Sgr025648 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr025648
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationtig00152936: 1446809 .. 1450065 (+)
RNA-Seq ExpressionSgr025648
SyntenySgr025648
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAGATCTCCATGTTCTTTTCAAGCCAAGGCTCGCCTTCTTCAATGCAATGTCTTCTTCATCGTCACCCCAGATTCCATCTCTGGTAACCCATTTCATCGATCTTATTCATGCTTCCGATACCCCCCGCAAGCTCCGGCAGATCCACGCTCAACTCCTCCGCTGCAATATCTTCTCCAGCAGCCGGGTCGTGACCCAGTTTATCTCTTCGTGTTTTTCGGTAAATTCTGTTGACTATGCCGTCTCGATCTTTCAGCTGTTCGAGTTGAAGAACAATTTCCTCTTTAACGCGTTGATACGTGGGCTTGCTGAAAACTCCAGGTTCGAGAGCTCAATTTCTTACTTTGTCCTAATGCTCAGGTTGAAAATCAGCCCTGATAGGCTTACTTTTCCATTTGTGCTCAAGTCAGCCGCGGCTCTTCCCGATGGAGGTGTTGGGAGGGCCTTGCATGGTGGGATTTTGAAGTTTGGCCTTGAGTTTGATTCTTTTGTGAGGGTTTCGTTGGTGGATATGTACGTGAAAGTTGAAGATTTGAGTTCTGCCTTGAAGCTGTTTGATGAAAGTCCTGATAGAATTAAGAATGAAAGTGTGTTGATTTGGAATGTTCTTATTCATGGGTATTGTAGAGTGGGAGATTTGATAAAAGCTTCGGAGCTGTTCGAGACAATGCCAAAGAAGGATGCAGGATCTTGGAATAGTTTGATTAATGGCTTCATGAGAAAAGGGGATTTGGGTCGAGCAAAGGAACTGTTTGAGAAAATACCTGAAAAGAATGTTGTTTCTTGGACTACCATGGTGAATGGATTTTCACAGAATGGAGACCCTGAAAAGGCACTGGAAACTTTCTTTTATATGCTTGAAGAAGGTGCACGACCAAACGATTACACAATTGTTTCTGCACTTTCAGCTTGTGCAAAAGTTGGTGCCTTAGATACTGGTTTAAGGATCCATAAATACCTTTCAGGCAATGGTTTCAAACTGAATGTAACGATTGGAACTGCACTTGTGGATATGTATGCAAAATGTGGAAATATTGAGTGTGCAGGAGCAGTGTTCTGCAAAACCAAAGAAAAGTGCCTTCTTACTTGGAGTGTTATGATCTGGGGCTGGGCGATCCATGGACATTATAAGAAAGCTATACAATACTTTGAATGGATGAAGTCTACAGGTTTGACTTCACATTGTGTTCATTATTGCCTGAAATTTATATTTTGTTGTTCACAGACTCAAAACTCAGCATGTTTGCAGGAACAAAGCCAGATGGTGTGGTCTTTCTTGCTGTTCTTACAGCATGCTCACATTCTGGGCAAGTAAATGACGGACTTGAGTTTTTCAATAGTATGAGGCACGTGTACTTGATTGAGCCTTCTATGAAGCACTACACACTGGTTGTAGACATGCTAGGCAGGGCTGGTAGACTAGACGAAGCTCTGAAGTTCATAAGAAACATGCCCATAAATCCTGATTTCGTGGTATGGGGTGCACTGTTTTGTGCTTGTAGCGCTCATAAGAATATTGAAATGGCAGAGCTAGCATCAGAAAAGCTTCTTCAGCTTGAACCTAAGCATCCGGGAAGTTACGTCTTTTTGTCAAATGCATATGCTGCAGTAGGAAGATGGGAGATGCAGAGAGAGTGAGGATTTCAATGCGAGATAGAGGTGCACAAAAAGATCCGGGATGGAGCTTTATTGAAGTGGATGATAAATTACATAGATTTATAGCTGGTGATATTACTCATGACCATGCTCGAGAGATATACTTGAGATTAGATGAAATAAGTGCAGGTGCCAGGGAAAAAGGATACACATCAGAAATTGAGTGTGTACTTCATAATATTGAAGAGGAAGAAAAGGAAGATGCATTGGGACATCACAGTGAGAAGTTGGCACTTGCTTTGGGCTCATTAGCACAGCCCCCGGGGCGACCATTAGGATAGTGAAAAACCTTAGAGTCTGTGTGGATTGCCATTCTTTCATGAAATATGCCAGTAAAATGAGTCAGAGGGAGATCGTTTTGCGGGATATAAAGCGATTCCATCATTTTAATGATGGTGATTGTTCATGTGGAGATTATTGGTAAAAAGGTAAATAGAACAAGTTGTGGTGTAGTATTGATAATATTTTACTTTTATGTGTTACTTATATCCATAATCTTTCCATATGGCTGCAGGTGCACAGATTAATTTGTCTTTGTTCTCCGGATTAGTGGGGTTGGGCCATAAGGATAATGTAAGAAGTGGGGCTCCCAACACTCGGTTATTAAAAAGTTAATAATAATTGAAATGCAGACTCTACTTTCAGTTCCAACTTCAGATCAGTCAACAAATTGCTTGCTGATATGTTATCCCCCACTCCTATACTCCTGCTGCTTCCAAGAATACATCATATCAAATGCAGACTCTACTCTTCCAATACAGGTTTGATGCCCTGACATGATTTCCCTCAATGATCTTAGGTTCTTAACTGCTGATATGATTTGTGCATCTCATCTGGTCTCTTTGCAGGTGAATCCCATTTTCATGCTGTGTTTAGAGTATAAGTTTCCCCATTGAACTTTTACAACAAAGTCGGCTTGGAACGCTAAGAACATGAGAAAATTTCTTGCACTCGGGTTGTTCCGCTCTTTGTCGGATAAATTTACAGATGTTATAGCTTATGTATATGTTCATAAACATTCTGCCAGAGGTCTGCTAACTTCAGTGTAGAGATGAGGAGAAGATCTTCAGGAACATATCAACGATGGGGTTGGGGTCAAGGAGGGGGAGGAGGCCTTCTCAAGGTGCTGTTGAGTGATCATGCAGCATAATTCTGTGAATTTCGGAGTCATCTTTGCTAATTTTTCATCGGAGGACACTTTCCGGGATTCCAAATCCCAAGCATCAACATGTGCAGAACAGTAAGAGGAAGCTCCAGTTTTAGGGACACGGTCAGAAGGGCAACTTCATTTGTTGTTACCAAGCTGCATAATAATCAAGGAATGAAGTCTTGGGGATCCATTTCATTAATTGAAATAATTTGATTGTGGCTCCTCACAGGAACAAGAGGGATTCACCGAAGTATACTTCTCCTTCCCTATTGAGGATACGACAAAATCGATGAAACGAACGAGATCCATTGCCTTCTCAGGTCTCGCAGATGTTGCTTGTAGGTCGTCCTATGCCTCAAGATGGCTCTCTTGTCCCCTACCGGTTTGAAATGCGCTCTCGGGTTTTCGCTCTATTGCATCTGGCAAGTTTTTCTTTCCTCTGTCATTGA

mRNA sequence

ATGAAAGATCTCCATGTTCTTTTCAAGCCAAGGCTCGCCTTCTTCAATGCAATGTCTTCTTCATCGTCACCCCAGATTCCATCTCTGGTAACCCATTTCATCGATCTTATTCATGCTTCCGATACCCCCCGCAAGCTCCGGCAGATCCACGCTCAACTCCTCCGCTGCAATATCTTCTCCAGCAGCCGGGTCGTGACCCAGTTTATCTCTTCGTGTTTTTCGGTAAATTCTGTTGACTATGCCGTCTCGATCTTTCAGCTGTTCGAGTTGAAGAACAATTTCCTCTTTAACGCGTTGATACGTGGGCTTGCTGAAAACTCCAGGTTGAAAATCAGCCCTGATAGGCTTACTTTTCCATTTGTGCTCAAGTCAGCCGCGGCTCTTCCCGATGGAGGTGTTGGGAGGGCCTTGCATGGTGGGATTTTGAAGTTTGGCCTTGAGTTTGATTCTTTTGTGAGGGTTTCGTTGGTGGATATGTACGTGAAAGTTGAAGATTTGAGTTCTGCCTTGAAGCTGTTTGATGAAAGTCCTGATAGAATTAAGAATGAAAGTGTGTTGATTTGGAATGTTCTTATTCATGGGTATTGTAGAGTGGGAGATTTGATAAAAGCTTCGGAGCTGTTCGAGACAATGCCAAAGAAGGATGCAGGATCTTGGAATAGTTTGATTAATGGCTTCATGAGAAAAGGGGATTTGGGTCGAGCAAAGGAACTGTTTGAGAAAATACCTGAAAAGAATGTTGTTTCTTGGACTACCATGGTGAATGGATTTTCACAGAATGGAGACCCTGAAAAGGCACTGGAAACTTTCTTTTATATGCTTGAAGAAGGTGCACGACCAAACGATTACACAATTGTTTCTGCACTTTCAGCTTGTGCAAAAGTTGGTGCCTTAGATACTGGTTTAAGGATCCATAAATACCTTTCAGGCAATGGTTTCAAACTGAATGTAACGATTGGAACTGCACTTGTGGATATGTATGCAAAATGTGGAAATATTGAGTGTGCAGGAGCAGTGTTCTGCAAAACCAAAGAAAAGTGCCTTCTTACTTGGAGTGTTATGATCTGGGGCTGGGCGATCCATGGACATTATAAGAAAGCTATACAATACTTTGAATGGATGAAGTCTACAGACTCAAAACTCAGCATGTTTGCAGGAACAAAGCCAGATGGTGTGGTCTTTCTTGCTGTTCTTACAGCATGCTCACATTCTGGGCAAGTAAATGACGGACTTGAGTTTTTCAATAGTATGAGGCACGTGTACTTGATTGAGCCTTCTATGAAGCACTACACACTGGTTGTAGACATGCTAGGCAGGGCTGGTAGACTAGACGAAGCTCTGAAGTTCATAAGAAACATGCCCATAAATCCTGATTTCGTGGTATGGGGTGCACTGTTTTGTGCTTGTAGCGCTCATAAGAATATTGAAATGGCAGAGCTAGCATCAGAAAAGCTTCTTCAGCTTGAACCTAAGCATCCGGGAAGTTACGTCTTTTTGTCAAATGCATATGCTGCAGTAGGAAGATGGGAGATGCAGAGAGAATTTATAGCTGGTGATATTACTCATGACCATGCTCGAGAGATATACTTGAGATTAGATGAAATAAGTGCAGGTGCCAGGGAAAAAGGATACACATCAGAAATTGAGTGTGTACTTCATAATATTGAAGAGGAAGAAAAGGAAGATGCATTGGGACATCACAGTGAGAAGTTGGCACTTGCTTTGGGCTCATTAGCACAGCCCCCGGGGCGACCATTAGGATATTCCAACTTCAGATCAGTCAACAAATTGCTTGCTGATATGTTATCCCCCACTCCTATACTCCTGCTGCTTCCAAGAATACATCATATCAAATGCAGACTCTACTCTTCCAATACAGGTGAATCCCATTTTCATGCTGTGTTTAGAAGGTCTGCTAACTTCAGTGTAGAGATGAGGAGAAGATCTTCAGGAACATATCAACGATGGGGTTGGGGTCAAGGAGGGGGAGGAGGCCTTCTCAAGGTCTCGCAGATGTTGCTTGTAGGTCGTCCTATGCCTCAAGATGGCTCTCTTGTCCCCTACCGGTTTGAAATGCGCTCTCGGGTTTTCGCTCTATTGCATCTGGCAAGTTTTTCTTTCCTCTGTCATTGA

Coding sequence (CDS)

ATGAAAGATCTCCATGTTCTTTTCAAGCCAAGGCTCGCCTTCTTCAATGCAATGTCTTCTTCATCGTCACCCCAGATTCCATCTCTGGTAACCCATTTCATCGATCTTATTCATGCTTCCGATACCCCCCGCAAGCTCCGGCAGATCCACGCTCAACTCCTCCGCTGCAATATCTTCTCCAGCAGCCGGGTCGTGACCCAGTTTATCTCTTCGTGTTTTTCGGTAAATTCTGTTGACTATGCCGTCTCGATCTTTCAGCTGTTCGAGTTGAAGAACAATTTCCTCTTTAACGCGTTGATACGTGGGCTTGCTGAAAACTCCAGGTTGAAAATCAGCCCTGATAGGCTTACTTTTCCATTTGTGCTCAAGTCAGCCGCGGCTCTTCCCGATGGAGGTGTTGGGAGGGCCTTGCATGGTGGGATTTTGAAGTTTGGCCTTGAGTTTGATTCTTTTGTGAGGGTTTCGTTGGTGGATATGTACGTGAAAGTTGAAGATTTGAGTTCTGCCTTGAAGCTGTTTGATGAAAGTCCTGATAGAATTAAGAATGAAAGTGTGTTGATTTGGAATGTTCTTATTCATGGGTATTGTAGAGTGGGAGATTTGATAAAAGCTTCGGAGCTGTTCGAGACAATGCCAAAGAAGGATGCAGGATCTTGGAATAGTTTGATTAATGGCTTCATGAGAAAAGGGGATTTGGGTCGAGCAAAGGAACTGTTTGAGAAAATACCTGAAAAGAATGTTGTTTCTTGGACTACCATGGTGAATGGATTTTCACAGAATGGAGACCCTGAAAAGGCACTGGAAACTTTCTTTTATATGCTTGAAGAAGGTGCACGACCAAACGATTACACAATTGTTTCTGCACTTTCAGCTTGTGCAAAAGTTGGTGCCTTAGATACTGGTTTAAGGATCCATAAATACCTTTCAGGCAATGGTTTCAAACTGAATGTAACGATTGGAACTGCACTTGTGGATATGTATGCAAAATGTGGAAATATTGAGTGTGCAGGAGCAGTGTTCTGCAAAACCAAAGAAAAGTGCCTTCTTACTTGGAGTGTTATGATCTGGGGCTGGGCGATCCATGGACATTATAAGAAAGCTATACAATACTTTGAATGGATGAAGTCTACAGACTCAAAACTCAGCATGTTTGCAGGAACAAAGCCAGATGGTGTGGTCTTTCTTGCTGTTCTTACAGCATGCTCACATTCTGGGCAAGTAAATGACGGACTTGAGTTTTTCAATAGTATGAGGCACGTGTACTTGATTGAGCCTTCTATGAAGCACTACACACTGGTTGTAGACATGCTAGGCAGGGCTGGTAGACTAGACGAAGCTCTGAAGTTCATAAGAAACATGCCCATAAATCCTGATTTCGTGGTATGGGGTGCACTGTTTTGTGCTTGTAGCGCTCATAAGAATATTGAAATGGCAGAGCTAGCATCAGAAAAGCTTCTTCAGCTTGAACCTAAGCATCCGGGAAGTTACGTCTTTTTGTCAAATGCATATGCTGCAGTAGGAAGATGGGAGATGCAGAGAGAATTTATAGCTGGTGATATTACTCATGACCATGCTCGAGAGATATACTTGAGATTAGATGAAATAAGTGCAGGTGCCAGGGAAAAAGGATACACATCAGAAATTGAGTGTGTACTTCATAATATTGAAGAGGAAGAAAAGGAAGATGCATTGGGACATCACAGTGAGAAGTTGGCACTTGCTTTGGGCTCATTAGCACAGCCCCCGGGGCGACCATTAGGATATTCCAACTTCAGATCAGTCAACAAATTGCTTGCTGATATGTTATCCCCCACTCCTATACTCCTGCTGCTTCCAAGAATACATCATATCAAATGCAGACTCTACTCTTCCAATACAGGTGAATCCCATTTTCATGCTGTGTTTAGAAGGTCTGCTAACTTCAGTGTAGAGATGAGGAGAAGATCTTCAGGAACATATCAACGATGGGGTTGGGGTCAAGGAGGGGGAGGAGGCCTTCTCAAGGTCTCGCAGATGTTGCTTGTAGGTCGTCCTATGCCTCAAGATGGCTCTCTTGTCCCCTACCGGTTTGAAATGCGCTCTCGGGTTTTCGCTCTATTGCATCTGGCAAGTTTTTCTTTCCTCTGTCATTGA

Protein sequence

MKDLHVLFKPRLAFFNAMSSSSSPQIPSLVTHFIDLIHASDTPRKLRQIHAQLLRCNIFSSSRVVTQFISSCFSVNSVDYAVSIFQLFELKNNFLFNALIRGLAENSRLKISPDRLTFPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLSSALKLFDESPDRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDAGSWNSLINGFMRKGDLGRAKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYMLEEGARPNDYTIVSALSACAKVGALDTGLRIHKYLSGNGFKLNVTIGTALVDMYAKCGNIECAGAVFCKTKEKCLLTWSVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSGQVNDGLEFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWGALFCACSAHKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVGRWEMQREFIAGDITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIEEEEKEDALGHHSEKLALALGSLAQPPGRPLGYSNFRSVNKLLADMLSPTPILLLLPRIHHIKCRLYSSNTGESHFHAVFRRSANFSVEMRRRSSGTYQRWGWGQGGGGGLLKVSQMLLVGRPMPQDGSLVPYRFEMRSRVFALLHLASFSFLCH
Homology
BLAST of Sgr025648 vs. NCBI nr
Match: XP_022138400.1 (pentatricopeptide repeat-containing protein At1g04840 [Momordica charantia] >XP_022138401.1 pentatricopeptide repeat-containing protein At1g04840 [Momordica charantia])

HSP 1 Score: 1001.9 bits (2589), Expect = 2.8e-288
Identity = 515/665 (77.44%), Postives = 546/665 (82.11%), Query Frame = 0

Query: 1   MKDLHVLFKPRLAFFNAMSSSSSPQIPSLVTHFIDLIHASDTPRKLRQIHAQLLRCNIFS 60
           MKDLHVL KP LAFFN+M SSSS Q+ SL THFIDLIHAS+T  KLRQIHAQL RCNIFS
Sbjct: 1   MKDLHVLLKPSLAFFNSMPSSSSSQVQSLETHFIDLIHASNTAHKLRQIHAQLFRCNIFS 60

Query: 61  SSRVVTQFISSCFSVNSVDYAVSIFQLFELKNNFLFNALIRGLAENSRL----------- 120
           SSRVVTQFISSC S+N VDYA+SIFQ FE+KN+FLFNALIRGLAENSR            
Sbjct: 61  SSRVVTQFISSCSSLNLVDYAISIFQRFEMKNSFLFNALIRGLAENSRFEGSIYYFVLML 120

Query: 121 --KISPDRLTFPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLS 180
             KISPDRLTFPFVLKSAAAL  GGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLS
Sbjct: 121 KWKISPDRLTFPFVLKSAAALSSGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLS 180

Query: 181 SALKLFDESPDRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDAGSWNSLINGFM 240
           SALK+FDESPDRIKN SVLIWNVLIHGYCRVGDL+KA+ELFETMPKKD GSWNSLINGFM
Sbjct: 181 SALKVFDESPDRIKNGSVLIWNVLIHGYCRVGDLVKATELFETMPKKDTGSWNSLINGFM 240

Query: 241 RKGDLGRAKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYMLEEGARPNDYTIVS 300
           RKGDLGRAKELFE++PEKNVVSWTTMVNGFSQNGDPEKALETF  MLEEGA+PNDYTIVS
Sbjct: 241 RKGDLGRAKELFERMPEKNVVSWTTMVNGFSQNGDPEKALETFSCMLEEGAQPNDYTIVS 300

Query: 301 ALSACAKVGALDTGLRIHKYLSGNGFKLNVTIGTALVDMYAKCGNIECAGAVFCKTKEKC 360
           ALSACAKVGALD GLRIH YLS NGF+LN+TIGTALVDMYAKCGNIE AG VFC+TKEK 
Sbjct: 301 ALSACAKVGALDAGLRIHNYLSSNGFRLNLTIGTALVDMYAKCGNIESAGEVFCETKEKG 360

Query: 361 LLTWSVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSGQV 420
           LLTWSVMIWGWAIHGH+KKAI YFEWMKST        G KPDGVVFLAVLT+CSHSG+V
Sbjct: 361 LLTWSVMIWGWAIHGHFKKAILYFEWMKST--------GMKPDGVVFLAVLTSCSHSGKV 420

Query: 421 NDGLEFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWGALFC 480
           NDGL+FF+SMR  YLIEPSMKHYTLVVDMLGRAGRLDEAL FIRNMPINPDFVVWGALFC
Sbjct: 421 NDGLKFFDSMRRDYLIEPSMKHYTLVVDMLGRAGRLDEALNFIRNMPINPDFVVWGALFC 480

Query: 481 ACSAHKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVGRW------------------ 540
           AC AHKNIEMAELASEKLL+LEPKHPGSYVFLSNAYAAVGRW                  
Sbjct: 481 ACRAHKNIEMAELASEKLLELEPKHPGSYVFLSNAYAAVGRWDDAERVRISMRDRGAQKD 540

Query: 541 ---------EMQREFIAGDITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIEEEEK 600
                    +    F+AGD TH+ AREIY RLDEISAGAREKGYTS IECVLHNIEEEEK
Sbjct: 541 PGWSFIEVDDKLHRFVAGDKTHNRAREIYSRLDEISAGAREKGYTSGIECVLHNIEEEEK 600

Query: 601 EDALGHHSEKLALALGSLAQPPGRPLG-YSNFR------SVNKLLADMLSPTPILLLLPR 619
           EDALGHHSEKLALA G ++  PG  +    N R      S  K  + M     +L  + R
Sbjct: 601 EDALGHHSEKLALAFGLISTTPGTTIRIVKNLRVCVDCHSFMKYASKMNQREIVLRDIKR 657

BLAST of Sgr025648 vs. NCBI nr
Match: XP_023513771.1 (pentatricopeptide repeat-containing protein At1g04840 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 969.5 bits (2505), Expect = 1.5e-278
Identity = 498/663 (75.11%), Postives = 540/663 (81.45%), Query Frame = 0

Query: 1   MKDLHVLFKPRLAFFNAMSSSSSPQIPSLVTHFIDLIHASDTPRKLRQIHAQLLRCNIFS 60
           MK+LHVLFKPR+AFFN+ SSSSSPQI S  THFIDLIHASD+  KLRQIH QL RCNIFS
Sbjct: 14  MKNLHVLFKPRIAFFNSTSSSSSPQISSQETHFIDLIHASDSTHKLRQIHGQLYRCNIFS 73

Query: 61  SSRVVTQFISSCFSVNSVDYAVSIFQLFELKNNFLFNALIRGLAENSRL----------- 120
           SSRVVTQFISSC S+NSVDYAV IFQ FELKN+FLFNALIRGLAENSR            
Sbjct: 74  SSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSRFESSIAYFVCML 133

Query: 121 --KISPDRLTFPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLS 180
             +ISPDRLTFPFVLKSAAAL +GGVG ALH GI+KFGLEFDSFVRVSLVDMYVKV+DL 
Sbjct: 134 RWEISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVSLVDMYVKVDDLG 193

Query: 181 SALKLFDESPDRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDAGSWNSLINGFM 240
           SALK+FDESPDRIK E+VLIWNVLIHGYCRVG+L+KA+ELFETMPKKD GSWNSLINGFM
Sbjct: 194 SALKVFDESPDRIKKENVLIWNVLIHGYCRVGNLVKATELFETMPKKDTGSWNSLINGFM 253

Query: 241 RKGDLGRAKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYMLEEGARPNDYTIVS 300
           RKG LG A ELFEK+PEKNVVSWTTMVNGFSQNGDPEKAL+ FF MLEEGARPNDYTIVS
Sbjct: 254 RKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLEEGARPNDYTIVS 313

Query: 301 ALSACAKVGALDTGLRIHKYLSGNGFKLNVTIGTALVDMYAKCGNIECAGAVFCKTKEKC 360
           ALSACAK+GALD GLRIH+YLS +GFKLN TIGTALVDMYAKCGNIE AG VF + K+K 
Sbjct: 314 ALSACAKLGALDAGLRIHRYLSSHGFKLNQTIGTALVDMYAKCGNIESAGEVFREIKQKG 373

Query: 361 LLTWSVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSGQV 420
           LLTWSVMIWGWAIHGH+KK+IQYFEWMKST        GTKPDGVVFLAVLTACSHSGQV
Sbjct: 374 LLTWSVMIWGWAIHGHFKKSIQYFEWMKST--------GTKPDGVVFLAVLTACSHSGQV 433

Query: 421 NDGLEFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWGALFC 480
           +DGLEFF+SMR  YLIEPSMKHYTL+VDMLGRAGRLDEALKF+R+MPINPDFVVWGALFC
Sbjct: 434 DDGLEFFDSMRRDYLIEPSMKHYTLIVDMLGRAGRLDEALKFLRDMPINPDFVVWGALFC 493

Query: 481 ACSAHKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVGRWE----------------- 540
           AC AHKNI+MAELASEKLL+LEPKHPGSYVFLSNAYAAVGRWE                 
Sbjct: 494 ACRAHKNIKMAELASEKLLELEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKD 553

Query: 541 ----------MQREFIAGDITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIEEEEK 600
                         F+AGD TH+ A+EIY +LDEI+AGAREKGYT  IECVLHNIEEEEK
Sbjct: 554 PGWSFMEVDDKLHRFVAGDNTHNRAQEIYSKLDEINAGAREKGYTKGIECVLHNIEEEEK 613

Query: 601 EDALGHHSEKLALALGSLAQPPGRPLG-YSNFR------SVNKLLADMLSPTPILLLLPR 617
           E+ALGHHSEKLALA G ++  P   +    N R      S  K  + M     IL  + R
Sbjct: 614 EEALGHHSEKLALAFGLVSTAPETTIRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKR 668

BLAST of Sgr025648 vs. NCBI nr
Match: KAG7026055.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 966.1 bits (2496), Expect = 1.7e-277
Identity = 493/639 (77.15%), Postives = 532/639 (83.26%), Query Frame = 0

Query: 1   MKDLHVLFKPRLAFFNAMSSSSSPQIPSLVTHFIDLIHASDTPRKLRQIHAQLLRCNIFS 60
           MK+L VLFKPR+AFFN+ SSSSSPQI SL THFIDLIHASD+  KLRQIH QL RCNIFS
Sbjct: 16  MKNLLVLFKPRIAFFNSTSSSSSPQISSLETHFIDLIHASDSTHKLRQIHGQLYRCNIFS 75

Query: 61  SSRVVTQFISSCFSVNSVDYAVSIFQLFELKNNFLFNALIRGLAENSRL----------- 120
           SSRVVTQFISSC S+NSVDYAV IFQ FELKN+FLFNALIRGLAENSR            
Sbjct: 76  SSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSRFESSISYFVCML 135

Query: 121 --KISPDRLTFPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLS 180
             KISPDRLTFPFVLKSAAAL +GGVG ALH GILKFGLEFDSFVRVSLVDMYVKV+DL 
Sbjct: 136 RWKISPDRLTFPFVLKSAAALSNGGVGSALHSGILKFGLEFDSFVRVSLVDMYVKVDDLG 195

Query: 181 SALKLFDESPDRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDAGSWNSLINGFM 240
           SALK+FDESPDRIK  +VLIWNVLIHGYCRVG+L+KA+ELFETMP+KD GSWNSLINGFM
Sbjct: 196 SALKVFDESPDRIKKGNVLIWNVLIHGYCRVGNLVKATELFETMPEKDTGSWNSLINGFM 255

Query: 241 RKGDLGRAKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYMLEEGARPNDYTIVS 300
           RKG LG A ELFEK+PEKNVVSWTTMVNGFSQNGDPEKAL+ FF MLEEGARPNDYTIVS
Sbjct: 256 RKGQLGPAHELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLEEGARPNDYTIVS 315

Query: 301 ALSACAKVGALDTGLRIHKYLSGNGFKLNVTIGTALVDMYAKCGNIECAGAVFCKTKEKC 360
           ALSACAK+GALD GLRIHKYLS +GFKLN TIGTA+VDMYAKCGNIE AG VF + K+K 
Sbjct: 316 ALSACAKLGALDAGLRIHKYLSSHGFKLNQTIGTAVVDMYAKCGNIESAGEVFREIKQKG 375

Query: 361 LLTWSVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSGQV 420
           LLTWSVMIWGWAIHGH+KK+IQYFEWMKS        AGTKPDGVVFLAVLTACSHSGQV
Sbjct: 376 LLTWSVMIWGWAIHGHFKKSIQYFEWMKS--------AGTKPDGVVFLAVLTACSHSGQV 435

Query: 421 NDGLEFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWGALFC 480
           +DGLEFF+SMR  YLIEPSMKHYTL+VDMLGRAGRLDEALKFIR+MPINPDFVVWGALFC
Sbjct: 436 DDGLEFFDSMRRDYLIEPSMKHYTLIVDMLGRAGRLDEALKFIRDMPINPDFVVWGALFC 495

Query: 481 ACSAHKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVGRWE----------------- 540
           AC AHKNI+MAELASEKLL+LEPKHPGSYVFLSNAYAAVGRWE                 
Sbjct: 496 ACRAHKNIKMAELASEKLLELEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKD 555

Query: 541 ----------MQREFIAGDITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIEEEEK 599
                         F+AGD TH+ A+EIY +LDEI+AGAREKGYT  IECVLHNIEEEEK
Sbjct: 556 PGWSFMEVDDKLHRFVAGDNTHNRAQEIYSKLDEINAGAREKGYTKGIECVLHNIEEEEK 615

BLAST of Sgr025648 vs. NCBI nr
Match: XP_022964045.1 (pentatricopeptide repeat-containing protein At1g04840 [Cucurbita moschata])

HSP 1 Score: 965.7 bits (2495), Expect = 2.2e-277
Identity = 499/663 (75.26%), Postives = 539/663 (81.30%), Query Frame = 0

Query: 1   MKDLHVLFKPRLAFFNAMSSSSSPQIPSLVTHFIDLIHASDTPRKLRQIHAQLLRCNIFS 60
           MK+L VLFKPR+AFFN+ SSSSSPQI SL THFIDLIHASD+  KLRQIH QL RCNIFS
Sbjct: 16  MKNLLVLFKPRIAFFNSTSSSSSPQISSLETHFIDLIHASDSTHKLRQIHGQLYRCNIFS 75

Query: 61  SSRVVTQFISSCFSVNSVDYAVSIFQLFELKNNFLFNALIRGLAENSRL----------- 120
           SSRVVTQFISSC S+NSVDYAV IFQ FELKN+FLFNALIRGLAENSR            
Sbjct: 76  SSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSRFESSISYFVCML 135

Query: 121 --KISPDRLTFPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLS 180
             KISPDRLTFPFVLKSAAAL +GGVG ALH GILKFGLEFDSFVRVSLVDMYVKV+DL 
Sbjct: 136 RWKISPDRLTFPFVLKSAAALSNGGVGSALHSGILKFGLEFDSFVRVSLVDMYVKVDDLG 195

Query: 181 SALKLFDESPDRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDAGSWNSLINGFM 240
           SALK+FDESPDRIK  +VLIWNVLIHGYCRVG+L+KA+ELFETMP+KD GSWNSLINGFM
Sbjct: 196 SALKVFDESPDRIKKGNVLIWNVLIHGYCRVGNLVKATELFETMPEKDTGSWNSLINGFM 255

Query: 241 RKGDLGRAKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYMLEEGARPNDYTIVS 300
           RKG LG A ELFEK+PEKNVVSWTTMVNGFSQNGDPEKAL+ FF MLEEGARPNDYTIVS
Sbjct: 256 RKGQLGPAHELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLEEGARPNDYTIVS 315

Query: 301 ALSACAKVGALDTGLRIHKYLSGNGFKLNVTIGTALVDMYAKCGNIECAGAVFCKTKEKC 360
           ALSACAK+GALD GLRIHKYLS +GFKLN TIGTA+VDMYAKCGNIE AG VF + K+K 
Sbjct: 316 ALSACAKLGALDAGLRIHKYLSSHGFKLNQTIGTAVVDMYAKCGNIESAGEVFREIKQKG 375

Query: 361 LLTWSVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSGQV 420
           LLTWSVMIWGWAIHGH+KK+IQYFEWMKS        AGTKPDGVVFLAVLTACSHSGQV
Sbjct: 376 LLTWSVMIWGWAIHGHFKKSIQYFEWMKS--------AGTKPDGVVFLAVLTACSHSGQV 435

Query: 421 NDGLEFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWGALFC 480
           +DGLEFF+SMR  YLIEPSMKHYTL+VDMLGRAGRLDEALKFIR+MPINPDFVVWGALFC
Sbjct: 436 DDGLEFFDSMRRDYLIEPSMKHYTLIVDMLGRAGRLDEALKFIRDMPINPDFVVWGALFC 495

Query: 481 ACSAHKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVGRWE----------------- 540
           AC AHKNI+MAELASEKLL+LEPKHPGSYVFLSNAYAAVGRWE                 
Sbjct: 496 ACRAHKNIKMAELASEKLLELEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKD 555

Query: 541 ----------MQREFIAGDITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIEEEEK 600
                         F+AGD TH+ A+EIY +LDEI+AGAREKGYT  IECVLHNIEEEEK
Sbjct: 556 PGWSFMEVDDKLHRFVAGDNTHNRAQEIYSKLDEINAGAREKGYTKGIECVLHNIEEEEK 615

Query: 601 EDALGHHSEKLALALGSLAQPPGRPLG-YSNFR------SVNKLLADMLSPTPILLLLPR 617
           E+ALGHHSEKLALA G ++  P   +    N R      S  K  + M     IL  + R
Sbjct: 616 EEALGHHSEKLALAFGLVSTAPETTIRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKR 670

BLAST of Sgr025648 vs. NCBI nr
Match: XP_023000600.1 (pentatricopeptide repeat-containing protein At1g04840 [Cucurbita maxima])

HSP 1 Score: 965.3 bits (2494), Expect = 2.9e-277
Identity = 497/663 (74.96%), Postives = 539/663 (81.30%), Query Frame = 0

Query: 1   MKDLHVLFKPRLAFFNAMSSSSSPQIPSLVTHFIDLIHASDTPRKLRQIHAQLLRCNIFS 60
           MK+LHVLFKPR+AFFN+ SSSSSPQI SL T+FIDLIHASD+  KLRQIH QL RCNIFS
Sbjct: 13  MKNLHVLFKPRIAFFNSTSSSSSPQISSLETYFIDLIHASDSTHKLRQIHGQLYRCNIFS 72

Query: 61  SSRVVTQFISSCFSVNSVDYAVSIFQLFELKNNFLFNALIRGLAENSRL----------- 120
           SSRVVTQFISSC S+NSVDYAV IFQ FELKN+FLFNALIRGLAENSR            
Sbjct: 73  SSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSRFESSISYFVCML 132

Query: 121 --KISPDRLTFPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLS 180
             KISPDRLTFPFVLKSAAAL +GGVG ALH GI+KFGLEFDSFVRVSLVDMYVKV+DL 
Sbjct: 133 RWKISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVSLVDMYVKVDDLG 192

Query: 181 SALKLFDESPDRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDAGSWNSLINGFM 240
           SALK+FDESPDRIK  +VLIWNVLIHGYCRVG+L+KA+ELFETMPKKD GSWNSLINGFM
Sbjct: 193 SALKVFDESPDRIKQGNVLIWNVLIHGYCRVGNLVKATELFETMPKKDTGSWNSLINGFM 252

Query: 241 RKGDLGRAKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYMLEEGARPNDYTIVS 300
           RKG LG A ELFEK+PEKNVVSWTTMVNGFSQNGDPEKAL+ FF MLEEGA+PNDYTIVS
Sbjct: 253 RKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLEEGAQPNDYTIVS 312

Query: 301 ALSACAKVGALDTGLRIHKYLSGNGFKLNVTIGTALVDMYAKCGNIECAGAVFCKTKEKC 360
           ALSACAK+GALD GLRIHKYLS +GFKLN TIGTA+VDMYAKCGNIE AG VF + K+K 
Sbjct: 313 ALSACAKLGALDAGLRIHKYLSSHGFKLNQTIGTAVVDMYAKCGNIESAGEVFGEIKQKG 372

Query: 361 LLTWSVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSGQV 420
           LLTWSVMIWGWAIHGH+KK+IQYFEWMKST        GTKPDGVVFLAVLTACSHSGQV
Sbjct: 373 LLTWSVMIWGWAIHGHFKKSIQYFEWMKST--------GTKPDGVVFLAVLTACSHSGQV 432

Query: 421 NDGLEFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWGALFC 480
           +DGLEFF+SMR  YLIEPSMKHYTL+VDMLGRAGRLDEALKFIR+MPINPDFVVWGALFC
Sbjct: 433 DDGLEFFDSMRRDYLIEPSMKHYTLIVDMLGRAGRLDEALKFIRDMPINPDFVVWGALFC 492

Query: 481 ACSAHKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVGRWE----------------- 540
           AC AHKNI+MAELASEKLL+LEPKHPGSYVFLSNAYAAVGRWE                 
Sbjct: 493 ACRAHKNIKMAELASEKLLELEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKD 552

Query: 541 ----------MQREFIAGDITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIEEEEK 600
                         F+AGD TH+ A+EIY +LDEI+A AREKGYT  IECVLHNIEEEEK
Sbjct: 553 PGWSFMEVDDKLHRFVAGDNTHNRAQEIYSKLDEINASAREKGYTKGIECVLHNIEEEEK 612

Query: 601 EDALGHHSEKLALALGSLAQPPGRPLG-YSNFR------SVNKLLADMLSPTPILLLLPR 617
           E+ALGHHSEKLALA G ++  P   +    N R      S  K  + M     IL  + R
Sbjct: 613 EEALGHHSEKLALAFGLISTAPETMIRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKR 667

BLAST of Sgr025648 vs. ExPASy Swiss-Prot
Match: Q9MAT2 (Pentatricopeptide repeat-containing protein At1g04840 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H64 PE=2 SV=1)

HSP 1 Score: 593.6 bits (1529), Expect = 3.0e-168
Identity = 315/627 (50.24%), Postives = 412/627 (65.71%), Query Frame = 0

Query: 1   MKDLHVLFKPRLA----FFNAMSSSSSPQIPSLVTHFIDLIHASDTPRKLRQIHAQLLRC 60
           MK L V+FKP+ +    +F A   +S  +     +HFI LIHA      LR +HAQ+LR 
Sbjct: 1   MKSLSVIFKPKSSPAKIYFPADRQASPDE-----SHFISLIHACKDTASLRHVHAQILRR 60

Query: 61  NIFSSSRVVTQFISSCFSVNSVDYAVSIFQLFELKNNFLFNALIRGLAENS--------- 120
            +  SSRV  Q +S    + S DY++SIF+  E +N F+ NALIRGL EN+         
Sbjct: 61  GVL-SSRVAAQLVSCSSLLKSPDYSLSIFRNSEERNPFVLNALIRGLTENARFESSVRHF 120

Query: 121 ----RLKISPDRLTFPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKV 180
               RL + PDRLTFPFVLKS + L    +GRALH   LK  ++ DSFVR+SLVDMY K 
Sbjct: 121 ILMLRLGVKPDRLTFPFVLKSNSKLGFRWLGRALHAATLKNFVDCDSFVRLSLVDMYAKT 180

Query: 181 EDLSSALKLFDESPDRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDAGSWNSLI 240
             L  A ++F+ESPDRIK ES+LIWNVLI+GYCR  D+  A+ LF +MP++++GSW++LI
Sbjct: 181 GQLKHAFQVFEESPDRIKKESILIWNVLINGYCRAKDMHMATTLFRSMPERNSGSWSTLI 240

Query: 241 NGFMRKGDLGRAKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYMLEEGARPNDY 300
            G++  G+L RAK+LFE +PEKNVVSWTT++NGFSQ GD E A+ T+F MLE+G +PN+Y
Sbjct: 241 KGYVDSGELNRAKQLFELMPEKNVVSWTTLINGFSQTGDYETAISTYFEMLEKGLKPNEY 300

Query: 301 TIVSALSACAKVGALDTGLRIHKYLSGNGFKLNVTIGTALVDMYAKCGNIECAGAVFCKT 360
           TI + LSAC+K GAL +G+RIH Y+  NG KL+  IGTALVDMYAKCG ++CA  VF   
Sbjct: 301 TIAAVLSACSKSGALGSGIRIHGYILDNGIKLDRAIGTALVDMYAKCGELDCAATVFSNM 360

Query: 361 KEKCLLTWSVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSH 420
             K +L+W+ MI GWA+HG + +AIQ F  M        M++G KPD VVFLAVLTAC +
Sbjct: 361 NHKDILSWTAMIQGWAVHGRFHQAIQCFRQM--------MYSGEKPDEVVFLAVLTACLN 420

Query: 421 SGQVNDGLEFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWG 480
           S +V+ GL FF+SMR  Y IEP++KHY LVVD+LGRAG+L+EA + + NMPINPD   W 
Sbjct: 421 SSEVDLGLNFFDSMRLDYAIEPTLKHYVLVVDLLGRAGKLNEAHELVENMPINPDLTTWA 480

Query: 481 ALFCACSAHKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVG--------RWEMQR-- 540
           AL+ AC AHK    AE  S+ LL+L+P+  GSY+FL   +A+ G        R  +Q+  
Sbjct: 481 ALYRACKAHKGYRRAESVSQNLLELDPELCGSYIFLDKTHASKGNIQDVEKRRLSLQKRI 540

Query: 541 -----------------EFIAGDITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIE 584
                            +F AGD +H   +EI L+LDEI + A +KGY    +  +H+IE
Sbjct: 541 KERSLGWSYIELDGQLNKFSAGDYSHKLTQEIGLKLDEIISLAIQKGYNPGADWSIHDIE 600

BLAST of Sgr025648 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 376.3 bits (965), Expect = 7.5e-103
Identity = 237/734 (32.29%), Postives = 354/734 (48.23%), Query Frame = 0

Query: 15  FNAMSSSSSPQIPSLVTH-FIDLIHASDTPRKLRQIHAQLLRCNIFSSSRVVTQFISSCF 74
           F+ + SSS P   S+  H  + L+H   T + LR IHAQ+++  + +++  +++ I  C 
Sbjct: 17  FHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCI 76

Query: 75  ---SVNSVDYAVSIFQLFELKNNFLFNALIRGLAENS-------------RLKISPDRLT 134
                  + YA+S+F+  +  N  ++N + RG A +S              L + P+  T
Sbjct: 77  LSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYT 136

Query: 135 FPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLSSALKLFDESP 194
           FPFVLKS A       G+ +HG +LK G + D +V  SL+ MYV+   L  A K+FD+SP
Sbjct: 137 FPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSP 196

Query: 195 DRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDAGSWNSLINGFMRKG------- 254
            R     V+ +  LI GY   G +  A +LF+ +P KD  SWN++I+G+   G       
Sbjct: 197 HR----DVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALE 256

Query: 255 ------------------------------DLGR-------------------------- 314
                                         +LGR                          
Sbjct: 257 LFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYS 316

Query: 315 -------AKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYMLEEGARPNDYTIVS 374
                  A  LFE++P K+V+SW T++ G++     ++AL  F  ML  G  PND T++S
Sbjct: 317 KCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLS 376

Query: 375 ALSACAKVGALDTGLRIHKYLSG--NGFKLNVTIGTALVDMYAKCGNIECAGAVFCKTKE 434
            L ACA +GA+D G  IH Y+     G     ++ T+L+DMYAKCG+IE A  VF     
Sbjct: 377 ILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILH 436

Query: 435 KCLLTWSVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSG 494
           K L +W+ MI+G+A+HG    +   F  M+          G +PD + F+ +L+ACSHSG
Sbjct: 437 KSLSSWNAMIFGFAMHGRADASFDLFSRMRK--------IGIQPDDITFVGLLSACSHSG 496

Query: 495 QVNDGLEFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWGAL 554
            ++ G   F +M   Y + P ++HY  ++D+LG +G   EA + I  M + PD V+W +L
Sbjct: 497 MLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSL 556

Query: 555 FCACSAHKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVGRW---------------- 614
             AC  H N+E+ E  +E L+++EP++PGSYV LSN YA+ GRW                
Sbjct: 557 LKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMK 616

Query: 615 -----------EMQREFIAGDITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIEEE 626
                       +  EFI GD  H   REIY  L+E+     + G+  +   VL  +EEE
Sbjct: 617 KVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEE 676

BLAST of Sgr025648 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 359.0 bits (920), Expect = 1.2e-97
Identity = 207/695 (29.78%), Postives = 352/695 (50.65%), Query Frame = 0

Query: 18  MSSSSSPQIPSLVTH--------FIDLIHASDTPRKLRQIHAQLLRCNIFSSSRVVTQFI 77
           MS +S    P L T+        +  LI ++    +L+QIHA+LL   +  S  ++T+ I
Sbjct: 1   MSEASCLASPLLYTNSGIHSDSFYASLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLI 60

Query: 78  SSCFSVNSVDYAVSIFQLFELKNNFLFNALIRGLAENSRL-------------KISPDRL 137
            +  S   + +A  +F        F +NA+IRG + N+               ++SPD  
Sbjct: 61  HASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSF 120

Query: 138 TFPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLSSALKLFDES 197
           TFP +LK+ + L    +GR +H  + + G + D FV+  L+ +Y K   L SA  +F+  
Sbjct: 121 TFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGL 180

Query: 198 PDRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDA-GSWNSLIN----------- 257
           P  +   +++ W  ++  Y + G+ ++A E+F  M K D    W +L++           
Sbjct: 181 P--LPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDL 240

Query: 258 ---------------------------GFMRKGDLGRAKELFEKIPEKNVVSWTTMVNGF 317
                                       + + G +  AK LF+K+   N++ W  M++G+
Sbjct: 241 KQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGY 300

Query: 318 SQNGDPEKALETFFYMLEEGARPNDYTIVSALSACAKVGALDTGLRIHKYLSGNGFKLNV 377
           ++NG   +A++ F  M+ +  RP+  +I SA+SACA+VG+L+    +++Y+  + ++ +V
Sbjct: 301 AKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDV 360

Query: 378 TIGTALVDMYAKCGNIECAGAVFCKTKEKCLLTWSVMIWGWAIHGHYKKAIQYFEWMKST 437
            I +AL+DM+AKCG++E A  VF +T ++ ++ WS MI G+ +HG  ++AI  +  M+  
Sbjct: 361 FISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMER- 420

Query: 438 DSKLSMFAGTKPDGVVFLAVLTACSHSGQVNDGLEFFNSMRHVYLIEPSMKHYTLVVDML 497
                   G  P+ V FL +L AC+HSG V +G  FFN M   + I P  +HY  V+D+L
Sbjct: 421 -------GGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMAD-HKINPQQQHYACVIDLL 480

Query: 498 GRAGRLDEALKFIRNMPINPDFVVWGALFCACSAHKNIEMAELASEKLLQLEPKHPGSYV 557
           GRAG LD+A + I+ MP+ P   VWGAL  AC  H+++E+ E A+++L  ++P + G YV
Sbjct: 481 GRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYV 540

Query: 558 FLSNAYAAVGRWEMQRE---------------------------FIAGDITHDHAREIYL 617
            LSN YAA   W+   E                           F  GD +H    EI  
Sbjct: 541 QLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIER 600

Query: 618 RLDEISAGAREKGYTSEIECVLHNIEEEEKEDALGHHSEKLALALGSLAQPPGRPLGYS- 619
           +++ I +  +E G+ +  +  LH++ +EE E+ L  HSE++A+A G ++ P G PL  + 
Sbjct: 601 QVEWIESRLKEGGFVANKDASLHDLNDEEAEETLCSHSERIAIAYGLISTPQGTPLRITK 660

BLAST of Sgr025648 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 357.8 bits (917), Expect = 2.8e-97
Identity = 226/722 (31.30%), Postives = 351/722 (48.61%), Query Frame = 0

Query: 19  SSSSSPQIPSLVTHFIDLIHASDTPRKLRQIHAQLLRCNIFSSSRVVTQF--ISSCFSVN 78
           S+ + P   +  +  I LI    + R+L+Q H  ++R   FS     ++   +++  S  
Sbjct: 19  SNPNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFA 78

Query: 79  SVDYAVSIFQLFELKNNFLFNALIRGLAEN--------------SRLKISPDRLTFPFVL 138
           S++YA  +F      N+F +N LIR  A                S  +  P++ TFPF++
Sbjct: 79  SLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLI 138

Query: 139 KSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLSSALKLFDESPDRIKN 198
           K+AA +    +G++LHG  +K  +  D FV  SL+  Y    DL SA K+F      IK 
Sbjct: 139 KAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVF----TTIKE 198

Query: 199 ESVLIWNVLIHG------------------------------------------------ 258
           + V+ WN +I+G                                                
Sbjct: 199 KDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQV 258

Query: 259 ----------------------YCRVGDLIKASELFETMPKKDAGSWNSLINGFMRKGDL 318
                                 Y + G +  A  LF+ M +KD  +W ++++G+    D 
Sbjct: 259 CSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDY 318

Query: 319 GRAKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYM-LEEGARPNDYTIVSALSA 378
             A+E+   +P+K++V+W  +++ + QNG P +AL  F  + L++  + N  T+VS LSA
Sbjct: 319 EAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSA 378

Query: 379 CAKVGALDTGLRIHKYLSGNGFKLNVTIGTALVDMYAKCGNIECAGAVFCKTKEKCLLTW 438
           CA+VGAL+ G  IH Y+  +G ++N  + +AL+ MY+KCG++E +  VF   +++ +  W
Sbjct: 379 CAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVW 438

Query: 439 SVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSGQVNDGL 498
           S MI G A+HG   +A+  F  M+         A  KP+GV F  V  ACSH+G V++  
Sbjct: 439 SAMIGGLAMHGCGNEAVDMFYKMQE--------ANVKPNGVTFTNVFCACSHTGLVDEAE 498

Query: 499 EFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWGALFCACSA 558
             F+ M   Y I P  KHY  +VD+LGR+G L++A+KFI  MPI P   VWGAL  AC  
Sbjct: 499 SLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKI 558

Query: 559 HKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVGRWE--------------------- 618
           H N+ +AE+A  +LL+LEP++ G++V LSN YA +G+WE                     
Sbjct: 559 HANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCS 618

BLAST of Sgr025648 vs. ExPASy Swiss-Prot
Match: Q9CA54 (Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H71 PE=2 SV=1)

HSP 1 Score: 347.4 bits (890), Expect = 3.7e-94
Identity = 199/596 (33.39%), Postives = 322/596 (54.03%), Query Frame = 0

Query: 30  VTHFIDLIHASDTPRKLRQIHAQLLRCNIFSSSRVVTQFISSC-FSV-NSVDYAVSIFQL 89
           + H + L+++    R L QIH   ++  + + S    + I  C  S+ +++ YA  +   
Sbjct: 5   IHHCLSLLNSCKNLRALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLC 64

Query: 90  FELKNNFLFNALIRGLAENSRLK--------------ISPDRLTFPFVLKSAAALPDGGV 149
           F   + F+FN L+RG +E+                  + PD  +F FV+K+         
Sbjct: 65  FPEPDAFMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRT 124

Query: 150 GRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLSSALKLFDESPDRIKNESVLIWNVLIH 209
           G  +H   LK GLE   FV  +L+ MY     +  A K+FDE    +   +++ WN +I 
Sbjct: 125 GFQMHCQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDE----MHQPNLVAWNAVIT 184

Query: 210 GYCRVGDLIKASELFETMPKKDAGSWNSLINGFMRKGDLGRAKELFEKIPEKNVVSWTTM 269
              R  D+  A E+F+ M  ++  SWN ++ G+++ G+L  AK +F ++P ++ VSW+TM
Sbjct: 185 ACFRGNDVAGAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTM 244

Query: 270 VNGFSQNGDPEKALETFFYMLEEGARPNDYTIVSALSACAKVGALDTGLRIHKYLSGNGF 329
           + G + NG   ++   F  +   G  PN+ ++   LSAC++ G+ + G  +H ++   G+
Sbjct: 245 IVGIAHNGSFNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGY 304

Query: 330 KLNVTIGTALVDMYAKCGNIECAGAVFCKTKEK-CLLTWSVMIWGWAIHGHYKKAIQYFE 389
              V++  AL+DMY++CGN+  A  VF   +EK C+++W+ MI G A+HG  ++A++ F 
Sbjct: 305 SWIVSVNNALIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFN 364

Query: 390 WMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSGQVNDGLEFFNSMRHVYLIEPSMKHYTL 449
            M +         G  PDG+ F+++L ACSH+G + +G ++F+ M+ VY IEP ++HY  
Sbjct: 365 EMTA--------YGVTPDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIEHYGC 424

Query: 450 VVDMLGRAGRLDEALKFIRNMPINPDFVVWGALFCACSAHKNIEMAELASEKLLQLEPKH 509
           +VD+ GR+G+L +A  FI  MPI P  +VW  L  ACS+H NIE+AE   ++L +L+P +
Sbjct: 425 MVDLYGRSGKLQKAYDFICQMPIPPTAIVWRTLLGACSSHGNIELAEQVKQRLNELDPNN 484

Query: 510 PGSYVFLSNAYAAVGRWE----------MQR-----------------EFIAGD------ 569
            G  V LSNAYA  G+W+          +QR                 +F AG+      
Sbjct: 485 SGDLVLLSNAYATAGKWKDVASIRKSMIVQRIKKTTAWSLVEVGKTMYKFTAGEKKKGID 544

Query: 570 -ITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIEEEEKEDALGHHSEKLALA 575
              H+  +EI LRL +      E GYT E+   L+++EEEEKED +  HSEKLALA
Sbjct: 545 IEAHEKLKEIILRLKD------EAGYTPEVASALYDVEEEEKEDQVSKHSEKLALA 582

BLAST of Sgr025648 vs. ExPASy TrEMBL
Match: A0A6J1C9L2 (pentatricopeptide repeat-containing protein At1g04840 OS=Momordica charantia OX=3673 GN=LOC111009584 PE=3 SV=1)

HSP 1 Score: 1001.9 bits (2589), Expect = 1.3e-288
Identity = 515/665 (77.44%), Postives = 546/665 (82.11%), Query Frame = 0

Query: 1   MKDLHVLFKPRLAFFNAMSSSSSPQIPSLVTHFIDLIHASDTPRKLRQIHAQLLRCNIFS 60
           MKDLHVL KP LAFFN+M SSSS Q+ SL THFIDLIHAS+T  KLRQIHAQL RCNIFS
Sbjct: 1   MKDLHVLLKPSLAFFNSMPSSSSSQVQSLETHFIDLIHASNTAHKLRQIHAQLFRCNIFS 60

Query: 61  SSRVVTQFISSCFSVNSVDYAVSIFQLFELKNNFLFNALIRGLAENSRL----------- 120
           SSRVVTQFISSC S+N VDYA+SIFQ FE+KN+FLFNALIRGLAENSR            
Sbjct: 61  SSRVVTQFISSCSSLNLVDYAISIFQRFEMKNSFLFNALIRGLAENSRFEGSIYYFVLML 120

Query: 121 --KISPDRLTFPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLS 180
             KISPDRLTFPFVLKSAAAL  GGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLS
Sbjct: 121 KWKISPDRLTFPFVLKSAAALSSGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLS 180

Query: 181 SALKLFDESPDRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDAGSWNSLINGFM 240
           SALK+FDESPDRIKN SVLIWNVLIHGYCRVGDL+KA+ELFETMPKKD GSWNSLINGFM
Sbjct: 181 SALKVFDESPDRIKNGSVLIWNVLIHGYCRVGDLVKATELFETMPKKDTGSWNSLINGFM 240

Query: 241 RKGDLGRAKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYMLEEGARPNDYTIVS 300
           RKGDLGRAKELFE++PEKNVVSWTTMVNGFSQNGDPEKALETF  MLEEGA+PNDYTIVS
Sbjct: 241 RKGDLGRAKELFERMPEKNVVSWTTMVNGFSQNGDPEKALETFSCMLEEGAQPNDYTIVS 300

Query: 301 ALSACAKVGALDTGLRIHKYLSGNGFKLNVTIGTALVDMYAKCGNIECAGAVFCKTKEKC 360
           ALSACAKVGALD GLRIH YLS NGF+LN+TIGTALVDMYAKCGNIE AG VFC+TKEK 
Sbjct: 301 ALSACAKVGALDAGLRIHNYLSSNGFRLNLTIGTALVDMYAKCGNIESAGEVFCETKEKG 360

Query: 361 LLTWSVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSGQV 420
           LLTWSVMIWGWAIHGH+KKAI YFEWMKST        G KPDGVVFLAVLT+CSHSG+V
Sbjct: 361 LLTWSVMIWGWAIHGHFKKAILYFEWMKST--------GMKPDGVVFLAVLTSCSHSGKV 420

Query: 421 NDGLEFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWGALFC 480
           NDGL+FF+SMR  YLIEPSMKHYTLVVDMLGRAGRLDEAL FIRNMPINPDFVVWGALFC
Sbjct: 421 NDGLKFFDSMRRDYLIEPSMKHYTLVVDMLGRAGRLDEALNFIRNMPINPDFVVWGALFC 480

Query: 481 ACSAHKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVGRW------------------ 540
           AC AHKNIEMAELASEKLL+LEPKHPGSYVFLSNAYAAVGRW                  
Sbjct: 481 ACRAHKNIEMAELASEKLLELEPKHPGSYVFLSNAYAAVGRWDDAERVRISMRDRGAQKD 540

Query: 541 ---------EMQREFIAGDITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIEEEEK 600
                    +    F+AGD TH+ AREIY RLDEISAGAREKGYTS IECVLHNIEEEEK
Sbjct: 541 PGWSFIEVDDKLHRFVAGDKTHNRAREIYSRLDEISAGAREKGYTSGIECVLHNIEEEEK 600

Query: 601 EDALGHHSEKLALALGSLAQPPGRPLG-YSNFR------SVNKLLADMLSPTPILLLLPR 619
           EDALGHHSEKLALA G ++  PG  +    N R      S  K  + M     +L  + R
Sbjct: 601 EDALGHHSEKLALAFGLISTTPGTTIRIVKNLRVCVDCHSFMKYASKMNQREIVLRDIKR 657

BLAST of Sgr025648 vs. ExPASy TrEMBL
Match: A0A6J1HJP9 (pentatricopeptide repeat-containing protein At1g04840 OS=Cucurbita moschata OX=3662 GN=LOC111464188 PE=3 SV=1)

HSP 1 Score: 965.7 bits (2495), Expect = 1.1e-277
Identity = 499/663 (75.26%), Postives = 539/663 (81.30%), Query Frame = 0

Query: 1   MKDLHVLFKPRLAFFNAMSSSSSPQIPSLVTHFIDLIHASDTPRKLRQIHAQLLRCNIFS 60
           MK+L VLFKPR+AFFN+ SSSSSPQI SL THFIDLIHASD+  KLRQIH QL RCNIFS
Sbjct: 16  MKNLLVLFKPRIAFFNSTSSSSSPQISSLETHFIDLIHASDSTHKLRQIHGQLYRCNIFS 75

Query: 61  SSRVVTQFISSCFSVNSVDYAVSIFQLFELKNNFLFNALIRGLAENSRL----------- 120
           SSRVVTQFISSC S+NSVDYAV IFQ FELKN+FLFNALIRGLAENSR            
Sbjct: 76  SSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSRFESSISYFVCML 135

Query: 121 --KISPDRLTFPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLS 180
             KISPDRLTFPFVLKSAAAL +GGVG ALH GILKFGLEFDSFVRVSLVDMYVKV+DL 
Sbjct: 136 RWKISPDRLTFPFVLKSAAALSNGGVGSALHSGILKFGLEFDSFVRVSLVDMYVKVDDLG 195

Query: 181 SALKLFDESPDRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDAGSWNSLINGFM 240
           SALK+FDESPDRIK  +VLIWNVLIHGYCRVG+L+KA+ELFETMP+KD GSWNSLINGFM
Sbjct: 196 SALKVFDESPDRIKKGNVLIWNVLIHGYCRVGNLVKATELFETMPEKDTGSWNSLINGFM 255

Query: 241 RKGDLGRAKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYMLEEGARPNDYTIVS 300
           RKG LG A ELFEK+PEKNVVSWTTMVNGFSQNGDPEKAL+ FF MLEEGARPNDYTIVS
Sbjct: 256 RKGQLGPAHELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLEEGARPNDYTIVS 315

Query: 301 ALSACAKVGALDTGLRIHKYLSGNGFKLNVTIGTALVDMYAKCGNIECAGAVFCKTKEKC 360
           ALSACAK+GALD GLRIHKYLS +GFKLN TIGTA+VDMYAKCGNIE AG VF + K+K 
Sbjct: 316 ALSACAKLGALDAGLRIHKYLSSHGFKLNQTIGTAVVDMYAKCGNIESAGEVFREIKQKG 375

Query: 361 LLTWSVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSGQV 420
           LLTWSVMIWGWAIHGH+KK+IQYFEWMKS        AGTKPDGVVFLAVLTACSHSGQV
Sbjct: 376 LLTWSVMIWGWAIHGHFKKSIQYFEWMKS--------AGTKPDGVVFLAVLTACSHSGQV 435

Query: 421 NDGLEFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWGALFC 480
           +DGLEFF+SMR  YLIEPSMKHYTL+VDMLGRAGRLDEALKFIR+MPINPDFVVWGALFC
Sbjct: 436 DDGLEFFDSMRRDYLIEPSMKHYTLIVDMLGRAGRLDEALKFIRDMPINPDFVVWGALFC 495

Query: 481 ACSAHKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVGRWE----------------- 540
           AC AHKNI+MAELASEKLL+LEPKHPGSYVFLSNAYAAVGRWE                 
Sbjct: 496 ACRAHKNIKMAELASEKLLELEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKD 555

Query: 541 ----------MQREFIAGDITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIEEEEK 600
                         F+AGD TH+ A+EIY +LDEI+AGAREKGYT  IECVLHNIEEEEK
Sbjct: 556 PGWSFMEVDDKLHRFVAGDNTHNRAQEIYSKLDEINAGAREKGYTKGIECVLHNIEEEEK 615

Query: 601 EDALGHHSEKLALALGSLAQPPGRPLG-YSNFR------SVNKLLADMLSPTPILLLLPR 617
           E+ALGHHSEKLALA G ++  P   +    N R      S  K  + M     IL  + R
Sbjct: 616 EEALGHHSEKLALAFGLVSTAPETTIRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKR 670

BLAST of Sgr025648 vs. ExPASy TrEMBL
Match: A0A6J1KIT8 (pentatricopeptide repeat-containing protein At1g04840 OS=Cucurbita maxima OX=3661 GN=LOC111494840 PE=3 SV=1)

HSP 1 Score: 965.3 bits (2494), Expect = 1.4e-277
Identity = 497/663 (74.96%), Postives = 539/663 (81.30%), Query Frame = 0

Query: 1   MKDLHVLFKPRLAFFNAMSSSSSPQIPSLVTHFIDLIHASDTPRKLRQIHAQLLRCNIFS 60
           MK+LHVLFKPR+AFFN+ SSSSSPQI SL T+FIDLIHASD+  KLRQIH QL RCNIFS
Sbjct: 13  MKNLHVLFKPRIAFFNSTSSSSSPQISSLETYFIDLIHASDSTHKLRQIHGQLYRCNIFS 72

Query: 61  SSRVVTQFISSCFSVNSVDYAVSIFQLFELKNNFLFNALIRGLAENSRL----------- 120
           SSRVVTQFISSC S+NSVDYAV IFQ FELKN+FLFNALIRGLAENSR            
Sbjct: 73  SSRVVTQFISSCSSLNSVDYAVLIFQRFELKNSFLFNALIRGLAENSRFESSISYFVCML 132

Query: 121 --KISPDRLTFPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLS 180
             KISPDRLTFPFVLKSAAAL +GGVG ALH GI+KFGLEFDSFVRVSLVDMYVKV+DL 
Sbjct: 133 RWKISPDRLTFPFVLKSAAALSNGGVGSALHSGIVKFGLEFDSFVRVSLVDMYVKVDDLG 192

Query: 181 SALKLFDESPDRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDAGSWNSLINGFM 240
           SALK+FDESPDRIK  +VLIWNVLIHGYCRVG+L+KA+ELFETMPKKD GSWNSLINGFM
Sbjct: 193 SALKVFDESPDRIKQGNVLIWNVLIHGYCRVGNLVKATELFETMPKKDTGSWNSLINGFM 252

Query: 241 RKGDLGRAKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYMLEEGARPNDYTIVS 300
           RKG LG A ELFEK+PEKNVVSWTTMVNGFSQNGDPEKAL+ FF MLEEGA+PNDYTIVS
Sbjct: 253 RKGQLGPANELFEKMPEKNVVSWTTMVNGFSQNGDPEKALQFFFCMLEEGAQPNDYTIVS 312

Query: 301 ALSACAKVGALDTGLRIHKYLSGNGFKLNVTIGTALVDMYAKCGNIECAGAVFCKTKEKC 360
           ALSACAK+GALD GLRIHKYLS +GFKLN TIGTA+VDMYAKCGNIE AG VF + K+K 
Sbjct: 313 ALSACAKLGALDAGLRIHKYLSSHGFKLNQTIGTAVVDMYAKCGNIESAGEVFGEIKQKG 372

Query: 361 LLTWSVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSGQV 420
           LLTWSVMIWGWAIHGH+KK+IQYFEWMKST        GTKPDGVVFLAVLTACSHSGQV
Sbjct: 373 LLTWSVMIWGWAIHGHFKKSIQYFEWMKST--------GTKPDGVVFLAVLTACSHSGQV 432

Query: 421 NDGLEFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWGALFC 480
           +DGLEFF+SMR  YLIEPSMKHYTL+VDMLGRAGRLDEALKFIR+MPINPDFVVWGALFC
Sbjct: 433 DDGLEFFDSMRRDYLIEPSMKHYTLIVDMLGRAGRLDEALKFIRDMPINPDFVVWGALFC 492

Query: 481 ACSAHKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVGRWE----------------- 540
           AC AHKNI+MAELASEKLL+LEPKHPGSYVFLSNAYAAVGRWE                 
Sbjct: 493 ACRAHKNIKMAELASEKLLELEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKD 552

Query: 541 ----------MQREFIAGDITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIEEEEK 600
                         F+AGD TH+ A+EIY +LDEI+A AREKGYT  IECVLHNIEEEEK
Sbjct: 553 PGWSFMEVDDKLHRFVAGDNTHNRAQEIYSKLDEINASAREKGYTKGIECVLHNIEEEEK 612

Query: 601 EDALGHHSEKLALALGSLAQPPGRPLG-YSNFR------SVNKLLADMLSPTPILLLLPR 617
           E+ALGHHSEKLALA G ++  P   +    N R      S  K  + M     IL  + R
Sbjct: 613 EEALGHHSEKLALAFGLISTAPETMIRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKR 667

BLAST of Sgr025648 vs. ExPASy TrEMBL
Match: A0A0A0LI86 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G139850 PE=3 SV=1)

HSP 1 Score: 952.2 bits (2460), Expect = 1.2e-273
Identity = 487/663 (73.45%), Postives = 534/663 (80.54%), Query Frame = 0

Query: 1   MKDLHVLFKPRLAFFNAMSSSSSPQIPSLVTHFIDLIHASDTPRKLRQIHAQLLRCNIFS 60
           MKDLHVLF PR+AFF++M SSSSP I  L THFIDLIHAS++  KLRQIH QL RCN+FS
Sbjct: 13  MKDLHVLFNPRIAFFSSMFSSSSPPISFLETHFIDLIHASNSTHKLRQIHGQLYRCNVFS 72

Query: 61  SSRVVTQFISSCFSVNSVDYAVSIFQLFELKNNFLFNALIRGLAENSRL----------- 120
           SSRVVTQFISSC S+NSVDYA+SIFQ FELKN++LFNALIRGLAENSR            
Sbjct: 73  SSRVVTQFISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSRFESSISFFVLML 132

Query: 121 --KISPDRLTFPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLS 180
             KISPDRLTFPFVLKSAAAL +GGVGRALH GILKFGLEFDSFVRVSLVDMYVKVE+L 
Sbjct: 133 KWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEELG 192

Query: 181 SALKLFDESPDRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDAGSWNSLINGFM 240
           SALK+FDESP+ +KN SVLIWNVLIHGYCR+GDL+KA+ELF++MPKKD GSWNSLINGFM
Sbjct: 193 SALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDTGSWNSLINGFM 252

Query: 241 RKGDLGRAKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYMLEEGARPNDYTIVS 300
           + GD+GRAKELF K+PEKNVVSWTTMVNGFSQNGDPEKALETFF MLEEGARPNDYTIVS
Sbjct: 253 KMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVS 312

Query: 301 ALSACAKVGALDTGLRIHKYLSGNGFKLNVTIGTALVDMYAKCGNIECAGAVFCKTKEKC 360
           ALSACAK+GALD GLRIH YLSGNGFKLN+ IGTALVDMYAKCGNIE A  VF +TKEK 
Sbjct: 313 ALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEKVFHETKEKG 372

Query: 361 LLTWSVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSGQV 420
           LL WSVMIWGWAIHGH++KA+QYFEWMK        F GTKPD VVFLAVL ACSHSGQV
Sbjct: 373 LLIWSVMIWGWAIHGHFRKALQYFEWMK--------FTGTKPDSVVFLAVLNACSHSGQV 432

Query: 421 NDGLEFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWGALFC 480
           N+GL+FF++MR  YLIEPSMKHYTLVVDMLGRAGRLDEALKFIR MPI PDFVVWGALFC
Sbjct: 433 NEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWGALFC 492

Query: 481 ACSAHKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVGRWE----------------- 540
           AC  HKN+EMAELAS+KLLQLEPKHPGSYVFLSNAYA+VGRW+                 
Sbjct: 493 ACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDHGAHKD 552

Query: 541 ----------MQREFIAGDITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIEEEEK 600
                         F+AGD TH+ A EIY +LDEISA AREKGYT EIECVLHNIEEEEK
Sbjct: 553 PGWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLHNIEEEEK 612

Query: 601 EDALGHHSEKLALALGSLAQPPGRPLG-YSNFR------SVNKLLADMLSPTPILLLLPR 617
           E+ALG+HSEKLALA G ++  PG  +    N R      S  K  + M     IL  + R
Sbjct: 613 EEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREIILRDMKR 667

BLAST of Sgr025648 vs. ExPASy TrEMBL
Match: A0A5A7SRY4 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold275G00910 PE=3 SV=1)

HSP 1 Score: 941.8 bits (2433), Expect = 1.7e-270
Identity = 484/663 (73.00%), Postives = 530/663 (79.94%), Query Frame = 0

Query: 1   MKDLHVLFKPRLAFFNAMSSSSSPQIPSLVTHFIDLIHASDTPRKLRQIHAQLLRCNIFS 60
           MKDLHVLF PR+AF ++M SSSS +I SL THFIDLIHAS++  KLRQIH QL RCN+FS
Sbjct: 13  MKDLHVLFNPRIAFLSSMFSSSSLRISSLETHFIDLIHASNSTHKLRQIHGQLYRCNVFS 72

Query: 61  SSRVVTQFISSCFSVNSVDYAVSIFQLFELKNNFLFNALIRGLAENSRL----------- 120
           SSRVVTQFISSC  +N+VDYAVSIFQ FELKN++LFNALIRGLAENSR            
Sbjct: 73  SSRVVTQFISSCSLLNAVDYAVSIFQRFELKNSYLFNALIRGLAENSRFESSISFFVLML 132

Query: 121 --KISPDRLTFPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLS 180
             KISPDRLTFPFVLKSAAAL +GGVGRALH GILKFGL FDSFVRVSLVDMYVKV +L 
Sbjct: 133 KWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLVFDSFVRVSLVDMYVKVGELG 192

Query: 181 SALKLFDESPDRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDAGSWNSLINGFM 240
           SALK+FDESP+ +KN SVLIWNVLIHGYCR+GDL+KA+ELF++MPKKD GSWNSLINGFM
Sbjct: 193 SALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDTGSWNSLINGFM 252

Query: 241 RKGDLGRAKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYMLEEGARPNDYTIVS 300
           + GD+GRAKELFEK+PEKNVVSWTTMVNGFSQNGDP+KALETFF MLEEGARPNDYTIVS
Sbjct: 253 KMGDMGRAKELFEKMPEKNVVSWTTMVNGFSQNGDPQKALETFFCMLEEGARPNDYTIVS 312

Query: 301 ALSACAKVGALDTGLRIHKYLSGNGFKLNVTIGTALVDMYAKCGNIECAGAVFCKTKEKC 360
           ALSACAK+GALD GL IH YLSGNGFKLN+ IGTALVDM+AKCGNIE A  VF +TKEK 
Sbjct: 313 ALSACAKIGALDAGLSIHNYLSGNGFKLNLVIGTALVDMHAKCGNIEYAEKVFHETKEKG 372

Query: 361 LLTWSVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSGQV 420
           LL WSVMIWGWAIHGH++KA+QYFEWMK        F GTKPD VVFLAVL ACSHSGQV
Sbjct: 373 LLIWSVMIWGWAIHGHFRKALQYFEWMK--------FTGTKPDSVVFLAVLNACSHSGQV 432

Query: 421 NDGLEFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWGALFC 480
           N+GL+FF+SMR  YLIEPSMKHYTLVVDMLGRAGRLDEALKFIR MPI PDFVVWGALFC
Sbjct: 433 NEGLKFFDSMRRSYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWGALFC 492

Query: 481 ACSAHKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVGRWE----------------- 540
           AC AHKN+EMAELASEKLLQLEPKHPGSYVFLSNAYA+VGRW+                 
Sbjct: 493 ACRAHKNVEMAELASEKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDSGAHKD 552

Query: 541 ----------MQREFIAGDITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIEEEEK 600
                         F+AGD TH  A EIY  LDEISA AREKGYT EIECVLHNIEEEEK
Sbjct: 553 PGWSFIEVDHKLHRFVAGDNTHSRAVEIYSMLDEISASAREKGYTKEIECVLHNIEEEEK 612

Query: 601 EDALGHHSEKLALALGSLAQPPGRPLG-YSNFR------SVNKLLADMLSPTPILLLLPR 617
           E+ALG+HSEKLALA G L+  PG  +    N R      S  K  + +     IL  + R
Sbjct: 613 EEALGYHSEKLALAFGILSTRPGTTVRIVKNLRVCVDCHSFMKYTSKLTKREIILRDMKR 667

BLAST of Sgr025648 vs. TAIR 10
Match: AT1G04840.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 593.6 bits (1529), Expect = 2.1e-169
Identity = 315/627 (50.24%), Postives = 412/627 (65.71%), Query Frame = 0

Query: 1   MKDLHVLFKPRLA----FFNAMSSSSSPQIPSLVTHFIDLIHASDTPRKLRQIHAQLLRC 60
           MK L V+FKP+ +    +F A   +S  +     +HFI LIHA      LR +HAQ+LR 
Sbjct: 1   MKSLSVIFKPKSSPAKIYFPADRQASPDE-----SHFISLIHACKDTASLRHVHAQILRR 60

Query: 61  NIFSSSRVVTQFISSCFSVNSVDYAVSIFQLFELKNNFLFNALIRGLAENS--------- 120
            +  SSRV  Q +S    + S DY++SIF+  E +N F+ NALIRGL EN+         
Sbjct: 61  GVL-SSRVAAQLVSCSSLLKSPDYSLSIFRNSEERNPFVLNALIRGLTENARFESSVRHF 120

Query: 121 ----RLKISPDRLTFPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKV 180
               RL + PDRLTFPFVLKS + L    +GRALH   LK  ++ DSFVR+SLVDMY K 
Sbjct: 121 ILMLRLGVKPDRLTFPFVLKSNSKLGFRWLGRALHAATLKNFVDCDSFVRLSLVDMYAKT 180

Query: 181 EDLSSALKLFDESPDRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDAGSWNSLI 240
             L  A ++F+ESPDRIK ES+LIWNVLI+GYCR  D+  A+ LF +MP++++GSW++LI
Sbjct: 181 GQLKHAFQVFEESPDRIKKESILIWNVLINGYCRAKDMHMATTLFRSMPERNSGSWSTLI 240

Query: 241 NGFMRKGDLGRAKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYMLEEGARPNDY 300
            G++  G+L RAK+LFE +PEKNVVSWTT++NGFSQ GD E A+ T+F MLE+G +PN+Y
Sbjct: 241 KGYVDSGELNRAKQLFELMPEKNVVSWTTLINGFSQTGDYETAISTYFEMLEKGLKPNEY 300

Query: 301 TIVSALSACAKVGALDTGLRIHKYLSGNGFKLNVTIGTALVDMYAKCGNIECAGAVFCKT 360
           TI + LSAC+K GAL +G+RIH Y+  NG KL+  IGTALVDMYAKCG ++CA  VF   
Sbjct: 301 TIAAVLSACSKSGALGSGIRIHGYILDNGIKLDRAIGTALVDMYAKCGELDCAATVFSNM 360

Query: 361 KEKCLLTWSVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSH 420
             K +L+W+ MI GWA+HG + +AIQ F  M        M++G KPD VVFLAVLTAC +
Sbjct: 361 NHKDILSWTAMIQGWAVHGRFHQAIQCFRQM--------MYSGEKPDEVVFLAVLTACLN 420

Query: 421 SGQVNDGLEFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWG 480
           S +V+ GL FF+SMR  Y IEP++KHY LVVD+LGRAG+L+EA + + NMPINPD   W 
Sbjct: 421 SSEVDLGLNFFDSMRLDYAIEPTLKHYVLVVDLLGRAGKLNEAHELVENMPINPDLTTWA 480

Query: 481 ALFCACSAHKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVG--------RWEMQR-- 540
           AL+ AC AHK    AE  S+ LL+L+P+  GSY+FL   +A+ G        R  +Q+  
Sbjct: 481 ALYRACKAHKGYRRAESVSQNLLELDPELCGSYIFLDKTHASKGNIQDVEKRRLSLQKRI 540

Query: 541 -----------------EFIAGDITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIE 584
                            +F AGD +H   +EI L+LDEI + A +KGY    +  +H+IE
Sbjct: 541 KERSLGWSYIELDGQLNKFSAGDYSHKLTQEIGLKLDEIISLAIQKGYNPGADWSIHDIE 600

BLAST of Sgr025648 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 376.3 bits (965), Expect = 5.3e-104
Identity = 237/734 (32.29%), Postives = 354/734 (48.23%), Query Frame = 0

Query: 15  FNAMSSSSSPQIPSLVTH-FIDLIHASDTPRKLRQIHAQLLRCNIFSSSRVVTQFISSCF 74
           F+ + SSS P   S+  H  + L+H   T + LR IHAQ+++  + +++  +++ I  C 
Sbjct: 17  FHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCI 76

Query: 75  ---SVNSVDYAVSIFQLFELKNNFLFNALIRGLAENS-------------RLKISPDRLT 134
                  + YA+S+F+  +  N  ++N + RG A +S              L + P+  T
Sbjct: 77  LSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYT 136

Query: 135 FPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLSSALKLFDESP 194
           FPFVLKS A       G+ +HG +LK G + D +V  SL+ MYV+   L  A K+FD+SP
Sbjct: 137 FPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSP 196

Query: 195 DRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDAGSWNSLINGFMRKG------- 254
            R     V+ +  LI GY   G +  A +LF+ +P KD  SWN++I+G+   G       
Sbjct: 197 HR----DVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALE 256

Query: 255 ------------------------------DLGR-------------------------- 314
                                         +LGR                          
Sbjct: 257 LFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYS 316

Query: 315 -------AKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYMLEEGARPNDYTIVS 374
                  A  LFE++P K+V+SW T++ G++     ++AL  F  ML  G  PND T++S
Sbjct: 317 KCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLS 376

Query: 375 ALSACAKVGALDTGLRIHKYLSG--NGFKLNVTIGTALVDMYAKCGNIECAGAVFCKTKE 434
            L ACA +GA+D G  IH Y+     G     ++ T+L+DMYAKCG+IE A  VF     
Sbjct: 377 ILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILH 436

Query: 435 KCLLTWSVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSG 494
           K L +W+ MI+G+A+HG    +   F  M+          G +PD + F+ +L+ACSHSG
Sbjct: 437 KSLSSWNAMIFGFAMHGRADASFDLFSRMRK--------IGIQPDDITFVGLLSACSHSG 496

Query: 495 QVNDGLEFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWGAL 554
            ++ G   F +M   Y + P ++HY  ++D+LG +G   EA + I  M + PD V+W +L
Sbjct: 497 MLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSL 556

Query: 555 FCACSAHKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVGRW---------------- 614
             AC  H N+E+ E  +E L+++EP++PGSYV LSN YA+ GRW                
Sbjct: 557 LKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMK 616

Query: 615 -----------EMQREFIAGDITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIEEE 626
                       +  EFI GD  H   REIY  L+E+     + G+  +   VL  +EEE
Sbjct: 617 KVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEE 676

BLAST of Sgr025648 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 359.0 bits (920), Expect = 8.8e-99
Identity = 207/695 (29.78%), Postives = 352/695 (50.65%), Query Frame = 0

Query: 18  MSSSSSPQIPSLVTH--------FIDLIHASDTPRKLRQIHAQLLRCNIFSSSRVVTQFI 77
           MS +S    P L T+        +  LI ++    +L+QIHA+LL   +  S  ++T+ I
Sbjct: 1   MSEASCLASPLLYTNSGIHSDSFYASLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLI 60

Query: 78  SSCFSVNSVDYAVSIFQLFELKNNFLFNALIRGLAENSRL-------------KISPDRL 137
            +  S   + +A  +F        F +NA+IRG + N+               ++SPD  
Sbjct: 61  HASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSF 120

Query: 138 TFPFVLKSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLSSALKLFDES 197
           TFP +LK+ + L    +GR +H  + + G + D FV+  L+ +Y K   L SA  +F+  
Sbjct: 121 TFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGL 180

Query: 198 PDRIKNESVLIWNVLIHGYCRVGDLIKASELFETMPKKDA-GSWNSLIN----------- 257
           P  +   +++ W  ++  Y + G+ ++A E+F  M K D    W +L++           
Sbjct: 181 P--LPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDL 240

Query: 258 ---------------------------GFMRKGDLGRAKELFEKIPEKNVVSWTTMVNGF 317
                                       + + G +  AK LF+K+   N++ W  M++G+
Sbjct: 241 KQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGY 300

Query: 318 SQNGDPEKALETFFYMLEEGARPNDYTIVSALSACAKVGALDTGLRIHKYLSGNGFKLNV 377
           ++NG   +A++ F  M+ +  RP+  +I SA+SACA+VG+L+    +++Y+  + ++ +V
Sbjct: 301 AKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDV 360

Query: 378 TIGTALVDMYAKCGNIECAGAVFCKTKEKCLLTWSVMIWGWAIHGHYKKAIQYFEWMKST 437
            I +AL+DM+AKCG++E A  VF +T ++ ++ WS MI G+ +HG  ++AI  +  M+  
Sbjct: 361 FISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMER- 420

Query: 438 DSKLSMFAGTKPDGVVFLAVLTACSHSGQVNDGLEFFNSMRHVYLIEPSMKHYTLVVDML 497
                   G  P+ V FL +L AC+HSG V +G  FFN M   + I P  +HY  V+D+L
Sbjct: 421 -------GGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMAD-HKINPQQQHYACVIDLL 480

Query: 498 GRAGRLDEALKFIRNMPINPDFVVWGALFCACSAHKNIEMAELASEKLLQLEPKHPGSYV 557
           GRAG LD+A + I+ MP+ P   VWGAL  AC  H+++E+ E A+++L  ++P + G YV
Sbjct: 481 GRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYV 540

Query: 558 FLSNAYAAVGRWEMQRE---------------------------FIAGDITHDHAREIYL 617
            LSN YAA   W+   E                           F  GD +H    EI  
Sbjct: 541 QLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIER 600

Query: 618 RLDEISAGAREKGYTSEIECVLHNIEEEEKEDALGHHSEKLALALGSLAQPPGRPLGYS- 619
           +++ I +  +E G+ +  +  LH++ +EE E+ L  HSE++A+A G ++ P G PL  + 
Sbjct: 601 QVEWIESRLKEGGFVANKDASLHDLNDEEAEETLCSHSERIAIAYGLISTPQGTPLRITK 660

BLAST of Sgr025648 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 357.8 bits (917), Expect = 2.0e-98
Identity = 226/722 (31.30%), Postives = 351/722 (48.61%), Query Frame = 0

Query: 19  SSSSSPQIPSLVTHFIDLIHASDTPRKLRQIHAQLLRCNIFSSSRVVTQF--ISSCFSVN 78
           S+ + P   +  +  I LI    + R+L+Q H  ++R   FS     ++   +++  S  
Sbjct: 19  SNPNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFA 78

Query: 79  SVDYAVSIFQLFELKNNFLFNALIRGLAEN--------------SRLKISPDRLTFPFVL 138
           S++YA  +F      N+F +N LIR  A                S  +  P++ TFPF++
Sbjct: 79  SLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLI 138

Query: 139 KSAAALPDGGVGRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLSSALKLFDESPDRIKN 198
           K+AA +    +G++LHG  +K  +  D FV  SL+  Y    DL SA K+F      IK 
Sbjct: 139 KAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVF----TTIKE 198

Query: 199 ESVLIWNVLIHG------------------------------------------------ 258
           + V+ WN +I+G                                                
Sbjct: 199 KDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQV 258

Query: 259 ----------------------YCRVGDLIKASELFETMPKKDAGSWNSLINGFMRKGDL 318
                                 Y + G +  A  LF+ M +KD  +W ++++G+    D 
Sbjct: 259 CSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDY 318

Query: 319 GRAKELFEKIPEKNVVSWTTMVNGFSQNGDPEKALETFFYM-LEEGARPNDYTIVSALSA 378
             A+E+   +P+K++V+W  +++ + QNG P +AL  F  + L++  + N  T+VS LSA
Sbjct: 319 EAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSA 378

Query: 379 CAKVGALDTGLRIHKYLSGNGFKLNVTIGTALVDMYAKCGNIECAGAVFCKTKEKCLLTW 438
           CA+VGAL+ G  IH Y+  +G ++N  + +AL+ MY+KCG++E +  VF   +++ +  W
Sbjct: 379 CAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVW 438

Query: 439 SVMIWGWAIHGHYKKAIQYFEWMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSGQVNDGL 498
           S MI G A+HG   +A+  F  M+         A  KP+GV F  V  ACSH+G V++  
Sbjct: 439 SAMIGGLAMHGCGNEAVDMFYKMQE--------ANVKPNGVTFTNVFCACSHTGLVDEAE 498

Query: 499 EFFNSMRHVYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRNMPINPDFVVWGALFCACSA 558
             F+ M   Y I P  KHY  +VD+LGR+G L++A+KFI  MPI P   VWGAL  AC  
Sbjct: 499 SLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKI 558

Query: 559 HKNIEMAELASEKLLQLEPKHPGSYVFLSNAYAAVGRWE--------------------- 618
           H N+ +AE+A  +LL+LEP++ G++V LSN YA +G+WE                     
Sbjct: 559 HANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCS 618

BLAST of Sgr025648 vs. TAIR 10
Match: AT1G74630.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 347.4 bits (890), Expect = 2.7e-95
Identity = 199/596 (33.39%), Postives = 322/596 (54.03%), Query Frame = 0

Query: 30  VTHFIDLIHASDTPRKLRQIHAQLLRCNIFSSSRVVTQFISSC-FSV-NSVDYAVSIFQL 89
           + H + L+++    R L QIH   ++  + + S    + I  C  S+ +++ YA  +   
Sbjct: 5   IHHCLSLLNSCKNLRALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLC 64

Query: 90  FELKNNFLFNALIRGLAENSRLK--------------ISPDRLTFPFVLKSAAALPDGGV 149
           F   + F+FN L+RG +E+                  + PD  +F FV+K+         
Sbjct: 65  FPEPDAFMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRT 124

Query: 150 GRALHGGILKFGLEFDSFVRVSLVDMYVKVEDLSSALKLFDESPDRIKNESVLIWNVLIH 209
           G  +H   LK GLE   FV  +L+ MY     +  A K+FDE    +   +++ WN +I 
Sbjct: 125 GFQMHCQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDE----MHQPNLVAWNAVIT 184

Query: 210 GYCRVGDLIKASELFETMPKKDAGSWNSLINGFMRKGDLGRAKELFEKIPEKNVVSWTTM 269
              R  D+  A E+F+ M  ++  SWN ++ G+++ G+L  AK +F ++P ++ VSW+TM
Sbjct: 185 ACFRGNDVAGAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTM 244

Query: 270 VNGFSQNGDPEKALETFFYMLEEGARPNDYTIVSALSACAKVGALDTGLRIHKYLSGNGF 329
           + G + NG   ++   F  +   G  PN+ ++   LSAC++ G+ + G  +H ++   G+
Sbjct: 245 IVGIAHNGSFNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGY 304

Query: 330 KLNVTIGTALVDMYAKCGNIECAGAVFCKTKEK-CLLTWSVMIWGWAIHGHYKKAIQYFE 389
              V++  AL+DMY++CGN+  A  VF   +EK C+++W+ MI G A+HG  ++A++ F 
Sbjct: 305 SWIVSVNNALIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFN 364

Query: 390 WMKSTDSKLSMFAGTKPDGVVFLAVLTACSHSGQVNDGLEFFNSMRHVYLIEPSMKHYTL 449
            M +         G  PDG+ F+++L ACSH+G + +G ++F+ M+ VY IEP ++HY  
Sbjct: 365 EMTA--------YGVTPDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIEHYGC 424

Query: 450 VVDMLGRAGRLDEALKFIRNMPINPDFVVWGALFCACSAHKNIEMAELASEKLLQLEPKH 509
           +VD+ GR+G+L +A  FI  MPI P  +VW  L  ACS+H NIE+AE   ++L +L+P +
Sbjct: 425 MVDLYGRSGKLQKAYDFICQMPIPPTAIVWRTLLGACSSHGNIELAEQVKQRLNELDPNN 484

Query: 510 PGSYVFLSNAYAAVGRWE----------MQR-----------------EFIAGD------ 569
            G  V LSNAYA  G+W+          +QR                 +F AG+      
Sbjct: 485 SGDLVLLSNAYATAGKWKDVASIRKSMIVQRIKKTTAWSLVEVGKTMYKFTAGEKKKGID 544

Query: 570 -ITHDHAREIYLRLDEISAGAREKGYTSEIECVLHNIEEEEKEDALGHHSEKLALA 575
              H+  +EI LRL +      E GYT E+   L+++EEEEKED +  HSEKLALA
Sbjct: 545 IEAHEKLKEIILRLKD------EAGYTPEVASALYDVEEEEKEDQVSKHSEKLALA 582

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022138400.12.8e-28877.44pentatricopeptide repeat-containing protein At1g04840 [Momordica charantia] >XP_... [more]
XP_023513771.11.5e-27875.11pentatricopeptide repeat-containing protein At1g04840 [Cucurbita pepo subsp. pep... [more]
KAG7026055.11.7e-27777.15Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022964045.12.2e-27775.26pentatricopeptide repeat-containing protein At1g04840 [Cucurbita moschata][more]
XP_023000600.12.9e-27774.96pentatricopeptide repeat-containing protein At1g04840 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q9MAT23.0e-16850.24Pentatricopeptide repeat-containing protein At1g04840 OS=Arabidopsis thaliana OX... [more]
Q9LN017.5e-10332.29Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9LTV81.2e-9729.78Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
O823802.8e-9731.30Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9CA543.7e-9433.39Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1C9L21.3e-28877.44pentatricopeptide repeat-containing protein At1g04840 OS=Momordica charantia OX=... [more]
A0A6J1HJP91.1e-27775.26pentatricopeptide repeat-containing protein At1g04840 OS=Cucurbita moschata OX=3... [more]
A0A6J1KIT81.4e-27774.96pentatricopeptide repeat-containing protein At1g04840 OS=Cucurbita maxima OX=366... [more]
A0A0A0LI861.2e-27373.45DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G1398... [more]
A0A5A7SRY41.7e-27073.00Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT1G04840.12.1e-16950.24Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.15.3e-10432.29Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G12770.18.8e-9929.78mitochondrial editing factor 22 [more]
AT2G29760.12.0e-9831.30Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G74630.12.7e-9533.39Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 300..382
e-value: 1.0E-11
score: 46.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 197..299
e-value: 1.5E-26
score: 94.9
coord: 12..179
e-value: 6.6E-8
score: 34.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 383..527
e-value: 1.2E-22
score: 82.7
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 226..510
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 187..214
e-value: 1.6E-5
score: 22.8
coord: 218..248
e-value: 1.0E-4
score: 20.3
coord: 248..281
e-value: 1.1E-7
score: 29.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 429..453
e-value: 0.023
score: 14.9
coord: 350..374
e-value: 0.16
score: 12.3
coord: 392..418
e-value: 0.2
score: 12.0
coord: 187..214
e-value: 6.4E-7
score: 29.2
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 245..294
e-value: 1.1E-10
score: 41.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 184..218
score: 10.544828
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 246..280
score: 12.375365
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 508..585
e-value: 2.7E-13
score: 49.9
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 19..510
NoneNo IPR availablePANTHERPTHR24015:SF1865TETRATRICOPEPTIDE REPEAT-LIKE SUPERFAMILY PROTEIN ISOFORM 1coord: 19..510

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr025648.1Sgr025648.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding