Sgr018481 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr018481
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationtig00153204: 409124 .. 411904 (+)
RNA-Seq ExpressionSgr018481
SyntenySgr018481
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATGGAGTCCTCACCGCCGTCCGATACCCTACGATGATTAGATATTCTACTGCCATTAACTCAGGTCAGCTCCTCATCATCCTTGGATTCAGGCTCAGACTCACATTCACACTCACACTCAAGTTCTTCACATCAACTGCTTCTCTCCCTCAAAGCCTTCCCGTAGAACACGATATCTCAGCGCAGCTCTTCTCCATTCTCTCCCACCCCAATTGGCAGAAGCATCCTTCTCTGAAAAATTTAATCCCATCCATTGCTCCGTCTCATATATCTACCCTTTTCACCCACAATCTCGATCCTCAAACTGCCCTTGCGTTTTTCAATTGGATCGGACAGAAGCATGGATTCAAACACAATGTTCAATCCTATATGTCTATGTTAAATATCCTTGTTCCCAATGGGTACCTCCGCATTGCTGAAAAGATGCGAATTTTAATGATAAAGTCGACGGATTCCTCAGAAAATGCGCTGTTCGTGTTGGAAATGCTGCGAAGCATGAACCGCCGGGGGGATGATTTCAAATTTAAGCTCACTCTCAGGTGCTATAACATGCTTTTGATGTTGTTGTCGAGGTTTCTCATGATTGATGAAATGAAAAGTGTGTATTTAGAGATGTTGGATGATATGGTTACACCGAATATATATACACTCAACACAATGGTAAATGGATATTGTAAATTGGGTAATGTAGTTGAAGCAGAGTTGTATGTCAGTAAGATAGTGCAAGCCGGTTTGAGTTTGGATACTTTTACTTACACGTCTTTGATATTAGGATATTGTAGGAATAGGAATGTAGATGGTGCATATAGAATTTTTCTGTCAATGCCGAGTAAAGGTTGCCGCAGAAATGAGGTTTCTTATACTAATCTGATTCATGGGCTTTGTGAAGCCAGGAGAATTGATGAAGCTCTAAAATTGTTTTCACAAATGCATGAGGATAATTGTTGGCCAACTGTTCGTACGTATACAATTATCATATGTGCATTGTGCCAATTGAGCAGGAAACTAGAAGGATTTAATATGTTTAAGGAGATGACTGAGAAAGGTTGTGAACCAAATGTACATACCTATACGGTCCTTATACATAGTTTATGTGAGGACCAAAACTTTGATGATGCCAAGAAAATGCTAAATGGTATGCTTGAGAAAGGATTGGTTCCAAGTGTGGTCACTTACAATGCCTTAATCGATGGTTATTGCAAGAAAGGAATGAGTATGAGTGCCTTGGAAATTTTGAGCTTGATGGAATCAAATAATTGTAGTCCAAATGCTCGCACTTATAATGAATTGATATTGGGGTTTTGCAGGGCAAAGGATGTCCACAAGGCCATGGCACTACTATATAAAATGCTTGAGCGGAAGCTTCAACCAGATGTAGTTACCTACAACCTATTAATCCATGGACAGTGCAAAGGAGGTCATCTGGGTAGTGCTTATAAGCTGCTTGGTTTGATGAATGAAAGTGGTTTGGTTCCTGATGAATGGACTTACAGTGTCTTCGTGGATACACTTTGTAAAAGAGGGCGGGTTGAAGAAGCTCATCTTCTCTTTGACTCTCTTAAGGAGAAAGGCGTAAAGGCAAATGAAGTAATATATAGTGCTTTGATTGATGGTTATTGCAAAGTTGGAAAAGTCAGTGATGGTCACTCCTTACTTGATAAAATGCTTGGTGATGGATGCATTCCGAATTCAATTACTTATAATTCCTTGATTGATGGATATTGCAGAGAGAAAAATTTTCAAGAAGCTCTTGTACTTGTGGAAATAATGATAAAGAGGGACATTAATCCTACTGCTGATACTTACACCATTCTTATAGAAAATTTATTAGAAGATGGTGAGTTTGACCGTGCCCATAATATGTTTGATCAAATGCTTTCCACAGGTTCTCATCCTGATGTATTTACATATACTGCATTTATTCATGCATATTGTAGTCAGGGTAGACTAAAAGACGCAGAGCTTTTTATTTATAAAATGAATGAAAAAGGAATAATGCCAGACACTTTGCTTTATACATTATTGATTGATGCGTATGGACGGTTTGGATCAATTGGTTGTGCTTTTGACATTCTGAAGCGTATGTATGATGTTGGTTGTGAGCCATCTTTTCACACATATTCTTATTTAATTAAACATCTATCAAATGCAAAGCTGACAAAAGTAAATAGCAGTTCAGAGTTGAATGACCTGTCATCAGGGGTTGCCTCCAATGATTTTGCCAACTTATGGAAGAGAGTAGATTATGAATTCGCTTTGGAGTTATTTGAGAAAATGGTCAAGCATGGTTGTGCACCTAATGCTAATACTTATGGCAAGTTTATTACAGGTCTTTGCAAGGTGGGATGCTTGGAAGTAGCCCACAGGTTATATGACCATATGAAAGAAAAAGGACTATCGCCTAATGAAGACATTTACAACTGTCTTCTTGGTTGTTCTTGTCAATTGGGATTGTATGACAAAGCAATAAGGTGGTTAGATATCATGATAGAGCATGGATATTTACCGCATTTAGATTCGTGCAAGCTGCTGGTTTGTGGGTTGTATGATGAAGGAAATAATGAGAAAGCAAAAATAGTGTTTTATAGTTTACTTCAGTGTGGGTATAATTATGATGAAATGGCTTGGAAAGTACTTATCGATGGCTTACTTAAGAAGGGCCTTGTTGATAAATGCTCTGAACTATTTGGCATCATGGAGAGACAAGGTTGCCGAATTCATCCTAAGACATATAGTATGTTGATTGAGGGATTTGATGGAATTCAGGATATGGATTAG

mRNA sequence

ATGCATGGAGTCCTCACCGCCGTCCGATACCCTACGATGATTAGATATTCTACTGCCATTAACTCAGGTCAGCTCCTCATCATCCTTGGATTCAGGCTCAGACTCACATTCACACTCACACTCAAGTTCTTCACATCAACTGCTTCTCTCCCTCAAAGCCTTCCCGTAGAACACGATATCTCAGCGCAGCTCTTCTCCATTCTCTCCCACCCCAATTGGCAGAAGCATCCTTCTCTGAAAAATTTAATCCCATCCATTGCTCCGTCTCATATATCTACCCTTTTCACCCACAATCTCGATCCTCAAACTGCCCTTGCGTTTTTCAATTGGATCGGACAGAAGCATGGATTCAAACACAATGTTCAATCCTATATGTCTATGTTAAATATCCTTGTTCCCAATGGGTACCTCCGCATTGCTGAAAAGATGCGAATTTTAATGATAAAGTCGACGGATTCCTCAGAAAATGCGCTGTTCGTGTTGGAAATGCTGCGAAGCATGAACCGCCGGGGGGATGATTTCAAATTTAAGCTCACTCTCAGGTGCTATAACATGCTTTTGATGTTGTTGTCGAGGTTTCTCATGATTGATGAAATGAAAAGTGTGTATTTAGAGATGTTGGATGATATGGTTACACCGAATATATATACACTCAACACAATGGTAAATGGATATTGTAAATTGGGTAATGTAGTTGAAGCAGAGTTGTATGTCAGTAAGATAGTGCAAGCCGGTTTGAGTTTGGATACTTTTACTTACACGTCTTTGATATTAGGATATTGTAGGAATAGGAATGTAGATGGTGCATATAGAATTTTTCTGTCAATGCCGAGTAAAGGTTGCCGCAGAAATGAGGTTTCTTATACTAATCTGATTCATGGGCTTTGTGAAGCCAGGAGAATTGATGAAGCTCTAAAATTGTTTTCACAAATGCATGAGGATAATTGTTGGCCAACTGTTCGTACGTATACAATTATCATATGTGCATTGTGCCAATTGAGCAGGAAACTAGAAGGATTTAATATGTTTAAGGAGATGACTGAGAAAGGTTGTGAACCAAATGTACATACCTATACGGTCCTTATACATAGTTTATGTGAGGACCAAAACTTTGATGATGCCAAGAAAATGCTAAATGGTATGCTTGAGAAAGGATTGGTTCCAAGTGTGGTCACTTACAATGCCTTAATCGATGGTTATTGCAAGAAAGGAATGAGTATGAGTGCCTTGGAAATTTTGAGCTTGATGGAATCAAATAATTGTAGTCCAAATGCTCGCACTTATAATGAATTGATATTGGGGTTTTGCAGGGCAAAGGATGTCCACAAGGCCATGGCACTACTATATAAAATGCTTGAGCGGAAGCTTCAACCAGATGTAGTTACCTACAACCTATTAATCCATGGACAGTGCAAAGGAGGTCATCTGGGTAGTGCTTATAAGCTGCTTGGTTTGATGAATGAAAGTGGTTTGGTTCCTGATGAATGGACTTACAGTGTCTTCGTGGATACACTTTGTAAAAGAGGGCGGGTTGAAGAAGCTCATCTTCTCTTTGACTCTCTTAAGGAGAAAGGCGTAAAGGCAAATGAAGTAATATATAGTGCTTTGATTGATGGTTATTGCAAAGTTGGAAAAGTCAGTGATGGTCACTCCTTACTTGATAAAATGCTTGGTGATGGATGCATTCCGAATTCAATTACTTATAATTCCTTGATTGATGGATATTGCAGAGAGAAAAATTTTCAAGAAGCTCTTGTACTTGTGGAAATAATGATAAAGAGGGACATTAATCCTACTGCTGATACTTACACCATTCTTATAGAAAATTTATTAGAAGATGGTGAGTTTGACCGTGCCCATAATATGTTTGATCAAATGCTTTCCACAGGTTCTCATCCTGATGTATTTACATATACTGCATTTATTCATGCATATTGTAGTCAGGGTAGACTAAAAGACGCAGAGCTTTTTATTTATAAAATGAATGAAAAAGGAATAATGCCAGACACTTTGCTTTATACATTATTGATTGATGCGTATGGACGGTTTGGATCAATTGGTTGTGCTTTTGACATTCTGAAGCGTATGTATGATGTTGGTTGTGAGCCATCTTTTCACACATATTCTTATTTAATTAAACATCTATCAAATGCAAAGCTGACAAAAGTAAATAGCAGTTCAGAGTTGAATGACCTGTCATCAGGGGTTGCCTCCAATGATTTTGCCAACTTATGGAAGAGAGTAGATTATGAATTCGCTTTGGAGTTATTTGAGAAAATGGTCAAGCATGGTTGTGCACCTAATGCTAATACTTATGGCAAGTTTATTACAGGTCTTTGCAAGGTGGGATGCTTGGAAGTAGCCCACAGGTTATATGACCATATGAAAGAAAAAGGACTATCGCCTAATGAAGACATTTACAACTGTCTTCTTGGTTGTTCTTGTCAATTGGGATTGTATGACAAAGCAATAAGGTGGTTAGATATCATGATAGAGCATGGATATTTACCGCATTTAGATTCGTGCAAGCTGCTGGTTTGTGGGTTGTATGATGAAGGAAATAATGAGAAAGCAAAAATAGTGTTTTATAGTTTACTTCAGTGTGGGTATAATTATGATGAAATGGCTTGGAAAGTACTTATCGATGGCTTACTTAAGAAGGGCCTTGTTGATAAATGCTCTGAACTATTTGGCATCATGGAGAGACAAGGTTGCCGAATTCATCCTAAGACATATAGTATGTTGATTGAGGGATTTGATGGAATTCAGGATATGGATTAG

Coding sequence (CDS)

ATGCATGGAGTCCTCACCGCCGTCCGATACCCTACGATGATTAGATATTCTACTGCCATTAACTCAGGTCAGCTCCTCATCATCCTTGGATTCAGGCTCAGACTCACATTCACACTCACACTCAAGTTCTTCACATCAACTGCTTCTCTCCCTCAAAGCCTTCCCGTAGAACACGATATCTCAGCGCAGCTCTTCTCCATTCTCTCCCACCCCAATTGGCAGAAGCATCCTTCTCTGAAAAATTTAATCCCATCCATTGCTCCGTCTCATATATCTACCCTTTTCACCCACAATCTCGATCCTCAAACTGCCCTTGCGTTTTTCAATTGGATCGGACAGAAGCATGGATTCAAACACAATGTTCAATCCTATATGTCTATGTTAAATATCCTTGTTCCCAATGGGTACCTCCGCATTGCTGAAAAGATGCGAATTTTAATGATAAAGTCGACGGATTCCTCAGAAAATGCGCTGTTCGTGTTGGAAATGCTGCGAAGCATGAACCGCCGGGGGGATGATTTCAAATTTAAGCTCACTCTCAGGTGCTATAACATGCTTTTGATGTTGTTGTCGAGGTTTCTCATGATTGATGAAATGAAAAGTGTGTATTTAGAGATGTTGGATGATATGGTTACACCGAATATATATACACTCAACACAATGGTAAATGGATATTGTAAATTGGGTAATGTAGTTGAAGCAGAGTTGTATGTCAGTAAGATAGTGCAAGCCGGTTTGAGTTTGGATACTTTTACTTACACGTCTTTGATATTAGGATATTGTAGGAATAGGAATGTAGATGGTGCATATAGAATTTTTCTGTCAATGCCGAGTAAAGGTTGCCGCAGAAATGAGGTTTCTTATACTAATCTGATTCATGGGCTTTGTGAAGCCAGGAGAATTGATGAAGCTCTAAAATTGTTTTCACAAATGCATGAGGATAATTGTTGGCCAACTGTTCGTACGTATACAATTATCATATGTGCATTGTGCCAATTGAGCAGGAAACTAGAAGGATTTAATATGTTTAAGGAGATGACTGAGAAAGGTTGTGAACCAAATGTACATACCTATACGGTCCTTATACATAGTTTATGTGAGGACCAAAACTTTGATGATGCCAAGAAAATGCTAAATGGTATGCTTGAGAAAGGATTGGTTCCAAGTGTGGTCACTTACAATGCCTTAATCGATGGTTATTGCAAGAAAGGAATGAGTATGAGTGCCTTGGAAATTTTGAGCTTGATGGAATCAAATAATTGTAGTCCAAATGCTCGCACTTATAATGAATTGATATTGGGGTTTTGCAGGGCAAAGGATGTCCACAAGGCCATGGCACTACTATATAAAATGCTTGAGCGGAAGCTTCAACCAGATGTAGTTACCTACAACCTATTAATCCATGGACAGTGCAAAGGAGGTCATCTGGGTAGTGCTTATAAGCTGCTTGGTTTGATGAATGAAAGTGGTTTGGTTCCTGATGAATGGACTTACAGTGTCTTCGTGGATACACTTTGTAAAAGAGGGCGGGTTGAAGAAGCTCATCTTCTCTTTGACTCTCTTAAGGAGAAAGGCGTAAAGGCAAATGAAGTAATATATAGTGCTTTGATTGATGGTTATTGCAAAGTTGGAAAAGTCAGTGATGGTCACTCCTTACTTGATAAAATGCTTGGTGATGGATGCATTCCGAATTCAATTACTTATAATTCCTTGATTGATGGATATTGCAGAGAGAAAAATTTTCAAGAAGCTCTTGTACTTGTGGAAATAATGATAAAGAGGGACATTAATCCTACTGCTGATACTTACACCATTCTTATAGAAAATTTATTAGAAGATGGTGAGTTTGACCGTGCCCATAATATGTTTGATCAAATGCTTTCCACAGGTTCTCATCCTGATGTATTTACATATACTGCATTTATTCATGCATATTGTAGTCAGGGTAGACTAAAAGACGCAGAGCTTTTTATTTATAAAATGAATGAAAAAGGAATAATGCCAGACACTTTGCTTTATACATTATTGATTGATGCGTATGGACGGTTTGGATCAATTGGTTGTGCTTTTGACATTCTGAAGCGTATGTATGATGTTGGTTGTGAGCCATCTTTTCACACATATTCTTATTTAATTAAACATCTATCAAATGCAAAGCTGACAAAAGTAAATAGCAGTTCAGAGTTGAATGACCTGTCATCAGGGGTTGCCTCCAATGATTTTGCCAACTTATGGAAGAGAGTAGATTATGAATTCGCTTTGGAGTTATTTGAGAAAATGGTCAAGCATGGTTGTGCACCTAATGCTAATACTTATGGCAAGTTTATTACAGGTCTTTGCAAGGTGGGATGCTTGGAAGTAGCCCACAGGTTATATGACCATATGAAAGAAAAAGGACTATCGCCTAATGAAGACATTTACAACTGTCTTCTTGGTTGTTCTTGTCAATTGGGATTGTATGACAAAGCAATAAGGTGGTTAGATATCATGATAGAGCATGGATATTTACCGCATTTAGATTCGTGCAAGCTGCTGGTTTGTGGGTTGTATGATGAAGGAAATAATGAGAAAGCAAAAATAGTGTTTTATAGTTTACTTCAGTGTGGGTATAATTATGATGAAATGGCTTGGAAAGTACTTATCGATGGCTTACTTAAGAAGGGCCTTGTTGATAAATGCTCTGAACTATTTGGCATCATGGAGAGACAAGGTTGCCGAATTCATCCTAAGACATATAGTATGTTGATTGAGGGATTTGATGGAATTCAGGATATGGATTAG

Protein sequence

MHGVLTAVRYPTMIRYSTAINSGQLLIILGFRLRLTFTLTLKFFTSTASLPQSLPVEHDISAQLFSILSHPNWQKHPSLKNLIPSIAPSHISTLFTHNLDPQTALAFFNWIGQKHGFKHNVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKLTLRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVSKIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEARRIDEALKLFSQMHEDNCWPTVRTYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTYTVLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMESNNCSPNARTYNELILGFCRAKDVHKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGSAYKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEEAHLLFDSLKEKGVKANEVIYSALIDGYCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLIDGYCREKNFQEALVLVEIMIKRDINPTADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHPDVFTYTAFIHAYCSQGRLKDAELFIYKMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDILKRMYDVGCEPSFHTYSYLIKHLSNAKLTKVNSSSELNDLSSGVASNDFANLWKRVDYEFALELFEKMVKHGCAPNANTYGKFITGLCKVGCLEVAHRLYDHMKEKGLSPNEDIYNCLLGCSCQLGLYDKAIRWLDIMIEHGYLPHLDSCKLLVCGLYDEGNNEKAKIVFYSLLQCGYNYDEMAWKVLIDGLLKKGLVDKCSELFGIMERQGCRIHPKTYSMLIEGFDGIQDMD
Homology
BLAST of Sgr018481 vs. NCBI nr
Match: XP_022153102.1 (pentatricopeptide repeat-containing protein At5g65560 isoform X1 [Momordica charantia])

HSP 1 Score: 1751.5 bits (4535), Expect = 0.0e+00
Identity = 849/927 (91.59%), Postives = 884/927 (95.36%), Query Frame = 0

Query: 1   MHGVLTAVRYPTMIRYSTA-INSGQLLIILGFRLRLTFTLTLKFFTSTASLPQSLPVEHD 60
           MHGVLTAVR  TMIRY TA INSGQL I+LGFRLRLTFTL LKFFTSTASLPQSLPVEHD
Sbjct: 13  MHGVLTAVRCRTMIRYPTAIINSGQLFIVLGFRLRLTFTLNLKFFTSTASLPQSLPVEHD 72

Query: 61  ISAQLFSILSHPNWQKHPSLKNLIPSIAPSHISTLFTHNLDPQTALAFFNWIGQKHGFKH 120
           ISAQLFSILS PNWQKHPSLKNLIPSIAPSHIS LF  NLDPQTALAFFNWIGQKHGFKH
Sbjct: 73  ISAQLFSILSRPNWQKHPSLKNLIPSIAPSHISALFALNLDPQTALAFFNWIGQKHGFKH 132

Query: 121 NVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKLT 180
           NVQSY SMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKLT
Sbjct: 133 NVQSYTSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKLT 192

Query: 181 LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVS 240
           LRCYNMLLMLLSRFL++DEM+SVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVS
Sbjct: 193 LRCYNMLLMLLSRFLLVDEMRSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVS 252

Query: 241 KIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEAR 300
           KIVQAGLSLDTFTYTSLILGYCRN+NVDGAYRIFLSMP+KGCRRNEVSYTNLIHG C+A+
Sbjct: 253 KIVQAGLSLDTFTYTSLILGYCRNKNVDGAYRIFLSMPNKGCRRNEVSYTNLIHGFCDAK 312

Query: 301 RIDEALKLFSQMHEDNCWPTVRTYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTYT 360
           R DEALKLFSQMHEDNCWPTVRTYT+IICALCQL RK E FN FKEMTEKGCEPNVHTYT
Sbjct: 313 RTDEALKLFSQMHEDNCWPTVRTYTVIICALCQLGRKSEAFNTFKEMTEKGCEPNVHTYT 372

Query: 361 VLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMESN 420
           VLIHSLCED NFDDAK MLNGML+KGLVPSVVTYNALIDGYCKKGMS+SALEILSLMESN
Sbjct: 373 VLIHSLCEDNNFDDAKNMLNGMLQKGLVPSVVTYNALIDGYCKKGMSLSALEILSLMESN 432

Query: 421 NCSPNARTYNELILGFCRAKDVHKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGSA 480
           NCSPNARTYNELILGFC+AK+VHKAM+LL+KMLERKLQPDVVTYNLLIHGQCK GHLGSA
Sbjct: 433 NCSPNARTYNELILGFCKAKNVHKAMSLLHKMLERKLQPDVVTYNLLIHGQCKDGHLGSA 492

Query: 481 YKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEEAHLLFDSLKEKGVKANEVIYSALIDG 540
           YKLLGLMNESGLVPDEWTYSVFVDTLCKRG+VEEA  LFDSLKEKG++ANEVIYSALIDG
Sbjct: 493 YKLLGLMNESGLVPDEWTYSVFVDTLCKRGQVEEARFLFDSLKEKGIRANEVIYSALIDG 552

Query: 541 YCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLIDGYCREKNFQEALVLVEIMIKRDINPT 600
           YCKVGKV+DGHSL DKM GDGC+PNSITYNSLIDGYCREKNFQEAL+L+EIMIKRDI PT
Sbjct: 553 YCKVGKVTDGHSLFDKMHGDGCVPNSITYNSLIDGYCREKNFQEALLLLEIMIKRDIKPT 612

Query: 601 ADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHPDVFTYTAFIHAYCSQGRLKDAELFIY 660
           ADTYTILIE+LL+DGEFDRAHNMFDQMLSTGS PDVFTYTAFIHAYCSQGRLKDAELFIY
Sbjct: 613 ADTYTILIESLLKDGEFDRAHNMFDQMLSTGSRPDVFTYTAFIHAYCSQGRLKDAELFIY 672

Query: 661 KMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDILKRMYDVGCEPSFHTYSYLIKHLSNAK 720
           KMNEKGIMPDTLLYTLLIDAYG+FGSIG AFDILKRMYDVGCEPSFHTYSYLIKHLSN+K
Sbjct: 673 KMNEKGIMPDTLLYTLLIDAYGQFGSIGRAFDILKRMYDVGCEPSFHTYSYLIKHLSNSK 732

Query: 721 LTKVNSSSELNDLSSGVASNDFANLWKRVDYEFALELFEKMVKHGCAPNANTYGKFITGL 780
             KV+SS ELNDLSSGV SNDFA+LW++VDYEFAL+LFEKMVKHGC PNANTY KFITGL
Sbjct: 733 SIKVDSSLELNDLSSGVTSNDFASLWRKVDYEFALDLFEKMVKHGCEPNANTYSKFITGL 792

Query: 781 CKVGCLEVAHRLYDHMKEKGLSPNEDIYNCLLGCSCQLGLYDKAIRWLDIMIEHGYLPHL 840
           CKVGCLEVAHRLYDHMK KGLSPNED YN LLGCSCQLG Y KAI+WLDIMIEHG LPHL
Sbjct: 793 CKVGCLEVAHRLYDHMKAKGLSPNEDSYNSLLGCSCQLGSYGKAIKWLDIMIEHGLLPHL 852

Query: 841 DSCKLLVCGLYDEGNNEKAKIVFYSLLQCGYNYDEMAWKVLIDGLLKKGLVDKCSELFGI 900
           DSCKLLVCGLYDEGNNEKAK V YSLLQCGYN DE+AWKVLIDGLLKKGLVDKCSELFGI
Sbjct: 853 DSCKLLVCGLYDEGNNEKAKTVLYSLLQCGYNNDELAWKVLIDGLLKKGLVDKCSELFGI 912

Query: 901 MERQGCRIHPKTYSMLIEGFDGIQDMD 927
           MERQGC+IHPKTYSMLIEGFDGI D+D
Sbjct: 913 MERQGCQIHPKTYSMLIEGFDGIHDID 939

BLAST of Sgr018481 vs. NCBI nr
Match: XP_038885361.1 (pentatricopeptide repeat-containing protein At5g65560 [Benincasa hispida])

HSP 1 Score: 1703.7 bits (4411), Expect = 0.0e+00
Identity = 824/927 (88.89%), Postives = 872/927 (94.07%), Query Frame = 0

Query: 1   MHGVLTAVRYPTMIRYSTA-INSGQLLIILGFRLRLTFTLTLKFFTSTASLPQSLPVEHD 60
           MHGV TAVR P MIR S A INSGQLL+++ FRLRLTF LT KFFTSTASLPQSL VEHD
Sbjct: 13  MHGVFTAVRCPIMIRNSAAIINSGQLLVVIEFRLRLTFALTPKFFTSTASLPQSLSVEHD 72

Query: 61  ISAQLFSILSHPNWQKHPSLKNLIPSIAPSHISTLFTHNLDPQTALAFFNWIGQKHGFKH 120
           ISAQLFSILS PNWQK PSLKNLIPSIAPSHIS LF  NLDPQTALAFFNWIGQKHGFKH
Sbjct: 73  ISAQLFSILSRPNWQKQPSLKNLIPSIAPSHISALFALNLDPQTALAFFNWIGQKHGFKH 132

Query: 121 NVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKLT 180
           N+QSY+SMLNILVPNGY  +AEKMRILMIKSTDSSENALF+LE+LRSMNRRGD+FKFKLT
Sbjct: 133 NIQSYISMLNILVPNGYHHVAEKMRILMIKSTDSSENALFLLEILRSMNRRGDNFKFKLT 192

Query: 181 LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVS 240
           LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMVNGYCKLG VVEAELYVS
Sbjct: 193 LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMVNGYCKLGRVVEAELYVS 252

Query: 241 KIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEAR 300
           KIVQAGLSLDTFTYTSLILGYCRN+NVD AY+ FLSMPSKGCRRNEVSYTNLIHG CEAR
Sbjct: 253 KIVQAGLSLDTFTYTSLILGYCRNKNVDAAYKTFLSMPSKGCRRNEVSYTNLIHGFCEAR 312

Query: 301 RIDEALKLFSQMHEDNCWPTVRTYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTYT 360
           RIDEALKLFSQMHEDNCWPTVRTYTIIICALCQL RK E FNMFKEMTEKGCEPNVHTYT
Sbjct: 313 RIDEALKLFSQMHEDNCWPTVRTYTIIICALCQLGRKTEAFNMFKEMTEKGCEPNVHTYT 372

Query: 361 VLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMESN 420
           VLIH LCED NFDDAKKMLNGMLEKGL+PSVVTYNALIDGYCKKG+SMSALEILSLMESN
Sbjct: 373 VLIHRLCEDNNFDDAKKMLNGMLEKGLIPSVVTYNALIDGYCKKGLSMSALEILSLMESN 432

Query: 421 NCSPNARTYNELILGFCRAKDVHKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGSA 480
           NCSPNARTYNELILGFCRAK++HKAM++L+KMLERKLQPDVVTYNLLIHGQCK GHLGSA
Sbjct: 433 NCSPNARTYNELILGFCRAKNIHKAMSILHKMLERKLQPDVVTYNLLIHGQCKEGHLGSA 492

Query: 481 YKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEEAHLLFDSLKEKGVKANEVIYSALIDG 540
           YKLL LMNESGLVPDEWTYSVF+DTLCKRG+VEEAH LFDSLKEKG+KANEVIYS LIDG
Sbjct: 493 YKLLSLMNESGLVPDEWTYSVFIDTLCKRGQVEEAHSLFDSLKEKGIKANEVIYSTLIDG 552

Query: 541 YCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLIDGYCREKNFQEALVLVEIMIKRDINPT 600
           YCKVGKVSDGHSLLDKM+  GC+PNSITYNSLIDGYC+EKNFQEAL+LVEIMIKRDI P 
Sbjct: 553 YCKVGKVSDGHSLLDKMVSAGCVPNSITYNSLIDGYCKEKNFQEALLLVEIMIKRDIMPA 612

Query: 601 ADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHPDVFTYTAFIHAYCSQGRLKDAELFIY 660
           ADTYTILIENLL++GEFDRAH+MFDQMLSTGSHPDVF YTAF+HAYCSQGRLKDAE+ IY
Sbjct: 613 ADTYTILIENLLKNGEFDRAHDMFDQMLSTGSHPDVFIYTAFVHAYCSQGRLKDAEVLIY 672

Query: 661 KMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDILKRMYDVGCEPSFHTYSYLIKHLSNAK 720
           KMNEKGI+PDTLLY+LLIDAYGRFGSI  AFD LKRM DVGCEPS++TYSYLIKHLSN+K
Sbjct: 673 KMNEKGILPDTLLYSLLIDAYGRFGSIDGAFDTLKRMNDVGCEPSYYTYSYLIKHLSNSK 732

Query: 721 LTKVNSSSELNDLSSGVASNDFANLWKRVDYEFALELFEKMVKHGCAPNANTYGKFITGL 780
             +V SS EL++LSSGVASNDF+N W+RVDYEFALELF KM KHGCAPNANTYGKFITGL
Sbjct: 733 PKEVISSLELSELSSGVASNDFSNFWRRVDYEFALELFGKMFKHGCAPNANTYGKFITGL 792

Query: 781 CKVGCLEVAHRLYDHMKEKGLSPNEDIYNCLLGCSCQLGLYDKAIRWLDIMIEHGYLPHL 840
           CKVGCLEVAHRL+DHMKEKGLSPNEDIYN LLGCSCQLGLY K+ RWLDIMIE+G+LPHL
Sbjct: 793 CKVGCLEVAHRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYGKSTRWLDIMIENGHLPHL 852

Query: 841 DSCKLLVCGLYDEGNNEKAKIVFYSLLQCGYNYDEMAWKVLIDGLLKKGLVDKCSELFGI 900
           DSCKLL+CGLY+EGNNEKAK VFY LLQCGYNYDEMAWKVLIDGLLKKGLVDKCSELFGI
Sbjct: 853 DSCKLLLCGLYEEGNNEKAKTVFYRLLQCGYNYDEMAWKVLIDGLLKKGLVDKCSELFGI 912

Query: 901 MERQGCRIHPKTYSMLIEGFDGIQDMD 927
           ME QGC+IHPKTYSMLIEGFDGI+  D
Sbjct: 913 METQGCQIHPKTYSMLIEGFDGIEGFD 939

BLAST of Sgr018481 vs. NCBI nr
Match: XP_023545913.1 (pentatricopeptide repeat-containing protein At5g65560-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1678.3 bits (4345), Expect = 0.0e+00
Identity = 818/927 (88.24%), Postives = 865/927 (93.31%), Query Frame = 0

Query: 1   MHGVLTAVRYPTMIRYSTA-INSGQLLIILGFRLRLTFTLTLKFFTSTASLPQSLPVEHD 60
           MHGV TAVR PTMIR S   INSGQLLI+ GFRLR TF+LT KFFTSTASLPQ+LPVEHD
Sbjct: 13  MHGVFTAVRCPTMIRNSAVIINSGQLLIVHGFRLRFTFSLTFKFFTSTASLPQNLPVEHD 72

Query: 61  ISAQLFSILSHPNWQKHPSLKNLIPSIAPSHISTLFTHNLDPQTALAFFNWIGQKHGFKH 120
           ISAQLFSILS PNWQKHPSLK LIPSI+PSHIS LF  NLDPQTALAFFNWIGQKHGFKH
Sbjct: 73  ISAQLFSILSRPNWQKHPSLKVLIPSISPSHISALFALNLDPQTALAFFNWIGQKHGFKH 132

Query: 121 NVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKLT 180
           NVQSY+S++NILVPNGYL IAEKMRILMIKSTDS ENALFVLEMLRSMNRRGDDFKFKLT
Sbjct: 133 NVQSYVSIINILVPNGYLHIAEKMRILMIKSTDSLENALFVLEMLRSMNRRGDDFKFKLT 192

Query: 181 LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVS 240
           LRCYNMLLML+SRFLMIDEMKSVYLEMLDDMVTPNIYT NTMVNGYCKLG VVEAELYVS
Sbjct: 193 LRCYNMLLMLMSRFLMIDEMKSVYLEMLDDMVTPNIYTFNTMVNGYCKLGYVVEAELYVS 252

Query: 241 KIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEAR 300
           KIVQAGLSLDTFTYTSLILGYCRN+NVDGAYRIFLSMPSKGCRRNEVSYTN+I+G CEAR
Sbjct: 253 KIVQAGLSLDTFTYTSLILGYCRNKNVDGAYRIFLSMPSKGCRRNEVSYTNMINGFCEAR 312

Query: 301 RIDEALKLFSQMHEDNCWPTVRTYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTYT 360
           RIDEALKLF QMHEDNC PTVRTYTI+I A+CQL RK E F+MFKEMTEKG EPNV+T+T
Sbjct: 313 RIDEALKLFLQMHEDNCSPTVRTYTILIHAMCQLGRKTEAFSMFKEMTEKGSEPNVYTWT 372

Query: 361 VLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMESN 420
           VLIHSLCED NFDDAKKMLNGMLEKGLVPS+VTYNALIDGYCKKGMSMSALEILSLME N
Sbjct: 373 VLIHSLCEDNNFDDAKKMLNGMLEKGLVPSLVTYNALIDGYCKKGMSMSALEILSLMELN 432

Query: 421 NCSPNARTYNELILGFCRAKDVHKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGSA 480
           NCSPNARTYNELILGFCRAK+VHKAM+LL +MLERKLQPDVVTYNLLIHGQCK GHL SA
Sbjct: 433 NCSPNARTYNELILGFCRAKNVHKAMSLLNEMLERKLQPDVVTYNLLIHGQCKEGHLDSA 492

Query: 481 YKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEEAHLLFDSLKEKGVKANEVIYSALIDG 540
           YKLL LMNESGLVPDEWTYSVFVDTLCKR +VEEA LLFDSLK KG+KANEVIYSALIDG
Sbjct: 493 YKLLSLMNESGLVPDEWTYSVFVDTLCKREQVEEARLLFDSLKVKGIKANEVIYSALIDG 552

Query: 541 YCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLIDGYCREKNFQEALVLVEIMIKRDINPT 600
           YCKVGKVSDGHSLLDKML DG +PNS TYNSLIDGYC+EKN+QEAL+L+EIMIKR I P 
Sbjct: 553 YCKVGKVSDGHSLLDKMLSDGWVPNSFTYNSLIDGYCKEKNYQEALLLMEIMIKRGIKPA 612

Query: 601 ADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHPDVFTYTAFIHAYCSQGRLKDAELFIY 660
            DTYTI IENLL+DGEFDRAHNMFDQMLSTGSHPDVF YTAFIHAYCSQGRLKDAE+ IY
Sbjct: 613 VDTYTIFIENLLKDGEFDRAHNMFDQMLSTGSHPDVFIYTAFIHAYCSQGRLKDAEVLIY 672

Query: 661 KMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDILKRMYDVGCEPSFHTYSYLIKHLSNAK 720
           KMNEKGI+PDTLL+TLLIDAYGRFGSI  AFDILK M+DVGCEPSF+TYSYLIKHLSN K
Sbjct: 673 KMNEKGILPDTLLHTLLIDAYGRFGSIDDAFDILKHMHDVGCEPSFYTYSYLIKHLSNEK 732

Query: 721 LTKVNSSSELNDLSSGVASNDFANLWKRVDYEFALELFEKMVKHGCAPNANTYGKFITGL 780
           L +VNS+SEL+DLSSGVASNDF+N W+RVDYEFALELF KMVKHGCAPNANTY KFITGL
Sbjct: 733 LKEVNSNSELSDLSSGVASNDFSNFWRRVDYEFALELFGKMVKHGCAPNANTYSKFITGL 792

Query: 781 CKVGCLEVAHRLYDHMKEKGLSPNEDIYNCLLGCSCQLGLYDKAIRWLDIMIEHGYLPHL 840
           CKV CLE+A RL+DHMKEKGL PNEDIYN LLGCSC+LGLY  A+RWLDIMIE G+LPHL
Sbjct: 793 CKVECLEIAQRLFDHMKEKGLLPNEDIYNSLLGCSCRLGLYGNAVRWLDIMIEQGHLPHL 852

Query: 841 DSCKLLVCGLYDEGNNEKAKIVFYSLLQCGYNYDEMAWKVLIDGLLKKGLVDKCSELFGI 900
           DSCKLL+CGLYDEGNNEKAK VFYSLLQCGYNYDEM WKVLIDGLLKKGLVDKCSELFGI
Sbjct: 853 DSCKLLLCGLYDEGNNEKAKTVFYSLLQCGYNYDEMTWKVLIDGLLKKGLVDKCSELFGI 912

Query: 901 MERQGCRIHPKTYSMLIEGFDGIQDMD 927
           ME+QGC+IHPKTYSMLIEGFDGIQD+D
Sbjct: 913 MEKQGCQIHPKTYSMLIEGFDGIQDID 939

BLAST of Sgr018481 vs. NCBI nr
Match: KAG6599094.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1674.8 bits (4336), Expect = 0.0e+00
Identity = 815/927 (87.92%), Postives = 865/927 (93.31%), Query Frame = 0

Query: 1   MHGVLTAVRYPTMIRYSTA-INSGQLLIILGFRLRLTFTLTLKFFTSTASLPQSLPVEHD 60
           MHGV TAVR P MIR S   INSGQLLI+ GFRLR TF+LT KFFTSTASLPQ+LPVEHD
Sbjct: 1   MHGVFTAVRCPAMIRNSAVIINSGQLLIVHGFRLRFTFSLTFKFFTSTASLPQNLPVEHD 60

Query: 61  ISAQLFSILSHPNWQKHPSLKNLIPSIAPSHISTLFTHNLDPQTALAFFNWIGQKHGFKH 120
           ISAQL++ILS PNWQK+PSLK LIPSI+PSHIS LF  NLDPQTALAFFNWIGQKHGFKH
Sbjct: 61  ISAQLYTILSRPNWQKNPSLKVLIPSISPSHISALFALNLDPQTALAFFNWIGQKHGFKH 120

Query: 121 NVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKLT 180
           NVQSY+S++NILVPNGYL IAEKMRILMIKSTDS ENALFVLEMLRSMNRRGDDFKFKLT
Sbjct: 121 NVQSYVSIINILVPNGYLHIAEKMRILMIKSTDSLENALFVLEMLRSMNRRGDDFKFKLT 180

Query: 181 LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVS 240
           LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYT NTMVNGYCKLG VVEAELYVS
Sbjct: 181 LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTFNTMVNGYCKLGYVVEAELYVS 240

Query: 241 KIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEAR 300
           KIVQAGLSLDTFTYTSLILGYCRN+NVDGAYRIFLSMPSKGCRRNEVSYTN+I+G CEAR
Sbjct: 241 KIVQAGLSLDTFTYTSLILGYCRNKNVDGAYRIFLSMPSKGCRRNEVSYTNMINGFCEAR 300

Query: 301 RIDEALKLFSQMHEDNCWPTVRTYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTYT 360
           RIDEALKLFSQMHEDNC PTVRTYT++I A+CQL RK E F+MFKEMTEKG EPNV+T+T
Sbjct: 301 RIDEALKLFSQMHEDNCSPTVRTYTVLIHAMCQLGRKTEAFSMFKEMTEKGSEPNVYTWT 360

Query: 361 VLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMESN 420
           VLIHSLCED NFDDAKKMLNGMLEKGLVPS+VTYNALIDGYCKKGMS SALEILSLMESN
Sbjct: 361 VLIHSLCEDNNFDDAKKMLNGMLEKGLVPSLVTYNALIDGYCKKGMSTSALEILSLMESN 420

Query: 421 NCSPNARTYNELILGFCRAKDVHKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGSA 480
           NCSPNARTYNELILGFCRAK+VHKAM+LL +MLERKLQPDVVTYNLLIHGQCK GHL SA
Sbjct: 421 NCSPNARTYNELILGFCRAKNVHKAMSLLNEMLERKLQPDVVTYNLLIHGQCKEGHLDSA 480

Query: 481 YKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEEAHLLFDSLKEKGVKANEVIYSALIDG 540
           YKLL LMNESGLVPDEWTYSVFVDTLCKR +VEEA LLFDSLK KG+KANEVIYSALIDG
Sbjct: 481 YKLLSLMNESGLVPDEWTYSVFVDTLCKREQVEEARLLFDSLKVKGIKANEVIYSALIDG 540

Query: 541 YCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLIDGYCREKNFQEALVLVEIMIKRDINPT 600
           YCKVGKVSDGHSLLDKML DG +PNS TYNSLIDGYC+EKN+QEAL+L+EIMIKR I P 
Sbjct: 541 YCKVGKVSDGHSLLDKMLSDGWVPNSFTYNSLIDGYCKEKNYQEALLLMEIMIKRGIKPA 600

Query: 601 ADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHPDVFTYTAFIHAYCSQGRLKDAELFIY 660
            DTYTILIENLL+DGEFDRAHNMFDQMLSTGSHPDVF YTAFIHAYCSQGRLKDAE+ IY
Sbjct: 601 VDTYTILIENLLKDGEFDRAHNMFDQMLSTGSHPDVFIYTAFIHAYCSQGRLKDAEVLIY 660

Query: 661 KMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDILKRMYDVGCEPSFHTYSYLIKHLSNAK 720
           KM EKGI+PDTLLYTLLIDAYGRFGSI  AFDILK M+DVGCEPSF+TYSYLIKHLSN K
Sbjct: 661 KMKEKGILPDTLLYTLLIDAYGRFGSIDDAFDILKHMHDVGCEPSFYTYSYLIKHLSNEK 720

Query: 721 LTKVNSSSELNDLSSGVASNDFANLWKRVDYEFALELFEKMVKHGCAPNANTYGKFITGL 780
           L +VNS+SEL+DLSSGVASNDF+N W+RVDYEFALELF KMVKHGCAPNANTY KFITGL
Sbjct: 721 LKEVNSNSELSDLSSGVASNDFSNFWRRVDYEFALELFGKMVKHGCAPNANTYSKFITGL 780

Query: 781 CKVGCLEVAHRLYDHMKEKGLSPNEDIYNCLLGCSCQLGLYDKAIRWLDIMIEHGYLPHL 840
           CKV CLE+A RL+DHMKEKGL PNEDIYN LLGCSC+LGLY  A+RWLDIMIE G+LPHL
Sbjct: 781 CKVECLEIAQRLFDHMKEKGLLPNEDIYNSLLGCSCRLGLYGNAVRWLDIMIEQGHLPHL 840

Query: 841 DSCKLLVCGLYDEGNNEKAKIVFYSLLQCGYNYDEMAWKVLIDGLLKKGLVDKCSELFGI 900
           DSCKLL+CGLYDEGNNEKAK VFYSLLQCGYNYDEM WKVLIDGLLKKGLVDKCSELFGI
Sbjct: 841 DSCKLLLCGLYDEGNNEKAKTVFYSLLQCGYNYDEMTWKVLIDGLLKKGLVDKCSELFGI 900

Query: 901 MERQGCRIHPKTYSMLIEGFDGIQDMD 927
           ME+QGC+IHPKTYSMLIEGFDG+QD+D
Sbjct: 901 MEKQGCQIHPKTYSMLIEGFDGVQDID 927

BLAST of Sgr018481 vs. NCBI nr
Match: KAG7030032.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1667.9 bits (4318), Expect = 0.0e+00
Identity = 812/927 (87.59%), Postives = 863/927 (93.10%), Query Frame = 0

Query: 1   MHGVLTAVRYPTMIRYSTA-INSGQLLIILGFRLRLTFTLTLKFFTSTASLPQSLPVEHD 60
           MHGV TAVR P MIR S   INSGQLLI+ GFRLR TF+LT KFFTSTASLPQ+LPVEHD
Sbjct: 1   MHGVFTAVRCPAMIRNSAVIINSGQLLIVHGFRLRFTFSLTFKFFTSTASLPQNLPVEHD 60

Query: 61  ISAQLFSILSHPNWQKHPSLKNLIPSIAPSHISTLFTHNLDPQTALAFFNWIGQKHGFKH 120
           ISAQL++ILS PNWQK+PSLK LIPSI+PSHIS LF  NLDPQTALAFFNWIGQKHGFKH
Sbjct: 61  ISAQLYTILSRPNWQKNPSLKVLIPSISPSHISALFALNLDPQTALAFFNWIGQKHGFKH 120

Query: 121 NVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKLT 180
           NVQSY+S++NILVPNGYL IAEKMRILMIKSTDS ENALFVLEMLRSMNRRGDDFKFKLT
Sbjct: 121 NVQSYVSIINILVPNGYLHIAEKMRILMIKSTDSLENALFVLEMLRSMNRRGDDFKFKLT 180

Query: 181 LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVS 240
           LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYT NTMVNGYCKLG VVEAELYVS
Sbjct: 181 LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTFNTMVNGYCKLGYVVEAELYVS 240

Query: 241 KIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEAR 300
           KIVQAGLSLDTFTYTSLILGYCRN+NVDGAYRIFLSMPSKGCRRNEVSYTN+I+G CEAR
Sbjct: 241 KIVQAGLSLDTFTYTSLILGYCRNKNVDGAYRIFLSMPSKGCRRNEVSYTNMINGFCEAR 300

Query: 301 RIDEALKLFSQMHEDNCWPTVRTYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTYT 360
           RIDEALKLFSQMHEDNC PTVRTYT++I A+CQL RK E F+MFKEMTEKG EPNV+T+T
Sbjct: 301 RIDEALKLFSQMHEDNCSPTVRTYTVLIHAMCQLGRKTEAFSMFKEMTEKGSEPNVYTWT 360

Query: 361 VLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMESN 420
           VLIHSLCED NFDDAKKMLNGMLEKGLVPS+VTYNALIDGYCKKGMS SALEILSLMESN
Sbjct: 361 VLIHSLCEDNNFDDAKKMLNGMLEKGLVPSLVTYNALIDGYCKKGMSTSALEILSLMESN 420

Query: 421 NCSPNARTYNELILGFCRAKDVHKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGSA 480
           NCSPNARTYNELILGFCRAK+VHKAM+LL +ML RKLQPDVVTYNLLIHGQCK GHL SA
Sbjct: 421 NCSPNARTYNELILGFCRAKNVHKAMSLLNEMLNRKLQPDVVTYNLLIHGQCKEGHLDSA 480

Query: 481 YKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEEAHLLFDSLKEKGVKANEVIYSALIDG 540
           YKLL LMNESGLVPDEWTYSVFVD+LCKR +VEEA LLFDSLK KG+KANEVIYSALIDG
Sbjct: 481 YKLLSLMNESGLVPDEWTYSVFVDSLCKREQVEEARLLFDSLKVKGIKANEVIYSALIDG 540

Query: 541 YCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLIDGYCREKNFQEALVLVEIMIKRDINPT 600
           YCKVGKVSDGHSLLDKML DG +PNS TYNSLIDGYC+EKN+QEAL+L+EIMIKR I P 
Sbjct: 541 YCKVGKVSDGHSLLDKMLSDGWVPNSFTYNSLIDGYCKEKNYQEALLLMEIMIKRGIKPA 600

Query: 601 ADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHPDVFTYTAFIHAYCSQGRLKDAELFIY 660
            DTYTILIENLL+DGEFDRAHNMFDQMLST SHPDVF YTAFIHAYCSQGRLKDAE+ IY
Sbjct: 601 VDTYTILIENLLKDGEFDRAHNMFDQMLSTVSHPDVFIYTAFIHAYCSQGRLKDAEVLIY 660

Query: 661 KMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDILKRMYDVGCEPSFHTYSYLIKHLSNAK 720
           KM EKGI+PDTLLYTLLIDAYGRFGSI  AFDILK M+DVGCEPSF+TYSYLIKHLSN K
Sbjct: 661 KMKEKGILPDTLLYTLLIDAYGRFGSIDDAFDILKHMHDVGCEPSFYTYSYLIKHLSNEK 720

Query: 721 LTKVNSSSELNDLSSGVASNDFANLWKRVDYEFALELFEKMVKHGCAPNANTYGKFITGL 780
           L +VNS+SEL+DLSSGVASNDF+N W+RVDYEFALELF KMVKHGCAPNANTY KFITGL
Sbjct: 721 LKEVNSNSELSDLSSGVASNDFSNFWRRVDYEFALELFGKMVKHGCAPNANTYSKFITGL 780

Query: 781 CKVGCLEVAHRLYDHMKEKGLSPNEDIYNCLLGCSCQLGLYDKAIRWLDIMIEHGYLPHL 840
           CKV CLE+A RL+DHMKEKGL PNEDIYN LLGCSC+LGLY  A+RWLDIMIE G+LPHL
Sbjct: 781 CKVECLEIAQRLFDHMKEKGLLPNEDIYNSLLGCSCRLGLYGNAVRWLDIMIEQGHLPHL 840

Query: 841 DSCKLLVCGLYDEGNNEKAKIVFYSLLQCGYNYDEMAWKVLIDGLLKKGLVDKCSELFGI 900
           DSCKLL+CGLYDEGNNEKAK VFYSLLQCGYNYDEM WKVLIDGLLKKGLVDKCSELFGI
Sbjct: 841 DSCKLLLCGLYDEGNNEKAKTVFYSLLQCGYNYDEMTWKVLIDGLLKKGLVDKCSELFGI 900

Query: 901 MERQGCRIHPKTYSMLIEGFDGIQDMD 927
           ME+QGC+IHPKTYSMLIEGFDG+QD+D
Sbjct: 901 MEKQGCQIHPKTYSMLIEGFDGVQDID 927

BLAST of Sgr018481 vs. ExPASy Swiss-Prot
Match: Q9LSL9 (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX=3702 GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 984.2 bits (2543), Expect = 1.0e-285
Identity = 477/886 (53.84%), Postives = 637/886 (71.90%), Query Frame = 0

Query: 39  LTLKFFTSTASLPQSLPVEH----DISAQLFSILSHPNWQKHPSLKNLIPSIAPSHISTL 98
           +T + F S + L ++LP E      +  +L SILS PNW K PSLK+++ +I+PSH+S+L
Sbjct: 37  VTRRQFCSVSPLLRNLPEEESDSMSVPHRLLSILSKPNWHKSPSLKSMVSAISPSHVSSL 96

Query: 99  FTHNLDPQTALAFFNWIGQKHGFKHNVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDSS 158
           F+ +LDP+TAL F +WI Q   +KH+V SY S+L +L+ NGY+ +  K+R+LMIKS DS 
Sbjct: 97  FSLDLDPKTALNFSHWISQNPRYKHSVYSYASLLTLLINNGYVGVVFKIRLLMIKSCDSV 156

Query: 159 ENALFVLEMLRSMNR-RGDDFKFKLTLRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTP 218
            +AL+VL++ R MN+    + K+KL + CYN LL  L+RF ++DEMK VY+EML+D V P
Sbjct: 157 GDALYVLDLCRKMNKDERFELKYKLIIGCYNTLLNSLARFGLVDEMKQVYMEMLEDKVCP 216

Query: 219 NIYTLNTMVNGYCKLGNVVEAELYVSKIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIF 278
           NIYT N MVNGYCKLGNV EA  YVSKIV+AGL  D FTYTSLI+GYC+ +++D A+++F
Sbjct: 217 NIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTYTSLIMGYCQRKDLDSAFKVF 276

Query: 279 LSMPSKGCRRNEVSYTNLIHGLCEARRIDEALKLFSQMHEDNCWPTVRTYTIIICALCQL 338
             MP KGCRRNEV+YT+LIHGLC ARRIDEA+ LF +M +D C+PTVRTYT++I +LC  
Sbjct: 277 NEMPLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKMKDDECFPTVRTYTVLIKSLCGS 336

Query: 339 SRKLEGFNMFKEMTEKGCEPNVHTYTVLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTY 398
            RK E  N+ KEM E G +PN+HTYTVLI SLC    F+ A+++L  MLEKGL+P+V+TY
Sbjct: 337 ERKSEALNLVKEMEETGIKPNIHTYTVLIDSLCSQCKFEKARELLGQMLEKGLMPNVITY 396

Query: 399 NALIDGYCKKGMSMSALEILSLMESNNCSPNARTYNELILGFCRAKDVHKAMALLYKMLE 458
           NALI+GYCK+GM   A++++ LMES   SPN RTYNELI G+C++ +VHKAM +L KMLE
Sbjct: 397 NALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELIKGYCKS-NVHKAMGVLNKMLE 456

Query: 459 RKLQPDVVTYNLLIHGQCKGGHLGSAYKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEE 518
           RK+ PDVVTYN LI GQC+ G+  SAY+LL LMN+ GLVPD+WTY+  +D+LCK  RVEE
Sbjct: 457 RKVLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEE 516

Query: 519 AHLLFDSLKEKGVKANEVIYSALIDGYCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLID 578
           A  LFDSL++KGV  N V+Y+ALIDGYCK GKV + H +L+KML   C+PNS+T+N+LI 
Sbjct: 517 ACDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIH 576

Query: 579 GYCREKNFQEALVLVEIMIKRDINPTADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHP 638
           G C +   +EA +L E M+K  + PT  T TILI  LL+DG+FD A++ F QMLS+G+ P
Sbjct: 577 GLCADGKLKEATLLEEKMVKIGLQPTVSTDTILIHRLLKDGDFDHAYSRFQQMLSSGTKP 636

Query: 639 DVFTYTAFIHAYCSQGRLKDAELFIYKMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDIL 698
           D  TYT FI  YC +GRL DAE  + KM E G+ PD   Y+ LI  YG  G    AFD+L
Sbjct: 637 DAHTYTTFIQTYCREGRLLDAEDMMAKMRENGVSPDLFTYSSLIKGYGDLGQTNFAFDVL 696

Query: 699 KRMYDVGCEPSFHTYSYLIKHLSNAKLTKVNSSSELNDLSSGVASNDFANLWKRVDYEFA 758
           KRM D GCEPS HT+  LIKHL   K  K   S             +   +   ++++  
Sbjct: 697 KRMRDTGCEPSQHTFLSLIKHLLEMKYGKQKGSEP-----------ELCAMSNMMEFDTV 756

Query: 759 LELFEKMVKHGCAPNANTYGKFITGLCKVGCLEVAHRLYDHM-KEKGLSPNEDIYNCLLG 818
           +EL EKMV+H   PNA +Y K I G+C+VG L VA +++DHM + +G+SP+E ++N LL 
Sbjct: 757 VELLEKMVEHSVTPNAKSYEKLILGICEVGNLRVAEKVFDHMQRNEGISPSELVFNALLS 816

Query: 819 CSCQLGLYDKAIRWLDIMIEHGYLPHLDSCKLLVCGLYDEGNNEKAKIVFYSLLQCGYNY 878
           C C+L  +++A + +D MI  G+LP L+SCK+L+CGLY +G  E+   VF +LLQCGY  
Sbjct: 817 CCCKLKKHNEAAKVVDDMICVGHLPQLESCKVLICGLYKKGEKERGTSVFQNLLQCGYYE 876

Query: 879 DEMAWKVLIDGLLKKGLVDKCSELFGIMERQGCRIHPKTYSMLIEG 919
           DE+AWK++IDG+ K+GLV+   ELF +ME+ GC+   +TYS+LIEG
Sbjct: 877 DELAWKIIIDGVGKQGLVEAFYELFNVMEKNGCKFSSQTYSLLIEG 910

BLAST of Sgr018481 vs. ExPASy Swiss-Prot
Match: Q9SFV9 (Pentatricopeptide repeat-containing protein At3g07290, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g07290 PE=2 SV=1)

HSP 1 Score: 473.8 bits (1218), Expect = 4.5e-132
Identity = 271/870 (31.15%), Postives = 461/870 (52.99%), Query Frame = 0

Query: 44  FTSTASLPQSLPVEHDISAQLFSILSHPNWQKHPSLKNLIPSIAPSHISTLFT-HNLDPQ 103
           F S +S P     +   +  + S+L  PNW+K+ SLK+L+  + P+  S + +    D  
Sbjct: 25  FFSVSSRPSLSSSDEVAAHDVASLLKTPNWEKNSSLKSLVSHMNPNVASQVISLQRSDND 84

Query: 104 TALAFFNWIGQKHGFKHNVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLE 163
             + FF W+ +   +  +      +L ++V +G  R+A  + + +IK     E  +  L+
Sbjct: 85  ICVRFFMWVCKHSSYCFDPTQKNQLLKLIVSSGLYRVAHAVIVALIKECSRCEKEM--LK 144

Query: 164 MLRSMNRRGDDFKFKLTLRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMV 223
           ++   +   + F F+L   CY+ LLM L++  +       Y  M  D     +    T+V
Sbjct: 145 LMYCFDELREVFGFRLNYPCYSSLLMSLAKLDLGFLAYVTYRRMEADGFVVGMIDYRTIV 204

Query: 224 NGYCKLGNVVEAELYVSKIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIFLSMPSK-GC 283
           N  CK G    AE+++SKI++ G  LD+   TSL+LG+CR  N+  A ++F  M  +  C
Sbjct: 205 NALCKNGYTEAAEMFMSKILKIGFVLDSHIGTSLLLGFCRGLNLRDALKVFDVMSKEVTC 264

Query: 284 RRNEVSYTNLIHGLCEARRIDEALKLFSQMHEDNCWPTVRTYTIIICALCQLSRKLEGFN 343
             N VSY+ LIHGLCE  R++EA  L  QM E  C P+ RTYT++I ALC      + FN
Sbjct: 265 APNSVSYSILIHGLCEVGRLEEAFGLKDQMGEKGCQPSTRTYTVLIKALCDRGLIDKAFN 324

Query: 344 MFKEMTEKGCEPNVHTYTVLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTYNALIDGYC 403
           +F EM  +GC+PNVHTYTVLI  LC D   ++A  +   M++  + PSV+TYNALI+GYC
Sbjct: 325 LFDEMIPRGCKPNVHTYTVLIDGLCRDGKIEEANGVCRKMVKDRIFPSVITYNALINGYC 384

Query: 404 KKGMSMSALEILSLMESNNCSPNARTYNELILGFCRAKDVHKAMALLYKMLERKLQPDVV 463
           K G  + A E+L++ME   C PN RT+NEL+ G CR    +KA+ LL +ML+  L PD+V
Sbjct: 385 KDGRVVPAFELLTVMEKRACKPNVRTFNELMEGLCRVGKPYKAVHLLKRMLDNGLSPDIV 444

Query: 464 TYNLLIHGQCKGGHLGSAYKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEEAHLLFDSL 523
           +YN+LI G C+ GH+ +AYKLL  MN   + PD  T++  ++  CK+G+ + A      +
Sbjct: 445 SYNVLIDGLCREGHMNTAYKLLSSMNCFDIEPDCLTFTAIINAFCKQGKADVASAFLGLM 504

Query: 524 KEKGVKANEVIYSALIDGYCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLIDGYCREKNF 583
             KG+  +EV  + LIDG CKVGK  D   +L+ ++    +    + N ++D   +    
Sbjct: 505 LRKGISLDEVTGTTLIDGVCKVGKTRDALFILETLVKMRILTTPHSLNVILDMLSKGCKV 564

Query: 584 QEALVLVEIMIKRDINPTADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHPDVFTYTAF 643
           +E L ++  + K  + P+  TYT L++ L+  G+   +  + + M  +G  P+V+ YT  
Sbjct: 565 KEELAMLGKINKLGLVPSVVTYTTLVDGLIRSGDITGSFRILELMKLSGCLPNVYPYTII 624

Query: 644 IHAYCSQGRLKDAELFIYKMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDILKRMYDVGC 703
           I+  C  GR+++AE  +  M + G+ P+ + YT+++  Y   G +  A + ++ M + G 
Sbjct: 625 INGLCQFGRVEEAEKLLSAMQDSGVSPNHVTYTVMVKGYVNNGKLDRALETVRAMVERGY 684

Query: 704 EPSFHTYSYLIK-HLSNAKLTKVNSSSELNDLSSGVASNDFANLWKRVDYEFALELFEKM 763
           E +   YS L++  + + K    +  S ++D++            +  D E   EL   +
Sbjct: 685 ELNDRIYSSLLQGFVLSQKGIDNSEESTVSDIA-----------LRETDPECINELISVV 744

Query: 764 VK-HGCAPNANTYGKFITGLCKVGCLEVAHRLYDHMKEKGLSPNEDIYNCLLGCSCQLGL 823
            +  GC      +   +T LCK G  + ++ L  ++ E+G+   E   + ++   C    
Sbjct: 745 EQLGGCISGLCIF--LVTRLCKEGRTDESNDLVQNVLERGVF-LEKAMDIIMESYCSKKK 804

Query: 824 YDKAIRWLDIMIEHGYLPHLDSCKLLVCGLYDEGNNEKAKIVFYSLLQCGYNYDEMAWKV 883
           + K +  + ++++ G++P   S  L++ GL  EG+ E+A+ +   LL      ++     
Sbjct: 805 HTKCMELITLVLKSGFVPSFKSFCLVIQGLKKEGDAERARELVMELLTSNGVVEKSGVLT 864

Query: 884 LIDGLLKKGLVDKCSELFGIMERQGCRIHP 910
            ++ L++      CSE+  ++++  CR  P
Sbjct: 865 YVECLMEGDETGDCSEVIDLVDQLHCRERP 878

BLAST of Sgr018481 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 320.5 bits (820), Expect = 6.4e-86
Identity = 193/651 (29.65%), Postives = 313/651 (48.08%), Query Frame = 0

Query: 256 LILGYCRNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEARR-IDEALKLFSQMHED 315
           ++  Y R   +D A  I     + G     +SY  ++     ++R I  A  +F +M E 
Sbjct: 140 VVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLES 199

Query: 316 NCWPTVRTYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTYTVLIHSLCEDQNFDDA 375
              P V TY I+I   C          +F +M  KGC PNV TY  LI   C+ +  DD 
Sbjct: 200 QVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDG 259

Query: 376 KKMLNGMLEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMESNNCSPNARTYNELILG 435
            K+L  M  KGL P++++YN +I+G C++G       +L+ M     S +  TYN LI G
Sbjct: 260 FKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKG 319

Query: 436 FCRAKDVHKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGSAYKLLGLMNESGLVPD 495
           +C+  + H+A+ +  +ML   L P V+TY  LIH  CK G++  A + L  M   GL P+
Sbjct: 320 YCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPN 379

Query: 496 EWTYSVFVDTLCKRGRVEEAHLLFDSLKEKGVKANEVIYSALIDGYCKVGKVSDGHSLLD 555
           E TY+  VD   ++G + EA+ +   + + G   + V Y+ALI+G+C  GK+ D  ++L+
Sbjct: 380 ERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLE 439

Query: 556 KMLGDGCIPNSITYNSLIDGYCREKNFQEALVLVEIMIKRDINPTADTYTILIENLLEDG 615
            M   G  P+ ++Y++++ G+CR  +  EAL +   M+++ I P   TY+ LI+   E  
Sbjct: 440 DMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQR 499

Query: 616 EFDRAHNMFDQMLSTGSHPDVFTYTAFIHAYCSQGRLKDAELFIYKMNEKGIMPDTLLYT 675
               A +++++ML  G  PD FTYTA I+AYC +G L+ A     +M EKG++PD + Y+
Sbjct: 500 RTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYS 559

Query: 676 LLIDAYGRFGSIGCAFDILKRMYDVGCEPSFHTYSYLIKHLSNAKLTKVNSSSELNDLSS 735
           +LI+   +      A  +L +++     PS  TY  LI++ SN +   V S         
Sbjct: 560 VLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTLIENCSNIEFKSVVS--------- 619

Query: 736 GVASNDFANLWKRVDYEFALELFEKMVKHGCAPNANTYGKFITGLCKVGCLEVAHRLYDH 795
                                                    I G C  G +  A ++++ 
Sbjct: 620 ----------------------------------------LIKGFCMKGMMTEADQVFES 679

Query: 796 MKEKGLSPNEDIYNCLLGCSCQLGLYDKAIRWLDIMIEHGYLPHLDSCKLLVCGLYDEGN 855
           M  K   P+   YN ++   C+ G   KA      M++ G+L H  +   LV  L+ EG 
Sbjct: 680 MLGKNHKPDGTAYNIMIHGHCRAGDIRKAYTLYKEMVKSGFLLHTVTVIALVKALHKEGK 739

Query: 856 -NEKAKIVFYSLLQCGYNYDEMAWKVLIDGLLKKGLVDKCSELFGIMERQG 905
            NE   ++ + L  C  +  E A KVL++   ++G +D   ++   M + G
Sbjct: 740 VNELNSVIVHVLRSCELSEAEQA-KVLVEINHREGNMDVVLDVLAEMAKDG 740

BLAST of Sgr018481 vs. ExPASy Swiss-Prot
Match: Q9SZ52 (Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PGR3 PE=1 SV=1)

HSP 1 Score: 310.5 bits (794), Expect = 6.6e-83
Identity = 239/844 (28.32%), Postives = 380/844 (45.02%), Query Frame = 0

Query: 100 DPQTALAFFNWIGQKHGFK------HNVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDS 159
           D  T L  F  +  K G K        ++ +  +LN    NG +        L++KS   
Sbjct: 152 DTNTYLTIFKSLSVKGGLKQAPYALRKMREFGFVLNAYSYNGLIH-------LLLKSRFC 211

Query: 160 SENALFVLEMLRSMNRRGDDFKFKLTLRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTP 219
           +E     +E+ R M   G    F+ +L+ Y+ L++ L +   ID +  +  EM    + P
Sbjct: 212 TE----AMEVYRRMILEG----FRPSLQTYSSLMVGLGKRRDIDSVMGLLKEMETLGLKP 271

Query: 220 NIYTLNTMVNGYCKLGNVVEAELYVSKIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIF 279
           N+YT    +    + G + EA   + ++   G   D  TYT LI   C  R +D A  +F
Sbjct: 272 NVYTFTICIRVLGRAGKINEAYEILKRMDDEGCGPDVVTYTVLIDALCTARKLDCAKEVF 331

Query: 280 LSMPSKGCRRNEVSYTNLIHGLCEARRIDEALKLFSQMHEDNCWPTVRTYTIIICALCQL 339
             M +   + + V+Y  L+    + R +D   + +S+M +D   P V T+TI++ ALC+ 
Sbjct: 332 EKMKTGRHKPDRVTYITLLDRFSDNRDLDSVKQFWSEMEKDGHVPDVVTFTILVDALCKA 391

Query: 340 SRKLEGFNMFKEMTEKGCEPNVHTYTVLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTY 399
               E F+    M ++G  PN+HTY  LI  L      DDA ++   M   G+ P+  TY
Sbjct: 392 GNFGEAFDTLDVMRDQGILPNLHTYNTLICGLLRVHRLDDALELFGNMESLGVKPTAYTY 451

Query: 400 NALIDGYCKKGMSMSALEILSLMESNNCSPNARTYNELILGFCRAKDVHKAMALLYKMLE 459
              ID Y K G S+SALE    M++   +PN    N  +    +A    +A  + Y + +
Sbjct: 452 IVFIDYYGKSGDSVSALETFEKMKTKGIAPNIVACNASLYSLAKAGRDREAKQIFYGLKD 511

Query: 460 RKLQPDVVTYNLLIHGQCKGGHLGSAYKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEE 519
             L PD VTYN+++    K G +  A KLL  M E+G  PD    +  ++TL K  RV+E
Sbjct: 512 IGLVPDSVTYNMMMKCYSKVGEIDEAIKLLSEMMENGCEPDVIVVNSLINTLYKADRVDE 571

Query: 520 AHLLFDSLKEKGVKANEVIYSALIDGYCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLID 579
           A  +F  +KE  +K   V Y+ L+ G  K GK+ +   L + M+  GC PN+IT+N+L D
Sbjct: 572 AWKMFMRMKEMKLKPTVVTYNTLLAGLGKNGKIQEAIELFEGMVQKGCPPNTITFNTLFD 631

Query: 580 GYCREKNFQEALVLVEIMIKRDINPTADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHP 639
             C  KN +  L L                                  M  +M+  G  P
Sbjct: 632 CLC--KNDEVTLAL---------------------------------KMLFKMMDMGCVP 691

Query: 640 DVFTYTAFIHAYCSQGRLKDAELFIYKMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDIL 699
           DVFTY   I      G++K+A  F ++M +K + PD +    L+    +   I  A+ I+
Sbjct: 692 DVFTYNTIIFGLVKNGQVKEAMCFFHQM-KKLVYPDFVTLCTLLPGVVKASLIEDAYKII 751

Query: 700 KR-MYDVGCEPSFHTYSYLI-KHLSNAKLTKVNSSSELNDLSSGVASNDFANLWKRVDYE 759
              +Y+   +P+   +  LI   L+ A +    S SE   +++G+  +  + L   + Y 
Sbjct: 752 TNFLYNCADQPANLFWEDLIGSILAEAGIDNAVSFSE-RLVANGICRDGDSILVPIIRYS 811

Query: 760 F-------ALELFEKMVKH-GCAPNANTYGKFITGLCKVGCLEVAHRLYDHMKEKGLSPN 819
                   A  LFEK  K  G  P   TY   I GL +   +E+A  ++  +K  G  P+
Sbjct: 812 CKHNNVSGARTLFEKFTKDLGVQPKLPTYNLLIGGLLEADMIEIAQDVFLQVKSTGCIPD 871

Query: 820 EDIYNCLLGCSCQLGLYDKAIRWLDIMIEHGYLPHLDSCKLLVCGLYDEGNNEKAKIVFY 879
              YN LL    + G  D+       M  H    +  +  +++ GL   GN + A  ++Y
Sbjct: 872 VATYNFLLDAYGKSGKIDELFELYKEMSTHECEANTITHNIVISGLVKAGNVDDALDLYY 931

Query: 880 SLL-QCGYNYDEMAWKVLIDGLLKKGLVDKCSELFGIMERQGCRIHPKTYSMLIEGFDGI 927
            L+    ++     +  LIDGL K G + +  +LF  M   GCR +   Y++LI GF   
Sbjct: 932 DLMSDRDFSPTACTYGPLIDGLSKSGRLYEAKQLFEGMLDYGCRPNCAIYNILINGFGKA 943

BLAST of Sgr018481 vs. ExPASy Swiss-Prot
Match: Q9CA58 (Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis thaliana OX=3702 GN=At1g74580 PE=3 SV=1)

HSP 1 Score: 304.7 bits (779), Expect = 3.6e-81
Identity = 199/735 (27.07%), Postives = 339/735 (46.12%), Query Frame = 0

Query: 84  PSIAPSHISTLFTHNLDPQTALAFFNWIGQKHGFKHNVQSYMSMLNILVPNGYLRIAEKM 143
           P + P H++ +     DP  AL  FN + ++ GFKH + +Y S++  L   GY    E M
Sbjct: 3   PPLLPKHVTAVIKCQKDPMKALEMFNSMRKEVGFKHTLSTYRSVIEKL---GYYGKFEAM 62

Query: 144 RILMIKSTDSSENALF---VLEMLRSMNRRG------------DDFKFKLTLRCYNMLLM 203
             +++   ++  N +     +  +++  R+G            D +  + T+  YN ++ 
Sbjct: 63  EEVLVDMRENVGNHMLEGVYVGAMKNYGRKGKVQEAVNVFERMDFYDCEPTVFSYNAIMS 122

Query: 204 LLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVSKIVQAGLSL 263
           +L      D+   VY+ M D  +TP++Y+    +  +CK      A   ++ +   G  +
Sbjct: 123 VLVDSGYFDQAHKVYMRMRDRGITPDVYSFTIRMKSFCKTSRPHAALRLLNNMSSQGCEM 182

Query: 264 DTFTYTSLILGYCRNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEARRIDEALKLF 323
           +   Y +++ G+         Y +F  M + G      ++  L+  LC+   + E  KL 
Sbjct: 183 NVVAYCTVVGGFYEENFKAEGYELFGKMLASGVSLCLSTFNKLLRVLCKKGDVKECEKLL 242

Query: 324 SQMHEDNCWPTVRTYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTYTVLIHSLCED 383
            ++ +    P + TY + I  LCQ         M   + E+G +P+V TY  LI+ LC++
Sbjct: 243 DKVIKRGVLPNLFTYNLFIQGLCQRGELDGAVRMVGCLIEQGPKPDVITYNNLIYGLCKN 302

Query: 384 QNFDDAKKMLNGMLEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMESNNCSPNARTY 443
             F +A+  L  M+ +GL P   TYN LI GYCK GM   A  I+     N   P+  TY
Sbjct: 303 SKFQEAEVYLGKMVNEGLEPDSYTYNTLIAGYCKGGMVQLAERIVGDAVFNGFVPDQFTY 362

Query: 444 NELILGFCRAKDVHKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGSAYKLLGLMNE 503
             LI G C   + ++A+AL  + L + ++P+V+ YN LI G    G +  A +L   M+E
Sbjct: 363 RSLIDGLCHEGETNRALALFNEALGKGIKPNVILYNTLIKGLSNQGMILEAAQLANEMSE 422

Query: 504 SGLVPDEWTYSVFVDTLCKRGRVEEAHLLFDSLKEKGVKANEVIYSALIDGYCKVGKVSD 563
            GL+P+  T+++ V+ LCK G V +A  L   +  KG   +   ++ LI GY    K+ +
Sbjct: 423 KGLIPEVQTFNILVNGLCKMGCVSDADGLVKVMISKGYFPDIFTFNILIHGYSTQLKMEN 482

Query: 564 GHSLLDKMLGDGCIPNSITYNSLIDGYCREKNFQEALVLVEIMIKRDINPTADTYTILIE 623
              +LD ML +G  P+  TYNSL++G C+   F++ +   + M+++   P   T+ IL+E
Sbjct: 483 ALEILDVMLDNGVDPDVYTYNSLLNGLCKTSKFEDVMETYKTMVEKGCAPNLFTFNILLE 542

Query: 624 NLLEDGEFDRAHNMFDQMLSTGSHPDVFTYTAFIHAYCSQGRLKDAELFIYKMNEK-GIM 683
           +L    + D A  + ++M +   +PD  T+   I  +C  G L  A     KM E   + 
Sbjct: 543 SLCRYRKLDEALGLLEEMKNKSVNPDAVTFGTLIDGFCKNGDLDGAYTLFRKMEEAYKVS 602

Query: 684 PDTLLYTLLIDAYGRFGSIGCAFDILKRMYDVGCEPSFHTYSYLIKHLSNAKLTKVNSSS 743
             T  Y ++I A+    ++  A  + + M D    P  +TY  ++               
Sbjct: 603 SSTPTYNIIIHAFTEKLNVTMAEKLFQEMVDRCLGPDGYTYRLMV--------------- 662

Query: 744 ELNDLSSGVASNDFANLWKRVDYEFALELFEKMVKHGCAPNANTYGKFITGLCKVGCLEV 803
                  G       NL     Y+F LE    M+++G  P+  T G+ I  LC    +  
Sbjct: 663 ------DGFCKTGNVNL----GYKFLLE----MMENGFIPSLTTLGRVINCLCVEDRVYE 705

BLAST of Sgr018481 vs. ExPASy TrEMBL
Match: A0A6J1DI13 (pentatricopeptide repeat-containing protein At5g65560 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111020685 PE=4 SV=1)

HSP 1 Score: 1751.5 bits (4535), Expect = 0.0e+00
Identity = 849/927 (91.59%), Postives = 884/927 (95.36%), Query Frame = 0

Query: 1   MHGVLTAVRYPTMIRYSTA-INSGQLLIILGFRLRLTFTLTLKFFTSTASLPQSLPVEHD 60
           MHGVLTAVR  TMIRY TA INSGQL I+LGFRLRLTFTL LKFFTSTASLPQSLPVEHD
Sbjct: 13  MHGVLTAVRCRTMIRYPTAIINSGQLFIVLGFRLRLTFTLNLKFFTSTASLPQSLPVEHD 72

Query: 61  ISAQLFSILSHPNWQKHPSLKNLIPSIAPSHISTLFTHNLDPQTALAFFNWIGQKHGFKH 120
           ISAQLFSILS PNWQKHPSLKNLIPSIAPSHIS LF  NLDPQTALAFFNWIGQKHGFKH
Sbjct: 73  ISAQLFSILSRPNWQKHPSLKNLIPSIAPSHISALFALNLDPQTALAFFNWIGQKHGFKH 132

Query: 121 NVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKLT 180
           NVQSY SMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKLT
Sbjct: 133 NVQSYTSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKLT 192

Query: 181 LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVS 240
           LRCYNMLLMLLSRFL++DEM+SVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVS
Sbjct: 193 LRCYNMLLMLLSRFLLVDEMRSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVS 252

Query: 241 KIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEAR 300
           KIVQAGLSLDTFTYTSLILGYCRN+NVDGAYRIFLSMP+KGCRRNEVSYTNLIHG C+A+
Sbjct: 253 KIVQAGLSLDTFTYTSLILGYCRNKNVDGAYRIFLSMPNKGCRRNEVSYTNLIHGFCDAK 312

Query: 301 RIDEALKLFSQMHEDNCWPTVRTYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTYT 360
           R DEALKLFSQMHEDNCWPTVRTYT+IICALCQL RK E FN FKEMTEKGCEPNVHTYT
Sbjct: 313 RTDEALKLFSQMHEDNCWPTVRTYTVIICALCQLGRKSEAFNTFKEMTEKGCEPNVHTYT 372

Query: 361 VLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMESN 420
           VLIHSLCED NFDDAK MLNGML+KGLVPSVVTYNALIDGYCKKGMS+SALEILSLMESN
Sbjct: 373 VLIHSLCEDNNFDDAKNMLNGMLQKGLVPSVVTYNALIDGYCKKGMSLSALEILSLMESN 432

Query: 421 NCSPNARTYNELILGFCRAKDVHKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGSA 480
           NCSPNARTYNELILGFC+AK+VHKAM+LL+KMLERKLQPDVVTYNLLIHGQCK GHLGSA
Sbjct: 433 NCSPNARTYNELILGFCKAKNVHKAMSLLHKMLERKLQPDVVTYNLLIHGQCKDGHLGSA 492

Query: 481 YKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEEAHLLFDSLKEKGVKANEVIYSALIDG 540
           YKLLGLMNESGLVPDEWTYSVFVDTLCKRG+VEEA  LFDSLKEKG++ANEVIYSALIDG
Sbjct: 493 YKLLGLMNESGLVPDEWTYSVFVDTLCKRGQVEEARFLFDSLKEKGIRANEVIYSALIDG 552

Query: 541 YCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLIDGYCREKNFQEALVLVEIMIKRDINPT 600
           YCKVGKV+DGHSL DKM GDGC+PNSITYNSLIDGYCREKNFQEAL+L+EIMIKRDI PT
Sbjct: 553 YCKVGKVTDGHSLFDKMHGDGCVPNSITYNSLIDGYCREKNFQEALLLLEIMIKRDIKPT 612

Query: 601 ADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHPDVFTYTAFIHAYCSQGRLKDAELFIY 660
           ADTYTILIE+LL+DGEFDRAHNMFDQMLSTGS PDVFTYTAFIHAYCSQGRLKDAELFIY
Sbjct: 613 ADTYTILIESLLKDGEFDRAHNMFDQMLSTGSRPDVFTYTAFIHAYCSQGRLKDAELFIY 672

Query: 661 KMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDILKRMYDVGCEPSFHTYSYLIKHLSNAK 720
           KMNEKGIMPDTLLYTLLIDAYG+FGSIG AFDILKRMYDVGCEPSFHTYSYLIKHLSN+K
Sbjct: 673 KMNEKGIMPDTLLYTLLIDAYGQFGSIGRAFDILKRMYDVGCEPSFHTYSYLIKHLSNSK 732

Query: 721 LTKVNSSSELNDLSSGVASNDFANLWKRVDYEFALELFEKMVKHGCAPNANTYGKFITGL 780
             KV+SS ELNDLSSGV SNDFA+LW++VDYEFAL+LFEKMVKHGC PNANTY KFITGL
Sbjct: 733 SIKVDSSLELNDLSSGVTSNDFASLWRKVDYEFALDLFEKMVKHGCEPNANTYSKFITGL 792

Query: 781 CKVGCLEVAHRLYDHMKEKGLSPNEDIYNCLLGCSCQLGLYDKAIRWLDIMIEHGYLPHL 840
           CKVGCLEVAHRLYDHMK KGLSPNED YN LLGCSCQLG Y KAI+WLDIMIEHG LPHL
Sbjct: 793 CKVGCLEVAHRLYDHMKAKGLSPNEDSYNSLLGCSCQLGSYGKAIKWLDIMIEHGLLPHL 852

Query: 841 DSCKLLVCGLYDEGNNEKAKIVFYSLLQCGYNYDEMAWKVLIDGLLKKGLVDKCSELFGI 900
           DSCKLLVCGLYDEGNNEKAK V YSLLQCGYN DE+AWKVLIDGLLKKGLVDKCSELFGI
Sbjct: 853 DSCKLLVCGLYDEGNNEKAKTVLYSLLQCGYNNDELAWKVLIDGLLKKGLVDKCSELFGI 912

Query: 901 MERQGCRIHPKTYSMLIEGFDGIQDMD 927
           MERQGC+IHPKTYSMLIEGFDGI D+D
Sbjct: 913 MERQGCQIHPKTYSMLIEGFDGIHDID 939

BLAST of Sgr018481 vs. ExPASy TrEMBL
Match: A0A6J1GH11 (pentatricopeptide repeat-containing protein At5g65560-like OS=Cucurbita moschata OX=3662 GN=LOC111454140 PE=4 SV=1)

HSP 1 Score: 1661.7 bits (4302), Expect = 0.0e+00
Identity = 804/928 (86.64%), Postives = 869/928 (93.64%), Query Frame = 0

Query: 1   MHGVLTAVRYPTMIRYSTA-INSGQLLIILGFRLRLTFTLTLKFFT-STASLPQSLPVEH 60
           M+GV TA+R PTMIR S+A INSGQLLI+LGFRLR TFTL  KFFT STASLPQSLPVEH
Sbjct: 13  MYGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLAFKFFTSSTASLPQSLPVEH 72

Query: 61  DISAQLFSILSHPNWQKHPSLKNLIPSIAPSHISTLFTHNLDPQTALAFFNWIGQKHGFK 120
           D+ AQLFSILS P+WQKHPSLK LIPSIAPSH+S+LF  NLDP+TALAFFNWI QKHGFK
Sbjct: 73  DVPAQLFSILSRPDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 132

Query: 121 HNVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKL 180
           HNVQSY+SMLNILVPNGYLRIAEK+RILMIKST+S+ENALFVLEMLRSMNRRGDD +FKL
Sbjct: 133 HNVQSYVSMLNILVPNGYLRIAEKLRILMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 192

Query: 181 TLRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYV 240
           TL+ YNMLLMLLSRFLMIDEMK+VYLEMLDDMV+PN+YTLNT+VNGYCKLGNVVEAELYV
Sbjct: 193 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTLVNGYCKLGNVVEAELYV 252

Query: 241 SKIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEA 300
           SKIVQAGLSLDTFTYTSLILGYCRN+NVDGA +IFLSMPSKGCRRNEVSYTNLIHG CEA
Sbjct: 253 SKIVQAGLSLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 312

Query: 301 RRIDEALKLFSQMHEDNCWPTVRTYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTY 360
           RRIDEALKL SQMHEDNCWPTVRTYT+IICALCQ+ RK E F++FKEMTEKGCEPNVHTY
Sbjct: 313 RRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFDVFKEMTEKGCEPNVHTY 372

Query: 361 TVLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMES 420
           TVLI SLCED  FDDAKK+L+GMLEKGLVPSVVTYNA IDGYCKKGMS SALEILSLMES
Sbjct: 373 TVLIRSLCEDSKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMES 432

Query: 421 NNCSPNARTYNELILGFCRAKDVHKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGS 480
           NNC+PN RTYNELILGFCRAK+VHKAM LL+KMLE KLQPDVVTYNLLIHGQCK G LGS
Sbjct: 433 NNCNPNTRTYNELILGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGQLGS 492

Query: 481 AYKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEEAHLLFDSLKEKGVKANEVIYSALID 540
           AYKLL LMNE+GLVPDEWTYSVF+  LCKRGRVE+A  LFDSLKEKGVKANEVIYSALID
Sbjct: 493 AYKLLSLMNENGLVPDEWTYSVFIVVLCKRGRVEDARFLFDSLKEKGVKANEVIYSALID 552

Query: 541 GYCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLIDGYCREKNFQEALVLVEIMIKRDINP 600
           GYCKVGKVSDGHSLLDKML DGC+PNSITYNSLIDG+C+EKNFQEAL+LVEIMIKRDI  
Sbjct: 553 GYCKVGKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKL 612

Query: 601 TADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHPDVFTYTAFIHAYCSQGRLKDAELFI 660
           TADTYTILI+NLL+DGEFDRAH MFDQMLS GSHPDV  YT FIHAYCS GRL+DAELF+
Sbjct: 613 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLRDAELFL 672

Query: 661 YKMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDILKRMYDVGCEPSFHTYSYLIKHLSNA 720
           +KMN+KGI+PDTLLY+LLIDAYG  GSIG AFDILKRM+DVGCEPSF+TYSYLIKHL +A
Sbjct: 673 HKMNDKGILPDTLLYSLLIDAYGWSGSIGIAFDILKRMHDVGCEPSFYTYSYLIKHLLSA 732

Query: 721 KLTKVNSSSELNDLSSGVASNDFANLWKRVDYEFALELFEKMVKHGCAPNANTYGKFITG 780
           KL +VNSS+EL DLSSGV SNDFANLW+RVD+EFALELFE+MVK GCAPNANTY KFI+G
Sbjct: 733 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDFEFALELFEEMVKQGCAPNANTYSKFISG 792

Query: 781 LCKVGCLEVAHRLYDHMKEKGLSPNEDIYNCLLGCSCQLGLYDKAIRWLDIMIEHGYLPH 840
           LCKVGCLEV  RL+DHMKEKGLSPNEDIYN LLGCSCQLGLY+KAIRWLDIM+EHGYLPH
Sbjct: 793 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYEKAIRWLDIMVEHGYLPH 852

Query: 841 LDSCKLLVCGLYDEGNNEKAKIVFYSLLQCGYNYDEMAWKVLIDGLLKKGLVDKCSELFG 900
           LDSCKLL+CGL+DEGNNEKAK VF+SLLQCGYNYDE+AWK+LIDGLL+KGLVDKCSELFG
Sbjct: 853 LDSCKLLLCGLFDEGNNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 912

Query: 901 IMERQGCRIHPKTYSMLIEGFDGIQDMD 927
           IMERQGC+IHPKTYSMLIEGFDGIQD+D
Sbjct: 913 IMERQGCQIHPKTYSMLIEGFDGIQDID 940

BLAST of Sgr018481 vs. ExPASy TrEMBL
Match: A0A6J1KKQ2 (pentatricopeptide repeat-containing protein At5g65560-like OS=Cucurbita maxima OX=3661 GN=LOC111496590 PE=3 SV=1)

HSP 1 Score: 1649.8 bits (4271), Expect = 0.0e+00
Identity = 800/928 (86.21%), Postives = 863/928 (93.00%), Query Frame = 0

Query: 1   MHGVLTAVRYPTMIRYSTA-INSGQLLIILGFRLRLTFTLTLKFFTS-TASLPQSLPVEH 60
           +HGV TA+R PTMIR S+A INSGQLLI+LGFRLR TFTL LKFFTS TASLPQSLPVEH
Sbjct: 13  VHGVFTAIRCPTMIRNSSAIINSGQLLIVLGFRLRFTFTLALKFFTSTTASLPQSLPVEH 72

Query: 61  DISAQLFSILSHPNWQKHPSLKNLIPSIAPSHISTLFTHNLDPQTALAFFNWIGQKHGFK 120
           D+ AQLFSILS  +WQKHPSLK LIPSIAPSH+S+LF  NLDP+TALAFFNWI QKHGFK
Sbjct: 73  DVPAQLFSILSRLDWQKHPSLKILIPSIAPSHVSSLFALNLDPKTALAFFNWIEQKHGFK 132

Query: 121 HNVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKL 180
           HNVQSY+S+LNILVPNGYLRIAEK+RI MIKST+S+ENALFVLEMLRSMNRRGDD +FKL
Sbjct: 133 HNVQSYVSILNILVPNGYLRIAEKLRISMIKSTNSAENALFVLEMLRSMNRRGDDLRFKL 192

Query: 181 TLRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYV 240
           TL+ YNMLLMLLSRFLMIDEMK+VYLEMLDDMV+PN+YTLNTMVNGYCKLGNVVEAELYV
Sbjct: 193 TLKSYNMLLMLLSRFLMIDEMKNVYLEMLDDMVSPNMYTLNTMVNGYCKLGNVVEAELYV 252

Query: 241 SKIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEA 300
           SKIVQ GL LDTFTYTSLILGYCRN+NVDGA +IFLSMPSKGCRRNEVSYTNLIHG CEA
Sbjct: 253 SKIVQTGLCLDTFTYTSLILGYCRNKNVDGANKIFLSMPSKGCRRNEVSYTNLIHGFCEA 312

Query: 301 RRIDEALKLFSQMHEDNCWPTVRTYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTY 360
           RRIDEALKL SQMHEDNCWPTVRTYT+IICALCQ+ RK E F++FKEMTEKGCEPNVHTY
Sbjct: 313 RRIDEALKLLSQMHEDNCWPTVRTYTVIICALCQMGRKSEAFDVFKEMTEKGCEPNVHTY 372

Query: 361 TVLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMES 420
           TVLIHSLCED  FDDAKK+L+GMLEKGLVPSVVTYNA IDGYCKKGMS SALEILSLME 
Sbjct: 373 TVLIHSLCEDNKFDDAKKLLDGMLEKGLVPSVVTYNAFIDGYCKKGMSTSALEILSLMEL 432

Query: 421 NNCSPNARTYNELILGFCRAKDVHKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGS 480
           NNCSPN RTYNELI+GFCRAK+VHKAM LL+KMLE KLQPDVVTYNLLIHGQCK GHLGS
Sbjct: 433 NNCSPNTRTYNELIMGFCRAKNVHKAMLLLHKMLELKLQPDVVTYNLLIHGQCKEGHLGS 492

Query: 481 AYKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEEAHLLFDSLKEKGVKANEVIYSALID 540
           AYKLL LMNE+GLVPDEWTYSVF+  LCKRGRVEEA  LFDSLKEKG+KANEVIYSALID
Sbjct: 493 AYKLLSLMNENGLVPDEWTYSVFIVVLCKRGRVEEARFLFDSLKEKGIKANEVIYSALID 552

Query: 541 GYCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLIDGYCREKNFQEALVLVEIMIKRDINP 600
           GYCKV KVSDGHSLLDKML DGC+PNSITYNSLIDG+C+EKNFQEAL+LVEIMIKRDI P
Sbjct: 553 GYCKVEKVSDGHSLLDKMLSDGCVPNSITYNSLIDGHCKEKNFQEALLLVEIMIKRDIKP 612

Query: 601 TADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHPDVFTYTAFIHAYCSQGRLKDAELFI 660
           TADTYTILI+NLL+DGEFDRAH MFDQMLS GSHPDV  YT FIHAYCS GRL+DAELF+
Sbjct: 613 TADTYTILIKNLLKDGEFDRAHQMFDQMLSAGSHPDVVIYTVFIHAYCSLGRLQDAELFL 672

Query: 661 YKMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDILKRMYDVGCEPSFHTYSYLIKHLSNA 720
           +KMNEKGI+PD LLY+LLIDAYG  GSI  AFDILKRM+DVGCEPSF+TYSYLIKHL +A
Sbjct: 673 HKMNEKGILPDALLYSLLIDAYGWSGSIEIAFDILKRMHDVGCEPSFYTYSYLIKHLLSA 732

Query: 721 KLTKVNSSSELNDLSSGVASNDFANLWKRVDYEFALELFEKMVKHGCAPNANTYGKFITG 780
           KL +VNSS+EL DLSSGV SNDFANLW+RVDYEFALELFE MVK GCAPNANTYGKFI+G
Sbjct: 733 KLIEVNSSTELGDLSSGVVSNDFANLWRRVDYEFALELFEGMVKQGCAPNANTYGKFISG 792

Query: 781 LCKVGCLEVAHRLYDHMKEKGLSPNEDIYNCLLGCSCQLGLYDKAIRWLDIMIEHGYLPH 840
           LCKVGCLEV  RL+DHMKEKGLSPNEDIYN LL CSCQLGLY+KAIRWLD M+EHGYLPH
Sbjct: 793 LCKVGCLEVGRRLFDHMKEKGLSPNEDIYNSLLCCSCQLGLYEKAIRWLDGMVEHGYLPH 852

Query: 841 LDSCKLLVCGLYDEGNNEKAKIVFYSLLQCGYNYDEMAWKVLIDGLLKKGLVDKCSELFG 900
           LDSCKLL+CGL+DEG+NEKAK VF+SLLQCGYNYDE+AWK+LIDGLL+KGLVDKCSELFG
Sbjct: 853 LDSCKLLLCGLFDEGSNEKAKTVFHSLLQCGYNYDEIAWKLLIDGLLQKGLVDKCSELFG 912

Query: 901 IMERQGCRIHPKTYSMLIEGFDGIQDMD 927
           IMERQGC+IHPKTYSMLIEGFDGIQD+D
Sbjct: 913 IMERQGCQIHPKTYSMLIEGFDGIQDID 940

BLAST of Sgr018481 vs. ExPASy TrEMBL
Match: A0A5A7T899 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold119G00120 PE=4 SV=1)

HSP 1 Score: 1635.5 bits (4234), Expect = 0.0e+00
Identity = 796/927 (85.87%), Postives = 854/927 (92.13%), Query Frame = 0

Query: 1   MHGVLTAVRYPTMIRYSTAI-NSGQLLIILGFRLRLTFTLTLKFFTSTASLPQSLPVEHD 60
           MHGV T VR PTMIR STAI  SGQLL++LGFRLRLTF LT +FFTSTAS PQSL VEHD
Sbjct: 1   MHGVFTPVRCPTMIRNSTAIFKSGQLLVVLGFRLRLTFPLTHRFFTSTASFPQSLSVEHD 60

Query: 61  ISAQLFSILSHPNWQKHPSLKNLIPSIAPSHISTLFTHNLDPQTALAFFNWIGQKHGFKH 120
           I AQLF+ILS PNWQKHPSLKNLIPSI+PSHIS LF  NLDPQTALAFFNWIGQKHGFKH
Sbjct: 61  IPAQLFTILSRPNWQKHPSLKNLIPSISPSHISALFALNLDPQTALAFFNWIGQKHGFKH 120

Query: 121 NVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKLT 180
           NVQSY+SMLNILVPNGYLRIAE MRILMIKSTDSSENA+FVLEMLRSMNRR D FKFKL+
Sbjct: 121 NVQSYVSMLNILVPNGYLRIAENMRILMIKSTDSSENAVFVLEMLRSMNRRVDAFKFKLS 180

Query: 181 LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVS 240
           LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNI+TLNTMVNGYCKLGNVVEAELYVS
Sbjct: 181 LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIFTLNTMVNGYCKLGNVVEAELYVS 240

Query: 241 KIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEAR 300
           KIVQAGLSLDTFTYTSLILGYCRN+NVD A  IFLSMP+KGCRRNEVSYTNLIHG CEAR
Sbjct: 241 KIVQAGLSLDTFTYTSLILGYCRNKNVDAANAIFLSMPNKGCRRNEVSYTNLIHGFCEAR 300

Query: 301 RIDEALKLFSQMHEDNCWPTVRTYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTYT 360
           R+ EALKLFSQMHEDNCWPTVRTYT++I ALCQL RK E  NMFKEMTEK C+PNVHTYT
Sbjct: 301 RVGEALKLFSQMHEDNCWPTVRTYTVLIFALCQLGRKTEALNMFKEMTEKRCQPNVHTYT 360

Query: 361 VLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMESN 420
           VLI SLCED NFDDAKK+LNGMLEKGL+PSVVTYNALIDGYCKKG+S SALEILSLMESN
Sbjct: 361 VLICSLCEDGNFDDAKKILNGMLEKGLIPSVVTYNALIDGYCKKGLSASALEILSLMESN 420

Query: 421 NCSPNARTYNELILGFCRAKDVHKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGSA 480
           NCSPNARTYNELILGFCRAK++HKAM+LL+KMLERKLQP+VVTYN+LIHGQCK G LGSA
Sbjct: 421 NCSPNARTYNELILGFCRAKNIHKAMSLLHKMLERKLQPNVVTYNILIHGQCKEGDLGSA 480

Query: 481 YKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEEAHLLFDSLKEKGVKANEVIYSALIDG 540
           YKLL LMNESGLVPDEWTY VF+DTLCKRG VEEA  LF+SLKEKG+KANEV+YS LIDG
Sbjct: 481 YKLLSLMNESGLVPDEWTYGVFIDTLCKRGLVEEARSLFESLKEKGIKANEVMYSTLIDG 540

Query: 541 YCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLIDGYCREKNFQEALVLVEIMIKRDINPT 600
           YCKVGKVSDG  LLDKML  GC+PNSITYNSLIDGYC+EKNF+EA +LVE+MIKRDI P 
Sbjct: 541 YCKVGKVSDGRFLLDKMLSAGCVPNSITYNSLIDGYCKEKNFKEARLLVEVMIKRDIQPA 600

Query: 601 ADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHPDVFTYTAFIHAYCSQGRLKDAELFIY 660
           ADTYTILI+NLL+DGE D AH++FDQMLSTGSHPDVF YTAFIHAYCSQGRLKDAE+ I 
Sbjct: 601 ADTYTILIDNLLKDGEIDHAHDVFDQMLSTGSHPDVFIYTAFIHAYCSQGRLKDAEVLIC 660

Query: 661 KMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDILKRMYDVGCEPSFHTYSYLIKHLSNAK 720
           KMN KGIMPDT+LYTL IDAYGRFGSI  AF ILKRM+DVGCEPS+HTYSYLIKHLSNAK
Sbjct: 661 KMNAKGIMPDTILYTLFIDAYGRFGSIDGAFGILKRMHDVGCEPSYHTYSYLIKHLSNAK 720

Query: 721 LTKVNSSSELNDLSSGVASNDFANLWKRVDYEFALELFEKMVKHGCAPNANTYGKFITGL 780
             +V+SSSEL+DLSSGVASNDF+N W+RVDYEF LELF KMV+HGCAPNANTYGKFITGL
Sbjct: 721 PKEVSSSSELSDLSSGVASNDFSNCWRRVDYEFTLELFGKMVEHGCAPNANTYGKFITGL 780

Query: 781 CKVGCLEVAHRLYDHMKEKGLSPNEDIYNCLLGCSCQLGLYDKAIRWLDIMIEHGYLPHL 840
           CKVG LEVA RL+DHMKEKGLSPNEDIYN LLGCSCQLGLY +AIRWLDI+IE+G+LP L
Sbjct: 781 CKVGYLEVADRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYGEAIRWLDILIENGHLPRL 840

Query: 841 DSCKLLVCGLYDEGNNEKAKIVFYSLLQCGYNYDEMAWKVLIDGLLKKGLVDKCSELFGI 900
           DSCKLL+CGLYDEGN+EKAK VF SLLQCGYN DEMAWKVLIDGLLKKGL DKCS+LFGI
Sbjct: 841 DSCKLLLCGLYDEGNDEKAKRVFCSLLQCGYNCDEMAWKVLIDGLLKKGLSDKCSDLFGI 900

Query: 901 MERQGCRIHPKTYSMLIEGFDGIQDMD 927
           ME QGC IHPKTYSMLIEGFDG+Q++D
Sbjct: 901 METQGCHIHPKTYSMLIEGFDGVQEID 927

BLAST of Sgr018481 vs. ExPASy TrEMBL
Match: A0A1S4E4V7 (pentatricopeptide repeat-containing protein At5g65560 OS=Cucumis melo OX=3656 GN=LOC107990278 PE=4 SV=1)

HSP 1 Score: 1635.5 bits (4234), Expect = 0.0e+00
Identity = 797/927 (85.98%), Postives = 854/927 (92.13%), Query Frame = 0

Query: 1   MHGVLTAVRYPTMIRYSTAI-NSGQLLIILGFRLRLTFTLTLKFFTSTASLPQSLPVEHD 60
           MHGV T VR PTMIR STAI  SGQLL++LGFRLRLTF LT +FFTSTAS PQSL VEHD
Sbjct: 13  MHGVFTPVRCPTMIRNSTAIFKSGQLLVVLGFRLRLTFPLTHRFFTSTASFPQSLSVEHD 72

Query: 61  ISAQLFSILSHPNWQKHPSLKNLIPSIAPSHISTLFTHNLDPQTALAFFNWIGQKHGFKH 120
           I AQLF+ILS PNWQKHPSLKNLIPSIAPSHIS LF  NLDPQTALAFFNWIGQKHGFKH
Sbjct: 73  IPAQLFTILSRPNWQKHPSLKNLIPSIAPSHISALFALNLDPQTALAFFNWIGQKHGFKH 132

Query: 121 NVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLEMLRSMNRRGDDFKFKLT 180
           NVQSY+SMLNILVPNGYLRIAE MRILMIKSTDSSENA+FVLEMLRSMNRR D FKFKL+
Sbjct: 133 NVQSYVSMLNILVPNGYLRIAENMRILMIKSTDSSENAVFVLEMLRSMNRRVDAFKFKLS 192

Query: 181 LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVS 240
           LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNI+TLNTMVNGYCKLGNVVEAELYVS
Sbjct: 193 LRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIFTLNTMVNGYCKLGNVVEAELYVS 252

Query: 241 KIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEAR 300
           KIVQAGLSLDTFTYTSLILGYCRN+NVD A  IFLSMP+KGCRRNEVSYTNLIHG CEAR
Sbjct: 253 KIVQAGLSLDTFTYTSLILGYCRNKNVDAANAIFLSMPNKGCRRNEVSYTNLIHGFCEAR 312

Query: 301 RIDEALKLFSQMHEDNCWPTVRTYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTYT 360
           R+ EALKLFSQMHEDNCWPTVRTYT++I ALCQL RK E  NMFKEMTEK C+PNVHTYT
Sbjct: 313 RVGEALKLFSQMHEDNCWPTVRTYTVLIFALCQLGRKTEALNMFKEMTEKRCQPNVHTYT 372

Query: 361 VLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMESN 420
           VLI SLCED NFDDAKK+LNGMLEKGL+PSVVTYNALIDGYCKKG+S SALEILSLMESN
Sbjct: 373 VLICSLCEDGNFDDAKKILNGMLEKGLIPSVVTYNALIDGYCKKGLSASALEILSLMESN 432

Query: 421 NCSPNARTYNELILGFCRAKDVHKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGSA 480
           NCSPNARTYNELILGFCRAK++HKAM+LL+KMLERKLQP+VVTYN+LIHGQCK G LGSA
Sbjct: 433 NCSPNARTYNELILGFCRAKNIHKAMSLLHKMLERKLQPNVVTYNILIHGQCKEGDLGSA 492

Query: 481 YKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEEAHLLFDSLKEKGVKANEVIYSALIDG 540
           YKLL LMNESGLVPDEWTY VF+DTLCKRG VEEA  LF+SLKEKG+KANEV+YS LIDG
Sbjct: 493 YKLLSLMNESGLVPDEWTYGVFIDTLCKRGLVEEACSLFESLKEKGIKANEVMYSTLIDG 552

Query: 541 YCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLIDGYCREKNFQEALVLVEIMIKRDINPT 600
           YCKVGKVSDG  LLDKML  GC+PNSITYNSLIDGYC+EKNF+EA +LVE+MIKRDI P 
Sbjct: 553 YCKVGKVSDGRFLLDKMLSAGCVPNSITYNSLIDGYCKEKNFKEARLLVEVMIKRDIQPA 612

Query: 601 ADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHPDVFTYTAFIHAYCSQGRLKDAELFIY 660
           ADTYTILI+NLL+DGE D AH++FDQMLSTGSHPDVF YTAFIHAYCSQGRLKDAE+ I 
Sbjct: 613 ADTYTILIDNLLKDGEIDHAHDVFDQMLSTGSHPDVFIYTAFIHAYCSQGRLKDAEVLIC 672

Query: 661 KMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDILKRMYDVGCEPSFHTYSYLIKHLSNAK 720
           KMN KGIMPDT+LYTL IDAYGRFGSI  AF ILKRM+DVGCEPS+HTYSYLIKHLSNAK
Sbjct: 673 KMNAKGIMPDTILYTLFIDAYGRFGSIDGAFGILKRMHDVGCEPSYHTYSYLIKHLSNAK 732

Query: 721 LTKVNSSSELNDLSSGVASNDFANLWKRVDYEFALELFEKMVKHGCAPNANTYGKFITGL 780
             +V+SSSEL+DLSSGVASNDF+N W+RVDYEF LELF KMV+HGCAPNANTYGKFITGL
Sbjct: 733 PKEVSSSSELSDLSSGVASNDFSNCWRRVDYEFTLELFGKMVEHGCAPNANTYGKFITGL 792

Query: 781 CKVGCLEVAHRLYDHMKEKGLSPNEDIYNCLLGCSCQLGLYDKAIRWLDIMIEHGYLPHL 840
           CKVG LEVA RL+DHMKEKGLSPNEDIYN LLGCSCQLGLY +AIRWLDI+IE+G+LP L
Sbjct: 793 CKVGYLEVADRLFDHMKEKGLSPNEDIYNSLLGCSCQLGLYGEAIRWLDILIENGHLPRL 852

Query: 841 DSCKLLVCGLYDEGNNEKAKIVFYSLLQCGYNYDEMAWKVLIDGLLKKGLVDKCSELFGI 900
           DSCKLL+CGLYDEGN+EKAK VF SLLQCGYN DEMAWKVLIDGLLKKGL DKCS+LFGI
Sbjct: 853 DSCKLLLCGLYDEGNDEKAKRVFCSLLQCGYNCDEMAWKVLIDGLLKKGLSDKCSDLFGI 912

Query: 901 MERQGCRIHPKTYSMLIEGFDGIQDMD 927
           ME QGC IHPKTYSMLIEGFDG+Q++D
Sbjct: 913 METQGCHIHPKTYSMLIEGFDGVQEID 939

BLAST of Sgr018481 vs. TAIR 10
Match: AT5G65560.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 984.2 bits (2543), Expect = 7.3e-287
Identity = 477/886 (53.84%), Postives = 637/886 (71.90%), Query Frame = 0

Query: 39  LTLKFFTSTASLPQSLPVEH----DISAQLFSILSHPNWQKHPSLKNLIPSIAPSHISTL 98
           +T + F S + L ++LP E      +  +L SILS PNW K PSLK+++ +I+PSH+S+L
Sbjct: 37  VTRRQFCSVSPLLRNLPEEESDSMSVPHRLLSILSKPNWHKSPSLKSMVSAISPSHVSSL 96

Query: 99  FTHNLDPQTALAFFNWIGQKHGFKHNVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDSS 158
           F+ +LDP+TAL F +WI Q   +KH+V SY S+L +L+ NGY+ +  K+R+LMIKS DS 
Sbjct: 97  FSLDLDPKTALNFSHWISQNPRYKHSVYSYASLLTLLINNGYVGVVFKIRLLMIKSCDSV 156

Query: 159 ENALFVLEMLRSMNR-RGDDFKFKLTLRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTP 218
            +AL+VL++ R MN+    + K+KL + CYN LL  L+RF ++DEMK VY+EML+D V P
Sbjct: 157 GDALYVLDLCRKMNKDERFELKYKLIIGCYNTLLNSLARFGLVDEMKQVYMEMLEDKVCP 216

Query: 219 NIYTLNTMVNGYCKLGNVVEAELYVSKIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIF 278
           NIYT N MVNGYCKLGNV EA  YVSKIV+AGL  D FTYTSLI+GYC+ +++D A+++F
Sbjct: 217 NIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTYTSLIMGYCQRKDLDSAFKVF 276

Query: 279 LSMPSKGCRRNEVSYTNLIHGLCEARRIDEALKLFSQMHEDNCWPTVRTYTIIICALCQL 338
             MP KGCRRNEV+YT+LIHGLC ARRIDEA+ LF +M +D C+PTVRTYT++I +LC  
Sbjct: 277 NEMPLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKMKDDECFPTVRTYTVLIKSLCGS 336

Query: 339 SRKLEGFNMFKEMTEKGCEPNVHTYTVLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTY 398
            RK E  N+ KEM E G +PN+HTYTVLI SLC    F+ A+++L  MLEKGL+P+V+TY
Sbjct: 337 ERKSEALNLVKEMEETGIKPNIHTYTVLIDSLCSQCKFEKARELLGQMLEKGLMPNVITY 396

Query: 399 NALIDGYCKKGMSMSALEILSLMESNNCSPNARTYNELILGFCRAKDVHKAMALLYKMLE 458
           NALI+GYCK+GM   A++++ LMES   SPN RTYNELI G+C++ +VHKAM +L KMLE
Sbjct: 397 NALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELIKGYCKS-NVHKAMGVLNKMLE 456

Query: 459 RKLQPDVVTYNLLIHGQCKGGHLGSAYKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEE 518
           RK+ PDVVTYN LI GQC+ G+  SAY+LL LMN+ GLVPD+WTY+  +D+LCK  RVEE
Sbjct: 457 RKVLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEE 516

Query: 519 AHLLFDSLKEKGVKANEVIYSALIDGYCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLID 578
           A  LFDSL++KGV  N V+Y+ALIDGYCK GKV + H +L+KML   C+PNS+T+N+LI 
Sbjct: 517 ACDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIH 576

Query: 579 GYCREKNFQEALVLVEIMIKRDINPTADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHP 638
           G C +   +EA +L E M+K  + PT  T TILI  LL+DG+FD A++ F QMLS+G+ P
Sbjct: 577 GLCADGKLKEATLLEEKMVKIGLQPTVSTDTILIHRLLKDGDFDHAYSRFQQMLSSGTKP 636

Query: 639 DVFTYTAFIHAYCSQGRLKDAELFIYKMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDIL 698
           D  TYT FI  YC +GRL DAE  + KM E G+ PD   Y+ LI  YG  G    AFD+L
Sbjct: 637 DAHTYTTFIQTYCREGRLLDAEDMMAKMRENGVSPDLFTYSSLIKGYGDLGQTNFAFDVL 696

Query: 699 KRMYDVGCEPSFHTYSYLIKHLSNAKLTKVNSSSELNDLSSGVASNDFANLWKRVDYEFA 758
           KRM D GCEPS HT+  LIKHL   K  K   S             +   +   ++++  
Sbjct: 697 KRMRDTGCEPSQHTFLSLIKHLLEMKYGKQKGSEP-----------ELCAMSNMMEFDTV 756

Query: 759 LELFEKMVKHGCAPNANTYGKFITGLCKVGCLEVAHRLYDHM-KEKGLSPNEDIYNCLLG 818
           +EL EKMV+H   PNA +Y K I G+C+VG L VA +++DHM + +G+SP+E ++N LL 
Sbjct: 757 VELLEKMVEHSVTPNAKSYEKLILGICEVGNLRVAEKVFDHMQRNEGISPSELVFNALLS 816

Query: 819 CSCQLGLYDKAIRWLDIMIEHGYLPHLDSCKLLVCGLYDEGNNEKAKIVFYSLLQCGYNY 878
           C C+L  +++A + +D MI  G+LP L+SCK+L+CGLY +G  E+   VF +LLQCGY  
Sbjct: 817 CCCKLKKHNEAAKVVDDMICVGHLPQLESCKVLICGLYKKGEKERGTSVFQNLLQCGYYE 876

Query: 879 DEMAWKVLIDGLLKKGLVDKCSELFGIMERQGCRIHPKTYSMLIEG 919
           DE+AWK++IDG+ K+GLV+   ELF +ME+ GC+   +TYS+LIEG
Sbjct: 877 DELAWKIIIDGVGKQGLVEAFYELFNVMEKNGCKFSSQTYSLLIEG 910

BLAST of Sgr018481 vs. TAIR 10
Match: AT3G07290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 473.8 bits (1218), Expect = 3.2e-133
Identity = 271/870 (31.15%), Postives = 461/870 (52.99%), Query Frame = 0

Query: 44  FTSTASLPQSLPVEHDISAQLFSILSHPNWQKHPSLKNLIPSIAPSHISTLFT-HNLDPQ 103
           F S +S P     +   +  + S+L  PNW+K+ SLK+L+  + P+  S + +    D  
Sbjct: 25  FFSVSSRPSLSSSDEVAAHDVASLLKTPNWEKNSSLKSLVSHMNPNVASQVISLQRSDND 84

Query: 104 TALAFFNWIGQKHGFKHNVQSYMSMLNILVPNGYLRIAEKMRILMIKSTDSSENALFVLE 163
             + FF W+ +   +  +      +L ++V +G  R+A  + + +IK     E  +  L+
Sbjct: 85  ICVRFFMWVCKHSSYCFDPTQKNQLLKLIVSSGLYRVAHAVIVALIKECSRCEKEM--LK 144

Query: 164 MLRSMNRRGDDFKFKLTLRCYNMLLMLLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMV 223
           ++   +   + F F+L   CY+ LLM L++  +       Y  M  D     +    T+V
Sbjct: 145 LMYCFDELREVFGFRLNYPCYSSLLMSLAKLDLGFLAYVTYRRMEADGFVVGMIDYRTIV 204

Query: 224 NGYCKLGNVVEAELYVSKIVQAGLSLDTFTYTSLILGYCRNRNVDGAYRIFLSMPSK-GC 283
           N  CK G    AE+++SKI++ G  LD+   TSL+LG+CR  N+  A ++F  M  +  C
Sbjct: 205 NALCKNGYTEAAEMFMSKILKIGFVLDSHIGTSLLLGFCRGLNLRDALKVFDVMSKEVTC 264

Query: 284 RRNEVSYTNLIHGLCEARRIDEALKLFSQMHEDNCWPTVRTYTIIICALCQLSRKLEGFN 343
             N VSY+ LIHGLCE  R++EA  L  QM E  C P+ RTYT++I ALC      + FN
Sbjct: 265 APNSVSYSILIHGLCEVGRLEEAFGLKDQMGEKGCQPSTRTYTVLIKALCDRGLIDKAFN 324

Query: 344 MFKEMTEKGCEPNVHTYTVLIHSLCEDQNFDDAKKMLNGMLEKGLVPSVVTYNALIDGYC 403
           +F EM  +GC+PNVHTYTVLI  LC D   ++A  +   M++  + PSV+TYNALI+GYC
Sbjct: 325 LFDEMIPRGCKPNVHTYTVLIDGLCRDGKIEEANGVCRKMVKDRIFPSVITYNALINGYC 384

Query: 404 KKGMSMSALEILSLMESNNCSPNARTYNELILGFCRAKDVHKAMALLYKMLERKLQPDVV 463
           K G  + A E+L++ME   C PN RT+NEL+ G CR    +KA+ LL +ML+  L PD+V
Sbjct: 385 KDGRVVPAFELLTVMEKRACKPNVRTFNELMEGLCRVGKPYKAVHLLKRMLDNGLSPDIV 444

Query: 464 TYNLLIHGQCKGGHLGSAYKLLGLMNESGLVPDEWTYSVFVDTLCKRGRVEEAHLLFDSL 523
           +YN+LI G C+ GH+ +AYKLL  MN   + PD  T++  ++  CK+G+ + A      +
Sbjct: 445 SYNVLIDGLCREGHMNTAYKLLSSMNCFDIEPDCLTFTAIINAFCKQGKADVASAFLGLM 504

Query: 524 KEKGVKANEVIYSALIDGYCKVGKVSDGHSLLDKMLGDGCIPNSITYNSLIDGYCREKNF 583
             KG+  +EV  + LIDG CKVGK  D   +L+ ++    +    + N ++D   +    
Sbjct: 505 LRKGISLDEVTGTTLIDGVCKVGKTRDALFILETLVKMRILTTPHSLNVILDMLSKGCKV 564

Query: 584 QEALVLVEIMIKRDINPTADTYTILIENLLEDGEFDRAHNMFDQMLSTGSHPDVFTYTAF 643
           +E L ++  + K  + P+  TYT L++ L+  G+   +  + + M  +G  P+V+ YT  
Sbjct: 565 KEELAMLGKINKLGLVPSVVTYTTLVDGLIRSGDITGSFRILELMKLSGCLPNVYPYTII 624

Query: 644 IHAYCSQGRLKDAELFIYKMNEKGIMPDTLLYTLLIDAYGRFGSIGCAFDILKRMYDVGC 703
           I+  C  GR+++AE  +  M + G+ P+ + YT+++  Y   G +  A + ++ M + G 
Sbjct: 625 INGLCQFGRVEEAEKLLSAMQDSGVSPNHVTYTVMVKGYVNNGKLDRALETVRAMVERGY 684

Query: 704 EPSFHTYSYLIK-HLSNAKLTKVNSSSELNDLSSGVASNDFANLWKRVDYEFALELFEKM 763
           E +   YS L++  + + K    +  S ++D++            +  D E   EL   +
Sbjct: 685 ELNDRIYSSLLQGFVLSQKGIDNSEESTVSDIA-----------LRETDPECINELISVV 744

Query: 764 VK-HGCAPNANTYGKFITGLCKVGCLEVAHRLYDHMKEKGLSPNEDIYNCLLGCSCQLGL 823
            +  GC      +   +T LCK G  + ++ L  ++ E+G+   E   + ++   C    
Sbjct: 745 EQLGGCISGLCIF--LVTRLCKEGRTDESNDLVQNVLERGVF-LEKAMDIIMESYCSKKK 804

Query: 824 YDKAIRWLDIMIEHGYLPHLDSCKLLVCGLYDEGNNEKAKIVFYSLLQCGYNYDEMAWKV 883
           + K +  + ++++ G++P   S  L++ GL  EG+ E+A+ +   LL      ++     
Sbjct: 805 HTKCMELITLVLKSGFVPSFKSFCLVIQGLKKEGDAERARELVMELLTSNGVVEKSGVLT 864

Query: 884 LIDGLLKKGLVDKCSELFGIMERQGCRIHP 910
            ++ L++      CSE+  ++++  CR  P
Sbjct: 865 YVECLMEGDETGDCSEVIDLVDQLHCRERP 878

BLAST of Sgr018481 vs. TAIR 10
Match: AT1G77340.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 410.2 bits (1053), Expect = 4.3e-114
Identity = 213/416 (51.20%), Postives = 277/416 (66.59%), Query Frame = 0

Query: 84  PSIAPSHISTLFTHNLDPQTALAFFNWIGQKHGFKHNVQSYMSMLNILVPNGYLRIAEKM 143
           P   PSH+S+LF+ NLDPQTAL+F +WI +   FKHNV SY S++ +L          K+
Sbjct: 23  PFYTPSHVSSLFSLNLDPQTALSFSDWISRIPNFKHNVTSYASLVTLLCSQEIPYEVPKI 82

Query: 144 RILMIKSTDSSENALFVLEMLRSMNRRGDDF--KFKLTLRCYNMLLMLLSRFLMIDEMKS 203
            ILMIKS +S  +ALFV++  R+M R+GD F  K+KLT +CYN LL  L+RF +++EMK 
Sbjct: 83  TILMIKSCNSVRDALFVVDFCRTM-RKGDSFEIKYKLTPKCYNNLLSSLARFGLVEEMKR 142

Query: 204 VYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVSKIVQAGLSLDTFTYTSLILGYC 263
           +Y EML+D+V+P+IYT NT+VNGYCKLG VVEA+ YV+ ++QAG   D FTYTS I G+C
Sbjct: 143 LYTEMLEDLVSPDIYTFNTLVNGYCKLGYVVEAKQYVTWLIQAGCDPDYFTYTSFITGHC 202

Query: 264 RNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEARRIDEALKLFSQMHEDNCWPTVR 323
           R + VD A+++F  M   GC RNEVSYT LI+GL EA++IDEAL L  +M +DNC P VR
Sbjct: 203 RRKEVDAAFKVFKEMTQNGCHRNEVSYTQLIYGLFEAKKIDEALSLLVKMKDDNCCPNVR 262

Query: 324 TYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTYTVLIHSLCEDQNFDDAKKMLNGM 383
           TYT++I ALC   +K E  N+FK+M+E G +P+   YTVLI S C     D+A  +L  M
Sbjct: 263 TYTVLIDALCGSGQKSEAMNLFKQMSESGIKPDDCMYTVLIQSFCSGDTLDEASGLLEHM 322

Query: 384 LEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMESNNCSPNARTYNELILGFCRAKDV 443
           LE GL+P+V+TYNALI G+CK                                    K+V
Sbjct: 323 LENGLMPNVITYNALIKGFCK------------------------------------KNV 382

Query: 444 HKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGSAYKLLGLMNESGLVPDEWT 498
           HKAM LL KMLE+ L PD++TYN LI GQC  G+L SAY+LL LM ESGLVP++ T
Sbjct: 383 HKAMGLLSKMLEQNLVPDLITYNTLIAGQCSSGNLDSAYRLLSLMEESGLVPNQRT 401

BLAST of Sgr018481 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 320.5 bits (820), Expect = 4.5e-87
Identity = 193/651 (29.65%), Postives = 313/651 (48.08%), Query Frame = 0

Query: 256 LILGYCRNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEARR-IDEALKLFSQMHED 315
           ++  Y R   +D A  I     + G     +SY  ++     ++R I  A  +F +M E 
Sbjct: 140 VVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLES 199

Query: 316 NCWPTVRTYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTYTVLIHSLCEDQNFDDA 375
              P V TY I+I   C          +F +M  KGC PNV TY  LI   C+ +  DD 
Sbjct: 200 QVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDG 259

Query: 376 KKMLNGMLEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMESNNCSPNARTYNELILG 435
            K+L  M  KGL P++++YN +I+G C++G       +L+ M     S +  TYN LI G
Sbjct: 260 FKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKG 319

Query: 436 FCRAKDVHKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGSAYKLLGLMNESGLVPD 495
           +C+  + H+A+ +  +ML   L P V+TY  LIH  CK G++  A + L  M   GL P+
Sbjct: 320 YCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPN 379

Query: 496 EWTYSVFVDTLCKRGRVEEAHLLFDSLKEKGVKANEVIYSALIDGYCKVGKVSDGHSLLD 555
           E TY+  VD   ++G + EA+ +   + + G   + V Y+ALI+G+C  GK+ D  ++L+
Sbjct: 380 ERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLE 439

Query: 556 KMLGDGCIPNSITYNSLIDGYCREKNFQEALVLVEIMIKRDINPTADTYTILIENLLEDG 615
            M   G  P+ ++Y++++ G+CR  +  EAL +   M+++ I P   TY+ LI+   E  
Sbjct: 440 DMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQR 499

Query: 616 EFDRAHNMFDQMLSTGSHPDVFTYTAFIHAYCSQGRLKDAELFIYKMNEKGIMPDTLLYT 675
               A +++++ML  G  PD FTYTA I+AYC +G L+ A     +M EKG++PD + Y+
Sbjct: 500 RTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYS 559

Query: 676 LLIDAYGRFGSIGCAFDILKRMYDVGCEPSFHTYSYLIKHLSNAKLTKVNSSSELNDLSS 735
           +LI+   +      A  +L +++     PS  TY  LI++ SN +   V S         
Sbjct: 560 VLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTLIENCSNIEFKSVVS--------- 619

Query: 736 GVASNDFANLWKRVDYEFALELFEKMVKHGCAPNANTYGKFITGLCKVGCLEVAHRLYDH 795
                                                    I G C  G +  A ++++ 
Sbjct: 620 ----------------------------------------LIKGFCMKGMMTEADQVFES 679

Query: 796 MKEKGLSPNEDIYNCLLGCSCQLGLYDKAIRWLDIMIEHGYLPHLDSCKLLVCGLYDEGN 855
           M  K   P+   YN ++   C+ G   KA      M++ G+L H  +   LV  L+ EG 
Sbjct: 680 MLGKNHKPDGTAYNIMIHGHCRAGDIRKAYTLYKEMVKSGFLLHTVTVIALVKALHKEGK 739

Query: 856 -NEKAKIVFYSLLQCGYNYDEMAWKVLIDGLLKKGLVDKCSELFGIMERQG 905
            NE   ++ + L  C  +  E A KVL++   ++G +D   ++   M + G
Sbjct: 740 VNELNSVIVHVLRSCELSEAEQA-KVLVEINHREGNMDVVLDVLAEMAKDG 740

BLAST of Sgr018481 vs. TAIR 10
Match: AT1G74580.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 304.7 bits (779), Expect = 2.6e-82
Identity = 199/735 (27.07%), Postives = 339/735 (46.12%), Query Frame = 0

Query: 84  PSIAPSHISTLFTHNLDPQTALAFFNWIGQKHGFKHNVQSYMSMLNILVPNGYLRIAEKM 143
           P + P H++ +     DP  AL  FN + ++ GFKH + +Y S++  L   GY    E M
Sbjct: 3   PPLLPKHVTAVIKCQKDPMKALEMFNSMRKEVGFKHTLSTYRSVIEKL---GYYGKFEAM 62

Query: 144 RILMIKSTDSSENALF---VLEMLRSMNRRG------------DDFKFKLTLRCYNMLLM 203
             +++   ++  N +     +  +++  R+G            D +  + T+  YN ++ 
Sbjct: 63  EEVLVDMRENVGNHMLEGVYVGAMKNYGRKGKVQEAVNVFERMDFYDCEPTVFSYNAIMS 122

Query: 204 LLSRFLMIDEMKSVYLEMLDDMVTPNIYTLNTMVNGYCKLGNVVEAELYVSKIVQAGLSL 263
           +L      D+   VY+ M D  +TP++Y+    +  +CK      A   ++ +   G  +
Sbjct: 123 VLVDSGYFDQAHKVYMRMRDRGITPDVYSFTIRMKSFCKTSRPHAALRLLNNMSSQGCEM 182

Query: 264 DTFTYTSLILGYCRNRNVDGAYRIFLSMPSKGCRRNEVSYTNLIHGLCEARRIDEALKLF 323
           +   Y +++ G+         Y +F  M + G      ++  L+  LC+   + E  KL 
Sbjct: 183 NVVAYCTVVGGFYEENFKAEGYELFGKMLASGVSLCLSTFNKLLRVLCKKGDVKECEKLL 242

Query: 324 SQMHEDNCWPTVRTYTIIICALCQLSRKLEGFNMFKEMTEKGCEPNVHTYTVLIHSLCED 383
            ++ +    P + TY + I  LCQ         M   + E+G +P+V TY  LI+ LC++
Sbjct: 243 DKVIKRGVLPNLFTYNLFIQGLCQRGELDGAVRMVGCLIEQGPKPDVITYNNLIYGLCKN 302

Query: 384 QNFDDAKKMLNGMLEKGLVPSVVTYNALIDGYCKKGMSMSALEILSLMESNNCSPNARTY 443
             F +A+  L  M+ +GL P   TYN LI GYCK GM   A  I+     N   P+  TY
Sbjct: 303 SKFQEAEVYLGKMVNEGLEPDSYTYNTLIAGYCKGGMVQLAERIVGDAVFNGFVPDQFTY 362

Query: 444 NELILGFCRAKDVHKAMALLYKMLERKLQPDVVTYNLLIHGQCKGGHLGSAYKLLGLMNE 503
             LI G C   + ++A+AL  + L + ++P+V+ YN LI G    G +  A +L   M+E
Sbjct: 363 RSLIDGLCHEGETNRALALFNEALGKGIKPNVILYNTLIKGLSNQGMILEAAQLANEMSE 422

Query: 504 SGLVPDEWTYSVFVDTLCKRGRVEEAHLLFDSLKEKGVKANEVIYSALIDGYCKVGKVSD 563
            GL+P+  T+++ V+ LCK G V +A  L   +  KG   +   ++ LI GY    K+ +
Sbjct: 423 KGLIPEVQTFNILVNGLCKMGCVSDADGLVKVMISKGYFPDIFTFNILIHGYSTQLKMEN 482

Query: 564 GHSLLDKMLGDGCIPNSITYNSLIDGYCREKNFQEALVLVEIMIKRDINPTADTYTILIE 623
              +LD ML +G  P+  TYNSL++G C+   F++ +   + M+++   P   T+ IL+E
Sbjct: 483 ALEILDVMLDNGVDPDVYTYNSLLNGLCKTSKFEDVMETYKTMVEKGCAPNLFTFNILLE 542

Query: 624 NLLEDGEFDRAHNMFDQMLSTGSHPDVFTYTAFIHAYCSQGRLKDAELFIYKMNEK-GIM 683
           +L    + D A  + ++M +   +PD  T+   I  +C  G L  A     KM E   + 
Sbjct: 543 SLCRYRKLDEALGLLEEMKNKSVNPDAVTFGTLIDGFCKNGDLDGAYTLFRKMEEAYKVS 602

Query: 684 PDTLLYTLLIDAYGRFGSIGCAFDILKRMYDVGCEPSFHTYSYLIKHLSNAKLTKVNSSS 743
             T  Y ++I A+    ++  A  + + M D    P  +TY  ++               
Sbjct: 603 SSTPTYNIIIHAFTEKLNVTMAEKLFQEMVDRCLGPDGYTYRLMV--------------- 662

Query: 744 ELNDLSSGVASNDFANLWKRVDYEFALELFEKMVKHGCAPNANTYGKFITGLCKVGCLEV 803
                  G       NL     Y+F LE    M+++G  P+  T G+ I  LC    +  
Sbjct: 663 ------DGFCKTGNVNL----GYKFLLE----MMENGFIPSLTTLGRVINCLCVEDRVYE 705

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022153102.10.0e+0091.59pentatricopeptide repeat-containing protein At5g65560 isoform X1 [Momordica char... [more]
XP_038885361.10.0e+0088.89pentatricopeptide repeat-containing protein At5g65560 [Benincasa hispida][more]
XP_023545913.10.0e+0088.24pentatricopeptide repeat-containing protein At5g65560-like isoform X1 [Cucurbita... [more]
KAG6599094.10.0e+0087.92Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAG7030032.10.0e+0087.59Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
Q9LSL91.0e-28553.84Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX... [more]
Q9SFV94.5e-13231.15Pentatricopeptide repeat-containing protein At3g07290, mitochondrial OS=Arabidop... [more]
Q9FIX36.4e-8629.65Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9SZ526.6e-8328.32Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidop... [more]
Q9CA583.6e-8127.07Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A6J1DI130.0e+0091.59pentatricopeptide repeat-containing protein At5g65560 isoform X1 OS=Momordica ch... [more]
A0A6J1GH110.0e+0086.64pentatricopeptide repeat-containing protein At5g65560-like OS=Cucurbita moschata... [more]
A0A6J1KKQ20.0e+0086.21pentatricopeptide repeat-containing protein At5g65560-like OS=Cucurbita maxima O... [more]
A0A5A7T8990.0e+0085.87Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S4E4V70.0e+0085.98pentatricopeptide repeat-containing protein At5g65560 OS=Cucumis melo OX=3656 GN... [more]
Match NameE-valueIdentityDescription
AT5G65560.17.3e-28753.84Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G07290.13.2e-13331.15Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G77340.14.3e-11451.20Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G39710.14.5e-8729.65Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G74580.12.6e-8227.07Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 529..577
e-value: 3.0E-14
score: 52.9
coord: 875..919
e-value: 1.5E-7
score: 31.5
coord: 388..437
e-value: 5.2E-16
score: 58.6
coord: 458..507
e-value: 1.4E-13
score: 50.8
coord: 284..332
e-value: 7.6E-15
score: 54.9
coord: 606..647
e-value: 5.4E-8
score: 32.9
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 349..381
e-value: 1.1E-9
score: 37.9
coord: 210..235
e-value: 5.7E-7
score: 29.1
coord: 245..276
e-value: 4.1E-8
score: 32.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 182..208
e-value: 1.2
score: 9.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 566..599
e-value: 6.8E-9
score: 33.4
coord: 216..249
e-value: 1.1E-5
score: 23.2
coord: 251..284
e-value: 8.9E-8
score: 29.9
coord: 673..704
e-value: 6.4E-5
score: 20.9
coord: 806..837
e-value: 1.6E-4
score: 19.6
coord: 771..804
e-value: 7.4E-8
score: 30.1
coord: 427..460
e-value: 1.5E-7
score: 29.1
coord: 461..495
e-value: 1.8E-6
score: 25.8
coord: 286..316
e-value: 1.1E-8
score: 32.7
coord: 322..355
e-value: 1.4E-7
score: 29.2
coord: 636..670
e-value: 1.4E-6
score: 26.1
coord: 531..564
e-value: 8.6E-9
score: 33.0
coord: 602..635
e-value: 1.4E-6
score: 26.1
coord: 391..425
e-value: 7.0E-10
score: 36.5
coord: 356..390
e-value: 1.3E-8
score: 32.5
coord: 497..529
e-value: 1.0E-6
score: 26.5
coord: 876..906
e-value: 4.9E-4
score: 18.1
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 756..812
e-value: 4.3E-9
score: 36.3
coord: 660..715
e-value: 3.3E-9
score: 36.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 424..458
score: 11.73961
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 599..633
score: 10.533867
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 529..563
score: 12.517862
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 284..318
score: 12.780933
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 389..423
score: 12.627475
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 768..802
score: 11.882107
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 494..528
score: 11.728648
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 873..907
score: 10.117337
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 459..493
score: 12.024604
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 214..248
score: 10.161182
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 564..598
score: 12.945353
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 803..837
score: 10.709248
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 669..703
score: 10.446177
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 249..283
score: 12.046526
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 319..353
score: 11.114816
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 354..388
score: 12.178061
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 634..668
score: 12.386327
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 454..522
e-value: 2.3E-18
score: 68.4
coord: 239..312
e-value: 2.1E-20
score: 75.0
coord: 590..659
e-value: 1.9E-15
score: 58.9
coord: 76..238
e-value: 6.5E-21
score: 76.7
coord: 383..453
e-value: 4.5E-20
score: 74.0
coord: 523..589
e-value: 1.2E-19
score: 72.6
coord: 313..382
e-value: 4.6E-19
score: 70.7
coord: 660..729
e-value: 2.4E-11
score: 45.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 741..926
e-value: 2.9E-36
score: 127.4
NoneNo IPR availablePANTHERPTHR47938RESPIRATORY COMPLEX I CHAPERONE (CIA84), PUTATIVE (AFU_ORTHOLOGUE AFUA_2G06020)-RELATEDcoord: 878..919
coord: 27..873
NoneNo IPR availablePANTHERPTHR47938:SF4REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 878..919
NoneNo IPR availablePANTHERPTHR47938:SF4REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 27..873
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 356..664
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 742..873

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr018481.1Sgr018481.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding