Sgr029061 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr029061
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationtig00153210: 2922061 .. 2926077 (-)
RNA-Seq ExpressionSgr029061
SyntenySgr029061
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGATCAATAACATGAAACCCACCTACTCGATCTTCATATCCAAGCTTCTGCTGTTGCAGTATCTCTCTGTTGTCTGTCTTTCCCAGGATTTCGATTTCTTCTACTTTGTTCAGCAGGCGAGTCTCTTTATTATTTCCTGATCTCTCTCTGATTCATTATGGGGTTTCTGTTCAGATAATTAATCATTTTTTAGTTTCATCCGATCTAAAAAATTCTGGGTTTTGCAGTGGCCGGGAGCATATTGCGATACAAAGCGCAGCTGCTGCTATCCTAAAACAGGAAGGCCTTCTGCAGATTTTGGCATTCATGGTCTTTGGCCTAATTACAAAGATGGAACCTACCCGTCCAGCTGCGATCCCGACAGCGTTTTCGACCTAACTCAGGTAAACTTGCAAGCTAAGATTCATCAACATGTGTTCATATATTTAGCGGTTTGATCAGTTTCCGTGCAACTCATGTTCTTCTTGATATCTCCTTAATAGATCTCAGATCTATTAAGCAGCATGCAGAAGCACTGGCCATCGTTGAGCTGCCCAAGCAGCAACGGGTTAAGATTTTGGTCACACGAATGGGAGAAACACGGGACTTGCTCCGAGTCTGAACTCGACCAGAAAGAATATTTCGAAGCAGCTATCAAACTGAAGCAGAAGGCCAACCTGCTCAAAGTTCTTAATGATGCAGGAATTGAACCGAATGATGAGTTCTACAGCTTGGAGAGTATCAGAAATGCCATTGAAGAGGGGGTTGGATTTACTCCCGGGATTGAATGTAACAGGGATTCAGCTGGCAACACTCAACTTTACCAGATTTATCTGTGTGTGGATACTTCTGGGTCTGTGTTCATCAAATGTCCAGTTCTTCCCAAGGGAAGATGTGCTTCCAGTATTCAATTCCCCAAGTTTTAAATTAAGCACTCTCTGATCTGTCATTAAAGAGTATATGGACATGTCTTTTTTGTCTCTTTCTTTCTTTCTTTCTTGTAATTTCATTCGCTAATGGCTCCTCTGATAGCATTGTGGATGAACCAGCAGCCATTTGGTTGGTAATAAAATTACTGTCTTTGTGTTAGAGATACAAAGAAACCCAATTGGGGAGTTTAGATTCATGTTTGGGAATTTGTGTTTTGTTTTGTTCCCCTAAATTGTATAAATGTCCACATTTTGTTACTAAATTTTTTTAAAAATTTAACTTTATAATTTCAATTTTATCCATTTTAACTTTTAACTTGATCAATAGATTAAAAAAAAATTAATACGCCCTATCTAGTCACTACTAAAAATTTGTTTGTTTCGTTAAATTAGTTAATTTCAACTTTAAAATTTCAATTTTGTTCACTTATTATATTATTTTAATTAATTTTGTTACTATTTTTTGTGGGTTTCGATTTTCGAATCCTTTCGAGAAAACGAAAATAAAAGGACAAATTTTAGTTGAAAAGTAGTTTTTATTAATCGGGTCAATTAATTACTTAAACTCAATAAAATATTAAAGTTCAGAAAGATGGAGATAAGATAAGTAAAATCATTATTAAAAAAAAAAAAAAAAAAAAAAGGTATTTTTGTCATGAGTTTAAAAATGTTGATATTTACTAAAAAAAATTCCAAGGATCGTTCGTCTAAACTATCCATCCAGATTCCAGACCAAGCCACGCCAAAGGAGTAAAATTTGCCAGAGGGGATTGTTTCCTTCTTAAAGATGGATTTACGTCAACCGTTTGTGGCGGTGGGAGAGGCGGCCAAGTTACCGGGAAAATGGTCGCTCTTTTACTTCCTCCCCATCTCCCCTCATCTCCCTCTTTCTCGCCTGCTTCACTTTCTCGGCATCTTTTCCAAAAGATGATCCTTTTCGCATTTTCAAGGTAAATTCTCTCTCTCTCACCTTGCTCCAATCATTTCCTTCTTCACTGGACTGTTAAGCTTCATCGTGCTATACTTTTTTGGGTCGAGTTTCAAGTGGTTTTAGCACTCTGGAACGGCGTCTTTTGATTCATATCTTTTCGGATGTGCTTCGTCGTTTCCGTGGCCGGAATTTTGTTCAAGTTGTTTGAAGTTGAGATAATTTACTTTCCTAGTAGTTCCTTTGCTCTTTGGGCTCTCTTGTTTAGGCAAACAGCTGCAGTCTGCAGCATATAGGTTGTTCGCATCTGCAGCTGTGTCCTCTGCAATGTACTACATCGTTTAAGAGTCAAATTAAGTACTGAAATGGCTATCTGGTGTATTCCAATATGCCAATTCACTCTCACAAAACCAAACTCAGCCTTTTCGAAGAATGAGTTCATAAATCAACCCCACCCTCTCTCTCTTTTGTCCAAATGTACATCTTTCAGGGAGCTCAAGCAAATTCAAGCCTTTACCATCAAAACCAATCTTCACAATGACATCTCTGTCCTCACCAAGCTCATTAATTTTTGCACACTCAACCCCACAACTTCATCCATGGACCACGCCCACCATCTTTTCGATCAAATTCCAGACAAGGATATTGTCCTTTTTAACTTACTGGCACGTGGTTATGCTCGCTCTAACACTCCCTATCTTGCATTTTCTCTTTTTGCTCAAATTCTCTGCTCTGGCCTTCTTCCTGATGACTACACATTCTCATCCCTTCTCAAGGCATGTGCTAGTTCTAAGGCCTTGAAGGAAGGTAGGCAGTTGCATTGTTTTGCTATTAAACTCGGGCTGAATCATAATATTTACCTATGCCCAACCCTCATAAATATGTATGCGGAGTGCAATGACATGAATGCAGCACGTGAAGTCTTTGATGGAATGGACGGGCCATGTATTGTTAGCTATAATTCAATTATCACAGGTTATGCTCGAAGTAGTCAGCCCAATGAGGCTTTGTCGTTGTTTAGAGAATTGCAAGCAAGTAATCTTGAGCCTACTGATGTTACTATGCTTAGTATAATTATGTCATGTGCTCTGTTGGGAGCACTAGACATAGGAAGGTGGATTCATGAATATGTTAAGAAGAATGGGTTTGATAAGTATGTGAAGGTGAACACTGCACTTATAGATATGTATGCGAAATGCGGAAGTCTAGCTGATGCTATTTCTATCTTTGAGGGAATGCATGTGAGAGATACACAAGCTTGGTCTGCAATGATAGTAGCATATGCAACTCATGGGGATGGCTTGAAAGCCATCTCATTGTTTGACGAGATGAAGAGGGCAGGAGTTCGACCTGATGAGATTACTTTTTTGGGGCTTTTGTATGCTTGCAGTCATGCTGGGCTAGTAGAGGAAGGTACAGGATATTTTTATAGCATGTCTAAAAACCATGGAATAGTCCCAGGGATCAAGCATTATGGATGTATGGTGGATTTGCTTGGTCGAACAGGACGTTTAGATGAGGCTTATAAGTTCATAGATGGATTGGAAATCAAGCCCACACCTATACTCTGGCGCACCCTGTTATCTGCTTGCAGCAACCATGGTAATGTCGACTTAGCAAAGCAGGTTATTGAACGAATTTTTGAATTAGATGACTCCCATGGAGGGGACTATGTCATATTATCGAACTTGTGTGCTAGAGTAGGAAGATGGGAAGATGTGAACCATCTGAGGAAATTGATGAAAGATAGAGGGGTGGTGAAGGTTCCTGGGTGTAGCTCTGTAGAGGTAAACAATGTAGTGCATGAATTCTTCTCTGGAGATGGAGTTCACTCCATTTCGGTGGAGTTGCACCGAGCACTTGATGAGTTAATCAAAGAAATAAAGTTGGTGGGATATGTACCAGATACTTCTTTAGTATATCATGCTGATATGGAAGAGGAAGGGAAAGAACTTGTCCTTAGATATCACAGTGAAAAATTGGCCATTGCTTTCGGGCTCCTAAATACACCTCCTGGTACAACTATAAGGGTAGTCAAGAACCTTCGTATTTGCGGAGATTGCCATACTGCTGCAAAACTTATATCATTAATTTTTGGGAGGCAGATCATCATTAGGGACGTTCAACGATTCCATCGATTCAAAGATGGGAAGTGCTCCTGCTGTGATTTCTGGTAA

mRNA sequence

ATGTCGATCAATAACATGAAACCCACCTACTCGATCTTCATATCCAAGCTTCTGCTGTTGCAGTATCTCTCTGTTGTCTGTCTTTCCCAGGATTTCGATTTCTTCTACTTTTGGCCGGGAGCATATTGCGATACAAAGCGCAGCTGCTGCTATCCTAAAACAGGAAGGCCTTCTGCAGATTTTGGCATTCATGGTCTTTGGCCTAATTACAAAGATGGAACCTACCCGTCCAGCTGCGATCCCGACAGCGTTTTCGACCTAACTCAGATCTCAGATCTATTAAGCAGCATGCAGAAGCACTGGCCATCGTTGAGCTGCCCAAGCAGCAACGGGTTAAGATTTTGGTCACACGAATGGGAGAAACACGGGACTTGCTCCGAGTCTGAACTCGACCAGAAAGAATATTTCGAAGCAGCTATCAAACTGAAGCAGAAGGCCAACCTGCTCAAAGTTCTTAATGATGCAGGAATTGAACCGAATGATGAGTTCTACAGCTTGGAGAGTATCAGAAATGCCATTGAAGAGGGGGTTGGATTTACTCCCGGGATTGAATGTAACAGGGATTCAGCTGGCAACACTCAACTTTACCAGATTTATCTGTGTGTGGATACTTCTGGGTCTGTGTTCATCAAATGTCCAGTTCTTCCCAAGGGAAGATGCAAACAGCTGCAGTCTGCAGCATATAGGTTGTTCGCATCTGCAGCTGTGTCCTCTGCAATGGAGCTCAAGCAAATTCAAGCCTTTACCATCAAAACCAATCTTCACAATGACATCTCTGTCCTCACCAAGCTCATTAATTTTTGCACACTCAACCCCACAACTTCATCCATGGACCACGCCCACCATCTTTTCGATCAAATTCCAGACAAGGATATTGTCCTTTTTAACTTACTGGCACGTGGTTATGCTCGCTCTAACACTCCCTATCTTGCATTTTCTCTTTTTGCTCAAATTCTCTGCTCTGGCCTTCTTCCTGATGACTACACATTCTCATCCCTTCTCAAGGCATGTGCTAGTTCTAAGGCCTTGAAGGAAGGTAGGCAGTTGCATTGTTTTGCTATTAAACTCGGGCTGAATCATAATATTTACCTATGCCCAACCCTCATAAATATGTATGCGGAGTGCAATGACATGAATGCAGCACGTGAAGTCTTTGATGGAATGGACGGGCCATGTATTGTTAGCTATAATTCAATTATCACAGGTTATGCTCGAAGTAGTCAGCCCAATGAGGCTTTGTCGTTGTTTAGAGAATTGCAAGCAAGTAATCTTGAGCCTACTGATGTTACTATGCTTAGTATAATTATGTCATGTGCTCTGTTGGGAGCACTAGACATAGGAAGGTGGATTCATGAATATGTTAAGAAGAATGGGTTTGATAAGTATGTGAAGGTGAACACTGCACTTATAGATATGTATGCGAAATGCGGAAGTCTAGCTGATGCTATTTCTATCTTTGAGGGAATGCATGTGAGAGATACACAAGCTTGGTCTGCAATGATAGTAGCATATGCAACTCATGGGGATGGCTTGAAAGCCATCTCATTGTTTGACGAGATGAAGAGGGCAGGAGTTCGACCTGATGAGATTACTTTTTTGGGGCTTTTGTATGCTTGCAGTCATGCTGGGCTAGTAGAGGAAGGTACAGGATATTTTTATAGCATGTCTAAAAACCATGGAATAGTCCCAGGGATCAAGCATTATGGATGTATGGTGGATTTGCTTGGTCGAACAGGACGTTTAGATGAGGCTTATAAGTTCATAGATGGATTGGAAATCAAGCCCACACCTATACTCTGGCGCACCCTGTTATCTGCTTGCAGCAACCATGGTAATGTCGACTTAGCAAAGCAGGTTATTGAACGAATTTTTGAATTAGATGACTCCCATGGAGGGGACTATGTCATATTATCGAACTTGTGTGCTAGAGTAGGAAGATGGGAAGATGTGAACCATCTGAGGAAATTGATGAAAGATAGAGGGGTGGTGAAGGTTCCTGGGTGTAGCTCTGTAGAGGTAAACAATGTAGTGCATGAATTCTTCTCTGGAGATGGAGTTCACTCCATTTCGGTGGAGTTGCACCGAGCACTTGATGAGTTAATCAAAGAAATAAAGTTGGTGGGATATGTACCAGATACTTCTTTAGTATATCATGCTGATATGGAAGAGGAAGGGAAAGAACTTGTCCTTAGATATCACAGTGAAAAATTGGCCATTGCTTTCGGGCTCCTAAATACACCTCCTGGTACAACTATAAGGGTAGTCAAGAACCTTCGTATTTGCGGAGATTGCCATACTGCTGCAAAACTTATATCATTAATTTTTGGGAGGCAGATCATCATTAGGGACGTTCAACGATTCCATCGATTCAAAGATGGGAAGTGCTCCTGCTGTGATTTCTGGTAA

Coding sequence (CDS)

ATGTCGATCAATAACATGAAACCCACCTACTCGATCTTCATATCCAAGCTTCTGCTGTTGCAGTATCTCTCTGTTGTCTGTCTTTCCCAGGATTTCGATTTCTTCTACTTTTGGCCGGGAGCATATTGCGATACAAAGCGCAGCTGCTGCTATCCTAAAACAGGAAGGCCTTCTGCAGATTTTGGCATTCATGGTCTTTGGCCTAATTACAAAGATGGAACCTACCCGTCCAGCTGCGATCCCGACAGCGTTTTCGACCTAACTCAGATCTCAGATCTATTAAGCAGCATGCAGAAGCACTGGCCATCGTTGAGCTGCCCAAGCAGCAACGGGTTAAGATTTTGGTCACACGAATGGGAGAAACACGGGACTTGCTCCGAGTCTGAACTCGACCAGAAAGAATATTTCGAAGCAGCTATCAAACTGAAGCAGAAGGCCAACCTGCTCAAAGTTCTTAATGATGCAGGAATTGAACCGAATGATGAGTTCTACAGCTTGGAGAGTATCAGAAATGCCATTGAAGAGGGGGTTGGATTTACTCCCGGGATTGAATGTAACAGGGATTCAGCTGGCAACACTCAACTTTACCAGATTTATCTGTGTGTGGATACTTCTGGGTCTGTGTTCATCAAATGTCCAGTTCTTCCCAAGGGAAGATGCAAACAGCTGCAGTCTGCAGCATATAGGTTGTTCGCATCTGCAGCTGTGTCCTCTGCAATGGAGCTCAAGCAAATTCAAGCCTTTACCATCAAAACCAATCTTCACAATGACATCTCTGTCCTCACCAAGCTCATTAATTTTTGCACACTCAACCCCACAACTTCATCCATGGACCACGCCCACCATCTTTTCGATCAAATTCCAGACAAGGATATTGTCCTTTTTAACTTACTGGCACGTGGTTATGCTCGCTCTAACACTCCCTATCTTGCATTTTCTCTTTTTGCTCAAATTCTCTGCTCTGGCCTTCTTCCTGATGACTACACATTCTCATCCCTTCTCAAGGCATGTGCTAGTTCTAAGGCCTTGAAGGAAGGTAGGCAGTTGCATTGTTTTGCTATTAAACTCGGGCTGAATCATAATATTTACCTATGCCCAACCCTCATAAATATGTATGCGGAGTGCAATGACATGAATGCAGCACGTGAAGTCTTTGATGGAATGGACGGGCCATGTATTGTTAGCTATAATTCAATTATCACAGGTTATGCTCGAAGTAGTCAGCCCAATGAGGCTTTGTCGTTGTTTAGAGAATTGCAAGCAAGTAATCTTGAGCCTACTGATGTTACTATGCTTAGTATAATTATGTCATGTGCTCTGTTGGGAGCACTAGACATAGGAAGGTGGATTCATGAATATGTTAAGAAGAATGGGTTTGATAAGTATGTGAAGGTGAACACTGCACTTATAGATATGTATGCGAAATGCGGAAGTCTAGCTGATGCTATTTCTATCTTTGAGGGAATGCATGTGAGAGATACACAAGCTTGGTCTGCAATGATAGTAGCATATGCAACTCATGGGGATGGCTTGAAAGCCATCTCATTGTTTGACGAGATGAAGAGGGCAGGAGTTCGACCTGATGAGATTACTTTTTTGGGGCTTTTGTATGCTTGCAGTCATGCTGGGCTAGTAGAGGAAGGTACAGGATATTTTTATAGCATGTCTAAAAACCATGGAATAGTCCCAGGGATCAAGCATTATGGATGTATGGTGGATTTGCTTGGTCGAACAGGACGTTTAGATGAGGCTTATAAGTTCATAGATGGATTGGAAATCAAGCCCACACCTATACTCTGGCGCACCCTGTTATCTGCTTGCAGCAACCATGGTAATGTCGACTTAGCAAAGCAGGTTATTGAACGAATTTTTGAATTAGATGACTCCCATGGAGGGGACTATGTCATATTATCGAACTTGTGTGCTAGAGTAGGAAGATGGGAAGATGTGAACCATCTGAGGAAATTGATGAAAGATAGAGGGGTGGTGAAGGTTCCTGGGTGTAGCTCTGTAGAGGTAAACAATGTAGTGCATGAATTCTTCTCTGGAGATGGAGTTCACTCCATTTCGGTGGAGTTGCACCGAGCACTTGATGAGTTAATCAAAGAAATAAAGTTGGTGGGATATGTACCAGATACTTCTTTAGTATATCATGCTGATATGGAAGAGGAAGGGAAAGAACTTGTCCTTAGATATCACAGTGAAAAATTGGCCATTGCTTTCGGGCTCCTAAATACACCTCCTGGTACAACTATAAGGGTAGTCAAGAACCTTCGTATTTGCGGAGATTGCCATACTGCTGCAAAACTTATATCATTAATTTTTGGGAGGCAGATCATCATTAGGGACGTTCAACGATTCCATCGATTCAAAGATGGGAAGTGCTCCTGCTGTGATTTCTGGTAA

Protein sequence

MSINNMKPTYSIFISKLLLLQYLSVVCLSQDFDFFYFWPGAYCDTKRSCCYPKTGRPSADFGIHGLWPNYKDGTYPSSCDPDSVFDLTQISDLLSSMQKHWPSLSCPSSNGLRFWSHEWEKHGTCSESELDQKEYFEAAIKLKQKANLLKVLNDAGIEPNDEFYSLESIRNAIEEGVGFTPGIECNRDSAGNTQLYQIYLCVDTSGSVFIKCPVLPKGRCKQLQSAAYRLFASAAVSSAMELKQIQAFTIKTNLHNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPDKDIVLFNLLARGYARSNTPYLAFSLFAQILCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLNHNIYLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSIIMSCALLGALDIGRWIHEYVKKNGFDKYVKVNTALIDMYAKCGSLADAISIFEGMHVRDTQAWSAMIVAYATHGDGLKAISLFDEMKRAGVRPDEITFLGLLYACSHAGLVEEGTGYFYSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAYKFIDGLEIKPTPILWRTLLSACSNHGNVDLAKQVIERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELHRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCHTAAKLISLIFGRQIIIRDVQRFHRFKDGKCSCCDFW
Homology
BLAST of Sgr029061 vs. NCBI nr
Match: XP_022151060.1 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic isoform X1 [Momordica charantia])

HSP 1 Score: 1083.2 bits (2800), Expect = 0.0e+00
Identity = 522/572 (91.26%), Postives = 546/572 (95.45%), Query Frame = 0

Query: 230 LFASAAVSSAMELKQIQAFTIKTNLHNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPD 289
           LF  +  +S  ELKQIQAFTIKTNL NDISVLTK+INFCTLNP+TSSMDHAHHLFDQIPD
Sbjct: 32  LFLLSKCTSLRELKQIQAFTIKTNLQNDISVLTKIINFCTLNPSTSSMDHAHHLFDQIPD 91

Query: 290 KDIVLFNLLARGYARSNTPYLAFSLFAQILCSGLLPDDYTFSSLLKACASSKALKEGRQL 349
           KDIVLFN++ARGYARSNTPYLAFSLF+Q+LCSGLLPDDYTFSSLLKACASSKA  EGRQL
Sbjct: 92  KDIVLFNIMARGYARSNTPYLAFSLFSQVLCSGLLPDDYTFSSLLKACASSKAFSEGRQL 151

Query: 350 HCFAIKLGLNHNIYLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQP 409
           HCFAIKLGLNHNIY+CP+LIN+YAECNDMNAAR VFD M+ PCIVSYN+IITG+ARSSQP
Sbjct: 152 HCFAIKLGLNHNIYICPSLINLYAECNDMNAARGVFDEMEAPCIVSYNAIITGHARSSQP 211

Query: 410 NEALSLFRELQASNLEPTDVTMLSIIMSCALLGALDIGRWIHEYVKKNGFDKYVKVNTAL 469
           NEALSLFRELQASNLEPTDVTMLSIIMSCALLGALD+GRWIHEYVKK GFDK+VKVNTAL
Sbjct: 212 NEALSLFRELQASNLEPTDVTMLSIIMSCALLGALDLGRWIHEYVKKKGFDKFVKVNTAL 271

Query: 470 IDMYAKCGSLADAISIFEGMHVRDTQAWSAMIVAYATHGDGLKAISLFDEMKRAGVRPDE 529
           IDMYAKCGSL DAISIFE M VRDTQAWSAMIVAYATHGDGLKAIS+F+EMKRAGVRPDE
Sbjct: 272 IDMYAKCGSLVDAISIFEDMRVRDTQAWSAMIVAYATHGDGLKAISMFEEMKRAGVRPDE 331

Query: 530 ITFLGLLYACSHAGLVEEGTGYFYSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAYKFID 589
           ITFLGLLYACSHAGLVEEG GYF SMSK +GI PGIKHYGCMVDLLGRTG LDEAYKFID
Sbjct: 332 ITFLGLLYACSHAGLVEEGRGYFNSMSKYYGIAPGIKHYGCMVDLLGRTGHLDEAYKFID 391

Query: 590 GLEIKPTPILWRTLLSACSNHGNVDLAKQVIERIFELDDSHGGDYVILSNLCARVGRWED 649
           G EIKPTPILWRTLLSACSN GNVDLAK+VIERIFELDDSHGGDYVILSNLCARVGRWED
Sbjct: 392 GSEIKPTPILWRTLLSACSNRGNVDLAKRVIERIFELDDSHGGDYVILSNLCARVGRWED 451

Query: 650 VNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELHRALDELIKEIKLVGY 709
           VNH+RKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVEL RALDELIKEIKLVGY
Sbjct: 452 VNHIRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELRRALDELIKEIKLVGY 511

Query: 710 VPDTSLVYHADMEEEGKELVLRYHSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCHTAAK 769
           VPDTSLVYHADMEEEGKELVLRYHSEKLAIAFGLLN+PPGT IRVVKNLRICGDCHTAAK
Sbjct: 512 VPDTSLVYHADMEEEGKELVLRYHSEKLAIAFGLLNSPPGTPIRVVKNLRICGDCHTAAK 571

Query: 770 LISLIFGRQIIIRDVQRFHRFKDGKCSCCDFW 802
           LIS IFGRQI+IRDVQRFHRF+DGKCSCCDFW
Sbjct: 572 LISFIFGRQIVIRDVQRFHRFEDGKCSCCDFW 603

BLAST of Sgr029061 vs. NCBI nr
Match: XP_023541252.1 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1077.8 bits (2786), Expect = 0.0e+00
Identity = 517/565 (91.50%), Postives = 541/565 (95.75%), Query Frame = 0

Query: 237 SSAMELKQIQAFTIKTNLHNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPDKDIVLFN 296
           SS  ELKQIQA+TIKTNLHNDISVLTKLINFCT NPT SSMDHAHHLFD++ DKDIVLFN
Sbjct: 34  SSLRELKQIQAYTIKTNLHNDISVLTKLINFCTRNPTISSMDHAHHLFDKMLDKDIVLFN 93

Query: 297 LLARGYARSNTPYLAFSLFAQILCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKL 356
           ++ARGYARSN+PYL FSLFAQ+LCSGLLPDDYTFSSLLKACA SKAL+EGRQLHCFAIKL
Sbjct: 94  IMARGYARSNSPYLVFSLFAQVLCSGLLPDDYTFSSLLKACAGSKALEEGRQLHCFAIKL 153

Query: 357 GLNHNIYLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQPNEALSLF 416
           GL HNIY+CPTLINMYA CNDMNAAR VFDGM+ PCIVSYN+IITGYARSSQPNEALSLF
Sbjct: 154 GLGHNIYICPTLINMYAACNDMNAARGVFDGMEEPCIVSYNAIITGYARSSQPNEALSLF 213

Query: 417 RELQASNLEPTDVTMLSIIMSCALLGALDIGRWIHEYVKKNGFDKYVKVNTALIDMYAKC 476
           RELQASNLEPTDVTMLSIIMSCALLGALD+GRWIHEYVKK GFDK+VKVNTALIDMYAKC
Sbjct: 214 RELQASNLEPTDVTMLSIIMSCALLGALDLGRWIHEYVKKKGFDKFVKVNTALIDMYAKC 273

Query: 477 GSLADAISIFEGMHVRDTQAWSAMIVAYATHGDGLKAISLFDEMKRAGVRPDEITFLGLL 536
           GS+ DAISIFEGM VRDTQAWSAMIVA+ATHGDGLKAIS+F+EMK+AGVRPDEITFLGLL
Sbjct: 274 GSIVDAISIFEGMRVRDTQAWSAMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLL 333

Query: 537 YACSHAGLVEEGTGYFYSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAYKFIDGLEIKPT 596
           YACSHAGLVEEG GYFYSM KNHGI PGIKHYGCMVDLLGRTGRLDEAYKFID L IKPT
Sbjct: 334 YACSHAGLVEEGRGYFYSMYKNHGITPGIKHYGCMVDLLGRTGRLDEAYKFIDELAIKPT 393

Query: 597 PILWRTLLSACSNHGNVDLAKQVIERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKL 656
           PILWRTLLSACSNHGNVDLAK+VIERIFELDDSHGGDYVILSNLCAR+GRWEDVN LRKL
Sbjct: 394 PILWRTLLSACSNHGNVDLAKRVIERIFELDDSHGGDYVILSNLCARLGRWEDVNRLRKL 453

Query: 657 MKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELHRALDELIKEIKLVGYVPDTSLV 716
           MKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVEL RALDELIKEIKL GYVPDTSLV
Sbjct: 454 MKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELRRALDELIKEIKLAGYVPDTSLV 513

Query: 717 YHADMEEEGKELVLRYHSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCHTAAKLISLIFG 776
           YHADMEEE KELVLRYHSEKLA+AFGLLNTPPGTTIRVVKNLRICGDCH AAKLISLIFG
Sbjct: 514 YHADMEEEAKELVLRYHSEKLAMAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFG 573

Query: 777 RQIIIRDVQRFHRFKDGKCSCCDFW 802
           RQI+IRDVQRFHRF+DG+CSCCDFW
Sbjct: 574 RQIVIRDVQRFHRFEDGQCSCCDFW 598

BLAST of Sgr029061 vs. NCBI nr
Match: KAG7013140.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1074.7 bits (2778), Expect = 6.2e-310
Identity = 522/601 (86.86%), Postives = 553/601 (92.01%), Query Frame = 0

Query: 201 CVDTSGSVFIKCPVLPKGRCKQLQSAAYRLFASAAVSSAMELKQIQAFTIKTNLHNDISV 260
           C+ TS     K    PK   K+  +  + L   +  +S  ELKQIQA+TIKTNLHNDISV
Sbjct: 5   CIPTSQFALTK----PK---KEFINQPHPLSLFSKCTSLRELKQIQAYTIKTNLHNDISV 64

Query: 261 LTKLINFCTLNPTTSSMDHAHHLFDQIPDKDIVLFNLLARGYARSNTPYLAFSLFAQILC 320
           LTKLINFCT NPTTSSMDHAHHLFD++ DKDIVLFN++ARGYARSN+PYL FSLFAQ+LC
Sbjct: 65  LTKLINFCTRNPTTSSMDHAHHLFDKMLDKDIVLFNIMARGYARSNSPYLVFSLFAQVLC 124

Query: 321 SGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLNHNIYLCPTLINMYAECNDMNA 380
           SGLLPDDYTFSSLLKACA SKAL+EGRQLHCFAIKLGL HNIY+CPTLINMYA CNDMNA
Sbjct: 125 SGLLPDDYTFSSLLKACAGSKALEEGRQLHCFAIKLGLGHNIYICPTLINMYAACNDMNA 184

Query: 381 AREVFDGMDGPCIVSYNSIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSIIMSCAL 440
           AR VFDGM+ PCIVSYN+IITGYARSSQPNEALSLFRELQASNLEPTDVTMLSIIMSCAL
Sbjct: 185 ARGVFDGMEEPCIVSYNAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSIIMSCAL 244

Query: 441 LGALDIGRWIHEYVKKNGFDKYVKVNTALIDMYAKCGSLADAISIFEGMHVRDTQAWSAM 500
           LGALD+GRWIHEYVKK GFDK+VKVNTALIDMYAKCGS+ DAISIFEGM VRDTQAWSAM
Sbjct: 245 LGALDLGRWIHEYVKKKGFDKFVKVNTALIDMYAKCGSIVDAISIFEGMRVRDTQAWSAM 304

Query: 501 IVAYATHGDGLKAISLFDEMKRAGVRPDEITFLGLLYACSHAGLVEEGTGYFYSMSKNHG 560
           IVA+ATHGDGLKAIS+F+EMK+AGVRPDEITFLGLLYACSHAGLV+EG GYFYSM KNHG
Sbjct: 305 IVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVDEGRGYFYSMYKNHG 364

Query: 561 IVPGIKHYGCMVDLLGRTGRLDEAYKFIDGLEIKPTPILWRTLLSACSNHGNVDLAKQVI 620
           I PGIKHYGCMVDLLGRTGRLDEAYKFID L IKPTPILWRTLLSACSNHGNVDLAK+VI
Sbjct: 365 ITPGIKHYGCMVDLLGRTGRLDEAYKFIDELAIKPTPILWRTLLSACSNHGNVDLAKRVI 424

Query: 621 ERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEF 680
           ERIFELDDSHGGDYVILSNLCAR+GRWEDVN LRKLMKDRGV KVPGCSSVEVNNVVHEF
Sbjct: 425 ERIFELDDSHGGDYVILSNLCARLGRWEDVNRLRKLMKDRGVGKVPGCSSVEVNNVVHEF 484

Query: 681 FSGDGVHSISVELHRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAIA 740
           FSGDGVHSISVEL RALDELIKEIKL GY PDTSLVYHADMEEE KELVLRYHSEKLA+A
Sbjct: 485 FSGDGVHSISVELRRALDELIKEIKLAGYAPDTSLVYHADMEEEAKELVLRYHSEKLAMA 544

Query: 741 FGLLNTPPGTTIRVVKNLRICGDCHTAAKLISLIFGRQIIIRDVQRFHRFKDGKCSCCDF 800
           FGLLNTPPGTTIRVVKNLRICGDCH AAKLISLIFGRQI+IRDVQRFHRF+DG+CSCCDF
Sbjct: 545 FGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGQCSCCDF 598

Query: 801 W 802
           W
Sbjct: 605 W 598

BLAST of Sgr029061 vs. NCBI nr
Match: KAG6574081.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1073.2 bits (2774), Expect = 1.2e-309
Identity = 522/600 (87.00%), Postives = 553/600 (92.17%), Query Frame = 0

Query: 201 CVDTSGSVFIKCPVLPKGRCKQLQSAAYRLFASAAVSSAMELKQIQAFTIKTNLHNDISV 260
           C+ TS     K    PK   K+  +  + L   +  +S  ELKQIQA+TIKTNLHNDISV
Sbjct: 5   CIPTSQFALTK----PK---KEFINQPHPLSLFSKCTSLRELKQIQAYTIKTNLHNDISV 64

Query: 261 LTKLINFCTLNPTTSSMDHAHHLFDQIPDKDIVLFNLLARGYARSNTPYLAFSLFAQILC 320
           LTKLINFCT NPTTSSMDHAHHLFD++ DKDIVLFN++ARGYARSN+PYL FSLFAQ+LC
Sbjct: 65  LTKLINFCTRNPTTSSMDHAHHLFDKMLDKDIVLFNIMARGYARSNSPYLVFSLFAQVLC 124

Query: 321 SGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLNHNIYLCPTLINMYAECNDMNA 380
           SGLLPDDYTFSSLLKACA SKAL+EGRQLHCFAIKLGL HNIY+CPTLINMYA CNDMNA
Sbjct: 125 SGLLPDDYTFSSLLKACAGSKALEEGRQLHCFAIKLGLGHNIYICPTLINMYAACNDMNA 184

Query: 381 AREVFDGMDGPCIVSYNSIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSIIMSCAL 440
           AR VFDGM+ PCIVSYN+IITGYARSSQPNEALSLFRELQASNLEPTDVTMLSIIMSCAL
Sbjct: 185 ARGVFDGMEEPCIVSYNAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSIIMSCAL 244

Query: 441 LGALDIGRWIHEYVKKNGFDKYVKVNTALIDMYAKCGSLADAISIFEGMHVRDTQAWSAM 500
           LGALD+GRWIHEYVKK GFDK+VKVNTALIDMYAKCGS+ DAISIFEGM VRDTQAWSAM
Sbjct: 245 LGALDLGRWIHEYVKKKGFDKFVKVNTALIDMYAKCGSIVDAISIFEGMRVRDTQAWSAM 304

Query: 501 IVAYATHGDGLKAISLFDEMKRAGVRPDEITFLGLLYACSHAGLVEEGTGYFYSMSKNHG 560
           IVA+ATHGDGLKAIS+F+EMK+AGVRPDEITFLGLLYACSHAGLV+EG GYFYSM KNHG
Sbjct: 305 IVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLLYACSHAGLVDEGRGYFYSMYKNHG 364

Query: 561 IVPGIKHYGCMVDLLGRTGRLDEAYKFIDGLEIKPTPILWRTLLSACSNHGNVDLAKQVI 620
           I PGIKHYGCMVDLLGRTGRLDEAYKFID L IKPTPILWRTLLSACSNHGNVDLAK+VI
Sbjct: 365 ITPGIKHYGCMVDLLGRTGRLDEAYKFIDELAIKPTPILWRTLLSACSNHGNVDLAKRVI 424

Query: 621 ERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEF 680
           ERIFELDDSHGGDYVILSNLCAR+GRWEDVN LRKLMKDRGVVKVPGCSSVEVNNVVHEF
Sbjct: 425 ERIFELDDSHGGDYVILSNLCARLGRWEDVNRLRKLMKDRGVVKVPGCSSVEVNNVVHEF 484

Query: 681 FSGDGVHSISVELHRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAIA 740
           FSGDGVHSISVEL RALDELIKEIKL GY PDTSLVYHADMEEE KELVLRYHSEKLA+A
Sbjct: 485 FSGDGVHSISVELRRALDELIKEIKLAGYAPDTSLVYHADMEEEAKELVLRYHSEKLAMA 544

Query: 741 FGLLNTPPGTTIRVVKNLRICGDCHTAAKLISLIFGRQIIIRDVQRFHRFKDGKCSCCDF 800
           FGLLNTPPGTTIRVVKNLRICGDCH AAKLISLIFGRQI+IRDVQRFHRF+DG+CSCCDF
Sbjct: 545 FGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGQCSCCDF 597

BLAST of Sgr029061 vs. NCBI nr
Match: XP_022968061.1 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Cucurbita maxima])

HSP 1 Score: 1072.8 bits (2773), Expect = 1.2e-309
Identity = 517/572 (90.38%), Postives = 543/572 (94.93%), Query Frame = 0

Query: 230 LFASAAVSSAMELKQIQAFTIKTNLHNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPD 289
           LF+  A  S  ELKQIQA+TIKTNLHNDISVLTKLINFCT  PTTSSMDHAHHLFD++ D
Sbjct: 29  LFSKCA--SLRELKQIQAYTIKTNLHNDISVLTKLINFCTRYPTTSSMDHAHHLFDKMLD 88

Query: 290 KDIVLFNLLARGYARSNTPYLAFSLFAQILCSGLLPDDYTFSSLLKACASSKALKEGRQL 349
           KDIVLFN++ARGYARSN+PYL FSLFAQ+LCSGLLPDDYTFSSLLKACA SKAL+EGRQL
Sbjct: 89  KDIVLFNIMARGYARSNSPYLVFSLFAQVLCSGLLPDDYTFSSLLKACAGSKALEEGRQL 148

Query: 350 HCFAIKLGLNHNIYLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQP 409
           HCFAIKLG  HNIY+CPTLINMYA CNDMNAAR VFDGM+ PCIVSYN+IITGYARSSQP
Sbjct: 149 HCFAIKLGFGHNIYICPTLINMYAACNDMNAARGVFDGMEEPCIVSYNAIITGYARSSQP 208

Query: 410 NEALSLFRELQASNLEPTDVTMLSIIMSCALLGALDIGRWIHEYVKKNGFDKYVKVNTAL 469
           NEALSLFRELQASNLEPTDVTMLSIIMSCALLGALD+GRWIHEYVKK GFDK+VKVNTAL
Sbjct: 209 NEALSLFRELQASNLEPTDVTMLSIIMSCALLGALDLGRWIHEYVKKKGFDKFVKVNTAL 268

Query: 470 IDMYAKCGSLADAISIFEGMHVRDTQAWSAMIVAYATHGDGLKAISLFDEMKRAGVRPDE 529
           IDMYAKCGS+ DAISIFEGM VRDTQAWSAMIVAYATHGDGLKAIS+F+EMK+AGVRPDE
Sbjct: 269 IDMYAKCGSIVDAISIFEGMRVRDTQAWSAMIVAYATHGDGLKAISMFEEMKKAGVRPDE 328

Query: 530 ITFLGLLYACSHAGLVEEGTGYFYSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAYKFID 589
           ITFLGLLYACSHAGLVEEG GYFYSM KNHG+ PGIKHYGCMVDLLGRTGRLDEAYKFID
Sbjct: 329 ITFLGLLYACSHAGLVEEGRGYFYSMYKNHGMTPGIKHYGCMVDLLGRTGRLDEAYKFID 388

Query: 590 GLEIKPTPILWRTLLSACSNHGNVDLAKQVIERIFELDDSHGGDYVILSNLCARVGRWED 649
            L IKPTPILWRTLLSACSNHGNVDLAK+VIERIFELDDSHGGDYVILSNLCAR+GRWED
Sbjct: 389 ELAIKPTPILWRTLLSACSNHGNVDLAKRVIERIFELDDSHGGDYVILSNLCARLGRWED 448

Query: 650 VNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELHRALDELIKEIKLVGY 709
           VN LRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVEL RALDELI+EIKL GY
Sbjct: 449 VNRLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELRRALDELIQEIKLAGY 508

Query: 710 VPDTSLVYHADMEEEGKELVLRYHSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCHTAAK 769
           VPDTSLVYHADMEEE KELVLRYHSEKLA+AFGLLNTPPGTTIRVVKNLRICGDCH AAK
Sbjct: 509 VPDTSLVYHADMEEEAKELVLRYHSEKLAMAFGLLNTPPGTTIRVVKNLRICGDCHNAAK 568

Query: 770 LISLIFGRQIIIRDVQRFHRFKDGKCSCCDFW 802
           LISLIFGRQI+IRDVQRFHRF+DG+CSCCDFW
Sbjct: 569 LISLIFGRQIVIRDVQRFHRFEDGQCSCCDFW 598

BLAST of Sgr029061 vs. ExPASy Swiss-Prot
Match: Q8LK93 (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 810.4 bits (2092), Expect = 1.8e-233
Identity = 381/565 (67.43%), Postives = 465/565 (82.30%), Query Frame = 0

Query: 237 SSAMELKQIQAFTIKTNLHNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPDKDIVLFN 296
           +S  EL QIQA+ IK+++  D+S + KLINFCT +PT SSM +A HLF+ + + DIV+FN
Sbjct: 40  NSLRELMQIQAYAIKSHI-EDVSFVAKLINFCTESPTESSMSYARHLFEAMSEPDIVIFN 99

Query: 297 LLARGYARSNTPYLAFSLFAQILCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKL 356
            +ARGY+R   P   FSLF +IL  G+LPD+YTF SLLKACA +KAL+EGRQLHC ++KL
Sbjct: 100 SMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKACAVAKALEEGRQLHCLSMKL 159

Query: 357 GLNHNIYLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQPNEALSLF 416
           GL+ N+Y+CPTLINMY EC D+++AR VFD +  PC+V YN++ITGYAR ++PNEALSLF
Sbjct: 160 GLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALSLF 219

Query: 417 RELQASNLEPTDVTMLSIIMSCALLGALDIGRWIHEYVKKNGFDKYVKVNTALIDMYAKC 476
           RE+Q   L+P ++T+LS++ SCALLG+LD+G+WIH+Y KK+ F KYVKVNTALIDM+AKC
Sbjct: 220 REMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFAKC 279

Query: 477 GSLADAISIFEGMHVRDTQAWSAMIVAYATHGDGLKAISLFDEMKRAGVRPDEITFLGLL 536
           GSL DA+SIFE M  +DTQAWSAMIVAYA HG   K++ +F+ M+   V+PDEITFLGLL
Sbjct: 280 GSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMRSENVQPDEITFLGLL 339

Query: 537 YACSHAGLVEEGTGYFYSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAYKFIDGLEIKPT 596
            ACSH G VEEG  YF  M    GIVP IKHYG MVDLL R G L++AY+FID L I PT
Sbjct: 340 NACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPISPT 399

Query: 597 PILWRTLLSACSNHGNVDLAKQVIERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKL 656
           P+LWR LL+ACS+H N+DLA++V ERIFELDDSHGGDYVILSNL AR  +WE V+ LRK+
Sbjct: 400 PMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLRKV 459

Query: 657 MKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELHRALDELIKEIKLVGYVPDTSLV 716
           MKDR  VKVPGCSS+EVNNVVHEFFSGDGV S + +LHRALDE++KE+KL GYVPDTS+V
Sbjct: 460 MKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTSMV 519

Query: 717 YHADMEEEGKELVLRYHSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCHTAAKLISLIFG 776
            HA+M ++ KE+ LRYHSEKLAI FGLLNTPPGTTIRVVKNLR+C DCH AAKLISLIFG
Sbjct: 520 VHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLIFG 579

Query: 777 RQIIIRDVQRFHRFKDGKCSCCDFW 802
           R++++RDVQRFH F+DGKCSC DFW
Sbjct: 580 RKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of Sgr029061 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 492.3 bits (1266), Expect = 1.1e-137
Identity = 235/549 (42.81%), Postives = 356/549 (64.85%), Query Frame = 0

Query: 255 HNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPDKDIVLFNLLARGYARSNTPYLAFSL 314
           H D+   T LI       +   +++A  LFD+IP KD+V +N +  GYA +     A  L
Sbjct: 197 HRDVVSYTALIKGYA---SRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALEL 256

Query: 315 FAQILCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLNHNIYLCPTLINMYAE 374
           F  ++ + + PD+ T  +++ ACA S +++ GRQ+H +    G   N+ +   LI++Y++
Sbjct: 257 FKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSK 316

Query: 375 CNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSI 434
           C ++  A  +F+ +    ++S+N++I GY   +   EAL LF+E+  S   P DVTMLSI
Sbjct: 317 CGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSI 376

Query: 435 IMSCALLGALDIGRWIHEYVKK--NGFDKYVKVNTALIDMYAKCGSLADAISIFEGMHVR 494
           + +CA LGA+DIGRWIH Y+ K   G      + T+LIDMYAKCG +  A  +F  +  +
Sbjct: 377 LPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHK 436

Query: 495 DTQAWSAMIVAYATHGDGLKAISLFDEMKRAGVRPDEITFLGLLYACSHAGLVEEGTGYF 554
              +W+AMI  +A HG    +  LF  M++ G++PD+ITF+GLL ACSH+G+++ G   F
Sbjct: 437 SLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIF 496

Query: 555 YSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAYKFIDGLEIKPTPILWRTLLSACSNHGN 614
            +M++++ + P ++HYGCM+DLLG +G   EA + I+ +E++P  ++W +LL AC  HGN
Sbjct: 497 RTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGN 556

Query: 615 VDLAKQVIERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVE 674
           V+L +   E + +++  + G YV+LSN+ A  GRW +V   R L+ D+G+ KVPGCSS+E
Sbjct: 557 VELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIE 616

Query: 675 VNNVVHEFFSGDGVHSISVELHRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRY 734
           +++VVHEF  GD  H  + E++  L+E+   ++  G+VPDTS V   +MEEE KE  LR+
Sbjct: 617 IDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQ-EMEEEWKEGALRH 676

Query: 735 HSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCHTAAKLISLIFGRQIIIRDVQRFHRFKD 794
           HSEKLAIAFGL++T PGT + +VKNLR+C +CH A KLIS I+ R+II RD  RFH F+D
Sbjct: 677 HSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRD 736

Query: 795 GKCSCCDFW 802
           G CSC D+W
Sbjct: 737 GVCSCNDYW 741

BLAST of Sgr029061 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 488.0 bits (1255), Expect = 2.0e-136
Identity = 238/596 (39.93%), Postives = 360/596 (60.40%), Query Frame = 0

Query: 237 SSAMELKQIQAFTIKTNLHNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPDKDIVLFN 296
           S   ELKQI A  +KT L  D   +TK ++FC  + ++  + +A  +FD     D  L+N
Sbjct: 25  SKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDTFLWN 84

Query: 297 LLARGYARSNTPYLAFSLFAQILCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKL 356
           L+ RG++ S+ P  +  L+ ++LCS    + YTF SLLKAC++  A +E  Q+H    KL
Sbjct: 85  LMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQITKL 144

Query: 357 GLNHNIYLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQPN------ 416
           G  +++Y   +LIN YA   +   A  +FD +  P  VS+NS+I GY ++ + +      
Sbjct: 145 GYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIALTLF 204

Query: 417 -------------------------EALSLFRELQASNLEPTDVTMLSIIMSCALLGALD 476
                                    EAL LF E+Q S++EP +V++ + + +CA LGAL+
Sbjct: 205 RKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQLGALE 264

Query: 477 IGRWIHEYVKKNGFDKYVKVNTALIDMYAKCGSLADAISIFEGMHVRDTQAWSAMIVAYA 536
            G+WIH Y+ K        +   LIDMYAKCG + +A+ +F+ +  +  QAW+A+I  YA
Sbjct: 265 QGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALISGYA 324

Query: 537 THGDGLKAISLFDEMKRAGVRPDEITFLGLLYACSHAGLVEEGTGYFYSMSKNHGIVPGI 596
            HG G +AIS F EM++ G++P+ ITF  +L ACS+ GLVEEG   FYSM +++ + P I
Sbjct: 325 YHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNLKPTI 384

Query: 597 KHYGCMVDLLGRTGRLDEAYKFIDGLEIKPTPILWRTLLSACSNHGNVDLAKQVIERIFE 656
           +HYGC+VDLLGR G LDEA +FI  + +KP  ++W  LL AC  H N++L +++ E +  
Sbjct: 385 EHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEEIGEILIA 444

Query: 657 LDDSHGGDYVILSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDG 716
           +D  HGG YV  +N+ A   +W+     R+LMK++GV KVPGCS++ +    HEF +GD 
Sbjct: 445 IDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGTTHEFLAGDR 504

Query: 717 VHSISVELHRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAIAFGLLN 776
            H    ++      + ++++  GYVP+   +    ++++ +E ++  HSEKLAI +GL+ 
Sbjct: 505 SHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSEKLAITYGLIK 564

Query: 777 TPPGTTIRVVKNLRICGDCHTAAKLISLIFGRQIIIRDVQRFHRFKDGKCSCCDFW 802
           T PGT IR++KNLR+C DCH   KLIS I+ R I++RD  RFH F+DGKCSC D+W
Sbjct: 565 TKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKCSCGDYW 620

BLAST of Sgr029061 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 485.0 bits (1247), Expect = 1.7e-135
Identity = 242/576 (42.01%), Postives = 372/576 (64.58%), Query Frame = 0

Query: 230 LFASAAVSSAMELKQIQAFTIKTNLH-NDISVLTKLINFCTLNPTTSSMDHAHHLFDQIP 289
           L  +  VSS  +L+QI AF+I+  +  +D  +   LI +    P+   M +AH +F +I 
Sbjct: 21  LLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIE 80

Query: 290 DK-DIVLFNLLARGYARSNTPYLAFSLFAQILCSGLL-PDDYTFSSLLKACASSKALKEG 349
              ++ ++N L RGYA       AFSL+ ++  SGL+ PD +T+  L+KA  +   ++ G
Sbjct: 81  KPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLG 140

Query: 350 RQLHCFAIKLGLNHNIYLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARS 409
             +H   I+ G    IY+  +L+++YA C D+ +A +VFD M    +V++NS+I G+A +
Sbjct: 141 ETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAEN 200

Query: 410 SQPNEALSLFRELQASNLEPTDVTMLSIIMSCALLGALDIGRWIHEYVKKNGFDKYVKVN 469
            +P EAL+L+ E+ +  ++P   T++S++ +CA +GAL +G+ +H Y+ K G  + +  +
Sbjct: 201 GKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSS 260

Query: 470 TALIDMYAKCGSLADAISIFEGMHVRDTQAWSAMIVAYATHGDGLKAISLFDEMKRA-GV 529
             L+D+YA+CG + +A ++F+ M  +++ +W+++IV  A +G G +AI LF  M+   G+
Sbjct: 261 NVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGL 320

Query: 530 RPDEITFLGLLYACSHAGLVEEGTGYFYSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAY 589
            P EITF+G+LYACSH G+V+EG  YF  M + + I P I+H+GCMVDLL R G++ +AY
Sbjct: 321 LPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAY 380

Query: 590 KFIDGLEIKPTPILWRTLLSACSNHGNVDLAKQVIERIFELDDSHGGDYVILSNLCARVG 649
           ++I  + ++P  ++WRTLL AC+ HG+ DLA+    +I +L+ +H GDYV+LSN+ A   
Sbjct: 381 EYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQ 440

Query: 650 RWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELHRALDELIKEIK 709
           RW DV  +RK M   GV KVPG S VEV N VHEF  GD  H  S  ++  L E+   ++
Sbjct: 441 RWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLR 500

Query: 710 LVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCH 769
             GYVP  S VY  D+EEE KE  + YHSEK+AIAF L++TP  + I VVKNLR+C DCH
Sbjct: 501 SEGYVPQISNVY-VDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCH 560

Query: 770 TAAKLISLIFGRQIIIRDVQRFHRFKDGKCSCCDFW 802
            A KL+S ++ R+I++RD  RFH FK+G CSC D+W
Sbjct: 561 LAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of Sgr029061 vs. ExPASy Swiss-Prot
Match: Q9LIQ7 (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 482.3 bits (1240), Expect = 1.1e-134
Identity = 234/558 (41.94%), Postives = 353/558 (63.26%), Query Frame = 0

Query: 245 IQAFTIKTNLHNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPDKDIVLFNLLARGYAR 304
           + A  +++   +DI +   L+N   +     S++ A  +F+++P +D V +  L  GY++
Sbjct: 82  VHAHILQSIFRHDIVMGNTLLN---MYAKCGSLEEARKVFEKMPQRDFVTWTTLISGYSQ 141

Query: 305 SNTPYLAFSLFAQILCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLNHNIYL 364
            + P  A   F Q+L  G  P+++T SS++KA A+ +    G QLH F +K G + N+++
Sbjct: 142 HDRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDSNVHV 201

Query: 365 CPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQPNEALSLFRELQASNL 424
              L+++Y     M+ A+ VFD ++    VS+N++I G+AR S   +AL LF+ +     
Sbjct: 202 GSALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIAGHARRSGTEKALELFQGMLRDGF 261

Query: 425 EPTDVTMLSIIMSCALLGALDIGRWIHEYVKKNGFDKYVKVNTALIDMYAKCGSLADAIS 484
            P+  +  S+  +C+  G L+ G+W+H Y+ K+G          L+DMYAK GS+ DA  
Sbjct: 262 RPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGNTLLDMYAKSGSIHDARK 321

Query: 485 IFEGMHVRDTQAWSAMIVAYATHGDGLKAISLFDEMKRAGVRPDEITFLGLLYACSHAGL 544
           IF+ +  RD  +W++++ AYA HG G +A+  F+EM+R G+RP+EI+FL +L ACSH+GL
Sbjct: 322 IFDRLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSHSGL 381

Query: 545 VEEGTGYFYSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAYKFIDGLEIKPTPILWRTLL 604
           ++EG  ++Y + K  GIVP   HY  +VDLLGR G L+ A +FI+ + I+PT  +W+ LL
Sbjct: 382 LDEG-WHYYELMKKDGIVPEAWHYVTVVDLLGRAGDLNRALRFIEEMPIEPTAAIWKALL 441

Query: 605 SACSNHGNVDLAKQVIERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKLMKDRGVVK 664
           +AC  H N +L     E +FELD    G +VIL N+ A  GRW D   +RK MK+ GV K
Sbjct: 442 NACRMHKNTELGAYAAEHVFELDPDDPGPHVILYNIYASGGRWNDAARVRKKMKESGVKK 501

Query: 665 VPGCSSVEVNNVVHEFFSGDGVHSISVELHRALDELIKEIKLVGYVPDTS-LVYHADMEE 724
            P CS VE+ N +H F + D  H    E+ R  +E++ +IK +GYVPDTS ++ H D +E
Sbjct: 502 EPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKIKELGYVPDTSHVIVHVDQQE 561

Query: 725 EGKELVLRYHSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCHTAAKLISLIFGRQIIIRD 784
             +E+ L+YHSEK+A+AF LLNTPPG+TI + KN+R+CGDCHTA KL S + GR+II+RD
Sbjct: 562 --REVNLQYHSEKIALAFALLNTPPGSTIHIKKNIRVCGDCHTAIKLASKVVGREIIVRD 621

Query: 785 VQRFHRFKDGKCSCCDFW 802
             RFH FKDG CSC D+W
Sbjct: 622 TNRFHHFKDGNCSCKDYW 633

BLAST of Sgr029061 vs. ExPASy TrEMBL
Match: A0A6J1DA68 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111019083 PE=3 SV=1)

HSP 1 Score: 1083.2 bits (2800), Expect = 0.0e+00
Identity = 522/572 (91.26%), Postives = 546/572 (95.45%), Query Frame = 0

Query: 230 LFASAAVSSAMELKQIQAFTIKTNLHNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPD 289
           LF  +  +S  ELKQIQAFTIKTNL NDISVLTK+INFCTLNP+TSSMDHAHHLFDQIPD
Sbjct: 32  LFLLSKCTSLRELKQIQAFTIKTNLQNDISVLTKIINFCTLNPSTSSMDHAHHLFDQIPD 91

Query: 290 KDIVLFNLLARGYARSNTPYLAFSLFAQILCSGLLPDDYTFSSLLKACASSKALKEGRQL 349
           KDIVLFN++ARGYARSNTPYLAFSLF+Q+LCSGLLPDDYTFSSLLKACASSKA  EGRQL
Sbjct: 92  KDIVLFNIMARGYARSNTPYLAFSLFSQVLCSGLLPDDYTFSSLLKACASSKAFSEGRQL 151

Query: 350 HCFAIKLGLNHNIYLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQP 409
           HCFAIKLGLNHNIY+CP+LIN+YAECNDMNAAR VFD M+ PCIVSYN+IITG+ARSSQP
Sbjct: 152 HCFAIKLGLNHNIYICPSLINLYAECNDMNAARGVFDEMEAPCIVSYNAIITGHARSSQP 211

Query: 410 NEALSLFRELQASNLEPTDVTMLSIIMSCALLGALDIGRWIHEYVKKNGFDKYVKVNTAL 469
           NEALSLFRELQASNLEPTDVTMLSIIMSCALLGALD+GRWIHEYVKK GFDK+VKVNTAL
Sbjct: 212 NEALSLFRELQASNLEPTDVTMLSIIMSCALLGALDLGRWIHEYVKKKGFDKFVKVNTAL 271

Query: 470 IDMYAKCGSLADAISIFEGMHVRDTQAWSAMIVAYATHGDGLKAISLFDEMKRAGVRPDE 529
           IDMYAKCGSL DAISIFE M VRDTQAWSAMIVAYATHGDGLKAIS+F+EMKRAGVRPDE
Sbjct: 272 IDMYAKCGSLVDAISIFEDMRVRDTQAWSAMIVAYATHGDGLKAISMFEEMKRAGVRPDE 331

Query: 530 ITFLGLLYACSHAGLVEEGTGYFYSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAYKFID 589
           ITFLGLLYACSHAGLVEEG GYF SMSK +GI PGIKHYGCMVDLLGRTG LDEAYKFID
Sbjct: 332 ITFLGLLYACSHAGLVEEGRGYFNSMSKYYGIAPGIKHYGCMVDLLGRTGHLDEAYKFID 391

Query: 590 GLEIKPTPILWRTLLSACSNHGNVDLAKQVIERIFELDDSHGGDYVILSNLCARVGRWED 649
           G EIKPTPILWRTLLSACSN GNVDLAK+VIERIFELDDSHGGDYVILSNLCARVGRWED
Sbjct: 392 GSEIKPTPILWRTLLSACSNRGNVDLAKRVIERIFELDDSHGGDYVILSNLCARVGRWED 451

Query: 650 VNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELHRALDELIKEIKLVGY 709
           VNH+RKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVEL RALDELIKEIKLVGY
Sbjct: 452 VNHIRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELRRALDELIKEIKLVGY 511

Query: 710 VPDTSLVYHADMEEEGKELVLRYHSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCHTAAK 769
           VPDTSLVYHADMEEEGKELVLRYHSEKLAIAFGLLN+PPGT IRVVKNLRICGDCHTAAK
Sbjct: 512 VPDTSLVYHADMEEEGKELVLRYHSEKLAIAFGLLNSPPGTPIRVVKNLRICGDCHTAAK 571

Query: 770 LISLIFGRQIIIRDVQRFHRFKDGKCSCCDFW 802
           LIS IFGRQI+IRDVQRFHRF+DGKCSCCDFW
Sbjct: 572 LISFIFGRQIVIRDVQRFHRFEDGKCSCCDFW 603

BLAST of Sgr029061 vs. ExPASy TrEMBL
Match: A0A6J1HTT7 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111467412 PE=3 SV=1)

HSP 1 Score: 1072.8 bits (2773), Expect = 6.0e-310
Identity = 517/572 (90.38%), Postives = 543/572 (94.93%), Query Frame = 0

Query: 230 LFASAAVSSAMELKQIQAFTIKTNLHNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPD 289
           LF+  A  S  ELKQIQA+TIKTNLHNDISVLTKLINFCT  PTTSSMDHAHHLFD++ D
Sbjct: 29  LFSKCA--SLRELKQIQAYTIKTNLHNDISVLTKLINFCTRYPTTSSMDHAHHLFDKMLD 88

Query: 290 KDIVLFNLLARGYARSNTPYLAFSLFAQILCSGLLPDDYTFSSLLKACASSKALKEGRQL 349
           KDIVLFN++ARGYARSN+PYL FSLFAQ+LCSGLLPDDYTFSSLLKACA SKAL+EGRQL
Sbjct: 89  KDIVLFNIMARGYARSNSPYLVFSLFAQVLCSGLLPDDYTFSSLLKACAGSKALEEGRQL 148

Query: 350 HCFAIKLGLNHNIYLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQP 409
           HCFAIKLG  HNIY+CPTLINMYA CNDMNAAR VFDGM+ PCIVSYN+IITGYARSSQP
Sbjct: 149 HCFAIKLGFGHNIYICPTLINMYAACNDMNAARGVFDGMEEPCIVSYNAIITGYARSSQP 208

Query: 410 NEALSLFRELQASNLEPTDVTMLSIIMSCALLGALDIGRWIHEYVKKNGFDKYVKVNTAL 469
           NEALSLFRELQASNLEPTDVTMLSIIMSCALLGALD+GRWIHEYVKK GFDK+VKVNTAL
Sbjct: 209 NEALSLFRELQASNLEPTDVTMLSIIMSCALLGALDLGRWIHEYVKKKGFDKFVKVNTAL 268

Query: 470 IDMYAKCGSLADAISIFEGMHVRDTQAWSAMIVAYATHGDGLKAISLFDEMKRAGVRPDE 529
           IDMYAKCGS+ DAISIFEGM VRDTQAWSAMIVAYATHGDGLKAIS+F+EMK+AGVRPDE
Sbjct: 269 IDMYAKCGSIVDAISIFEGMRVRDTQAWSAMIVAYATHGDGLKAISMFEEMKKAGVRPDE 328

Query: 530 ITFLGLLYACSHAGLVEEGTGYFYSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAYKFID 589
           ITFLGLLYACSHAGLVEEG GYFYSM KNHG+ PGIKHYGCMVDLLGRTGRLDEAYKFID
Sbjct: 329 ITFLGLLYACSHAGLVEEGRGYFYSMYKNHGMTPGIKHYGCMVDLLGRTGRLDEAYKFID 388

Query: 590 GLEIKPTPILWRTLLSACSNHGNVDLAKQVIERIFELDDSHGGDYVILSNLCARVGRWED 649
            L IKPTPILWRTLLSACSNHGNVDLAK+VIERIFELDDSHGGDYVILSNLCAR+GRWED
Sbjct: 389 ELAIKPTPILWRTLLSACSNHGNVDLAKRVIERIFELDDSHGGDYVILSNLCARLGRWED 448

Query: 650 VNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELHRALDELIKEIKLVGY 709
           VN LRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVEL RALDELI+EIKL GY
Sbjct: 449 VNRLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELRRALDELIQEIKLAGY 508

Query: 710 VPDTSLVYHADMEEEGKELVLRYHSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCHTAAK 769
           VPDTSLVYHADMEEE KELVLRYHSEKLA+AFGLLNTPPGTTIRVVKNLRICGDCH AAK
Sbjct: 509 VPDTSLVYHADMEEEAKELVLRYHSEKLAMAFGLLNTPPGTTIRVVKNLRICGDCHNAAK 568

Query: 770 LISLIFGRQIIIRDVQRFHRFKDGKCSCCDFW 802
           LISLIFGRQI+IRDVQRFHRF+DG+CSCCDFW
Sbjct: 569 LISLIFGRQIVIRDVQRFHRFEDGQCSCCDFW 598

BLAST of Sgr029061 vs. ExPASy TrEMBL
Match: A0A6J1FZ61 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111449232 PE=3 SV=1)

HSP 1 Score: 1072.0 bits (2771), Expect = 1.2e-309
Identity = 514/565 (90.97%), Postives = 541/565 (95.75%), Query Frame = 0

Query: 237 SSAMELKQIQAFTIKTNLHNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPDKDIVLFN 296
           +S  ELKQIQA+TIKTNLHNDISVLTKLINFCT +PTTSSMDHAHHLFD++ DKDIVLFN
Sbjct: 34  TSLRELKQIQAYTIKTNLHNDISVLTKLINFCTRSPTTSSMDHAHHLFDKMLDKDIVLFN 93

Query: 297 LLARGYARSNTPYLAFSLFAQILCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKL 356
           ++ARGYARSN+PYL FSLFAQ+L SGLLPDDYTFSSLLKACA SKAL+EGRQLHCFAIKL
Sbjct: 94  IMARGYARSNSPYLIFSLFAQVLFSGLLPDDYTFSSLLKACAGSKALEEGRQLHCFAIKL 153

Query: 357 GLNHNIYLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQPNEALSLF 416
           G  HNIY+CPTLINMYA CNDMNAAR VFDGM+ PCIVSYN+IITGYARSSQPNEALSLF
Sbjct: 154 GFGHNIYICPTLINMYAACNDMNAARGVFDGMEEPCIVSYNAIITGYARSSQPNEALSLF 213

Query: 417 RELQASNLEPTDVTMLSIIMSCALLGALDIGRWIHEYVKKNGFDKYVKVNTALIDMYAKC 476
           RELQASNLEPTDVTMLSIIMSCALLGALD+GRWIHEYVKK GFDK+VKVNTALIDMYAKC
Sbjct: 214 RELQASNLEPTDVTMLSIIMSCALLGALDLGRWIHEYVKKKGFDKFVKVNTALIDMYAKC 273

Query: 477 GSLADAISIFEGMHVRDTQAWSAMIVAYATHGDGLKAISLFDEMKRAGVRPDEITFLGLL 536
           GS+ DAISIFEGM VRDTQAWSAMIVA+ATHGDGLKAIS+F+EMK+AGVRPDEITFLGLL
Sbjct: 274 GSIVDAISIFEGMRVRDTQAWSAMIVAFATHGDGLKAISMFEEMKKAGVRPDEITFLGLL 333

Query: 537 YACSHAGLVEEGTGYFYSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAYKFIDGLEIKPT 596
           YACSHAGLV+EGTGYFYSM KNHGI PGIKHYGCMVDLLGRTGRLDEAYKFID L IKPT
Sbjct: 334 YACSHAGLVDEGTGYFYSMYKNHGITPGIKHYGCMVDLLGRTGRLDEAYKFIDELAIKPT 393

Query: 597 PILWRTLLSACSNHGNVDLAKQVIERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKL 656
           PILWRTLLSACSNHGNVDLAK+VIERIFELDDSHGGDYVILSNLCAR+GRWEDVN LRKL
Sbjct: 394 PILWRTLLSACSNHGNVDLAKRVIERIFELDDSHGGDYVILSNLCARLGRWEDVNRLRKL 453

Query: 657 MKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELHRALDELIKEIKLVGYVPDTSLV 716
           MKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVEL RALDELIKEIKL GYVPDTSLV
Sbjct: 454 MKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELRRALDELIKEIKLAGYVPDTSLV 513

Query: 717 YHADMEEEGKELVLRYHSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCHTAAKLISLIFG 776
           YHADMEEE KELVLRYHSEKLA+AFGLLNTPPGTTIRVVKNLRICGDCH AAKLISLIFG
Sbjct: 514 YHADMEEEAKELVLRYHSEKLAMAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFG 573

Query: 777 RQIIIRDVQRFHRFKDGKCSCCDFW 802
           RQI+IRDVQRFHRF+DG+CSCCDFW
Sbjct: 574 RQIVIRDVQRFHRFEDGQCSCCDFW 598

BLAST of Sgr029061 vs. ExPASy TrEMBL
Match: A0A5A7STH8 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1932G00400 PE=3 SV=1)

HSP 1 Score: 1048.1 bits (2709), Expect = 1.8e-302
Identity = 498/565 (88.14%), Postives = 538/565 (95.22%), Query Frame = 0

Query: 237 SSAMELKQIQAFTIKTNLHNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPDKDIVLFN 296
           +S  ELKQIQA+TIKTNL +DISVLTKLINFCTLNPTTS MDHAHHLFDQI DKDI+LFN
Sbjct: 40  TSLKELKQIQAYTIKTNLQSDISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFN 99

Query: 297 LLARGYARSNTPYLAFSLFAQILCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKL 356
           ++ARGYARSN+PYLAFSLFAQ+LCSGLLPDDYTFSSLLKACASSKAL++G  LHCFA+KL
Sbjct: 100 IMARGYARSNSPYLAFSLFAQLLCSGLLPDDYTFSSLLKACASSKALRQGMGLHCFAVKL 159

Query: 357 GLNHNIYLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQPNEALSLF 416
           GLNHNIY+CPTLINMYAECNDMNAAR VFD M+ PCIVSYN+IITGYARSSQPNEALSLF
Sbjct: 160 GLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLF 219

Query: 417 RELQASNLEPTDVTMLSIIMSCALLGALDIGRWIHEYVKKNGFDKYVKVNTALIDMYAKC 476
           RELQAS++EPTDVTMLS+IMSCALLGALD+G+WIHEYVKK GFDKYVKVNTALIDM+AKC
Sbjct: 220 RELQASDIEPTDVTMLSVIMSCALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKC 279

Query: 477 GSLADAISIFEGMHVRDTQAWSAMIVAYATHGDGLKAISLFDEMKRAGVRPDEITFLGLL 536
           GSL DAISIFEGM VRDTQAWSAMIVA+ATHGDGLK+IS+F+EMKRAGVRPDEITFLGLL
Sbjct: 280 GSLTDAISIFEGMRVRDTQAWSAMIVAFATHGDGLKSISIFEEMKRAGVRPDEITFLGLL 339

Query: 537 YACSHAGLVEEGTGYFYSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAYKFIDGLEIKPT 596
           YACSHAGLVE+G GYFYSMS+ +GI PGIKHYGCMVDLLGRTG LDEAY F+D LEIKPT
Sbjct: 340 YACSHAGLVEQGRGYFYSMSRTYGITPGIKHYGCMVDLLGRTGCLDEAYNFVDELEIKPT 399

Query: 597 PILWRTLLSACSNHGNVDLAKQVIERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKL 656
           PILWRTLLSACS HGNV++AK+VIERIFELDDSHGGDYVILSNL ARVGRWEDVNHLRKL
Sbjct: 400 PILWRTLLSACSTHGNVEMAKRVIERIFELDDSHGGDYVILSNLYARVGRWEDVNHLRKL 459

Query: 657 MKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELHRALDELIKEIKLVGYVPDTSLV 716
           MKDRGVVKVPGCSSVEVNNVVHEFFSGDGVH +SVEL RALDEL+KEIKLVGY+PDTSLV
Sbjct: 460 MKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCVSVELRRALDELMKEIKLVGYIPDTSLV 519

Query: 717 YHADMEEEGKELVLRYHSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCHTAAKLISLIFG 776
           YHADM+EEGKELVLRYHSEKLA+AFGLLNTPPGTTIRV KNLRICGDCH AAKLIS IFG
Sbjct: 520 YHADMDEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVAKNLRICGDCHNAAKLISFIFG 579

Query: 777 RQIIIRDVQRFHRFKDGKCSCCDFW 802
           R+I+IRDVQRFH+F+DGKCSC DFW
Sbjct: 580 RKIVIRDVQRFHQFEDGKCSCGDFW 604

BLAST of Sgr029061 vs. ExPASy TrEMBL
Match: A0A1S3BFK0 (pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103489124 PE=3 SV=1)

HSP 1 Score: 1048.1 bits (2709), Expect = 1.8e-302
Identity = 498/565 (88.14%), Postives = 538/565 (95.22%), Query Frame = 0

Query: 237 SSAMELKQIQAFTIKTNLHNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPDKDIVLFN 296
           +S  ELKQIQA+TIKTNL +DISVLTKLINFCTLNPTTS MDHAHHLFDQI DKDI+LFN
Sbjct: 40  TSLKELKQIQAYTIKTNLQSDISVLTKLINFCTLNPTTSYMDHAHHLFDQILDKDIILFN 99

Query: 297 LLARGYARSNTPYLAFSLFAQILCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKL 356
           ++ARGYARSN+PYLAFSLFAQ+LCSGLLPDDYTFSSLLKACASSKAL++G  LHCFA+KL
Sbjct: 100 IMARGYARSNSPYLAFSLFAQLLCSGLLPDDYTFSSLLKACASSKALRQGMGLHCFAVKL 159

Query: 357 GLNHNIYLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQPNEALSLF 416
           GLNHNIY+CPTLINMYAECNDMNAAR VFD M+ PCIVSYN+IITGYARSSQPNEALSLF
Sbjct: 160 GLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLF 219

Query: 417 RELQASNLEPTDVTMLSIIMSCALLGALDIGRWIHEYVKKNGFDKYVKVNTALIDMYAKC 476
           RELQAS++EPTDVTMLS+IMSCALLGALD+G+WIHEYVKK GFDKYVKVNTALIDM+AKC
Sbjct: 220 RELQASDIEPTDVTMLSVIMSCALLGALDLGKWIHEYVKKKGFDKYVKVNTALIDMFAKC 279

Query: 477 GSLADAISIFEGMHVRDTQAWSAMIVAYATHGDGLKAISLFDEMKRAGVRPDEITFLGLL 536
           GSL DAISIFEGM VRDTQAWSAMIVA+ATHGDGLK+IS+F+EMKRAGVRPDEITFLGLL
Sbjct: 280 GSLTDAISIFEGMRVRDTQAWSAMIVAFATHGDGLKSISIFEEMKRAGVRPDEITFLGLL 339

Query: 537 YACSHAGLVEEGTGYFYSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAYKFIDGLEIKPT 596
           YACSHAGLVE+G GYFYSMS+ +GI PGIKHYGCMVDLLGRTG LDEAY F+D LEIKPT
Sbjct: 340 YACSHAGLVEQGRGYFYSMSRTYGITPGIKHYGCMVDLLGRTGCLDEAYNFVDELEIKPT 399

Query: 597 PILWRTLLSACSNHGNVDLAKQVIERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKL 656
           PILWRTLLSACS HGNV++AK+VIERIFELDDSHGGDYVILSNL ARVGRWEDVNHLRKL
Sbjct: 400 PILWRTLLSACSTHGNVEMAKRVIERIFELDDSHGGDYVILSNLYARVGRWEDVNHLRKL 459

Query: 657 MKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELHRALDELIKEIKLVGYVPDTSLV 716
           MKDRGVVKVPGCSSVEVNNVVHEFFSGDGVH +SVEL RALDEL+KEIKLVGY+PDTSLV
Sbjct: 460 MKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCVSVELRRALDELMKEIKLVGYIPDTSLV 519

Query: 717 YHADMEEEGKELVLRYHSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCHTAAKLISLIFG 776
           YHADM+EEGKELVLRYHSEKLA+AFGLLNTPPGTTIRV KNLRICGDCH AAKLIS IFG
Sbjct: 520 YHADMDEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVAKNLRICGDCHNAAKLISFIFG 579

Query: 777 RQIIIRDVQRFHRFKDGKCSCCDFW 802
           R+I+IRDVQRFH+F+DGKCSC DFW
Sbjct: 580 RKIVIRDVQRFHQFEDGKCSCGDFW 604

BLAST of Sgr029061 vs. TAIR 10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 810.4 bits (2092), Expect = 1.2e-234
Identity = 381/565 (67.43%), Postives = 465/565 (82.30%), Query Frame = 0

Query: 237 SSAMELKQIQAFTIKTNLHNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPDKDIVLFN 296
           +S  EL QIQA+ IK+++  D+S + KLINFCT +PT SSM +A HLF+ + + DIV+FN
Sbjct: 40  NSLRELMQIQAYAIKSHI-EDVSFVAKLINFCTESPTESSMSYARHLFEAMSEPDIVIFN 99

Query: 297 LLARGYARSNTPYLAFSLFAQILCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKL 356
            +ARGY+R   P   FSLF +IL  G+LPD+YTF SLLKACA +KAL+EGRQLHC ++KL
Sbjct: 100 SMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKACAVAKALEEGRQLHCLSMKL 159

Query: 357 GLNHNIYLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQPNEALSLF 416
           GL+ N+Y+CPTLINMY EC D+++AR VFD +  PC+V YN++ITGYAR ++PNEALSLF
Sbjct: 160 GLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALSLF 219

Query: 417 RELQASNLEPTDVTMLSIIMSCALLGALDIGRWIHEYVKKNGFDKYVKVNTALIDMYAKC 476
           RE+Q   L+P ++T+LS++ SCALLG+LD+G+WIH+Y KK+ F KYVKVNTALIDM+AKC
Sbjct: 220 REMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFAKC 279

Query: 477 GSLADAISIFEGMHVRDTQAWSAMIVAYATHGDGLKAISLFDEMKRAGVRPDEITFLGLL 536
           GSL DA+SIFE M  +DTQAWSAMIVAYA HG   K++ +F+ M+   V+PDEITFLGLL
Sbjct: 280 GSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMRSENVQPDEITFLGLL 339

Query: 537 YACSHAGLVEEGTGYFYSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAYKFIDGLEIKPT 596
            ACSH G VEEG  YF  M    GIVP IKHYG MVDLL R G L++AY+FID L I PT
Sbjct: 340 NACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPISPT 399

Query: 597 PILWRTLLSACSNHGNVDLAKQVIERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKL 656
           P+LWR LL+ACS+H N+DLA++V ERIFELDDSHGGDYVILSNL AR  +WE V+ LRK+
Sbjct: 400 PMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLRKV 459

Query: 657 MKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELHRALDELIKEIKLVGYVPDTSLV 716
           MKDR  VKVPGCSS+EVNNVVHEFFSGDGV S + +LHRALDE++KE+KL GYVPDTS+V
Sbjct: 460 MKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTSMV 519

Query: 717 YHADMEEEGKELVLRYHSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCHTAAKLISLIFG 776
            HA+M ++ KE+ LRYHSEKLAI FGLLNTPPGTTIRVVKNLR+C DCH AAKLISLIFG
Sbjct: 520 VHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLIFG 579

Query: 777 RQIIIRDVQRFHRFKDGKCSCCDFW 802
           R++++RDVQRFH F+DGKCSC DFW
Sbjct: 580 RKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of Sgr029061 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 492.3 bits (1266), Expect = 7.5e-139
Identity = 235/549 (42.81%), Postives = 356/549 (64.85%), Query Frame = 0

Query: 255 HNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPDKDIVLFNLLARGYARSNTPYLAFSL 314
           H D+   T LI       +   +++A  LFD+IP KD+V +N +  GYA +     A  L
Sbjct: 197 HRDVVSYTALIKGYA---SRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALEL 256

Query: 315 FAQILCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLNHNIYLCPTLINMYAE 374
           F  ++ + + PD+ T  +++ ACA S +++ GRQ+H +    G   N+ +   LI++Y++
Sbjct: 257 FKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSK 316

Query: 375 CNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSI 434
           C ++  A  +F+ +    ++S+N++I GY   +   EAL LF+E+  S   P DVTMLSI
Sbjct: 317 CGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSI 376

Query: 435 IMSCALLGALDIGRWIHEYVKK--NGFDKYVKVNTALIDMYAKCGSLADAISIFEGMHVR 494
           + +CA LGA+DIGRWIH Y+ K   G      + T+LIDMYAKCG +  A  +F  +  +
Sbjct: 377 LPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHK 436

Query: 495 DTQAWSAMIVAYATHGDGLKAISLFDEMKRAGVRPDEITFLGLLYACSHAGLVEEGTGYF 554
              +W+AMI  +A HG    +  LF  M++ G++PD+ITF+GLL ACSH+G+++ G   F
Sbjct: 437 SLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIF 496

Query: 555 YSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAYKFIDGLEIKPTPILWRTLLSACSNHGN 614
            +M++++ + P ++HYGCM+DLLG +G   EA + I+ +E++P  ++W +LL AC  HGN
Sbjct: 497 RTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGN 556

Query: 615 VDLAKQVIERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVE 674
           V+L +   E + +++  + G YV+LSN+ A  GRW +V   R L+ D+G+ KVPGCSS+E
Sbjct: 557 VELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIE 616

Query: 675 VNNVVHEFFSGDGVHSISVELHRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRY 734
           +++VVHEF  GD  H  + E++  L+E+   ++  G+VPDTS V   +MEEE KE  LR+
Sbjct: 617 IDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQ-EMEEEWKEGALRH 676

Query: 735 HSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCHTAAKLISLIFGRQIIIRDVQRFHRFKD 794
           HSEKLAIAFGL++T PGT + +VKNLR+C +CH A KLIS I+ R+II RD  RFH F+D
Sbjct: 677 HSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRD 736

Query: 795 GKCSCCDFW 802
           G CSC D+W
Sbjct: 737 GVCSCNDYW 741

BLAST of Sgr029061 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 488.0 bits (1255), Expect = 1.4e-137
Identity = 238/596 (39.93%), Postives = 360/596 (60.40%), Query Frame = 0

Query: 237 SSAMELKQIQAFTIKTNLHNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPDKDIVLFN 296
           S   ELKQI A  +KT L  D   +TK ++FC  + ++  + +A  +FD     D  L+N
Sbjct: 25  SKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDTFLWN 84

Query: 297 LLARGYARSNTPYLAFSLFAQILCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKL 356
           L+ RG++ S+ P  +  L+ ++LCS    + YTF SLLKAC++  A +E  Q+H    KL
Sbjct: 85  LMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQITKL 144

Query: 357 GLNHNIYLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQPN------ 416
           G  +++Y   +LIN YA   +   A  +FD +  P  VS+NS+I GY ++ + +      
Sbjct: 145 GYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIALTLF 204

Query: 417 -------------------------EALSLFRELQASNLEPTDVTMLSIIMSCALLGALD 476
                                    EAL LF E+Q S++EP +V++ + + +CA LGAL+
Sbjct: 205 RKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQLGALE 264

Query: 477 IGRWIHEYVKKNGFDKYVKVNTALIDMYAKCGSLADAISIFEGMHVRDTQAWSAMIVAYA 536
            G+WIH Y+ K        +   LIDMYAKCG + +A+ +F+ +  +  QAW+A+I  YA
Sbjct: 265 QGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALISGYA 324

Query: 537 THGDGLKAISLFDEMKRAGVRPDEITFLGLLYACSHAGLVEEGTGYFYSMSKNHGIVPGI 596
            HG G +AIS F EM++ G++P+ ITF  +L ACS+ GLVEEG   FYSM +++ + P I
Sbjct: 325 YHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNLKPTI 384

Query: 597 KHYGCMVDLLGRTGRLDEAYKFIDGLEIKPTPILWRTLLSACSNHGNVDLAKQVIERIFE 656
           +HYGC+VDLLGR G LDEA +FI  + +KP  ++W  LL AC  H N++L +++ E +  
Sbjct: 385 EHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEEIGEILIA 444

Query: 657 LDDSHGGDYVILSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDG 716
           +D  HGG YV  +N+ A   +W+     R+LMK++GV KVPGCS++ +    HEF +GD 
Sbjct: 445 IDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGTTHEFLAGDR 504

Query: 717 VHSISVELHRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAIAFGLLN 776
            H    ++      + ++++  GYVP+   +    ++++ +E ++  HSEKLAI +GL+ 
Sbjct: 505 SHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSEKLAITYGLIK 564

Query: 777 TPPGTTIRVVKNLRICGDCHTAAKLISLIFGRQIIIRDVQRFHRFKDGKCSCCDFW 802
           T PGT IR++KNLR+C DCH   KLIS I+ R I++RD  RFH F+DGKCSC D+W
Sbjct: 565 TKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKCSCGDYW 620

BLAST of Sgr029061 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 485.0 bits (1247), Expect = 1.2e-136
Identity = 242/576 (42.01%), Postives = 372/576 (64.58%), Query Frame = 0

Query: 230 LFASAAVSSAMELKQIQAFTIKTNLH-NDISVLTKLINFCTLNPTTSSMDHAHHLFDQIP 289
           L  +  VSS  +L+QI AF+I+  +  +D  +   LI +    P+   M +AH +F +I 
Sbjct: 21  LLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIE 80

Query: 290 DK-DIVLFNLLARGYARSNTPYLAFSLFAQILCSGLL-PDDYTFSSLLKACASSKALKEG 349
              ++ ++N L RGYA       AFSL+ ++  SGL+ PD +T+  L+KA  +   ++ G
Sbjct: 81  KPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLG 140

Query: 350 RQLHCFAIKLGLNHNIYLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARS 409
             +H   I+ G    IY+  +L+++YA C D+ +A +VFD M    +V++NS+I G+A +
Sbjct: 141 ETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAEN 200

Query: 410 SQPNEALSLFRELQASNLEPTDVTMLSIIMSCALLGALDIGRWIHEYVKKNGFDKYVKVN 469
            +P EAL+L+ E+ +  ++P   T++S++ +CA +GAL +G+ +H Y+ K G  + +  +
Sbjct: 201 GKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSS 260

Query: 470 TALIDMYAKCGSLADAISIFEGMHVRDTQAWSAMIVAYATHGDGLKAISLFDEMKRA-GV 529
             L+D+YA+CG + +A ++F+ M  +++ +W+++IV  A +G G +AI LF  M+   G+
Sbjct: 261 NVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGL 320

Query: 530 RPDEITFLGLLYACSHAGLVEEGTGYFYSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAY 589
            P EITF+G+LYACSH G+V+EG  YF  M + + I P I+H+GCMVDLL R G++ +AY
Sbjct: 321 LPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAY 380

Query: 590 KFIDGLEIKPTPILWRTLLSACSNHGNVDLAKQVIERIFELDDSHGGDYVILSNLCARVG 649
           ++I  + ++P  ++WRTLL AC+ HG+ DLA+    +I +L+ +H GDYV+LSN+ A   
Sbjct: 381 EYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQ 440

Query: 650 RWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHSISVELHRALDELIKEIK 709
           RW DV  +RK M   GV KVPG S VEV N VHEF  GD  H  S  ++  L E+   ++
Sbjct: 441 RWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLR 500

Query: 710 LVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCH 769
             GYVP  S VY  D+EEE KE  + YHSEK+AIAF L++TP  + I VVKNLR+C DCH
Sbjct: 501 SEGYVPQISNVY-VDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCH 560

Query: 770 TAAKLISLIFGRQIIIRDVQRFHRFKDGKCSCCDFW 802
            A KL+S ++ R+I++RD  RFH FK+G CSC D+W
Sbjct: 561 LAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of Sgr029061 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 478.4 bits (1230), Expect = 1.1e-134
Identity = 232/559 (41.50%), Postives = 355/559 (63.51%), Query Frame = 0

Query: 243 KQIQAFTIKTNLHNDISVLTKLINFCTLNPTTSSMDHAHHLFDQIPDKDIVLFNLLARGY 302
           K+I  + +++   + +++ T L++   +     S++ A  LFD + ++++V +N +   Y
Sbjct: 256 KEIHGYAMRSGFDSLVNISTALVD---MYAKCGSLETARQLFDGMLERNVVSWNSMIDAY 315

Query: 303 ARSNTPYLAFSLFAQILCSGLLPDDYTFSSLLKACASSKALKEGRQLHCFAIKLGLNHNI 362
            ++  P  A  +F ++L  G+ P D +    L ACA    L+ GR +H  +++LGL+ N+
Sbjct: 316 VQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNV 375

Query: 363 YLCPTLINMYAECNDMNAAREVFDGMDGPCIVSYNSIITGYARSSQPNEALSLFRELQAS 422
            +  +LI+MY +C +++ A  +F  +    +VS+N++I G+A++ +P +AL+ F ++++ 
Sbjct: 376 SVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSR 435

Query: 423 NLEPTDVTMLSIIMSCALLGALDIGRWIHEYVKKNGFDKYVKVNTALIDMYAKCGSLADA 482
            ++P   T +S+I + A L      +WIH  V ++  DK V V TAL+DMYAKCG++  A
Sbjct: 436 TVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIA 495

Query: 483 ISIFEGMHVRDTQAWSAMIVAYATHGDGLKAISLFDEMKRAGVRPDEITFLGLLYACSHA 542
             IF+ M  R    W+AMI  Y THG G  A+ LF+EM++  ++P+ +TFL ++ ACSH+
Sbjct: 496 RLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHS 555

Query: 543 GLVEEGTGYFYSMSKNHGIVPGIKHYGCMVDLLGRTGRLDEAYKFIDGLEIKPTPILWRT 602
           GLVE G   FY M +N+ I   + HYG MVDLLGR GRL+EA+ FI  + +KP   ++  
Sbjct: 556 GLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGA 615

Query: 603 LLSACSNHGNVDLAKQVIERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKLMKDRGV 662
           +L AC  H NV+ A++  ER+FEL+   GG +V+L+N+      WE V  +R  M  +G+
Sbjct: 616 MLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGL 675

Query: 663 VKVPGCSSVEVNNVVHEFFSGDGVHSISVELHRALDELIKEIKLVGYVPDTSLVYHADME 722
            K PGCS VE+ N VH FFSG   H  S +++  L++LI  IK  GYVPDT+LV    +E
Sbjct: 676 RKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLV--LGVE 735

Query: 723 EEGKELVLRYHSEKLAIAFGLLNTPPGTTIRVVKNLRICGDCHTAAKLISLIFGRQIIIR 782
            + KE +L  HSEKLAI+FGLLNT  GTTI V KNLR+C DCH A K ISL+ GR+I++R
Sbjct: 736 NDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTGREIVVR 795

Query: 783 DVQRFHRFKDGKCSCCDFW 802
           D+QRFH FK+G CSC D+W
Sbjct: 796 DMQRFHHFKNGACSCGDYW 809

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022151060.10.0e+0091.26pentatricopeptide repeat-containing protein At2g02980, chloroplastic isoform X1 ... [more]
XP_023541252.10.0e+0091.50pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Cucurbita ... [more]
KAG7013140.16.2e-31086.86Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
KAG6574081.11.2e-30987.00Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
XP_022968061.11.2e-30990.38pentatricopeptide repeat-containing protein At2g02980, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
Q8LK931.8e-23367.43Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
Q9LN011.1e-13742.81Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9FJY72.0e-13639.93Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
A8MQA31.7e-13542.01Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q9LIQ71.1e-13441.94Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1DA680.0e+0091.26pentatricopeptide repeat-containing protein At2g02980, chloroplastic isoform X1 ... [more]
A0A6J1HTT76.0e-31090.38pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucurbit... [more]
A0A6J1FZ611.2e-30990.97pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucurbit... [more]
A0A5A7STH81.8e-30288.14Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BFK01.8e-30288.14pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Cucumis ... [more]
Match NameE-valueIdentityDescription
AT2G02980.11.2e-23467.43Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.17.5e-13942.81Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G66520.11.4e-13739.93Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21065.11.2e-13642.01Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G11290.11.1e-13441.50Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 666..791
e-value: 2.3E-37
score: 127.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 444..556
e-value: 1.1E-24
score: 89.5
coord: 557..707
e-value: 6.9E-15
score: 57.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 233..348
e-value: 6.7E-14
score: 53.6
coord: 349..443
e-value: 2.2E-19
score: 71.5
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 343..490
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 634..662
e-value: 1.3
score: 9.5
coord: 599..623
e-value: 0.16
score: 12.3
coord: 568..589
e-value: 0.45
score: 10.9
coord: 367..388
e-value: 1.0
score: 9.7
coord: 467..491
e-value: 0.019
score: 15.2
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 391..438
e-value: 1.0E-9
score: 38.5
coord: 290..338
e-value: 8.9E-8
score: 32.2
coord: 493..539
e-value: 4.0E-10
score: 39.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 394..427
e-value: 2.4E-7
score: 28.5
coord: 496..529
e-value: 6.4E-7
score: 27.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 291..325
score: 9.141782
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 392..426
score: 11.684803
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 493..527
score: 11.925952
IPR036430Ribonuclease T2-like superfamilyGENE3D3.90.730.10coord: 26..232
e-value: 4.5E-72
score: 244.5
IPR036430Ribonuclease T2-like superfamilySUPERFAMILY55895Ribonuclease Rh-likecoord: 27..221
IPR001568Ribonuclease T2-likePFAMPF00445Ribonuclease_T2coord: 35..212
e-value: 3.3E-53
score: 180.6
NoneNo IPR availablePANTHERPTHR47926:SF132BNAA02G26650D PROTEINcoord: 229..787
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 229..787
IPR018188Ribonuclease T2, His active site 1PROSITEPS00530RNASE_T2_1coord: 61..68
IPR033130Ribonuclease T2, His active site 2PROSITEPS00531RNASE_T2_2coord: 114..125
IPR033697Ribonuclease T2, eukaryoticCDDcd01061RNase_T2_eukcoord: 32..221
e-value: 2.07669E-76
score: 243.779

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr029061.1Sgr029061.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1900865 chloroplast RNA modification
biological_process GO:0031425 chloroplast RNA processing
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding
molecular_function GO:0033897 ribonuclease T2 activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding