Moc03g30900 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc03g30900
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Locationchr3: 21976076 .. 21982496 (+)
RNA-Seq ExpressionMoc03g30900
SyntenyMoc03g30900
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCTCTCACCACATCTGTCCCTGCCGAATCATTTTCGAACTTCCTCGGCGTTTCAGAAACGGTTCTTAGAGTCTCACCAATTTCCATTTCTCTCCAAACTTTCCAGTTTTCATGTTCGCCGTAGGTTTCTCTCAGATGACTGGAGCTTCTCCTCCAACGTGGGGAAGTGCAGAGCGAAGCCGAAGGATCTAGTTCTTGGAAATCCGTCGGTAATTGTTGAGAAGGGCAAATACAGCTATGACGTAGAAACCCTAATCAACAAACTTAGTAGCCTCCCGCCACGAGGCAGCATAGCTCGCTGTCTCGATATCTTTAAGAACAGGCTCTCGCTCAATGACTTCAGCTTCGTTTTCAAGGAATTCGCGGCGCGCGGGGATTGGCAACGGTCGTTACGCCTCTTCAAGTATATGCAGCGTCAGATATGGTGCAAGCCGAACGAGCACATCTATACCATCATGATCAGCTTGCTCGGCCGCGAAGGATTGCTAGAGAAATGTAGCGAGATATTCGATGAAATGGCGAGCCAAGGCGTGATACGTAGTGTGTTTTCTTACACCGCTTTGATAAATGCCTACGGGCGCAATGGTCAGTATGAAACCTCACTTGAACTTATTGAAAGGATGAAGAGAGAGAGAGTGTCACCTAATATATTGACTTACAATACCGTGATAAATGCCTGTGCTAGAGGTGATTTGGATTGGGAGGGATTGTTGGGATTGTTCGCCGAGATGAGGCATGAAGGGGTTCAACCTGATCTTGTTACTTATAATACTTTGCTTAGTGCTTGTGCTGCCCGGGGTCTAGGTGACGAGGCAGAGATGGTCTTCAAGACTATGATAGAGGGAGGGATAGTCCCTGAGATAACAACATATAGTTATATTGTTGAAACATTTGGAAAATTGGGTAAGCTTGAGAAAGTTGCTATGCTGCTAAAAGAGATGGAGTCCGAAGGGTATTTGCCTGATATATCATCCTACAATGTGTTAATAGAGGCACATGCAAAATTGGGGTCGATTAAGGAGGCAATGGATGTGTTTAAGCAGATGCAAGCAGCAGGATGTGTGCCAAATGCATCCACTTACAGTATTCTGTTAAATTTATATGGGAAGCATGGGAGGTATGATGATGTTCGGGAGCTTTTTCTTGAAATGAAAGAGAGCAGTGCTGAGCCAGATGCAACTACTTACAACATTCTCATACGAGTATTTGGGGAGGGTGGATACTTCAAAGAGGTTGTAACTCTGTTTCATGATTTGGTGGAGGAGAAGATTGACCCAAATATGGAGACATATGAGGGGTTAGTATTTGCTTGTGGAAAGGGAGGGTTGCACGAGGATGCCAAGAAAATTTTACTTCACATGAATGAGAAAGGAATAGTCCCAAGTTCCAAAGCATACAGTGGACTGATTGAAGCATACGGACTGGCTGCATTGTATGATGAAGCTGTCGTTGCGTTTAACACTATGAATGAAGTGGGAAGCAAGTCAACTGTCGATACCTACAATTCACTAATTCACACATTTGCTAGGGGCGGACTCTACAAGGAGTTTGAAGCAATCTTATTGAGAATGAGAGAATCCGGCATTTCAAGGAATGTGAAGTCATTTAGTGGTATTATTGAAGGTTATAGGCAAAGTGGTCAGTTTGAAGAAGCTATAAAGGCCTTTGTTGAAATGGAAAAGATGAGATGTGAACCTGATGAGCAAACCCTGGAGGCAGTTTTAGGTGTTTATTGCTTTGCAGGTCTTGTTGATGAGAGCAAGGAGCAGTTTCATGAAATTAAAGCTACAGGAATATTGCCTAGTGTGTTGTGCTACTGCATGATGCTTGCGGTGTATGCCAAGAATGGCAGGTAATGGTTACTATATGTATATATATATTTGTTAATGAAACTTATGTTTATTGTCAACTTATGTTTTCTTAATTTTATTTCCAGTTAATATAACCTGTACATGCTACTTTTCGTCTAAAAACAAATTGTGTTCAAGATTCAGTCTAATTAGAACTTTATTCTATCTTGAAATCTGTCAGGGCATTGGGGGTTAGTTAGTTAGAGTATGGGTAGTTATATGAGATGGATGGAGCTTATTAGAGTTCTTCATAGGGAATTGTTAATGGCAAAGGAAAACAGTTATAAATAAGAGAAGAGGGCAGTGAGGGGAGGTGTACTGAAGAGGAGATGTTCAAGTTTCTCAAATTAAACGTTTATATTTGAATTTTGATATAGATAACATTTATTATTTCTTCTCTCTCTAGAAATTATAGCTTTTATATGATATTGAGAACTTGGAGATTTGTCCACACCAAGTCAGCTTTGGTGTATCACGTTCATGTATGACATATTGACATGTAAAATTACCATATGAGACTACCAATTCTTCGATGGGATACATTAAGTGAATGAAAGACGAGCTAATCTATTAACAGGGTGTAGTGTCTTCTTGTCAACTGGGAAGATATGGGAGGTTGAAGTAAGGTCTTGATGAGAAGATCGTTCTCTATTCAAGTTGTCAACTTTGACTTTATTGCATAGCATAAATAGATCTGCAGGAGTGGCTTGAAGTTGAACGATTCATTCTTCATCTGCCAAGCCATGAACTTTCTGCGGCTGATTGCCTTAGTAGAATGCTATTAAGAAACTTAATGTAGGGATCTTGGAGAGTTCCAACGTGGTAGGAAACTTCATCATTATCAATCATCTACAGTTCGTGGATAGACTATCATCTTCTCCTCCTCAATAGAAAAATTTGGCAATCCTTTCAAAATCATATATCTTCTTGAGGTATCCTTGGGACTGGATGTCAACCTCTCAAAATTGAAATTGGCAGGCATCAACTATACTATCAATGTATTTAAATAATTGGAAGATGCCTGGGGCTGTAGAGTAGACAAATAGCTCTGCTTTTTTTATTCTCTCTCTGAATGGAACAAAATAAAACATCAACTCCCGGAGCACATTATTGACAAAATTCAGGGTAAACTATAATGTGGGAAATGAAACCTTATTTCCAAGGATGGCGGTCTTACTTTAGTTCAAGCTGCTCTTTCAAAATTTGCCTACATATTATATGTCTACTTTCTGGATGCCTAATAAGGTAACTCTCATTATAAAAATGCCGCTAGGGATTTCATTTGGGATTGGCATAATTTTGAATGAAGTTTTCATCTCGTGGGATGGAACAAGGTTACACTTCCTACACTCTCGGGGTCTGGGCTTTGGTGACATATATGGAACGTCTCTCCCTCCTTCTCCTAAATGGTTGTGGAGATTTGTCTTAGAGAAAGCAGCCCTTTGGGGAAGGCGGCGGTGGTGTACTGTATGGAATCTCCAAGGAGGATTGGTTGCGCTCTGCTCCTGAATCCTCCAAGGGACTCCAGAATGGCATTTCTAATCTGAAAGAGTTGTTGGCCTCAAGAGTAAAAATTATGGTGGGGAGTGGAGAAATACCAAATTCTGGGAAGACCATTGGAAAGATACAGAAACTCTCTCTACACTCTTCCTTGCACTTTATGTTATCACCCATAGCAATGGCCTCTACATGATCCCAGTGTGATGACTCGGAAGTCGGAGAATGGATCTTCCACATGGACATCCTCACCGAATTTTATTCCAGGCTGGTTGAATGAAAATGAAGATGGCTGTGTTTCACCTATTCTTTCATTTATAAATGTTGACTTGAAGACTTCACCTAGAGTACTTAATTTCTTAACTGAAGCCTTTGGGAATGGAGGATAAATACCAAAGGTAAAATTCAAAGGAAAGTCCAAACCTCTGAATTTCCCCCACTTCGGTTTTATGTGAAAAGCCGACAAGAAGACACAATAATCTGTTCCTCTTCCCCATGACCTTCCGACAATGTAACTACATTTTGGTAATTTTAACTGGCCTGCTCCTTTTCTCAGCAACATTACCGGCCTGCTGTACTTCCTCTTTGGAGTACATCTATGGATCTTTTGGAATTAAAGGAACCAAAAGGCTTCCTTAGATGAAGACGAACCTTTCTCCTTTTTGCTGGATAAGAATACTTTTGTGGTTCCCACTTTGTGTAAACTTGACAATTCTTTTTGTAATTGTAATTCAGCCCCTGTTTGTGATAATTGGAAGTTTTTTTGGGATTGAGGCCTTCTTCTCTCTCCTTTATAGGCCCATTTTATCAAACAGTTTGTTATAAAAGAAAAAGTACGAGCTTTCTGCATGGTTGCATTTGTATTGGTATTACTGGTCGACAGGTCCTTTTAATTTTTGATGTTGCAGCCTAACAAGTTCCAAAGTTTAGATAAACATCCTTTGCTTTCCTTGTTGCTATCAAGGTTGCAATGTCCTCAAAATTATCTGATTTACCTCTAAAACATTCTTCTTCCTTGAAATCTCTCATGTCGTGCAACATGTTCGTGACCTTGGCATATTTATTTGCCTCGTGTAGAAAATAAATTCACTACATGAACCATGAAGTCATTATGGATGTAGTGAAGAGTTTGTCACTTGTCAAGTTGTCATTAGTTTGTTTACTTTTACCTAGTAAGAACATCTTTCTTTTTTCTGTGATGAAATTTTGTTTGATAATTTCCCCAGCAAATTATCACTAACGGTCCTAAAGTTCTCATTTCATAAAATTTAATTTTCGAACCTTGAATCTTCTGATTGATTGCTTCACAAATACTTTATTTTAATGATCTGTCTTACATCTCAGAGTCCATTCTTTCCAGGTGGGATAATGCCTATGAGTTACTCGATGAGATGATTACGAACAGGGTATCTAGTATTCATCAAGTCATTGGACAGATGATCAAGGGAGATTACGATGATGATTCTAATTGGCAGATGGTTGAGTATGTTTTTGACAAACTTAACGCTGAAGGGTGTGGATTGGGAACAAGATTTTACAATACACTCTTAGACGCACTTTGGTGGCTTGGCCAGAAAGGCCGTGCAGCAAGGGTTCTCATTGAAGCAACAAAACGGGGCCTTTTCCCTGAACTTTTCCGCAAAAGCAAACTAGTGTGGTCTGTGGATGTGCACAGGTATGATAGTAATCTCCCTTTTCCATGCTGGACTGGCTGATATCTGAGCTTTAGTTTATATAATCCGTATATCTGGTTTTTCACAGAATGTGGGAAGGTGGTGCATATACAGCAATATCACTTTGGGTTAACACTATGAACAAAATGATCAAGGATGAGGAGGATCTCCCTCAGCTTGCAGCTGTTGTAGTTGGGTAAGGTTCAGTTATCTTGCAATCAATTTGATCAAAGTTAATTTTTTTGAATTAAAGTGTCATGCATGGTGGAGGCATATATGATAAGAATGATAAAAGGCTCTCATAATCCAGCTGAAGCAAATCTTCTCATTTTGGGCTTCTTGAACTTGACTAATTCTGTTAGCCTAAAAAACGCACATAGCTAATAGGCTGGTTGAAAAATGCTCTCTTTAGTTTGGCATCCCAAATTTCACATAGCTTTGAAACGTGCAGTAAATCTTCATTATTTTGCTAATTTCTAGACTATATGTAAGATAATAGTGTTACAACGTTTATGTGCAGGCACTTATGACTTATGGGCTGCAATATTTTGTGTGTGCATGCATGCATGTTATCATTATTTTCTTCAGGTCTAGCTCTGTGTTGTTTGCGCCTTTTATTTATGTATTTGCTTTCCGCACTACAATTAGAAAAAAAACTGGTCACCATAAGGCCAAACTATGTAATTAGACAGTACGTATTCTAGACGTGTGGATGAGCATGTGCTGTAGAGAGGTTATATGACCAATTAAAGTAGATTAAGTTTAAGTAGTTCTTTTCAAAACGTGGGCTGAATACTATCGATTTTAAATAAAAGCCATCAAATGAAAATTACAGGGTAAGGGTGAAGGGAATGGAAGCCAAACCATGTATTTCTGACCAGTAGTTACTGCATTTCTTTAGCAGAAAATTTGTTGATTTTTCCTGTACTTTGAATTTACATGAAACGACAGCTAGTTGGAGAAATCATGTAGTTTCTCTGGAAATTTACTGGCTTTGTTTTACTTTACGAGGGTTCACTAATATTTTGTCAACTGTGTATTTCCAAATTATCAGAAGAGGATGGTTGGAGAAAGACTCAAAAGCGCAAAACTTGCCTATTGCAAAGGCTGTCTATTCATTTCTGCAGGATAACGTGTCGTCATCCTTCGATTTTCCCAGGTGGAACAACTGTCGAATTGTGTGCCAACAGTCTCAACTGAAGCAGCTTCTCTCAGACGCAGAATCATCATCAAAAGGTCCCAAAACTAATGAAATTATTACTTTAAACAATTCCCCCTTGAATCTTCCAGCAGGATCCAAGATATCCAGATCCGGTATAAACAATGACAAATACAAAGATGTTGATTCTAAATCAAGTAACAGGACAGGAACTGAGCTTCTGACCACAACTGTTTGA

mRNA sequence

ATGGCGCTCTCACCACATCTGTCCCTGCCGAATCATTTTCGAACTTCCTCGGCGTTTCAGAAACGGTTCTTAGAGTCTCACCAATTTCCATTTCTCTCCAAACTTTCCAGTTTTCATGTTCGCCGTAGGTTTCTCTCAGATGACTGGAGCTTCTCCTCCAACGTGGGGAAGTGCAGAGCGAAGCCGAAGGATCTAGTTCTTGGAAATCCGTCGGTAATTGTTGAGAAGGGCAAATACAGCTATGACGTAGAAACCCTAATCAACAAACTTAGTAGCCTCCCGCCACGAGGCAGCATAGCTCGCTGTCTCGATATCTTTAAGAACAGGCTCTCGCTCAATGACTTCAGCTTCGTTTTCAAGGAATTCGCGGCGCGCGGGGATTGGCAACGGTCGTTACGCCTCTTCAAGTATATGCAGCGTCAGATATGGTGCAAGCCGAACGAGCACATCTATACCATCATGATCAGCTTGCTCGGCCGCGAAGGATTGCTAGAGAAATGTAGCGAGATATTCGATGAAATGGCGAGCCAAGGCGTGATACGTAGTGTGTTTTCTTACACCGCTTTGATAAATGCCTACGGGCGCAATGGTCAGTATGAAACCTCACTTGAACTTATTGAAAGGATGAAGAGAGAGAGAGTGTCACCTAATATATTGACTTACAATACCGTGATAAATGCCTGTGCTAGAGGTGATTTGGATTGGGAGGGATTGTTGGGATTGTTCGCCGAGATGAGGCATGAAGGGGTTCAACCTGATCTTGTTACTTATAATACTTTGCTTAGTGCTTGTGCTGCCCGGGGTCTAGGTGACGAGGCAGAGATGGTCTTCAAGACTATGATAGAGGGAGGGATAGTCCCTGAGATAACAACATATAGTTATATTGTTGAAACATTTGGAAAATTGGGTAAGCTTGAGAAAGTTGCTATGCTGCTAAAAGAGATGGAGTCCGAAGGGTATTTGCCTGATATATCATCCTACAATGTGTTAATAGAGGCACATGCAAAATTGGGGTCGATTAAGGAGGCAATGGATGTGTTTAAGCAGATGCAAGCAGCAGGATGTGTGCCAAATGCATCCACTTACAGTATTCTGTTAAATTTATATGGGAAGCATGGGAGGTATGATGATGTTCGGGAGCTTTTTCTTGAAATGAAAGAGAGCAGTGCTGAGCCAGATGCAACTACTTACAACATTCTCATACGAGTATTTGGGGAGGGTGGATACTTCAAAGAGGTTGTAACTCTGTTTCATGATTTGGTGGAGGAGAAGATTGACCCAAATATGGAGACATATGAGGGGTTAGTATTTGCTTGTGGAAAGGGAGGGTTGCACGAGGATGCCAAGAAAATTTTACTTCACATGAATGAGAAAGGAATAGTCCCAAGTTCCAAAGCATACAGTGGACTGATTGAAGCATACGGACTGGCTGCATTGTATGATGAAGCTGTCGTTGCGTTTAACACTATGAATGAAGTGGGAAGCAAGTCAACTGTCGATACCTACAATTCACTAATTCACACATTTGCTAGGGGCGGACTCTACAAGGAGTTTGAAGCAATCTTATTGAGAATGAGAGAATCCGGCATTTCAAGGAATGTGAAGTCATTTAGTGGTATTATTGAAGGTTATAGGCAAAGTGGTCAGTTTGAAGAAGCTATAAAGGCCTTTGTTGAAATGGAAAAGATGAGATGTGAACCTGATGAGCAAACCCTGGAGGCAGTTTTAGGTGTTTATTGCTTTGCAGGTCTTGTTGATGAGAGCAAGGAGCAGTTTCATGAAATTAAAGCTACAGGAATATTGCCTAGTGTGTTGTGCTACTGCATGATGCTTGCGGTGTATGCCAAGAATGGCAGGTGGGATAATGCCTATGAGTTACTCGATGAGATGATTACGAACAGGGTATCTAGTATTCATCAAGTCATTGGACAGATGATCAAGGGAGATTACGATGATGATTCTAATTGGCAGATGGTTGAGTATGTTTTTGACAAACTTAACGCTGAAGGGTGTGGATTGGGAACAAGATTTTACAATACACTCTTAGACGCACTTTGGTGGCTTGGCCAGAAAGGCCGTGCAGCAAGGGTTCTCATTGAAGCAACAAAACGGGGCCTTTTCCCTGAACTTTTCCGCAAAAGCAAACTAGTGTGGTCTGTGGATGTGCACAGAATGTGGGAAGGTGGTGCATATACAGCAATATCACTTTGGGTTAACACTATGAACAAAATGATCAAGGATGAGGAGGATCTCCCTCAGCTTGCAGCTGTTGTAGTTGGAAGAGGATGGTTGGAGAAAGACTCAAAAGCGCAAAACTTGCCTATTGCAAAGGCTGTCTATTCATTTCTGCAGGATAACGTGTCGTCATCCTTCGATTTTCCCAGGTGGAACAACTGTCGAATTGTGTGCCAACAGTCTCAACTGAAGCAGCTTCTCTCAGACGCAGAATCATCATCAAAAGGTCCCAAAACTAATGAAATTATTACTTTAAACAATTCCCCCTTGAATCTTCCAGCAGGATCCAAGATATCCAGATCCGGTATAAACAATGACAAATACAAAGATGTTGATTCTAAATCAAGTAACAGGACAGGAACTGAGCTTCTGACCACAACTGTTTGA

Coding sequence (CDS)

ATGGCGCTCTCACCACATCTGTCCCTGCCGAATCATTTTCGAACTTCCTCGGCGTTTCAGAAACGGTTCTTAGAGTCTCACCAATTTCCATTTCTCTCCAAACTTTCCAGTTTTCATGTTCGCCGTAGGTTTCTCTCAGATGACTGGAGCTTCTCCTCCAACGTGGGGAAGTGCAGAGCGAAGCCGAAGGATCTAGTTCTTGGAAATCCGTCGGTAATTGTTGAGAAGGGCAAATACAGCTATGACGTAGAAACCCTAATCAACAAACTTAGTAGCCTCCCGCCACGAGGCAGCATAGCTCGCTGTCTCGATATCTTTAAGAACAGGCTCTCGCTCAATGACTTCAGCTTCGTTTTCAAGGAATTCGCGGCGCGCGGGGATTGGCAACGGTCGTTACGCCTCTTCAAGTATATGCAGCGTCAGATATGGTGCAAGCCGAACGAGCACATCTATACCATCATGATCAGCTTGCTCGGCCGCGAAGGATTGCTAGAGAAATGTAGCGAGATATTCGATGAAATGGCGAGCCAAGGCGTGATACGTAGTGTGTTTTCTTACACCGCTTTGATAAATGCCTACGGGCGCAATGGTCAGTATGAAACCTCACTTGAACTTATTGAAAGGATGAAGAGAGAGAGAGTGTCACCTAATATATTGACTTACAATACCGTGATAAATGCCTGTGCTAGAGGTGATTTGGATTGGGAGGGATTGTTGGGATTGTTCGCCGAGATGAGGCATGAAGGGGTTCAACCTGATCTTGTTACTTATAATACTTTGCTTAGTGCTTGTGCTGCCCGGGGTCTAGGTGACGAGGCAGAGATGGTCTTCAAGACTATGATAGAGGGAGGGATAGTCCCTGAGATAACAACATATAGTTATATTGTTGAAACATTTGGAAAATTGGGTAAGCTTGAGAAAGTTGCTATGCTGCTAAAAGAGATGGAGTCCGAAGGGTATTTGCCTGATATATCATCCTACAATGTGTTAATAGAGGCACATGCAAAATTGGGGTCGATTAAGGAGGCAATGGATGTGTTTAAGCAGATGCAAGCAGCAGGATGTGTGCCAAATGCATCCACTTACAGTATTCTGTTAAATTTATATGGGAAGCATGGGAGGTATGATGATGTTCGGGAGCTTTTTCTTGAAATGAAAGAGAGCAGTGCTGAGCCAGATGCAACTACTTACAACATTCTCATACGAGTATTTGGGGAGGGTGGATACTTCAAAGAGGTTGTAACTCTGTTTCATGATTTGGTGGAGGAGAAGATTGACCCAAATATGGAGACATATGAGGGGTTAGTATTTGCTTGTGGAAAGGGAGGGTTGCACGAGGATGCCAAGAAAATTTTACTTCACATGAATGAGAAAGGAATAGTCCCAAGTTCCAAAGCATACAGTGGACTGATTGAAGCATACGGACTGGCTGCATTGTATGATGAAGCTGTCGTTGCGTTTAACACTATGAATGAAGTGGGAAGCAAGTCAACTGTCGATACCTACAATTCACTAATTCACACATTTGCTAGGGGCGGACTCTACAAGGAGTTTGAAGCAATCTTATTGAGAATGAGAGAATCCGGCATTTCAAGGAATGTGAAGTCATTTAGTGGTATTATTGAAGGTTATAGGCAAAGTGGTCAGTTTGAAGAAGCTATAAAGGCCTTTGTTGAAATGGAAAAGATGAGATGTGAACCTGATGAGCAAACCCTGGAGGCAGTTTTAGGTGTTTATTGCTTTGCAGGTCTTGTTGATGAGAGCAAGGAGCAGTTTCATGAAATTAAAGCTACAGGAATATTGCCTAGTGTGTTGTGCTACTGCATGATGCTTGCGGTGTATGCCAAGAATGGCAGGTGGGATAATGCCTATGAGTTACTCGATGAGATGATTACGAACAGGGTATCTAGTATTCATCAAGTCATTGGACAGATGATCAAGGGAGATTACGATGATGATTCTAATTGGCAGATGGTTGAGTATGTTTTTGACAAACTTAACGCTGAAGGGTGTGGATTGGGAACAAGATTTTACAATACACTCTTAGACGCACTTTGGTGGCTTGGCCAGAAAGGCCGTGCAGCAAGGGTTCTCATTGAAGCAACAAAACGGGGCCTTTTCCCTGAACTTTTCCGCAAAAGCAAACTAGTGTGGTCTGTGGATGTGCACAGAATGTGGGAAGGTGGTGCATATACAGCAATATCACTTTGGGTTAACACTATGAACAAAATGATCAAGGATGAGGAGGATCTCCCTCAGCTTGCAGCTGTTGTAGTTGGAAGAGGATGGTTGGAGAAAGACTCAAAAGCGCAAAACTTGCCTATTGCAAAGGCTGTCTATTCATTTCTGCAGGATAACGTGTCGTCATCCTTCGATTTTCCCAGGTGGAACAACTGTCGAATTGTGTGCCAACAGTCTCAACTGAAGCAGCTTCTCTCAGACGCAGAATCATCATCAAAAGGTCCCAAAACTAATGAAATTATTACTTTAAACAATTCCCCCTTGAATCTTCCAGCAGGATCCAAGATATCCAGATCCGGTATAAACAATGACAAATACAAAGATGTTGATTCTAAATCAAGTAACAGGACAGGAACTGAGCTTCTGACCACAACTGTTTGA

Protein sequence

MALSPHLSLPNHFRTSSAFQKRFLESHQFPFLSKLSSFHVRRRFLSDDWSFSSNVGKCRAKPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFKEFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVIRSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLGLFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFGKLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDLVEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALYDEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGIIEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGILPSVLCYCMMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVEYVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSVDVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVYSFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAGSKISRSGINNDKYKDVDSKSSNRTGTELLTTTV
Homology
BLAST of Moc03g30900 vs. NCBI nr
Match: XP_022137367.1 (pentatricopeptide repeat-containing protein At1g74850, chloroplastic [Momordica charantia])

HSP 1 Score: 1757.7 bits (4551), Expect = 0.0e+00
Identity = 873/873 (100.00%), Postives = 873/873 (100.00%), Query Frame = 0

Query: 1   MALSPHLSLPNHFRTSSAFQKRFLESHQFPFLSKLSSFHVRRRFLSDDWSFSSNVGKCRA 60
           MALSPHLSLPNHFRTSSAFQKRFLESHQFPFLSKLSSFHVRRRFLSDDWSFSSNVGKCRA
Sbjct: 1   MALSPHLSLPNHFRTSSAFQKRFLESHQFPFLSKLSSFHVRRRFLSDDWSFSSNVGKCRA 60

Query: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFK 120
           KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFK
Sbjct: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFK 120

Query: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180
           EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI
Sbjct: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180

Query: 181 RSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240
           RSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLG
Sbjct: 181 RSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240

Query: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300
           LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG
Sbjct: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300

Query: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS 360
           KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS
Sbjct: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS 360

Query: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420
           TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL
Sbjct: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420

Query: 421 VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALY 480
           VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALY
Sbjct: 421 VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALY 480

Query: 481 DEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGI 540
           DEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGI
Sbjct: 481 DEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGI 540

Query: 541 IEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGI 600
           IEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGI
Sbjct: 541 IEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGI 600

Query: 601 LPSVLCYCMMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660
           LPSVLCYCMMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVE
Sbjct: 601 LPSVLCYCMMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660

Query: 661 YVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSV 720
           YVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSV
Sbjct: 661 YVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSV 720

Query: 721 DVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVY 780
           DVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVY
Sbjct: 721 DVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVY 780

Query: 781 SFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAG 840
           SFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAG
Sbjct: 781 SFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAG 840

Query: 841 SKISRSGINNDKYKDVDSKSSNRTGTELLTTTV 874
           SKISRSGINNDKYKDVDSKSSNRTGTELLTTTV
Sbjct: 841 SKISRSGINNDKYKDVDSKSSNRTGTELLTTTV 873

BLAST of Moc03g30900 vs. NCBI nr
Match: XP_038894203.1 (pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X1 [Benincasa hispida])

HSP 1 Score: 1630.2 bits (4220), Expect = 0.0e+00
Identity = 812/873 (93.01%), Postives = 838/873 (95.99%), Query Frame = 0

Query: 1   MALSPHLSLPNHFRTSSAFQKRFLESHQFPFLSKLSSFHVRRRFLSDDWSFSSNVGKCRA 60
           MALSP+LS+PN F+TS+ FQKR+LE  Q PFLSKLS+F VRRRF SDDWS SS+V KCRA
Sbjct: 1   MALSPYLSVPNPFKTSTTFQKRYLECQQLPFLSKLSNFSVRRRFFSDDWSLSSDVAKCRA 60

Query: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFK 120
           KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFS VFK
Sbjct: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSLVFK 120

Query: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180
           EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI
Sbjct: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180

Query: 181 RSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240
           RSVFSYTALINAYGRNGQYETSLEL+ERMKRERVSPNILTYNTVINACARGDLDWEGLLG
Sbjct: 181 RSVFSYTALINAYGRNGQYETSLELLERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240

Query: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300
           LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTM+EGGIVPEITTYSYIVETFG
Sbjct: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMVEGGIVPEITTYSYIVETFG 300

Query: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS 360
           KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAK GSIKEAMDVFKQMQAAGCVPNAS
Sbjct: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKSGSIKEAMDVFKQMQAAGCVPNAS 360

Query: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420
           TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL
Sbjct: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420

Query: 421 VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALY 480
           VEEKIDPNMETYEGLV+ACGKGGLHEDAKKI+ HMNEKG+VPSSKAYSGLIEAYG AALY
Sbjct: 421 VEEKIDPNMETYEGLVYACGKGGLHEDAKKIIRHMNEKGMVPSSKAYSGLIEAYGQAALY 480

Query: 481 DEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGI 540
           DEA+VAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRE GISRN KSFSGI
Sbjct: 481 DEALVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMREHGISRNAKSFSGI 540

Query: 541 IEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGI 600
           IEGYRQSGQFEEAIKAFVEMEKMR EPDEQTLEAVLGVYCFAGLVDESKEQFHEIKA+GI
Sbjct: 541 IEGYRQSGQFEEAIKAFVEMEKMRFEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKASGI 600

Query: 601 LPSVLCYCMMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660
           LP+VLCYCMMLAVYAKNGRWD+AYELLDEMI NRVSSIHQVIGQMIKGDYDDDSNWQMVE
Sbjct: 601 LPNVLCYCMMLAVYAKNGRWDDAYELLDEMIKNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660

Query: 661 YVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSV 720
           YVFDKLNAEGCGLG RFYNTLL+ALWWLGQKGRAARVL EATKRGLFPELFRK KLVWSV
Sbjct: 661 YVFDKLNAEGCGLGMRFYNTLLEALWWLGQKGRAARVLTEATKRGLFPELFRKRKLVWSV 720

Query: 721 DVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVY 780
           DVHRMWEG AYTA+SLWVN MN+M+ D EDLPQLAAVVVGRGWLEKDS AQNLPIA+AVY
Sbjct: 721 DVHRMWEGSAYTALSLWVNKMNEMLMDGEDLPQLAAVVVGRGWLEKDSTAQNLPIARAVY 780

Query: 781 SFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAG 840
           SFLQDNVSSSF FP WNN RIVCQQSQLKQLLS+AE+SS     +EIITLNNSPLNLP  
Sbjct: 781 SFLQDNVSSSFSFPGWNNGRIVCQQSQLKQLLSNAEASS-----SEIITLNNSPLNLPE- 840

Query: 841 SKISRSGINNDKYKDVDSKSSNRTGTELLTTTV 874
           +KISRSGINNDKYKDVDSKSSNRTGTELLTTT+
Sbjct: 841 AKISRSGINNDKYKDVDSKSSNRTGTELLTTTI 867

BLAST of Moc03g30900 vs. NCBI nr
Match: XP_038894204.1 (pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X2 [Benincasa hispida])

HSP 1 Score: 1623.6 bits (4203), Expect = 0.0e+00
Identity = 811/873 (92.90%), Postives = 837/873 (95.88%), Query Frame = 0

Query: 1   MALSPHLSLPNHFRTSSAFQKRFLESHQFPFLSKLSSFHVRRRFLSDDWSFSSNVGKCRA 60
           MALSP+LS+PN F+TS+ FQKR+LE  Q PFLSKLS+F VRRRF SDDWS SS+V KCRA
Sbjct: 1   MALSPYLSVPNPFKTSTTFQKRYLECQQLPFLSKLSNFSVRRRFFSDDWSLSSDVAKCRA 60

Query: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFK 120
           KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFS VFK
Sbjct: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSLVFK 120

Query: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180
           EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI
Sbjct: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180

Query: 181 RSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240
           RSVFSYTALINAYGRNGQYETSLEL+ERMKRERVSPNILTYNTVINACARGDLDWEGLLG
Sbjct: 181 RSVFSYTALINAYGRNGQYETSLELLERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240

Query: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300
           LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTM+EGGIVPEITTYSYIVETFG
Sbjct: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMVEGGIVPEITTYSYIVETFG 300

Query: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS 360
           KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAK GSIKEAMDVFKQMQAAGCVPNAS
Sbjct: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKSGSIKEAMDVFKQMQAAGCVPNAS 360

Query: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420
           TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL
Sbjct: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420

Query: 421 VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALY 480
           VEEKIDPNMETYEGLV+ACGKGGLHEDAKKI+ HMNEKG+VPSSKAYSGLIEAYG AALY
Sbjct: 421 VEEKIDPNMETYEGLVYACGKGGLHEDAKKIIRHMNEKGMVPSSKAYSGLIEAYGQAALY 480

Query: 481 DEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGI 540
           DEA+VAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRE GISRN KSFSGI
Sbjct: 481 DEALVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMREHGISRNAKSFSGI 540

Query: 541 IEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGI 600
           IEGYRQSGQFEEAIKAFVEMEKMR EPDEQTLEAVLGVYCFAGLVDESKEQFHEIKA+GI
Sbjct: 541 IEGYRQSGQFEEAIKAFVEMEKMRFEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKASGI 600

Query: 601 LPSVLCYCMMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660
           LP+VLCYCMMLAVYAKNGRWD+AYELLDEMI NRVSSIHQVIGQMIKGDYDDDSNWQMVE
Sbjct: 601 LPNVLCYCMMLAVYAKNGRWDDAYELLDEMIKNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660

Query: 661 YVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSV 720
           YVFDKLNAEGCGLG RFYNTLL+ALWWLGQKGRAARVL EATKRGLFPELFRK KLVWSV
Sbjct: 661 YVFDKLNAEGCGLGMRFYNTLLEALWWLGQKGRAARVLTEATKRGLFPELFRKRKLVWSV 720

Query: 721 DVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVY 780
           DVHRMWEG AYTA+SLWVN MN+M+ D EDLPQLAAVVVG GWLEKDS AQNLPIA+AVY
Sbjct: 721 DVHRMWEGSAYTALSLWVNKMNEMLMDGEDLPQLAAVVVG-GWLEKDSTAQNLPIARAVY 780

Query: 781 SFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAG 840
           SFLQDNVSSSF FP WNN RIVCQQSQLKQLLS+AE+SS     +EIITLNNSPLNLP  
Sbjct: 781 SFLQDNVSSSFSFPGWNNGRIVCQQSQLKQLLSNAEASS-----SEIITLNNSPLNLPE- 840

Query: 841 SKISRSGINNDKYKDVDSKSSNRTGTELLTTTV 874
           +KISRSGINNDKYKDVDSKSSNRTGTELLTTT+
Sbjct: 841 AKISRSGINNDKYKDVDSKSSNRTGTELLTTTI 866

BLAST of Moc03g30900 vs. NCBI nr
Match: XP_008437850.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 1601.3 bits (4145), Expect = 0.0e+00
Identity = 801/873 (91.75%), Postives = 825/873 (94.50%), Query Frame = 0

Query: 1   MALSPHLSLPNHFRTSSAFQKRFLESHQFPFLSKLSSFHVRRRFLSDDWSFSSNVGKCRA 60
           MALSP+ S+ N FRTSS FQKR+LE  Q PFLSKLS+F VRRRF SDDW  SS+VGK RA
Sbjct: 1   MALSPYFSVSNPFRTSSTFQKRYLECQQLPFLSKLSNFSVRRRFFSDDWRLSSDVGKARA 60

Query: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFK 120
           K KDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFS VFK
Sbjct: 61  KAKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSLVFK 120

Query: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180
           EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTI+ISLLGREGLLEKCSEIFDEMASQGVI
Sbjct: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIIISLLGREGLLEKCSEIFDEMASQGVI 180

Query: 181 RSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240
           RSVFSYTALINAYGRNGQYETSLEL+ERMKRERVSPNILTYNTVINACARGDLDWEGLLG
Sbjct: 181 RSVFSYTALINAYGRNGQYETSLELLERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240

Query: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300
           LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG
Sbjct: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300

Query: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS 360
           KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS
Sbjct: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS 360

Query: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420
           TYSILLNLYGKHGRYDDVRELFL+MKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL
Sbjct: 361 TYSILLNLYGKHGRYDDVRELFLQMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420

Query: 421 VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALY 480
           VEE IDPNMETYEGLVFACGKGGLHEDAKKIL HMNEKGIVPSSKAY+GLIEAYG AALY
Sbjct: 421 VEENIDPNMETYEGLVFACGKGGLHEDAKKILFHMNEKGIVPSSKAYTGLIEAYGQAALY 480

Query: 481 DEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGI 540
           DEAVVAFNTMNEVGSKST+DTYNSLIHTFARGGLYKEFEAIL RMRE GISRN KSFSGI
Sbjct: 481 DEAVVAFNTMNEVGSKSTIDTYNSLIHTFARGGLYKEFEAILSRMREYGISRNAKSFSGI 540

Query: 541 IEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGI 600
           IEGYRQSGQ+EEAIKAFVEMEKMRCE DEQTLEAVLGVYCFAGLVDESKEQF EIKA+GI
Sbjct: 541 IEGYRQSGQYEEAIKAFVEMEKMRCELDEQTLEAVLGVYCFAGLVDESKEQFVEIKASGI 600

Query: 601 LPSVLCYCMMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660
           LPSVLCYCMMLAVYAKNGRWD+AYELLDEMI NRVSSIHQVIGQMIKGDYDDDSNWQMVE
Sbjct: 601 LPSVLCYCMMLAVYAKNGRWDDAYELLDEMIKNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660

Query: 661 YVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSV 720
           YVFDKLNAEGCG G RFYNTLL+ALWWLGQKGRA RVL EATKRGLFPELFR+SKLVWSV
Sbjct: 661 YVFDKLNAEGCGFGMRFYNTLLEALWWLGQKGRAGRVLTEATKRGLFPELFRQSKLVWSV 720

Query: 721 DVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVY 780
           DVHRMWEGGAYTAISLWVN MN+M+ D ED+PQLAAVVVGRGWLEKDS A+NLPIA+AV 
Sbjct: 721 DVHRMWEGGAYTAISLWVNKMNEMLMDGEDIPQLAAVVVGRGWLEKDSTARNLPIARAVN 780

Query: 781 SFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAG 840
           SFLQDNVSSSF FP WNN RIVCQQSQLKQLL+ + S        EII LNNSPLNLP  
Sbjct: 781 SFLQDNVSSSFSFPGWNNGRIVCQQSQLKQLLTASSS--------EIIALNNSPLNLPE- 840

Query: 841 SKISRSGINNDKYKDVDSKSSNRTGTELLTTTV 874
           +KISRSGINNDKYKD+DSKSSNRTGTELLTTTV
Sbjct: 841 AKISRSGINNDKYKDIDSKSSNRTGTELLTTTV 864

BLAST of Moc03g30900 vs. NCBI nr
Match: KAG6584246.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1595.5 bits (4130), Expect = 0.0e+00
Identity = 794/873 (90.95%), Postives = 825/873 (94.50%), Query Frame = 0

Query: 1   MALSPHLSLPNHFRTSSAFQKRFLESHQFPFLSKLSSFHVRRRFLSDDWSFSSNVGKCRA 60
           MALSP+LS+PNH   S+ FQ+R+L S Q PFLS  S+F VRRRF SD+WS  S+VGKCRA
Sbjct: 1   MALSPYLSVPNHLEISTTFQQRYLLSQQLPFLSNRSNFSVRRRFFSDNWSLFSDVGKCRA 60

Query: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFK 120
           KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFS VFK
Sbjct: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSLVFK 120

Query: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180
           EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI
Sbjct: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180

Query: 181 RSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240
           RSVFSYTALINAYGRNGQYE SLEL+ERMKR+RVSPNILTYNTVINACARGDLDWEGLLG
Sbjct: 181 RSVFSYTALINAYGRNGQYEISLELLERMKRDRVSPNILTYNTVINACARGDLDWEGLLG 240

Query: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300
           LFAEMRHEGVQPDLVTYNTLLSACAAR LGDEAEMVFKTMIEGGIVPEITTYSYIVETFG
Sbjct: 241 LFAEMRHEGVQPDLVTYNTLLSACAARSLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300

Query: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS 360
           KLGKLEKVAMLLKEMESEGYLPDISSYNVLI+AHAK GSIKEAMDVFKQMQAAGCVPNAS
Sbjct: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIDAHAKSGSIKEAMDVFKQMQAAGCVPNAS 360

Query: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420
           TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL
Sbjct: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420

Query: 421 VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALY 480
           VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKG+VPSSKAY GLIEAYG AALY
Sbjct: 421 VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGMVPSSKAYGGLIEAYGQAALY 480

Query: 481 DEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGI 540
           DEA+VAFNTMNEVGSKSTVDTYNSLIHTFA+GGLYKE EAI+ RM ESGISRN KSFSGI
Sbjct: 481 DEALVAFNTMNEVGSKSTVDTYNSLIHTFAKGGLYKELEAIISRMGESGISRNAKSFSGI 540

Query: 541 IEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGI 600
           IE YRQSGQFEEAIKAFVEMEKMRC+PDEQTLEAVLGVYCFAGLVDESKEQFHEIKA+GI
Sbjct: 541 IECYRQSGQFEEAIKAFVEMEKMRCKPDEQTLEAVLGVYCFAGLVDESKEQFHEIKASGI 600

Query: 601 LPSVLCYCMMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660
           +PSVLCYCMMLAVYAKNGRWDNAYELLDEMI NR S IHQVIGQMIKG YDDDSNWQMVE
Sbjct: 601 VPSVLCYCMMLAVYAKNGRWDNAYELLDEMIKNRESGIHQVIGQMIKGGYDDDSNWQMVE 660

Query: 661 YVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSV 720
           YVFDKLNAEGCGLG RFYNTLL+ALWWLGQKGRAARVL EATKRGLFPELFRKSKLVWSV
Sbjct: 661 YVFDKLNAEGCGLGMRFYNTLLEALWWLGQKGRAARVLTEATKRGLFPELFRKSKLVWSV 720

Query: 721 DVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVY 780
           DVHRMWEGGAYTAISLWVN MN+M+ D E+LPQ+AAVVVGRGWLEKD+ AQNLPI +AVY
Sbjct: 721 DVHRMWEGGAYTAISLWVNKMNEMLMDGEELPQVAAVVVGRGWLEKDTTAQNLPIPRAVY 780

Query: 781 SFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAG 840
           SFLQDNVSSSF FP WNN RIVCQ SQLKQLL+D E+S     T+EIITLNNSPLNLP  
Sbjct: 781 SFLQDNVSSSFTFPGWNNSRIVCQPSQLKQLLADTEAS-----TSEIITLNNSPLNLPE- 840

Query: 841 SKISRSGINNDKYKDVDSKSSNRTGTELLTTTV 874
           +KISRSGI+NDKY+DVDSKSSNRTGTELLT TV
Sbjct: 841 AKISRSGISNDKYEDVDSKSSNRTGTELLTATV 867

BLAST of Moc03g30900 vs. ExPASy Swiss-Prot
Match: Q9S7Q2 (Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PTAC2 PE=2 SV=1)

HSP 1 Score: 1226.1 bits (3171), Expect = 0.0e+00
Identity = 613/859 (71.36%), Postives = 714/859 (83.12%), Query Frame = 0

Query: 26  SHQFPFLSKLSSFHVRRRFLSDD------------WSFSSNVGKCRAKPKDLVLGNPSVI 85
           SH   FL + SSF   RRF   +             SFS   GK +AK KDLVLGNPSV 
Sbjct: 10  SHHLSFLIQNSSFIGNRRFADGNRLRFLSGGNRKPCSFS---GKIKAKTKDLVLGNPSVS 69

Query: 86  VEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFKEFAARGDWQRSLR 145
           VEKGKYSYDVE+LINKLSSLPPRGSIARCLDIFKN+LSLNDF+ VFKEFA RGDWQRSLR
Sbjct: 70  VEKGKYSYDVESLINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAGRGDWQRSLR 129

Query: 146 LFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVIRSVFSYTALINAY 205
           LFKYMQRQIWCKPNEHIYTIMISLLGREGLL+KC E+FDEM SQGV RSVFSYTALINAY
Sbjct: 130 LFKYMQRQIWCKPNEHIYTIMISLLGREGLLDKCLEVFDEMPSQGVSRSVFSYTALINAY 189

Query: 206 GRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLGLFAEMRHEGVQPD 265
           GRNG+YETSLEL++RMK E++SP+ILTYNTVINACARG LDWEGLLGLFAEMRHEG+QPD
Sbjct: 190 GRNGRYETSLELLDRMKNEKISPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQPD 249

Query: 266 LVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFGKLGKLEKVAMLLK 325
           +VTYNTLLSACA RGLGDEAEMVF+TM +GGIVP++TTYS++VETFGKL +LEKV  LL 
Sbjct: 250 IVTYNTLLSACAIRGLGDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLRRLEKVCDLLG 309

Query: 326 EMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNLYGKHG 385
           EM S G LPDI+SYNVL+EA+AK GSIKEAM VF QMQAAGC PNA+TYS+LLNL+G+ G
Sbjct: 310 EMASGGSLPDITSYNVLLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQSG 369

Query: 386 RYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDLVEEKIDPNMETYE 445
           RYDDVR+LFLEMK S+ +PDA TYNILI VFGEGGYFKEVVTLFHD+VEE I+P+METYE
Sbjct: 370 RYDDVRQLFLEMKSSNTDPDAATYNILIEVFGEGGYFKEVVTLFHDMVEENIEPDMETYE 429

Query: 446 GLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALYDEAVVAFNTMNEV 505
           G++FACGKGGLHEDA+KIL +M    IVPSSKAY+G+IEA+G AALY+EA+VAFNTM+EV
Sbjct: 430 GIIFACGKGGLHEDARKILQYMTANDIVPSSKAYTGVIEAFGQAALYEEALVAFNTMHEV 489

Query: 506 GSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGIIEGYRQSGQFEEA 565
           GS  +++T++SL+++FARGGL KE EAIL R+ +SGI RN  +F+  IE Y+Q G+FEEA
Sbjct: 490 GSNPSIETFHSLLYSFARGGLVKESEAILSRLVDSGIPRNRDTFNAQIEAYKQGGKFEEA 549

Query: 566 IKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGILPSVLCYCMMLAV 625
           +K +V+MEK RC+PDE+TLEAVL VY FA LVDE +EQF E+KA+ ILPS++CYCMMLAV
Sbjct: 550 VKTYVDMEKSRCDPDERTLEAVLSVYSFARLVDECREQFEEMKASDILPSIMCYCMMLAV 609

Query: 626 YAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVEYVFDKLNAEGCGL 685
           Y K  RWD+  ELL+EM++NRVS+IHQVIGQMIKGDYDDDSNWQ+VEYV DKLN+EGCGL
Sbjct: 610 YGKTERWDDVNELLEEMLSNRVSNIHQVIGQMIKGDYDDDSNWQIVEYVLDKLNSEGCGL 669

Query: 686 GTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSVDVHRMWEGGAYTA 745
           G RFYN LLDALWWLGQK RAARVL EATKRGLFPELFRK+KLVWSVDVHRM EGG YTA
Sbjct: 670 GIRFYNALLDALWWLGQKERAARVLNEATKRGLFPELFRKNKLVWSVDVHRMSEGGMYTA 729

Query: 746 ISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVYSFLQDNVSSSFDF 805
           +S+W+N +N M+  + DLPQLA VV  RG LEK S A+  PIAKA +SFLQD+VSSSF F
Sbjct: 730 LSVWLNDINDMLL-KGDLPQLAVVVSVRGQLEKSSAARESPIAKAAFSFLQDHVSSSFSF 789

Query: 806 PRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAGSKISRSGINNDKY 865
             WN  RI+CQ+SQLKQLLS  E +S+  +   ++ L NSP+   AG++ S S   N  +
Sbjct: 790 TGWNGGRIMCQRSQLKQLLSTKEPTSEESENKNLVALANSPI-FAAGTRASTSSDTN--H 849

Query: 866 KDVDSKSSNRTGTELLTTT 873
               ++   RT  EL  +T
Sbjct: 850 SGNPTQRRTRTKKELAGST 861

BLAST of Moc03g30900 vs. ExPASy Swiss-Prot
Match: Q9SIC9 (Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g31400 PE=2 SV=1)

HSP 1 Score: 296.6 bits (758), Expect = 9.3e-79
Identity = 188/706 (26.63%), Postives = 346/706 (49.01%), Query Frame = 0

Query: 113 NDFSFVFKEFAARGDWQRSLRLFKY-MQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIF 172
           +D +++ +E   R +  +++  +++ ++R+        + + MIS LGR G +     IF
Sbjct: 197 DDCTYIIRELGNRNECDKAVGFYEFAVKRERRKNEQGKLASAMISTLGRYGKVTIAKRIF 256

Query: 173 DEMASQGVIRSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARG 232
           +   + G   +V++++ALI+AYGR+G +E ++ +   MK   + PN++TYN VI+AC +G
Sbjct: 257 ETAFAGGYGNTVYAFSALISAYGRSGLHEEAISVFNSMKEYGLRPNLVTYNAVIDACGKG 316

Query: 233 DLDWEGLLGLFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITT 292
            ++++ +   F EM+  GVQPD +T+N+LL+ C+  GL + A  +F  M    I  ++ +
Sbjct: 317 GMEFKQVAKFFDEMQRNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMTNRRIEQDVFS 376

Query: 293 YSYIVETFGKLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQ 352
           Y+ +++   K G+++    +L +M  +  +P++ SY+ +I+  AK G   EA+++F +M+
Sbjct: 377 YNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEALNLFGEMR 436

Query: 353 AAGCVPNASTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFK 412
             G   +  +Y+ LL++Y K GR ++  ++  EM     + D  TYN L+  +G+ G + 
Sbjct: 437 YLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGGYGKQGKYD 496

Query: 413 EVVTLFHDLVEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLI 472
           EV  +F ++  E + PN+ TY  L+    KGGL+++A +I       G+      YS LI
Sbjct: 497 EVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRADVVLYSALI 556

Query: 473 EAYGLAALYDEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGIS 532
           +A     L   AV   + M + G    V TYNS+I  F R     +  A          S
Sbjct: 557 DALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSAT-MDRSADYSNGGSLPFS 616

Query: 533 RNVKSFSGIIEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQ 592
            +  S     EG R    F +            CE   Q L  +L V+           +
Sbjct: 617 SSALSALTETEGNRVIQLFGQLTTESNNRTTKDCEEGMQELSCILEVF----------RK 676

Query: 593 FHEIKATGILPSVLCYCMMLAVYAKNGRWDNAYELLDE--MITNRVSSIHQVIGQMIKGD 652
            H+++   I P+V+ +  +L   ++   +++A  LL+E  +  N+V   + V+  ++ G 
Sbjct: 677 MHQLE---IKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKV---YGVVHGLLMG- 736

Query: 653 YDDDSNWQMVEYVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPE 712
              ++ W   + +FDK+N       + FYN L D LW  GQK  A  V +E   R ++  
Sbjct: 737 -QRENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWEN 796

Query: 713 LFRKSKLVWSVDVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSK 772
           ++  S L    D+H M  G A   +  W+  +  ++ +  +LP++ +++ G G   K SK
Sbjct: 797 VWSDSCL----DLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWG---KHSK 856

Query: 773 AQNLPIAKAVYSFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDA 816
                  +     L   + + F   + N  R     S +   L ++
Sbjct: 857 VVGDGALRRAVEVLLRGMDAPFHLSKCNMGRFTSSGSVVATWLRES 876

BLAST of Moc03g30900 vs. ExPASy Swiss-Prot
Match: O64624 (Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g18940 PE=2 SV=1)

HSP 1 Score: 293.1 bits (749), Expect = 1.0e-77
Identity = 173/635 (27.24%), Postives = 305/635 (48.03%), Query Frame = 0

Query: 86  LINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFKEFAARGDWQRSLRLFKYM---QRQI 145
           L+N +   P  G ++R  D  K+ L   D   + K     G W+R++ LF+++       
Sbjct: 111 LVNSIVEQPLTG-LSRFFDSVKSELLRTDLVSLVKGLDDSGHWERAVFLFEWLVLSSNSG 170

Query: 146 WCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVIRSVFSYTALINAYGRNGQYETS 205
             K +  +  I + +LGRE      +++ D++  Q  +  V +YT +++AY R G+YE +
Sbjct: 171 ALKLDHQVIEIFVRILGRESQYSVAAKLLDKIPLQEYLLDVRAYTTILHAYSRTGKYEKA 230

Query: 206 LELIERMKRERVSPNILTYNTVINACARGDLDWEGLLGLFAEMRHEGVQPDLVTYNTLLS 265
           ++L ERMK    SP ++TYN +++   +    W  +LG+  EMR +G++ D  T +T+LS
Sbjct: 231 IDLFERMKEMGPSPTLVTYNVILDVFGKMGRSWRKILGVLDEMRSKGLKFDEFTCSTVLS 290

Query: 266 ACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFGKLGKLEKVAMLLKEMESEGYLP 325
           ACA  GL  EA+  F  +   G  P   TY+ +++ FGK G   +   +LKEME      
Sbjct: 291 ACAREGLLREAKEFFAELKSCGYEPGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPA 350

Query: 326 DISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNLYGKHGRYDDVRELF 385
           D  +YN L+ A+ + G  KEA  V + M   G +PNA TY+ +++ YGK G+ D+  +LF
Sbjct: 351 DSVTYNELVAAYVRAGFSKEAAGVIEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLF 410

Query: 386 LEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDLVEEKIDPNMETYEGLVFACGKG 445
             MKE+   P+  TYN ++ + G+     E++ +  D+      PN  T+  ++  CG  
Sbjct: 411 YSMKEAGCVPNTCTYNAVLSLLGKKSRSNEMIKMLCDMKSNGCSPNRATWNTMLALCGNK 470

Query: 446 GLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALYDEAVVAFNTMNEVGSKSTVDTY 505
           G+ +   ++   M   G  P    ++ LI AYG      +A   +  M   G  + V TY
Sbjct: 471 GMDKFVNRVFREMKSCGFEPDRDTFNTLISAYGRCGSEVDASKMYGEMTRAGFNACVTTY 530

Query: 506 NSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGIIEGYRQSGQFEEAIKAFVEMEK 565
           N+L++  AR G ++  E ++  M+  G      S+S +++ Y + G +    +    +++
Sbjct: 531 NALLNALARKGDWRSGENVISDMKSKGFKPTETSYSLMLQCYAKGGNYLGIERIENRIKE 590

Query: 566 MRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGILPSVLCYCMMLAVYAKNGRWDN 625
            +  P    L  +L        +  S+  F   K  G  P ++ +  ML+++ +N  +D 
Sbjct: 591 GQIFPSWMLLRTLLLANFKCRALAGSERAFTLFKKHGYKPDMVIFNSMLSIFTRNNMYDQ 650

Query: 626 AYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVEYVFDKLNAEGCGLGTRFYNTLL 685
           A  +L+ +  + +S        ++         W+  E +   L           YNT++
Sbjct: 651 AEGILESIREDGLSPDLVTYNSLMDMYVRRGECWK-AEEILKTLEKSQLKPDLVSYNTVI 710

Query: 686 DALWWLGQKGRAARVLIEATKRGLFPELFRKSKLV 718
                 G    A R+L E T+RG+ P +F  +  V
Sbjct: 711 KGFCRRGLMQEAVRMLSEMTERGIRPCIFTYNTFV 743

BLAST of Moc03g30900 vs. ExPASy Swiss-Prot
Match: B8Y6I0 (Pentatricopeptide repeat-containing protein 10, chloroplastic OS=Zea mays OX=4577 GN=PPR10 PE=1 SV=1)

HSP 1 Score: 282.3 bits (721), Expect = 1.8e-74
Identity = 176/633 (27.80%), Postives = 306/633 (48.34%), Query Frame = 0

Query: 80  SYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFKEFAARGDWQRSLRLFKYMQ 139
           S D + L+  +SS  P  ++A  L   ++ L   D + + K     G W+ +L L ++  
Sbjct: 73  SPDAQVLVLAISS-HPLPTLAAFLASRRDELLRADITSLLKALELSGHWEWALALLRWAG 132

Query: 140 RQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEM-ASQGVIRSVFSYTALINAYGRNGQ 199
           ++     +     +++  LGREG  +    + DE     G    V +YT +++A  R G+
Sbjct: 133 KE--GAADASALEMVVRALGREGQHDAVCALLDETPLPPGSRLDVRAYTTVLHALSRAGR 192

Query: 200 YETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLGLFAEMRHEGVQPDLVTYN 259
           YE +LEL   ++R+ V+P ++TYN V++   R    W  ++ L  EMR  GV+PD  T +
Sbjct: 193 YERALELFAELRRQGVAPTLVTYNVVLDVYGRMGRSWPRIVALLDEMRAAGVEPDGFTAS 252

Query: 260 TLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFGKLGKLEKVAMLLKEMESE 319
           T+++AC   GL DEA   F+ +   G  P + TY+ +++ FGK G   +   +L EME  
Sbjct: 253 TVIAACCRDGLVDEAVAFFEDLKARGHAPCVVTYNALLQVFGKAGNYTEALRVLGEMEQN 312

Query: 320 GYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNLYGKHGRYDDV 379
           G  PD  +YN L   +A+ G  +EA      M + G +PNA TY+ ++  YG  G+ D+ 
Sbjct: 313 GCQPDAVTYNELAGTYARAGFFEEAARCLDTMASKGLLPNAFTYNTVMTAYGNVGKVDEA 372

Query: 380 RELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDLVEEKIDPNMETYEGLVFA 439
             LF +MK++   P+  TYN+++ + G+   F  ++ +  ++      PN  T+  ++  
Sbjct: 373 LALFDQMKKTGFVPNVNTYNLVLGMLGKKSRFTVMLEMLGEMSRSGCTPNRVTWNTMLAV 432

Query: 440 CGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALYDEAVVAFNTMNEVGSKST 499
           CGK G+ +   ++L  M   G+  S   Y+ LI AYG       A   +N M   G    
Sbjct: 433 CGKRGMEDYVTRVLEGMRSCGVELSRDTYNTLIAAYGRCGSRTNAFKMYNEMTSAGFTPC 492

Query: 500 VDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGIIEGYRQSGQFEEAIKAFV 559
           + TYN+L++  +R G +   ++I+ +MR  G   N +S+S +++ Y + G    A  A +
Sbjct: 493 ITTYNALLNVLSRQGDWSTAQSIVSKMRTKGFKPNEQSYSLLLQCYAKGGNV--AGIAAI 552

Query: 560 EME---KMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGILPSVLCYCMMLAVYA 619
           E E        P    L  ++        +D  +  F E+KA G  P ++ +  ML++YA
Sbjct: 553 ENEVYGSGAVFPSWVILRTLVIANFKCRRLDGMETAFQEVKARGYNPDLVIFNSMLSIYA 612

Query: 620 KNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVEYVFDKLNAEGCGLGT 679
           KNG +  A E+ D +  + +S        ++        +W+  + +     ++      
Sbjct: 613 KNGMYSKATEVFDSIKRSGLSPDLITYNSLMDMYAKCSESWEAEKILNQLKCSQTMKPDV 672

Query: 680 RFYNTLLDALWWLGQKGRAARVLIEATKRGLFP 709
             YNT+++     G    A RVL E    G+ P
Sbjct: 673 VSYNTVINGFCKQGLVKEAQRVLSEMVADGMAP 700

BLAST of Moc03g30900 vs. ExPASy Swiss-Prot
Match: Q9LYZ9 (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana OX=3702 GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 278.1 bits (710), Expect = 3.4e-73
Identity = 150/599 (25.04%), Postives = 295/599 (49.25%), Query Frame = 0

Query: 131 SLRLFKYM--QRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVIRSVFSYTA 190
           +LR F +   Q+      +  +  I+IS+LG+EG +   + +F+ +   G    V+SYT+
Sbjct: 154 ALRAFDWFMKQKDYQSMLDNSVVAIIISMLGKEGRVSSAANMFNGLQEDGFSLDVYSYTS 213

Query: 191 LINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLGLFAEMRHE 250
           LI+A+  +G+Y  ++ + ++M+ +   P ++TYN ++N   +    W  +  L  +M+ +
Sbjct: 214 LISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVFGKMGTPWNKITSLVEKMKSD 273

Query: 251 GVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFGKLGKLEKV 310
           G+ PD  TYNTL++ C    L  EA  VF+ M   G   +  TY+ +++ +GK  + ++ 
Sbjct: 274 GIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVYGKSHRPKEA 333

Query: 311 AMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNL 370
             +L EM   G+ P I +YN LI A+A+ G + EAM++  QM   G  P+  TY+ LL+ 
Sbjct: 334 MKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSG 393

Query: 371 YGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDLVEEKIDPN 430
           + + G+ +    +F EM+ +  +P+  T+N  I+++G  G F E++ +F ++    + P+
Sbjct: 394 FERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSPD 453

Query: 431 METYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALYDEAVVAFN 490
           + T+  L+   G+ G+  +   +   M   G VP  + ++ LI AY     +++A+  + 
Sbjct: 454 IVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTLISAYSRCGSFEQAMTVYR 513

Query: 491 TMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGIIEGYRQSG 550
            M + G    + TYN+++   ARGG++++ E +L  M +     N  ++  ++  Y    
Sbjct: 514 RMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSLLHAYANGK 573

Query: 551 QFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGILPSVLCYC 610
           +         E+     EP    L+ ++ V     L+ E++  F E+K  G  P +    
Sbjct: 574 EIGLMHSLAEEVYSGVIEPRAVLLKTLVLVCSKCDLLPEAERAFSELKERGFSPDITTLN 633

Query: 611 MMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVEYVFDKLNA 670
            M+++Y +      A  +LD M   R  +        +   +   +++   E +  ++ A
Sbjct: 634 SMVSIYGRRQMVAKANGVLDYM-KERGFTPSMATYNSLMYMHSRSADFGKSEEILREILA 693

Query: 671 EGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSVDVHRMWE 728
           +G       YNT++ A     +   A+R+  E    G+ P++   +  + S     M+E
Sbjct: 694 KGIKPDIISYNTVIYAYCRNTRMRDASRIFSEMRNSGIVPDVITYNTFIGSYAADSMFE 751

BLAST of Moc03g30900 vs. ExPASy TrEMBL
Match: A0A6J1C827 (pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111008838 PE=4 SV=1)

HSP 1 Score: 1757.7 bits (4551), Expect = 0.0e+00
Identity = 873/873 (100.00%), Postives = 873/873 (100.00%), Query Frame = 0

Query: 1   MALSPHLSLPNHFRTSSAFQKRFLESHQFPFLSKLSSFHVRRRFLSDDWSFSSNVGKCRA 60
           MALSPHLSLPNHFRTSSAFQKRFLESHQFPFLSKLSSFHVRRRFLSDDWSFSSNVGKCRA
Sbjct: 1   MALSPHLSLPNHFRTSSAFQKRFLESHQFPFLSKLSSFHVRRRFLSDDWSFSSNVGKCRA 60

Query: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFK 120
           KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFK
Sbjct: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFK 120

Query: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180
           EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI
Sbjct: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180

Query: 181 RSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240
           RSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLG
Sbjct: 181 RSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240

Query: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300
           LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG
Sbjct: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300

Query: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS 360
           KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS
Sbjct: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS 360

Query: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420
           TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL
Sbjct: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420

Query: 421 VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALY 480
           VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALY
Sbjct: 421 VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALY 480

Query: 481 DEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGI 540
           DEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGI
Sbjct: 481 DEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGI 540

Query: 541 IEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGI 600
           IEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGI
Sbjct: 541 IEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGI 600

Query: 601 LPSVLCYCMMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660
           LPSVLCYCMMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVE
Sbjct: 601 LPSVLCYCMMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660

Query: 661 YVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSV 720
           YVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSV
Sbjct: 661 YVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSV 720

Query: 721 DVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVY 780
           DVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVY
Sbjct: 721 DVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVY 780

Query: 781 SFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAG 840
           SFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAG
Sbjct: 781 SFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAG 840

Query: 841 SKISRSGINNDKYKDVDSKSSNRTGTELLTTTV 874
           SKISRSGINNDKYKDVDSKSSNRTGTELLTTTV
Sbjct: 841 SKISRSGINNDKYKDVDSKSSNRTGTELLTTTV 873

BLAST of Moc03g30900 vs. ExPASy TrEMBL
Match: A0A1S3AUM4 (pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103483154 PE=3 SV=1)

HSP 1 Score: 1601.3 bits (4145), Expect = 0.0e+00
Identity = 801/873 (91.75%), Postives = 825/873 (94.50%), Query Frame = 0

Query: 1   MALSPHLSLPNHFRTSSAFQKRFLESHQFPFLSKLSSFHVRRRFLSDDWSFSSNVGKCRA 60
           MALSP+ S+ N FRTSS FQKR+LE  Q PFLSKLS+F VRRRF SDDW  SS+VGK RA
Sbjct: 1   MALSPYFSVSNPFRTSSTFQKRYLECQQLPFLSKLSNFSVRRRFFSDDWRLSSDVGKARA 60

Query: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFK 120
           K KDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFS VFK
Sbjct: 61  KAKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSLVFK 120

Query: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180
           EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTI+ISLLGREGLLEKCSEIFDEMASQGVI
Sbjct: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIIISLLGREGLLEKCSEIFDEMASQGVI 180

Query: 181 RSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240
           RSVFSYTALINAYGRNGQYETSLEL+ERMKRERVSPNILTYNTVINACARGDLDWEGLLG
Sbjct: 181 RSVFSYTALINAYGRNGQYETSLELLERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240

Query: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300
           LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG
Sbjct: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300

Query: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS 360
           KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS
Sbjct: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS 360

Query: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420
           TYSILLNLYGKHGRYDDVRELFL+MKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL
Sbjct: 361 TYSILLNLYGKHGRYDDVRELFLQMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420

Query: 421 VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALY 480
           VEE IDPNMETYEGLVFACGKGGLHEDAKKIL HMNEKGIVPSSKAY+GLIEAYG AALY
Sbjct: 421 VEENIDPNMETYEGLVFACGKGGLHEDAKKILFHMNEKGIVPSSKAYTGLIEAYGQAALY 480

Query: 481 DEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGI 540
           DEAVVAFNTMNEVGSKST+DTYNSLIHTFARGGLYKEFEAIL RMRE GISRN KSFSGI
Sbjct: 481 DEAVVAFNTMNEVGSKSTIDTYNSLIHTFARGGLYKEFEAILSRMREYGISRNAKSFSGI 540

Query: 541 IEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGI 600
           IEGYRQSGQ+EEAIKAFVEMEKMRCE DEQTLEAVLGVYCFAGLVDESKEQF EIKA+GI
Sbjct: 541 IEGYRQSGQYEEAIKAFVEMEKMRCELDEQTLEAVLGVYCFAGLVDESKEQFVEIKASGI 600

Query: 601 LPSVLCYCMMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660
           LPSVLCYCMMLAVYAKNGRWD+AYELLDEMI NRVSSIHQVIGQMIKGDYDDDSNWQMVE
Sbjct: 601 LPSVLCYCMMLAVYAKNGRWDDAYELLDEMIKNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660

Query: 661 YVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSV 720
           YVFDKLNAEGCG G RFYNTLL+ALWWLGQKGRA RVL EATKRGLFPELFR+SKLVWSV
Sbjct: 661 YVFDKLNAEGCGFGMRFYNTLLEALWWLGQKGRAGRVLTEATKRGLFPELFRQSKLVWSV 720

Query: 721 DVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVY 780
           DVHRMWEGGAYTAISLWVN MN+M+ D ED+PQLAAVVVGRGWLEKDS A+NLPIA+AV 
Sbjct: 721 DVHRMWEGGAYTAISLWVNKMNEMLMDGEDIPQLAAVVVGRGWLEKDSTARNLPIARAVN 780

Query: 781 SFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAG 840
           SFLQDNVSSSF FP WNN RIVCQQSQLKQLL+ + S        EII LNNSPLNLP  
Sbjct: 781 SFLQDNVSSSFSFPGWNNGRIVCQQSQLKQLLTASSS--------EIIALNNSPLNLPE- 840

Query: 841 SKISRSGINNDKYKDVDSKSSNRTGTELLTTTV 874
           +KISRSGINNDKYKD+DSKSSNRTGTELLTTTV
Sbjct: 841 AKISRSGINNDKYKDIDSKSSNRTGTELLTTTV 864

BLAST of Moc03g30900 vs. ExPASy TrEMBL
Match: A0A1S3AVM1 (pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X2 OS=Cucumis melo OX=3656 GN=LOC103483154 PE=3 SV=1)

HSP 1 Score: 1594.7 bits (4128), Expect = 0.0e+00
Identity = 800/873 (91.64%), Postives = 824/873 (94.39%), Query Frame = 0

Query: 1   MALSPHLSLPNHFRTSSAFQKRFLESHQFPFLSKLSSFHVRRRFLSDDWSFSSNVGKCRA 60
           MALSP+ S+ N FRTSS FQKR+LE  Q PFLSKLS+F VRRRF SDDW  SS+VGK RA
Sbjct: 1   MALSPYFSVSNPFRTSSTFQKRYLECQQLPFLSKLSNFSVRRRFFSDDWRLSSDVGKARA 60

Query: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFK 120
           K KDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFS VFK
Sbjct: 61  KAKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSLVFK 120

Query: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180
           EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTI+ISLLGREGLLEKCSEIFDEMASQGVI
Sbjct: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIIISLLGREGLLEKCSEIFDEMASQGVI 180

Query: 181 RSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240
           RSVFSYTALINAYGRNGQYETSLEL+ERMKRERVSPNILTYNTVINACARGDLDWEGLLG
Sbjct: 181 RSVFSYTALINAYGRNGQYETSLELLERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240

Query: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300
           LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG
Sbjct: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300

Query: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS 360
           KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS
Sbjct: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS 360

Query: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420
           TYSILLNLYGKHGRYDDVRELFL+MKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL
Sbjct: 361 TYSILLNLYGKHGRYDDVRELFLQMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420

Query: 421 VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALY 480
           VEE IDPNMETYEGLVFACGKGGLHEDAKKIL HMNEKGIVPSSKAY+GLIEAYG AALY
Sbjct: 421 VEENIDPNMETYEGLVFACGKGGLHEDAKKILFHMNEKGIVPSSKAYTGLIEAYGQAALY 480

Query: 481 DEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGI 540
           DEAVVAFNTMNEVGSKST+DTYNSLIHTFARGGLYKEFEAIL RMRE GISRN KSFSGI
Sbjct: 481 DEAVVAFNTMNEVGSKSTIDTYNSLIHTFARGGLYKEFEAILSRMREYGISRNAKSFSGI 540

Query: 541 IEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGI 600
           IEGYRQSGQ+EEAIKAFVEMEKMRCE DEQTLEAVLGVYCFAGLVDESKEQF EIKA+GI
Sbjct: 541 IEGYRQSGQYEEAIKAFVEMEKMRCELDEQTLEAVLGVYCFAGLVDESKEQFVEIKASGI 600

Query: 601 LPSVLCYCMMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660
           LPSVLCYCMMLAVYAKNGRWD+AYELLDEMI NRVSSIHQVIGQMIKGDYDDDSNWQMVE
Sbjct: 601 LPSVLCYCMMLAVYAKNGRWDDAYELLDEMIKNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660

Query: 661 YVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSV 720
           YVFDKLNAEGCG G RFYNTLL+ALWWLGQKGRA RVL EATKRGLFPELFR+SKLVWSV
Sbjct: 661 YVFDKLNAEGCGFGMRFYNTLLEALWWLGQKGRAGRVLTEATKRGLFPELFRQSKLVWSV 720

Query: 721 DVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVY 780
           DVHRMWEGGAYTAISLWVN MN+M+ D ED+PQLAAVVVG GWLEKDS A+NLPIA+AV 
Sbjct: 721 DVHRMWEGGAYTAISLWVNKMNEMLMDGEDIPQLAAVVVG-GWLEKDSTARNLPIARAVN 780

Query: 781 SFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAG 840
           SFLQDNVSSSF FP WNN RIVCQQSQLKQLL+ + S        EII LNNSPLNLP  
Sbjct: 781 SFLQDNVSSSFSFPGWNNGRIVCQQSQLKQLLTASSS--------EIIALNNSPLNLPE- 840

Query: 841 SKISRSGINNDKYKDVDSKSSNRTGTELLTTTV 874
           +KISRSGINNDKYKD+DSKSSNRTGTELLTTTV
Sbjct: 841 AKISRSGINNDKYKDIDSKSSNRTGTELLTTTV 863

BLAST of Moc03g30900 vs. ExPASy TrEMBL
Match: A0A6J1E7U5 (pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111431586 PE=3 SV=1)

HSP 1 Score: 1594.3 bits (4127), Expect = 0.0e+00
Identity = 793/873 (90.84%), Postives = 825/873 (94.50%), Query Frame = 0

Query: 1   MALSPHLSLPNHFRTSSAFQKRFLESHQFPFLSKLSSFHVRRRFLSDDWSFSSNVGKCRA 60
           MALSP+LS+PNH   S+ FQ+R+L S Q PFLS  S+F VRRRF SD+WS  S+VGKCRA
Sbjct: 1   MALSPYLSVPNHLEISTTFQQRYLLSQQLPFLSNRSNFSVRRRFFSDNWSLFSDVGKCRA 60

Query: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFK 120
           KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFS VFK
Sbjct: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSLVFK 120

Query: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180
           EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI
Sbjct: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180

Query: 181 RSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240
           RSVFSYTALINAYGRNGQYE SLEL+ERMKR+RVSPNILTYNTVINACARGDLDWEGLLG
Sbjct: 181 RSVFSYTALINAYGRNGQYEISLELLERMKRDRVSPNILTYNTVINACARGDLDWEGLLG 240

Query: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300
           LFAEMRHEGVQPDLVTYNTLLSACAAR LGDEAEMVFKTMIEGGIVPEITTYSYIVETFG
Sbjct: 241 LFAEMRHEGVQPDLVTYNTLLSACAARSLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300

Query: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS 360
           KLGKLEKVAMLLKEMESEGYLPDISSYNVLI+AHAK GSIKEAMDVFKQMQAAGCVPNAS
Sbjct: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIDAHAKSGSIKEAMDVFKQMQAAGCVPNAS 360

Query: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420
           TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL
Sbjct: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420

Query: 421 VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALY 480
           VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKG+VPSSKAY GLIEAYG AALY
Sbjct: 421 VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGMVPSSKAYGGLIEAYGQAALY 480

Query: 481 DEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGI 540
           DEA+VAFNTMNEVGSKSTVDTYNSLIHTFA+GGLYKE EAI+ RM ESGISRN KSFSGI
Sbjct: 481 DEALVAFNTMNEVGSKSTVDTYNSLIHTFAKGGLYKELEAIISRMGESGISRNAKSFSGI 540

Query: 541 IEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGI 600
           IE YRQSGQFEEAIKAFVEMEKMRC+PDEQTLEAVLGVYCFAGLVDESKEQFHEIKA+GI
Sbjct: 541 IECYRQSGQFEEAIKAFVEMEKMRCKPDEQTLEAVLGVYCFAGLVDESKEQFHEIKASGI 600

Query: 601 LPSVLCYCMMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660
           +PSVLCYCMMLAVYAKNGRWDNAYELLDEMI NR S IHQVIGQMIKG YDDDSNWQMVE
Sbjct: 601 VPSVLCYCMMLAVYAKNGRWDNAYELLDEMIKNRESGIHQVIGQMIKGGYDDDSNWQMVE 660

Query: 661 YVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSV 720
           YVFDKLNAEGCGLG RFYNTLL+ALWWLGQKGRAARVL EATKRGLFPELFRKSKLVWSV
Sbjct: 661 YVFDKLNAEGCGLGMRFYNTLLEALWWLGQKGRAARVLTEATKRGLFPELFRKSKLVWSV 720

Query: 721 DVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVY 780
           DVHRMWEGGAYTAISLWVN MN+M+ D E+LPQ+AAVVVGRGWLEKD+ AQNLPI +AVY
Sbjct: 721 DVHRMWEGGAYTAISLWVNKMNEMLMDGEELPQVAAVVVGRGWLEKDTTAQNLPIPRAVY 780

Query: 781 SFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAG 840
           SFLQDNVSSSF FP WNN RIVCQ SQLKQLL+D E+S     T+EI+TLNNSPLNLP  
Sbjct: 781 SFLQDNVSSSFTFPGWNNGRIVCQPSQLKQLLTDTEAS-----TSEIVTLNNSPLNLPE- 840

Query: 841 SKISRSGINNDKYKDVDSKSSNRTGTELLTTTV 874
           +KISRSGI+NDKY+DVDSKSSNRTGTELLT TV
Sbjct: 841 AKISRSGISNDKYEDVDSKSSNRTGTELLTATV 867

BLAST of Moc03g30900 vs. ExPASy TrEMBL
Match: A0A6J1KL33 (pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111495167 PE=3 SV=1)

HSP 1 Score: 1592.8 bits (4123), Expect = 0.0e+00
Identity = 794/873 (90.95%), Postives = 824/873 (94.39%), Query Frame = 0

Query: 1   MALSPHLSLPNHFRTSSAFQKRFLESHQFPFLSKLSSFHVRRRFLSDDWSFSSNVGKCRA 60
           MALSP+LS+PNH   S+ FQ+R+L S Q PFLS  S+F VRRRF SD+WS  S+VGKCRA
Sbjct: 1   MALSPYLSVPNHLEISTTFQQRYLLSQQLPFLSNRSNFSVRRRFFSDNWSLFSDVGKCRA 60

Query: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFK 120
           KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFS VFK
Sbjct: 61  KPKDLVLGNPSVIVEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSLVFK 120

Query: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180
           EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI
Sbjct: 121 EFAARGDWQRSLRLFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVI 180

Query: 181 RSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLG 240
           RSVFSYTALINAYGRNGQYE SLEL+ERMKR+RVSPNILTYNTVINACARGDLDWEGLLG
Sbjct: 181 RSVFSYTALINAYGRNGQYEISLELLERMKRDRVSPNILTYNTVINACARGDLDWEGLLG 240

Query: 241 LFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300
           LFAEMRHEGVQPDLVTYNTLLSACAAR LGDEAEMVFKTMIEGGIVPEITTYSYIVETFG
Sbjct: 241 LFAEMRHEGVQPDLVTYNTLLSACAARSLGDEAEMVFKTMIEGGIVPEITTYSYIVETFG 300

Query: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNAS 360
           KLGKLEKVAMLLKEMESEGYLPDISSYNVLI+AHAK GSIKEAMDVFKQMQAAGCVPNAS
Sbjct: 301 KLGKLEKVAMLLKEMESEGYLPDISSYNVLIDAHAKSGSIKEAMDVFKQMQAAGCVPNAS 360

Query: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420
           TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL
Sbjct: 361 TYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDL 420

Query: 421 VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALY 480
           VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKG+VPSSKAY GLIEAYG AALY
Sbjct: 421 VEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGMVPSSKAYGGLIEAYGQAALY 480

Query: 481 DEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGI 540
           DEA+VAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKE EAI+ RM ESGISRN KSFSGI
Sbjct: 481 DEALVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKELEAIISRMGESGISRNAKSFSGI 540

Query: 541 IEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGI 600
           IE YRQSGQFEEAIKAFVEMEKMRC+PDEQTLEAVLGVYCFAGLVDESKEQFHEIKA+GI
Sbjct: 541 IECYRQSGQFEEAIKAFVEMEKMRCKPDEQTLEAVLGVYCFAGLVDESKEQFHEIKASGI 600

Query: 601 LPSVLCYCMMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVE 660
           +PSVLCYCMMLAVYAKN RWDNAYELLDEMI NR S IHQVIGQMIKG YDDDSNWQMVE
Sbjct: 601 VPSVLCYCMMLAVYAKNARWDNAYELLDEMIKNRESGIHQVIGQMIKGGYDDDSNWQMVE 660

Query: 661 YVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSV 720
           YVFDKLNAEGCGLG RFYNTLL+ALWWLGQKGRAARVL EATKRGLFPELFRKSKLVWSV
Sbjct: 661 YVFDKLNAEGCGLGMRFYNTLLEALWWLGQKGRAARVLTEATKRGLFPELFRKSKLVWSV 720

Query: 721 DVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVY 780
           DVHRMWEGGAYTAISLWVN MN+M+ D E+LPQ+AAVVVGRGWLEKD+ AQNLPI +AVY
Sbjct: 721 DVHRMWEGGAYTAISLWVNKMNEMLMDGEELPQVAAVVVGRGWLEKDTTAQNLPIPRAVY 780

Query: 781 SFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAG 840
           SFLQDNVSSSF FP WNN RIVCQ SQLKQLL+D E+SS     +EIITLNNSPLNLP  
Sbjct: 781 SFLQDNVSSSFTFPGWNNGRIVCQPSQLKQLLADTEASS-----SEIITLNNSPLNLPE- 840

Query: 841 SKISRSGINNDKYKDVDSKSSNRTGTELLTTTV 874
           +KISRSGI+NDKY+DVDSKSSNRTGTELLT TV
Sbjct: 841 AKISRSGISNDKYEDVDSKSSNRTGTELLTATV 867

BLAST of Moc03g30900 vs. TAIR 10
Match: AT1G74850.1 (plastid transcriptionally active 2 )

HSP 1 Score: 1226.1 bits (3171), Expect = 0.0e+00
Identity = 613/859 (71.36%), Postives = 714/859 (83.12%), Query Frame = 0

Query: 26  SHQFPFLSKLSSFHVRRRFLSDD------------WSFSSNVGKCRAKPKDLVLGNPSVI 85
           SH   FL + SSF   RRF   +             SFS   GK +AK KDLVLGNPSV 
Sbjct: 10  SHHLSFLIQNSSFIGNRRFADGNRLRFLSGGNRKPCSFS---GKIKAKTKDLVLGNPSVS 69

Query: 86  VEKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFKEFAARGDWQRSLR 145
           VEKGKYSYDVE+LINKLSSLPPRGSIARCLDIFKN+LSLNDF+ VFKEFA RGDWQRSLR
Sbjct: 70  VEKGKYSYDVESLINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAGRGDWQRSLR 129

Query: 146 LFKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVIRSVFSYTALINAY 205
           LFKYMQRQIWCKPNEHIYTIMISLLGREGLL+KC E+FDEM SQGV RSVFSYTALINAY
Sbjct: 130 LFKYMQRQIWCKPNEHIYTIMISLLGREGLLDKCLEVFDEMPSQGVSRSVFSYTALINAY 189

Query: 206 GRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLGLFAEMRHEGVQPD 265
           GRNG+YETSLEL++RMK E++SP+ILTYNTVINACARG LDWEGLLGLFAEMRHEG+QPD
Sbjct: 190 GRNGRYETSLELLDRMKNEKISPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQPD 249

Query: 266 LVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFGKLGKLEKVAMLLK 325
           +VTYNTLLSACA RGLGDEAEMVF+TM +GGIVP++TTYS++VETFGKL +LEKV  LL 
Sbjct: 250 IVTYNTLLSACAIRGLGDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLRRLEKVCDLLG 309

Query: 326 EMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNLYGKHG 385
           EM S G LPDI+SYNVL+EA+AK GSIKEAM VF QMQAAGC PNA+TYS+LLNL+G+ G
Sbjct: 310 EMASGGSLPDITSYNVLLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQSG 369

Query: 386 RYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDLVEEKIDPNMETYE 445
           RYDDVR+LFLEMK S+ +PDA TYNILI VFGEGGYFKEVVTLFHD+VEE I+P+METYE
Sbjct: 370 RYDDVRQLFLEMKSSNTDPDAATYNILIEVFGEGGYFKEVVTLFHDMVEENIEPDMETYE 429

Query: 446 GLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALYDEAVVAFNTMNEV 505
           G++FACGKGGLHEDA+KIL +M    IVPSSKAY+G+IEA+G AALY+EA+VAFNTM+EV
Sbjct: 430 GIIFACGKGGLHEDARKILQYMTANDIVPSSKAYTGVIEAFGQAALYEEALVAFNTMHEV 489

Query: 506 GSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGIIEGYRQSGQFEEA 565
           GS  +++T++SL+++FARGGL KE EAIL R+ +SGI RN  +F+  IE Y+Q G+FEEA
Sbjct: 490 GSNPSIETFHSLLYSFARGGLVKESEAILSRLVDSGIPRNRDTFNAQIEAYKQGGKFEEA 549

Query: 566 IKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGILPSVLCYCMMLAV 625
           +K +V+MEK RC+PDE+TLEAVL VY FA LVDE +EQF E+KA+ ILPS++CYCMMLAV
Sbjct: 550 VKTYVDMEKSRCDPDERTLEAVLSVYSFARLVDECREQFEEMKASDILPSIMCYCMMLAV 609

Query: 626 YAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVEYVFDKLNAEGCGL 685
           Y K  RWD+  ELL+EM++NRVS+IHQVIGQMIKGDYDDDSNWQ+VEYV DKLN+EGCGL
Sbjct: 610 YGKTERWDDVNELLEEMLSNRVSNIHQVIGQMIKGDYDDDSNWQIVEYVLDKLNSEGCGL 669

Query: 686 GTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSVDVHRMWEGGAYTA 745
           G RFYN LLDALWWLGQK RAARVL EATKRGLFPELFRK+KLVWSVDVHRM EGG YTA
Sbjct: 670 GIRFYNALLDALWWLGQKERAARVLNEATKRGLFPELFRKNKLVWSVDVHRMSEGGMYTA 729

Query: 746 ISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSKAQNLPIAKAVYSFLQDNVSSSFDF 805
           +S+W+N +N M+  + DLPQLA VV  RG LEK S A+  PIAKA +SFLQD+VSSSF F
Sbjct: 730 LSVWLNDINDMLL-KGDLPQLAVVVSVRGQLEKSSAARESPIAKAAFSFLQDHVSSSFSF 789

Query: 806 PRWNNCRIVCQQSQLKQLLSDAESSSKGPKTNEIITLNNSPLNLPAGSKISRSGINNDKY 865
             WN  RI+CQ+SQLKQLLS  E +S+  +   ++ L NSP+   AG++ S S   N  +
Sbjct: 790 TGWNGGRIMCQRSQLKQLLSTKEPTSEESENKNLVALANSPI-FAAGTRASTSSDTN--H 849

Query: 866 KDVDSKSSNRTGTELLTTT 873
               ++   RT  EL  +T
Sbjct: 850 SGNPTQRRTRTKKELAGST 861

BLAST of Moc03g30900 vs. TAIR 10
Match: AT2G31400.1 (genomes uncoupled 1 )

HSP 1 Score: 296.6 bits (758), Expect = 6.6e-80
Identity = 188/706 (26.63%), Postives = 346/706 (49.01%), Query Frame = 0

Query: 113 NDFSFVFKEFAARGDWQRSLRLFKY-MQRQIWCKPNEHIYTIMISLLGREGLLEKCSEIF 172
           +D +++ +E   R +  +++  +++ ++R+        + + MIS LGR G +     IF
Sbjct: 197 DDCTYIIRELGNRNECDKAVGFYEFAVKRERRKNEQGKLASAMISTLGRYGKVTIAKRIF 256

Query: 173 DEMASQGVIRSVFSYTALINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARG 232
           +   + G   +V++++ALI+AYGR+G +E ++ +   MK   + PN++TYN VI+AC +G
Sbjct: 257 ETAFAGGYGNTVYAFSALISAYGRSGLHEEAISVFNSMKEYGLRPNLVTYNAVIDACGKG 316

Query: 233 DLDWEGLLGLFAEMRHEGVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITT 292
            ++++ +   F EM+  GVQPD +T+N+LL+ C+  GL + A  +F  M    I  ++ +
Sbjct: 317 GMEFKQVAKFFDEMQRNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMTNRRIEQDVFS 376

Query: 293 YSYIVETFGKLGKLEKVAMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQ 352
           Y+ +++   K G+++    +L +M  +  +P++ SY+ +I+  AK G   EA+++F +M+
Sbjct: 377 YNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEALNLFGEMR 436

Query: 353 AAGCVPNASTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFK 412
             G   +  +Y+ LL++Y K GR ++  ++  EM     + D  TYN L+  +G+ G + 
Sbjct: 437 YLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGGYGKQGKYD 496

Query: 413 EVVTLFHDLVEEKIDPNMETYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLI 472
           EV  +F ++  E + PN+ TY  L+    KGGL+++A +I       G+      YS LI
Sbjct: 497 EVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRADVVLYSALI 556

Query: 473 EAYGLAALYDEAVVAFNTMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGIS 532
           +A     L   AV   + M + G    V TYNS+I  F R     +  A          S
Sbjct: 557 DALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSAT-MDRSADYSNGGSLPFS 616

Query: 533 RNVKSFSGIIEGYRQSGQFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQ 592
            +  S     EG R    F +            CE   Q L  +L V+           +
Sbjct: 617 SSALSALTETEGNRVIQLFGQLTTESNNRTTKDCEEGMQELSCILEVF----------RK 676

Query: 593 FHEIKATGILPSVLCYCMMLAVYAKNGRWDNAYELLDE--MITNRVSSIHQVIGQMIKGD 652
            H+++   I P+V+ +  +L   ++   +++A  LL+E  +  N+V   + V+  ++ G 
Sbjct: 677 MHQLE---IKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKV---YGVVHGLLMG- 736

Query: 653 YDDDSNWQMVEYVFDKLNAEGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPE 712
              ++ W   + +FDK+N       + FYN L D LW  GQK  A  V +E   R ++  
Sbjct: 737 -QRENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWEN 796

Query: 713 LFRKSKLVWSVDVHRMWEGGAYTAISLWVNTMNKMIKDEEDLPQLAAVVVGRGWLEKDSK 772
           ++  S L    D+H M  G A   +  W+  +  ++ +  +LP++ +++ G G   K SK
Sbjct: 797 VWSDSCL----DLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWG---KHSK 856

Query: 773 AQNLPIAKAVYSFLQDNVSSSFDFPRWNNCRIVCQQSQLKQLLSDA 816
                  +     L   + + F   + N  R     S +   L ++
Sbjct: 857 VVGDGALRRAVEVLLRGMDAPFHLSKCNMGRFTSSGSVVATWLRES 876

BLAST of Moc03g30900 vs. TAIR 10
Match: AT2G18940.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 293.1 bits (749), Expect = 7.3e-79
Identity = 173/635 (27.24%), Postives = 305/635 (48.03%), Query Frame = 0

Query: 86  LINKLSSLPPRGSIARCLDIFKNRLSLNDFSFVFKEFAARGDWQRSLRLFKYM---QRQI 145
           L+N +   P  G ++R  D  K+ L   D   + K     G W+R++ LF+++       
Sbjct: 111 LVNSIVEQPLTG-LSRFFDSVKSELLRTDLVSLVKGLDDSGHWERAVFLFEWLVLSSNSG 170

Query: 146 WCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVIRSVFSYTALINAYGRNGQYETS 205
             K +  +  I + +LGRE      +++ D++  Q  +  V +YT +++AY R G+YE +
Sbjct: 171 ALKLDHQVIEIFVRILGRESQYSVAAKLLDKIPLQEYLLDVRAYTTILHAYSRTGKYEKA 230

Query: 206 LELIERMKRERVSPNILTYNTVINACARGDLDWEGLLGLFAEMRHEGVQPDLVTYNTLLS 265
           ++L ERMK    SP ++TYN +++   +    W  +LG+  EMR +G++ D  T +T+LS
Sbjct: 231 IDLFERMKEMGPSPTLVTYNVILDVFGKMGRSWRKILGVLDEMRSKGLKFDEFTCSTVLS 290

Query: 266 ACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFGKLGKLEKVAMLLKEMESEGYLP 325
           ACA  GL  EA+  F  +   G  P   TY+ +++ FGK G   +   +LKEME      
Sbjct: 291 ACAREGLLREAKEFFAELKSCGYEPGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPA 350

Query: 326 DISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNLYGKHGRYDDVRELF 385
           D  +YN L+ A+ + G  KEA  V + M   G +PNA TY+ +++ YGK G+ D+  +LF
Sbjct: 351 DSVTYNELVAAYVRAGFSKEAAGVIEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLF 410

Query: 386 LEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDLVEEKIDPNMETYEGLVFACGKG 445
             MKE+   P+  TYN ++ + G+     E++ +  D+      PN  T+  ++  CG  
Sbjct: 411 YSMKEAGCVPNTCTYNAVLSLLGKKSRSNEMIKMLCDMKSNGCSPNRATWNTMLALCGNK 470

Query: 446 GLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALYDEAVVAFNTMNEVGSKSTVDTY 505
           G+ +   ++   M   G  P    ++ LI AYG      +A   +  M   G  + V TY
Sbjct: 471 GMDKFVNRVFREMKSCGFEPDRDTFNTLISAYGRCGSEVDASKMYGEMTRAGFNACVTTY 530

Query: 506 NSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGIIEGYRQSGQFEEAIKAFVEMEK 565
           N+L++  AR G ++  E ++  M+  G      S+S +++ Y + G +    +    +++
Sbjct: 531 NALLNALARKGDWRSGENVISDMKSKGFKPTETSYSLMLQCYAKGGNYLGIERIENRIKE 590

Query: 566 MRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGILPSVLCYCMMLAVYAKNGRWDN 625
            +  P    L  +L        +  S+  F   K  G  P ++ +  ML+++ +N  +D 
Sbjct: 591 GQIFPSWMLLRTLLLANFKCRALAGSERAFTLFKKHGYKPDMVIFNSMLSIFTRNNMYDQ 650

Query: 626 AYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVEYVFDKLNAEGCGLGTRFYNTLL 685
           A  +L+ +  + +S        ++         W+  E +   L           YNT++
Sbjct: 651 AEGILESIREDGLSPDLVTYNSLMDMYVRRGECWK-AEEILKTLEKSQLKPDLVSYNTVI 710

Query: 686 DALWWLGQKGRAARVLIEATKRGLFPELFRKSKLV 718
                 G    A R+L E T+RG+ P +F  +  V
Sbjct: 711 KGFCRRGLMQEAVRMLSEMTERGIRPCIFTYNTFV 743

BLAST of Moc03g30900 vs. TAIR 10
Match: AT5G02860.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 278.1 bits (710), Expect = 2.4e-74
Identity = 150/599 (25.04%), Postives = 295/599 (49.25%), Query Frame = 0

Query: 131 SLRLFKYM--QRQIWCKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVIRSVFSYTA 190
           +LR F +   Q+      +  +  I+IS+LG+EG +   + +F+ +   G    V+SYT+
Sbjct: 154 ALRAFDWFMKQKDYQSMLDNSVVAIIISMLGKEGRVSSAANMFNGLQEDGFSLDVYSYTS 213

Query: 191 LINAYGRNGQYETSLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLGLFAEMRHE 250
           LI+A+  +G+Y  ++ + ++M+ +   P ++TYN ++N   +    W  +  L  +M+ +
Sbjct: 214 LISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVFGKMGTPWNKITSLVEKMKSD 273

Query: 251 GVQPDLVTYNTLLSACAARGLGDEAEMVFKTMIEGGIVPEITTYSYIVETFGKLGKLEKV 310
           G+ PD  TYNTL++ C    L  EA  VF+ M   G   +  TY+ +++ +GK  + ++ 
Sbjct: 274 GIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVYGKSHRPKEA 333

Query: 311 AMLLKEMESEGYLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNL 370
             +L EM   G+ P I +YN LI A+A+ G + EAM++  QM   G  P+  TY+ LL+ 
Sbjct: 334 MKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSG 393

Query: 371 YGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDLVEEKIDPN 430
           + + G+ +    +F EM+ +  +P+  T+N  I+++G  G F E++ +F ++    + P+
Sbjct: 394 FERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSPD 453

Query: 431 METYEGLVFACGKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALYDEAVVAFN 490
           + T+  L+   G+ G+  +   +   M   G VP  + ++ LI AY     +++A+  + 
Sbjct: 454 IVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTLISAYSRCGSFEQAMTVYR 513

Query: 491 TMNEVGSKSTVDTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGIIEGYRQSG 550
            M + G    + TYN+++   ARGG++++ E +L  M +     N  ++  ++  Y    
Sbjct: 514 RMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSLLHAYANGK 573

Query: 551 QFEEAIKAFVEMEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGILPSVLCYC 610
           +         E+     EP    L+ ++ V     L+ E++  F E+K  G  P +    
Sbjct: 574 EIGLMHSLAEEVYSGVIEPRAVLLKTLVLVCSKCDLLPEAERAFSELKERGFSPDITTLN 633

Query: 611 MMLAVYAKNGRWDNAYELLDEMITNRVSSIHQVIGQMIKGDYDDDSNWQMVEYVFDKLNA 670
            M+++Y +      A  +LD M   R  +        +   +   +++   E +  ++ A
Sbjct: 634 SMVSIYGRRQMVAKANGVLDYM-KERGFTPSMATYNSLMYMHSRSADFGKSEEILREILA 693

Query: 671 EGCGLGTRFYNTLLDALWWLGQKGRAARVLIEATKRGLFPELFRKSKLVWSVDVHRMWE 728
           +G       YNT++ A     +   A+R+  E    G+ P++   +  + S     M+E
Sbjct: 694 KGIKPDIISYNTVIYAYCRNTRMRDASRIFSEMRNSGIVPDVITYNTFIGSYAADSMFE 751

BLAST of Moc03g30900 vs. TAIR 10
Match: AT2G41720.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 246.5 bits (628), Expect = 7.8e-65
Identity = 156/551 (28.31%), Postives = 270/551 (49.00%), Query Frame = 0

Query: 90  LSSLPPRGSIARCLDIF------KNRLSLNDFSFVFKEFAARGDWQRSLR-LFKYMQRQI 149
           +  L  RG I  C+++F      KN  + ND   +     AR +W    R LF  MQ+  
Sbjct: 114 IRELSRRGCIELCVNVFKWMKIQKNYCARNDIYNMMIRLHARHNWVDQARGLFFEMQK-- 173

Query: 150 W-CKPNEHIYTIMISLLGREGLLEKCSEIFDEMASQGVIRSVFSYTALINAYGRNGQYET 209
           W CKP+   Y  +I+  GR G       + D+M    +  S  +Y  LINA G +G +  
Sbjct: 174 WSCKPDAETYDALINAHGRAGQWRWAMNLMDDMLRAAIAPSRSTYNNLINACGSSGNWRE 233

Query: 210 SLELIERMKRERVSPNILTYNTVINACARGDLDWEGLLGLFAEMRHEGVQPDLVTYNTLL 269
           +LE+ ++M    V P+++T+N V++A   G   +   L  F  M+   V+PD  T+N ++
Sbjct: 234 ALEVCKKMTDNGVGPDLVTHNIVLSAYKSG-RQYSKALSYFELMKGAKVRPDTTTFNIII 293

Query: 270 SACAARGLGDEAEMVFKTMIE--GGIVPEITTYSYIVETFGKLGKLEKVAMLLKEMESEG 329
              +  G   +A  +F +M E      P++ T++ I+  +   G++E    + + M +EG
Sbjct: 294 YCLSKLGQSSQALDLFNSMREKRAECRPDVVTFTSIMHLYSVKGEIENCRAVFEAMVAEG 353

Query: 330 YLPDISSYNVLIEAHAKLGSIKEAMDVFKQMQAAGCVPNASTYSILLNLYGKHGRYDDVR 389
             P+I SYN L+ A+A  G    A+ V   ++  G +P+  +Y+ LLN YG+  +    +
Sbjct: 354 LKPNIVSYNALMGAYAVHGMSGTALSVLGDIKQNGIIPDVVSYTCLLNSYGRSRQPGKAK 413

Query: 390 ELFLEMKESSAEPDATTYNILIRVFGEGGYFKEVVTLFHDLVEEKIDPNMETYEGLVFAC 449
           E+FL M++   +P+  TYN LI  +G  G+  E V +F  + ++ I PN+ +   L+ AC
Sbjct: 414 EVFLMMRKERRKPNVVTYNALIDAYGSNGFLAEAVEIFRQMEQDGIKPNVVSVCTLLAAC 473

Query: 450 GKGGLHEDAKKILLHMNEKGIVPSSKAYSGLIEAYGLAALYDEAVVAFNTMNEVGSKSTV 509
            +     +   +L     +GI  ++ AY+  I +Y  AA  ++A+  + +M +   K+  
Sbjct: 474 SRSKKKVNVDTVLSAAQSRGINLNTAAYNSAIGSYINAAELEKAIALYQSMRKKKVKADS 533

Query: 510 DTYNSLIHTFARGGLYKEFEAILLRMRESGISRNVKSFSGIIEGYRQSGQFEEAIKAFVE 569
            T+  LI    R   Y E  + L  M +  I    + +S ++  Y + GQ  EA   F +
Sbjct: 534 VTFTILISGSCRMSKYPEAISYLKEMEDLSIPLTKEVYSSVLCAYSKQGQVTEAESIFNQ 593

Query: 570 MEKMRCEPDEQTLEAVLGVYCFAGLVDESKEQFHEIKATGILPSVLCYCMMLAVYAKNGR 629
           M+   CEPD     ++L  Y  +    ++ E F E++A GI P  +    ++  + K G+
Sbjct: 594 MKMAGCEPDVIAYTSMLHAYNASEKWGKACELFLEMEANGIEPDSIACSALMRAFNKGGQ 653

Query: 630 WDNAYELLDEM 631
             N + L+D M
Sbjct: 654 PSNVFVLMDLM 661

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022137367.10.0e+00100.00pentatricopeptide repeat-containing protein At1g74850, chloroplastic [Momordica ... [more]
XP_038894203.10.0e+0093.01pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X1 ... [more]
XP_038894204.10.0e+0092.90pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X2 ... [more]
XP_008437850.10.0e+0091.75PREDICTED: pentatricopeptide repeat-containing protein At1g74850, chloroplastic ... [more]
KAG6584246.10.0e+0090.95Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
Q9S7Q20.0e+0071.36Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidop... [more]
Q9SIC99.3e-7926.63Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidop... [more]
O646241.0e-7727.24Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidop... [more]
B8Y6I01.8e-7427.80Pentatricopeptide repeat-containing protein 10, chloroplastic OS=Zea mays OX=457... [more]
Q9LYZ93.4e-7325.04Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1C8270.0e+00100.00pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Momordic... [more]
A0A1S3AUM40.0e+0091.75pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X1 ... [more]
A0A1S3AVM10.0e+0091.64pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X2 ... [more]
A0A6J1E7U50.0e+0090.84pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Cucurbit... [more]
A0A6J1KL330.0e+0090.95pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X1 ... [more]
Match NameE-valueIdentityDescription
AT1G74850.10.0e+0071.36plastid transcriptionally active 2 [more]
AT2G31400.16.6e-8026.63genomes uncoupled 1 [more]
AT2G18940.17.3e-7927.24Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G02860.12.4e-7425.04Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G41720.17.8e-6528.31Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 562..636
e-value: 4.2E-14
score: 54.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 94..232
e-value: 6.2E-36
score: 125.5
coord: 354..439
e-value: 1.1E-21
score: 79.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 233..351
e-value: 5.2E-32
score: 113.5
coord: 443..561
e-value: 1.2E-23
score: 86.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 501..530
e-value: 4.3E-4
score: 20.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 533..580
e-value: 1.6E-7
score: 31.4
coord: 322..368
e-value: 1.3E-13
score: 51.0
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 599..630
e-value: 1.0E-5
score: 25.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 431..463
e-value: 0.0014
score: 16.7
coord: 291..324
e-value: 2.6E-5
score: 22.1
coord: 326..359
e-value: 1.9E-10
score: 38.3
coord: 501..533
e-value: 1.1E-5
score: 23.3
coord: 184..218
e-value: 2.7E-8
score: 31.5
coord: 150..180
e-value: 4.0E-6
score: 24.7
coord: 536..569
e-value: 7.4E-7
score: 27.0
coord: 255..287
e-value: 1.3E-7
score: 29.3
coord: 396..428
e-value: 4.4E-6
score: 24.5
coord: 606..635
e-value: 1.2E-5
score: 23.1
coord: 361..394
e-value: 4.1E-8
score: 30.9
coord: 220..253
e-value: 3.9E-4
score: 18.4
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 373..406
e-value: 0.0066
score: 16.5
coord: 241..301
e-value: 1.9E-13
score: 50.2
coord: 170..228
e-value: 3.5E-15
score: 55.8
coord: 416..475
e-value: 4.5E-6
score: 26.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 217..252
score: 11.279235
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 288..322
score: 10.172144
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 393..427
score: 10.621557
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 498..532
score: 10.281757
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 147..181
score: 10.259834
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 182..216
score: 12.386327
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 358..392
score: 12.33152
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 428..462
score: 10.292718
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 568..602
score: 9.163705
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 603..637
score: 10.226951
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 533..567
score: 11.268274
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 323..357
score: 13.153618
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 253..287
score: 12.58363
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 839..873
NoneNo IPR availablePANTHERPTHR47937:SF1PLASTID TRANSCRIPTIONALLY ACTIVE CHROMOSOME 2-LIKE PROTEINcoord: 34..834
NoneNo IPR availablePANTHERPTHR47937PLASTID TRANSCRIPTIONALLY ACTIVE CHROMOSOME 2-LIKE PROTEINcoord: 34..834

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc03g30900.1Moc03g30900.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0042793 plastid transcription
biological_process GO:0045893 positive regulation of transcription, DNA-templated
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding