Moc02g08840 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc02g08840
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Locationchr2: 6247752 .. 6259822 (+)
RNA-Seq ExpressionMoc02g08840
SyntenyMoc02g08840
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTCCAGGTTGGCCTTGAGATTCTTCAATTTCTTGGGATTGCATAGGAATTTCCATCACTCCACTGCCTCATTTTGTATTTTAATTCATTCTTTGGTTCAGAGTAGTCTGTTTTGGCCTGCCTCTTCGCTTTTGCAAACCCTTTTGCTCCGTGGGCTAAACCCACTGGAGATTTTTTCGAATTTTTTGGAATCTTATAGGAAGTACAAATTTTCTTCTAGTTCGGGTTTTGATATGTTGATTCAGTATTACGTGCAAAACAAGAGGGCAATGGATGGTGTTCTGGTTATAAATCTCATGAGAGACCATGGGATTTGGCCTGAAGTTAGGACTTTAAGTGCTCTGTTAAATGCTCTTGCGAGAATCAGGAAGTTCCGCCAGGTCTTGGAACTCTTTGATACCCTTGTGAATGCGGGTGTTAAGCCTGATAGTTATATTTACACGGTGGCGGTTCGATGCTTGTGTGAATTGAAGGACTTTAGCAAGGCCAAGGAAGTAATCAATCAGGCCGAGTGCAATGGATAAAATTAATCATTTTCATGCAAAACTAAAACATTATGAACCATAATTTTCCATAGCAATTTATAAAAGATAAATTATCATGCAAATAAAAAGTGGTACATATTATAATAGTACTACAAACTTTTCCCATTAAACACACATTTGTGTGTAGGGATGTCTACTTAATACTTTCCATCAATAGACAAAAGACATGCTTAATTAGTCAATGCGAACTTAGCTCAATTAGTTAAGATAAAAAAATCGCGAGTTTGAATTTCCACTCTAATATATCTAACATGCATGTTGTCGTATTAAAAAAAAAAGCTAAATTATATTACGTTTGTAAAAATAAAATTATGTAATATCTATGAAGAAAAAAGAAAAGAGGGAGATGACTAAATTGCAAAATTTTTATTGTAAAACATATATAATAAAAAAACAGAAGCAACAGAAAAGGCCGGAGCCGCATAGCCCAGGCCCAAATATTTTCAGTAGATAATGGAAATGGACATGGGCTAACGTGACTTAATATAGGCCCAGTTTAGCAAATAAATCATATAGGCCCAAAAACACGAGAATACAAAAAAAAAAAAAGAAGAAAAAAACTGTGGCTGCTGAGATTCGAGCCCAGGTCTCCACGGCCACAACGTGGAATTCTTACCACTAAACTACAGCCACTTGTTGTGCTAAAGAATTCCAACTAAACAAATTAAGCAAAACAAAAGAAGACTAAGAAGTAATTTATGATTAATAAAAGGTTATTGGATTCATAATATAAATATTAGAATGAGATTCTATAATTTGTTATAAAGTACTTTTTTAAGAGTTATCTTTTAGACTATAGTTTTGTCACAGTGATATAGATATATGTTTATATGATCTTTACGAGAAAACAAATGTATAAAGTATAGAACAACAAATTGTCGTTTTCATTTGACACTATTAGGTACACGTAATATAAAAACTATAAATTTTAGCATATAAACAAATATTTAAATAAAAAGTCAAATCGTTAGGCAAATTAGATACACATGACACCATCGCACCTTTACTTACGTCAAAGTTTATACCTCATTAATCTCGTTTATTTTATTTTTTACATCATTCACGCATGAATTGGTGTTTTAAACTAAAATAAAATTGTAACTTATTGAACTGGGTAGTCTTAAAATTTAAATTGCAATTAGAGAAGTTTAATAATTTAAGAACATAGTATTGAAATTGTACTAAATTTAAGACAAATAAGGTTTTTGAAATTGTATTAAATCATGGTTTAATAGTATAATATAATGTTTTTCGAAATCGGACATTTGCGTAATTTGGGCCTTCCCGCTGCCCGGTTTCAGGAAAATTTTACCCCCGCCCGCCAAAATACATTTCCAAAGCACAATCCGCAAAGCGTTCACACGGATCGATCTTTACGACCGTCGATTCATTTACAATCCAACGGACGGCGTGTTTCCCGGATGAACTTTAAACCCAGGGCGTGAGATTTTTAACTTCCATAAATTCCCCAAAATCGCAACATTTCGGCAGCGAGATCATAATCTCTGATCGATCGAAGCGTTGTTTGTTCTCCAAATTCAGTCGATAAAATGTCTGGGCGTGGGAAGGGAGGCAAGGGATTGGGAAAGGGAGGAGCCAAGCGTCACAGGAAGGTTCTGAGAGATAACATTCAGGGCATTACGAAGCCTGCAATTCGTCGTCTCGCTCGTAGAGGTGGCGTGAAGCGTATCAGCGGGTTGATTTACGAAGAAACCAGAGGAGTTTTGAAGATTTTCCTCGAGAATGTTATCCGTGATGCTGTTACCTACACTGAGCACGCCAGGAGGAAGACGGTGACTGCCATGGATGTGGTTTATGCATTGAAAAGGCAGGGAAGGACTTTGTATGGTTTCGGAGGTTAGGTTTTAGGTTAATTTCACAAATGAATGTATATCTCGGCTCTGGCTTCCCAGCCATTGTTGTTCTTTTAGTTTACCTGGAAGTCGTTCACCGTTGTTCTGGCAATGAAATTTCTGTTCATTGTGTATTCATTCCTGGTTTTGCCTTTCGTTTTATTACCAGTATATCTTATAGCTACTCGGTTTTTTTGTTCTTACCCGTAATCGCTATAAAGAGGTCTGGCGGAACAGAGTTCCAACATCCTATTCAGTTGCTGAAATTCAATACGTTTTCATGTCTAAATCGATTATAAGAGAACTACACTATCATCTATAAGATTGTTTCTCACGCATCCATCATAATGTTTAAGGTCATGAAGCGTAAGCAAGAAACGGTATACCAGCCTCCTTAATGCGCTATGCTAATGTTATCCAGTTGGCTTGCAATCTCCTGGATTATTCGCATAAAATTGTTCGGAGAAATTTTATAACTTAGAAAACTACATCAAGTATCTCTGCCTCTATGCTTGAGTTTTAATTGTTATTTATGTTCAACAAGTTTTAGAAATAAATGTTGAATATGAAACCCTAGTATAGTAGGTTAAACGAAGTAGGAATGCGTCCTTATGTATGAATTCGAGAATGCGATGGTATTGGTTACATGTTCCTCTTGCAAAAGATTAGCTAAATGATTACAGCACTAGACTTTCAATTTGATATTTCTGAGCTACTTTTGATAAACCCTCGAAGCAGGTGTAGCAACTAAGAGGATGGGTGATGAACATTACCTAAATCAGAGACCAATGAAATAGTATTATAGTGTATGCAATGGGAGAGACATCTAGTTTAGTGTAGCTCTTATACACAACACCTTTAGTTAATGTTCGTGGTTTGAGAATTCGATCTCAAAGATGCTGTTTTTCTTTTTAGTTTGGGGCGAAGATTTGAATCTTTAATCTTTTGGTTGATGACATGTGCCTTAATTAGTTGAGCGATGTTTAGCTCAAAATGTTTAATCCATAGGTTAAGCATAGATCCAAATGCTGAATCCACTTGTAGGTACGAATGCTGAATCCACTTGTAGGTACGAAGAGCTGTTGGTGAGAGAGAGAGAGAGAGATTAACCATAAAATTTGAGGTATGAAGTCTATGCGTCATTTTCTTTGGTGAGGTCTTCAGGCTGTGCAGGTTCTTTGTTATTGGGCATTTTAATGGATTCAAATGTGAACTCCTTCCTTCCAAGGTGTCAAAGGCGTGCAGGTATGTGCTATTAACTTCTCATTGATTTTTCGAGACCAGTTTTCAATAATTTTGACAAACTAGTAAGACAGCTTTCATACTATTGTCGAATCTTATCTCAAGTTCGTGACGTTATGTCCCTTAGCAAAATAATACAATTCAGGAGTAACTCCTGTGAATCCTCGGCCAAATTGATGCATACATCTAGCATAAACTTCTCTGCTTGTTCTGGTCCTGGAAGGGTGCTTTATATCTCAATTTGTTGGTCTCATCACTCATATTGTCTTATATCTGTGTTTTTTTGACATGTTTCTTGTCTTTTTTTGACATGTTTCTTGTCTATGGGCCTATGTCTCGATGTAGATCCTAACTCATTCTGTTGGGAAATTGGGATGTTGTACATATTGGACATGGTAAATATGGACAGGCTACCACAAGGCTTAGATCCGCACTTATTTCAAATTAATCCATCTGAAAAACCTGAATTCAACTGTGGAAGTAGAATGGAAACTGCTTTTTAAGTCTCATCTCTACTAATTTACTAACCTTACTCCTATGCTAGGATGTTTATGATCTGCATTCTCCTGTTGCTTTCGCCCAATATGAAGTTGGTAAGAAGTGGTGGGTCATATTTGCCTGTAGATAAGTGAATCAATGTGATTCCAGTTATCCCATGTTCAAGTTCCCAAGATGCATTGTTTCTTGGATGCTTTATATTTTTCCAGATGGCGATGAACATTCTAATGCTTTCACCACAGCCATTTCTTGCTCTATTGGTTGCAACATTGCGTTCTGATTAGACAGTACGGTGATGAACATTCTGATTCCCCTTTGAAAACTCTACTACAAATAAAGAGGTATTGAGATCAGATTAGATGGCACAAGGACTATACAAGTGTAATAATCTCTTCTGCGTTCTCAAACATTTCATGAATAACACAGATGAGGACTTTTATGAGCACTCCTTGCAGTAATGGAATTGTGTAGTAGATATGATTTTGAATAATGAAGGTTGAACCCCTAAAGTTAAAAGAATGCAAAAGATCCACTCCTTCGGAAAGGGAACTCATTCCCTCTGAGTCCTTTACTATTTACAAAGGCCTTCATTGATCACTTGAGCCATCCCATGATGGTTTCCTAATTTTCTATATTGAAAAATAATACAAACTTGAGAAATAGGTATTACAAATCAGAAGTATAAATTGAAAACCCATGTTTTCAATACTGCCATTTTCCTTCTCATTGTTGCATTTCTTGGATTCTAATCTAAATATGTGCTTTTCACTTTTCATTTTTCTAGTTTAGTCCGTATATATCTTACTTTTAATAATTATATTATCTTTATCAATTAGTATTTTCATTTCATTCAATTATGGAGTTTCATTTTTCCTTATTATGATCTAACTCTGTGTATTTCATTTATATATATATATATATATATATATATATATTTCACTTTTAATAATTAAATAATTTTCGTAAATTATTCTTCTAACTTCATTAGATTATGGACTTTGAACCAAGAAAAGGACATTATATATGCATATTTGTATTAGATAAATTTTAGAAACTTTACTTTTCTATTAATTTGATTGATCCTTGTGTTAGATAGAATCATTTATGGTCAATGAGAACCAATCTAAATAGTAAAGGGTTTAGAGGGAATGAGTTCAAACCATGGTGACCACCTAAGTAAGATTTAATATCCTACAAGTTTAGGTGCAAATCCCTTTTTTTTTATTTTCAGGAAGGCCAAATTATTATTTTTTCCTTTCAATAAATTATGTAATTTAAATCAGGTATACAACATTGAAGTAATGTAAAGTCAATTAAAATATATGCAGCATTACGAGAACAAAAGATAAAATTGATGAGGAAAACAACTAATATTTAATATTTAAAACACAATCCAAATTATATCAATTAAATTAAACCAATAATTTATATCGGTAATTTTCCCCTCAAAATTGGTGTAAATTGTAATTTATTTGGCCTCAATTGTACTTTTGGGCCACAAAATTTGGAATTTCGCGCTGCCCGGCTTCCCGGAATTTCCCCTGCCCGCCAAAACCTGACTGAAACTCACTTTCAAAGCTCGATCCACAGCGCTTTCACGCGGATCGATCTTTTCAACCGTCGATTCATTATAAATCCAACGGACTAAGTGATTCGCACCCGAAACTTCCGTACCATAAATACCCCAAACCAGTGCAGCAGAGCAACAAATCGACAATTTCCGATCAGTTCTAGCGCGAATTCTTCCCGAAAATCAAACGCGAAAATGTCTGGGCGTGGGAAGGGAGGCAAGGGCCTGGGAAAGGGAGGAGCCAAGCGTCACAGGAAGGTTTTGAGAGACAACATTCAGGGCATCACGAAGCCTGCAATTCGCCGTCTCGCTCGTAGAGGTGGTGTGAAGCGCATCAGTGGTTTGATCTATGAGGAAACCAGAGGCGTTTTGAAGATTTTCCTCGAGAATGTGATTCGCGGTGCTGTTACCTACACGGAGCACGCCAGGAGGAAGACGGTGACCGCCATGGATGTGGTCTATGCGTTGAAGAGGCAGGGAAGAACTCTGTATGGATTCGGAGGTTAGGTTACAGCTTGATTTAACGAATCGTCGTACTGTATAGCTTGTCTCTTGTTTTCTACTCGTTTGGCAAGTTGTTTACCAATTGTTCTGGCAACGAGATTCTGGTTTATTGTCAAATTATAATATAAGGTTCTATTCCTTCTGTGGATTTGAAAATTGATTCTGTCGTCCAATTATCAAGTTCAAGAGTGCTCGGTTTTTGCTTTATAATTGATGAACATGTTTAATCAAGAAACGGAAGACCGCGTTCAAAATTTCGCCCTATTATCGATGAACTCAAGAATATAATTCCTAAACATGACTGTGACGAACCTTGCGAAACTATTGAATAAATTTGACGTCCTTGTGAAAGAACTTCAAAATTTGGAGAGGAGACGATCCTTGGTACGAAACAACCATTAAATTAACAGTTTGGTACTACAATGAGCAAGTAGTCGTCCTCTCACAGCAATTGATTATTATGATGTCAATTAGAATTATGAATATATTTGTCAATTCTTGCAATTTCCAATTCCAAAATATAAATGTCAATGAATCAAGCTTTCAAGTTTAGAACTGATAATGTGTAAAGATGCTTAATAGAATAATGTATAAGCACGGTTGATATTTGGGTGCCTCTTATTTATTACCATTAGAAAATTGATTTTCAAGCAGTGAATGCCTTATTGGATGGCTAAGAGAGAAAAGTCTACAAACAAGATGAGAAAATGTTTATTCCACACAAAAAAAAAATGATAAATAAATTTTTACTGAAAAAGATATGTCAATTTATTCATGGGCATTTGGGCGAAATAGCCTTTTTCAAAGGTGTGGCATCTCTCCTCATTTGTATGTTTTGTAAATTTTGTGGAAATGATACGTATTTACTCTCTTGATTTTTTCTTCCATTTATTACTTTTTTCTCCTCTTTCTTCCCTATATCGTTACAAATTCTCTTTTTCTTGAAAAATTCAACAAACTTTCTCTATCTTATTTACTCACCATAAAAAAACCTTAAATATTTATTACTATCTTCCTCCTTATCACTCTTAAGTTGCCTCTTCTCATTTCCATTCTTTTTTCTTCACCCTTTTTGAAAAAACAAGCAACAAGCTACTCCATTTACCATTTTTTTCGATGAGAGACATACGTTGAGAGCGAGAGCAGGAGCGAGAGGGGCGAGAGAGCAAGAGCCTTTTTTTGTGTGTTCTGTTTTTTTCTTTTTTATTTGGGGGGTTTATGTCAACTCTTTTTCTTAGTGTTCTTTGTTGAGTTTTTACCTCTCTTTTATGTGTGATTGGTTTTTTGTGCATGCATTTGTTATTTTGTTTTTGAATTTTAAGCTCCAACAATGGAGTCAACATAGAGAGGAGATAAATGAAAAATAAAGAAGAGAGAGGAGAGAGAAAAATGCATGCCTTCAAAATCAAAACAAAAAAATCCAAAAGCAAAGGAAATCGAATGGGATGTGCTATAAAAATATTTATATATAACTTGATGTATACTTGTGTCATAGGATTTTTGGTAGGTTATTAGATATCCAAATTTACCAAACCTACCAGTGGTAGGTTTAGCTTTTGGATTGAAGATGAGTAAACGACTAATTAATCGAAAAGCTTAAGTTGGATAAATTTAATTCATATATTAGATATCTATTATACGTAAATGAATTCATTATTATAAAACTTGATACTATTTTTACAAATTTTATTGAGGTATTCTAGTAAGTGAGGTGGGGATTTGAACTTTTGGCCTTTGACTCGAGGATAATTATCTAAATTTGTTGGACTATGCTTTGATTGACCTACTAATTTGAATGCAAGTATTTGTACAAATTACCAACCATTACAACTTAAATGGTATTTTTCACATAGAATTTTGCAATATCTTTTGAAGTTTACAACATTATAAATACTTGATTTAATATCAATTTGAGTCTAGATGAATTGGTTAAGGTGTGTATTATTGATAAGAAAGCTGTGTGTCCGTCTATTTGAGCCTTACTTCCACGTGTTATTGTATGAAAAAAAGAGGCACTTAACTAATTAATAATTTTTAATGCACTTAAAATAACATTCAGAATACATTTTTCATAATATGTTTTGGTTCCTTTTCGGTACAATTGAGAGCTATAAAACACAGTTTAAATATTAATTCTAAGGATAAAATTTATCTATCAATTTTAAATAAATCATAAATGATATTTTCTGTTGCTCTTCATCTCGAGTGATTTCAAATTCTAGTGGATTTAAAAATGTTATCATATGAAAGAAAACAATTTGAAATATTCGTGTTTTTAACCAAAAAAAGGAAAAAAAATCACCAAACACAATTGGATATTGTATAATTTGATTTCAAATTCTACTGGATTTTAAATGTTACCCTATGGAAGAAAACAATTTGAAACATAAGGTTTTTATCCAAAAAAAAAAAAAAAAAAAACTAACGCAATTGGATATCCTATAATTTTATAGAATTTTATCATAAATTACTTATTTAACGTTTTCATATATTCTACTTTCAATTAACAAATTCAAATTACAAAATTTCTTCTAGTAAAATTTATCTTTTATTTTTATTTCCGACAATTATTTAAACAAATAAATTTTGGATCCAAAACGAGATAAAAAAAATTAAAAGTCATTTATTAAATTTAAATGTTCACAATACACTTTAAAAAAAAAAAAAAAATCTCACTATAATTTCATACAGCTGTCGGTTAGGTGTGGTGATTGGGCTTTTTACGATAATTTTGGGCCTTCCCGCTGCCCGGATTCAGGAATTTCACCATGCCCGCCAAAAATGAAACACAAATCCACTTTTCCAAAGCACAATCCGCAGAGATTTCACGCGGATCGACCTTTAGAACCGTCGATTTTCAAAAGATCCAACGGACTCTGTAATTCCCAGCTGAATTTGCACCTCCATAAATTCCCCAATTCGCTACGTTTGTTCGCAGCATAATCAGAACCTTCTGATCGGAACAAGCGCGGTTTATCCCCAAAATCAGTGCAAGAAAATGTCCGGCCGTTGAAAGGGAGGCAAGGGTTTGGGAAAGGGAGGAGCGAAGCGTCACCGGAAGGTTTTGAGAGACAACATTCAGGGCATTACGAAGCCTGCAATTCGCCGTCTCGCCCGTAGAAGTTGAAGCATTGAGTTTCGAACTCGATGAAGCTCATTCAGTCTCGTCGATCGCTGAGAACTCCTAATCTAGGCAGAAGGAGATTCAAGAAGTATTGTACATGGCGGAGGAACCTCGAAGAGGACTGTGAAAATGATTCGCAGTTCATTTACGCACTCGAACAAATTGTGCGAGGAAAGCAAAGCTGGAAGATCGCCTTCGACAACGCATTCATTTCAGGGACTTTAAAGCCCCATCACGTAGAAAATGTTTTGATTCGAACTCTCGATGACTCCAGGTTGGCCTTGAGATTCTTCAATTTCTTGGGATTGCATAGGAATTTCCATCACTCCACTGCCTCATTTTGTATTTTAATTCATTCTTTGGTTCAGAGTAGTCTGTTTTGGCCTGCCTCTTCGCTTTTGCAAACCCTTTTGCTCCGTGGGCTAAACCCACTGGAGATTTTTTCGAATTTTTTGGAATCTTATAGGAAGTACAAATTTTCTTCTAGTTCGGGTTTTGATATGTTGATTCAGTATTACGTGCAAAACAAGAGGGAAATGGATGGTGTTCTGGTTATAAATCTCATGAGGGACCATGGGATTTGGCCTGAAGTTAGGACTTTAAGTGCTCTGTTAAATGCTCTTGCGAGAATTAGGAAGTTCCGCCAGGTCTTGGAACTCTTTGATACCCTTGTGAATGCGGGTGTTAAGCCTGATAGTTATATTTACACGGTGGCGGTTCGATGCTTGTGTGAATTGAAGTACTTTAGCAAGGCCAAGGAAGTAATCAATCAGGCCGAGTGCAATGGATGTGGTTTGAATATTGTAACTTATAATGTGTTTATCCACGGGCTCTGCAAGAGCAAGAGAGTTTGCGAGGCTATTGAGATCAAGAGATTGCTAGGTGAAAAGGGTTTGAAAGCCGATTTGGTTACATATTGCACATTGGTACTCGGATTGTGCAGAATACAGGAATTCGAGATTGGTATGGAGGTGATGGATGAAATGATTGTGTTGGGTTTTGCTCCGAGTGAAGCTGCTGTTTCAGGAGTCATAGAGGGGTTGAGGAGAATGGGGAATATCCAATGTGCTTTTGAGTTGCTAAAAAAGGTTGGGAAACTTGGAGTAGTGCCTAATCTATTTGTTTATAATTCAGTGATCAATTCATTGTGCAAAAGTGGAAAATTGGAAGAAGCTGAGTCGCTTTTCAGTGTAATGACTGAAAGGGGTTTGTTTCCAAATGATGTCACATATAGCATCTTGATAGAGGGGTTTGGAAGAAGGGCCGAATTGGATGTTGCTATCAATTTCTTCAATAAAATGGTTGAATCTGGCATAAGTGCAACTGTGTATTCCTATAATTCTTTGATAAGTGGTCAATGCAAGTTTGGGGACATGAGAACAGCAGAGTTTTTCTTCAAGGAGATGGTTGACAGAGGATTGATACCAACTGTGGCAACTTATACTTCATTGATAAGTGGATATTGCAGAGAAGGATTAGTACCCAAGGCATTCAGGATATATCATGAAATGACTGGAAAAGGCATTGCCCCAAATACTTTTACATTCACTGCTCTTATTTGTGGTCTTTGTCATATCAATAAAATGGCTGAAGCCAGTAAATTATTTGATGAAATGGTTGAACTCAACATTCTTCCAAATGAGGTGACCTATAATGTTTTGATAGAGGGCCACTGTAGGGAAGGTAACACTACAAGAGCTTTTGAATTGCTGGATGAAATGATTAAGAAGGGCCTATTACCAGACACATACACCTACAGGCCCCTAATTGCTGGTCTTTGTTCTACAGGTAGAGTTTCTGAAGCAAAGGAGTTCGTAAACGACCTTCACCACGAGCATCAAAAGTTGAATGAGTTGTGCTATACTGCACTTCTGCAAGGTTTTTGCAAGGAAGGAAGAATTAAGGAAGCGTTAATTGCTCGCCAGGAGATGGTAGGACGTGGATTACGCATGGATCTAATAAGTTATGCTGTGCTTATATATGGAGCTTTGAAGAAAAATGATGGAAGGTTGTTTGATCTTCTGAGGGAAATGCATAGTCAAGGAATGAAACCCGACAGTGTAATATACACCACTTTGATTGATGGGTACATCAAAGCAGGAAATCTCAGAAAGGCGTTTGGATTTTGGGACATTATGACTGGTGAAGGATGCATTCCCAACACTGTGACATACACAACGTTGGTGAATGGATTATTCAAGGCAGGATATGTCAGCAAGGCCAAACAACTTTTAAAGCGTATGCTAGTCGGTGAGGCCTTTCCCAATCACATAACTTATGGTTGTTTTCTGGATCACCTCACAAAAGAAGGAAATATGGAGAAAGCTCTGCAACTTCACGATACAATGCTAATAGAAACTTTAGCAAATCCTGTCACATATAATATACTAATCCGGGGTTATTGCCAGATGGGAAAATTTCAGGAGGCAGCCAAGCTTCTTGATGGAATGATTGGAAACGGTATCGTTCCAGATTGTATCACTTACTCCACATTTATCTATGAATACTGTAGGAGGGGTAATGTTGATGCAGCTATTAAGATGTGGGAGCGTATGTTTAATAGGGGCCTGAAACCTGATACAGTAGCATTTAACTTTCTAATATATGCCTGCTGTCTTACTGGTGAACTAAACCGGGCTCTGCAATTGCGCGACGAAATGACGTTGAGGGGTTTAAAACCGACTCGATCAACATATTATTCCCTGATTCATGGGACTTGCTTAACGAGCTAG

mRNA sequence

ATGACTCCAGGTAGAGTTTCTGAAGCAAAGGAGTTCGTAAACGACCTTCACCACGAGCATCAAAAGTTGAATGAGTTGTGCTATACTGCACTTCTGCAAGGTTTTTGCAAGGAAGGAAGAATTAAGGAAGCGTTAATTGCTCGCCAGGAGATGGTAGGACGTGGATTACGCATGGATCTAATAAGTTATGCTGTGCTTATATATGGAGCTTTGAAGAAAAATGATGGAAGGTTGTTTGATCTTCTGAGGGAAATGCATAGTCAAGGAATGAAACCCGACAGTGTAATATACACCACTTTGATTGATGGGTACATCAAAGCAGGAAATCTCAGAAAGGCGTTTGGATTTTGGGACATTATGACTGGTGAAGGATGCATTCCCAACACTGTGACATACACAACGTTGGTGAATGGATTATTCAAGGCAGGATATGTCAGCAAGGCCAAACAACTTTTAAAGCGTATGCTAGTCGGTGAGGCCTTTCCCAATCACATAACTTATGGTTGTTTTCTGGATCACCTCACAAAAGAAGGAAATATGGAGAAAGCTCTGCAACTTCACGATACAATGCTAATAGAAACTTTAGCAAATCCTGTCACATATAATATACTAATCCGGGGTTATTGCCAGATGGGAAAATTTCAGGAGGCAGCCAAGCTTCTTGATGGAATGATTGGAAACGGTATCGTTCCAGATTGTATCACTTACTCCACATTTATCTATGAATACTGTAGGAGGGGTAATGTTGATGCAGCTATTAAGATGTGGGAGCGTATGTTTAATAGGGGCCTGAAACCTGATACAGTAGCATTTAACTTTCTAATATATGCCTGCTGTCTTACTGGTGAACTAAACCGGGCTCTGCAATTGCGCGACGAAATGACGTTGAGGGGTTTAAAACCGACTCGATCAACATATTATTCCCTGATTCATGGGACTTGCTTAACGAGCTAG

Coding sequence (CDS)

ATGACTCCAGGTAGAGTTTCTGAAGCAAAGGAGTTCGTAAACGACCTTCACCACGAGCATCAAAAGTTGAATGAGTTGTGCTATACTGCACTTCTGCAAGGTTTTTGCAAGGAAGGAAGAATTAAGGAAGCGTTAATTGCTCGCCAGGAGATGGTAGGACGTGGATTACGCATGGATCTAATAAGTTATGCTGTGCTTATATATGGAGCTTTGAAGAAAAATGATGGAAGGTTGTTTGATCTTCTGAGGGAAATGCATAGTCAAGGAATGAAACCCGACAGTGTAATATACACCACTTTGATTGATGGGTACATCAAAGCAGGAAATCTCAGAAAGGCGTTTGGATTTTGGGACATTATGACTGGTGAAGGATGCATTCCCAACACTGTGACATACACAACGTTGGTGAATGGATTATTCAAGGCAGGATATGTCAGCAAGGCCAAACAACTTTTAAAGCGTATGCTAGTCGGTGAGGCCTTTCCCAATCACATAACTTATGGTTGTTTTCTGGATCACCTCACAAAAGAAGGAAATATGGAGAAAGCTCTGCAACTTCACGATACAATGCTAATAGAAACTTTAGCAAATCCTGTCACATATAATATACTAATCCGGGGTTATTGCCAGATGGGAAAATTTCAGGAGGCAGCCAAGCTTCTTGATGGAATGATTGGAAACGGTATCGTTCCAGATTGTATCACTTACTCCACATTTATCTATGAATACTGTAGGAGGGGTAATGTTGATGCAGCTATTAAGATGTGGGAGCGTATGTTTAATAGGGGCCTGAAACCTGATACAGTAGCATTTAACTTTCTAATATATGCCTGCTGTCTTACTGGTGAACTAAACCGGGCTCTGCAATTGCGCGACGAAATGACGTTGAGGGGTTTAAAACCGACTCGATCAACATATTATTCCCTGATTCATGGGACTTGCTTAACGAGCTAG

Protein sequence

MTPGRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISYAVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGEGCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKALQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEYCRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTRSTYYSLIHGTCLTS
Homology
BLAST of Moc02g08840 vs. NCBI nr
Match: XP_022148373.1 (putative pentatricopeptide repeat-containing protein At5g59900 isoform X2 [Momordica charantia])

HSP 1 Score: 654.1 bits (1686), Expect = 6.3e-184
Identity = 314/314 (100.00%), Postives = 314/314 (100.00%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY
Sbjct: 554 GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 613

Query: 64  AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE 123
           AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE
Sbjct: 614 AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE 673

Query: 124 GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA 183
           GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA
Sbjct: 674 GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA 733

Query: 184 LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY 243
           LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY
Sbjct: 734 LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY 793

Query: 244 CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR 303
           CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR
Sbjct: 794 CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR 853

Query: 304 STYYSLIHGTCLTS 318
           STYYSLIHGTCLTS
Sbjct: 854 STYYSLIHGTCLTS 867

BLAST of Moc02g08840 vs. NCBI nr
Match: XP_022148372.1 (putative pentatricopeptide repeat-containing protein At5g59900 isoform X1 [Momordica charantia])

HSP 1 Score: 654.1 bits (1686), Expect = 6.3e-184
Identity = 314/314 (100.00%), Postives = 314/314 (100.00%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY
Sbjct: 589 GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 648

Query: 64  AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE 123
           AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE
Sbjct: 649 AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE 708

Query: 124 GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA 183
           GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA
Sbjct: 709 GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA 768

Query: 184 LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY 243
           LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY
Sbjct: 769 LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY 828

Query: 244 CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR 303
           CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR
Sbjct: 829 CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR 888

Query: 304 STYYSLIHGTCLTS 318
           STYYSLIHGTCLTS
Sbjct: 889 STYYSLIHGTCLTS 902

BLAST of Moc02g08840 vs. NCBI nr
Match: XP_023511929.1 (putative pentatricopeptide repeat-containing protein At5g59900 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 562.0 bits (1447), Expect = 3.3e-156
Identity = 266/313 (84.98%), Postives = 288/313 (92.01%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           GRVSEAKEF+NDLHHEH++LNELCYT LLQGFCKEGR+KEAL+ARQEMVGRG+RMDLISY
Sbjct: 589 GRVSEAKEFINDLHHEHRRLNELCYTELLQGFCKEGRVKEALVARQEMVGRGMRMDLISY 648

Query: 64  AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE 123
           AVLIYGALK+ND RLFDLLREMHSQGMKPD VIYTTLIDG IKAG+LRKAFGFWDIM GE
Sbjct: 649 AVLIYGALKQNDRRLFDLLREMHSQGMKPDKVIYTTLIDGSIKAGDLRKAFGFWDIMIGE 708

Query: 124 GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA 183
           GCIPNTVTYT LVNGL KAGYV++AK L KRMLV EA PNHITYGCFLDHLTKEGNME A
Sbjct: 709 GCIPNTVTYTALVNGLLKAGYVNEAKLLFKRMLVHEATPNHITYGCFLDHLTKEGNMENA 768

Query: 184 LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY 243
           LQLH+ ML  TLANPVTYNILIRGYCQ+GKF EAAKLLDGMIGNGIVPDCITYSTFIYEY
Sbjct: 769 LQLHNAMLKGTLANPVTYNILIRGYCQIGKFHEAAKLLDGMIGNGIVPDCITYSTFIYEY 828

Query: 244 CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR 303
           C+RGNV AA++MWE M  RGLKPDTVAFNFLI+ACCLTGEL++AL+LR++M  RGLKPTR
Sbjct: 829 CKRGNVTAAVEMWECMLRRGLKPDTVAFNFLIHACCLTGELDQALRLRNDMMSRGLKPTR 888

Query: 304 STYYSLIHGTCLT 317
           STYYSLI  +C T
Sbjct: 889 STYYSLIGASCST 901

BLAST of Moc02g08840 vs. NCBI nr
Match: KAG6570725.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 560.5 bits (1443), Expect = 9.6e-156
Identity = 265/313 (84.66%), Postives = 287/313 (91.69%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           GRVSEAKEF+NDLHHEH++LNELCYT LLQGFCKEGR+KEAL+ARQEMVGRG+ MDLISY
Sbjct: 589 GRVSEAKEFINDLHHEHRRLNELCYTELLQGFCKEGRVKEALVARQEMVGRGMHMDLISY 648

Query: 64  AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE 123
           AVLIYGALK+ND RLFDLLREMHSQGMKPD VIYTTLIDG IKAG+LRKAFG WDIM GE
Sbjct: 649 AVLIYGALKQNDRRLFDLLREMHSQGMKPDKVIYTTLIDGSIKAGDLRKAFGIWDIMIGE 708

Query: 124 GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA 183
           GCIPNTVTYT LVNGLFKAGYV++AK L KRMLV EA PNHITYGCFLDHLTKEGNME A
Sbjct: 709 GCIPNTVTYTALVNGLFKAGYVNEAKLLFKRMLVHEATPNHITYGCFLDHLTKEGNMENA 768

Query: 184 LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY 243
           LQLH+ ML  TLANPVTYNILIRGYCQ+GKF EAAKLLDGMIGNGIVPDCITYSTFIYEY
Sbjct: 769 LQLHNAMLKGTLANPVTYNILIRGYCQIGKFHEAAKLLDGMIGNGIVPDCITYSTFIYEY 828

Query: 244 CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR 303
           C+RGNV AA++MWE M  RGLKPDTVAFNFLI+ACCLTGEL++AL+LR++M  RGLKPTR
Sbjct: 829 CKRGNVTAAVEMWECMLRRGLKPDTVAFNFLIHACCLTGELDKALRLRNDMMSRGLKPTR 888

Query: 304 STYYSLIHGTCLT 317
           STYYSLI  +C T
Sbjct: 889 STYYSLIGASCST 901

BLAST of Moc02g08840 vs. NCBI nr
Match: KAG7010569.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 560.5 bits (1443), Expect = 9.6e-156
Identity = 265/313 (84.66%), Postives = 287/313 (91.69%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           GRVSEAKEF+NDLHHEH++LNELCYT LLQGFCKEGR+KEAL+ARQEMVGRG+ MDLISY
Sbjct: 589 GRVSEAKEFINDLHHEHRRLNELCYTELLQGFCKEGRVKEALVARQEMVGRGMHMDLISY 648

Query: 64  AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE 123
           AVLIYGALK+ND RLFDLLREMHSQGMKPD VIYTTLIDG IKAG+LRKAFG WDIM GE
Sbjct: 649 AVLIYGALKQNDRRLFDLLREMHSQGMKPDKVIYTTLIDGSIKAGDLRKAFGIWDIMIGE 708

Query: 124 GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA 183
           GCIPNTVTYT LVNGLFKAGYV++AK L KRMLV EA PNHITYGCFLDHLTKEGNME A
Sbjct: 709 GCIPNTVTYTALVNGLFKAGYVNEAKLLFKRMLVHEATPNHITYGCFLDHLTKEGNMENA 768

Query: 184 LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY 243
           LQLH+ ML  TLANPVTYNILIRGYCQ+GKF EAAKLLDGMIGNGIVPDCITYSTFIYEY
Sbjct: 769 LQLHNAMLKGTLANPVTYNILIRGYCQIGKFHEAAKLLDGMIGNGIVPDCITYSTFIYEY 828

Query: 244 CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR 303
           C+RGNV AA++MWE M  RGLKPDTVAFNFLI+ACCLTGEL++AL+LR++M  RGLKPTR
Sbjct: 829 CKRGNVTAAVEMWECMLRRGLKPDTVAFNFLIHACCLTGELDKALRLRNDMMSRGLKPTR 888

Query: 304 STYYSLIHGTCLT 317
           STYYSLI  +C T
Sbjct: 889 STYYSLIGASCST 901

BLAST of Moc02g08840 vs. ExPASy Swiss-Prot
Match: Q9FJE6 (Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis thaliana OX=3702 GN=At5g59900 PE=3 SV=1)

HSP 1 Score: 351.3 bits (900), Expect = 1.2e-95
Identity = 161/304 (52.96%), Postives = 225/304 (74.01%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           G+ SEAK FV+ LH  + +LNE+CYT LL GFC+EG+++EAL   QEMV RG+ +DL+ Y
Sbjct: 591 GQASEAKVFVDGLHKGNCELNEICYTGLLHGFCREGKLEEALSVCQEMVQRGVDLDLVCY 650

Query: 64  AVLIYGALKKNDGRL-FDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTG 123
            VLI G+LK  D +L F LL+EMH +G+KPD VIYT++ID   K G+ ++AFG WD+M  
Sbjct: 651 GVLIDGSLKHKDRKLFFGLLKEMHDRGLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLMIN 710

Query: 124 EGCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTK-EGNME 183
           EGC+PN VTYT ++NGL KAG+V++A+ L  +M    + PN +TYGCFLD LTK E +M+
Sbjct: 711 EGCVPNEVTYTAVINGLCKAGFVNEAEVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQ 770

Query: 184 KALQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIY 243
           KA++LH+ +L   LAN  TYN+LIRG+C+ G+ +EA++L+  MIG+G+ PDCITY+T I 
Sbjct: 771 KAVELHNAILKGLLANTATYNMLIRGFCRQGRIEEASELITRMIGDGVSPDCITYTTMIN 830

Query: 244 EYCRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKP 303
           E CRR +V  AI++W  M  +G++PD VA+N LI+ CC+ GE+ +A +LR+EM  +GL P
Sbjct: 831 ELCRRNDVKKAIELWNSMTEKGIRPDRVAYNTLIHGCCVAGEMGKATELRNEMLRQGLIP 890

Query: 304 TRST 306
              T
Sbjct: 891 NNKT 894

BLAST of Moc02g08840 vs. ExPASy Swiss-Prot
Match: Q0WVK7 (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 2.5e-50
Identity = 108/315 (34.29%), Postives = 173/315 (54.92%), Query Frame = 0

Query: 5   RVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISYA 64
           +++EA+E  +++  +    + + YT L+ GFCK G I+ A     EM  R +  D+++Y 
Sbjct: 331 KLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYT 390

Query: 65  VLIYGALKKND----GRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIM 124
            +I G  +  D    G+LF    EM  +G++PDSV +T LI+GY KAG+++ AF   + M
Sbjct: 391 AIISGFCQIGDMVEAGKLF---HEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHM 450

Query: 125 TGEGCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNM 184
              GC PN VTYTTL++GL K G +  A +LL  M      PN  TY   ++ L K GN+
Sbjct: 451 IQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNI 510

Query: 185 EKALQLHDTMLIETL-ANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTF 244
           E+A++L        L A+ VTY  L+  YC+ G+  +A ++L  M+G G+ P  +T++  
Sbjct: 511 EEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVL 570

Query: 245 IYEYCRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGL 304
           +  +C  G ++   K+   M  +G+ P+   FN L+   C+   L  A  +  +M  RG+
Sbjct: 571 MNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGV 630

Query: 305 KPTRSTYYSLIHGTC 315
            P   TY +L+ G C
Sbjct: 631 GPDGKTYENLVKGHC 642

BLAST of Moc02g08840 vs. ExPASy Swiss-Prot
Match: Q6NQ83 (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g22470 PE=1 SV=1)

HSP 1 Score: 193.7 bits (491), Expect = 3.1e-48
Identity = 100/313 (31.95%), Postives = 170/313 (54.31%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           GRVSEA   V+ +    Q+ + +  + L+ G C +GR+ EAL+    MV  G + D ++Y
Sbjct: 154 GRVSEAVALVDRMVEMKQRPDLVTVSTLINGLCLKGRVSEALVLIDRMVEYGFQPDEVTY 213

Query: 64  AVLIYGALKKNDGRL-FDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTG 123
             ++    K  +  L  DL R+M  + +K   V Y+ +ID   K G+   A   ++ M  
Sbjct: 214 GPVLNRLCKSGNSALALDLFRKMEERNIKASVVQYSIVIDSLCKDGSFDDALSLFNEMEM 273

Query: 124 EGCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEK 183
           +G   + VTY++L+ GL   G      ++L+ M+     P+ +T+   +D   KEG + +
Sbjct: 274 KGIKADVVTYSSLIGGLCNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGKLLE 333

Query: 184 ALQLHDTMLIETLA-NPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIY 243
           A +L++ M+   +A + +TYN LI G+C+     EA ++ D M+  G  PD +TYS  I 
Sbjct: 334 AKELYNEMITRGIAPDTITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDIVTYSILIN 393

Query: 244 EYCRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKP 303
            YC+   VD  ++++  + ++GL P+T+ +N L+   C +G+LN A +L  EM  RG+ P
Sbjct: 394 SYCKAKRVDDGMRLFREISSKGLIPNTITYNTLVLGFCQSGKLNAAKELFQEMVSRGVPP 453

Query: 304 TRSTYYSLIHGTC 315
           +  TY  L+ G C
Sbjct: 454 SVVTYGILLDGLC 466

BLAST of Moc02g08840 vs. ExPASy Swiss-Prot
Match: Q9LFC5 (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX=3702 GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 190.7 bits (483), Expect = 2.6e-47
Identity = 101/313 (32.27%), Postives = 168/313 (53.67%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           G + +A  + N +       + + YT L+QG+C++G I  A+  R EM+ +G  MD+++Y
Sbjct: 389 GNLDKALMYFNSVKEAGLIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTY 448

Query: 64  AVLIYGALK-KNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTG 123
             +++G  K K  G    L  EM  + + PDS   T LIDG+ K GNL+ A   +  M  
Sbjct: 449 NTILHGLCKRKMLGEADKLFNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKE 508

Query: 124 EGCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEK 183
           +    + VTY TL++G  K G +  AK++   M+  E  P  I+Y   ++ L  +G++ +
Sbjct: 509 KRIRLDVVTYNTLLDGFGKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAE 568

Query: 184 ALQLHDTMLIETLANPVTY-NILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIY 243
           A ++ D M+ + +   V   N +I+GYC+ G   +    L+ MI  G VPDCI+Y+T IY
Sbjct: 569 AFRVWDEMISKNIKPTVMICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTLIY 628

Query: 244 EYCRRGNVDAAIKMWERMFNR--GLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGL 303
            + R  N+  A  + ++M     GL PD   +N +++  C   ++  A  +  +M  RG+
Sbjct: 629 GFVREENMSKAFGLVKKMEEEQGGLVPDVFTYNSILHGFCRQNQMKEAEVVLRKMIERGV 688

Query: 304 KPTRSTYYSLIHG 313
            P RSTY  +I+G
Sbjct: 689 NPDRSTYTCMING 701

BLAST of Moc02g08840 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 189.9 bits (481), Expect = 4.4e-47
Identity = 103/296 (34.80%), Postives = 163/296 (55.07%), Query Frame = 0

Query: 24  NELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISYAVLIYGALKKNDGRLFD--- 83
           N + Y  L+ G+CK  +I +     + M  +GL  +LISY V+I G  +  +GR+ +   
Sbjct: 239 NVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCR--EGRMKEVSF 298

Query: 84  LLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGEGCIPNTVTYTTLVNGLF 143
           +L EM+ +G   D V Y TLI GY K GN  +A      M   G  P+ +TYT+L++ + 
Sbjct: 299 VLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMC 358

Query: 144 KAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKALQLHDTMLIETLA-NPV 203
           KAG +++A + L +M V    PN  TY   +D  +++G M +A ++   M     + + V
Sbjct: 359 KAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVV 418

Query: 204 TYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEYCRRGNVDAAIKMWERM 263
           TYN LI G+C  GK ++A  +L+ M   G+ PD ++YST +  +CR  +VD A+++   M
Sbjct: 419 TYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREM 478

Query: 264 FNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTRSTYYSLIHGTCL 316
             +G+KPDT+ ++ LI   C       A  L +EM   GL P   TY +LI+  C+
Sbjct: 479 VEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCM 532

BLAST of Moc02g08840 vs. ExPASy TrEMBL
Match: A0A6J1D3X3 (putative pentatricopeptide repeat-containing protein At5g59900 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111017039 PE=4 SV=1)

HSP 1 Score: 654.1 bits (1686), Expect = 3.1e-184
Identity = 314/314 (100.00%), Postives = 314/314 (100.00%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY
Sbjct: 554 GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 613

Query: 64  AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE 123
           AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE
Sbjct: 614 AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE 673

Query: 124 GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA 183
           GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA
Sbjct: 674 GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA 733

Query: 184 LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY 243
           LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY
Sbjct: 734 LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY 793

Query: 244 CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR 303
           CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR
Sbjct: 794 CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR 853

Query: 304 STYYSLIHGTCLTS 318
           STYYSLIHGTCLTS
Sbjct: 854 STYYSLIHGTCLTS 867

BLAST of Moc02g08840 vs. ExPASy TrEMBL
Match: A0A6J1D4W4 (putative pentatricopeptide repeat-containing protein At5g59900 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111017039 PE=4 SV=1)

HSP 1 Score: 654.1 bits (1686), Expect = 3.1e-184
Identity = 314/314 (100.00%), Postives = 314/314 (100.00%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY
Sbjct: 589 GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 648

Query: 64  AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE 123
           AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE
Sbjct: 649 AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE 708

Query: 124 GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA 183
           GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA
Sbjct: 709 GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA 768

Query: 184 LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY 243
           LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY
Sbjct: 769 LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY 828

Query: 244 CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR 303
           CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR
Sbjct: 829 CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR 888

Query: 304 STYYSLIHGTCLTS 318
           STYYSLIHGTCLTS
Sbjct: 889 STYYSLIHGTCLTS 902

BLAST of Moc02g08840 vs. ExPASy TrEMBL
Match: A0A6J1FVS2 (putative pentatricopeptide repeat-containing protein At5g59900 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111448927 PE=4 SV=1)

HSP 1 Score: 559.7 bits (1441), Expect = 7.9e-156
Identity = 264/311 (84.89%), Postives = 286/311 (91.96%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           GRVSEAKEF+NDLHHEH++LNELCYT LLQGFCKEGR+KEAL+ARQEMVGRG+ MDLISY
Sbjct: 589 GRVSEAKEFINDLHHEHRRLNELCYTELLQGFCKEGRVKEALVARQEMVGRGMHMDLISY 648

Query: 64  AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE 123
           AVLIYGALK+ND RLFDLLREMHSQGMKPD VIYTTLIDG IKAG+LRKAFG WDIM GE
Sbjct: 649 AVLIYGALKQNDRRLFDLLREMHSQGMKPDKVIYTTLIDGSIKAGDLRKAFGIWDIMIGE 708

Query: 124 GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA 183
           GCIPNTVTYT LVNGLFKAGYV++AK L KRMLV EA PNHITYGCFLDHLTKEGNME A
Sbjct: 709 GCIPNTVTYTALVNGLFKAGYVNEAKLLFKRMLVHEATPNHITYGCFLDHLTKEGNMENA 768

Query: 184 LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY 243
           LQLH+ ML  TLANPVTYNILIRGYCQ+GKF EAAKLLDGMIGNGIVPDCITYSTFIYEY
Sbjct: 769 LQLHNAMLKGTLANPVTYNILIRGYCQIGKFHEAAKLLDGMIGNGIVPDCITYSTFIYEY 828

Query: 244 CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR 303
           C+RGNV AA++MWE M  RGLKPDTVAFNFLI+ACCLTGEL++AL+LR++M  RGLKPTR
Sbjct: 829 CKRGNVTAAVEMWECMLRRGLKPDTVAFNFLIHACCLTGELDKALRLRNDMMSRGLKPTR 888

Query: 304 STYYSLIHGTC 315
           STYYSLI  +C
Sbjct: 889 STYYSLIGASC 899

BLAST of Moc02g08840 vs. ExPASy TrEMBL
Match: A0A6J1FY36 (putative pentatricopeptide repeat-containing protein At5g59900 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111448927 PE=4 SV=1)

HSP 1 Score: 559.7 bits (1441), Expect = 7.9e-156
Identity = 264/311 (84.89%), Postives = 286/311 (91.96%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           GRVSEAKEF+NDLHHEH++LNELCYT LLQGFCKEGR+KEAL+ARQEMVGRG+ MDLISY
Sbjct: 589 GRVSEAKEFINDLHHEHRRLNELCYTELLQGFCKEGRVKEALVARQEMVGRGMHMDLISY 648

Query: 64  AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE 123
           AVLIYGALK+ND RLFDLLREMHSQGMKPD VIYTTLIDG IKAG+LRKAFG WDIM GE
Sbjct: 649 AVLIYGALKQNDRRLFDLLREMHSQGMKPDKVIYTTLIDGSIKAGDLRKAFGIWDIMIGE 708

Query: 124 GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA 183
           GCIPNTVTYT LVNGLFKAGYV++AK L KRMLV EA PNHITYGCFLDHLTKEGNME A
Sbjct: 709 GCIPNTVTYTALVNGLFKAGYVNEAKLLFKRMLVHEATPNHITYGCFLDHLTKEGNMENA 768

Query: 184 LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY 243
           LQLH+ ML  TLANPVTYNILIRGYCQ+GKF EAAKLLDGMIGNGIVPDCITYSTFIYEY
Sbjct: 769 LQLHNAMLKGTLANPVTYNILIRGYCQIGKFHEAAKLLDGMIGNGIVPDCITYSTFIYEY 828

Query: 244 CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR 303
           C+RGNV AA++MWE M  RGLKPDTVAFNFLI+ACCLTGEL++AL+LR++M  RGLKPTR
Sbjct: 829 CKRGNVTAAVEMWECMLRRGLKPDTVAFNFLIHACCLTGELDKALRLRNDMMSRGLKPTR 888

Query: 304 STYYSLIHGTC 315
           STYYSLI  +C
Sbjct: 889 STYYSLIGASC 899

BLAST of Moc02g08840 vs. ExPASy TrEMBL
Match: A0A6J1JC67 (putative pentatricopeptide repeat-containing protein At5g59900 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111484424 PE=4 SV=1)

HSP 1 Score: 555.4 bits (1430), Expect = 1.5e-154
Identity = 262/313 (83.71%), Postives = 286/313 (91.37%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           GRVSEAKEF+NDLHHEH++LNELCYT LLQGFCKEGR+KEAL+ARQEMVGRG+ MDLISY
Sbjct: 589 GRVSEAKEFINDLHHEHRRLNELCYTELLQGFCKEGRVKEALVARQEMVGRGMHMDLISY 648

Query: 64  AVLIYGALKKNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTGE 123
           AVLIYGALK+ND RLFDLLREMHSQGMKPD VIYTTLIDG IKAG+LRKAFGFWDIM GE
Sbjct: 649 AVLIYGALKQNDRRLFDLLREMHSQGMKPDKVIYTTLIDGSIKAGDLRKAFGFWDIMIGE 708

Query: 124 GCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEKA 183
           GCIPN+VTYT LVNGL KAGYV++AK L KRMLV EA PNHITYGCFLDHLTKEGNME A
Sbjct: 709 GCIPNSVTYTALVNGLLKAGYVNEAKLLFKRMLVHEATPNHITYGCFLDHLTKEGNMENA 768

Query: 184 LQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIYEY 243
           LQLH+ ML  TLANPVTYNILIRGYCQ+GKF EAA+LLDGMIGNGIVPDCITYSTFIYEY
Sbjct: 769 LQLHNAMLKGTLANPVTYNILIRGYCQIGKFHEAAQLLDGMIGNGIVPDCITYSTFIYEY 828

Query: 244 CRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKPTR 303
           C+RGNV AA++MWE M  RGLKPDTV FNFLI+ACCLTGEL++AL+LR++M  RGLKPTR
Sbjct: 829 CKRGNVTAAVEMWECMLRRGLKPDTVVFNFLIHACCLTGELDQALRLRNDMMSRGLKPTR 888

Query: 304 STYYSLIHGTCLT 317
           STYYSLI  +C T
Sbjct: 889 STYYSLIGASCST 901

BLAST of Moc02g08840 vs. TAIR 10
Match: AT5G59900.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 351.3 bits (900), Expect = 8.2e-97
Identity = 161/304 (52.96%), Postives = 225/304 (74.01%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           G+ SEAK FV+ LH  + +LNE+CYT LL GFC+EG+++EAL   QEMV RG+ +DL+ Y
Sbjct: 591 GQASEAKVFVDGLHKGNCELNEICYTGLLHGFCREGKLEEALSVCQEMVQRGVDLDLVCY 650

Query: 64  AVLIYGALKKNDGRL-FDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTG 123
            VLI G+LK  D +L F LL+EMH +G+KPD VIYT++ID   K G+ ++AFG WD+M  
Sbjct: 651 GVLIDGSLKHKDRKLFFGLLKEMHDRGLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLMIN 710

Query: 124 EGCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTK-EGNME 183
           EGC+PN VTYT ++NGL KAG+V++A+ L  +M    + PN +TYGCFLD LTK E +M+
Sbjct: 711 EGCVPNEVTYTAVINGLCKAGFVNEAEVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQ 770

Query: 184 KALQLHDTMLIETLANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIY 243
           KA++LH+ +L   LAN  TYN+LIRG+C+ G+ +EA++L+  MIG+G+ PDCITY+T I 
Sbjct: 771 KAVELHNAILKGLLANTATYNMLIRGFCRQGRIEEASELITRMIGDGVSPDCITYTTMIN 830

Query: 244 EYCRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKP 303
           E CRR +V  AI++W  M  +G++PD VA+N LI+ CC+ GE+ +A +LR+EM  +GL P
Sbjct: 831 ELCRRNDVKKAIELWNSMTEKGIRPDRVAYNTLIHGCCVAGEMGKATELRNEMLRQGLIP 890

Query: 304 TRST 306
              T
Sbjct: 891 NNKT 894

BLAST of Moc02g08840 vs. TAIR 10
Match: AT1G05670.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 200.7 bits (509), Expect = 1.8e-51
Identity = 108/315 (34.29%), Postives = 173/315 (54.92%), Query Frame = 0

Query: 5   RVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISYA 64
           +++EA+E  +++  +    + + YT L+ GFCK G I+ A     EM  R +  D+++Y 
Sbjct: 331 KLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYT 390

Query: 65  VLIYGALKKND----GRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIM 124
            +I G  +  D    G+LF    EM  +G++PDSV +T LI+GY KAG+++ AF   + M
Sbjct: 391 AIISGFCQIGDMVEAGKLF---HEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHM 450

Query: 125 TGEGCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNM 184
              GC PN VTYTTL++GL K G +  A +LL  M      PN  TY   ++ L K GN+
Sbjct: 451 IQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNI 510

Query: 185 EKALQLHDTMLIETL-ANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTF 244
           E+A++L        L A+ VTY  L+  YC+ G+  +A ++L  M+G G+ P  +T++  
Sbjct: 511 EEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVL 570

Query: 245 IYEYCRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGL 304
           +  +C  G ++   K+   M  +G+ P+   FN L+   C+   L  A  +  +M  RG+
Sbjct: 571 MNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGV 630

Query: 305 KPTRSTYYSLIHGTC 315
            P   TY +L+ G C
Sbjct: 631 GPDGKTYENLVKGHC 642

BLAST of Moc02g08840 vs. TAIR 10
Match: AT1G05670.2 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 200.7 bits (509), Expect = 1.8e-51
Identity = 108/315 (34.29%), Postives = 173/315 (54.92%), Query Frame = 0

Query: 5   RVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISYA 64
           +++EA+E  +++  +    + + YT L+ GFCK G I+ A     EM  R +  D+++Y 
Sbjct: 331 KLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYT 390

Query: 65  VLIYGALKKND----GRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIM 124
            +I G  +  D    G+LF    EM  +G++PDSV +T LI+GY KAG+++ AF   + M
Sbjct: 391 AIISGFCQIGDMVEAGKLF---HEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHM 450

Query: 125 TGEGCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNM 184
              GC PN VTYTTL++GL K G +  A +LL  M      PN  TY   ++ L K GN+
Sbjct: 451 IQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNI 510

Query: 185 EKALQLHDTMLIETL-ANPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTF 244
           E+A++L        L A+ VTY  L+  YC+ G+  +A ++L  M+G G+ P  +T++  
Sbjct: 511 EEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVL 570

Query: 245 IYEYCRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGL 304
           +  +C  G ++   K+   M  +G+ P+   FN L+   C+   L  A  +  +M  RG+
Sbjct: 571 MNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGV 630

Query: 305 KPTRSTYYSLIHGTC 315
            P   TY +L+ G C
Sbjct: 631 GPDGKTYENLVKGHC 642

BLAST of Moc02g08840 vs. TAIR 10
Match: AT3G22470.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 193.7 bits (491), Expect = 2.2e-49
Identity = 100/313 (31.95%), Postives = 170/313 (54.31%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           GRVSEA   V+ +    Q+ + +  + L+ G C +GR+ EAL+    MV  G + D ++Y
Sbjct: 154 GRVSEAVALVDRMVEMKQRPDLVTVSTLINGLCLKGRVSEALVLIDRMVEYGFQPDEVTY 213

Query: 64  AVLIYGALKKNDGRL-FDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTG 123
             ++    K  +  L  DL R+M  + +K   V Y+ +ID   K G+   A   ++ M  
Sbjct: 214 GPVLNRLCKSGNSALALDLFRKMEERNIKASVVQYSIVIDSLCKDGSFDDALSLFNEMEM 273

Query: 124 EGCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEK 183
           +G   + VTY++L+ GL   G      ++L+ M+     P+ +T+   +D   KEG + +
Sbjct: 274 KGIKADVVTYSSLIGGLCNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGKLLE 333

Query: 184 ALQLHDTMLIETLA-NPVTYNILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIY 243
           A +L++ M+   +A + +TYN LI G+C+     EA ++ D M+  G  PD +TYS  I 
Sbjct: 334 AKELYNEMITRGIAPDTITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDIVTYSILIN 393

Query: 244 EYCRRGNVDAAIKMWERMFNRGLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGLKP 303
            YC+   VD  ++++  + ++GL P+T+ +N L+   C +G+LN A +L  EM  RG+ P
Sbjct: 394 SYCKAKRVDDGMRLFREISSKGLIPNTITYNTLVLGFCQSGKLNAAKELFQEMVSRGVPP 453

Query: 304 TRSTYYSLIHGTC 315
           +  TY  L+ G C
Sbjct: 454 SVVTYGILLDGLC 466

BLAST of Moc02g08840 vs. TAIR 10
Match: AT5G01110.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 190.7 bits (483), Expect = 1.9e-48
Identity = 101/313 (32.27%), Postives = 168/313 (53.67%), Query Frame = 0

Query: 4   GRVSEAKEFVNDLHHEHQKLNELCYTALLQGFCKEGRIKEALIARQEMVGRGLRMDLISY 63
           G + +A  + N +       + + YT L+QG+C++G I  A+  R EM+ +G  MD+++Y
Sbjct: 389 GNLDKALMYFNSVKEAGLIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTY 448

Query: 64  AVLIYGALK-KNDGRLFDLLREMHSQGMKPDSVIYTTLIDGYIKAGNLRKAFGFWDIMTG 123
             +++G  K K  G    L  EM  + + PDS   T LIDG+ K GNL+ A   +  M  
Sbjct: 449 NTILHGLCKRKMLGEADKLFNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKE 508

Query: 124 EGCIPNTVTYTTLVNGLFKAGYVSKAKQLLKRMLVGEAFPNHITYGCFLDHLTKEGNMEK 183
           +    + VTY TL++G  K G +  AK++   M+  E  P  I+Y   ++ L  +G++ +
Sbjct: 509 KRIRLDVVTYNTLLDGFGKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAE 568

Query: 184 ALQLHDTMLIETLANPVTY-NILIRGYCQMGKFQEAAKLLDGMIGNGIVPDCITYSTFIY 243
           A ++ D M+ + +   V   N +I+GYC+ G   +    L+ MI  G VPDCI+Y+T IY
Sbjct: 569 AFRVWDEMISKNIKPTVMICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTLIY 628

Query: 244 EYCRRGNVDAAIKMWERMFNR--GLKPDTVAFNFLIYACCLTGELNRALQLRDEMTLRGL 303
            + R  N+  A  + ++M     GL PD   +N +++  C   ++  A  +  +M  RG+
Sbjct: 629 GFVREENMSKAFGLVKKMEEEQGGLVPDVFTYNSILHGFCRQNQMKEAEVVLRKMIERGV 688

Query: 304 KPTRSTYYSLIHG 313
            P RSTY  +I+G
Sbjct: 689 NPDRSTYTCMING 701

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022148373.16.3e-184100.00putative pentatricopeptide repeat-containing protein At5g59900 isoform X2 [Momor... [more]
XP_022148372.16.3e-184100.00putative pentatricopeptide repeat-containing protein At5g59900 isoform X1 [Momor... [more]
XP_023511929.13.3e-15684.98putative pentatricopeptide repeat-containing protein At5g59900 [Cucurbita pepo s... [more]
KAG6570725.19.6e-15684.66putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
KAG7010569.19.6e-15684.66putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
Match NameE-valueIdentityDescription
Q9FJE61.2e-9552.96Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis th... [more]
Q0WVK72.5e-5034.29Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
Q6NQ833.1e-4831.95Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
Q9LFC52.6e-4732.27Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX... [more]
Q9FIX34.4e-4734.80Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1D3X33.1e-184100.00putative pentatricopeptide repeat-containing protein At5g59900 isoform X2 OS=Mom... [more]
A0A6J1D4W43.1e-184100.00putative pentatricopeptide repeat-containing protein At5g59900 isoform X1 OS=Mom... [more]
A0A6J1FVS27.9e-15684.89putative pentatricopeptide repeat-containing protein At5g59900 isoform X1 OS=Cuc... [more]
A0A6J1FY367.9e-15684.89putative pentatricopeptide repeat-containing protein At5g59900 isoform X2 OS=Cuc... [more]
A0A6J1JC671.5e-15483.71putative pentatricopeptide repeat-containing protein At5g59900 isoform X1 OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT5G59900.18.2e-9752.96Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G05670.11.8e-5134.29Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G05670.21.8e-5134.29Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT3G22470.12.2e-4931.95Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G01110.11.9e-4832.27Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1..109
e-value: 5.5E-23
score: 83.9
coord: 110..225
e-value: 5.1E-30
score: 107.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 227..316
e-value: 2.4E-26
score: 94.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 234..268
e-value: 3.0E-9
score: 34.5
coord: 269..302
e-value: 2.7E-5
score: 22.0
coord: 199..233
e-value: 9.1E-12
score: 42.4
coord: 165..191
e-value: 5.1E-4
score: 18.0
coord: 27..59
e-value: 7.2E-5
score: 20.7
coord: 130..156
e-value: 1.4E-6
score: 26.1
coord: 95..129
e-value: 2.2E-8
score: 31.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 165..191
e-value: 0.024
score: 14.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 197..245
e-value: 5.7E-14
score: 52.1
coord: 24..69
e-value: 1.8E-8
score: 34.5
coord: 92..141
e-value: 9.2E-15
score: 54.6
coord: 266..314
e-value: 5.0E-11
score: 42.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 24..58
score: 10.742131
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 93..127
score: 11.794416
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 267..301
score: 11.355965
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 128..162
score: 10.358486
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 232..266
score: 12.67132
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 197..231
score: 14.01956
NoneNo IPR availablePANTHERPTHR47932:SF12OS01G0153250 PROTEINcoord: 266..314
NoneNo IPR availablePANTHERPTHR47932ATPASE EXPRESSION PROTEIN 3coord: 4..275
NoneNo IPR availablePANTHERPTHR47932ATPASE EXPRESSION PROTEIN 3coord: 266..314
NoneNo IPR availablePANTHERPTHR47932:SF12OS01G0153250 PROTEINcoord: 4..275
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 103..269

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc02g08840.1Moc02g08840.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding