MC07g0149 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC07g0149
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationMC07: 2752494 .. 2777207 (-)
RNA-Seq ExpressionMC07g0149
SyntenyMC07g0149
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AGTCGAGATGGTCTTATGGATGATGCTATGTCAGCATTTCAAGATATGAAAGAGCATTGGCCTTCAACCATCTTTGGGTATTTACAACTCCCTCATCCATGGTTTTGCTGCAAAAGGTGAGTTTGAAACTGCTATGTTTTTCGTCAATGAGATGAAAGACATCAATATGACTCGAGAACCAGAAACCTACGATGGCCTAATCGAAGCCTATGGAAAATATAGAATGTACGATAAGATGGTCGAGTGTCTGAAACAGATGGAATTGGATGGATATTTTCCGGACCAAATAACTTACAATTTGCTTATCAGAGATTTTTCCAAAGGTGGATTGTTTAAAAAGATGGAAGGTTTATACCAAACTATGCTTTCAAAAAGAATGGATTTGCAGTCATCTACCTTGGTTGCCATGTTGGAAGCTTACACCAAATTTGGCATATTGGATAAAATGGAAATGTTCTATAGAAGTATCCTGAACTCAAAGACCAATCTGAAGGAAGACCTAATCAGAAAGTTGACTTTAGTATATATCAAGAACTTCATGTATTCAAGATTGGAGACCTGTTAAGAACAACGGTACCCCAAGTATCCTACGTATCTCAACAATAGTATGATATTGTCTGCTTTGGGCATAGCCCTCACGGTTTTGTTTTTTGTTTCACCCAAAAGGCCTCATACCAATAGAGATAATTGTCCTGTCTTATATACCTATGATCATTCCCAGTCAATGTGGGACTTTGTTTGCAATCACCTTACAAACCCAACAATCCCCCCCTCGAAAAAAGTACCACCCGAGCCTTCCCTCAAGTAGTCACTCATCCGTCTACTCTCGGAATCCAGAAACGAGAGTTTGCTCTTATCCAAGGCTCAAGGATGCAACACAACCTTGATAGAGCACACCCACGGATTAAACCATGACCACACCTACCGACACAGCTAGGCTTAGACCATGGCTCTGATACCACTTGTTAAGGACAACGGTACCCCAAGTACCCCACGTATCTCCACAATGGTATGATATTGTCCGCTTTGGGCATAGCCCTCACGGCTTTATTTTTTGTTTCACCCAAAAGACCTCATACCAATGGAGATAATTGTCCTCTCTTATATACCCATGATCATTCTCTTCCCTAGTCAATGTGGGACTTTGTTTGCAATCACCTTACAAACCCAACAAGACCTTGGGCGTAAATCTTTACATTAAAATCGGGAAGACCGATCTTTTTTGGTGCCTGCATCTTCTATCGCATGCTTGTCTATCAAGCCGAAGGGGTATGAACTCTGTTGTTCAGGAAATGGACAGAGCTAACAAGATTTGGAATGTGACAGTTGCAAACATTATACTTCTAACTTATTTGAAAATGAAAGATTTTAAGCAGCTCCGAATGTCGTTCTCTGAAATGCGAGCAAGAAATGTCAAACCCGATATAGTTACCATTGGAATTATTTTAGATGCAAATAACAAAGGATTGAACGGCACCGGAACTTTAGAGGCATGGAGAAGGATGAACTTGCTATCTAGAGCTGTGGAGATGAACACTGACTCGCTGGTTCTAGCTGCATTTGGAGAGGGAAATTTCCTTAAAAGCTGTGAAGAGACATACGCTGCTCTGGAGCCCAAGGATAGGGACAATAAAATATGGACTTATGAAGGACTCGTTGATCTGGTTCTTGAAAAGGGGGAAACCGGATTTAAAACTAGATAAACTAGCTTATTGCACATACCTACATCAAATTGTTTTAAAAGATATGATAATCCATTGACATCTGAGAGAACTTTTGCACGAAAACCTTACTGTAGGGAGATTTTTATGTACAAAACCAGTCTATGTTTCGTGTGTTGTAAATTTGATGTATTCCCAATAGTTAGTACTTCACTTGAATCAATTAGTACAAATGACAGATTAGTTGTGATCCTTCACTGCTAAAGCCCTAAAAAACAACTCTATTTCGTTTAATAATCCTTTATTCTCATGCGAGCCTGATTGCATGATCAAGTGAAGGAGATACTTACAAACGTCTCCCAGATCATCCACATTAGCAAACAGATTCCCGATTTCTTCCTCTGTTTGAGCACCCCCATCATTCTCATAGTACATCTGCTTCCTGCAAGCCTGGAAGAGACCACGTTTAAATTATAATATTTTTTAATTTATAATAATGTTAATAAATAAATTATTTAATGAACATAAGGAAGAAATTTGTAAAAAACCAAAAAGGCATATTGAATAGGCATGAATATCAACCATTCTTTCTGAAAATAAATGAGGTTATATTGTTTTCCTCGTTTTAAATATCAATAGGAAAATACTCGAGAAACTTTGTCCTTCTAACATTTAATTGTGAAGAGGACAAGGTTCCTCCTCGTTCTATTTTGGGACGCCTTAATCGCGAAGAATTTGACGAAAGTCGGGTTTTGAGAAACGTTACCTGAAGAACCGAATTTAGGCTGGCAATTTGATCTGCCTGCAGGAACTCAGAGTTCAATAGTTCCCGGAAGAAAGATACCATCTCCGGGCTCAAATTGCTGGTTGTGACATTTTGAATTATGCTAGAGATGCATGCTTTATCAAATATTGATCGCATCCATACAAAATCCTTCTGCAGCTTCAAAAACGCCGCTGTAACATAAATCGCAGGTTTGTGACCAATGATCGCATTTGGAGCCAAAGATGGAAGAGATATGGAAGCTGCCCATTCTTCAACGATTATCCTACACCACTTGTTCTTTACAATAACCCTCGCCTCCTGCATATTCATTTGTTTCCACCCTAAAACCATCGATAAACACAGGCTACAATTCGTAGCAACACGATGGTCACCGTATACAACCAGGCCTTCCAATGTCGCGATCACGGACATAAGGTAATTTGTGGTGCATCTCAGCTCCTCCTGTTTGCTAGTTCTCTGCTCTGATAGTCGAGTAAACAACTCCAACAGGCAATATGAAGCCACGAGTTTAACCGAAGCAGATCCAAAATGCAGGAGTCTACACAAATCGTGGCAGGTGATGCCAATGGAAGAGAGCGGTGTTCCATTTGATTGACCAAGGTTGTTTTGCCAATTGACAACCTCGGGTAGAACAGCTTGAAGACTGAAAAAAGTGAGGGAAAACAAAAAACTGAGTTGGATCGTTTACCTGAACAGAAAATTAGCAAATTTTCATTTCAACATTTGCACCTAGTAGGGAACGCTAACCTTCTCATCGAAAAGTAAACTAGAAAAAGAACAAGAATTACAGTTTTTCCCATGTTTGTTCCTTCGTGATCATCAATTAAAGCAGGCCCCTTTGAGCAAGCTTCATGCAACACAGACTTGATTGCAGATGCCACGGGAGTATGAAAAAGTACGGATTTTGAAGCTTCAATGAGTCCTCCATTCGTTGAGTGGTGCAGAATCAAGGATAAAATAGCAAGAACTAAGAGATTTTCAGGAGTCCACCTATCCGCTATATCAATTGGAGAAAGGCAATCGACCAGCTTTCAAAGAAAAATGAGAACTTTATGACAATTGCTTAAAACTATGAGTAAGCAAAGATGAACAATAAATAGTCTATCTGAACACTGAAATCTAGTTACCTTCACAGTGACTGCAAGCCAAGCTTCATCGTCGGAAAGTATTCCAGAATGGCTTGATCTCAAAATGCTGAAAACCAACAGTAAGACAGCTTTAAATGTCTGTTTCGAGTATGAATTATTAGTATCGTAATATAGAAGCTTTATTGCATTTCCTATGCCATGCACACGTAATTGATCTGCGGAACTTGGAAAGATATTCACGATGGTTGACAAAAAATTCACCACCGAACTTATTTCATGTTCGACACCTTCCTCAACGAGCTGTTCCAGCAAGCGTATCAGAAGGATTGCAGCATAATTTTCTCCTTCTGCAACTAATTTTGCTATTTCTCGCGCACCAATAAACTGATTGTGGATGGTGGCAGTGGCGGTCCCGTTTGCGCCAAGGAGTTGGCATATTTTCAAAACCTGATAACATAACGGGTTTCTGATTCTCTCTTGTTTAAACAACCATTTCAACGACGACCAGTGAATCCTTGAAGAATGCAAATCCCATTCACTTTCAGTCACCAGTTGGAACACTATCCTCTCAGCTTCTGGACTGTAGGAGATCTGGCAACTAGTGTCAGCAACAAACCTACAAAAACCATATATATTCACCAATTGTGTGACTGTAAATGGATCAGGGTACCCACAAAGCAAGCCACTGTTGCTGACAAGAATATACTGTTCCAGAGACGCTAAGACCATCTTCTCATCAGCAAGTCTGGCGAATAAATAGTAATTTGGTGAGCAGACAAGCAAACAGATGTATACCAGGAAGAGAACATAATAAAGAATAAACATAATCGTGTTACCTATCATCGTGTAATGAACTTGTGTATAGTAACAGCAAGATGGCAGATTGACAAGAAGAGAGCTCCAAGTCATTGGAGCTTTTCTGTCCAAGTAGAAACAGTAAATCAATTGGATCAGATGGCAGAAATGATACAGATTCTCTAATGCATTGTCCAGAATCATTTCCGAACATGACATCCACAAGAGAACTAAGCATTAGATACACTCTCCATTTCATCTTTTCAGAAGGAAATAAGCCTAGACATGTAAAAGAGAAACTGAACCAAGAAACTGATAGAAGCGTGTTTGCAAAATCAAGGATCCTGATGTCAGGATCTTGGAGCAAAATCGAATGAAAAGTTTCCATTACCCCCAAGACAAGTTCATCTTCAACTACGTTAATAGTTGCTAAAACCCAAGGTAATAAATGTGTTGTACATACTTCAAGAACACAATTTTGCAAATCTTTGGTGGTGGGGTCGTCGATGAAACTATTGTCATGACTATACACGTAGAACTCCTTGAGAAGGTATAAGGAATGCAAGAGCTGAGTTGGCTGTGCCTCAAAAGTACTAAGACAAAATGAAACTAAATGTTCTAGTACTTCTCGAACTGATCTTGCCAGATGAGGCACCCTGTGAGATGGAGACTTCATAACTGTAACCAAGATTGAGCAGGTTGTTGCGAATGTGTCTGGAAGTAAGCCCATCTCCCCATTCACATTCCTAAGGAGCATACTCGTCAAAATAAGAACTAGTTCTTCAATGTGAGATGCAGACACTACTCCAGGACACCGAGAAATGCATTTCCAAATGAGAGTAAGAGTTTGGTTATGAACCGGGTGGAAAGGAACTTCAGCAACATGACGCAAAACTGGGATTAGTGTTGCAAATCCGACCGCAAGCCTTTGATTGAAGGGCTGTTCAGCTCTTGAAAGAAGATCAAGGGCCTGAACACAAACCTTTGCCAGGGGATCCTTACCTTCTGCAACATTCAACTCATTTCATTATCCTAAAATTTGAAGTTTATTTCAGGCAGATTATACTCATTCTAAAATAATCAGGACTACGATTGCTCACCTGAAAATCGTACTATCTCAAATACATAATCCACTATATTTTCTTCAACTAATAATTGGATCTCCTTGATGGAAGCGACTTGTGAGGACAAATAACGAATTATTAGCTCCAGTGTGCTAAGTTGGACCTCTCTGTCTGACGAAAGCAGAGGACCTTTGATGGCCTCAGCAAACAAAGTATTCAAAGGAGGTTCACCTTGATCCTTTTCGTTAAACTTCAAATATTTTGCATGTTCATTTCCCAAGAACCCCCTTTGAGCCAGAACAGTTAGAAGTGCTACGAAAATAAATCAATGCCAATCAGATAAGAAGATGAGAGACGAAATTTTATGCATATGCAAACATTCAGAATATCACTTAAACTCTTATGCATCACCTGCTACATCTAAAACAATTGTGTTAGTACCATAACTTCCATTGCTTAACTAAGAGATCTGGCAGAAAGATGTATAAAAGGTTGGCCTTAATCATTTGCCTACCGCACTTGTGTTAAGATATCCAATAAGTAAATTAAATTTATCTAACTCATCAGCTTAAACCTTTTGATTCATGGGTGGTTTATTCATGTTTAATACGGTATCGGAGCAAAAAAAGAGCTCAAACGAGCATCCGGTCAAAAATGGGATCCAAATTTATCCAACCCATCAGCTTAGGCATTTGAGTTCTTTGGTGGTTTATTCATTCATGTTTAATATTTGTCCTAAAATTGGGGTTTAAGAAGGAATGAATTCTAGTGGCAGAGTAAATGACGAAAAGTTAACTGTATAATTAGTACAGGTCTGTCAAAGGAGAACTAAATGCAAGTAGTATTTAGAACATAATCTGAAGGCAGGGTAATAATCAAACTGAGTCAAAAAGCTGACGCAAAAGTTAGTTTTGAAGAAGTCGACAAACCTACACAGTTCAATCTGACATCATCATTTTGAGTCTTCATCAGCGCCTCCAAAGATAGGTACACAAGTTTTGGGCAAAATTCAGAGAGAACGTCAATTTCAGTGCTAGTGGACAATTTGTATAGAACAAAGAGGATCTCTCCCCGTATCTCCTCGCTGCATTTAGAGAGTCATTCAAAACTTGCATAAAATACCCGTTCATAATGTAAGATAATTTTCATCAGTTCATCACCTTGGCAATTCAAGGCCTGCCACGAGATTAGATAAGAGCCCATCGTTCTTCTTAATTAGGTCATGGAGATTCTTTGTCCGATAGTTCAGAAGCATTCCATAGCAGTGAAGCTGCAAGAGGAAAACAAAATTCTCTCACTATTTCGTCAATGAATCACTTCAATAGGTAATGAAGAACTGTAGAACAGAGAACATTACCATATACACCTGACGACGACTCCAAGCCAACGCACCAGACGAAAGCCGATCGGAAACTCTGGCGATAAAATCCTCGGAAAGAGATCCGTCACCGTCAGCGGCACTGAAATCACAGAGCTGGCGAACAAGATCAGTGAGTTGGCGAGCAATCGGCTCGTCATCGAAGCAGCAGAGCGCGGCGACGAAGGGGGAGACAATGAAGTGAGAGTGGAAAGTGAGGAAGGTGCGTAGGAACGGCGGCTGAGCAAGAGCCTGGGAGAGCTGGGAAAGCGCATAAGAGACGTGAACTGTGGGAGAGAGAGGATCGGAGATGAGATTTGAGAAGCAGAGGAGGCAAATCGTTCCTCCTTCCTGAGTATGGAGGCAGAGCGACGATCGGTGGCCATGGGAGCAAGATTGTGGAGAATTGTAGCGATTTTGATCGTCTTCTTCCTCTGCAGGTTCGAGGTCTGGATCCTGCGAATCGCCGAAGAACATTTCTGGTAATCTCTCGACAAGAAACGGAGTGGTTTTCGAAATTTTGGCGCGGAAACGATGAAGAGTTTTCAATTTTGTTCTTTTTATAGCCTGTCTGAACGGTTAAAATTATTTTGATGACATTTTTTAAAAAATATTTTTACAATGTTTATTTTTCATCTCTTTTTCTTGTTTATTGTTTCATATGCATATTTAGAAAATTAAAAAATAGTACAAAAGAATATCTATTTATATATTTTACCATTAGGTTTAAAATTTTACAAAAATGCACAAATATAAACTTATACTATCATTGACCGACTAACCAAAAATAAGATTGGATGATCAATTAAGCTAAAAAGTACAAACTAGTTTGGTTTTAAATAAATTGTTATAATAAAATTTATTTACTATCAATAACAAAATGATAATAAAATAAATTGAAATTTATCATTGTTTTGATTAGAAATGATATTAGCACATTTTTAGTGAATGTATCCGTCGCAAAGCTAAGAATAGTAAAGGGTATTTTACTACGTCAATAGGTTTTCGATTCTGATTTTATAAATTTTTTAGTACCACCAATTGCACTTTGGATCGTAACCCGTCTAATAACCATTTATGTTTTCTTTATTTTTTATTTTTGGTTTTTTTTAGTTTTTAGTTTTTTATTTTAAAAAAACAGAAATCATATATATTTGATTACTGTTTATTATTTTTTGGTTTTTAAATAACATAAAGGTTTAGTTTTCTATTTCAAAAGACAAAAAACCAAAAACCAAAGAATTTTTTGATAACTAATTTGTTGGGAAGGAGGACAAATTGGAATAATAAAAATAATTGTCATTGTGTGTAGAACCACCCTGAAAGTAAAATCATTGCTGGAAAGCTCCCAAGATTAACACAACTTCCAACTATGTACTCTTCGTATCAAAGTCAAATCAAACTTATGCTCGGAAATACCTTAGAAAACTCAAATGAAATAATAGGGTAAAGAAGGAGAGAAGAGAATGGAGTACTTTGCGTTCAAAAGTTCCAGGGAGATCTTATCAACAAGGATTTCACTATACGAACAGACTCAAAAGCAAGCAAATACATCTTCGAAAAAAATGTAAAGAATCTTGTCTCAAAACAAATCTTCGCAAGGTGGCAGGCCATATTATCTTGCTTTGATTTCAAAATAGAGCCTATAAAAGGAAGTGAGAACTCCCTTGCTGATTACCTCTCAAGAGAAAATCTCTTGAAGACATCCAAATCAGCCTTGATCTCTCTCCCTGATGGAACCTCCTCCCGGCCGGAGACGGCCAAATTCTCAGCGGCCTCCGCCGCAAAACTCCCAGCGGCCTCCGCCACAAGTCAATCAGCGACTTCCGTCGCCGAGAAATGAATCAAGCATTTCTCCCCAAGCAGCAGCATCCTCTTCTAGGACTGCTATCTCAAAGGGCAAAAAGTCCATCACTCAAACATCTGTACCATCTCCAATGAGTGCAGAAAATTATGCTATGGATATCCAGTTTGAAACGGTATCCAGGCGTCAGCAAGGTTCTTCCCAGAGAGCCTTGACTATTCAAACAGGCCCTCCAAGCCTTCCGACCCCTTCAAGCACGTTGTTACGCCCTCGCGGCAATACAACGAGGAACAGGCGCCCTGCTACGGCAGCCGCCGTTTCCAGACCAACGATTCCGAGGAACCCTTCCTCGTTTTCTCAAATAGTTAGGCCGAAGATTTTTCAGCCAAGACCTCCAATTACTGGGTATTTCACCAAGACTACCCTAGTAGATTCAATTATTGAACCAGAGTTCGACGGACCTTCAGTCCAAGAAGTCTGCAAGCAAATATTTCCTCAAGGCTTCAACTACCTGCCAGAGGATATTCAAAAAACCCAAACTTATTATGAGTTTATTCTGGTAGATTCGAAGTCTGCAGAAATAACTCATGTTCCAGATAGAAATGATCCTTCTAGGACCATTTACTCAAAGCACAGGATCTTCCGCATCCTTACCCCTTCCTCGTGGAAACAGGGTATGTTTGTAGGGAAGAGACTGTCAGTAACCTTCCAACCGCAAACTTACAATTACCGCGACTACATGAAAGCGTGGTATATTGTCTTCTGGTTGCAAGGCTATAACCATTCCTGGTTTGTGACATTCTGTAAGCAAGCTTACAAGTCTCACTTTCCAATTTGGTTTCAAACATGGTGGACTTACTTTGGACTCTCCGAAGAGATTTTTCCGGTAGAAGTTCAGAGATCTTACCACCTATTCCAACAGAGTATCTATTCGTCTCCTCTCTCCAAGACGTTTAGATTCGCTTTGTATTTTCAATTACCATGGATCTTTTGCTGGAACTTCCAGCTAGGACCCAGTGGAAACTTTAAAGCGTTGAGCAAAGCTCTCTGCGTCAAATGGTGGAAAAAAATTCGATTATTCCTACCTAGAATCAGACAAGATGAAGGATTGGTTAAAGATCAACGTTCATCTCCAAGACGTCGCAAGGCAAGAAGATGAAAGCTTCCTTCTGGCGAAAAATACCATCACGAGTTCACTTGCTGGAGCCGGATCTCAAGCCGACTTCAACTCGGTCCTCAATACCGTCGCAGTTCAGATCTCTGACCCCGACGAAGCCCAGACGGATGTAGATTCCTCCACCTCTGTCAACGATGATGCCGTAGACGACGAAGAAGACTTCGATCCCTTCGAAGGATACGACATCAACGACCCATATCTAGATTCTCAGCCTAGCTGAAACTAGATATTTCAATTGTTTAATTTAAATTTCTTATTATGTAAATTAATTTGTTTAAATTTCGTTTTGAATTTTGGATAAGATCATTGTATTCCGATCTTATCATCACTATCCCAGTCTGTTTTTGTGAAGGACTTGCAGAAGAAATCACGATGCAGTAGATGGCATGCTTCTGATGTGGAATAACCAACCACAAGGCAACGCAGGTAGCGTACATGATTGACTCAGATGAGAGGATTCCATACCTCACAAGGCAACTCTAGTCGAAAATCTATGGTGTTCCTTGGGCAGTGAAAGGCGGGAAGCCGAGAGTAAGACCCTAGATGGAACGGCCTAATTGGAAACCAGAGAAGCCTTTACGATTAATATACAGAATCCGCCAATCCTCGCTAAATCAGAAGAAGCCGGGTGACTTCGTCAGATTTCTAAAGCAAGGCATCCAGTCCAAAAACAGTCATATCAGCTTTTATTTTCCAACTTTTCTTATTTTGTCAAGTTGTAATTATTTCCGCTTTACTTTTTCTCTTCCTGCTTTTTCAAAGATAAGATGCCTCTTGTCGGCCACGCTTATCATTACTTTTTAAAAAGTAGATTCCAAAGAGATAAGAGGCCAGTCAAGGACTTGTCTTGTCGGCCATACTCTATCTCTTTAGGTTGTATCAAATTATGGTGGACTTGACTTGTCTTGTCTTGTCGGCCAGTCAAGTTGTATCAAAGTAGATTCCAAAGAGATAAGAGACCAGAGATAAGTGCTATATATAGAAGAGAACCGATAATTGCACCAGGGTACAATTATTTAAAAAGTACAATTATTGTCTTGACACTTGTCTCAACATAATTTGATACAACCTAAAGAGATAGAGTATGGCCGACAAGACAAGTCCTTGACTGGCCTCTTATCTCTTTGGAATCTACTTTTTAAAAAGTAATGATAAGCGTGGCCGACAAGAGGCATCTTATCTTTGAAAAAGCAGGAAGAGAAAAAGTAAAGCGGAAATAATTACAACTTGACAAAATAAGAAAAGTTGGAAAATAAAAGCTGATATGACTGTTTTTGGACTGGATGCCTTGCTTTAGAAATCTGATGAAGTCACCCGGCTTCTTCTGATTTAGCGAGGATTGGCGGATTCTGTATATCAATCGTAAAGGCTTCTCTGGTTTCCAATTAGGCCGTTCCATCTAGGGTCTTACTCTCGGCTTCCCGCCTTTCACTGCCCAAGGAACACCATAGATTTTCGACTAGAGTTGCCTTGTGAGATATGGAATCCTCTCATCTGAGTCAATCATGTACGCTACCTGCGTTGCCTTGTGGTTGGTTATTCCACATCAGAAGCATGCCATCTACTGCATCGTGATTTCTTCTGCAAGTCCTTCACAAAAACAGACTGGGATAGTGATGATAAGATCGGAATACAATGATCTTATCCAAAATTCAAAACGAAATTTAAACAAATTAATTTACATAATAAGAAATTTAAATTAAACAATTGAAATATCTAGTTTCAGCTAGGCTGAGAATCTAGATATGGGTCGTTGATGTCGTATCCTTCGAAGGGATCGAAGTCTTCTTCGTCGTCTACGGCATCATCGTTGACAGAGGCGGAGGAATCTACATCCGTCTGGGCTTCGTCGGGGTCAAAGATCTGAACTGCGACGGTATTGAGGACCGAGTTGAAGTCGGCTTGAGATCCGGCTCCAGCAAGTGAACTCGTGATGGTATTTTTCGCCAGAAGGAAGCTTTCTTCTTCTTGCCTTGTGACGTCTTGGAGATGAACGTTGGTCTTTAACCAATCCTTCATCTTGTCTGATTCTAGGTAGGAATAATCGAATTTTTGCTTCCAAATATCTGTTCTGCACACAATGGTATAAAGACGTGCCAGGAAGGTGTCTTTGTACCATTTGTAGTTACTCATCTTTCGACATCGAAGGCTTAAAAGAGCTTCGGCGTTGAGATCTGAGTAGATTTGAGTGCTACCAATAAAGTTCTTGGTCATAGCATAGATTAATTGATTTACCATATCTGGCTCATCAACTTGCATAGCACTAAAACCTTCCTGTTTGACAACTGTTTTAGTTGCCGTTAGGATCCTGGTTCTGGCTTCTTCGGTTAGCTGGTTATGCCACCAGCTTCTTAGGTTTCCAGAGAGACCCGAAATAAGGATCTGAGCTGTCTATAAAACCTGTTTCTTGGTGCTGAAGGCAGTGGCTACCATCATCATTTCTTGAAACGTGTTCATCATTTGAGCTTCAGAATACCCATCGATATTCCAAGTGATTATAGATCCATCATAAGATCGTTGATCATGGCGGAGATCGTCCCATCCTAAATTCGGAGGAGAGGGTTTAGGATAATGACTCTTCATTTCCGTATGCATTGTGACCGGAAGTATTGTTGAGGAAGAAGCTTGAGAAGTAGTAGGCATGGTAGCCACTACATTTATACTCTTGGCCGCTTCATTCTTCTGGGAAGATTCTCCTTTGTTGATGGACAAAGAGGAAAGTCTTCTGTTGATCTCAGCAAAAAGATCTGAGGGGTCTTCTTTGAGAGGTCCAATCTTAAAACTGTTTGGTTGGAAGATTGGCTGGCAGGGGTCTACTGGTGGGATCCCTGGTATTTTTGAGACGGTAGGAAGAACTGGATTCTCAATTCTCTCTACAGTTTTAGATACCTCAGAGAGGATCTTGTTTGAGTAGTTGAGTTGATGCTGGATGTTCTTGATTTCTCGAACACCAACTTTCTGGACTTTGTCTTCATCGATTGTTTTATAGGGTGAAGACACCATCTTTATGGCAGATATTACTGGGTGAGAGAAGGTGGCTTCTTCTTCCGGAGGGAAGGTGGAAGTTATCTCTTTGCCTCTGGCTGTGACCCAGACAGTGCCATTTCTTATGACTGCTACCTGTTCATCCGCCTGGAGTTTTTTTACTAGAGCTTCGTTCTTAACTAGCTCTCGTTCTTGTATCTTGAAGTGAAATCGCCAAAGGTCATGACGGGTTTCCTTGTTTCTTTTGGTGCAGCAATCCACATGTCGATGTACTTGCTGTATAGTTCTTCATAACGTTTCTCAAGTTTTGAGATTACGTTTATTTGATTGAAGGCGGATTCAGTTCTCCTTTCCATGTCAGATTGGGTTGGAGAGAGTGATTCTTCTTCGTAGGGGACGTCAGGGATTGGATGAGTGAAATCTACTGACGCTCGCATAGATTCAGATCTTTTAAAAGATCTGTTAGCAGTATTTACCGACCTAGTATCTGAAGATATACTTGGGCGGCTACTCATAACTTCTCTAATTCTGGGATTGGGAATAATATATATATATATATATATATATATATTATATAGAAAGGCCAAAAGTAGCAAAAATAAGATTTTAAAAGTGAATTTTGTTTTAATAATAAAGTTTTAAATTAAAGCTTTAGAAAACAAAATTAAAAAAATCTTTTTAAAAAGAAAAAGAAGTTTTTAAATCTATCCGTTCTAAAATAACTTTTTTTAATAATTATTTCAAAATTCCAATGACAAAGATTGAAAATGATAATCTTCCACTATTCTCGAGGTATTTATAAAGTATTATTAATTTTTAATGCATATAAATTATAAAGATAAAATTACTTGGTTAATTAAAAATTATTTATATTTAGGAATTAAACTTTTATTTTACTGTTTATTAAAATTCGACAACCCAATATTTGGAAAAAAAAATTATTTTATATTTATCTATTTTAGTTGTAAGATAATTTAATGGTCGCTGGAAGGAAATCTTTACTAATAATTACTTGGAGATACTAGACAAAAAAATATTGAAGGTTTGATAAGTAAGAATAATAAATGGTAAAATATAGATTTGTTTAAAAAAAACAACTGAGTGTTAGATTTTTAAGGGTGGGGAGTTAAAATTTAGAGTAAAATACATTTTTAGTCCCTAGATTTGGAGATAGGCTTTTATTTGGTCCTTAAGTTTCAAAAAGTGACAGTTTAATCCTCGACCTTTAGAAATATTGACCTGACTTGATAAAAAAAATGACACACGTGTTAAAAATAATATAAAATTAAATAAATATCCAAATCAGCAAGTCAACATTCTGTTGGTGTCACTAACGGAAGTTTAACCCAATGACCTTTTAGAAAAAGGTTGGGGAACAAACTGTCAAATTTTGAAACTTAGAGATCAAATAGAAACCTATTTCAAACCTGGATTTTTAATCTCTCTAAAAAACCTAATTTGTGATTCATTTTCTTCTAATTTGTAAAACCTTTTCACTTTTTGTCTTTATAAAATTACTAAAATATCCCTATTTTTTCCGTTGGTTCTTTCTTTTCTACTCAAAACATGTAGGAGGAAAAAAAGTGTTGACGAGTATTTATCGTTTCTTCTATAAATTTTAAAATGAAAAAGGAAAGTTTTACTCCAAAGATAAAGATTTAAATTATGGTGTCATTTTAATACTATTATATTATAATGCATTTTATAAGTCATATTAATGAAATAAGATTTTCATTTATGTTTTTTTTAACGAAAATAACAACAAACAAGAAAACAATTATTTTCCTAAATTTGTAAATAATCTTTGAAGATATTTCTTAAAAATAAAATGGTTATAAAACGTATATACACCTTAGGGCATATTGAGTAATGTTCCATTTTTTAATTCTCATTTCTAATTTGTTTTAGTAACAAAAGCATATAAGCATTTAGTAATTGTTTCCATTTCTTGTTTCCATAGTTATGTAGAAATTTTGAAAACAAAATCAAGTTTTTATTTCTTGTTCTTGTTCACAGTTTTCAATTTTTATATTTCTATTTTTTCCAAAACATAGAATAAAGTATTACTTTATCATTTTTTATTTTTAAATATACAATAATAAAAATAAATAAATAAAACTTTTACTACTTTTATTTTAAATTTTTAGATAACAAGAAAGAAGAAATAGTTGTCGTTTCTACTAAAGAGATAACAAGAAATAAGAAATAGTTATCAAATACAATTTTGTTTTTCTTGATTGAAAAAGAGGAAACAAAAAATAAGAAACAATAACATTACTAAATTGACCCTCAAAAGCCAGAAAAAATAAACAAAAACGTTGTCAAACAACCCCATAAGACAGCCAGAAAACAAGAAATAAAAGAAGTGAAAGGAAGCATTAAGATATTACCTCAAGTAACCTACTGCCTCCCTATAAGCTTTATATGATGAGACAAATTTAACAGCTACTGTCTCAATGCCTATTGTTGTTGCTTTTTGGGGTCAGTTGAAACATCAAATCCAGGTGGAATGAAGACTACATCATCCCATGTTGAAATTGAATTATCCTATCAGTGAAACAAAATTACAACTTAGATCATAGGTTTACAACTACTTTTTTTCTATTAACAAGCATTTAGAGATATATATATATATATATATATATAAAAAGAGAGACAGCGAGTTCTTAAAAATAAACGGCTAAATTACAAAAATACACCTGCATTGTGCCCCTTGTTTCAAAAATACACGTATCCTTCCTTCCTTTCAACACGTTCAATACTATCCTTGACCTTTCATAAACATTTCAAACATACCCTTTGAGCAAACATCCGTTAATATTTTGAACAAAAAGTTGATGTGACATCATCTAGAACGACGTGACACTTAACAGATACGGATGTTTGCTCAAAGAGTATTTTTGAAACGTTTATGAAATGTCAAATGTAGTATTGAAACTTTTAAAATGATGAGGGCATTTTTTGTAATTTAGCCAAAATAAAAATGAGATAGGCAGTACGATCCTCACCTTCATTCCTTTTATTACTTCTTCAAAACCAGCAACCTACAAGGTTCAGACACAATTTTATAAGTTCCACTCCACAGAACACACTTCACTAGTGACAAGCAGAGAGAGCTATACCTGGATATCGCCCGGTCGGAAGGCAGGATTCACTTGATTTTGTTCTCGAGATGCTCTTTGGAGAGATTTTTCTAAATCCTCCTAAAAGAATCCAAGAATGAGGGTTGTAAAAATGAATCCATGTAAAGATGATAGTCCAGGTCAAGAAAACTACTCTTCTTGACTAACCTTTCTGAAAAATGCAGGACGGTAGCTCTTGTTTTGGCTCCTCAGTATCAAGCTTTTCGACTATGCAAAGTGCATAAAATTGAAAACGTCAGTCTCAAATGTTTAGATCCTCAAAAAAGGTAATATAAACAAGATAAAAAACAAAATCCCAACATAATAATGGTTGAGGTGAGGAGAATGGAACTTAATATAATTATGCTTCATTAAGTCTAAAATCGAGAGTATTGTTCAACCCTATTAATTACTATATTTTATACTTTTATAGTCAAGAAATTCAAGGGTATATTCAAGTCCCCAGATAGGCAGCTCAATAGAGTTCTGAATCCATGATGACTTGGCAGAATCCAGAGGTAGCCGATTACATAAAACTTCTCCCGTTGAACAAAACTACATAAGTATAGAAAAACTATAATAATGAAAGAGTGGAGATAATGTACACCAAGAGATAGCCAAAAATGCTATAGTATAAAAACAAAAACGATAAAAATTCTCTCTGTCGCTGAAGATCCTTTGGTCCCTCTTTTCATACAATGCTGCCCAAAAAAAACACTCAAGTGTATCTAAAGAAGCTTCTTTTTCCTTTATAAAGGGGTTGCCTGATAAGGTAAGAGAAAGGAGAGTTTTCAGGTCATTTGGAAGAGCCGTAGTGGAAGGGCTCCAGGATAATCTTCCAGAATGTAGCTGCAAGGCTGCAAGAAATGAAGATTTGGCTTTGCATAATTGCATCTCTGCATTCTTCTCGCAAAGCGACAGTATTGATGTGTATTTTAAGGATCAGCAGCATTGACAACTCTCTTAGCTTCTCATCATTTCCCTAACAGTGGGGATTCTTGGAGGTGGAATCTTGAAATGGGAGTATGCTTTTCTACAAAATCCTTGGTTAGCAGGGTCCTCATCAAACACCAGCGATATATACAAAAAAACTCCTTGGCTGGCCTTATCTCCAAACTGGTGTAGCTTGCGTAAAATGGTTCATGAAACTCATCGAAAATCTGAGCACATTTCCTTCCAGCTTTCAGTTGGTCTTTGGCTCTCCCTTCAAAATTCAAGAGTGGTTTACCCCCATCTTAAAGGAGCATCCCTTCAAGCAAGAGAAGAAATCTCTCTGGTTCATCAACAGAGCCTTTATATGGATCATTTGAAATGAACAAAATGTTAGAAGATAAAGAAAAATCCTATGAAGCAGTGGTGGAACATATCTTATTCTTAGCCTTATTTTGGAGCAAAAAGGTTCCTTGTTTTCGTAATTATAGCTTGAATTCTCATATATCCAATTGGAGATGTTTTCTTTTAACTTCATTGGATTCATTTTCTCCCTTTTGTTAATTTCATATCATCAATTAAACTTGTTTCCCTTCCCAAAAAAAAAAAGATAGGACAACCTAAAGTGACATCCAATAGTTGGAGTGGGACCAGTGAAACCTTTCAAAAGAAAGATTTAAGAGGAGGAAGGTCAGAGGCAATCAAAAGATTGAGGAAGAGGAGAACAAATTTCCTTTATGGCTGAAATAGGCCAAATGCAAGGAGGCATAGATTATTGTTCATCCCTTGATACTTAAATGGAAATCAAAGAAGCGGAAAGATTTTCTTTTGTTTTCCTTTCATTCCTGTTTCGAGCTTTGAAAAGGAAGGGAGTTTTGGTTCAAGCTTCAAAGGCCATTAGGACAAGCTGAAGAAAGTTGTTGGGTGACTTTTCTTTTCTCCCTATAATGCACAAGTTTTTCTCCTTCTAATAATTTTAGTTCATCTCGACTAATCCACTATGACTCACCATGAACTTTAAGCTGGTGTGACGACAATGCCGACCATGGCATGCACTTCTTTATGTGCACGACATTGACTTGGGCTGAGGCATTGAAGGCAAGAGGCACCTTGCCTCTAGCCATAGGACATTGAGGCACTCGCCTCAATAACACTGCTTCTAAAGAGTCTTAGTTTCAATTGTTGAAAGAAAGCAGGGGAAAAGACAGGCCGCCTTTTCCTTGCTTCGTTTTCTAGCTGCAATATTCTTTTTGTTATTGTGGTACACAATTACAGTCATATTTTTTTGTTTATTTGGTTTTATGTTGTTTGCTTCTATGCTTTTTTACAGGGATACTGCTCTTGATCAAATCTCAGAAACCAATTTCTGGAGTTTCAAGATGTCAAACTCCTATCAATATCACAAATAATTAAACATAGTTCAGCGGTAATTGGCACTTGTTTATGATCAAAAGGTCGTTATTTCAAATCCCCGCCCTCACTTCTTATCACAAATAATTAACCACCTTATGGACTTTTAAATAAAAACTAAATTCTCAAAAGTGGTACAATTATGCATCATATACAATCATGATTAGAAAAAGGAAAAAAAAAACAAAAGAAAAAGGATTGGCTCATATATCTGAATATGGGTTTGCTTTTCTAAAATGTTAATATACCAACTGATCTTTCAACTCAATGTTCAGCAATACATCATTGGTGAAGAAAATGACGACATTCAAATTTTACTTCCCAATTGAAAGGTTAACACTTGGCAGATTTGTAGCAGTACATCTTTCTCCACCATTGAAAGTGAATGGCATAGTTGCCAACAATGGCTATGAATGAATAACAATTTTCATTAAGGAATGCCTAGAAATTAACCTGGAAAACAGGAACCCCAGAAAATCCATCAGCAGAAATACCAGCCTTTTTTCTCTTTCCTACGCAGGCAAATTAATATTGTCAATGGACAGACAACTACACGATAAGTAAAGATGATTGCCAGAGCAGAGATTGAATTTTGGGCAACAGCCACAAGGGGATAAACAAAGACAGCAAATGATGATATACAGGCAAAACAGATATCGTGATTTGACATTTTTAAGGGAAATGAAAACAATATCAGAGAAACTAAATTCAAACACCATTCCCGCTTTCCAACAAACACAACAGAGAGAGAGGGAGGGGGGAGGAGAGAACTTACACTCAGAGAGTTTCTCACCTGAGAGCACTCCGGAATCAATCTGAAGGCCACCCCATTAACATGGAGCTGAAAAACCTACAAAAACAAGCAACAAATTTGAAAGAAGTATTTGGGTCATTTCCATGGAAACTCATTTCGTCGAGGGAAAGAGGGGAGAGTTGCAGAAGATTGAGCTGCGATTTCTTCGCAAGTGAGACAATGAATTAGACTGAGCGAGCATGATGCAGGGGAGGGGGTCTTTTCAAAATTATATGATTTGAGAATTTCATTATTTTAATGATATAATTTGAGTTAAATTATCCTTTTCACTTTTTCAATATAATTTTTAGGTTAAAATAGTATTTTCCTCTTGGTTTATTTTAGTCTTAGACTTTAAACATGTTTATTTTGATCTATGTAAATTCATCTTAGTCATTGAACCTTGAAAATGTTCATTTTAGTCCCTATACTTTTAAAAAGTGATCATTTTAGTCTATATTTGCAATATTTCAACACAAATTTTACAAATAATGAAAACTGTTAAGTATTGTTTACATTTTAAAAAATATATATTGATTGTACTGTGGTGTTGTAATTGTGTTAGAAAAAATTAGATTTAAAATAAAAACAAATGGACCAAAATGATCATTTTTTAAAAGTACATGTTTCAAAATAAACATTTTCAGAGTACAGGGACTAAAATGTCAAAGCGAGCATAGCTAACGGTAATTGGCATATACCACAAACCATGAAATTGTGAGTTCAAATGTCCTATTCAACATGTTGTACTAAAAAAAGTACGAACTAAAATAACCAATATTTAAAGTACAAATACCAAAATGAACATTTTAGAAGTATAAAGATTAAAATGAACTATGACTGAAAGCATAGGGTCCAAACTAGTATTTTAACCTAATTTTCATGATTTAGTCTGTAATTTGAGTAAGGATTTGAATATCCGAGGTATGTATTAACCAATCAAGTTGTCCAAATTAACAACAATTATCCCATGTATGTAAATAAGAAGTGAGTCTCCACCCTACTCATCTTGTGTCTAAAGAAAATGTATCAGGTAAGAAACTTGTTATTGTACCATAATAATTTATAGTTTAAAATTTCATGATTTATTGTAGATTAAACTTGTATTCTCACGATTTAAAAATGTTGGGTTATAACACTATTTTTATCCTTTTACTTTCCAGCTTTAGCTTATTTTACTTCTGGTCCATATACTTTTCGAATTTGATTCATTTTTGTGTATGTACTTTCAAAATGTTCTTTTTGTCACTTACTTTTAAAAATAATCATTTTTGTCTCTATTTTCAATACAATACAATGACGCTATTGCATATCTTTTTGCTAGATATATAAAATTCAAAATTGGAATAAAAATCTAAAACGAAACAACTTAAACATGTTTATTGTTGTACTGGATCCGCAATTAAGCCTATGGATCCTTAGGATTGGAGTAGTCGTTTAATATATTGTATCTCGTAGCATCGTTCAAATCATACATACAGTCAAACATCACAACTTATTATGCCCGTCCTTTATATTGTTCTTTTTGTTTTGTTTTGGAATACAAATTTTGTAGTAGACGATATTTCACGAAAAAGGTTCTTCATATTCATAGGGAGAACCACTTTTTTTTTAATGTGAATTTGACATAACTAGTGAGAAGCCCCTATTTATAAATTAATAATTTGATTGAAAAAAAATTATCTTTTGCGAGAGCCATCAATATCACTTCACTATATATGACGAAACCTTTTTTTAAGATCCACTCAAAAATTATTCAACATAAAATCTATTGCATTTCAATAAATATATTGAATAGAGTGTGGGTTGAAATTGTGAAAGAAAAAATGAGATTGAAAATAAAATTAATAGAAAAAAATGGTCACATTTTGAAAGTATACTAGCGAAACCACTTTACCTATTGCAGGTTCGGCCCCTCTTAGTTAGCCTCGAATTAAGAAAAATTATTTAAAAAAAATATTAAGATTTTAAAGTTTTATAAAGTTGGAATGTAAAATTTTCTGTTCTTAATTAAATTTATGGACTAACTCCTAATTATATAATTTTTAAAATTTTGAAGTTTAGTTAAGTCTCCAACAAAATCCCTTTCACGACGCCCCCCGGCACTCTCTCTCTTTCAGTGACCATCTCCGGCGGCGGCGGCCATATGCAGCGGTGATCATATCCAGCAGCGGCGACGCTCTCTGTGAACGAACGTTCGTCTCGATTCAACTCTCACCTCACCGAAGCTCTTCATCTGTGTACGTCTTGTATACAAACCCATGTTCTTCCCTTGTTCTTGCATTTCTTCTGTCAAAGTCCGTGTTGTGCGAACTCGATTAAAGGGTGGTTTATCTGATTAAGGGATATGTTATTCTGGGTTGTTATCAAAAGGGTAGAATTTTATTGGGTTGTGCAAAATTTTGTTAAAAAAAAAAAGAAATCGATTTTGGGTTTTTCTTCTATGTTTTCGATCATTCTCCTTAATTTCCAGTCTGAAAATTGGGCTTTTTCTTCTATGAATGCATTCATGGGTCAATGTGTATCAATTACCCCCACAAACTTGAAGCAATGGGTGTAAAAATATATGTTAATGAATTGATAAGTTAATCTACAAACCGGAGAACCCGAAGAAGTTGGTCCACAAAATCTGGGTTGCTATTGATTGCTGCCGTCATATGTAGTTCTAAGTTCGTTGTTCTTACCTTGCCCACAAGTCTTATGAAGTTGTTAATTATGAAATATGGATTCATTTATATTTTTAATTTTCATTCTAGTTATTGCAAATTGCTACAGGAAATTGAATTCATAAAGCAGGGATGGAAAATTGAGCACGAAAGAGCATGATTCTTTGGTTTCTGTTTTAAAATTAGCTTGTGGAGGGTTGGGAGAAAGTAAATCCTTTGAAAGCAAATCGAGTCTTCTGAAAAATCTATATCTAGCCTTGAAAAACTGGTTAGATTCCAATTTCTAACCCTTCTTCGCTTTTGCATTTACTTGGGATGTTTACTCGATGAGATTCGATGATGTTGATGTGCATTTTCTCGAGGAAACGTGGAGTACTAATGATTTTGTTTTTAGTTGGAGTTTAATGATTCCAAGATGTGCATTTTCTAAGTTTAATGATTATGTTCTTAGTTTTTTAATGGAGTTTCTTGAACTTTTTTGTTTGTTTTTATTTAGGAGGTCTAATTTTATCTTATTAATTCACAGGATGACTTCATATGAATTTTCTAAAATAAGTCATTATAGTCAGAGTGTTCCATTAGTTGAACTTGACAACTCTTATTGCCAGAAAATCGATTTCGGTTTTCCACTCCCTACCCGAAAACCGAACCGAATGAGGATATTTACACTCCTAGCTAACAATATTGAGAAGAGGGGAGAAGGTATATGTGTGTATATATATATTTCCTTTTAAAAAAAGATATATATGTTGATCAGGTCGGGTTGGTTTTGAGAGTTTTTTCCTAGCCGATTGACCAACAGTTTTGGAAAAATCCCAAATCGACCGCCGACCAGCCGACGTCAATTTGATCGGTTTCAACTTATCGGGTCGGTTTGGGAAATTCTCATATACACCCCTAGTTAAAATTATCAAAAATAAAAAATTCATAGACGAAAGCAAAAAAACAGTATTTAAAGAAAAGGAAAAAAAAAAAGTCTTACCTAACCTATGTTTGTTTTAGCAGTGAAATAAGAGGAGTGAGCAAATTAGGAGGACCTGGATGGAAACTTGTGTTCTCCATGGTTACGACTCTATTGGTTGCGGCTCCAAATTGGCGTTTCGGCTCCAATTCCCATACTATTCTTCTTCTTCTTCGTACCCTTTTTACCTTAACAAAATCAGGTGCGTCCCCTCCAAAAACCCAACTCCACAGCCTCTTCTATCCCTAACAACTTCTTGTGCAGTTCCAACGCACAGTCCTACAAAGCATGCCACTCTATTACTAGACACTTTTCACCAACACCATGCCCTCAAAACCTTGCTTTCCCACCTCCACAAAACCGATTCCTGCCCTTTGCTACTACTTACACACCATGGAGATTGGAGCACAGACCATTTCTGGGTTGTTGTCAAATTCCTCACACAAGCATCCAGATCCCATCAACTCCTCAAGGTGCTGCTTCTCCTTCTCCTACACATTTTAGATTACAATGTACTTTGCCTTCTTTTAGCTTTTGGACACGTTTGAGAGTGATTTTTTACTCTTGTCATTTTCTAGACCTGTTATACCTCCAATCAATCAGGTGTACGTGAGAAAAGAGGGGCAGTAAAAGTGGGGACTTGTGTTAGTTCCTGTTTTTATGGTTCAATCCTGCATGTCTTAGTGGGCTAGGTTGTGTTTCCCTAGCCTAGAGGTTTAGTTTGTGAGGAGGCAAGCAATTTAAGTGTGTGTCTAGGGAGAGATCGAAGTCCTCTCAAATTCCCTCTCTTTTAAATTGTTTCTTTCTATTTTTCCCCGATAGGAAGGGTAAAGATACCTATCAAAATCACTTGTATGTCATTGTAAAATATTATTATTTTAAATGGAGGTGATTTTGATAATGACAAAACTTGTCTTGTCAAAATCACTTTCAAACATGTCCTAATATTTTAGGATGATATACGAGATCTTAGAAAATGGCTGAAAGTGATTTTTGCCAAACATGTTCTTATTCTGGAATGATTTGGTTCTATTACCTGGGGAGAGGAGGCCTAAAAATGTGTTTTAACTTTTAACCTAGATTGTATAATATTCTCCCTCACTTGAAACAAAGGTTTGCCAATATATGTACAAAGTGATACATGTATTTGTACATGACTGATTAAGGGTGTGTTTGAGAGTGATGATAGGATAAGTAATTTTGAAAAAAAGAGATTTAAGTATAATTGATTTTTCAAAATCAACGTCCAAGTCATATCTTAGAAATCAACTTAAAGTGATTTTAAACTTTTCTAAATTCAATTTTTTAATTGACCAAACATGATCAATGGGAAGGGAAGTGATTTTGACAATGCCAAAAATCACTCCTAAGCGCATCCTAAATGCTCAACCTTTTATTTTGTTGCCAAGAAATTATCGTTACAACTGAGAATCTACTTTCTGCGGCCATATTATTTTAAAGGAAACATTAAATCTGTTCTGAAATTTAATCTTCTGCATCCAAGGGAAAGTTGATAGATGCAAGGTTCTGATATACATTCCTTTAAATAACCGCCATCTATTTTCTTGGTTCTTTTCGGGATCCTATTGGGACTTGATAGTGGATTGGATCTATGTACATTTTGGGAGTTCTAGTGCATGTAGTGAATGAAAATTTGAAACTTTGTGCAGCTATTTGATGCGTGGAAGAACATTGAGAGATCACGGATTAACGAGAGTAACTATGAGAAGGTAATAGTTCTATTGAGTCGAGATGGTCTTATGGATGATGCTATGTCAGCATTTCAAGATATGAAGAGCATTGGCCTTCAACCATCTTTGGGTATTTACAATTCTCTCATCCATGGTTTTGCTGCAAAAGGTGAGTTTGAAACTGCTATGTTTTTCGTCAATGAGATGAAAGACATCAATATGACTCGAGAACCAGATACCTACGATGGCCTAATCGAAGCCTATGGAAAATATAGAATGTACGATGAGATGGTCAAGTGTCTGAAACAGATGGAATTGGATGGATGTTTTCCAGACCAAATAACTTACAATTTGCTTATCAGGGAGTTTTCCAAAGGTGGATTGCTTAAAAAGATGGAAGGTTTATACCAAACTATGCTTTCA

mRNA sequence

AGTCGAGATGGTCTTATGGATGATGCTATGTCAGCATTTCAAGATATGAAAGAGCATCTTCAACCATCTTTGGGTATTTACAACTCCCTCATCCATGGTTTTGCTGCAAAAGGTGAGTTTGAAACTGCTATGTTTTTCGTCAATGAGATGAAAGACATCAATATGACTCGAGAACCAGAAACCTACGATGGCCTAATCGAAGCCTATGGAAAATATAGAATGTACGATAAGATGGTCGAGTGTCTGAAACAGATGGAATTGGATGGATATTTTCCGGACCAAATAACTTACAATTTGCTTATCAGAGATTTTTCCAAAGATGGTCTTATGGATGATGCTATGTCAGCATTTCAAGATATGAAGAGCATTGGCCTTCAACCATCTTTGGGTATTTACAATTCTCTCATCCATGGTTTTGCTGCAAAAGGTGAGTTTGAAACTGCTATGTTTTTCGTCAATGAGATGAAAGACATCAATATGACTCGAGAACCAGATACCTACGATGGCCTAATCGAAGCCTATGGAAAATATAGAATGTACGATGAGATGGTCAAGTGTCTGAAACAGATGGAATTGGATGGATGTTTTCCAGACCAAATAACTTACAATTTGCTTATCAGGGAGTTTTCCAAAGGTGGATTGCTTAAAAAGATGGAAGGTTTATACCAAACTATGCTTTCA

Coding sequence (CDS)

AGTCGAGATGGTCTTATGGATGATGCTATGTCAGCATTTCAAGATATGAAAGAGCATCTTCAACCATCTTTGGGTATTTACAACTCCCTCATCCATGGTTTTGCTGCAAAAGGTGAGTTTGAAACTGCTATGTTTTTCGTCAATGAGATGAAAGACATCAATATGACTCGAGAACCAGAAACCTACGATGGCCTAATCGAAGCCTATGGAAAATATAGAATGTACGATAAGATGGTCGAGTGTCTGAAACAGATGGAATTGGATGGATATTTTCCGGACCAAATAACTTACAATTTGCTTATCAGAGATTTTTCCAAAGATGGTCTTATGGATGATGCTATGTCAGCATTTCAAGATATGAAGAGCATTGGCCTTCAACCATCTTTGGGTATTTACAATTCTCTCATCCATGGTTTTGCTGCAAAAGGTGAGTTTGAAACTGCTATGTTTTTCGTCAATGAGATGAAAGACATCAATATGACTCGAGAACCAGATACCTACGATGGCCTAATCGAAGCCTATGGAAAATATAGAATGTACGATGAGATGGTCAAGTGTCTGAAACAGATGGAATTGGATGGATGTTTTCCAGACCAAATAACTTACAATTTGCTTATCAGGGAGTTTTCCAAAGGTGGATTGCTTAAAAAGATGGAAGGTTTATACCAAACTATGCTTTCA

Protein sequence

SRDGLMDDAMSAFQDMKEHLQPSLGIYNSLIHGFAAKGEFETAMFFVNEMKDINMTREPETYDGLIEAYGKYRMYDKMVECLKQMELDGYFPDQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSLGIYNSLIHGFAAKGEFETAMFFVNEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLIREFSKGGLLKKMEGLYQTMLS
Homology
BLAST of MC07g0149 vs. ExPASy Swiss-Prot
Match: O23278 (Pentatricopeptide repeat-containing protein At4g14190, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At4g14190 PE=2 SV=2)

HSP 1 Score: 152.5 bits (384), Expect = 5.6e-36
Identity = 73/132 (55.30%), Postives = 98/132 (74.24%), Query Frame = 0

Query: 97  YNLLIRDFSKDGLMDDAMSAFQDM-KSIGLQPSLGIYNSLIHGFAAKGEFETAMFFVNEM 156
           Y  +IR   ++  M +A+ AF+ M     L PSL IYNS+IH +A  G+FE AMF++N M
Sbjct: 134 YERIIRFLCEEKSMSEAIRAFRSMIDDHELSPSLEIYNSIIHSYADDGKFEEAMFYLNHM 193

Query: 157 KDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLIREFSKGGLL 216
           K+  +    +TYDGLIEAYGK++MYDE+V CLK+ME DGC  D +TYNLLIREFS+GGLL
Sbjct: 194 KENGLLPITETYDGLIEAYGKWKMYDEIVLCLKRMESDGCVRDHVTYNLLIREFSRGGLL 253

Query: 217 KKMEGLYQTMLS 228
           K+ME +YQ+++S
Sbjct: 254 KRMEQMYQSLMS 265

BLAST of MC07g0149 vs. ExPASy Swiss-Prot
Match: Q9LYZ9 (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana OX=3702 GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 3.1e-26
Identity = 63/216 (29.17%), Postives = 112/216 (51.85%), Query Frame = 0

Query: 11  SAFQDMK-EHLQPSLGIYNSLIHGFAAKGEFETAMFFVNEMKDINMTREPETYDGLIEAY 70
           S  + MK + + P    YN+LI         + A     EMK    + +  TY+ L++ Y
Sbjct: 265 SLVEKMKSDGIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVY 324

Query: 71  GKYRMYDKMVECLKQMELDGYFPDQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSL 130
           GK     + ++ L +M L+G+ P  +TYN LI  +++DG++D+AM     M   G +P +
Sbjct: 325 GKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDV 384

Query: 131 GIYNSLIHGFAAKGEFETAMFFVNEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQ 190
             Y +L+ GF   G+ E+AM    EM++        T++  I+ YG    + EM+K   +
Sbjct: 385 FTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDE 444

Query: 191 MELDGCFPDQITYNLLIREFSKGGLLKKMEGLYQTM 226
           + + G  PD +T+N L+  F + G+  ++ G+++ M
Sbjct: 445 INVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEM 480

BLAST of MC07g0149 vs. ExPASy Swiss-Prot
Match: Q9S7Q2 (Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PTAC2 PE=2 SV=1)

HSP 1 Score: 117.1 bits (292), Expect = 2.6e-25
Identity = 68/227 (29.96%), Postives = 112/227 (49.34%), Query Frame = 0

Query: 1   SRDGL-MDDAMSAFQDMK-EHLQPSLGIYNSLIHGFAAKGEFETAMFFVNEMKDINMTRE 60
           +R GL  +  +  F +M+ E +QP +  YN+L+   A +G  + A      M D  +  +
Sbjct: 222 ARGGLDWEGLLGLFAEMRHEGIQPDIVTYNTLLSACAIRGLGDEAEMVFRTMNDGGIVPD 281

Query: 61  PETYDGLIEAYGKYRMYDKMVECLKQMELDGYFPDQITYNLLIRDFSKDGLMDDAMSAFQ 120
             TY  L+E +GK R  +K+ + L +M   G  PD  +YN+L+  ++K G + +AM  F 
Sbjct: 282 LTTYSHLVETFGKLRRLEKVCDLLGEMASGGSLPDITSYNVLLEAYAKSGSIKEAMGVFH 341

Query: 121 DMKSIGLQPSLGIYNSLIHGFAAKGEFETAMFFVNEMKDINMTREPDTYDGLIEAYGKYR 180
            M++ G  P+   Y+ L++ F   G ++       EMK  N   +  TY+ LIE +G+  
Sbjct: 342 QMQAAGCTPNANTYSVLLNLFGQSGRYDDVRQLFLEMKSSNTDPDAATYNILIEVFGEGG 401

Query: 181 MYDEMVKCLKQMELDGCFPDQITYNLLIREFSKGGLLKKMEGLYQTM 226
            + E+V     M  +   PD  TY  +I    KGGL +    + Q M
Sbjct: 402 YFKEVVTLFHDMVEENIEPDMETYEGIIFACGKGGLHEDARKILQYM 448

BLAST of MC07g0149 vs. ExPASy Swiss-Prot
Match: Q9FMQ1 (Pentatricopeptide repeat-containing protein At5g12100, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g12100 PE=2 SV=1)

HSP 1 Score: 114.8 bits (286), Expect = 1.3e-24
Identity = 70/209 (33.49%), Postives = 103/209 (49.28%), Query Frame = 0

Query: 17  KEHLQPSLGIYNSLIHGFAAKGEFETAMFFVNEMKDINMTREPETYDGLIEAYGKYRMYD 76
           K+ ++P    YN LI  F   GE E A   VN+MK   ++   ETY+ LI  YG+   +D
Sbjct: 417 KQGMKPDHLAYNCLIRRFCELGEMENAEKEVNKMKLKGVSPSVETYNILIGGYGRKYEFD 476

Query: 77  KMVECLKQMELDGYFPDQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSLGIYNSLI 136
           K  + LK+ME +G  P+ ++Y  LI    K   + +A    +DM+  G+ P + IYN LI
Sbjct: 477 KCFDILKEMEDNGTMPNVVSYGTLINCLCKGSKLLEAQIVKRDMEDRGVSPKVRIYNMLI 536

Query: 137 HGFAAKGEFETAMFFVNEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCF 196
            G  +KG+ E A  F  EM    +     TY+ LI+         E    L ++   G  
Sbjct: 537 DGCCSKGKIEDAFRFSKEMLKKGIELNLVTYNTLIDGLSMTGKLSEAEDLLLEISRKGLK 596

Query: 197 PDQITYNLLIREFSKGGLLKKMEGLYQTM 226
           PD  TYN LI  +   G +++   LY+ M
Sbjct: 597 PDVFTYNSLISGYGFAGNVQRCIALYEEM 625

BLAST of MC07g0149 vs. ExPASy Swiss-Prot
Match: Q9SIC9 (Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g31400 PE=2 SV=1)

HSP 1 Score: 114.0 bits (284), Expect = 2.2e-24
Identity = 63/216 (29.17%), Postives = 117/216 (54.17%), Query Frame = 0

Query: 13  FQDMKEH-LQPSLGIYNSLIHGFAAKGEFETAMFFVNEMKDINMTREPETYDGLIEAYGK 72
           F +M+ + +QP    +NSL+   +  G +E A    +EM +  + ++  +Y+ L++A  K
Sbjct: 327 FDEMQRNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMTNRRIEQDVFSYNTLLDAICK 386

Query: 73  YRMYDKMVECLKQMELDGYFPDQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSLGI 132
               D   E L QM +    P+ ++Y+ +I  F+K G  D+A++ F +M+ +G+      
Sbjct: 387 GGQMDLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEALNLFGEMRYLGIALDRVS 446

Query: 133 YNSLIHGFAAKGEFETAMFFVNEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQME 192
           YN+L+  +   G  E A+  + EM  + + ++  TY+ L+  YGK   YDE+ K   +M+
Sbjct: 447 YNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGGYGKQGKYDEVKKVFTEMK 506

Query: 193 LDGCFPDQITYNLLIREFSKGGLLKKMEGLYQTMLS 228
            +   P+ +TY+ LI  +SKGGL K+   +++   S
Sbjct: 507 REHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKS 542

BLAST of MC07g0149 vs. NCBI nr
Match: XP_022150398.1 (pentatricopeptide repeat-containing protein At4g14190, chloroplastic [Momordica charantia] >XP_022150399.1 pentatricopeptide repeat-containing protein At4g14190, chloroplastic [Momordica charantia] >XP_022150400.1 pentatricopeptide repeat-containing protein At4g14190, chloroplastic [Momordica charantia] >XP_022150401.1 pentatricopeptide repeat-containing protein At4g14190, chloroplastic [Momordica charantia])

HSP 1 Score: 254 bits (648), Expect = 3.49e-78
Identity = 124/135 (91.85%), Postives = 128/135 (94.81%), Query Frame = 0

Query: 93  DQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSLGIYNSLIHGFAAKGEFETAMFFV 152
           ++  Y  +I   S+DGLMDDAMSAFQDMKSIGLQPSLGIYNSLIHGFAAKGEFETAMFFV
Sbjct: 145 NESNYEKVIVLLSRDGLMDDAMSAFQDMKSIGLQPSLGIYNSLIHGFAAKGEFETAMFFV 204

Query: 153 NEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLIREFSKG 212
           NEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLIREFSKG
Sbjct: 205 NEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLIREFSKG 264

Query: 213 GLLKKMEGLYQTMLS 227
           GLLKKMEGLYQTMLS
Sbjct: 265 GLLKKMEGLYQTMLS 279

BLAST of MC07g0149 vs. NCBI nr
Match: XP_023514113.1 (pentatricopeptide repeat-containing protein At4g14190, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 223 bits (569), Expect = 2.12e-66
Identity = 105/141 (74.47%), Postives = 125/141 (88.65%), Query Frame = 0

Query: 87  LDGYFPDQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSLGIYNSLIHGFAAKGEFE 146
           ++G    +  Y  +I   S+DGLM+DA+SAFQDMKS+GL+PSLG YN+LIHGFAA+G+FE
Sbjct: 139 IEGSRISESNYEKVIVLLSQDGLMEDAVSAFQDMKSLGLRPSLGTYNTLIHGFAARGKFE 198

Query: 147 TAMFFVNEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLI 206
            AM F++EMK+INMTRE DTYDGLIEAYGKYRMYDEM++CLKQMELDGCFPD ITYNLLI
Sbjct: 199 IAMLFIDEMKEINMTRETDTYDGLIEAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLI 258

Query: 207 REFSKGGLLKKMEGLYQTMLS 227
           REFSKGGLLKKMEGLY+++LS
Sbjct: 259 REFSKGGLLKKMEGLYRSILS 279

BLAST of MC07g0149 vs. NCBI nr
Match: XP_023004999.1 (pentatricopeptide repeat-containing protein At4g14190, chloroplastic [Cucurbita maxima])

HSP 1 Score: 223 bits (569), Expect = 2.27e-66
Identity = 106/141 (75.18%), Postives = 125/141 (88.65%), Query Frame = 0

Query: 87  LDGYFPDQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSLGIYNSLIHGFAAKGEFE 146
           ++G   ++  Y  +I   S+DGLM+DA+SAFQDMKSIGL+PSLG YNSLIHGFAA+G+FE
Sbjct: 142 IEGSRINESNYEKVIVLLSQDGLMEDAVSAFQDMKSIGLRPSLGTYNSLIHGFAARGKFE 201

Query: 147 TAMFFVNEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLI 206
            AM F++EMK+INM RE DTYDGLIEAYGKYRMYDEM++CLKQMELDGCFPD ITYNLLI
Sbjct: 202 IAMLFIDEMKEINMIREADTYDGLIEAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLI 261

Query: 207 REFSKGGLLKKMEGLYQTMLS 227
           REFSKGGLLKKMEGLY+++LS
Sbjct: 262 REFSKGGLLKKMEGLYRSILS 282

BLAST of MC07g0149 vs. NCBI nr
Match: XP_022959989.1 (pentatricopeptide repeat-containing protein At4g14190, chloroplastic [Cucurbita moschata])

HSP 1 Score: 222 bits (566), Expect = 5.92e-66
Identity = 105/141 (74.47%), Postives = 125/141 (88.65%), Query Frame = 0

Query: 87  LDGYFPDQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSLGIYNSLIHGFAAKGEFE 146
           ++G   ++  Y  +I   S+DGLM+DA+S+FQDMKSIGL+PSLG YNSLIHGFAA+G+FE
Sbjct: 139 IEGSRINESNYEKVIVLLSQDGLMEDAVSSFQDMKSIGLRPSLGTYNSLIHGFAARGKFE 198

Query: 147 TAMFFVNEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLI 206
            AM F++EMK+INMTRE DTYDGLIEAYGKYRMYDEM++CLKQMELDGC PD ITYNLLI
Sbjct: 199 IAMLFIDEMKEINMTREADTYDGLIEAYGKYRMYDEMIECLKQMELDGCIPDHITYNLLI 258

Query: 207 REFSKGGLLKKMEGLYQTMLS 227
           REFSKGGLLKKMEGLY+++LS
Sbjct: 259 REFSKGGLLKKMEGLYRSILS 279

BLAST of MC07g0149 vs. NCBI nr
Match: KAG6592896.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 221 bits (563), Expect = 1.47e-65
Identity = 104/141 (73.76%), Postives = 125/141 (88.65%), Query Frame = 0

Query: 87  LDGYFPDQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSLGIYNSLIHGFAAKGEFE 146
           ++G   ++  Y  +I   S+DGLM+DA+S+FQDMKSIGL+PSLG YNSLIHGFAA+G+FE
Sbjct: 134 IEGSRINESNYEKVIVLLSQDGLMEDAVSSFQDMKSIGLRPSLGTYNSLIHGFAARGKFE 193

Query: 147 TAMFFVNEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLI 206
            A+ F++EMK+INMTRE DTYDGLIEAYGKYRMYDEM++CLKQMELDGC PD ITYNLLI
Sbjct: 194 IALLFIDEMKEINMTREADTYDGLIEAYGKYRMYDEMIECLKQMELDGCIPDHITYNLLI 253

Query: 207 REFSKGGLLKKMEGLYQTMLS 227
           REFSKGGLLKKMEGLY+++LS
Sbjct: 254 REFSKGGLLKKMEGLYRSILS 274

BLAST of MC07g0149 vs. ExPASy TrEMBL
Match: A0A6J1DBE2 (pentatricopeptide repeat-containing protein At4g14190, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111018567 PE=3 SV=1)

HSP 1 Score: 254 bits (648), Expect = 1.69e-78
Identity = 124/135 (91.85%), Postives = 128/135 (94.81%), Query Frame = 0

Query: 93  DQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSLGIYNSLIHGFAAKGEFETAMFFV 152
           ++  Y  +I   S+DGLMDDAMSAFQDMKSIGLQPSLGIYNSLIHGFAAKGEFETAMFFV
Sbjct: 145 NESNYEKVIVLLSRDGLMDDAMSAFQDMKSIGLQPSLGIYNSLIHGFAAKGEFETAMFFV 204

Query: 153 NEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLIREFSKG 212
           NEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLIREFSKG
Sbjct: 205 NEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLIREFSKG 264

Query: 213 GLLKKMEGLYQTMLS 227
           GLLKKMEGLYQTMLS
Sbjct: 265 GLLKKMEGLYQTMLS 279

BLAST of MC07g0149 vs. ExPASy TrEMBL
Match: A0A6J1KXW9 (pentatricopeptide repeat-containing protein At4g14190, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111498121 PE=3 SV=1)

HSP 1 Score: 223 bits (569), Expect = 1.10e-66
Identity = 106/141 (75.18%), Postives = 125/141 (88.65%), Query Frame = 0

Query: 87  LDGYFPDQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSLGIYNSLIHGFAAKGEFE 146
           ++G   ++  Y  +I   S+DGLM+DA+SAFQDMKSIGL+PSLG YNSLIHGFAA+G+FE
Sbjct: 142 IEGSRINESNYEKVIVLLSQDGLMEDAVSAFQDMKSIGLRPSLGTYNSLIHGFAARGKFE 201

Query: 147 TAMFFVNEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLI 206
            AM F++EMK+INM RE DTYDGLIEAYGKYRMYDEM++CLKQMELDGCFPD ITYNLLI
Sbjct: 202 IAMLFIDEMKEINMIREADTYDGLIEAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLI 261

Query: 207 REFSKGGLLKKMEGLYQTMLS 227
           REFSKGGLLKKMEGLY+++LS
Sbjct: 262 REFSKGGLLKKMEGLYRSILS 282

BLAST of MC07g0149 vs. ExPASy TrEMBL
Match: A0A6J1H9M9 (pentatricopeptide repeat-containing protein At4g14190, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111460878 PE=3 SV=1)

HSP 1 Score: 222 bits (566), Expect = 2.87e-66
Identity = 105/141 (74.47%), Postives = 125/141 (88.65%), Query Frame = 0

Query: 87  LDGYFPDQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSLGIYNSLIHGFAAKGEFE 146
           ++G   ++  Y  +I   S+DGLM+DA+S+FQDMKSIGL+PSLG YNSLIHGFAA+G+FE
Sbjct: 139 IEGSRINESNYEKVIVLLSQDGLMEDAVSSFQDMKSIGLRPSLGTYNSLIHGFAARGKFE 198

Query: 147 TAMFFVNEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLI 206
            AM F++EMK+INMTRE DTYDGLIEAYGKYRMYDEM++CLKQMELDGC PD ITYNLLI
Sbjct: 199 IAMLFIDEMKEINMTREADTYDGLIEAYGKYRMYDEMIECLKQMELDGCIPDHITYNLLI 258

Query: 207 REFSKGGLLKKMEGLYQTMLS 227
           REFSKGGLLKKMEGLY+++LS
Sbjct: 259 REFSKGGLLKKMEGLYRSILS 279

BLAST of MC07g0149 vs. ExPASy TrEMBL
Match: A0A540MIL8 (Uncharacterized protein OS=Malus baccata OX=106549 GN=C1H46_016376 PE=4 SV=1)

HSP 1 Score: 179 bits (454), Expect = 6.34e-51
Identity = 82/135 (60.74%), Postives = 110/135 (81.48%), Query Frame = 0

Query: 93  DQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSLGIYNSLIHGFAAKGEFETAMFFV 152
           ++  Y  +I   S++GLM++A   FQ+MKS  L+PSL +YNS+IHGFA +G F+ A+F+ 
Sbjct: 11  NEFNYIKIIGLLSEEGLMEEAAPCFQEMKSHDLRPSLEVYNSMIHGFARQGNFDDALFYF 70

Query: 153 NEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLIREFSKG 212
           NEM+++N+  E DTYDGLIEAYGKY+MYDEM  C+K+M+L+GC PD ITYNLLIREFS+G
Sbjct: 71  NEMREMNVAPETDTYDGLIEAYGKYKMYDEMGTCVKKMKLNGCPPDHITYNLLIREFSRG 130

Query: 213 GLLKKMEGLYQTMLS 227
           GLLK+ME +YQ+MLS
Sbjct: 131 GLLKRMESVYQSMLS 145

BLAST of MC07g0149 vs. ExPASy TrEMBL
Match: A0A5B7BLA3 (PPR_long domain-containing protein (Fragment) OS=Davidia involucrata OX=16924 GN=Din_038595 PE=3 SV=1)

HSP 1 Score: 182 bits (463), Expect = 6.58e-51
Identity = 82/134 (61.19%), Postives = 110/134 (82.09%), Query Frame = 0

Query: 94  QITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSLGIYNSLIHGFAAKGEFETAMFFVN 153
           +  Y  ++    ++GLM+DA+SAF+ MKS GL+PSL IYNS+IHGFA+KG FE ++F++ 
Sbjct: 165 EFNYEKILGLLVEEGLMEDAVSAFRGMKSHGLRPSLEIYNSMIHGFASKGRFEDSLFYLE 224

Query: 154 EMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLIREFSKGG 213
           EM++IN+  + +TYDGLIEAYG Y+MYDEM KC+K+ME DGC PD +TYNLLIREFS+ G
Sbjct: 225 EMEEINLKPDTETYDGLIEAYGNYKMYDEMGKCMKKMEYDGCLPDHVTYNLLIREFSRAG 284

Query: 214 LLKKMEGLYQTMLS 227
           LLK+ME +YQT+LS
Sbjct: 285 LLKRMERVYQTLLS 298

BLAST of MC07g0149 vs. TAIR 10
Match: AT4G14190.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 152.5 bits (384), Expect = 4.0e-37
Identity = 73/132 (55.30%), Postives = 98/132 (74.24%), Query Frame = 0

Query: 97  YNLLIRDFSKDGLMDDAMSAFQDM-KSIGLQPSLGIYNSLIHGFAAKGEFETAMFFVNEM 156
           Y  +IR   ++  M +A+ AF+ M     L PSL IYNS+IH +A  G+FE AMF++N M
Sbjct: 134 YERIIRFLCEEKSMSEAIRAFRSMIDDHELSPSLEIYNSIIHSYADDGKFEEAMFYLNHM 193

Query: 157 KDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCFPDQITYNLLIREFSKGGLL 216
           K+  +    +TYDGLIEAYGK++MYDE+V CLK+ME DGC  D +TYNLLIREFS+GGLL
Sbjct: 194 KENGLLPITETYDGLIEAYGKWKMYDEIVLCLKRMESDGCVRDHVTYNLLIREFSRGGLL 253

Query: 217 KKMEGLYQTMLS 228
           K+ME +YQ+++S
Sbjct: 254 KRMEQMYQSLMS 265

BLAST of MC07g0149 vs. TAIR 10
Match: AT5G02860.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 120.2 bits (300), Expect = 2.2e-27
Identity = 63/216 (29.17%), Postives = 112/216 (51.85%), Query Frame = 0

Query: 11  SAFQDMK-EHLQPSLGIYNSLIHGFAAKGEFETAMFFVNEMKDINMTREPETYDGLIEAY 70
           S  + MK + + P    YN+LI         + A     EMK    + +  TY+ L++ Y
Sbjct: 265 SLVEKMKSDGIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVY 324

Query: 71  GKYRMYDKMVECLKQMELDGYFPDQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSL 130
           GK     + ++ L +M L+G+ P  +TYN LI  +++DG++D+AM     M   G +P +
Sbjct: 325 GKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDV 384

Query: 131 GIYNSLIHGFAAKGEFETAMFFVNEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQ 190
             Y +L+ GF   G+ E+AM    EM++        T++  I+ YG    + EM+K   +
Sbjct: 385 FTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDE 444

Query: 191 MELDGCFPDQITYNLLIREFSKGGLLKKMEGLYQTM 226
           + + G  PD +T+N L+  F + G+  ++ G+++ M
Sbjct: 445 INVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEM 480

BLAST of MC07g0149 vs. TAIR 10
Match: AT1G74850.1 (plastid transcriptionally active 2 )

HSP 1 Score: 117.1 bits (292), Expect = 1.9e-26
Identity = 68/227 (29.96%), Postives = 112/227 (49.34%), Query Frame = 0

Query: 1   SRDGL-MDDAMSAFQDMK-EHLQPSLGIYNSLIHGFAAKGEFETAMFFVNEMKDINMTRE 60
           +R GL  +  +  F +M+ E +QP +  YN+L+   A +G  + A      M D  +  +
Sbjct: 222 ARGGLDWEGLLGLFAEMRHEGIQPDIVTYNTLLSACAIRGLGDEAEMVFRTMNDGGIVPD 281

Query: 61  PETYDGLIEAYGKYRMYDKMVECLKQMELDGYFPDQITYNLLIRDFSKDGLMDDAMSAFQ 120
             TY  L+E +GK R  +K+ + L +M   G  PD  +YN+L+  ++K G + +AM  F 
Sbjct: 282 LTTYSHLVETFGKLRRLEKVCDLLGEMASGGSLPDITSYNVLLEAYAKSGSIKEAMGVFH 341

Query: 121 DMKSIGLQPSLGIYNSLIHGFAAKGEFETAMFFVNEMKDINMTREPDTYDGLIEAYGKYR 180
            M++ G  P+   Y+ L++ F   G ++       EMK  N   +  TY+ LIE +G+  
Sbjct: 342 QMQAAGCTPNANTYSVLLNLFGQSGRYDDVRQLFLEMKSSNTDPDAATYNILIEVFGEGG 401

Query: 181 MYDEMVKCLKQMELDGCFPDQITYNLLIREFSKGGLLKKMEGLYQTM 226
            + E+V     M  +   PD  TY  +I    KGGL +    + Q M
Sbjct: 402 YFKEVVTLFHDMVEENIEPDMETYEGIIFACGKGGLHEDARKILQYM 448

BLAST of MC07g0149 vs. TAIR 10
Match: AT5G12100.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 114.8 bits (286), Expect = 9.2e-26
Identity = 70/209 (33.49%), Postives = 103/209 (49.28%), Query Frame = 0

Query: 17  KEHLQPSLGIYNSLIHGFAAKGEFETAMFFVNEMKDINMTREPETYDGLIEAYGKYRMYD 76
           K+ ++P    YN LI  F   GE E A   VN+MK   ++   ETY+ LI  YG+   +D
Sbjct: 417 KQGMKPDHLAYNCLIRRFCELGEMENAEKEVNKMKLKGVSPSVETYNILIGGYGRKYEFD 476

Query: 77  KMVECLKQMELDGYFPDQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSLGIYNSLI 136
           K  + LK+ME +G  P+ ++Y  LI    K   + +A    +DM+  G+ P + IYN LI
Sbjct: 477 KCFDILKEMEDNGTMPNVVSYGTLINCLCKGSKLLEAQIVKRDMEDRGVSPKVRIYNMLI 536

Query: 137 HGFAAKGEFETAMFFVNEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQMELDGCF 196
            G  +KG+ E A  F  EM    +     TY+ LI+         E    L ++   G  
Sbjct: 537 DGCCSKGKIEDAFRFSKEMLKKGIELNLVTYNTLIDGLSMTGKLSEAEDLLLEISRKGLK 596

Query: 197 PDQITYNLLIREFSKGGLLKKMEGLYQTM 226
           PD  TYN LI  +   G +++   LY+ M
Sbjct: 597 PDVFTYNSLISGYGFAGNVQRCIALYEEM 625

BLAST of MC07g0149 vs. TAIR 10
Match: AT2G31400.1 (genomes uncoupled 1 )

HSP 1 Score: 114.0 bits (284), Expect = 1.6e-25
Identity = 63/216 (29.17%), Postives = 117/216 (54.17%), Query Frame = 0

Query: 13  FQDMKEH-LQPSLGIYNSLIHGFAAKGEFETAMFFVNEMKDINMTREPETYDGLIEAYGK 72
           F +M+ + +QP    +NSL+   +  G +E A    +EM +  + ++  +Y+ L++A  K
Sbjct: 327 FDEMQRNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMTNRRIEQDVFSYNTLLDAICK 386

Query: 73  YRMYDKMVECLKQMELDGYFPDQITYNLLIRDFSKDGLMDDAMSAFQDMKSIGLQPSLGI 132
               D   E L QM +    P+ ++Y+ +I  F+K G  D+A++ F +M+ +G+      
Sbjct: 387 GGQMDLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEALNLFGEMRYLGIALDRVS 446

Query: 133 YNSLIHGFAAKGEFETAMFFVNEMKDINMTREPDTYDGLIEAYGKYRMYDEMVKCLKQME 192
           YN+L+  +   G  E A+  + EM  + + ++  TY+ L+  YGK   YDE+ K   +M+
Sbjct: 447 YNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGGYGKQGKYDEVKKVFTEMK 506

Query: 193 LDGCFPDQITYNLLIREFSKGGLLKKMEGLYQTMLS 228
            +   P+ +TY+ LI  +SKGGL K+   +++   S
Sbjct: 507 REHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKS 542

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O232785.6e-3655.30Pentatricopeptide repeat-containing protein At4g14190, chloroplastic OS=Arabidop... [more]
Q9LYZ93.1e-2629.17Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana OX... [more]
Q9S7Q22.6e-2529.96Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidop... [more]
Q9FMQ11.3e-2433.49Pentatricopeptide repeat-containing protein At5g12100, mitochondrial OS=Arabidop... [more]
Q9SIC92.2e-2429.17Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_022150398.13.49e-7891.85pentatricopeptide repeat-containing protein At4g14190, chloroplastic [Momordica ... [more]
XP_023514113.12.12e-6674.47pentatricopeptide repeat-containing protein At4g14190, chloroplastic [Cucurbita ... [more]
XP_023004999.12.27e-6675.18pentatricopeptide repeat-containing protein At4g14190, chloroplastic [Cucurbita ... [more]
XP_022959989.15.92e-6674.47pentatricopeptide repeat-containing protein At4g14190, chloroplastic [Cucurbita ... [more]
KAG6592896.11.47e-6573.76Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
A0A6J1DBE21.69e-7891.85pentatricopeptide repeat-containing protein At4g14190, chloroplastic OS=Momordic... [more]
A0A6J1KXW91.10e-6675.18pentatricopeptide repeat-containing protein At4g14190, chloroplastic OS=Cucurbit... [more]
A0A6J1H9M92.87e-6674.47pentatricopeptide repeat-containing protein At4g14190, chloroplastic OS=Cucurbit... [more]
A0A540MIL86.34e-5160.74Uncharacterized protein OS=Malus baccata OX=106549 GN=C1H46_016376 PE=4 SV=1[more]
A0A5B7BLA36.58e-5161.19PPR_long domain-containing protein (Fragment) OS=Davidia involucrata OX=16924 GN... [more]
Match NameE-valueIdentityDescription
AT4G14190.14.0e-3755.30Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G02860.12.2e-2729.17Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G74850.11.9e-2629.96plastid transcriptionally active 2 [more]
AT5G12100.19.2e-2633.49pentatricopeptide (PPR) repeat-containing protein [more]
AT2G31400.11.6e-2529.17genomes uncoupled 1 [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1..84
e-value: 1.3E-16
score: 62.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 85..227
e-value: 1.5E-29
score: 105.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 95..128
e-value: 1.6E-4
score: 19.6
coord: 26..51
e-value: 1.8E-4
score: 19.4
coord: 131..156
e-value: 1.8E-4
score: 19.4
coord: 200..226
e-value: 0.0026
score: 15.8
coord: 166..198
e-value: 8.5E-8
score: 29.9
coord: 61..93
e-value: 7.2E-7
score: 27.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 166..210
e-value: 1.0E-7
score: 32.1
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 88..121
e-value: 2.0E-7
score: 30.6
coord: 124..156
e-value: 1.9E-5
score: 24.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 26..53
e-value: 5.6E-5
score: 23.1
coord: 1..18
e-value: 0.43
score: 10.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 93..127
score: 12.167101
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 128..162
score: 9.097937
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 163..197
score: 9.941957
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 23..57
score: 9.097937
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 198..227
score: 9.097937
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 58..92
score: 9.624079
NoneNo IPR availablePANTHERPTHR47493:SF3BNAC09G53180D PROTEINcoord: 3..110
coord: 107..227
NoneNo IPR availablePANTHERPTHR47493OS08G0520200 PROTEINcoord: 3..110
coord: 107..227

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC07g0149.1MC07g0149.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding