IVF0012595 (gene) Melon (IVF77) v1

Overview
NameIVF0012595
Typegene
OrganismCucumis melo L. ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptionpentatricopeptide repeat-containing protein At4g18975, chloroplastic
Locationchr01: 2027743 .. 2042600 (+)
RNA-Seq ExpressionIVF0012595
SyntenyIVF0012595
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTATTACCCAAATACTACTCGGGACAGTGGCGCCCTTCACTCCACCATGCTCATCCGGAGGTTTTATAGAGCCGCGGCATGGGCGACACCTCTGTTGCGACACCCAACCGTAAGTCTTCGAATTCGTTGTTCTCTTCATTTGTCTCCTCGAACCTTTGAATTCTTGGATTTCCGTTAGTTCTTTTTGGTTTCAATGTTTGTTAGTTGTCTTCATGTGCTTTTCCGTTTTGCTAATTGGATTGGCAACAGGAAATGCTTAGTTTCTATTTTTTTTTTTTCTGCCTAAATCCCTTGCATTATTGTTTATTTTTATGTTTTCAACAAATGGCTAATGAAAGGACCAAGGGGAAGAACTGGATACCTCTTCTTACAACCCTGAACTTACATTTGTTATGTATTTATTTATTTTTATTTTTTATTCGGGAATTTCTGGGCTGATTTACTTGCACTTCGATTAATCTCATGGGAGAATCTGCTTGACCTTACAATATTGAAGTGTCAACTCGTAGGATATTATCATAAGTAGATGACCACAATGTTTGAAACTGAACTTATTTGGTTGTTGTGATCTTTCCCTAATAATGTGAATCTTGCATTAGGATTTTTTGGATTGCAATCATCTATTGTTGATAAATCCTCTGTTTTGCATTACTTGGGGAATAATTAACTGTGGCTAAAGGCTCTCCCCGTCATTGATTTCTTATTGTCTATATTAGGAAATCCTTGGTTTACCCAGAACTAAGCCTTTGTTGGCCTAGTCTTATTTTGGAACCTCAAGACCTTGGTTGGATATACTTAGAGCAGAATTGCGTATTAATTTCCTGTTAATATATTAACCTATGATACCTTTCTCGACCTTTTGGCTAAGATCAAGTGTATAATATATTAACCTATGCTAACTAGTTTCAGACTGCTTTTAAGGAATCTAAATTTTTCAATAAATGAGCACACAGGTAGGGAAAACCATGGAGCTTGGAGTCAGCAGGCTGCAAGTTGGGTGCTCTTGGTACTGCACAATGATACAAGATCAAATGTATAAACAGCTTGCTGATAAAGATAGAAAAAATAAGGTATCTGAGATAAATTATATAGTCCATGCTTCGAAAGTATGACATACTTCTGAAGTCCTTCGATGCCTCGGCAGATTAAAATTAGTTTGGTTGATATTTTAAAGATTTGGCCTCTTTTTCCTTTTGGTATAACTGTTGTAGCTCTCACTGTTAATATTGCATTATTTGGCAGCAGAAATGATATTAAGGTTGATGAAATATTGTTTATCGAAGGGTTTGAAGAAAAAAGAGAGTCCTGTCATCTTTGTATTTGGACTTGGGAGGATAAAATAGCTCATCTGGACTTGTCAAGTTTCATGTGCTTTCAAATCTTTCCTGTAGTTGTTAGTCTCTTCTCTTATTGTGCATTAATATCGGGACTTATAAAATGTACTTTATTTATTACTTTTGTTTGTTTCTTTTTTTCCGGACAAAGATTACACAAATTGATATTCCTTCACAGAGTAACAGTGAATATTGAAATTATTGTAAATTTATTTGTTGGTAAATTAGTTGACTGTTGAATTCTTTTTACCAATGATCCAGGATGTTGACAATAGTAAAGCTTTGGGGCACATTTCAGAGCAAAATATTGGAGACATTAGAAAGCACAAAATTGGGGAAAATGTTTCACGGAAAGACAAAATTAGTTTTCTTGTAAATACGGCAAGTATAAATGGGAAAATATCTCCCATCTCTCTCTCTCTCATCACATATGGGTGTATTTTGATTTCATAGAAAGCTTTAGGAATAACTAATAGTCTAATTCTGAAAGGGGAAAAAAAAAGCAGAGAACGGCACTTTGAGTATTCAAATGCTAAAACTGTTTGCATCAATAAGAGTTGCATACGTCAAACAGTAATTGGCAAGACTTCTTGTTTCCTGCTAAAAGTAATAGCGATAAGCCTTTCAGATCTCCTATTTCCTTCTGTAACAAACCTAACACTTGATTGTACATATCGGTTGGGTTCAACACAGCTTCTTGCCTCATCATGGTGTTTTTAATCAAAAGTTTTTCCTAGTTATTCCGTTCAGAACATTTGTCTGAAGTTTGAACTTGAATGCTTTCATTAATTCAGTTTAATTTATGCTTTCATTAATTCGGTTTAATTTATTTCTTTGTCTTTTTAGTCTTGGTATCTAGTATTTTGGAGCACTAGACTCTTTCATTAAGTCGATGAAAAGTATTGTAAGTTGTTTCTGTTAAAAAAAATCCCTATTTTCTTCTGTCGTTACTATACATTAGTGATATATAATTAAATTTGTCTTCAATCACTAACTTAAGCATTTGGTTGAATTGGTGATTTAAGATGGTATCAAAGAAGGTGGTCTAGGGAGGTCCTGTGTTCAATCCCTTGTATTGTTGTTTTCTCCCAATTAACATTGATTTCCATTTGTTGGGTCTTTCAAATATTTTAAGCCCACAAGTGAGGAGGAGTGTTAGTGATATAATTAAATTTGCCTTTAACCACCAACTTAAACTTTTGGTTGAATTGGTGATTTAAGACTATTATTTACTTCATTACATCTTTTCGCTTCTTCTTTTTCACACTGGGTAAGAATGGATCAAGAGATGGTCAACAAGCTATGGTTGTGTCTAGATTGTTTGCTCCTAATAACTGCAAGTAGATTAAAGGTGCTCAGGTAGATCATTTACCATCTCTATTTCGCTTAATCTTTCCATGGCAGATATGGCTCTACTCAAACTTGATGATAGTAAGATTAGTGGGTTGGAAGCAGACCCCTCTGGTCGATGTTCCCTCGTCTCTTTCATCTTTTTTCCTTTAAAAAAATCATGTGGTGTTTTGGTTGAGAAGATCCTTTTCCTTTTCATTGCTCTTCATTCGATGGGGAAGCGATGGATGTCCCTTTGTCCTTTCTTTATGGGGCTGTTTTTAGACAGGGGAGAAAGGATCTTAGGGTGTGGAGTCCTAATCTTTCAAAGGAATTTTCGTGTAACTCCTTTTTTTTGTATGCTTATTAGTGCCCTCCCTTAGGCGATTTGGATTTTCTAGCTTTTTTGGAGGATTAAGATTCCTAGGAAAACTAAGTCCAATTGGCGAGTTCTACATGGAAAAGTTAACACTTTCCTTCTTTTTTTCATTTTTTTAAGAAACAGAGGAATATATACATATATAAACAAGAAGAGAACAGCCAAAGACCGAGGGATTGAGGGACCCCCTAATCATGAGAAGGTTGTAATTACAAAATAACTGTGTAGTTTGTACGCCACCAAGAGGCTACATGCTGCATAAGAGCCCCAAAAAGAACCAAAAGAACTAAACTTATCCTGAAACACCTAATATTTCTTCCGAAGATGTCACAAAAGCATGCAGGCTCCACAACTCCATAACATTCTTCCTTTTCCACTAAAGCTGCGTCCGTTCAAACCCTCCATCATTCTATCATCAATCCACAAGGGGAGACATCCATCCATCCACTCCTGTCCCTATGACGAACATAAAAGATACCAAACGTCCAGAAGAAGAACTCGCAAACTCATATCGCCAAAGCAAATGATCCAAATTTTCTTCCGCCTTTCAACCAAGAATACAACATAAAGGACCAATATCCGGACGCACCAGGACATCTTAGTTAGGTTGGCAATTCCTCAAGGGGGGCGAGTAAAGGCATTGAATGTGTCAGCCTTCAAAGAATATACTTTAGATAGACAAATAGAACTGAGTACATCCCTTAATCGCTCAAAGTAATGCACCTGTTTTGGATGTAGCATAATTCAACCAAATAACTATTAGACCAAGAGGGGAATCACCTGACTGTTCTTCGCTTCTTTTACCCAAAAAAAAAGAAAAAAAGAACATTGTATTTTCAGACCAAATACAAAAATACCTTTTACAGATGCAACTCCTTGTCATCATAATGCAAATTAACAAAATAATCTTTTCTGAATATGTTAGCAATAGAATATAGATGAAGAAGTAAACGAAATCACTGAAATCTGACACGAAAACCAAATAACTCTTCAAGGCTAAGAATGAAGGGATGGATATGGGATGGGAGGAGAGAGAAGAGAGAAGAGAGCATTTCCTTTGCTCTCATTCCAGTTGCCTTGATGCAGTCTTATGCTAAATTGCATTTTACAGGGAAGATGATTCGTGAATGATACAAGTTCAATATTTTCTTTACGAAAAGTATATTGTATACTTTGCCCATACCAGATGCTCTTGTTTCTTATGTATATTTTTATGGTCAATATCAAACTTTTATTTATTTTTAAGTAATTATACGAACTTGGAAGTTTATGAGTTCTCTAAATGTTGCAGCTGCTCGATCTGAGAGATAGTAAGGAGGCTGTTTATGGTGCTCTTGATGCCTGGGTTGCATGGGAGCAAGACTTTCCAATTGCATCCCTTAAGCATGTATTGGCTGCCCTTGAGAAGGAACAACAGTGGCATAGAATTGTACAGGTACGAGTTGTACCAGCTATTCTTTATCTACTCTCTTTGCCCTTTACCCACTCTTCCCATTTTTCTTCTGAAACTGAAAAAAAATCTCTTCATTGAAATAGATATGAAATGAAATGTAAGGAATCATCAAATGATGATGCAATGGCATAAATAGAGCAATGGTATTTGTAAAACAAACAAACATAGTTAAGAGGAGATACATGCCTACCAATTTAAACTAATATTTTAAATATAGTATTCAGCAAATAATTTGGATAGAGAGCACCAGGAAGAAGTTTTGACCCAAACAATCTCATAACGATCAAACCAAGGGAGAGACTTATTTTGAAAAGTATGCTGGTTCCTTTCAAACAGAATTTTTAAGAGAATCATTTTGACTCTGTTCACCTATAAAAGGCGTGGGATAGCTTTTAACGAAGGCCCCACTAGAAGCTGAATTATATTCTCCTTGAAAACAGTGCTTTACACCACACCTGATGCAGATTAAACATAGCAAATTTTTTTGGAGAAGCTACAATAAAAAAAATATGTGCCGGTAGTCTTCCAGATTTTCCATACCCAGTGGGCAGATTGAGGGCATTAACGAATGGCTTGGGTGCTATCTTTGAAAACACTTGGAGCAATTAAGAGCTCCATAAATCGTTACACAAGTTATGATGTTAACTCTTCAAGGACAGTTTGATTTCCACACAGCTTTGTACAGCTTTGAATCCATGGGAGAAGAGGCCGTCAAATGCCTGGATAAAGATTTAACCAAAAACTGATTTGAAACTTCCAATTTCTGTCTTTAGACCACTTTTTGACCCTCTAATACAACCAATCAATTCTTGAAATCCAGCAGCTCTTAATTCTTCAGAAACCTTCAAAAGTAATAGACCGTGAAACAACATTCAAGTTCCGGTGGTCGGAAACTGATCCGTTAGGAACTGAGGCCATACGACCATGCATCCAATGTCAGGCCCTAAATGGGCTCTATTTGACCCTGCATACAATTTCAGATGCACCTTATCAAACAACTTACTAAGTAGACGTAAATTTTCAGAACTGTTTGATCCAAACCAAAATTTGGTCATTTTTTGTAGGCTAACATTGCCATTTTTCGAACGATTTGGGTTCCACCTAAGTTCACCACTAAGGAGCACCTATCATGGCCCTTAGCTCACTTTTCTGTGCTAGACCTTGCTAGAGATACGATTCCACACTTATCATCCCATAATCATTCAAATAGAGAGGTGCAGGACAAAAAAAAAAAAAAACTATTGGAAGACTAGTTTGGACCAAAAAATAAAAACCGAGTTTGGGGCTTCATTGATCAATTTTTTCTCAAAAAACTTCAACATTCCTTAAAACGAGACCACTTACTTAGTAGATGTAAACTTTTAGGACTTTTTGACCCCGACCAAAATTTGATCATTTTTTTGTAGGCTAATTTTTTGATCATTTTGGGTTCCACCATAGTCCACCACTAAGGAGCACCAATTGCAACCCTTAGATTATAAGGTATTAAATTATCCACAATGTGTACTTTTTTAATTCCAAGTTATAATTAATAATTGTGAAGATTAATCATTGTTCCCCAGGTAAGTTCTCTCTCCTTTCTCCTTTATCCTTTCTCCTCTCCTTTTGTCCTCTACCTGCTTCAACATTTTCAAATATCAGTTTTCCAGCAAGTAAGTTAGAAATGGAGGTGGAAAGATGCAGAGTTCTAGACTCTCCCTATTGTATTTGGTAAGAGAAGGAGTTATTCCAAGTTGAAGATGTGGAAGCAAAAAAACTCTGCCTTTATCAGGTTCCCAACTCTGCTGGTTCCAAGAGGCTGTTGAAGTACTATGTTAACAATCAGAAAAGAAGTACATCTACTAGAAAGAAATTATAAAAATGGCGTAACAGAAGTCTGAAAATTCAGAGTTGGGAAAGGTTGGGTTTTAAGATGCAACTTTTGATTTAGAACAGGAGGAAGATCTTTCATACATATCCTATCGGGCAAGGGAAACAGAGATGGCAAGCGTTTCTGGAAATGATGAAAAGCTTCAAATCCAAGCACTATTATATGACATGGTTTTCAGTCCAGAACAAGTTGAAAAGCGGTCACTCTAAAGGTTTTGGCCGATCACAAGAGAGAAGGATAGTTATGCGGACAAAATTAAGCAGAATAAGACCTTGTTTTGGCCCCGGAAAAACAGAGGAATCGCCAACATTGGATTATCAAGAATCCAAAGGTCTTTCGAGCATATTTCAACAATCTTTGGGTTGTCACATGTCTATTTGAGTTCAATAGCTGGGTCGAGATTGATGAAACTTTACAAACTGCCTTCAAACCACTGTTATCATCAACTCGTTGTTTGCAGAGAATGTTGTTAGATACCTAGATTAGTATAGAGCAGGGTATAAAGGTAAATTAGATATATGGCAGTTAGATGGTTATAAATAGGAAGTTGGGGAGTGATGAGAGAGCATGGAGATTTTAGTAAAGTCAAGGGACTGCTATACTCCTTGAGAGAGAGGGGAAGCAGAAGGCCATGTGCTTTTGTTCTAGTCAATTCATATTGTAATTTCATATTAGATATCAATAAAGAGAAATATTATATTAATGTTCTATCAAATTGGTATCAGAGCAGTGAATCTTGGGAAGGAAATCAACAATGGAGCAAACACAGATTGAAGAAAAACTGGTAGCCTTTGAACAAAAAGTGATTGGGATGAAGAAATAATTAAGCAAAATTTCGGTAATTGAAGAAAATCTGAGATCCTTAACAAAGAGCCTCCGGCGGTTAAGAATTCAAGCCGAGGAAAATCAACAGTTGTTTTTACAATGCATTGAAACCATGGTGAAGGATAAATCGACAATGAGTGAAAGAATCACAACGAATCTACCAGTGATGAAGAGTATAGTTGAAGGAGGAAGTACATCTATGAAAAGGATCGAGAACGAAGGACAAAGGAAGGAAATAGAAACTGAAGAGAAGGTGAGCGAAGAACAAACGGAGAAAATAGAAACTGAAGAGAAAGTGAGCGAAGAACGAATGGAGAAAATAGAAATTGAAGAGAAGGCGAGCGATCGAAATAAGTTCGAATGGTCAGTGTTTAGCGTAGCTTCAGGTTGTTATCGGTCACTGGAGGAACACGATCCATTCAAGAATTGGTTGGAATTAAAACAGAGGCTGATATTTAGTTTTCGATCACTAAGGGAAGAACTTAGCCGCGAAATTTTTTAGCAAACAAGCCGTAAATCATGGAAGATCGACAAGAATTACCGGCGAGACTCGAAACGAAAATCTGAGTGAAGAAAAAGTTGCTGTTCGGAGAAGCTGAAATTGATTGGCTGCGGTGGAGAAGGAAGACACCGGAAGTCGCCTTCTCGGACAACCACCATTTGCCGGCGTTTTCTTCCTCCTTCTCATCCACACGACTGCTGCCGTCTTCTTTTTCCATCGAAACCAGGCAACTTCCGGCGTCTTCCTTCTCCACCGTAGCCAATCAATTTTAGCTTCTCCGAACAACTTTTTCTTCACTCGGATTTTCATTTCGGGTCTCGCCGGTAGTTCTCGCCGATATTCCATGATTTCCTGCTTGTTTGCTAAAATCATCCGCTGCTAAGTTCTTCCCTTAGTGATCGGAAACTAAATATCAGCCTCTGTTTTAATTCCAACCAGTTCTTGAATGAATCGTGTTCCTCCGGTGATCGATAACAACCTGAAGTTACGCTAAACACTGACCATTCAAACTTATTTCGATCGCTCACCTTCTCTTCAATTTCTATTTTCTCCGTTCGTTCTTTGCTCACCTTCTCTTCAGTTTCTATTTCCTTTCTTTGTTCTTCGTTCTCGATCCTTTTCGTAGATGTACTTCCTCCTTCAACTATACTCTTCATCACTGGTAGATTCGTTGTGATTCTTTCACTCATTGTCGATTTATCCTTCACCATGGTTTCAACGCATTGTAAAAACAACTGTTGATTTTCCTTGGCTTGAATTCTCAACCGCTAGAAGAGGCTCTTTGTTAGGGATCTCAGATTTTCTTCAATTGCTGCAATTTTGCTTAATTATTTCTTCATCCCAATCACTTCTTGTTCAAAGGCTTCCAGTTTTTCTTCAATCTGTGTTTGCTCCATTGTTGATTTCCTTCCTAAGATTCACTACTCTGGTACCAATTTGATAGAACATTGATATAATATTTCTCTTTATTGATATCTAATATGAAATTACAATATGAATTGACTAGAACAAAAACACCTGACCTTCTGCTTCCCCTCTCTCTTAAGGAGTATAGCAGTCCCTTGACTTTACTAAAATCTTCATGCCCTCTTTGCACTCCCTAACTTCCTATTTATAACCATCTAACCGCCACATATTTAATTTACCTTTATACCCTTGTTCTATACTAATCTAGATATCTAACAAATGCTTTAATTGATATTGAGCAAAGATCATTGGATGAAGTGATAGAGCAACCCGGGAGATTGCAGGATTTTGGTGCTTTCCACTTAAAATTTGAAAAATGGGATAAGATATTACACAGTATACCACTTTATGCAAGAGGATATGGTGGATTGATCTCGATTAAGAACTTACCTCTAGACTACTGGATTAATCAAACTTTTGAAACTATTAGGGCTTACTTTGGGGGCTTATTAGATATATCCATGGAAACGTTGAAATTTGTAAATGTTTCAAAAGCTAAAATTAAAGTTAAAGAGAATCTTTGTGGGTTCATTCCGGCAACACTTGGAATCAATGTTGAGAAAATAGGTAATATTTTCTTGAACTTCGGTGATATTTCATCAGTCAATCCACCTAGTAAAGTCAGACATTTGTCTATAATTTTTCAAATTCGGTTGATGCTGTTAGATTAAATCAAGTTTTGAAGGATGAAGGGGTTGATTTGTTTATGATCAATTTGAGTTTCTTCTACCAGCAAGGATGCTTGAACAATACACTAGAAAGCCATTTATCCAGTCATAAAAATTCAACTAGTTGAAACGTTAGAAGTCGAAATCTGTTTCACTGGAGCCAGAGAAGGTGGTTGTGCGTGAGGTAAATCCAAAGGTCACTGAATTCTCTGTTGGAATTTGTAAAGGTGCAATCAAAGAAGAGAGAAAGTTCTTATGGGCTCTTCAGTTTTCTTGAATGGGGAAAGCCAATCAATGACTTGCAATCCAAAGGTAGTACTGGATTCTTTGGTGCAGCCACCTGAAATTACTAAGGAAGTCAATCAGTCGTTGTTTAATCAAACAATCTCTTCATTTTTGGAGGGAATATTGAAGCAGAAGTCGAAGAGGTGCTTTTAAAGAAGAAAGAGAGTTTTTCAGCTGCTACTTCTTTATTGCCCACTGAGAACCCAAGAGACACTCGACCTATCTCCTTTGATTTAGTCCAATCGACTAAATTTCAGAAATTTTCCGCTTCTTGTTCTATCGTTAGAAGGTCTCTCTCTCCTACTTCTTCAGCATCAGGTGCCTCTTCTGCGAAGGTCAGGTTAGTGCCAATAGTGCTCCTTCTAAGAAAGCCTCAAGCTTCTTCTCTTCATTTGTGCATTTATCTTCTTCATCCCATGCGTCTTCGGCCATGTCTAGTCTAAGAGGTTTTTTTTGATAAGCATACAAAATCAAAGTAACCCTCGCTTCCCTTTAAGACGTTACCCAAGCATTTTGTTAGAAGGAGGCCGAAGGATCTCTTAAGAAAGGCTTTCGTAAAGACAAAGATGTCAGACCAAAATCCTCAATTTTTAAAGGTAAAAGACATCGAAGGCCAAGCTGCTTCATCTTCAATAAGGTTAGACATCCAGAGTTCAGAAAAGATTGCTTTAAGGGTAGAATGGAGTTTCGATCAAAACCCTGATTCTTTGGAGGTTAGTTGCACATGGATTCAAGTTCCTTCTCTTTCAACCAGATCAAACCCTCGGAGATCCTGCCAACCACATTTTAAATACTTTATTTTTTCCTTACCTTCTTGAAAAGTAAAGTTTTGAGAGGCTCTCCTTGCCAAACTCTAGGCCTTCCATCCAAAAAGAATTGTGATTCAAATTGTGACTCAAGTATTAGTGTGAGCTGTGTCGAAGATGATCATTTGGGGAAGCTAAATGAGTTAGAGGATCATTCTTCGAAGGACCCCTTAGAAACCAATCTCAATGATTTGTTTCAGAAGGAAGAGGAATTGAATGACGTAAAACAGGCCTCTGGAAACCCCTCTTAGTCTAGGCATTGTGAAATTCCAACCCATTTAAAGTCGATAGTAGAAAAATGCAAACTAGTTTTTGTTTGAAATTTAATTTTAGTTGCTTTTTGTTTCAATAATCGGTTTCTTTGTGAGGTTTGAAGGCTGCAGATAAAATGTTTGGGATTCAAGTCTCTTGAACATTTAATTCAGAACTAGTTCTCCCCTTAGAATCATTGAAGAAGTTGGGCTGAAGAATTGTTTCCAATCGATTGAACAGTTAGTATTTTTAGTAGTTCCTGGTTTTCATCCTAGGAAGATTCAGCAAGGATGGTTGTTATTCTCTTGGCATTCCTTTTCACTCATCTCGAAGCTTATTGTGAGCATCATAATTTAGTTTTCGACATTTCGGTTGAGTTGGTTTGTTCGTTTCATCTTTTCTCATTGGGTGCAAGAATGGCTTCTTCTTTTCAAAGAAATTAATGTTACTTTGCTGTTTGTTGGTTTGAAGATTGTTGGCAAGTCTTCTCGGGCTCAAGATCGTTTTTGTCTCCCATTGGTGTTCCATCTACATTATTTTCGAAGCTATTAGTGTGCAATTCTTCCACAAGAATTGGTTTCTTTTTGAAAGAGCATTATTTTGCTTGGTTATTCTTTCTCCTTTTTGGCTTGGTTAACTTTATAGAGGATTTTGGCTTGTTTGTATCGTTATTTTGATTGTCCTTTTAGTCATTTTTCTCCTGATTGCTCATAGTATATTGCTCTCGTACTTTGAGCGTTAGTCTCATTTACTATCATTAATAGAGAGGTTCGTCTCTGTTTCAAAAAGAAAAAATAATAATTGTAATTGTGAAATGTAGATTAGTTAAAAACTTGTACTTGTTTAGATTACTTACCATAAACTGAAGAGTACTATGTGTCTTGGGGAATACTTTGGCCAAATAATATTATAAATATTTTGAGTTAAGCTGTGGAAACATCAAGTTCTAAAATAAATCTTCAAAGCTTGTTGCTATCGAATTTAGACAAAGTCATGTGTAATTCTATAGGTCATGATAGAGATAGATGCTGTTAGATTCATGCGAATGTCTTCCAGGTAATCAAATGGATGCTAAGCAAGGGCCAGGGAACCACAATGAATGTCTATGGGCAGTTAATTCGGGCTTTAGACATGGACCATCGAGCGGAAGAAGCACACAAATTTTGGGTTATGAAGATTGGTTCGGATCTTCATTCAGTTCCTTGGCAATTGTGCAGAAGCATGATAGCAATATACTACCGAAATAAAATGCTAGAAGACCTTGTAAAGGTATAATTTTTTTTTAGATTTATTTAGTCTATTGCGGTTCTAGTGCCTTCTGGAAATGACTTTATGTTTGGGATGTTTGTTTTTATTATTTATTTATTTTTTAACTTTCCCACTAAACTATTTGGAATAGGAAATAAATTTTTCCTCCCATTTTCAAAACAAAAATAATTTAAATGTTTCCATTAACAGAGGAAAACATTTTTTTATGAAGGAAACAAGCCTTTTCATTGATAAAATGGTAAGAGAATTGAAGCGAAATGAAACAAACTATCAAATATTTCAGCTAAAGACAACCAACATTATCTTCCTAATACTTAAAAACTTCCTTGTAATGGACCAAGTTCTAGTACAGTGGCGAACTCTCACTCTTTCCAAGCGTGTACTTATTCTACCATTCAAATTTGGTCCCCATCCTTTTGAGTGGTTGTCTAGAGAGTGTAAAGGCAATGCACCGGAACCTTTGGAAAGAAATTTCGAAAGAGCTTCCTAAATTTACTCCTTGGGTCCTTTTTGTGGTGGGGGAAGGGAAAACTACTTATTTTTGGGAGGATTTCTGGGTTGGGGATAAACCTTTTTGTGCCTCCTTTCCCACTTTGTATCTTTTCTCTTACCTGAAATATCACTTCATCGCCGACTTCTTAGTATGGTTGGGGAATTCATAATCTTTTCACTTTGGTTTTAGTTGACACCTTACTGATGGGGGAAACGATGGAGGTGGTTACCACTTTACCCTCCTTTCCTTATTGGAGGATCACCACTTTCGTCAAGGTAGAAGGGATGTTAGAATATGGAGCCCCAATCCCATGAATGTCTATTCTTGCAAATCTTTCTTTCAAACCTTGGAGGATTAAGATTTCTAGAAAGGTGAGATTCTTTTCTTGGCTGGTGCAATATGTATGACAGACTCGCTAGGGATTATGTCCCTGTTAGTTTGGCTGGCTCTTTTTGCTCTGTTCTCTTTTACAAGTCAGCGGAAGACTTAGACCATATCCTTTGGAGCTGTGATTATGTGAGTACTGTATGGGATTACTTTGCTCGATGTTTGGTAGGGATACTGCTCGCCCAGTGCAGTAGGTATGATTGAGGAATACCTCCTCAATTCGCTTGTTGAGGAGAAAGGAAAGTTTCTTTGGAGTGCTACTGTATGCAATCTTGTGGTTTGTTTGTCTGCAATCTTTTAATTGTTTGTCTGCAATCTTAGAGTTTGTTTGTCTGCAATCTTGTTTGTTTGTTATCTTCGTTTGGATCTTACTAACCTTAGCTGGGATCCTTTTTTGTAATGTGACTTTCTTGGGGCTTGGCTATTGCCGCCCTTGTAGTTTATAAACCAATAGATTTCACAACTCTTTTTCTTCTGAATTCTTGACTAAAATACACTCCTAGACTTGATTTTGATCTGTACTGATCCGACGTGGTTCATAGTGCGTCAAGTTAATGGTTCTTTTTACTAGGCTCATGTTTTGTGGGTCTTCGCAAACATTTCAAACCTTAGTTTTGTTCTTAAGATCTGAACTAGAGTTCAATTTCTCAAGTGACGAATGGTAAATGATTATCATATATTCCTATGGCCAGCAGGATAAAAAATCAAGAACGGGGCTATGTAGATAAAATATGGAAATAAACAAAGGAGCACTAGGATTTCAATAAACAAGAAAGAGCACCAAAAATCTTTTTAGGAATGATCCATTTGATATGGAACAGGGGTTCAGTTATCTGTTGACCTTATCTTCGTTGCCAAAATAAAGCGTGAGGTTATCCATTGAAACTCTATGTTCTTATGCCCCTTTGATAGTTCCCACTTGAAGAAAGGATAAACAATGGACATTTGGAACCCAGTTGTTTTGTAATTTTAAGGGAGGACTATCACTCCAACAGCGCATCAAACAAAGGAAGAAAAACAGGAAAAAAGGTCTCCTGTTTCCTATGAATTAGAGCTCCACATGAAATTTACTTCACAACTTCGTATAAAATCTAATTCCCTTGCAGGCCATATTTCAAACTTTGATATTAATTTATAGGTAGGTATATTGTTATTGAACTCCATATTGGTAGAACCTTCACTTGGTCCGGCGGTCATCCCTCCTAGACTGAAATAAATGCCCCAAACAAGAAGAGAAATTTCCGTTTGTCTTAGAGGTAGGAATGGTCAATGAGAAAAGAGTGTGTTTGATAACCACTACACTTCAGAATGATTGATTTACACTTCACAATGATTGATTTACAGAAAGATGGTTGATGTATTGGTGCAAAATGGTTGTATTGATGAGTTTTAGATTGTTGTGGATCTGATTGCATTACCCTCTATTCTGTGTTAATCTAAACTTGTCTTCTATAATCAAAAGCTTTTTAAGGATCTCGAAGCCTTTGGACGTAAACCCCCAGACAAATCAATAGTTCAGAGGGTAGCAGATGCTTGTGAGATGCTAGGCTTGCTTGAAGAGAAAGAGAGGGTACTCGTAAAGTACAAATACCTTTTTGATGAGAAGCAAGAGTCCATGAAGAAATATAAGAGGGTTTCGTTTGAAAAACCAAAGAGAAAACGAAAATCAACAAAGGGCAGTGAAGACAATAGCAACCTTGTGAAGTCTGAATGAATGAAAGAAACAGGTAAAAATATGAAAGAAATTATGTGTTAATTAACTTATTCATTTGTCTCAGCAACCCTGATTATAATGGTGACGTCTTACGGGTATCAATATTTTGCTTCGGCTTTTGTAAATAATTGGTTTTTATTGTTAGAATTTTAGTTCTCGAACTTGTAGCACTGACATGTATTTAGCCCATAAGCAATTACCCTTTTGAAATCCATATTAATTCGAGAGTATTTGTAGTGAATGATAAAATTAGCTAATTCTGGGCATAGCAAGTTGGTTTTCTTTTTCATCTAGATTTTCTTCAATAGCTTAAGCCGATATTAATG

mRNA sequence

GTATTACCCAAATACTACTCGGGACAGTGGCGCCCTTCACTCCACCATGCTCATCCGGAGGTTTTATAGAGCCGCGGCATGGGCGACACCTCTGTTGCGACACCCAACCGTAGGGAAAACCATGGAGCTTGGAGTCAGCAGGCTGCAAGTTGGGTGCTCTTGGTACTGCACAATGATACAAGATCAAATGTATAAACAGCTTGCTGATAAAGATAGAAAAAATAAGGATGTTGACAATAGTAAAGCTTTGGGGCACATTTCAGAGCAAAATATTGGAGACATTAGAAAGCACAAAATTGGGGAAAATGTTTCACGGAAAGACAAAATTAGTTTTCTTGTAAATACGCTGCTCGATCTGAGAGATAGTAAGGAGGCTGTTTATGGTGCTCTTGATGCCTGGGTTGCATGGGAGCAAGACTTTCCAATTGCATCCCTTAAGCATGTATTGGCTGCCCTTGAGAAGGAACAACAGTGGCATAGAATTGTACAGGTAATCAAATGGATGCTAAGCAAGGGCCAGGGAACCACAATGAATGTCTATGGGCAGTTAATTCGGGCTTTAGACATGGACCATCGAGCGGAAGAAGCACACAAATTTTGGGTTATGAAGATTGGTTCGGATCTTCATTCAGTTCCTTGGCAATTGTGCAGAAGCATGATAGCAATATACTACCGAAATAAAATGCTAGAAGACCTTGTAAAGCTTTTTAAGGATCTCGAAGCCTTTGGACGTAAACCCCCAGACAAATCAATAGTTCAGAGGGTAGCAGATGCTTGTGAGATGCTAGGCTTGCTTGAAGAGAAAGAGAGGGTACTCGTAAAGTACAAATACCTTTTTGATGAGAAGCAAGAGTCCATGAAGAAATATAAGAGGGTTTCGTTTGAAAAACCAAAGAGAAAACGAAAATCAACAAAGGGCAGTGAAGACAATAGCAACCTTGTGAAGTCTGAATGAATGAAAGAAACAGGTAAAAATATGAAAGAAATTATGTGTTAATTAACTTATTCATTTGTCTCAGCAACCCTGATTATAATGGTGACGTCTTACGGGTATCAATATTTTGCTTCGGCTTTTGTAAATAATTGGTTTTTATTGTTAGAATTTTAGTTCTCGAACTTGTAGCACTGACATGTATTTAGCCCATAAGCAATTACCCTTTTGAAATCCATATTAATTCGAGAGTATTTGTAGTGAATGATAAAATTAGCTAATTCTGGGCATAGCAAGTTGGTTTTCTTTTTCATCTAGATTTTCTTCAATAGCTTAAGCCGATATTAATG

Coding sequence (CDS)

ATGCTCATCCGGAGGTTTTATAGAGCCGCGGCATGGGCGACACCTCTGTTGCGACACCCAACCGTAGGGAAAACCATGGAGCTTGGAGTCAGCAGGCTGCAAGTTGGGTGCTCTTGGTACTGCACAATGATACAAGATCAAATGTATAAACAGCTTGCTGATAAAGATAGAAAAAATAAGGATGTTGACAATAGTAAAGCTTTGGGGCACATTTCAGAGCAAAATATTGGAGACATTAGAAAGCACAAAATTGGGGAAAATGTTTCACGGAAAGACAAAATTAGTTTTCTTGTAAATACGCTGCTCGATCTGAGAGATAGTAAGGAGGCTGTTTATGGTGCTCTTGATGCCTGGGTTGCATGGGAGCAAGACTTTCCAATTGCATCCCTTAAGCATGTATTGGCTGCCCTTGAGAAGGAACAACAGTGGCATAGAATTGTACAGGTAATCAAATGGATGCTAAGCAAGGGCCAGGGAACCACAATGAATGTCTATGGGCAGTTAATTCGGGCTTTAGACATGGACCATCGAGCGGAAGAAGCACACAAATTTTGGGTTATGAAGATTGGTTCGGATCTTCATTCAGTTCCTTGGCAATTGTGCAGAAGCATGATAGCAATATACTACCGAAATAAAATGCTAGAAGACCTTGTAAAGCTTTTTAAGGATCTCGAAGCCTTTGGACGTAAACCCCCAGACAAATCAATAGTTCAGAGGGTAGCAGATGCTTGTGAGATGCTAGGCTTGCTTGAAGAGAAAGAGAGGGTACTCGTAAAGTACAAATACCTTTTTGATGAGAAGCAAGAGTCCATGAAGAAATATAAGAGGGTTTCGTTTGAAAAACCAAAGAGAAAACGAAAATCAACAAAGGGCAGTGAAGACAATAGCAACCTTGTGAAGTCTGAATGA

Protein sequence

MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNKDVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLVKSE
Homology
BLAST of IVF0012595 vs. ExPASy Swiss-Prot
Match: Q2V3H0 (Pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At4g18975 PE=2 SV=2)

HSP 1 Score: 142.5 bits (358), Expect = 7.8e-33
Identity = 83/219 (37.90%), Postives = 124/219 (56.62%), Query Frame = 0

Query: 69  GHISEQNIGDIRK--------HKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 128
           G+++  N  +I+K         K  ++     K   LV  L  L + KEAVYGAL+ WVA
Sbjct: 65  GYVATVNSKEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVA 124

Query: 129 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 188
           WE +FPI +    L  L K  QWHR++Q+ KWMLSKGQG TM  Y  L+ A DMD RA+E
Sbjct: 125 WEVEFPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADE 184

Query: 189 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 248
           A   W M + +   S+P +L   MIA+Y  + + + ++++F D+E   +  PD+   +RV
Sbjct: 185 AESLWNMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEEL-KVSPDEDSARRV 244

Query: 249 ADACEMLGLLEEKE----RVLVKYKYL-FDEKQESMKKY 275
           A A   L   E ++    R L +YKY+ F+ ++  +K+Y
Sbjct: 245 ARAFRELNQEENRKLILRRYLSEYKYIYFNGERVRVKRY 282

BLAST of IVF0012595 vs. ExPASy Swiss-Prot
Match: Q8LG95 (Pentatricopeptide repeat-containing protein At4g21190 OS=Arabidopsis thaliana OX=3702 GN=EMB1417 PE=2 SV=1)

HSP 1 Score: 131.0 bits (328), Expect = 2.3e-29
Identity = 68/181 (37.57%), Postives = 104/181 (57.46%), Query Frame = 0

Query: 80  RKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHVLAALEK 139
           R  K  + +    K + ++  +  L + KE VYGALD+++AWE +FP+  +K  L  LE 
Sbjct: 44  RVWKTRKRIGTISKAAKMIACIKGLSNVKEEVYGALDSFIAWELEFPLVIVKKALVILED 103

Query: 140 EQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQ 199
           E++W +I+QV KWMLSKGQG TM  Y  L+ AL  D+R +EA + W       L   P +
Sbjct: 104 EKEWKKIIQVTKWMLSKGQGRTMGTYFSLLNALAEDNRLDEAEELWNKLFMEHLEGTPRK 163

Query: 200 LCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVK 259
               MI+IYY+  M + L ++F D+E  G K P+ +IV  V      L + ++ E+++ K
Sbjct: 164 FFNKMISIYYKRDMHQKLFEVFADMEELGVK-PNVAIVSMVGKVFVKLEMKDKYEKLMKK 223

Query: 260 Y 261
           Y
Sbjct: 224 Y 223

BLAST of IVF0012595 vs. ExPASy TrEMBL
Match: A0A1S3C174 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103495459 PE=4 SV=1)

HSP 1 Score: 602.4 bits (1552), Expect = 1.0e-168
Identity = 302/302 (100.00%), Postives = 302/302 (100.00%), Query Frame = 0

Query: 1   MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNK 60
           MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNK
Sbjct: 1   MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNK 60

Query: 61  DVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 120
           DVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA
Sbjct: 61  DVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 120

Query: 121 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180
           WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE
Sbjct: 121 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180

Query: 181 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 240
           AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV
Sbjct: 181 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 240

Query: 241 ADACEMLGLLEEKERVLVKYKYLFDEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLVK 300
           ADACEMLGLLEEKERVLVKYKYLFDEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLVK
Sbjct: 241 ADACEMLGLLEEKERVLVKYKYLFDEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLVK 300

Query: 301 SE 303
           SE
Sbjct: 301 SE 302

BLAST of IVF0012595 vs. ExPASy TrEMBL
Match: A0A6J1HUZ4 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111467059 PE=4 SV=1)

HSP 1 Score: 488.8 bits (1257), Expect = 1.6e-134
Identity = 248/301 (82.39%), Postives = 272/301 (90.37%), Query Frame = 0

Query: 1   MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNK 60
           MLIRRF+RAA WATPLLR  TVG+ MELGV++LQ+G S YCTM+Q+QM K+ ADKD  +K
Sbjct: 1   MLIRRFHRAATWATPLLRDTTVGQVMELGVNKLQIGNSCYCTMLQNQMPKRFADKDMTDK 60

Query: 61  DVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 120
           DV+NSK L   SE+NIGDIRKH+IGENVSRKDKI+FLVNTL+DLRDSKEAVYGALDAWVA
Sbjct: 61  DVNNSKPLYQTSERNIGDIRKHQIGENVSRKDKINFLVNTLMDLRDSKEAVYGALDAWVA 120

Query: 121 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180
           WEQDFPIASLKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE
Sbjct: 121 WEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180

Query: 181 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 240
           AHKFWVMKIGSDLHSVPWQLCRSMI+IYYRNKMLEDLVKLFKDLEAFGRKPP+KSIVQRV
Sbjct: 181 AHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRKPPEKSIVQRV 240

Query: 241 ADACEMLGLLEEKERVLVKYKYLF-DEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLV 300
           ADACEMLGL+EEKERVLVKY YLF DEK+ S+KKY        K KRKSTKG++DNS+L+
Sbjct: 241 ADACEMLGLVEEKERVLVKYNYLFTDEKKGSIKKY--------KGKRKSTKGNQDNSDLM 293

BLAST of IVF0012595 vs. ExPASy TrEMBL
Match: A0A6J1HGC4 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111463832 PE=4 SV=1)

HSP 1 Score: 485.0 bits (1247), Expect = 2.4e-133
Identity = 246/301 (81.73%), Postives = 270/301 (89.70%), Query Frame = 0

Query: 1   MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNK 60
           MLIRRF+RAA WATPLLR  TVG+ MELGV++LQ+G S YCTM+Q+QM K+  DKD  +K
Sbjct: 1   MLIRRFHRAATWATPLLRDTTVGQIMELGVNKLQIGNSCYCTMLQNQMSKRFGDKDMTDK 60

Query: 61  DVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 120
           DV+NSK L   SE+NIGDIRKH+IGENVSRKDKI FLVNTL+DLRDSKEAVYGALDAWVA
Sbjct: 61  DVNNSKPLYQTSERNIGDIRKHQIGENVSRKDKIDFLVNTLMDLRDSKEAVYGALDAWVA 120

Query: 121 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180
           WEQDFPIASLKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE
Sbjct: 121 WEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180

Query: 181 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 240
           AHKFWVMKIGSDLHSVPWQLCRSMI+IYYRNKMLEDLVKLFK+LEAFGRKPP+KSIVQRV
Sbjct: 181 AHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKNLEAFGRKPPEKSIVQRV 240

Query: 241 ADACEMLGLLEEKERVLVKYKYLF-DEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLV 300
           ADACEMLGL+EEKERVLVKY YLF DEK+ S+KKY        K KRKSTKG++DNS+L+
Sbjct: 241 ADACEMLGLVEEKERVLVKYNYLFTDEKKGSIKKY--------KGKRKSTKGNQDNSDLM 293

BLAST of IVF0012595 vs. ExPASy TrEMBL
Match: A0A6J1HSN2 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111467059 PE=4 SV=1)

HSP 1 Score: 471.5 bits (1212), Expect = 2.7e-129
Identity = 242/301 (80.40%), Postives = 265/301 (88.04%), Query Frame = 0

Query: 1   MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNK 60
           MLIRRF+RAA WATPLLR  TVG+ MELGV++LQ+G S YCTM+Q+QM K+ ADKD  +K
Sbjct: 1   MLIRRFHRAATWATPLLRDTTVGQVMELGVNKLQIGNSCYCTMLQNQMPKRFADKDMTDK 60

Query: 61  DVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 120
                      SE+NIGDIRKH+IGENVSRKDKI+FLVNTL+DLRDSKEAVYGALDAWVA
Sbjct: 61  ----------TSERNIGDIRKHQIGENVSRKDKINFLVNTLMDLRDSKEAVYGALDAWVA 120

Query: 121 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180
           WEQDFPIASLKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE
Sbjct: 121 WEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180

Query: 181 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 240
           AHKFWVMKIGSDLHSVPWQLCRSMI+IYYRNKMLEDLVKLFKDLEAFGRKPP+KSIVQRV
Sbjct: 181 AHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRKPPEKSIVQRV 240

Query: 241 ADACEMLGLLEEKERVLVKYKYLF-DEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLV 300
           ADACEMLGL+EEKERVLVKY YLF DEK+ S+KKY        K KRKSTKG++DNS+L+
Sbjct: 241 ADACEMLGLVEEKERVLVKYNYLFTDEKKGSIKKY--------KGKRKSTKGNQDNSDLM 283

BLAST of IVF0012595 vs. ExPASy TrEMBL
Match: A0A6J1HFH8 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111463832 PE=4 SV=1)

HSP 1 Score: 467.6 bits (1202), Expect = 3.9e-128
Identity = 240/301 (79.73%), Postives = 263/301 (87.38%), Query Frame = 0

Query: 1   MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNK 60
           MLIRRF+RAA WATPLLR  TVG+ MELGV++LQ+G S YCTM+Q+QM K+  DKD  +K
Sbjct: 1   MLIRRFHRAATWATPLLRDTTVGQIMELGVNKLQIGNSCYCTMLQNQMSKRFGDKDMTDK 60

Query: 61  DVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 120
                      SE+NIGDIRKH+IGENVSRKDKI FLVNTL+DLRDSKEAVYGALDAWVA
Sbjct: 61  ----------TSERNIGDIRKHQIGENVSRKDKIDFLVNTLMDLRDSKEAVYGALDAWVA 120

Query: 121 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180
           WEQDFPIASLKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE
Sbjct: 121 WEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180

Query: 181 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 240
           AHKFWVMKIGSDLHSVPWQLCRSMI+IYYRNKMLEDLVKLFK+LEAFGRKPP+KSIVQRV
Sbjct: 181 AHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKNLEAFGRKPPEKSIVQRV 240

Query: 241 ADACEMLGLLEEKERVLVKYKYLF-DEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLV 300
           ADACEMLGL+EEKERVLVKY YLF DEK+ S+KKY        K KRKSTKG++DNS+L+
Sbjct: 241 ADACEMLGLVEEKERVLVKYNYLFTDEKKGSIKKY--------KGKRKSTKGNQDNSDLM 283

BLAST of IVF0012595 vs. NCBI nr
Match: XP_008455250.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucumis melo])

HSP 1 Score: 601 bits (1550), Expect = 8.82e-217
Identity = 302/302 (100.00%), Postives = 302/302 (100.00%), Query Frame = 0

Query: 1   MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNK 60
           MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNK
Sbjct: 1   MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNK 60

Query: 61  DVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 120
           DVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA
Sbjct: 61  DVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 120

Query: 121 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180
           WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE
Sbjct: 121 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180

Query: 181 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 240
           AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV
Sbjct: 181 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 240

Query: 241 ADACEMLGLLEEKERVLVKYKYLFDEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLVK 300
           ADACEMLGLLEEKERVLVKYKYLFDEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLVK
Sbjct: 241 ADACEMLGLLEEKERVLVKYKYLFDEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLVK 300

Query: 301 SE 302
           SE
Sbjct: 301 SE 302

BLAST of IVF0012595 vs. NCBI nr
Match: XP_004136857.2 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucumis sativus] >KAE8645790.1 hypothetical protein Csa_017311 [Cucumis sativus])

HSP 1 Score: 550 bits (1416), Expect = 2.35e-196
Identity = 276/302 (91.39%), Postives = 290/302 (96.03%), Query Frame = 0

Query: 1   MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNK 60
           MLIRR +RAAAWATPLLRHPTVG+TMELGVSRLQVG S YCT IQDQM +QLADKDRK+K
Sbjct: 1   MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDK 60

Query: 61  DVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 120
           DV++SKALGHISEQNIGDIRKH+IG+N+SRKDKI FLVNTLLDLRDSKEAVYGALDAWVA
Sbjct: 61  DVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVA 120

Query: 121 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180
           WEQDFPIA LKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EE
Sbjct: 121 WEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEE 180

Query: 181 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 240
           AHKFWVMKIGSDLHSVPWQ+CRSM+AIYYRNK LEDLVKLFKDLEAFGRKPPDKSIVQRV
Sbjct: 181 AHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRV 240

Query: 241 ADACEMLGLLEEKERVLVKYKYLFDEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLVK 300
           ADACEMLGLLEEKERVLVKYKYLFDEK+  MKKYKR+SFEK KRKRKSTKG+EDNSNLVK
Sbjct: 241 ADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVK 300

Query: 301 SE 302
           SE
Sbjct: 301 SE 302

BLAST of IVF0012595 vs. NCBI nr
Match: XP_038887984.1 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Benincasa hispida])

HSP 1 Score: 518 bits (1333), Expect = 1.08e-183
Identity = 263/303 (86.80%), Postives = 281/303 (92.74%), Query Frame = 0

Query: 1   MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNK 60
           ML+RRF+RA AWATPLLR  TVG+ MELGVSRLQVG   YCTMIQDQM KQLA KD KNK
Sbjct: 1   MLVRRFHRATAWATPLLRDLTVGQIMELGVSRLQVGSFCYCTMIQDQMSKQLAVKDIKNK 60

Query: 61  DVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 120
           D +NSKALG  SEQNIGD+RKH+IG+NV RKDKI+FLVNTLLDLRDSKEAVYGALDAWVA
Sbjct: 61  DFNNSKALGQTSEQNIGDVRKHQIGKNVPRKDKINFLVNTLLDLRDSKEAVYGALDAWVA 120

Query: 121 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180
           WEQDFPI SLKHVL  LEKEQQWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE
Sbjct: 121 WEQDFPIGSLKHVLTVLEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180

Query: 181 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 240
           AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPP+KSIVQRV
Sbjct: 181 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPEKSIVQRV 240

Query: 241 ADACEMLGLLEEKERVLVKYKYLF-DEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLV 300
           ADACE+LGLLEEKERVL+KYKYLF DEK+ S+KKYKRVSFEK K KRKSTK +EDNSNL+
Sbjct: 241 ADACEILGLLEEKERVLMKYKYLFTDEKEGSIKKYKRVSFEKSKGKRKSTKSTEDNSNLM 300

Query: 301 KSE 302
           K++
Sbjct: 301 KAQ 303

BLAST of IVF0012595 vs. NCBI nr
Match: XP_022967610.1 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucurbita maxima])

HSP 1 Score: 488 bits (1257), Expect = 3.16e-172
Identity = 248/301 (82.39%), Postives = 272/301 (90.37%), Query Frame = 0

Query: 1   MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNK 60
           MLIRRF+RAA WATPLLR  TVG+ MELGV++LQ+G S YCTM+Q+QM K+ ADKD  +K
Sbjct: 1   MLIRRFHRAATWATPLLRDTTVGQVMELGVNKLQIGNSCYCTMLQNQMPKRFADKDMTDK 60

Query: 61  DVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 120
           DV+NSK L   SE+NIGDIRKH+IGENVSRKDKI+FLVNTL+DLRDSKEAVYGALDAWVA
Sbjct: 61  DVNNSKPLYQTSERNIGDIRKHQIGENVSRKDKINFLVNTLMDLRDSKEAVYGALDAWVA 120

Query: 121 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180
           WEQDFPIASLKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE
Sbjct: 121 WEQDFPIASLKHALAVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180

Query: 181 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 240
           AHKFWVMKIGSDLHSVPWQLCRSMI+IYYRNKMLEDLVKLFKDLEAFGRKPP+KSIVQRV
Sbjct: 181 AHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRKPPEKSIVQRV 240

Query: 241 ADACEMLGLLEEKERVLVKYKYLF-DEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLV 300
           ADACEMLGL+EEKERVLVKY YLF DEK+ S+KKYK         KRKSTKG++DNS+L+
Sbjct: 241 ADACEMLGLVEEKERVLVKYNYLFTDEKKGSIKKYKG--------KRKSTKGNQDNSDLM 293

BLAST of IVF0012595 vs. NCBI nr
Match: XP_023511579.1 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 486 bits (1252), Expect = 1.82e-171
Identity = 247/301 (82.06%), Postives = 271/301 (90.03%), Query Frame = 0

Query: 1   MLIRRFYRAAAWATPLLRHPTVGKTMELGVSRLQVGCSWYCTMIQDQMYKQLADKDRKNK 60
           MLIRRF+RAA WATPLLR  TVG+ MELGV++LQ+G S YCTM+Q+QM K+ ADKD  +K
Sbjct: 1   MLIRRFHRAATWATPLLRDKTVGQIMELGVNKLQIGNSSYCTMLQNQMSKRFADKDMTDK 60

Query: 61  DVDNSKALGHISEQNIGDIRKHKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 120
           DV+NSK L   SE+NIGDIRKH+IGENVSRKDKI+FLVNTL+DLRDSKEAVYGALDAWVA
Sbjct: 61  DVNNSKPLYQTSERNIGDIRKHQIGENVSRKDKINFLVNTLMDLRDSKEAVYGALDAWVA 120

Query: 121 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180
           WEQDFPIASLKH L  LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE
Sbjct: 121 WEQDFPIASLKHALTVLEKENQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 180

Query: 181 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 240
           AHKFWVMKIGSDLHSVPWQLCRSMI+IYYRNKMLEDLVKLFKDLEAFGRKPP+KSIVQRV
Sbjct: 181 AHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRKPPEKSIVQRV 240

Query: 241 ADACEMLGLLEEKERVLVKYKYLF-DEKQESMKKYKRVSFEKPKRKRKSTKGSEDNSNLV 300
           ADACEMLGL+EEKERVLVKY YLF DEK+ S+KKYK         KRKSTKG++DNS+L+
Sbjct: 241 ADACEMLGLVEEKERVLVKYNYLFTDEKKGSIKKYKG--------KRKSTKGNQDNSDLM 293

BLAST of IVF0012595 vs. TAIR 10
Match: AT1G04590.1 (BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G21190.1); Has 111 Blast hits to 111 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 109; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 292.0 bits (746), Expect = 5.6e-79
Identity = 152/261 (58.24%), Postives = 191/261 (73.18%), Query Frame = 0

Query: 44  IQDQMYKQLADKDRKNKDVDNSKALGHISEQ----NIGDIRKHKIGENVSRKDKISFLVN 103
           +Q   Y+ +AD     K +  ++     S+     N  + RKH+IGEN+ +KDKI FLVN
Sbjct: 92  VQSMSYQFVADSHSSPKRIVKNEDEEDFSDSSKKGNAENPRKHQIGENIPKKDKIKFLVN 151

Query: 104 TLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQG 163
           TLLD+ D+KEAVYGALDAWVAWE++FPIASLK V+A+LEKE QWHR+VQVIKW+LSKGQG
Sbjct: 152 TLLDIEDNKEAVYGALDAWVAWERNFPIASLKIVIASLEKEHQWHRMVQVIKWILSKGQG 211

Query: 164 TTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVK 223
            TM  YGQLIRALDMD RAEEAH  W  K+G+DLHSVPWQLC  M+ IY+RN ML++LVK
Sbjct: 212 NTMGTYGQLIRALDMDRRAEEAHVIWRKKVGNDLHSVPWQLCLQMMRIYFRNNMLQELVK 271

Query: 224 LFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF------DEKQESMKK 283
           LFKDLE++ RKPPDK IVQ VADA E+LG+L+EKERV+ KY +L       D+   S +K
Sbjct: 272 LFKDLESYDRKPPDKHIVQTVADAYELLGMLDEKERVVTKYSHLLLGTPSDDKPSRSSRK 331

Query: 284 YKRVSFEKPKRKRKSTKGSED 295
            K+     P+    +T+G+ D
Sbjct: 332 KKKPELRIPE---ATTEGAVD 349

BLAST of IVF0012595 vs. TAIR 10
Match: AT1G04590.2 (BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G18975.4); Has 111 Blast hits to 111 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 109; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 286.6 bits (732), Expect = 2.4e-77
Identity = 152/264 (57.58%), Postives = 191/264 (72.35%), Query Frame = 0

Query: 44  IQDQMYKQLADKDRKNKDVDNSKALGHISEQ----NIGDIRKHKIGENVSRKDKISFLVN 103
           +Q   Y+ +AD     K +  ++     S+     N  + RKH+IGEN+ +KDKI FLVN
Sbjct: 92  VQSMSYQFVADSHSSPKRIVKNEDEEDFSDSSKKGNAENPRKHQIGENIPKKDKIKFLVN 151

Query: 104 TLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQG 163
           TLLD+ D+KEAVYGALDAWVAWE++FPIASLK V+A+LEKE QWHR+VQVIKW+LSKGQG
Sbjct: 152 TLLDIEDNKEAVYGALDAWVAWERNFPIASLKIVIASLEKEHQWHRMVQVIKWILSKGQG 211

Query: 164 TTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLV- 223
            TM  YGQLIRALDMD RAEEAH  W  K+G+DLHSVPWQLC  M+ IY+RN ML++LV 
Sbjct: 212 NTMGTYGQLIRALDMDRRAEEAHVIWRKKVGNDLHSVPWQLCLQMMRIYFRNNMLQELVK 271

Query: 224 --KLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF------DEKQES 283
             KLFKDLE++ RKPPDK IVQ VADA E+LG+L+EKERV+ KY +L       D+   S
Sbjct: 272 VMKLFKDLESYDRKPPDKHIVQTVADAYELLGMLDEKERVVTKYSHLLLGTPSDDKPSRS 331

Query: 284 MKKYKRVSFEKPKRKRKSTKGSED 295
            +K K+     P+    +T+G+ D
Sbjct: 332 SRKKKKPELRIPE---ATTEGAVD 352

BLAST of IVF0012595 vs. TAIR 10
Match: AT4G18975.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 142.5 bits (358), Expect = 5.5e-34
Identity = 83/219 (37.90%), Postives = 124/219 (56.62%), Query Frame = 0

Query: 69  GHISEQNIGDIRK--------HKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 128
           G+++  N  +I+K         K  ++     K   LV  L  L + KEAVYGAL+ WVA
Sbjct: 65  GYVATVNSKEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVA 124

Query: 129 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 188
           WE +FPI +    L  L K  QWHR++Q+ KWMLSKGQG TM  Y  L+ A DMD RA+E
Sbjct: 125 WEVEFPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADE 184

Query: 189 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 248
           A   W M + +   S+P +L   MIA+Y  + + + ++++F D+E   +  PD+   +RV
Sbjct: 185 AESLWNMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEEL-KVSPDEDSARRV 244

Query: 249 ADACEMLGLLEEKE----RVLVKYKYL-FDEKQESMKKY 275
           A A   L   E ++    R L +YKY+ F+ ++  +K+Y
Sbjct: 245 ARAFRELNQEENRKLILRRYLSEYKYIYFNGERVRVKRY 282

BLAST of IVF0012595 vs. TAIR 10
Match: AT4G18975.2 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 142.5 bits (358), Expect = 5.5e-34
Identity = 83/219 (37.90%), Postives = 124/219 (56.62%), Query Frame = 0

Query: 69  GHISEQNIGDIRK--------HKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 128
           G+++  N  +I+K         K  ++     K   LV  L  L + KEAVYGAL+ WVA
Sbjct: 38  GYVATVNSKEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVA 97

Query: 129 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 188
           WE +FPI +    L  L K  QWHR++Q+ KWMLSKGQG TM  Y  L+ A DMD RA+E
Sbjct: 98  WEVEFPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADE 157

Query: 189 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 248
           A   W M + +   S+P +L   MIA+Y  + + + ++++F D+E   +  PD+   +RV
Sbjct: 158 AESLWNMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEEL-KVSPDEDSARRV 217

Query: 249 ADACEMLGLLEEKE----RVLVKYKYL-FDEKQESMKKY 275
           A A   L   E ++    R L +YKY+ F+ ++  +K+Y
Sbjct: 218 ARAFRELNQEENRKLILRRYLSEYKYIYFNGERVRVKRY 255

BLAST of IVF0012595 vs. TAIR 10
Match: AT4G18975.3 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 142.5 bits (358), Expect = 5.5e-34
Identity = 83/219 (37.90%), Postives = 124/219 (56.62%), Query Frame = 0

Query: 69  GHISEQNIGDIRK--------HKIGENVSRKDKISFLVNTLLDLRDSKEAVYGALDAWVA 128
           G+++  N  +I+K         K  ++     K   LV  L  L + KEAVYGAL+ WVA
Sbjct: 65  GYVATVNSKEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVA 124

Query: 129 WEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEE 188
           WE +FPI +    L  L K  QWHR++Q+ KWMLSKGQG TM  Y  L+ A DMD RA+E
Sbjct: 125 WEVEFPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADE 184

Query: 189 AHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLEDLVKLFKDLEAFGRKPPDKSIVQRV 248
           A   W M + +   S+P +L   MIA+Y  + + + ++++F D+E   +  PD+   +RV
Sbjct: 185 AESLWNMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEEL-KVSPDEDSARRV 244

Query: 249 ADACEMLGLLEEKE----RVLVKYKYL-FDEKQESMKKY 275
           A A   L   E ++    R L +YKY+ F+ ++  +K+Y
Sbjct: 245 ARAFRELNQEENRKLILRRYLSEYKYIYFNGERVRVKRY 282

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q2V3H07.8e-3337.90Pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Arabidop... [more]
Q8LG952.3e-2937.57Pentatricopeptide repeat-containing protein At4g21190 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A1S3C1741.0e-168100.00pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Cucumis ... [more]
A0A6J1HUZ41.6e-13482.39pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 ... [more]
A0A6J1HGC42.4e-13381.73pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 ... [more]
A0A6J1HSN22.7e-12980.40pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X2 ... [more]
A0A6J1HFH83.9e-12879.73pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X2 ... [more]
Match NameE-valueIdentityDescription
XP_008455250.18.82e-217100.00PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic ... [more]
XP_004136857.22.35e-19691.39pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucumis sa... [more]
XP_038887984.11.08e-18386.80pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 ... [more]
XP_022967610.13.16e-17282.39pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 ... [more]
XP_023511579.11.82e-17182.06pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 ... [more]
Match NameE-valueIdentityDescription
AT1G04590.15.6e-7958.24BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) super... [more]
AT1G04590.22.4e-7757.58BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) super... [more]
AT4G18975.15.5e-3437.90Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G18975.25.5e-3437.90Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G18975.35.5e-3437.90Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (IVF77) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 92..277
e-value: 2.9E-9
score: 38.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 272..302
NoneNo IPR availablePANTHERPTHR47603:SF1PPR CONTAINING-LIKE PROTEINcoord: 15..291
NoneNo IPR availablePANTHERPTHR47603PPR CONTAINING-LIKE PROTEINcoord: 15..291

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
IVF0012595.1IVF0012595.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding