Cucsa.079810.2 (mRNA) Cucumber (Gy14) v1

NameCucsa.079810.2
TypemRNA
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationscaffold00793 : 2660524 .. 2672222 (-)
Sequence length3243
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTCCACCTTTCTTCACCCATTCGCATCTGAGCTCTCTCAGTCGCCGTCCGCCGCCGTCGCAGGTCTCAATCGCCCTTCGACGTTCGCCGTCGAAACTGGTTTTTATTGGATTTCAACTGGTCGTCCTCTTCTTTCTTTGAGTTTTGTGGTTACTTCTTCCTTTTGCGACTGTGGATTTTGGGTTTTTATTGGGTGATTGCACACCGTCCAGCTCCACCTTTATCTCTCTCTGTTCTGTCTCCTTAAGAATCAAGTCGTGTGATTGTTAGATTCCCAATTCCACTCCAACATCTATTCGCCATTTCAGGCCTCAATTTTTTCGGACCATTTAGACAAATAGAGAAATGGGGTTGGGTTAAAATGGGGATGCTAAGAGGAAATTAGGATGAACTTAGAGGAGGAAAACAGAATTCAGTGAAGACGATGAATGTGCAGCAACTTCCGAGAAGAGGAAAGAGGGAGGAAGCCTTTAGTTTATTCTAACAGAACAAATCCTATTTCTACACCAACTGTATTCGGCTGCAGAGCAAGAATATTGCACTTATGGCTTTGTTCGCACTGCCATTTCATTCGGAATCTTGTCTTTAATAAGCTTTTGTCGCCGTCTCTGCTGGGAAGACAACAATGCACAGGCATGTAAAATTCTTTCCGGGTCATGGTAGTAGTTACAGATGCCCTAACCTCATTTCTACTTTGCATTTTCCTCAGGGCAAGCTTTCGTTTGGGATCAATAGCTGACTCGATATATAGATTCAAGCCGCACGAACATGTAAGCATTTTTAGTTCGGAAGTATCAAGCATGGCTGGATTTTATAAATTTGTTTTTTTTAAAATCTTGAAAGACAATGAAAAAGTGTAGGGAGCTCACTTGAGAGGTTATCTGATTCAGTTAATGCGTAATTTTTCTTTGAAGAATTGCTAAGCATGGCAGTTGTTCAATGAACTTATATTTGGTTTAGCGTTTGTGCAAAATATTTATTGATTAGTGGCCATCATGGTTTCAACAAAATGTAGTAGGGTCGTGTAGTTGTCTCGTGAGATTAGTTATTGCCTCAAAACTAGCAACATAGGAATTCCTGTTGAATTTTGTGAGTTGTTATTTATTCTCCTTACTACTATCAACAGGTGCGAAAACAGGATGCAAGCAAATTGGTATTCCATCGGGCGCTTCTCATCTCCAGTGGTAATCTATTGAAAATAAGTTGTTTGAATACTTCATGTCTTTCGAAATGAGTGTTTCAATTGTTTTTCTTTGGCATAGTAAATGTAATTGGTCTTATTACATTTGTTTACAAATTACAAATTTGGAAAATAGTTTGTTAATAGGCTATGTTTTCTGTCAGAACATACAGTTAAATTTCTTGTACGACATGAATATATGAATCATAATGGTGTAGTTTTTTGAAATCTTCAACTCATGTGGAAAATGACATATTAATGTTTCAGTGTAGGTAGTGAAATTTGGGGAAATGGAGCAGAGTCAACTGCATTCATGCAGATGCAGATTGTCAATGCACTTCGTCTGGGTGATAGAAGTAGGGCATCCAACCTGCTTATGGTTCTTGGCCAGGAAAAGTTCTCTTTAACTGCAGATAATTTTGTTCGCATTTTGAGTTACTGTGCAAAATCCCCTGATCCGCTGGTAGGTTTTTTGTTCTTTGTTATCCTCTATTGCATCTCAAGGTTATAATTATTATTATTTTAAATTTACGGGTGTTGAGAAAGCAAAATTTGTTTGTATATTTATCCATTATGATTCAATATTACTGTGAATAAGCTAGAAACGGATTATGCTAGCTAGCATTTACCCATTTCAGTGGCTAGACCAGTGAAAAAATGAATTGAAGGTCTAGAGTTTATACTGTACATAAGTATTTATTTGCAACTGTGTGCATTGCATTGTAATTATACTGTATGCACTTGTGAACATTGAATCATAATGTCCCAGCAAAAACATCACGCAGCCACAAGAATCACATAAACACATAAAATGTTTGGAAGAAGTTAAACTCTTCTTATTGATACTAGTAGAAAGTGGCTATAAATGGGAACTAAAGCTCAGTATTCATAAGACAACTCTTACTCACCCATTTCCTTGATGATACCTGTCCAGTTTCTGTTAAGATTCTAAACCTGAAACTCAATAAGATCTCAACTTACCAAGTAGAGACAATACTGTGCTGTAATGTATTAATAAAACGATCCAAGATTTCCAGACAAACAAGAATACAAAATAACCAGATGACAGGAATAGTAAAAGAAAAATCCAGCTCCTTTAAGAATAGCCGGAAACTTCTCCTAAAAGCTTATAACAACTTTCACCCCAAAATCGAGACCTAGTTTTCATACCTAAAACATTCCTCATTTATGTCTATGTCTCCAGTGGGCCCTCGGGTGTTTCTTTCCCTCTCTCCTTAGCTCTCCCTTGACATATTCCGTGAGTTTCCCTTTTTACCCTTTCTTTTATATGTATGAAGGAATGGAGATCTAACATTACCACACGCCTCGAAATGCACCTTGTCCTTAAGGTGGAAGGAAGGAAATTGTTAGTTCATCAAGTACATTGATTCCCAAGTCTTTTCACTATTCGGAAATCCCTTCCCCTTGTCCGACCATTCATTTACTGCTAATTCTCTACTCCCTCTTACCCATAGAACTGTTTTAGGCCTCAATCGTAATTCGAATTCTTCGGTTACGGCTGGAGGTGTGTATTGAACTCATTGATTCTGCCTCAATTCCAGCTTCAATTGTGACACATGGAACACATCATGGATGCTTGCTTCCGGTGGTAACTCGAGACGATAAGCTACTGCCCATATCTTTCCGGTATAATATGGCCCATAGAACTTTGGTGTTGGTTTTTCACTTCTCTTTCTTGCTAATGATCGTTGCCTGTAAGGTCCCAACTTTAAATACACTTCTTCCCCTTGTTTGAAATTCATCTCTCTTTATGCGAATCTATCTGCTTCTTCATCTTACTTTGTGCAAGGAGAACATCTAGTTTGAACCCTGAAGGATCAGTTTTGCTCACAAAGGTACTGCGATATTGGGTGAAGTCAGTACTTTGGCTAAAAGTGGGGGTTGCCATGTCTTCATTCTTCTTGACAAACCTCCCGTAATATCCCTTCAGCCCCAATAATCCTCTCAATTTTGATTCATTCTTGGGTGTGGTCAATTTATCATAGCTCTGTCCTTCTCCCCATCGGTTTCCACCCTTTTTTGCGCGATCCACTGTCCTAAGTGTTGTATTTTGAAATGGCCGATAACACACTTCTTATTTTCTACACATTTCTTTTAAAGCTTATACGACACAATGACTTGATTCCACTTGCAGTAATCTACACAAATGCACCACCTCCCCTCCTTCTTCTTAACTAAAAACACTTAACTTGAGTAAGGGTTTCTCCTTGGCTTAATGATTTATGCTTGCCACCTTTTGATTTCCAATTTCTCACTCTCCCCTTCTTGAATGTACCCATACCTATAGGGTCTCACGATGATGGTTTTTTTCCTTCCCACACGAGAATTCGATGGTCAGCTTTTGAGGTAGCCTTCTAGGTGTGTCAAAGATATTATCATACTGCTTTAGGAGAGCTTGTATCACGTGCAACCATTCTTCGTCCCCCCCTTCCCGAATCTTGTTTCCAGTCTTCTCTTCTTCCTCAACCTCAATTCACGACCCCTGTTCTTCCTCCTTCCCAACCTTCATCAAGCTTCTCAATGAACACTCAACTGTAGTCAAGGTCGAATTCCCCCTGAGCACAATGGGTTGGTTTTTAATTGCAAATGACTTCGGCGGCCAATGTATCCCCATCAATCCCGTCGAGCACAACCATGTTATTCCTACTACTACATCTCTCTTTCCCAGATTAATGGCTAGGAATTTTACGTGGATGGTTATTTTGAGAAGTTTCAGTTCAGTTCTTCCGCAGATTCCTTCCCCTTCAACTGTCGTAATAACTCCCATAGTTACTCCAAATACCGAACCTCTAGTGATGGGTAGTTTGAGTTCCTCTACAATCTTTTGTTGTATAAAATTGTGAGTTGCACCATTATTAATCTAAAGGATTACGCTCCTCCCTTTAATCGTTCCTTTCAGCTTAATCGTTCCCATTTTTTCAATTCCGTGGATGGCTTGCAGAGCGATTTCCTTATCATCACCAACCTCCATCATCTCTAGCTCTACAGTTCCGTTCACCCCATGTCGGTATCAAATCCCTCTTCAAAATCCTCTTCCTCGTTTGTGAAAAAAAACATAAACTCCCTATTTTCTCTCACCTTACACCCATGGTCATGCAACCATTTTTCGTTGCACCGTAAACACAACTCCTTATCTAGTCGTGATTGAAGCTTACTATCGAACAACTGCTTGATTGGAGTTTTTTTTTTCGTGTAATTCCCTCACATGGGTATAATAATCTGCCGTGTATGGGTTTCTCTCCCTTTTACCCCGTTTCTCTCCGTGTAATTTTGATGGGTTTTGGAGAATTGGGTTTCCCTCCTACCCAGCCCATGGGCTCCCCATTCACTAAGCGCCAATTTTAGGGCAATTTTTTAGTCATTCATTAGTTTGATCTCCCTCATACCATCTTCCAAAGTTCCCGGATAACGGCTCACCAATTTCGCTTGCAACTCTGGTTCCCGCTCGTCTTGTGTGATCCTAATTAGTCGAACTCGCCAACTTCTTTCGTCGAGAAATTTGAAATGCTCGCACAACTTCTGTTTCAAGTCTTCTTCTACCCATATCATCTTACGATTGTTACTCTGACAGGACAAACTTACTTCATCTTGAGCGAAGTCTACCATGATAACCTTGATTTTCTCCACCACTGTTGAGTCATTGATCTCGAAGAAATACTTATCTTTGCATAGCCCGAATTCTCGATTCACTCCATTGAAGACGGACCTTTCCAACTTCTTATATTTGCTCTTATCCACCGTCTTAGGTTTGGCCATCGAAGGCGTTTCTAACTCCTTTTTCTCTTTTGTCTTGCACACCGAAACGTTAGATGCTTCTAACTCTTCTTTTCTCCTACTTGCCACACACTTATTCATCTCTTCAGCTAAATGGTCCTCACTCTCCTTTTGTCCGAGAATGATTTCCTTGAGGTCTATAACCAGCCTTTCAGTTGATTTGATTCTTTTTTCAAACTTTTTTTGATCCATGGAATGCGTCCTTCCCAGGATGATTGGCTCTCATACTAATTTGTTAAGATTCTAAACCTGAAACTCAGTAAGATCTCAACAAAAGGATCAAATAGAGACAATATTCTGCTGTAATATATTAATAAAATGATCCAAGATTCCCAGACCAACAAGAATACATAATAACAAGGAATATAGATAACCGGATGACAGGGATAGTAAAAGAAAAATCCAGCTCCTTCAAGAATAGCCGGCAACTTCTCCTAAAAGCTTATAACAGCTTTCACCCCAAAATCCAGACCTAGTTTTCATACCCAAAACATTCTTTATTTATATCCATGTTTCTAGTGGGTCCTCAGGTGTTTCTTTCCCTCTCTCCTCAGCTCTCCCTTAGCATATTCCGTGAGTTCCCCTTTTTACCCTTTCTTTATATGTATGAAGTATTGGAGATCTAACAGTTTCCTCCTCTCTTTACATTACTCTGCCACCACAACTAATTCACTCTCCACCTTGAAACTAGGAAATTGCATGTCTTACTAGCATTGGAACACGAATGATGACAGCCTTATTACTTTCTATAAACTGATGAGTGTGCTGAAGCCTAAGGCTATATATATGTGTGTGTGTATTTTCTTGCAGTTTGTCATGGAGACTTGGAAAATAATGGAAGAAAGAGGAATTTTTCTGAATAACACATGCTCCTTACTTATGATAGAAGCACTCTGTAAAGGGGGTTACTTGGATGAGGTATAAACATATAATCTACACTGAGGTTGTATGCAATCTTTTGGCCACACATCCAGTTGTAGTAATAACAATAATTTTCTCCTGTTGGAATGATACAGGCATTTGGTCTAATAAATTTCCTAGCAGAAAGTCATGTGATGTTCCCTGCTCTGCCTGCGTACAATTGTTTCTTGAGAGCCTGTGCCATAAGGCAAAGTATGGTTCATGCTAGTCAATGTTTGGATCTTATGGATCACAAAATGGTTGGGAAGAATGAAGCTACATATTCTGAGCTACTCAAGGTCTGCAAGGAGCCTGATTCCTTATCATTTTTATAAACATTATTTCAAGATTTCTTTTGAACTAGTGATCATCATGATGATACTTGGAAGTTCTCTTATGATATTATTTCTACACCAAAATTGGATCAGTCATAGGCATCTCACTGTCAATCTTACTGGTAAAAATAAATTAAAATTTGATGGAGATACGATTCTAAATTTTGAAGTGTATACGTAACATTCCATGCACTTTTGTTTAGTGTGATAAGAACATATTGATACATGAAGTGCAGTACCTTTACAGAATAAAGTGGATCGTGGGATAGGTTCTAGTAGTCATTGATCTCTTCAAAACACTCTCCTGTTTCAATTGCATGAGTCCTTTTGACAGGAAGGTTCTGTTTTAGGAGAATTATTATATCATATATGTACCTATAAGTTAAGATGACCATCCTTTAAAACATATATGTAATGAATGAACACTAATCTTTCTTTTTTCAATCAGCTTGCAGTTTGTCAGAAAAACTTGTCTTCTGTGCATGAAATCTGGAGGGACTTTGTAAAAAATTATAGTCCAAGCGTTTCATCGCTGAGGAAGTTTATATGGTCCTACGCAAGGATGGGAGATGTGAAATCTGCATATACTGCACTGCAAAAGATGGTGACTTTGAATAATGGAGCCGCAGGAAGAAAGTTACAATCTTTGGACATTCCAATACCTTCAAGAACTGAACTTTATCGTTACAATTTTAATTTTGAGGAAAAAGAACCCTCTATTGATGAGTTTTTCTATAAGAAAATGGTCCCCTGGAATGGTGACGTAGGGGGGATTTCTGTTAGTGGTATAAAATGTGGAGAAGTTGAAACTGGTCCATTAACTGTGCCAAACAATCACAAAAGCAGTTTTGTAAGGAAGGTTTTGAGATGGTCTTCCAATGATGTGATGCGTGCATGCTCTCTTGCTGGGAACTGTGGTCTTGCAGAGCAGCTAATGCAACAGGTCTTATTTCTTTCCTTCGAGATCATTTTTATTCAATGATACAGCCAGTTGGTACAAGAAATCTTGAATTGAGATAATATTTTTCATAATGTTATCTCTTGGTACCAAAAACTTTTACTTTTATAAACAAGTAATATATATTGTGTATATATATTTTCATAAAATAATGGGTTTCAAATGCTTTAAATAGTTGCCACTTTAAGTTATTTGGATTTTCTTGTTCTTGCTCAGATTTCATCATTGGTGAAATAATTTTTGTTATTTTATACGTTGTACCTTGTGTGAATATTTTTAGAGTTCATTTAGTTCCGTATTTCGTGTGTACTGATGAGACTCGGACATGTCTTTGAAGATGCATAAACTTGGATTGCAACCGTCATCCCACACATTTGATGGTTTTGTTAGATCAGTTGTCTCAGAGAGAGGTTTCAGTGCTGGCATGGAAATAGTAAGTTATTTGGGCTTTGATTAGATCTTGATTATCATTACAAGCTAATCTTTTGCATTCATTTTTTTATTTGTTTCATTTTTCAAGGAGTTTTTAATTTTTATTATGAGGCCAAATATTTATTACTCCCCTAAACGAAGAAACCCCTCTTTTCCAGTTAAAAGTAATGCAACAGAGGGGATTGGAGCCATATGATTCAACTCTTGCTGCTGTTTCAGTAAGTTGTAGCAAGGCGCTAGAACTTGATTTGGCTGAAGCTCTACTTGAACGACTTTCAGCTTGTCCTTACCCATACCCCTTCAATGCATTTTTTTCTGCATGTGACATGATGGTAAGTTGACCTTTTTTAAGTAAGAAATTGGCCTAAAATGGAGAAAAATGAAAGAATTTTCATGAGCATACAAAAAGAGTCCTAAACTACAAGAAAAACTTCATGACAAGAAGCTGATAATTACGAAATAAATCTGGTTAATAGAAGCCAGACACATTAAATCTGCACCCTTCCATGCCTCCTTCCAAGTCCTCTCAACCCCTCTAAAAATCCTATGATTCCTTTTGATTCACTCTCCAGAAAAAACCCTAAACCCTTGCCACCAGACTCTCCATTATTCTCAAAAGGGAGAATTCATTAGAATCTTCCAATTTAACAATTTAAGGCGTAGCTTCTTACTGTAATTGCTTCTCAATTTTCTGAGTTTGGTGTAATTAGACTATTCCTTATATGCTCCAAACGCAAAGCTGATGCTAAAACGCTACTCAGGCTCCTTCTAAATAGTTATATGCCAGCATTAAGTTTCTGACATAATTATTGTAAAAGCCCCTAAATATTTTTAATTCTGACTCATGGTTTGGATTTTTCTTTTGTTTCTTCTGAACTGGGTACTAGTGCATTGTGACACTAGAGGATATAAATTATAAATGCAATCAAGTTTATGGGCTACAAGGGAACTGTCCAATGAAAGATTTTATAGTGATATCTTAGTTTGTCTTCATATGTCCTATAGGACATTATTCGATCCAACTTCGATTGTGGGATCCTTATTGCAGGATCAGCCTGAACGTGCCATGCGTATGCTTGTTAAAATGAAACAAATGAAGGTGGCTCCAGATGTCAGGACCTATGAGCTTCTATATTCTTTATTTGGTAATGTGAATGCTCCATATGAGGAGGGGGACAATTTGTCACAGGTGGATGCTGCCAAAAGGGTACGCATGATAGAGATGGATATGGGAAAACATGGGATACAATATAGTCATTTCTCTATGATGAACTTGGTAAGACGACAGCTATCCTCTCTCTCTCTCTCTCTCTCCTCCTCCCCANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTCTCTCTCTCTCTCTCTCTCTCCCCCCCCACCACAAAAAGGAATAAAATAAAGAGAGAGAAAAGAACTTGCAGTTGTGTTATGTATTGACTGATGATCCACATCTGACTAGAGAACATTGCAGGAAGTTGTCTACTTCCCTCACAAAATTTCAAGCAGAATTTGGTTCTCTTTTCGAAGTTTCTTTCATTTTTTCCTTGTACTAATTTATTGTTTGTTCAGCACTTTGAGGGAAGTGGATATGGTCACTTTTCAATTGGAGAATTGATTAACATTTAGGAAAAGGTTTTGATGAACTATGACGGATAAATCTGATTTTATGTCTTTTCAGATCCCATTTAGTGCTGTTGTAGAACCATCACTGTTTTTGTTGTACATAGTCCTGATTATGTGTACTAGCTAAGAAAAATAAAATGTGGTGGTTAACTGGCTTAGCCCTTTGAAGTACGATCATTAGCATTTAGCAGTTAGCACTACTATGTAACTAGAAAAGCATTAAGTTCACTCAAATTTATTGGGTGCACGGGTGCACCGTGCAGATTAAACTGTTTTTACTTTTTTACTTTAATTGTTAAATCTTTCTTCACTGCTTATGGAAATATTGTTGTTTCTATTTAACTTTGAGTCCATTTGTTCCTTTAGTTGAAAGCTCTAGGTACAGAGGGGATGAAGAAGGAGGTTCTTCAGTATTTAAATTTGGCAGAGAACCTCTTTTATTACAACAACACTAGTCTGGGGATGCCCGTTTATAACACAGTGTTGCATTTTTTGGTTGATTCCAAGGAAGTAAGTAATTGTGGTCATGCACTCTATTCTATTAAGCTATAGAGTTCCTCATTTGTGTTAAACAGATAGTTCTGTTCCATATTTTTGCAGACTCACATGGCAATAGAATTATTCAATAATATGAAGCGTTCTGGTTTCTTTCCAGATGCTGCAACATTTGAGATGATGTTAGACTGTTGTAGTGTGATCGGATGCTTGAAATCAGCTTTTGCTCTTCTTTCCTTGATGATTCGCTCAGGGTTTTGTCCACAGATATTAACTTATACGAGCCTAGTAAAGGTACCCAATGAATTTCTTATTGGTAGATACATATATATACATTACTCACTCCATTTATACTTCTAAAGCGTCCATATTGAGTTAACAGATCGTGGGCTTTCAATGAAATCACTTTGTGGACTATTAGCTATGCATTCATGTTTTCAGATTTATATGAGTTGTCAACCTGAATGTTTGCTGATGCTCTCATACACAAGTCTGAACACAAAAAGTCATGTTTTAAAGGGGAAGGGTCTGTAGAGAGGGAGAATTTCTATTAATGTTCTCTTTTGACAATACTATAGTTTGGCTAATTTACAAGTCATTGTACGGTTTGAACCGTCCATGAAATTTTTGTTGGGCTAGTGCTATGGCGGTTGACGCCCATGGTTCGGACTTAATTTATTTTGTATTCATGTATATCTATTCTTGACACAAATGAACCATCCTAGTAGCTTCGAGGTTAACCTACTGGAGGATGCAATGTTAAATTTTCCATAAGTTTCCATATCAATTATTACCTATTAATACTTTCTTATTGTTAGTCTTGTCCTTTATCATTCAAAATATTTAACGTTTGGATTATTCCGATACAGATTGTGTTGGGATTTGAGAGATTTGATGATGCCTTGAACCTTTTGGATCAAGCCAGTTCAGAAGGGATTGAACTTGATGTCATTATAATGAATACAATCATGCGGAAAGCTTGTGAAAAGGTACCTCCCATGGCCTTGTGTCACACTTGGTAGTATTTGCTCTTGTTGGTTTTACACATTTTGACATTCTTGGAAGTCTTGGTGTACCTGTAAACCATATTTTGTTATTGAATGCCTTTGTGATTAGTGATACATGTGCTAAACAATTTCATTTTGTTGCTCTTTCTCGAATTGGATATTAGGCAAGGATTGATGTTATTGAGTTTCTCGTTGAGAAGATGAACCGTGAAAAGATCCCACCCGACCCTTCAACTTGTCAAAATGTCTTCTCTACATATGTGAACCTCGGTTATCACAGCACTGCCATGGAAGCACTGCAAGTACTGAGCATGCGCATGTTATTATGCGAAGAAGATGATGCCTCCGTGACAGAATATATGGAAAACTTTGTGCTTGCAGAAGACACCGGAGCCGATTCACGTATTGCGGAGTTCTTCAAATGCTCTAGAGAGTACCTGAGTTTTGCTCTCTTCAACTTGAGATGGTGTGCCATGCTGGGATATCCAGTTTGTTATGCCCCTAACCAAAGTCCATGGGCAATGAGACTTGCAAGTTCCTACGATGGCTACAACAACCTCCTTAGATGAAATCCTCTGTTCTGTCAAGTTGAAATACCCCATCTTTTCACCATATTGGACAAATCATGTTTCTGTAAAAGATCATATTTCTTTTTAGATGTGGAGCAATTGATTCCAACGTGGAATAGGAACAGGAGATATTTAAATTGATGTAACTTTTAATGAGAGAAACTGTCAAGGCATGAGTAAAACAGTCAGCTATTTTTGTTTAGTTTGAGGGTG

mRNA sequence

TTTCCACCTTTCTTCACCCATTCGCATCTGAGCTCTCTCAGTCGCCGTCCGCCGCCGTCGCAGGTCTCAATCGCCCTTCGACGTTCGCCGTCGAAACTGGTTTTTATTGGATTTCAACTGGTCGTCCTCTTCTTTCTTTGAGTTTTGTGGTTACTTCTTCCTTTTGCGACTGTGGATTTTGGGTTTTTATTGGGTGATTGCACACCGTCCAGCTCCACCTTTATCTCTCTCTGTTCTGTCTCCTTAAGAATCAAGTCGTGTGATTGTTAGATTCCCAATTCCACTCCAACATCTATTCGCCATTTCAGGCCTCAATTTTTTCGGACCATTTAGACAAATAGAGAAATGGGGTTGGGTTAAAATGGGGATGCTAAGAGGAAATTAGGATGAACTTAGAGGAGGAAAACAGAATTCAGTGAAGACGATGAATGTGCAGCAACTTCCGAGAAGAGGAAAGAGGGAGGAAGCCTTTAGTTTATTCTAACAGAACAAATCCTATTTCTACACCAACTGTATTCGGCTGCAGAGCAAGAATATTGCACTTATGGCTTTGTTCGCACTGCCATTTCATTCGGAATCTTGTCTTTAATAAGCTTTTGTCGCCGTCTCTGCTGGGAAGACAACAATGCACAGGGCAAGCTTTCGTTTGGGATCAATAGCTGACTCGATATATAGATTCAAGCCGCACGAACATGTGCGAAAACAGGATGCAAGCAAATTGGTATTCCATCGGGCGCTTCTCATCTCCAGTGGTAGTGAAATTTGGGGAAATGGAGCAGAGTCAACTGCATTCATGCAGATGCAGATTGTCAATGCACTTCGTCTGGGTGATAGAAGTAGGGCATCCAACCTGCTTATGGTTCTTGGCCAGGAAAAGTTCTCTTTAACTGCAGATAATTTTGTTCGCATTTTGAGTTACTGTGCAAAATCCCCTGATCCGCTGTTTGTCATGGAGACTTGGAAAATAATGGAAGAAAGAGGAATTTTTCTGAATAACACATGCTCCTTACTTATGATAGAAGCACTCTGTAAAGGGGGTTACTTGGATGAGGCATTTGGTCTAATAAATTTCCTAGCAGAAAGTCATGTGATGTTCCCTGCTCTGCCTGCGTACAATTGTTTCTTGAGAGCCTGTGCCATAAGGCAAAGTATGGTTCATGCTAGTCAATGTTTGGATCTTATGGATCACAAAATGGTTGGGAAGAATGAAGCTACATATTCTGAGCTACTCAAGCTTGCAGTTTGTCAGAAAAACTTGTCTTCTGTGCATGAAATCTGGAGGGACTTTGTAAAAAATTATAGTCCAAGCGTTTCATCGCTGAGGAAGTTTATATGGTCCTACGCAAGGATGGGAGATGTGAAATCTGCATATACTGCACTGCAAAAGATGGTGACTTTGAATAATGGAGCCGCAGGAAGAAAGTTACAATCTTTGGACATTCCAATACCTTCAAGAACTGAACTTTATCGTTACAATTTTAATTTTGAGGAAAAAGAACCCTCTATTGATGAGTTTTTCTATAAGAAAATGGTCCCCTGGAATGGTGACGTAGGGGGGATTTCTGTTAGTGGTATAAAATGTGGAGAAGTTGAAACTGGTCCATTAACTGTGCCAAACAATCACAAAAGCAGTTTTGTAAGGAAGGTTTTGAGATGGTCTTCCAATGATGTGATGCGTGCATGCTCTCTTGCTGGGAACTGTGGTCTTGCAGAGCAGCTAATGCAACAGATGCATAAACTTGGATTGCAACCGTCATCCCACACATTTGATGGTTTTGTTAGATCAGTTGTCTCAGAGAGAGGTTTCAGTGCTGGCATGGAAATATTAAAAGTAATGCAACAGAGGGGATTGGAGCCATATGATTCAACTCTTGCTGCTGTTTCAGTAAGTTGTAGCAAGGCGCTAGAACTTGATTTGGCTGAAGCTCTACTTGAACGACTTTCAGCTTGTCCTTACCCATACCCCTTCAATGCATTTTTTTCTGCATGTGACATGATGGATCAGCCTGAACGTGCCATGCGTATGCTTGTTAAAATGAAACAAATGAAGGTGGCTCCAGATGTCAGGACCTATGAGCTTCTATATTCTTTATTTGGTAATGTGAATGCTCCATATGAGGAGGGGGACAATTTGTCACAGGTGGATGCTGCCAAAAGGGTACGCATGATAGAGATGGATATGGGAAAACATGGGATACAATATAGTCATTTCTCTATGATGAACTTGTTGAAAGCTCTAGGTACAGAGGGGATGAAGAAGGAGGTTCTTCAGTATTTAAATTTGGCAGAGAACCTCTTTTATTACAACAACACTAGTCTGGGGATGCCCGTTTATAACACAGTGTTGCATTTTTTGGTTGATTCCAAGGAAATGCTGCAACATTTGAGATGATGTTAGACTGTTGTAGTGTGATCGGATGCTTGAAATCAGCTTTTGCTCTTCTTTCCTTGATGATTCGCTCAGGGTTTTGTCCACAGATATTAACTTATACGAGCCTAGTAAAGATTGTGTTGGGATTTGAGAGATTTGATGATGCCTTGAACCTTTTGGATCAAGCCAGTTCAGAAGGGATTGAACTTGATGTCATTATAATGAATACAATCATGCGGAAAGCTTGTGAAAAGGCAAGGATTGATGTTATTGAGTTTCTCGTTGAGAAGATGAACCGTGAAAAGATCCCACCCGACCCTTCAACTTGTCAAAATGTCTTCTCTACATATGTGAACCTCGGTTATCACAGCACTGCCATGGAAGCACTGCAAGTACTGAGCATGCGCATGTTATTATGCGAAGAAGATGATGCCTCCGTGACAGAATATATGGAAAACTTTGTGCTTGCAGAAGACACCGGAGCCGATTCACGTATTGCGGAGTTCTTCAAATGCTCTAGAGAGTACCTGAGTTTTGCTCTCTTCAACTTGAGATGGTGTGCCATGCTGGGATATCCAGTTTGTTATGCCCCTAACCAAAGTCCATGGGCAATGAGACTTGCAAGTTCCTACGATGGCTACAACAACCTCCTTAGATGAAATCCTCTGTTCTGTCAAGTTGAAATACCCCATCTTTTCACCATATTGGACAAATCATGTTTCTGTAAAAGATCATATTTCTTTTTAGATGTGGAGCAATTGATTCCAACGTGGAATAGGAACAGGAGATATTTAAATTGATGTAACTTTTAATGAGAGAAACTGTCAAGGCATGAGTAAAACAGTCAGCTATTTTTGTTTAGTTTGAGGGTG

Coding sequence (CDS)

ATGCACAGGGCAAGCTTTCGTTTGGGATCAATAGCTGACTCGATATATAGATTCAAGCCGCACGAACATGTGCGAAAACAGGATGCAAGCAAATTGGTATTCCATCGGGCGCTTCTCATCTCCAGTGGTAGTGAAATTTGGGGAAATGGAGCAGAGTCAACTGCATTCATGCAGATGCAGATTGTCAATGCACTTCGTCTGGGTGATAGAAGTAGGGCATCCAACCTGCTTATGGTTCTTGGCCAGGAAAAGTTCTCTTTAACTGCAGATAATTTTGTTCGCATTTTGAGTTACTGTGCAAAATCCCCTGATCCGCTGTTTGTCATGGAGACTTGGAAAATAATGGAAGAAAGAGGAATTTTTCTGAATAACACATGCTCCTTACTTATGATAGAAGCACTCTGTAAAGGGGGTTACTTGGATGAGGCATTTGGTCTAATAAATTTCCTAGCAGAAAGTCATGTGATGTTCCCTGCTCTGCCTGCGTACAATTGTTTCTTGAGAGCCTGTGCCATAAGGCAAAGTATGGTTCATGCTAGTCAATGTTTGGATCTTATGGATCACAAAATGGTTGGGAAGAATGAAGCTACATATTCTGAGCTACTCAAGCTTGCAGTTTGTCAGAAAAACTTGTCTTCTGTGCATGAAATCTGGAGGGACTTTGTAAAAAATTATAGTCCAAGCGTTTCATCGCTGAGGAAGTTTATATGGTCCTACGCAAGGATGGGAGATGTGAAATCTGCATATACTGCACTGCAAAAGATGGTGACTTTGAATAATGGAGCCGCAGGAAGAAAGTTACAATCTTTGGACATTCCAATACCTTCAAGAACTGAACTTTATCGTTACAATTTTAATTTTGAGGAAAAAGAACCCTCTATTGATGAGTTTTTCTATAAGAAAATGGTCCCCTGGAATGGTGACGTAGGGGGGATTTCTGTTAGTGGTATAAAATGTGGAGAAGTTGAAACTGGTCCATTAACTGTGCCAAACAATCACAAAAGCAGTTTTGTAAGGAAGGTTTTGAGATGGTCTTCCAATGATGTGATGCGTGCATGCTCTCTTGCTGGGAACTGTGGTCTTGCAGAGCAGCTAATGCAACAGATGCATAAACTTGGATTGCAACCGTCATCCCACACATTTGATGGTTTTGTTAGATCAGTTGTCTCAGAGAGAGGTTTCAGTGCTGGCATGGAAATATTAAAAGTAATGCAACAGAGGGGATTGGAGCCATATGATTCAACTCTTGCTGCTGTTTCAGTAAGTTGTAGCAAGGCGCTAGAACTTGATTTGGCTGAAGCTCTACTTGAACGACTTTCAGCTTGTCCTTACCCATACCCCTTCAATGCATTTTTTTCTGCATGTGACATGATGGATCAGCCTGAACGTGCCATGCGTATGCTTGTTAAAATGAAACAAATGAAGGTGGCTCCAGATGTCAGGACCTATGAGCTTCTATATTCTTTATTTGGTAATGTGAATGCTCCATATGAGGAGGGGGACAATTTGTCACAGGTGGATGCTGCCAAAAGGGTACGCATGATAGAGATGGATATGGGAAAACATGGGATACAATATAGTCATTTCTCTATGATGAACTTGTTGAAAGCTCTAGGTACAGAGGGGATGAAGAAGGAGGTTCTTCAGTATTTAAATTTGGCAGAGAACCTCTTTTATTACAACAACACTAGTCTGGGGATGCCCGTTTATAACACAGTGTTGCATTTTTTGGTTGATTCCAAGGAAATGCTGCAACATTTGAGATGA

Protein sequence

MHRASFRLGSIADSIYRFKPHEHVRKQDASKLVFHRALLISSGSEIWGNGAESTAFMQMQIVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSMVHASQCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFIWSYARMGDVKSAYTALQKMVTLNNGAAGRKLQSLDIPIPSRTELYRYNFNFEEKEPSIDEFFYKKMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVMRACSLAGNCGLAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLEPYDSTLAAVSVSCSKALELDLAEALLERLSACPYPYPFNAFFSACDMMDQPERAMRMLVKMKQMKVAPDVRTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIEMDMGKHGIQYSHFSMMNLLKALGTEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKEMLQHLR*
BLAST of Cucsa.079810.2 vs. Swiss-Prot
Match: PP126_ARATH (Pentatricopeptide repeat-containing protein At1g76280 OS=Arabidopsis thaliana GN=At1g76280 PE=2 SV=2)

HSP 1 Score: 424.9 bits (1091), Expect = 1.5e-117
Identity = 245/546 (44.87%), Postives = 336/546 (61.54%), Query Frame = 1

Query: 52  ESTAFMQMQIVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMET 111
           + +  +Q+QIV+ALR G+R  AS LL  L Q  +SL+AD+F  IL YCA+SPDP+FVMET
Sbjct: 59  DESKILQLQIVDALRSGERQGASALLFKLIQGNYSLSADDFHDILYYCARSPDPVFVMET 118

Query: 112 WKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACA 171
           + +M ++ I L++   L ++++LC GG+LD+A   I+ + E   + P LP YN FL ACA
Sbjct: 119 YSVMCKKEISLDSRSLLFIVKSLCNGGHLDKASEFIHAVREDDRISPLLPIYNFFLGACA 178

Query: 172 IRQSMVHASQCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSS 231
             +S+ HAS+CL+LMD + VGKN  TY  LLKLAV Q+NLS+V++IW+ +V +Y+  + S
Sbjct: 179 RTRSVYHASKCLELMDQRRVGKNGITYVALLKLAVFQRNLSTVNDIWKHYVNHYNLDILS 238

Query: 232 LRKFIWSYARMGDVKSAYTALQKMVTL----------NNGAAGRKLQS--LDIPIPSRTE 291
           LR+FIWS+ R+GD+KSAY  LQ MV L          N G    KL S  L IP+PS+ E
Sbjct: 239 LRRFIWSFTRLGDLKSAYELLQHMVYLALRGEFFVKSNRG----KLHSTRLYIPVPSKDE 298

Query: 292 LYRYNFNFEEKEPSIDEFFYKKMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVR 351
                F F                       G++   + C    +  + +P  H      
Sbjct: 299 TGSEKFAF-----------------------GVTDRIVDCN--SSSKVALPKGHNKILAI 358

Query: 352 KVLRWSSNDVMRACSLAGNCGLAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGME 411
           +VLRWS NDV+ AC  + N  LAEQLM Q                               
Sbjct: 359 RVLRWSFNDVIHACGQSKNSELAEQLMLQ------------------------------- 418

Query: 412 ILKVMQQRGLEP---YDSTLAAVSVSCSKALELDLAEALLERLSACPYPYPFNAFFSACD 471
            LKVMQQ+ L+P     +T+AA    CSKAL++DLAE LL+++S C Y YPFN   +A D
Sbjct: 419 -LKVMQQQNLKPYDSTLATVAAY---CSKALQVDLAEHLLDQISECSYSYPFNNLLAAYD 478

Query: 472 MMDQPERAMRMLVKMKQMKVAPDVRTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIE 531
            +DQPERA+R+L +MK++K+ PD+RTYELL+SLFGNVNAPYEEG+ LSQVD  KR+  IE
Sbjct: 479 SLDQPERAVRVLARMKELKLRPDMRTYELLFSLFGNVNAPYEEGNMLSQVDCCKRINAIE 538

Query: 532 MDMGKHGIQYSHFSMMNLLKALGTEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHF 583
           MDM ++G Q+S  S +N+L+ALG EGM  E++++L  AENL  ++N  LG P YN VLH 
Sbjct: 539 MDMMRNGFQHSPISRLNVLRALGAEGMVNEMIRHLQKAENLSAHSNMYLGTPTYNIVLHS 540

BLAST of Cucsa.079810.2 vs. TrEMBL
Match: M5X5C4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022231mg PE=4 SV=1)

HSP 1 Score: 644.0 bits (1660), Expect = 1.7e-181
Identity = 329/535 (61.50%), Postives = 409/535 (76.45%), Query Frame = 1

Query: 57  MQMQIVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIME 116
           MQMQIV+ALRLG+R +ASNLL+ LG    SL AD+F+ IL+YCAKSPDPLFVMETW+IM+
Sbjct: 1   MQMQIVDALRLGERGQASNLLLNLGHGNDSLRADDFIYILNYCAKSPDPLFVMETWRIMD 60

Query: 117 ERGIFLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSM 176
           E+ I LNN CSLLM+++LCKGGYL+EAF LINFL E   + P LP YN FLRACA  QS+
Sbjct: 61  EKEIGLNNICSLLMVQSLCKGGYLEEAFKLINFLGEIPGIHPVLPIYNSFLRACAKMQSI 120

Query: 177 VHASQCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFI 236
            +A+QCLDLM+ +MVGKNE TYSELLKLAV Q+NL + HEIW+D++K YS S+  LRKFI
Sbjct: 121 KNANQCLDLMERQMVGKNEVTYSELLKLAVWQQNLPAAHEIWKDYIKCYSLSIIPLRKFI 180

Query: 237 WSYARMGDVKSAYTALQKMVTL--------NNGAAGRKLQS-LDIPIPSRTELYRYNFNF 296
           WS+ R+GD+KSAY  LQ MV L        N  + G+   S LDIPIPS  EL     + 
Sbjct: 181 WSFTRLGDLKSAYEKLQYMVALAIRGNTYVNRTSEGKLYSSRLDIPIPSICELDLKKLDL 240

Query: 297 EEKEPSIDEFFYKKMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSN 356
           EE + SI   + + +     +    +  G+  GEVE   + + + H S  V K+LRWS +
Sbjct: 241 EENKHSIPSIYCENLDDHAVNADQCTTFGLGVGEVENVGMDMLDIHISQPVMKILRWSFS 300

Query: 357 DVMRACSLAGNCGLAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQR 416
           DV+ AC+   N GLAEQL+ QM K GLQPSSHT+DGFVR+V SERGFS+GMEIL++MQQR
Sbjct: 301 DVIHACARLRNGGLAEQLILQMQKFGLQPSSHTYDGFVRAVTSERGFSSGMEILRIMQQR 360

Query: 417 GLEPYDSTLAAVSVSCSKALELDLAEALLERLSACPYPYPFNAFFSACDMMDQPERAMRM 476
            L+PYDSTLA +S+ CSK LELD AEALL ++S C YP+PFNAF +ACD +DQPERA++M
Sbjct: 361 NLKPYDSTLANLSIGCSKVLELDFAEALLVQISECSYPHPFNAFLAACDTVDQPERAVQM 420

Query: 477 LVKMKQMKVAPDVRTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIEMDMGKHGIQYS 536
           L KMKQ+KV PD+RTYELL+SLFGNVNAPYEEG+ LSQVDAAKR+  IEMDM ++GIQ+S
Sbjct: 421 LAKMKQLKVVPDIRTYELLFSLFGNVNAPYEEGNMLSQVDAAKRINAIEMDMARYGIQHS 480

Query: 537 HFSMMNLLKALGTEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKE 583
           + SM NLLKALG EGM +E++QYL++AEN+F  NN  LG P+YNTVLH LV++KE
Sbjct: 481 YLSMKNLLKALGAEGMIRELIQYLDVAENIFCRNNIYLGTPIYNTVLHSLVEAKE 535

BLAST of Cucsa.079810.2 vs. TrEMBL
Match: A0A061FEB6_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 2 (Fragment) OS=Theobroma cacao GN=TCM_034369 PE=4 SV=1)

HSP 1 Score: 629.4 bits (1622), Expect = 4.4e-177
Identity = 332/592 (56.08%), Postives = 419/592 (70.78%), Query Frame = 1

Query: 1   MHRASFRLGSIADSIYRFKPHEHVRKQDASKLVFHRALLISSGSEIWGNGAES-TAFMQM 60
           M R    L S+AD++ + K  EH R+    +L   R +   +G    G G ES T  +Q+
Sbjct: 1   MRRVRIPLRSVADTLCKSKSREHGRRNVNGRLELCRTVATINGHVFLGYGEESRTKTLQL 60

Query: 61  QIVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIMEERG 120
           QIV+ALRLG+RSRAS LL  LG     L AD+ V IL+YCAKSPDPLF METW+++EE+ 
Sbjct: 61  QIVDALRLGERSRASRLLSDLGDGNQPLKADDIVYILNYCAKSPDPLFFMETWRLIEEKE 120

Query: 121 IFLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSMVHA 180
           I LNN C LLM++ALC+GGYL+EA  LI FL E+  ++P L  YN FL ACA  Q+ VHA
Sbjct: 121 IGLNNKCYLLMVQALCRGGYLEEACNLIKFLGENRGIYPFLSIYNSFLGACAKMQTAVHA 180

Query: 181 SQCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFIWSY 240
           +QCLDLM+ + VGKNE TYSELLKLAV Q+NLS+VHEIW+D++K+YS ++ SLRKFIWS+
Sbjct: 181 NQCLDLMERQRVGKNEITYSELLKLAVWQQNLSAVHEIWKDYIKHYSLNIISLRKFIWSF 240

Query: 241 ARMGDVKSAYTALQKMVTL--------NNGAAGRKLQS-LDIPIPSRTELYRYNFNFEEK 300
            R+ D+KSAY  LQ MV L        +    GR   S LDIPIPS+ EL        E 
Sbjct: 241 TRLKDLKSAYETLQHMVALAISGKIFVSRTGEGRLYSSRLDIPIPSKGELGSQKVELGEN 300

Query: 301 EPSIDEFFYKKMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVM 360
           E      F         D    +V   K      G L   NN+K   V KVLRWS +DV+
Sbjct: 301 EQDFALKF---------DTDASNVEICKSVSATVGML---NNYKRMPVMKVLRWSFSDVI 360

Query: 361 RACSLAGNCGLAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLE 420
            AC+ A +  LAE LM QM  LGLQPSSHT+DGF+R+V+  RGFSAGME+LKVM++R ++
Sbjct: 361 HACAQARDYKLAEHLMVQMQNLGLQPSSHTYDGFMRAVIPTRGFSAGMEMLKVMEERNMK 420

Query: 421 PYDSTLAAVSVSCSKALELDLAEALLERLSACPYPYPFNAFFSACDMMDQPERAMRMLVK 480
           PY ST AA+SV CSKALELDLAEALL+++  CP+PYP+NAF  ACD MDQPERA+R+L K
Sbjct: 421 PYASTFAALSVQCSKALELDLAEALLDQVCECPHPYPYNAFLGACDTMDQPERAIRLLAK 480

Query: 481 MKQMKVAPDVRTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIEMDMGKHGIQYSHFS 540
           M+Q K+ PD+RTYELL+SLFGNVNAPYEEGD LSQVD++KR++ IEMDM K+G+Q+SH S
Sbjct: 481 MRQRKLQPDIRTYELLFSLFGNVNAPYEEGDMLSQVDSSKRIKAIEMDMAKNGVQHSHLS 540

Query: 541 MMNLLKALGTEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKE 583
           M NLLKALG EGM +E+L YL++AE LF + NT +G P+YNTVLH L++++E
Sbjct: 541 MKNLLKALGAEGMTRELLHYLHIAEKLFCHTNTYMGAPIYNTVLHSLIEAEE 580

BLAST of Cucsa.079810.2 vs. TrEMBL
Match: A0A061FEW4_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_034369 PE=4 SV=1)

HSP 1 Score: 629.4 bits (1622), Expect = 4.4e-177
Identity = 332/592 (56.08%), Postives = 419/592 (70.78%), Query Frame = 1

Query: 1   MHRASFRLGSIADSIYRFKPHEHVRKQDASKLVFHRALLISSGSEIWGNGAES-TAFMQM 60
           M R    L S+AD++ + K  EH R+    +L   R +   +G    G G ES T  +Q+
Sbjct: 1   MRRVRIPLRSVADTLCKSKSREHGRRNVNGRLELCRTVATINGHVFLGYGEESRTKTLQL 60

Query: 61  QIVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIMEERG 120
           QIV+ALRLG+RSRAS LL  LG     L AD+ V IL+YCAKSPDPLF METW+++EE+ 
Sbjct: 61  QIVDALRLGERSRASRLLSDLGDGNQPLKADDIVYILNYCAKSPDPLFFMETWRLIEEKE 120

Query: 121 IFLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSMVHA 180
           I LNN C LLM++ALC+GGYL+EA  LI FL E+  ++P L  YN FL ACA  Q+ VHA
Sbjct: 121 IGLNNKCYLLMVQALCRGGYLEEACNLIKFLGENRGIYPFLSIYNSFLGACAKMQTAVHA 180

Query: 181 SQCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFIWSY 240
           +QCLDLM+ + VGKNE TYSELLKLAV Q+NLS+VHEIW+D++K+YS ++ SLRKFIWS+
Sbjct: 181 NQCLDLMERQRVGKNEITYSELLKLAVWQQNLSAVHEIWKDYIKHYSLNIISLRKFIWSF 240

Query: 241 ARMGDVKSAYTALQKMVTL--------NNGAAGRKLQS-LDIPIPSRTELYRYNFNFEEK 300
            R+ D+KSAY  LQ MV L        +    GR   S LDIPIPS+ EL        E 
Sbjct: 241 TRLKDLKSAYETLQHMVALAISGKIFVSRTGEGRLYSSRLDIPIPSKGELGSQKVELGEN 300

Query: 301 EPSIDEFFYKKMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVM 360
           E      F         D    +V   K      G L   NN+K   V KVLRWS +DV+
Sbjct: 301 EQDFALKF---------DTDASNVEICKSVSATVGML---NNYKRMPVMKVLRWSFSDVI 360

Query: 361 RACSLAGNCGLAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLE 420
            AC+ A +  LAE LM QM  LGLQPSSHT+DGF+R+V+  RGFSAGME+LKVM++R ++
Sbjct: 361 HACAQARDYKLAEHLMVQMQNLGLQPSSHTYDGFMRAVIPTRGFSAGMEMLKVMEERNMK 420

Query: 421 PYDSTLAAVSVSCSKALELDLAEALLERLSACPYPYPFNAFFSACDMMDQPERAMRMLVK 480
           PY ST AA+SV CSKALELDLAEALL+++  CP+PYP+NAF  ACD MDQPERA+R+L K
Sbjct: 421 PYASTFAALSVQCSKALELDLAEALLDQVCECPHPYPYNAFLGACDTMDQPERAIRLLAK 480

Query: 481 MKQMKVAPDVRTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIEMDMGKHGIQYSHFS 540
           M+Q K+ PD+RTYELL+SLFGNVNAPYEEGD LSQVD++KR++ IEMDM K+G+Q+SH S
Sbjct: 481 MRQRKLQPDIRTYELLFSLFGNVNAPYEEGDMLSQVDSSKRIKAIEMDMAKNGVQHSHLS 540

Query: 541 MMNLLKALGTEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKE 583
           M NLLKALG EGM +E+L YL++AE LF + NT +G P+YNTVLH L++++E
Sbjct: 541 MKNLLKALGAEGMTRELLHYLHIAEKLFCHTNTYMGAPIYNTVLHSLIEAEE 580

BLAST of Cucsa.079810.2 vs. TrEMBL
Match: A0A0D2R4H9_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_002G118000 PE=4 SV=1)

HSP 1 Score: 627.5 bits (1617), Expect = 1.7e-176
Identity = 333/588 (56.63%), Postives = 425/588 (72.28%), Query Frame = 1

Query: 8   LGSIADSIYRFKPHEHVRKQD--ASKLVFHRALLISSGSEIWGNGAES-TAFMQMQIVNA 67
           L SIAD++ RFK  E  R +     +L   R++   +G+   G G E  T  +Q+QIV+A
Sbjct: 5   LRSIADTLCRFKSGELERGRGNFIRRLELCRSVATINGNVFLGYGGEPLTKSIQVQIVDA 64

Query: 68  LRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIMEERGIFLNN 127
           LRLG+RSRAS+LL+  G    SL A++FV IL+YCA+SPDPLFVMETW++MEE+ I LNN
Sbjct: 65  LRLGERSRASSLLLDFGNGNQSLKANDFVYILNYCARSPDPLFVMETWRLMEEKEIDLNN 124

Query: 128 TCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSMVHASQCLD 187
           TC LLM+ ALC+GGYL+EA   + FL E+H  +P LP YNCFL ACA  +S++HA+QCLD
Sbjct: 125 TCYLLMVRALCRGGYLEEACKFMKFLRENHGTYPLLPVYNCFLGACAKMKSIIHANQCLD 184

Query: 188 LMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFIWSYARMGD 247
           LM+ + VGKNE TYS LLKLAV Q++LS+V EIW D++K+YS ++ SLR+FIWS+ R+ D
Sbjct: 185 LMELQRVGKNEITYSVLLKLAVWQQDLSAVREIWEDYIKHYSLNIISLRRFIWSFTRLKD 244

Query: 248 VKSAYTALQKMVTL--------NNGAAGRKLQS-LDIPIPSRTELYRYNFNFEEKEPSID 307
           +KSAY  LQ MV L        +    GR   S LDIPIPS++EL   N    E E S+ 
Sbjct: 245 LKSAYETLQHMVALAISGKHFVSRTDEGRLYSSRLDIPIPSKSELGSQNVQSGENEQSLA 304

Query: 308 EFFYKKMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVMRACSL 367
             F         D+   ++   K      G L   NN+K+  V +VLR S NDV+ AC+ 
Sbjct: 305 FKF---------DIDSSNIERSKSISATVGML---NNYKNLPVMEVLRLSINDVLHACAQ 364

Query: 368 AGNCGLAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLEPYDST 427
           A   GLAEQLM  M  LGLQPSSHT+DGFVR+++  RGF AGME+LKVM++R L+P+DST
Sbjct: 365 ARAYGLAEQLMMLMQNLGLQPSSHTYDGFVRAIIQRRGFGAGMEMLKVMEERNLKPHDST 424

Query: 428 LAAVSVSCSKALELDLAEALLERLSACPYPYPFNAFFSACDMMDQPERAMRMLVKMKQMK 487
           LAA+SV CSKALELDLAEALLE++  CPYPYPFNAF  ACD MDQP+RA+R+L KM+Q+K
Sbjct: 425 LAALSVQCSKALELDLAEALLEQVCECPYPYPFNAFLEACDNMDQPKRALRILAKMRQLK 484

Query: 488 VAPDVRTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIEMDMGKHGIQYSHFSMMNLL 547
           + PD+RTYELL+S+FGNVNAPYEEG+ LS VD+ KR+  IEMDM K+G+Q+SH SM NLL
Sbjct: 485 LQPDIRTYELLFSMFGNVNAPYEEGNRLSHVDSRKRINAIEMDMAKNGVQHSHLSMKNLL 544

Query: 548 KALGTEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKEM 584
           KALG EGM  E+LQYL++AENLF + NT LG P+YN VLH LV++ E+
Sbjct: 545 KALGAEGMTIELLQYLHVAENLFCHTNTKLGAPMYNVVLHSLVEANEV 580

BLAST of Cucsa.079810.2 vs. TrEMBL
Match: A0A0D2QBL1_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_002G118000 PE=4 SV=1)

HSP 1 Score: 627.1 bits (1616), Expect = 2.2e-176
Identity = 333/587 (56.73%), Postives = 424/587 (72.23%), Query Frame = 1

Query: 8   LGSIADSIYRFKPHEHVRKQD--ASKLVFHRALLISSGSEIWGNGAES-TAFMQMQIVNA 67
           L SIAD++ RFK  E  R +     +L   R++   +G+   G G E  T  +Q+QIV+A
Sbjct: 5   LRSIADTLCRFKSGELERGRGNFIRRLELCRSVATINGNVFLGYGGEPLTKSIQVQIVDA 64

Query: 68  LRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIMEERGIFLNN 127
           LRLG+RSRAS+LL+  G    SL A++FV IL+YCA+SPDPLFVMETW++MEE+ I LNN
Sbjct: 65  LRLGERSRASSLLLDFGNGNQSLKANDFVYILNYCARSPDPLFVMETWRLMEEKEIDLNN 124

Query: 128 TCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSMVHASQCLD 187
           TC LLM+ ALC+GGYL+EA   + FL E+H  +P LP YNCFL ACA  +S++HA+QCLD
Sbjct: 125 TCYLLMVRALCRGGYLEEACKFMKFLRENHGTYPLLPVYNCFLGACAKMKSIIHANQCLD 184

Query: 188 LMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFIWSYARMGD 247
           LM+ + VGKNE TYS LLKLAV Q++LS+V EIW D++K+YS ++ SLR+FIWS+ R+ D
Sbjct: 185 LMELQRVGKNEITYSVLLKLAVWQQDLSAVREIWEDYIKHYSLNIISLRRFIWSFTRLKD 244

Query: 248 VKSAYTALQKMVTL--------NNGAAGRKLQS-LDIPIPSRTELYRYNFNFEEKEPSID 307
           +KSAY  LQ MV L        +    GR   S LDIPIPS++EL   N    E E S+ 
Sbjct: 245 LKSAYETLQHMVALAISGKHFVSRTDEGRLYSSRLDIPIPSKSELGSQNVQSGENEQSLA 304

Query: 308 EFFYKKMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVMRACSL 367
             F         D+   ++   K      G L   NN+K+  V +VLR S NDV+ AC+ 
Sbjct: 305 FKF---------DIDSSNIERSKSISATVGML---NNYKNLPVMEVLRLSINDVLHACAQ 364

Query: 368 AGNCGLAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLEPYDST 427
           A   GLAEQLM  M  LGLQPSSHT+DGFVR+++  RGF AGME+LKVM++R L+P+DST
Sbjct: 365 ARAYGLAEQLMMLMQNLGLQPSSHTYDGFVRAIIQRRGFGAGMEMLKVMEERNLKPHDST 424

Query: 428 LAAVSVSCSKALELDLAEALLERLSACPYPYPFNAFFSACDMMDQPERAMRMLVKMKQMK 487
           LAA+SV CSKALELDLAEALLE++  CPYPYPFNAF  ACD MDQP+RA+R+L KM+Q+K
Sbjct: 425 LAALSVQCSKALELDLAEALLEQVCECPYPYPFNAFLEACDNMDQPKRALRILAKMRQLK 484

Query: 488 VAPDVRTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIEMDMGKHGIQYSHFSMMNLL 547
           + PD+RTYELL+S+FGNVNAPYEEG+ LS VD+ KR+  IEMDM K+G+Q+SH SM NLL
Sbjct: 485 LQPDIRTYELLFSMFGNVNAPYEEGNRLSHVDSRKRINAIEMDMAKNGVQHSHLSMKNLL 544

Query: 548 KALGTEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKE 583
           KALG EGM  E+LQYL++AENLF + NT LG P+YN VLH LV++ E
Sbjct: 545 KALGAEGMTIELLQYLHVAENLFCHTNTKLGAPMYNVVLHSLVEANE 579

BLAST of Cucsa.079810.2 vs. TAIR10
Match: AT1G76280.3 (AT1G76280.3 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 485.0 bits (1247), Expect = 6.8e-137
Identity = 262/543 (48.25%), Postives = 357/543 (65.75%), Query Frame = 1

Query: 52  ESTAFMQMQIVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMET 111
           + +  +Q+QIV+ALR G+R  AS LL  L Q  +SL+AD+F  IL YCA+SPDP+    T
Sbjct: 59  DESKILQLQIVDALRSGERQGASALLFKLIQGNYSLSADDFHDILYYCARSPDPV----T 118

Query: 112 WKIMEERGIFLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACA 171
           + +M ++ I L++   L ++++LC GG+LD+A   I+ + E   + P LP YN FL ACA
Sbjct: 119 YSVMCKKEISLDSRSLLFIVKSLCNGGHLDKASEFIHAVREDDRISPLLPIYNFFLGACA 178

Query: 172 IRQSMVHASQCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSS 231
             +S+ HAS+CL+LMD + VGKN  TY  LLKLAV Q+NLS+V++IW+ +V +Y+  + S
Sbjct: 179 RTRSVYHASKCLELMDQRRVGKNGITYVALLKLAVFQRNLSTVNDIWKHYVNHYNLDILS 238

Query: 232 LRKFIWSYARMGDVKSAYTALQKMVTL----------NNGAAGRKLQS--LDIPIPSRTE 291
           LR+FIWS+ R+GD+KSAY  LQ MV L          N G    KL S  L IP+PS+ E
Sbjct: 239 LRRFIWSFTRLGDLKSAYELLQHMVYLALRGEFFVKSNRG----KLHSTRLYIPVPSKDE 298

Query: 292 LYRYNFNFEEKEPSIDEFFYKKMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVR 351
                F F                       G++   + C    +  + +P  H      
Sbjct: 299 TGSEKFAF-----------------------GVTDRIVDCNS--SSKVALPKGHNKILAI 358

Query: 352 KVLRWSSNDVMRACSLAGNCGLAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGME 411
           +VLRWS NDV+ AC  + N  LAEQLM QM  LGL PSSHT+DGF+R+V    G+  GM 
Sbjct: 359 RVLRWSFNDVIHACGQSKNSELAEQLMLQMQNLGLLPSSHTYDGFIRAVAFPEGYEYGMT 418

Query: 412 ILKVMQQRGLEPYDSTLAAVSVSCSKALELDLAEALLERLSACPYPYPFNAFFSACDMMD 471
           +LKVMQQ+ L+PYDSTLA V+  CSKAL++DLAE LL+++S C Y YPFN   +A D +D
Sbjct: 419 LLKVMQQQNLKPYDSTLATVAAYCSKALQVDLAEHLLDQISECSYSYPFNNLLAAYDSLD 478

Query: 472 QPERAMRMLVKMKQMKVAPDVRTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIEMDM 531
           QPERA+R+L +MK++K+ PD+RTYELL+SLFGNVNAPYEEG+ LSQVD  KR+  IEMDM
Sbjct: 479 QPERAVRVLARMKELKLRPDMRTYELLFSLFGNVNAPYEEGNMLSQVDCCKRINAIEMDM 538

Query: 532 GKHGIQYSHFSMMNLLKALGTEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVD 583
            ++G Q+S  S +N+L+ALG EGM  E++++L  AENL  ++N  LG P YN VLH L++
Sbjct: 539 MRNGFQHSPISRLNVLRALGAEGMVNEMIRHLQKAENLSAHSNMYLGTPTYNIVLHSLLE 568

BLAST of Cucsa.079810.2 vs. NCBI nr
Match: gi|778726439|ref|XP_004139754.2| (PREDICTED: pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucumis sativus])

HSP 1 Score: 1168.7 bits (3022), Expect = 0.0e+00
Identity = 582/582 (100.00%), Postives = 582/582 (100.00%), Query Frame = 1

Query: 1   MHRASFRLGSIADSIYRFKPHEHVRKQDASKLVFHRALLISSGSEIWGNGAESTAFMQMQ 60
           MHRASFRLGSIADSIYRFKPHEHVRKQDASKLVFHRALLISSGSEIWGNGAESTAFMQMQ
Sbjct: 1   MHRASFRLGSIADSIYRFKPHEHVRKQDASKLVFHRALLISSGSEIWGNGAESTAFMQMQ 60

Query: 61  IVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIMEERGI 120
           IVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIMEERGI
Sbjct: 61  IVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIMEERGI 120

Query: 121 FLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSMVHAS 180
           FLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSMVHAS
Sbjct: 121 FLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSMVHAS 180

Query: 181 QCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFIWSYA 240
           QCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFIWSYA
Sbjct: 181 QCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFIWSYA 240

Query: 241 RMGDVKSAYTALQKMVTLNNGAAGRKLQSLDIPIPSRTELYRYNFNFEEKEPSIDEFFYK 300
           RMGDVKSAYTALQKMVTLNNGAAGRKLQSLDIPIPSRTELYRYNFNFEEKEPSIDEFFYK
Sbjct: 241 RMGDVKSAYTALQKMVTLNNGAAGRKLQSLDIPIPSRTELYRYNFNFEEKEPSIDEFFYK 300

Query: 301 KMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVMRACSLAGNCG 360
           KMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVMRACSLAGNCG
Sbjct: 301 KMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVMRACSLAGNCG 360

Query: 361 LAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLEPYDSTLAAVS 420
           LAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLEPYDSTLAAVS
Sbjct: 361 LAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLEPYDSTLAAVS 420

Query: 421 VSCSKALELDLAEALLERLSACPYPYPFNAFFSACDMMDQPERAMRMLVKMKQMKVAPDV 480
           VSCSKALELDLAEALLERLSACPYPYPFNAFFSACDMMDQPERAMRMLVKMKQMKVAPDV
Sbjct: 421 VSCSKALELDLAEALLERLSACPYPYPFNAFFSACDMMDQPERAMRMLVKMKQMKVAPDV 480

Query: 481 RTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIEMDMGKHGIQYSHFSMMNLLKALGT 540
           RTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIEMDMGKHGIQYSHFSMMNLLKALGT
Sbjct: 481 RTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIEMDMGKHGIQYSHFSMMNLLKALGT 540

Query: 541 EGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKE 583
           EGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKE
Sbjct: 541 EGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKE 582

BLAST of Cucsa.079810.2 vs. NCBI nr
Match: gi|659123018|ref|XP_008461447.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucumis melo])

HSP 1 Score: 1086.6 bits (2809), Expect = 0.0e+00
Identity = 542/583 (92.97%), Postives = 557/583 (95.54%), Query Frame = 1

Query: 1   MHRASFRLGSIADSIYRFKPHEHVRKQDASKLVFHRALLISSGSEIWGNGAESTAFMQMQ 60
           MHRASFRLGSIADSIYRFKPHE VRKQDASKLVFHRALLIS GSEIWGNGAESTAFMQ+Q
Sbjct: 1   MHRASFRLGSIADSIYRFKPHELVRKQDASKLVFHRALLISKGSEIWGNGAESTAFMQIQ 60

Query: 61  IVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIMEERGI 120
           IV+ALRLGDRS+ASNLLMVLGQEK SLTADNFVRILSYCAKSPDPLFVMETWKIMEERGI
Sbjct: 61  IVDALRLGDRSKASNLLMVLGQEKCSLTADNFVRILSYCAKSPDPLFVMETWKIMEERGI 120

Query: 121 FLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSMVHAS 180
           FLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFP LP YNCFLRACAIRQS VHAS
Sbjct: 121 FLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPVLPVYNCFLRACAIRQSTVHAS 180

Query: 181 QCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFIWSYA 240
           QCLDLMDH+MVGKNEATYSELLKLAVCQ+N SSVHEIW DFVKNYSPSVSSLRKFIWS+A
Sbjct: 181 QCLDLMDHRMVGKNEATYSELLKLAVCQENSSSVHEIWTDFVKNYSPSVSSLRKFIWSFA 240

Query: 241 RMGDVKSAYTALQKMVTLNNGAAGRKLQSLDIPIPSRTELYRYNFNFEEKEPSIDEFFYK 300
           R+GD+ SAYTALQKMV L  GA GRKLQSLDIPIP RTE Y  NFNFEEKEPSIDEFF K
Sbjct: 241 RLGDLTSAYTALQKMVALATGATGRKLQSLDIPIPLRTEFYHNNFNFEEKEPSIDEFFCK 300

Query: 301 KMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVMRACSLAGNCG 360
           KMVPWNGDVGGISV+ +KCG  ETGPLTVPNNH+SSFVRKVLRWSSNDVMR+CSLAGNCG
Sbjct: 301 KMVPWNGDVGGISVNDMKCG--ETGPLTVPNNHRSSFVRKVLRWSSNDVMRSCSLAGNCG 360

Query: 361 LAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLEPYDSTLAAVS 420
           LAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLEPYDSTLAAVS
Sbjct: 361 LAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLEPYDSTLAAVS 420

Query: 421 VSCSKALELDLAEALLERLSACPYPYPFNAFFSACDMMDQPERAMRMLVKMKQMKVAPDV 480
           VSCSKALELDLAEALLERLSACPYPYPFNAF SAC +MDQPERAMRMLVKMKQMKV PDV
Sbjct: 421 VSCSKALELDLAEALLERLSACPYPYPFNAFLSACGVMDQPERAMRMLVKMKQMKVVPDV 480

Query: 481 RTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIEMDMGKHGIQYSHFSMMNLLKALGT 540
           RTYELLYSLFGNVNAPYEEGD LSQVDAAKR+RMIEMDMGKHGIQYSHFSMMNLLKALG 
Sbjct: 481 RTYELLYSLFGNVNAPYEEGDKLSQVDAAKRIRMIEMDMGKHGIQYSHFSMMNLLKALGA 540

Query: 541 EGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKEM 584
           EGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKE+
Sbjct: 541 EGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKEI 581

BLAST of Cucsa.079810.2 vs. NCBI nr
Match: gi|778726443|ref|XP_011659099.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g76280 isoform X2 [Cucumis sativus])

HSP 1 Score: 1057.4 bits (2733), Expect = 9.4e-306
Identity = 526/526 (100.00%), Postives = 526/526 (100.00%), Query Frame = 1

Query: 57  MQMQIVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIME 116
           MQMQIVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIME
Sbjct: 1   MQMQIVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIME 60

Query: 117 ERGIFLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSM 176
           ERGIFLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSM
Sbjct: 61  ERGIFLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSM 120

Query: 177 VHASQCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFI 236
           VHASQCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFI
Sbjct: 121 VHASQCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFI 180

Query: 237 WSYARMGDVKSAYTALQKMVTLNNGAAGRKLQSLDIPIPSRTELYRYNFNFEEKEPSIDE 296
           WSYARMGDVKSAYTALQKMVTLNNGAAGRKLQSLDIPIPSRTELYRYNFNFEEKEPSIDE
Sbjct: 181 WSYARMGDVKSAYTALQKMVTLNNGAAGRKLQSLDIPIPSRTELYRYNFNFEEKEPSIDE 240

Query: 297 FFYKKMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVMRACSLA 356
           FFYKKMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVMRACSLA
Sbjct: 241 FFYKKMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVMRACSLA 300

Query: 357 GNCGLAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLEPYDSTL 416
           GNCGLAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLEPYDSTL
Sbjct: 301 GNCGLAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLEPYDSTL 360

Query: 417 AAVSVSCSKALELDLAEALLERLSACPYPYPFNAFFSACDMMDQPERAMRMLVKMKQMKV 476
           AAVSVSCSKALELDLAEALLERLSACPYPYPFNAFFSACDMMDQPERAMRMLVKMKQMKV
Sbjct: 361 AAVSVSCSKALELDLAEALLERLSACPYPYPFNAFFSACDMMDQPERAMRMLVKMKQMKV 420

Query: 477 APDVRTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIEMDMGKHGIQYSHFSMMNLLK 536
           APDVRTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIEMDMGKHGIQYSHFSMMNLLK
Sbjct: 421 APDVRTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIEMDMGKHGIQYSHFSMMNLLK 480

Query: 537 ALGTEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKE 583
           ALGTEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKE
Sbjct: 481 ALGTEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKE 526

BLAST of Cucsa.079810.2 vs. NCBI nr
Match: gi|659123020|ref|XP_008461448.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g76280 isoform X2 [Cucumis melo])

HSP 1 Score: 981.1 bits (2535), Expect = 8.5e-283
Identity = 488/527 (92.60%), Postives = 503/527 (95.45%), Query Frame = 1

Query: 57  MQMQIVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIME 116
           MQ+QIV+ALRLGDRS+ASNLLMVLGQEK SLTADNFVRILSYCAKSPDPLFVMETWKIME
Sbjct: 1   MQIQIVDALRLGDRSKASNLLMVLGQEKCSLTADNFVRILSYCAKSPDPLFVMETWKIME 60

Query: 117 ERGIFLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSM 176
           ERGIFLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFP LP YNCFLRACAIRQS 
Sbjct: 61  ERGIFLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPVLPVYNCFLRACAIRQST 120

Query: 177 VHASQCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFI 236
           VHASQCLDLMDH+MVGKNEATYSELLKLAVCQ+N SSVHEIW DFVKNYSPSVSSLRKFI
Sbjct: 121 VHASQCLDLMDHRMVGKNEATYSELLKLAVCQENSSSVHEIWTDFVKNYSPSVSSLRKFI 180

Query: 237 WSYARMGDVKSAYTALQKMVTLNNGAAGRKLQSLDIPIPSRTELYRYNFNFEEKEPSIDE 296
           WS+AR+GD+ SAYTALQKMV L  GA GRKLQSLDIPIP RTE Y  NFNFEEKEPSIDE
Sbjct: 181 WSFARLGDLTSAYTALQKMVALATGATGRKLQSLDIPIPLRTEFYHNNFNFEEKEPSIDE 240

Query: 297 FFYKKMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVMRACSLA 356
           FF KKMVPWNGDVGGISV+ +KCG  ETGPLTVPNNH+SSFVRKVLRWSSNDVMR+CSLA
Sbjct: 241 FFCKKMVPWNGDVGGISVNDMKCG--ETGPLTVPNNHRSSFVRKVLRWSSNDVMRSCSLA 300

Query: 357 GNCGLAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLEPYDSTL 416
           GNCGLAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLEPYDSTL
Sbjct: 301 GNCGLAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEILKVMQQRGLEPYDSTL 360

Query: 417 AAVSVSCSKALELDLAEALLERLSACPYPYPFNAFFSACDMMDQPERAMRMLVKMKQMKV 476
           AAVSVSCSKALELDLAEALLERLSACPYPYPFNAF SAC +MDQPERAMRMLVKMKQMKV
Sbjct: 361 AAVSVSCSKALELDLAEALLERLSACPYPYPFNAFLSACGVMDQPERAMRMLVKMKQMKV 420

Query: 477 APDVRTYELLYSLFGNVNAPYEEGDNLSQVDAAKRVRMIEMDMGKHGIQYSHFSMMNLLK 536
            PDVRTYELLYSLFGNVNAPYEEGD LSQVDAAKR+RMIEMDMGKHGIQYSHFSMMNLLK
Sbjct: 421 VPDVRTYELLYSLFGNVNAPYEEGDKLSQVDAAKRIRMIEMDMGKHGIQYSHFSMMNLLK 480

Query: 537 ALGTEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKEM 584
           ALG EGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKE+
Sbjct: 481 ALGAEGMKKEVLQYLNLAENLFYYNNTSLGMPVYNTVLHFLVDSKEI 525

BLAST of Cucsa.079810.2 vs. NCBI nr
Match: gi|778726446|ref|XP_011659100.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g76280 isoform X3 [Cucumis sativus])

HSP 1 Score: 807.7 bits (2085), Expect = 1.3e-230
Identity = 400/400 (100.00%), Postives = 400/400 (100.00%), Query Frame = 1

Query: 1   MHRASFRLGSIADSIYRFKPHEHVRKQDASKLVFHRALLISSGSEIWGNGAESTAFMQMQ 60
           MHRASFRLGSIADSIYRFKPHEHVRKQDASKLVFHRALLISSGSEIWGNGAESTAFMQMQ
Sbjct: 1   MHRASFRLGSIADSIYRFKPHEHVRKQDASKLVFHRALLISSGSEIWGNGAESTAFMQMQ 60

Query: 61  IVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIMEERGI 120
           IVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIMEERGI
Sbjct: 61  IVNALRLGDRSRASNLLMVLGQEKFSLTADNFVRILSYCAKSPDPLFVMETWKIMEERGI 120

Query: 121 FLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSMVHAS 180
           FLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSMVHAS
Sbjct: 121 FLNNTCSLLMIEALCKGGYLDEAFGLINFLAESHVMFPALPAYNCFLRACAIRQSMVHAS 180

Query: 181 QCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFIWSYA 240
           QCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFIWSYA
Sbjct: 181 QCLDLMDHKMVGKNEATYSELLKLAVCQKNLSSVHEIWRDFVKNYSPSVSSLRKFIWSYA 240

Query: 241 RMGDVKSAYTALQKMVTLNNGAAGRKLQSLDIPIPSRTELYRYNFNFEEKEPSIDEFFYK 300
           RMGDVKSAYTALQKMVTLNNGAAGRKLQSLDIPIPSRTELYRYNFNFEEKEPSIDEFFYK
Sbjct: 241 RMGDVKSAYTALQKMVTLNNGAAGRKLQSLDIPIPSRTELYRYNFNFEEKEPSIDEFFYK 300

Query: 301 KMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVMRACSLAGNCG 360
           KMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVMRACSLAGNCG
Sbjct: 301 KMVPWNGDVGGISVSGIKCGEVETGPLTVPNNHKSSFVRKVLRWSSNDVMRACSLAGNCG 360

Query: 361 LAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEI 401
           LAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEI
Sbjct: 361 LAEQLMQQMHKLGLQPSSHTFDGFVRSVVSERGFSAGMEI 400

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP126_ARATH1.5e-11744.87Pentatricopeptide repeat-containing protein At1g76280 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
M5X5C4_PRUPE1.7e-18161.50Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022231mg PE=4 SV=1[more]
A0A061FEB6_THECC4.4e-17756.08Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 2 (Fra... [more]
A0A061FEW4_THECC4.4e-17756.08Tetratricopeptide repeat (TPR)-like superfamily protein, putative isoform 1 OS=T... [more]
A0A0D2R4H9_GOSRA1.7e-17656.63Uncharacterized protein OS=Gossypium raimondii GN=B456_002G118000 PE=4 SV=1[more]
A0A0D2QBL1_GOSRA2.2e-17656.73Uncharacterized protein OS=Gossypium raimondii GN=B456_002G118000 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G76280.36.8e-13748.25 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778726439|ref|XP_004139754.2|0.0e+00100.00PREDICTED: pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cuc... [more]
gi|659123018|ref|XP_008461447.1|0.0e+0092.97PREDICTED: pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cuc... [more]
gi|778726443|ref|XP_011659099.1|9.4e-306100.00PREDICTED: pentatricopeptide repeat-containing protein At1g76280 isoform X2 [Cuc... [more]
gi|659123020|ref|XP_008461448.1|8.5e-28392.60PREDICTED: pentatricopeptide repeat-containing protein At1g76280 isoform X2 [Cuc... [more]
gi|778726446|ref|XP_011659100.1|1.3e-230100.00PREDICTED: pentatricopeptide repeat-containing protein At1g76280 isoform X3 [Cuc... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cucsa.079810Cucsa.079810gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cucsa.079810.2Cucsa.079810.2-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cucsa.079810.2.three_prime_UTR.3Cucsa.079810.2.three_prime_UTR.3three_prime_UTR
Cucsa.079810.2.three_prime_UTR.2Cucsa.079810.2.three_prime_UTR.2three_prime_UTR
Cucsa.079810.2.three_prime_UTR.1Cucsa.079810.2.three_prime_UTR.1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cucsa.079810.2.CDS.12Cucsa.079810.2.CDS.12CDS
Cucsa.079810.2.CDS.11Cucsa.079810.2.CDS.11CDS
Cucsa.079810.2.CDS.10Cucsa.079810.2.CDS.10CDS
Cucsa.079810.2.CDS.9Cucsa.079810.2.CDS.9CDS
Cucsa.079810.2.CDS.8Cucsa.079810.2.CDS.8CDS
Cucsa.079810.2.CDS.7Cucsa.079810.2.CDS.7CDS
Cucsa.079810.2.CDS.6Cucsa.079810.2.CDS.6CDS
Cucsa.079810.2.CDS.5Cucsa.079810.2.CDS.5CDS
Cucsa.079810.2.CDS.4Cucsa.079810.2.CDS.4CDS
Cucsa.079810.2.CDS.3Cucsa.079810.2.CDS.3CDS
Cucsa.079810.2.CDS.2Cucsa.079810.2.CDS.2CDS
Cucsa.079810.2.CDS.1Cucsa.079810.2.CDS.1CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cucsa.079810.2.five_prime_UTR.1Cucsa.079810.2.five_prime_UTR.1five_prime_UTR


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 130..148
score: 0.09coord: 347..374
score:
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 447..492
score: 0.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 346..479
score: 1.2E-136coord: 565..582
score: 1.2E-136coord: 528..547
score: 1.2E-136coord: 59..197
score: 1.2E
NoneNo IPR availablePANTHERPTHR24015:SF450SUBFAMILY NOT NAMEDcoord: 565..582
score: 1.2E-136coord: 59..197
score: 1.2E-136coord: 528..547
score: 1.2E-136coord: 346..479
score: 1.2E