Cmc08g0225141 (gene) Melon (Charmono) v1.1

Overview
NameCmc08g0225141
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag protease polyprotein
LocationCMiso1.1chr08: 15880138 .. 15900293 (+)
RNA-Seq ExpressionCmc08g0225141
SyntenyCmc08g0225141
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCGAAACTGAGATTGTAAAGGGCAAAAACGATGAAGAGTTGAGTTATTATTACAAATTGAACTCAAAATGTTCACGATTCAAATATGAACTTTTTATTACATTACATAACAAACTCAACACATTACACGTTACCAAACATTCCTTATATAAATAAGAAGTAAAAACCAAACTGAAATATTTTTTATGGGTAAAATAGGAAGAAAAAAAATAGATACAAAATTGTAATTGAAATAAAGAGAACAAAATTTCAAAAATCCTTTTTAATTTTCATCCTCCATCACACTCGTTTTCTGCTTAAAATTCTTAAGCAATTTGGGCCTAGGGTTCATGGGCTTCAAAAACCCAGCTTGTAACCTAACCATTTCCCTCAAGACCCCAATCAATTTCTCCTCAACCGCCGCCCCATCCAACTCCATCACTCTCAAACACCTCGCCACAGCCTCCATCGTACTAACACATCCGCCGTGCGGTTCCTTCCTCAGAATCAACTCCGAATCAAATATGCTTCCGCCTTCAACACTCTCATCGATGTCCAAACAAACTCTCCTCGCAAACCTTGACAGAAACTCCTCGCTCGATCGCACCATCTCCTTCGCGTGCTTCCACGTTCCGTCGAACGCTATCAGGACCAGATCTTCAGTTATGTTGAGATCCGAGATGTTGATCGGCGGCGAGGAGTGAGGTGTAGGCGGGAAAAGGAAGACTGCGGGAGGGGATTGGTCGAGGAGTTTGGAGAGACCGGGCTTGAGGCGGCGATTGACGATGGTTGTTGCGTTGAGGAGGCATTTCGTGAGGATTGGTGTAGTGGAGAGCTTGTGTTGTGATTCGTGAGGATGTTGTAGTATGATGATTTTTGTTTTTGTTGCGATTGGTGAAGATGGAAGGAATTTGCAGAGGCAGACTGGTTGGGGACGGTTGCAATTAGGGCAAATTGGCCGCCGAGGCGGCCGGGACGACGAGGAATATCCGGCGGCGAAGTTGTGGTTGTTTGTTGACTCCATGCCTCTGGAGAAAACGATACCGTTTTGATGTGTTGCTTCCTCGACTTTCGTTTTAGGGTAAGGCATGATAGCATCTTACCTTAGGGTGTTCGACGGTTGGGTTCATTTTTTTTAAACCGAAATTTTCAAAAAAAAAAAAAAAAGAGAAAATATAGTTAATATTTGAGGTGTTTTCAAAAGTATAACACAGAAAACAAAAAAATTCCACCTTGCAATAAAAAATACCTCTACGTCTAGCATTGGCTAGATTTGGTATTCTTATTATATTATCCTTAGATTTTTTAGATTTGTTTATTTATATTTAAATAACAATTATTTTTAGATTATTTTTTATATTTGATATTTGATTTAATATAAATCAAAATTTATGGTTGATTGAAAATAAATATCATAATTTTTAGAAAATATATAAAAGATTACAGAAGAGGAAAGAGAAATTGCAGAAAAAGAGAAGAAAGAAGATATAAAATGTGGTCAAAGTTAGAATATTTTTAAAATTATAATTGGCTTCATGTATTTTAACTACAGTAAATATTTTTGTGTTTTATATTATTATATTTATGAAAATTAACATGTGAAGATTTCTATTTTTTTTTTAATTTTCTTTAAATTTATGTAATTTTCTATTAAATGAAGGTGTGTCTAGGAGTTCTCTTCCCGAATTCATTTTGGGACACATGTTTCATAATATGGATATTTAATTTATAGGATTTGCGAAATCACTTGAATGTTGGATGTTGTTTCATGGGTATAATTCAATAATTTGTAAAAAATAAAATTAATTAATTACTTAACTAAAGAACGGAGATCAATTAAATCTAATGGGGTTTTCATAAAATATTTAACAATTTTGAAAACGAAAAAAGTCATACGTTCATGATTAGAAATACAAAAAATGCTCCAATTAACTATGCACAATCGTTCACAATGCTTGTGTAATATTCTTCTAAATGAGTGATCTAATATCATTTAGGTCATGATACTAGATTGTGTAGTTTTTTTTAACAATTGAAAAAAAATGCTTTAAATTCAAACGATTATGCACGATTGTGTATTTCTTTTAGCAATGAGAAAAAATATTTCAAATTTAAGCGATTGTATTGTCCATGTCAAACGATTGTGTTGACTATAACAAATGATCAACACAATCATGTTGATTATGAAAAACAATCGTGTTGATCATAGTAAGCAATCATATAATTCAATGTAAATAATCATTAAAAACTTCAAATAAGATGATCGTGTTTACCATGCTAAATAATCGTGTTAACTATGGTAAGTGATCGTTAAGACGATGGGGAAATAATTTCAAATTTAAAAGATTGTGTTGATCATGGTAAATGATCATATTAACAATGTTAAACGATCGTTTAGATCATGTCAAATGATATTTAAACGATTTTAATATTCTCGAATATGAGAAAGATGAGTAACTTCATAGATTTTTGTCGAAAATGGTGAAGATGTAATAAAAGATTTAAACTGAAGAATAAATCGTTTAAAAATAGAAAAGAGAAAATCTGAAAGAGAAGATGAAGAAATCTAGAAGAGAATATAGAGTAAATGTCACACCCCCTCCCGGACCACCTGCTACCTTAGACCGAAAGATGGCGTGAAGCCGACGAACAACGCTTTTCTGTACTTGTATCTGTCGACTCTGCTTAAAACTTTCATGTGATGAACTTGCCAATCTAGTATATAAACTATTGTACATGCCACAAAACACAGCGGATCCTAGGCTAGTCAAACACAGCTAGTCAAACACATACATGAAATGTACCTCAAAATTTACCGATTTCACGAGCAAACCTAAAGACACCCTAATTAAAATATAACCAACAAAATTTACACAACTACAGCTCCTAAAGCTTATACAAACTGTGGAGTATCTATAACATACAAGGGTGTGAATACATCACAACACAACAGACTTTATGCCCAACTCAAAACACGTACTGGCAATCGATAATGAAGCTTCCGCAGACTAAGGCAGTCTATCAGGCACCGGGAAGTCGATCTCTACCTGGAAAATGGGTAAAACATTTTGGAAAGGGTGAGCTATGAAGCTCAGTGAGTGACTCAGTTTAAAACTATGAATTTAAAGATAAGAGCACGTGAAAACTTTACACATATAAATCTATTTATACAAGTCGTAAATGTAAATGTGTAATAGTAAGAATAGCTTCCTTAAATACTCAGCAAGTTCATCATTGAAATCTAGCAAGGCATATCAAGTATATCGAAGCATTTTAACTAACATGACGAATCTACGTTGTGTAGGGTACACCTTGTACAACAACGTTTACAAGCACTACGATGTAGTGGGGTCCGCCCAGCACTCCCATCGTCTTTGTCGTAGTGGGGCCCGCCCAGCACTCACGACAGCATTGTTGTTACGGAGTACACTCAACGTCAACAACGAAATCTAACTAGTGTGCACACGGTAATCTCTAGCCTCAGGCTCAAGATCTCAGTCATGTAGTTAATGAAAATTTCATAAATCATCATAAAATATTAAAACATGCCTGCTTATAAATTTTACACAAAGCTCGAACTTTACTCAGCTCTTTCGTGGAAAATTGTAGTTCTTAAAACAAAACTTCATACTAATTCATACTTTGCTTAAAATATTGTGGTTTAAATACTTTCCAAACATTTAATATCTACGTGCTAGCGTAAATTTCTTTAAGAAATCTAGTCTCCAAATGTATACTTGAAAATAGTCATGAAATTCAAATACCTTTCATGGCTTCGTAAATAAACATTTATCGCATTCAATCATAATAACAGTTATGGAAAATATTTCAAAGTTGGTTTTGTCACTCACAGTCTTGGGCTAGATCCTTGGTCTATAAGCTCGGTTTAATTCTTCTCGGCCTGCCAACAACCCAACCGAGTATTAAAACCCTAAACATTCTGTTTTCCTAAAGATATACATAACATGTCGCACCAAAGATGACGCTCACGAAAACAAAGCTTAAGAAATAAAATCCTATTTCAACAATTCTCTAAAATTTCCAGATTTCTAACATAAACTTCAGAGAACTATAACTTTCTCAATACTTAACCAAAATGAGTGTTCTTTATATCAAACTCTTCGTTTTGAGATCCTCTAAAACTTACCTGAAGACATATAAATCTGATTCCCCGTGCATAAACACATATAACCCTCAAAACAGAACCACTTCCAGAATCGCGCGCACACGACCTCTTTAACTTGTTCCTACGATTTAACCAATTTTTAGTCGTTTTTGTTTACAATTTCTTCATGAAAGTTGTAGTAAATTGAATAAGCTTTCCAACCATACCAAATAGAAATTCTGTAACGACCCAACTTTTCCGGACTAAGCTGAGGTCACTACCAAATACCAAAACTCGACCACTCAACTTAAAAGTTTAAAACGGACCAGATACGATTCATTAAAACGTTATAAACCTTACAAAAGACAGTTTTGGGCCCTATTTTAAATAATTCAAAAAAAAAATATCACAAAATAAAATATCAAGTCACCAGTCCAAAATCACAGTCAAAATATTCTGACAAAATACATAGCGGAAGCGAAAAGAAAACCAGACGCGTCCATATGGCCTTCACGCACCCTTCCTGCCTCTCGTCGGTCTGCCCCTCGCTGTACCCCTACCTGAAAAGTTAAAGAAAAGAAAGGGTGAGTATAAACATACCCAGTAAGGGACCCACTACTGGGCCCGTTAGGGAACAACAGTTAACTTCCTATTCGGGGGTACCCTACATAACAGTCTAGTGGTTCCGTAGAACGCACATATCAGTCTAGTGCTCCCGAAGGATGCACATATCAGTCTAGTGCTCCCGAAGGATGCACATATCAGTCTAGTGCTCCCGAAGGATGCACATATCAGTCTAGTGCTCCCGAAGGATGCACATATCAGTCTAGTGCTCCCGAAGGATGCACATATCAGTCTAGTGCTCCCGAAGGATGCACGTATCCGTAAGGTACACTACCCCATAGATGAAGCTAACCGTTACCCCTCAGCCCTTACCAAACTGTCTACATCAGTCACATCTCAACGGCATTCATATTACAGTCCCGCCATAGGCTTTGTCAGTCAGATAGTATAGGTTTAAACATCTACACCCTCAGTTGCTATATGCATTACCGATTCGTACACCAATAGGGAAACCCTAGGTCCAATCGACTAACCAACCAAAACCGGACTCACTGTCCTTCCCCGTCCAAACTCCATGAACCACATCCAGACCAGTGTTTAATACATACTAAACCGTAACTTTAAGGTCCATCAACATATAATTTCAATCCACAAACAGCAGTCACAGTATATTTTCATCAGACAGAATATAATATCAGTACTTAACAGTCAACACGCATGCAGATATTCAGTACAGTCACTAACGTGTAATCCCCTGTGGATTACTACGGTTTTAGCCTGGACTCGGGGTCCAGTAGTAGGGAAACCCTTACCTGAAACTCGGTTATGCCCCTCGATCGAAATCCACGCTCAACAGATCCACCTAACGAAACACGAGGTTTTAGTTGATGACTTTAGTAGCAGTTGTTTTAAAAAGGTCAGAAGCGATATCCGTTGACTTACCCAAGGAAAGGAATCTATCTCTAACCAGGCCTGCCGCGGAACCCGAGAACTTAAGCGTCAACTCTGCACTACCTTCAAGGGAGAAAAGTAGGGCCATATCTTAAATCAACATTCAAACTCAAATCGGACGAGGTAACATTAGGGAATTCATCAGAACAATCCTTACCGAAAACTCACCGTGAACCGAAATGAAGGAAGGAAGGCTTAGGTGGCTCGGCTCGGCTCGGCTCGGCTCGGCTCGGCTCGGCTTGGTTCTCGGCTCGGCTCGGCTTAACTCGGCTCGCGGCTCGGCTCACTCGGCTCGCGGCTCGGCTCGCGGCTCGGCTCGCGGCTCGCTCGGCTTAGGGCTCGGCTCGGACAAGGATGGCGGCTCGGGTACGGCTCGCGGCTCGGCTTGTCCAACACGGCTCACCCGGATCTGTGTGGCTCGGATCGGAAACGGCTCGGGTCAGTTGGAACCGGGTCGGGTCTGGACGAAAGAAAATGGGCAGCACGGTGATCGATTTTTCCCCCGGCGAGCGACGGCGGCGGCGATTCGCGGACGACACACGAGGGATGCGGGTGGTTGCTTCGTGGTGCGGCGACGAGCGGCGGTGGCGTTGTTGTGTGAGGCCGAGAAGGAATCGGAAATGGATGGCTACGGCGGACGGCTCCGACGACGTGGACAGAGACGGAAACGCGCTCGACGGAAGGAAAAGAAGAGAAACCGGGCGGCGCACGACGGAAAGGAGAGAGAGAGAGAGAGATGACGGCGGCGGCTTTGTGAACGTTGAAGACGAAGAAGAGAACGAAGGAGAAGAAGAAGGTGCGGCGGCTGGGCTGAGGGTGTGCGTTCGCGCGTTAGGGTTTCTGAAAAAAAAAAATATTTTATTTTATATACATATATATATATATATATATATATATACAGAAATATTAAATTAAATTATAATTATAATAATAATATTTAATAATAACAATAATATTAATAATAATATTAATTAATAAAAATGAATTAATAATATTTAATAATAATAATAATATTAGTAATAATATTAATTAATAATAATGAATTAATAATATTAAATAATAATAATAATAATATTAAGTAATAGTAATAATAACAATAATATTTTAATCATAATAAATAATATTAATAATATAGTTAATACAATATTATTATTATAGCAATTAACAAAACATTAAGTTTTTTTTAAAAAAATTTCGTATCCCCACGAAAGATAAAATTTACCTCGAAAAGATTCGGGACGTTACAAATTCTAACTCCACCGATACACTCTGAAATACAAGAAAAACCAGAGACCTCCTGATTTCGAGTGAAAACCAACTGCCCTGTTTTCTTCATCAAAAAGCACCAAATGAAAATCCGAATTTGCTCCAACGTTCTTCATAAAGTTTGTTTAAAAATGAGTTAACTTTCAGTAAATGTAAATCTCACCTCTTAATTCCTTTTGAAGAGTTCAAACCGAATTAAAAACCAGTGATGTGCAGAATGAACTAAAATTCAGCCATTGATACACTTTGCTTGAGTTCTCCTCCTCTTCTTCAACCTAAAAATTCTAGTAAAATTTCTCTTCTTCTTCCAAACTCTCTCTTAGAAGTTCTCTGCAAATTGAAGAAGGAAAAAGAAAAAAAAAAATAAATAAAGGAAACTTGGATGTCTTCAAACTTGGCGGTGAAGGGAAGAGAAGAAGAAGAAGAAAAGAGAAAATGAAATTTTCTTCTCCTTTTCTTCTTTTTGTCCTTTCTTTTCTTTAATTAATTTAATAAAACTATTTAATATATAAATATATATATTTATATTTACACTTAATTTCCATTTTCTTTTTCTTTTGTTTTTCCACAATTAAAGAAATTAAACCAAGAAAATATCAGGTTTACAACTTACCTTCCCTTTAGAAAATTTCGTCCTCGAAATTTAGAATGTCCTTAACTGAAAAGTATCGGAGAGCTCTTCTTCATCTGATATTCTGGTTCCCAAGTTGCCTCCTCCGCTCCATGATGTCTCCACAAAACTTTTATGAGTGGAATCGTTTTGTTTCTCAAAACTTGTTCCTTTCTGTCGAGAATCTGAACTGGTTCTTCAACATAACTCAAATCTTCTTTTAATTCAACTGGTTGATCTTGCAAAACATGCGATGGATCTGGTATATATTTTCTTAACATGGATACATGGAAAACATCATGTATTCGTGCAAGTTCTATTGGCAACTCAAGTCTATACGCTGCTGGTCCCACTCGTTCCGTTATCTGATATGGCCCAATATATCTAGGACATAACTTACCTTTTCTTCCGAAACGAATAACACCTCGCCATGGAGATAATTTCAAGAAAACTTGATCTCCAACTTGGAATTCTAGGTTTCTTCGTCGCTTATCCGCATAACTTTTCTGTCGATCTTGGGCTTTCCTCAGATTTTCTCTGATCAACTTAATATTGTTTGTCGTAATCTGAACCAACTCAGGACCTACTAACTTCCGCTCTCCCACTTCATTCCAGCACACAGGAGTTCTGCATGGTCTCCCGTATAAGGCTTCATATGGTGCCATACCGATACTAGACTGATAGTTATTATTATAAGCAAACTTCATAAGTGGCAAGTGGGTATCCCAACTTCCTTTAAGTTGTAGGACACATGCTCTCAACATGTCCTCTAAAGTTTGGATGGTCCTCTCGGACTGACCATCTGTTTGGGGATGAAATGATGTACTAAACTTTAGCCCTGTTCCCATTGCTTTCTGTAAACTAGGCCAAAATTTAGAAGTAAACCTCAGATCCCTATCTGAAACTATGGACACTGGTACTCCATACTGACTCACAATCTTATCAACATATAATCTCGCTAGCTGGTCTAACGTAGATGTCATTTTAATCGGTATAAATCGTGTCGTCTTGGTGAGTCTGTCTACTATTACCCATATACCATCATGTCCACTGGATGTACGAGGTAATCCAAATAGAAAATCCATAGTAATATGCTCCTCAGCTCATAATCATTCTACATATTAAATACATCTTTGTACCTTTCCTCAGCTCAAGCGTTTACCACGCTCTTTTGCCAAGTTGTAACTATGTGGAGTAGTCTTAATTTAAGAAAGTCACTCACGCAGAAGTACATAATTTAAGAAAGTCACTCACGTGAGAGTGTGGAGTTAAGAAAGTCACTCACGCCAGCTTCCGATCCTTTCTTCCCTGAATGCCTCTTCTTGCCTAGAGTTCTCTAGGTAGCAACTCTACTCTCTGCATTCTTTTAACAAAACTTCTAAAATGCTAGCACTTTTCTTAAGGTTTTTAACCCTATTTATACTAATCTTTTAGACCTGTTACTTTCTTTACTTGCCAGTTTAGAATTATAACTTTACACGTGTTGTTCTACCGCTTCGCCGCACTATGCAGCAAAACCAAACCCTACATGTAAACCAACGATCAACTTCCGATTTGACATTTCCTCGATACGTAAACCTTATCATCTAGAGAAGCTAATCTCTTATCACCTTATCCTTTCGGGAAGCTAAACTCCTTTTTCCTGCCATCACCTTATCTAACTTCTCCTCGCATTACTTTGGTAATTGCCTAACTCTTCTTGTGATAGGACTCCTTCGTCAATGCCTCTCTTCTTCAACAAGATTCCTTTTCTCGGCTCTGAGCCTCACATCTTTGCATACTCTTTAGCCGCGAACCCCCTTGCACTTCCTTGTAATACTTGACGGATTCAAGCTTCTAAAAATCTCTAGAAATTTTGGAAATTCTCTCAATTCTTTCTTAAATGAGCATAAGGTAAAAGCAGCTTCACAATTCTCAAACTTCACCTATTTATAAGCCTTTCCAGTGACATTTTCTATGCAGAGTTGATGACTTCTTCCGAGCGATGGGAGTTAGCAAATACCAGAATGCCACATATTACTACCATCTTCTTATCCTAAATTCAATTGGACGACAATGTCAAATCAATTCATCGGCCTTTCTACACAAGAATTACGTCAACTACTACATGGTGAAGTAAGCTCATAGCGTGACAAAGATTTTCGCATAGTCTTTTCTCATCGCATAATAGTTTCTATCGCATAGTGTAACGCCCCGACGTTCGAGGTAAAATTTTAGCATTTTTAAGTAAATTGTTCGAAAATTTGAGTTTTGGTAAAACGGTAAAATAATAATATAATATTGATTATATTATCATTATCGGGAATTATTATTATTATTTTAATGTTAATTTTTTTTCTCTTTTATTTTATTTAAAATAAATATTAATATTCTTATTTAATATTAATATAATAATATATATTAATATTCCATTATAAATTTATATTAATTTTATTCTATATATATATATATATATCATTATTATTTTTATTTTTATTATAATTAAGATTATTATTATTATTATTATTATTAGTTATTTTTATTACATATATATATATTATTATTTATATATGTATATATATATATATATATATATATATATATATATATATATATATATATATTTATTACAAAAAAAAAAAGAAAAACCCTAAGTTCGCGCCGCTACCCCCCCCCCTCGAATTTTTCTTCTCCCTTCCGTTTCCTCCTTCCACGCCGCGCCACCGCCCCGGGTTCCCTTCTCTTTTTCCTTTCCGACGGCCTCTCTCCCTCCATTTCTTTTCCGTCTCTGCCTCGCCACCATCACCATCATCCCTCCTTCTTGTTCTTCTCCTTCTCCCTCTTCTTTACAGCCCCCACCGACGACAGCAGACGTCCGTCCGCCGCGGCCTCCATCCGTTTTCGCTTCGCCCCCAACCGCCGGCAGCAGATAACTCTCTCTCTCTTCGTTTCTATCACGGCTGGCTTACTCTCCGTCACCAGCTCGTGTCTCCGACGCCGACAGAGCACGGAGGAAGCGCCGTCGTTCGCCGGAAGGAAGAAGCCCCAGAACGTGCTGCCGTTTTTCGTCCAACCCGACCCGGTTCCAAGCTGACCCGAACCCGTTTCCGGTCCGAGCCGCGAGCCGAGTGAGCCGAGCCGCGAGCCGAGCCGCGAGCCGAGTGAGCCGAGCCGCGAGCCGAGTGAGCCGAGCCGCGAGCCGAACCAAGCCGAGCCGCGAGCCGAGTGAGCCGAGCCGCGAGCCGAACCAAGCCGAGCCGAGCCGAGCCAAGCCACCTAAGCCTTCCTTCCTTCATTTCGGTTCACGGTGAGTCTTCGGTAAGGATTGTTCTGATAAATTCCTTAATGTTACCTCGTCCGGTTCGAGTTTGAATGTTGGATTAAGATATGGCCCTACTTTTCTCCCTTGAAGGTAGTGCAGAGTTGACGCTTAAGTTCTCGGGTTCCGCGGCAGACCTGGTTGGAGTAGCTTCTTTTCCTTGGGTAAGTCAGCGGATGACCTTTTAAAACAACTGCTACTAAAATCATCAACTAAAACTTTCGTGTTTCGTTAGGTAGATCTGTCGAGCGTGGTTTTCGATCGAGGGGCATAACCGAGTATCAGGTAAGGGTTTTCCTACTACTGGACCCCGAGTCCAGGTTAAAACCGTAGTAATCCACAGGGGATTACACGTTAGTGACTGTACTGAATATCTGTATGCGTGTTGACTGTTAAGTACTGACATTATATTTTGTCTGATGAAAATATACTGTGACTGCTATTTGTGGATTGAAATCATATGTTGATGGATCTTAAAGTTACGATTTAGTATGTAGTAAAGCACTGGTCTGGATGTGGTTCATGGAATTTGGATGGGGAAGAACAGTGAGTCCGGTTTTGGTTGGTTAGTCGATTGGACCTAGGGTTTCCCTATTGGTGTGCGAAACGGTAATGCATATAACAACTGAGGGTGTAGATGTTTAAACCTATACTATCTGACTGACAAAGCCTATGGCGGGACTGTGATATGAATGCCGTTGGGATGTAGCTGATGTAGACAGTTTAATTTGGACTGAGGGGTAACGGTTAGCTTCATCTATGGGGTAGTGTGCCTTACGGATATGTGCGTCCTTCGGGAGCACTAGACTGATATGTGCATCCTTCGGGAGCACTAGACGGATATGTGCATCCTTCGGGAGCACTAGACTGATATGTGCATCCTTCGGGAGCACTAGACTGATATGTGCATCCTTCGGGAGCACTAGACCGATATGTGCATCCTTCGGGAGCACTAGACCGATATGTGCATCCTTCGGGAGCACTAGACCGATATGTGCGTCCTTCGGGAGCACTAGACTGATATGTGCGTTCTACGGAACCACTAGACTGTTATGTAGGGTACCCCCGAATAGGAAGTTAACTGTTGTTCCTTAATGGGCCCAGTAGTGGGTCCCTTACTGGGTATGTTTATACTCACCCTTTCTCTTCTTTAACTTTTCAGGTAGGGGTACTGCGAGGGGCAGACCGACGAGAGGCAGGAAGGATGCGTGAAGGCCATATGGACGCGTCTGATTTTCTTTCGCTTCCGCTATGTATTTTGTCAGAATATTTTGATTGTGATTTTTGGACTGGTGACTTGACATTTTATCTTGTGACTTTTGAATTATTTAAAATAGGGCCCGAAACTGTCTTTTGTAAGGTTTATACTGTTTTAATGAATCGTATCTGGTCCGTTTTAAATTTTACGTTGAATGGTCGAGTTTTGGTATTTGGTAGTGATCTCAGCTTAGTCCGGAAAAGTTGGGTCGTTACAGTTGGTATCAGAGCGTAAGTTTTAGGTTCTGTAGACTGACTTATAATGTGAGTCTGTGTTTTGTGTCCTTATGGCTGAAACGATCCTTGCCGCTCGTCAGGTACGCTCTCATGAAAGTATATGTATAACTCTACATGCATTACCTTACCTAAGTTAAACTGCAAATTCAATTACCATTTATGACTAAAAGAATCGTTTGGTGGTTGTTAGGAAAATGCCACCAAGGAGAGGTGCACGTAGGGGTGGCCGAGGAGGCCGAGGAAGGGGAGCGGGACGCGTTCAGCCTGAGGTGCAGCCTGTAGCCCAAGCCCCTGACCCGGCTGCGCCAGTTACTCATGCGGACCTAGCCGCCATCGAGCAGAGGTTTAGAGATATGATTATGCAGATGCGGGAGCAGCAGAAGCCTGCCTCGCCAACTCCGGCGCCAGCTCCAGTACCAGTTCCTGCTCCAGCTCCGGCTCCGGTACCAGTTGCACCCCAGTTTGTGCCGGATCAGTTGTCAGCAGAGGCTAAACATCTGAGGGATTTCAGGAAGTATAATCCCACGACGTTCGATGGGTCTTTGGAGGACCCCACCAGGGCTCAGATGTGGTTATCATCCTTGGAAACCATATTCCGTTACATGAAATGCCCTGAGGATCAGAAGGTTCAGTGTGCTGTTTTTATGTTGACTGACAGAGGTACTGCATGGTGGGAGACTACAGAGAGGATGTTAGGTGGTGATGTGAGTCAGATCACGTGGCAGCAGTTCAAGGAGAGTTTCTATGCGAAATTCTTCTCTGCCAGTTTGAGAGATGCCAAGCGGCAGGAGTTTCTGAACCTAGAGCAGGGTGACATGACAGTGGAGCAGTATGATGCGGAATTTGACATGTTATCCCGCTTCGCTCCCGAGATGATAGCGACCGAGGCGGCCAGAGCTGATAAGTTTGTTAGAGGCCTCAGACTGGACATTCAGGGTTTGGTCCGAGCTTTCAGACCCGCTACTCATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCCAACTCGTCTAAAACCGCTGGTAGAGGTTCGACGTCGGGACAGAAGAGGAAGGCTGAGCAGCAGCCTGTTCCAGTGCCACAGCGGAATTTCAGACCAGGTGATGAGTTTCGCAGCTTCCAGCAGAAACCTTTTGAGGCAGGGGAGGCTGCCAGAGGAAAGCCGTTGTGTACCAGTTGTGGGAAGCACCATTTGGGCCGTTGCTTATTCGGGACCAGGACCTGCTTTAAGTGCAGGCAAGAGGGTCATACAGCTGATAGATGCCCGTTGAGAGTCACGGGGATCGCGCAGAATCAGGGAGCAGGTGCTCCACATCAGGGTAGAGTCTTTGCTACCAACAGGACTGAGGCTGAGAAGGCAGGCACAGTAGTGACAGGTACGCTCCCAGTGTTGGGGCATTACGCCTTAGTTTTGTTTGATTCGGGTTCGTCACATTCTTTTATCTCTTCCGCATTTGTGTCGCATGCCCGCTTAGAGGTAGAGCCCTTACACCATGTTCTGTCAGTATCTACTCCTTCCGGGGAATGTATGTTGTCGAAGGAAAAGGTGAAGGCATGTCAGATTGAGATAGCAGGCCATGTGATTGAGGTAACGCTGATAGTCCTGGATATGCTGGACTTTGATGTAATCCTGGGTATGGATTGGTTGGCCGCTAACCACGCCAGCATAGATTGTTCACGTAAGGAGGTAACGTTTAACCCTCCCTCGATGGCCAGTTTTAAATTTAAGGGAGGAGGGTCAAAGTCGTTGCCTCAGGTAATCTCGGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGTGTGGTGGATACTAGAGAGGCGGATGTATCCCTGTCATCAGAACCAGTAGTGAGGGACTATCCGGACGTTTTTCCTGAGGAACTTCCGGGGTTACCTCCGCACAGGGAGGTTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCTTTACAGAATGGCCCCCGCAGAATTGAAAGAACTGAAGGTACAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTCTTGTTCGTTAAGAAGAAGGACGGATCGATGCGTCTGTGCATTGACTATAGGGAGTTGAACAAAGTAACCGTAAAGAACAGATATCCCTTGCCCAGGATAGACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGGTACCATCAGCTGAGAATTAAGGATGAGGATGTACCGAAGACAGCATTTCGTTCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTTCTAGACACTTTTGTGATTGTGTTTATCGACGATATCTTGATATACTCCAAGATGGAGGCCGAACACGAGGAGCATTTACGCATGGTTTTGCAAACACTTCGGGATAATAAATTATACGCAAAGTTCTCGAAGTGTGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTAGATCCAGCTAAGATAGAGGCAGTCACTGGTTGGACCCGACCTTCCACAGTCAATGAGGTTCGTAGCTTTCTGGGTTTAGCAGGCTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGTTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCGCCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTCGTGATTTATAGTGATGCTTCTAAGAAGGGTCTGGGTTGTGTTTTGATGCAGCAGGGTAAGGTGGTCGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTATTTATATGGTGAAAAGATACAGATATTCACAGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGGAAAGTATCACATTCAGCAGCACTCATTACCCGGCAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAACCGACTTTGAGGCAGAGGATCATTGATGCTCAGAATCACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAGACGGCTGAGTTCTCGTTATCCTCTGATGGTGGACTGTTGTTTGAGAGACGCCTCTGTGTTCCGTCAGATAGTGCGATTAAGGCAGAATTATTAACTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGAGAAGTAGCAGAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAAGCACCAAGGCAGAAACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAGAACGTGTCCATGGATTTCATTACAGGGCTACCGAGAACTCTGAGGGGTTTTACAGTGATTTGGGTTGTGGTGGACAGACTTACTAAATCAGCGCACTTCGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTGAGATTACATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCATTTCCAAATTTTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTGGACTTTAGTACGGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAGGTTTTAGAGGATATGTTGCGAGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTACATTTGATGGAATTTGCTTATAATAACAGTTATCAGGCTACTATTGGCATGGCACCGTTTGAGGCCCTGTACGGCAGATGTTGTAGATCCCCGGTTTGCTGGGATGAGGTAGGTGAGCAAAGATTGATGGGTCCTGAGTTAGTCCAGTCTACTAACGAAGCTATACAGAAGATTAGATCACGCATGCATACCGCACAGAGTAGGCAGAAGAGTTATGCAGATGTGAGGCGGAAGGACCTTGAGTTTGAGGTAGGGGATAAGGTGTTCTTAAAGGTAGCACCTATGAAAGGTGTCTTGCGTTTTGAAAGGAGGGGAAAGTTGAGTCCCCGTTTTGTTGGGCCATTTGAGATTCTGGAGCGGATTGGCCCTGTAGCTTATCGCTTGGCGTTGCCTCCATCACTCTCGACAGTCCATAATGTGTTTCACGTTTCTATGTTGAGGAAGTACGTGCCAGATCCATCCCACGTAGTGGATTACGAGCCACTAGAGATTGATGAGAACTTGAGCTATGTTGAACAACCTGTTGAGGTGCTTGCTAGAGAGGTGAAGACGTTGAGAAATAAAGAAATTCCCCTGGTTAAAGTCTTGTGGCGGAATCATCGGGTAGAAGAGGCTACGTGGGAGCGAGAAGATGACATGAGATCTCGTTATCCCGAGCTGTTCGAGGAATAAAACTTTCGAGGACGAAAGTTCCCTAAGGAGGGAAGAATGTAACGCCCCGACGTTCGAGGTAAAATTTTAGCATTTTTAAGTAAATTGTTCGGAAATTTGAGTTTTGGTAAAACGGTAAAATAATAATATAATATTGATTATATTATCATTATCGGGAATTATTATTATTATTTTAATGTTAATTTTTTTTTCTCTTTTATTTTATTTAAAATAAATATTAATATTCTTATTTAATATTAATATAATAATATATATTAATATTCCATTATAAATTTATATTAATTTTATTCTATATATATATATATATATCATTATTATTTTTATTTTTATTATAATTAAGATTATTATTATTATTATTATTATTAGTTATTTTTATTACATATATATATATTATTATTTATATATGTATATATATATATATATATATTTATTACAAAAAAAAAAAGAAAAACCCTAAGTTCGCGCCGCTACCCCCCCCTCGAATTTTTCTTCTCCCTTCCGTTTCCTCCTTCCACGCCGCGCCACCGCCCCGGTTCCCTTCTCTTTTTCCTTTCCGACGGCCTCTCTCCCTCCATTTCTTTTCCGTCTCTGCCTCGCCACCATCACCATCATCCCTCCTTCTTGTTCTTCTCCTTCTCCCTCTTCTTTACAGCCCCCATCGACGACAGCAGACGTCCGTCCGCCGCGGCCTCCATCCGTTTTCGCTTCGCCCCCAACCGCCGGCAGCAGATAACTCTCTCTCTCTTCGTTTCTATCACGGCCGGCTTACTCTCCGTCACCAGCTCGTGTCTCCGACGCCGACAGAGCACGGAGGAAGCGCCGTCGTTCGCCGGAAGGAAGAAGCCCCAGAACGTGCTGCCGTTTTTCGTCCAACCCGACCCGGTTCCAAGCTGACCCGAACCCGTTTCCGGTCCGAGCCGCGAGCCGAGTGAGCCGAGCCGCGAGCCGAGTGAGCCGAGCCGCGAGCCGAGTGAGCCGAGCCGCGAGCCGAACCAAGCCGAGCCGAGCCGAGCCGCGCCGAGCCGAGCCGAGCCACCTAAGCCTTCCTTCCTTCATTTCGGTTCACGGTGAGTCTTCGGTAAGGATTGTTCTGATAAATTCCTTAATGTTACCTCGTCCGGTTCGAGTTTGAATGTTGGATTAAGATATGGCCCTACTTTTCTCCCTTGAAGGTAGTGCAGAGTTGACGCTTAAGTTCTCGGGTTCCGCGGCAGACCTGGTTGGAGTAGCTTCTTTTCCTTGGGTAAGTCAGCGGATGACCTTTTAAAACAACTGCTACTAAAATCATCAACTAAAACTTTCGTGTTTCGTTAGGTAGATCTGTCGAGCGTGGTTTTCGATCGAGGGGCATAACCGAGTATCAGGTAAGGGTTTTCCTACTACTGGACCCCGAGTCCAGGTTAAAACCGTAGTAATCCACAGGGGATTACACGTTAGTGACTGTACTGAATATCTGTATGCGTGTTGACTGTTAAGTACTGACATTATATTTTGTCTGATGAAAATATACTGTGACTGCTATTTGTGGATTGAAATCATATGTTGATGGACCTTAAAGTTACGATTTAGTATGTAGTAAAGCACTGGTCTGGATGTGGTTCATGGAATTTGGATGGGGAAGAACAGTGAGTCCGGTTTTGGTTGGTTAGTCGATTGGACCTAGGGTTTCCCTATTGGTGTGCGAATCGGTAATGCATATAACAACTGAGGGTGTAGATGTTTAAACCTATACTATCTGACTGACAAAGCCTATGGCGGGACTGTGATATGAATGCCGTTGGGATGTAGCTGATGTAGACAGTTTAATTTGGACTGAGGGGTAACGGTTAGCTTCATCTATGGGGTAGTGTGCCTTACGGATATGTGCGTCCTTCGGGAGCACTAGACTGATATGTGCATCCTTCGGGAGCACTAGACTGATATGTGCATCCTTCGGGAGCACTAGACTGATATGTGCATCCTTCGGGAGCACTAGACGGATATGTGCATCCTTCGGGAGCACTAGACTGATATGTGCATCCTTCGGGAGCACTAGACCGATATGTGCATCCTTCGGGAGCACTAGACCGATATGTGCATCCTTCGGGAGCACTAGACTGATATGTGCATCCTTCGGGAGCACTAGACTGATATGTGCGTTCTACGGAACCACTAGACTGTTATGTAGGGTACCCCCGAATAGGAAGTTAACTGTTGTTCCCTAATGGGCCCAGTAGTGGGTCCCTTACTGGGTATGTTTATACTCACCCTTTCTCTTCTTTAACTTTTCAGGTAGGGGTACTGCGAGGGGCAGACCGACGAGAGGCAGGAAGGATGCGTGAAGGCCATATGGACGCGTCTGATTTTCTTTCGCTTCCGCTATGTATTTTGTCAGAATATTTTGATTGTGATTTTTGGACTGGTGACTTGACATTTTATCTTGTGACTTTTGAATTATTTAAAATAGGGCCCGAAACTGTCTTTTGTAAGGTTTATATTGTTTTAATGAATCGTATCTGGTCCGTTTTAAATTTTACGTTGAATGGTCGAGTTTTGGTATTTGGTAGTGATCTCAGCTTAGTCCGAAAAAGTTGGGTCGTTACACATAGGCTATCCCCATCGCATGGGATCTTATTACGTAAAGGTAGATCGTAATTAGAGTTTCATAACTACTACCTCCTTATTGGAATTTTGCCATTGAAATTCAGTAAGTCTTATTTGAAAAGTTGTTGGTATTTAGCCTTCATTAGCTCTTCATTTTCCTAATGGACTTCTTCGACGCCATGGTTGCTCCATGAAATCTTAACCATGTGAATCACTTTTATCCTCAATACTTGCTTTTTTCTGTCCAACATGTGCAAAGACCTTTCTTTGTAAGTCAAGTTCTCTTGCACTTCTAAGGGTTGTTCTTGGAGGATATGTGAAGGATCTGGTACCTATTTTCTTAACATGGAAACATGGAAGACATTATGAATCCTACACAATTTTGCGGGCAGTTTTAGTTCATAAGCCATTTGTCCTACGAGTGTTAGAATTTCATAAGGTTCTATATGTCGTGGGCTTAACTTCATTTTTCTCCTAAATCGGATCACACCCTTCCATGGTGAAACTCTCAAGAAAACTTTGTCTCCAACATTGAATT

mRNA sequence

TCGAAACTGAGATTGTAAAGGGCAAAAACGATGAAGAGTTGAGTTATTATTACAAATTGAACTCAAAATGTTCACGATTCAAATATGAACTTTTTATTACATTACATAACAAACTCAACACATTACACGTTACCAAACATTCCTTATATAAATAAGAAGTAAAAACCAAACTGAAATATTTTTTATGGGTAAAATAGGAAGAAAAAAAATAGATACAAAATTGTAATTGAAATAAAGAGAACAAAATTTCAAAAATCCTTTTTAATTTTCATCCTCCATCACACTCGTTTTCTGCTTAAAATTCTTAAGCAATTTGGGCCTAGGGTTCATGGGCTTCAAAAACCCAGCTTGTAACCTAACCATTTCCCTCAAGACCCCAATCAATTTCTCCTCAACCGCCGCCCCATCCAACTCCATCACTCTCAAACACCTCGCCACAGCCTCCATCGTACTAACACATCCGCCGTGCGGTTCCTTCCTCAGAATCAACTCCGAATCAAATATGCTTCCGCCTTCAACACTCTCATCGATGTCCAAACAAACTCTCCTCGCAAACCTTGACAGAAACTCCTCGCTCGATCGCACCATCTCCTTCGCGTGCTTCCACGTTCCGTCGAACGCTATCAGGACCAGATCTTCAGTTATGTTGAGATCCGAGATGTTGATCGGCGGCGAGGAGTGAGGTGTAGGCGGGAAAAGGAAGACTGCGGGAGGGGATTGGTCGAGGAGTTTGGAGAGACCGGGCTTGAGGCGGCGATTGACGATGGTTGTTGCGTTGAGGAGGCATTTCGTGAGGATTGGTGTAGTGGAGAGCTTGTGTTGTGATTCGTGAGGATGTTGTAGTATGATGATTTTTGTTTTTGTTGCGATTGGTGAAGATGGAAGGAATTTGCAGAGGCAGACTGGTTGGGGACGGTTGCAATTAGGGCAAATTGGCCGCCGAGGCGGCCGGGACGACGAGGAATATCCGGCGGCGAAGTTGTGGTTGTTTGTTGACTCCATGCCTCTGGAGAAAACGATACCGTTTTGATGTGTTGCTTCCTCGACTTTCGTTTTAGGGAAAATGCCACCAAGGAGAGGTGCACGTAGGGGTGGCCGAGGAGGCCGAGGAAGGGGAGCGGGACGCGTTCAGCCTGAGGTGCAGCCTGTAGCCCAAGCCCCTGACCCGGCTGCGCCAGTTACTCATGCGGACCTAGCCGCCATCGAGCAGAGGTTTAGAGATATGATTATGCAGATGCGGGAGCAGCAGAAGCCTGCCTCGCCAACTCCGGCGCCAGCTCCAGTACCAGTTCCTGCTCCAGCTCCGGCTCCGGTACCAGTTGCACCCCAGTTTGTGCCGGATCAGTTGTCAGCAGAGGCTAAACATCTGAGGGATTTCAGGAAGTATAATCCCACGACGTTCGATGGGTCTTTGGAGGACCCCACCAGGGCTCAGATGTGGTTATCATCCTTGGAAACCATATTCCGTTACATGAAATGCCCTGAGGATCAGAAGGTTCAGTGTGCTGTTTTTATGTTGACTGACAGAGGTACTGCATGGTGGGAGACTACAGAGAGGATGTTAGGTGGTGATGTGAGTCAGATCACGTGGCAGCAGTTCAAGGAGAGTTTCTATGCGAAATTCTTCTCTGCCAGTTTGAGAGATGCCAAGCGGCAGGAGTTTCTGAACCTAGAGCAGGGTGACATGACAGTGGAGCAGTATGATGCGGAATTTGACATGTTATCCCGCTTCGCTCCCGAGATGATAGCGACCGAGGCGGCCAGAGCTGATAAGTTTGTTAGAGGCCTCAGACTGGACATTCAGGGTTTGGTCCGAGCTTTCAGACCCGCTACTCATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCCAACTCGTCTAAAACCGCTGGTAGAGGTTCGACGTCGGGACAGAAGAGGAAGGCTGAGCAGCAGCCTGTTCCAGTGCCACAGCGGAATTTCAGACCAGGTGATGAGTTTCGCAGCTTCCAGCAGAAACCTTTTGAGGCAGGGGAGGCTGCCAGAGGAAAGCCGTTGTGTACCAGTTGTGGGAAGCACCATTTGGGCCGTTGCTTATTCGGGACCAGGACCTGCTTTAAGTGCAGGCAAGAGGGTCATACAGCTGATAGATGCCCGTTGAGAGTCACGGGGATCGCGCAGAATCAGGGAGCAGGTGCTCCACATCAGGGTAGAGTCTTTGCTACCAACAGGACTGAGGCTGAGAAGGCAGGCACAGTAGTGACAGGGGTACTGCGAGGGGCAGACCGACGAGAGGCAGGAAGGATGCGTGAAGGCCATATGGACGCGTCTGATTTTCTTTCGCTTCCGCTATGTATTTTGTCAGAATATTTTGATTGTGATTTTTGGACTGGTGACTTGACATTTTATCTTGTGACTTTTGAATTATTTAAAATAGGGCCCGAAACTGTCTTTTGTAAGGTTTATATTGTTTTAATGAATCGTATCTGGTCCGTTTTAAATTTTACGTTGAATGGTCGAGTTTTGGTATTTGGTAGTGATCTCAGCTTAGTCCGAAAAAGTTGGGTCGTTACACATAGGCTATCCCCATCGCATGGGATCTTATTACGTAAAGGTAGATCGTAATTAGAGTTTCATAACTACTACCTCCTTATTGGAATTTTGCCATTGAAATTCAGTAAGTCTTATTTGAAAAGTTGTTGGTATTTAGCCTTCATTAGCTCTTCATTTTCCTAATGGACTTCTTCGACGCCATGGTTGCTCCATGAAATCTTAACCATGTGAATCACTTTTATCCTCAATACTTGCTTTTTTCTGTCCAACATGTGCAAAGACCTTTCTTTGTAAGTCAAGTTCTCTTGCACTTCTAAGGGTTGTTCTTGGAGGATATGTGAAGGATCTGGTACCTATTTTCTTAACATGGAAACATGGAAGACATTATGAATCCTACACAATTTTGCGGGCAGTTTTAGTTCATAAGCCATTTGTCCTACGAGTGTTAGAATTTCATAAGGTTCTATATGTCGTGGGCTTAACTTCATTTTTCTCCTAAATCGGATCACACCCTTCCATGGTGAAACTCTCAAGAAAACTTTGTCTCCAACATTGAATT

Coding sequence (CDS)

ATGCCACCAAGGAGAGGTGCACGTAGGGGTGGCCGAGGAGGCCGAGGAAGGGGAGCGGGACGCGTTCAGCCTGAGGTGCAGCCTGTAGCCCAAGCCCCTGACCCGGCTGCGCCAGTTACTCATGCGGACCTAGCCGCCATCGAGCAGAGGTTTAGAGATATGATTATGCAGATGCGGGAGCAGCAGAAGCCTGCCTCGCCAACTCCGGCGCCAGCTCCAGTACCAGTTCCTGCTCCAGCTCCGGCTCCGGTACCAGTTGCACCCCAGTTTGTGCCGGATCAGTTGTCAGCAGAGGCTAAACATCTGAGGGATTTCAGGAAGTATAATCCCACGACGTTCGATGGGTCTTTGGAGGACCCCACCAGGGCTCAGATGTGGTTATCATCCTTGGAAACCATATTCCGTTACATGAAATGCCCTGAGGATCAGAAGGTTCAGTGTGCTGTTTTTATGTTGACTGACAGAGGTACTGCATGGTGGGAGACTACAGAGAGGATGTTAGGTGGTGATGTGAGTCAGATCACGTGGCAGCAGTTCAAGGAGAGTTTCTATGCGAAATTCTTCTCTGCCAGTTTGAGAGATGCCAAGCGGCAGGAGTTTCTGAACCTAGAGCAGGGTGACATGACAGTGGAGCAGTATGATGCGGAATTTGACATGTTATCCCGCTTCGCTCCCGAGATGATAGCGACCGAGGCGGCCAGAGCTGATAAGTTTGTTAGAGGCCTCAGACTGGACATTCAGGGTTTGGTCCGAGCTTTCAGACCCGCTACTCATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCCAACTCGTCTAAAACCGCTGGTAGAGGTTCGACGTCGGGACAGAAGAGGAAGGCTGAGCAGCAGCCTGTTCCAGTGCCACAGCGGAATTTCAGACCAGGTGATGAGTTTCGCAGCTTCCAGCAGAAACCTTTTGAGGCAGGGGAGGCTGCCAGAGGAAAGCCGTTGTGTACCAGTTGTGGGAAGCACCATTTGGGCCGTTGCTTATTCGGGACCAGGACCTGCTTTAAGTGCAGGCAAGAGGGTCATACAGCTGATAGATGCCCGTTGAGAGTCACGGGGATCGCGCAGAATCAGGGAGCAGGTGCTCCACATCAGGGTAGAGTCTTTGCTACCAACAGGACTGAGGCTGAGAAGGCAGGCACAGTAGTGACAGGGGTACTGCGAGGGGCAGACCGACGAGAGGCAGGAAGGATGCGTGAAGGCCATATGGACGCGTCTGATTTTCTTTCGCTTCCGCTATGTATTTTGTCAGAATATTTTGATTGTGATTTTTGGACTGGTGACTTGACATTTTATCTTGTGACTTTTGAATTATTTAAAATAGGGCCCGAAACTGTCTTTTGTAAGGTTTATATTGTTTTAATGAATCGTATCTGGTCCGTTTTAAATTTTACGTTGAATGGTCGAGTTTTGGTATTTGGTAGTGATCTCAGCTTAGTCCGAAAAAGTTGGGTCGTTACACATAGGCTATCCCCATCGCATGGGATCTTATTACGTAAAGGTAGATCGTAA

Protein sequence

MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAIEQRFRDMIMQMREQQKPASPTPAPAPVPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGDEFRSFQQKPFEAGEAARGKPLCTSCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGVLRGADRREAGRMREGHMDASDFLSLPLCILSEYFDCDFWTGDLTFYLVTFELFKIGPETVFCKVYIVLMNRIWSVLNFTLNGRVLVFGSDLSLVRKSWVVTHRLSPSHGILLRKGRS
Homology
BLAST of Cmc08g0225141 vs. NCBI nr
Match: ADN33767.1 (gag protease polyprotein, partial [Cucumis melo subsp. melo])

HSP 1 Score: 778.5 bits (2009), Expect = 3.6e-221
Identity = 394/401 (98.25%), Postives = 396/401 (98.75%), Query Frame = 0

Query: 1   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAIEQRFRDMIMQMRE 60
           MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAA+EQRFRDMIMQMRE
Sbjct: 231 MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 290

Query: 61  QQKPASPTPAPAPVPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDP 120
           QQKPASPTPAPAP PVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDP
Sbjct: 291 QQKPASPTPAPAPAPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDP 350

Query: 121 TRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFK 180
           TRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFK
Sbjct: 351 TRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFK 410

Query: 181 ESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVR 240
           ESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVR
Sbjct: 411 ESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVR 470

Query: 241 GLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP 300
           GLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP
Sbjct: 471 GLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP 530

Query: 301 QRNFRPGDEFRSFQQKPFEAGEAARGKPLCTSCGKHHLGRCLFGTRTCFKCRQEGHTADR 360
           QRNFRPG EFRSFQQKPFEAGEAARGKPLCT+CGKHHLGRCLFGTRTCFKCRQEGHTADR
Sbjct: 531 QRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADR 590

Query: 361 CPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGVL 402
           CPLR TGIAQNQGAGAP QGRVFATNRTEAEKAGTVVTG L
Sbjct: 591 CPLRPTGIAQNQGAGAPLQGRVFATNRTEAEKAGTVVTGTL 631

BLAST of Cmc08g0225141 vs. NCBI nr
Match: TYK01613.1 (pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 778.1 bits (2008), Expect = 4.8e-221
Identity = 395/405 (97.53%), Postives = 398/405 (98.27%), Query Frame = 0

Query: 1   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAIEQRFRDMIMQMRE 60
           MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAA+EQRFRDMIMQMRE
Sbjct: 228 MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 287

Query: 61  QQKPASPT----PAPAPVPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 120
           QQKPASPT    PAPAP PVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS
Sbjct: 288 QQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 347

Query: 121 LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 180
           LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW
Sbjct: 348 LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 407

Query: 181 QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240
           QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD
Sbjct: 408 QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 467

Query: 241 KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 300
           KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
Sbjct: 468 KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 527

Query: 301 VPVPQRNFRPGDEFRSFQQKPFEAGEAARGKPLCTSCGKHHLGRCLFGTRTCFKCRQEGH 360
           VPVPQRNFRPG EFRSFQQKPFEAGEAARGKPLCT+CGKHHLGRCLFGTRTCFKCRQEGH
Sbjct: 528 VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 587

Query: 361 TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGVL 402
           TADRCPLR+TGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTG L
Sbjct: 588 TADRCPLRLTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTL 632

BLAST of Cmc08g0225141 vs. NCBI nr
Match: KAA0054678.1 (gag protease polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 772.7 bits (1994), Expect = 2.0e-219
Identity = 394/405 (97.28%), Postives = 397/405 (98.02%), Query Frame = 0

Query: 1   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAIEQRFRDMIMQMRE 60
           MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAA+EQRFRDMIMQMRE
Sbjct: 1   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 60

Query: 61  QQKPASPT----PAPAPVPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 120
           QQKPASPT    PAPAP PVPAPAPAPVPVAPQF+PDQLSAEAKHLRDFRKYNPTTFDGS
Sbjct: 61  QQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFMPDQLSAEAKHLRDFRKYNPTTFDGS 120

Query: 121 LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 180
           LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTA WETTERMLGGDVSQITW
Sbjct: 121 LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTARWETTERMLGGDVSQITW 180

Query: 181 QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240
           QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD
Sbjct: 181 QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240

Query: 241 KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 300
           KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
Sbjct: 241 KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 300

Query: 301 VPVPQRNFRPGDEFRSFQQKPFEAGEAARGKPLCTSCGKHHLGRCLFGTRTCFKCRQEGH 360
           VPVPQRNFRPG EFRSFQQKPFEAGEAARGKPLCT+CGKHHLGRCLFGTRTCFKCRQEGH
Sbjct: 301 VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 360

Query: 361 TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGVL 402
           TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTG L
Sbjct: 361 TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTL 405

BLAST of Cmc08g0225141 vs. NCBI nr
Match: KAA0053290.1 (gag protease polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 769.2 bits (1985), Expect = 2.2e-218
Identity = 391/405 (96.54%), Postives = 393/405 (97.04%), Query Frame = 0

Query: 1   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAIEQRFRDMIMQMRE 60
           MPPRRGARRGGRGGRGRGAGRVQPEV PVAQAPDPAAPVTHADLAA+EQRFRDMIMQMRE
Sbjct: 1   MPPRRGARRGGRGGRGRGAGRVQPEVHPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 60

Query: 61  QQKPASPTPAPAPVP----VPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 120
           QQKPASPTPAPAP P    VPAPAPAPVPVAPQ VPDQLSAEAKHLRDFRKYNPTTFDGS
Sbjct: 61  QQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQLVPDQLSAEAKHLRDFRKYNPTTFDGS 120

Query: 121 LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 180
           LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW
Sbjct: 121 LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 180

Query: 181 QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240
           QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD
Sbjct: 181 QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240

Query: 241 KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 300
           KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGR STSGQKRKAEQQP
Sbjct: 241 KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRDSTSGQKRKAEQQP 300

Query: 301 VPVPQRNFRPGDEFRSFQQKPFEAGEAARGKPLCTSCGKHHLGRCLFGTRTCFKCRQEGH 360
           VPVPQRNFRPG EFRSFQQKPFE GEAARGKPLCT+CGKHHLGRCLFGTRTCFKCRQEGH
Sbjct: 301 VPVPQRNFRPGGEFRSFQQKPFETGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 360

Query: 361 TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGVL 402
           TADRCPLRVTGIAQNQGAG PHQGRVFATNRTEAEKAGTVVTG L
Sbjct: 361 TADRCPLRVTGIAQNQGAGVPHQGRVFATNRTEAEKAGTVVTGTL 405

BLAST of Cmc08g0225141 vs. NCBI nr
Match: KAA0036197.1 (gag protease polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 768.5 bits (1983), Expect = 3.8e-218
Identity = 392/409 (95.84%), Postives = 395/409 (96.58%), Query Frame = 0

Query: 1   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAIEQRFRDMIMQMRE 60
           MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAA+EQRFRDMIMQMRE
Sbjct: 1   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 60

Query: 61  QQKPASPTPAPAPVP--------VPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTT 120
           QQKPAS TPAPAP P        VPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTT
Sbjct: 61  QQKPASRTPAPAPAPAPAPAPALVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTT 120

Query: 121 FDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVS 180
           FDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVS
Sbjct: 121 FDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVS 180

Query: 181 QITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEA 240
           QITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEA
Sbjct: 181 QITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEA 240

Query: 241 ARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA 300
           ARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
Sbjct: 241 ARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA 300

Query: 301 EQQPVPVPQRNFRPGDEFRSFQQKPFEAGEAARGKPLCTSCGKHHLGRCLFGTRTCFKCR 360
           EQQPVPVPQRNFRP  EFRSFQQKPFEAGEAARGKPLCT+CGKHHLGRCLFGTRTCFKCR
Sbjct: 301 EQQPVPVPQRNFRPDGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR 360

Query: 361 QEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGVL 402
           QEGHTADRCPLR+TGIA NQGAGAPHQGRVFATNRTEAEKAGTVVTG L
Sbjct: 361 QEGHTADRCPLRLTGIAHNQGAGAPHQGRVFATNRTEAEKAGTVVTGTL 409

BLAST of Cmc08g0225141 vs. ExPASy TrEMBL
Match: E5GBB7 (Gag protease polyprotein (Fragment) OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 778.5 bits (2009), Expect = 1.8e-221
Identity = 394/401 (98.25%), Postives = 396/401 (98.75%), Query Frame = 0

Query: 1   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAIEQRFRDMIMQMRE 60
           MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAA+EQRFRDMIMQMRE
Sbjct: 231 MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 290

Query: 61  QQKPASPTPAPAPVPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDP 120
           QQKPASPTPAPAP PVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDP
Sbjct: 291 QQKPASPTPAPAPAPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDP 350

Query: 121 TRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFK 180
           TRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFK
Sbjct: 351 TRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFK 410

Query: 181 ESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVR 240
           ESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVR
Sbjct: 411 ESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVR 470

Query: 241 GLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP 300
           GLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP
Sbjct: 471 GLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP 530

Query: 301 QRNFRPGDEFRSFQQKPFEAGEAARGKPLCTSCGKHHLGRCLFGTRTCFKCRQEGHTADR 360
           QRNFRPG EFRSFQQKPFEAGEAARGKPLCT+CGKHHLGRCLFGTRTCFKCRQEGHTADR
Sbjct: 531 QRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADR 590

Query: 361 CPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGVL 402
           CPLR TGIAQNQGAGAP QGRVFATNRTEAEKAGTVVTG L
Sbjct: 591 CPLRPTGIAQNQGAGAPLQGRVFATNRTEAEKAGTVVTGTL 631

BLAST of Cmc08g0225141 vs. ExPASy TrEMBL
Match: A0A5D3BPI1 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold451G002120 PE=4 SV=1)

HSP 1 Score: 778.1 bits (2008), Expect = 2.3e-221
Identity = 395/405 (97.53%), Postives = 398/405 (98.27%), Query Frame = 0

Query: 1   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAIEQRFRDMIMQMRE 60
           MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAA+EQRFRDMIMQMRE
Sbjct: 228 MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 287

Query: 61  QQKPASPT----PAPAPVPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 120
           QQKPASPT    PAPAP PVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS
Sbjct: 288 QQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 347

Query: 121 LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 180
           LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW
Sbjct: 348 LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 407

Query: 181 QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240
           QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD
Sbjct: 408 QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 467

Query: 241 KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 300
           KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
Sbjct: 468 KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 527

Query: 301 VPVPQRNFRPGDEFRSFQQKPFEAGEAARGKPLCTSCGKHHLGRCLFGTRTCFKCRQEGH 360
           VPVPQRNFRPG EFRSFQQKPFEAGEAARGKPLCT+CGKHHLGRCLFGTRTCFKCRQEGH
Sbjct: 528 VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 587

Query: 361 TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGVL 402
           TADRCPLR+TGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTG L
Sbjct: 588 TADRCPLRLTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTL 632

BLAST of Cmc08g0225141 vs. ExPASy TrEMBL
Match: A0A5A7UI54 (Gag protease polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold24G004990 PE=4 SV=1)

HSP 1 Score: 772.7 bits (1994), Expect = 9.7e-220
Identity = 394/405 (97.28%), Postives = 397/405 (98.02%), Query Frame = 0

Query: 1   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAIEQRFRDMIMQMRE 60
           MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAA+EQRFRDMIMQMRE
Sbjct: 1   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 60

Query: 61  QQKPASPT----PAPAPVPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 120
           QQKPASPT    PAPAP PVPAPAPAPVPVAPQF+PDQLSAEAKHLRDFRKYNPTTFDGS
Sbjct: 61  QQKPASPTPAPAPAPAPAPVPAPAPAPVPVAPQFMPDQLSAEAKHLRDFRKYNPTTFDGS 120

Query: 121 LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 180
           LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTA WETTERMLGGDVSQITW
Sbjct: 121 LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTARWETTERMLGGDVSQITW 180

Query: 181 QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240
           QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD
Sbjct: 181 QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240

Query: 241 KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 300
           KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
Sbjct: 241 KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 300

Query: 301 VPVPQRNFRPGDEFRSFQQKPFEAGEAARGKPLCTSCGKHHLGRCLFGTRTCFKCRQEGH 360
           VPVPQRNFRPG EFRSFQQKPFEAGEAARGKPLCT+CGKHHLGRCLFGTRTCFKCRQEGH
Sbjct: 301 VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 360

Query: 361 TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGVL 402
           TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTG L
Sbjct: 361 TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTL 405

BLAST of Cmc08g0225141 vs. ExPASy TrEMBL
Match: A0A5A7UDK9 (Gag protease polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold102G001010 PE=4 SV=1)

HSP 1 Score: 769.2 bits (1985), Expect = 1.1e-218
Identity = 391/405 (96.54%), Postives = 393/405 (97.04%), Query Frame = 0

Query: 1   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAIEQRFRDMIMQMRE 60
           MPPRRGARRGGRGGRGRGAGRVQPEV PVAQAPDPAAPVTHADLAA+EQRFRDMIMQMRE
Sbjct: 1   MPPRRGARRGGRGGRGRGAGRVQPEVHPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 60

Query: 61  QQKPASPTPAPAPVP----VPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 120
           QQKPASPTPAPAP P    VPAPAPAPVPVAPQ VPDQLSAEAKHLRDFRKYNPTTFDGS
Sbjct: 61  QQKPASPTPAPAPAPAPARVPAPAPAPVPVAPQLVPDQLSAEAKHLRDFRKYNPTTFDGS 120

Query: 121 LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 180
           LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW
Sbjct: 121 LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 180

Query: 181 QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240
           QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD
Sbjct: 181 QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240

Query: 241 KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 300
           KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGR STSGQKRKAEQQP
Sbjct: 241 KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRDSTSGQKRKAEQQP 300

Query: 301 VPVPQRNFRPGDEFRSFQQKPFEAGEAARGKPLCTSCGKHHLGRCLFGTRTCFKCRQEGH 360
           VPVPQRNFRPG EFRSFQQKPFE GEAARGKPLCT+CGKHHLGRCLFGTRTCFKCRQEGH
Sbjct: 301 VPVPQRNFRPGGEFRSFQQKPFETGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 360

Query: 361 TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGVL 402
           TADRCPLRVTGIAQNQGAG PHQGRVFATNRTEAEKAGTVVTG L
Sbjct: 361 TADRCPLRVTGIAQNQGAGVPHQGRVFATNRTEAEKAGTVVTGTL 405

BLAST of Cmc08g0225141 vs. ExPASy TrEMBL
Match: A0A5A7T3M7 (Gag protease polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold69G00720 PE=4 SV=1)

HSP 1 Score: 768.5 bits (1983), Expect = 1.8e-218
Identity = 392/409 (95.84%), Postives = 395/409 (96.58%), Query Frame = 0

Query: 1   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAIEQRFRDMIMQMRE 60
           MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAA+EQRFRDMIMQMRE
Sbjct: 1   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 60

Query: 61  QQKPASPTPAPAPVP--------VPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTT 120
           QQKPAS TPAPAP P        VPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTT
Sbjct: 61  QQKPASRTPAPAPAPAPAPAPALVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTT 120

Query: 121 FDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVS 180
           FDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVS
Sbjct: 121 FDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVS 180

Query: 181 QITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEA 240
           QITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEA
Sbjct: 181 QITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEA 240

Query: 241 ARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA 300
           ARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA
Sbjct: 241 ARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKA 300

Query: 301 EQQPVPVPQRNFRPGDEFRSFQQKPFEAGEAARGKPLCTSCGKHHLGRCLFGTRTCFKCR 360
           EQQPVPVPQRNFRP  EFRSFQQKPFEAGEAARGKPLCT+CGKHHLGRCLFGTRTCFKCR
Sbjct: 301 EQQPVPVPQRNFRPDGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR 360

Query: 361 QEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGVL 402
           QEGHTADRCPLR+TGIA NQGAGAPHQGRVFATNRTEAEKAGTVVTG L
Sbjct: 361 QEGHTADRCPLRLTGIAHNQGAGAPHQGRVFATNRTEAEKAGTVVTGTL 409

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ADN33767.13.6e-22198.25gag protease polyprotein, partial [Cucumis melo subsp. melo][more]
TYK01613.14.8e-22197.53pol protein [Cucumis melo var. makuwa][more]
KAA0054678.12.0e-21997.28gag protease polyprotein [Cucumis melo var. makuwa][more]
KAA0053290.12.2e-21896.54gag protease polyprotein [Cucumis melo var. makuwa][more]
KAA0036197.13.8e-21895.84gag protease polyprotein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
E5GBB71.8e-22198.25Gag protease polyprotein (Fragment) OS=Cucumis melo subsp. melo OX=412675 PE=4 S... [more]
A0A5D3BPI12.3e-22197.53Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold45... [more]
A0A5A7UI549.7e-22097.28Gag protease polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffol... [more]
A0A5A7UDK91.1e-21896.54Gag protease polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffol... [more]
A0A5A7T3M71.8e-21895.84Gag protease polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffol... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 147..243
e-value: 8.4E-16
score: 58.0
NoneNo IPR availableGENE3D4.10.60.10coord: 274..368
e-value: 2.3E-5
score: 26.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 275..292
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..42
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..15
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 275..309
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 66..81
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 59..81
NoneNo IPR availablePANTHERPTHR34482:SF4POLYMERASES SUPERFAMILY PROTEIN, PUTATIVE ISOFORM 1-RELATEDcoord: 120..364
NoneNo IPR availablePANTHERPTHR34482DNA DAMAGE-INDUCIBLE PROTEIN 1-LIKEcoord: 120..364
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 348..362
score: 9.438442
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 344..368

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc08g0225141.1Cmc08g0225141.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006508 proteolysis
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0006278 RNA-dependent DNA biosynthetic process
cellular_component GO:0043227 membrane-bounded organelle
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0003700 DNA-binding transcription factor activity
molecular_function GO:0003964 RNA-directed DNA polymerase activity
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003676 nucleic acid binding