CcUC05G081410 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC05G081410
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionUnknown protein
LocationCicolChr05: 505395 .. 538772 (+)
RNA-Seq ExpressionCcUC05G081410
SyntenyCcUC05G081410
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATCTGGAAACCCATGTGGTAGAAATACCACAGAATTTGACAGCATCAAACACATGATGATATTTACATATATATAGAAGAGAACGGTAAGTGCTCTTCCTTCTTGTGGAGTTGATATTCTTCTTCTTCATTCACACCATTTCTCTCTGACCTTCTTTCTCATCTCAATGCTGTAATTCTTCTCCAACCCTTTTCCCCTTCTTCTTCTTCTTCTTCCCAAAACCACATTATTTCTCTCTCTTTCATTCTGTCTCTCTATGTATATGCCTGTGGTTAATTGGTGCACAACAGTGGAAAGTAAAATAGGGGAGAGAGGTGGACACATGCTTTGAGGGTTTGAGCTTCCTCTTTGTCAAAACATTGCCTGAATTCTCTCCAACCCTTTCTTAGCTCTGTCCTTCCTCTCTCTCCCTTCCTCTCTTTGCCATCTACGCTCCCTTTTCATCTATTCCGTGCTCAATTGGGACGTTTTTCAATCAATCATATTCAATGTGTCGTGATTTGTTCTAAAAGGTTGAAGATTTCAGGGTTTACCGGCGGACCCAGATAGCAAAAACTATCAGGTTTAAGGAGGCGTCGTTTTTCTTTCCTATAATCAATCCATAAGGCAGATTAATTTTGTTCTTTGTTCTTTTTTCCTCATCCCATTGTCCACAAACTTGGGTTTTTTTGGTTCACATTAAGAGTTTTCAATCAATTGTTACGCAATCATAACTTCTTTCTTTGGAGGACTAGGGTTTTGAAGTAATAGCATAGCTTCGCGTGGGTTTGTATACCCCCATCCAGCGTCTTAAGGAATTCGGAATTTGGGTTGGAAAAGAATGGTTTTGTGTATAACTGTGTTTCTTTTTTGTTGTTCGAAATTTCCCTGAGGTTTGGGGTGGCAGTATCTGGGTTTCTGGGATTGGCTGGAACATGAATCGGAGGGTGAGGAGGAAGGTGACAAGAAAAGGGAAGGAGAAGCTGATTTTGCCAAGCTACCCTGAAATTGAAATTGAGATTGCTGATTTGGACAATAAACAGACTGTAGATTGGACTAGTTTGCCTGATGATACAGTCATTCAGCTTTTCTCTTGTTTGAATTATCGTGACCGGGCAAACTTGTCATCGACTTGTAGAACATGGAGACTTCTTGGTTCATCTTCATGCTTGTGGACTTCATTTGATCTTCGAGCACACAGAATTGATGCTGCAATGGCTGCTTCTCTTGCTTCTAGGTGCAAGAATCTTCAGAAGCTCAGGTTTCGTGGGGCAGAGTCTGCTGATGCAATAATTCTACTTCTTGCAAAGAATTTGCGTGAAATAAGTGGTGATTACTGTAGAAAAATTACTGATGCTACACTCTCTGCCATTGCAGCTCGACACCAGGCACTTGAAAGCCTCCAGCTTGGGCCAGATTTCTGTGAAAGGATCAGTAGCGATGCTATAAAAGCAATAGCTATTTGTTGTCATAAGTTGAAAAAACTTAGGCTTTCTGGAATTAGGGATGTCAATGCAGAGGCTCTCAATGCTCTATCAAAGCATTGCCCTAATTTGCTGGAAATAGGGTTCATTGATTGTCAGAATATAGATGAGATGGCCCTTGGAAATGTATCATCGGTTCGTTTTCTCTCAGTTGCAGGGACCTCAAATATGAAGTGGGGTGCTGTTTCACATCAGTGGCACAAGCTGCCTAACTTGGTTGGTTTAGATGTGTCACGAACTGATATTGGTCCTGTTGCTGTATCAAGATTAATTTCATCTTCTCAGAGCTTAAAAGTCTTGTGTGCCTTCAATTGTGCAGTTCTAGAAGAAGATGCTGGCTTCACTGTCAGCAAATATAAAGGCAAGCTGTTGCTTGCCCTTTTCACTGATGTTGTGAAGGAAATAGCTTCTTTATTTGTCGATATCATAACGAAAGGGGAAAACATGTTGTTAGATTGGAGGAATTTGAAGAATAAAAACAAGATTTTGGACGAGATAATGATGTGGCTTGAGTGGATATTATCTCATAATCTTCTGCGCATTGCTGAGAGCAATCAACATGGTCTGGACAATTTTTGGCTCAATCAAGGTGCAGCTTTGTTACTTAGTTTGATGCAGAGCTCACAAGAGGATGTTCAAGAAAGGGCAGCGACAGGTCTTGCAACTTTTGTTGTCATTGATGATGAAAATGCTAGTATTGACTCTGGAAGGGCAGAAGAAGTTATGCGGCGTGGTGGTATTCGTCTCCTTCTAAACTTGGCAAAGTCTTGGAGAGAAGGGCTTCAGTCTGAGGCAGCAAAGGTAAATTTCATGTGGATATACTAACTGGTAAATAAGCATATTACAAAACTTTTGTTGGTTTATTCGATTTTGATGATAATTGAGACATCTATTCTATTTGTACCCGTTGCTCATAAATATTACAATGCTAATATGAATTGTAGTTGATGGGTTTGAGCCAAAACAACTTTAACATGCTTTTCTTTTGATTGTGTAGGCCATAGCAAACTTGTCTGTGAATGCTAATGTTGCAAAGGCAGTAGCCGAAGAAGGTGGAATTGATATTCTTGCAGGCCTTGCAAGATCCATGAACAGGCTAGTTGCAGAAGAGGCTGCTGGAGGATTGTGGAATCTTTCTGTTGGCGAGGAACACAAAGTCTGATGTTTACATATTGTATTTTTCTCATTAGTAGCATGACTGGAGACTCTTAACCTCTAGTTCTGATAATGGATTTTTAAAATTAAAAATAGGGTGCGATTGCTGAGGCTGGTGGAGTAAGAGCTTTAGTTGATTTGATATTTAAATGGTCTTCTGGTGGTGATGGAGTTCTTGTAAGTTTCACCCTTTCATTATTTGTTTTTGATATCCATGAGTGGCTAGTTATTTTTGTTTTATCTTCATTCCCTTATTCTCCTTTTGTATTCCCCATTTTGGAAATGATGAATTAAGTTCCTCAGAAATCACAGATTTTAATTTCATTTTACATGTATTTTCAGGAACGTGCAGCTGGTGCACTAGCAAATTTGGCAGCTGATGATAGGTGTAGTACTGAAGTTGCTTTAGCAGGTGGCGTGCATGCACTGGTGATGCTTGCTCGCAACTGCAAGTTTGAAGGAGTGCAAGAACAGGTGACCACCCTGTTGAATAACAAGGATTTTTTTTTTTTTTTTTGTCTAAAAAAAGTTGCATACTAGAAGTTGCATATTTTGATCATTCTGTAAGATTGAAGACCACATTTCCTCTTGATTGAAGTAACTTTAATTTTGGTTATTTTCCTTCTGAAATATGCATTTTCCTAAATTATTCTTGACATGGTATTATACTCCCAACAATGAATGCTATGTAAACCTTTAATTACCATTGCGTTATTGTCTATGGCTATGTATTTAAATTTCAAATTCTAGCTCTTTTGCCCCTCTAGAATGCAACCAAGGATGCGTGAATAAAGTGATGGAAGATGTTCTCGCTTTCTATCTTGCACATTGTGTGAACTCCAGTGATTAATTTGCTTTAAGGGCATTTCGTCAATGTCTTCTCTCTTGAAGGTTAGTAGGTTCGATTCAAAGATTTATTCGATGGATTTTGGTTAGATGATTTCTAGTGCCTTTGCCTTTTCCATCTCACTTCAATTTTCTTTGGTAGATGGAGAAAGGTTTTTGGCTATCATGTTTATGTTTCGACGTCGTACCTTAGTAACATTTTTTTGGGAAGAGTTGAAGTATGTGGGGAGATGATTTGGTGTTTAAGAGGCAACTAGTGTTTCGATTATGAGGTGTTTAACAATTTGATCATTGGTAAGGAATTGAATGAAAGACCTCCGGAAAATTATATGTTAACCTGGTCCTTGTGTAACTTACAGGTCAATCTATTCCTTACTCTATAGATTTATAGTTCCTAGCAATTAGAAGAGGAAGAACTATTATTTAGCACATCTTTTCATAAGGTCCTTACCTCCTTCATGGGAGATGTTGCCTTTGTAGTCCTTGTTTGTCTAAAGGCTGTTCCTACAGTTCTTTCTTCTCGTGTTTGGTCTGTCTCTCCCTGTCGAGGGTTCTATTTTCTCCTCTTTGTGGCAGGTGAATAACCTTTCCCAAGAAGGTTAAGTTTCTGTGTGGCAGGTTATGACAAAGGAGTTACACCTTGGATTCGATATCGACTGGGAGGCCTATTTTGGTCAGGCCACTTTGTTGTATTCTTTGTAAAAGGGTGGTGGCGCACCTCGATCATATTATCTATTGATGTGATTTTGCTGGTGCTGTCTGGAGTTTATTCTTGGAGGCGTTTGACTTTTATTGAAAGGGCCCTTGACTCTTGAACAAAGCAAAGGGATTGTTGGAGTTGTTGGCTTGAAATGCTCTTTGAGGAGGTGATCGTGGGAGCTATTTTCAGTTTAGGGGATCTTAAAGCAATTCAGAATCCCATGCTCATCTATTTATTCATTGTTCTTTTGCCTCTCGGTTCTGGCATATTATTTTGGATGCTTATGGTTGGTCAATGACTTATTCATGGTTCAAAGAAGACGGTTTGGTTGACCATTATTCGAGCTTTCTTTTGGACTTTAAGGGATGAGTGCAACAAACTAGAGACTTATTTTCAGTTTCTGATAGTTTTATGGATTTGGTTCTATCTACAGCTTTGTATTTGTGCAAAAATAGGCAACCTTTTTAACACTTCAGTCTTTCTTACTTACTTTCCAATTGAAAATTTCTTTTGTAATCACCTTTAGGTGCTTATTTCATTTTATCGACGAAATGTTGTTATCTAAAAAAAGTCGTAGGGGCTTGATGGTTTCAATGTTGAATTCTTAGGAAAAAAAGTTGGAACATCTATGAAACTGACTTTTTTAAGGTGTTCCAAGAGTATTTTGAGAACAGTATTTAAGGTATTCTGAATAAGAAAGAGTGAAAGACGGATGAAACATATATTTGTTTGAGCCAGAAAAGTCATATGATAGCAATGAGTAGGAGAATTTTGGCCCCTTAGCCTAGTCACATGTCTTGATGAGGTCGTTGACTCATTGGTCATTGTTAAATTTTTATTGCAAGGTTCTCTCAATGGCTCCTGTTGAAGGAGGCAAATCCCAGACACTATCCATTTAGGTGCAAAATCAATGGATTGGTACTTTAGCAACCACAAAAAAAGGTTTATTTCGTAAAATAGACTATGTGAAGCTCATGTCATTGTGGATTGGTCCGTTTTAGAGCACTTGTGGAATAATAGGGGTTTGAGATGAAATAGAGAAAATGGATCCATGGATGCCTTTCTTCTTATGACTTCTTCCTAATTAATGGAAGGCCTAAAGGTACATTCTAGATGCTTCAAAAACGATTAGACAAGATGATCCCCTCTACACTTTTTTGTTCATTTGCATAACCGATGCCTTCCGATAATTTTATAGAAACCTGTACCAGGATAAAACGTTATTGAAGGTTTTTTTTGGGTAAGTAATTGAGGTTTCATTGTGAAAAAAGAAAGAATGTATGAGGGATTATAAAAAAAGTTCACTAAAAAGGAGGCCAAACCCCAACTAAAGAAACAAACTCCAACCTATGAGGATCAAACCAAGCTCTTAATCACAAAAAGGTCGGACAATAGATGCTCATAAAAAACATCAAACCTAGCAACCTGCCAAATCTTGTCCACCTCTCGACCTCTCTAAAAATTTTGCTATTTCTTTCCAACCGAATTGCTCACAAACCAAAAGGAACCAGACTGCCACAAAATTTTCCTTTTATCGTGAAATGGGAGATTCAAGAGCACCTCCTTAATCATGGTGCCACCCTCCCTACCATGTGCCAAATACAAGTCAAATGAACTCTAGCATTTGCTTTACAAGTAATGGGCAAACCGACACCCCACAGCCAATGGTCAATATCCTTTTCATCCTTCGTGCAAAGAATGCATCAATAGGAGCCCAACACAACACAATATCATCTTTGGATACAACTCAAGATACTAGTTTTTATATGCAGAACCTGCCACACTAATAATTTGACCTTACTGGGAATTTAACTCTTTCAAAGAGAGGAGAAGACAAAGGCATCTGGTAGGAACAAGGTTACATAAAAGAATAGAAAGAACAACACAAAGGATTAGGAGTCTAGACTCTAACGTCTCTTCTCTTTAGGCGCACTTAAGGGCTCTGAAGAATAGAAAGTAAGATTAGCGCTTCCGTAGTTTCTTTGTTAGATAGAGGCTAACAACAACCCAAGAAGCATGAGGAAAAAGGGTTAGAGGAAGGCAAAACATAAGCCATCAAGTGAAGTTTCTTGACGGAAAAGTGATAGAGATGAGGAAATAAAGTACCAAGGGGGAACTCTGCAACCCCCACCAATCCTTCCATAAGTGCACCTTAGAGCCTTCTCCAACGTGACATTTCTAAACTAGGAGATAAAGGAAACTTGAAGAAATAACTAACTAGAGGTTTTTATGCAAACCTTTAAGACCGCACATTGTACCTCACACTTAAGGGTGACTGCCATATTTGCTCATACTAGTCCTATGTCTCAAAGCATTTAGGTATGAAAAGAACCTTGCGACCATTTTGCTTGTAAAGCTTCATTAGGGAGTATCAGATCCTCAATACCTAACCCCCTCCCAAGGGTAATGGTTTTGAAACCACATCACACTTAACAAGATGGGACCCCTCTCCCCCATCAACCCTTTCCAACAAAGAACCCCCCGTCGTAGGCTTGCTCACCAAAACCAACATCCTAGACAAAGACAATAAATAACTAGCAATGCTACAAAGATCAATTGGATAAGAGTTAATCTGCTCCCTTTGGAAAAGAAGGGGCTTCCTTAAGAAACCAAACACTTTTGCACCTTTTTCACCACATGATTCGAGAAGATTGGTACTTCGAGTTACCACTCAAGGAGAACCCTAAAATTACCGCCTAAAAAGGACGAGATAATTAATGATTCCATTTAAGTTCTCGACCGGATAAGATTTATTCAACCCCTTTGGAAACGAAGGCTCTTCATCGAGAACCCAAACACTTTTGCAACTTAACTTCCTAAGATACTAGTTTGAACCTATGGTGGAGCTTTCACCCATGTTAATCTTGAGGATCAAAATTATTGAAGGTTTTAAGAACTCCTTGGCCAAACAAGGATTAGAGTTAATAACTCCGCTCCACCCGATCCAACTCCTCCCTCCGAACACCCCAAAGTTCCTTTTCTCCCATATCGCCACAAAATATTGAAAGAACCTTAGCCTTGTGTAGAGGCTAGGTTAAGTTGCAGGCACTTAACCTAGAGTAAAGATAAGCCGACTTGTCTTGCAAAAATGATAAATGCACGTAGATGCAGTGTGACTAACGATTGATAAAGTTGCATGTGTGGTAGCGCTTGTGACCTTAATACCAAACCAATTATGAGCAAAAGGATGATGCTTACTCTGTGGGTAGTTCGTGTAAACGTTCAAGATGCCATCAGGATTCAAACAAAGAGAGCTTGGGCAGAGAAGAAGAATCACATCAACAGATAGTATGATCAAGGTCTGCCACCACACTTTTGTAGAGAATACAACAAAGTGGCTAGCTGTCACACCTCCAAAGTTCAAAGCACGCGAGAAGGGCTGTGTTGCCGGGCGCGACGCACGCCTAGGGCTACAAGGGGTTAGCCAAGGCGTTACCCATGGGGGCAGTGGACATGCGATGCTGGCGCATCACGCCCATGCCCTGTGTCCAGAGTAGGCATGGCAATGCGCGCAGGGCCGAGCAGTGTGGGAAAAGGTGACACCCAAGCCATGTGCGGGGAAGCACGGTGGAGGCGCTTGCGGCACGTTCACGGGGCTGAGCAGCATGGGCAGGTGGCGCAGAGACCGCATAGGCGCGCGCAGGACGTTTGAGGTTAGCGCGGAGAGGCACGAGTAGTGCAGGGGAAATACCGCCCTGACGATGGTGCATGCAGGCGCTAGGATCGAATACGGAAGTCTGCGGGAGCGAAGGTGCGGGTGAGGCGCGCGACACGCTAGGCTGACACAAGGGTGTCGGGCGCTCACCGAGGATGGGGCACGCAGGCCACACGCACACGCGCAGTGCACGGAAATGTGTGGGCCATGCGCCTCCCTGGTTTGCATCCGTGCATGGTGATGCACGTCGATGGCCCTTCGGCAGGCTCCTCAAGGGCGTCTCCTACGACCACATGGCCGGCCAAAATATGTGGGTAGTGGGCGCTCAACTGTAGGAGGCCGAAGGATGATCTCCCCTATTTTTGGATAGTGTGGAACGTGCCCCACAAAAGTCGGGTTGTAGATGCTAAATAGGAGTAGTTGATAGTGAGGGAGAAAATGTTGGAGTAGTGGCCATGTTCTCTCCTAAAAAGTAGGAGTTAGTGGGGAAATGGGAGAGTGGCCCTATTTTGTAGGGTAGTGGGCTGTTACACCCCTACTAAGGGCCACTCCTCCACAAATGGTGCCTACATTCTCGCCCCTCCCCTCCTATATAAGGTGAACCTTGAGAGTTGTTCAAGGGCACCGAAAGTAGCCAACACATCTCATGTTGCCAAAGATTTGGAGTGAGAGCTTCTCTTTGTGTATAGTGTTTGTTTGAGTGCCTCTTGTCAAAACGAGCTCCAAGCTCCACCCTCTGTATTTAGTGGTTTTGATATAATAAAAAGTTATATTTCCACTCAAGCACTTGTGTGTCTTCCTTTAGCCATCAAGAAGAGTGTCCTAGGCGAAAGTCTCAGAGAGAGATTTGGAAGGCGCTCTAGGCTCCCTCGAGAACGAAAAGGTCGAGTCGTTAGGCTGGCCAAGGGAGTACCTTGGCGCCGCACGGGGTCGATCCGCGAAAAGAATATATACTCATATACATCCCACGCTTCCGCCTTTCACCCTTGTGACACTAGACATGGTTGAAGTAGAATCATGGCATCACAGAGATGAAGATTGCTCCTTTTTCCTTTTTCCTTATTCAATGGCCATTAATACTTTCTCAATAACTATGGGTGGCAGCTCGCACAACTCAATTGCCACCCATTGACTTGGTAATTGAATTAACAACCTGGATATAAAATGCTGGTGGAAACTCCTCTTCTTCTTTTTTTTTCTTTCAGATAATGGTGGAAGACTTTCTTATGCTCAAATATTGGTATTTAAGTGTATTATCTGTCACTACGTATTATTATAATATTAATTTGGCTCATATATTTTTTGCTACTTTTATTTTTCATGGAACCCTTTCCAGCTGTCAGGAATGCATTGCTCATAAGATAAAAAAAGTCTAAATCTGCACAAATTTCAAGTGAAAATTTGTACAATTTGAAATTGTGCAGGCCGCTCGAGCATTGGCTAACTTAGCTGCCCATGGGGATAGCAACACAAACAACTCTGCTGTTGGACAAGAGGCAGGTGCACTTGAAGCACTTGTTCAACTTACACATTCTCCTCATGAAGGCGTCAGGTAGTTTTACAACGACTCAATATTTTTCTTTTTTATTGTTGGTTTTATTATTTATGTGTAGAATATCATCCAATCCTGTTCAAATTTTTCTTCATTTAGTCTTTACGCCATTGGATGCCAAAATGTTGAAAATAGACTATAGCATAACTTGCTAGAGAATGAAATGACATCTAAAGATAAATTCTATTGATTTTGACCATAAATGTGATATAGTAGGATTAGAGTAACGTAGGGTATCTTGATCCTTGATTGATTAGGATTAAGACTAGTTTCCTTGGTCTATTAGGACTAGAATTAGTTTCTTTTATTAATTATTAATTAGGATTAGAACTAGTTTGTTTTCCAATTCTCTCTATAAATAGAGTAATTGTCTTCTTGGATTGATAACTTTTTATTCATAATAAAGACTTCGATTTTATTATTGGAGATTTCTCTCCTTTTATCTTGTAGGCTACATCAGAATGTCACACATGCTGATGCAAATCTTTTAAGTCTCAATCTTTCTGGATGTCTTTTTGGTTTTCTTTTGTATTGTTAGGATCAGCTTGTTCCATTCTTGGGCAATCTTAGTGTTTTTTGAATTAAGTTTTAGTTTTAAGGTTCATTGTATTTGATTCTTGAGAGATCTTTATTTCTTTGTGTTTTTCAGTTGGTTGTAACTTGCAACCTGCCCTGCCGGTAAAGTCCCTATTTAAGCTGGCTGTTAACACTTTTGGAGGTTAATGAATGATATTCTTTAGAGTCTTATTTGAGTTTCTATCACATGCACTTAGGGGATAAATCCAGGTTGTCCGGTAAGTTTAAGTATTGATCTTGTTGCGAGTAAAGGATTAGTGAAAGAACTTCAGTTGTACATATATTTGTGGGAAAATTCTCCTAACTTGTAGATATTTAATTTTAACGAAATTAGGACAGAAAGCTTTTGACATTGGAACTGAAATAAAATGTTCATAAGGTGGGTCTTCCGCAAGGTTGTTAAAAACATACGATTCATGATGATACGTATCAAAACTAAGGCAAAAGGATTCAACTTGTAGGTATGAATCACAAATTTCATGTATCGCATGATTCACGTTTCTGAATTATGTGAATCGTGCTAAATCGGAACGTTCCGATTCACTTTTTAAAAAAAGTTTCCATTTTCGAGTTGCATGGCACCGGAGAAGATGAGTTTTCTTTCCCGAACTGTGTTCTCCTTCTCTTCAAGTTGCAGTGCGCCAGACTCTAGAGAAGAAGGTTCTGATTTTTGAACCGTGAAGAAAGATGAAAACTTCTTGCCAGAGATGAAGCTCTACACAAAGTGTTCTCTAATCTCCTTCTCTTCGAGTTGCAGGGTGTTGGAATGGAGAAGAAGACTCTGGTTTTTGGACTTCGATTGAAGAAATATGGATTGATGGAAGGGAAAACGAAGAGAGAAACTAGATAGAGACGTTTAGCATAGAGATTTGAAGAAGAGAGAGATTGGTGGAATGGAAAAGAAGAAATGAAGAGACTAGAGAGAAACTTTTTTTCTTTTTTTTTCTTTTTTTTAAATAATATAAACTTTTCATATTCTCTTTTTTGAATAATAAAAATAAGGGATTCCAAGGCATTAATTATTTTCTTTCCGTTTTCATGATCTCCACGTGCCTTTTCTCTCTCTCTCTCTTTTTCAAAATTTTTTTCCATACTTATTTTGAAGTTAATTGGATGCTTTATTTTCTTATCTATTTTGAACTTTGAAGTTAGTTTATGTTTTTAATTTTTATATATATTTTAGATTAATTTAGTGTATGACTATATATTTCTCTATTTTGAGTTTTCATATCTATATTTATATTTATATGTTATCTACAAATTAAAACTCATGCTGTACGATACGTTACGATACACGATACACTAAAATGAAATTTCGATATATGTTCTGTTTCACGAGTTAACAACCTTGGTCTTTCGTATCTTCCAACTGTATGGACACTCTCGGTGCGAGAAACTGTTGAAACTTGGATAAAGTGTATAGTAAGGTTTTAGTCCAACATATTATTTTTGTCTCATGGAGCAGGCAAGAGGCTGCTGGTGCCCTATGGAATTTATCATTTGATGACAGAAATAGAGAAGCAATTGCAGCTGCAGGTGGTGTTGAGGCATTGGTAAGATCATTAACCCTTCACTTTAAATTGTTCATTATTGTAAAAGCTGACTGGACTCTTGTACAACGTTTGAATCTTTTTTTTTTTTTTTTTTTTTTTTTTTGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAATTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGATAAGAGACAATTTCATTGAAAAAATGAAATATCCGAAGATATCCAAGGGTAAAAGGAACCAATCATATGGTCAAAAGTTGTTTCTAATTTGCAATGAGGAAAGAAAAACTATGGCAATTTATAGCTGGTATTTGGGTAGTTAAAGAATAATCCAACCCTGTCTACCTGCTGTTGAATCACCCGGACTCATTATTGTAGAGGTTGACTGGATGTATCTTACACTGTTGAACTTTTGCCTGCATGAGATCTTAAAAAAACCATTCTTCATGAGGTGTAGGTTTTGATGATTAAATGAGAGTACATTTTGTTTTAGTAAACCGATGAAAAAGGTTTGGAATTGGTATTCTCTTGAACTCTAATTGGAGAAATGTTTCTTCTGTGTTGGAATACGTATCTTCTTTATTATGTATGCATTAGATTTTATTTTTGATAAGAAATAACAATTTCATTATCAATAAGGGAAGGTACGAAAAGGAGATAAGGGGATTACAAAAAGCTTCCCAATCGGCATAAATTTGAAGAAAAGGAGAAACTACAAAAGTTACTTGAGTTGGCACACCAACCAGAAGCAAAAGTTCTAACACTCTCCCAATATGATTCAAAGCTACTCTCTTCTCCGTGGATTAGCCTATTGTTTTTTTCAAACCAAATTCTACACAAGAGATCTGCCTCGTTTAGCCAGTCTTTGCTTTGCCTTAGATCTACCCCACAAATCAGCTGGAACATGGCCTCTGGGAAATTAATATGTTAGGCATTCTCCTTTGTGGGGCTAAGCTTGCCGTGAGTCAAAATCAATAAGGCACTTTTTTTTTTTTTTTTTTGGAATTTTGGCTCTCCAATGACTTGCATAGTGGGGAAGTTCTTTGCTAGTTGAGAGTTGGTTGACCACAGATTTGACTGTAAGAGATCACTTCCTTCTAGCTTCCATACTCTTTGATCCTCTTGGTTTTTGACCGTGAAGTTATCCCAAATATCAGAAAGCTGAGCCCACTCCTCCACCTCTTTCTCTTCAAACTCCTTCTAAGCCGAAGATCCCACCTTGTCACACCGTCTAACTTGCCTGGGAAATAGATAACCCTTAGAGTTGGCGGGTACAATCTTGGGAATAAAAGCTGGAATGAAGAGTTTGAAATATAATATTTTTATTGTCTATTTCTAGGCAGAGTTATGTGCATTTGATATCAAATGAATTGTTTTTCTTTTTTCTTTTCTTTTGATTGAAAAGGAGACCACCATTGAGGGGAATGAAAGTGCAAAAAGAAAAGCTAGAGGAGACAACGAGAGATGCTTTCTGAGTAATTAGTAGACATGCTTTTGTTATGCTCAACTGCTCAAGTGTTTGGGTTCAAGTCAGGAAGGTTCTTAAACTTATGATGGGAACTACTTTCCGTTAAAAAAATAGAAATAAGAATAATGGAAGTGCAAACAAAAATTGTTAAATGTACAAAACTCCAAATCTTTTATAGATATTTTTTTCTTCCTTTGTGATAAAAGACCAATTTTCATGGTGAAAAAAGGAAACAATACACGGGCATATTGAAAACATGGTCCTCAGAAGGGAACACCCTAAAGAAAGGGACTCCAATCAAGCAATGTAAGACCTAGGAAATTACAAAATAAACTTGTCACCAATGCCCAAAGACGCATTAAACCTCCCAGGAGATCAGACATCGATGGCTTCCCTCTCAGTCCCTCTAAAAATTCTATTATTTCTCTCATTCAAATATCCCACAACAATGTGCACAACTTGCTTTCCATAAAAGTCTACTTGGCTTTCCACAAAAATCTCCCTTTCTGTCGGAACGGTGGTTGGAGGGAGCTTACTATGCTCCAACTAAAGGATCTTAAGAAGTGGCAAAAAATCATTTTTGCCATAAAATCGTTTCATTGATATAATGAAAAGAGACTAATGCTCAAAATACATAGAATCAGTTTTGTTTCCTATCAAAAAAGAAATTCTGCTAACTGTTGAAGGAAGGAACTCTCAGCTATTAGTGTTTTCCTCCTTAAAAAAAACCCTCAACCCTATATTTGGTCTCAGAGCTCATTTAGGTGAGGTAATAGAAACTATTCTATATACTATGAGAAGAAAGCAGTGATGGAGAAACAGAAGTTGGCAATGGTCCTCTGAATTATATTCTGCAGAGGATGTATTGTTTTGGTGGAGGCGTAAAAGCACCTTCCTCACATATAATGTTTGTGGGGTCTAACCCTTCTATTATGCTTTTGGCTCACGCGTACCGTTTTCCGAAGGACCTGCCCCCTCTAAAGTCAATTCTGGCATATAAAGTTAGAAGAGTTAATGCTTATGGTAGCCACTAACCTTGTAGTAGCTTTGAACATTCCAGCTCAGCAAGGTTAATTTGGATATTAGGAATGCAATCAAACCTGAAAGACTCTCATCATTATGCTCTGTTTTCTTAATTGAAGTCAACATAGATTAGAGTGTATTGCTTTTCCAACTGTCTATGCCCTTAAATACTAGTTTTCCAATCGAGATATGAACCATTACAGAAAAACAAGCAAAGTTGGATACTATACTTGACTGGAATATGTTATATGATGTAAGCTCACCCATCGTTTGCATGTACGTTTTTATTTATTTATTTTCTTTTAATTTTAGGTTGCTCTAGCACAATCTTGTTCAAATGCATCCCCGGGTCTTCAGGAAAGGGCTGCTGGTGCTCTGTGGGGATTGTCAGTTTCCGAAGCCAACAGGTATAAAATTTGAGGTTCGGAGATGGTATCACCTAATACGTTGCAGAGGTAGCAAGCCAATAAAATTTGTTAATTTTACTGCAAGGAAGATTTTTATATTTAAAAAAATAGCTGTAGGGATGAATGCGTCAGTCAACTATTCTCTATGAAGAGTGCCGGTGATGTAGAGGATCTCTATAGGATATGCCTTGTATGTTTAGTAAAATTCCCATGATAAAGGAATTTGTGGAAAATTGGCATGAAGACTTCCTCGATTATAAATGTCTTGCTTGTTCAGAACCAGACAAGGGAAATTGTCTTTTTTCCTTTACAAGAATCTTTGTTTGGTAGTTTGAAAGACAATGATAGGAGTTGATTAATTACTTAATGTAGGGGATCATGACCGTTCAAGAAAGATTGCTTTTCCCTTTAGAAAAATTTGTGTCCGTAGTTTTTCTTCTAAAGGTTTTGATTTAAAGTAATACTATCCCTGCTATTTATCCCTGCTATTCAACTTACTTTTTCTTCTTTCACATTACTTTCACTGAAGAGTAACCATTATTTTTCTACGTGAAGGTCAGGTTCAACTAACCTTTTTTCTTTTGGGTTTTTTTAGCATCGCTATTGGTCAGCAAGGGGGCGTTGCACCATTAATTGCTTTGGCACGTTCAGATGCTGAAGTAAGGATTTCAAATTATTCTTGAGTTTCTCGATAAGGAGCACTAAAATTTGATTGCGAAATATGTGTGAATGTTCAACTGAACTACATATTTACCATTTATACACTCAACTGTTTAACATTATTTATTTTCAACAGGATGTTCACGAGACTGCTGCTGGAGCTCTTTGGAATCTCGCATTCAACCCTGGTAATGCCCTTCGTATAGTTGAAGAAGGGGGTGTTCCAGCCCTAGTTCATCTTTGTTATGCATCAGTATCAAAAATGGCACGCTTCATGGCTGCTTTGGCATTGGCTTACATGTTTGATGGGAGGTACATGAAATGACTTTACTAGACCTACTGTCATGCTGCCTTCATTTTTCTTCATAAAAGAATTAACTTTCATTGATTCTATAACGTTTTAGAGTCTTCAACCTTTTGCAGGATGGATGAATGTGCCTTGCCAGGAAGCTCATCAGAAGGCATTTCCAAGAGTGTGAGCTTAGATGGGGCTAGAAGGATGGCATTAAAGAACATTGAAGCATTTGTCCAGACATTTTCAGATCCACAAGCATTTGCCTCTGCTGCTGCTTCCTCGGCACCTGCAGCATTGGTGCAAGTAACAGAACGAGCTCGTATTCAAGAAGCGGGCCATCTGCGATGCAGGTTTACTCGAACCTCTTGTATTCTCATGCCCATTTAATTTAACATGCTCAGGTTTTTATGCCTATGAATGTTGGGTAATCAATTATCTTCAGCAGAACCTTGTTGATAACAAAATATGCATTGCCTGTATTTCCCTTCAGCATATGATGGATGTCTATTCTAGTCATGCTTATATTTCTTTAGTGAATCGATTAAAATTTTCGTCTAAATTCTTTTACAAAATATTCAATGTACTTTGTGTTGTAAAAGGTTATCCTTTTGGCTGTTTGGTTTTTGTATGCAAAAATGCTCTAGTCAGGGAGGAATGTTGGTTGTGGGGCAAGGAAATTGTCTTAAAATTTTATTAATCTTACTCTTGTTTTCTTATGTCTTACTTTTTCATAACTGTTAGTGGAGCTGAAATTGGAAGATTTGTTGCAATGCTTCGAAATCCATCACCTACGCTAAAAGCATGTGCAGCTTTCGCTCTTCTACAGGCAAGTGCACAGAAGTTCATAGTGTTTTTCCTGATTTTATAAGCTACCCTTAGTTTTGGCACAAATTTTCTTCAATACATGTATTCTTCATATGCAGTTTACTATCCCGGGGGGTCGGCACGCCTTACACCATGCAAGCCTTATGCAGAATGCAGGAGCATCAAGAGCCCTGCGTACTGCAGCTGCAGCAGCAACTGCGCCATTACAAGCGAAAATCTTCGCTAGAATTGTTCTTAGAAATCTAGAGCACCACAGCATTGAATCTTCCCTTTAAAGACAAATGCAACATAAATTTGCAACAGAAGGTGAGTTCTTGTTCAACTCAACTCATGGAGCTTAAACGAGCTGCATGGCATGCCCGAACCAGGTGCTCATGTAAATGCCCCATTAATCATATACCGAGTTTCTGATGCCAAGTAGTACACAAACTATATGGTTTGACTTCTCAGCTTCATCGTTACTTCCTGGTCTATTACCTTTATCAGAAGCGTCAAAAGGTCGTTTTTCGAAGCATTTTCTAATTTTTCTTCGTGTACTTGCAGTCTGCATTCTATCAGAGTTCTTATATACTGATGGTACTTTATGCAGGGAGTCGATAAGAATCACATTACCTTGAAACCTTTTTCTTCTTTTCCTTCTTTTGGGTTAGCCATTTGTTTTGGTTGTTAATATTGATCCTGGTGGTATTTTCCCTGTAATGATAGAAGTTCTGAATATGCTAAAGATTGTACAATGTTTAGCTCTCATTTTTGCAGTTGGACTGGTTGAAGATGAATTAAGGATGTTTAGCTATCGAACTTCATTTAATTTCATGTTTATCCAGTTGAGGGTGAAGTTGTTTAATAAATGGTGATTTGAATGTACTGAGATACAAAATCTAGGTTCAAATCAAATATGTACACATTGTTAAAGGTATTAACTAAACCATTACTTGCAAAAACATATTTTTAAGATTTTGTTTATGATCTTTCATTACATGAATATAACTTTTGTCCATTAACTATTATTCACTTCATGGATGAACTCTGCAGGGAAGCACCTTATTGGATTCTCACTTAGTTTTGTGTAAGTATTGGATTTTCACTCTTGCTTGCGCCTTTCACCACCACTACTATTCTTTTGTTGTTATTGAGTTCAATGATTGTAGTATGGAAAATTAAATCATGGATCTTTTATGATCATAAATGATAATGATGTTTTATTTATTAAATTATGCTCGAATTGAGACAACTTGTATTATTAAAACATATATTTTTATTTTAAAAAAAAAGTTTTTACGAATTTGAGTACCACTCAAAATATTATAAGAGATCTAAATTTTTGTTACACATTGTTGAATTAAAGAAAACAAAATTGGAATTAGAATTGTGGAATGTGCATACCTTCAACTTGACACATCAAAGAAAACTAAGAGATAAATTAATTTATATTTTAGCTTAATTTAAATGGTTAAATTGTACCTAAAAAAGGGTTAAAATTAAGATATTGATTATCTCCACTACTTATAGGTTTAAATTTAATTTGTTGATTTAAAGCTACGACTTCCACCTTTTAGACCTTTAAATTTAATTTGTTGATTTAAAGCTACGACTTCCACCTTCCTCTAAAAGTTGAAAGTTTGATCTTTTATAAACTGTTGTTTTTTAAAAAAAAAATCTTGAATTGAAAAAATAAGTTAAGAAAGATCATAGTTTGCTTATTTAGTTGTTGTTTTTTTAAACAAAATCTTGAATTGAAAAAATAAGTTAAGAAAGATCATAGTTTGCTTATTTAGTAGTTGGGTTGAGGAGAAAGAGTTGTTTGCCTTCATGAGTTGAGTAAACAAATCAAAAGCTTGATCACACTAATATAACAAGACGCATGTGTCGAGATCATCTCGGTTTTTGATGTAGCTGTGCAAATAATTGGATTTGAATTTAGTGGTGGTGTGACAAATTAAGGTAATATTATCAGAATCGGAATGAATTTTGTTGTGGATCCGGGACAGGTTTTACATGGAAATCGTAACAAAACAATTAACCAAGGATGGAAAATAAAAATAATAATAATAAAAAGTGGAAAATCATGTGAAAAAACCGGACGGACGGTGGACCCCACGATTAAAAACATGATGGGGTCCATTTTATGCATCATAAAACTATAAATGAAACCAAAATTGGCGGTGTTTCTGTTTTGTTCTGGCGTAAAACGTTCCCACGTTCTGACTCTACAGTTTTCTCTCAATTCCGTTCACTTTTCTCCTCTTTCGCTTCCTGTTCCGTTCACTTTACCGGATCTGACTTCCCTCAGTCATCGCCCCACTATCCCGACTCTGCCTTGCCCTTGCCCTTCGATTTGCTCGCTGATCGACGACAAGAATCCTGCGTAATCAATCGACCCCCAAACCCAGGTTTGGTTCTTTTCAATTCTGATGAAAACCCAGACGCGGTTTCTCTTCACTCGTCCAAAATTATGAACTGGGAAGAAGAGGGGTTTCTATTTGGCTGTTGAAAGTATTCGATTGTCGTAACAAGTTAAGTCTGTATAGGACAGTACCGCTTTCCCGGAATTTTGGTAATTTGGATGCTCGGAAAATTCGTATTAGTTTTAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCTTGTTTTTTGCGTTCTTCTTTTTCTGTTTTTTTTTATTTTTCGAAAATTTTTTTTTTTAACCCTTAATTTTTTATTAAAAAAAAGTTTTTTTTTTTTTTTCTTGTTTTTTGCGTTCTTCTTTTTCTGTTTTTTTTTATTTTTCGAAAATTTTTTTTTTTAACCCTTAATTTTTTATTAAAAAAAAGTTTTTTTTTTTTTTTCTTGTTTTTTGCGTTCTTCTTTTTCTGTTTTTTTTTATTTTTCGAAAATTTTTTTTTTTAACCCTTAATTTTTTATTAAAAAAAAGTTTTTTTTTTTTTTTCTTGTTTTTTGCGTTCTTCTTTTTCTGTTTTTTTTTATTTTTCGAAAATTTTTTTTTTTAACCCTTAATTTTTTATTAAAAAAAAGTTTTTTTTTTTTTTTCTTGTTTTTTGCGTTCTTCTTTTTCTGTTTTTTTTTATTTTTCGAAAATTTTTTTTTTTAACCCTTAATTTTTTATTAAAAAAAAGTTTTTTTTTTTTTTTCTTGTTTTTTGCGTTCTTCTTTTTCTGTTTTTTTTTATTTTTCGAAAATTTTTTTTTTTAACCCTTAATTTTTTATTAAAAAAAAGTTTTTTTTTTTTTTTAATTTTGAATTTTCTAATCGGAAACGGAATATTCTATTTGTTGAAGACTCTCTGTGGCGAAAAATCCTTTTTGGAGCGTTCCCTTTGACATGAAATTGACTGTTTTTGTGATCGGAGTGTAATGGAAGTTGTCCCTTTTCACTTCCCACTTTCTTGTTGGTGAAATCGTTTTCTTTGCCCATTATTTTCTTCCCGGAGGTTTTTTGTGTTTTGTGCTACCGTTCACTAATTGGGCCCTTTTATTTTTGTCGATATAATCTGTGTTTTTCCCTTTGCTTTCCGAGAAGAACGATTGATAGTGAGGTCAAGACTACTTTCTTCATCTTCTTTTTTTTGAATGAAAGTGGGCAGGTAATCTTTTCTTAATTTCGGGCGCTCATGGATTTCAAACAAGCAATAACGAATTCTTACTATTTTTGGCTCTTATTACAAAAGAAAAGAAGGAATTCTTACTATTCTGGTCACCAAGTGAATTCTTAAGTGAATTCCTTCTTTCTCTATTTTCAACCTATGGATAACATCTTCTTTGTGGAACTGAACACAGATGTAGGCTATTAAGAATAAATTATTGGACCTCCAATTTAGGTTTTCCTTTATGCTATTCTCATTTGGAAAGCTTGAAATATTATTGTGTACTTAAGTTACTTGTGTTAAGCTCTGAATAAGTATTGGGTATGTGTATTGTATTCTGATGCTGTTCATGTGTTTAGATGTTATTGCCCGATTGCACTCAAGAGTGCCTAAATTCGGGCCTATACTAGCATGGGACTTGAAAATGAATCAAAAGTGAAGGAGGAAGCTTTGGTGGAGGTTTCAAGGTGTGTGGAGGACGGAAATGCTGCTCAAGCTAAGCAGAGTGCAAGTAGTTGTCAGGAAAACATTCATGACATGGAAGCTTCATCCTTTGAACGAAGTACGATGTTAGGTAGAAGTGAGGATATGGAGCTTGATATTATTGGGTGTACAGAGAATTGTGAGGGAGGTCCTAGTAATGAATGCAATGTTTCAACTGAAAATTCAAGCTCGTTTGGAGATACTGTTTCTGGGACAGATTATGGTTTGCTATTAGATGATGAAGAAGTCGAATCCCAATTATATGGTGATAATAATTTGCAGCCTATGTCTAATGGATACAGCGAATTATTTCCAAGGTAGTACATGTATCTTTTCTTTTTCTTTTTGCCGTTGCTGCTTTGTTTTTTTCTTCTTTTCCCCCACTTTTTCTTTATACTTAATTGCGTGCTATTGGTGGGTGAACAATTTGATTTTTTTTTTTCTTTTCCAAAACGAAATGCATAGAATTGTAAGGGGAAAGTGGAACTGCTTTAGTTACTGTTTGGACTTGGAGCCTTCATCAAAATAATGCCTCATAATCATAGCTTCACGAGGAATAATAGGATTTCTAGAGAGATTGAGAGATCTGGTGAGGAGGTTTGGGAGGGAGCCTAGTTTAATGCCTTCTTGTGTGCACTTGTTTATAGACTCTTCTGTAATTATGATCTTGGCTTGAAGCCTTTTTTGTAGTTAGCTTGGGGCTTCCTCTGCTTTTTTTGTGAGCTGTCTTTCTTATATGCCCATTTTCTTTCACTACTTTGTGAAATGTTGGTCTCGGTTACTTAAAAAAAAAAAAAAAGAAAAAAAAAGTAATAAATAAATGGAAAATTATGCAACCATCTTATAATGAACAAACATGATCTTTAGGGGGTGTGGAGTGTTATATGGAGGAGGTTTGATCCTTAGCTAGATTTAATAGTCACTTTGGGGTGTTGTGTTTATTTTTATTTTATTATTATTTTTTCGATATCTGTGGGTGCTTGGGTTAGTTCACGCGCACCTCGACTAATCTTATAAAACAATTTGCTTGACCCTACATGACCTATGTTTAAAGATTTTTGTAACTATCCATTAGGTTTTATTCTCTTAGACTGGATCTTTTTTCTTGTTCATTGTTCCCTGTTTGGCTTCTTTTGGCTGTTTGTTTTTATAGGCCGTTTTTGTTTTTTGGCCATTGTTGACCGTCTTGTATTCTTTCATATAGTTTGGTTTTCTTTAAAAAAAGAAAAAAAATATATATTTTATTATTATTAAAATAAAATGTAATGAACAAGGCAATAATGTCAAAATGAACCTATGATTAACACAAAGAAGATTGCAAATATCATATCTTGAGTTTCTTTGCTCTAAAACAATATATCCTATCTTATATACTGGTTGCGTTGTTATTGACTTGGTTGATGGTCAAATTTTAAACCATGTCAAGTCAAATCAGGGTGTGTTGAATTTTGTCTTTTTAAATTTTTAACTCCTTGAAATATGTATAGTTGGACTCAACGAGCCTTTTCTATCTTGGTTAAGTTTTGAACCATGGGTTAGGGCACACTTGCCCATTCCTAAAAATGTTCAAGACGTCATATGTTGTCACTTTTCATGGCCGACATTGTGAGAACAAGGCAACAATTTTATGACTAATACAATCAAGTTTATATATTCCTCTATTATGATATCTTCTACCTCAATCTATGATATTCACGTGTTTATACATGCATCTTACTTTCTAGGAAGAAAAAGTTGACAGCTCACTGGAGGAAGTTTATAAGTCCTCTTATGTGGCGGTGTAGATGGTTAGAACTGCAAATTAAGAAACTTCAGTCTCAATCGTTAAAATATGATAGAGAACTTGCATTATATGATCAAAGAAAGCAGTCTATCTACGAACACTTCTCAACCGAAGGTTTTGATGTAAAGTCAACAGGATTCTCAAGTCACACTCAAAAACACAGGGTTATGAAAAGAAAGGGAAGGAAGAAAGTTGAAGAGACTACTGATGTAGCTTCATATATGGCACATCATAATCTGTTCTCCTATTATGGTATCAAAGCCTCAACAATTTCTATCCAATACCTTTTTTTCCTAGTGTCTGTCTCTGTCTATGACATTGCAGTCTTCTTTTTCCATTCCAGAGAAGAAGAGGTCTCTTGCTGATGACATGTCTTTGGAAGATACTTTCCTTAAATTAGGTAAGTGAAGTGGATCGTAGTATTTACTTTGGTTGATTGCATTAAATTGTACATATGGTATACTAAATAGGAGTTTGCTGCAGATTAATAATAGCCGCAGAATTGAAAGTCATTTCATATCTGACTATTGGATGAATTAGTTCTGTACTTGATATTTTACGATTAGAAGCTATAATTGTTGGTTTTCTTAGGGTATGTGATACTGTTATCCCTTTGTTTTTTTTCCATACGTAGGTTTGACATTGTTGATCCTCATGAATATACCAACAGTCCTTGTGGAGATTCTTTCAAGAGAATATAACAAAAACTTGAACCTCCCAATTATTTCCCCTAGATCTCTTGACCCTTAAAAAGTTCCTGCATTTTCTCTCAGATGATAGTTTTAACAACAATGAGCCATAAAATCTTCTTCTTCTTCTCCTTTTTTTTTTTTTTTCTTTTTCCTTGGGATGGCCACAAAGAGTCTCAATTCAGAAATTCTTTGGAGTTTAGAGTGTTGGGAAGGACAAGGTTAGTTCCAAGGCAGAGTAGTATTAGGAATATTAGTATGGAAATTTTAAGAATATCAATAGGATACTTGGGACGGAGGCATCTTTGTCATCAATAAGAGTTGGTTAATGAAAGATTTTGGAGGTAGGGAGAATTTGGAGTGAAGTAATTTTAGGAGATGAACTTTCTACCTATTTCAATTGCAAAGGAGAAATACTCAAAACACCATATTGGTGGATACAAGCAAGATCAATGGGATAAAAAATAATATAATAAGCTATAAAAAGGAATGGGTGTTTATTGTACACCAAAAGAAAGCAGAAGACACATATCCATAAAACACTCCAAGGAAGAGGAAACATTTCTGAAAAATACGACCATTTCATTTACCCCAAAGATGCCAGAACAACGCTCAGATTTTGGCCAACCATAACATTTTCTTCATACCGCAAAAAGGCTGGCCCAGCAAAACCAACACCAAGGCCACCCATCAATTATGGAAGCCAATAAACATTATGGACACCACCATCACCACTAGAGAACCCTCCCAGAAAAATGATAAGTAATATGTTGGCAAACTGGCAAGAGAAGCTTGTGTTGAATTCGCTCAATACCAGGTTTCCAGCAGGAAACTGTCTTAGGACTACACCACTCAAAGGAAGACCAAGGTATAGATGGGCCAGTGACCCTATTTACAACCAAAATTAGATAACATCCAATCCATCTTCCAATCAGCAATATGAATCCCAAGGAGTTCACTTTTGGAAAGGTTGATCTTCAGTCAAAAGGTCTGTTAGTAAATATGAACAATCTCGAAAAGATGTTTTAAAGCAACCCAATCAGCCATGGAGAACCAAAGAGCGTCACCAACAAATTGTAGATGATTCAAATAAATAAAGGATGCACCAACAGGATGAGTCACAATATTACCCAAATCATCACTATGGGCTAGAAGATGACTGAGGCAATTAGCAACTATAATAAATAAAAAGGCGGGAGAGAATCACCTTGACGGATGCCCCCCTAGATCTAACATCAGTAATAATAGAGTAATTGACACGTGATACACAACCTCCTATCCAAGATTGCCAAAGAAAGCAAAACCTTGCCTGAAAACAACATCCAAAAATTCCAATCAACTGTGTCGAAAGTCTTGTCAAGATCCAGCTTGAGAATTACACCCTCTTGTGGGAGACATTGCAATAATCAATCAGCATGTTCACCATTGAAGAAGCATCCAAAAATTGTCTATTAGGAACAAAGGCTAACTGGTTATCAGCAATGGTAAAAGCAAGAACCTCCTTTAAACGATTAGAGAAATCCTGAGTAATGATCTTTTTAAGCACAAGGAATGAGGTTGATAATCAACAACAGGTTTAGGTCAAATCCACAGAATAGTCAAGGTTGTCCGTTAGTCCCTATTTTTGAGGATGATAAACTTTACGGGCATGGTGATGGTGCAAATAAAACATTCTTCCATACGATATCTCCTGACAGGTATTATCTCTGCCAACAGAATTTGGAGACTTAGACCATGTCGTTACTTCCAAATACTGGTGTTATGATGTGTTGAATGCGTATAGTCTCTCCTGAAGCTTACCCAAGAATGTTAATGATTTTGTTGAAAACTTGGGTGGGATTCCCTTGACATGAAAAACTCAAACCATCTCAAAATGTGAGCTGTGATGCCATTTCTTTGACTCTGGTTAGGAAGAGACCATAGGGCTATCAGAAGTTCCCTTTTTTTCCTTTTTGTGGCACCTCATTCTGGTCTGATCTATTCAAAAGTTCTAACCTTTGCAGGGTCTTTCCTAGGAGTAGTTCAGATGCATTTTTCTCACTCATCTCTGGGACTTTCTTTGGTGGAAATACTAGAATCCTTTGGTCCAAATGTGGTGCACTCACCATCATGACATTTTTGGAGATTTAGCTATACTTCACTCTTCTACTTGGTGTTTTCTCTATGAAGACTGAAGAGTTTATGGAGATTTATAATTATGGTTTGACGCATATTTCTGCCCATTGAGAGTCTTCTTTAAAATCCTCTTGGCTTTTTAGGGACATGTCATCTCCTTTTTGTAAATGGTCAATACTATGGTTTTGTTTCCTAGAACAGAAAAGAAAAGAAAATAGATGTAGTGTTTGTAACAACCTTAAGGATCCTATTCGTCTTTTTTGTGCTTCACAGCTTACACTTTTGAACTTTTCAGTAACCATGCCTTCTTTCTTTTTGCTAATTAGAGTAAGATTCATGAGTTTCTACAATGTATAAACAAACAATTCTGTTTCTTATACTTAAAAAGAATAGCCAGATATAGGACTTTAAGTTTGAGAAGACTATCTATGTTTGAAGAGCTCCTGGCTTACAGTAGTCAGTTGTAGGCCATTGATTCTCTGCCCTGATATCTACAGACAAAACAAGGAATATAAAACGTGAAGACATCAATGACTTCGGAACAATGGCAACTGATGGATGGGCATCCTCCATGTTGGGAGATAACGATAATAATTTGGAGGACGTTTTTCTAAAAATTGAAGCTGCACAGTTGAAAGTTCATGAGTTGAAGAACAGAATTGATAAGGTGGTGAATGAAAATCCCATGAAGTTCTCTGCAATCAATCAGCTATACTCTCTTGCATCAAGTGATGATCCCGCTTCACCTGAAGATGGAAATGATGAGTTAGTTAGGTCTTTGCATGAAGCATCGCAACACATATCTGAGAATGCCTTAGATGTACTTATGCCTGAAACCGCAATTAAAACTCATGGAGAGGTCATGCTACTTCCTGATATTACTCAGAGCGCAGATTGTGGAACTGTAAGTTCTCTACTGCTATATGCATTTTTTTTTTTTATTGAAATAAAAATCTTTTCACTGAATTTATGAAAAACAAAAACAAAAGTCAACTCCAATGGGAGTGAAAAGGACTAAAAACCATAAAGCACACACAAAACTATAATGGAGACATAAAGACCTTCCAATTCATATTTAATTTCTTGCGAAGAAAACTCGGAAAGAGAACCACCATTGAGAAGTTTTCCATCTAGCCGTCTCAAACCGATCAAACCATTTAGTTTCCTTGTTCTCAAAAACTCCTTGATTTTGTTCCAACCAAACTTATGTAATAATAGCTTGGCTACTGTGTTGCAACTGTTATTATATTAAAAATTGGCAGTTCTGAACATTGGAACTCACCTGTATGTTTGCAGACTCAAAAACTTCTGATGCAAGATTCCGCAGTCAAGGAAGAGTTGCAACTTTCCGAAGAGGTTAAAGGTCAGTTGATTGAGCCTCAAAAATTAGAGCAGAAAATCATTTCTCTAGCTGCAGTTTCTCAAGTTGACTTAACCTCAAAGGACGACATGCTGCACAAAACAAAATCCCCTTCTGCTGTCAAGCCTAATTCATCTAAGAAAACAAGAAAGCGAGGAAGGCGAAAAATCGGTTTGAGTAAGAAGAATAGGAAAGCAACAGGTTAGCGGAATAATTGGAACTAGAAGAAGGTGGTTATTTTGCTTCTCATTATGTGGTTGATGGAGGTTAGAATTTTGTCAATGGTTCTTCGTTCTACTTCTAATTGTATATTTTCCATGTCTTGCTCTCATCTGCACATATAAACTCAGATTATTTGTACATGCTGCAAATATTCATTAATTATGTTAATATGAAGATAGCATTAAATCACGATCGATAGGTGTAGTTGAGGAAATTTCAGGTACCACGGGAGTGTGATACTCCTATTTATTGGCTGTTGTGTTAGTATCTTTTTAATATGGCTCTAGTAGCATTAAAAAGTTATGTACTCCTGTAGGATTAGAAATGGGAGTACCAAACTTCATGCTCAACTAAGTTCTAAATCATAATAAATGAAGTCAAGATTTTTGGTAGTTGATTTTTTACAGTTCTAACAAGTTAACTTTTTTCTAAGTATGTTTTTAACATTCATTGTAGAGAATTGGCTATGAAACATATTTGGTTTCTAAGTGGCGTTGTGTAACTTATGTCTGAATTATAAAAATTAGGTTATATAAAACTTAGATTATATAGTTTAGGTTATAACATCTTTTTGTCTCTAAATCTTCGCCAAAGAAACAGTTTAATCTCTCTTTTTTTTAGTACAATAATTTAATTTCCATACATTTAAAGTTTGTACTTCAATCTTCATAACAATTCCTGTCATAAAAGTAGTAATAATGTCTAATGAGATTTTTTCTAATGATTATTGGATAAGGTATGGATTGAATTATTATAAATTTAAGTATAGAGGTTGTATATTGGACTAATTGTTACAAATTAAAAAATACATGAATTACAATATTACTCTACGTTGTAATCACTTGGTATTTACACAATTGAATTTACTGACATCTTCTACAACAAAATTACAAAAATCTCGATGATGAAACCCAATTGTAAAAATAGAACATTGTTGTCTATGACGTGTTTGGAATGATTTTTATAAATGCTAGTAAACAATTTTTTTTACTCAAGAATACATTCTTAAACACTTAAATAGTCATTTCAAATAGTCGTCGCAGCTGTTGTTCATTTTGACACCGAACTACATAGGATTATATACAGAACTCTCGATTTTCCAGCGTCCTGACAATAGGACGATCAGCAAACATCCAACTTTTCAATTATTTCAAAATGCATCTGCATGTCCACAGGCCTATCAGCAAAATAAAATTTCTCACAAACCTGCAAGTTTCCTCCACTTCTAGGAGCTTCGGATTTAAGAAAACTCATCATTATTTCCTTCTCATCTCGAATCCAAACTCTTCTGCTGTCACACCATTGTCTTGGAGTACCATGAAGAACAAAACGGAGTCCTCTTGACTCCTGAACAAGGCTGCTGGAGATTCTTTTGATTACTTCAGGGTCTGTAAGATATGCACCATGGGAGTTCAACCCAACATCTATGTAATGGATCTCTGTTATGCTTTTCAAAAAGCTTTGTTCTGTAGTTGATATAAACTGAACCTCACCAAGTTTAGAACATTCAACACCTAGGTCTTGCTTGAAATGAGGTAGATTCTCATCAGCAGCTATCAAGTCTTTGTTGCCAAGTTCAGTAACTAGCTGGTTAACCACAGTACCTCCCTTGCTAAATCCAAGAATGATTGTTTTGGGGGTGCGGCAGCCTAGTGTGGATATGGTGGTTTCCTGTGTTCCTGGTTTTCCGCTAGAAATTATCTTCTTTACCTGCCAATTCCACAACTGAGGGCATTAGTCCACATCAAAATCACAAGCAGACAATCAACCTAAACATAAGCAGTCAACCTAAACATACCTATTAGTTAGTTTAGATACACACATTCTCGATTCAAAAGATTCAAACCTCTCAACCACAGCTCAACAAGTTCAGCCATACATAGCTTAGTTTAGACATCAAACTCTTGAGGTTAGAGATTCGAAACTCTCAACCATATAGCTCAACAAGTTTAGGCATACAGTACCAATCAAGAGGTCAAAGAGTCAAATCTTTTTAACCAACTGATTTAATAATACACCATCGATCAAGAAGTTGGTATTCAGAATATTTTTCTTACCATAACTCAACTAACTGGTTTAATTAAACATACCATTAAACTAAGAGACTAAATGGTTAGAGATTCGCATATACTCAACCCCTTAAAATCAAAAGATCACAATCAGCGACCCGATTTGGACCATACCTCATTGGAGCATCTTCCCAAAAGTGAAATGGTTGACCGTGAAGCAGGAAACCCATTTGGAGTATATGATTTTGGTTCTCCCCACCGATTGATTAAAGGTATAAAATCATGATACATAGCAAAAGCCCCATTGAAATCAGAAGCCTCGACAACCCATGCATTAGTGGAATCTCCAAACTTCGAAACTAAAATTTCAGCTATGTTCTGCAGATTAGACAATCTCTCAATCACTGGGTTGCCTGTCCCTTCGACCCGATCCCCATTGAAAAAAATGGCGTTTGCACAAGGAACCTATTAAAACATCATAAAAGGAAGGCGTTTAGTTTAGGGCAATTTGTGATAGTGATGCAATGGCAGAATATTCGAGAACTTACAGTTAATTTTTTGGAGGTTGGGGAAAGGGAGAGTGATGCAGCAACTCGATAAAACTTTTCGCTGTTGGGATGCAATGGAACCTTCAAAATTCCACTCCAACGATCCATGCTTGAAGCCTTGAATTGAATCGCCTACCGAAAATCATTCGCACGTTCCATTTTTTGTTTTTTTTTTTCTTTTTTCTTTTGTTTTTTACACAGAATCAATCCATTTACATATTTTAAAAAAATTAAATTCCATAAAATTTAGTATTTGTGTCTAATAGGTCTTAATATATAATAAATTCGTTTGTATTTATGCCAAAACGAGATAACATGAAGGTTGGATCTTGTATAAAAAAAAGAAAAGAAAAAGTACCAACTTCTTTAGATTTTGTGTTTAATATGTATCAACAACATTAAACTTGCAATGAGTTTATATAAAATTGATCTTTTTTTTTTTGGTCTTATTTTACAAAGCAAGAATCAATCCATTTAAATTTTTTTTATTAAATTGCAAATTTAGTTGATAGACTTTAGTGTTTGTGGTTAACAAATCCAGAACTGTGTTTAATGGGTTCTAAACAGACTTTTGAAATTTGAATTTAGCATATTTATAGGTTCTTCAACATTTTAAATCAACAATTCTATTAGTCATTAAATTAAAATTCCACGATAACACATTAAATTTGTAAAGTTGAATAATTTTGTGAATTTACATTGATTCCATATATTTTATTTTTATTTTTTATTTGCCGTTTTTTTAGACATAAAGAATTAATCACGTTATGTTCGTGTTTTATAAATCTCAATATTTAATATATTCTTTAAGTCTGAATTTTGTATTTGATAGAATATTAAAAAAACTAATTTAGATATAAATTAAAATTTTATGTCTAGGGAATTTTTAATTTTGCGTAGTTAGTGGATTTTTAAATTTTAAAAAAAAATATATCTAAGTTAGAAGAACAAAATGTGTTATTTTAAAAAATTAGATCAAATACACACATTTTAAATTTTAGGACTTAAATTAAATGTGTAATTTAACTAAAATATATAAATTTCTGGAGGGTGTGGATGCCAATCATTAAGTGTTGAGGGCCAAATTGGTATTTCAACAAATTATAAGATATTAAGGACGGCACAATATGGGCAGGTTTTAAACCCAAAGCTTAAGGCGTAACTCGCTCCTCCGTTGGTTTGTTATCTGCGGCCGCCACTCCTACAATCTTCAAGATGAAGGTTCAGCTCGACTGTTTTCTCTTCTTCTCGACTGTTTTCTCTTCTTCTCGATTTTTCTAGTTTTACTTCTATACTAACTTTGAGTTTTTATGTTTCCAGTTTAATATCGCGAATCCGCCTACTGGATGCCAGAAGAAGCTCGAAATCGACGATGATCAGAAACTGTTAGCTTTCAAATTCCCTTATGCTTCAATCTATTGTCTAATTGACGAACGTTGTATTAAATGTGGATGAATTTTTCTGTCCATCTTCAGTCGAAAGTTCGTACTGTAAACATTACATAATTACATGCGGTTGGTCTAATTGAATGTTGAAAAGTATATCTTGATATGCATAAATCCTTGATTCCAATACTATCACTTTCTGGAACTAGGTGTTTTGAGATGCATAATTGATAAAATGGGCCCCTTTTGATCAGTAGAGTTGTGTTGAAATCGATTGTTCGATATCTTTGCAGTTTAAATTTTTTAGTTAACGTTCAGCCTTGCTATGGGGGATTTAGCTTAGGGCTGTTTGGTATGGATTCTATTTCATGTTTTCAAACCTACGCATGCCTTAACTGAAATTATTTTTTTGAAAAATGTTTTCCAATTTCGTAATCTGTAAACTCTGAAAACATTTTTCGATGTATCTGATTTTAATTTTGAAATTTAAAAATGTGGATTAATATTAAAAAATATTATAATGTAAATAAATATAGTATTTCATATTATAAACTCAATTCATTTTTGTTAAATTAAATCAATTTTATTGTAATTATACAATATTACTTGGTGAGACAACAAACACAATTTTTAAAAATAGAAAACCAAATATCAAATGGTTATTGAACGTGGCTTAGCTATCCCGTTTTTGTTTGTTTGTATGTTTTCCATCATTTTTTTACCCGGTTTAAATTGTCATCATGTACTAAACTTATCATTTGTCATTGTCTAGCCGTGTTTTTTTTACAAGAGGATCTCCCAGGAGGTCAGTGGAGATTCCCTCGGAGAGGTATGTGGGTGCTCTTTTGTCATCCAATCCAATGAGTGGGATCTTAGTTCCTAGTTGCTGTCTCTGAGTTTACTGATTGTGGATTTTCGTTTGCTCAGGAATTCAAGGGCTATGTTTTCAAGATTATGGGAGGTTGTGACAAGCAGGGTTTCCCAATGAAACAGGGAGTTTTGACCCCTGGCCGTGTTCGTCTCTTGCTACACAGAGGTTTGTGGAACTGTTTATACTAGTGAATAGTTAACCCATTTGATCATATTTGTGGCCTTTATTTGGTATTCTATCATTTATCCAAATCAAGATTAGTTTTCTTACCAAAAACATATTTGTGGCCTTTGTTATTTGTTTTCATTAGTTTATAGGACTATCCAAGACTACAATGCAACCATCTCGTATATCACTTCCTTATTTCCCTAAGTTGGTGTAAATTGATCTTCCATTTTTGTAACTATAGTTCATCTAATCTTATGAGTTATGACCAACTGGAGAAGCTTCTTTTTTAATCCATTGAACATGATCTTTTATAATCTCATCTCACCAATGAAATTTTTGTTTCCTATAAAAAGAATAGTTTATAGCACTATTGTAAGTAATGCAAGCATGTGTTTGATTTCTGTGGGCAGGTACCCCTTGTTTCCGTGGTTATGGTAGGCGTAATGGAGAACGCAGAAGGAAGTCTGTACGTGGATGCATCGTGAGCCCTGACCTTTCTGTTCTGAATCTGGTTATTGTGAAGAAGGGAGATAATGATCTTCCTGGCTTGACAGACACCGAGAAACCCAGAATGAGGGGTCCCAAGAGGGCTTCCAAAATTCGCAAACTTTTCAATTTGTCAAAGGATGATGATGTTAGAAAGTATGTTAACACATACCGTAGATCATTCACCACAAAATCTGGTATGGTTTATGAATGCTTTTTATGGTTATTGAACTTGTCAAACAAATTGAAGGCTTGTTTGGATGGACATTATGAGTGCTCACACTTTTTTTTTTATGCTCGAAGACATTTTTTTTCACTTGAAAACATTTTTGTAAGCATTTAGAAAATCAATCCAAAAACACTTTTTTTTTTCACTTAAAAATACTTTTTTCAACTGTGACTCATAGATGGTGCAATTACCGGTTGAGGTCTAATACTTTTCTCAAGTTGCATACTCGGATGGATGAATTCTTATAATAACCTATACTTGGTAATTTGTATGGATAGTTCTAATGTTTCTAGATGAACTGAACCTTCATTTGACTTTGGTTTGCATTGTTGTTCAATATTGCTCCTGTTTGCACATCTAGATGGATTGTGCTTTGTAACTGGACTAATCTCCTGGTCTCATTTTGGCTATTAATAAAATTTTATTTTATTCGAATTGTTTTTTCAGTTCCATATATTTTGGAATCAAGTTTCTGATGTTTATTTGATTGGCAGGCAAAAAGGTTAGCAAAGCACCGAAGATTCAGAGGTTGGTTACCCCACTGACCCTGCAGAGGAAGCGAGGAAGAATTGCTGAGAAGAAGAAGAGAATTGCGAAGGCCAAGTCTGAGGCTGCTGAGTATCAGAAGCTCCTGGCCTCTAGGTTGAAGGAGCAAAGAGAACGTCGCAGTGAGAGTTTGGCCAAGAAGAGATCCAGGCTCTCTGCTGCTTCAAAGCCGTCTATTGCTGCTGCTTAGTTCATTTGCCTTGTTCGACTCAGTGTTCTATTATCTTTATATGTGCATCTTGTTTTCCTTTCAAGAGAATGTTTTTTTTGCGGCTGCTTATATGAGATCTGGAACGTTTTGGTACTTTTGGTGTCTGTTAGCCCATATTTTCTTTTTTCACTTTGTTTATGGATTCATGCAAACCGTTAATTGTTTCTCACCAGAAATACAGATACAGATTAGAATCTCCAGCATTTAGCAAGCTGAGAGAATTTCCTCCATAGTCCATTCGCTTGTATTCACTTTATGAAAGTCATTTATTGACATGCTTCATATTTTATGAAAGTCTAGAG

mRNA sequence

AAAATCTGGAAACCCATGTGGTAGAAATACCACAGAATTTGACAGCATCAAACACATGATGATATTTACATATATATAGAAGAGAACGGTAAGTGCTCTTCCTTCTTGTGGAGTTGATATTCTTCTTCTTCATTCACACCATTTCTCTCTGACCTTCTTTCTCATCTCAATGCTGTAATTCTTCTCCAACCCTTTTCCCCTTCTTCTTCTTCTTCTTCCCAAAACCACATTATTTCTCTCTCTTTCATTCTGTCTCTCTATGTATATGCCTGTGGTTAATTGGTGCACAACAGTGGAAAGTAAAATAGGGGAGAGAGGTGGACACATGCTTTGAGGGTTTGAGCTTCCTCTTTGTCAAAACATTGCCTGAATTCTCTCCAACCCTTTCTTAGCTCTGTCCTTCCTCTCTCTCCCTTCCTCTCTTTGCCATCTACGCTCCCTTTTCATCTATTCCGTGCTCAATTGGGACGTTTTTCAATCAATCATATTCAATGTGTCGTGATTTGTTCTAAAAGGTTGAAGATTTCAGGGTTTACCGGCGGACCCAGATAGCAAAAACTATCAGGTTTAAGGAGGCGTCGTTTTTCTTTCCTATAATCAATCCATAAGGCAGATTAATTTTGTTCTTTGTTCTTTTTTCCTCATCCCATTGTCCACAAACTTGGGTTTTTTTGGTTCACATTAAGAGTTTTCAATCAATTGTTACGCAATCATAACTTCTTTCTTTGGAGGACTAGGGTTTTGAAGTAATAGCATAGCTTCGCGTGGGTTTGTATACCCCCATCCAGCGTCTTAAGGAATTCGGAATTTGGGTTGGAAAAGAATGGTTTTGTGTATAACTGTGTTTCTTTTTTGTTGTTCGAAATTTCCCTGAGGTTTGGGGTGGCAGTATCTGGGTTTCTGGGATTGGCTGGAACATGAATCGGAGGGTGAGGAGGAAGGTGACAAGAAAAGGGAAGGAGAAGCTGATTTTGCCAAGCTACCCTGAAATTGAAATTGAGATTGCTGATTTGGACAATAAACAGACTGTAGATTGGACTAGTTTGCCTGATGATACAGTCATTCAGCTTTTCTCTTGTTTGAATTATCGTGACCGGGCAAACTTGTCATCGACTTGTAGAACATGGAGACTTCTTGGTTCATCTTCATGCTTGTGGACTTCATTTGATCTTCGAGCACACAGAATTGATGCTGCAATGGCTGCTTCTCTTGCTTCTAGGTGCAAGAATCTTCAGAAGCTCAGGTTTCGTGGGGCAGAGTCTGCTGATGCAATAATTCTACTTCTTGCAAAGAATTTGCGTGAAATAAGTGGTGATTACTGTAGAAAAATTACTGATGCTACACTCTCTGCCATTGCAGCTCGACACCAGGCACTTGAAAGCCTCCAGCTTGGGCCAGATTTCTGTGAAAGGATCAGTAGCGATGCTATAAAAGCAATAGCTATTTGTTGTCATAAGTTGAAAAAACTTAGGCTTTCTGGAATTAGGGATGTCAATGCAGAGGCTCTCAATGCTCTATCAAAGCATTGCCCTAATTTGCTGGAAATAGGGTTCATTGATTGTCAGAATATAGATGAGATGGCCCTTGGAAATGTATCATCGGTTCGTTTTCTCTCAGTTGCAGGGACCTCAAATATGAAGTGGGGTGCTGTTTCACATCAGTGGCACAAGCTGCCTAACTTGGTTGGTTTAGATGTGTCACGAACTGATATTGGTCCTGTTGCTGTATCAAGATTAATTTCATCTTCTCAGAGCTTAAAAGTCTTGTGTGCCTTCAATTGTGCAGTTCTAGAAGAAGATGCTGGCTTCACTGTCAGCAAATATAAAGGCAAGCTGTTGCTTGCCCTTTTCACTGATGTTGTGAAGGAAATAGCTTCTTTATTTGTCGATATCATAACGAAAGGGGAAAACATGTTGTTAGATTGGAGGAATTTGAAGAATAAAAACAAGATTTTGGACGAGATAATGATGTGGCTTGAGTGGATATTATCTCATAATCTTCTGCGCATTGCTGAGAGCAATCAACATGGTCTGGACAATTTTTGGCTCAATCAAGGTGCAGCTTTGTTACTTAGTTTGATGCAGAGCTCACAAGAGGATGTTCAAGAAAGGGCAGCGACAGGTCTTGCAACTTTTGTTGTCATTGATGATGAAAATGCTAGTATTGACTCTGGAAGGGCAGAAGAAGTTATGCGGCGTGGTGGTATTCGTCTCCTTCTAAACTTGGCAAAGTCTTGGAGAGAAGGGCTTCAGTCTGAGGCAGCAAAGGCCATAGCAAACTTGTCTGTGAATGCTAATGTTGCAAAGGCAGTAGCCGAAGAAGGTGGAATTGATATTCTTGCAGGCCTTGCAAGATCCATGAACAGGCTAGTTGCAGAAGAGGCTGCTGGAGGATTGTGGAATCTTTCTGTTGGCGAGGAACACAAAGGTGCGATTGCTGAGGCTGGTGGAGTAAGAGCTTTAGTTGATTTGATATTTAAATGGTCTTCTGGTGGTGATGGAGTTCTTGAACGTGCAGCTGGTGCACTAGCAAATTTGGCAGCTGATGATAGGTGTAGTACTGAAGTTGCTTTAGCAGGTGGCGTGCATGCACTGGTGATGCTTGCTCGCAACTGCAAGTTTGAAGGAGTGCAAGAACAGGCCGCTCGAGCATTGGCTAACTTAGCTGCCCATGGGGATAGCAACACAAACAACTCTGCTGTTGGACAAGAGGCAGGTGCACTTGAAGCACTTGTTCAACTTACACATTCTCCTCATGAAGGCGTCAGGCAAGAGGCTGCTGGTGCCCTATGGAATTTATCATTTGATGACAGAAATAGAGAAGCAATTGCAGCTGCAGGTGGTGTTGAGGCATTGGTTGCTCTAGCACAATCTTGTTCAAATGCATCCCCGGGTCTTCAGGAAAGGGCTGCTGGTGCTCTGTGGGGATTGTCAGTTTCCGAAGCCAACAGCATCGCTATTGGTCAGCAAGGGGGCGTTGCACCATTAATTGCTTTGGCACGTTCAGATGCTGAAGATGTTCACGAGACTGCTGCTGGAGCTCTTTGGAATCTCGCATTCAACCCTGGTAATGCCCTTCGTATAGTTGAAGAAGGGGGTGTTCCAGCCCTAGTTCATCTTTGTTATGCATCAGTATCAAAAATGGCACGCTTCATGGCTGCTTTGGCATTGGCTTACATGTTTGATGGGAGGATGGATGAATGTGCCTTGCCAGGAAGCTCATCAGAAGGCATTTCCAAGAGTGTGAGCTTAGATGGGGCTAGAAGGATGGCATTAAAGAACATTGAAGCATTTGTCCAGACATTTTCAGATCCACAAGCATTTGCCTCTGCTGCTGCTTCCTCGGCACCTGCAGCATTGGTGCAAGTAACAGAACGAGCTCGTATTCAAGAAGCGGGCCATCTGCGATGCAGTGGAGCTGAAATTGGAAGATTTGTTGCAATGCTTCGAAATCCATCACCTACGCTAAAAGCATGTGCAGCTTTCGCTCTTCTACAGGCAAGTGCACAGAAGTTCATAGTGTTTTTCCTGATTTTATAAGCTACCCTTAGTTTTGGCACAAATTTTCTTCAATACATGTATTCTTCATATGCAGTTTACTATCCCGGGGGGTCGGCACGCCTTACACCATGCAAGCCTTATGCAGAATGCAGGAGCATCAAGAGCCCTGCGTACTGCAGCTGCAGCAGCAACTGCGCCATTACAAGCGAAAATCTTCGCTAGAATTGTTCTTAGAAATCTAGAGCACCACAGCATTGAATCTTCCCTTTAAAGACAAATGCAACATAAATTTGCAACAGAAGGTGAGTTCTTGTTCAACTCAACTCATGGAGCTTAAACGAGCTGCATGGCATGCCCGAACCAGGTGCTCATGTAAATGCCCCATTAATCATATACCGAGTTTCTGATGCCAAGTAGTACACAAACTATATGGTTTGACTTCTCAGCTTCATCGTTACTTCCTGGTCTATTACCTTTATCAGAAGCGTCAAAAGGTCGTTTTTCGAAGCATTTTCTAATTTTTCTTCGTGTACTTGCAGTCTGCATTCTATCAGAGTTCTTATATACTGATGGTACTTTATGCAGGGAGTCGATAAGAATCACATTACCTTGAAACCTTTTTCTTCTTTTCCTTCTTTTGGGTTAGCCATTTGTTTTGGTTGTTAATATTGATCCTGGTGGTATTTTCCCTGTAATGATAGAAGTTCTGAATATGCTAAAGATTGTACAATGTTTAGCTCTCATTTTTGCAGTTGGACTGGTTGAAGATGAATTAAGGATGTTTAGCTATCGAACTTCATTTAATTTCATGTTTATCCAGTTGAGGGTGAAGTTGTTTAATAAATGGTGATTTGAATGTACTGAGATACAAAATCTAGGTTCAAATCAAATATGTACACATTGTTAAAGGGAAGCACCTTATTGGATTCTCACTTAGTTTTGTATGTTATTGCCCGATTGCACTCAAGAGTGCCTAAATTCGGGCCTATACTAGCATGGGACTTGAAAATGAATCAAAAGTGAAGGAGGAAGCTTTGGTGGAGGTTTCAAGGTGTGTGGAGGACGGAAATGCTGCTCAAGCTAAGCAGAGTGCAAGTAGTTGTCAGGAAAACATTCATGACATGGAAGCTTCATCCTTTGAACGAAGTACGATGTTAGGTAGAAGTGAGGATATGGAGCTTGATATTATTGGGTGTACAGAGAATTGTGAGGGAGGTCCTAGTAATGAATGCAATGTTTCAACTGAAAATTCAAGCTCGTTTGGAGATACTGTTTCTGGGACAGATTATGGTTTGCTATTAGATGATGAAGAAGTCGAATCCCAATTATATGGTGATAATAATTTGCAGCCTATGTCTAATGGATACAGCGAATTATTTCCAAGGAAGAAAAAGTTGACAGCTCACTGGAGGAAGTTTATAAGTCCTCTTATGTGGCGGTGTAGATGGTTAGAACTGCAAATTAAGAAACTTCAGTCTCAATCGTTAAAATATGATAGAGAACTTGCATTATATGATCAAAGAAAGCAGTCTATCTACGAACACTTCTCAACCGAAGGTTTTGATGTAAAGTCAACAGGATTCTCAAGTCACACTCAAAAACACAGGGTTATGAAAAGAAAGGGAAGGAAGAAAGTTGAAGAGACTACTGATGTAGCTTCATATATGGCACATCATAATCTGTTCTCCTATTATGAGAAGAAGAGGTCTCTTGCTGATGACATGTCTTTGGAAGATACTTTCCTTAAATTAGACAAAACAAGGAATATAAAACGTGAAGACATCAATGACTTCGGAACAATGGCAACTGATGGATGGGCATCCTCCATGTTGGGAGATAACGATAATAATTTGGAGGACGTTTTTCTAAAAATTGAAGCTGCACAGTTGAAAGTTCATGAGTTGAAGAACAGAATTGATAAGGTGGTGAATGAAAATCCCATGAAGTTCTCTGCAATCAATCAGCTATACTCTCTTGCATCAAGTGATGATCCCGCTTCACCTGAAGATGGAAATGATGAGTTAGTTAGGTCTTTGCATGAAGCATCGCAACACATATCTGAGAATGCCTTAGATGTACTTATGCCTGAAACCGCAATTAAAACTCATGGAGAGGTCATGCTACTTCCTGATATTACTCAGAGCGCAGATTGTGGAACTACTCAAAAACTTCTGATGCAAGATTCCGCAGTCAAGGAAGAGTTGCAACTTTCCGAAGAGGTTAAAGGTCAGTTGATTGAGCCTCAAAAATTAGAGCAGAAAATCATTTCTCTAGCTGCAGTTTCTCAAGTTGACTTAACCTCAAAGGACGACATGCTGCACAAAACAAAATCCCCTTCTGCTGTCAAGCCTAATTCATCTAAGAAAACAAGAAAGCGAGGAAGGCGAAAAATCGGTTTGAGTAAGAAGAATAGGAAAGCAACAGGTTAGCGGAATAATTGGAACTAGAAGAAGGTGGTTATTTTGCTTCTCATTATGTGGTTGATGGAGGTTTTAAACCCAAAGCTTAAGGCGTAACTCGCTCCTCCGTTGGTTTGTTATCTGCGGCCGCCACTCCTACAATCTTCAAGATGAAGTTTAATATCGCGAATCCGCCTACTGGATGCCAGAAGAAGCTCGAAATCGACGATGATCAGAAACTCCGTGTTTTTTTTACAAGAGGATCTCCCAGGAGGTCAGTGGAGATTCCCTCGGAGAGGAATTCAAGGGCTATGTTTTCAAGATTATGGGAGGTTGTGACAAGCAGGGTTTCCCAATGAAACAGGGAGTTTTGACCCCTGGCCGTGTTCGTCTCTTGCTACACAGAGGTACCCCTTGTTTCCGTGGTTATGGTAGGCGTAATGGAGAACGCAGAAGGAAGTCTGTACGTGGATGCATCGTGAGCCCTGACCTTTCTGTTCTGAATCTGGTTATTGTGAAGAAGGGAGATAATGATCTTCCTGGCTTGACAGACACCGAGAAACCCAGAATGAGGGGTCCCAAGAGGGCTTCCAAAATTCGCAAACTTTTCAATTTGTCAAAGGATGATGATGTTAGAAAGTATGTTAACACATACCGTAGATCATTCACCACAAAATCTGGCAAAAAGGTTAGCAAAGCACCGAAGATTCAGAGGTTGGTTACCCCACTGACCCTGCAGAGGAAGCGAGGAAGAATTGCTGAGAAGAAGAAGAGAATTGCGAAGGCCAAGTCTGAGGCTGCTGAGTATCAGAAGCTCCTGGCCTCTAGGTTGAAGGAGCAAAGAGAACGTCGCAGTGAGAGTTTGGCCAAGAAGAGATCCAGGCTCTCTGCTGCTTCAAAGCCGTCTATTGCTGCTGCTTAGTTCATTTGCCTTGTTCGACTCAGTGTTCTATTATCTTTATATGTGCATCTTGTTTTCCTTTCAAGAGAATGTTTTTTTTGCGGCTGCTTATATGAGATCTGGAACGTTTTGGTACTTTTGGTGTCTGTTAGCCCATATTTTCTTTTTTCACTTTGTTTATGGATTCATGCAAACCGTTAATTGTTTCTCACCAGAAATACAGATACAGATTAGAATCTCCAGCATTTAGCAAGCTGAGAGAATTTCCTCCATAGTCCATTCGCTTGTATTCACTTTATGAAAGTCATTTATTGACATGCTTCATATTTTATGAAAGTCTAGAG

Coding sequence (CDS)

ATGGGACTTGAAAATGAATCAAAAGTGAAGGAGGAAGCTTTGGTGGAGGTTTCAAGGTGTGTGGAGGACGGAAATGCTGCTCAAGCTAAGCAGAGTGCAAGTAGTTGTCAGGAAAACATTCATGACATGGAAGCTTCATCCTTTGAACGAAGTACGATGTTAGGTAGAAGTGAGGATATGGAGCTTGATATTATTGGGTGTACAGAGAATTGTGAGGGAGGTCCTAGTAATGAATGCAATGTTTCAACTGAAAATTCAAGCTCGTTTGGAGATACTGTTTCTGGGACAGATTATGGTTTGCTATTAGATGATGAAGAAGTCGAATCCCAATTATATGGTGATAATAATTTGCAGCCTATGTCTAATGGATACAGCGAATTATTTCCAAGGAAGAAAAAGTTGACAGCTCACTGGAGGAAGTTTATAAGTCCTCTTATGTGGCGGTGTAGATGGTTAGAACTGCAAATTAAGAAACTTCAGTCTCAATCGTTAAAATATGATAGAGAACTTGCATTATATGATCAAAGAAAGCAGTCTATCTACGAACACTTCTCAACCGAAGGTTTTGATGTAAAGTCAACAGGATTCTCAAGTCACACTCAAAAACACAGGGTTATGAAAAGAAAGGGAAGGAAGAAAGTTGAAGAGACTACTGATGTAGCTTCATATATGGCACATCATAATCTGTTCTCCTATTATGAGAAGAAGAGGTCTCTTGCTGATGACATGTCTTTGGAAGATACTTTCCTTAAATTAGACAAAACAAGGAATATAAAACGTGAAGACATCAATGACTTCGGAACAATGGCAACTGATGGATGGGCATCCTCCATGTTGGGAGATAACGATAATAATTTGGAGGACGTTTTTCTAAAAATTGAAGCTGCACAGTTGAAAGTTCATGAGTTGAAGAACAGAATTGATAAGGTGGTGAATGAAAATCCCATGAAGTTCTCTGCAATCAATCAGCTATACTCTCTTGCATCAAGTGATGATCCCGCTTCACCTGAAGATGGAAATGATGAGTTAGTTAGGTCTTTGCATGAAGCATCGCAACACATATCTGAGAATGCCTTAGATGTACTTATGCCTGAAACCGCAATTAAAACTCATGGAGAGGTCATGCTACTTCCTGATATTACTCAGAGCGCAGATTGTGGAACTACTCAAAAACTTCTGATGCAAGATTCCGCAGTCAAGGAAGAGTTGCAACTTTCCGAAGAGGTTAAAGGTCAGTTGATTGAGCCTCAAAAATTAGAGCAGAAAATCATTTCTCTAGCTGCAGTTTCTCAAGTTGACTTAACCTCAAAGGACGACATGCTGCACAAAACAAAATCCCCTTCTGCTGTCAAGCCTAATTCATCTAAGAAAACAAGAAAGCGAGGAAGGCGAAAAATCGGTTTGAGTAAGAAGAATAGGAAAGCAACAGGTTAG

Protein sequence

MGLENESKVKEEALVEVSRCVEDGNAAQAKQSASSCQENIHDMEASSFERSTMLGRSEDMELDIIGCTENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMSNGYSELFPRKKKLTAHWRKFISPLMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSIYEHFSTEGFDVKSTGFSSHTQKHRVMKRKGRKKVEETTDVASYMAHHNLFSYYEKKRSLADDMSLEDTFLKLDKTRNIKREDINDFGTMATDGWASSMLGDNDNNLEDVFLKIEAAQLKVHELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGNDELVRSLHEASQHISENALDVLMPETAIKTHGEVMLLPDITQSADCGTTQKLLMQDSAVKEELQLSEEVKGQLIEPQKLEQKIISLAAVSQVDLTSKDDMLHKTKSPSAVKPNSSKKTRKRGRRKIGLSKKNRKATG
Homology
BLAST of CcUC05G081410 vs. NCBI nr
Match: XP_038886802.1 (uncharacterized protein LOC120076910 [Benincasa hispida])

HSP 1 Score: 832.0 bits (2148), Expect = 2.6e-237
Identity = 438/481 (91.06%), Postives = 456/481 (94.80%), Query Frame = 0

Query: 1   MGLENESKVKEEALVEVSRCVEDGNAAQAKQSASSCQENIHDMEASSFERSTMLGRSEDM 60
           MGLE+ SKVKEE L+EVSRCVEDGNAAQ KQSA SCQ+NIHD+EA SFE S ML RSEDM
Sbjct: 1   MGLEDGSKVKEEVLMEVSRCVEDGNAAQDKQSAGSCQDNIHDIEAPSFEPSMMLDRSEDM 60

Query: 61  ELDIIGCTENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPM 120
           ELDIIGCTENCEGGPSNECNVSTE SSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQ M
Sbjct: 61  ELDIIGCTENCEGGPSNECNVSTEYSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQHM 120

Query: 121 SNGYSELFPRKKKLTAHWRKFISPLMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSI 180
           SNGYSE+FPRKKKLTAHWRKFI PLMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQS+
Sbjct: 121 SNGYSEVFPRKKKLTAHWRKFIGPLMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSV 180

Query: 181 YEHFSTEGFDVKSTGFSSHTQKHRVMKRKGRKKVEETTDVASYMAHHNLFSYYEKKRSLA 240
           YEHFSTEGFDVKSTGFSSHTQ+HRVMKRKGRKKVEETTD+ASYMAHHNLFSYYEKKRSLA
Sbjct: 181 YEHFSTEGFDVKSTGFSSHTQRHRVMKRKGRKKVEETTDIASYMAHHNLFSYYEKKRSLA 240

Query: 241 DDMSLEDTFLKLDKTRNIKREDINDFGTMATDGWASSMLGDNDNNLEDVFLKIEAAQLKV 300
           DDMSLEDTF KLDKTRNIKR+DINDFGTMATDGWASSMLGD+DNNL+DVFLKIEAAQ KV
Sbjct: 241 DDMSLEDTFFKLDKTRNIKRDDINDFGTMATDGWASSMLGDSDNNLKDVFLKIEAAQSKV 300

Query: 301 HELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGNDELVRSLHEASQHISENALD 360
           HELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGND+LVRSLHEASQHISE+ALD
Sbjct: 301 HELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGNDKLVRSLHEASQHISEHALD 360

Query: 361 VLMPETAIKTHGEVMLLPDITQSADCGTTQKLLMQDSAVKEELQLSEEVKGQLIEPQKL- 420
           VLMPETAIKTHGEVMLLPD+TQSADCGTT+K+L QDSAVKEELQLSE VKGQLIE QKL 
Sbjct: 361 VLMPETAIKTHGEVMLLPDMTQSADCGTTEKVLRQDSAVKEELQLSEGVKGQLIESQKLE 420

Query: 421 EQKIISLAAVSQVDLTSKD---DMLHKTKSPSAVKPNSSKKTRKRGRRKIGLSKKNRKAT 478
           EQKIISLAAVSQ DLTS D   + LHKTKSPSA KPNSSK+TRKRGRRKIG SKKNRKAT
Sbjct: 421 EQKIISLAAVSQSDLTSNDKEPNTLHKTKSPSAAKPNSSKRTRKRGRRKIGSSKKNRKAT 480

BLAST of CcUC05G081410 vs. NCBI nr
Match: XP_004133783.1 (uncharacterized protein LOC101222847 isoform X1 [Cucumis sativus] >XP_011650651.1 uncharacterized protein LOC101222847 isoform X1 [Cucumis sativus] >KGN56377.1 hypothetical protein Csa_011573 [Cucumis sativus])

HSP 1 Score: 792.0 bits (2044), Expect = 2.9e-225
Identity = 417/481 (86.69%), Postives = 445/481 (92.52%), Query Frame = 0

Query: 1   MGLENESKVKEEALVEVSRCVEDGNAAQAKQSASSCQENIHDMEASSFERSTMLGRSEDM 60
           M LEN SK KEE L+EVSRCVED NA Q KQ+ASS QENIHD+EASSFERS ML RSEDM
Sbjct: 1   MELENGSKAKEEVLMEVSRCVEDENATQDKQNASSGQENIHDIEASSFERSMMLNRSEDM 60

Query: 61  ELDIIGCTENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPM 120
           E+D+IGC+ENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQ  
Sbjct: 61  EVDVIGCSENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQSN 120

Query: 121 SNGYSELFPRKKKLTAHWRKFISPLMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSI 180
           SNGY E+FPRKKKLTAHWRKFISP+MWRCRWLE+QIKKLQ+QSLKYDRELALYDQRKQS 
Sbjct: 121 SNGYGEVFPRKKKLTAHWRKFISPIMWRCRWLEVQIKKLQAQSLKYDRELALYDQRKQSF 180

Query: 181 YEHFSTEGFDVKSTGFSSHTQKHRVMKRKGRKKVEETTDVASYMAHHNLFSYYEKKRSLA 240
           Y+ FS +GF VKSTGFS+HTQ+HR MKRKGRK VEETTD ASYMAHHN+FSYYEKKRSLA
Sbjct: 181 YKDFSADGFSVKSTGFSNHTQRHRFMKRKGRKMVEETTDAASYMAHHNVFSYYEKKRSLA 240

Query: 241 DDMSLEDTFLKLDKTRNIKREDINDFGTMATDGWASSMLGDNDNNLEDVFLKIEAAQLKV 300
           DDMSLEDTFLKLDKTRNIKR+DINDFGT+ATDGWASSMLG+NDNNLED+FLKIEAAQ KV
Sbjct: 241 DDMSLEDTFLKLDKTRNIKRDDINDFGTIATDGWASSMLGNNDNNLEDIFLKIEAAQSKV 300

Query: 301 HELKNRIDKVVNENPMKFSAINQLYSLA-SSDDPASPEDGNDELVRSLHEASQHISENAL 360
           HELKNRIDKVVNENPMKFS INQLYSLA SSDDPASP DGNDELVRSLHEASQH+SE+AL
Sbjct: 301 HELKNRIDKVVNENPMKFSVINQLYSLASSSDDPASPGDGNDELVRSLHEASQHMSEHAL 360

Query: 361 DVLMPETAIKTHGEVMLLPDITQSADCGTTQKLLMQDSAVKEELQLSEEVKGQLIEPQKL 420
           DVLMPETAIKTHGEVMLLPD+ +S DCGTTQK+LMQDSAVKEELQLS+EVKGQL+E Q  
Sbjct: 361 DVLMPETAIKTHGEVMLLPDMMRSTDCGTTQKVLMQDSAVKEELQLSKEVKGQLVELQNS 420

Query: 421 -EQKIISLAAVSQVDLTSKD---DMLHKTKSPSAVKPNSSKKTRKRGRRKIGLSKKNRKA 477
            EQK ISLAA+SQ DLTSKD   DMLHKTKSPSA+KPNSSKKTRKRGRRKIG SKKNRKA
Sbjct: 421 EEQKSISLAAISQADLTSKDKEPDMLHKTKSPSAMKPNSSKKTRKRGRRKIGSSKKNRKA 480

BLAST of CcUC05G081410 vs. NCBI nr
Match: XP_008437823.1 (PREDICTED: uncharacterized protein LOC103483139 [Cucumis melo] >XP_016898997.1 PREDICTED: uncharacterized protein LOC103483139 [Cucumis melo] >KAA0048848.1 uncharacterized protein E6C27_scaffold171G00620 [Cucumis melo var. makuwa] >TYK20800.1 uncharacterized protein E5676_scaffold291G00610 [Cucumis melo var. makuwa])

HSP 1 Score: 789.3 bits (2037), Expect = 1.9e-224
Identity = 419/481 (87.11%), Postives = 445/481 (92.52%), Query Frame = 0

Query: 1   MGLENESKVKEEALVEVSRCVEDGNAAQAKQSASSCQENIHDMEASSFERSTMLGRSEDM 60
           M LEN SK KEEAL+EVSRCVEDGNA + KQ+ASS QENI D+EASSFERS ML RSEDM
Sbjct: 1   MELENGSKAKEEALMEVSRCVEDGNATRDKQNASSGQENIRDIEASSFERSMMLDRSEDM 60

Query: 61  ELDIIGCTENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPM 120
           E+DIIGC+ENCEGGPSNECNV TENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQ  
Sbjct: 61  EVDIIGCSENCEGGPSNECNVLTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQSN 120

Query: 121 SNGYSELFPRKKKLTAHWRKFISPLMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSI 180
           SNGY E+FPRKKKLTAHWRKFISP+MWRCRWLE+QIKKLQ+QSLKYDRELALYDQRKQS 
Sbjct: 121 SNGYGEVFPRKKKLTAHWRKFISPIMWRCRWLEVQIKKLQAQSLKYDRELALYDQRKQSF 180

Query: 181 YEHFSTEGFDVKSTGFSSHTQKHRVMKRKGRKKVEETTDVASYMAHHNLFSYYEKKRSLA 240
           YE FS +GF VKSTGFS+HTQ+HR MKRKGRKKVEETTDVASYMAHHN+FSYYEKKRSLA
Sbjct: 181 YEDFSGDGFAVKSTGFSNHTQRHRFMKRKGRKKVEETTDVASYMAHHNVFSYYEKKRSLA 240

Query: 241 DDMSLEDTFLKLDKTRNIKREDINDFGTMATDGWASSMLGDNDNNLEDVFLKIEAAQLKV 300
           DDMSLEDTFLKLDKTRNIKR+DIND GT+ATDGWASSMLG+NDNNLED+FLKIEAAQ KV
Sbjct: 241 DDMSLEDTFLKLDKTRNIKRDDINDLGTIATDGWASSMLGNNDNNLEDIFLKIEAAQSKV 300

Query: 301 HELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGNDELVRSLHEASQHISENALD 360
           HELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGNDELVRSLHEASQH+SE+ALD
Sbjct: 301 HELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGNDELVRSLHEASQHMSEHALD 360

Query: 361 VLMPETAIKTHGEVMLLPDITQSADCGTTQKLLMQDSAVKEELQLSEEVKGQLIEPQKL- 420
           VLMPETAIKTHGEVMLLPD+TQS DCGTTQK+LMQDSAVKEELQLS+E K QLIE Q   
Sbjct: 361 VLMPETAIKTHGEVMLLPDMTQSTDCGTTQKVLMQDSAVKEELQLSKEDKDQLIELQNSE 420

Query: 421 EQKIISLAAVSQVDLTSKD---DMLHKTKSPSAVKPNSSKKTRKRGRRKIGLSKKNRKAT 478
           EQK +SLAA+SQ D +SKD   DMLHK KS SAVKPNSSKKTRKRGRRKIG SKKNRKAT
Sbjct: 421 EQKSVSLAAISQAD-SSKDKEPDMLHKAKSSSAVKPNSSKKTRKRGRRKIGSSKKNRKAT 480

BLAST of CcUC05G081410 vs. NCBI nr
Match: XP_031737513.1 (uncharacterized protein LOC101222847 isoform X2 [Cucumis sativus])

HSP 1 Score: 745.7 bits (1924), Expect = 2.4e-211
Identity = 398/481 (82.74%), Postives = 426/481 (88.57%), Query Frame = 0

Query: 1   MGLENESKVKEEALVEVSRCVEDGNAAQAKQSASSCQENIHDMEASSFERSTMLGRSEDM 60
           M LEN SK KEE L+EVSRCVED NA Q KQ+ASS QENIHD+EASSFERS ML RSEDM
Sbjct: 1   MELENGSKAKEEVLMEVSRCVEDENATQDKQNASSGQENIHDIEASSFERSMMLNRSEDM 60

Query: 61  ELDIIGCTENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPM 120
           E+D+IGC+ENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQ  
Sbjct: 61  EVDVIGCSENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQSN 120

Query: 121 SNGYSELFPRKKKLTAHWRKFISPLMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSI 180
           SNGY E+FPRKKKLTAHWRKFISP+MWRCRWLE+QIKKLQ+QSLKYDRELALYDQRKQS 
Sbjct: 121 SNGYGEVFPRKKKLTAHWRKFISPIMWRCRWLEVQIKKLQAQSLKYDRELALYDQRKQSF 180

Query: 181 YEHFSTEGFDVKSTGFSSHTQKHRVMKRKGRKKVEETTDVASYMAHHNLFSYYEKKRSLA 240
           Y+ FS +GF VKSTGFS+HTQ+HR MKRKGRK VEETTD ASYMAHHN+FSYY       
Sbjct: 181 YKDFSADGFSVKSTGFSNHTQRHRFMKRKGRKMVEETTDAASYMAHHNVFSYY------- 240

Query: 241 DDMSLEDTFLKLDKTRNIKREDINDFGTMATDGWASSMLGDNDNNLEDVFLKIEAAQLKV 300
                       DKTRNIKR+DINDFGT+ATDGWASSMLG+NDNNLED+FLKIEAAQ KV
Sbjct: 241 ------------DKTRNIKRDDINDFGTIATDGWASSMLGNNDNNLEDIFLKIEAAQSKV 300

Query: 301 HELKNRIDKVVNENPMKFSAINQLYSLA-SSDDPASPEDGNDELVRSLHEASQHISENAL 360
           HELKNRIDKVVNENPMKFS INQLYSLA SSDDPASP DGNDELVRSLHEASQH+SE+AL
Sbjct: 301 HELKNRIDKVVNENPMKFSVINQLYSLASSSDDPASPGDGNDELVRSLHEASQHMSEHAL 360

Query: 361 DVLMPETAIKTHGEVMLLPDITQSADCGTTQKLLMQDSAVKEELQLSEEVKGQLIEPQKL 420
           DVLMPETAIKTHGEVMLLPD+ +S DCGTTQK+LMQDSAVKEELQLS+EVKGQL+E Q  
Sbjct: 361 DVLMPETAIKTHGEVMLLPDMMRSTDCGTTQKVLMQDSAVKEELQLSKEVKGQLVELQNS 420

Query: 421 -EQKIISLAAVSQVDLTSKD---DMLHKTKSPSAVKPNSSKKTRKRGRRKIGLSKKNRKA 477
            EQK ISLAA+SQ DLTSKD   DMLHKTKSPSA+KPNSSKKTRKRGRRKIG SKKNRKA
Sbjct: 421 EEQKSISLAAISQADLTSKDKEPDMLHKTKSPSAMKPNSSKKTRKRGRRKIGSSKKNRKA 462

BLAST of CcUC05G081410 vs. NCBI nr
Match: XP_023527403.1 (uncharacterized protein LOC111790643 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 714.5 bits (1843), Expect = 6.0e-202
Identity = 394/482 (81.74%), Postives = 424/482 (87.97%), Query Frame = 0

Query: 1   MGLENESKVKEEALVEVSRCVEDGNAAQAKQSASSCQENIHDMEASSFERSTMLGRSEDM 60
           MGLEN SKVKEEAL+EV  CVED NAAQ  QSASS QENI DMEA S +R+ ML RSEDM
Sbjct: 1   MGLENGSKVKEEALMEV--CVEDRNAAQDMQSASSGQENIRDMEAPSCKRTMMLDRSEDM 60

Query: 61  ELDIIGCTENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPM 120
           ELDIIGCT+ CEGGPSNE NVSTENSSSFGDT+SGTDYGLLLDDEEVES LYGDNNLQPM
Sbjct: 61  ELDIIGCTDYCEGGPSNERNVSTENSSSFGDTISGTDYGLLLDDEEVESHLYGDNNLQPM 120

Query: 121 SNGYSELFPRKKKLTAHWRKFISPLMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSI 180
           S+GYS++FPRKKKLT HWRKFISPLMWRCRWLELQIKKLQSQSLKYDRELAL+DQRKQS+
Sbjct: 121 SDGYSQVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQSLKYDRELALFDQRKQSV 180

Query: 181 YEHFSTEGFDVKSTGFSSHTQKHRVMKRKGRKKVEETTDVASYMAHHNLFSYYEKKRSLA 240
           YEHFSTEG DVKSTGFSSHTQ+HRVMKRK RKK EETT+VASYMAHHNLFSYYEKKRSLA
Sbjct: 181 YEHFSTEGLDVKSTGFSSHTQRHRVMKRKRRKKTEETTEVASYMAHHNLFSYYEKKRSLA 240

Query: 241 DDMSLEDTFLKLDKTRNIKREDINDFGTMATDGWASSMLGDNDNNLEDVFLKIEAAQLKV 300
           DDMSLEDTF KLDKTRN+KR+DINDFG MATDGWA  MLGDNDN LED+FLKIE AQ KV
Sbjct: 241 DDMSLEDTF-KLDKTRNMKRDDINDFGAMATDGWAPFMLGDNDNYLEDIFLKIEVAQSKV 300

Query: 301 HELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGNDELVRSLHEASQHISENALD 360
           HELKNRI+KVV ENPMKFS+INQL  LASSDDPASPEDGN +LVRSLHEASQ ISE+ALD
Sbjct: 301 HELKNRIEKVVTENPMKFSSINQLCFLASSDDPASPEDGNVDLVRSLHEASQLISEHALD 360

Query: 361 VLMPETAIKTHGEVMLLPDITQSADCGTTQKLLMQDSAVKEELQLSEEVKGQLIEPQKL- 420
           VLMPETAIKTHGEVMLLP + Q+ DCG TQK++MQDSAVKEELQ S +VKG LIEPQKL 
Sbjct: 361 VLMPETAIKTHGEVMLLPAMMQNVDCGITQKVVMQDSAVKEELQFSGKVKGGLIEPQKLG 420

Query: 421 EQKIISLAAVSQVDLTSKD---DMLHKTKSPSAVKP-NSSKKTRKR-GRRKIGLSKKNRK 477
           EQKI      S+ DLTSK+   +M+ KTK  S VKP +SSKKTRKR GRRK G SK+ RK
Sbjct: 421 EQKI-----ASEADLTSKNKEPNMVQKTKPSSTVKPTSSSKKTRKRGGRRKTGSSKQRRK 474

BLAST of CcUC05G081410 vs. ExPASy TrEMBL
Match: A0A0A0L704 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G118120 PE=4 SV=1)

HSP 1 Score: 792.0 bits (2044), Expect = 1.4e-225
Identity = 417/481 (86.69%), Postives = 445/481 (92.52%), Query Frame = 0

Query: 1   MGLENESKVKEEALVEVSRCVEDGNAAQAKQSASSCQENIHDMEASSFERSTMLGRSEDM 60
           M LEN SK KEE L+EVSRCVED NA Q KQ+ASS QENIHD+EASSFERS ML RSEDM
Sbjct: 1   MELENGSKAKEEVLMEVSRCVEDENATQDKQNASSGQENIHDIEASSFERSMMLNRSEDM 60

Query: 61  ELDIIGCTENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPM 120
           E+D+IGC+ENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQ  
Sbjct: 61  EVDVIGCSENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQSN 120

Query: 121 SNGYSELFPRKKKLTAHWRKFISPLMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSI 180
           SNGY E+FPRKKKLTAHWRKFISP+MWRCRWLE+QIKKLQ+QSLKYDRELALYDQRKQS 
Sbjct: 121 SNGYGEVFPRKKKLTAHWRKFISPIMWRCRWLEVQIKKLQAQSLKYDRELALYDQRKQSF 180

Query: 181 YEHFSTEGFDVKSTGFSSHTQKHRVMKRKGRKKVEETTDVASYMAHHNLFSYYEKKRSLA 240
           Y+ FS +GF VKSTGFS+HTQ+HR MKRKGRK VEETTD ASYMAHHN+FSYYEKKRSLA
Sbjct: 181 YKDFSADGFSVKSTGFSNHTQRHRFMKRKGRKMVEETTDAASYMAHHNVFSYYEKKRSLA 240

Query: 241 DDMSLEDTFLKLDKTRNIKREDINDFGTMATDGWASSMLGDNDNNLEDVFLKIEAAQLKV 300
           DDMSLEDTFLKLDKTRNIKR+DINDFGT+ATDGWASSMLG+NDNNLED+FLKIEAAQ KV
Sbjct: 241 DDMSLEDTFLKLDKTRNIKRDDINDFGTIATDGWASSMLGNNDNNLEDIFLKIEAAQSKV 300

Query: 301 HELKNRIDKVVNENPMKFSAINQLYSLA-SSDDPASPEDGNDELVRSLHEASQHISENAL 360
           HELKNRIDKVVNENPMKFS INQLYSLA SSDDPASP DGNDELVRSLHEASQH+SE+AL
Sbjct: 301 HELKNRIDKVVNENPMKFSVINQLYSLASSSDDPASPGDGNDELVRSLHEASQHMSEHAL 360

Query: 361 DVLMPETAIKTHGEVMLLPDITQSADCGTTQKLLMQDSAVKEELQLSEEVKGQLIEPQKL 420
           DVLMPETAIKTHGEVMLLPD+ +S DCGTTQK+LMQDSAVKEELQLS+EVKGQL+E Q  
Sbjct: 361 DVLMPETAIKTHGEVMLLPDMMRSTDCGTTQKVLMQDSAVKEELQLSKEVKGQLVELQNS 420

Query: 421 -EQKIISLAAVSQVDLTSKD---DMLHKTKSPSAVKPNSSKKTRKRGRRKIGLSKKNRKA 477
            EQK ISLAA+SQ DLTSKD   DMLHKTKSPSA+KPNSSKKTRKRGRRKIG SKKNRKA
Sbjct: 421 EEQKSISLAAISQADLTSKDKEPDMLHKTKSPSAMKPNSSKKTRKRGRRKIGSSKKNRKA 480

BLAST of CcUC05G081410 vs. ExPASy TrEMBL
Match: A0A1S3AUK2 (uncharacterized protein LOC103483139 OS=Cucumis melo OX=3656 GN=LOC103483139 PE=4 SV=1)

HSP 1 Score: 789.3 bits (2037), Expect = 9.2e-225
Identity = 419/481 (87.11%), Postives = 445/481 (92.52%), Query Frame = 0

Query: 1   MGLENESKVKEEALVEVSRCVEDGNAAQAKQSASSCQENIHDMEASSFERSTMLGRSEDM 60
           M LEN SK KEEAL+EVSRCVEDGNA + KQ+ASS QENI D+EASSFERS ML RSEDM
Sbjct: 1   MELENGSKAKEEALMEVSRCVEDGNATRDKQNASSGQENIRDIEASSFERSMMLDRSEDM 60

Query: 61  ELDIIGCTENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPM 120
           E+DIIGC+ENCEGGPSNECNV TENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQ  
Sbjct: 61  EVDIIGCSENCEGGPSNECNVLTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQSN 120

Query: 121 SNGYSELFPRKKKLTAHWRKFISPLMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSI 180
           SNGY E+FPRKKKLTAHWRKFISP+MWRCRWLE+QIKKLQ+QSLKYDRELALYDQRKQS 
Sbjct: 121 SNGYGEVFPRKKKLTAHWRKFISPIMWRCRWLEVQIKKLQAQSLKYDRELALYDQRKQSF 180

Query: 181 YEHFSTEGFDVKSTGFSSHTQKHRVMKRKGRKKVEETTDVASYMAHHNLFSYYEKKRSLA 240
           YE FS +GF VKSTGFS+HTQ+HR MKRKGRKKVEETTDVASYMAHHN+FSYYEKKRSLA
Sbjct: 181 YEDFSGDGFAVKSTGFSNHTQRHRFMKRKGRKKVEETTDVASYMAHHNVFSYYEKKRSLA 240

Query: 241 DDMSLEDTFLKLDKTRNIKREDINDFGTMATDGWASSMLGDNDNNLEDVFLKIEAAQLKV 300
           DDMSLEDTFLKLDKTRNIKR+DIND GT+ATDGWASSMLG+NDNNLED+FLKIEAAQ KV
Sbjct: 241 DDMSLEDTFLKLDKTRNIKRDDINDLGTIATDGWASSMLGNNDNNLEDIFLKIEAAQSKV 300

Query: 301 HELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGNDELVRSLHEASQHISENALD 360
           HELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGNDELVRSLHEASQH+SE+ALD
Sbjct: 301 HELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGNDELVRSLHEASQHMSEHALD 360

Query: 361 VLMPETAIKTHGEVMLLPDITQSADCGTTQKLLMQDSAVKEELQLSEEVKGQLIEPQKL- 420
           VLMPETAIKTHGEVMLLPD+TQS DCGTTQK+LMQDSAVKEELQLS+E K QLIE Q   
Sbjct: 361 VLMPETAIKTHGEVMLLPDMTQSTDCGTTQKVLMQDSAVKEELQLSKEDKDQLIELQNSE 420

Query: 421 EQKIISLAAVSQVDLTSKD---DMLHKTKSPSAVKPNSSKKTRKRGRRKIGLSKKNRKAT 478
           EQK +SLAA+SQ D +SKD   DMLHK KS SAVKPNSSKKTRKRGRRKIG SKKNRKAT
Sbjct: 421 EQKSVSLAAISQAD-SSKDKEPDMLHKAKSSSAVKPNSSKKTRKRGRRKIGSSKKNRKAT 480

BLAST of CcUC05G081410 vs. ExPASy TrEMBL
Match: A0A5A7U0V8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold291G00610 PE=4 SV=1)

HSP 1 Score: 789.3 bits (2037), Expect = 9.2e-225
Identity = 419/481 (87.11%), Postives = 445/481 (92.52%), Query Frame = 0

Query: 1   MGLENESKVKEEALVEVSRCVEDGNAAQAKQSASSCQENIHDMEASSFERSTMLGRSEDM 60
           M LEN SK KEEAL+EVSRCVEDGNA + KQ+ASS QENI D+EASSFERS ML RSEDM
Sbjct: 1   MELENGSKAKEEALMEVSRCVEDGNATRDKQNASSGQENIRDIEASSFERSMMLDRSEDM 60

Query: 61  ELDIIGCTENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPM 120
           E+DIIGC+ENCEGGPSNECNV TENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQ  
Sbjct: 61  EVDIIGCSENCEGGPSNECNVLTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQSN 120

Query: 121 SNGYSELFPRKKKLTAHWRKFISPLMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSI 180
           SNGY E+FPRKKKLTAHWRKFISP+MWRCRWLE+QIKKLQ+QSLKYDRELALYDQRKQS 
Sbjct: 121 SNGYGEVFPRKKKLTAHWRKFISPIMWRCRWLEVQIKKLQAQSLKYDRELALYDQRKQSF 180

Query: 181 YEHFSTEGFDVKSTGFSSHTQKHRVMKRKGRKKVEETTDVASYMAHHNLFSYYEKKRSLA 240
           YE FS +GF VKSTGFS+HTQ+HR MKRKGRKKVEETTDVASYMAHHN+FSYYEKKRSLA
Sbjct: 181 YEDFSGDGFAVKSTGFSNHTQRHRFMKRKGRKKVEETTDVASYMAHHNVFSYYEKKRSLA 240

Query: 241 DDMSLEDTFLKLDKTRNIKREDINDFGTMATDGWASSMLGDNDNNLEDVFLKIEAAQLKV 300
           DDMSLEDTFLKLDKTRNIKR+DIND GT+ATDGWASSMLG+NDNNLED+FLKIEAAQ KV
Sbjct: 241 DDMSLEDTFLKLDKTRNIKRDDINDLGTIATDGWASSMLGNNDNNLEDIFLKIEAAQSKV 300

Query: 301 HELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGNDELVRSLHEASQHISENALD 360
           HELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGNDELVRSLHEASQH+SE+ALD
Sbjct: 301 HELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGNDELVRSLHEASQHMSEHALD 360

Query: 361 VLMPETAIKTHGEVMLLPDITQSADCGTTQKLLMQDSAVKEELQLSEEVKGQLIEPQKL- 420
           VLMPETAIKTHGEVMLLPD+TQS DCGTTQK+LMQDSAVKEELQLS+E K QLIE Q   
Sbjct: 361 VLMPETAIKTHGEVMLLPDMTQSTDCGTTQKVLMQDSAVKEELQLSKEDKDQLIELQNSE 420

Query: 421 EQKIISLAAVSQVDLTSKD---DMLHKTKSPSAVKPNSSKKTRKRGRRKIGLSKKNRKAT 478
           EQK +SLAA+SQ D +SKD   DMLHK KS SAVKPNSSKKTRKRGRRKIG SKKNRKAT
Sbjct: 421 EQKSVSLAAISQAD-SSKDKEPDMLHKAKSSSAVKPNSSKKTRKRGRRKIGSSKKNRKAT 480

BLAST of CcUC05G081410 vs. ExPASy TrEMBL
Match: A0A6J1IQ89 (uncharacterized protein LOC111479028 OS=Cucurbita maxima OX=3661 GN=LOC111479028 PE=4 SV=1)

HSP 1 Score: 708.4 bits (1827), Expect = 2.1e-200
Identity = 391/483 (80.95%), Postives = 423/483 (87.58%), Query Frame = 0

Query: 1   MGLENESKVKEEALVEVSRCVEDGNAAQAKQSASSCQENIHDMEASSFERSTMLGRSEDM 60
           MGLEN SKVKEEA +EV  CVED NAAQ  QSASS QENI DMEA S +R+ ML RSEDM
Sbjct: 1   MGLENGSKVKEEAFMEV--CVEDRNAAQDMQSASSGQENIRDMEAPSCKRTMMLDRSEDM 60

Query: 61  ELDIIGCTENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPM 120
           ELDIIGCT+ CEGGPSNE NVSTENSSSFGDT+SGTDYGLLLDDEEVES LYGDNNLQ M
Sbjct: 61  ELDIIGCTDYCEGGPSNERNVSTENSSSFGDTISGTDYGLLLDDEEVESHLYGDNNLQSM 120

Query: 121 SNGYSELFPRKKKLTAHWRKFISPLMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSI 180
           S+GYSE+FPRKKKLT HWRKFISPLMWRCRWLELQIKKLQSQSLKYDRELAL+DQRKQS+
Sbjct: 121 SDGYSEVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQSLKYDRELALFDQRKQSV 180

Query: 181 YEHFSTEGFDVKSTGFSSHTQKHRVMKRKGRKKVEETTDVASYMAHHNLFSYYEKKRSLA 240
           YEHFST G DVKSTGFSSHTQ+HRVMKRK RKK EETT+VASYMAHHNLFSYYEKKRSLA
Sbjct: 181 YEHFSTVGLDVKSTGFSSHTQRHRVMKRKRRKKTEETTEVASYMAHHNLFSYYEKKRSLA 240

Query: 241 DDMSLEDTFLKLDKTRNIKREDINDFGTMATDGWASSMLGDNDNNLEDVFLKIEAAQLKV 300
           DDMSLED F KLDKTRN+KR+DINDFG MA DGWA SMLGDNDN LE +FLKIEAAQ KV
Sbjct: 241 DDMSLEDAF-KLDKTRNMKRDDINDFGAMAADGWAPSMLGDNDNYLEHIFLKIEAAQSKV 300

Query: 301 HELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGNDELVRSLHEASQHISENALD 360
           HELKNRI+KVV ENPMKFS+INQL  LASSDDPASPEDGN +LVRSLHEAS+ IS++ALD
Sbjct: 301 HELKNRIEKVVTENPMKFSSINQLCFLASSDDPASPEDGNVDLVRSLHEASRLISKHALD 360

Query: 361 VLMPETAIKTHGEVMLLPDITQSADCGTTQKLLMQDSAVKEELQLSEEVKGQLIEPQKL- 420
           VLMPETA+KTHGEVMLLP   Q+ DCG TQK++MQDSAVKEELQ S +VK +L+EPQKL 
Sbjct: 361 VLMPETAMKTHGEVMLLPATMQNVDCGITQKVVMQDSAVKEELQFSGKVKDRLVEPQKLG 420

Query: 421 EQKIISLAAVSQVDLTSKD---DMLHKTKSPSAVKP-NSSKKTRKR-GRRKIGLSKKNRK 478
           EQKII     SQ DLTSK+   +M+HKTK  SAVKP +SSKKTRKR GRRK G SK+ RK
Sbjct: 421 EQKII-----SQADLTSKNKEPNMVHKTKPSSAVKPTSSSKKTRKRGGRRKTGSSKQRRK 475

BLAST of CcUC05G081410 vs. ExPASy TrEMBL
Match: A0A6J1CVZ4 (uncharacterized protein LOC111014899 OS=Momordica charantia OX=3673 GN=LOC111014899 PE=4 SV=1)

HSP 1 Score: 679.1 bits (1751), Expect = 1.3e-191
Identity = 371/488 (76.02%), Postives = 415/488 (85.04%), Query Frame = 0

Query: 1   MGLENESKVKEEALVEVSRCVEDGNAAQAK-----QSASSCQENIHDMEASSFERSTMLG 60
           MG E   KVKEEAL+EVSR +ED NAAQ +     QSASSCQ+ I DMEA S  R+ ML 
Sbjct: 1   MGPEIVPKVKEEALMEVSRGMEDKNAAQEEKNNFLQSASSCQDKILDMEAISVGRTVMLD 60

Query: 61  RSEDMELDIIGCTENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDN 120
            S++MELD+IGC++NC+ GP+ ECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYG +
Sbjct: 61  GSDNMELDVIGCSDNCDEGPNGECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGGD 120

Query: 121 NLQPMSNGYSELFPRKKKLTAHWRKFISPLMWRCRWLELQIKKLQSQSLKYDRELALYDQ 180
           NL+ MSNGY E+FPRKKKLT HWRKFISPLMWRCRWLELQIKKLQSQ+ KYDRELALYDQ
Sbjct: 121 NLRRMSNGYREVFPRKKKLTVHWRKFISPLMWRCRWLELQIKKLQSQAFKYDRELALYDQ 180

Query: 181 RKQSIYEHFSTEGFDVKSTGFSSHTQKHRVMKRKGRKKVEETTDVASYMAHHNLFSYYEK 240
           RKQS+Y +FS EGFDVKS GFSSHTQ+HRVMKRK RKK EETTDVASYM HHNLFSYYEK
Sbjct: 181 RKQSVYGNFSMEGFDVKSIGFSSHTQRHRVMKRKRRKKTEETTDVASYMGHHNLFSYYEK 240

Query: 241 KRSLADDMSLEDTFLKLDKTRNIKREDINDFGTMATDGWASSMLGDNDNNLEDVFLKIEA 300
           KRSLADDM+LEDTFLKLDKT+N++R DIN FGT AT+GW  SMLG NDN LED+FLKIEA
Sbjct: 241 KRSLADDMALEDTFLKLDKTKNMRRYDINYFGTNATEGWEPSMLGGNDNILEDIFLKIEA 300

Query: 301 AQLKVHELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPEDGNDELVRSLHEASQHIS 360
            Q KVH LKNRIDKVVNENPMKF++INQL  L SSDDP+SPEDGND LVRSLHEASQHIS
Sbjct: 301 VQSKVHMLKNRIDKVVNENPMKFASINQLNFLESSDDPSSPEDGNDALVRSLHEASQHIS 360

Query: 361 ENAL-DVLMPETAIKTHGEVMLLPDITQSADCGTTQKLLMQDSAVKEELQLSEEVKGQLI 420
           E+A  DVLMPE+A K HGEV+LLPD+ QSADCG+T+K+ MQ+ AVKEELQLSEEVKGQ I
Sbjct: 361 EHAFGDVLMPESANKNHGEVILLPDMIQSADCGSTRKVQMQNCAVKEELQLSEEVKGQSI 420

Query: 421 E-PQKL-EQKIISLAAVSQVDLTSKD---DMLHKTKSPSAVKPNSSKKTRKRGRRKIGLS 478
           E PQ+L EQK I  AAVS+ DL SK+   ++ H TK  SA KPN SKKT+KRGRRK G S
Sbjct: 421 EQPQELEEQKTIPPAAVSEADLASKNTEPNLQHDTKPLSAAKPNPSKKTKKRGRRKTGSS 480

BLAST of CcUC05G081410 vs. TAIR 10
Match: AT3G59670.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G37440.2); Has 77 Blast hits to 77 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 73; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 189.1 bits (479), Expect = 8.1e-48
Identity = 136/356 (38.20%), Postives = 202/356 (56.74%), Query Frame = 0

Query: 46  SSFERSTMLGRSEDMELDIIGCTENCEGGPSNECNVSTENSSSFGDTVSGTDYGLLLD-- 105
           +S E  T +   E++++DI+   EN       + N +TE SSSF DT S  +  +LLD  
Sbjct: 44  TSEETVTSVSGGEELDVDIVESDENKTSTTDEDPN-ATEYSSSFSDTAS-ENAEMLLDGL 103

Query: 106 --DEEVESQLYGDNNLQPMSNGYSELFP-RKKKLTAHWRKFISPLMWRCRWLELQIKKLQ 165
             + EVES  + + +L P  + +S +F  RKK+LT HWR+FI PLMWR +W+EL+I++L+
Sbjct: 104 TGEAEVESHYWDETDLGPAYDSFSSIFHFRKKRLTNHWRRFIRPLMWRSKWVELRIRELE 163

Query: 166 SQSLKYDRELALYDQRK------QSIYEHFSTEGFDVKSTGFSSHTQKHR-VMKRKGRKK 225
           S++L+Y +EL LYDQ K       S+ E   + G  +KS  FS+   K R   KR+ RKK
Sbjct: 164 SRALEYPKELELYDQEKLEANIDPSVLE---SCGEGIKSLPFSNPCYKKRAAKKRRKRKK 223

Query: 226 VEETTDVASYMAHHNLFSYYEKKRSLADDMSLEDTFLKLDKTRNIKREDINDFGTMATDG 285
           VE T D+ASYMA HNLFSY E KR  +D M L D F      R+   E ++       D 
Sbjct: 224 VESTDDIASYMACHNLFSYIETKRLSSDGMGLADDFGDAKDPRSDSNEPVD-----LDDA 283

Query: 286 WASSMLGDNDNNLEDVFLKIEAAQLKVHELKNRIDKVVNENPMKFSAINQLYSLASSD-- 345
            +     D D+ LE+V  KIE    +VH LK ++D V+++N  +FS+   L  LA+S   
Sbjct: 284 DSLFHHRDGDSVLEEVLWKIELVHSQVHRLKTQVDVVLSKNTARFSSSENLSLLAASSAP 343

Query: 346 DPASPEDGNDELVR--SLHEASQHISENALD--VLMPETAIKTHGEVMLLPDITQS 384
            P     GN +++   +++ ASQH+++  L   V   E  I ++G+   +PDI +S
Sbjct: 344 SPTVSAGGNGDVISFGAIYNASQHMADYGLGDIVFSSEGVISSYGDAFHIPDIIES 389

BLAST of CcUC05G081410 vs. TAIR 10
Match: AT4G37440.2 (unknown protein; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G50040.1); Has 121 Blast hits to 117 proteins in 32 species: Archae - 0; Bacteria - 6; Metazoa - 13; Fungi - 5; Plants - 66; Viruses - 0; Other Eukaryotes - 31 (source: NCBI BLink). )

HSP 1 Score: 172.2 bits (435), Expect = 1.0e-42
Identity = 130/361 (36.01%), Postives = 198/361 (54.85%), Query Frame = 0

Query: 42  DMEASSFERSTMLGRSEDMELDIIGCTENCEGGPSNECNVSTEN-SSSFGDTVSGTDYGL 101
           D++A +  +  +    ED E+DI+ C +N E   S  C+  T+  SSSFG T S  +   
Sbjct: 54  DIDADASIKKEVAEFDED-EVDILECNDNIEIQVSG-CDDGTDGYSSSFGGTDSEHE--- 113

Query: 102 LLDDEEVESQLYGDNNLQPMSNGYSELFPRKKKLTAHWRKFISP-LMWRCRWLELQIKKL 161
             +D+EV+S +  + +L         L+ RK+KLT HWR+F+ P LMWRC+W+EL+ K+L
Sbjct: 114 --NDQEVDSMICNETSL--------PLWVRKRKLTDHWRRFVQPTLMWRCKWIELKYKEL 173

Query: 162 QSQSLKYDRELALYDQRKQSIYEHFSTEGFDVKS-TGFSSHTQKHRVMKRKGRKKVEETT 221
           Q+Q+ KYD+E+  Y Q K+   E+  +E   VK+      +TQK R+MKRK RK+VEET 
Sbjct: 174 QNQAQKYDKEVEEYYQAKKLELENVKSEELGVKALPPLPCYTQKTRLMKRKTRKRVEETA 233

Query: 222 DVASYMAHHNLFSYYEKKRSLADDMSLEDTFLKLDKTRNIKREDINDFGTMATDGWASSM 281
           DV SY ++HNLFSYY+ ++SLA D++L D    LDK     +++     T  ++      
Sbjct: 234 DVTSYASNHNLFSYYDCRKSLA-DIALNDNSRNLDKKNKSAKDE-----TAFSEETPPLE 293

Query: 282 LGDNDNNLEDVFLKIEAAQLKVHELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPED 341
             + D  LE + LKIEAA+ +   LK R+DKV++ENP  F   N +  L ++D   S E 
Sbjct: 294 FREGDAYLEQILLKIEAAKSEARNLKIRVDKVLSENPSIFPLANTVNPLGAADVYTSSEQ 353

Query: 342 ----------------GNDELVRSLHEASQHIS---ENALDVLMPE-TAIKTHGEVMLLP 380
                             ++ V+S   +S H+S   +   D+L+ E  A K      ++P
Sbjct: 354 QKPLLAIKNEDEKSIISEEKPVKSASVSSHHVSPEDDETTDILLSEILASKRREGKSIIP 393

BLAST of CcUC05G081410 vs. TAIR 10
Match: AT4G37440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G50040.1); Has 220 Blast hits to 205 proteins in 55 species: Archae - 0; Bacteria - 15; Metazoa - 50; Fungi - 11; Plants - 76; Viruses - 3; Other Eukaryotes - 65 (source: NCBI BLink). )

HSP 1 Score: 164.5 bits (415), Expect = 2.1e-40
Identity = 143/445 (32.13%), Postives = 232/445 (52.13%), Query Frame = 0

Query: 42  DMEASSFERSTMLGRSEDMELDIIGCTENCEGGPSNECNVSTEN-SSSFGDTVSGTDYGL 101
           D++A +  +  +    ED E+DI+ C +N E   S  C+  T+  SSSFG T S  +   
Sbjct: 54  DIDADASIKKEVAEFDED-EVDILECNDNIEIQVSG-CDDGTDGYSSSFGGTDSEHE--- 113

Query: 102 LLDDEEVESQLYGDNNLQPMSNGYSELFPRKKKLTAHWRKFISP-LMWRCRWLELQIKKL 161
             +D+EV+S +  + +L         L+ RK+KLT HWR+F+ P LMWRC+W+EL+ K+L
Sbjct: 114 --NDQEVDSMICNETSL--------PLWVRKRKLTDHWRRFVQPTLMWRCKWIELKYKEL 173

Query: 162 QSQSLKYDRELALYDQRKQSIYEHFSTEGFDVKS-TGFSSHTQKHRVMKRKGRKKVEETT 221
           Q+Q+ KYD+E+  Y Q K+   E+  +E   VK+      +TQK R+MKRK RK+VEET 
Sbjct: 174 QNQAQKYDKEVEEYYQAKKLELENVKSEELGVKALPPLPCYTQKTRLMKRKTRKRVEETA 233

Query: 222 DVASYMAHHNLFSYYEKKRSLADDMSLEDTFLKLDKTRNIKREDINDFGTMATDGWASSM 281
           DV SY ++HNLFSYY+ ++SLA D++L D    LDK     +++     T  ++      
Sbjct: 234 DVTSYASNHNLFSYYDCRKSLA-DIALNDNSRNLDKKNKSAKDE-----TAFSEETPPLE 293

Query: 282 LGDNDNNLEDVFLKIEAAQLKVHELKNRIDKVVNENPMKFSAINQLYSLASSDDPASPED 341
             + D  LE + LKIEAA+ +   LK R+DKV++ENP  F   N +  L ++D   S E 
Sbjct: 294 FREGDAYLEQILLKIEAAKSEARNLKIRVDKVLSENPSIFPLANTVNPLGAADVYTSSEQ 353

Query: 342 GNDELVRSLHEASQHISENALDVLMPETAIKTHGEVMLLPDITQSADCGTTQKLLMQDSA 401
               L     +    ISE   +  +   ++ +H    + P+  ++ D   ++ L  +   
Sbjct: 354 QKPLLAIKNEDEKSIISE---EKPVKSASVSSH---HVSPEDDETTDILLSEILASKRRE 413

Query: 402 VKEELQLSEEVKGQLIEPQKLEQKIISLAAVSQVDLTSKDDMLHKTKSPSAVKPNS---- 461
            K  +     VK +    ++   + +        ++ +K++   K +  S  KP S    
Sbjct: 414 GKSIIPDKNLVKTEQASIEEGPSRPVRKRTPRNREIITKEESNPKRRRVSREKPKSNAVM 471

Query: 462 ----SKKTRKRGRRKIGLSKKNRKA 476
               S + RKRG+R+ G +   R++
Sbjct: 474 ASRFSNRKRKRGKRRSGSAGLRRRS 471

BLAST of CcUC05G081410 vs. TAIR 10
Match: AT3G50040.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G37440.2); Has 70 Blast hits to 70 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 70; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 127.9 bits (320), Expect = 2.2e-29
Identity = 89/252 (35.32%), Postives = 135/252 (53.57%), Query Frame = 0

Query: 85  NSSSFGDTV---SGTDYGLLLDDEEVESQLYGDNNLQ-PMSNGYSELFPRKKKLTAHWRK 144
           +SSSFGD++    G D+G     +E +S L  D  L     +G   L   KKK    WR+
Sbjct: 61  SSSSFGDSMCARDGDDFGF---GDEAQSMLSNDYPLPGTCDDGTEFLGLPKKKTNDRWRR 120

Query: 145 FISPLMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSIYEHFSTEGFDVKSTGFSSHT 204
              P+MWRC+W+EL++K++QSQ+  Y++E+  Y   KQ   E    EGFD KS  F  + 
Sbjct: 121 LTKPIMWRCKWIELKVKEIQSQARGYEKEVKDYYLTKQFDLEKSKLEGFDGKSIPFRENN 180

Query: 205 QKHRVMKRKGRKKVEETTDVASYMAHHNLFSYYEKKRSLADDMSLEDTFLKLDKTRNIKR 264
           Q+  V KR  RK+VEETTDVA+YM++HNLFSY +K+  +       D+     +    K+
Sbjct: 181 QRRNVFKRGRRKRVEETTDVAAYMSNHNLFSYADKRVPVNVKGQYLDSDFGTGRKATGKQ 240

Query: 265 EDINDFGTMATDGWASSMLGDNDNNLEDVFLKIEAAQLKVHELKNRIDKVV-NENPMKFS 324
           + I D   +       S L  +D+ L     KI+ AQ K   L+ R+D+++ +  P   S
Sbjct: 241 DAIEDDSLI-------SELDCSDDVLAKFLCKIDEAQGKARRLRKRVDQLMWDSQPAHTS 300

Query: 325 AINQLYSLASSD 332
           ++ Q+ +    D
Sbjct: 301 SMPQMVAPCHRD 302

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886802.12.6e-23791.06uncharacterized protein LOC120076910 [Benincasa hispida][more]
XP_004133783.12.9e-22586.69uncharacterized protein LOC101222847 isoform X1 [Cucumis sativus] >XP_011650651.... [more]
XP_008437823.11.9e-22487.11PREDICTED: uncharacterized protein LOC103483139 [Cucumis melo] >XP_016898997.1 P... [more]
XP_031737513.12.4e-21182.74uncharacterized protein LOC101222847 isoform X2 [Cucumis sativus][more]
XP_023527403.16.0e-20281.74uncharacterized protein LOC111790643 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L7041.4e-22586.69Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G118120 PE=4 SV=1[more]
A0A1S3AUK29.2e-22587.11uncharacterized protein LOC103483139 OS=Cucumis melo OX=3656 GN=LOC103483139 PE=... [more]
A0A5A7U0V89.2e-22587.11Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6J1IQ892.1e-20080.95uncharacterized protein LOC111479028 OS=Cucurbita maxima OX=3661 GN=LOC111479028... [more]
A0A6J1CVZ41.3e-19176.02uncharacterized protein LOC111014899 OS=Momordica charantia OX=3673 GN=LOC111014... [more]
Match NameE-valueIdentityDescription
AT3G59670.18.1e-4838.20unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G37440.21.0e-4236.01unknown protein; LOCATED IN: cellular_component unknown; BEST Arabidopsis thalia... [more]
AT4G37440.12.1e-4032.13unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G50040.12.2e-2935.32unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 286..313
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 454..477
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 436..477
NoneNo IPR availablePANTHERPTHR34057ELONGATION FACTORcoord: 22..475
NoneNo IPR availablePANTHERPTHR34057:SF10BNAA08G15670D PROTEINcoord: 22..475
IPR038745AT4G37440-likeCDDcd11650AT4G37440_likecoord: 56..313
e-value: 6.279E-81
score: 249.647

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC05G081410.1CcUC05G081410.1mRNA