Tan0002976 (gene) Snake gourd v1

Overview
NameTan0002976
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
LocationLG08: 25367035 .. 25394327 (+)
RNA-Seq ExpressionTan0002976
SyntenyTan0002976
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGACAACTGGTGGTTTAGAAGGGATTGAATACGTAGATGTTGAGGAAATGGTTGCAATGTTCTTACACATCCTAGCTCATGACGTCAAGAATCGAATAATTCGTACACAATTTGCAAGGTCTGGTGAGACAGTATCTAGGCACAATTCTGTACTTAGCGCAGTATTGCAACTCCATGAAATTCTACTGAAGACTCCAGAGCCAATCACAAATTCTTGTACTGATTCTAAGTGGAAATGGTTTGAGGTATTCATATTTTTAATGTCCCTCAATTACATGATGTAGTTAGATTTAAAATGATATACGTAACTCTTGTTCGCCCTGTCTTTCCCAAAACAGAATTGCTTGGGTGCATTGGATGGCACATACATTCAAGTCAATGTAAGTGTCGTCGATCGCCCGAGATATAGGATGAGGAAGGGTGAAATTCCAACGAATGTCCTTGTTGTTTGTTCACCAAGTGGAGAGTTCATATTCGTTCTACCAGGATGGGAGGGGTCTGCGGCTGATTCACGTGTGCTTAGGGATGCAATATCAAGACCTAATGGGTTGAAAGTCCCCAAGGGTAATAAAATCAAAAATTAAACCTTATTGCATTTGATTCTTTGAAATCAACCACTAATAGGGTCATGTGTGCAATGGTTCACATCAACAGAGTATTATTACCTATGTGATGTTGGGTACCCTAATGTAGAGGGATTCTTGGCGCCGTATAGGGGAGAGCGCTACCATCTCACTGACTGGCGTGGGGCAAGGAATGCACCAACTACTCTGAGAGAGTTCTTTAACATGAAACATTCATCTGCGAGGAACATAATCGAGAGAGCATTTGGTGTGTTGAAAGGAAAATGGGGCATTTTGTTGAGAAAATCATTCTACTCTGTTTGCACACAATGTCGAACCATTACAGCATGTTGCCTCCTCCATAATCTTATCAATCGAGAGATGGGTTCAGTGGTAGGGTGTTGGGATGTATGTTCTAATCTCGTTTTGTTTGTAATATTACCATTGGGAATACATAAGAGTTATTTCTTTCATCACTTTGTGTCTCACAATTTTACAAACTTTGTCCAATAAACTAAGCTCCAGGGTCATTAAATTTATGAGAGAACTAGAACTTGAACGGTGTGTAGTGGACACAGAGGGAGAATCACGTTCGAGTTAATGACTGAACGGTCTACAGTATATGGATATGGTTGGAGACCTAATCCTGGATATGTTGTGGATGCGGCCCGCTTTGTATTAGATACAAACGAGTTGATCCGACTCGTTCATGTGGCTGACATGCGAGTGAGGACATCCTGTGCCATGAGATTGTACAAGACCGGATCGTGAATTATTAGACCTACTGATGTAACACCGTTAACAGTATTGGACTAGTAACTTCGTAGATGACCAATAGTGACTTGACCATAATCCTAAGTGTTGGGTTTTAATGTCCTAAAACTCGTGATTTTGTAAACTGAATGTATATTCTATTTCACAATAAAGTTGTTATTGAAATTTATTCAGTAAAGGTGTTATTGACTATTGTGTATTGTGTAACCTTAATCCAATAAAGAACTTTATGTGTGACATAAAAGTGGATCAAGTTCAAGTTATAGCCAGAACAGTCTATAGTATAAGGATAGGGTTGGGCACCTTATCCTGGGGACAATATAGACGCGACCCACTTTGTAAAGTACAAATGATGTGATCCTAAATCGTTCATGTGGAGACATGTGAGTGGGGGCACCCTATGCAATGAGTTTGCATAAGACTGGACCGAAATAGTCACTTAGACTTTATAACTCCGTCTACTGTCTTAACTGACTATATTCATACGATGACCTAGGTAACTCGATCTTAATCCTGAGCGTTCTATGGACTCCTGTTTATTCGGATTATCCTTAGATTTGCATGGGTGAGAGTGGCTCGAGTTGCGACTCAATAAGCCTCCCATTTCAGGGACAAAACCGGGTAGATAGCTGGGAACATAGTTCTGCAAGATGGAATTCACTCCTACCCGCTTTAGGGATAAGTAGAGAGATTGTTCCCTTAAGTGCTGACTCCCGGTCTTGAACAATGGGCCCACCCCTCTCATTGGCCTGAGAGGGATTCGGTTTGCTGGTTGGACCATAAACCAATTGTTCATTAGAGGATCAGTGGTACTTAAGGAACAAAAGGTAATTACAGGGGTAAAACGGTATTTTTGACCCAACTGTAATTACAAACGACTTGTGAAGGATCGACTTACTAATCATGGTTATATCGAGTGGACAGAAATATATCTACAGTGAGGAGAATGCAGCCACTAGGCTTTAGTGGAGTGTCCCGGTTGTTAACGAATGTTGATTAGCTAGGCTAGAGAGTTTGGCCGGTTATTCTCGTATCGTTGGAGCTCATGATCTGTAGGTCCATTAGGTCCTCCTTACTAGCTCAGACTTAGCCTTATAACATTTTGAGAGAAAAGAATTTGAATTGTTCAAATTCGAGTTTGAAAGAAAGCGTTAATTATATGTTATATATTTAACGAATCATAGCTTAATTATGAATTAAACTATGATTAATAAGAGGTGGATATATTTAAATATGATTTAAATATCAGATCGGTTTATAGCATGAATAAGGATTCATGTTCAGAAAAACAATCGAGATCGGGCAATTTATTTAATTTAATATTTGATATTAAATTAAATTAAAAATTAATTAATTAAATTAATTTTGAATTATTTAATTAATTATATTAATTTTAATTAAAAATCAATTTAATATTGATTTTCGAATTAAAATTAAATGAAAAATAAATAGATTTTCATGATTTTCATGAAAATCAACTTGACAAGTTGCCTAATCCTCATCATCTTCAAAATTGTCCACCTCCACCTCAAAGCATCCTCATCACCCTTGAGTTGTCACTCCATTGAAGCTTATTGAAGCTTTATATAATGGTGATTTGAGAGATGTTGTGAGAATCTTACAGAAAATTCAATATTCCCGAATTAGTGACCTTTGAGGTCTTTTCTTCCAAATTCTCTCAACCTAGGATCCCACAAACTGTTCTAAGGCCTGAGGATAGTAGGGAAGACTCTAACGGTTGTCCACGAAGTGTTCGTGTTGAAAACCATCCACCAGGTGAAGAGAATTTGATCTTCAAAGTTGAGTTTTCTATCCCTCTGTTTTTCCAGCTTTGAGCATGTTTTTTATTTTACAAAATTGATCAACTTAGAGTGTTCAAGATCCAAAATTGCTTCCGCTTTCTTAATCTATACTCTTTCACTAAGCGTGTTACGGACTCCTATCTGCGGGGGTCTGTCCTTTGACTAGTACGGGTGAGGGTGGTCACAGTTACTCGCCCAATATGTCTATCTTCTCAAGGACAGGACCACGTGGGGAGCTGGGAACTTGACTACGCGAGATGGAATTCACTCGTTCCCGACTTTAGGGAAACTAGAGGGGTTGTTCCCTTAAATGCGGCCTCCAGGGCTTGGACAACGGGCGCCCACCCTCTCCCTGGCCCGAGAGGGATGTTGTATATGGTTGGACAATTACAATGTTGTTTTTCATTAGAGGAGCAGTGGGGACTTAAGGAGCAAAAAGTACACACAGGGGTAAAACGGTAATTTGACCCAACAGTGTCTACGAACAACCTGTGAAGGGTCGACTCATCGGCATTGGTTATATCAGTGGACACAACTTGTCCTACAGTGACGGGAGTGCAACTACGGGCTTTAGTGGAGTGTTACCGTCGCTAATGAATGTTGATTAACCAGGTCAAAGAGTTTGACAGGTTAGTCTCGGATCATTGGAACTCGTCACCTTTAGGTCCTTAAGGTCCCTCGGTTGGCTCAAACTGGATAAATTGAGGGTTTACAGTTGGTGTAGATTTGAAGTGTTCAAATTCTATATAAGGGTATTTATGTCATTTATATGAGATATGATTGACATATTATATAGTTTAATGTGAATGAGATTCATAATAAACTATAGGTAGCCTAATTGCTTAATTAGGGCTAATTTGGAGAAATTGGATTGAGAGAAGTTAAATTGAATGTGATTAATTTAAAATCCACAATTATACAAATGTGATTTGTATAATAGCATATGAGATATCTTATTATATGTGGTTATGCAAATATGATTTGCATAACCTATGGTTGTGGTTAAATGGTAAATAGGTGGAGATTTCAACTCCACCTATCACCATTATTTAAACTCCTTTGCAAGAAGGATTTCATACTTGAAAGTTCATTTTTTGGATTTCTCACCAAAAGAACTCAAGAATTCTCTCTCAAATCTCTCCTCCCTTTCACCAAAAAGGAGTCCCACATACTCTCCTTCTACCTAAAAGAGAACAACCGAGAAGCCTATCGGAGGTGTCCGGCTCGTTGGTTTGTCCAAGGAGATCGAGTTCGTTGAGATCGTGTTCGCGAAGAGTTCATGATGGAGGAGGAAAACGTGAAGAACGGTTCTTCAACAAGTAAGTATTCTTTACTGTTTTTCCTTTCTAAAGCATGTTTAATTTAGTTTTTATGTGATAGAAACAATTTTAAGAAAACATTTTTCACAATGATACCACAGTGATTCAATCGCTTTTTCGCTGCGTAGGATTTTCATTCCTTCAATTGGTATCAGAATCGTTGGCTTTTCGTTGTGCAAATGTTTTTCTTAAAATTATGTGGGTATTGCTTCAATTCATGAGATGAAATTGTTGGTTTTGCATAATTGGATGTTTTCAGTTTTTGCACTAGTTTTAGGTTTATTTTCTGTTGATTAGATCAATGTAAAGACCCATGTTTTTGGGGCGTTTAGAAAATAAATCGAGTCTGTAAGTCCGGGTCTTTGTCGGCAAGAGTTGCTGTGAAGAAATCGGAGCAGCAAACGCGAAAAATCAAGAAAACAAAGAAAGGAGTTTGTTATTCAATAGATGGCATTGAGACCCTGTTCTTCGGCGTCTCAACGTCGTCCGCGCGGGCCAAGGCTGATCTCCAGAGCAGCGTTGCAACGCCGAGGCATGGAGCACGCGAGGTGCAACGCCGAAGCCGCAAATACGTTACTAACGCTACCCACGGCGTTACAACGCGTAATGCGTGATGGGCGCCCAATGCGCGCTTTCTTTGGTTGCAAGTGTTGCACTTCGTGGGATGTGATTTTAATTTCTTTTATTTATTTTTAATTTATAAAATGTATTTTTCTTTTAAATTAAATTAAATTTATTAATTAATATTAATATAATAGTTATATTAATTTTAATTAATTCATTTAATGTCCCCTAAGTGCCAAAACCAGTCCAATTTTTGCTATTTATTTATTTTATGCATGAGATGTATGAAATATTTAATTCTATGTGCATAATGTTTTTTATATACATTTAAAATCTCAACTTAGGTTAAACATTTCATGCATCATATTTATATTATAAGTGTTATAATATATAGTATGCATGTTAGGTTATATTATAAGTGTTATAATATAATGTATGCATGTTTTAATTTATTTAAATATAAGTGTTATATTTTTAGATTAAAAGCATGCTCATGCATAATTCATGTGATGAAGGTATGTTATAGTATATATAGTATTATTCATGTGATAAATGATCAAGCACTATGACATAGAAATGCATGTAACCTAAGTTAATTATTAGTTTTTGGACCGGTTCGGCTCGATTTTGGTCCGGTTTGGGTGGTTTTAGACCGGTTCAGACTATTTCGAGGTCGATTTTTTTATTTTGGAGGTCCGTTTCACCAATTGAAGGCCCGGTTCAGCCTTTTACAGCCCAGTTTAACCTTTTGAAGGTCCGGTTCACACATTTGTAAGTCGGTTTGACCCAAATTCGAAGTGTTCGAAGTGGTTCGATATTATTTCTAGATTATTATTGTAATAATTTAGTAGTTTGCTTGCATTAGATGCCAAAACCAATCCACTTTAGTGTGTTTAATTAATTATGTATTGTGAATGTATGTTTTAATTAAAATCCCTGATTGTATGTGCATAAAGTATGCCATATAGATTTTAAAATCCCACCATAGGAAGCATGCTCCATGCATTGAAAGTATGTTATAAGTGTTATAATATATTTTATGCATGTTTATTGGGTTTCATAAGAATTAATATAAGTGTTATATTAATTTATGAAACAAATATTATATGTTTTAATTATAATTAATATAAGAGTAATATTAGTTAAAAATTAAACATATAATTGCTATGCATTTATTCATGTGATGAATTTGTTATAAATGTATGTTTATTCATGTGATGAATGATGAAACAATTTATAACATAGAAATGCATGTAACCTAAGGTTAATTAATTAGTTTTAAAATGGTTTTAGAATTGGAATTAATTAGACCTTAGGTTATCTAGTTCTAATAAGATTAAAATTAGGAAACCAATAATTTATAAACTTTGATTAAAGAGGGACTCTGACTAAGGAAGGTTCTGGCTAGGTTGGGGTACTTAAGCTGACGGAAACGGAACACCCTTACCTGGGAACTGACCTGAGAGTGAATTTAATCCATAGGTTTATAAATTATCTTTCGTGAGTGGCGTTTATTAAAAGTTAATATATTAAACTCATAGACCTAGGTTTATTAAAAGGTTAATAGACTTAGGTAATTCCAGTATAATTGTTATATTGGTAGACTTAGGTCATATTGTTTTAGCCAAATATGCACCGAGTAAAAGTATACCCAAACAGCTTTAAAATACTTAGTGGGAGTGAAAAGGTATATATGATATATCAATTTCCTCTTACGCTCTCCACGGTTCACACCGTGAGATTTATGCTCGGCCTCGTGATGTCAATAGGGCATCCCCATTCGGATGGTGTTTGCATAAATCAAGAGCAAGGTGAATGGAGAAAGTATTTATAGTAAGTGAGCGAAGGAAGTGTGTCAACACGTCCTACGGTCTCCTCCATTAGGTTGGACCGTGAGATTCCCATGTTGCGCTGCAGTTGTTGCCCGAAGTAGCACCATCCATTCTAAGGATGTAACATGGGATTTGGAACAACTCAAACTCCAAAAATGGATAGGGTCTCTTAGGTCCATTCCCAGCTTTGTCTCTCCAGTTCGGTAGCATTGTTGGGGCCGACCTCTGAGGTCTGAAAATGGCGGGTCACACTTACAAGAATTGTTAAGTGTTAGTAAGTTCTTGACCAAAGTAGCGATAACTAGAATGTTATAGGAATAAGAGTTATTCCTAGATTAGCATTTAGTCAAGAATGTCTTGATTCAGATGAAGGAGTGATTGATCGCTCCTCAGAGCGCGTTGCTCGACTCACGAAGTATCGTTGCAAAAATAATAATGAGTTTATGGATACATTATTACTTTTGCTAAATTCTTAAATTGGATAGGTTACAAACAATTCACTAATAGAATTCATTCTTTTTCAGCATGTCGAATTCTTTTATTCGCCACTCGCCTTCAATAAACTTAGCGGCGATAATTATGGAACTGGAAATCAAACTTGAATACGATTCTTGTTCTTGATGATCCGAGGTTCGCCTTTAACGGAGGAATGTCCTCCCCTCCCACTTCGATCGCAAACCCAATTGTTCGGGATGCTTATGACAGATGGATTAGAGCTAATGAGAAGGCCTGGGTCTATATCTTAGCCAAAGATATCTGATGTGTTGTCTAAGAAACATGAGACCATGGTCACCGCAAAGGAGATCATGGGATCATTACAGGCGATGTTTGGACAACCATCCTCAACGGTCCATTATGATGCTGTCAAATACGTTTACAACTCCCGTATGAAGGAGAGAGCCTCTGTTAGGGAACATGCCCTTGACATGATGACCCATTCAACGTGGTTGAAGTAAATGGGGCAGTCATAGATGAGAAAAGTCAGGTAACCTTTATTATGAAATCTCTTCCGAAGAGTTTCTCGCCATTCCGCACAAAATGCGGTGATGAATAAAATAGAGTATAACCCGACTACTCTCCTCAACGAGCTCTGCACTTTTGAGTCCTCATGAAATCAAAAGGAAAAGAGAAGGAGGCAAATGTTGTCACTTGAAGAAGTTCCTAAGAGGATCGCCCTCTGGGACCAAAAGCAGTCCTTCTTTTCTAAGAATAAGGGTATTCGTAAAAGAAGAAAAAGGACAAAGGAAGGGGACAGGCTCCCACACGCATGCAAGGCCAAAGCCACGAGAAAATGTTTCCACTGTGGAGTAGACGGGCACTGGAAGAGGAACTGCCCGAAATACCTTGCAGAAAAGAATGCTGAGAAAGAAAAACAAGGTAAATATGATTTACTCGTTATGGAAACATGTTTAGTAGAACATGATGATTCCGCCTGGATATTAGATTCAGGAGCCACTAACCATGTTTGTTCTTCTTTTAAGGAAACTAGTTCCTGGCAGCAGCTTGCTGATGGGCAGATATCTCTCAGGGTTGGAACGGGAGAGGTTGTCTCAACCAAAGGAGTGGGAGCTGTGAAGTTGTTGTTTAGAGATAGATTTATTTTATTAGAAAATGTACTTTTGGTTCCGGGAATCAAAAGAAATCTTGTATCTATCTCTTGTTTGCTTGAACATATGTATAAAGTATCTTTTGATCATAATGAAGTGTTCATTTGCAAAAGAGGTGTACGAATATGTTCTGCTAAACTTGAAAATAACTTATACGCGTTAAGACCAACTGAAGTAAAAGCTATTTTGAACACCGAGATGTTTAAAACAGCTGATACTCAAAATAAAAGCCAGAAACTTTCTCCTAGTATCTATCTTTGGCACTTAAGACTAGGCCACATTAATCTCAATAGGATTGAGAGATTGGTCAAGAGTGGTCTCCTAAGTCAGTTAGAGGACAACTCTTTACCACCATGTGAGTCCTTTCTCGAAGGAAAAATGACTAAAAGACCTTTTTCTGAAAAAGGTTATAGAGCCAAAGAACCCTTGGAACTCGTGCATTCGGATCTTTGTGGTCCTATGAATGTCAAGGCACGATGAGGGTATGAATATTTCATCAGTTTCATTGATGATTATTCTAGGTATGACTATCTTTACCTAATGCATCATAAGTCTGAAACCCTTGAAAAGTTCAAGGAATATAAGGCAGAAGTTGAGAACACATTAGGTAAAACAATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGATTTGAGATTCCAAGACTATATGTTAGAACATAGAATCGTATCCTAACTCTCAGCACCTGGTACACCTCAGCAGAATGGTGTATCTGAGAGGAAAAATAGAACCCTGTTAGACATGGTTCGATCCATGATGAGCTATGCTCAGCTGCCTAATTCGTTTTGGGGTTATGCAGTAGAAGTTGTTGTATATATTTTGAACATGGTTCCCTCAAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGTAGTTTACGTCACTTCAGGATTTGGGGATGTCCAACACATGTGTTGGTGTCCAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCCTATTTGTAGGATACTCCAAAGAAACGAAATGTGGTCTATTTTATGATCCTCAAGAGGACAAGGTGCTTGTGTCGACAAATGTCACATTCTTAGAGGAAGACCACATGAGAGATCATCAGCCTCGAAGTAGGATTTTCTTAAGTGAAATTTCCAGGGAAGCTACAGATAAATCAACAAAAGTTGTTGATCAAGCTGGTCCTTCTCAAGTGTTGAGAATGCCTCGACGTAGTGGGAGGGATGTTAGACAACCCGACCGTTACATGGGTTTGACTGAAACCCAAGTCGTCATACCTGATGATGGCGTTGAGGATCCATTGACCTATAAACATGCAATGAATGATATAGATAGGGACCAGTGGATTAAAGCCATGGACCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTGTAGATCAACCAGATGGTGAAAAACCTATCGGTTGCAAGTGGATCTACAAGAGGAAACGAGATCAAGCTAGTAAGGTGCAAACCTTTAAGGCTCGACTTGTGGCAAAGGGTTATACCCAAAGGGAAGGGGTGGACTATGAAGAAACCTTCTCCCCTGTTGCCATGCTTAAGTCCATTAGAATACTCTTGTCCATTGCCATGTTTTATGACTATGAAATTTGGCAAATGGATATCAAGACAGCCTTTCTGAATGGCTATCTTGAGGAGAGTATCAATATGGATAAACCAGAGGGGTTCATAGTTCAGGGTTAAGAGAAAAAAGTTTGCAAGCTTAAACGATCCATTTATGGATTGAAACAAGCGTCTAGATCCTGGAATATAAGATTTGATACTACGATCAAATCTTATGGCTTTGAACAGAATATTGACGAGCCTTGTGTTTATAAGAAGATAGTCAATACCACTGTAGCTTTCTTAGTTCCATACGTAGACGATATCTTGCTCATTGGGAATGATGCAGGGTTCCTGACTGGCGTTAAGCAATGGCTAGCGACCCAATTCCTAATGAAAGATTTGGGAGAGGCTCAGTTTGTTCTTGGAATCCAAATCGTTCGGAATCGCAAGAACAAAATGCTAGCACTTTCTCAGGCATCTTATATTGACAAGATGTTGGTTAGATATAAGATGCAGAATAACAAGAAGGGATTATTGCCTTTCAGGCATGGAATTCATTTAACTAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGCATTCCCTATGCATCAGCTGTCAGTAGTATGATGTATGTCATACTATGTACTAGACCCGACATATGTTACGCAGTGGGAATTGTCAGCAGGTATCAGTCTAATCCAGGATATGATCACTGGACTGCCGTTAAGAAAATCCTCAAGTATCTTAGGAGAACGAGGGACTATATGCTTGTGTATGGCGCAAAGGATTTGATCCTTGCAAGATACATTGACTCTGATTTTCAAACTGATGTAGATTCGAGGAAATCCACATCATGATCAGTGTTCACTCTTAACGGAGGAGCTATAGTATGGAAGAGTATAAAATAAGGTTGTTGAAGGGAATGAGAATCCCCGTGCAGCGGAAGACGATCGAATCGTCGTGCTATCATCATGCAACATGCTTATTAAAATTGATCTAACTATTAGATCTAGGCTAAACATGCTTTGAAATTTTACAAAAGAAGAAGAAAACCACTTACTTGTTGTAGACCAATTCAAAGAACTCTCCTTGTTAATCACTTAACTCTTCGTGCTTCTCGATCTCTACGAACTCTTCCACGATCAGATCCCAAACACTACCAAATGATCTTCTTGGTTGTTCTCTTTTGGAGAAGGGAAGTTGGTGCGAACTTTTGTTTTTGGAAAGAGAGGAGAAATGAGAGAATGGAGAGAGTTCACTTTTGGATTCTTAGGTCAAAATCCAAAAATGAGAACTTAGCTTTTCTTAGCTCTCTAAACCCATCTTACAAAGGGTTTATATAGTGGAATAATGAAGTTCACAACTTCACATTATTACCACTTTACCACAACCATATGTTATGTAAATCTTATTTACATAATTATATAATATAAGATATCTCATATCTTATGTTGCAATAATTGTGGATTTAATTGAATCACATTCAATTTATTTTCTCTCTATCAATTTCTCCAATTAGCCTTTAATTCAACAATTAGGCTAATATATAGTTTATTATGAATCTCATTCATATTAAACTATATATTATATCATCTATATGATATAAATATTCTTAAAGTGAAATTGAACACTTCAATTTCATGCCAAAACATTAACCCTTCTTTTATCCCGTTTGAGCCAACCAAGAGACCTAATGGACCTACATTGACGGGCTCCAATGATTTGAGGTTAACCACCAAACTCTTTTACCCACAACCAATGTTCATTAGCTACAGTGAACACTCCACTAAAGCCCAGTAGTTGTACTCCCATCGATGTAGAATATTACGTGTCCACCGATATAACCAATGCCCATGAGTCGACCATTCACGGGTTGTTCGTGAGACACGATCGGGTCAAATTACCGCTTTACCCCGTGTCTCACATCTTGTTCCTTAAGTCCCCACAGCTCCTCTAATGAACAACATATTGTGATGGTCTAACCATACACAACACCCTTCTCTGGCGGTGAGAAGGTGGGCGCGCGTTGTCCAAGCCCGGAGACAACACTTAAGGAACAACCCCTAGATTCCCTATACGCGAGAACGAGTGAATTTCATCTCGCGTAAGTAGGTTCCCACTCTCTACTTGGTTCTGTCCCTAAGAAGATAGGCATATTGGGCAAGTAACTAATACTACCCTCACCCGTATTAGTCAAAGGACGGACCATGAGGCGGAGTCCGTAATCGCTCGAGGATTCGTGTCGAGTCACTAATGGTCATCTACGAATTTATTAGTCTTTCACTGTTGTCAACGGTGTTAGATCGATAAGTCTAATAATTCACGATCCTGATCTTATACAATCTCATTGTGCAGATGCCCCCCACTCGCATGTCAACCACATGAACGAGTCGATCACCTCGTTTGTATCTAATACAAAGCGGGTCGCATCCATGAATGTATCCGAGATTAGGTCTCCAACCCCATCCTTATATCAGATCGTGCTTGGGTCATTAACTCGACGTGATCCTCCTTGTGTGTCAACTACACACACGCTCAAGTTCTAGTTCTCTCTCATATATTCAATGACCCCGGAGCTTAGTTTATTGGAAATGTTTAAAATATTTCCGAGACACAAAAGTGAAGAAATTAATAACTCTTATTAATTTCAAAATAATATTTTACAACCACGAGATTAGGACAGAAATCCCAACAGTTGTTTCGCTGACTCCACCATGGAGGTTGAGTATGTCGCAGCTTGCGAAGCAGTGAAGGAGGTTGTATGGCTTAGGAAGTTCTTGACTGATTTGGAAGTTGTTCCAAATATGAATTTGCCTATCACCCTTTATTGTGATAATAGTGGTGCAGTGGCAAATTCCAAAGAACCTAGAAGCCATAATCGCGGAAAGCACATAGAGCGCAAATATCATCTCATCGAGAGAGATTGTGCAACGAGGAGACGTGATCGCCACGCGGATCGCCTGAGCACAACATTGTTGATCCGCTTACAAATCCTCTCACGGCTAAAGTGTTTGAGGACACCTAGAGAGTCTAGGACTACGGGTTGTACAAGAGGGCGCTTTTTCTTTGCTCAAAGATGTTTTACCCCACAAACGTTTTAGTCCAAGTGGGAAATTGTTGGGTTTTAATGTCCTAAAACTCGTGGTTTTGTAAACTGAATGTATATTCTATTTCACAATAAAGTTGTTATTGAAATTTATTCAGTAAAGGTGTTATTGACTATTGTGTATTGTGTAACCTTAATCCAATAAAGAACCCTTGGCTATAGGACGAATACTTGAACTTTATGTGTGACATAAAAGTGGATCAAGTTCAAGTTATAGCCAGAACAGTCTATAGTATAAGGATAGGGTTAGGTACCTTATCCTGGGGACACTCTGGGCGCGACCCACTTTGTAGAGTACAAACGATGTGATCCTAAATTGTTCATGTGGAGACATGTGAGTGGGGGCGCCCTATGCAATGAGTTTGCATAAGACTGGACCGCGAAATAGTCACTTAGACTTTATAACTCCGTCTGTCTTAACTGACTATATTCATACGATGACCTAGGTAACTCGATCTTAATCCTGAGCGTTCTATGGGCTCCTGTTTATTCGGATTATCCTTAGATTTGCATGGGTGAGAGTGGCTCGAGATTGCCGAGTCAATAAGCCTCCCATTTCAGGGATAAAACTGGGTAGAAAGCTGGGAACATAGTTCTGCAAGATGGAATTCACTCCTACCCGCTTTAGGGATAAGTAGAGAGGTTGTTCCCTTAAATGCTGACTCCAGGTCTTGAACAATGGGCCCACCCCTCTCATTGGCCCGAGAGGGATTCGGTTTGCTGGTTGGACCACAAACCAATTGTTCATTAGAGGATCAGTGGTACTTAAGGAACAAGAGGTAATTACAGGGGTAAAACGGTATTTTTGATCCAGCTGTAATTACGAACGACCTGTGAATGATCGACTTACTAATCATGGTTATATCGAGTGAACAGAAATATATCTACAGTGAGGAAAATGCAGCCACTGGGCTTTAGTGGAGTGTCCCGGTGGTTAACGAATGTTGATTAGCTCGGCTAAAGAGTTTGGTCGATTATTCTCGTATCGTTGGAGCTCATGATCTGTAGGTCCATTAGGTCCTCCTTACTAGCTCAAACTTAGCCTTATAACATTTTGAGAGAAAAGAATTTGAATTGTTCAAATTCGAGTTTGGAAGAAAGCGTTAATTAAATGTGATATATTTAACGAATCATAGCTTAATTATGAATTAAACTATGATTAATAAGAGGTGGATATATTTAAATATGATTTAAATATCGGATCGGTTTATAGCATGAATAAGGATTCATGTTCGAAAAAACGATCGAGATCGGGAAATTTATTTAATTTAACATTTGATATTAAATTAAATTAAAAATTAATTTTGAATTATTTAATTAATTATATTAATTTTAATTAAAAATCAATTTAATATTGATTTTTGAATTAAAATTAAATGAAAAATAAATAGATTTTCATGATTTTCATGAAAATCAACTTGACAAGTTGCCTAATCCTCATCATCTTCAAAATTGTCCACCTCCACCTCAAAGCATCCTCATCACCCTTGAGTTGTCACTCAATTGAAGCTTATTGAAGCTTTATATAATGGTGATTTGAGGGATGTTGTGAGAATTTTACAGAAAATTCAATATTCCCGAATCAGTGACCTTTGAGGTCTTTTCTTCCAAATTCTCTCAACCTAGGATCCCACAATCCGTTCTAAGGCCTGAGGATAGTAGGGAAGACTCTAACGGTTGTCCACGAAGTGTTCGTGTTGAAAACCATCCACCGGGTGAAGATAATTTCATCTTCAAAGTTGAGTTTTCTATACCTCTGTTTTTCCAGCTTTGAGCATGTTTTTTATTTTACAAAATTGATCAACTTAGAGTGTTCAAGATCCAAAATTGCTTCCGCTTTCTTAATCTAGACTCTTTCACTAACCGTGTTACGGACTCCTATCTGCGGGGGTCTGTCCTTTGACTAGTACGGATGAGGGTGATCACAGTTACTCGCACAATATGCCTATCTTCTCAAGGACAGGACCACGTGGGGAGCTGGGAACTTGACTACGCGAGATGAAATTCACTCGTTCCCGACTTTAGGGAAACTAGAGGGGTTGTTCCCTTAAATGTTGTCGCCAGGGCTTGGACAACGGGCGCCCACCCTCTCACTGGCCCGAGAGGGATGTTGTATATGGTTGGACAATTATAATGTTGTTGTTCATTAGAGGAGCAGTGGGGACTTAAGGAGCAAGAAGTACACACAGGGGTAAAACGGTAATTTGACCCAGCAGTGTCTACGAACAACCTGTGAAGGGTCGACTCATCGGCATTGGTTATATCAGTGGACACAACTTGTCCTACAGTGACGGGAGTGCAACTACGGGCTTTAGTGGAGTGTTACCGTCGCTAATGAATGTTGATTAACCAGGTCAAAGAGTTTGACAGGTTAGTCTCGGATCATTGGAGCTCGTCACCTTTAGGTCCTTTAGGTCCCTCGGTTGGCTCAAACTGGATAAATTGAGGGTTTACAGTTGGTGTGGATTTGAAGTGTTCAAATTCTATATAAGGGTATTTATGTCATTTATATGAGATATGATTGACATATTATATAGTTTAATGTGAATGAGATTCATAATAAACTATAGGTAGCCTAATTGCTTAATTAGGGCTAATTTGGAGAAATTGGATTGAGAGAAGTTAAATTGAATGTGATTAATTTAAAATCCACAATTATACAAATGTGATTTGTATAATAGCATATGAGATATCTTATTATATGTGGTTATGCAAATATGATTTGCATAACCTATGGTTGTGGTTAAATGGTAAATAGGTGGAGATTTCAACTCCACCTATCACCATTATTTAAACTCCTTTGCAAGAAGGATGTCATACTTGAAAGTTCATTTTTTGGATTTCTCACCAAAAGAACTCAAGAATTCTCTCTCAACTCTCATCTCAAATCTCTCCTCCCTTTCACCAAAAAGGAGTCCCACATACTCTCCTTCTACCTAAAAGAGAACAACCGAGAAGCCTATCGGAGGTGTCCGGCTCGTTGGTTTGTCCAAGGAGATCGAGTTCGTTGAGATCGTGTTCGCGAAGAGTTCATGATGGAGGAGGAAAACGTGAAGAACGGTTCTTCAACAAGTAAGTATTCTTTACTGGTTTTCCTTTCTAAAGCATGTTTAATCTAGTTTTTATGTGATAGAAACAATTTTAAGAAAGCATTTTTCACAATGATACCACAGTGATTCAATCGCTTTTTCGCTGCGTAGGATTTTCATTCCTTCAATTGGTATCAGAATCGTTGGCTTTTCATTGTGCAAATGTTTTTCTTAAAATTATGTGGGTATTGCTTCAATTCATGAGATGAAATTGTTGGTTTTGCATAATTGGATGTTTTCGGTTTTTGCACTAGTTTTAGGTTTATTTTCTGTTGATTAGATCAATGTAAAGGCCCATGTTTTTGGGGCGTTTAGAAAATAAATCGAGTCTGTAAGTCCGGGTCTTTGTCGGCAAGAGTTGTTGTGAAGAAATCGGAGCGAAAACGGGCGAAAAATCGAAGAAAACAGCAAAGGAGTTTGTTATTCAATAGATGGCATTGAGACCCTGTTCTTCGGCGTCTCAACGTCGTCCGCGCGGGCCAAGGCTGATCTCCAGAGCAGCGTTGCAACGCCGAGGCATGGAGCACGCGAGGTGCAACGCTGAGCGCTGCAGCGTTGCAACGCTACCCACGGCGTTGCAACGTTGTGTGCGTGATGGGCAGTTCCAGCGGCAGTTTCGCGCGAGACGTGAGAAGGTTGCAAGTGTTGCACTTCGTGGGATGTGATTTTAATTTGTTTTATTTATTTTTAATTTGTAAAATGTATTTTTCTTTTTAATTAAATTAAATTTATTAATTAATATTAATATAATAGTTATATTAATGTTAATTAATTCATTTAATGTCCCCTAAGTGCCAAAATCGGTCCAAAATTTTGTATTTATTTATTTTATGCATGAGATGTATGAAATATTTAGTTCTATGTGCATAATGTTTATCATATACATTTAAAATCTCAACTTAGGTTAAACATTTCATGCATCATATTTATATTATAAGTGTTATAATATATAGTATGCATGTTAGGTTATATTATAAGTGTTATAATATAATGTATGCATGTTTTAATTTATTTAAATATAAGTGTTATATTTTTATATTAAAAGCATGCTCATGCATAATTCATGTAATGAATGTATGTTATAGTATATATAGTATTATTCATGTGATGAATGATCAAGCACTATGACATAGAAATGCATGTAACCTAAGTTAATTATTAGTTTTAAAATGTTTAGAACTTGAATAATTAGACTTCGGTTTTCTACTTCTAATAAGATTAGAATTAGGAAATCAAATATGACCTTTGAATGGAGACTAATTAGTAGCTCTGTCTAAGGAAGGTTCTATCTAAGGTTGCGGTATTTAAGCTGACGCTTTACGGAACACCCCTACCAGGGAACCGACCTGGGAGTGAATTTGGTCTGGAGATTCTTAGGTTATATTATCGTGAGTGGCGTTAATTAAAGGTTGATTAATTTACTCACATAGACCTAAGTTAATAATGTGATTATAAACTTAGGTTAAATCAATATAAGAGTTATATTGATAGACTTAGGTAAATATTAGTTGATTTATCTAGGTCAATCCCGAATAATATAAAGTGTTGAGTGGGAGGAAATAAAGTGTATAAGATACATTTGATTTTTCGACCACAAATCTCTGTAAAGTCTGCACCGTGAGATCCATACTCGGCCTCGTGGCCCAAATGGGAGTGTCCCCCTTCGGATGGTATTTGTATGGGTCAATATCATAGTGGATGGAGAGAGTACTTATAGTGAGTGGGAGCAGGATCTGTGAAAACGGGACCTGTGAAAATGGGACCTACGATCTCTTCTTTCGATTCACACCGTGAGACCCCTATGATCCGCCTGCTAGTTTGGCTTGGACCAGACAATCCCTTCGGAGGGCCTTGATCATGGGAGTCGAAACACCGTGAATTTTCGAAAGGGATACAGTTTCTTATTGGTTTTCCTTCTTGGTTCGCCCTTCTGTGCCTATTGTTTGGACGGATACTTGGAAACTTAAAGATTGAGGGTGACACTCACAGGATTCATACTGAAAAATGGTAACTGGACGCCTAATGGAATAAGAGTTATTCTGGTATTAGGGTCTGATCGATATAGTCTCGTCCTAGAGGAGGAGTGTTTTTGTTCACCCTTCGGTGGCGGTCATTCTAAACCTTTGGAATATCATTGCAAAATTTTAATGACCTAGGGAACATAGGATAAATTTTTGCTAAAACTTGATTGGATTTCAAAAAGTTTTGCGAGACTGACACAAAGTTTTTATTTGTTTTTCAGCAAAATGTCTAGCTCAATTATAGTTTTACTAGCATCCGACAAATTAGTGGGAGATAATTTCCAAACTTGGAAAAACAATATAAACACGATTCTAGTAACTGACGACCTTAAGTTCGTGCTCACTGAGGAGTGTCCTCAGTTGTCGAGCTCGACCGCGTCACGAAGTGTTCGTGATGCGTACGATCGATGGATCAGGGCCAATGAAAAGGCCAAGGTCTACATAATTGCCAGCTTGTCTGAAGTCATGGCAAAGAAGCATGAGCTGATGATCACCGCTAAAGAGATCATGGAGTCCTTGCAGGAAATGTTTGGACAACAGTTCTTTCAGGTCTGGCATGACTCGCTCAAACATGTCTTCGTCCGGATGAAAGAAGGGGCGTTTGTCCGTGAACACGTTCTGGACATGATGACCCACTTTAATCTGGCGGAGATGAACGGGGCTTCGATCGATGAGTCAAGCCAGGTCAGCTATATTCTGGAGACTCTTCCGAAGAGTTTCCTTCAGTTTCGTAGCAATGTTGTTATGAACAAAATGAGCTATTGTTGCGTATGATTTATGATATGCAATACCGCAGTGTACGGGTCAAGTTTTAGTATAGTAATTTAAGAGTCCAAGTATCGTCCTCGGGAATAGATTTCTAGATTAATTAATTACTAAGATACTACTTCTCAATTTTAGCTAAAGTAACAACATATGGATAGGTGATTTGAAAGTAAACACAAATAACAGACTAACATGTGGAGACACGAATTGATAGAATTGAGGAAAGTGATTTTACAAGTTAATAACCTAACTAACACATGCATTGAACTAAGAGATGAAGTTATAAACAAGAAGTAGAATACTCAAGAAAACCTCTCGATTCTTAGTGAGCCTCGCACATGCAATAAACTCCATAGTCTCCATATGAACCTACGACATGCACGGCATTAAGCAACTTGTTTATATGTTTCCCTTAAGTTTCTTAGTCATTCTAATGTCAGATTCCTAAGGGTTTTGACAACTTTGCTCTAATTCAAGTTTTATCTCTCAGTCCACTTAAACCAACAAAATTAAGAGTTCAAGAGCTAATTGCCAAAAATAAGAACAAAACATGCTTCAATACATAACAATTGAGTCTTAACAAGCAAAACAAATTCTAACTACAACATTTAGCTACTCTAAATCACATGAAAACAAAGCTAAACGTGTAGAAAACGTCTTAAAACGGTATTACAAGCTGAAGAAAAGGAAATTAAATTTATGTGTCGATGTCGGTCCTTCGTCTCCGAGCTCCCGTCTTCGATCTCGAACGTCCACGGATGCTTTGGCTCCCAAAAACGCTCTTTTCTCTCTTCTTACAAAGTGTATTTCGAAAATAGGGTCAAAAGCCTACCAAATTCGGGTCCCCTTGCAATGAAAACATAAGGTTCTATATATAGGCTCAATCGGTACAGCGTTGCAACGTTGTACCCTAGGCGGCTGGCATATATGGAAATTATCGATGGCGTTGCAACGTTGAGGGCAGCGTTGCAACGTCGCTACCCTAACGTGTGCACCTCTTATCGAAATGGCGTTGCGACGCTGCAAGGCAGCGTTGCAACGCCCTCTATGTGGTTCTTCAAGTGTGCCTTTTTTATCGAGATAGCGTTGCGACGCTGCTAGGCAGCGTTGCAACGCTATCGTCTGGGATCTCCAATGCCTCTAAACAGCAAGCACGACATTGCAACGCTTTGCATAGCGTTACAACGCCATCACCTCGGGTCTTCAAGCTTGGTTTGGTCCTTTCACTTCCAATTGATGTTATTTTACGCTCCAAATAGCCTCGTTTTGATCCCGAGTGATCCAAAATGTCTCCAATTGCTCTGTTTAAGCTCTTAACCTGAAATTGAGTAATTTAGACACGTAAAAGCTCTGAAATCATTACAAAACTAGCAATAATTAAGCCTAAGGATAACACATTTTGTGTGCTATAAGCTACACTCTGACCACCCTTCTGACGAGCTACAGAACTTTCAGTCCTTGATGAGGATCAGGACACCGGAAGCTGAGGCATATATTTCCTTCAGGTTCTTTCACAGGGGTTTGGCCTCTGGGACAAAATCTGTGGCTCTTTCTCACCCGAAAGGGAAGAAGAAAAAGATAAAGAAAGGTAAAGTTGACTGCGTTCCTGCCCAAAAGGGCAAAAAGGTCAAGGAAGTTACAGAGAAAGGAAAGTGTTTCCACTGTCATGGGGATGGCCACTGGAAGCGTAACTGTCCCAAGTTCCTTGCCGACAGGAAGAATCAAGGTAAATGTGATTTACTAATGTGAAACTTTGCTTAGTGGAGAGTAATGACTCTGCCTGGATATTAGATTCGGGCGCCACTAACCATGTTTGTTCTTCTTTTCAGCGAATTAGTTCCTGGCAGCAGCTGCAAGAGGGTGAGGTGACTCTACGGGTTGGATCTGGGGAGGTTGTCTCTGCTACAGTGGTCGGCACGGTGAAGCTCTATTTCGGCAGGAATTACATTTTATTAGATAATATATATATAGTTCCAGGGTTTACTAGAAACCTAGTTTCTATTTCCTGCCTACTTGAACACTTTATTTCTGTAAATTTTTCTAGTAATAAATCGTTTATTTCTATAAATGGAAATTTCTTGTGTTCTGCTTCACTTGAGAATAATCTGTATGTTTTGAAACCTAATTCGGTCAAAAGTGTTTTGAATACGGAATTATTTAAAACAGCAGAACCACGAACTAAGAGAATGAAAGTTTTTCCTAAAGAAAATGTCCATCTTTGACATCTAAGGTTAGGCCACATTAATCTCAATAGGATTGAGGACTAGTGAAGAGTGGACTTCTGAACGAGTTGGAAGAAAACTCTTTACCGGTGTGTGAGTCATGCCTCGAAGGCAAAATGACCAAACGTCCTTTTAGTGGAAAAGGATATAGAGCCAAAGAGTCCCTTGAGCTTATACATTCTGACCTCTGTGGTCCGATGAGTGTTAAAGCACGAGGAGGTTATAAATACTTTGTATCTTTTATAGATGACTATTCAAGGTATGGGTATATCTACATGGACCAGCCCAAGGGGTTCATTACCCAAGGCCAAGAGCAAAAGGTTTGTCGGCTTCATAAGTCTATTATGGAGTGAAACAAGCCTCAAGGTCCTGGAATATAAGGTTTGATGAGACGATCAAATCTTATGGCTTTGATCAGAATGTTGACGAGCCTTGTGTCTACAAGAAAATCGTCGACAAAACTGTTGCATTTCTGATATTGTACGTGGATGATATCCTTCTCATTGGGAATGAGGTAGGATTACTTACTGACATAAAGAAATGGTTGGCTTCGCAATTCCAAATGAAAGATTTGGGAGAAGCACGCAGTATGTTCTAGGATATCCAGTTGTCCTAACCTAAGAATAGAAACGTTGGCCATGTCTTAGGCATCTTATATTGACAAGATGTTGTCTAGATATAAGATGCAGAACTCCAAGAAGGGCTTGCTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGATCAATGTCCTAAGACTCCTCAAGAGGTTGACGACATGAGACGAATCCCTTATGCTTCAGCTGTTGGAAGCCTCATGTATGTCATGCTATGTACTAGGCCCGACATCTGTTATGCAGTTGGGATTGTCAGTAGGTATCAATCCAATCCAGGATTAGATCACTGGACAACCGTAAAGGCAATCCTCAAGTATCCTAGGAGACCAAGGAACTACATCCTTATGTATGGGAGTGGGGATTTGATCCTTACGGGATACATAGACTCTAACTTTCAGACCGATAAGGATTCTAGGAAATCCACTTCGAGGTCAGCCTTCATTCTGAATGGAGGAGCTGTAGCATGGCGAAGCATCAAGCAGGGATGCATCGTTGATTCCACTATGGAAGTTGAGTACGTTGCAGCTTGTGAAGCTGCAAAGGACGTTGTTTGGCTTATGAAGTTCATGATAGATTTGGAAGTTGTTCCAAATATGAACTTGTCGATCACGTTGTTTTGTGACAACAGTGGTGTTGTAGCCGACTCCAGAGAGCCTCGGCGTCATAAAAGGGGCAAGCACATTGAGCGAAAGTATCACTTGATACGGGAGATTGTGCATCGCGGCAACGTGACAGTCACGCAGATAGCGTCGGAGCACAACGTTGTTGATCCATTTACAAAGGCCCTCACAGCTAAGGAGTTTGAGGGTCACCTAAAGAGTTTAGGTCTTCGAGTGCTTCCTGACTAGGGCAAGTGGGAGTATTGCAAAGGGTATTCAATGCCCTAGTTTATTGTATTACTTTATTGTAATTCTTGTACATTTGAGAAATTTATTATTTACTCCACTAGCTTAGTCCAAGTGGGAGTTTGTTGGGATGTATGTCCTAATCTCGTTTGGTTTGTAATATTATCATTTGAAATAAATAAGAGTTATTTCTTTCATCACTTTTTGTCTCGCAATTTTACAAACTTTGTCCAATAAACTAAGCTCTAGGGTCATTGAATTTATGAGAGAACTAAAACTTGAACGGTGTGTAGTGGACATACAGGAAGGATCACGTTCGAGTAAAGGACCCTAACGGTCTACAGTATATGGATAGGGTTGGAGACCTAATCTGGATACATTGTAGATGTGGCCCGCTTTATATTAGATACAAACGAGTTGATCCAACTCGTTCATGTCGCTGACATGCGAGTGGGGGCATCCTATGCAAATGAGATTGTACAAGACCGAACTGTGGATTATTAGGCCTACTAATGTAACACCGTTAACAGTATTGGACTAATAACTTCGTAGATGACCAATAGTGACTTGACCTTAATCCTGAGCGTACGGGACTCACATGCGGGGAGTCTCGTCCCTTTGACTAGTACGGGTGAGAGTGGTCACAGTTACTCGCCCAATATGCCTATCTTCTCAAGGACAGGACCACATGGGGAGCTGGAAACTTGACTACGCGAGATGGAATTCACTTCTTCCCGACTTTATGGAAACTAGAGGGGTTGTTCCCTTAAATGTTGTCTCCAGGGCTTGGACAACGGGCGCCCACCCTCTCACTGGCCCGAGAGGGATGTTGTATATGGTTGGACAATTACAATGTTGTTGTTCATTAGAGGAGCAGTGGGGACTTAAGGAGCAAGAAGTACACACAGGGGTAAAACGGTAATTTGATCCAACAGTGTCTACGAACAACCTGTGAAGGGTCGACTCACGGGCATTGGTTATATCAGTGGACACAACTTGTCCTACAGTGACGGGAGTGAAACTACAGGCTTTAGTGGAGTGTTACCGTAGCTAATGAATGTTGATTAACCGGGTCAAAGAGTTTGACAGGTTAGTCTCGGATCATTGGAGCTCGTCACCTTTAGGTCCATTAGGTCCCTCGGTTGGCTCAAACTGGATAAATTGAGGGTTTACAGTTTGGTGTGGATTTAAAGTGTTCAAATTCAATATAAGGGTATTTATGTCATTTATATGAGATATGATTGACATATTATATAGTTTAATGTGAATAAGATTCATAATAAACTATAGGTTAGCCTAATTGCTTAATTTGGAGAAATTGGATTGAGAGAAGTTAAATTGAATGTGATTAATTTAAAATCCAAAATTATACAAATGTGATTTGTATAATGACATATGAGATATCTTATTATATGTGGTTATGCAAATATGATTTGCATAACCTATGGTTGTGGTTAAATGGTAAATAGGTGGAGATTTCAACTCCACCTATCACCATTATTTAAACTCCTTTGCAAGAAGGATTTCATACTTGAAAGTTCATTTTTTGGATTGCTCACCAAAAGAACTCAAGAATTCTCTCTCAACTCTCATCTCAAATCTCTCCTCCCTTTCACCAAAAAGGAGTCCCACATACTCTCCTTCTACCTAAAAGAGAACAACCGAGAAGCCTATCGGAGGTGTCCGGCTCGTTGGTTTGTTCGAGGAGATCGAGTTCGTTGAGATCGTGTTGGCGAAGAGTTTGTGACCGGAGGAGGAAAACGTGAAGAACGGTTCTTCAACAAGTAAGTATTCCTTACTGTTTTTCCCTCTTAAAGCATATTTAATCTAGTTTTTATGTGATAGAAACAATTTTAAGAAAGCGTTTTTCACAACGATACCACGGCGATTCAATCGTTTTTCCGCTGCGTAGGATTTTCATTCCTTCATAGGGTACGAAGAATGGGGAGAGGAAGACTCACCTTCAATTGCATTAGGTGTAGATAATATATCTTTCATTGAAAATTCTGATGAATGGAGTCGACGACGAGATGAAATGGTTGAATGAATGTTTTCTGAATGGGCAAGGTCATAATTACATGCTTGTGACAACTAGTATGACGTACTTATGTTGAATGTGAATTCATGTATTGTATTTTACATGGTTTAAGTACTTATGTTGAATGTGAACGTCTACACCGAACCTTTAACCTATTAATTGAATGTATGATTTTGTCCACTTATGGTCTTACATTAGTGTGATATATCCTAACAAAACTAACTAGAGTTGATGTATATATATGCATTTAGAAAAATGGCAGGTGATAGCACACAAAAGGTGCTATCCTTAAGCTTAATTATTGCTAGTTTTATGATAATTTTAGGGTTTTGATGTGTTTAAATCACTCAATTTCAGGTTAAGAGCGTAAACATGACAATTGGAGACATTTTGGTTCACTCGGGATCGAAACGAGGCTATTTGGAGCTAAAAATAACATCAATTGGAAGTAGAAGGACCAATCCAAGCTTGAAGACCCGAGGTGATGGCGTTGCAACGCCATGATTGTACCTTTGATGCATAGAAGAACTCTGTCGATAGCGTTGCAACGTTGCCTTGCAGCGTCGCAACGTTATCTCGATAAGAAGACTTATATGAAGATCCGAAGCGAGGGTGTCGCAACGCTGCCTTGCAGCGTCGCAACGCCAGATCGATAGAAACGCGCTTGTTAAGGTAGCTGTGTTGCAATGTTGTCCTCAGCATTGCAACGCCATCGATAGTTTCCATATTTGATACGCGCTTAGGGTACAGCGTTGCAACGTTGTACCAAAGATCATTATAAATAGAACCTTATCTTTTCTTACAAAGGGAGCCGAATTTGATAGATAATTCACCCTAATTTCGAAATACCTTCTGTAAGAAGAGAAAAGAGAGCGTTCCTTGGGAGCCAAAGCATTCGTGGAGGTTCGAGATCGATACGGGAGCTCGGAGACAAAGGACCGACATCGACACATAAGTTTAATTTCTTTTTCTTTCAGATTGTAACCCCGTTTAAGACGTTTTCTACACTTTTAGTTTGGCTTTATTGTGATTTAGAGTAGTTAAATGTTGTTGTTAGGATTTGATTTGCATGTTAAGACTTATTTGTTATGTATTGAAGTATGTTTTGTTCTTTATTTTTGGCAATTCGTTCTTTAACTCTTTGATATTGTTGGTTTAAGTGGACTCGAGAGATAAAACTTAGATTAGAGCAAAGTTGTCAAAACCATTAGAAGTCTGGCATGAGAATGACTTAGAAGTCTAAGGGAAACACGTAAAGAAATTGCTTAATATCGTGCATGTTGTAGGTTCATATGGAGACTATAGAATTTATTGCATGTGCGAGACTCACTAAGAATCGAGAGATTTTCTTGAGTATTCTACTTCTTGTTTATAAATTCATCGCATTAGTTCATTGTATGTGTTAATTAAGTCGTTAACTTGTGAAGTCACTTTCCTCTAAGTCTATCAATTCGTATTTCCACTTGTTTGTTTGTTGTTTGTCTTTATTTCCATCACCTATCCATGTTATGTTACTTTAGCTAAAATTGAGAATTAGTATCTTGGTAATTAATTAATCTAGAAACTCATTCCCGAGGACGATACTTGGACTCTTAAATCACTATACTAAAACTTGACCCGTACACTTGTGGTATTGCATTATCATAATTATACGCAACAGTTGGTATTGAGAAAACTAAAAAGTATTGTTGGTCTAAGTTCGAGGATGCTAAGTTGGTTGAGTGCCTCGTTCGAATGGTACAAAAGGGATGTTGGAGGTCAGACAATGGCACATTCAGACCCGAATACATATCGCATCTTCTCCGGCTGTTGCGAGAAAAAATACCAAACTGTAGTCTTCAATCAATAAGTACAATAGACTGCAAGGTTAGGAATCTTAAAAAGCAATACACAACAATTGTTGAGATGCTTGGATCTGAATGTAGTGGATTCGGATGGAATGAGGAGTTCAAGTGTGTGGATGTGGAGAGGAACATCTTTGACGAATGGGTCAAGGTAAGACAACAACTTTAATTCATAGTCGTAGTTACATTACCAACTACACATAATACTATATATTTTTTGAAATGCAGAGTCATACTGGTGCCAAAGGACTCCGGAATAAGCCCTTCCCTCATTTTGATGAGTTAGCATTTGTATTTGGAAAAGATCGTGCGACGGAAGTAGCAGTAGAGACCCCTGGTGACATGGCATCAAATTATGATGTTGAAACACTGGGTGAAGACGTGGAGATTAGAGTATCCCAAGACCATTATGTACCTGAACCCCCGTTTGTCGATGGAACAAGTAATCTAGATGGGGAAGAAAATCCCGAGACACCGACTAGTAGGTTTCATATGTCAGCCACGACATCACGCGGAAGCAAGAGAAAGCGTTCATCTTATCAGTCTGAAATGCTAGATGTTGTGCGCGCAACAATGGACATGCAGGGTACTCAACTAGAGCGGATTGCGTCATGGCCTCAATCAAATGTTGATATAGATGATAGCCGACGCACAAAAGTGTCAAACATTTTAAAATAA

mRNA sequence

ATGTTGACAACTGGTGGTTTAGAAGGGATTGAATACGTAGATGTTGAGGAAATGGTTGCAATGTTCTTACACATCCTAGCTCATGACGTCAAGAATCGAATAATTCGTACACAATTTGCAAGGTCTGGTGAGACAGTATCTAGGCACAATTCTGTACTTAGCGCAGTATTGCAACTCCATGAAATTCTACTGAAGACTCCAGAGCCAATCACAAATTCTTGTACTGATTCTAAAAAATTCAATATTCCCGAATCAGTGACCTTTGAGGTCTTTTCTTCCAAATTCTCTCAACCTAGGATCCCACAATCCGTTCTAAGGCCTGAGGATAGTAGGGAAGACTCTAACGACAATGGCACATTCAGACCCGAATACATATCGCATCTTCTCCGGCTGTTGCGAGAAAAAATACCAAACTGTAGTCTTCAATCAATAAGTACAATAGACTGCAAGGTTAGGAATCTTAAAAAGCAATACACAACAATTGTTGAGATGCTTGGATCTGAATGTAGTGGATTCGGATGGAATGAGGAGTTCAAGTGTGTGGATGTGGAGAGGAACATCTTTGACGAATGGGTCAAGAGTCATACTGGTGCCAAAGGACTCCGGAATAAGCCCTTCCCTCATTTTGATGAGTTAGCATTTGTATTTGGAAAAGATCGTGCGACGGAAGTAGCAGTAGAGACCCCTGGTGACATGGCATCAAATTATGATGTTGAAACACTGGGTGAAGACGTGGAGATTAGAGTATCCCAAGACCATTATGTACCTGAACCCCCGTTTGTCGATGGAACAAGTAATCTAGATGGGGAAGAAAATCCCGAGACACCGACTAGTAGGTTTCATATGTCAGCCACGACATCACGCGGAAGCAAGAGAAAGCGTTCATCTTATCAGTCTGAAATGCTAGATGTTGTGCGCGCAACAATGGACATGCAGGGTACTCAACTAGAGCGGATTGCGTCATGGCCTCAATCAAATGTTGATATAGATGATAGCCGACGCACAAAAGTGTCAAACATTTTAAAATAA

Coding sequence (CDS)

ATGTTGACAACTGGTGGTTTAGAAGGGATTGAATACGTAGATGTTGAGGAAATGGTTGCAATGTTCTTACACATCCTAGCTCATGACGTCAAGAATCGAATAATTCGTACACAATTTGCAAGGTCTGGTGAGACAGTATCTAGGCACAATTCTGTACTTAGCGCAGTATTGCAACTCCATGAAATTCTACTGAAGACTCCAGAGCCAATCACAAATTCTTGTACTGATTCTAAAAAATTCAATATTCCCGAATCAGTGACCTTTGAGGTCTTTTCTTCCAAATTCTCTCAACCTAGGATCCCACAATCCGTTCTAAGGCCTGAGGATAGTAGGGAAGACTCTAACGACAATGGCACATTCAGACCCGAATACATATCGCATCTTCTCCGGCTGTTGCGAGAAAAAATACCAAACTGTAGTCTTCAATCAATAAGTACAATAGACTGCAAGGTTAGGAATCTTAAAAAGCAATACACAACAATTGTTGAGATGCTTGGATCTGAATGTAGTGGATTCGGATGGAATGAGGAGTTCAAGTGTGTGGATGTGGAGAGGAACATCTTTGACGAATGGGTCAAGAGTCATACTGGTGCCAAAGGACTCCGGAATAAGCCCTTCCCTCATTTTGATGAGTTAGCATTTGTATTTGGAAAAGATCGTGCGACGGAAGTAGCAGTAGAGACCCCTGGTGACATGGCATCAAATTATGATGTTGAAACACTGGGTGAAGACGTGGAGATTAGAGTATCCCAAGACCATTATGTACCTGAACCCCCGTTTGTCGATGGAACAAGTAATCTAGATGGGGAAGAAAATCCCGAGACACCGACTAGTAGGTTTCATATGTCAGCCACGACATCACGCGGAAGCAAGAGAAAGCGTTCATCTTATCAGTCTGAAATGCTAGATGTTGTGCGCGCAACAATGGACATGCAGGGTACTCAACTAGAGCGGATTGCGTCATGGCCTCAATCAAATGTTGATATAGATGATAGCCGACGCACAAAAGTGTCAAACATTTTAAAATAA

Protein sequence

MLTTGGLEGIEYVDVEEMVAMFLHILAHDVKNRIIRTQFARSGETVSRHNSVLSAVLQLHEILLKTPEPITNSCTDSKKFNIPESVTFEVFSSKFSQPRIPQSVLRPEDSREDSNDNGTFRPEYISHLLRLLREKIPNCSLQSISTIDCKVRNLKKQYTTIVEMLGSECSGFGWNEEFKCVDVERNIFDEWVKSHTGAKGLRNKPFPHFDELAFVFGKDRATEVAVETPGDMASNYDVETLGEDVEIRVSQDHYVPEPPFVDGTSNLDGEENPETPTSRFHMSATTSRGSKRKRSSYQSEMLDVVRATMDMQGTQLERIASWPQSNVDIDDSRRTKVSNILK
Homology
BLAST of Tan0002976 vs. NCBI nr
Match: TYK26842.1 (uncharacterized protein E5676_scaffold260G00340 [Cucumis melo var. makuwa])

HSP 1 Score: 250.4 bits (638), Expect = 2.3e-62
Identity = 146/329 (44.38%), Postives = 204/329 (62.01%), Query Frame = 0

Query: 3   TTGGLEGIEYVDVEEMVAMFLHILAHDVKNRIIRTQFARSGETVSRH-NSVLSAVLQLHE 62
           T GGLE  +YVDVEEMV +FLHI+AHDVKNR+ R  FARSGETVSRH N VL+ VL+LHE
Sbjct: 4   TRGGLEATQYVDVEEMVTIFLHIVAHDVKNRVARRHFARSGETVSRHFNVVLNVVLRLHE 63

Query: 63  ILLKTPEPITNSCTDSK-----KFNIPESVTFEVFSSKFSQPRIPQSVLRPEDSREDSND 122
           ILLK P+ +T+SC+  K       +I   VT   +++      + + +L+  +      D
Sbjct: 64  ILLKQPDSVTHSCSHEKWRWFQMASINSKVTKHWWTT-IEDEALVECLLQLVEKGCWRVD 123

Query: 123 NGTFRPEYISHLLRLLREKIPNCSLQSISTIDCKVRNLKKQYTTIVEMLGSECSGFGWNE 182
           NGTF+P Y+  + +L++EKI   ++Q    ++  V+ LKKQYTTI EM+G  CSGF WN+
Sbjct: 124 NGTFKPGYLVQVQKLMKEKILESNIQVTPNLESGVKILKKQYTTIAEMMGPVCSGFSWNK 183

Query: 183 EFKCVDVERNIFDEWVKSHTGAKGLRNKPFPHFDELAFVFGKDRATEVAVETPGDMASNY 242
           E KC++ E+++ ++WVK H  A+ L NKPFP+F +L  VFG+DRAT    +TP +M S  
Sbjct: 184 ERKCIEAEKSVSNDWVKGHLNARYLLNKPFPYFYDLEIVFGRDRATGGKCKTPVEMGSQT 243

Query: 243 DVETLGEDVEIRVSQDHYVPEPPFVDGTSNLDGEENPETPTSRFHMSATTSRGSKRKRSS 302
             +T  +D+ I + +D  +P P    G     GE+ P TPTS  H  A +SR SK KR S
Sbjct: 244 ARDTEEDDMIINL-EDFDIPNP---HGLEPPSGEDMPSTPTSMAH-DAGSSRPSK-KRRS 303

Query: 303 YQSEMLDVVRATMDM--QGTQLERIASWP 324
           Y  +++D  RAT  +    T L     +P
Sbjct: 304 YSRDLMDTFRATESLLPDPTMLHAFLDYP 325

BLAST of Tan0002976 vs. NCBI nr
Match: KAA0033487.1 (uncharacterized protein E6C27_scaffold261G00210 [Cucumis melo var. makuwa])

HSP 1 Score: 248.1 bits (632), Expect = 1.1e-61
Identity = 145/329 (44.07%), Postives = 203/329 (61.70%), Query Frame = 0

Query: 3   TTGGLEGIEYVDVEEMVAMFLHILAHDVKNRIIRTQFARSGETVSRH-NSVLSAVLQLHE 62
           T GGLE  +YVDVEEMV +FLHI+AHDVKNR+ R  FARSGETVSRH N VL+ VL+LHE
Sbjct: 4   TRGGLEATQYVDVEEMVTIFLHIVAHDVKNRVARRHFARSGETVSRHFNVVLNVVLRLHE 63

Query: 63  ILLKTPEPITNSCTDSK-----KFNIPESVTFEVFSSKFSQPRIPQSVLRPEDSREDSND 122
           ILLK P+ +T+SC+  K       +I   VT   +++      + + +L+  +      D
Sbjct: 64  ILLKQPDSVTHSCSHEKWRWFQMASINSKVTKHWWTT-IEDEALVECLLQLVEKGCWRVD 123

Query: 123 NGTFRPEYISHLLRLLREKIPNCSLQSISTIDCKVRNLKKQYTTIVEMLGSECSGFGWNE 182
           NGTF+P Y+  + +L++EKI   ++Q    ++  V+ LKKQYTTI EM+G  CSGF WN+
Sbjct: 124 NGTFKPGYLVQVQKLMKEKILESNIQVTPNLESGVKILKKQYTTIAEMMGPVCSGFSWNK 183

Query: 183 EFKCVDVERNIFDEWVKSHTGAKGLRNKPFPHFDELAFVFGKDRATEVAVETPGDMASNY 242
           E KC++ E+++ ++WVK H  A+ L NKPFP+F +L  VFG+DRAT    +TP +M S  
Sbjct: 184 ERKCIEAEKSVSNDWVKGHLNARYLLNKPFPYFYDLEIVFGRDRATGGKCKTPVEMGSQT 243

Query: 243 DVETLGEDVEIRVSQDHYVPEPPFVDGTSNLDGEENPETPTSRFHMSATTSRGSKRKRSS 302
             +T  +D+ I + +D  +P P    G     GE+ P TPTS  H  A + R SK KR S
Sbjct: 244 ARDTEEDDMIINL-EDFDIPNP---HGLEPPSGEDMPSTPTSMAH-DAGSFRPSK-KRRS 303

Query: 303 YQSEMLDVVRATMDM--QGTQLERIASWP 324
           Y  +++D  RAT  +    T L     +P
Sbjct: 304 YSRDLMDTFRATESLLPDPTMLHAFLDYP 325

BLAST of Tan0002976 vs. NCBI nr
Match: TYK07921.1 (hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa])

HSP 1 Score: 235.3 bits (599), Expect = 7.6e-58
Identity = 145/347 (41.79%), Postives = 202/347 (58.21%), Query Frame = 0

Query: 3   TTGGLEGIEYVDVEEMVAMFLHILAHDVKNRIIRTQFARSGETVSRH-NSVLSAVLQLHE 62
           T GGLE  +YVDV+EMV +FLHI+AHDVKNR+ R   ARSGETVSRH N+VL+AVL+LHE
Sbjct: 15  TRGGLEATQYVDVKEMVVIFLHIVAHDVKNRVARRHCARSGETVSRHFNAVLNAVLRLHE 74

Query: 63  ILLKTPEPITNSCT-DSKKFNIPESVTFEVFSSKFSQPRIPQSVLRP---------EDSR 122
           ILLK P+P+T+SC  D     +  S++ +    +F    I  +V            +DS 
Sbjct: 75  ILLKQPDPVTHSCALDGTHIKVNVSMS-DCPRYRFRNGDITTNVTTTYVMLVIQMLKDSS 134

Query: 123 EDSNDNGTFRPEYISHLLRLLREKIPNCSLQSISTIDCKVRN------LKKQYTTIVEML 182
             +    T           +  E +  C LQ +     +  N        KQYT I EM+
Sbjct: 135 LHTEMASTNSKATKHRWTTIEDEVLVECLLQLVEEGGWRADNGTFKLGYLKQYTAIAEMM 194

Query: 183 GSECSGFGWNEEFKCVDVERNIFDEWVKSHTGAKGLRNKPFPHFDELAFVFGKDRATEVA 242
           G  CSGFGWNE  KC++VE+ +FD+WVK H  A+GL NKPFP+F +L  VFG+DRAT   
Sbjct: 195 GPACSGFGWNEGQKCIEVEKPVFDDWVKGHPNAQGLLNKPFPYFYDLEVVFGRDRATGGR 254

Query: 243 VETPGDMASNYDVETLGEDVEIRVSQDHYVPEPPFVDGTSNLDGEENPETPTSRFHMSAT 302
            +TP +M+S    +T  +D++I + +D  +P P    G     GE+ P TPTS  H  A 
Sbjct: 255 CKTPVEMSSQTARDTEEDDMDINL-EDFDIPNP---HGLEPPSGEDMPSTPTSMTH-DAG 314

Query: 303 TSRGSKRKRSSYQSEMLDVVRATMDMQGTQLERIASWPQSNVDIDDS 333
           +SR SK KR SY  +++D  RA+M     ++ +IA+W +  ++I+ S
Sbjct: 315 SSRPSK-KRRSYSGDLMDTFRASMRETSKEIGKIATWQREKMEIESS 354

BLAST of Tan0002976 vs. NCBI nr
Match: KAA0034843.1 (retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 193.4 bits (490), Expect = 3.3e-45
Identity = 153/508 (30.12%), Postives = 206/508 (40.55%), Query Frame = 0

Query: 3   TTGGLEGIEYVDVEEMVAMFLHILAHDVKNRIIRTQFARSGETVSRH-NSVLSAVLQLHE 62
           T  GL   E VDVEEMVAMFLHILAHDVKNR+I+ +F RSGET+SRH N VL AV++LH+
Sbjct: 81  TIAGLTSTEVVDVEEMVAMFLHILAHDVKNRVIQREFMRSGETISRHFNMVLLAVIRLHD 140

Query: 63  ILLKTPEPITNSCTDSK----------------KFNIP---------------------- 122
            LLK P+P+ N CTD +                K N+P                      
Sbjct: 141 ELLKKPQPVPNECTDQRWRWFENCLGALDGTYIKVNVPASDRARYRTRKGEVATNVLGVY 200

Query: 123 ------------------------------------------------------------ 182
                                                                       
Sbjct: 201 DTKGDFVYVLTGWEGSAADSRILRDALSRPNRLKVPKGYYYLVDAGYPNAEGFLAPYRGQ 260

Query: 183 -------------ESVTFEVFSSKFSQPR------------------------------- 242
                         S + E F+ K S  R                               
Sbjct: 261 RYHLQEWRGPKNAPSTSKEFFNMKHSSARNVIERAFGVLKGRWAILRGKSYHPVEVQCHT 320

Query: 243 -IPQSVLRPEDSREDSN----------------------------------------DNG 302
            +   +L    +RE +N                                        DNG
Sbjct: 321 ILACCLLHNLINREMTNFDIEDNIVSMTSSSRLPKHTWTKEEEAGLVELVNAGGWRSDNG 380

Query: 303 TFRPEYISHLLRLLREKIPNCSLQSISTIDCKVRNLKKQYTTIVEMLGSECSGFGWNEEF 324
           TFRP Y++ L R++  KIP C++ + STID +++ +K+ +  + EM G  CSGFGWN+E 
Sbjct: 381 TFRPGYLNQLARMMAFKIPGCNIHA-STIDSRIKLMKRMFHALAEMRGPNCSGFGWNDEK 440

BLAST of Tan0002976 vs. NCBI nr
Match: ADN33754.1 (retrotransposon protein [Cucumis melo subsp. melo])

HSP 1 Score: 193.4 bits (490), Expect = 3.3e-45
Identity = 159/553 (28.75%), Postives = 217/553 (39.24%), Query Frame = 0

Query: 6   GLEGIEYVDVEEMVAMFLHILAHDVKNRIIRTQFARSGETVSRH-NSVLSAVLQLHEILL 65
           GL   E VDVEEMVAMFLH+LAHDVKNR+I+ +F RSGETVSRH N VL AVL+L+E L+
Sbjct: 32  GLSSTEIVDVEEMVAMFLHVLAHDVKNRVIQQEFVRSGETVSRHFNIVLLAVLRLYEELI 91

Query: 66  KTPEPITNSCTDSK----------------KFNIP--ESVTF------------------ 125
           K P P+T++C D +                K N+P  +  TF                  
Sbjct: 92  KRPVPVTSNCNDQRWKCFENCLGALDGTYIKVNVPAGDRPTFRTRKGEIATNVLGVCDMK 151

Query: 126 ------------------------------------------------------------ 185
                                                                       
Sbjct: 152 GDFVYVLAGWEGSAADSRILRDAISQENGLQVPKGYYYLCDAGYPNAEGFLAPYKGQRYH 211

Query: 186 ---------------EVFSSKFSQPR----------------------IPQSV------- 245
                          E F+ K S  R                       P  V       
Sbjct: 212 LQEWRGAANAPTNAKEYFNMKHSSARNVIERAFGVLKGRWTILRGKSYYPLQVQCRTILA 271

Query: 246 -----------------LRPEDS--------------------------RED-------- 305
                            +  ED                           R+D        
Sbjct: 272 CTLLHNLINREMTYCNDVEDEDEGDSTYATTTASEDIQYIETTNEWSQWRDDLATSMFTD 331

Query: 306 -------------------SNDNGTFRPEYISHLLRLLREKIPNCSLQSISTIDCKVRNL 343
                               +DNGTFRP Y++ L+R++ EK+  C +++ + IDC+++ L
Sbjct: 332 WQFRGGDSCGMELVSMGGWKSDNGTFRPGYLAQLVRMMAEKLSGCQVRATTVIDCRIKTL 391

BLAST of Tan0002976 vs. ExPASy TrEMBL
Match: A0A5D3DTL0 (Myb_DNA-bind_3 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold260G00340 PE=4 SV=1)

HSP 1 Score: 250.4 bits (638), Expect = 1.1e-62
Identity = 146/329 (44.38%), Postives = 204/329 (62.01%), Query Frame = 0

Query: 3   TTGGLEGIEYVDVEEMVAMFLHILAHDVKNRIIRTQFARSGETVSRH-NSVLSAVLQLHE 62
           T GGLE  +YVDVEEMV +FLHI+AHDVKNR+ R  FARSGETVSRH N VL+ VL+LHE
Sbjct: 4   TRGGLEATQYVDVEEMVTIFLHIVAHDVKNRVARRHFARSGETVSRHFNVVLNVVLRLHE 63

Query: 63  ILLKTPEPITNSCTDSK-----KFNIPESVTFEVFSSKFSQPRIPQSVLRPEDSREDSND 122
           ILLK P+ +T+SC+  K       +I   VT   +++      + + +L+  +      D
Sbjct: 64  ILLKQPDSVTHSCSHEKWRWFQMASINSKVTKHWWTT-IEDEALVECLLQLVEKGCWRVD 123

Query: 123 NGTFRPEYISHLLRLLREKIPNCSLQSISTIDCKVRNLKKQYTTIVEMLGSECSGFGWNE 182
           NGTF+P Y+  + +L++EKI   ++Q    ++  V+ LKKQYTTI EM+G  CSGF WN+
Sbjct: 124 NGTFKPGYLVQVQKLMKEKILESNIQVTPNLESGVKILKKQYTTIAEMMGPVCSGFSWNK 183

Query: 183 EFKCVDVERNIFDEWVKSHTGAKGLRNKPFPHFDELAFVFGKDRATEVAVETPGDMASNY 242
           E KC++ E+++ ++WVK H  A+ L NKPFP+F +L  VFG+DRAT    +TP +M S  
Sbjct: 184 ERKCIEAEKSVSNDWVKGHLNARYLLNKPFPYFYDLEIVFGRDRATGGKCKTPVEMGSQT 243

Query: 243 DVETLGEDVEIRVSQDHYVPEPPFVDGTSNLDGEENPETPTSRFHMSATTSRGSKRKRSS 302
             +T  +D+ I + +D  +P P    G     GE+ P TPTS  H  A +SR SK KR S
Sbjct: 244 ARDTEEDDMIINL-EDFDIPNP---HGLEPPSGEDMPSTPTSMAH-DAGSSRPSK-KRRS 303

Query: 303 YQSEMLDVVRATMDM--QGTQLERIASWP 324
           Y  +++D  RAT  +    T L     +P
Sbjct: 304 YSRDLMDTFRATESLLPDPTMLHAFLDYP 325

BLAST of Tan0002976 vs. ExPASy TrEMBL
Match: A0A5A7SW62 (Myb_DNA-bind_3 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold261G00210 PE=4 SV=1)

HSP 1 Score: 248.1 bits (632), Expect = 5.5e-62
Identity = 145/329 (44.07%), Postives = 203/329 (61.70%), Query Frame = 0

Query: 3   TTGGLEGIEYVDVEEMVAMFLHILAHDVKNRIIRTQFARSGETVSRH-NSVLSAVLQLHE 62
           T GGLE  +YVDVEEMV +FLHI+AHDVKNR+ R  FARSGETVSRH N VL+ VL+LHE
Sbjct: 4   TRGGLEATQYVDVEEMVTIFLHIVAHDVKNRVARRHFARSGETVSRHFNVVLNVVLRLHE 63

Query: 63  ILLKTPEPITNSCTDSK-----KFNIPESVTFEVFSSKFSQPRIPQSVLRPEDSREDSND 122
           ILLK P+ +T+SC+  K       +I   VT   +++      + + +L+  +      D
Sbjct: 64  ILLKQPDSVTHSCSHEKWRWFQMASINSKVTKHWWTT-IEDEALVECLLQLVEKGCWRVD 123

Query: 123 NGTFRPEYISHLLRLLREKIPNCSLQSISTIDCKVRNLKKQYTTIVEMLGSECSGFGWNE 182
           NGTF+P Y+  + +L++EKI   ++Q    ++  V+ LKKQYTTI EM+G  CSGF WN+
Sbjct: 124 NGTFKPGYLVQVQKLMKEKILESNIQVTPNLESGVKILKKQYTTIAEMMGPVCSGFSWNK 183

Query: 183 EFKCVDVERNIFDEWVKSHTGAKGLRNKPFPHFDELAFVFGKDRATEVAVETPGDMASNY 242
           E KC++ E+++ ++WVK H  A+ L NKPFP+F +L  VFG+DRAT    +TP +M S  
Sbjct: 184 ERKCIEAEKSVSNDWVKGHLNARYLLNKPFPYFYDLEIVFGRDRATGGKCKTPVEMGSQT 243

Query: 243 DVETLGEDVEIRVSQDHYVPEPPFVDGTSNLDGEENPETPTSRFHMSATTSRGSKRKRSS 302
             +T  +D+ I + +D  +P P    G     GE+ P TPTS  H  A + R SK KR S
Sbjct: 244 ARDTEEDDMIINL-EDFDIPNP---HGLEPPSGEDMPSTPTSMAH-DAGSFRPSK-KRRS 303

Query: 303 YQSEMLDVVRATMDM--QGTQLERIASWP 324
           Y  +++D  RAT  +    T L     +P
Sbjct: 304 YSRDLMDTFRATESLLPDPTMLHAFLDYP 325

BLAST of Tan0002976 vs. ExPASy TrEMBL
Match: A0A5D3C7T4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold265G00330 PE=4 SV=1)

HSP 1 Score: 235.3 bits (599), Expect = 3.7e-58
Identity = 145/347 (41.79%), Postives = 202/347 (58.21%), Query Frame = 0

Query: 3   TTGGLEGIEYVDVEEMVAMFLHILAHDVKNRIIRTQFARSGETVSRH-NSVLSAVLQLHE 62
           T GGLE  +YVDV+EMV +FLHI+AHDVKNR+ R   ARSGETVSRH N+VL+AVL+LHE
Sbjct: 15  TRGGLEATQYVDVKEMVVIFLHIVAHDVKNRVARRHCARSGETVSRHFNAVLNAVLRLHE 74

Query: 63  ILLKTPEPITNSCT-DSKKFNIPESVTFEVFSSKFSQPRIPQSVLRP---------EDSR 122
           ILLK P+P+T+SC  D     +  S++ +    +F    I  +V            +DS 
Sbjct: 75  ILLKQPDPVTHSCALDGTHIKVNVSMS-DCPRYRFRNGDITTNVTTTYVMLVIQMLKDSS 134

Query: 123 EDSNDNGTFRPEYISHLLRLLREKIPNCSLQSISTIDCKVRN------LKKQYTTIVEML 182
             +    T           +  E +  C LQ +     +  N        KQYT I EM+
Sbjct: 135 LHTEMASTNSKATKHRWTTIEDEVLVECLLQLVEEGGWRADNGTFKLGYLKQYTAIAEMM 194

Query: 183 GSECSGFGWNEEFKCVDVERNIFDEWVKSHTGAKGLRNKPFPHFDELAFVFGKDRATEVA 242
           G  CSGFGWNE  KC++VE+ +FD+WVK H  A+GL NKPFP+F +L  VFG+DRAT   
Sbjct: 195 GPACSGFGWNEGQKCIEVEKPVFDDWVKGHPNAQGLLNKPFPYFYDLEVVFGRDRATGGR 254

Query: 243 VETPGDMASNYDVETLGEDVEIRVSQDHYVPEPPFVDGTSNLDGEENPETPTSRFHMSAT 302
            +TP +M+S    +T  +D++I + +D  +P P    G     GE+ P TPTS  H  A 
Sbjct: 255 CKTPVEMSSQTARDTEEDDMDINL-EDFDIPNP---HGLEPPSGEDMPSTPTSMTH-DAG 314

Query: 303 TSRGSKRKRSSYQSEMLDVVRATMDMQGTQLERIASWPQSNVDIDDS 333
           +SR SK KR SY  +++D  RA+M     ++ +IA+W +  ++I+ S
Sbjct: 315 SSRPSK-KRRSYSGDLMDTFRASMRETSKEIGKIATWQREKMEIESS 354

BLAST of Tan0002976 vs. ExPASy TrEMBL
Match: A0A5A7SWD8 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold515G00010 PE=3 SV=1)

HSP 1 Score: 193.4 bits (490), Expect = 1.6e-45
Identity = 153/508 (30.12%), Postives = 206/508 (40.55%), Query Frame = 0

Query: 3   TTGGLEGIEYVDVEEMVAMFLHILAHDVKNRIIRTQFARSGETVSRH-NSVLSAVLQLHE 62
           T  GL   E VDVEEMVAMFLHILAHDVKNR+I+ +F RSGET+SRH N VL AV++LH+
Sbjct: 81  TIAGLTSTEVVDVEEMVAMFLHILAHDVKNRVIQREFMRSGETISRHFNMVLLAVIRLHD 140

Query: 63  ILLKTPEPITNSCTDSK----------------KFNIP---------------------- 122
            LLK P+P+ N CTD +                K N+P                      
Sbjct: 141 ELLKKPQPVPNECTDQRWRWFENCLGALDGTYIKVNVPASDRARYRTRKGEVATNVLGVY 200

Query: 123 ------------------------------------------------------------ 182
                                                                       
Sbjct: 201 DTKGDFVYVLTGWEGSAADSRILRDALSRPNRLKVPKGYYYLVDAGYPNAEGFLAPYRGQ 260

Query: 183 -------------ESVTFEVFSSKFSQPR------------------------------- 242
                         S + E F+ K S  R                               
Sbjct: 261 RYHLQEWRGPKNAPSTSKEFFNMKHSSARNVIERAFGVLKGRWAILRGKSYHPVEVQCHT 320

Query: 243 -IPQSVLRPEDSREDSN----------------------------------------DNG 302
            +   +L    +RE +N                                        DNG
Sbjct: 321 ILACCLLHNLINREMTNFDIEDNIVSMTSSSRLPKHTWTKEEEAGLVELVNAGGWRSDNG 380

Query: 303 TFRPEYISHLLRLLREKIPNCSLQSISTIDCKVRNLKKQYTTIVEMLGSECSGFGWNEEF 324
           TFRP Y++ L R++  KIP C++ + STID +++ +K+ +  + EM G  CSGFGWN+E 
Sbjct: 381 TFRPGYLNQLARMMAFKIPGCNIHA-STIDSRIKLMKRMFHALAEMRGPNCSGFGWNDEK 440

BLAST of Tan0002976 vs. ExPASy TrEMBL
Match: A0A5A7TFA4 (Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold93G00550 PE=4 SV=1)

HSP 1 Score: 189.9 bits (481), Expect = 1.8e-44
Identity = 96/233 (41.20%), Postives = 139/233 (59.66%), Query Frame = 0

Query: 115 NDNGTFRPEYISHLLRLLREKIPNCSLQSISTIDCKVRNLKKQYTTIVEMLGSECSGFGW 174
           +DNGTFRP Y++ L+R++ EK+P C +++ + IDC+++ LK+ +  IVEM G  CSGFGW
Sbjct: 35  SDNGTFRPGYLAQLVRMMAEKLPGCQVRATTVIDCRIKTLKRTFQAIVEMRGPACSGFGW 94

Query: 175 NEEFKCVDVERNIFDEWVKSHTGAKGLRNKPFPHFDELAFVFGKDRATEVAVETPGDMAS 234
           N+E KC+  E+ +FD WV+SH  AKGL NKPFP++DEL +VFG+DRAT+   ET  D+ S
Sbjct: 95  NDEEKCIVAEKELFDNWVRSHPAAKGLLNKPFPYYDELTYVFGRDRATDRFAETFADVGS 154

Query: 235 N-----YDVETLGEDVEIRVSQDHYVPEPPFVDGTSNLDGEENPETPTSRFHMSATTSRG 294
           N     YD   +G+  E           PP      ++  ++   +  SR     T S G
Sbjct: 155 NEPGGGYDRFDMGDGNE---------DFPPVYSQGVDISQDDVRASRPSRASEGRTGSSG 214

Query: 295 SKRKRSSYQSEMLDVVRATMDMQGTQLERIASWPQSNVDIDDSRRTKVSNILK 343
           SKRKR S +   L+ +   +D    QL +IA WP  N+  D+  RT+   IL+
Sbjct: 215 SKRKRGSQRDFELEAIHLALDQTNEQLRQIAEWPARNLANDNHVRTEFFRILR 258

BLAST of Tan0002976 vs. TAIR 10
Match: AT4G02210.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes - 26 (source: NCBI BLink). )

HSP 1 Score: 56.2 bits (134), Expect = 5.9e-08
Identity = 25/110 (22.73%), Postives = 57/110 (51.82%), Query Frame = 0

Query: 108 EDSREDSNDNGTFRPEYISHLLRLLREKIPNCSLQSISTIDCKVRNLKKQYTTIVEMLGS 167
           + +R  +   G FR +  + ++ L   K    S   +  +  + ++L++Q+  I  +L S
Sbjct: 200 DQARRGNQIEGVFRKQAWTEMVNLFNAKFE--SNFDVDVLKNRYKSLRRQFNAIKSILRS 259

Query: 168 ECSGFGWNEEFKCVDVERNIFDEWVKSHTGAKGLRNKPFPHFDELAFVFG 218
           +  GF W+ E + V  + N++ +++K+H  A+    +P P++ +L  + G
Sbjct: 260 D--GFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDLCVLCG 305

BLAST of Tan0002976 vs. TAIR 10
Match: AT4G02210.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24960.2). )

HSP 1 Score: 56.2 bits (134), Expect = 5.9e-08
Identity = 25/110 (22.73%), Postives = 57/110 (51.82%), Query Frame = 0

Query: 108 EDSREDSNDNGTFRPEYISHLLRLLREKIPNCSLQSISTIDCKVRNLKKQYTTIVEMLGS 167
           + +R  +   G FR +  + ++ L   K    S   +  +  + ++L++Q+  I  +L S
Sbjct: 200 DQARRGNQIEGVFRKQAWTEMVNLFNAKFE--SNFDVDVLKNRYKSLRRQFNAIKSILRS 259

Query: 168 ECSGFGWNEEFKCVDVERNIFDEWVKSHTGAKGLRNKPFPHFDELAFVFG 218
           +  GF W+ E + V  + N++ +++K+H  A+    +P P++ +L  + G
Sbjct: 260 D--GFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKDLCVLCG 305

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
TYK26842.12.3e-6244.38uncharacterized protein E5676_scaffold260G00340 [Cucumis melo var. makuwa][more]
KAA0033487.11.1e-6144.07uncharacterized protein E6C27_scaffold261G00210 [Cucumis melo var. makuwa][more]
TYK07921.17.6e-5841.79hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa][more]
KAA0034843.13.3e-4530.12retrotransposon protein [Cucumis melo var. makuwa][more]
ADN33754.13.3e-4528.75retrotransposon protein [Cucumis melo subsp. melo][more]
Match NameE-valueIdentityDescription
A0A5D3DTL01.1e-6244.38Myb_DNA-bind_3 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 ... [more]
A0A5A7SW625.5e-6244.07Myb_DNA-bind_3 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 ... [more]
A0A5D3C7T43.7e-5841.79Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5A7SWD81.6e-4530.12Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A5A7TFA41.8e-4441.20Retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
Match NameE-valueIdentityDescription
AT4G02210.15.9e-0822.73unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G02210.25.9e-0822.73unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 268..292
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 255..297
NoneNo IPR availablePANTHERPTHR46250MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEIN-RELATEDcoord: 115..305

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0002976.1Tan0002976.1mRNA