Cla97C02G036070 (gene) Watermelon (97103) v2

NameCla97C02G036070
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionU11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X2
LocationCla97Chr02 : 14837220 .. 14875277 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGATGCTGCCACAAAAGATGCCATATCTAAGGACCATAGGAGTAAATATTATACTGAGGGTATGAGAGCTAGGCCTCTGCATTTTGGTGAGCCAATTGCTGCAAGGCTTGGTGTGGACTATCCATTCCCTCCTCATCTTGAGTACGGTGCTCTCTTACTAATATTTAATGTTTTCATCTTACATTATAATTTCTAATGTATTACTGTGCATGCTAGGTCTACAAATTCTTTAGGGTTGGGAGTAGTTGAAGTTGATGACTCTTTAATATATTATTTTCTTGAAAATTCAAGATGTTCTTTAAGTACTAGACTGTCGAACATTTGTATTAAAGTTTTCAGCATCTCCACAATTAGACATACAATAGTTTAAACGAGCCCCACAAAAGGTACTAAATCAACATTTTAGGTCTAGGCTTGTTCTACTCGAATCAAATATACTTCCTTGTTATATATAAATAATGTATATGAGAAAGGACATGCCAATTAGGGACATAAGTGAAAACTTAAAACTTCGTAAAAAATGAAGATAAAATTGGGGATTTACAGTGGTGGCTTAAAATTAACAAGAATGTAACATAAATAAGAGTTGGTGTGGGTTTAATGTAAGAAGCTGAATACTAGTGATGTGCTACTAAGGGAAAGCCCACATAGGGTTTTTATTTATTTATTTTTTATTTTTTTATTGAATTGATGGATTTTAATGCTACTCGACTCCAAAACTAGTAGAGGAAGGAAGCGATGATCGCTTGAGACTGTTAAGTGTAATCCTGTTAGCCAATGAAAGGACTACCTTTCTTAGACTTCAAGCAATGATACAGAGCTAATGTAATGAGACCCAAGAGGATCTTTACCTCCTCTTTTCCAGAGACTTCTATCTCGCCTCCATTTAATAAGTTCTTAGCACATTTGGTATTAGTTGGTTTTCTCCTAACATTGGTCTATCACATTTATAGCGTCTGTGTCATTATGGAAAATATGGCTTGAGAGGTATCTCTTCAACGTAGGAAATCGGATTCCAATTTCTATAAATCACTGGCCCACCAAATGGTCCGTGGGGATTCACCTACAAATTAAATTTGATCAGAGGCGATAGAATCCATTTTTGGGAGGAGGATTTGTGGATTGAAGATAGGGCGCTCAAGTTTTCTTTCCCAAGAATCCACAATATAACCTCTACAAAGAAGTTATTTTTTATTTTAATAATAAATAAGTGCTGGTCTGCTTCCTCAAATGGTTGGGTGGTGCTATTAAGAAGGAATTTTAAGAGATGTTGAGCCCTTGAATGGATGGATTTCCACAATGCTATAGAGACAATTCGAATGTCTAACAGAGAGGATTCCAAGTTGTGGATTTGATAAGATTCCAACCAAACTTTCTTATTGAATTTCATCAAAATTACAGCCCTACACTAGAACACAAGGAAACTGCCCTCAGTCTCCTTCCCCAACTGCTGTATAAGCAATTCCCGAAAACATAAAAACTGTCTTCCCCCTCTCACCTATTCCTCCCTTAACTACCTCCCTCCCCCACAATAAAACAAAGACATAAATAATTACTAATTAAATTAACTACTAATTAAGTTTGGCTAGAAGAAGCCACACTCCCACACACCATACCCTTCCCTTTATTTCTTTTTCTGCTGTAAGTTAATATTATAGGAGGTCTAACACTACTCTCCCCTTCAAGAGCCACCTTGTCCTCAAGGTGAAAATCTGGAAATTGTTGTTTGAAATCATCAAAAAGTTCCCAAGTCGCCTCATGGGTCGGCAACCCTTTCCATCCTACCAACACCTCCCAAACATTGGTTGTCGAATTCTTATGATATCCAAAAATCTCAGATGGTTCAGTTAACCGCTCATGGCTTTCCGACAATTGTTGCACCATAGGTTGAATAGTATGATGTTCTCCCAAAACCTTTTTCAACTGCGACACATGGAAAACCGGATGTATATGCCAATGATGGTAACTCTAAACGATACACGACCTCCCCAATTCTATCCAACACTTTAAAAGGCCCAAAATACTTAGGAGAAAGCTTCTCGTTCCGCCGTTGTCGCAAAGTTGACTGACGATATGGTCTAATCTTTATAAACACCCAATCTCCAACTTGATGCTCAACCTCACGCCTCCTCCTATCCGCATTTTTCTTCATCTTCTCTTGAGCCACTCGTAAATGCTCCTTCAATGCTCGTAATGCCACATCTCTCTCCTTCAATTGTTGATCTAATGTGGAATTAACAATAGTAGTATCTCCATAATATAAAAGAGGAGGTGGTAAACGACCATAAACTGCTTGAAATGGTGTAATACCGATGGAACTATGGTATGTGGTATTGTACCAATATTCAGCCCAATGTAACCACTCAACCCACTCCTTCGGCCTTTCACTACAAAAACATCTCAAATACGCCTCAACGCTACGATTCACCACTTCAGTTTGACCATCTGTTTGGGGATGATAAGCTGAACTTTGTCGTAATTTTGTACCTGCCAACTTGAATAACTCACGCCAGAAATGACTTAGAAAAACCTTATCTCTATCAGATACAATAGAACTCGGAAACCCATGTAGGCGCACGATCTCCTTTACAAATACTTCTGCTACCACCTTAGCATTGAATGGATGTTTCAGTGCAATAAAGTGACTATACTTACTCAATCTATCAACCACTACGTAAATAACATCATATTCCTTCGGTTTTGGTAATCCTTCAATGAAATCCATGGAAATATCACTCCAAATAACATCAGGAATCTCCAATGGCGTCAATAACCCAGCAGGGGATAAAGCTAGAGTCTTATTCCGTTGACACACTATACATTCTTCCACATACCGTTGAACGTCTCCTTTCATACCCTCCTAGTACAATTCATCGATCAATCTCTTATAAGTACGGAGATACCACGAATGCCCCCCAAATACAGAATCATGATAAATGTGTAACACCGTAGGAATAAGGGTAGAAGATCGAGATAATACCAGGCGTCCCTTGTATTTCAAAGTTCCCTGTTGCCACACAAAACCAAGAACACTCTCAACATCCTGCTCCAACTTTGCAATAATTTCCTGCAACTTCGGGTCATTGGAAACTTCGGTTTTAATAATCTCTACATCGATGATAGTAGGGGCTGATAAAGAAGCCAAATGGACAGTAGGTGGAATATGAGATAAAGCATCAGTAGCCTTGTTTTCCAACCCTGGTCGTAAACTACCTGGAAACCATAACCCAACAACTTTGCAATCCATTTCTGATGTTGTGGTTGTATCACTCTCTGCTCCAACAAAAACTTCAAAACCTGCTGATCAGTCTTCACAATAAACTTCCTACCCAACAAATAAGGTCTCTAACGTTGCACCGCCATCACTACGGCCATCAATTCTCTCTCATATACTGGTTTCCCCCTATCTCGTAGGGACAAAGTGTGACTATAATAAGCTATGGGCCTCCTCAATTGTGTCAAAACTGCCCCCAATCCAAAACCAGAAGCATCCGTTTCAACCTCAAAAGGTAAATCAAAATCAGGTAATGTCAAAACGGGCAATGTCATCATAGCATTTTTCAGACGTTCAAAAGCAACTTGCGCATCTGCATTCCATTCAAAGGCTCCCCCTTTCAACAACTGACTCAAAGGAGTTGCGAGGCTACCGTAATGATGAACAAAACGCCTATAATACCCAGTCAATCCCAAAAAACCATGCACCTCCCTGACGTTTGTAGGTACAGGCCATTCTAACATTGCTCGAACTTTTTCAGGATCTGCCTCTACACCTTTACCTGATATCTAATGCCCCAAATATTCCACCCTTTCCTTAGCAAACTGACATTTAGTCTTATTAGCATACAATTCATTGTCCCTCAATATCAGAAAGACTTTCTCCAAATGCTTAACATGATCCTCTACTGTTCGACTATAGACTGAAATGACATCAAAGAAAACTAGTACGAACCTACTCAAATATGGTTTAAAAATGGTATTCATAAGTGCTTGAAATGTTGATGGAGCATTAGTCAATCCAAATGGCATTACCAAAAACTCATAATGGCCCTCATGAGTCCGAAAAGCTGTTTTCTCCACATCTCTCGGGTACATTCTAATTTGATGATAACCCGACTTCAAATCGATCTTGGAAAATAATGTGGCACCATTCAATTCATCAAACAATTCTTCAATTACCGGAATGCAAAACTTATCAGGTATAGTAACATTATTCAAAGCTCGATAATCAACACAAAATCTCCAACCTCCGTCCTTCTTCTTAACCAACAAAACAGGACTCGAATACGGACTTGTGCTAGGACGAATAACTCCTGATGCTAACATTTCATCCACCAACTTCTCCATCTCCTCCTTTTGATGATAGGCGTATCTATAAGGTCTCACATTAACTGGATCCGTATCTATAAGGTCTCACATTAACTGGATCCGTGCCCTGCTTCAGATGGATGTGATGTTCAATACTCCTCCTTGGTGGTAATGTTTCCGGCCACTCAAACACATCAACAAAGGTATCTAGCACTGCTGGAATTGAATCAGAAATAGTATACACTTCATCGATACCATACCATTTAGCCAACGAAACTCCGCCCTCCATGAACCGACATTCGATGAGAAATCCCTGATCCGCTCTGTCCATGATCGTATCATACTTTTTAAACTCACCCGCGACTTTGTGAGACTAGAATCTCCCTTTAATACCACGGTCTTCCCATCCTGGTTAAATGACATTGTCAAATTCTGCCAATTAACTTTTGTCATTCCTAAAGTATGTAACCACTGCATGCCTAGTATTACATCCACTCCTCCCAATTCTAAAGGTAAGAAATTACCACAGACAATAAAATCCCCAATCACTAATTTCATCACATTACAGATACCTTTTCCTTTAATTGCGGACCCCGATCCCAAAATTACCCCATAATTGGAGGTTTCCACAATTGGTAGGCGTAGAGACGAAACTAATCTTTCAGCAATAAAATTATGTGTCGCTCCACAATCAATTAGCACTATAACTGACTCTCCTTCAATGTGACCTCTCATCTTCATAGTTCCCAGATTGGATAATCCTACAACAGAATTAATTGACAATTTCGCTATGGCGGTCATAGTGTCATCTGTTTTAATCATCTGTAACTCTGTCTGAACCTCCTGATAATCCTCATCTTCAACAATTTCCAAATCGTCAACCCCATCCGCAACAACCAGCACCCGAAGTTCCCGTTGATCGTGATTTTTACAACGATGACCTGTTGAATACCGTTCATCACAACGAAAACACAACCCTTTATCATGTTTGGCCTGAAATTCAGCATCTGTCAATCGTCTCGCTAGTCCTTCCCGTTTGTTACCCACAATCGACAATCCCCTTAATGTTATCGTCCTCATTGTAGGCGTTTCCGTCGTCTTCCCTCCTTCCTTCAGCCCAGCAGGCAAGGACGAATCCTTCTGATCAAACAGCTGCGGTCGTCCAACTGATTTTGTGCCCAGACCAACATCAAAACGACGCAATTCCCTATTTTCAACCTTCTGAGCCAACTTCATCATCTGTGCCAGACCGACTGGTTCCCAACACTCAACCTCAGCTTTGATCCACGGCAATAGGCCATTCATGAACGTCTCTTCCAGAACCTCATTTGACAAATGTGGCAATGGTGCAACCAATTTATCGAACAAATTCCGATAAGCCTCCACGGTAGTCTCTTGTTTGATGGCTAAAAACCGACCACACAACGACCCCTCCCTCTCAGAACGAAAACGCATCAACAAACGACGCTTCAAATTTGCCCAATCCGTAAAAGAGTCGTGTTCTTCCTGAGCCCGATACCAGTTGAGTGCAACACCATCAAAACTAATGACTGCCACCGTCATCTTCTCCGTCTCCGACAACCGGTGTATGTGAAAATATCGATCTGCTCTGAATAGCCACCCATCCGGATCGATTCCGCTGAACACGGGCATTTCCACCTTCTTGAATTTACTCCGATCCATGGCCGCGTTAGAAGATTCCGCGTTTGTTTGATCAGTTATCGACGGTTGCGGCAACATACTTGTGGTACGACTGCCCGACTCATCCCCTGCCGGCAATTTCTGACCACGGGCTAAGTCTACGGGTATCGTAGTCAACATGATTTGCTGCTTTTCAGCTTGAACTCCCAACTTTTCAATACGGACAGCGAAAGTAGAGATTCTTCAATACCGGGAAGTTTCTGAATTTCAGCGTGAAGTTCTGAAACCTCTTGATCCATTGCTTCCATCCGCTCATCAAATTTCTGCTGCGCCATTGTACCGTGCTTCCCCAGGGTGATGCTCTGATACCAATTTGATAAGATTCCAACCAAACTTTCTTATTGAATTTCATCAAAATTACAGCCCTACACTACACAAGGAAACTGCCCTCAGTCCCCTTCCCCAACTGTTGTATAAGCAATTCCCGAAAACATAAAAACTGTCTTCCCCCTCTCACCTATTCCTCCCTTAACTACCTCCCTCCCCCACAATAAAACAAAGACATAAATAATTACTAATTAAATTAACTACTAATTAAGTTTGGCTAGAAGAAGCCACACTCCCACACACCATACCCTTCCCTTTATTTCTTTTTCTGCTGTAAGTTAATATTATAGGAGGTCTAACAGGATTCTTCAGTGCTGGTCTGCTTCCTCAAAAGGTTGGGCAGTGGATGGCTTATCTGTGAAGACACAGGCATTGTTTTCCAACTTGAGATTCCAATATAAAGCTTGAAGATGACTGAGCTAGAGGATCTTTTGGTCCTTTTTTGAAAGGATGCTTCAAAAGAATCATAAATATATGAGTAAAGATGTCATGGCTTTGTTATCTTTACTTGTGGGCTCCCCCTTCAGTGATTGGGGTATATCTGCCTTTGGACTCCTTGCCCTTCCAAACCTTTTCCTGTTAGCTTCTTCTGTCGTTGTTTGGCTGGCTCCTTTCAGTCAAATGTCTCCACTTTTCCTCTATGTGGAAGGTAAACATCTAAGAAGGTTCAGTTTTTTTGTGTGGCAAGTTATGCATAGAAGAGTTAACATCTTGGATACAATATTGGCTACTAGGTCCTCTTTGGTTGGGCTCATTTTTTCTCAAAGAAAGTTTGGTGTATCATTATAAAAAAAAGTCCAAAGAGAAAATACATACAATCCCTAATTGGAAAATGTCTAAATATTCTTGGCATACCATTTGTCCTGTATCACCTGTTTTTCATGTCTTGATTAGATTTGTTTCCTTTAAAAGTACTATTTTACTTTAGGTTCTTCATTATCCATTGTAATTGATCTGATCAAAATTGTTTGTTCTTTACTAAGTTTTTGTGGAGACAATTGTATTTGGTTGTGCATTCAATAAGATTTGGTAAAAACTTTTTTTAGGTATGCTTATCCTCCACCAGATGGTAATATCCTGACCAACATTGTTAATGCCCTCATAGCTGTCCCTCGCTTCTATACTCAGGTTACTACAAATGAAATGATAATTTATACGTTTAATTGCTGGAGCATATTTGAAGTAAATTCTTACAACTTAGTTTTTTAATTGTAGGTGCTGCATTTAATGAACAAGATGAACATTCCAGCTCCATTCCGATTGCCTCTACCCACTCCACCTCTCCCACCTTGCATTCCTGCACCACCATCACCATCATTAACGCTGCCACCTCTTCCTCCTCTTGCCACTCCAAGGCCTCAAGTAACAAATATGTCCAGTGATGAGTCAGAAATGGAGTCCTCAGAAGAGGTAACTTGCCTAGAGCTGGTAAAATTATTTCACTTTTCTAGCTTGGATGAAATTCTGTCTTCAATTGGAGCCCAAAAGTCTGCTCTGACTAATACTAAAGGTATATCCAAATAAGATTATGACCATTTTTGTTGGGAGTATTCCGTAAGAATCAGCTGATTAATTTTGAGCTATGGTGTGGAGGACAACTCTTTAGTCTTTTAGTTAGGACCTGGGCAACAATTTTGCAAAGTCTTTTAGCTAGGACCTTGCAACAATTTTGTACAGACTTGTAGTGAGGTTACTTGGCTGATCGTCAACAACGTTGAAGATCCCTCCTCCCCCTTTTTTTTTGGTGGAATCATATAAAGGTAGGTCTCATTACACAAGAATTAAGGATTCCATTCTAAAAAAATCTTGGGACACCTCAACCATATCCTCTTTGACGATATTCCAAGTTTGGACCCTCTCCCATCTTTTATAAATTTCATATATAAATGAAACTTGTTCTTAAAAAATATTTTTTCATAAATTTCCCCATATATCCATTGGGCCCCATAAATCTTGTACTTCTAAAAAAAAATTATTTTATTGTTGTTGAAATTATACTGAGAGTGTGTCAGCAAAATTTATTCACCTTCCTTTATTCCTATGGGCGATGATGTCAATATTTCTTTTGTCAATTTTGTTGATGTTGATGTCATCTACAATTTTGTCATCTAAAATTTTTAGGTGCATCCTATCTTTTATTTTTTCATGTTGATAAATGTTCTTTCCTTTGTCTTTGTCTTCTTAATTCTCCTGTACAATTTTATTTCATATTCTATTTGTTTCTTGTTCTAGAATGTTTCTAGCTGAGACGAGGTATAATATTGTACTCCCATTATGTCTTGGAACTTACTGATTAATTGGATTTAAGTTTGCATCAGGATGTGGAAGGAAAATATAATTCAAGTGGGGTTACAAAAACTGCAAAATCAGGAAACAAGCGTTCTAGGTGTGAAACGATTGTAGGCCCTGGGATAGACAAAGATGTGGCTCATGAGGCTGTTGGGGTGAAACCTTCCACATTGGTTCCGAAAGAGATTCCTATGATAAAAAAGAAAAACCCAGTGATTCAGGTTGCTTTTATCATAAGATGTAACTAATTGGCATATGATCTTATTTATATGATTCCTTCGCCCTTGTTTAATCATTGCAGATTAGAATTGCCCCTAAAGTGATTCACGATGGAAAAGTCAATAACATTAATAATGATGGTGAAGAGAGAGAAAAGGAAGCAAATGATCTTAAGCCCTATGCAACACTCGAAGATTTAGAGAGAGGAAAATTGCCTCCAGAAGAAATTCTTTCACTTCCAATGTTCAAGGTACTATACCATGTGATATTTCAATGTTTAAGTATGAAACTAATGCATGTTGCTTGTTCCTTGGTTCATTGAAGCTCCATTTTTTATAAATAAACAAATTAGTGACAAAAAAGAGCTTTGTTTTAGATTTTTGATAGACTAACAACTTTCATTGAGAAAAAATTGAAAGAATAGAAGGATATACGAAAAAAAAACCCACAAAAACACCTCCTCTAAAGAAAGGGATTCCAACTAAGTAAGATGTTACCTAAAGAATAACTATAAAAAAGCTTCGATACTAAAACCCAAAGTAAAATATGAAACCTCACTAAGGACCAAACCTCACTAGGGTGATCCATACCTTAAAACCCAAGAAGAATAAGAGTAAATATTGTATTGATATTCAAAATGAGGGGTTGGAAGCCTATTTATACATAAGTTTGGAAACTGAAAATAATTGTAAAATAATACTTAAAATATCTACAACAGTTCTTCCCTTCCCAGAAAATAAACTCGTCCCGGGACGATTGTGATAAAGTGAATTGGTTTTGAGCATGATATGGGAATATGTCTGAGACATTGAAAATGTTGCTGATCTTCACTGTTGGAGGAAGATCAATCGTGTATGCATAAGGGCCAATTCTTTCTAGAATAGGAAGTGGTCCTATCTTCTTATTTATGAGTTTGGAGTGTTGGCCAGTGGGTAGTCTACTTTTCTTCAAGTGGATCATTACTAAATCTCCAATCTGGAATTCCACAGCTCGTTTGTGCTTATCTGCTTTGGACTTATAAGAAGGGTCGGTTGCTTCTAGGTGTTGTTGAACTTCTTCATGAATTGTTTTTATTCGATCAGCCATAATGGTAGCCTCGGCACTAAGATCAACGTAGATGGGTAGTTTAGCAAGATCTAATGTTAATCTAGGAACATGAGTATATACAACTTCAAATGGGCACTTCCCTGTTGAGCCATTAGTCATATTGTTGAATGCAAACTCTGCTTGAGCAAGGTAAAGATCCCATTGGCGAGGTCTATCTCCACTTAGACTTCTTAAAAGATTGCCGAGTGTGCGGTTTGTAACCTCTGTTTGTCCATCAGTTTGAGGATGGTTTATTGTGCTATATAGGAGGTTTGTTCCAAATTTTTTCCATAATGATCGCCAAAAATGACCCATGAATTTGACATCCCTGTCTGATACTATGCTTTTAGGTATACCATGAAGACGGACAATTTCCCTTAAAAAAAGATTGGCAACATTTAAAGCATCATTAGTCTTTTTGCAAGCTAAAAAATGTGCCATTTTGCTAAAGCGGTCAACAACCACTAAAATGGATTCAAAGCCTCTTTGGGTCCTAGGTAAACCAAGGACGAAATCCATAGAGAGATCCTCCCAAATATTGTTTGGTATTGGTAACGTGGAGTACAGACCTTGGTTTAAACTAGCACCCTTGGAAGATTGACAAATATAGCAACGTTTAACAAAATTAGCAACATCCTTTTTAAGTTGTGGCCAGAAAAATTTAGAGGAAAGTAGAGCCAAAGTTTTATCTCGACCAAAGTGGCCAGCAAGACCTCCCGAATGGGCTTCTCGTAACAATGATTCATGAAGAGATGTTTGAGGGATACATAAGGTGTCATTTTTAAATAAAAACCCATAAAAAATATGGAAATCTTTCGCTGGAATATGATTTGACCATGCTTGCCAAATATTTTTGAAATCCAAATCATTAGCACATAATTCAGGTAGATGATCAAAAGCTATAATTTTTCCACGAAGTAAAGTTAGAAGTGTACCTTTTCTGCTTAATGTGTCTGCAAATTTATTAGCTTTACCTGGAGTGTGTTTAATGACGAAGTCAAACCTTTGAATGAATTGGATCCACCTAGCATGCATACAGCTAATTTCCTTTTGAAATTCCAAAAATTTCAAAAAAAAATGATCTATTAGCAACACAAATTCTTTCCCAAGCAAGTAATGTTCCCACTGTTTTAAAGATTGAACAAGAGAGTAGAGTTCTTGTTCATAGGTTGACCATTTTTGTCTGGAAGGGCTCAATTTTTCACTAAAAAATTCTATAGGGTGCTTGTCTTGACTTAGCACAACTCCAATACGGATTCCAGAAGCATCTACTGCGACCTCAAATGGTTTATTAAAGTCTGGAAATCCTAGGACTGGAGTTGAGCTAAGCATTCTCTTTAATTCAGAAAAACTTTGTGATTGTTTCTCCCCCCATTTGAATGGAATATTGGCCTTTAAACAAGCTGTCATTAATGCTGCAATGGTGCTAAAATGTGGGATGAATTTTCTATAAAATGAAGCTAAACAAAGAAAGCTTCGGATGTCTTTGGTAGTTTTTGGTTCTGACCATTCAACTATCGCCTCTATTTTTCTCGGGTCTACACTAATTCCCTTTGGGCTGATTAGAAATCCCAAAAAATAAAGTTCAGATTGAAGAAAAGAACATTTTTTGAAATTTGTATATAATGCATTTTCCTTTAAAACAGAAAAAACTGAATGTAAATGTTCTAAGTGTTTGTCATAGGAAGAGCTATAAGTTAGGATGTCATCAAAATAAACTACAAGGAATTTATTGCGAAATGGTAACAAGATTTGGTTTATTAAACGCATAAACGTACTTGGGACATTTGACAAACCAAAGGGTATCACAAGCCATTCAAAAAGGCCTTCATTTGTTTTAAAAGTCGTTCTCCACTCATCCCCCGATTGGATCCTAATTTGGTGATAACCACTTTTAAGGTCAATTTAGAAAAACTGGAGGCTCCTCCTAGTTGGTCTAGCAAATCAGAGAGTCGAGGTATTGGGAACCTATATTTTATAGTTATTTTGTTAATGGCTCTACTATCCACACAAAGCCTCCAAGATCCATCTTTTTTTGGTGCCAGTAGAGCAGGGACAGCACATGGGCTAAGGCTTGGTTGTAAATGTCCTTTATCTAATAAGGTGTGGATTTCATTGTGCAAAGCTTCATATTCTTTTGGACTCGTGCGATAATGTGGCAAATGAGGTAAGGAGCTCCCTAGTATTTGATCTATATGATGTTGAATGTTCCAAAGTGGTGGCAGAGAATTGGGTTCTTCTATTAGAGTAGGGAATTCATTTAATAGGTTGGATATGTGAGGATCTAAGATGTTAGAATTTGGACTCTCAAAAGGGTCTTGTTTTAAAATAAATGCTAAAAGTGTATTGTCTTTTTCTAATAGTAGATCCTTACCTGGACAAATGGAAAAGTTGTTTTTCAGTTGTAGAAGGTACCTTAAGAGCTTGCTGTCATTCTTACCGAGGGGAAGGAGAACCACCTTCTTGCCCATCCATCTGAATTCGTAGGTGTTGTCTCGTCCATTATGTGTTGTATTGTTGTTGTATTGTCATGGCCTTCCCAAGAGTAGGTGACACATCCATTTCCAACACATCGCATGTAATTTGGTATTGGTAAGTGTTTCCAATTTGGTCTTTAACCATTGTTTCTCCCCCTTTTTTGATCCAGCTAACCCTATACGGATTGGGTGTGCTTTAGCCTTCAAATTGAGTGCAGTAACCAACTTTTTTGACACAATATTCTTGCTGCTGCCACTGTCTATAATTATTTGACACACCCGACCATTTATAGTACATCTTGTCCGGAATAAGGAGTGTCTCTGAGGGTGGCTTTCAGATTTAGGTGTGAGGAGCACTTGTTGAAGCACACAAGACAATTGATCACCATCGTCAGGTTGTAAGTAAGCCAATTCTTCATTATCCTCTGAATTGCAGCCGTCTGCTCCATCCTCTTCTTCTTGTACAGCCAACGTCCTGCGTTGGGGACATTCATTAGAAAGGTGGCCTTGCTGTCCACAACGAAAGCACTTTGCTAATGTTGGTGGATTGTAGGGATTATTGTTTCTTCGGGCAACGCTTGTATCATTGCTTCGAATAGGTTGCTGTTTTAGAATTGAATCATCTTTCTTGGAATGTTGGTTCAATGTATTTCCAGAGATATGGTTCTTGAAGTTTCAATAGGTGTTTTTCTTGAAGCAGGTTGCTGTCTCTCCCATGCGTTTTTTCTTGAGTAGGTCTTTGTTTGCTTAACCCCTTGATGTATTAATAATTGAGTTTACTTTTTAATCTTTTTAGTTAGTAGCCGGTTGCTATGTATAACGGGTTTGTTAGCTAGTTAGTTGCATTTTAGTTTGTTGAGATCCTTGCCTATAAATAAAGGCTGGATCCCCTCTTTTAATCATCAATAAAAATACAATTTGAGTTGAGAATTTTCTCCAAGAATTGGTCTTACACCAATTGGTATCAGGGCCCCAAAAATCCTTGGAGCTTACGGCAGGGAAGAACGAACAATGGTGGCAAGTCCTAAGGAACTAGTAAAGAAGCGGAGTCTACCACTGCCATATCACCAAGAACTTTAACCGGACGCTTGCTGTCTATTGAAGATTCCATTGGTGGTATTCCTAGGTTACTAGAAAGAATGTCGACATGATTAGAAGAAATCGCCATTGAAGTTCAAGCCACTCGACCTCCACCAAATAATCAAGAAAACCAGCTAAGATTTATCAATCTACCTCAAGAAAGACAAAATGTTCATGTGCGGCAGTTCAACCAACCAAGAAACATCCAAGAACCCCATGCTATTCATCTAGAACACCTAAGAATGCAAGGTCACCGTGGGAGACAATGGCAGTTCAACGATTCTTCAAGTGAGGAAGGAGAAAACGATGAAGTTAGAAGATGGAGAGCTGGAACACAGCATCAGCACGAGCCGAATGAATACAAAGTCAAGTTAGACACATCCTCACATTGAAGAATTCTTGGAATGGATTAAAAATGTGGAAGGGTTCTTCAAATATGCCAACACCCCCAAGCATAAGAAGGTTCAATTAGTCGCCTTAAAATTGCATGGTGGTGCTTCTGCATGGTGGGAGCAATTGGAAGTAAACCGATATAGATGTGGCAAAAGTCCAATCACAAGTTGGGAGCAAACGAAGAGGTTGATGAAAGATTGATTCTTGCCAACTAACTATGAACACTATAATCAGTACCAAAATCTTCAACAAGGAGCTCGATCCGGGACTGAATATACTAAAGATTTTCATTGATTGGGAGCAAGAACCAACTTACCTGAAAGTGAACAGCACCTGATAGCAAGATTTGTTGGTGGTTTGCGGTCATATATAAAAGAAAAAGTTAAGCTGCAGCCGCTGGGTTTTCTTACAGACGCGATTTCCTTGGCAGAAACAGTGGAAGAGCTACAAGAATCTCAATCCAAGAGGTTCAGCCGAGGAAATCAGTGGGAATCCTCATCAAACAACCTGCAAAAGGGGAAGGTAACCGCTGCAAAATCCTCTGCCAATAGCTCTGTGAAGGCCAAAGAAACAGAGGCAGTTTCAAATAAAGATGTAACCCCTCCATACATCAAGAAACAGAGGCAGCTCAATCCGTAATGGTAGGCCAACTTTGGGAAAATGTTTTAGATGTGGGAAGAGTGGCCATCTATCCAATAATTGCCCTGATAGAAAAACAGTAGCCGTGGTGGATGATGAAGATTGCCAAGATCAAAAATCCGACAACCAAGAAAATGAAGAAACTGATTTCTTGGAACCGAATGAAGGAGATAGAGTGTCATGTGTTATTCAACATATTCTTCTCTCCCCTAAATCAGAATCTCCAAATCAAAACACTCCTTATTTAAAACAAGATGCACGGTAAAAGGAAAAATTTGCAATGTTATCATAGACAATGGGAGTTCGGAAAATGTGGTGTCCAAAAAATTAGTTAATGCCTTAAATTTGAAGTCGGAACCCCACCCAAATCCATACAAAGTCAGCTGGATAAAGAAAGGAGGTTAAGCTTAAGTAACACATACTTGCACGGTACAACTTTCAATAGGAAACAGCTACAAAGATCAGATAATTTGTGATGTTTTGGATATGGATGTTTGTTATATCCTGTTGGGACGCCCTTGGCAATACGACATTCAAGCAATCCACAAAGGGCGTGACAATACATATGAATTCTCTTGGCATGGAAGGAAGATTGTGCTTCTGCCATTACCAAAATCAAACTCACAGCACTCTTCCAAAGATCAAAATGGTAAGCTCTTTTGCATTCTTAATTGCAAACAATTTATCCAACTGTGGTGTGGGGCTTAGTAGCAAAAGGAATTTCTCCTCTTGATCCATCTTCTAAAAAGTTCAAGAATTACTATCCAAATTTCCGAATATTACCAATCCACCAGCCACTTTGCCTCCTTTGAGGGATATCCAACATCAAATTGATTTTTTACCGGGGTCCTCTCTTCCTAACCTTCCTCATTATCGAATGAGTCCAGCTGAATACAATATTTTGCATGAACAAATCCAAAATTTGCTAAATAAAGGGTACATCTAGTCAAGTATCAGCCCCTGTGCAGTTCTGGCTCTTTTAACTCCAAAAAAGGATGGCAGTTGGCGCATGAGTGTAGATAGCAGGGCCATCAACAAAATTACAGTTAAGTACCGGTTTTCGATTCCTCAGATTACTAACCTTTTGGATTAATTAGGTGGTGCCTACATATTTTCCAAGTTAGATCTTCAAAGTGGCTACTACCAGATAAGGATTAAACTGAGAGATGAATGGAAGACCGCTTTCAAAACAAATGAAGGATTGTTTGAATGGTTGGTCATGCCGTTTGGCCTCTCCAATGCTCCTAATACATTCATGAGACTAATGAATCAGGTTTTTCTTCCCTTTCTCAATAAATTTGTTGTCCTTTATTTTGATTACATCCTTATTTTTAGCAAAACCTCAGAGGACCATATGATACACCTACACCCTGTCTTTCAAACTTTATAAAAAAATCAAATCAACTATCTTTAAACCAAAAAAAGTGTCTTTTTAGCCCACCTCTTCATTTTCTTTGATTGGGGAAGCCAAGATAATTGCATTAGCCAAATACGCAACAGGGTGTAGCTTGACAACTTCTTTGATTTCATTGCGAAGAACAAGAATGAATCTTGATACTAACTGTTGCTCGGTCTCGGGTAGATTGTTGCAGGCACCTAAGCACTGGAACTCTTCAGTGTAATAAAAAATTGATCGATCTCCCTGTTTGCAAAGTTGGTACTGACTGTATAGAATCTGCTGATAGTTCATAGGAAGGAATCTTTTCTTCCTCAGCCTTAGTAACTTGGGCCAATTTCTGATAGGTGGTTTCCCATAGTACTGGCGATTTGTTTCTAGTTGATCCCACCATGTTGAGGCTTCTTTTTTAAATTTAAAAGCCACCAACTTTACCTTCTTTTCCTCTGGAATAATAGCATACTCAAAAAAAAAATTTCGACCTTCTTACCCAATCAAGGAAAGCTTCTACTTCCATTCAATCATCGAAACTTGGAAGATCTACCTTCATCTTGTATTCTGGTGCTTCTTGCTGATATTGATATCTCCCCTATCTATTCTCCCTTCTTTCTTGAATTCCGAAGTAGCCGTGATCATCTTCATCGGAAGAATCCGAATCTTGCACCATATATCTGGCTAATTGGTTTTCTTGCTGATGAAATCTTGGTTGGAAAACGTTTCTTCTTGCTGGCAAATTTTGAGTTCCAAGCAGATTGTGTCCAATTACCTTTGCTTTCATTTGATTCTTGAAGGGTTTTTGAAATCTTTCTTGAATTCTTGCGCCCATTTGATCATCAATCGGTCCTTCTTCTTGAAGGTTTTCATTGATTTGAAGAATTCCTTTGTTTTTCTGATTTCCTTGCACTTGCTGACCCTCATTCTCTTTTGGCTTTTAAATATCCAGATGTTGAAGAATTTGAGCTAACATAACTTGCAAGTCACCCACTCCATTTTTCATCTCGATCATGGCAGTTCCTACTGAAGTTTGTTCTCCCAAGCTTTAATTGAAGAGGAAGCTTCTGCGTTTATGAGTTTGTTAATCCAATTTTAAGAATGCGCATTTGGATCGAATGCGCTGATACCAATTGATGCAAATGCTCCAAGAAGAATAAGAATAAATCTTGTATTGATATTCAAAATGAAGGGGTGGAAGCCTATTTATACATAAGTTTGGAAACTGAAAATAATTGTAAAATAATACTTAAAATATCCCTAAAGAAAGACTTACAAGGAGCTTCATTATAGATGGTTAAATACGTTTCTTAAATGTCTCTCTTTAGGTTGTTGTTGGATCCTTCCCCTATTAGTGAGTCGGTCTTTGATGTGGTTTGGAGGATTAAGATTCCAAAGAAAATCAAGTTATTTACCTCGCAAGTTTTTCTAGGTCAGGTGAATACTTTAGATAGGCTTGCTAGGAAGATGACTTTGTTAATGGGTCCTTTCTGCTGTATTCTTTGTCGGAAGGTGGAGGAAGACCTGGATCACCTCCGTTGGGCTATCAGTATGCGAGGTCTATGTGGAATTGCTTCCTGTAGGAGTTTGATGTTGAAATGGCTAGCCAGAGGGATATTTGTGCGATAATCGGGGAATTCCTCCCCCATCTGCCTTTCAGATAGAAAGATCGTTTTTTGTCTCCAAGAAACCAACAAGTCTTTATTATGGATAAAAAGTTGATCTTATAAAAGGGGACTTTCTCTATTTATAGAGAATTACAGGAGTTCTAAAAGGATAACTAAAAGAGGTTAAAACTAATTTGGTAATTAAAATGTAACTAGTCGAGGATTAACCAAGATTAACTTACAAGGAGGTCTATTGGGAAGAAAAGTCCCCTTGTAACTAATATCCCTAAAGAATGACTTACAAGGAGCTTTGTTATAGATGGTTAAACACGTTTCTTAAATAAAAGCTTACCTCATGTGCTTAAGAAAAAAAGCTTACTACATAAGAAATTTGACATGATTAGCTGACCCCCCACCCCACATTCGTTTTAAATTCAAAATTCAAAAAGTCTTCTTGAACATGTGCTCATGGGTATCTTAGCCCTTTTGTCTCTTGTTAGAAAAATATGCGTCTCTAGATGCTCTGCTATTTTCAGTTTCTGCTTTGTTGGCAGCAAATGGGACATCAATGTTTACCCTTGTGTTGATGCATGTTGGGTTTTTTAAGTAATGTTTTTATAATTATAATTTAAATAAATAGGATTTGATTTTGAATTGTAACAGTTATCTTTTAGTTCTGCTTTATTGGAGACTTTTTCCAATCCCACATTTATTGCAATTTTCTTTTCCTGGGTTTAATAAATTTTGCTTTGATTTCCATGGTTTAGGGTTGAACATATAGCTCCCTTTTTCTCCTAATTTACCTTCGAGTATGATGACAATTTATCTGTGATAAAAATAATGTTGGAAATTTCATGGATATTTCAGAACTATACTACTGGGAGTCCTACTTCGGTGCTGTATATCAAGAACTTAGCCAAAGATGTATTGAATGATGACCTCTACTATATTTTTGGTCAGTTTCCTCGCTCCAATTCAGCCATCATGCATATATGGAAATTCATTTCCTGTTGTGTTGATTATTTGATAAGCTTTTTTACTTTTAGTTTATTATTTGTTGTTATTTATTATTTTCATGTGCTTTGTTATTAGGACGGACCTAGGGCAATTTGTATTGCTTTAATTTTTAAAATAATTTCATATTTTTTTATTTTAATACAGCTTTTAATTTTTTTACTTTTTCTTAGGGGCAATGCCCCCATCTAAAGGAGTTAGATAAAACCTCTCCAATAGGATATGAGAGAATTTAATGTGGCAAAATTAAGGGTAACTTCATATTAATCTTGGTTAAGCATGATTAATTCCTTCTTAGTTTTCGATGTATATTTTATATGTAGTTTGATCTGTAGTTTGTTGTCAATAGAGAACTACCATTGCCCTTATGTATGCAATGACTTTTCATTTGTAATAAGGATTAAATTGATTTTGAAGAGAATTTTCCTTTTATCTTCTTAGGCTTCACCAAAACTATAGCTGTAATGCTGAAAGGGGGAAATATTTACACTAGGAAACAGCCAAAAGAACTATAGTATTAGAAAGATGGTAGAAAGAAAGATAATCTTGAAAGATTCGATGATTCCTTTACTGCCAGACACTCCGGCTGAAAGCTCTTATGTAGTTCTTCCAAAGAAGCTTTTTCTCCCTTTTAAAAGGAAGAGATAGGAGAAGGGAAGGATCATTTTGAAGAGCAAAAGTCCACCCAGACGAATAATATCTCAAAAGGACTGGAAATAGAAGACAAAAATAATCTCTCTACGACTTCTCTCCACCCTGCTTCTCTTCCCCACGCCCATCCCCATTGGAGATCACCATGCCCAATGCCCCTAACCCCTTCGATCAACAACCTCATTCATGGCTTCCCCTCCCATCTCCTCCATCGAACAACATCCATATCTCCCTCATCAACAACTATCGAGAGAAAGAACTTTTCAATTGAAACAGATAACACATATAGAGGTTCTCGTATCACAATCACAGAAATGACCTGTGATTCAGCATTCTTCATTTCTATCCATTGGTCTTCCCTACAATGGCTTTGTTCTACTTTCTTCGCCATCCTTACTTCATCTATGAACCAAAAATTGTTTAGGGTTTTTTTTTTCTTCGTCACTGTATAGGTTTTCTTGTTGAAAGGTTGGATGAGGGCGGAATGGCCATGATACATTTTCTGTGCCTGCGGAGAGTTCAAGGGCTGGGTTTTCTTGTTGATATAGGTTTTCTGAACTGCTAAGCTTTAGAATAGATATGAAAGAATCCTAGGTAGGGTTTCAAATATTTAGGGGAAGGTTGTAAGATGGGTAGAGGTTTGGGATGTACCCGTGTGAAATACCTTTGACATATTTATGGATGAACTCCCCCTTTGACCTTTTTTTCTGTAGCCCCTTGACACCGATTAACCCCCTCACTAAACAAGTTGGCTCTTTTGCCAACATCTAAAACAGACAACTTCTTGTGATCCAGATCTCCCATTCATCCTGCGGGGCCTTTCCTCCTAATTGTGTGTCATGGATTTATATTGTTCCCATTCCCATTTAGAGGAAAAGGAGCAGGAATACCAAAAAAAAAAAAAATAAAAAAATAAAAAAAAAAATAAAAAAAAATAAAAAACTAGCCTGATGCATTAGAGAACTGATGAGGCTTGGGCCATCAAGTATGATAGGTAAGGTGGCCTTGGGATAAATCATACTAGATAAAGAAACAATGCCTGGTTCAGAGTCATGTTAGATATCCAAGGTGTTCAAGTTTGGGCTTAGTTGAAGTATTTTGTTAAGTTTGATACTGGGTAGCAGTACTTTGGATGATCTGTTGACTTTATTGTCTTGGTTTCTCTTATTCTTTCTAGGGTGCTAGAGATTTTTGGGGCTTTGTTTTCTTTGCCGGATCTTTTGTTTGTGGGAGAAGTTTTGTTGTTCTAGTATTTTCGTTTTTTGTTTGAATGGTGTTTGGTTGTTGCTCATTCGTTTGGTTGTTCATTGAGGATTGTTTCTTATGAGTTTGTTTTCTTTTGCTCTCTGTTTTTGGTAGCTTTTCACAGTAGTCAGTATCAATAAAAGGTTTGTTTCTTGTTAAAAACAAAATTTCCTAGTTGGACACTTCATCTTCCCATTGGAAGCAAGGGATGGCAAGGAATGTTGTGTGATAAATACTTGCCCACGGAGCTTGATTGAAATATTATATCTCCCAGCTGGAAGAATCCAGATATCCCAATTGAGAAGTGGCATCTCTTGTCAATTATGTGTTAGTATATGTGTTTCCATATTTCACTGTTGCACCAACATTAGCTGTGGAATACGTCCCAGCAACCAACAGTAGTTCCTCTCACACCTCTGCTCACACTGGTAGAAATCTTTTCGTGCAACCATTCTTCCAAACCTTCTCAATGTTTTTTTCTTTGATAGCTGTGATTGTTCGGGCTAGGTTACATGCACCTCAACAAATTTCATGGGACAACCAGCCAGACCCTACAACATTTGAGTGGCAAGGAAACTTGTTAATTCCTAGGTAGGTGGCCACCATGGATTGAACCCATGACCTCTTAACCATTTATTGATATTATGTCTTCTTTTTTACCACTAGGCCAACCCATGATGGTTTTTGCACTGATACATTGTTCCGTGACCATTTACTAGCGCGATACTCATTTTGTCTCGTGATGCAACTCATCATAGTTTCTGTTTGATCCTTTATTACATCCATCCTTGCATTCATTCTTTTTGTGGCCATATCTTGTATGGTTTTGATACCGGTTGAAACTCTTTTGCAAAAAGGAACCATCATAGGTTGACCTAGTAATAAATAAAGGGCATGAGTTTGATAAAGGGCTAAGAGGGAATGTGATGTAGGGGCATTTTGGGTATTTTCCAATTAATTAAGCTGAATTGTATTTTTGTCTTTTATTTTTATTATTCAGTTTTTAAATTCTTATTTTTAATTGTAGATTAGGTGCGAACAAACCCTTCCTCTTTGTGTCTATTTAAAGAGGGTTGAGTGTCGTTTATTCATTCATAAAAGTGCTTGATTGTGACCAAGAACACCTCCAAGGACCCCGACCCTCCTTCCTTACATGCAACAGCATCATCTTCTTCTCCTCAGTCCCATCAACCGACGCTTGTCTCATGTAGAAGCTTCTCTTGGCGAGCTTAACACAAGGCTATTGGAAAGACAATCCAACATGGTGGGATTTATGGCTAGTATGGCAGGCATACAGAAAGCAGTGGAACTCCTTACTCAAGAAAGAGGTAAAACTAATACAGCCAACATTCAAGAACAAAACCTTAACTCTCAATGCGTCCATGAACATATTACCAATCAACGCAGAGAACAAGAATGAGCAGCTGTACCTCAAATGCTGCAAGAAAGAGGAAATCCTAGACATCCAAGATTCCAAGAAACACAAAGAACCCAAGAGAGAGGCAGAACCACCTCGAACCTCAAGTTGGTTTCGATCGAAGAGGCCAATTTGGGTTGAATTTACAAAATCCTGTACAAGGCCACCGATTCTTGCATCAAAGACCTCTTCCCCACAATGTAGTTGGCTTGGATTCAGACTCTTTGGATCCTCGGCTCAAATATAATCTTTGTTCGAGGTACTCTATGCCCTGAAAACCAAGTTGTGCAACACAACAAAGAAGAGGATGGGGAATCAATTGTCAGTGCAAGCATGAATTGGAATGCTCAGGCTCAGAGGAGGACTTAGAAAAGGAGGCCCCTCTAATAGAAAATGGCAGTGTTTTGAAGAACTTGCTCCAATCAAAGGAAGATCAAGCTCTTGATAATGTGGAAGCTGCAACTCTTAAATCAATTGAGAAGAAAGTCATTCCTCTAAACCTGATACCAATAATTGAAGATTGTGATCTGATTCTTGGGTGAAAACTCACTGTCAGCTTAGTCCTAATGAAGATCTTGGCATGGAACACCAAAGGCTTGGGCGATAAGTGAAAAAAAGAGTAGCAATTAAACTAAGCTTAAAAAGATTGAATTCGGATTTAGCCTTAATCCAAGAAACAAAGAAGGATAACTTCGACATGGAGAACTGTGGAGCTCCAAAGATATTGGATGGACGTTTGTGGAGGCAATTGGAAGGTTTGGAGGAATTTTAACCATGTGGGATGAAAGCAAGATATTGGTGATCGAAGTGTTAGAAGGAGGGTACTCACTATCAGTCAAATGTCTGACTATTAACAAGAAAAGCTGCTGGATAACTAATGTCTATGGGCCTAATGATTACCGTGAGAGGAAGCATTTATGGGCAGAATTGAATTCTTTGGTGGCCTACTGTGAAGAGCCTTGGTGTATTGGAGGAGATTTTAATATCACAAGGAGGTTTCAAGAGAGATTTCCACAAGGGAGAGCTACAAGGGGAATGAGAAAGTTTAATAAATTCATAGCTTTGTCACAACTGATGGAGATTCCACTTAGCAAGGTAGATTCACTTGGTCTCGAGAAGGAAATGCTTCAGCTCGGTCCCTCTTGGACAGATTATTAATATCTAGCAACTGGGATGATACCTTTGCAAACACAAGAGTTACAAGACAGGTGCACACTATCTTCGATCACTTCCCTATTTTGCTTGAAGCCTGATCTTTTGAATGGGGTTCCCCCATTTAGATTTTGCAACAGCTGGTTAGACTTGAAGGAATGTTGCCTTGTAATGGAAAATTCACTTTATCTGGAAGCGCAACAAGGGTGGGCTGGCTTTGTTATTTTTGATAAACTTAGAAGAGTAAAGGCTGCTTTGAAGACTTGGAAGTGTAACTTTGAAAAGGAAAAAAAGAGAATAGAAGAAGGGCTCGTAAATGAAATTGAAACCCACGAATTGGAGGCTGAAAATTCAAAGGAAGGCACAGCGAGAGATATGTTTATCACTTAAGGCAGAACTATTGTCACTTTACCGCATTGACGAAAGAAACCTAATGCAAGCTGAATTGGTTAAGCTTGGGGTGACAAAAATTCAAAATTTTTCCACTAGTTCTTGTCAGCCAAGAAAAGAAGAAACTTGATTTCTGAATTGATCAATGAGCAAGGGGCAGTAACAACTTCCTACTAGGAAATAGAAGAGCTAATCTTGAGGTTCTACAACAATTTCCAATGGCTGGTAGTTAATAAAGATCATAACAGGATGCTGGTTTCTTGATTCTCCTTGGAGGAAATTAGAAATGCACTAAAGATGTTAGGGAAAAAACAAAGCTTCGGGCCCAGATGGTTTCAAGTAGAATTTTTCCTGAAATTTTGGAAAAATTAAGTCTCTTTATTGAATTTGTTTGAAGAATTTACGAAAATGGAAAATTAAATTCATGTGTAAGGGAGAACTACATTTGCTTGATCTAGAAAAAGGAAGACGCCACCCTAGTGAAGGATTTCAGACCAATTAGCCTTACCACTTCAGTTTGTAAGATTGTGGCCAAGGTGCTAGCGGAAAGACTGAAGAAAATAATGCCAAGAATAATCGCCCTAGGTGGATATTGTGGGAAGACAAATTCTTGACCCTGTCCTCATAGCCAATGAAGTGGTTGAGGAGTATAAAATCAAAAAGAAGAAAGGATGGATGAAAAAGCTTCTGACCATGTAGATTGGGCATTCCTTGAGAAAGTCTTAGTTGGAAAGAATTTTGACCCTAGGTGGATATTGTGGATTATGGGTTGTGTGTCTAACCCTTGAGAAAGTCTTAGTTGGAAAGAATTCTCAATTTTCATCAATGGAAGACCAAGGGGAAGAATACAAGCTTCTAGAGGCATTAGGCAAGGGGATCCCCTCTCACCCTTCCTCTTTCTACTTGTTAGTGAGGTGCTTAGCGGATTATTATCAAGGCTGCATGATAAGGTCAAATATGAGGGATTCATCGTTGGAAAGGATGCTGTCCATGTTTCTTTGCTCCAATTTGTGGATGACACTTTGATATGTTGCAAACATGATGTCGCTATGCTAGAAAATTTAAGAAAGACCATAGAACTTTTCGAGTGGTGCTCGGGACAAAAAGTTAATTGGGAGAAATCAACACTTTGTGGGATAAACATCGAAGATAGCAAGCTGATTTCAGTGGTAGCTAAGCTCAATTGTAAGGTTGAATACCTCCCTATCATGTACCTCGATCTACCTTTAGGAGGACACCCTAAAAAAGAAGCTTTTTGGCATCCGATCATTGGAAAGATTCAAGACAAATTAGATAAGTGGAAAATATATAATTTGTCAAGAGGTGGGTGTGTTACTCTTTGCAAATCAGTCCTTTCACATCTCCCCACTTACACTATATGTCCATCTGCTTAATGCAAGGTGATTTCAACCATTGAACGTGTGATGAGGAACTTCTTTTGGGAAGGACACAAAGGGGGTAAGTTGAATCACTTAGTTAAATGGGACTTGTTTACTAGAAACCAATCCGATGGTGGCCTTGGAATCAGCGGCTTGAAATCAAAGAATATTGCTCTCTTAGCCAAAGAAGACTCCCTTTGGTGTCAAGTAGTGCGCTTCATTCATGGAAGAAGTCCGCTCGGATGGCACACAAGGAGAGAGGTCAAGAATAGTCTTCGTAGCCCGTGGAATAGCATCTCAAGGTCTTGGCTAAAAGTTTGAGGCTTTGGCCGTCTACAAAGTTGGTAATGGAAGAAGAGTGGCTTTTTGGCTTGATCCTTGGGTGGATAAATCACCTCTTAATATCTTCTTTCTGAGACTTTTAAGAATTTCTTTGAACAAAAGTGGTGCTATCGCCTACTTTTGGGACACCTCTCTTTCATCTTGGTCCGTTAATTTTAGAAGAGCACTTAAGGAAGAGGAGGTTGTTGACTTTCAATCCCTCCTTAGCCTTGTTTCAAAATTTCCCATTACAACAAAAAATGACAAACGAGTGTGGGATTTAGAATCCAATGGAATTTTCTTAGTAAACTCCCTTGTCAAGCACCTCTCCAAGGCCTCCCCCATTGGTAAACTACTAGAAGGAACCCTTTGGAAATCCAAAGGCCCTCAGAGAGTTAATATCATCGTTTGGATAATGCTAAAGGGGCGATTAAACTATGCTTCGGTGCTACAATCAAAGTTGCCTTCTCATTGCCTATCACCTCATGCTTGTCACATCTACTTGAAAGATCATGAGGACCTACAACACCTTTTCTTCAGCTGTAAATTTGCCAAAAACTGCTGGTTTAGGCTGTTTGACAGCTTTAATTTAAGCTGGGTGTTCAGAGTATTCAAGGAAAATATGTTGCAGCTGCTGCTCGGACCTAAGCTGAAATTAAAGCCAAAATTGCTGTGGACTAATGCTGTCAAAGCTATTTTATCAGAAATCTGGTTTGAAAGGAATCAGAGGGCGTTTCACAATAATGCTTCCTCTTGCGCGATCAGATTTGAGACGGCTCATTTAAATGCTTCCTCATGGTGTTCTCTCTCCAAGGGTGTTTAAAGACGTATCCATCCAAGACATCAACATTAATTGGAAGGTTTTTATATTCCCCCCTTCCTAGTTTTTCTAAGATAGGTGTCTAGTTCGTTATTTTGGTGTTTTTGGAGTAGTTAGGTTGTTTGTAGCATGATCTTATTCACTTTTGTACTTGTAATCAAGGTTGTTAAATCATAAAACCGAATGTATATCAAAATGTTAATTTTAGTGTATCGTGTATCGTAACGTGTCGTGTATCGGAAGTTTTAATTGGGTAAATAACATATACATGTAAATATAAATATAAAAGCTTAAAATAGAGATAAATAGTCATGCATTAAATTAATTCAAAACAAAAACATAAACTAACTTCAAAGTTCAAAATAGCTAAAAAAAGAAAACATGCAACTAACTTAACATCCAACTTCAAAATATGTATGAAAAAACATTAAAAATAATAGAATTTCTTTGTTTCCAAGTTTCAATGTTTAACTATTGACTTCAATATCTTCAAGATCATTATCAGCTAATAGTTCGCCATTGTCTTGTTCATGTGTCTCTTCCCTATTGCTTTCTTCTTCTATTAGTTCAATAACTTCCTTCCCTTTGTCAGTCTTATAGTTTTTTGATAGATGTTTAAGTCCTAAATCAAAAAATAAATATGAAAGGTTACTTAGAAAAAAAAAAAAATAACTAACATATAAAAAAAAATGATAGATCATTACCCCTCTTTCTCTTTTTGTTGCTCTCAATTTCTTGCATGTCAAAACAATCATTTACATTCATCCAAGAGCTATCATTTGGTAATGTCGGATCTTCTTTCTTTGTAATCCATTCATCATCAGATTCTAAATCCGATAGACAAATAGGATCATATGATTCTCCTTCTAAGTTCTCTTCTTATGTCTTGCTTCCAACCGAAGATTATATTTCACCAACACAAGTGCACTCAACCTTTGTTGTTCCAAACAATTTCTTCTCTTTGAATGCACCTAAAAACAATTCATGTTAATATTTTAATATTAATCCTAGTTTTTATGAAATTTGGTTTTCATAATACCATACCTGATCAAATGTACTCCAATTTCTCTCACATCTTGTTGCACTACAAGTCAAGGTAAGAATTCTAATAGCAAGATCTTGGAGCTCTTTGCAGCCTCCTCCAAAGGATTCCCACCATAATGCTTGAATACAATTCACATACATTTTAGTCATAACATCTTAGAAAATTGCAACTCTAACTAGTGTCTTTCTTCTAGAATTTAAGCTAGTACCTGGTTGTTTTTTGTGGCGCATTTGTACTGCCATATCGATTCCAAACATATTCTCAGCATTATGGAATAGGTCTATCTGCTTATCAATTTTTTCTCTGACCATTGGTGAAGGGCACATTCTTTGAATAGTTTGATAAAGTCCCATCTTGACCTCATAGTCGGCATTCAATTTAGGAGCATAATGGAACCTATGTTTTAATTTTTATAACCTAATTAGTAATTAGTAAAAAAAGAAAATACTAATAAACTGAAATTACTAATTCTAAAAATTAATACATAGGTTCTAACCTTGGGTTAAGATAATATGTTGTGGCATGTAGGGGCCTGTGAAGTTGCATAGCCCATCTCTTATCAATTATATCCCAAATAGGTTTATAATGTTTCTGAATATTGTTGAAAATTTTCTCAATCTGTTCCTTTGCCCTGTCCATTGCTTCATATATATAGCCCATTGCAGGTCTCACATCACCATCAACGTGTCTTAAAACTTTTATCAAGGGGATAACACATTTTAAACAATATTTTATTGATGACCAAAAGTTTGAATCAGCAAGAATAATTTTCATTACGTCAACCCCGTCAGACTTCTTTGAGTAGTTGCTTCTTTGCCAATCTTCAGATCCAAACATTGCCCTCAAACCTATTTTATTTTCTGTTATGCTTCGCAAAGTCAAGTAAGAAGTTGCAAATCGTGTCACACCAGCCCTTATCAATTCTCTCCCTTTAGTATACTTCCTCATAAGGTTTAAAACCCAAGAGTATTGATAAATATAAATAGTAACTTTCCTTGCCTTAGACACTATATCATTATGTAAGGGCTATCACCAATATCAGATAAAATCAAGTCTAGGCAATGTGCGGCACAAGGAGACCAAAATAGTTGTTTACGTTTAGCCATTAACATTTCTCCAGCTTTGACATAAGCAGATGCACTATCTGTTACCACTTGAACTACATTATTTTCCCCAATCTCTTCTATCACTGAATCTAGTAGTTCAAACAAATTATCAGCACTTCTGTATATTCCAGATGCTTTTCAGAAATATAGTGCCTTTGGGACTATTTACTAAAAAGTTTGTTATTGATCGATTCTTCCCATCCGTCCAACCACGGACATCAATGTGCATCCTGATGATTTCCATTTGGCTGTATACTTTTCTAACAACTCCATTTTGATTCTTTCCACCTCTTTCTTCAAAAAAGTACATTAACCTCATGATATGAGGGAGGTTTCAATCCAGGGCCAAATGAAGCAACAGAATCTAACATAATTTTGAAGTAGGGATCTTTGACCAAGTTAAAAGAAAGTGCATTTGCATACCAAAATCTAAAAATATCTCCACAAACACTCTCTTTGTCACCTTTCTTCCACATAGTATTAAGGGTAGCAGAACTATATTTTCTTATGCTTTTACCAAAAGCACTCATCATAAGTTTCTTTTTACTATCGGTATCATGACATCATGAAAATCATCTCCATCTTTATTTCTTTTATTTTCATTCAACAAACTCTTGAAAATATCTCGAACTCATCTGAAACTTTAGTACATGGAGCAACATCTTTTTTAGTATGTGCTAGATGATGCTTAATCCTATAAATACCTCCCCAATAAGAATTTCCACAGTAGTTACATCTTAGTTTGAGTCTTCTATTTTTGCCTTCTTCTACTTTAACACAATGGTCCCATGTTGAATCCCCACTTTTCTCTTTACTCCCTTCTACCATTTTTTTCCCAATAAAATAGTAACTAATAGTCTAATAACTAATAAGGAAGTAAAAGGAAAAAAAAAATTCTAACCTTTAGTTGATCTTCTTCAAATATTGCCACGGAGAAATGAGAATAGCTCTCCAATTTACTATTTTTTTGAATGAAATGTGTTGTTGTGTTTTGAATAGAGGTGAAAAGCTTATTTAAAAAGACAATAATAACAACAATAAAATAATATTATTTAAAAATAACAATAATAAAATAATAAAAGAAAAGGAAAACAAAAAAAACCTTTTTTTTAAAAAAAGATGGGAAAATGAAGGAATTAAATGTTATCTTGTTTTTTTAAAAAAGAATTGGGAAACGAAAAGACTTAAAGTTTTAAATAAATGTTTCTCTTTTTTTAAAAAAAATGAAAGGACATTTTAACTGAAAACACACGCCGGCCACAACCAAACTTTTTTTTTTTTTTTCTTTTTGTATCTTTCTTCTAGGCTAGCATTATTCTTTTCTTCATCTTCTTCTTCTCCAATCTCCATTAATCCGACTGCAAGTGCGAACTGCTTCATCTTCTTCTTATCCAATCTCCACTGATCTAGCTGTAAAATGCGAACTGTGAAGCCCATCTTCGCCCGTCTGGCGTGCTGCAACTCAAAGAGAAGGAGAATGGATACTTCCAGTGACGATTCGACTTTGTTCTCCTGTCTGGCGTGCTGCAACTCGATAGAGAACCCATCTTCTCCGGCGATCGGGCGATGAAACTCGAAAATGGTATTTTTTTTTCCTTCTAAAAACTTAATCGGAACATTACGATTAGCACGATCCGCACGATTCATAACCTTGAATCGTGAGATACATGAGATTTTCGATTCAAAATCACGAGTTGAATCGTTTTGGCTTTGTTTTGATACGTATCGCGAATCGTGCGTTTTTAACAACCTTGCTTGTAATTTGCTTATTTGGTCTAGAATATTTTGATCTTATATATTGAAAATGATGGGGATGCTAAAGGGGTGTTAACCTAGTTCCGATGCCTGGGTGCACTCGCTGATCCTTAGGTCTCTATATCTTTTTATTATCTTAATGAAGAGGTCGCTTCCTTTTTCAAAAAAAGATTCACTCTTTTGGATGTGGAAGATTTTGAAAACCCAATGCAAGATTATTACCATAGATTTCCTCCTTACCAAGAGGAGCACCTCTAGTACCTAGAAAATTTCCAAGATCAACCATATCACAGGAGAATGACCAAGAATATTTTCAAGATTGGCACCCACCTTGTGCAGCTTATAGTGGGCAGCCATGGAGACGAGTCTGAGTATAAAATAAAAGTTAACCTCCCTTGTTATAGTGGGTGTATGAACATTGAAGTATGTCAAGATTGGGTCAAGAACGTAGAAAGCTTCTTTAATTATAGGGAACCCCGGAAGATAGAAAAGTCAAACTTGTTTGCTTTAAAGTAATAAGCACGGATACGGATACGAGACACGAATACAATATGACACGAACACGACGATGTCATTTTTTAAAATCTAGGACAATGACATGACAAGGACACATTTATTAAAATATACATTTTTAAAAATATATATCATTTTTATACCATTAAAAAATTTAAAGTTAATGAGTTGATACATTTATATGCTTAAAAAATTAGTTTGATGTATTTTACACTCAAATTTTTTTTATTATTGTCATATATATGTCCTTTTAGTCTACTTACTAGCGCTCTCTAACGCAAGTGTTCGATATGTCTCCAACAAATGTCCAAGTGTCCGACACGTGTCGGACACAGATACATTAGCCAAACTAAAGTGTCTATGCTTCTTAGTCTTTAAGCTAAAAAGTGGTGCTTCCGCTTGGTGGGACCAAGTCCAACACAACTGAAAAGTATTTGGTAAACGACCAATTGTTAGTTGGCCGAAAATGCTGAAGTTAATGAGGAAAAGATTCCTTCCAACCAACTATGAAAGGCTGTTGTACAATCAATTTCAACAATGCAAGCATACACGAAATGTTGAAGAGTACAATGAAGAATTCCATCGACTCAGTGCTCGAAACAACCTAGTGGAAAGTGAGAACCAATATATTGCTCGCTATATTGATGGGTTGTGATATGATATTAAGGAAAAGCTTCACCTTCAACCCCTAATGAGCATTGCAGAAGTTGTGTCCTTTGCTACAACAGTAGGCAGAAGCGCTGGCCGGGTAGGAAGAATAATTGGGAAAAGGCTTCATCACAACCTAGGAAAATTGCAAAAGACCCTACCAAGAATGCGATTCAAGTCAGCTCTGCATAGGGCAAAGAGACAACAACAAATAAATCTGAAGCAGACTAGAAAGGGAGTAACAATACTTCTTTTAAAAATAACCCTTATAATCAGCCTAACCTTAGGAAATGTTTTAGGTGTGGCAAAATCAGGCATCTTTCCAACGTTGCCCTCAAAGAAAAAAATCTTAGAATTCAAAAAGGCAAGAAGACCAAGAAGAAATCAATCTTGATGAGCAAAAGTGAAGATGTTGCTTATGCAGAGCCCGATAATGGTGATGCACTGTCTTGTGTCATCCAAAAAATTCTTTTAGCACCAACTACGGTGACAATGCCCCAGCGGCATGAGCTATTTCGAACAAGGTTTACGATAAAAGGAAAGATTTGCAACATTATCATGGATAGTGGAAGCAATGAGAATATGGTTTCGAAGAAGCTTGTTTAAGCTCTTAAACTAAAGACTGATGGTCATCCTAGCCCTTAGAAGGTGGGCTGGATAAAAAAAGAGGGGCAGAAGTCTTAGTAAATGAAACATGTATGGCCCCTCTGTCAATAGGCAATCACTACAAGGCATATATAAGTCCATGTGAGAACCATACACTGAAATATGGAACCAGTCCTGGCCAATTGCTAAAACTATGCATCAGGCTTTTTGTTAGATCAATATGTTGAGATGTAAATTTGAAGATTTCGAGAATGTGCAACTAATTATAAGTTGAATTCCACCTCATTGTAACCCTAGGATAGGCCTATATATCTGGTCTTTTGATGCATGGAAAAACATTACTTTCACATTCTTCTAATGCTTTTCTTTGCCAACTCTTAAAATTTTAAAATTTTAAAATTTTCTATGGTGCTTATCTTTTCCAATGAGAGGTTTAGTTTTAACTAGGTGGTTTTTATTTTGTAGGATCATTATTTGAAGGTATTGATGAAGCCAAGTCCGCTCTAACTGTGAAGCTAATGCAGGTTAGGTATTGTTGTCAGATGTGTAGTTTGACAATAAGTTTTTGAATAATCGAGAATTATGTAATTTCTTCTCTGATTACTTGAAGAGCATTACAGAAACATCTGCAAAGTTGCCTCATTGTCTGAACTGGGAAGCTAATAATATGTATAATATGCTTTATTATTAGGCATCAGCTTTTACAAAGATAAGGCAAGCATAAATAAGGGAAGTCTTAGACAAAGCCTATTATGGAAAGATACGGCTGAAGATTGCCTAGGTTGATTCTCCACACTCGAGAAATGCTATACATCTGTGATTGAAAAAAATTAAAAAAAGATGATGTTTAGTAAAATTGTCCAACTCTTCTATACCATTAAATCTCACTTTGGTCATTATTTTTGAAGTGTGAGATGTACTAACGTACAATGACTTATATAGAAAGTAGTAAAATACACTTTCTTAGATTTTTTAATACTATAAAGCATGGAACGGTAAAAATACTCCCAAGGATTGGATTGCTTTCATTTTCTTTGTTTAATTTAGTTTGTATCCTTCAGTTGAACTTTCATTTTCAGTTTCTGCATTATTGTATTTTAAGCATTAGACTCTTTTCATTATATCAATCAAGGGTCTCGTTTCCTTTTCAAAGGAAAAAAGGAAAAAAAGAACTCCAAAGATTTGAAGTCAAACGTATTAGTCTTGGACTCTTGCTGACCAGGCTCAGGGACATTTCCCTAGAAGTTTGAGGAAGGGAGAAATTGTCCTGTTAGCATGATAGTGTTGCAAAGATCAGAGTGTAATGTATGACCCTTTATCAAAAGAACAAAACCTGTATCAAAATTTTTCTCCGAAAAAGGAAAAAAAAGAAGAAAGAATAAAAAAGGTATCTGAATGATGCAATGCAAGTTACAAACCATGGGAAAAAAATCAAATTAAATATGACCTAGGTTATCATTAGTTCAAAGAATACAACCTTGTGTAATATGATCTCTTATCAATGTGTGTTAGAAAGTTTCTTGAAGAAAGAGCAGAAAATAGTGATTGCTCATCTGAGTGTATAAAATTATGACCTGAGTTTCAAAATTTTTCTTATTCTGAATTTCAGGAGGGAAGAATGAGGGGCCAAGCATTTGTAACATTTCCATCGGTTGAGCTTGCACAACATGCTTTGGTATGTATTTGATTATTTGGATCCATGTATTTTCTTGATCACTATTCACTTGACATATTGATCATGAGTGCTCGTGGTTGTCTAGGAACAACCAGAATTCCCCTCCACTTTGTGGAATTATGGAGTAATTGTTCTTTTAGATGTTTGAGTTGTTGCAATATAAATGAAAGTTAGAGGAGATTTTGTTTAACTTAAGTTCTATGCAGCTGGCTCATAATTGGTACATTACTATGAAGTACAAACACTTCATTGTAAGGTGGTATCAGTGTGAGACACAAGTTTGTCCAGACACGCAGGGGACACATGTCTGAAATGCAAATTTACATGTCACTTTTTTTAATTTTTAAATTCAGACATGTTGAGACACAGTAAGGATACTTCTTGGAAATGACTGAAAAAAAGTTAAAGAAACAATTGTTTAAAGTATGGGCCTAAATTTAAAGGCCAATAACTTTCTTTTTAAACTTATAAAAAATTAAAAAAAAAACAAAAAAAAACCTAAACAGTAAAAACGAAATAAACATTAACTTTTCAACTCTCCTCCTACCCTTATCGTGCAATTCATGGAAACGTACAGGCACAATAGTTTTCTTAGTTTTCCTTCACACTAGAGCAGAAGATGTTGTCGACGTGTTTTACTCACCACCGCTGCTATTTCTGGTTCACTGCCGATGTGTTCTAGCCTCAATCTTCAAACTCTTCGATATCAATTATTAAGTTTTTCTTTTCTTTTTTCTTTTATGTCTACCATTATCTAGTTACTCATTGCCTACTAGATCTTCAAAGTTCAACATCTTTTATATGTTCTTTTTACCCACTGTCATACGTTGTCCCCTAATCTTCAAACTCTTTTACATGTATCATTTCATATATATATATATTAGCCACTGTCGTTTTCCAATCCTCAAAGTTCAAGCGTTTTTAGATTTTTCTTCTTAAATCTATCAAAGTTAGCCACGGTTGATCTTCAAAGTTGAAACTCTTTTATATATATATTTTCAAGTGTATAAAGTTGGCCATTGACATCTTCAGACCCTTTTAAGTTATATATGTTTTTTTCTTTGTTAAATGTTAGGATGACATCTACAAACTTTAGCCATTGTCAATCATCAAATAGTTCCATGGAATTTACCTCCTTTGAAGATGATTTAGAACTCCTATAGTGGATTAAGATTCAACAGCTAAGTGATGGGGTGAAAATTTTCATGGAAATGCAACTTTTATCATGTCACAAAGAAGAGCTCACATACAAGAGTCAAAGCTCACTTGTTGAAGATAAATGGTGAGGAGTTGGGCCCTGTCTAAACGTCACTCTTAAAGATACGACTGATATGCAAAAATTAGAAGATGGGCCAAACATTCGAATAGAAAGAAAGACTCCTAAAAATGTTACTTTGCCACCCTTAGCCATATCAATTAGCGGAGTTAATGTGAGTAAGTCTCGCACTGAGTAGCTTTGACTTAGAGAACAAGAAAAGAAAGGGCAGTACAAGTCAAAGTGCACTTGAAAGATCATTCAACAAGATATCACAAGACCAAGTGGATATGCTTAATGCTTGAATGTTTTATTGTGTGAGCTTTCCGTTTAATTCAACGAGGAATCCACATTTTATAGGTGCCTTTACTTGTGTCACAAACAATCTGTTGTTGGAATATATGTCTCCTGTATATAATAATGTTAAAGATAAGTCTTCTCCAAAGAGAAAAGACAAATATTGAGAGATTATCGGTACCTATTAAAGGTGAATGACGTAAAAGGAAGGAGTATTGTGAGCGATGGATGAAGTGACTCACAAAGGAGGCCGTTGATTAACTTCATGTCCATTTTTTATGGCAAACCAACGTCTCTAAAGTCAATGGATTGTTTTGGTGAGATCAAAGATAAGTAATTCATTGAAAATTTGATGGACAATTAATGAAGTTGGTCACAAGTGATAATAGATAATGCTCTTATTGACAAAGGTGCAGGACAAATTAACGAAGCATAATTTTTTACGATTATAAAAATTATAAATAAGTAAAAGAATTGCAAGTTAAAAATCACTTTTGGTTCTTGAACTTTCATGAGGTGGCAAAATTTAGTCCCTGATTGTAATGTGCTCTTATACTTTCAATTTTGTAAAAATTTAATTCATTTAACTTTCAAATGTGTAATAGTTTAATTATTCCCATGAAATTTTCTTTAAAATTAGAAGTCCATTTTTATATTAATAGTTTTTGACTTTGTACATAAACATGCTCCAAGGTTTAAAATATCAATGTCGACAAAATGTCAAGGTCCTAATTATATGGAAATGTTGATGGAACTTCGATAATTGTGCATGGTGTCCATACAATCATCGGAAGTTGTGCGTCAATAATTTGTCCTACACCTTTGCTTGCAATTAGGAGTATTATCAGTTATCACTTGAACAACATTCTCATATCTAACTTCAATAATCACTTCTTTCATCAAATTTGCAATGAAGTATTTATCTTTGATCTCACGAACAATCCACCATAAAAAATGGTCTTGAAGTTAATCAATGACCTCCTTTGTGAGTAACTCCATCCATCACTCCCAATACTCACTCTCTTATTATGCCATTCACCTTGCATAGGTACTAATAATCTTTCAATATTTGCCTTTTCTCTTTGGAAAAAGACTTGTCGTCAACATATTAAATCCAGGAAGCACATATCCTGTTCATATCCTGACAACAGATCATTTGCGGCATAAGTAAAGATACCTATAACATGTGGATTCCTCACTAAATGAAACGACAAGCCAACAAAATAGAACATTCGAGCAATAAGCGCATCCAATTGGTATTGTGATGTCTTGTAGAATGATTTTTCAAGTGCATTTGTACTACCCTTTCTTTTCTTGTTCTCTAAGTCAAAGCTACCAGTGGAAAAATCATCAAACTATTGATTGATATGACTTAGGGTGGTTTAGGTGGCAAAGAAACTATTTTAGAAGTTTTTCTTTCTATTCGAATGTTTGTTTCATCTTCAAAATTTAGCATATTAGCTATACCTTTAGGAGTGACTTTTAAAGACAGCCCAACTCCTTAACTATTTATCTTCAACAAGTGATAATTCTTGGTTATGGACTCTTTGTGACATGACAAAAGTTGCATTGCCATGAAAAATTTCCACCACCCATTATTTAACTTTTGAATCTTAGTCACATATTGCCGTAGGGGTTTTGAATCATCTTCAAAGGGGGAAGATTCTATGGAACCATTTGATGATTGGTAGTGGCTAGAGTTTGTAGATGTTATCCAAACAACGCAAAGAAAAAAAACAACATATATATATATATAATGCAGAAGGGATTAAGATGACAGTGACCAACTTTGATACACTCAAAAGGAAAAAAAAGAAAAATATATATATATATATATATATATATATATATATATATATATATATATATATAAGAGTTTTAGAGTTTCAACTTTGAAGATCGACAATGGTTAACTTGATAGATTTAAAGAAAAAAATCTAGAAAAAGTTTGAAGTTTGAGGATTGGTAAACAACACTGGCTAATATATATAAATGTAAGAGTTTGAAGATTGGAAAACGGCATACGACCATGGGGCAGTGGCTTAAAAATACATTTATAAAAGAGGTTGAATTTTGAAGATAGTCGGACGACGGTGGGTAACTTGGTAATTATAGACTTAGAAAAAAAACATACTTGATAGTTGATATCATAGAGTTTGAAGATTGAGGCTAGAACATATCGGTGGCGAACAAAGAACAGCAGCGTCGGCGACAACTTTTGCTCCAATGAGAAGGAGAATAGAGAAGACTACTGTTTCTTTTTTCCTTTTCTTTCTTTTATGTGAGTGTCTCGGCTAGTTTACGTGCTCCTCCACTAATCTCACGAGACAACTCGCCTAACCCTACAACATTTGGGTGTTAAGGAAACCCGTAAGATATTAAATCCAATAGGTGGTCACCATGGATTGAACTCATGATCTCTTAGCCCTTTATCAATGTCATGCCCTCTATTTACTACTAGGCCAACCCATAATGGTTATTTAGAAGACTACCGTGCTTGTTCGTTTTCGTGAACTGCATGACAAGAGTGAGAGGGGAGTTGACAAGGTTATGTTATATAATGTTTAGTTTTTACTTTTTAGGTTTTATGTTTTTCTTTTTCATCTTTGTTTAAAAAGATAGTTATTGGGCTTCAAATTTGGGCCCATACTTCAAACAATTGTTTCTTAAATCTTTCTCGTATCCTTACTGTGTCTCAACGTGTCTGAATGTTTTTTTTAGAAAAACAAAAATAGGACATGCAAATTGGCATTTTGGATGCATCCCCATGTGACAAAACAAATATGTGTCTGACATTGATACCACCATTAAAATGAAGTGTCCGTGCTTCGTAGTGAGAGGGCTGGTATGTGGTATCTCTCTCCCATTTTAGAACCTCAGGAAAATGCAAATGTCCCTATTCCCGTGCTCATTGAAGATGCATAAGATAAATAACTGTCAGACCCAACTTTTCCCATGTTACCTCTCCTAAGATATGTTTTTACAATTGGAGGCCTTACAAATTACTGTGTATACTTTTCTTCTCGATGCTCTTTTGTTTGAAATCCTATAAAATTGCTATTCCAAATATCTCAAGGTGAAGGCTGTAGTATTATTTGTTTTTGGTTTGTTTTATTCTGCCGGTAGATATAAGCTAAATATCTAATATGCCAAACTCATTGTTTCGACAGAATCTAGTAAATGGTTATGTGTTCAAAGGCAAGCCAATGATTATCCAGTTTGGACGCAATCCAGGAGCCGCCAAGGGGAGTTAA

mRNA sequence

ATGGAGGATGCTGCCACAAAAGATGCCATATCTAAGGACCATAGGAGTAAATATTATACTGAGGGTATGAGAGCTAGGCCTCTGCATTTTGGTGAGCCAATTGCTGCAAGGCTTGGTGTGGACTATCCATTCCCTCCTCATCTTGAGTATGCTTATCCTCCACCAGATGGTAATATCCTGACCAACATTGTTAATGCCCTCATAGCTGTCCCTCGCTTCTATACTCAGGTGCTGCATTTAATGAACAAGATGAACATTCCAGCTCCATTCCGATTGCCTCTACCCACTCCACCTCTCCCACCTTGCATTCCTGCACCACCATCACCATCATTAACGCTGCCACCTCTTCCTCCTCTTGCCACTCCAAGGCCTCAAGTAACAAATATGTCCAGTGATGAGTCAGAAATGGAGTCCTCAGAAGAGGATGTGGAAGGAAAATATAATTCAAGTGGGGTTACAAAAACTGCAAAATCAGGAAACAAGCGTTCTAGGTGTGAAACGATTGTAGGCCCTGGGATAGACAAAGATGTGGCTCATGAGGCTGTTGGGGTGAAACCTTCCACATTGGTTCCGAAAGAGATTCCTATGATAAAAAAGAAAAACCCAGTGATTCAGATTAGAATTGCCCCTAAAGTGATTCACGATGGAAAAGTCAATAACATTAATAATGATGGTGAAGAGAGAGAAAAGGAAGCAAATGATCTTAAGCCCTATGCAACACTCGAAGATTTAGAGAGAGGAAAATTGCCTCCAGAAGAAATTCTTTCACTTCCAATGTTCAAGAACTATACTACTGGGAGTCCTACTTCGGTGCTGTATATCAAGAACTTAGCCAAAGATGTATTGAATGATGACCTCTACTATATTTTTGGATCATTATTTGAAGGTATTGATGAAGCCAAGTCCGCTCTAACTGTGAAGCTAATGCAGGAGGGAAGAATGAGGGGCCAAGCATTTGTAACATTTCCATCGGTTGAGCTTGCACAACATGCTTTGAATCTAGTAAATGGTTATGTGTTCAAAGGCAAGCCAATGATTATCCAGTTTGGACGCAATCCAGGAGCCGCCAAGGGGAGTTAA

Coding sequence (CDS)

ATGGAGGATGCTGCCACAAAAGATGCCATATCTAAGGACCATAGGAGTAAATATTATACTGAGGGTATGAGAGCTAGGCCTCTGCATTTTGGTGAGCCAATTGCTGCAAGGCTTGGTGTGGACTATCCATTCCCTCCTCATCTTGAGTATGCTTATCCTCCACCAGATGGTAATATCCTGACCAACATTGTTAATGCCCTCATAGCTGTCCCTCGCTTCTATACTCAGGTGCTGCATTTAATGAACAAGATGAACATTCCAGCTCCATTCCGATTGCCTCTACCCACTCCACCTCTCCCACCTTGCATTCCTGCACCACCATCACCATCATTAACGCTGCCACCTCTTCCTCCTCTTGCCACTCCAAGGCCTCAAGTAACAAATATGTCCAGTGATGAGTCAGAAATGGAGTCCTCAGAAGAGGATGTGGAAGGAAAATATAATTCAAGTGGGGTTACAAAAACTGCAAAATCAGGAAACAAGCGTTCTAGGTGTGAAACGATTGTAGGCCCTGGGATAGACAAAGATGTGGCTCATGAGGCTGTTGGGGTGAAACCTTCCACATTGGTTCCGAAAGAGATTCCTATGATAAAAAAGAAAAACCCAGTGATTCAGATTAGAATTGCCCCTAAAGTGATTCACGATGGAAAAGTCAATAACATTAATAATGATGGTGAAGAGAGAGAAAAGGAAGCAAATGATCTTAAGCCCTATGCAACACTCGAAGATTTAGAGAGAGGAAAATTGCCTCCAGAAGAAATTCTTTCACTTCCAATGTTCAAGAACTATACTACTGGGAGTCCTACTTCGGTGCTGTATATCAAGAACTTAGCCAAAGATGTATTGAATGATGACCTCTACTATATTTTTGGATCATTATTTGAAGGTATTGATGAAGCCAAGTCCGCTCTAACTGTGAAGCTAATGCAGGAGGGAAGAATGAGGGGCCAAGCATTTGTAACATTTCCATCGGTTGAGCTTGCACAACATGCTTTGAATCTAGTAAATGGTTATGTGTTCAAAGGCAAGCCAATGATTATCCAGTTTGGACGCAATCCAGGAGCCGCCAAGGGGAGTTAA

Protein sequence

MEDAATKDAISKDHRSKYYTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNILTNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTPPLPPCIPAPPSPSLTLPPLPPLATPRPQVTNMSSDESEMESSEEDVEGKYNSSGVTKTAKSGNKRSRCETIVGPGIDKDVAHEAVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVNNINNDGEEREKEANDLKPYATLEDLERGKLPPEEILSLPMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS
BLAST of Cla97C02G036070 vs. NCBI nr
Match: XP_004139973.1 (PREDICTED: U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X1 [Cucumis sativus] >XP_011656911.1 PREDICTED: U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X1 [Cucumis sativus])

HSP 1 Score: 610.5 bits (1573), Expect = 3.6e-171
Identity = 334/359 (93.04%), Postives = 340/359 (94.71%), Query Frame = 0

Query: 1   MEDAATKDAISKDHRSKYYTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60
           ME AA KDAISKDH S+ +TEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL
Sbjct: 1   MEGAAIKDAISKDHWSRDHTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60

Query: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTPXXXXXXXXXXXXXXXXXXXXXXX 120
           TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTP  XXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTPPLXXXXXXXXXXXXXXXXXXXXX 120

Query: 121 XXRPQVTNMSSDESEMESSEEDVEGKYNSSGVTKTAKSGNKRSRCETIVGPGIDKDVAHE 180
             R QVTNM SDESEMESSEEDVEGKYN SGV+KTAKSG KR RCETIVGPGIDKDVAHE
Sbjct: 121 TTRSQVTNMPSDESEMESSEEDVEGKYNPSGVSKTAKSGYKRPRCETIVGPGIDKDVAHE 180

Query: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVNNINNDGEEREKEANDLKPYAT 240
           AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKV NINNDGEE EKEA+DLKPYAT
Sbjct: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVGNINNDGEEAEKEADDLKPYAT 240

Query: 241 LEDLERGKLPPEEILSLPMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300
           LEDLERGKLPPEEILSLPMFKNYT G+PTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA
Sbjct: 241 LEDLERGKLPPEEILSLPMFKNYTAGTPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300

Query: 301 KSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS 360
           KSALTVKLMQEGRMRGQAFVTFPS+ELAQ ALNLVNGYVFKGKPMIIQFGRNPG  KGS
Sbjct: 301 KSALTVKLMQEGRMRGQAFVTFPSIELAQRALNLVNGYVFKGKPMIIQFGRNPGGVKGS 359

BLAST of Cla97C02G036070 vs. NCBI nr
Match: XP_008448178.1 (PREDICTED: U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X2 [Cucumis melo])

HSP 1 Score: 604.4 bits (1557), Expect = 2.6e-169
Identity = 336/359 (93.59%), Postives = 342/359 (95.26%), Query Frame = 0

Query: 1   MEDAATKDAISKDHRSKYYTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60
           MEDAA K AISKDH S+ +TEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL
Sbjct: 1   MEDAAIKGAISKDHWSRDHTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60

Query: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTPXXXXXXXXXXXXXXXXXXXXXXX 120
           TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPT XXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTXXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 121 XXRPQVTNMSSDESEMESSEEDVEGKYNSSGVTKTAKSGNKRSRCETIVGPGIDKDVAHE 180
           X R QVTNM SDESEMESSEEDVEGKYNSSGV+KTAKSG KR RCETIVGPGIDKDVAHE
Sbjct: 121 XSRSQVTNMPSDESEMESSEEDVEGKYNSSGVSKTAKSGYKRPRCETIVGPGIDKDVAHE 180

Query: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVNNINNDGEEREKEANDLKPYAT 240
           AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKV+  NNDGE  EKEA+DLKPYAT
Sbjct: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVDKTNNDGEGAEKEADDLKPYAT 240

Query: 241 LEDLERGKLPPEEILSLPMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300
           LEDLERGKLPPEEILSLPMFKNYT GSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA
Sbjct: 241 LEDLERGKLPPEEILSLPMFKNYTAGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300

Query: 301 KSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS 360
           KSALTVKLMQEGRMRGQAFVTFPS+ELAQ ALNLVNGYVFKGKPMIIQFGRNPGA KGS
Sbjct: 301 KSALTVKLMQEGRMRGQAFVTFPSIELAQLALNLVNGYVFKGKPMIIQFGRNPGAVKGS 359

BLAST of Cla97C02G036070 vs. NCBI nr
Match: KGN46649.1 (hypothetical protein Csa_6G118290 [Cucumis sativus])

HSP 1 Score: 558.1 bits (1437), Expect = 2.1e-155
Identity = 309/332 (93.07%), Postives = 315/332 (94.88%), Query Frame = 0

Query: 1   MEDAATKDAISKDHRSKYYTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60
           ME AA KDAISKDH S+ +TEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL
Sbjct: 1   MEGAAIKDAISKDHWSRDHTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60

Query: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTPXXXXXXXXXXXXXXXXXXXXXXX 120
           TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTP  XXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTPPLXXXXXXXXXXXXXXXXXXXXX 120

Query: 121 XXRPQVTNMSSDESEMESSEEDVEGKYNSSGVTKTAKSGNKRSRCETIVGPGIDKDVAHE 180
             R QVTNM SDESEMESSEEDVEGKYN SGV+KTAKSG KR RCETIVGPGIDKDVAHE
Sbjct: 121 TTRSQVTNMPSDESEMESSEEDVEGKYNPSGVSKTAKSGYKRPRCETIVGPGIDKDVAHE 180

Query: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVNNINNDGEEREKEANDLKPYAT 240
           AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKV NINNDGEE EKEA+DLKPYAT
Sbjct: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVGNINNDGEEAEKEADDLKPYAT 240

Query: 241 LEDLERGKLPPEEILSLPMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300
           LEDLERGKLPPEEILSLPMFKNYT G+PTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA
Sbjct: 241 LEDLERGKLPPEEILSLPMFKNYTAGTPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300

Query: 301 KSALTVKLMQEGRMRGQAFVTFPSVELAQHAL 333
           KSALTVKLMQEGRMRGQAFVTFPS+ELAQ AL
Sbjct: 301 KSALTVKLMQEGRMRGQAFVTFPSIELAQRAL 332

BLAST of Cla97C02G036070 vs. NCBI nr
Match: XP_011656912.1 (PREDICTED: U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X2 [Cucumis sativus])

HSP 1 Score: 558.1 bits (1437), Expect = 2.1e-155
Identity = 309/332 (93.07%), Postives = 315/332 (94.88%), Query Frame = 0

Query: 1   MEDAATKDAISKDHRSKYYTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60
           ME AA KDAISKDH S+ +TEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL
Sbjct: 1   MEGAAIKDAISKDHWSRDHTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60

Query: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTPXXXXXXXXXXXXXXXXXXXXXXX 120
           TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTP  XXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTPPLXXXXXXXXXXXXXXXXXXXXX 120

Query: 121 XXRPQVTNMSSDESEMESSEEDVEGKYNSSGVTKTAKSGNKRSRCETIVGPGIDKDVAHE 180
             R QVTNM SDESEMESSEEDVEGKYN SGV+KTAKSG KR RCETIVGPGIDKDVAHE
Sbjct: 121 TTRSQVTNMPSDESEMESSEEDVEGKYNPSGVSKTAKSGYKRPRCETIVGPGIDKDVAHE 180

Query: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVNNINNDGEEREKEANDLKPYAT 240
           AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKV NINNDGEE EKEA+DLKPYAT
Sbjct: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVGNINNDGEEAEKEADDLKPYAT 240

Query: 241 LEDLERGKLPPEEILSLPMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300
           LEDLERGKLPPEEILSLPMFKNYT G+PTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA
Sbjct: 241 LEDLERGKLPPEEILSLPMFKNYTAGTPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300

Query: 301 KSALTVKLMQEGRMRGQAFVTFPSVELAQHAL 333
           KSALTVKLMQEGRMRGQAFVTFPS+ELAQ AL
Sbjct: 301 KSALTVKLMQEGRMRGQAFVTFPSIELAQRAL 332

BLAST of Cla97C02G036070 vs. NCBI nr
Match: XP_008448172.1 (PREDICTED: U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X1 [Cucumis melo] >XP_008448174.1 PREDICTED: U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X1 [Cucumis melo] >XP_008448175.1 PREDICTED: U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X1 [Cucumis melo] >XP_008448176.1 PREDICTED: U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X1 [Cucumis melo] >XP_008448177.1 PREDICTED: U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X1 [Cucumis melo])

HSP 1 Score: 550.4 bits (1417), Expect = 4.4e-153
Identity = 310/332 (93.37%), Postives = 316/332 (95.18%), Query Frame = 0

Query: 1   MEDAATKDAISKDHRSKYYTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60
           MEDAA K AISKDH S+ +TEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL
Sbjct: 1   MEDAAIKGAISKDHWSRDHTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60

Query: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTPXXXXXXXXXXXXXXXXXXXXXXX 120
           TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPT XXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTXXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 121 XXRPQVTNMSSDESEMESSEEDVEGKYNSSGVTKTAKSGNKRSRCETIVGPGIDKDVAHE 180
           X R QVTNM SDESEMESSEEDVEGKYNSSGV+KTAKSG KR RCETIVGPGIDKDVAHE
Sbjct: 121 XSRSQVTNMPSDESEMESSEEDVEGKYNSSGVSKTAKSGYKRPRCETIVGPGIDKDVAHE 180

Query: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVNNINNDGEEREKEANDLKPYAT 240
           AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKV+  NNDGE  EKEA+DLKPYAT
Sbjct: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVDKTNNDGEGAEKEADDLKPYAT 240

Query: 241 LEDLERGKLPPEEILSLPMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300
           LEDLERGKLPPEEILSLPMFKNYT GSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA
Sbjct: 241 LEDLERGKLPPEEILSLPMFKNYTAGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300

Query: 301 KSALTVKLMQEGRMRGQAFVTFPSVELAQHAL 333
           KSALTVKLMQEGRMRGQAFVTFPS+ELAQ AL
Sbjct: 301 KSALTVKLMQEGRMRGQAFVTFPSIELAQLAL 332

BLAST of Cla97C02G036070 vs. TrEMBL
Match: tr|A0A1S3BJY8|A0A1S3BJY8_CUCME (U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X2 OS=Cucumis melo OX=3656 GN=LOC103490451 PE=4 SV=1)

HSP 1 Score: 604.4 bits (1557), Expect = 1.7e-169
Identity = 336/359 (93.59%), Postives = 342/359 (95.26%), Query Frame = 0

Query: 1   MEDAATKDAISKDHRSKYYTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60
           MEDAA K AISKDH S+ +TEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL
Sbjct: 1   MEDAAIKGAISKDHWSRDHTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60

Query: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTPXXXXXXXXXXXXXXXXXXXXXXX 120
           TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPT XXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTXXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 121 XXRPQVTNMSSDESEMESSEEDVEGKYNSSGVTKTAKSGNKRSRCETIVGPGIDKDVAHE 180
           X R QVTNM SDESEMESSEEDVEGKYNSSGV+KTAKSG KR RCETIVGPGIDKDVAHE
Sbjct: 121 XSRSQVTNMPSDESEMESSEEDVEGKYNSSGVSKTAKSGYKRPRCETIVGPGIDKDVAHE 180

Query: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVNNINNDGEEREKEANDLKPYAT 240
           AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKV+  NNDGE  EKEA+DLKPYAT
Sbjct: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVDKTNNDGEGAEKEADDLKPYAT 240

Query: 241 LEDLERGKLPPEEILSLPMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300
           LEDLERGKLPPEEILSLPMFKNYT GSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA
Sbjct: 241 LEDLERGKLPPEEILSLPMFKNYTAGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300

Query: 301 KSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS 360
           KSALTVKLMQEGRMRGQAFVTFPS+ELAQ ALNLVNGYVFKGKPMIIQFGRNPGA KGS
Sbjct: 301 KSALTVKLMQEGRMRGQAFVTFPSIELAQLALNLVNGYVFKGKPMIIQFGRNPGAVKGS 359

BLAST of Cla97C02G036070 vs. TrEMBL
Match: tr|A0A0A0KCT1|A0A0A0KCT1_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G118290 PE=4 SV=1)

HSP 1 Score: 558.1 bits (1437), Expect = 1.4e-155
Identity = 309/332 (93.07%), Postives = 315/332 (94.88%), Query Frame = 0

Query: 1   MEDAATKDAISKDHRSKYYTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60
           ME AA KDAISKDH S+ +TEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL
Sbjct: 1   MEGAAIKDAISKDHWSRDHTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60

Query: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTPXXXXXXXXXXXXXXXXXXXXXXX 120
           TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTP  XXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTPPLXXXXXXXXXXXXXXXXXXXXX 120

Query: 121 XXRPQVTNMSSDESEMESSEEDVEGKYNSSGVTKTAKSGNKRSRCETIVGPGIDKDVAHE 180
             R QVTNM SDESEMESSEEDVEGKYN SGV+KTAKSG KR RCETIVGPGIDKDVAHE
Sbjct: 121 TTRSQVTNMPSDESEMESSEEDVEGKYNPSGVSKTAKSGYKRPRCETIVGPGIDKDVAHE 180

Query: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVNNINNDGEEREKEANDLKPYAT 240
           AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKV NINNDGEE EKEA+DLKPYAT
Sbjct: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVGNINNDGEEAEKEADDLKPYAT 240

Query: 241 LEDLERGKLPPEEILSLPMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300
           LEDLERGKLPPEEILSLPMFKNYT G+PTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA
Sbjct: 241 LEDLERGKLPPEEILSLPMFKNYTAGTPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300

Query: 301 KSALTVKLMQEGRMRGQAFVTFPSVELAQHAL 333
           KSALTVKLMQEGRMRGQAFVTFPS+ELAQ AL
Sbjct: 301 KSALTVKLMQEGRMRGQAFVTFPSIELAQRAL 332

BLAST of Cla97C02G036070 vs. TrEMBL
Match: tr|A0A1S3BIH7|A0A1S3BIH7_CUCME (U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490451 PE=4 SV=1)

HSP 1 Score: 550.4 bits (1417), Expect = 2.9e-153
Identity = 310/332 (93.37%), Postives = 316/332 (95.18%), Query Frame = 0

Query: 1   MEDAATKDAISKDHRSKYYTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60
           MEDAA K AISKDH S+ +TEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL
Sbjct: 1   MEDAAIKGAISKDHWSRDHTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60

Query: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTPXXXXXXXXXXXXXXXXXXXXXXX 120
           TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPT XXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTXXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 121 XXRPQVTNMSSDESEMESSEEDVEGKYNSSGVTKTAKSGNKRSRCETIVGPGIDKDVAHE 180
           X R QVTNM SDESEMESSEEDVEGKYNSSGV+KTAKSG KR RCETIVGPGIDKDVAHE
Sbjct: 121 XSRSQVTNMPSDESEMESSEEDVEGKYNSSGVSKTAKSGYKRPRCETIVGPGIDKDVAHE 180

Query: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVNNINNDGEEREKEANDLKPYAT 240
           AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKV+  NNDGE  EKEA+DLKPYAT
Sbjct: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVDKTNNDGEGAEKEADDLKPYAT 240

Query: 241 LEDLERGKLPPEEILSLPMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300
           LEDLERGKLPPEEILSLPMFKNYT GSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA
Sbjct: 241 LEDLERGKLPPEEILSLPMFKNYTAGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEA 300

Query: 301 KSALTVKLMQEGRMRGQAFVTFPSVELAQHAL 333
           KSALTVKLMQEGRMRGQAFVTFPS+ELAQ AL
Sbjct: 301 KSALTVKLMQEGRMRGQAFVTFPSIELAQLAL 332

BLAST of Cla97C02G036070 vs. TrEMBL
Match: tr|A0A1S3BJ40|A0A1S3BJ40_CUCME (U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X4 OS=Cucumis melo OX=3656 GN=LOC103490451 PE=4 SV=1)

HSP 1 Score: 478.4 bits (1230), Expect = 1.4e-131
Identity = 272/292 (93.15%), Postives = 277/292 (94.86%), Query Frame = 0

Query: 1   MEDAATKDAISKDHRSKYYTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60
           MEDAA K AISKDH S+ +TEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL
Sbjct: 1   MEDAAIKGAISKDHWSRDHTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60

Query: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTPXXXXXXXXXXXXXXXXXXXXXXX 120
           TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPT XXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTXXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 121 XXRPQVTNMSSDESEMESSEEDVEGKYNSSGVTKTAKSGNKRSRCETIVGPGIDKDVAHE 180
           X R QVTNM SDESEMESSEEDVEGKYNSSGV+KTAKSG KR RCETIVGPGIDKDVAHE
Sbjct: 121 XSRSQVTNMPSDESEMESSEEDVEGKYNSSGVSKTAKSGYKRPRCETIVGPGIDKDVAHE 180

Query: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVNNINNDGEEREKEANDLKPYAT 240
           AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKV+  NNDGE  EKEA+DLKPYAT
Sbjct: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVDKTNNDGEGAEKEADDLKPYAT 240

Query: 241 LEDLERGKLPPEEILSLPMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFGS 293
           LEDLERGKLPPEEILSLPMFKNYT GSPTSVLYIKNLAKDVLNDDLYYIFGS
Sbjct: 241 LEDLERGKLPPEEILSLPMFKNYTAGSPTSVLYIKNLAKDVLNDDLYYIFGS 292

BLAST of Cla97C02G036070 vs. TrEMBL
Match: tr|A0A1S4DX29|A0A1S4DX29_CUCME (U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X8 OS=Cucumis melo OX=3656 GN=LOC103490451 PE=4 SV=1)

HSP 1 Score: 476.9 bits (1226), Expect = 4.1e-131
Identity = 271/291 (93.13%), Postives = 276/291 (94.85%), Query Frame = 0

Query: 1   MEDAATKDAISKDHRSKYYTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60
           MEDAA K AISKDH S+ +TEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL
Sbjct: 1   MEDAAIKGAISKDHWSRDHTEGMRARPLHFGEPIAARLGVDYPFPPHLEYAYPPPDGNIL 60

Query: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTPXXXXXXXXXXXXXXXXXXXXXXX 120
           TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPT XXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  TNIVNALIAVPRFYTQVLHLMNKMNIPAPFRLPLPTXXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 121 XXRPQVTNMSSDESEMESSEEDVEGKYNSSGVTKTAKSGNKRSRCETIVGPGIDKDVAHE 180
           X R QVTNM SDESEMESSEEDVEGKYNSSGV+KTAKSG KR RCETIVGPGIDKDVAHE
Sbjct: 121 XSRSQVTNMPSDESEMESSEEDVEGKYNSSGVSKTAKSGYKRPRCETIVGPGIDKDVAHE 180

Query: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVNNINNDGEEREKEANDLKPYAT 240
           AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKV+  NNDGE  EKEA+DLKPYAT
Sbjct: 181 AVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVDKTNNDGEGAEKEADDLKPYAT 240

Query: 241 LEDLERGKLPPEEILSLPMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFG 292
           LEDLERGKLPPEEILSLPMFKNYT GSPTSVLYIKNLAKDVLNDDLYYIFG
Sbjct: 241 LEDLERGKLPPEEILSLPMFKNYTAGSPTSVLYIKNLAKDVLNDDLYYIFG 291

BLAST of Cla97C02G036070 vs. Swiss-Prot
Match: sp|Q8RWV8|U1165_ARATH (U11/U12 small nuclear ribonucleoprotein 65 kDa protein OS=Arabidopsis thaliana OX=3702 GN=SNRNP65 PE=1 SV=1)

HSP 1 Score: 374.4 bits (960), Expect = 1.4e-102
Identity = 206/333 (61.86%), Postives = 241/333 (72.37%), Query Frame = 0

Query: 31  GEPIAARLGVDYPFPPHLEYAYPPPDGNILTNIVNALIAVPRFYTQVLHLMNKMNIPAPF 90
           GEPIA +LG+DYPFPPHL+YAYPPPD NIL NI NALIAVP  YTQVLHLMNKMN+P PF
Sbjct: 139 GEPIAPKLGIDYPFPPHLQYAYPPPDANILANITNALIAVPPLYTQVLHLMNKMNLPPPF 198

Query: 91  RLPLPTPXXXXXXXXXXXXXXXXXXXXXXXXXRPQVTNM---SSDESEMESSEEDVEGKY 150
           RL LPTP                          PQ T++   SS ESEMES E+      
Sbjct: 199 RLALPTPPLPKAG--------------------PQQTDLEHQSSSESEMESDED------ 258

Query: 151 NSSGVTKTAKSGNKRSRCETIVGPGIDKDVAHEAVGVKPSTLVPKEIPMIKKKNPVIQIR 210
                  T+KSG KR+R   +VG G+DKDV HE VGVKPS+L PKEIP I+K   V+QI+
Sbjct: 259 -----IGTSKSGRKRARHGFLVGLGMDKDVPHETVGVKPSSLTPKEIPRIRKNKHVMQIK 318

Query: 211 IAPKVIHDG-KVNNINND--GEEREKEANDLKPYATLEDLERGKLPPEEILSLPMFKNYT 270
           I  KV  D  K  + N D   E +EK++N LKP+A+LE+LE+G+LPP++ILSLPMFKNYT
Sbjct: 319 ITSKVTQDEYKEESENEDPADEPKEKDSN-LKPFASLEELEKGRLPPQDILSLPMFKNYT 378

Query: 271 TGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPS 330
            G+P+ VLYIKNLAKDV+ DD YYIFGS FE  + AKS+L V+LMQEGRMRGQAF+TFPS
Sbjct: 379 AGNPSVVLYIKNLAKDVVIDDFYYIFGSQFESSEVAKSSLGVRLMQEGRMRGQAFLTFPS 438

Query: 331 VELAQHALNLVNGYVFKGKPMIIQFGRNPGAAK 358
           VE+A  ALNLVNG+VFKGKPMIIQFGR PGAAK
Sbjct: 439 VEVAHRALNLVNGFVFKGKPMIIQFGRTPGAAK 439

BLAST of Cla97C02G036070 vs. Swiss-Prot
Match: sp|F1Q8J0|RBM40_DANRE (RNA-binding protein 40 OS=Danio rerio OX=7955 GN=rnpc3 PE=2 SV=1)

HSP 1 Score: 134.0 bits (336), Expect = 3.2e-30
Identity = 117/360 (32.50%), Postives = 171/360 (47.50%), Query Frame = 0

Query: 34  IAARLGVDYPFPPHLEYAYPPPDGNILTNIVNALIAVPRFYTQVLHLMNKMNIPAPFRL- 93
           IA  LG+ +   P L+Y YPPP   ILTNI + L++VP+FY QVLHLMNKMN+P+PFR  
Sbjct: 130 IAPSLGLKFQTNPTLKYLYPPPSSGILTNITHTLLSVPKFYVQVLHLMNKMNLPSPFRPV 189

Query: 94  ------------PLP--------------------TPXXXXXXXXXXXXXXXXXXXXXXX 153
                       PLP                      XXXXXXXXXXXXX          
Sbjct: 190 TAPPPMFEMPSGPLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIRLMGLVNQP 249

Query: 154 XXRPQVTNMSSDESEMESSEEDVEGKYNSSGVT-KTAKSGNKRSRCETIVGPGIDKDVAH 213
             RP  T  SS   + +  +     K +S G +    +  +   +   +    I+  ++ 
Sbjct: 250 CKRPLRTKTSSKRKKPKLKDLLFIPKPDSHGPSGPVLQPADVFEQPHALGQKKIEFHISS 309

Query: 214 EAVGVKPSTLVPKEIPMIKKKNPVIQIRIAPKVIHDGKVNNINNDGEEREKEANDL---- 273
           E   +       +++P          + ++ +   +G    I    +   +E        
Sbjct: 310 EVSAILEGPGQNQKLPFADATEDTPDMEVSAQETTEG-FGKIYPSAQVPRQEXXXXXXXX 369

Query: 274 --KPYATLEDLERGKLPPEEILSLPMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFGSL 333
               + +  +LERG+L  +EI  L +FK Y  G PT  LY+KN+AK V   DL +I+G  
Sbjct: 370 IPSEFISRRELERGRLSRDEIKKLSVFKKYEPGEPTCRLYVKNVAKHVEEKDLKFIYGRY 429

Query: 334 FE-GIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRN 353
            +   +E ++   + LM+EGRM+GQAF+  PS   AQ AL   NGYV K KP+++QF R+
Sbjct: 430 IDISSEEERNMFDIVLMKEGRMKGQAFIGLPSERSAQKALKETNGYVLKDKPLVVQFARS 488

BLAST of Cla97C02G036070 vs. Swiss-Prot
Match: sp|Q5R6C7|RBM40_PONAB (RNA-binding protein 40 OS=Pongo abelii OX=9601 GN=RNPC3 PE=2 SV=1)

HSP 1 Score: 125.6 bits (314), Expect = 1.1e-27
Identity = 131/361 (36.29%), Postives = 178/361 (49.31%), Query Frame = 0

Query: 34  IAARLGVDYPFPPHLEYAYPPPDGNILTNIVNALIAVPRFYTQVLHLMNKMNIPAPF--- 93
           IA   G+ +P    L+Y YPPP   IL NIVNAL +VP+FY QVLHLMNKMN+P PF   
Sbjct: 143 IAPNHGLTFPLNSCLKYMYPPPSSTILANIVNALASVPKFYVQVLHLMNKMNLPTPFGPI 202

Query: 94  ---------RLPLPTPXXXXXXXXXXXXXXXXXXXXXXXXXRPQVTNMSSDESEMESSEE 153
                     +PL  P XXXXXXXXXXXXXXXXXXXXXXXX                   
Sbjct: 203 TARPPMYEDYMPLHAPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 262

Query: 154 DVEGKYNS---------------SGVTKTAKSGNKRSRCETIVGPGI---DKDVAHEAVG 213
                                    +  T    +  S    ++   +    + V ++ + 
Sbjct: 263 XXXXXXXXXXXXXXXXXXXXXXIKDMLNTPLCPSHSSLHPVLLPSDVFDQPQPVGNKRIE 322

Query: 214 VKPSTLVP--------KEIPMIKKKNPVIQIRIAPKVIHDGKV-NNINNDGEEREKEAND 273
              ST +P        KE    +K + +    +    I  GK+    N D  E  KE +D
Sbjct: 323 FHISTDMPAAFKKDLEKEQNCEEKNHDLPATEVDASNIGFGKIFPKPNLDITEEIKEDSD 382

Query: 274 LKPYATL--EDLERGKLPPEEILSLPMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFGS 333
             P   +   +LE+G++  EE+ +L +F++Y  G P   +Y+KNLAK V   DL YIFG 
Sbjct: 383 EMPSECISRRELEKGRISREEMETLSVFRSYEPGEPNCRIYVKNLAKHVEEKDLKYIFGR 442

Query: 334 LFEGIDEAKSAL-TVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGR 353
             +   E +  +  ++LM+EGRM+GQAFV  P+ + A  AL   NGYV  GKPM++QF R
Sbjct: 443 YVDFSSETRRIMFDIRLMKEGRMKGQAFVGLPNEKAAAKALKEANGYVLFGKPMVVQFAR 502

BLAST of Cla97C02G036070 vs. Swiss-Prot
Match: sp|Q96LT9|RBM40_HUMAN (RNA-binding protein 40 OS=Homo sapiens OX=9606 GN=RNPC3 PE=1 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 1.5e-27
Identity = 131/361 (36.29%), Postives = 179/361 (49.58%), Query Frame = 0

Query: 34  IAARLGVDYPFPPHLEYAYPPPDGNILTNIVNALIAVPRFYTQVLHLMNKMNIPAPF--- 93
           IA   G+ +P    L+Y YPPP   IL NIVNAL +VP+FY QVLHLMNKMN+P PF   
Sbjct: 143 IAPNHGLTFPLNSCLKYMYPPPSSTILANIVNALASVPKFYVQVLHLMNKMNLPTPFGPI 202

Query: 94  ---------RLPLPTP---------------XXXXXXXXXXXXXXXXXXXXXXXXXRPQV 153
                     +PL  P               XXXXXXXXXXXXXXXXXXXXXXXXX    
Sbjct: 203 TARPPMYEDYMPLHAPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 262

Query: 154 TNMSSDESEMESSEEDVEGKYNSSGVTKTAKSGNKRSRCETIVGPGI---DKDVAHEAVG 213
                                    +  T    +  S    ++   +    + V ++ + 
Sbjct: 263 XXXXXXXXXXXXXXXXXXXXXKIKDMLNTPLCPSHSSLHPVLLPSDVFDQPQPVGNKRIE 322

Query: 214 VKPSTLVP--------KEIPMIKKKNPVIQIRIAPKVIHDGKV-NNINNDGEEREKEAND 273
              ST +P        KE    +K + +    +    I  GK+    N D  E  KE +D
Sbjct: 323 FHISTDMPAAFKKDLEKEQNCEEKNHDLPATEVDASNIGFGKIFPKPNLDITEEIKEDSD 382

Query: 274 LKPYATL--EDLERGKLPPEEILSLPMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFGS 333
             P   +   +LE+G++  EE+ +L +F++Y  G P   +Y+KNLAK V   DL YIFG 
Sbjct: 383 EMPSECISRRELEKGRISREEMETLSVFRSYEPGEPNCRIYVKNLAKHVQEKDLKYIFGR 442

Query: 334 LFEGIDEAKSAL-TVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGR 353
             +   E +  +  ++LM+EGRM+GQAF+  P+ + A  AL   NGYV  GKPM++QF R
Sbjct: 443 YVDFSSETQRIMFDIRLMKEGRMKGQAFIGLPNEKAAAKALKEANGYVLFGKPMVVQFAR 502

BLAST of Cla97C02G036070 vs. Swiss-Prot
Match: sp|Q4G055|RBM40_RAT (RNA-binding protein 40 OS=Rattus norvegicus OX=10116 GN=Rnpc3 PE=1 SV=2)

HSP 1 Score: 125.2 bits (313), Expect = 1.5e-27
Identity = 125/362 (34.53%), Postives = 170/362 (46.96%), Query Frame = 0

Query: 34  IAARLGVDYPFPPHLEYAYPPPDGNILTNIVNALIAVPRFYTQVLHLMNKMNIPAPF--- 93
           IA   G+ +P    L+Y YPPP   IL NIVNAL +VP+FY QVLHLMNKMN+P PF   
Sbjct: 143 IAPNHGLTFPLNSCLKYMYPPPSSTILANIVNALASVPKFYVQVLHLMNKMNLPTPFGPI 202

Query: 94  ---------RLPLPTPXXXXXXXXXXXXXXXXXXXXXXXXXRPQVTNMSSDESEMESSEE 153
                     + L  P  XXXXXXXXXXXXXXXXXXXXXXX                   
Sbjct: 203 TARPPMYEDYVQLHAPLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 262

Query: 154 DVEGKYNSSGVTKTAKSGNKRSRCETIVGPGIDKDVAHEAVGVKPSTLVPKEIPMIKKKN 213
                              KR   + +  P       H    + PS +  +  P+  KK 
Sbjct: 263 XXXXXXXXXXXXXXXXXXKKRKIKDMLTVPSPASQSLHPV--LLPSDVFDQPQPVGNKKI 322

Query: 214 PVIQIRIAPKVIHDGKVNNINNDGE----------------------------EREKEAN 273
                   P  ++       NN+ +                            E  KE +
Sbjct: 323 EFNISTNVPAALNKDLETEQNNEEKNSDSPDTGLDDSNTGFGKLFPKPNVNITEEIKEDS 382

Query: 274 DLKP--YATLEDLERGKLPPEEILSLPMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFG 333
           D  P  + +  +LE+G++  EE+ +L +F++Y  G P   +Y+KNLA+ V   DL +IFG
Sbjct: 383 DEMPSQFISRRELEKGRISREEMETLSVFRSYEPGEPNCRIYVKNLARHVQEKDLKFIFG 442

Query: 334 SLFEGIDEAKSAL-TVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFG 353
              +   E +  +  ++LM+EGRM+GQAFV  P+ + A  AL   NGYV  GKPM++QF 
Sbjct: 443 RYVDFSSETQRIMFDIRLMKEGRMKGQAFVGLPNEKAAAKALKEANGYVLFGKPMVVQFA 502

BLAST of Cla97C02G036070 vs. TAIR10
Match: AT1G09230.1 (RNA-binding (RRM/RBD/RNP motifs) family protein)

HSP 1 Score: 374.4 bits (960), Expect = 7.9e-104
Identity = 206/333 (61.86%), Postives = 241/333 (72.37%), Query Frame = 0

Query: 31  GEPIAARLGVDYPFPPHLEYAYPPPDGNILTNIVNALIAVPRFYTQVLHLMNKMNIPAPF 90
           GEPIA +LG+DYPFPPHL+YAYPPPD NIL NI NALIAVP  YTQVLHLMNKMN+P PF
Sbjct: 139 GEPIAPKLGIDYPFPPHLQYAYPPPDANILANITNALIAVPPLYTQVLHLMNKMNLPPPF 198

Query: 91  RLPLPTPXXXXXXXXXXXXXXXXXXXXXXXXXRPQVTNM---SSDESEMESSEEDVEGKY 150
           RL LPTP                          PQ T++   SS ESEMES E+      
Sbjct: 199 RLALPTPPLPKAG--------------------PQQTDLEHQSSSESEMESDED------ 258

Query: 151 NSSGVTKTAKSGNKRSRCETIVGPGIDKDVAHEAVGVKPSTLVPKEIPMIKKKNPVIQIR 210
                  T+KSG KR+R   +VG G+DKDV HE VGVKPS+L PKEIP I+K   V+QI+
Sbjct: 259 -----IGTSKSGRKRARHGFLVGLGMDKDVPHETVGVKPSSLTPKEIPRIRKNKHVMQIK 318

Query: 211 IAPKVIHDG-KVNNINND--GEEREKEANDLKPYATLEDLERGKLPPEEILSLPMFKNYT 270
           I  KV  D  K  + N D   E +EK++N LKP+A+LE+LE+G+LPP++ILSLPMFKNYT
Sbjct: 319 ITSKVTQDEYKEESENEDPADEPKEKDSN-LKPFASLEELEKGRLPPQDILSLPMFKNYT 378

Query: 271 TGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPS 330
            G+P+ VLYIKNLAKDV+ DD YYIFGS FE  + AKS+L V+LMQEGRMRGQAF+TFPS
Sbjct: 379 AGNPSVVLYIKNLAKDVVIDDFYYIFGSQFESSEVAKSSLGVRLMQEGRMRGQAFLTFPS 438

Query: 331 VELAQHALNLVNGYVFKGKPMIIQFGRNPGAAK 358
           VE+A  ALNLVNG+VFKGKPMIIQFGR PGAAK
Sbjct: 439 VEVAHRALNLVNGFVFKGKPMIIQFGRTPGAAK 439

BLAST of Cla97C02G036070 vs. TAIR10
Match: AT2G30260.1 (U2 small nuclear ribonucleoprotein B)

HSP 1 Score: 47.0 bits (110), Expect = 2.9e-05
Identity = 26/84 (30.95%), Postives = 44/84 (52.38%), Query Frame = 0

Query: 268 PTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVEL 327
           P   +YI+NL + +  ++L      LF    +    L V  ++  ++RGQA+VTF  V  
Sbjct: 8   PNQSIYIQNLNERIKKEELKRSLYCLF---SQFGRILDVVALKTPKLRGQAWVTFSEVTA 67

Query: 328 AQHALNLVNGYVFKGKPMIIQFGR 352
           A HA+  +  + F  KPM +Q+ +
Sbjct: 68  AGHAVRQMQNFPFYDKPMRLQYAK 88

BLAST of Cla97C02G036070 vs. TAIR10
Match: AT1G67770.1 (terminal EAR1-like 2)

HSP 1 Score: 45.4 bits (106), Expect = 8.4e-05
Identity = 36/101 (35.64%), Postives = 44/101 (43.56%), Query Frame = 0

Query: 258 PMFKNYTTGSPTSVLYIKNLAKDVLNDDLYYIFGSLFEGIDEAKSALTVKLMQEGR-MRG 317
           P       G+    L I NL   V +  L +IF    E          VK ++E    R 
Sbjct: 171 PQLNAVPEGNNQGSLVIMNLEPTVSSSTLRHIFQVYGE----------VKQVRETPCKRE 230

Query: 318 QAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAK 358
           Q FV F  V  A  AL ++NG V  GKPM+IQF R  G  K
Sbjct: 231 QRFVEFFDVRDAAKALRVMNGKVISGKPMVIQFSRPGGLTK 261

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004139973.13.6e-17193.04PREDICTED: U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X1 [Cu... [more]
XP_008448178.12.6e-16993.59PREDICTED: U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X2 [Cu... [more]
KGN46649.12.1e-15593.07hypothetical protein Csa_6G118290 [Cucumis sativus][more]
XP_011656912.12.1e-15593.07PREDICTED: U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X2 [Cu... [more]
XP_008448172.14.4e-15393.37PREDICTED: U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X1 [Cu... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3BJY8|A0A1S3BJY8_CUCME1.7e-16993.59U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X2 OS=Cucumis mel... [more]
tr|A0A0A0KCT1|A0A0A0KCT1_CUCSA1.4e-15593.07Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G118290 PE=4 SV=1[more]
tr|A0A1S3BIH7|A0A1S3BIH7_CUCME2.9e-15393.37U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X1 OS=Cucumis mel... [more]
tr|A0A1S3BJ40|A0A1S3BJ40_CUCME1.4e-13193.15U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X4 OS=Cucumis mel... [more]
tr|A0A1S4DX29|A0A1S4DX29_CUCME4.1e-13193.13U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X8 OS=Cucumis mel... [more]
Match NameE-valueIdentityDescription
sp|Q8RWV8|U1165_ARATH1.4e-10261.86U11/U12 small nuclear ribonucleoprotein 65 kDa protein OS=Arabidopsis thaliana O... [more]
sp|F1Q8J0|RBM40_DANRE3.2e-3032.50RNA-binding protein 40 OS=Danio rerio OX=7955 GN=rnpc3 PE=2 SV=1[more]
sp|Q5R6C7|RBM40_PONAB1.1e-2736.29RNA-binding protein 40 OS=Pongo abelii OX=9601 GN=RNPC3 PE=2 SV=1[more]
sp|Q96LT9|RBM40_HUMAN1.5e-2736.29RNA-binding protein 40 OS=Homo sapiens OX=9606 GN=RNPC3 PE=1 SV=1[more]
sp|Q4G055|RBM40_RAT1.5e-2734.53RNA-binding protein 40 OS=Rattus norvegicus OX=10116 GN=Rnpc3 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
AT1G09230.17.9e-10461.86RNA-binding (RRM/RBD/RNP motifs) family protein[more]
AT2G30260.12.9e-0530.95U2 small nuclear ribonucleoprotein B[more]
AT1G67770.18.4e-0535.64terminal EAR1-like 2[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
Vocabulary: INTERPRO
TermDefinition
IPR035979RBD_domain_sf
IPR012677Nucleotide-bd_a/b_plait_sf
IPR000504RRM_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0010229 inflorescence development
biological_process GO:0000398 mRNA splicing, via spliceosome
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0000166 nucleotide binding
molecular_function GO:0097157 pre-mRNA intronic binding
molecular_function GO:0030626 U12 snRNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G036070.1Cla97C02G036070.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 271..348
e-value: 2.3E-12
score: 57.1
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 272..344
e-value: 7.8E-12
score: 44.8
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 270..352
score: 14.078
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3DG3DSA:3.30.70.330coord: 264..358
e-value: 2.9E-20
score: 74.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 105..159
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 105..122
NoneNo IPR availablePANTHERPTHR16105UNCHARACTERIZEDcoord: 23..355
NoneNo IPR availablePANTHERPTHR16105:SF0RNA-BINDING PROTEIN 40coord: 23..355
NoneNo IPR availableCDDcd12239RRM2_RBM40_likecoord: 269..350
e-value: 5.40551E-41
score: 139.621
IPR035979RNA-binding domain superfamilySUPERFAMILYSSF54928RNA-binding domain, RBDcoord: 250..354

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C02G036070Cla002125Watermelon (97103) v1wmwmbB321
Cla97C02G036070ClCG02G009830Watermelon (Charleston Gray)wcgwmbB138
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C02G036070Silver-seed gourdcarwmbB0345
Cla97C02G036070Silver-seed gourdcarwmbB0845
Cla97C02G036070Cucumber (Gy14) v2cgybwmbB111
Cla97C02G036070Cucumber (Gy14) v1cgywmbB098
Cla97C02G036070Cucurbita maxima (Rimu)cmawmbB575
Cla97C02G036070Cucurbita maxima (Rimu)cmawmbB631
Cla97C02G036070Cucurbita moschata (Rifu)cmowmbB560
Cla97C02G036070Cucurbita moschata (Rifu)cmowmbB604
Cla97C02G036070Wild cucumber (PI 183967)cpiwmbB118
Cla97C02G036070Cucumber (Chinese Long) v3cucwmbB122
Cla97C02G036070Cucumber (Chinese Long) v2cuwmbB116
Cla97C02G036070Bottle gourd (USVL1VR-Ls)lsiwmbB046
Cla97C02G036070Wax gourdwgowmbB355