Cp4.1LG03g05890 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g05890
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMajor latex-like protein
LocationCp4.1LG03 : 4442904 .. 4474190 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTTGAATCAAATCAAAACAAAATGAGCCAAATTGAAAGCATTTGGGGAAAGGTTCAGCTAAAATCGTCTCCTGAGAAGTTCTTTGGCTTCTTCAGGAACCATATGGGCGATTTGGTCCATATGTTCCCTGACCACTTCCAGAGCTTCCACTTTGTCGAAGGACAAAACTTCGACGATGGCAGCGTCGTGCACTGGAAATACCACCTCGGTGAGATATTATTATCTCGTTTCTTCTGTCTTTTAAAATGTTTACTTTAATCTTTACGATCTCAAGCTCTAGTTTTCGAGTTTTACCATGTTTATTATTTGTCTAGATCGTAATTTATAGGGTAGAGATTAAAATTTCTAAACGGTTGGTTAAAGTTATAATATCTTAGCTAATCGAACAATATTTAATTTTAATTATACTTAACAAAATAAACTCCGGCTTAATTCTAAAATNAAAAAAAAAAAAAAAAAAAAAAAAAATAAATACTACTACAGTCTTTTAGTGTGAGATCCCACATCAGGTAAGGAGAAAACTAAACATTGATTATAAGGGTGTGGAAACCTCTCCCTAACAGACGCGTTTTGAAACCATAAGGTTGACGATGATACATAACTAGCTAAAACAGACAATATTTGCTAGCTAAGCAGTCTTTTAGCGTGAGATCACACGTCAGTTGAGAGAGAAAACAAAACATTGGGTGTGGAAACCTCTCCTTAACAGACACGTTTTAAAATTATAAGGTTGACGATGATACATAACTAGCTAAAACAGACAATATTTGCTAGCTAAGCAGTGTTTTAGTGTGAGATCCCACGTCAGTTGAGAGAGAAAACAAAACATTGGGTGTGGAAACCTCTCCTTAACAGACACGTTTTAAAATTATAAGGTTGACGATGATACATAACTAGCTAAAACAGACAATATTTGCTAGCTAAGCAGTCTTTTAGCGTGAGATCACACGTCAGTTGAGAGAGAAAACAAAACATTGGGTGTGGAAACCTCTCCTTAACAGACACGTTTTAAAATTATAAGGTTGACGATGATACATAACTAGCTAAAACAGACAATATTTGCTAGCTAAGCAGTGTTTTAGTGTGAGATCCCACGTCAGTTGAGAGAGAAATGAAACATTGGGTGTGGAAACCTCTCCCTAACAGACGCGTTTTGAAACAATAAGGTTGACGATGATACATAACTAGCTAAAACAGACAATATTTGCTAGCTAAGTAGTGTTTTAGTGTGAGATCACACGTCAGTTGAGAGAGAAAACAAAACATTCGGTGTGGAAACCTCTCCTTAACAGACACGTTTTAAAATTATAAGGTTGACGATGATACATAACTAGCTAAAACAGACAATATTTGCTAGCTAAGCAGTCTTTTAGTGTGAGATCCCACGTCAGTTGAGAGAGAAAACGAAACATTGGGTGTGGAAACCTCTGCCTAACAGACGCGTTTAGAAACCATAAGGTTGACGATGATACATAACTAGCTAAAACAGACAATATTTGCTAGCTAAGCAGTCTTTTAGTGTGAGATCCCACGTCAGTTGAGAGAGAAAACGAAACATTGGGTGTGGAAACCTCTCCTTAACAGACACGTTTTAAAATTATAAGGTTGACGATGATACATAACTAGCTAAAACAGACAATATTTGCTAGCTAAGCAGTCTTTTAGTGTGAGATCCCACGTCAGTTGAGAGAGAAAACAAAACATTGGGTGTGAAAACCTCTCCTTAACAGACACGTTTTAAAATTATAAGGTTGGCGACGATACATAACGAGTTAAAACAGACAATATTTACTAGTAATGAGCTTGAATTATTACGTTTAGTAGAAAATTAAGAACAAAATCGATATGAAACGAGATCAAATTTTACACGGGAAGGTATGGTTGTTGACGCATGAAGATACTAAAGAAAGGGTGTTGAATGATGAACAAATGAACAGGAATTCCAGAAGCAGTAAAGATAAAGATGAAGAATAGGGATGAAGCAAGGACCATAATTTATGAAGTTGTTGAAGGAGATGCATTAAAGCATTACAAAGTTTTCAGAGCCAAACTTGAAACTGTTAGTGGAGGGTTAAACAAAGTGGGAGGAAGCTTTGCAAAATGGACAATTGAGTATGAAAAGGCTCATGAAAACGTTCCTTCACCAGAAACCTACATGGAATTGGCTCTCAAAGTCAGCAAAGGTCTGGATGCTTATCTTTATCGGAACTAAACCTCTATAACAACCAAGTGCATGCCCTAAGTTAATGCCTTTTCTACTTGGTTTGCTCTTAAATAACAACTTCTATGTGTAATGTAATGTTATGTTGTATAATTATGTGGGATTTGGTGAAAATGTGTGTTTATATTTGATCGAAAGGGTATGTATGTGTGTGTCAACCCTCTTAAAAACTCAATGGGAAATTTGATTTCAAGTGTGTTTGTAAGAGTCTAAGCCTATCGCTAATGGATATTGTAACGACCCTAAATTTCTACATACTTAGAGTTGTTACTAACATTTCATGCTCATCTCTAATGCAAAAATATAACTCTCCAAATGAACATCATTTATTACATAAACTTTGAAAAAGTTTAAAACAACTTTCTTTTTTCTATAAAATACAACCATGGTGTTACGTGTTTAAATACGAAATACTCGAGTTTAAACAAAACAAAAACAAATATAAACTAATTTAAAACGATGAAATGCAAAACAACCTATTCTAACCTAGAATGAGTATTTCGGAGAATACTCATAAGTGGCCCCATTATTGGGAGCCATCTAAACCCTAAACACATGCAATCATGATTTAGGACCTATCTCCGACATCATTAGGGTGGCCTCCAGTATCAGGCAAATTCGATGCATAATACTTCCCTACACACGACTCACACATGCGAGTGTGAATCCACAGGGCGCTCGCACACCCCCTGGACCATTTGGTCAAGCAAACAGATATTATTTGTGTTGGCTTATTATGTATCACCGTCAACCTCATGGTTTTAAAACAAGTTTGCTGGGGAGATACTTTCATACTCTTATATCTTTATTTCTCTCTCTAACCGACGTGGATCTCATGGTGTTCAAAATGAAATATCAATGCTTTCAACTTTTTAATGATATTGTTAATATTTAACTTATTAGAATTCAAAATCAGTATTTAAAAAAAAAAAAAAAAAAAATAAATAAATAAAAAAAATACAAATGGCTCATAGTAACTTTCATCTCTTTTCTTAATCATCATATTTTTAGGTGAGGACAATAGGGTAAGTGCTAGCCTACTTGTCAGCCAAGGGAGGTTAGTTGACAAACCCATCCTACAAGCCATGTAGTCCCCAAGGTTGGGGGCGCTACCCCCGATACATGCTAGATCCAAAGGTTACCTACATGGTCAACACTCTCTTGTAGGACGGATCTTCCGTGCGGGGTAACCACAGATTACTCACATGATCAACACCCTACCACAAGAAGAATCCATCGTGCAGGGTAATCCATCAAACCACTAAGTCACACATCCCTAATTAGCCTCATGCCTAGCTCAACGATACCGTACTGGAGTGGAAAAGGAAGTGGAGGTCCTTAAACACTATACTAGCTGCTATCGGCCTAGGAGAGCTACCTTTGTCAATCCTAGATGCCGTGGAGGTTTGCTAAGTTCTTAGGGGCGTCTAGTGGTTTGAATTCCATCTCTCATTTTAGAACGTTCAAAGATTAACTCCACAAGTTAACCCAATCTTTTGGTGCACATAGATCTAAACTTCTGATTGCTTATGATCCTGGGGTGCATGGTCTATTGGAAACGTACTAACTTGAGTCAACATGTAAATCACATTAGTTGTAGCGTCTTAAATTTAACTGAGTTCTCTCATGCATGTTTTGATATGGCGAAATTCTTATCGATCCCCTAAGCACGTAATTGATAATTTAAACTTCACCTTAATCATATCTGGGTGTTGAGGAAGTTTATTGGATTCCTAGGGCAACTAGGTGACACACAGACTTCTCATTCCAGACTGCTTTGTTTTGACTCCTCAAGTCTTTCTTAGTCTGGGAGCACTAGGTCTACACCTTTTGACTTTCAATCACGATACTAAGGTGCTCAGTCTACTAGAGGCGTCCAAATCTAGGTCATTTTCTAGCCAACCTAACTCTAACGCCCTAGGTCTGTAGCACACTCTACAGGGAGAACACACAATCAAACACGTGGAACTAACACATACTAGCACACTCTACAGGGAGAACACACAATCAAACACGTGGAACTAACACATACTATATCATTGCAAACATACAAGCAAAGAAGCTCAACAAGAAGATAACATCCATAAATCATCATAGCACATATCACAAAAACCCATCATCTTTTCTAAAAAGCATATAAAACAAAATTCATCACAAATCTCATTACAGATGCATTTTGATAATCTTTTATATCATGCATAACATACTCATCCATATTGATCAACTCGTATTCTAGTCATGCATATTCTCTTTCTAATTTTGCATAATACATAAAACAACATGCTTCTTAGAAATCTCAACCTAAGAACACAACGTTAATTTACCATAACATACATACAATACATACTCATTAAAAAACGCCCATCCGAAGTCCTTAACATGCTGCTTAGAAATCTCAATATTGATTTTTTTTTAAGATTAAAAAAACCCAAATGAAACTTAAACAAGCAACAAGAAAAAATGATCAAACATCTTAAAAATTGAGAAACAAAAAGGCTTCCCAAAAATTATTTGTCAAGTGGATTGAGCCAAAATGAATCATAAAACAAACTAGAGATTATTTTACAATCTTAAGCTGATCTCCAAGAGATAACAACTCCACAAACTGGATTTTAATACGAGTTAGTGGGGGAAAAATGTGTATGATGAAGTGTCACAAGAGGGTTAAAAAATATAAAAAGTTGGGCCAAATTATTGAGTGGAAGTTCAAATAATCCCTAAAATATTCACACAAATACACTATAAAATCCCTCACTTGGCTTAAACTTCGAATCAAATCAAAACAAAATGAGCCAAAGTGATAGCATTTGGGGAAAGGTTCAGCTGAAATCGTCTCCTGAGAAGTTCTATGGGTTCTTCAGGAACCATATGGGCGAATTGGTCCATATGTTCCCTGATCACTTCCAGAGCTTCCACTTTTTGGAAGGACAAAACTTCGACGATGGGAGTGTTGTGCAATGGAAATACCACCTTGGTGAGATATTATCATTATTATTATTATTTGGCTGTTTTCTTCGATTTCTTTTAGGTTTAAATCCTATGTTTTTGTCTTTTAAAATGTTTATTTTAATCTTTACGAATTTTGAAAAGACGATCAATCTCTAGTTTTCGAGTTTTTATCATATGTAGTAATTTATAGGGTAGAGATTAAAAATTGTAAACAATTGATTAAAATTATAATATCTTAACTTGTGAAATCGGACGTCAGTCAGTAGAAGAATGGAACAAAACATTACTTATAAAGGTATGAAAACCGCTCTCTAATATACGTATTTTAAAACTATTTGTTAGCAGTGTGTTTGAGTGGTTACAAATGGTATCAGACTCAGACACCGGAGAGTATGCTATCAAGGACGTTGGCCCCAAAAGAAGGTGGATTGTGAGATCCCACGTCGGATGGAGAAGGGAGTGAACCATTCCTTTTAAGAGTGTGAAAACCTCTCTCTAATAGACGACGATACGTAACGGGCTAAAACAGATAATATTTGCTAGCAGGGTAGAAGATCAAATTTTGTATGGTTGTTGATGCATGAAGATACTAAAAAAAGGGTGTTTGAATGATGAACAAATTAACAGGATTTCCAGAAGCAGCAAAGGTACGGATGAGAGTTATGGATGAAGCAAGGACCATAATTTATGAAGTTGTTGAAGGAGATGCACTAAAGCATTACAAAGCTTTCAGAGTAAAACTTGAAACTGTTAGTGGAGACTTAAACAAAGTGGGAGCAAACTTTGCAAAATGGACAATTGAGTATGAAAAGGCACATCAAAACGTAGCTTCACCAGAAACCTACCTGGAATTGGCTCTCCAAGTTACCAAAGGTTTAGATGCTTATCTTTATCGGAACTAAACCCCTAAAACAACCAAGTGTATGCACTAAGTTCATGCTTTTTCTAATTGGTTTGCTCTTAAGTAAGAACTTTTATGTGTAATGTAATGTTATGTTGTATAATTATGTGGGCTTTGGTGAAAATGTGTGTTTATATTTGATCCAAAGGTTATGTATGTGTGAATGCTCTTAAAAACGTGAAAAATTTGATTTTAAGTGTGTTTATAATCGCTTAAATCTACCGCTAACAAATATTGTCCGTTTTGGCTCGTTATGTATCACCGTCGGCCTCACGGTTTTAAAACGCGTCGAGAGAGGCTTTCATACNAAATTATTGAGTGGAAGTTCAAATAATCCCTAAAATATTCACACAAATACACTATAAAATCCCTCACTTGGCTTAAACTTCGAATCAAATCAAAACAAAATGAGCCAAAGTGATAGCATTTGGGGAAAGGTTCAGCTGAAATCGTCTCCTGAGAAGTTCTATGGGTTCTTCAGGAACCATATGGGCGAATTGGTCCATATGTTCCCTGATCACTTCCAGAGCTTCCACTTTTTGGAAGGACAAAACTTCGACGATGGGAGTGTTGTGCAATGGAAATACCACCTTGGTGAGATATTATCATTATTATTATTATTTGGCTGTTTTCTTCGATTTCTTTTAGGTTTAAATCCTATGTTTTTGTCTTTTAAAATGTTTATTTTAATCTTTACGAATTTTGAAAAGACGATCAATCTCTAGTTTTCGAGTTTTTATCATATGTAGTAATTTATAGGGTAGAGATTAAAAATTGTAAACAATTGATTAAAATTATAATATCTTAACTTGTGAAATCGGACGTCAGTCAGTAGAAGAATGGAACAAAACATTACTTATAAAGGTATGAAAACCGCTCTCTAATATACGTATTTTAAAACTATTTGTTAGCAGTGTGTTTGAGTGGTTACAAATGGTATCAGACTCAGACACCGGAGAGTATGCTATCAAGGACGTTGGCCCCAAAAGAAGGTGGATTGTGAGATCCCACGTCGGATGGAGAAGGGAGTGAACCATTCCTTTTAAGAGTGTGAAAACCTCTCTCTAATAGACGACGATACGTAACGGGCTAAAACAGATAATATTTGCTAGCAGGGTAGAAGATCAAATTTTGTATGGTTGTTGATGCATGAAGATACTAAAAAAAGGGTGTTTGAATGATGAACAAATTAACAGGATTTCCAGAAGCAGCAAAGGTACGGATGAGAGTTATGGATGAAGCAAGGACCATAATTTATGAAGTTGTTGAAGGAGATGCACTAAAGCATTACAAAGCTTTCAGAGTAAAACTTGAAACTGTTAGTGGAGACTTAAACAAAGTGGGAGCAAACTTTGCAAAATGGACAATTGAGTATGAAAAGGCACATCAAAACGTAGCTTCACCAGAAACCTACCTGGAATTGGCTCTCCAAGTTACCAAAGGTTTAGATGCTTATCTTTATCGGAACTAAACCCCTAAAACAACCAAGTGTATGCACTAAGTTCATGCTTTTTCTAATTGGTTTGCTCTTAAGTAAGAACTTTTATGTGTAATGTAATGTTATGTTGTATAATTATGTGGGCTTTGGTGAAAATGTGTGTTTATATTTGATCCAAAGGTTATGTATGTGTGAATGCTCTTAAAAACGTGAAAAATTTGATTTTAAGTGTGTTTATAATCGCTTAAATCTACCGCTAACAAATATTGTCCGTTTTGGCTCGTTATGTATCACCGTCGGCCTCACGGTTTTAAAACGCGTCGAGAGAGGCTTTCATACCCTTATCAAAAAAATGATTCATTCTTCTCAAGTGAAACGTCAATGCTTTCAACCTTTTAATGATATTGTTAATAATTCATCTTAATACAAATAAAAAACAATTAATTATGTAATTATTCATGTTCAAAATTCAGATCACGATCCAGAATCTGAATTCGACACCAAAAAAAGAAGGTTTACATTGTTTATATTGGTAGTAAATTTCATTTCCTTTCAGATTTTCTTAATCATCTTAGTTTTAGGATCGCACTCGTTCGGATATAATATTGGTTCATTCATGTACCCTCCTTTATAGAGTGTCACGCATCTAAAATTATGTAATATAATTAAATTGTTAGTAGCGGTTGAACCAAACATTAATATTATTTATTTATATGAAATTCTGATAGTATTTCATGCAACACCAAATATTGTTATATGATTTTTAGATATCTTAAACAACTTTCAACCAAACGACCCCTTAAATGCGATGTACATACAGAAGTACTCTTCAAAGCGTTGGAACGTGACGTGATCATCGGGTGACCATCTTTGAGTTTTTTTAGAAAGATGATATTGATTTCTTGAAGGTGAAGTGATCATCGAGTCACTTTATTTTTTTTGTGAATAATGAACTCTGTATGATCTTTATTTTTTTTTTGATCATATAGACATTCACGACCCCACATACGCCAATTGTTGATGCTAAAATATTACCTTAAAAAATCTCACTTGACTTTTACGGCCTCAACAGTAAATTTAGTGAGATCGAGAACAAACTCTATATAATCTTTATTTTTTTCGATCATGCATCTCTGACAAAACACTCTCATCCCCATAGATGACAACGGTTGATACCAAAATCTTGTCATGAAAAATTTCACCTTACTTTCACGGTCTCAACGGTAAATTTGGTGAAATCGAAAATAAGCTTAATCTGATCTATATTTTTCTTGGATTCTTGACCTTTTTTCGAGTAGTGTTCTAATAATTTTTTCTTCTACACGATCTTGAAATTCTACCTCTAAANGCTGTTTTCTTCGATTTCTTTTAGGTTTAAATCCTATGTTTTTGTCTTTTAAAATGTTTATTTTAATCTTTACGAATTTTGAAAAGACGATCAATCTCTAGTTTTCGAGTTTTTATCATATGTAGTAATTTATAGGGTAGAGATTAAAAATTGTAAACAATTGATTAAAATTATAATATCTTAACTTGTGAAATCGGACGTCAGTCAGTAGAAGAATGGAACAAAACATTACTTATAAAGGTATGAAAACCGCTCTCTAATATACGTATTTTAAAACTATTTGTTAGCAGTGTGTTTGAGTGGTTACAAATGGTATCAGACTCAGACACCGGAGAGTATGCTATCAAGGACGTTGGCCCCAAAAGAAGGTGGATTGTGAGATCCCACGTCGGATGGAGAAGGGAGTGAACCATTCCTTTTAAGAGTGTGAAAACCTCTCTCTAATAGACGACGATACGTAACGGGCTAAAACAGATAATATTTGCTAGCAGGGTAGAAGATCAAATTTTGTATGGTTGTTGATGCATGAAGATACTAAAAAAAGGGTGTTTGAATGATGAACAAATTAACAGGATTTCCAGAAGCAGCAAAGGTACGGATGAGAGTTATGGATGAAGCAAGGACCATAATTTATGAAGTTGTTGAAGGAGATGCACTAAAGCATTACAAAGCTTTCAGAGTAAAACTTGAAACTGTTAGTGGAGACTTAAACAAAGTGGGAGCAAACTTTGCAAAATGGACAATTGAGTATGAAAAGGCACATCAAAACGTAGCTTCACCAGAAACCTACCTGGAATTGGCTCTCCAAGTTACCAAAGGTTTAGATGCTTATCTTTATCGGAACTAAACCCCTAAAACAACCAAGTGTATGCACTAAGTTCATGCTTTTTCTAATTGGTTTGCTCTTAAGTAAGAACTTTTATGTGTAATGTAATGTTATGTTGTATAATTATGTGGGCTTTGGTGAAAATGTGTGTTTATATTTGATCCAAAGGTTATGTATGTGTGAATGCTCTTAAAAACGTGAAAAATTTGATTTTAAGTGTGTTTATAATCGCTTAAATCTACCGCTAACAAATATTGTCCGTTTTGGCTCGTTATGTATCACCGTCGGCCTCACGGTTTTAAAACGCGTCGAGAGAGGCTTTCATACCCTTATCAAAAAAATGATTCATTCTTCTCAAGTGAAACGTCAATGCTTTCAACCTTTTAATGATATTGTTAATAATTCATCTTAATACAAATAAAAAACAATTAATTATGTAATTATTCATGTTCAAAATTCAGATCACGATCCAGAATCTGAATTCGACACCAAAAAAAGAAGGTTTACATTGTTTATATTGGTAGTAAATTTCATTTCCTTTCAGATTTTCTTAATCATCTTAGTTTTAGGATCGCACTCGTTCGGATATAATATTGGTTCATTCATGTACCCTCCTTTATAGAGTGTCACGCATCTAAAATTATGTAATATAATTAAATTGTTAGTAGCGGTTGAACCAAACATTAATATTATTTATTTATATGAAATTCTGATAGTATTTCATGCAACACCAAATATTGTTATATGATTTTTAGATATCTTAAACAACTTTCAACCAAACGACCCCTTAAATGCGATGTACATACAGAAGTACTCTTCAAAGCGTTGGAACGTGACGTGATCATCGGGTGACCATCTTTGAGTTTTTTTAGAAAGATGATATTGATTTCTTGAAGGTGAAGTGATCATCGAGTCACTTTATTTTTTTTGTGAATAATGAACTCTGTATGATCTTTATTTTTTTTTTGATCATATAGACATTCACGACCCCACATACGCCAATTGTTGATGCTAAAATATTACCTTAAAAAATCTCACTTGACTTTTACGGCCTCAACAGTAAATTTAGTGAGATCGAGAACAAACTCTATATAATCTTTATTTTTTTCGATCATGCATCTCTGACAAAACACTCTCATCCCCATAGATGACAACGGTTGATACCAAAATCTTGTCATGAAAAATTTCACCTTACTTTCACGGTCTCAACGGTAAATTTGGTGAAATCGAAAATAAGCTTAATCTGATCTATATTTTTCTTGGATTCTTGACCTTTTTTCGAGTAGTGTTCTAATAATTTTTTCTTCTACACGATCTTGAAATTCTACCTCTAAAAACCCAAAAAAATCTAAAAATTAGAACTATAAAGAAAAAATTGAAAAATTAATTCTCGTTGGAACTCAGATATTACATTCAACCTCGCCTTTTCACCTCTGCTATAACGATGACAATATCTACGTTAAGACGCCAACTATTGATACCAAAATCTTGTTACGTAAAATCTCACCTGACTTTCACGTACTCACATATATTATATTACTATTTTCTAAATTGAGTAAGTGGAACCAACAAACAAACATTATTTTACACAATCTTGCTGATATGCAACAGATAACAACTCTAATACTTGGATTTAAAATAAAGTTAGTGGGAAAAATAAAAGATATTAAAGTGTTCATAATGAACTAATCACAAGAGTGTTTAAAGAATGATAAAAAGTTGGCCAAATTAGATAGCAAATATTAATGATTAAATTTATCACTACCAATCAACTTAAGTTGAGTTTATTCATAATTTAATAGACATAATAGGAGAAGAGTTTGTGATCGAATTCCCTTGTGGGTTACTTTTATACTTTTAATTATTTAATCTTTTTCGATTTTTCTGTTAGCTAATCGAATAGGAGAAGAAATATAAAGGTGACTACTCGTCAAAGCATAGTTCTTTATCTTTTTAGTAGCTAGTCAAAATGAACCTGACACCCGTCTCGAGAAATGAATAAATGAATCTTGATGTAATTTTAAGTCTATTTTTAAGTCAAAGAATTTTTAGGGACACAAAGAACTAAGTGAATTGGGACTAAAAATTAACAATTAAGGTGCTTTAGGATAGAGTAGGGTTAAGTTGTAACCCTATTATCAGGTTGGTCGAGTTGACCGGACTAAGAAAATGTCAACTCGCAAGCTGATCCAAATTTTCGATTTTCAAAAAATTTAACTCAACTGAAATCGAAAAAAAAAAAAAAAAAAACTTAACCATACCCAACTTTAATATCTAATTGTAAGTTATAGACTAAGAATTTAATATCTAATTGTTTTGTTTAGTGGATGTAGCACTCAAAAGATAGACAAAAGTTCTTAAACTAACCTTTAATTTCTCTTAAAATATTGGGAACTTGCATATAATGCAAATAAAATCCAATTAATTTCAAGTATATTGGCACTAACATTATGTATTTGCAAATAATATACTAATTTTAAATATTTTTCGTTTGCTTACTTCAATATTTTTATGTTATCTTCTAATACTCGTTTACGTCATTAGTGATGCATTGAGATGTTCGTAAAGGAATAGGACGTCTTTCTCGTTTTCATCTCCGGTTTTAATCTTTATTCTTTCAAATTTTTTCATTTATTTTTAAAAGAATTGGGGTGAAGATTTACCGTAGAGAATTCGTGTAGGGCTCGAGATCCGTGTAGAAAACTTTTATCCATTTTATATTTCTCTCTCTATACGTATATTTATTTATATTTATATATAAAAATGAAGCGAGACGAGTCGGGATGGGGATTTTTGCAAGGTTGGAGAGGGGGACTGGATGAGATATATAGTTTCAATTCCCGACCCCATTTAGTTAATAAGGATATGTGAGATCCCACGTTGGTTGGAAATGAGTATTATAAAGGTGTGGAAACTTCTTTCTAGTAGATGTGTTTTAAAACCCTAAGATTGGCGGCGATACACAACGAGTTAAAGCGGACAATATCTACTAACGATGGACTTGGATTGTTACAAATGAAATCAGAGCCAGACACCGGGCGGTGTGCTAGCGAGGGCGCTGGCCCCCAAGGGAGTAAATTGCGAGATCCTACATCAATCGGAGAGAGAACAAAACATTTCTTATAACGGTGTGGAAACCTCTTCCTACTAAACATATTTTAAAACTGTGAGGCTGACAGCAGGACAAAACAGACAATATCTACTGGCGATGACCTCAGACTTTTACATGATAAATCTTATTTCGCTCAATTATTTTCTGTTTGAACGAAGATTTTTCACTCTATTCAAGGCAAATAAAATAAATATCTTGATAGATTTTTAACATATATTTATGCTATTAGTCTATCAATGACACGCTACCAAATGCTATTGTTGATATGGTCTAATTTAAACAAAAACCCTAGTGACTTCTCTTTAAAGATGTCATACCAATTGGACTTTATTTGCACCTAAAATATGGGATTTAAAATACAATCAGTGGGGAAAAAAGTGATCATTCATCCAACTTTGCTATAAAATCCCCCTCTTAGCTTCTACTTTGTATCACATCGAAAGAAAATGGGCAAAAGTGATAGCATTTGGGCAAAGATTGACCTAAAATCTTCTCCTGAGAAGTTCTATGGGTTCTTTAGGAACCATTTGGGAGATTTGGTGGATTTGTTTCCTGAGAATTACAAGAGCATTCAACTTGTGGAAGGACAACATTTCTCCGGTGGCAATGTTGTTCTATTTAAATTCCAATTTGGATTTGGTGAGACAATAATTCCTTCTCATATTGCATAACGTTGATTATATATATTTGTTCATGCATGATCAGGATATTAAACATAGTTTTGTTTTATCTTTTCTTATGAATATGATCGAACAGGTCATCAACTTCGAGTCGAAAAATGGGCAATAAGAGCTGTGGATGATGTGAAGAAATACATAATCTATGAAGCTGTTGAAGGAGATGTTCTAAAGCAATTCAAAGTGCTTAGAGTAAAAGTTGAAGCTGTTCATGGAGGATCAACCAAAGTAGGAGGAGGAAACTTTACTAAATGGACTGTTGAGTTTGAAAAGGCAAATCAAAATGTGGCCTCACCACAAAACTACTTGGAGTTGTTCGTCAAAATCTCAAAAGGGGTGGATGCATATTTCTCCAAGAACTAAAATCCTACAATAATGAAGGGAGTGATCTTAAGTTTGGCTGTGTTTTCTACTTGGTTTGTTCTAAATAAGAAATTATGTGTGTAATGTGTTTAAATATGGGTTATAAGGATGCATGTGAGGCTATGGAGTATCTATTATATGTAAGTCTACTTATGTGTGTGTGTAAGATCTCAAATGTGATATCTAATGTGGGTTTATGGTCTAATGAATTATAACTCATCAAAATGTTTATCTTATAAATCCATGAGTCTTTATTTATGGAGAAATCATAATTGAATTGAAAATATTGAAAGTTTTGTTTAAATTTGGACACCAAAAAAAAATTGAAGTGGTTTGATATGTGAAGACTCAACTAGAGGCGTGTATTTAATTTGTGGGGCCAAGTGTCATAATCGCACTCTTGTGGTAGTGCGACACTTGTTCTGATCCAGCGAACAAGTCAGTCGTGAGAAATCTGGACTTCGCGGACAATGTCGATGTATGCTTGAGTCTCGAGTCACGTTTGAGAGGAATGATTTAGAAAATGTAAGCGAGGAAATATGTTGAAAACGATTTGAAAGAAAGGAATGTTATAAGCTAAAAGTGATTAGACAACAACAAATAACAACATGTTTTATAGCATATAACTTGGGTAGGAAGACGTGACAACTAGTCATATCCCCTTATAATGCATTCTGGGGCATTTTAGCCTCGACAGAGGTAGACATGATTTTGGTGAACGGTCATACACCTCTTTTGATACAACATAAAATAGTAAAGACTTATTTAAGATGAAAGCATGCAAAAGGGGTTTGGAACGGACATGCAGCCATGAACAATATGTCCTGGACAAGCATGACTAGGACATCCAACAAGCGGCTATACACAACATGACTTAGATAAGCATGACCGTGACACCAATACCTGAACTTGCCCCTCATTTAAATGGGGTATTTTGGGGAGTAGATAGAGATTTCTCTTTGTTAACTAAACCAGATTGAGGAAAAATTATATTCCCCATCCTCCTTTCTGCCCCCCGACCTTGTCGAAATCCCTGCCTCGCCTCATTTTTTATACATAAATATATTTGTATATATATAAAAAAAAAACGTATTATATTATATTTTTGTTTACGTAGTATTATTGGATTTATTAATTTTTTAATAGAAATTATATTAGAAAATAAAAATTATCCATTAAAAATTTATTTTAAACTAATATTTTTTTCTTTTGAACCCTTGGGAATTTTTTAGCACGAACTCGTGGTAAATCTCATCTTCTCATCTTCTCGTGGTAAACGGAGTTAAAACATTTGTAATGCAATCATTTAAAACAATATTGTTATCAACAACATAACGAAAAATAGACAACGTGAACTTTAAATAAAATAATTAAAACGATCCTCTGATATAATTAAAGGATGACAAAAATAAATATAAATGCAACTATCCTAACTTCGATCCCATGATCTTCACCAAGACATCACCATCCTGTTAGACGAACACGACTCTCCACAATGGTATGATATTGTCCACTTTGAGCATAAGCATTCATGACTTTGCTTTGGGCTTCCCCAAAAAGGCCTCATACTAACTCCATTGGTATGAGGCCTTTTGGAGAAGCCCAAAGCAAAGCCATGAGAGCTTAGGCTCAAAGTGGACAAGGAGAGTCGTGTTCGTCTAACATGGTATCAGAGTCATGCCCTAAACTTAGTCGGGCCAATAGATTGGTAAATTCTCAAATATCGAACAAAGAACTCCAAAAAGAAAAGGAGCAAAGCCTCCTCGAAGGCAGTAAAAAATGACTAAGACTCCAAAGGAGTCGAGCCTCGATTAAGGGGAGGCGCACTTTGCACTTTGTTCGAGGGGAGGTGTTGGATGATGGAAGTCCCACATCGACTAATTTAGGGAATGATCATGGGTTTATAAGTGAGGAATACTAACTCCATTGGTATGAGGTCTTTTGGGGAAGCCCAAAGCAAAGCCATGAGAGCTTAGGCTCAAAGTGGACAATATCATACCATTGTGGAGAGTCGTGTTCGTCTAACACATCCGAATAAAACAAAGTTTACTTGTGAATTTGAATTTCATGGATGAATCTCATAAGCATTCAATAAGTTGTTTTATCTACGAGAAATTATGATAATTTGAGCGTTGCTATCAAGAAACTTTTATAATAATGTATTAATAAGGAGAATGACCTAAAGTTTCTATCTTCGTCCATTATTATTATTATATCTTCAAATCGGGACTTAAACAAACATATATTATATTATTATTTTCTAAATTGAGTAAGTGGAACCAACAAACAAACATTATTTTACACATCTTGCTGATATGCGACATTAACAACTCTAATATTTGATTTAGAATAAAGTTAGTGGGGAAAAATAAAAGATATTAAAGTGTTCATCATGAACTGATCACAAGAATGTTTAAAAAATGATAAAAAAATTGGCCAAACTCCTCAAACCCCACTCTCAATAATTCATGCAAAAAGAAAAAGAAAAAAAAAATGGCCTTGAAGTTTAGAAAAATTTAATTCTACTCAAGAAAAATACTTACATGCGGTAGATGTAGACGAGAGTTGTAGTGTAACGACGTTATTGTTTTTATTGAAAGAGTCGCTACGATATGCACGACGTAATATTTTTATGCGAAAACATTCATTTAAAATAGTTCATAAAACGATATCACAAACTTAAATATTCGTGTTAAGAGTCTTAAAACAAAACAACAAAAAATAATCCATGTGAAATATAAATAAAACCAAACAAGTTTTAATCTAAGATCTACAAAATACAAATATAACTACCTTAACCGGTTGCATAGTCTTTCTGCGATGCCGCTGTCAACCGTACATGAATGTCTTGCCTTGACCTAAAATATAAAAACAGCACAATATTTGAGTATTTTAAGAAATAATCAGTAAGTGACCCACTATTAAGTTCGGGAAATACAAACACATATAGAATGATACGACCTATCATTTAAGATTGTATTTTTAAGCGGCTTTCGAGTTCGGACAAATTGGATGGGATGGAAGCTATGGTAAATTATAACATAAGAATTTAATATCTAATTGCTTTGTGGAGTGGATGTACCGCTCAAAAGAGAATCAAAAGTTCTTAAACTAACATTTAATCTCTTAAAATATTGGTAACTTGCAAATAGTGCAAATAAAATCCAATTAATTGCAAATAATATACTAATTTAAAATCTTTTCGTTTGCTTGCTTAAATATTTTTATGATATCTTTGATACTCATTTACGTCATCAGTGATCTATGCATAGAGATGTTCACGAACTTTAAGTTGAATAAAAATATCTTTTTCCGTTCCACTTCTTATTTTAATCTCTATCCTTTAAAATATTTTCATTTATTTTTAAGACTGTGGGCGAAGATTTTCGGTAAAAAAAAAAAAAGAAAAATTTCACCAAACGAGTCTGGGTGAAGATTTTGGTAAGGTTGGAGAGGGAGACAAGATGAAAAATATAGTCTCATTCCCGGCCCCATTTAGTTCACGAAGATAAATCTCTTTTCACTCATTGTTTTCTACTTGAACCAAGATTTTTCATTCTATTCAAGGCCCGTAAAATATATATCTGGATAGATTTTTAACATGCTTATAATATTTATTGTAACAACCAAAGTCCACCGCTAGCAAATATTCCGCTTGAGCCCGTTACGTGTTGCGATCCTTCTAACGATTTAAAAACGTGTGTATTAGGGAGATGTTTCACACCCTTATAAGAAATGTTTCGTTTCTGTCTCCAACCGATATGGGATCTCACATTTATGCTAATAGTCTATCAACGATAACGTCTATCAAATATTATTGCTGATATGGTCTAGTTTCGACATAAACCCTCGTAATTTTTCTTTTAGTTTCACCGAAAAGATGTCATACCAATAGAGATAATTGTCCTCAACTAGGTACAAAAAAATTCATCCCTTTTCTAACTAATGCAACACTTTATTTGCACTCCTAACCTAACTTTTCGAATCATTGATTAGTGGCGTTTAAAATAATTCCTAAAATATTCACCCAACTTCGCTATAAAATCCCCCTCTTAGCTTCAAGTTTGTATCACATCGAAAGACAATGGGCAAAAGTGATAGCATTTGGGCAAAGGTTGACCTAAAATCTTCCCATGAGAAGTTCTATGGGTTCTTCAGGAACCATTTGGGAGATTTGGTGGACTTGTTTCCTGAGAATTTCAATAGCATTCAACTTGTGGAAGGACAACATTTCGATAGGGGCAGTCTTGTTCTAAGTAAATACGAATTTGGTGAGAGATTCTCTTCGTTTAAATTGGTCGACACATTTCATTTCATTTACGTTTCGATAATTATTTTGGTTTTCGACATTTTAACTCATATGTTAACCTTGAGATAAACGATAGGGTCATCTAATTACATTCCAGCCGACTACTTTTTTATTATTTATATTTTTTATAGCCAATTCTAATCTCAATTTTTTATAGTTTGGATTGGTTCATCAATTTATTCGAGTTGTTTGTGTTGGAAGATGAAAGTTCCACATCGGCTAATTTAGGGAATGATCATGAATTTATAAATAAAGAATACTCTCTTCATTGGTGTGAGGCTTTTTGGGGAAGCCGAAAGCAAAGCCATGAAAGCTTATGCTCAAAGTGGACAATATCATACCATTGTGGAGAGTCGTGTTCATCTAACATGGTATCAGAGTCGTGTCAATAGAGTGGTAAATCCTCAAATATCGAACGGAGTCAAGCCTCTTTGAAGGCAGTAAGAAATGATTTCAACCACACATTTAGTTTTTTTTTTTTTTTGAAATTAAGGCTAAAACAAATAAATTTCATCAAACAAAATCGATATCAAACGGAACTAAATATTACATAACGTTGTTCTTATGAATATTATGAAACAGGACATGAACATAGAGTCGAAAAATGGGTAATAAGAGCTGTGGATGATGTGAAGAAATACATAGTCTATGAAGCTGTTGAAGGAGAGGCTCTAAAGCAATTCAAAGTGCTTAGAGCAAAGGTTGAAGCTGTTCATGGAGGATCAACCAAAGTAGGAGGAGGAAACTTTACAAAATTGACTATTGAGTTTGAAAAGGCAAATGAAAATGTGGCCTCGCCTGAAATCTACTTGGAATTGTTCGTCAAAATCGCAAAAGGGGTGGATGCTTATTTCTCCAACAACTAAAATCCTACAACAATCGAGGGAGTGATCTTAAGTTTGGCTGTGTTTTCTACTTGGTTTGTTCTGAATAAGAAATTATGTGTGTAATGTGTTTAAATATGGGTTATAAGGATGCATGTGGGGCTATGGAGTATCTATTATATGCAAGTCTACTTATGTGTGTGTGTGTAAGATCTCAAATGTGATATCTAATGTGGGTTTATTGGTCTAATGAATAATAACTCATGGAAATGTTTTATGTTATAAATCCAAGACCAAATTATGCCTAATTTTGTCGTTTTTAGAATTGAGTGTTATCAAATATTTAAAGTTTGGTTTAAATTTGGACTGTGAAAATTTAATGAGAGATTTTCATTGATGAGTAGGGTAAAGATCTTATGTGGATATGCCTTTCATTTAAATGGGGTATTTGGGACGTGCCCTTCATTTAAACGGAGTATTTGGGATGAAAAGGACATAAATTTCTCTCTATTAGCTAGACCGAAGATGGAAATTATATTCTCCATTCCTATCTCTACCTCGACCTTCCCCAATTCCTACCTCGTCTCACGTTTTTATATATAAATATAAATAAAGATACAAAACAAACTTATGTTCTTTGCTTAGTAAATAAACAAATAAATGTATATTTTTAAACATATAGTATTTGAGTTTTTGAAATTATATTTTTGCTCATGTATCATTATTGGATTTACAAGAATATTTTGCAATTTTGCATTTTTCAATAGAAATTATTTATGATGAAAAAAAAATCTATGGGAAACCGATTCCCTTTAACTCGTCCTTGAAATCTTATCTTGATCCCAGCAAAAATAAACGGAAAATTTTGTACCCAAAGTGATCGAAATAGGTGGCAGATGATTATTATTATTATTATTATTATTATTATTTTCTTTTTTTTGTCCTCGGCCTCCGTAGACATCTTGGATCCAACTTATTTAAAAAATGCTTACAACGACTTGTTAATGGAATAGAATCAAATTTAAGATTTTCGCAGGAGCCAAAAGAGAGAGGAGAACCTCGTTCAAGGATGTACGCCACATTATTTATAATTCCATTTATGTAAATTGACGTGACAACTAATATTTAAAAATTGATTAGGTAATGAATTTAAACAATATATTTTAAGATCAAATAAATGGTAATAATAATCTTAGGCATCTCATTTTTATATATCTGTCACAATATTTTTTATCCTTAATTTTGGTAGAAAAGGGATTATCGAACAATTTCAAAGCTACGTTGTTCATGGGTGTTTTTTTAAGAACAAATTTCAAAGGCAAAGGTAAATTTTTTAACACAACCTAATTTTAAATTTTAGGTTTACGTATCAATAAAAACCATATTCTAAAAGCATACGGTTGTATGAAATCACTAAAAGATCTCGTAGACCTCGAATTAAGGTTGCTCTCGAATTATCTTTAGAATATCTCATTAAACTAAGTATGGATGACTCTGAGTTATAGATGACAATATATTTCTTTAGAAATTATATGATCATTATGTGACGATCTAAATGAACTCGAAGATGACGTGTTCATTGATAATCTTTGAGATTTATGATTTTTTTTTTCTAGAAAGATGATATTAAATTTTTGAAGGCAAACTGATTAGGTACATCAAATATTTTCATTATTATTATTATTATTATTATTTTTTTTTTTTTTTGTGAATAATGAAGTCTATATGATCTTTATTTTTTCGATCATATAGACTTTCACAGCCCCACATACGTCAATTGTCGATGCATTTTATCTTGAAAATTTTCATGTGATTGTCACGGTCTCAACGGTAAATCTAGTGAGACCGAAAATAGACTTTATATGATCTTTATTTTTTTCAATCATGCATCTTTGACAAAGCACTCTCATCCCCATAGATGACACCTATTGATACCGAAATCTTGTCAGTAAAATTTCTCAACAGTAAATTTGGTGAGATAGAGAAGAAGCTCTATCTAATCTATATTTTTCTTGGGTTCCTAACCTTTTTCTGAACAGTTTTCTAATCATTTTTCTGTTGGGTGTAATCTTGAAATTCTGGATCTAAGGTTTTCAAGAAAATCTCAAAAGTTAGAACTTTAAAGAAAAAAATTGTAATATTTTTCGTTGGAACTTGGATGTTACTCAACCTCACCTTTTCGCCTCTACTATAATGATGGCGAACAAATAGGACTTTCTTGAAGCGTTGCAGTTTATTCATATGAAGTTATAAGGATGTATTAATAAGAAGAACAACCTAGGGTTTCTCTCTTCGTCAATTATTATTATTATATCTCGTAAAGAAAAACTAGATATTTTAATTAGCCATTGAAGTGTAAGATCACTATGAAAAAATAACTCCAAACATATATTATTTTACTACTTTCTAAATTGAGTAAGTGGAACCAACAAACAAACATTACTTTACACATCTTGCTTTATGTAACGGATAACAACTCTAATACTTAATTTAAAATAAAGTTAGTGGGAAAAATAAAGGATACCGATGTGTTCATCATCAACTCATCACAAGAATGCTTAAAAAATGATAAAAAGTTGGTTCAAATCATTCCTAAAATATTCACCAAATTCACCATAAAATCTCTCTTAGCTTAGAAAATGACCCAAATTGATAGCAAGTATTAATATGATTAAATTTATTAATAAGTAATCGATTCATAATTTAATTGATATAAGAGGAGAAGATTTTGTGTTCGAATTCCTAAAACGTCAAAATTAATAGAATTAGGTGATTATGTCTAGTGGGTTACTTTTATATTTTAATGTTTTTATAGACTAGCCTCCCGTTGAGTTGAGCACTCTTTTTTTTTTGTGCATCTTTCTTGATTTTCGATTTCTCTTTGAGCTAAAGGTTTAAGAATAAGAGAAGAAAGACAAAGGACTGAGTGAAATTGGGATTACCCATTAGCAATTTAGGTGTTTATGGTAAGGTAGAGTTGGGCTGTAACCCTATTTTTAGGTTGGTTAAGTTGACCAAACTAAAACAATGTCAACTGGCAAGCTGATCCAAATGTTGGATTCTTAAAAAATTCAACCCAACCCTAATGAAAAAAAATTAATCCTACCCAACCTGTAAGATGAGATTAGTGAATAATTCTCATACTTATATTATTCTTAAAAATAGTTTGAGAAACACAATATCATTCTTAAAAATGGTTTGAAAAGACAAATATTTTTGTAAATTTTTGTTTAAATAATTGTCTTTTACAACTTAATCTGTTGTTTTGAGTGGATAATTAATAATGTATGCTTAATTTAGTGCAATAAAGGGATTATTGATGGAAGCTGTTAGCCTAATGGTAAGTTATAAGGTAAGTATTTAATATCTAATTGCATTGTGTAGTGGATGTACCACTCAAAACAGAATCAAAAGTTCTTTAATTTCTCTTAAAATATTGGTAACTTGCAAATAGTGCAAACAAAATCCAATTAATTGCAAGTATATTGGCACTAACATTATGTAATTGTAAATAATATACTAATTTTAAAATCTTTTCCTTCTATGCTATCTTCTAATACTCGTTTACGTCATTCGTGATGCATATAGATGTTCATGAAGTTTGAGTAGAATGGAGATGTCTTCCTCCATCCCACCACCTATTTTCATGTTCATCATATCAATTTTTATTTATTTATTTTTAAAGGAATTGAGGTGAAAATTTACCGTGGAGAATTTTCGGGGCTCGAAATCCGTGTAGAAAAATTTTCTTCATTTCATATATCTATCTATACATATATTTATTTATATTTGTATATAAAAAATGAGGCAAGACGAGTCGGGTGGAGATTTTGACAAGGTTGGAGAGGAGAACAACGATGAGAAATGTAATCTCCATTCCCGACCCTATTTAGTTAATGGAGATAAATCTATATTTACTCCTTAATTGTTTTCTGTTTGAACAGATGTTTTTCACAATTTTGAAGGGGTAAAATAGATATCTTGATAGATTTTTAACATGCTTATAATATTTATTGTAACAGCTCAAGTCCACCAATAACAGATATTGTTCGTTTTGACCCGTTACGTATCATCATCATTAGTCTCCCGATTTTAAAACGCGTCTACTAGGGAGAGATTTGCTTATAAGGAACATTTCATTTCTCTCTCCAATCGATATGGGATCTCACATTTATGCTATTAGTCTATCAAATGTTATTGCTGCTATGATCTAGTTTGGACACAAACCCTCGCAACTCTTCTTTTAGTTTCACGAGAAAGATGTCATATCGATAAAGATAATTCATCCATTTTTCTAATTAATATGAGACTTTATTTGCACTCCTAACCTAACTTTTTGGATTATTGATGAATGGCGATTAAAATAATTCCTAAAATATTCACCCAACTTGGCTATAAAATCCCCCTCTTAGCTTCAAGTTTGCATCACATCAAAAGAAAATGGTCCAAACTGATAGCATTTGGGTAAAGGTTGACCTAAAATCTTCTCCAGAGAAGGTCTATGGGTTCTTCAGGAACCATTTGGGAGATTTGGTGGACTTGTTTCCTGAGACTTACCAGAGCATTCAACTTGTGGAAGGACAACATTTCTCAAGTGGCAGTGTTGTTCAATTTAAATTCCAATTTGGTGAGAGATGTGGTTCCCATTATATTAGGTTTAGATAGGTTGAAACGTTCTATTTTAGTCATTTAACTTTGAGATACAATGATATAATCATCTAATTACGTTCTTGCTAACTACTATTTTATTATTCAATATTTTTTATAGAAAATTCTAATCTCGACTTACTTATGTAACTTTAATATAAGGGTCGATATAATTATTAATTTCAAACTAAGAAAATTAAAATAAACATGTTAAATGAAGATTGAAATTGCTCATTTTTGTTAATTATTAGGGTTTATTGTGAAGTATTGCTATTTGATTATGTGTTTTTCATTCTTCTTGGTGTTTCGATTTTTCTTTTCTTATGAATATAATCAAACAGGAGATGAACTTAGAGCAGAAAAATGGGCAATAAGAGTTGTGGATGATGTGAAGAAATACATAATCTATGAAGCTGTTGAAGGAGATCCACTAAAGGAATTCAAAGTGCTAAGAGCAAAATTTGAAGTTGTTAATGGAGGATTAAGCAAAGTGAGAAGAGGAAACTTTACAAAATGGACTGTTGAGTTTGAAAAGGCAAATCAAAATGTGGCCTCACCACAAAACTACTTGGAGTTGTTCGTCAAAATCTCAAAAGGGGTGGATGCTTATTTCTCCAACAACTAAAATCCTACAATAATGAAGGGAGTGATCTTAAGTTTGGCTGTGTTTTCTACTTGGTTTGTTCTAAATAAGAAATTATGTGTGTAATGTGTTTAAATATGGGTTATAAGGATGCATGTGAGGCTATGGAGTATCTATTATATGTAAGTCTACTTATGTGTGTGTGTGTAAGATCTCAAATGTGATATCTAATGTGGGTTTATGGTCCGATGAATCATAACTCATGGAAATGTTTTATGTTATAAATTTAAGACCAAATTATGCCTAATTTTGTCATCTTTATTACATTGTAAATAAAGGGAGGATTTGGTATTTGGAGACTCAATTAGAGATATCCATTTAATCGGTGGGGTTAAAGTCCTATATGGACTTGACCTTCATTTAAATGAGATATTTGGGAGGGAGTGGATAGAGATTTCTCTCGATTAGCTAAAAACAAGAATTACATTCTCCATCTCTGTAACACCCCAAGCCCAACACGTACAATTCACTCTATCCTTGTGACTAGCACCTTCTTATCCATTGTGAGTTTCTAGATTTTATTGGAAGACTTAGAGTGTGATTTTCATATCGGTCTAGAAGAAAGAAATGCTCGGGATGCATCAAGGAACTTAGAAAAAAATTTAAAAAAAAAATGGTAAAGTAATCGAAATTTGGGTCAAAGTCAATCGGGGGCAAAAAGGTAATTTTGCCTCACACATACCACCTAGGGGGTCTGGACAGACCTGGAAGAAAGGAAAAGGAAAAGAAAAAAAAGAGAGAGAAAGATAAATGAACAGTTTAGGTTTTTTTAAGAAGAGGAAGGAGGACGGAGAAACGCCTCAAACCTAAACCGAAGCACCCTCAATCTTAGGTCTCCTACTAGAGCTACTCAGAGTATCAAAAGAAGTGGTCTGAGTGTGGATCGTTCGTTTCCAGCAAGAAACGGGCAAGAACACGAGGTAAGTCTACGACCTCTCCTGTTTCAAGATTTCACCATAGATTTGGGGGCTTCACAAAGATTTAGACTTTAATAATCTAGTTGGGGGAGGAGTATTACAAGTTTTTGAGAGGAAATGCCGCCAAAAATAGCATTGAAAACAAAATGGGACACGTGTCAAGCGTCAATTGGAGAAGAAAATCAAAAAATATTTCGTGTCCGGCTGGATCGAAGGCCGGTGGCCCGCTACTCGAGTCCACGACCCGATGACCAGCGCTCAACCCGGAATACCAACCCGGCTCCTCGTTACCACGCGTCACGTAAAAGAATTCAGATTTCCAGCGCGTCTTTCCGGTTTCGAACCGACGACTCGTCCACGGGTGACCGTTCATCGGGCACCACGTGGCAATACCTCAGCGCGCGTCTTCCCTCCCTCATGCGCGTCCGTACGCGCGCGTGTGAACGCGTGCATGCGCGTGAGTTGCTGTTTTTAGCCTCAGACTTCTGTTTGACCGATTTTCGTTCGGATTCGAGTGCTAATCCTATTTTACTGTGAACTAGGTCTCTGTATAGATAATTCATAACTTTTGGATTCTGTTACGACTGTAATGAAGCTCGGGAAGGCACGCAACATTTTGGAAGTAACCATACGAACTTAGGAAGTAATAGTTGTAAGGTGCGCTTGATTTTGCATGCATGCTAGAGCAAATTACCCGAAGGTGGCTAGATCTAATGATCCTCTATACTTAGAGAATTAAATATATATCTATTGGACTTGCTGCCCTAGTTAGATTTTGTACTCGTTGTTTGTATGCTTGCTGGACTATCTTACTACTACTTCTATTTGTTCATACTATGTTGGCAAACGGGGGAATTTGTGTATCTGTGCTCTGGGTGTTGATAGATGTTACCGCCCCGTTGAGCTTGTATGATTGTATGCGTTGTTAATCATACATAGACTCCTCTTGAATGATTGTGTGAACTTATGATATGATTGATAAGTTGTAGTGTTGTGATATGCCTAGGGTGAACTCTCGTAGGGATCGAGGCGCTCTGGGTGTTGATGGATGTTACCGTCCCGTTGAGCTTGTATGATTGTATGCATTGTTAATCATACATAAATTCCTCTTGAATGATTGTGTGAACTTATGATATGATTGATAAGTTGTAGTGTTATGAAATGCCTAGGGTGAACTCTTGTAGGGATTCAGTTGGGTTTAGTTAGGGGACCCTCCTGATGTTACGGTCTATGGTGTCGTGAGCGACGCTTAGAAGGAGAGACTCTTTAATCCTGGACTGTTATGGACTTGGGAGACTTTATTTAGTGAATCTGGAAGCCAGGGCCCCTTACTCAATAGTTTTAACTACCCTTGAGATATAGAGAACCACATTAGGACCCCGAGGTGTAGAGAGATGACCTCTTAGCCAACAGTATCTGGTATAGAATTCGAGTATACTCTGCTTCCCTTCCTACTCCAGTGTGGTACTGTTGAGATAGGCGAGATGCCAGTTAGGGAGACGTTACCCTCCCCTTCCTACTCCCGTGTGGTACCGTTGAGCTAGGCGAGACGCCAGTTAGGGAGATGTTACTCTTCCCTTCCTACTTCAGTGTGGTACCGTTGAGCTAGACGAGACGCCAATTAGGGAGGTGTTACTCTTCCCTTCATACTCCAGTGTGGTACTGTTGAGCTTGGCGAGACGCCTGTTAGGGAAGTGTTACTTAGTGGTAGATCGACTACTCTGCATGGGGATCCGTCTCATGGAAAGATGTGATCACGCGAGTGGGTCAAGGTCACTTGCATGATGGGTCCGTCCTATGGAAGGATGTTGACCATGCAGTGACTAATTCTAGCTGGGATGTACGAGTGGTACATGAAGATATGAGGGAGCACCTCATCCTCAAACGTTGAATACCGCTTTAAAAGTCAAGTCCTTCTGCCCTGAACTGTCTACTGAGAGTACTGAGACTCTAGGGTCTATAGTATAGTCGGTGGTTCTCTCCTTAACTTAGATGCGATTAAGAGCGTACAAGTAGGTTGCTTACTGAGTATATTTTTATACTCACCCTTGTTTTCCCTTCTTTTTTTTCAGGTACTAGTCGAGGTGGGGAGGTGTTCGAGAGCTGCATTATCACAGAGGGTTGTCGTTTCGGAGCGTCAGTCCCTTGTCGGTCTTTTAGAATCTTTTGTATTTTTCTTTCATAATTGTATTTCTCGTCTGTAATATTTGACTTTAATTTTGATGTCATTGACTCTTTTTTCTTGTTTTAAATTTTATTAAATTTTCATCTCCTATTTTATCTTCTTTTTAATTTCATTTGTTTTCACTTTCATCTTTTTAGTCACGATTCAATTTTTCGAAGTCTCGAAAATTGGGTCGTGACAGTCTCCGCCCCCGACTTTGTCTAAATTCTTATGCAATTGGAGTCATGAGTCGGTACATGCAAAATCCAAAGAAACTTCATTTGGATGCGGCTCGATGGATTTTGATATAATCGACTATGATATTTTGTACAAAAGAAGCGAAAACAGCAAGCTAATTGGATACTGTGATGCTGGCTATGCAGTAGACCATGATATCCAAAAATCAACCACTAGGTCTATGTTCAAGTTTGGTTGAGGAACGATTTCTTGGTGCAGTAAAAGACAACCAATAATATCATTTTCAACTACATAAACAAAGTACAGAACAAACGGCTGGAGCAGCTCAGAAAAGTACGAAACTCTTGGTGAAAGATTTGCACTAGAAAATTGACTATCTAATACCACTTTATTGCGTCAACTTATTTGCGATTCGTCTAACAGAAAATCCGGCATCATTTCGTGCTAGTGGTACGCTACCATTATCCTTAATCTCAAATAAGAAACTTGAATTCATCCATCATCACTCATTCTATCGCATAATCAACTAATATATATATATATTAAGCCGTGAGATTGTTAAGGATATTTAGTAATAAATTAGAGTTTACTATATATATTATATTTGATATTTTATTATAAGGCAAATATTATATATTTGCGCACCTTTCTATTTATTTCTATATCCTTATAATTTATTTGATTATAAATAAGATAACTTTCACACCATTTATTAAGTGTAGTGGATTAATCAAACATTCTCATGGTATCAGAGACGTTTCGGTTTAACAACCTTGATCCTTTCTTTTCAATGGCTTTTGAAATCTTCTACTATTCTTCTCCACCACCACTGCTGGCTTCGACGCCACTGGGGCAGTCGCCATGGCTGCCACATGCAGAGCCGATTGATTACTCTTCGGCTCCACCCTTTTGACCTCCAAATCACCCTTCGCCATACTTTGTACATACAACCTTACCTTCAACACAATCGATTTTCCCTCGACCAATATAGTCTCTTTCATTCTCTTCTCAATCTTCACTTCATCCTTTACATAAACCCTAACCTAAACCTATCAACATTCTGCAGAGATTTACCTTTCGAATTTGGGTGCCGCACTGGCCTCTTCGCCAACCAAGTCGCCGCCACCCTTCTAGCCAAAATTGGCTCCTTCGCCCACTATGGCAACTCTTCTGGCTTCAACATTGTCTTTATCCCTTGCTCTAGGTTTTCCCTTCTAATTTTTGGGTTGACATATTCTCACCAGAGCCAAGGCAGCATCGTCGGTAAGCGACGATCCCTCTTCATGGGTTAGACATATTCTCGCCGAAGTCAAGGCAGCATCGTCGCTGAGTGACATTCCCTCTGCAGTTTTCGATGGCCAAGAAAGTGTTTCTCTACGCCTCCTCTGAACCAGGTTGTTGACATCTTCACGAAAAGTGTTTCTCAACCTCTTTGAATTTTTCAGATTCAAGCTTCACGTTCGTTTAAATCCGACGTTTAGCTGCGGTGGGGTGTTAAGGGTATTTAATAATAAATTATAGTTTACCATATATATTATATTTGATATTTTATTATAAGTCAAATATTATATATTTGTTTTATTACCATAGGCATGTTTCTATTTATTGTTATTTCCATTTATTACTATTTCTATATCCTTGTAATTTATTTGATTATAAATAAAATAACTTTCACACTATTTAAGTGGGGTGGATTAATCAAACATTCTCAGAGATTATACTTTCAAAATATCATCATATAATTTTATAATAGCAACGAAATATAGCTTATGGAGCGAATGAGAATAAACTTTATTCTTCGTATAGAGATGATATAATTTATTTAAAGGATTTCATAACAAAATTGGTCCAATAGTTTAACAAGTTTGAGTTTTAGTCCTTTGAGTTTATTTCAATGGGGGCTTATAGTCTTAAGAATTCCATTATAGAAAGACATAACGTGAAACGGAGTAAAAGAGAGGTACATGAATGAATTGAAAACGATTCTTGAAGATATGAATACTAATAAATACTTAAAATAATTGAAGATTACTATTTATACTAATAAAGTCTACTATTCTCTTTACTGACTCGACTATAAGAACTCCAAAATTAAGTATGATTTATTTGAAACAATTCTATATTTGGTAACTTTTTGAAAATTTTCTTAAGAATAGTGGAGTGAATATAAAATAGGTTGAAAAGATCTATGTTAATACGTTGAGATAGTCTTTACTTTTAGAAGGAAGTGAAGAGTAAGTAGATCATAACATAGATAATACCGGGAAACGCCGAAACTATCAGGTACCAAACCCGAATTTCGACATTAAAAAAAAAAATTGAAACTTAAATTATAAGGACTAAATTTCGGGTTAAACTACCCGTCTTCGAGATGAATAATGAAAGTAGTGGGATTGGTGGCTTTTGAGTCACCATAATATAATATTGTGATAAAAATGAGAACACAATCGATGAAAATTATATATATCAATTGTGTATAGTTTGATTAATTGAATTATATGGTCAAGTTAGTTAGTAGACCTAAAATATATGTTTAAAGTAGACAATACTAAACTAGATATGTTCAAAATTTTAAACTCCATAATTCTCGAGAAAAAAAAAAAAAAAAAACCAAGATCGCAATAGAGATGTAATCAAATAAGAATCTAGTAGTCCAATGTTAGAGAAATGGTATCGCTCATGGCATTGTTATCCACTGCAAATGGGTGAACCCGTAAACTAGTCTTGGAAGAGACTTGAGACTGTAAGGGTAGGTGTCAAGATGAGAATTTTTATTGAGCGTAGAAACGGTCGAGATTGACGTCCCGATTCTGTTAATCAGTCTTGGAAAAGACTTGGGGCTACATAGGTAGTTGGCACATGATGAACCGTTTGGATCTTTGTGTCCTACATAGAATCTCAATTGGTTACAAGAATAGGCCCAAACTTACTAGAGATAGAGAGATGGAAACTTATTAAGAAATGGATGTGAACAGAGCCAACAAGGAGGTTGCTTACTATGTATTTATATACTCACCTTTTTCTCTTTTTCTTTTTCAAGTATCGGTCGAGGTAAATGAGTGTTCGAAAGCTGCGCGGTCACAGAAGAGTCTTGTACTTCGTTTTATAACTGTATCCCAAACTCATTTCCCGTTTTTAATAAATTTGAGCAACATCGTCTTATAATTGTACGTAATTTTGTACTTTAACATGGTGTTGTTCGATTTTATTTTGATTTTAGTTTTACTGCATTTTCTTATTTTATTTTCTTTTACTTTTTGTTTTTATTTGTTTTCACTATCATCTTTTATAGTCTCGGTTAGGATTTTGAAGATTTAAAATTTGAGTTGTGACATATAGTTGATGGAGAATTCAAATAAAAGTGTTACTCGACTCGCATAATTTGTGAAAGATTATCGATAGAGGATATACTGAAGTGAAAGAAAACTGCAGAAAAAGACGAGAAAACTTTATTCTTCCTTTATCAAGTGTTGATGAAAGAATATTCTAAAGTAGGGGTTGATTGATGATAACTAGTGATAGTTGCCCAAAAAAAATGTGTGGAGACCTGTTAGGAACTGGATGCTATTCGAGTGTACAAATAGATTGTTTATGGAATATTTTTATATTCATCCCTTTATCTTTTCTTTTTCAGATACTAGTCGATGTGGAGAGTGTTCAAAAGTTGTGTGGCTATAATGTAGGTTGTTTCCGAGCATCAATTCATTTTTTTTATTTGAATCTTTTGTATTTTGATTTATAAGGTTACC

mRNA sequence

CTTTGAATCAAATCAAAACAAAATGAGCCAAATTGAAAGCATTTGGGGAAAGGTTCAGCTAAAATCGTCTCCTGAGAAGTTCTTTGGCTTCTTCAGGAACCATATGGGCGATTTGGTCCATATGTTCCCTGACCACTTCCAGAGCTTCCACTTTGTCGAAGGACAAAACTTCGACGATGGCAGCGTCGTGCACTGGAAATACCACCTCGGAATTCCAGAAGCAGTAAAGATAAAGATGAAGAATAGGGATGAAGCAAGGACCATAATTTATGAAGTTGTTGAAGGAGATGCATTAAAGCATTACAAAGTTTTCAGAGCCAAACTTGAAACTGTTAGTGGAGGGTTAAACAAAGTGGGAGGAAGCTTTGCAAAATGGACAATTGAGTATGAAAAGGCTCATGAAAACGTTCCTTCACCAGAAACCTACATGGAATTGGCTCTCAAACTGAAATCGTCTCCTGAGAAGTTCTATGGGTTCTTCAGGAACCATATGGGCGAATTGGTCCATATGTTCCCTGATCACTTCCAGAGCTTCCACTTTTTGGAAGGACAAAACTTCGACGATGGGAGTGTTGTGCAATGGAAATACCACCTTGGATTTCCAGAAGCAGCAAAGGTACGGATGAGAGTTATGGATGAAGCAAGGACCATAATTTATGAAGTTGTTGAAGGAGATGCACTAAAGCATTACAAAGCTTTCAGAGTAAAACTTGAAACTGTTAGTGGAGACTTAAACAAAGTGGGAGCAAACTTTGCAAAATGGACAATTGAGTATGAAAAGGCACATCAAAACGTAGCTTCACCAGAAACCTACCTGGAATTGGCTCTCCAACTGAAATCGTCTCCTGAGAAGTTCTATGGGTTCTTCAGGAACCATATGGGCGAATTGGTCCATATGTTCCCTGATCACTTCCAGAGCTTCCACTTTTTGGAAGGACAAAACTTCGACGATGGGAGTGTTGTGCAATGGAAATACCACCTTGGATTTCCAGAAGCAGCAAAGGTACGGATGAGAGTTATGGATGAAGCAAGGACCATAATTTATGAAGTTGTTGAAGGAGATGCACTAAAGCATTACAAAGCTTTCAGAGTAAAACTTGAAACTGTTAGTGGAGACTTAAACAAAGTGGGAGCAAACTTTGCAAAATGGACAATTGAGTATGAAAAGGCACATCAAAACGTAGCTTCACCAGAAACCTACCTGGAATTGGCTCTCCAAGTTACCAAAGGATTTCCAGAAGCAGCAAAGGTACGGATGAGAGTTATGGATGAAGCAAGGACCATAATTTATGAAGTTGTTGAAGGAGATGCACTAAAGCATTACAAAGCTTTCAGAGTAAAACTTGAAACTGTTAGTGGAGACTTAAACAAAGTGGGAGCAAACTTTGCAAAATGGACAATTGAGTATGAAAAGGCACATCAAAACGTAGCTTCACCAGAAACCTACCTGGAATTGGCTCTCCAAAAAATGGGCAAAAGTGATAGCATTTGGGCAAAGATTGACCTAAAATCTTCTCCTGAGAAGTTCTATGGGTTCTTTAGGAACCATTTGGGAGATTTGGTGGATTTGTTTCCTGAGAATTACAAGAGCATTCAACTTGTGGAAGGACAACATTTCTCCGGTGGCAATGTTGTTCTATTTAAATTCCAATTTGGATTTGGTCATCAACTTCGAGTCGAAAAATGGGCAATAAGAGCTGTGGATGATGTGAAGAAATACATAATCTATGAAGCTGTTGAAGGAGATGTTCTAAAGCAATTCAAAGTGCTTAGAGTAAAAGTTGAAGCTGTTCATGGAGGATCAACCAAAGTAGGAGGAGGAAACTTTACTAAATGGACTGTTGAGTTTGAAAAGGCAAATCAAAATGTGGCCTCACCACAAAACTACTTGGAGTTGTTCGTCAAAATCTCAAAAGGGACAATGGGCAAAAGTGATAGCATTTGGGCAAAGGTTGACCTAAAATCTTCCCATGAGAAGTTCTATGGGTTCTTCAGGAACCATTTGGGAGATTTGGTGGACTTGTTTCCTGAGAATTTCAATAGCATTCAACTTGTGGAAGGACAACATTTCGATAGGGGCAGTCTTGTTCTAAGACATGAACATAGAGTCGAAAAATGGGTAATAAGAGCTGTGGATGATGTGAAGAAATACATAGTCTATGAAGCTGTTGAAGGAGAGGCTCTAAAGCAATTCAAAGTGCTTAGAGCAAAGGTTGAAGCTGTTCATGGAGGATCAACCAAAGTAGGAGGAGGAAACTTTACAAAATTGACTATTGAGTTTGAAAAGGCAAATGAAAATGTGGCCTCGCCTGAAATCTACTTGGAATTGTTCGTCAAAATCGCAAAAGGGAAAATGGTCCAAACTGATAGCATTTGGGTAAAGGTTGACCTAAAATCTTCTCCAGAGAAGGTCTATGGGTTCTTCAGGAACCATTTGGGAGATTTGGTGGACTTGTTTCCTGAGACTTACCAGAGCATTCAACTTGTGGAAGGACAACATTTCTCAAGTGGCAGTGTTGTTCAATTTAAATTCCAATTTGGAGATGAACTTAGAGCAGAAAAATGGGCAATAAGAGTTGTGGATGATGTGAAGAAATACATAATCTATGAAGCTGTTGAAGGAGATCCACTAAAGGAATTCAAAGTGCTAAGAGCAAAATTTGAAGTTGTTAATGGAGGATTAAGCAAAGTGAGAAGAGGAAACTTTACAAAATGGACTGTTGAGTTTGAAAAGGCAAATCAAAATGTGGCCTCACCACAAAACTACTTGGAGTTGTTCGTCAAAATCTCAAAAGGGATACTAGTCGATGTGGAGAGTGTTCAAAAGTTGTGTGGCTATAATGTTACC

Coding sequence (CDS)

ATGAGCCAAATTGAAAGCATTTGGGGAAAGGTTCAGCTAAAATCGTCTCCTGAGAAGTTCTTTGGCTTCTTCAGGAACCATATGGGCGATTTGGTCCATATGTTCCCTGACCACTTCCAGAGCTTCCACTTTGTCGAAGGACAAAACTTCGACGATGGCAGCGTCGTGCACTGGAAATACCACCTCGGAATTCCAGAAGCAGTAAAGATAAAGATGAAGAATAGGGATGAAGCAAGGACCATAATTTATGAAGTTGTTGAAGGAGATGCATTAAAGCATTACAAAGTTTTCAGAGCCAAACTTGAAACTGTTAGTGGAGGGTTAAACAAAGTGGGAGGAAGCTTTGCAAAATGGACAATTGAGTATGAAAAGGCTCATGAAAACGTTCCTTCACCAGAAACCTACATGGAATTGGCTCTCAAACTGAAATCGTCTCCTGAGAAGTTCTATGGGTTCTTCAGGAACCATATGGGCGAATTGGTCCATATGTTCCCTGATCACTTCCAGAGCTTCCACTTTTTGGAAGGACAAAACTTCGACGATGGGAGTGTTGTGCAATGGAAATACCACCTTGGATTTCCAGAAGCAGCAAAGGTACGGATGAGAGTTATGGATGAAGCAAGGACCATAATTTATGAAGTTGTTGAAGGAGATGCACTAAAGCATTACAAAGCTTTCAGAGTAAAACTTGAAACTGTTAGTGGAGACTTAAACAAAGTGGGAGCAAACTTTGCAAAATGGACAATTGAGTATGAAAAGGCACATCAAAACGTAGCTTCACCAGAAACCTACCTGGAATTGGCTCTCCAACTGAAATCGTCTCCTGAGAAGTTCTATGGGTTCTTCAGGAACCATATGGGCGAATTGGTCCATATGTTCCCTGATCACTTCCAGAGCTTCCACTTTTTGGAAGGACAAAACTTCGACGATGGGAGTGTTGTGCAATGGAAATACCACCTTGGATTTCCAGAAGCAGCAAAGGTACGGATGAGAGTTATGGATGAAGCAAGGACCATAATTTATGAAGTTGTTGAAGGAGATGCACTAAAGCATTACAAAGCTTTCAGAGTAAAACTTGAAACTGTTAGTGGAGACTTAAACAAAGTGGGAGCAAACTTTGCAAAATGGACAATTGAGTATGAAAAGGCACATCAAAACGTAGCTTCACCAGAAACCTACCTGGAATTGGCTCTCCAAGTTACCAAAGGATTTCCAGAAGCAGCAAAGGTACGGATGAGAGTTATGGATGAAGCAAGGACCATAATTTATGAAGTTGTTGAAGGAGATGCACTAAAGCATTACAAAGCTTTCAGAGTAAAACTTGAAACTGTTAGTGGAGACTTAAACAAAGTGGGAGCAAACTTTGCAAAATGGACAATTGAGTATGAAAAGGCACATCAAAACGTAGCTTCACCAGAAACCTACCTGGAATTGGCTCTCCAAAAAATGGGCAAAAGTGATAGCATTTGGGCAAAGATTGACCTAAAATCTTCTCCTGAGAAGTTCTATGGGTTCTTTAGGAACCATTTGGGAGATTTGGTGGATTTGTTTCCTGAGAATTACAAGAGCATTCAACTTGTGGAAGGACAACATTTCTCCGGTGGCAATGTTGTTCTATTTAAATTCCAATTTGGATTTGGTCATCAACTTCGAGTCGAAAAATGGGCAATAAGAGCTGTGGATGATGTGAAGAAATACATAATCTATGAAGCTGTTGAAGGAGATGTTCTAAAGCAATTCAAAGTGCTTAGAGTAAAAGTTGAAGCTGTTCATGGAGGATCAACCAAAGTAGGAGGAGGAAACTTTACTAAATGGACTGTTGAGTTTGAAAAGGCAAATCAAAATGTGGCCTCACCACAAAACTACTTGGAGTTGTTCGTCAAAATCTCAAAAGGGACAATGGGCAAAAGTGATAGCATTTGGGCAAAGGTTGACCTAAAATCTTCCCATGAGAAGTTCTATGGGTTCTTCAGGAACCATTTGGGAGATTTGGTGGACTTGTTTCCTGAGAATTTCAATAGCATTCAACTTGTGGAAGGACAACATTTCGATAGGGGCAGTCTTGTTCTAAGACATGAACATAGAGTCGAAAAATGGGTAATAAGAGCTGTGGATGATGTGAAGAAATACATAGTCTATGAAGCTGTTGAAGGAGAGGCTCTAAAGCAATTCAAAGTGCTTAGAGCAAAGGTTGAAGCTGTTCATGGAGGATCAACCAAAGTAGGAGGAGGAAACTTTACAAAATTGACTATTGAGTTTGAAAAGGCAAATGAAAATGTGGCCTCGCCTGAAATCTACTTGGAATTGTTCGTCAAAATCGCAAAAGGGAAAATGGTCCAAACTGATAGCATTTGGGTAAAGGTTGACCTAAAATCTTCTCCAGAGAAGGTCTATGGGTTCTTCAGGAACCATTTGGGAGATTTGGTGGACTTGTTTCCTGAGACTTACCAGAGCATTCAACTTGTGGAAGGACAACATTTCTCAAGTGGCAGTGTTGTTCAATTTAAATTCCAATTTGGAGATGAACTTAGAGCAGAAAAATGGGCAATAAGAGTTGTGGATGATGTGAAGAAATACATAATCTATGAAGCTGTTGAAGGAGATCCACTAAAGGAATTCAAAGTGCTAAGAGCAAAATTTGAAGTTGTTAATGGAGGATTAAGCAAAGTGAGAAGAGGAAACTTTACAAAATGGACTGTTGAGTTTGAAAAGGCAAATCAAAATGTGGCCTCACCACAAAACTACTTGGAGTTGTTCGTCAAAATCTCAAAAGGGATACTAGTCGATGTGGAGAGTGTTCAAAAGTTGTGTGGCTATAATGTTACC

Protein sequence

MSQIESIWGKVQLKSSPEKFFGFFRNHMGDLVHMFPDHFQSFHFVEGQNFDDGSVVHWKYHLGIPEAVKIKMKNRDEARTIIYEVVEGDALKHYKVFRAKLETVSGGLNKVGGSFAKWTIEYEKAHENVPSPETYMELALKLKSSPEKFYGFFRNHMGELVHMFPDHFQSFHFLEGQNFDDGSVVQWKYHLGFPEAAKVRMRVMDEARTIIYEVVEGDALKHYKAFRVKLETVSGDLNKVGANFAKWTIEYEKAHQNVASPETYLELALQLKSSPEKFYGFFRNHMGELVHMFPDHFQSFHFLEGQNFDDGSVVQWKYHLGFPEAAKVRMRVMDEARTIIYEVVEGDALKHYKAFRVKLETVSGDLNKVGANFAKWTIEYEKAHQNVASPETYLELALQVTKGFPEAAKVRMRVMDEARTIIYEVVEGDALKHYKAFRVKLETVSGDLNKVGANFAKWTIEYEKAHQNVASPETYLELALQKMGKSDSIWAKIDLKSSPEKFYGFFRNHLGDLVDLFPENYKSIQLVEGQHFSGGNVVLFKFQFGFGHQLRVEKWAIRAVDDVKKYIIYEAVEGDVLKQFKVLRVKVEAVHGGSTKVGGGNFTKWTVEFEKANQNVASPQNYLELFVKISKGTMGKSDSIWAKVDLKSSHEKFYGFFRNHLGDLVDLFPENFNSIQLVEGQHFDRGSLVLRHEHRVEKWVIRAVDDVKKYIVYEAVEGEALKQFKVLRAKVEAVHGGSTKVGGGNFTKLTIEFEKANENVASPEIYLELFVKIAKGKMVQTDSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPETYQSIQLVEGQHFSSGSVVQFKFQFGDELRAEKWAIRVVDDVKKYIIYEAVEGDPLKEFKVLRAKFEVVNGGLSKVRRGNFTKWTVEFEKANQNVASPQNYLELFVKISKGILVDVESVQKLCGYNVT
BLAST of Cp4.1LG03g05890 vs. Swiss-Prot
Match: MLP34_ARATH (MLP-like protein 34 OS=Arabidopsis thaliana GN=MLP34 PE=2 SV=1)

HSP 1 Score: 124.8 bits (312), Expect = 5.0e-27
Identity = 88/303 (29.04%), Postives = 152/303 (50.17%), Query Frame = 1

Query: 492 KIDLKSSPEKFYGFFRNHLGDLVDLFPENYKSIQLVEGQHFSGGNVVLFKFQFGFGHQLR 551
           ++++K+S  +F+  F      +    P N +S  L EG   + G++V + +      + +
Sbjct: 16  EVEIKASAGQFHHMFAGKPHHVSKASPGNIQSCDLHEGDWGTVGSIVFWNYVHD--GEAK 75

Query: 552 VEKWAIRAVDDVKKYIIYEAVEGDVLKQFK--VLRVKVEAVHGGSTKVGGGNFTKWTVEF 611
           V K  I AV+  K  I +  +EGD++K++K  ++ ++V   HGG      G+   W +E+
Sbjct: 76  VAKERIEAVEPEKNLITFRVIEGDLMKEYKSFLITIQVTPKHGGP-----GSIVHWHLEY 135

Query: 612 EKANQNVASPQNYLELFVKIS----------KGTMGKSDSIWAKVDLKSSHEKFYGFFRN 671
           EK +  VA P+  L+  V++S          +  +  ++++  +V++K+S EKF+  F  
Sbjct: 136 EKISDEVAHPETLLQFCVEVSQEIDEHLLSEEEEVKTTETLETEVEIKASAEKFHHMFAG 195

Query: 672 HLGDLVDLFPENFNSIQLVEGQHFDRGSLVLRH-----EHRVEKWVIRAVDDVKKYIVYE 731
               +    P N  S  L EG     GS+V  +     E +V K  I AVD  K  I + 
Sbjct: 196 KPHHVSKATPGNIQSCDLHEGDWGTVGSIVFWNYVHDGEAKVAKERIEAVDPEKNLITFR 255

Query: 732 AVEGEALKQFK--VLRAKVEAVHGGSTKVGGGNFTKLTIEFEKANENVASPEIYLELFVK 776
            +EG+ +K++K  V+  +V   HGGS  V   +F     E+EK NE VA PE  L+  V+
Sbjct: 256 VIEGDLMKEYKSFVITIQVTPKHGGSGSVVHWHF-----EYEKINEEVAHPETLLQFAVE 306

BLAST of Cp4.1LG03g05890 vs. Swiss-Prot
Match: MLP28_ARATH (MLP-like protein 28 OS=Arabidopsis thaliana GN=MLP28 PE=1 SV=1)

HSP 1 Score: 116.7 bits (291), Expect = 1.4e-24
Identity = 86/310 (27.74%), Postives = 143/310 (46.13%), Query Frame = 1

Query: 136 MELALKLKSSPEKFYGFFRNHMGELVHMFPDHFQSFHFLEGQNFDDGSVVQWKY-HLGFP 195
           +E  +++K+S +KF+  F      +    P + Q     EG     GS+V W Y H G  
Sbjct: 26  LETDVEIKASADKFHHMFAGKPHHVSKASPGNIQGCDLHEGDWGTVGSIVFWNYVHDGEA 85

Query: 196 EAAKVRMRVMDEARTII-YEVVEGDALKHYKAFRVKLETVSGDLNKVGA--NFAKWTIEY 255
           + AK R+  ++  + +I + V+EGD +K YK+F + ++       K G   +   W +EY
Sbjct: 86  KVAKERIEAVEPDKNLITFRVIEGDLMKEYKSFLLTIQVTP----KPGGPGSIVHWHLEY 145

Query: 256 EKAHQNVASPETYLELALQ---------------------------------LKSSPEKF 315
           EK  + VA PET L+  ++                                 +K+S EKF
Sbjct: 146 EKISEEVAHPETLLQFCVEVSKEIDEHLLAEEEEVKTPETPSLVGKLETDVEIKASAEKF 205

Query: 316 YGFFRNHMGELVHMFPDHFQSFHFLEGQNFDDGSVVQWKY-HLGFPEAAKVRMRVMDEAR 375
           +  F      +    P + Q     EG     GS+V W Y H    + AK R+  ++  +
Sbjct: 206 HHMFAGKPHHVSKASPGNIQGCDLHEGDWGQVGSIVFWNYVHDREAKVAKERIEAVEPNK 265

Query: 376 TII-YEVVEGDALKHYKAFRVKLETVSGDLNKVGANFAKWTIEYEKAHQNVASPETYLEL 407
            +I + V++GD +K YK+F + ++ V+  L   G+    W +EYEK  + VA PET L+ 
Sbjct: 266 NLITFRVIDGDLMKEYKSFLLTIQ-VTPKLGGPGS-IVHWHLEYEKISEEVAHPETLLQF 325

BLAST of Cp4.1LG03g05890 vs. Swiss-Prot
Match: MLP31_ARATH (MLP-like protein 31 OS=Arabidopsis thaliana GN=MLP31 PE=2 SV=2)

HSP 1 Score: 75.1 bits (183), Expect = 4.6e-12
Identity = 47/146 (32.19%), Postives = 73/146 (50.00%), Query Frame = 1

Query: 265 LELALQLKSSPEKFYGFFRNHMGELVHMFPDHFQSFHFLEGQNFDDGSVVQWKY-HLGFP 324
           LE  +++K+S  KF+  F      +    P   Q     EG     GS+V W Y H G  
Sbjct: 24  LETDIEIKASAGKFHHMFAGRPHHVSKATPGKIQGCELHEGDWGKVGSIVFWNYVHDGEA 83

Query: 325 EAAKVRMRVMDEARTII-YEVVEGDALKHYKAFRVKLETVSGDLNKVGA--NFAKWTIEY 384
           + AK R+  ++  + +I + V+EGD LK YK+F + ++       K G   +   W +EY
Sbjct: 84  KVAKERIEAVEPEKNLITFRVIEGDLLKEYKSFVITIQVTP----KRGGPGSVVHWHVEY 143

Query: 385 EKAHQNVASPETYLELALQVTKGFPE 407
           EK    VA PET+L+  ++V+K   E
Sbjct: 144 EKIDDKVAHPETFLDFCVEVSKEIDE 165

BLAST of Cp4.1LG03g05890 vs. Swiss-Prot
Match: MLP43_ARATH (MLP-like protein 43 OS=Arabidopsis thaliana GN=MLP43 PE=1 SV=1)

HSP 1 Score: 73.9 bits (180), Expect = 1.0e-11
Identity = 46/144 (31.94%), Postives = 75/144 (52.08%), Query Frame = 1

Query: 265 LELALQLKSSPEKFYGFFRNHMGELVHMFPDHFQSFHFLEGQNFDDGSVVQWKY-HLGFP 324
           LE  +++K+S +KF+  F      +    PD        EG     GS+V WKY H G  
Sbjct: 11  LETEVEIKASAKKFHHMFTERPHHVSKATPDKIHGCELHEGDWGKVGSIVIWKYVHDGKL 70

Query: 325 EAAKVRMRVMDEARTII-YEVVEGDALKHYKAFRVKLETVSGDLNKVGANFAKWTIEYEK 384
              K ++  +D  + +I ++V+EGD +  YK+F   L+ V+    + G+  A W +EYEK
Sbjct: 71  TVGKNKIEAVDPEKNLITFKVLEGDLMNEYKSFAFTLQ-VTPKQGESGS-IAHWHLEYEK 130

Query: 385 AHQNVASPETYLELALQVTKGFPE 407
             + VA PET L+  ++++K   E
Sbjct: 131 ISEEVAHPETLLQFCVEISKEIDE 152

BLAST of Cp4.1LG03g05890 vs. Swiss-Prot
Match: ML165_ARATH (MLP-like protein 165 OS=Arabidopsis thaliana GN=MLP165 PE=2 SV=1)

HSP 1 Score: 69.3 bits (168), Expect = 2.5e-10
Identity = 45/148 (30.41%), Postives = 73/148 (49.32%), Query Frame = 1

Query: 782 DSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPETYQSIQLVEGQHFSSGSVVQFKFQFGD 841
           + I V VD+K+  +K + F R      V       +   L+EG+    GS++ +K  F  
Sbjct: 4   EEIEVDVDIKTRADKFHKFIRR--SQHVPKATHYIKGCDLLEGEWGKVGSILLWKLVFDG 63

Query: 842 ELRAEKWAIRVVDDVKKYIIYEAVEGDPLKEFKVLRAKFEVV---NGGLSKVRRGNFTKW 901
           E R  K  I V+D+ K  I    +EG   KE+K      +V+   +GG      G+  KW
Sbjct: 64  EPRVSKDMIEVIDEEKNVIQLRVLEGPLKKEYKSFLKTMKVMSPKHGG-----PGSVVKW 123

Query: 902 TVEFEKANQNVASPQNYLELFVKISKGI 927
            +++E+ +QNV  P   L+ FV+++K I
Sbjct: 124 NMKYERIDQNVDHPNRLLQFFVEVTKEI 144

BLAST of Cp4.1LG03g05890 vs. TrEMBL
Match: N0DK19_CUCPE (Major latex-like protein OS=Cucurbita pepo subsp. pepo GN=MLP-GR1 PE=2 SV=1)

HSP 1 Score: 303.5 bits (776), Expect = 8.8e-79
Identity = 148/149 (99.33%), Postives = 149/149 (100.00%), Query Frame = 1

Query: 778 MVQTDSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPETYQSIQLVEGQHFSSGSVVQFKF 837
           MVQTDSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPETYQSIQLVEGQHFSSGSVVQFKF
Sbjct: 1   MVQTDSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPETYQSIQLVEGQHFSSGSVVQFKF 60

Query: 838 QFGDELRAEKWAIRVVDDVKKYIIYEAVEGDPLKEFKVLRAKFEVVNGGLSKVRRGNFTK 897
           QFGDELRAEKWAIRVVDDVKKYIIYEAVEGDPLKEFKVLRAKFEVVNGGLSKVRRGNFTK
Sbjct: 61  QFGDELRAEKWAIRVVDDVKKYIIYEAVEGDPLKEFKVLRAKFEVVNGGLSKVRRGNFTK 120

Query: 898 WTVEFEKANQNVASPQNYLELFVKISKGI 927
           WTVEFEKANQNVASPQNYLELFVKISKG+
Sbjct: 121 WTVEFEKANQNVASPQNYLELFVKISKGV 149

BLAST of Cp4.1LG03g05890 vs. TrEMBL
Match: N0DKL1_CUCPE (Major latex-like protein OS=Cucurbita pepo subsp. ovifera GN=MLP-PG1 PE=2 SV=1)

HSP 1 Score: 303.5 bits (776), Expect = 8.8e-79
Identity = 148/149 (99.33%), Postives = 149/149 (100.00%), Query Frame = 1

Query: 778 MVQTDSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPETYQSIQLVEGQHFSSGSVVQFKF 837
           MVQTDSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPETYQSIQLVEGQHFSSGSVVQFKF
Sbjct: 1   MVQTDSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPETYQSIQLVEGQHFSSGSVVQFKF 60

Query: 838 QFGDELRAEKWAIRVVDDVKKYIIYEAVEGDPLKEFKVLRAKFEVVNGGLSKVRRGNFTK 897
           QFGDELRAEKWAIRVVDDVKKYIIYEAVEGDPLKEFKVLRAKFEVVNGGLSKVRRGNFTK
Sbjct: 61  QFGDELRAEKWAIRVVDDVKKYIIYEAVEGDPLKEFKVLRAKFEVVNGGLSKVRRGNFTK 120

Query: 898 WTVEFEKANQNVASPQNYLELFVKISKGI 927
           WTVEFEKANQNVASPQNYLELFVKISKG+
Sbjct: 121 WTVEFEKANQNVASPQNYLELFVKISKGV 149

BLAST of Cp4.1LG03g05890 vs. TrEMBL
Match: N0DK07_CUCPE (Major latex-like protein OS=Cucurbita pepo subsp. pepo GN=MLP-GR3 PE=2 SV=1)

HSP 1 Score: 266.9 bits (681), Expect = 9.2e-68
Identity = 136/148 (91.89%), Postives = 138/148 (93.24%), Query Frame = 1

Query: 634 MGKSDSIWAKVDLKSSHEKFYGFFRNHLGDLVDLFPENFNSIQLVEGQHFDRGSLVLR-- 693
           M ++DSIW KVDLKSS EK YGFFRNHLGDLVDLFPENFNSIQLVEGQHFDRGSLVL   
Sbjct: 1   MVQTDSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPENFNSIQLVEGQHFDRGSLVLSKY 60

Query: 694 ---HEHRVEKWVIRAVDDVKKYIVYEAVEGEALKQFKVLRAKVEAVHGGSTKVGGGNFTK 753
              HEHRVEKWVIRAVDDVKKYIVYEAVEGEALKQFKVLRAKVEAVHGGSTKVGGGNFTK
Sbjct: 61  EFGHEHRVEKWVIRAVDDVKKYIVYEAVEGEALKQFKVLRAKVEAVHGGSTKVGGGNFTK 120

Query: 754 LTIEFEKANENVASPEIYLELFVKIAKG 777
           LTIEFEKANENVASPEIYLELFVKIAKG
Sbjct: 121 LTIEFEKANENVASPEIYLELFVKIAKG 148

BLAST of Cp4.1LG03g05890 vs. TrEMBL
Match: A0A0A0LA34_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G342860 PE=4 SV=1)

HSP 1 Score: 230.7 bits (587), Expect = 7.3e-57
Identity = 108/156 (69.23%), Postives = 131/156 (83.97%), Query Frame = 1

Query: 1   MSQIESIWGKVQLKSSPEKFFGFFRNHMGDLVHMFPDHFQSFHFVEGQNFDDGSVVHWKY 60
           MSQ ESIW KV LKS PEKF+GFFRNHMGDLVHMFPD+FQSF F+EG++F  GSV+HW+Y
Sbjct: 1   MSQTESIWAKVPLKSPPEKFYGFFRNHMGDLVHMFPDNFQSFQFLEGESFTTGSVMHWQY 60

Query: 61  HLGIPEAVKIKMKNRDEA-RTIIYEVVEGDALKHYKVFRAKLETVSGGLNKVGGSFAKWT 120
           HLG P A KIKM+  D+  ++I+YE ++GD LKHYKVFRAKLE V+GGLNKVGG+FAKWT
Sbjct: 61  HLGSPAAAKIKMRVVDDVKKSIVYEFMDGDVLKHYKVFRAKLEAVNGGLNKVGGNFAKWT 120

Query: 121 IEYEKAHENVPSPETYMELALKLKSSPEKFYGFFRN 156
           IEY+KA+ENVPSPETYMELA+K+    + +   F+N
Sbjct: 121 IEYQKANENVPSPETYMELAVKVSKGLDAY--IFKN 154

BLAST of Cp4.1LG03g05890 vs. TrEMBL
Match: A0A0A0LAY7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G342350 PE=4 SV=1)

HSP 1 Score: 201.4 bits (511), Expect = 4.7e-48
Identity = 95/150 (63.33%), Postives = 122/150 (81.33%), Query Frame = 1

Query: 483 MGKSDSIWAKIDLKSSPEKFYGFFRNHLGDLVDLFPENYKSIQLVEGQHFSGGNVVLFKF 542
           M +SDSI AK++LKS+ EKFYGFFRNH+ DL++LFP+ Y+ I LVEGQ+ S G+V+LFK+
Sbjct: 1   MSRSDSIVAKVELKSNIEKFYGFFRNHVEDLMNLFPDLYQGIDLVEGQYLSAGSVILFKY 60

Query: 543 QFGFGHQLRVEKWAIRAVDDVKKYIIYEAVEGDVLKQFKVLRVKVEAVHGGSTKVGGGNF 602
             G   Q+  EKW IRAVDD KK IIYEA+EGD+ K +KVLR K+E VHG S+K+G G+F
Sbjct: 61  HLG-ADQVVSEKWLIRAVDDAKKCIIYEAIEGDLQKYYKVLRAKLEVVHGRSSKIGRGSF 120

Query: 603 TKWTVEFEKANQNVASPQNYLELFVKISKG 633
            KWT+EFEKAN+NV SP +++E+FVKISKG
Sbjct: 121 AKWTIEFEKANENVPSPDSHMEIFVKISKG 149

BLAST of Cp4.1LG03g05890 vs. TAIR10
Match: AT1G70850.1 (AT1G70850.1 MLP-like protein 34)

HSP 1 Score: 124.8 bits (312), Expect = 2.8e-28
Identity = 88/303 (29.04%), Postives = 152/303 (50.17%), Query Frame = 1

Query: 492 KIDLKSSPEKFYGFFRNHLGDLVDLFPENYKSIQLVEGQHFSGGNVVLFKFQFGFGHQLR 551
           ++++K+S  +F+  F      +    P N +S  L EG   + G++V + +      + +
Sbjct: 16  EVEIKASAGQFHHMFAGKPHHVSKASPGNIQSCDLHEGDWGTVGSIVFWNYVHD--GEAK 75

Query: 552 VEKWAIRAVDDVKKYIIYEAVEGDVLKQFK--VLRVKVEAVHGGSTKVGGGNFTKWTVEF 611
           V K  I AV+  K  I +  +EGD++K++K  ++ ++V   HGG      G+   W +E+
Sbjct: 76  VAKERIEAVEPEKNLITFRVIEGDLMKEYKSFLITIQVTPKHGGP-----GSIVHWHLEY 135

Query: 612 EKANQNVASPQNYLELFVKIS----------KGTMGKSDSIWAKVDLKSSHEKFYGFFRN 671
           EK +  VA P+  L+  V++S          +  +  ++++  +V++K+S EKF+  F  
Sbjct: 136 EKISDEVAHPETLLQFCVEVSQEIDEHLLSEEEEVKTTETLETEVEIKASAEKFHHMFAG 195

Query: 672 HLGDLVDLFPENFNSIQLVEGQHFDRGSLVLRH-----EHRVEKWVIRAVDDVKKYIVYE 731
               +    P N  S  L EG     GS+V  +     E +V K  I AVD  K  I + 
Sbjct: 196 KPHHVSKATPGNIQSCDLHEGDWGTVGSIVFWNYVHDGEAKVAKERIEAVDPEKNLITFR 255

Query: 732 AVEGEALKQFK--VLRAKVEAVHGGSTKVGGGNFTKLTIEFEKANENVASPEIYLELFVK 776
            +EG+ +K++K  V+  +V   HGGS  V   +F     E+EK NE VA PE  L+  V+
Sbjct: 256 VIEGDLMKEYKSFVITIQVTPKHGGSGSVVHWHF-----EYEKINEEVAHPETLLQFAVE 306

BLAST of Cp4.1LG03g05890 vs. TAIR10
Match: AT1G70830.1 (AT1G70830.1 MLP-like protein 28)

HSP 1 Score: 116.7 bits (291), Expect = 7.7e-26
Identity = 86/310 (27.74%), Postives = 143/310 (46.13%), Query Frame = 1

Query: 136 MELALKLKSSPEKFYGFFRNHMGELVHMFPDHFQSFHFLEGQNFDDGSVVQWKY-HLGFP 195
           +E  +++K+S +KF+  F      +    P + Q     EG     GS+V W Y H G  
Sbjct: 26  LETDVEIKASADKFHHMFAGKPHHVSKASPGNIQGCDLHEGDWGTVGSIVFWNYVHDGEA 85

Query: 196 EAAKVRMRVMDEARTII-YEVVEGDALKHYKAFRVKLETVSGDLNKVGA--NFAKWTIEY 255
           + AK R+  ++  + +I + V+EGD +K YK+F + ++       K G   +   W +EY
Sbjct: 86  KVAKERIEAVEPDKNLITFRVIEGDLMKEYKSFLLTIQVTP----KPGGPGSIVHWHLEY 145

Query: 256 EKAHQNVASPETYLELALQ---------------------------------LKSSPEKF 315
           EK  + VA PET L+  ++                                 +K+S EKF
Sbjct: 146 EKISEEVAHPETLLQFCVEVSKEIDEHLLAEEEEVKTPETPSLVGKLETDVEIKASAEKF 205

Query: 316 YGFFRNHMGELVHMFPDHFQSFHFLEGQNFDDGSVVQWKY-HLGFPEAAKVRMRVMDEAR 375
           +  F      +    P + Q     EG     GS+V W Y H    + AK R+  ++  +
Sbjct: 206 HHMFAGKPHHVSKASPGNIQGCDLHEGDWGQVGSIVFWNYVHDREAKVAKERIEAVEPNK 265

Query: 376 TII-YEVVEGDALKHYKAFRVKLETVSGDLNKVGANFAKWTIEYEKAHQNVASPETYLEL 407
            +I + V++GD +K YK+F + ++ V+  L   G+    W +EYEK  + VA PET L+ 
Sbjct: 266 NLITFRVIDGDLMKEYKSFLLTIQ-VTPKLGGPGS-IVHWHLEYEKISEEVAHPETLLQF 325

BLAST of Cp4.1LG03g05890 vs. TAIR10
Match: AT1G70880.1 (AT1G70880.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein)

HSP 1 Score: 84.3 bits (207), Expect = 4.3e-16
Identity = 52/144 (36.11%), Postives = 75/144 (52.08%), Query Frame = 1

Query: 265 LELALQLKSSPEKFYGFFRNHMGELVHMFPDHFQSFHFLEGQNFDDGSVVQWKY-HLGFP 324
           +E  ++LKSS EKF+         + +  P + QS    EG+    G+V+ W Y H G  
Sbjct: 12  VETTVELKSSVEKFHDLLVGRPHHMSNATPSNIQSAELQEGEMGQVGAVILWNYVHDGEA 71

Query: 325 EAAKVRMRVMD-EARTIIYEVVEGDALKHYKAFRVKLETVSGDLNKVGANFAKWTIEYEK 384
           ++AK R+  +D E   I Y VVEGD LK Y +F    +    +      + A W  EYEK
Sbjct: 72  KSAKQRIESLDPEKNRITYRVVEGDLLKEYTSFVTTFQVTPKEGEP--GSVAHWHFEYEK 131

Query: 385 AHQNVASPETYLELALQVTKGFPE 407
            ++ VA PET L+LA +V+K   E
Sbjct: 132 INEEVAHPETLLQLATEVSKDMDE 153

BLAST of Cp4.1LG03g05890 vs. TAIR10
Match: AT1G70840.1 (AT1G70840.1 MLP-like protein 31)

HSP 1 Score: 75.1 bits (183), Expect = 2.6e-13
Identity = 47/146 (32.19%), Postives = 73/146 (50.00%), Query Frame = 1

Query: 265 LELALQLKSSPEKFYGFFRNHMGELVHMFPDHFQSFHFLEGQNFDDGSVVQWKY-HLGFP 324
           LE  +++K+S  KF+  F      +    P   Q     EG     GS+V W Y H G  
Sbjct: 24  LETDIEIKASAGKFHHMFAGRPHHVSKATPGKIQGCELHEGDWGKVGSIVFWNYVHDGEA 83

Query: 325 EAAKVRMRVMDEARTII-YEVVEGDALKHYKAFRVKLETVSGDLNKVGA--NFAKWTIEY 384
           + AK R+  ++  + +I + V+EGD LK YK+F + ++       K G   +   W +EY
Sbjct: 84  KVAKERIEAVEPEKNLITFRVIEGDLLKEYKSFVITIQVTP----KRGGPGSVVHWHVEY 143

Query: 385 EKAHQNVASPETYLELALQVTKGFPE 407
           EK    VA PET+L+  ++V+K   E
Sbjct: 144 EKIDDKVAHPETFLDFCVEVSKEIDE 165

BLAST of Cp4.1LG03g05890 vs. TAIR10
Match: AT1G70890.1 (AT1G70890.1 MLP-like protein 43)

HSP 1 Score: 73.9 bits (180), Expect = 5.8e-13
Identity = 46/144 (31.94%), Postives = 75/144 (52.08%), Query Frame = 1

Query: 265 LELALQLKSSPEKFYGFFRNHMGELVHMFPDHFQSFHFLEGQNFDDGSVVQWKY-HLGFP 324
           LE  +++K+S +KF+  F      +    PD        EG     GS+V WKY H G  
Sbjct: 11  LETEVEIKASAKKFHHMFTERPHHVSKATPDKIHGCELHEGDWGKVGSIVIWKYVHDGKL 70

Query: 325 EAAKVRMRVMDEARTII-YEVVEGDALKHYKAFRVKLETVSGDLNKVGANFAKWTIEYEK 384
              K ++  +D  + +I ++V+EGD +  YK+F   L+ V+    + G+  A W +EYEK
Sbjct: 71  TVGKNKIEAVDPEKNLITFKVLEGDLMNEYKSFAFTLQ-VTPKQGESGS-IAHWHLEYEK 130

Query: 385 AHQNVASPETYLELALQVTKGFPE 407
             + VA PET L+  ++++K   E
Sbjct: 131 ISEEVAHPETLLQFCVEISKEIDE 152

BLAST of Cp4.1LG03g05890 vs. NCBI nr
Match: gi|477504342|dbj|BAN14688.1| (major latex-like protein [Cucurbita pepo subsp. ovifera])

HSP 1 Score: 303.5 bits (776), Expect = 1.3e-78
Identity = 148/149 (99.33%), Postives = 149/149 (100.00%), Query Frame = 1

Query: 778 MVQTDSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPETYQSIQLVEGQHFSSGSVVQFKF 837
           MVQTDSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPETYQSIQLVEGQHFSSGSVVQFKF
Sbjct: 1   MVQTDSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPETYQSIQLVEGQHFSSGSVVQFKF 60

Query: 838 QFGDELRAEKWAIRVVDDVKKYIIYEAVEGDPLKEFKVLRAKFEVVNGGLSKVRRGNFTK 897
           QFGDELRAEKWAIRVVDDVKKYIIYEAVEGDPLKEFKVLRAKFEVVNGGLSKVRRGNFTK
Sbjct: 61  QFGDELRAEKWAIRVVDDVKKYIIYEAVEGDPLKEFKVLRAKFEVVNGGLSKVRRGNFTK 120

Query: 898 WTVEFEKANQNVASPQNYLELFVKISKGI 927
           WTVEFEKANQNVASPQNYLELFVKISKG+
Sbjct: 121 WTVEFEKANQNVASPQNYLELFVKISKGV 149

BLAST of Cp4.1LG03g05890 vs. NCBI nr
Match: gi|477504344|dbj|BAN14689.1| (major latex-like protein [Cucurbita pepo subsp. pepo])

HSP 1 Score: 303.5 bits (776), Expect = 1.3e-78
Identity = 148/149 (99.33%), Postives = 149/149 (100.00%), Query Frame = 1

Query: 778 MVQTDSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPETYQSIQLVEGQHFSSGSVVQFKF 837
           MVQTDSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPETYQSIQLVEGQHFSSGSVVQFKF
Sbjct: 1   MVQTDSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPETYQSIQLVEGQHFSSGSVVQFKF 60

Query: 838 QFGDELRAEKWAIRVVDDVKKYIIYEAVEGDPLKEFKVLRAKFEVVNGGLSKVRRGNFTK 897
           QFGDELRAEKWAIRVVDDVKKYIIYEAVEGDPLKEFKVLRAKFEVVNGGLSKVRRGNFTK
Sbjct: 61  QFGDELRAEKWAIRVVDDVKKYIIYEAVEGDPLKEFKVLRAKFEVVNGGLSKVRRGNFTK 120

Query: 898 WTVEFEKANQNVASPQNYLELFVKISKGI 927
           WTVEFEKANQNVASPQNYLELFVKISKG+
Sbjct: 121 WTVEFEKANQNVASPQNYLELFVKISKGV 149

BLAST of Cp4.1LG03g05890 vs. NCBI nr
Match: gi|477504346|dbj|BAN14690.1| (major latex-like protein [Cucurbita pepo subsp. pepo])

HSP 1 Score: 266.9 bits (681), Expect = 1.3e-67
Identity = 136/148 (91.89%), Postives = 138/148 (93.24%), Query Frame = 1

Query: 634 MGKSDSIWAKVDLKSSHEKFYGFFRNHLGDLVDLFPENFNSIQLVEGQHFDRGSLVLR-- 693
           M ++DSIW KVDLKSS EK YGFFRNHLGDLVDLFPENFNSIQLVEGQHFDRGSLVL   
Sbjct: 1   MVQTDSIWVKVDLKSSPEKVYGFFRNHLGDLVDLFPENFNSIQLVEGQHFDRGSLVLSKY 60

Query: 694 ---HEHRVEKWVIRAVDDVKKYIVYEAVEGEALKQFKVLRAKVEAVHGGSTKVGGGNFTK 753
              HEHRVEKWVIRAVDDVKKYIVYEAVEGEALKQFKVLRAKVEAVHGGSTKVGGGNFTK
Sbjct: 61  EFGHEHRVEKWVIRAVDDVKKYIVYEAVEGEALKQFKVLRAKVEAVHGGSTKVGGGNFTK 120

Query: 754 LTIEFEKANENVASPEIYLELFVKIAKG 777
           LTIEFEKANENVASPEIYLELFVKIAKG
Sbjct: 121 LTIEFEKANENVASPEIYLELFVKIAKG 148

BLAST of Cp4.1LG03g05890 vs. NCBI nr
Match: gi|449464144|ref|XP_004149789.1| (PREDICTED: MLP-like protein 34 [Cucumis sativus])

HSP 1 Score: 230.7 bits (587), Expect = 1.0e-56
Identity = 108/156 (69.23%), Postives = 131/156 (83.97%), Query Frame = 1

Query: 1   MSQIESIWGKVQLKSSPEKFFGFFRNHMGDLVHMFPDHFQSFHFVEGQNFDDGSVVHWKY 60
           MSQ ESIW KV LKS PEKF+GFFRNHMGDLVHMFPD+FQSF F+EG++F  GSV+HW+Y
Sbjct: 1   MSQTESIWAKVPLKSPPEKFYGFFRNHMGDLVHMFPDNFQSFQFLEGESFTTGSVMHWQY 60

Query: 61  HLGIPEAVKIKMKNRDEA-RTIIYEVVEGDALKHYKVFRAKLETVSGGLNKVGGSFAKWT 120
           HLG P A KIKM+  D+  ++I+YE ++GD LKHYKVFRAKLE V+GGLNKVGG+FAKWT
Sbjct: 61  HLGSPAAAKIKMRVVDDVKKSIVYEFMDGDVLKHYKVFRAKLEAVNGGLNKVGGNFAKWT 120

Query: 121 IEYEKAHENVPSPETYMELALKLKSSPEKFYGFFRN 156
           IEY+KA+ENVPSPETYMELA+K+    + +   F+N
Sbjct: 121 IEYQKANENVPSPETYMELAVKVSKGLDAY--IFKN 154

BLAST of Cp4.1LG03g05890 vs. NCBI nr
Match: gi|659114461|ref|XP_008457062.1| (PREDICTED: MLP-like protein 423 [Cucumis melo])

HSP 1 Score: 230.3 bits (586), Expect = 1.4e-56
Identity = 107/156 (68.59%), Postives = 132/156 (84.62%), Query Frame = 1

Query: 1   MSQIESIWGKVQLKSSPEKFFGFFRNHMGDLVHMFPDHFQSFHFVEGQNFDDGSVVHWKY 60
           MSQ ESIW KVQLKS PEKF+GFFRNHMGDLVHMFPD+FQSF F+EG++F  GSV+HW+Y
Sbjct: 1   MSQTESIWAKVQLKSPPEKFYGFFRNHMGDLVHMFPDNFQSFQFLEGESFTTGSVMHWQY 60

Query: 61  HLGIPEAVKIKMKNRDEA-RTIIYEVVEGDALKHYKVFRAKLETVSGGLNKVGGSFAKWT 120
           HLG P A KIKM+  D+  ++I+YE+++GD LK+YKVFRAKLE V+GGLNKVGG+FAKWT
Sbjct: 61  HLGSPAAAKIKMRLVDDVKKSIVYEIMDGDVLKYYKVFRAKLEAVNGGLNKVGGNFAKWT 120

Query: 121 IEYEKAHENVPSPETYMELALKLKSSPEKFYGFFRN 156
           IEY+KA+ENVPSPE YMELA+K+    + +   F+N
Sbjct: 121 IEYQKANENVPSPENYMELAVKVSKGLDAY--IFKN 154

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MLP34_ARATH5.0e-2729.04MLP-like protein 34 OS=Arabidopsis thaliana GN=MLP34 PE=2 SV=1[more]
MLP28_ARATH1.4e-2427.74MLP-like protein 28 OS=Arabidopsis thaliana GN=MLP28 PE=1 SV=1[more]
MLP31_ARATH4.6e-1232.19MLP-like protein 31 OS=Arabidopsis thaliana GN=MLP31 PE=2 SV=2[more]
MLP43_ARATH1.0e-1131.94MLP-like protein 43 OS=Arabidopsis thaliana GN=MLP43 PE=1 SV=1[more]
ML165_ARATH2.5e-1030.41MLP-like protein 165 OS=Arabidopsis thaliana GN=MLP165 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
N0DK19_CUCPE8.8e-7999.33Major latex-like protein OS=Cucurbita pepo subsp. pepo GN=MLP-GR1 PE=2 SV=1[more]
N0DKL1_CUCPE8.8e-7999.33Major latex-like protein OS=Cucurbita pepo subsp. ovifera GN=MLP-PG1 PE=2 SV=1[more]
N0DK07_CUCPE9.2e-6891.89Major latex-like protein OS=Cucurbita pepo subsp. pepo GN=MLP-GR3 PE=2 SV=1[more]
A0A0A0LA34_CUCSA7.3e-5769.23Uncharacterized protein OS=Cucumis sativus GN=Csa_3G342860 PE=4 SV=1[more]
A0A0A0LAY7_CUCSA4.7e-4863.33Uncharacterized protein OS=Cucumis sativus GN=Csa_3G342350 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G70850.12.8e-2829.04 MLP-like protein 34[more]
AT1G70830.17.7e-2627.74 MLP-like protein 28[more]
AT1G70880.14.3e-1636.11 Polyketide cyclase/dehydrase and lipid transport superfamily protein[more]
AT1G70840.12.6e-1332.19 MLP-like protein 31[more]
AT1G70890.15.8e-1331.94 MLP-like protein 43[more]
Match NameE-valueIdentityDescription
gi|477504342|dbj|BAN14688.1|1.3e-7899.33major latex-like protein [Cucurbita pepo subsp. ovifera][more]
gi|477504344|dbj|BAN14689.1|1.3e-7899.33major latex-like protein [Cucurbita pepo subsp. pepo][more]
gi|477504346|dbj|BAN14690.1|1.3e-6791.89major latex-like protein [Cucurbita pepo subsp. pepo][more]
gi|449464144|ref|XP_004149789.1|1.0e-5669.23PREDICTED: MLP-like protein 34 [Cucumis sativus][more]
gi|659114461|ref|XP_008457062.1|1.4e-5668.59PREDICTED: MLP-like protein 423 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0009607response to biotic stimulus
GO:0006952defense response
Vocabulary: INTERPRO
TermDefinition
IPR023393START-like_dom_sf
IPR000916Bet_v_I/MLP
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006952 defense response
biological_process GO:0009607 response to biotic stimulus
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g05890.1Cp4.1LG03g05890.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000916Bet v I/Major latex proteinPFAMPF00407Bet_v_1coord: 9..141
score: 1.2E-23coord: 782..926
score: 2.5E-21coord: 638..775
score: 5.8E-15coord: 267..402
score: 4.0E-22coord: 403..480
score: 1.7E-10coord: 487..631
score: 7.2
IPR000916Bet v I/Major latex proteinSMARTSM01037Bet_v_1_2coord: 635..781
score: 1.3E-7coord: 270..411
score: 2.7E-8coord: 6..142
score: 9.7E-8coord: 782..933
score: 3.8E-10coord: 143..268
score: 3.0E-5coord: 484..633
score: 8.6
IPR023393START-like domainGENE3DG3DSA:3.30.530.20coord: 412..483
score: 2.3E-13coord: 639..775
score: 8.5E-18coord: 268..403
score: 1.1E-22coord: 777..927
score: 9.1E-23coord: 142..267
score: 4.2E-21coord: 1..141
score: 1.3E-23coord: 484..631
score: 4.2
NoneNo IPR availablePANTHERPTHR31907FAMILY NOT NAMEDcoord: 260..402
score: 7.3
NoneNo IPR availablePANTHERPTHR31907:SF4MLP-LIKE PROTEIN 165-RELATEDcoord: 260..402
score: 7.3
NoneNo IPR availableunknownSSF55961Bet v1-likecoord: 638..668
score: 7.69E-13coord: 697..775
score: 7.69E-13coord: 413..481
score: 8.55E-11coord: 846..926
score: 1.75E-16coord: 780..812
score: 1.75E-16coord: 197..270
score: 3.9E-15coord: 139..168
score: 3.9E-15coord: 487..517
score: 4.29E-16coord: 553..631
score: 4.29E-16coord: 1..39
score: 8.24E-19coord: 68..141
score: 8.24E-19coord: 326..404
score: 8.24E-17coord: 267..297
score: 8.24

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG03g05890CmaCh14G008460Cucurbita maxima (Rimu)cmacpeB283
Cp4.1LG03g05890MELO3C032036.2Melon (DHL92) v3.6.1cpemedB694
Cp4.1LG03g05890CsaV3_3G021380Cucumber (Chinese Long) v3cpecucB0749
Cp4.1LG03g05890CsGy3G020930Cucumber (Gy14) v2cgybcpeB397
The following gene(s) are paralogous to this gene:

None