Lsi06G005790 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi06G005790
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionUnknown protein
Locationchr06 : 7657690 .. 7682529 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAATGTAAAGATTCATAAGGACCTAAGGCAGTTAAGATGTTCCTTACCTGGGGCTTGAGCTGGGCTAGTAGCTTACCATTTTGATTCTCCAATTTGTTCGTGGTGGTGGCCAACTGAGTAATCTGGTTGGCTAAGTGTTGCATGCTTGTCCTCGTTTCCTTCAACAACAACGAATTTTCTTGTTGAATATTTCTCATCTCTTGTTGGAAGGTCATTACATCTTGTTGGAATTTCATTGAGGACTCAGCTAGGGACTTCACCACATCCTCCAGTGGGGTTCTAGAATTCGAGTTGCTAGGTTGGTCAACCTGTCCTCCCCCTCCTCAACGGAAGTTTGGATGATCCTTCTGTTGGATTTTATGTCATAAAACTCGTAGTTTGTAAACTTTTAACATCTTCTATAATCTATAAAGTTGTTATCGAGGTTATTCAATAAAGTTGTTATTAAATATGTGAATTGTTGTTGATATAAGGCTAAATCCAATTAACTGAGAACCCATGGCTATATTATGAGTACTTGAACTATATGTGGAAATATAAAGTGGACCAAGTTTGAGTAAATAGTCTAAACGGTCTATAGTATAGGGATTAGGATGGGTACCTTATCCTTGGAACACTATTGGATGCGGCCCACTTTGTATTTGATACAAACGATGTGATTCTGAATCATTCATATAGAGACATGTGAGTGGGGACGTTCTGTACAATGAGTTGTACAAGACTGGACCACGAAATAGTCATTGTTGCCTTATAACGCAGTTTACTGTTAACACTAACTAATTTTAAATTGATGACCTAGGTAACTCAATCTTAATCATAAGCTAACTATGAACTCCTGTTTATTCGAAATTATCCTTAGATCTGCATGGGTGAGAGTGGCCTGACTTCGCTGACTTAATAAGTCCCCCATTTCAGGGGTAAGACCGGGTAGATAGTTGAAAACATAGTCCTGCAAGACAGAATTCACTCCTACCCGACTTTAGGGTTAATAGAGAGATTGTTCCCTTAAGTGCTAATTCTAGGTCTTGAACAAGGATCTCACCCTCTCATTGGCCCAAGAGAGACTTAGTTTGTTGGTTAGACCACAAACCGATTGTTCATTAGAGGGTCAGTAGTACTTAAGAAACAAGAAGTAATCTAGGGGTAAAATGATAATTGACCCAGCTGAGATTACGAACAACCTGTGAAAGGTCGACTTACTAATGATGGTTATATCAAGTGGACAAAAATATATCTATAGTGAGGGAAGTGCAACTACAGGCTATAGTGGAATGACCCGTTAGTTAACAAATGTTGATTAGCTAGGTTAAAGAGTTTGGCTGGTTAATCTCGGATCATTAGAGCCCATGATCTGTAGGTCCATCGGGTCCCCTTGCTAGCTCACATCGAATAAAAACAAAGAACTATATGATGAACAAGTTTGAAACGTTCAAATTCGGTTTAGGAAGAAAACATCAATTATATTTTATATAATTAACAATCATTTATTTAATCGAGAATTAAATAAATTAGAGAATTAGAAATATTTAAATTAGATTTACATTTTTTTAATGTTTCAAGTTTAATCAATGAGATTGGTTATTTAGGGTTTGGTATATTTTTTAATTAATTAATTAAAATTATTTTGATTAATTAAGTTTAAAATTATTAAATTAAAATTAGATTTTAATTTAATGCATTTTAAATTAATAAAGAGATTATTGGAAAAATCCATTTCGTTCTTGGATTTTTCCACTAAATCTCTTTAGATTGTTTGACATCACCATGAACTTACACCTATTCCCTTCCTGCATAGTGCAAGCTTGAGTTTCAAACATGCAAGACTCCTTTATGCATGGATCTTCTCTATATATACTTATTTAGAAGAAGAAGAAGAGGTCGGTCACTTAGTAGTTTTTGCTAGAAATATTCACCAAGTATCTTGAAGGTTTAACTCTTCAAATTATTTACTTTCTCCCTAAATTTTCTCACTAAAAATGGTCCCACAACCTTGTTCCTTGGCCGGAGAATAGTGAGGACTGTTCGATGGTAATCCTAAGAGTTTTCGGGAAGCATTTCATCCATTGTTGCAAGGAGAGAAGAAGAGTTTCAAAGTCATCAAGGGTAATGTACTCTCTTCTCTTTATCTTTATTTATTTTTCTAGCATGCTTATTCTTTAATTTTTTGCAATTTAGAGTGTAATATGTTCTTGGTTTCGCTGCGCATGTCTTTTACTTCGATCACCTTCCACACAGATTATATGTGTTGCTACAAGGATCATATTTCCGTCCTCCTTTGTTTTCAACATAATTAACCTACTCGACGAGAGCCTTGGGCTACAATTGAGTACAAGTCATATGTCTAACAAGAGCAGTAAGGTTATTGTTATTGTTATTGTCTAACCAATTTTAATAAAATAAAAAAAATAATAAGATTCATTGAAAGGACATGAACTAGAATCGAATAATTAAATTCCAAAGTCCATGAACCATAATGGTATTTTAACCTAAATTTTTTTACCAACTTAAAACTTTGGATGAGAAAAATTATAATCTTTCCTCTCTGTTGACTTGATTTCTTCCTTCCTTTCTTTTTTACTACATCTGCCACTAAAACTTATTGTCATTCACATTTATGGTTGTTTGAGGTGTTAAGTTGGTTATGATAGTCTGTGATTATTATAGTTTGTGGAGTAATAATAGTTTATGTTTGGAGTGCAAACTATTAGTATGAGTTTAAATAGTTCATGTTTACACCACAAGAAAAACAATGTTTATCGATATTTTTTCAATGTCAAGTGTGATTGTGCTTGACGTTTTTAAAAACGTCAAGGAAGCCCGTGTCAAGAATGTCGGGGTTCTTGACATAGTATAAACGTCAAGAACAGTGACAGTTGACATTTTTAAAATGTCAAGAATATGAGTTCAGAAAGTCTGTGTTTGGAGTGTATACTATTTTAGCATGGGTTCAAGGAGTCTTTGTTTGGGTAGCATATTATTTTAGTCCAAGTTTAGGGTTTATGTGTTTGAGTTGCCAATTTGTTTAACATTGAGTTCTTTAAGGGTTAGGGTTTAGAATTGTTTTCATATATATATATATATATATATATATTTGCATTGTATTATACAAGTAGTAGTACCTTAAATATCGCATGTTGTGATAATAAAAATAAAAAAAATTATTCCACAAATTTTTAAAAATTTGTTATATTTTAAGTTACATTAGTCTTCGTTTATTTGATCATCGTTGGTGGGTGGAGTCATTGGAAGTTTTGGAGAGAGAGAGAGATTGTTATTTTAATTACTGATTTGTAAATAGTAGTACTATAGCAAAACGTGAGTTAAAATAACCTTTATCCACTATTTTTTTAATACACACCCCAAAGGCCGAGTAGGCTATTATATCCCACTCAACTCATTCTAAGTTGGAACCTCAAACACCTCCTTAAAATCCACATGCAAAAGTTGTACGATTAAAAATTGTTGAACCATAAATAAATATATTGAAACTTTTTAATAATTGATACTCAATGGATTATTGTGGAAGGAGTCTAATTCAAGTAGATATTTAACTTTGTTATAAATAATCAATAAAAAAATTAAGAATAATGACTCAAAAATAGGGTTGTTTTCTAATAGAATAAAAGCCAAAATATTCACAAATATATAATCAATATATTTTGTTTTCATTTGTTTTTTAGAAAATATAGTTAACTAATTTTTTTTAATTTTTTTACAATAGGTGGGGTGAGAGATGAGAGAGTCTAATCTCTAACTTCAAAGTTGATAGTACAAACTTTTCATCAACTGAGTTATGCTCATGTTACCTAATTAAAGTTAAATAACACATTTAATCAAATATATAGTAGTTTGTGTTTATGACTATCAAGAACATGAGTAGAGCAATATAGGGGTTTATTCATCAAAAGCTAGGGGTGGCTCGGAAGAATGCCTTGGGACAGTATATGGATCTTCTTTTTTAGTATGTTCGATGGAAAAGTGTGATCTTCTCATCAACTAAGGATTGGTTGCGGAAAGTCCATCAAGTGGAAAGAGAAGTTTTTTTTTCTTTTCCTGGGGAAAAAGGTGCTTACCAAGACAGTGGTGCAAGTCATCCCGACGTCTATGATGAGTTGTTTCAACCTTCCGGGATCTCTGTGTGATGAATTAAATTCTATGTGTGCCAATTTTTAGTGGGGTTCGGGCGACGAGCATAGGAAGTTGCATTAAAGGAGGTGGAGAAGGATGTGTGTGGAAAAGGAGATGGGGAGCTTAGGCTTGTGGGATTTGAAAATATTTAATCAAGCAATGTTGGCAAAGTAGTGTTGGCGGTTGTGTAGGCAACCTAAGAGTCTTTTATTTTGTGTTCTTCGTGGTTGGTACTTTCAAACTGACTTTTTTTTTTTTGCAGGCTCAATTAGGGCCAAATTTGTCATATGCATGGAGAAACTATCTGTGGGGGCATGTGTTGTTCGTCCAAGGGTATAGATGGAGGGTGGGTAACAAGTTGTCCATCAAATTAATAGAGATCCTTGGTCGCCTAGGCAAGAAAATAGCAAGGTGTGTCAGGTGGGAGAAAGAGTTAGGGGCGGTTTTGTTACATCTCTTTTGACAGACAGTGAGCGGCGCCTTTGTTTCTTAGAGGAGGAGGCGAACTTGATTTTGACAATTGTGGTGAGTGGGCTTGAGAGAGAGGATGATATTTTTTTGGGGTTTTGGTAAGAAGGGTATTTCCTCGATGAAAAATGTGTATAGATTGGGTTTGTGTATGGATTCGAGTGGCGAGGGATCCTCATCCTCTACTAAGGCGATGATGGCTTTATGTTATTGTCTTTGGAAAAGAAATTTACCATCTAAGGTAAAAATTTGTCGTTGGCAGGCGTTGCAAGGTATAATTCCTACTATAGGGGGAATTTGGTTGGAAGAGGAGTGGTGTTTGATCTGCGTTGTCCTCTATGTGTAGGAAAGCCTGAGACTTCGATGCAGTTATTCTAGGAGTGCAAAAAGGTTAGGAAGGTTTGACTCAATATTTTTGTATCTAATGCAGTTTCTATTTTATATGGCAGGGACTTTGGGACTCTGATGGGTCGGTTTGACTAGTTACGAGAACGTCGGGGAATCTGCTACTTATGTCACTATGAAGAATTTGGACCTTACGTAATCAAGTGGTGTTTCATGGAGGTGGTGTCTCAGTTCCCTTGCTTGTAGATTCCATTCACGATTTTGTTCAAGAGTTTGAGGGGTAAGTGCGAGCGAGGGTGGGGACTCTGGACTGAGTTTTGCAAGATCATTCAGGGCGGCTCAAAGGCGTAGGGTGTAGCATGTGCGTGGAGATATGATCCATTGCTTGTGGTGGTCGGCCATAGCTAATGGGTTAAAGGTTGTTGCATCTTAGGGCTTCTATTCAATCAGAGTGGAGTCTGATAATTTGACAACAATTCAATTATTGAAGATGGAGGAGCTCAATTTTATTATTATGAGCTCATTTGTGGAGGAGATTCTTGAGTTGAATATAAATAGGATAGTGCATCAATTTTTGCATATTAGTTGTTCATAGAATAAAGTGGCAGACCGGTTGGACTCTTAGGCGCTTAACAAGATATGATGTTAAGTGTGGTGGGAATATTTACTTGTTTGGATAGTTAGCTTGCTAAGGCAGATTATAACCTTATTAGTTCTTGTTAGTTTTTCTCGAATGTTCTTCAATATATATGTGTGTGTTAACAACATATATCCAAAAAAATCGTGGAGATTTTGTTGGATTTTTTGCCCTTCATCTCAAATTAATTAATTTTAGTTAATTAATTAATTAATATTTTGATAATAACAAACATGCTTTTACTCACTAACAATTTTAGGGTTTTATAGAGCTCAGAACAACGATACCAATGCATCTTGAATCACTTAAATCATAGCTCAAACGAAGAAGATATAGTCAAAACAAACTTAACAGAAAAAACCCAAGATGATGATGTGGCAGATTTTTTTAATCAAAGTGGACCAATTGAAAGGTGTCAGTTGAAGCAATGGAGGAGTGACACAAGGTGGATATTTGAGTTTTTGTAGGTTTTTGAAAGAAAATGATTTTAAAAAATATACATTATTATATTATTTTATGTTTGTGTCTAGGGAGCTAATTGAACACATGGTGGACGAAAAATTATATTATTTTATGTTTGTGTTTAATGGCTAGTTAATTTTAAAAAAGAAAAATCTCCACCATTGGCTTTTCTGTTTCAAGATCTGAAGTCTCAAATTGAATTTGGTTTTTACCTTTTTCCCTATCTTTTTAAACATTTCATCTTCTCCAAATTTCTTCAACAAATTTTCATTATTTTCAATAATCTATAAAACACAAAAAGTTCTCATTTTTACTCTCTCCAAGCTTCAATTTTCTTCTTTTCATCTTATTCTTAAGAGAGATATTTTATGTTGTAAGGGCTAATTTGTTGTGTGCTACAACGTGTAAATCCACTCTATTGAGAGAGTTGTAAGTGTGTGTTTTAATTTTTTTAAAACTTTTGTAAGGGTTGCTCTTGATCCTGGAAAAAGAGTTGTTTTGATGGTTTGATCCTCAATCGTGGAAAAGATCGGGTTTGTTCTTACGCCCGAAAAAGGAATGTTGTAGTGGTTTAACTTTAAGTCGTGGAAAGAGTCAAGTTCGTAATGACATACCTCAGAGAAGCTTGAGAAGTGGATGTAGGCCGGTTGTGCTAAACCAGTGTTCCCTCTCCTAATTTATTTTTGCAATTAATTTTTATTATTATATTTCATTGCATGAAATTAATTTCACTTTTTAATTCTTGAAATCAATTGCTCATATTTGTTTATTTTGATTGGTAGTTCTTAAAATTATATATCATTTACCAATTTTATTATTAGAATTAAATAAGGTGCTTAAAAATGTTAGTTTTAAGTTCTTGATTACTTTGGGTTGAATTTATTTGTATGAAATTCTTGTTTAAATTGCTTTCTAGCAAAAAGTTTATTAACTTCTTTAATTGGGTCCATTTAGAGAAAAGATTTCATTAATTTTAATAAACTCTATTCACTCTCCTCTAGAGTTGCCATACCAATCCTACAATTGGTATCAGAGCAAGGAGCTCTTATACATAAAATTATTTTTCAAATATAATTTTGATACTTGAATCATTTTATTAATATTTTGAAAAATTATTTGAAAGCTTAATTGCTAGAGCTATGGCAACTTTAGAAGACTATTCTATTGATAGGCTCCCATTCTTTGATGGTATTAGAAATTATGATGTTTGGAGAAATAAAATGAAAACTTTTATGCTTGCTTTAGATTTTAACATTTTGTACGTTTGTGAGCATGATTTGTCAAAATTAGAATCGAGTAATGAGTTGCTTACTTTTAAATGTCAAAGCCATCAAATATTATATATTGTGCATTGGATAGACAAATATGTGATAAAATCTTAGATGTTGATTATGCATATGATGTTTGAAATTTTCTTGATAATATGTTTAGTGTAGAAAATTCTCCTTGTACTACTAATGTTTTATTTGATGTGTGAAGGAGAATTATAATTCTAATAATCAAGAAAAGTTTCATGAAGTGAAAAAGGGGAAATGTGATAGATTCTCCTTGTGCTTAATGGCTCATTTGGATAATAACTCTGGAGATGGAAGTGATGTAGTAGGATTAGGGTATTATGAGCAAATCTTTAATTTATTAGGATTAGGATTAGTTTCTTTGATTTATTAGGATTAGGATTAGTTTTCTTAATTAATTAGGATCCATCTAAGATTAGTTTCTTTGATTTATTAGGATTAGGATTAGTTTCTTTAATTAATTAGGATTAGAATTAGTTTGCTTTGCAATTCTCTATAAATAGAGGGATTGTCTTCTTGAATTGACGTGAATTTTTAATTGATTTAAATAAAAGTCTCTCTTGATATTCATCAGAAAGTGACAACAATAAGGTAAATCAAAAATCTCCTTCTTAATTATGATGAATTGTAAGATGCTTTTGAAGATATGCAATATAATTTAGAAAAGACTAGCTCTAAATTTCTAAAATTATCCAAAAAGTTTAAGGCATTGTCTATTGAGAATGAACCTTTGATAAATGAAAATGCATTGTTTGAAAAATTGAGCTAATAATGTCTTGTTGCATGATTCTTGTTGTGATGAGAAAAATGTCTTGAATGATAGAACTGCTTTCCTTGAAAAAGAGAATGATGAGCTCAAATCTTTATCATGTGAAGTGAATGTTGAATTGAATGATTTGAGGAGTGAAAATGAGAAATTTAAATCTTATTCTTTGGAATTAAAGGATGAAATTGCTAGTTTAAAAACTAGAATTAATTGATTTTGAAACTATCAATATAGCATTAGAAAAAAAGAAAGTTTCTTTTGTTGATAAGATTAAATTTCTTGAATGTAATAGTCATGAGAAAAAAGATCGTTTGCATTTGCTTAAAGAAAAAGAATTGCTTGCTAATAAAGAACTTGAAATTACAAAAGAGTCCATAAAGAAATTGACTATTGGTGCTCAAAAGTTAGATACAATTATTGACATGGGTAAACCTTTTAATGACAAAAAGGGATTAGGCTATGTGAATGAAAATTGTCCATCTCTGTCTCAAGAAGTTTTACCCAATGCGCTTAATTTTGTTGAATTTAAATTTGTCTCAAAGCATGAAAAATCTGGAAATATACATGGAAAATCAAATATTGTGCCTAATCAATTTCATGCTAGATTAAAATGTGCTCATAATGTGAACTTTGCGAATAAGCATGAAAAGGCTAGGCCTTTCCATGAAAAATCAAAGTTTATACCTAAACATGTTCATAAAAATTTTAAATGTATGCATAATGCTAAATTTTTGTCTAGTCATGAAAATGCTAGAAATGAGCATGAAAAATCAAAATTTGTAACTACTTGTCATTATTGTGGAGTTAGTCATATTAGACCAAATTATTTTAAATTGAAAAATGTTCCTAGAAAGAAATTCTTGAATAGAAATAAAGTGCAAAATCAATTTATTAAGAATAAATTCTTGTCTAATGTTACTTGTTATTCATGTGGCAAAATTGGTCATAGATTTAATACTTGTTATTTGCCAAAAATCAATCCTAACGATGTTGGAACTAAATTTAAATGGGTTCCAAAATCTTTGCTCACTAACAACGAAGAACCCAAGAAATGCCTACCAAAATCTTTTAGATGAACTTTCTTTATAGGTGTGATTCAAGAAGTAATTGTTGCTTGGACTTCAAACAAGAAATTAATTCAATTTGGAGCAACTAATATGACTTTCTTGAATCTAAGAAAAGAAAATTTATATATTTGGTAACCTTTATTTTATTGAGATAATTATTCATTTGGTATCTTTATGGAAATTGTATCTCTTGAATAGTTTTGAATGGTTAAAATTTGAGATTTTTGTGAATTACATTGATAATTGGATTTAATGTCATGAAAAATGATTGAATATCTATCTATTTGGCATTTGTCTTTAGTGCTTATTACTTGACTTTTTGGTCATTTAGATGCTTGCTCTATCCTTATATTTTTCTTGGAAAGTTTTGAGCGAAATGATATTGATTTTGGAATATGTTGTTTGCTCTATGTCGAGAAGAATTGATCATTATGAGCATGTTGTTTTGAAGAGTTGTTAAATTTTTTTCCATAGGGAGAGTAGTTATTGAATTGACTTACTTTCTTGATCCTCTTTAGTGATTTGCTCAAAATTGGTGTAATTCTTGTTTATGTTGTCACTATACTTATTTCATAAAATTGATTTGCTCTTATTTGGTATCGTGATTTGTTTTTCAACAATTGATGCTCATTTTATCATATTTGATACTTTGGTATCTCAATTTCTTTAACTTTACTCAAATAGTTTTATCTCCTTGGTTTTTATTGATTTCTCTTATTAGAAGGAGATTGCGTGTTTGTAGGAGAAGTTATACACTTTTATATATGAATTTCATGTTTAAATGAAATTGTTTCCTGTCGTGTGCCCCTCTTATGTATATTGTTTATACTTCTTTTAGGGGGAGTAATTGCCTTTGGAGTGAAATTTTATTTGAAAATATGTTTGATATTTCTACCCCAAGTTGAATTATGTGCTAAATGTCTTTTTGTTCCTTATTAAGGGGGAGTGTTCAATTGATTTCAAAAGGGGGAGAAATTTGTTAAATTGGACTTAATTTGAATTAGTCTAGACTTGATTGATTCTTTGTGATACCTCTTGAATTCATTTTTCTTGAGTAGTTTGTCATCATCAAAAAGGGGAGATTGTTGGCTTTTTGGCCCTTAATCTCAAATTAATTAATTTTAATTAATTAATTTTAATTAATTAATTAATTGATGTTTTGATGATAACAAACTTGTTTTTATTCACTTACAATTTTAGTGTTTTGAAGTGTCCATTGCATAGAGCTCGGATCAACGATTCCAATGCATTTTCAACCACTTAAATCGAAGCTCAGAAACAAGCTTAACGGAAAAATTCTGAGTTGATGACGGGGAAGATTTCTTTTTATCAAAGTTGACCAATTGAAATGTGCCACTTGGACCAATGGAGGAGTGACACATGATCGGCTTTTGAGTTTTTGTTGGTTTTTAAAATAAAATTATTTTTCTATATTTGTAAATAATTTGATGTTTTTTCTATTTATAATCATTTTTCATTATTATATTATTTTATCTTTATGCCTAACGGCTAGTTGGACACATGATGAACTAAACAATATATTATTTTATATTTGTGTCTAACGGCGTTAATTTAAAAAAGAAAAATTTTTCACCATTGATTTTTATTTTTCAAGATCTAAAGGCCCAAATTGAATTTGGTTTTTACCTTTTCCCTCTCTTTTTAATACTTCATCTTCTCCAAATTTCTATAACAAATTTTCATTATGTTCAATCCTTCATAAAACACAAAGTTCTCTTTGTTGCTTTCTTCAAGTTTCAATTTTCTTCTTTTTATCTTATTCTTGAAAGAGATACTTTATGTTGTGAGGATCGGTTTGTTGTGTGCTAGAACTTGTAAATCCACTCTATTGAGAGAGTTGTAAGTGTGTGCTTAAATTTTTAACTTTTGTAAGGGTTGCTAGCTCTTGACTCCGGAAAAATTGTTGTGTTTGACGGGTTGATCCTCAATCGTGGAAAGATGAGTTTGTTCTTGCGCCTGAAAAAGGAACGTTGTAGTGATTCGACTCTAAGTTGTGAAAAAAGTTAAGTTCGTAGTAATATATCCAAGAAGAGCTTGGAAAGTGGATGTAGGTCGGTTGTGCCGAACCAGAGTCAATTTTTCTCTCTCTCTCTCTTAATTTATTTTTGTAATTAATTTTTATTATTATATTTCATTGTATGAAATTAATTGCACTTTTTAATTCTTGAAATCAATTGTTCATAGTTGTTTATTTTGATTGGTAAATCTTAAAATTACATATTATTTACCAATCTGATTATTAGAATTAAATATGGTGCTTAAAAATGTTAGTTTTAATTTATTGATTACTTTGGATTGAATTTATTTGTATGAAATTATTATTTAAATTACTTTCTAGCAAAAAGTTTAGTAACTTGTTTAATCGGGTCTATTTAGAGAAAAGTTTTCATTAATTTTAATAAATTTTATTTACCCCTCTAGAATTGTTATACCGATTCTACAGATTTAAGTTAAAAATTTTGAACTATAAAAGCTAAATTTAAAATTAAGCCTATATAGTACGAGAACAAACTATAATTATGTCAAAAAGCTTTGTGGTTTAAGAATAGTTTCGATTTCATCTCTCTTTATTTTAAAAATTTTCAATTTGGGTTGCATGGTTTTTGTTTGGTTTTCATTTTTTTTTTCTAGTTCTAAAGTTTCAATTCGTCAAAACGAGTATAGTTCAACTTACATATAGTATGCTTTAGTGACCATGAGGTTTATGGTTTGAATCTCTCTATCTCCTATTGTACTTGAAAAAAGAAAAAATATCAATTCAATTTTATATGTTAGTTAAACCTTACCAATTATATTCCACTTAATTGAAAACCCGTCTACTATTCTCTTTTGTTTTTTTCTCCGAAACGCTTATACGGTTAACTAGCACAAAAAAACAAATAAAAAGCATCAACACATCGAAAAAGAAAATGTTAATTTGGTAACAAAAGAGACCATTTATAAGATTTGACTAAACAATAAAATCAAATTGAAACTGACTCTATAAAAATAAAAACTGAATTGAAACTTTTAAAACCGTAAAAGTGAAAGAGATACCATACTCACTACTCAAACTATATATACCAAATTTGTAACCTAATAAAAAATATAAAAAGAAAAAATAAAAAAGAAAAAGAAAAAGAAAAAGAAAACTACTTCTTAGATTATGTTCAAATACGAAATTGTCTTTAAGGTTTCAAATATAATACAAATGTTAAGGTCTCAATTTATGAATATGTCACTAAAAATATATTAATATATTAATGTCCGATGAATATTCATAAACACGTTATAAATATGAGAAAACTCATAAAAACATATCTAGATTAACAAACAAACCTTTCACAATTTTAAACAAGTTTCAAATTTGCCCATTGTTAATATTTGGTTTACATTAATAACATTTAGTTGACATTTTAATATTTTAATTACAACAATTCACATCTCACATCCATGTCGTGAACTAAAGTTTAAAAGGTAATTGCAATTGGTAGTATTTTTAGGACTTATAATTAAGTATATAATAATATTTTAAAGGAAAATTTTCACATATGGAAAAAATATCAAACTATTTACATAAAATGGCAAAAAAGACACTGATAGACATTGATAGACTTCTATCAGTATCTAAAAATTTTGCTATTTTGTGTAAATAGTTTCCCTTATTTTTCTATTTTTAAAAACTCCCCTATTTTAAATGAATTGCAAAATATAGACATTTTTTTTCCAATCTCTTCATTTATTTTATCTATTATACTCTTATTATTTCAAATTGGCTAAACCCAAAGCTGTTTTTTGCATTTATGCCCGTTACTAGAGATGATTGTGGGGGCAGGGAAGAGTTGAGGATGGCTTCCCCATCCTCGCTCCTTACTCCCATTTCATTTCTCATCCCCACAAAGTTACGCACAGGGATCAGACATGGGAATTTTCCATGAGTAATTTGAGGGAATCACGTTCTCTATGGATTTTTTTCCCATTTGGTTGTTTTTTCAAAAAAAAAAATCAATTAACTTAAATCGCTACTATAGAAAAGTTTTAATTAAATAGTTAAATTTCTCACTAAATTTTTTCTACGTTAGTTCCACATGAAAATTTTGAATTTAAAGAGAAATAACTTTAATTTTCTAATATATATATATATATATATACTAAAAATTTAAATGTTCGATTCAAACATATATATATATCTTGACCATTAAACAATAAAACTTGAAATTGAAATATAATTTTATAAAATAATAGTAATAACATATCATTAAATTATTATAATAAATCAAATAGAGGCTAATGGAGTGGCGACGGGAATAGGGAACATTCCCCGTCTCCATCCCCGTTTAGCTTACGAAGAATTTTTCCCCATTCCCACCCTCATTCCACGCCTAAAGGGAAAATCCCAGCCCTGTTAAAGGCGAGTCCCAGCGGGGCTCCGTATGTACTTGCAGGGAAAATGGACATCTCTACCTACTACGATATCAATTTTTCTACTCTCAATGTTGTTTCGGGTCCTAACAACATCATCTACGACAATATTTGTTTTGACTTTTTTAATCAAAAGTTTGTATCTTATGTCAGTTGAACTATGTTCATGTTATCAGTGTAACTATGCTCATCTTGGCTTCTATCATTGTAGATTTTTTTTATAGTTGGAAGTCTACCAATGATAGAATTCTATCACTAACATACTTTATTTATAATAACTCAAATCTTTCAATGATAGACTATTGTGGCCGATAGATTTTAATCACAATCAACAACCAACATATAGTCTATCTCTTCTATCATTGATAAATTTCAAACAAATATTTTTTTTCTTTTTTTTTTTTCTTTCTCCATTTCCTCACTTTCTTCCTCTAATTTTGTTTTCTCCATCTCTTTTATTGACAACTCAACAAAATATTCAAAATAATTTAATGTAATAATCTTTTTTTCCCAAAACAACCATAAAAAACTAAACTATATCTACAATCGAAGCTTAAAGTATGTAACAAAAGTCTATTAGTGATAACCTTCTATATGAGCTATTTAAAATCTATAAATGGCTATTGTTGATAGACTTCAAATAATATAAACTTATATATCATTGACTCTAATTAATAATGAAGTTTATCACTGATACCTCATATCACCTACTAGATTCTATTGGTGATACCTTATATCACTAAGAGGCTGTTTGGGGCGCTGAGTTGAGATATGATGTCTGGATATATGATGTCTGTGGAGTTTATATGTCTGTGGAGTTCATATGTTTGTGTTTGGGGTGCAGAGTTGTTTTACCCTTAATTCAGATGTCTGTGTTTGAGGTGCAGAGTTAAGTTCATATGTCAGTGGTATCTAGTACAGTTTTTTAAAATCTAAATAAACAGATGTTATGATGATTATTTTTGTTTTGAAAATTTTGTATTCAATAATTCTACACATCTTTTGCTTGGATAAATATGGATTGTAATTTAATTTTCATTTTTTTTTAAAATACACATCTATGATTAATGATCTTGAAATATGGTCAATGTTTTAATTTTAGACCATTAATTCCTAATATTATTTTTATACAAAACCAATCATTCTAAAAATTTGATTTTCTATTAATTGGTTTGTTTAGTTGGACAAAATTATTTGGATTAATTTGGTGAAGATAACTGTTGACTTAAATATTATGTGGAATACATCACCAAAATGTGTTCATAATAAGATGCCTATATTGACGGTCATATTACATGCTTTATACAAAATTATGTAATACCTGATCAACGAGCCACATTTTACCGCTGGTTTACATGCACACAAAAAATATATATACATGGTCAACAAATTACCAAGTTCACTCACAAAAAAAAAACTACACAATCACACATACATCATTCAGATCAGTTCGTCTTGCCCAACATATGTAGGCAATATCCTCTTCTCGCATGGGAGGGAATTGTTAGGAAACAATCTGTTTTTTGAATATCAGTCACTATCATATCAATAAGGGCCAATCGATCAACTTCCTCCAAACCATCAACTTGATATATCTGATCGATGACTTCCTTTCGTCTTTTGGATTCCATTTCATATTTCTCCTTCTGCCATGATACGAGCTTGTCCAACTTTTGTTCATGGATGTCCATTGATGTTCGTACTAGGTCAAACATTTCAGATTGGAAGACATTCCTCTTTCGCTTGCTGCCTCTCGAACCACTGCTAGGTCCCGCACGCCTATTTGCCGTTGTATCTGAGAAAACCTCATCCATACCGTCAACCGTCGATATGTTCTCAGACAATCCATGGTCAGATTGCTCATTATCCTGTGATCCCAACCTAACATCCTCCAAATGATCATCTGTTTCAGATGCTTGATCCAAAGGAGTTTGACAACCGTCATCAGTAGCCCTGTCCTTTCCAAACACGATGGAAAGTTCATCATAGTGAGGGAATGGTTTATGTCTCATACCTTTCGCGTTGGGATGAGATTACAAGTGAATATATAATATGTTAGTAAAAAATATAACAACTGTGGTAATATATTGTAATGCCTTGTCTTACTCGAACCCATAGATCAAAAACATCCTGCTCTACGTCTACACACTTGAACTCCTCGTTCCAACTGAATCTGCTACATGCGTTACTCAACATCTCAGCAATCGCTGTATACTGTCACTTTAAAAACCTAACCTTGCACTCTACTGTGGCTATCCCTATGCCACAGTTAGGAACTTTCTCATGCAGCAGACGGTGAAGTTGTTGTAAATAGCCAGGCTTGAAAGTACCATTATCAGCTCTCCACCCACTCTCAACCAATTGTAGTAACGACTCGACCAACTTTGCACCTTCAACCTTAGACCAAATGTGTTTTTTGATTCTCTCACGTCCTGCCATTAAGCTACATACATGCATATATCTAACTCAATTAAAAATTTTGGGAACATATAGATAAATGAATTTATAAGTACGATATACGAAAGACAATTTTCATAACTCAAACCAACAAACAAGCATAATATTGCATAATTGAAAGTAGTACATTGTTTGAGCTTAAAAAAAACATGAACTAAATCAAAGGAAGTTTCAATTCAAAGTAAAGCAATTCGAACAATAAAATATTTTTTCATGATGCTTGTTCCCATGCATTGAACATTAGGGTCGCCAAGTCATCCCTAAAGCTAGTCCATTCGTCAGACGTTTCAATAAATTGAATGTTTTCATCGCCCAGTTGAGAAGATGAAGAGTCTGAATCCTCTGGTGTATCAAACAATGCAGTTGGGCCCATCTCCCTAGTGATAAGGTTGTGAAGTAGGCAACATGCAGTAATTGTTCGACATTGGACCTTCACTGAGTAGAATGACTTCCCACGAAGTATTGTCCACCGACCCTTAAGTACGCCAAATTCTCTTTCTATGCAGTTTCTAGCATATGAGTGTTTCATGTTGAAAAATTCCCTTGGATTAGTCGGTGCATTTCCAGCACCACGCCATTCTGAAAGGTGATAACGTTCTCCCCTATATGGTGCCAAGAATCCCTCGGCATTGGGATATCCAGCATCACATAAGTAGTAATAACCTAGACATGGTATACATAGTTAATTGACAAGCTATTTGCTTTTGAACACAGAAAATGTAGTTACAAAAGCCTATACCCTTGGGAACCTTCAATCCAGCAGGTCGAGAAATTGCATCTCTAAGCACTCTAGAATCGGTTGCGGACCCTTCCCATCCTGGAATCACAAATATGAACTCTCCATTCGGAGAACAGACTGCAAGCACATTTGTGGCAATCTCACCCTTTCTGGTATGATATCGGGGTCGATCAGTCGCACTTACATTCACTTTAATGTATGTGTCATCAATGCACCCAAACAATTCTGTTTGAGTTCATACATCAGAAAAAAAAAAAAGTCAACAAATTTGTTTTCCTTGAAAAACAAATACCTAGTACTTGACCATGAACGGAGATACAATCGTTGTAATACCTCAAACCATTTCCACCTATTGTCAGTGCATTCGTTTGTGATAGACTCTGGTTTTTTTAGCAGTACCTCATGAAGTTGTAGCACTGCAGTGAGGACTGACTTGAAATGCGTTGAAACAGTTTCACCAAACCGTGCAAACTGTCGTCGAACTACACGATTTTTAACGTCATGCGCCAGTATGTGTAGGAACATTGCCACCATCTCCTCGACATCAACACACTGGGTTGGTAGTAGACGACCGGTCGTCCTAAGCATATTACAGAGGATTGTGAACGTCCTTCTATCCAGCCGGGTGCTTTCATGACAACATATGTCATTCTCATAAATAAGTCGAAAGAAGTTCAGCTGCCTAATCCGATTTCTAATGTATGACAGTTGGTTCTCTATTCTACAATGGTCATTCAATAATAAGATTACAGATAGCAAGAGTTGGTACTGGGAGGTGGTATTGATTGAGAGAATGGTAAATAGTTCTGTGTTTTCCATATTAAGGATAGGAAAGATGTGAAACTGTTCCTGAAATCATAAAAGGTAATTTTAATTTAAAAATTATCAAAAGATACATCTAGTGTTGTGCACAACATGCACATCACTTCAACTCTAGTAGACTAATCCCTACATACATAACGACCATTCCAATGATAAACCCTAGTATTAGATATTTTGTATTATTTAATTGTCATCCTTCATTCTCACTAATTGGAGAAGTGTGACGTCCAACTTTACTAGGTGGAACAGTTAGTGATTGGTCAACAGTTGACAATCTAAACTTGATATATGCATACTCTACTTCCTCAACAGTTTCGTATGACCGGTGAAGTGCTCCCCTAAACTCATTGACTTGTCGATAGCATTCGTGCCACGACATGTAAATCCCTGGCTTACGACCAACGAATACAACATAAAACTTCGTTTTTCCCATACTCCTAAACAATATCGAAATCAACAATTAACTATGCTATCAGCAATACAAAAAACTATAGTCATGTTGTAAAAGGGTTAGGTATAGTGTTTTATTTGGGTTAGGCCTCATTTCAAATATATTAAATCGAGATATAGATATTGGGCGTCGGTTTGTATGAAAGTCTAAGCCTACTTTATCCAAGTTTTCCAATTAATTAAATCAAGTATCATACCTAAAAAATTATCTAATATATATGTTTAGTATTTTGAAATACATAAACAAATTTAATATTTAACAAATATAATAATTACATTAATTTGACTTAGAAACACTATTATGTCCCTGTAATTTAAATCATCTTAAATTTAGTCATTCGAAGTTAATTCACTTTTAACTTGGCTAAATAATTATACTAGTTTTAATGCAATAAAGTACAATATATTAATAAATTTTTAAATTTATAATGAAAATGCTAATAACAAAAAATCTTTTGAGTAGTTTGTGTAATTTGCTGTTAAAAATTGAACTTAGCATCTTTATTTAATAAATTAATTCCCACAAAGTAGAAAATTTTACTAGTTTTAATGCAATAAAGCACAATATATGAATAAATTTTTAAATTTATAGTGAAAATACTAATAACAAAAAACCGTTTGAGTAGTTTGTGTAATTCGCTGTTAAAAATTGAAATTAACATCTTTATTTAATAAATAAATTCACACAAAGTAGTAAATTTTGGGTTTGGCTAAAATTTCTCTTCAACTTTGTTTTCTAAATCATACATAAACTAATTAGTCAATATCAATATATTTTTTTAAATCATACAAATTTTCTGTTGATATATGTTTTGTTCTAATGTAGCAAATGTTGTCAGACAAATACAAAATGAGTCAAAGAATGATCGAAAATACCCTAAACAAATCCTAAGAGAATAAACATGAGAAAATGTCCAACAAAAGTCATAGACATCAAAGTATTATACGAGGGTCATAAACATTAAAATAATTTAAATAAAAAGAAAATTACACAAATGTCCAACAAAAATCAAACATCAAACTAGTGAACAAAAGTCATAAACCTTATAGCAGCAAAAAAAAATTACAAGAATGTCCAAGAATTATCCAACATAAGTAGAATGAACAAAACAATTAAATACATCATATTGATGTAACTAACTACAAAGGAAAAAACACATTCCACTAACCTCAAACTAATTTGTGCGAAATGTATTAGTTTGGGGCCTGCATATAATAGAAAATAATGTATTAAGATCAAACGACAAATCTTTGTAAAAAATGTAAGAGGAGTTAATTTTGTGTGTGTGTGTTGGAGGGGGAGGGGGGAAGTTGGGCATATCTAATACAAATTACCTTCTTTCATTCAAATGGTCAAGAAGGAGATCTACACGATAGACCAACTGAAAACCAAAATATAGAACATTAGTACTCAAAGTTGGGTGGAAAGAAACCATAAAAATACATAATGAAAAAAAAAGCCAAGGATGGCTATTAGGAATAACTCACCAGACTACAAAGGAGATCAGTAGGCAATGGAGATTTCTCAAGAGTATAAAACGTGGGATACAAGTGATTTATAGAAGAATAAAGGTGGTGGTTACCTCCTAATCTTATTAATTGATGGTGTGGGAAGTAATGTTGTGCAATAAACACAACCATCTAACCCAATAAAGCATTAAAAAGAGGATAAGACAACACTAATAACTGACCTCAGACATCAATTTTGGGTGTATTTCATGTTAAAAAAATTGTGGTTCAGCAGGTGAACAGGAGACATATTAAATGAAGGTTGGCTGCAGTAAATGTTATACTAATAAGACAGCCAGAATGCATTAACTATAGCAATAGTACATATCGAAAACAGAGATTAAAATTCATGAAAATAACAAGATGAGAGGCACCACACATCCATGAGTGATCAACATTACCAAAAATTCTATGTAATTGGAATGACTTAGAAGAAAAAAAATTTAGATAAAAAAAAATCAGAGAATTTCATCCATCAAGACGAAGTAATAAAAAAAATAGAGAAATCAATCGACAAATTTTAACATCCACATATTTACAGTTGCTATGCATGAAAAAAATGGCATAAATCATGAAAGAAAAATTTAGATTTAGTTCATACCGAAACAAAAAAAAATACAAATTAAATCATGAAGAAATCAAGATGCAAAACAAATTAAGATGCAAATAAAAAAAAATGGAGAAATTAAGATGCAAATAAAAGAAGTGAGAATGGAAGCATCGAACAAAACAAATAAAAAAAAATAAACAGAGGAATCCTATCCAGCAAAACGATCTAATAAAAAATAGAGAAATCAAGATGCAAATAAGAACATCCAACATATTCATAGATTGAAAATCGACATGTATGAAGAAAATAACATAGATTGTGAAAGAAAAATCAGATCTAGATCATTAAAAAAAATCAATCTTGAAAATCAATCATCCAAAAGATTTTACCTGATATGTAAATCAAGAGGTGGGAGTGGAATCATCATTGGCCGATGTAGGTTTAGAAAACATACCTCTTCCCCCTCCCCCCACCTAACTTCGATTTGGAACCCATGGACCCCTACCATGTATTTCCTTCCGCAAATCGAACATTCATTTGATCCCACAAACCCAAACAAAGGAGCGAACTATGAATGAACCAGGAGAATATCAAATCCATGAGGAAAATGAAGGAACCGACAAGGATTTGTCATGGATCATGGTTGAAGTGGCGAGGGGAGTGAGATCTGAGCAACGATCGAGAGAAAAAATCCTTCTCCATAAAAGAATCATGAAATCTGAGAAGTTTCGGAGAAAAAACCGTTCGCCATGGAAGAATTAAGAAATATGAAATGAGGTTTGTGAGGGTTTCAATCAGAGAACCAAATCGGAGAAGAAAAATCATTCTCCATGGAAGAATCAAACAAATCTAAGATGAAAATTGTAAGGGTTTCGAAGCGTATAAAGAGAGTAATGAGTGAGAGAACCAAAGCGTTTAAAGAGAGTAATGAGTGAGAGAACCAAAGCGTCTGAAGAGAGAGGAAAGAGATCACATATCACTGTGCAGATGCCATTTTACCTGATGTGGGGAGTTATCATCTCAGTGATGAGTTGGGTTAAAAACCATCTCATCTCATCTCATATCAGTGGGCCAAACAGCCCCTAATAGAGTTTTTACCAATGATAGAGTATCAATTATACACTCTATCAATAATACGAGTTTATCAATGATAGACTTCGGTTGGACATTTTGAAAATTTGAAAATTGTGTTATATCCATAAATTATTTGATATTGTACTATATCTACTAATATTTTATGCCTAATTGTTATATTTGCAAATGCTACAACTTTAAAACCCTAATTATAAAAAGTAAAAAAAAATGCAAAAATAAGCCCTAAAACTGGGTCCCACTCCTTTCTTCTCCAAAATCCAAACGGGCCCACAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAAAATAGGCATGATAACTATTTAGTTTTTGCATGATAATTATTTATTTATTTATTTTTAATTTTAAAAATTAAACTTATAAATACTACTACTTCTACTTAATTAACTAGTTTCTAGTTAAGTACTTAAAAAAAAATTCTGATTGCGTAATCTACTTTTTATAAGTGTTTTAAAAAATTAGGTCAAGTTTTGGAATGTAACTAAGAATTCACATGATTATTTAGAAGAAGATGAAAATTATTCTAAAGATATGGTGTAAAAACCAGCACAAATTTTGAGATGAAAAATTAAATGGTTGTCAAGCGAAATTTTAATTTTTCTTATTTTTTATTTTTAAAAATTAAGTTTAGAAACATTAATTTCATTTTTTTTATCTATTTTGAACCATGTTTTAAAAAATCAAGTTTAGAATTTGATTGAAAATTCAAAAATTTTCTCAAGAATGATGAAAAACATAACAAGCATAAATTTTAGAAACATAAAACTTAAAATGGAATTGGTTATTAAACAAATCTTAACTTTCTAAAGTAGAAAATAAACGACAATAATATTGATTATGAAACCCTTTTTTTTTTTAAAGAAAAGTATCGAGAGAATTTAAATTGTTATTAGTTATGGTCAATTTATAAATCTTTACAAATCATATTCTTGACAAGGTGTTCGGAGGCATAGTTGGCGGGAAAGTCACAGCAGCGAGTCTCGTGGTGGTGGTTGCGGCGACGTTTATAAATCCAGTGTTCCATCGACTACCAAGCGAGACGACGGAGGACGACGACGATGTTGATATGACGAAATCCGCCATCAACACTACCGATGAGTCTCCCGTCACAGCAACTACAACAAGCTCGGCAACTCCGATGGCGGTCTGCGCATACAATGCTCCATCTCCCCCGGATCATGGAATGCCATGGGGGCCGAGTTCTCGATCATCTTACTGA

mRNA sequence

ATGCAATGCGTTGCAAGGTATAATTCCTACTATAGGGGGAATTTGGTTGGAAGAGGAGTGGTGTTCGGAGGCATAGTTGGCGGGAAAGTCACAGCAGCGAGTCTCGTGGTGGTGGTTGCGGCGACGTTTATAAATCCAGTGTTCCATCGACTACCAAGCGAGACGACGGAGGACGACGACGATGTTGATATGACGAAATCCGCCATCAACACTACCGATGAGTCTCCCGTCACAGCAACTACAACAAGCTCGGCAACTCCGATGGCGGTCTGCGCATACAATGCTCCATCTCCCCCGGATCATGGAATGCCATGGGGGCCGAGTTCTCGATCATCTTACTGA

Coding sequence (CDS)

ATGCAATGCGTTGCAAGGTATAATTCCTACTATAGGGGGAATTTGGTTGGAAGAGGAGTGGTGTTCGGAGGCATAGTTGGCGGGAAAGTCACAGCAGCGAGTCTCGTGGTGGTGGTTGCGGCGACGTTTATAAATCCAGTGTTCCATCGACTACCAAGCGAGACGACGGAGGACGACGACGATGTTGATATGACGAAATCCGCCATCAACACTACCGATGAGTCTCCCGTCACAGCAACTACAACAAGCTCGGCAACTCCGATGGCGGTCTGCGCATACAATGCTCCATCTCCCCCGGATCATGGAATGCCATGGGGGCCGAGTTCTCGATCATCTTACTGA

Protein sequence

MQCVARYNSYYRGNLVGRGVVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEDDDDVDMTKSAINTTDESPVTATTTSSATPMAVCAYNAPSPPDHGMPWGPSSRSSY
BLAST of Lsi06G005790 vs. TrEMBL
Match: A0A0A0LXI2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G629020 PE=4 SV=1)

HSP 1 Score: 156.0 bits (393), Expect = 2.7e-35
Identity = 84/97 (86.60%), Postives = 86/97 (88.66%), Query Frame = 1

Query: 18  RGVVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEDDDD-VDMTKSAINTTDESP 77
           +G VFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTE +DD VDM K  IN TDESP
Sbjct: 181 QGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEGEDDRVDMAKPTINATDESP 240

Query: 78  VTATTTSSATPMAVCAYNAPSPPDHGMPWGPSSRSSY 114
           VTATTTSSATPM VC YNAPSPPDH MPW PSSRSSY
Sbjct: 241 VTATTTSSATPMTVCVYNAPSPPDHAMPWVPSSRSSY 277

BLAST of Lsi06G005790 vs. TrEMBL
Match: A0A0D2QB68_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G075100 PE=4 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 2.8e-08
Identity = 43/101 (42.57%), Postives = 57/101 (56.44%), Query Frame = 1

Query: 18  RGVVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEDDDDVDMTKSAINTTDESPV 77
           +G VFGG++GGKV AA+ V+VVAATF+NP FHRLP +   +D   +              
Sbjct: 192 QGQVFGGMIGGKVIAATQVIVVAATFVNPAFHRLPCKGDNEDTHQETKHCIHGNVGGGAS 251

Query: 78  TATTTSSATPMAVCAYNAPSP--------PDHGMPWGPSSR 111
            AT + S+T M+   Y++P P        PD  MPWGP SR
Sbjct: 252 GATESCSSTGMSTAVYSSPCPTPLNCQISPD-VMPWGPPSR 291

BLAST of Lsi06G005790 vs. TrEMBL
Match: A0A061GSR7_THECC (AT-hook DNA-binding family protein OS=Theobroma cacao GN=TCM_040931 PE=4 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 4.9e-08
Identity = 49/108 (45.37%), Postives = 63/108 (58.33%), Query Frame = 1

Query: 18  RGVVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEDDDDVDMTKSAINT------ 77
           +G VFGGIVGGKV AA+ V+VVAATFINP  HRLP E  +++D    TK  +++      
Sbjct: 305 QGQVFGGIVGGKVMAATQVIVVAATFINPALHRLPCE-GDNEDRHQETKPGVHSNVGGGG 364

Query: 78  -TDESPVTATTTSSATPMAVCAYNAPSP--------PDHGMPWGPSSR 111
               + V AT + S+  M++  Y   SP        PD  MPWGPSSR
Sbjct: 365 GATAAAVGATESCSSAGMSMSVYGVASPSPLSCQISPD-VMPWGPSSR 410

BLAST of Lsi06G005790 vs. TrEMBL
Match: W9T373_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_000140 PE=4 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 4.9e-08
Identity = 47/105 (44.76%), Postives = 60/105 (57.14%), Query Frame = 1

Query: 18  RGVVFGGIVGGKVTAASLVVVVAATFINPVFHRLPS------ETTEDDDDVDMTKSAINT 77
           +G VFGGIVGGKV AAS V VVA TF+NP FHRLP       E T + D  ++    +  
Sbjct: 197 QGQVFGGIVGGKVIAASAVAVVATTFVNPSFHRLPGDNQDNVEGTHNHDHQEIKPCVVGG 256

Query: 78  TDESPVTATTTSSATPMAVCAYNA--PSPPDHGMP-WGPSSRSSY 114
           T E+  T+T+++    ++    N    SP  H MP WG SSRS Y
Sbjct: 257 THETTYTSTSSACTHVVSPTPINCQLSSPDHHVMPSWGHSSRSPY 301

BLAST of Lsi06G005790 vs. TrEMBL
Match: A0A0D2LYC4_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G133900 PE=4 SV=1)

HSP 1 Score: 64.3 bits (155), Expect = 1.1e-07
Identity = 44/98 (44.90%), Postives = 57/98 (58.16%), Query Frame = 1

Query: 18  RGVVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEDDDDVDMTKSAINTTDESPV 77
           +G VFGG VGGKV AA+LV+V AATF+NP FH LP E    D + + +K + +       
Sbjct: 199 QGQVFGGKVGGKVMAATLVIVAAATFVNPEFHMLPGEGDNKDHNQE-SKPSTHGCVAGGA 258

Query: 78  TATTTSSATPMAVCAYNAPSP-----PDHGMPWGPSSR 111
           T + TS+   M V    +P+P     P   MPWGPSSR
Sbjct: 259 TESCTSTGLSMPVYGVASPTPLNCQIPPDVMPWGPSSR 295

BLAST of Lsi06G005790 vs. NCBI nr
Match: gi|449442723|ref|XP_004139130.1| (PREDICTED: AT-hook motif nuclear-localized protein 17-like [Cucumis sativus])

HSP 1 Score: 156.0 bits (393), Expect = 3.9e-35
Identity = 84/97 (86.60%), Postives = 86/97 (88.66%), Query Frame = 1

Query: 18  RGVVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEDDDD-VDMTKSAINTTDESP 77
           +G VFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTE +DD VDM K  IN TDESP
Sbjct: 181 QGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEGEDDRVDMAKPTINATDESP 240

Query: 78  VTATTTSSATPMAVCAYNAPSPPDHGMPWGPSSRSSY 114
           VTATTTSSATPM VC YNAPSPPDH MPW PSSRSSY
Sbjct: 241 VTATTTSSATPMTVCVYNAPSPPDHAMPWVPSSRSSY 277

BLAST of Lsi06G005790 vs. NCBI nr
Match: gi|659098900|ref|XP_008450342.1| (PREDICTED: putative DNA-binding protein ESCAROLA [Cucumis melo])

HSP 1 Score: 153.7 bits (387), Expect = 1.9e-34
Identity = 82/97 (84.54%), Postives = 85/97 (87.63%), Query Frame = 1

Query: 18  RGVVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETT-EDDDDVDMTKSAINTTDESP 77
           +G VFGGIVGGKVTAASLVVVVAATFINPVFHRLPSET  EDD+ +DM K  IN TDESP
Sbjct: 180 QGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETAAEDDEGIDMAKPTINATDESP 239

Query: 78  VTATTTSSATPMAVCAYNAPSPPDHGMPWGPSSRSSY 114
           VTATTTSSATPM VC YNAPSPPDH MPW PSSRSSY
Sbjct: 240 VTATTTSSATPMTVCVYNAPSPPDHAMPWVPSSRSSY 276

BLAST of Lsi06G005790 vs. NCBI nr
Match: gi|1009114395|ref|XP_015873664.1| (PREDICTED: AT-hook motif nuclear-localized protein 17-like [Ziziphus jujuba])

HSP 1 Score: 68.6 bits (166), Expect = 8.2e-09
Identity = 50/102 (49.02%), Postives = 61/102 (59.80%), Query Frame = 1

Query: 18  RGVVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEDDDDVDMTKSAINT-TDESP 77
           +G VFGGIVGGKV AASLVVVVAATF++P FHRLP+E     D+ + TK   N+ T    
Sbjct: 210 QGQVFGGIVGGKVIAASLVVVVAATFLSPSFHRLPNE----GDEAEETKPIRNSNTSGGG 269

Query: 78  VTATTTSSATPMAVCAYNAPSP--------PDHGMPWGPSSR 111
                +   T M++  YN  SP        PD  MPWGP+SR
Sbjct: 270 ANEGCSGPCTGMSMSVYNVASPNPINCQISPD-VMPWGPTSR 306

BLAST of Lsi06G005790 vs. NCBI nr
Match: gi|823227259|ref|XP_012446464.1| (PREDICTED: AT-hook motif nuclear-localized protein 17-like [Gossypium raimondii])

HSP 1 Score: 66.2 bits (160), Expect = 4.1e-08
Identity = 43/101 (42.57%), Postives = 57/101 (56.44%), Query Frame = 1

Query: 18  RGVVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEDDDDVDMTKSAINTTDESPV 77
           +G VFGG++GGKV AA+ V+VVAATF+NP FHRLP +   +D   +              
Sbjct: 192 QGQVFGGMIGGKVIAATQVIVVAATFVNPAFHRLPCKGDNEDTHQETKHCIHGNVGGGAS 251

Query: 78  TATTTSSATPMAVCAYNAPSP--------PDHGMPWGPSSR 111
            AT + S+T M+   Y++P P        PD  MPWGP SR
Sbjct: 252 GATESCSSTGMSTAVYSSPCPTPLNCQISPD-VMPWGPPSR 291

BLAST of Lsi06G005790 vs. NCBI nr
Match: gi|590584830|ref|XP_007015286.1| (AT-hook DNA-binding family protein [Theobroma cacao])

HSP 1 Score: 65.5 bits (158), Expect = 7.0e-08
Identity = 49/108 (45.37%), Postives = 63/108 (58.33%), Query Frame = 1

Query: 18  RGVVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEDDDDVDMTKSAINT------ 77
           +G VFGGIVGGKV AA+ V+VVAATFINP  HRLP E  +++D    TK  +++      
Sbjct: 305 QGQVFGGIVGGKVMAATQVIVVAATFINPALHRLPCE-GDNEDRHQETKPGVHSNVGGGG 364

Query: 78  -TDESPVTATTTSSATPMAVCAYNAPSP--------PDHGMPWGPSSR 111
               + V AT + S+  M++  Y   SP        PD  MPWGPSSR
Sbjct: 365 GATAAAVGATESCSSAGMSMSVYGVASPSPLSCQISPD-VMPWGPSSR 410

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LXI2_CUCSA2.7e-3586.60Uncharacterized protein OS=Cucumis sativus GN=Csa_1G629020 PE=4 SV=1[more]
A0A0D2QB68_GOSRA2.8e-0842.57Uncharacterized protein OS=Gossypium raimondii GN=B456_009G075100 PE=4 SV=1[more]
A0A061GSR7_THECC4.9e-0845.37AT-hook DNA-binding family protein OS=Theobroma cacao GN=TCM_040931 PE=4 SV=1[more]
W9T373_9ROSA4.9e-0844.76Uncharacterized protein OS=Morus notabilis GN=L484_000140 PE=4 SV=1[more]
A0A0D2LYC4_GOSRA1.1e-0744.90Uncharacterized protein OS=Gossypium raimondii GN=B456_001G133900 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|449442723|ref|XP_004139130.1|3.9e-3586.60PREDICTED: AT-hook motif nuclear-localized protein 17-like [Cucumis sativus][more]
gi|659098900|ref|XP_008450342.1|1.9e-3484.54PREDICTED: putative DNA-binding protein ESCAROLA [Cucumis melo][more]
gi|1009114395|ref|XP_015873664.1|8.2e-0949.02PREDICTED: AT-hook motif nuclear-localized protein 17-like [Ziziphus jujuba][more]
gi|823227259|ref|XP_012446464.1|4.1e-0842.57PREDICTED: AT-hook motif nuclear-localized protein 17-like [Gossypium raimondii][more]
gi|590584830|ref|XP_007015286.1|7.0e-0845.37AT-hook DNA-binding family protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003674 molecular_function
molecular_function GO:0003680 AT DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi06G005790.1Lsi06G005790.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3DG3DSA:3.30.1330.80coord: 18..57
score: 6.
NoneNo IPR availablePANTHERPTHR31100FAMILY NOT NAMEDcoord: 17..113
score: 2.0