Tan0015692 (gene) Snake gourd v1

Overview
NameTan0015692
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPolyketide_cyc domain-containing protein
LocationLG06: 80572380 .. 80604347 (+)
RNA-Seq ExpressionTan0015692
SyntenyTan0015692
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCTCCCTTCTTAATTCATCAGAACCGTCGCATTCTTCTGGAGCGCTCTCCTGCTTCCTTTCCCGTTTAGCTCCGACATTGCCGGCCATCGCCTCAGCCGCCGTAGTAGTTAAGTTTAGAGTCGACCCTTCTCTCTCATGTGTCGCCATCCCCAGCACCAAATCAAGTGACCCTTCCTCAGATACTACTTATCCTGCCAAACATTTTCGGTCCAGGTTCCGAAACTATTATTCGAATTCCGACCCCACGTTCTCAGATAGTGATGTCAATGGCGATTACTCTGACGCATCAGAGCCAGAAACCATATTGGAAGACGGTGGTGTAAGCATCAAAATCGAGAAGTTGGGAAACAACTCTCGCAGAATTTACTCGAGAATTGGTATTGACGCCCCACTTCAGGCCGTGTGGAACATCTTGACAGATTATGGTAGACTGGCAGATTTCATACCCGGTCTTGCTCTCAGCCAAATACTCTATAAGACTGGCAACCATGCCCGACTCTTTCAGGTACTTTTTTCTTCTCTTCTTTACCGCCCCTTTCCCTCTTCTGGTTTTAACTATGCAGGGGGCTAGAACCCATTTAATGACTTCTGGAGCGCTTTCTTTCCTTCTCTCTGTTTCTCTTGATTCCATCCTCCAGTTTGTTATGAATATATATAAAAGGTTAAATAGACCATAACCCGGCTTGGGTGTTCGAATTTAAGATAATTTGACATGGTAATTCAAATTCCAACAGAGTCATATTTCCAACTGATATTCACAATTTTGTGCAAAATGTCATGAATTAAATTAATATAGTAAGTCATATTTTGAGGAGAGTGGTAAAAATTTGTATAAAAGATTACATATTCCATGTGTGAGAAGAGTGCCTATAGCCCACAACATTTCTACCTGTGTCTGTAAGAATTATATGAGAACCCAAAAGTAATTCCTAGTCATTTCACCAAGGGGTTGGTTGGAGCACTGAGTTGGTTATAATAGTCGGGTATATAATCTGTGTTTAAGATGTAGAGTTATTTAGCTTAGGTTATAATAATTTGTGTTTGGGATGCAGATACAAGTTATTATAACATGCACCCCAAACATGCTTGAATATTTAGTTTCACCCTTCTTCTTGAATAAACTACAAAAGCTCATTCAATATTTTTTTGGGAAGCATCTCCTCCGTGACTCTTGCAAGAATGGCGAAGTGCGTCTTAGATTTCGTTTGCTGTTTTGTATGGAAGAAAGAGATACTTTCCCGTATAATGATTGCTTAATTTATCAACCTTGACATCACTATATATATGTAACATTGTACATTTTGGTCACATTGCTTCTCCTTCTTTAGGTTTTCAACACCCTCTTTCTATTAGGGAAACTGAGGACGTGACGACTCTCCTGTTCTTGCTGGAGAACCTTCGCTGTATCCCAGATACTCAGATAGGAAGGATATCCGTTGTTGGAGTCTTAGTCCAGCTGTGGGGTTTTCTTGTCGTTCCTTTTTTCTAGCCCTAGCCTTCTCTCTTTGCTTCTAGGGGGGAGTCTGTCTTTACGACTGTGTGGAAGGTGAAAATTCCAAAGAAGGTTAAGTTTTTTACTTGGCAGGTTCTACACGGGAGAGTTAATACTCTCGATAAAATTTCGAGAAGGGTACCTTCTTTGGTCGGACCGTGGTGTTGTATTCTCTGCAGGAGGGCGGTAAAAGATTTTGAGCATTTGTTTTGGAGTTGTGACTTGACATTAGGCGTGTGAAATTTCTTTTTTGAGGAGTTGGGTCTATCAGACCTTAACCACAACGTTAGCAGAGAGAGGTTGGAGGAGTTTATCTTGCATCCCCCTTTGAAGGGGAAGGGTTTGTTCTTGTGGCACGCTGGGGTGTGCGCTATTCTTTGGGGGCTTTGGGGTGAGAGAAATAATAGAATCTTTCGAGGGCTTGAACGCTCTCTGACGTCTGGTCTCTCATAAGATTCTCTCCCCCCTTGTTGGGCTAGTCTTTTTGTATGCCCTTGTATTCTTTCATTCTTTCTCAATGAAAGCTTGGTTCCTTATAAAAAAAGAAAAGGTATTGTAACACTGTATTGAGAGAAAATACTCATTGTATTCACATTAGAGAGTCACTAACACATATATATAGGTATACAAGGAAACTCTAATACTAAAGGATATATATAGGTATACAAGGAAACTCTAGTTAAGAGTCATTTTTTCAATTTTCTTAGAAAAATAAAATGATATTTAATAAAATTTTAGACTTATGGTTTATTAGATGTATATAAATATATATATTTAAAACTAAATAAATTAATCGGGGCGGGGACGGGAGCGGGGGCGAGGACGGAGAATGTATTCCCCGTTCCCGTTCCCTCCTTCCCCCGTTCCCCGCTTAAACGAGGATAACGGGGTGGGTCCCCGCGGGGAAAATAGACATCTCTAGGTAAAGGGTAAATCCTCAAAAAACGCAACATAAGGAGAGACAAGATACTTGTTAAGACTAGGACAATAACAACGATACCCCTTTTGAACACGAGAATAACCTAGAAAGATGCATTTTAAGGACTTAGGATCTAATTTCGTACGATGAGGACTGATATCTCGAGCAAAACAAGTGCAACCAAATATTTTGGGTGTAATTGGGAACACAGATTTAGTTGGACAAAGGACACGATAAGGGATCTCACCATGAAGAACTGAGGAAGACATTCTATTTATCAAAAAACAAGCTGTGGAAACTGCATCTGCCCAAAAGTGTTTTGGAACATTCATTTGTAAGGATAATGCTCTAACTGTTTCAAGTAAGTGTCGATTTTTTCATTCAGCGACTCCATTTTGGGATGGAGTATCAACGCAGATTGATGAAGAATGCCATTTATACTCAAGTAAGATCCCAACATGTTAGAAAAATATTCACCCACATTATCTTTTTTTTGACAAGGAAACAAGAAATATTCATTGAAGTAATGAAAAGAGGCGGAGCTCACAAAACAAAACAAGTACAAAACAAGAGACTAATGCTCGATGTACAAACAAAACAACCCAAGCAAGAAAGACCAAAATAAACCCAAAGGCAAAACCAATACAAGGCAAAGAAACCAAGGGTGAAGACCAAAAGAATTCCCCAAAGGAGAAGCAAGTAAAACCAAAAAACAACTCCAAAAACCTTAAACCAAAGCTTTACTTTCCCTTCAACCCCGAAACAAACACAACCCGAGACCTGCCATGAGACCCTCGAGAAACCAAACACTCCAAGCCTCCACCAAGCAGCTAGGCTGACTCCGAGAAGCCACCAAACCAAACTTCAGAAACGAGCAAACTAGAAGAATCTACAAAGGCATCGACCTTCCCAAAACTGCCAGAAGCTTCAACATGAAACCGACCAAGCCGAACGAAAACAAACTGATCACCAAAAACAAACACGAAAACGAACTCTGATCCCTGGAAAAGCACCTCAACTGGGCAAAACTTCAGAAATAAATCAACAAAAATACTTCTCTTAAAACCACCCCACACCCCAAAAAGAATACGAAACAAACACCATTAAATAGGAATGGATTTATCAACTCCCAACACTTGTTTAAATTGTAACCCACTCGCCTTTATGAGAGATCTAAACCTGCCCAACTCCTTAGGGCACTCCGAATCACCCTCTTCCACCAACAATGGACTTCCTGGGGAATCAAACAACCTTGCCAGATTGAGAATAGGTGGATCCACCCCTAAATCTTTAGTTAAGGAGAAGCAATTGGATCCTAACTCCGAATTGCTCAAACTAACCACCGAACCCACCATCGAGTCAAGATCAACCAAAGGGGAGAATCCACCCACATCTTTAGGAGCTGGTGTAGAATGCTGGAAAGAAGCTTTAAACAAAGATACCCCCATAGGGCCAACCTGACATTCCTCAATGGTAAAGGAAGGGATGCTTGAATCAACTTTGTCAAAAACTTTAGCACGAGGCAGAGGAGAGATCAGTGGTTGCAAGATACACTCTGGATGGTCCAAAAAATCAGGAAAATCTCCCTTCTGCTCCACAAGCTCCACTGTGCTCACCTTTCCTTTCCTTGAATATACCTTTCCACAGAAAGCCTCCTTCATTCCATGAGAGCAACAGCGCGCGGGAGGAGAAAAAGACCTTATGCTGGTAAGACAACCGCACATCCTGCCAAACAAGCTCCAGACCTACCAACCTCTACTAAAGGCCCAACTGACAGATCGGTTGCAACTCCATCCAAGTTACGAACCAAACTATCCTGCCCTAACTCAATAAACAAATCATCCCCCACCTTGACGACCTCCACTGAACCTTCCATCACAATCCCAGAGGGTGCCCCCACCATCGGATCGCACAAACTGTGAGAAAAAAGATCATCAACAGGGGAAGAGTACAAATTAAACCCCAGCTCATCATTCATAATCGCAGAGATTCGAGCCAGATCCAATGAATTTGAGACATCCCTTAATACCAAGGAACCCGATAACAAAGGAGGAGGATCCGCACACTCCACCTCACCAAACTTCAACAAAAACTCCCCCAGAATTGGGTCAACCACACTAATTGAAGCCGGGAGGAAACCACACAAGTTCTGAGCCACCTTGATTCTAGCCCCTGAAAGATCAAGAAGATTCAAGGTCTCCAAAGAAATATCCACCAAACCCCCCAGTCGCAAACCAATTGCTTCAAAAACCTCCCTTTTCCAGAGCTTCAAAGGCAAATTCTTTATTACTAACCAACCCCCATACCCCCCAATTACGGCCGGTAAACTATGTAGACGTTCCGACCACCTTTCAATTTTCAAATGGAGATTACTAATTACTTGCCATTTCCCAATGCTGGAAACAAAAAAATCATTGGCAATACCATCAACCAACAAAATGGCTTTGTCAACCATGAACGGATTGATTTGAAACACCACTCGAAAATGAGCTTCTAACCCTTTCTTGATAGTCAACCAAGGAATATGGGCAAACAGACGAGTCACCACAAGGATCCTTGAGAAGTCAACATCAACTACCTCTCCAGCCTTCCAAACCCAAAGCCTTTTCACCTGAGCAGAATCACTTTCCAGCAACGAGCGATCCCTCACAGCCCTCAAAGCAACCTCGCCGAGTCCCTTAACGGGAACCCCCTTAAAACTACTAATGCCATCCCCAACCAAATTTGGCGAAGAGATACCCGCACAACAAGGTTCCAGCGCACCCACACCCTCCGAGGAATTGATGAAGTCACCAAGCAAATCCTTAAAGACTTTCCAACCATTCATAAACCTTCCAAGAGGAACGCAAAATGTTTTCTTACCTCCTGAAGGCGGCCATACAACACCCACCGCATCCCAATCTGCATACGAACACACCTTAAAGAGGGAAAACCTCCTTAACTGTTGAATTCCCAATCCAAACATCCTCCCAAAAACGTATCTGGAGACCATTTCCCAACCGATAATGCACCAACGACCTCACCTCCTCCCAGTGCCTCATAATACTTTTTTTTTTGACAAAGTGACAATCAAGATTTCATTAAGATAATGAAAGAGACTAGCGCTCAAAGTACAAAAAAAGACAAAACTCAAAAGAGAAGGGTACAAGAAAAGATTCATAAACCACAAAGTCATGACCATAACCCCAATACCCCTGGAAAACAACCTGTTTAGCAAACCAGACCAACCACCAAGAACTATCCCTATAACAACACTACATTTTGAATAAGAAAGGTGCCAGAAGCGGTTAGATCAGAAATAGAGTAGCTCTGTAAACTCTTTGCTTTGGGAGCTCCAAGAAGAGGCTTTGATAATAGCTGAGTCAAACCTTTGCGCCCACACCCAACTTTTTTCCTCAAACAATCTCCTATTTCTTTCGAACCACAATTCGAAAGAAGTGCTTTCACCCCGCCAACCACAAGACACCTTTGCCTTCGAACCAAGGGCGGCCTACCAACGGTGACACACGCTTCAGTCCATCGAGCAATCAAAGACCCAACTAACATTAAACACTCCAAAAACTCCTCCCAACAATTACTAGCATCACGGCAATGAAAAAAAAAATGCACCATTGTTTCTCATTCCCCAACATAGCCCATGGATGGATGGTGGAGCTTATAGTTTGGGTTGTTTTCTTTGGAGAACCTCTGCGTGTTCAACTTCCCTTTTATCGTAATCCACATAAGGATGTTTATCCTCTTAGGATGTAGAAGGAGGACAGATTGCTTTGAACAACAAATTGTCCATCAACTCTCCATTGCCAAGATAAGCCACAAGAGATTTAACTGAAAAGACCCCCGAGGCCTCAATTGTCCACCTCCTCCCATCCACCCCATCTACTGGCGTTAAATGATGAATAAGCCCCAGCAATCCCATCAACTCTTCAGATTCTTCATCCTTCAAGTTTCTCCTAAAAGAGATACTCCATGACCTTGTGTCGCCATCCCAAAAATCAGAGACGCACCCCTGGGGATTACACGCAACCCGACTCAGCAGAGGGAATTGAGCCTTAAGAGGGAGATCCCCGGCCCACACATCATTCCAAAACAAAATTCTTCTGCCATTCCCAAGCTGAAATCGGACTAAAGACTCTGCCATCCCCCACTGTTTTGCTATGCTCAGCCAGGGACTCCTCAAGCTTCTTCCTCTATTTACCAACGCCGTCCACCCCTCCGAGGAGACACCATGTATGCTCGCCACAACCTGACACCATAGGGCTTAGGATTAACGCCAAACTTCTGTGCCCATTTGGCCAATGAGAGCCTGCTTCTTCTTCTTCTTAAATCACCCACCCCAGATCCTCCAACCGCACGATCCCGCCACCCGATCCCATCTCACTAAATGATTAACCCTTGACCCCTGCTCCCCTCCTGAGAAAGTTTCGATGAACTTTTCCAATTGAAGGCACACACCTGCTGGGACCTCAAATAAGGAGAAGAAGTACAACGGAAGAGCAGTCAAAACCCGACGAACATAAGGTCATCCTCCCTCCTCTAGATAGGTTGAACTTCTTCCATCTATCAATCTTTTCCCAAAATTATCCACAATTGGTTGCGAGAATCCAATGTGCTGCCGGATGGCCCCCAAAGGCGACTCTAAGTAGGCAAAGGAAGAGTCCCCGCCACACAACCCAACTTGCTTGCCATTGACTCACCTCGCTTTAGCAATATTAATACCAACCACCGAGGACTTCATCCAATTAACCTTTAGACCGAACAAGACTCGAAGAGTGCTATTGTTTTTATCAACGCATCCATCATCCTATCATCAAACTTGCAAAAGAACAATGTGTCATCCGTGATACCGCAATATTGATATATGAACCCTTTGCCTTCCAACCACAAATCCTCTAGCAGGACCTGCTCCTCATTCGCCGAATGAGGTAATTTAGACCTCTCAAACCAATAAAAAGCAGAAGGGGAGAGAGAGGTCCCCTGTCGCAACCCCTAGAAGCCTTAATTCTCCCCGCCGGACTACCATTGATGTAAACCGAGAAGATAGGCTCTCTAATGCACCCCATAATCCATCGAATCCATTTTACATCAAACTTTTTAGAACCATTACCTCCCGCAGAAAGTCCCAATCCACTTTGTCAAAGGCCTTTTCCAAGTCAATCTCGTAATCCATCCCTTCGCTTCTTAACCCCGAATTCCTCCACCACCTCACCGGCAATCAACACAAGGTCTAAGATGTTTCTCCCACTGATGAAAGCACTTTGAGTATCCGCAATTATCGAGGGCATGACATCTTTCAATCTTTCGGCCAATACCTTTGCAAGGATCTTATATGAAGAATTAACCAGGCTTATTGGCCGAAAGTCCTTTACAGCCAGTGCCTCATAATACTAATCCATGGGCTTCTCAAACTTCGCCCCTTTGCGACGGTAGAGTTCCATCCATCTTTCGAAATCCCATGTATACTAGCCACCACCCTCCGCCATAATGCCGAGGGCTCACCCCCAAATCTCCACCCCCACTTTGCCAATAAGGTTATATTTTTCCTCCGTAAATCCACAAAACCTAGCCCTCCCAGCGACCTATCATGAGTTACAAAATCCCATCGATCTAAATGATTAATTCTCAACCCACTCCTCCCTTCCCAAAAGAAGTTTCTCATGATTTGCTCCATTTTTTGCGAAACTGCTGAAGGCATCTCAAATACCGAAAAGTAGTATAGCGGCAGGCTAGAGAGAACCGATGAACACAAGGTCATTCGACCTCCCCTTGACAAATTGAATTTCTTCCATCTATCCACTTTCGTTTCAAAGGCATCAATCACTGGTTGCCAAAATCCCAAAGTCCTTGGATGACCACCCAAAGGTAATCCTAAGTAAGAAAAGGGCAAACAATCTGCCTTGCATCCCAAACGGAGAACTCTACTAAAAACTTCATCCAAAGAGGGGATCTCGGATCTTGAGAGAATTTGTGATTTGGCCATATCAAATTCAGAGGAAAGATCATTAAGAAAGCTCATGACACCAATCTTTTCTCTTTGAGTTTGTTGAACCTTTATGTTTGTACTAAATGGGAGCAAAGTGTTAAATTCTGCACATGTCTTCTTAAATTGCATAAAATAATTCGTGACAGATCGGTCTCCCTGTTTCGCATGAAAAAGAGCGGTGCAAACTTCATACATTCGGTTAACTTGCTCTTTTCCTGAGTCTAAGAATTCTAAGAATTCGAGCAGTTCTTTTACAGAATCACAGTGATTTACCAATCCAACCACCTCACTCTCAATTGAATTCTTGATTTGAATATACAAACGAGCATCATCGCGCAACCAGGTCCTCCTTGTATCGTCCTCTGGTGGATCCTGATCGATATGATCATCCATCTTAGTGCTTCGTAAATAGAACCTAATTGTTCGACTCCAATCATAATAGTTTGAACCATTCAACTTATGCTCTGTGATCTTTGACGAAAGGGGAACCAGATTAGGTACAACCATAGGTTTCATCTCAACCATTATTAAAACAAAACAGTAGGTTAACTTCAAGGAAACAGAAAAACCAGATCTGAACAGTACCAAAAAGTGCCTTAATCTCACCCGAATGCAAAAATTGAGTCAAACGAAGGGACTAATGTCGAATCCGAGGTGATAGACAAGCCGGGAAATACGAACAGAGATGAACCCAAGCGGCGGCGACCCGATCGACACGGTCACGCACCCCCATGCGCCGACGCGTGACGGCGACAACAGAAGCTTTCGGCGGCGCGTGAGCCTCACGCGCAGCGGATTCCGACGAACTTTCGACGGTGAACGTCCTTCCCTGACGACGACACTCCTCCTAAGGTAGGTGGTGTTGACAAATGGCCACCCTAAAATGATGACCCAAACCTCAAGCCCTAACCTTACTTAGGAGGAAGAAACCCTAATTGTCAAATGCAAACCCTAAACCCTAGGGAATACTCTGATACCATGTAAATCAGAATTGAGAAAGAAAATATTCGTTCTATTCACATTTGAGTTCTATGACAAATATATATAGGTATACAAGGAAACCCTAATACTAAAGAATGTACAATTACGATAAAGGACAACAATATATATTATAACAATAACAACTTCCATTGATATAATGAAATGAATCTGAAACATCAAAAACAAACTCCAACGGGAGGAAACACGAGCAAGAAAAAAAGAAAGACTAAGCCTCACACAGAGCAACCAAAAAATAAAAAAACAGAAGCCTTCCAATACCCCAAGACTCACAGTTGGCTTCTGTATTATATAAATTTTTTACTGGGGGGCGGGGGGCTTTGAATTTTAATGTGATCAAGTACTTCAGTATACTGCCAGTACGCTTTATTCCTTGGCCAGGAAACCCTTGATTTGTGTTCCTGTATCAAAATTTTTATTGAATAATGCCTCTTACAACCTAGGGACCCCTATTTAAAGGCTACAAGCCAACCTTTAAGTTAAAGACATAGTAACTAAGTCAAATGACAGCAGTAAAACTTAAATGACAGCAGTAAAACTTAAGACCTAAAAAACTAGAAAAATAAAAAGACAGAAGCCTTGCCTTTATAAAAGGACACAAAATAGTAGGGCACAAAAGAGTACACTACAGTACACTATATCAATTCCCTCCCCCTAAAGATAACTCATCCTCGAATTAATCAAGTAGCCAATTGAAACTCATCCGGAGCATGGTAAGCTGTGAGATCCGAAATATTAAAGGTAGGAGAAATTTTGAATTGTGGCGGAAGATCAACTTTGTAGGCATTTGGTCCATAGCGTTCAACAATAGGGAAAGGCCCAATTTTCTTAGGCTGAAGTTTGCCATGAACACCCAAAGGTAATCTAGATTTCCGCAAATGAACCATACAAGATCTCCAATTGCAAAGGATTGAGGTCGGCAGTGCTTATCAGCCAAAAGCTTGTAAGAAGCATTGGCAGCCTCCAAATGGTCACGAACTTCTTTATGGAGTTTGGTGATACGTTCTGCCATTTCCTCGGCCTCCTGATGAACATTCAAAGTAGCAGGTAATGTAGCAAGGTCAATTGTTAAACATGGAAGTTTAGTATAGACTATTTCAAAAGGTGACTTCCCTGTAGATCGATTTCGCATATGATTAAAAGCAAATTCCGCTTGAGGCAAAGCTAGGTCCCATTGTCGAGGTTTTTCTCCACTAAGACATCGAATCAAGTTACCAAGTGTGCGATTTGTGACTTCGGTTTGGCCATCTGTTTGAGGATGACTAGTGGAGCTAAACTTCAAACATGTATCAAATTTCCTCCAAAGAGTTCTCCAAAAATGACTTAGAAATTTAACATCCCGATCAGAGACTATAGATTTAGGAACACCATGCAATCGAACAATCTCTTTAAAAAACAAGTTAGCAATTGCAATGGCATTAGAAGTTTTTTTTGCGAGCCAAAAAGTGCGTCATTTTGCTAAAATGGTCTACTACTACTAAAACGGAGTCATAACCACGTTGTGTCCTAGGCAATCCTAAAACAAAATCCATAGAAAGGTCTTCCCAAATAGATTGTGGAATAGGTAAAGGGGCATAAAGTCCTGTGTTTTGTGATTGACCCTTTGCAGTTTGGCAAATAAAACATCTTAGAACAAAGTTATTGCATCCTTTCGAAGTTGTGGCCAAAAATACCTTCCCGAAACAAGCTCAATGGTTTTATCTCTCCCTAAGTGTCCTGCTAAACCCCCAGAATGTAGTTCCTTTATCACTTGCTCACTTAAAGATGTGTGAGGTATGCACAAGACATTTCCCTTAAACAAATATCCATCAATTATATGGAAGTCACTAGGGTTAACATGGTGGCAACATTTATTCCAAATATCACGAAAATCAACATCAAATTCATATAAAGATGGTAACTCCTCAAATGCAATAATTTCACCCCGGAGTAAAGTAAGTAGAGAATGTTTCCTACTAAGGGCATCCGCTACTTTGTTAGTATTTCCAGCTTTATGTTTAATGAGAAAATCAAATTTTTGAATAAAGGTGATCCACCTAGCATGCATCCGGTTGATATTCTTTTGTGTTTGTAAGAACTTAAGAGAATAATGATCGGTATATAAAATGAATTCCTTACTTATCAAATAATGTTCCCATTGTTTGAGTGCTCTCACTAAGGAGTATAGTTCTTGCTCATAAGTGCTCCACCGTTGCCTAGGTTCACTAAGTTTTTCGCTATAATACTCAATGGGATGGTTTTCCTGAGACAAAACAACACCTATTCCCACTCCTAAGGCATCTACAGATACTTCAAAAATTTTATTAAAGTCTGGTAATGCTAAAACAAGGGCTGCACATAATTTGTTTTTTAGGTTCGTAAAACTGTCTTCTTGTTCATTTGTCCAACCAAACCTACCTTTCTTTAAACATTCAGTTAGTGGTGCAGCTATTGTGCTAAAGTTTTTGATGAACTTATGATAAAAAGAAGCTAATCCCAAAAAACATTGTATGTCTCTAGCATTTTTCAATGGGGGCCAATCTCTAATAGCCTTTATTTTTAGGGGGTCCACGGTTATGCTGTTGCTGGAAATGATAAAACCTAAAAACTTAATTTTTGTTTGTAAGAAAGAGCATTTTTTTAGGTTAATGTAAAGCCGGTTCTTCTCTAATGTGGTAAATAGTGTCATAACGTGTTCTAGATGTTCTTCTTGGGACTTACTATATATTAGAATGTCATCAAAATAGACCACAATAAATTTATTAAGGTAAGGATGCAGGACCTGATTCATCAATCTCATAAAAGTGCTTGGGGCGTTGGAGAGGCCAAAAGGCATTACCAACCATTCAAAGAGTCCGACATTTGTCTTGAAGTCGGTCTTCCATTCATCTCCGGGTCGAATTCAAATATGGTGGTAACCACTTCGTAGATCCACCTTAGAGAATACCCTAGCACCACCCAATTGATCAATTAGGTCAGATAAACGAGGAATAGGAAATTTGTATTTTACCGTTATTTTGTTGATTGCTCGGCTATCCACGCACATTCTCCATGTGCCATCCTTTTTAGGAATTAATAAGACTGGTACAACACATGGGCTAACACTAGGTTGTATATGCCCTTTTTCTAGGAGTTCTTGAATGTGTTGTTGAAGTATTTCATACTCCTTGGGGTTCATTCTGTAATGCGGCAAATTGGGCAAATTAGAGCTTGGTATAAAGTCAATTTGGTGCTATATGTCTCTAAGTGGAGGCAGTGTGGTCGGATTGTTTGTCAAATGTGGAAATTTCTTCAATATTTTCTCAACCATGGGGTCAAGTTGTTCCTCAGTGTCTTCCTTTTTGTCTTGTTTCTGTACAACAACCCAGATTTCTTTATTTATGTTTTGGGTGAATTCTTTAGATCCCAAATGGTGAATAGGTTAGAAGAACCTTCAATCCTTGGTTTTTAGTGGTGTTGTTCAATGGCATGAGTACTATCTTTTTGCTCATCCAATGAAACTCATAAGTATTCTCTCGCCCTTTATGTGTGGCTTGAACATCATATTGCCATGGTCGTCCCAAAAGAACGTGGCATGCATCCATATCAAGGACATCACAAATAATTTGATCCTTGTAATTACTCCCTATAGATAGGGGAATAGTGCAAATTTCATTAACATGGGCCTCTCCCCCCTTTCGTATCCAACTTACTTTGTATGGATTTGGGTGAGGATCGGGCTTCAATTGAAGAATTTGGACAGCCTTGCGTGAAATTATGTTTTCACTACTTCCGCTATCAATGATCACATTACAAATCTTTCCATTGATAGTGCAACGTGTTCTAAAAAGAGCATGGCGTTGAGGGTGAGACTCACTTGTGGGAGTGAGTAATATTCTTTGGATAATGCAAGATAGATGCTCTCCATCATCGGCTTCCACATAAGACACATCTTGTTCATCGGTCTCTTCTTCAAATTCTTGCTCCTCATTGTGTGGATTGTCCATAATGGTTAGAGTCTTCCTTTGAGGGCATTCATTAGATAAATGTCCTTGTTGGCCACATCTAAAGCATTTGCCCAATGTTGGTCTGTTGTATATATTTGCTTTCTTCATGTTGTTTCCTTCATTACCCTTATTACCGGAAGTTCTAGGATTACCCTCTTCTTTACCTTTGTCAATATGGGGTGTAGAAGAAGGGCCTCCAATCTGCCAATTTTTTTGTACTTCATTAGATCTTCTAAAATTTTGGCTGCTTCTTTCCCAAGGTAATTTTCTGCCGGGGGTGTTCCTCTTTGTTTGAAGCTATGGCTTTCTTCCACCTTTGTTGCTAAAGCTACAACTTCTGTAAGATAAAAAAGGGGCTGTAAATTGACCTTTTTCTTGATATCTTCTCGTAAACCATCTACAAAGCGAGTGATCTTGTGTTGTTCAGTTTCAGACAAATTGTTTCTTGCACATAATCTATGAAACTCTTCTGAATATTCAGCTACTGTTCGGTTATTTTGAGTGCAATGTTGATATTGATTATATAGGAGTTGTTCGTAATTAACGGGAAGGAATGTGCTTTTCAATAGTTTTAGCATCTTGGACCAAGACCTAATAGGTCCTTTTCCATATCTTCTTCTATTGATTTGCACTTGGTCCCACCATGCGGAAGCTCCCTCTCTTAGTTTGTATTCACCAACTTCACTTTCTTGCCTTCGGGAGTGTTGGTATAGTCAAAGAAAGCTTCAACTTCTTTGGACCAATCTAAAAAGGCCTCAATATCAAATTTCTCACTAAAGCCTTCATTTTGTATTCATAATTTTGTGGAGGGTAATTTGGTGCATTGTCCCTTTGTCTTTCTTGAAATTCATAATTTCTTCCTCCAAACGGTTCACCCTCATCATTAGAGCTATCAAGTCCTATAAATCTTGGATCTTGAATTTGAGGAGGTGGAAATGCATTTCGTGGATATTCTTGCTATCTTCTTGCCCTTTCTTGGGGATGTCTTTGGTGGAATAACAATTCTTCCTCCCTGTAATAATCTTGCAGTGGATTGTGGTTGTGGGGTGGTATATTTCTTGCAACTTCTTGATAAAGGTTTGACCCATGTTGAGGCTGTCTTCCAAGGGATTGACTGGGGCCACCTTGAAGATCTTGGCTTCTTGGTTGTTGGTTATTATTTGGCCGAATGGGCATCAAGTTCATTTGTTCGGTCAAGGCTTTCACCATTTGGTGAAGCTCAAGTTGGCTATTTCTCATTTTTTCTATGGAATTTTCCATTCTAGAGAGATGATGAAAGATAGATCTTGAAGATAGGACTTCAACTTCTTCACCGGTTCGAACAACGTCAGTACGGAAGTGTCACCGGCTGGTTGCCTTTTTCCAACCATTTTGGTTCCCAATTTTGGTCGCTCTGATACCAAAACTAATGTAGAACCAATCTTGAAGGTTTCTTGAATCAGAATTTTTATTGAATAGTGCCTCTTACAACCTAGGGGCCCCTATTTAAAGGCTACAAGCCAACCTTTAAGTTAAAGACATAGTAACTAAGTCAAATGACAACAGTAAAACTTAAATGACAGCAGTAAAACTTAAATGACAACAGTAAAACTTAAGACCTAAAAAACTAGAAAAATAAAAAGAGAGAAGCCTTGCCTTTATAAAAGGACACAAAAGAGTAGGGCACAAAAGAGTACGCTACAGTACACTACATCATCCTGCCTCCTGCATTCATGATTTTTGTCAAATATATGAAGGAATTAGTATCTCCACATAGAGAGATGATGAGTTGGAGATAGAAAGCTCAAGTGAGATGGACGTTATTTGGAGATGTTAATTCAGCTTATTTCCATAGAATTGCGAGTGGAAGACGCAACAAAAACCTCATCAGTTCCCTAGAGAGTGATAGTGGTGCGGTGATTCAGGGTGATGAGGAATTGAAAGAAAGATCATCTCTTTCTACACTTCCCTTTATGCTCCTCCTGTCCAAGCTCGGGGGTTCTTGGAAGGGGTGGATTGGAGTCCCATCTCGAGTGCTGCCAACGATGAGCTTGTTAGTCCTTTTACTTTGGAAGAAATTCGGAAGGTGGTGTTTAGTTTTGATAGGAGTAAAGCGCCTGGCCCTGATGGCTTCTCCATGGCCTTCTTCCAGGATAGCTGGGACACTATCAAAGAGGATATGGAGAAAGTTTTTCGTGAATTCTTTGAGAGGGGGATCCTGGATAGCTCTCTTTATGATACGTTTATTTGCTTAATTCCTAAAAATGAGGTTTCTCGTAAGGTTCGTGACTTTAGACCGATAATCCTGGTGACAAGTGTGTATAAGATTATTGCTAAGGTTCTGGCTTGTAGGTTAAGGAAGGTGCTGCCCTTGACTGTGGCGGATTCACAGGGAGCTTTTATTTCGGGTAGACAGATTTTGGACCAGTCCTTGATTGCCAATAAGGCTATTGAGGATTACAGATCCAGGAAGAAGGAATGGATCCTTTTCAAAATTGATTTTGAAAAGGCTTATGACCACGTGGATTGGAATTTCCTGGATAAGGTCTTGGCTAGAAAAGGCTTTGGATATAAATGGAGATCCTGGGTGTGGAGTTGCGTAAGATCCGTCAAGTATTCTATTATGGTAAATGGAAAACCGAAAGGACGGATTTGTGCTACAAGAGGTCTCAGACAAGGGGATCCAATCTTGCCCTTCCTCTTCCTTCTGGTGGTGGATGTCCTCAGCAGGATAATCTCGAAGGGGGTTGAGCGAGGGGTCCTTGAAGGTTTCAGGATGGGGATGGACAATATACCCTTATCCCATCTTCAATTTGCTGATGATACCCTTTTCTTTTGCTCTGGGGATGCGGCTTCTTTCTGTCATCTGAACAACTTCTTAGCCTTCTTTGAATCTATATCGGGGTTGGAGATAAATAGGGGAAAGTGTCAGCTATTGGGCCTGAATTGTGATCCCGAGAAGCTGAGGACTTGGGCTAACCGAGTTGGTTGTGAGGTGGGATCCTTTCCTTCCTCTTATTTGGGGCTTCCCCTTGGGGATAATCCTAGGACCCTTTTTTTTTGGGATCCCGTGGTGAGAAAGGTTCAAAAGAGATTAACTTCTTGGAAGAAGAGTTTTTTCTCCAAAGGAGGAAGACTTACCTTGATTCCGTCTGTGTTAAGTGGAATACTTGTTTATTTCTTGTCACTCTTTAAGATCCCTGCTGCGGTGTGTAAGCGAGTTGAAAAGCTTATGAGAGACTTTCTTTGGGAGGGTGTGGGAGAGGGAGGTGGCTCGCACTTGGTTAATTGGGACCTTGTTTCAAAACCTGTGGATCTCGGGGGTTTGGGGATAGGAAACCTGAGAAATCGCAACAAAGCCCTTTTGGCGAAGTGGCTGTGGCGTTTTTACCATGAAACCTCCACTCTTTGGCACATGGTCATTGTGAGTAAGTACGGGGCGTCCCCTTTTGAGTGGATTACTTCCGGAGGGAGAGGCATTTCTAGAAACCCTTGGAAGGGCATTGCTGAGGGTTTTCCTTTTTTCTCCGACCTGGTTACTTGTTGTGTGGGGGATGGGAAAGATACTTTGTTTTGGGAGGATAGGTGGTTGAGGGATAGACCTCTCTGTTCGATGTTCCCTTGTCTTTATCATTTATCTCGGGCTAAAAATAGGGTCGTGGCGGAGGTTCTCTTCCTTTCTGGTCACATCGCTTCTCCTTCTTTGGGTTTTTAACGCCCTCTTTCTATTAGGGAAACCGAGGACGTGTCGGTTCTCCTATCCTTGGTGGCAGTGTTATCAAGGCGCACAGAGGCGCTCGCCCGAGGCGCAAGGCGAGGCGATGTGAGGGCGTGGCGCCTCCAACACCTGCGAGGCGGGCGACTTCAATGGGCGCGCGCTGCTTTTGCCTGGATGCGCCCGGACGGTCACGCAGCGCTTTGTCATGAGCGTTCAAGCTTTTTTTTTTTAAACTTAACTTCATATGCACGAAACCCTAAATGCCAGCCCACACATAAGCCCATAAACTCCATGCAGCATGCATTTGACGCTTCGTAGGTTTTTTTTTTAATGAAAAGTTGAAAACGTTAAACGGCAATGACTTCTCTTTTCCTCTATCTATTGAGTATCGATCAACAAGTCTCCATCTCCAAATAGTTTCTTCTTCTTCATCGACCAATTCTGAACACCAAGTTTCTCTTGCTACTATCGATCAAGTCATCATTAATTCGCATTGGTTCTGTTGGACGTTGGCTTTCTTCAAATTCTTCTTTCTTTTTCATCGACTGCTCACTGCTGGTAGTGGTACTGTTTCTTCTTCTTTCTTATTTTGTTCTTCATCGATTGCTTCTGCTGGTGGTGTTGCCGTGTTGGTTGCTTCTTCTCCTCCTCCACTCCATTTGTTTGGCATTCGTTGGTTGCCTGGCTGCTCCTAATTCTTCTTCTTTCTTCTTCATTGATTGTTGGTTGCTACTTGCTATTTGATAGCTACTGCCTACTGCTTGGTGCTTTGCTTCATCGGTTGTTTCTTCTCCATCAGGTAAGCAGAACATGGTTACTTCCTCACTAACTCGTTCATTCCTCTTCTTCTATTTAACTTCAGCCTTGAACCAAATTATTCATCTTCTTTAATTATTCTTCTTCTAATAGGAATCAATCAATATGGGAGATGTTGAGACAAAAAGAAAGCACCCCAGTTGGAAATATGTCATCTTGGAGGATGAGGACACATACTTGCAAATGTACCTTTTGTGGAAAAATCACAAAAGGTGGAATTTGTCGAGCAACACATCATTTAGTTGGTGGAAATCGAAATGTCAAAGCATGCTTAAAGTGTCCAACTAATGTAAGAGAGGAATTGAAGGAGTATCAAACAAAAACAAAAACAAAGCGAGAAGAAAGTAATGCATGAACAAACTATGAGATTATGATTTGAATGCAATTGATGAAGATGATGAGATGGATGTCATTGGAGGATCAAGTTCAAGTACTGGAACATCTAGAAAGCGAGCTCTTGGTAGTGGAGAAGGATTTAGTAGAAGTCGATCATAATTGAACTTGAAGCCACTACGACAAAAGGGAGCCATGGATAAATTTGTTTATCCCGATCCAACCAAAGTTGTTGATGATAGGAATAGGATGAAGAAGTTGAAGCAATCTAGCATTACAGAAAAATATAACAAAGAGTGAGGTGTTGGGTTGGTGGTTTTTGTTGGGTTCACGTTGAAGCTTTTGGCAGATTTGATGGGTTGTTGGTGTTGTGGTGTTTAAAGCAGTGGCTACTGGAAGGGTTTGGCTCGGTTGGGTGAGATCTCCAGGTTGGTGGTGTTGTCAATAGAGGAAGGTGTTGGGTTGGTGGTTTTTGTTGGGCTCCTTCTCTGCTGGTCTTAATTATGGAAATTTAGCTTTGTTAGAGCTTATGTTTGGTTCCAAGAGTAGTTTTTGGTTGTTGGAGGTTTGGGTGGTGTGTTTTGGTTCTGTTGCTTTGTTGGCTTCATTCTGGAGTGTTTTTCTATGTTAGCGTTATTGGCTTCTCCTCTTTGGGTTTACTTGTATTGCTTATTAACTTTCTGGTTTTGGATGGTCTAGGTTTACTTTTTTTTTCTATATGAAAATTGTATTTTGAGCATTAGACTCATTTCATTATTTCAATGAAAATTTTGTTGTTGCCTTGTCAAAAACAAAGAGGGAAGGGATACTACGATTCAATTCATCATGCGTTGGATTTATCAAGCACGTATCCCGTTAAACACCATTCGCTTAGAAGCAGTTGGTCAATTTGGTCCCGGTTTGAAGCCTCCTTCTTATCATGAGGCTAGAGTTACATGTTTGAAAAATGAGGTAGAATTTACTAAAACAATAGTTAGTAATCGTGAGGTTGAATGGAAAAAAAATGGATGTTCGTTGAAGTCAGATGGGTAGACAGATAGAAGAGATAGAACTTTAATAAACTTTTTAGTTCATTCTGCGGCTGGAACATTATTTTTGGAATCCATTGATGCATCTTCATGCATCAAAACTAGTGAAAAAGTATTTGAACTCTTGGATGGAATGGTAGAGAAGATTGGAGAAGAAAATGTCATCCAAGTTGTTACTGATAATGCCTCAAATTATGTCCTAGCTGGTAAATATTTAGAAGCGAAACGACCACATTTATATTGGACACCATGTGCTGCTCATTGTTTAGATTTGATGTTAGAAGATATTGGAAAGATGGATCAAATCAAAAGGTGTATTAAAAAGGCCGTGGCACTTAGTGGTTTTATTTATAACCACTTGTATGTGCTAAACATGATGAGAGAATTCACTAATCAACATGAATTGGTAAGACCCGCCGTTGCTCGATTTGCAACATCATTCTTGACACTATCGAGTATTCATAAAAATAAGGCGAATTTGAGAAATATGTTCATCTCTGAGAAATGGATAACTTCTAAATGGGCAAAGGATTCAAAGGGTAAGAGAGCAGCTGACACTATCTTGATGACATCGTTTTGGAACTCAGTTGTCTATACGTTAAAGGCATCGAGGCCACTAGTTCGTGTTCTTAGATTAGTTGATGGTGACAAACCAACTATGGGGTACATATATGACACTATGGATCGAGCGAAGGAGGCTATCAAGAGCGGGTTTAATGATAATGAAGCAAAATATAAACCTATTTGGGATATCATTGACAAAAGATGGGATTGCCAACATCATTGACCTTTACATGCAACCGGTTATTATTTAAATCCATCTCTTTTTTATGATCACAAAGATAGAATTATTCAAGATGTAGAAGTTATGAGTGGTCTACTTCAAGTTGTGCAACGACTAATACCATCAAAGAATGACCAAAACCGATGCACGGTGGAATTGGAGTTGTATTCGGAAGCTAAAGGCATGTTTAGCATCGATTTAGCTATCAACACGAGGAAAATTAAGACGCCAGGTAATGTTATTCTTTAAATGTTTGGGTTTTAAACTTAAGTTATTCCAAGAGTTAATCCTAAACCATTTTAAACATGTGACAATGTGAACAACATCATGGTGGGCTCTTTATGGGGGTAGTACTCCAATTCTACAAAGGTTGGCCATGAGAGTTCTAAATCTCACATGTAGTGCTTCTGGTTGTGAACGCAATTGGAGTGTTTTTGAACACGTGAGTGGTTAATTTGTCTTCTACCTTATTAATTTAATATTTAAACATTACTTTTTAACTCTATATTCATTATTAATTTTGTAGATTCATTCGAACAAGAGAAACAGGTTAGAGCAGAAGCGTTTAAATGATTTGGTTTATATAAAATACAACCAAGCACTTAAAGAACGCTTCGATTTGAAGGATCAACTCGATCCAATTTCTCTTAATCATATTGATGAGAGTAATGAATGGTTGGTTGGAACGCTTGAAGAAGAAGATAATGAACTTGAAGCTGATAATGAGTTGGTTTTTGATGATGACGACCACACATGGGGAGATGTGGCAGATGCTAGTGGTGTTAGAGAACCTCTAAAGTACACTAGGTCTAAAGGGAAGTCAATCTGTGCATGACTTCATCTTCTAGAATTCGAACACCAACACCTACTATACAAAGCAAGTTAGTGATGAAGATGATGACAATGTAGAGGAATTGGATCTTGAAGAGGAGAATGATCAAGGAGTAGCAGGAGATGACGAGTTAGATTTAGATGGGGAGGAAAATTATGATTATGACATTTGATTGATATTTTAGACTTTTAAATTCAGAGTTTTATACTAGTTTTTCCTTTATTGTTGGTTCACTAATTCATTATTTCATAATTTTCATTCTTGATGACTTTAGACATTTGATTGATATATTAAGTTTTTAAGGTTTTTTTCATTTTATATGATATTTACTATACAATTTATGGTGTATATATTATTTATTTATATATTTAATTATTGCGCCTCGCATCACTCGGGCCATGCCTTTTTGTCGCCTCTCGCCTTCGGGTGATCAAATGACTTGTCGTCCTGAGGTGCGCCTTGCGCTTTGAAAACACTGCTTGGTGGAGAACCTTCGTTGTATCCCAGATAGGAAGGATATTCGCTATTGGAGGCCTTGTCCGGCCATGGGATTTTCTTGTCGTTCCTTTTTTCTAGCTCTTACTACTTCTTCTCCCTCTGCTTATAGGGGGGAGTCTGTCTTCGCGACTGTGTGGAGGGTGAAAATTCCAAAGAAGGTTAAGTTTTTTGCCTGGCAGGTCCTACACTAGAGAGTTAATACTCTCGATAAAGTCTTGAGAAGGGTACCTTCTTTGGTTGGCCGTGGTGTTGTATTCTCTGCAGGAGGGCGGCAGAGGACTTCGAGCATTTGTTTTGGACTTGTGATCTGACAGTTGGCGTGTGGAATTTGTTTTTTGAGGAGCTGGGCCTTTCTGATCACAACCACATCGTTAGCAGAGAGAGGCTGGAGGAGTTTTTCTTGCACCCCCCTCTGAAGGGGAAGAGTTTGTTTTTGTGGCACGCTGGGGTTTGCGCTATTCTTTGGGGGCTTTGGGGTGAGAGAAATAATAGAATCTTTCGAGGGATAAAACGCTCTCCCTATGACGTTTGGTCTCTCATTAGATTTTACATTTCTCTTTGGGTCTCGTTAACCAAACCCTTTTGTAATTATCCTTTACATCTTATTTTGTTGGGGTGGAGGCCTTTGTTGTAAGCATCGCTTTTTTTGTTGGGCTTTTCTTTTTGTCTGCCCTTGTATTCTTTCATTCTTACTCAATGAAAGGGTGGTTTTTATCAAAAAAAAAAAAAAAAAGTATCGTACTTTCTAATTAAAGCTGGTGGTGCACATTGAGCTCATTCAGTTGTGACTTAGTCTCTTGTTAACCATCGTTGCATCGGTTGCTGCATTTTCTATGTACTAAAATAGTGTTTTCCTGAATTTAAAGCTGGTGGTGCACATCGAGCTGCATTCATGATTTTTGTCAAATATATGAAGGAATTCGTATCTCCACATATTTGTACTTTCTAATTAAAGCTGGTGGTGCACATTGAGCTCATTCAGTTGTGACTTAGTCTCCTCTCTTGTTAGCCATCGTTGCATCGGTTGCTAATGCATTTTCTATGTACTAAAGTAGTGTTTTCCTGAATTTAAAGCGGTTTATTCTATGCTATTTGAAGTGAGAAGTATTATTCAATCCTATGGAACTTAAAACCACATCTTCATATCCTCTTTTGCTTTTCTTCATGTGTACTAACATTCTTTGACATTCTTTCAATAGGTTGGACAGCAAAACTTGGCCTTTGGTTTGAAATTTAATGCTAAAGGAACCATTGATTGTTATGAGAACGACCTGGAAAGACTTCCTTTGGGTAAAAAGCGAGTTATCAAATTCAAGATGATTGAAGGTGACTTTGAACTCTTTGAGGGAGAATGGTCAATTGAGCAGGTAATTATCCTTTGCATAGATCAACATTTTTCTATCAGAAGCAATTGCTATTCATATATCAATCATGAGAAATTATATTAGAAGTCCTTTTATTTATAATTAATAAACAAGCTTGTAGCTAATCTTGTAGTGATTTGTGGTCCCTTGTTAGATTTATGCCTCTCTTTGGGCTTCGGTTTCCAAGCTTTTTTGTAATTATTCACTTATCTCATTTTGTTAGACTAGAGCCCTTTTTGCTAAGGGTTTCAGTTGGTAAAGAAACATCCTTTTTTCAGACCCTTAATTTCTGTGGATCTCTTTATTGTGTAAATATTTCCCTTTTATTTCTCTTTCATTTTTGTTAAGATTTCTCTCTCTCCTTCCTTGTATATTTATTCCCGTTACCCTTCTTTCCTTTTGTGTAAATTCATTTAGTCTTTAAATCTAGGGTTGTATCCTTTTTTGAAGAATAATTCAAGACAAGAAGAAGATTTCTCTTACATAAAACAAGATGGTATCAGAGCCCACTTTCTGATTTTTTTAATTGTGTCAGAGGCATTTTTATCAATTCTTCACCTCGTGTCAGAGGTTGATTTGTGGGTATTCTGCCCTTTCTCAAGGTGCTTCGTCAATTGAGGATTCTTGACACCCATTGACTAATAAGCTTCGTGTCAGAGGCTACGTGTGCATCTCCATAGAAAATATTGAGCTGGTGAAGTATTGTGTGAAGTTCAATTTCACACATTAGAAGTATTGTCGGGTAGAAAACTACTATTGTTGGCAGGAAGTTAAATTACTGCATTTTTTATCTATCTATTTCTGTCTTTATTTTGGTGAAAAGCTGCAAAATAGCTAAATGGCTTCTTCCTCAAATACAATGCCTATGTCAGAAGGAACAATTCTCCAAATAACCTCTCATAAGCTCAACAGGAGGAACTACCTTTCATGGTCCCAATCGATAATGATGTTTATTTGTGGGAAGAGAAAAGAAGACTACCTGTCCGACAAGAACTCAACCCCCGCCAGCACAGGTCAAAGGTACAAGATATGGAAAGCTGAAAATAACATGGTCATGTCATGGTTAATCAATTCCATGACCACGGAAATTGGTGAAAACTTCTTACTTTACTCTACAACAAGGGAGATTTGGTAAGCGGTACGTGAAACGTACTCTGATACTGAAAATACTTATGAATTATTTGAAATAGAAGGGAAACTTCACGAGTTGAATCAAGGTGATATGCAAGTTACCCAATACTTCACTCAACTATCCCGACTGTGGCAACAACTAGATATGTTCAAAAATCATAAGTGAGACTGCACTTCAGATTCGTTAACATACAAGAAAATTGTTGAAATGAAAAGAGTATTCAAATTCCTACTTGGTCTCAACAAAGAACTAGATGAGGTACGTGGTAGAATTCTAGCTACTAAACCTTTACCTAATATTAGAGAAACTTTTGCAGAAGTAAGAAGAGAAGAAAGCCGGAAAAAGCTAATGTTGGGTGTTGGAACAAACTCTGAACCTGAGGGGTCGACTTTAATTGCTCAAATACACATGCATTGGGAAATGAGGTACACAATGCAAGTACAAATTTTCATAATCGAAATCAAAATTACAACAGAAATGAGAGTCGAACACGAAAAAGTAGACCATGGTGTGAGCATTGTAAAAAACGAGGACATACCAAAGAAGTATGTTGGGACATTCATGGCAAACCTCTGGATTGGAAGTCAAACAAAGGAACTAGTGATCGACGAGGAAAGGCTTTTCAAGCAAGTATACCATCAGAAGGTAATGTCACATCTGAAAGTAATTTATTTACTAAAGAGCAGTTGAAACTTCTTCAAAAATTATTTGGGCAAAGTAAATCTACTCAAAAGGCGCATGAAGCAAACATGGTATAGCTCAAACAGGTAATTTTTCGATTGCTCTTAAGGTAACAAAAGAAAATTTTAGTACATGGATTATAGACTCGGGAGCTTCAGATCATATGACAGGTGGTATTTCTCTTTTGACTAATTACTCAATTTGTAGCAATAAACCTGAAGTAAAAATTGCCGATGGTACTATTTTCAAAGTAATAGGAATGGGTAATGTCTTAGTCTCTAATGATCTAGTCTTAGAAAATGTTCTTCATGTTCCAAACCTCAAATATAATCTTCTCTCAATAAGCAAACTCACTCAAATGAAGAAATGCATGGCAAATTTTTACTCACATCATTGTGTTTTTCAGGATTTGAGCTCGGGGAAGACGATTAATGCTGAAGCATTCTCATGTCTCTATATCCTGAATATTCCAAAGTCCCAAGTCAAGCAATTAGGTGCAAGCAAATCTATTTTTGTTGGTTCGAAAGACGATTACAATAGTAGAATAATGTTATGGCATCGTCGCTTAGGTCATCCAAATTTTATGTATTTGAAACGACTTTTTCCAAACATCTTTAATACAAGTTCTCAGGTTTTTCAATGAGAATCTTGTGAATTAGCTAAACTGGCTCGTTCTATCTATCAGGTACGCACTTATAAACCTTCTCATCCTTTTTCTATGATACGTAGTGATGTCTGGGGGCCTAGTAGAGTTAAAAATATATCAGGAGCGAGATGGTTTGTATCGTTTATAGATGATCACACCAGGCTAGCTTGGGTGTTTCTAATGAAAGAGAAATCTGAAGCAACCAACATATTCCAAAGTCGGTATGTTTTGATCCTCAACCTTTTTAACTTAACACCAAAATTCAAATTCAACATTCTGATAATGCCAAAGATTATTTTAATACTACCCTAGTTTTTTTAAAGGAAAAAGGAATTGTTCACCTAAGTACTTGCACCGACACACCTCAACAAAATGAAATTGAAATTTTTGGATGTACAATATTTATCCAAGATCATCAACCCAATAGAAGTAAATTAGATCCTAAGGCAATCAAATGTATCTTCCTTGGATATGCCTCAAATCAAAAGGGGTACAAATGTTATTATCCTCCTTCTAGGAAATTCTATGTATCCATGAACATGTCCTTCTTTGAAAATCAACCTTATTTTCAAAATTCCAACAATTTCATCAACCCTGGGGAAATCAATATTGAACCATCTTTCCTCATTGACAATGAAGAAACCTCCAACTCCACATTTACAGAAGAATGTACAATGGAGGAACCACCTATCCAAAATCTTAGTATCATTGATCAAGATCCTCAAGAACCTGAGAGAAACCCGATCGTCTATACAAGGAGAAGGATGAACATTCTTGAGAATGAACAAGCACTCAATCAAATTTCTACAAATATACAGGAAAATTTAACTACACACTCTAATCTCCCAACTGACACCTTTTTTGATGATGTTGATCTTCCTATAGCAAAAAGGAAAGGGGTGAGAACCGCACTAACCATCCTATACAAAACTACCTTTCCTACAAACATCTCTCTCCCAACTTCCAAGCCTTTACAACTAGCCTATCTACCTTACAAATCCCAAATAATATCTACGAGGCTCTTAGAGACCACAATTAGAAGAAGGCTGTGATGGAAAAAGTAAATGCCTTAGAGAATAACAAAACATGGGAAATAACTGATCTACCCAAAGGCAAGAAAACTGTAGGATGCAAATAGATTTTTACCATCAAATACAAAGCCAATGGCGAAATAGATAGGTTTAAAGTAAGACTTGTTGCAAAAGGGTTCACTCAATCATATGGAATTGATTACCAAGAAACATTTGCTCCAGTAGCAAAATTAAACACTATCCGTGTTATTCTCTCTCTAGCTGTGAATAATGATTGGACTCTTAACCAATTAGACATAAAAAATGCTTTCCTCAATGGTGATCTAGAAGAAGTCTACATGGATATTCCTGTAGGGATGGAAACAACAAAAACCATCAATAAAGTGTGCAAACTCAAGAAAGCCCTTTATGGTTTAAAACAATCTCTAAGGGCCTGGTTTGAGAAATTTTCTAAAGTTGTAAAGAATAAAGGGTACTTGCAATGTCAATCCGATCACACCCTTTTTGTCAAACATTTTCTTCAAGGAAAAATAGCTATCATTATTGTGTATGTAGATGATATTATTATTACAGGTGACCATGAAGAAGAAATAAGGAACATCAAATCACTTTTGGCAAAAGAATTTGAAGTCAAGGATTTAGGACATCTCAAATACTTCCTAGGCATGGAAGTAGCTCGATCAAAGAAAGGTTTGTGTATCTCTTAAAGAAAATATGTGCTTGATCTCCTAAAGGAAACATGTATGCTTTGGGTGTAAACCCGCCAACACTCCGATTGAACCTAAGTGCAACTTGGACAAAGGAAATAGATTAGTTGACAAAGGAAAGTATCAAAGACTTGTTGGAAAACTCATATATCTCACTCATACTAGACTTGACATTTGTTTTGCAGTTAGCTTTGTAAGTCAATTTATGAACAACCCTACTGAAGAAAATATGCAAGCTGTTTACAGAATCCTAAGATATTTGAAAGGGACTCCTAGAGAGGGGTTATATTTTACGAAGCCACCAACGAGAAAGGTTGAAGTCTTTACAGATGCTGATTGGGCAAGCTCTAAAGTCGATCGAAGATCAACCACTGGTTATTGTGCCTTTGTATGGGGCAACCTAGTAACATGGTGGAGTAAAAAACAATCGGTTGTATCAAGAAGTAGTGCTGAAGCTGAATGAAGAGCCTTAACAAGGATATGTGAAGGAATATGACTGCAAAGAATCCTATCAGAATTGAAATTATCTTCAGAAGAACGAATGGAGGTTCATTGTGATAATCAAGCTACCATCAATATTGCCAAAGACCCAATTCACCATGATCGTATAAAGCATGTGGAAATTGATCGTCATTTCATCAAAGAGAAAATTGAAAACAAGAGTCTCATTCTCAACTACATATCGACCAAGATGCAAATCGCAGGTATCCTCACAAAGGCTTTATTTAAGAATAGACTTGAAGATCTTAAAGCCAAGCTGAATGTCATTAACATATGCAATCCAGCTTGAGGAGGAGTGTTGGTAAAGAAACATCCCTTTTTCAGACCCTTAATTGCCGTGGATCTCTTTATTATGTAAATATTTCCCTTTTATTTCTCTTTCTTTTTTGTTAAGATTTCTCTCTCTCCTTCCTTGTATATTTATTCCCGTTACCCTTCTTTCCTTTTGTGTAAATTCATTTAGTCTTTAAATCTAGGGTTGTATCCTTTTGAATAATAATTCAAGACAAGAAGAAAATTTCTCTTGCATACAAGAGTTTCCTTTTTTGTGAGCTTTATTTTTGTATGCCCTTGTATTCTTTCCGTTGTTCTCAATGAAAGGTTGGTTTTTCATCCCAAAAAAAAAGCTTGTAGCTAATTGGTGCACACGTATACTTTCATTATTTTATTGCTGGATCAAACTTCTAGCCACGATGCTTTTTTCATATATTTCTTCAAGTTTTAACGTGTTTCTGATCTTATCTATAAAACCACCTGCAATATAATATACCTTAAACTTTCCGTTTTAAATATTTGATGATTTTACTTTTTTTTTTGACAAGGAAACAACAAACTTTCATGAAAATAATGAAATGAGATTAATGCTTAAAGTACAACCTCAATATGTACCATCTCAAAAAGGAGAACAAAATTGAAGTGATGCGACAATAAACCTAACTAGGAACAAAAAAACAAGAACATCACACAACGAACAAACAAATAAAACCCAAGAACAAAAGAAGAGTGCCAACAAGCCAAACTTTTAAGAACCAAAAATAAATAACTGTCAAAAACCAAAACTCGGTAATGCCAATGAGGAGAGAACTCCATAAAGAAGCCATCAAAATGTGATCACCAACTCATCAACAAACAACCAGCAATGGCAATGAGCATGGACCGCAAATGTGATCAAACGCACCAAAGAAGAAGAAAACTTTGCTCCATAAACTGCTTTGTAGGGGTGGTGTACATAAGCCAAGTTGGGTTGGGTTGAGGGAGATTTTTGGACCAACCCGAAATTTTGGGTTGGTCATTCCTTCAACCCAACCAACTCTATTCATGAGAGTAATTTAACCCAACCCAACCCAATATTTTTGGGTTGGGTCAAGTCAAATCATCAAGTTAACTTTTTATTTTTTAAATTATTTAAAATATTTTTTTCTTGAATCAATAACTAAAATCACATAAAATTCAAATATCAAACTTCAAAATCCATAACATCTTAACATTTTGTTAGAAGTCTAGAAGAAAACATAAAAAAGTATATATTAATTAAAATAAAATTTAAAAAATATAAACTTTGAGTTGGGTTGGGTCAACCCGAGCCTAAAAGGCTCAACCCGAGAACCGACCCGAACTTTTCGGTTCTCCAAACTTCTAACCCAACCCAAACCGAAAAAAATCTAACCCTACCCAACCCTTACGGTTTGGGTTGGGTAGTCCGGGTTGGTCGGGTTATCGGGTTACTTGTACACCCCTACTGCTTTGTACTTGACTCAACCATTATGATGCTTGTTTGCACTCTCTGAAGTGGCAACATTGAAAATAACCCAAAGACCAGCCAAGAAAAAAACCTCACAAAAAGCAATTGAGAGCTACTAAATTAAAAAAATCTAGCAGGAGCATGCTCAACAATTCCACCCGCTTCCACACTCCTTCTATCAACACTGTAGAAACAAAGAACCAAAACGCAGACCCACAAGATTCAAGACATGATCTTTATTATAAGTAGGACGAACATCCACCAACAGGTACGTTCTTAAATTGTAAACCACTTGCCTTAATCAGGGAGCTAAACTGACTCAGTTCTGGATAATGAATTACCGAATCCGATACCCCCAAGGAGAACACCCCTTTGGAGAATCAAATAAGCTTGCAAGATTAATGATATGGGGCCCATCCGTCAAACTTACCAATGAATTCCTAACCGAACCCAAATCAGGACTACTTAAACTCACTAATGAATCTATGATTGAACCACCTCCAAGCAAAGGAGAGGAAAAGTCTAAAACCCGAGCATCCTCTACCGAATGTAAAAAGGATGCTTTAACAAACGAAACTCCCTTCGGACCAACCTGGCATTCCCCAATATTGAATGTCTAAAACCCGATCATCCTCTAACCTCATCAAATTCAACATTGAGACCAACTAAGAATTTGTAAATACGACCATCTTCAACAATTTTCCTGTAGTGCTTTTGATCCTTTGTAGACTTCCATTCATAGGTATCAAACAGGTCTAGGTCTTGCCAAATCTTTTTGAGAGAGTGAAAGTACTGAGTAACATAATTACCTCCATGTTATGTCACCTCAATTTCAAATTTAGCTAAAAAACTTGTGATTTGTTGCCCAAATCCGAATACATTTCAATCACGTTCTCCAAAGCTCCTTTGCGGTAGTGTAGTTACAACTTATGTGTCCGACCATAGAATTAACTAACCAGGTCATCACCATGGAATTTTCAGCATCCCATCCGACAAATGAAGGGTCGTCTTTGGCTGGTGCCACTGTATCTCCAGTGAGGTAGTCGATCTTTCCTTATCCTCGAATATACATTCGTACACTTTGGGACCAACGAAGAAAATTTTCACCATTAAGCCGAATGGTGGTAATTTGAACAATATTGGAATGAATGCAATTATCCGATGCTTTAATAATTGCAGACTTGTTTTTCGACATTCTTGCCTTGCTGGTAGTAGTAGTGGTGGGTTGCAGTTCACCGATGGTGCAGACTTCAGGCGGCGGCAAAGAGATAAGGACGGTCGACGCAAAGCAGATGGCTGCGAAACCTTGCTGGCAAAACAGATGTCTGCGAAGATGGCGTTCGACGCGAGCAAGGATGCGAAGCAGCAGTTGACGCAAGCAAGGAAGGCCGTGTAATCTTCGATCCTCCGGCGGCAACGACTGCGTGGTCGGTCGATGACGTTGGCTTAGCGGACAATCGACGACAACAATCGAAGATGTAGTCGGCGGCGCTGGTTAGGGTTTCTAGAAGTTGACGGCTAGGGTTGGGTTTTTTGTTTTATTATAAACAATTAGGATTTATTGCTTTGATACCATGGGCAAAGTATGAAGAGAGATAATCTTTTCTCTATATTCATTATAAGAAGTTTAGGGTTTCTTAAATATAAGAAATACCCATATACACATACGTAAATAAATTTCCAGCACTATACTACATTAAATACAAATATACTAATTAGCCATAAAAGGAAAGAATAAATTAGAAATAACCAACAAACAAGATACGTGTAGACGAGAGAGCAATTATGAGATATCTTCACCAAGGTACTGAATGAAGCACAAATCGATTACCGCTCTAACAAGTTGGGCATGATGGATATATGCTCCAACTTGAGGGGGAGTGTTAGAGTCCTTATATTAGTAAAATTGTATTTTCTGTATATATATTAATTAGGTTGTCCTTTATTGTCTTTTACATCGTCCTATTTAGGTTAGCTTTATGTACTTGTATAAATACATCTTTTGAGTGAATGAAATGATTCATTGACTCTCTAAAACCTTTAACATTAACGGTAGAAAAGAAAATCATGTGACATTTATGTAGGATACCTCATAATTAAGAAGAAAGAGACCAAAGTGAACATTGCTCAACGACAATTGACATGTACCATCGATCAAGAGGTCAGAGGTTTGAATCTCTCGCTTTCAAATGTTGTAAAATAATAAAAAAGGATATGATACCTTTTACTGTTTAATAAAGACATGGTTTAAACTAAGTTGCAAACACACTAAGCTACATTGGTGTATTTAGATTGAGATTTATTTCCTCTTGTTGCTGATTTTATTTCTTACAGTAGCTGCTTCCTGTTTTAAATATGAATATATACACACTGTTTACCATGGATGTGTTTGAGAAATCCGGTAAAAAGTTAACTACTCATTCCGACTAGAAACAAATGTGTTTTTTGCCTGGTGATTTGATGCTATGCTTACTGCAACCTGTGATCACTCGTTAGGTTGAATGGTTGTTTCCCAGAGGCTGCCTGGACTTTCCATCATTTGAACTTTGGAAATTTGTTTCGATATTTTTTGACTTGCCGTACAATGTCTCATACTTGTGGTTGGTCATGTTGATACAGTTTGACAAGGATAGACATGAAGATGATGGTAACTTGCAAGATTAAGATATACATACAATTCTATTGTATAGGGTTGATGTAAAGCCAAAACTTCTGTTGCCCGTTCAGCTTCTTGAGGGTAGGCTTTGTGATGAGATAAAGGTGAACCTAATGTGTATTCGAGAAGAAGTATATAAAGCTCGCTCAAGCACCTGCTAA

mRNA sequence

ATGCTCTCCCTTCTTAATTCATCAGAACCGTCGCATTCTTCTGGAGCGCTCTCCTGCTTCCTTTCCCGTTTAGCTCCGACATTGCCGGCCATCGCCTCAGCCGCCGTAGTAGTTAAGTTTAGAGTCGACCCTTCTCTCTCATGTGTCGCCATCCCCAGCACCAAATCAAGTGACCCTTCCTCAGATACTACTTATCCTGCCAAACATTTTCGGTCCAGGTTCCGAAACTATTATTCGAATTCCGACCCCACGTTCTCAGATAGTGATGTCAATGGCGATTACTCTGACGCATCAGAGCCAGAAACCATATTGGAAGACGGTGGTGTAAGCATCAAAATCGAGAAGTTGGGAAACAACTCTCGCAGAATTTACTCGAGAATTGGTATTGACGCCCCACTTCAGGCCGTGTGGAACATCTTGACAGATTATGGTAGACTGGCAGATTTCATACCCGGTCTTGCTCTCAGCCAAATACTCTATAAGACTGGCAACCATGCCCGACTCTTTCAGGTACTTTTTTCTTCTCTTCTTTACCGCCCCTTTCCCTCTTCTGATATACATACAATTCTATTGTATAGGGTTGATGTAAAGCCAAAACTTCTGTTGCCCGTTCAGCTTCTTGAGGGTAGGCTTTGTGATGAGATAAAGGTGAACCTAATGTGTATTCGAGAAGAAGTATATAAAGCTCGCTCAAGCACCTGCTAA

Coding sequence (CDS)

ATGCTCTCCCTTCTTAATTCATCAGAACCGTCGCATTCTTCTGGAGCGCTCTCCTGCTTCCTTTCCCGTTTAGCTCCGACATTGCCGGCCATCGCCTCAGCCGCCGTAGTAGTTAAGTTTAGAGTCGACCCTTCTCTCTCATGTGTCGCCATCCCCAGCACCAAATCAAGTGACCCTTCCTCAGATACTACTTATCCTGCCAAACATTTTCGGTCCAGGTTCCGAAACTATTATTCGAATTCCGACCCCACGTTCTCAGATAGTGATGTCAATGGCGATTACTCTGACGCATCAGAGCCAGAAACCATATTGGAAGACGGTGGTGTAAGCATCAAAATCGAGAAGTTGGGAAACAACTCTCGCAGAATTTACTCGAGAATTGGTATTGACGCCCCACTTCAGGCCGTGTGGAACATCTTGACAGATTATGGTAGACTGGCAGATTTCATACCCGGTCTTGCTCTCAGCCAAATACTCTATAAGACTGGCAACCATGCCCGACTCTTTCAGGTACTTTTTTCTTCTCTTCTTTACCGCCCCTTTCCCTCTTCTGATATACATACAATTCTATTGTATAGGGTTGATGTAAAGCCAAAACTTCTGTTGCCCGTTCAGCTTCTTGAGGGTAGGCTTTGTGATGAGATAAAGGTGAACCTAATGTGTATTCGAGAAGAAGTATATAAAGCTCGCTCAAGCACCTGCTAA

Protein sequence

MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASAAVVVKFRVDPSLSCVAIPSTKSSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYSDASEPETILEDGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSLLYRPFPSSDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVNLMCIREEVYKARSSTC
Homology
BLAST of Tan0015692 vs. NCBI nr
Match: XP_023548259.1 (uncharacterized protein LOC111806945 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 289.3 bits (739), Expect = 3.0e-74
Identity = 174/296 (58.78%), Postives = 197/296 (66.55%), Query Frame = 0

Query: 1   MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TK 60
           MLS LNSS+P++SS  +SC  SRL P  PA ASA  AV V FR DPSLS VA+ S   TK
Sbjct: 2   MLSFLNSSDPTYSSPLISCSPSRLPPAFPATASAAVAVAVNFRADPSLSRVAVSSSSRTK 61

Query: 61  SSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYSDASEPETILE-DGGVSIKIE 120
           SS+  SD+TYPAKHFRSRFR YYSNSDP FSD+D N DYSDASE ETI E DGGVSI+IE
Sbjct: 62  SSNLFSDSTYPAKHFRSRFRKYYSNSDPAFSDTDDNDDYSDASESETIFEDDGGVSIQIE 121

Query: 121 KLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFS 180
           KLGNNSRRIYSRIGIDA LQAVWNILTDY +LADFIPGLALSQ+++KTGNHARLFQV   
Sbjct: 122 KLGNNSRRIYSRIGIDASLQAVWNILTDYEKLADFIPGLALSQLIFKTGNHARLFQVGQQ 181

Query: 181 SLLY-----------------RPFPS---------------------------------- 234
           +L +                    PS                                  
Sbjct: 182 NLAFGFKFNAKGTIDCYENDLEILPSGRRRVIKFKMIEGDFALFEGEWSIEQFDEDRLED 241

BLAST of Tan0015692 vs. NCBI nr
Match: XP_023006623.1 (uncharacterized protein LOC111499296 [Cucurbita maxima])

HSP 1 Score: 288.9 bits (738), Expect = 4.0e-74
Identity = 173/297 (58.25%), Postives = 199/297 (67.00%), Query Frame = 0

Query: 1   MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TK 60
           MLS LNSS+P++SS  +SC  SRL PT PA ASA  AVVV FR DPSLS VA+ S   TK
Sbjct: 1   MLSFLNSSDPTYSSPLISCSPSRLPPTFPATASAAVAVVVNFRADPSLSRVAVSSSSRTK 60

Query: 61  SSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYSDASEPETILE-DGGVSIKIE 120
           SS+  SD+TYPAK+FRSRFR YYSNSDPTFSD+D N +YSDASE ETI E DGGVSI+IE
Sbjct: 61  SSNIFSDSTYPAKYFRSRFRKYYSNSDPTFSDTDDNDEYSDASESETIFEDDGGVSIQIE 120

Query: 121 KLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFS 180
           KLGNNSRRIYSRIGID  LQ VWNILTDY +LADFIPGLALSQ+++KTGNHARLFQV   
Sbjct: 121 KLGNNSRRIYSRIGIDVSLQTVWNILTDYEKLADFIPGLALSQLIFKTGNHARLFQVGQQ 180

Query: 181 SLLY-----------------RPFPS---------------------------------- 235
           +L +                    PS                                  
Sbjct: 181 NLAFGFKFNAKGTIDCYENDLEILPSGKRRVIKFKMIEGDFALFEGEWSIEQFDEDRLED 240

BLAST of Tan0015692 vs. NCBI nr
Match: XP_038874642.1 (uncharacterized protein LOC120067209 [Benincasa hispida])

HSP 1 Score: 286.6 bits (732), Expect = 2.0e-73
Identity = 174/289 (60.21%), Postives = 193/289 (66.78%), Query Frame = 0

Query: 1   MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPSTKSSD 60
           MLS L+SSEP++SS   S  LSRLA T PA A A  AVVV FRVDPSLS + IP+TKS+ 
Sbjct: 5   MLSFLHSSEPTYSSSLTSSSLSRLASTFPATAPAALAVVVTFRVDPSLSRLVIPTTKSTI 64

Query: 61  PSSDT-TYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYSDASEPETILED-GGVSIKIEKL 120
            SSDT TYP KHFRS FRNYYSNSD TFSDSD NGDYSDASE ET  +D GG+SI+IEKL
Sbjct: 65  SSSDTITYPPKHFRSAFRNYYSNSDSTFSDSDDNGDYSDASESETSFDDGGGLSIQIEKL 124

Query: 121 GNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFSSL 180
           G+NSRRIYSRIGIDAPLQAVWNILTDY RLADFIPGLALSQIL+K GNH RLFQV   +L
Sbjct: 125 GSNSRRIYSRIGIDAPLQAVWNILTDYERLADFIPGLALSQILFKIGNHVRLFQVGEQNL 184

Query: 181 LY----------------------------------------------------RPFPSS 234
            +                                                          
Sbjct: 185 AFGLKFNAKGIIDCYENDLERLPLGKRRVIKFKMIEGDFELFEGEWSIEQLDEDDNLQDQ 244

BLAST of Tan0015692 vs. NCBI nr
Match: KAG6575162.1 (hypothetical protein SDJN03_25801, partial [Cucurbita argyrosperma subsp. sororia] >KAG7013718.1 hypothetical protein SDJN02_23885, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 284.3 bits (726), Expect = 9.8e-73
Identity = 172/296 (58.11%), Postives = 196/296 (66.22%), Query Frame = 0

Query: 1   MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TK 60
           MLS LNSS+P++SS  +SC  SRL PT PA ASA  AV V FR DPSLS VA+ S   TK
Sbjct: 1   MLSFLNSSDPTYSSPLISCSPSRLPPTFPATASAAVAVAVNFRADPSLSRVAVSSSSGTK 60

Query: 61  SSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYSDASEPETILE-DGGVSIKIE 120
           SS+  SD+TY AKHFRSRFR YYSNSDP FSD+D N DYSDASE ETI E DGGVSI+IE
Sbjct: 61  SSNLFSDSTYHAKHFRSRFRKYYSNSDPAFSDTDDNDDYSDASESETIFEDDGGVSIQIE 120

Query: 121 KLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFS 180
           KLGNNSRRIYSRIGIDA LQ VWNILTDY +LADFIPGLALSQ+++KTGNHARLFQV   
Sbjct: 121 KLGNNSRRIYSRIGIDASLQTVWNILTDYEKLADFIPGLALSQLIFKTGNHARLFQVGQQ 180

Query: 181 SLLY-----------------RPFPS---------------------------------- 234
           +L +                    PS                                  
Sbjct: 181 NLAFGFKFNAKGTIDCCENDLEILPSGKRRVIKFKMIEGDFALFEGEWSIEQFDEDRLED 240

BLAST of Tan0015692 vs. NCBI nr
Match: XP_022959182.1 (uncharacterized protein LOC111460246 isoform X2 [Cucurbita moschata])

HSP 1 Score: 281.6 bits (719), Expect = 6.3e-72
Identity = 171/296 (57.77%), Postives = 195/296 (65.88%), Query Frame = 0

Query: 1   MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TK 60
           MLS LNSS+P++SS  +SC  SRL PT PA ASA  AV V FR DPSLS VA+ S   TK
Sbjct: 1   MLSFLNSSDPTYSSPLISCSPSRLPPTFPATASAAVAVAVNFRADPSLSRVAVSSSSGTK 60

Query: 61  SSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYSDASEPETILE-DGGVSIKIE 120
           SS+  SD+TY AKHFRSRF  YYSNSDP FSD+D N DYSDASE ETI E DGGVSI+IE
Sbjct: 61  SSNLFSDSTYHAKHFRSRFGKYYSNSDPAFSDTDDNDDYSDASESETIFEDDGGVSIQIE 120

Query: 121 KLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFS 180
           KLGNNSRRIYSRIGIDA LQAVWNILTDY +LADFIPGLALSQ+++KTGNHARLFQV   
Sbjct: 121 KLGNNSRRIYSRIGIDASLQAVWNILTDYEKLADFIPGLALSQLIFKTGNHARLFQVGQQ 180

Query: 181 SLLY-----------------RPFPS---------------------------------- 234
           +L +                    PS                                  
Sbjct: 181 NLAFGFKFNAKGTIDCYENDLEILPSGKRRVIKFKMIEGDFALFEGEWSIEQFDEDRLKD 240

BLAST of Tan0015692 vs. ExPASy TrEMBL
Match: A0A6J1L5G6 (uncharacterized protein LOC111499296 OS=Cucurbita maxima OX=3661 GN=LOC111499296 PE=3 SV=1)

HSP 1 Score: 288.9 bits (738), Expect = 1.9e-74
Identity = 173/297 (58.25%), Postives = 199/297 (67.00%), Query Frame = 0

Query: 1   MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TK 60
           MLS LNSS+P++SS  +SC  SRL PT PA ASA  AVVV FR DPSLS VA+ S   TK
Sbjct: 1   MLSFLNSSDPTYSSPLISCSPSRLPPTFPATASAAVAVVVNFRADPSLSRVAVSSSSRTK 60

Query: 61  SSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYSDASEPETILE-DGGVSIKIE 120
           SS+  SD+TYPAK+FRSRFR YYSNSDPTFSD+D N +YSDASE ETI E DGGVSI+IE
Sbjct: 61  SSNIFSDSTYPAKYFRSRFRKYYSNSDPTFSDTDDNDEYSDASESETIFEDDGGVSIQIE 120

Query: 121 KLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFS 180
           KLGNNSRRIYSRIGID  LQ VWNILTDY +LADFIPGLALSQ+++KTGNHARLFQV   
Sbjct: 121 KLGNNSRRIYSRIGIDVSLQTVWNILTDYEKLADFIPGLALSQLIFKTGNHARLFQVGQQ 180

Query: 181 SLLY-----------------RPFPS---------------------------------- 235
           +L +                    PS                                  
Sbjct: 181 NLAFGFKFNAKGTIDCYENDLEILPSGKRRVIKFKMIEGDFALFEGEWSIEQFDEDRLED 240

BLAST of Tan0015692 vs. ExPASy TrEMBL
Match: A0A6J1H3U6 (uncharacterized protein LOC111460246 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111460246 PE=3 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 3.1e-72
Identity = 171/296 (57.77%), Postives = 195/296 (65.88%), Query Frame = 0

Query: 1   MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TK 60
           MLS LNSS+P++SS  +SC  SRL PT PA ASA  AV V FR DPSLS VA+ S   TK
Sbjct: 1   MLSFLNSSDPTYSSPLISCSPSRLPPTFPATASAAVAVAVNFRADPSLSRVAVSSSSGTK 60

Query: 61  SSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYSDASEPETILE-DGGVSIKIE 120
           SS+  SD+TY AKHFRSRF  YYSNSDP FSD+D N DYSDASE ETI E DGGVSI+IE
Sbjct: 61  SSNLFSDSTYHAKHFRSRFGKYYSNSDPAFSDTDDNDDYSDASESETIFEDDGGVSIQIE 120

Query: 121 KLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFS 180
           KLGNNSRRIYSRIGIDA LQAVWNILTDY +LADFIPGLALSQ+++KTGNHARLFQV   
Sbjct: 121 KLGNNSRRIYSRIGIDASLQAVWNILTDYEKLADFIPGLALSQLIFKTGNHARLFQVGQQ 180

Query: 181 SLLY-----------------RPFPS---------------------------------- 234
           +L +                    PS                                  
Sbjct: 181 NLAFGFKFNAKGTIDCYENDLEILPSGKRRVIKFKMIEGDFALFEGEWSIEQFDEDRLKD 240

BLAST of Tan0015692 vs. ExPASy TrEMBL
Match: A0A6J1H450 (uncharacterized protein LOC111460246 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111460246 PE=3 SV=1)

HSP 1 Score: 276.9 bits (707), Expect = 7.5e-71
Identity = 171/308 (55.52%), Postives = 195/308 (63.31%), Query Frame = 0

Query: 1   MLSLLNSSEPSHSSGALSCFLSRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS---TK 60
           MLS LNSS+P++SS  +SC  SRL PT PA ASA  AV V FR DPSLS VA+ S   TK
Sbjct: 1   MLSFLNSSDPTYSSPLISCSPSRLPPTFPATASAAVAVAVNFRADPSLSRVAVSSSSGTK 60

Query: 61  SSDPSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYSDASEPETILE-DGGVSIKIE 120
           SS+  SD+TY AKHFRSRF  YYSNSDP FSD+D N DYSDASE ETI E DGGVSI+IE
Sbjct: 61  SSNLFSDSTYHAKHFRSRFGKYYSNSDPAFSDTDDNDDYSDASESETIFEDDGGVSIQIE 120

Query: 121 KLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLFQVLFS 180
           KLGNNSRRIYSRIGIDA LQAVWNILTDY +LADFIPGLALSQ+++KTGNHARLFQV   
Sbjct: 121 KLGNNSRRIYSRIGIDASLQAVWNILTDYEKLADFIPGLALSQLIFKTGNHARLFQVGQQ 180

Query: 181 SLLY-----------------RPFPS---------------------------------- 234
           +L +                    PS                                  
Sbjct: 181 NLAFGFKFNAKGTIDCYENDLEILPSGKRRVIKFKMIEGDFALFEGEWSIEQFDEDRLKD 240

BLAST of Tan0015692 vs. ExPASy TrEMBL
Match: A0A1S3C7G7 (uncharacterized protein LOC103497743 OS=Cucumis melo OX=3656 GN=LOC103497743 PE=3 SV=1)

HSP 1 Score: 270.4 bits (690), Expect = 7.1e-69
Identity = 169/297 (56.90%), Postives = 192/297 (64.65%), Query Frame = 0

Query: 1   MLSLLNSSEPSHSSGALSCFL-----SRLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS 60
           M S LNSSEP++SS + S  L     SRL+ T PA +SA  AVV  FRV PSLS +AI +
Sbjct: 1   MRSFLNSSEPTYSSSSSSSSLTSSSISRLSSTSPATSSAALAVVPTFRVHPSLSRLAILA 60

Query: 61  TKSSD-PS--SDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYSDASEPETILED-GGV 120
           TKS+  PS  S TTYP KHFRSRFRNYYSNS+PTFSDSD NGDYSD S+ ETI +D GG+
Sbjct: 61  TKSTTIPSSYSSTTYPPKHFRSRFRNYYSNSEPTFSDSDENGDYSDVSDSETIFDDGGGL 120

Query: 121 SIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLF 180
            I+IEKLG NSRRIYSRIGIDAPLQAVWNILTDY RLADFIPGLA+SQIL+K GNHARLF
Sbjct: 121 CIQIEKLGTNSRRIYSRIGIDAPLQAVWNILTDYERLADFIPGLAISQILFKIGNHARLF 180

Query: 181 QVLFSSLLY--------------------------------------------------- 234
           QV   +L +                                                   
Sbjct: 181 QVGEQNLAFGFKFNAKGTIDCYENDLERLPFGKRRVIKFKMIEGDFELFEGEWSIEQFGE 240

BLAST of Tan0015692 vs. ExPASy TrEMBL
Match: A0A0A0KCX4 (Polyketide_cyc domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G001770 PE=3 SV=1)

HSP 1 Score: 267.7 bits (683), Expect = 4.6e-68
Identity = 167/297 (56.23%), Postives = 187/297 (62.96%), Query Frame = 0

Query: 1   MLSLLNSSEPSHSSGALSCFLS-----RLAPTLPAIASA--AVVVKFRVDPSLSCVAIPS 60
           MLS LNSSEPS SS + S  L+     RLAPT PA  SA  AVV  FRV PSLS +AI +
Sbjct: 1   MLSFLNSSEPSFSSSSSSSSLTSSSLPRLAPTSPATTSAALAVVPTFRVHPSLSSLAILT 60

Query: 61  TKSSD---PSSDTTYPAKHFRSRFRNYYSNSDPTFSDSDVNGDYSDASEPETILED-GGV 120
           TK +      S TTYP KHFRSRFRNYYSNS+PTFSD D NGDYSD S+ ETI +D GG+
Sbjct: 61  TKPTTIPFSYSSTTYPPKHFRSRFRNYYSNSEPTFSDRDENGDYSDVSDSETIFDDGGGL 120

Query: 121 SIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARLF 180
           SI+IEKLG NSRRIYSRIGIDAPLQAVWNILTDY RLADFIPGLA+SQIL+K  NH RLF
Sbjct: 121 SIQIEKLGTNSRRIYSRIGIDAPLQAVWNILTDYERLADFIPGLAISQILFKIDNHVRLF 180

Query: 181 QVLFSSLLY--------------------------------------------------- 234
           QV   +L +                                                   
Sbjct: 181 QVGEQNLAFGLKFNAKGTIDCYENDLERLPFGKRRVIKFKMIEGDFELFEGEWSIEQFGE 240

BLAST of Tan0015692 vs. TAIR 10
Match: AT4G01650.1 (Polyketide cyclase / dehydrase and lipid transport protein )

HSP 1 Score: 112.8 bits (281), Expect = 3.6e-25
Identity = 81/237 (34.18%), Postives = 116/237 (48.95%), Query Frame = 0

Query: 56  SSDPSSDTTYPAKH-FRSRFRN----YYSNSDPTFSDSDVNGDY--SDASEPETILEDGG 115
           S  PSS     ++  F  RF +    + SN D T +++D   DY  +D    E ++ D G
Sbjct: 43  SFSPSSTLLASSRRCFTCRFGDSSPRFNSNEDETETETDDEDDYCLTDGKTEELVVGDDG 102

Query: 116 VSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALSQILYKTGNHARL 175
           V I+++KL  +SRRI S+IG++A L +VW++LTDY +L+DFIPGL +S+++ K GN  RL
Sbjct: 103 VLIELKKLEKSSRRIRSKIGMEASLDSVWSVLTDYEKLSDFIPGLVVSELVEKEGNRVRL 162

Query: 176 FQVLFSSLL-------------------------------------YRPFPS-------- 229
           FQ+   +L                                      ++ F          
Sbjct: 163 FQMGQQNLALGLKFNAKAVLDCYEKELEVLPHGRRREIDFKMVEGDFQLFEGKWSIEQLD 222

BLAST of Tan0015692 vs. TAIR 10
Match: AT4G01650.2 (Polyketide cyclase / dehydrase and lipid transport protein )

HSP 1 Score: 107.8 bits (268), Expect = 1.2e-23
Identity = 67/190 (35.26%), Postives = 94/190 (49.47%), Query Frame = 0

Query: 96  DASEPETILEDGGVSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLAL 155
           D    E ++ D GV I+++KL  +SRRI S+IG++A L +VW++LTDY +L+DFIPGL +
Sbjct: 13  DGKTEELVVGDDGVLIELKKLEKSSRRIRSKIGMEASLDSVWSVLTDYEKLSDFIPGLVV 72

Query: 156 SQILYKTGNHARLFQVLFSSLL-------------------------------------Y 215
           S+++ K GN  RLFQ+   +L                                      +
Sbjct: 73  SELVEKEGNRVRLFQMGQQNLALGLKFNAKAVLDCYEKELEVLPHGRRREIDFKMVEGDF 132

Query: 216 RPFPS--------------------SDIHTILLYRVDVKPKLLLPVQLLEGRLCDEIKVN 229
           + F                       D  T L Y VDVKPK+ LPV+L+EGRLC EI+ N
Sbjct: 133 QLFEGKWSIEQLDKGIHGEALDLQFKDFRTTLAYTVDVKPKMWLPVRLVEGRLCKEIRTN 192

BLAST of Tan0015692 vs. TAIR 10
Match: AT5G08720.1 (CONTAINS InterPro DOMAIN/s: Streptomyces cyclase/dehydrase (InterPro:IPR005031); BEST Arabidopsis thaliana protein match is: Polyketide cyclase / dehydrase and lipid transport protein (TAIR:AT4G01650.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 51.6 bits (122), Expect = 9.9e-07
Identity = 23/48 (47.92%), Postives = 31/48 (64.58%), Query Frame = 0

Query: 109 VSIKIEKLGNNSRRIYSRIGIDAPLQAVWNILTDYGRLADFIPGLALS 157
           V  +++ +    RRI   I +D+  Q+VWN+LTDY RLADFIP L  S
Sbjct: 85  VRCEVDVISWRERRIRGEIWVDSDSQSVWNVLTDYERLADFIPNLVWS 132

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023548259.13.0e-7458.78uncharacterized protein LOC111806945 [Cucurbita pepo subsp. pepo][more]
XP_023006623.14.0e-7458.25uncharacterized protein LOC111499296 [Cucurbita maxima][more]
XP_038874642.12.0e-7360.21uncharacterized protein LOC120067209 [Benincasa hispida][more]
KAG6575162.19.8e-7358.11hypothetical protein SDJN03_25801, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022959182.16.3e-7257.77uncharacterized protein LOC111460246 isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1L5G61.9e-7458.25uncharacterized protein LOC111499296 OS=Cucurbita maxima OX=3661 GN=LOC111499296... [more]
A0A6J1H3U63.1e-7257.77uncharacterized protein LOC111460246 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1H4507.5e-7155.52uncharacterized protein LOC111460246 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A1S3C7G77.1e-6956.90uncharacterized protein LOC103497743 OS=Cucumis melo OX=3656 GN=LOC103497743 PE=... [more]
A0A0A0KCX44.6e-6856.23Polyketide_cyc domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G001... [more]
Match NameE-valueIdentityDescription
AT4G01650.13.6e-2534.18Polyketide cyclase / dehydrase and lipid transport protein [more]
AT4G01650.21.2e-2335.26Polyketide cyclase / dehydrase and lipid transport protein [more]
AT5G08720.19.9e-0747.92CONTAINS InterPro DOMAIN/s: Streptomyces cyclase/dehydrase (InterPro:IPR005031);... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005031Coenzyme Q-binding protein COQ10, START domainPFAMPF03364Polyketide_cyccoord: 129..221
e-value: 1.1E-7
score: 32.1
IPR023393START-like domain superfamilyGENE3D3.30.530.20coord: 122..177
e-value: 2.9E-8
score: 35.6
NoneNo IPR availablePANTHERPTHR34060:SF1POLYKETIDE CYCLASE / DEHYDRASE AND LIPID TRANSPORT PROTEINcoord: 184..230
NoneNo IPR availablePANTHERPTHR34060POLYKETIDE CYCLASE / DEHYDRASE AND LIPID TRANSPORT PROTEINcoord: 184..230
NoneNo IPR availablePANTHERPTHR34060POLYKETIDE CYCLASE / DEHYDRASE AND LIPID TRANSPORT PROTEINcoord: 56..171
NoneNo IPR availablePANTHERPTHR34060:SF1POLYKETIDE CYCLASE / DEHYDRASE AND LIPID TRANSPORT PROTEINcoord: 56..171
NoneNo IPR availableSUPERFAMILY55961Bet v1-likecoord: 122..173

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0015692.1Tan0015692.1mRNA