Lsi03G006210 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi03G006210
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionCAP, cysteine-rich secretory protein, antigen 5
Locationchr03 : 7723692 .. 7749632 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTGGTAGGTTTTCTCCAAATTTCCATCTCCAAGGCACAAAACTCCCCTCAAGACTTCGTCGATACCCACAACGTCGTCCGAGCTGCAGTGGGTGTCGGTCCAGTCTCCTGGGACGACACCCTAGCCACCTATGCTCAGAATTACGCCGATAGTAAGATTGATACCTGCGAGATGGAGCACTCCAATGGGCCCTACGGTGAAAACCTCGCTGAAGGGTACGACGAGATGACGGCGATAGAGGCAGTGAAGTTCTGGGCGACCGAGAAAAAGTTCTACAACCACCATTTGAATCGGTGCATCGGTGACGAGTGTGGCCACTATACGCAAATAGTTTGGAGGGACACTAAAAACATAGGGTGTGCTAGAATGAAGTGTGAGAACAATTGGATTTTTGTGATATGCAACTATAATCCTCCTGGTAACTATATAGGCCAACATCCCTACTAAAATCTAAGTACGAGAGACTTTATAAGAAAGAATTAATATGATGTTCCTCTTGATAATATGCATTTATTTGCATCTATGTATTAGAATATTAAATAGTTACATGAAATATATATGTATATCAGTTATTATATGTCTTCATGTTTAAATTTGAGAGAAGTTACTTTATTTTTGCTATGGAGTAAAAAAAATAAAAAAAAAACAATAAATAATTTTATCTCCATTCTTAAAAGAAATTACTCGCTCATATTTACGTAGCTAATAAGGAGAAAAAATTAATTGCAAACTCTAATACTGGTTCTCGTAATAGATTAGTTAACATTTTGGTGGATCATTTATTTATATTGTTTTATTTTCCTAAATATTTTCATACATTCAATTAATTAAATAAAATCATAAAGTTTAATATCATTATATATGTTGCTAATATTAACATATGCACTAATAATTATATCAATTTGAATCTAAATTTTGGGATGCTAATGTATACTCTTATGGTTTAGAGGTTTTATATGAATTTAACCTATGCACTAATAATTATATCAATTTGAATCTAAAATTTTCACGAATATATCAATTTACACTCTTATTCAAAGTTAATTCATCCAAAAAAAATATTGGGTAGGATTTATAATTTTTTTTCAATTAAAATCTTAAATTGTCATAAGTGAATCACTATGAACCTTTTATTTAAATTACTTTTGAAAATCGTCCGTTTATTAATTTTTTTACACGCGTTTCTTCACATATTAGATTAAGAAAATGGACAATTAGAATTGTTACATGAACGATTTTAAAAAAAAAAAAAATCTTAATTGAGTGTTAAAATTAATTTGCAAAATTTAGGATTTTAAGTTGATAAAATTATATGTCTCACATACTATTTATCTAAATGGAACGTAATAAAGAGTATAGATTAATACATTTATAAAAGCTTAAATTTTAAATCGATACAATTATTTACTTAAAAACTTTTAATTGATACCATCCTAAAATGATATTTCTTTTTCTTATTTAATTGTCACGACATGAATATAAAACGATATATTTTTTTTGGTTGGTTATTAAATACGATGCAACCGAATTTTGTCAGAAAAATTAAAAGAGATATAGATGTGGCATTAATAGTTGTGAAGTTTAGTAAGTATATATAATTGTTGTCACCATCAAGTAGAGAGAGTTAATAAAGGGTTTGAATTTTCCTTCTGTTGTATTACAAGGACGACAAATCGAAAGAGTTACAGATATAAATTAGATCAGTTCTCTTAAATTGAATTAATTGGTGTTCATTCTAGAATTTAGAACAACCAAAATTTGTTTTTGTTAGTTTAATGAGACGGCAGATATATCATATAAGAAATAGAACAAACAAAATGATTGGAGCAATGTGTAGATTTTAACTAGAAATGTTAAGAAATAGATGTGGAACCTTTGCACATCCCCATGGTTAATAGATTTGATAGTTAATTTTTTTTTTTTTATTTTAAATAAGGGTAAAAGAATATCGGGATTTAACTTTTTTGTCATCTTATAAATTGGGTAGGTTTTATTAGTTTCTAGGAGGATAAGAATTATCTCTGGTACATTCCCTACTAGAAACGAGAGTTTGTTCTTACTAATAATCGTGGAGATGGAGGAAGGAATCTCTTACTTCAAGAGAAAATTACCCCTGAGCTAAATAAGTTTACTTTGGCCACAAGGTCACATAGTTGGATCTATTTGGTGTTGACCCACTCCTTTGTCACACTCTTACCTTTCCTCCTCAAATGAAGATATACTTCTCTTATCCTAGAGGCTACAATTGGGTACAACCCTATGTAGGCTTATAAAAACTTGAGAAAGGTCAAGAATGTAGGCTTCATAAAATCTTGTGAGAACCCGACCAACCTGGACCAACCTCAACCCTAGGTTGGGTTGGGTTAGATTTGTTTTCTGTTTGGGTTGGGTTAGAATGTAGGAGAATGAAAATTTTCGAGTTAGTTCTCGAGTTATTTGTTGTGTTCACATAATTAACCTAGTTGTTAATAATGGGCTAGAAGATATGCATGACTTTATAACTAGTATTTGCAAAGTAGTATGATAGCATACTATTGATGTCAAAATTTGGACAGGGTGAATAGTGCCATTTTGACAGAGACCTTTCTGGGCATCGTTTAATTCCCGAAAAACATTCCAATCAAAAGAAATTTATAATACAGTGAATCCTATACTACTACTACGATGACGTAATAATAGTGATAAGTTTGGGTCGAACCATAGGGAGACAAGCGGTAATGATGCTTTGTCGAGTTAATTATCAAATTAGCGCGTAAAATGAAGGGGGTTTGGATTGGAAAGAGATAAAAGCGTGCAAGAAAATAAAGTGGAAGGGTAAATGTAACCTAGGATTGGGTTTATTTAGTTTCGAGAGCAAGAGGGATCCTATGCAGTTTTCAATTATCAATTTCTGTTAATTAGACATGCATTTGATTTATGCGTGCACAACAACCTACATAGCCTAACCTTAACTCAAAGACACTACAAGCAGCTGAAAAGCTCAATTAAACTAAGCAAGCGTCCTAAATATTAATTAAAGTCAGACAACCAAGCTCAATTAACCTACATGCCCTTAAGTCTATTTGGTTCGATAGCGCCCATATAGAAACTAAAAACATGCGCATTAATTTTAAGGATTTTATTGTGTTGATTAACTAATTGGATAACCTAATCAATTAATCAAGTTTTCCTTTGAGCCTAGTTGAAACATGCATTCTAAGATCTAGGATAGCTATAAACCCTAGCATGCAAGATTCAACTAATTGCTACTTAGGTGATTAAGCTAGATGAACATAAGAGAAATACGCAAGCCTAATCTATTAGCTAAGACATACATTCATGATAGGGGTGTCAATCCACATAAAAACATACATCTAAATTGAAATAGATTCAAAGAAAGGCATGAAATCAAAATATATTCAGCACAATCCACAAAAATCTCAAGAAAACTACTTAGGTCTTGCTCCCAAAACCAAATAAATATACATACCTCTACAATCAATAGACAAAATATAGAGGGAAGCTCTAATTCATGAACTACAACCCAAAATAAAACTTAAATCTAACCTAAATATAGAAGAAAAGATGAAAGAGAAGAAGAAATTTTAGTATAACTTCGGCCTCCATGAGTTCCTTCATCCCCTAACCCTAAAAGAAAAGAAAGATATTTATAGATCTGGCGAGGGCGTCGTTATGCTCATGAAGAGCATAGCGACGCTGGCTTTTCCAGCTGCAGCGTAGCTACACTGCCTCCATTTTCGCCCAGCGCCAGCTATTTCTTTGGAACGCAGCGCAGCTACGCTGCCTCTAGGGGCATTTTAGCATTTTGCATCTTTGAATCTTCGAGTGTCCGATTTCACTTCCAATAACCCAAATCAACCCCAAACGGCCCATAACACTCTAAAATGCATGTTTTGCTCAAATTCCCTATAAAATAAACATTTAATCATATAAAACATAATAAGGAAGCTAGATTAAAGATATAAAAATAGCTCTTTTCAAGAGCTATTAGTGCCCAATCCTTTTGCAGCAACAACTGTGAATTTGTAATTCAAGAACGACAGCAACCTGGAAGACAAAAAGAATCTGTACATCGGTGTGGTGCTTTCCACATAAGACTCTGATGCTTACGTTAACAGGAAAACTATGGGGCAATAATAGAAATTTGGGGGTGAGTGGACTTACTTCCCTTAGGGTCTTCCCCGTTTATTTATAGAGCTTTTCTTGACCTAATTCCTTTCTTTCTAGGATTTATGAGTTATACTGTCAGCCAAATCTCTCTCCTTTGGCTGGATTTTCCAAATAAAACCTATTTGTGGTTAACCATTTGGGTCGACCCATGCCACCCCTTTCCTTTTGGCCTTGCGCCTTTTGGCTTTATTTTTTTGGCCTGTTACGTCATTACTTCCTTCATTATTTTACTCATTTGCCCTTGGGTCTAAAATTTTCCTCAACATTAGCCCCACTCCTAAGTTTGATCTGAAGAATTGGTATGTTCATCGATACAAACTTAGAAGTATTGTTCGATCCTTCGTCGCTTCTTTAATTGCTCTATCTTCCTTATTATATGACATACCAGTCTTATGTTGGTATAACCGTATCGAATTTGGCTAATCCAAAATCATTCTCAGATCTTTAGCCATGTTGGAAATATCTAGCCAGATTTCTTTTGGAAATCCGGTTAGATTCCAGGCGCGCATTAACCATAATTGAGTCGTTAAGCACAAAATTTATGTCATGCCCGTTCTTTTCCGAGGAGCATAGACACATGAGCTTGGCTTTTCTTCTATCAAGGATCATTCTCCTCTTCTCTAGACACGCGTAGTTGGGAATATTGGTTAAGATTTCGTTGGTCTCCATCCAGTTCCCTAATTTCCATCCTCCTATACTTAAGTAGAGTCAGCCTACCTGCCTTTCATGATGGGCTTTGACATTTTTCTTTAATTTGACCCGAGCGATTAACTCAGATTTTGGATTTGTAGTCTTGTTGATTACCTTGGTTTTCAATCATGTATTGTTTCCTCATTGGTCAGCTATATTCGTTAATCTTTCTCTCAACTTCTCCATGTAACTTCCCTTCACGCCATGTGCCCATTATAGGGTGGTCGACCTCGGTAATGAGTTGAAATACCTATCTCAATTTTGTAATTTCGAAACATCACATCCACGTCTGATAGAAACGTAGTCATTTCTCTACGCTCTGAGTCTTAAATGTGCTGAGTCTTGGGTGGTCGTCCTCGACAATAGGCCAATCTAGTCCTTTTAATTGGTTTTCAACTTCTCGATTATTGCTTGAAAAGATTTTGAACAAGCGATAAGATACCTCACCACTTAAACCACAAATGTTAAAAAAAAAAAGCACAGACATGAACTGTAACAGCCATAAAACCAACAGCAAGTGATGCCTTAGGCTACCCCTAAGACAGTGAATGTCTTCACAGTTTTGAGATTGACAACTAAAGAACTTGTACAGTGGAAAACACAGCAAGCACAGATCAAAACAATGAAGTACAACGTCAGCCACTTAATACAGAAAACTCTCTTGAAAATAAACAATTGTGACAACCTTAGGAGAGGCCACAACTTCTTTTAACATCCGAGTTCAGATAAGGAAAAAGTTCCATATGAAATATTTTCATTTTCTACAAAATATGCTTCACCATAAAAGGAAAATATCCATCAATAAACTTGTACCGAGAATTCGTAGTTTAGGAAATTAAAATATTCCACATTAAAATTCTAGATTAGGAAGTAAAATATTCTTCTAGAATTATGGTATGTCGGAACCAAATGTTAAGACATAATTTGTGACATCTGAGAAAGCCACGTAAAACAACCTTATCAAAATAGCCTTATTAGAAGCCCCGACTTTCTCAACTAAACAGTTTACAGCCCAAGGGTAACCAAAAGGGGGGTTGATATTGGTTGTTTGCAAGAAATTAATCGCATAAAATAATATAAAGCGCAAAGTGCGCAAAGTAAAGGGTTTAAAAATTAAGATGAGAAAGCCTAGTTTAGGTTGTAACAAAATTTTCCTATTCAAAATATTCGATCATTGAACTCTCATTTACCAAAGTCAATTAATTGCTTACACCTAACGAAATTAATTAGAAAAGCTAATTAAATTGTCCTAAACAACTAACTATTCCAAATTGATTAAATGAAAAGCATGCAAATCAAGTTGTTCTAATTGTATTAGGATCTAGGGAACGTTAGGGTGAATATTTAACTTTCAAGCACCTAGTTAAATGGTTCAATCCTAGGTTGATTAGATCAATACTTTTCTATTCGATTCGCAAGCTAACCTAACCTATTGCTTTCTACGAAACAACATATAACCAAGTATCAGGAATTGCATGTTAATTAAAAACAATAACTATACATCCACAAGAGTACAATAATACAATCAGAATCTCATAAACATAATTGAATCCAACAATGCAATATGAAACTCAGTTCGAAAACTCACAAATACATTGAAACAGAGAAAAGATCCTAAAACCCAAGCTCGAAACGATTACCTTAGCTCATAATCAACATATCTGACATCATTTGACTTGATTATAGAAATAAGATGGAAAGAACTCTGAAATATTATGCTTTAAATGGATAGAGCACAAAAAAACAAAGAAAACGAGTGGTCATATGCCCAAAACGGCCACTTGGATAAAAAGATATGACCTAAAAATCGAGCTACGATTTTTATATCGCGAGACAATGTCGTGGCGCCGTTAAATAAGTGGCAATTTTGTAATTCTTACTTCTTTTGGCCTAATTGCTTCGTTAGGACTCCAAATTGGTCCATTTTAGATCCGTTTTCGTTCTAACATTTGACTCTACCTCTAAATGCTCGATACCTACAAAATAATATATTTAAGCACATTAAATATCATCAACGGGCAAGATTAAAGGCATAAAGATAACACATTTTAGGTGCTATCAAACTTCCTTGTCAAACTCAAAAAAGAGTGAACGAATTGAGTAGGTAATAAGAACCCTTAAATTCAATTAGAAGAAAATAGCAAAAGGGGATGGTTGGCCTCAGCCACCCTTGGAATTCCACTGCACCTGATTTCTTATCAGCTTCCGCAATGCTGTGTAGTAATACTCTCTTGACATCTTTGCTCCTCCTTTGATTACTCAAGTTCCTTCTACAGTAAAAAACTTGATGGCTTGAGGATAAATTAAAGGAATTGCCTTTAATTCATTCAAAGTTTATCTTCGTAGGATTGCACTGTATGTAAATGAATAGTCTGCTACTAGGAAGTTCACTCTTTACGTAATAATGCTAGGCTCTTCCCTGAAGGTCACTGGTAGCTCTATGATGCCTTCAGGATGCACTCTCTCTCCTCTAACCCATCTTTGTGAAGGTTGTCGATGATAGCACATCAATTGAGGTTCCTTCATCTACTAGTACCCTATGAACTGCTATGTTTGCAATGTTTAAGGTTATGACCAAGGCATCGTTGTGAGATTCGTGAAGAGTTTGAGTCTCAGCCTCAGTGAATTCAAGTTTTGCATCCTAGTTCTTGCTTATCACTGAATACATCCTTAACCCATCTGATTCACTCCTAGCACGTCTTGTACTTGCTTTCTGTTTTCTACTTGAATGTCCTTCTGTCGGTCTTCCAATTATGGTTCAAATCTACCTTGGTGGAAAATCTTATTGCTTTGGAGATTGATTTTCCCTGCTCAGCTCGATTCCCTGTTGGTTGCTTTGGAGACTGATTTTCCCACTATTTGCTTCTCCCACATATTTTTTCACAAAAAACATTCCTTATTAGGGCTTTAATCCCATCTTTAACCTCCATTCATTCTTTATGAGGTATCCATGCTCATCATGGTAGAGACAATATTTCTCTTTATTCCTCTTATTTGGTCATGCCCTCATCTTTTTTAGTTTTTTTAACAAGTGCATGTGGCATATTTCTATAATTATCTCTTCCAGCTCTGACATGGTTTGCATATACTTCTCGAATTTATAAGTGGGTTGAAATCGTGTTGGTTCTCTCTCGTCCAATCATCGTACTTGAATTTTGTGGTAGCTTTGCATCCACAATCTCGTCAGTGAGCGGTGGGTCTACTTGATCGATAGGGCTTCTAGGTCAAATCCCTATCTTGATCTAGCTTTCTGCATATCCCTTATCTTGTCCTACATCTTTGCAAGTTTAGCCTTGCATGCCTCTACTTCATTCTTGACGTTCTGACTGTCTTTTTCTACTGTTGTATTTCTGCTCATTTTCTTCCCCAAGAAGCTTCTCAAATCCGTCTCTAGGAAGCTTTTTGATTGGTCCATCTCTCTCATCGTTTCTGTAGGCTAGGCTTGGCCTCGACTTAGCCCTTCCTTCACCTTTTTTTGTTGCTTTGTGTATCTCTAACTTTTGGCCCCAACACTGGCTATCGACTTTTTTAGCAGATTCGTATTTCCTTTCTTCCTTCTTATCGTTCTTGCTTTTCAAATCACGGGGATCCTTGTCTCTCTCCCCTTTATGCCATTTTTTGTTCGTTCCTACCTCAAAACTGTCTTTCCTTTAATTTTGAACAGTGTGGATAGGATTCTCTCATGCTCTCATTTAATGGAACTTTTTTCTTTGGCGGGATCGCAAATAAGCTCATCAAAATTATCTCGATTCTATCGATTGAGAACTTCAAGTAGCAGTGCAACGTTCTGATCTATACCATTCATCTTTCTATCCTTTAATGTGACAGGACTTGATTCCAATGGTTCTGTCGTTGACTTTTTCTTGGTTGGCTCGGACTCGGCCTCGACCTATTCCTTATCTTGATTGGAGAACTGATTGTTGATATGCCAAACTTGGTGTTACTGCAAGTGCAAGTGCACGTGTCAAGTTATAATATATAAAATGGCCTAAGCCAAGTATTGTCCTCAGGGAATCAGATTGACTATCTTTTAGCTATTTTAGAGACCATTCCGATTTTATTTAGGGGATCGACTCATGATGATACGTAATTAAACTAATCTAAGAAAACAAATAAAAGTGATTTAATATGGAAATTATCAACTAAGAGACGTTTTAAGGAATAAATTTCATCAAAGGTATTATTAAGCATAATTGCTTATTCTTGAAGTAATTAACACAACACCTAGATCTTGAAGTCTTTTATCCGGATTGTCCCTCTCGAGATCAACCTTTGCTTTCACTTAATCACTAAGACTAGTCTCTTGAAATTAGATAAATAAGCAAAGCATTAAGATTCTTCAACCACAATCTCTACCTGTTATCTGACTATTTTAGTGTTTTCAAATGATAGTCTAGCATGCATGATATAATTTAAACATACTCTCTCAAGAAAGATTCAAATCATAAAACCATTCCAATGTTGATCAAATATTCAAAAGCATTAAGAACGGTCATGCTAGAAAGCATAAATGCAAATTAATATTCAAGAAACAAAGTGTTCAATAATTTAAATCTAAGAATTCATCTACGCCCCCAAAACTATGAGAAAAAGCCTAACATGGCTTCCATGGATATAACCAAAATTAGAACACAAGAATACATAAAATCAACCAAGAGTAAAGAAGGAAATGAGAAGTTATCTTGCTTCCAAGCTCGTCTTGGGGATGGACTTGGCCTTCCAACGATCCTTAAACCTTCTGGCATCATGAAATCTTCATATTTTGTTCTAAACTCTCTCTTGGTCTCAAATACTCCAAATTCTGATGAGGAGGTTGGCCAAAAACTCTAGGTTAAATAGGCAAAGTTTTTCAAAATCTGCAACAAATCTTTCCAAAAATACACTGGCGCCGCGACGCTACCTTGTTAAGGCCACGATTGCCACAACGCAATGCTGCAGAGCACGCGACTACTTCGACATTACAGGCTCTCTAAAAAATGCGTCGGGTTAGCCCTATTTTTGGTCACTTTGTCTTTTTTTCCACTTTTTCTTTCCCACTTTGATCCTTCGGTCTTGGGTTATCTTGTTTATGCATTATTGGATCATTTTAGCTTGTTTTAGCTTAAAATTTTGTCCTATTGCTCAAATTTTAGAAATCACTAAGAAACAACATCAAAGCTAATATAAAAAAGTGAAATTTACTTTAAATCAGAAGCCTAATAAAAGATCTTTTTAGGTGTTTTTCAATTGTTCTTTTTCTCTTTTGTGAAATTGGAAACTCCTTTTGATTAAGGCTTCCCTTCCTTAAAAGCACAAGGGTAGACCTCGACCTTGGTCAAATGAGTCTGCTTTGTTTCAGCTTCGCTTACTTTACCTTCTTCTTTACCTGTCGCTTCAATCTACAACTTTAAAATTTTTAGCTCCTCTGGTTTTGGGTTTTGGGATCCAGTTCTCTCATCCCTTCTCCCTAGGCTTCGTTCTCTTCCTCTGGTCGAAGTTGGGGAATCCCATCCTCGGCTCGGTCTCCTTCTGACCAAGAGGCCAATGAGACTTGTTTTCTCTAACTCCCGTGGATGTTGGTCTTGATCTGAATCCTTACCATGCAAGGTCATGTTTTGATAATTTTGACTCATCCATATGCAATCTCTTATTGTCCTTGGATAATTTTTTTATTGTTCTTCTCCAGACGACACCAACTATTGATGGCAAAATTTGGACGGAATGAATAGTGTCCAATGCTCCTATGGAAACAATTGTAACTTTATAAAGGACGACAAAAACTTGGGGGACAAAAAGAATCTTCACATCGGTGTTGTCCTTGCCGCATAGACTCTGATGCTTAAGTTGGAAACAAAAGGGCAAGTATAGAAATTTGGGGTGAGTGGCTTACTTCCCTTGGAGTTTTCTTCTTCTACTTATAGAGCTTTTTTGCCTAATTCCTTTCTTTCTAGGGTTTCTAAGCTATTGTGTTGGCCAATTCTCCCTCCTTTGGTCGGATCCCTCGAATGGCAGCCATTCGAGGGTTAATCATTTGGGTCATTCCACTCCCCTCCTTTTGGCTATGCATGCCCTTTGGCTTCGCCTTTTGGTCGGTCACATCATTACTTCCTTTGTTATTTTACTCCTTAGCTCTTGGGTCTAAAATTATCCTCAACACATACGATCATTACTTAAGAGTACTTCATGTAACGCCCTAGATTTTTAAGATGAGATGAATTTGATTACTCATAAAGTGTCAATGCTTTACCTTAATGTCATGTTTTCAAGACATTTTGAATGTTTTATTGTGAACTATTGATTTTAATAGCCTTAAATTTAAATGTTCTTATTTAATTAAAGTTTTTAGGAGTCCATAACTAAAGTTTTGTTTTGATTGTTCCTTAAATATGGTCTTAACAATGTTTTATTTATTAAGGAAAATTTTACTTTAAGTTTAAATGGACAACTAATTTTGGTTTTCAAATTGCACATATAGTAATGACTCTAATTAGAGATCTCGAAAAGTCAAATGACTACCCTTCATATTAACAACTATGCAAAGTTGCTAGTTATTAGTATTATAGTCACAACAATCCCAAATTAAACTTCGTCTTATTTTAAAGTTTAGATTCCATGATCCTAATTAATATATATATATATATATATATATATACACCAAAAGAACTAAAAGTTATATATTTAATCTTTTTAAAAAAACTGATCTTTATATACTTTCATCTGATCAAAACTAAAAATTACATTTATACGGAGGGAAAAAAAAAGTTAAACACAAAGTGGCCCCATTTGATAACTATTGTTTTTTTTTTTAAATTAAGCTTCTAAATACTACTTTTACCTAATAAGATTCCTCATTTTGTTATCCACTTTCTATCGGTGCTTTCAAAAGTCAATTTTTAAAAATTAAAAAAGAAAAAAATCAAGTCTTTGTTTATATAATTTGGCTGAAAATTCAACTATTTTTTTTTTAGAAAAAAAAAGATAAAACAAAATTGATATGAAATGGAGCCAATCTTTTATTTTTGTCATATATGAATTATATTGATATAAAATACGGGTATATGGTTTATATTAAGCTCAATAACCCTAAAAGAAAGTCTAGACATTGACATGCATGTTCTTATGTTATAAGGAGATAAATGAATGGAAGAAGATTATGAAAGTGATGCTCTCACTCACATGGGCAAGCTAGGGAGGTGTCATTATATATTAAAAACCATGGCTTTGCATTGGCAGTCGCAATCTTTATATTAGACTTTGTCAGTTATGGGGTTTGTAGTGTAGCCATGTGAACTAATGTCAATTACTGGATGTTGACACATTATCAGATTAAAAATAAAATAAAAACATTTGTCCCTACGTTTTGGGTATTCAACTTTATAAAAATGTCTTAATTAGTGTTTTTATGCGGCAAGTCTTGGGATAGATTCGGTTGGCGTATGGATCTCACATGTGTTTGTCATCGTTGTAACTCTCGTGGACTTGACTTTCAGTTTGCTTTTTTTGGTCTAGTCTCGCTTTTCAAACCGACCTGCGTGAAAGATTCTCGTACTGGTGTGGTGTCATGCCTATATCACTTCTATGTGTAAGTTAGTAACTGGGAATGTATAGTTCAACTAAGCATACAAAAGTAAGGTCAGAGTTTTCCTTAAGAGATTGATTACCCTTTCTCCTTTGGGGTCATCGAAATAGGCTATTTATAATGTCCTTCCTTTGACAGTCGGCTGCAGTTATGTGTAACTCTAGCGATATCAAAGTAGTTTACAGCACTTGAGGTTGCAACCAATAGCGCAACTCAGTTATTTCCTGATTGGTAATATATTTGTCTACTCGGTTGATCAGCCTAGAACCTGAGGCTCTGCCTTTCCCTTATGGGATGCATGTTTCCCCACGTCGCCTACCAGTTTTCATTAGGATTCAACTCGCTCGTAGGCTTGGGGCCTTGCCGTAGAACAACCCTCAACAGATTTATTATTGTCATATTGTTTCATTAACAACGTTGTTTATAAATAATGTTCAATCTGAAAATGAGATGATATGAATATTAAAGTAAATGAGATTATTTATTCAAAGTGGAACACTTTGTTATGTTATTTTCTCTTAAATTGATATTATTAATAGTTCACTATATATTCGACTATGAAAATATCAAATTACTCAATATATTTGGAAATAGTATTTAAGAAAACAGATGAATCTATGTTAGGAGGTTGGAGTCTTTACTAATTTAGATTTAGTTTTTGAATTTTTTATTTTTTCATTTTTCTAGAAATATCCATTGAAATCAATATTTTATCGATATTTCCATCGCAATTTCCATAAAATTAAAGTCTCGATATTTTTGTCGACATCAACATTTAAATGTACGTGTACTATATATATAACTATTGATGTGGAAATGATGAAAAGATATTAATATATTTTGTATTAAACAAAAAATAGAAAAGAGAAAGGACGATAAGATCTTATATAAACAATGCCAAGACCAGCTGACATATAAAATAAATTTATTTTTTATTTTCAAGAATTATTACATGTGTGCACTAACATTGCATAATATAGAGGACCTTCTCCTCAACTCATGGCTCTAGATACCTAGTTTCAAAAGAAACCATCATCATCTTAAGCACCCTTTTATTTTCGAGAACAAAAGCATATACGAACCTCATTATTAGGAGAAAATATTACTCTTTTTTGTTTTCGCTAGGTTGAGAATATAGTTTTCATTTGTTACCTAATTTTCAAAATGTGATATTTATATCTCTGAATTTTATGTTTAATTTTAATTTGGTCTCTATTTCAAAATGTTACATTTGTACCTTTTAGTTTTGAGTTTAATCCTCATTTAGTCCCTAGGTTTCAATAGTTTACGATTTAATTCTAAAGTTTTAACTAGAACTCACTTTGTTCTTTTGTGTAAAATTTCCATTATTTACTTAAAAGAATTATGATGTGAAAATATTTCATTAATTTTAATATCATTGAAAATTAATGAGAACTAATTGAATTGTAATTAAGTAATTCTTTAAAATTAATTAATAGATATTAACACCAAGGACAAAATGTAAACATTTAGTGAAAATTTAAGGTTGAAAGTGTAAATTTGAAAACTTAGGGATCAAATGAAAACAAAACTCAAAACTCAGGGATAAAAATGTAGCATTTTGAAACTTAGGAACCAAATAGAAATCAAAACCAAAACACATAAGTAAACGTGTAAGATTTTGAAAGGACCATAAACGAAAATTAGACTCAAACCATAAAGACGAAAAACATATTTTCCCTTATTATTATTATTATTATTATCATACTAACACTAATTAAACCATATGTTTAGAAAAAAAAAAGTAGCTAATAAGGCCACTCCCCATCCCAATTGCCCGGAGGATCATAATTGCAAGACACAAACCACCAAGCATCATTACGAGCACGTTGGGGCAGAAGATTGGCGATATGCCAATACATCCCGCGTTCCTTGTCACGTGTAACTCCGATGGAGGTTATAGGACACGTTGGGAAAAAAGATTGGCACTACGCCAAGGCGTCCTGCGTTCCTTGTCACGCACAACCCCGATGGAGGTTATAGGGCACGTTGCGTCCTACGTTCCTTGTCACGCCAATCCCAATGGAGGTTATAGGGCACGTTGAGGCAGAAGATTGGCACTAGGCTAATGCGTCTCGTGTTCCTTGTCACGTGCAACTCTATGGAGGTTATAGGGCACGTTCGGGCAGAAGATTGTCGTTACGCCAATGTGCCCCGCATTCCTTGTCACGCGCAACTCCGATGGAGGTTATAGGGCACGTTGGGGCAAAAGGTTGGTACTACGCCAATGCGCCTCGCGTTCCTTATCACCCGCAACTTCGATGGAGGTTATCCACTAGTAGACATGTAGCAGATATGTCGGTGTTAACACGTCTTGAAAGATGTTAATGTGTCCTGCATTCCATCAAACCTATAGATAGAGAAGTGATAGACGCAAGCGCACGTGTCGAGTAATAATATAGTTAAAACGGCCTTGACCGAGTATCGTCCTCAGGAAATCGAATTGGTTAATTTTTTAACTATTTAGAGATCATTCCAATTTTATTTAGGAGACCGTATTGTAATGACACTTAGGGTAAACTAAGCTAAGAAAGCGAATAAAAGTGATCGAGATAAGGAAATTATCAACTAAGAGACGTTTTAGGAAATAGATTTCATTAAAAGTATTATTAAACACAATTGTTTGGTTCTTGAAGTAATTAACATAACACTTAGTTTTAAAAGTCTTTTATCTGGATTGAATCTTTCGAAAACAACATTTTCTTTTGTTTAAACCCTAATTACTAATCGCTTGAAACTAAATGATTAAGCAAAGCATTGAGAGCCTTCATTAGCAATCTCTACCGTTATCCAACTACTTTTTGTCATTTCAAACGATAGTCTAGCATGCATGATCTAATATAAAAGTCACTTTTCAAGCAAGATTCTAAACCATGAAATCATTCAAACATTGATCAAACATTCAAAAGCATTAAGAACGGTCATGCTAGAAAGCATAAATGTAAATTAACATTTAAGAAGCAAAGTGTTCAACAATTTAAACCTAAGAATTTATCCACTTCTAAAACTATGAGAAATTTAGCCTAACATGACTTCCATAGATAAAATCCACATTAGAACAACAAAATATAGCAAATAAACCAAGAATAAAGAAAGGAAAGAGAAGTTATCGAACTTCCAAACTTGTCTCCGGGATGGATATGGTCTTGCAACGATCCTCGAACCTTCCGACATCATGGTATCCTCATACTTTGCTCTAAACTCTCTCTCTTTCTAAGTATCAAATACTTCAAATCTTATGATTAGGGCTGCTGAGGACCTCCCAATTATAGAAGGTTAGAAACTTGGAGCTCAATTAATTTCCAAAATTACAAAATTTTCATTTTTTTGGAAAGAGAGCGTCGCTACGCTCCCTTACAGCGTCGCGGCACTCTGGTCAATTTTTCGCAACATAATGCTACGCTCAAAAGAGTGCAACTATGCTCCCTCAAATGCCAAAAATTGTTGCTCTCGGGTCTAGAGCGTAGAAGGCCAGCATAGCTACGCTCTAGCTGTTACGCTGCCCTGTTTTCTAGGGTTTTTTATTTGTTTTCATCCGATTTTTGTTCAGTTTGGTTCCTTTAGCTCCTGAAACTCATTTTCTTCATTAATTACCTGAAAACCACTAGCAAACACATTACATCTACTAAAATATCCCTAAATAAATAGTAAATAATGCAAATTTTAAGTGTGTTTCAAGAAACAACAACGCGACTATCAAACTTGGAGATGAAGAAGCAACAGTACTACCAATGAACTTGGAGATGGAGATGCAGCAACGATATCATCAAATTTAAAAAAATAGATAAATTGCAACGCAGTTGTCGAACTTGGAGATGGAGTAAAGCTGCACCAGCATTTCGGAGGAGAAGCAGAGCTGCACCGACATTTCAAGCACCATTGCGAAGCTGCACTAATATTTTGGAGGAGGAGCGAAGTTGCGCCAACATTTTGAGCGCTATTGCATAGTTGTGCCAACATTTTGGAGGAGGAACAGAGTTGTGCCAACATTTTGGATGAAGCGGAGTTGCGCCAATATTTTGGAGATGGATCGAGCTGTACCAGCATTTTGGAGATGGAGCGGAGCTGAGCCAATATTTTGGAGATAAAACGGAGCTGCACCAACATTTTGGAGGAGGAGAAAAGCTGTATCGACATTTCGAGCACAATTGCGGAGCTGCACCAATATTTTGGAGATGGAACGGAGCTGTGCTAATATTTTGGAAACAGAACGGAGATGCCCTAAAATTTTGGAGGAGGAGCGAAACTGAACCAATATTTTGAAGGAGGAGCGGAGTTGAACCAATATTTTGGAGATGGAGCGGGCGACACCAATATTTTAAAGATGGAGCAGACCTGATCTTTCAGATGAGGTAGAGTAAAGCATATCGAACTTGATCTCACATATGAGGTGAAGTCAGGCAGATCAGACTTGATCTCGCAAATAAGGCGGAGCCAAGCAAGCTGGACCTAATCTCGCAAATGAGGTGGAGCTAGGCAGACTAAACTTCATCTTGCATATGAGGCCAATCATCCAAACTTGAATTTGATAAAGCATCGACAAATCTTTCGAATTTGAATCTGAAGAAGCATCGACAAAACCTTTGAACTTGTATTGGAAGAAGCATTGGCGAAGTCTCCGAACTTGAGTTTGATGAATCATTCATGAAGTCTCCGAACTTGAAGGTGAAAATACATCAACGAAGCCTTCGAGCTAGAATATAAGGAAACATCAATGAAGCCTCTGAACTAAAGCGAGCAACCCCTGTACTCAAAGCTTATTTAAAGCGAGCAGAGAATAGCACAAAGTTTCTTCGAAGGCTACTTTATAAGCAGACAACTTGTAGTGTCTTGCTTCTAGTGCTACACTCCTTTGATTTGTTGGGTTGCTGTTAGTAAGGCATCTTCTGTTGGGTTTTATGTCCTAAAACTTGTAGTTTGTAAACATTAATATGCATTCTATTTATCAATAAAGTTTTTATCGAGGTTATTCGGTAAACATGTTATTGAATAAAGTGAATTGTTGTTAATATAACTTTAAATCCAATAAACTAAGAACCCATGACTTTAGTATGAAAACTTGAACTATATGTAGTGACATAAAAGTGGATCAAGTTCGAGTAAAATAGCCAGAACAGTCAATAGTATAGGGATGAGGATGGGTACCTTATCTTGGGGACACTATTGAATGTGGCCCACTTTGTATTTAGTACAAATGATTTGATCCTAAATCATTAATGTGGAGACATGTGAGTGGGGGCATTCTGTGCAATGAGTTTGCATAAGACCGAACCACGAAATAGTTACTACTACTTTATAACGCCATTTACTGTTAAATACTAACTATTTCAACTAGATAACGTAGGTAACTCGATCTCAATCCTGAACTAACTATGAACTCCTATTTATTCGGGATTATCCTTAGATCTGCATGGGTGAAAGTGGCCCAGGTGCGCCGACTCAATAAGCCTCCCATTTCAGGGGTAAGACCAAGTAGATAGGTGGAGACATAGTCTTGCAAGACGGAATTCACTCCTACCCGACTTAGATTAGTAGATAGGTTGTTCCCTTAAATATTGATTTCGGGTCTTGAACAAGGAGCCACACCCTCTCATTGGCCCGAGAGGGACTCGGTTTAGTGATAGGATCACAAACCAATTGTTCATTAGAGGATCAGTTGGACTTAAGGAACAAGATGTATTCATAAGGGTAAAACGGTAATTTTGACCTAGCTGTGATTACGGACAACCTATGAAGAATCGACTTACTGATTATGGTTATATCAAGTGGACATAAATATATCTACAATGAGGAGAGTGCAGCTACGGGCTTTAGTAGAGTGTTCTGTTAGTTAACGAATGTTGGTTAACTAGGTTAAAGAATTTAGCCGGTTAATCTCGGATCGTTGGAGCTCATGATCTGTAGGTCCATTAGGTCCCTCTACTAGTTCATAAATGGATAGAAACCTTAGATAGTGTGGTGAGTAAGTTTGAACCGTTCAATCTCGGCTTAAGGAATGAAAGTCAAATATATACGATATATCAACGTTTAATTACAAATTAAACGGACAAATAGAGTTAATTATAATATTTAAATATGATTTAAATATTATTTATATGGATAGAAATTCATATTGGTTTTTTAGAAAAATCGGAATAAGTGGAAAAAATAATATTATTTTATTATCTAATATAAAAATAATATTATATTTAATTAATTAAATATTAAATTATTTTTTATTTAATTAATTATTTTTAAATTAATTTTTGAAATTAAAAATCAAATTAGAAATTGATTTTTTTATTCAATTTTGAAAATTCATTTAAAATTGAACTTAAAATTAATTTTTGAAAAATTGGATTTTCCCACTAACTAAGCTACTACCTTAGTTGTTAACTTCTCAATTAGCCAACAAATTGGAGTTTTGTGTTGTCAATTTGTGATGCAGTTAGATTGCATGTTAATCCTTAATAAGTAAAGATAATTTTGGCTAAAAATGGATTAAGAATTTTGTGAAATTAAGAGTTAATTTTGAAAAATTCTTTCATAGCCCATTTTCTTCTTCAACCTTCATTCCATCACAAATTCACCACAAAATTGGATCCACCATTTAATCTAACGCCAGAGAATAATAGAGAAGACTCTTGTGGTGATCTACAAACAAGTGGTATTCATTTTCATCAAGAAACAATAGAGAATTTAGAGCTTCAATGGTAAGTATTCTTAAAACTCTAGTTTTCTCCTTTTAATTGCATGTTTTATCCTCAAAATTGAGCATCTTATAGTGAAAATTGATCAAATTAATTTTCGCTGCATGCTCGTATGTCCTTGATCTTCAACCTTTAGTGACATCCTTTAACAAACACCATCAATTTTCTAGTGTCACGCTTTAGGCGATTGTGACACTTTTCTTTTTTTGGATTGAACTTAAATATCTTTAAGAAATCTACGTACCCTTTATAATGTATAGGGATCAAGTCATAACGTAATTCAACTGAGTTTTTTTGCTTTTTACTGGACTGAACTTAAATTTCTTTAAGTTGCCTACGTACCCTTTATAATGTATAGGGATCAAGTAATAACGTAGTTCAACTGAGTTTTTTTGTTTTTTACTGGACTGAACTTAAATTTCTTTAAGTTGCCTACATATCTTTTATAATGTATAGGGATCAAGTCATAACGTAGTTTAACTGAGTTTTTTTTCTTTTTACTGGATTGAATTTAAATTTTTTTAAGTTGCGTATGTACCCTTTATAATGTATAGGGATCAAGTCATACTGTAGTTCAAATACTTTTTTTTTTTATCATTCACCTTTTAAGCTCATGCGGACCAGGAGCACCATGCAGTTTAGGCTCTTGCGATGGGGTGCTTCTACATTTAAACTTTAGAAGCTCTTATTTCATCATGACTTTGTTCATCTTCTTCAATCGAAGGATTCGTCAATATGATGAGTTTTGACTTCACCGTCAAGGAACCTTTTGTATTTATGTCACTAGACAACTTCCTCTTCATGCGTGAAGGGACACAACTGTGAATCTTCTCGTTGTTGTTTTCTTCATGAAATGGTTTTGCTCCAAGGTTTTCATCTTTCTTTCATGTTCACCGCTTGTCATTTTGAGATGATCAAAAGCAGATGTTGAAGGTCGATCCTTCTTTGATGTGGAGATACTTAGCCTTTGAAGCCTAAACATTGATCTCTTCTTTTATCTTGGCCATACCCAGTCTTTGGAAGACTAAAGATTGAAGGCTTGAGACGATCGAATACAGAAGTTCTTTACTTGGCTTCTTTTGAATCGTCGACTTCTTCAACAGTTATATGATTGCTTTCTTGTCTTGCCCTTCCCAGTTATACGAACCAGCTTAATGGGACTTATATCCAAATCCTTTTCTCGACGTGGGTATAGCATACCCCTATTTCAAAAGCTTCTTCTAGAATGGAGAAAGTTCAGGCTGCTCATAGATTTTCAAGCTCTTGAATTTAGTGTGAGTTATGAAGTCATAGCCTACTTTTGCCAGCAATTTATATGTAACGACCCGACTTTCTAGGACTTGTACTAGGCCGTTACTATATGCATGCTTACATCACAAACGACTCTTTTTTATAAATTTAACAAAAATTAACAAATTTCATAAATCAAATAATTTTTATAAATTCTAAATAACAGAAAATTCCTAATCGAGGTACCCCTAAAATTATAAAACAAAACTGAAAGTTTGTATGATTTGTTTAACAGCGCAAATACTAGAATTTAAAATTTAAAACATCAAGCCTAGAATTTCAAAACATGAAAACATGGAAAATGTGTGGGGAAGCTTCTATTTGGTCCCAATAGTCCACTACCAGGGTCAGGCGGTGCATTTATATCCTATAAATGCAATCATATTAGTGGGATATGCATACATCAACTAATCCTAGATGGCCGATGCGCGATTTCCGCAAGTGCATGGGTTAAGTTATAATATAAAACAGTTAAGACCGAGTATCGTTCCACTAAGGATTATAGCACGTTTAATTACTAAGCTATTCAAAAATTGAATCAACTTTATCTAAGAAATCGAATTTGAGTCTGTGTTGAATTGGAAAACAAACAAAAAAAGTAAAGACAAGTGAAAATTAAGAGATAAAAGCGTTCTAGGGTGTTGATATCATTAATTACTTCATTAAGTCATACTTTATATTTTCATGGAATTATGTGTAAACAATATGTTTAAGTCTAGAGACTATTATCCAGATTATTTTCTCTAAAAAACAAACATTTCTCATGCAACTTAATTGATTACTGATTTCTTGAAACCAAGATAAGAAGCATAACATTAAACACAATCATTAACAACTTCCTCTACCAACCTAGCTATTTTCATGAAAGATTGATAAAAGTTCAATACCATATATCCAATTAGACATATAATAAATTACTCAGTCATACATCCAACCTAACAATTATTGATGATCAATCAAAGGATAATGAAGCACAATAGTATTGAAAAACATATCATTCAACAACATCCATGAAAAGCACTAAAATATTAACTTAAATTCATAATTAAATCATTCAACACCCTAAAACTAAGAGTTTAGCCACACAGTCATCAAATACATACTATCATATGAGTTCTCAAGCATGAGAGTAAAGAGAAAGAGAAAACTTAGGAAGAATAACTCTTGAATTGCTTCTCGAGTTGAATTCCTTCGATCTTTACTTAGTCAGCACCTCGGGATCTCGAACGGCCTCCAACTCTCCTCCAATCAATTTTCTCTCTTTTTAGTAGCAATCTCCCACGTTTAGGCTTTGATATTTCGTGATTAGGATTGCTAAACACCATCTAATTTATAGAATTTCTCAAACTTGCAGCCCAAAAATTTTCCAATTGTGCAAACTTTTTAATTTTAGAAAGAGAGCGTCGTACACTCCCTTAGAGCGTCGTGACACTCTGGCTAATTTCTACAGCTCAACACTCTCTAATGAGCGTAGCTATGTTCCCTTAGCTTCCAGATTTTCTTGCTCTCAAGTCTAGAGCGTAAAGGGGCACATAGCGACGCTCTAGCGTTACGCTGCCCTGTTTTGAGGGTTTTAAACTTGTTTTGATTCGGTTTTGGCTCGATTTGGTTCCTTTGGCTCGAAACTCAAATAAAGAGTATGTTGGGAAAAACCTCGCACTTCTATCACAGGAATAAGCAGAAAAATAATAGAGAAATTAATAAAAGAAAGCAGTAAAACTGGAAACACAAGAACTAACGTGGAAAACTCCCAACTCAGAGAAAAACCACGAACCAGAAAAAATTCACTATGTGAAAAATCGTTACAATCATAGAGAATAATTCTCTTTCCCGATCCTAATTACAAGAGCACTCTCTCAAAATTTTTATACTACTCACACCCTTTTCCCACTCTCAAACTAGAGAATACATAAAGGAAATTTAACTAGAGTTAGCACACTAGGCTTAAAGTATTTCTAACTAGGACGACTTGAAACTAGAGGCATAGGCTCCTTTTATAGGCTTGGAGTCCATCCCCATTCTCCAATTTAACCGATGTGGGACAACTACATTTCTAATATTTTGCAAAAAATCCAATTCCAAATCAAATAAAACAATAATCTATGATGCCAAATTCGAAAAAATCTATTTGCAAAACCCAACATTCAAAATGTAGAGTTAAAATACCCATCTTTTCTCTTAAATTTGTTTCAAATTGAATATCAAAGCATAACCTGGCTCTGATACCAATTGAAGGAATTGAATTTTGAGCGGAAGTGCATGAACATCGTGCAATTGATATTTAATTATGAAACACCAAAACACAGAGAAACAAAAACTACAGGATAAACAAAATAATTCCCTTAGCATGCTTTCTACAGAAGAATTAAGGAAGAAATTTCACAAATCTTTGAAGACCAATCTTCATAGTATTTCTTCTTTTCCTTCTTTGGATTACGAACTCGTACAGAGAACAGTAATAGGGACACCACTGTAAATCAATCCAGAGGAGTTGTGGGCTCTCTGTTCTTTTTGGTAAAAGAATTTAGATTGTTCTTGAGAGAGTTTTCTATATTCCAGAGAGAGTACCTTTTTTTTTTCTATGTTCTGTTAGTGTATCTCTATGAGAGAAGAGAAGATGGGAGTTATTATAATAACAATAATATTATTAATATTAAAATAATTTTAATAATAATAGTAATAATATTGATTTAATATATATATATATATATATATATATATATATATATATATATATTAAATCATATTTAATATATATAATGAAATAATAATTAATTAACAATTAATTAATTAATAAATTAAATATCACATATTTAATTTTATTTGAATCATATTCAAATGATAAAACTCTCACATGACCTATAGTTTAATATGAATCATATATTCACATTAAATTTAACCTATAGTTTTTATATGAATCTAATTCATATAATTAATATTTGATCTTATTCAAATATTTTGTCTCTCTCATCATATAAAATTTAAATTTGAATCTCATTCAAACTAACTTTATATTATAATGTATCTATATACATTATCTTAATTGTATCATATACAATTAATTTCCTTATTTAATTTGAACAATTCAAATTAATCCAAAATAATTTATTCTTATCAAAACCCTCAATGAGTTAGAAAGGGGACCTTATGGACCTATAGATTAGAAACTCCAATGATATGAGATTAATTAATTAAACTCTTAAATTATCTTAAATTAAATTAATCAATATTCATTAATTGTAGGTCACTCCACTAAAGACCTACAGTTGCACTCTTTGCACTATAGATATATTTCCGTGTCTACGGATATAACCAATCAGCAGTAAGTTGACCCTTCACAATTGCTCATAACTACAGCTAAGTCAAATTACCGTTTTACCCCTGTAGTACATCTAACTCCTTAAGTACCATTGATCCCTCTAATGAACAAACAGTTTATAGTCCAACTATAAACTATACTCTCTTGGGCCAGAGAGAGGGTGAGGCCTCATTGATCAAGACCCGAAATCAGCCCTCAAGGGATGAATTATCCACTTATCCTAAAGATGGGAGGGAGTGAGTTCCATCTTGTGAGAATACGTTCCCAACTCCCTAATGAGACAAATCCCCAAAATGATAGGCTTACTGAGTCGACGATCTGCCCACTCTCACCCATACAGATCAAAGGATTGCCCTCATAAGCAGGAGTTGATATCCACTTAGGATTAAGGTCAAGTCACCTATGATCATCCTATGAAATGCTAATTTCTTCAATTAATGGTGTTATAAGGAGAAATTAAACATTTCGCAGTCGGTCTTATACCAACTCTTTGCATAGGATACCCCCAATCACATGTCTCCACATGAACGATTAGGATTACATCGTTTGTAACACTTTACAACTCTTATAACAATTACAAAGTGGGCAGTATTCGTAGTGTTACCAGGTTAAGGCACCCAACTTTATCCCTATACTACAAACCTTTTTGGTTATTACTTAAACATGATCCACCCTTGTTTGTAAGCGAATCGTGTGGATGTACTTGAAGTTGGAATCTCTGAGGCATGTATGTATTCCAAAGAACTTCAGTCTTCAAGTGTGATTCAACTTCCGGAGTTAAGGAGTCTTCTGATTGTAGGGAGAATTCTCTCTAGGCTCTCTGGTGTCTAAAATTTCTGATCACCTCAAATGAAGGAGAGCTCATATATTTATAGAGTTCTCATGTTGGCTTTATGGACTTAGACTTGGCTAGTCCATGAACCTAGCCCTTAGGCTCAACTAATTAGATTTGTGCCTTTGTTGGCCTTTGGGCCAAATTAAGTTTACAATTTTTGGACTCAATTGGATTTTAGTGCAAATATTAATAAATAATTGAACAAAAAATAATTAATTTGATCCAACAGTGAGGACGAAAACATGTGACACCATCATCATTTGTCAATTTACATCTTCAATAGCAATTTGGGACACATGTCAACTTTTAATTAGTCCCAAATTTAATTATTTCTAATTTCACCATTAATTTGAGAAATGATGTGACAATTTGTGGTTGGTCCAAAATTTCTCATTCAACACATGTTGTTCAAGTTTTGCCTGTAACAAAATCAGTCAATCATACTGCAATAACAACATTTGTGTGTTTTATGATATATATTTTTAGTTATATGAACAATTTTTTAAGGATATAATGGAGATGTTTATTTATCTCCATGTCAATATTGAACCCATGAACATATGAAATTATTGCAGAGACAAGAATTTAAAACCTTAATTCATACACGGTATTATCCTATTAATAGTAATATATATAGTAATATATATATATATTACTATGTATATATATATTGAATTAGTTATGAGAAATTATGGAGAATTTATTTGGTGCCTCACTATTATTAGACCTTTGCTCTGTCTTCCAAAACAACCCCTAAAATGGAACCATACTCATCTTTATTCTCTGTATAAAAGCTTAGCTAAGCCAAACCCCAAACCCCAAATCTCCATATCCAACACAATCAATTATTCCTCCAACTTGCTTATAAATATGGCATTTACCAACATTTTCTTCGCCCTTTGTTTGTTGGGACTAACCCTAACTCTAGCACCCATTGCCCCCATCATGGCTAAAAGCTACCCTAAAAACTACATTGCCGCCCACAATGCCATTCGTTTGCAGGTTGGTGTCGAGCCCCTCCATTGGAACGCCACCTTGGCAGCGTATGCTCAAAATTATGCCAACACGAAGATCGCCACCTGTCAAATGGAGCACTCTGGAGGACCTTATGGCGAAAACTTAGCGGAGGGATACGAAGTGATGACAGCAGAGACGGCGGTGAGTCTCTGGACTGATGAGAAGAAACATTATGACTACAATTCTAACACGTGTGTTAACGACTCTAGCCATTGCCTCCATTATACACAGTTGGTGTGGAGTAACACCAAATCTATTGGTGCCCAAGTGAAGTGCCAAAACAATTGGGTTTTTCTAATTTGCAACTATTATCCTCCAGGAAACTATATGGGCCAACGTCCATATTAA

mRNA sequence

ATGCTTGGTTTTCTCCAAATTTCCATCTCCAAGGCACAAAACTCCCCTCAAGACTTCGTCGATACCCACAACGTCGTCCGAGCTGCAGTGGGTGTCGGTCCAGTCTCCTGGGACGACACCCTAGCCACCTATGCTCAGAATTACGCCGATAGTAAGATTGATACCTGCGAGATGGAGCACTCCAATGGGCCCTACGGTGAAAACCTCGCTGAAGGGTACGACGAGATGACGGCGATAGAGGCAGTGAAGTTCTGGGCGACCGAGAAAAAGTTCTACAACCACCATTTGAATCGGTGCATCGGTGACGAGTGTGGCCACTATACGCAAATAGTTTGGAGGGACACTAAAAACATAGGGTGTGCTAGAATGAAGTGTGAGAACAATTGGATTTTTGTGATATGCAACTATAATCCTCCTGCTAAGCCAAACCCCAAACCCCAAATCTCCATATCCAACACAATCAATTATTCCTCCAACTTGCTTATAAATATGGCATTTACCAACATTTTCTTCGCCCTTTGTTTGTTGGGACTAACCCTAACTCTAGCACCCATTGCCCCCATCATGGCTAAAAGCTACCCTAAAAACTACATTGCCGCCCACAATGCCATTCGTTTGCAGGTTGGTGTCGAGCCCCTCCATTGGAACGCCACCTTGGCAGCGTATGCTCAAAATTATGCCAACACGAAGATCGCCACCTGTCAAATGGAGCACTCTGGAGGACCTTATGGCGAAAACTTAGCGGAGGGATACGAAGTGATGACAGCAGAGACGGCGGTGAGTCTCTGGACTGATGAGAAGAAACATTATGACTACAATTCTAACACGTGTGTTAACGACTCTAGCCATTGCCTCCATTATACACAGTTGGTGTGGAGTAACACCAAATCTATTGGTGCCCAAGTGAAGTGCCAAAACAATTGGGTTTTTCTAATTTGCAACTATTATCCTCCAGGAAACTATATGGGCCAACGTCCATATTAA

Coding sequence (CDS)

ATGCTTGGTTTTCTCCAAATTTCCATCTCCAAGGCACAAAACTCCCCTCAAGACTTCGTCGATACCCACAACGTCGTCCGAGCTGCAGTGGGTGTCGGTCCAGTCTCCTGGGACGACACCCTAGCCACCTATGCTCAGAATTACGCCGATAGTAAGATTGATACCTGCGAGATGGAGCACTCCAATGGGCCCTACGGTGAAAACCTCGCTGAAGGGTACGACGAGATGACGGCGATAGAGGCAGTGAAGTTCTGGGCGACCGAGAAAAAGTTCTACAACCACCATTTGAATCGGTGCATCGGTGACGAGTGTGGCCACTATACGCAAATAGTTTGGAGGGACACTAAAAACATAGGGTGTGCTAGAATGAAGTGTGAGAACAATTGGATTTTTGTGATATGCAACTATAATCCTCCTGCTAAGCCAAACCCCAAACCCCAAATCTCCATATCCAACACAATCAATTATTCCTCCAACTTGCTTATAAATATGGCATTTACCAACATTTTCTTCGCCCTTTGTTTGTTGGGACTAACCCTAACTCTAGCACCCATTGCCCCCATCATGGCTAAAAGCTACCCTAAAAACTACATTGCCGCCCACAATGCCATTCGTTTGCAGGTTGGTGTCGAGCCCCTCCATTGGAACGCCACCTTGGCAGCGTATGCTCAAAATTATGCCAACACGAAGATCGCCACCTGTCAAATGGAGCACTCTGGAGGACCTTATGGCGAAAACTTAGCGGAGGGATACGAAGTGATGACAGCAGAGACGGCGGTGAGTCTCTGGACTGATGAGAAGAAACATTATGACTACAATTCTAACACGTGTGTTAACGACTCTAGCCATTGCCTCCATTATACACAGTTGGTGTGGAGTAACACCAAATCTATTGGTGCCCAAGTGAAGTGCCAAAACAATTGGGTTTTTCTAATTTGCAACTATTATCCTCCAGGAAACTATATGGGCCAACGTCCATATTAA

Protein sequence

MLGFLQISISKAQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEMEHSNGPYGENLAEGYDEMTAIEAVKFWATEKKFYNHHLNRCIGDECGHYTQIVWRDTKNIGCARMKCENNWIFVICNYNPPAKPNPKPQISISNTINYSSNLLINMAFTNIFFALCLLGLTLTLAPIAPIMAKSYPKNYIAAHNAIRLQVGVEPLHWNATLAAYAQNYANTKIATCQMEHSGGPYGENLAEGYEVMTAETAVSLWTDEKKHYDYNSNTCVNDSSHCLHYTQLVWSNTKSIGAQVKCQNNWVFLICNYYPPGNYMGQRPY
BLAST of Lsi03G006210 vs. Swiss-Prot
Match: PRB1_TOBAC (Basic form of pathogenesis-related protein 1 OS=Nicotiana tabacum PE=3 SV=1)

HSP 1 Score: 183.7 bits (465), Expect = 3.2e-45
Identity = 74/133 (55.64%), Postives = 99/133 (74.44%), Query Frame = 1

Query: 7   ISISKAQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEMEHSNGPYG 66
           I  SKAQNSPQD+++ HN  R  VGVGP++WD+ LA YAQNYA+ +I  C M HS+GPYG
Sbjct: 18  IHSSKAQNSPQDYLNPHNAARRQVGVGPMTWDNRLAAYAQNYANQRIGDCGMIHSHGPYG 77

Query: 67  ENLAEGYDEMTAIEAVKFWATEKKFYNHHLNRCIGDECGHYTQIVWRDTKNIGCARMKCE 126
           ENLA  + ++ A  AVK W  EK+FY+++ N C+G  CGHYTQ+VWR++  +GCAR++  
Sbjct: 78  ENLAAAFPQLNAAGAVKMWVDEKRFYDYNSNSCVGGVCGHYTQVVWRNSVRLGCARVRSN 137

Query: 127 NNWIFVICNYNPP 140
           N W F+ CNY+PP
Sbjct: 138 NGWFFITCNYDPP 150

BLAST of Lsi03G006210 vs. Swiss-Prot
Match: PR1A_SOLLC (Pathogenesis-related protein 1A1 OS=Solanum lycopersicum PE=2 SV=1)

HSP 1 Score: 168.7 bits (426), Expect = 1.1e-40
Identity = 69/137 (50.36%), Postives = 97/137 (70.80%), Query Frame = 1

Query: 4   FLQISISKAQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEMEHSNG 63
           F+    S+AQ   ++F++ HN  R  VGVGP++WDD LA YAQNYA+ + D C M HS+G
Sbjct: 13  FIIFHSSQAQTPRENFLNAHNAARRRVGVGPMTWDDGLAAYAQNYANQRADDCGMIHSDG 72

Query: 64  PYGENLAEGYDEMTAIEAVKFWATEKKFYNHHLNRCI-GDECGHYTQIVWRDTKNIGCAR 123
           PYGENLA  + ++ A  AVK W  EK++Y+++ N C  G  CGHYTQ+VWR +  +GCAR
Sbjct: 73  PYGENLAAAFPQLNAAGAVKMWDDEKQWYDYNSNTCAPGKVCGHYTQVVWRKSVRLGCAR 132

Query: 124 MKCENNWIFVICNYNPP 140
           ++C + W+F+ CNY+PP
Sbjct: 133 VRCNSGWVFITCNYDPP 149

BLAST of Lsi03G006210 vs. Swiss-Prot
Match: PR1A_TOBAC (Pathogenesis-related protein 1A OS=Nicotiana tabacum PE=1 SV=1)

HSP 1 Score: 166.4 bits (420), Expect = 5.3e-40
Identity = 78/142 (54.93%), Postives = 99/142 (69.72%), Query Frame = 1

Query: 1   MLGFLQISIS-KAQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEME 60
           +L FL IS S +AQNS QD++D HN  RA VGV P++WDD +A YAQNYA      C + 
Sbjct: 18  LLLFLVISHSCRAQNSQQDYLDAHNTARADVGVEPLTWDDQVAAYAQNYASQLAADCNLV 77

Query: 61  HSNGPYGENLAEGY-DEMTAIEAVKFWATEKKFYNHHLNRCI-GDECGHYTQIVWRDTKN 120
           HS+G YGENLAEG  D MTA +AV+ W  EK++Y+H  N C  G  CGHYTQ+VWR++  
Sbjct: 78  HSHGQYGENLAEGSGDFMTAAKAVEMWVDEKQYYDHDSNTCAQGQVCGHYTQVVWRNSVR 137

Query: 121 IGCARMKCENNWIFVICNYNPP 140
           +GCAR++C N    V CNY+PP
Sbjct: 138 VGCARVQCNNGGYVVSCNYDPP 159

BLAST of Lsi03G006210 vs. Swiss-Prot
Match: PR1B_TOBAC (Pathogenesis-related protein 1B OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 161.8 bits (408), Expect = 1.3e-38
Identity = 76/142 (53.52%), Postives = 97/142 (68.31%), Query Frame = 1

Query: 1   MLGFLQISISK-AQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEME 60
           +L FL IS S  AQNS QD++D HN  RA VGV P++WD+ +A YAQNY       C + 
Sbjct: 18  LLLFLIISHSSHAQNSQQDYLDAHNTARADVGVEPLTWDNGVAAYAQNYVSQLAADCNLV 77

Query: 61  HSNGPYGENLAEGY-DEMTAIEAVKFWATEKKFYNHHLNRCI-GDECGHYTQIVWRDTKN 120
           HS+G YGENLA+G  D MTA +AV+ W  EK++Y+H  N C  G  CGHYTQ+VWR++  
Sbjct: 78  HSHGQYGENLAQGSGDFMTAAKAVEMWVDEKQYYDHDSNTCAQGQVCGHYTQVVWRNSVR 137

Query: 121 IGCARMKCENNWIFVICNYNPP 140
           +GCAR+KC N    V CNY+PP
Sbjct: 138 VGCARVKCNNGGYVVSCNYDPP 159

BLAST of Lsi03G006210 vs. Swiss-Prot
Match: PR1C_TOBAC (Pathogenesis-related protein 1C OS=Nicotiana tabacum PE=2 SV=3)

HSP 1 Score: 160.2 bits (404), Expect = 3.8e-38
Identity = 76/142 (53.52%), Postives = 96/142 (67.61%), Query Frame = 1

Query: 1   MLGFLQISIS-KAQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEME 60
           +L FL IS S  AQNS QD++D HN  RA VGV P++WDD +A YAQNYA      C + 
Sbjct: 18  LLLFLIISHSCHAQNSQQDYLDAHNTARADVGVEPLTWDDQVAAYAQNYASQLAADCNLV 77

Query: 61  HSNGPYGENLAEGY-DEMTAIEAVKFWATEKKFYNHHLNRCI-GDECGHYTQIVWRDTKN 120
           HS+G YGENLA G  D +TA +AV+ W  EK++Y H  N C  G  CGHYTQ+VWR++  
Sbjct: 78  HSHGQYGENLAWGSGDFLTAAKAVEMWVNEKQYYAHDSNTCAQGQVCGHYTQVVWRNSVR 137

Query: 121 IGCARMKCENNWIFVICNYNPP 140
           +GCAR++C N    V CNY+PP
Sbjct: 138 VGCARVQCNNGGYIVSCNYDPP 159

BLAST of Lsi03G006210 vs. TrEMBL
Match: A0A067KWY2_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_09851 PE=3 SV=1)

HSP 1 Score: 295.0 bits (754), Expect = 1.1e-76
Identity = 147/313 (46.96%), Postives = 192/313 (61.34%), Query Frame = 1

Query: 12  AQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEMEHSNGPYGENLAE 71
           A++SPQD+V+ HN  RAAVGVGPV+WD T+A YA+NYA+  I  C + HS GPYGENLA 
Sbjct: 11  ARDSPQDYVNAHNTARAAVGVGPVTWDSTVAAYARNYANQHIGDCRLVHSGGPYGENLAG 70

Query: 72  GYDEMTAIEAVKFWATEKKFYNHHLNRCI-GDECGHYTQIVWRDTKNIGCARMKCENNWI 131
           G   ++   AVK W  +K FY+++ + C  G  CGHYTQ+VWR++ +IGCA++KC N   
Sbjct: 71  GSGGLSGTAAVKLWVDQKAFYDNNSSSCAAGKVCGHYTQVVWRNSVHIGCAKVKCNNGGT 130

Query: 132 FVICNYNPPAKPNPKPQISISNTINYSSNLLINMAFTNIFFALCLLGLTLTLAPIAPIMA 191
           F+ CNY+PP               NY+               LC    TLT+    P+ A
Sbjct: 131 FITCNYDPPG--------------NYN---------------LC----TLTI----PLHA 190

Query: 192 KSYPKNYIAAHNAIRLQVGVEPLHWNATLAAYAQNYANTKIATCQMEHSGGPYGENLAEG 251
              P++Y+ AHN  R  VGV P+ W+ T+AAYA+NYAN  I  C++ HS GPYGENLA  
Sbjct: 191 LDSPQDYVNAHNTARAAVGVGPVTWDNTVAAYARNYANQHIGDCRLVHSAGPYGENLAWS 250

Query: 252 YEVMTAETAVSLWTDEKKHYDYNSNTCVNDSSHCLHYTQLVWSNTKSIG-AQVKCQNNWV 311
              ++   AV LW DEK  YDYNS++C      C HYTQ+VW N+  IG A+ KC N   
Sbjct: 251 SGDLSGIDAVKLWVDEKAFYDYNSSSCAT-GKVCRHYTQVVWCNSVRIGCAKAKCNNGGT 285

Query: 312 FLICNYYPPGNYM 323
           F+ CNY PPGNY+
Sbjct: 311 FITCNYDPPGNYL 285

BLAST of Lsi03G006210 vs. TrEMBL
Match: A0A0A0K7N5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G070232 PE=3 SV=1)

HSP 1 Score: 287.7 bits (735), Expect = 1.7e-74
Identity = 140/171 (81.87%), Postives = 146/171 (85.38%), Query Frame = 1

Query: 164 MAFTNIFFALCLLGLTLTL------APIAPIMAKSYPKNYIAAHNAIRLQVGVEPLHWNA 223
           M F NI   LCLLGLTLTL      A  AP +AKS PKNYI AHNA+R  VGVEPLHWN+
Sbjct: 1   MGFANILSTLCLLGLTLTLTLTLTLASTAPSLAKSSPKNYIDAHNAVRAAVGVEPLHWNS 60

Query: 224 TLAAYAQNYANTKIATCQMEHSGGPYGENLAEGYEVMTAETAVSLWTDEKKHYDYNSNTC 283
           TLA YAQNYANTKIATCQMEHSGGPYGENLAEG EVMTAETAVSLW DEKKHYDYNSNTC
Sbjct: 61  TLADYAQNYANTKIATCQMEHSGGPYGENLAEGNEVMTAETAVSLWADEKKHYDYNSNTC 120

Query: 284 VNDSSHCLHYTQLVWSNTKSIG-AQVKCQNNWVFLICNYYPPGNYMGQRPY 328
            ND S+CLHYTQLVWSNTKS+G AQVKCQNNWVFLIC+YYPPGNY GQRPY
Sbjct: 121 SNDPSNCLHYTQLVWSNTKSVGCAQVKCQNNWVFLICSYYPPGNYNGQRPY 171

BLAST of Lsi03G006210 vs. TrEMBL
Match: A0A0A0K6A0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G070235 PE=3 SV=1)

HSP 1 Score: 271.9 bits (694), Expect = 9.9e-70
Identity = 117/135 (86.67%), Postives = 126/135 (93.33%), Query Frame = 1

Query: 5   LQISISKAQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEMEHSNGP 64
           L  +IS AQNSPQDFVDTHN +RAAVGVGPVSWDDTLA YAQ+YADSK+DTCEMEHSNGP
Sbjct: 18  LATTISNAQNSPQDFVDTHNDIRAAVGVGPVSWDDTLAAYAQSYADSKMDTCEMEHSNGP 77

Query: 65  YGENLAEGYDEMTAIEAVKFWATEKKFYNHHLNRCIGDECGHYTQIVWRDTKNIGCARMK 124
           YGENLAEGYDEMT +EAV+FWATEKKFYNHHLNRC+GDECGHYTQIVWR T NIGC R+K
Sbjct: 78  YGENLAEGYDEMTGVEAVRFWATEKKFYNHHLNRCVGDECGHYTQIVWRHTTNIGCGRVK 137

Query: 125 CENNWIFVICNYNPP 140
           CENNW+FVICNYNPP
Sbjct: 138 CENNWVFVICNYNPP 152

BLAST of Lsi03G006210 vs. TrEMBL
Match: A0A0A0K4C1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G070225 PE=3 SV=1)

HSP 1 Score: 230.7 bits (587), Expect = 2.5e-57
Identity = 99/132 (75.00%), Postives = 113/132 (85.61%), Query Frame = 1

Query: 8   SISKAQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEMEHSNGPYGE 67
           SI+ AQNS QDFV+ HN  RA VGVGPVSW+ TLA YAQ YA+ KI TCEM+HS GPYGE
Sbjct: 22  SITLAQNSHQDFVNAHNAARAKVGVGPVSWNYTLAAYAQTYANKKIGTCEMQHSYGPYGE 81

Query: 68  NLAEGYDEMTAIEAVKFWATEKKFYNHHLNRCIGDECGHYTQIVWRDTKNIGCARMKCEN 127
           NLAEGY EMTA+EAV FW +EKK+Y+HH NRCIGDEC HYTQ+VWR TK++GCAR+KC N
Sbjct: 82  NLAEGYGEMTAVEAVNFWVSEKKYYDHHSNRCIGDECRHYTQVVWRGTKHVGCARVKCHN 141

Query: 128 NWIFVICNYNPP 140
           NWIFVICNY+PP
Sbjct: 142 NWIFVICNYDPP 153

BLAST of Lsi03G006210 vs. TrEMBL
Match: A0A103XME5_CYNCS (Uncharacterized protein OS=Cynara cardunculus var. scolymus GN=Ccrd_004588 PE=3 SV=1)

HSP 1 Score: 224.2 bits (570), Expect = 2.4e-55
Identity = 128/321 (39.88%), Postives = 174/321 (54.21%), Query Frame = 1

Query: 7   ISISKAQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEMEHS-NGPY 66
           +  S  QN+PQD+V+ HN  R  VGVGPV W+  LA +A+N A+ +   C M+HS +  Y
Sbjct: 19  LHFSYVQNAPQDYVNAHNQARKEVGVGPVMWNAKLAKFAENQANQRKTDCAMQHSRSSQY 78

Query: 67  GENLAEGYDEMTAIEAVKFWATEKKFYNHHLNRCIG-DECGHYTQIVWRDTKNIGCARMK 126
           GENLA G  E + ++AVK W   K  Y++  N C+   +CG YTQ+VWR +  IGCAR+K
Sbjct: 79  GENLATGTGEFSGMDAVKLWIKGKANYDYKSNSCVQMRKCGGYTQVVWRKSTLIGCARVK 138

Query: 127 C-ENNWIFVICNYNPPAKPNPKPQISISNTINY--SSNLLINMAFTNIFFALCLLGLTLT 186
           C +NNW                       T NY  S N   N+A+ +  +A         
Sbjct: 139 CNKNNWFV---------------------TCNYDPSGN---NIAYLHFSYA--------- 198

Query: 187 LAPIAPIMAKSYPKNYIAAHNAIRLQVGVEPLHWNATLAAYAQNYANTKIATCQMEHS-G 246
                    ++ P++++ AHN  R +VGV P+ W+A LA +A+NYAN +   C  +HS  
Sbjct: 199 ---------QNAPQDFVNAHNQARKEVGVGPVTWDAKLAKFAENYANQRKTDCAPQHSHS 258

Query: 247 GPYGENLAEGYEVMTAETAVSLWTDEKKHYDYNSNTCVNDSSHCLHYTQLVWSNTKSIG- 306
             YGENLA G    +   AV LW  EK +YDY SN+C      C  YTQ+VW  +  IG 
Sbjct: 259 SQYGENLATGTGEFSGMDAVKLWITEKANYDYKSNSCA-QMRRCGSYTQVVWRKSTLIGC 296

Query: 307 AQVKCQNNWVFLICNYYPPGN 321
           A+VKC  N  F+ CNY P GN
Sbjct: 319 ARVKCNTNGWFVTCNYDPSGN 296

BLAST of Lsi03G006210 vs. TAIR10
Match: AT1G50060.1 (AT1G50060.1 CAP (Cysteine-rich secretory proteins, Antigen 5, and Pathogenesis-related 1 protein) superfamily protein)

HSP 1 Score: 162.9 bits (411), Expect = 3.3e-40
Identity = 74/140 (52.86%), Postives = 97/140 (69.29%), Query Frame = 1

Query: 2   LGFLQISISKAQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEMEHS 61
           + FL ++ + AQN+PQD++++HN  RA VGV  V WD TLA YA NY++ +   C + HS
Sbjct: 14  ISFLVVA-TNAQNTPQDYLNSHNTARAQVGVPNVVWDTTLAAYALNYSNFRKADCNLVHS 73

Query: 62  NGPYGENLAEG-YDEMTAIEAVKFWATEKKFYNHHLNRCI-GDECGHYTQIVWRDTKNIG 121
           NGPYGENLA+G     +AI AVK W  EK +Y++  N C  G +C HYTQ+VWRD+  IG
Sbjct: 74  NGPYGENLAKGSSSSFSAISAVKLWVDEKPYYSYAYNNCTGGKQCLHYTQVVWRDSVKIG 133

Query: 122 CARMKCENNWIFVICNYNPP 140
           CAR++C N W FV CNYN P
Sbjct: 134 CARVQCTNTWWFVSCNYNSP 152

BLAST of Lsi03G006210 vs. TAIR10
Match: AT4G33720.1 (AT4G33720.1 CAP (Cysteine-rich secretory proteins, Antigen 5, and Pathogenesis-related 1 protein) superfamily protein)

HSP 1 Score: 156.8 bits (395), Expect = 2.3e-38
Identity = 65/130 (50.00%), Postives = 91/130 (70.00%), Query Frame = 1

Query: 11  KAQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEMEHSNGPYGENLA 70
           KAQ+SPQDF+  HN  RA VGVGP+ WD+ +A YA+NYA+ +   C M+HS+G YGEN+A
Sbjct: 25  KAQDSPQDFLAVHNRARAEVGVGPLRWDEKVAAYARNYANQRKGDCAMKHSSGSYGENIA 84

Query: 71  EGYDEMTAIEAVKFWATEKKFYNHHLNRCIGD-ECGHYTQIVWRDTKNIGCARMKCENNW 130
                MT + AV  W  E+  Y++  N C  D +CGHYTQ+VWR+++ +GCA+++C N  
Sbjct: 85  WSSGSMTGVAAVDMWVDEQFDYDYDSNTCAWDKQCGHYTQVVWRNSERLGCAKVRCNNGQ 144

Query: 131 IFVICNYNPP 140
            F+ CNY+PP
Sbjct: 145 TFITCNYDPP 154

BLAST of Lsi03G006210 vs. TAIR10
Match: AT2G14580.1 (AT2G14580.1 basic pathogenesis-related protein 1)

HSP 1 Score: 156.0 bits (393), Expect = 4.0e-38
Identity = 68/147 (46.26%), Postives = 97/147 (65.99%), Query Frame = 1

Query: 1   MLGFLQISISKAQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEMEH 60
           ++G L + + KAQ+S QD+V+ HN  R+ +GVGP+ WD+ LA YA+NYA+     C + H
Sbjct: 16  LVGALVVPL-KAQDSQQDYVNAHNQARSQIGVGPMQWDEGLAAYARNYANQLKGDCRLVH 75

Query: 61  SNGPYGENLAEGYDEMTAIEAVKFWATEKKFYNHHLNRCIGDECGHYTQIVWRDTKNIGC 120
           S GPYGENLA+   +++ + AV  W  EK  YN+  N C G  CGHYTQ+VWR++  +GC
Sbjct: 76  SRGPYGENLAKSGGDLSGVAAVNLWVNEKANYNYDTNTCNG-VCGHYTQVVWRNSVRLGC 135

Query: 121 ARMKCENNWIFVICNYNPPAK-PNPKP 147
           A+++C N    + CNY+PP    N KP
Sbjct: 136 AKVRCNNGGTIISCNYDPPGNYANQKP 160

BLAST of Lsi03G006210 vs. TAIR10
Match: AT2G14610.1 (AT2G14610.1 pathogenesis-related gene 1)

HSP 1 Score: 154.5 bits (389), Expect = 1.2e-37
Identity = 67/138 (48.55%), Postives = 91/138 (65.94%), Query Frame = 1

Query: 10  SKAQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEMEHSNGPYGENL 69
           SKAQ+SPQD++  HN  R AVGVGP+ WD+ +A YA++YA+     C + HS GPYGENL
Sbjct: 24  SKAQDSPQDYLRVHNQARGAVGVGPMQWDERVAAYARSYAEQLRGNCRLIHSGGPYGENL 83

Query: 70  AEGYDEMTAIEAVKFWATEKKFYNHHLNRCIGDECGHYTQIVWRDTKNIGCARMKCENNW 129
           A G  +++ + AV  W +EK  YN+  N C G  CGHYTQ+VWR +  +GCA+++C N  
Sbjct: 84  AWGSGDLSGVSAVNMWVSEKANYNYAANTCNG-VCGHYTQVVWRKSVRLGCAKVRCNNGG 143

Query: 130 IFVICNYNPPAK-PNPKP 147
             + CNY+P     N KP
Sbjct: 144 TIISCNYDPRGNYVNEKP 160

BLAST of Lsi03G006210 vs. TAIR10
Match: AT2G19990.1 (AT2G19990.1 pathogenesis-related protein-1-like)

HSP 1 Score: 151.4 bits (381), Expect = 9.9e-37
Identity = 67/126 (53.17%), Postives = 93/126 (73.81%), Query Frame = 1

Query: 16  PQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEMEHSNGPYGENLAEGYDE 75
           PQ+ +  HN  RA VGVGP+ W++TLATYAQ+YA  +   C M+HS GP+GENLA G+  
Sbjct: 42  PQETLVVHNKARAMVGVGPMVWNETLATYAQSYAHERARDCAMKHSLGPFGENLAAGWGT 101

Query: 76  MTAIEAVKFWATEKKFYNHHLNRCIGD-ECGHYTQIVWRDTKNIGCARMKCENN-WIFVI 135
           M+   A ++W TEK+ Y++  N C GD  CGHYTQIVWRD+  +GCA ++C+N+ +I+VI
Sbjct: 102 MSGPVATEYWMTEKENYDYDSNTCGGDGVCGHYTQIVWRDSVRLGCASVRCKNDEYIWVI 161

Query: 136 CNYNPP 140
           C+Y+PP
Sbjct: 162 CSYDPP 167

BLAST of Lsi03G006210 vs. NCBI nr
Match: gi|659110139|ref|XP_008455069.1| (PREDICTED: basic form of pathogenesis-related protein 1-like [Cucumis melo])

HSP 1 Score: 299.3 bits (765), Expect = 8.3e-78
Identity = 138/165 (83.64%), Postives = 149/165 (90.30%), Query Frame = 1

Query: 164 MAFTNIFFALCLLGLTLTLAPIAPIMAKSYPKNYIAAHNAIRLQVGVEPLHWNATLAAYA 223
           M F NI    CLLGLTLTLA  API+AKSYPKNY+ AHNAIR +VGV+PLHWN+TLAAYA
Sbjct: 1   MGFANILSTFCLLGLTLTLASTAPILAKSYPKNYVDAHNAIRAEVGVDPLHWNSTLAAYA 60

Query: 224 QNYANTKIATCQMEHSGGPYGENLAEGYEVMTAETAVSLWTDEKKHYDYNSNTCVNDSSH 283
           QNYANTKIATCQMEHSGGPYGENLAEGYE MTAE AVSLW DEKKHYDYNSNTC ND+S+
Sbjct: 61  QNYANTKIATCQMEHSGGPYGENLAEGYEEMTAEMAVSLWADEKKHYDYNSNTCTNDASN 120

Query: 284 CLHYTQLVWSNTKSIG-AQVKCQNNWVFLICNYYPPGNYMGQRPY 328
           CLHYTQLVW NTKS+G A+VKCQNNWVFLIC+YYPPGNY+GQRPY
Sbjct: 121 CLHYTQLVWRNTKSVGCAEVKCQNNWVFLICSYYPPGNYIGQRPY 165

BLAST of Lsi03G006210 vs. NCBI nr
Match: gi|1009121091|ref|XP_015877272.1| (PREDICTED: STS14 protein-like [Ziziphus jujuba])

HSP 1 Score: 298.9 bits (764), Expect = 1.1e-77
Identity = 148/335 (44.18%), Postives = 202/335 (60.30%), Query Frame = 1

Query: 1   MLGFLQISISKAQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEMEH 60
           ++ F  +  S AQNSPQD+++ HN  R A GVGP+ WDD +A +AQ+YA+     C M H
Sbjct: 14  IISFSLLQTSHAQNSPQDYLNAHNSARTAFGVGPLVWDDRIAAFAQDYANKHQGDCNMVH 73

Query: 61  SNGPYGENLAEGYDEMTAIEAVKFWATEKKFYNHHLNRCIGDECGHYTQIVWRDTKNIGC 120
           S GPYGENLA+   +++  EAV  W  EK  YN++ N C+G +C HYTQ+VW D+  +GC
Sbjct: 74  SGGPYGENLAKSTGDLSGTEAVNLWVEEKPNYNYNSNSCVGGQCLHYTQVVWEDSARVGC 133

Query: 121 ARMKCENNWIFVICNYNPPAKPNPKPQISISNTI------NYSSNLLINMAFTNIFFALC 180
            +++C N    + CNY+P      +  ISI   I        +S    N  F   F    
Sbjct: 134 GKVRCNNGGTLIGCNYDPRGNIQLRNTISILLFITSIISRKETSTKNHNYKFPQYFHNTL 193

Query: 181 LLGLTLTLAPIAPIM-AKSYPKNYIAAHNAIRLQVGVEPLHWNATLAAYAQNYANTKIAT 240
              L +TL+P AP+  A+++ +N + AHN+ R  VGV PL WN  +AAYAQNY+N +   
Sbjct: 194 ---LPVTLSPAAPLSGAQNFTQNCLNAHNSARAVVGVGPLTWNDNVAAYAQNYSNQRRGD 253

Query: 241 CQMEHSGGPYGENLAEGYEVMTAETAVSLWTDEKKHYDYNSNTCVNDSSHCLHYTQLVWS 300
           C + HS G YGENLA     ++   AV LW  EK  YDYNSN+C+     C HYTQ+VW 
Sbjct: 254 CSLVHSQGQYGENLAWSSADLSCAEAVKLWVAEKADYDYNSNSCIG-GRQCGHYTQVVWR 313

Query: 301 NTKSIG-AQVKCQNNWVFLICNYYPPGNYMGQRPY 328
           N+  +G A+V+C +   F+ CNY PPGNY+GQRPY
Sbjct: 314 NSIHLGCAKVRCNDGATFITCNYDPPGNYVGQRPY 344

BLAST of Lsi03G006210 vs. NCBI nr
Match: gi|643728186|gb|KDP36354.1| (hypothetical protein JCGZ_09851 [Jatropha curcas])

HSP 1 Score: 295.0 bits (754), Expect = 1.6e-76
Identity = 147/313 (46.96%), Postives = 192/313 (61.34%), Query Frame = 1

Query: 12  AQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEMEHSNGPYGENLAE 71
           A++SPQD+V+ HN  RAAVGVGPV+WD T+A YA+NYA+  I  C + HS GPYGENLA 
Sbjct: 11  ARDSPQDYVNAHNTARAAVGVGPVTWDSTVAAYARNYANQHIGDCRLVHSGGPYGENLAG 70

Query: 72  GYDEMTAIEAVKFWATEKKFYNHHLNRCI-GDECGHYTQIVWRDTKNIGCARMKCENNWI 131
           G   ++   AVK W  +K FY+++ + C  G  CGHYTQ+VWR++ +IGCA++KC N   
Sbjct: 71  GSGGLSGTAAVKLWVDQKAFYDNNSSSCAAGKVCGHYTQVVWRNSVHIGCAKVKCNNGGT 130

Query: 132 FVICNYNPPAKPNPKPQISISNTINYSSNLLINMAFTNIFFALCLLGLTLTLAPIAPIMA 191
           F+ CNY+PP               NY+               LC    TLT+    P+ A
Sbjct: 131 FITCNYDPPG--------------NYN---------------LC----TLTI----PLHA 190

Query: 192 KSYPKNYIAAHNAIRLQVGVEPLHWNATLAAYAQNYANTKIATCQMEHSGGPYGENLAEG 251
              P++Y+ AHN  R  VGV P+ W+ T+AAYA+NYAN  I  C++ HS GPYGENLA  
Sbjct: 191 LDSPQDYVNAHNTARAAVGVGPVTWDNTVAAYARNYANQHIGDCRLVHSAGPYGENLAWS 250

Query: 252 YEVMTAETAVSLWTDEKKHYDYNSNTCVNDSSHCLHYTQLVWSNTKSIG-AQVKCQNNWV 311
              ++   AV LW DEK  YDYNS++C      C HYTQ+VW N+  IG A+ KC N   
Sbjct: 251 SGDLSGIDAVKLWVDEKAFYDYNSSSCAT-GKVCRHYTQVVWCNSVRIGCAKAKCNNGGT 285

Query: 312 FLICNYYPPGNYM 323
           F+ CNY PPGNY+
Sbjct: 311 FITCNYDPPGNYL 285

BLAST of Lsi03G006210 vs. NCBI nr
Match: gi|778724756|ref|XP_004137080.2| (PREDICTED: pathogenesis-related protein 1A-like [Cucumis sativus])

HSP 1 Score: 287.7 bits (735), Expect = 2.5e-74
Identity = 140/171 (81.87%), Postives = 146/171 (85.38%), Query Frame = 1

Query: 164 MAFTNIFFALCLLGLTLTL------APIAPIMAKSYPKNYIAAHNAIRLQVGVEPLHWNA 223
           M F NI   LCLLGLTLTL      A  AP +AKS PKNYI AHNA+R  VGVEPLHWN+
Sbjct: 1   MGFANILSTLCLLGLTLTLTLTLTLASTAPSLAKSSPKNYIDAHNAVRAAVGVEPLHWNS 60

Query: 224 TLAAYAQNYANTKIATCQMEHSGGPYGENLAEGYEVMTAETAVSLWTDEKKHYDYNSNTC 283
           TLA YAQNYANTKIATCQMEHSGGPYGENLAEG EVMTAETAVSLW DEKKHYDYNSNTC
Sbjct: 61  TLADYAQNYANTKIATCQMEHSGGPYGENLAEGNEVMTAETAVSLWADEKKHYDYNSNTC 120

Query: 284 VNDSSHCLHYTQLVWSNTKSIG-AQVKCQNNWVFLICNYYPPGNYMGQRPY 328
            ND S+CLHYTQLVWSNTKS+G AQVKCQNNWVFLIC+YYPPGNY GQRPY
Sbjct: 121 SNDPSNCLHYTQLVWSNTKSVGCAQVKCQNNWVFLICSYYPPGNYNGQRPY 171

BLAST of Lsi03G006210 vs. NCBI nr
Match: gi|659110137|ref|XP_008455068.1| (PREDICTED: basic form of pathogenesis-related protein 1-like [Cucumis melo])

HSP 1 Score: 274.2 bits (700), Expect = 2.9e-70
Identity = 119/135 (88.15%), Postives = 126/135 (93.33%), Query Frame = 1

Query: 5   LQISISKAQNSPQDFVDTHNVVRAAVGVGPVSWDDTLATYAQNYADSKIDTCEMEHSNGP 64
           L  +IS AQNSPQDFVDTHN +RAAVGVGPVSWDDTLA YAQ+YADSKIDTCEMEHSNGP
Sbjct: 19  LATTISNAQNSPQDFVDTHNDIRAAVGVGPVSWDDTLAAYAQSYADSKIDTCEMEHSNGP 78

Query: 65  YGENLAEGYDEMTAIEAVKFWATEKKFYNHHLNRCIGDECGHYTQIVWRDTKNIGCARMK 124
           YGENLAEGYDEMT +EAVKFWATEKKFYNHHLNRC+GDECGHYTQIVWR T NIGC R+K
Sbjct: 79  YGENLAEGYDEMTGVEAVKFWATEKKFYNHHLNRCVGDECGHYTQIVWRHTTNIGCGRVK 138

Query: 125 CENNWIFVICNYNPP 140
           CENNW+FVICNYNPP
Sbjct: 139 CENNWVFVICNYNPP 153

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PRB1_TOBAC3.2e-4555.64Basic form of pathogenesis-related protein 1 OS=Nicotiana tabacum PE=3 SV=1[more]
PR1A_SOLLC1.1e-4050.36Pathogenesis-related protein 1A1 OS=Solanum lycopersicum PE=2 SV=1[more]
PR1A_TOBAC5.3e-4054.93Pathogenesis-related protein 1A OS=Nicotiana tabacum PE=1 SV=1[more]
PR1B_TOBAC1.3e-3853.52Pathogenesis-related protein 1B OS=Nicotiana tabacum PE=2 SV=1[more]
PR1C_TOBAC3.8e-3853.52Pathogenesis-related protein 1C OS=Nicotiana tabacum PE=2 SV=3[more]
Match NameE-valueIdentityDescription
A0A067KWY2_JATCU1.1e-7646.96Uncharacterized protein OS=Jatropha curcas GN=JCGZ_09851 PE=3 SV=1[more]
A0A0A0K7N5_CUCSA1.7e-7481.87Uncharacterized protein OS=Cucumis sativus GN=Csa_7G070232 PE=3 SV=1[more]
A0A0A0K6A0_CUCSA9.9e-7086.67Uncharacterized protein OS=Cucumis sativus GN=Csa_7G070235 PE=3 SV=1[more]
A0A0A0K4C1_CUCSA2.5e-5775.00Uncharacterized protein OS=Cucumis sativus GN=Csa_7G070225 PE=3 SV=1[more]
A0A103XME5_CYNCS2.4e-5539.88Uncharacterized protein OS=Cynara cardunculus var. scolymus GN=Ccrd_004588 PE=3 ... [more]
Match NameE-valueIdentityDescription
AT1G50060.13.3e-4052.86 CAP (Cysteine-rich secretory proteins, Antigen 5, and Pathogenesis-r... [more]
AT4G33720.12.3e-3850.00 CAP (Cysteine-rich secretory proteins, Antigen 5, and Pathogenesis-r... [more]
AT2G14580.14.0e-3846.26 basic pathogenesis-related protein 1[more]
AT2G14610.11.2e-3748.55 pathogenesis-related gene 1[more]
AT2G19990.19.9e-3753.17 pathogenesis-related protein-1-like[more]
Match NameE-valueIdentityDescription
gi|659110139|ref|XP_008455069.1|8.3e-7883.64PREDICTED: basic form of pathogenesis-related protein 1-like [Cucumis melo][more]
gi|1009121091|ref|XP_015877272.1|1.1e-7744.18PREDICTED: STS14 protein-like [Ziziphus jujuba][more]
gi|643728186|gb|KDP36354.1|1.6e-7646.96hypothetical protein JCGZ_09851 [Jatropha curcas][more]
gi|778724756|ref|XP_004137080.2|2.5e-7481.87PREDICTED: pathogenesis-related protein 1A-like [Cucumis sativus][more]
gi|659110137|ref|XP_008455068.1|2.9e-7088.15PREDICTED: basic form of pathogenesis-related protein 1-like [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: Cellular Component
TermDefinition
GO:0005576extracellular region
Vocabulary: INTERPRO
TermDefinition
IPR018244Allrgn_V5/Tpx1_CS
IPR014044CAP_domain
IPR002413V5_allergen
IPR001283Allrgn_V5/Tpx1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0007010 cytoskeleton organization
cellular_component GO:0005576 extracellular region
cellular_component GO:0045298 tubulin complex
molecular_function GO:0003674 molecular_function
molecular_function GO:0008017 microtubule binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi03G006210.1Lsi03G006210.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001283Cysteine-rich secretory protein, allergen V5/Tpx-1-relatedPRINTSPR00837V5TPXLIKEcoord: 104..120
score: 2.9E-18coord: 35..53
score: 2.9E-18coord: 310..323
score: 2.9E-18coord: 81..94
score: 2.9
IPR001283Cysteine-rich secretory protein, allergen V5/Tpx-1-relatedPANTHERPTHR10334CYSTEINE-RICH SECRETORY PROTEIN-RELATEDcoord: 7..232
score: 5.6
IPR002413Ves allergenPRINTSPR00838V5ALLERGENcoord: 103..122
score: 3.1E-5coord: 35..53
score: 3.
IPR014044CAP domainGENE3DG3DSA:3.40.33.10coord: 189..327
score: 9.0E-41coord: 13..140
score: 9.5
IPR014044CAP domainPFAMPF00188CAPcoord: 21..136
score: 7.7E-23coord: 199..315
score: 4.0
IPR014044CAP domainSMARTSM00198SCP_3coord: 192..323
score: 1.6E-46coord: 14..144
score: 3.2
IPR014044CAP domainunknownSSF55797PR-1-likecoord: 14..139
score: 1.16E-43coord: 189..327
score: 3.4
IPR018244Allergen V5/Tpx-1-related, conserved sitePROSITEPS01009CRISP_1coord: 105..115
scor
IPR018244Allergen V5/Tpx-1-related, conserved sitePROSITEPS01010CRISP_2coord: 310..321
scor
NoneNo IPR availablePANTHERPTHR10334:SF223F2J10.6 PROTEIN-RELATEDcoord: 7..232
score: 5.6