Tan0012330 (gene) Snake gourd v1

Overview
NameTan0012330
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionvacuolar protein sorting-associated protein 20 homolog 2-like
LocationLG03: 32769111 .. 32790563 (+)
RNA-Seq ExpressionTan0012330
SyntenyTan0012330
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTTATCTCCAATCCACCGGACATTTCCATCAAATCGATTTCAATCTTCGTCTACAAAATTAAAATAGGGAGCCAAAATGGCATTTTAACCGAGAAAAGAGAAAAAAAGAAAGCTCACCCCTCCCTGACACGCCATCCGAGTCCGAGTTAGCCAGTTCTGGTTGGTTGTTCTTCCTAAAACCAAAATTCCCCTTTCTTCTCTCAATCTTCAGTTGGTCACTCCGGTGATTCTTCATCCGTCAAAGGAATCAAACACGAAGAATGAGGGATTGACGGATCTGTGCTGTTCAATCCCTCTGCTAAGACAATCGATCATATTCCCCCAAGTTGAACGAAAAAAGTCATCGGTGTTCCAATTGGGAGAAGTAGATGGGGAATTTGTTCGTCAAGAAGCCCAAAGTCACCGATGTCGATAAAGCGATTCTCTCTCTCAAGACGCAGAGACGCAAACTCGGTCAATATCAACAGCAGGTTTTCAATTTATCCCCTTTTTCATTTTTCTAAATTTCTTCTTCTTTTTGGTAATATGTCTTTGTTTTCTAATTAATTTACTCCCGTGGTTGGATTTTTGAAATTGGGTATCAAAGAAACTGGGTTGGATAGAAGTAAATATGTTTTCAGGATTTGCTTGCGGTGGTGAAATGGCTGTTTTATCTTGTGGCTGCTAGAGCCTAACTTTAGCTTTATTTGGTTAATAATTATATGATAAATAAGGCCCTTATCATCTAATTTGTTCTCCTTTCTTTCTCCGTCTGCTGTATATTATAGTTCTGCAATTTCAATACTTCTCTGTTGAAATGACTGTTGATTAAGTAGCTCAATGGTGACTTGAAAATTATTGAAATGCACTAACCTTTAAAATGATGTAATAGCTTATATTGTAAGGGTATTTTACTATCTAATTCTTGGGAATTTTTTATAGGTCTAAGATTTATATCTGATGATCTTTTTTTGTTAGTGAAGTTTTGAGCTCAAAGAACCGTTGAAATGTGATATAATTTCTGAGCGTGTCAATAAGTTTAGAATAATTCATTGTTGGTCACTGCATGTACTTGTTGGAGTTAGTGCAGATGTCAGAGCTATGTTCGGTCTCTGTTGCCAAGAATTCTTGTCATTATTAGTTTTCTCTCCTTTTGAACTAGAGCCCTTTTTTTGTGGTTGTGTAGTTTGTTTTGTTTTTGGCTCCATTCTTGGTCTTTCCCTTTTCTGTTATTGCTCTTGTGGATTCTTTTTTGTCATTTCATCCATGGAAGTTTGTTTCTTTATTAAAAAAAGAAAAAAAGAAAAAACTACGAAACATGAATTTGGATAACCTGTTTGACAGTGTGATTTGCGCTTGTAGGATACGTTGTATTTGATATTTTATTATAGTTATTTCGGGACATGGACATAAGAATATATACAAGAGAATCTCTGAACCAACATTCCCAAATTTTCATTTTCAATGATGATGAGGATGGATCTTTTTCCCACTCTCTTCTTTCGTTCGCAACTTTCAAGATTGTTTGATAGGAGTCTTTTCAAGTGAAAGGTCGGGCATCAAAAATGTGTCGTTCAGAAGTCAAGTAATGAATGATTTAGAAAAGAGGCAACTTTCAAAATAAAAAAGAACCCGATGACGTGGGTTTCAGCCTAGGTATATTATTCTCAATGAAGAAAGAACTGTTCTCATCACCCTTTAAAGTCCATCATGTTCGACATTTTTGCCGCCAAGCGATTTCATCTTTGAAGGAGAGCTCCATAAGCTCGGCTTTAATATCCACACGTCTCTATGTATGGATGGAATCATGAGAGTTAGAGCCTTCCAAATCATCAATTTGGCTTAGTTCATTTATTAGATCTGCCCTTCTAGTTTCGGTACAGCCAAATGTGGACACATTCCATGATTTCACAACTCCTTTTAATCCTTTGAGTTTTTGGATAAACCCATGTCCAGGCCAACCCTGTAAAATGTTGTTGTTCCACCATTACCTGAGGAGAGGTTCAAAAGAGTGATGACTCATCCACATGTTTTCGAACTTGAAGTAGCCTGGTCCCCATTTCTGCCTCTGAGTCAAAAGGATGGGATAGTGATCTGATGTGATTCGAGGAAGCCTTTTAACCCAAGCACTATTGAATTTGTCTATACAAAGACCTGTTAAAAGAAAATGATCTATGAGGGACATAACCGGTTGTTCTCGTAAATTCGACCAAGTGTATTGCCCATTTGAGAGGGGAGGGCCGATGAGGGCAGCTTCAGAGATAAACTTGTTGAAGAGATTCATGCTTTTGGTGTTGGCTTTACCAGAGGATTTTTCAAAGGACCATCTTACCATATTAAAGTCTCCCCCAAGGATCCAATGGTACACAGGTACGATAGGTCATATAACTCCTCCCAAAGGGGCTGTCGGAGAGATGCTTTCGAAGGACCATAAATACCTGTTATCCAACAGTTGTAACCGTCCGTAAGAGATAAGTGTAGTGTAAGAGAATGTGTACCTTCGATGACTTCTAACACGTTTATGTTTGGGTTGTTCCACATTATCAAAATACCTCCCGAACTGTCAAAAGCATCTAAAGTTGCCCAAACAATATTACGAGAGCTCTGGAGGGTTTTGATAAATCTACTATCGATGGAATTTCGCTTTGTCTCCTGTAAGATGACAATGGTGGCATTTAATTGTTGGAGGAAATCTTTTATCAAAGCTCTTTTCTTTCTAGAGCCCAAACCTCTCACGTTCCATGACAGAATAATCATTTAGAGGGAGGTGACCCGTGGGTGACAAGTTGGCTGGAACTTTTGTCATAGTTAACCGAGGAGGTGAGATTTCTAACTTCTCTGATCAATTTGGTGGTCTTTGCTGGTGTTTTACCTTTCTTCGATCTAGAGGGGATGGCCATGAGGCAAAGGCCATTTGCTATCAACCAAGGAGCTAGCTTGAGCATAAAGGTGTCATTCATCAGACTTCTATTGGTGCCACTTGATTGATAGAATCTGCTCTGCTTTTGGGGGGAACATCATGTTGTTCGGAGGCAAGTTGGTGAAACAAATATGTAAAGGTGGGGGAAATGCAAAGTTCTTCGGTAAAAAGGTTTTTCGGGATGATGGAGTGGGAAGGGGATAAGGGGTGGGAAAGAAAGGTGGAATTTGTGTCTGGGCAGGGGCTAGAGAGCAAATCTTCGGAGTCAGAAAGTTGAATGGGAGCAGCAGGGGAGTGCTTTTTACCGTGAAGAAGAAAGGTTTGCTTGCCTTCGATAATGACGACTTTTTTAGCTCTGGACGAAGGCTGATTAAAAGGGGTTACTGGGCTTTTTTGTGGGGCCAAGGTTATTGAAGAAGATGGGTGTGGGGCAAAGGTTATTGGCCATTTTTGGGTCGGGGTAGGGTCGCAAAGAGATGGGGGTGAGAGGGGAGTGGGTTGGACGACCTTTTTGGTAGGTAAAAACATGGGCTTGGGGCCCAACGAAGAAATATCATTTAGAGAATCGAAATCGGGATTAAATGAAGATGAGGTTGTCAAATTAGAGTGCCCTGGGCGAAAGGCAGTCGGTGGGTCCGGTTTGTCAAAAGTCAAATCGGCGGTTGGTAAGCGCGTGGCATTGGGATAGGTGTACAACAGGGGTTTGTGTGAGAGGCGCAGGTAATTCGAGCCGACTGGACCCTCATTCGATCTCACAGGTTTCTTTGCTCCATGAACTTCGACAATGAATCCAATGTGGTAATCAGAATTAAAAAAAGGTTCAATGGCAACCTTGTTGCATCCTGTCGACGAGGATGGCATTTGGACAAAAGGTGGAAGAAAACCAGTTGTGTTACTGTGTACTTTGAGGCATACCTCCATCATATCCAACCTAGAGAGGGTTTTCTTTGAGACTTCACAGAGTCCTCCGCAGGCATCGCCTATGAACTTAAACGATTCCCAACTCCATTTGTCAATCGGTAGGTATCGCACCTTTACCCATCCCCCATAAGATGGGACCTTTACATCGTTAAGAAAGACTTCTTGGCTCCACTGTTCGAACTTCACCAAAAAGGAGCCAACCTTCTGCCAATGTTCTTTTTGTGCGAGATTATAAGCCTGTGTATGGTTTTCGCACATGATCAGTGCCTTGTTTGGGTGGATGGGGCTTATCGCACTTAACTCGCTGATTTCCTTTCTTAATGCCTCTGAAATTTGGAACCAGTCGTTTTGGAAGTTGTGGCGGTGAAGGACAATCGTAGAAGATAGGTTTATGGTGGCTAGGTCATATGGTTGAGTGTCGGAGGATGGGTGGGCAGTGGCATCCTTAGATGGGGGAAGAGGGGAGGGTGAGGAGTTGAGCAGGGCGTCTTTGAAGGAAACAGATTTGGGGGCCGGTAGGGAGGGTGGGTAGGTAAAAGAAGGGGTAAGTCGAGGGTGTGTGGGGTGAAACCTGACTAACATTGAAGACAAAAGATGCCAACCTTTCTTGTTTTCACCTGGAGGGATAATGATTTTGGAACATCCTCCATTGGTCTCAAGTTTAGCGATTTCGACACAATGTCCTCGCCTATTCGAGTGTTTTTCGAGCCAGAAGACGTCACCATCAAGCCTAGTTTCTTTGAAGAACTTTTGTGTGAGGGGGGATGAGAGAAGAAGACTAAGCGATGACCGGAACCAGTGCAACGAAGACAAGGGAGCCGTAAAGGAGATGGTGCGATCTCTCTGCAGTTTGGTGAGTCAAAAACGGTTCCCATTGGATCGGTTATCAAGGTCAATGACAAAGGACTTTCGTTCGTGGTTGAAGCTATGGCGGTCAAAAATTCTGGCCATGGGTAAGATGAGGGACGAGGAAGGGGGATGTGATTGCGGTGGGGGAAGAAAATCATGGTTGGAGGTCGGGGCGAAGGGGTGAGGGAGCGAGAGGAGGGAGAGAGAGAGAGCGTCATTAACAACTCCTTTTACCTTTGGAAAAGCTATAGGATAGGAGGAAAAAAGAGAACCGGTTTTACACCAAGATATAATATGGAAAAAAAACACTATAAAAAAACGATTGAATAGGTTTTTTTCTTTGGAATCAAAGATCTTTCCAACCAAATCGACAACAAAAAAGCTCGAATCTAATTCTTCTAAAGGACCTCTTTCCTTTCTCTAGATGGATGACCTAAAAAGACAATGCTTAGGAGAATGTGTGAATCCTTTGGGAATACTATCTTCCATCTGAGTGCAGTGCGAAATTGCTAATTGATTATTTAAATTTAAATAGAATGGAATGATTATTGTATATTTTTACTTCTTTTCATGTATAAAATATTCATGTTTCATGTTTAGGCATTGTATAACTATTTTCCTTTACAAATTATTGCTATATATAATTTTATTTGTTTATTTATTCCTGGGCACCTCGCTCCACTTAGATGATGCATTTTTTACGCCTCTTGTCTTGAGGTGGGGAGGCTTGTCACTTTGGTATGTGCTTTGGTATGCATCTTGTGCTTTGAAAACATGATGGGGATTAAACAAATATAAGTTTCATTAATGCATTTGTTATTATTTTTTTTGGGACACGAAACAAGGCTTTTCATTGAATTAATGAAAGAGTCTGGCTCCAAAATACAAAACACCACATAATGTGAAAAAGAAACAAACAAACTATAAACAAACTAACAAACCAAGAAAATAAAACAACAACATAACTAATGCAGATTAAGAGGGAGCAATAAAAGCATTCCAATTAGTCCTTCGCCGTGCATATGGCTGGAAAAAAGACAAAATTTAGCTTCTCCTTAGACCGGAAAACCTTTGTCATTGACCTGGACAACCTCTCCCATGGTCGGAACTTTCGACTCACTGAATTCCAACGTGATCGCTCGTTCTCTCTGGTAGCCCCTCTCCCTTTGCTTCATTGGCTTTACTCCACACTCAGAACCCTGTTCGAGCACCCCCTCAACTAGAAATTCTTCAAAGAAAACAGAGTAGATGGTCATGTTTTCTGGGTTGAAAAGCACACAAACTGTCGAGGCCACTGTGTGGAGCTAGCAAAACTTGAAACAAATGGAGGTTGTTCCAAACTAATCATTCCTGCTGGAGAAAACTGAAAAGGCTGGCACCTTTTATTGACTTTACTCTCCAACCACCTACCAACCCCATTCCAAAACACCCGCCCCATTTAATTCAAAGGCCAACCAACCTCCCAAAACTTCACAAACTTATAAATCCGTCCTACAGGGCCCCAATTCTTCCCAACCCAGGAATCCCCCAAAGCCATATGATCTTGTCGGAATTGATTTGTCTGCTGTAGTCATTCTACCACGCTCAAATTTCCACGACGATTGGTTCTCGCTCCAAGAGGTCATAAAAGACTCCATCAGCGATCTTTGTGCCATCAACCCTATCCATCCCATCACTGTGACAGTGTTGAACGTGCTCATCTTCTCACTACCCAAACCGAATGGCAGCAGATAGGTCCCTTTTTGGTGAAATTCGAAAAATGGAGCCATGAAGCTTTCTATAGCGATCTCAAGGTCCCCTCTCATGGCGGTTGGATTAAAATAAGAAACCTTTCCATTGACAAATGGAGCTGGGTGACACGGTCGCACCTCTTAGAGGGTCTCGACACGACCGTGCGACACTTGCTCTCGCCCACGAGCAAGTCAGCCAACTCGAATTCGGATCGCCGGTGGCCTACAAGGCCTATCTCGCAACCTCCACCCCCATTCTTGGGAAAACAGTTTTTAGAAAACGGGGTCAGAGAGACTTTGTGAAAACGTTTTGGAAATCATTTGGAAGCATTTTAGAAAGTTAGTTATACAGCTGATAAGCAATTTCAATGAAAGCCACGCTTAACTAGCATATAACTCTGGGTAGGGAGAATCGATGAATAGTCGTATCCCCCTATAGTACAAACTAAGGATTCTAGCCATGACCCCCTTGCACGTCTTGGCGAGGAGTCATACGGAAGCAAAACTGCAAGGAAACTATCCTAACATGCGACAACGATAAAAGTAAAGTGAAAGCAAGATAAAAGAGTGGGAACGGGCATGCGGCCCACAACAGCACTCCACTGGACAAGCATGACCGTGACATTCTCCCCACTTAAACGGTTGACGTCCCTGTCAACCAACTAGACTCAAACCCTCGGTTCTACAAGCAGCTCTCTTCAGCTTCTGAGGCGGTCCCAGCGGACCTCTACAACCTGACTTCTTTTCCATTCTTCAAGAAACTCTTTCACTTCCTTACTCTGTACACTCTTCTTCGGGGTGTCATGCCTCCGCTAACACCCCAAACGCATTCTTCACAACCACCGGTGGAGCCGACTCCACATTTCTTCTCGTCAAAAGGGGTCGTTTTCGCTTGACCCAATCCTCGCACAACTCGGTGTCTCCACAGGCCTTCGAGTACATTCGAGCTTCGACATGGTTATGGGGTTGACTTCCCCAACCTTCCTTGGGCTCTCCCTTAGTTGGAGGGCCGACAACCGCTTCACACCACTTGGTTGTTTGAGTCTAGTGTGCACAACAGTTGGATTAGTCCCTGTGACCACTAGACACTCCGCCAATGGTATCGGTATGACCTTGTGTTCAAGGAGGAACTCCATCCTTAACACAACGTCGAAGTCATCCATGTTGACAACCACAAAATCTGTTTGTCCTGGCCCAAGCCCCAATCGCATCGTCACTCTTTGCGCCACTCCTAATACTGGCAGGGCCGCAGAGACTGACGCGACCTTCACCTTTTCAACATCTCTAGTCCAACTTAAATTCAATCTACGGGCTTCAATTTCAGTCATAAAGTTATGGGTTGCTCCTGAGTCAATCATAGTGCTCTTTGTTGGCTTCTGGTTAATAACCGTGTCCACATACATAAGGCCATGTGTGGACTCTCTAGTCCCAACTTCTTCTTTCTGCAACGCTGACAAAAATCTCAAGGCCCCGATGCGGGGTTTGTCCCCCTCTCCTTCAATTGGGGTATCTTCCTCAGAAGTTCCAACCTTGGTACCCGAAGTACTTGTCAGTGAGGCTTGAAAATTTTGTAAGGCAATCTTATCTTGGCACTCTACCCACCGTGAGGCCAAGTTGCAAAGGTAACAAGAAGGGGGGCCTCCGACTCACCCCATATTGTTGGTTCGACGTTCGCCATGTACTTCCTTCCCACTTCTTGTCTTATCCTCCTCCACTCTTGGGGAGTTAGGTGACGCCCCATTGTCTCTACTTGGGAGGTCCGACTCGCCTCACATCCTGGGGCTCTTCTTCGCTCCCATCACTTACCAGATCAAACACCCGTTCTGCCACAGCATACGCTTCTGACAAGGTCTGCACCCTTTGTTCATAGAGTTTGGACCTTGCCCATGGTTGTAATCCTTCAACAAACGAAAATACTTTGTCCTTCTCCGTCATGTCTCGAATATCTAACATTACCATGAAGAAATTCCTTAACATACTCCCGAATAGTCTTATCATGTTTTAGCTCGCGCAATCTGCGTCTTGCCAAAATCTCCACATTCTCAGGGAAGAATTGCGAGCGCAATTCTTGTTTCAATGTCTCCCACGTATCAATGGTACAACGCCCTTCTTGGACGTCCATATAACGTGTTCTCCACCACAACTTGGCGTCGTCTGTCAGGTGCATTGTCGCCAAGGTAACTCTCTTCTCTTCCATCGTTGTGTTTGTAGCCCTGAAATATTGCTCCATATCAAATATGAAGTTCTCAAGGGCTTTTGCATCACGAACCCCACAGAAGGGTTTCGGTCTAGGTACTTTGAGCTTGTCTGTTTGAGCTGCACCCCCAGCTGGGGCTTGATTTTCCACCGCTCTCATGGTGACATTAACTCGGGTGTTCAAATCAACCACCCCTTCCTTCAACTCTGCCACTTCTGCTCTCACAACTTCTATGGCAGTTCTAACTTCATTAGTCAAGTCCGTCATCATCTGAGCCATCAATTGTTGGGAGTGGTCTAAACCTTCGACACGCCCCTCCATGCGGGCAACAGGGCCCAACGAGCTTTCCCCACTCCCGAGATCGCAATCTCGTATAGCTTTATCCTCGAGGATCTCAACCCTCGTCATCAACTCTCGGACGGGCATTCCATCCAGGCGGCCAGCTACCGCATCGATCTTATTTGCTTTGGTTGCAATCTCATCCAGACGAGATTCCAAAACCAACATCCTACCAGGGACTTCCATCAGGAAGGGCATTTGCCCCTCAATCTCGCTCGCCCAGTCAATACGGGGCTCGCCAGCTTTCTCTCTCAAAGACATGATTATCGCTCTTGATTACGTAAGCCAAAATGGCTCGATACCAACCGACACTGTCGCACCTCTTAGAGGGTCTCGACACGACGAGGCGGCACTTGCTCGCCCACGAGCAAGTCACCAACTCGAATTGGATCGCGGTGGCCTACAAGGCCTATCTCGCAACCTCCACCCCGTTCTTGGGAAAACAGCTTTTAGAAAACGGGTCGAGAGACTTTGTGAAAACGCTTTGAAATCATTTGGAAGCATTTTAGAAAGTTAGTTATACGGTGATAAGCAATTTCAATGAAAGCCACGCTTAACTAGCATATAACTCGGGTAGGAGAATCGATGAATAGTCGTATCCCCATAGTACAAACTAAGGATTCTAGCCATGACCCCTTGCACGCCATTTTGGCGAGGAGTCATACTGGAAGCAAAACCGCAAGAAAACTATCCTAACATGCGACAACGATAAAAGTAAAGTGAAAGCAAGATAAAAGAGTGGGAACGGGCATGCGCCACGAACGCAATGCACTGACAAGCATGACCGTGGCACTGGGAATCGTTTAAATATATTGGTGAGGCCTGCGGTGGTTTATGTGAAGTTTCCAGGAAAACCTTATCCAGATTGGATATGATGGAGATTTCCCTCAAGGTTGAAGCAAACAAAACCGGATTCCTACCAACAACCATTCACCTGCCCTCTTCGTCTTCTTCGATTCTCTCTGTAAGTGTTGACCCCCTTTTTAATGCCAATTACTACACAGGCTTCATCGTCGGTGTTCATGGAAACAAACCGTCGTCAACGGTCGAAGAGGACTTTCCTAAAAACGTGGGTTCGACACGCTTCCAACATGAGCCACAGCCGACCTCCAAACCCAAGGTCACGCGCCCTCTGACTTCGAAAGTGACCACAGAAAAGTCAGTTGATAAGAGCCCCTGTACTCTCCCTTTTTGTCCTTTACCGATAGACCCACGTGATAACCCTTTCCATTCTGGTCCTAATCCGGTCAAACCCTTTACAAACCCCAAAAAAGTCTGCCCCACACAAGCCCCCTAACCTTTCGTCCAAATCCTTAAAGCTTTCTTCCCGTCCTGGCCCATTAATAAAACCCAATCCAACCCATCACTGCAGCCATCTGTCGCCTCCCCCTCCCCCCGGGCCCACCCTCTGAGATTCCTTCCTCTCAGGCGAAAAAAGTGGTTATTATCGATGGTAAACAAACGAGACTCCTTCAAGGTAATAAACACGTCACATCTGAAGTCCCCTCCCCTATGATTTCGATAACCTTCTTTCAAGTTCTTGTTCGAATTCAGTTTCTCCCTCTATCCCCACATACCTGTCACCCTCTCACTCCATCACCCGACACCGTCTTTTTGAAGAGGAACCTACCAACCTCGAACAATCTTTCAGTTGAGCATTGTTCCGGAAAGCACCGCAATCCCAACCCAGGGATATTCGGTTGATTATGAGGTTTTACTAGATACAGAGGGCATATCCTCCCACGGAGATTGGTCGGATGAAGACTTCCTAAAACAAATTGCACCTTGGTTGACTGAAAATGGCCTTTGCATTATGGCCATCCCCTCTCGTGGAAAGAAAACCAAAACTCCAGCAAGGACGAGCCGTCTGATCAGGGAAGTCAAAAACCTAACCTCTACCATTAATTATGACAAACAACCCAATGCTCCTGCCATCAATGGGCCGCCGTCGGTTACATGATTGTTCTCTCTTGGAATGTTCAAGGATTGGGCTTGAGGAAAAAAAGAGCCCTTATCAAAGATTTCCTTCAAAACACAAATGTCACCATTGTGATCTTGCAGGAAACAAAGCGATCATATATAGACAGGAGATTTATAAAATCCATTTGGAGTTCACGTAACATCGCATGGGCGGTCATTGACGCTATTGACAACTCCGACGGTATTCTAATCATGTGGAACGACCCCAACATCATTGTCTTAGAAACTATTGAAGGTATCCACTCCCTCACCCTTCATTTATCTTTGACAGATGGTTATAGGTTTTGGCTTACAGGTATATATGGGCCCTCAAAAGCCTCATCTCATGGAAAGAGCTTTATGACCTCTCTCACCTTTGCACTGAACATTGGATCGTGGGGGGAGATTTCAATCTAACTAGATGGTCTCTTGAAAAATCTTCCGGTAAAGCTAGCACTAAGAGCATGAAGCTTTTCAACAAATTTATCTCCGATTCGGTCCTCATTGGCCCCCCTTTATCCAGCGGTCAATTCACATGGTCTAGCCTTCGAGAACAACCTATGACGACTCTTATAGACTGTTTTCTTCTCACTGAACCCTGTATTGGGAAATTTAGTAGTGCCATGGTGAAAAGACATCCCTGGATTACCTCGGATCACTACCCGATACAGGGTTCACTGAGAAGAAAACAGTCTATAAGAGTCGTCCAGGTTACTTTAAATTTGAAAACATGTGGTTGGAGCATCGAACCTTTGAGCCTTCACTCAATTCTTGGTGGACAAGCAACCACATTCAAGGTTGGCCAGGTCATGGGATTATCCAAAAACTAAAATCCTTCAAAGCTACATTAAAAGATTGGAATTGGAAGACCTTTGGATGCACCAAAACCATAAAGGCGGATCTGACTAGAGAACTTACTCGCATTGACCGTTTGGAGGAATCTAACAATTGGGATCCCGCACATGTTCAACGAAGAATAGCCATCAAAGCACAAGATATCTCCTTCATGGATGAAATGGCATGGAGACAAAAATGCAAAGCCAAATGGGCTATCGATGGAGATGAAAATAACTCATTTTTCCACCGTATTTGTTCTGCCAGGAAGCGAAAAAAACACCATTATGGAGTTATTACCCAGAAAAGGTGAAAGTCTTGTTGTTGACAAACATATTGAGGAAGAATTCCTTGATTTTTTCTCCAACCTTTACACAAAGGCAGGTGGCACTAGAACTCTCCCAATGCTGGACTCTTGGAATCCTATTACTTCGACCCAAGCCTTGTCACTAGAGACCCCCTTCACCGAAGAAGAAATTAGAATAGCAGTTATGGACCTTGGCACCAATAAATCCCCGGGCCCCAATGGCTTCACTCGAGAATTCTTTAAAAAACATTGGAACATTCTCAAAGAAGACATAAAGGGGGTGTTCCAAGATTTTTTTTGAGAAGGGCATCATCAATACTAGCCTAAACGAATCATATATCTGCCTCATTCCAAAGAAGACTGGCGCTAAGATGGTGGGTGATTACTGGCCTATTAGCCTCACCACTTGTGTTTACAAAATGGTTGCTAGAGTTTTATCCGAACGCCTCAAGAGAGTTCTCCCTGATGTAATTACAGAGTACCAGTTTGCCTTTGTTGAGGGTCGTCAGATTATTGATGCCTCCCTGATTGCCAATGAACTCATAGGGGAATACGCTAAAACGGGTAAGCAGGGGGTTGTGATCAAACTTGACATTGAAAGGCGTTTGACATGGTTGATTGGGATTTCCTCGATATTGTTCCTGAAGCTAGAAGATTTGGCTCAAGATGGAAGAAGTGGATTCGTGGTTGTCTCTTCGGCCAATTATTCAATTATCATTAACGGAAGACCCCGAGGAAAGATTCGTGCTTCTAGGGGGCTTAGACAAGGGGATCGCCTTTCTCCCTTCCGTTTTATCCTCATTATTGACTGTCTTAGCAGTATGCTGAATCATGAAGCTAAACAGGGCAACATCAACGGTTTGGTTATTGGTGGTGGACAGTTAGCCATAAATCACTTGCTTTTTGCAGATGACACCATTCTTTTCTCTACTCTTGACAAAGAAAAGATTGCATCCCTCTTTAAAATTGTGAAAAAATTTGAAGAGGCAACTGACCTTCGTATAAATCCTCAGAAATCAGAATTCCTTGGCAGTAATCTTCAGGCGAACGAGGTTACCTCCCTAGCTAAGGATTGTGGTTGCAAAACCATGTGGCCATCAGTATATCTCAGATTGCCTCTAGATGGTAAGCCTCACACTAGCTCTTTTTGGAACCCAATCATAGAAAAGATTGAAAGAAGGCTCAGGGGTTGGAATAACTCATATATCTCAAAAGGAGGTCGCCTCACCCTTATTCAAGCCACACTGTCCACAGAAATCTCCTATGGAATGGTAGTAAGGAGACACAGGGATTTCATTCTATTAATTGGGCGACAATCCAACTTCCATTTGAGGAAGGTGGTCTTGGAATTTTTGATTTGAAAGATAAGAACTGCAACCTTTTAGCCAAGTGGATTTGGAGGTTTCATCTTGGAAAAAATGCTCTTTGGAGGAGAGTTGTGGCAACCAAATACGGTACCACCCACTTTGGCCTTCTACCAGATCCTAAGAAGCTCAAATCCCACCGTGGCCCTTGGGGGTAGGATTATGGAGCAACAAAGCTTGATCCTGCAAAACATCTCCAACATAGTGGGGGATGGCAAATCGACTTCTTTTTGGAATGATAAATGGGCTTCCAATATTCCATTCCGAGACACTTTCCCTCATCTCTATCAGTTGACTTGAAGGAAACAAGCTTCTATACATGACATGCTAAATGAAGAGGGTTCTTCATGGAATCTTGGCCTCCAGAGAAGCCTTAAGGAGAATGAATTTGAGGAATGGTCGTGTCTCTCTACCATTCTCTTTTTCCCCTCAAAGCCGAGGACCAAAGATTCGTGGAAGTGGAATTTAGATAAACATGGTTCCTTCACTGTTAGATCCCTCATCAATTCTTTGGTTTCTAATGCAAATGGCAACCACAAGATAATGTATAGAAATATTTGGAAGGGCCCAACCCCTAAAAACGTATGTTTTTTCCTTTGGGAAGTCAACCATAGAGGCATTAATACTGCAGACACTCTATTGAGGAAGGCTCCTTGGATTGTCTCCTCCCCAAATTGATGTATTCTTTGCAAACAGGATAGCGAATCCGCGAGACACATCTTTGTTGCTTGCCCATACACAACTAACATTTGGGGTCCAATCCTAAATGCTTCACATCTTCAGATTGTTCTCCCCGAATCATTGGTCGACATTTTGGAAGTCTTGCTAACGGGCCATCCATTCACAAACGGGAAAGGTTGCCTTTGAGGGTAAGTATTGTCAAGGCTCTTTTTTGGTCCATTTGGTTAGAAAGAAACAACAAACTCTTCAACGACAAGGCTCGCGATCCAAATGTGGTTTTAGAACATTTTCTTGGCTTTATCTTGGTGTAAATTTTCCCCCTAATTTAGCAACTACAGTCTTACTTCCCTCTTATGTTCTTGGGAAAGCCTTTTGTAACGGGTCTTTTGATCCTTTTTTGTACATTTCATATCATCAATGAAATGGTTCACTTGTTTCTCATCAAAAAAATAAAAAATAAAAAATAAAAGCATTCCAATTCAAACAAATTTCTTGGATAGAAAACCCGACAAGCAATTTAATGATATCTTTCTAAATAACCATGGAACTTTAATGATATCTCATTAAATTAATACATTCTTTTTAAAAAAACATGGAAACTTGAAGATATCTCCTTTTATGATGTTCCAATTTCTAAGTCTATTTAAAAAATTCTCTGAAAGTTGCGAGATGACAGGTTGGAGGAGTTCCTTTTGCACCCCCTGTTAGGGGGAAAGGTTCGTTTTTGTGGCAATCGGGGATTTGTGCTATCCTTTGGGGGCTTTGAGGGGAGAGAAACAATAGAATCTTTCTAGGGATTGAGTGATCCTCTCATGATATTTGGCTCCTTACTAGATTCTATGTTTCTCTCTTGGCCTCGGTGGCTAAGTCTTTTTGTAATTATCCTTTAGGGAGTGTTTGGTTTATGGTGTTGGGTTGGGTTGAGTTGGGTTAATAACCCAACTCCTTGTTTGGTTCACCAACAATTATAACCGGTGACTTAATAACCCAACCCCATGGAGTTGAGTTATTAACTCAACCGCATTTGCACTTTCTCCGGGCGTTAACGACTTTCCAAATGACGGTCCAGTGACTTTCCGACGACGGTCCGACGACTTTCTGATGACTTTTCCGGCGAGATGAAGAAAGTGGAGAAAAGAAGAAGATGAGGAGAGTTTTGAAGTGTGTCTTTAAGTTGCTAATACTCTAGTTTAAATAAGCACCAAAAAGTCCTAATCCTTTTAGAACAAGGAATTCTAATCATTTTAGAAAAAAGAAATCCTAAACATTTTTAAAAAAGAAATCATAATCGTTTTTTAAACTTTGAAACACCCTTTCCTAATCCTGATCTGATTATCCAAAAATTCAAATTTAAAACAAATACAATTAAAACACCTAATTTTACGGGGCGTCACAATATACCCTTCTAAGGAAAACTTTCGTCCTCGAAAATTGCTAGAACTTTACAACGGTCACGTCTTCGAGCCATAATATAAACTTAAACTCTAAACTACCTTAACACAAGTACAAAGAAGTCAAAAGCATAGCATAATAAGTGTCAACACACAAGAAAACTCAAATAACTTTAACTACTTTACAAAATAAAAGCAAACCCGAAGATAACGAAGTGATTTCCTGCAGGAGGCTATAAGGATCGTTGCCTACAATGTAGGCGAGTATGCAAAATCTAAAACCCATGTTTTGATACCAATTGTAACGACCATAATTTTTCAAACTCAAAATTTCCTAGGGACGTTACTAAGCAACACCGCCTGCACTGAAAATTTCGGTAGCATAACGGCGTTGTTAGGACCTTGATGACATATCTAATATTTTGTAGATATGCAACTTCAATATGACATAAAATATTTAAGTATGTTGGCATATTTTTAACATCTTCACAAGGTAAACTATTTATTCCATTCTAAAGATGTCCATAGACAATGCATATGTTTAGACACATTTAAATAAACACTCTTCATATGCAAAGTATTAATACTAAATTAATACCTACTTAAACTAAAATAAAAAAGTAGTGGTGAACATATGCATAAAAAAAGGAATAAAATTCATATTGAAGAGTAAAAGTTCTAAATAATGAATTTTATTGGAAAAGAAATATTGATATCAAACCCAATATCCATTACTTTTGCAAATATCCCTAACATACATTTGGCAAATGGTAATATATATTTTAATTTTATAATTGATCACATAATTGAGTTTTGAAATTGTAGACAAATGTGTTTGTATTTGATTTGTTGTGAAAACAAAGTAAGGAAAAAATGTTACAAAAAAAATCAGATCTTTCACTTTACATAAGATAAAAATGATTTTTAGGGGATAAGAAAAATTGATATCAATCAATCTCAATATCCAGTATTGTTCTAAAAGTCTCATACATCTTGCAAATGGTAATATATATTGTAAATTTGTAATTGATCTCATAATGGAGTTTTGAAATTGTAGACAAATATGTTTGCAATTGATTTGGTGGAAACAAAAAAAAGTAAAAAAAAAAAAGTTACAAAAAATCAGACTTATTCACTTTACAAAAAACTGCACCAGATAATCCTGACATAATAATTCAACTCTGCAAACCAAATACTGACTTCCTGACTCAACTCAACTCTACAAACCAAACACTGATTTCCAGAACTCACTGACAAAACAACTCTGCACGACTCAACCCAACTCAGCGATCCAAACGCTTAGATTTTATTTTGCTTGGGTAGAGCGCTTTGTTGTAAGCTCCCCTCGTTCTCTTTTGTGGGCTGGTTTTTTTATGCCCTTGTTTTCTTTCATTCTTTCTCAATGAAAGCTCGGTCTTTTAATTAAAAAAAAAATTCTCTGAAAGTTCATCTCTTTTAAATATTTATGATTTCCCATTCTTTTGAATTCTGTCATAAATTACTCTCTAATTATCTAACTTCAATTTATCCGTTATGAGTATTAATTAACTATATTTTAAATAATTTTTAAGTCATCTATAAATTATGAAATTGAATTATGCATGACCACAAATCATCATAAAGAAAAGTTGCATTTTACCCTGGACGGAAGACACAACTTATAAAGATCTCTAAAAGAGGTCAGATATTACACTAAATAACTTCTTTGTGGGTTTCTTTCATAGGATATTACACTAAATAACTTATTTTTGGGTTTCTTTCATAGGAATGCTGAATGCAAGTGCATTTTACTTTGAAATTTGAATAATTCTTTTTTGTCTTTCCGTGACTTTTTATCCCCTAGCTTGATGCTGTCATTGAAGCAGAAAAGAATGCTGCAAGAGAGTTAGTTCGTCAAAAGAAAAAGGATCGAGCCTTGTTAGCTTTGAAGAAGAAAAAAGCACAAGAAGAATTATTAAAGCAAGTTGATGCCTGGCTGATAAATGTTGAGCAGCAAGTGAGTTATGTTAATTTTTCCCTTGTTTATTCAATCTTATTTTTGCCTCCCTTTTAACTATACTTTTTTGTTTCTAATAAAATAATGATTTCCTTGGATCTTAAAATAAAAATTTTGATAGTTGCAATAGCTAGGTTTGGTTTTCTGGTTCGATATTAACAATTAAATTGTCTCCTGTGCTGTATTCACGTTTAAAACTTTTTGTACTTTAGAAATTTGTGTTCATGCTTTAATATAGTTGTTGGCTCAGTTGTCAGATATTGAGTTGGCCAGCAAACAGAAAGCTGTTTTTGAAAGTTTAAAGGCCGGTAATAATGCTATCAAAGCTATACAAAGTGAAATTAATATAGATGATGTCCAAAAGCTGATGGATGATACTGCTGAAGCCAAGGCATATCAAGATGTAAGTTCGTGAATTTTTCAATTCATCGTTTTATTAGGATTCATGTTTGAATATTTTATTTTCTTACTTGGTTTACATGTCAGACTATGGTTGAAATAAAATTATATGAGGTCTAGTACTCTCCCAAAACTATTTCCGAAGTGTCTAATTCCAGAATTTCCCAAGTTCCCAACTCTACTCTAATCTATTTATCAGGTAATTACAAATATGCCACTATTGATATTCCTTATGTGCCCCTACTAGTAGTCCTATCAATACTGTCTCTAAAACACACTTTGCCTGACAATCTGATCATAACTTAATTGGTAAAGACATATCCACGACCAAGAGGTTGGAGGTTCAAATCTCTTCACTTCAAATGTTGTGTTGAACTAAAAAAAAAAAAAACTACACCTTGGCTCTCAATTATTCCCAATATTGTTGTCAAACACGTTGATTAACACCCATGTTACCCCCACTCATGACCCTTAGTTACTCCTTTTGAGTAGGGTCTCAATGATTCTTTTATGTGATGCTCCACACAAGATTTTTTGTGCTTTGAAAATCCATGTGTCCTCGTTCTTTCTCAACCTGTTCTTTCCTTCTTTCTTAACTAGTATGAGTTTTTGTGGCTCCTTAGATGGTGGTTTCCAAATACGTTTGGGTCTCTTAAGTAGGAAAGACCCAATCTTTCTTCCTTTGCTCCATGTCGGTGCTTCTATCGTTGTGGTCAGATCAATGCTTATGGTTATTAGCCTATTTAGTACCCACAATCAATTTTTTTTAAGCTGTGCATGTTGCCATCCTTCTTCCACTACCTTTGGCTTCTTGTATGGGACATTTGGTCTCATTTGAATTTTTGTCTTGCAAATATTTCTCTTATTCTTGTTAAGAAATTGAGCTTTCAGAAATGAAAGAATATACAAAGGCATGCTAAAGAAAGTCCTACAAACTCTCATTTGGCCTTGAGTGTGGCAATTCCAATGTTCTCATGGTAAACCACCAATTAACCCAAAAGCTTAAGTCGATAGATTATGACAAATTTAATTATATCAACACCAACACTCCCCCTCACTTGTGGGCTTAGAAATTTGAACAAAGACCCAACAAGTGGAAATCAAACACTCTCCCTAACTTGTGGGCTTGAAAATTTGAACATTGGCCCAACAAATACAAATCAATATTGATTGGGAGAAAATGACTTCACCAGGGTTCGAACACAGGAACTCCTGCTCTGATACCATGTAAATCACCAATTAACCCAAAAGTGTAAGCTGATAGGTTATGACAAATTTAATTATATCAACACCAACACTCGATCACCTCAACCTCGATCATGATATGGGGATCTTCTTCTTAGATAACAATCTCATAATTCTGTGGATCTTTTTTATTTTATTAAGCATGTATTTTTTTAATTCTTAGATAAAGTCTTCCCCTGCCCCGAGGCTTAAACTTTGGAGACATCAAGAGTTTTCCTTATTAAGCTCACCCGAAGATTTGAACACGCGTTCTTCGAGTAGGTGTTACTGGCCTTCACCAGGGGTTTGCCCCTTGGGGGCTATTTTATTAAGCATATATTGAATCCTAGAGCCAGCGTGCTCCTTAGTTTGACCAATTTCTTGATGAGAACTCCTTTATAAACTTATTAACCTTGCCATTTCAGAAACCCAACACAGAAAATACTAATGTGCCCAGAAGATTATCAAACAAATACAAATTCATTCACTTTGAAAGCACTCCCTCCAAACTATGCTCCAAAATCTGCCTAATTCTCAACATTTCCAACTCACCCTATTTATGATCAATTAGCATGTCCTTGCTGCTATTCCTAATAGAAAGGTGTTTTCTCATTTTCTTAAAAAAAATTACTGCCGAGGTTCTCTAATACCTTCTCGGTAACTCTTCCAGCTTGGTGAGAGGAGATTATGGTTTTACTACAGACTGGTTATTGTACATCTTTTCTTCTTAGTCATGATAGGAAATGCTCCATTCTCTCCTCTTGCATCTCTACTTTGTTCCTTTTTTCTTTAATTTATTTTCATTTTCTTTCTTTCTTTCTTTCTTTCTTTATTATTTTTTTTTTTTTTGGGGGTGTAAAATCACTTTGAACTTCTAGTCATAATAGTGGTTCAGGGATGTTATAGAGTTGAAAGATTGATTTAGATCTTAGCTTTTCATCAACTCAAAACATATCATTAACACAGGAATTTGACTTTCTTAACATTGTATTGGTGAGGAAGAGATTTGATCTTGGTTTTGTTAATTCCTCCCATGCAGACGACTTTCTGCATTAGTTGACATGTGATGTTACCAGATTCTTGTTTTGTTGGGCTCTGTAATCTATCGTACAAAATTTGAAGTTGACTAGCATGATTGAGCTCAAATTTTATCTATATTTTTGAAGGAAATTAATGCCATCTTAGGAGAGAAATTGTCAGAAGAAGACGAAGAGCAAATCTTAGCAGAATTTGAGAGCTTGGAAGCTCAGGTATTGTGTGTTTTGTTGTCTTATTCATTATTACTTTTCATCCTGCATGATAGGGTCAGTCCGTTCAGTTGTTTAATTAGATTTCTACTTACAGAATGTCATTTACTCCCGTAGGAGGGAGTACTAAAGTGCCTTGACTGGTAAATTAGTTTAATTTTGAATTTAGTTAGGAATAAGGTTTTTTGAGAATGCACCTTCACTTGGCTGTTATTGCAAATTGAATTTGGTCAATCGCAGTTGTAGTGCAAATTGCAGTATAAGATTGGTTATTCTTTTTGTTCTATTCTGGTCTCTAAACTTTGGAACAATTCTATTTTAGTCTTTGCACTTGCACTTTACACAAAAGACAAATCCTTGTCAATTGTCACTAGTTTTTTGATTAATTATTTAATAGAAACCTAGATTCATCTTCTGTCTATTTTAATTTAACTAACTCTGTGATGTCTACGCATGTTTGAAGCCAATTGTTGGGTTTGCAAATGGGTTCTCGTTAAATGTTTGATGACAAAAATCAACTTTAAGGGCTAAAATAGTCTTTTTTTTAATACAAAATTTAGAGACTAAATTAAACCTTTGAAAGTCATAAAGACAGATAGTGATAAGTATATAGAGTGGCTACTGCTGATTGCCTTTTAGTATACTATCTCAATGCGATCCCCCCATCGACACTTTTTTCAGCTCACTGTTCAAGATCTGCCCGAGGTTCCAACGCCTAAAGAAAATGCTGAAAAGTTGGATCTTCCCGATGTACCAACCAAAAAACCTGTTGCAACTGATGCCATTGAAGAAGATGCTGAAATTGTTTCCTCTGATGCCCTCACAAAGAGAAGAGGTCTCTTTCTCTCTCGTTTTCTACTCTCTTTTACATCCACTGGTTTCATAATACATTAAATTTGTTGGAATGCAGTTATGGAAGAACCATTGGCAGCATAAGTGTCGGTACTTGATCGAGACTGCTCAAAATGTTCGAGTACTGGTGAAGTGGTTTAAGCTGAGAATCTGTTATTTGATTTGTGAATTCATACTAATGGTCATAATGGCTTCATTCACCAATATGATGTTGTACAGATACTTCCTCATGCACGTATATAAATTTATTTTCTTTCTGCTCA

mRNA sequence

ATTTATCTCCAATCCACCGGACATTTCCATCAAATCGATTTCAATCTTCGTCTACAAAATTAAAATAGGGAGCCAAAATGGCATTTTAACCGAGAAAAGAGAAAAAAAGAAAGCTCACCCCTCCCTGACACGCCATCCGAGTCCGAGTTAGCCAGTTCTGGTTGGTTGTTCTTCCTAAAACCAAAATTCCCCTTTCTTCTCTCAATCTTCAGTTGGTCACTCCGGTGATTCTTCATCCGTCAAAGGAATCAAACACGAAGAATGAGGGATTGACGGATCTGTGCTGTTCAATCCCTCTGCTAAGACAATCGATCATATTCCCCCAAGTTGAACGAAAAAAGTCATCGGTGTTCCAATTGGGAGAAGTAGATGGGGAATTTGTTCGTCAAGAAGCCCAAAGTCACCGATGTCGATAAAGCGATTCTCTCTCTCAAGACGCAGAGACGCAAACTCGGTCAATATCAACAGCAGCTTGATGCTGTCATTGAAGCAGAAAAGAATGCTGCAAGAGAGTTAGTTCGTCAAAAGAAAAAGGATCGAGCCTTGTTAGCTTTGAAGAAGAAAAAAGCACAAGAAGAATTATTAAAGCAAGTTGATGCCTGGCTGATAAATGTTGAGCAGCAATTGTCAGATATTGAGTTGGCCAGCAAACAGAAAGCTGTTTTTGAAAGTTTAAAGGCCGGTAATAATGCTATCAAAGCTATACAAAGTGAAATTAATATAGATGATGTCCAAAAGCTGATGGATGATACTGCTGAAGCCAAGGCATATCAAGATGAAATTAATGCCATCTTAGGAGAGAAATTGTCAGAAGAAGACGAAGAGCAAATCTTAGCAGAATTTGAGAGCTTGGAAGCTCAGCTCACTGTTCAAGATCTGCCCGAGGTTCCAACGCCTAAAGAAAATGCTGAAAAGTTGGATCTTCCCGATGTACCAACCAAAAAACCTGTTGCAACTGATGCCATTGAAGAAGATGCTGAAATTGTTTCCTCTGATGCCCTCACAAAGAGAAGAGTTATGGAAGAACCATTGGCAGCATAAGTGTCGGTACTTGATCGAGACTGCTCAAAATGTTCGAGTACTGGTGAAGTGGTTTAAGCTGAGAATCTGTTATTTGATTTGTGAATTCATACTAATGGTCATAATGGCTTCATTCACCAATATGATGTTGTACAGATACTTCCTCATGCACGTATATAAATTTATTTTCTTTCTGCTCA

Coding sequence (CDS)

ATGGGGAATTTGTTCGTCAAGAAGCCCAAAGTCACCGATGTCGATAAAGCGATTCTCTCTCTCAAGACGCAGAGACGCAAACTCGGTCAATATCAACAGCAGCTTGATGCTGTCATTGAAGCAGAAAAGAATGCTGCAAGAGAGTTAGTTCGTCAAAAGAAAAAGGATCGAGCCTTGTTAGCTTTGAAGAAGAAAAAAGCACAAGAAGAATTATTAAAGCAAGTTGATGCCTGGCTGATAAATGTTGAGCAGCAATTGTCAGATATTGAGTTGGCCAGCAAACAGAAAGCTGTTTTTGAAAGTTTAAAGGCCGGTAATAATGCTATCAAAGCTATACAAAGTGAAATTAATATAGATGATGTCCAAAAGCTGATGGATGATACTGCTGAAGCCAAGGCATATCAAGATGAAATTAATGCCATCTTAGGAGAGAAATTGTCAGAAGAAGACGAAGAGCAAATCTTAGCAGAATTTGAGAGCTTGGAAGCTCAGCTCACTGTTCAAGATCTGCCCGAGGTTCCAACGCCTAAAGAAAATGCTGAAAAGTTGGATCTTCCCGATGTACCAACCAAAAAACCTGTTGCAACTGATGCCATTGAAGAAGATGCTGAAATTGTTTCCTCTGATGCCCTCACAAAGAGAAGAGTTATGGAAGAACCATTGGCAGCATAA

Protein sequence

MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALLALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDDVQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPKENAEKLDLPDVPTKKPVATDAIEEDAEIVSSDALTKRRVMEEPLAA
Homology
BLAST of Tan0012330 vs. ExPASy Swiss-Prot
Match: Q9FY89 (Vacuolar protein sorting-associated protein 20 homolog 2 OS=Arabidopsis thaliana OX=3702 GN=VPS20.2 PE=1 SV=1)

HSP 1 Score: 303.5 bits (776), Expect = 1.9e-81
Identity = 169/226 (74.78%), Postives = 195/226 (86.28%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           MGNLFVKKPK+T+VD+AILSLKTQRRKLGQYQQQL+ VIEAEK AAR+L+R+K+KDRALL
Sbjct: 1   MGNLFVKKPKITEVDRAILSLKTQRRKLGQYQQQLEKVIEAEKQAARDLIREKRKDRALL 60

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
           ALKKK+ QEELLKQVD WLINVEQQL+DIEL SKQKAVFESLK GNNAIKAIQSE+N+DD
Sbjct: 61  ALKKKRTQEELLKQVDQWLINVEQQLADIELTSKQKAVFESLKQGNNAIKAIQSEVNLDD 120

Query: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPK--- 180
           VQKLMDDTAEAKAYQDE++AILGEKLS EDEE+ILAEF++LE+ L V+D+PEVPT +   
Sbjct: 121 VQKLMDDTAEAKAYQDELSAILGEKLSAEDEEEILAEFDNLESLLIVEDMPEVPTTELMP 180

Query: 181 ENAEKLDLPDVPTKKPVATDAIEEDAEIVSSDALTKRRVMEEPLAA 224
           E  EK+DLPDVPTK PVA+          +    TKR+V+EEPL A
Sbjct: 181 EEPEKMDLPDVPTKAPVAS----------NETTSTKRKVLEEPLEA 216

BLAST of Tan0012330 vs. ExPASy Swiss-Prot
Match: Q8GXN6 (Vacuolar protein sorting-associated protein 20 homolog 1 OS=Arabidopsis thaliana OX=3702 GN=VPS20.1 PE=1 SV=1)

HSP 1 Score: 302.4 bits (773), Expect = 4.3e-81
Identity = 166/224 (74.11%), Postives = 200/224 (89.29%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           MGNLFVKKP++T+VD+AILSLKTQRRKLGQYQQ+L+ VIEAEK AAR+L+R+K+KDRALL
Sbjct: 1   MGNLFVKKPQITEVDRAILSLKTQRRKLGQYQQKLEKVIEAEKQAARDLIREKRKDRALL 60

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
           AL+KK+ QEELLKQVD W+INVEQQL+DIEL SKQKAVFESLK GN+AIKAIQSE+++DD
Sbjct: 61  ALRKKRTQEELLKQVDQWVINVEQQLTDIELTSKQKAVFESLKQGNSAIKAIQSELDLDD 120

Query: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPK-EN 180
           VQKLMDDTA+AKAYQDE+NAILGEKLS EDEE ILAEF++LE+QL V ++PEVPT + E 
Sbjct: 121 VQKLMDDTADAKAYQDELNAILGEKLSAEDEEDILAEFDNLESQLIVDEMPEVPTKESEE 180

Query: 181 AEKLDLPDVPTKKPVATDAIEEDAEIVSSDALTKRRVMEEPLAA 224
           +EKLDLPDVPTK PVA+     +AEI  +++ TK +V+EEPL A
Sbjct: 181 SEKLDLPDVPTKTPVAS-----NAEITPAESATKTKVLEEPLPA 219

BLAST of Tan0012330 vs. ExPASy Swiss-Prot
Match: Q503V0 (Charged multivesicular body protein 6 OS=Danio rerio OX=7955 GN=chmp6 PE=2 SV=3)

HSP 1 Score: 135.2 bits (339), Expect = 9.1e-31
Identity = 81/196 (41.33%), Postives = 131/196 (66.84%), Query Frame = 0

Query: 1   MGNLFVKK--PKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRA 60
           MGNLF KK   +VT+ D+A+L LK QR KL QYQ+++   ++ E+  A++L++  KKD+A
Sbjct: 1   MGNLFGKKKATRVTEQDRAVLQLKQQRDKLKQYQKRITLQMDKERQLAKQLLKDGKKDKA 60

Query: 61  LLALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINI 120
           LL LKKK+ Q++LL++ +  + N+E+ + DIE A  +  V E LK GN+ +K +   ++I
Sbjct: 61  LLLLKKKRYQDQLLEKTENQISNLERMVQDIEFAQIEMKVIEGLKVGNDCLKKMHEVLSI 120

Query: 121 DDVQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPKE 180
           ++V+K+MD+T +A  YQ +I+ +L   L++EDEE +LAE E++       +LPEVP    
Sbjct: 121 EEVEKIMDETHDAIEYQKQIDEMLAGSLTQEDEEAVLAELEAITQGEADLELPEVPGE-- 180

Query: 181 NAEKLDLPDVPTKKPV 195
                +LP+VP ++PV
Sbjct: 181 -----ELPEVPEQEPV 189

BLAST of Tan0012330 vs. ExPASy Swiss-Prot
Match: Q5ZL55 (Charged multivesicular body protein 6 OS=Gallus gallus OX=9031 GN=CHMP6 PE=2 SV=3)

HSP 1 Score: 126.7 bits (317), Expect = 3.3e-28
Identity = 78/198 (39.39%), Postives = 127/198 (64.14%), Query Frame = 0

Query: 1   MGNLFVKK--PKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRA 60
           MGNLF +K   +VT+ DKA+L LK QR KL QYQ+++   +E E+  AR+L+++ KK++A
Sbjct: 1   MGNLFGRKRRSRVTEQDKAVLQLKQQRDKLRQYQKRISLGLERERELARQLLKEGKKEKA 60

Query: 61  LLALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINI 120
           +L LKKK+ QE+LL + D  + N+E+ + DIE    +  V E LK GN  +  +   ++I
Sbjct: 61  MLLLKKKRYQEQLLDKTDNQISNLERMVQDIEFTQIEMKVIEGLKIGNECLNKMHQVMSI 120

Query: 121 DDVQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESL-EAQLTVQDLPEVPTPK 180
           ++V++++ +T +A  YQ +I+ IL   L+EEDE+ IL E  ++ + QL + D+P  P P+
Sbjct: 121 EEVERIIGETQDAVEYQRQIDEILAGSLTEEDEDAILEELNAITQEQLELPDVPSEPLPE 180

Query: 181 ENAEKLDLPDVPTKKPVA 196
           E  E   + + P  + VA
Sbjct: 181 EPPEATPVKNRPKPELVA 198

BLAST of Tan0012330 vs. ExPASy Swiss-Prot
Match: Q6GMA4 (Charged multivesicular body protein 6-A OS=Xenopus laevis OX=8355 GN=chmp6-a PE=2 SV=3)

HSP 1 Score: 124.4 bits (311), Expect = 1.6e-27
Identity = 77/198 (38.89%), Postives = 124/198 (62.63%), Query Frame = 0

Query: 1   MGNLFVKK--PKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRA 60
           MGNLF +K   +VT+ DKA+L LK QR KL QYQ+++   ++ E+  A++L+   KK++A
Sbjct: 1   MGNLFARKRRSRVTEQDKAVLQLKQQRDKLKQYQKKITLQLQRERELAKQLLHDGKKEKA 60

Query: 61  LLALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINI 120
            L LKKK+ QE+LL++ D  + N+E+ + DIE A  +  V E LK GN  +K +   ++I
Sbjct: 61  KLLLKKKRYQEQLLEKTDNQISNLEKMVEDIEFAQIEMKVIEGLKVGNECLKKMHEVMSI 120

Query: 121 DDVQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESL-EAQLTVQDLPEVPTPK 180
           ++V+++MD+T E   YQ +I+ +L   L+ EDEE IL E E++ +  L + + P  P P 
Sbjct: 121 EEVERIMDETQEGIEYQRQIDEMLSGSLTAEDEEAILEELEAITQEDLELPEAPSEPLPD 180

Query: 181 ENAEKLDLPDVPTKKPVA 196
              EK  + + P  + +A
Sbjct: 181 TIPEKQAVKNKPKPQMIA 198

BLAST of Tan0012330 vs. NCBI nr
Match: XP_038883019.1 (vacuolar protein sorting-associated protein 20 homolog 2-like [Benincasa hispida])

HSP 1 Score: 383.6 bits (984), Expect = 1.1e-102
Identity = 216/223 (96.86%), Postives = 218/223 (97.76%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL
Sbjct: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
           ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD
Sbjct: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120

Query: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPKENA 180
           VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQL VQDLPEVPT +ENA
Sbjct: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLIVQDLPEVPTSEENA 180

Query: 181 EKLDLPDVPTKKPVATDAIEEDAEIVSSDALTKRRVMEEPLAA 224
           EKL LPDVP+KKPVA DAIEEDAEIVSSDA TKRRVMEEPLAA
Sbjct: 181 EKLVLPDVPSKKPVAADAIEEDAEIVSSDAPTKRRVMEEPLAA 223

BLAST of Tan0012330 vs. NCBI nr
Match: XP_022928588.1 (vacuolar protein sorting-associated protein 20 homolog 2-like [Cucurbita moschata])

HSP 1 Score: 380.9 bits (977), Expect = 7.3e-102
Identity = 213/223 (95.52%), Postives = 217/223 (97.31%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL
Sbjct: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
           ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD
Sbjct: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120

Query: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPKENA 180
           VQKLMDDTAEAKAYQDE+NAILGEKLSEEDEEQILAEFE LEAQLTV+DLPEVPT +ENA
Sbjct: 121 VQKLMDDTAEAKAYQDEMNAILGEKLSEEDEEQILAEFEGLEAQLTVEDLPEVPTSEENA 180

Query: 181 EKLDLPDVPTKKPVATDAIEEDAEIVSSDALTKRRVMEEPLAA 224
           EKLDLP+VPTKKPVA DAIEEDA IVSSDA TKRRVMEEPL A
Sbjct: 181 EKLDLPNVPTKKPVAADAIEEDAVIVSSDASTKRRVMEEPLPA 223

BLAST of Tan0012330 vs. NCBI nr
Match: XP_022149415.1 (vacuolar protein sorting-associated protein 20 homolog 2-like isoform X1 [Momordica charantia])

HSP 1 Score: 380.9 bits (977), Expect = 7.3e-102
Identity = 213/223 (95.52%), Postives = 216/223 (96.86%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAA+ELVRQKKKDRALL
Sbjct: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAAKELVRQKKKDRALL 60

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
           ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEIN+DD
Sbjct: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINLDD 120

Query: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPKENA 180
           VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQ+TVQDLPEVPT  EN 
Sbjct: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQITVQDLPEVPTSNENV 180

Query: 181 EKLDLPDVPTKKPVATDAIEEDAEIVSSDALTKRRVMEEPLAA 224
           EKLDLPDVPTKKPVA D IEEDAEIVSS A TKRRVMEEPLAA
Sbjct: 181 EKLDLPDVPTKKPVAADPIEEDAEIVSSVAPTKRRVMEEPLAA 223

BLAST of Tan0012330 vs. NCBI nr
Match: KAG6603293.1 (Vacuolar protein sorting-associated protein 20-like 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 380.6 bits (976), Expect = 9.5e-102
Identity = 213/223 (95.52%), Postives = 216/223 (96.86%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL
Sbjct: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
           ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLK GNNAIKAIQSEINIDD
Sbjct: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKTGNNAIKAIQSEINIDD 120

Query: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPKENA 180
           VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFE LEAQLTV+DLPEVPT +ENA
Sbjct: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFEGLEAQLTVEDLPEVPTSEENA 180

Query: 181 EKLDLPDVPTKKPVATDAIEEDAEIVSSDALTKRRVMEEPLAA 224
           EKLDLP+VPTKKPVA DAIEEDA IVSSDA TKRRVMEEPL A
Sbjct: 181 EKLDLPNVPTKKPVAADAIEEDAVIVSSDASTKRRVMEEPLPA 223

BLAST of Tan0012330 vs. NCBI nr
Match: XP_023520722.1 (vacuolar protein sorting-associated protein 20 homolog 2-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 379.0 bits (972), Expect = 2.8e-101
Identity = 215/224 (95.98%), Postives = 218/224 (97.32%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL
Sbjct: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
           ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD
Sbjct: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120

Query: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPKENA 180
           VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTV+DLPEVPT +ENA
Sbjct: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVEDLPEVPTSEENA 180

Query: 181 EKLDLPDVPTKKPVAT-DAIEEDAEIVSSDALTKRRVMEEPLAA 224
           EKLDLP+VPTKKPVA  DAIEEDA IVSSDA TKRRVMEEPL A
Sbjct: 181 EKLDLPNVPTKKPVAADDAIEEDAVIVSSDASTKRRVMEEPLPA 224

BLAST of Tan0012330 vs. ExPASy TrEMBL
Match: A0A6J1D5M8 (vacuolar protein sorting-associated protein 20 homolog 2-like isoform X1 OS=Momordica charantia OX=3673 GN=LOC111017847 PE=4 SV=1)

HSP 1 Score: 380.9 bits (977), Expect = 3.5e-102
Identity = 213/223 (95.52%), Postives = 216/223 (96.86%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAA+ELVRQKKKDRALL
Sbjct: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAAKELVRQKKKDRALL 60

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
           ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEIN+DD
Sbjct: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINLDD 120

Query: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPKENA 180
           VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQ+TVQDLPEVPT  EN 
Sbjct: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQITVQDLPEVPTSNENV 180

Query: 181 EKLDLPDVPTKKPVATDAIEEDAEIVSSDALTKRRVMEEPLAA 224
           EKLDLPDVPTKKPVA D IEEDAEIVSS A TKRRVMEEPLAA
Sbjct: 181 EKLDLPDVPTKKPVAADPIEEDAEIVSSVAPTKRRVMEEPLAA 223

BLAST of Tan0012330 vs. ExPASy TrEMBL
Match: A0A6J1EPH4 (vacuolar protein sorting-associated protein 20 homolog 2-like OS=Cucurbita moschata OX=3662 GN=LOC111435445 PE=4 SV=1)

HSP 1 Score: 380.9 bits (977), Expect = 3.5e-102
Identity = 213/223 (95.52%), Postives = 217/223 (97.31%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL
Sbjct: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
           ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD
Sbjct: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120

Query: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPKENA 180
           VQKLMDDTAEAKAYQDE+NAILGEKLSEEDEEQILAEFE LEAQLTV+DLPEVPT +ENA
Sbjct: 121 VQKLMDDTAEAKAYQDEMNAILGEKLSEEDEEQILAEFEGLEAQLTVEDLPEVPTSEENA 180

Query: 181 EKLDLPDVPTKKPVATDAIEEDAEIVSSDALTKRRVMEEPLAA 224
           EKLDLP+VPTKKPVA DAIEEDA IVSSDA TKRRVMEEPL A
Sbjct: 181 EKLDLPNVPTKKPVAADAIEEDAVIVSSDASTKRRVMEEPLPA 223

BLAST of Tan0012330 vs. ExPASy TrEMBL
Match: A0A1S3B5R7 (vacuolar protein sorting-associated protein 20 homolog 2-like OS=Cucumis melo OX=3656 GN=LOC103486109 PE=4 SV=1)

HSP 1 Score: 376.3 bits (965), Expect = 8.7e-101
Identity = 211/223 (94.62%), Postives = 215/223 (96.41%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL
Sbjct: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
           ALKKKKAQEELLK VDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD
Sbjct: 61  ALKKKKAQEELLKHVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120

Query: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPKENA 180
           VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTV+DLPEVPT +E  
Sbjct: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVEDLPEVPTSEEKE 180

Query: 181 EKLDLPDVPTKKPVATDAIEEDAEIVSSDALTKRRVMEEPLAA 224
           EKLDLP+VP+KKPVA DAIEEDAE VSSDA TKRRVME PLAA
Sbjct: 181 EKLDLPEVPSKKPVAADAIEEDAETVSSDASTKRRVMEGPLAA 223

BLAST of Tan0012330 vs. ExPASy TrEMBL
Match: A0A6J1EBQ2 (vacuolar protein sorting-associated protein 20 homolog 2-like OS=Cucurbita moschata OX=3662 GN=LOC111432566 PE=4 SV=1)

HSP 1 Score: 374.0 bits (959), Expect = 4.3e-100
Identity = 211/224 (94.20%), Postives = 216/224 (96.43%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL
Sbjct: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
           ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINI+D
Sbjct: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIED 120

Query: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPKENA 180
           VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTV+DLPE+PT KEN 
Sbjct: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVEDLPEIPTSKENG 180

Query: 181 EKLDLPDVPTKKPVAT-DAIEEDAEIVSSDALTKRRVMEEPLAA 224
           E LDLPDVPTKKPVA+  A+EEDAEIV SDA TKRR MEEPLAA
Sbjct: 181 EGLDLPDVPTKKPVASAAAVEEDAEIVPSDAPTKRRAMEEPLAA 224

BLAST of Tan0012330 vs. ExPASy TrEMBL
Match: A0A6J1HS74 (vacuolar protein sorting-associated protein 20 homolog 2-like OS=Cucurbita maxima OX=3661 GN=LOC111466101 PE=4 SV=1)

HSP 1 Score: 371.3 bits (952), Expect = 2.8e-99
Identity = 209/224 (93.30%), Postives = 214/224 (95.54%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL
Sbjct: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
           ALKKKKAQEELLKQVD WLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINI+D
Sbjct: 61  ALKKKKAQEELLKQVDVWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIED 120

Query: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPKENA 180
           VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTV+DLPE+P  KEN 
Sbjct: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVEDLPEIPMAKENG 180

Query: 181 EKLDLPDVPTKKPV-ATDAIEEDAEIVSSDALTKRRVMEEPLAA 224
           E+LDLPDVPTKKPV A  A+EEDAEIV SDA TKRR MEEPLAA
Sbjct: 181 ERLDLPDVPTKKPVAAAAAVEEDAEIVPSDAPTKRRAMEEPLAA 224

BLAST of Tan0012330 vs. TAIR 10
Match: AT5G09260.1 (vacuolar protein sorting-associated protein 20.2 )

HSP 1 Score: 303.5 bits (776), Expect = 1.4e-82
Identity = 169/226 (74.78%), Postives = 195/226 (86.28%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           MGNLFVKKPK+T+VD+AILSLKTQRRKLGQYQQQL+ VIEAEK AAR+L+R+K+KDRALL
Sbjct: 1   MGNLFVKKPKITEVDRAILSLKTQRRKLGQYQQQLEKVIEAEKQAARDLIREKRKDRALL 60

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
           ALKKK+ QEELLKQVD WLINVEQQL+DIEL SKQKAVFESLK GNNAIKAIQSE+N+DD
Sbjct: 61  ALKKKRTQEELLKQVDQWLINVEQQLADIELTSKQKAVFESLKQGNNAIKAIQSEVNLDD 120

Query: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPK--- 180
           VQKLMDDTAEAKAYQDE++AILGEKLS EDEE+ILAEF++LE+ L V+D+PEVPT +   
Sbjct: 121 VQKLMDDTAEAKAYQDELSAILGEKLSAEDEEEILAEFDNLESLLIVEDMPEVPTTELMP 180

Query: 181 ENAEKLDLPDVPTKKPVATDAIEEDAEIVSSDALTKRRVMEEPLAA 224
           E  EK+DLPDVPTK PVA+          +    TKR+V+EEPL A
Sbjct: 181 EEPEKMDLPDVPTKAPVAS----------NETTSTKRKVLEEPLEA 216

BLAST of Tan0012330 vs. TAIR 10
Match: AT5G63880.1 (SNF7 family protein )

HSP 1 Score: 302.4 bits (773), Expect = 3.1e-82
Identity = 166/224 (74.11%), Postives = 200/224 (89.29%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           MGNLFVKKP++T+VD+AILSLKTQRRKLGQYQQ+L+ VIEAEK AAR+L+R+K+KDRALL
Sbjct: 1   MGNLFVKKPQITEVDRAILSLKTQRRKLGQYQQKLEKVIEAEKQAARDLIREKRKDRALL 60

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
           AL+KK+ QEELLKQVD W+INVEQQL+DIEL SKQKAVFESLK GN+AIKAIQSE+++DD
Sbjct: 61  ALRKKRTQEELLKQVDQWVINVEQQLTDIELTSKQKAVFESLKQGNSAIKAIQSELDLDD 120

Query: 121 VQKLMDDTAEAKAYQDEINAILGEKLSEEDEEQILAEFESLEAQLTVQDLPEVPTPK-EN 180
           VQKLMDDTA+AKAYQDE+NAILGEKLS EDEE ILAEF++LE+QL V ++PEVPT + E 
Sbjct: 121 VQKLMDDTADAKAYQDELNAILGEKLSAEDEEDILAEFDNLESQLIVDEMPEVPTKESEE 180

Query: 181 AEKLDLPDVPTKKPVATDAIEEDAEIVSSDALTKRRVMEEPLAA 224
           +EKLDLPDVPTK PVA+     +AEI  +++ TK +V+EEPL A
Sbjct: 181 SEKLDLPDVPTKTPVAS-----NAEITPAESATKTKVLEEPLPA 219

BLAST of Tan0012330 vs. TAIR 10
Match: AT5G63880.2 (SNF7 family protein )

HSP 1 Score: 288.9 bits (738), Expect = 3.5e-78
Identity = 166/248 (66.94%), Postives = 200/248 (80.65%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           MGNLFVKKP++T+VD+AILSLKTQRRKLGQYQQ+L+ VIEAEK AAR+L+R+K+KDRALL
Sbjct: 1   MGNLFVKKPQITEVDRAILSLKTQRRKLGQYQQKLEKVIEAEKQAARDLIREKRKDRALL 60

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
           AL+KK+ QEELLKQVD W+INVEQQL+DIEL SKQKAVFESLK GN+AIKAIQSE+++DD
Sbjct: 61  ALRKKRTQEELLKQVDQWVINVEQQLTDIELTSKQKAVFESLKQGNSAIKAIQSELDLDD 120

Query: 121 VQKLMDDTAEAKAYQD------------------------EINAILGEKLSEEDEEQILA 180
           VQKLMDDTA+AKAYQD                        E+NAILGEKLS EDEE ILA
Sbjct: 121 VQKLMDDTADAKAYQDCENLYNLFCRAFVFDDSIYISECQELNAILGEKLSAEDEEDILA 180

Query: 181 EFESLEAQLTVQDLPEVPTPK-ENAEKLDLPDVPTKKPVATDAIEEDAEIVSSDALTKRR 224
           EF++LE+QL V ++PEVPT + E +EKLDLPDVPTK PVA+     +AEI  +++ TK +
Sbjct: 181 EFDNLESQLIVDEMPEVPTKESEESEKLDLPDVPTKTPVAS-----NAEITPAESATKTK 240

BLAST of Tan0012330 vs. TAIR 10
Match: AT2G19830.1 (SNF7 family protein )

HSP 1 Score: 56.6 bits (135), Expect = 2.9e-08
Identity = 60/215 (27.91%), Postives = 99/215 (46.05%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           M  LF K  + T   + +  L      L + +  L      E   A+E  R K K  A+ 
Sbjct: 3   MNRLFGKPKQETSTLQTLDKLNETLEMLEKKENVLLKKATGEVEKAKEFSRAKNKRAAIQ 62

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
            LK+K+  E+ ++Q+  + + +  Q+  +E A       ++L+ G +A+KA+Q   NIDD
Sbjct: 63  CLKRKRLYEQQVEQLGNFQLRIHDQMIMLEGAKATTETVDALRTGASAMKAMQKATNIDD 122

Query: 121 VQKLMDD----TAEAKAYQDEINAILGEKLSEEDEEQILAEFESLE-AQLTVQDLPEVPT 180
           V K MD+    T   K  Q+ ++A  G   ++ DE+++ AE + LE A+L  Q L  VP 
Sbjct: 123 VDKTMDEINEQTENMKQIQEALSAPFG--ANDFDEDELEAELDELEGAELEEQLLQPVPI 182

Query: 181 PKENAEKLDLPDVPTKKPVATDAIEEDAEIVSSDA 211
                 K      P + P       E+ E+ +  A
Sbjct: 183 HVPQGNK------PARAPAQKQPTAEEDELAALQA 209

BLAST of Tan0012330 vs. TAIR 10
Match: AT4G29160.1 (SNF7 family protein )

HSP 1 Score: 53.9 bits (128), Expect = 1.9e-07
Identity = 55/215 (25.58%), Postives = 97/215 (45.12%), Query Frame = 0

Query: 1   MGNLFVKKPKVTDVDKAILSLKTQRRKLGQYQQQLDAVIEAEKNAARELVRQKKKDRALL 60
           M  LF K  +  +  + +  L      L + ++ L     AE   A+E  R K K  A+ 
Sbjct: 2   MNRLFGKPKQEANALQTLDKLNETLEMLEKKEKVLLKKAGAEVEKAKEYSRAKNKRAAIQ 61

Query: 61  ALKKKKAQEELLKQVDAWLINVEQQLSDIELASKQKAVFESLKAGNNAIKAIQSEINIDD 120
            LK+K+  E  ++Q+  + + +  Q+  +E A       ++L++G +A+KA+Q   NIDD
Sbjct: 62  CLKRKRLYEGQVEQLGNFQLRIHDQMIMLEGAKATTETVDALRSGASAMKAMQKATNIDD 121

Query: 121 VQKLMDDTAEAKAYQDEINAILGEKL---SEEDEEQILAEFESLEAQ----------LTV 180
           V K MD+  E      +I   L   +   ++ DE+++ AE + LE++           T 
Sbjct: 122 VDKTMDEINEQTENMKQIQEALATPMGAAADFDEDELAAELDELESEELESQLLQPATTA 181

Query: 181 QDLPEVPTPKENAEKLDLPDVPTKKPVATDAIEED 203
             LP VP P        +P   T +     A++ +
Sbjct: 182 PPLPSVPVPAGRQPARPVPQKRTAEEEELAALQAE 216

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FY891.9e-8174.78Vacuolar protein sorting-associated protein 20 homolog 2 OS=Arabidopsis thaliana... [more]
Q8GXN64.3e-8174.11Vacuolar protein sorting-associated protein 20 homolog 1 OS=Arabidopsis thaliana... [more]
Q503V09.1e-3141.33Charged multivesicular body protein 6 OS=Danio rerio OX=7955 GN=chmp6 PE=2 SV=3[more]
Q5ZL553.3e-2839.39Charged multivesicular body protein 6 OS=Gallus gallus OX=9031 GN=CHMP6 PE=2 SV=... [more]
Q6GMA41.6e-2738.89Charged multivesicular body protein 6-A OS=Xenopus laevis OX=8355 GN=chmp6-a PE=... [more]
Match NameE-valueIdentityDescription
XP_038883019.11.1e-10296.86vacuolar protein sorting-associated protein 20 homolog 2-like [Benincasa hispida... [more]
XP_022928588.17.3e-10295.52vacuolar protein sorting-associated protein 20 homolog 2-like [Cucurbita moschat... [more]
XP_022149415.17.3e-10295.52vacuolar protein sorting-associated protein 20 homolog 2-like isoform X1 [Momord... [more]
KAG6603293.19.5e-10295.52Vacuolar protein sorting-associated protein 20-like 2, partial [Cucurbita argyro... [more]
XP_023520722.12.8e-10195.98vacuolar protein sorting-associated protein 20 homolog 2-like isoform X2 [Cucurb... [more]
Match NameE-valueIdentityDescription
A0A6J1D5M83.5e-10295.52vacuolar protein sorting-associated protein 20 homolog 2-like isoform X1 OS=Momo... [more]
A0A6J1EPH43.5e-10295.52vacuolar protein sorting-associated protein 20 homolog 2-like OS=Cucurbita mosch... [more]
A0A1S3B5R78.7e-10194.62vacuolar protein sorting-associated protein 20 homolog 2-like OS=Cucumis melo OX... [more]
A0A6J1EBQ24.3e-10094.20vacuolar protein sorting-associated protein 20 homolog 2-like OS=Cucurbita mosch... [more]
A0A6J1HS742.8e-9993.30vacuolar protein sorting-associated protein 20 homolog 2-like OS=Cucurbita maxim... [more]
Match NameE-valueIdentityDescription
AT5G09260.11.4e-8274.78vacuolar protein sorting-associated protein 20.2 [more]
AT5G63880.13.1e-8274.11SNF7 family protein [more]
AT5G63880.23.5e-7866.94SNF7 family protein [more]
AT2G19830.12.9e-0827.91SNF7 family protein [more]
AT4G29160.11.9e-0725.58SNF7 family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 50..70
NoneNo IPR availableCOILSCoilCoilcoord: 21..41
NoneNo IPR availableGENE3D6.10.250.1710coord: 116..168
e-value: 3.3E-5
score: 25.7
NoneNo IPR availableGENE3D1.10.287.1060coord: 14..90
e-value: 3.9E-10
score: 41.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 171..197
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 175..197
NoneNo IPR availablePANTHERPTHR22761:SF50VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN 20 HOMOLOG 1coord: 2..223
NoneNo IPR availablePANTHERPTHR22761CHARGED MULTIVESICULAR BODY PROTEINcoord: 2..223
IPR005024Snf7 familyPFAMPF03357Snf7coord: 16..192
e-value: 1.2E-41
score: 142.2

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0012330.1Tan0012330.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032511 late endosome to vacuole transport via multivesicular body sorting pathway
biological_process GO:0006900 vesicle budding from membrane
biological_process GO:0007034 vacuolar transport
cellular_component GO:0000815 ESCRT III complex
cellular_component GO:0005771 multivesicular body