Tan0001619 (gene) Snake gourd v1

Overview
NameTan0001619
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposase
LocationLG10: 21591935 .. 21607973 (+)
RNA-Seq ExpressionTan0001619
SyntenyTan0001619
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTGGATCAAGTGACGATGAAGTGAACGTATCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACGATGCGTGGTCTGGCACGCGTAAGGACTACAGGAGAACGCTTGGTCATCCAATACAACAATCAAGGCCAAAGTGTTGGTGATAATGCAAACCAAATGCAAAGTTATATAGGAGTTTGCGTTAGGCAACAAATTCCATTAAGTTACAAGACTTGGAAAGAAGTTCCCCAAGAATTGAAAGATAAAATTTTTGATTCTGTAGAGGTAAGAATAATAGTATTTTTCCCTAAAGGCTTCTTAATATATAGAGTTATTTAACCCTTTATTATGTGTTTCCTATGTAGATGTCATTTGTGGTAGACCCTCGGTCCAAGAATTGTATTCTTCAATCAGCGGCAAATAAATTTTGAACATTTCGATACACGTTGTATCAAAAGCACATACTTCCATTTAAGGATGAGCCGTTCTTGTTGAAACATCCTCCACAAAAGTATTTTCATATTGATAAAAAACAATGGGAAGCATTTGTGAATGCTAGATTATCGGAAGAATGAGAGGTAAATATATTATACTTATTGCTTTTAAAAGTTAGTAAAAAAAGTATTAAAATGATTTAAATTGATTCTAATATGTTATACAGGCACAAAGTCGTGTGCAAAAGGAAAGAAGAGAGAAATGTAAATATAATCATCACATCTCTCGCAAAGGATATGCAAACTTAGCTGAAGAACTAGTGAGTAACATTATTTCTTATAATTTTCCATACATGAGATTTAATAACTTGTATGTTTATATAGGAATTGGCGGAAGATCCTTCCACTCGTGCCACCTTATGGATACAAGGACGAAAAGGAAAAAATAATGAATACTTCGATGAAGACACCAAACAATGTGCTGGTCAAATCGTAAGTTAATATGTTTTATTTTAACATGATGTAAAAATGATTCTTAGTTTAAATATCTTATGGTTAGTTTCAATGTCAATCTAAATGGTTGTGACAGGATGAACTAGTTATGAAGAATAAAGGTAAAGACATATTGACCGAAGCATTAGGCACGCCAGAACACAGAGGGCGTGTTAGAGGAATAGGTTTGTCTGTTAATCCATCAACATACTTTAACATTCCTTGAGGGAAATCAAAATCAAGCAAAGAGTCTGGAAACAAAATGTCGACTGATTGCTCACCTTCCAAAAAGTCTACAAGCATAGGCACTAATCATCCAAAAGACAAGGAGGTCATTGACGAGGTGGAAGAAATTTTAGAGGTAATTAGAATTTTAAAATATTAAAAGTGTGATCTATGATAATTTATTTTGACATTCTTTTTTAATTTTTGAAGGGAACTCCATGCCATCTAGCAATAGGATCAAAGGATAATGTGGTTGCTGTAGGCATAATGTACACGTCTGACGCTCAATTTCTCACAGTCCATGGAGTTCCCTTAGGAGTTGAAAATATTAGAGTGGTAGTGGACATGATCGTAGGTGAAAATGCTCCATTACCAATTCCTATACGGGGAGAAGTAGAGTCTCTGAGTCAATCAATGAGAAATTTTGTGGCATGGCCTCGGGACCTTGTCATTTTTAATAAGGGGAAATAGGTATGAAATTCTTCAACTCAATGTTTAAAATTACTATATTCTTTGTTTACAAATTTAAATGTTGTTTTGTTGTGATAGGTGGCTTCTCCAGCAAAACATAAGTCGGATGTTTATGTACATGTGCCTACTTCACATTCTACAAAGTACACAAATGCTCATGTGACTATTAAGCTTCTGAATCGTTATGCAATGTTATCGATGCAAGAAGATGATACGACAAGTCAAGTTGAGCGAGCACATGTTCGGGGAGGAGAAGTTAATTTATTTACATCGCGATGATATCCTGCATTACTGTGGGATGGTGAAGATAGGGTACTCCTGCATAGTCGCATACATTGCGTAAGTATACAACATCTTTCATTTATATTGACTACGCTTTTGAATGTAGTATACAACATCTTTCGTTTTATAACTTTACTTTTGTTTAGGTATCTTTGGACTGTATGTGACTATGAAATAATCGCCAAGTTCTTGCTAGTTGATCAAATAATCATTTCTAATTTTATTAAAAGTCAAGAAACACTTTGTATAAATCTGGCTAACAGGTTAAAAATGATTTATTTGTACTTGGATCAACAAGTTTTCATCCCATATAATACTGGGTAAGTTTAAATTATTTCAACCTATTTATATGTTTTTTGTTCATAAAACAATGATATTAGAATATTCATTGTTTAATAGATATCATTGGATGTTGATCGTTATCCATCCGCGGGCGAACGCCGTTTATGTCTTAAACTCGTTGAGGAGTAAGATCGAAGAAAGTTTTCAAGGAACAATAAATATGTAAGTGTTTCAAACCTTTAGAATTAATTACTCAAACCTTTAGAATTAATTACTTTCTATTAATTTTTCTAATACTTCAAACTCTTTAGATCCTTGAGAATGTGGCAAACCAAGCACTCACTTCCACAATATTGCTCCGCCATCACTTGGAAACTTGTAAAGGTAAACATAATTGATATATTCGATTGTTTTGATCAACAGAACGAAATTTAACTCAAACATGCTATTTTTTAAATTATTTTGCATGTAGTGCCCCTGTCAACCGAGTTCTGTAGAGTGCGGGTACTATGTACAGAAGTATATACGAGAAATCGTATATAATTCTAGTATCCCTATAATGAACCTTGTAAGTACTCTTATTTTATCTTCTTAGTGATTCAAAATATAATTTAGTCTAATGTTATTATATATTATTTCTCTTTGTAGTTTAACACAAAAACTGCATATAAACAAGAAGAAATCGACGAGATTCGAGTACAATGGGCGGATTTGTTGGCAGATTTGTGTAAGCCATTTGTGGATTTCTGAATTTTGTGAAAAAGTACTAGATTTGTATGGAGTAAGGTAAGTTAATATGTTGGTTAGGTGTACAATATACTTGTTTATAATGTATATAGTAGAAGTGTATATAATATTTAGTAGTTTGTGGAGAGGTGGGTGGCCTCATTCTTCTGAATCTGGGATTGGAGTTTGTTGGATGTTGTCTATGAATGAATGTGTTTTTTGATGTATATACTTGTTTCTTGGATGTTGTCTATGGAGAGGCGGGTGACCTCATTCCCTACACCGTTTTTTTTTTGGATAGAAGTATTCTGCAGTGTTTCGGGTTATGGTTAGCTGATGTTCTGTAGAGGTCATGCCGAAATTTTTACTTATTGACTTGATTTCGTGTTAGTTTTAGAGTTTATTGTTTTTGTGATATAGGTGTTAAGGTTTGTGATATAGGAGTTTATGGTTTTAGAGTTATGGTTAAAGTTTGTGATATACTAGTTTATGGTTTTTGTGATATAGGCGATAAGGTTATGTAGAGATCGTCTCATGTGTATTCTTTATAATGAAGTGTTATGATTTTCTCGTAAAAGGAATCTTGTTGGATGTTTGTGGATGAATTAGTGATTTCATGTTTTTTTTTATATATAGAAGTGTTCTGCAATGTTTCAGGTTATGGTTAGCTGATGTTCTGTAGAGGTCATGCCGAAATTTTTACTTATTAACTTGATTTCGTGTTAGTTTTAGAGTTTATTGTTTTTGTGATATAGGTGTTAAGGTTTGTGATATAGGAGTTTATGGTTTTAGAGTTATGGTTAAGATTTGTGATATACGAGTTTATGGTTTTAGAGTTATATTTAAGGTTTAATTTGTGATATATGAGTTTATAGTTTTTGTGATACAGGCGTTAAAGTTTGTGATATAGGAGTTTATGGTTTGTAATATAGTAAGTTTATGGTTTTGGAGTTTTGGATATAAAAGTTTTCTATGTGCATTGTTTTGGTTAAAAAAGAATTTTGTTGGATGTTGTTATGGATTAGTGATTTCATGTATGTTTTGGATATAAAAGTGTTTTTGTTGCATGTTTGTAAATTTGTATAGTGATTTCATGTGTGGTTTCAATATAGAAGTTAATATATTTGCAATATTTCAGGTAATTAGTTATTAAATATTGTTGGTTAGCCATCGGGCTATCAACTTTTTTTTATTAAATTATGCTAAAAACGATGACGGACATTGTTTGTCATTAAATAAAAAATTATGAACTAAAATAATGACGAACAACATATAATGACGGGCAATGGGTTTTTAATTTTTTTTTCAAATATAAATGACGCGTAATGCCCGTGATAAAAAGTGACCCATTTTTTAATGACGGACAACACTTTTAATGACGGGCATTGCCCATCATCAAAACTCGACCGTCATTGTATGTTACAATGACGGGCGGTTTTTAATGACGGGCATATCAAAACCCTTTAATGACGGGCAATGCCCGTCATTAAAACCTTTTTTTCTTCTAGTGAACTAAAAATGAACAAAAGAATGTTAATAGCACATTCATGGTTTTTTTTTTAGTAAAAGAAGTTGGAGGATTTGAATCACAAATCTCGTAATCGTTAAAACGTTTGATATGTCAATTGAGTTATGCTCTCTTTGGCATAGCGTATTCACAGTTATTGTGTAGAGATGCTTTGAGATCTACAAAACTATTCTAAATACTAATATTTATGGGTAAGATATCTCAGCTTTGTACACAAATTTATTTTTCAGGCATGGAATCATTGATTACCTCCTCCATATTAGAGCATTTTTAAGTGGAAAATATAAAAAATGCTAAAACTTTCACTTTCAGAGTATTGTATCCCACAAATATTGATTTAATTCAATTAAGTATTGCGAATAATCTCATTTTACTTTATTTAAAGACAAGAAAAGAAGAGCAAATAGAGAAAATAAACTTCTAAATCTAGAAAGCAAGATAATGAGAGACGCAAATCAATTGGTAATTCTATGGATGTTGGCATTCTTTACCTTAATTTACTTAATTAAGCTAGCAATTAGGCAAAAACAAGTGTTCAAATCATAAATCTAGAAGCCTAAGGTCATGGACTCTCCAGTCTCCTCCAAATTTAGAGTGTAATTAATTAAATCATGTGTCCGTCTCATTAATTATTTCTCACCCGACATTAACAACCTTTATTAATTATTGCTCTAGCTATCACTCTAGTAATTAATCTATAGTCATGCATCTAGTTCTAGTTTAAGCTTATCTTTGTCGAGCCTCCACTTCGTAACTTATCAATTAGACGGTGCCTAATTAATTAATCTTGAATATTAACTATATTTAATTAACCTTTTATCCTACTGTCACTCGGGTAATTAACCAATTATACATTAAACTAATTAGGAATCCTTCGTCGAGCTTAACTTAGTCTGGCCTCAATTAATTAACTAACAAATCTAAGGGAATTAAGCCTAAAATGCATAATTGAACAAGAACATTTAATCAAACCACAATTACTAACTAAATTAAGAAAAATTAAGTACAATGGTCAAACAATTACAACAACCCATAGAAATTAGGGATTTAACTCAACATATTCAGCAATTACAAATCAAAAACCCCTCATTGCTACTCCTAAAGTAACACAATTGAAAGAAAAAGGGAAAGAAAATGCAAACGGAGGACGATCTCGTGTTCTCGAGGTGATCCCGAACTCGAGCTCTAAGGCCTTCAAACTTGCTCCAAAACGCTTACAAATGCTCCCAAATACCCTCAATTGTGGTATGGAATCGAGATTAGAGCTTCAACCCTCGAAATAGCCTCCTTTTTTCATAGAGAAAACGATGCATTTATATTGTAATTCGACGCAACATTAGCGTCTCGACACTGCCCAAAACATCCCACTTTTTCTTGTGTGTGTGCCTCTAGCGTTGAGACGCTGAGTCTTAACGCTACAGATGAAAATCCTCTTCTCTACACCTTTTAAGTGTCTAGACGCTATAGGGTAGCGTCTTGACGCTGACCCTCGGGTTTGAGAGATCAAATACAATGGCTCAATGCCATTTTGAGCATTGAGACGCTATCCTCTCGACCGACAACACTTAAGAAGACTGTCTTCTTTATATTTAGCTCCATTTTGAGTTCTCTTAGGCTCCACAACGAACCCCATGCTCCAAAATTACCAATTCTAGTGTCAAATGATCGACTTCCTGAAATAAGAAAAATATTGACATAAAAACTCACAATTATCGCATAAAACTAGCTAAAATAAGACATAAATATAACACTAATTAATTAGAGCAATCAAATATCCCCACTTTAACCTTTGCTAGTCTTGAGCAAAGCTAAAACAAAATCACATATGATTGCATCATAATGAACCTCAAAGTTCAAAATAACAGGGTAATCACAATTAAAGGAAATAATTGAACACATCAAATCATAAAGCAAATTTAAATTTAAACTTTTCATTCCTTGAAAATTTGTGCATAACCTTCCTTATTTGATCCCTACTAGCCTAGAAAGTTTTTAAAATCCTCAATTCATCTTTTTCCTACACTAACCTTAAGCACCAATCGTCCAACATCCCAACTAGGATGCCCAAACACTACAGGGCTTAAAGCTTGCATTTTTCTCTAAAACTCAATGACACTAGCTTTCACATAGCTGTTATAGCGTCCCGTACTTAGATCACTCCCTCCATGAGTGGCTTGGCTAATCACAATAGAGGAAGTTTTTTTTTCTCCTGATCAATAAACACAAGAAGTTATTTTATTTTATTTTTCTTTTCTTCAATATTTTTATTTTTTATGATCAGACTGAAACGTTGGGGCTTTAAGAAAAGATTTTTTTAATTCAAAATAAAGCTCAAATCTAGTGCAGGTAGATACTGTCGATTGCAAGTAATAATTCTGGGGTATTCTAGAATATCGTACCCAAGGGAACAAATTTTAACAAAATCTATGAGGGTAACACAAATTTGTTTAGAAAATACAATTGTGGTAACCAAAGAAGTAAAAATGTTGAGAGGTTCTTTTTGCTAATGGAAATATGAGAAAAATCAATAAGTATAGAAAAAGGGAATTCAAGTTGCTCATGCCCCTTTGTAAAGAAATGTTGTAAAAAATGTATGTAACGACCCTGAACTCCGGATGCCTTTTAAGTTATACCTTTGGGTCAAATTTTAATGGAAATTTAATTATTTGGGCCTTAGGCTTCGTAATTTAAATTAAGTTTATGTAAATTGAAAATTTTTGAGTTTATTAGAGACAAATTTAGAACTTTTAAATAGGCTCAATGACATATTTGTAGTTAGTAGAAGGTTGAGGGGCATTTTCGTAATTTTAAAGAAAAGTCAATTAGGGTTTCTTAGGTTTGTAAATCACGTTTTTTGAGGCATCTTCACCATCTCCTCCTTCTTTCTCGTGAAAACGACTGCCGACCTCACACCTTCCCTCTCGACTCGCCGTCGTTCTTCGCCGCCCGCGCCTCAGACGTGTCGAAGCCACCTTCGTGTCACGCCGTCGTCCACCGCACGACAACCAACAACCTCCTCGCACAGTGGACGCTTCGCCATCGTCCTTCGTAGGATTCCTGCCGCTGCACAACTTGTCGGGCGCCTCGAGCAGTCAACGAAGCAAGCCGCCACACGTTCGTTTGCAAGTCGCTCGTCGAGTTGTGGGTTGGAGCCGCTGTTATAGCCTCGCTGAAATCCTAATCCAACACAAGTCACAGATGACGACCTTCTTCTGCAACACGCGAACCAGACTCCAAATCGCTGACTTGGTTTGTTTGGTTTGCCTTAATTTCAATCGGTTTTTGGAGTTTTAGGATCGGACCACGAGGATTCCAGGGGTACACTCCACGCCACCACCATCGCACGCGCCGCCACACGCGCCGTCATCGACCAGCCACGGGCTGCCGCTCGTCGGACTTTGTCTGATCCCACCAGGTTCATCTCCCGAGTCTGTTTTAAGCCTCCGTTAGTGGTATTTTAGCCTGATTAACGTTGAATTTCAAGAATAAGGTTATTTATGAGTATGAATCTAAATTAGGAGAATATGATTTTGGTTTTAGGGGTTTGTTCATTGGCCAAGTGGGAGGACATCAAGTAAGAACGAATCCTAGCATATTTAAGAGATAAGACTAAGGTAAGGGACCTTGTGCCGAACTTTCTAAAATGAGATAAAAAAAGGAGTTTTTGTTACTTTATTTCAAAAGTATTTTATTATTTACACTTTTAAAATCTATTATGTTCTAAAGTATTATTTAAGTCAAGTTATGACAGTTATGTCATGAATGTTAACTTATTGTTTTTTCACCACGAGACATCATCAATACACATACTTCATGCTAGTTTCTCGTCACATAAGGCTGCAAGACAACGTCCTAAATAAGGAGTAAGGTAGGGACTGCTAGCCTGAGGGAGTCCTATGACACAACTCTCTTGACTTGGTACCAAAGTCATATTTGCTAACAATCATGTGAACAATACAAGCTACATATCATTTATGATCCTCAACCACACATGCATCAAATAAAACTCAAGTCAACCCAAAGTTGGCTAACAACCAAAATTTATATTCACGTTTACTTTACTCCTTACCTCATATTTTGTGTGCGAGGAATAATCCAAATAACTACTTATATTTGCTTATCTAAAACCCATAAACATGCTATCAAGCACCTAGTTTGTTAAACTAGAGCTTAATAATACAAACAATTCAACCCCCACCTCACATAATAAGACACATAGTCCATAGATACATGATAAATACGATCAAACATGATTCAAAGGCTCATGATAACATACACATAACCATGTCAAGTATAAAGCTCATAAAACATAATAAGACACATAATCCATATATACATATAAATATGATCAAACATGATTCATAGGCTCATGATAACATACACATAATCATACCAAGTATAAAGCTTATAAAACATAATAAGACACCTAGTCTATAGATACATGATAAATATGATCAAACATGATTTATAGACTCAGGATAACATACACATAATCATGCCAAGTATAAAGCTCATAAAACATACAACACAAGTTTCATAAACACTTTAAATAGTTAAACCAAAGTTTAATGCTAGAAACAACTAAAAAAATACACAAACAAGCAATGCCGATCACCTTGCTCCTTCACTTAGGAGCTTGGTTTTAATCCAAATGACTACTTACCTTATGTTTGAAGATTCACAAAAGAGTGGGTGCTTCTTAAACAACCAAAACAGGGTCTAAACCAACTTAAGAAGTTGAAACTGAGCGGGATAAATCAACTCCGGCGAGCGGCAGCGACTTTGGTCCTGTGGCGGTCCGACGAATTTTCGGCAGCGGTCCGACGAAACTCCAACGGTGGTCCGATGAAATAAAAAGTGTGGAGGAAAGAAGAAGAAGGAGAAGAAGAGAAGAGTTTTGAGGTACACCCTTTAGGTTGCAATTACTCTAGTTTAAATAGGTTGCAAAAAGCCCTAATACTTTTTAGAAAAGGAAATTCAATTTTTAGAAAAAGAAATTCTAATCTTTTCTAAAAGGAAATCTTAATTCTTTTTTTTTTTAAACATAAATCATAATCTTTTTTAGATTTTGAAGCACCCTTTCCTAAAAATCTCTAGATTCCTAATCCCTATCCGATTATCCAAAAAAATTTAAATTTAAAAATAAATACAAATTAAAGCACCTAATTTTACGAGGCGTTACAATATACTCATCTAAGAAAAACTTTCGACCTCAAAAGTTGCTAGAACTTAACAACGGCCACGCCCTCGAGCCATAATCTAAAGTTAAACTCTAAAATACCTTAGCAAAAGAACAAAGAAGTCAGAAGCATATCATAACAAGTGTCAACACACAAGCAAAACAAATAGGCTTAACTACATTACAAAATAAAAGCAAACCTAGAGATGACGAGGTGACTTCTTGCGAGAGGCTTTAAGAACCGTTGCTTACGATGTAAGCGAGTTTGCAGAATCTAAAACCTATGCTATGGTACCAACTGTAACGACCCTAATTTTTTAAACTCAAAACTCTGAGGGGTGTTGCTAAGCAACACCGTCTGCGCCAAAACTTTGACAACATAATGGTGTTGGTAAGGTAAACCCCACATTTTTCTTGACCTGTTTAAAACATAACACACTCCAAAGGTCTATTAAAACATCTAACCTCTTCAAACAACAATCAACAATAGGCTGGATTTCAAACAGATCAGGAATTTACAACAGTTATGCAACAACATAAGTAAACTTCTAATCTATTAAGAAGGTACAACAGAAATATAAAGTACTAGAACAACTACTATGAGCTAAAAGACACCAAAGAACATCCTTATAGCCTCGCCAGGTTGCCACCATCGTCATCCATCCCACAGTTTACCTTTTATCTGCAAAAAAAAAAAAAAAACAACAACAAAGAAAGGAATGAGTATTAAAATACCCAGTAAGTAGCCCAATTACGTCCAACCCTATTTCAGCAGGAGGAAACAACAAAACTCATCTACTAAACTCACTCAAACCCCAGAACAGGTCTCACAACTTGCACTTCTCACCTGAAACTACCAAGTTTTCACTTTCAGACAACACTAAGCTAGACGTCCGACCTAACCTTACCCCAAGCCCACACAGGATTGCACAACTCCTTGTTGTACGACGATATACGTCTCCCTAACTAGCATGCTGACATAGTCTAGACGCACCCTGCAGGACGCTCGTAAGCTATGTCTCACTCCAGGTCTGTTGCCACACTCATTGTGACTCACCTTGGAGCATTCTCGTCTTTGGTCTGTTACCATGACTACTAAGGCTCACCCCAAGACATCATTGCCCCTGGTCTGTTGTCACACACATTGTGACACACCACGAGACATCATCGCCCAGGGACTATTGTCACACACACTGTGACTCACCACGAGATGACCCCGAGACTTTTTTCACACACACTGTGACTCACCACGAGACATCATCAATACACATACTCCATGAGTTTCTCGCCACACAAGGCTGCAAGACAACATCACCTAAATAAGGAGTAAGGTAGGGACTACTAGCCTGAGGGAGTCTTATGACACAACTTTCTTAACTTGGTACCAAAGTCATATTCAGAGATACTTACTAACAATCATGTGAACAATACAAACTACATATCTCAACCATACATGCATCAATTAGATCTCAAGTCAACCCAAAGTTGACTAATAACCAAGATTCATATTCACGTTTACTTTACTCCTTATCTCATATTTTGTGTGCGAGGAATAATCCAAATGGCTACTTACCTTTGCTAATCTAAAACCCATAAACATGCTATCAAACACCTAGTTTGTTAAACTAGAGCTTAACAATACAAACAATTCAACCCCCACCTCACATAATAAGACACATAGTCCATAGATACTTGATAAATACGATCAAACATGATTCATAGGCTCATGATAACATACACATAATCATGCCAAGTATAAAGCTCATAAAACATAATAAGACACATAGTCCATAGATACATATAAATATGATCAAACATGATTCATAGGCTAATGATAACATACACATAATCATACCAAGAATAAAGTTCATAAAACATAATAAGACACAATCTATAGATACATGATAAATATGATCAAACATGATTCATAGGCTCACGATAACATACACATAATCATGCCAAGTATAAAGCTCATAAAACATACTACACAAGTTTCATAAACACTTTAAATAGCTAAAACAAAGTTTAATGCTAGAAACAACTAAAAAACACACAAACAAGCAATGTCGATCACCTTCCTCCTTCACTTAGGAGCTTGGTTTTAATCCAAATGACTACTTACCTTATGTTTGAAGATTCACAAAAGAGTGGGTGCTTCATAAACAACAAAAATAGGGTCTAAAACAACTTAAGAAGATGAAACTGAGCGGGATAAATCAACTCCGACGAGTGGTGGTGGTCCGACGACAGTAGTGGCGGTCCGCCGACTTTTCGACGACTTTCGGGCAGCGATCCGGTGAAACTTCAGCGGTGGTCCGACGAGATGACAAATGTGGAGGAAAGAAGAAAAAGGAGAAGAAAAGAGTTTTGAGGTACACTCTTTAGGTTGCAATTACTCTAGTTTAAATAGGTTGCAAAAAGTCCTAATCCTTTTTAGAAAAGGAAATCCTATTTTTAGAAAAAGAAATCCTAATCTTTTTTTAAAAGAAATCTTAATCATTTTTTTTTAACATAAATCCTAATTGTTTTTAGACTTTGAAGCACCCTTTCCTAAAAATCTCTAGATTCATAATCCTTATCCGATTATCCAAAAAATTTAAATTTAAAAAATAAATACAAATTAAAGCATCTAATTTTACGGGGCGTTACAGTTACAATTTTCATATATCATTAAATTTCATTGTATATTTTAAACTTTAATATTTATTTTTATTAATGCTAGGTACTTATGCCAATTTGAATCCATCATAATAGAGAAATATGCATTCCTTCAACCATCCTTAATTTCATATGGCTTAGGACTTGAAGAGCAATGGTCTCGATTCTTATGTAATAGGCTAAGAGAAACCAAGAACAGACTATTGATATGTTCTTGTAATTCAGTGTAAGTATGTAAAACATGTACCCCCTTATTAACTTACTTGTATTTTTTGTATATTCTAACTCTATTGGTTGGTAATTGTATATTTAGAAACCATTGGTTGTTGGTTGTTATATCATTGAAAACTTCCACAATTTTGTCGATTGACTCCATATATATAGGATAGGACATCCACGATTATGTGAAAAGTATAGTTAACATGTAAGTATATTATTCTTTTACCATTGATATATTTTTTTTATTGAAAACGTTATTCACATATTTTTATTATTATTTGTACTTGTATAGGACTATTAGGATGTTCTATGCTCAAGAAACATTAGAAGTCCAATTTTTAATTAGACAGTTGTTTAGGTAATCATTTCAATCTACCCTTTTGTTTATGTATTCGAGTCTTTCTTATTACATATTTTAATTTATTTTTTTTCCTTTTTCTTTCTTTTTCTTTTTCTTTTTTCTTCTTCTTCTTCCTCCTCCCTCATTTTTTTTTCTCTTCTCCTCCTCCCTTCCGACCAGCCCGCCGCCCACGCCACGGCCGCCGCCTGCGCGCAGCGCTGCACGCACGCACCGCGCTCAATCTGCGCTCATCCGCGCCCGCCGCCGCACACCCGTCGTCCATTCGCGCGCCCAACGCGTGCTCACACTCCGCGCGCGTCGCGGCGTTCGCCTCTCGCGCTCGCCGATCGCGGCGCCGTAGCCTCATCACCGGTGAGTCTTTCCTCAAAACCATTTAGTTTTGGTTCAAGCGGTTCACTAGCTTTTAGTCCGGTTCTCTTAAAATGGTTGTTGGTTTTGAATAGATCGGGTCATGGACATCCCGGATCTATGCTCCACCCGCCGCCACTCGTCGGATTCCGCCGCCGCCGCCGCCGGCGGATAGGTATAACTGGTCGACTTCGTTTTCTCGATCCCGTTTTAAGTCTCCATGCTTTAGTTTGGTCATCCAAGTCAAATTAGTTGTTTATGGTTGATAGAAGCATGTTTTAAGTCGTTGTTTGTTGTTTAAGGGCTCGAACATGTTAAGAAAGTACGAGGAGCTCGGGATCGAAGCCTTCGTCATCCAAGTAAGTAGTCATTTGGAATAAGCCCCTTAAAGTTATTATTCGTTTTGTTGTATGAGGTATTCTATGAATTTTGTGTAATGCACGAAAATCACGATTTGATGCTTATGTATGAGTATGGGTTCGGGTTGGTGTTGCGTGTACCTCTAATGGACTAAGAAAGTAGTTTATAGGGGAATAGGGGCTTAAGGCTTGAAGTTAAGAGTGAGACTTAAAGCCTTTGATCGTTATGAGTTGAGAACCAGCGTTGTGAGTCTCGAGTAAACCCTAATCCGAGTGCGGCAAAGCGTGGTTGACCAAACCAATTAGAGGATTACTGCGTTGGGCTTAAGGTCGACGACTCGCATTTTAGTTGTTTAGACCTCAATAACCGTTGGTGGGCATTAAGGGTGAGCTCTAGGCAACGTTTGTGACCTTTCATCTCCTTGGAATTGTTTTGGGCGAGCTTGTGTTGGACTTTCGAAGGAAGGGGTTGTTTTGTGGATTCTCGAATCGTTGTTTGTAGCCGACTTGATCCGGTAAGCGAGCTCGTGAGCGAATGAGCTTGTTTGCATAGCTTGTGGATTGAACCTTGTGTGGTCGCGTGTGTTAAAGTGGTTGGTTCAGATCGTGAATGAGTTTGCTTATGTGTGTTTGAGGATCATATTCGTTATGCAGCTTGTATTGCTCGCATAGTTGTTAGAATGTATGTTCGAGTGTGGTTTTAGTATCAGTCAAGAGAGTTGTGTCAGGTAACCCCTTAGTCTTGAAGTCCCTACCTTACTCCTTGTGTAGGTGATGTCATCCTACGGCCTTGAGTGGCGAGGAACTAGCATGGATTACGTGTATGATGATGCCTCGTGGTGAGTCACAGTGTGTGTGGCAATAGACAGGGGCAGTGATGTCTTGTGGTGAGCCATAGTAGTCTTGGCAACAAACCAAAGACGAGGATGCTTCGAGGTGAGTCACTATGAGAGTGGCAACAGACTCGGAGTGGGACATAGCTTACGAGCATCCTGCGGGGTGGGTCTGGGCTATGTCAGAATGCTAGTCAGGGAAATGTTTACCAACGTGAAGTAAGGAGCTGAGTAAGCCTGAGTGAGCTTGGGGTGGGCTAAGCCAGACTTCTAACTGAGTGTGGTCTGACAGTGGATTTGGGTAGTTGAGGTGTTGGCTAAAAGTGAGAACCTAGCCGAATTTGAGACCCTTTGAGATTGGGTCGAGGATATTAGAAATACCTACATAGAACCATAAGTAACCTTATAAAGCGCTGAACCCAACACTTACTAAACCTGAGTTACGCCTGGGCGAGTCGGAACCTCTAGGTTGACGATATATATAGGCTCATGAGATTGGGTTGCGCCCCCATTGAGATCAGGAGTGGTTGCTTTCCATTATCCCGTACTAGCCTAGGGGCGATTCCCCAATGGGTGTGTTGGTTACGTACCGCATTGCACCATGCTAGTGTATTATGAGGGCCTGGGGTGCATCCCCACCAGGTAGTGGTTGCTTTCCATGATCCTGGGCTGGCTTAGGGGTGATTCCCCATAGAGCGTGTAGGATACGTACCTGCTCTGCTCTATGCCAGCGTGTGGCGTGAGAGTTGCTTTCTCTCATTGCATCGTCAATCAAAGTTCCAAGACCGAAGGTAGAGCTCGAGTTATTACCATAGGAGGGGGCTCAGGGCGAAGTGTGCTAAATGTGAGACTAGTTTGGGGTTGGGTGAGTGAGAAGACGAGTTTTGATTGGTTGTTTGTTTGCTTGGTTGGACGTGAGTGGGCTACTTACTGAGTATTTTTATACTCACCCGTCTACTTTGTTGTTTTTGGTTGCAGGTAAGAAAGCTTGAGGCGGAGGACGAGGACGACGACCTGTTGAGGCTGTGGGACCCTGTTTCCTTCTTGTTTCCTTTGCGCTTAGAGTTTTCATTTGCCATTGTCCTGTTTTAGTTTCTATTTTAGATGTCTAGTCGCGCGACACTTTGTTTTTTACTTGTATTCCTTGGTTACTGAGTTTCCTTTCCGCTCGTGTATGTTTTTATCTAAGAGATCGTGCTTTGGAACTCTGATTGTAGCCCAACCCATTTTATGCGTTGAATTTTGGTGGGGGTTGGGCATTGTTGTTTCTTTGAAGTGTTTCTTCTGCTTATTAGGTTCCAGGAAAAATTTTGGTTTTCTCCTTACTACGTCGTTATGCTGCCAAAATTTCAGCGTGAACGGTGTAGCGGAGTAACTTCCCTAACGTAGATTCATCTCTGAACTTTAGGGTCGTTACAGTAAAGTTCATTTGAGATATAGTAACCCAAAGAAGTTGAGTGATTACAGACGTGGTAATAATACTTTGATCTTATTTTATGTTTTTGAAATTTCTTAATTTATTCAATTTTTTAATCTATTATGTACACTTATTTGTTTGCAATTGACTAGAAAACATAAGTACAACCAAGATGAGTTGGATGAGATTAGAGTTGAGCTATGTGAATTTGTATCTCAGTACTTATGA

mRNA sequence

ATGAGTGGATCAAGTGACGATGAAGTGAACGTATCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACGATGCGTGGTCTGGCACGCGTAAGGACTACAGGAGAACGCTTGGTCATCCAATACAACAATCAAGGCCAAAGTGTTGGTGATAATGCAAACCAAATGCAAAGTTATATAGGAGTTTGCGTTAGGCAACAAATTCCATTAAGTTACAAGACTTGGAAAGAAGTTCCCCAAGAATTGAAAGATAAAATTTTTGATTCTGTAGAGGAATTGGCGGAAGATCCTTCCACTCGTGCCACCTTATGGATACAAGGACGAAAAGGAAAAAATAATGAATACTTCGATGAAGACACCAAACAATGTGCTGGTCAAATCTCTGGAAACAAAATGTCGACTGATTGCTCACCTTCCAAAAAGTCTACAAGCATAGGCACTAATCATCCAAAAGACAAGGAGGTCATTGACGAGGTGGAAGAAATTTTAGAGGGAACTCCATGCCATCTAGCAATAGGATCAAAGGATAATGTGGTTGCTGTAGGCATAATGTACACGTCTGACGCTCAATTTCTCACAGTCCATGGAGTTCCCTTAGGAGTTGAAAATATTAGAGTGGTAGTGGACATGATCGTAGGTGAAAATGCTCCATTACCAATTCCTATACGGGGAGAAGTAGAGTCTCTGAGTCAATCAATGAGAAATTTTGTGGCATGGCCTCGGGACCTTGTGGCTTCTCCAGCAAAACATAAGTCGGATGTTTATTTAATTTATTTACATCGCGATGATATCCTGCATTACTGTGGGATGGTGAAGATAGGGTACTCCTGCATAGTCGCATACATTGCGTATCTTTGGACTGTATGTGACTATGAAATAATCGCCAAGTTCTTGCTAGTTGATCAAATAATCATTTCTAATTTTATTAAAAGTCAAGAAACACTTTGTATAAATCTGGCTAACAGGTTAAAAATGATTTATTTGTACTTGGATCAACAAGTTTTCATCCCATATAATACTGGATATCATTGGATGTTGATCGTTATCCATCCGCGGGCGAACGCCGTTTATGTCTTAAACTCGTTGAGGAGTAAGATCGAAGAAAGTTTTCAAGGAACAATAAATATAAAACATAAGTACAACCAAGATGAGTTGGATGAGATTAGAGTTGAGCTATGTGAATTTGTATCTCAGTACTTATGA

Coding sequence (CDS)

ATGAGTGGATCAAGTGACGATGAAGTGAACGTATCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACGATGCGTGGTCTGGCACGCGTAAGGACTACAGGAGAACGCTTGGTCATCCAATACAACAATCAAGGCCAAAGTGTTGGTGATAATGCAAACCAAATGCAAAGTTATATAGGAGTTTGCGTTAGGCAACAAATTCCATTAAGTTACAAGACTTGGAAAGAAGTTCCCCAAGAATTGAAAGATAAAATTTTTGATTCTGTAGAGGAATTGGCGGAAGATCCTTCCACTCGTGCCACCTTATGGATACAAGGACGAAAAGGAAAAAATAATGAATACTTCGATGAAGACACCAAACAATGTGCTGGTCAAATCTCTGGAAACAAAATGTCGACTGATTGCTCACCTTCCAAAAAGTCTACAAGCATAGGCACTAATCATCCAAAAGACAAGGAGGTCATTGACGAGGTGGAAGAAATTTTAGAGGGAACTCCATGCCATCTAGCAATAGGATCAAAGGATAATGTGGTTGCTGTAGGCATAATGTACACGTCTGACGCTCAATTTCTCACAGTCCATGGAGTTCCCTTAGGAGTTGAAAATATTAGAGTGGTAGTGGACATGATCGTAGGTGAAAATGCTCCATTACCAATTCCTATACGGGGAGAAGTAGAGTCTCTGAGTCAATCAATGAGAAATTTTGTGGCATGGCCTCGGGACCTTGTGGCTTCTCCAGCAAAACATAAGTCGGATGTTTATTTAATTTATTTACATCGCGATGATATCCTGCATTACTGTGGGATGGTGAAGATAGGGTACTCCTGCATAGTCGCATACATTGCGTATCTTTGGACTGTATGTGACTATGAAATAATCGCCAAGTTCTTGCTAGTTGATCAAATAATCATTTCTAATTTTATTAAAAGTCAAGAAACACTTTGTATAAATCTGGCTAACAGGTTAAAAATGATTTATTTGTACTTGGATCAACAAGTTTTCATCCCATATAATACTGGATATCATTGGATGTTGATCGTTATCCATCCGCGGGCGAACGCCGTTTATGTCTTAAACTCGTTGAGGAGTAAGATCGAAGAAAGTTTTCAAGGAACAATAAATATAAAACATAAGTACAACCAAGATGAGTTGGATGAGATTAGAGTTGAGCTATGTGAATTTGTATCTCAGTACTTATGA

Protein sequence

MSGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEELAEDPSTRATLWIQGRKGKNNEYFDEDTKQCAGQISGNKMSTDCSPSKKSTSIGTNHPKDKEVIDEVEEILEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLVASPAKHKSDVYLIYLHRDDILHYCGMVKIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQIIISNFIKSQETLCINLANRLKMIYLYLDQQVFIPYNTGYHWMLIVIHPRANAVYVLNSLRSKIEESFQGTINIKHKYNQDELDEIRVELCEFVSQYL
Homology
BLAST of Tan0001619 vs. NCBI nr
Match: XP_022136077.1 (uncharacterized protein LOC111007859 isoform X2 [Momordica charantia])

HSP 1 Score: 322.8 bits (826), Expect = 4.3e-84
Identity = 217/584 (37.16%), Postives = 287/584 (49.14%), Query Frame = 0

Query: 2   SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQS 61
           S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS
Sbjct: 5   SSSSQDERSVVIHTEVKKVARRGPTTMHELTCIRNLGKRKTIEYNDQGQPIGENAKKMQS 64

Query: 62  YIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE--------------------------- 121
           +IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE                           
Sbjct: 65  FIGVCVRQKIPVTYNHWKEVPQELKDKIFNCVESFVLDWRSKHHILQSASKKFRTFKSTL 124

Query: 122 ------------------------------------------------------------ 181
                                                                       
Sbjct: 125 TRTYILPFKDEPICLQNPPEKYPHIDQEQWNSFVNARLSEEWETLSRAHKEIRAKCLYNH 184

Query: 182 --------------ELAEDPSTRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SGN 241
                         +L+ DPS RA LW + RKGKNNEYFD+ T++CA +I        G 
Sbjct: 185 HISRKGYANLAQELDLSSDPSNRAILWKEARKGKNNEYFDDATRECAARIDELAAIHKGE 244

Query: 242 KMSTDC-----------------SPS-----------------KKSTSIGTN----HPKD 301
            + T+                  SPS                  KST+ G+N      K 
Sbjct: 245 DILTEALGTSEHSGRVRGVGEFVSPSLYFNVVKGKSKTQELQPNKSTTEGSNPSKKKSKG 304

Query: 302 KEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIR 361
           KE+++  EEI       +EG PCHLA+ S DN+VAVG ++ ++ Q  TVHGVPLGV+N+R
Sbjct: 305 KEIVNVHEEIYVTDEQKMEGKPCHLAVESVDNIVAVGTIFDNNVQCPTVHGVPLGVDNVR 364

Query: 362 VVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV-----------------ASPA 383
           V+VD+++ E A +PIP+RGE+E+L+Q++  FVAWPR LV                    +
Sbjct: 365 VMVDIVIDEYATIPIPVRGEIETLNQTIGGFVAWPRRLVILSEEKNISSSRTSQTRTQLS 424

BLAST of Tan0001619 vs. NCBI nr
Match: XP_022136080.1 (uncharacterized protein LOC111007859 isoform X4 [Momordica charantia])

HSP 1 Score: 322.4 bits (825), Expect = 5.6e-84
Identity = 217/585 (37.09%), Postives = 287/585 (49.06%), Query Frame = 0

Query: 2   SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQS 61
           S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS
Sbjct: 5   SSSSQDERSVVIHTEVKKVARRGPTTMHELTCIRNLGKRKTIEYNDQGQPIGENAKKMQS 64

Query: 62  YIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE--------------------------- 121
           +IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE                           
Sbjct: 65  FIGVCVRQKIPVTYNHWKEVPQELKDKIFNCVEKSFVLDWRSKHHILQSASKKFRTFKST 124

Query: 122 ------------------------------------------------------------ 181
                                                                       
Sbjct: 125 LTRTYILPFKDEPICLQNPPEKYPHIDQEQWNSFVNARLSEEWETLSRAHKEIRAKCLYN 184

Query: 182 ---------------ELAEDPSTRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SG 241
                          +L+ DPS RA LW + RKGKNNEYFD+ T++CA +I        G
Sbjct: 185 HHISRKGYANLAQELDLSSDPSNRAILWKEARKGKNNEYFDDATRECAARIDELAAIHKG 244

Query: 242 NKMSTDC-----------------SPS-----------------KKSTSIGTN----HPK 301
             + T+                  SPS                  KST+ G+N      K
Sbjct: 245 EDILTEALGTSEHSGRVRGVGEFVSPSLYFNVVKGKSKTQELQPNKSTTEGSNPSKKKSK 304

Query: 302 DKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENI 361
            KE+++  EEI       +EG PCHLA+ S DN+VAVG ++ ++ Q  TVHGVPLGV+N+
Sbjct: 305 GKEIVNVHEEIYVTDEQKMEGKPCHLAVESVDNIVAVGTIFDNNVQCPTVHGVPLGVDNV 364

Query: 362 RVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV-----------------ASP 383
           RV+VD+++ E A +PIP+RGE+E+L+Q++  FVAWPR LV                    
Sbjct: 365 RVMVDIVIDEYATIPIPVRGEIETLNQTIGGFVAWPRRLVILSEEKNISSSRTSQTRTQL 424

BLAST of Tan0001619 vs. NCBI nr
Match: XP_022136076.1 (uncharacterized protein LOC111007859 isoform X1 [Momordica charantia])

HSP 1 Score: 322.4 bits (825), Expect = 5.6e-84
Identity = 217/585 (37.09%), Postives = 287/585 (49.06%), Query Frame = 0

Query: 2   SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQS 61
           S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS
Sbjct: 5   SSSSQDERSVVIHTEVKKVARRGPTTMHELTCIRNLGKRKTIEYNDQGQPIGENAKKMQS 64

Query: 62  YIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE--------------------------- 121
           +IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE                           
Sbjct: 65  FIGVCVRQKIPVTYNHWKEVPQELKDKIFNCVEKSFVLDWRSKHHILQSASKKFRTFKST 124

Query: 122 ------------------------------------------------------------ 181
                                                                       
Sbjct: 125 LTRTYILPFKDEPICLQNPPEKYPHIDQEQWNSFVNARLSEEWETLSRAHKEIRAKCLYN 184

Query: 182 ---------------ELAEDPSTRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SG 241
                          +L+ DPS RA LW + RKGKNNEYFD+ T++CA +I        G
Sbjct: 185 HHISRKGYANLAQELDLSSDPSNRAILWKEARKGKNNEYFDDATRECAARIDELAAIHKG 244

Query: 242 NKMSTDC-----------------SPS-----------------KKSTSIGTN----HPK 301
             + T+                  SPS                  KST+ G+N      K
Sbjct: 245 EDILTEALGTSEHSGRVRGVGEFVSPSLYFNVVKGKSKTQELQPNKSTTEGSNPSKKKSK 304

Query: 302 DKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENI 361
            KE+++  EEI       +EG PCHLA+ S DN+VAVG ++ ++ Q  TVHGVPLGV+N+
Sbjct: 305 GKEIVNVHEEIYVTDEQKMEGKPCHLAVESVDNIVAVGTIFDNNVQCPTVHGVPLGVDNV 364

Query: 362 RVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV-----------------ASP 383
           RV+VD+++ E A +PIP+RGE+E+L+Q++  FVAWPR LV                    
Sbjct: 365 RVMVDIVIDEYATIPIPVRGEIETLNQTIGGFVAWPRRLVILSEEKNISSSRTSQTRTQL 424

BLAST of Tan0001619 vs. NCBI nr
Match: XP_038895930.1 (uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida])

HSP 1 Score: 313.2 bits (801), Expect = 3.4e-81
Identity = 220/630 (34.92%), Postives = 285/630 (45.24%), Query Frame = 0

Query: 2   SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQS 61
           S SS DE NV I+ E + T RRG T M  L  +R +GER  I+YN+ GQ VG+NA +MQS
Sbjct: 5   SSSSQDEGNVLIRYEVKKTARRGPTIMPELIHIRNSGERKTIEYNDLGQPVGENAKKMQS 64

Query: 62  YIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE--------------------------- 121
           +IGVCVRQQIPL+YK+WK VPQELKD IFD ++                           
Sbjct: 65  FIGVCVRQQIPLTYKSWKAVPQELKDTIFDCIQMSFVVDLGSKHYILQSASKKFRTFKST 124

Query: 122 ------------------------------------------------------------ 181
                                                                       
Sbjct: 125 LTQQYILPYKDEPSRLQNPPEKYSHIDKKQWESFVKARLSEEWETFSSAQRERRAKCIYN 184

Query: 182 ---------------ELAEDPSTRATLWIQGRKGKNNEYFDEDTKQCAGQI--------- 241
                          EL+ DP  RATLW + RK KNNEY D  T++CA +I         
Sbjct: 185 HHISRKGYANLAQELELSSDPCNRATLWKEARKRKNNEYSDIATRECAKRIDELAAIRKG 244

Query: 242 ------------------------------------------SGNKMSTDCSPSKKSTSI 301
                                                     S N+  T  S  K  T  
Sbjct: 245 QDILTEALGTPEHRGRIRGVGEFVSPALHYNVAKGKLKLIQESQNEAETQQSQDKDETQQ 304

Query: 302 GTNH--------------------------------PKDKEVIDEVEEILEGTPCHLAIG 361
             +                                 PK K V+ + EEILEG PCHLAIG
Sbjct: 305 SQDKDETRQSRSSVVEKKTKRKRVQKGRNVQKRKKVPKGKMVVKDPEEILEGIPCHLAIG 364

Query: 362 SKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPIPIRGEVESLSQSM 396
           S DN+VAVG M+ SDAQ  +++ +PLG +N+R +VD+++GE+  LPIP + ++++L Q++
Sbjct: 365 SVDNIVAVGTMFESDAQCPSINEIPLGPDNVRAMVDIVMGEDVALPIPQKDKIKTLDQAI 424

BLAST of Tan0001619 vs. NCBI nr
Match: XP_038895921.1 (uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida] >XP_038895924.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida] >XP_038895927.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida])

HSP 1 Score: 308.5 bits (789), Expect = 8.3e-80
Identity = 220/631 (34.87%), Postives = 285/631 (45.17%), Query Frame = 0

Query: 2   SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQS 61
           S SS DE NV I+ E + T RRG T M  L  +R +GER  I+YN+ GQ VG+NA +MQS
Sbjct: 5   SSSSQDEGNVLIRYEVKKTARRGPTIMPELIHIRNSGERKTIEYNDLGQPVGENAKKMQS 64

Query: 62  YIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE--------------------------- 121
           +IGVCVRQQIPL+YK+WK VPQELKD IFD ++                           
Sbjct: 65  FIGVCVRQQIPLTYKSWKAVPQELKDTIFDCIQMSFVVDLGSKHYILQSASKKFRTFKST 124

Query: 122 ------------------------------------------------------------ 181
                                                                       
Sbjct: 125 LTQQYILPYKDEPSRLQNPPEKYSHIDKKQWESFVKARLSEEWETFSSAQRERRAKCIYN 184

Query: 182 ---------------ELAEDPSTRATLWIQGRKGKNNEYFDEDTKQCAGQI--------- 241
                          EL+ DP  RATLW + RK KNNEY D  T++CA +I         
Sbjct: 185 HHISRKGYANLAQELELSSDPCNRATLWKEARKRKNNEYSDIATRECAKRIDELAAIRKG 244

Query: 242 ------------------------------------------SGNKMSTDCSPSKKSTSI 301
                                                     S N+  T  S  K  T  
Sbjct: 245 QDILTEALGTPEHRGRIRGVGEFVSPALHYNVAKGKLKLIQESQNEAETQQSQDKDETQQ 304

Query: 302 GTNH--------------------------------PKDKEVIDEVEEILEGTPCHLAIG 361
             +                                 PK K V+ + EEILEG PCHLAIG
Sbjct: 305 SQDKDETRQSRSSVVEKKTKRKRVQKGRNVQKRKKVPKGKMVVKDPEEILEGIPCHLAIG 364

Query: 362 SKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPIPIRGEVESLSQSM 396
           S DN+VAVG M+ SDAQ  +++ +PLG +N+R +VD+++GE+  LPIP + ++++L Q++
Sbjct: 365 SVDNIVAVGTMFESDAQCPSINEIPLGPDNVRAMVDIVMGEDVALPIPQKDKIKTLDQAI 424

BLAST of Tan0001619 vs. ExPASy TrEMBL
Match: A0A6J1C4J7 (uncharacterized protein LOC111007859 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111007859 PE=3 SV=1)

HSP 1 Score: 322.8 bits (826), Expect = 2.1e-84
Identity = 217/584 (37.16%), Postives = 287/584 (49.14%), Query Frame = 0

Query: 2   SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQS 61
           S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS
Sbjct: 5   SSSSQDERSVVIHTEVKKVARRGPTTMHELTCIRNLGKRKTIEYNDQGQPIGENAKKMQS 64

Query: 62  YIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE--------------------------- 121
           +IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE                           
Sbjct: 65  FIGVCVRQKIPVTYNHWKEVPQELKDKIFNCVESFVLDWRSKHHILQSASKKFRTFKSTL 124

Query: 122 ------------------------------------------------------------ 181
                                                                       
Sbjct: 125 TRTYILPFKDEPICLQNPPEKYPHIDQEQWNSFVNARLSEEWETLSRAHKEIRAKCLYNH 184

Query: 182 --------------ELAEDPSTRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SGN 241
                         +L+ DPS RA LW + RKGKNNEYFD+ T++CA +I        G 
Sbjct: 185 HISRKGYANLAQELDLSSDPSNRAILWKEARKGKNNEYFDDATRECAARIDELAAIHKGE 244

Query: 242 KMSTDC-----------------SPS-----------------KKSTSIGTN----HPKD 301
            + T+                  SPS                  KST+ G+N      K 
Sbjct: 245 DILTEALGTSEHSGRVRGVGEFVSPSLYFNVVKGKSKTQELQPNKSTTEGSNPSKKKSKG 304

Query: 302 KEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIR 361
           KE+++  EEI       +EG PCHLA+ S DN+VAVG ++ ++ Q  TVHGVPLGV+N+R
Sbjct: 305 KEIVNVHEEIYVTDEQKMEGKPCHLAVESVDNIVAVGTIFDNNVQCPTVHGVPLGVDNVR 364

Query: 362 VVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV-----------------ASPA 383
           V+VD+++ E A +PIP+RGE+E+L+Q++  FVAWPR LV                    +
Sbjct: 365 VMVDIVIDEYATIPIPVRGEIETLNQTIGGFVAWPRRLVILSEEKNISSSRTSQTRTQLS 424

BLAST of Tan0001619 vs. ExPASy TrEMBL
Match: A0A6J1C2V2 (uncharacterized protein LOC111007859 isoform X4 OS=Momordica charantia OX=3673 GN=LOC111007859 PE=3 SV=1)

HSP 1 Score: 322.4 bits (825), Expect = 2.7e-84
Identity = 217/585 (37.09%), Postives = 287/585 (49.06%), Query Frame = 0

Query: 2   SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQS 61
           S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS
Sbjct: 5   SSSSQDERSVVIHTEVKKVARRGPTTMHELTCIRNLGKRKTIEYNDQGQPIGENAKKMQS 64

Query: 62  YIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE--------------------------- 121
           +IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE                           
Sbjct: 65  FIGVCVRQKIPVTYNHWKEVPQELKDKIFNCVEKSFVLDWRSKHHILQSASKKFRTFKST 124

Query: 122 ------------------------------------------------------------ 181
                                                                       
Sbjct: 125 LTRTYILPFKDEPICLQNPPEKYPHIDQEQWNSFVNARLSEEWETLSRAHKEIRAKCLYN 184

Query: 182 ---------------ELAEDPSTRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SG 241
                          +L+ DPS RA LW + RKGKNNEYFD+ T++CA +I        G
Sbjct: 185 HHISRKGYANLAQELDLSSDPSNRAILWKEARKGKNNEYFDDATRECAARIDELAAIHKG 244

Query: 242 NKMSTDC-----------------SPS-----------------KKSTSIGTN----HPK 301
             + T+                  SPS                  KST+ G+N      K
Sbjct: 245 EDILTEALGTSEHSGRVRGVGEFVSPSLYFNVVKGKSKTQELQPNKSTTEGSNPSKKKSK 304

Query: 302 DKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENI 361
            KE+++  EEI       +EG PCHLA+ S DN+VAVG ++ ++ Q  TVHGVPLGV+N+
Sbjct: 305 GKEIVNVHEEIYVTDEQKMEGKPCHLAVESVDNIVAVGTIFDNNVQCPTVHGVPLGVDNV 364

Query: 362 RVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV-----------------ASP 383
           RV+VD+++ E A +PIP+RGE+E+L+Q++  FVAWPR LV                    
Sbjct: 365 RVMVDIVIDEYATIPIPVRGEIETLNQTIGGFVAWPRRLVILSEEKNISSSRTSQTRTQL 424

BLAST of Tan0001619 vs. ExPASy TrEMBL
Match: A0A6J1C2H7 (uncharacterized protein LOC111007859 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111007859 PE=3 SV=1)

HSP 1 Score: 322.4 bits (825), Expect = 2.7e-84
Identity = 217/585 (37.09%), Postives = 287/585 (49.06%), Query Frame = 0

Query: 2   SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQS 61
           S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS
Sbjct: 5   SSSSQDERSVVIHTEVKKVARRGPTTMHELTCIRNLGKRKTIEYNDQGQPIGENAKKMQS 64

Query: 62  YIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVE--------------------------- 121
           +IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE                           
Sbjct: 65  FIGVCVRQKIPVTYNHWKEVPQELKDKIFNCVEKSFVLDWRSKHHILQSASKKFRTFKST 124

Query: 122 ------------------------------------------------------------ 181
                                                                       
Sbjct: 125 LTRTYILPFKDEPICLQNPPEKYPHIDQEQWNSFVNARLSEEWETLSRAHKEIRAKCLYN 184

Query: 182 ---------------ELAEDPSTRATLWIQGRKGKNNEYFDEDTKQCAGQI-------SG 241
                          +L+ DPS RA LW + RKGKNNEYFD+ T++CA +I        G
Sbjct: 185 HHISRKGYANLAQELDLSSDPSNRAILWKEARKGKNNEYFDDATRECAARIDELAAIHKG 244

Query: 242 NKMSTDC-----------------SPS-----------------KKSTSIGTN----HPK 301
             + T+                  SPS                  KST+ G+N      K
Sbjct: 245 EDILTEALGTSEHSGRVRGVGEFVSPSLYFNVVKGKSKTQELQPNKSTTEGSNPSKKKSK 304

Query: 302 DKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENI 361
            KE+++  EEI       +EG PCHLA+ S DN+VAVG ++ ++ Q  TVHGVPLGV+N+
Sbjct: 305 GKEIVNVHEEIYVTDEQKMEGKPCHLAVESVDNIVAVGTIFDNNVQCPTVHGVPLGVDNV 364

Query: 362 RVVVDMIVGENAPLPIPIRGEVESLSQSMRNFVAWPRDLV-----------------ASP 383
           RV+VD+++ E A +PIP+RGE+E+L+Q++  FVAWPR LV                    
Sbjct: 365 RVMVDIVIDEYATIPIPVRGEIETLNQTIGGFVAWPRRLVILSEEKNISSSRTSQTRTQL 424

BLAST of Tan0001619 vs. ExPASy TrEMBL
Match: A0A1S3BRX5 (uncharacterized protein LOC103493028 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103493028 PE=3 SV=1)

HSP 1 Score: 298.9 bits (764), Expect = 3.2e-77
Identity = 221/634 (34.86%), Postives = 278/634 (43.85%), Query Frame = 0

Query: 4   SSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYI 63
           SS DE NV I+ E + T RRG T M  L  +R +GER  I+YN++GQ VG+NA +MQS+I
Sbjct: 7   SSQDEGNVLIRYEVKRTARRGPTIMPELLHIRNSGERKTIEYNDRGQPVGENAKKMQSFI 66

Query: 64  GVCVRQQIPLSYKTWKEVPQELKDKIFDSVE----------------------------- 123
           GVCVRQQIP++Y +WKEVPQELKD IFD ++                             
Sbjct: 67  GVCVRQQIPVTYNSWKEVPQELKDTIFDCIQMSFVVDLSSKHYILQSASKKFRSFKSTLT 126

Query: 124 ------------------------------------------------------------ 183
                                                                       
Sbjct: 127 QMYILPYKDEPSRLQYPPEKYSHIDKKQWESFVKARLSEEWEVFSCAQRERRAKCIYNHH 186

Query: 184 -------------ELAEDPSTRATLWIQGRKGKNNEYFDEDTKQCA-------------- 243
                        EL+ DP  RATLW + RK KNN  FD+ T++C               
Sbjct: 187 ISRKGYANLAQELELSSDPCNRATLWKEARKRKNNGCFDDATRECVKRIDELAAIRKGQD 246

Query: 244 ------------GQISG------------------------------------------- 303
                       G+I G                                           
Sbjct: 247 ILTEALGTPEHRGRIRGVGEFVSPALHVNVARGNLKLSQQSQDKDETQQSQDENETQQSK 306

Query: 304 ----NKMSTDCSPSKKSTSI-------------GTNHPKDKEVIDEVEEILE-------- 363
                + S D + +++S S              G   PK K V+ E EE LE        
Sbjct: 307 AENETQQSNDENETQQSRSSVSRKKTKGKKVQKGKKGPKGKMVVKESEETLEVQVLQEPE 366

Query: 364 ----GTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPI 388
               G PCHLAIGS DNVVAVG M+ SD Q  T+HG+PLG ENIRV VD+ + E+  LPI
Sbjct: 367 NISKGIPCHLAIGSLDNVVAVGKMFESDVQCPTIHGIPLGAENIRVTVDIAMVEDVALPI 426

BLAST of Tan0001619 vs. ExPASy TrEMBL
Match: A0A5D3CYL9 (ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G002480 PE=3 SV=1)

HSP 1 Score: 298.9 bits (764), Expect = 3.2e-77
Identity = 221/634 (34.86%), Postives = 278/634 (43.85%), Query Frame = 0

Query: 4   SSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYI 63
           SS DE NV I+ E + T RRG T M  L  +R +GER  I+YN++GQ VG+NA +MQS+I
Sbjct: 7   SSQDEGNVLIRYEVKRTARRGPTIMPELLHIRNSGERKTIEYNDRGQPVGENAKKMQSFI 66

Query: 64  GVCVRQQIPLSYKTWKEVPQELKDKIFDSVE----------------------------- 123
           GVCVRQQIP++Y +WKEVPQELKD IFD ++                             
Sbjct: 67  GVCVRQQIPVTYNSWKEVPQELKDTIFDCIQMSFVVDLSSKHYILQSASKKFRSFKSTLT 126

Query: 124 ------------------------------------------------------------ 183
                                                                       
Sbjct: 127 QMYILPYKDEPSRLQYPPEKYSHIDKKQWESFVKARLSEEWEVFSCAQRERRAKCIYNHH 186

Query: 184 -------------ELAEDPSTRATLWIQGRKGKNNEYFDEDTKQCA-------------- 243
                        EL+ DP  RATLW + RK KNN  FD+ T++C               
Sbjct: 187 ISRKGYANLAQELELSSDPCNRATLWKEARKRKNNGCFDDATRECVKRIDELAAIRKGQD 246

Query: 244 ------------GQISG------------------------------------------- 303
                       G+I G                                           
Sbjct: 247 ILTEALGTPEHRGRIRGVGEFVSPALHVNVARGNLKLSQQSQDKDETQQSQDENETQQSK 306

Query: 304 ----NKMSTDCSPSKKSTSI-------------GTNHPKDKEVIDEVEEILE-------- 363
                + S D + +++S S              G   PK K V+ E EE LE        
Sbjct: 307 AENETQQSNDENETQQSRSSVSRKKTKGKKVQKGKKGPKGKMVVKESEETLEVQVLQEPE 366

Query: 364 ----GTPCHLAIGSKDNVVAVGIMYTSDAQFLTVHGVPLGVENIRVVVDMIVGENAPLPI 388
               G PCHLAIGS DNVVAVG M+ SD Q  T+HG+PLG ENIRV VD+ + E+  LPI
Sbjct: 367 NISKGIPCHLAIGSLDNVVAVGKMFESDVQCPTIHGIPLGAENIRVTVDIAMVEDVALPI 426

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022136077.14.3e-8437.16uncharacterized protein LOC111007859 isoform X2 [Momordica charantia][more]
XP_022136080.15.6e-8437.09uncharacterized protein LOC111007859 isoform X4 [Momordica charantia][more]
XP_022136076.15.6e-8437.09uncharacterized protein LOC111007859 isoform X1 [Momordica charantia][more]
XP_038895930.13.4e-8134.92uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida][more]
XP_038895921.18.3e-8034.87uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida] >XP_03889592... [more]
Match NameE-valueIdentityDescription
A0A6J1C4J72.1e-8437.16uncharacterized protein LOC111007859 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1C2V22.7e-8437.09uncharacterized protein LOC111007859 isoform X4 OS=Momordica charantia OX=3673 G... [more]
A0A6J1C2H72.7e-8437.09uncharacterized protein LOC111007859 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A1S3BRX53.2e-7734.86uncharacterized protein LOC103493028 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5D3CYL93.2e-7734.86ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003653Ulp1 protease family, C-terminal catalytic domainPFAMPF02902Peptidase_C48coord: 337..370
e-value: 1.2E-4
score: 22.0
NoneNo IPR availableGENE3D3.40.395.10Adenoviral Proteinase; Chain Acoord: 241..401
e-value: 4.7E-9
score: 37.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 132..148
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 132..153
NoneNo IPR availablePANTHERPTHR33018OS10G0338966 PROTEIN-RELATEDcoord: 21..107
coord: 134..248
NoneNo IPR availablePANTHERPTHR33018:SF19MYB/SANT-LIKE DNA-BINDING DOMAIN PROTEINcoord: 21..107
coord: 134..248
IPR004264Transposase, Tnp1/En/Spm-likePFAMPF03017Transposase_23coord: 174..224
e-value: 3.3E-4
score: 20.6
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 252..372

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0001619.1Tan0001619.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0008234 cysteine-type peptidase activity