Tan0014729 (gene) Snake gourd v1

Overview
NameTan0014729
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionkunitz trypsin inhibitor 2
LocationLG05: 71982101 .. 72001926 (+)
RNA-Seq ExpressionTan0014729
SyntenyTan0014729
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAACTTCGCAATACTGTCTTTTCTTTTCTTTGTCCTTGCCTCTACTGAGGTACACTTCTGCAGAGCCGACGCCTCGCCGGACGCCGTCGTCGACATCGACGGAAAGAAGCTCCGCGCCGGCGACAACTACTACATCCTCCCTGTTTTCCGGCGAAACATCGGCGGAGTAGCCATCGGCGGTATCCCTGGATACAACAATCAATGTCCGATCAACGTCGTCCCGGAAACTTACGAAGCATCTAACGGTGCTCCAACGATATTTACGCCCATAAACCCTAAGAAAGGCGTGGTTCGAGTTTCCACCGATTTGAACATCCAATTCGAGGCGAACACGAAATGCGCCAAATCGACGGTGTGGAAAATAGGTAAATTTGATGAACATATGAGGCAATATTTCGTGACAATCGGCGGAACGAAAGGAAATCCAGGGCGCGAGACGTTGGAGAGTTGGTTCAAAATTGAGAAGCATGGCAATAATAATTATTACAAGTTTGTGTATTGTCCAACTGTGTGTAAGTACTGCAAAGTTATATGCAAAAATGTTGGTTTATTTTATGCCAAAGGAAGGATGGTCCTTGCTTTGAACGATACGCCATTCCTTGTTACGTTCAAGAAAGTTTAGTTTAAGAGTTTTGGTTTGCATTTGCTCACTCAAAACTTTTAATTACCTCCATCTCTAATGTAATAATCACGAGTTCGTGAAATAATAGTATTGTGTTTATCTAAATAAAATAAAAGTTCGTTCAGTTGTTTATGAGGTTAAAAGAATCTTACTCGCAATTGTTAAAAATCAAAATTGATTATGTCCAATGAAGAAAGTTCCTTTAAGATTAGGTTTATAATTTCCGAAATTTGGTTGAGAAATCGAGTAATGTAGCGAACTTCTCATATAATTAATGATTAGCCAAAACGACTCTCTCTCTCTCAAAATTATGAGAGTTTGAACTAGGGTGTACAAATAACCTGTTAACTCGAACAATTCAGACTACTCAACCCAAATCGAAAGGGTTGGGTTCGTCGGAATTTTGGAGAATCGAAACTTCGATTGACTTTTTTAGGCTCGGGTTGACCCGACCCAACACAACACGACCCAAAATTTATGTTTTTTAATAAATAAATTATTTTAAATAGTTTTCTTCACAGTTGATAACAAATATTAAGATGTTATGAAATTTAAAGTTTGATAATTAAATTATATGTAATTTTAGTTATAAATTCAAGAAAAAAAAGTTGTTTTAAATAATTAAGAAAATTATTAAAAAAATAAAAAATAAAAATCAACCCAATAACTCAAGGATAATTCAGGTTGGCAACGCGACCAACTCAAAAAATTCAGGTTGGCTCCAAAAACTCTTCAATCCAACCTGTGTACACCCCTCGTTTGAACATCCAAACGGAAAGAAATTAAAGAGTACATACAATTAATGCTAAATTGGGCTCATATTGCTACTCATTACTATTTATTTTGTTTTAAGCTTTTCATCTGTAGTCATTGGTTGCTGTGACATTGTAAAATTGGAGAAGCTTACTATACGTGGAGATCATTGGAATAGATGATCAAATTTAATTTTGACGTCTGAGATTGCCCATGCCATGGAGGCATGGACTAATTACGATTTGTTTTCTTCATATAATTAATAATGAACATTGAAAAAAAAAAAAACAAACTTTCTCTTAACTTCATCTCTAATTCATCTCATCTACGTGATATTCTTTCGAAAACGAAAAAGATCTTCACAAATTCTCGTGAATTTAAAGATAGAAAAGATAATCTTGTTCTCAATCTTGACCCATCCTATTGATTTCCCTAACACTACACAAATTTATTAACGCGTATAAAAATCAAAGTTTTGCACAAAATTAAATTGAATCATGCAAAAAAATGAATAATTTTAAGAAAAAAGAAAAAATAAACTCTAAAGTCATTTTTGTCTTAAACCCAATAAATTTTGTACTAAATATATATTTTAATTATAATTCATTGACACATGATTTTTTTTTTCTTTTAACAATTCTTCAAATGACAACTTATTAAATTAAAATTCCCAGTACTTATTCCAGTAAAAAGATTACAAGGTTTTCTTTAAGTTTTTTAAGGTGCAATTCATCAATTCAAAGTTTAGAATTAAAACCAAGTAAATTTTGATTTTTTTTTTCAAGAAAGTTATCTCTACTAACAAATCCCGAAAACATGATACGATCAAAACAGGTTTTGACTATTAAGTTTGATATATATATTGTTAAGCAATTTTAAATTTTATTAGCTTTGGTCAAAACCTTCTTGTCTATTGAATGTGACTCCCGGATTTATTTAAAAATTATCAATAGATATTTGGAAATTGTTTTCTAAACTTGAGTTTTGGTTTGAAACTTTAAATTTCATTATCTTTGATAAAAAACTTAATGTCTATTGAATGTGACTTACGGATTTTTTTTAAAAACTAATAGGTTGTTTGAAAATTATTTTCTAAATTTTAGTTTTGGTTTTCAAACTTCGTCTCTTAAAATTCACTCCGCTAATTAATAAGAAGGATCAATTTTATTGTGTGTAAAATATCACTCATAAGCATATGATTATAAAGTTTTTATTCCATTCAGAATTCTAAACAAAAAAAATTGAACATATATTAAAAAATTAATACAAACTATTTTTTTTTAAAAAAAAAACAAATAATTTTCTAATAGGCTCTATATGCCTAAATTTGATTACAAAATTATCTTTTGATTGTTAGATTAACTAATTAAACTCTACAAACCTTTTTTCAAATCTTTTTTCAAGGGGAGGAAGTGGTTTTTAGGTTTTGATATATATTTAGTTAATACATGGTGTGATTACCATACATATATGCGAATTAAATATTGATCAGCTTATACTTTTCATCAATAATAAGTATCATTCATATGATTTTCGCATAACTAGGGTCGATTTTGTTCCTAGTCAACATATTTTTATAAAAAATATATATAAAATAATAATTCGTTTTTCAATTATTTAAATATATGTGCAATACAACTATACAAATGACTAGCTTGGCGCCCGCGGGGAAAATAATTCCATGAGAAATAAAAAACAGTGAGCAATTTCGACTCTCTGGCTCTCTCTCATTTCTTATAAATACGAGAAATTATTAGACTCTCTCTCATTTCCTATAAATACGAGAAATTATTAGGTGAGACCGTCATTAATGTGGTTAACAATTTTTCTCCATTAGATGAAGAAAAAAAAGTATCTCGAATTTATATGCAATAATTATTAAGGACATGAAAATTGTTGAATCCACAATGGTTATATCATGAGAAATCCACATTGAGTAATTTATCTATAAATACAGAGTTTTTCCAATCCCATCATGCAAAAAAAAAAAAAAAAAGAAAAAGAAAATCAGTGAAAAACAAAAAACAATATTCAGTTTAAAGAAGTATGAAGAAGTTTGTATTACTCTCTTTTCTTTTCATCGCCATCGCCTCTACTGAGGTACGCTTCTGCAGAGCCAACGCCTCGCCAGACGCCGTTCGCGACATCGACGGGAAGAAGCTCCGAGCCGGCGACGATTACTACATCCTCTCGGCTTTCCATGAAAACGGCGACGGATTAACCACCGGGAATATCCAGCAGGAATACGATGTCATCTCAACGTCGTCCAAGAACCGTACGAAGAATACAACGGTCTTGGCAACATTTGAACCTATAAACCCTAAGAAAGGCATGAGTTGTTGGGTGTGCAACCAAGTTTCCCATCAATTAGAGAAGGGAAGATTATGAGTATATAAGTGAGAGATAGTATCTTGACACTTTTTGGGTGAAACAAAAAAACAAAGCCATGAAGACATATGCTTAAAGTAGACAATATCATATTAATGTGGAGATATTCGTGTACCGTCAAACTGTGTGCAAGTACTGCAAAGTTATATGCAAAAATGTCGGGTTATTTGAATCGAATGGAAAGAAGGCTCTTGTTTTGAGCGATACGCCATTTCCTGTTATGTTTAAGAAAGTTCATATATAATTATTGAAGCTGAGAGAGAGAAGAGGTTGGTTTTGGTTTGCATTTGGTCATCACTCTCAAACCTTTCCTTCATCTCTCTCACCACAGTAATAATCACGAGTTTGCGAAATAAATTACCCTAAAAGTGTTTAATCAAATAAAATATGATTGTAAGAGGAAAAATTTCTGGTCCTACTTCCTCCAAGTATTAACTACATGTCATGTTTTAAATTAATTTTTTTTAGGTGGGTCTCACTCACTTAAATGATATTCTCTCTTTCTTTTTCGTGTATCTCTCCTCCTCTTCTCCAAATTGAATATTTAAATTTTGGGTTTAAACTATATGTTTTAAACTGTTTTATTATTTCATTTCTCACTCTAAAAAATAACCTAATGAGTCTAAATTTAAAATGGAGTAAATTGTGAGTTGCAGATAAAATGAAAAACAAATCTAAAGAGAGAAAAAAAAAATAATAAAGAGAGAGAAAAATAATATATTGAGAGAATCAAGGAAGAGATACATGAAAAAAAAAAAAAAGAGGGATGTTATCATAATCTATATCTATAATCTATAATATATTAAAAAACTACAATTTTTTAAACATTTTTTTAGACTATTTTGTCCCTTTTTCAAGTTCTAAATTTACAAAATATGCCACTTACAATTATAATTACCAACAAAAATTATATCTATAATTCTTATAATTATTAATTAACATAAAAGTGGTTAATAAAAAAAATACCTTCACACTACCACCCACGTTTTCTTTTAAAAAAATAGTAATTTAATTAGTAAATTAATTAGATGTGAACAATAAATATTTACATTTCTTAAAGCTTGGAATGTAAAAAGTTTTCTATAACTTAAATAATTGTTTGATGTTTGATTTAATTATAATTCATAATAGAATAAAATTTTAAAAGTGGTAGAATTCAAAAAGTTTTATAACTCTCACATTTTACTTCTATAAATATGAGTTTTTGATTACCATTCTCTCACATTGTTTGATGTTTGAGTGTGAGATCTTTCATTCACATTTTTTTTCCATCAAGAACATTTTATTTCATTATTTCTTTATTTTTTTGTCTCCCAACTCATTGTTCTCAAAGTCTCTTCAATCTTCAAAATATCTTCAAATGTTTGCAACACATTCAAGTTGCAAGACGACGATGTTGATGCATGTGTCAAATTTAAATCCTAAACACAAACATCTTCAAATATTGGTGGTACGTTCAAGTTGTAATGAATGCTTGTATCGAGTCTATATCTTAGACCTTAGACAAATATACATTGTAAAGTAGGATAATAGTGTTTGTCCATCAATTGTCAAATGTGTCTATCTATTTAAAAAATGATTGTATAGTAGTTAGAACATTTATTTTTTTTACTATTATTTATTTATTTTATTGTTATTTTATAACAAGTGTATTGTTGTTATTTTTTAAAATATAATTTTGTAAAAAAAAAAAAAACTTGTTGATAAATTGTACAAGTTTTATCTTCCTTTTGTTGGAAAAAAAACATCATGGAGTTTCAAATATATTTCTCATAAAGTTTTTAATTAACTTTAATAGTAAGTGAATTAATTATCAATAATGGATTAAAATAATTATCAATTAATAATTTCTATGTAATATGAAAAGTTGTAATACACAAAGTATTGTTATTTCATTCTCCATGGAGTTAAACAAGTTTTCTTTAATTATAATTATAACTAATAAAATTAATTATAATATAAAATTAATTTAGTAAAATAATTTTATTTTTATTAAGGTTGACAAGTTCAAAATTTAACAAAATATTATAAAAAAAAAATTAACAATCCTAAAATTCGAGACGAACCTACGGTAGGGAAGTTAATCCGCCACACCGTTCGCGCCGAAATTTCGGCAGCATAACAACATGGCAAGGAGAAAACACCTTAAAGAATTTCAAAGAATGCCCAACCTCCACCAATTTCAATAAAAAAAATGGGTTGGGCTACAACCAAAGTTCCAATACACGATTCCAGACAGATGAACATGCAACAGAAATTAACTAAGACAAAATGAAAATCAACCACTACGACTATGAAACATCACAGATAAAGAGGAATCAAAACTGTCGAGCATCTATGTGAAACAAGCTCTAACAAGACAAACTCAAGAAAGGAAAACTAAGCCTACAGACAAGAGACAGGAGGGGTCCACAGCCTCGACAGGTCGTCGTCGTCGTCGTCAACTACCTAAGGGTATCTTAACTGCAACCAAAAACATCAAAGTAGACGGGTGAGTATAAGAATACTCAGTAAGTAGCCCACTCACGTCCAAGTTACAACTGCACACACAGGAATAAAGCCACAACTGCACACATAGGAATAAAGTCACAACTGCACACACAGGAATAGATCGAGTTTGCAACCAAAATCATCTTAAGGACTCACGACTAGGCACCCTCGTGCATACCTCACTCGAGACTCACCGACGATCCAGGAACCTAGACGACCACAGACGGGCACACACCCCAATGCAAAAGAGTGATCACTCTAACGCCCACACAGAAAAGCAATAGAGTCTCGACTCTAAAGCCCACACAGATCGCACAAGGGCGGCCCACCCAAGTGCCCATACAACTCGCAAAAGGGCGCGCTCGACCCTAATGCCCACACAGGGTTCACAAAGGGCGCTCGACCCTATGCCACACAGCTCGCAGTAGGATGCTCGACCCCACCACCCACACAACTCGCAATAGGATGCTCGACCCTATCGCCCACACAGCTCGCAGTAGGATGCTCGACCCTACCGCCCACATAGCTCGCATTAGGATGCTCGACCCTAATGCCCACACAGAATCACCGACGTAGAGGTTCGGTCTCGTCTGGGGTCACTCAATGGGTTTGGTGATGCTAGCCTTGAAGGTCAGACTCTCTTGTCACTTCTAAACCTTCCCCTAAACTCATCTCGTTGTCGTCTTAGGTGTCGAGTCCTTAACCACCAATCCAACCTCACCAACCACACAACCCGAGCCAAGCAACCCTTGACCCAATCACACAGCTAACCAACCAAGCACCTTGGCCTAACCAACCACACAACTTAAATCACTCAACTTCCAGGAGTTGAACACACAACTCATACTCCCACATAGGAATCCATGCTCAAGTCTCCCCACAAGCAACCAGTCACAGGTCATAAAACATCACATCCTCGAAATGTAGGAAAACCAAACTCAACACAAAAGGAAAAATATAAGACAAACAGGGTCTCATGAACTCTTCTCCTACCAAACAGTAACCAACACTCCACACAACAAGATACAAGCCATCCATAAGCGACCCTAGACACAAACTCTAATGCTTAGAAGTTTGATGCTTAAACCAAACGACTACTTACCTTTGGTCGAGAACCCAACAAGCCAACCCATTCTTTCCAACTAGAGTCATTCCTAGGCCTTGGGTACCTGCTCAGCCAGAGTTTACTCATGACTCTCAAACTCCAAATATCTACCACACGGTCCCACAAATCAGCTACACCTTTAAGTTGTTAACACGGAAAACACGAGGTCAAACAACCTCTTATCCACACAGTCAGTCCTTAGATTTCTTCGGTCGATGTCAAGAAGTCTCACACTCAACTCGAAGCTTTAAGCTCCTATTTCCATACCAATTACTTTCTTAGCCCATTCTTGGTTTCAATAACGCTAGCACAAGCCTGATATCAACAATATACCTCGTCACATACTTGTAGTGCGTTACTTTACAATTCACGAATTACCACATTCCAATCAAAACAACTAACAACTTAAAGGGACTTAATCCAATTGACTACTTACCTCGATGACGAAGGTTTGGTAGGTCGATTCCGAGTTTCACGCACTTTCTTAACACGATCGAGCCCTTAATTAACGAACAAGGGCTTAAAACACACTCCTAATAACATTATTTGAACTAATCCAACATAAGTATTCGAAACAACACATGGAGGCTGAAAACGACATCAACAAAACGAAAGCGACTGATCGTACCTAATCGGCAGCGGTGGTTTTCGGCGAGCAGCAGCGACTTTCGGTGAGTGGAGGTGAGTGAAAAACGGATCCGGGATGCCCGTATGGCTCGATCTATTCAAAACCAATCTCCAACCAATGAGAACCGACCTAAAACATAGTGAACCAAGCGAACCAAATCTAAAACGGTTAGAGAAAAACTTATCGGCGAAGGCTACGGCGAGCGGGGCGTCGGTGAGGAGAGGGACGACGACCAAAAGAGAACGAAACCGGCAAGAAGCGGCGGGAGGAGGGTGTTGAACTGGTCGACAACTCGAAGGAAAACTTGGGTTGTTGCTCGATGGCTTTTGCGAAAGGGAGGACGAAGAACGACGACTAGCGAGTCGCTTGCAGTTGGCATCGGCGCGCTGGTCGGCAACGAGGATGGCGACTGTTGGACAACGAGAAGAAGAAGACCGATTGAGAAGGCAATGTCGAAGGGGACTAGCAGCGACATAAACACAATAAGGAGAAGAAGGAGAGAAACACACAGGGGCCTATTCGCTGCTTCCGACGTGAAAGAGAAGGAAAAAGATTTTTTTTAAAAAAAAAATTCATTTTATTTAAAACAAATCATTTAATTCTTTTTTTTATAAAAACCTTCCTTTTTCCTTTTCTTTTCCTTATTTAACTTAAACTTATTTCCAAAATTACGAAACAAGCACCGATTTAAATTATGAAAAACATTTAGAAAATTACCTGAAAATTATCGGGCGTCACAACTAATAAATTTAATTATAATATAAAATTAATTTAGTAAAATAATTTTATTTTTATTAAGGTTGACAAGTTCAAAATATAACAAAATATTATAAAAAAAATATATAGAATTTTTAATTAACTATCAAAGTAATTAAAAATAATAAAGGATTCCCATTATGTAAAAATAATTTTTTCAAACAAAAAAAATGAAAGTATGAATATATTTTTTAAATAGATATATTAAAAATAATAAAATTTAGCCTAAATTTCACTAAGAATATGACATAAGATGATGTAATTTAAATGCACTCTAAAAATTAAATGCATTGTAATGCCCGAGACTTTCATGTACCTTTAATTTTGAAGTTTTGACTTGAAATTTAATGGATCTAGTCAAGTTAGACTAAGATTAATGGTGTTGTGATTATTTCGTTATTTAGAAATTTGGAATAATGAGTTTTGTAATTTAGTGAATAAGGGAGGGGCAAAGTAGCAATTTTGAGCTTATGGCATTTTTGTAGATTCACTGAACTTGTACGACAAAATGGTAATTTTGTTTGTCCTTTAGTGAATCATCTTCTTCTTCTTCGCCCTATCTTCAACTTCACGAGCAGCCGCCTGCACCTAGAGCCTCTTTCTTCTCGTCATGTCGCCACTCAACCAGATCCGATGCCCTCAGTCACCAGTTCGCAAGCCAACTGCTGAATGCAACTCGTGGTCGTCGGTCAGCCCCTCGCGAGATGCAAGACCGCGCCGTAAGAAGTCGCTTACCGTTTGTTCGCCTGAGAAGAGCATCGCTAGCCTCGTCGGAAGTTCGTCCGCCTTTCAGATCATTGAAGCCTCGCCTGAAAATCGATTGTCGCAAAGAGCGAAGTCGTCGATTGACGAGCAACAAGAGCTGGAGCCTGCTCTATCTTCGACCATATCTGTCTTCGTCCATGTCGAATCACGCTAGCTCGAGCTCCGCCTCATCAGATCCTCTACGGTCACGAGCTCATGTCGTGCATATCCGTTGGCAGCGCTTCCCTGGAGTTCGTTTAGTTCGAATACAATTTTGATTCAGACCTTCAATCATCGGATACCCTTTGTGGTTCGATGTTGGATTAACAAAGATTTGGACGATTTTAGTCACAATCGAGAGACGATCTAGCTTGAGGTGAGTTTGAAGGAATGTTTAATACCTCTAAGTTGTAATGCTAAAATGGATTTTTAATTTAGGATGTATGGATGAGCAGGTTCGAAAAGGGATCGTTATTTTAGGAACTGAAATTTATGAGTTGGGGCTTTAGGCCAAGGTAAGGGAACCCACACCAAACTCTAAAATTCAAAATTTAACGGTTTTACTCGAAAAATGCCATGATATGTTTGAAAAGTTCTTTATAAATGCCAAGATATGTTTGAAAAGTTCTTTATAAATGTCATGACATGTTTGAAAAGTCCTTTATGAATGTCATGATATGTTTGAGAGGTCTGTTCTAAATATCATGATCTGTTTGATTTGATGTTTATTTCAAAGGCCATGAATGTTATGATGTTGTTAGGTTAAAACTTGCTTTTCCAAAACCTTCAATCAAGTCTGCAGAACCTGATATAGTATTTACTTTTGCTTCCAGCATTGTCAATTTAGAAAAATACTTTCTGTTTATAAGTATTGTGAGCGTAGTTATGCAGTCTGCCAGACATAAATCTTCATTGTTCATTTTTGAGTCATTCAACACTGTCCATATTTCTTCATTAAACAAAAATAATTACAATAAGAGATATATAATATAAAAAATTTAAAGCTAAAAAGAACATAAAACTGAACATTAGTATTAAAAATTCTCAAAATCAAAAGAAACACTTGTTATTCCATCAATTGCTTCAATCTTCTCTTTAGGAGATGCAAAAAAGTCTTCCACATCCAAATTTGTCATATTAGATTGGTCGAGAATATATCATTGTCTTGATAAGCAAGATTTGCTTCTACATTTTTTCCCTTTTCCTTCAGGGAAGCTTGATAGAGATCAACTAAGTGTTTGGATGTACGACAAACACGTGACCAGTGTCCAGTCATTCTACATCGAAAACATTTATTTTCATCACCTCCTGAATTTTTATTTTGTGGAGCTTTTCCTTTATGGTCATCATTTCGTGTGGATTTCTTGAAATTTAGATAATTATAATGCCCACCACGAAAAAAATTATTTCTTTCCCTACCGCGGTCATGACCTCGACCACGACCACGACCTCGATTGTTAAAATTCACAACATTCACTTCAGGGAATGGTGTTGCTCCAGTTGGTCGAGATTCATGATTTTTCATTAATAGCTCGTTATTTTGTTCAGTCACGAGAAGACATGAAATGAGTTCTGAATACTTTCTAAAACCTTTCTCTCGATATTGTTGCTGCAGGAGCATATTCGAGACATGAAAGGTATAAAATGTCTTCTCTAACATAACCACACCAATAATGTTTTCTCCACATAAGTTCAATTTCGAACAAATTTTAAATAATGCGGAGTTATATTCACTTACTGATTTAAAATCTTGCAACCTCAGGTGCATCCAATCATAACGAGCTTTTGGAAGAATAATAGTTTTTTGATGATCATACCTTTCTTTCAAGTTCTTCCACAAGACATAGGGATCTTTTATCGTCGAATATTACATTTTCAATCCCTCGCGGAAATGATGACGAATAAAGATCATGACCTTAGCCTTTTTTTGTCTGGATGTCGTATTTCCTTCTTTAATGGTCTCTCCAAGGTTCATAGCATCCAAATGTTTTTGAACATCAAACACCCATGACAAATAATTATTACCATTAATGTCAAGGGTCGTGAATTCTAATTTTGCAAGATTTTACATAGTAACGCTATCAAAAGGCATTATATTTATATTAGAAAATTAATATAAACTAAATATGCAAAATAACATTAAAAAAAATGACTTGCCCGGAGGACCTACCTTCAGTGAAAAAACCGACAGAAGTTTCGTGTTGATAACGTGTTGTAAAAATAGGATACCCCTTTGCCTGTTTGCCATCAAACTTTCATCCCAAGAAACTAGTTGCTCTTATCTTGTGTCACGGTCGCATCGAGTACTTATGGAAGAATTTTAAGGCATTGACTTCACTCGAGAACATCATAATTATATACTGCAAGAAACTGGTGTAACTGAACCTGAATTTTGATTTTGACTTTGATCTTTTAACTGACTTTTCAGGTCTTTAACAATTTAAAAATTGTGGATCTGAGCTTCTGCAAGTTTCTTGTTGAAATCCCTGATTTTTCAAGAATCCCAAATGTTGAGTCATTGGACTTCAATCACTGTACAAGTTTAGTAGAGGTTCATGAATCTGTTGGTTTGCTTCAAAAGCTTGCTACTTTAAAACCTTTTGTTCTGCTCGAGCCTTCAGAGACTTCCAACTAGCATCAAATTGATATCTCTGAGCAATCTTTTCCTTGCTGACAGCCCCAAGCTGGAGGCTTTTCCTAGTATTCTAAAACAAATGAAATATATTGAAAGTATACATTTAGAAATGTCTACTATTAGGTGGTGTTTGGCCCATTGGTTTGGGTTAGGGTGGTCAAACCCAACACCATGTTTGGTACAAACGGTTTTGAAAACACAACCCCATAGGTTTTGTACCATGTTTTCCACTTCTATTTGTCGTTTTTCATTTTCAATGACCACTTCAACAACAATTCTGATGACAACTTCGACAACAACTCCGACGACTACACCGGTGACCCGACCAACTCCGATGACAACTCAAACGATCACCAACAACGACAACTCCGACGATTACACTGGCAACTCAACCAACTTCGATGAAAACTTCGATAATCACACCGACGACTCGACCAACTCCGATGAAAATTCCGACGACCGCACCGACGACCCGACGAACTCCGATGATAATTCCGACGACCACACCGGCGACCCAGACCAACTCCGATGACAATTCCAACAACCCCACCGACAATCCAGACCAACTCCGATGACAATTTCAACGACCACATCGCCCGACTCCAACCAACTCCGATAACAATTCCGACGACCACACCGGCGACCCCGACAAACTCTTATGACTATTTTGACGACCACACCAGCGACCCCGACCAACTCCGACTCGACCACCCAAACAACTTGATACTTTGATGCCCCAATCACCGCCAATCCGAAAGAGTTGTTCAACATGAGACATTCATTCGCTCGAAATGTCATTGAGAGAGCATTCGGTATGCTGAAGGGTCGATGGGTGATACTAAGAGGAAAGTCTTTTTATCGATTAGAAGTCTAGTACAGGACTATAATTGCATGTTGTTTGTTGCACAACTTAATTATACAAAAGATGGACCCAAATCTCACATTCGAGGAGGCACACACAAGTGACCCTAGTTCAACTGGGATGAACACACACAATGTTGGATTTGTAGAGACAACAGATGTCTGGACTGAATGTTGGGAATATCTCGCAAACCATATATGGATGAATTGGAATGAGAATTGACGATTGTTGAATTAAACATATATATATATATATATTGATTAGATTTTGTATATATGCACTACATACATTGTGGTGGATAATATTTTTGTGTGTAAATATTATTAATATAATATTATTAATATAATATAATATTGCCTAAAGATGACAAGGCAAGACTGCATGATTGTTGCAAAAACGCTTCTCTTGAATAGTATGATGTTGAACTATTTCTTTGCATGAACCACCCCAAATTGGAAGTATGATTACTGTGTGGAAGTTCTGGGGAAAGCACTGGGAACCTGATTTTATTTCCATCTACCACTCTCTATGAGTTTTTTAGAATTTTATGTTTAATTGAGCTTCGTTGGACAACTATCTCTATTTTTTTATTATTTATTTTAGTCAGCGTAACATGTTATTTTTAACCAACATAGTTCTATGCATCGATTATTTTACTTTATCATTGGCATCACAGTTCTTACATTACACATTATAACATATAGTTAGTTGTTTTACTTTATGTCATCATGAGTGAAGTGCATTATAAATATAATATATACTCCTACAAAGTATTATAAACTCAAATTAGATACATTAGATAAATTGTTATAATTATGTAGTTATGAAGTTAAAAAATAAAGTAAGAAGACAAAATTACTATATTTAAAAAATATTAAGTATTTATTTTAGAACAAACAGACTGCACACACCTAACCCAACACAACACTGCACCCCAAACACTGTTTTATGCAACCTATTCCAGCACAACACGGCATCTCAAACACTGTTTTCTGCAACACAGTCTAACAAAATCCTGCGCTCCAAACACTATTTTCGTCCAGTGTTTCTGATTCCAGTCAAGAAACCCCAGGGTTTCTGATTCCAACACAGCATGCCAAACAGTCACTTAAAAGCTTCCTTTATCAATCAACAATCTCATTGGACTTAAAGCTTTGAACTTAGGATTCTGCAAAAATCTGGAGGTTCTTCCCAATAGCATTTGCGAATTGCGGCTTCTTCAACTTTTAAATCTTATGGGTTGCTCAAAACTCCGAGAAATTCCAAAGCTCCCTTCTAATTTGTTCTATCTATGTGCAGATGATTGTAAATCACTGGAAAGTTACTCACAATTGTCAGAATTGATTATGTTCAATGCAGAAAATCCATCAAGATTAAGCATGTCCACATTAATTGCCAAAAAATAATTGAGAATCAAAATAATGATGTTGTGAAGTTTTCATTAAGTGAGGTTAGTTTTTCTCTAGTTTCTTACTTATACATCCTATTGGTATACGTATAAATGTATCGTTGGATTGTATCAATCATTCTAAATAACTTCTCTCTTGTAATTGAACATGTCTCTTTTTTTGCATCAGGGATCTATTATGGTTTGATTTTCATGGTACAGAAGGCTTAATTCATCTCCAGATTGCTTCAAGTTTATATGGGAAACCTGCAACTTGGTTTTTTTTATGCAGTTTTGGGACCTGAGGACAGTGATGAAGCTACTGGAAAATTTTCTCGTGGAGTTGTTGTCTCCATTAATCACCAGAAGAATATAGGCCACGTTTAATAATCATTTTGTTTTTTGTTTTTGGTTTTTGAAATTTAAGTTTATTTTCACCTAACTTATCTACGATAGAGATCACCCCCTAAGGAAGTATTTAAATTCTTAGCCAAATTCTAAAAACAAAATCAAGTTTTTTAAAGCTACTTTAGTTTCTAAAAGTTAGCTTAGTTTTTGAAAATATAGATACAGGGTAGATAAGAAAACATATAAATTCATTAGTCTAATTAGTATTTTTAGGCTTAAATTTCAAAAACTAAAAACTAAAAACCAAATGGTTATCAAACGGAACCATAGTAATTGAAAGAGTTTTTGGTTCTTTGAAAGATCATATCTGGATCATGAGCTTCTGCCCAAGGCCATTGACATGGATCTTGAAGAAATGTGGTTCCACATAGTAATTTGAATGACTTGTCATTGGTAAGAAGATATGCAAAAGGAACCCATTAAAAGATTGATGGTGATAATTATATATATAAACAAAGACTATTACATACATCACAAATTTGGGTTTGAACTTGCACCTTTAAAAAAAAGTACAAATGCTTTTATACCATAATTGGTTGTTTGTTTGTTATAGTTTAAGTAATGTATTAGAATCAACGTGAGCTTAGCTTAATTGGTTAAGGTGCATATTTTCGATCATGAGGTCAAAGGGTTCGATTCTCCCACTTTATTTATTTTTGAACTCAACAAAAAGTGGCGTATTGAGATTTCAATGTGCTATAATGTATTTAAAATTTGAAATGTCATTTCCTCTTAATTAGTTTATATTGTTCGAAGATTGTTTAGATCTCATGCTAATTATTTTAGCTGCAAATGTTTGGATACACCTATGGTTTTATGTCACCATGATTTATGGAATCTCCTAATGATACCTAAATCTTTTTTACGAGTAGTAGAGATGGGAGAGAAGAGAATATAAGTATTATTTATATATATCAATAGTGTGTTATAATAATTTTCTTTTTTTGGCCTAGAAATATATATTTTATCTTTTGATTCATATCTAATCACTAGAGCTGGTCATCGACCGACTCCGAACCGTTGGTCGGTTTCTAATATTTTTAAATAATTTATTTCATTATTATTTAATATATTTATAAAATATAATATAAATATATAAATTTTTTCTGGTTTTGGTTTTATTATTATTGTATATGATTTTTTTAAAAAGATCCGTCAGTTTGTCGGTTTTTTCCCTGACCGATTCTTGTGAAAACCGACCGATCGACCTTAATTTGGTCGGTTTTGGTCGATCGAGTTAGTTTTCGGTCTTTTTTTTCTCACCTCCATTTAATGTAACAACATTTTCCCAACATAATCCATTATTCTCAGATCATTCATTAAATTCGTTGTAATTCTTTTAAAAACAAAGAATATGACAAAGTATACATTATAAATTCTCATCGATCTAAAAAAAATAGAAAATTTAACCTCTCGCTCACAATTTGATCCTGTCTTTTTTTTAATTCAACAATATATAGAGAAATATTCAAACTTCTAATCATTTAAGTTGAAGATATAATATCAATCTGAATATAGTTTAGTAGATAGGATACAAATTACTATCTCAAATGTCAATGATTCTATCATCAACCCCATAATTATTAAACTCAAAAAGAAAAATTGAAGATATAATGTCTTATCCAATTAAGTTATGTTCAAGTTAGTTCATCTCATTATTTCTATCTGTTGAGGATCTCTCATCTAAAATATGAAGAGAAATTTCACAATTTATAAGATATAGGAGTTATTCATTTCATTGTCCATTAATTTTGAGATATGAAATTCTAACCAATCTAATATCATCTGGCCCTTCAAACTATATAAATTAAAAATAAATAAGTCAAATTTGTGTGTTGGCCATAGGTTAGCTTTTCTTGCAAGGAAGCACAACTCCCTCGTTTAATGGGAAAAATGATTGAGACTTTTAAGCAATAAGATATATATGATTGGAATTTGGGTTCTTTTGGAAATAGAACAATGATCTTATTGTTGATGTTAACTGTGGAAGCCATCTGTTTGAACAAATGTGATAAACTATAAGGGTCTGTTTGAGACACATGTTATAACAACCTGTGTTATTATAATCGGAATACTTTGTACCACAAACACAAATTATTATAAATCTAACTATACTACTTTGCACTTTAAATAATATTATATGATTCTACCGACTATTATAACTTATTGTTATTATAACTCACTCAGCGCCCCAAACAGTTCCTAATCTTATGTTTTTATGTCTCATTATTCAAACTAAATACCAAAAAAATATGATTTGAAAATGCAAACTATATATATATATTCTCCATTTAAAACTTAGAGAAGAAGAAAACCCCTTGATGAATATAATTCAAAGTCAACCATTTTCTAGATTCAACACACTTCATCTTCAAAGCTAAAGACTATTAAATGCACTCACGAGGGGCAAAGGGCTTGACTTTTAACTATTTAAAACTTATCTCTACTAACGAATCTGACGTGTCGATCTTAGAACATGATAGGCTCAAAAATATTTTTTAGTATTAGTTTTGATATATAGTTGAATAATTTTAATATTATTGTCTTTGGTCAAAAACTACCCAATCTGGATCTCCGGATTTCTTTAAAAACAAATACAAGTTGTTTGGAAATTGTTTTCTAATTTTCAGTTTCCTTTTTTTTAACAATTTTTAAGTCTTCATTTTCAAAATTATTTCTAAAAGTGGTGAATGAATTAATAATTAAAATCCATATTAATGGGTAAAATATTACTCATCAATTGTGAATAGAGAGAATAATTTCATTTATTCTAAAAGAATTATTCTAATTCTTAAGTATCAAGTTAAGCTAACTTTTTTCCAATTTCGTTTCTTTTTTGCCATATCCAAAAACTTCTACTTTTAGGATTTCGTACAAATAGAGTATTTTTTTTCTTGTTGCTATTGATTCATTATTTGAGTGGTTGTGTAGCAAGTGATTCTGTATCATTGAGGGGGCGTTTGGAGCACCATTTCGAGATCAAAATCCTTGAAATTGAAACCCTGAACGATTTGTTCTAAGAGTTTGAAATCGTGAGAATTAGAAAACGATATTTGATATGTAATGTTTGATGTCTGGGAATTAGCTAAATAGTATTGGAATTAGCAAAACTGTGTTGGAAGTAGCAAAATTGTGTTGGAAGTAGCAAAACAAAACTATAAGGTTGTGTTTCAAATCCATTGAATTTTGAACACAGTGTTCCAAACGTTCCCCGAATGTTTAAGAGAATTTAAAAACCATATTTTAAATATATACTAAAATGTTATTATTTGTAGCTGGTCAAATGTCATATCAATACTAAAGGATGACTAAGGACAATAAACAATTTTTCTTGGCAACGTAAATATATTTTGAAGTTAAAGTAAAATATATTTAAAATTAAGAACTATAATAATTTTGTAATTAATGGTTTTTCTAATTAATGGCATGTTTCCATACTAACTACTACTAGGACATAAAAAGCTTAACTAATTCCCCTCTCACATTTAGTTTTTTTCTTTTTCTTTTTTTTCAGATAAACACTATCACATTTAATTAACCATAATTTGTAATTTCTCTCCTAACAGTACATTTTCATATTGTAATTGTCTCCTCTAAATCCATTAAAACCAACTAAATTTATAGTTAAAATATCATTTTTCTTTTTTTTTTTCTCTAGAGATCTTTGAAATGACTTATTGAGTGTTTAAAAATATTTTTTTTCAAGTTTCGTTTGATAATCATTTAGCTTTTAATTTTTTTCCTTTGAAAATGAGGCTCGTTTTTCAACAATTTTTCTTCATATTCTTTATAATCATTTTCATCTTTGTTGAGATATCATTTAAATTTTTAAAAATAAAAATAAATTTTTGAAAATTACTTTTCTTAATTTTTAAAATTTAACTTAATTTTGTAAGACATAGATACAAAATAGAAAAAGAAAAGTAGAAACTAAGAGGTCATTTGGATTGAGGAGTTGACAGAGTGGAATTGGGAGGAGTTAATAACTCTTTGTCTCACCGAAAAGAGTTATTAACTTCTCTTTGTTGGTTCCACAAATTTATTAACTTCTTTCCCAACTCTTTCTTACATGCCCCACAGATTTATTTATTAACTCCTCCTCCAACTCCCCATAACTTATTTTCCAACTCTTACTTCCTACAACTTCTTCCCTTAAACACAGTTTCTCAGAACTGTGAACAACTCTTACACCCCAACCACAGATTAAGACTTCTTTACTCCTATTCCCTAAAACTCTTAGTCTCTTATCCCCTCAATCCAAAGGCCCCTAAATAGTTATTCTTAAAATTTAAAAGTCATTCCACATCCAAACGAGCATCTTCTCTCTCTACTTTCCTATAAATACGGACCTTTCCCTATCTTATTCATCCAAAAAGAAAAAGAAAAAGAAAAAACGCATAAAACCAATCAGGAAAAAAACAAATGAAGAACTTCGCATTACTGTCTTTTCTTTTTATTCTCCTTGCCTCTACTGAGTCTGAGGTCCGCTTCTGCAGAGCCGACGCCTCCCCAGACGCCGTCCGCGACATCGACGGGAAGAAGCTCCGCGCCGGCGTCAACTATTACATCCTCCCTGTTATCCGCGGAAGAGGAGGCGGCCTAACCCTAGGCAACCTCCAATCCGAGAAATGTCCAGTCAACGTGGTTCAAGAACAATTCGAGTTAATGAACGGCCTTCCGGCGACATTCGCGCCTGTAAACCCTAAGAAAGGAGTGGTTCGAGTTTCGACGGATTTGAATGTGCAATTCGAGGCCAGTACGATCTGCGCCACATCGACGGTGTGGAAATTGGACAAAATCGACGAATCGACGGGACAGTGGTTCGTGACAATCGGCGGAACGAGAGGAAATCCAGGGCGGGAGACGCTCGACAATTGGTTCAAAATTGAGAAGCACGGCAGCGATTATAAGTTTGTTTTCTGTCCGACTGTGTGTGATTTCTGTAAAGTTATGTGCAGAGATGTTGGAATCTTCTTCAAGAATGGAAAGAGGGCTCTGGCTTTGAGCGATACGCCATTCCCAGTTATGTTCAAGAAAGTTTAA

mRNA sequence

ATGAAGAACTTCGCAATACTGTCTTTTCTTTTCTTTGTCCTTGCCTCTACTGAGGTACACTTCTGCAGAGCCGACGCCTCGCCGGACGCCGTCGTCGACATCGACGGAAAGAAGCTCCGCGCCGGCGACAACTACTACATCCTCCCTGTTTTCCGGCGAAACATCGGCGGAGTAGCCATCGGCGGTATCCCTGGATACAACAATCAATGTCCGATCAACGTCGTCCCGGAAACTTACGAAGCATCTAACGGTGCTCCAACGATATTTACGCCCATAAACCCTAAGAAAGGCGTGGTTCGAGTTTCCACCGATTTGAACATCCAATTCGAGGCGAACACGAAATGCGCCAAATCGACGGTGTGGAAAATAGGTAAATTTGATGAACATATGAGGCAATATTTCGTGACAATCGGCGGAACGAAAGGAAATCCAGGGCGCGAGACGTTGGAGAGTTGGTTCAAAATTGAGAAGCATGGCAATAATAATTATTACAAGTTTGTGTATTGTCCAACTGTGTGTAAGTACTGCAAAGTTATATGCAAAAATGTTGGTTTATTTTATAACTTCGCATTACTGTCTTTTCTTTTTATTCTCCTTGCCTCTACTGAGTCTGAGGTCCGCTTCTGCAGAGCCGACGCCTCCCCAGACGCCGTCCGCGACATCGACGGGAAGAAGCTCCGCGCCGGCGTCAACTATTACATCCTCCCTGTTATCCGCGGAAGAGGAGGCGGCCTAACCCTAGGCAACCTCCAATCCGAGAAATGTCCAGTCAACGTGGTTCAAGAACAATTCGAGTTAATGAACGGCCTTCCGGCGACATTCGCGCCTGTAAACCCTAAGAAAGGAGTGGTTCGAGTTTCGACGGATTTGAATGTGCAATTCGAGGCCAGTACGATCTGCGCCACATCGACGGTGTGGAAATTGGACAAAATCGACGAATCGACGGGACAGTGGTTCGTGACAATCGGCGGAACGAGAGGAAATCCAGGGCGGGAGACGCTCGACAATTGGTTCAAAATTGAGAAGCACGGCAGCGATTATAAGTTTGTTTTCTGTCCGACTGTGTGTGATTTCTGTAAAGTTATGTGCAGAGATGTTGGAATCTTCTTCAAGAATGGAAAGAGGGCTCTGGCTTTGAGCGATACGCCATTCCCAGTTATGTTCAAGAAAGTTTAA

Coding sequence (CDS)

ATGAAGAACTTCGCAATACTGTCTTTTCTTTTCTTTGTCCTTGCCTCTACTGAGGTACACTTCTGCAGAGCCGACGCCTCGCCGGACGCCGTCGTCGACATCGACGGAAAGAAGCTCCGCGCCGGCGACAACTACTACATCCTCCCTGTTTTCCGGCGAAACATCGGCGGAGTAGCCATCGGCGGTATCCCTGGATACAACAATCAATGTCCGATCAACGTCGTCCCGGAAACTTACGAAGCATCTAACGGTGCTCCAACGATATTTACGCCCATAAACCCTAAGAAAGGCGTGGTTCGAGTTTCCACCGATTTGAACATCCAATTCGAGGCGAACACGAAATGCGCCAAATCGACGGTGTGGAAAATAGGTAAATTTGATGAACATATGAGGCAATATTTCGTGACAATCGGCGGAACGAAAGGAAATCCAGGGCGCGAGACGTTGGAGAGTTGGTTCAAAATTGAGAAGCATGGCAATAATAATTATTACAAGTTTGTGTATTGTCCAACTGTGTGTAAGTACTGCAAAGTTATATGCAAAAATGTTGGTTTATTTTATAACTTCGCATTACTGTCTTTTCTTTTTATTCTCCTTGCCTCTACTGAGTCTGAGGTCCGCTTCTGCAGAGCCGACGCCTCCCCAGACGCCGTCCGCGACATCGACGGGAAGAAGCTCCGCGCCGGCGTCAACTATTACATCCTCCCTGTTATCCGCGGAAGAGGAGGCGGCCTAACCCTAGGCAACCTCCAATCCGAGAAATGTCCAGTCAACGTGGTTCAAGAACAATTCGAGTTAATGAACGGCCTTCCGGCGACATTCGCGCCTGTAAACCCTAAGAAAGGAGTGGTTCGAGTTTCGACGGATTTGAATGTGCAATTCGAGGCCAGTACGATCTGCGCCACATCGACGGTGTGGAAATTGGACAAAATCGACGAATCGACGGGACAGTGGTTCGTGACAATCGGCGGAACGAGAGGAAATCCAGGGCGGGAGACGCTCGACAATTGGTTCAAAATTGAGAAGCACGGCAGCGATTATAAGTTTGTTTTCTGTCCGACTGTGTGTGATTTCTGTAAAGTTATGTGCAGAGATGTTGGAATCTTCTTCAAGAATGGAAAGAGGGCTCTGGCTTTGAGCGATACGCCATTCCCAGTTATGTTCAAGAAAGTTTAA

Protein sequence

MKNFAILSFLFFVLASTEVHFCRADASPDAVVDIDGKKLRAGDNYYILPVFRRNIGGVAIGGIPGYNNQCPINVVPETYEASNGAPTIFTPINPKKGVVRVSTDLNIQFEANTKCAKSTVWKIGKFDEHMRQYFVTIGGTKGNPGRETLESWFKIEKHGNNNYYKFVYCPTVCKYCKVICKNVGLFYNFALLSFLFILLASTESEVRFCRADASPDAVRDIDGKKLRAGVNYYILPVIRGRGGGLTLGNLQSEKCPVNVVQEQFELMNGLPATFAPVNPKKGVVRVSTDLNVQFEASTICATSTVWKLDKIDESTGQWFVTIGGTRGNPGRETLDNWFKIEKHGSDYKFVFCPTVCDFCKVMCRDVGIFFKNGKRALALSDTPFPVMFKKV
Homology
BLAST of Tan0014729 vs. ExPASy Swiss-Prot
Match: Q9LMU2 (Kunitz trypsin inhibitor 5 OS=Arabidopsis thaliana OX=3702 GN=KTI5 PE=2 SV=1)

HSP 1 Score: 229.6 bits (584), Expect = 6.2e-59
Identity = 107/199 (53.77%), Postives = 146/199 (73.37%), Query Frame = 0

Query: 192 LSFLFILLASTESEVRFCRADASPDAVRDIDGKKLRAGVNYYILPVIRGRGGGLTLGNLQ 251
           L ++F+LLA   S  R    +A+ + V+DI+GK L  GVNYYILPVIRGRGGGLT+ NL+
Sbjct: 4   LLYIFLLLAVFISH-RGVTTEAAVEPVKDINGKSLLTGVNYYILPVIRGRGGGLTMSNLK 63

Query: 252 SEKCPVNVVQEQFELMNGLPATFAPVNPKKGVVRVSTDLNVQFEASTICATSTVWKLDKI 311
           +E CP +V+Q+QFE+  GLP  F+P + K   + VSTD+N++F      + +++W+L   
Sbjct: 64  TETCPTSVIQDQFEVSQGLPVKFSPYD-KSRTIPVSTDVNIKF------SPTSIWELANF 123

Query: 312 DESTGQWFVTIGGTRGNPGRETLDNWFKIEKHGSDYKFVFCPTVCDFCKVMCRDVGIFFK 371
           DE+T QWF++  G  GNPG++T+DNWFKI+K   DYK  FCPTVC+FCKV+CRDVG+F +
Sbjct: 124 DETTKQWFISTCGVEGNPGQKTVDNWFKIDKFEKDYKIRFCPTVCNFCKVICRDVGVFVQ 183

Query: 372 NGKRALALSDTPFPVMFKK 391
           +GKR LALSD P  VMFK+
Sbjct: 184 DGKRRLALSDVPLKVMFKR 194

BLAST of Tan0014729 vs. ExPASy Swiss-Prot
Match: P13087 (Miraculin OS=Synsepalum dulcificum OX=3743 PE=1 SV=3)

HSP 1 Score: 211.5 bits (537), Expect = 1.8e-53
Identity = 115/209 (55.02%), Postives = 141/209 (67.46%), Query Frame = 0

Query: 192 LSFLFI--LLASTESEVRFCRADASPDAVRDIDGKKLRAGVNYYILPVIRGRGGGLTLGN 251
           LSF F+  LLA+  + +    AD++P+ V DIDG+KLR G NYYI+PV+R  GGGLT+  
Sbjct: 9   LSFFFVSALLAAAANPL-LSAADSAPNPVLDIDGEKLRTGTNYYIVPVLRDHGGGLTVSA 68

Query: 252 LQSE---KCPVNVVQEQFELMNGLPATFAPVNPKKGVVRVSTDLNVQFEASTIC--ATST 311
                   CP  VVQ + E+ +  P  F P NPK+ VVRVSTDLN+ F A   C   +ST
Sbjct: 69  TTPNGTFVCPPRVVQTRKEVDHDRPLAFFPENPKEDVVRVSTDLNINFSAFMPCRWTSST 128

Query: 312 VWKLDKIDESTGQWFVTIGGTRGNPGRETLDNWFKIEKH-GSD-YKFVFCPTVCDFCKVM 371
           VW+LDK DESTGQ+FVTIGG +GNPG ET+ +WFKIE+  GS  YK VFCPTVC  CKV 
Sbjct: 129 VWRLDKYDESTGQYFVTIGGVKGNPGPETISSWFKIEEFCGSGFYKLVFCPTVCGSCKVK 188

Query: 372 CRDVGIFF-KNGKRALALSDTPFPVMFKK 391
           C DVGI+  + G+R LALSD PF   F K
Sbjct: 189 CGDVGIYIDQKGRRRLALSDKPFAFEFNK 216

BLAST of Tan0014729 vs. ExPASy Swiss-Prot
Match: P32765 (21 kDa seed protein OS=Theobroma cacao OX=3641 GN=ASP PE=2 SV=1)

HSP 1 Score: 177.9 bits (450), Expect = 2.2e-43
Identity = 94/201 (46.77%), Postives = 122/201 (60.70%), Query Frame = 0

Query: 195 LFILLASTESEVRFCRADASPDAVRDIDGKKLRAGVNYYILPVIRGR-GGGLTLGNLQSE 254
           + +L A T     F  A+A+   V D DG +L+ GV YY+L  I G  GGGL LG    +
Sbjct: 8   VLLLFAFTSKSYFFGVANAANSPVLDTDGDELQTGVQYYVLSSISGAGGGGLALGRATGQ 67

Query: 255 KCPVNVVQEQFELMNGLPATFAPVNPKKGVVRVSTDLNVQFE--ASTICATSTVWKLDKI 314
            CP  VVQ + +L NG P  F+  + K  VVRVSTD+N++F      +C+TSTVW+LD  
Sbjct: 68  SCPEIVVQRRSDLDNGTPVIFSNADSKDDVVRVSTDVNIEFVPIRDRLCSTSTVWRLDNY 127

Query: 315 DESTGQWFVTIGGTRGNPGRETLDNWFKIEKHG-SDYKFVFCPTVCDFCKVMCRDVGIFF 374
           D S G+W+VT  G +G PG  TL +WFKIEK G   YKF FCP+VCD C  +C D+G   
Sbjct: 128 DNSAGKWWVTTDGVKGEPGPNTLCSWFKIEKAGVLGYKFRFCPSVCDSCTTLCSDIGRHS 187

Query: 375 -KNGKRALALSDTPFPVMFKK 391
             +G+  LALSD  +  MFKK
Sbjct: 188 DDDGQIRLALSDNEWAWMFKK 208

BLAST of Tan0014729 vs. ExPASy Swiss-Prot
Match: Q8RXD5 (Kunitz trypsin inhibitor 4 OS=Arabidopsis thaliana OX=3702 GN=KTI4 PE=2 SV=1)

HSP 1 Score: 153.3 bits (386), Expect = 5.7e-36
Identity = 84/177 (47.46%), Postives = 107/177 (60.45%), Query Frame = 0

Query: 217 AVRDIDGKKLRAGVNYYILPVIRGRGGGLTLGNLQSEKCPVNVVQEQFELMNGLPATFAP 276
           AV DIDG  +    +YY+LPVIRGRGGGLTL     + CP ++VQE  E+  G+P  F+ 
Sbjct: 29  AVVDIDGNAM-FHESYYVLPVIRGRGGGLTLAGRGGQPCPYDIVQESSEVDEGIPVKFSN 88

Query: 277 VNPKKGVVRVSTDLNVQFE-ASTICATSTVWKLDKIDESTGQWFVTIGGTRGNPGRETLD 336
              K   V  S +LN++ +  +TIC  ST W++ + D    Q+FV  G      G+++L 
Sbjct: 89  WRLKVAFVPESQNLNIETDVGATICIQSTYWRVGEFDHERKQYFVVAGPKPEGFGQDSLK 148

Query: 337 NWFKIEKHGSD-YKFVFCPTVCDFCKVMCRDVGIFFKN-GKRALALSDTPFPVMFKK 391
           ++FKIEK G D YKFVFCP  CD     C DVGIF    G R LALSD PF VMFKK
Sbjct: 149 SFFKIEKSGEDAYKFVFCPRTCDSGNPKCSDVGIFIDELGVRRLALSDKPFLVMFKK 204

BLAST of Tan0014729 vs. ExPASy Swiss-Prot
Match: P83667 (Kunitz-type serine protease inhibitor DrTI OS=Delonix regia OX=72433 PE=1 SV=1)

HSP 1 Score: 111.3 bits (277), Expect = 2.5e-23
Identity = 67/180 (37.22%), Postives = 99/180 (55.00%), Query Frame = 0

Query: 216 DAVRDIDGKKLRAGVNYYILPVIRGR-GGGLTLGNLQSEKCPVNVVQEQFELMNGLPATF 275
           + V DI+G  +  G  YYI+  I G  GGG+  G  +   CP++++QEQ +L  GLP  F
Sbjct: 4   EKVYDIEGYPVFLGSEYYIVSAIIGAGGGGVRPGRTRGSMCPMSIIQEQSDLQMGLPVRF 63

Query: 276 APVNPKKGVVRVSTDLNVQFEASTICATSTVWKLDKIDESTGQWFVTIGGTRGNPGRETL 335
           +     +G +   T+L ++F     CA S+ W + K    +G+  V IGG+  +P  E +
Sbjct: 64  SSPEESQGKIYTDTELEIEFVEKPDCAESSKWVIVK---DSGEARVAIGGSEDHPQGELV 123

Query: 336 DNWFKIEKHGS-DYKFVFCPTVCDFCKVMCRDVGIFFKNGKRALAL---SDTPFPVMFKK 391
             +FKIEK GS  YK VFCP         C D+GI ++ G+R+L L    D+PF V+F K
Sbjct: 124 RGFFKIEKLGSLAYKLVFCP---KSSSGSCSDIGINYE-GRRSLVLKSSDDSPFRVVFVK 176

BLAST of Tan0014729 vs. NCBI nr
Match: KAF9664255.1 (hypothetical protein SADUNF_Sadunf17G0137100 [Salix dunnii])

HSP 1 Score: 385.6 bits (989), Expect = 5.2e-103
Identity = 204/410 (49.76%), Postives = 263/410 (64.15%), Query Frame = 0

Query: 6   ILSFLFFVLASTE-VHFCRADASPDAVVDIDGKKLRAGDNYYILPVFRRNIGGVAIGGIP 65
           +LSFL   LA+ + +    A+A+PD V+D++GK L  G +YYILPV R   GG+ +    
Sbjct: 7   LLSFLLSALAANQYLPRVAANAAPDPVLDVNGKILTTGSSYYILPVIRGRGGGLKMAST- 66

Query: 66  GYNNQCPINVVPETYEASNGAPTIFTPINPKKGVVRVSTDLNIQFEANTKCAKSTVWKIG 125
                CP++VV + YEASNG P  FTP+N KKGV+RV TDLNI+F A + C +STVWK+ 
Sbjct: 67  -VRKTCPLDVVQDRYEASNGLPLKFTPVNSKKGVIRVRTDLNIKFSAPSICHQSTVWKLD 126

Query: 126 KFDEHMRQYFVTIGGTKGNPGRETLESWFKIEKHGNNNYYKFVYCPTVCKYCKVICKNVG 185
            +DE  +Q+FVT  G +GNPG ET  +WFKIEK    N YK V+CPTVC++CKV+CK+VG
Sbjct: 127 SYDEWAKQWFVTTNGVEGNPGPETTSNWFKIEKF--QNKYKLVFCPTVCRHCKVMCKDVG 186

Query: 186 LF-------------------YNFALLSFLFIL-LASTESEVRFCRADASPDAVRDIDGK 245
           ++                    N+ LL   F+L  A T+  +      ++ + V DIDG+
Sbjct: 187 IYIDAKGARRLALSNVPFKIKMNYPLLLLCFLLAFACTKQSI------SAAEPVLDIDGE 246

Query: 246 KLRAGVNYYILPVIRGRGGGLTLGNLQSEKCPVNVVQEQFELMNGLPATFAP-VNPKKGV 305
           KL AG  YYILPV RGRGGG+T+       CP+ VVQ++ EL  GLP TF P V+ KKGV
Sbjct: 247 KLVAGTKYYILPVFRGRGGGITMAR-NKTSCPLAVVQDRLELSKGLPLTFTPAVDSKKGV 306

Query: 306 VRVSTDLNVQFEASTICATSTVWKLDKIDESTGQWFVTIGGTRGNPGRETLDNWFKIEKH 365
           + VSTDLN++F A+T C  STVWK+ K   S  QWFV+ GG  GNPG  T+ NWFKIEK 
Sbjct: 307 ILVSTDLNIKFLATTTCPQSTVWKIIKSSNSKVQWFVSTGGVEGNPGFNTVTNWFKIEKA 366

Query: 366 GSDYKFVFCPTVCDFCKVMCRDVGIFFK-NGKRALALSDT--PFPVMFKK 391
             DYK VFCPT    C V+CRD+GI+ + NG R L+LSD   PF V FKK
Sbjct: 367 DGDYKLVFCPTKVCNCGVLCRDIGIYIEDNGTRTLSLSDALQPFKVQFKK 405

BLAST of Tan0014729 vs. NCBI nr
Match: XP_022923703.1 (kunitz trypsin inhibitor 2 [Cucurbita moschata] >XP_023001808.1 kunitz trypsin inhibitor 2 [Cucurbita maxima] >KAG6584289.1 Kunitz trypsin inhibitor 5, partial [Cucurbita argyrosperma subsp. sororia] >KAG7019882.1 Kunitz trypsin inhibitor 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 382.1 bits (980), Expect = 5.7e-102
Identity = 184/204 (90.20%), Postives = 191/204 (93.63%), Query Frame = 0

Query: 188 NFALLSFLFILLASTESEVRFCRADASPDAVRDIDGKKLRAGVNYYILPVIRGRGGGLTL 247
           NFALLSFLFILLA   SEVR  RADASPDAVRDIDGKKLRAGVNYYILPV RGRGGGL L
Sbjct: 3   NFALLSFLFILLA---SEVRVSRADASPDAVRDIDGKKLRAGVNYYILPVFRGRGGGLAL 62

Query: 248 GNLQSEKCPVNVVQEQFELMNGLPATFAPVNPKKGVVRVSTDLNVQFEASTICATSTVWK 307
           GNLQS+KCP+NVVQEQFELMNGLPA F PVNPKKGVVRVSTDLNVQFEASTICATSTVWK
Sbjct: 63  GNLQSDKCPLNVVQEQFELMNGLPAAFLPVNPKKGVVRVSTDLNVQFEASTICATSTVWK 122

Query: 308 LDKIDESTGQWFVTIGGTRGNPGRETLDNWFKIEKHGSDYKFVFCPTVCDFCKVMCRDVG 367
           LDK DEST QWF+TIGGTRGNPG +T+DNWFKIEKHG+DYKF FCPTVCDFCKVMCRDVG
Sbjct: 123 LDKFDESTKQWFITIGGTRGNPGVKTVDNWFKIEKHGNDYKFKFCPTVCDFCKVMCRDVG 182

Query: 368 IFFKNGKRALALSDTPFPVMFKKV 392
           IFFKNGKRALALSDTPFPVMFK+V
Sbjct: 183 IFFKNGKRALALSDTPFPVMFKEV 203

BLAST of Tan0014729 vs. NCBI nr
Match: XP_038895203.1 (kunitz trypsin inhibitor 5-like [Benincasa hispida])

HSP 1 Score: 382.1 bits (980), Expect = 5.7e-102
Identity = 186/211 (88.15%), Postives = 193/211 (91.47%), Query Frame = 0

Query: 181 KNVGLFYNFALLSFLFILLASTESEVRFCRADASPDAVRDIDGKKLRAGVNYYILPVIRG 240
           KN G+     LLSFLFILLAST  EVRF RADASP+AVRDIDGKKLRAGVNYYILPVIRG
Sbjct: 2   KNFGI-----LLSFLFILLAST--EVRFSRADASPEAVRDIDGKKLRAGVNYYILPVIRG 61

Query: 241 RGGGLTLGNLQSEKCPVNVVQEQFELMNGLPATFAPVNPKKGVVRVSTDLNVQFEASTIC 300
           RGGGLTL NLQSE CPVNVVQEQFELMNG P TF PVNPKKGVVRVSTDLNVQFEASTIC
Sbjct: 62  RGGGLTLSNLQSENCPVNVVQEQFELMNGFPTTFLPVNPKKGVVRVSTDLNVQFEASTIC 121

Query: 301 ATSTVWKLDKIDESTGQWFVTIGGTRGNPGRETLDNWFKIEKHGSDYKFVFCPTVCDFCK 360
           ATSTVWKLDK+DESTGQWFVTIGG+RGNPG ET+DNWFKI KHG DYK VFCP+VCDFCK
Sbjct: 122 ATSTVWKLDKVDESTGQWFVTIGGSRGNPGVETVDNWFKIVKHGKDYKLVFCPSVCDFCK 181

Query: 361 VMCRDVGIFFKNGKRALALSDTPFPVMFKKV 392
           VMCRD+GIFFKNGKRALALSDTPFPVMFKKV
Sbjct: 182 VMCRDIGIFFKNGKRALALSDTPFPVMFKKV 205

BLAST of Tan0014729 vs. NCBI nr
Match: KAF9664260.1 (hypothetical protein SADUNF_Sadunf17G0137600 [Salix dunnii])

HSP 1 Score: 379.0 bits (972), Expect = 4.9e-101
Identity = 196/394 (49.75%), Postives = 256/394 (64.97%), Query Frame = 0

Query: 6   ILSFLFFVLASTE-VHFCRADASPDAVVDIDGKKLRAGDNYYILPVFRRNIGGVAIGGIP 65
           +LSFL   LA+ + +    A ++PD V+D++GK L  G +YYILPV R   GG+ +    
Sbjct: 64  LLSFLLSALAANQYLPRVAAASAPDPVLDVNGKILTTGSSYYILPVIRGRGGGLKMAST- 123

Query: 66  GYNNQCPINVVPETYEASNGAPTIFTPINPKKGVVRVSTDLNIQFEANTKCAKSTVWKIG 125
                CP++VV + YEASNG P  FTP+N KKGV+RV TDLNI+F A + C +STVWK+ 
Sbjct: 124 -VRKTCPLDVVQDRYEASNGLPLKFTPVNSKKGVIRVHTDLNIKFSAASICHQSTVWKLD 183

Query: 126 KFDEHMRQYFVTIGGTKGNPGRETLESWFKIEKHGNNNYYKFVYCPTVCKYCKVICKNVG 185
            +DE  +Q+FVT  G +GNPG ET  +WFKIEK    N YK V+CPTVC++CKV+CK+VG
Sbjct: 184 SYDEWAKQWFVTTNGVEGNPGPETTSNWFKIEKF--QNKYKLVFCPTVCRHCKVMCKDVG 243

Query: 186 LFYN------FALLSFLFILLASTESEVRFCRADASPDAVRDIDGKKLRAGVNYYILPVI 245
           ++ +       AL    F ++ +         A A+PD V D++GK L  G +YYILPVI
Sbjct: 244 IYIDAKGERRLALSDVPFKVMVA---------ATAAPDPVLDVNGKILTIGTSYYILPVI 303

Query: 246 RGR-GGGLTLGNLQSEKCPVNVVQEQFELMNGLPATFAPVNPKKGVVRVSTDLNVQFEAS 305
           RGR GGGL + +   + CP++VVQ+++E  N             GV+RV TDLN++F A 
Sbjct: 304 RGRGGGGLKMASTVRKTCPLDVVQDRYEASN-------------GVIRVHTDLNIKFSAP 363

Query: 306 TICATSTVWKLDKIDESTGQWFVTIGGTRGNPGRETLDNWFKIEKHGSDYKFVFCPTVCD 365
           +IC  STVWKLD  DE   QWFVT  G  GNPG ET  NWFKIEK  + YK VFCPTVC 
Sbjct: 364 SICHQSTVWKLDSYDEWAKQWFVTTNGVEGNPGPETTSNWFKIEKFQNKYKLVFCPTVCR 423

Query: 366 FCKVMCRDVGIFF-KNGKRALALSDTPFPVMFKK 391
            CKVMC+D+GI+    G+R LALS+ PF VMFK+
Sbjct: 424 HCKVMCKDIGIYIDAKGERRLALSNVPFKVMFKR 431

BLAST of Tan0014729 vs. NCBI nr
Match: XP_023520252.1 (kunitz trypsin inhibitor 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 379.0 bits (972), Expect = 4.9e-101
Identity = 183/204 (89.71%), Postives = 189/204 (92.65%), Query Frame = 0

Query: 188 NFALLSFLFILLASTESEVRFCRADASPDAVRDIDGKKLRAGVNYYILPVIRGRGGGLTL 247
           NFALLSFLFILLA   SEVR  RADASPDAVRDIDGKKLRAGVNYYILPV RGRGGGL L
Sbjct: 3   NFALLSFLFILLA---SEVRVSRADASPDAVRDIDGKKLRAGVNYYILPVFRGRGGGLAL 62

Query: 248 GNLQSEKCPVNVVQEQFELMNGLPATFAPVNPKKGVVRVSTDLNVQFEASTICATSTVWK 307
           GNLQS+KCP+NVVQEQFELMNGLPA F PVNPKKGVVRVSTDLNVQFEASTICATSTVWK
Sbjct: 63  GNLQSDKCPLNVVQEQFELMNGLPAAFLPVNPKKGVVRVSTDLNVQFEASTICATSTVWK 122

Query: 308 LDKIDESTGQWFVTIGGTRGNPGRETLDNWFKIEKHGSDYKFVFCPTVCDFCKVMCRDVG 367
           LDK DEST QWF+TIGG RGNPG +T+DNWFKIEKHG+DYKF FCPTVCDFCKVMCRDVG
Sbjct: 123 LDKFDESTKQWFITIGGARGNPGVKTVDNWFKIEKHGNDYKFKFCPTVCDFCKVMCRDVG 182

Query: 368 IFFKNGKRALALSDTPFPVMFKKV 392
           IFFKNGKRALALSDTPFPVMFK V
Sbjct: 183 IFFKNGKRALALSDTPFPVMFKVV 203

BLAST of Tan0014729 vs. ExPASy TrEMBL
Match: A0A6J1KRM6 (kunitz trypsin inhibitor 2 OS=Cucurbita maxima OX=3661 GN=LOC111495838 PE=3 SV=1)

HSP 1 Score: 382.1 bits (980), Expect = 2.8e-102
Identity = 184/204 (90.20%), Postives = 191/204 (93.63%), Query Frame = 0

Query: 188 NFALLSFLFILLASTESEVRFCRADASPDAVRDIDGKKLRAGVNYYILPVIRGRGGGLTL 247
           NFALLSFLFILLA   SEVR  RADASPDAVRDIDGKKLRAGVNYYILPV RGRGGGL L
Sbjct: 3   NFALLSFLFILLA---SEVRVSRADASPDAVRDIDGKKLRAGVNYYILPVFRGRGGGLAL 62

Query: 248 GNLQSEKCPVNVVQEQFELMNGLPATFAPVNPKKGVVRVSTDLNVQFEASTICATSTVWK 307
           GNLQS+KCP+NVVQEQFELMNGLPA F PVNPKKGVVRVSTDLNVQFEASTICATSTVWK
Sbjct: 63  GNLQSDKCPLNVVQEQFELMNGLPAAFLPVNPKKGVVRVSTDLNVQFEASTICATSTVWK 122

Query: 308 LDKIDESTGQWFVTIGGTRGNPGRETLDNWFKIEKHGSDYKFVFCPTVCDFCKVMCRDVG 367
           LDK DEST QWF+TIGGTRGNPG +T+DNWFKIEKHG+DYKF FCPTVCDFCKVMCRDVG
Sbjct: 123 LDKFDESTKQWFITIGGTRGNPGVKTVDNWFKIEKHGNDYKFKFCPTVCDFCKVMCRDVG 182

Query: 368 IFFKNGKRALALSDTPFPVMFKKV 392
           IFFKNGKRALALSDTPFPVMFK+V
Sbjct: 183 IFFKNGKRALALSDTPFPVMFKEV 203

BLAST of Tan0014729 vs. ExPASy TrEMBL
Match: A0A6J1EAD6 (kunitz trypsin inhibitor 2 OS=Cucurbita moschata OX=3662 GN=LOC111431333 PE=3 SV=1)

HSP 1 Score: 382.1 bits (980), Expect = 2.8e-102
Identity = 184/204 (90.20%), Postives = 191/204 (93.63%), Query Frame = 0

Query: 188 NFALLSFLFILLASTESEVRFCRADASPDAVRDIDGKKLRAGVNYYILPVIRGRGGGLTL 247
           NFALLSFLFILLA   SEVR  RADASPDAVRDIDGKKLRAGVNYYILPV RGRGGGL L
Sbjct: 3   NFALLSFLFILLA---SEVRVSRADASPDAVRDIDGKKLRAGVNYYILPVFRGRGGGLAL 62

Query: 248 GNLQSEKCPVNVVQEQFELMNGLPATFAPVNPKKGVVRVSTDLNVQFEASTICATSTVWK 307
           GNLQS+KCP+NVVQEQFELMNGLPA F PVNPKKGVVRVSTDLNVQFEASTICATSTVWK
Sbjct: 63  GNLQSDKCPLNVVQEQFELMNGLPAAFLPVNPKKGVVRVSTDLNVQFEASTICATSTVWK 122

Query: 308 LDKIDESTGQWFVTIGGTRGNPGRETLDNWFKIEKHGSDYKFVFCPTVCDFCKVMCRDVG 367
           LDK DEST QWF+TIGGTRGNPG +T+DNWFKIEKHG+DYKF FCPTVCDFCKVMCRDVG
Sbjct: 123 LDKFDESTKQWFITIGGTRGNPGVKTVDNWFKIEKHGNDYKFKFCPTVCDFCKVMCRDVG 182

Query: 368 IFFKNGKRALALSDTPFPVMFKKV 392
           IFFKNGKRALALSDTPFPVMFK+V
Sbjct: 183 IFFKNGKRALALSDTPFPVMFKEV 203

BLAST of Tan0014729 vs. ExPASy TrEMBL
Match: A0A5J5A047 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_009107 PE=3 SV=1)

HSP 1 Score: 377.1 bits (967), Expect = 8.9e-101
Identity = 189/387 (48.84%), Postives = 255/387 (65.89%), Query Frame = 0

Query: 6   ILSFLFFVLASTEVHFCRADASPDAVVDIDGKKLRAGDNYYILPVFRRNIGGVAIGGIPG 65
           +LSF  F  ++  +     D + + V+D+ GK ++ G +YYILPV R   GG+ +     
Sbjct: 7   LLSFFLFAFSTNPLLGVADDDTRNPVLDVAGKTVQTGVDYYILPVVRGRGGGLTLASNRN 66

Query: 66  YNNQCPINVVPETYEASNGAPTIFTPINPKKGVVRVSTDLNIQFEANTKCAKSTVWKIGK 125
            +N CP++VV E  E +NG P  F+P++  +GVVR +TDLNI+F A T CA+STVW++G 
Sbjct: 67  GSN-CPLDVVQEQQEVNNGLPVTFSPVS-NEGVVRETTDLNIKFSAATICAQSTVWQLGD 126

Query: 126 FDEHMRQYFVTIGGTKGNPGRETLESWFKIEKHGNNNYYKFVYCPTVCKYCKVICKNVGL 185
           FD  + + FVT GG +GNPGRETL +WF I+K+ ++  YK V+CPTVC  C+  C ++G+
Sbjct: 127 FDNSVGRSFVTTGGVEGNPGRETLSNWFNIQKYDDD--YKLVFCPTVCNICRPRCGDLGI 186

Query: 186 FYNFALLSFLFILLASTESEVRFCRADASPDAVRDIDGKKLRAGVNYYILPVIRGRGGGL 245
           F    +                  R D  P+ V D+ G  ++ GV+YYILPV+RG GGGL
Sbjct: 187 FIENGI------------------RPDDEPNPVLDVAGNTVQTGVDYYILPVVRGSGGGL 246

Query: 246 TLGNLQS-EKCPVNVVQEQFELMNGLPATFAPVNPKKGVVRVSTDLNVQFEASTICATST 305
           TL N ++   CP++VVQEQ E+ NGLP TF+PV   +GVVR  TDLN+Q  A+TIC  S 
Sbjct: 247 TLANNRNGSNCPLDVVQEQQEVDNGLPLTFSPVT-GEGVVREITDLNIQSSAATICIQSL 306

Query: 306 VWKLDKIDESTGQWFVTIGGTRGNPGRETLDNWFKIEKHGSDYKFVFCPTVCDFCKVMCR 365
           VWKL   DES G+ FVT GG  GNPG+ETL NWF+IEK  +DYK VFCP+VCD C+ +C 
Sbjct: 307 VWKLGDFDESVGRSFVTTGGVEGNPGQETLSNWFRIEKDDNDYKIVFCPSVCDICRPLCG 366

Query: 366 DVGIFFKNGKRALALSDTPFPVMFKKV 392
           D+GIF +NG R LALSD P  VMF++V
Sbjct: 367 DIGIFIENGIRRLALSDEPLRVMFRRV 370

BLAST of Tan0014729 vs. ExPASy TrEMBL
Match: A0A5D3BIG0 (Miraculin-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G002770 PE=3 SV=1)

HSP 1 Score: 372.5 bits (955), Expect = 2.2e-99
Identity = 180/211 (85.31%), Postives = 189/211 (89.57%), Query Frame = 0

Query: 181 KNVGLFYNFALLSFLFILLASTESEVRFCRADASPDAVRDIDGKKLRAGVNYYILPVIRG 240
           KN G+FY      F+FILLAST  E+RF  ADASP+AV DIDGKKLRAGVNYYILPV RG
Sbjct: 3   KNFGIFY------FIFILLAST--ELRFSTADASPEAVLDIDGKKLRAGVNYYILPVFRG 62

Query: 241 RGGGLTLGNLQSEKCPVNVVQEQFELMNGLPATFAPVNPKKGVVRVSTDLNVQFEASTIC 300
           RGGGLTLGNLQSE CPVNVVQEQFELMNG P TF PVNPKKGVVRVSTDLNVQF+ASTIC
Sbjct: 63  RGGGLTLGNLQSEICPVNVVQEQFELMNGFPTTFHPVNPKKGVVRVSTDLNVQFDASTIC 122

Query: 301 ATSTVWKLDKIDESTGQWFVTIGGTRGNPGRETLDNWFKIEKHGSDYKFVFCPTVCDFCK 360
            TSTVWKLDK DESTGQWFVTIGG+RGNPG ET+DNWFKIEKHG DYK VFCPTVC+FCK
Sbjct: 123 VTSTVWKLDKFDESTGQWFVTIGGSRGNPGVETVDNWFKIEKHGKDYKLVFCPTVCNFCK 182

Query: 361 VMCRDVGIFFKNGKRALALSDTPFPVMFKKV 392
           VMCRD+GIFFKNGKRALALSDTPFPVMFKKV
Sbjct: 183 VMCRDIGIFFKNGKRALALSDTPFPVMFKKV 205

BLAST of Tan0014729 vs. ExPASy TrEMBL
Match: A0A1S3ASR3 (miraculin-like OS=Cucumis melo OX=3656 GN=LOC103482596 PE=3 SV=1)

HSP 1 Score: 372.5 bits (955), Expect = 2.2e-99
Identity = 180/211 (85.31%), Postives = 189/211 (89.57%), Query Frame = 0

Query: 181 KNVGLFYNFALLSFLFILLASTESEVRFCRADASPDAVRDIDGKKLRAGVNYYILPVIRG 240
           KN G+FY      F+FILLAST  E+RF  ADASP+AV DIDGKKLRAGVNYYILPV RG
Sbjct: 3   KNFGIFY------FIFILLAST--ELRFSTADASPEAVLDIDGKKLRAGVNYYILPVFRG 62

Query: 241 RGGGLTLGNLQSEKCPVNVVQEQFELMNGLPATFAPVNPKKGVVRVSTDLNVQFEASTIC 300
           RGGGLTLGNLQSE CPVNVVQEQFELMNG P TF PVNPKKGVVRVSTDLNVQF+ASTIC
Sbjct: 63  RGGGLTLGNLQSEICPVNVVQEQFELMNGFPTTFHPVNPKKGVVRVSTDLNVQFDASTIC 122

Query: 301 ATSTVWKLDKIDESTGQWFVTIGGTRGNPGRETLDNWFKIEKHGSDYKFVFCPTVCDFCK 360
            TSTVWKLDK DESTGQWFVTIGG+RGNPG ET+DNWFKIEKHG DYK VFCPTVC+FCK
Sbjct: 123 VTSTVWKLDKFDESTGQWFVTIGGSRGNPGVETVDNWFKIEKHGKDYKLVFCPTVCNFCK 182

Query: 361 VMCRDVGIFFKNGKRALALSDTPFPVMFKKV 392
           VMCRD+GIFFKNGKRALALSDTPFPVMFKKV
Sbjct: 183 VMCRDIGIFFKNGKRALALSDTPFPVMFKKV 205

BLAST of Tan0014729 vs. TAIR 10
Match: AT1G17860.1 (Kunitz family trypsin and protease inhibitor protein )

HSP 1 Score: 229.6 bits (584), Expect = 4.4e-60
Identity = 107/199 (53.77%), Postives = 146/199 (73.37%), Query Frame = 0

Query: 192 LSFLFILLASTESEVRFCRADASPDAVRDIDGKKLRAGVNYYILPVIRGRGGGLTLGNLQ 251
           L ++F+LLA   S  R    +A+ + V+DI+GK L  GVNYYILPVIRGRGGGLT+ NL+
Sbjct: 4   LLYIFLLLAVFISH-RGVTTEAAVEPVKDINGKSLLTGVNYYILPVIRGRGGGLTMSNLK 63

Query: 252 SEKCPVNVVQEQFELMNGLPATFAPVNPKKGVVRVSTDLNVQFEASTICATSTVWKLDKI 311
           +E CP +V+Q+QFE+  GLP  F+P + K   + VSTD+N++F      + +++W+L   
Sbjct: 64  TETCPTSVIQDQFEVSQGLPVKFSPYD-KSRTIPVSTDVNIKF------SPTSIWELANF 123

Query: 312 DESTGQWFVTIGGTRGNPGRETLDNWFKIEKHGSDYKFVFCPTVCDFCKVMCRDVGIFFK 371
           DE+T QWF++  G  GNPG++T+DNWFKI+K   DYK  FCPTVC+FCKV+CRDVG+F +
Sbjct: 124 DETTKQWFISTCGVEGNPGQKTVDNWFKIDKFEKDYKIRFCPTVCNFCKVICRDVGVFVQ 183

Query: 372 NGKRALALSDTPFPVMFKK 391
           +GKR LALSD P  VMFK+
Sbjct: 184 DGKRRLALSDVPLKVMFKR 194

BLAST of Tan0014729 vs. TAIR 10
Match: AT1G73260.1 (kunitz trypsin inhibitor 1 )

HSP 1 Score: 153.3 bits (386), Expect = 4.0e-37
Identity = 84/177 (47.46%), Postives = 107/177 (60.45%), Query Frame = 0

Query: 217 AVRDIDGKKLRAGVNYYILPVIRGRGGGLTLGNLQSEKCPVNVVQEQFELMNGLPATFAP 276
           AV DIDG  +    +YY+LPVIRGRGGGLTL     + CP ++VQE  E+  G+P  F+ 
Sbjct: 29  AVVDIDGNAM-FHESYYVLPVIRGRGGGLTLAGRGGQPCPYDIVQESSEVDEGIPVKFSN 88

Query: 277 VNPKKGVVRVSTDLNVQFE-ASTICATSTVWKLDKIDESTGQWFVTIGGTRGNPGRETLD 336
              K   V  S +LN++ +  +TIC  ST W++ + D    Q+FV  G      G+++L 
Sbjct: 89  WRLKVAFVPESQNLNIETDVGATICIQSTYWRVGEFDHERKQYFVVAGPKPEGFGQDSLK 148

Query: 337 NWFKIEKHGSD-YKFVFCPTVCDFCKVMCRDVGIFFKN-GKRALALSDTPFPVMFKK 391
           ++FKIEK G D YKFVFCP  CD     C DVGIF    G R LALSD PF VMFKK
Sbjct: 149 SFFKIEKSGEDAYKFVFCPRTCDSGNPKCSDVGIFIDELGVRRLALSDKPFLVMFKK 204

BLAST of Tan0014729 vs. TAIR 10
Match: AT1G73325.1 (Kunitz family trypsin and protease inhibitor protein )

HSP 1 Score: 102.1 bits (253), Expect = 1.1e-21
Identity = 68/188 (36.17%), Postives = 106/188 (56.38%), Query Frame = 0

Query: 211 ADASP-DAVRDIDGKKLRAGVNYYILPVIRGRGGGL--TLGNLQSEKCPVN--VVQEQFE 270
           ADA+P   V DI G  +++ V YYI+P   G GGGL  +  NL ++   +N  +VQ    
Sbjct: 24  ADATPSQVVLDIAGHPVQSNVQYYIIPAKIGTGGGLIPSNRNLSTQDLCLNLDIVQSSSP 83

Query: 271 LMNGLPATFAPVNPKKGVVRVSTDLNVQFEAST-ICATSTVWKLDKIDESTGQWFVTIGG 330
            ++GLP TF+P+N K   V++S  LN++F+++  +C  S VW++D       + FV+IGG
Sbjct: 84  FVSGLPVTFSPLNTKVKHVQLSASLNLEFDSTVWLCPDSKVWRIDH-SVQLRKSFVSIGG 143

Query: 331 TRGNPGRETLDNWFKIEKHGSDYKFVFCPTVCDFCKVMCRDVGI-FFKNGKRALALS-DT 390
            +G       ++WF+I++ G  YK ++CP       V C +V +    +G R L LS D 
Sbjct: 144 QKGKG-----NSWFQIQEDGDAYKLMYCPI---SSIVACINVSLEIDDHGVRRLVLSTDQ 202

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LMU26.2e-5953.77Kunitz trypsin inhibitor 5 OS=Arabidopsis thaliana OX=3702 GN=KTI5 PE=2 SV=1[more]
P130871.8e-5355.02Miraculin OS=Synsepalum dulcificum OX=3743 PE=1 SV=3[more]
P327652.2e-4346.7721 kDa seed protein OS=Theobroma cacao OX=3641 GN=ASP PE=2 SV=1[more]
Q8RXD55.7e-3647.46Kunitz trypsin inhibitor 4 OS=Arabidopsis thaliana OX=3702 GN=KTI4 PE=2 SV=1[more]
P836672.5e-2337.22Kunitz-type serine protease inhibitor DrTI OS=Delonix regia OX=72433 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
KAF9664255.15.2e-10349.76hypothetical protein SADUNF_Sadunf17G0137100 [Salix dunnii][more]
XP_022923703.15.7e-10290.20kunitz trypsin inhibitor 2 [Cucurbita moschata] >XP_023001808.1 kunitz trypsin i... [more]
XP_038895203.15.7e-10288.15kunitz trypsin inhibitor 5-like [Benincasa hispida][more]
KAF9664260.14.9e-10149.75hypothetical protein SADUNF_Sadunf17G0137600 [Salix dunnii][more]
XP_023520252.14.9e-10189.71kunitz trypsin inhibitor 2 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1KRM62.8e-10290.20kunitz trypsin inhibitor 2 OS=Cucurbita maxima OX=3661 GN=LOC111495838 PE=3 SV=1[more]
A0A6J1EAD62.8e-10290.20kunitz trypsin inhibitor 2 OS=Cucurbita moschata OX=3662 GN=LOC111431333 PE=3 SV... [more]
A0A5J5A0478.9e-10148.84Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_009107 PE=3 SV=1[more]
A0A5D3BIG02.2e-9985.31Miraculin-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G00277... [more]
A0A1S3ASR32.2e-9985.31miraculin-like OS=Cucumis melo OX=3656 GN=LOC103482596 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G17860.14.4e-6053.77Kunitz family trypsin and protease inhibitor protein [more]
AT1G73260.14.0e-3747.46kunitz trypsin inhibitor 1 [more]
AT1G73325.11.1e-2136.17Kunitz family trypsin and protease inhibitor protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002160Proteinase inhibitor I3, Kunitz legumePRINTSPR00291KUNITZINHBTRcoord: 360..389
score: 25.95
coord: 255..275
score: 31.37
coord: 217..246
score: 44.71
coord: 337..356
score: 25.98
IPR002160Proteinase inhibitor I3, Kunitz legumeSMARTSM00452kul_2coord: 217..391
e-value: 3.6E-72
score: 255.7
coord: 30..212
e-value: 1.1E-55
score: 201.0
IPR002160Proteinase inhibitor I3, Kunitz legumePFAMPF00197Kunitz_legumecoord: 218..390
e-value: 1.4E-58
score: 197.7
coord: 31..208
e-value: 7.4E-47
score: 159.5
IPR002160Proteinase inhibitor I3, Kunitz legumePANTHERPTHR33107KUNITZ TRYPSIN INHIBITOR 2coord: 193..391
IPR002160Proteinase inhibitor I3, Kunitz legumePANTHERPTHR33107KUNITZ TRYPSIN INHIBITOR 2coord: 8..186
IPR002160Proteinase inhibitor I3, Kunitz legumePROSITEPS00283SOYBEAN_KUNITZcoord: 31..47
IPR002160Proteinase inhibitor I3, Kunitz legumePROSITEPS00283SOYBEAN_KUNITZcoord: 218..234
IPR002160Proteinase inhibitor I3, Kunitz legumeCDDcd00178STIcoord: 217..390
e-value: 7.60853E-55
score: 176.764
IPR002160Proteinase inhibitor I3, Kunitz legumeCDDcd00178STIcoord: 30..188
e-value: 3.72413E-52
score: 169.83
NoneNo IPR availableGENE3D2.80.10.50coord: 214..391
e-value: 2.5E-75
score: 254.0
NoneNo IPR availableGENE3D2.80.10.50coord: 29..213
e-value: 5.3E-59
score: 200.9
NoneNo IPR availablePANTHERPTHR33107:SF5KUNITZ TRYPSIN INHIBITOR 2coord: 8..186
NoneNo IPR availablePANTHERPTHR33107:SF5KUNITZ TRYPSIN INHIBITOR 2coord: 193..391
IPR011065Kunitz inhibitor STI-like superfamilySUPERFAMILY50386STI-likecoord: 27..184
IPR011065Kunitz inhibitor STI-like superfamilySUPERFAMILY50386STI-likecoord: 214..391

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0014729.1Tan0014729.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010951 negative regulation of endopeptidase activity
molecular_function GO:0004866 endopeptidase inhibitor activity