Tan0002099 (gene) Snake gourd v1

Overview
NameTan0002099
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG10: 19340939 .. 19357819 (+)
RNA-Seq ExpressionTan0002099
SyntenyTan0002099
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCGCTTTAGGTTTATAACGTTAACTTCTCTGCTCTCTCTCTCTCTCTCTCTCTTCTTCTAGCCGCGCCGAAATTTCTCTCTCCGCACTCTCTCCCGCGCTCACGCCTCTCTCTCCTCTCTCTTTCTCTCTTCCGTTTCATTGCTCCTCCATAGCTTTTTTTCACTCCCTCGCCTTTCTACATTGCTCTTTCTTGCCTCCTCTGCTTCTTCCTTCTTCGGTCTGCTGCATTTTCTTTCTCTGTTGCTGTCCTCTGACTCTCGCTATGTAATTCCTTCGTTCCTTCTTCTTATGTGATCTTTGAGTGGATTCCATTGCTGGGTTTGGAAATTGAAATTCGTTTGTGCATCGTCTTTGTGTTTTTCGGCAAAGTTGTTTTATACTGAATGTATGAGGTGCTGTGATTTTGGGGTTGTTGTCGTTTTTTTTTTTGTACTTGGAATTCTTTTTTTTTTTTCCCTTTCTTTCTTGTCTGAACTTCTATGCGGATTGTTTTGGCATGTTTGGGTCTGAGTGTACTGAAGTAAACTTTTTGGAGTGGGACATGTAAATTGGGATGGAATTAAGAAGAAGGAAAAGTGCAGCTGTTGGTGTTCAAGTTGTAGTGAAATAATGGATGTTCAGCGTAATTACTCTGCTAGACGTTCTAATTTAGGCTCTATTGTTGAGTAGCTTTGGGTGACAAGATGCCGGAAGTTATAAGTTTATTCTAGGCTGCTTACCTTTGTGAAAGCTAGGCGAGATTAATTTCTAACATGACTTATGAAGAACTTGAAACAATTTTCACTATGAAGGATTTGTGGGTCTGTTTTTCCGTTCCATAAGAAAGAATTACCACCACTTCCCTCCATTTTGAAGGGTTGGTAGCGTGTTGTTTAAGTGACTAGAATGGGATGACTTGAGAACCATTGAATTATTGGGGATAAGCTATCCTGAGGTCGTGGCATTTATTTGTCATGTAAATGTCCGCCTATTTGATTTGAAAGGGAGTGCGGAGTGCCAATTTAAATATTGAGTTTACAACTTGCACATTGAATTCAGTGCCCGCTATTGTCTTAGTAAATTTAATTTCATGATATATATTAATACGTTTTCTTTTGGACTTTTTTTTTTGACAGAATATAATTGTGCTAAATCATTGATGGACCTCTATCTGCACAAGACAATGGATTAAAGAAAAATGCTGCACGTATTAAGTTGGAGATTTTCTTGGTAAATTCTAGTGGGAAAGGCAGGGGCAGGAGCTAGTCTGGGAAACCATTTCTAGGAATGGAAACTCATAGCAAAGGGAAGTGCCGGGAGGATTTCGAGGTTGATATAATTGAATGTTCAAATAAAACTGATCCCAAATTTTGTGGGAAAGAAGACCCTGATGCAACTGAATATTCAAGCTCATTTGCTGAGACATCTGATGCGGATAATTGTTCAGGATTCAACGAAGGAGAAGTAGAAACTCAATTTTTTGGAGACATTGGCTTGCCACCCACATTTGGTTCATTTAGTGGTGCACTTCCAATAAGGTCCTTTTTCTTTTGGCTGCGATTATAATCTATGTTTTAGCGTCATCGTATATATTTTGTTAGAAGAAAAACTATATGAAAGTGGTAGGATTATGAGTCTAACATGATATCCTTGCTTCTCACTTTTCTGAAACGTTTCAGTTGGCCTTGATTGCACTGCTTAATGATATCTATAAATTATAAATACCTGATCTTTGAAAGTTGAAATATATGATTCTATGTTTTTTTTTGTATAGGATATTTTCACCTTGAAATCTTAGCAATTTAGGAACTTTCAAGTTTCAATTTAGTGGGCGTGAAAACAGAAGATTGGTATAACTTTTAATTATGCATACAAATTAAAAGCTTCAACTCTTTGAACTTGAAGAACTATGAGACCCATCAGCTCCTCTGTAATCAGTTTGCTTGTGAATGTGATGAATCAATATTCAATAACATTATAGTAAAAGTCATGCAAAAATTTTGGAGGAAATGGTGCCCCATGGATTATTTTTGTTTATTTTACAATGCACACTATATTCATTCCTGCATTTTGTGTTAAGAGTATTGCTTTTGACTGTAGAATGGTTCTATACCATCATACTTGAAGTCATTGATTATATAAAAATGATACTAATTTCATTTTTTCAATGAAATTTCTTTCCTATTGCTATGGAAAAGTAGAAAAACTTTCCAATTTGGAAGGCCATAATTGTTTAATCCTCTTGTCCTTATTGAATGGTTTTCACAAGAGATTGCTGCTGGGTTGAGTCAACTAGTTGTCTCATAAAAGTATTTGGGGTGTAAATAGGCTAGAATACCCAATTATACGAAGTAAAAGATGCATTTAATGTTTGTCCTCCTTAAGTAGTTGTATAGATTCTTTAAAGAATGGAGCTTCCTTTTCCTGCTCTATCATCACGAGCAAATCCAGGAAGCCCACTAATAGTTAGAAATCTAATAACCTTTTAGACACTTTTAAAGTCTAGGGGCCTATTAGACTCCAAATTGAAAGTTTAGGACCTGTAAACTTTTTAAAATTTTTACGAACTTATTGTGCACAATCCTAAAAGTGTAGGCCCCGTTTGTTAACTATTTGGTTTTTGAAAATTAAGCCTATAAATACTACTTCTACCCATGATTTCTTTGTTTGGTTATCCACTTTTTACCTATGTTTTCAAAAACCAACTCCTTTGTTCACCTGTCTTTTTATATGCCCTTTTGTATTTTTTTTTTCATTTTTCTCAATAAAAGCTCTTTTCCTCATTGAAAAAGGATATTGAGAAACTTAATTAATCTACCAAAATAGAGTAGGATCTTTGTCTTCCTAGGGAAAAATTTGCTTCTAACAAATAAAGGACAGATCCACAAGCAAAACACCTTTTAGGTAGTATATTAAACCAGGAGTTTAGGACTACTGTATGTCTGGAAAATAATAGATGGCTGGTAGGTAGATAATCGGTAGCCAAGTAAGAAACATTTAACACATTAGAGATGGTGAGATCAGGTGTAGTTCAATGACATAAGCATTTTCTCCCACTCATTTCATAACTCGACAAGGACCAATTTTCTTCTTATGCTAGTTTAGTACATGTAGGAAACCTACTTTACCCCAAGTACACTAGAGATGTTGAGATTTGAAGCAGTTCAATGACATAAATACTTTATCCAGCTCGCTTCATTACTCAACAAGGACCAATTTTCTTCTTCTCAAGCTTGTTATATGTACTGTTAGAATTAAAACCTACTAAAGTGCACCATGACGAACTCCCTACTTCCGAATCCTTGCTACAATTTCCTATTTCCTGCTGTTTTATACAACGTGTTAGCTTTAGTTAAATGATCAACTATCTCATCACATGTATCTACTTGGTTTGATCAGCTACCGATTCAGCCTCAAAGTTTTCATTAGATCAATAGTTAAAGGAGGGAGTTTAGAATGTTCTACCTCTAAAGATATTTTGCCTGTAGATCTGTTCATGTTGTTCAAAGAAAATTTTAGTTACTGCAAGAAATCCAACTACATTTCTTAATATTCAGTTGGTCACTTCAATTTGGCCCTCTGTTTGGGGTGTTAAGTTGCATTATACTGCCCTTGCGCATCAAACTTTTGCTACAAACTAGTCACATTGTGACGAAGAGGATGTCATTGTACAAAGTTGTCAACATCCTTTCTTAATTTGAACCTATTTAATTGATCTAATATTTTATCATGACTGAAATGTGCTCCCAATCCACTAGCGCAGGTATATCCACTAAAATTTTATGTTGAAGGAATGTAGGGGTAATTATAGTTCATCATTTTTAAATAAAAATCTATTAATCACCTTCTAGGAAGTACAGACACGGACACGGACACGGACACGACACGATACGGCGATACGCCAATTTCTAGAAAACTAGGACACGGACACGTTGAGGATACGTTATTATTATTTTCTTCTATATATAATATATATATATATATATATAACATAAACTACTAAATGTACTCCAAAACATAAAATGTTAAATCAATTGAGATTTTCATTTGATTATCCATATCCAACAAAGTTGCAAAATAAGATATCCAATGACAATGACCAACCAAAACCCAATAAAAGTTACAAACTATTGATCTAGAGTAAAAAAATCCAAGAAAGGAAATGATTCAAATGAAAAATAGTTTTCCATACAAAACATAGTTCTCAAGCAATATAAAGAAGAGTCTCAAATCTCCACATCTTCATTTTTATCACCAACAACACCCTCATCATCTTCATCATCAATAATCCCAGCCTCCAAATCTGGTTCATCCAAAGACAAATTAGCTATTTCAAGCATGCTGACATCTTCAAAAGAATCAAAAGCATCTCCAGCAATATCCCATAACTTAGTATCTCCTTTTAAATACTCTGGAGTTCTTGACAAAAGACGAAGATTACTATGGATAAATACCAAATCTTCTGCACGTTGTGGCGTCATTTTGTTTCTCTTGATAGAGTTAATAAAGGAATATGTGCTCCAATTCCTCTCACAACCTGAAGAGGAGGAAGGTTGTCCAAGGACTTTAAAAGCAATTGCCTGAAGTGTTGGGGCATATACACCATGAGTTGCCCACCAACTTACTGGATCTATAGTATACCTTTGAGATATTGAATCATATTCAGCAAAATCTCCTGTTTTAGTAGAAAATCTAGCAAATTCTACATTCACTTTTGCACGATCTTCAGAATTAGTAAAATATCTCTTAAAGCACTTCATTCTCTCTCGAGTTAGTTCCATATCTTGATGCGGAGCCACTCGATTAGGGTCTTGTCTAAGCCATTCTTCACTATAGTACCTAAAGATACGTTTTGAAAAAAAAAAGGGTTAAATTTTAAATATAAACTACAAGATTAGAATAGGACCAATAGGTCATAACCACCATAAGGATATAAAAAGAGTAATTAAGTACCTTGGATTTAAAGAATGTGCTAAACAGTGAAGTGGAGTATTATTCTTGTTCCAACGATCAATGAGAATGTTATGCACCACATCATAGAAAGAGGAGTGTTCACTTGGTTGCAATCCTTCATGTCTATATATTGTTGTCTTCACCTTTTCAATCATTGTATCCCACATATCATAAACCAAATGTAAACAAGGTTTGTCAGTATCACAAGCTCTGATCATATCATATATAGGTGAAGTGAAGGAAATAATATAATCAATCTTGTCCCACCATACGTCATCAAGGACCAACTCCTTTACATGCCTTGCCTTTCCCACATCATCTTCTCTATAACTTGCCCATTACTCACTAATAACCATATTTTGTAAACCACCTTTAATAAGTTTGAACCTCTTAAGCATGATGATTACGGATGCAAAACGTGTTTCAGCTATAGAAAGCAATTTAAGAGACACAAATTCATTAAACATGGCGGGCCTCATGGAATGATTCATGATAAAATTTTTCACTACCATGATTTCACCAGCAATATCAGATATCCAACTGCATTCCCCATATACAAGTTGATTGTTTTCAACATTTTTGGCGACATATATGTTTTTCAAAGCAAGATTCGGTATGCACTACACATGGTGTCCAAATAATTGTCGAAAATTGTGCTTCGATGATTTGCCTGCACCTTTGCAATTAGGAGCATTATCATGTTATCACTTGAATCACATTTTCAGGGCCGACTTCATTTATCACTTCTTTCATCAAATTTGCAATGAAATACTTATCTTTGATCTCACCAGAGCAATCCGCTGCTTTTAGAAACATTGGTGTGCCATCAGACATTGCCATAAAATTAATCAAAGGTCTCTTTGTGAGTCACTCCATCCATCACTCACAATACTCACCCCTTTGTGCGCCATTCACTTTTAATAGGTGTCAATAGTCTCTCAATATTTGCCTTTTCTTGTTGGAGAAGACTTGTCCTCAATAAATTATATCCGGGAGGAACATATCCCGACAACATATGGTTTGCGACATATAAGTAAAGGCCCCTATAAAATGAGGATTCCTCGCTAGATGGAAAGGCAAGCCAGTGAGAATAAAACATTCGAGCAATAAGTGCATGTAATTGGTCTCGTGATGTCTTGTTGAAAGATCTTTCAATTGCACTTGTACCAACCTTTCTTTTCTTTAGCTCACTTGTAATGGGAGTGGGAGAATTCATCGACATTAACACCACCAATTGATATGATTGAAGGCGTAGGTGGTAAAGGAACATTTTTTGGACTTTTTCTTTCCATTCGAGTCTTTGCCTCATCTTCTAACTTTTGCATCTCAGCCATATCTTTAGGAGTGACTTTTAGACACATTCCAATCCCTTGACCATTTATCTTCAACAAATGAGCCCTAACTCTTGTATAAGAACTCTTTTTTGTGGCGTGACAAAAATTGCGTTGCCATAAAAAATTTCCACCCCCATCACTCAATTTTTTATTCTTAGTCACATATTGCCATAGAGGTTTGAAATCATCTTCAAAAGTAGAGGAAGGGGATTCTATGGAAGGATTTGATGAATGAGAGTGGCTAGAGGTTGAAGATGCCATCCTTTTTCCTTCCTAATAAAATTATAAACTTGTAATTTTATTGAATGACAATGTTAGAAATGATGGAATAAGAATTCACACAGTTTGATACTAGTTTAGTTTAAGTTTTATTATATGAAGGAAACTAGGAAAGGAAGGAAAAAAAACATACAGTTCAGTTATGTGAAATGAAAACCAAAACAGTGGCAAAATGCAGATTGGACAAGACGAAGAGTTGCAGTTGCGGACGGCACGACGACAGACAGCAAGGCGGCGGAAAACACGGCGGTGGACGGTTGGCGGCGCGAACAAATCCTTCGCTGGTTGTGGTCGATGCGTGTTGCAGGCGCTGGCTTCGTGCGAAATGGAGTAGGGTTGAAATCGATGGGGGTGGGCGGCTCTTTGTGTAAATTGTGGGAATGGGAAAACAGATTGGAAACTTTAGCCCTTTGGGCCTTTATTTGGGCTGGGAAACGGAAACTTTGGCCCTTTGGCAAAATGAAGATCTTTGATTTCCATTTTTTTACGTGTCCTTGCGTGTCTGATATTAAAAAAAAAAAAAATGGATACGCAAAATTGCGTGTCCGGCGCGTGTCTGGACGTATCCGTGTCGGACACTGATTCTTAGCCTAATTTAAAGTGTCCGTGCTTCATTGATCACCTTTAAATACCCCACGATTTTCTGGAGTTGCCTCTTCGTTTCTTCGAAGGAATGTAGGTGGTATTTATAGTTCATCATTTTTAAATAAAAAAACTATTAATCACCCAGAAATACTCTACTAAGTCAGACTTCTTAGTCGGGTTAGGAAAGAGAGCGTTCGTTGATGATGGAGGAGGTGCTCTTCCATCCCCCTTTTTGCAACAAGGGGTCCTGTGCCATTGTGCTTTCTTTGCTGTTGTTTTGGGATTATGACTAAAGAGGAATATGAGAGTTTTTTAGATGCTTGATAGATCTTGAAAGGAGTTGTGGTCTCTTGTGAGATTTAATGCTTCTCTCTAGGCATTTGTCACTAAGGCTTTTTGTAGTTTTCCTTTAGGCCTTGTCCTTCTGGTCTGGTCCCCTTTTTGTTTAGCTAGTTTGTGGAGCTTTCTCCATTTTGTTGGGCTCTTTTTAAAATGCTCTTTTGTTTCTTCTTAGTTAAAGCACATTTTCTTATCAATAATAATAGTAATAATAAATACACTACTATTTTTCTGCCACTTTTTAAAATGCTCTTTTATTTCTTCTTAGTTGAAACACATTTTCTTATCAATAATAGTAATAGTAATAATAAGTACACTACCATTTTTCTTCCTGGGCATCACACATCACTGCCAAAAAAACAAACACTACCATTTTTACAATTATCCCAAATATTAAGAAAGTCTTAACCTGTCAAATATAATAATTAACATAAATTGTTAGATTATTTAATCTCTCCATGTACAATGGAAAGTAGGTGAGATCTCCTACATAGAGTACTAGCTAGCTAATTTGTTAGGTCAGACTTATGATTGATGGAAAAATCAAATCTTTGTATAAGAGTTATCCACCTAGGATGCATGCGGGATAAGTTCTTTTGTGAATTTACGTGTTTTAAGATTGGCGATTAGTAATTAATATGAACTCTCATCTTCACAAATGGTGTTCCCATTGTTTTAGGTCTTGTACTTAAGAATATATTTCTTATCCCAATTATAAATTTTTTAAAGTTAATGTAAAGTCGATTATCTGGCTAGACAGAAAATGTCATAAATATTTGACATGCTCATCTAGTGTGGTTCTGGAAACCAAAATGCCTAAGTAAACTACAACAAATTTATTATAATAGATTGTAAAATTTGATTCACTAATCATGAAAGTATTTAGATCAAAATTTTAGAAAAGGATTTTGAATCATGTAGTTGATCTAAGAGGTGACTTAATCTAGAAGGTATCAATCGAAGTTTTCTTAATGGCTTTATTTTCAATGCGCATACTCTAGGACCCTCTAGGCTCATGTTGTACATATTAACCCTTCAATTAATGTTCTACATATTAGCCCTTTATCTGGTAGTAGTAGTTCATGTACTCGGAGGCCCATCCTGTGGTGTCATTGTGATAAGTTGGTAAAGTAGCCCCTAGTTTTAAGCCTGTATGGTGTTGGATATCTCTTAAACACAAAACCTGGTCAACTATGAATATAGGCATCCCACCAAAGTTCGACCAGCATTTTTGAAAGACAGATACTCTTCTAGCCAAACATTTCAAGACCTAAACGGTGTTCTACCTTATTTTTATTTCCTCAACAAAAGAATAAGGGAAAGGTGATCCAAAGTGATTCTAGGGAGTTTAATCACCTTAACTGGTTGAAGAAGTTTAGCCAACCCTTGGAAGCGAAAAATCTGTCAATCTTTGTAAAGATGGGGTTTTCCCTCGTTACGAAGTGTTGCAGCCTTTGTAAGCACCATGGGGAAAATATTGGCTTTTTTCTTGGGTTGTCATTTTAGCAAAAAGTTCATGGAATTGAATGTTTAATGAGTTCAAGTTCTCTAGTGCTGACACAATAATACGAAAGATTGCTTAGATCACCTTTTCAGTAGCTTCTCCCACTTTAAAAGCTTACACAATCTGGGTGATAGTAGTAGAAAGTATCTATGGTTAATTTAGAAAGAAAGGAATTCCAGAACTTTAAAAATAAAGGGAGAAATACTGAGAACAAATATTTGATTTATCCCATTTTAATGTTTTTACTTGGTGTACCCTTTCTACTCATGGATTTATGATTCATTTGTCCTCATCAATACCAGTTGCATTTTTTTTTTTTTTGAAAGATTGACCTTTCATATTTTTTATGTAATTGATGACATTTTTTTTTTGGATAGTAAATGGATCATTTCATTGATAAAATGAAATTACAAAATGAAATGCTCATTCAGAGGATTACAAAAAACTTTTCCAATGACCTATAAGTGAAGATAAACCATAATTACAAATTGGGGGGGGGGGGGGTCAATTTACAACATGAAAGAGAGATAAATAATTGATGAGCTAAAATCCTCGAAGTCCTATTCTTTTCCAACAAAAATACAAGAGTTTCTCTCCAACCAAATTCACCAGAAGAAAGCCCGGATAGAATTAAGACTTAAGCCAAAGGGTCTTCTTCTCCTTTTTGAAAGAGTGCCTAATTTTGACAAGTTTGTTCCTTATAGGAAAAAAAAAGAAGCTACTTTCACATGTAATTAATACCTATGAAGCATGGACACTTTAAATTAGGCTAAGAATCAGTGTCGGACACGTGTCGGACACGGATACGTCCGGACACGCGCCGGACACGTGTCGGACACGCAATTTTGCGTATCCTTTTTTTTTTTTTTTTTTTTACATCGGACACTCTGGGACACGCAAAAAATGGAAATAAACATTATCCATTTCGTTTCCCAGCCGGCCAACCCAAATAAAGGCCTAAAGGCCCAAAGAGTCTTTCCCATTCCCACGATTTACACAAGTGCCGCCCCCCCCAATCGAGATTTCAACCCTACTCCCTCCATTTTGCACGAAGCGAGCGCCTGCCAACCATTGCCCACGAGCAGCGGAGGATTTGTTTGCGCGCCGCCGCCACCGCCCCCACGCAGTGTTTCCGCCGCCTTGCCGTCCGCGTCGTCGTCTCGTCCAAACCGCAATCCAATCTGCATTTTTCCATTGTTTTGGTTCTCATTTCACAGAACTAAACTAAACAAAACTGTATGTTTTTTTTCCTTCCTTTCCTAGTTTCCTTCATATAATAAAACTTAATCTAAACTAGTATGATAGTATGAGTATCAAACTGTGTGAATTCTTATCCCATCATTTCTAAGTTTATAATTTTATTAGGAAGGAAAAAGGATGGCATCTTCAACCTCTAGCCACTCTCATTCATCAAATCCTTCCATAGAATCCCCCTCCTCTACTTTTGAAGATGATTCCAAACCTCTATGGCAATATGTGACTAAGATTAAAAATTGAGTGATGGGGTGGAAACTTTTATGGCAATGCAATTTTTGTCACGCCACAAAAAAGAGCTCTTATACAAGAGTTAGGGCTCATTTGTTGAAGATAAATGGTCAAGGGATTGGAATGTGTCTAAAAGTCACTCCTAAAGATATGGCTGAGATGCAAAAGTTAGACGATGAGACAAAGACTCGAATGGAAAGAAAAAATCCAAAAAATGTTCCTTTACCACCTACGCCTTCAATCATATCAATTGGTGGTGTTAATGTCAGTAATTCTCCCACTCCCATTACAAGTGAGCTAAAGAAAAGAAAGGTTGGTACAAGTGCAATTGAAAGATCTTTCAACAAGACATCACGAGACCAATTACATGCACTTATTGCTCGAATGTTTTATTCTCGCGCTTGCCCTTCCATCTAGCGAGGAATCCTCATTTTATAGGGGCCTTTACTTATCTCATGCAAACCATATGTTGTCGGGATATGTTCCTCCTGGATATAATTTATTGAGGACAAGTCTTCTCCAACAAGAAAAGGCAAATATTGAGAGACTATTGACACCTATTAAAAGTGAATGGCGTACAAAGGGGGTGAGCATTGTGAGTGATGGATGGAGTGACTCACAAAGGAGACCTTTGATTAATTTTATGGCAATGTCTGATGGCACACCAATGTTTCTAAAAGCAGTGGATTGCTCTGGTGAGATCAAAGATAAGTATTTCATTGCAAATTTGATGAAAGAAGTGATAAATGAAGTCGACCCTGAAAATGTGATTCAAGTGATAACAGATAATGCTCCTAATTGCAAAGGTGCAGGGCAAATCATCGAAGCACAATTTCCGACAATTATTTGGACACCATGTGTAGTGCATACCCTGAATCTTGCTTTGAAAAACATATGTGCTGCCAAAAATGTTGAAAACAATCAACTTGTATATGGGGAATGCAGTTGGATATCTGATATTGCTGGTGAAATCATGGTAGTGAAAAATTTTATCATGAATCATTCCATGAGGCTCGCCATGTTTAATGAATTTGTGTCTCTTAAATTGCTTTCTATAGCTGAAACACGTTTTGCATCAGTAATCATCATGCTTAAGAGGTTCAAACTTATTAAAGGTGGTTTACAAAATATGGTTATTAGTGAGCAATGGGCAAGTTATAGAGAAGATGATGTGGGAAAGGCAAGGCATGTAAAGGAGTTGGTCCTTGATGACGTATGGTGGGACAAGATTGATTATATTATTTCCTTCACTTCACCTATATATGATATGATCAGAGCTTGTGATACTGACAAACCTTGTTTACATTTGGTTTATGATATGTGGGATACAATGATTGAAAAGGTGAAGACAACAATATATAGACATGAAGGATTGCAACCAAGTGAACACTCCTCTTTCTATGATGTGGTGCATAACATTCTCATCGATCGTTGGAACAAGAATAATACTCCACTTCACTGTTTAGCATATTCTTTAAATCCAAGGTACTTAATTACTCGTTTTATATCCTTATGGTGGTTATGACCTATTGGTCCTATTCTAATATTGAAGTTTATATTTAAAATTTAACCTTTTTTTTCAAAACGTATCTTTAGGTACTATAGTGAAGAATGGCTTAGACAAGACCCTAATCGAGTGGCTCCGCATCAAGATATGGAACTAACTCAAGAGAGAATGAAGTGCTTTAAGAGATATTTTACTAATTCTGAAGATCGTGCAAAAGTGAATGTAGAATTTGCTAGATTTTCTACCAAAACAGGAGATTTTGCTGAATATGATTCAATATCTCAAAGGTATAGTATAGATCCAGTAAGTTGGTGGGCAACTCATGGTGTATATGCACCAACACTTCAGGCAATTGCTTTTAAAGTCCTTGGACAACCTTCCTCCTCTTCATGTTGTGAGAGGAATTGGAGCACATATTCCTTTATTAACTCTATCAAGAGAAACAAAATGACGCCACAACGTGCAGAAGATTTGGTATTTATCCATAGTAATCTTCGTCTTTTGTCAAGAAGAACTCCAGAGTATTTAAAAGGAGATACTAAGTTATGGGATATTGCTGGAGATGCTTTTGATTCTTTTGAAGATGTCAGCATGCTTGAAATAGCTAATTTGTCTTTGGATGAACCAGATTTGGAGGCTGGGATTATTGATGATGAAGATGATGAGGGTGTTGTTGGTGATAAAGATGAAGATGTGGAGATTTGAGACTCTTCTTTTTATTGCTTGGGAACTTTGTTTTGTATGGAATGGAAAACTATTTTTCATTTGAATCATTTCCTTTCTTGTATTTTTTTACTCTAGATCAATAGTTTGTAACTTTTATTGGGTTTTGGTTGGTCATTGTCATTGGATATCTTGTTTTGCAACTTTGTTGGATATGGATAATCATATGAAAATCTCAATTGATTTAACATTTTATGTTTTGGAGTACATTTAGATTTAGTAGTTTATGTTATATATATTATATATATATATTATATATATATATTATATATAGAAGAAAATAATAATATAATAACGTATCCTCAACGTGTCCGTGTCCTAGTTTTCTAGAAATTGGCGTATCGCCGTATCGTGTCGTGTCCGTGTCTGTGCTTCCTAGATTAATACTAACTGAAGTGGGTTTTCTTGTAATTCTTCTGTGGGAGAGAAGGGGTCCTTTTCTTCTGGTTATTCCTTTTCAGTAAGTCAATGAAAGACGTCTCTTGTTTTGGAAAAAGTTTGAACATTATATATTTGGTAACCATGGAAAATCTGGCAGTCCTTAGATCTTTATGGTAACATGAAGACCAGGAATAACATCAAGCATCTTGAACATTATTTTGTTTCCAATTCCAATTGCCGAGCTATTCAGGCAGGTACTTTAATCAGCTTTATCTCTATATTCTAGGTGATTGAACGGTTCAACTGGCTCTGCTCTTTTGTAGGAAGAGGAAGTTAACAAATCACTGGCAAAACTTCATCCGCCCTCTAATGTGGCGTTGCAAGTGGACAGAATTGAGAATTAAGGAAATTGAGTCACAGACATTAAAATATTCCAGAGCACTTGCAGTGTATGAACAAGGAAAAAATTCAGGCCTTGATCCAACAATGGAAGATTTTTCTTCAAAAGCATTTCCATTTTCTAGCCCATATTATAGAAGAAAGGCAATGAAGAGAAGAAAACGAAAGAGAGCTGAAGATACAAATGATATATCATCCTATATGTCACAACATAACCTTTTCTCCTACTATGGTACTGGCTTTAACCTCGTACTTTTTACTTCTTGTTTTTTTTCTCTAAATGCAACTGTATAATTTGGTAACTTTTTTTTTTTAAATACAGAAAATAAGAGAGCTGAACTAGATGGTACTTCTGTAGCTGATGAATTTGCTAATCCAGGTAAAATCTTTCTGCTGTTACCTATATTGTCTGTATGGCATGCAGTATTTTCTTGGTGATGATAGCCAATGGGAAGGTCAAGCAAGACACTTGCACTGGATATTTGAGCCTGGCGAGTGATATATTGAACAATGGAACTCTTCAATTACTTTATACATTGGACAAAAGACTTTTATTTTCGTTTTCATGCATTTTTCTCTTAAAAACTAAAGGAGTCCGTTTGTCAAATGATAATATACTTTTGCCTACTCATGTAGTTTTAGCGCATTTAACTAAAGTATCCTCTTTTTTTTTTTTTTTGACAAGAACTAAAGTATCCTCTTTCGTCAACTCAAGAGATTTTTTTGCCCAATTGAATATTTATCTTATTTTAATATCTGATTAATACACAAACTGGGGGTTTTTCAGTTCATTCTGATGCCTCTTTTCTTAGTTTAGTTACCGTCCTAAAATTCTTCATACTTGAATATTTTTAAACTAAAAGTTGGCTTTCTGTTATTATTGGGTCAGTGAAAATGGAGAAAAATGCTGTTTCAGACGACAAATTTGGGATTAATAATGACTCTGTTCTCGAGTTCAGAGACACTAATAATTCTTTGGAGCAAGTACTCTGGAAAATTGAAGCGGTGCACTCTCGACTTCACAAGCTAAAGGGTCAAATGGATAAGGTGATGTCCAAAAATGCTGCAAAATTTTCTTCCTCAGAGAATCTGAGTCTTCTTGCACCTTGTGAGGCACAGACCAGCTCTGCCCCTAGTCCTACGTTCTCTGCTGGTAATGGAGAATTATCAGTTGGAGTTATGTGTGCCTCGGCCCAACATATATCAGAGTGTGATATTGGTGAGCTGATGAAGCCTGAAAGTGCTATTTCAAGCTATGGGGAGGCAATACTAGTTCCTGATATTATTGAAAGTACAGTAGGTCTTTTGACTGCTACTGCTGATGTTTCAATTCCTCAACCACAAATTGGGGACTCAACTGAGGATGTAAGAATTCTTGATGTCTAGCTAATTAATTTCTCTCTTACACACAAATCCTTCTCTGGTGATGATAATGCTAGAGACAATTTTTGGTGTGAACTCTGCAAAATGGGTTTGCTTTTCTTTCTTCAATTGGAAAGAAGTTGTCTGATTGTTTAAATTTTTTTTCTACTTTCTTTTACCTTTCTCCACATCTCTATTGTAAGGGGATGTAGCTCAAATGGCAGAGCCCTTGCTCTGCATGCAGAGGTACAGGGTTCAATCCCCTGCACCTTCATCTTCTTCTAAAAACCAATTTCTGGAAAATTTTGGTCTCCTCTTCCTTTGATCTCTTTGATATAATGCATGTCTACGGAATGAAAAATAGATATTTTAACCACTTGCTTTCATGAGAATCGATATGTTTATCTCTTAAGAAGCTATATTGCACATTTGGATTTCTGTCTCATGGTCTCATGGTCTCTGTTTTCATTTGATCTCTTGATATACTGCAATGTCTATGAAATGAAATTTACATCTTTTAACTACTTGCTTTCATGAGAGTAGATCGTTTTAGCTCATGGTCACTGCTCTTCTTTGAATTAGTAAACCAATAGATTCCTGCTTTCTAACATCAAACTAACCTTTTTGCAGATTGTCGATAATGTTCTGATACACAATGAGGTGGCTGAGGCAGAGAGAAACCCAGATAGCAGGATTGTTGTGCAGCCCGTGGAAAAACATGAAGAACAAGAAAAAGGCAAGCAAAGTGAAGGTACCTGTCTTGGCTCAGTTCCAACTACACAACCTGATCCTATGGGAAAAGCTTTGATCTCCGACGAACAATCAGCTCTTAAGAAATGTTTAGCTTCAGATATCAATTTCCCTAGGAACAAGAGAAAGCGAGGGGAAAGAAAAGCAGGCCCGGGTAGTTGGAACAAGAAACATTCGAGCGAACCCGATAGCCAGTAAGTTATTGTAGCTGTCCATTAGAGGTCTGGAGGTGGTCAGACGTAAATTTTCCACTTTGCCATTAGCCACTGGCATAGGTACGTAATGACCACTGCTCAGGTTTAGTCTTTATTAAGCAATTTCCAGAATGAAGCAGCTCTCAGTTGCTGCTGCCACAATGTGTTAGAGAGAGATCCTGAGTTTTTCTCGTGTAAGTTCTGTCTGTATCATGTTTATTGGGAACTCATCTCTCTCGTACATCATTTTAGATCTCAATAAGATAACTTTCGCCAA

mRNA sequence

TTCGCTTTAGGTTTATAACGTTAACTTCTCTGCTCTCTCTCTCTCTCTCTCTCTTCTTCTAGCCGCGCCGAAATTTCTCTCTCCGCACTCTCTCCCGCGCTCACGCCTCTCTCTCCTCTCTCTTTCTCTCTTCCGTTTCATTGCTCCTCCATAGCTTTTTTTCACTCCCTCGCCTTTCTACATTGCTCTTTCTTGCCTCCTCTGCTTCTTCCTTCTTCGGTCTGCTGCATTTTCTTTCTCTGTTGCTGTCCTCTGACTCTCGCTATAATATAATTGTGCTAAATCATTGATGGACCTCTATCTGCACAAGACAATGGATTAAAGAAAAATGCTGCACGTATTAAGTTGGAGATTTTCTTGGTAAATTCTAGTGGGAAAGGCAGGGGCAGGAGCTAGTCTGGGAAACCATTTCTAGGAATGGAAACTCATAGCAAAGGGAAGTGCCGGGAGGATTTCGAGGTTGATATAATTGAATGTTCAAATAAAACTGATCCCAAATTTTGTGGGAAAGAAGACCCTGATGCAACTGAATATTCAAGCTCATTTGCTGAGACATCTGATGCGGATAATTGTTCAGGATTCAACGAAGGAGAAGTAGAAACTCAATTTTTTGGAGACATTGGCTTGCCACCCACATTTGGTTCATTTAGTGGTGCACTTCCAATAAGGAAGAGGAAGTTAACAAATCACTGGCAAAACTTCATCCGCCCTCTAATGTGGCGTTGCAAGTGGACAGAATTGAGAATTAAGGAAATTGAGTCACAGACATTAAAATATTCCAGAGCACTTGCAGTGTATGAACAAGGAAAAAATTCAGGCCTTGATCCAACAATGGAAGATTTTTCTTCAAAAGCATTTCCATTTTCTAGCCCATATTATAGAAGAAAGGCAATGAAGAGAAGAAAACGAAAGAGAGCTGAAGATACAAATGATATATCATCCTATATGTCACAACATAACCTTTTCTCCTACTATGAAAATAAGAGAGCTGAACTAGATGGTACTTCTGTAGCTGATGAATTTGCTAATCCAGTGAAAATGGAGAAAAATGCTGTTTCAGACGACAAATTTGGGATTAATAATGACTCTGTTCTCGAGTTCAGAGACACTAATAATTCTTTGGAGCAAGTACTCTGGAAAATTGAAGCGGTGCACTCTCGACTTCACAAGCTAAAGGGTCAAATGGATAAGGTGATGTCCAAAAATGCTGCAAAATTTTCTTCCTCAGAGAATCTGAGTCTTCTTGCACCTTGTGAGGCACAGACCAGCTCTGCCCCTAGTCCTACGTTCTCTGCTGGTAATGGAGAATTATCAGTTGGAGTTATGTGTGCCTCGGCCCAACATATATCAGAGTGTGATATTGGTGAGCTGATGAAGCCTGAAAGTGCTATTTCAAGCTATGGGGAGGCAATACTAGTTCCTGATATTATTGAAAGTACAGTAGGTCTTTTGACTGCTACTGCTGATGTTTCAATTCCTCAACCACAAATTGGGGACTCAACTGAGGATATTGTCGATAATGTTCTGATACACAATGAGGTGGCTGAGGCAGAGAGAAACCCAGATAGCAGGATTGTTGTGCAGCCCGTGGAAAAACATGAAGAACAAGAAAAAGGCAAGCAAAGTGAAGGTACCTGTCTTGGCTCAGTTCCAACTACACAACCTGATCCTATGGGAAAAGCTTTGATCTCCGACGAACAATCAGCTCTTAAGAAATGTTTAGCTTCAGATATCAATTTCCCTAGGAACAAGAGAAAGCGAGGGGAAAGAAAAGCAGGCCCGGGTAGTTGGAACAAGAAACATTCGAGCGAACCCGATAGCCAGTAAGTTATTGTAGCTGTCCATTAGAGGTCTGGAGGTGGTCAGACGTAAATTTTCCACTTTGCCATTAGCCACTGGCATAGGTACGTAATGACCACTGCTCAGGTTTAGTCTTTATTAAGCAATTTCCAGAATGAAGCAGCTCTCAGTTGCTGCTGCCACAATGTGTTAGAGAGAGATCCTGAGTTTTTCTCGTGTAAGTTCTGTCTGTATCATGTTTATTGGGAACTCATCTCTCTCGTACATCATTTTAGATCTCAATAAGATAACTTTCGCCAA

Coding sequence (CDS)

ATGGAAACTCATAGCAAAGGGAAGTGCCGGGAGGATTTCGAGGTTGATATAATTGAATGTTCAAATAAAACTGATCCCAAATTTTGTGGGAAAGAAGACCCTGATGCAACTGAATATTCAAGCTCATTTGCTGAGACATCTGATGCGGATAATTGTTCAGGATTCAACGAAGGAGAAGTAGAAACTCAATTTTTTGGAGACATTGGCTTGCCACCCACATTTGGTTCATTTAGTGGTGCACTTCCAATAAGGAAGAGGAAGTTAACAAATCACTGGCAAAACTTCATCCGCCCTCTAATGTGGCGTTGCAAGTGGACAGAATTGAGAATTAAGGAAATTGAGTCACAGACATTAAAATATTCCAGAGCACTTGCAGTGTATGAACAAGGAAAAAATTCAGGCCTTGATCCAACAATGGAAGATTTTTCTTCAAAAGCATTTCCATTTTCTAGCCCATATTATAGAAGAAAGGCAATGAAGAGAAGAAAACGAAAGAGAGCTGAAGATACAAATGATATATCATCCTATATGTCACAACATAACCTTTTCTCCTACTATGAAAATAAGAGAGCTGAACTAGATGGTACTTCTGTAGCTGATGAATTTGCTAATCCAGTGAAAATGGAGAAAAATGCTGTTTCAGACGACAAATTTGGGATTAATAATGACTCTGTTCTCGAGTTCAGAGACACTAATAATTCTTTGGAGCAAGTACTCTGGAAAATTGAAGCGGTGCACTCTCGACTTCACAAGCTAAAGGGTCAAATGGATAAGGTGATGTCCAAAAATGCTGCAAAATTTTCTTCCTCAGAGAATCTGAGTCTTCTTGCACCTTGTGAGGCACAGACCAGCTCTGCCCCTAGTCCTACGTTCTCTGCTGGTAATGGAGAATTATCAGTTGGAGTTATGTGTGCCTCGGCCCAACATATATCAGAGTGTGATATTGGTGAGCTGATGAAGCCTGAAAGTGCTATTTCAAGCTATGGGGAGGCAATACTAGTTCCTGATATTATTGAAAGTACAGTAGGTCTTTTGACTGCTACTGCTGATGTTTCAATTCCTCAACCACAAATTGGGGACTCAACTGAGGATATTGTCGATAATGTTCTGATACACAATGAGGTGGCTGAGGCAGAGAGAAACCCAGATAGCAGGATTGTTGTGCAGCCCGTGGAAAAACATGAAGAACAAGAAAAAGGCAAGCAAAGTGAAGGTACCTGTCTTGGCTCAGTTCCAACTACACAACCTGATCCTATGGGAAAAGCTTTGATCTCCGACGAACAATCAGCTCTTAAGAAATGTTTAGCTTCAGATATCAATTTCCCTAGGAACAAGAGAAAGCGAGGGGAAAGAAAAGCAGGCCCGGGTAGTTGGAACAAGAAACATTCGAGCGAACCCGATAGCCAGTAA

Protein sequence

METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQHNLFSYYENKRAELDGTSVADEFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSVGVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGTCLGSVPTTQPDPMGKALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Homology
BLAST of Tan0002099 vs. NCBI nr
Match: XP_023540257.1 (uncharacterized protein LOC111800683 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 856.7 bits (2212), Expect = 9.5e-245
Identity = 437/469 (93.18%), Postives = 446/469 (95.10%), Query Frame = 0

Query: 1   METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEV 60
           METHSK KCRED EVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSD DNC+ FNEGEV
Sbjct: 1   METHSKAKCREDLEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDTDNCAEFNEGEV 60

Query: 61  ETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQTLKY 120
           ETQFFGDIGLPP FGSFS ALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQ LKY
Sbjct: 61  ETQFFGDIGLPPAFGSFSSALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQALKY 120

Query: 121 SRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180
           SRALAVYEQGKN GLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH
Sbjct: 121 SRALAVYEQGKNLGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180

Query: 181 NLFSYYENKRAELDGTSVADEFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLW 240
           NLFSYYENK+AELDGTSVADEFANPVKMEKNA SDDKFGINNDSVLEFRDTN+SLEQVLW
Sbjct: 181 NLFSYYENKKAELDGTSVADEFANPVKMEKNADSDDKFGINNDSVLEFRDTNSSLEQVLW 240

Query: 241 KIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV 300
           KIE VHSRLHKLKGQMDKVMSKNA+KFSSSENLSLLAPCEAQTSSAP+PTFSAGNGELSV
Sbjct: 241 KIEVVHSRLHKLKGQMDKVMSKNASKFSSSENLSLLAPCEAQTSSAPTPTFSAGNGELSV 300

Query: 301 GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGD 360
           GVM AS QHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT +VSIP PQIGD
Sbjct: 301 GVMGASTQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT-EVSIPLPQIGD 360

Query: 361 STEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGTCLGSVPTTQPDPMG 420
           STEDIV NVLIHNE+AEAERN    IV QPVEKHE+ EKGKQSEGT L SVPTTQPDPMG
Sbjct: 361 STEDIVTNVLIHNELAEAERNTHDTIVAQPVEKHEKPEKGKQSEGTSLSSVPTTQPDPMG 420

Query: 421 KALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 470
           KAL+SDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Sbjct: 421 KALVSDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 468

BLAST of Tan0002099 vs. NCBI nr
Match: XP_022972330.1 (uncharacterized protein LOC111470900 [Cucurbita maxima])

HSP 1 Score: 854.7 bits (2207), Expect = 3.6e-244
Identity = 436/469 (92.96%), Postives = 445/469 (94.88%), Query Frame = 0

Query: 1   METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEV 60
           METHSK KCRED EVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSD DNC+ FNEGEV
Sbjct: 1   METHSKAKCREDLEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDTDNCAEFNEGEV 60

Query: 61  ETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQTLKY 120
           ETQFFGDIGLPP FGSFS ALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQ LKY
Sbjct: 61  ETQFFGDIGLPPAFGSFSSALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQALKY 120

Query: 121 SRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180
           SRALAVYEQGKN GLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH
Sbjct: 121 SRALAVYEQGKNLGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180

Query: 181 NLFSYYENKRAELDGTSVADEFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLW 240
           NLFSYYENK+AELDGTSVADEFANPVK+EKNA  DDKFGINNDSVLEFRDTN+SLEQVLW
Sbjct: 181 NLFSYYENKKAELDGTSVADEFANPVKIEKNADCDDKFGINNDSVLEFRDTNSSLEQVLW 240

Query: 241 KIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV 300
           KIE VHSRLHKLKGQMDKVMSKNA+KFSSSENLSLLAPCEAQTSSAP+PTFSAGNGELSV
Sbjct: 241 KIEVVHSRLHKLKGQMDKVMSKNASKFSSSENLSLLAPCEAQTSSAPTPTFSAGNGELSV 300

Query: 301 GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGD 360
           GVM AS QHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT +VSIP PQIGD
Sbjct: 301 GVMGASTQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT-EVSIPLPQIGD 360

Query: 361 STEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGTCLGSVPTTQPDPMG 420
           STEDIV NVLIHNE+AEAERN    IV QPVEKHEE EKGKQSEGT L SVPTTQPDPMG
Sbjct: 361 STEDIVTNVLIHNELAEAERNTHDTIVAQPVEKHEEPEKGKQSEGTSLSSVPTTQPDPMG 420

Query: 421 KALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 470
           KAL+SDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Sbjct: 421 KALVSDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 468

BLAST of Tan0002099 vs. NCBI nr
Match: KAG7029121.1 (hypothetical protein SDJN02_10306, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 851.3 bits (2198), Expect = 4.0e-243
Identity = 435/469 (92.75%), Postives = 444/469 (94.67%), Query Frame = 0

Query: 1   METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEV 60
           METHSK KCRED EVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSD DNC+ FNEGEV
Sbjct: 1   METHSKAKCREDLEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDTDNCAEFNEGEV 60

Query: 61  ETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQTLKY 120
           ETQFFGDIGLPP FGSFS ALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQ LKY
Sbjct: 61  ETQFFGDIGLPPAFGSFSSALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQALKY 120

Query: 121 SRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180
           SRALAVYEQGKN GLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH
Sbjct: 121 SRALAVYEQGKNLGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180

Query: 181 NLFSYYENKRAELDGTSVADEFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLW 240
           NLFSYYENK+AELDGTSVADEFANPVKMEKNA SDDKFGINNDSVLEFRDTN+SLEQVLW
Sbjct: 181 NLFSYYENKKAELDGTSVADEFANPVKMEKNADSDDKFGINNDSVLEFRDTNSSLEQVLW 240

Query: 241 KIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV 300
           KIE VHSRLHKLKGQMDKVMSKNA+KFSSSENLSLLAPCEAQTSSAP+PTFSAGNGELSV
Sbjct: 241 KIEVVHSRLHKLKGQMDKVMSKNASKFSSSENLSLLAPCEAQTSSAPTPTFSAGNGELSV 300

Query: 301 GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGD 360
           GVM AS QHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT +VSIP PQIGD
Sbjct: 301 GVMGASTQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT-EVSIPLPQIGD 360

Query: 361 STEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGTCLGSVPTTQPDPMG 420
           STEDIV NVLI NE+AEAERN    I  QPVEKHE+ EKGKQSEGT L SVPTTQPDPMG
Sbjct: 361 STEDIVTNVLIPNELAEAERNTHDTIAAQPVEKHEKPEKGKQSEGTSLSSVPTTQPDPMG 420

Query: 421 KALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 470
           KAL+SDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Sbjct: 421 KALVSDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 468

BLAST of Tan0002099 vs. NCBI nr
Match: KAG6597677.1 (Protein IQ-DOMAIN 14, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 851.3 bits (2198), Expect = 4.0e-243
Identity = 435/469 (92.75%), Postives = 444/469 (94.67%), Query Frame = 0

Query: 1   METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEV 60
           METHSK KCRED EVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSD DNC+ FNEGEV
Sbjct: 1   METHSKAKCREDLEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDTDNCAEFNEGEV 60

Query: 61  ETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQTLKY 120
           ETQFFGDIGLPP FGSFS ALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQ LKY
Sbjct: 61  ETQFFGDIGLPPAFGSFSSALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQALKY 120

Query: 121 SRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180
           SRALAVYEQGKN GLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH
Sbjct: 121 SRALAVYEQGKNLGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180

Query: 181 NLFSYYENKRAELDGTSVADEFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLW 240
           NLFSYYENK+AELDGTSVADEFANPVKMEKNA SDDKFGINNDSVLEFRDTN+SLEQVLW
Sbjct: 181 NLFSYYENKKAELDGTSVADEFANPVKMEKNADSDDKFGINNDSVLEFRDTNSSLEQVLW 240

Query: 241 KIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV 300
           KIE VHSRLHKLKGQMDKVMSKNA+KFSSSENLSLLAPCEAQTSSAP+PTFSAGNGELSV
Sbjct: 241 KIEVVHSRLHKLKGQMDKVMSKNASKFSSSENLSLLAPCEAQTSSAPTPTFSAGNGELSV 300

Query: 301 GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGD 360
           GVM AS QHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT +VSIP PQIGD
Sbjct: 301 GVMGASTQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT-EVSIPLPQIGD 360

Query: 361 STEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGTCLGSVPTTQPDPMG 420
           STEDIV NVLI NE+AEAERN    I  QPVEKHE+ EKGKQSEGT L SVPTTQPDPMG
Sbjct: 361 STEDIVTNVLIPNELAEAERNTHDTIAAQPVEKHEKPEKGKQSEGTSLSSVPTTQPDPMG 420

Query: 421 KALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 470
           KAL+SDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Sbjct: 421 KALVSDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 468

BLAST of Tan0002099 vs. NCBI nr
Match: XP_022932640.1 (uncharacterized protein LOC111439133 isoform X1 [Cucurbita moschata])

HSP 1 Score: 850.1 bits (2195), Expect = 8.9e-243
Identity = 434/469 (92.54%), Postives = 444/469 (94.67%), Query Frame = 0

Query: 1   METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEV 60
           METHSK KCRED EVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSD DNC+ FNEGEV
Sbjct: 1   METHSKAKCREDLEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDTDNCAEFNEGEV 60

Query: 61  ETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQTLKY 120
           ETQFFGDIGLPP FGSFS ALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQ LKY
Sbjct: 61  ETQFFGDIGLPPAFGSFSSALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQALKY 120

Query: 121 SRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180
           SRALAVYEQGKN GLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH
Sbjct: 121 SRALAVYEQGKNLGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180

Query: 181 NLFSYYENKRAELDGTSVADEFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLW 240
           NLFSYYENK+AELDGTSVADEFANPVKMEKNA SDDKFGINNDSVLEFRDTN+SLEQVLW
Sbjct: 181 NLFSYYENKKAELDGTSVADEFANPVKMEKNADSDDKFGINNDSVLEFRDTNSSLEQVLW 240

Query: 241 KIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV 300
           KIE VHSRLHKLKGQMDKVMSKNA+KFSSSENLSLLAPCEAQTSSAP+PTFSAGNGELSV
Sbjct: 241 KIEVVHSRLHKLKGQMDKVMSKNASKFSSSENLSLLAPCEAQTSSAPTPTFSAGNGELSV 300

Query: 301 GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGD 360
           GVM AS QHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT ++SIP PQIGD
Sbjct: 301 GVMGASTQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT-ELSIPLPQIGD 360

Query: 361 STEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGTCLGSVPTTQPDPMG 420
           STEDIV NVLI NE+AEAERN    I  QPVEKHE+ EKGKQSEGT L SVPTTQPDPMG
Sbjct: 361 STEDIVTNVLIPNELAEAERNTHDTIAAQPVEKHEKPEKGKQSEGTSLSSVPTTQPDPMG 420

Query: 421 KALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 470
           KAL+SDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Sbjct: 421 KALVSDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 468

BLAST of Tan0002099 vs. ExPASy TrEMBL
Match: A0A6J1I8A3 (uncharacterized protein LOC111470900 OS=Cucurbita maxima OX=3661 GN=LOC111470900 PE=4 SV=1)

HSP 1 Score: 854.7 bits (2207), Expect = 1.8e-244
Identity = 436/469 (92.96%), Postives = 445/469 (94.88%), Query Frame = 0

Query: 1   METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEV 60
           METHSK KCRED EVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSD DNC+ FNEGEV
Sbjct: 1   METHSKAKCREDLEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDTDNCAEFNEGEV 60

Query: 61  ETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQTLKY 120
           ETQFFGDIGLPP FGSFS ALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQ LKY
Sbjct: 61  ETQFFGDIGLPPAFGSFSSALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQALKY 120

Query: 121 SRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180
           SRALAVYEQGKN GLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH
Sbjct: 121 SRALAVYEQGKNLGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180

Query: 181 NLFSYYENKRAELDGTSVADEFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLW 240
           NLFSYYENK+AELDGTSVADEFANPVK+EKNA  DDKFGINNDSVLEFRDTN+SLEQVLW
Sbjct: 181 NLFSYYENKKAELDGTSVADEFANPVKIEKNADCDDKFGINNDSVLEFRDTNSSLEQVLW 240

Query: 241 KIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV 300
           KIE VHSRLHKLKGQMDKVMSKNA+KFSSSENLSLLAPCEAQTSSAP+PTFSAGNGELSV
Sbjct: 241 KIEVVHSRLHKLKGQMDKVMSKNASKFSSSENLSLLAPCEAQTSSAPTPTFSAGNGELSV 300

Query: 301 GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGD 360
           GVM AS QHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT +VSIP PQIGD
Sbjct: 301 GVMGASTQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT-EVSIPLPQIGD 360

Query: 361 STEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGTCLGSVPTTQPDPMG 420
           STEDIV NVLIHNE+AEAERN    IV QPVEKHEE EKGKQSEGT L SVPTTQPDPMG
Sbjct: 361 STEDIVTNVLIHNELAEAERNTHDTIVAQPVEKHEEPEKGKQSEGTSLSSVPTTQPDPMG 420

Query: 421 KALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 470
           KAL+SDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Sbjct: 421 KALVSDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 468

BLAST of Tan0002099 vs. ExPASy TrEMBL
Match: A0A6J1F2Q4 (uncharacterized protein LOC111439133 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439133 PE=4 SV=1)

HSP 1 Score: 850.1 bits (2195), Expect = 4.3e-243
Identity = 434/469 (92.54%), Postives = 444/469 (94.67%), Query Frame = 0

Query: 1   METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEV 60
           METHSK KCRED EVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSD DNC+ FNEGEV
Sbjct: 1   METHSKAKCREDLEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDTDNCAEFNEGEV 60

Query: 61  ETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQTLKY 120
           ETQFFGDIGLPP FGSFS ALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQ LKY
Sbjct: 61  ETQFFGDIGLPPAFGSFSSALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQALKY 120

Query: 121 SRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180
           SRALAVYEQGKN GLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH
Sbjct: 121 SRALAVYEQGKNLGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180

Query: 181 NLFSYYENKRAELDGTSVADEFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLW 240
           NLFSYYENK+AELDGTSVADEFANPVKMEKNA SDDKFGINNDSVLEFRDTN+SLEQVLW
Sbjct: 181 NLFSYYENKKAELDGTSVADEFANPVKMEKNADSDDKFGINNDSVLEFRDTNSSLEQVLW 240

Query: 241 KIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV 300
           KIE VHSRLHKLKGQMDKVMSKNA+KFSSSENLSLLAPCEAQTSSAP+PTFSAGNGELSV
Sbjct: 241 KIEVVHSRLHKLKGQMDKVMSKNASKFSSSENLSLLAPCEAQTSSAPTPTFSAGNGELSV 300

Query: 301 GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGD 360
           GVM AS QHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT ++SIP PQIGD
Sbjct: 301 GVMGASTQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTAT-ELSIPLPQIGD 360

Query: 361 STEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGTCLGSVPTTQPDPMG 420
           STEDIV NVLI NE+AEAERN    I  QPVEKHE+ EKGKQSEGT L SVPTTQPDPMG
Sbjct: 361 STEDIVTNVLIPNELAEAERNTHDTIAAQPVEKHEKPEKGKQSEGTSLSSVPTTQPDPMG 420

Query: 421 KALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 470
           KAL+SDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Sbjct: 421 KALVSDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 468

BLAST of Tan0002099 vs. ExPASy TrEMBL
Match: A0A6J1E2X4 (uncharacterized protein LOC111025545 OS=Momordica charantia OX=3673 GN=LOC111025545 PE=4 SV=1)

HSP 1 Score: 800.8 bits (2067), Expect = 3.0e-228
Identity = 407/469 (86.78%), Postives = 434/469 (92.54%), Query Frame = 0

Query: 1   METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEV 60
           MET SK KCRED EV+IIECSN+TDPK  GKEDPDATEYSSSFAETSDAD+CSGF+EG+V
Sbjct: 1   METLSKNKCREDLEVEIIECSNETDPKLYGKEDPDATEYSSSFAETSDADDCSGFSEGDV 60

Query: 61  ETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQTLKY 120
           ETQFFGDIGLPP FGSFS  LPIRKRKLT+HWQNFIRPLMWRCKWTELRIKE+ESQ L+Y
Sbjct: 61  ETQFFGDIGLPPAFGSFSSTLPIRKRKLTSHWQNFIRPLMWRCKWTELRIKELESQALRY 120

Query: 121 SRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180
           SRALAVYEQGK+S LDPT+E+FSSK  P SS YYRRKAMKRRKRKR EDTNDI SYM+QH
Sbjct: 121 SRALAVYEQGKSSWLDPTVEEFSSKELPLSSQYYRRKAMKRRKRKRVEDTNDILSYMTQH 180

Query: 181 NLFSYYENKRAELDGTSVADEFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLW 240
           NLFSY+ENKR+ELDG  VADEFANPVK+EKNA SDDKFGINNDSVLEFR++N+SLEQVLW
Sbjct: 181 NLFSYHENKRSELDGIPVADEFANPVKVEKNADSDDKFGINNDSVLEFRESNSSLEQVLW 240

Query: 241 KIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV 300
           KIEAVHSRLHKLKGQMD VMSKNAAKFSSSENLSLLAPCEAQTSSAP+PTFSAGNGE+SV
Sbjct: 241 KIEAVHSRLHKLKGQMDMVMSKNAAKFSSSENLSLLAPCEAQTSSAPTPTFSAGNGEISV 300

Query: 301 GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGD 360
           GVM AS QHISECDIG+LMKPESAISSYGEAILVPDIIESTVGLLTAT DVS+PQPQIGD
Sbjct: 301 GVMYASTQHISECDIGDLMKPESAISSYGEAILVPDIIESTVGLLTAT-DVSVPQPQIGD 360

Query: 361 STEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGTCLGSVPTTQPDPMG 420
           STEDIVDNVLIHNEVAEAERN  SRIVVQPVEKH E EK K  EGT L S+PTTQ DPMG
Sbjct: 361 STEDIVDNVLIHNEVAEAERNTRSRIVVQPVEKHREPEKSKLGEGTSLNSIPTTQSDPMG 420

Query: 421 KALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 470
           KA++ +EQS LKKCLASDINFPRNKRKRGERKAGPGSWNK+HSSEPDSQ
Sbjct: 421 KAVVCEEQSTLKKCLASDINFPRNKRKRGERKAGPGSWNKRHSSEPDSQ 468

BLAST of Tan0002099 vs. ExPASy TrEMBL
Match: A0A5A7TML9 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold46G002900 PE=4 SV=1)

HSP 1 Score: 792.3 bits (2045), Expect = 1.1e-225
Identity = 408/469 (86.99%), Postives = 424/469 (90.41%), Query Frame = 0

Query: 1   METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEV 60
           ME+HSK   RED EVDIIE SNKTDPKFCGKEDPDATEYSSSF ETSDADNCSGF+EGEV
Sbjct: 1   MESHSKANSREDLEVDIIEGSNKTDPKFCGKEDPDATEYSSSFGETSDADNCSGFSEGEV 60

Query: 61  ETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQTLKY 120
           ETQFFGDIGLPPTFGSFS  L IRKRKLT HWQNFIRPLMWRCKWTELRIKEIESQ LKY
Sbjct: 61  ETQFFGDIGLPPTFGSFSSTLQIRKRKLTAHWQNFIRPLMWRCKWTELRIKEIESQALKY 120

Query: 121 SRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180
           SRALAVYEQ K    DPTMEDF SK FPFSS YYRRKAMKRRKRK+ ED  DISSYMS H
Sbjct: 121 SRALAVYEQEKVPAHDPTMEDFFSKTFPFSSQYYRRKAMKRRKRKKIEDAIDISSYMSHH 180

Query: 181 NLFSYYENKRAELDGTSVADEFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLW 240
           NLFSY+ENKR+ELDGTSVADEFANPVK+EKNA SDDKFGINNDS+LE RDT+NSLEQVLW
Sbjct: 181 NLFSYFENKRSELDGTSVADEFANPVKVEKNADSDDKFGINNDSILESRDTDNSLEQVLW 240

Query: 241 KIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV 300
           KIE VHSRLHKLKGQMDKVMSKNAA FSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV
Sbjct: 241 KIEVVHSRLHKLKGQMDKVMSKNAAIFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV 300

Query: 301 GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGD 360
            VMCAS Q ISECDIG+LMKPESAISS+G+AILVPDIIESTVG LTAT DVS+PQPQIGD
Sbjct: 301 SVMCASTQRISECDIGDLMKPESAISSFGDAILVPDIIESTVGNLTAT-DVSLPQPQIGD 360

Query: 361 STEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGTCLGSVPTTQPDPMG 420
           STE IVDNVL HNEV EAERN DS++V QPVEKH E EK  Q EGT L S PTTQPDPMG
Sbjct: 361 STEAIVDNVLTHNEVVEAERNTDSKVVAQPVEKHREPEKVSQGEGTSLSSNPTTQPDPMG 420

Query: 421 KALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 470
           KAL+S+EQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Sbjct: 421 KALVSEEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 468

BLAST of Tan0002099 vs. ExPASy TrEMBL
Match: A0A1S3BXR7 (uncharacterized protein LOC103494594 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494594 PE=4 SV=1)

HSP 1 Score: 790.8 bits (2041), Expect = 3.1e-225
Identity = 407/469 (86.78%), Postives = 424/469 (90.41%), Query Frame = 0

Query: 1   METHSKGKCREDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEV 60
           ME+HSK   RED EVDIIE SNKTDPKFCGKEDPDATEYSSSF ETSDADNCSGF+EGEV
Sbjct: 1   MESHSKANSREDLEVDIIEGSNKTDPKFCGKEDPDATEYSSSFGETSDADNCSGFSEGEV 60

Query: 61  ETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQTLKY 120
           ETQFFGDIGLPPTFGSFS  L IRKRKLT HWQNFIRPLMWRCKWTELRIKEIESQ LKY
Sbjct: 61  ETQFFGDIGLPPTFGSFSSTLQIRKRKLTAHWQNFIRPLMWRCKWTELRIKEIESQALKY 120

Query: 121 SRALAVYEQGKNSGLDPTMEDFSSKAFPFSSPYYRRKAMKRRKRKRAEDTNDISSYMSQH 180
           SRALAVYEQ K    DPTME+F SK FPFSS YYRRKAMKRRKRK+ ED  DISSYMS H
Sbjct: 121 SRALAVYEQEKVPAHDPTMEEFFSKTFPFSSQYYRRKAMKRRKRKKIEDAIDISSYMSHH 180

Query: 181 NLFSYYENKRAELDGTSVADEFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLW 240
           NLFSY+ENKR+ELDGTSVADEFANPVK+EKNA SDDKFGINNDS+LE RDT+NSLEQVLW
Sbjct: 181 NLFSYFENKRSELDGTSVADEFANPVKVEKNADSDDKFGINNDSILESRDTDNSLEQVLW 240

Query: 241 KIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV 300
           KIE VHSRLHKLKGQMDKVMSKNAA FSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV
Sbjct: 241 KIEVVHSRLHKLKGQMDKVMSKNAAIFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSV 300

Query: 301 GVMCASAQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGD 360
            VMCAS Q ISECDIG+LMKPESAISS+G+AILVPDIIESTVG LTAT DVS+PQPQIGD
Sbjct: 301 SVMCASTQRISECDIGDLMKPESAISSFGDAILVPDIIESTVGNLTAT-DVSLPQPQIGD 360

Query: 361 STEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGTCLGSVPTTQPDPMG 420
           STE IVDNVL HNEV EAERN DS++V QPVEKH E EK  Q EGT L S PTTQPDPMG
Sbjct: 361 STEAIVDNVLTHNEVVEAERNTDSKVVAQPVEKHHEPEKVSQGEGTSLSSNPTTQPDPMG 420

Query: 421 KALISDEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 470
           KAL+S+EQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ
Sbjct: 421 KALVSEEQSALKKCLASDINFPRNKRKRGERKAGPGSWNKKHSSEPDSQ 468

BLAST of Tan0002099 vs. TAIR 10
Match: AT3G59670.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G37440.2); Has 77 Blast hits to 77 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 73; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 353.2 bits (905), Expect = 3.2e-97
Identity = 228/482 (47.30%), Postives = 314/482 (65.15%), Query Frame = 0

Query: 11  EDFEVDIIEC-SNKTDPKFCGKEDPDATEYSSSFAETSD------ADNCSGFNEGEVETQ 70
           E+ +VDI+E   NKT       EDP+ATEYSSSF++T+        D  +G  E EVE+ 
Sbjct: 56  EELDVDIVESDENKTSTT---DEDPNATEYSSSFSDTASENAEMLLDGLTG--EAEVESH 115

Query: 71  FFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRPLMWRCKWTELRIKEIESQTLKYSRA 130
           ++ +  L P + SFS     RK++LTNHW+ FIRPLMWR KW ELRI+E+ES+ L+Y + 
Sbjct: 116 YWDETDLGPAYDSFSSIFHFRKKRLTNHWRRFIRPLMWRSKWVELRIRELESRALEYPKE 175

Query: 131 LAVYEQGK-NSGLDPTMEDFSS---KAFPFSSP-YYRRKAMKRRKRKRAEDTNDISSYMS 190
           L +Y+Q K  + +DP++ +      K+ PFS+P Y +R A KRRKRK+ E T+DI+SYM+
Sbjct: 176 LELYDQEKLEANIDPSVLESCGEGIKSLPFSNPCYKKRAAKKRRKRKKVESTDDIASYMA 235

Query: 191 QHNLFSYYENKRAELDGTSVADEFANPVKMEKNAVSDDKFGINN-DSVLEFRDTNNSLEQ 250
            HNLFSY E KR   DG  +AD+F +    +  + S++   +++ DS+   RD ++ LE+
Sbjct: 236 CHNLFSYIETKRLSSDGMGLADDFGD--AKDPRSDSNEPVDLDDADSLFHHRDGDSVLEE 295

Query: 251 VLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSA-GNG 310
           VLWKIE VHS++H+LK Q+D V+SKN A+FSSSENLSLLA      SSAPSPT SA GNG
Sbjct: 296 VLWKIELVHSQVHRLKTQVDVVLSKNTARFSSSENLSLLA-----ASSAPSPTVSAGGNG 355

Query: 311 E-LSVGVMCASAQHISECDIGELM-KPESAISSYGEAILVPDIIESTVGLLTATADVSIP 370
           + +S G +  ++QH+++  +G+++   E  ISSYG+A  +PDIIESTVGL  A ADV++ 
Sbjct: 356 DVISFGAIYNASQHMADYGLGDIVFSSEGVISSYGDAFHIPDIIESTVGLF-ADADVTLH 415

Query: 371 QPQIGDSTEDIVDNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGTCLGSVPTT 430
             QIGDS EDI+DN+LI N VAE E N D    +     H+E EK ++ EGT +  +  T
Sbjct: 416 HHQIGDSCEDILDNILIRNGVAE-EMNGD----LMETSCHDEAEKAEEGEGTSVPPLQQT 475

Query: 431 Q------PDPMGKALISDEQSALKKCLASDINFPRNKRKR-GERKAGPGSWNKKHSSEPD 470
           +       +     L   E S L+ CLAS++  PRNKR R GERKA   SW KKH S+P+
Sbjct: 476 EETEEYNQEEKSLVLQGREDSVLRSCLASEMLVPRNKRTRGGERKA--SSWCKKHLSDPE 517

BLAST of Tan0002099 vs. TAIR 10
Match: AT4G37440.2 (unknown protein; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G50040.1); Has 121 Blast hits to 117 proteins in 32 species: Archae - 0; Bacteria - 6; Metazoa - 13; Fungi - 5; Plants - 66; Viruses - 0; Other Eukaryotes - 31 (source: NCBI BLink). )

HSP 1 Score: 156.0 bits (393), Expect = 7.5e-38
Identity = 106/279 (37.99%), Postives = 156/279 (55.91%), Query Frame = 0

Query: 11  EDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGL 70
           ++ EVDI+EC++  + +  G +D     YSSSF  T         N+ EV++    +  L
Sbjct: 69  DEDEVDILECNDNIEIQVSGCDD-GTDGYSSSFGGTDSEHE----NDQEVDSMICNETSL 128

Query: 71  PPTFGSFSGALPIRKRKLTNHWQNFIRP-LMWRCKWTELRIKEIESQTLKYSRALAVYEQ 130
           P         L +RKRKLT+HW+ F++P LMWRCKW EL+ KE+++Q  KY + +  Y Q
Sbjct: 129 P---------LWVRKRKLTDHWRRFVQPTLMWRCKWIELKYKELQNQAQKYDKEVEEYYQ 188

Query: 131 GKNSGLDPT-MEDFSSKAFPFSSPYYRRKA--MKRRKRKRAEDTNDISSYMSQHNLFSYY 190
            K   L+    E+   KA P   P Y +K   MKR+ RKR E+T D++SY S HNLFSYY
Sbjct: 189 AKKLELENVKSEELGVKALP-PLPCYTQKTRLMKRKTRKRVEETADVTSYASNHNLFSYY 248

Query: 191 ENKRAELDGTSVADEFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVH 250
           + +++  D  ++ D   N  K  K+A  +  F       LEFR+ +  LEQ+L KIEA  
Sbjct: 249 DCRKSLAD-IALNDNSRNLDKKNKSAKDETAFS-EETPPLEFREGDAYLEQILLKIEAAK 308

Query: 251 SRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSS 286
           S    LK ++DKV+S+N + F  +  ++ L   +  TSS
Sbjct: 309 SEARNLKIRVDKVLSENPSIFPLANTVNPLGAADVYTSS 330

BLAST of Tan0002099 vs. TAIR 10
Match: AT4G37440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G50040.1); Has 220 Blast hits to 205 proteins in 55 species: Archae - 0; Bacteria - 15; Metazoa - 50; Fungi - 11; Plants - 76; Viruses - 3; Other Eukaryotes - 65 (source: NCBI BLink). )

HSP 1 Score: 154.8 bits (390), Expect = 1.7e-37
Identity = 144/455 (31.65%), Postives = 222/455 (48.79%), Query Frame = 0

Query: 11  EDFEVDIIECSNKTDPKFCGKEDPDATEYSSSFAETSDADNCSGFNEGEVETQFFGDIGL 70
           ++ EVDI+EC++  + +  G +D     YSSSF  T         N+ EV++    +  L
Sbjct: 69  DEDEVDILECNDNIEIQVSGCDD-GTDGYSSSFGGTDSEHE----NDQEVDSMICNETSL 128

Query: 71  PPTFGSFSGALPIRKRKLTNHWQNFIRP-LMWRCKWTELRIKEIESQTLKYSRALAVYEQ 130
           P         L +RKRKLT+HW+ F++P LMWRCKW EL+ KE+++Q  KY + +  Y Q
Sbjct: 129 P---------LWVRKRKLTDHWRRFVQPTLMWRCKWIELKYKELQNQAQKYDKEVEEYYQ 188

Query: 131 GKNSGLDPT-MEDFSSKAFPFSSPYYRRKA--MKRRKRKRAEDTNDISSYMSQHNLFSYY 190
            K   L+    E+   KA P   P Y +K   MKR+ RKR E+T D++SY S HNLFSYY
Sbjct: 189 AKKLELENVKSEELGVKALP-PLPCYTQKTRLMKRKTRKRVEETADVTSYASNHNLFSYY 248

Query: 191 ENKRAELDGTSVADEFANPVKMEKNAVSDDKFGINNDSVLEFRDTNNSLEQVLWKIEAVH 250
           + +++  D  ++ D   N  K  K+A  +  F       LEFR+ +  LEQ+L KIEA  
Sbjct: 249 DCRKSLAD-IALNDNSRNLDKKNKSAKDETAFS-EETPPLEFREGDAYLEQILLKIEAAK 308

Query: 251 SRLHKLKGQMDKVMSKNAAKFSSSENLSLLAPCEAQTSSAPSPTFSAGNGELSVGVMCAS 310
           S    LK ++DKV+S+N + F  +  ++ L   +  TSS       A   E    +    
Sbjct: 309 SEARNLKIRVDKVLSENPSIFPLANTVNPLGAADVYTSSEQQKPLLAIKNEDEKSI---- 368

Query: 311 AQHISECDIGELMKPESAISSYGEAILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIV 370
              ISE    E     +++SS+    + P+  E+T  LL   +++   + + G S   I 
Sbjct: 369 ---ISE----EKPVKSASVSSHH---VSPEDDETTDILL---SEILASKRREGKSI--IP 428

Query: 371 DNVLIHNEVAEAERNPDSRIVVQPVEKHEEQEKGKQSEGTCLGSVPTTQPDPMGKALISD 430
           D  L+  E A  E  P      +PV K   + +           + T +     +  +S 
Sbjct: 429 DKNLVKTEQASIEEGPS-----RPVRKRTPRNR----------EIITKEESNPKRRRVSR 470

Query: 431 EQSALKKCLASDINFPRNKRKRGERKAGPGSWNKK 462
           E+      +AS   F   KRKRG+R++G     ++
Sbjct: 489 EKPKSNAVMAS--RFSNRKRKRGKRRSGSAGLRRR 470

BLAST of Tan0002099 vs. TAIR 10
Match: AT3G50040.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G37440.2); Has 70 Blast hits to 70 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 70; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 121.7 bits (304), Expect = 1.6e-27
Identity = 100/363 (27.55%), Postives = 170/363 (46.83%), Query Frame = 0

Query: 40  SSSFAETSDADNCSGFNEG-EVETQFFGDIGLPPTFGSFSGALPIRKRKLTNHWQNFIRP 99
           SSSF ++  A +   F  G E ++    D  LP T    +  L + K+K  + W+   +P
Sbjct: 62  SSSFGDSMCARDGDDFGFGDEAQSMLSNDYPLPGTCDDGTEFLGLPKKKTNDRWRRLTKP 121

Query: 100 LMWRCKWTELRIKEIESQTLKYSRALAVYEQGKNSGLDPT-MEDFSSKAFPFSSPYYRRK 159
           +MWRCKW EL++KEI+SQ   Y + +  Y   K   L+ + +E F  K+ PF     RR 
Sbjct: 122 IMWRCKWIELKVKEIQSQARGYEKEVKDYYLTKQFDLEKSKLEGFDGKSIPFRENNQRRN 181

Query: 160 AMKRRKRKRAEDTNDISSYMSQHNLFSYYENK-RAELDGTSVADEFANPVKM--EKNAVS 219
             KR +RKR E+T D+++YMS HNLFSY + +    + G  +  +F    K   +++A+ 
Sbjct: 182 VFKRGRRKRVEETTDVAAYMSNHNLFSYADKRVPVNVKGQYLDSDFGTGRKATGKQDAIE 241

Query: 220 DDKFGINNDSVLEFRDTNNSLEQVLWKIEAVHSRLHKLKGQMDKVMSKNAAKFSSSENLS 279
           DD        + E   +++ L + L KI+    +  +L+ ++D++M  +    +SS    
Sbjct: 242 DDSL------ISELDCSDDVLAKFLCKIDEAQGKARRLRKRVDQLMWDSQPAHTSSMP-Q 301

Query: 280 LLAPCEAQT---SSAPSPTFSAGNGELSVGVMCASAQHISECDIGELMKPESAISSYGEA 339
           ++APC   +   +        A    +  G  C  A HI       LM P++ I   G+ 
Sbjct: 302 MVAPCHRDSMIQTGKKCALVEAPLTHVQNGQQCIPADHIE-----HLMVPQTHIG--GQC 361

Query: 340 ILVPDIIESTVGLLTATADVSIPQPQIGDSTEDIVDNVL------------IHNEVAEAE 383
           +     I S++       D+ + +P++ D   +  D  L            +  E A+AE
Sbjct: 362 LTNNSPISSSLRFHPILEDLLMDEPEMNDHEMEGDDKKLDYFMKIINQITGVPREEADAE 410

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023540257.19.5e-24593.18uncharacterized protein LOC111800683 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022972330.13.6e-24492.96uncharacterized protein LOC111470900 [Cucurbita maxima][more]
KAG7029121.14.0e-24392.75hypothetical protein SDJN02_10306, partial [Cucurbita argyrosperma subsp. argyro... [more]
KAG6597677.14.0e-24392.75Protein IQ-DOMAIN 14, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022932640.18.9e-24392.54uncharacterized protein LOC111439133 isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1I8A31.8e-24492.96uncharacterized protein LOC111470900 OS=Cucurbita maxima OX=3661 GN=LOC111470900... [more]
A0A6J1F2Q44.3e-24392.54uncharacterized protein LOC111439133 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1E2X43.0e-22886.78uncharacterized protein LOC111025545 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A5A7TML91.1e-22586.99Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A1S3BXR73.1e-22586.78uncharacterized protein LOC103494594 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT3G59670.13.2e-9747.30unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G37440.27.5e-3837.99unknown protein; LOCATED IN: cellular_component unknown; BEST Arabidopsis thalia... [more]
AT4G37440.11.7e-3731.65unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G50040.11.6e-2727.55unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 381..402
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 381..469
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 439..469
NoneNo IPR availablePANTHERPTHR34057:SF1ELONGATION FACTORcoord: 5..468
NoneNo IPR availablePANTHERPTHR34057ELONGATION FACTORcoord: 5..468
IPR038745AT4G37440-likeCDDcd11650AT4G37440_likecoord: 11..262
e-value: 4.75489E-80
score: 247.336

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0002099.1Tan0002099.1mRNA
Tan0002099.2Tan0002099.2mRNA