Tan0011674 (gene) Snake gourd v1

Overview
NameTan0011674
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionExosome complex exonuclease
LocationLG01: 101534152 .. 101554777 (-)
RNA-Seq ExpressionTan0011674
SyntenyTan0011674
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGACCTAAAAATATTGTTTTCTTCGGTTTAAGAAGCGGTCAGTGAAATTCATTGATTTGCTTCATCATTCATTCAAAGCAGTTTGCTGCTCCCATCGACGTTACGAGTATTTAACTTCATCGGATTCTTTTCGCTTGATTTCGGTATTGTTCTTCTCAATCACCTTGTTCCGGGAAGCCAATCTTCGATTGCGTTCTGGTTAGACAAGGATTTCAACAACCTTGATTCTCAGCTTTCATTTCAGTAATGGTATGCCCTCCACCTGTTCGACAAAATGCCACAATGAAGTTCCAGCCTATGCCAGAGTCACTTCCGCTGTTCGATGTTGAACCAATGAGAATATACAAAGCTACTGCTCGGAATTTTTCAAGGGAGTAACTTCATGAAGTGATTTTGGAGGCATGATTATGGCATATAAGCATTCTCAGAGGTTGCAGTTCATCTGTCGCGTAATTCATCTTAACAACACGAGATGTGGGGCTTTGTATTCAAATCCGATGCAATTTCACTGCAAAGAGGACTCCTTCGCCAATCAAAAGCTGTTACCCGCCGATTGGTACGAGAAGGTATTTCCGAAGGTAAAGAAATTAAGCTGCTCGCTGAAGAATGTCGATTTGATCGATGGACGACTTGTTAACGTTAACGACGATTCAACCGTTATCGATGAGCGTATTGAACAGAGAATGCGTACTTTGAAGTCCCTTGTAAGAGTCTTCGTTGGTTCTCCATCAGCTCAGAGGAGAGTAACAGAAATGGCTGAATCGACTTCTACAAATTGCCGGCCTCACGCATATTTCAGAAATCCAAGTGAAAGAGAGCCAATAGTTGTTGATTCCCTCACCAAGGTCAGCAACTTCCTCAACGTCTCTGCCCAACAAAGGAAACTGGTGCGCCACACCATATGTCCACAGGTTAGTCTTTACATCACCGCTAAACTCAAAAGTTTAAAAGCATATAGATTAGGAACTATTTAGTCTTGTATAAATGTGTGTATATATATTGTTAGACCCCAATTGTGAATAAGTACGTTTGTCACACCCCTAAAGGTCGGGCATCCCAATGGCCTAGAGGATGCTCCGGACACCGATCGGAGGGAGGCAAACGTGCAGGTAGGAGTTCGATGGAGAAAGACAACAACACTTGGCTGATTCGAATTAAGTGTTGTCACATACACTCGAAGATGGAAGAAGCCCATGGACACTCAAAGATGCATGTGTTCCTTTGAGTGCCATGTCCCTTGCTCAATGATGATCCATGCTTGGTAGCAGTCCATAGGTGGGCGTGAAAGGACCAACAATGGTGGGTAGGAGATGTCGCCCACTCAGGAGTGACGCAACAGGACGGGGAGTGTCTGTCGCAGCCAGCTGCAACGTCAGCATCAAGGAGCAAGACACCAGGCTCGCAGGAGGCGAGCTGACCATGGACGTGCCCAGGACGCAGCTGCACGCGCCCCACCTCTCGGCCGTTAAGGGGGGCAGGCGCGCCTCTTCGTCGAGTGCGGCAACCCCATCCAACAAGGCGCACGAATGTTGTCCTAGGGAAGGAGTCACCTATGGGAGACAGGCAACCGACCAGGCAAAGACCACCATCAAGCCCATTGAACGCACGAAGCGTGTGCCAGCTTATCGGAACTAGCAAGCATGAGGCTTGTAGGAGGAGCGCTAGGCTTGTGCACAACCGGAGGGTGCGCAGGGCTTTGGTGCGCGTCGACACCAACCCATGGGTGTGCGCCATGGTGGGCATGCGCCAACACAACAATGCCAGCATGAGAGCGGACGACATGTGAGACGCCAACGCATGCGCGCGCCAATGCAACGGAGACGCAGGAGTGCACATTAGGCTGGCGGACCTTCACGCACGCACGGAGCGTGCGCGCCTCAGCCTACCCAGATGCGCAACACCTGCCGCCCAGTGTCACAACCATACTAGAAGAGAGTTGGTTATGACTCCCACAAGTGCAGACAGATGCAAGGGTGGCACGCGAGTGTCGCCCACCAGGCCATGCGAGGGCCGAGAGAAGACAAGTGCCATGATGCGATCGAAGAAGACGGTGCATCATCCAACACACCAAGAACAGTGGGCATGAATGCCAAGTTGAGGATCGAGCAAAGTTGCGAGAAACTATGTCGTTAGTGGTGTCAGTGACACGACTAGCAGGGGTCAGATGGGGCCCCGGGTCATAGCCTCGAGTCGGGGTTCCGCCGTGAGTTTTGACGAGCGGCTTTGGAGTTAAGTCCGTATATGGGCAAGATGTAAGCTCGCCAGATTTCGTGTCAATCGGAGTTCAGACGAGCGCTCTACGCTGGCGTATGACTCCTCGCCAAATTGACGTGCATGCGTGTCAAGGCTAAAAATCCTCATAATCTACTATAGGGGGGAGGCATGGTGTTGGCTCTCCACCCCCCTGGGCCTCATGGCCTAAGTTAGCTACCCCATGGCACATGCAAGTGTGTTATGGTTGGGGCGAGCGTGTCAAGACGAGTGTCTTGGGATGTCTTCCTATGGAATGAGTTTTTCAAGGACCGTGTCGGGCGGCACGGGTCTGCCGACATTACACCTTTTCCCATGGACTAGAGGTTGTACCATAGAAGCGACACGCGATACGATTATGACTCGACATGGACGCAGAAGGGTTCATGCCTAGATGACTCGGATGGATGGATGTCCGAGAGGTCCCGATCATGAAGTTAAGTCATGGGAAGGATTCAATAGTATGTCTGAGCGAGCAGCTAAGTTCGATGACCTCCCGATGCTCGGGATGGCATCACTATTTATCAGTGGGACCTGATAGTTTTCGGTTGGCTCGATATTTGCCCGTTATGTTCTGGTAAGCGTTATAATGGCCCGAAAAGCTTCAGTTTGCAATGTTTGCTCTCGATAAGCGTTATAATGCCTCAGTAAGTCCCGTTAAGTCTCGAAATCACCCGATAAGCTTCGATTAACCGTTTTTAGGCCGTTTTGACCCTATGAAGGATCTGAATGAGTCCCGATGAAGTCCTAGATCGATGTGAGACCTTTACAGACATGTCTAAGAGCTTGGAGATGATGTCTTGAGAAGCGACAGAGCATTCTAATGACCTTGGCTTAGTGTAGAGGCAAGGTGGAGCCTTGAGACTTGGCCTAGAGTAGAGGCAAGTCGGCTAGAGTCGTGTTCGAGTGAAAATACACGTACGGGCGACGTGTATGACCAAGTATACTTCCGCTGCATATCAAGAGAAGCTTGGTGGAGGGAAAGAACCTCGCCTAAGGTCCGACGAAACCACAAAAGGTTGCGATCAGAGAGACGGGGCTTGAATTCTCTTTAAGGCTGGCCATGATTGAGCAAAAAAATTATGGCGCCGCACGGTTGGAGAGATAAGACCGTGACATTTGGTATCAAAACCATCTTTTGTGTGTGTAGTCGGACAAGACAGACAGAACAAACCCAGGATTGTGTTCAGGACTTCACAAGTGCGTCAAGAGTGCCAAGAGAACAACAGGGTGGTTGTGCTGTCAGTATTAGAGACAGTGTGTGGGTCAAAGCCAGACTCCGTGAAGTGAGTCCAACCCGTCAGTGAGCAAGAACCAAGTCTATTTGCCAAGGGCTTGAGTTACATTGTGCTAAAGAGAGTTGATCAGAAGGTGTACCGGGTAGGGTGCCTTCGAAAGAGCCACTTCCATCAGAGGAATAATGTGAATGGTTGCAAACAAGAAGTCGGTAGTCAGTGTGACATATGGGTTGACGTTTTGGCTGAGTAGAGCCAATCAAGTTGAGTGGGAAAGTGCTTGTGAAAAAGGGTTGCTTGTCCCGTGAGAGTGCTCGTAAATTAGAGCACGTGTGTCCCTTAAGTCATTCCATTGGGGGGTGGGTGACTTCACGGGGTTGTCGAAACCGGTATTAGCTCATGTTGGAGCATCAGACCGGAGACAAGAGCAGAAGGGAGAGAAGCTGCAGCAAGATCAGAGAAGCATCTTTGCAGCGTTGCCACATGAGGGTGGCCACTGGCCAGTTGAGAGTTGGCTGAAGGGGAGTGCCAACAGAGAAAAGGGCTATGTCAATGCGCTTTCAGACTTTGCTACCGTGAGAGTTGGTGCCACGATGAGTCTATTCCGTGGTTGGTTTGCGAGCCTTACCCCGAGGGGTGCATCGTTATGTGCCGCTCAGAGAATATGTGATGTGGTGGCCAGAGTGGCCAAGTGAGCTTGGGGGAGTGCTTCCCGGGGCTAGGCGAAGAGAATGCCTATGCCTCATTGACATGTCCTGTTTGGTGATTCCTGGGTCGAGTCCTAGAGTTGGGCGAGGTAGGCCCCAGAGAGGGATAGCCCGCAGGGTAGCAGTTGATCGTCGTGCTTGTGTTTCTGGCTGTCGGGCCAAGTCTTAGAGTTGGGCGCAGTAGGTCCCTGGGGCGGTTCAGGACATGAGTAGTTGACAGTTTCCATATTCGATCTGATGCACGATTATATCCAACAAGTCAGTCAGTTTCAAGCACGAGAACGGATCGTCATCGGGACGATGCTGAGTTTAAGTGGGGGAGAGTGTTACAATCCATTGTGTGATGATTTGGATTGAACCCCCCGAGTAAGTCATCGAGGACGATGACTAGTTTAAGTGGGGGAGAGTGTCACAACCATACTTGAAGAGAGTTGGTTGTGACTCCCACAAGCGCAGACAGATGCAAGGGTGGCACACGAGTGTCGCCCGCCAGGCCATGCGAGGGCCGAGAGAAGACAAGTGCCATGATGCGATCGAAGAGGACGGTGCATCATCCAACATACCAAGAACAGTGGGCTTGAATGCCAAGTTGAGGATTGAGCAAAGTTGCGAGAAACTATGTCGTCAGTGGTGTCAGTGACACGCCTAGTAGGGGTTAGATGGGCCCCGGGCCATAGCCTCGAAAGTCGGGGTTCCGCCGTGAATTTTGACGAGCGACTCTTGAGTTAAATGCGTATATGGACAAGATGTAATCTCGCCAGATTTTGTGTCAATTGGAGTCCAGGCGAGCGCTCTACGCTGGCGTATGACTCCTCGCCAAATTGATGTGCATGGGTGTTAAGGCTAAAAATCCTCATAATCTACTATAGGGGTGAGGCATGGTGTTGGCTCTCCACCACCCCTGGGCCTCATGGCCTAAGTGAGCTACCCCATGACACATGCAAATGTGTCATGGCTGGGGTGAGCGTGTCAAGATGAGTGTCTTGGGGCGTATTTCTATGGAATGAGTTTTTCAAGGACAGTGTCGGGTGTCACAATCATGCTTGTGAACCTCACAGTGATGAGACGTGCCATGATGCGACTGAGGAGGGCGGTGCATCACCAAGATAGGTGAGCTTGATGGCTAATGTTAGCACAAGTGAGATTGGGGCCAACCGTGTCGTCGGAGGGGTCGATCTATGGGCGTATGACTCCTCATCATTTCGACGTGCAAGGGAGTCAGACCTAATAATCCTCATAATATACTATAGGGGGTACCATAGTGTCGACTCTCCACCACTCCTGGGCTTGAGGGCCTAAGTGAGTCACCCCGTGGCACGTTAGCGTGTCATGGCTGGGGCGAGCGTCACAAGAGAAGTGTCTTGGGCGTCTTCCTTTGTAACGGGTTTTCAAGGGACAGTGTCAGACGACACAAGTGTGTCGACATTACACCCCTTCCCATGGCTTGGAGGTTCAGATATAGGCCAATTAGCGTCCAACGATCCGAGAATGTCCCGTTAAAGCCAACTCATGGGAAAGACCGAATGCATGTACACGGACGGGCGACGTGTACGACTGAGTTGTTTTCCGCTGCTTATTTACTTGATGCTTGCTTGAGGACTGAAGAACCTCGCCTAAGGTCCGACAAAGCCACAGAGCGTTGCGATCATAGAGATGGGGCGGGAGTTCCCCTTAAGGTTGGCCATGTTCGAGGGATGATTCATGGCGTTGCACGGTGCGAGAGGTGCGACCGTGACACTTGGTATTAGAGTCAACCTTTATGCGCGCAGTCGGACAAGAGTTCAATGGGGTCCGTTGTGGCAATTATTGTGACTTTCGTGGGGTGAACCAGTTGGGTGGCATCGTGCAAATCGAGACCGTTTTGACATTGACTTTCAAGGAAGGATCTGGGCAATGTTAAACACTTACACTTTCATGGAAGAGAGCCTGGGTGAAGTGTTGGTCGCCGATAGCCACTTCCGAGGAAGAGCCTGGGTGGTATCGGTTGCGTGACCTTGACGAGTCGGAAAAATCACTTTCGAGATGGAGTCTGGATGATGTTTGTTCGAGTTGCATGAAATCGATGTTTAGTGAATTGAATTGACGTTAGCAAGAGAAATCGTATTTCAAACAAGTATGGGCAATAGTTTGATGTATGAGGACAATGTTGGCGTTGACTCAGTTCGAGAGAGTTGTCAAACGAGACACGACTTCCAAACGAGGCCCAAGAGGGGCAAGTGAGCAGTTGGCGTGTGCAAAGCCAACGAGTGGGTTGTGTGGGCGGTGGTGCCCGAGTGAAGCGCAAGAGTGCGCAAGTGGTGGTGATATGGCCAGGAGTGCCAGTGTGGAAGTGCTTGTGCAGAGTGAGCACGTTGTGGTTGGAAGAGAAGCGCAAAGGCGCTGTGTTGAGGAGTTTCCACGTGGGGTGGCCTGGTGCAATGAGTTAGCCAGTGGAGTGCCAACGGAGTGAGAAGCTTTCCGACTTTGCTAAAGTTGTATGTTGCTCAAAGCATACGTATGTTGTGGCCAAGGTGGCCAAGTGAGTTTTGGGACGTGTTCCCGTAACTAGGCGAGAGGAAACCCAGGTTTGCTGCCATGTTTGTCTGGCAAGTTTCGGGCCAAAGCCCCAAGAGTTGTGCGATGAGAGAGCCCCGTAGGAGATAACACACAAACCAACAGTAGTTGTTGTGGCGAAATTTTCGTATCTGTCTCAGGCGCAATTTTCAAGCAAGTCCGTTAGCAGTGGGACGACATCGGGGCGATGCCAACTTCGAGTGAGGGAGGGTGTCACAATCCATTGTGCGATGGAAGTGGGTTGTGCCCCTTAGTGATGTCATCGAGGACGATGACGAGATTAAGTGGGGGAGAATGTCACAATCAGACTAGAAGAGCGTATGCTTGTGAACCTCACAGTGATGATACGTGCCATGATGCGACCGAGGAAGACGGTGCATCATCTAACCCACCAAGATAGGTGGGCTTGATGGCTAATGTGAGGACAAGTGAGATAACCGTGTCGTCGGAGGGGTCGATGTATGAGCGTATGACTCCTCGCTATTTTGACGTGCAAGGGAGGCAAACCTAATAATCCTCATAATATACTATAACGGGTACTATGGTGTCGACTCTCCACCACCCCTGGGCTTGGGGGCCTAAATGTGTCACCCCGTGGCACGTTTGCGTGTCATGGCTGGGGCGAGCGTCACAAGAGAAGTGTCTTGAGCGTCTTCCTTTGTAACAGGTTTTCAAGTGACAGTGTCGGACGACACAGGTGTGCCCACATTGTGCCCCTTCCCATGGCTTGGAGGTTCAGATACAGGGGCCGTTAGCGGCCAGCGATCCGAGAATGTCCCGTTAAAGGCAACTCATGGGAAAGACCGAGTGCATGTACACGGACGGGCGACGTGTACGACCGAGTTGTATTTCGTTGCTTATTTACTTGATGCTTGGTTGAGGACTGAAGAACCTGCCTAAGGTCCGGCGAAGCCACAGAGCGTTGCGATCATAGAGATGGGGCGGAAGTTCCCCTTAAGGTTGGCCATGTTCAAGGGATGACTCATGGGCCGCACGGTGCGACCGTGACATCGGGTGGCACGGGTGTGCCGACATTGCGCCCTTTCTCATGGACTAGAGGTTGCACCATAGGACCCGGTAGCGGCACGCGATACGATAATGACTCGGCATGGACACGGAAGGGTTCATGCCTAGATGACTCAGATGGATGGATGTCTGAGAGGCCCCGATCTTGAAGTTAAGTCATTGGAAGGATTCGATAGCATGTCCGAGCGAGCAGCTAAATTCGATGACCTCCCGATGCTCGGGATGGCATCACTATTCATCAGTGGCAGTGGCAGATCCAAAATTTGTATCGAGTGGGGCTTGAATATGACATTGGAAAAAATTATTTAAAACTGTACAAACCAGAGGTTGAACCTGTGACTTGAGGCTTATAAGCCATTAACCACTATGCCAAAAGGCTTTTTTGTTATTTAACCTGACTTTTTCTATACATATTATTAAAGGTAAAAATTTGAGAGGGGGCTCGAGCCCCCCTGAGCCCATGGTAAATCCGCCCCTGATCAGTGGGACTCGATAGTTTCCGGTTGGCTCGATATTTGCTTGTTATGTCCTGGTAAGCGTTATAATGGCCCGAAAAGCTCCAGTTTGCAATGTTTGCTCCCGGTAAGCGTCATAATGCCCCGGTAAGTCTCGGTAAGTCTCGAAATCACCCGGTAAGTTCCGGTTGAGCGTTTTAGGCCGTTTTGACCCTATGAAGGGTTCGAATGAGTCCCGATGAAGTCTTGAATTTATGTGAGACCTTTACGGACATGTCTAAGAGCTTGAAGATGATGTGTTGAGAAGCAATAGAGCATTCTAATGACCTTGGCCTAGTGTAGAGGCAAGGTGGAGTCCTGAGACTTGGCTTAGAGTAGAAGCAAGTCGGTTAGAGTCGCGTTCAAGTGAAAATACACGTATGGGCGACGTGTATGACCAAGTATACTTCTGCTGCATATCAAGAGAAGCTTGGTTGAGGGAAAGAACCTCGCCTAAGGTCCGACGAAGCCACAAAAGGTTGCGATCAGAGAGACAAGGCTTGAGTTCCCTTAAGGTTGGCAATGATTGAGGAAAAAAATTATGGCGCCGCACGGTTGGAGAGATACGACCGTGACACCCAGTGGCCAGTGTCACAATCATACTAGAAGAGCGTATGTTTGTGAACCTCACAGTGATGAGACGTGTCATGATGTGGCCGAGGAGGACGGTGCATCATCCAACACACCAAGATAGGTGGGCTTGATGGCTAATGTGAGAACAAACGAGATTGGGGCCAACCATGCCGTCAGAGGGGTCGATGTATGGGCTTTTGACGTGCAAGGGAGACAGACCTAATATTTCTCATAATATACTATAGGGGGTATCATGGTGTCGACTCTCCACCACTTTTGGGCTTGAGGGCCTAAGTGAGTCACCCCGTGGGACGTTAGCGTGTCATGGCTGGGGCGAGCGTCACAAGAGAAGTGTCTTGGGCGTTTTCCTTTGTAACGGGTTTCGGCGGGCGTCGAGCGACGACACGTGTGTGCGACATTGCGCCCCTTCCCATGGCTTGGAGGTTTAGATATAGGGGTCGTTAGCAGCCAGCGATCTGAGAATGTCCCGTCAAAGCCAACTCATGGGAAAGGCCGAGTGCAAGTACACGGACGGGCGACGTGTACGACCGAGTTGTGTTCCGCTGCTTATTTACTTGATGCTTGGTTGAGGACTAAAGAACCTCGCCTATGGTCCGACGAAACCACAGAGCGTTGCGATCATAGAGATGGGGCGGGAGTTCCCCTTAAGGCTGACCATGTTCGAGGGATGATTCATGGCGCCACACGGTGCGAGAGGTGCGACCGTGACACTTGGTATCTGAGTCAACATTTGTGCGCGTAGTTGGACAAGAGTTCAATGGGGTCTGACAATTATTGTGACTTTCGTGGGGTGAACCAGTTGAGTGACATCGTGCAAATCGAGATCGTTTTGATATTGACTTTCGAGGAAGGACCTGGGCAATGTTAAACACTTACACTTTCATGGAAGAGAGTCTGGGTGAAGTGTTGGTCGTCGATTGCCACTTCCGAGGAAGAGCCTGGGTGGTATTGGTTGCGAGACCTTGACGAGTCGGGAAAGTCACTTTGGAGATGGAGTCTGGGTGATGTTTGTTCGAGTTGCCTGAAATCGATGTTTAGTGAATTGAATTGACGTTAGCAAGAGAAATCGTATGTCAAACAAGTATGGACAACAGTTTGATGTATGAAGACAATGTTGGTGTTGGCTCAGTTCGAGAGAGTTGTCAAACGAGACGCGACTTCCAAACGAGGCCCAAGAGGGGCAAGTTAAGCAGTTGGCGTGTGCAAAGCCAACGAGTGGGTTGTGCGGGCGGTTGTGCCCGAGTGAAGCGCAAGAGTGCGCAAGTGGTGGTGATATGGCCAGGAGTGTCAGTGTGGAAGTGCTCGTATGCGAGTGAGCACGCCAGTGAGAGAGGAGAAGCGCCAGTTGTGGAGTCGCCACGTGGGGTGGCCTGTTGCGGTGAGTTGGCCAGCAGAGTGCCAACGGAGTGAGAAGCTTTCCTACTTTGCTGAAGTTGTATATCGCTCAGAGCATACATGTGTTGTGGTCAAGGTGGCCTAGTGAGTTTGGACGGGTTCCCGTAACTAGGCGAGAGGAAGCCTAGGTTTGCTGCCATGGTTTGTCTGGCGAGTCTCGGGCTAAAGCCCCAGGAGTTGGGCGAGGAGAGGGCCCCGTAGGGGATAGCACACAAGACAGCGGTAGCTGATGTGGTGAAATTTCCGTATCCGTCTAAGACATAATTTTTCAAGCAAGTCTGTTAGCAGTTGGACGGCATCGGGGCAATGCCGTGTTTGAGTGGGGGAGGATGTCAAAATCCATTGTGCGATGGAAGTGGATTGTGCCCCTTAGTGATGTCATCGAGGACGATGAAGAGATTAAGTGGGGGAGAATGTCACAATCATACTAGAAGAGCGTATGTTTGTGAACCTCATAGTGATGAGACGTGCCATGATGCGACCGAGGAGGACGGTGCATTATGCAACACACCAAGATAGGTGGGCTTGATGGCTAGTATGAGGACAAGCGAGATTGTGGTCAACTGTGTCGTCGGAGGGGTCGATGTATGGACGTATGACTCCTCGCCATTTTGACGTGCAAGGAAGGCAAATCTAATATTTCTCATAATATACTATAGGGGGTACCATGGTGTCGACTCTCCACCACCCTAGGCTTGAGGGCCTAAGTGAGTCACGCCGTGGCACGTTAGCGTGTCATGGCTGGGGCGAGTGTCACATGAGAAGTGTCTTGGGCGTCTTCCTTTGTAATGGGTTTTCAAGGGACAGTGTCGGACGACACAAGTGTGCCGACATTGCGCCCCTTCCTATGTCTTGGAGGTTCAGATATAGGGGTCGTTAGCGACCAATGATCCGAGAATGTCCCGTCAAAGCCAACTCATGGGAAAGACCGAGTGCAAGTACACGGACAGTGCAATTACACGGACAGGCGACGTGTACGATCGAGTTGTGTTCCGTTGCTTATTTACTTGATGCTTGGTTGAGGACTGAAGAACCTCGCTTAAGATCTGGTGAAGCCACGGAGCGTTGCGATCATAGAGATGGGGCGAGAGTTCCCCCCAAAGGCCGCCATGTTCGAGGATGATTCATGGCGCCACACTGCAGACGCAGAGGTGCGACCGCGGCCGACACACCCGGCCTACGAGCACCACCAGCGATCGCCAAGGAGTCGCAGACGCGCGCACATGGACAATGCCGTCAACGTCGTTGTCCCGCGCGCCAATTGGTGGAAACTCCAAGAAGTTGGCGCCATCCAGCAAGGGGTCGTGTGCATAGGAGCAGTTGTGCAGAGGACTGTGAAGACACTCATTTCCTAATGAGTGAGACAACCACAGTTACCCTTCGTGGGTACTTGGGGAAGAAATTTAGAAGCTACTTTCCTTGGAGCTAAAACGACCGAGAGGTGGAGCGCGCGCGACTGCGTCCTGGGCACGTCCATGGTTAGCTCGCCTCCTGCGAGCCTGGTGTCTTGCTCCTTGATGCTGACGTTGCGACTGGTTGTGATGGACACTCGTCGTCATGTTGCGCCACTCCTGAGTGGACGACATCTCCTGCCCACCATTGCTGGTCCTTTCACGCCCACCTATGGATTGCTACCAAGCATGGATCATCATTGAGCAAGGGACATGGCACTCAAAGGAACACAAACGTCTTTGAGTGTCCATGGGCTTCTTCCATCTTCGAGTGTATGGGACAACACTTAATTCGAACCAGCCAAGTGTTGTCTTCCTTCTCCATCGAACTCCTACCTGCACGTTTGCCTCCCTCTGATCGGTGTCCGGAGCATCCTCTAGGCCATTGGGATGCCCGACCCTTAGGGGTGTGACAAACCTCCCTCACTTAATCTCCAATGTCCTCGTTGAGATCCCAAAATAACCTCTGAGAAGTCAAAAAGAGAACTCGAACAAGTGAGTGGATACCTTTATTTAACGCATGCGGTTTGGCCCTCCGTTGGTCAAATTGAAATGAACTACTACTACTACTCCAAGAAGAAAAGTTGTTGGCTAGTTGCATTGAAGCCGTGTCTTGCGACACCCAAGGTCCATAATCTCCTTCTAAATCGGCTTAGAAGTTAGATGGTCCCGTCGGATCCATCTGTCCCCATATGCTTCTTCTTGGACAATCGTCTGCGTCGTGGCTGATCTCGCACAGTTCGCACTCGTCTTCATTGCAATCGGGATGCAAACACGAACCCTCTGACTCGAGATCGATCACCATTACTCCTGCATGCTCCCGACGCGCGCGCTGGGGGGTGTTCGTTCACTTGGTGGCGTGCGTTAGGTATGGCTCCTCGTTGGCCATCCTCTCGGTATCTCGCATGGTCGGCCATGGTGGTCCTCAATTGGTGAATCTCATCTCTAAGATTGTAGAGATCTCCTTGGAGTGTATCGAGACTCATCCGGGCGGTCTCTTGAATAGCAAGGAGGCATTGATTCGCTTGGCTTATGGTTTGAACGGACTCCTCGAACGCCATCACCTTGTCGTATTGTTGATCAACAAGGTCCATGATGGGTGGTGCTTGCATCAAGGGACGTTCAGGACTTAGAAATTCCGAGGTGAGTTGACCTCTCGGAATCCCCATCCTTTACTTAACACCGAGGATGCGTTCTTCGATCCTTCTTGTTTTGGCGGCGTGCAACAAATGTGCCATCTTCGTTGATAGGTATGCCCAACAAAGTGTCTGAAATTTGCTAGCTCCAACGGAACCTGGTTTGAGGTAACCCAGCTTGAACGAAATGTTTGTATGGGGGAGGGACCACTGTGGTTCGTGCTGAGGGTATTTGTAAGGGGAATGTTTCCTTGTGGGAGGTAGGCTGTTTTTGGTAGTAAGTTGGGCGTTTTAGGAGAGAACTAACTCTCTCTAATAGCTGGGGGTTTTCTTTGTATTTCTGCACTTTTTCCTTGATTGGTTGTAGCTTGAACCTTCTTTTGAATGGTATAACCACTATCTATTGTAGAGTTAGTCAATAGAACAGTATTTTCCAGGGTTGTTTGGGTTATTTACTTTGTGTTTGGCAGGGAAAGGTGAAGGATCCTAACATCTTGGTATCAGAGCACGTGAAATCGTGGGGAGAACAAGGTGAATGATGCAACGACAGATGGAGGAAAGGGTTGAAGCGACCGAAAGGGAGATTACATATCTCAAAGCAGCGATGCTGGAACTTTCCAGGAAACATGGGGATTCTGTATGTTAGGGTCTCAGATAGCCAACGAAATATGGAAGAGTCGAGAGAAATTCACCGGAAACAGTTGGAGGAGATAAAAGAAAATCAGAAAATGTTGGTTGTTTTGCTTGCGAGAAATGGAGGAGGACCTCGCGAAAGTTCAGCAACAATTACAAATCCAAACCAAGCCGCTTTTCGAGTGGGAGAAAGCTCAGGAACACGGTCGGAAAAAAATGTGGAAGCAGGAGAACAGTCGTACCCCGAAAATGGACATCTCTAGTCGTACCCGAAAATGGAGCCTCCGTCTGATCGAGGTAAGTTTAAGAAACTGGAAATGCCCGTATTTGATGGATCTAATCCCGATTCTTAGTTGTTCAGGGTTGAGACCTACTTCGAAGTTCACCAACTGAGTGGGGTAAAGAAAATCAATGTTTCCATCGTAAGCTTTGATCCCGAAGTAATCGAATGGTATCGATATACCAATACGCGATCTAAGATTTCTTCCTGGAAGGATTTGAAGTTGAGAATCTTCGAGCGATATCGCCCCACGCAAGAAGCTAGTCTATGCACCCGATTTTTGGTCATTAAACAAGAAGGGACGGTAGCTGAGTATCGAAAAAAATTTGAGGTATCTCAGCGTCGATGCTGCACCTTTTGCCGGAAGACGTGCTGGAAAATACGTCCTTAAACGGTCTACGACCAACTGTCAGAGCGACGGTTATTAGTCGAGAACCTAAGGGGTTGGATGACATGGTTAGGCAAGCCCAACTTATCGATGACAGAGATCTTGCCCTAAAGTTGGCCCAAGAGGAGGAGGCCCAGTGGTGGGCGTGTTCAAAAACGCGGAATGGGCCAGACCCATGGAAGCAACCACTTAAGCCCCAGTTATTCCATGGGTAATAAGCCCAACCCTATTTCTGTCGGCAAATCTAATTCAAAACCTGCAGACTTCACGAAGACGGTACCGGTTCCTGAGAAGAGAGACATGAACCGTCGGGAACCTTCAAGGAGGTTATCTAACGCTGTGTTTCAGGCTAAACGAGAGAAGGGACTCTGTTTTCGTTGCGATGAGAAATATTCGGCCAACCATAGGTGTAAGAACCGTGAAGTTCGCGAGTTACGGGTCTTGCTGGTAGGAGAATTTTGGGGAATCGAGCTCGTACAGGGCGTCGGAGAGGATTTAGTCGAAGAAGCTGAACAAATTGAGCTCCAAGCATCAGAGGTGGTCGAGGTAGCAAAGCTTGACTTGAAGTTCGTTCTGGGATTTTCTACACCTGGAACAATGAGAGTTCGAGGGAGAGTTGGGGAGAAAGAAGTGATCATCCTGATCGATAGCAGGGCTACACACAACTTCATTTCGCATGGGCAACGGTAAGGCGATAAAAGGGAAGGTGGTGTGCAAAGATGTGGTCGTCCACCTCAAAGAGATATCGGTAATTAAAGATTTTCTCCCTTTCGATCTAGGTAGAGTTGATTTAGTGTTGGGAATGCAGTGGTTAAAAACAATAGGGTATATGGGAGTGGACTGGAATGAACTCACAATGACAGTCGGACATGGCGACACGAAGGTCACCATCCAAAGTGATCCCTCTCTGACCACCACCGAAATCTCGTTGAAAAATATGGCTAAGACTTGGGCAGAACGGGATGTGGGATTCCTAGTTGAATACAGGGGAATCATTGTTGAAGACGAGGAAGAGGATTCAGAAGGTGAAAACGAACAGAGCAATGGAGAACTTACTCGAGGGGTTTGTGAACTACTGCAGGAGTATGATGAGGTTTTTCATTTACCCAAAAGGTTACCTCCTAAACGAAAGGTAGACCATCGAATTGTTCAGAGAGAAGATAAACAAACTGTGAACGTCAGGCCATATCAGTACGCTCATGCTCAAAAGGTAGAAATCGAACGCTTGATTGGAGAGATGCTGGTTGCAGGGATAATTCGAACAAGCAACCACCCTTTTTCAAGTCCTGTTCTTTTAATTTTTTTAAAAAAAAGATGGAGGATGAAGATTTTGTGTTGACTATAAAACTCTAAATCAAGCAACCATACCCGACAAATTTCTAATCCCAGTCATAGAGGAGTTGTTAGATGAGTTACATGGATCGCAGATATACTCGAAGTTAGATCTTAAAGTAGGGTATCATCTAATTCGAATGTGCGAACAAGATATACCCAAGACGGCTTTCAGGACTCACGAAGGCCATTATGAGTTTTTAGTAATGCCATTTGGCTTAACCAATGCGTCAACTACCTTCCAATCATTAATGAACCAGGTATTTTGCCCCTTTTTGAGGAGGTTCATAATTGTTTTCTTTGATGTTATTTTAGTGTATAGCCCTGATTATGATACTCACCTTAAACATCTAGCTATTGTATTTGACACTCTTAAGGAGAATTCGTTGTTTGCTAACCGTGCTAAGTGTGTTTTTGCACGAAGTAGAGTGGAATATTTGGGGCACTGGGTTTCTACAGATGGTGTAGAGGCAGATCGTAGCAAAGTGCAAGCGATGTTGGATTGGCCACAACCACGATCAGTGAGGGAGTTGAGGGGGTTCCTGGAGCTCACAGGGTATTATAGAAGATTTGTATTGAACTATAGTTTGATTGTAGGACCGTTGAATCAATTGTTGAAAAAAGACTCCTTCACATGGAATGTGGAAGCAACCCAAGCTTTTGAACAACTAAAAATAGCCATGGTAACACTTCCCGTCCTTGCTCTACCAAATTTCTCCTTACCCTTTATTATTGAAACAAATGCATCCGACAAGGGGGTGAGAGCAGTTTTGTCTCAAAGGCAACGACCCATAGCCTATTTTAGCCAGTCACTTTCAGCCAAGTCTAGACTGAAATCGGTGTATGAGCGGGAGTTAATGGCTATAGTACTCGCATTTAAAAAATGGAGACACTATCTGATTGGCCAGCGTTTTACAGTCCTTACAGATCAACGAGCATTAAAACACCTCTTAGAGCAAACGAAGGTCCAACCAAAGTACCAGAAATGGCTTACCAGATTGCTTGGCTATGACTGAGAGATCAGGTATCACCCCGACCTCTTGAATAAAGCTGTTGAGGCTTTATCTAGAATGCCAGAGAAAGAGGGAGCCACCTTGACTGGGGAGTTACTCTTACTATCAACCCTGTCTCTACTGGACCATTCAGTTATTCAAAAAGAAGTACAACAAGATCCAGAGCTAGTAAAAGTGATGGAAGAAATAACACAGGATCCTCTGAAGCATCCCAAATTCTCCTTGCAGAAGGATCAATTATGTTACAAAGGTCGAATGGTTCTCTCTCATAAGTCCTCACTGATTCCCTCCATGTTGACCACTTTTCATGACTCGATGATGAGAGGACACTTGGGGTTTTTGAGAACCTACAAACGCTTAACTGGTGAGCTATATTGGAAGGGTATGAAAGCGGATGCGAAGAAGCATGTGGAAGGGTGTTCAACCTGTCAGCGAAACAAATCCGATTTTGTTTCTCCAGCTGGGCTACTCCAACCCTTGCCTGCTCCTAATGGTATTTGGGAGGAACTCACGATGAATTTTATCGAAGGGTTACCTAGATCACATGGTAAGAGCTCAATTTTTGTGGTAGTAGACCGATTAAGCAAGTATGCTCACTTCATGGCTTTACATCATCCATTCACGGCAAAGGAGGTAGCAGATGTGTATATCAGGGAAATAGTGAGGCTACATGGTTTTCCAAAGTCGATAGTGTCCGACAGAGATAAAATTTTTGTAAGTAGATTTTAGAATGAACTTTTTCGGATGCAGGAAACCCAATTGCGCAAAAGTACTTCTTTCCATCCTCAAACGGATGGCCAGACAGAAAGGTGTTTGGAGAACTATCTTCGTTGTTTTTGTAGTGAGAAACCTAGTCAGTGTGAGCATTGGCTTCATTGGGCAGAGTATTGGTATAATACTATTTTCCATATCTCTATCCACACTACTCCCTTTAATGTGGTCTATGGTCGACAACCACCCCCCTTGTTGTTCTATGGTGAGAAGAAAACCACTAACGATTCTCTTGATCAAATGTTGTTGGAGCGTGATAAAGCACTTTTGGCTCTAAAGGAACATCTTCGAATGACCCAAGATCGTATGAAAAAGAATGTGGATATGAAGAGGAGAGATGTGGAGTTCAGGGTGGGAGACATGGTCTATCTTAAGCTTCGTCCTTATAGACAGAAGTCCATGGCCAAAAGACGGTGTGAGAAACTTTCTCCCAAGTATTTCGGTCCTTACAAGGTGCTGGAAAGGATTGGACCTGTTGCATACAAGCTTGAATTACCAACCAAGGCTGCAATCCATGATGTTTTTCATGTGTCTCAACTGAAGAAAGTGATCGGGCCCAATAAGGTGATACAATCAACAACCCCTTTATTTACTCCTGAGTATGAATGGGCAACCACACCAATGGAACGTTTGGGCGTGTGTTGAAATGAAAATTTGAACGAGGAACAATGGTTGGTTACGTGGCAAGGGGGAACCGAATGTGATGCAACTTGGGAATCGACTACGGCTTTGAGCGATCAATTCCCAGGACTCCACCTTGAGGACAAGGTGCCTCAAAATCTGGAGGGTATTGTTAGACCCCAATTGTGAATACGTACTCTAGAAAGGGTAAAAAGGGAACTGAGTGAAAGTTAGTTATGTGTGGCAAGGGGAGGGACCATTGTGGTTCGTGTTGAGGGTATTTGTAAGGGGGATGTTTCCTTGTGGGAGGTAGGCTGTTTTTGGTAGTAAGTTGGGCATTTTGGGAGAGAATCAGCTCTCTCTAATAGCTAGGGGTTTTCTTTGTATTTCTGCACTTTTTCCTTGCTTGGTTGTAGCTTGAACTTTCTTTTGAATGATATAACCACCGTCTATTGCATAGTTAGTCAATAGAACTATGTTTTCCAGGGCTGTTTGGGTTATTTACTGTGTGTTTGGCAGGAAAAGGTCAAGGATCCTAACATTCTCATATGTGAGCTTGAGTATTAATTTGATTTGTTCAGTTAAATACATGAGCTATTGATAACAATTGGTGTTAGCTTACTTCGATTAGTTTCATGGAACAACTGCCAACTGCTCATATACGCGCATTTACGTGTATATATACTCGATCAATAGATTAGAAGTTTGAATTCTTCGAACTAAAAAAACATTTCATGAAACAACTTGTTAGGATGCTCCTAACAAGAATACTAAACCACAATATATTGCAATATAAAAGGTAAAAAGAACTCCAAGATAAACTAAGTCTTTCGAGGTACTTGGACTCTCTCCTCTTCAGATCTCTCTCAAGCTCTAATTCTCTCACACCCCCTCTATTTATAACCAAACTTCCTTTAGCTAATTACCACTAATGTCCTCATTACTATTCCCAAGATATCTCTATTAGTACTCTCACACATCTACAATACCTTACCATATAGAACTATAGAAGATGGTTAAGAAATGCTAGGATTTATTTTGGTGCTCAGACAAATCACTACTAAACCTAAAAGTTTGAACGGATATATTAGTATATATTTAATCTTTTATATCTATTATTTTAACAGGTTACACAACATCACATTTGGACTGGTGCATTGGATCATATGCTGAAAGAGTTAAAAATGGAGTTGGATCCACTGGCTCATCAGTCACCCAACAAAGGAATCAAAATGGGGCAGCAGATAGTTTCAAGTTGCCTAAAGTTTTTGGATGATGCCACCAATTCAAATGCTCACTTCACTTCATGGATGCGGCCAGCACCGTTGCAACCCGTTGTCGATTCATCTGCATCGCCCAGATGGGAAGACATGCTCGAGATGTTCACCGATCTGATCGACTCTATGAAAGACGAAAAGAGTTTGCTCCATTATGTAACAAAGCTTGAGGTTATGAAAGAGGGGCTTTCCCAGATCAAAGATGTGCTGACTGATAAAAGCATTGGATTCAAGGAAGCAAGGCATCAAGAAAGCCTGGTGCAGAAAAAGCTTTCAAAGACACTGGGCCACTCATCCAGGTGCTTGTTCACTCTTTTACTTTACTATCTTTTTGGGCATTTTAGAGATGTTGAAGTTGATCTTTGTGGTGGGTTTTTGAAGGCTGTTGAAAATGACAAGTTTTTGTTGTTCATGGGGAGGGTTTTGAGTTGTGATGAGGAGAAAATTGTTTGGAATGGGGTGAGGCAGCTTGATAGAGCAATGGGGCTTTTTAAATTTGTTTGGGAAACAGCTGGAATGAAGGGAGAATTGGAATTGCAAGGCCATTTATTTTGTGTTGGGGCTGAGAATAGGCAGCTTAGTTATAAAGGAAATGCTTATTTATTACATGAGGTCAAATTATAACCAAACATCTCCCTTGATTATTTCTTATAACTTTGTGCTTAAGAAAATAAAGAACTCTTTTATCTTACCTTTATTTTTTTTTGCTCAAAGATTACTGCTTTTAAAGTCGTGATAACTAAATAAAGGAAAGATCTTGCTAACTAATGAGTAGGATATAACTTATAAAGTCTTGTTTTTAGGTGTCAAATTGCTCAGCAGGTTAAGGCTATATCATCAACCAAGAGACTAGAGGTTCAAATCTCCACCCTACATGTTGAATTAAAAAGAAAACATGTTTTTAGGTGAGAAATTTTTTAAGTGCTTTTGATATTTAATGATTTCCAGTAGTAGGGCAGGTCGGAATGATTTTCAAATGCTTAGAAAAATATTTTCGAGCATTAAAAAATATTTTTAAGCACTTAAAAAGTCATACTAAATAAACCCATAATGTTTCTATGCATTTTTTAAAGTAATGTTTTCAATTGATGCAAAACATTTTTTGGTTGAGGTCATAATGTCCTTTTTTTTAAAGATTTATTTTTACTTTTCCTTCATCTTCTCTTATTTTCTTCTTCCTCTATCTCTATTTCTTCTTGCTCTAACGTGACCTATTTTTTATTTTCTTTTTATTTCCTTGGCTCCAACACCACCGTTGAAAACGATAATGACAAGAAAGATTTTCTCGCCGCGTTGATGTAGTGCCTTACTCAGTCCACCCTTCTTGATTTTCAAAAACTTGTTTCTGCCCACATATATGAAGTACCCATCTTTTGTGGTTGGTTTCTGATATTTGGCAATTCTGTCTTTTTATTCTGTTGAGTGGTTTGATTGGATAAATAGAAAATAAAACAATTGAAGAGCGTGGC

mRNA sequence

CAGACCTAAAAATATTGTTTTCTTCGGTTTAAGAAGCGGTCAGTGAAATTCATTGATTTGCTTCATCATTCATTCAAAGCAGTTTGCTGCTCCCATCGACGTTACGAGTATTTAACTTCATCGGATTCTTTTCGCTTGATTTCGGTATTGTTCTTCTCAATCACCTTGTTCCGGGAAGCCAATCTTCGATTGCGTTCTGGTTAGACAAGGATTTCAACAACCTTGATTCTCAGCTTTCATTTCAGTAATGGTATGCCCTCCACCTGTTCGACAAAATGCCACAATGAAGTTCCAGCCTATGCCAGAGTCACTTCCGCTGTTCGATGTTGAACCAATGAGAATATACAAAGCTACTGCTCGGAATTTTTCAAGGGAGTAACTTCATGAAGTGATTTTGGAGGCATGATTATGGCATATAAGCATTCTCAGAGGTTGCAGTTCATCTGTCGCGTAATTCATCTTAACAACACGAGATGTGGGGCTTTGTATTCAAATCCGATGCAATTTCACTGCAAAGAGGACTCCTTCGCCAATCAAAAGCTGTTACCCGCCGATTGGTACGAGAAGGTATTTCCGAAGGTAAAGAAATTAAGCTGCTCGCTGAAGAATGTCGATTTGATCGATGGACGACTTGTTAACGTTAACGACGATTCAACCGTTATCGATGAGCGTATTGAACAGAGAATGCGTACTTTGAAGTCCCTTGTAAGAGTCTTCGTTGGTTCTCCATCAGCTCAGAGGAGAGTAACAGAAATGGCTGAATCGACTTCTACAAATTGCCGGCCTCACGCATATTTCAGAAATCCAAGTGAAAGAGAGCCAATAGTTGTTGATTCCCTCACCAAGGTCAGCAACTTCCTCAACGTCTCTGCCCAACAAAGGAAACTGGTGCGCCACACCATATGTCCACAGGTTACACAACATCACATTTGGACTGGTGCATTGGATCATATGCTGAAAGAGTTAAAAATGGAGTTGGATCCACTGGCTCATCAGTCACCCAACAAAGGAATCAAAATGGGGCAGCAGATAGTTTCAAGTTGCCTAAAGTTTTTGGATGATGCCACCAATTCAAATGCTCACTTCACTTCATGGATGCGGCCAGCACCGTTGCAACCCGTTGTCGATTCATCTGCATCGCCCAGATGGGAAGACATGCTCGAGATGTTCACCGATCTGATCGACTCTATGAAAGACGAAAAGAGTTTGCTCCATTATGTAACAAAGCTTGAGGTTATGAAAGAGGGGCTTTCCCAGATCAAAGATGTGCTGACTGATAAAAGCATTGGATTCAAGGAAGCAAGGCATCAAGAAAGCCTGGTGCAGAAAAAGCTTTCAAAGACACTGGGCCACTCATCCAGGTGTCAAATTGCTCAGCAGGTTAAGGCTATATCATCAACCAAGAGACTAGAGGTTCAAATCTCCACCCTACATGTTGAATTAAAAAGAAAACATGTTTTTAGATTTTCTCGCCGCGTTGATGTAGTGCCTTACTCAGTCCACCCTTCTTGATTTTCAAAAACTTGTTTCTGCCCACATATATGAAGTACCCATCTTTTGTGGTTGGTTTCTGATATTTGGCAATTCTGTCTTTTTATTCTGTTGAGTGGTTTGATTGGATAAATAGAAAATAAAACAATTGAAGAGCGTGGC

Coding sequence (CDS)

ATGATTATGGCATATAAGCATTCTCAGAGGTTGCAGTTCATCTGTCGCGTAATTCATCTTAACAACACGAGATGTGGGGCTTTGTATTCAAATCCGATGCAATTTCACTGCAAAGAGGACTCCTTCGCCAATCAAAAGCTGTTACCCGCCGATTGGTACGAGAAGGTATTTCCGAAGGTAAAGAAATTAAGCTGCTCGCTGAAGAATGTCGATTTGATCGATGGACGACTTGTTAACGTTAACGACGATTCAACCGTTATCGATGAGCGTATTGAACAGAGAATGCGTACTTTGAAGTCCCTTGTAAGAGTCTTCGTTGGTTCTCCATCAGCTCAGAGGAGAGTAACAGAAATGGCTGAATCGACTTCTACAAATTGCCGGCCTCACGCATATTTCAGAAATCCAAGTGAAAGAGAGCCAATAGTTGTTGATTCCCTCACCAAGGTCAGCAACTTCCTCAACGTCTCTGCCCAACAAAGGAAACTGGTGCGCCACACCATATGTCCACAGGTTACACAACATCACATTTGGACTGGTGCATTGGATCATATGCTGAAAGAGTTAAAAATGGAGTTGGATCCACTGGCTCATCAGTCACCCAACAAAGGAATCAAAATGGGGCAGCAGATAGTTTCAAGTTGCCTAAAGTTTTTGGATGATGCCACCAATTCAAATGCTCACTTCACTTCATGGATGCGGCCAGCACCGTTGCAACCCGTTGTCGATTCATCTGCATCGCCCAGATGGGAAGACATGCTCGAGATGTTCACCGATCTGATCGACTCTATGAAAGACGAAAAGAGTTTGCTCCATTATGTAACAAAGCTTGAGGTTATGAAAGAGGGGCTTTCCCAGATCAAAGATGTGCTGACTGATAAAAGCATTGGATTCAAGGAAGCAAGGCATCAAGAAAGCCTGGTGCAGAAAAAGCTTTCAAAGACACTGGGCCACTCATCCAGGTGTCAAATTGCTCAGCAGGTTAAGGCTATATCATCAACCAAGAGACTAGAGGTTCAAATCTCCACCCTACATGTTGAATTAAAAAGAAAACATGTTTTTAGATTTTCTCGCCGCGTTGATGTAGTGCCTTACTCAGTCCACCCTTCTTGA

Protein sequence

MIMAYKHSQRLQFICRVIHLNNTRCGALYSNPMQFHCKEDSFANQKLLPADWYEKVFPKVKKLSCSLKNVDLIDGRLVNVNDDSTVIDERIEQRMRTLKSLVRVFVGSPSAQRRVTEMAESTSTNCRPHAYFRNPSEREPIVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELKMELDPLAHQSPNKGIKMGQQIVSSCLKFLDDATNSNAHFTSWMRPAPLQPVVDSSASPRWEDMLEMFTDLIDSMKDEKSLLHYVTKLEVMKEGLSQIKDVLTDKSIGFKEARHQESLVQKKLSKTLGHSSRCQIAQQVKAISSTKRLEVQISTLHVELKRKHVFRFSRRVDVVPYSVHPS
Homology
BLAST of Tan0011674 vs. NCBI nr
Match: XP_022982191.1 (uncharacterized protein LOC111481093 [Cucurbita maxima] >XP_022982192.1 uncharacterized protein LOC111481093 [Cucurbita maxima])

HSP 1 Score: 749.2 bits (1933), Expect = 2.0e-212
Identity = 362/427 (84.78%), Postives = 391/427 (91.57%), Query Frame = 0

Query: 1   MIMAYKHSQRLQFICRVIHLNNTRCGALYSNPMQFHCKEDSFANQKLLPADWYEKVFPKV 60
           MIMAYKHSQRL F+CRVIHLN TRC ALYSNPM +HC EDSF +Q+ LPADWYEK FPK+
Sbjct: 1   MIMAYKHSQRLIFVCRVIHLNITRCAALYSNPMLYHCSEDSFDDQERLPADWYEKAFPKI 60

Query: 61  KKLSCSLKNVDLIDGRLVNVNDDSTVIDERIEQRMRTLKSLVRVFVGSPSAQRRVTEMAE 120
           KKLSCSLKNVDLIDGRLVNVNDDST++DERIEQRMR  KSLVRVF+GS S QRRVTEMA 
Sbjct: 61  KKLSCSLKNVDLIDGRLVNVNDDSTILDERIEQRMRIFKSLVRVFIGSSSVQRRVTEMAA 120

Query: 121 STSTNCRPHAYFRNPSEREPIVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGA 180
           ST+ N +P A FRN SEREP+VVDS TKVSNFLNVSAQQRKLVRHTICPQ TQHHIWTGA
Sbjct: 121 STTINWQPQACFRNSSEREPMVVDSFTKVSNFLNVSAQQRKLVRHTICPQATQHHIWTGA 180

Query: 181 LDHMLKELKMELDPLAHQSPNKGIKMGQQIVSSCLKFLDDATNSNAHFTSWMRPAPLQPV 240
           LDH+LKELKMELDPLAH SPNKGIKMGQQIVSSCLKFL+DATNSNAH TSWMRPAPLQ  
Sbjct: 181 LDHVLKELKMELDPLAHHSPNKGIKMGQQIVSSCLKFLNDATNSNAHITSWMRPAPLQRN 240

Query: 241 VDSSASPRWEDMLEMFTDLIDSMKDEKSLLHYVTKLEVMKEGLSQIKDVLTDKSIGFKEA 300
           VDSS SP+WEDMLEMFTDLI ++KDEK L  YVTKLEVMKEGL+QI+DVL DKSIGFKEA
Sbjct: 241 VDSSTSPKWEDMLEMFTDLIGTLKDEKGLHQYVTKLEVMKEGLTQIRDVLADKSIGFKEA 300

Query: 301 RHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGFLKAVENDKFLLFMGR 360
           +HQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGG LKAVE +K+L+FMGR
Sbjct: 301 KHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGLLKAVEKEKYLVFMGR 360

Query: 361 VLSCDEEKIVWNGVRQLDRAMGLFKFVWETAGMKGELELQGHLFCVGAENRQLSYKGNAY 420
           +LSCDEE+ VWNGVRQLDRAMGLFKFVWETAGMKG+L L+GHLFCVGAE+RQLSYKGN Y
Sbjct: 361 ILSCDEERTVWNGVRQLDRAMGLFKFVWETAGMKGDLVLRGHLFCVGAEDRQLSYKGNVY 420

Query: 421 LLHEVKL 428
           L+HE+ L
Sbjct: 421 LVHEISL 427

BLAST of Tan0011674 vs. NCBI nr
Match: XP_022940414.1 (uncharacterized protein LOC111446029 [Cucurbita moschata] >XP_022940415.1 uncharacterized protein LOC111446029 [Cucurbita moschata])

HSP 1 Score: 746.1 bits (1925), Expect = 1.7e-211
Identity = 362/428 (84.58%), Postives = 391/428 (91.36%), Query Frame = 0

Query: 1   MIMAYKHSQRLQFICRVIHLNNTRCGALYSNPMQFHCKEDSFANQKLLPADWYEKVFPKV 60
           MIMAYKHSQRL F+CRVIHLN TR  A YSNPM +HC EDSF + + LPADWYEK FPK+
Sbjct: 1   MIMAYKHSQRLMFVCRVIHLNITRRSAFYSNPMLYHCTEDSFDDHERLPADWYEKAFPKI 60

Query: 61  KKLSCSLKNVDLIDGRLVNVNDDSTVIDERIEQRMRTLKSLVRVFVGSPSAQRRVTEMAE 120
           KKLSCSLKNVDLIDGRLVNVNDDST++DERIEQRMR  KSLVRVF+GSPS QRRVTEMA 
Sbjct: 61  KKLSCSLKNVDLIDGRLVNVNDDSTILDERIEQRMRIFKSLVRVFIGSPSVQRRVTEMAA 120

Query: 121 STSTNCRPHAYFRNPSEREPIVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGA 180
           ST+TN +P   FRN SEREP+VVDSLTKVSNFLNVSAQQRKLVRHTICPQ TQHHIWTGA
Sbjct: 121 STATNWQPQTCFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQATQHHIWTGA 180

Query: 181 LDHMLKELKMELDPLAHQSPNKGIKMGQQIVSSCLKFLDDATNSNAHFTSWMRPAPLQPV 240
           LDH+LKELKMELDPLAH SPNKGIKMGQQIVSSCL FL+DATNSNAH TSWMRPAPLQ  
Sbjct: 181 LDHVLKELKMELDPLAHHSPNKGIKMGQQIVSSCLNFLNDATNSNAHITSWMRPAPLQHN 240

Query: 241 VDSSASPRWEDMLEMFTDLIDSMKDEKSLLHYVTKLEVMKEGLSQIKDVLTDKSIGFKEA 300
           VDSS SP+WEDMLEMFTDLI ++KDEK L  YVTKLEVMKEGL+QI+DVLTDKSIGFKEA
Sbjct: 241 VDSSTSPKWEDMLEMFTDLISTLKDEKGLHQYVTKLEVMKEGLTQIRDVLTDKSIGFKEA 300

Query: 301 RHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGFLKAVE-NDKFLLFMG 360
           +HQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGG LKAVE  +K+L+FMG
Sbjct: 301 KHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGLLKAVEKEEKYLVFMG 360

Query: 361 RVLSCDEEKIVWNGVRQLDRAMGLFKFVWETAGMKGELELQGHLFCVGAENRQLSYKGNA 420
           R+LSCDEE++VWNGVRQLDRAMGLFKFVWETAGMKG+L LQGHLFCVGAE+RQLSYKGN 
Sbjct: 361 RILSCDEERVVWNGVRQLDRAMGLFKFVWETAGMKGDLVLQGHLFCVGAEDRQLSYKGNV 420

Query: 421 YLLHEVKL 428
           YLLH++ L
Sbjct: 421 YLLHQISL 428

BLAST of Tan0011674 vs. NCBI nr
Match: XP_023524216.1 (uncharacterized protein LOC111788189 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 742.7 bits (1916), Expect = 1.8e-210
Identity = 360/428 (84.11%), Postives = 390/428 (91.12%), Query Frame = 0

Query: 1   MIMAYKHSQRLQFICRVIHLNNTRCGALYSNPMQFHCKEDSFANQKLLPADWYEKVFPKV 60
           MIMAYKHS RL F+CRVIHLN TRC ALYSNPM +HC EDSF +Q+ LPADWYEK FPK+
Sbjct: 1   MIMAYKHSHRLMFVCRVIHLNITRCAALYSNPMLYHCTEDSFDDQERLPADWYEKAFPKI 60

Query: 61  KKLSCSLKNVDLIDGRLVNVNDDSTVIDERIEQRMRTLKSLVRVFVGSPSAQRRVTEMAE 120
           KKLS SLKNVDLIDGRLVNVNDDST++DERIEQRMR  KSLVRVF+GSPS QRRVTEMA 
Sbjct: 61  KKLSSSLKNVDLIDGRLVNVNDDSTILDERIEQRMRIFKSLVRVFIGSPSVQRRVTEMAA 120

Query: 121 STSTNCRPHAYFRNPSEREPIVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGA 180
           ST+TN +P A FRN SEREP+VVDSLTKVSNFLNVSAQQRKLVRHTICPQ TQHHIWTGA
Sbjct: 121 STATNWQPQACFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQATQHHIWTGA 180

Query: 181 LDHMLKELKMELDPLAHQSPNKGIKMGQQIVSSCLKFLDDATNSNAHFTSWMRPAPLQPV 240
           LDH+LKELKMELDP AH SPN+GIKMGQQIVSSCL FL+DATNSN H TSWMRPAPLQ  
Sbjct: 181 LDHVLKELKMELDPRAHHSPNEGIKMGQQIVSSCLNFLNDATNSNTHITSWMRPAPLQRN 240

Query: 241 VDSSASPRWEDMLEMFTDLIDSMKDEKSLLHYVTKLEVMKEGLSQIKDVLTDKSIGFKEA 300
           VDSS SP+WEDMLEMFTDLI ++KDEK L  YVTKLEVMKEGL+QI+DVLTDKSIGFKEA
Sbjct: 241 VDSSTSPKWEDMLEMFTDLIGTLKDEKGLHQYVTKLEVMKEGLTQIRDVLTDKSIGFKEA 300

Query: 301 RHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGFLKAVENDKFLLFMGR 360
           +HQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGG LKA E +K+L+FMGR
Sbjct: 301 KHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGLLKAAEKEKYLVFMGR 360

Query: 361 VLSCDEEKIVWNGVRQLDRAMGLFKFVWETAGMKGELELQGHLFCVGAE-NRQLSYKGNA 420
           +LSCDEE++VWNGVRQLDRAMGLFKFVWETAGMKG+L LQGHLFCVGAE +RQLSYKGN 
Sbjct: 361 ILSCDEERVVWNGVRQLDRAMGLFKFVWETAGMKGDLVLQGHLFCVGAEDSRQLSYKGNV 420

Query: 421 YLLHEVKL 428
           YLLH++ L
Sbjct: 421 YLLHQISL 428

BLAST of Tan0011674 vs. NCBI nr
Match: XP_038896888.1 (uncharacterized protein LOC120085101 isoform X1 [Benincasa hispida])

HSP 1 Score: 722.2 bits (1863), Expect = 2.6e-204
Identity = 354/428 (82.71%), Postives = 388/428 (90.65%), Query Frame = 0

Query: 1   MIMAYKHSQRLQFICRVIHLNNTRCGALYSNPMQFHCKEDSFANQKLLPADWYEKVFPKV 60
           MIMAYK+ QRL FI R+ HLNNTR GAL SN M +HC E S A+Q++LP++WYE  F K+
Sbjct: 1   MIMAYKNLQRLFFISRLKHLNNTRFGALQSNSMLYHCAEHSSADQEVLPSEWYENAFRKI 60

Query: 61  KKLSCSLKNVDLIDGRLVNVNDDSTVIDERIEQRMRTLKSLVRVFVGSPSAQRRVTEMAE 120
           KKLSCSLKNVDLIDGRLVNVNDDST+IDE IEQRMRT KSLV V +GSP+A+RR+TEMA 
Sbjct: 61  KKLSCSLKNVDLIDGRLVNVNDDSTIIDELIEQRMRTFKSLVGVLIGSPTARRRITEMAV 120

Query: 121 STSTNCRPHAYFRNPSEREPIVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGA 180
           S+S  C+PHA+FRN SEREP++VDSLTK+SNFLNVSAQQRKLVRHTICPQVTQHHIWTGA
Sbjct: 121 SSSITCQPHAWFRNLSEREPMIVDSLTKISNFLNVSAQQRKLVRHTICPQVTQHHIWTGA 180

Query: 181 LDHMLKELKMELDPLAHQSPNKGIKMGQQIVSSCLKFLDDATNSNAHFTSWMRPAPLQ-P 240
           LDHMLKEL +EL PL+ QS NKGIKMG QIVSSCLKFLDDATNSNAHFTSWMRPAPL+  
Sbjct: 181 LDHMLKELNLELVPLSRQSTNKGIKMGHQIVSSCLKFLDDATNSNAHFTSWMRPAPLRAA 240

Query: 241 VVDSSASPRWEDMLEMFTDLIDSMKDEKSLLHYVTKLEVMKEGLSQIKDVLTDKSIGFKE 300
           VVDSSA PRWEDMLEMFTDLID +K+EK L+HYVTKL+VMKEGLSQIKDVLTDKSIG+KE
Sbjct: 241 VVDSSAPPRWEDMLEMFTDLIDCLKEEKCLVHYVTKLKVMKEGLSQIKDVLTDKSIGYKE 300

Query: 301 ARHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGFLKAVENDKFLLFMG 360
           A HQESLVQKKLSKTLGHSSRCLFTLLLYY+FGHFRD+EVDLCGG LKA  NDKFLLFMG
Sbjct: 301 ASHQESLVQKKLSKTLGHSSRCLFTLLLYYIFGHFRDIEVDLCGGLLKADGNDKFLLFMG 360

Query: 361 RVLSCDEEKIVWNGVRQLDRAMGLFKFVWETAGMKGELELQGHLFCVGAENRQLSYKGNA 420
           RVLS DEEKIVWNG+RQLDR MGLFKFVWETAGMKG+LELQGHLFCVG E+RQLSYKGNA
Sbjct: 361 RVLSSDEEKIVWNGMRQLDRVMGLFKFVWETAGMKGQLELQGHLFCVGTEDRQLSYKGNA 420

Query: 421 YLLHEVKL 428
           YLLHE+ L
Sbjct: 421 YLLHEINL 428

BLAST of Tan0011674 vs. NCBI nr
Match: XP_022139027.1 (uncharacterized protein LOC111010055 isoform X1 [Momordica charantia] >XP_022139037.1 uncharacterized protein LOC111010055 isoform X1 [Momordica charantia] >XP_022139045.1 uncharacterized protein LOC111010055 isoform X1 [Momordica charantia] >XP_022139054.1 uncharacterized protein LOC111010055 isoform X1 [Momordica charantia])

HSP 1 Score: 688.0 bits (1774), Expect = 5.3e-194
Identity = 345/431 (80.05%), Postives = 386/431 (89.56%), Query Frame = 0

Query: 1   MIMAYKHSQRLQFICRVIHLNNTRCGALYSNPMQFHCKEDSFANQKLLPADWYEKVFPKV 60
           MI+A+K SQRL FI R+ HLNNTR GAL+SNPM +H  E+S A+Q+LLP++WYE  + K+
Sbjct: 1   MILAHKLSQRL-FIPRLNHLNNTRYGALHSNPMLYHSAENSSADQELLPSEWYENAYRKI 60

Query: 61  KKLSCSLKNVDLIDGRLVNVNDDSTVIDERIEQRMRTLKSLVRVFVGSPSAQRRVTE--M 120
           +KLSCSLKNVDLIDGRLVNV DDST+ DERIEQRMR  KSLVRVFVGSPSA+RRVTE  M
Sbjct: 61  QKLSCSLKNVDLIDGRLVNVVDDSTIFDERIEQRMRAFKSLVRVFVGSPSARRRVTETMM 120

Query: 121 AESTSTNCRPHAYFRNPSEREPIVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWT 180
           AES++TNC+P   F N SEREP+VVDSLTK+SNFLNVSAQQRKLVRHTICPQVTQHHIWT
Sbjct: 121 AESSTTNCQPPWCFGNSSEREPMVVDSLTKISNFLNVSAQQRKLVRHTICPQVTQHHIWT 180

Query: 181 GALDHMLKELKMELDPLAHQSP-NKGIKMGQQIVSSCLKFLDDATNSNAHFTSWMRPAPL 240
           GALDHMLKELK+ELDPLAHQS  NKGIKMGQQIVSSCLKFLDDATNSNAHFTSWMRPAP 
Sbjct: 181 GALDHMLKELKLELDPLAHQSTNNKGIKMGQQIVSSCLKFLDDATNSNAHFTSWMRPAPS 240

Query: 241 QPVVDSSASPRWEDMLEMFTDLIDSMKDEKSLLHYVTKLEVMKEGLSQIKDVLTD-KSIG 300
           QPVVD SASPRWEDMLEMF DLI S+K EK LL +V KLEVMKEGLSQIKDVL+D KSIG
Sbjct: 241 QPVVDPSASPRWEDMLEMFDDLIGSLKGEKPLLRHVAKLEVMKEGLSQIKDVLSDHKSIG 300

Query: 301 FKEARHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGFLKAVENDKFLL 360
            KE++HQESLVQ+KLSKTLGHSSRCLFTLL++YL+GH RD+EVD CGG LK VEN+KF L
Sbjct: 301 HKESKHQESLVQRKLSKTLGHSSRCLFTLLMFYLWGHIRDIEVDFCGGVLKDVENEKFWL 360

Query: 361 FMGRVLSCDEEKIVWNGVRQLDRAMGLFKFVWETAGMKGELELQGHLFCVGAENRQLSYK 420
            MGR+LSCDEEK+VWNGV+QLDRAMG+FKFVWETAGMKG LELQGHL+ VGA+ RQLSYK
Sbjct: 361 VMGRILSCDEEKMVWNGVKQLDRAMGVFKFVWETAGMKGGLELQGHLWSVGAQQRQLSYK 420

Query: 421 GNAYLLHEVKL 428
           GNAY+LH++ L
Sbjct: 421 GNAYILHDITL 430

BLAST of Tan0011674 vs. ExPASy TrEMBL
Match: A0A6J1IW03 (uncharacterized protein LOC111481093 OS=Cucurbita maxima OX=3661 GN=LOC111481093 PE=4 SV=1)

HSP 1 Score: 749.2 bits (1933), Expect = 9.5e-213
Identity = 362/427 (84.78%), Postives = 391/427 (91.57%), Query Frame = 0

Query: 1   MIMAYKHSQRLQFICRVIHLNNTRCGALYSNPMQFHCKEDSFANQKLLPADWYEKVFPKV 60
           MIMAYKHSQRL F+CRVIHLN TRC ALYSNPM +HC EDSF +Q+ LPADWYEK FPK+
Sbjct: 1   MIMAYKHSQRLIFVCRVIHLNITRCAALYSNPMLYHCSEDSFDDQERLPADWYEKAFPKI 60

Query: 61  KKLSCSLKNVDLIDGRLVNVNDDSTVIDERIEQRMRTLKSLVRVFVGSPSAQRRVTEMAE 120
           KKLSCSLKNVDLIDGRLVNVNDDST++DERIEQRMR  KSLVRVF+GS S QRRVTEMA 
Sbjct: 61  KKLSCSLKNVDLIDGRLVNVNDDSTILDERIEQRMRIFKSLVRVFIGSSSVQRRVTEMAA 120

Query: 121 STSTNCRPHAYFRNPSEREPIVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGA 180
           ST+ N +P A FRN SEREP+VVDS TKVSNFLNVSAQQRKLVRHTICPQ TQHHIWTGA
Sbjct: 121 STTINWQPQACFRNSSEREPMVVDSFTKVSNFLNVSAQQRKLVRHTICPQATQHHIWTGA 180

Query: 181 LDHMLKELKMELDPLAHQSPNKGIKMGQQIVSSCLKFLDDATNSNAHFTSWMRPAPLQPV 240
           LDH+LKELKMELDPLAH SPNKGIKMGQQIVSSCLKFL+DATNSNAH TSWMRPAPLQ  
Sbjct: 181 LDHVLKELKMELDPLAHHSPNKGIKMGQQIVSSCLKFLNDATNSNAHITSWMRPAPLQRN 240

Query: 241 VDSSASPRWEDMLEMFTDLIDSMKDEKSLLHYVTKLEVMKEGLSQIKDVLTDKSIGFKEA 300
           VDSS SP+WEDMLEMFTDLI ++KDEK L  YVTKLEVMKEGL+QI+DVL DKSIGFKEA
Sbjct: 241 VDSSTSPKWEDMLEMFTDLIGTLKDEKGLHQYVTKLEVMKEGLTQIRDVLADKSIGFKEA 300

Query: 301 RHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGFLKAVENDKFLLFMGR 360
           +HQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGG LKAVE +K+L+FMGR
Sbjct: 301 KHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGLLKAVEKEKYLVFMGR 360

Query: 361 VLSCDEEKIVWNGVRQLDRAMGLFKFVWETAGMKGELELQGHLFCVGAENRQLSYKGNAY 420
           +LSCDEE+ VWNGVRQLDRAMGLFKFVWETAGMKG+L L+GHLFCVGAE+RQLSYKGN Y
Sbjct: 361 ILSCDEERTVWNGVRQLDRAMGLFKFVWETAGMKGDLVLRGHLFCVGAEDRQLSYKGNVY 420

Query: 421 LLHEVKL 428
           L+HE+ L
Sbjct: 421 LVHEISL 427

BLAST of Tan0011674 vs. ExPASy TrEMBL
Match: A0A6J1FK17 (uncharacterized protein LOC111446029 OS=Cucurbita moschata OX=3662 GN=LOC111446029 PE=4 SV=1)

HSP 1 Score: 746.1 bits (1925), Expect = 8.0e-212
Identity = 362/428 (84.58%), Postives = 391/428 (91.36%), Query Frame = 0

Query: 1   MIMAYKHSQRLQFICRVIHLNNTRCGALYSNPMQFHCKEDSFANQKLLPADWYEKVFPKV 60
           MIMAYKHSQRL F+CRVIHLN TR  A YSNPM +HC EDSF + + LPADWYEK FPK+
Sbjct: 1   MIMAYKHSQRLMFVCRVIHLNITRRSAFYSNPMLYHCTEDSFDDHERLPADWYEKAFPKI 60

Query: 61  KKLSCSLKNVDLIDGRLVNVNDDSTVIDERIEQRMRTLKSLVRVFVGSPSAQRRVTEMAE 120
           KKLSCSLKNVDLIDGRLVNVNDDST++DERIEQRMR  KSLVRVF+GSPS QRRVTEMA 
Sbjct: 61  KKLSCSLKNVDLIDGRLVNVNDDSTILDERIEQRMRIFKSLVRVFIGSPSVQRRVTEMAA 120

Query: 121 STSTNCRPHAYFRNPSEREPIVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGA 180
           ST+TN +P   FRN SEREP+VVDSLTKVSNFLNVSAQQRKLVRHTICPQ TQHHIWTGA
Sbjct: 121 STATNWQPQTCFRNSSEREPMVVDSLTKVSNFLNVSAQQRKLVRHTICPQATQHHIWTGA 180

Query: 181 LDHMLKELKMELDPLAHQSPNKGIKMGQQIVSSCLKFLDDATNSNAHFTSWMRPAPLQPV 240
           LDH+LKELKMELDPLAH SPNKGIKMGQQIVSSCL FL+DATNSNAH TSWMRPAPLQ  
Sbjct: 181 LDHVLKELKMELDPLAHHSPNKGIKMGQQIVSSCLNFLNDATNSNAHITSWMRPAPLQHN 240

Query: 241 VDSSASPRWEDMLEMFTDLIDSMKDEKSLLHYVTKLEVMKEGLSQIKDVLTDKSIGFKEA 300
           VDSS SP+WEDMLEMFTDLI ++KDEK L  YVTKLEVMKEGL+QI+DVLTDKSIGFKEA
Sbjct: 241 VDSSTSPKWEDMLEMFTDLISTLKDEKGLHQYVTKLEVMKEGLTQIRDVLTDKSIGFKEA 300

Query: 301 RHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGFLKAVE-NDKFLLFMG 360
           +HQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGG LKAVE  +K+L+FMG
Sbjct: 301 KHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGLLKAVEKEEKYLVFMG 360

Query: 361 RVLSCDEEKIVWNGVRQLDRAMGLFKFVWETAGMKGELELQGHLFCVGAENRQLSYKGNA 420
           R+LSCDEE++VWNGVRQLDRAMGLFKFVWETAGMKG+L LQGHLFCVGAE+RQLSYKGN 
Sbjct: 361 RILSCDEERVVWNGVRQLDRAMGLFKFVWETAGMKGDLVLQGHLFCVGAEDRQLSYKGNV 420

Query: 421 YLLHEVKL 428
           YLLH++ L
Sbjct: 421 YLLHQISL 428

BLAST of Tan0011674 vs. ExPASy TrEMBL
Match: A0A6J1CBU2 (uncharacterized protein LOC111010055 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111010055 PE=4 SV=1)

HSP 1 Score: 688.0 bits (1774), Expect = 2.6e-194
Identity = 345/431 (80.05%), Postives = 386/431 (89.56%), Query Frame = 0

Query: 1   MIMAYKHSQRLQFICRVIHLNNTRCGALYSNPMQFHCKEDSFANQKLLPADWYEKVFPKV 60
           MI+A+K SQRL FI R+ HLNNTR GAL+SNPM +H  E+S A+Q+LLP++WYE  + K+
Sbjct: 1   MILAHKLSQRL-FIPRLNHLNNTRYGALHSNPMLYHSAENSSADQELLPSEWYENAYRKI 60

Query: 61  KKLSCSLKNVDLIDGRLVNVNDDSTVIDERIEQRMRTLKSLVRVFVGSPSAQRRVTE--M 120
           +KLSCSLKNVDLIDGRLVNV DDST+ DERIEQRMR  KSLVRVFVGSPSA+RRVTE  M
Sbjct: 61  QKLSCSLKNVDLIDGRLVNVVDDSTIFDERIEQRMRAFKSLVRVFVGSPSARRRVTETMM 120

Query: 121 AESTSTNCRPHAYFRNPSEREPIVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWT 180
           AES++TNC+P   F N SEREP+VVDSLTK+SNFLNVSAQQRKLVRHTICPQVTQHHIWT
Sbjct: 121 AESSTTNCQPPWCFGNSSEREPMVVDSLTKISNFLNVSAQQRKLVRHTICPQVTQHHIWT 180

Query: 181 GALDHMLKELKMELDPLAHQSP-NKGIKMGQQIVSSCLKFLDDATNSNAHFTSWMRPAPL 240
           GALDHMLKELK+ELDPLAHQS  NKGIKMGQQIVSSCLKFLDDATNSNAHFTSWMRPAP 
Sbjct: 181 GALDHMLKELKLELDPLAHQSTNNKGIKMGQQIVSSCLKFLDDATNSNAHFTSWMRPAPS 240

Query: 241 QPVVDSSASPRWEDMLEMFTDLIDSMKDEKSLLHYVTKLEVMKEGLSQIKDVLTD-KSIG 300
           QPVVD SASPRWEDMLEMF DLI S+K EK LL +V KLEVMKEGLSQIKDVL+D KSIG
Sbjct: 241 QPVVDPSASPRWEDMLEMFDDLIGSLKGEKPLLRHVAKLEVMKEGLSQIKDVLSDHKSIG 300

Query: 301 FKEARHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGFLKAVENDKFLL 360
            KE++HQESLVQ+KLSKTLGHSSRCLFTLL++YL+GH RD+EVD CGG LK VEN+KF L
Sbjct: 301 HKESKHQESLVQRKLSKTLGHSSRCLFTLLMFYLWGHIRDIEVDFCGGVLKDVENEKFWL 360

Query: 361 FMGRVLSCDEEKIVWNGVRQLDRAMGLFKFVWETAGMKGELELQGHLFCVGAENRQLSYK 420
            MGR+LSCDEEK+VWNGV+QLDRAMG+FKFVWETAGMKG LELQGHL+ VGA+ RQLSYK
Sbjct: 361 VMGRILSCDEEKMVWNGVKQLDRAMGVFKFVWETAGMKGGLELQGHLWSVGAQQRQLSYK 420

Query: 421 GNAYLLHEVKL 428
           GNAY+LH++ L
Sbjct: 421 GNAYILHDITL 430

BLAST of Tan0011674 vs. ExPASy TrEMBL
Match: A0A0A0LED2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G878820 PE=4 SV=1)

HSP 1 Score: 671.8 bits (1732), Expect = 1.9e-189
Identity = 327/425 (76.94%), Postives = 366/425 (86.12%), Query Frame = 0

Query: 3   MAYKHSQRLQFICRVIHLNNTRCGALYSNPMQFHCKEDSFANQKLLPADWYEKVFPKVKK 62
           M Y   QRL FI R+ HL NTRCGA  SN M +H  EDS A Q++LP++WYEK F K+KK
Sbjct: 1   MIYNQLQRLFFISRLKHLTNTRCGASQSNSMLYHSAEDSSAVQEVLPSEWYEKAFGKIKK 60

Query: 63  LSCSLKNVDLIDGRLVNVNDDSTVIDERIEQRMRTLKSLVRVFVGSPSAQRRVTEMAEST 122
           LSC L+NVDL+DGR+VN +DDST+ DERIEQ MRT KSLVR+ +GSPSAQRR+TE+A S+
Sbjct: 61  LSCKLRNVDLMDGRVVNASDDSTISDERIEQEMRTFKSLVRILIGSPSAQRRITEIAGSS 120

Query: 123 STNCRPHAYFRNPSEREPIVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALD 182
           S NC+PHA+FRN SERE +VVDSLTKV N L V+ QQRKLVRHTICPQVTQHHIWTGALD
Sbjct: 121 SINCQPHAWFRNSSEREAMVVDSLTKVCNILGVTVQQRKLVRHTICPQVTQHHIWTGALD 180

Query: 183 HMLKELKMELDPLAHQSPNKGIKMGQQIVSSCLKFLDDATNSNAHFTSWMRPAPLQPVVD 242
            +LKEL +EL PL+H+S +KGIKM  QIVSSCLKFLD ATNSN HF+SW+RPAP + VV 
Sbjct: 181 QILKELNLELLPLSHRSTDKGIKMSLQIVSSCLKFLDTATNSNVHFSSWIRPAPSRTVVK 240

Query: 243 SSASPRWEDMLEMFTDLIDSMKDEKSLLHYVTKLEVMKEGLSQIKDVLTDKSIGFKEARH 302
           SS  PRWEDMLEMF DLI  +KDEKSL+HYVTKLEVMKEGLSQIKDV +D+SIGF+EA+ 
Sbjct: 241 SSPPPRWEDMLEMFNDLIGYLKDEKSLVHYVTKLEVMKEGLSQIKDVWSDRSIGFREAKL 300

Query: 303 QESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGFLKAVENDKFLLFMGRVL 362
           QESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRD+EVD CGG LK   NDKFLLFMGRVL
Sbjct: 301 QESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDFCGGLLKGDGNDKFLLFMGRVL 360

Query: 363 SCDEEKIVWNGVRQLDRAMGLFKFVWETAGMKGELELQGHLFCVGAENRQLSYKGNAYLL 422
           SCDEEKIVWNGVRQLDRAMG+FK VWETAGMKGEL L+GHLFCVG E RQLSYKGNAYLL
Sbjct: 361 SCDEEKIVWNGVRQLDRAMGIFKLVWETAGMKGELGLEGHLFCVGTEVRQLSYKGNAYLL 420

Query: 423 HEVKL 428
           HE+KL
Sbjct: 421 HEIKL 425

BLAST of Tan0011674 vs. ExPASy TrEMBL
Match: A0A1S3CR45 (uncharacterized protein LOC103503810 OS=Cucumis melo OX=3656 GN=LOC103503810 PE=4 SV=1)

HSP 1 Score: 662.5 bits (1708), Expect = 1.2e-186
Identity = 325/428 (75.93%), Postives = 367/428 (85.75%), Query Frame = 0

Query: 1   MIMAYKHSQRLQFICRVIHLNNTRCGALYSNPMQFHCKEDSFANQKLLPADWYEKVFPKV 60
           MIM Y H QR  FI R+ HL +TRCGA  SN M +H  E S  +Q++LP++WYEK F K+
Sbjct: 1   MIMLYNHLQRRFFISRLKHLTDTRCGASQSNSMLYHSPEQSSTDQEVLPSEWYEKAFGKI 60

Query: 61  KKLSCSLKNVDLIDGRLVNVNDDSTVIDERIEQRMRTLKSLVRVFVGSPSAQRRVTEMAE 120
           KKLSC L+NVDL+DGR+VN +DDST+IDERIEQ+MRT KSLVR+ +GSPSAQRR+TEMA 
Sbjct: 61  KKLSCKLRNVDLMDGRVVNASDDSTIIDERIEQKMRTFKSLVRILIGSPSAQRRITEMAG 120

Query: 121 STSTNCRPHAYFRNPSEREPIVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGA 180
           S+S N + HA+FRN SERE +VVDSLTK  NFL V+ QQRKL+RHTICPQ+TQHHIWTGA
Sbjct: 121 SSSINGQTHAWFRNSSEREAMVVDSLTKACNFLGVTVQQRKLLRHTICPQITQHHIWTGA 180

Query: 181 LDHMLKELKMELDPLAHQSPNKGIKMGQQIVSSCLKFLDDATNSNAHFTS-WMRPAPLQP 240
           LD +LKEL +EL PL+++S NKGI M  QIVSSCLKFLDDATNSN HFTS W+RPAP + 
Sbjct: 181 LDQILKELNLELLPLSNRSTNKGIIMALQIVSSCLKFLDDATNSNVHFTSTWIRPAPKRT 240

Query: 241 VVDSSASPRWEDMLEMFTDLIDSMKDEKSLLHYVTKLEVMKEGLSQIKDVLTDKSIGFKE 300
           +V+SS  PRWEDMLEMF DLI  +KDEKSL+HYVTKLEVMKEGLSQIKDV +D+SIGFKE
Sbjct: 241 IVNSSPPPRWEDMLEMFNDLIGYLKDEKSLVHYVTKLEVMKEGLSQIKDVWSDRSIGFKE 300

Query: 301 ARHQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGFLKAVENDKFLLFMG 360
           A+ QESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRD+EVD CGG LK   NDKFLLFMG
Sbjct: 301 AKLQESLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDIEVDFCGGLLKGDGNDKFLLFMG 360

Query: 361 RVLSCDEEKIVWNGVRQLDRAMGLFKFVWETAGMKGELELQGHLFCVGAENRQLSYKGNA 420
           RVLSCDEEKIVWNGVRQLDRAMG+FK VWETAGMKGEL LQGHLFCV  E RQLSYKGNA
Sbjct: 361 RVLSCDEEKIVWNGVRQLDRAMGIFKLVWETAGMKGELGLQGHLFCVETEVRQLSYKGNA 420

Query: 421 YLLHEVKL 428
           YLLHE+KL
Sbjct: 421 YLLHEIKL 428

BLAST of Tan0011674 vs. TAIR 10
Match: AT5G25500.1 (unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 374.4 bits (960), Expect = 1.2e-103
Identity = 204/424 (48.11%), Postives = 290/424 (68.40%), Query Frame = 0

Query: 18  IHLNNTRCGALYSNPMQF--------HCKEDSFANQKLLPADWYEKVFPKVKKLSCSLKN 77
           IHLN+    +  +NP +F        H   DS+  + +LP +WYE   P +KKL+ +L++
Sbjct: 11  IHLNSEL--SFKANPSRFFRSFQVLYHPSVDSY--EDVLPHEWYETKLPVLKKLNRALRD 70

Query: 78  VDLIDGRLVNVNDDSTVIDERIEQRMRTLKSLVRVFVGSPSAQRRVTEMAESTSTNCRPH 137
           VDL+DG+L ++N    V D+ I ++M+  KSL R+F+GSPS Q+++ E            
Sbjct: 71  VDLVDGKLEDIN-GVIVYDDGITKKMQAFKSLARIFIGSPSIQQKLREEGRF------KF 130

Query: 138 AYFRNPSEREPIVVDSLTKVSNFLNVSAQQRKLVRHTICPQVTQHHIWTGALDHMLKELK 197
            +F + SEREP+VV+SLTKV NFLNVSAQQRKLVR T+C QVTQ+ IW G L+ +L  LK
Sbjct: 131 PFFGSESEREPLVVNSLTKVCNFLNVSAQQRKLVRSTVCSQVTQYRIWRGTLEDILNGLK 190

Query: 198 MELDPLA-HQSPNKGIKMGQQIVSSCLKFLDDATNS--NAHFTSWMRPAPLQPVVDSSAS 257
            E+D L  H+  ++G  + QQ++ SCL+FL +++ S      TSWMRP P +    ++AS
Sbjct: 191 EEVDWLVEHREMSQGRVLAQQVILSCLRFLSESSVSFEVEKSTSWMRPVPAR-YAKANAS 250

Query: 258 PRWEDMLEMFTDLIDSMK--DEKSLLHYVTKLEVMKEGLSQIKDVLTDKSIGFKEARHQE 317
            +WED+L+M  DL   ++  +E ++L+++ KL  MKEGL QIKDV  D +IGF+E RHQE
Sbjct: 251 AKWEDVLDMVNDLRRYLEHDEEITVLYHLDKLVSMKEGLLQIKDVFLDNTIGFREVRHQE 310

Query: 318 SLVQKKLSKTLGHSSRCLFTLLLYYLFGHFRDVEVDLCGGFLKAVENDKFL-LFMGRVLS 377
            LV +KLSK LG  S CLF L++Y+L+G  RD+EVDLCGGF K  E  +FL L MGR+L+
Sbjct: 311 HLVYRKLSKLLGSPSPCLFALVMYFLYGRVRDIEVDLCGGFYK--EKSEFLCLSMGRILT 370

Query: 378 CDEEKIVWNGVRQLDRAMGLFKFVWETAGMKGELELQGHLFCVGAENRQLSYKGNAYLLH 428
             +EK++  G++QLDRA+GLF+FVWETAGMK  L LQGHL+C+GAE R ++Y+G  + +H
Sbjct: 371 STDEKMLERGMKQLDRALGLFEFVWETAGMKETLNLQGHLWCLGAEERSITYRGKTFFVH 420

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022982191.12.0e-21284.78uncharacterized protein LOC111481093 [Cucurbita maxima] >XP_022982192.1 uncharac... [more]
XP_022940414.11.7e-21184.58uncharacterized protein LOC111446029 [Cucurbita moschata] >XP_022940415.1 unchar... [more]
XP_023524216.11.8e-21084.11uncharacterized protein LOC111788189 [Cucurbita pepo subsp. pepo][more]
XP_038896888.12.6e-20482.71uncharacterized protein LOC120085101 isoform X1 [Benincasa hispida][more]
XP_022139027.15.3e-19480.05uncharacterized protein LOC111010055 isoform X1 [Momordica charantia] >XP_022139... [more]
Match NameE-valueIdentityDescription
A0A6J1IW039.5e-21384.78uncharacterized protein LOC111481093 OS=Cucurbita maxima OX=3661 GN=LOC111481093... [more]
A0A6J1FK178.0e-21284.58uncharacterized protein LOC111446029 OS=Cucurbita moschata OX=3662 GN=LOC1114460... [more]
A0A6J1CBU22.6e-19480.05uncharacterized protein LOC111010055 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A0A0LED21.9e-18976.94Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G878820 PE=4 SV=1[more]
A0A1S3CR451.2e-18675.93uncharacterized protein LOC103503810 OS=Cucumis melo OX=3656 GN=LOC103503810 PE=... [more]
Match NameE-valueIdentityDescription
AT5G25500.11.2e-10348.11unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR37763EXOSOME COMPLEX EXONUCLEASEcoord: 12..321

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0011674.2Tan0011674.2mRNA
Tan0011674.1Tan0011674.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006464 cellular protein modification process
cellular_component GO:0016020 membrane
molecular_function GO:0140096 catalytic activity, acting on a protein
molecular_function GO:0000166 nucleotide binding
molecular_function GO:0016740 transferase activity