Tan0009634 (gene) Snake gourd v1

Overview
NameTan0009634
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionubinuclein-1-like isoform X2
LocationLG11: 10537731 .. 10558422 (+)
RNA-Seq ExpressionTan0009634
SyntenyTan0009634
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACCTAACCAGCCGAATACCCGTCCGTCGGCTTCCTTCCCTCCTTCTATAGTCTCACAGCTTCTCTGAAAAACGAACCAGAAAAAATAAAAGACAGAGCGAGACCATTTCCAAGCCTCAGTTCGATGAGCAGTTAACAAACTCCACCTTCTAAATTCTCTTCCGTCTCTCAATTTCTCTCTCATTCGATTTGGGTTTCATCTACAAAGGGTTATGTCCCTCCATCTGTGCTTCCAGCCGTTCGATTCTTTAATTTCTGATTGAGCAACCCTAAATTCGTTGAACCGGTCATTGTAATTCCATGGAAGAGGAAAAATTTAGTGGCGGCGCCGGCGTTGGAGCTCGCACTGGGAATGGTGGTTCCGGTGATTCGTCGAGAGCTTCTTCTTCGTTTCTGAAATCGGGGGACCGGCAAATGTTCACTGTGGAGCTCCGACCGGGAGAGACTACCATTGTTTCGTGGAAGAAGCTTGTCAAAGATGCCAACAAGGTTAATGGAGTCAACACTGTGCCTGAACCTCCTGCCAACCCCAACCCTGCCGTCGAGTGTCGCATTGATCCGGTGAGGACTTTGTTTTCACTCTAAATGTATTCGATGTATAATGTGTTTTATTTCTTGCTAAAGTGTAATTGGAATTTGCCAGTGCTTGTTTTAGTTTTGGCATTGCTTCAATCATTTGCCACTTCAGTTTCAGAGTTCATTGCTTGTAATTTTATTAGGTAAAAAGACTAGGGTCAAGTTTAAGTGACTAAACAACCTGGGGAATGATCCTTCAAGCAAGATTCTTGAAGTGGTCCAGAGTACCTCTATAATACTTTGGATGGGCAGTGCGTTAGAACTTTAATGGGTCTATTTTTCATATGAAAGTGGTTAAAAGATGTAGAACTCGAGTTTGGTCTACATACATCCAAGTCGTGTGATGCAGTTACCAGAAAGTGTTAATGGTCCGAGACTGATGCGAAAAAAGAGATGCTCATATGATAGTCTTTTTCCTTGACTTCAGGGTTACCCTACTGATCATTGGTGTGACCATTTAATTATCTATTCACATTTTAAAATTTTATGTGCACTTTTTAAATTATTTTTACTAACGTGTACATGTAAATATAATTTAGCTATGTTCTAATACAATGTATGTTCTCGAGCATGCCGCCACAGGAATTGGATGTCCATTCACTTGAATTACAAATTCATATTTATGGATTTCAGTTATTGGGGCTATTGTTAGAAATGATTTAAACCCTGGGTTTAGCAAGGTACTGCAAGAAAAATTTTGCTGTCTGTTATCAGCCACAGTGTTGCAGCTTTGATATCCTCATGGCTCACCTGCTATAACTGATTATTAAGAAAATTGCACGATCAACTTTTTTTCTGAATTGCATCAATTGCACCATGAAGCTATAGGTGAACATGTTGTAGGTGCCAGACATGGTCAGCATCCAATTTTGAAGTACAGATACCTTAGCTTGTATAGCTTTACGTCATATACCCTTTGTAAACATCTGTTGCACACCCAAACTAGAGTGCTAGTTTATGTATATTTATATATGTATGTACATACTTTTTGATTGAAAACAAACGATGTAGGGAAATGATTCATCTTGAGTGCGGAGAGGGCTGATAAACTCAGTGAGAAGATGGAGTGATTGTTTTTGTAGTTCATTTCATTCTTTCATTTGAATTTATGGACATAACAAAAACTACTGGGCAAAAGAACTAAACTAATGAATTTTGATCGGGAAACACTATTCTTTCTGTTTGCAGCATTGCTTGAGCAGTTTGATATGAAATTTTACTTTCTCATTAGATATTAAATCACTGAGAGATATATTTAATTTGTTTGAGACTTCCTTTATAAACTTTTCTTACAAGACGTTGCATCGTTACAAATATGTAATTGGTTTATTAAAGTTCTTATAGCTATTGTATAACTCAACTGTTTTTCCATTCATTACATTTCAGTACAAATGTCATAACTGAAAGGCAAATTCTGGAGCATGAAAACCATTGATATGTGTTAAAATATATACTCGTAGAATGTAATATATTTGTTTAAATGTTTTCAAAGGTGTTGTGTATGTGCCATCCATTTCCATTCTGTTAAGGTATTTTTCATTAAAAAAGGGCTCATAGCCATGTACGTTCGATATGTATTTCAATTTTTCCTTCTTTATTTTTTCAAAAAAGAAAATGGCTTGTTGGCGTGTCTTTATCTTGATATGTTTATAACTTGCGTAATTCCATTCTGTACTTATTTATTTATATGTCATGCCAGTATGTTTTGTTTATGTCCATATTAGTTCTTTTTAGATGTCCAACCATTTATTTACTATTTAGGATTTAGGGGTCGATCTTTAAAGTAACTGAAGAGAAAATGATCTATAAATCATGAAGTGCTTGTCTTTTTATTTGCCATTAGGGACAACCAATTGAAGACGAAGTGAAAGATGCAACAGCACCAAATCGTTTTAATGCTGTTATTGAGAAGATTGAACGCCTATATATGGTACTTTTGTTTCCTTCTTAACACTGTCAATGTTTGTTTTTCAATTAATAGTTTTGCTATACATCTGTTGCATTCAATAATCAAAGAATCACTCAATACTTCAATCTCATGGCTACTAACTTGACTATTAAAATCTTATAGGGTAAGGATAGTAGCGACGAGGAAGATTTAATTCCAGATGATGATCAGTATGATACAGAGGACTCATTCATTGACGATACCGAGTTGGTTGGTTTCTTTCTGTTGCTCGTGTGGTCTTGTTTGTCCTTTCTGCAATGTAAATTACAATTTCTTTTTTCCCTGAGAGTGTGATGATAACCTTTTATGTGTGGTGATTATTGGTGACTACTTGTTCTGCAGGATGAATATTTTGAAGTTGATAATTCGGCAATAAAACACGATGGATTTTTTGTTAATAGGGGGAAGTTGGAGCGCATGTGAGTTTCTGACTTTGTAGATCCTTCTTTTCCTTTTCTTCAGTTTTATAATTTGTTAGAAACCACGTTAGTTAACTTGTTAAGTCAACTATTTTGTTGGATTCTTTTTTGCTCAAGTGGTTGATGTTGCATGCACCACTGTTTTCTTCTGTTTGTGGAATTAAATGATCTAGCTTCATAAAGTTACGTTCCTTACCATGCTTTTATTATGGAAGAGTGTTACTTATGGACTTGGTATGTGTGTGTTGATCCGGTCAATCTTATTGAACTTGTGGACTTGGTAGGTATGAGTGGTTGCCTGGGATGAAGTGTTGGCATGACACTACATTTTATTTCATGCATTAAAATACCATGTTTTAATTCTGATCTTACTTTGATTTTGTTAGGTTGGAGTCCCTTTATGTAGTTTCTGGGATGTTCTTTTGTGGGACTTGTTTTTTGTATGCCCTTGTATTGTCTTTTACTCTTCTCAATGAAAGCGCGTTTTCTTACCCAAAAAAAACCATGTTTTAAGGTTCTACATAAAGTTTCGTATCTCGAAAAAAAGTCCACCAACAATACATGATGTAGGTGGCATGACGATAGTGGTGTCTTCTGTAGATCAACAACTCCTGGATAGTATCGAGCTATCAGCTACTAGAGGAAATTCTCCTCTCTTCGTTTAGCCCTAGTGTCTCCGACTGTTCCTCTACCTCCTCGCCGTCGTTTTCCGGCCGATTCTCTCGCCGTCATTCCTCCACCCCTTCTCTGCTCTCTTTCTTTCCTTCTGGTATGTTTCTTTCGATCGCAATGTTTCGATAGTACTCGGTAAAGCGCGAAGACTCGCCGACTGAAGTCAGTCGCTGTCGTGCCGTCGATCCCTGCCGACCGACAACCTCCCTACCCGACCGCCATGTCTCAGCCTTTGCCGACATCCTCCTTTCTCTCGTTTCGCGAATTCGCCCGCGTCTGTTCAATCTCCCTGTCCGACTGCTCGTGTCTGTTCACTCTCCTGAAAGCCCCCCCTGGCGTTTTCGTCTGTCTCCTTCCCTTGCTTTGAACTTTCAAACGATTTCTTCTTCTGGCGGTAAGACTTCCCGTCTCTGCCTAACCTTCGTTTCGTTTCCAACGACTATTGGGTCTTGTCACTCCCTTCATTGGGGAAGTTCAGTTGCTGTATAGGTGAGAACTTTTTTCACATTTGGGGTGATTCTCTACGACCTATATCCAAGATGGGATTCAGAATCTTCCGGTTTCATTGACTCTCAATCACATGAAGTGGTTTCATGATTCATTTATCGATTTGGCTACAAAGGTCGATTCATTCTTTGCAAGGAAAAAATTCAAGGATGACGAGGCCACGCTCGGGTTATTCAAGGTCAAGAAAAGTTTCGGTTGGGCTGCAGAAGGTATGATTTGAATACAAATTTGGATGTCAAAGGTGGGAAGTGGCAAATTATCCATAACCTTCATTTGAAGATTGAAAAGTGGGAGGATAAACTTCATAGCTCTCCTTCAGTAATGATTGGATATGGAGGTTGGCTTGCTGTAAAAAACTTACCCATGAGGCTATGGAAGAGGAGGGTTTTTGAAGTGATTTGAAGTTTCTTTGGAGGGCTTGAAGAGGTCTCATTGGACACGTTGAACTTTGTGGATATATCAAAGGCAAAAATCAAGGTGTCCCCAAATCTCTGTAGATTCTGGCCGACTGTAATTCCTATCGAGGATCCGGTTGTGGGAAAATTCTTGATAGATTTTAAAATTTTGGGGACTATAAATCCACCTGCAAGGCCGTTGGATTCGTTAGTTTTAAGGGACATTATCAACCTAATAGATTTGGTTAGAATTCACCAGGTAATAGAAGATGAACTGGAAGTTGATTCGACTGGGATTCCCTTGGAGATGGATGTATCTGTTGGCATCCCCTTGATCTCCAAAGCGAAATGTGCTACTGGGAATGGAGGGGAAAATGTGTTCCTCGAAGCTGGAGATTACGAAGAGGGTGAGGATCATGAGTTTTCAGATATTGAATCGAGTATGGGTTTCCATCAGCAGGTTAATGGCATTAATGAAGATCTAAATGAGACTTTCTCTTATTTGAAGACGTTGGAAGCAGGGGAATCAATTGGCTGTGAGAAGGAATTTTGGAGGAGTGGGAGCTTGTCAAAAGGCCATGAAAATGTCAAGCCGGATGAGCCGAGGGTATTGGATACAGAAGACTTGCCTTCGCGGATTAAAGACTCTGTGATTGATGAGCCTCTATTCTCTTCTGTTACCCCAATCAGTGTTAAGGAAGTCAAAGATACTTTTTGTGGCAAGTACTATTCAAGAAAGGGTGGGAGAAACTCGACAATGAATTTGGAAGAACATTTGGTTGAAAACTTTCCAGATTTCCTCGAGCATTCCGAGGCCTCCATTCAACAATCAGACCATGTAGTATCAGAGTCCCAGGTTACTTATGCTCCCTTTTTCAAAGCTCCATTAATTGCTGTTGAAGAATGCCACCTTGGGCTGAAAGAGGTGCCATTTGTTAAAGCTTTTTTTCGGTCCCCAACGGTTCTTAACCGAGGAACGCCTTCCCCTTCCCATGTTCTTGGTCAATCTTCGATACCAGATTCGGTTGTGGCTTTAAGCAACCCTGATTTAGAGTTTGCAAAGAGCTTAAAACATAAGGATTTAAAAGATTTTTCCCCAACAATTAATCTGCAGGAATTTTTTGATTTAGAAGTCTCTCCGCCTTGTTTGGAAAAGGAAAGTAATAAAGGGAACCTTGATTGCCCCATTGAATTAGCTCGTTTTAATGACTTAATAAAAGCAAGTGGGCTGCAATTCAAAGAAATTGTAACTAGATAAGTAGTATTGTCCTTATGACCATAGGCTGATCAAGGACAGTGGGCTCCATTTTCGTGGGGATGGCCCTGTGGTTCTTCCTTTTGATAAATGGTGACTTTATTATGGAATTCCAGAGAGTTGGAGGATTACTCTTAAAGGGAAAGCCTGAAAATTATGTTATGAAGCTTTGTACTGGTGCAGTATGATTATAAGAATCAAAAGCTGGGTGTATTGATGCTTTTATTGGGTATTCAGACGTTCCCAATGGCTCGTTGGTTGTGGAGAACTGGAGTCAAACACTGCTATCATATTTTTGCTTGCGAGAATTCAAAGATTGTTTGAGGATAAGAAGGTGGATTGGGATAGGAAATTCAATGCAATTCTTTTCAAAGCTTCTACCTGATGTGTTCTTTCCAAGGATTTTGTTAATTATTCCTTTTATGACTTACGCTTTTTTTGAAGCCCTTCTTGTCTTGTATCAGTTGTTTTGTAGGCTATTTTTGGTGTACTGGTTCTGTACAAGGTTTAGATGCCTTTAATTTGGGTTTCTTTGGGCCATGTCTCATGTTGTTTGGATATTTCCTGTTCTTTATTGTTGTTGTATTTTGAGCATTAGTCTCTCTTCATTATTTCAATTAATCTCGTTTGTTGCCTTGTCAAAAAAAAAAAAATTCCACTTGCATTGTGCCCAACATCTCATGGAAAGTGTAGGAAGAGGAGAGGGGCTAGTTTTGAGGTTACAAACCAATCTTACCACTCTATTTAAAACTCCTTGGGCAAGATTTTCGGCGAAAAGGAGAAATGGGAAGCTGACCACTATATTGCTTCAATCTTTACATGGCAATGAGAAGATTGGAACTATAGTGTATCACTGTGTGTATTAGAAGCTACCTGCCTAACATTTCATATATCTATCTTAGAAACTAGTAATGGGTACACCTTGAATGGGTGATTGCTTCCTCTGCACCTTCCCATGATATGGAATTCATAGTGGAATTTGAAGAAGTTTCCCTCGTGAGATTATCGTCTGACCTCAAGGTTTAAATGGATACAAAGGTTTTTATCTATAATATTTTTGCTAACTCAAGTAGTAAGGTTAGGAGATTGTTCTGTGAGAATAGTCTAGGTGCGTGCAAGTCGACTTGAACACTCACGGATATGAAGAAAAATTTATCTACGCTAAGTCAAATACAAATCTATATTTTTTTGGAACTTGATTCTACACGATTAGATATTGGAATTTTAAAAAGTGGAAATTTTCTTGCAAATTGGCTACTTTCAAGCTCAATTGTAATCACACCAATCTGTGTTCCTTGTTGACAAAGCTCACTTGGGGTTAGTACCTTGAAAAAGTTCAGCTCTACTAAAAAATAAAGTGCCAAGAGGAATTCCCTTCAATTTTTTTTTGTCGAATCTACCATAATTGCAAAAAAATTGTTGCTGTTTGAAAGTTACTTAGGAGAAGTATGGCTAGATTACGACATTCTCAGGCTTTCGCCTTGAGGACAATTGACTCAATAGATAGATTTAACAATACCAAGGGTGGGAGTGTTGATGGACCACCCATTAGGTTTACTTTCTCCTAATAGAAGAAGTTTGCTGAGGGCAGGTGCCAGTAAGCCCAGTTGGTCGGTCAAGCCAATTGATCAACTGAAGTAGCTGCTACGGCTGTGCCTATGACAAAATATATATCACCTACCTATATGTTGATAAGCTTGTAGTTTGTTAGAAGGGGGAATTTGTGTATGTGGGACTAGGGGCACATGTCTTTTGGAGGATAGGTGACTGCTGACTGCTATATATTGAAATTAGTTTAGAGAGGTATTGGGAATTCTTTTCAGCTTTTTCTATATCAAATGGGGTTTCGGTTCCAATTCTTCAGCCAATTTACTCCAGCTTTTAGTTGGTTCACTTTTAAAAGTGATCTAAAGCTCACCTCATTTGGGTTAATGCTCTTATTTCTGAATTATGATTGGAACATAATAAGCGATCTTTTAAGATAAGTGCTTAGCTTGGAGTGAGCGTTTTGATTTGGCCAAGATCAATGTTTCTCTTTGGTGTTCTTTGTCTTCTTTGTATGCTAATTATTCTTTTAATGATATTCGTCTTCATTGGGATGCATTTATATCCCCTTGTTAAATTTTATTTTGTCTTTTTTCTCCCCCAATTTAGGGTTGTATCTTTGAATATATCAATGAAAAGTTTGTTTCTTGTTCTAAAAAGAGGTATTGAGAATTCTAGTCTGGAATCAGCTCAGCTTAGGGAGGGAGACCAGGCCTCTACGTGGTTATGTTATTTTGACCCCTTTTAACCAATATATATACCTCAGTTTTTTTTCAACTTTTTTTAAAGAGTAATATTACAAAATTTTAAATCTAGTTCTCTATTGATCTCATTGTGGGTGTGTAGTTTCAGCATCGTGCTCACAAGATTTATTTTATTTTATGCTTGGAAATTGGAATATTGAATTTTAGGATATCATTTTTTAGTTTGCTTTTTGGAATTTTCTATGCTTATTTTTACTTCTTTTCTTTGCAAAGAAGCGAACCATCTGGACAACCTAATCAACAGTTGAAGAAAAGGCGCAGAAAAGATTTAGAGAAAGGTCATCCTGATAACCATGATGCTCGCTCATCAAATAAGCACACAAAAGTGGGAAAGACAACTGCGGGAAAGAGTGCACTGATGGTTGCAAAGAGTTTTTCAAATCTGTCTCAAAACATGGCCGTCACCCATGGACATCATGAGGATGAAAAATTGCAGAATCAAGTGAATATGCCTGGACATAGTTCCAAAAAAAAATCTGGTGATACGAAAATGATATTGGACCCTTCTCCATCTTCAAAAATTTATAATGGCGATACATCTGTAGCAGAAGCGAAGGACATTGATCCACCAAAGCCTGGTGTTTATCCATCCAAGAACCTTGTTAGCAAATCGAAAGAGTCATGTGGACCATCTGATTGTTTACAACAGAAAGTTCTTGAAAAAAGTGCCCATGCACAATCCAAACCCCAACCTGGGAGACCGTTGAATAACACAGATGAGATGGATTCGTCAGTTCAATTGAAAGAGAAACAAGGCATTCGTGAATTGCCGGACATCAATTTGCCAGAGGGCAAGTATTCCATGCAAACAGCAGTAAGTTTTCTAAAGTAATTAAAATGATGATCATTCACCGCAAAGTACTGATTCTACCATTGTTTATGAGAAAGAATCATTTTCTTTACTAGTCTATAACAAACTTTCTGTTTGAATTGTTCATCAATTGAATTCCTTTTGACTCCCCCTGGATTTGGGATGGGTGTTTGTGATTCATGCTACTGGTCTTCTTAGTTAAATGTATAATTTTTCCTTTACTCTAATAGAAATGGAATTTGCTTCCCTCCCTCTATTAATAAGAATATGATTTGTGGTTTAATGGATGGGGAATTTCTTCAACTTTCCGAATTAAGGATTGGTCTGTTCTATACTCATTTCTATACAAATTTAAGCATGCCGTGTTTGTTTACTGGAAGGAAAAGGAAAAGCCTGAGAGGCGCTGAGTAGGTGACCTGTGATTGTATCAATTGATGAAACTCAAGATATATGTTATTATATGTTTGATTTCCTGGAAATGAGCTAGCTAGCAAGGTGCCTGTTGGTATGGGTTTTGCATCGTTTATTAATTGTTTTTGGCATTTTTTTAATTACATTAAACCAATGCAAAATTCTGAACTTTCACTCAGTTTTCATGACAGTCCTTAGACTCTATTGTTTTGATAGGACCTTTATCTTTGGTTAAGTGTCAAAATTATTTCTTATTGTTACTTCTCCAAAATAATTTTAAAATTTTATAACATCATTGAAGTTTTTGTTTTTCTTATAAAATATGCTGGACTTAAGAATTTGATTTTCAACTTGGTCGGAAAATTTGGAGTATCATAATTCTCACTTGATTACCTAGGGGCACACTTTTTTTGTTTCTTGAAAGACACATGGGAGATTAATGTTTGGCTTAGTGACATGTCATATGGTAACTAATAAAATTTCAGGCTTCTCTTAGGCCGTGATTTTAAAAGTGTGAAACACATCAAAGCACAATAATATTTTGCGAGCACAAAAGAAGTGAGGGCTTCATAGAGCAAAAGTGTACAATATATAAATATGTACACAATACAACCTTACTTGAATAGGGTCTGTTCTATTAGTTGTATATCTGTGTCCTCGAGTTCTAAACATTCACGTAATATTAAATCAGAATGAAGCATGCAATAGAGAAAAATAAAAACAAACAAAGATTAACGGTTAAAGATGAAGTTAATCCTCTTATCATTTGTTTGATAAGAGGATGCATTTGGTAGCCTCTGTTTTGCCCTCTTTTGATAACTCTTCCTCACTTTCTTTGGGTTTTTGTTGCCCCCTCACTGATCGAGAGGCAGTTGAGGCTGTCGGTCTCCTTTCTATTATTTTTGACCAGGTGACTCATCATGAGAGAAGAGATTTCAGGCTTTGGGTTTCGAAGCCTTCTGAGGGTTTCTCTTGCAGCTCTTTTTTCCATATCCTGTCTGCCACTTCTTCCCCGGGGAGCCTCTGTGTTCTACTTTCTTTGGAAGGTTAAAATTCCTAAGAAAGTAAAATTCTTTGCCAGGCAAGTTTTGCATGGGAGGGTGAACACTATGGATCGTCTTTAGAGGCTTTCGCCTTTTGTGTTGCAGCCACAATGGTGTGTTCTTTGCAGAAAGCAGGAGGAGGAGCTCGACCATCTCTTATGGGGTGCCCTTTTGTTTGCTCCCATTGGAGTCAGTTTTTCAGGACCTTCAGGGTTGTTCTGGCTCGTAATAGAGGGTGTTGCTCTATATTTGAGGAAGTTCTTCTAAACCCTCCTTTTTGCAATAAAGGAAGAGTTTGGTGGCAGACTTGCTTCTTTGCCATTTTGTGGGGCATTTGACTTGAGAGACCGGGTGAGGAGCTTTGGGAGGTGATTAGATTTAACTCGTCTGTGAGGGCATTCGTTAGTCGGGTTTTTTGTAATTATCACCTCGGCATCATTCTTTTGGATTGAGTTTCGTTTTTGTAATTAGCTTTAGAGTTTTTTTGTGGTGGTTTTTTTTGTTTGCTTCTGTATATTTTTTCATCTCTCTCTTAAAAGCTCGGTTTCTTATTAAAAGAACGGTTAAAGATGAAGTTCAGGTACAAAGAGTCTACAAAATTTTAAAATCTAACACTATGATCACATTGCAAAATGCAAAGACCTAAAACTTAAAACTGTCACTAAATATGTCCTCATCACTATTTTCCCCCATCATTAGACCTGTAGCTGTTTATCTTTCCCTCCCTAGTTTAGTCTTATTTCTATTTTCTTCCACGTCTATTGGAGTAGGAGGGCATACCTCGAGTGAGCTGATATAGCTTTTTGATTAAGTTTTCTGTTCTTTACTTAATTTAACCATAGTTTATTAAAAAAAACTACTTTAAGCAAGAAGCACACACTCAACAAAGCTTGCCACTCGACCTGACTTAAGTGCACAAATTTTTTATGCTTTTTAACACAATGCTCATAGATGACATGGGCTTTGAGTAGCAGCTAGCAACTCTCTTTCATGTGTCATGCCTTCTAAAAATATATTTTGACTAGTCCAATCTCTGCATTTGAGTATAGTTGCTTTGGAGATTCTTGATCTGATATCCTCTTATTTTCACTATTATGCGGCTATAGAATGAATCTACTGCTGTTTGTGCCCCTCTTTAGGTCAAGTGTGGTCTGTTAGGGTTGGGAAAAATGTACAAACTTTAAATGTTTTAAAAATTATCACTGGCTTCTCTTTTTGTATTTTTGTTCTTCTTTATTGGAGGATACGCCATCATCTTTCCCTTGTAATGTCTCTCTTATTAATATGATTAGGCTTTCTATTAAAAACTTGTACCTTCGTTTATTTATCATTTCTTGGTATTATTGCAAATTTTGGACATTTTATTTGTTTATCATCTCTACCCTAGCTTCAAACTTGTGGTGTCACCTTGCAGAAAACACCATATGTGCACAAAAAGGACGGTTCCAGTGTTAGACCAAAAAGCTCTCTGCTAGAAAAGGCTATAAGAGAGTTGGAGAAGATGGTTGCAGAATGTAAGCTTAATTTTCTTTTCTATGTCTAATTTTTAATTCACGTACGGGACTTCTTTGCGATTTATCTAATTATCATTTAATGTAAACCTGTTGCAGCTAGGCCACCACTTACGGAGAATCCAGAGGCAGACAATACATCTCAGGCTATCAAAAGGAGATTGCCAAGAGAAATCAAACTCAAGCTTGCTAAAGTTGCTAGATTAGCGGTATGATGCTGTTGTTAATTTGTTATTAGCTGTTTGATGTAATGTCATTGAGACTTGTGGCGTGAATGGGACTGCTGCTAGGGATGTAAATAGTTTTGCTTTTCAGGTTAACTTTGGTCTCCCTTTCAGACACAAAATGGAAACGATATGCATTGGCTTGTTAGAATTTTTAATATCAGTAGGAGAGTGCAATAGGGGAACATGGAATGTAGTTTGGTTCAATTTTTATAGCTTGCTAGCTTCTTCCAACAATTTTTGTTGCAACCTTCTTCTTTGTTTTACCAACCTGTTGGCCTTTTTTTCCTGACTCCCATTGTACATACTTTTTCTTATCAACAAACATGCATACACGTGGTTGAATGAAGTGCCTATGATCAATCATAGAAAAGTTTTTTCTATTTTATATTGAATACTAAAAGGTTCTAACTTTGTTAGTTATTTAACAGGCAAGCCATGGGAAATTGTCAAAGGGGTTGATTAATCGACTTATGAGTATTCTTGGTCACTTGATACAACTGCGAACTCTAAAGGTATTGCTTCCATTATATAATATATGGATATAGTGCTTAGAATTTTTTTTCCTTTTTTTTAGTTCCTCTTTTTTTTTTTTTGCTTGGGGACTCATCTTCATCCCTGAGGCTGTTTAGGTGGTTCTCCTTCTTGTTAATATATCTCTCTCATCATTTCTCATAAAAAATTTATTTCCCTTTTTAATTCTGGGATTTTAGTTTTTCTTTTACGTCAAATCCATTCATATGGTGAAGTAATGTTATCTCTGACATTTTTTTAGTTAAAAAAATATTTATCTTTGAGATTTTGTTATCCCGTTTTTATTATTTGTTTTGGGGTTGAGTTTGTTAAATCAGCAATCAATCCAAAAGCTTTTGGGTTGATTGGTAATTTAACATGGTATTAGAGTATGATTTGTGATTTAACATGGTATCAAAGCAGGAGGTCCTCTGAGTTTAAACTCCTGCAAAATCGTTTCCTCTCCAATTAATATTGATTTCCATTTGTTTGATTTTTCTTCAAAATTTTCAAGTCTACAAGTGAGAGGGAGTATTGGAGTGTTGATATAATTAAATTTACCATACTCCATTAGTTTAAACTTTTGGGTTGATTGGTGATTTAACACAATTTATTATTGAATTGGACACTAGGGTCCTGAAGTTTTTATAATATCACGTAGGTACAACTCATTCAATTTAACTTCAAGACCTTTGCTTGCCAGATTAATAATTTGTTAAGCTTAATATAGATTGTTTTTTTCCCTGTAAATTAATAGGTGAGTTGCATCTGTAAATTTGGACTCCATGTTATTGTATATGTAACTATGCTTGTATTTTCTGATGAGTGATGATGATGAAGAGGGTTATAGACTTTGGGCGGTGTATCAGTTTATTGGCTTCGTGATTGATATTTTTGTCTCACAATTCTAGATTCTAATTGCATTTGTAAACTCAGTGTAGAAAAAATCTGTTTTATCCATCTTTTCTTTTATTCTGTGGAAACTTGATCCTATCATATTTTCAACTTGACCCTATATTATTTTTGCTGCTAATTGATCTTTTTCATATTTCTATATGCAATGTCATATTGTAGAGAAATTTAAAAGTCATGATCAACATGGGTATCTCAGTGAAGCAGGAGAAGGATGATAGGTTTCAACAGATAAAGAAAGAAGTTGTTGAGATGATTAAAATCCGGCCTTTGTCCATGGAATCCAAGGTATTATATTCTGAAACCACCTTTACTTTTTCCTATGTCTGCCGTGCATAACACATTGCTTTACTCCAAAGATGCACGTTTGGCAGGCTGGGTGGTAAAGTTTAATCGGTTCACCCTACCTTCTCTCTTTCTTTTATTTTATTTACTTGCACATATATTTTGTCATAAGAAACAAACCTTTCATCAAAGGAAGAAGAAAGGTACAAAAAAGTAAAGACACTTTCACTCAAACAACAAAGGAGAGTATAAAAAAGCCGCCCAATTGATCAAAATATAGGAGTAGCCAGTAGGAGTTTGACTAATCTCTCTCTCTCTGCCAAAATCACTCTGTGAAAATGGTTAGTGTTACCTCACCTTCCACTAATCTAAAATCTAAATTTGCCTTTTTTAAAATATGCTTCCGTTTTTAAACTTAGTGTTTTTTCATTTGAGGTGTATATTGATTTTAACGAAGAGAACAATTACAAAAGCTCCAACTTGACATATGAAAGAAAAATAAAATCCCGAGTGGACACAGAAAATTCTTCTATTTCTCTCCATCTCAAGCTCCTAGAAAGTTACAAAATGAAATTAGTCCGAGATGACTTACAAGGTTACTTAACAAAATGAAGTTGCTCCTTTAAGGGTTGGAATATTAGCCAGCACTTTTGTTGGAGCAAATAACACAATCATGTACTCAGAGTGTCCCTCATGTATTAAAAAAGTAGTTTCGTGGGTGTTCTCTGATTTTAGACATATTTGATGGAAACTTTGTTTTAAATCAATTTTGGGAAAAACAGATTGGAATAATTTAACTCATCTAGTAGTTCCTCCACAAGAATTGGAAAATTATCTGGGATAGTTTTAAGGTTTAGAGCTCTATAAATGGCTCAAAACCTCTAGTTGCTTTCATTTTTATTGACTAAGAATACTGGACTAGTAAGGGCAGGTGCTTGGTGCGATCCCTCCTATTGCCAACATATCTGGTGTCCTGACCAAGCATTCAATCTCCCTATTTTGAATCATGGGCGAAAGATAATATTGAATATTTACAGTTGTTGTACTTGATTTTAAAGTTATAGCATGATCTCTACCCGACTTCTGGTTGGAGGAAGTCTTGTAAGGGGGAAAACATGGGCAAATCATTGTAAAACCCATTGACAATCTCCTCCGTGCTAGTCTTTTGTGCCACTGCAACTTTCTTTAGCTCTGACCACAATGAGTTTTATCAGGGTTCTGCCCTTCAATAAATGTTCATCACAGATGTCAAGGAAGCCTGCGATTTGGATAACCTTGGGCGACCTTTTAAAGTTACTTCCTCTCCAATTTTAAGCTTTATTGTCAGATTGCTCCAATCGACTCTCATTGAGCCCAATGTGGGCAACCATTGTCCTTTGAGAATACATTTCACAGGGATGGTGGTAAAAATCTTCTAGGAGTCGATTTATAGTTCCTTTAGCTTGTTTCTTATTAAAAAAATAGTTCCTATAGCGTCAGGAGCACGATTTGAAACATCTCCCCTGTAATATGAAGTCTTGGTGGTTGTCAAGTCCAATTCTTGCACGATCTCTTCAAAGATGAAGATTATGCATAGCCTCACCAATTACCTTTCATCACATAAAACAAACAAGCCTTTCTCTTGCTGAGCTTGCAGTTTACCTTTCATTAATATTTGATATACTTGATCTTGTCGGCTAGTTACAATTTTCTCTGTTACATTGGAGCCCCTACTTGCTACCTATGGGTTATATATCATACTTTGGATGTGGATTGCCCTCTGACCTAGGGTTTTGGAATGAACGTAGCTCCTTATTGGACCTCTAAGGCCAAATTCTTCCTCAATTCTATGAGCCGAAATTGTAACATCTTTTCTTCAGCTTTGATCTCTGTTTTTAGCCCGTTGACAAATTGTTTGTCCTATGACCATTCTGGCATTTCTTATAATGGTCGTCCTGCTAATGATCCAAATTGGTGCAGATATCTTTCATTGTTACATCCTGATGATGAGCTTAAAGCCTCCCACATCGTGTTCCATCCTACGAAGTTCTGAATCTTTTTAACAATTGTTCTCGCAACATGATTCGGCCCTATCCAAATTGGTGCTGATATCTTTCACTGTTACATCCTGATGACGAGCTTAAAGCCTCCCACGCCGTGTGCCATCGTACGAAGGTCTAAATCAATGGTTCTCGCAACATGATTCAACCCCTCAAAGTGTTTTTTCTCTTCCTTCCAATGTAACCATGAAAGAGCTTCCCCCTTACTGCAACCAGCTCAATCATCTCTTCTTCCAACAGTTTATTGACTTCAAAATACCACTCAGCTCCAAACAATCAGTTACTTGGCTCCTCCAATTGAAGATTGGTATCTCTAGGCGACTGGGCCCCCATCGACTACTTTCTTTCTTTCTCTGGAATCTTCTCTGACTACCTCTGGATCCCGTTGAAGTTGCCCTTGTTGAGGATTTAACCTTGTCCTCCTGCCTTAGAGCAGTATGTTCCTTCCATTGCTCTGACTGTTGCCACATTTCTTCCATCCTTGGTATCTTCCATTTCCTGAAGTTCTCACCGCTTTGTAGATTCGACTTGGTTATTTTATCAAGCAGTCGGATTACAATAAACTTGAACCAAAATATTCTTTGATATCACTTAGTGTTATATGCCTCCCGCATTGTATTTTGCGGCCTCGGTGTTACCTTCCTCTGATCCTTCTTTGATTTCTTTAGGATTCAGTCGTCCTCTGTCGGATAGAGAGGCAACCAATGTGTCAGGTTTTCTTTCTATTTTGTGAGACCATCTTGTCGTTCCAGGGAGAAGGGATGTTTGAATTTGGTCTCCCGATCCTTCTAGGCAGTTCTTTTATCATTCTTCGTTCATATCTTGTGTATCCTTTCTCCCTCTTTGATTGCCTCTGTTTTCCCGTTCTCTTTGAGTCAAGGTTTTTATGTGGTAGGTTTTATATAGGAGAGTTAATACCTTGGATCATGTCCAAAGACATTCTTCCTTTGTGTTGTTCTTGTATTGATGCATTCTCTACAGGAGGCATGAGGAGGACCTTGATCATTTGCTGTGGGATCGTTAGTTTGCTCAGTTTCTCTGGAGGATCTCGTTTGGTGTGTGTGTGGGTCTCGGGATAGAGATGGTAGATGTTGGAGGAGGTAGTCTTAAGTCTTGGATCCTCTTTTTAGGGAGAAGGACAAAGTTCTATGGCAAGCTAGCTTCTTTCTTATTTTGTGGGCTATTTGGATGGAAAAGAACAATAGAATGTTTAGAGAGGTGAATAGAACTTTTAGAGAGGTGGAGAGATGGGGTGAGGAGGGTTGGGAGGTGGCCAGGTTTAATGTCTTCATATGAGCTTTGGTGACTAGGTCTTTGTGTAATCATGAGCTTGGTCTTGTTCTTTTGGATTGGGGTCCTTTCTATTGTTAGGTTCTCCTTTTGTTCGGGCTTGTTTTTTATATGCCCTTGTATATTCTTTAAAATGGAAGCACGGTTTCTTGCCAAAAGAAAAAAAAACTTCGCTCTTTTGTTGCCTCAACCTTTTCTTTTCTATCCCCCTTATTAATCTTCTAGAATATGCATCTAAGGAGAGGCCCATGAGATTTCTAAGTTAGGTTCCTTTTTTTTTTTTTAAAGAAACTCGAACCATTTCATTGATATATGAAACATTTACAAAAGACCTATCTGAACTACTATATGAAGCATCTCCAATGAGAGTAGAGAGTTGATAGGCTATAATTACAAAAAGGAAGAGAATGGAAACTCCAATATAAAGCAAGAAAAGTTAGGTTCATTCTCCAAGTCTTAAGGGCTGTTTGGGAATTAAGTTTGAAAATTGTTGTTTGAGAAGTTTAGGCCCCGTTTGATAACCATTTGGTTTTTGGTTTTTAGTTTTTGAAATTTAAGCCTAAAAATACTACTTCTACCAATGAGTTTATAAGTTTTCTTATCTACTTTGTACCTATGTTTTCAAAAACCAAGCTAACTTTTGAAAACTAAAAAAAAGTAGTTTTCAAAAAATTGATTTTGTTTTTAGAATTTGGCTTAGAATTCAAATGCTTCCTTAAGGGGGATGATCTCTATTGTAGAGAAATTAGGTGAAATAAGCTTAATTTTCAAAAATAAAAAATCAAAACAAAATGGTTATCAAACGGGGCCTTAGTGAGACACATTTGATTAGCTTTTTAAAAAGTTCTTTTTCCCTAGTTTATTTTATTTTGCAGGGGCTTATTTTCTAGGACGATTGAAGTTATTTTTCCTTTAAAAATCACTTGTGCATACTAATGTAAAGTGATGTTAATTAATTTCAAGCTTAACAAACTCCAATTGATTACCTTTGTAAGAATTATTCTGCAGAACACGATTACCCCCTTTCCAAAATTATTCTGCAGGCAAGCTATTGAAGAATCCCTTAATTTTAGTCTAATATAGATCATAACCCTACCGGGCTAATTTAGATAAGTTTGGGGTTTGTTGAATATTTAATTTTAATTGTCAATTTGTCTGTGGTTAAGGCATGAGTAATCTTCCTTCCTTACAGATTGAAGGCAAATCCCTGCCCTCTCATTCATAATGTTAATTTTTTATTTTTTATTTTTTGAAGATGATTTTCATTTTACATTGTTAATAGTGTATCAAATATCATTCAGGCAATCGAACAACAAGCTGGATCACCTCATGATGTCCGTGAACTTGTTTCGGAAGAAAAAGGAGTTCTGAAAAAGAAATTTGTTATGGATCCTGTATTGGAGGACAAGATTTGTGATCTCTACGATCTGTTTGTTGATGTATTGCTTCCTGTTCCGAGTGACCTTCAAATGATTTCTTTTCTGATAAGCTTTAAACTTTCTGACGTCTCTGTTTCTTTATGATTCTAGGGACTGGATGAGGATGCTGGTCCACAAATCAGAAAGTTGTATGCCGAGGTAGTTGATCTCTCATTTATTCTTCAATTTCTCAATCAGCTATTTATATCAGATTGCTGCCTCCATCGTTTTGGGAAGGGTGTATGATGATTTTAGCAAGCGGTCTCAAAACATGTCACAGAAGCAGTCAGTTCGTTTATTTTTCATGAAAATTTAATTGCAGAAAACATGTCACAAAAACGCCCATTTTTGCAATTAAATTTTCTTAACTGCACACTACTGTAAACGCCCACTTTGTGTTAGATCTATTAGTTGGTCCTTTTTGTCAGTTGTGCAGTCTCTCTCAAGTTTTTCTGAAGTTCCTTCCCAATGTAGCCAATCATCTTATTTTAGTTATAGCTCCTCTTCTTATCTATCTTAGGCCCCATTTGATAACCCTTTGATTTTTGAAAATTAAGCCTAAAAATACTACTTTTACTAATGAGTTTCTATGTTTTCTTATTTACATTGTACCTATGTTTTAAAAAACCTAGTTAAGTTTTGAAAACTAAAAAAAGTAGCTTTCAAAAACTTGTTTTTTTTTTTGAAATTGACTAGAAATTTAAATAGTACCTCAAGAAAGATGGAAATCTGGAAATCATTGTAGAGAAGATGGAGAAAAATTGTGAAAAAAACAAGCCTAATTTTCAAAAAGCAAAAAGCAAAAATCAAATGGTTATCAAACGGGGCTTAGTTTTTAAACTTGTACTAACTTTAATCTGTGTAATTGGCCATCATCTTTCCAGCTTGCAGAATTGTGGCCAAATGGGTTCATGGATAATCATGGGATCAAACGTGCAATATGTAGGGCAAAAGAGAGGCGGAGAGCATTGCACGGAAGACATAAGGTAATGTACTCTCCCTTTCTCTATCATTCCCTATTTAAGCTATTAGAAAATTGCGAAGTTCTACTCTCTGATAATTGATACAGAAGTGAGACGCTTGCCTCTTATTAGGCATGTAAACATTAGTAAACAGATTCTTCTATTTCGTCGACACACCCATCCCCTTGAGTTGATATTCTTTCGATTTAAGATCCATGTTTCTGGATTAATATTGGCCGTCACTTCTTATATCAAAAAAAAATATTGGCTGTCACTTATAACGTTGTCTTTTTTGCACTCACTCCAAATTTGTAGGATCAAGAAAAAGTCAAGAGGAAAAAGATATTACCACCTAGAGTAGACGAAACTGTAAGAAATGAGGTCAGTTTGGTTGCTCAGCCACAGCATGCTCGAGAGCGATTAGCCTCTGAATCAGTTTTACAGCCAGCCCCAGCAACCAAGCCTGTATCTGTTTCATCAGCTACAATGGCCCAGCTACCAAGTCCTTCCATTAGTGTTGGAAATCTAGACAGGCTAAAATCCGAGAAGTTGAAGGGAAGCTTAAGCAATTCCCATGAAGATGCAAAAATGGTGGACGGTGCGTTAACCAAGAAAAAGACAAAGAGGAAGGCAGAGATGGAGTTAGATGAAACTCATAATCGCCTCGAGAAGGCGTCGATCCAACATGGGGATGAAAAACACAAGGCCCTCAACAAGCCAACTGCCAGTCTTCCTCCTAAGACAAACATTCAATCAGCTGCTCCTCCAAGTCTGGAACAGTCAAGCTAACATGATATTACTTCCCATGAGTCTAATCTACACCATAATAATGTTCTATCACACATCACCATTCTCACTCCATCCCACAAAATTTTCTTCCCTTCTCCCTCCTCCCTTCTCTTACTCTCGGGTTTCATTTATCGACCCGCGAAAAGCCAACGAAAAAGAAGCCATTTTTGTAATTACTGCATGTTCGTGCATGGGGCTTTCTGGATTAGAAATGTATCTTAAATTAGGGAGAGCTGAAGTAGCAATTTTATTTGGCAAGTAGTCAATTTGAAGGCCCTCTTTTTCCAACTCTTTAAGGCTCAAATCATTTGTTTTTCCTCTGGTTTTCCCTGTCTAATTTTCTATTAGTTTTCTGTTGATAGTTTCCATGCCATAAAATTCCAATTTTAAAGCTTAAAACTCAC

mRNA sequence

ACCTAACCAGCCGAATACCCGTCCGTCGGCTTCCTTCCCTCCTTCTATAGTCTCACAGCTTCTCTGAAAAACGAACCAGAAAAAATAAAAGACAGAGCGAGACCATTTCCAAGCCTCAGTTCGATGAGCAGTTAACAAACTCCACCTTCTAAATTCTCTTCCGTCTCTCAATTTCTCTCTCATTCGATTTGGGTTTCATCTACAAAGGGTTATGTCCCTCCATCTGTGCTTCCAGCCGTTCGATTCTTTAATTTCTGATTGAGCAACCCTAAATTCGTTGAACCGGTCATTGTAATTCCATGGAAGAGGAAAAATTTAGTGGCGGCGCCGGCGTTGGAGCTCGCACTGGGAATGGTGGTTCCGGTGATTCGTCGAGAGCTTCTTCTTCGTTTCTGAAATCGGGGGACCGGCAAATGTTCACTGTGGAGCTCCGACCGGGAGAGACTACCATTGTTTCGTGGAAGAAGCTTGTCAAAGATGCCAACAAGGTTAATGGAGTCAACACTGTGCCTGAACCTCCTGCCAACCCCAACCCTGCCGTCGAGTGTCGCATTGATCCGGGACAACCAATTGAAGACGAAGTGAAAGATGCAACAGCACCAAATCGTTTTAATGCTGTTATTGAGAAGATTGAACGCCTATATATGGGTAAGGATAGTAGCGACGAGGAAGATTTAATTCCAGATGATGATCAGTATGATACAGAGGACTCATTCATTGACGATACCGAGTTGGATGAATATTTTGAAGTTGATAATTCGGCAATAAAACACGATGGATTTTTTGTTAATAGGGGGAAGTTGGAGCGCATCGAACCATCTGGACAACCTAATCAACAGTTGAAGAAAAGGCGCAGAAAAGATTTAGAGAAAGGTCATCCTGATAACCATGATGCTCGCTCATCAAATAAGCACACAAAAGTGGGAAAGACAACTGCGGGAAAGAGTGCACTGATGGTTGCAAAGAGTTTTTCAAATCTGTCTCAAAACATGGCCGTCACCCATGGACATCATGAGGATGAAAAATTGCAGAATCAAGTGAATATGCCTGGACATAGTTCCAAAAAAAAATCTGGTGATACGAAAATGATATTGGACCCTTCTCCATCTTCAAAAATTTATAATGGCGATACATCTGTAGCAGAAGCGAAGGACATTGATCCACCAAAGCCTGGTGTTTATCCATCCAAGAACCTTGTTAGCAAATCGAAAGAGTCATGTGGACCATCTGATTGTTTACAACAGAAAGTTCTTGAAAAAAGTGCCCATGCACAATCCAAACCCCAACCTGGGAGACCGTTGAATAACACAGATGAGATGGATTCGTCAGTTCAATTGAAAGAGAAACAAGGCATTCGTGAATTGCCGGACATCAATTTGCCAGAGGGCAAGTATTCCATGCAAACAGCAAAAACACCATATGTGCACAAAAAGGACGGTTCCAGTGTTAGACCAAAAAGCTCTCTGCTAGAAAAGGCTATAAGAGAGTTGGAGAAGATGGTTGCAGAATCTAGGCCACCACTTACGGAGAATCCAGAGGCAGACAATACATCTCAGGCTATCAAAAGGAGATTGCCAAGAGAAATCAAACTCAAGCTTGCTAAAGTTGCTAGATTAGCGGCAAGCCATGGGAAATTGTCAAAGGGGTTGATTAATCGACTTATGAGTATTCTTGGTCACTTGATACAACTGCGAACTCTAAAGAGAAATTTAAAAGTCATGATCAACATGGGTATCTCAGTGAAGCAGGAGAAGGATGATAGGTTTCAACAGATAAAGAAAGAAGTTGTTGAGATGATTAAAATCCGGCCTTTGTCCATGGAATCCAAGGCAATCGAACAACAAGCTGGATCACCTCATGATGTCCGTGAACTTGTTTCGGAAGAAAAAGGAGTTCTGAAAAAGAAATTTGTTATGGATCCTGTATTGGAGGACAAGATTTGTGATCTCTACGATCTGTTTGTTGATGGACTGGATGAGGATGCTGGTCCACAAATCAGAAAGTTGTATGCCGAGGTAGTTGATCTCTCATTTATTCTTCAATTTCTCAATCAGCTATTTATATCAGATTGCTGCCTCCATCGTTTTGGGAAGGGTGTATGATGATTTTAGCAAGCGGTCTCAAAACATGTCACAGAAGCAGTCAGTTCGTTTATTTTTCATGAAAATTTAATTGCAGAAAACATGTCACAAAAACGCCCATTTTTGCAATTAAATTTTCTTAACTGCACACTACTGTAAACGCCCACTTTGTGTTAGATCTATTAGTTGGTCCTTTTTGTCAGTTGTGCAGTCTCTCTCAAGTTTTTCTGAAGTTCCTTCCCAATGTAGCCAATCATCTTATTTTAGTTATAGCTCCTCTTCTTATCTATCTTAGGCCCCATTTGATAACCCTTTGATTTTTGAAAATTAAGCCTAAAAATACTACTTTTACTAATGAGTTTCTATGTTTTCTTATTTACATTGTACCTATGTTTTAAAAAACCTAGTTAAGTTTTGAAAACTAAAAAAAGTAGCTTTCAAAAACTTGTTTTTTTTTTTGAAATTGACTAGAAATTTAAATAGTACCTCAAGAAAGATGGAAATCTGGAAATCATTGTAGAGAAGATGGAGAAAAATTGTGAAAAAAACAAGCCTAATTTTCAAAAAGCAAAAAGCAAAAATCAAATGGTTATCAAACGGGGCTTAGTTTTTAAACTTGTACTAACTTTAATCTGTGTAATTGGCCATCATCTTTCCAGCTTGCAGAATTGTGGCCAAATGGGTTCATGGATAATCATGGGATCAAACGTGCAATATGTAGGGCAAAAGAGAGGCGGAGAGCATTGCACGGAAGACATAAGGATCAAGAAAAAGTCAAGAGGAAAAAGATATTACCACCTAGAGTAGACGAAACTGTAAGAAATGAGGTCAGTTTGGTTGCTCAGCCACAGCATGCTCGAGAGCGATTAGCCTCTGAATCAGTTTTACAGCCAGCCCCAGCAACCAAGCCTGTATCTGTTTCATCAGCTACAATGGCCCAGCTACCAAGTCCTTCCATTAGTGTTGGAAATCTAGACAGGCTAAAATCCGAGAAGTTGAAGGGAAGCTTAAGCAATTCCCATGAAGATGCAAAAATGGTGGACGGTGCGTTAACCAAGAAAAAGACAAAGAGGAAGGCAGAGATGGAGTTAGATGAAACTCATAATCGCCTCGAGAAGGCGTCGATCCAACATGGGGATGAAAAACACAAGGCCCTCAACAAGCCAACTGCCAGTCTTCCTCCTAAGACAAACATTCAATCAGCTGCTCCTCCAAGTCTGGAACAGTCAAGCTAACATGATATTACTTCCCATGAGTCTAATCTACACCATAATAATGTTCTATCACACATCACCATTCTCACTCCATCCCACAAAATTTTCTTCCCTTCTCCCTCCTCCCTTCTCTTACTCTCGGGTTTCATTTATCGACCCGCGAAAAGCCAACGAAAAAGAAGCCATTTTTGTAATTACTGCATGTTCGTGCATGGGGCTTTCTGGATTAGAAATGTATCTTAAATTAGGGAGAGCTGAAGTAGCAATTTTATTTGGCAAGTAGTCAATTTGAAGGCCCTCTTTTTCCAACTCTTTAAGGCTCAAATCATTTGTTTTTCCTCTGGTTTTCCCTGTCTAATTTTCTATTAGTTTTCTGTTGATAGTTTCCATGCCATAAAATTCCAATTTTAAAGCTTAAAACTCAC

Coding sequence (CDS)

ATGGAAGAGGAAAAATTTAGTGGCGGCGCCGGCGTTGGAGCTCGCACTGGGAATGGTGGTTCCGGTGATTCGTCGAGAGCTTCTTCTTCGTTTCTGAAATCGGGGGACCGGCAAATGTTCACTGTGGAGCTCCGACCGGGAGAGACTACCATTGTTTCGTGGAAGAAGCTTGTCAAAGATGCCAACAAGGTTAATGGAGTCAACACTGTGCCTGAACCTCCTGCCAACCCCAACCCTGCCGTCGAGTGTCGCATTGATCCGGGACAACCAATTGAAGACGAAGTGAAAGATGCAACAGCACCAAATCGTTTTAATGCTGTTATTGAGAAGATTGAACGCCTATATATGGGTAAGGATAGTAGCGACGAGGAAGATTTAATTCCAGATGATGATCAGTATGATACAGAGGACTCATTCATTGACGATACCGAGTTGGATGAATATTTTGAAGTTGATAATTCGGCAATAAAACACGATGGATTTTTTGTTAATAGGGGGAAGTTGGAGCGCATCGAACCATCTGGACAACCTAATCAACAGTTGAAGAAAAGGCGCAGAAAAGATTTAGAGAAAGGTCATCCTGATAACCATGATGCTCGCTCATCAAATAAGCACACAAAAGTGGGAAAGACAACTGCGGGAAAGAGTGCACTGATGGTTGCAAAGAGTTTTTCAAATCTGTCTCAAAACATGGCCGTCACCCATGGACATCATGAGGATGAAAAATTGCAGAATCAAGTGAATATGCCTGGACATAGTTCCAAAAAAAAATCTGGTGATACGAAAATGATATTGGACCCTTCTCCATCTTCAAAAATTTATAATGGCGATACATCTGTAGCAGAAGCGAAGGACATTGATCCACCAAAGCCTGGTGTTTATCCATCCAAGAACCTTGTTAGCAAATCGAAAGAGTCATGTGGACCATCTGATTGTTTACAACAGAAAGTTCTTGAAAAAAGTGCCCATGCACAATCCAAACCCCAACCTGGGAGACCGTTGAATAACACAGATGAGATGGATTCGTCAGTTCAATTGAAAGAGAAACAAGGCATTCGTGAATTGCCGGACATCAATTTGCCAGAGGGCAAGTATTCCATGCAAACAGCAAAAACACCATATGTGCACAAAAAGGACGGTTCCAGTGTTAGACCAAAAAGCTCTCTGCTAGAAAAGGCTATAAGAGAGTTGGAGAAGATGGTTGCAGAATCTAGGCCACCACTTACGGAGAATCCAGAGGCAGACAATACATCTCAGGCTATCAAAAGGAGATTGCCAAGAGAAATCAAACTCAAGCTTGCTAAAGTTGCTAGATTAGCGGCAAGCCATGGGAAATTGTCAAAGGGGTTGATTAATCGACTTATGAGTATTCTTGGTCACTTGATACAACTGCGAACTCTAAAGAGAAATTTAAAAGTCATGATCAACATGGGTATCTCAGTGAAGCAGGAGAAGGATGATAGGTTTCAACAGATAAAGAAAGAAGTTGTTGAGATGATTAAAATCCGGCCTTTGTCCATGGAATCCAAGGCAATCGAACAACAAGCTGGATCACCTCATGATGTCCGTGAACTTGTTTCGGAAGAAAAAGGAGTTCTGAAAAAGAAATTTGTTATGGATCCTGTATTGGAGGACAAGATTTGTGATCTCTACGATCTGTTTGTTGATGGACTGGATGAGGATGCTGGTCCACAAATCAGAAAGTTGTATGCCGAGGTAGTTGATCTCTCATTTATTCTTCAATTTCTCAATCAGCTATTTATATCAGATTGCTGCCTCCATCGTTTTGGGAAGGGTGTATGA

Protein sequence

MEEEKFSGGAGVGARTGNGGSGDSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKDANKVNGVNTVPEPPANPNPAVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDSSDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPSGQPNQQLKKRRRKDLEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHEDEKLQNQVNMPGHSSKKKSGDTKMILDPSPSSKIYNGDTSVAEAKDIDPPKPGVYPSKNLVSKSKESCGPSDCLQQKVLEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPDINLPEGKYSMQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNTSQAIKRRLPREIKLKLAKVARLAASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINMGISVKQEKDDRFQQIKKEVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKFVMDPVLEDKICDLYDLFVDGLDEDAGPQIRKLYAEVVDLSFILQFLNQLFISDCCLHRFGKGV
Homology
BLAST of Tan0009634 vs. ExPASy Swiss-Prot
Match: Q8RX78 (Ubinuclein-1 OS=Arabidopsis thaliana OX=3702 GN=UBN1 PE=1 SV=1)

HSP 1 Score: 445.7 bits (1145), Expect = 8.5e-124
Identity = 292/570 (51.23%), Postives = 362/570 (63.51%), Query Frame = 0

Query: 12  VGARTGNGGSGDSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKDANKVNGVN-TV 71
           V   +G    G+  RAS   L +GDR++  VELRPG+TT VSWKKL++DA KVNG++ +V
Sbjct: 4   VNEMSGGSIGGELLRASPKVLTAGDRKLLKVELRPGDTTYVSWKKLMRDAGKVNGLSASV 63

Query: 72  PEPPANPNPAVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDSSDEEDL--IP 131
           P+PP N NP +E RI PG P+E E  +    NRFNAVIEKIERLY G DSSD E+L   P
Sbjct: 64  PDPPPNANPNLEFRIAPGHPVEIETNEQPHSNRFNAVIEKIERLYKGNDSSDGEELDGAP 123

Query: 132 DDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPSGQPNQQLKKRRRKD 191
           DDD+YDTEDSFIDD ELDEYFEVDNS +KHDGF+VNRGKLER+EPS   NQQ KKRRRKD
Sbjct: 124 DDDEYDTEDSFIDDAELDEYFEVDNSTVKHDGFYVNRGKLERMEPSTTSNQQPKKRRRKD 183

Query: 192 LEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHEDEKLQNQVN 251
             K   D  D   S+KHTK+  T   K                             +Q  
Sbjct: 184 SAKPCRDAVDV--SDKHTKLSITARKK-----------------------------DQST 243

Query: 252 MPGHSSKKKSGDTKMILDPSPSSKIYNGDTSVAEAKDIDPPKPGVYPSKNLVSKSKESCG 311
            PG    ++S        P PS    + +TSV    D+       + S+N  S      G
Sbjct: 244 APGSWKTQES--------PLPSG-AQDANTSV-PLDDVKHSDRANHQSRNDTSHKSRETG 303

Query: 312 PSDCLQQKVLEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPDINLPEGKYSMQ 371
            S  L QK   KS H QS    G+   N     + V+ KE  G+ +L   N+   + S Q
Sbjct: 304 SSSALHQKYSNKSLHQQSTSLLGKSPPNVFAEVTVVRQKENNGMHQL--ANVTGSRQSSQ 363

Query: 372 TAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPP-LTENPEADNTSQAIKRRLPR 431
            +      KKDGS+V+ K+S+LEKAIRELEK+V ESRPP +TEN EAD +SQA+KRRLPR
Sbjct: 364 AS------KKDGSNVKSKTSILEKAIRELEKVVVESRPPAITENQEADTSSQAVKRRLPR 423

Query: 432 EIKLKLAKVARLA-ASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINMGISVKQEKD 491
           ++KLKLAKVAR+A AS GK S  LINRLMSI+GHLIQLR+LKRNLK+MI+MG S  +EKD
Sbjct: 424 DVKLKLAKVARIAQASQGKHSTELINRLMSIVGHLIQLRSLKRNLKIMIDMGDSATREKD 483

Query: 492 DRFQQIKKEVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKFVMDPVLEDK 551
            RF+QI  EV++MIK +   MES+AI+ +  +  D ++  S EK  L KKFVMD  LEDK
Sbjct: 484 TRFKQINNEVLDMIKAKVSLMESQAIKPEGATSDDFQD--SVEKPSL-KKFVMDAALEDK 521

Query: 552 ICDLYDLFVDGLDEDAGPQIRKLYAEVVDL 577
           +CDLYD+F+DGLDED GPQ +KLY  + +L
Sbjct: 544 LCDLYDIFIDGLDEDQGPQTKKLYVNLAEL 521

BLAST of Tan0009634 vs. ExPASy Swiss-Prot
Match: F4I700 (Ubinuclein-2 OS=Arabidopsis thaliana OX=3702 GN=UBN2 PE=1 SV=1)

HSP 1 Score: 411.0 bits (1055), Expect = 2.3e-113
Identity = 271/562 (48.22%), Postives = 350/562 (62.28%), Query Frame = 0

Query: 23  DSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKDANKVNG--VNTVPEPPANPNPA 82
           +S + SS  L +GDR++  VEL   ETT+VSWKKL+ +A+K NG    + PE   N NP 
Sbjct: 17  ESCKISSEILTAGDRKLLKVELLKEETTLVSWKKLMDEASKENGGLFVSAPERLLNANPN 76

Query: 83  VECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDSSDEEDL--IPDDDQYDTEDS 142
           +E R+ PG   E+E+ +   PNR N+VI KIERLYMGKD SD E+L   PDDD YDTEDS
Sbjct: 77  LEFRLAPGAQTENEMVNQPHPNRLNSVIAKIERLYMGKDGSDGEELDGAPDDDDYDTEDS 136

Query: 143 FIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPSGQPNQQL-KKRRRKDLEKGHPDNH 202
           FIDD ELDEYFEVDNS IKHDGFFVNRGKLERIEPS   NQQ  KKRRRK+  K   D  
Sbjct: 137 FIDDAELDEYFEVDNSPIKHDGFFVNRGKLERIEPSATSNQQQPKKRRRKESAKPCGDVV 196

Query: 203 DARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHEDEKLQNQVNMPGHSSKKK 262
           D   S K  K+ KT  GK                             +Q   PG SSKK 
Sbjct: 197 DV--SRKRAKMAKTAGGK-----------------------------DQSASPGPSSKKI 256

Query: 263 SGDTKMILDPSPSSKIYNGDTSVAEAKDIDPPKPGVYPSKNLVSKSKESCGPSDCLQQKV 322
           S D+K + D     K  NG+ S+   +++       +   N  S   ++ G S  L  K 
Sbjct: 257 SNDSKTVQDSFSPLKAQNGNDSLV-LENVKHTDKANHQPMNATSPKSKAAGSSGPLHPKC 316

Query: 323 LEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPDINL-PEGKYSMQTAKTPYVH 382
             KS H QS   PG+   N     + V+ +   G   +PD+++  E K S+Q      + 
Sbjct: 317 SSKSVHEQSNSPPGKSRPNVSAKSAVVRQQVNNG---MPDLDIATESKTSIQ------IS 376

Query: 383 KKDGSSVRPKSSLLEKAIRELEKMVAESRPP-LTENPEADNTSQAIKRRLPREIKLKLAK 442
           KK GS+ RPK S LEKAIR LEK+VAESRPP  TEN +AD +SQA+KR LP ++KL LAK
Sbjct: 377 KKSGSNGRPKYSTLEKAIRNLEKLVAESRPPAATENQDADISSQAVKRGLPGDVKLHLAK 436

Query: 443 VARLA-ASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINMGISVKQEKDDRFQQIKK 502
           VAR+A AS G++S  LINRLM I+GHLIQ+R+LKRNLK+MI+  ++  +EKD RFQ+IK 
Sbjct: 437 VARIAYASQGEISGELINRLMGIVGHLIQIRSLKRNLKIMIDSIVTANREKDTRFQRIKS 496

Query: 503 EVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKFVMDPVLEDKICDLYDLF 562
           E+ EM+K +   +ES+   Q+AG+  D +++ S  K  + KKFVMD  LE+K+CDLYD+F
Sbjct: 497 EITEMLKTQVPLVESQETNQEAGTSDDFQDVGSLGKSPV-KKFVMDVALEEKLCDLYDVF 536

Query: 563 VDGLDEDAGPQIRKLYAEVVDL 577
           V+G+DE +G QIRKLY+++  L
Sbjct: 557 VEGMDEHSGSQIRKLYSDLAQL 536

BLAST of Tan0009634 vs. NCBI nr
Match: XP_022990762.1 (ubinuclein-1-like isoform X2 [Cucurbita maxima])

HSP 1 Score: 1008.1 bits (2605), Expect = 3.3e-290
Identity = 530/578 (91.70%), Postives = 550/578 (95.16%), Query Frame = 0

Query: 1   MEEEKFSGGAGVGARTGNGGSGDSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60
           MEEEKFSGGAG GARTGNGGSGD SRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD
Sbjct: 1   MEEEKFSGGAGGGARTGNGGSGDLSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60

Query: 61  ANKVNGVNTVPEPPANPNPAVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120
           +N++NGVNT+P PPANP PAVECRIDPGQ IEDEVKDATAPNRFNAVIEKIERLYMGKDS
Sbjct: 61  SNRINGVNTLPVPPANPIPAVECRIDPGQSIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120

Query: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPSGQPNQQ 180
           SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEP+GQPNQQ
Sbjct: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPAGQPNQQ 180

Query: 181 LKKRRRKDLEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHED 240
           LKKRRRKD+EKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNM VTH HH+D
Sbjct: 181 LKKRRRKDIEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMVVTHEHHDD 240

Query: 241 EKLQNQVNMPGHSSKKKSGDTKMILDPSPSSKIYNGD--TSVAEAKDIDPPKPGVYPSKN 300
           EKLQNQ+NMPGHS KKKSGD KMILDPSPSSKIYNGD  TSVAEAKDIDPP+PGV PSKN
Sbjct: 241 EKLQNQLNMPGHSGKKKSGDPKMILDPSPSSKIYNGDTSTSVAEAKDIDPPRPGVLPSKN 300

Query: 301 LVSKSKESCGPSDCLQQKVLEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPDI 360
           +VSK KESCGPSD LQQ VLEKSA A SKPQPG+PLN+TDE+D SVQLKEKQ IRELPDI
Sbjct: 301 VVSKLKESCGPSDSLQQNVLEKSALAPSKPQPGKPLNSTDEIDLSVQLKEKQSIRELPDI 360

Query: 361 NLPEGKYSMQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNTS 420
           NLPEGKYS QTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADN S
Sbjct: 361 NLPEGKYSTQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNAS 420

Query: 421 QAIKRRLPREIKLKLAKVARLAASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINMG 480
           QAIKRRLPREIKLKLAKVARLAA+HGKLSKGLINRLMSILGHLIQLRTLKRNLK+MINMG
Sbjct: 421 QAIKRRLPREIKLKLAKVARLAANHGKLSKGLINRLMSILGHLIQLRTLKRNLKIMINMG 480

Query: 481 ISVKQEKDDRFQQIKKEVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKFV 540
           ISVKQEKD RFQQIKKEVVEMIKIRPLSMESKAIEQQ G+ HD+RELVSEEKGV +KKF 
Sbjct: 481 ISVKQEKDGRFQQIKKEVVEMIKIRPLSMESKAIEQQTGALHDIRELVSEEKGVPRKKFA 540

Query: 541 MDPVLEDKICDLYDLFVDGLDEDAGPQIRKLYAEVVDL 577
           MDP LEDKICDLYDLFVDGLDEDAGPQIRKLYAE+ +L
Sbjct: 541 MDPALEDKICDLYDLFVDGLDEDAGPQIRKLYAELAEL 578

BLAST of Tan0009634 vs. NCBI nr
Match: XP_022990761.1 (ubinuclein-1-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 1003.4 bits (2593), Expect = 8.1e-289
Identity = 530/579 (91.54%), Postives = 550/579 (94.99%), Query Frame = 0

Query: 1   MEEEKFSGGAGVGARTGNGGSGDSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60
           MEEEKFSGGAG GARTGNGGSGD SRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD
Sbjct: 1   MEEEKFSGGAGGGARTGNGGSGDLSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60

Query: 61  ANKVNGVNTVPEPPANPNPAVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120
           +N++NGVNT+P PPANP PAVECRIDPGQ IEDEVKDATAPNRFNAVIEKIERLYMGKDS
Sbjct: 61  SNRINGVNTLPVPPANPIPAVECRIDPGQSIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120

Query: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERI-EPSGQPNQ 180
           SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERI EP+GQPNQ
Sbjct: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERISEPAGQPNQ 180

Query: 181 QLKKRRRKDLEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHE 240
           QLKKRRRKD+EKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNM VTH HH+
Sbjct: 181 QLKKRRRKDIEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMVVTHEHHD 240

Query: 241 DEKLQNQVNMPGHSSKKKSGDTKMILDPSPSSKIYNGD--TSVAEAKDIDPPKPGVYPSK 300
           DEKLQNQ+NMPGHS KKKSGD KMILDPSPSSKIYNGD  TSVAEAKDIDPP+PGV PSK
Sbjct: 241 DEKLQNQLNMPGHSGKKKSGDPKMILDPSPSSKIYNGDTSTSVAEAKDIDPPRPGVLPSK 300

Query: 301 NLVSKSKESCGPSDCLQQKVLEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPD 360
           N+VSK KESCGPSD LQQ VLEKSA A SKPQPG+PLN+TDE+D SVQLKEKQ IRELPD
Sbjct: 301 NVVSKLKESCGPSDSLQQNVLEKSALAPSKPQPGKPLNSTDEIDLSVQLKEKQSIRELPD 360

Query: 361 INLPEGKYSMQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNT 420
           INLPEGKYS QTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADN 
Sbjct: 361 INLPEGKYSTQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNA 420

Query: 421 SQAIKRRLPREIKLKLAKVARLAASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINM 480
           SQAIKRRLPREIKLKLAKVARLAA+HGKLSKGLINRLMSILGHLIQLRTLKRNLK+MINM
Sbjct: 421 SQAIKRRLPREIKLKLAKVARLAANHGKLSKGLINRLMSILGHLIQLRTLKRNLKIMINM 480

Query: 481 GISVKQEKDDRFQQIKKEVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKF 540
           GISVKQEKD RFQQIKKEVVEMIKIRPLSMESKAIEQQ G+ HD+RELVSEEKGV +KKF
Sbjct: 481 GISVKQEKDGRFQQIKKEVVEMIKIRPLSMESKAIEQQTGALHDIRELVSEEKGVPRKKF 540

Query: 541 VMDPVLEDKICDLYDLFVDGLDEDAGPQIRKLYAEVVDL 577
            MDP LEDKICDLYDLFVDGLDEDAGPQIRKLYAE+ +L
Sbjct: 541 AMDPALEDKICDLYDLFVDGLDEDAGPQIRKLYAELAEL 579

BLAST of Tan0009634 vs. NCBI nr
Match: XP_022956026.1 (ubinuclein-1-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 1001.5 bits (2588), Expect = 3.1e-288
Identity = 525/578 (90.83%), Postives = 547/578 (94.64%), Query Frame = 0

Query: 1   MEEEKFSGGAGVGARTGNGGSGDSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60
           MEEEKFSGGAG GARTGNGGSGD SRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD
Sbjct: 1   MEEEKFSGGAGGGARTGNGGSGDLSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60

Query: 61  ANKVNGVNTVPEPPANPNPAVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120
           +N++NGVNT+P PPANP PAVECRIDPGQ IEDEVKDATAPNRFNAVIEKIERLYMGKDS
Sbjct: 61  SNRINGVNTLPVPPANPIPAVECRIDPGQSIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120

Query: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPSGQPNQQ 180
           SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEP+GQPNQQ
Sbjct: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPAGQPNQQ 180

Query: 181 LKKRRRKDLEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHED 240
           LKKRRRKD+EKGHPDNHDARSSNKHTKVGKTT GKSALMVAKSFSNLSQNM VTH HH+D
Sbjct: 181 LKKRRRKDIEKGHPDNHDARSSNKHTKVGKTTVGKSALMVAKSFSNLSQNMVVTHEHHDD 240

Query: 241 EKLQNQVNMPGHSSKKKSGDTKMILDPSPSSKIYNGD--TSVAEAKDIDPPKPGVYPSKN 300
           EKLQNQ+NMPGHS KKKSGDTKMILDPSPSSK+YNGD  TSVAEAKDIDP +PGV PSKN
Sbjct: 241 EKLQNQLNMPGHSGKKKSGDTKMILDPSPSSKVYNGDTSTSVAEAKDIDPLRPGVLPSKN 300

Query: 301 LVSKSKESCGPSDCLQQKVLEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPDI 360
           + SK KESCGPSD LQQ VLEKSA A SKPQPG+PLNNTDE+D S+QLKEKQ IRELPDI
Sbjct: 301 VASKLKESCGPSDSLQQNVLEKSALAPSKPQPGKPLNNTDEIDLSIQLKEKQSIRELPDI 360

Query: 361 NLPEGKYSMQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNTS 420
           NLPEGKYS QTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADN S
Sbjct: 361 NLPEGKYSTQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNAS 420

Query: 421 QAIKRRLPREIKLKLAKVARLAASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINMG 480
           QAIKRRLPREIKLKLAKVARLAA+HGKLSKGLINRLMSILGHLIQLRTLKRNLK+MINMG
Sbjct: 421 QAIKRRLPREIKLKLAKVARLAANHGKLSKGLINRLMSILGHLIQLRTLKRNLKIMINMG 480

Query: 481 ISVKQEKDDRFQQIKKEVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKFV 540
           ISVKQEKD RFQQIKKEVVEMIKIRPLSME+KAIE Q G+ HD+RELVSEEKGV +KKF 
Sbjct: 481 ISVKQEKDGRFQQIKKEVVEMIKIRPLSMETKAIEHQTGALHDIRELVSEEKGVPRKKFA 540

Query: 541 MDPVLEDKICDLYDLFVDGLDEDAGPQIRKLYAEVVDL 577
           MDP LEDKICDLYDLFVDGLDEDAGPQIRKLYAE+ +L
Sbjct: 541 MDPALEDKICDLYDLFVDGLDEDAGPQIRKLYAELAEL 578

BLAST of Tan0009634 vs. NCBI nr
Match: XP_023552943.1 (ubinuclein-1-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1001.1 bits (2587), Expect = 4.0e-288
Identity = 525/578 (90.83%), Postives = 547/578 (94.64%), Query Frame = 0

Query: 1   MEEEKFSGGAGVGARTGNGGSGDSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60
           MEEEKFSGGAG GARTGNGGSGD SRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD
Sbjct: 1   MEEEKFSGGAGGGARTGNGGSGDLSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60

Query: 61  ANKVNGVNTVPEPPANPNPAVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120
           +N++NGVNT+P PPANP PAVECRIDPGQ IEDEVKDATAPNRFNAVIEKIERLYMGKDS
Sbjct: 61  SNRINGVNTLPVPPANPIPAVECRIDPGQSIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120

Query: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPSGQPNQQ 180
           SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEP+GQPNQQ
Sbjct: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPAGQPNQQ 180

Query: 181 LKKRRRKDLEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHED 240
           LKKRRRKD+EKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNM VTH HH+D
Sbjct: 181 LKKRRRKDIEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMVVTHEHHDD 240

Query: 241 EKLQNQVNMPGHSSKKKSGDTKMILDPSPSSKIYNGD--TSVAEAKDIDPPKPGVYPSKN 300
           EKLQNQ+NMPGHS KKKSGD KMILDPSPSSK+YNGD  TSVAEAKD DP +PGV PSKN
Sbjct: 241 EKLQNQLNMPGHSGKKKSGDAKMILDPSPSSKVYNGDTSTSVAEAKDTDPLRPGVLPSKN 300

Query: 301 LVSKSKESCGPSDCLQQKVLEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPDI 360
           + SK KESCGPSD LQQ VLEKSA A SKPQPG+PLNNTDE+D S+QLKEKQ IRELPDI
Sbjct: 301 VASKLKESCGPSDSLQQNVLEKSALAPSKPQPGKPLNNTDEIDLSIQLKEKQSIRELPDI 360

Query: 361 NLPEGKYSMQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNTS 420
           NLPEGKYS QTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADN S
Sbjct: 361 NLPEGKYSTQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNAS 420

Query: 421 QAIKRRLPREIKLKLAKVARLAASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINMG 480
           QAIKRRLPREIKLKLAKVARLAA+HGKLSKGLINRLMSILGHLIQLRTLKRNLK+MINMG
Sbjct: 421 QAIKRRLPREIKLKLAKVARLAANHGKLSKGLINRLMSILGHLIQLRTLKRNLKIMINMG 480

Query: 481 ISVKQEKDDRFQQIKKEVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKFV 540
           ISVKQEKD RFQQIKKEVVEMIKIRPLSME+KAIEQQ G+ HD+RELVSEEKGV +KKF 
Sbjct: 481 ISVKQEKDGRFQQIKKEVVEMIKIRPLSMETKAIEQQTGALHDIRELVSEEKGVPRKKFA 540

Query: 541 MDPVLEDKICDLYDLFVDGLDEDAGPQIRKLYAEVVDL 577
           MDP LEDKICDLYDLFVDGLDEDAGPQIRKLYAE+ +L
Sbjct: 541 MDPALEDKICDLYDLFVDGLDEDAGPQIRKLYAELAEL 578

BLAST of Tan0009634 vs. NCBI nr
Match: XP_022956019.1 (ubinuclein-1-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 996.9 bits (2576), Expect = 7.6e-287
Identity = 525/579 (90.67%), Postives = 547/579 (94.47%), Query Frame = 0

Query: 1   MEEEKFSGGAGVGARTGNGGSGDSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60
           MEEEKFSGGAG GARTGNGGSGD SRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD
Sbjct: 1   MEEEKFSGGAGGGARTGNGGSGDLSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60

Query: 61  ANKVNGVNTVPEPPANPNPAVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120
           +N++NGVNT+P PPANP PAVECRIDPGQ IEDEVKDATAPNRFNAVIEKIERLYMGKDS
Sbjct: 61  SNRINGVNTLPVPPANPIPAVECRIDPGQSIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120

Query: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERI-EPSGQPNQ 180
           SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERI EP+GQPNQ
Sbjct: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERISEPAGQPNQ 180

Query: 181 QLKKRRRKDLEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHE 240
           QLKKRRRKD+EKGHPDNHDARSSNKHTKVGKTT GKSALMVAKSFSNLSQNM VTH HH+
Sbjct: 181 QLKKRRRKDIEKGHPDNHDARSSNKHTKVGKTTVGKSALMVAKSFSNLSQNMVVTHEHHD 240

Query: 241 DEKLQNQVNMPGHSSKKKSGDTKMILDPSPSSKIYNGD--TSVAEAKDIDPPKPGVYPSK 300
           DEKLQNQ+NMPGHS KKKSGDTKMILDPSPSSK+YNGD  TSVAEAKDIDP +PGV PSK
Sbjct: 241 DEKLQNQLNMPGHSGKKKSGDTKMILDPSPSSKVYNGDTSTSVAEAKDIDPLRPGVLPSK 300

Query: 301 NLVSKSKESCGPSDCLQQKVLEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPD 360
           N+ SK KESCGPSD LQQ VLEKSA A SKPQPG+PLNNTDE+D S+QLKEKQ IRELPD
Sbjct: 301 NVASKLKESCGPSDSLQQNVLEKSALAPSKPQPGKPLNNTDEIDLSIQLKEKQSIRELPD 360

Query: 361 INLPEGKYSMQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNT 420
           INLPEGKYS QTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADN 
Sbjct: 361 INLPEGKYSTQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNA 420

Query: 421 SQAIKRRLPREIKLKLAKVARLAASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINM 480
           SQAIKRRLPREIKLKLAKVARLAA+HGKLSKGLINRLMSILGHLIQLRTLKRNLK+MINM
Sbjct: 421 SQAIKRRLPREIKLKLAKVARLAANHGKLSKGLINRLMSILGHLIQLRTLKRNLKIMINM 480

Query: 481 GISVKQEKDDRFQQIKKEVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKF 540
           GISVKQEKD RFQQIKKEVVEMIKIRPLSME+KAIE Q G+ HD+RELVSEEKGV +KKF
Sbjct: 481 GISVKQEKDGRFQQIKKEVVEMIKIRPLSMETKAIEHQTGALHDIRELVSEEKGVPRKKF 540

Query: 541 VMDPVLEDKICDLYDLFVDGLDEDAGPQIRKLYAEVVDL 577
            MDP LEDKICDLYDLFVDGLDEDAGPQIRKLYAE+ +L
Sbjct: 541 AMDPALEDKICDLYDLFVDGLDEDAGPQIRKLYAELAEL 579

BLAST of Tan0009634 vs. ExPASy TrEMBL
Match: A0A6J1JJR4 (ubinuclein-1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111487545 PE=4 SV=1)

HSP 1 Score: 1008.1 bits (2605), Expect = 1.6e-290
Identity = 530/578 (91.70%), Postives = 550/578 (95.16%), Query Frame = 0

Query: 1   MEEEKFSGGAGVGARTGNGGSGDSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60
           MEEEKFSGGAG GARTGNGGSGD SRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD
Sbjct: 1   MEEEKFSGGAGGGARTGNGGSGDLSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60

Query: 61  ANKVNGVNTVPEPPANPNPAVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120
           +N++NGVNT+P PPANP PAVECRIDPGQ IEDEVKDATAPNRFNAVIEKIERLYMGKDS
Sbjct: 61  SNRINGVNTLPVPPANPIPAVECRIDPGQSIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120

Query: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPSGQPNQQ 180
           SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEP+GQPNQQ
Sbjct: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPAGQPNQQ 180

Query: 181 LKKRRRKDLEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHED 240
           LKKRRRKD+EKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNM VTH HH+D
Sbjct: 181 LKKRRRKDIEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMVVTHEHHDD 240

Query: 241 EKLQNQVNMPGHSSKKKSGDTKMILDPSPSSKIYNGD--TSVAEAKDIDPPKPGVYPSKN 300
           EKLQNQ+NMPGHS KKKSGD KMILDPSPSSKIYNGD  TSVAEAKDIDPP+PGV PSKN
Sbjct: 241 EKLQNQLNMPGHSGKKKSGDPKMILDPSPSSKIYNGDTSTSVAEAKDIDPPRPGVLPSKN 300

Query: 301 LVSKSKESCGPSDCLQQKVLEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPDI 360
           +VSK KESCGPSD LQQ VLEKSA A SKPQPG+PLN+TDE+D SVQLKEKQ IRELPDI
Sbjct: 301 VVSKLKESCGPSDSLQQNVLEKSALAPSKPQPGKPLNSTDEIDLSVQLKEKQSIRELPDI 360

Query: 361 NLPEGKYSMQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNTS 420
           NLPEGKYS QTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADN S
Sbjct: 361 NLPEGKYSTQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNAS 420

Query: 421 QAIKRRLPREIKLKLAKVARLAASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINMG 480
           QAIKRRLPREIKLKLAKVARLAA+HGKLSKGLINRLMSILGHLIQLRTLKRNLK+MINMG
Sbjct: 421 QAIKRRLPREIKLKLAKVARLAANHGKLSKGLINRLMSILGHLIQLRTLKRNLKIMINMG 480

Query: 481 ISVKQEKDDRFQQIKKEVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKFV 540
           ISVKQEKD RFQQIKKEVVEMIKIRPLSMESKAIEQQ G+ HD+RELVSEEKGV +KKF 
Sbjct: 481 ISVKQEKDGRFQQIKKEVVEMIKIRPLSMESKAIEQQTGALHDIRELVSEEKGVPRKKFA 540

Query: 541 MDPVLEDKICDLYDLFVDGLDEDAGPQIRKLYAEVVDL 577
           MDP LEDKICDLYDLFVDGLDEDAGPQIRKLYAE+ +L
Sbjct: 541 MDPALEDKICDLYDLFVDGLDEDAGPQIRKLYAELAEL 578

BLAST of Tan0009634 vs. ExPASy TrEMBL
Match: A0A6J1JNV3 (ubinuclein-1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111487545 PE=4 SV=1)

HSP 1 Score: 1003.4 bits (2593), Expect = 3.9e-289
Identity = 530/579 (91.54%), Postives = 550/579 (94.99%), Query Frame = 0

Query: 1   MEEEKFSGGAGVGARTGNGGSGDSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60
           MEEEKFSGGAG GARTGNGGSGD SRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD
Sbjct: 1   MEEEKFSGGAGGGARTGNGGSGDLSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60

Query: 61  ANKVNGVNTVPEPPANPNPAVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120
           +N++NGVNT+P PPANP PAVECRIDPGQ IEDEVKDATAPNRFNAVIEKIERLYMGKDS
Sbjct: 61  SNRINGVNTLPVPPANPIPAVECRIDPGQSIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120

Query: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERI-EPSGQPNQ 180
           SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERI EP+GQPNQ
Sbjct: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERISEPAGQPNQ 180

Query: 181 QLKKRRRKDLEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHE 240
           QLKKRRRKD+EKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNM VTH HH+
Sbjct: 181 QLKKRRRKDIEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMVVTHEHHD 240

Query: 241 DEKLQNQVNMPGHSSKKKSGDTKMILDPSPSSKIYNGD--TSVAEAKDIDPPKPGVYPSK 300
           DEKLQNQ+NMPGHS KKKSGD KMILDPSPSSKIYNGD  TSVAEAKDIDPP+PGV PSK
Sbjct: 241 DEKLQNQLNMPGHSGKKKSGDPKMILDPSPSSKIYNGDTSTSVAEAKDIDPPRPGVLPSK 300

Query: 301 NLVSKSKESCGPSDCLQQKVLEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPD 360
           N+VSK KESCGPSD LQQ VLEKSA A SKPQPG+PLN+TDE+D SVQLKEKQ IRELPD
Sbjct: 301 NVVSKLKESCGPSDSLQQNVLEKSALAPSKPQPGKPLNSTDEIDLSVQLKEKQSIRELPD 360

Query: 361 INLPEGKYSMQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNT 420
           INLPEGKYS QTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADN 
Sbjct: 361 INLPEGKYSTQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNA 420

Query: 421 SQAIKRRLPREIKLKLAKVARLAASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINM 480
           SQAIKRRLPREIKLKLAKVARLAA+HGKLSKGLINRLMSILGHLIQLRTLKRNLK+MINM
Sbjct: 421 SQAIKRRLPREIKLKLAKVARLAANHGKLSKGLINRLMSILGHLIQLRTLKRNLKIMINM 480

Query: 481 GISVKQEKDDRFQQIKKEVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKF 540
           GISVKQEKD RFQQIKKEVVEMIKIRPLSMESKAIEQQ G+ HD+RELVSEEKGV +KKF
Sbjct: 481 GISVKQEKDGRFQQIKKEVVEMIKIRPLSMESKAIEQQTGALHDIRELVSEEKGVPRKKF 540

Query: 541 VMDPVLEDKICDLYDLFVDGLDEDAGPQIRKLYAEVVDL 577
            MDP LEDKICDLYDLFVDGLDEDAGPQIRKLYAE+ +L
Sbjct: 541 AMDPALEDKICDLYDLFVDGLDEDAGPQIRKLYAELAEL 579

BLAST of Tan0009634 vs. ExPASy TrEMBL
Match: A0A6J1GVG2 (ubinuclein-1-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111457837 PE=4 SV=1)

HSP 1 Score: 1001.5 bits (2588), Expect = 1.5e-288
Identity = 525/578 (90.83%), Postives = 547/578 (94.64%), Query Frame = 0

Query: 1   MEEEKFSGGAGVGARTGNGGSGDSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60
           MEEEKFSGGAG GARTGNGGSGD SRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD
Sbjct: 1   MEEEKFSGGAGGGARTGNGGSGDLSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60

Query: 61  ANKVNGVNTVPEPPANPNPAVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120
           +N++NGVNT+P PPANP PAVECRIDPGQ IEDEVKDATAPNRFNAVIEKIERLYMGKDS
Sbjct: 61  SNRINGVNTLPVPPANPIPAVECRIDPGQSIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120

Query: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPSGQPNQQ 180
           SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEP+GQPNQQ
Sbjct: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPAGQPNQQ 180

Query: 181 LKKRRRKDLEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHED 240
           LKKRRRKD+EKGHPDNHDARSSNKHTKVGKTT GKSALMVAKSFSNLSQNM VTH HH+D
Sbjct: 181 LKKRRRKDIEKGHPDNHDARSSNKHTKVGKTTVGKSALMVAKSFSNLSQNMVVTHEHHDD 240

Query: 241 EKLQNQVNMPGHSSKKKSGDTKMILDPSPSSKIYNGD--TSVAEAKDIDPPKPGVYPSKN 300
           EKLQNQ+NMPGHS KKKSGDTKMILDPSPSSK+YNGD  TSVAEAKDIDP +PGV PSKN
Sbjct: 241 EKLQNQLNMPGHSGKKKSGDTKMILDPSPSSKVYNGDTSTSVAEAKDIDPLRPGVLPSKN 300

Query: 301 LVSKSKESCGPSDCLQQKVLEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPDI 360
           + SK KESCGPSD LQQ VLEKSA A SKPQPG+PLNNTDE+D S+QLKEKQ IRELPDI
Sbjct: 301 VASKLKESCGPSDSLQQNVLEKSALAPSKPQPGKPLNNTDEIDLSIQLKEKQSIRELPDI 360

Query: 361 NLPEGKYSMQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNTS 420
           NLPEGKYS QTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADN S
Sbjct: 361 NLPEGKYSTQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNAS 420

Query: 421 QAIKRRLPREIKLKLAKVARLAASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINMG 480
           QAIKRRLPREIKLKLAKVARLAA+HGKLSKGLINRLMSILGHLIQLRTLKRNLK+MINMG
Sbjct: 421 QAIKRRLPREIKLKLAKVARLAANHGKLSKGLINRLMSILGHLIQLRTLKRNLKIMINMG 480

Query: 481 ISVKQEKDDRFQQIKKEVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKFV 540
           ISVKQEKD RFQQIKKEVVEMIKIRPLSME+KAIE Q G+ HD+RELVSEEKGV +KKF 
Sbjct: 481 ISVKQEKDGRFQQIKKEVVEMIKIRPLSMETKAIEHQTGALHDIRELVSEEKGVPRKKFA 540

Query: 541 MDPVLEDKICDLYDLFVDGLDEDAGPQIRKLYAEVVDL 577
           MDP LEDKICDLYDLFVDGLDEDAGPQIRKLYAE+ +L
Sbjct: 541 MDPALEDKICDLYDLFVDGLDEDAGPQIRKLYAELAEL 578

BLAST of Tan0009634 vs. ExPASy TrEMBL
Match: A0A6J1GXV5 (ubinuclein-1-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111457837 PE=4 SV=1)

HSP 1 Score: 996.9 bits (2576), Expect = 3.7e-287
Identity = 525/579 (90.67%), Postives = 547/579 (94.47%), Query Frame = 0

Query: 1   MEEEKFSGGAGVGARTGNGGSGDSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60
           MEEEKFSGGAG GARTGNGGSGD SRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD
Sbjct: 1   MEEEKFSGGAGGGARTGNGGSGDLSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKD 60

Query: 61  ANKVNGVNTVPEPPANPNPAVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120
           +N++NGVNT+P PPANP PAVECRIDPGQ IEDEVKDATAPNRFNAVIEKIERLYMGKDS
Sbjct: 61  SNRINGVNTLPVPPANPIPAVECRIDPGQSIEDEVKDATAPNRFNAVIEKIERLYMGKDS 120

Query: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERI-EPSGQPNQ 180
           SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERI EP+GQPNQ
Sbjct: 121 SDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERISEPAGQPNQ 180

Query: 181 QLKKRRRKDLEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHE 240
           QLKKRRRKD+EKGHPDNHDARSSNKHTKVGKTT GKSALMVAKSFSNLSQNM VTH HH+
Sbjct: 181 QLKKRRRKDIEKGHPDNHDARSSNKHTKVGKTTVGKSALMVAKSFSNLSQNMVVTHEHHD 240

Query: 241 DEKLQNQVNMPGHSSKKKSGDTKMILDPSPSSKIYNGD--TSVAEAKDIDPPKPGVYPSK 300
           DEKLQNQ+NMPGHS KKKSGDTKMILDPSPSSK+YNGD  TSVAEAKDIDP +PGV PSK
Sbjct: 241 DEKLQNQLNMPGHSGKKKSGDTKMILDPSPSSKVYNGDTSTSVAEAKDIDPLRPGVLPSK 300

Query: 301 NLVSKSKESCGPSDCLQQKVLEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPD 360
           N+ SK KESCGPSD LQQ VLEKSA A SKPQPG+PLNNTDE+D S+QLKEKQ IRELPD
Sbjct: 301 NVASKLKESCGPSDSLQQNVLEKSALAPSKPQPGKPLNNTDEIDLSIQLKEKQSIRELPD 360

Query: 361 INLPEGKYSMQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNT 420
           INLPEGKYS QTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADN 
Sbjct: 361 INLPEGKYSTQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEADNA 420

Query: 421 SQAIKRRLPREIKLKLAKVARLAASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINM 480
           SQAIKRRLPREIKLKLAKVARLAA+HGKLSKGLINRLMSILGHLIQLRTLKRNLK+MINM
Sbjct: 421 SQAIKRRLPREIKLKLAKVARLAANHGKLSKGLINRLMSILGHLIQLRTLKRNLKIMINM 480

Query: 481 GISVKQEKDDRFQQIKKEVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKF 540
           GISVKQEKD RFQQIKKEVVEMIKIRPLSME+KAIE Q G+ HD+RELVSEEKGV +KKF
Sbjct: 481 GISVKQEKDGRFQQIKKEVVEMIKIRPLSMETKAIEHQTGALHDIRELVSEEKGVPRKKF 540

Query: 541 VMDPVLEDKICDLYDLFVDGLDEDAGPQIRKLYAEVVDL 577
            MDP LEDKICDLYDLFVDGLDEDAGPQIRKLYAE+ +L
Sbjct: 541 AMDPALEDKICDLYDLFVDGLDEDAGPQIRKLYAELAEL 579

BLAST of Tan0009634 vs. ExPASy TrEMBL
Match: A0A6J1CJZ6 (ubinuclein-1 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111011989 PE=4 SV=1)

HSP 1 Score: 989.6 bits (2557), Expect = 5.8e-285
Identity = 520/581 (89.50%), Postives = 545/581 (93.80%), Query Frame = 0

Query: 1   MEEEKFS---GGAGVGARTGNGGSGDSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKL 60
           MEEEK S   GG+GVG RTGNGGSGDSSRASSSF+KSGDRQMFTVELRPGETTIVSWKKL
Sbjct: 1   MEEEKISGGGGGSGVGGRTGNGGSGDSSRASSSFVKSGDRQMFTVELRPGETTIVSWKKL 60

Query: 61  VKDANKVNGVNTVPEPPANPNPAVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMG 120
           VKDANKVNG+NTVPEPPANPNP VECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMG
Sbjct: 61  VKDANKVNGLNTVPEPPANPNPVVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMG 120

Query: 121 KDSSDEEDLIPDDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPSGQP 180
           KDSSD+ED+IPDDD+YDTEDSFIDDTELDEYFEVD+ AIKHDGFFVNRGKLERIEPSGQP
Sbjct: 121 KDSSDDEDIIPDDDRYDTEDSFIDDTELDEYFEVDDLAIKHDGFFVNRGKLERIEPSGQP 180

Query: 181 NQQLKKRRRKDLEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGH 240
           NQQLKKRRRKDLEKGH +NHD RSSNKHTKVGK   GK+ALMVAKSFSNLSQNMA+TH H
Sbjct: 181 NQQLKKRRRKDLEKGHSENHDGRSSNKHTKVGKIATGKTALMVAKSFSNLSQNMAITHEH 240

Query: 241 HEDEKLQNQVNMPGHSSKKKSGDTKMILDPSPSSKIYNGD--TSVAEAKDIDPPKPGVYP 300
            EDEKLQNQ+NMPGH SKKK GDTKM LDPS S K+YNGD  TSVAEAKD+DP KPG+ P
Sbjct: 241 REDEKLQNQLNMPGHGSKKKIGDTKMTLDPSTSIKVYNGDTSTSVAEAKDLDPSKPGIVP 300

Query: 301 SKNLVSKSKESCGPSDCLQQKVLEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIREL 360
           SKNL SKSKESCGPSD LQQ +LEKSAHA  KPQPGRPLN+ +E+DSS QLKEK GIREL
Sbjct: 301 SKNLASKSKESCGPSDSLQQNLLEKSAHAPFKPQPGRPLNSVEEVDSSTQLKEKHGIREL 360

Query: 361 PDINLPEGKYSMQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEAD 420
           PDINLPEGKYSMQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEAD
Sbjct: 361 PDINLPEGKYSMQTAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPPLTENPEAD 420

Query: 421 NTSQAIKRRLPREIKLKLAKVARLAASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMI 480
           N+SQA+KRRLPREIKLKLAKVARLAASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMI
Sbjct: 421 NSSQAVKRRLPREIKLKLAKVARLAASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMI 480

Query: 481 NMGISVKQEKDDRFQQIKKEVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKK 540
           NMGISVKQEKDDRFQQIKKEVVEMIKIRPLSMESKA EQQAG+P ++RELVSEEKGV KK
Sbjct: 481 NMGISVKQEKDDRFQQIKKEVVEMIKIRPLSMESKAFEQQAGAPQNLRELVSEEKGVPKK 540

Query: 541 KFVMDPVLEDKICDLYDLFVDGLDEDAGPQIRKLYAEVVDL 577
           KFVMDP LEDKICDLYDLFVDGLDEDAGPQIRKLYAE+ +L
Sbjct: 541 KFVMDPSLEDKICDLYDLFVDGLDEDAGPQIRKLYAELAEL 581

BLAST of Tan0009634 vs. TAIR 10
Match: AT1G21610.2 (wound-responsive family protein )

HSP 1 Score: 450.3 bits (1157), Expect = 2.5e-126
Identity = 292/569 (51.32%), Postives = 362/569 (63.62%), Query Frame = 0

Query: 12  VGARTGNGGSGDSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKDANKVNGVN-TV 71
           V   +G    G+  RAS   L +GDR++  VELRPG+TT VSWKKL++DA KVNG++ +V
Sbjct: 4   VNEMSGGSIGGELLRASPKVLTAGDRKLLKVELRPGDTTYVSWKKLMRDAGKVNGLSASV 63

Query: 72  PEPPANPNPAVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDSSDEEDL--IP 131
           P+PP N NP +E RI PG P+E E  +    NRFNAVIEKIERLY G DSSD E+L   P
Sbjct: 64  PDPPPNANPNLEFRIAPGHPVEIETNEQPHSNRFNAVIEKIERLYKGNDSSDGEELDGAP 123

Query: 132 DDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPSGQPNQQLKKRRRKD 191
           DDD+YDTEDSFIDD ELDEYFEVDNS +KHDGF+VNRGKLER+EPS   NQQ KKRRRKD
Sbjct: 124 DDDEYDTEDSFIDDAELDEYFEVDNSTVKHDGFYVNRGKLERMEPSTTSNQQPKKRRRKD 183

Query: 192 LEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHEDEKLQNQVN 251
             K   D  D   S+KHTK+  T   K                             +Q  
Sbjct: 184 SAKPCRDAVDV--SDKHTKLSITARKK-----------------------------DQST 243

Query: 252 MPGHSSKKKSGDTKMILDPSPSSKIYNGDTSVAEAKDIDPPKPGVYPSKNLVSKSKESCG 311
            PG    ++S        P PS    + +TSV    D+       + S+N  S      G
Sbjct: 244 APGSWKTQES--------PLPSG-AQDANTSV-PLDDVKHSDRANHQSRNDTSHKSRETG 303

Query: 312 PSDCLQQKVLEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPDINLPEGKYSMQ 371
            S  L QK   KS H QS    G+   N     + V+ KE  G+ +L   N+   + S Q
Sbjct: 304 SSSALHQKYSNKSLHQQSTSLLGKSPPNVFAEVTVVRQKENNGMHQL--ANVTGSRQSSQ 363

Query: 372 TAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPP-LTENPEADNTSQAIKRRLPR 431
            +      KKDGS+V+ K+S+LEKAIRELEK+V ESRPP +TEN EAD +SQA+KRRLPR
Sbjct: 364 AS------KKDGSNVKSKTSILEKAIRELEKVVVESRPPAITENQEADTSSQAVKRRLPR 423

Query: 432 EIKLKLAKVARLAASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINMGISVKQEKDD 491
           ++KLKLAKVAR+AAS GK S  LINRLMSI+GHLIQLR+LKRNLK+MI+MG S  +EKD 
Sbjct: 424 DVKLKLAKVARIAASQGKHSTELINRLMSIVGHLIQLRSLKRNLKIMIDMGDSATREKDT 483

Query: 492 RFQQIKKEVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKFVMDPVLEDKI 551
           RF+QI  EV++MIK +   MES+AI+ +  +  D ++  S EK  L KKFVMD  LEDK+
Sbjct: 484 RFKQINNEVLDMIKAKVSLMESQAIKPEGATSDDFQD--SVEKPSL-KKFVMDAALEDKL 520

Query: 552 CDLYDLFVDGLDEDAGPQIRKLYAEVVDL 577
           CDLYD+F+DGLDED GPQ +KLY  + +L
Sbjct: 544 CDLYDIFIDGLDEDQGPQTKKLYVNLAEL 520

BLAST of Tan0009634 vs. TAIR 10
Match: AT1G21610.1 (wound-responsive family protein )

HSP 1 Score: 445.7 bits (1145), Expect = 6.0e-125
Identity = 292/570 (51.23%), Postives = 362/570 (63.51%), Query Frame = 0

Query: 12  VGARTGNGGSGDSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKDANKVNGVN-TV 71
           V   +G    G+  RAS   L +GDR++  VELRPG+TT VSWKKL++DA KVNG++ +V
Sbjct: 4   VNEMSGGSIGGELLRASPKVLTAGDRKLLKVELRPGDTTYVSWKKLMRDAGKVNGLSASV 63

Query: 72  PEPPANPNPAVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDSSDEEDL--IP 131
           P+PP N NP +E RI PG P+E E  +    NRFNAVIEKIERLY G DSSD E+L   P
Sbjct: 64  PDPPPNANPNLEFRIAPGHPVEIETNEQPHSNRFNAVIEKIERLYKGNDSSDGEELDGAP 123

Query: 132 DDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPSGQPNQQLKKRRRKD 191
           DDD+YDTEDSFIDD ELDEYFEVDNS +KHDGF+VNRGKLER+EPS   NQQ KKRRRKD
Sbjct: 124 DDDEYDTEDSFIDDAELDEYFEVDNSTVKHDGFYVNRGKLERMEPSTTSNQQPKKRRRKD 183

Query: 192 LEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHEDEKLQNQVN 251
             K   D  D   S+KHTK+  T   K                             +Q  
Sbjct: 184 SAKPCRDAVDV--SDKHTKLSITARKK-----------------------------DQST 243

Query: 252 MPGHSSKKKSGDTKMILDPSPSSKIYNGDTSVAEAKDIDPPKPGVYPSKNLVSKSKESCG 311
            PG    ++S        P PS    + +TSV    D+       + S+N  S      G
Sbjct: 244 APGSWKTQES--------PLPSG-AQDANTSV-PLDDVKHSDRANHQSRNDTSHKSRETG 303

Query: 312 PSDCLQQKVLEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPDINLPEGKYSMQ 371
            S  L QK   KS H QS    G+   N     + V+ KE  G+ +L   N+   + S Q
Sbjct: 304 SSSALHQKYSNKSLHQQSTSLLGKSPPNVFAEVTVVRQKENNGMHQL--ANVTGSRQSSQ 363

Query: 372 TAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPP-LTENPEADNTSQAIKRRLPR 431
            +      KKDGS+V+ K+S+LEKAIRELEK+V ESRPP +TEN EAD +SQA+KRRLPR
Sbjct: 364 AS------KKDGSNVKSKTSILEKAIRELEKVVVESRPPAITENQEADTSSQAVKRRLPR 423

Query: 432 EIKLKLAKVARLA-ASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINMGISVKQEKD 491
           ++KLKLAKVAR+A AS GK S  LINRLMSI+GHLIQLR+LKRNLK+MI+MG S  +EKD
Sbjct: 424 DVKLKLAKVARIAQASQGKHSTELINRLMSIVGHLIQLRSLKRNLKIMIDMGDSATREKD 483

Query: 492 DRFQQIKKEVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKFVMDPVLEDK 551
            RF+QI  EV++MIK +   MES+AI+ +  +  D ++  S EK  L KKFVMD  LEDK
Sbjct: 484 TRFKQINNEVLDMIKAKVSLMESQAIKPEGATSDDFQD--SVEKPSL-KKFVMDAALEDK 521

Query: 552 ICDLYDLFVDGLDEDAGPQIRKLYAEVVDL 577
           +CDLYD+F+DGLDED GPQ +KLY  + +L
Sbjct: 544 LCDLYDIFIDGLDEDQGPQTKKLYVNLAEL 521

BLAST of Tan0009634 vs. TAIR 10
Match: AT1G21610.3 (wound-responsive family protein )

HSP 1 Score: 445.7 bits (1145), Expect = 6.0e-125
Identity = 292/570 (51.23%), Postives = 362/570 (63.51%), Query Frame = 0

Query: 12  VGARTGNGGSGDSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKDANKVNGVN-TV 71
           V   +G    G+  RAS   L +GDR++  VELRPG+TT VSWKKL++DA KVNG++ +V
Sbjct: 4   VNEMSGGSIGGELLRASPKVLTAGDRKLLKVELRPGDTTYVSWKKLMRDAGKVNGLSASV 63

Query: 72  PEPPANPNPAVECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDSSDEEDL--IP 131
           P+PP N NP +E RI PG P+E E  +    NRFNAVIEKIERLY G DSSD E+L   P
Sbjct: 64  PDPPPNANPNLEFRIAPGHPVEIETNEQPHSNRFNAVIEKIERLYKGNDSSDGEELDGAP 123

Query: 132 DDDQYDTEDSFIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPSGQPNQQLKKRRRKD 191
           DDD+YDTEDSFIDD ELDEYFEVDNS +KHDGF+VNRGKLER+EPS   NQQ KKRRRKD
Sbjct: 124 DDDEYDTEDSFIDDAELDEYFEVDNSTVKHDGFYVNRGKLERMEPSTTSNQQPKKRRRKD 183

Query: 192 LEKGHPDNHDARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHEDEKLQNQVN 251
             K   D  D   S+KHTK+  T   K                             +Q  
Sbjct: 184 SAKPCRDAVDV--SDKHTKLSITARKK-----------------------------DQST 243

Query: 252 MPGHSSKKKSGDTKMILDPSPSSKIYNGDTSVAEAKDIDPPKPGVYPSKNLVSKSKESCG 311
            PG    ++S        P PS    + +TSV    D+       + S+N  S      G
Sbjct: 244 APGSWKTQES--------PLPSG-AQDANTSV-PLDDVKHSDRANHQSRNDTSHKSRETG 303

Query: 312 PSDCLQQKVLEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPDINLPEGKYSMQ 371
            S  L QK   KS H QS    G+   N     + V+ KE  G+ +L   N+   + S Q
Sbjct: 304 SSSALHQKYSNKSLHQQSTSLLGKSPPNVFAEVTVVRQKENNGMHQL--ANVTGSRQSSQ 363

Query: 372 TAKTPYVHKKDGSSVRPKSSLLEKAIRELEKMVAESRPP-LTENPEADNTSQAIKRRLPR 431
            +      KKDGS+V+ K+S+LEKAIRELEK+V ESRPP +TEN EAD +SQA+KRRLPR
Sbjct: 364 AS------KKDGSNVKSKTSILEKAIRELEKVVVESRPPAITENQEADTSSQAVKRRLPR 423

Query: 432 EIKLKLAKVARLA-ASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINMGISVKQEKD 491
           ++KLKLAKVAR+A AS GK S  LINRLMSI+GHLIQLR+LKRNLK+MI+MG S  +EKD
Sbjct: 424 DVKLKLAKVARIAQASQGKHSTELINRLMSIVGHLIQLRSLKRNLKIMIDMGDSATREKD 483

Query: 492 DRFQQIKKEVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKFVMDPVLEDK 551
            RF+QI  EV++MIK +   MES+AI+ +  +  D ++  S EK  L KKFVMD  LEDK
Sbjct: 484 TRFKQINNEVLDMIKAKVSLMESQAIKPEGATSDDFQD--SVEKPSL-KKFVMDAALEDK 521

Query: 552 ICDLYDLFVDGLDEDAGPQIRKLYAEVVDL 577
           +CDLYD+F+DGLDED GPQ +KLY  + +L
Sbjct: 544 LCDLYDIFIDGLDEDQGPQTKKLYVNLAEL 521

BLAST of Tan0009634 vs. TAIR 10
Match: AT1G77310.1 (BEST Arabidopsis thaliana protein match is: wound-responsive family protein (TAIR:AT1G21610.1); Has 493 Blast hits to 482 proteins in 163 species: Archae - 0; Bacteria - 100; Metazoa - 172; Fungi - 66; Plants - 65; Viruses - 7; Other Eukaryotes - 83 (source: NCBI BLink). )

HSP 1 Score: 411.0 bits (1055), Expect = 1.7e-114
Identity = 271/562 (48.22%), Postives = 350/562 (62.28%), Query Frame = 0

Query: 23  DSSRASSSFLKSGDRQMFTVELRPGETTIVSWKKLVKDANKVNG--VNTVPEPPANPNPA 82
           +S + SS  L +GDR++  VEL   ETT+VSWKKL+ +A+K NG    + PE   N NP 
Sbjct: 17  ESCKISSEILTAGDRKLLKVELLKEETTLVSWKKLMDEASKENGGLFVSAPERLLNANPN 76

Query: 83  VECRIDPGQPIEDEVKDATAPNRFNAVIEKIERLYMGKDSSDEEDL--IPDDDQYDTEDS 142
           +E R+ PG   E+E+ +   PNR N+VI KIERLYMGKD SD E+L   PDDD YDTEDS
Sbjct: 77  LEFRLAPGAQTENEMVNQPHPNRLNSVIAKIERLYMGKDGSDGEELDGAPDDDDYDTEDS 136

Query: 143 FIDDTELDEYFEVDNSAIKHDGFFVNRGKLERIEPSGQPNQQL-KKRRRKDLEKGHPDNH 202
           FIDD ELDEYFEVDNS IKHDGFFVNRGKLERIEPS   NQQ  KKRRRK+  K   D  
Sbjct: 137 FIDDAELDEYFEVDNSPIKHDGFFVNRGKLERIEPSATSNQQQPKKRRRKESAKPCGDVV 196

Query: 203 DARSSNKHTKVGKTTAGKSALMVAKSFSNLSQNMAVTHGHHEDEKLQNQVNMPGHSSKKK 262
           D   S K  K+ KT  GK                             +Q   PG SSKK 
Sbjct: 197 DV--SRKRAKMAKTAGGK-----------------------------DQSASPGPSSKKI 256

Query: 263 SGDTKMILDPSPSSKIYNGDTSVAEAKDIDPPKPGVYPSKNLVSKSKESCGPSDCLQQKV 322
           S D+K + D     K  NG+ S+   +++       +   N  S   ++ G S  L  K 
Sbjct: 257 SNDSKTVQDSFSPLKAQNGNDSLV-LENVKHTDKANHQPMNATSPKSKAAGSSGPLHPKC 316

Query: 323 LEKSAHAQSKPQPGRPLNNTDEMDSSVQLKEKQGIRELPDINL-PEGKYSMQTAKTPYVH 382
             KS H QS   PG+   N     + V+ +   G   +PD+++  E K S+Q      + 
Sbjct: 317 SSKSVHEQSNSPPGKSRPNVSAKSAVVRQQVNNG---MPDLDIATESKTSIQ------IS 376

Query: 383 KKDGSSVRPKSSLLEKAIRELEKMVAESRPP-LTENPEADNTSQAIKRRLPREIKLKLAK 442
           KK GS+ RPK S LEKAIR LEK+VAESRPP  TEN +AD +SQA+KR LP ++KL LAK
Sbjct: 377 KKSGSNGRPKYSTLEKAIRNLEKLVAESRPPAATENQDADISSQAVKRGLPGDVKLHLAK 436

Query: 443 VARLA-ASHGKLSKGLINRLMSILGHLIQLRTLKRNLKVMINMGISVKQEKDDRFQQIKK 502
           VAR+A AS G++S  LINRLM I+GHLIQ+R+LKRNLK+MI+  ++  +EKD RFQ+IK 
Sbjct: 437 VARIAYASQGEISGELINRLMGIVGHLIQIRSLKRNLKIMIDSIVTANREKDTRFQRIKS 496

Query: 503 EVVEMIKIRPLSMESKAIEQQAGSPHDVRELVSEEKGVLKKKFVMDPVLEDKICDLYDLF 562
           E+ EM+K +   +ES+   Q+AG+  D +++ S  K  + KKFVMD  LE+K+CDLYD+F
Sbjct: 497 EITEMLKTQVPLVESQETNQEAGTSDDFQDVGSLGKSPV-KKFVMDVALEEKLCDLYDVF 536

Query: 563 VDGLDEDAGPQIRKLYAEVVDL 577
           V+G+DE +G QIRKLY+++  L
Sbjct: 557 VEGMDEHSGSQIRKLYSDLAQL 536

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8RX788.5e-12451.23Ubinuclein-1 OS=Arabidopsis thaliana OX=3702 GN=UBN1 PE=1 SV=1[more]
F4I7002.3e-11348.22Ubinuclein-2 OS=Arabidopsis thaliana OX=3702 GN=UBN2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_022990762.13.3e-29091.70ubinuclein-1-like isoform X2 [Cucurbita maxima][more]
XP_022990761.18.1e-28991.54ubinuclein-1-like isoform X1 [Cucurbita maxima][more]
XP_022956026.13.1e-28890.83ubinuclein-1-like isoform X2 [Cucurbita moschata][more]
XP_023552943.14.0e-28890.83ubinuclein-1-like isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_022956019.17.6e-28790.67ubinuclein-1-like isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1JJR41.6e-29091.70ubinuclein-1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111487545 PE=4 SV... [more]
A0A6J1JNV33.9e-28991.54ubinuclein-1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111487545 PE=4 SV... [more]
A0A6J1GVG21.5e-28890.83ubinuclein-1-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111457837 PE=4 ... [more]
A0A6J1GXV53.7e-28790.67ubinuclein-1-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111457837 PE=4 ... [more]
A0A6J1CJZ65.8e-28589.50ubinuclein-1 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111011989 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G21610.22.5e-12651.32wound-responsive family protein [more]
AT1G21610.16.0e-12551.23wound-responsive family protein [more]
AT1G21610.36.0e-12551.23wound-responsive family protein [more]
AT1G77310.11.7e-11448.22BEST Arabidopsis thaliana protein match is: wound-responsive family protein (TAI... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR014840Hpc2-related domainPFAMPF08729HUNcoord: 119..169
e-value: 1.7E-13
score: 50.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 168..212
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 179..205
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 18..36
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 234..345
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..38
NoneNo IPR availablePANTHERPTHR21669CAPZ-INTERACTING PROTEIN AND RELATED PROTEINScoord: 25..576
NoneNo IPR availablePANTHERPTHR21669:SF27WOUND-RESPONSIVE FAMILY PROTEINcoord: 25..576

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0009634.1Tan0009634.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006336 DNA replication-independent chromatin assembly
cellular_component GO:0005634 nucleus