Tan0004012 (gene) Snake gourd v1

Overview
NameTan0004012
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTHO complex subunit 4D-like
LocationLG04: 73612457 .. 73627651 (+)
RNA-Seq ExpressionTan0004012
SyntenyTan0004012
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATAAAACGCAATTGTAAAAACAGAGGGAAAGAACAAAAGGTTCGATGAAAGAAATATCGTTGCCTTAGTCCCGAAGGAAAAGTCTCTCTCAAGCCGCCGCCGCAACCGCCGCCTCTCGCCAGAAGAATCTCTGTCTCTCTCTTAGAAGCCCTAAAAATCCCACACGCACATATCGCTTTTCATCATCATCCTCTCGGAAATTCTCTCCTTGATTTCGTTGAAATTTTTATCTTCGCTCTCTCTGTCGGAAGTCCTTCGACTGTTTCTACGGATCATCTTCAGGTAACTTTTGTTTCTCTCTTTCTGTTTTTTTTTTAAACTTTAATTACGGGGATTTGTTATGGCGTTATACTGTTGTATTATTTGTCAATCGGAGTTTTGAATAGAGATGTTGTTATTTTTTGAGAATGGTCGATGCGATCGGGATATTGATTTTGGGTTTTAGATCTGAAATCCGAAATTAATAGGACTGTGTTAAGAGTGTCTTTATTATAATCTTATTGATTTTGGAAGTAAATGAATTATGTGTGGGCGTGGATATTGGAGTTGGCATAATCATTGACTTGCATTTTGTTTTTAAATATATGTCTACGACTCTGTTCTGTATTACTTCCAAAGTCCAAATTATTGAAGGCTAGAGGACGATCTTAAACAAAACATCATGAAGAAACGATAATTACATTTATTTCGTTTTTCTACTCTTTTATCATCAAGACTAAATTAATTTTTTTGACTTTTAAAATCATCATTTTGGTCCACGAAAAAGAACCCAAAAGCTCATGGTGGAGTATGGTTGGAAAAGAAATAGACTTGCAGACTTAGGATGGGGTTGTTTAACATGGTAATTTTTTTTTCCTTTTCAAAAGCTCGAAGTGCGGATAACATACAAAAACAACACAATGGTTGAATAATTTAATTCTTTCAAAAACAACATGGTAAAAACAATCTTCATACTCACAGTTTAGAGATCCAAGCAATAACTAAATATGAAGTCCTCCATTTTTTCTTCTCTACTTTTGCTCATTGCCATAGTCCTAATCTGTAAGGGTGGATGAAGATGAGGCTTTATTTATATGAAGATGAGGCTTTATTTATTGTCTTTGATATAATCTTATGTTTTGTAATGCTTGATTATGCATTTTTATGAATGCCTTTTTGTTTAAAATTTAGGCTTATGCCTCACACTGTCTCAATGTTTATGCATTGTCTTATTATTATGTTTACCTTTAATAGTGATATGAAGTTCATTTGCACTTCTATTTCATTTTTATGTAATTCTATCTCCTTCTAGTCTTTTCCACGTGTTAAGTACATAAAAATTTGGTTGGCGGGTTATGTTAATCTCAGCTTTGACTTACTACCTCAATTTTTTTTAGAAATATTTATTTTGGCACTTCGGAGTCTTGGTTGGGTGTGGCGGGGAGTGTGTTTTAGATGGATGTTAAGTAGGATGAGCAGGTCATTTTCAAGTAATACTGAATACATATTACAGTTGATATAAATTATTGTTTTATTAAGGTTCAGGCAAGTATTATAATATGTACCTTTTATGTTACCTTTCTGCAGGACTTCTTTTCTAGAATTCTTTGAAATCTGTTTCTTCTTATCAAACCAATGGCTACTCCTTTGGATATGTCACTAGAGGATGTGATAAAGAAGAATAATCGTGAGAAGCTTAGAGCCCGAGGTAGGGCCCGTCGTGGGCGTGGAGCAGGTGGGTCTTTTAATGGTGGAAGAGGAGTAGGGATAGGGTCAGTTCGTAGAGGTCCTCTCAGCTTAAATGCGCGACCATCTGCTTACTCAATTAGCAAGGCAAGCTTCAAACTGTGATGTTTAATGCATATGCTTGAGAACCTGATATTCCTTGAAAATACATGCTCATGGGGATCTCAAATAACACTATAGTAACTCTAACAGAAGAGGATCTGGTAAAATTGTTGTTCTATGGAGAATCTCATTGTGGCTCACCCTCATATCCGAATCGTCTTCTGGTGTATTTGTACCATAATCTTTGCAAATATTGAATACGTGTGGGTTATCTGGGTTATTTTCAAAGGACTTTCAAGAGTGGAGAATTATATTCTTATCGATTTATAAGCTGATATGTGTTTTCCCCGTCTGGATTCTTATCTGCATGCTTTCATCAGCCCCCACGCAGAATGAAGAATGTTCAATGGCAGCATGATTTATTTGAAGATAGTCTTAGAGCTTCAGGGATTTCTGGAATTGAAATTGGCACAAAGTTGTACGTTTCCAACTTGGATTATGGGGTGACCAAAGAAGATATAAGGGTACAACTCAAGCCATATATATTTGACTAAAGATGCTTTCTCTTCATTTTTTTATTAATATTCTCTTGGAACCTGACGGTGAACTCAGGAGTTGTTCTCTGAGATTGGAGACCTGAAAAGATTTGCACTTCATTATGACAAAAATGGCAGTTCAAGTGTAAGTTCCTACTCTGTCTTCTTGTAAAAATATGCAGTAATGTGTTGCTGTTAGGCTTTCTCTAGAATCAAGTATCAATATTCTTTTGGTAGTCAGAACTGTGTGATCATCGAATCAATTTGCTCTTTTGGCAGGGATAGTGTTGGAGAAACCTATTACAAGAAACTTGTTACTAGAATACATGTAACGAATGCCAAGAAATTTAAGTTAATTAATAAAATATGCGTGTGTGTGTATATATTGCATGTTTTTATATATCAAATTTAAAATTTCTTTTCAATGATCTATTATTTTATTGCATGAAGAAAATTGGACTTCAAAGTACTTAATTGATTATTAGTAATAGTTGGTGCTAAAGAGACAGTTTTGAATTGAAGTATGGATACAAGCAATCTTATTTTGAGTAAAATATTTTGCATTTTTGACTAATTTACTTGTTGATATGAAAAACCCTAGGGAATTAGCATTAGGAACCTGGGGTATGTGAACCCCTAATTTATGAATACTAAAATCCTCTAATAAACTCGATCTCAACCCTAAGGTTTAGTTTAGACATAGGTTGTCTTAAGCTCTTGAGACAAAGAACAAGTTCTTGACCCAAGAGTCCAACACTCCACAAGTTAGGATGATCAACCCGACTTGAATGTTTTTGGCATGAAACTCAAACATGAATTGCAAAATTCAGAAACCTCTGGCTAAACTTGAAAGGAAAAGTTCAAAGGCTAAAAATCCAATTCGTCATTATCTTATCTATCCAAAACAACCTACTGACAAGACTTTATAGAGCCTCTCAACTAAATCTAAATGCCCACATCTAGTGTTTTAAAAGGCCTAGTGAGCACACACGACTGAGCCTCAACACTTCTCCCTTAATTAAGTCTCCTTAGGGTGTGTTCCTCAAGCCTGGCTTCCTATTTTATTTTATTTATTTGTTTTTTAATTTTTCTAGTTTTTAAGGAAGAAACAATGAATATTCTTATAACCTTGAATTAAATCTCTATTTTCTTAAACTTTTTATCATCTTACTATTCATCATACTATTTTTTTTTCATATATTTAAAATTTATTTACATTTTTGTATTGTTTGCCTCATAAAAAAGCCTTTTTTTGCGCCTCATGCTTAAGCCTTAGAGGGCTATTACGCTTTAGTATGCCTCGAGCTTTAAAAAACACTGGCCATATCATCGATTGTAGAAAAGAGACAAAAAGACCTCTACTCAATGTGCTCAAAAAATGATAAAATTGAAAACAATAAAATTTCATCATAAAGTGTGCATAATAACTTGCTCCTTGGTCCATATCACTTGTGCTGCCCGGGGTCAACACAATTTTTAAACAAAGTTCGAACCTAGCTAATTTGAATGTATTCAATTCATTTCAATCGATATCTCAATAACTTATTAATCCAATCCAACTCATTAAGGTTTTTAGGGTTGAGTTTTTGCCCCTTCGTTTAGGACGTAACCAATGTGAAAACAATTCCATGAGATTTATAAGCACATAAGATGATTATTGTACTTCTGTTTTAAATGTGTTCTTAATAGTTAGCTCTCTTATCAAGCTTTAGTTGCCTGATAAACTTTCCATTGTTTATTAAAGGGCTCGGCAGAGGTGGTCTATACTCGTAGAAGTGATGCATTTGCAGCTCTGAAGCGCTATAACAATGTGTTACTGGATGGGAAGCCAATGAAGATTGAAATGCTTGGTGATAATGGTGAAATGCCAGTTTCTGCACGTATAAATGTTACTGGAGTGAATGGAAGAAGCAGGAGGACTGTTGTTCTAACGTATGTGGTATTCTGGTTACTTTTTCTCTCTTCCTTTTTGTCCCTTTTCTTATTTAATTTTAACATCCTTATAACTAATTATTTGCCCCTGGGAGGTTGAAAGTTACTGTCCAGTATTATATTTCTTGGTCATTTTTATCTCGCTACTTTTAAGTTGCTTCTTATTCCCTAAAAGGAAACTATCATTTGCCTCATTCCATCTTGTCTCATTCCCAAAGGTGACATGACCGGGATTTCAGGCCAATTAGTTGTATCAATAGTGTTTGTAAACATTCTTCAAAGGTGCTTATAGGTTTGTGTAGGAATCCAACACAGAAAATAACGAGAACACTAATAAATTGATATATTGCAATAACTATGAAGAAATTACAATACCCAATAGCCTTTCGAGAGGGCTATCTCTCTCCAAAGTCCCACAATGGACTCTCTACCAAAATGATCCACACCAAAAAAAGGACCCTAACACTTCTATTTATAACCAAGACACCTTAACCTAATTACCACACCCCTCACTGATATACTAATAATCTACCACATATACTTCACATAATACCATACTAGTACTCTCACAGTTTGATGGTCCTCCCTTTAACAATCTCTCCTTGCTTTCGTTGATAAACTTGCTTGTAGATTGACAGACAAATAACAGACATTTTATAGCCAATGAAACAGCTGATGCTGATATGTAGCAAGACTCAAGAATAAATCTCAAAGTTAGACTTGGAAAAAGCCTGTGAAATAGTCAGTTAGGACTTCTATTCGAAGCTTTTGGGAAGAAAGGTTTTGAGCCAAATGGGTTAGGTGGATGCTTAATTGTTTGATTGTTTCACCTATACCAGATTTTTTTAATCTTTAGCCATGGAAGGACCATATAAAAAGCTTCAAAGTTCTTGAATAAGGCGACTCTCTATATGTGTAGCATTAGATAATTGAGAGAAAATATGGATTTCATTCCAATGGCTGGGATGCAGGGTTAGTTTTGAAGGGCACCTCAATCTTTCTAGTTTTCTCCCTTTCATTACCTTTTCCATGGGGGATGGTAGTATAGTTTGGTTCTAGATGATTGTTGGATCCATATAGGTCATGTTGGTTGCAATTCCCTTTTTTGTACATAGTGGCCCTTTCACAAGTCTTCTGGTTGCTTATGTCATTTGTTCACAGAGAGGACCTCTTTTTTTAGAGAAAAGGCAAGAATCGTGTGGAAATGTGCTACCAGAGCATTTGTGTGGGAAATTTAGAAAGAAAGAAATTGGAGAATCTTTGAAGATAAGTCCTTTTCTTCAGGTTCTTTTTGTAACAATGTACAATTTATTTCTTTCTGGAGTTTTAGACACCAGTATTTTTTTTTTAAATTATTGCCCAGCTCAATCCTCCAAAATTGGGAAGTAGTGCTTCATGAGTTTTTTTTGGGAGAGGATTCCTTGCCCCTCTGCCCTCAGGCTGTGTCTTCTTCCCCCCACCTCAAAAAAAAAAAAATTGTAACCACCATCTCGAGAGGCTCCTTTTTAGACAAAGTCCCTAATAAGAGTTCCTCCCACCTCTTAATGACTTTTGTAGGGCATCTGAATATAGATAAGAAATAGTTGGGGATGTTTGTCAAACATTGCATTAACAAGGGTAATTCTACCCTTTTAAAACAAATGAATTTTCTCCACTTAGAAAATTTAGCACCAAGCTTCTTTTGAGAAGCGCCAAAGAAATTTGCCTTTTGGTTTCCTCCCAATGTCTTTCCAAGGTAAGAGGAGGGGTTGGGCACCTTCCGTGCATCTTTATGGGATTCGTTAGTGGGATTTTTCAACATCGTCACGGATTCCTTTGGTTTTCATCTCTTCGCCGTGCATGTGGTGTTTGCTGCTAGGTCGAATGTGAAGCTAAAGTCGGTTAAAAACCAAAACAAACTGGTGGTAACTGAAATTGAGGAAAACATTGGTTTCGTCGGTTCAGTTCAAAACACTTCAAAATCAACATGTTTGGTTCGGTAATTGGTTTTGGCTAAAATTAGACCGAATTGAACCTTTGCATACCCCTAATAGAGTGACAACCCCTATTACACCCATGGCAACCCCAAACGACTCCAAAGGGAGTAAGAAAATTGACACCTACACAAGGATGATCGAGATCCTTCTCTTGTTGTCGGGAAAGGGCACACCACTGTGGACGCAAAATCAAGGAAGAATGCCCTTGGATATGATCTTGAGTGTTAATTCTTGCTAGGCGAAAAATTTCAGTTTGGATGGGAGTAGGGAACAAAGGATGAGAAAGAACGAATGACAGAAGAAACCATTCGACTTCCCCTTAAATAAACCTCCGATAAGAAACAATAGAAAGAAGGCCCACCACCTCTAACGACTCACAATTGGTAAAAGGACAATGAAAGTCCAAAAAACAAGGGGAAAGGCTGGGAGGAAATCAATAAAGAGGTGAGAAAATGCAACTTCTTCTTAGACAATAGGAACAACTCACAAAGGGGTGCTCCGATTTTATTTTTTATTTTTTATTTTTGGGGTAAGAAATCGAGCTTTCATTAAGAAAAAATGAAAGGGAAGCAAGGGCATACAAAAAAATGAGCCCAACAAAAAGAGAGTCCAGAAACAAACTACAGAAACAGACTCCAATCCAACAAAATCAAACCAAGATCATATTTACAAGAAGGACTAGTAACTGACGCCCACAAAAACGCATGAAACCTCACAAGCTGCCAACCTCGTCACAAGACCTCTCAACCTCTAAAAATCCTATTATTACGTTTGAATCAAATACCCCACAACGTGGCAAAAAAGCTCACCACAAGAATTTGCCCTTATAAGGGTGAACCCAAAAGCACCTCCACCATCGAGGAAAACCATCTCTATTCCAAGCCATACAAATCTCAAACAAAGACAATTGACGATCCTAAAAAGATGGGGCGAATTGGCAACCCCACAACAGATGATAGAGGTCCTCAGTCTGGCATTTGTAGAGGATGCACCAATACAGGCACGACACCAAGGGGGAATGCCTCTGAACGTGATTCATGGTATTAACTCTCCCAAGTAAGACCAACCACACAAAAAACTTTACCTTCTTAGGGATTTTGACCTTTCATAGAGGAGAAAGCGAAAGCAAGATGGAGAGAGGAGGGGAAGGGGATGACAACAAATGAAAGGACTTACAAGAGACCCCCTTGACGCATCAGGGTACCAAAATCTAACATCCCTTCTCCCTCAAACAACAAACTTCTCACCAACCAGAGAAATAAGACTCACCACCTCATGAGCCTCTCTATTAGTAAGAGAACGTCGAAACCCCAACGACAAAGACGAGGAAGACTCCAAGGGTAATACGGAAGCCACATAATGCAACTTCTTGCTCGAGAGATGATAAAAGCGAGGAAACAACTCATGCAACGACTTATCACTGACCCAAAAATCCTCCCAAAAGTACACACTTTAACCTTCCCCAACTGAACACCTAACAAAGGATGAAAAAGAAGGGAGACCAAGGGCAGAGGTCTAATGGTACCACCGCAAGCAACGAACTGGCCTATCCCCAATAGAGCATTCAACGAAGGAATCCCATAGCCACAACAATCCAACGATATTTACTAGAGTCTTACAATGCACCACTTGGGGACCACTCAAAAGGGTGTGGTCCATACCTACTCACAATGATCCTACACCATAAAGCATTAGATTCCTTGAGAACTACCACGACCATTTTGCTAGCAAAGCCTTATTCCTTAATCTTTGGGTTCCAATGCCTAAATGTCTGACCTACGCTAGCTTTGAAACAATCTCCGATCTTTTACTAGGTGTGACCTCCCACCTTCCTCAACTCCTTCCATTCCGATATAAAATCCTCATTAGATTTTATGTTTTTACACATAAAAACTGAGATTCTAAATAGAGAAAGGAAATAAATAGGGATCCCACTTAGCATAGATTGAATTAAGGCGAGCCTTCCTTACCTTAGAGAAAAAGGCCTTTTTTTTTTAAATTTTTGAAGGAGATGGCCCTTTTTTGATTTGCTCTGCCACAAGGTTCTAGAATGACACACTATTCGAATTATTGCCTAAGGGAAGACCCAAATATATTGAAGATCTGGCTAACCTCACAACCCACCAAAGACACCCAAGATTCCACTATAGACGTTCACAATTGAAGCCAATAATAGTACTCTTCTCCCTATTTGTCTTAAGGCTCGAAATAGCTTCAAAGAAGGATTGAAAGTCATAAAGATTATTCAAGGAACTCTCATTTCTTGAGCAGAAAAAGAAAGTCTTATAATTGAAGGAGCAAAAGATGTAGACTTTTAGAGCCCTCTCCACTAAACCCTTCTCAACACTTGTGGACAACCTGCTCAAAACATCAATGATCAGAGTGAAGAGAAAAGGTGAGAGAGGGTATCTTGTCTCAAACCTCTAGATGCAAAAATCTTGCTTCTTAGCCTCCCAGTGACGAAAATGGAGAAATTAAGGCCCCGTTTGATAACGTTCCTGTTTCTTGTTTCCTGTTTCTTGTTTCTTGTTCCCTGTTTCTTGTTTCTTGTTTCTCATTTTTTAAGAATAAGAAACAAGAATATGTTTGATAACTATTTCTGTTTCTTGTTTCCTTGAAAACAAGAAATAGAAACATAAACTTGTTTGATAACCGTTTCTTGTTTCTTGTTCTGGAAGGTTTTCTTTATCTCATTTTTTTTGTCATTTTTTATAGTTTAAAAAGAATTTAGTTATATAATATAAAATATATAACATGCATTAAAATATATAAATATACAATTAATTATAACAAATTTATATTATCAAATACCAACAAGTCTCGATGCAAAATTAATTAATAATAAAAGTAAATAAATAAATTTATTTTATTCTTCAAATGTTGTTCAAATTTGATCTGCAATTTTATCTCTTACTCGAACTATTCGTCTTAAATGATGTCGACTTAGATCAAACTCAATTATATTGATAAATGTTGTAGTTTGTAAAATATTAAAATGAGGTTAAAGGCATGTTTTAGTGAGCAAAATTTATTTATTGCAAAATTACAATAATATATAAATATAATTATATTATTACTATGTATAAAAATTAAAACTATATATATTGCAAAAAAACAGAATTGATAAAAACGAAAATCAAAAAGTTGTGCTTACCAAGGATCGAACTTGGGGCTTGAAATGGAAGGAGTATTGTTTTGACATGCATTTCACCAATAGGTTGGAGGCTAAGGATGGTAAATTTGGGAAACATTTAATATAAAAAGAAACGAAACAGAAAGAAACTACTATTTGAAGTTTCTCAAATTTAGTCCAAATTTTAGAAACGTTTCTCAAATTTTGGGAACAAGAAACGGGATGGATTATCAAACAAGTCTGTTTCTTAAAAAATGAGAAACAGAAACAAGAAACAGGAAACGAGAACGTTATCAAACGAGCCCTAAACGATCTTATATAGTTCCAAATCCAAGCTCGCCGTTTCCAGCCAAACCCTTTATTTTCCCGAGGACATTGTCTAGAAAATCCCAGTCAACTGATGATAGGCCTTCTCAAAATCGATCTCGAAAATCACATCTTCTATCTCTCTTACTATAGCCCTCAATCACCTCATTTAGCAATCAACACTTGATCCAAAATTTGCCTACTTGCAACAAAGACTCCTTGGGAATTGAAAATAGTTCTAGGGAGCACCTTCTCAAGCCTACTGCCTACCTGCTAGAGTCTTCGCAATTATCTTGTACAAACCAGTCACAAAGCTAATTGGCCCGAAATCTTTATGAAACTTGCAAAAAACTTTCCAAATGGCCTCCTTATACTCCTTCCAATTATCCTAATAAAAAGGCATCATGTAAAACCATCTAGCCTATGGGCTTTAGTCCTCTCAAACCTAAACTCCACCTGAATTTCCTATTGGGTGAAGGGAGCTTCCAGCTTCCAACTCATACCTGTTAGCTGAGGAAATCGGTCTCCAATCTAACCTTTCCAAAAAGGTCTAGGGACAGGAATGGGGGCATAGGGGTTTGAGAAAAAAGAAATGATCTCCTCCTCAATCTAATTCTCCTACTTCACATAGAATTGAAATGAGATTTTTTTAACAAGAAACAAACTTTTTATTAACCATTATAGGGACAATGTAAATAATAAAGGACTTACAAGGAATGAGTTCAAACTTAGTGTTTTAAAAAGCCCCTTCAGGCGCGCGCCTAGGCGCAAGGCGCATGTAAGAAGGGGCGCGCCTCTGGGGCTTTTTAAGTTTATATATATTTTTTTTCAGTTTTAACAGTATAACGTATTGTTTTACTTATTTTAAATATATATTTTAAAAAAAAACCAAAACACAAGAAACCCACTTCATTTAGGGTTTACATATACAACTTTTACGTTAAAACTTAAAAGAATTGCACGGCAGTCCCACACTTTCTGCCTTCTCCTTCAATCTATGTTCAAAAACTAATACTTGAACCTTATTTCTTTTACCTATTTTTCTTGGTTAAAACTTAATTTCTTCCCATGAAGGGGGTTTTCTTATTATGTATGCATACATATATATATTTTATATTTTTTTTATATATAGCGCGCCTTAGAAAAAAAGCCCGCGGCTTTTGGTGCGCCTTACGCCTAGACTCCAAAGGGCTATTGCGCCTTAGTGCGCCTTGAGCCTTTAAAAACACTGGTTCAAACCATGGTAACAACAACCTAGCTTTTAATATCCTTTGAGTTACCTTGGTAAATGTAGTAGGATTTGGTGGGCATCCTAAGAGAATAGTCGAGGTGCACACAAAATGACCTAGATACTCACAAAAATGAAAAGGGAAAGAAAAAACAATCCTTTTATTGATAGATGAAAAGAAGGAAAAAAGTTTAGGAATACAACCCTTTGGACACTCACGGATATAAAAAAAAAGAGACAAGCTTTTCATTGACGGATGCAAAGAAAGAAAAAAGTTAAGGAATACAAATTCCAAAGGGAGTGAAAAGAACAAGCAACTATACCACTCAAATAAAGTACAAGTCTCTAAAAATTACAATTGCCAAAATAAAACATTGAAGAACAAGCTCCCAAGGGAGAAGAAAACAAGCAAAATAATCCCAGAAGTGAAGCCTTTAGATTTGAAAAGCCAGGGATGCTAAAACACAATCCATCCAAATGCAGGACCAACATCGAAACCCACTACCCAAAACTCTGGAAGAGATACATGAAGACTCTGGTTTGACAGACTGCTCCGAATTCTTTCAAAGCTACTGAACAACACGAGGAATCACTGTTCAGATAAGCGACCTGTTGCATGCTTGAAAAATTTAGAAAGGTCCTACAAAATATGTAAAAAGTCCTTTGGATCTTTGAACATGTGCTTGGTGTCACGGTCGTACCTCTCGAACCGTGCGGCGCCATGATTCATCCCTCGAACATGGCCAGCCTTAAGGGGAACTCCTGCCCCATCGCTATGATCGCAACGCTTTGTGGCTTTGTCGGGCCTTAGGCGAGGTTCTTCAGTCCTCAACCAAGCATCAAGTAAATAAGCAGCGGAATGTAACTTGGTCGTACACGTCGCCCGTCCGTGTACTTGCACTCGATCTTCCCATAAGTTGGCTTTGACGTCATTCTCGGATCGGCCGCCGCTAATCGGACCCTATATCTAACCTCGGCCCATGGGAAGGGGCGCAATGTCGGCACACCTGTGCCGTCCGACACTGTCCCTTGAAAACCCGTTACAAAGGAAACGCCCAAGAAACTTCTTCTTGTGACGCTCGCCCCAGCCATGACACGCTAACGTGCCACATGGTGACTCACTTAGGCCCACAAGCCCAGGGGTGGTGGAGAGTCGACACCATGGTACCCCCTAAGAAGCACAAACACTTTAATTTGGCTAGCGTGTCCGTGTCTGACACCTGTCGGACACATAGACACTCCGACACTTGTTGGACACGTATCGGACATTTGTTAGCATTATAGATGTGTTAGAAACTAGTTGTTCAAAGTCAATATAGGTCCATTATTTGTTAGGCACATAATGAACACTAGTCAAGTATACTAAAATACATGTAATATAGGACAAAAATAATGAAATTTGAGAACGAAATATATCAAAATCATTTTTTTAAGCATATAGATACATAAACTTATTAACTTTAAATTTCTTTCTATATAAAATGATATATATTTTAAAAAATGTATACTTAATAAACGTGTCCTAGCCGTGTCCGTGTCTTAGATTTTAAAGAAATGATGTGTCGCCGTGTCCGTGTCGTATCGTATTCGTGTCTCGTATCCGTATCCGTGCTTCTTAGGTACCCCCTATAGTATATTATGTGAAATATTGGGTCTTCCTCCCTTGCACGTCATTTTGGCGAGGAGTCATACGCCCATACATCGACCCCTCCGACGACACGGTTGGCCCCAGTCTCACATGCCTTCAAGCCCACCTATCTTGGTGTGTTAGATGATGCATCGTCCTTCTCGATCGCATCATGGCACGACTTGTCACTGTGAGGTTCACAAACATATGCTCTTCTAGTATGATTGTGACACTTGGCACCAATGAAAGGGGAATTTCGTCTTGAAGAGGGATAAATGAGCCCTATTTTGCTTGGATCTTCAGGAAGGCCACTAGAAGAATTTTAAATTCTGAATCAAAGCTGAAAATTTTGAATTGAATCTGCTCACACTATCAATCGGATTGTCACTAAAATCCAAAAACTCCTCTAATGAACCTCTAAAAACAACTTCATTAGTTTCATTACTAGTGCCTTGAGAAAGGGTTGGATCTCTTTTGGAAACTCCTTTAACAGCCCAACAGATTTGAAATCCTCAAATCAACCGAACCAGGGGAAGGGCAGCCTTATTTGAATGGTTTTCTTTAGCTGTAGACAAGAAGGAAAGGATCCATTGTGATCACCTTGGAGATAATACTAATCTAAGTCTTGACGGAACATTCTTCAGGAGTATTTTAAAAAAATTATAGTGTTCATTAAAGACAAAAATAGTAACTTAACCTCTAGTTTGTTATAATATGATCTCTACCTGCTATCTCAAGACTGGAAGAGTCTGATGGGTTTATTATGTCATTTTAAAGTTTAATGCATTGAAAGATATGTAATAAACGCCCTTTTCATCTAAATGGTATTTTCAGTTTCATTATATGTAAAAATCACTAAAAAGAAGTTCCATTGTGAGTACGTTTTCTAAAAGCTCACTCAGAATCTACTGTATTAGACACCCTTGTAAAGATTTGTTTTTGACATGCTGGCCTTAGATAAATTTTATAGTAGTAGTTTGCTAATGGCTTCTTATCTGTTACAGACGTGCTCTCCTTTGAAACTCTTGTTTCTTGAAATGATTGATGAAAACAACATGAAATTTGCTACAGATATTGAGTAGTATTAATATTTGTTGGGTATATGCAGGTCTGAATCTGGTCGTACTGGTAGCTCCAATGTGGCCAACCCTTTTCCTGGGTAAGTTCTAACTGAAGATTGTTTTTGACATGAAATTTTAAGATTCAATAGTGGAGCAGCTGGGTTTTTTTATTATTATAATTATTATATTTGGGCATTGTTTTACTGCATGCTAGTTGGATTGGTTTTGAAATTCATTTGTTCTTACTATTATCTCTCATTTTCTTAGGGATGTTACAGTTGAGCGTTGAGATCTTTTCTACCTGTTTGGAAGCCAGGCTAATGAATTTTTGTCCTTTTATTTTAATGAAAATAATTTTATGTAGAAAATTATTTGGTAAGATAATATTTAATAAACAATCTCTAGAACAAAAAGGAAAAAGCGAACAATAGATGATAGCAGTGCAATGCTTTTGTGTGTGTGTGTGTGTTTTTTTAAGTTCTGCAAATTAACTGCTTGTATATGCCTTTATTTAAATGTATTATATTTGAAAATTAATATGGAGAAGAAAGAAATGAAAACACAAAAAACAAAATAATATTTTTCTTGTAGTTGTAAAATTTAAAATAATAAGAAACAATTCTAAAATATTAGTTCTATGAACTTTGGTGCATCAGTGGTGGTATGAGTGCTCCCAACCATTTTAAAATATTAGTTACAATTTAAAATAACAGGAAAGAGACCCCAAGTAATAGTATGCAAAACAGTTTTCAAATAAATGTTTTTGTAGGTGTGGTTATGAGTGCTTCATAGTTGTGAAACAAAGGAGAGAAAGGAAAAAAGGAAATGTGGGGTAGAGTTGTAGGCTTAAATCTCCACCCTTTCTTCCAAAAGAAAGGAGTGATAATATAAAGATGCACGTCATGTGTTGGAAATGGAGTAATAAGATGAATTAACTTTGCTTAAGTTGGTAAAGCTCGTTTTTGGTTTTATTGAAGTTTTTGGATTTTGTAGTCCAAGCCATCGTGGAGGGCTGAGGAATGGCCGTGGCCGTGGGCGAGGAGGCTGGAGCCGTGGTTTAGGAGGTCTAGGTGGGGGAGGGCGTGGCCGAGGGCGTGGGCGTGGTCGTGGTGGCCGTGGCCAGGGAAGAAAGAAACCTGTGGAGAAGTCCTCAGATGAACTTGACAAGGAGCTCGAAAACTACCATGCAGAAGCCATGCAAACCTGAGGAACCTTGGAATATATATTCAAAAAGCGAAGTATGATGTGATTAACCACTATGTTAGCTGTAAGAATATATGTCAACTGCTTGATTATTTTGTTATCATTGGTAGTTTTCCATGTTTATTTTTCCTCTTCAGTAGACTAGATTGTATTTGCCTGTTAATGGAGATGTTGCTGTTTAGACCCTCTGTTACCAAAACTTCTACCCTTAAGTTGTCATTATACGATGTTGAGAATAGGAGTATCTTGTATTTATGGGACCTATTCTTCAACACCTTATCGTCTGGCACTCGATATCTTCCATTTTCCCTCGACAATCTCAAAATCATACTCCATCAGTTTCTTTTCTCTTGAGCTGTCCTTTCTCAGTGTCGACTCCGACCATTCTCTTATCTCCTTTTGTATTTGATCAAATCTAAATTAAAG

mRNA sequence

ATAAAACGCAATTGTAAAAACAGAGGGAAAGAACAAAAGGTTCGATGAAAGAAATATCGTTGCCTTAGTCCCGAAGGAAAAGTCTCTCTCAAGCCGCCGCCGCAACCGCCGCCTCTCGCCAGAAGAATCTCTGTCTCTCTCTTAGAAGCCCTAAAAATCCCACACGCACATATCGCTTTTCATCATCATCCTCTCGGAAATTCTCTCCTTGATTTCGTTGAAATTTTTATCTTCGCTCTCTCTGTCGGAAGTCCTTCGACTGTTTCTACGGATCATCTTCAGGACTTCTTTTCTAGAATTCTTTGAAATCTGTTTCTTCTTATCAAACCAATGGCTACTCCTTTGGATATGTCACTAGAGGATGTGATAAAGAAGAATAATCGTGAGAAGCTTAGAGCCCGAGGTAGGGCCCGTCGTGGGCGTGGAGCAGGTGGGTCTTTTAATGGTGGAAGAGGAGTAGGGATAGGGTCAGTTCGTAGAGGTCCTCTCAGCTTAAATGCGCGACCATCTGCTTACTCAATTAGCAAGCCCCCACGCAGAATGAAGAATGTTCAATGGCAGCATGATTTATTTGAAGATAGTCTTAGAGCTTCAGGGATTTCTGGAATTGAAATTGGCACAAAGTTGTACGTTTCCAACTTGGATTATGGGGTGACCAAAGAAGATATAAGGGAGTTGTTCTCTGAGATTGGAGACCTGAAAAGATTTGCACTTCATTATGACAAAAATGGCAGTTCAAGTGGCTCGGCAGAGGTGGTCTATACTCGTAGAAGTGATGCATTTGCAGCTCTGAAGCGCTATAACAATGTGTTACTGGATGGGAAGCCAATGAAGATTGAAATGCTTGGTGATAATGGTGAAATGCCAGTTTCTGCACGTATAAATGTTACTGGAGTGAATGGAAGAAGCAGGAGGACTGTTGTTCTAACGTCTGAATCTGGTCGTACTGGTAGCTCCAATGTGGCCAACCCTTTTCCTGGTCCAAGCCATCGTGGAGGGCTGAGGAATGGCCGTGGCCGTGGGCGAGGAGGCTGGAGCCGTGGTTTAGGAGGTCTAGGTGGGGGAGGGCGTGGCCGAGGGCGTGGGCGTGGTCGTGGTGGCCGTGGCCAGGGAAGAAAGAAACCTGTGGAGAAGTCCTCAGATGAACTTGACAAGGAGCTCGAAAACTACCATGCAGAAGCCATGCAAACCTGAGGAACCTTGGAATATATATTCAAAAAGCGAAGTATGATGTGATTAACCACTATGTTAGCTGTAAGAATATATGTCAACTGCTTGATTATTTTGTTATCATTGGTAGTTTTCCATGTTTATTTTTCCTCTTCAGTAGACTAGATTGTATTTGCCTGTTAATGGAGATGTTGCTGTTTAGACCCTCTGTTACCAAAACTTCTACCCTTAAGTTGTCATTATACGATGTTGAGAATAGGAGTATCTTGTATTTATGGGACCTATTCTTCAACACCTTATCGTCTGGCACTCGATATCTTCCATTTTCCCTCGACAATCTCAAAATCATACTCCATCAGTTTCTTTTCTCTTGAGCTGTCCTTTCTCAGTGTCGACTCCGACCATTCTCTTATCTCCTTTTGTATTTGATCAAATCTAAATTAAAG

Coding sequence (CDS)

ATGGCTACTCCTTTGGATATGTCACTAGAGGATGTGATAAAGAAGAATAATCGTGAGAAGCTTAGAGCCCGAGGTAGGGCCCGTCGTGGGCGTGGAGCAGGTGGGTCTTTTAATGGTGGAAGAGGAGTAGGGATAGGGTCAGTTCGTAGAGGTCCTCTCAGCTTAAATGCGCGACCATCTGCTTACTCAATTAGCAAGCCCCCACGCAGAATGAAGAATGTTCAATGGCAGCATGATTTATTTGAAGATAGTCTTAGAGCTTCAGGGATTTCTGGAATTGAAATTGGCACAAAGTTGTACGTTTCCAACTTGGATTATGGGGTGACCAAAGAAGATATAAGGGAGTTGTTCTCTGAGATTGGAGACCTGAAAAGATTTGCACTTCATTATGACAAAAATGGCAGTTCAAGTGGCTCGGCAGAGGTGGTCTATACTCGTAGAAGTGATGCATTTGCAGCTCTGAAGCGCTATAACAATGTGTTACTGGATGGGAAGCCAATGAAGATTGAAATGCTTGGTGATAATGGTGAAATGCCAGTTTCTGCACGTATAAATGTTACTGGAGTGAATGGAAGAAGCAGGAGGACTGTTGTTCTAACGTCTGAATCTGGTCGTACTGGTAGCTCCAATGTGGCCAACCCTTTTCCTGGTCCAAGCCATCGTGGAGGGCTGAGGAATGGCCGTGGCCGTGGGCGAGGAGGCTGGAGCCGTGGTTTAGGAGGTCTAGGTGGGGGAGGGCGTGGCCGAGGGCGTGGGCGTGGTCGTGGTGGCCGTGGCCAGGGAAGAAAGAAACCTGTGGAGAAGTCCTCAGATGAACTTGACAAGGAGCTCGAAAACTACCATGCAGAAGCCATGCAAACCTGA

Protein sequence

MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVGIGSVRRGPLSLNARPSAYSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNGEMPVSARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRNGRGRGRGGWSRGLGGLGGGGRGRGRGRGRGGRGQGRKKPVEKSSDELDKELENYHAEAMQT
Homology
BLAST of Tan0004012 vs. ExPASy Swiss-Prot
Match: Q6NQ72 (THO complex subunit 4D OS=Arabidopsis thaliana OX=3702 GN=ALY4 PE=1 SV=1)

HSP 1 Score: 265.0 bits (676), Expect = 9.8e-70
Identity = 165/297 (55.56%), Postives = 214/297 (72.05%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRAR-RGRGAGGSFNGGRGVGIGSVRRGPLSLNARP 60
           M+  L+M+L++++K+    +   RG +R RGRG GG   GGRG   G  RRGPL++NARP
Sbjct: 1   MSGALNMTLDEIVKRGKTARSGGRGISRGRGRGRGG---GGRGA--GPARRGPLAVNARP 60

Query: 61  SAYSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSE 120
           S+++I+KP RR++++ WQ  LFED LRA+G SG+E+GT+L+V+NLD GVT EDIRELFSE
Sbjct: 61  SSFTINKPVRRVRSLPWQSGLFEDGLRAAGASGVEVGTRLHVTNLDQGVTNEDIRELFSE 120

Query: 121 IGDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDN--GE 180
           IG+++R+A+HYDKNG  SG+AEVVY RRSDAF ALK+YNNVLLDG+PM++E+LG N   E
Sbjct: 121 IGEVERYAIHYDKNGRPSGTAEVVYPRRSDAFQALKKYNNVLLDGRPMRLEILGGNNSSE 180

Query: 181 MPVSAR--INVTGVNGRSRRTVVLTSESGR----TGSSNVANPFPGPSHRGGLRNGRGRG 240
            P+S R  +NVTG+NGR +RTVV+    G      G      P P  S R  + N +G  
Sbjct: 181 APLSGRVNVNVTGLNGRLKRTVVIQQGGGGRGRVRGGRGGRGPAPTVSRRLPIHNQQG-- 240

Query: 241 RGGWSRGLGGLGGGGRGR-GRGRGRGGRGQGRKKPVEKSSDELDKELENYHAEAMQT 288
            GG   G GG    GRG  GRGRG GGRG G KKPVEKS+ +LDK+LE+YHA+AM T
Sbjct: 241 -GGMRGGRGGFRARGRGNGGRGRG-GGRGNG-KKPVEKSAADLDKDLESYHADAMNT 287

BLAST of Tan0004012 vs. ExPASy Swiss-Prot
Match: Q94EH8 (THO complex subunit 4C OS=Arabidopsis thaliana OX=3702 GN=ALY3 PE=1 SV=1)

HSP 1 Score: 251.5 bits (641), Expect = 1.1e-65
Identity = 163/311 (52.41%), Postives = 216/311 (69.45%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRA-----RGRARR-GRGAGGSFNG--GRGVGIGSVRRGP 60
           M+  L+M+L++++KK+  E+  A     +G +R+ GRG GG  NG  G G G G VRRGP
Sbjct: 1   MSDALNMTLDEIVKKSKSERSAAARSGGKGVSRKSGRGRGGP-NGVVGGGRGGGPVRRGP 60

Query: 61  LSLNARP-SAYSISKPPRRMKNVQW--QHDLFEDSLRASGISGIEIGTKLYVSNLDYGVT 120
           L++N RP S++SI+K  RR +++ W  Q+DL+E++LRA G+SG+E+GT +Y++NLD GVT
Sbjct: 61  LAVNTRPSSSFSINKLARRKRSLPWQNQNDLYEETLRAVGVSGVEVGTTVYITNLDQGVT 120

Query: 121 KEDIRELFSEIGDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKI 180
            EDIREL++EIG+LKR+A+HYDKNG  SGSAEVVY RRSDA  A+++YNNVLLDG+PMK+
Sbjct: 121 NEDIRELYAEIGELKRYAIHYDKNGRPSGSAEVVYMRRSDAIQAMRKYNNVLLDGRPMKL 180

Query: 181 EMLGDNGE-MPVSARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRNGR 240
           E+LG N E  PV+AR+NVTG+NGR +R+V                 F G   RGG R GR
Sbjct: 181 EILGGNTESAPVAARVNVTGLNGRMKRSV-----------------FIGQGVRGG-RVGR 240

Query: 241 GRG--------------RGGWSRGLGGLGGGGRGRGRGRGRGGRGQGRKKPVEKSSDELD 286
           GRG              +GG + G GG  G GRG G GRG    G+G KKPVEKS+ +LD
Sbjct: 241 GRGSGPSGRRLPLQQNQQGGVTAGRGGFRGRGRGNGGGRGNKSGGRGGKKPVEKSAADLD 292

BLAST of Tan0004012 vs. ExPASy Swiss-Prot
Match: Q8L719 (THO complex subunit 4B OS=Arabidopsis thaliana OX=3702 GN=ALY2 PE=1 SV=1)

HSP 1 Score: 176.4 bits (446), Expect = 4.6e-43
Identity = 137/304 (45.07%), Postives = 179/304 (58.88%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVGIGSVRRGPLSLNARPS 60
           M+  LDMSL+D+I K+NR+   +RGR   G G      GG G   G  RR    + AR +
Sbjct: 1   MSGGLDMSLDDII-KSNRKPTGSRGRGGIGGGNNTGGRGGSGSNSGPSRRFANRVGARTA 60

Query: 61  AYSISKPPRRMKNVQWQHDLF--EDSLRAS----------GISGIEIGTKLYVSNLDYGV 120
            YS     ++  +  WQ+D+F  + S+ A+          G S IE GTKLY+SNLDYGV
Sbjct: 61  PYSRPIQQQQAHDAMWQNDVFATDASVAAAFGHHQTAVVGGGSSIETGTKLYISNLDYGV 120

Query: 121 TKEDIRELFSEIGDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMK 180
           + EDI+ELFSE+GDLKR+ +HYD++G S G+AEVV++RR DA AA+KRYNNV LDGK MK
Sbjct: 121 SNEDIKELFSEVGDLKRYGIHYDRSGRSKGTAEVVFSRRGDALAAVKRYNNVQLDGKLMK 180

Query: 181 IEMLGDNGEMPVSARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRNGR 240
           IE++G N   P    +    +             +G  G+ N    F G  +     N R
Sbjct: 181 IEIVGTNLSAPALPILATAQIP---------FPTNGILGNFN--ENFNGNFNGNFNGNFR 240

Query: 241 GRGRGGW---SRGLGGLGGGGRGRGRG-RGRGGRGQ-GRKKPVEKSSDELDKELENYHAE 288
           GRGRGG+    RG GG GGG    GRG RGRGGRG  GR +    S+++LD EL+ YH E
Sbjct: 241 GRGRGGFMGRPRG-GGFGGGNFRGGRGARGRGGRGSGGRGRDENVSAEDLDAELDKYHKE 291

BLAST of Tan0004012 vs. ExPASy Swiss-Prot
Match: Q8L773 (THO complex subunit 4A OS=Arabidopsis thaliana OX=3702 GN=ALY1 PE=1 SV=1)

HSP 1 Score: 174.5 bits (441), Expect = 1.8e-42
Identity = 129/294 (43.88%), Postives = 167/294 (56.80%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNG-GRGVGIGSVRR-GPLSLNAR 60
           M+T LDMSL+D+I KN           R+ RG  G   G G G G G  RR  P   + R
Sbjct: 1   MSTGLDMSLDDMIAKN-----------RKSRGGAGPARGTGSGSGPGPTRRNNPNRKSTR 60

Query: 61  PSAYSISKPPRRMKNVQWQHDLF----EDSLRASGISGIEIGTKLYVSNLDYGVTKEDIR 120
            + Y  +K P       W HD+F    ED       +GIE GTKLY+SNLDYGV  EDI+
Sbjct: 61  SAPYQSAKAPES----TWGHDMFSDRSEDHRSGRSSAGIETGTKLYISNLDYGVMNEDIK 120

Query: 121 ELFSEIGDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGD 180
           ELF+E+G+LKR+ +H+D++G S G+AEVVY+RR DA AA+K+YN+V LDGKPMKIE++G 
Sbjct: 121 ELFAEVGELKRYTVHFDRSGRSKGTAEVVYSRRGDALAAVKKYNDVQLDGKPMKIEIVGT 180

Query: 181 NGEMPVSARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRNGRGRGRGG 240
           N                   +T    S     G+SN A P+ G   RGG +  RG GRGG
Sbjct: 181 N------------------LQTAAAPSGRPANGNSNGA-PWRGGQGRGGQQ--RGGGRGG 240

Query: 241 WSRGLGGLGGGGRGRGRGRGRGGRGQGRKKPVEK-SSDELDKELENYHAEAMQT 288
                GG GGGGRGR  G+G          P EK S+++LD +L+ YH+  M+T
Sbjct: 241 -----GGRGGGGRGRRPGKG----------PAEKISAEDLDADLDKYHSGDMET 243

BLAST of Tan0004012 vs. ExPASy Swiss-Prot
Match: B5FXN8 (THO complex subunit 4 OS=Taeniopygia guttata OX=59729 GN=ALYREF PE=2 SV=1)

HSP 1 Score: 140.2 bits (352), Expect = 3.7e-32
Identity = 118/294 (40.14%), Postives = 159/294 (54.08%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGG----RGVGIGSVRRGPL--- 60
           MA  +DMSL+D+IK N  ++  +RG  R GRG GG+  GG     GVG G    GP+   
Sbjct: 1   MADKMDMSLDDIIKLNRSQRGASRG-GRGGRGRGGTARGGGPGRGGVGGGRAGGGPVRNR 60

Query: 61  SLNARPSAYSISKPPRRMKNV--QWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKE 120
            + AR    +   P  R K +  +WQHDLF+    A   +G+E G KL VSNLD+GV+  
Sbjct: 61  PVMARGGGRNRPAPYSRPKQLPEKWQHDLFDSGFGAG--AGVETGGKLLVSNLDFGVSDA 120

Query: 121 DIRELFSEIGDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEM 180
           DI+ELF+E G LK+ A+HYD++G S G+A+V + R++DA  A+K+YN V LDG+PM I++
Sbjct: 121 DIQELFAEFGTLKKAAVHYDRSGRSLGTADVHFERKADALKAMKQYNGVPLDGRPMNIQL 180

Query: 181 LGDNGEMPVSARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPS-HRGGLRNGRGR 240
                         VT      RR                    P  S +RGG+   RG 
Sbjct: 181 --------------VTSQIDTQRR--------------------PAQSVNRGGMTRNRG- 240

Query: 241 GRGGWSRGLGGLGGGGRGRG-RGRGRG-GRGQGRKKPVEKSSDELDKELENYHA 283
                   LGG GGGG  RG RG  RG GRG GR    + S++ELD +L+ Y+A
Sbjct: 241 -------VLGGFGGGGNRRGTRGGNRGRGRGAGRTSKQQLSAEELDAQLDAYNA 249

BLAST of Tan0004012 vs. NCBI nr
Match: XP_038896761.1 (THO complex subunit 4D-like [Benincasa hispida])

HSP 1 Score: 492.7 bits (1267), Expect = 2.2e-135
Identity = 265/287 (92.33%), Postives = 273/287 (95.12%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVGIGSVRRGPLSLNARPS 60
           MATPLDMSLEDVIKK+NREKLRARGRARRGRGAGGSFNGGRGV +GSVRRGPL +NAR S
Sbjct: 1   MATPLDMSLEDVIKKSNREKLRARGRARRGRGAGGSFNGGRGVVVGSVRRGPLGINARAS 60

Query: 61  AYSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120
           AYSI KPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGV+KEDIRELFSEI
Sbjct: 61  AYSIRKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVSKEDIRELFSEI 120

Query: 121 GDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNGEMPV 180
           GDLKRFA+HYDKNG  SGSAEVVYTRRSDAFAALKRYNNVLLDG+PMKIEMLGDN EMPV
Sbjct: 121 GDLKRFAIHYDKNGRPSGSAEVVYTRRSDAFAALKRYNNVLLDGRPMKIEMLGDNAEMPV 180

Query: 181 SARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRNGRGRGRGGWSRGLG 240
           SARINVTGVNGRSRRTVVLT ESGRT SSNV NPFPGPSHRGGLRNGRGRGRGGW+RG  
Sbjct: 181 SARINVTGVNGRSRRTVVLTPESGRTASSNVVNPFPGPSHRGGLRNGRGRGRGGWNRG-Q 240

Query: 241 GLGGGGRGRGRGRGRGGRGQGRKKPVEKSSDELDKELENYHAEAMQT 288
           G+GGGG GRGRGRGR GRGQGRKKPVEKSSDELDKELENYHAEAMQT
Sbjct: 241 GVGGGGGGRGRGRGR-GRGQGRKKPVEKSSDELDKELENYHAEAMQT 285

BLAST of Tan0004012 vs. NCBI nr
Match: XP_023527122.1 (THO complex subunit 4D [Cucurbita pepo subsp. pepo])

HSP 1 Score: 492.3 bits (1266), Expect = 2.9e-135
Identity = 269/295 (91.19%), Postives = 275/295 (93.22%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVGIGSVRRGPLSLNARPS 60
           MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGS+NGGR V IGSVRRGPL +NARPS
Sbjct: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSYNGGRAVVIGSVRRGPLGINARPS 60

Query: 61  AYSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120
           A+SISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI
Sbjct: 61  AFSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120

Query: 121 GDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNGEMPV 180
           GDLKRFA+HYDKNG  SGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDN + PV
Sbjct: 121 GDLKRFAIHYDKNGRPSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNADTPV 180

Query: 181 SARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRN--GRGRGRGGWSRG 240
           SARINVTGVNGRSRRTVVLTSESGRTGSSN  NPFPGPSHRGGLR+  GRGRGRGGWSRG
Sbjct: 181 SARINVTGVNGRSRRTVVLTSESGRTGSSNAVNPFPGPSHRGGLRSGRGRGRGRGGWSRG 240

Query: 241 LG-----GLGGGGRGRGRGRGRG-GRGQGRKKPVEKSSDELDKELENYHAEAMQT 288
           LG     GLGGGGRGRGRG G G GRGQGRKKPVEKSS ELDKELENYHAEAMQT
Sbjct: 241 LGGGGGRGLGGGGRGRGRGSGSGRGRGQGRKKPVEKSSAELDKELENYHAEAMQT 295

BLAST of Tan0004012 vs. NCBI nr
Match: XP_022982649.1 (THO complex subunit 4D-like [Cucurbita maxima] >XP_022982650.1 THO complex subunit 4D-like [Cucurbita maxima] >XP_022982651.1 THO complex subunit 4D-like [Cucurbita maxima])

HSP 1 Score: 491.9 bits (1265), Expect = 3.8e-135
Identity = 269/305 (88.20%), Postives = 275/305 (90.16%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVGIGSVRRGPLSLNARPS 60
           MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGS+NGGR V IGSVRRGPL +NARPS
Sbjct: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSYNGGRAVVIGSVRRGPLGINARPS 60

Query: 61  AYSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120
           A+SISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI
Sbjct: 61  AFSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120

Query: 121 GDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNGEMPV 180
           GDLKRFA+HYDKNG  SGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDN + PV
Sbjct: 121 GDLKRFAIHYDKNGRPSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNADTPV 180

Query: 181 SARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRNGRGRGRGGWSRGLG 240
           SARINVTGVNGRSRRTVVLTSESGRTGSSN  NPFPGPSHRGGLR+GRGRGRGGWSRGLG
Sbjct: 181 SARINVTGVNGRSRRTVVLTSESGRTGSSNAVNPFPGPSHRGGLRSGRGRGRGGWSRGLG 240

Query: 241 GLGG-----------GGRGRGRGRGRG-------GRGQGRKKPVEKSSDELDKELENYHA 288
           G GG           GGRGRGRGRG G       GRGQGRKKPVEKSS ELDKELENYHA
Sbjct: 241 GGGGRGLGGGGGRGLGGRGRGRGRGSGSGSGSGRGRGQGRKKPVEKSSAELDKELENYHA 300

BLAST of Tan0004012 vs. NCBI nr
Match: XP_022935328.1 (THO complex subunit 4D-like [Cucurbita moschata] >XP_022935329.1 THO complex subunit 4D-like [Cucurbita moschata] >XP_022935330.1 THO complex subunit 4D-like [Cucurbita moschata])

HSP 1 Score: 490.3 bits (1261), Expect = 1.1e-134
Identity = 268/295 (90.85%), Postives = 275/295 (93.22%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVGIGSVRRGPLSLNARPS 60
           MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGS+NGGR V IGSVRRGPL +NARPS
Sbjct: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSYNGGRAVVIGSVRRGPLGINARPS 60

Query: 61  AYSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120
           A+SISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI
Sbjct: 61  AFSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120

Query: 121 GDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNGEMPV 180
           GDLKRFA+HYDKNG  SGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDN + PV
Sbjct: 121 GDLKRFAIHYDKNGRPSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNADTPV 180

Query: 181 SARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRN--GRGRGRGGWSRG 240
           SARINVTGVNGRSRRTVVLTSESGRTGSS+  NPFPGPSHRGGLR+  GRGRGRGGWSRG
Sbjct: 181 SARINVTGVNGRSRRTVVLTSESGRTGSSHAVNPFPGPSHRGGLRSGRGRGRGRGGWSRG 240

Query: 241 LG-----GLGGGGRGRGRGRGRG-GRGQGRKKPVEKSSDELDKELENYHAEAMQT 288
           LG     GLGGGGRGRGRG G G GRGQGRKKPVEKSS ELDKELENYHAEAMQT
Sbjct: 241 LGGGGGRGLGGGGRGRGRGSGSGRGRGQGRKKPVEKSSAELDKELENYHAEAMQT 295

BLAST of Tan0004012 vs. NCBI nr
Match: KAG6581189.1 (THO complex subunit 4C, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 485.0 bits (1247), Expect = 4.6e-133
Identity = 268/311 (86.17%), Postives = 275/311 (88.42%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVGIGSVRRGPLSLNARPS 60
           MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGS+NGG+ V IGSVRRGPL +NARPS
Sbjct: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSYNGGKAVVIGSVRRGPLGINARPS 60

Query: 61  AYSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120
           A+SISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI
Sbjct: 61  AFSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120

Query: 121 GDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNGEMPV 180
           GDLKRFA+HYDKNG  SGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDN + PV
Sbjct: 121 GDLKRFAIHYDKNGRPSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNADTPV 180

Query: 181 SARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRN--GRGRGRGGWSRG 240
           SARINVTGVNGRSRRTVVLTSESGRTGSSN  NPFPGPSHRGGLR+  GRGRGRGGWSRG
Sbjct: 181 SARINVTGVNGRSRRTVVLTSESGRTGSSNAVNPFPGPSHRGGLRSGRGRGRGRGGWSRG 240

Query: 241 LG-------------GLGGGGRGRGRGRGRG---------GRGQGRKKPVEKSSDELDKE 288
           LG             GLGGGGRGRGRG G G         GRGQGRKKPVEKSS ELDKE
Sbjct: 241 LGGGGGRGLGGGGGRGLGGGGRGRGRGSGSGSGSGSGSGRGRGQGRKKPVEKSSAELDKE 300

BLAST of Tan0004012 vs. ExPASy TrEMBL
Match: A0A6J1J564 (THO complex subunit 4D-like OS=Cucurbita maxima OX=3661 GN=LOC111481464 PE=4 SV=1)

HSP 1 Score: 491.9 bits (1265), Expect = 1.8e-135
Identity = 269/305 (88.20%), Postives = 275/305 (90.16%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVGIGSVRRGPLSLNARPS 60
           MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGS+NGGR V IGSVRRGPL +NARPS
Sbjct: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSYNGGRAVVIGSVRRGPLGINARPS 60

Query: 61  AYSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120
           A+SISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI
Sbjct: 61  AFSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120

Query: 121 GDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNGEMPV 180
           GDLKRFA+HYDKNG  SGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDN + PV
Sbjct: 121 GDLKRFAIHYDKNGRPSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNADTPV 180

Query: 181 SARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRNGRGRGRGGWSRGLG 240
           SARINVTGVNGRSRRTVVLTSESGRTGSSN  NPFPGPSHRGGLR+GRGRGRGGWSRGLG
Sbjct: 181 SARINVTGVNGRSRRTVVLTSESGRTGSSNAVNPFPGPSHRGGLRSGRGRGRGGWSRGLG 240

Query: 241 GLGG-----------GGRGRGRGRGRG-------GRGQGRKKPVEKSSDELDKELENYHA 288
           G GG           GGRGRGRGRG G       GRGQGRKKPVEKSS ELDKELENYHA
Sbjct: 241 GGGGRGLGGGGGRGLGGRGRGRGRGSGSGSGSGRGRGQGRKKPVEKSSAELDKELENYHA 300

BLAST of Tan0004012 vs. ExPASy TrEMBL
Match: A0A6J1FAB9 (THO complex subunit 4D-like OS=Cucurbita moschata OX=3662 GN=LOC111442248 PE=4 SV=1)

HSP 1 Score: 490.3 bits (1261), Expect = 5.3e-135
Identity = 268/295 (90.85%), Postives = 275/295 (93.22%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVGIGSVRRGPLSLNARPS 60
           MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGS+NGGR V IGSVRRGPL +NARPS
Sbjct: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSYNGGRAVVIGSVRRGPLGINARPS 60

Query: 61  AYSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120
           A+SISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI
Sbjct: 61  AFSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120

Query: 121 GDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNGEMPV 180
           GDLKRFA+HYDKNG  SGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDN + PV
Sbjct: 121 GDLKRFAIHYDKNGRPSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNADTPV 180

Query: 181 SARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRN--GRGRGRGGWSRG 240
           SARINVTGVNGRSRRTVVLTSESGRTGSS+  NPFPGPSHRGGLR+  GRGRGRGGWSRG
Sbjct: 181 SARINVTGVNGRSRRTVVLTSESGRTGSSHAVNPFPGPSHRGGLRSGRGRGRGRGGWSRG 240

Query: 241 LG-----GLGGGGRGRGRGRGRG-GRGQGRKKPVEKSSDELDKELENYHAEAMQT 288
           LG     GLGGGGRGRGRG G G GRGQGRKKPVEKSS ELDKELENYHAEAMQT
Sbjct: 241 LGGGGGRGLGGGGRGRGRGSGSGRGRGQGRKKPVEKSSAELDKELENYHAEAMQT 295

BLAST of Tan0004012 vs. ExPASy TrEMBL
Match: A0A1S3C452 (THO complex subunit 4D OS=Cucumis melo OX=3656 GN=LOC103496802 PE=4 SV=1)

HSP 1 Score: 482.3 bits (1240), Expect = 1.4e-132
Identity = 259/287 (90.24%), Postives = 266/287 (92.68%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVGIGSVRRGPLSLNARPS 60
           M TPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGV IGSVRRGPL +NAR S
Sbjct: 1   MTTPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVIGSVRRGPLGINARAS 60

Query: 61  AYSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120
           AYSI KPP RMKNVQWQHDLFEDSLRASGISGI+IGTKLYVSNLDYGVTKEDIRELFSEI
Sbjct: 61  AYSIRKPPHRMKNVQWQHDLFEDSLRASGISGIQIGTKLYVSNLDYGVTKEDIRELFSEI 120

Query: 121 GDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNGEMPV 180
           GDLKRFA+HYDKNG  SGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDN EMPV
Sbjct: 121 GDLKRFAIHYDKNGRPSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNAEMPV 180

Query: 181 SARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRNGRGRGRGGWSRGLG 240
           SARINVTG NGR+RRTVVLT ESGR  + NV NPFPGPSHRGGLRN RGRGRG W+RG+ 
Sbjct: 181 SARINVTGTNGRNRRTVVLTPESGRNATFNVVNPFPGPSHRGGLRNARGRGRGAWTRGV- 240

Query: 241 GLGGGGRGRGRGRGRGGRGQGRKKPVEKSSDELDKELENYHAEAMQT 288
           GLGG G GRGRGRGR GRGQGRKKPVEKSSDELDKELENYHAEAMQT
Sbjct: 241 GLGGSGGGRGRGRGR-GRGQGRKKPVEKSSDELDKELENYHAEAMQT 285

BLAST of Tan0004012 vs. ExPASy TrEMBL
Match: A0A6J1CP51 (THO complex subunit 4D-like OS=Momordica charantia OX=3673 GN=LOC111013262 PE=4 SV=1)

HSP 1 Score: 467.2 bits (1201), Expect = 4.8e-128
Identity = 257/287 (89.55%), Postives = 267/287 (93.03%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVGIGSVRRGPLSLNARPS 60
           MATPLDMSLED+IKKNNREKLRARGRARRGRGAGGSFNGGR V IGS+RRGPLS+N RPS
Sbjct: 1   MATPLDMSLEDMIKKNNREKLRARGRARRGRGAGGSFNGGR-VVIGSIRRGPLSINTRPS 60

Query: 61  AYSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120
           A+SISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI
Sbjct: 61  AFSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120

Query: 121 GDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNGEMPV 180
           GDLKRFA+HYDKNG  SGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIE+LGDN EMPV
Sbjct: 121 GDLKRFAIHYDKNGRPSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEILGDNAEMPV 180

Query: 181 SARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRNGRGRGRGGWSRGLG 240
           SARINVTG+NGRSRRTVVLTSESGRT SS V N FPGPS+RG LR GRGRGRGGWSRG G
Sbjct: 181 SARINVTGLNGRSRRTVVLTSESGRTDSSTVVNHFPGPSNRGALR-GRGRGRGGWSRGQG 240

Query: 241 GLGGGGRGRGRGRGRGGRGQGRKKPVEKSSDELDKELENYHAEAMQT 288
            +GG   GRGRGRGR GRG GRKK VEKSSDELDK+LENYHAEAMQT
Sbjct: 241 QVGG---GRGRGRGR-GRGLGRKKTVEKSSDELDKDLENYHAEAMQT 281

BLAST of Tan0004012 vs. ExPASy TrEMBL
Match: A0A0A0L717 (RRM domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G302100 PE=4 SV=1)

HSP 1 Score: 362.5 bits (929), Expect = 1.7e-96
Identity = 184/200 (92.00%), Postives = 190/200 (95.00%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVGIGSVRRGPLSLNARPS 60
           M TPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGV IGSVRRGPL +NAR S
Sbjct: 1   MTTPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVIGSVRRGPLGINARAS 60

Query: 61  AYSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEI 120
           AYSI KPP RMKNVQWQHDLFEDSLRASGISGI+IGTKLYVSNLDYGVTKEDI+ELFSEI
Sbjct: 61  AYSIRKPPHRMKNVQWQHDLFEDSLRASGISGIQIGTKLYVSNLDYGVTKEDIKELFSEI 120

Query: 121 GDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNGEMPV 180
           GD+KRFA+HYDKNG  SGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDN EMPV
Sbjct: 121 GDVKRFAIHYDKNGRPSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNAEMPV 180

Query: 181 SARINVTGVNGRSRRTVVLT 201
           SARINVTG NGR+RRTVVLT
Sbjct: 181 SARINVTGTNGRNRRTVVLT 200

BLAST of Tan0004012 vs. TAIR 10
Match: AT5G37720.2 (ALWAYS EARLY 4 )

HSP 1 Score: 268.9 bits (686), Expect = 4.8e-72
Identity = 166/293 (56.66%), Postives = 215/293 (73.38%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRAR-RGRGAGGSFNGGRGVGIGSVRRGPLSLNARP 60
           M+  L+M+L++++K+    +   RG +R RGRG GG   GGRG   G  RRGPL++NARP
Sbjct: 1   MSGALNMTLDEIVKRGKTARSGGRGISRGRGRGRGG---GGRGA--GPARRGPLAVNARP 60

Query: 61  SAYSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSE 120
           S+++I+KP RR++++ WQ  LFED LRA+G SG+E+GT+L+V+NLD GVT EDIRELFSE
Sbjct: 61  SSFTINKPVRRVRSLPWQSGLFEDGLRAAGASGVEVGTRLHVTNLDQGVTNEDIRELFSE 120

Query: 121 IGDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDN--GE 180
           IG+++R+A+HYDKNG  SG+AEVVY RRSDAF ALK+YNNVLLDG+PM++E+LG N   E
Sbjct: 121 IGEVERYAIHYDKNGRPSGTAEVVYPRRSDAFQALKKYNNVLLDGRPMRLEILGGNNSSE 180

Query: 181 MPVSAR--INVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRNGRGRGRGGW 240
            P+S R  +NVTG+NGR +RTVV+    GR G      P P  S R  + N +G   GG 
Sbjct: 181 APLSGRVNVNVTGLNGRLKRTVVIQVRGGRGG----RGPAPTVSRRLPIHNQQG---GGM 240

Query: 241 SRGLGGLGGGGRGR-GRGRGRGGRGQGRKKPVEKSSDELDKELENYHAEAMQT 288
             G GG    GRG  GRGRG GGRG G KKPVEKS+ +LDK+LE+YHA+AM T
Sbjct: 241 RGGRGGFRARGRGNGGRGRG-GGRGNG-KKPVEKSAADLDKDLESYHADAMNT 279

BLAST of Tan0004012 vs. TAIR 10
Match: AT5G37720.1 (ALWAYS EARLY 4 )

HSP 1 Score: 265.0 bits (676), Expect = 7.0e-71
Identity = 165/297 (55.56%), Postives = 214/297 (72.05%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRAR-RGRGAGGSFNGGRGVGIGSVRRGPLSLNARP 60
           M+  L+M+L++++K+    +   RG +R RGRG GG   GGRG   G  RRGPL++NARP
Sbjct: 1   MSGALNMTLDEIVKRGKTARSGGRGISRGRGRGRGG---GGRGA--GPARRGPLAVNARP 60

Query: 61  SAYSISKPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSE 120
           S+++I+KP RR++++ WQ  LFED LRA+G SG+E+GT+L+V+NLD GVT EDIRELFSE
Sbjct: 61  SSFTINKPVRRVRSLPWQSGLFEDGLRAAGASGVEVGTRLHVTNLDQGVTNEDIRELFSE 120

Query: 121 IGDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDN--GE 180
           IG+++R+A+HYDKNG  SG+AEVVY RRSDAF ALK+YNNVLLDG+PM++E+LG N   E
Sbjct: 121 IGEVERYAIHYDKNGRPSGTAEVVYPRRSDAFQALKKYNNVLLDGRPMRLEILGGNNSSE 180

Query: 181 MPVSAR--INVTGVNGRSRRTVVLTSESGR----TGSSNVANPFPGPSHRGGLRNGRGRG 240
            P+S R  +NVTG+NGR +RTVV+    G      G      P P  S R  + N +G  
Sbjct: 181 APLSGRVNVNVTGLNGRLKRTVVIQQGGGGRGRVRGGRGGRGPAPTVSRRLPIHNQQG-- 240

Query: 241 RGGWSRGLGGLGGGGRGR-GRGRGRGGRGQGRKKPVEKSSDELDKELENYHAEAMQT 288
            GG   G GG    GRG  GRGRG GGRG G KKPVEKS+ +LDK+LE+YHA+AM T
Sbjct: 241 -GGMRGGRGGFRARGRGNGGRGRG-GGRGNG-KKPVEKSAADLDKDLESYHADAMNT 287

BLAST of Tan0004012 vs. TAIR 10
Match: AT1G66260.1 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 251.5 bits (641), Expect = 8.0e-67
Identity = 163/311 (52.41%), Postives = 216/311 (69.45%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRA-----RGRARR-GRGAGGSFNG--GRGVGIGSVRRGP 60
           M+  L+M+L++++KK+  E+  A     +G +R+ GRG GG  NG  G G G G VRRGP
Sbjct: 1   MSDALNMTLDEIVKKSKSERSAAARSGGKGVSRKSGRGRGGP-NGVVGGGRGGGPVRRGP 60

Query: 61  LSLNARP-SAYSISKPPRRMKNVQW--QHDLFEDSLRASGISGIEIGTKLYVSNLDYGVT 120
           L++N RP S++SI+K  RR +++ W  Q+DL+E++LRA G+SG+E+GT +Y++NLD GVT
Sbjct: 61  LAVNTRPSSSFSINKLARRKRSLPWQNQNDLYEETLRAVGVSGVEVGTTVYITNLDQGVT 120

Query: 121 KEDIRELFSEIGDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKI 180
            EDIREL++EIG+LKR+A+HYDKNG  SGSAEVVY RRSDA  A+++YNNVLLDG+PMK+
Sbjct: 121 NEDIRELYAEIGELKRYAIHYDKNGRPSGSAEVVYMRRSDAIQAMRKYNNVLLDGRPMKL 180

Query: 181 EMLGDNGE-MPVSARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRNGR 240
           E+LG N E  PV+AR+NVTG+NGR +R+V                 F G   RGG R GR
Sbjct: 181 EILGGNTESAPVAARVNVTGLNGRMKRSV-----------------FIGQGVRGG-RVGR 240

Query: 241 GRG--------------RGGWSRGLGGLGGGGRGRGRGRGRGGRGQGRKKPVEKSSDELD 286
           GRG              +GG + G GG  G GRG G GRG    G+G KKPVEKS+ +LD
Sbjct: 241 GRGSGPSGRRLPLQQNQQGGVTAGRGGFRGRGRGNGGGRGNKSGGRGGKKPVEKSAADLD 292

BLAST of Tan0004012 vs. TAIR 10
Match: AT1G66260.2 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 251.5 bits (641), Expect = 8.0e-67
Identity = 163/311 (52.41%), Postives = 216/311 (69.45%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRA-----RGRARR-GRGAGGSFNG--GRGVGIGSVRRGP 60
           M+  L+M+L++++KK+  E+  A     +G +R+ GRG GG  NG  G G G G VRRGP
Sbjct: 1   MSDALNMTLDEIVKKSKSERSAAARSGGKGVSRKSGRGRGGP-NGVVGGGRGGGPVRRGP 60

Query: 61  LSLNARP-SAYSISKPPRRMKNVQW--QHDLFEDSLRASGISGIEIGTKLYVSNLDYGVT 120
           L++N RP S++SI+K  RR +++ W  Q+DL+E++LRA G+SG+E+GT +Y++NLD GVT
Sbjct: 61  LAVNTRPSSSFSINKLARRKRSLPWQNQNDLYEETLRAVGVSGVEVGTTVYITNLDQGVT 120

Query: 121 KEDIRELFSEIGDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKI 180
            EDIREL++EIG+LKR+A+HYDKNG  SGSAEVVY RRSDA  A+++YNNVLLDG+PMK+
Sbjct: 121 NEDIRELYAEIGELKRYAIHYDKNGRPSGSAEVVYMRRSDAIQAMRKYNNVLLDGRPMKL 180

Query: 181 EMLGDNGE-MPVSARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRNGR 240
           E+LG N E  PV+AR+NVTG+NGR +R+V                 F G   RGG R GR
Sbjct: 181 EILGGNTESAPVAARVNVTGLNGRMKRSV-----------------FIGQGVRGG-RVGR 240

Query: 241 GRG--------------RGGWSRGLGGLGGGGRGRGRGRGRGGRGQGRKKPVEKSSDELD 286
           GRG              +GG + G GG  G GRG G GRG    G+G KKPVEKS+ +LD
Sbjct: 241 GRGSGPSGRRLPLQQNQQGGVTAGRGGFRGRGRGNGGGRGNKSGGRGGKKPVEKSAADLD 292

BLAST of Tan0004012 vs. TAIR 10
Match: AT5G02530.1 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 176.4 bits (446), Expect = 3.3e-44
Identity = 137/304 (45.07%), Postives = 179/304 (58.88%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVGIGSVRRGPLSLNARPS 60
           M+  LDMSL+D+I K+NR+   +RGR   G G      GG G   G  RR    + AR +
Sbjct: 1   MSGGLDMSLDDII-KSNRKPTGSRGRGGIGGGNNTGGRGGSGSNSGPSRRFANRVGARTA 60

Query: 61  AYSISKPPRRMKNVQWQHDLF--EDSLRAS----------GISGIEIGTKLYVSNLDYGV 120
            YS     ++  +  WQ+D+F  + S+ A+          G S IE GTKLY+SNLDYGV
Sbjct: 61  PYSRPIQQQQAHDAMWQNDVFATDASVAAAFGHHQTAVVGGGSSIETGTKLYISNLDYGV 120

Query: 121 TKEDIRELFSEIGDLKRFALHYDKNGSSSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMK 180
           + EDI+ELFSE+GDLKR+ +HYD++G S G+AEVV++RR DA AA+KRYNNV LDGK MK
Sbjct: 121 SNEDIKELFSEVGDLKRYGIHYDRSGRSKGTAEVVFSRRGDALAAVKRYNNVQLDGKLMK 180

Query: 181 IEMLGDNGEMPVSARINVTGVNGRSRRTVVLTSESGRTGSSNVANPFPGPSHRGGLRNGR 240
           IE++G N   P    +    +             +G  G+ N    F G  +     N R
Sbjct: 181 IEIVGTNLSAPALPILATAQIP---------FPTNGILGNFN--ENFNGNFNGNFNGNFR 240

Query: 241 GRGRGGW---SRGLGGLGGGGRGRGRG-RGRGGRGQ-GRKKPVEKSSDELDKELENYHAE 288
           GRGRGG+    RG GG GGG    GRG RGRGGRG  GR +    S+++LD EL+ YH E
Sbjct: 241 GRGRGGFMGRPRG-GGFGGGNFRGGRGARGRGGRGSGGRGRDENVSAEDLDAELDKYHKE 291

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6NQ729.8e-7055.56THO complex subunit 4D OS=Arabidopsis thaliana OX=3702 GN=ALY4 PE=1 SV=1[more]
Q94EH81.1e-6552.41THO complex subunit 4C OS=Arabidopsis thaliana OX=3702 GN=ALY3 PE=1 SV=1[more]
Q8L7194.6e-4345.07THO complex subunit 4B OS=Arabidopsis thaliana OX=3702 GN=ALY2 PE=1 SV=1[more]
Q8L7731.8e-4243.88THO complex subunit 4A OS=Arabidopsis thaliana OX=3702 GN=ALY1 PE=1 SV=1[more]
B5FXN83.7e-3240.14THO complex subunit 4 OS=Taeniopygia guttata OX=59729 GN=ALYREF PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_038896761.12.2e-13592.33THO complex subunit 4D-like [Benincasa hispida][more]
XP_023527122.12.9e-13591.19THO complex subunit 4D [Cucurbita pepo subsp. pepo][more]
XP_022982649.13.8e-13588.20THO complex subunit 4D-like [Cucurbita maxima] >XP_022982650.1 THO complex subun... [more]
XP_022935328.11.1e-13490.85THO complex subunit 4D-like [Cucurbita moschata] >XP_022935329.1 THO complex sub... [more]
KAG6581189.14.6e-13386.17THO complex subunit 4C, partial [Cucurbita argyrosperma subsp. sororia][more]
Match NameE-valueIdentityDescription
A0A6J1J5641.8e-13588.20THO complex subunit 4D-like OS=Cucurbita maxima OX=3661 GN=LOC111481464 PE=4 SV=... [more]
A0A6J1FAB95.3e-13590.85THO complex subunit 4D-like OS=Cucurbita moschata OX=3662 GN=LOC111442248 PE=4 S... [more]
A0A1S3C4521.4e-13290.24THO complex subunit 4D OS=Cucumis melo OX=3656 GN=LOC103496802 PE=4 SV=1[more]
A0A6J1CP514.8e-12889.55THO complex subunit 4D-like OS=Momordica charantia OX=3673 GN=LOC111013262 PE=4 ... [more]
A0A0A0L7171.7e-9692.00RRM domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G302100 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT5G37720.24.8e-7256.66ALWAYS EARLY 4 [more]
AT5G37720.17.0e-7155.56ALWAYS EARLY 4 [more]
AT1G66260.18.0e-6752.41RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT1G66260.28.0e-6752.41RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT5G02530.13.3e-4445.07RNA-binding (RRM/RBD/RNP motifs) family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 98..170
e-value: 3.2E-18
score: 76.5
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 99..167
e-value: 8.9E-15
score: 54.3
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 97..174
score: 14.999257
IPR025715Chromatin target of PRMT1 protein, C-terminalSMARTSM01218FoP_duplication_2coord: 216..287
e-value: 2.0E-10
score: 50.7
IPR025715Chromatin target of PRMT1 protein, C-terminalPFAMPF13865FoP_duplicationcoord: 224..282
e-value: 6.7E-6
score: 26.7
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 94..173
e-value: 4.7E-21
score: 76.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 17..51
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 201..215
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 260..281
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 201..287
NoneNo IPR availablePANTHERPTHR19965RNA AND EXPORT FACTOR BINDING PROTEINcoord: 1..287
NoneNo IPR availablePANTHERPTHR19965:SF33THO COMPLEX SUBUNIT 4Ccoord: 1..287
NoneNo IPR availableCDDcd12680RRM_THOC4coord: 97..171
e-value: 8.07849E-41
score: 134.304
IPR035979RNA-binding domain superfamilySUPERFAMILY54928RNA-binding domain, RBDcoord: 31..171

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0004012.1Tan0004012.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003723 RNA binding
molecular_function GO:0003676 nucleic acid binding