CcUC05G094210.1 (mRNA) Watermelon (PI 537277) v1

Overview
NameCcUC05G094210.1
TypemRNA
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionTHO complex subunit 4D
LocationCicolChr05: 13476577 .. 13486209 (-)
Sequence length1343
RNA-Seq ExpressionCcUC05G094210.1
SyntenyCcUC05G094210.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTACTCCTTTGGATATGTCACTAGAGGATGTGATAAAGAAGAATAACCGTGAGAAGCTTAGAGCACGAGGTAGGGCCCGTCGTGGACGTGGAGCAGGTGGGTCTTTTAATGGTGGAAGAGGAGTAGTGTTAGGATCAGTTCGTAGAGGTCCTCTCGGCATAAACGCACGGGCATCTGCTTACTCAATTCGCAAGGCAAGCTTCAAACTGTGATGTTTAATGCATATGCTTGAGAACCTGATATTCCTTGAAAATACATGCTCACGGGGATTTCAAATAACACTATAGTAACTCTTAACAGAAGTGGATCAGTAAAATTGTTGTTCTGTGGAGAATCTCATTGTGGGTCACCCTCTGATCCGAATCGTCTTCTGGTGTATTTGTACCATTATCTTTGCAAATATTGAATACGTGTGGGTTATCTGGGTTATTTTCAAAGGACTTTCAAGGGTGGAGACTTATATTCTTGTCGATTTATAAGCTGATATGTATTTTCCCCGTCTGGATTCTTATCTGCATGCTTTCATCAGCCCCCACGCAGAATGAAGAATGTTCAATGGCAGCATGATTTATTTGAAGATAGTCTTAGAGCTTCAGGGATTTCTGGAATTGAAATTGGCACAAAGTTGTATGTTTCCAACTTGGATTATGGGGTGACCAAAGAAGATATAAGGGTACAACTCAAGCCATATATATTTGACTAAAGATGCTTTCTCTTCATTTTTTTGATTAATATTCACTTGGAACCTGACGGTGAACTCAGGAGTTGTTCTCTGAGATTGGAGACCTGAAACGATTTGCAATTCATTATGACAAAAATGGTCGTCCAAGCGTAAGTTCCTCCACTGTCTTTGTAAAATTATGTAGTAATGTGTTGCTATTTGGCTTTCTCTGCAATCAAGTATCAATATTGTTTTGATAGTCACAACTGTGTGACCATTGAATCAATATTTGCTTTTGATGGATTCAACTTTATGTGTATGCAGTACTGAATATTTTTTACTTTAGTAGTTGGATAGTGCTGGAAAAAACTATTATAAGGAACTTTTGACTAGAATACATGTTACAACTTCAAGTTAATGAATAAATTATATATATTGCATGTTTTTATATAATTTCTCTTTAATGGTTTAATATTTTATTTCATGAAGAAAATTATGCTCGAAAGTAATTAATTGATCATTAGTAATATGTGGTGCAGAACTTTCCGTTTATAATTAAAATATGGATGGAATAGAAGTGATCTTATTTCGAGTAAACAAATGTTGTATTTTTAATTAACTAACCTGTGCTGGCTAGGCTCAACACACTTCATTTTAACAAGTTTGAATGTAGCTCATTTAATATATTCAGTTCATTTCAATCTAGATATCTCTAATTGTTGCTCCTCTTCATTGTCTTTGCTTATCATTATTATTCAAATTTAGTTTCTCTTAAGTTAGCATGAGATTTATAATAAAATATGATAATTACTGTATTTCTGCTTTAAGATGCATATATAAAAGTTGGCTTCCTTATCAAGCTTTAGTTGGCTGATAAACTTTCCGTTGTTTATTGAAGGGCTCAGCAGAGGTGGTATATACTCGAAGAAGTGATGCATTTGCTGCTCTGAAGCGCTATAACAATGTGTTATTGGATGGGAAGCCAATGAAGATTGAAATGCTTGGTGATAATGCTGAAATGCCGGTTTCTGCACGTATAAATGTTACTGGAGTGAATGGAAGAAGCAGGAGGACAGTTGTTCTAACGTATGTAGTATTTTTTAGTTACTGTTTCTCTTCCTTTTTGTCCCTTTTCTTAATTAATTTTAACATCTTTATAGGTAATTATTGTGTGCTTGGAGTAGAAAGTTACTGCACATTATTCTATTTCTCGGTCATTTCTATCTTGCTACTTTTAAGTTGCTTCTTATTCCCCAAAGGGAAACTATCACCGGCCTCATTCCCAATTAGTCGGATCAATAGTGTTTGAAAATCTTTCTGAAGGCATCTCTAGATTGGCTGATCAAGGTTCTCCCTTTAACATCTCAACAAATTCCAAACTTGCTTGTAGATTGACAAACAAATAACAGACTCCATTTGTTTAGCTAGTGGAATAGCTGATGCTAATAAGGAGCAGACTCAAGGATAAATCTTTATATTAGACTTGGAAAAGACCTATGAGCTAGTTAGTCAAAAACTTTCATCCAAAGACTTTGAATAGAAAGGTTCTGAGCCAATTGGGTTAGATGGAAGCTTAATTGTTTGATAGTTTACCTAGTATCAAAATTTCCATGTTCATCAATGGAAAGCCTAGAGTATGAGTTAAACTTTAAAGTTCTTAAATAGGTGATTTTGGGGGTGTTTGGGGCGCCGAGTTGAGTTTTGAATTTTGGAGTTAGGAAAAATTTGTGTTTGGGGTGCAAAGTTGTGAAGATGGAGTTGCTCGATAAATTTACAAACTTTAATAGTGTTCCCTATTGAAGTTGGTAGAGTTGAGTTATTTAACACCGACTTATAAAGTTGTTGGGCCAAACACTCCATTTTTGTCTATGTAGTACTAGATAATTGGGATAAAATATGGATTTTATTCAAATGATTGGGATGGAGGCCTAGATTCTTAGGAACCTTCCGAAGTTTGTGGAAGGTGAACTCCTATTTTTTCAATTTTCACTCTTTATTTTCCTTTTCCATGGGGGAGGGTAGTAAACTGAAACCTAAAATGTATCAATTGATACAACCCAGCTGCAATATATATCTTGAATAGAAAGCCCGTCAAATTTCTCGGAGGATCACCATGAGGAAAATTTTCAATATGGGTTGTATCAATTGATCCATTTTAGGTTTCACTTTACTACCCTCCCTCCAAGATGTTTGAAGGGCTTTCTATTTGAGATTTATATTGCAGTTGGGTTGTATCAATTGATCCATTTTAGGCTTCACTTGGACTTTAAGACTTCCAAATTCAGTTAATAATAGTAGGTTTTTTTTTTTAATGATAACAACTGCTTTCATTGAGAAAAAAATGAAAGAATACAAGGGTATACAAAAAACTAGCCCACTGGAACACCCCAACTAAAAGAAGGGTTCCAGGTAAGTAAAATGTTTCCTAAGGAATAGTTACAAAAAAGCTTCGAAACAGAAACCCAAAGGGACACATGAAATCTCACCAAATACCGAACGGTCCCTAGCTCTAAACACTCTATCATTCCTCTCCCCCCAAATGTCCCAAACAACCGCACACACCCAGCAAGCCACAAAAAAACGACCTTTCTCTCTGAAAGGCGGATGGAGGAGAAACTCCTCGATCGTTGCACGAATGTCTCTTTACCCAACAAAGCTAACATTGAACTCCTGCAAGAAGAGACTGCACATCGTCCTTGCATATTTGTAGTTCCAAAGGAGATGATCTCGATCTTCCTCAACCCTCCAATAAAGAATACAATAAAAAGGTCCCACAAGCGAAGTTCTTCTCCTAACAAGCCTATCAAAAGTGTTCACACGACCAAGCAAGACTTGCCAAATAAAAAACCTGACTGCCTTAGGAATCTTAATCCTCCAAATCACATCGAACACAGACTCCCCAATGGGAGAGGGACCCAATAACAATCTAAAGAAGGATTTGTTGCAGGAATATTGAGTTACTTTGCCTTCGTTGTTCTTCCCAGGCACAGTGAATTAGAGCCTGCTAAATCTCTTATCGATATTGAATGAAGTACAGGTCTCCGAAAATTACTATTGCTGAAATTAAACATTGAAGAATAAGGCAGCCAAAGAAAAAAACAAGCAAAATACTCCCAGAAGTAAGGCCATAGATTTTAAAATTCATGGCAGCTAAAACATCCATCCAAATGCCAAACCACCATCATAACCCATGACCCAAAATAATCTCAGCACCAATAAAAGAGGAATTTCGTCTTGAAGAGGGATAAAGAAGAGCTGCCTTTGTTTGTATACGAAGGCCACTAGAAGAATTTCTTATTCTGAATCGAAGTTGAAAATTCTGTATTGAAGCTGCTCACTCTATCAATTGGATTGCCACTAAAATCTAAAAACTTCTCTATTACACTTATATAAACTACTTCCTTGGTTTAATTAATAGTACCATAATAAAAGGGGTGAACCTTCTGATAACCCCCATAACAACCCAACATGATTTGAGGATGTATCTAGTTTGAACGTAAAATGTCAAAAGTTGGAATTTTTGGGGGTCAACCTTGATTCATACGTTCACTTTATTAAGGAATTTGGATTGCAAGGTTGGTGATTGACCAGCCACCTATCTAGGCTTGCCTCTGAATGCTGAGCATAAAACCTAATCATTTTTCGGTTTCCATCATCGAGAAAATTTAGGAAAGACTGTACTCATACATCTCTAAAGAAGGCCGATAGATTCTCATTCAAGCTATACATTGGCCAACCTTCCCATCTACTATTTACCCCTTGGCCCATCTATCCCCTCAGCATCTGCTGAAAGACTCTTTTTGAACTTTTTGTGGATGGACAACGAATAAAAGCGTTCTTAACCTTAGATGGGAAGAAGTAAGTTAACTTTTGGAGAAGGAGGCTTATGGATTGTTAACCTAAAATAGAATAGTATATCCTTGTTGGCTAAATGGATTTGGAGATTTCATAAAGAGAAATCCACCCTGTGGAGGGAAGTTATTATTTCACACATCACTTGCAAAGTTGGTAAGGTTATTGGTACTCCTGATTTGTGGATTGGTATTGAGGCTCTGAAAGACAGCTTTCCATCCTTGTTTGCTAACACAGGGTTGATTGTTTTGTGGCTGATATGTGGAACAATCTTCAGCTTCTTGGAAATTTTACCCTAGGAGATATTTGATGTATGGAGAAACTGTGATTGGGCCAGCCTTCAAATGCTCTTGTCGGTGTGAATCTCAGTGAATCCTCTGATGACATTTGGTGCTGGAATTTAGAGATGGATGTCCCCCTCTTTTCAAATTCGTTATCTTCGATGGGGGAATTGGGCTGTAAAGATCTATCTAAGCATTAAGGAAGAATTATTTCGCAAAAAGGTGAAGTTCTTTTTAGGGGAGTCAAGTCACTCATGTTTTGATACTTACGACAGCCATCAAAGGCATCCTTGGATATCCATCTCTCCTAACTGCTGCATTTTGTGCAAAATCAATGGGGAATTGCATAGTTATCTCTTGTTTTGGTGACAAGCCAACATCTTTGGGACTTCATTTGTCAAGACAAGAGTTTAGGTGGCAGATCACCTTTCCTTTTGATGTTAGGGACACTCTTTCTATTGTTTTCAGTGGAGACCTTTCCAAGGAGAAGAAGATTGTCTGTCTGAACCTCATTTGGGCTTTCTTGTGGTCTTACTGGTTGGAACGCAACTCTAGTATCTTCATAGATAAGAGGAACAAGATTGAAAATTCTGTCCCCGTTGTTACAAACATACGATTCATGGGGAGATAAAGAAATAGATCGAACCGATCGTCTGTGTGTCACCCTTTTCCAATGCAAGGAAGATTAAAAATTACATATATTAGACATATTTTCGATTTAAATATAAACAAACCAAATCCTATTTCTTAATTCTAAATCATTTTAGATCCGATCGAAATAGGATTTTCGGGATAAGGAATAAACAAACCATGGATAAATCCAAGCGACTCCCTCTTAAATCCGAACGATCTTTTCGTAGGCGTTTGCCCCCGATTCAATCGGGGGATCGAATTGATTATAGAAACATGAGTTTAATTAGTCGATTTATTAGTGAACAAGGAAAAATATTATCTAGACGGGTAAATAGATTGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAGATTAAAAATTACATATATTAGACATATTTTCGATTTAAATATAAACAAACCAAATCCTATTTCTTAATTCTAAATCATTTTAGATCTGATCGAAATAGGATTTTCGGGATAAGGAATAAACAAACTATGGATAAATCCAAGCGACTCCCTCTTAAATCCAAACGATCTTTTCGTAGGCGTTTGCCCCCGATTCAATCAGGGGATTGAATTGATTATAGAAACATGAGTTTAATCAGTCGATTTATTAGTGAACAAGGAAAAATATTATCTAGACGGGTAAATAGATTGACTTTAAAACAGCAACGCTTAATTACTATTGCTATAGAACAAGCTCGTATTTTATCTTTGTTACCTTTTCTTAATAATGAGAAACAATTTGAAAGAAGCGAGTCGACCGCTAGAACTATTGGTCCTAGAACCAGGAATAAATAGGCTTACTCTTTAATTGAATTAAAATTCTAATCCAAACTCAACCGCAGATTGATGCTTTGTTCGAAAAATCCGAAAATCTAGATTTGATTGTCATGTCGTAAGAAAAAAAGAATAGTGGAAGAAGAATAAATCGTTTTTTTATTGANTCATGTCGTAAGAAAAAAAGAATAGTGGAAGAAGAATAAATCGTTTTCTTATTGAACATATTTGCTCATTTTGACTAATTTTTACTCTACCTTCTCGGAGTTCATTCTCCAGAGAACTCCATTTTAAGCATTCCGCTGCATTCTTTCCAATCTTCTTATTTTATGATCTTATTTCACATCTATTAAATACTTATCCTCTAATTCCAATCAAATTGAGCTCACCTTTTGTAGTTATAGTTGATCAAATCTCATGACCAATTGGAGATGTTTTGAACTGTTTCCTATAGAAAGAAAACCAACATGTTTTGGAATATCCTTAGAAGAACTTAAATCAACTGCCAGAGGGAAGAGCAGCCATTTTTTAGTGCTTTTCTTCGTCTTTAAATAATAGGGAAAGGCTTCATTGAGATCACCTCGGAGATAATGCTAATCCTTGACAGAAAACAGCATTTGGGAGTATTCCAAATAGTGGTTATTAGAGACAAAATAGTATGTAGTAAGTTTTGGGAGTTTAATGCTTTGAAGATATGTAATAAACACCATTTTCATCTAAATAGTTTTCAGTTCCATCATATGTAAAATCACTAAAAAGAAGTTCGATTGTGAGTACACTTCCAATGTAATGTAACTATACTAGATGCACAAATTCTTCCAACTGTCCTAAACTATCCTTTTCCGATTGCATTGTAGTTACTTCTTAAAATGCATCGACATCGTTTCTTGCTTTTAACTTGGAGAAAACTTACTCCACCTTAGAGGTAACTGTAGCATATCAATCCTCATGTCACACTGGAGGAACTTCTTGAGATTACTGAGTCACATATAGGACTTTTCTTGTAACCTATGTCATCAATGAATGTATAGAATTCTTGGGGACTTGGAGAAATCTGGAGAGGAGGTTTGAGAGTTGGTGACCTTTAATGCTTCCGTTTGGGCATTGGTCACTAGACTATTTTGTAATTATGATATCGGACAGGTTCTTTTGGACTGGAGTCCCTTCTCTAGTTCTGTTGGACTTTTGTTTTTTGGGGCTTATTTTTGTTTGTCCTTGTATTCTTTCATCTGTCTCAGAAAAAAGAAAAGAAAAGAAAAAGAAATGTTTAGATTGGCCTAATTTTGCACTGGTTTGTTAATGGTTTCTTATCTGATATAGACACTCCCTTGTCCTTGGCTGCGGTGTTGGAAACTGTCCCCCTCTGGAAATTGTATTTTGACATGTTTGATGAAAAAGAAACAGTCTGCTACAGATGATGAGTAATATTAATATTTGTTGGGTATATGCAGGCCTGAATCTGGTCGTACTGCTAGTTCTAACGTGGTCAACCCTTTTCCTGGGTAAGTTCTAATTGGACATTGCTTTGGGCATGAAATTTTTACATTCAATAGTGGAGTAGCTGGGTTTTATTGTTATTATTATACTTATAATCATGTTCTGGGATTGTTTCACTGCCTCCTAGTTGGATTGGTTTTGAAGTCATTTGTTTTTGGTTACTCGTCTCTCTGATTTTCTTTGGGGTGTTAGAATCGAGAGTTATATCTTTTCTACCTGTTAGGAAGTGTGGAAGACAAACTAATGACTTTTTTTTTTTTTTTCTTTTTTTTATTTTTTCTCTCTTCTTTTTCTTCTTATAAAAGTATTTTAAGGAAATTTTCAAATATAGCAAAATATGCCTTTGTTATTTTCTAAAATTAATATGGAGAAGAAAGAAACGAATACACGAGCAAAATATTATTTTTCTTGTAGTTTTTATTAGGTTTAATTATAAGTTTAGTCTCTAAACTTTTAAGATTTGTGCCTATATGGTCCATGAGTTTTCAAAAGTGTCTAATAGAACCCCAAACTTTCAATTTTGTATGTTGCAGATCTCTGACTTTAAAAATATCTAATAAGTCTTTTATTATTGTTTTTCTGTTTTTTTAATAAGAAACAAGTCAAAATAAAATATATTAATGGAAAGAAAACATAACCTAAAGGGCTATGGGATGAGAGGTCCCTAAGCCTAATTTGAGGAATTGGCAAAAGCCCTCCACTGTGTATTACAGTAAGAATTTTGCAAAGCTCTCACATGGATGCAGTAAGCTGTGCATTTTCCTTGTGTCTAATGGGTCGTTGACCTATTGAATATTTTTTAAAATTCATGGATCTACCATAATACAAACTTGAAACTTTAGGGTTCCATTAGATTAAAATTCAAATTTATATCCATTAAACATAGACCTTTAAAAGTTTGATTATGCTAGGCCAAAATTGGAAGTTGAGAAAGCAATCAGTGTAGGATAGGATTGAAGTCACTGGAGGTATCGTTTGGAGGGTTTTTTGTTTTTTTTTTTTTTTGGGATAGGATTGAAGGGTTACTGGTTTAAGTCTGAACTCCTATATTTATATATTTTACTTCTGTATTTTGTTTTTTAAAAAGGATTGATATTGTATATATGGTGGTTCAGCTTCTGTGACCAAATCTTTAAAAATGTTTGTCATGAGTTGGAAATGGAGTAAGATGAGTTAACTTTGCTTGAATTGGTAGAACTTGTTCTTATTGCTGTCTAGTTTTATTGAAGTTCTTGGGTTTTGTATAGTCCAAGCCGGCGTGGAGGGCTGAGGAATGCCCTTGGCCGTGGGCGAGGTGGCTGGAACCGTGGTTTAGGTCTAGGTGGAGGAGGCCGTGGCCGAGGGCGTGGTCGTGGTCGTGGTCGTGGCCGTGGCCAGGGAAAAAAGAAACCTGTGGAGAAGTCTTCAGATGAACTTGACAAGGAGCTTGAAAACTACCATGCAGAAGCCATGCAAACCTGAGGAATTTCGAATATGCAAAAATCGAAGTATGATCTGATTAAACCACTTATGTTAGCTGTGTAAGAATATATATAGCATTCTCGGGGGTGATTATTATTTTATTTTAATTTTTGTTCCAGTAGACTAGATTTGTATTTTGCCTGTTAATGGAGAATGATGCTGTTTGACCCTCTGTTATCAAAACTTCTACCTTTTTTGGTTGATCCTGAATGGAACGAGCTCGGATCATCGATCTCTCAAATAGGTAAATTGCTTTATGTATAAAGGATTTAGTTTAGTTGTCTTTATACATTGTTAAGAATGGGAATATCCTGTATTTAAAGGACCTATTCTTCAAAACCTTGTCGTCTGGCCATCACTGTACTCTTTTCACTTGCTATTGAGCTTCTCAT

mRNA sequence

ATGGCTACTCCTTTGGATATGTCACTAGAGGATGTGATAAAGAAGAATAACCGTGAGAAGCTTAGAGCACGAGGTAGGGCCCGTCGTGGACGTGGAGCAGGTGGGTCTTTTAATGGTGGAAGAGGAGTAGTGTTAGGATCAGTTCGTAGAGGTCCTCTCGGCATAAACGCACGGGCATCTGCTTACTCAATTCGCAAGGCAAGCTTCAAACTAAGTGGATCAGTAAAATTGTTGTTCTGTGGAGAATCTCATTGTGGGTCACCCTCTGATCCGAATCGTCTTCTGCCCCCACGCAGAATGAAGAATGTTCAATGGCAGCATGATTTATTTGAAGATAGTCTTAGAGCTTCAGGGATTTCTGGAATTGAAATTGGCACAAAGTTGTATGTTTCCAACTTGGATTATGGGGTGACCAAAGAAGATATAAGGGAGTTGTTCTCTGAGATTGGAGACCTGAAACGATTTGCAATTCATTATGACAAAAATGGTCGTCCAAGCGGCTCAGCAGAGGTGGTATATACTCGAAGAAGTGATGCATTTGCTGCTCTGAAGCGCTATAACAATGTGTTATTGGATGGGAAGCCAATGAAGATTGAAATGCTTGGTGATAATGCTGAAATGCCGGTTTCTGCACGTATAAATGTTACTGGAGTGAATGGAAGAAGCAGGAGGACAGTTGTTCTAACGCCTGAATCTGGTCGTACTGCTAGTTCTAACGTGGTCAACCCTTTTCCTGGTCCAAGCCGGCGTGGAGGGCTGAGGAATGCCCTTGGCCGTGGGCGAGGTGGCTGGAACCGTGGTTTAGGTCTAGGTGGAGGAGGCCGTGGCCGAGGGCGTGGTCGTGGTCGTGGTCGTGGCCGTGGCCAGGGAAAAAAGAAACCTGTGGAGAAGTCTTCAGATGAACTTGACAAGGAGCTTGAAAACTACCATGCAGAAGCCATGCAAACCTGAGGAATTTCGAATATGCAAAAATCGAAGTATGATCTGATTAAACCACTTATGTTAGCTGTGTAAGAATATATATAGCATTCTCGGGGGTGATTATTATTTTATTTTAATTTTTGTTCCAGTAGACTAGATTTGTATTTTGCCTGTTAATGGAGAATGATGCTGTTTGACCCTCTGTTATCAAAACTTCTACCTTTTTTGGTTGATCCTGAATGGAACGAGCTCGGATCATCGATCTCTCAAATAGGTAAATTGCTTTATGTATAAAGGATTTAGTTTAGTTGTCTTTATACATTGTTAAGAATGGGAATATCCTGTATTTAAAGGACCTATTCTTCAAAACCTTGTCGTCTGGCCATCACTGTACTCTTTTCACTTGCTATTGAGCTTCTCAT

Coding sequence (CDS)

ATGGCTACTCCTTTGGATATGTCACTAGAGGATGTGATAAAGAAGAATAACCGTGAGAAGCTTAGAGCACGAGGTAGGGCCCGTCGTGGACGTGGAGCAGGTGGGTCTTTTAATGGTGGAAGAGGAGTAGTGTTAGGATCAGTTCGTAGAGGTCCTCTCGGCATAAACGCACGGGCATCTGCTTACTCAATTCGCAAGGCAAGCTTCAAACTAAGTGGATCAGTAAAATTGTTGTTCTGTGGAGAATCTCATTGTGGGTCACCCTCTGATCCGAATCGTCTTCTGCCCCCACGCAGAATGAAGAATGTTCAATGGCAGCATGATTTATTTGAAGATAGTCTTAGAGCTTCAGGGATTTCTGGAATTGAAATTGGCACAAAGTTGTATGTTTCCAACTTGGATTATGGGGTGACCAAAGAAGATATAAGGGAGTTGTTCTCTGAGATTGGAGACCTGAAACGATTTGCAATTCATTATGACAAAAATGGTCGTCCAAGCGGCTCAGCAGAGGTGGTATATACTCGAAGAAGTGATGCATTTGCTGCTCTGAAGCGCTATAACAATGTGTTATTGGATGGGAAGCCAATGAAGATTGAAATGCTTGGTGATAATGCTGAAATGCCGGTTTCTGCACGTATAAATGTTACTGGAGTGAATGGAAGAAGCAGGAGGACAGTTGTTCTAACGCCTGAATCTGGTCGTACTGCTAGTTCTAACGTGGTCAACCCTTTTCCTGGTCCAAGCCGGCGTGGAGGGCTGAGGAATGCCCTTGGCCGTGGGCGAGGTGGCTGGAACCGTGGTTTAGGTCTAGGTGGAGGAGGCCGTGGCCGAGGGCGTGGTCGTGGTCGTGGTCGTGGCCGTGGCCAGGGAAAAAAGAAACCTGTGGAGAAGTCTTCAGATGAACTTGACAAGGAGCTTGAAAACTACCATGCAGAAGCCATGCAAACCTGA

Protein sequence

MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVLGSVRRGPLGINARASAYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTPESGRTASSNVVNPFPGPSRRGGLRNALGRGRGGWNRGLGLGGGGRGRGRGRGRGRGRGQGKKKPVEKSSDELDKELENYHAEAMQT
Homology
BLAST of CcUC05G094210.1 vs. NCBI nr
Match: XP_038896761.1 (THO complex subunit 4D-like [Benincasa hispida])

HSP 1 Score: 505.4 bits (1300), Expect = 3.6e-139
Identity = 274/316 (86.71%), Postives = 280/316 (88.61%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVLGSVRRGPLGINARAS 60
           MATPLDMSLEDVIKK+NREKLRARGRARRGRGAGGSFNGGRGVV+GSVRRGPLGINARAS
Sbjct: 1   MATPLDMSLEDVIKKSNREKLRARGRARRGRGAGGSFNGGRGVVVGSVRRGPLGINARAS 60

Query: 61  AYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLFEDSLRASGIS 120
           AYSIRK                             PPRRMKNVQWQHDLFEDSLRASGIS
Sbjct: 61  AYSIRK-----------------------------PPRRMKNVQWQHDLFEDSLRASGIS 120

Query: 121 GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180
           GIEIGTKLYVSNLDYGV+KEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF
Sbjct: 121 GIEIGTKLYVSNLDYGVSKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180

Query: 181 AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTPESGRTASSNV 240
           AALKRYNNVLLDG+PMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTPESGRTASSNV
Sbjct: 181 AALKRYNNVLLDGRPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTPESGRTASSNV 240

Query: 241 VNPFPGPSRRGGLRNALGRGRGGWNRGLGLGGGGRGRGRGRGRGRGRGQGKKKPVEKSSD 300
           VNPFPGPS RGGLRN  GRGRGGWNRG G+GGG  G GRGRGRGRGRGQG+KKPVEKSSD
Sbjct: 241 VNPFPGPSHRGGLRNGRGRGRGGWNRGQGVGGG--GGGRGRGRGRGRGQGRKKPVEKSSD 285

Query: 301 ELDKELENYHAEAMQT 317
           ELDKELENYHAEAMQT
Sbjct: 301 ELDKELENYHAEAMQT 285

BLAST of CcUC05G094210.1 vs. NCBI nr
Match: XP_008457020.1 (PREDICTED: THO complex subunit 4D [Cucumis melo])

HSP 1 Score: 494.6 bits (1272), Expect = 6.4e-136
Identity = 268/316 (84.81%), Postives = 274/316 (86.71%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVLGSVRRGPLGINARAS 60
           M TPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVV+GSVRRGPLGINARAS
Sbjct: 1   MTTPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVIGSVRRGPLGINARAS 60

Query: 61  AYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLFEDSLRASGIS 120
           AYSIRK                             PP RMKNVQWQHDLFEDSLRASGIS
Sbjct: 61  AYSIRK-----------------------------PPHRMKNVQWQHDLFEDSLRASGIS 120

Query: 121 GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180
           GI+IGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF
Sbjct: 121 GIQIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180

Query: 181 AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTPESGRTASSNV 240
           AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTG NGR+RRTVVLTPESGR A+ NV
Sbjct: 181 AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGTNGRNRRTVVLTPESGRNATFNV 240

Query: 241 VNPFPGPSRRGGLRNALGRGRGGWNRGLGLGGGGRGRGRGRGRGRGRGQGKKKPVEKSSD 300
           VNPFPGPS RGGLRNA GRGRG W RG+GLGG   G GRGRGRGRGRGQG+KKPVEKSSD
Sbjct: 241 VNPFPGPSHRGGLRNARGRGRGAWTRGVGLGGS--GGGRGRGRGRGRGQGRKKPVEKSSD 285

Query: 301 ELDKELENYHAEAMQT 317
           ELDKELENYHAEAMQT
Sbjct: 301 ELDKELENYHAEAMQT 285

BLAST of CcUC05G094210.1 vs. NCBI nr
Match: XP_004145851.1 (THO complex subunit 4D [Cucumis sativus] >KAE8650594.1 hypothetical protein Csa_011773 [Cucumis sativus])

HSP 1 Score: 490.3 bits (1261), Expect = 1.2e-134
Identity = 266/316 (84.18%), Postives = 275/316 (87.03%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVLGSVRRGPLGINARAS 60
           M TPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVV+GSVRRGPLGINARAS
Sbjct: 1   MTTPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVIGSVRRGPLGINARAS 60

Query: 61  AYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLFEDSLRASGIS 120
           AYSIRK                             PP RMKNVQWQHDLFEDSLRASGIS
Sbjct: 61  AYSIRK-----------------------------PPHRMKNVQWQHDLFEDSLRASGIS 120

Query: 121 GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180
           GI+IGTKLYVSNLDYGVTKEDI+ELFSEIGD+KRFAIHYDKNGRPSGSAEVVYTRRSDAF
Sbjct: 121 GIQIGTKLYVSNLDYGVTKEDIKELFSEIGDVKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180

Query: 181 AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTPESGRTASSNV 240
           AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTG NGR+RRTVVLT ESGR A+SNV
Sbjct: 181 AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGTNGRNRRTVVLTSESGRNATSNV 240

Query: 241 VNPFPGPSRRGGLRNALGRGRGGWNRGLGLGGGGRGRGRGRGRGRGRGQGKKKPVEKSSD 300
           VN FPGPS RGGLRNA GRGRG W+RG+GLGGG  G GRGRGRGRGRGQG+KKPVEKSSD
Sbjct: 241 VNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGS-GGGRGRGRGRGRGQGRKKPVEKSSD 286

Query: 301 ELDKELENYHAEAMQT 317
           ELDKELENYHAEAMQT
Sbjct: 301 ELDKELENYHAEAMQT 286

BLAST of CcUC05G094210.1 vs. NCBI nr
Match: XP_023527122.1 (THO complex subunit 4D [Cucurbita pepo subsp. pepo])

HSP 1 Score: 482.6 bits (1241), Expect = 2.5e-132
Identity = 267/324 (82.41%), Postives = 274/324 (84.57%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVLGSVRRGPLGINARAS 60
           MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGS+NGGR VV+GSVRRGPLGINAR S
Sbjct: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSYNGGRAVVIGSVRRGPLGINARPS 60

Query: 61  AYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLFEDSLRASGIS 120
           A+SI K                             PPRRMKNVQWQHDLFEDSLRASGIS
Sbjct: 61  AFSISK-----------------------------PPRRMKNVQWQHDLFEDSLRASGIS 120

Query: 121 GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180
           GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF
Sbjct: 121 GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180

Query: 181 AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTPESGRTASSNV 240
           AALKRYNNVLLDGKPMKIEMLGDNA+ PVSARINVTGVNGRSRRTVVLT ESGRT SSN 
Sbjct: 181 AALKRYNNVLLDGKPMKIEMLGDNADTPVSARINVTGVNGRSRRTVVLTSESGRTGSSNA 240

Query: 241 VNPFPGPSRRGGLRN--ALGRGRGGWNRGL------GLGGGGRGRGRGRGRGRGRGQGKK 300
           VNPFPGPS RGGLR+    GRGRGGW+RGL      GLGGGGRGRGRG G GRGRGQG+K
Sbjct: 241 VNPFPGPSHRGGLRSGRGRGRGRGGWSRGLGGGGGRGLGGGGRGRGRGSGSGRGRGQGRK 295

Query: 301 KPVEKSSDELDKELENYHAEAMQT 317
           KPVEKSS ELDKELENYHAEAMQT
Sbjct: 301 KPVEKSSAELDKELENYHAEAMQT 295

BLAST of CcUC05G094210.1 vs. NCBI nr
Match: XP_022935328.1 (THO complex subunit 4D-like [Cucurbita moschata] >XP_022935329.1 THO complex subunit 4D-like [Cucurbita moschata] >XP_022935330.1 THO complex subunit 4D-like [Cucurbita moschata])

HSP 1 Score: 480.7 bits (1236), Expect = 9.6e-132
Identity = 266/324 (82.10%), Postives = 274/324 (84.57%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVLGSVRRGPLGINARAS 60
           MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGS+NGGR VV+GSVRRGPLGINAR S
Sbjct: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSYNGGRAVVIGSVRRGPLGINARPS 60

Query: 61  AYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLFEDSLRASGIS 120
           A+SI K                             PPRRMKNVQWQHDLFEDSLRASGIS
Sbjct: 61  AFSISK-----------------------------PPRRMKNVQWQHDLFEDSLRASGIS 120

Query: 121 GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180
           GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF
Sbjct: 121 GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180

Query: 181 AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTPESGRTASSNV 240
           AALKRYNNVLLDGKPMKIEMLGDNA+ PVSARINVTGVNGRSRRTVVLT ESGRT SS+ 
Sbjct: 181 AALKRYNNVLLDGKPMKIEMLGDNADTPVSARINVTGVNGRSRRTVVLTSESGRTGSSHA 240

Query: 241 VNPFPGPSRRGGLRN--ALGRGRGGWNRGL------GLGGGGRGRGRGRGRGRGRGQGKK 300
           VNPFPGPS RGGLR+    GRGRGGW+RGL      GLGGGGRGRGRG G GRGRGQG+K
Sbjct: 241 VNPFPGPSHRGGLRSGRGRGRGRGGWSRGLGGGGGRGLGGGGRGRGRGSGSGRGRGQGRK 295

Query: 301 KPVEKSSDELDKELENYHAEAMQT 317
           KPVEKSS ELDKELENYHAEAMQT
Sbjct: 301 KPVEKSSAELDKELENYHAEAMQT 295

BLAST of CcUC05G094210.1 vs. ExPASy Swiss-Prot
Match: Q6NQ72 (THO complex subunit 4D OS=Arabidopsis thaliana OX=3702 GN=ALY4 PE=1 SV=1)

HSP 1 Score: 255.4 bits (651), Expect = 8.6e-67
Identity = 169/326 (51.84%), Postives = 215/326 (65.95%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRAR-RGRGAGGSFNGGRGVVLGSVRRGPLGINARA 60
           M+  L+M+L++++K+    +   RG +R RGRG GG   GGRG   G  RRGPL +NAR 
Sbjct: 1   MSGALNMTLDEIVKRGKTARSGGRGISRGRGRGRGG---GGRGA--GPARRGPLAVNARP 60

Query: 61  SAYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLFEDSLRASGI 120
           S+++I K                             P RR++++ WQ  LFED LRA+G 
Sbjct: 61  SSFTINK-----------------------------PVRRVRSLPWQSGLFEDGLRAAGA 120

Query: 121 SGIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDA 180
           SG+E+GT+L+V+NLD GVT EDIRELFSEIG+++R+AIHYDKNGRPSG+AEVVY RRSDA
Sbjct: 121 SGVEVGTRLHVTNLDQGVTNEDIRELFSEIGEVERYAIHYDKNGRPSGTAEVVYPRRSDA 180

Query: 181 FAALKRYNNVLLDGKPMKIEMLGDN--AEMPVSAR--INVTGVNGRSRRTVVLTPESGR- 240
           F ALK+YNNVLLDG+PM++E+LG N  +E P+S R  +NVTG+NGR +RTVV+    G  
Sbjct: 181 FQALKKYNNVLLDGRPMRLEILGGNNSSEAPLSGRVNVNVTGLNGRLKRTVVIQQGGGGR 240

Query: 241 ---TASSNVVNPFPGPSRRGGLRNALGRG-RGGWNRGLGLGGGGRGRGRGRGRGRGRGQG 300
                      P P  SRR  + N  G G RGG  RG G    GRG G GRGRG GRG G
Sbjct: 241 GRVRGGRGGRGPAPTVSRRLPIHNQQGGGMRGG--RG-GFRARGRGNG-GRGRGGGRGNG 287

Query: 301 KKKPVEKSSDELDKELENYHAEAMQT 317
            KKPVEKS+ +LDK+LE+YHA+AM T
Sbjct: 301 -KKPVEKSAADLDKDLESYHADAMNT 287

BLAST of CcUC05G094210.1 vs. ExPASy Swiss-Prot
Match: Q94EH8 (THO complex subunit 4C OS=Arabidopsis thaliana OX=3702 GN=ALY3 PE=1 SV=1)

HSP 1 Score: 240.7 bits (613), Expect = 2.2e-62
Identity = 165/341 (48.39%), Postives = 215/341 (63.05%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRA-----RGRARR-GRGAGGS---FNGGRGVVLGSVRRG 60
           M+  L+M+L++++KK+  E+  A     +G +R+ GRG GG      GGRG   G VRRG
Sbjct: 1   MSDALNMTLDEIVKKSKSERSAAARSGGKGVSRKSGRGRGGPNGVVGGGRGG--GPVRRG 60

Query: 61  PLGINAR-ASAYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQW--QHD 120
           PL +N R +S++SI K +                             RR +++ W  Q+D
Sbjct: 61  PLAVNTRPSSSFSINKLA-----------------------------RRKRSLPWQNQND 120

Query: 121 LFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGS 180
           L+E++LRA G+SG+E+GT +Y++NLD GVT EDIREL++EIG+LKR+AIHYDKNGRPSGS
Sbjct: 121 LYEETLRAVGVSGVEVGTTVYITNLDQGVTNEDIRELYAEIGELKRYAIHYDKNGRPSGS 180

Query: 181 AEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNAE-MPVSARINVTGVNGRSRRTVV 240
           AEVVY RRSDA  A+++YNNVLLDG+PMK+E+LG N E  PV+AR+NVTG+NGR +R+V 
Sbjct: 181 AEVVYMRRSDAIQAMRKYNNVLLDGRPMKLEILGGNTESAPVAARVNVTGLNGRMKRSV- 240

Query: 241 LTPESGRTASSNVVNPFPGPSRRGGLRNALGRGRGGWNRGLGL-----------GGGGRG 300
                           F G   RGG R   GRG G   R L L            GG RG
Sbjct: 241 ----------------FIGQGVRGG-RVGRGRGSGPSGRRLPLQQNQQGGVTAGRGGFRG 292

Query: 301 RGRGRGRGRGR---GQGKKKPVEKSSDELDKELENYHAEAM 315
           RGRG G GRG    G+G KKPVEKS+ +LDK+LE+YHAEAM
Sbjct: 301 RGRGNGGGRGNKSGGRGGKKPVEKSAADLDKDLESYHAEAM 292

BLAST of CcUC05G094210.1 vs. ExPASy Swiss-Prot
Match: Q8L719 (THO complex subunit 4B OS=Arabidopsis thaliana OX=3702 GN=ALY2 PE=1 SV=1)

HSP 1 Score: 168.3 bits (425), Expect = 1.4e-40
Identity = 137/333 (41.14%), Postives = 178/333 (53.45%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVLGSVRRGPLGINARAS 60
           M+  LDMSL+D+I K+NR+   +RGR   G G      GG G   G  RR    + AR +
Sbjct: 1   MSGGLDMSLDDII-KSNRKPTGSRGRGGIGGGNNTGGRGGSGSNSGPSRRFANRVGARTA 60

Query: 61  AYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLF--EDSLRAS- 120
            YS                             R +  ++  +  WQ+D+F  + S+ A+ 
Sbjct: 61  PYS-----------------------------RPIQQQQAHDAMWQNDVFATDASVAAAF 120

Query: 121 ---------GISGIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGS 180
                    G S IE GTKLY+SNLDYGV+ EDI+ELFSE+GDLKR+ IHYD++GR  G+
Sbjct: 121 GHHQTAVVGGGSSIETGTKLYISNLDYGVSNEDIKELFSEVGDLKRYGIHYDRSGRSKGT 180

Query: 181 AEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVL 240
           AEVV++RR DA AA+KRYNNV LDGK MKIE++G N   P    +    +          
Sbjct: 181 AEVVFSRRGDALAAVKRYNNVQLDGKLMKIEIVGTNLSAPALPILATAQIP--------- 240

Query: 241 TPESGRTASSNVVNPFPGPSRRGGLRNALGRGRGGW---NRGLGLGGGGRGRGRG-RGRG 300
            P +G   + N    F G        N  GRGRGG+    RG G GGG    GRG RGRG
Sbjct: 241 FPTNGILGNFN--ENFNGNFNGNFNGNFRGRGRGGFMGRPRGGGFGGGNFRGGRGARGRG 291

Query: 301 RGRGQGKKKPVEK-SSDELDKELENYHAEAMQT 317
            GRG G +   E  S+++LD EL+ YH EAM+T
Sbjct: 301 -GRGSGGRGRDENVSAEDLDAELDKYHKEAMET 291

BLAST of CcUC05G094210.1 vs. ExPASy Swiss-Prot
Match: Q8L773 (THO complex subunit 4A OS=Arabidopsis thaliana OX=3702 GN=ALY1 PE=1 SV=1)

HSP 1 Score: 163.3 bits (412), Expect = 4.4e-39
Identity = 127/321 (39.56%), Postives = 167/321 (52.02%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVLGSVRRGPLGINARAS 60
           M+T LDMSL+D+I KN     ++RG A   RG G     G      + R  P   + R++
Sbjct: 1   MSTGLDMSLDDMIAKNR----KSRGGAGPARGTGSGSGPG-----PTRRNNPNRKSTRSA 60

Query: 61  AYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLF----EDSLRA 120
            Y   KA                                     W HD+F    ED    
Sbjct: 61  PYQSAKA---------------------------------PESTWGHDMFSDRSEDHRSG 120

Query: 121 SGISGIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRR 180
              +GIE GTKLY+SNLDYGV  EDI+ELF+E+G+LKR+ +H+D++GR  G+AEVVY+RR
Sbjct: 121 RSSAGIETGTKLYISNLDYGVMNEDIKELFAEVGELKRYTVHFDRSGRSKGTAEVVYSRR 180

Query: 181 SDAFAALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTPESGRTA 240
            DA AA+K+YN+V LDGKPMKIE++G N +   +                     SGR A
Sbjct: 181 GDALAAVKKYNDVQLDGKPMKIEIVGTNLQTAAA--------------------PSGRPA 240

Query: 241 SSNVVNPFPGPSRRGGLRNALGRGRGGWNRGLGLGGGGRGRGRGRGRGRGRGQGKKKPVE 300
           + N      G   RG      G+GRGG  RG G GGGGRG G GRGR  G+G     P E
Sbjct: 241 NGN----SNGAPWRG------GQGRGGQQRGGGRGGGGRG-GGGRGRRPGKG-----PAE 243

Query: 301 K-SSDELDKELENYHAEAMQT 317
           K S+++LD +L+ YH+  M+T
Sbjct: 301 KISAEDLDADLDKYHSGDMET 243

BLAST of CcUC05G094210.1 vs. ExPASy Swiss-Prot
Match: B5FXN8 (THO complex subunit 4 OS=Taeniopygia guttata OX=59729 GN=ALYREF PE=2 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 1.8e-32
Identity = 116/316 (36.71%), Postives = 165/316 (52.22%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVLGSVRRGPLGINARAS 60
           MA  +DMSL+D+IK N  ++  +RG  R GRG GG+  G      G   RG +G   RA 
Sbjct: 1   MADKMDMSLDDIIKLNRSQRGASRG-GRGGRGRGGTARG------GGPGRGGVG-GGRAG 60

Query: 61  AYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNV--QWQHDLFEDSLRASG 120
              +R       G  +               NR  P  R K +  +WQHDLF+    A  
Sbjct: 61  GGPVRNRPVMARGGGR---------------NRPAPYSRPKQLPEKWQHDLFDSGFGAG- 120

Query: 121 ISGIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSD 180
            +G+E G KL VSNLD+GV+  DI+ELF+E G LK+ A+HYD++GR  G+A+V + R++D
Sbjct: 121 -AGVETGGKLLVSNLDFGVSDADIQELFAEFGTLKKAAVHYDRSGRSLGTADVHFERKAD 180

Query: 181 AFAALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTPESGRTASS 240
           A  A+K+YN V LDG+PM I++        V+++I                 ++ R  + 
Sbjct: 181 ALKAMKQYNGVPLDGRPMNIQL--------VTSQI-----------------DTQRRPAQ 240

Query: 241 NVVNPFPGPSRRGGLRNALGRGRGGWNRGL--GLGGGGRGRG-RGRGRGRGRGQGKKKPV 300
           +V         RGG+           NRG+  G GGGG  RG RG  RGRGRG G+    
Sbjct: 241 SV--------NRGGMTR---------NRGVLGGFGGGGNRRGTRGGNRGRGRGAGRTSKQ 249

Query: 301 EKSSDELDKELENYHA 312
           + S++ELD +L+ Y+A
Sbjct: 301 QLSAEELDAQLDAYNA 249

BLAST of CcUC05G094210.1 vs. ExPASy TrEMBL
Match: A0A1S3C452 (THO complex subunit 4D OS=Cucumis melo OX=3656 GN=LOC103496802 PE=4 SV=1)

HSP 1 Score: 494.6 bits (1272), Expect = 3.1e-136
Identity = 268/316 (84.81%), Postives = 274/316 (86.71%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVLGSVRRGPLGINARAS 60
           M TPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVV+GSVRRGPLGINARAS
Sbjct: 1   MTTPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVIGSVRRGPLGINARAS 60

Query: 61  AYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLFEDSLRASGIS 120
           AYSIRK                             PP RMKNVQWQHDLFEDSLRASGIS
Sbjct: 61  AYSIRK-----------------------------PPHRMKNVQWQHDLFEDSLRASGIS 120

Query: 121 GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180
           GI+IGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF
Sbjct: 121 GIQIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180

Query: 181 AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTPESGRTASSNV 240
           AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTG NGR+RRTVVLTPESGR A+ NV
Sbjct: 181 AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGTNGRNRRTVVLTPESGRNATFNV 240

Query: 241 VNPFPGPSRRGGLRNALGRGRGGWNRGLGLGGGGRGRGRGRGRGRGRGQGKKKPVEKSSD 300
           VNPFPGPS RGGLRNA GRGRG W RG+GLGG   G GRGRGRGRGRGQG+KKPVEKSSD
Sbjct: 241 VNPFPGPSHRGGLRNARGRGRGAWTRGVGLGGS--GGGRGRGRGRGRGQGRKKPVEKSSD 285

Query: 301 ELDKELENYHAEAMQT 317
           ELDKELENYHAEAMQT
Sbjct: 301 ELDKELENYHAEAMQT 285

BLAST of CcUC05G094210.1 vs. ExPASy TrEMBL
Match: A0A6J1FAB9 (THO complex subunit 4D-like OS=Cucurbita moschata OX=3662 GN=LOC111442248 PE=4 SV=1)

HSP 1 Score: 480.7 bits (1236), Expect = 4.6e-132
Identity = 266/324 (82.10%), Postives = 274/324 (84.57%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVLGSVRRGPLGINARAS 60
           MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGS+NGGR VV+GSVRRGPLGINAR S
Sbjct: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSYNGGRAVVIGSVRRGPLGINARPS 60

Query: 61  AYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLFEDSLRASGIS 120
           A+SI K                             PPRRMKNVQWQHDLFEDSLRASGIS
Sbjct: 61  AFSISK-----------------------------PPRRMKNVQWQHDLFEDSLRASGIS 120

Query: 121 GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180
           GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF
Sbjct: 121 GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180

Query: 181 AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTPESGRTASSNV 240
           AALKRYNNVLLDGKPMKIEMLGDNA+ PVSARINVTGVNGRSRRTVVLT ESGRT SS+ 
Sbjct: 181 AALKRYNNVLLDGKPMKIEMLGDNADTPVSARINVTGVNGRSRRTVVLTSESGRTGSSHA 240

Query: 241 VNPFPGPSRRGGLRN--ALGRGRGGWNRGL------GLGGGGRGRGRGRGRGRGRGQGKK 300
           VNPFPGPS RGGLR+    GRGRGGW+RGL      GLGGGGRGRGRG G GRGRGQG+K
Sbjct: 241 VNPFPGPSHRGGLRSGRGRGRGRGGWSRGLGGGGGRGLGGGGRGRGRGSGSGRGRGQGRK 295

Query: 301 KPVEKSSDELDKELENYHAEAMQT 317
           KPVEKSS ELDKELENYHAEAMQT
Sbjct: 301 KPVEKSSAELDKELENYHAEAMQT 295

BLAST of CcUC05G094210.1 vs. ExPASy TrEMBL
Match: A0A6J1J564 (THO complex subunit 4D-like OS=Cucurbita maxima OX=3661 GN=LOC111481464 PE=4 SV=1)

HSP 1 Score: 478.0 bits (1229), Expect = 3.0e-131
Identity = 267/334 (79.94%), Postives = 274/334 (82.04%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVLGSVRRGPLGINARAS 60
           MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGS+NGGR VV+GSVRRGPLGINAR S
Sbjct: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSYNGGRAVVIGSVRRGPLGINARPS 60

Query: 61  AYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLFEDSLRASGIS 120
           A+SI K                             PPRRMKNVQWQHDLFEDSLRASGIS
Sbjct: 61  AFSISK-----------------------------PPRRMKNVQWQHDLFEDSLRASGIS 120

Query: 121 GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180
           GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF
Sbjct: 121 GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180

Query: 181 AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTPESGRTASSNV 240
           AALKRYNNVLLDGKPMKIEMLGDNA+ PVSARINVTGVNGRSRRTVVLT ESGRT SSN 
Sbjct: 181 AALKRYNNVLLDGKPMKIEMLGDNADTPVSARINVTGVNGRSRRTVVLTSESGRTGSSNA 240

Query: 241 VNPFPGPSRRGGLRNALGRGRGGWNRGLGLGG------------GGRGRGRGR------G 300
           VNPFPGPS RGGLR+  GRGRGGW+RGLG GG            GGRGRGRGR      G
Sbjct: 241 VNPFPGPSHRGGLRSGRGRGRGGWSRGLGGGGGRGLGGGGGRGLGGRGRGRGRGSGSGSG 300

Query: 301 RGRGRGQGKKKPVEKSSDELDKELENYHAEAMQT 317
            GRGRGQG+KKPVEKSS ELDKELENYHAEAMQT
Sbjct: 301 SGRGRGQGRKKPVEKSSAELDKELENYHAEAMQT 305

BLAST of CcUC05G094210.1 vs. ExPASy TrEMBL
Match: A0A6J1CP51 (THO complex subunit 4D-like OS=Momordica charantia OX=3673 GN=LOC111013262 PE=4 SV=1)

HSP 1 Score: 455.7 bits (1171), Expect = 1.6e-124
Identity = 256/316 (81.01%), Postives = 264/316 (83.54%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVLGSVRRGPLGINARAS 60
           MATPLDMSLED+IKKNNREKLRARGRARRGRGAGGSFNGGR VV+GS+RRGPL IN R S
Sbjct: 1   MATPLDMSLEDMIKKNNREKLRARGRARRGRGAGGSFNGGR-VVIGSIRRGPLSINTRPS 60

Query: 61  AYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLFEDSLRASGIS 120
           A+SI K                             PPRRMKNVQWQHDLFEDSLRASGIS
Sbjct: 61  AFSISK-----------------------------PPRRMKNVQWQHDLFEDSLRASGIS 120

Query: 121 GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180
           GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF
Sbjct: 121 GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180

Query: 181 AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTPESGRTASSNV 240
           AALKRYNNVLLDGKPMKIE+LGDNAEMPVSARINVTG+NGRSRRTVVLT ESGRT SS V
Sbjct: 181 AALKRYNNVLLDGKPMKIEILGDNAEMPVSARINVTGLNGRSRRTVVLTSESGRTDSSTV 240

Query: 241 VNPFPGPSRRGGLRNALGRGRGGWNRGLGLGGGGRGRGRGRGRGRGRGQGKKKPVEKSSD 300
           VN FPGPS RG LR   GRGRGGW+RG G  GGGRGRGRGRGRG GR    KK VEKSSD
Sbjct: 241 VNHFPGPSNRGALRGR-GRGRGGWSRGQGQVGGGRGRGRGRGRGLGR----KKTVEKSSD 281

Query: 301 ELDKELENYHAEAMQT 317
           ELDK+LENYHAEAMQT
Sbjct: 301 ELDKDLENYHAEAMQT 281

BLAST of CcUC05G094210.1 vs. ExPASy TrEMBL
Match: A0A5D3BPA9 (THO complex subunit 4D OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold451G002100 PE=4 SV=1)

HSP 1 Score: 435.6 bits (1119), Expect = 1.7e-118
Identity = 225/237 (94.94%), Postives = 229/237 (96.62%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVLGSVRRGPLGINARAS 60
           M TPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVV+GSVRRGPLGINARAS
Sbjct: 1   MTTPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVIGSVRRGPLGINARAS 60

Query: 61  AYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLFEDSLRASGIS 120
           AYSIRKASFKLSGSV +LFCGESHCGSPSDPN LLPP RMKNVQWQHDLFEDSLRASGIS
Sbjct: 61  AYSIRKASFKLSGSV-MLFCGESHCGSPSDPNCLLPPHRMKNVQWQHDLFEDSLRASGIS 120

Query: 121 GIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180
           GI+IGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF
Sbjct: 121 GIQIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDAF 180

Query: 181 AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVLTPESGRTAS 238
           AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTG NGR+RRTVVLT  SG TAS
Sbjct: 181 AALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGTNGRNRRTVVLTYVSG-TAS 235

BLAST of CcUC05G094210.1 vs. TAIR 10
Match: AT5G37720.2 (ALWAYS EARLY 4 )

HSP 1 Score: 260.0 bits (663), Expect = 2.5e-69
Identity = 170/322 (52.80%), Postives = 216/322 (67.08%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRAR-RGRGAGGSFNGGRGVVLGSVRRGPLGINARA 60
           M+  L+M+L++++K+    +   RG +R RGRG GG   GGRG   G  RRGPL +NAR 
Sbjct: 1   MSGALNMTLDEIVKRGKTARSGGRGISRGRGRGRGG---GGRGA--GPARRGPLAVNARP 60

Query: 61  SAYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLFEDSLRASGI 120
           S+++I K                             P RR++++ WQ  LFED LRA+G 
Sbjct: 61  SSFTINK-----------------------------PVRRVRSLPWQSGLFEDGLRAAGA 120

Query: 121 SGIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDA 180
           SG+E+GT+L+V+NLD GVT EDIRELFSEIG+++R+AIHYDKNGRPSG+AEVVY RRSDA
Sbjct: 121 SGVEVGTRLHVTNLDQGVTNEDIRELFSEIGEVERYAIHYDKNGRPSGTAEVVYPRRSDA 180

Query: 181 FAALKRYNNVLLDGKPMKIEMLGDN--AEMPVSAR--INVTGVNGRSRRTVVLTPESGRT 240
           F ALK+YNNVLLDG+PM++E+LG N  +E P+S R  +NVTG+NGR +RTVV+    GR 
Sbjct: 181 FQALKKYNNVLLDGRPMRLEILGGNNSSEAPLSGRVNVNVTGLNGRLKRTVVIQVRGGRG 240

Query: 241 ASSNVVNPFPGPSRRGGLRNALGRG-RGGWNRGLGLGGGGRGRGRGRGRGRGRGQGKKKP 300
                  P P  SRR  + N  G G RGG  RG G    GRG G GRGRG GRG G KKP
Sbjct: 241 GR----GPAPTVSRRLPIHNQQGGGMRGG--RG-GFRARGRGNG-GRGRGGGRGNG-KKP 279

Query: 301 VEKSSDELDKELENYHAEAMQT 317
           VEKS+ +LDK+LE+YHA+AM T
Sbjct: 301 VEKSAADLDKDLESYHADAMNT 279

BLAST of CcUC05G094210.1 vs. TAIR 10
Match: AT5G37720.1 (ALWAYS EARLY 4 )

HSP 1 Score: 255.4 bits (651), Expect = 6.1e-68
Identity = 169/326 (51.84%), Postives = 215/326 (65.95%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRAR-RGRGAGGSFNGGRGVVLGSVRRGPLGINARA 60
           M+  L+M+L++++K+    +   RG +R RGRG GG   GGRG   G  RRGPL +NAR 
Sbjct: 1   MSGALNMTLDEIVKRGKTARSGGRGISRGRGRGRGG---GGRGA--GPARRGPLAVNARP 60

Query: 61  SAYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLFEDSLRASGI 120
           S+++I K                             P RR++++ WQ  LFED LRA+G 
Sbjct: 61  SSFTINK-----------------------------PVRRVRSLPWQSGLFEDGLRAAGA 120

Query: 121 SGIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGSAEVVYTRRSDA 180
           SG+E+GT+L+V+NLD GVT EDIRELFSEIG+++R+AIHYDKNGRPSG+AEVVY RRSDA
Sbjct: 121 SGVEVGTRLHVTNLDQGVTNEDIRELFSEIGEVERYAIHYDKNGRPSGTAEVVYPRRSDA 180

Query: 181 FAALKRYNNVLLDGKPMKIEMLGDN--AEMPVSAR--INVTGVNGRSRRTVVLTPESGR- 240
           F ALK+YNNVLLDG+PM++E+LG N  +E P+S R  +NVTG+NGR +RTVV+    G  
Sbjct: 181 FQALKKYNNVLLDGRPMRLEILGGNNSSEAPLSGRVNVNVTGLNGRLKRTVVIQQGGGGR 240

Query: 241 ---TASSNVVNPFPGPSRRGGLRNALGRG-RGGWNRGLGLGGGGRGRGRGRGRGRGRGQG 300
                      P P  SRR  + N  G G RGG  RG G    GRG G GRGRG GRG G
Sbjct: 241 GRVRGGRGGRGPAPTVSRRLPIHNQQGGGMRGG--RG-GFRARGRGNG-GRGRGGGRGNG 287

Query: 301 KKKPVEKSSDELDKELENYHAEAMQT 317
            KKPVEKS+ +LDK+LE+YHA+AM T
Sbjct: 301 -KKPVEKSAADLDKDLESYHADAMNT 287

BLAST of CcUC05G094210.1 vs. TAIR 10
Match: AT1G66260.1 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 240.7 bits (613), Expect = 1.6e-63
Identity = 165/341 (48.39%), Postives = 215/341 (63.05%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRA-----RGRARR-GRGAGGS---FNGGRGVVLGSVRRG 60
           M+  L+M+L++++KK+  E+  A     +G +R+ GRG GG      GGRG   G VRRG
Sbjct: 1   MSDALNMTLDEIVKKSKSERSAAARSGGKGVSRKSGRGRGGPNGVVGGGRGG--GPVRRG 60

Query: 61  PLGINAR-ASAYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQW--QHD 120
           PL +N R +S++SI K +                             RR +++ W  Q+D
Sbjct: 61  PLAVNTRPSSSFSINKLA-----------------------------RRKRSLPWQNQND 120

Query: 121 LFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGS 180
           L+E++LRA G+SG+E+GT +Y++NLD GVT EDIREL++EIG+LKR+AIHYDKNGRPSGS
Sbjct: 121 LYEETLRAVGVSGVEVGTTVYITNLDQGVTNEDIRELYAEIGELKRYAIHYDKNGRPSGS 180

Query: 181 AEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNAE-MPVSARINVTGVNGRSRRTVV 240
           AEVVY RRSDA  A+++YNNVLLDG+PMK+E+LG N E  PV+AR+NVTG+NGR +R+V 
Sbjct: 181 AEVVYMRRSDAIQAMRKYNNVLLDGRPMKLEILGGNTESAPVAARVNVTGLNGRMKRSV- 240

Query: 241 LTPESGRTASSNVVNPFPGPSRRGGLRNALGRGRGGWNRGLGL-----------GGGGRG 300
                           F G   RGG R   GRG G   R L L            GG RG
Sbjct: 241 ----------------FIGQGVRGG-RVGRGRGSGPSGRRLPLQQNQQGGVTAGRGGFRG 292

Query: 301 RGRGRGRGRGR---GQGKKKPVEKSSDELDKELENYHAEAM 315
           RGRG G GRG    G+G KKPVEKS+ +LDK+LE+YHAEAM
Sbjct: 301 RGRGNGGGRGNKSGGRGGKKPVEKSAADLDKDLESYHAEAM 292

BLAST of CcUC05G094210.1 vs. TAIR 10
Match: AT1G66260.2 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 240.7 bits (613), Expect = 1.6e-63
Identity = 165/341 (48.39%), Postives = 215/341 (63.05%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRA-----RGRARR-GRGAGGS---FNGGRGVVLGSVRRG 60
           M+  L+M+L++++KK+  E+  A     +G +R+ GRG GG      GGRG   G VRRG
Sbjct: 1   MSDALNMTLDEIVKKSKSERSAAARSGGKGVSRKSGRGRGGPNGVVGGGRGG--GPVRRG 60

Query: 61  PLGINAR-ASAYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQW--QHD 120
           PL +N R +S++SI K +                             RR +++ W  Q+D
Sbjct: 61  PLAVNTRPSSSFSINKLA-----------------------------RRKRSLPWQNQND 120

Query: 121 LFEDSLRASGISGIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGS 180
           L+E++LRA G+SG+E+GT +Y++NLD GVT EDIREL++EIG+LKR+AIHYDKNGRPSGS
Sbjct: 121 LYEETLRAVGVSGVEVGTTVYITNLDQGVTNEDIRELYAEIGELKRYAIHYDKNGRPSGS 180

Query: 181 AEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNAE-MPVSARINVTGVNGRSRRTVV 240
           AEVVY RRSDA  A+++YNNVLLDG+PMK+E+LG N E  PV+AR+NVTG+NGR +R+V 
Sbjct: 181 AEVVYMRRSDAIQAMRKYNNVLLDGRPMKLEILGGNTESAPVAARVNVTGLNGRMKRSV- 240

Query: 241 LTPESGRTASSNVVNPFPGPSRRGGLRNALGRGRGGWNRGLGL-----------GGGGRG 300
                           F G   RGG R   GRG G   R L L            GG RG
Sbjct: 241 ----------------FIGQGVRGG-RVGRGRGSGPSGRRLPLQQNQQGGVTAGRGGFRG 292

Query: 301 RGRGRGRGRGR---GQGKKKPVEKSSDELDKELENYHAEAM 315
           RGRG G GRG    G+G KKPVEKS+ +LDK+LE+YHAEAM
Sbjct: 301 RGRGNGGGRGNKSGGRGGKKPVEKSAADLDKDLESYHAEAM 292

BLAST of CcUC05G094210.1 vs. TAIR 10
Match: AT5G02530.1 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 168.3 bits (425), Expect = 9.8e-42
Identity = 137/333 (41.14%), Postives = 178/333 (53.45%), Query Frame = 0

Query: 1   MATPLDMSLEDVIKKNNREKLRARGRARRGRGAGGSFNGGRGVVLGSVRRGPLGINARAS 60
           M+  LDMSL+D+I K+NR+   +RGR   G G      GG G   G  RR    + AR +
Sbjct: 1   MSGGLDMSLDDII-KSNRKPTGSRGRGGIGGGNNTGGRGGSGSNSGPSRRFANRVGARTA 60

Query: 61  AYSIRKASFKLSGSVKLLFCGESHCGSPSDPNRLLPPRRMKNVQWQHDLF--EDSLRAS- 120
            YS                             R +  ++  +  WQ+D+F  + S+ A+ 
Sbjct: 61  PYS-----------------------------RPIQQQQAHDAMWQNDVFATDASVAAAF 120

Query: 121 ---------GISGIEIGTKLYVSNLDYGVTKEDIRELFSEIGDLKRFAIHYDKNGRPSGS 180
                    G S IE GTKLY+SNLDYGV+ EDI+ELFSE+GDLKR+ IHYD++GR  G+
Sbjct: 121 GHHQTAVVGGGSSIETGTKLYISNLDYGVSNEDIKELFSEVGDLKRYGIHYDRSGRSKGT 180

Query: 181 AEVVYTRRSDAFAALKRYNNVLLDGKPMKIEMLGDNAEMPVSARINVTGVNGRSRRTVVL 240
           AEVV++RR DA AA+KRYNNV LDGK MKIE++G N   P    +    +          
Sbjct: 181 AEVVFSRRGDALAAVKRYNNVQLDGKLMKIEIVGTNLSAPALPILATAQIP--------- 240

Query: 241 TPESGRTASSNVVNPFPGPSRRGGLRNALGRGRGGW---NRGLGLGGGGRGRGRG-RGRG 300
            P +G   + N    F G        N  GRGRGG+    RG G GGG    GRG RGRG
Sbjct: 241 FPTNGILGNFN--ENFNGNFNGNFNGNFRGRGRGGFMGRPRGGGFGGGNFRGGRGARGRG 291

Query: 301 RGRGQGKKKPVEK-SSDELDKELENYHAEAMQT 317
            GRG G +   E  S+++LD EL+ YH EAM+T
Sbjct: 301 -GRGSGGRGRDENVSAEDLDAELDKYHKEAMET 291

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896761.13.6e-13986.71THO complex subunit 4D-like [Benincasa hispida][more]
XP_008457020.16.4e-13684.81PREDICTED: THO complex subunit 4D [Cucumis melo][more]
XP_004145851.11.2e-13484.18THO complex subunit 4D [Cucumis sativus] >KAE8650594.1 hypothetical protein Csa_... [more]
XP_023527122.12.5e-13282.41THO complex subunit 4D [Cucurbita pepo subsp. pepo][more]
XP_022935328.19.6e-13282.10THO complex subunit 4D-like [Cucurbita moschata] >XP_022935329.1 THO complex sub... [more]
Match NameE-valueIdentityDescription
Q6NQ728.6e-6751.84THO complex subunit 4D OS=Arabidopsis thaliana OX=3702 GN=ALY4 PE=1 SV=1[more]
Q94EH82.2e-6248.39THO complex subunit 4C OS=Arabidopsis thaliana OX=3702 GN=ALY3 PE=1 SV=1[more]
Q8L7191.4e-4041.14THO complex subunit 4B OS=Arabidopsis thaliana OX=3702 GN=ALY2 PE=1 SV=1[more]
Q8L7734.4e-3939.56THO complex subunit 4A OS=Arabidopsis thaliana OX=3702 GN=ALY1 PE=1 SV=1[more]
B5FXN81.8e-3236.71THO complex subunit 4 OS=Taeniopygia guttata OX=59729 GN=ALYREF PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3C4523.1e-13684.81THO complex subunit 4D OS=Cucumis melo OX=3656 GN=LOC103496802 PE=4 SV=1[more]
A0A6J1FAB94.6e-13282.10THO complex subunit 4D-like OS=Cucurbita moschata OX=3662 GN=LOC111442248 PE=4 S... [more]
A0A6J1J5643.0e-13179.94THO complex subunit 4D-like OS=Cucurbita maxima OX=3661 GN=LOC111481464 PE=4 SV=... [more]
A0A6J1CP511.6e-12481.01THO complex subunit 4D-like OS=Momordica charantia OX=3673 GN=LOC111013262 PE=4 ... [more]
A0A5D3BPA91.7e-11894.94THO complex subunit 4D OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold4... [more]
Match NameE-valueIdentityDescription
AT5G37720.22.5e-6952.80ALWAYS EARLY 4 [more]
AT5G37720.16.1e-6851.84ALWAYS EARLY 4 [more]
AT1G66260.11.6e-6348.39RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT1G66260.21.6e-6348.39RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT5G02530.19.8e-4241.14RNA-binding (RRM/RBD/RNP motifs) family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 127..199
e-value: 8.3E-19
score: 78.5
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 128..196
e-value: 1.1E-14
score: 54.1
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 126..203
score: 15.203922
IPR025715Chromatin target of PRMT1 protein, C-terminalSMARTSM01218FoP_duplication_2coord: 245..316
e-value: 2.2E-10
score: 50.5
IPR025715Chromatin target of PRMT1 protein, C-terminalPFAMPF13865FoP_duplicationcoord: 258..311
e-value: 2.5E-9
score: 37.6
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 123..202
e-value: 1.6E-21
score: 78.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 289..310
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 232..316
NoneNo IPR availablePANTHERPTHR19965:SF33THO COMPLEX SUBUNIT 4Ccoord: 1..316
NoneNo IPR availablePANTHERPTHR19965RNA AND EXPORT FACTOR BINDING PROTEINcoord: 1..316
NoneNo IPR availableCDDcd12680RRM_THOC4coord: 126..200
e-value: 8.23875E-42
score: 137.771
IPR035979RNA-binding domain superfamilySUPERFAMILY54928RNA-binding domain, RBDcoord: 90..200

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CcUC05G094210CcUC05G094210gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CcUC05G094210.1-exonCcUC05G094210.1-exon-CicolChr05:13476577..13477182exon
CcUC05G094210.1-exonCcUC05G094210.1-exon-CicolChr05:13478425..13478475exon
CcUC05G094210.1-exonCcUC05G094210.1-exon-CicolChr05:13484459..13484646exon
CcUC05G094210.1-exonCcUC05G094210.1-exon-CicolChr05:13485375..13485443exon
CcUC05G094210.1-exonCcUC05G094210.1-exon-CicolChr05:13485533..13485676exon
CcUC05G094210.1-exonCcUC05G094210.1-exon-CicolChr05:13485833..13485905exon
CcUC05G094210.1-exonCcUC05G094210.1-exon-CicolChr05:13485998..13486209exon


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CcUC05G094210.1-three_prime_utrCcUC05G094210.1-three_prime_utr-CicolChr05:13476577..13476968three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CcUC05G094210.1-cdsCcUC05G094210.1-cds-CicolChr05:13476969..13477182CDS
CcUC05G094210.1-cdsCcUC05G094210.1-cds-CicolChr05:13478425..13478475CDS
CcUC05G094210.1-cdsCcUC05G094210.1-cds-CicolChr05:13484459..13484646CDS
CcUC05G094210.1-cdsCcUC05G094210.1-cds-CicolChr05:13485375..13485443CDS
CcUC05G094210.1-cdsCcUC05G094210.1-cds-CicolChr05:13485533..13485676CDS
CcUC05G094210.1-cdsCcUC05G094210.1-cds-CicolChr05:13485833..13485905CDS
CcUC05G094210.1-cdsCcUC05G094210.1-cds-CicolChr05:13485998..13486209CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CcUC05G094210.1CcUC05G094210.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003723 RNA binding