ClCG01G012750 (gene) Watermelon (Charleston Gray)

NameClCG01G012750
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionSplicing factor U2af 38 kDa subunit
LocationCG_Chr01 : 25612636 .. 25615815 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATAACTCCATTTTCCTTCCCCCTTAATCACAGCCGCATCCTCAACCTCGAACCCTAACTTGATTCGAATTTGATTCTCCAAATTTGTGGGCAGTTTCTTCTTTTTCATCTGCTCTTTGAATTTCGAGTCTGCAATTTCTCTTTCGGCCTCTGAGAACGACGGGGAAGATTACGAACGAACATTCTTTGGTGTCGCTGAGATCCGAGGGGTGGAATGGCGGAGCATTTGGCATCCATATTCGGGACGGAGAAGGATAGGGTTAATTGTCCTTTCTACTTCAAGATCGGGGCCTGTCGGCACGGTGATAGGTGTTCACGTTTACACACGAAGCCTAGTATAAGCCCCACCCTCTTGCTTTCTAATATGTACCAGAGACCCGACATGATTACTCCTGGTGTCGATGCGCAAGGGAATCCAATTGACCCTCGCAAGATCCAGGACCATTTTGAGGTAGATCTTGTTCTGAATTCCGTTCTTTTCCTTTTCAATCTTCTAATATTTGCTAATACATATTTTTCGTCATGAATTTTGATATATTTTGGTGAAAGGATGGGTGCGGATCTTGGCTTTTTTGATTTATTTTATCATAATCTTGCTGTAAATTTAGAGGTGGTCTAATTTGGTTGATTTTCGAGCGATCGTGTTATCATGGATATTGTTTGTTGAATCTTTTTCGTGTCCTAATTTACCTCTTTATGGTCTTGGTGTGGCTTGTAAGCTCTGTTATTTTGAAGTTGGGTTGTTTTTTTCTTGTGGCATGTGGAAGAATTGTAGCTTAATTGTTGATCTGTGTTCGTTGCAGGAGTTCTATGAAGATCTATTCCAGGAACTGAACAAGTACGGGGAGATTGAAAGTTTGAATGTCTGTGACAATCTAGCTGACCATATGGTAATACTGTCCTCTTTGCCCATTGTTTGAAGGGAACTGATTTTACAATCAGCCTTTCAACTTTGATATATGCTATTTCATTGTAGAATTAGCTAATGATCCATCTGATGTGTTTGATTAGGTGGGCAATGTTTATGTTCAATTTCGAGAGGAGGAGCAAGCTGCAAATGCTCTTCGGAATCTTAGTGGTAGATTCTATGCTGGTCAGTTGGAGTTCCTGTCCCCTCCCTTCCCTCTCTCCTGTCCTTTATTTACCGACCATGGTGTTTATATTACTTGTATGCAGCCTTTGATTTTAAAATTGGATGAAATTGGATATGTAACATATGGAGATCTATTTCTTTGCAGGACGTCCGATCATTGTTGACTTCTCTCCAGTTACGGACTTTAGAGAAGCTACATGCAGGCAATATGAAGAAAACATGTGTAATCGTGGTGGATATTGCAATTTCATGCATTTGAAGAGGATCAGTAGGTATATTTTCTTCACCACCTCCTTCTCTTCCCTCTAAGATGCATCATTCTTCCTGTTGATTTTGTATTGCCACTTCTGCTATGTGTACTTCATTTGATAGCCAATCCTTTCCTGTCTGTCAGAGAGCTGAGGCATGAACTGTTTGCAATTTATCGTCGAAGGCGTAGTCACAGCAGGAGCAGAAGCCGCAGCCCCTATAGGCATAGGAGTTATGAAGAACGTTCTTATGGCAGGCATGGTCATAGTCGAAGGTATGATGAGAGAGATGCTTATCATGAAAGTCGGAGTAGGAGGCACAGGACTACAAGCCCTGGTCATAGAAGCAGAAGTCGTAGTCCACGAGGAAGGAAGAACCAAAGTCCAGTTAGGGAAGGGAGTGAAGAGAGGCGTGCTAAAATTGAGCAGTGGAACAGGGAAAGAGAACAAGGAAATATCGATAATAATCCCAATTCTAATGACAACAAAAATAGCCATGAGAACAGCCACAATGGTGATGTGAAGTATGCAAATCAAACTTGTGGCTATGAGGAGCAGCGGCAGCGGCAGCCTCCTGAGCAAGGCTATGGTTACTGATCTGTTTACTTAAACTTAGGTACTACTACCAAGGTTTCTATTTGAGGTCTTGCATTCATTTTTTTAAAATTTCCATAGTACTCTTATCCCAATTCGTGAGTGGTTGCTGGTTCTAGTGATCTGATCTCCCCAACCTGATCTCTTCTTTAGCTCGTAAGAGTAGGTAACAGCAAGTTCTTTTTATAGGGGATAGGTCAGGTACATGTCCAGCTTAGAGAACCCAGAAAAATAGAAAAAGAAAAAGAAAAGAAAAAAAAAAACTAGAAAAAGGGAAGAAAGGGGGAGCTGTTTACGAGATAGAACACACGAGCTGGTGCTAGAAGATGCAAGAAATAATGCTTGAGAAATAAGTTCTTCCCCCCTATTTGATGTCAACCAGTTTGTTCCCTTTGATTTGTGGAAAGTTATCAAATGTGTAGAAGGAAACCCCCCTAGCTTACTCGGACGTTCTCCAGTTACGTCGTTGGAGTATCGTTGGAGTAAATCTAGAAAGCATCATAATTACAGAAAAAAGTTGACGTAAATCTTTTGACAATCTCTTTTTCTTGTCAGTGCTTTGTTTTTGGATGGATGTTCATCTTATCCTATTTAAATTTCGTTAACATGCATCAATTGCACATATGCATATTTCTAATATCCTTTATTCTATATAGCAAATGTATCATTGATTTTACAGTCAGGGATTAAGACTGCTGCTCATGAGGAACTTTTGAACGATTCCATTATCATACTACGGTTTCTTTTCCCTATCAACTTCTTTGCATTACCCTTTTGTTGACATTAAGAAAAATGTGGATCTCATCATTATCTAAGTTTTGCAGGTTCAGGTTGTTTCCCTGTTTCTAAAGTTGCCAAGCTAAGATGTGACCATGAAATTTTTGTACAGGTTAATATGGTCCACTGTGTAGGATGATTGGCTTTACAGGATGTGCAGGTTATGGATCCAGAAGAACATTTGAAGACGATGCACGTGGCAAAGACGTCATTGGCTCACTTTGGTGTTTTAGCATTGATGGTTGTGGGCATTGCAGGTTTTGAGATGATGCTAAAACATTTTGTATTTGTTTCATATCTTCATGTTAAGAGGAAGAAAACTTGGGGGTGTGCATGATCTCTTTGTTTTCATATTTGAAAGATAGTTGTGTGGAGGAAAACAACGTTATGGTGAACAGGTATTACTGATGAAATGGATAGCCTTAACATATCTGATGTGATGTTTTTTCTTCTGGAAGTTGTCAAG

mRNA sequence

AAAATAACTCCATTTTCCTTCCCCCTTAATCACAGCCGCATCCTCAACCTCGAACCCTAACTTGATTCGAATTTGATTCTCCAAATTTGTGGGCAGTTTCTTCTTTTTCATCTGCTCTTTGAATTTCGAGTCTGCAATTTCTCTTTCGGCCTCTGAGAACGACGGGGAAGATTACGAACGAACATTCTTTGGTGTCGCTGAGATCCGAGGGGTGGAATGGCGGAGCATTTGGCATCCATATTCGGGACGGAGAAGGATAGGGTTAATTGTCCTTTCTACTTCAAGATCGGGGCCTGTCGGCACGGTGATAGGTGTTCACGTTTACACACGAAGCCTAGTATAAGCCCCACCCTCTTGCTTTCTAATATGTACCAGAGACCCGACATGATTACTCCTGGTGTCGATGCGCAAGGGAATCCAATTGACCCTCGCAAGATCCAGGACCATTTTGAGGAGTTCTATGAAGATCTATTCCAGGAACTGAACAAGTACGGGGAGATTGAAAGTTTGAATGTCTGTGACAATCTAGCTGACCATATGGTGGGCAATGTTTATGTTCAATTTCGAGAGGAGGAGCAAGCTGCAAATGCTCTTCGGAATCTTAGTGGTAGATTCTATGCTGGACGTCCGATCATTGTTGACTTCTCTCCAGTTACGGACTTTAGAGAAGCTACATGCAGGCAATATGAAGAAAACATGTGTAATCGTGGTGGATATTGCAATTTCATGCATTTGAAGAGGATCAGTAGAGAGCTGAGGCATGAACTGTTTGCAATTTATCGTCGAAGGCGTAGTCACAGCAGGAGCAGAAGCCGCAGCCCCTATAGGCATAGGAGTTATGAAGAACGTTCTTATGGCAGGCATGGTCATAGTCGAAGGTATGATGAGAGAGATGCTTATCATGAAAGTCGGAGTAGGAGGCACAGGACTACAAGCCCTGGTCATAGAAGCAGAAGTCGTAGTCCACGAGGAAGGAAGAACCAAAGTCCAGTTAGGGAAGGGAGTGAAGAGAGGCGTGCTAAAATTGAGCAGTGGAACAGGGAAAGAGAACAAGGAAATATCGATAATAATCCCAATTCTAATGACAACAAAAATAGCCATGAGAACAGCCACAATGGTGATGTGAAGTATGCAAATCAAACTTGTGGCTATGAGGAGCAGCGGCAGCGGCAGCCTCCTGAGCAAGGCTATGGTTACTGATCTGTTTACTTAAACTTAGGTTAATATGGTCCACTGTGTAGGATGATTGGCTTTACAGGATGTGCAGGTTATGGATCCAGAAGAACATTTGAAGACGATGCACGTGGCAAAGACGTCATTGGCTCACTTTGGTGTTTTAGCATTGATGGTTGTGGGCATTGCAGGTTTTGAGATGATGCTAAAACATTTTGTATTTGTTTCATATCTTCATGTTAAGAGGAAGAAAACTTGGGGGTGTGCATGATCTCTTTGTTTTCATATTTGAAAGATAGTTGTGTGGAGGAAAACAACGTTATGGTGAACAGGTATTACTGATGAAATGGATAGCCTTAACATATCTGATGTGATGTTTTTTCTTCTGGAAGTTGTCAAG

Coding sequence (CDS)

ATGGCGGAGCATTTGGCATCCATATTCGGGACGGAGAAGGATAGGGTTAATTGTCCTTTCTACTTCAAGATCGGGGCCTGTCGGCACGGTGATAGGTGTTCACGTTTACACACGAAGCCTAGTATAAGCCCCACCCTCTTGCTTTCTAATATGTACCAGAGACCCGACATGATTACTCCTGGTGTCGATGCGCAAGGGAATCCAATTGACCCTCGCAAGATCCAGGACCATTTTGAGGAGTTCTATGAAGATCTATTCCAGGAACTGAACAAGTACGGGGAGATTGAAAGTTTGAATGTCTGTGACAATCTAGCTGACCATATGGTGGGCAATGTTTATGTTCAATTTCGAGAGGAGGAGCAAGCTGCAAATGCTCTTCGGAATCTTAGTGGTAGATTCTATGCTGGACGTCCGATCATTGTTGACTTCTCTCCAGTTACGGACTTTAGAGAAGCTACATGCAGGCAATATGAAGAAAACATGTGTAATCGTGGTGGATATTGCAATTTCATGCATTTGAAGAGGATCAGTAGAGAGCTGAGGCATGAACTGTTTGCAATTTATCGTCGAAGGCGTAGTCACAGCAGGAGCAGAAGCCGCAGCCCCTATAGGCATAGGAGTTATGAAGAACGTTCTTATGGCAGGCATGGTCATAGTCGAAGGTATGATGAGAGAGATGCTTATCATGAAAGTCGGAGTAGGAGGCACAGGACTACAAGCCCTGGTCATAGAAGCAGAAGTCGTAGTCCACGAGGAAGGAAGAACCAAAGTCCAGTTAGGGAAGGGAGTGAAGAGAGGCGTGCTAAAATTGAGCAGTGGAACAGGGAAAGAGAACAAGGAAATATCGATAATAATCCCAATTCTAATGACAACAAAAATAGCCATGAGAACAGCCACAATGGTGATGTGAAGTATGCAAATCAAACTTGTGGCTATGAGGAGCAGCGGCAGCGGCAGCCTCCTGAGCAAGGCTATGGTTACTGA

Protein sequence

MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITPGVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEEQAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISRELRHELFAIYRRRRSHSRSRSRSPYRHRSYEERSYGRHGHSRRYDERDAYHESRSRRHRTTSPGHRSRSRSPRGRKNQSPVREGSEERRAKIEQWNREREQGNIDNNPNSNDNKNSHENSHNGDVKYANQTCGYEEQRQRQPPEQGYGY
BLAST of ClCG01G012750 vs. Swiss-Prot
Match: U2AFB_ARATH (Splicing factor U2af small subunit B OS=Arabidopsis thaliana GN=U2AF35B PE=1 SV=1)

HSP 1 Score: 413.7 bits (1062), Expect = 1.9e-114
Identity = 209/293 (71.33%), Postives = 231/293 (78.84%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLH +P+ISPTLLLSNMYQRPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHNRPTISPTLLLSNMYQRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           GVD QG P+DP KIQDHFE+FYED+F+ELNK+GE+ESLNVCDNLADHM+GNVYV F+EE+
Sbjct: 61  GVDPQGQPLDPSKIQDHFEDFYEDIFEELNKFGEVESLNVCDNLADHMIGNVYVLFKEED 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
            AA AL+ L GRFY+GRPII DFSPVTDFREATCRQYEEN CNRGGYCNFMH+K+ISREL
Sbjct: 121 HAAAALQALQGRFYSGRPIIADFSPVTDFREATCRQYEENSCNRGGYCNFMHVKQISREL 180

Query: 181 RHELFAIYR---RRRSHSRSRSRSPYRHRSY-EERSYG------RHGHSRRYDERDAYHE 240
           R +LF  YR   RR S SRSRS SP R R +  ER  G      RHG+ +R  +R   H+
Sbjct: 181 RRKLFGRYRRSYRRGSRSRSRSISPRRKREHSRERERGDVRDRDRHGNGKRSSDRSERHD 240

Query: 241 ---SRSRRHRTTSPGHRSRSRSPRGRKNQSPVREGSEERRAKIEQWNREREQG 281
                 RRH     G   RSRSPR       VREGSEERRA+IEQWNRER++G
Sbjct: 241 RDGGGRRRH-----GSPKRSRSPRN------VREGSEERRARIEQWNRERDEG 282

BLAST of ClCG01G012750 vs. Swiss-Prot
Match: U2AFA_ORYSJ (Splicing factor U2af small subunit A OS=Oryza sativa subsp. japonica GN=U2AF35A PE=2 SV=1)

HSP 1 Score: 409.8 bits (1052), Expect = 2.7e-113
Identity = 207/298 (69.46%), Postives = 232/298 (77.85%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLH KPS+SPTLLLSNMY RPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHNKPSVSPTLLLSNMYLRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           G+DAQGNPIDP KIQ  FE+FYED+F+EL+KYGEIESL+VCDN ADHM+GNVYVQFREE+
Sbjct: 61  GIDAQGNPIDPEKIQADFEDFYEDIFEELSKYGEIESLHVCDNFADHMIGNVYVQFREED 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
           QAA AL+ L+GR+Y+GRPIIV+FSPV+DFREATCRQYEEN CNRGGYCNFMH+K I R+L
Sbjct: 121 QAARALQALTGRYYSGRPIIVEFSPVSDFREATCRQYEENSCNRGGYCNFMHVKEIGRDL 180

Query: 181 RHELFA-IYRRRRSHS--RSRSRSPYRHRSYEERSYGRHGHSRRYDERDAYH-------- 240
           R  LF  ++R RRSHS  RSRS SPY +R    R Y R   SR  D  D Y         
Sbjct: 181 RKRLFGHLHRSRRSHSHGRSRSPSPYHYR----RDYDRRSSSRSRDHDDYYRGGSHDYYR 240

Query: 241 ---ESRSRRHRTT--SPGHRSRSRSPRGRKNQSPVREGSEERRAKIEQWNREREQGNI 283
                 S RHR++  S G R R RS    + +SPVR+GSEERRA+IEQWNRERE   +
Sbjct: 241 GGSRRSSERHRSSYDSDGSRRRHRS----RTRSPVRDGSEERRAQIEQWNREREAAQV 290

BLAST of ClCG01G012750 vs. Swiss-Prot
Match: U2AFA_ARATH (Splicing factor U2af small subunit A OS=Arabidopsis thaliana GN=U2AF35A PE=1 SV=1)

HSP 1 Score: 407.5 bits (1046), Expect = 1.4e-112
Identity = 201/292 (68.84%), Postives = 225/292 (77.05%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLH +P+ISPTLLLSNMYQRPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHNRPTISPTLLLSNMYQRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           GVDAQG P+DPRKIQ+HFE+F+EDLF+EL K+GEIESLN+CDNLADHM+GNVYVQF+EE+
Sbjct: 61  GVDAQGQPLDPRKIQEHFEDFFEDLFEELGKFGEIESLNICDNLADHMIGNVYVQFKEED 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
           QAA AL+ L GRFY+GRPII DFSPVTDFREATCRQYEEN CNRGGYCNFMH+K +SREL
Sbjct: 121 QAAAALQALQGRFYSGRPIIADFSPVTDFREATCRQYEENNCNRGGYCNFMHVKLVSREL 180

Query: 181 RHELFAIYRR-----RRSHSRSRSRSPYRHRSYEERSYGRHGHSRRYDERDAYHESRSRR 240
           R +LF  YRR      RS SRSRS SP   R  + R       S R  +R+ Y     + 
Sbjct: 181 RRKLFGRYRRSYRRGSRSRSRSRSISPRNKRDNDRRDPSHREFSHRDRDREFYRHGSGK- 240

Query: 241 HRTTSPGHRSRSRSPRGRKNQSPV--------REGSEERRAKIEQWNREREQ 280
            R++    R      RGR+  SP         REGSEERRA+IEQWNRERE+
Sbjct: 241 -RSSERSERQERDGSRGRRQASPKRGGSPGGGREGSEERRARIEQWNREREE 290

BLAST of ClCG01G012750 vs. Swiss-Prot
Match: U2AFB_ORYSJ (Splicing factor U2af small subunit B OS=Oryza sativa subsp. japonica GN=U2AF35B PE=2 SV=1)

HSP 1 Score: 387.1 bits (993), Expect = 1.9e-106
Identity = 194/304 (63.82%), Postives = 223/304 (73.36%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLH +P++SPT++L+NMYQRPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHNRPTVSPTIVLANMYQRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           GVDAQG PIDP K+Q+HFE+FYED+++EL+K+GE+E+LNVCDNLADHM+GNVYVQFREEE
Sbjct: 61  GVDAQGQPIDPEKMQEHFEDFYEDIYEELSKFGEVETLNVCDNLADHMIGNVYVQFREEE 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
           QA  A   L GRFY+GRPIIV++SPVTDFREATCRQ+EEN CNRGGYCNFMH+K+I REL
Sbjct: 121 QAVAAHNALQGRFYSGRPIIVEYSPVTDFREATCRQFEENSCNRGGYCNFMHVKQIGREL 180

Query: 181 RHELFAIYRRRRSHSRSRSRSPYRHRSYEERSYGRH------------------GHSRRY 240
           R +L+   R RRSH RSRS SP   R   +R   R                   G    Y
Sbjct: 181 RRKLYG-GRSRRSHGRSRSPSPRHRRGNRDRDDFRRERDGYRGGGDGYRGGGGGGGGDGY 240

Query: 241 DERDAYH-------ESRSRRHRTTSPGHRSRSRSPRGRKNQSPVREGSEERRAKIEQWNR 280
              D+Y             R+     G R R  SP  R+ +SPVRE SEERRAKIEQWNR
Sbjct: 241 RGGDSYRGGGGGGRRGGGSRYDRYDDGGRRRHGSPP-RRARSPVRESSEERRAKIEQWNR 300

BLAST of ClCG01G012750 vs. Swiss-Prot
Match: U2AF1_BOVIN (Splicing factor U2AF 35 kDa subunit OS=Bos taurus GN=U2AF1 PE=2 SV=1)

HSP 1 Score: 263.5 bits (672), Expect = 3.2e-69
Identity = 133/235 (56.60%), Postives = 161/235 (68.51%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAE+LASIFGTEKD+VNC FYFKIGACRHGDRCSRLH KP+ S T+ L N+Y+ P   + 
Sbjct: 1   MAEYLASIFGTEKDKVNCSFYFKIGACRHGDRCSRLHNKPTFSQTIALLNIYRNPQNSSQ 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELN-KYGEIESLNVCDNLADHMVGNVYVQFREE 120
             D     +   ++Q+H++EF+E++F E+  KYGE+E +NVCDNL DH+VGNVYV+FR E
Sbjct: 61  SADGLRCAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMNVCDNLGDHLVGNVYVKFRRE 120

Query: 121 EQAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISRE 180
           E A  A+ +L+ R++ G+PI  + SPVTDFREA CRQYE   C RGG+CNFMHLK ISRE
Sbjct: 121 EDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEMGECTRGGFCNFMHLKPISRE 180

Query: 181 LRHELFAIYRRRRSHSRSRSRSPYRHRSYEERSYGRHGHSRRYDERDAYHESRSR 235
           LR EL   Y RRR   RSRSRS  R     +R  G  G      ERD    SR R
Sbjct: 181 LRREL---YGRRRKKHRSRSRSRERRSRSRDRGRGGGGGGGGGRERDR-RRSRDR 231

BLAST of ClCG01G012750 vs. TrEMBL
Match: A0A0A0LVA4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G145960 PE=4 SV=1)

HSP 1 Score: 655.6 bits (1690), Expect = 3.2e-185
Identity = 309/327 (94.50%), Postives = 320/327 (97.86%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           GVDAQGNPIDPR IQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE
Sbjct: 61  GVDAQGNPIDPRNIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
           QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRI REL
Sbjct: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRIGREL 180

Query: 181 RHELFAIYRRRRSHSRSRSRSPYRHRSYEERSYGRHGHSRRYDERDAYHESRSRRHRTTS 240
           RHELFA+YRRR SHSRSRSRSPYRHRSYEERSYG+HGHSRRYDERDAYHESRSRRHRTTS
Sbjct: 181 RHELFAMYRRRHSHSRSRSRSPYRHRSYEERSYGKHGHSRRYDERDAYHESRSRRHRTTS 240

Query: 241 PGHRSRSRSPRGRKNQSPVREGSEERRAKIEQWNREREQGNIDNNPNSNDNKNSHENSHN 300
           PGHRSRSRSPRGRKN+SPVREGSEERRAKIEQWN+EREQGN DNN NS+DN+N+HE S++
Sbjct: 241 PGHRSRSRSPRGRKNRSPVREGSEERRAKIEQWNKEREQGN-DNNANSDDNRNNHEKSYD 300

Query: 301 GDVKYANQTCGYEEQRQRQPPEQGYGY 328
            +VKYANQTCGYEEQ+QRQPPEQGYGY
Sbjct: 301 SEVKYANQTCGYEEQQQRQPPEQGYGY 326

BLAST of ClCG01G012750 vs. TrEMBL
Match: A0A061GVP9_THECC (Zinc finger C-x8-C-x5-C-x3-H type family protein isoform 1 OS=Theobroma cacao GN=TCM_041518 PE=4 SV=1)

HSP 1 Score: 531.2 bits (1367), Expect = 9.0e-148
Identity = 257/319 (80.56%), Postives = 286/319 (89.66%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           GV+ QGNPIDPRKIQ+HFEEFYEDLF+EL+KYGEIESLN+CDNLADHMVGNVYVQFREEE
Sbjct: 61  GVENQGNPIDPRKIQEHFEEFYEDLFEELSKYGEIESLNICDNLADHMVGNVYVQFREEE 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
            AANALRNLSGR+Y+GRPIIVDFSPVTDFREATCRQYEEN CNRGGYCNFMHLK ISREL
Sbjct: 121 HAANALRNLSGRYYSGRPIIVDFSPVTDFREATCRQYEENTCNRGGYCNFMHLKTISREL 180

Query: 181 RHELFAIYRRRRSHSRSRSRSPYRHR-SYEERSYGRHGHSRRYDERDAYHESRSRRHRTT 240
           R +LF  YRRRRSHS+SRSRSP +HR S+EERS+G  GH RRY +RD YHESRS+RHR+T
Sbjct: 181 RRQLFGRYRRRRSHSQSRSRSPPKHRGSHEERSHGGRGHIRRYGDRDHYHESRSKRHRST 240

Query: 241 SPGH-RSRSRSPRGRKNQSPVREGSEERRAKIEQWNREREQGNIDNNPNSNDNKNSHENS 300
           SPGH R RSRSP G++N+SPVREGSEERRAKIEQWNREREQ N +   N   N N++EN 
Sbjct: 241 SPGHRRGRSRSPGGKRNRSPVREGSEERRAKIEQWNREREQENANRVDNDAAN-NNNENG 300

Query: 301 HNGDVKYANQTCGYEEQRQ 318
           +NG  K  ++  G+++Q++
Sbjct: 301 NNGYAKNDDKYYGHQQQQE 318

BLAST of ClCG01G012750 vs. TrEMBL
Match: A0A059DB43_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_B03897 PE=4 SV=1)

HSP 1 Score: 530.4 bits (1365), Expect = 1.5e-147
Identity = 263/331 (79.46%), Postives = 286/331 (86.40%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           GVDAQG PIDPRKIQ+HFE+FYEDLF+ELNKYGE+ESLNVCDNLADHMVGNVYVQFREEE
Sbjct: 61  GVDAQGQPIDPRKIQEHFEDFYEDLFEELNKYGEMESLNVCDNLADHMVGNVYVQFREEE 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
           QA  AL++LSGR+YAGRPIIVD+SPVTDFREATCRQYEE+ CNRGGYCNFMHLK ISREL
Sbjct: 121 QAQRALQSLSGRYYAGRPIIVDYSPVTDFREATCRQYEEDKCNRGGYCNFMHLKSISREL 180

Query: 181 RHELFAIYRRRRSHSRSRSRSPYRHRSYEERSYGRHGHSRRYDERDAYHESRSRRHRTTS 240
           R +LF  YRRR S SRSRSRSPYRHRSYEE SYG  G+ RR+DE D YH+SRSRRHR+TS
Sbjct: 181 RRQLFGRYRRRHSRSRSRSRSPYRHRSYEEHSYGGRGYRRRHDEYD-YHDSRSRRHRSTS 240

Query: 241 PGH-RSRSRSPRGRKNQSPVREGSEERRAKIEQWNREREQGNIDNNPNSNDNKNSH-ENS 300
           PGH R RSRSP+GR+N SPVREGSEERRA+IEQWNRERE     N  NSN N N + EN+
Sbjct: 241 PGHRRGRSRSPQGRRNASPVREGSEERRARIEQWNREREWQENANVANSNHNSNGNLENA 300

Query: 301 HNGDVKYANQTCGYEEQRQRQPPEQG--YGY 328
            N   + +   C Y  Q+Q+QPP Q   YGY
Sbjct: 301 MNHHAR-SGDLCAY--QKQQQPPSQDGVYGY 327

BLAST of ClCG01G012750 vs. TrEMBL
Match: A0A061GZT2_THECC (Zinc finger C-x8-C-x5-C-x3-H type family protein OS=Theobroma cacao GN=TCM_041519 PE=4 SV=1)

HSP 1 Score: 529.3 bits (1362), Expect = 3.4e-147
Identity = 258/319 (80.88%), Postives = 284/319 (89.03%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           GVDAQGNPIDPRKIQ+HFE FYEDLF+EL+KYGE+ESLN+CDNLADHMVGNVYVQF+EEE
Sbjct: 61  GVDAQGNPIDPRKIQEHFEGFYEDLFEELSKYGELESLNICDNLADHMVGNVYVQFKEEE 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
            AANALRNLSGRFYA RPIIVDFSPVTDFREATCRQY+EN CNRGGYCNFMHLKRISREL
Sbjct: 121 HAANALRNLSGRFYAARPIIVDFSPVTDFREATCRQYDENTCNRGGYCNFMHLKRISREL 180

Query: 181 RHELFAIYRRRRSHSRSRSRSPYRHR-SYEERSYGRHGHSRRYDERDAYHESRSRRHRTT 240
           + +LF  YRRRRSH  SRSRSP RHR S+EERS+G  GHSRRYD+RD YHE+RSRRHR+T
Sbjct: 181 KRQLFGRYRRRRSH--SRSRSPQRHRSSHEERSHGGRGHSRRYDDRDRYHENRSRRHRST 240

Query: 241 SPGH-RSRSRSPRGRKNQSPVREGSEERRAKIEQWNREREQGNIDNNPNSNDNKNSHENS 300
           SPGH R RSRSP G++N+SPVREGSEERRAKIEQWNREREQ    N  ++N   N++EN 
Sbjct: 241 SPGHRRGRSRSPGGKRNRSPVREGSEERRAKIEQWNREREQEENANKVDNNAADNNNENG 300

Query: 301 HNGDVKYANQTCGYEEQRQ 318
           +NG V+  N    Y+ Q++
Sbjct: 301 NNGYVQ--NDDKNYQHQQE 315

BLAST of ClCG01G012750 vs. TrEMBL
Match: U5GJS9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0004s20030g PE=4 SV=1)

HSP 1 Score: 524.6 bits (1350), Expect = 8.5e-146
Identity = 258/331 (77.95%), Postives = 281/331 (84.89%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPS+SPTLLLSNMYQRPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSVSPTLLLSNMYQRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           GVDAQGNPIDPR+IQ HFEEFYEDLF+EL KYGEIESLNVCDNLADHMVGNVYVQFREEE
Sbjct: 61  GVDAQGNPIDPRRIQQHFEEFYEDLFEELRKYGEIESLNVCDNLADHMVGNVYVQFREEE 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
            A+NAL+NL+GRFYAGRPIIVDFSPVTDFREATCRQYEEN CNRGGYCNFMHLKRI REL
Sbjct: 121 HASNALKNLTGRFYAGRPIIVDFSPVTDFREATCRQYEENACNRGGYCNFMHLKRIGREL 180

Query: 181 RHELFAIYRRRRSHSRSRSRSPYRHRSYEERSYGRHGHSRRYDERDAYHESRSRRHRTTS 240
           R +LF  YRRRRSH  SRSRSPYRHRS+EE S+   G  RRYD+R+ Y+ESRSRRHR+TS
Sbjct: 181 RRQLFGSYRRRRSH--SRSRSPYRHRSHEEHSHSGRGSGRRYDDREHYYESRSRRHRSTS 240

Query: 241 PGHR-SRSRSPRGRKNQSPVREGSEERRAKIEQWNREREQ--GNIDNNPNSNDNKNSHEN 300
           PGHR  RSRSP GR+ +SPVREGSEERRAKI QWNREREQ  G  +NN N++ + N +  
Sbjct: 241 PGHRKGRSRSPGGRRKRSPVREGSEERRAKIAQWNREREQQEGTANNNVNADSSNNDNGQ 300

Query: 301 SHNGDVKYANQTCGYEEQRQRQPPEQG-YGY 328
             NG           +E +Q  PP+QG Y Y
Sbjct: 301 MQNG---------SGQEYQQHIPPQQGEYAY 320

BLAST of ClCG01G012750 vs. TAIR10
Match: AT5G42820.2 (AT5G42820.2 Zinc finger C-x8-C-x5-C-x3-H type family protein)

HSP 1 Score: 413.7 bits (1062), Expect = 1.1e-115
Identity = 209/293 (71.33%), Postives = 231/293 (78.84%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLH +P+ISPTLLLSNMYQRPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHNRPTISPTLLLSNMYQRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           GVD QG P+DP KIQDHFE+FYED+F+ELNK+GE+ESLNVCDNLADHM+GNVYV F+EE+
Sbjct: 61  GVDPQGQPLDPSKIQDHFEDFYEDIFEELNKFGEVESLNVCDNLADHMIGNVYVLFKEED 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
            AA AL+ L GRFY+GRPII DFSPVTDFREATCRQYEEN CNRGGYCNFMH+K+ISREL
Sbjct: 121 HAAAALQALQGRFYSGRPIIADFSPVTDFREATCRQYEENSCNRGGYCNFMHVKQISREL 180

Query: 181 RHELFAIYR---RRRSHSRSRSRSPYRHRSY-EERSYG------RHGHSRRYDERDAYHE 240
           R +LF  YR   RR S SRSRS SP R R +  ER  G      RHG+ +R  +R   H+
Sbjct: 181 RRKLFGRYRRSYRRGSRSRSRSISPRRKREHSRERERGDVRDRDRHGNGKRSSDRSERHD 240

Query: 241 ---SRSRRHRTTSPGHRSRSRSPRGRKNQSPVREGSEERRAKIEQWNREREQG 281
                 RRH     G   RSRSPR       VREGSEERRA+IEQWNRER++G
Sbjct: 241 RDGGGRRRH-----GSPKRSRSPRN------VREGSEERRARIEQWNRERDEG 282

BLAST of ClCG01G012750 vs. TAIR10
Match: AT1G27650.1 (AT1G27650.1 U2 snRNP auxiliary factor small subunit, putative)

HSP 1 Score: 407.5 bits (1046), Expect = 7.6e-114
Identity = 201/292 (68.84%), Postives = 225/292 (77.05%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLH +P+ISPTLLLSNMYQRPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHNRPTISPTLLLSNMYQRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           GVDAQG P+DPRKIQ+HFE+F+EDLF+EL K+GEIESLN+CDNLADHM+GNVYVQF+EE+
Sbjct: 61  GVDAQGQPLDPRKIQEHFEDFFEDLFEELGKFGEIESLNICDNLADHMIGNVYVQFKEED 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
           QAA AL+ L GRFY+GRPII DFSPVTDFREATCRQYEEN CNRGGYCNFMH+K +SREL
Sbjct: 121 QAAAALQALQGRFYSGRPIIADFSPVTDFREATCRQYEENNCNRGGYCNFMHVKLVSREL 180

Query: 181 RHELFAIYRR-----RRSHSRSRSRSPYRHRSYEERSYGRHGHSRRYDERDAYHESRSRR 240
           R +LF  YRR      RS SRSRS SP   R  + R       S R  +R+ Y     + 
Sbjct: 181 RRKLFGRYRRSYRRGSRSRSRSRSISPRNKRDNDRRDPSHREFSHRDRDREFYRHGSGK- 240

Query: 241 HRTTSPGHRSRSRSPRGRKNQSPV--------REGSEERRAKIEQWNREREQ 280
            R++    R      RGR+  SP         REGSEERRA+IEQWNRERE+
Sbjct: 241 -RSSERSERQERDGSRGRRQASPKRGGSPGGGREGSEERRARIEQWNREREE 290

BLAST of ClCG01G012750 vs. TAIR10
Match: AT1G10320.1 (AT1G10320.1 Zinc finger C-x8-C-x5-C-x3-H type family protein)

HSP 1 Score: 143.3 bits (360), Expect = 2.7e-34
Identity = 92/297 (30.98%), Postives = 142/297 (47.81%), Query Frame = 1

Query: 9   FGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITPGVDAQGNP 68
           FGTE+D+ +CPF+ K GACR G RCSR+H  P+ S TLL+ NMY  P  IT   D +G  
Sbjct: 237 FGTEQDKAHCPFHLKTGACRFGQRCSRVHFYPNKSCTLLMKNMYNGPG-ITWEQD-EGLE 296

Query: 69  IDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEEQAANALRN 128
               + +  +EEFYED+  E  KYGE+ +  VC N + H+ GNVYV +R  E A  A ++
Sbjct: 297 YTDEEAELCYEEFYEDVHTEFLKYGELVNFKVCRNGSFHLKGNVYVHYRSLESAILAYQS 356

Query: 129 LSGRFYAGRPIIVDFSPVTDFREATCRQYEEN---MCNRGGYCNFMHLKRISRELRHELF 188
           ++GR++AG+ +  +F  ++ ++ A C +Y ++    C+RG  CNF+H             
Sbjct: 357 INGRYFAGKQVNCEFVNISRWKVAICGEYMKSRLKTCSRGSACNFIH------------- 416

Query: 189 AIYRRRRSHSRSRSRSPYRHRSYEERSYGRHGHSRRYDERDAYHESRSRRHRTTSPGHRS 248
             +R                R +  +     G+S   DE+   HES    + + S     
Sbjct: 417 -CFRNPGGDYEWADHDRPPPRFWIHKMTSLFGYS---DEKHMEHESSGSLNDSISDLSTD 476

Query: 249 RSRSPRGRKNQSPVREGSEERRAKIEQWNREREQGNIDNNPNSNDNKNSHENSHNGD 303
             R P  R   S  R+           +   +  G+  ++   +  +   EN H+GD
Sbjct: 477 SHRQPSRR---SRSRDHDHANVGSTPSYRSRKYHGDTQDSTREDKLRRHAENCHDGD 511

BLAST of ClCG01G012750 vs. TAIR10
Match: AT3G44785.1 (AT3G44785.1 Zinc finger C-x8-C-x5-C-x3-H type family protein)

HSP 1 Score: 105.9 bits (263), Expect = 4.7e-23
Identity = 51/73 (69.86%), Postives = 55/73 (75.34%), Query Frame = 1

Query: 1  MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
          M EHLASI+GTEKDRVNCPFYFKIG CR+GDRCSRL+TKPSISPTLLLSN YQ+  +   
Sbjct: 1  MVEHLASIYGTEKDRVNCPFYFKIGVCRNGDRCSRLYTKPSISPTLLLSNTYQQGRLKQF 60

Query: 61 GVDAQGNPIDPRK 74
              Q    DP+K
Sbjct: 61 LDPVQSREKDPKK 73

BLAST of ClCG01G012750 vs. TAIR10
Match: AT5G64200.1 (AT5G64200.1 ortholog of human splicing factor SC35)

HSP 1 Score: 59.7 bits (143), Expect = 3.9e-09
Identity = 53/176 (30.11%), Postives = 73/176 (41.48%), Query Frame = 1

Query: 78  FEEFYEDLFQELNKYGEIESLNVC-DNLADHMVGNVYVQFREEEQAANALRNLSGRFYAG 137
           F    +DL+    KYG++  + +  D       G  +V+++ +++A  A+  L GR   G
Sbjct: 25  FRTTADDLYPLFAKYGKVVDVFIPRDRRTGDSRGFAFVRYKYKDEAHKAVERLDGRVVDG 84

Query: 138 RPIIVDFSPVTDFRE--ATCRQYEENMCNRGGYCNFMHLKRISRELRHELFAIYRRRRSH 197
           R I V F+      E  +  R  E    +R          R  R  R        RRRS 
Sbjct: 85  REITVQFAKYGPNAEKISKGRVVEPPPKSRRSRSRSPRRSRSPRRSRSP-----PRRRSP 144

Query: 198 SRSRSRSPYRHRSYEERSYGRHGHSRRYDERDAYHESRSRRHRTTSPGHRSRSRSP 251
            RSRS        Y E+ Y +   SR YD R+  HE + R HR  +   RSRS SP
Sbjct: 145 RRSRSPRRRSRDDYREKDYRKRSRSRSYDRRER-HEEKDRDHRRRT---RSRSASP 191

BLAST of ClCG01G012750 vs. NCBI nr
Match: gi|449443402|ref|XP_004139466.1| (PREDICTED: splicing factor U2af small subunit B-like [Cucumis sativus])

HSP 1 Score: 655.6 bits (1690), Expect = 4.6e-185
Identity = 309/327 (94.50%), Postives = 320/327 (97.86%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           GVDAQGNPIDPR IQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE
Sbjct: 61  GVDAQGNPIDPRNIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
           QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRI REL
Sbjct: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRIGREL 180

Query: 181 RHELFAIYRRRRSHSRSRSRSPYRHRSYEERSYGRHGHSRRYDERDAYHESRSRRHRTTS 240
           RHELFA+YRRR SHSRSRSRSPYRHRSYEERSYG+HGHSRRYDERDAYHESRSRRHRTTS
Sbjct: 181 RHELFAMYRRRHSHSRSRSRSPYRHRSYEERSYGKHGHSRRYDERDAYHESRSRRHRTTS 240

Query: 241 PGHRSRSRSPRGRKNQSPVREGSEERRAKIEQWNREREQGNIDNNPNSNDNKNSHENSHN 300
           PGHRSRSRSPRGRKN+SPVREGSEERRAKIEQWN+EREQGN DNN NS+DN+N+HE S++
Sbjct: 241 PGHRSRSRSPRGRKNRSPVREGSEERRAKIEQWNKEREQGN-DNNANSDDNRNNHEKSYD 300

Query: 301 GDVKYANQTCGYEEQRQRQPPEQGYGY 328
            +VKYANQTCGYEEQ+QRQPPEQGYGY
Sbjct: 301 SEVKYANQTCGYEEQQQRQPPEQGYGY 326

BLAST of ClCG01G012750 vs. NCBI nr
Match: gi|659071800|ref|XP_008461981.1| (PREDICTED: splicing factor U2af small subunit B-like [Cucumis melo])

HSP 1 Score: 654.8 bits (1688), Expect = 7.8e-185
Identity = 310/327 (94.80%), Postives = 319/327 (97.55%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE
Sbjct: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
           QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRI REL
Sbjct: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRIGREL 180

Query: 181 RHELFAIYRRRRSHSRSRSRSPYRHRSYEERSYGRHGHSRRYDERDAYHESRSRRHRTTS 240
           RHELFA+YRRRRSHSRSRSRSPYRHRSYEERSYGRHGHSRRYDERDAYHE+RSRR RTTS
Sbjct: 181 RHELFAMYRRRRSHSRSRSRSPYRHRSYEERSYGRHGHSRRYDERDAYHENRSRRRRTTS 240

Query: 241 PGHRSRSRSPRGRKNQSPVREGSEERRAKIEQWNREREQGNIDNNPNSNDNKNSHENSHN 300
           PGHRSRSRSPRGRKN+SP REGSEERRAKIEQWNREREQGN DNN  S+DN+NSHE S++
Sbjct: 241 PGHRSRSRSPRGRKNRSPAREGSEERRAKIEQWNREREQGN-DNNAKSDDNRNSHEKSYD 300

Query: 301 GDVKYANQTCGYEEQRQRQPPEQGYGY 328
            +VKYANQTCGYEEQ+QRQPPEQGYGY
Sbjct: 301 SEVKYANQTCGYEEQQQRQPPEQGYGY 326

BLAST of ClCG01G012750 vs. NCBI nr
Match: gi|590587412|ref|XP_007015957.1| (Zinc finger C-x8-C-x5-C-x3-H type family protein isoform 1 [Theobroma cacao])

HSP 1 Score: 531.2 bits (1367), Expect = 1.3e-147
Identity = 257/319 (80.56%), Postives = 286/319 (89.66%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           GV+ QGNPIDPRKIQ+HFEEFYEDLF+EL+KYGEIESLN+CDNLADHMVGNVYVQFREEE
Sbjct: 61  GVENQGNPIDPRKIQEHFEEFYEDLFEELSKYGEIESLNICDNLADHMVGNVYVQFREEE 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
            AANALRNLSGR+Y+GRPIIVDFSPVTDFREATCRQYEEN CNRGGYCNFMHLK ISREL
Sbjct: 121 HAANALRNLSGRYYSGRPIIVDFSPVTDFREATCRQYEENTCNRGGYCNFMHLKTISREL 180

Query: 181 RHELFAIYRRRRSHSRSRSRSPYRHR-SYEERSYGRHGHSRRYDERDAYHESRSRRHRTT 240
           R +LF  YRRRRSHS+SRSRSP +HR S+EERS+G  GH RRY +RD YHESRS+RHR+T
Sbjct: 181 RRQLFGRYRRRRSHSQSRSRSPPKHRGSHEERSHGGRGHIRRYGDRDHYHESRSKRHRST 240

Query: 241 SPGH-RSRSRSPRGRKNQSPVREGSEERRAKIEQWNREREQGNIDNNPNSNDNKNSHENS 300
           SPGH R RSRSP G++N+SPVREGSEERRAKIEQWNREREQ N +   N   N N++EN 
Sbjct: 241 SPGHRRGRSRSPGGKRNRSPVREGSEERRAKIEQWNREREQENANRVDNDAAN-NNNENG 300

Query: 301 HNGDVKYANQTCGYEEQRQ 318
           +NG  K  ++  G+++Q++
Sbjct: 301 NNGYAKNDDKYYGHQQQQE 318

BLAST of ClCG01G012750 vs. NCBI nr
Match: gi|702280342|ref|XP_010045262.1| (PREDICTED: splicing factor U2af small subunit B-like [Eucalyptus grandis])

HSP 1 Score: 530.4 bits (1365), Expect = 2.2e-147
Identity = 263/331 (79.46%), Postives = 286/331 (86.40%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           GVDAQG PIDPRKIQ+HFE+FYEDLF+ELNKYGE+ESLNVCDNLADHMVGNVYVQFREEE
Sbjct: 61  GVDAQGQPIDPRKIQEHFEDFYEDLFEELNKYGEMESLNVCDNLADHMVGNVYVQFREEE 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
           QA  AL++LSGR+YAGRPIIVD+SPVTDFREATCRQYEE+ CNRGGYCNFMHLK ISREL
Sbjct: 121 QAQRALQSLSGRYYAGRPIIVDYSPVTDFREATCRQYEEDKCNRGGYCNFMHLKSISREL 180

Query: 181 RHELFAIYRRRRSHSRSRSRSPYRHRSYEERSYGRHGHSRRYDERDAYHESRSRRHRTTS 240
           R +LF  YRRR S SRSRSRSPYRHRSYEE SYG  G+ RR+DE D YH+SRSRRHR+TS
Sbjct: 181 RRQLFGRYRRRHSRSRSRSRSPYRHRSYEEHSYGGRGYRRRHDEYD-YHDSRSRRHRSTS 240

Query: 241 PGH-RSRSRSPRGRKNQSPVREGSEERRAKIEQWNREREQGNIDNNPNSNDNKNSH-ENS 300
           PGH R RSRSP+GR+N SPVREGSEERRA+IEQWNRERE     N  NSN N N + EN+
Sbjct: 241 PGHRRGRSRSPQGRRNASPVREGSEERRARIEQWNREREWQENANVANSNHNSNGNLENA 300

Query: 301 HNGDVKYANQTCGYEEQRQRQPPEQG--YGY 328
            N   + +   C Y  Q+Q+QPP Q   YGY
Sbjct: 301 MNHHAR-SGDLCAY--QKQQQPPSQDGVYGY 327

BLAST of ClCG01G012750 vs. NCBI nr
Match: gi|590587422|ref|XP_007015960.1| (Zinc finger C-x8-C-x5-C-x3-H type family protein [Theobroma cacao])

HSP 1 Score: 529.3 bits (1362), Expect = 4.9e-147
Identity = 258/319 (80.88%), Postives = 284/319 (89.03%), Query Frame = 1

Query: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60
           MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP
Sbjct: 1   MAEHLASIFGTEKDRVNCPFYFKIGACRHGDRCSRLHTKPSISPTLLLSNMYQRPDMITP 60

Query: 61  GVDAQGNPIDPRKIQDHFEEFYEDLFQELNKYGEIESLNVCDNLADHMVGNVYVQFREEE 120
           GVDAQGNPIDPRKIQ+HFE FYEDLF+EL+KYGE+ESLN+CDNLADHMVGNVYVQF+EEE
Sbjct: 61  GVDAQGNPIDPRKIQEHFEGFYEDLFEELSKYGELESLNICDNLADHMVGNVYVQFKEEE 120

Query: 121 QAANALRNLSGRFYAGRPIIVDFSPVTDFREATCRQYEENMCNRGGYCNFMHLKRISREL 180
            AANALRNLSGRFYA RPIIVDFSPVTDFREATCRQY+EN CNRGGYCNFMHLKRISREL
Sbjct: 121 HAANALRNLSGRFYAARPIIVDFSPVTDFREATCRQYDENTCNRGGYCNFMHLKRISREL 180

Query: 181 RHELFAIYRRRRSHSRSRSRSPYRHR-SYEERSYGRHGHSRRYDERDAYHESRSRRHRTT 240
           + +LF  YRRRRSH  SRSRSP RHR S+EERS+G  GHSRRYD+RD YHE+RSRRHR+T
Sbjct: 181 KRQLFGRYRRRRSH--SRSRSPQRHRSSHEERSHGGRGHSRRYDDRDRYHENRSRRHRST 240

Query: 241 SPGH-RSRSRSPRGRKNQSPVREGSEERRAKIEQWNREREQGNIDNNPNSNDNKNSHENS 300
           SPGH R RSRSP G++N+SPVREGSEERRAKIEQWNREREQ    N  ++N   N++EN 
Sbjct: 241 SPGHRRGRSRSPGGKRNRSPVREGSEERRAKIEQWNREREQEENANKVDNNAADNNNENG 300

Query: 301 HNGDVKYANQTCGYEEQRQ 318
           +NG V+  N    Y+ Q++
Sbjct: 301 NNGYVQ--NDDKNYQHQQE 315

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
U2AFB_ARATH1.9e-11471.33Splicing factor U2af small subunit B OS=Arabidopsis thaliana GN=U2AF35B PE=1 SV=... [more]
U2AFA_ORYSJ2.7e-11369.46Splicing factor U2af small subunit A OS=Oryza sativa subsp. japonica GN=U2AF35A ... [more]
U2AFA_ARATH1.4e-11268.84Splicing factor U2af small subunit A OS=Arabidopsis thaliana GN=U2AF35A PE=1 SV=... [more]
U2AFB_ORYSJ1.9e-10663.82Splicing factor U2af small subunit B OS=Oryza sativa subsp. japonica GN=U2AF35B ... [more]
U2AF1_BOVIN3.2e-6956.60Splicing factor U2AF 35 kDa subunit OS=Bos taurus GN=U2AF1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LVA4_CUCSA3.2e-18594.50Uncharacterized protein OS=Cucumis sativus GN=Csa_1G145960 PE=4 SV=1[more]
A0A061GVP9_THECC9.0e-14880.56Zinc finger C-x8-C-x5-C-x3-H type family protein isoform 1 OS=Theobroma cacao GN... [more]
A0A059DB43_EUCGR1.5e-14779.46Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_B03897 PE=4 SV=1[more]
A0A061GZT2_THECC3.4e-14780.88Zinc finger C-x8-C-x5-C-x3-H type family protein OS=Theobroma cacao GN=TCM_04151... [more]
U5GJS9_POPTR8.5e-14677.95Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0004s20030g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G42820.21.1e-11571.33 Zinc finger C-x8-C-x5-C-x3-H type family protein[more]
AT1G27650.17.6e-11468.84 U2 snRNP auxiliary factor small subunit, putative[more]
AT1G10320.12.7e-3430.98 Zinc finger C-x8-C-x5-C-x3-H type family protein[more]
AT3G44785.14.7e-2369.86 Zinc finger C-x8-C-x5-C-x3-H type family protein[more]
AT5G64200.13.9e-0930.11 ortholog of human splicing factor SC35[more]
Match NameE-valueIdentityDescription
gi|449443402|ref|XP_004139466.1|4.6e-18594.50PREDICTED: splicing factor U2af small subunit B-like [Cucumis sativus][more]
gi|659071800|ref|XP_008461981.1|7.8e-18594.80PREDICTED: splicing factor U2af small subunit B-like [Cucumis melo][more]
gi|590587412|ref|XP_007015957.1|1.3e-14780.56Zinc finger C-x8-C-x5-C-x3-H type family protein isoform 1 [Theobroma cacao][more]
gi|702280342|ref|XP_010045262.1|2.2e-14779.46PREDICTED: splicing factor U2af small subunit B-like [Eucalyptus grandis][more]
gi|590587422|ref|XP_007015960.1|4.9e-14780.88Zinc finger C-x8-C-x5-C-x3-H type family protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000504RRM_dom
IPR000571Znf_CCCH
IPR003954RRM_dom_euk
IPR009145U2AF_small
IPR012677Nucleotide-bd_a/b_plait_sf
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO:0046872metal ion binding
GO:0003723RNA binding
GO:0000166nucleotide binding
Vocabulary: Biological Process
TermDefinition
GO:0000398mRNA splicing, via spliceosome
Vocabulary: Cellular Component
TermDefinition
GO:0089701U2AF
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000398 mRNA splicing, via spliceosome
cellular_component GO:0089701 U2AF
molecular_function GO:0046872 metal ion binding
molecular_function GO:0000166 nucleotide binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G012750.1ClCG01G012750.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 83..139
score: 4.
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 45..142
score: 4.
IPR000504RNA recognition motif domainPROFILEPS50102RRMcoord: 44..146
score: 11
IPR000571Zinc finger, CCCH-typePFAMPF00642zf-CCCHcoord: 14..37
score: 4.
IPR000571Zinc finger, CCCH-typeSMARTSM00356c3hfinal6coord: 148..174
score: 0.21coord: 13..39
score: 0.
IPR000571Zinc finger, CCCH-typePROFILEPS50103ZF_C3H1coord: 148..175
score: 10.775coord: 12..40
score: 13
IPR003954RNA recognition motif domain, eukaryoteSMARTSM00361rrm2_1coord: 73..142
score: 6.
IPR009145U2 auxiliary factor small subunitPRINTSPR01848U2AUXFACTORcoord: 37..57
score: 8.6E-63coord: 18..37
score: 8.6E-63coord: 75..90
score: 8.6E-63coord: 130..154
score: 8.6E-63coord: 103..125
score: 8.6E-63coord: 165..177
score: 8.6
IPR009145U2 auxiliary factor small subunitPANTHERPTHR12620U2 SNRNP AUXILIARY FACTOR, SMALL SUBUNITcoord: 1..298
score: 7.8E
IPR012677Nucleotide-binding alpha-beta plait domainGENE3DG3DSA:3.30.70.330coord: 44..145
score: 2.1
IPR012677Nucleotide-binding alpha-beta plait domainunknownSSF54928RNA-binding domain, RBDcoord: 32..154
score: 7.47
NoneNo IPR availablePANTHERPTHR12620:SF16SUBFAMILY NOT NAMEDcoord: 1..298
score: 7.8E