CmoCh16G007190 (gene) Cucurbita moschata (Rifu)

NameCmoCh16G007190
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionRNA polymerase-associated protein RTF1 like
LocationCmo_Chr16 : 3594325 .. 3596298 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGATCTAGAAAATTTACTTCTTGAGGCTGCGGGAAGAACTAAGGCATCAGGGAGAAATCGACACTCTCATCCACCATCACGAAGACAGCGCGAAGGTTCATATTCTGATGCTGGAAGTGACTCGAGGGATGATGACTCTGATGATGAACGTGGTTATGCGAGCAGGAAGCCATCTGGATCTCAGGTTCCTCTGAAGAAGAGGTTAGATCCTAATGAGAGGGATGATGATGTGGGCAGCCCAGAAGAAGGGGAAGACGAAGATGTTGGTTCAGAACATGAGGGTGACAGTAGTGATGAATCTGATGTTGGGGATGATCTTTACAAAGATGACGATGACAGGCGTAAGCTTGCTGGTATGTCTGAACTTCAAAGGGAGATGATTCTTTCAGACAGAGCATCGAAGAAGAATGATAAGCATTTATATGAAAGCTTGAGATCTAAGATGGATAAAGGGAAGGCTGCCCCGTCTCGGAAAGAAAATCCTCCTCTCCCATCATCTCGTATTAGATCGTCAGCCCGATCTGCTGATAGAGCAGCTGCAAAAGATGATGCTTTAAATGAATTGCGTGCAAAAAGGTTGAAGCAGCAGGACCCCGAGGCTCACCGCAAATTGAGAGATACATCTAGAGGAAACGCAAATAATCGAAGGTTCTCACCAACAAAGCGAAAGCCCTTCACTGCTCCTAGTTTGAGTAGCTCAAGCCAAAGTGAAAGTGAAAGTAGGTTTCAAAGTGACGATGAAGGATCTACAGGAGATGGTGGAATGATTGACAGTGACGATGAAAGAACCATGCCTGGTTCAAATGGGCCAACATTTGATGATATCAAGGAAATTACTATTCGTAGATCAAAGCTTGCAAAATGGTTAATGGAGCCATTCTTTGAGGAGTTGATAGTTGGATGCTTTGTGAGAGTCGGAATTGGGAGATCAAGATCCGGGCCTATCTACAGGCTCTGCTTGGTGCGCAATGTTGATGCAACAGAACCCGATCGTCAGTATAAACTCGAGAACAAAATCACGCATAAATATCTTAATGTTATTTGGGGAAATGAAAGTTCTGCTGCCAGATGGCAAATGGCTATGGTTTCGGACTCTGGTCCACTCGAGGATGAATATAAACAGTGGGTGAAGGAAGTAGAGCGAACGGGCGGTCGGATGCTGAGCAAGCAGGATATCTTGGAAAAGAAGGAAGCTATACAGAAAGCCAACAACTTCGTGTACTCAGCAGCCACAGTGAAGCAGATGTTGCAGGAGAAAAAATCTGCCTCATCAAGGCCATTAAATATTGCAGCCGAGAAGGACCGACTGAGGAATCAGATGGACGTAGCACTAAGCAAAAACAATGAAGCCGAGGTGGAGAGGATCAAAGCAAAACTGCGCCAATTAGACGCATCCAGGAGGTCACAAATGAAAGATGCCAAGGCTATTAGGTTATCTGAGATGAACAGGAAGAATAGAGTGGAAAACTTCAAGAACGCATCAGAACTAAGACCCACGAAAGATTTGAAAGCCGGTGAGGCTGGTTATGATCCCTTCTCTAGGAGATGGACGAGGTCGAGGAATTATTATGTTTCAAACGCTGGTCAAGTCGATGGGGCTGCCGAGGCAGCTGGCAACAGTGACAATATAACTCCTGCATCAGTGACTACTGGAACAGGATCTGGAGAAGCTGGCGTGGCAGCTACCGCAGCAGCTTTGGAAGCTGCTGCTGGTGCTGGAAAGTTGGTCGATACTAACGCTCCAGTAGATGGAGGAACAGAATCGAACTTGCTGCACAACTTCGAGCTGCCTATATCATTGACTGTGCTTCAGAAATTTGGTGGCGCTCTGGGAGCTCAGGCTGGGTTCTTAGCAAGGAAACAAAGGATAGAAGCCACAGTTGGGCGTCAAGTCCCTGAGAACGATGGTAGGCGGCATGCACTGACACTCACGGTTAGCGACTACAAGAGAAGAAGAGGGCTTCTTTGA

mRNA sequence

ATGGCAGATCTAGAAAATTTACTTCTTGAGGCTGCGGGAAGAACTAAGGCATCAGGGAGAAATCGACACTCTCATCCACCATCACGAAGACAGCGCGAAGGTTCATATTCTGATGCTGGAAGTGACTCGAGGGATGATGACTCTGATGATGAACGTGGTTATGCGAGCAGGAAGCCATCTGGATCTCAGGTTCCTCTGAAGAAGAGGTTAGATCCTAATGAGAGGGATGATGATGTGGGCAGCCCAGAAGAAGGGGAAGACGAAGATGTTGGTTCAGAACATGAGGGTGACAGTAGTGATGAATCTGATGTTGGGGATGATCTTTACAAAGATGACGATGACAGGCGTAAGCTTGCTGGTATGTCTGAACTTCAAAGGGAGATGATTCTTTCAGACAGAGCATCGAAGAAGAATGATAAGCATTTATATGAAAGCTTGAGATCTAAGATGGATAAAGGGAAGGCTGCCCCGTCTCGGAAAGAAAATCCTCCTCTCCCATCATCTCGTATTAGATCGTCAGCCCGATCTGCTGATAGAGCAGCTGCAAAAGATGATGCTTTAAATGAATTGCGTGCAAAAAGGTTGAAGCAGCAGGACCCCGAGGCTCACCGCAAATTGAGAGATACATCTAGAGGAAACGCAAATAATCGAAGGTTCTCACCAACAAAGCGAAAGCCCTTCACTGCTCCTAGTTTGAGTAGCTCAAGCCAAAGTGAAAGTGAAAGTAGGTTTCAAAGTGACGATGAAGGATCTACAGGAGATGGTGGAATGATTGACAGTGACGATGAAAGAACCATGCCTGGTTCAAATGGGCCAACATTTGATGATATCAAGGAAATTACTATTCGTAGATCAAAGCTTGCAAAATGGTTAATGGAGCCATTCTTTGAGGAGTTGATAGTTGGATGCTTTGTGAGAGTCGGAATTGGGAGATCAAGATCCGGGCCTATCTACAGGCTCTGCTTGGTGCGCAATGTTGATGCAACAGAACCCGATCGTCAGTATAAACTCGAGAACAAAATCACGCATAAATATCTTAATGTTATTTGGGGAAATGAAAGTTCTGCTGCCAGATGGCAAATGGCTATGGTTTCGGACTCTGGTCCACTCGAGGATGAATATAAACAGTGGGTGAAGGAAGTAGAGCGAACGGGCGGTCGGATGCTGAGCAAGCAGGATATCTTGGAAAAGAAGGAAGCTATACAGAAAGCCAACAACTTCGTGTACTCAGCAGCCACAGTGAAGCAGATGTTGCAGGAGAAAAAATCTGCCTCATCAAGGCCATTAAATATTGCAGCCGAGAAGGACCGACTGAGGAATCAGATGGACGTAGCACTAAGCAAAAACAATGAAGCCGAGGTGGAGAGGATCAAAGCAAAACTGCGCCAATTAGACGCATCCAGGAGGTCACAAATGAAAGATGCCAAGGCTATTAGGTTATCTGAGATGAACAGGAAGAATAGAGTGGAAAACTTCAAGAACGCATCAGAACTAAGACCCACGAAAGATTTGAAAGCCGGTGAGGCTGGTTATGATCCCTTCTCTAGGAGATGGACGAGGTCGAGGAATTATTATGTTTCAAACGCTGGTCAAGTCGATGGGGCTGCCGAGGCAGCTGGCAACAGTGACAATATAACTCCTGCATCAGTGACTACTGGAACAGGATCTGGAGAAGCTGGCGTGGCAGCTACCGCAGCAGCTTTGGAAGCTGCTGCTGGTGCTGGAAAGTTGGTCGATACTAACGCTCCAGTAGATGGAGGAACAGAATCGAACTTGCTGCACAACTTCGAGCTGCCTATATCATTGACTGTGCTTCAGAAATTTGGTGGCGCTCTGGGAGCTCAGGCTGGGTTCTTAGCAAGGAAACAAAGGATAGAAGCCACAGTTGGGCGTCAAGTCCCTGAGAACGATGGTAGGCGGCATGCACTGACACTCACGGTTAGCGACTACAAGAGAAGAAGAGGGCTTCTTTGA

Coding sequence (CDS)

ATGGCAGATCTAGAAAATTTACTTCTTGAGGCTGCGGGAAGAACTAAGGCATCAGGGAGAAATCGACACTCTCATCCACCATCACGAAGACAGCGCGAAGGTTCATATTCTGATGCTGGAAGTGACTCGAGGGATGATGACTCTGATGATGAACGTGGTTATGCGAGCAGGAAGCCATCTGGATCTCAGGTTCCTCTGAAGAAGAGGTTAGATCCTAATGAGAGGGATGATGATGTGGGCAGCCCAGAAGAAGGGGAAGACGAAGATGTTGGTTCAGAACATGAGGGTGACAGTAGTGATGAATCTGATGTTGGGGATGATCTTTACAAAGATGACGATGACAGGCGTAAGCTTGCTGGTATGTCTGAACTTCAAAGGGAGATGATTCTTTCAGACAGAGCATCGAAGAAGAATGATAAGCATTTATATGAAAGCTTGAGATCTAAGATGGATAAAGGGAAGGCTGCCCCGTCTCGGAAAGAAAATCCTCCTCTCCCATCATCTCGTATTAGATCGTCAGCCCGATCTGCTGATAGAGCAGCTGCAAAAGATGATGCTTTAAATGAATTGCGTGCAAAAAGGTTGAAGCAGCAGGACCCCGAGGCTCACCGCAAATTGAGAGATACATCTAGAGGAAACGCAAATAATCGAAGGTTCTCACCAACAAAGCGAAAGCCCTTCACTGCTCCTAGTTTGAGTAGCTCAAGCCAAAGTGAAAGTGAAAGTAGGTTTCAAAGTGACGATGAAGGATCTACAGGAGATGGTGGAATGATTGACAGTGACGATGAAAGAACCATGCCTGGTTCAAATGGGCCAACATTTGATGATATCAAGGAAATTACTATTCGTAGATCAAAGCTTGCAAAATGGTTAATGGAGCCATTCTTTGAGGAGTTGATAGTTGGATGCTTTGTGAGAGTCGGAATTGGGAGATCAAGATCCGGGCCTATCTACAGGCTCTGCTTGGTGCGCAATGTTGATGCAACAGAACCCGATCGTCAGTATAAACTCGAGAACAAAATCACGCATAAATATCTTAATGTTATTTGGGGAAATGAAAGTTCTGCTGCCAGATGGCAAATGGCTATGGTTTCGGACTCTGGTCCACTCGAGGATGAATATAAACAGTGGGTGAAGGAAGTAGAGCGAACGGGCGGTCGGATGCTGAGCAAGCAGGATATCTTGGAAAAGAAGGAAGCTATACAGAAAGCCAACAACTTCGTGTACTCAGCAGCCACAGTGAAGCAGATGTTGCAGGAGAAAAAATCTGCCTCATCAAGGCCATTAAATATTGCAGCCGAGAAGGACCGACTGAGGAATCAGATGGACGTAGCACTAAGCAAAAACAATGAAGCCGAGGTGGAGAGGATCAAAGCAAAACTGCGCCAATTAGACGCATCCAGGAGGTCACAAATGAAAGATGCCAAGGCTATTAGGTTATCTGAGATGAACAGGAAGAATAGAGTGGAAAACTTCAAGAACGCATCAGAACTAAGACCCACGAAAGATTTGAAAGCCGGTGAGGCTGGTTATGATCCCTTCTCTAGGAGATGGACGAGGTCGAGGAATTATTATGTTTCAAACGCTGGTCAAGTCGATGGGGCTGCCGAGGCAGCTGGCAACAGTGACAATATAACTCCTGCATCAGTGACTACTGGAACAGGATCTGGAGAAGCTGGCGTGGCAGCTACCGCAGCAGCTTTGGAAGCTGCTGCTGGTGCTGGAAAGTTGGTCGATACTAACGCTCCAGTAGATGGAGGAACAGAATCGAACTTGCTGCACAACTTCGAGCTGCCTATATCATTGACTGTGCTTCAGAAATTTGGTGGCGCTCTGGGAGCTCAGGCTGGGTTCTTAGCAAGGAAACAAAGGATAGAAGCCACAGTTGGGCGTCAAGTCCCTGAGAACGATGGTAGGCGGCATGCACTGACACTCACGGTTAGCGACTACAAGAGAAGAAGAGGGCTTCTTTGA
BLAST of CmoCh16G007190 vs. Swiss-Prot
Match: VIP5_ARATH (Protein RTF1 homolog OS=Arabidopsis thaliana GN=VIP5 PE=1 SV=1)

HSP 1 Score: 800.4 bits (2066), Expect = 1.4e-230
Identity = 456/659 (69.20%), Postives = 531/659 (80.58%), Query Frame = 1

Query: 1   MADLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPS 60
           M DLENLLLEAAGRT ++GR+RH  PPS R+REGSYSD  SDSRDD SD++RGYASRKPS
Sbjct: 1   MGDLENLLLEAAGRTNSAGRSRH--PPSSRRREGSYSDGSSDSRDD-SDEDRGYASRKPS 60

Query: 61  GSQVPLKKRLDPNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG 120
           GSQVPLKKRL+  ER+D     E G   D  S+ EGDSS+ESD GDDLYK+++DR+KLAG
Sbjct: 61  GSQVPLKKRLEA-EREDRAARVEGGYG-DGPSDREGDSSEESDFGDDLYKNEEDRQKLAG 120

Query: 121 MSELQREMILSDRASKKNDKHLYESLRSKMDKGKAAPSRKENPPLPSSR-IRSSARSADR 180
           M+E QREMILS+RA KK DK+  E LRSK +  K   S+KE  PLP+SR +RSSARSADR
Sbjct: 121 MTEFQREMILSERADKKGDKNFTEKLRSKRESEKTPVSKKETQPLPASRGVRSSARSADR 180

Query: 181 AAAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSE 240
           AAAKDDALNELRAKR+KQQDP A RKLRD S+G + +R FS TKRKP  + +LSSSSQS+
Sbjct: 181 AAAKDDALNELRAKRMKQQDPAALRKLRDASKGGSGSRDFSSTKRKPLASSNLSSSSQSD 240

Query: 241 SESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEEL 300
           S+SR QSDDEGS  +GGM+DSDD+R    S+ PTF+D+KE+TIRRSKLAKWLMEPFFEEL
Sbjct: 241 SDSRSQSDDEGS--NGGMLDSDDDR----SDVPTFEDVKEVTIRRSKLAKWLMEPFFEEL 300

Query: 301 IVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARW 360
           IVGCFVRVGIGRS+SGPIYRLC V+NVDAT+PD+ YKLENK THKYLNV+WGNE+SAARW
Sbjct: 301 IVGCFVRVGIGRSKSGPIYRLCWVKNVDATDPDKTYKLENKTTHKYLNVVWGNETSAARW 360

Query: 361 QMAMVSDSGPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVKQMLQ 420
           QMAM+SD  PLE+EY+QW++EVERT GRM +KQDI EKKEAIQ+ N+FVYSA TVKQMLQ
Sbjct: 361 QMAMISDGHPLEEEYRQWIREVERTNGRMPTKQDISEKKEAIQRTNSFVYSAETVKQMLQ 420

Query: 421 EKKSASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLDASRRSQMKDAKAIR 480
           EKKSAS RP+N+AAEKDRLR ++++A SKN+EA VERIK+K++QLDASR  +  D KA++
Sbjct: 421 EKKSASVRPMNVAAEKDRLRKELEIAQSKNDEAGVERIKSKIKQLDASRNKKGVDKKALK 480

Query: 481 LSEMNRKNRVENFKNASELRP-TKDLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEA 540
           L+EMN+KNR ENFKNASE++  T  LKAGEAGYDPFSRRWTRS NYY       DG    
Sbjct: 481 LAEMNKKNRAENFKNASEVKSITASLKAGEAGYDPFSRRWTRSSNYYNGKNKGKDGEENE 540

Query: 541 AGNSDNITPASVTTGTGSGEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLLHNFEL 600
           A        A+V T      AGV AT AALEAAA AGKL+DT AP+  G E N LHNFEL
Sbjct: 541 AA-----VAAAVETNGADAGAGVEATEAALEAAAEAGKLIDTRAPIGQGAEHNQLHNFEL 600

Query: 601 PISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL 658
            +SLT LQK+GG  G Q  F+ARKQ  EATVG +V ENDG+RH LTLTVSDYKRRRGLL
Sbjct: 601 SLSLTALQKYGGPQGVQKAFMARKQLTEATVGCRVAENDGKRHGLTLTVSDYKRRRGLL 643

BLAST of CmoCh16G007190 vs. Swiss-Prot
Match: RTF1_HUMAN (RNA polymerase-associated protein RTF1 homolog OS=Homo sapiens GN=RTF1 PE=1 SV=4)

HSP 1 Score: 100.5 bits (249), Expect = 7.1e-20
Identity = 169/662 (25.53%), Postives = 286/662 (43.20%), Query Frame = 1

Query: 12  AGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPSGSQVPLKKRLD 71
           + + K  G+ R        +++ + + A S S D DS  E    S  P   +V       
Sbjct: 108 SNKNKKKGKARKIEKKGTMKKQANKT-ASSGSSDKDSSAE----SSAPEEGEVS------ 167

Query: 72  PNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAGMSELQREMILS 131
               D D  S     D D  SE   D       G+DL  D++DR +L  M+E +RE  L 
Sbjct: 168 ----DSDSNSSSSSSDSDSSSE---DEEFHDGYGEDLMGDEEDRARLEQMTEKEREQELF 227

Query: 132 DRASK----KNDKHLYESLRSKMDKGKAAPSRKENPPLPSSRIRSSARSADRAAAKDDAL 191
           +R  K    K    + + L++   K K    +K+       ++     S   +  K    
Sbjct: 228 NRIEKREVLKRRFEIKKKLKTAKKKEKKEKKKKQEEEQEKKKLTQIQESQVTSHNK---- 287

Query: 192 NELRAKRLKQQDP--EAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSESESRFQ 251
            E R+KR ++ D   +A  +L+       N       K++P     + S  + E E    
Sbjct: 288 -ERRSKRDEKLDKKSQAMEELKAEREKRKNRTAELLAKKQPLKTSEVYSDDEEEEEDDKS 347

Query: 252 SDDEGSTGDGGMIDSDDERT-MPGSNGPTF--DDIKEITIRRSKLAKWLMEPFFEELIVG 311
           S+    +      D ++E+  +P  + P    +++  + + R KL +W   PFF + + G
Sbjct: 348 SEKSDRSSRTSSSDEEEEKEEIPPKSQPVSLPEELNRVRLSRHKLERWCHMPFFAKTVTG 407

Query: 312 CFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARWQMA 371
           CFVR+GIG   S P+YR+  +  V   E  + Y+L    T+K L +  GN+    R  + 
Sbjct: 408 CFVRIGIGNHNSKPVYRVAEITGV--VETAKVYQLGGTRTNKGLQLRHGNDQRVFR--LE 467

Query: 372 MVSDSGPLEDEYKQWVKEVERTGGRMLSKQDILEKKE-AIQKANNFVYSAATVKQMLQEK 431
            VS+    E E+ +W KE   + G  L   D + KKE +I++A N+ ++   ++++++EK
Sbjct: 468 FVSNQEFTESEFMKW-KEAMFSAGMQLPTLDEINKKELSIKEALNYKFNDQDIEEIVKEK 527

Query: 432 KSASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQL----DASRRSQMKDAKA 491
           +     P N A +K +L  +  +A    ++ + ++I+ +L +L    +A  R + K+  A
Sbjct: 528 ERFRKAPPNYAMKKTQLLKEKAMAEDLGDQDKAKQIQDQLNELEERAEALDRQRTKNISA 587

Query: 492 IRLSEMNRKNRVENFKNASELRPTKDLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAE 551
           I  S +N++NR  N   + +    +         DPF+RR  + +   VSN+   D A +
Sbjct: 588 I--SYINQRNREWNIVESEKALVAESHNMKNQQMDPFTRR--QCKPTIVSNSR--DPAVQ 647

Query: 552 AAGNSDNITPASVTTGTGSGEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNL--LHN 611
           AA        A +    GSG        A  E + G GK  D N+        +L  +H+
Sbjct: 648 AA------ILAQLNAKYGSG----VLPDAPKEMSKGQGKDKDLNSKSASDLSEDLFKVHD 707

Query: 612 FELPISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRG 658
           F++ I L V      AL              A   +  P  DG     +L + DYK+RRG
Sbjct: 708 FDVKIDLQVPSSESKAL--------------AITSKAPPAKDGAPRR-SLNLEDYKKRRG 710

BLAST of CmoCh16G007190 vs. Swiss-Prot
Match: RTF1_MOUSE (RNA polymerase-associated protein RTF1 homolog OS=Mus musculus GN=Rtf1 PE=1 SV=1)

HSP 1 Score: 99.8 bits (247), Expect = 1.2e-19
Identity = 175/692 (25.29%), Postives = 299/692 (43.21%), Query Frame = 1

Query: 3   DLENLLLEAAGRTKASGRNRH---SHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKP 62
           +L+  LL  A R ++    +    S P +    E S SD   D     S+  +     + 
Sbjct: 68  NLDQELLSLAKRKRSDSEEKEPPVSQPAASSDSETSDSD---DEWTFGSNKNKKKGKTRK 127

Query: 63  SGSQVPLKKRLDP------NERDDDVGS--PEEGEDEDVGSEHEGDSSDESD-------- 122
              +  +KK+ +       ++RD    S  PEEGE  D  S     SSD           
Sbjct: 128 VEKKGAMKKQANKAASSGSSDRDSSAESSAPEEGEVSDSESSSSSSSSDSDSSSEDEEFH 187

Query: 123 --VGDDLYKDDDDRRKLAGMSELQREMILSDRASK----KNDKHLYESLRSKMDKGKAAP 182
              G+DL  D++DR +L  M+E +RE  L +R  K    K    + + L++   K K   
Sbjct: 188 DGYGEDLMGDEEDRARLEQMTEKEREQELFNRIEKREVLKRRFEIKKKLKTAKKKEKKEK 247

Query: 183 SRKENPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDP--EAHRKLRDTSRGNAN 242
            +K+       ++     S   +  K     E R+KR ++ D   +A  +L+       N
Sbjct: 248 KKKQEEEQEKKKLTQIQESQVTSHNK-----ERRSKRDEKLDKKSQAMEELKAEREKRKN 307

Query: 243 NRRFSPTKRKPFTAPSLSSSSQSESESRFQSDDEGSTGDGGMIDSDDERT-MPGSNGPTF 302
                  K++P     + S  + E +    S+    +      D ++E+  +P  + P  
Sbjct: 308 RTAELLAKKQPLKTSEVYSDDEEEEDDDKSSEKSDRSSRTSSSDEEEEKEEIPPKSQPVS 367

Query: 303 --DDIKEITIRRSKLAKWLMEPFFEELIVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPD 362
             +++  + + R KL +W   PFF + + GCFVR+GIG   S P+YR+  +  V   E  
Sbjct: 368 LPEELNRVRLSRHKLERWCHMPFFAKTVTGCFVRIGIGNHNSKPVYRVAEITGV--VETA 427

Query: 363 RQYKLENKITHKYLNVIWGNESSAARWQMAMVSDSGPLEDEYKQWVKEVERTGGRMLSKQ 422
           + Y+L    T+K L +  GN+    R  +  VS+    E E+ +W KE   + G  L   
Sbjct: 428 KVYQLGGTRTNKGLQLRHGNDQRVFR--LEFVSNQEFTESEFMKW-KEAMFSAGMQLPTL 487

Query: 423 DILEKKE-AIQKANNFVYSAATVKQMLQEKKSASSRPLNIAAEKDRLRNQMDVALSKNNE 482
           D + KKE +I++A N+ ++   ++++++EK+     P N A +K +L  +  +A    ++
Sbjct: 488 DEINKKELSIKEALNYKFNDQDIEEIVKEKERFRKAPPNYAMKKTQLLKEKAMAEDLGDQ 547

Query: 483 AEVERIKAKLRQL----DASRRSQMKDAKAIRLSEMNRKNRVENFKNASELRPTKDLKAG 542
            + ++I+ +L +L    +A  R + K+  AI  S +N++NR  N   + +    +     
Sbjct: 548 DKAKQIQDQLNELEERAEALDRQRTKNISAI--SYINQRNREWNIVESEKALVAESHNMR 607

Query: 543 EAGYDPFSRRWTRSRNYYVSNAGQVDGAAEAAGNSDNITPASVTTGTGSGEAGVAATAAA 602
               DPF+RR  + +   VSN+   D A +AA        A +    GSG        A 
Sbjct: 608 NQQMDPFTRR--QCKPTIVSNSR--DPAVQAA------ILAQLNAKYGSG----VLPDAP 667

Query: 603 LEAAAGAGKLVDTNAPVDGGTESNL--LHNFELPISLTVLQKFGGALGAQAGFLARKQRI 658
            E + G GK  D N+        +L  +H+F++ I L V      AL             
Sbjct: 668 KEMSKGQGKDKDLNSKTASDLSEDLFKVHDFDVKIDLQVPSSESKAL------------- 715

BLAST of CmoCh16G007190 vs. Swiss-Prot
Match: RTF1_PONAB (RNA polymerase-associated protein RTF1 homolog (Fragment) OS=Pongo abelii GN=RTF1 PE=2 SV=2)

HSP 1 Score: 88.6 bits (218), Expect = 2.8e-16
Identity = 154/607 (25.37%), Postives = 266/607 (43.82%), Query Frame = 1

Query: 12  AGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPSGSQVPLKKRLD 71
           + + K  G+ R        +++ + + A S S D DS  E    S  P   +V       
Sbjct: 103 SNKNKKKGKARKIEKKGTMKKQANKT-ASSGSSDKDSSAE----SSAPEEGEVS------ 162

Query: 72  PNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAGMSELQREMILS 131
               D D  S     D D  SE   D       G+DL  D++DR +L  M+E +RE  L 
Sbjct: 163 ----DSDSNSSSSSSDSDSSSE---DEEFHDGYGEDLMGDEEDRARLEQMTEKEREQELF 222

Query: 132 DRASK----KNDKHLYESLRSKMDKGKAAPSRKENPPLPSSRIRSSARSADRAAAKDDAL 191
           +R  K    K    + + L++   K K    +K+       ++     S   +  K    
Sbjct: 223 NRIEKREVLKRRFEIKKKLKTAKKKEKKEKKKKQEEEQEKKKLTQIQESQVTSHNK---- 282

Query: 192 NELRAKRLKQQDP--EAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSESESRFQ 251
            E R+KR ++ D   +A  +L+       N       K++P     + S  + E E    
Sbjct: 283 -ERRSKRDEKLDKKSQAMEELKAEREKRKNRTVELLAKKQPLKTSEVYSDDEEEEEDDKS 342

Query: 252 SDDEGSTGDGGMIDSDDERT-MPGSNGPTF--DDIKEITIRRSKLAKWLMEPFFEELIVG 311
           S+    +      D ++E+  +P  + P    +++  + + R KL +W   PFF + + G
Sbjct: 343 SEKSDRSSRTSSSDEEEEKEEIPPKSQPVSLPEELNRVRLSRHKLERWCHMPFFAKTVTG 402

Query: 312 CFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARWQMA 371
           CFVR+GIG   S P+YR+  +  V   E  + Y+L    T+K L +  GN+    R  + 
Sbjct: 403 CFVRIGIGNHNSKPVYRVAEITGV--VETAKVYQLGGTRTNKGLQLRHGNDQRVFR--LE 462

Query: 372 MVSDSGPLEDEYKQWVKEVERTGGRMLSKQDILEKKE-AIQKANNFVYSAATVKQMLQEK 431
            VS+    E E+ +W KE   + G  L   D + KKE +I++A N+ ++   ++++++EK
Sbjct: 463 FVSNQEFTESEFMKW-KEAMFSAGMQLPTLDEINKKELSIKEALNYKFNDQDIEEIVKEK 522

Query: 432 KSASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQL----DASRRSQMKDAKA 491
           +     P N A +K +L  +  +A    ++ + ++I+ +L +L    +A  R + K+  A
Sbjct: 523 ERFRKAPPNYAMKKTQLLKEKAMAEDLGDQDKAKQIQDQLNELEERAEALDRQRTKNISA 582

Query: 492 IRLSEMNRKNRVENFKNASELRPTKDLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAE 551
           I  S +N++NR  N   + +    +         DPF+RR  + +   VSN+   D A +
Sbjct: 583 I--SYINQRNREWNIVESEKALVAESHNMKNQQMDPFTRR--QCKPTIVSNSR--DPAVQ 642

Query: 552 AAGNSDNITPASVTTGTGSGEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNL--LHN 603
           AA        A +    GSG        A  E + G GK  D N+        +L  +H+
Sbjct: 643 AA------ILAQLNAKYGSG----VLPDAPKEMSKGQGKDKDLNSKSASDLSEDLFKVHD 665

BLAST of CmoCh16G007190 vs. Swiss-Prot
Match: RTF1_SCHPO (RNA polymerase-associated protein C651.09c OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=SPBC651.09c PE=1 SV=1)

HSP 1 Score: 79.3 bits (194), Expect = 1.7e-13
Identity = 108/438 (24.66%), Postives = 191/438 (43.61%), Query Frame = 1

Query: 77  DDV--GSPEEGEDEDVGSEH------EGDSSDESDVGDDL---------YKDDDDRRKLA 136
           DDV   S +E  +E+VG ++      EG+   E +  +           +KD+ DR K+ 
Sbjct: 31  DDVLSSSSDEDNNENVGQDYAEESGGEGNEKSEDEFEEKFKNPYRLEGKFKDEADRAKIM 90

Query: 137 GMSELQREMILSDRASKKNDKHLYESLRSKMDKGKAAPS-RKENPPLPSSRIRSSARS-- 196
            M+E++RE IL +R          E +   M++ + A    ++N    +   R S R   
Sbjct: 91  AMTEIERESILFERE---------EEISKLMERRELAIRLHQQNAQYMAQSTRRSTRDKP 150

Query: 197 -ADRAAAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSS 256
               AA K D L EL+ +R ++       + R               KR P       S 
Sbjct: 151 LTSAAAGKRDKLTELKKRRQERSARSVSERTR---------------KRSPV------SD 210

Query: 257 SQSESESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPF 316
            + ++ES    ++EG +      +   E+    +      D+  I + R  +A+++  P 
Sbjct: 211 YEEQNESEKSEEEEGYSPS--YAEEKVEQVSKDNASANLYDLNAIRLGRKHVAEYMYHPI 270

Query: 317 FEELIVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESS 376
           FE  + GCFVRV IG      +YRLC V+ +   E  + Y+++  +T   L    G   S
Sbjct: 271 FESTVTGCFVRVKIGERDGQGVYRLCQVKGI--LESRKPYRVDGVLTKVSLECFHGR--S 330

Query: 377 AARWQMAMVSDSGPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVK 436
              + + ++S+    + ++++W  ++      M SK  +  K   ++  + +V S   V 
Sbjct: 331 KRVFDVNVLSNEPFSDHDFQRWHHQMMEDKLSMPSKNFVQRKLNDLRDMSKYVLSEKEVS 390

Query: 437 QMLQEKKSASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLD--ASRRSQMK 492
            ++  KK  S  P NIAAEK RLR +   A    N   V+ I  +L  L+  +   +Q  
Sbjct: 391 DIINRKKELSRVPSNIAAEKTRLRQRRQAAYVAGNAELVKEIDDQLNTLEELSMGSNQNS 432

BLAST of CmoCh16G007190 vs. TrEMBL
Match: A0A0A0KXW1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G038650 PE=4 SV=1)

HSP 1 Score: 1137.5 bits (2941), Expect = 0.0e+00
Identity = 609/660 (92.27%), Postives = 636/660 (96.36%), Query Frame = 1

Query: 1   MADLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPS 60
           MADLENLLLEAAGRT A+GRNRHSHPPSRRQREGSYSDAGSDSRDDDSDD+RGYASRKPS
Sbjct: 1   MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPS 60

Query: 61  GSQVPLKKRLDPNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG 120
           GSQVPLKKRLDP ERDDD GS EEGEDEDVGSE EGDSSDESDVGDDLYKDDDDRRKLAG
Sbjct: 61  GSQVPLKKRLDPTERDDDGGSQEEGEDEDVGSEREGDSSDESDVGDDLYKDDDDRRKLAG 120

Query: 121 MSELQREMILSDRASKKNDKHLYESLRSKMDKGKAAPSRKENPPLPSSRIRSSARSADRA 180
           MSELQREMILSDRASKKNDKHLYESLR+KMDKGK+APSRKE PPLPSSRIRSSARSADRA
Sbjct: 121 MSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRA 180

Query: 181 AAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSES 240
           AAKDDALNELRAKRLKQQDPEAHRKLRD SRGNAN+RRFSPTKRKPFTAPSLSSSSQSES
Sbjct: 181 AAKDDALNELRAKRLKQQDPEAHRKLRDASRGNANSRRFSPTKRKPFTAPSLSSSSQSES 240

Query: 241 ESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEELI 300
           ESRFQSDDEGSTGDGGMIDSDDER++PGS+GPTF+DIKE+TIRRSKLAKWLMEPFFEELI
Sbjct: 241 ESRFQSDDEGSTGDGGMIDSDDERSIPGSDGPTFEDIKEVTIRRSKLAKWLMEPFFEELI 300

Query: 301 VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARWQ 360
           VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNE+SAARWQ
Sbjct: 301 VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNEASAARWQ 360

Query: 361 MAMVSDSGPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVKQMLQE 420
           MAMVSDS PLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQK NNFVYSAATVKQMLQ+
Sbjct: 361 MAMVSDSAPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKVNNFVYSAATVKQMLQD 420

Query: 421 KKSASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLDASRRSQMKDAKAIRL 480
           KKSAS+RPLNIAAEKDRLR +MDVA+SKN+EAEVERIK +L+QL+ASRR QMKDAKAIRL
Sbjct: 421 KKSASARPLNIAAEKDRLRREMDVAVSKNDEAEVERIKTRLQQLEASRRLQMKDAKAIRL 480

Query: 481 SEMNRKNRVENFKNASELRPTKDLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEAAG 540
           +EMNRKNRVENFKNASELRP KDLKAGEAGYDPFSRRWTRSRNYYVSNAG+ +GAAEAAG
Sbjct: 481 AEMNRKNRVENFKNASELRPLKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAAG 540

Query: 541 NSDNITPASVTTGT---GSGEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLLHNFE 600
           NSDN+TPA   T T   G+ +AG+AATAAALEAAAGAGKLVDTNAPVDGGTESN LHNFE
Sbjct: 541 NSDNVTPALENTRTEAGGTSDAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE 600

Query: 601 LPISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL 658
           LPISL +LQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
Sbjct: 601 LPISLAMLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL 660

BLAST of CmoCh16G007190 vs. TrEMBL
Match: A0A067JDU6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21409 PE=4 SV=1)

HSP 1 Score: 926.0 bits (2392), Expect = 2.5e-266
Identity = 514/660 (77.88%), Postives = 577/660 (87.42%), Query Frame = 1

Query: 1   MADLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPS 60
           MADLENLLLEAAGRT +SGRNR++HPPSRR+REGSYSD GSDSRD+DSDD+RGYASRKPS
Sbjct: 1   MADLENLLLEAAGRTGSSGRNRNAHPPSRRRREGSYSDGGSDSRDEDSDDDRGYASRKPS 60

Query: 61  GSQVPLKKRLDPNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG 120
           GSQVPLKKRLDP ERDDD GS EEG  +D  S+ EGDSSDESDVGDDLYKD+DDRRKLA 
Sbjct: 61  GSQVPLKKRLDPAERDDDQGSQEEGGYDDGASDREGDSSDESDVGDDLYKDEDDRRKLAQ 120

Query: 121 MSELQREMILSDRASKKNDKHLYESLRSKMDKGKAAPSRKENPPLPSSR-IRSSARSADR 180
           MSEL+REMILS+RA KK DK+L E +RSK D  +A  SRKE PPLPSSR +R+SARSADR
Sbjct: 121 MSELEREMILSERADKKGDKNLTERIRSKRDSERATRSRKETPPLPSSRGVRTSARSADR 180

Query: 181 AAAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSE 240
           AAAKDDALNELRAKRLKQQDPEAHRKLRD SRG + +R  SP +RK FT+ SLSSSS SE
Sbjct: 181 AAAKDDALNELRAKRLKQQDPEAHRKLRDVSRGTSGSRGVSPVRRKRFTSASLSSSS-SE 240

Query: 241 SESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEEL 300
           S+SR  S+DE STGDGGM DSD++R  PGS G T+DDI+E+TIRRSKLAKWLMEP+FEEL
Sbjct: 241 SDSRSHSEDEASTGDGGMADSDEDRE-PGSEGLTYDDIREVTIRRSKLAKWLMEPWFEEL 300

Query: 301 IVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARW 360
           IVGCFVRVGIGRS+SGPIYRLCLVRNVDA +PDR YKLENK T+KYLNVIWGNESSAARW
Sbjct: 301 IVGCFVRVGIGRSKSGPIYRLCLVRNVDAADPDRPYKLENKTTYKYLNVIWGNESSAARW 360

Query: 361 QMAMVSDSGPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVKQMLQ 420
           QMAMVSDS P EDEYKQWV+EVER+GGRM +KQDILEKKEAI+K+N FVYSAATVKQMLQ
Sbjct: 361 QMAMVSDSAPTEDEYKQWVREVERSGGRMPTKQDILEKKEAIKKSNTFVYSAATVKQMLQ 420

Query: 421 EKKSASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLDASRRSQMKDAKAIR 480
           EKKSAS+RPLN+AAEKDRLR +++VA  K ++AEVERI+A++++L+ASR++Q KDAKAIR
Sbjct: 421 EKKSASTRPLNVAAEKDRLRRELEVAQMKQDDAEVERIRARIQELEASRQAQEKDAKAIR 480

Query: 481 LSEMNRKNRVENFKNASELRPTK-DLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEA 540
           L+EMNRKNR ENF+NASEL+P    LKAGEAGYDPFSRRWTRSRNYYVS  G  D AAEA
Sbjct: 481 LAEMNRKNRAENFRNASELKPVNTSLKAGEAGYDPFSRRWTRSRNYYVSKPGGADVAAEA 540

Query: 541 AGNSDNITPASVTTGTGS-GEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLLHNFE 600
             N       S    TG+  EAG+AATAAALEAAA AGKLVDT APVD GTESN LH+F+
Sbjct: 541 NNNGTAAVAHSNGAATGTLAEAGMAATAAALEAAADAGKLVDTAAPVDQGTESNTLHDFD 600

Query: 601 LPISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL 658
           LPISLT L+KFGGA GA+AGF+ARKQ+IEATVG +VPENDGRRHALTLTVSDYKRRRGLL
Sbjct: 601 LPISLTALEKFGGAKGAKAGFMARKQQIEATVGCRVPENDGRRHALTLTVSDYKRRRGLL 658

BLAST of CmoCh16G007190 vs. TrEMBL
Match: A0A061GFX4_THECC (PAF1 complex component isoform 1 OS=Theobroma cacao GN=TCM_029914 PE=4 SV=1)

HSP 1 Score: 924.9 bits (2389), Expect = 5.6e-266
Identity = 507/664 (76.36%), Postives = 573/664 (86.30%), Query Frame = 1

Query: 1   MADLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPS 60
           MADLENLLLEAAGRT   GRNRHS PPSRR+REGSYSD GSDSRDDDSDD+ GYASRKPS
Sbjct: 1   MADLENLLLEAAGRTGTGGRNRHSLPPSRRRREGSYSDGGSDSRDDDSDDDHGYASRKPS 60

Query: 61  GSQVPLKKRLDPNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG 120
           GSQVPLKKRLDP ERDDD GS EEG+ +D  S HEGDSSDESDVGDDLYK++DDRRKLA 
Sbjct: 61  GSQVPLKKRLDPAERDDDQGSQEEGDYDDGVSVHEGDSSDESDVGDDLYKNEDDRRKLAQ 120

Query: 121 MSELQREMILSDRASKKNDKHLYESLRSKMDKGKAAPSRKENPPLPSSR-IRSSARSADR 180
           M+EL+RE+ILS+RA K+ DK   E +RSK +  + + SRKE PPLPSSR +RSSARSADR
Sbjct: 121 MTELERELILSERADKRGDKKFTEKIRSKRENDRPSRSRKETPPLPSSRGVRSSARSADR 180

Query: 181 AAAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSE 240
           AAAKDDALNELRAKRLKQQDPEAHRKLRD SRG++ +R  SP KRKPFTA SLSSSSQS+
Sbjct: 181 AAAKDDALNELRAKRLKQQDPEAHRKLRDASRGSSGSRGLSPVKRKPFTASSLSSSSQSD 240

Query: 241 SESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEEL 300
           SESR  S+DEGSTGDGGM+DSDD+R M G +GPTFDDIKEITIRRSKLAKW MEPFFEEL
Sbjct: 241 SESRSNSEDEGSTGDGGMVDSDDDRGMQGPDGPTFDDIKEITIRRSKLAKWFMEPFFEEL 300

Query: 301 IVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARW 360
           IVGC+VRVGIGRS+SGPIYRLC+VRNVDATEP+R YKLENK T+KYLNV+WGNESSAARW
Sbjct: 301 IVGCYVRVGIGRSKSGPIYRLCMVRNVDATEPERTYKLENKTTYKYLNVVWGNESSAARW 360

Query: 361 QMAMVSDSGPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVKQMLQ 420
           QMAM+SDS P E+E++Q ++E+ER+GGRM SKQD+LEKKEA+QKA  FVYSAATVKQMLQ
Sbjct: 361 QMAMISDSPPQEEEFRQLIRELERSGGRMPSKQDVLEKKEALQKAKTFVYSAATVKQMLQ 420

Query: 421 EKKSASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLDASRRSQMKDAKAIR 480
           EKKS SSRPLNIAAEKDRLR  +++A SK++EAEVERIK +L+QL+ASR++Q KDAKA+R
Sbjct: 421 EKKSTSSRPLNIAAEKDRLRRDLEIAQSKHDEAEVERIKMRLQQLEASRQAQEKDAKAVR 480

Query: 481 LSEMNRKNRVENFKNASELRPTK-DLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEA 540
           L+EMNRKNR ENFKNASEL+P    LKAGEAGYDPFSRRWTRSRNYYV+     D AA A
Sbjct: 481 LAEMNRKNRAENFKNASELKPVNTGLKAGEAGYDPFSRRWTRSRNYYVAKPPGADAAAVA 540

Query: 541 AGNSDNITPASVTTGTG-----SGEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLL 600
            G+   +    + +G G     + EAG AAT AAL+ AAGAGKLVDT+APVD GTESN+L
Sbjct: 541 NGDRIGV----IASGNGNDARAAAEAGRAATVAALQEAAGAGKLVDTSAPVDEGTESNML 600

Query: 601 HNFELPISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRR 658
           H+FE+PISL  LQ+FGG  GA AGF+ARKQRIEATVG QVPENDGRRHALTLTVSDYKRR
Sbjct: 601 HDFEIPISLNALQRFGGPQGAVAGFMARKQRIEATVGCQVPENDGRRHALTLTVSDYKRR 660

BLAST of CmoCh16G007190 vs. TrEMBL
Match: W9RDA9_9ROSA (RNA polymerase-associated protein RTF1-like protein OS=Morus notabilis GN=L484_012115 PE=4 SV=1)

HSP 1 Score: 924.9 bits (2389), Expect = 5.6e-266
Identity = 516/662 (77.95%), Postives = 577/662 (87.16%), Query Frame = 1

Query: 1   MADLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPS 60
           MA+LENLLLEAAGRT+++GRNRHS PPSRR+REGSYSD GSDSRDDDSDD+RGYA+RKPS
Sbjct: 1   MAELENLLLEAAGRTRSAGRNRHSIPPSRRRREGSYSDGGSDSRDDDSDDDRGYANRKPS 60

Query: 61  GSQVPLKKRLDPNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG 120
           GSQVPLKKRLDP E DDD GS EEG+D D GS+ EGDS  ESDVG DLYKDDDDRRKLA 
Sbjct: 61  GSQVPLKKRLDPTEMDDDQGSEEEGDD-DRGSDREGDS--ESDVGSDLYKDDDDRRKLAE 120

Query: 121 MSELQREMILSDRASKKNDKHLYESLRSKMD-KGKAAPSRKENPPLPSSRIRSSARSADR 180
           M+ELQREMIL DRASKK DK+L E LR K D KGKA  SRKE  PLPSSR+RSSARSADR
Sbjct: 121 MTELQREMILLDRASKKEDKNLKEKLRPKWDNKGKATQSRKET-PLPSSRVRSSARSADR 180

Query: 181 AAAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSE 240
           AAAKDDALNELRAKRLKQQDPEAH KLRD SRG + +R     KRK +TA SLSSSSQS+
Sbjct: 181 AAAKDDALNELRAKRLKQQDPEAHGKLRDASRGGSGSRNLLHNKRKSYTATSLSSSSQSD 240

Query: 241 SESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEEL 300
           SES  QS+DEGSTGDGGMIDSDDER +PGS G TFDDIKE+T+RRSKLAKWLMEPFFEEL
Sbjct: 241 SESESQSEDEGSTGDGGMIDSDDERGIPGSEGLTFDDIKEVTVRRSKLAKWLMEPFFEEL 300

Query: 301 IVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARW 360
           IVGCFVRVGIGRS+SGPIYRLC+VRNVDA+EPDRQYKL+NKITHKYLNV+WGNE+SAARW
Sbjct: 301 IVGCFVRVGIGRSKSGPIYRLCMVRNVDASEPDRQYKLDNKITHKYLNVVWGNENSAARW 360

Query: 361 QMAMVSDSGPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVKQMLQ 420
           QMAMVSDS P E+E+KQWV+EVER+GGRM +K DIL+KKE+I+K N FVYSAATVKQMLQ
Sbjct: 361 QMAMVSDSVPNEEEFKQWVREVERSGGRMPTKHDILDKKESIKKINTFVYSAATVKQMLQ 420

Query: 421 EKKSASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLDASRRSQMKDAKAIR 480
           EKKSAS+RPLNIA EKDRLR +++VA SKN+E EV+RIK +L++L+ASR+++  DAKAIR
Sbjct: 421 EKKSASARPLNIALEKDRLRRELEVAQSKNDEVEVDRIKTRLQELEASRKAKQTDAKAIR 480

Query: 481 LSEMNRKNRVENFKNASELRPTK-DLKAGEAGYDPFSRRWTRSRNYYVSNAGQV-DGAAE 540
           L+EMNRKNRVENFKNASEL+P    LKAGEAGYDPFSRRWTRSRNYYV   G+V + +  
Sbjct: 481 LAEMNRKNRVENFKNASELKPVNTGLKAGEAGYDPFSRRWTRSRNYYVGKPGEVKEDSGA 540

Query: 541 AAGNSDNITPA--SVTTGTGSGEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLLHN 600
            AGN+   T A  +   G  + EAG+AAT AALEAAA AGKLVDTNAPVD GT SN+LHN
Sbjct: 541 NAGNNGASTDAENNGRHGIVAAEAGIAATEAALEAAADAGKLVDTNAPVDQGTVSNMLHN 600

Query: 601 FELPISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRG 658
           FELPISL+VLQKFGG  GAQAGF+ARKQRIEATVG +VPENDGRRHALTL+V DYKRRRG
Sbjct: 601 FELPISLSVLQKFGGPQGAQAGFMARKQRIEATVGCRVPENDGRRHALTLSVGDYKRRRG 658

BLAST of CmoCh16G007190 vs. TrEMBL
Match: A0A0D2T1B1_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_011G025800 PE=4 SV=1)

HSP 1 Score: 916.8 bits (2368), Expect = 1.5e-263
Identity = 503/660 (76.21%), Postives = 570/660 (86.36%), Query Frame = 1

Query: 1   MADLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPS 60
           MADLENLLLEAAGRT   GRNRHS PPSRR+REGSYSD GSDSRDDDSDD+ GYASRKPS
Sbjct: 1   MADLENLLLEAAGRTGTGGRNRHSLPPSRRRREGSYSDGGSDSRDDDSDDDHGYASRKPS 60

Query: 61  GSQVPLKKRLDPNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG 120
           GSQVPLKKRLDP ERDDD GS EEG+  D GS  E DSSDESDVGDDLYK+++DRR+LA 
Sbjct: 61  GSQVPLKKRLDPAERDDDQGSQEEGDYNDAGSGRERDSSDESDVGDDLYKNEEDRRQLAQ 120

Query: 121 MSELQREMILSDRASKKNDKHLYESLRSKMDKGKAAPSRKENPPLPSSRIRSSARSADRA 180
           ++EL+REMILS+RA K+ DK   E +RSK +  + + S++E PPLPS  +RSSARSADRA
Sbjct: 121 LTELEREMILSERADKRGDKKFTEKIRSKRENDRPSRSQRETPPLPSRGVRSSARSADRA 180

Query: 181 AAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSES 240
           AAKDDALNELRAKRLKQQDPEAHRKLRD SRG++ NR  SP KRKPFTA SLSSSSQSES
Sbjct: 181 AAKDDALNELRAKRLKQQDPEAHRKLRDASRGSSGNRGLSPVKRKPFTASSLSSSSQSES 240

Query: 241 ESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEELI 300
           ESR  S+DEGSTGDGGM+DS+DER   G NGPTF+DIKEITIRRSKLAKWLMEPFFEELI
Sbjct: 241 ESRSNSEDEGSTGDGGMVDSEDERGTWGPNGPTFNDIKEITIRRSKLAKWLMEPFFEELI 300

Query: 301 VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARWQ 360
           VGCFVRVGIGRS++G IYRLC+VRNVDAT+PDR YKLENK T+KYLNV+WGNESSAARWQ
Sbjct: 301 VGCFVRVGIGRSKTGAIYRLCMVRNVDATDPDRTYKLENKTTYKYLNVVWGNESSAARWQ 360

Query: 361 MAMVSDSGPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVKQMLQE 420
           MAM+SDS PLE+E++Q ++EVER+GGRM SKQD+LEKKEA+QKA  FVYSAATVKQMLQE
Sbjct: 361 MAMISDSPPLEEEFRQLIREVERSGGRMPSKQDVLEKKEALQKAKTFVYSAATVKQMLQE 420

Query: 421 KKSASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLDASRRSQMKDAKAIRL 480
           KKS+SSRPLN+AAEKDRLR  +++A SK+++ EVERIK +L+QL+ASR+SQ KDAKA+RL
Sbjct: 421 KKSSSSRPLNVAAEKDRLRRDLEIAQSKHDDVEVERIKKRLQQLEASRQSQEKDAKAVRL 480

Query: 481 SEMNRKNRVENFKNASELRPTK-DLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEAA 540
           +EMNRKNRVENFKNAS L+P    LKAGEAGYDPFSRRWTRSRNYY + A   D AA A 
Sbjct: 481 AEMNRKNRVENFKNASGLKPVNTGLKAGEAGYDPFSRRWTRSRNYYNAKAPGGDAAAVAN 540

Query: 541 GNSDNITPA--SVTTGTGSGEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLLHNFE 600
           G+++    +      G  + EAG AATAAAL+ AAGAGKLVDTNAPVD GTESN+LH+FE
Sbjct: 541 GDTNGAIGSGNGNDAGAAAAEAGRAATAAALQEAAGAGKLVDTNAPVDEGTESNMLHDFE 600

Query: 601 LPISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL 658
           LPISL VL+KFGG  GA AGF+ARKQRIEATVG +VPENDGRRHALTLTVSDYKRRRGLL
Sbjct: 601 LPISLDVLRKFGGHEGAVAGFMARKQRIEATVGCRVPENDGRRHALTLTVSDYKRRRGLL 660

BLAST of CmoCh16G007190 vs. TAIR10
Match: AT1G61040.1 (AT1G61040.1 plus-3 domain-containing protein)

HSP 1 Score: 800.4 bits (2066), Expect = 8.1e-232
Identity = 456/659 (69.20%), Postives = 531/659 (80.58%), Query Frame = 1

Query: 1   MADLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPS 60
           M DLENLLLEAAGRT ++GR+RH  PPS R+REGSYSD  SDSRDD SD++RGYASRKPS
Sbjct: 1   MGDLENLLLEAAGRTNSAGRSRH--PPSSRRREGSYSDGSSDSRDD-SDEDRGYASRKPS 60

Query: 61  GSQVPLKKRLDPNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG 120
           GSQVPLKKRL+  ER+D     E G   D  S+ EGDSS+ESD GDDLYK+++DR+KLAG
Sbjct: 61  GSQVPLKKRLEA-EREDRAARVEGGYG-DGPSDREGDSSEESDFGDDLYKNEEDRQKLAG 120

Query: 121 MSELQREMILSDRASKKNDKHLYESLRSKMDKGKAAPSRKENPPLPSSR-IRSSARSADR 180
           M+E QREMILS+RA KK DK+  E LRSK +  K   S+KE  PLP+SR +RSSARSADR
Sbjct: 121 MTEFQREMILSERADKKGDKNFTEKLRSKRESEKTPVSKKETQPLPASRGVRSSARSADR 180

Query: 181 AAAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSE 240
           AAAKDDALNELRAKR+KQQDP A RKLRD S+G + +R FS TKRKP  + +LSSSSQS+
Sbjct: 181 AAAKDDALNELRAKRMKQQDPAALRKLRDASKGGSGSRDFSSTKRKPLASSNLSSSSQSD 240

Query: 241 SESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEEL 300
           S+SR QSDDEGS  +GGM+DSDD+R    S+ PTF+D+KE+TIRRSKLAKWLMEPFFEEL
Sbjct: 241 SDSRSQSDDEGS--NGGMLDSDDDR----SDVPTFEDVKEVTIRRSKLAKWLMEPFFEEL 300

Query: 301 IVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARW 360
           IVGCFVRVGIGRS+SGPIYRLC V+NVDAT+PD+ YKLENK THKYLNV+WGNE+SAARW
Sbjct: 301 IVGCFVRVGIGRSKSGPIYRLCWVKNVDATDPDKTYKLENKTTHKYLNVVWGNETSAARW 360

Query: 361 QMAMVSDSGPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVKQMLQ 420
           QMAM+SD  PLE+EY+QW++EVERT GRM +KQDI EKKEAIQ+ N+FVYSA TVKQMLQ
Sbjct: 361 QMAMISDGHPLEEEYRQWIREVERTNGRMPTKQDISEKKEAIQRTNSFVYSAETVKQMLQ 420

Query: 421 EKKSASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLDASRRSQMKDAKAIR 480
           EKKSAS RP+N+AAEKDRLR ++++A SKN+EA VERIK+K++QLDASR  +  D KA++
Sbjct: 421 EKKSASVRPMNVAAEKDRLRKELEIAQSKNDEAGVERIKSKIKQLDASRNKKGVDKKALK 480

Query: 481 LSEMNRKNRVENFKNASELRP-TKDLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEA 540
           L+EMN+KNR ENFKNASE++  T  LKAGEAGYDPFSRRWTRS NYY       DG    
Sbjct: 481 LAEMNKKNRAENFKNASEVKSITASLKAGEAGYDPFSRRWTRSSNYYNGKNKGKDGEENE 540

Query: 541 AGNSDNITPASVTTGTGSGEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLLHNFEL 600
           A        A+V T      AGV AT AALEAAA AGKL+DT AP+  G E N LHNFEL
Sbjct: 541 AA-----VAAAVETNGADAGAGVEATEAALEAAAEAGKLIDTRAPIGQGAEHNQLHNFEL 600

Query: 601 PISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL 658
            +SLT LQK+GG  G Q  F+ARKQ  EATVG +V ENDG+RH LTLTVSDYKRRRGLL
Sbjct: 601 SLSLTALQKYGGPQGVQKAFMARKQLTEATVGCRVAENDGKRHGLTLTVSDYKRRRGLL 643

BLAST of CmoCh16G007190 vs. NCBI nr
Match: gi|659107572|ref|XP_008453742.1| (PREDICTED: RNA polymerase-associated protein RTF1 homolog [Cucumis melo])

HSP 1 Score: 1141.3 bits (2951), Expect = 0.0e+00
Identity = 610/660 (92.42%), Postives = 635/660 (96.21%), Query Frame = 1

Query: 1   MADLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPS 60
           MADLENLLLEAAGRT A G NRHSHPPSRRQREGSYSD GSDSRDDDSDDERGYASRKPS
Sbjct: 1   MADLENLLLEAAGRTNAGGGNRHSHPPSRRQREGSYSDGGSDSRDDDSDDERGYASRKPS 60

Query: 61  GSQVPLKKRLDPNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG 120
           GSQVPLKKRLDP ERDDD GSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG
Sbjct: 61  GSQVPLKKRLDPTERDDDGGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG 120

Query: 121 MSELQREMILSDRASKKNDKHLYESLRSKMDKGKAAPSRKENPPLPSSRIRSSARSADRA 180
           MSELQREMILSDRASKKNDKHLYESLR+KMDKGK APSRKE PPLPSSRIRSSARSADRA
Sbjct: 121 MSELQREMILSDRASKKNDKHLYESLRAKMDKGKTAPSRKETPPLPSSRIRSSARSADRA 180

Query: 181 AAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSES 240
           AAKDDALNELRAKRLKQQDPEAHRKLRD SRGN+NNRRFSPTKRKPFTAPSLSSSSQSES
Sbjct: 181 AAKDDALNELRAKRLKQQDPEAHRKLRDASRGNSNNRRFSPTKRKPFTAPSLSSSSQSES 240

Query: 241 ESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEELI 300
           ESRFQSDDEGSTGDGGMIDSDDER+MPGS+GPTF+DIKEITIRRSKLAKWLMEPFFEELI
Sbjct: 241 ESRFQSDDEGSTGDGGMIDSDDERSMPGSDGPTFEDIKEITIRRSKLAKWLMEPFFEELI 300

Query: 301 VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARWQ 360
           VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNE+SAARWQ
Sbjct: 301 VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNENSAARWQ 360

Query: 361 MAMVSDSGPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVKQMLQE 420
           MAMVSDS PLEDEYKQWVKEVERTGGRMLSKQD+LEKK+AIQK NNFVYSAATVKQMLQ+
Sbjct: 361 MAMVSDSAPLEDEYKQWVKEVERTGGRMLSKQDVLEKKDAIQKVNNFVYSAATVKQMLQD 420

Query: 421 KKSASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLDASRRSQMKDAKAIRL 480
           KKSAS+RPLNIAAEKDRLR +MDVA+SKN+EAEVERIK +L+QL+ASRR QMKDAKAIRL
Sbjct: 421 KKSASARPLNIAAEKDRLRREMDVAVSKNDEAEVERIKGRLQQLEASRRLQMKDAKAIRL 480

Query: 481 SEMNRKNRVENFKNASELRPTKDLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEAAG 540
           +EMNRKNRVENFKNASELRP KDLKAGEAGYDPFSRRWTRSRNYYVSNAG+ +GAAEAAG
Sbjct: 481 AEMNRKNRVENFKNASELRPLKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAAG 540

Query: 541 NSDNITPASVTTGTGSG---EAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLLHNFE 600
           NSD +TPA  +T TG+G   +AG+AATAAALEAAAGAGKLVDTNAPVDGGTESN LHNFE
Sbjct: 541 NSDTVTPALESTRTGAGGTSDAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE 600

Query: 601 LPISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL 658
           LPISL +LQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
Sbjct: 601 LPISLAMLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL 660

BLAST of CmoCh16G007190 vs. NCBI nr
Match: gi|449462844|ref|XP_004149150.1| (PREDICTED: RNA polymerase-associated protein RTF1 homolog [Cucumis sativus])

HSP 1 Score: 1137.5 bits (2941), Expect = 0.0e+00
Identity = 609/660 (92.27%), Postives = 636/660 (96.36%), Query Frame = 1

Query: 1   MADLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPS 60
           MADLENLLLEAAGRT A+GRNRHSHPPSRRQREGSYSDAGSDSRDDDSDD+RGYASRKPS
Sbjct: 1   MADLENLLLEAAGRTNAAGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDDRGYASRKPS 60

Query: 61  GSQVPLKKRLDPNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG 120
           GSQVPLKKRLDP ERDDD GS EEGEDEDVGSE EGDSSDESDVGDDLYKDDDDRRKLAG
Sbjct: 61  GSQVPLKKRLDPTERDDDGGSQEEGEDEDVGSEREGDSSDESDVGDDLYKDDDDRRKLAG 120

Query: 121 MSELQREMILSDRASKKNDKHLYESLRSKMDKGKAAPSRKENPPLPSSRIRSSARSADRA 180
           MSELQREMILSDRASKKNDKHLYESLR+KMDKGK+APSRKE PPLPSSRIRSSARSADRA
Sbjct: 121 MSELQREMILSDRASKKNDKHLYESLRAKMDKGKSAPSRKETPPLPSSRIRSSARSADRA 180

Query: 181 AAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSES 240
           AAKDDALNELRAKRLKQQDPEAHRKLRD SRGNAN+RRFSPTKRKPFTAPSLSSSSQSES
Sbjct: 181 AAKDDALNELRAKRLKQQDPEAHRKLRDASRGNANSRRFSPTKRKPFTAPSLSSSSQSES 240

Query: 241 ESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEELI 300
           ESRFQSDDEGSTGDGGMIDSDDER++PGS+GPTF+DIKE+TIRRSKLAKWLMEPFFEELI
Sbjct: 241 ESRFQSDDEGSTGDGGMIDSDDERSIPGSDGPTFEDIKEVTIRRSKLAKWLMEPFFEELI 300

Query: 301 VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARWQ 360
           VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNE+SAARWQ
Sbjct: 301 VGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNEASAARWQ 360

Query: 361 MAMVSDSGPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVKQMLQE 420
           MAMVSDS PLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQK NNFVYSAATVKQMLQ+
Sbjct: 361 MAMVSDSAPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKVNNFVYSAATVKQMLQD 420

Query: 421 KKSASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLDASRRSQMKDAKAIRL 480
           KKSAS+RPLNIAAEKDRLR +MDVA+SKN+EAEVERIK +L+QL+ASRR QMKDAKAIRL
Sbjct: 421 KKSASARPLNIAAEKDRLRREMDVAVSKNDEAEVERIKTRLQQLEASRRLQMKDAKAIRL 480

Query: 481 SEMNRKNRVENFKNASELRPTKDLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEAAG 540
           +EMNRKNRVENFKNASELRP KDLKAGEAGYDPFSRRWTRSRNYYVSNAG+ +GAAEAAG
Sbjct: 481 AEMNRKNRVENFKNASELRPLKDLKAGEAGYDPFSRRWTRSRNYYVSNAGEANGAAEAAG 540

Query: 541 NSDNITPASVTTGT---GSGEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLLHNFE 600
           NSDN+TPA   T T   G+ +AG+AATAAALEAAAGAGKLVDTNAPVDGGTESN LHNFE
Sbjct: 541 NSDNVTPALENTRTEAGGTSDAGMAATAAALEAAAGAGKLVDTNAPVDGGTESNSLHNFE 600

Query: 601 LPISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL 658
           LPISL +LQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
Sbjct: 601 LPISLAMLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL 660

BLAST of CmoCh16G007190 vs. NCBI nr
Match: gi|802784101|ref|XP_012091565.1| (PREDICTED: RNA polymerase-associated protein RTF1 homolog [Jatropha curcas])

HSP 1 Score: 926.0 bits (2392), Expect = 3.6e-266
Identity = 514/660 (77.88%), Postives = 577/660 (87.42%), Query Frame = 1

Query: 1   MADLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPS 60
           MADLENLLLEAAGRT +SGRNR++HPPSRR+REGSYSD GSDSRD+DSDD+RGYASRKPS
Sbjct: 1   MADLENLLLEAAGRTGSSGRNRNAHPPSRRRREGSYSDGGSDSRDEDSDDDRGYASRKPS 60

Query: 61  GSQVPLKKRLDPNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG 120
           GSQVPLKKRLDP ERDDD GS EEG  +D  S+ EGDSSDESDVGDDLYKD+DDRRKLA 
Sbjct: 61  GSQVPLKKRLDPAERDDDQGSQEEGGYDDGASDREGDSSDESDVGDDLYKDEDDRRKLAQ 120

Query: 121 MSELQREMILSDRASKKNDKHLYESLRSKMDKGKAAPSRKENPPLPSSR-IRSSARSADR 180
           MSEL+REMILS+RA KK DK+L E +RSK D  +A  SRKE PPLPSSR +R+SARSADR
Sbjct: 121 MSELEREMILSERADKKGDKNLTERIRSKRDSERATRSRKETPPLPSSRGVRTSARSADR 180

Query: 181 AAAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSE 240
           AAAKDDALNELRAKRLKQQDPEAHRKLRD SRG + +R  SP +RK FT+ SLSSSS SE
Sbjct: 181 AAAKDDALNELRAKRLKQQDPEAHRKLRDVSRGTSGSRGVSPVRRKRFTSASLSSSS-SE 240

Query: 241 SESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEEL 300
           S+SR  S+DE STGDGGM DSD++R  PGS G T+DDI+E+TIRRSKLAKWLMEP+FEEL
Sbjct: 241 SDSRSHSEDEASTGDGGMADSDEDRE-PGSEGLTYDDIREVTIRRSKLAKWLMEPWFEEL 300

Query: 301 IVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARW 360
           IVGCFVRVGIGRS+SGPIYRLCLVRNVDA +PDR YKLENK T+KYLNVIWGNESSAARW
Sbjct: 301 IVGCFVRVGIGRSKSGPIYRLCLVRNVDAADPDRPYKLENKTTYKYLNVIWGNESSAARW 360

Query: 361 QMAMVSDSGPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVKQMLQ 420
           QMAMVSDS P EDEYKQWV+EVER+GGRM +KQDILEKKEAI+K+N FVYSAATVKQMLQ
Sbjct: 361 QMAMVSDSAPTEDEYKQWVREVERSGGRMPTKQDILEKKEAIKKSNTFVYSAATVKQMLQ 420

Query: 421 EKKSASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLDASRRSQMKDAKAIR 480
           EKKSAS+RPLN+AAEKDRLR +++VA  K ++AEVERI+A++++L+ASR++Q KDAKAIR
Sbjct: 421 EKKSASTRPLNVAAEKDRLRRELEVAQMKQDDAEVERIRARIQELEASRQAQEKDAKAIR 480

Query: 481 LSEMNRKNRVENFKNASELRPTK-DLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEA 540
           L+EMNRKNR ENF+NASEL+P    LKAGEAGYDPFSRRWTRSRNYYVS  G  D AAEA
Sbjct: 481 LAEMNRKNRAENFRNASELKPVNTSLKAGEAGYDPFSRRWTRSRNYYVSKPGGADVAAEA 540

Query: 541 AGNSDNITPASVTTGTGS-GEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLLHNFE 600
             N       S    TG+  EAG+AATAAALEAAA AGKLVDT APVD GTESN LH+F+
Sbjct: 541 NNNGTAAVAHSNGAATGTLAEAGMAATAAALEAAADAGKLVDTAAPVDQGTESNTLHDFD 600

Query: 601 LPISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL 658
           LPISLT L+KFGGA GA+AGF+ARKQ+IEATVG +VPENDGRRHALTLTVSDYKRRRGLL
Sbjct: 601 LPISLTALEKFGGAKGAKAGFMARKQQIEATVGCRVPENDGRRHALTLTVSDYKRRRGLL 658

BLAST of CmoCh16G007190 vs. NCBI nr
Match: gi|703114113|ref|XP_010100559.1| (RNA polymerase-associated protein RTF1-like protein [Morus notabilis])

HSP 1 Score: 924.9 bits (2389), Expect = 8.1e-266
Identity = 516/662 (77.95%), Postives = 577/662 (87.16%), Query Frame = 1

Query: 1   MADLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPS 60
           MA+LENLLLEAAGRT+++GRNRHS PPSRR+REGSYSD GSDSRDDDSDD+RGYA+RKPS
Sbjct: 1   MAELENLLLEAAGRTRSAGRNRHSIPPSRRRREGSYSDGGSDSRDDDSDDDRGYANRKPS 60

Query: 61  GSQVPLKKRLDPNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG 120
           GSQVPLKKRLDP E DDD GS EEG+D D GS+ EGDS  ESDVG DLYKDDDDRRKLA 
Sbjct: 61  GSQVPLKKRLDPTEMDDDQGSEEEGDD-DRGSDREGDS--ESDVGSDLYKDDDDRRKLAE 120

Query: 121 MSELQREMILSDRASKKNDKHLYESLRSKMD-KGKAAPSRKENPPLPSSRIRSSARSADR 180
           M+ELQREMIL DRASKK DK+L E LR K D KGKA  SRKE  PLPSSR+RSSARSADR
Sbjct: 121 MTELQREMILLDRASKKEDKNLKEKLRPKWDNKGKATQSRKET-PLPSSRVRSSARSADR 180

Query: 181 AAAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSE 240
           AAAKDDALNELRAKRLKQQDPEAH KLRD SRG + +R     KRK +TA SLSSSSQS+
Sbjct: 181 AAAKDDALNELRAKRLKQQDPEAHGKLRDASRGGSGSRNLLHNKRKSYTATSLSSSSQSD 240

Query: 241 SESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEEL 300
           SES  QS+DEGSTGDGGMIDSDDER +PGS G TFDDIKE+T+RRSKLAKWLMEPFFEEL
Sbjct: 241 SESESQSEDEGSTGDGGMIDSDDERGIPGSEGLTFDDIKEVTVRRSKLAKWLMEPFFEEL 300

Query: 301 IVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARW 360
           IVGCFVRVGIGRS+SGPIYRLC+VRNVDA+EPDRQYKL+NKITHKYLNV+WGNE+SAARW
Sbjct: 301 IVGCFVRVGIGRSKSGPIYRLCMVRNVDASEPDRQYKLDNKITHKYLNVVWGNENSAARW 360

Query: 361 QMAMVSDSGPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVKQMLQ 420
           QMAMVSDS P E+E+KQWV+EVER+GGRM +K DIL+KKE+I+K N FVYSAATVKQMLQ
Sbjct: 361 QMAMVSDSVPNEEEFKQWVREVERSGGRMPTKHDILDKKESIKKINTFVYSAATVKQMLQ 420

Query: 421 EKKSASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLDASRRSQMKDAKAIR 480
           EKKSAS+RPLNIA EKDRLR +++VA SKN+E EV+RIK +L++L+ASR+++  DAKAIR
Sbjct: 421 EKKSASARPLNIALEKDRLRRELEVAQSKNDEVEVDRIKTRLQELEASRKAKQTDAKAIR 480

Query: 481 LSEMNRKNRVENFKNASELRPTK-DLKAGEAGYDPFSRRWTRSRNYYVSNAGQV-DGAAE 540
           L+EMNRKNRVENFKNASEL+P    LKAGEAGYDPFSRRWTRSRNYYV   G+V + +  
Sbjct: 481 LAEMNRKNRVENFKNASELKPVNTGLKAGEAGYDPFSRRWTRSRNYYVGKPGEVKEDSGA 540

Query: 541 AAGNSDNITPA--SVTTGTGSGEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLLHN 600
            AGN+   T A  +   G  + EAG+AAT AALEAAA AGKLVDTNAPVD GT SN+LHN
Sbjct: 541 NAGNNGASTDAENNGRHGIVAAEAGIAATEAALEAAADAGKLVDTNAPVDQGTVSNMLHN 600

Query: 601 FELPISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRG 658
           FELPISL+VLQKFGG  GAQAGF+ARKQRIEATVG +VPENDGRRHALTL+V DYKRRRG
Sbjct: 601 FELPISLSVLQKFGGPQGAQAGFMARKQRIEATVGCRVPENDGRRHALTLSVGDYKRRRG 658

BLAST of CmoCh16G007190 vs. NCBI nr
Match: gi|590624747|ref|XP_007025691.1| (PAF1 complex component isoform 1 [Theobroma cacao])

HSP 1 Score: 924.9 bits (2389), Expect = 8.1e-266
Identity = 507/664 (76.36%), Postives = 573/664 (86.30%), Query Frame = 1

Query: 1   MADLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPS 60
           MADLENLLLEAAGRT   GRNRHS PPSRR+REGSYSD GSDSRDDDSDD+ GYASRKPS
Sbjct: 1   MADLENLLLEAAGRTGTGGRNRHSLPPSRRRREGSYSDGGSDSRDDDSDDDHGYASRKPS 60

Query: 61  GSQVPLKKRLDPNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG 120
           GSQVPLKKRLDP ERDDD GS EEG+ +D  S HEGDSSDESDVGDDLYK++DDRRKLA 
Sbjct: 61  GSQVPLKKRLDPAERDDDQGSQEEGDYDDGVSVHEGDSSDESDVGDDLYKNEDDRRKLAQ 120

Query: 121 MSELQREMILSDRASKKNDKHLYESLRSKMDKGKAAPSRKENPPLPSSR-IRSSARSADR 180
           M+EL+RE+ILS+RA K+ DK   E +RSK +  + + SRKE PPLPSSR +RSSARSADR
Sbjct: 121 MTELERELILSERADKRGDKKFTEKIRSKRENDRPSRSRKETPPLPSSRGVRSSARSADR 180

Query: 181 AAAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSE 240
           AAAKDDALNELRAKRLKQQDPEAHRKLRD SRG++ +R  SP KRKPFTA SLSSSSQS+
Sbjct: 181 AAAKDDALNELRAKRLKQQDPEAHRKLRDASRGSSGSRGLSPVKRKPFTASSLSSSSQSD 240

Query: 241 SESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEEL 300
           SESR  S+DEGSTGDGGM+DSDD+R M G +GPTFDDIKEITIRRSKLAKW MEPFFEEL
Sbjct: 241 SESRSNSEDEGSTGDGGMVDSDDDRGMQGPDGPTFDDIKEITIRRSKLAKWFMEPFFEEL 300

Query: 301 IVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARW 360
           IVGC+VRVGIGRS+SGPIYRLC+VRNVDATEP+R YKLENK T+KYLNV+WGNESSAARW
Sbjct: 301 IVGCYVRVGIGRSKSGPIYRLCMVRNVDATEPERTYKLENKTTYKYLNVVWGNESSAARW 360

Query: 361 QMAMVSDSGPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVKQMLQ 420
           QMAM+SDS P E+E++Q ++E+ER+GGRM SKQD+LEKKEA+QKA  FVYSAATVKQMLQ
Sbjct: 361 QMAMISDSPPQEEEFRQLIRELERSGGRMPSKQDVLEKKEALQKAKTFVYSAATVKQMLQ 420

Query: 421 EKKSASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLDASRRSQMKDAKAIR 480
           EKKS SSRPLNIAAEKDRLR  +++A SK++EAEVERIK +L+QL+ASR++Q KDAKA+R
Sbjct: 421 EKKSTSSRPLNIAAEKDRLRRDLEIAQSKHDEAEVERIKMRLQQLEASRQAQEKDAKAVR 480

Query: 481 LSEMNRKNRVENFKNASELRPTK-DLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEA 540
           L+EMNRKNR ENFKNASEL+P    LKAGEAGYDPFSRRWTRSRNYYV+     D AA A
Sbjct: 481 LAEMNRKNRAENFKNASELKPVNTGLKAGEAGYDPFSRRWTRSRNYYVAKPPGADAAAVA 540

Query: 541 AGNSDNITPASVTTGTG-----SGEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLL 600
            G+   +    + +G G     + EAG AAT AAL+ AAGAGKLVDT+APVD GTESN+L
Sbjct: 541 NGDRIGV----IASGNGNDARAAAEAGRAATVAALQEAAGAGKLVDTSAPVDEGTESNML 600

Query: 601 HNFELPISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRR 658
           H+FE+PISL  LQ+FGG  GA AGF+ARKQRIEATVG QVPENDGRRHALTLTVSDYKRR
Sbjct: 601 HDFEIPISLNALQRFGGPQGAVAGFMARKQRIEATVGCQVPENDGRRHALTLTVSDYKRR 660

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
VIP5_ARATH1.4e-23069.20Protein RTF1 homolog OS=Arabidopsis thaliana GN=VIP5 PE=1 SV=1[more]
RTF1_HUMAN7.1e-2025.53RNA polymerase-associated protein RTF1 homolog OS=Homo sapiens GN=RTF1 PE=1 SV=4[more]
RTF1_MOUSE1.2e-1925.29RNA polymerase-associated protein RTF1 homolog OS=Mus musculus GN=Rtf1 PE=1 SV=1[more]
RTF1_PONAB2.8e-1625.37RNA polymerase-associated protein RTF1 homolog (Fragment) OS=Pongo abelii GN=RTF... [more]
RTF1_SCHPO1.7e-1324.66RNA polymerase-associated protein C651.09c OS=Schizosaccharomyces pombe (strain ... [more]
Match NameE-valueIdentityDescription
A0A0A0KXW1_CUCSA0.0e+0092.27Uncharacterized protein OS=Cucumis sativus GN=Csa_4G038650 PE=4 SV=1[more]
A0A067JDU6_JATCU2.5e-26677.88Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21409 PE=4 SV=1[more]
A0A061GFX4_THECC5.6e-26676.36PAF1 complex component isoform 1 OS=Theobroma cacao GN=TCM_029914 PE=4 SV=1[more]
W9RDA9_9ROSA5.6e-26677.95RNA polymerase-associated protein RTF1-like protein OS=Morus notabilis GN=L484_0... [more]
A0A0D2T1B1_GOSRA1.5e-26376.21Uncharacterized protein OS=Gossypium raimondii GN=B456_011G025800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G61040.18.1e-23269.20 plus-3 domain-containing protein[more]
Match NameE-valueIdentityDescription
gi|659107572|ref|XP_008453742.1|0.0e+0092.42PREDICTED: RNA polymerase-associated protein RTF1 homolog [Cucumis melo][more]
gi|449462844|ref|XP_004149150.1|0.0e+0092.27PREDICTED: RNA polymerase-associated protein RTF1 homolog [Cucumis sativus][more]
gi|802784101|ref|XP_012091565.1|3.6e-26677.88PREDICTED: RNA polymerase-associated protein RTF1 homolog [Jatropha curcas][more]
gi|703114113|ref|XP_010100559.1|8.1e-26677.95RNA polymerase-associated protein RTF1-like protein [Morus notabilis][more]
gi|590624747|ref|XP_007025691.1|8.1e-26676.36PAF1 complex component isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004343Plus-3_dom
Vocabulary: Biological Process
TermDefinition
GO:0006368transcription elongation from RNA polymerase II promoter
GO:0016570histone modification
Vocabulary: Cellular Component
TermDefinition
GO:0016593Cdc73/Paf1 complex
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006352 DNA-templated transcription, initiation
biological_process GO:0016570 histone modification
biological_process GO:0009910 negative regulation of flower development
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006368 transcription elongation from RNA polymerase II promoter
cellular_component GO:0016593 Cdc73/Paf1 complex
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh16G007190.1CmoCh16G007190.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004343Plus-3 domainPFAMPF03126Plus-3coord: 276..381
score: 8.3
IPR004343Plus-3 domainSMARTSM00719rtf1coord: 271..383
score: 2.4
IPR004343Plus-3 domainPROFILEPS51360PLUS3coord: 271..406
score: 36
IPR004343Plus-3 domainunknownSSF159042Plus3-likecoord: 273..404
score: 2.09
NoneNo IPR availableunknownCoilCoilcoord: 435..467
scor
NoneNo IPR availablePANTHERPTHR13115:SF8RNA POLYMERASE-ASSOCIATED PROTEIN RTF1 HOMOLOGcoord: 1..657
score: 3.7E