Csa6G486670 (gene) Cucumber (Chinese Long) v2

NameCsa6G486670
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionTranscription elongation factor, putative, expressed; contains IPR017923 (Transcription factor IIS, N-terminal)
LocationChr6 : 22529085 .. 22532034 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCTTCTTCTTCTTCTTCTTCTTCTACCTAATGTGAATTGACCCACTTCCAAAATTTTCCCTTTTAATTCTTTGAGGACGAAATTCCGGCTGTCTTCGTCTTCTCAAGGAATATTCCGTAGTTTTGATTTGGGTTCTTTTTAATTTATTGTTTCAGTTGCTTTTTGTGTTCTTAATTTTGTTTCCTGACTTGGGTTTCTGTAATTTTGACGGGTTGTCTTCTTCTACTTCTTCTTTTTCCTTCCGAACGGCAACTGAGAGTTTGGATTCTGAAAACGATCGGTTAAGCTTTTCTTTAGGATCGAATCAGATCTATTCTATGTCATTTGGGTTTCTTTTCTTTTTTGTTTCTTTCACAAAATTTCATTAATGGATTATGACGATTTTGCTTTTCCACCACTGGGGTTTCTGTGCAATTTCTAGCGTTTTTCCTTCTCTTTTGTTTATTGTCAGAGTTTGGAAGTAGAATAATAAACTTCCATCTTAGATCCGTTTCTGTTTCCTGAGTCCATTACTGAAAACAAGAACTACGAACGTTTTTTTCTTTTTCAGATTGTTTATCGGGAAGACAACCTTTCTCTTCTCTCAATCAATCTCCGATTCTTCTAATCGGACTCTTAAAAAAATGGTTCGGTTGGATTTAGATGATTTCCGAACTATATTAGATACAGCCGGCGTCGATGTTTGGACTTTCATCGATACAGCTATGGAGGTCGCTTCATTAGATTACGGTAACCAATTGAAGAATCGGAGAGATGGAATCGTTGAGAGGCTCTACGCGTTGACTTCGCCGCCATCTCGATGTCGGAACTGTGATACGGATCGCAACCATGACGGTCGCTCTAATGGCTGTGAAATCAAACAGGGGTCAGGTGAGGTTAAGGAAGCTTCCCCTTCAACGCCGCAGTTCGTTGTAGTGGAAGGGGATGACGATGGAGCAGACCCATATGCAGGATTGTTTGATGATGAACAGAAGAAGGTTCTGGAAATAAAGGAACAGCTCGAAATTCCTCAGCAGGTCTTCTTCTTTTGATACTGATTGTACAGAATTCTTTGATTGTTTTGAGGGATTTTTTCTCTTATCTTTACCAATTTGTTGTTTTTTGGCCTGTTTTAGCCTGAAGATGCTTTGGTTGAACTGCTTCAAAACTTAGCAGACATGGATATAACATTCCAAGCTCTTAAGGTAAGTGTTTATGGGGTGGTTTTTCTTGTTCTTTGTGTATGTAAATTCCTTTTCTTTTTCTTCTTCTCTGTTTCTTAATTCTTCTTGTTCTTGTTAGGAAACTGATATTGGAAGGCATGTGAATCGGTTGAGAAAGCATCCATCAAATGATGTTCGTAGATTGGTGAAACATCTTGTCAGGTACATAATTTTTTACATATTTGAAGGAATGGTTGATGCTTGTAATTTTCAGAAGAGGATAGTTGGTGAATTATCAGTTCAATTCAACTGTTTGTTAACGATATCAATTTGTTTTTCGACAGAAAATGGAAGGAGATTGTTGATGAATGGGTCAGGTTGAATCAGCCTGGGGAGCAAACAGCTACACTCTTAGGTAGAGACCAGTTTTGAACTTCGTCATTCAATTCCTTCGTAAGGTTAAATCTTAGTCACTCTGTTTGTAGAAAGATTTAGCTAAGAATGGTTTCTACTGTCAATGACAGCTGATGGCGACTCACCCCAGCAAAAAGCTCCTCAAAATGGCTATCATCAGGTGAACTGAAATCATTCATCGTATTCGGTTCTAATGTTCGTTAAGAAGTTTATCCTTTTCATAGTTTCTAATGTTTTATTGTTGTTGTTGTGTAGGTTCCTGATTTTGCATACTCTCCGAATCCACACAGTGAGTATTGAATTAGAGCTTATTGTTTCACTTGTCATCATGTACCACGGATTGATTTGGTTTCTCATGATCCCTCCCTTTTTCTTTGTTTTGGTAGATGGAGGTTCTGGGTCCGATAGAAATAACTCAGAACCGGAACCGAAGGGGAAATCCGTTCCAAGAAGAGATGCCCTTCCGAAACCAACACAACAAGCTCCAACGTCGTCTTTAACTCCTCAAAATGTATGGTAACCAAATTAGAGTCTTTTAACTCTATGTTTGGTTGCCAAAAATGGAGGAAGAAAATTGAAACTTACGATTCTTATGCTTCCGAAAACATAACAAACTTAACTCAAACTGGGATCAATTGTGTTGAAATTTGACAAAATTTGAGTGGTAGGTCTAAATTTTCTCAGGTCCCAGATGATATGCATGGTCTTACCTTACTTACAGTCAGCCAAATTTTGTCTTCAACAACTTAGCTGTTTTAGGAATGTTTTTCTTATTATGTCAATTGAATTGATGGCCATTTGCAGAGACAGAAAGAACAACAGAAAGAAGCAAATTTTGATTCCCAAAAGCTTGCATCGGCTAGAAAAAGGCTTCAAGAGAATTACAAGGAAGCTGAAAATGGTTTGGCCTACATCTAAGTTCTCTTTGATTTCATTTCTGGCTCTTTTTAGAACAAACAAAACATACTAATTTCTGTTTGTCTGTTTCTTTTTTTGAAGCTAAAAGGCAAAGAACAATTCAAGTCATGGACATCCATGAGATCCCAAAGCCAAAGAATGCCTTCTTCTCAAAGAACAAAGGCAGTGGTGGTGGTTCTCAAGGCAGACACTGGTGATAGATAATTTGGCAATGCAGTAAAAGCAAACAACATCTTCTAAATTAGGGTTATCCGTTATCGTCGAACGAATCGGTTTATCATCATCCTTTCTATTTGTTACATCTTTTGATCATTAAATTCTTAACTTCTCTATCAAGTTTAAAAAGAGAATCAAAAAACTACTTTTCGCGTAAATTTTCTCTTAATAAGCAATGCTAATACTGATATATTGTTAATCTACTGAGTCTTGAAACCTCACTGCCTGTTCTATATCATGTAAAACTCAAATAGGAAGG

mRNA sequence

ATGGTTCGGTTGGATTTAGATGATTTCCGAACTATATTAGATACAGCCGGCGTCGATGTTTGGACTTTCATCGATACAGCTATGGAGGTCGCTTCATTAGATTACGGTAACCAATTGAAGAATCGGAGAGATGGAATCGTTGAGAGGCTCTACGCGTTGACTTCGCCGCCATCTCGATGTCGGAACTGTGATACGGATCGCAACCATGACGGTCGCTCTAATGGCTGTGAAATCAAACAGGGGTCAGGTGAGGTTAAGGAAGCTTCCCCTTCAACGCCGCAGTTCGTTGTAGTGGAAGGGGATGACGATGGAGCAGACCCATATGCAGGATTGTTTGATGATGAACAGAAGAAGGTTCTGGAAATAAAGGAACAGCTCGAAATTCCTCAGCAGCCTGAAGATGCTTTGGTTGAACTGCTTCAAAACTTAGCAGACATGGATATAACATTCCAAGCTCTTAAGGAAACTGATATTGGAAGGCATGTGAATCGGTTGAGAAAGCATCCATCAAATGATGTTCGTAGATTGGTGAAACATCTTGTCAGAAAATGGAAGGAGATTGTTGATGAATGGGTCAGGTTGAATCAGCCTGGGGAGCAAACAGCTACACTCTTAGCTGATGGCGACTCACCCCAGCAAAAAGCTCCTCAAAATGGCTATCATCAGGTTCCTGATTTTGCATACTCTCCGAATCCACACAATGGAGGTTCTGGGTCCGATAGAAATAACTCAGAACCGGAACCGAAGGGGAAATCCGTTCCAAGAAGAGATGCCCTTCCGAAACCAACACAACAAGCTCCAACGTCGTCTTTAACTCCTCAAAATGTATGGTAA

Coding sequence (CDS)

ATGGTTCGGTTGGATTTAGATGATTTCCGAACTATATTAGATACAGCCGGCGTCGATGTTTGGACTTTCATCGATACAGCTATGGAGGTCGCTTCATTAGATTACGGTAACCAATTGAAGAATCGGAGAGATGGAATCGTTGAGAGGCTCTACGCGTTGACTTCGCCGCCATCTCGATGTCGGAACTGTGATACGGATCGCAACCATGACGGTCGCTCTAATGGCTGTGAAATCAAACAGGGGTCAGGTGAGGTTAAGGAAGCTTCCCCTTCAACGCCGCAGTTCGTTGTAGTGGAAGGGGATGACGATGGAGCAGACCCATATGCAGGATTGTTTGATGATGAACAGAAGAAGGTTCTGGAAATAAAGGAACAGCTCGAAATTCCTCAGCAGCCTGAAGATGCTTTGGTTGAACTGCTTCAAAACTTAGCAGACATGGATATAACATTCCAAGCTCTTAAGGAAACTGATATTGGAAGGCATGTGAATCGGTTGAGAAAGCATCCATCAAATGATGTTCGTAGATTGGTGAAACATCTTGTCAGAAAATGGAAGGAGATTGTTGATGAATGGGTCAGGTTGAATCAGCCTGGGGAGCAAACAGCTACACTCTTAGCTGATGGCGACTCACCCCAGCAAAAAGCTCCTCAAAATGGCTATCATCAGGTTCCTGATTTTGCATACTCTCCGAATCCACACAATGGAGGTTCTGGGTCCGATAGAAATAACTCAGAACCGGAACCGAAGGGGAAATCCGTTCCAAGAAGAGATGCCCTTCCGAAACCAACACAACAAGCTCCAACGTCGTCTTTAACTCCTCAAAATGTATGGTAA

Protein sequence

MVRLDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRCRNCDTDRNHDGRSNGCEIKQGSGEVKEASPSTPQFVVVEGDDDGADPYAGLFDDEQKKVLEIKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSNDVRRLVKHLVRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQNGYHQVPDFAYSPNPHNGGSGSDRNNSEPEPKGKSVPRRDALPKPTQQAPTSSLTPQNVW*
BLAST of Csa6G486670 vs. Swiss-Prot
Match: MD26C_ARATH (Probable mediator of RNA polymerase II transcription subunit 26c OS=Arabidopsis thaliana GN=MED26C PE=1 SV=1)

HSP 1 Score: 365.5 bits (937), Expect = 6.2e-100
Identity = 204/363 (56.20%), Postives = 251/363 (69.15%), Query Frame = 1

Query: 4   LDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRCRNC 63
           +DLDDFR+++D AGVDVWTFIDTA+ VASLDYG +LK RRD IVERLYA TS  ++CRNC
Sbjct: 1   MDLDDFRSVMDNAGVDVWTFIDTAILVASLDYGQELKRRRDNIVERLYA-TSMANKCRNC 60

Query: 64  DTDRNHD------GRSNGCEIKQGSGEVKE------ASPSTPQFVVVEGDDDGADPYAGL 123
           D     +      GR N   + + + E  E      A     +  V   DDD  DP+AGL
Sbjct: 61  DFGGGGNVTEAAIGRVNNGRVHEETEEEDEEGVTAAAEEEVREKSVNVEDDDDFDPFAGL 120

Query: 124 FDDEQKKVLEIKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSN 183
           FDDEQK ++EIKE+LE P   E++LVELLQNL DMDITFQAL+ETDIGRHVNR+RKHPSN
Sbjct: 121 FDDEQKSIVEIKEKLEDPDLSEESLVELLQNLEDMDITFQALQETDIGRHVNRVRKHPSN 180

Query: 184 DVRRLVKHLVRKWKEIVDEWVRLNQPGE-QTATLLADGDSPQQKAPQNG-YHQVPDFAYS 243
           +VRRL K LV+KWKE VDEWV+ NQPG+ +  +L+AD DSP QKA  NG   QVPDF YS
Sbjct: 181 NVRRLAKQLVKKWKETVDEWVKFNQPGDLEPPSLIADEDSPVQKALHNGSRQQVPDFGYS 240

Query: 244 PNPHNGGSGSDRNN--SEPEPKGKSV---PRRD--ALPKPTQQAPTSSLTPQNRQKEQQK 303
           P P NG S S +N+  +EPE K + V   PRR+  +  KP++ +P+    P+++   + K
Sbjct: 241 PVPQNGYSSSSKNSNITEPERKPRPVAPQPRRESPSPAKPSRPSPSQQTIPRDK---EHK 300

Query: 304 EANFDSQKLASARKRLQENYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQG 346
           E +FD     +ARKRLQ+NY++AENAK+QRTIQVMDIH+IPKPK   F   KG G    G
Sbjct: 301 EVDFD-----TARKRLQQNYRQAENAKKQRTIQVMDIHDIPKPKKGGFFPRKG-GSSQGG 353

BLAST of Csa6G486670 vs. Swiss-Prot
Match: MD26B_ARATH (Probable mediator of RNA polymerase II transcription subunit 26b OS=Arabidopsis thaliana GN=MED26B PE=2 SV=1)

HSP 1 Score: 89.7 bits (221), Expect = 6.6e-17
Identity = 99/362 (27.35%), Postives = 151/362 (41.71%), Query Frame = 1

Query: 6   LDDFRTILDTAG-VDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRCRNCD 65
           LD +R      G  D++  ID A+ VA+ D  N+ K+RRD I E L++     +RC  CD
Sbjct: 7   LDSWREYFRRRGDSDIFGIIDHAIMVAATDCPNKFKSRRDKIAELLFSCRV--NRCVGCD 66

Query: 66  -----------TDRNHDGRSNG-------CEIKQGSGEVKEASPSTPQFVVVEG----DD 125
                       +R   G   G        E+  GS E K  S       +V      + 
Sbjct: 67  HLELSVPGDDEANRGTTGNGGGGTAVDEDYEVAGGSKESKANSSRGDNNQIVSNYTFDEA 126

Query: 126 DGADPYAGLFDDEQKKVLEIKE-QLEIPQQPEDALVELLQNLADMDITFQALKETDIGRH 185
           +        F    K+V  IKE  L    +P   L++ L++L  M +    LK T+IG+ 
Sbjct: 127 EALSDEIEEFSVVSKEVARIKEILLNKEDEPNSVLLDSLRHLKLMSLNVDILKSTEIGKA 186

Query: 186 VNRLRKHPSNDVRRLVKHLVRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQ--NG 245
           VN LRKH S+ +R+L K L+ +WKE+VD+WV  N   E T    A+G +P+   P   + 
Sbjct: 187 VNGLRKHSSDKIRQLAKTLIAEWKELVDQWV--NTTKEITG---AEG-TPESANPSVLDE 246

Query: 246 YHQVPDFAYSPNPHNGGSGSDRNNSEPEPKGKSVPR------RDALPKPTQQAPTS---S 305
               P   Y           D +   PEP G  +         D  P+ +++  TS    
Sbjct: 247 EEAFPSLPY-----------DVDIFTPEPNGFEISHFFDSLDFDGNPRNSEEHNTSREHE 306

Query: 306 LTPQNRQKEQQ-------KEANFDSQKLASARKRLQENYKEAENAKRQRTIQVMDIHEIP 326
             PQN  K +        ++A F S K +SA           ++ +++   + + +H+  
Sbjct: 307 RRPQNIAKRKPEGTQMRIQDAPFRSIKPSSATDFDGTRRPVKQSTEQRMKNETVSVHKSE 349


HSP 2 Score: 46.2 bits (108), Expect = 8.4e-04
Identity = 35/109 (32.11%), Postives = 57/109 (52.29%), Query Frame = 1

Query: 241 RNNSEPEPKGKSVPRRDALPKPTQQAPTSSLTPQNRQK---EQQKEANFDSQ-KLASARK 300
           + ++E   K ++V    +     Q+ P   +T Q R+    +Q+K    D+  K   A++
Sbjct: 329 KQSTEQRMKNETVSVHKSEKPMIQRKPV--VTEQKRKAPGPQQEKLKGLDADAKFEFAKR 388

Query: 301 RLQENYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQGRHW 346
           +LQE+Y+  ENAK+QRTIQV+++  IPK  +A   K +    G   R+W
Sbjct: 389 KLQESYQHHENAKKQRTIQVLEM--IPKQGSA--QKPQLKRPGMSNRNW 431

BLAST of Csa6G486670 vs. Swiss-Prot
Match: MD26A_ARATH (Probable mediator of RNA polymerase II transcription subunit 26a OS=Arabidopsis thaliana GN=MED26A PE=2 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 2.8e-15
Identity = 76/279 (27.24%), Postives = 126/279 (45.16%), Query Frame = 1

Query: 4   LDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRCRNC 63
           + LD +R        D++  ID A+ VA+ D+  + K+R D I E L++     SRC  C
Sbjct: 8   VSLDTWREYFRRGDSDIFGIIDHAIMVAAADWPKEFKSRSDRIAELLFSCKV--SRCIGC 67

Query: 64  D-TDRNHDGRSNGCEIKQ--GSGEVKEASPSTPQFVVVEGDDDGADPYAGLFDDEQKKVL 123
           D  + +  G     EI    G G+  ++  +T      EG++      A +  DE  ++ 
Sbjct: 68  DHLELSIAGDEAAVEIVGVGGGGDRGDSGVATG-----EGEE------ASVSVDEVMRIR 127

Query: 124 EIKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSNDVRRLVKHL 183
           +I    +   + +  L+E L+ L  M ++   LK+T+IG+ VN LR+H S+ + +L K L
Sbjct: 128 DILSNKD--DEKDSVLLESLRKLESMSMSVDILKDTEIGKAVNGLRRHSSDKISKLAKTL 187

Query: 184 VRKWKEIVDEWVRLNQPGEQTAT-----LLADGDSPQQKAPQNGYHQVPDFAYSPNPHN- 243
             +WK +VD+W  +N P E   T      L      +++A  +  H +  +A  PN    
Sbjct: 188 FAEWKRLVDQW--MNTPEEMAGTEGTPESLNLSVIDEEEAFPSPPHDLDIYAPEPNGFEL 247

Query: 244 -------GGSGSDRNNSEPEPKGKSVPRRDALPKPTQQA 267
                     G+ R++ E + + KS       PK T  A
Sbjct: 248 SQILDCLDCDGNPRHSVESKHERKSQSSAGRRPKGTNDA 269


HSP 2 Score: 50.4 bits (119), Expect = 4.4e-05
Identity = 34/83 (40.96%), Postives = 50/83 (60.24%), Query Frame = 1

Query: 247 EPKGKSVPRRDALPKPTQQAPTSSLTPQNR-----QKEQQKEANFDSQKLASARKRLQEN 306
           EPK ++   R+ +    Q+ PT+ +T Q R     Q+++ K  + DS K   A+++LQE+
Sbjct: 300 EPKRQTKQSREQMVSAIQRKPTA-VTEQKRKLAGPQQDKLKALDPDS-KFEFAKRKLQES 359

Query: 307 YKEAENAKRQRTIQVMDIHEIPK 325
           Y + ENAKRQRTIQV++   IPK
Sbjct: 360 YHQHENAKRQRTIQVLE--TIPK 378

BLAST of Csa6G486670 vs. Swiss-Prot
Match: TEAN2_HUMAN (Transcription elongation factor A N-terminal and central domain-containing protein 2 OS=Homo sapiens GN=TCEANC2 PE=1 SV=1)

HSP 1 Score: 53.1 bits (126), Expect = 6.9e-06
Identity = 30/95 (31.58%), Postives = 46/95 (48.42%), Query Frame = 1

Query: 102 DDGADPYAGLFDDEQKKVLEI------KEQLEIPQQPEDALVELLQNLADMDITFQALKE 161
           D G   Y     +  K+V+ +      K  LE+P Q ++ LVE LQ L     + + LK 
Sbjct: 19  DSGGKVYKQATIESLKRVVVVEDIKRWKTMLELPDQTKENLVEALQELKKKIPSREVLKS 78

Query: 162 TDIGRHVNRLRKHPSNDVRRLVKHLVRKWKEIVDE 191
           T IG  VN++RKH  ++V  L + +  +WK   ++
Sbjct: 79  TRIGHTVNKMRKHSDSEVASLAREVYTEWKTFTEK 113

BLAST of Csa6G486670 vs. TrEMBL
Match: A0A0A0KHV9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G486670 PE=4 SV=1)

HSP 1 Score: 707.6 bits (1825), Expect = 7.5e-201
Identity = 345/345 (100.00%), Postives = 345/345 (100.00%), Query Frame = 1

Query: 1   MVRLDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRC 60
           MVRLDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRC
Sbjct: 1   MVRLDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRC 60

Query: 61  RNCDTDRNHDGRSNGCEIKQGSGEVKEASPSTPQFVVVEGDDDGADPYAGLFDDEQKKVL 120
           RNCDTDRNHDGRSNGCEIKQGSGEVKEASPSTPQFVVVEGDDDGADPYAGLFDDEQKKVL
Sbjct: 61  RNCDTDRNHDGRSNGCEIKQGSGEVKEASPSTPQFVVVEGDDDGADPYAGLFDDEQKKVL 120

Query: 121 EIKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSNDVRRLVKHL 180
           EIKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSNDVRRLVKHL
Sbjct: 121 EIKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSNDVRRLVKHL 180

Query: 181 VRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQNGYHQVPDFAYSPNPHNGGSGSD 240
           VRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQNGYHQVPDFAYSPNPHNGGSGSD
Sbjct: 181 VRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQNGYHQVPDFAYSPNPHNGGSGSD 240

Query: 241 RNNSEPEPKGKSVPRRDALPKPTQQAPTSSLTPQNRQKEQQKEANFDSQKLASARKRLQE 300
           RNNSEPEPKGKSVPRRDALPKPTQQAPTSSLTPQNRQKEQQKEANFDSQKLASARKRLQE
Sbjct: 241 RNNSEPEPKGKSVPRRDALPKPTQQAPTSSLTPQNRQKEQQKEANFDSQKLASARKRLQE 300

Query: 301 NYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQGRHW 346
           NYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQGRHW
Sbjct: 301 NYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQGRHW 345

BLAST of Csa6G486670 vs. TrEMBL
Match: A0A061DKK6_THECC (Transcription elongation factor (TFIIS) family protein, putative isoform 1 OS=Theobroma cacao GN=TCM_001525 PE=4 SV=1)

HSP 1 Score: 469.9 bits (1208), Expect = 2.6e-129
Identity = 242/344 (70.35%), Postives = 274/344 (79.65%), Query Frame = 1

Query: 4   LDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRCRNC 63
           +DLDDFR++L+TA VDVWTFIDTA+ VASLDYG +LK RRD IVERLYA TS  ++CRNC
Sbjct: 1   MDLDDFRSVLETAEVDVWTFIDTAILVASLDYGPELKQRRDRIVERLYA-TSMVTQCRNC 60

Query: 64  DTDRNHDGRSNGCEIKQGSGEVKEASPSTPQFVVVEGDDDGADPYAGLFDDEQKKVLEIK 123
           D            ++K+ S    +    +P     +  DD  DPY GLFDDEQKK+LEIK
Sbjct: 61  DFGERPSDYEVKADLKRESSHEDKRRGGSPNAPQSDNGDDELDPYGGLFDDEQKKILEIK 120

Query: 124 EQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSNDVRRLVKHLVRK 183
           E LE P Q ED+L++LLQ+LADMDITFQALKETDIGRHVN LRKH SNDVRRLVK LVRK
Sbjct: 121 EHLEEPHQSEDSLIDLLQSLADMDITFQALKETDIGRHVNILRKHSSNDVRRLVKQLVRK 180

Query: 184 WKEIVDEWVRLNQPGE-QTATLLADGDSPQQKAPQNGYHQVPDFAYSPNPHNGGSGSDRN 243
           WKEIVDEWVRLNQPGE +++ L+ADGDSPQ+K PQNGYHQVPDFAYSPNPHNG SGSD+N
Sbjct: 181 WKEIVDEWVRLNQPGELESSALMADGDSPQRKPPQNGYHQVPDFAYSPNPHNGSSGSDKN 240

Query: 244 NSEPEPKGKSV-PRRDALPKPTQQAPTSSLTPQNRQKEQQKEANFDSQKLASARKRLQEN 303
           NSEPE K K + PR +  PKPT  AP      QNRQ+E QKE+NFDS++LASARKRLQEN
Sbjct: 241 NSEPERKPKPIPPRNEPPPKPTYSAPVL----QNRQRE-QKESNFDSERLASARKRLQEN 300

Query: 304 YKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQGRHW 346
           YKEAENAKRQRTIQVMDIHE+PKPKNAFF KNK  GG SQGRHW
Sbjct: 301 YKEAENAKRQRTIQVMDIHELPKPKNAFFGKNK--GGSSQGRHW 336

BLAST of Csa6G486670 vs. TrEMBL
Match: A0A0D2SWX2_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_010G216800 PE=4 SV=1)

HSP 1 Score: 466.8 bits (1200), Expect = 2.2e-128
Identity = 237/346 (68.50%), Postives = 278/346 (80.35%), Query Frame = 1

Query: 4   LDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRCRNC 63
           +DLDDFR++L+TAGVDVWTFIDTA+ VASLDYG +LK RRDGIVERLYA TS  ++C++C
Sbjct: 1   MDLDDFRSVLETAGVDVWTFIDTAILVASLDYGQELKQRRDGIVERLYA-TSMVTKCKSC 60

Query: 64  DTDRNHDGR--SNGCEIKQGSGEVKEASPSTPQFVVVEGDDDGADPYAGLFDDEQKKVLE 123
           D     +G   +N     +G  E    SP +PQ    + ++D  DPY GLFDDEQK+VLE
Sbjct: 61  DFGEGSNGYQLNNESNPHEGGEEGVTGSPFSPQS---DNENDDFDPYGGLFDDEQKRVLE 120

Query: 124 IKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSNDVRRLVKHLV 183
           IKE+LE+P Q ED LV+LLQ+LADMDITFQALKETDIGRHVN+LRKH SNDVRRLVKHLV
Sbjct: 121 IKERLELPDQSEDTLVDLLQSLADMDITFQALKETDIGRHVNKLRKHSSNDVRRLVKHLV 180

Query: 184 RKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQNGYHQVPDFAYSPNPHNGGSGSDR 243
           RKWK+IVDEWVR+NQPGE     L DGDSPQQK PQNG  QVPDFAYSPNPHNGGSGS++
Sbjct: 181 RKWKDIVDEWVRVNQPGEHEPAALMDGDSPQQKPPQNGRQQVPDFAYSPNPHNGGSGSEK 240

Query: 244 NNSEPEPKGKSV-PRRDALPKPTQQAPTSSLTPQNRQKE-QQKEANFDSQKLASARKRLQ 303
           NNSEPE K K + PR+D   +PT   P     PQN Q++ +QKE NFD+++LASARKRLQ
Sbjct: 241 NNSEPERKPKPIPPRKDPPSRPTHLTP-----PQNVQRQREQKETNFDAERLASARKRLQ 300

Query: 304 ENYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQGRHW 346
           E+YKEAENAK+QRT+QVMDIHE+PKPKNAFF +NK  GGGSQGRHW
Sbjct: 301 ESYKEAENAKKQRTVQVMDIHELPKPKNAFFGRNK--GGGSQGRHW 335

BLAST of Csa6G486670 vs. TrEMBL
Match: W9QGB8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_016512 PE=4 SV=1)

HSP 1 Score: 466.1 bits (1198), Expect = 3.8e-128
Identity = 243/349 (69.63%), Postives = 276/349 (79.08%), Query Frame = 1

Query: 4   LDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRCRNC 63
           +D DDFR ILD++GVDVWTFIDTA+ VAS+DYG +L+ RRDGIVERLYA +S  S     
Sbjct: 1   MDYDDFRAILDSSGVDVWTFIDTAIAVASMDYGGELRQRRDGIVERLYAASSSSS----- 60

Query: 64  DTDRNHDGRSNGCEIKQGSGEVKEASPSTPQFVVVEGDD---DGADPYAGLFDDEQKKVL 123
               +H  R    E K  +GE +  SP TPQ +  E  D   +G DPY GLFDDEQKK+L
Sbjct: 61  -ASDHHRPRHYDREAKT-AGEKERVSPPTPQSIDRENSDNDGEGLDPYGGLFDDEQKKIL 120

Query: 124 EIKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSNDVRRLVKHL 183
           EIKEQLE   Q ED LVELLQ+LADMDITFQALKETDIGRHVNRLRKHPS++V+RLVK +
Sbjct: 121 EIKEQLEDSDQSEDTLVELLQSLADMDITFQALKETDIGRHVNRLRKHPSSEVKRLVKQV 180

Query: 184 VRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQNGYHQVPDFAYSPNPHNGGSGSD 243
           VRKWK+ VDEWV+LNQPGE  +  L DGDSPQQK PQNG+HQVPDFAYSPNPHNG SGSD
Sbjct: 181 VRKWKDTVDEWVKLNQPGEHASNSLMDGDSPQQKIPQNGHHQVPDFAYSPNPHNGSSGSD 240

Query: 244 RNNSEPEPKGKSVP-RRDALPKPTQQAPTSSL--TPQNRQKEQQKEANFDSQKLASARKR 303
           +NNSEPE K K+VP RR+ALPKP   +   S   TPQNRQ+E Q+E +FD++KLASARKR
Sbjct: 241 KNNSEPEQKPKAVPSRREALPKPMNPSVPQSAHSTPQNRQRE-QREGSFDAEKLASARKR 300

Query: 304 LQENYKEAENAKRQRTIQVMDIHEIPKP--KNAFFSKNKGSGGGSQGRH 345
           LQENYKEAENAK+QRTIQVMDIHEIPKP  KNAFF KNKG GGGS GRH
Sbjct: 301 LQENYKEAENAKKQRTIQVMDIHEIPKPKAKNAFFPKNKGGGGGSLGRH 341

BLAST of Csa6G486670 vs. TrEMBL
Match: A0A061FYN3_THECC (Transcription elongation factor (TFIIS) family protein, putative isoform 1 OS=Theobroma cacao GN=TCM_045322 PE=4 SV=1)

HSP 1 Score: 465.3 bits (1196), Expect = 6.4e-128
Identity = 244/352 (69.32%), Postives = 279/352 (79.26%), Query Frame = 1

Query: 4   LDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRCRNC 63
           +DLDDFR++L+TAGVDVWTFID+A+ VASLDYG +LK RRDGIVERLYA TS  +RC++C
Sbjct: 1   MDLDDFRSVLETAGVDVWTFIDSAILVASLDYGQELKQRRDGIVERLYA-TSMVTRCKSC 60

Query: 64  DTDRNHDGRSNGCEI-KQGS------GEVKEASPSTPQFVVVEGDDDGADPYAGLFDDEQ 123
           D       RSNG ++ K+GS      GE    SP TPQ    + +D   DPY GLFDDEQ
Sbjct: 61  DFGE----RSNGYQVNKEGSPNEGKGGEGGRESPFTPQS---DNEDGDLDPYGGLFDDEQ 120

Query: 124 KKVLEIKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSNDVRRL 183
           K+VLEIKE LE P Q ED+LV+LLQ+LADMDITFQALKETDIGRHVN+LRKH SNDVRRL
Sbjct: 121 KRVLEIKESLEEPDQSEDSLVDLLQSLADMDITFQALKETDIGRHVNKLRKHSSNDVRRL 180

Query: 184 VKHLVRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQNGYHQVPDFAYSPNPHNGG 243
           VK LVRKWKEIVDEWVR+NQPGE  +  L DGDSPQQK PQNG  QVPDFAYSPNPHNG 
Sbjct: 181 VKQLVRKWKEIVDEWVRVNQPGELESAALMDGDSPQQKLPQNGRQQVPDFAYSPNPHNGS 240

Query: 244 SGSDRNNSEPEPKGKSV--PRRDALPKPTQQAPTSSLTPQNRQKE-QQKEANFDSQKLAS 303
            G ++NNSEPE K K +  PR+D  P+PT   P     PQN Q++ +QKE+NFDS++LAS
Sbjct: 241 FGLEKNNSEPERKPKPIPPPRKDPPPRPTHSTP-----PQNVQRQREQKESNFDSERLAS 300

Query: 304 ARKRLQENYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQGRHW 346
           ARKRLQ NYKEAENAK+QRTIQVMDIHE+PKPKNAFF KNK  GGGSQGRHW
Sbjct: 301 ARKRLQANYKEAENAKKQRTIQVMDIHELPKPKNAFFGKNK--GGGSQGRHW 337

BLAST of Csa6G486670 vs. TAIR10
Match: AT5G09850.1 (AT5G09850.1 Transcription elongation factor (TFIIS) family protein)

HSP 1 Score: 365.5 bits (937), Expect = 3.5e-101
Identity = 204/363 (56.20%), Postives = 251/363 (69.15%), Query Frame = 1

Query: 4   LDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRCRNC 63
           +DLDDFR+++D AGVDVWTFIDTA+ VASLDYG +LK RRD IVERLYA TS  ++CRNC
Sbjct: 1   MDLDDFRSVMDNAGVDVWTFIDTAILVASLDYGQELKRRRDNIVERLYA-TSMANKCRNC 60

Query: 64  DTDRNHD------GRSNGCEIKQGSGEVKE------ASPSTPQFVVVEGDDDGADPYAGL 123
           D     +      GR N   + + + E  E      A     +  V   DDD  DP+AGL
Sbjct: 61  DFGGGGNVTEAAIGRVNNGRVHEETEEEDEEGVTAAAEEEVREKSVNVEDDDDFDPFAGL 120

Query: 124 FDDEQKKVLEIKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSN 183
           FDDEQK ++EIKE+LE P   E++LVELLQNL DMDITFQAL+ETDIGRHVNR+RKHPSN
Sbjct: 121 FDDEQKSIVEIKEKLEDPDLSEESLVELLQNLEDMDITFQALQETDIGRHVNRVRKHPSN 180

Query: 184 DVRRLVKHLVRKWKEIVDEWVRLNQPGE-QTATLLADGDSPQQKAPQNG-YHQVPDFAYS 243
           +VRRL K LV+KWKE VDEWV+ NQPG+ +  +L+AD DSP QKA  NG   QVPDF YS
Sbjct: 181 NVRRLAKQLVKKWKETVDEWVKFNQPGDLEPPSLIADEDSPVQKALHNGSRQQVPDFGYS 240

Query: 244 PNPHNGGSGSDRNN--SEPEPKGKSV---PRRD--ALPKPTQQAPTSSLTPQNRQKEQQK 303
           P P NG S S +N+  +EPE K + V   PRR+  +  KP++ +P+    P+++   + K
Sbjct: 241 PVPQNGYSSSSKNSNITEPERKPRPVAPQPRRESPSPAKPSRPSPSQQTIPRDK---EHK 300

Query: 304 EANFDSQKLASARKRLQENYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQG 346
           E +FD     +ARKRLQ+NY++AENAK+QRTIQVMDIH+IPKPK   F   KG G    G
Sbjct: 301 EVDFD-----TARKRLQQNYRQAENAKKQRTIQVMDIHDIPKPKKGGFFPRKG-GSSQGG 353

BLAST of Csa6G486670 vs. TAIR10
Match: AT5G05140.1 (AT5G05140.1 Transcription elongation factor (TFIIS) family protein)

HSP 1 Score: 89.7 bits (221), Expect = 3.7e-18
Identity = 99/362 (27.35%), Postives = 151/362 (41.71%), Query Frame = 1

Query: 6   LDDFRTILDTAG-VDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRCRNCD 65
           LD +R      G  D++  ID A+ VA+ D  N+ K+RRD I E L++     +RC  CD
Sbjct: 7   LDSWREYFRRRGDSDIFGIIDHAIMVAATDCPNKFKSRRDKIAELLFSCRV--NRCVGCD 66

Query: 66  -----------TDRNHDGRSNG-------CEIKQGSGEVKEASPSTPQFVVVEG----DD 125
                       +R   G   G        E+  GS E K  S       +V      + 
Sbjct: 67  HLELSVPGDDEANRGTTGNGGGGTAVDEDYEVAGGSKESKANSSRGDNNQIVSNYTFDEA 126

Query: 126 DGADPYAGLFDDEQKKVLEIKE-QLEIPQQPEDALVELLQNLADMDITFQALKETDIGRH 185
           +        F    K+V  IKE  L    +P   L++ L++L  M +    LK T+IG+ 
Sbjct: 127 EALSDEIEEFSVVSKEVARIKEILLNKEDEPNSVLLDSLRHLKLMSLNVDILKSTEIGKA 186

Query: 186 VNRLRKHPSNDVRRLVKHLVRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQ--NG 245
           VN LRKH S+ +R+L K L+ +WKE+VD+WV  N   E T    A+G +P+   P   + 
Sbjct: 187 VNGLRKHSSDKIRQLAKTLIAEWKELVDQWV--NTTKEITG---AEG-TPESANPSVLDE 246

Query: 246 YHQVPDFAYSPNPHNGGSGSDRNNSEPEPKGKSVPR------RDALPKPTQQAPTS---S 305
               P   Y           D +   PEP G  +         D  P+ +++  TS    
Sbjct: 247 EEAFPSLPY-----------DVDIFTPEPNGFEISHFFDSLDFDGNPRNSEEHNTSREHE 306

Query: 306 LTPQNRQKEQQ-------KEANFDSQKLASARKRLQENYKEAENAKRQRTIQVMDIHEIP 326
             PQN  K +        ++A F S K +SA           ++ +++   + + +H+  
Sbjct: 307 RRPQNIAKRKPEGTQMRIQDAPFRSIKPSSATDFDGTRRPVKQSTEQRMKNETVSVHKSE 349


HSP 2 Score: 46.2 bits (108), Expect = 4.7e-05
Identity = 35/109 (32.11%), Postives = 57/109 (52.29%), Query Frame = 1

Query: 241 RNNSEPEPKGKSVPRRDALPKPTQQAPTSSLTPQNRQK---EQQKEANFDSQ-KLASARK 300
           + ++E   K ++V    +     Q+ P   +T Q R+    +Q+K    D+  K   A++
Sbjct: 329 KQSTEQRMKNETVSVHKSEKPMIQRKPV--VTEQKRKAPGPQQEKLKGLDADAKFEFAKR 388

Query: 301 RLQENYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQGRHW 346
           +LQE+Y+  ENAK+QRTIQV+++  IPK  +A   K +    G   R+W
Sbjct: 389 KLQESYQHHENAKKQRTIQVLEM--IPKQGSA--QKPQLKRPGMSNRNW 431

BLAST of Csa6G486670 vs. TAIR10
Match: AT3G10820.2 (AT3G10820.2 Transcription elongation factor (TFIIS) family protein)

HSP 1 Score: 84.3 bits (207), Expect = 1.6e-16
Identity = 76/279 (27.24%), Postives = 126/279 (45.16%), Query Frame = 1

Query: 4   LDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRCRNC 63
           + LD +R        D++  ID A+ VA+ D+  + K+R D I E L++     SRC  C
Sbjct: 8   VSLDTWREYFRRGDSDIFGIIDHAIMVAAADWPKEFKSRSDRIAELLFSCKV--SRCIGC 67

Query: 64  D-TDRNHDGRSNGCEIKQ--GSGEVKEASPSTPQFVVVEGDDDGADPYAGLFDDEQKKVL 123
           D  + +  G     EI    G G+  ++  +T      EG++      A +  DE  ++ 
Sbjct: 68  DHLELSIAGDEAAVEIVGVGGGGDRGDSGVATG-----EGEE------ASVSVDEVMRIR 127

Query: 124 EIKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSNDVRRLVKHL 183
           +I    +   + +  L+E L+ L  M ++   LK+T+IG+ VN LR+H S+ + +L K L
Sbjct: 128 DILSNKD--DEKDSVLLESLRKLESMSMSVDILKDTEIGKAVNGLRRHSSDKISKLAKTL 187

Query: 184 VRKWKEIVDEWVRLNQPGEQTAT-----LLADGDSPQQKAPQNGYHQVPDFAYSPNPHN- 243
             +WK +VD+W  +N P E   T      L      +++A  +  H +  +A  PN    
Sbjct: 188 FAEWKRLVDQW--MNTPEEMAGTEGTPESLNLSVIDEEEAFPSPPHDLDIYAPEPNGFEL 247

Query: 244 -------GGSGSDRNNSEPEPKGKSVPRRDALPKPTQQA 267
                     G+ R++ E + + KS       PK T  A
Sbjct: 248 SQILDCLDCDGNPRHSVESKHERKSQSSAGRRPKGTNDA 269


HSP 2 Score: 50.4 bits (119), Expect = 2.5e-06
Identity = 34/83 (40.96%), Postives = 50/83 (60.24%), Query Frame = 1

Query: 247 EPKGKSVPRRDALPKPTQQAPTSSLTPQNR-----QKEQQKEANFDSQKLASARKRLQEN 306
           EPK ++   R+ +    Q+ PT+ +T Q R     Q+++ K  + DS K   A+++LQE+
Sbjct: 300 EPKRQTKQSREQMVSAIQRKPTA-VTEQKRKLAGPQQDKLKALDPDS-KFEFAKRKLQES 359

Query: 307 YKEAENAKRQRTIQVMDIHEIPK 325
           Y + ENAKRQRTIQV++   IPK
Sbjct: 360 YHQHENAKRQRTIQVLE--TIPK 378

BLAST of Csa6G486670 vs. NCBI nr
Match: gi|449448454|ref|XP_004141981.1| (PREDICTED: probable mediator of RNA polymerase II transcription subunit 26c [Cucumis sativus])

HSP 1 Score: 707.6 bits (1825), Expect = 1.1e-200
Identity = 345/345 (100.00%), Postives = 345/345 (100.00%), Query Frame = 1

Query: 1   MVRLDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRC 60
           MVRLDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRC
Sbjct: 1   MVRLDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRC 60

Query: 61  RNCDTDRNHDGRSNGCEIKQGSGEVKEASPSTPQFVVVEGDDDGADPYAGLFDDEQKKVL 120
           RNCDTDRNHDGRSNGCEIKQGSGEVKEASPSTPQFVVVEGDDDGADPYAGLFDDEQKKVL
Sbjct: 61  RNCDTDRNHDGRSNGCEIKQGSGEVKEASPSTPQFVVVEGDDDGADPYAGLFDDEQKKVL 120

Query: 121 EIKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSNDVRRLVKHL 180
           EIKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSNDVRRLVKHL
Sbjct: 121 EIKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSNDVRRLVKHL 180

Query: 181 VRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQNGYHQVPDFAYSPNPHNGGSGSD 240
           VRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQNGYHQVPDFAYSPNPHNGGSGSD
Sbjct: 181 VRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQNGYHQVPDFAYSPNPHNGGSGSD 240

Query: 241 RNNSEPEPKGKSVPRRDALPKPTQQAPTSSLTPQNRQKEQQKEANFDSQKLASARKRLQE 300
           RNNSEPEPKGKSVPRRDALPKPTQQAPTSSLTPQNRQKEQQKEANFDSQKLASARKRLQE
Sbjct: 241 RNNSEPEPKGKSVPRRDALPKPTQQAPTSSLTPQNRQKEQQKEANFDSQKLASARKRLQE 300

Query: 301 NYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQGRHW 346
           NYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQGRHW
Sbjct: 301 NYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQGRHW 345

BLAST of Csa6G486670 vs. NCBI nr
Match: gi|659080819|ref|XP_008440997.1| (PREDICTED: probable mediator of RNA polymerase II transcription subunit 26c [Cucumis melo])

HSP 1 Score: 697.2 bits (1798), Expect = 1.4e-197
Identity = 341/345 (98.84%), Postives = 341/345 (98.84%), Query Frame = 1

Query: 1   MVRLDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRC 60
           MVRLDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRC
Sbjct: 1   MVRLDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRC 60

Query: 61  RNCDTDRNHDGRSNGCEIKQGSGEVKEASPSTPQFVVVEGDDDGADPYAGLFDDEQKKVL 120
           RNCDTDRNHDGRSNGCEIKQGSGEVK ASPSTPQFVVVEGDDDGADPYAGLFDDEQKKVL
Sbjct: 61  RNCDTDRNHDGRSNGCEIKQGSGEVKGASPSTPQFVVVEGDDDGADPYAGLFDDEQKKVL 120

Query: 121 EIKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSNDVRRLVKHL 180
           EIKEQLEIP QPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKH SNDVRRLVKHL
Sbjct: 121 EIKEQLEIPHQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHSSNDVRRLVKHL 180

Query: 181 VRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQNGYHQVPDFAYSPNPHNGGSGSD 240
           VRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQNGYHQVPDFAYSPNPHNGGSGSD
Sbjct: 181 VRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQNGYHQVPDFAYSPNPHNGGSGSD 240

Query: 241 RNNSEPEPKGKSVPRRDALPKPTQQAPTSSLTPQNRQKEQQKEANFDSQKLASARKRLQE 300
           RNNSEPEPKGKSVPRRDA PKPTQQAPTSSLTPQNRQKEQQKEANFDSQKLASARKRLQE
Sbjct: 241 RNNSEPEPKGKSVPRRDAPPKPTQQAPTSSLTPQNRQKEQQKEANFDSQKLASARKRLQE 300

Query: 301 NYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQGRHW 346
           NYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQGRHW
Sbjct: 301 NYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQGRHW 345

BLAST of Csa6G486670 vs. NCBI nr
Match: gi|1009146734|ref|XP_015891030.1| (PREDICTED: probable mediator of RNA polymerase II transcription subunit 26c [Ziziphus jujuba])

HSP 1 Score: 496.1 bits (1276), Expect = 4.9e-137
Identity = 251/347 (72.33%), Postives = 284/347 (81.84%), Query Frame = 1

Query: 4   LDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRCRNC 63
           +D DDFRTILD+AGVDVWTFIDTA+ VAS+DYG +LK RRDGIVERLYA +S   R RNC
Sbjct: 1   MDYDDFRTILDSAGVDVWTFIDTAIAVASVDYGGELKQRRDGIVERLYAASSALPRLRNC 60

Query: 64  DTDRNHDGRSNGCEIKQG--SGEVKEASPSTPQFVVVE--GDDDGADPYAGLFDDEQKKV 123
           D    ++  S   + ++   SGE +  SP+TPQ V  +   DDD  DPY GLFDDEQKK+
Sbjct: 61  DAAEQNNRPSIHYDHRESKTSGEKERVSPATPQSVSRDDGNDDDELDPYGGLFDDEQKKI 120

Query: 124 LEIKEQLEIPQQPEDALVELLQNLADMDITFQALKETDIGRHVNRLRKHPSNDVRRLVKH 183
           LEI+E LE P Q ED+LVELLQ+LADMDITFQALKETDIGRHVN+LRKH S +VR LVK 
Sbjct: 121 LEIREHLEEPDQSEDSLVELLQSLADMDITFQALKETDIGRHVNKLRKHSSGEVRGLVKQ 180

Query: 184 LVRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQKAPQNGYHQVPDFAYSPNPHNGGSGS 243
           LVRKWKE VDEWV+LNQPGEQT++ L DGDSPQQK  QNG+HQVPDFAYSPNPHNG SGS
Sbjct: 181 LVRKWKETVDEWVKLNQPGEQTSSALMDGDSPQQKTFQNGHHQVPDFAYSPNPHNGSSGS 240

Query: 244 DRNNSEPEPKGKSV-PRRDALPKPTQQAPTSSLTPQNRQKEQQKEANFDSQKLASARKRL 303
           D+NNSE E K K++ PRR+A PKP+    T    PQNRQ+EQQKE+NFDSQ+LA+ARKRL
Sbjct: 241 DKNNSELERKPKALPPRREAPPKPSTIQSTPVSVPQNRQREQQKESNFDSQRLATARKRL 300

Query: 304 QENYKEAENAKRQRTIQVMDIHEIPKPKNAFFSKNKGSGGGSQGRHW 346
           QENYKEAENAKRQRTIQVMDIHEIPKPKNAFF KNKG GGGSQGRHW
Sbjct: 301 QENYKEAENAKRQRTIQVMDIHEIPKPKNAFFVKNKGGGGGSQGRHW 347

BLAST of Csa6G486670 vs. NCBI nr
Match: gi|1012257508|ref|XP_015945225.1| (PREDICTED: probable mediator of RNA polymerase II transcription subunit 26c [Arachis duranensis])

HSP 1 Score: 483.8 bits (1244), Expect = 2.5e-133
Identity = 255/376 (67.82%), Postives = 289/376 (76.86%), Query Frame = 1

Query: 4   LDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRCRNC 63
           +DLDDFR+ILDTAGVDVW FIDTA+ VASLD G++L+ RRDGI+ER++A T+PP  CRNC
Sbjct: 1   MDLDDFRSILDTAGVDVWLFIDTAITVASLDCGDELRRRRDGIIERIFAATTPPLPCRNC 60

Query: 64  DTDRNHDGRSNGCEIKQ--------------------GSGEVKEASPSTPQFV------- 123
           D DRN   RSNG +IK+                    G G     SPSTPQ +       
Sbjct: 61  DGDRNL--RSNGYQIKKRLSPSPSPQRQHSHHQQRRGGRGAAAAVSPSTPQSLGDDDDNG 120

Query: 124 ---VVEGDDDGADPYAGLFDDEQKKVLEIKEQLEIPQQPEDALVELLQNLADMDITFQAL 183
                E D +  DPY GLFDDEQKK+LEIKEQLE P Q E++LVELLQNLADMDITFQAL
Sbjct: 121 HADAAEDDREDLDPYGGLFDDEQKKILEIKEQLEEPDQSEESLVELLQNLADMDITFQAL 180

Query: 184 KETDIGRHVNRLRKHPSNDVRRLVKHLVRKWKEIVDEWVRLNQPGEQTATLLADGDSPQQ 243
           KETDIGRHVNRLRKH SNDVRRLVK LVRKWKEIVDEWVRLNQPG  TA+L+ADGDSP  
Sbjct: 181 KETDIGRHVNRLRKHSSNDVRRLVKLLVRKWKEIVDEWVRLNQPG-GTASLMADGDSPPL 240

Query: 244 KAPQNGYHQVPDFAYSPNPHNGGSGSDRNNSEPEPKGKSVPRRDALPKPT--QQAPTSSL 303
           K  QNG+HQ+PDFAYSPNPHNG SGSDRN SE EPK K +PR++A PKPT     PT S+
Sbjct: 241 KTTQNGHHQIPDFAYSPNPHNGSSGSDRNTSEAEPKPKVIPRKEAPPKPTPPPSVPTPSI 300

Query: 304 TPQNRQKEQQKEANFDSQKLASARKRLQENYKEAENAKRQRTIQVMDIHEIP--KPKNAF 346
             QNRQ+E Q++ +FD+++LASARKRLQENYKEAENAK+QRTIQVMDIHE+P  KPKNAF
Sbjct: 301 AFQNRQRE-QRDRDFDAERLASARKRLQENYKEAENAKKQRTIQVMDIHELPKSKPKNAF 360

BLAST of Csa6G486670 vs. NCBI nr
Match: gi|1021581102|ref|XP_016179913.1| (PREDICTED: probable mediator of RNA polymerase II transcription subunit 26c [Arachis ipaensis])

HSP 1 Score: 480.7 bits (1236), Expect = 2.1e-132
Identity = 254/378 (67.20%), Postives = 288/378 (76.19%), Query Frame = 1

Query: 4   LDLDDFRTILDTAGVDVWTFIDTAMEVASLDYGNQLKNRRDGIVERLYALTSPPSRCRNC 63
           +DLDDFR+ILDTAGVDVW FIDTA+ VASLD G++L+ RRDGI+ER++A T+PP  CRNC
Sbjct: 1   MDLDDFRSILDTAGVDVWLFIDTAITVASLDCGDELRRRRDGIIERIFAATTPPLPCRNC 60

Query: 64  DTDRNHDGRSNGCEIKQ--------------------GSGEVKEASPSTPQFV------- 123
           D DRN   RSNG +IK+                    G G     SPSTPQ +       
Sbjct: 61  DGDRNL--RSNGHQIKKRLSPSPSPQRQHSHHQQRRGGRGAAAAVSPSTPQSLGDDDDNG 120

Query: 124 -----VVEGDDDGADPYAGLFDDEQKKVLEIKEQLEIPQQPEDALVELLQNLADMDITFQ 183
                  E D +  DPY GLFDDEQKK+LEIKEQLE P Q E++LVELLQNLADMDITFQ
Sbjct: 121 HADADAAEDDREDLDPYGGLFDDEQKKILEIKEQLEEPDQSEESLVELLQNLADMDITFQ 180

Query: 184 ALKETDIGRHVNRLRKHPSNDVRRLVKHLVRKWKEIVDEWVRLNQPGEQTATLLADGDSP 243
           ALKETDIGRHVNRLRKH SNDVRRLVK LVRKWKEIVDEWVRLNQPG   A+L+ADGDSP
Sbjct: 181 ALKETDIGRHVNRLRKHSSNDVRRLVKLLVRKWKEIVDEWVRLNQPG-GAASLMADGDSP 240

Query: 244 QQKAPQNGYHQVPDFAYSPNPHNGGSGSDRNNSEPEPKGKSVPRRDALPKPT--QQAPTS 303
             K  QNG+HQ+PDFAYSPNPHNG SGSDRN SE EPK K +PR++A PKPT     PT 
Sbjct: 241 PLKTTQNGHHQIPDFAYSPNPHNGSSGSDRNTSEAEPKPKVIPRKEAPPKPTPPPSVPTP 300

Query: 304 SLTPQNRQKEQQKEANFDSQKLASARKRLQENYKEAENAKRQRTIQVMDIHEIP--KPKN 346
           S+  QNRQ+E Q++ +FD+++LASARKRLQENYKEAENAK+QRTIQVMDIHE+P  KPKN
Sbjct: 301 SIAFQNRQRE-QRDRDFDAERLASARKRLQENYKEAENAKKQRTIQVMDIHELPKSKPKN 360

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MD26C_ARATH6.2e-10056.20Probable mediator of RNA polymerase II transcription subunit 26c OS=Arabidopsis ... [more]
MD26B_ARATH6.6e-1727.35Probable mediator of RNA polymerase II transcription subunit 26b OS=Arabidopsis ... [more]
MD26A_ARATH2.8e-1527.24Probable mediator of RNA polymerase II transcription subunit 26a OS=Arabidopsis ... [more]
TEAN2_HUMAN6.9e-0631.58Transcription elongation factor A N-terminal and central domain-containing prote... [more]
Match NameE-valueIdentityDescription
A0A0A0KHV9_CUCSA7.5e-201100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_6G486670 PE=4 SV=1[more]
A0A061DKK6_THECC2.6e-12970.35Transcription elongation factor (TFIIS) family protein, putative isoform 1 OS=Th... [more]
A0A0D2SWX2_GOSRA2.2e-12868.50Uncharacterized protein OS=Gossypium raimondii GN=B456_010G216800 PE=4 SV=1[more]
W9QGB8_9ROSA3.8e-12869.63Uncharacterized protein OS=Morus notabilis GN=L484_016512 PE=4 SV=1[more]
A0A061FYN3_THECC6.4e-12869.32Transcription elongation factor (TFIIS) family protein, putative isoform 1 OS=Th... [more]
Match NameE-valueIdentityDescription
AT5G09850.13.5e-10156.20 Transcription elongation factor (TFIIS) family protein[more]
AT5G05140.13.7e-1827.35 Transcription elongation factor (TFIIS) family protein[more]
AT3G10820.21.6e-1627.24 Transcription elongation factor (TFIIS) family protein[more]
Match NameE-valueIdentityDescription
gi|449448454|ref|XP_004141981.1|1.1e-200100.00PREDICTED: probable mediator of RNA polymerase II transcription subunit 26c [Cuc... [more]
gi|659080819|ref|XP_008440997.1|1.4e-19798.84PREDICTED: probable mediator of RNA polymerase II transcription subunit 26c [Cuc... [more]
gi|1009146734|ref|XP_015891030.1|4.9e-13772.33PREDICTED: probable mediator of RNA polymerase II transcription subunit 26c [Ziz... [more]
gi|1012257508|ref|XP_015945225.1|2.5e-13367.82PREDICTED: probable mediator of RNA polymerase II transcription subunit 26c [Ara... [more]
gi|1021581102|ref|XP_016179913.1|2.1e-13267.20PREDICTED: probable mediator of RNA polymerase II transcription subunit 26c [Ara... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR003617TFIIS/CRSP70_N_sub
IPR017923TFIIS_N
IPR003617TFIIS/CRSP70_N_sub
IPR017923TFIIS_N
IPR003617TFIIS/CRSP70_N_sub
IPR017923TFIIS_N
Vocabulary: Biological Process
TermDefinition
GO:0006351transcription, DNA-templated
GO:0006351transcription, DNA-templated
GO:0006351transcription, DNA-templated
Vocabulary: Cellular Component
TermDefinition
GO:0005634nucleus
GO:0005634nucleus
GO:0005634nucleus
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0003677DNA binding
GO:0003677DNA binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006448 regulation of translational elongation
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0006414 translational elongation
biological_process GO:0008150 biological_process
cellular_component GO:0005634 nucleus
cellular_component GO:0005840 ribosome
cellular_component GO:0005829 cytosol
molecular_function GO:0003677 DNA binding
molecular_function GO:0003746 translation elongation factor activity
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU090965cucumber EST collection version 3.0transcribed_cluster
CU093681cucumber EST collection version 3.0transcribed_cluster
CU094826cucumber EST collection version 3.0transcribed_cluster
CU129666cucumber EST collection version 3.0transcribed_cluster
CU172145cucumber EST collection version 3.0transcribed_cluster
CU177450cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa6G486670.3Csa6G486670.3mRNA
Csa6G486670.2Csa6G486670.2mRNA
Csa6G486670.1Csa6G486670.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU094826CU094826transcribed_cluster
CU090965CU090965transcribed_cluster
CU129666CU129666transcribed_cluster
CU093681CU093681transcribed_cluster
CU177450CU177450transcribed_cluster
CU172145CU172145transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003617Transcription elongation factor, TFIIS/CRSP70, N-terminal, sub-typeSMARTSM00509TFS2_5coord: 119..190
score: 3.3
IPR017923Transcription factor IIS, N-terminalGENE3DG3DSA:1.20.930.10coord: 116..189
score: 6.7
IPR017923Transcription factor IIS, N-terminalPFAMPF08711Med26coord: 138..188
score: 3.3
IPR017923Transcription factor IIS, N-terminalPROFILEPS51319TFIIS_Ncoord: 114..191
score: 25
IPR017923Transcription factor IIS, N-terminalunknownSSF47676Conserved domain common to transcription factors TFIIS, elongin A, CRSP70coord: 117..198
score: 1.14
NoneNo IPR availablePANTHERPTHR15141TRANSCRIPTION ELONGATION FACTOR B POLYPEPTIDE 3coord: 1..270
score: 4.9
NoneNo IPR availablePANTHERPTHR15141:SF14TRANSCRIPTION ELONGATION FACTOR B POLYPEPTIDE 3coord: 1..270
score: 4.9

The following gene(s) are paralogous to this gene:

None