CSPI04G04000 (gene) Wild cucumber (PI 183967)

NameCSPI04G04000
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionDNA-directed RNA polymerase III subunit RPC4
LocationChr4 : 2535425 .. 2540389 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGTCTTCGTTGGTATCTCATCATTTCTTCATTCCGCCTGGTCTAGGGTTCGATTCTTTCCCTTTTCTTCTTTTTCACCATTTTTGTCCCGTTTCCCTTCACCATCATCTTCAACCTCGAGCTCATGAAGAATTCCGATCCATCTCCTCCACGCAGAAAGGTAGTAATCAAAATAACTTTTCAAATCACGCCTAAATAATATCCTAAACTTCTTCTTTAACTCGATTTAGATCTAAACCCTTTTCCTTCTGCAGGTCAAATTCGCTCCAAAATCCTCACAGCGCAAGAGGCCACCACCACCACCGGTTCAGAAAACGTACTCCTCTGAATCTCATCTTCTTCCACATTTTGAATTTGTAAAATCCTAATTTGTTTGGGGAATTGCTGCAGTGAAGATGAAGATGGTGAGGGTTATGTTGCTCAAACTCGCTACCTATTACGCCGTGCCAATGTAAGCTTCTTCTTCTTCGTCTTCTTGTTATAATGGTTTTTCGAATTCTTTTGTGTGTTTTGGTTTTGATGTTTAGATGACTTTTGGTTTTCTGATCAGTTAGAATTTGGATGATTTGAAAAGATTTTCTTCTTCTACTTCTACTTCTGCTTTCCCGTTTGGTTCCCGAGGAAATTGTTTTTATGGAAATGAATCGTTTTCCGGACAACGTTCTTTCTTCTTTATTTCTCTTCCTGGAGAAGCGCTTGGGGTTTCTCTTTTTCTTTTCGTCGATAAAAATTAAATTAAAGGTTAAAATATTTGGTTAGTCTTAGCCTAAAAAAAAAGTGAACTTTATTATTTTAGTAAAAAATCTTCCAATTTTAAAAAGTCTTAAAAGTACACTTAAATTTTTAAAATGAGGACAAAAATACAATGCAAATTTATTTTTTGAACTTTACAAAATTTCACAACTAGAAAAAACATTAGAAATTCCCTTATTTCATTAATATCGTCTTACTTCTCTTTTCTTCTTCATTTTCGTTAATTATTTGACACTTGTTTAGCTCAAATAAATAAATAAAATCGCTAAGATATATAAACTATAGAAGATAAATAAAAGGGCTAATGATTTGAAGGCTTAAATATTATAACTAAATTATATGTTAGTTGGTGTGTATGTGTTAGTTAATATTCAAATACTTTTTTATTTGAATATTGATCGTATGTTAATAAAATATTAAATTAGTAAAAATTGTCATATATAACTTGCCAACTTTTATTTATAGACCATTTAAAATAATTTACAAACCTCAAATAACAAAACTTGGAGATCAACTAATCTCAAACCTTAATAATTAAAAAAACTTTTAATGAACTTGGAAAATGTTAGTAGTACTATCCTCCCCCCTAATTTGTCCGGTCACCAAATATATATATTAATTTATTGTTTATTTTCTTTATGAACTTCACATATTTTGTTGCAATCATTAAAAGGAGCTAATATAGAAAATTTCTTGTTATGACCAAACAGTAGTTATTTTTTGAAAAATAAATTCTTATTTTTAGAAAACAATTTCACTAATGTTAGTTGCTTAAAAAAATTATTAATATTAGTTTTGCATGTAACTATATAATTTAGTTTTTTTTTTTTTTGTTTACGGTTCAATCTGCAATAATTAAAAGATGATTATTTTTTGGTTATTTATGTAACTTACATTTCTCAACTCGTTATTCGTGTTAGTTTTTCTATAATAATTACCAACTCAACTCTGACTTTTATCACCAACCAAACGTTAAATCTAGATATTCAATCTTTTGTTAAACTCACTATGTTTGATTATTTTTTGAAATGTTTGTGCTTTATTTTAGCTATTTGGTTTTCTAGATGAGTTTTTTTGTCCCTTTTTTTAGAATAAGATGAGTTTTTTGGCCTGCTATTGCTATATTTGAGCATTTTCTTTACGCTATTATGGATTATTTCTTTCTTGAATAGTAATTTTATCAATTACCTTTTGTGTTTAGGCCTCAAAGTTAAAACTATGGACTTATTAAGTTCAAAGCAAAATTTCATATCTTCTTAAACAATATTACAATAGTACACATCACAAATAGAACAATTTTCAACAATATCTTCATACGAAATAATTTTGTTTCATTTTTAGTGTTGTAGATTTATGTTCCTTGATCAAATACGATTTAATTCACCTTTAAAATATGTTAACATGTTTGAAAACATAGCCTATGGGTGGAACTATATCCACCGATGATGGCTAGGTAATGAAACAATTGTTTGATTTTTCACTTTCTTTACTCATTTGAAAGAGCATCGAGCATCCATTAGTGTTAGACAGTACAGACTACCTGTTAAGAGTTAAAACAGCAAAGGGGCAACAATTGACTCCTTAACCTTTGGCCTTAATATGTAACTGTAAGAACTGATTTGGTTTTCTTTCCACTTTGCAGGAGAATCTCGGAAAACGAGCGAACAAGGTCGAAAAAAAATGTATATGAAGTATTACTTTAAAATTTTGAATTAATATCAATAGCACTGTATTGGTTTTTTAATATAATTTGGACGGATTATCCATGTAAACGGGTTTGTTTTCATTCACAAGAGAGTGTGGGCTGCTACAATGTGGCTAATAATTTTGTAGAGTTGAATGTATGAAGGAAAAAATTGCAAAAGCACCTTGAACATTCGTATCTAATAATTATTGGGTGCATTCAGATTTTTAAATCTACCTCCAACGACTAAATAATTTGTCATCACGAGAAAAGAAAATCCCAAGTTCTTAGCAACGGAAGATGAAGTTTCCATTGTTGCCTTAACATGATCTCTATTTTCATTTGCATATCTGCAGCTTCAGTGCAAGTTGCATTCGGTCCTGGCGCTGAATCAACGTCATCTTCTATTAGAACATATGGAGTTCCCAAAGTAGAAAACGGTAGCAGAAAAAATGACATTGAGCCTGAAGTTGATGAAGATGAGGAGTTTGTCTTGCCAGTGGCTAGGGATGTTAACGAAGATGGAAAATATTTTGATAAAAAAACCAAAGATGGAATTACTGAATCCTCTAGTTCGGCTATGGAAACCAAGACCAAGAGGGATTACAAAGAACCTTGGGTAATCTCTTTTTCTCTCTCACTTTTGACACACACATAAAGATTAACTTCCTGATCTCATGATACTTTTTCAGGATTACCAAAATTCTTACTATCCTACTACACTCCCTTTACGGATGCCTTACTCTGGAGACCCTGGTATGTCTTTATTCTCTCCCAAAGAGGAATAGCAGAAGTGTCACATTGGCCACCTTTAGCTTTTGGTTTGTGAAAACTGATAAGTGATTTATGATATAAAATGATCTTAATGGCATCTTAGACATGTTTGTGAGTGATTTTGAAAGAATTAAAATCACTTTTGTCATGTTCAAAGTTGCTATGAAACAATGTCTTTATTCATATGTTTGATTTCACACGTTGAACTGTGAATATCTATACAATCAAAATTGGTTTTGAAGCATTAGAACTATTTTATGGAGCTAATACTTAATCACGCAAAAGCTTATAATCAAATCTTGAAAACAAAAGCAGACTTTTAAGAGTCATTAGATAAGGGATTGCTCATCTATGATGAAATGAACTTTTGACCTTAGTGGAGGATCGTTCTGACTTTGTTCTGTAGTTCAGCTAACTCTCCTGCCTCTTCTTGGATTTAGAACGACTTGATGAAGCCGAGTTTGGGCAGGATGTGATGAATAGAGAGTATGATGAGAACTCTGTAATACCTGCCTTGGATCTCGGGTTGCTGGTGGGTGAATAATTGGGAACTGTGGCTTATTTCATATATACATTGGTATTTCTTTATGCAGAATCTAGTTATAACTTATAATATGGTGGTGTTTTTTGATGCAGGATGAAAACACAGAGAGTACTAAGTACTTCTTTCAGCTTCCTGCACGTCTTCCTTTACCCAAACAATCGTCTACTGCAACGGGAAAGGAGAAAGTAGGAAATTCGAGATCTTCAAACAGCACGAGCTCGTCCGATCTCGACGATTTGAAGAAACTGTCTGCTGGATGTATGGGGAAACTACTGATATACAAGAGTGGAGCTATTAAGTTGAGGCTGGGAGACATTCTTTACGATGTGAGTAAAGTCGTTAACTTTCTACAACCAATATCAAGCATTGTTGAGTTCTTTTTTTTGCTGCCCGCCTCCAGTTTAGATGTCATATATTGTGATATTATTTGTGTTTGGAATGCATTTAATTTGTTGTGTGCCACCCCAAATGTGACCTATCGCAATCAGTGAACCGAGGTCTAAAGATGACTCCAGCCCTCAAAATCGAAATTGTATTCATATTTATGTTATTTACAATTTAGACCGTAGGGTTTAGAGGAGTGCACTATATAAACTCAAACTCAAAAGGCACTTGAATCAAGTTAGCAGATATACTGTTAAAAATCAAGTGAATGTCAATTTTTTATTGTTTACGTTTTCCGTTTATTTTTTAAGTAGATTTCTTAACAATTTGGTCCTGTTTTGCGGCTGTTGTAGGTTTCTTCTGGATCAAACTGCAGTTTTCTTCAACATGTTGTGGCAATCAACAGTGAAGAGGGACAGTGCTGTGATCTCGGAGACATTGGCAACCGAGTTGTTGTAACACCGGATATTAGTTCCCTTTTAAATTCAGTGACTAATTTGAGATGATTGTGAAGAAAAAGCCTGTGGATAGAACAACCAACAAAAGAACACAACGAAGAAGGGCAAATCCTTACAATGTCAAAATGGGACTTTTACGTCTGGCTGTGTAGTGAAGGTGCTCATTTAGTAAGGAAGTTGGAGGGATGTCAAGTTCTTGTGTAATCTTTCTCCTTTGTTATTTCTATGGAAACAACAACGTATTCAGTATCTTTGATTTCTAGCTGTGGAATAGGTTTTTACCCGATAATGTTGATAGTTGTTGTAAAGGACTTTTTGTATATAGTCTTTCTTGATGGTGGTAATATTAGGATCCACACTTATCATGCA

mRNA sequence

ATGAAGAATTCCGATCCATCTCCTCCACGCAGAAAGGTCAAATTCGCTCCAAAATCCTCACAGCGCAAGAGGCCACCACCACCACCGGTTCAGAAAACTGAAGATGAAGATGGTGAGGGTTATGTTGCTCAAACTCGCTACCTATTACGCCGTGCCAATGAGAATCTCGGAAAACGAGCGAACAAGGTCGAAAAAAAATCTTCAGTGCAAGTTGCATTCGGTCCTGGCGCTGAATCAACGTCATCTTCTATTAGAACATATGGAGTTCCCAAAGTAGAAAACGGTAGCAGAAAAAATGACATTGAGCCTGAAGTTGATGAAGATGAGGAGTTTGTCTTGCCAGTGGCTAGGGATGTTAACGAAGATGGAAAATATTTTGATAAAAAAACCAAAGATGGAATTACTGAATCCTCTAGTTCGGCTATGGAAACCAAGACCAAGAGGGATTACAAAGAACCTTGGGATTACCAAAATTCTTACTATCCTACTACACTCCCTTTACGGATGCCTTACTCTGGAGACCCTGAACGACTTGATGAAGCCGAGTTTGGGCAGGATGTGATGAATAGAGAGTATGATGAGAACTCTGTAATACCTGCCTTGGATCTCGGGTTGCTGGATGAAAACACAGAGAGTACTAAGTACTTCTTTCAGCTTCCTGCACGTCTTCCTTTACCCAAACAATCGTCTACTGCAACGGGAAAGGAGAAAGTAGGAAATTCGAGATCTTCAAACAGCACGAGCTCGTCCGATCTCGACGATTTGAAGAAACTGTCTGCTGGATGTATGGGGAAACTACTGATATACAAGAGTGGAGCTATTAAGTTGAGGCTGGGAGACATTCTTTACGATGTTTCTTCTGGATCAAACTGCAGTTTTCTTCAACATGTTGTGGCAATCAACAGTGAAGAGGGACAGTGCTGTGATCTCGGAGACATTGGCAACCGAGTTGTTGTAACACCGGATATTAGTTCCCTTTTAAATTCAGTGACTAATTTGAGATGA

Coding sequence (CDS)

ATGAAGAATTCCGATCCATCTCCTCCACGCAGAAAGGTCAAATTCGCTCCAAAATCCTCACAGCGCAAGAGGCCACCACCACCACCGGTTCAGAAAACTGAAGATGAAGATGGTGAGGGTTATGTTGCTCAAACTCGCTACCTATTACGCCGTGCCAATGAGAATCTCGGAAAACGAGCGAACAAGGTCGAAAAAAAATCTTCAGTGCAAGTTGCATTCGGTCCTGGCGCTGAATCAACGTCATCTTCTATTAGAACATATGGAGTTCCCAAAGTAGAAAACGGTAGCAGAAAAAATGACATTGAGCCTGAAGTTGATGAAGATGAGGAGTTTGTCTTGCCAGTGGCTAGGGATGTTAACGAAGATGGAAAATATTTTGATAAAAAAACCAAAGATGGAATTACTGAATCCTCTAGTTCGGCTATGGAAACCAAGACCAAGAGGGATTACAAAGAACCTTGGGATTACCAAAATTCTTACTATCCTACTACACTCCCTTTACGGATGCCTTACTCTGGAGACCCTGAACGACTTGATGAAGCCGAGTTTGGGCAGGATGTGATGAATAGAGAGTATGATGAGAACTCTGTAATACCTGCCTTGGATCTCGGGTTGCTGGATGAAAACACAGAGAGTACTAAGTACTTCTTTCAGCTTCCTGCACGTCTTCCTTTACCCAAACAATCGTCTACTGCAACGGGAAAGGAGAAAGTAGGAAATTCGAGATCTTCAAACAGCACGAGCTCGTCCGATCTCGACGATTTGAAGAAACTGTCTGCTGGATGTATGGGGAAACTACTGATATACAAGAGTGGAGCTATTAAGTTGAGGCTGGGAGACATTCTTTACGATGTTTCTTCTGGATCAAACTGCAGTTTTCTTCAACATGTTGTGGCAATCAACAGTGAAGAGGGACAGTGCTGTGATCTCGGAGACATTGGCAACCGAGTTGTTGTAACACCGGATATTAGTTCCCTTTTAAATTCAGTGACTAATTTGAGATGA
BLAST of CSPI04G04000 vs. Swiss-Prot
Match: RPC4_MOUSE (DNA-directed RNA polymerase III subunit RPC4 OS=Mus musculus GN=Polr3d PE=2 SV=2)

HSP 1 Score: 59.3 bits (142), Expect = 9.3e-08
Identity = 43/127 (33.86%), Postives = 65/127 (51.18%), Query Frame = 1

Query: 216 FFQLPARLP--LPKQSSTATGKEKVGNS---------RSSNSTSSSDLDDLKKLSAGCMG 275
           F QLP  LP   P Q       E  G           +   +  + +   L  L+ G +G
Sbjct: 268 FLQLPDTLPGQPPTQDIKPVKTEVQGEDGQMVVIKQEKDREARLAENACTLADLTEGQVG 327

Query: 276 KLLIYKSGAIKLRLGDILYDVSSGSNCSFLQHVVAI---NSEEGQCCDLGDIGNRVVVTP 329
           KLLI KSG ++L LG +  DV+ G+ CSFLQ +V++   +S  G+   LG + +++V +P
Sbjct: 328 KLLIRKSGKVQLLLGKVTLDVTMGTTCSFLQELVSVGLGDSRTGEMTVLGHVKHKLVCSP 387

BLAST of CSPI04G04000 vs. Swiss-Prot
Match: RPC4_HUMAN (DNA-directed RNA polymerase III subunit RPC4 OS=Homo sapiens GN=POLR3D PE=1 SV=2)

HSP 1 Score: 58.5 bits (140), Expect = 1.6e-07
Identity = 44/132 (33.33%), Postives = 66/132 (50.00%), Query Frame = 1

Query: 211 ESTKYFFQLPARLP--LPKQSSTATGKEKVGNS---------RSSNSTSSSDLDDLKKLS 270
           E    F QLP  LP   P Q       E  G           +   +  + +   L  L+
Sbjct: 263 EEELLFLQLPDTLPGQPPTQDIKPIKTEVQGEDGQVVLIKQEKDREAKLAENACTLADLT 322

Query: 271 AGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQHVVAI---NSEEGQCCDLGDIGNR 329
            G +GKLLI KSG ++L LG +  DV+ G+ CSFLQ +V++   +S  G+   LG + ++
Sbjct: 323 EGQVGKLLIRKSGRVQLLLGKVTLDVTMGTACSFLQELVSVGLGDSRTGEMTVLGHVKHK 382

BLAST of CSPI04G04000 vs. Swiss-Prot
Match: RPC4_BOVIN (DNA-directed RNA polymerase III subunit RPC4 OS=Bos taurus GN=POLR3D PE=2 SV=1)

HSP 1 Score: 58.2 bits (139), Expect = 2.1e-07
Identity = 37/97 (38.14%), Postives = 59/97 (60.82%), Query Frame = 1

Query: 235 KEKVGNSRSSNSTSSSDLDDLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFL 294
           +EK   +R + +T +     L  L+ G +GKLLI KSG ++L LG +  DV+ G+ CSFL
Sbjct: 303 QEKDREARLAENTCT-----LADLTEGQVGKLLIRKSGKVQLLLGKVTLDVTMGTTCSFL 362

Query: 295 QHVVAI---NSEEGQCCDLGDIGNRVVVTPDISSLLN 329
           Q +V++   +S  G    LG I +++V +P+  SLL+
Sbjct: 363 QELVSVGLGDSRTGDMTVLGHIKHKLVCSPNFESLLD 394

BLAST of CSPI04G04000 vs. Swiss-Prot
Match: RPC4_YEAST (DNA-directed RNA polymerase III subunit RPC4 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=RPC53 PE=1 SV=2)

HSP 1 Score: 54.3 bits (129), Expect = 3.0e-06
Identity = 35/116 (30.17%), Postives = 65/116 (56.03%), Query Frame = 1

Query: 217 FQLPARLPLPKQSSTATGKEKVGNSRSSNSTSSSDL---DDLKKLS----AGCMGKLLIY 276
           FQLP RLP  ++ +    KE +    S  S    ++   D    LS    AG +G + ++
Sbjct: 307 FQLPTRLPAFERPAVKEEKEDMETQASDPSKKKKNIKKKDTKDALSTRELAGKVGSIRVH 366

Query: 277 KSGAIKLRLGDILYDVSSGSNCSFLQHVVAIN-SEEGQCCD-LGDIGNRVVVTPDI 324
           KSG + +++G+++ D+  G+  +FLQ V+A++ +++    + LG +  ++VVTP I
Sbjct: 367 KSGKLSVKIGNVVMDIGKGAETTFLQDVIALSIADDASSAELLGRVDGKIVVTPQI 422

BLAST of CSPI04G04000 vs. TrEMBL
Match: A0A0A0KZ61_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G022870 PE=4 SV=1)

HSP 1 Score: 648.3 bits (1671), Expect = 5.2e-183
Identity = 333/334 (99.70%), Postives = 334/334 (100.00%), Query Frame = 1

Query: 1   MKNSDPSPPRRKVKFAPKSSQRKRPPPPPVQKTEDEDGEGYVAQTRYLLRRANENLGKRA 60
           MKNSDPSPPRRKVKFAPKSSQRKRPPPPPVQKTEDEDGEGYVAQTRYLLRRANENLGKRA
Sbjct: 1   MKNSDPSPPRRKVKFAPKSSQRKRPPPPPVQKTEDEDGEGYVAQTRYLLRRANENLGKRA 60

Query: 61  NKVEKKSSVQVAFGPGAESTSSSIRTYGVPKVENGSRKNDIEPEVDEDEEFVLPVARDVN 120
           NKVEKKSSVQVAFGPGAESTSSSIRTYGVPKVENGSRKNDIEPEVDEDEEFVLPVARDVN
Sbjct: 61  NKVEKKSSVQVAFGPGAESTSSSIRTYGVPKVENGSRKNDIEPEVDEDEEFVLPVARDVN 120

Query: 121 EDGKYFDKKTKDGITESSSSAMETKTKRDYKEPWDYQNSYYPTTLPLRMPYSGDPERLDE 180
           EDGKYFDKKTKDGITESSSSAMETKTKRDYKEPWDYQNSYYPTTLPLRMPYSGDPERLDE
Sbjct: 121 EDGKYFDKKTKDGITESSSSAMETKTKRDYKEPWDYQNSYYPTTLPLRMPYSGDPERLDE 180

Query: 181 AEFGQDVMNREYDENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTATGKEKVGN 240
           AEFGQDVMNREYDENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTATGKEKVGN
Sbjct: 181 AEFGQDVMNREYDENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTATGKEKVGN 240

Query: 241 SRSSNSTSSSDLDDLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQHVVAI 300
           SRSSNSTSSSDLDDLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQHVVAI
Sbjct: 241 SRSSNSTSSSDLDDLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQHVVAI 300

Query: 301 NSEEGQCCDLGDIGNRVVVTPDISSLLNSVTNLR 335
           N+EEGQCCDLGDIGNRVVVTPDISSLLNSVTNLR
Sbjct: 301 NTEEGQCCDLGDIGNRVVVTPDISSLLNSVTNLR 334

BLAST of CSPI04G04000 vs. TrEMBL
Match: A0A067JSB7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26275 PE=4 SV=1)

HSP 1 Score: 283.5 bits (724), Expect = 3.4e-73
Identity = 167/341 (48.97%), Postives = 221/341 (64.81%), Query Frame = 1

Query: 1   MKNSDPSPPRRKVKFAPKSSQRKRPPPPPVQKTEDEDG---EGYVAQTRYLLRRANENLG 60
           M    PSP  RK KF PK+  +++P P  V K+E  D    E   AQ + L+R+ NENL 
Sbjct: 1   MDQEQPSPTPRKAKFTPKAPPQRKPIPT-VTKSEVNDSNNDEDEAAQAQKLMRKFNENLR 60

Query: 61  KRANKVEKKSSVQVAFGPGAESTSSSIRTYGVPKVENG--SRKNDIEPEVDEDEEFVLPV 120
           ++  KVEKKSSVQVAFGPGA + S+SIR YGVP  EN   S + DI+   ++D   V+  
Sbjct: 61  RKVPKVEKKSSVQVAFGPGA-APSTSIRKYGVPGCENAGSSSRLDIKDSDNDDRRIVVSS 120

Query: 121 ARDVNEDG--KYFDKKTKDGITESSSSAMETKTKRDYKEPWDYQNSYYPTTLPLRMPYSG 180
                EDG  KY  +         +   +  K K+DY+EPWDY ++YYPTTLPLR PYSG
Sbjct: 121 LSTAEEDGASKYHSE---------AIGVLPLKIKKDYREPWDYNHTYYPTTLPLRRPYSG 180

Query: 181 DPERLDEAEFGQDVMNREYDENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTAT 240
           DPE L+E EFG+     EY+EN++ PA DLGLL+E+ +   +FFQLP++LPL K+S+   
Sbjct: 181 DPELLNEEEFGEAARKLEYNENTIKPASDLGLLEESDKERLFFFQLPSKLPLVKRSAITK 240

Query: 241 GKEKVGNSRSSNSTSSSDLD-DLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCS 300
           GKEKV  S  S  TS+   +   + LS G MGK+L+Y+SGA+K++LGD LYDVS GS+C+
Sbjct: 241 GKEKVEGSTPSQGTSALKKESSFQGLSEGYMGKMLVYRSGAVKIKLGDTLYDVSPGSDCT 300

Query: 301 FLQHVVAINSEEGQCCDLGDIGNRVVVTPDISSLLNSVTNL 334
           F Q ++AI+    QCC +G +  R VVTPD+ SLLNSV NL
Sbjct: 301 FAQDLMAIDIASKQCCTIGKLRKRAVVTPDVDSLLNSVINL 330

BLAST of CSPI04G04000 vs. TrEMBL
Match: B9R9V4_RICCO (DNA binding protein, putative OS=Ricinus communis GN=RCOM_1500880 PE=4 SV=1)

HSP 1 Score: 272.7 bits (696), Expect = 5.9e-70
Identity = 162/336 (48.21%), Postives = 217/336 (64.58%), Query Frame = 1

Query: 1   MKNSDPSPPRRKVKFAPKSSQRKRP--PPPPVQKTEDEDGEGYVAQTRYLLRRANENLGK 60
           M +  PSP +RKVKF PK+  ++RP    P  +    ++ E    Q + L+R+ NEN  +
Sbjct: 1   MDDEQPSPSQRKVKFTPKAPSQRRPRRTVPKTEVNGVDNNEDEAVQAQKLMRKFNENFRR 60

Query: 61  RANKVEKKSSVQVAFGPGAESTSSSIRTYGVPKVENGSRKNDIEPEVDEDEEFVLPVARD 120
           +  +VEKKS+VQVAFGPGA S S+SIRT+GV K EN    + I+   D+D + V+     
Sbjct: 61  QGPRVEKKSTVQVAFGPGATS-STSIRTFGVSKGENPV-SSGIKDSTDDDGKIVISSLST 120

Query: 121 VNEDGKYFDKKTKDGITESSSSAMETKTKRDYKEPWDYQNSYYPTTLPLRMPYSGDPERL 180
             ED +  +  ++D        A+  K K+DY+EPWDY  +YYPTTLPLR PYSGDP  L
Sbjct: 121 DKED-EIINCASED------IDALPLKIKKDYREPWDYDRTYYPTTLPLRRPYSGDPVLL 180

Query: 181 DEAEFGQDVMNREYDENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTATGKEKV 240
           DEAEFG+     EYDE+++ PA DL LL+E       FFQLPA+LPL K+S++A GKEK 
Sbjct: 181 DEAEFGEAARKLEYDESTMNPASDLELLEECDTEKMIFFQLPAKLPLVKRSASAKGKEKA 240

Query: 241 -GNSRSSNSTSSSDLDDLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQHV 300
            G+  S    ++     L  LSAG MGK+L+Y+SGA+KL+LGD LYDVS GS+C F Q V
Sbjct: 241 EGSIPSQGKNAAKKESSLDGLSAGYMGKMLVYRSGAVKLKLGDTLYDVSQGSDCMFAQDV 300

Query: 301 VAINSEEGQCCDLGDIGNRVVVTPDISSLLNSVTNL 334
           +AIN+    CC +G++  R VVTPD+ SLL+SV NL
Sbjct: 301 MAINTAAKHCCTIGELEKRAVVTPDVDSLLDSVVNL 327

BLAST of CSPI04G04000 vs. TrEMBL
Match: A0A061FZZ6_THECC (DNA binding protein, putative isoform 2 OS=Theobroma cacao GN=TCM_014896 PE=4 SV=1)

HSP 1 Score: 265.4 bits (677), Expect = 9.5e-68
Identity = 160/337 (47.48%), Postives = 209/337 (62.02%), Query Frame = 1

Query: 1   MKNSDPSPPRRKVKFAPKSSQRKRPPPPPVQKTEDEDGEGYVAQTRYLLRRANENLGKRA 60
           M    PS  RRKV+FAPK+ Q  R     V K+E  D +G  AQ +YLL R NEN  ++ 
Sbjct: 1   MDQDGPSSGRRKVRFAPKAPQSSRRLKTTVSKSEVNDEDGEAAQAQYLLGRFNENQTRQR 60

Query: 61  NKVEKKSSVQVAFGPGAESTSSSIRTYGVPKVENGSRKNDIEPEVDEDEEFVLPVARDVN 120
            KVEKKSS Q++FGPGA S S+ +R YG  +     +  D      +D         D  
Sbjct: 61  PKVEKKSSAQISFGPGAPS-SNLLRAYGSQRGGTSGKSTDSRQRSPDD--------NDGQ 120

Query: 121 EDGKYFDKKTKDGITESSSSAMET---KTKRDYKEPWDYQNSYYPTTLPLRMPYSGDPER 180
             G +     +D     SS A+E    K KR+Y+EPWDY ++YYP TLPLR PYSGDPE 
Sbjct: 121 IIGSFPSASKEDRTDICSSDAIEASAPKIKREYREPWDYHHTYYPITLPLRRPYSGDPEL 180

Query: 181 LDEAEFGQDVMNREYDENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTATGKEK 240
           LD+AEF  +   +EYDE ++ PA DLGLL+E  +   +FFQLPA LP+ K+ ++  GKEK
Sbjct: 181 LDQAEF-VEAARKEYDEKTINPASDLGLLEEGEKGKMFFFQLPANLPVIKRLASTKGKEK 240

Query: 241 VGNSRSSNSTSSSDLD-DLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQH 300
             N  SS    +      L++L  G MGK+L+YKSGA+KL+LG+ LYDVS GS+C F Q 
Sbjct: 241 AENLGSSERFGALKKGCQLEELPGGFMGKMLVYKSGAVKLKLGETLYDVSPGSDCIFAQD 300

Query: 301 VVAINSEEGQCCDLGDIGNRVVVTPDISSLLNSVTNL 334
           V A+N+ E  CC +G++G RVVVTPDISS+LNSV +L
Sbjct: 301 VAAVNTTEKHCCVIGELGKRVVVTPDISSVLNSVIDL 327

BLAST of CSPI04G04000 vs. TrEMBL
Match: A0A058ZWX9_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_K00171 PE=4 SV=1)

HSP 1 Score: 261.9 bits (668), Expect = 1.0e-66
Identity = 149/330 (45.15%), Postives = 205/330 (62.12%), Query Frame = 1

Query: 11  RKVKFAPKSSQRKRPPPPPVQKTEDEDGEG-----YVAQTRYLLRRANENLGKRANKVEK 70
           RKVKF PK+   ++ PP    KTE   G+G       A+ + LLRR NEN G++  +V  
Sbjct: 7   RKVKFTPKAPAPRKKPPTVSPKTEANGGDGGEEAAVEAEAQMLLRRFNENFGQQGARVGN 66

Query: 71  KSSVQVAFGPGAESTSSSIRTYGVPKVEN---GSRKNDIEPEVDEDEEFVLPVARDVNED 130
           KSSVQVAFGPGA S++S+IRT+GVPK E    G           + +    P+  + +  
Sbjct: 67  KSSVQVAFGPGAPSSASTIRTFGVPKEERSDQGCGPRLRSYAAAKGQTISPPIVAETD-- 126

Query: 131 GKYFDKKTKDGITESSSSAMETKTKRDYKEPWDYQNSYYPTTLPLRMPYSGDPERLDEAE 190
                    D   E +++ +  K+K++YKEPWD+ +SYYPTTLPLRMPYSGDPE L+E E
Sbjct: 127 -------ATDATMEDATAEVPRKSKKEYKEPWDFHHSYYPTTLPLRMPYSGDPEILNEKE 186

Query: 191 FGQDVMNREYDENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTATGKEKVGNSR 250
           FG+   N EYDE  +  A DLGLL+++      FFQLP  LP  ++S++  GKEK  +S 
Sbjct: 187 FGEATRNFEYDEGIINAASDLGLLEDSGRGKMLFFQLPPNLPSVRRSTSVKGKEKAESSA 246

Query: 251 SSNSTSSSDLDDLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQHVVAINS 310
           SS + +S     L+ LS G MGKL++YKSGA+KL+LG+ LYDVS GS+C F Q V  +N+
Sbjct: 247 SSAADASERDCKLEDLSGGYMGKLVVYKSGAVKLKLGETLYDVSPGSDCIFAQDVAVVNT 306

Query: 311 EEGQCCDLGDIGNRVVVTPDISSLLNSVTN 333
            +   C LG++G R ++TPD  SLLN + +
Sbjct: 307 ADKHFCSLGEVGKRGIITPDFDSLLNYIAD 327

BLAST of CSPI04G04000 vs. TAIR10
Match: AT4G25180.1 (AT4G25180.1 RNA polymerase III RPC4)

HSP 1 Score: 173.3 bits (438), Expect = 2.5e-43
Identity = 125/334 (37.43%), Postives = 176/334 (52.69%), Query Frame = 1

Query: 5   DPSPPRRKVKFAPKSSQ-RKRPPPPPVQKTEDEDGEGYVAQTRYLLRRANENLGKRANKV 64
           D    + K +F P   +  +R P  P   TE E+ E  +  +R   RR    +G+R    
Sbjct: 2   DSGEQKSKRRFQPNPPRPSRRLPIAPTSNTEAEEDEENIKASRQFDRRI---VGRRPKTE 61

Query: 65  EKKSSVQVAFGPGAESTSSSIRTYGVPKVENGSRKNDIEPEVDEDEEFVLPVARDVN--- 124
            K SS +VAF P     +  IR++GVPK E+    +D+ P        +LP    V    
Sbjct: 62  TKASSPEVAFQPSLSPLA--IRSFGVPK-EDDKPNSDVNPSSPAS---ILPAVSSVTAAQ 121

Query: 125 EDGKYFDKKTKDGITESSSSAMETKTKRDYKEPWDYQNSYYPTTLPLRMPYSGDPERLDE 184
           EDG+                   T+T  DY EPWDY+NSYYPT LPLR P SGD E LD+
Sbjct: 122 EDGEEVHN-------------FVTRTGDDYVEPWDYRNSYYPTVLPLRKPNSGDIELLDQ 181

Query: 185 AEFGQDVMNREYDENSVIPALDLGLLD-ENTESTKYFFQLPARLPLPKQSSTATGKEKVG 244
            EFG+   NR+YDEN++  A +LGL   ++++   + F++P  LP+ KQ++ AT K    
Sbjct: 182 EEFGEVAKNRDYDENTINSAEELGLTSVQHSKKQMFIFKIPDCLPVVKQTTGATTK---- 241

Query: 245 NSRSSNSTSSSDLDDLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQHVVA 304
             RS    SS   +  + L  G MGK+L+YKSGA+KL++GD L+DVS G        VVA
Sbjct: 242 --RSVREYSSGISNPFEGLPEGFMGKMLVYKSGAVKLKVGDALFDVSPGPGTKIPNDVVA 301

Query: 305 INSEEGQCCDLGDIGNRVVVTPDISSLLNSVTNL 334
           I+ +   C  +G     V VTPD+ SLLN  +++
Sbjct: 302 IDIKGRNCSRIGSSAKFVTVTPDVESLLNPASDM 307

BLAST of CSPI04G04000 vs. TAIR10
Match: AT5G09380.1 (AT5G09380.1 RNA polymerase III RPC4)

HSP 1 Score: 147.9 bits (372), Expect = 1.1e-35
Identity = 84/201 (41.79%), Postives = 122/201 (60.70%), Query Frame = 1

Query: 134 ITESSSSAMETKTKR-DYKEPWDYQNSYYPTTLPLRMPYSGDPERLDEAEFGQDVMNREY 193
           +  S+ +   T T+  +YKEPWDY  SYYP TLP+R PY+GDPE LD  EF Q      +
Sbjct: 82  LNRSNGAYGSTSTQEIEYKEPWDYY-SYYPITLPMRRPYAGDPEVLDVEEFMQ---AGGH 141

Query: 194 DENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTATGKEKVGNSRSSNSTSSSDL 253
            E+S+  A +LGL++++ E   +F +LP+ +PL      +T  E +    +         
Sbjct: 142 HEDSLNTAANLGLMEDSGEQKMFFMRLPS-VPL-----ASTPTENLETRPNIKGPVEKKT 201

Query: 254 DDLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQHVVAINSEEGQCCDLGD 313
            DLK L  G MGKLL+YKSGA+K++LG++LYDVS G    F Q V+ +N+E+  CC +GD
Sbjct: 202 VDLKALPEGYMGKLLVYKSGAVKMKLGEVLYDVSPGLKSEFAQDVMVVNTEQKNCCLVGD 261

Query: 314 IGNRVVVTPDISSLLNSVTNL 334
           +    V+TPDI S+L  + N+
Sbjct: 262 VYKHAVLTPDIDSILKDIENI 272

BLAST of CSPI04G04000 vs. NCBI nr
Match: gi|449458357|ref|XP_004146914.1| (PREDICTED: DNA-directed RNA polymerase III subunit rpc4-like [Cucumis sativus])

HSP 1 Score: 648.3 bits (1671), Expect = 7.5e-183
Identity = 333/334 (99.70%), Postives = 334/334 (100.00%), Query Frame = 1

Query: 1   MKNSDPSPPRRKVKFAPKSSQRKRPPPPPVQKTEDEDGEGYVAQTRYLLRRANENLGKRA 60
           MKNSDPSPPRRKVKFAPKSSQRKRPPPPPVQKTEDEDGEGYVAQTRYLLRRANENLGKRA
Sbjct: 1   MKNSDPSPPRRKVKFAPKSSQRKRPPPPPVQKTEDEDGEGYVAQTRYLLRRANENLGKRA 60

Query: 61  NKVEKKSSVQVAFGPGAESTSSSIRTYGVPKVENGSRKNDIEPEVDEDEEFVLPVARDVN 120
           NKVEKKSSVQVAFGPGAESTSSSIRTYGVPKVENGSRKNDIEPEVDEDEEFVLPVARDVN
Sbjct: 61  NKVEKKSSVQVAFGPGAESTSSSIRTYGVPKVENGSRKNDIEPEVDEDEEFVLPVARDVN 120

Query: 121 EDGKYFDKKTKDGITESSSSAMETKTKRDYKEPWDYQNSYYPTTLPLRMPYSGDPERLDE 180
           EDGKYFDKKTKDGITESSSSAMETKTKRDYKEPWDYQNSYYPTTLPLRMPYSGDPERLDE
Sbjct: 121 EDGKYFDKKTKDGITESSSSAMETKTKRDYKEPWDYQNSYYPTTLPLRMPYSGDPERLDE 180

Query: 181 AEFGQDVMNREYDENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTATGKEKVGN 240
           AEFGQDVMNREYDENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTATGKEKVGN
Sbjct: 181 AEFGQDVMNREYDENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTATGKEKVGN 240

Query: 241 SRSSNSTSSSDLDDLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQHVVAI 300
           SRSSNSTSSSDLDDLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQHVVAI
Sbjct: 241 SRSSNSTSSSDLDDLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQHVVAI 300

Query: 301 NSEEGQCCDLGDIGNRVVVTPDISSLLNSVTNLR 335
           N+EEGQCCDLGDIGNRVVVTPDISSLLNSVTNLR
Sbjct: 301 NTEEGQCCDLGDIGNRVVVTPDISSLLNSVTNLR 334

BLAST of CSPI04G04000 vs. NCBI nr
Match: gi|659107882|ref|XP_008453902.1| (PREDICTED: DNA-directed RNA polymerase III subunit RPC4-like [Cucumis melo])

HSP 1 Score: 619.8 bits (1597), Expect = 2.8e-174
Identity = 319/334 (95.51%), Postives = 325/334 (97.31%), Query Frame = 1

Query: 1   MKNSDPSPPRRKVKFAPKSSQRKRPPPPPVQKTEDEDGEGYVAQTRYLLRRANENLGKRA 60
           MKN D SPPRRKVKFAPKSSQR+RPPPPPVQKTEDEDGEGYVAQTRYLLRRANENLGKRA
Sbjct: 1   MKNQDSSPPRRKVKFAPKSSQRRRPPPPPVQKTEDEDGEGYVAQTRYLLRRANENLGKRA 60

Query: 61  NKVEKKSSVQVAFGPGAESTSSSIRTYGVPKVENGSRKNDIEPEVDEDEEFVLPVARDVN 120
           NKVEKKSSVQVAFGPGAESTSSSIRTYGVPKVENGSRKNDIEPEVDEDEEFVLPVA DVN
Sbjct: 61  NKVEKKSSVQVAFGPGAESTSSSIRTYGVPKVENGSRKNDIEPEVDEDEEFVLPVAMDVN 120

Query: 121 EDGKYFDKKTKDGITESSSSAMETKTKRDYKEPWDYQNSYYPTTLPLRMPYSGDPERLDE 180
           EDGK+FDKKTKDGI ESSSSAM TKTKRDYKEPWDYQNSYYPTTLPLRMPYSGD ERLDE
Sbjct: 121 EDGKFFDKKTKDGIAESSSSAMVTKTKRDYKEPWDYQNSYYPTTLPLRMPYSGDHERLDE 180

Query: 181 AEFGQDVMNREYDENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTATGKEKVGN 240
           AEFGQD MNREYDENSVIPALDLGL DENTE+TKYFFQLPARLPLPKQSSTATGKEKVGN
Sbjct: 181 AEFGQDAMNREYDENSVIPALDLGLQDENTENTKYFFQLPARLPLPKQSSTATGKEKVGN 240

Query: 241 SRSSNSTSSSDLDDLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQHVVAI 300
           SRSSNSTSSSDLDDLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGS+CSFLQHVVAI
Sbjct: 241 SRSSNSTSSSDLDDLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSDCSFLQHVVAI 300

Query: 301 NSEEGQCCDLGDIGNRVVVTPDISSLLNSVTNLR 335
           N+E+GQCCDLGDIG RVVVTPDISSLLNSVTNLR
Sbjct: 301 NTEKGQCCDLGDIGKRVVVTPDISSLLNSVTNLR 334

BLAST of CSPI04G04000 vs. NCBI nr
Match: gi|1009122078|ref|XP_015877810.1| (PREDICTED: uncharacterized protein LOC107414218 [Ziziphus jujuba])

HSP 1 Score: 295.4 bits (755), Expect = 1.2e-76
Identity = 166/337 (49.26%), Postives = 230/337 (68.25%), Query Frame = 1

Query: 2   KNSDPSPPRRKVKFAPKSSQRKRPPPPPVQKTEDEDGEGYVAQTRYLLRRANENLGKRAN 61
           ++   S  R+K++FAPK   R++P PPP +K   ++      + + LLR+ NENL +R +
Sbjct: 3   QDGSSSTTRKKLRFAPKPPPRRKPRPPPPKKEAGDEDVDEAKEAKSLLRQFNENLTRRGS 62

Query: 62  KVEKKSSVQVAFGPGAESTSSSIRTYGVPKVENGSRKNDIEPEVDED---EEFVLPVARD 121
           + EKKSSVQVAFGPGA + SSSIRT+GV K  N  + +D+EP+V E    E+ + P+  D
Sbjct: 63  RAEKKSSVQVAFGPGA-THSSSIRTFGVDKARNSYKSSDVEPKVSESSDSEKIISPLPLD 122

Query: 122 VNEDGKYFDKKTKDGITESSSSAMETKT-KRDYKEPWDYQNSYYPTTLPLRMPYSGDPER 181
                     K   G        + T+T K++YKEPWDY +SYYP TLPLR PYSGDPE 
Sbjct: 123 ----------KAGTGAALEEVRDLSTQTVKKEYKEPWDYNHSYYPITLPLRPPYSGDPEL 182

Query: 182 LDEAEFGQDVMNREYDENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTATGKEK 241
           L+EAEFG+   N+EYDE ++ PA +LGL++EN E   +FFQLPA LP+ KQS+++ GKEK
Sbjct: 183 LNEAEFGEAARNKEYDETAIHPASELGLMEENGERKMFFFQLPATLPMVKQSASSKGKEK 242

Query: 242 VGNSRSSNST-SSSDLDDLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCSFLQH 301
           VG+S SS S  ++S    L++L  GCMGK+L+YKSGA+KL+LGD L+DVS GS+    Q+
Sbjct: 243 VGSSTSSGSVGTTSKGSKLEELHGGCMGKMLVYKSGAVKLKLGDTLFDVSPGSDFQVSQN 302

Query: 302 VVAINSEEGQCCDLGDIGNRVVVTPDISSLLNSVTNL 334
           VVAIN+ E +C  LG++   VVV+PD+ S+LNS+ +L
Sbjct: 303 VVAINTAEKECNVLGELNKLVVVSPDVCSVLNSIIDL 328

BLAST of CSPI04G04000 vs. NCBI nr
Match: gi|643706312|gb|KDP22444.1| (hypothetical protein JCGZ_26275 [Jatropha curcas])

HSP 1 Score: 283.5 bits (724), Expect = 4.8e-73
Identity = 167/341 (48.97%), Postives = 221/341 (64.81%), Query Frame = 1

Query: 1   MKNSDPSPPRRKVKFAPKSSQRKRPPPPPVQKTEDEDG---EGYVAQTRYLLRRANENLG 60
           M    PSP  RK KF PK+  +++P P  V K+E  D    E   AQ + L+R+ NENL 
Sbjct: 1   MDQEQPSPTPRKAKFTPKAPPQRKPIPT-VTKSEVNDSNNDEDEAAQAQKLMRKFNENLR 60

Query: 61  KRANKVEKKSSVQVAFGPGAESTSSSIRTYGVPKVENG--SRKNDIEPEVDEDEEFVLPV 120
           ++  KVEKKSSVQVAFGPGA + S+SIR YGVP  EN   S + DI+   ++D   V+  
Sbjct: 61  RKVPKVEKKSSVQVAFGPGA-APSTSIRKYGVPGCENAGSSSRLDIKDSDNDDRRIVVSS 120

Query: 121 ARDVNEDG--KYFDKKTKDGITESSSSAMETKTKRDYKEPWDYQNSYYPTTLPLRMPYSG 180
                EDG  KY  +         +   +  K K+DY+EPWDY ++YYPTTLPLR PYSG
Sbjct: 121 LSTAEEDGASKYHSE---------AIGVLPLKIKKDYREPWDYNHTYYPTTLPLRRPYSG 180

Query: 181 DPERLDEAEFGQDVMNREYDENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTAT 240
           DPE L+E EFG+     EY+EN++ PA DLGLL+E+ +   +FFQLP++LPL K+S+   
Sbjct: 181 DPELLNEEEFGEAARKLEYNENTIKPASDLGLLEESDKERLFFFQLPSKLPLVKRSAITK 240

Query: 241 GKEKVGNSRSSNSTSSSDLD-DLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCS 300
           GKEKV  S  S  TS+   +   + LS G MGK+L+Y+SGA+K++LGD LYDVS GS+C+
Sbjct: 241 GKEKVEGSTPSQGTSALKKESSFQGLSEGYMGKMLVYRSGAVKIKLGDTLYDVSPGSDCT 300

Query: 301 FLQHVVAINSEEGQCCDLGDIGNRVVVTPDISSLLNSVTNL 334
           F Q ++AI+    QCC +G +  R VVTPD+ SLLNSV NL
Sbjct: 301 FAQDLMAIDIASKQCCTIGKLRKRAVVTPDVDSLLNSVINL 330

BLAST of CSPI04G04000 vs. NCBI nr
Match: gi|802769842|ref|XP_012090466.1| (PREDICTED: DNA-directed RNA polymerase III subunit rpc4 isoform X1 [Jatropha curcas])

HSP 1 Score: 283.5 bits (724), Expect = 4.8e-73
Identity = 167/341 (48.97%), Postives = 221/341 (64.81%), Query Frame = 1

Query: 1   MKNSDPSPPRRKVKFAPKSSQRKRPPPPPVQKTEDEDG---EGYVAQTRYLLRRANENLG 60
           M    PSP  RK KF PK+  +++P P  V K+E  D    E   AQ + L+R+ NENL 
Sbjct: 20  MDQEQPSPTPRKAKFTPKAPPQRKPIPT-VTKSEVNDSNNDEDEAAQAQKLMRKFNENLR 79

Query: 61  KRANKVEKKSSVQVAFGPGAESTSSSIRTYGVPKVENG--SRKNDIEPEVDEDEEFVLPV 120
           ++  KVEKKSSVQVAFGPGA + S+SIR YGVP  EN   S + DI+   ++D   V+  
Sbjct: 80  RKVPKVEKKSSVQVAFGPGA-APSTSIRKYGVPGCENAGSSSRLDIKDSDNDDRRIVVSS 139

Query: 121 ARDVNEDG--KYFDKKTKDGITESSSSAMETKTKRDYKEPWDYQNSYYPTTLPLRMPYSG 180
                EDG  KY  +         +   +  K K+DY+EPWDY ++YYPTTLPLR PYSG
Sbjct: 140 LSTAEEDGASKYHSE---------AIGVLPLKIKKDYREPWDYNHTYYPTTLPLRRPYSG 199

Query: 181 DPERLDEAEFGQDVMNREYDENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTAT 240
           DPE L+E EFG+     EY+EN++ PA DLGLL+E+ +   +FFQLP++LPL K+S+   
Sbjct: 200 DPELLNEEEFGEAARKLEYNENTIKPASDLGLLEESDKERLFFFQLPSKLPLVKRSAITK 259

Query: 241 GKEKVGNSRSSNSTSSSDLD-DLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVSSGSNCS 300
           GKEKV  S  S  TS+   +   + LS G MGK+L+Y+SGA+K++LGD LYDVS GS+C+
Sbjct: 260 GKEKVEGSTPSQGTSALKKESSFQGLSEGYMGKMLVYRSGAVKIKLGDTLYDVSPGSDCT 319

Query: 301 FLQHVVAINSEEGQCCDLGDIGNRVVVTPDISSLLNSVTNL 334
           F Q ++AI+    QCC +G +  R VVTPD+ SLLNSV NL
Sbjct: 320 FAQDLMAIDIASKQCCTIGKLRKRAVVTPDVDSLLNSVINL 349

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RPC4_MOUSE9.3e-0833.86DNA-directed RNA polymerase III subunit RPC4 OS=Mus musculus GN=Polr3d PE=2 SV=2[more]
RPC4_HUMAN1.6e-0733.33DNA-directed RNA polymerase III subunit RPC4 OS=Homo sapiens GN=POLR3D PE=1 SV=2[more]
RPC4_BOVIN2.1e-0738.14DNA-directed RNA polymerase III subunit RPC4 OS=Bos taurus GN=POLR3D PE=2 SV=1[more]
RPC4_YEAST3.0e-0630.17DNA-directed RNA polymerase III subunit RPC4 OS=Saccharomyces cerevisiae (strain... [more]
Match NameE-valueIdentityDescription
A0A0A0KZ61_CUCSA5.2e-18399.70Uncharacterized protein OS=Cucumis sativus GN=Csa_4G022870 PE=4 SV=1[more]
A0A067JSB7_JATCU3.4e-7348.97Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26275 PE=4 SV=1[more]
B9R9V4_RICCO5.9e-7048.21DNA binding protein, putative OS=Ricinus communis GN=RCOM_1500880 PE=4 SV=1[more]
A0A061FZZ6_THECC9.5e-6847.48DNA binding protein, putative isoform 2 OS=Theobroma cacao GN=TCM_014896 PE=4 SV... [more]
A0A058ZWX9_EUCGR1.0e-6645.15Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_K00171 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G25180.12.5e-4337.43 RNA polymerase III RPC4[more]
AT5G09380.11.1e-3541.79 RNA polymerase III RPC4[more]
Match NameE-valueIdentityDescription
gi|449458357|ref|XP_004146914.1|7.5e-18399.70PREDICTED: DNA-directed RNA polymerase III subunit rpc4-like [Cucumis sativus][more]
gi|659107882|ref|XP_008453902.1|2.8e-17495.51PREDICTED: DNA-directed RNA polymerase III subunit RPC4-like [Cucumis melo][more]
gi|1009122078|ref|XP_015877810.1|1.2e-7649.26PREDICTED: uncharacterized protein LOC107414218 [Ziziphus jujuba][more]
gi|643706312|gb|KDP22444.1|4.8e-7348.97hypothetical protein JCGZ_26275 [Jatropha curcas][more]
gi|802769842|ref|XP_012090466.1|4.8e-7348.97PREDICTED: DNA-directed RNA polymerase III subunit rpc4 isoform X1 [Jatropha cur... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007811RPC4
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0003899DNA-directed RNA polymerase activity
Vocabulary: Cellular Component
TermDefinition
GO:0005666DNA-directed RNA polymerase III complex
Vocabulary: Biological Process
TermDefinition
GO:0006383transcription from RNA polymerase III promoter
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006144 purine nucleobase metabolic process
biological_process GO:0006206 pyrimidine nucleobase metabolic process
biological_process GO:0006383 transcription from RNA polymerase III promoter
cellular_component GO:0005666 DNA-directed RNA polymerase III complex
cellular_component GO:0005730 nucleolus
molecular_function GO:0003677 DNA binding
molecular_function GO:0003899 DNA-directed RNA polymerase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G04000.1CSPI04G04000.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007811DNA-directed RNA polymerase III subunit RPC4PANTHERPTHR13408DNA-DIRECTED RNA POLYMERASE IIIcoord: 132..334
score: 8.8E-109coord: 7..115
score: 8.8E
IPR007811DNA-directed RNA polymerase III subunit RPC4PFAMPF05132RNA_pol_Rpc4coord: 215..323
score: 1.2
NoneNo IPR availablePANTHERPTHR13408:SF3SUBFAMILY NOT NAMEDcoord: 132..334
score: 8.8E-109coord: 7..115
score: 8.8E