CSPI04G04310 (gene) Wild cucumber (PI 183967)

NameCSPI04G04310
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionMediator of RNA polymerase II transcription subunit 31
LocationChr4 : 2809673 .. 2812829 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGAAGTATTTTGAGGTTTTTATAGCATTGGTAGCGCGGGCATATTTCATTGTTCTACGTTGGGTCATCGATTTCAATCGCTTGCTGCTCTAAGTTGAATGTAGCAATTGTTGAGAGCTTTGAAGGGTATCTCCATATCGCTCCACTACGAATCTCGCAACTTCTTCTTAATTTTTGGGTTCAATTTGAACTTCGACTGGCAGCCAGATTCAATGGCGTCCAATCATGGATTAAAAGAAGAAGAAGCTGATAACCCATCATCGTAAGTTTGTTTCTCTTTTTCATTCTCCTGTTCTATCATTTTCCCTCCTCCGTCTCTCTTTACCCAATTCTTCTTTCATTTTGTATATCGCTTCAATTCTTCATTTTTGTTCCGTTGCTGAGGGATTGGTTTATTGGGCTCATTGTGAATTGATCAATTTTCCCTTTCTGTTTGGTTAATTATTACCAGACCTACCAATGTTTACAAAGATCCCGATGACGGACGGCAGCGGTTCTTGCTCGAATTGGAGTTTGTTCAATGCCTTGCCAATCCTACCTACATTCATTGTAAGTTTTTTGTAAAATTTAGGGTTTCTTTATAGGGGGTTAGGCTTATTTCTTATCCTTGAAACTTCAGCTTGAAGGGTGGGCAAAGCTATTGGATGTTCCAAAATGAGTTTCGAAAATTCAGCGTTTATCCTAAGATTTGCTTTCGAACCTTTTCAGTTGAATTTAGTTTCAATTTGGATTTCTTATGAGCTAACTATACATGAAAGTGTGCTTTACATGAAAAGCAACCAAGACGAAAGTAAATTCTACGTATTTCTTCTAATGATAAAGTGGATTGAACATATGTGTATAATAAATGTTAATGTAAAATTAAATAATCACTTGGAGTGGGTTCAATCTACGACGACCATCTACCAATGATTTAATACCGTATGAGTTTCCTTGATAACAAATATCAGTATAGTATTGTTATAGTTATTCCATTCGATTAGTTGAGGTCTAACAAGCTAGCTTGAACATCCAGGATATGAAAGAAAAAGAACTGTTGGCTTGAATTTTTATCATATAATATCTCACGTGAACCATCTGTTTTATTTGAATTTTGAACATGATTTTCTCAGTTGCATTTCCTGTGAAGTGATTTTTTATTGACCAATGTGAAGAAGGAAGTCACAATGAAAGATCAGATTTTGACATCTAGAACCAAAATTCATTCAATTGCATCTTCTTGAGTTAGAGTAGTGCCTAGATGAAACTTTTTCTTTTATGCTGGTTGCCTGACTACTACTGGGGCAGCAACCTACTGTTTCTACTACAAAATCAACTTATAATTTACCAGAGATCAGTGCTCGTCTGATCTTTCTGCCTGTTGAGAAAAACTGGCGTTGTACAATTTTTACATATAGTGAATTTCCTAGAATTTTATCCTTTTGATGATATGCTACAGTGTCTTACTGAAGTACATTTCTGAAGTTGGCACTTACATATTTGGTGATAGTCATCTTCACGTACACATTTCTTCTTAATGAATGATGTATTTGCAAATTGATATTGAAATGTTCGAACAAATGCTTTTCTCAGTTTTGGTTGCTGATCATCTAGTTTTATACAGATCTGGCGCAGAATCGTTACCTCGAGGATGAAGCTTTTATTGGTTACTTGAAGTACCTTCAATATTGGCAACGGCCAGAGTATATCAAGTTTATAATGTGAGTCAGTTCTTTCCCTGAACTGATGAATGAGAACTGGTTGTACTTTTGGTATTGACTTCAATGTTGTTTCTAGGTACCCTCATTGTCTTTTTTTTCTTGAACTTCTACAAAATTCAAATTTCCGGAATGCAATGGCTCATCCTGGCAACAAGGTTAGCTTCTGTTATTTTTTGTTCAACTTCTCTTTCGAGATCGGAGAATAAATATTTAGTTGGATAGTGAATTTTTTCTGTGATCACGACTATGCACATACATCTCTGATAAATGCTCAGTAGACTTATATATTTTCATTTCAGTTTATAGTGAGATGAAAAAATCTAAACTGTCTCCCATGTCCATCGATTTCAAAAGAGATATTTGTTGCTTCTTTCCAAATGGTTTGCTCATATGCGGATATGTGATATTTCCCCTTTTCCTTTTTCTTAGTATTTTAACACAGTTGCATCATTTGGAATTTCTTAGGAATTGGCACACAGACAGCAATTTTACTTTTGGAAGAACTATAGGAACAATCGATTGAAACACATTTTGCCTCGACCTCTTCCCGAACCTGCAGCATTACCACCCCCAGTCACTGCTCCACCTCAAGCAGCTGTACCGGCTCCAACCTCAGCCCCCAATATGGCAGCTTCACCTGCTGCACCTGCACTTTCTCCCATGCAGTATGGTATTCCACCTGGTCCTGGTCTTCCAAAGAACGACATGAAGGGTGCAGGAATTGATCGACGAAAGAGAAAACATGTAATTTATCACATATAGTTTGTTTTCTTAAGTTATCAATGTACTCTACTTATTAGGTTAGAAGACACACCTCTTCCTTAAACCTTCAAATTTTCATTTACTTATTTTGTGTGTTTGACAGGAAAGAAGTATGACGTAGTTGGTTTCATTAGTCAAAACTAGGGTGGATGCTGGAGCAGCTCATGAGGTAAAATCTGCATACAGTGAAAATTAAAGTTCAAAGATGATGCCTTTCATAAGCTTTCCAAGAGTTATCCTTCACTTCAGGTCGTTCAAAATCTCTCTCTCGCATGTTCGTGAACACGAGCTACATGACATCCATGTTTATGGGTTCATGTAATATATCCTTTGATGAAGATCTGCATGACGTTGGGATAAAATATTGCTAAGAATCCTTAAAATTTTTGGTTTATCTATATGTTTGGCTGTTATAGATAATACATTGATAAGCAGTGTCTTTGACTATCTTTTTATCTCCATCTATATTAGACATTTTTTTTTTGTAGCAGCATGAAATCCTATAAGTAGGTATGGTGGGAGTGAGCGCTAACGTGAATATAGCAAAATATTATTGTTTACTAATGATGTATATTGATAGATTATTATCATCTATTGGTGATAGACACAAATAGATGTTTACTATGTTTATTTATCACGGTGACATTTTGTTATATTTGTAAATATTATTCGCTATATATATTATTGATTATGAT

mRNA sequence

ATGGCGTCCAATCATGGATTAAAAGAAGAAGAAGCTGATAACCCATCATCACCTACCAATGTTTACAAAGATCCCGATGACGGACGGCAGCGGTTCTTGCTCGAATTGGAGTTTGTTCAATGCCTTGCCAATCCTACCTACATTCATTATCTGGCGCAGAATCGTTACCTCGAGGATGAAGCTTTTATTGGTTACTTGAAGTACCTTCAATATTGGCAACGGCCAGAGTATATCAAGTTTATAATGTACCCTCATTGTCTTTTTTTTCTTGAACTTCTACAAAATTCAAATTTCCGGAATGCAATGGCTCATCCTGGCAACAAGGAATTGGCACACAGACAGCAATTTTACTTTTGGAAGAACTATAGGAACAATCGATTGAAACACATTTTGCCTCGACCTCTTCCCGAACCTGCAGCATTACCACCCCCAGTCACTGCTCCACCTCAAGCAGCTGTACCGGCTCCAACCTCAGCCCCCAATATGGCAGCTTCACCTGCTGCACCTGCACTTTCTCCCATGCAGTATGGTATTCCACCTGGTCCTGGTCTTCCAAAGAACGACATGAAGGGTGCAGGAATTGATCGACGAAAGAGAAAACATGAAAGAAGTATGACGTAG

Coding sequence (CDS)

ATGGCGTCCAATCATGGATTAAAAGAAGAAGAAGCTGATAACCCATCATCACCTACCAATGTTTACAAAGATCCCGATGACGGACGGCAGCGGTTCTTGCTCGAATTGGAGTTTGTTCAATGCCTTGCCAATCCTACCTACATTCATTATCTGGCGCAGAATCGTTACCTCGAGGATGAAGCTTTTATTGGTTACTTGAAGTACCTTCAATATTGGCAACGGCCAGAGTATATCAAGTTTATAATGTACCCTCATTGTCTTTTTTTTCTTGAACTTCTACAAAATTCAAATTTCCGGAATGCAATGGCTCATCCTGGCAACAAGGAATTGGCACACAGACAGCAATTTTACTTTTGGAAGAACTATAGGAACAATCGATTGAAACACATTTTGCCTCGACCTCTTCCCGAACCTGCAGCATTACCACCCCCAGTCACTGCTCCACCTCAAGCAGCTGTACCGGCTCCAACCTCAGCCCCCAATATGGCAGCTTCACCTGCTGCACCTGCACTTTCTCCCATGCAGTATGGTATTCCACCTGGTCCTGGTCTTCCAAAGAACGACATGAAGGGTGCAGGAATTGATCGACGAAAGAGAAAACATGAAAGAAGTATGACGTAG
BLAST of CSPI04G04310 vs. Swiss-Prot
Match: MED31_ARATH (Mediator of RNA polymerase II transcription subunit 31 OS=Arabidopsis thaliana GN=MED31 PE=1 SV=1)

HSP 1 Score: 273.5 bits (698), Expect = 1.9e-72
Identity = 146/203 (71.92%), Postives = 158/203 (77.83%), Query Frame = 1

Query: 1   MASNHGLKEEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDE 60
           MAS   + ++ ++ PS P N YKDPD GRQRFLLELEF+QCLANPTYIHYLAQNRY EDE
Sbjct: 1   MASPEEMGDDASEIPSPPKNTYKDPDGGRQRFLLELEFIQCLANPTYIHYLAQNRYFEDE 60

Query: 61  AFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWK 120
           AFIGYLKYLQYWQRPEYIKFIMYPHCL+FLELLQN NFR AMAHP NKELAHRQQFY+WK
Sbjct: 61  AFIGYLKYLQYWQRPEYIKFIMYPHCLYFLELLQNPNFRTAMAHPANKELAHRQQFYYWK 120

Query: 121 NYRNNRLKHILPRPLPEPAALPPPVTAPPQAAVPAPTSAPNMAASPAAPALSPMQYGIPP 180
           NYRNNRLKHILPRPLPEP    PPV AP  +  PAP+     A +  +PALSPMQY    
Sbjct: 121 NYRNNRLKHILPRPLPEPVPPQPPV-APSTSLPPAPS-----ATAALSPALSPMQY---- 180

Query: 181 GPGLPKND---MKGAGIDRRKRK 201
              L KND   M   GIDRRKRK
Sbjct: 181 NNMLSKNDTRNMGATGIDRRKRK 193

BLAST of CSPI04G04310 vs. Swiss-Prot
Match: MED31_DICDI (Putative mediator of RNA polymerase II transcription subunit 31 OS=Dictyostelium discoideum GN=med31 PE=3 SV=1)

HSP 1 Score: 137.9 bits (346), Expect = 1.3e-31
Identity = 57/97 (58.76%), Postives = 76/97 (78.35%), Query Frame = 1

Query: 31  RFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFL 90
           RF++ELEF+QCL+NP Y++YLAQNRY +D+AF+ YL YLQYW++PEY KFI+YP  L+FL
Sbjct: 55  RFIMELEFIQCLSNPRYLNYLAQNRYFQDKAFVNYLVYLQYWKKPEYAKFIVYPQSLYFL 114

Query: 91  ELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRL 128
           +LLQ   FR  + H  + +  H QQFY W+ YRNNR+
Sbjct: 115 DLLQEERFRQELNHSQSTDFIHEQQFYHWQYYRNNRM 151

BLAST of CSPI04G04310 vs. Swiss-Prot
Match: MED31_DANRE (Mediator of RNA polymerase II transcription subunit 31 OS=Danio rerio GN=med31 PE=2 SV=2)

HSP 1 Score: 117.9 bits (294), Expect = 1.4e-25
Identity = 53/108 (49.07%), Postives = 70/108 (64.81%), Query Frame = 1

Query: 21  VYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKF 80
           V +  +  RQRF LELEFVQCLANP Y+++LAQ  YL ++ F+ YLKYL YW+ PEY KF
Sbjct: 4   VMETDEQARQRFQLELEFVQCLANPNYLNFLAQRGYLREKPFVNYLKYLLYWKEPEYAKF 63

Query: 81  IMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLK 129
           + YPHCL  LELLQ  +FR  + +    +    QQ   W++Y   R +
Sbjct: 64  LKYPHCLHMLELLQYEHFRKELVNAQCAKFIDEQQILHWQHYSRKRTR 111

BLAST of CSPI04G04310 vs. Swiss-Prot
Match: MED31_BOVIN (Mediator of RNA polymerase II transcription subunit 31 OS=Bos taurus GN=MED31 PE=2 SV=1)

HSP 1 Score: 113.2 bits (282), Expect = 3.3e-24
Identity = 51/105 (48.57%), Postives = 68/105 (64.76%), Query Frame = 1

Query: 24  DPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMY 83
           D    R RF LELEFVQCLANP Y+++LAQ  Y +D+AF+ YLKYL YW+ PEY K++ Y
Sbjct: 10  DDAGNRLRFQLELEFVQCLANPNYLNFLAQRGYFKDKAFVNYLKYLLYWKEPEYAKYLKY 69

Query: 84  PHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLK 129
           P CL  LELLQ  +FR  + +    +    QQ   W++Y   R++
Sbjct: 70  PQCLHMLELLQYEHFRKELVNAQCAKFIDEQQILHWQHYSRKRMR 114

BLAST of CSPI04G04310 vs. Swiss-Prot
Match: MED31_MOUSE (Mediator of RNA polymerase II transcription subunit 31 OS=Mus musculus GN=Med31 PE=1 SV=2)

HSP 1 Score: 112.8 bits (281), Expect = 4.4e-24
Identity = 51/105 (48.57%), Postives = 68/105 (64.76%), Query Frame = 1

Query: 24  DPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMY 83
           D    R RF LELEFVQCLANP Y+++LAQ  Y +D+AF+ YLKYL YW+ PEY K++ Y
Sbjct: 10  DDAGNRLRFQLELEFVQCLANPNYLNFLAQRGYFKDKAFVNYLKYLLYWKEPEYAKYLKY 69

Query: 84  PHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLK 129
           P CL  LELLQ  +FR  + +    +    QQ   W++Y   R++
Sbjct: 70  PQCLHMLELLQYEHFRKELVNAQCAKFIDEQQILHWQHYSRKRVR 114

BLAST of CSPI04G04310 vs. TrEMBL
Match: A0A0A0KZ88_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G025160 PE=4 SV=1)

HSP 1 Score: 417.9 bits (1073), Expect = 7.1e-114
Identity = 205/206 (99.51%), Postives = 206/206 (100.00%), Query Frame = 1

Query: 1   MASNHGLKEEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDE 60
           MASNHGLKEEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDE
Sbjct: 1   MASNHGLKEEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDE 60

Query: 61  AFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWK 120
           AFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWK
Sbjct: 61  AFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWK 120

Query: 121 NYRNNRLKHILPRPLPEPAALPPPVTAPPQAAVPAPTSAPNMAASPAAPALSPMQYGIPP 180
           NYRNNRLKHILPRPLPEPAALPPPVTAPPQAAVPAPTSAPNMAASPAAPALSP+QYGIPP
Sbjct: 121 NYRNNRLKHILPRPLPEPAALPPPVTAPPQAAVPAPTSAPNMAASPAAPALSPIQYGIPP 180

Query: 181 GPGLPKNDMKGAGIDRRKRKHERSMT 207
           GPGLPKNDMKGAGIDRRKRKHERSMT
Sbjct: 181 GPGLPKNDMKGAGIDRRKRKHERSMT 206

BLAST of CSPI04G04310 vs. TrEMBL
Match: V4TKG6_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10032791mg PE=4 SV=1)

HSP 1 Score: 310.8 bits (795), Expect = 1.2e-81
Identity = 156/194 (80.41%), Postives = 166/194 (85.57%), Query Frame = 1

Query: 9   EEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKY 68
           EE +D PSSP  VYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRY EDEAFIGYLKY
Sbjct: 8   EEASDAPSSPKKVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFEDEAFIGYLKY 67

Query: 69  LQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLK 128
           LQYWQRPEYIKFIMYPHCL+FLELLQN+NFRNAMAHP NKELAHRQQF+FWKNYRNNRLK
Sbjct: 68  LQYWQRPEYIKFIMYPHCLYFLELLQNANFRNAMAHPANKELAHRQQFFFWKNYRNNRLK 127

Query: 129 HILPRPLPEPAALPPPVTAPPQAAVPAPTSAPNMAASPAAPALSPMQYGIPPGPGLPKND 188
           HILPRPLPEP+  PPP  APP    PAP     + A+P  PALSPMQYGIPPG  L KND
Sbjct: 128 HILPRPLPEPSEAPPPAAAPP--LPPAPPVLTPVTAAP-GPALSPMQYGIPPGSALMKND 187

Query: 189 MKGAGIDRRKRKHE 203
           M+ + IDRRKRK +
Sbjct: 188 MRSSSIDRRKRKKD 198

BLAST of CSPI04G04310 vs. TrEMBL
Match: A0A061G047_THECC (Mediator of RNA polymerase II transcription subunit 31 isoform 2 OS=Theobroma cacao GN=TCM_014808 PE=4 SV=1)

HSP 1 Score: 302.8 bits (774), Expect = 3.3e-79
Identity = 154/204 (75.49%), Postives = 170/204 (83.33%), Query Frame = 1

Query: 9   EEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKY 68
           +  +++PSSP  VYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRY EDEAFIGYLKY
Sbjct: 8   DNASNSPSSPKTVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFEDEAFIGYLKY 67

Query: 69  LQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLK 128
           LQYWQRPEYIKFIMYPHCL+FLELLQN++FRN MAHP NKELAHRQQF+FWKNYRNNRLK
Sbjct: 68  LQYWQRPEYIKFIMYPHCLYFLELLQNASFRNGMAHPVNKELAHRQQFFFWKNYRNNRLK 127

Query: 129 HILPRPLPEPAALP---PPVTAPPQAA---VPAPTSAPNMAASPAAPALSPMQYGIPPGP 188
            ILP+P PEP A P   PP   PPQAA   VPA T A   A+   + ALSPM YG+PPG 
Sbjct: 128 FILPKPPPEPVAAPAPLPPTAVPPQAAMPPVPATTIAMTSASPAPSSALSPMPYGLPPGS 187

Query: 189 GLPKNDMKGAGIDRRKRKHERSMT 207
            L KNDM+ +GIDRRKRK+ERS+T
Sbjct: 188 VLAKNDMRNSGIDRRKRKYERSLT 211

BLAST of CSPI04G04310 vs. TrEMBL
Match: A0A0J8CG17_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_5g104190 PE=4 SV=1)

HSP 1 Score: 302.0 bits (772), Expect = 5.6e-79
Identity = 154/202 (76.24%), Postives = 171/202 (84.65%), Query Frame = 1

Query: 1   MASNHGLKEEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDE 60
           MAS++   ++ +++PS   NVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRY +DE
Sbjct: 1   MASSNDA-DDTSNSPSLTQNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDE 60

Query: 61  AFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWK 120
           AFIGYLKYLQYWQ+PEYIKFIMYPHCLFFLELLQN+NFRNAMAHPG+KELAHRQQFYFWK
Sbjct: 61  AFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGSKELAHRQQFYFWK 120

Query: 121 NYRNNRLKHILPRPLPEPAALPPPVTAPPQAAVPAPTSAPNMAASPAAPALSPMQYGIPP 180
           NYRNNRLKHILPRPLPEP   PP  TA P    P P + P  A S ++PALSPMQY IPP
Sbjct: 121 NYRNNRLKHILPRPLPEPDPAPPASTAAPP---PLPPAIP--ATSASSPALSPMQYAIPP 180

Query: 181 GPGLPKNDMKGAGIDRRKRKHE 203
           G G+ KNDM+ +G DRRKRK E
Sbjct: 181 GSGVAKNDMRNSGTDRRKRKKE 196

BLAST of CSPI04G04310 vs. TrEMBL
Match: A0A0D2N5R9_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G052400 PE=4 SV=1)

HSP 1 Score: 300.8 bits (769), Expect = 1.3e-78
Identity = 152/202 (75.25%), Postives = 168/202 (83.17%), Query Frame = 1

Query: 9   EEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKY 68
           +  +D PSSP NVYKDPDDGRQRFLLELEF+QCLANPTYIHYLAQNRY EDEAFIGYLKY
Sbjct: 8   DNASDTPSSPKNVYKDPDDGRQRFLLELEFLQCLANPTYIHYLAQNRYFEDEAFIGYLKY 67

Query: 69  LQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLK 128
           LQYWQRPEYIKFIMYPHCL+FLELLQN+NFRNAMAHP NKE+AHRQQF+FWKNYRNNRLK
Sbjct: 68  LQYWQRPEYIKFIMYPHCLYFLELLQNANFRNAMAHPANKEVAHRQQFFFWKNYRNNRLK 127

Query: 129 HILPRPLPEPAALP---PPVTAPPQAAVPAPTSAPNMAASPAAPAL--SPMQYGIPPGPG 188
            ILP+P PE    P   PP +APPQ ++PA   A  M  +P APA   SPM YG+P G  
Sbjct: 128 FILPKPPPEEVPTPAPLPPASAPPQQSLPASNIA--MTTAPPAPASTHSPMPYGLPSGSA 187

Query: 189 LPKNDMKGAGIDRRKRKHERSM 206
           L KNDM+ +GIDRRKRKHERS+
Sbjct: 188 LAKNDMRNSGIDRRKRKHERSL 207

BLAST of CSPI04G04310 vs. TAIR10
Match: AT5G19910.2 (AT5G19910.2 SOH1 family protein)

HSP 1 Score: 257.7 bits (657), Expect = 6.2e-69
Identity = 146/233 (62.66%), Postives = 158/233 (67.81%), Query Frame = 1

Query: 1   MASNHGLKEEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDE 60
           MAS   + ++ ++ PS P N YKDPD GRQRFLLELEF+QCLANPTYIHYLAQNRY EDE
Sbjct: 1   MASPEEMGDDASEIPSPPKNTYKDPDGGRQRFLLELEFIQCLANPTYIHYLAQNRYFEDE 60

Query: 61  AFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNK------------ 120
           AFIGYLKYLQYWQRPEYIKFIMYPHCL+FLELLQN NFR AMAHP NK            
Sbjct: 61  AFIGYLKYLQYWQRPEYIKFIMYPHCLYFLELLQNPNFRTAMAHPANKWFKMLGYWCFWN 120

Query: 121 ------------------ELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPPPVTAPPQ 180
                             ELAHRQQFY+WKNYRNNRLKHILPRPLPEP    PPV AP  
Sbjct: 121 VFWGLVEFRVSLVQNLAYELAHRQQFYYWKNYRNNRLKHILPRPLPEPVPPQPPV-APST 180

Query: 181 AAVPAPTSAPNMAASPAAPALSPMQYGIPPGPGLPKND---MKGAGIDRRKRK 201
           +  PAP+     A +  +PALSPMQY       L KND   M   GIDRRKRK
Sbjct: 181 SLPPAPS-----ATAALSPALSPMQY----NNMLSKNDTRNMGATGIDRRKRK 223

BLAST of CSPI04G04310 vs. NCBI nr
Match: gi|449458311|ref|XP_004146891.1| (PREDICTED: mediator of RNA polymerase II transcription subunit 31 [Cucumis sativus])

HSP 1 Score: 417.9 bits (1073), Expect = 1.0e-113
Identity = 205/206 (99.51%), Postives = 206/206 (100.00%), Query Frame = 1

Query: 1   MASNHGLKEEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDE 60
           MASNHGLKEEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDE
Sbjct: 1   MASNHGLKEEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDE 60

Query: 61  AFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWK 120
           AFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWK
Sbjct: 61  AFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWK 120

Query: 121 NYRNNRLKHILPRPLPEPAALPPPVTAPPQAAVPAPTSAPNMAASPAAPALSPMQYGIPP 180
           NYRNNRLKHILPRPLPEPAALPPPVTAPPQAAVPAPTSAPNMAASPAAPALSP+QYGIPP
Sbjct: 121 NYRNNRLKHILPRPLPEPAALPPPVTAPPQAAVPAPTSAPNMAASPAAPALSPIQYGIPP 180

Query: 181 GPGLPKNDMKGAGIDRRKRKHERSMT 207
           GPGLPKNDMKGAGIDRRKRKHERSMT
Sbjct: 181 GPGLPKNDMKGAGIDRRKRKHERSMT 206

BLAST of CSPI04G04310 vs. NCBI nr
Match: gi|659107763|ref|XP_008453845.1| (PREDICTED: mediator of RNA polymerase II transcription subunit 31 [Cucumis melo])

HSP 1 Score: 401.4 bits (1030), Expect = 9.8e-109
Identity = 197/206 (95.63%), Postives = 199/206 (96.60%), Query Frame = 1

Query: 1   MASNHGLKEEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDE 60
           MASNHGL EE ADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDE
Sbjct: 1   MASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDE 60

Query: 61  AFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWK 120
           AFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWK
Sbjct: 61  AFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWK 120

Query: 121 NYRNNRLKHILPRPLPEPAALPPPVTAPPQAAVPAPTSAPNMAASPAAPALSPMQYGIPP 180
           NYRNNRLKHILPRPLPEPAALPPPV+APPQA VPAPT AP +AASPA  ALSPMQYGIPP
Sbjct: 121 NYRNNRLKHILPRPLPEPAALPPPVSAPPQAPVPAPTPAPTVAASPATAALSPMQYGIPP 180

Query: 181 GPGLPKNDMKGAGIDRRKRKHERSMT 207
           GPGLPKNDMKGAGIDRRKRKHERSMT
Sbjct: 181 GPGLPKNDMKGAGIDRRKRKHERSMT 206

BLAST of CSPI04G04310 vs. NCBI nr
Match: gi|567889026|ref|XP_006437035.1| (hypothetical protein CICLE_v10032791mg [Citrus clementina])

HSP 1 Score: 310.8 bits (795), Expect = 1.7e-81
Identity = 156/194 (80.41%), Postives = 166/194 (85.57%), Query Frame = 1

Query: 9   EEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKY 68
           EE +D PSSP  VYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRY EDEAFIGYLKY
Sbjct: 8   EEASDAPSSPKKVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFEDEAFIGYLKY 67

Query: 69  LQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLK 128
           LQYWQRPEYIKFIMYPHCL+FLELLQN+NFRNAMAHP NKELAHRQQF+FWKNYRNNRLK
Sbjct: 68  LQYWQRPEYIKFIMYPHCLYFLELLQNANFRNAMAHPANKELAHRQQFFFWKNYRNNRLK 127

Query: 129 HILPRPLPEPAALPPPVTAPPQAAVPAPTSAPNMAASPAAPALSPMQYGIPPGPGLPKND 188
           HILPRPLPEP+  PPP  APP    PAP     + A+P  PALSPMQYGIPPG  L KND
Sbjct: 128 HILPRPLPEPSEAPPPAAAPP--LPPAPPVLTPVTAAP-GPALSPMQYGIPPGSALMKND 187

Query: 189 MKGAGIDRRKRKHE 203
           M+ + IDRRKRK +
Sbjct: 188 MRSSSIDRRKRKKD 198

BLAST of CSPI04G04310 vs. NCBI nr
Match: gi|720044213|ref|XP_010269818.1| (PREDICTED: mediator of RNA polymerase II transcription subunit 31 isoform X1 [Nelumbo nucifera])

HSP 1 Score: 310.5 bits (794), Expect = 2.3e-81
Identity = 157/207 (75.85%), Postives = 169/207 (81.64%), Query Frame = 1

Query: 6   GLKEEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGY 65
           G + ++   PS P N+YKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRY EDEAFIGY
Sbjct: 4   GKESDDVQTPSMPKNIYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFEDEAFIGY 63

Query: 66  LKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNN 125
           LKYLQYWQRPEY+KFIMYPHCLFFLELLQNSNFRNAMAHPG+KE+AHRQQF+FWKNYRNN
Sbjct: 64  LKYLQYWQRPEYVKFIMYPHCLFFLELLQNSNFRNAMAHPGSKEIAHRQQFFFWKNYRNN 123

Query: 126 RLKHILPRPLPEPAALPP---------PVTAPPQAAVPAPTSAPNMA-ASPAAPALSPMQ 185
           RLKHILPRPLPEP A PP         P T  P AA PAP   P  A A+P A ALSPM 
Sbjct: 124 RLKHILPRPLPEPVAAPPAPIPPPAPLPATNVPVAASPAPVLVPVPAPAAPPASALSPMP 183

Query: 186 YGIPPGPGLPKNDMKGAGIDRRKRKHE 203
           YG+PPGP L KND + +GIDRRKRK E
Sbjct: 184 YGLPPGPTLSKNDPRNSGIDRRKRKKE 210

BLAST of CSPI04G04310 vs. NCBI nr
Match: gi|802770058|ref|XP_012090529.1| (PREDICTED: mediator of RNA polymerase II transcription subunit 31 [Jatropha curcas])

HSP 1 Score: 308.5 bits (789), Expect = 8.7e-81
Identity = 154/197 (78.17%), Postives = 168/197 (85.28%), Query Frame = 1

Query: 9   EEEADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKY 68
           ++  DNPSSP N+YKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRY EDEAFIGYLKY
Sbjct: 8   DDTLDNPSSPKNIYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFEDEAFIGYLKY 67

Query: 69  LQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLK 128
           LQYWQRPEY+KFIMYPHCL+FLELLQN+NFRNAMAHPGNKELAHRQQF+FWKNYRNNRLK
Sbjct: 68  LQYWQRPEYLKFIMYPHCLYFLELLQNANFRNAMAHPGNKELAHRQQFFFWKNYRNNRLK 127

Query: 129 HILPRPLPEPAALPP---PVTAPPQAAVPAPTSAPNMAASPAAPALSPMQYGIPPGPGLP 188
           HILPRPLPEPA  PP   P+  P Q   P P +   + A+PA+ ALSPM YG+ PG  L 
Sbjct: 128 HILPRPLPEPAPAPPASAPLPPPVQPVPPMPATTIPVPAAPAS-ALSPMPYGMAPGSALA 187

Query: 189 KNDMKGAGIDRRKRKHE 203
           KNDM+ AGIDRRKRK E
Sbjct: 188 KNDMRNAGIDRRKRKKE 203

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MED31_ARATH1.9e-7271.92Mediator of RNA polymerase II transcription subunit 31 OS=Arabidopsis thaliana G... [more]
MED31_DICDI1.3e-3158.76Putative mediator of RNA polymerase II transcription subunit 31 OS=Dictyostelium... [more]
MED31_DANRE1.4e-2549.07Mediator of RNA polymerase II transcription subunit 31 OS=Danio rerio GN=med31 P... [more]
MED31_BOVIN3.3e-2448.57Mediator of RNA polymerase II transcription subunit 31 OS=Bos taurus GN=MED31 PE... [more]
MED31_MOUSE4.4e-2448.57Mediator of RNA polymerase II transcription subunit 31 OS=Mus musculus GN=Med31 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KZ88_CUCSA7.1e-11499.51Uncharacterized protein OS=Cucumis sativus GN=Csa_4G025160 PE=4 SV=1[more]
V4TKG6_9ROSI1.2e-8180.41Uncharacterized protein OS=Citrus clementina GN=CICLE_v10032791mg PE=4 SV=1[more]
A0A061G047_THECC3.3e-7975.49Mediator of RNA polymerase II transcription subunit 31 isoform 2 OS=Theobroma ca... [more]
A0A0J8CG17_BETVU5.6e-7976.24Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_5g104190 PE=4 S... [more]
A0A0D2N5R9_GOSRA1.3e-7875.25Uncharacterized protein OS=Gossypium raimondii GN=B456_001G052400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G19910.26.2e-6962.66 SOH1 family protein[more]
Match NameE-valueIdentityDescription
gi|449458311|ref|XP_004146891.1|1.0e-11399.51PREDICTED: mediator of RNA polymerase II transcription subunit 31 [Cucumis sativ... [more]
gi|659107763|ref|XP_008453845.1|9.8e-10995.63PREDICTED: mediator of RNA polymerase II transcription subunit 31 [Cucumis melo][more]
gi|567889026|ref|XP_006437035.1|1.7e-8180.41hypothetical protein CICLE_v10032791mg [Citrus clementina][more]
gi|720044213|ref|XP_010269818.1|2.3e-8175.85PREDICTED: mediator of RNA polymerase II transcription subunit 31 isoform X1 [Ne... [more]
gi|802770058|ref|XP_012090529.1|8.7e-8178.17PREDICTED: mediator of RNA polymerase II transcription subunit 31 [Jatropha curc... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008831Mediator_Med31
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: Molecular Function
TermDefinition
GO:0001104RNA polymerase II transcription cofactor activity
Vocabulary: Cellular Component
TermDefinition
GO:0016592mediator complex
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006357 regulation of transcription from RNA polymerase II promoter
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0016592 mediator complex
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0001104 RNA polymerase II transcription cofactor activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0000166 nucleotide binding
molecular_function GO:0003712 transcription cofactor activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G04310.1CSPI04G04310.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008831Mediator complex, subunit Med31PANTHERPTHR13186MEDIATOR OF RNA POLYMERASE II TRANSCRIPTION SUBUNIT SOH1coord: 1..175
score: 2.1
IPR008831Mediator complex, subunit Med31PFAMPF05669Med31coord: 30..122
score: 6.5

The following gene(s) are paralogous to this gene:

None