CSPI03G21000 (gene) Wild cucumber (PI 183967)

NameCSPI03G21000
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon 297 family
LocationChr3 : 17103937 .. 17104328 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAAGGAAAAGGGACTATGTTTTCGATGTGACGAAAAATTCAGTCTGGGGCACCGTTGCAAAAGACGAGAATTAAATATCATTGTTATTCAAGAAGGGGAGGACTTAAGTGGGGAAATTGATAAAGTAGCAAAGGAGACTGAAGATGAGAATGAGCAAATCAATACTGAGATTGCGAATTTGTCTTTACATTCGTTGGTAGGTTTTAGTTCTCCTAAAACCATAAAAATAAAAGGCGAAATCAGGAATCGCGAAGTTGTCGTGTCGATGGGGGAGCTACACATAACTTTATTTCGGAGGAGGTGGTCAAGGAATTAAAAATTCCAATAGAAACTTTAGATGCTTATGGCGTTGTTTTGGGAACCGGGGGTGTAGTTCGAGCAACATGA

mRNA sequence

ATGAAGAAGGAAAAGGGACTATGTTTTCGATGTGACGAAAAATTCAGTCTGGGGCACCGTTGCAAAAGACGAGAATTAAATATCATTGTTATTCAAGAAGGGGAGGACTTAAGTGGGGAAATTGATAAAGTAGCAAAGGAGACTGAAGATGAGAATGAGCAAATCAATACTGAGATTGCGAATTTGTCTTTACATTCGTTGGAATCGCGAAGTTGTCGTGTCGATGGGGGAGCTACACATAACTTTATTTCGGAGGAGGTGGTCAAGGAATTAAAAATTCCAATAGAAACTTTAGATGCTTATGGCGTTGTTTTGGGAACCGGGGGTGTAGTTCGAGCAACATGA

Coding sequence (CDS)

ATGAAGAAGGAAAAGGGACTATGTTTTCGATGTGACGAAAAATTCAGTCTGGGGCACCGTTGCAAAAGACGAGAATTAAATATCATTGTTATTCAAGAAGGGGAGGACTTAAGTGGGGAAATTGATAAAGTAGCAAAGGAGACTGAAGATGAGAATGAGCAAATCAATACTGAGATTGCGAATTTGTCTTTACATTCGTTGGAATCGCGAAGTTGTCGTGTCGATGGGGGAGCTACACATAACTTTATTTCGGAGGAGGTGGTCAAGGAATTAAAAATTCCAATAGAAACTTTAGATGCTTATGGCGTTGTTTTGGGAACCGGGGGTGTAGTTCGAGCAACATGA
BLAST of CSPI03G21000 vs. TrEMBL
Match: A0A067D9Z8_CITSI (Uncharacterized protein (Fragment) OS=Citrus sinensis GN=CISIN_1g045527mg PE=4 SV=1)

HSP 1 Score: 89.7 bits (221), Expect = 2.4e-15
Identity = 51/131 (38.93%), Postives = 78/131 (59.54%), Query Frame = 1

Query: 3   KEKGLCFRCDEKFSLGHRCKRRELNIIVIQEGEDLSGEIDKVAKETEDEN---EQINTEI 62
           +E GLC++CDEKFS GHRC+++EL ++++QE E  +  ++ V +E E E+   E    ++
Sbjct: 62  QECGLCYKCDEKFSPGHRCRKQELQVVLLQEYEAEAQAVEDVGQERELESKPTEGAKNQV 121

Query: 63  ANLSLHSL-------------ESRSCRV----DGGATHNFISEEVVKELKIPIETLDAYG 114
             +SL+S+             E  + +V    D GA+HNFIS EVV  LK+PI   + YG
Sbjct: 122 VEVSLNSVVGLTSPKTLKLASEINNKKVVVLTDSGASHNFISNEVVLVLKLPITNTEPYG 181

BLAST of CSPI03G21000 vs. TrEMBL
Match: E2DMZ5_BETVU (Putative uncharacterized protein OS=Beta vulgaris PE=4 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 1.0e-13
Identity = 48/125 (38.40%), Postives = 73/125 (58.40%), Query Frame = 1

Query: 2   KKEKGLCFRCDEKFSLGHRCKRRELNIIV-IQEGEDLSGEIDKVAKETEDENEQINTEIA 61
           K+E GLCFRCDEK+++GHRCK++EL+I++  +E E+  G + +  +    ++ Q+     
Sbjct: 345 KREHGLCFRCDEKWAIGHRCKKKELSILLGHEEEEEEYGSLMENIQPAHPDDSQLEIHSP 404

Query: 62  NLSLHSLESRS-----------------CRVDGGATHNFISEEVVKELKIPIETLDAYGV 109
            +SL+S+   S                   VD GATHNFIS + V+ L+IPI +   +GV
Sbjct: 405 EISLNSVMGISSPKTLKMEGTIYGQKVIVMVDPGATHNFISLDTVRRLQIPISSSRPFGV 464

BLAST of CSPI03G21000 vs. TrEMBL
Match: A0A087GEK8_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA8G499800 PE=4 SV=1)

HSP 1 Score: 80.9 bits (198), Expect = 1.1e-12
Identity = 47/129 (36.43%), Postives = 72/129 (55.81%), Query Frame = 1

Query: 2   KKEKGLCFRCDEKFSLGHRCKRRELNIIVIQE-GEDLSGEIDKVAKETEDENEQINTEIA 61
           +K  GLCFRCDEK+ + H+C ++E+N++++QE G D+  E D    +  D  +Q  TE+A
Sbjct: 341 RKADGLCFRCDEKWHIRHQCPKKEVNVLLVQEDGPDILWEAD---DDFTDATDQAITELA 400

Query: 62  NLSLHSLESRS-----------------CRVDGGATHNFISEEVVKELKIPIETLDAYGV 113
            LSL+S+   S                   +D GA+HNF+SE++V  L +      +YGV
Sbjct: 401 ELSLNSMVGISSPSTMKLMGTIQTTEVVVLIDSGASHNFVSEQLVHRLGLQSAKTGSYGV 460

BLAST of CSPI03G21000 vs. TrEMBL
Match: A0A087GW89_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA5G106200 PE=4 SV=1)

HSP 1 Score: 79.7 bits (195), Expect = 2.5e-12
Identity = 45/122 (36.89%), Postives = 70/122 (57.38%), Query Frame = 1

Query: 3   KEKGLCFRCDEKFSLGHRCKRRELNIIVIQEGEDLSGEIDKVAKETEDENEQINTEIANL 62
           + +GLCFRCDEK+   HRC RREL+++++QE       +++   ++++E   +  E+A L
Sbjct: 365 RAEGLCFRCDEKWYERHRCPRRELSVVIVQEEGPDKEWVEEDETDSDEEGVTV-AEMATL 424

Query: 63  SLHSLESRS-----------------CRVDGGATHNFISEEVVKELKIPIETLDAYGVVL 108
           SL+SL   S                   +D GA+HNFISE +VK+L +  E    YGV++
Sbjct: 425 SLNSLVGISSPRTMKLKAKMLGTEVVVMIDSGASHNFISEPLVKKLSMKTEESHCYGVMM 484

BLAST of CSPI03G21000 vs. TrEMBL
Match: A0A0D3BSK9_BRAOL (Uncharacterized protein OS=Brassica oleracea var. oleracea PE=3 SV=1)

HSP 1 Score: 78.6 bits (192), Expect = 5.6e-12
Identity = 47/128 (36.72%), Postives = 70/128 (54.69%), Query Frame = 1

Query: 2   KKEKGLCFRCDEKFSLGHRCKRRELNIIVIQEGEDLSGEIDKVAKETEDENEQINTEIAN 61
           +K+  LCFRCDEK+   H C R+EL ++V+ E        ++  +  EDE E+I TE+A 
Sbjct: 466 RKKDDLCFRCDEKYVYPHVCSRKELMVLVVHENGTEIEISEEQMEHREDEEEEI-TEVAE 525

Query: 62  LSLHSLESRSC-----------------RVDGGATHNFISEEVVKELKIPIETLDAYGVV 113
           LS++S+   S                   +D GATHNFISE +V+ L +   T   YGV+
Sbjct: 526 LSVNSVVGLSAPHTIKLRGTINGEEVVVLIDSGATHNFISESLVRRLGLTRGTSRGYGVM 585

BLAST of CSPI03G21000 vs. NCBI nr
Match: gi|729344250|ref|XP_010541181.1| (PREDICTED: uncharacterized protein LOC104814705 [Tarenaya hassleriana])

HSP 1 Score: 92.4 bits (228), Expect = 5.4e-16
Identity = 55/127 (43.31%), Postives = 77/127 (60.63%), Query Frame = 1

Query: 2   KKEKGLCFRCDEKFSLGHRCKRRELNIIVIQE----GEDLSGEIDKVAKETEDENE---- 61
           K++KGLCFRCDEKF +GHRCK++EL +I+ +E    GE+L  E D  A   EDE E    
Sbjct: 577 KRKKGLCFRCDEKFFVGHRCKQKELQVILAEEITETGEELEEEQDNEAGNREDEGEFAEL 636

Query: 62  QINTEIANLSLHSLESRS--------CRVDGGATHNFISEEVVKELKIPIETLDAYGVVL 113
            +N+ +   S  +L+ R           +D GATHNFIS +++K+LK+  E    +GV L
Sbjct: 637 SLNSVVGLTSPKTLKIRGSIEGQEVVVLIDSGATHNFISLKLMKKLKLRPEGNTQFGVSL 696

BLAST of CSPI03G21000 vs. NCBI nr
Match: gi|641816240|gb|KDO38385.1| (hypothetical protein CISIN_1g045527mg, partial [Citrus sinensis])

HSP 1 Score: 89.7 bits (221), Expect = 3.5e-15
Identity = 51/131 (38.93%), Postives = 78/131 (59.54%), Query Frame = 1

Query: 3   KEKGLCFRCDEKFSLGHRCKRRELNIIVIQEGEDLSGEIDKVAKETEDEN---EQINTEI 62
           +E GLC++CDEKFS GHRC+++EL ++++QE E  +  ++ V +E E E+   E    ++
Sbjct: 62  QECGLCYKCDEKFSPGHRCRKQELQVVLLQEYEAEAQAVEDVGQERELESKPTEGAKNQV 121

Query: 63  ANLSLHSL-------------ESRSCRV----DGGATHNFISEEVVKELKIPIETLDAYG 114
             +SL+S+             E  + +V    D GA+HNFIS EVV  LK+PI   + YG
Sbjct: 122 VEVSLNSVVGLTSPKTLKLASEINNKKVVVLTDSGASHNFISNEVVLVLKLPITNTEPYG 181

BLAST of CSPI03G21000 vs. NCBI nr
Match: gi|731341463|ref|XP_010681914.1| (PREDICTED: uncharacterized protein LOC104896819 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 87.0 bits (214), Expect = 2.3e-14
Identity = 52/126 (41.27%), Postives = 73/126 (57.94%), Query Frame = 1

Query: 2   KKEKGLCFRCDEKFSLGHRCKRRELNIIVIQEGEDLSG-EIDKVAKETEDENEQINTEIA 61
           K+ KGLCFRCDEK+++GHRCKRREL++++ Q   DL G E  +++   E     I +EI+
Sbjct: 377 KRAKGLCFRCDEKWNVGHRCKRRELSVLLTQ---DLDGEEAQELSVMEEGAPPSIQSEIS 436

Query: 62  NLSLHSLESRS--------------CRVDGGATHNFISEEVVKELKIPIETLDAYGVVLG 113
             S+  +++                  VD GATHNFIS   VK+L +PI     +GV LG
Sbjct: 437 LNSVLGIDAPKTLKMKGQINGQDVVVMVDPGATHNFISLATVKKLSLPISPTQNFGVTLG 496

BLAST of CSPI03G21000 vs. NCBI nr
Match: gi|659119521|ref|XP_008459701.1| (PREDICTED: uncharacterized protein LOC103498741 [Cucumis melo])

HSP 1 Score: 86.3 bits (212), Expect = 3.9e-14
Identity = 51/135 (37.78%), Postives = 79/135 (58.52%), Query Frame = 1

Query: 2   KKEKGLCFRCDEKFSLGHRCK---RRELNIIVIQEGEDLSGEIDKVAKETE----DENEQ 61
           +KEKGLCFRC+EK+S  H+C+   +REL + V+ EG +    +++  +E E    + NE 
Sbjct: 351 RKEKGLCFRCNEKYSADHKCRLKEQRELRMFVVTEGREEYEIVEEEKEEKELGRIEVNED 410

Query: 62  INTEIANLSLHSL-----------------ESRSCRVDGGATHNFISEEVVKELKIPIET 113
           I T +  LS++S+                 E     +D GATHNF+SE++VK+L +PI+ 
Sbjct: 411 ITT-VVELSINSVVGLNDPGTMKVRGKLLGEEVIILIDCGATHNFVSEKLVKKLILPIKE 470

BLAST of CSPI03G21000 vs. NCBI nr
Match: gi|729346269|ref|XP_010541862.1| (PREDICTED: uncharacterized protein LOC104815241 isoform X1 [Tarenaya hassleriana])

HSP 1 Score: 85.5 bits (210), Expect = 6.6e-14
Identity = 53/129 (41.09%), Postives = 72/129 (55.81%), Query Frame = 1

Query: 2   KKEKGLCFRCDEKFSLGHRCKRRELNIIVIQE-GEDLSGEIDKVAKETEDENEQINTEIA 61
           ++++GLCFRCDEK+  GH+CK +EL +IV+QE GE L         +   E   +  EIA
Sbjct: 368 RRKRGLCFRCDEKYFFGHKCKLKELQVIVVQEDGETLLAA--DAHPDPVPEEPPVAPEIA 427

Query: 62  NLSLHSLESRS-----------------CRVDGGATHNFISEEVVKELKIPIETLDAYGV 113
            LSL+ +   +                   VD GATHNFIS EV+++L+I  ET   YGV
Sbjct: 428 ELSLNFVVGLTSPKTLKLQGSISKLPVIVMVDSGATHNFISWEVIRKLRICPETTTGYGV 487

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A067D9Z8_CITSI2.4e-1538.93Uncharacterized protein (Fragment) OS=Citrus sinensis GN=CISIN_1g045527mg PE=4 S... [more]
E2DMZ5_BETVU1.0e-1338.40Putative uncharacterized protein OS=Beta vulgaris PE=4 SV=1[more]
A0A087GEK8_ARAAL1.1e-1236.43Uncharacterized protein OS=Arabis alpina GN=AALP_AA8G499800 PE=4 SV=1[more]
A0A087GW89_ARAAL2.5e-1236.89Uncharacterized protein OS=Arabis alpina GN=AALP_AA5G106200 PE=4 SV=1[more]
A0A0D3BSK9_BRAOL5.6e-1236.72Uncharacterized protein OS=Brassica oleracea var. oleracea PE=3 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|729344250|ref|XP_010541181.1|5.4e-1643.31PREDICTED: uncharacterized protein LOC104814705 [Tarenaya hassleriana][more]
gi|641816240|gb|KDO38385.1|3.5e-1538.93hypothetical protein CISIN_1g045527mg, partial [Citrus sinensis][more]
gi|731341463|ref|XP_010681914.1|2.3e-1441.27PREDICTED: uncharacterized protein LOC104896819 [Beta vulgaris subsp. vulgaris][more]
gi|659119521|ref|XP_008459701.1|3.9e-1437.78PREDICTED: uncharacterized protein LOC103498741 [Cucumis melo][more]
gi|729346269|ref|XP_010541862.1|6.6e-1441.09PREDICTED: uncharacterized protein LOC104815241 isoform X1 [Tarenaya hassleriana... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G21000.1CSPI03G21000.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 45..65
scor

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None