CSPI04G11760 (gene) Wild cucumber (PI 183967)

NameCSPI04G11760
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionTransposon Ty3-G Gag-Pol polyprotein
LocationChr4 : 10037586 .. 10038128 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAGAAGACATGTCGAATTCGAAGAAGGAGATCAAGTGTTCCTAAAAATTAGGCCATACAGACAGGTATCCTTACGGAGAAAGAGGAATGAGAAGCTATCACCGAAGTATTTCGGGCCTTATATGATAGTGAAGCGAATTGGTCAGGTGGCATATCGGCTGGAACTACCAGCGGCAGCAACAATCCACCCTGTGTTCCATGTGTCACAGTTGAAAAAAGCTTTTGGGGAGAGTGCGAATAGCGAAGAGCTGTTGCCGTTTTTGACTGCAAATCACGAGTGGAAGGCCGTGCCACAAGAGACTTACGGTTATAGAAAAAACGAAACAGGAGGGTGGGAGGTTTTAATGAGTTGGAAAGGTCTGCCGCATCATGAAGCCACATGGGAAAGCTATGACGACTTTCAGCAATCCTTCCCCGATTTCCACCTTGAGGACAAGGTGAAACTGGACCGGGAATGCAATGTTAGACCACCCATCATACACCAATACAGTAGGAGAAGGAACAGGAAAGAGAAGAAGGACGGGAAGTTGGTTATGTAA

mRNA sequence

ATGAAGAGAAGACATGTCGAATTCGAAGAAGGAGATCAAGTGTTCCTAAAAATTAGGCCATACAGACAGGTATCCTTACGGAGAAAGAGGAATGAGAAGCTATCACCGAAGTATTTCGGGCCTTATATGATAGTGAAGCGAATTGGTCAGGTGGCATATCGGCTGGAACTACCAGCGGCAGCAACAATCCACCCTGTGTTCCATGTGTCACAGTTGAAAAAAGCTTTTGGGGAGAGTGCGAATAGCGAAGAGCTGTTGCCGTTTTTGACTGCAAATCACGAGTGGAAGGCCGTGCCACAAGAGACTTACGGTTATAGAAAAAACGAAACAGGAGGGTGGGAGGTTTTAATGAGTTGGAAAGGTCTGCCGCATCATGAAGCCACATGGGAAAGCTATGACGACTTTCAGCAATCCTTCCCCGATTTCCACCTTGAGGACAAGGTGAAACTGGACCGGGAATGCAATGTTAGACCACCCATCATACACCAATACAGTAGGAGAAGGAACAGGAAAGAGAAGAAGGACGGGAAGTTGGTTATGTAA

Coding sequence (CDS)

ATGAAGAGAAGACATGTCGAATTCGAAGAAGGAGATCAAGTGTTCCTAAAAATTAGGCCATACAGACAGGTATCCTTACGGAGAAAGAGGAATGAGAAGCTATCACCGAAGTATTTCGGGCCTTATATGATAGTGAAGCGAATTGGTCAGGTGGCATATCGGCTGGAACTACCAGCGGCAGCAACAATCCACCCTGTGTTCCATGTGTCACAGTTGAAAAAAGCTTTTGGGGAGAGTGCGAATAGCGAAGAGCTGTTGCCGTTTTTGACTGCAAATCACGAGTGGAAGGCCGTGCCACAAGAGACTTACGGTTATAGAAAAAACGAAACAGGAGGGTGGGAGGTTTTAATGAGTTGGAAAGGTCTGCCGCATCATGAAGCCACATGGGAAAGCTATGACGACTTTCAGCAATCCTTCCCCGATTTCCACCTTGAGGACAAGGTGAAACTGGACCGGGAATGCAATGTTAGACCACCCATCATACACCAATACAGTAGGAGAAGGAACAGGAAAGAGAAGAAGGACGGGAAGTTGGTTATGTAA
BLAST of CSPI04G11760 vs. TrEMBL
Match: E2DMZ5_BETVU (Putative uncharacterized protein OS=Beta vulgaris PE=4 SV=1)

HSP 1 Score: 160.6 bits (405), Expect = 1.8e-36
Identity = 88/178 (49.44%), Postives = 113/178 (63.48%), Query Frame = 1

Query: 3    RRHVEFEEGDQVFLKIRPYRQVSLRRKRNEKLSPKYFGPYMIVKRIGQVAYRLELPAAAT 62
            RR V FE G  V+LKI+PYR  SL +KRNEKL+P+++GP+ ++KRIGQVAY+L+LP  A 
Sbjct: 1379 RRAVHFEPGAMVYLKIQPYRHQSLAKKRNEKLAPRFYGPFSVLKRIGQVAYQLQLPLGAK 1438

Query: 63   IHPVFHVSQLKKAFGESANSEELLPFLTANHEWKAVPQETYGYR---KNETGGWEVLMSW 122
            +HPVFH+SQLKKA G   +S  + P LT +    A P+     R   +      EVL+ W
Sbjct: 1439 LHPVFHISQLKKAVGSLQSSPTIPPQLTNDLVLDAQPESLLNIRSHPQKPAEVTEVLIKW 1498

Query: 123  KGLPHHEATWESYDDFQQSFPDFHLEDKVKLDRECNVR-------PPIIHQYSRRRNR 171
              LP  EATWE    F   FPDFHLEDKV L+ E ++        PPI+H YSRRR +
Sbjct: 1499 LNLPAFEATWEDAALFNARFPDFHLEDKV-LNWEGSIAKSPTRIIPPIVHTYSRRRKK 1555

BLAST of CSPI04G11760 vs. TrEMBL
Match: A5B2I6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_043911 PE=4 SV=1)

HSP 1 Score: 160.2 bits (404), Expect = 2.3e-36
Identity = 82/170 (48.24%), Postives = 108/170 (63.53%), Query Frame = 1

Query: 11   GDQVFLKIRPYRQVSLRRKRNEKLSPKYFGPYMIVKRIGQVAYRLELPAAATIHPVFHVS 70
            GD V++K+RPYR  SL ++ NEKLSP+YFGPY +V++IG VAYRL+LP++ +IHPVFHVS
Sbjct: 1886 GDFVYIKLRPYRLRSLAKRPNEKLSPRYFGPYKVVQQIGPVAYRLKLPSSTSIHPVFHVS 1945

Query: 71   QLKKAFGESANSEELLPFLTANHEWKAVPQETYGYRKNETGGWEVLMSWKGLPHHEATWE 130
            QLK+A G +   + L P LT + EW   P +                  KGLP  EA+WE
Sbjct: 1946 QLKRALGSADLCQPLSPILTEDLEWLVEPDQ------------------KGLPQFEASWE 2005

Query: 131  SYDDFQQSFPDFHLEDKVKLDRECNVRPPIIHQYSRRRNRKEKKDGKLVM 181
            S D  ++ FPDFHLEDKV L    N RPPI + Y+R+  R     G+ V+
Sbjct: 2006 SVDTIKEHFPDFHLEDKVSLIEGGNDRPPIRYVYNRKGKRNGLPPGRAVL 2037

BLAST of CSPI04G11760 vs. TrEMBL
Match: A0A087GEK8_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA8G499800 PE=4 SV=1)

HSP 1 Score: 157.9 bits (398), Expect = 1.2e-35
Identity = 86/181 (47.51%), Postives = 116/181 (64.09%), Query Frame = 1

Query: 3    RRHVEFEEGDQVFLKIRPYRQVSLRRKRNEKLSPKYFGPYMIVKRIGQVAYRLELPAAAT 62
            RR VEF+ GD VFLK++PYRQ SL R+ NEKL+ +++GPY ++ R+G VAY+L+LPA + 
Sbjct: 1371 RREVEFKVGDMVFLKLKPYRQQSLARRVNEKLAARFYGPYEVLARVGVVAYQLKLPADSK 1430

Query: 63   IHPVFHVSQLKKAFGESANSEELLPFLTANHEWKAVPQETYGYRKN-ETGGWEVLMSWKG 122
            IH  FHVSQLK A G S     L P LTA +  +A P+   G R N  +G  EVL+ WKG
Sbjct: 1431 IHDTFHVSQLKLAVGSSFQPAALPPHLTAENVLEAEPEAHMGVRINSRSGQQEVLIKWKG 1490

Query: 123  LPHHEATWESYDDFQQSFPDFHLEDK-----VKLDRECNVRPPIIHQYSRRRNRKEKKDG 178
            LP  ++TWE     Q+ FP+F LEDK       +  E + + P++HQY  RR +K  K G
Sbjct: 1491 LPECDSTWEWVGVIQEQFPEFDLEDKALFKAAGIVTEISEKTPLVHQY--RRRKKFGKQG 1549

BLAST of CSPI04G11760 vs. TrEMBL
Match: A0A087H2U0_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA4G124900 PE=4 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 2.6e-35
Identity = 82/181 (45.30%), Postives = 115/181 (63.54%), Query Frame = 1

Query: 3    RRHVEFEEGDQVFLKIRPYRQVSLRRKRNEKLSPKYFGPYMIVKRIGQVAYRLELPAAAT 62
            RR VEF+ GD VFLK++PYRQ SL R+ NEKL+ +++GP+ +  R+G V+Y+L+LP +A 
Sbjct: 1217 RREVEFKVGDMVFLKLKPYRQQSLARRVNEKLAARFYGPFEVAARVGAVSYKLKLPPSAK 1276

Query: 63   IHPVFHVSQLKKAFGESANSEELLPFLTANHEWKAVPQETYGYRKN-ETGGWEVLMSWKG 122
            IH  FH+SQLK A G S    EL   L+     +A P+   G+R N ++G  EVL+ WK 
Sbjct: 1277 IHHTFHISQLKLAVGSSFAPSELPAPLSKEGAIEATPEAYMGFRVNKQSGQEEVLIKWKD 1336

Query: 123  LPHHEATWESYDDFQQSFPDFHLEDKV-----KLDRECNVRPPIIHQYSRRRNRKEKKDG 178
            LP  ++TWE     Q+ FP F+LEDKV      +D     + P++HQY RR+NR    +G
Sbjct: 1337 LPDCDSTWEWKGVIQEQFPHFNLEDKVLFKAAGIDTVIREKTPLVHQYRRRKNRMGNSEG 1396

BLAST of CSPI04G11760 vs. TrEMBL
Match: A0A087H2T9_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA4G124900 PE=4 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 2.6e-35
Identity = 82/181 (45.30%), Postives = 115/181 (63.54%), Query Frame = 1

Query: 3   RRHVEFEEGDQVFLKIRPYRQVSLRRKRNEKLSPKYFGPYMIVKRIGQVAYRLELPAAAT 62
           RR VEF+ GD VFLK++PYRQ SL R+ NEKL+ +++GP+ +  R+G V+Y+L+LP +A 
Sbjct: 584 RREVEFKVGDMVFLKLKPYRQQSLARRVNEKLAARFYGPFEVAARVGAVSYKLKLPPSAK 643

Query: 63  IHPVFHVSQLKKAFGESANSEELLPFLTANHEWKAVPQETYGYRKN-ETGGWEVLMSWKG 122
           IH  FH+SQLK A G S    EL   L+     +A P+   G+R N ++G  EVL+ WK 
Sbjct: 644 IHHTFHISQLKLAVGSSFAPSELPAPLSKEGAIEATPEAYMGFRVNKQSGQEEVLIKWKD 703

Query: 123 LPHHEATWESYDDFQQSFPDFHLEDKV-----KLDRECNVRPPIIHQYSRRRNRKEKKDG 178
           LP  ++TWE     Q+ FP F+LEDKV      +D     + P++HQY RR+NR    +G
Sbjct: 704 LPDCDSTWEWKGVIQEQFPHFNLEDKVLFKAAGIDTVIREKTPLVHQYRRRKNRMGNSEG 763

BLAST of CSPI04G11760 vs. NCBI nr
Match: gi|659131181|ref|XP_008465551.1| (PREDICTED: uncharacterized protein LOC103503179 [Cucumis melo])

HSP 1 Score: 217.2 bits (552), Expect = 2.3e-53
Identity = 99/140 (70.71%), Postives = 116/140 (82.86%), Query Frame = 1

Query: 1   MKRRHVEFEEGDQVFLKIRPYRQVSLRRKRNEKLSPKYFGPYMIVKRIGQVAYRLELPAA 60
           +KRRHVEFEE + V+LKIRPYRQVS+R++RNEKLS KYFGPY I+KRIG VAY+LELP +
Sbjct: 53  LKRRHVEFEEREMVYLKIRPYRQVSMRKRRNEKLSAKYFGPYRILKRIGLVAYKLELPTS 112

Query: 61  ATIHPVFHVSQLKKAFGESANSEELLPFLTANHEWKAVPQETYGYRKNETGGWEVLMSWK 120
           ATIHPVFHVSQLK+AFGE  + +E++P+LT NHEW  +  E YGY KN+ G WEVLMSWK
Sbjct: 113 ATIHPVFHVSQLKRAFGECKDKQEVVPYLTMNHEWLTIHDEVYGYHKNDKGEWEVLMSWK 172

Query: 121 GLPHHEATWESYDDFQQSFP 141
           GLP HEA  E YDDFQQ  P
Sbjct: 173 GLPRHEAMKEPYDDFQQVIP 192

BLAST of CSPI04G11760 vs. NCBI nr
Match: gi|778730254|ref|XP_011659739.1| (PREDICTED: uncharacterized protein LOC105436252 [Cucumis sativus])

HSP 1 Score: 216.9 bits (551), Expect = 3.0e-53
Identity = 103/122 (84.43%), Postives = 111/122 (90.98%), Query Frame = 1

Query: 1   MKRRHVEFEEGDQVFLKIRPYRQVSLRRKRNEKLSPKYFGPYMIVKRIGQVAYRLELPAA 60
           MKRRHVEFE GD+VFLKIRPYRQ SLR+K NEKLSPKYFGPY IVKRIG V YRLELP A
Sbjct: 262 MKRRHVEFE-GDKVFLKIRPYRQASLRKKTNEKLSPKYFGPYKIVKRIGPVTYRLELPTA 321

Query: 61  ATIHPVFHVSQLKKAFGESANSEELLPFLTANHEWKAVPQETYGYRKNETGGWEVLMSWK 120
           AT HPVFH+SQLK+AFGESAN++ELLPFLTANHEWKAV QE +GY+KNE GGWEVLMSWK
Sbjct: 322 ATTHPVFHISQLKRAFGESANNDELLPFLTANHEWKAVTQEVFGYQKNEKGGWEVLMSWK 381

Query: 121 GL 123
           GL
Sbjct: 382 GL 382

BLAST of CSPI04G11760 vs. NCBI nr
Match: gi|659072240|ref|XP_008464306.1| (PREDICTED: uncharacterized protein LOC103502220 [Cucumis melo])

HSP 1 Score: 206.5 bits (524), Expect = 4.0e-50
Identity = 101/170 (59.41%), Postives = 119/170 (70.00%), Query Frame = 1

Query: 2   KRRHVEFEEGDQVFLKIRPYRQVSLRRKRNEKLSPKYFGPYMIVKRIGQVAYRLELPAAA 61
           KRR VE+  GD+VFLKIRPYRQ+SLRRKRNEKLS KYFGPY I++RI  VAY+LEL    
Sbjct: 60  KRRDVEYAVGDRVFLKIRPYRQLSLRRKRNEKLSAKYFGPYKILERIVPVAYKLELLEGT 119

Query: 62  TIHPVFHVSQLKKAFGESANSEELLPFLTANHEWKAVPQETYGYRKNETGGWEVLMSWKG 121
            IHPVFHVSQLKK  GE  N +  +  L  N  W   P ET  YR+N+   WEV++ W G
Sbjct: 120 LIHPVFHVSQLKKLVGEYINVQPTVQQLDENFVWTTHPVETLDYRQNKAKEWEVMIQWDG 179

Query: 122 LPHHEATWESYDDFQQSFPDFHLEDKVKLDRECNVRPPIIHQYSRRRNRK 172
           L  HEATWE Y+D    +P+FHLEDK  L+   NVRPPI+ QYSR+  RK
Sbjct: 180 LSTHEATWEQYNDIADKYPNFHLEDKASLEWRSNVRPPILFQYSRKNKRK 229

BLAST of CSPI04G11760 vs. NCBI nr
Match: gi|659118736|ref|XP_008459277.1| (PREDICTED: uncharacterized protein LOC103498453 [Cucumis melo])

HSP 1 Score: 177.2 bits (448), Expect = 2.6e-41
Identity = 88/160 (55.00%), Postives = 108/160 (67.50%), Query Frame = 1

Query: 2   KRRHVEFEEGDQVFLKIRPYRQVSLRRKRNEKLSPKYFGPYMIVKRIGQVAYRLELPAAA 61
           +RR VE+E  D VF KIRPYRQ+SLRRKRNEKLS KYFGPY I++RIG VAY+LELP   
Sbjct: 60  RRRDVEYEVRDLVFFKIRPYRQLSLRRKRNEKLSAKYFGPYKILERIGPVAYKLELPEGT 119

Query: 62  TIHPVFHVSQLKKAFGESANSEELLPFLTANHEWKAVPQETYGYRKNETGGWEVLMSWKG 121
            IHPVFHVSQLKK  GE  + +  +  L  +  W   P E   YR+N+ G WEV++ W G
Sbjct: 120 LIHPVFHVSQLKKLVGEHIDIQPTVQQLDESFVWTTHPVEALDYRQNKAGEWEVMIRWDG 179

Query: 122 LPHHEATWESYDDFQQSFPDFHLEDK-VKLDRECNVRPPI 161
           L  HEATWE Y D    + +FHLEDK +K + E  +  P+
Sbjct: 180 LSSHEATWEQYADISGKYSNFHLEDKGLKTNHEKAIVHPL 219

BLAST of CSPI04G11760 vs. NCBI nr
Match: gi|659094491|ref|XP_008448087.1| (PREDICTED: uncharacterized protein LOC103490375 [Cucumis melo])

HSP 1 Score: 164.9 bits (416), Expect = 1.4e-37
Identity = 85/172 (49.42%), Postives = 114/172 (66.28%), Query Frame = 1

Query: 1    MKRRHVEFEEGDQVFLKIRPYRQVSLRRKRNEKLSPKYFGPYMIVKRIGQVAYRLELPAA 60
            +KRR ++ + G++V+LK++PYRQ SL RK++EKL+P+Y+GPY I++ IG VAYRL+LP  
Sbjct: 965  LKRRELKLKVGEEVYLKLKPYRQRSLARKKSEKLAPRYYGPYKIIEEIGAVAYRLDLPPE 1024

Query: 61   ATIHPVFHVSQLKKAFGESANSEELLPFLTANHEWKAVPQETYGYRKN-ETGGWEVLMSW 120
            A IH VFH+SQLK   G     +     LT N E +  P+   G R N E G  E L+ W
Sbjct: 1025 AAIHNVFHISQLKPKLGAQQVVQHQHLMLTENFELQLQPENVLGIRWNKELGANEWLIKW 1084

Query: 121  KGLPHHEATWESYDDFQQSFPDFHLEDKVKLDRECNVRPPIIHQYSRRRNRK 172
            +GL   +ATWES     Q FP FHLEDKV ++    VRPPI+H Y +RR+RK
Sbjct: 1085 QGLQESDATWESVYRMNQQFPSFHLEDKVNVEPRGIVRPPILHIY-KRRDRK 1135

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
E2DMZ5_BETVU1.8e-3649.44Putative uncharacterized protein OS=Beta vulgaris PE=4 SV=1[more]
A5B2I6_VITVI2.3e-3648.24Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_043911 PE=4 SV=1[more]
A0A087GEK8_ARAAL1.2e-3547.51Uncharacterized protein OS=Arabis alpina GN=AALP_AA8G499800 PE=4 SV=1[more]
A0A087H2U0_ARAAL2.6e-3545.30Uncharacterized protein OS=Arabis alpina GN=AALP_AA4G124900 PE=4 SV=1[more]
A0A087H2T9_ARAAL2.6e-3545.30Uncharacterized protein OS=Arabis alpina GN=AALP_AA4G124900 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|659131181|ref|XP_008465551.1|2.3e-5370.71PREDICTED: uncharacterized protein LOC103503179 [Cucumis melo][more]
gi|778730254|ref|XP_011659739.1|3.0e-5384.43PREDICTED: uncharacterized protein LOC105436252 [Cucumis sativus][more]
gi|659072240|ref|XP_008464306.1|4.0e-5059.41PREDICTED: uncharacterized protein LOC103502220 [Cucumis melo][more]
gi|659118736|ref|XP_008459277.1|2.6e-4155.00PREDICTED: uncharacterized protein LOC103498453 [Cucumis melo][more]
gi|659094491|ref|XP_008448087.1|1.4e-3749.42PREDICTED: uncharacterized protein LOC103490375 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000953Chromo/chromo_shadow_dom
IPR016197Chromo-like_dom_sf
IPR023780Chromo_domain
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0043170 macromolecule metabolic process
biological_process GO:0044238 primary metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G11760.1CSPI04G11760.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000953Chromo/chromo shadow domainPROFILEPS50013CHROMO_2coord: 97..176
score: 9
IPR016197Chromo domain-likeunknownSSF54160Chromo domain-likecoord: 61..138
score: 8.44
IPR023780Chromo domainPFAMPF00385Chromocoord: 106..141
score: 5.
NoneNo IPR availableGENE3DG3DSA:2.40.50.40coord: 108..135
score: 1.
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 1..142
score: 4.5
NoneNo IPR availablePANTHERPTHR24559:SF186SUBFAMILY NOT NAMEDcoord: 1..142
score: 4.5