CSPI05G18890 (gene) Wild cucumber (PI 183967)

NameCSPI05G18890
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr5 : 20060723 .. 20061184 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATATCATTAGTCGTCTTCTCCCCCATATTCTTTTATTACGTCTCTTGCTTCCACATCTATTCCTAACACTGTTTAGGAAACTTTATCTCATCCTGTTTGGCATAGTGCAATGATTGAGGAGATGACTGTTTTAGATGATAATGGTACTTGGGATTTAGTATCTTGCTTTGCAGAAAAGAAGGCCATTGGATGTAAATGGGTGTTCGCTATCAAGGTTGATCATGATGGAGCAATGCCTTGGTTGAGTGTTGTGTTTGTCTTGTTGCGAAAGGTTATGTCCAAATCTATGAAATTGATTATTCAAATACATTTTCTCCCGACTTTTTTTTTCATGGCTGCTACCCATGATTGGCCTTTACATGAGTTTGACATACATGAGCTTGACATCAAGAATGCTTTTCTACATGGTGATCTTTAGGAGGAGGTTTATATGGAGTAGCCACCTTTGTTGCTTAGGGGGAG

mRNA sequence

ATGATTGAGGAGATGACTGTTTTAGATGATAATGGTACTTGGGATTTAGTATCTTGCTTTGCAGAAAAGAAGGCCATTGGATGTAAATGGGTGTTCGCTATCAAGGTTGATCATGATGGAGCAATGCCTTGGTTGAGTGTTGTGTTTGTCTTGTTGCGAAAGGTTATGTCCAAATCTATGAAATTGATTATTCAAATACATTTTCTCCCGACTTTTTTTTTCATGGCTGCTACCCATGATTGGCCTTTACATGAGTTTGACATACATGAGCTTGACATCAAGAATGCTTTTCTACATGGTGATCTTTAG

Coding sequence (CDS)

ATGATTGAGGAGATGACTGTTTTAGATGATAATGGTACTTGGGATTTAGTATCTTGCTTTGCAGAAAAGAAGGCCATTGGATGTAAATGGGTGTTCGCTATCAAGGTTGATCATGATGGAGCAATGCCTTGGTTGAGTGTTGTGTTTGTCTTGTTGCGAAAGGTTATGTCCAAATCTATGAAATTGATTATTCAAATACATTTTCTCCCGACTTTTTTTTTCATGGCTGCTACCCATGATTGGCCTTTACATGAGTTTGACATACATGAGCTTGACATCAAGAATGCTTTTCTACATGGTGATCTTTAG
BLAST of CSPI05G18890 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 51.2 bits (121), Expect = 7.8e-06
Identity = 36/108 (33.33%), Postives = 54/108 (50.00%), Query Frame = 1

Query: 1   MIEEMTVLDDNGTWDLVSCFAEKKAIGCKWVFAIKVDHDGAMPWLS---VVFVLLRKV-- 60
           M EEM  L  NGT+ LV     K+ + CKWVF +K D D  +       VV    +K   
Sbjct: 830 MQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGI 889

Query: 61  -MSKSMKLIIQIHFLPTFFFMAATHDWPLHEFDIHELDIKNAFLHGDL 103
              +    ++++  + T   +AA+ D      ++ +LD+K AFLHGDL
Sbjct: 890 DFDEIFSPVVKMTSIRTILSLAASLD-----LEVEQLDVKTAFLHGDL 932

BLAST of CSPI05G18890 vs. TrEMBL
Match: A0A061ECC7_THECC (Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 OS=Theobroma cacao GN=TCM_009017 PE=4 SV=1)

HSP 1 Score: 101.7 bits (252), Expect = 5.6e-19
Identity = 55/108 (50.93%), Postives = 68/108 (62.96%), Query Frame = 1

Query: 1   MIEEMTVLDDNGTWDLVSCFAEKKAIGCKWVFAIKVDHDGAMPWLSVVFVLLRKVM---- 60
           M+EEM  LD NGTWDLV   A KKAIGCKWVFA+KV+ DG++  L   FV          
Sbjct: 276 MVEEMVALDGNGTWDLVDLPAGKKAIGCKWVFAVKVNSDGSVARLKGRFVAKGYAQTYGV 335

Query: 61  --SKSMKLIIQIHFLPTFFFMAATHDWPLHEFDIHELDIKNAFLHGDL 103
             S +   + +++ +  F  MAAT+DWPL     H+LDIKNAFLHGDL
Sbjct: 336 DYSDTFSPVAKLNSVRLFISMAATYDWPL-----HQLDIKNAFLHGDL 378

BLAST of CSPI05G18890 vs. TrEMBL
Match: A0A061EJR6_THECC (Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 OS=Theobroma cacao GN=TCM_020052 PE=4 SV=1)

HSP 1 Score: 93.2 bits (230), Expect = 2.0e-16
Identity = 51/108 (47.22%), Postives = 62/108 (57.41%), Query Frame = 1

Query: 1   MIEEMTVLDDNGTWDLVSCFAEKKAIGCKWVFAIKVDHDGAMPWLSVVFVLLRKVM---- 60
           ++EEM  LD NGTWD V   A KK IGCKWVFA+KV+ DG+M  L    V          
Sbjct: 357 IVEEMMALDGNGTWDSVDLLAGKKVIGCKWVFAVKVNPDGSMARLKARLVAKGYAQTYGV 416

Query: 61  --SKSMKLIIQIHFLPTFFFMAATHDWPLHEFDIHELDIKNAFLHGDL 103
             S +   + ++  +  F  M AT DWPL     H+LDIKNAFLHGDL
Sbjct: 417 DYSNTFSPVAKLTSVRLFISMVATCDWPL-----HQLDIKNAFLHGDL 459

BLAST of CSPI05G18890 vs. TrEMBL
Match: A0A061EWC9_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_024268 PE=4 SV=1)

HSP 1 Score: 90.1 bits (222), Expect = 1.7e-15
Identity = 51/110 (46.36%), Postives = 66/110 (60.00%), Query Frame = 1

Query: 1   MIEEMTVLDDNGTWDLVSCFAEKKAIGCKWVFAIKVDHDGAMPWLSVVFVLLRKVMSKSM 60
           M+EEM  LD N TWD V   A KKAIGCKWV A+KVD +G+      V  L+ K  +++ 
Sbjct: 368 MVEEMVALDGNCTWDSVDLPAGKKAIGCKWVLAVKVDPNGS------VASLVAKGYAQTY 427

Query: 61  KL--------IIQIHFLPTFFFMAATHDWPLHEFDIHELDIKNAFLHGDL 103
            +        + ++ F+  F  M AT+DWPL     H+LDIKNAFLHGDL
Sbjct: 428 SIDYFVTFSPVAKLTFVRLFISMVATYDWPL-----HQLDIKNAFLHGDL 466

BLAST of CSPI05G18890 vs. TrEMBL
Match: A0A0L9U7K1_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan03g217200 PE=4 SV=1)

HSP 1 Score: 82.4 bits (202), Expect = 3.5e-13
Identity = 47/108 (43.52%), Postives = 59/108 (54.63%), Query Frame = 1

Query: 1   MIEEMTVLDDNGTWDLVSCFAEKKAIGCKWVFAIKVDHDGAMPWLSVVFVLLRKVM---- 60
           MI+EM  LD +GTWDLV    +KK +GC+WV+AIKV   GA+  L    V          
Sbjct: 622 MIDEMQALDKSGTWDLVPLPPDKKPVGCRWVYAIKVGPTGAIDRLKARLVAKGYTQVYGL 681

Query: 61  --SKSMKLIIQIHFLPTFFFMAATHDWPLHEFDIHELDIKNAFLHGDL 103
               +   + +I  +  F  MAA   WPL     H+LDIKNAFLHGDL
Sbjct: 682 DYCDTFSPVAKISSVRLFLAMAAIRQWPL-----HQLDIKNAFLHGDL 724

BLAST of CSPI05G18890 vs. TrEMBL
Match: Q6L3Q0_SOLDE (Polyprotein, putative OS=Solanum demissum GN=SDM1_42t00018 PE=4 SV=2)

HSP 1 Score: 82.4 bits (202), Expect = 3.5e-13
Identity = 46/108 (42.59%), Postives = 62/108 (57.41%), Query Frame = 1

Query: 1    MIEEMTVLDDNGTWDLVSCFAEKKAIGCKWVFAIKVDHDGAMPWLSVVFVLLRKVM---- 60
            M++E+  LDDN TW+LV     KKA+GCKWVF IKV+ DG+M  L    V          
Sbjct: 930  MLDEIHALDDNHTWNLVDLPKGKKAVGCKWVFTIKVNPDGSMARLKARLVAKGYAQTYGV 989

Query: 61   --SKSMKLIIQIHFLPTFFFMAATHDWPLHEFDIHELDIKNAFLHGDL 103
              S +   + ++  +  F  +AA+ +WPL     H+L IKNAFLHGDL
Sbjct: 990  DYSDTFSPVAKLTSVRLFISLAASQNWPL-----HQLAIKNAFLHGDL 1032

BLAST of CSPI05G18890 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 57.0 bits (136), Expect = 8.0e-09
Identity = 34/109 (31.19%), Postives = 54/109 (49.54%), Query Frame = 1

Query: 1   MIEEMTVLDDNGTWDLVSCFAEKKAIGCKWVFAIKVDHDGAMPWLSVVFVLLRKVMSKSM 60
           M +E+  ++   TW++ +    KK IGCKWV+ IK + DG +           ++++K  
Sbjct: 102 MDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKA------RLVAKGY 161

Query: 61  KLIIQIHFLPTFFFMAATHDWPL-------HEFDIHELDIKNAFLHGDL 103
                I F+ TF  +       L       + F +H+LDI NAFL+GDL
Sbjct: 162 TQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDL 204

BLAST of CSPI05G18890 vs. NCBI nr
Match: gi|590692641|ref|XP_007044112.1| (Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [Theobroma cacao])

HSP 1 Score: 101.7 bits (252), Expect = 8.0e-19
Identity = 55/108 (50.93%), Postives = 68/108 (62.96%), Query Frame = 1

Query: 1   MIEEMTVLDDNGTWDLVSCFAEKKAIGCKWVFAIKVDHDGAMPWLSVVFVLLRKVM---- 60
           M+EEM  LD NGTWDLV   A KKAIGCKWVFA+KV+ DG++  L   FV          
Sbjct: 276 MVEEMVALDGNGTWDLVDLPAGKKAIGCKWVFAVKVNSDGSVARLKGRFVAKGYAQTYGV 335

Query: 61  --SKSMKLIIQIHFLPTFFFMAATHDWPLHEFDIHELDIKNAFLHGDL 103
             S +   + +++ +  F  MAAT+DWPL     H+LDIKNAFLHGDL
Sbjct: 336 DYSDTFSPVAKLNSVRLFISMAATYDWPL-----HQLDIKNAFLHGDL 378

BLAST of CSPI05G18890 vs. NCBI nr
Match: gi|590655404|ref|XP_007033977.1| (Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [Theobroma cacao])

HSP 1 Score: 93.2 bits (230), Expect = 2.8e-16
Identity = 51/108 (47.22%), Postives = 62/108 (57.41%), Query Frame = 1

Query: 1   MIEEMTVLDDNGTWDLVSCFAEKKAIGCKWVFAIKVDHDGAMPWLSVVFVLLRKVM---- 60
           ++EEM  LD NGTWD V   A KK IGCKWVFA+KV+ DG+M  L    V          
Sbjct: 357 IVEEMMALDGNGTWDSVDLLAGKKVIGCKWVFAVKVNPDGSMARLKARLVAKGYAQTYGV 416

Query: 61  --SKSMKLIIQIHFLPTFFFMAATHDWPLHEFDIHELDIKNAFLHGDL 103
             S +   + ++  +  F  M AT DWPL     H+LDIKNAFLHGDL
Sbjct: 417 DYSNTFSPVAKLTSVRLFISMVATCDWPL-----HQLDIKNAFLHGDL 459

BLAST of CSPI05G18890 vs. NCBI nr
Match: gi|590634785|ref|XP_007028466.1| (Uncharacterized protein TCM_024268 [Theobroma cacao])

HSP 1 Score: 90.1 bits (222), Expect = 2.4e-15
Identity = 51/110 (46.36%), Postives = 66/110 (60.00%), Query Frame = 1

Query: 1   MIEEMTVLDDNGTWDLVSCFAEKKAIGCKWVFAIKVDHDGAMPWLSVVFVLLRKVMSKSM 60
           M+EEM  LD N TWD V   A KKAIGCKWV A+KVD +G+      V  L+ K  +++ 
Sbjct: 368 MVEEMVALDGNCTWDSVDLPAGKKAIGCKWVLAVKVDPNGS------VASLVAKGYAQTY 427

Query: 61  KL--------IIQIHFLPTFFFMAATHDWPLHEFDIHELDIKNAFLHGDL 103
            +        + ++ F+  F  M AT+DWPL     H+LDIKNAFLHGDL
Sbjct: 428 SIDYFVTFSPVAKLTFVRLFISMVATYDWPL-----HQLDIKNAFLHGDL 466

BLAST of CSPI05G18890 vs. NCBI nr
Match: gi|720026103|ref|XP_010264152.1| (PREDICTED: uncharacterized protein LOC104602236 [Nelumbo nucifera])

HSP 1 Score: 89.0 bits (219), Expect = 5.4e-15
Identity = 48/114 (42.11%), Postives = 66/114 (57.89%), Query Frame = 1

Query: 1   MIEEMTVLDDNGTWDLVSCFAEKKAIGCKWVFAIKVDHDGAMPWLSVVFVLLRKVMSKSM 60
           M+EEM  L +NGTWDLV   A KKA+GCKWV A+K ++DG++  L        ++++K  
Sbjct: 226 MLEEMNALTENGTWDLVDLPASKKAVGCKWVLAVKFNYDGSVARLKA------RLVAKGY 285

Query: 61  KLIIQIHFLPTFF------------FMAATHDWPLHEFDIHELDIKNAFLHGDL 103
                + +  TFF             +AAT+DWPL     ++LDIKNAFLHG L
Sbjct: 286 AQTYGVDYFDTFFPVAKLTSVRLFISLAATYDWPL-----YQLDIKNAFLHGVL 328

BLAST of CSPI05G18890 vs. NCBI nr
Match: gi|659121366|ref|XP_008460624.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103499401 [Cucumis melo])

HSP 1 Score: 88.6 bits (218), Expect = 7.0e-15
Identity = 52/108 (48.15%), Postives = 64/108 (59.26%), Query Frame = 1

Query: 1   MIEEMTVLDDNGTWDLVSCFAEKKAIGCKWVFAIKVDHDGAMPWLSVVFVLLRKVM---- 60
           MIEEMT  DDNGT DLVS  A KKAIGCK VF++KV+ DG +  L    V          
Sbjct: 436 MIEEMTAFDDNGTGDLVSXPAGKKAIGCKLVFSLKVNPDGTVARLKARLVAKDYAQTYGI 495

Query: 61  --SKSMKLIIQIHFLPTFFFMAATHDWPLHEFDIHELDIKNAFLHGDL 103
             S +   + ++  +  F  MAATH+W L     H+LDIKN+FLHGDL
Sbjct: 496 DYSDTFSPVAKLTSIRLFLSMAATHNWSL-----HQLDIKNSFLHGDL 538

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC7.8e-0633.33Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Match NameE-valueIdentityDescription
A0A061ECC7_THECC5.6e-1950.93Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 OS=Theobroma cacao GN=TCM_009... [more]
A0A061EJR6_THECC2.0e-1647.22Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 OS=Theobroma cacao GN=TCM_020... [more]
A0A061EWC9_THECC1.7e-1546.36Uncharacterized protein OS=Theobroma cacao GN=TCM_024268 PE=4 SV=1[more]
A0A0L9U7K1_PHAAN3.5e-1343.52Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan03g217200 PE=4 SV=1[more]
Q6L3Q0_SOLDE3.5e-1342.59Polyprotein, putative OS=Solanum demissum GN=SDM1_42t00018 PE=4 SV=2[more]
Match NameE-valueIdentityDescription
AT4G23160.18.0e-0931.19 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
Match NameE-valueIdentityDescription
gi|590692641|ref|XP_007044112.1|8.0e-1950.93Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [Theobroma cacao][more]
gi|590655404|ref|XP_007033977.1|2.8e-1647.22Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [Theobroma cacao][more]
gi|590634785|ref|XP_007028466.1|2.4e-1546.36Uncharacterized protein TCM_024268 [Theobroma cacao][more]
gi|720026103|ref|XP_010264152.1|5.4e-1542.11PREDICTED: uncharacterized protein LOC104602236 [Nelumbo nucifera][more]
gi|659121366|ref|XP_008460624.1|7.0e-1548.15PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103499401 [Cucumis me... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013103RVT_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G18890.1CSPI05G18890.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 11..102
score: 5.
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 1..102
score: 2.4
NoneNo IPR availablePANTHERPTHR11439:SF163SUBFAMILY NOT NAMEDcoord: 1..102
score: 2.4

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None