Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATGCGAAGGATCTTCGAGGAGAAGATGAAACGAAAAACGAAGAGAAGAAACCGAATGACGAAGGGAAAAATGACCGAAACAAATTCAAGAAGGTGGAGATGCCGATATTCAACAGAGATGATCTCGATTCGTGGTTATTTCTTGCTGAGAGGTATTTTCATATCCATAGACTCATTGAATCTGAGAAAATGACAATTTCTACTATAAGTTTCGAAGGACCAGCGCTGAATTGGTTTCGTTCTCAAGAGGAACGGGAGAAGTTTGTTGATTGGGCGAATATGAAGGAGAGGTTGTTAGAGAGATTCCGTTCATCGAGAGAAGGATCCTTGTATGGGCGATTTTTGCGTATCCAACAAACAACAATTGTGGATGAATATCAAAATTTATTCGATAAGTGGGTATCACCACTAACTGATTTACCTGAAAAAGTAGTAGAAGAGACGTTTGTTTCGGGATTGAAACCATGGATTCAAGCAGAGATGGACTTTTGCGAACCGAAAGGTTTAGCCCATATGATGAAGATAGCGTAG
mRNA sequence
ATGAATGCGAAGGATCTTCGAGGAGAAGATGAAACGAAAAACGAAGAGAAGAAACCGAATGACGAAGGGAAAAATGACCGAAACAAATTCAAGAAGGTGGAGATGCCGATATTCAACAGAGATGATCTCGATTCGTGGTTATTTCTTGCTGAGAGGTATTTTCATATCCATAGACTCATTGAATCTGAGAAAATGACAATTTCTACTATAAGTTTCGAAGGACCAGCGCTGAATTGGTTTCGTTCTCAAGAGGAACGGGAGAAGTTTGTTGATTGGGCGAATATGAAGGAGAGGTTGTTAGAGAGATTCCGTTCATCGAGAGAAGGATCCTTGTATGGGCGATTTTTGCGTATCCAACAAACAACAATTGTGGATGAATATCAAAATTTATTCGATAAGTGGGTATCACCACTAACTGATTTACCTGAAAAAGTAGTAGAAGAGACGTTTGTTTCGGGATTGAAACCATGGATTCAAGCAGAGATGGACTTTTGCGAACCGAAAGGTTTAGCCCATATGATGAAGATAGCGTAG
Coding sequence (CDS)
ATGAATGCGAAGGATCTTCGAGGAGAAGATGAAACGAAAAACGAAGAGAAGAAACCGAATGACGAAGGGAAAAATGACCGAAACAAATTCAAGAAGGTGGAGATGCCGATATTCAACAGAGATGATCTCGATTCGTGGTTATTTCTTGCTGAGAGGTATTTTCATATCCATAGACTCATTGAATCTGAGAAAATGACAATTTCTACTATAAGTTTCGAAGGACCAGCGCTGAATTGGTTTCGTTCTCAAGAGGAACGGGAGAAGTTTGTTGATTGGGCGAATATGAAGGAGAGGTTGTTAGAGAGATTCCGTTCATCGAGAGAAGGATCCTTGTATGGGCGATTTTTGCGTATCCAACAAACAACAATTGTGGATGAATATCAAAATTTATTCGATAAGTGGGTATCACCACTAACTGATTTACCTGAAAAAGTAGTAGAAGAGACGTTTGTTTCGGGATTGAAACCATGGATTCAAGCAGAGATGGACTTTTGCGAACCGAAAGGTTTAGCCCATATGATGAAGATAGCGTAG
Protein sequence
MNAKDLRGEDETKNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGSLYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA*
Homology
BLAST of CSPI01G21520 vs. ExPASy TrEMBL
Match:
A0A5A7SZK8 (Transposon Tf2-1 polyprotein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold112G00030 PE=4 SV=1)
HSP 1 Score: 239.2 bits (609), Expect = 1.3e-59
Identity = 111/164 (67.68%), Postives = 141/164 (85.98%), Query Frame = 0
Query: 14 NEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFE 73
NE +K +E NDR+KFKKVEMP+FN +D D+WLF A+RYF IHRL +SEKMTI+TISFE
Sbjct: 30 NENEKNGEEKNNDRSKFKKVEMPVFNGEDPDAWLFRADRYFQIHRLTDSEKMTIATISFE 89
Query: 74 GPALNWFRSQEEREKFVDWANMKERLLERFRSSREGSLYGRFLRIQQTTIVDEYQNLFDK 133
GPALNW+R+QEER+KF DWAN+KERLL RFRSSREGS+Y +FLRIQQ + V+EYQN FD+
Sbjct: 90 GPALNWYRAQEERDKFKDWANLKERLLVRFRSSREGSIYLQFLRIQQESSVEEYQNKFDR 149
Query: 134 WVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA 178
++P++DLP++V+EETF+ GL PWI+AE++FC P GLA MM +A
Sbjct: 150 LMAPVSDLPDRVIEETFMGGLFPWIKAEVEFCRPTGLAEMMLLA 193
BLAST of CSPI01G21520 vs. ExPASy TrEMBL
Match:
A0A5A7VAG8 (Transposon Ty3-I Gag-Pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold79G001800 PE=4 SV=1)
HSP 1 Score: 236.9 bits (603), Expect = 6.6e-59
Identity = 114/174 (65.52%), Postives = 146/174 (83.91%), Query Frame = 0
Query: 5 DLRGEDET-KNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESE 64
D+ G E +NE K ND+ DR+KFKKVEMP+F+ +D DSWLF AERYF IH+LIESE
Sbjct: 124 DIEGIAENGRNERKTENDDTTTDRSKFKKVEMPVFSGEDPDSWLFQAERYFQIHKLIESE 183
Query: 65 KMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGSLYGRFLRIQQTTI 124
KM +STISF+GPALNW+RSQEER+KF+ WAN+KERLL RFRSSR+G+L G+FLRI+Q T
Sbjct: 184 KMLVSTISFDGPALNWYRSQEERDKFLSWANLKERLLIRFRSSRDGTLLGKFLRIKQETT 243
Query: 125 VDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA 178
V+EY+NLFDK V+PL+++ E VVE+TF++GL PWI+AE+ FC PKGL+ MM++A
Sbjct: 244 VEEYRNLFDKLVAPLSEVQEDVVEDTFMNGLLPWIRAEVAFCHPKGLSEMMQVA 297
BLAST of CSPI01G21520 vs. ExPASy TrEMBL
Match:
A0A5D3BPU7 (Transposon Ty3-I Gag-Pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2044G00410 PE=4 SV=1)
HSP 1 Score: 233.4 bits (594), Expect = 7.3e-58
Identity = 113/174 (64.94%), Postives = 145/174 (83.33%), Query Frame = 0
Query: 5 DLRGEDET-KNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESE 64
D+ G E +NE K ND+ DR+KFKKVEMP+F+ +D DSWLF AERYF IH+LIESE
Sbjct: 124 DIEGIAENGRNERKTENDDTTTDRSKFKKVEMPVFSGEDPDSWLFQAERYFQIHKLIESE 183
Query: 65 KMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGSLYGRFLRIQQTTI 124
KM +STISF+GPALNW+RSQEER+KF+ WAN+KERLL RFRSSR+G+L G+FLRI+Q T
Sbjct: 184 KMLVSTISFDGPALNWYRSQEERDKFLSWANLKERLLIRFRSSRDGTLLGKFLRIKQETT 243
Query: 125 VDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA 178
V+EY+NLFDK V+PL+++ E VVE+TF++GL PWI+AE+ FC KGL+ MM++A
Sbjct: 244 VEEYRNLFDKLVAPLSEVQEDVVEDTFMNGLLPWIRAEVAFCHRKGLSEMMQVA 297
BLAST of CSPI01G21520 vs. ExPASy TrEMBL
Match:
A0A5D3BEL2 (Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold10G00340 PE=4 SV=1)
HSP 1 Score: 232.6 bits (592), Expect = 1.2e-57
Identity = 111/167 (66.47%), Postives = 137/167 (82.04%), Query Frame = 0
Query: 11 ETKNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTI 70
E N +K DE NDR+KFKKVEMP+F +D +SWLF AERYF IH+L ESEKM +STI
Sbjct: 98 EEINTKKNEPDENSNDRSKFKKVEMPVFTGEDPESWLFRAERYFQIHKLTESEKMLVSTI 157
Query: 71 SFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGSLYGRFLRIQQTTIVDEYQNL 130
F+GPALNW+R+QEEREKFV W N+KERLL RF+S+REG+ +GRFLRIQQ T V+EY+NL
Sbjct: 158 CFDGPALNWYRAQEEREKFVSWTNLKERLLIRFQSTREGTAFGRFLRIQQETTVEEYRNL 217
Query: 131 FDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA 178
FDK V+PL+D+ ++VVEETF+SGL PWI+AE+ C PKGLA MM+ A
Sbjct: 218 FDKLVAPLSDVEDRVVEETFMSGLFPWIRAEVILCRPKGLAEMMRTA 264
BLAST of CSPI01G21520 vs. ExPASy TrEMBL
Match:
A0A5D3DC20 (Transposon Tf2-1 polyprotein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold392G00900 PE=4 SV=1)
HSP 1 Score: 232.6 bits (592), Expect = 1.2e-57
Identity = 111/167 (66.47%), Postives = 137/167 (82.04%), Query Frame = 0
Query: 11 ETKNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTI 70
E N +K DE NDR+KFKKVEMP+F +D +SWLF AERYF IH+L ESEKM +STI
Sbjct: 98 EEINTKKNEPDENSNDRSKFKKVEMPVFTGEDPESWLFRAERYFQIHKLTESEKMLVSTI 157
Query: 71 SFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGSLYGRFLRIQQTTIVDEYQNL 130
F+GPALNW+R+QEEREKFV W N+KERLL RF+S+REG+ +GRFLRIQQ T V+EY+NL
Sbjct: 158 CFDGPALNWYRAQEEREKFVSWTNLKERLLIRFQSTREGTAFGRFLRIQQETTVEEYRNL 217
Query: 131 FDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA 178
FDK V+PL+D+ ++VVEETF+SGL PWI+AE+ C PKGLA MM+ A
Sbjct: 218 FDKLVAPLSDVEDRVVEETFMSGLFPWIRAEVILCRPKGLAEMMRTA 264
BLAST of CSPI01G21520 vs. NCBI nr
Match:
KAA0036018.1 (transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa])
HSP 1 Score: 239.2 bits (609), Expect = 2.7e-59
Identity = 111/164 (67.68%), Postives = 141/164 (85.98%), Query Frame = 0
Query: 14 NEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFE 73
NE +K +E NDR+KFKKVEMP+FN +D D+WLF A+RYF IHRL +SEKMTI+TISFE
Sbjct: 30 NENEKNGEEKNNDRSKFKKVEMPVFNGEDPDAWLFRADRYFQIHRLTDSEKMTIATISFE 89
Query: 74 GPALNWFRSQEEREKFVDWANMKERLLERFRSSREGSLYGRFLRIQQTTIVDEYQNLFDK 133
GPALNW+R+QEER+KF DWAN+KERLL RFRSSREGS+Y +FLRIQQ + V+EYQN FD+
Sbjct: 90 GPALNWYRAQEERDKFKDWANLKERLLVRFRSSREGSIYLQFLRIQQESSVEEYQNKFDR 149
Query: 134 WVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA 178
++P++DLP++V+EETF+ GL PWI+AE++FC P GLA MM +A
Sbjct: 150 LMAPVSDLPDRVIEETFMGGLFPWIKAEVEFCRPTGLAEMMLLA 193
BLAST of CSPI01G21520 vs. NCBI nr
Match:
KAA0062661.1 (Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa])
HSP 1 Score: 236.9 bits (603), Expect = 1.4e-58
Identity = 114/174 (65.52%), Postives = 146/174 (83.91%), Query Frame = 0
Query: 5 DLRGEDET-KNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESE 64
D+ G E +NE K ND+ DR+KFKKVEMP+F+ +D DSWLF AERYF IH+LIESE
Sbjct: 124 DIEGIAENGRNERKTENDDTTTDRSKFKKVEMPVFSGEDPDSWLFQAERYFQIHKLIESE 183
Query: 65 KMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGSLYGRFLRIQQTTI 124
KM +STISF+GPALNW+RSQEER+KF+ WAN+KERLL RFRSSR+G+L G+FLRI+Q T
Sbjct: 184 KMLVSTISFDGPALNWYRSQEERDKFLSWANLKERLLIRFRSSRDGTLLGKFLRIKQETT 243
Query: 125 VDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA 178
V+EY+NLFDK V+PL+++ E VVE+TF++GL PWI+AE+ FC PKGL+ MM++A
Sbjct: 244 VEEYRNLFDKLVAPLSEVQEDVVEDTFMNGLLPWIRAEVAFCHPKGLSEMMQVA 297
BLAST of CSPI01G21520 vs. NCBI nr
Match:
TYK01195.1 (Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa])
HSP 1 Score: 233.4 bits (594), Expect = 1.5e-57
Identity = 113/174 (64.94%), Postives = 145/174 (83.33%), Query Frame = 0
Query: 5 DLRGEDET-KNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESE 64
D+ G E +NE K ND+ DR+KFKKVEMP+F+ +D DSWLF AERYF IH+LIESE
Sbjct: 124 DIEGIAENGRNERKTENDDTTTDRSKFKKVEMPVFSGEDPDSWLFQAERYFQIHKLIESE 183
Query: 65 KMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGSLYGRFLRIQQTTI 124
KM +STISF+GPALNW+RSQEER+KF+ WAN+KERLL RFRSSR+G+L G+FLRI+Q T
Sbjct: 184 KMLVSTISFDGPALNWYRSQEERDKFLSWANLKERLLIRFRSSRDGTLLGKFLRIKQETT 243
Query: 125 VDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA 178
V+EY+NLFDK V+PL+++ E VVE+TF++GL PWI+AE+ FC KGL+ MM++A
Sbjct: 244 VEEYRNLFDKLVAPLSEVQEDVVEDTFMNGLLPWIRAEVAFCHRKGLSEMMQVA 297
BLAST of CSPI01G21520 vs. NCBI nr
Match:
TYJ96875.1 (Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa] >TYK19540.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa])
HSP 1 Score: 232.6 bits (592), Expect = 2.6e-57
Identity = 111/167 (66.47%), Postives = 137/167 (82.04%), Query Frame = 0
Query: 11 ETKNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTI 70
E N +K DE NDR+KFKKVEMP+F +D +SWLF AERYF IH+L ESEKM +STI
Sbjct: 98 EEINTKKNEPDENSNDRSKFKKVEMPVFTGEDPESWLFRAERYFQIHKLTESEKMLVSTI 157
Query: 71 SFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGSLYGRFLRIQQTTIVDEYQNL 130
F+GPALNW+R+QEEREKFV W N+KERLL RF+S+REG+ +GRFLRIQQ T V+EY+NL
Sbjct: 158 CFDGPALNWYRAQEEREKFVSWTNLKERLLIRFQSTREGTAFGRFLRIQQETTVEEYRNL 217
Query: 131 FDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA 178
FDK V+PL+D+ ++VVEETF+SGL PWI+AE+ C PKGLA MM+ A
Sbjct: 218 FDKLVAPLSDVEDRVVEETFMSGLFPWIRAEVILCRPKGLAEMMRTA 264
BLAST of CSPI01G21520 vs. NCBI nr
Match:
TYK21115.1 (transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa])
HSP 1 Score: 232.6 bits (592), Expect = 2.6e-57
Identity = 111/167 (66.47%), Postives = 137/167 (82.04%), Query Frame = 0
Query: 11 ETKNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTI 70
E N +K DE NDR+KFKKVEMP+F +D +SWLF AERYF IH+L ESEKM +STI
Sbjct: 98 EEINTKKNEPDENSNDRSKFKKVEMPVFTGEDPESWLFRAERYFQIHKLTESEKMLVSTI 157
Query: 71 SFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGSLYGRFLRIQQTTIVDEYQNL 130
F+GPALNW+R+QEEREKFV W N+KERLL RF+S+REG+ +GRFLRIQQ T V+EY+NL
Sbjct: 158 CFDGPALNWYRAQEEREKFVSWTNLKERLLIRFQSTREGTAFGRFLRIQQETTVEEYRNL 217
Query: 131 FDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA 178
FDK V+PL+D+ ++VVEETF+SGL PWI+AE+ C PKGLA MM+ A
Sbjct: 218 FDKLVAPLSDVEDRVVEETFMSGLFPWIRAEVILCRPKGLAEMMRTA 264
BLAST of CSPI01G21520 vs. TAIR 10
Match:
AT1G67020.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: leaf; Has 72 Blast hits to 72 proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 72; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 61.2 bits (147), Expect = 9.5e-10
Identity = 27/79 (34.18%), Postives = 46/79 (58.23%), Query Frame = 0
Query: 25 NDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQE 84
N + +++EMP+F+ + W ER+F + R +S+K+ + +S EG AL WF +
Sbjct: 102 NRSSLIRRIEMPVFDGSGVYEWFSKVERFFRVGRYQDSDKLDLVALSLEGVALKWFLREM 161
Query: 85 EREKFVDWANMKERLLERF 104
+F DW + ++RLL RF
Sbjct: 162 STLEFRDWNSFEQRLLARF 180
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7SZK8 | 1.3e-59 | 67.68 | Transposon Tf2-1 polyprotein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 G... | [more] |
A0A5A7VAG8 | 6.6e-59 | 65.52 | Transposon Ty3-I Gag-Pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E... | [more] |
A0A5D3BPU7 | 7.3e-58 | 64.94 | Transposon Ty3-I Gag-Pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E... | [more] |
A0A5D3BEL2 | 1.2e-57 | 66.47 | Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... | [more] |
A0A5D3DC20 | 1.2e-57 | 66.47 | Transposon Tf2-1 polyprotein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 G... | [more] |
Match Name | E-value | Identity | Description | |
KAA0036018.1 | 2.7e-59 | 67.68 | transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa] | [more] |
KAA0062661.1 | 1.4e-58 | 65.52 | Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa] | [more] |
TYK01195.1 | 1.5e-57 | 64.94 | Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa] | [more] |
TYJ96875.1 | 2.6e-57 | 66.47 | Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa] >TYK19540.1 Ty3/gyp... | [more] |
TYK21115.1 | 2.6e-57 | 66.47 | transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
AT1G67020.1 | 9.5e-10 | 34.18 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |