CSPI03G20730 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI03G20730
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr3: 16746729 .. 16747616 (-)
RNA-Seq ExpressionCSPI03G20730
SyntenyCSPI03G20730
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACAACAATGGCAATGCTATGGGTACAACACAACCACTCATTCTAATCTTCAAAGGAGAAGGCTACGAGTTTTGGAGTATGCGTATGAAGACTCTTCTCAGATCTCAAGACTTATGGGACTTAGTAGAACACAACTATGCGGATCCTGACGACGAAGGCAAGTTGCGGGAGAAGAGGAGAAAGGACTCTAAGGCGTTAGTGATTATTCAACAAGCAGTCCATGACAGTGGTTTTTCGCGGATTGGTACAACAACAACGTCAAAAGAAGCGTGGCTGATTTTGCAAAAGGCATTTCGAGGAGATTTAAGAGTACTTGTGGTAAAATTGCAATCACTTAGACAAGAATTTGAGACCTTGATGATGAAAAATAGAGAATCAATTGCTAATTTTTTGTCACGGGCAACGACAATTATTAGTCAGATGCAAACATACGGCGAGATGATTACAGATCAGACTATAGTGGAGAAAGTATTGAGAAGTTTGACTCTAAAGTTCGATCAAGTTGTGGCCGCAATAGAAGAATCAAAGGATCTGTCCACTTTCACATTTATTGAATTAATGGGATCTCTTCAAGCACATGAGTCAAGAATCAATAGATCGATGGAAAGAAACGAAGAAAAAGCGTTTCAGGTAAAGGATGTAGTTCCAAAGTATAATAACAGTGATCGTGTGATGACTCGAGGCAGAGGAAGAGGAGGATATCGTGGTCAAGGTCGTGGAACTGAAAAAGGATGCAAACAAAATGAAGAAAAAGGGCAGTTCAGAGTGCAATCAAGCAACAAAGCTAATATTCAATGCTACCATGGCAAGAAGTTTGGTCATGTAAAGGCAGACTGCTGGTACAAAAATCAGCGAGCCAATTTTTCAGCAGAGAATGAAGCATAA

mRNA sequence

ATGGACAACAATGGCAATGCTATGGGTACAACACAACCACTCATTCTAATCTTCAAAGGAGAAGGCTACGAGTTTTGGAGTATGCGTATGAAGACTCTTCTCAGATCTCAAGACTTATGGGACTTAGTAGAACACAACTATGCGGATCCTGACGACGAAGGCAAGTTGCGGGAGAAGAGGAGAAAGGACTCTAAGGCGTTAGTGATTATTCAACAAGCAGTCCATGACAGTGGTTTTTCGCGGATTGGTACAACAACAACGTCAAAAGAAGCGTGGCTGATTTTGCAAAAGGCATTTCGAGGAGATTTAAGAGTACTTGTGGTAAAATTGCAATCACTTAGACAAGAATTTGAGACCTTGATGATGAAAAATAGAGAATCAATTGCTAATTTTTTGTCACGGGCAACGACAATTATTAGTCAGATGCAAACATACGGCGAGATGATTACAGATCAGACTATAGTGGAGAAAGTATTGAGAAGTTTGACTCTAAAGTTCGATCAAGTTGTGGCCGCAATAGAAGAATCAAAGGATCTGTCCACTTTCACATTTATTGAATTAATGGGATCTCTTCAAGCACATGAGTCAAGAATCAATAGATCGATGGAAAGAAACGAAGAAAAAGCGTTTCAGGTAAAGGATGTAGTTCCAAAGTATAATAACAGTGATCGTGTGATGACTCGAGGCAGAGGAAGAGGAGGATATCGTGGTCAAGGTCGTGGAACTGAAAAAGGATGCAAACAAAATGAAGAAAAAGGGCAGTTCAGAGTGCAATCAAGCAACAAAGCTAATATTCAATGCTACCATGGCAAGAAGTTTGGTCATGTAAAGGCAGACTGCTGGTACAAAAATCAGCGAGCCAATTTTTCAGCAGAGAATGAAGCATAA

Coding sequence (CDS)

ATGGACAACAATGGCAATGCTATGGGTACAACACAACCACTCATTCTAATCTTCAAAGGAGAAGGCTACGAGTTTTGGAGTATGCGTATGAAGACTCTTCTCAGATCTCAAGACTTATGGGACTTAGTAGAACACAACTATGCGGATCCTGACGACGAAGGCAAGTTGCGGGAGAAGAGGAGAAAGGACTCTAAGGCGTTAGTGATTATTCAACAAGCAGTCCATGACAGTGGTTTTTCGCGGATTGGTACAACAACAACGTCAAAAGAAGCGTGGCTGATTTTGCAAAAGGCATTTCGAGGAGATTTAAGAGTACTTGTGGTAAAATTGCAATCACTTAGACAAGAATTTGAGACCTTGATGATGAAAAATAGAGAATCAATTGCTAATTTTTTGTCACGGGCAACGACAATTATTAGTCAGATGCAAACATACGGCGAGATGATTACAGATCAGACTATAGTGGAGAAAGTATTGAGAAGTTTGACTCTAAAGTTCGATCAAGTTGTGGCCGCAATAGAAGAATCAAAGGATCTGTCCACTTTCACATTTATTGAATTAATGGGATCTCTTCAAGCACATGAGTCAAGAATCAATAGATCGATGGAAAGAAACGAAGAAAAAGCGTTTCAGGTAAAGGATGTAGTTCCAAAGTATAATAACAGTGATCGTGTGATGACTCGAGGCAGAGGAAGAGGAGGATATCGTGGTCAAGGTCGTGGAACTGAAAAAGGATGCAAACAAAATGAAGAAAAAGGGCAGTTCAGAGTGCAATCAAGCAACAAAGCTAATATTCAATGCTACCATGGCAAGAAGTTTGGTCATGTAAAGGCAGACTGCTGGTACAAAAATCAGCGAGCCAATTTTTCAGCAGAGAATGAAGCATAA

Protein sequence

MDNNGNAMGTTQPLILIFKGEGYEFWSMRMKTLLRSQDLWDLVEHNYADPDDEGKLREKRRKDSKALVIIQQAVHDSGFSRIGTTTTSKEAWLILQKAFRGDLRVLVVKLQSLRQEFETLMMKNRESIANFLSRATTIISQMQTYGEMITDQTIVEKVLRSLTLKFDQVVAAIEESKDLSTFTFIELMGSLQAHESRINRSMERNEEKAFQVKDVVPKYNNSDRVMTRGRGRGGYRGQGRGTEKGCKQNEEKGQFRVQSSNKANIQCYHGKKFGHVKADCWYKNQRANFSAENEA*
Homology
BLAST of CSPI03G20730 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 47.0 bits (110), Expect = 4.3e-04
Identity = 62/280 (22.14%), Postives = 118/280 (42.14%), Query Frame = 0

Query: 18  FKGEGYEFWSMRMKTLLRSQDLWDLVEHNYA-DPDDEGKLREKRRKDSKALVIIQQAVHD 77
           F GE Y  W  R++ LL  QD+  +V+     + DD  K  E+  K +     I + + D
Sbjct: 11  FDGEKYAIWKFRIRALLAEQDVLKVVDGLMPNEVDDSWKKAERCAKST-----IIEYLSD 70

Query: 78  SGFSRIGTTTTSKEAWLILQKAFRGDLRVLVVKLQSLRQEFETLMMKNRESIANFLSRAT 137
           S  +   +  T+++    L   +    R  +    +LR+   +L + +  S+ +      
Sbjct: 71  SFLNFATSDITARQILENLDAVYE---RKSLASQLALRKRLLSLKLSSEMSLLSHFHIFD 130

Query: 138 TIISQMQTYGEMITDQTIVEKVLRSLTLKFDQVVAAIEE-SKDLSTFTFIELMGSLQAHE 197
            +IS++   G  I +   +  +L +L   +D ++ AIE  S++  T  F++    L   E
Sbjct: 131 ELISELLAAGAKIEEMDKISHLLITLPSCYDGIITAIETLSEENLTLAFVK--NRLLDQE 190

Query: 198 SRINRSMERNEEKAFQVKDVVPKYNNSDRVMTRGRGRGGYRGQGRGTEKGCKQNEEKGQF 257
            +I    + N+     +  +V   NN+ +                      K    K + 
Sbjct: 191 IKIKN--DHNDTSKKVMNAIVHNNNNTYK------------------NNLFKNRVTKPKK 250

Query: 258 RVQSSNKANIQCYHGKKFGHVKADCW-YKNQRANFSAENE 295
             + ++K  ++C+H  + GH+K DC+ YK    N + ENE
Sbjct: 251 IFKGNSKYKVKCHHCGREGHIKKDCFHYKRILNNKNKENE 260

BLAST of CSPI03G20730 vs. ExPASy TrEMBL
Match: A0A5A7UQM0 (Copia protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold319G00340 PE=4 SV=1)

HSP 1 Score: 480.7 bits (1236), Expect = 4.3e-132
Identity = 246/295 (83.39%), Postives = 266/295 (90.17%), Query Frame = 0

Query: 1   MDNNGNAMGTTQPLILIFKGEGYEFWSMRMKTLLRSQDLWDLVEHNYADPDDEGKLREKR 60
           M NNGN MGTTQPLI IFKGEGYEFWS+RMKTLL SQDLWDLVE  Y DPDDEGKL+E R
Sbjct: 1   MSNNGNVMGTTQPLIPIFKGEGYEFWSIRMKTLLISQDLWDLVEQGYTDPDDEGKLQENR 60

Query: 61  RKDSKALVIIQQAVHDSGFSRIGTTTTSKEAWLILQKAFRGDLRVLVVKLQSLRQEFETL 120
            KD KALVI+QQAVHD+ FSRI   TTSK+AWLILQKAF+GD RVLVVKLQSL+++FETL
Sbjct: 61  EKDPKALVIVQQAVHDNVFSRIAAATTSKQAWLILQKAFQGDSRVLVVKLQSLKRDFETL 120

Query: 121 MMKNRESIANFLSRATTIISQMQTYGEMITDQTIVEKVLRSLTLKFDQVVAAIEESKDLS 180
           MMKN ESIA+FLSRATTIISQMQTYGE ITDQTIVEKVLRSLT KFD VVAAIEESKDLS
Sbjct: 121 MMKNGESIADFLSRATTIISQMQTYGETITDQTIVEKVLRSLTPKFDHVVAAIEESKDLS 180

Query: 181 TFTFIELMGSLQAHESRINRSMERNEEKAFQVKDVVPKYNNSDRVMTRGRGRGGYRGQGR 240
           TFTFIELMGSLQAHESRIN SME+N+EKAF+VKDVVPKYN+SD VMT+G+G GGYR +GR
Sbjct: 181 TFTFIELMGSLQAHESRINISMEKNKEKAFKVKDVVPKYNDSDCVMTQGQGSGGYRSRGR 240

Query: 241 GTEKGCKQNEEKGQFRVQSSNKANIQCYHGKKFGHVKADCWYKNQRANFSAENEA 296
           GT KGC QNEE+ QF VQSSNKANIQCYH KKFGHVKADCWYKNQRANF+ +NEA
Sbjct: 241 GTGKGCNQNEEQRQFGVQSSNKANIQCYHCKKFGHVKADCWYKNQRANFTEQNEA 295

BLAST of CSPI03G20730 vs. ExPASy TrEMBL
Match: A0A5D3DWP2 (Putative gag-pol polyprotein, identical OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold225G00850 PE=4 SV=1)

HSP 1 Score: 444.9 bits (1143), Expect = 2.6e-121
Identity = 230/266 (86.47%), Postives = 245/266 (92.11%), Query Frame = 0

Query: 30  MKTLLRSQDLWDLVEHNYADPDDEGKLREKRRKDSKALVIIQQAVHDSGFSRIGTTTTSK 89
           +KTLLRSQDLWDLVE  Y DPDDEGKLRE R+KDSKALVIIQQAVHDS FSRI T TTSK
Sbjct: 117 VKTLLRSQDLWDLVEQGYVDPDDEGKLRENRKKDSKALVIIQQAVHDSVFSRIATATTSK 176

Query: 90  EAWLILQKAFRGDLRVLVVKLQSLRQEFETLMMKNRESIANFLSRATTIISQMQTYGEMI 149
           +AWLILQKAF+GD RVL+VKLQSLR++FETLMMKN ESIA+FLSRATTIISQMQTYGE I
Sbjct: 177 QAWLILQKAFQGDSRVLMVKLQSLRRDFETLMMKNGESIADFLSRATTIISQMQTYGETI 236

Query: 150 TDQTIVEKVLRSLTLKFDQVVAAIEESKDLSTFTFIELMGSLQAHESRINRSMERNEEKA 209
            DQTIVEKVLRSLT KFD VVAAIEESK+L TFTFIELMGSL+AHESRINRSMERNEEKA
Sbjct: 237 KDQTIVEKVLRSLTPKFDHVVAAIEESKNLFTFTFIELMGSLEAHESRINRSMERNEEKA 296

Query: 210 FQVKDVVPKYNNSDRVMTRGRGRGGYRGQGRGTEKGCKQNEEKGQFRVQSSNKANIQCYH 269
           FQVKD VPKYN+SDRVMTRGRGRGGYRG+G GTEKGC +NE + QF VQSSNKANIQCYH
Sbjct: 297 FQVKDAVPKYNDSDRVMTRGRGRGGYRGRGHGTEKGCNRNEAQRQFGVQSSNKANIQCYH 356

Query: 270 GKKFGHVKADCWYKNQRANFSAENEA 296
            KKFGHVKADCWYKNQRANF+AENEA
Sbjct: 357 CKKFGHVKADCWYKNQRANFAAENEA 382

BLAST of CSPI03G20730 vs. ExPASy TrEMBL
Match: A0A5D3DWC7 (Putative gag-pol polyprotein, identical OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold289G00230 PE=4 SV=1)

HSP 1 Score: 421.4 bits (1082), Expect = 3.1e-114
Identity = 223/295 (75.59%), Postives = 238/295 (80.68%), Query Frame = 0

Query: 1   MDNNGNAMGTTQPLILIFKGEGYEFWSMRMKTLLRSQDLWDLVEHNYADPDDEGKLREKR 60
           M NNGN MGT QPLI IFKGEGYEFWS+RMKTLL SQDLWDLVE  Y DPDDEGKL+E R
Sbjct: 1   MSNNGNVMGTAQPLIPIFKGEGYEFWSIRMKTLLISQDLWDLVEQGYTDPDDEGKLQENR 60

Query: 61  RKDSKALVIIQQAVHDSGFSRIGTTTTSKEAWLILQKAFRGDLRVLVVKLQSLRQEFETL 120
            KDSKALVIIQQAVHD+ FSRI   TT                           ++FETL
Sbjct: 61  EKDSKALVIIQQAVHDNVFSRIAAATT---------------------------RDFETL 120

Query: 121 MMKNRESIANFLSRATTIISQMQTYGEMITDQTIVEKVLRSLTLKFDQVVAAIEESKDLS 180
           MMKN ESIA+FLSRATTIISQMQTYGE ITDQTIVEKVLRSLT KFD VV AIEESKDLS
Sbjct: 121 MMKNGESIADFLSRATTIISQMQTYGETITDQTIVEKVLRSLTPKFDHVVVAIEESKDLS 180

Query: 181 TFTFIELMGSLQAHESRINRSMERNEEKAFQVKDVVPKYNNSDRVMTRGRGRGGYRGQGR 240
           TFTFIELMGSLQAHESRIN SME+NEEKAF+VKDVVPKYN+SD VMT+G+G GGYR +GR
Sbjct: 181 TFTFIELMGSLQAHESRINISMEKNEEKAFKVKDVVPKYNDSDCVMTQGQGSGGYRSRGR 240

Query: 241 GTEKGCKQNEEKGQFRVQSSNKANIQCYHGKKFGHVKADCWYKNQRANFSAENEA 296
           GT KGC QNEE+ QF VQSSNKANIQCYH KKFGHVKADCWYKN RANF+ +NEA
Sbjct: 241 GTGKGCNQNEEQRQFGVQSSNKANIQCYHCKKFGHVKADCWYKNHRANFTEQNEA 268

BLAST of CSPI03G20730 vs. ExPASy TrEMBL
Match: A0A5A7UDE3 (DUF4219 domain-containing protein/UBN2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold101G00260 PE=4 SV=1)

HSP 1 Score: 401.4 bits (1030), Expect = 3.4e-108
Identity = 215/295 (72.88%), Postives = 230/295 (77.97%), Query Frame = 0

Query: 1   MDNNGNAMGTTQPLILIFKGEGYEFWSMRMKTLLRSQDLWDLVEHNYADPDDEGKLREKR 60
           M +N NAMG+TQPLI IFKGEGYEFWS+R KTLLRSQ LWDLVE  YADP+DEGKL+E R
Sbjct: 1   MGSNDNAMGSTQPLIPIFKGEGYEFWSIRTKTLLRSQYLWDLVEQGYADPNDEGKLQENR 60

Query: 61  RKDSKALVIIQQAVHDSGFSRIGTTTTSKEAWLILQKAFRGDLRVLVVKLQSLRQEFETL 120
           +KDSK LVIIQQAVHD+ FSRI   TTSK+ WLILQKA +GD RVLV+            
Sbjct: 61  KKDSKMLVIIQQAVHDNVFSRIVAATTSKQVWLILQKALQGDSRVLVI------------ 120

Query: 121 MMKNRESIANFLSRATTIISQMQTYGEMITDQTIVEKVLRSLTLKFDQVVAAIEESKDLS 180
                                 QTYGE I DQTIVEKVLRSLT KFD VVAAIEESKDLS
Sbjct: 121 ----------------------QTYGETIKDQTIVEKVLRSLTPKFDHVVAAIEESKDLS 180

Query: 181 TFTFIELMGSLQAHESRINRSMERNEEKAFQVKDVVPKYNNSDRVMTRGRGRGGYRGQGR 240
           TFTFIELMGSLQAHESRINRS+E NEEKAFQVKDVVPKYN+SDRVMTRGRGRG Y G+GR
Sbjct: 181 TFTFIELMGSLQAHESRINRSIEINEEKAFQVKDVVPKYNDSDRVMTRGRGRGEYHGRGR 240

Query: 241 GTEKGCKQNEEKGQFRVQSSNKANIQCYHGKKFGHVKADCWYKNQRANFSAENEA 296
           GT KG  QNEE+ QF VQSSNKANIQCYH KKFGHVK DCWYKN RANF+AENEA
Sbjct: 241 GTGKGYNQNEEQRQFGVQSSNKANIQCYHCKKFGHVKVDCWYKNHRANFAAENEA 261

BLAST of CSPI03G20730 vs. ExPASy TrEMBL
Match: A0A5D3CL10 (UBN2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1017G00080 PE=4 SV=1)

HSP 1 Score: 395.2 bits (1014), Expect = 2.4e-106
Identity = 209/248 (84.27%), Postives = 221/248 (89.11%), Query Frame = 0

Query: 41  DLVEHNYADPDDEGKLREKRRKDSKALVIIQQAVHDSGFSRIGTTTTSKEAWLILQKAFR 100
           +LVE  YADPDDEGKLR  ++KDSK LVIIQQAVHDS FS+I   TTSK+AWLILQK F+
Sbjct: 2   NLVEQGYADPDDEGKLRMNKKKDSKVLVIIQQAVHDSVFSQIVIATTSKQAWLILQKEFQ 61

Query: 101 GDLRVLVVKLQSLRQEFETLMMKNRESIANFLSRATTIISQMQTYGEMITDQTIVEKVLR 160
           GD RVLVVKLQSLR++FETLMMKN ESIA+FLSRATTIISQMQTY E I D TIVEKVLR
Sbjct: 62  GDSRVLVVKLQSLRRDFETLMMKNGESIADFLSRATTIISQMQTYSETIKDHTIVEKVLR 121

Query: 161 SLTLKFDQVVAAIEESKDLSTFTFIELMGSLQAHESRINRSMERNEEKAFQVKDVVPKYN 220
           SLT KFD VVA IEESKDLSTFTFIELMGSLQAHESRINRSMERNEEKAFQVKDVV KYN
Sbjct: 122 SLTPKFDHVVATIEESKDLSTFTFIELMGSLQAHESRINRSMERNEEKAFQVKDVVLKYN 181

Query: 221 NSDRVMTRGRGRGGYRGQGRGTEKGCKQNEEKGQFRVQSSNKANIQCYHGKKFGHVKADC 280
           +SDRV TRGRGRGGYRG+G G EKGC QNEE+ QF VQSSNKANIQCYH KKFGHVKADC
Sbjct: 182 DSDRVTTRGRGRGGYRGRGCGIEKGCNQNEEQRQFGVQSSNKANIQCYHFKKFGHVKADC 241

Query: 281 WYKNQRAN 289
           WYKNQRAN
Sbjct: 242 WYKNQRAN 249

BLAST of CSPI03G20730 vs. NCBI nr
Match: XP_031738054.1 (uncharacterized protein LOC116402652 [Cucumis sativus])

HSP 1 Score: 575.1 bits (1481), Expect = 3.5e-160
Identity = 293/295 (99.32%), Postives = 294/295 (99.66%), Query Frame = 0

Query: 1   MDNNGNAMGTTQPLILIFKGEGYEFWSMRMKTLLRSQDLWDLVEHNYADPDDEGKLREKR 60
           MDNNGNAMGTTQPLILIFKGEGYEFWSMRMKTLLRSQDLWDLVEHNYADPDDEGKLREKR
Sbjct: 1   MDNNGNAMGTTQPLILIFKGEGYEFWSMRMKTLLRSQDLWDLVEHNYADPDDEGKLREKR 60

Query: 61  RKDSKALVIIQQAVHDSGFSRIGTTTTSKEAWLILQKAFRGDLRVLVVKLQSLRQEFETL 120
           RKDSKALVIIQQAVHDSGFSRIGTTTTSKEAWLILQKAFRGDLRVLVVKLQSLR+EFETL
Sbjct: 61  RKDSKALVIIQQAVHDSGFSRIGTTTTSKEAWLILQKAFRGDLRVLVVKLQSLRKEFETL 120

Query: 121 MMKNRESIANFLSRATTIISQMQTYGEMITDQTIVEKVLRSLTLKFDQVVAAIEESKDLS 180
           MMKNRESIANFLSRATTIISQMQTYGE ITDQTIVEKVLRSLTLKFDQVVAAIEESKDLS
Sbjct: 121 MMKNRESIANFLSRATTIISQMQTYGETITDQTIVEKVLRSLTLKFDQVVAAIEESKDLS 180

Query: 181 TFTFIELMGSLQAHESRINRSMERNEEKAFQVKDVVPKYNNSDRVMTRGRGRGGYRGQGR 240
           TFTFIELMGSLQAHESRINRSMERNEEKAFQVKDVVPKYNNSDRVMTRGRGRGGYRGQGR
Sbjct: 181 TFTFIELMGSLQAHESRINRSMERNEEKAFQVKDVVPKYNNSDRVMTRGRGRGGYRGQGR 240

Query: 241 GTEKGCKQNEEKGQFRVQSSNKANIQCYHGKKFGHVKADCWYKNQRANFSAENEA 296
           GTEKGCKQNEEKGQFRVQSSNKANIQCYHGKKFGHVKADCWYKNQRANFSAENEA
Sbjct: 241 GTEKGCKQNEEKGQFRVQSSNKANIQCYHGKKFGHVKADCWYKNQRANFSAENEA 295

BLAST of CSPI03G20730 vs. NCBI nr
Match: KAE8650579.1 (hypothetical protein Csa_010963 [Cucumis sativus])

HSP 1 Score: 543.1 bits (1398), Expect = 1.5e-150
Identity = 278/280 (99.29%), Postives = 279/280 (99.64%), Query Frame = 0

Query: 1   MDNNGNAMGTTQPLILIFKGEGYEFWSMRMKTLLRSQDLWDLVEHNYADPDDEGKLREKR 60
           MDNNGNAMGTTQPLILIFKGEGYEFWSMRMKTLLRSQDLWDLVEHNYADPDDEGKLREKR
Sbjct: 1   MDNNGNAMGTTQPLILIFKGEGYEFWSMRMKTLLRSQDLWDLVEHNYADPDDEGKLREKR 60

Query: 61  RKDSKALVIIQQAVHDSGFSRIGTTTTSKEAWLILQKAFRGDLRVLVVKLQSLRQEFETL 120
           RKDSKALVIIQQAVHDSGFSRIGTTTTSKEAWLILQKAFRGDLRVLVVKLQSLR+EFETL
Sbjct: 61  RKDSKALVIIQQAVHDSGFSRIGTTTTSKEAWLILQKAFRGDLRVLVVKLQSLRKEFETL 120

Query: 121 MMKNRESIANFLSRATTIISQMQTYGEMITDQTIVEKVLRSLTLKFDQVVAAIEESKDLS 180
           MMKNRESIANFLSRATTIISQMQTYGE ITDQTIVEKVLRSLTLKFDQVVAAIEESKDLS
Sbjct: 121 MMKNRESIANFLSRATTIISQMQTYGETITDQTIVEKVLRSLTLKFDQVVAAIEESKDLS 180

Query: 181 TFTFIELMGSLQAHESRINRSMERNEEKAFQVKDVVPKYNNSDRVMTRGRGRGGYRGQGR 240
           TFTFIELMGSLQAHESRINRSMERNEEKAFQVKDVVPKYNNSDRVMTRGRGRGGYRGQGR
Sbjct: 181 TFTFIELMGSLQAHESRINRSMERNEEKAFQVKDVVPKYNNSDRVMTRGRGRGGYRGQGR 240

Query: 241 GTEKGCKQNEEKGQFRVQSSNKANIQCYHGKKFGHVKADC 281
           GTEKGCKQNEEKGQFRVQSSNKANIQCYHGKKFGHVKADC
Sbjct: 241 GTEKGCKQNEEKGQFRVQSSNKANIQCYHGKKFGHVKADC 280

BLAST of CSPI03G20730 vs. NCBI nr
Match: KAA0055915.1 (copia protein [Cucumis melo var. makuwa])

HSP 1 Score: 480.7 bits (1236), Expect = 9.0e-132
Identity = 246/295 (83.39%), Postives = 266/295 (90.17%), Query Frame = 0

Query: 1   MDNNGNAMGTTQPLILIFKGEGYEFWSMRMKTLLRSQDLWDLVEHNYADPDDEGKLREKR 60
           M NNGN MGTTQPLI IFKGEGYEFWS+RMKTLL SQDLWDLVE  Y DPDDEGKL+E R
Sbjct: 1   MSNNGNVMGTTQPLIPIFKGEGYEFWSIRMKTLLISQDLWDLVEQGYTDPDDEGKLQENR 60

Query: 61  RKDSKALVIIQQAVHDSGFSRIGTTTTSKEAWLILQKAFRGDLRVLVVKLQSLRQEFETL 120
            KD KALVI+QQAVHD+ FSRI   TTSK+AWLILQKAF+GD RVLVVKLQSL+++FETL
Sbjct: 61  EKDPKALVIVQQAVHDNVFSRIAAATTSKQAWLILQKAFQGDSRVLVVKLQSLKRDFETL 120

Query: 121 MMKNRESIANFLSRATTIISQMQTYGEMITDQTIVEKVLRSLTLKFDQVVAAIEESKDLS 180
           MMKN ESIA+FLSRATTIISQMQTYGE ITDQTIVEKVLRSLT KFD VVAAIEESKDLS
Sbjct: 121 MMKNGESIADFLSRATTIISQMQTYGETITDQTIVEKVLRSLTPKFDHVVAAIEESKDLS 180

Query: 181 TFTFIELMGSLQAHESRINRSMERNEEKAFQVKDVVPKYNNSDRVMTRGRGRGGYRGQGR 240
           TFTFIELMGSLQAHESRIN SME+N+EKAF+VKDVVPKYN+SD VMT+G+G GGYR +GR
Sbjct: 181 TFTFIELMGSLQAHESRINISMEKNKEKAFKVKDVVPKYNDSDCVMTQGQGSGGYRSRGR 240

Query: 241 GTEKGCKQNEEKGQFRVQSSNKANIQCYHGKKFGHVKADCWYKNQRANFSAENEA 296
           GT KGC QNEE+ QF VQSSNKANIQCYH KKFGHVKADCWYKNQRANF+ +NEA
Sbjct: 241 GTGKGCNQNEEQRQFGVQSSNKANIQCYHCKKFGHVKADCWYKNQRANFTEQNEA 295

BLAST of CSPI03G20730 vs. NCBI nr
Match: TYK27735.1 (putative gag-pol polyprotein, identical [Cucumis melo var. makuwa])

HSP 1 Score: 444.9 bits (1143), Expect = 5.5e-121
Identity = 230/266 (86.47%), Postives = 245/266 (92.11%), Query Frame = 0

Query: 30  MKTLLRSQDLWDLVEHNYADPDDEGKLREKRRKDSKALVIIQQAVHDSGFSRIGTTTTSK 89
           +KTLLRSQDLWDLVE  Y DPDDEGKLRE R+KDSKALVIIQQAVHDS FSRI T TTSK
Sbjct: 117 VKTLLRSQDLWDLVEQGYVDPDDEGKLRENRKKDSKALVIIQQAVHDSVFSRIATATTSK 176

Query: 90  EAWLILQKAFRGDLRVLVVKLQSLRQEFETLMMKNRESIANFLSRATTIISQMQTYGEMI 149
           +AWLILQKAF+GD RVL+VKLQSLR++FETLMMKN ESIA+FLSRATTIISQMQTYGE I
Sbjct: 177 QAWLILQKAFQGDSRVLMVKLQSLRRDFETLMMKNGESIADFLSRATTIISQMQTYGETI 236

Query: 150 TDQTIVEKVLRSLTLKFDQVVAAIEESKDLSTFTFIELMGSLQAHESRINRSMERNEEKA 209
            DQTIVEKVLRSLT KFD VVAAIEESK+L TFTFIELMGSL+AHESRINRSMERNEEKA
Sbjct: 237 KDQTIVEKVLRSLTPKFDHVVAAIEESKNLFTFTFIELMGSLEAHESRINRSMERNEEKA 296

Query: 210 FQVKDVVPKYNNSDRVMTRGRGRGGYRGQGRGTEKGCKQNEEKGQFRVQSSNKANIQCYH 269
           FQVKD VPKYN+SDRVMTRGRGRGGYRG+G GTEKGC +NE + QF VQSSNKANIQCYH
Sbjct: 297 FQVKDAVPKYNDSDRVMTRGRGRGGYRGRGHGTEKGCNRNEAQRQFGVQSSNKANIQCYH 356

Query: 270 GKKFGHVKADCWYKNQRANFSAENEA 296
            KKFGHVKADCWYKNQRANF+AENEA
Sbjct: 357 CKKFGHVKADCWYKNQRANFAAENEA 382

BLAST of CSPI03G20730 vs. NCBI nr
Match: TYK28117.1 (putative gag-pol polyprotein, identical [Cucumis melo var. makuwa])

HSP 1 Score: 421.4 bits (1082), Expect = 6.5e-114
Identity = 223/295 (75.59%), Postives = 238/295 (80.68%), Query Frame = 0

Query: 1   MDNNGNAMGTTQPLILIFKGEGYEFWSMRMKTLLRSQDLWDLVEHNYADPDDEGKLREKR 60
           M NNGN MGT QPLI IFKGEGYEFWS+RMKTLL SQDLWDLVE  Y DPDDEGKL+E R
Sbjct: 1   MSNNGNVMGTAQPLIPIFKGEGYEFWSIRMKTLLISQDLWDLVEQGYTDPDDEGKLQENR 60

Query: 61  RKDSKALVIIQQAVHDSGFSRIGTTTTSKEAWLILQKAFRGDLRVLVVKLQSLRQEFETL 120
            KDSKALVIIQQAVHD+ FSRI   TT                           ++FETL
Sbjct: 61  EKDSKALVIIQQAVHDNVFSRIAAATT---------------------------RDFETL 120

Query: 121 MMKNRESIANFLSRATTIISQMQTYGEMITDQTIVEKVLRSLTLKFDQVVAAIEESKDLS 180
           MMKN ESIA+FLSRATTIISQMQTYGE ITDQTIVEKVLRSLT KFD VV AIEESKDLS
Sbjct: 121 MMKNGESIADFLSRATTIISQMQTYGETITDQTIVEKVLRSLTPKFDHVVVAIEESKDLS 180

Query: 181 TFTFIELMGSLQAHESRINRSMERNEEKAFQVKDVVPKYNNSDRVMTRGRGRGGYRGQGR 240
           TFTFIELMGSLQAHESRIN SME+NEEKAF+VKDVVPKYN+SD VMT+G+G GGYR +GR
Sbjct: 181 TFTFIELMGSLQAHESRINISMEKNEEKAFKVKDVVPKYNDSDCVMTQGQGSGGYRSRGR 240

Query: 241 GTEKGCKQNEEKGQFRVQSSNKANIQCYHGKKFGHVKADCWYKNQRANFSAENEA 296
           GT KGC QNEE+ QF VQSSNKANIQCYH KKFGHVKADCWYKN RANF+ +NEA
Sbjct: 241 GTGKGCNQNEEQRQFGVQSSNKANIQCYHCKKFGHVKADCWYKNHRANFTEQNEA 268

BLAST of CSPI03G20730 vs. TAIR 10
Match: AT1G48720.1 (unknown protein; Has 229 Blast hits to 229 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 0; Plants - 228; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 62.4 bits (150), Expect = 7.1e-10
Identity = 26/76 (34.21%), Postives = 48/76 (63.16%), Query Frame = 0

Query: 23 YEFWSMRMKTLLRSQDLWDLVEHNYADPDDEGK--------LREKRRKDSKALVIIQQAV 82
          Y+ WS+RMK +L + D+W++VE  + +P++EG         LR+ R++D KAL +I Q +
Sbjct: 18 YDNWSLRMKAILGAHDVWEIVEKGFIEPENEGSLSQTQKDGLRDSRKRDKKALCLIYQGL 77

Query: 83 HDSGFSRIGTTTTSKE 91
           +  F ++   T++K+
Sbjct: 78 DEDTFEKVVEATSAKD 93

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P041464.3e-0422.14Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A5A7UQM04.3e-13283.39Copia protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold319G00340 ... [more]
A0A5D3DWP22.6e-12186.47Putative gag-pol polyprotein, identical OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A5D3DWC73.1e-11475.59Putative gag-pol polyprotein, identical OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A5A7UDE33.4e-10872.88DUF4219 domain-containing protein/UBN2 domain-containing protein OS=Cucumis melo... [more]
A0A5D3CL102.4e-10684.27UBN2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
Match NameE-valueIdentityDescription
XP_031738054.13.5e-16099.32uncharacterized protein LOC116402652 [Cucumis sativus][more]
KAE8650579.11.5e-15099.29hypothetical protein Csa_010963 [Cucumis sativus][more]
KAA0055915.19.0e-13283.39copia protein [Cucumis melo var. makuwa][more]
TYK27735.15.5e-12186.47putative gag-pol polyprotein, identical [Cucumis melo var. makuwa][more]
TYK28117.16.5e-11475.59putative gag-pol polyprotein, identical [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
AT1G48720.17.1e-1034.21unknown protein; Has 229 Blast hits to 229 proteins in 10 species: Archae - 0; B... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 60..199
e-value: 1.0E-23
score: 83.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 222..255
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 28..277
NoneNo IPR availablePANTHERPTHR34222:SF30SUBFAMILY NOT NAMEDcoord: 28..277
IPR025314Domain of unknown function DUF4219PFAMPF13961DUF4219coord: 20..44
e-value: 2.1E-6
score: 27.3

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G20730.1CSPI03G20730.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding