Cucsa.117920 (gene) Cucumber (Gy14) v1

NameCucsa.117920
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionRetrotransposon protein
Locationscaffold00997 : 240732 .. 241390 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGCTTGAGTGAGACAATTTTACATCATTTCAACATGGTATTGCTTGCAGTCATTCACTTACATGATGAGCTGTTGAAAAAACCaCAACCAGTGACAAATGGTTGTACAGATCCAAGATGGAGGTGGTTTGTGGTTCATGTTTCTTTCTACAACTATGTACATCCACCGATCAAACGTGCGATCTAGTCATTAATCATTAGTTGTACGTTATATAATTGTCTTGGCCTATTAGATGCCCATTAGATGACACATACATCAAGGTAAACGTTCCAGCATGTGAGCGGCCTAGATATATAACACGAAAGGGCAAAGTTGCCACAATTGTCCTTGGTGTATGTGATACAATATACAATTTTATGTTCGTATTAGCCGGCTGGGAAGGATTGGCTGCTAACTCACATATTCTTCGAGATGTCATTTCAAGACCGAACAGACTAAGGGTGTCGAAGAGTAATTACTTGCAAAATTGTATATAGAAGTACAAATTTTCCATTATCAGTTACGTATTTTTAATCGTCGAATGTGCTCACAAGCTATTACTACCTAGTCGATGGCGGATACCCAAATGCTAAGGGTTTCTTGGCACCATACAGAGGGAAACGCTACTACCTGCAGGACTGGTGTGGTGTTGAAAATGCACCATCAACTGTGAAAGA

mRNA sequence

atgtgcttgagtgagacaattttacatcatttcaacatggtattgcttgcagtcattcacttacatgatgagctgttgaaaaaaccacaaccagtgacaaatggttgtacagatccaagatggaggtggtttgtggtaaacgttccagcatgtgagcggcctagatatataacacgaaagggcaaagttgccacaattgtccttggtgtatgtgatacaatatacaattttatgttcgtattagccggctgggaaggattggctgctaactcacatattcttcgagatgtcatttcaagaccgaacagactaagggtgtcgaagatcgatggcggatacccaaatgctaagggtttcttggcaccatacagagggaaacgctactacctgcaggactggtgtggtgttgaaaatgcaccatcaactgtgaaaga

Coding sequence (CDS)

ATGTGCTTGAGTGAGACAATTTTACATCATTTCAACATGGTATTGCTTGCAGTCATTCACTTACATGATGAGCTGTTGAAAAAACCaCAACCAGTGACAAATGGTTGTACAGATCCAAGATGGAGGTGGTTTGTGGTAAACGTTCCAGCATGTGAGCGGCCTAGATATATAACACGAAAGGGCAAAGTTGCCACAATTGTCCTTGGTGTATGTGATACAATATACAATTTTATGTTCGTATTAGCCGGCTGGGAAGGATTGGCTGCTAACTCACATATTCTTCGAGATGTCATTTCAAGACCGAACAGACTAAGGGTGTCGAAGATCGATGGCGGATACCCAAATGCTAAGGGTTTCTTGGCACCATACAGAGGGAAACGCTACTACCTGCAGGACTGGTGTGGTGTTGAAAATGCACCATCAACTGTGAAAGA

Protein sequence

MCLSETILHHFNMVLLAVIHLHDELLKKPQPVTNGCTDPRWRWFVVNVPACERPRYITRKGKVATIVLGVCDTIYNFMFVLAGWEGLAANSHILRDVISRPNRLRVSKIDGGYPNAKGFLAPYRGKRYYLQDWCGVENAPSTVKX
BLAST of Cucsa.117920 vs. TrEMBL
Match: E5GCB5_CUCME (Retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 205.3 bits (521), Expect = 5.0e-50
Identity = 105/157 (66.88%), Postives = 119/157 (75.80%), Query Frame = 1

Query: 5   ETILHHFNMVLLAVIHLHDELLKKPQPVTNGCTDPRWRWFV------------VNVPACE 64
           ETI  HFNMVLLAVI LH+ELLKKPQPV N CTD RWRWF             VNVPA +
Sbjct: 98  ETISRHFNMVLLAVIRLHEELLKKPQPVPNECTDQRWRWFENCLGALDGTYIKVNVPASD 157

Query: 65  RPRYITRKGKVATIVLGVCDTIYNFMFVLAGWEGLAANSHILRDVISRPNRLRVSK---- 124
           R RY TRKG+VAT VLGVCDT  +F++VLAGWEG AA+S ILRD +SRPNRL+V K    
Sbjct: 158 RARYRTRKGEVATNVLGVCDTKGDFVYVLAGWEGSAADSRILRDALSRPNRLKVPKGYYY 217

Query: 125 -IDGGYPNAKGFLAPYRGKRYYLQDWCGVENAPSTVK 145
            +D GYPNA+GFLAPYRG+RY+LQ+W G ENAPST K
Sbjct: 218 LVDVGYPNAEGFLAPYRGQRYHLQEWRGPENAPSTSK 254

BLAST of Cucsa.117920 vs. TrEMBL
Match: E5GBB2_CUCME (Retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 172.2 bits (435), Expect = 4.7e-40
Identity = 86/157 (54.78%), Postives = 111/157 (70.70%), Query Frame = 1

Query: 5   ETILHHFNMVLLAVIHLHDELLKKPQPVTNGCTDPRWRWFV------------VNVPACE 64
           ET+  HFN+VLLAV+ L++EL+K+P PVT+ C D RW+ F             VNVPA +
Sbjct: 70  ETVSRHFNIVLLAVLRLYEELIKRPVPVTSNCNDQRWKCFENCLGALDGTYIKVNVPAGD 129

Query: 65  RPRYITRKGKVATIVLGVCDTIYNFMFVLAGWEGLAANSHILRDVISRPNRLRVSK---- 124
           RP + TRKG++AT VLGVCD   +F++VLAGWEG AA+S ILRD IS+ N L+V K    
Sbjct: 130 RPTFRTRKGEIATNVLGVCDMKGDFVYVLAGWEGSAADSRILRDAISQENGLQVPKGYYY 189

Query: 125 -IDGGYPNAKGFLAPYRGKRYYLQDWCGVENAPSTVK 145
             D GYPNA+GFLAPY+G+RY+LQ+W G  NAP+  K
Sbjct: 190 LCDAGYPNAEGFLAPYKGQRYHLQEWRGAANAPTNAK 226

BLAST of Cucsa.117920 vs. TrEMBL
Match: A5BND9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027369 PE=4 SV=1)

HSP 1 Score: 144.1 bits (362), Expect = 1.4e-31
Identity = 75/146 (51.37%), Postives = 91/146 (62.33%), Query Frame = 1

Query: 5   ETILHHFNMVLLAVIHLHDELLKKPQPVTNGCTDPRWRWFV------------VNVPACE 64
           ETI  HFN VL AVI L   LLKKP+PV+   TD RW+WF             VNV   +
Sbjct: 129 ETISRHFNAVLNAVIRLQGVLLKKPEPVSENSTDERWKWFKNCLGALDGTYIKVNVREGD 188

Query: 65  RPRYITRKGKVATIVLGVCDTIYNFMFVLAGWEGLAANSHILRDVISRPNRLRVSK---- 124
           +PRY TRK ++AT VLGVC     F++VL GWEG  ++S +LRD +SR N L V      
Sbjct: 189 KPRYRTRKNEIATNVLGVCSQDMQFIYVLPGWEGSTSDSRVLRDAVSRRNGLTVPHGYYY 248

Query: 125 -IDGGYPNAKGFLAPYRGKRYYLQDW 134
            +D GY N KGFLAPYRG+RY+L DW
Sbjct: 249 LVDVGYTNGKGFLAPYRGQRYHLNDW 274

BLAST of Cucsa.117920 vs. TrEMBL
Match: A0A061FY73_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_014176 PE=4 SV=1)

HSP 1 Score: 137.5 bits (345), Expect = 1.3e-29
Identity = 66/146 (45.21%), Postives = 90/146 (61.64%), Query Frame = 1

Query: 5   ETILHHFNMVLLAVIHLHDELLKKPQPVTNGCTDPRWRWFV------------VNVPACE 64
           E+I  HF+ VL AV+ L + L +KP+P+    TD +W+WF             V VP+ +
Sbjct: 136 ESISRHFHNVLAAVLKLQEHLFRKPEPIPTNSTDNQWKWFKNCLGALDGTYIRVKVPSAD 195

Query: 65  RPRYITRKGKVATIVLGVCDTIYNFMFVLAGWEGLAANSHILRDVISRPNRLRVSK---- 124
           +PRY TRKG +AT +LGVC     F+FVL GWEG  A+  +LRD + R N L+V      
Sbjct: 196 KPRYRTRKGNIATNMLGVCTPDMQFVFVLPGWEGSVADGRVLRDALRRRNGLKVPNGCYY 255

Query: 125 -IDGGYPNAKGFLAPYRGKRYYLQDW 134
            +D GY N +GFLAPYRG+RY+L +W
Sbjct: 256 LVDAGYTNCEGFLAPYRGQRYHLNEW 281

BLAST of Cucsa.117920 vs. TrEMBL
Match: A0A061GDT4_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_029406 PE=4 SV=1)

HSP 1 Score: 136.3 bits (342), Expect = 2.9e-29
Identity = 66/146 (45.21%), Postives = 90/146 (61.64%), Query Frame = 1

Query: 5   ETILHHFNMVLLAVIHLHDELLKKPQPVTNGCTDPRWRWFV------------VNVPACE 64
           E+I  HF+ VL AV+ L + L +KP+P+    TD RW+WF             V VP+ +
Sbjct: 136 ESISRHFHNVLAAVLKLQEYLFRKPEPIPTNSTDNRWKWFKNCLGALDGTYIRVKVPSAD 195

Query: 65  RPRYITRKGKVATIVLGVCDTIYNFMFVLAGWEGLAANSHILRDVISRPNRLRVSK---- 124
           +PRY TRKG +AT +LGVC     F+FVL GWEG  A+  +LRD + R N L+V      
Sbjct: 196 KPRYRTRKGDIATNMLGVCTLDMQFVFVLPGWEGSVADGRVLRDALRRRNGLKVPNGCYY 255

Query: 125 -IDGGYPNAKGFLAPYRGKRYYLQDW 134
            +D GY N +GFLAP+RG+RY+L +W
Sbjct: 256 LVDAGYSNCEGFLAPFRGQRYHLNEW 281

BLAST of Cucsa.117920 vs. TAIR10
Match: AT5G41980.1 (AT5G41980.1 Putative harbinger transposase-derived nuclease (InterPro:IPR006912))

HSP 1 Score: 65.1 bits (157), Expect = 4.1e-11
Identity = 48/139 (34.53%), Postives = 67/139 (48.20%), Query Frame = 1

Query: 5   ETILHHFNMVLLAVIHLHDELLKKPQPVTNGCT----DPRWRWFV---------VNVPAC 64
           ETI  HFN VL AVI +  +     QP +N  T    DP ++  V         V V   
Sbjct: 103 ETISRHFNNVLNAVIAISKDFF---QPNSNSDTLENDDPYFKDCVGVVDSFHIPVMVGVD 162

Query: 65  ERPRYITRKGKVATIVLGVCDTIYNFMFVLAGWEGLAANSHILRDVISRPNRLRVSK--- 124
           E+  +    G +   VL        F +VLAGWEG A++  +L   ++R N+L+V +   
Sbjct: 163 EQGPFRNGNGLLTQNVLAASSFDLRFNYVLAGWEGSASDQQVLNAALTRRNKLQVPQGKY 222

Query: 125 --IDGGYPNAKGFLAPYRG 126
             +D  YPN  GF+APY G
Sbjct: 223 YIVDNKYPNLPGFIAPYHG 238

BLAST of Cucsa.117920 vs. TAIR10
Match: AT5G28950.1 (AT5G28950.1 unknown protein)

HSP 1 Score: 48.1 bits (113), Expect = 5.2e-06
Identity = 24/58 (41.38%), Postives = 36/58 (62.07%), Query Frame = 1

Query: 54  PRYITRKGKVATIVLGVCDTIYNFMFVLAGWEGLAANSHILRDVISR-PNRLRVSKID 111
           P +  RKG ++  +L  C+    FM+VL+GWEG A +S +L D ++R  NRL V + D
Sbjct: 44  PSFRNRKGDISQNMLAACNFDVEFMYVLSGWEGSAHDSKVLNDALTRNSNRLPVPEED 101

BLAST of Cucsa.117920 vs. NCBI nr
Match: gi|307136287|gb|ADN34114.1| (retrotransposon protein [Cucumis melo subsp. melo])

HSP 1 Score: 205.3 bits (521), Expect = 7.2e-50
Identity = 105/157 (66.88%), Postives = 119/157 (75.80%), Query Frame = 1

Query: 5   ETILHHFNMVLLAVIHLHDELLKKPQPVTNGCTDPRWRWFV------------VNVPACE 64
           ETI  HFNMVLLAVI LH+ELLKKPQPV N CTD RWRWF             VNVPA +
Sbjct: 98  ETISRHFNMVLLAVIRLHEELLKKPQPVPNECTDQRWRWFENCLGALDGTYIKVNVPASD 157

Query: 65  RPRYITRKGKVATIVLGVCDTIYNFMFVLAGWEGLAANSHILRDVISRPNRLRVSK---- 124
           R RY TRKG+VAT VLGVCDT  +F++VLAGWEG AA+S ILRD +SRPNRL+V K    
Sbjct: 158 RARYRTRKGEVATNVLGVCDTKGDFVYVLAGWEGSAADSRILRDALSRPNRLKVPKGYYY 217

Query: 125 -IDGGYPNAKGFLAPYRGKRYYLQDWCGVENAPSTVK 145
            +D GYPNA+GFLAPYRG+RY+LQ+W G ENAPST K
Sbjct: 218 LVDVGYPNAEGFLAPYRGQRYHLQEWRGPENAPSTSK 254

BLAST of Cucsa.117920 vs. NCBI nr
Match: gi|659118458|ref|XP_008459130.1| (PREDICTED: putative nuclease HARBI1 [Cucumis melo])

HSP 1 Score: 203.0 bits (515), Expect = 3.6e-49
Identity = 105/161 (65.22%), Postives = 118/161 (73.29%), Query Frame = 1

Query: 1   MCLSETILHHFNMVLLAVIHLHDELLKKPQPVTNGCTDPRWRWFV------------VNV 60
           M L ETI  HFNMVLLAVI LH ELLKKPQPV N CTD RWRWF             VNV
Sbjct: 52  MRLGETISRHFNMVLLAVIRLHQELLKKPQPVPNDCTDQRWRWFENCLGALDGTYIKVNV 111

Query: 61  PACERPRYITRKGKVATIVLGVCDTIYNFMFVLAGWEGLAANSHILRDVISRPNRLRVSK 120
           P  +R RY TRKG+VAT VLGVCDT  +F++VLAGWEG AA+S ILRD +SRPN L+V K
Sbjct: 112 PVSDRARYRTRKGEVATNVLGVCDTKGDFIYVLAGWEGSAADSRILRDALSRPNGLKVPK 171

Query: 121 -----IDGGYPNAKGFLAPYRGKRYYLQDWCGVENAPSTVK 145
                +D GYPNA+GFLAPYRG+RY+LQ+W G ENAPST K
Sbjct: 172 GYYYLVDVGYPNAEGFLAPYRGQRYHLQEWRGPENAPSTSK 212

BLAST of Cucsa.117920 vs. NCBI nr
Match: gi|659111563|ref|XP_008455792.1| (PREDICTED: putative nuclease HARBI1 [Cucumis melo])

HSP 1 Score: 191.8 bits (486), Expect = 8.3e-46
Identity = 97/157 (61.78%), Postives = 112/157 (71.34%), Query Frame = 1

Query: 5   ETILHHFNMVLLAVIHLHDELLKKPQPVTNGCTDPRWRWFV------------VNVPACE 64
           ET+  HFN+VLLA   LHDELLKKPQPVTN CTDPRW+WF             VNV A +
Sbjct: 56  ETVSRHFNIVLLAGFRLHDELLKKPQPVTNSCTDPRWKWFENCLGALDGTYIKVNVSATD 115

Query: 65  RPRYITRKGKVATIVLGVCDTIYNFMFVLAGWEGLAANSHILRDVISRPNRLRVSK---- 124
           RPRY TRKG+VAT VLG CDT  +F+FVL GWEG AA+S ILRD ISR N L+V K    
Sbjct: 116 RPRYRTRKGEVATNVLGACDTKGDFVFVLFGWEGSAADSRILRDAISRHNGLKVPKGYYY 175

Query: 125 -IDGGYPNAKGFLAPYRGKRYYLQDWCGVENAPSTVK 145
             D GYPNA+GFLAPYRG+RY+L +W G  NAP+T +
Sbjct: 176 LCDAGYPNAEGFLAPYRGERYHLSEWRGESNAPTTAR 212

BLAST of Cucsa.117920 vs. NCBI nr
Match: gi|659109385|ref|XP_008454689.1| (PREDICTED: putative nuclease HARBI1 [Cucumis melo])

HSP 1 Score: 189.9 bits (481), Expect = 3.1e-45
Identity = 100/161 (62.11%), Postives = 115/161 (71.43%), Query Frame = 1

Query: 1   MCLSETILHHFNMVLLAVIHLHDELLKKPQPVTNGCTDPRWRWFV------------VNV 60
           M L ETI  HF+MVLL VI LHDELLKKPQPV N CTD RWRWF             VNV
Sbjct: 1   MRLGETISRHFHMVLL-VIRLHDELLKKPQPVANDCTDQRWRWFENYLGALDGTYIKVNV 60

Query: 61  PACERPRYITRKGKVATIVLGVCDTIYNFMFVLAGWEGLAANSHILRDVISRPNRLRVSK 120
              +R RY TRKG+VAT VLGVCDT  +F++VL+GWEG AA+S ILRD ISRPN L+V K
Sbjct: 61  LESDRARYKTRKGEVATNVLGVCDTKGDFVYVLSGWEGSAADSRILRDAISRPNGLKVPK 120

Query: 121 -----IDGGYPNAKGFLAPYRGKRYYLQDWCGVENAPSTVK 145
                +D GYPNA+ FLAPYRG+RY+LQ+W G EN PS +K
Sbjct: 121 GYYYLVDAGYPNAEDFLAPYRGQRYHLQEWRGAENVPSNLK 160

BLAST of Cucsa.117920 vs. NCBI nr
Match: gi|659114872|ref|XP_008457266.1| (PREDICTED: putative nuclease HARBI1 [Cucumis melo])

HSP 1 Score: 187.6 bits (475), Expect = 1.6e-44
Identity = 97/158 (61.39%), Postives = 115/158 (72.78%), Query Frame = 1

Query: 5   ETILHHFNMVLLAVIHLHDELLKKPQPVTNGCTDPRWRWFV-------------VNVPAC 64
           ET+  HFNMVLLAVI LH+ELLKKPQPV N  TD +W + +             VNVPA 
Sbjct: 37  ETMSRHFNMVLLAVIRLHEELLKKPQPVPNEYTDKKWSYVLQNCLGALDGTYIKVNVPAS 96

Query: 65  ERPRYITRKGKVATIVLGVCDTIYNFMFVLAGWEGLAANSHILRDVISRPNRLRVSK--- 124
           +R RY TRKG+VAT VLGVCDT  +F++VL GWEG AA+S ILRD +SRPN L+V K   
Sbjct: 97  DRARYRTRKGEVATNVLGVCDTKGDFVYVLVGWEGSAADSRILRDALSRPNELKVPKGYY 156

Query: 125 --IDGGYPNAKGFLAPYRGKRYYLQDWCGVENAPSTVK 145
             +D GYPNA+GFLAPYRG+RY+LQ+W G ENAPST K
Sbjct: 157 YLVDVGYPNAEGFLAPYRGQRYHLQEWRGPENAPSTSK 194

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
E5GCB5_CUCME5.0e-5066.88Retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
E5GBB2_CUCME4.7e-4054.78Retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A5BND9_VITVI1.4e-3151.37Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027369 PE=4 SV=1[more]
A0A061FY73_THECC1.3e-2945.21Uncharacterized protein OS=Theobroma cacao GN=TCM_014176 PE=4 SV=1[more]
A0A061GDT4_THECC2.9e-2945.21Uncharacterized protein OS=Theobroma cacao GN=TCM_029406 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G41980.14.1e-1134.53 Putative harbinger transposase-derived nuclease (InterPro:IPR006912)[more]
AT5G28950.15.2e-0641.38 unknown protein[more]
Match NameE-valueIdentityDescription
gi|307136287|gb|ADN34114.1|7.2e-5066.88retrotransposon protein [Cucumis melo subsp. melo][more]
gi|659118458|ref|XP_008459130.1|3.6e-4965.22PREDICTED: putative nuclease HARBI1 [Cucumis melo][more]
gi|659111563|ref|XP_008455792.1|8.3e-4661.78PREDICTED: putative nuclease HARBI1 [Cucumis melo][more]
gi|659109385|ref|XP_008454689.1|3.1e-4562.11PREDICTED: putative nuclease HARBI1 [Cucumis melo][more]
gi|659114872|ref|XP_008457266.1|1.6e-4461.39PREDICTED: putative nuclease HARBI1 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.117920.1Cucsa.117920.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR22930UNCHARACTERIZEDcoord: 3..133
score: 4.7
NoneNo IPR availablePANTHERPTHR22930:SF27SUBFAMILY NOT NAMEDcoord: 3..133
score: 4.7

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None