CsGy1G011020 (gene) Cucumber (Gy14) v2

NameCsGy1G011020
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
LocationChr1 : 6908485 .. 6909513 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAACCCAGTAGAAGCATTAGAAGCCAAATTGAAGAAGGTTGAGGATGAAGAAGTTCACAACATCCCACTAACTGACAACAACCACACCACCCTCTTACACCAAATGACTCTGCTCCTGAACGCACACCAACAACGCCTACTTCAATATGCACCCACCCACCGGATGCCCAACCTGGAGCTACCCATGTTCGACGGAACAGATACCCTCATGTGGATCTTGAAAATGGAGCGATACCTTGAAGTTCACCACATCGACGACATTGCCCAGATGATGGAGACCATCCTTCTCTGTATGTCTGGCCAAGCCCTTGCCTGGTTCCGGTGCTTTCAAAACTGGGGAAATCCGCCGGAATCGTGGGGTGAGTTCCGGGGTTCATTGTATAAGAGATTTGGAGACGGTCACAGGGTGCTTTCCAGGTTCATTGGCTTGCAGCAAGAGGGGAGTGTTGGGGAATATTGTAGCAAGTTCGAGTCACTAGGGGCGCTCCTTCCAGAACTCTCTCACTGTGTTGTTGAAGCTAAATTTATGAACGGCTTGAAGACAGAGATTCGAGGGGAGGTTCGGATGTTAGATACAGAAGGTATACTGGACATTATGCATAGAGCAAGGTTAGCGGAAATTAAGAACAACGTCGCTTTGAACTCAACCAAGCGTAAGGGCACGGTCAAGGAGAGGTCTGTGGTTGTTAAGGTCGTCAGTTGGACAGACTATAACTTAATCTCTAAACACTTAGCTACAGACTTGAAGCTTAAATTGGACATGTATGGCGACTACAGTGTGGTTTTAGGTTCCGGAAAGACGGTGAAAGGAGACGGAATTTGCCGTGGAGTGTTGCTGCAGATCGGGAATGAAACCTATGCGGAGGATTTCTTTCCCCTTCAAATGGGAGAGGATGATGAGGTGATATTGGGAAATCTGTGGCTGGTGGATTTAGGCAAGATGGAAGTTGATTGGAAAAACCTTCCGATGAAGCTGGAGGTGGGGAAGGAGATTGTTACTTTAAGAAAAGATCCATCTCTCTGA

mRNA sequence

ATGGAAAACCCAGTAGAAGCATTAGAAGCCAAATTGAAGAAGGTTGAGGATGAAGAAGTTCACAACATCCCACTAACTGACAACAACCACACCACCCTCTTACACCAAATGACTCTGCTCCTGAACGCACACCAACAACGCCTACTTCAATATGCACCCACCCACCGGATGCCCAACCTGGAGCTACCCATGTTCGACGGAACAGATACCCTCATGTGGATCTTGAAAATGGAGCGATACCTTGAAGTTCACCACATCGACGACATTGCCCAGATGATGGAGACCATCCTTCTCTGTATGTCTGGCCAAGCCCTTGCCTGGTTCCGGTGCTTTCAAAACTGGGGAAATCCGCCGGAATCGTGGGGTGAGTTCCGGGGTTCATTGTATAAGAGATTTGGAGACGGTCACAGGGTGCTTTCCAGGTTCATTGGCTTGCAGCAAGAGGGGAGTGTTGGGGAATATTGTAGCAAGTTCGAGTCACTAGGGGCGCTCCTTCCAGAACTCTCTCACTGTGTTGTTGAAGCTAAATTTATGAACGGCTTGAAGACAGAGATTCGAGGGGAGGTTCGGATGTTAGATACAGAAGGTATACTGGACATTATGCATAGAGCAAGGTTAGCGGAAATTAAGAACAACGTCGCTTTGAACTCAACCAAGCGTAAGGGCACGGTCAAGGAGAGGTCTGTGGTTGTTAAGGTCGTCAGTTGGACAGACTATAACTTAATCTCTAAACACTTAGCTACAGACTTGAAGCTTAAATTGGACATGTATGGCGACTACAGTGTGGTTTTAGGTTCCGGAAAGACGGTGAAAGGAGACGGAATTTGCCGTGGAGTGTTGCTGCAGATCGGGAATGAAACCTATGCGGAGGATTTCTTTCCCCTTCAAATGGGAGAGGATGATGAGGTGATATTGGGAAATCTGTGGCTGGTGGATTTAGGCAAGATGGAAGTTGATTGGAAAAACCTTCCGATGAAGCTGGAGGTGGGGAAGGAGATTGTTACTTTAAGAAAAGATCCATCTCTCTGA

Coding sequence (CDS)

ATGGAAAACCCAGTAGAAGCATTAGAAGCCAAATTGAAGAAGGTTGAGGATGAAGAAGTTCACAACATCCCACTAACTGACAACAACCACACCACCCTCTTACACCAAATGACTCTGCTCCTGAACGCACACCAACAACGCCTACTTCAATATGCACCCACCCACCGGATGCCCAACCTGGAGCTACCCATGTTCGACGGAACAGATACCCTCATGTGGATCTTGAAAATGGAGCGATACCTTGAAGTTCACCACATCGACGACATTGCCCAGATGATGGAGACCATCCTTCTCTGTATGTCTGGCCAAGCCCTTGCCTGGTTCCGGTGCTTTCAAAACTGGGGAAATCCGCCGGAATCGTGGGGTGAGTTCCGGGGTTCATTGTATAAGAGATTTGGAGACGGTCACAGGGTGCTTTCCAGGTTCATTGGCTTGCAGCAAGAGGGGAGTGTTGGGGAATATTGTAGCAAGTTCGAGTCACTAGGGGCGCTCCTTCCAGAACTCTCTCACTGTGTTGTTGAAGCTAAATTTATGAACGGCTTGAAGACAGAGATTCGAGGGGAGGTTCGGATGTTAGATACAGAAGGTATACTGGACATTATGCATAGAGCAAGGTTAGCGGAAATTAAGAACAACGTCGCTTTGAACTCAACCAAGCGTAAGGGCACGGTCAAGGAGAGGTCTGTGGTTGTTAAGGTCGTCAGTTGGACAGACTATAACTTAATCTCTAAACACTTAGCTACAGACTTGAAGCTTAAATTGGACATGTATGGCGACTACAGTGTGGTTTTAGGTTCCGGAAAGACGGTGAAAGGAGACGGAATTTGCCGTGGAGTGTTGCTGCAGATCGGGAATGAAACCTATGCGGAGGATTTCTTTCCCCTTCAAATGGGAGAGGATGATGAGGTGATATTGGGAAATCTGTGGCTGGTGGATTTAGGCAAGATGGAAGTTGATTGGAAAAACCTTCCGATGAAGCTGGAGGTGGGGAAGGAGATTGTTACTTTAAGAAAAGATCCATCTCTCTGA

Protein sequence

MENPVEALEAKLKKVEDEEVHNIPLTDNNHTTLLHQMTLLLNAHQQRLLQYAPTHRMPNLELPMFDGTDTLMWILKMERYLEVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPESWGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNGLKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVALNSTKRKGTVKERSVVVKVVSWTDYNLISKHLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGEDDEVILGNLWLVDLGKMEVDWKNLPMKLEVGKEIVTLRKDPSL
BLAST of CsGy1G011020 vs. NCBI nr
Match: XP_011652863.1 (PREDICTED: uncharacterized protein LOC105435132 [Cucumis sativus])

HSP 1 Score: 684.9 bits (1766), Expect = 1.4e-193
Identity = 336/342 (98.25%), Postives = 338/342 (98.83%), Query Frame = 0

Query: 1   MENPVEALEAKLKKVEDEEVHNIPLTDNNHTTLLHQMTLLLNAHQQRLLQYAPTHRMPNL 60
           MENPVEALEAKLKKVEDEEVHNIP+TDNNHTTLLHQM LLLNAHQQRLLQYAPTHRMPNL
Sbjct: 1   MENPVEALEAKLKKVEDEEVHNIPITDNNHTTLLHQMALLLNAHQQRLLQYAPTHRMPNL 60

Query: 61  ELPMFDGTDTLMWILKMERYLEVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPES 120
           ELPMFDGTDTLMWILKMERY EVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPES
Sbjct: 61  ELPMFDGTDTLMWILKMERYFEVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPES 120

Query: 121 WGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNG 180
           WGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNG
Sbjct: 121 WGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNG 180

Query: 181 LKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVALNSTKRKGTVKERSVVVKVVSWTDYN 240
           LKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVA NSTKRKGTVKERSVVVKVVSWTDYN
Sbjct: 181 LKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVAFNSTKRKGTVKERSVVVKVVSWTDYN 240

Query: 241 LISKHLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGED 300
           LISK+LATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGED
Sbjct: 241 LISKNLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGED 300

Query: 301 DEVILGNLWLVDLGKMEVDWKNLPMKLEVGKEIVTLRKDPSL 343
           DEVILGNLWLVDLGKMEVDWKNL MKLEVGKEIVTLRKDPSL
Sbjct: 301 DEVILGNLWLVDLGKMEVDWKNLTMKLEVGKEIVTLRKDPSL 342

BLAST of CsGy1G011020 vs. NCBI nr
Match: XP_016900762.1 (PREDICTED: uncharacterized protein LOC107991016 [Cucumis melo])

HSP 1 Score: 580.5 bits (1495), Expect = 3.8e-162
Identity = 286/329 (86.93%), Postives = 301/329 (91.49%), Query Frame = 0

Query: 1   MENPVEALEAKLKKVEDEEVHNIPLT-DNNHTTLLHQMTLLLNAHQQRLLQYAPTHRMPN 60
           MENPVEAL+AKLKKVEDEEVHNIP T DNN TTL+HQM LLLN HQQ L QYAPTHRMPN
Sbjct: 1   MENPVEALDAKLKKVEDEEVHNIPTTHDNNLTTLMHQMALLLNGHQQCLPQYAPTHRMPN 60

Query: 61  LELPMFDGTDTLMWILKMERYLEVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPE 120
           LELPMFDGTDTLMWILKMERY EVHHIDDIA+MM+TILLCMSGQALAWFRCFQNWG PPE
Sbjct: 61  LELPMFDGTDTLMWILKMERYFEVHHIDDIARMMDTILLCMSGQALAWFRCFQNWGKPPE 120

Query: 121 SWGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMN 180
           SW EFR SLY RFGD   V S+F+GL+QEGSV EYCSKFE+LGALLPEL H V+EAKFMN
Sbjct: 121 SWDEFRDSLYMRFGDARNVCSKFLGLEQEGSVEEYCSKFETLGALLPELQHVVLEAKFMN 180

Query: 181 GLKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVALNSTKRKGTVKERSVVVKVVSWTDY 240
           GLKTEIR +VRML  + ILDIMHRARL E KNNVALNSTKRKGTVKERSV+VKVVSWTDY
Sbjct: 181 GLKTEIREDVRMLHPKDILDIMHRARLGEKKNNVALNSTKRKGTVKERSVIVKVVSWTDY 240

Query: 241 NLISKHLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGE 300
           NLISK+LATDLKLKLDMYGDYSVVLGSGK VKGDGICRGVLLQIGN+TY EDFFPLQMGE
Sbjct: 241 NLISKNLATDLKLKLDMYGDYSVVLGSGKKVKGDGICRGVLLQIGNDTYVEDFFPLQMGE 300

Query: 301 DDEVILGNLWLVDLGKMEVDWKNLPMKLE 329
           DDEVILGNLWLV LGKMEVDWKNLPMKL+
Sbjct: 301 DDEVILGNLWLVALGKMEVDWKNLPMKLK 329

BLAST of CsGy1G011020 vs. NCBI nr
Match: KGN64564.1 (hypothetical protein Csa_1G064860 [Cucumis sativus])

HSP 1 Score: 460.3 bits (1183), Expect = 5.7e-126
Identity = 224/235 (95.32%), Postives = 230/235 (97.87%), Query Frame = 0

Query: 1   MENPVEALEAKLKKVEDEEVHNIPLTDNNHTTLLHQMTLLLNAHQQRLLQYAPTHRMPNL 60
           MENPVEALEAKLKKVEDEEVHNIP+TDNNHTTLLHQM LLLNAHQQRLLQYAPTHRMPNL
Sbjct: 1   MENPVEALEAKLKKVEDEEVHNIPITDNNHTTLLHQMALLLNAHQQRLLQYAPTHRMPNL 60

Query: 61  ELPMFDGTDTLMWILKMERYLEVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPES 120
           ELPMFDGTDTLMWILKMERY EVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPES
Sbjct: 61  ELPMFDGTDTLMWILKMERYFEVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPES 120

Query: 121 WGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNG 180
           WGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNG
Sbjct: 121 WGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNG 180

Query: 181 LKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVALNSTKRKGTVKERSVVVKVVS 236
           LKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVA NSTKRKGTVKE S+++ ++S
Sbjct: 181 LKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVAFNSTKRKGTVKESSLIMIMLS 235

BLAST of CsGy1G011020 vs. NCBI nr
Match: XP_024027268.1 (uncharacterized protein LOC112093319 [Morus notabilis])

HSP 1 Score: 151.4 bits (381), Expect = 5.7e-33
Identity = 98/340 (28.82%), Postives = 157/340 (46.18%), Query Frame = 0

Query: 55  HRMPNLELPMFDGTDTLMWILKMERYLEVHHIDDIAQMMETILLCMSGQALAWFRCFQNW 114
           HR   LE+P+F G D   W+ + +RY  V+ ++D  ++M    +C+ GQAL WF+C  + 
Sbjct: 36  HRGRRLEMPLFQGDDPQGWVFRADRYFAVNDVEDEEKVM-VASVCLEGQALGWFQC-ADA 95

Query: 115 GNPPESWGEFRGSLYKRFGDGHRV--LSRFIGLQQEGSVGEYCSKFESLGALLPELSHCV 174
            NP  SW E R ++ +RFG    V  + + + L+Q  SV EY  +FE   A +  +   V
Sbjct: 96  QNPFRSWQELRAAVLRRFGRAREVDPVEQLMALRQRMSVVEYRDEFEVTAAHMHGVPKTV 155

Query: 175 VEAKFMNGLKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVA------------------ 234
            +  F++GL+ +IR E+++    G+ ++M  A+  E +N  A                  
Sbjct: 156 FKGAFLHGLREDIRAELKLHRPNGLHEMMDLAQQVEARNEAADRVLIVAEEGDKPGEDCE 215

Query: 235 --------------------------------LNSTKRKGTVKERSVVVKVVSWTDYNLI 294
                                            N+ K +G +  R +V  V S   ++ I
Sbjct: 216 EEPEPLEMDNKGKEVLNVKVDLSLNSMRGMSSSNTRKLRGWIGGREIVALVDSGATHSFI 275

Query: 295 SKHLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGEDDE 343
           SK +  +L L +D     ++++G+G  VK  G+C GV + I      E+FF  ++G  D 
Sbjct: 276 SKLVVKELHLPMDSTVSNNILVGNGMCVKQTGVCWGVRVWIQGHLVEENFFSFELGGAD- 335

BLAST of CsGy1G011020 vs. NCBI nr
Match: XP_022897442.1 (uncharacterized protein LOC111411108 [Olea europaea var. sylvestris])

HSP 1 Score: 149.4 bits (376), Expect = 2.2e-32
Identity = 112/434 (25.81%), Postives = 175/434 (40.32%), Query Frame = 0

Query: 56  RMPNLELPMFDGTDTLMWILKMERYLEVHHIDDIAQMMETILLCMSGQALAWFRCFQNWG 115
           R+  LE+P+F+G D   W+ ++ERY  V+ + +  + +E   +C  G+ALAWF+ ++   
Sbjct: 147 RIRRLEMPVFEGNDPDGWVFRVERYFSVNRLSE-EEKLEAAAVCFDGEALAWFQ-WEERR 206

Query: 116 NPPESWGEFRGSLYKRFGDGHR--VLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVV 175
            P ++W + +  L +RF       + ++F+ LQQ  +V EY  +FE+L + L  +S  V+
Sbjct: 207 RPVKAWEDLKAHLLRRFRPSQEGSLCAQFLSLQQTTTVREYRRRFETLASPLLGISEEVM 266

Query: 176 EAKFMNGLKTEIRGEVRMLDTEGILDIMHRARLAEIKN---------------------- 235
           E  F+NGL+ EIR EV++L   G+  IM  A++ E +N                      
Sbjct: 267 ECSFLNGLRAEIRAEVQLLRPLGLEQIMEVAQMVEDRNLITHQGPGPTKPKSIPYQSPNP 326

Query: 236 ------------------------------------------------------------ 295
                                                                       
Sbjct: 327 PQGTLKQNKNRIPVNILPQRPNFVPKNPAAFKRLSDSEMQIKREKGLCFKCDERFTVGHR 386

Query: 296 ------------------------------------------------------NVALNS 343
                                                                  +++NS
Sbjct: 387 RRNKELQVLLTHEFEGSDERKETGQNGELDAGDVEADGDLAFADTDAMGSGQPVELSINS 446

BLAST of CsGy1G011020 vs. TAIR10
Match: AT3G29750.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 78.2 bits (191), Expect = 1.1e-14
Identity = 64/235 (27.23%), Postives = 104/235 (44.26%), Query Frame = 0

Query: 142 FIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNGLKTEIRGEVRMLDTEGI---- 201
           + G+QQEGSV +Y  +FE+L      L     E  F+ GL+  ++  VR L   GI    
Sbjct: 9   YSGIQQEGSVRDYRERFEALCLRSVTLPGQGFEEMFLQGLQPSLQTAVRELKPNGINSYQ 68

Query: 202 ---------------LDIMHRAR-----LAEIKNN----------VALNSTKRKGT---- 261
                          LD++ + +     L E++ +          + ++ T+ KG     
Sbjct: 69  SRQAELMSLTLVQAKLDVVKKKKGVINELEELEQDSYTLRQGMEQLVIDLTRNKGMRFYG 128

Query: 262 -VKERSVVVKVVSWTDYNLISKHLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQ 321
            + +  VVV + S    N I   LA  LKL   +    SV+LG  + ++  G C G+ L 
Sbjct: 129 FILDHKVVVAIDSGATDNFILVELAFSLKLPTSITNQASVLLGQRQCIQSVGTCLGIRLW 188

Query: 322 IGNETYAEDFFPLQMGEDD-EVILGNLWLVDLGKMEVDWKNLPMKLEVGKEIVTL 337
           +      E+F  L + + D +VILG  WL  LG+  V+W+N        ++ +TL
Sbjct: 189 VQEVEITENFLLLDLAKTDVDVILGYEWLSKLGETMVNWQNQDFSFSHNQQWITL 243

BLAST of CsGy1G011020 vs. TAIR10
Match: AT3G42723.1 (aminoacyl-tRNA ligases;ATP binding;nucleotide binding)

HSP 1 Score: 62.0 bits (149), Expect = 8.2e-10
Identity = 72/263 (27.38%), Postives = 109/263 (41.44%), Query Frame = 0

Query: 116 NPPESWGEFRGSLYKRFGDGHRV--LSRFIGLQQEGSVGEYCSKFESL---GALLPELSH 175
           N P SW EF+  + +      +V     + G+QQEGSV EY  +FE+L     +LP    
Sbjct: 315 NSPTSWKEFKCMMARETKTTMKVNHQPHYSGIQQEGSVREYRERFEALCLGSVILPGQG- 374

Query: 176 CVVEAKFMNGLKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVALNSTKRKGTVKERSVV 235
             +EA F+ GL+  ++  VR L   GI+ +M  A+  E  N++ +      G+       
Sbjct: 375 --LEALFLQGLQPSLQTAVRELKPNGIVQMMDTAQWLEESNSLMV-----YGSGLSVQTE 434

Query: 236 VKVVSWTDYNLISK----HLATDLK------------LK---------------LDMYG- 295
            KV   T   L S     ++  DLK            LK               +  YG 
Sbjct: 435 PKVYPTTQAELRSMVLMGYMREDLKDTPRPANEDTGTLKQEHELPGTEVATCRGMRFYGY 494

Query: 296 ----DYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGED-DEVILGNLWLVDL 337
               + S        VK    C+ + L+I +    ED+    +  D  +VILG  WL  L
Sbjct: 495 ILQQEVSFSYRLASDVKRS--CQEISLRINDIDIVEDYCVWDLKRDVVDVILGYEWLSKL 554

BLAST of CsGy1G011020 vs. TrEMBL
Match: tr|A0A1S4DXQ7|A0A1S4DXQ7_CUCME (uncharacterized protein LOC107991016 OS=Cucumis melo OX=3656 GN=LOC107991016 PE=4 SV=1)

HSP 1 Score: 580.5 bits (1495), Expect = 2.5e-162
Identity = 286/329 (86.93%), Postives = 301/329 (91.49%), Query Frame = 0

Query: 1   MENPVEALEAKLKKVEDEEVHNIPLT-DNNHTTLLHQMTLLLNAHQQRLLQYAPTHRMPN 60
           MENPVEAL+AKLKKVEDEEVHNIP T DNN TTL+HQM LLLN HQQ L QYAPTHRMPN
Sbjct: 1   MENPVEALDAKLKKVEDEEVHNIPTTHDNNLTTLMHQMALLLNGHQQCLPQYAPTHRMPN 60

Query: 61  LELPMFDGTDTLMWILKMERYLEVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPE 120
           LELPMFDGTDTLMWILKMERY EVHHIDDIA+MM+TILLCMSGQALAWFRCFQNWG PPE
Sbjct: 61  LELPMFDGTDTLMWILKMERYFEVHHIDDIARMMDTILLCMSGQALAWFRCFQNWGKPPE 120

Query: 121 SWGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMN 180
           SW EFR SLY RFGD   V S+F+GL+QEGSV EYCSKFE+LGALLPEL H V+EAKFMN
Sbjct: 121 SWDEFRDSLYMRFGDARNVCSKFLGLEQEGSVEEYCSKFETLGALLPELQHVVLEAKFMN 180

Query: 181 GLKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVALNSTKRKGTVKERSVVVKVVSWTDY 240
           GLKTEIR +VRML  + ILDIMHRARL E KNNVALNSTKRKGTVKERSV+VKVVSWTDY
Sbjct: 181 GLKTEIREDVRMLHPKDILDIMHRARLGEKKNNVALNSTKRKGTVKERSVIVKVVSWTDY 240

Query: 241 NLISKHLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGE 300
           NLISK+LATDLKLKLDMYGDYSVVLGSGK VKGDGICRGVLLQIGN+TY EDFFPLQMGE
Sbjct: 241 NLISKNLATDLKLKLDMYGDYSVVLGSGKKVKGDGICRGVLLQIGNDTYVEDFFPLQMGE 300

Query: 301 DDEVILGNLWLVDLGKMEVDWKNLPMKLE 329
           DDEVILGNLWLV LGKMEVDWKNLPMKL+
Sbjct: 301 DDEVILGNLWLVALGKMEVDWKNLPMKLK 329

BLAST of CsGy1G011020 vs. TrEMBL
Match: tr|A0A0A0LUB3|A0A0A0LUB3_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G064860 PE=4 SV=1)

HSP 1 Score: 460.3 bits (1183), Expect = 3.8e-126
Identity = 224/235 (95.32%), Postives = 230/235 (97.87%), Query Frame = 0

Query: 1   MENPVEALEAKLKKVEDEEVHNIPLTDNNHTTLLHQMTLLLNAHQQRLLQYAPTHRMPNL 60
           MENPVEALEAKLKKVEDEEVHNIP+TDNNHTTLLHQM LLLNAHQQRLLQYAPTHRMPNL
Sbjct: 1   MENPVEALEAKLKKVEDEEVHNIPITDNNHTTLLHQMALLLNAHQQRLLQYAPTHRMPNL 60

Query: 61  ELPMFDGTDTLMWILKMERYLEVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPES 120
           ELPMFDGTDTLMWILKMERY EVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPES
Sbjct: 61  ELPMFDGTDTLMWILKMERYFEVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPES 120

Query: 121 WGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNG 180
           WGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNG
Sbjct: 121 WGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNG 180

Query: 181 LKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVALNSTKRKGTVKERSVVVKVVS 236
           LKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVA NSTKRKGTVKE S+++ ++S
Sbjct: 181 LKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVAFNSTKRKGTVKESSLIMIMLS 235

BLAST of CsGy1G011020 vs. TrEMBL
Match: tr|A0A2I0VWY1|A0A2I0VWY1_9ASPA (Putative mitochondrial protein OS=Dendrobium catenatum OX=906689 GN=MA16_Dca006942 PE=4 SV=1)

HSP 1 Score: 135.6 bits (340), Expect = 2.1e-28
Identity = 101/365 (27.67%), Postives = 157/365 (43.01%), Query Frame = 0

Query: 62  LPMFDGTDTLMWILKMERYLEVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPESW 121
           +P+F+  +   WI K+ERY  V+ + +  ++M    +C+ G A AWF+    W +P  +W
Sbjct: 1   MPVFEEGNVDEWIHKVERYFAVNGLMEEERLM-AASMCLEGAAFAWFKYMDKW-DPVRTW 60

Query: 122 GEFRGSLYKRF--GDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMN 181
            +F+ ++ +RF       +  +F  L QEG+V EY  KFESL   +  LS+  +   FM 
Sbjct: 61  RDFKEAIRERFRGSSPWEISEQFYALTQEGTVEEYRKKFESLVGDMEGLSNSTLGGNFMK 120

Query: 182 GLKTEIRGEVRMLDTEGILDIMHRARLAEIKN---------------------------- 241
           GLK EIR  V+++    + + M  A+L E ++                            
Sbjct: 121 GLKPEIRDAVKVMRPRDLREAMELAQLVENESGKEKSGVTRSAGSFKKLTEEEMQEKRAK 180

Query: 242 --------------------------------------------------NVALNST--- 301
                                                              V+LNS    
Sbjct: 181 GLCFRCEEKFVPGHRCKDRALRALTVYIDEAPEEGDDSDEEQSDPQLEVAEVSLNSVMGF 240

Query: 302 ------KRKGTVKERSVVVKVVSWTDYNLISKHLATDLKLKLDMYGDYSVVLGSGKTVKG 338
                 K KG +  R VVV + S   +N IS  +A +L ++    G Y V++G+GK    
Sbjct: 241 TPSHTMKVKGKIHGREVVVLIDSGATHNFISTQVAEELGIEPTETGSYGVMMGTGKIESS 300

BLAST of CsGy1G011020 vs. TrEMBL
Match: tr|E5GC35|E5GC35_CUCME (Gypsy/ty3 element polyprotein (Fragment) OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 132.5 bits (332), Expect = 1.8e-27
Identity = 104/381 (27.30%), Postives = 156/381 (40.94%), Query Frame = 0

Query: 56  RMPNLELPMFDGTDTLMWILKMERYLEVHHIDDIAQMMETILLCMSGQALAWFRCFQNWG 115
           +   LE+P+F+G D   WI K E Y ++H +++  + ++  ++ M G+ L WFR  +N  
Sbjct: 95  KFKKLEMPVFNGEDPDGWIYKAEYYFQMHLLNE-QEKLKIAIVSMEGKGLCWFRWAEN-R 154

Query: 116 NPPESWGEFRGSLYKRF-----GDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSH 175
               SW E +  +Y RF     G G    +RF+ ++ EGSVGEY  +FE L   LPE++ 
Sbjct: 155 KRFRSWKELKERMYNRFCNREYGTG---CARFLAIKHEGSVGEYLQRFEELSTPLPEMAE 214

Query: 176 CVVEAKFMNGLKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVA---------------- 235
            V+   F  GL   IR EV  +   G+ D++  ARLAE K  +A                
Sbjct: 215 DVLVGTFTKGLDPVIRTEVFAMRVVGLEDMVDVARLAEEKLEIARASHSPYAKDGKPAQK 274

Query: 236 ----------------------------------------------------------LN 295
                                                                     ++
Sbjct: 275 HAPKNSEIPSTKIVTLAERIPTSLNQASNPQNGATGMGGRILNCKHDVKRDELEDVEMVD 334

Query: 296 ST----------------------------KRKGTVKERSVVVKVVSWTDYNLISKHLAT 330
           ST                            K KG V++R +V+ V     +N IS  L  
Sbjct: 335 STHEGEMVEVSPMVELSLNFVVGLTAPGTFKIKGRVEDREIVIMVDCGATHNFISLKLVE 394

BLAST of CsGy1G011020 vs. TrEMBL
Match: tr|A0A1U8Q8J7|A0A1U8Q8J7_NELNU (uncharacterized protein LOC109115387 OS=Nelumbo nucifera OX=4432 GN=LOC109115387 PE=4 SV=1)

HSP 1 Score: 131.0 bits (328), Expect = 5.3e-27
Identity = 98/349 (28.08%), Postives = 150/349 (42.98%), Query Frame = 0

Query: 69  DTLMWILKMERYLEVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPESWGEFRGSL 128
           D      + E Y E++ +   A+ +   ++C+ G AL W+     W  P  SW EF+  L
Sbjct: 48  DLTTGFFRAELYFEINGLLP-AERLRAAVVCLEGDALVWYSWEDGW-QPFRSWAEFKELL 107

Query: 129 YKRFGDGH--RVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNGLKTEIR 188
            +RF       +  + + L+Q  +V EY  +FE L   L +L   V+EA F+NGL+ +I+
Sbjct: 108 LERFRSTQEGNLQEQLLSLRQSTTVKEYRRQFEVLSVPLRDLPESVLEAAFVNGLRPDIQ 167

Query: 189 GEVRMLDTEGILDIMHRARLAEIKN----------------------------------- 248
            E+R ++  G++  M  A+  + KN                                   
Sbjct: 168 SELRQMEPVGLMRKMVAAQKIKEKNQALWAYQSGSFPRGPTTSLSGTHFNSLVSQVLWVQ 227

Query: 249 --------------------NVAL------------------NSTKRKGTVKERSVVVKV 308
                                +AL                  NS K +G +  R VV+ V
Sbjct: 228 DDVAEITSDASNLDCTSGDTEIALTPESATLCVSSLVGLCVANSLKVRGRILSREVVILV 287

Query: 309 VSWTDYNLISKHLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFF 343
            S   +N IS+ L  +LKL      ++ V +G+G  V+  G+CR V L +       DFF
Sbjct: 288 DSGASHNFISETLVKELKLPRTPTHEFGVQMGNGDEVRISGVCRQVCLNLPELDVIADFF 347

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011652863.11.4e-19398.25PREDICTED: uncharacterized protein LOC105435132 [Cucumis sativus][more]
XP_016900762.13.8e-16286.93PREDICTED: uncharacterized protein LOC107991016 [Cucumis melo][more]
KGN64564.15.7e-12695.32hypothetical protein Csa_1G064860 [Cucumis sativus][more]
XP_024027268.15.7e-3328.82uncharacterized protein LOC112093319 [Morus notabilis][more]
XP_022897442.12.2e-3225.81uncharacterized protein LOC111411108 [Olea europaea var. sylvestris][more]
Match NameE-valueIdentityDescription
AT3G29750.11.1e-1427.23Eukaryotic aspartyl protease family protein[more]
AT3G42723.18.2e-1027.38aminoacyl-tRNA ligases;ATP binding;nucleotide binding[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
tr|A0A1S4DXQ7|A0A1S4DXQ7_CUCME2.5e-16286.93uncharacterized protein LOC107991016 OS=Cucumis melo OX=3656 GN=LOC107991016 PE=... [more]
tr|A0A0A0LUB3|A0A0A0LUB3_CUCSA3.8e-12695.32Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G064860 PE=4 SV=1[more]
tr|A0A2I0VWY1|A0A2I0VWY1_9ASPA2.1e-2827.67Putative mitochondrial protein OS=Dendrobium catenatum OX=906689 GN=MA16_Dca0069... [more]
tr|E5GC35|E5GC35_CUCME1.8e-2727.30Gypsy/ty3 element polyprotein (Fragment) OS=Cucumis melo subsp. melo OX=412675 P... [more]
tr|A0A1U8Q8J7|A0A1U8Q8J7_NELNU5.3e-2728.08uncharacterized protein LOC109115387 OS=Nelumbo nucifera OX=4432 GN=LOC109115387... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005162Retrotrans_gag_dom
IPR021109Peptidase_aspartic_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G011020.1CsGy1G011020.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 221..338
e-value: 6.3E-8
score: 34.5
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 98..182
e-value: 1.0E-8
score: 35.2
NoneNo IPR availablePANTHERPTHR34482FAMILY NOT NAMEDcoord: 58..213
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 221..311
e-value: 6.84736E-8
score: 48.4868

The following gene(s) are paralogous to this gene:

None