CSPI01G11090 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI01G11090
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
LocationChr1: 6929892 .. 6930941 (-)
RNA-Seq ExpressionCSPI01G11090
SyntenyCSPI01G11090
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAAATTTGCTAAGAAACAAAATGGAAAACCCAGTAGAAGCATTAGAAGCCAAATTGAAGAAGGTTGAGGATGAAGAAGTTCACAACATCCCAATAACTGACAACAACCACACCACCCTCTTACACCAAATGGCTCTGCTCCTGAACGCACACCAACAACGCCTACTTCAATATGCACCCACCCACCGGATGCCCAACCTGGAGCTACCCATGTTCGACGGAACAGATACCCTCATGTGGATCTTGAAAATGGAGCGATACTTTGAAGTTCCCCACATCGACGACATTGCCCAGATGATGGAGACCATCCTTCTCTGTATGTCTGGCCAAGCCCTTGCCTGGTTCCGGTGCTTTCAAAACTGGGGAAATCCGCCGGAATCGTGGGGTGAGTTCCGGGGTTCATTGTATAAGAGATTTGGAGACGGTCACAGGGTGCTTTCCAGGTTCATTGGCTTGCAGCAAGAGGGGAGTGTTGGGGAATATTGTAGCAAGTTCGAGTCACTAGGGGCGCTCCTTCCAGAACTCTCTCACTGTGTTGTTGAAGCTAAATTTATGAACGGCTTGAAGACAGAGATTCGAGGGGAGGTTCGGATGTTAGATACAGAAGGTATACTGGACATTATGCATAGAGCAAGGTTAGCGGAAATTAAGAACAACGTCGCTTTGAACTCAACCAAGCGTAAGGGCACGGTCAAGGAGAGGTCTGTGGTTGTTAAGGTCGTCAGTTGGACAGACTATAACTTAATCTCTAAAAACTTAGCTACAGACTTGAAGCTTAAATTGGACATGTATGGCGACTACAGTGTGGTTTTAGGTTCCGGAAAGACGGTGAAAGGAGACGGAATTTGCCGTGGAGTGTTGCTGCAGATCGGGAATGAAACCTATGCGGAGGATTTCTTTCCCCTTCAAATGGGAGAGGATGATGAGGTGATATTGGGAAATCTGTGGCTGGTGGATTTAGGCAAGATGGAAGTTGATTGGAAAAACCTTCCGATGAAGCTGGAGGTGGGGAAGGAGATTGTTACTTTAAGAAAAGATCCATCTCTCTGA

mRNA sequence

GGAAATTTGCTAAGAAACAAAATGGAAAACCCAGTAGAAGCATTAGAAGCCAAATTGAAGAAGGTTGAGGATGAAGAAGTTCACAACATCCCAATAACTGACAACAACCACACCACCCTCTTACACCAAATGGCTCTGCTCCTGAACGCACACCAACAACGCCTACTTCAATATGCACCCACCCACCGGATGCCCAACCTGGAGCTACCCATGTTCGACGGAACAGATACCCTCATGTGGATCTTGAAAATGGAGCGATACTTTGAAGTTCCCCACATCGACGACATTGCCCAGATGATGGAGACCATCCTTCTCTGTATGTCTGGCCAAGCCCTTGCCTGGTTCCGGTGCTTTCAAAACTGGGGAAATCCGCCGGAATCGTGGGGTGAGTTCCGGGGTTCATTGTATAAGAGATTTGGAGACGGTCACAGGGTGCTTTCCAGGTTCATTGGCTTGCAGCAAGAGGGGAGTGTTGGGGAATATTGTAGCAAGTTCGAGTCACTAGGGGCGCTCCTTCCAGAACTCTCTCACTGTGTTGTTGAAGCTAAATTTATGAACGGCTTGAAGACAGAGATTCGAGGGGAGGTTCGGATGTTAGATACAGAAGGTATACTGGACATTATGCATAGAGCAAGGTTAGCGGAAATTAAGAACAACGTCGCTTTGAACTCAACCAAGCGTAAGGGCACGGTCAAGGAGAGGTCTGTGGTTGTTAAGGTCGTCAGTTGGACAGACTATAACTTAATCTCTAAAAACTTAGCTACAGACTTGAAGCTTAAATTGGACATGTATGGCGACTACAGTGTGGTTTTAGGTTCCGGAAAGACGGTGAAAGGAGACGGAATTTGCCGTGGAGTGTTGCTGCAGATCGGGAATGAAACCTATGCGGAGGATTTCTTTCCCCTTCAAATGGGAGAGGATGATGAGGTGATATTGGGAAATCTGTGGCTGGTGGATTTAGGCAAGATGGAAGTTGATTGGAAAAACCTTCCGATGAAGCTGGAGGTGGGGAAGGAGATTGTTACTTTAAGAAAAGATCCATCTCTCTGA

Coding sequence (CDS)

ATGGAAAACCCAGTAGAAGCATTAGAAGCCAAATTGAAGAAGGTTGAGGATGAAGAAGTTCACAACATCCCAATAACTGACAACAACCACACCACCCTCTTACACCAAATGGCTCTGCTCCTGAACGCACACCAACAACGCCTACTTCAATATGCACCCACCCACCGGATGCCCAACCTGGAGCTACCCATGTTCGACGGAACAGATACCCTCATGTGGATCTTGAAAATGGAGCGATACTTTGAAGTTCCCCACATCGACGACATTGCCCAGATGATGGAGACCATCCTTCTCTGTATGTCTGGCCAAGCCCTTGCCTGGTTCCGGTGCTTTCAAAACTGGGGAAATCCGCCGGAATCGTGGGGTGAGTTCCGGGGTTCATTGTATAAGAGATTTGGAGACGGTCACAGGGTGCTTTCCAGGTTCATTGGCTTGCAGCAAGAGGGGAGTGTTGGGGAATATTGTAGCAAGTTCGAGTCACTAGGGGCGCTCCTTCCAGAACTCTCTCACTGTGTTGTTGAAGCTAAATTTATGAACGGCTTGAAGACAGAGATTCGAGGGGAGGTTCGGATGTTAGATACAGAAGGTATACTGGACATTATGCATAGAGCAAGGTTAGCGGAAATTAAGAACAACGTCGCTTTGAACTCAACCAAGCGTAAGGGCACGGTCAAGGAGAGGTCTGTGGTTGTTAAGGTCGTCAGTTGGACAGACTATAACTTAATCTCTAAAAACTTAGCTACAGACTTGAAGCTTAAATTGGACATGTATGGCGACTACAGTGTGGTTTTAGGTTCCGGAAAGACGGTGAAAGGAGACGGAATTTGCCGTGGAGTGTTGCTGCAGATCGGGAATGAAACCTATGCGGAGGATTTCTTTCCCCTTCAAATGGGAGAGGATGATGAGGTGATATTGGGAAATCTGTGGCTGGTGGATTTAGGCAAGATGGAAGTTGATTGGAAAAACCTTCCGATGAAGCTGGAGGTGGGGAAGGAGATTGTTACTTTAAGAAAAGATCCATCTCTCTGA

Protein sequence

MENPVEALEAKLKKVEDEEVHNIPITDNNHTTLLHQMALLLNAHQQRLLQYAPTHRMPNLELPMFDGTDTLMWILKMERYFEVPHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPESWGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNGLKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVALNSTKRKGTVKERSVVVKVVSWTDYNLISKNLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGEDDEVILGNLWLVDLGKMEVDWKNLPMKLEVGKEIVTLRKDPSL*
Homology
BLAST of CSPI01G11090 vs. ExPASy TrEMBL
Match: A0A5D3BJD9 (Aminoacyl-tRNA ligase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G006110 PE=4 SV=1)

HSP 1 Score: 605.9 bits (1561), Expect = 1.0e-169
Identity = 300/343 (87.46%), Postives = 314/343 (91.55%), Query Frame = 0

Query: 1   MENPVEALEAKLKKVEDEEVHNIPIT-DNNHTTLLHQMALLLNAHQQRLLQYAPTHRMPN 60
           MENPVEAL+AKLKKVEDEEVHNIP T DNN TTL+HQMALLLN HQQ L QYAPTHRMPN
Sbjct: 1   MENPVEALDAKLKKVEDEEVHNIPTTHDNNLTTLMHQMALLLNGHQQCLPQYAPTHRMPN 60

Query: 61  LELPMFDGTDTLMWILKMERYFEVPHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPE 120
           LELPMFDGTDTLMWILKMERYFEV HIDDIA+MM+TILLCMSGQALAWFRCFQNWG PPE
Sbjct: 61  LELPMFDGTDTLMWILKMERYFEVHHIDDIARMMDTILLCMSGQALAWFRCFQNWGKPPE 120

Query: 121 SWGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMN 180
           SW EFR SLY RFGD   V S+F+GL+QEGSV EYCSKFE+LGALLPEL H V+EAKFMN
Sbjct: 121 SWDEFRDSLYMRFGDARNVCSKFLGLEQEGSVEEYCSKFETLGALLPELQHVVLEAKFMN 180

Query: 181 GLKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVALNSTKRKGTVKERSVVVKVVSWTDY 240
           GLKTEIR +VRML  + ILDIMHRARL E KNNVALNSTKRKGTVKERSV+VKVVSWTDY
Sbjct: 181 GLKTEIREDVRMLHPKDILDIMHRARLGEKKNNVALNSTKRKGTVKERSVIVKVVSWTDY 240

Query: 241 NLISKNLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGE 300
           NLISKNLATDLKLKLDMYGDYSVVLGSGK VKGDGICRGVLLQIGN+TY EDFFPLQMGE
Sbjct: 241 NLISKNLATDLKLKLDMYGDYSVVLGSGKKVKGDGICRGVLLQIGNDTYVEDFFPLQMGE 300

Query: 301 DDEVILGNLWLVDLGKMEVDWKNLPMKLEVGKEIVTLRKDPSL 343
           DDEVILGNLWLV LGKMEVDWKNLPMKL+VGKE VTLRKDP L
Sbjct: 301 DDEVILGNLWLVALGKMEVDWKNLPMKLKVGKETVTLRKDPFL 343

BLAST of CSPI01G11090 vs. ExPASy TrEMBL
Match: A0A1S4DXQ7 (uncharacterized protein LOC107991016 OS=Cucumis melo OX=3656 GN=LOC107991016 PE=4 SV=1)

HSP 1 Score: 583.6 bits (1503), Expect = 5.5e-163
Identity = 288/329 (87.54%), Postives = 302/329 (91.79%), Query Frame = 0

Query: 1   MENPVEALEAKLKKVEDEEVHNIPIT-DNNHTTLLHQMALLLNAHQQRLLQYAPTHRMPN 60
           MENPVEAL+AKLKKVEDEEVHNIP T DNN TTL+HQMALLLN HQQ L QYAPTHRMPN
Sbjct: 1   MENPVEALDAKLKKVEDEEVHNIPTTHDNNLTTLMHQMALLLNGHQQCLPQYAPTHRMPN 60

Query: 61  LELPMFDGTDTLMWILKMERYFEVPHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPE 120
           LELPMFDGTDTLMWILKMERYFEV HIDDIA+MM+TILLCMSGQALAWFRCFQNWG PPE
Sbjct: 61  LELPMFDGTDTLMWILKMERYFEVHHIDDIARMMDTILLCMSGQALAWFRCFQNWGKPPE 120

Query: 121 SWGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMN 180
           SW EFR SLY RFGD   V S+F+GL+QEGSV EYCSKFE+LGALLPEL H V+EAKFMN
Sbjct: 121 SWDEFRDSLYMRFGDARNVCSKFLGLEQEGSVEEYCSKFETLGALLPELQHVVLEAKFMN 180

Query: 181 GLKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVALNSTKRKGTVKERSVVVKVVSWTDY 240
           GLKTEIR +VRML  + ILDIMHRARL E KNNVALNSTKRKGTVKERSV+VKVVSWTDY
Sbjct: 181 GLKTEIREDVRMLHPKDILDIMHRARLGEKKNNVALNSTKRKGTVKERSVIVKVVSWTDY 240

Query: 241 NLISKNLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGE 300
           NLISKNLATDLKLKLDMYGDYSVVLGSGK VKGDGICRGVLLQIGN+TY EDFFPLQMGE
Sbjct: 241 NLISKNLATDLKLKLDMYGDYSVVLGSGKKVKGDGICRGVLLQIGNDTYVEDFFPLQMGE 300

Query: 301 DDEVILGNLWLVDLGKMEVDWKNLPMKLE 329
           DDEVILGNLWLV LGKMEVDWKNLPMKL+
Sbjct: 301 DDEVILGNLWLVALGKMEVDWKNLPMKLK 329

BLAST of CSPI01G11090 vs. ExPASy TrEMBL
Match: A0A0A0LUB3 (Retrotrans_gag domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G064860 PE=4 SV=1)

HSP 1 Score: 462.6 bits (1189), Expect = 1.4e-126
Identity = 226/235 (96.17%), Postives = 231/235 (98.30%), Query Frame = 0

Query: 1   MENPVEALEAKLKKVEDEEVHNIPITDNNHTTLLHQMALLLNAHQQRLLQYAPTHRMPNL 60
           MENPVEALEAKLKKVEDEEVHNIPITDNNHTTLLHQMALLLNAHQQRLLQYAPTHRMPNL
Sbjct: 1   MENPVEALEAKLKKVEDEEVHNIPITDNNHTTLLHQMALLLNAHQQRLLQYAPTHRMPNL 60

Query: 61  ELPMFDGTDTLMWILKMERYFEVPHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPES 120
           ELPMFDGTDTLMWILKMERYFEV HIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPES
Sbjct: 61  ELPMFDGTDTLMWILKMERYFEVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPES 120

Query: 121 WGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNG 180
           WGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNG
Sbjct: 121 WGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNG 180

Query: 181 LKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVALNSTKRKGTVKERSVVVKVVS 236
           LKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVA NSTKRKGTVKE S+++ ++S
Sbjct: 181 LKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVAFNSTKRKGTVKESSLIMIMLS 235

BLAST of CSPI01G11090 vs. ExPASy TrEMBL
Match: A0A5D3C860 (Transposon Tf2-1 polyprotein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold453G00050 PE=4 SV=1)

HSP 1 Score: 163.3 bits (412), Expect = 1.8e-36
Identity = 105/344 (30.52%), Postives = 164/344 (47.67%), Query Frame = 0

Query: 52  APTHRMPNLELPMFDGTDTLMWILKMERYFEVPHIDDIAQMMETILLCMSGQALAWFRCF 111
           A  ++   +E+P+F G D   W+ + ERYF++  + D  +M+ +  +   G AL W+R  
Sbjct: 119 ADRNKFKKVEMPVFAGEDPDSWLFRAERYFQIHKLSDSEKMLVS-TISFDGPALNWYRS- 178

Query: 112 QNWGNPPESWGEFRGSLYKRFGDG--HRVLSRFIGLQQEGSVGEYCSKFESLGALLPELS 171
           Q       SW   +  L  RF      +VL RF+ ++QE +V +Y + F+ L A L ++ 
Sbjct: 179 QEEREKFTSWANLKERLLVRFRSSRDEKVLERFLRVKQESTVDDYRNLFDKLVAPLSDVP 238

Query: 172 HCVVEAKFMNGLKTEIRGEVRMLDTEGILDIMHRARLAEIK------------------- 231
             VV+  FMNGL   IR EVR+   +G+ ++M  A+L E +                   
Sbjct: 239 DPVVKDTFMNGLFPWIRAEVRICRPKGLAEMMEFAQLVENREIERNEVNLNNFAGGKYSQ 298

Query: 232 --------------------------------NNVALNSTKRKGTVKERSVVVKVVSWTD 291
                                           NN  + + K KG ++ER V++ +     
Sbjct: 299 QNTVNNRTPANTNSDSKTNTNFPMRTITLRSSNNAEICTMKVKGKIQEREVIILIDYGAT 358

Query: 292 YNLISKNLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMG 343
           +N IS+ L   L+L +   G Y V+LGSG  V+G GIC  V +Q+ N    E+F PL++G
Sbjct: 359 HNFISEKLVESLQLPVKETGHYGVILGSGTVVQGKGICENVEIQLSNWKVKEEFLPLELG 418

BLAST of CSPI01G11090 vs. ExPASy TrEMBL
Match: A0A5A7T6B1 (Transposon Ty3-G Gag-Pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold86G002030 PE=4 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 3.0e-36
Identity = 105/344 (30.52%), Postives = 163/344 (47.38%), Query Frame = 0

Query: 52  APTHRMPNLELPMFDGTDTLMWILKMERYFEVPHIDDIAQMMETILLCMSGQALAWFRCF 111
           A  ++   +E+P+F G D   W+ + ERYF++  + D  +M+ +  +   G AL W+R  
Sbjct: 119 ADRNKFKKVEMPVFAGEDPDSWLFRAERYFQIHKLSDSEKMLVS-TISFDGPALNWYRS- 178

Query: 112 QNWGNPPESWGEFRGSLYKRFGDG--HRVLSRFIGLQQEGSVGEYCSKFESLGALLPELS 171
           Q       SW   +  L  RF      +VL RF+ ++QE +V +Y + F+ L A L ++ 
Sbjct: 179 QEEREKFTSWANLKERLLVRFRSSRDEKVLERFLRVKQESTVDDYRNLFDKLVAPLSDVP 238

Query: 172 HCVVEAKFMNGLKTEIRGEVRMLDTEGILDIMHRARLAEIK------------------- 231
             VV+  FMNGL   IR EVR+   +G+ ++M  A+L E +                   
Sbjct: 239 DPVVKDTFMNGLFPWIRAEVRICRPKGLAEMMEFAQLVENREIERNEVNLNNFAGGKYSQ 298

Query: 232 --------------------------------NNVALNSTKRKGTVKERSVVVKVVSWTD 291
                                           NN  + + K KG ++ER V++ +     
Sbjct: 299 QNTVNNRTPANTNSDSKTNTNFPMRTITLRSSNNAEICTMKVKGKIQEREVIILIDYGAT 358

Query: 292 YNLISKNLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMG 343
           +N IS+ L   L+L +   G Y V+LGSG  V+G GIC  V +Q+ N    E+F PL++G
Sbjct: 359 HNFISEKLVESLQLPVKETGHYGVILGSGTVVQGKGICENVEIQLSNWKVKEEFLPLELG 418

BLAST of CSPI01G11090 vs. NCBI nr
Match: KAE8652678.1 (hypothetical protein Csa_013756 [Cucumis sativus])

HSP 1 Score: 689.1 bits (1777), Expect = 1.9e-194
Identity = 339/342 (99.12%), Postives = 339/342 (99.12%), Query Frame = 0

Query: 1   MENPVEALEAKLKKVEDEEVHNIPITDNNHTTLLHQMALLLNAHQQRLLQYAPTHRMPNL 60
           MENPVEALEAKLKKVEDEEVHNIPITDNNHTTLLHQMALLLNAHQQRLLQYAPTHRMPNL
Sbjct: 1   MENPVEALEAKLKKVEDEEVHNIPITDNNHTTLLHQMALLLNAHQQRLLQYAPTHRMPNL 60

Query: 61  ELPMFDGTDTLMWILKMERYFEVPHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPES 120
           ELPMFDGTDTLMWILKMERYFEV HIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPES
Sbjct: 61  ELPMFDGTDTLMWILKMERYFEVHHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPES 120

Query: 121 WGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNG 180
           WGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNG
Sbjct: 121 WGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNG 180

Query: 181 LKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVALNSTKRKGTVKERSVVVKVVSWTDYN 240
           LKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVA NSTKRKGTVKERSVVVKVVSWTDYN
Sbjct: 181 LKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVAFNSTKRKGTVKERSVVVKVVSWTDYN 240

Query: 241 LISKNLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGED 300
           LISKNLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGED
Sbjct: 241 LISKNLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGED 300

Query: 301 DEVILGNLWLVDLGKMEVDWKNLPMKLEVGKEIVTLRKDPSL 343
           DEVILGNLWLVDLGKMEVDWKNL MKLEVGKEIVTLRKDPSL
Sbjct: 301 DEVILGNLWLVDLGKMEVDWKNLTMKLEVGKEIVTLRKDPSL 342

BLAST of CSPI01G11090 vs. NCBI nr
Match: KAA0056890.1 (aminoacyl-tRNA ligase [Cucumis melo var. makuwa] >TYJ99393.1 aminoacyl-tRNA ligase [Cucumis melo var. makuwa])

HSP 1 Score: 605.9 bits (1561), Expect = 2.1e-169
Identity = 300/343 (87.46%), Postives = 314/343 (91.55%), Query Frame = 0

Query: 1   MENPVEALEAKLKKVEDEEVHNIPIT-DNNHTTLLHQMALLLNAHQQRLLQYAPTHRMPN 60
           MENPVEAL+AKLKKVEDEEVHNIP T DNN TTL+HQMALLLN HQQ L QYAPTHRMPN
Sbjct: 1   MENPVEALDAKLKKVEDEEVHNIPTTHDNNLTTLMHQMALLLNGHQQCLPQYAPTHRMPN 60

Query: 61  LELPMFDGTDTLMWILKMERYFEVPHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPE 120
           LELPMFDGTDTLMWILKMERYFEV HIDDIA+MM+TILLCMSGQALAWFRCFQNWG PPE
Sbjct: 61  LELPMFDGTDTLMWILKMERYFEVHHIDDIARMMDTILLCMSGQALAWFRCFQNWGKPPE 120

Query: 121 SWGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMN 180
           SW EFR SLY RFGD   V S+F+GL+QEGSV EYCSKFE+LGALLPEL H V+EAKFMN
Sbjct: 121 SWDEFRDSLYMRFGDARNVCSKFLGLEQEGSVEEYCSKFETLGALLPELQHVVLEAKFMN 180

Query: 181 GLKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVALNSTKRKGTVKERSVVVKVVSWTDY 240
           GLKTEIR +VRML  + ILDIMHRARL E KNNVALNSTKRKGTVKERSV+VKVVSWTDY
Sbjct: 181 GLKTEIREDVRMLHPKDILDIMHRARLGEKKNNVALNSTKRKGTVKERSVIVKVVSWTDY 240

Query: 241 NLISKNLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGE 300
           NLISKNLATDLKLKLDMYGDYSVVLGSGK VKGDGICRGVLLQIGN+TY EDFFPLQMGE
Sbjct: 241 NLISKNLATDLKLKLDMYGDYSVVLGSGKKVKGDGICRGVLLQIGNDTYVEDFFPLQMGE 300

Query: 301 DDEVILGNLWLVDLGKMEVDWKNLPMKLEVGKEIVTLRKDPSL 343
           DDEVILGNLWLV LGKMEVDWKNLPMKL+VGKE VTLRKDP L
Sbjct: 301 DDEVILGNLWLVALGKMEVDWKNLPMKLKVGKETVTLRKDPFL 343

BLAST of CSPI01G11090 vs. NCBI nr
Match: XP_016900762.1 (PREDICTED: uncharacterized protein LOC107991016 [Cucumis melo])

HSP 1 Score: 583.6 bits (1503), Expect = 1.1e-162
Identity = 288/329 (87.54%), Postives = 302/329 (91.79%), Query Frame = 0

Query: 1   MENPVEALEAKLKKVEDEEVHNIPIT-DNNHTTLLHQMALLLNAHQQRLLQYAPTHRMPN 60
           MENPVEAL+AKLKKVEDEEVHNIP T DNN TTL+HQMALLLN HQQ L QYAPTHRMPN
Sbjct: 1   MENPVEALDAKLKKVEDEEVHNIPTTHDNNLTTLMHQMALLLNGHQQCLPQYAPTHRMPN 60

Query: 61  LELPMFDGTDTLMWILKMERYFEVPHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPE 120
           LELPMFDGTDTLMWILKMERYFEV HIDDIA+MM+TILLCMSGQALAWFRCFQNWG PPE
Sbjct: 61  LELPMFDGTDTLMWILKMERYFEVHHIDDIARMMDTILLCMSGQALAWFRCFQNWGKPPE 120

Query: 121 SWGEFRGSLYKRFGDGHRVLSRFIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMN 180
           SW EFR SLY RFGD   V S+F+GL+QEGSV EYCSKFE+LGALLPEL H V+EAKFMN
Sbjct: 121 SWDEFRDSLYMRFGDARNVCSKFLGLEQEGSVEEYCSKFETLGALLPELQHVVLEAKFMN 180

Query: 181 GLKTEIRGEVRMLDTEGILDIMHRARLAEIKNNVALNSTKRKGTVKERSVVVKVVSWTDY 240
           GLKTEIR +VRML  + ILDIMHRARL E KNNVALNSTKRKGTVKERSV+VKVVSWTDY
Sbjct: 181 GLKTEIREDVRMLHPKDILDIMHRARLGEKKNNVALNSTKRKGTVKERSVIVKVVSWTDY 240

Query: 241 NLISKNLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMGE 300
           NLISKNLATDLKLKLDMYGDYSVVLGSGK VKGDGICRGVLLQIGN+TY EDFFPLQMGE
Sbjct: 241 NLISKNLATDLKLKLDMYGDYSVVLGSGKKVKGDGICRGVLLQIGNDTYVEDFFPLQMGE 300

Query: 301 DDEVILGNLWLVDLGKMEVDWKNLPMKLE 329
           DDEVILGNLWLV LGKMEVDWKNLPMKL+
Sbjct: 301 DDEVILGNLWLVALGKMEVDWKNLPMKLK 329

BLAST of CSPI01G11090 vs. NCBI nr
Match: TYK06549.1 (transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 163.3 bits (412), Expect = 3.7e-36
Identity = 105/344 (30.52%), Postives = 164/344 (47.67%), Query Frame = 0

Query: 52  APTHRMPNLELPMFDGTDTLMWILKMERYFEVPHIDDIAQMMETILLCMSGQALAWFRCF 111
           A  ++   +E+P+F G D   W+ + ERYF++  + D  +M+ +  +   G AL W+R  
Sbjct: 119 ADRNKFKKVEMPVFAGEDPDSWLFRAERYFQIHKLSDSEKMLVS-TISFDGPALNWYRS- 178

Query: 112 QNWGNPPESWGEFRGSLYKRFGDG--HRVLSRFIGLQQEGSVGEYCSKFESLGALLPELS 171
           Q       SW   +  L  RF      +VL RF+ ++QE +V +Y + F+ L A L ++ 
Sbjct: 179 QEEREKFTSWANLKERLLVRFRSSRDEKVLERFLRVKQESTVDDYRNLFDKLVAPLSDVP 238

Query: 172 HCVVEAKFMNGLKTEIRGEVRMLDTEGILDIMHRARLAEIK------------------- 231
             VV+  FMNGL   IR EVR+   +G+ ++M  A+L E +                   
Sbjct: 239 DPVVKDTFMNGLFPWIRAEVRICRPKGLAEMMEFAQLVENREIERNEVNLNNFAGGKYSQ 298

Query: 232 --------------------------------NNVALNSTKRKGTVKERSVVVKVVSWTD 291
                                           NN  + + K KG ++ER V++ +     
Sbjct: 299 QNTVNNRTPANTNSDSKTNTNFPMRTITLRSSNNAEICTMKVKGKIQEREVIILIDYGAT 358

Query: 292 YNLISKNLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMG 343
           +N IS+ L   L+L +   G Y V+LGSG  V+G GIC  V +Q+ N    E+F PL++G
Sbjct: 359 HNFISEKLVESLQLPVKETGHYGVILGSGTVVQGKGICENVEIQLSNWKVKEEFLPLELG 418

BLAST of CSPI01G11090 vs. NCBI nr
Match: KAA0037097.1 (Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 162.5 bits (410), Expect = 6.3e-36
Identity = 105/344 (30.52%), Postives = 163/344 (47.38%), Query Frame = 0

Query: 52  APTHRMPNLELPMFDGTDTLMWILKMERYFEVPHIDDIAQMMETILLCMSGQALAWFRCF 111
           A  ++   +E+P+F G D   W+ + ERYF++  + D  +M+ +  +   G AL W+R  
Sbjct: 119 ADRNKFKKVEMPVFAGEDPDSWLFRAERYFQIHKLSDSEKMLVS-TISFDGPALNWYRS- 178

Query: 112 QNWGNPPESWGEFRGSLYKRFGDG--HRVLSRFIGLQQEGSVGEYCSKFESLGALLPELS 171
           Q       SW   +  L  RF      +VL RF+ ++QE +V +Y + F+ L A L ++ 
Sbjct: 179 QEEREKFTSWANLKERLLVRFRSSRDEKVLERFLRVKQESTVDDYRNLFDKLVAPLSDVP 238

Query: 172 HCVVEAKFMNGLKTEIRGEVRMLDTEGILDIMHRARLAEIK------------------- 231
             VV+  FMNGL   IR EVR+   +G+ ++M  A+L E +                   
Sbjct: 239 DPVVKDTFMNGLFPWIRAEVRICRPKGLAEMMEFAQLVENREIERNEVNLNNFAGGKYSQ 298

Query: 232 --------------------------------NNVALNSTKRKGTVKERSVVVKVVSWTD 291
                                           NN  + + K KG ++ER V++ +     
Sbjct: 299 QNTVNNRTPANTNSDSKTNTNFPMRTITLRSSNNAEICTMKVKGKIQEREVIILIDYGAT 358

Query: 292 YNLISKNLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQIGNETYAEDFFPLQMG 343
           +N IS+ L   L+L +   G Y V+LGSG  V+G GIC  V +Q+ N    E+F PL++G
Sbjct: 359 HNFISEKLVESLQLPVKETGHYGVILGSGTVVQGKGICENVEIQLSNWKVKEEFLPLELG 418

BLAST of CSPI01G11090 vs. TAIR 10
Match: AT3G29750.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 78.6 bits (192), Expect = 1.1e-14
Identity = 64/235 (27.23%), Postives = 104/235 (44.26%), Query Frame = 0

Query: 142 FIGLQQEGSVGEYCSKFESLGALLPELSHCVVEAKFMNGLKTEIRGEVRMLDTEGI---- 201
           + G+QQEGSV +Y  +FE+L      L     E  F+ GL+  ++  VR L   GI    
Sbjct: 9   YSGIQQEGSVRDYRERFEALCLRSVTLPGQGFEEMFLQGLQPSLQTAVRELKPNGINSYQ 68

Query: 202 ---------------LDIMHRAR-----LAEIKNN----------VALNSTKRKGT---- 261
                          LD++ + +     L E++ +          + ++ T+ KG     
Sbjct: 69  SRQAELMSLTLVQAKLDVVKKKKGVINELEELEQDSYTLRQGMEQLVIDLTRNKGMRFYG 128

Query: 262 -VKERSVVVKVVSWTDYNLISKNLATDLKLKLDMYGDYSVVLGSGKTVKGDGICRGVLLQ 321
            + +  VVV + S    N I   LA  LKL   +    SV+LG  + ++  G C G+ L 
Sbjct: 129 FILDHKVVVAIDSGATDNFILVELAFSLKLPTSITNQASVLLGQRQCIQSVGTCLGIRLW 188

Query: 322 IGNETYAEDFFPLQMGEDD-EVILGNLWLVDLGKMEVDWKNLPMKLEVGKEIVTL 337
           +      E+F  L + + D +VILG  WL  LG+  V+W+N        ++ +TL
Sbjct: 189 VQEVEITENFLLLDLAKTDVDVILGYEWLSKLGETMVNWQNQDFSFSHNQQWITL 243

BLAST of CSPI01G11090 vs. TAIR 10
Match: AT3G42723.1 (aminoacyl-tRNA ligases;ATP binding;nucleotide binding )

HSP 1 Score: 63.9 bits (154), Expect = 2.8e-10
Identity = 76/296 (25.68%), Postives = 125/296 (42.23%), Query Frame = 0

Query: 78  ERYFEVPHIDDIAQMMETILLCMSGQALAWFRCFQNWGNPPESWGEFRGSLYKRFGDGHR 137
           E YF   +I +  + ++ +   + G    W +      N P SW EF+  + +      +
Sbjct: 279 ENYFGENNIPE-QERLQIVYSNLEGDIGQWIKHLWK-KNSPTSWKEFKCMMARETKTTMK 338

Query: 138 V--LSRFIGLQQEGSVGEYCSKFESL---GALLPELSHCVVEAKFMNGLKTEIRGEVRML 197
           V     + G+QQEGSV EY  +FE+L     +LP      +EA F+ GL+  ++  VR L
Sbjct: 339 VNHQPHYSGIQQEGSVREYRERFEALCLGSVILPGQG---LEALFLQGLQPSLQTAVREL 398

Query: 198 DTEGILDIMHRARLAEIKNNVALN------STKRK----GTVKERSVVVKVVSWTDYNLI 257
              GI+ +M  A+  E  N++ +        T+ K       + RS+V+      D    
Sbjct: 399 KPNGIVQMMDTAQWLEESNSLMVYGSGLSVQTEPKVYPTTQAELRSMVLMGYMREDLKDT 458

Query: 258 SKNLATD---LKLKLDMYGD----------YSVVLGS--------GKTVKGDGICRGVLL 317
            +    D   LK + ++ G           Y  +L             VK    C+ + L
Sbjct: 459 PRPANEDTGTLKQEHELPGTEVATCRGMRFYGYILQQEVSFSYRLASDVKRS--CQEISL 518

Query: 318 QIGNETYAEDFFPLQMGED-DEVILGNLWLVDLGKMEVDWKNLPMKLEVGKEIVTL 337
           +I +    ED+    +  D  +VILG  WL  LG+ EV+W+N        ++ VTL
Sbjct: 519 RINDIDIVEDYCVWDLKRDVVDVILGYEWLSKLGETEVNWQNQSFSFIHNQDWVTL 567

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3BJD91.0e-16987.46Aminoacyl-tRNA ligase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold24... [more]
A0A1S4DXQ75.5e-16387.54uncharacterized protein LOC107991016 OS=Cucumis melo OX=3656 GN=LOC107991016 PE=... [more]
A0A0A0LUB31.4e-12696.17Retrotrans_gag domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G064... [more]
A0A5D3C8601.8e-3630.52Transposon Tf2-1 polyprotein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A5A7T6B13.0e-3630.52Transposon Ty3-G Gag-Pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E... [more]
Match NameE-valueIdentityDescription
KAE8652678.11.9e-19499.12hypothetical protein Csa_013756 [Cucumis sativus][more]
KAA0056890.12.1e-16987.46aminoacyl-tRNA ligase [Cucumis melo var. makuwa] >TYJ99393.1 aminoacyl-tRNA liga... [more]
XP_016900762.11.1e-16287.54PREDICTED: uncharacterized protein LOC107991016 [Cucumis melo][more]
TYK06549.13.7e-3630.52transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa][more]
KAA0037097.16.3e-3630.52Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
AT3G29750.11.1e-1427.23Eukaryotic aspartyl protease family protein [more]
AT3G42723.12.8e-1025.68aminoacyl-tRNA ligases;ATP binding;nucleotide binding [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 216..338
e-value: 1.4E-8
score: 36.4
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 98..182
e-value: 5.9E-9
score: 36.1
NoneNo IPR availablePANTHERPTHR34482:SF14RETROTRANSPOSON GAG DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 2..220
NoneNo IPR availablePANTHERPTHR34482DNA DAMAGE-INDUCIBLE PROTEIN 1-LIKEcoord: 2..220
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 221..311
e-value: 1.11731E-8
score: 50.0276

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G11090.1CSPI01G11090.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0016874 ligase activity