CSPI03G21030 (gene) Wild cucumber (PI 183967)

NameCSPI03G21030
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionTransposon Ty3-G Gag-Pol polyprotein
LocationChr3 : 17111058 .. 17112624 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTCTGAAGGAGATGATGCTGGAAATGAAGAAATTGATGGATCGAATGACAGACGAGTTGAGAGAAAACCATGGCACTAAAAAAAGAGAAGAAGCTGGGACTACGGAAGGCCCCATGTTGAAACTGAAGGGAAAGCTGGAAGATACCGAAACCACAGCCGAGTCGGGAGGAAGCAACGCAGGCCGAAGTAAGTACAAAAAATTAGAAATGTTATTGTTTACAGGGGTGAACCCTGAATCATGGGCTTACCGAGCAGAACATTTCTTTGATATAAACAATTTATCGGAAGCTGAGAAGGTCAAAGTAGCAGTCGTGAGTTTTGGACAGGAAGAGGTTGACTAGTTTAGGTGGAGTCATCATCGAAAGAGGGTGGAATCTTGGGAGGATTTGAAGGAGAGGATGTTTGATTTTTTCAATGATACGGGACAGCAGAGCTTGGGAGCAAGAGTAATCTGTATCAAGCAAAAAGGCTCATACAGTGATTATGTTAAGAAGTTTGTTACATATTCAGCCCCCCTCCCGGACATGGCCGAAAGTGTTCTGCTGGATGCGTTTTTTACGGGTCTTGAGCCAGCACTGCAAGCAGAAGTAATCAGCAGGCACCCTCAAACTTTAGAGGAGTGTATGAAGAAAGCTCAACTGGTGAATGATAGGAATATAGCATTAAAACTGGCGATGGAAGAGATGGATAAAGTTGAGCCCAAAAGAAGTGAGAGCAGTAGCAAACTTCAAGGAAAAGGTGAGAAGGTAGAAACAAAAAAAGACAAAGTTTGTCATGAAGCAGGTAACTATCCCACTAAAAGGTGATTACCAGAAAAAAGATCCCCCAATTAAAAGGCTGTTGGATATGGAATTTAGAGCAAGGCTGGATAAAGGGTTTTGTTTCAGGTGTAATGAAAAGTATTCCCCCCGTCATAGATGTAAAGGCAGAGAAAAAAGAGAGTTGATGCTCCTTATACTAAATGAGGAAGATGACAACGAAAGGGAATCAAATACAGAGGATGAGGCAGGCGAAATAATAGAACTGAATCAATTGGAATTGAATGAGGATACCCCTATTGAGTTAAGGTTGATCACGAGGGTTACGTCCAAGGGAACGATGAAACTAAAAGGACATGTGAATGGAAAGGAAGTAGTCATCCTGACTAACAGCGGTGCAACCAATAACTTCATCAGCCAAGTGTTGGTAGATGAACTACAACTGAGCATCGATTCGGGGACTCGTTTCGGAGTTACCATTAGGAATGGCACCCGATGTGAAGGGAGAGGGATCTGTAAGAGGGTAAAAGTGAAGTTGAAAGAGTTAACGATCGTAGCAGACTTCCTGGCGGGAGAATTAGGAAGGGTAGACTTGGTGTTGGGGATACAATGGTTAGATTCGACAGGGACTATGAAAGTACACTGGCCATCTCTAACCATGACGTTTTGGACAAAGGGTAGAAGAATAATCCTAAAAGGGGACCCTTCTCTAACAAAGTTAGAATGTTCGTTGAAAACCTTAGAGAAAACTTGGCAATCTGGGGACCAAGGATTTCTCTTGGAATTTCAAAATTATGAAGTGTGA

mRNA sequence

ATGAGTCTGAAGGAGATGATGCTGGAAATGAAGAAATTGATGGATCGAATGACAGACGAGTTGAGAGAAAACCATGGCACTAAAAAAAGAGAAGAAGCTGGGACTACGGAAGGCCCCATGTTGAAACTGAAGGGAAAGCTGGAAGATACCGAAACCACAGCCGAGTCGGGAGGAAGCAACGCAGGCCGAAGGGTGAACCCTGAATCATGGGCTTACCGAGCAGAACATTTCTTTGATATAAACAATTTATCGGAAGCTGAGAAGGTCAAAGTAGCAGTCGTGAGTTTTGGACAGGAAGAGCAGAGCTTGGGAGCAAGAGTAATCTGTATCAAGCAAAAAGGCTCATACAGTGATTATGTTAAGAAGTTTGTTACATATTCAGCCCCCCTCCCGGACATGGCCGAAAGTGTTCTGCTGGATGCGTTTTTTACGGGTCTTGAGCCAGCACTGCAAGCAGAAGTAATCAGCAGGCACCCTCAAACTTTAGAGGAGTGTATGAAGAAAGCTCAACTGGTGAATGATAGGAATATAGCATTAAAACTGGCGATGGAAGAGATGGATAAAGTTGAGCCCAAAAGAAGTGAGAGCAGTAGCAAACTTCAAGGAAAAGGTGAGAAGGTAACTATCCCACTAAAAGGTGATTACCAGAAAAAAGATCCCCCAATTAAAAGGCTGTTGGATATGGAATTTAGAGCAAGGCTGGATAAAGGGTTTTGTTTCAGGTGTAATGAAAAGTATTCCCCCCGTCATAGATGTAAAGGCAGAGAAAAAAGAGAGTTGATGCTCCTTATACTAAATGAGGAAGATGACAACGAAAGGGAATCAAATACAGAGGATGAGGCAGGCGAAATAATAGAACTGAATCAATTGGAATTGAATGAGGATACCCCTATTGAGTTAAGGTTGATCACGAGGGTTACGTCCAAGGGAACGATGAAACTAAAAGGACATGTGAATGGAAAGGAAGTAGTCATCCTGACTAACAGCGGTGCAACCAATAACTTCATCAGCCAAGTGTTGGTAGATGAACTACAACTGAGCATCGATTCGGGGACTCGTTTCGGAGTTACCATTAGGAATGGCACCCGATGTGAAGGGAGAGGGATCTGTAAGAGGGTAAAAGTGAAGTTGAAAGAGTTAACGATCGTAGCAGACTTCCTGGCGGGAGAATTAGGAAGGGTAGACTTGGTGTTGGGGATACAATGGTTAGATTCGACAGGGACTATGAAAGTACACTGGCCATCTCTAACCATGACGTTTTGGACAAAGGGTAGAAGAATAATCCTAAAAGGGGACCCTTCTCTAACAAAGTTAGAATGTTCGTTGAAAACCTTAGAGAAAACTTGGCAATCTGGGGACCAAGGATTTCTCTTGGAATTTCAAAATTATGAAGTGTGA

Coding sequence (CDS)

ATGAGTCTGAAGGAGATGATGCTGGAAATGAAGAAATTGATGGATCGAATGACAGACGAGTTGAGAGAAAACCATGGCACTAAAAAAAGAGAAGAAGCTGGGACTACGGAAGGCCCCATGTTGAAACTGAAGGGAAAGCTGGAAGATACCGAAACCACAGCCGAGTCGGGAGGAAGCAACGCAGGCCGAAGGGTGAACCCTGAATCATGGGCTTACCGAGCAGAACATTTCTTTGATATAAACAATTTATCGGAAGCTGAGAAGGTCAAAGTAGCAGTCGTGAGTTTTGGACAGGAAGAGCAGAGCTTGGGAGCAAGAGTAATCTGTATCAAGCAAAAAGGCTCATACAGTGATTATGTTAAGAAGTTTGTTACATATTCAGCCCCCCTCCCGGACATGGCCGAAAGTGTTCTGCTGGATGCGTTTTTTACGGGTCTTGAGCCAGCACTGCAAGCAGAAGTAATCAGCAGGCACCCTCAAACTTTAGAGGAGTGTATGAAGAAAGCTCAACTGGTGAATGATAGGAATATAGCATTAAAACTGGCGATGGAAGAGATGGATAAAGTTGAGCCCAAAAGAAGTGAGAGCAGTAGCAAACTTCAAGGAAAAGGTGAGAAGGTAACTATCCCACTAAAAGGTGATTACCAGAAAAAAGATCCCCCAATTAAAAGGCTGTTGGATATGGAATTTAGAGCAAGGCTGGATAAAGGGTTTTGTTTCAGGTGTAATGAAAAGTATTCCCCCCGTCATAGATGTAAAGGCAGAGAAAAAAGAGAGTTGATGCTCCTTATACTAAATGAGGAAGATGACAACGAAAGGGAATCAAATACAGAGGATGAGGCAGGCGAAATAATAGAACTGAATCAATTGGAATTGAATGAGGATACCCCTATTGAGTTAAGGTTGATCACGAGGGTTACGTCCAAGGGAACGATGAAACTAAAAGGACATGTGAATGGAAAGGAAGTAGTCATCCTGACTAACAGCGGTGCAACCAATAACTTCATCAGCCAAGTGTTGGTAGATGAACTACAACTGAGCATCGATTCGGGGACTCGTTTCGGAGTTACCATTAGGAATGGCACCCGATGTGAAGGGAGAGGGATCTGTAAGAGGGTAAAAGTGAAGTTGAAAGAGTTAACGATCGTAGCAGACTTCCTGGCGGGAGAATTAGGAAGGGTAGACTTGGTGTTGGGGATACAATGGTTAGATTCGACAGGGACTATGAAAGTACACTGGCCATCTCTAACCATGACGTTTTGGACAAAGGGTAGAAGAATAATCCTAAAAGGGGACCCTTCTCTAACAAAGTTAGAATGTTCGTTGAAAACCTTAGAGAAAACTTGGCAATCTGGGGACCAAGGATTTCTCTTGGAATTTCAAAATTATGAAGTGTGA
BLAST of CSPI03G21030 vs. TrEMBL
Match: E5GC18_CUCME (Ty3-gypsy retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 226.5 bits (576), Expect = 6.8e-56
Identity = 154/465 (33.12%), Postives = 242/465 (52.04%), Query Frame = 1

Query: 19  DELRENHGTKKREEAGTTEGPMLKLKGKLEDTETTAESGGSNAGRRVNPESWAYRAEHFF 78
           +E+ E  G ++ E + + E    + + K    E    +  S  GR +N   WA   + F 
Sbjct: 70  EEVTEEEG-EEGETSLSVETEAGQKRFKFRKLEMPVFNVVSLEGRGMNWFRWAENRKWFR 129

Query: 79  DINNLSEAEKVKVAVVSFGQEEQSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVL 138
               L E    +     +G    +  AR + IKQ+GS S+Y++KF   SAPLP+MAE VL
Sbjct: 130 SWKELKERMYNRFRCRDYG----TACARFLAIKQEGSVSEYLQKFEELSAPLPEMAEEVL 189

Query: 139 LDAFFTGLEPALQAEVISRHPQTLEECMKKAQLVNDRNIALK-----------LAMEEMD 198
              F  GL+P ++ EV +     LE+ M+ AQL  ++    K            A   + 
Sbjct: 190 EGTFTNGLDPRIRKEVFAMRVVGLEDLMEAAQLAEEKAEVTKGGPYPYPYSKEAAKTNLG 249

Query: 199 KVEPKRSESSSKLQGKGEKVTIPLK----------GDYQKKDPPIKRLLDMEFRARLDKG 258
                     +K+    +KV               G   +++P  +R +D + +AR +KG
Sbjct: 250 SSPKTFGSPPTKMVTLAKKVVNQTSNSNTSQSHTVGGGGRREPSYRRWIDSKLQARKEKG 309

Query: 259 FCFRCNEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTP 318
            C+RC++ +S  HRCK RE +    L +  +D  + E + ED  G ++E+  +       
Sbjct: 310 LCYRCDKPFSKGHRCKNRELK----LCVVADDLIDTEMSEEDNDGGMVEIGPI-----VE 369

Query: 319 IELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDELQLSIDSGTRFGVT 378
           + L  +  +T+ GT K+KG V  +EVV++ + GAT+NFIS  LV+E+Q++    T++GV 
Sbjct: 370 LSLSSVVGLTAPGTSKIKGKVEDREVVVMIDCGATHNFISLRLVEEMQIATTETTQYGVI 429

Query: 379 IRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLT 438
           + +G   +G+G+C  V V L  LT+V DFL  ELG +D+VLG+QWL   G M V W +L 
Sbjct: 430 MGSGKAVQGKGMCTGVVVGLPGLTVVEDFLPLELGHLDMVLGMQWLPKQGAMTVDWRNLA 489

Query: 439 MTFWTKGRRIILKGDPSLTKLECSLKTLEKTWQSGDQGFLLEFQN 463
           MTF  +  +++L+GD SLT++  SLK L K WQ  D+GFL   Q+
Sbjct: 490 MTFAVRDVKVMLRGDLSLTRMAISLKMLMKQWQPEDRGFLSLLQS 520

BLAST of CSPI03G21030 vs. TrEMBL
Match: A5B2I6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_043911 PE=4 SV=1)

HSP 1 Score: 218.8 bits (556), Expect = 1.4e-53
Identity = 140/401 (34.91%), Postives = 223/401 (55.61%), Query Frame = 1

Query: 47   LEDTETTAESGGSNAGRRVNPESWAYRAEHFFDINNLSEAEKVKVAVVSFGQEEQSLGAR 106
            L + E    +  S  G  ++   W    E F    NL     ++  +     +E SL  +
Sbjct: 799  LTEEEKLVAAAMSLDGDALSWYQWTDSREVFGSWENLKRRLLLRFRLT----QEGSLCEQ 858

Query: 107  VICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVISRHPQTLEECM 166
             + ++Q+G+ + Y ++F     PL  ++E V+   F  GL P ++AE     P  L   M
Sbjct: 859  FLAVRQQGTVAAYWREFEILETPLKGISEEVMESTFMNGLLPEIRAEQRLLQPYGLGHLM 918

Query: 167  KKAQLVNDRNIALKLAMEEMDKVEPKRSESSSKLQGK-GEK-----VTIPLKGDYQKKDP 226
            + AQ V DRN+A++ A E       K   ++++ + K GE      V +  K   Q+++ 
Sbjct: 919  EMAQRVEDRNLAMRAAREPNGPKSTKMLSTANRGEWKIGENFQTRAVAVGEKTMSQRREI 978

Query: 227  PIKRLLDMEFRARLDKGFCFRCNEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDE 286
            PIKRL + E +AR +KG  F+C EK+SP HRC    K+EL +L+++ ED+ E ++  +D 
Sbjct: 979  PIKRLTESELQARREKGLWFKCEEKFSPGHRC----KKELRVLLVH-EDEEEDDNQFDDR 1038

Query: 287  AGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVL 346
            A E  E   +EL +   + L  +  +T+ GTMK+KG +  KEV+IL +SGAT+NF+S  L
Sbjct: 1039 ATE--EPALIELKDAVELSLNSVVGLTTPGTMKIKGTIGSKEVIILVDSGATHNFLSLEL 1098

Query: 347  VDELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGI 406
            V +L L + + T +GV +  G   +G+GIC+ V + ++ LT+V DFL  ELG  D++LG+
Sbjct: 1099 VQQLTLPLTTTTSYGVMMGTGISVKGKGICRGVCISMQGLTVVEDFLPLELGNTDVILGM 1158

Query: 407  QWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECS 442
             WL + G +KV+W  LTM        ++LKGDPSL++ E S
Sbjct: 1159 PWLGTLGDVKVNWKMLTMKIKMGKAVMVLKGDPSLSRTETS 1188

BLAST of CSPI03G21030 vs. TrEMBL
Match: E5GCI2_CUCME (Retrotransposon protein (Fragment) OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 214.2 bits (544), Expect = 3.5e-52
Identity = 144/322 (44.72%), Postives = 178/322 (55.28%), Query Frame = 1

Query: 1   MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTTEGPMLKLK-GKLEDTETTAESGGS 60
           M+ KE+  EMK++M  +   +  + G +     G T G   + K  KLE      E    
Sbjct: 12  MNEKEIR-EMKEMMLNLLKSMESDGGRETETTPGFTVGSSDRSKYKKLEIPVFNGE---- 71

Query: 61  NAGRRVNPESWAYRAEHFFDINNLSEAEKVKVAVVSFGQE-------------------- 120
                 NPE+W YRAEH+FDIN L + EKVKVAVVSFG +                    
Sbjct: 72  ------NPETWIYRAEHYFDINELVDEEKVKVAVVSFGPDEVNWFRWSNNRKKVKTWEDL 131

Query: 121 ------------EQSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLE 180
                       E SLGAR+I IKQ G YSDY+KKF+ YSAPLP+MAESVL+DAF TGLE
Sbjct: 132 KRRMFEHFKSPGEGSLGARLIRIKQDGCYSDYLKKFLEYSAPLPEMAESVLIDAFVTGLE 191

Query: 181 PALQAEVISRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSSKLQGKGEKV 240
             LQAEV SRHP TLEEC  K                      PK ++ + K      ++
Sbjct: 192 TNLQAEVKSRHPVTLEECSGK----------------------PKWADFAMK------QL 251

Query: 241 TIPLKGDYQKKD--PPIKRLLDMEFRARLDKGFCFRCNEKYSPRHRCKGREKRELMLLIL 288
           T+P+KG++ KK+  PP+KRL D EFRARLDKG CFRCN+KYSP HRCK +  RELM  I 
Sbjct: 252 TLPIKGNFVKKEPQPPVKRLSDSEFRARLDKGLCFRCNDKYSPGHRCKAKTNRELMFFIT 294

BLAST of CSPI03G21030 vs. TrEMBL
Match: A0A087H2U0_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA4G124900 PE=4 SV=1)

HSP 1 Score: 191.8 bits (486), Expect = 1.9e-45
Identity = 137/461 (29.72%), Postives = 215/461 (46.64%), Query Frame = 1

Query: 66  NPESWAYRAEHFFDINNLSEAEK--------------------------------VKVAV 125
           N ESW  R E +F+I  L E +K                                 +V  
Sbjct: 123 NAESWVSRVEQYFEIEELVEYQKLNAVRACFIDKALDWYRWERDRNPFRSWKDMRSRVVA 182

Query: 126 VSFGQEEQSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAE 185
                     G R++ +KQ+GS +DY ++F+  +   P++ E +L   F  GL+P +++ 
Sbjct: 183 TYASHNNTCAGKRLLVLKQEGSVADYCREFIGLATNAPEVPEFILEWTFMNGLKPQIRSR 242

Query: 186 VISRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSESSSKLQGKGEKVTIPLKG 245
           V++  PQTL+  M  A++V+D +     + E++  V    S+     QG G    + LK 
Sbjct: 243 VLTFAPQTLDTMMSVAKMVDDWSDENWRSPEKIS-VSAGLSDRPGYNQGSGLSTGLGLKN 302

Query: 246 ------------------------------DYQKKDPPIKRLLDMEFRARLDKGFCFRCN 305
                                         ++ +  PP KRL   E   R   G CFRC+
Sbjct: 303 GTGPSWSKPNSQLNPADHTTAFRPGDKTNPNHIRPKPPTKRLSPTEMAQRKAAGLCFRCD 362

Query: 306 EKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLI 365
           EK+  RH C    KRELM+LI   +  +    N E+E  +  +L   EL E   + L  +
Sbjct: 363 EKWHIRHSCA---KRELMILIAQPDGTDIVWDNGEEEFSDATDLPITELAE---LSLNSV 422

Query: 366 TRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDELQLSIDSGTRFGVTIRNGTR 425
             ++S  TMKL G +   +VV++ +SGA++NFIS  LV++  L+  +   +GV    G  
Sbjct: 423 VGISSPSTMKLTGSLGNTDVVVMIDSGASHNFISTRLVNQPALTPHTAGNYGVLTGAGIP 482

Query: 426 CEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTK 465
            +G GIC+ + + ++ L I ADFL   LG  D++LG+QWL S G MKV+W    M F   
Sbjct: 483 VKGEGICRELTLLVQGLRIRADFLPLALGSADVILGMQWLASLGEMKVNWGLQWMRFTVD 542

BLAST of CSPI03G21030 vs. TrEMBL
Match: A0A087GW89_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA5G106200 PE=4 SV=1)

HSP 1 Score: 189.9 bits (481), Expect = 7.0e-45
Identity = 124/395 (31.39%), Postives = 197/395 (49.87%), Query Frame = 1

Query: 99  EEQSLGARVICIKQKGSYSDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVISRH 158
           E++S G R+  +KQ G+  DY + F+  +A   D+ ES L  AF  G +P ++A + S  
Sbjct: 202 EDKSAGERLFTLKQVGTVKDYCRDFIFLAAQALDLQESALELAFMIGRKPTIRARMKSFD 261

Query: 159 PQTLEECMKKAQLVND--------------------RNIALKLAMEEMDKVEPKRSESSS 218
           P  LE+ M  A+ V D                    R   +K         +   +   +
Sbjct: 262 PHHLEKMMSVAKTVADWDLTEEEGPVSNHRGGERGSRGNQIKSGGPNQYSGQKHGTRPQN 321

Query: 219 KLQGKGEKVTIPL--------KGDYQKKDPPIKRLLDMEFRARLDKGFCFRCNEKYSPRH 278
           K    G   T             ++ +  PP +RL   E      +G CFRC+EK+  RH
Sbjct: 322 KKPNNGSSTTTTSFCGGDTKNPNNHNRVKPPFRRLTQAEMAQGRAEGLCFRCDEKWYERH 381

Query: 279 RCKGREKRELMLLILNEEDDN-----ERESNTEDEAGEIIELNQLELNEDTPIELRLITR 338
           RC    +REL ++I+ EE  +     E E+++++E   + E+  L LN         +  
Sbjct: 382 RCP---RRELSVVIVQEEGPDKEWVEEDETDSDEEGVTVAEMATLSLNS--------LVG 441

Query: 339 VTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDELQLSIDSGTRFGVTIRNGTRCE 398
           ++S  TMKLK  + G EVV++ +SGA++NFIS+ LV +L +  +    +GV +   T   
Sbjct: 442 ISSPRTMKLKAKMLGTEVVVMIDSGASHNFISEPLVKKLSMKTEESHCYGVMMGTRTEVV 501

Query: 399 GRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGR 458
           GRGIC+ V + ++++T+V DFL  ELG  D++LGIQWL++ G MKV+W      F   G+
Sbjct: 502 GRGICREVNLVMQDITVVTDFLPIELGGADVILGIQWLETLGEMKVNWKLQRAKFRVNGQ 561

Query: 459 RIILKGDPSLTKLECSLKTLEKTWQSGDQGFLLEF 461
           ++ ++ DP L     +LK L K      QG ++EF
Sbjct: 562 KVTIQRDPELVCAPITLKALWKAIGDESQGVIVEF 585

BLAST of CSPI03G21030 vs. TAIR10
Match: AT3G29750.1 (AT3G29750.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 59.7 bits (143), Expect = 5.6e-09
Identity = 43/144 (29.86%), Postives = 73/144 (50.69%), Query Frame = 1

Query: 282 GEIIELNQLELNEDT---PIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQ 341
           G I EL +LE +  T    +E  +I    +KG M+  G +   +VV+  +SGAT+NFI  
Sbjct: 92  GVINELEELEQDSYTLRQGMEQLVIDLTRNKG-MRFYGFILDHKVVVAIDSGATDNFILV 151

Query: 342 VLVDELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGR--VDL 401
            L   L+L      +  V +      +  G C  +++ ++E+ I  +FL  +L +  VD+
Sbjct: 152 ELAFSLKLPTSITNQASVLLGQRQCIQSVGTCLGIRLWVQEVEITENFLLLDLAKTDVDV 211

Query: 402 VLGIQWLDSTGTMKVHWPSLTMTF 421
           +LG +WL   G   V+W +   +F
Sbjct: 212 ILGYEWLSKLGETMVNWQNQDFSF 234

BLAST of CSPI03G21030 vs. NCBI nr
Match: gi|659094491|ref|XP_008448087.1| (PREDICTED: uncharacterized protein LOC103490375 [Cucumis melo])

HSP 1 Score: 508.1 bits (1307), Expect = 1.7e-140
Identity = 269/482 (55.81%), Postives = 342/482 (70.95%), Query Frame = 1

Query: 41  LKLKGKLEDTETTAESGGSNAGR------------RVNPESWAYRAEHFFDINNLSEAEK 100
           +KLKGK+++TE   E  G+   R              NPESW YRAEHFF+INNL E+EK
Sbjct: 1   MKLKGKMDETEPITEINGTVTDRSKYKKLEMPMFLEENPESWVYRAEHFFEINNLPESEK 60

Query: 101 VKVAVVSFGQEE----------------QSLGARVI----------------CIKQKGSY 160
           VKVAVV FGQ+E                + L +R+                  I+Q GSY
Sbjct: 61  VKVAVVIFGQDEVDWYRWSHNKKKVESWEDLKSRMFEFFRDSGQKSLGARLIRIQQDGSY 120

Query: 161 SDYVKKFVTYSAPLPDMAESVLLDAFFTGLEPALQAEVISRHPQTLEECMKKAQLVNDRN 220
           ++YVKKFV YSAPLP MA SVL+DAF TGLEP+LQAEVISRHPQTLE+CM++AQLVNDRN
Sbjct: 121 NEYVKKFVIYSAPLPYMAGSVLVDAFVTGLEPSLQAEVISRHPQTLEDCMREAQLVNDRN 180

Query: 221 IALKLAMEEMDKVEPKRSESSS-------------KLQGKGEKVTIPLKGDYQKKDPPIK 280
           +ALKL+  E+   E +   SS              K   + +++TIP+KG+++K +PP+K
Sbjct: 181 LALKLSKMELGMTEWEEGGSSKVKKLGDVDKPPPRKTDFQMKQITIPIKGNFKKGEPPVK 240

Query: 281 RLLDMEFRARLDKGFCFRCNEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGE 340
           RL D EFRARLD+G CFRCN+KYSP HRCK +EKRELM  I+NEE+++E   + E+    
Sbjct: 241 RLSDAEFRARLDRGLCFRCNDKYSPGHRCKTKEKRELMFFIMNEEEEDEEGESHEEVTEG 300

Query: 341 IIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDE 400
            +EL  LEL ED  IE++ +TR++SKGTMK+KG +  KE+VIL +SGAT+NFI Q LV +
Sbjct: 301 TVELKTLELTEDVAIEMKTMTRLSSKGTMKIKGWIRQKEIVILIDSGATHNFIHQSLVVD 360

Query: 401 LQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWL 460
           L+L ++  T+FG TI NGTRC+G+GIC+RV+VKL+E+TI+ADFLA ELG VD VL +QWL
Sbjct: 361 LKLGMEQHTQFGYTIGNGTRCKGKGICRRVEVKLEEITIIADFLAVELGSVDAVLEMQWL 420

Query: 461 DSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSLKTLEKTWQSGDQGFLLEFQNY 466
           D+TGTMK+HWPSLTM+FW  GR+IILKGDPSL + ECSL+TLEKTWQ  DQGFLLE+ N 
Sbjct: 421 DTTGTMKIHWPSLTMSFWNGGRQIILKGDPSLIRAECSLRTLEKTWQEDDQGFLLEWANM 480

BLAST of CSPI03G21030 vs. NCBI nr
Match: gi|778697580|ref|XP_011654353.1| (PREDICTED: uncharacterized protein LOC105435354 [Cucumis sativus])

HSP 1 Score: 505.8 bits (1301), Expect = 8.3e-140
Identity = 278/522 (53.26%), Postives = 356/522 (68.20%), Query Frame = 1

Query: 1   MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTTEGPMLKLKGKLEDTETTAESGGSN 60
           M LKEM+LEMKK M+RM + LRENH  K+REE+GT++G ++KLKGK E+T+   E G   
Sbjct: 18  MVLKEMLLEMKKAMERMAEVLRENHSYKRREESGTSDGSVMKLKGKAEETDVHNE-GNLT 77

Query: 61  AGRR-------------VNPESWAYRAEHFFDINNLSEAEKVKVAVVSFGQEEQSLGARV 120
            G R              NPESW YRAEHFF+INNL E EKVKVAVVSFGQ+E     R 
Sbjct: 78  MGDRSKYKKLEMPMFLGENPESWVYRAEHFFEINNLPETEKVKVAVVSFGQDEVDWYRRS 137

Query: 121 ICIKQKGSYSDYVKKF------------------VTYSAPLPDMAE-------------- 180
              K+  S+ D  ++                   +       D  +              
Sbjct: 138 HNRKKVESWEDLKERMFDFFKDTGQKSLVARLIRIEQDGSYNDYVKKFVNYSAPLPHMTE 197

Query: 181 SVLLDAFFTGLEPALQAEVISRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSE 240
           SVL DAF TGLEP LQAEV+S +P TLEECM++AQLVNDRN+AL+ +  E   +  K+ E
Sbjct: 198 SVLRDAFLTGLEPNLQAEVVSHNPLTLEECMREAQLVNDRNLALQWSKAEGGGLNYKKGE 257

Query: 241 SSSKLQGKG-------------EKVTIPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRC 300
            S+    +G             ++VTIP+KG+YQK +PP+KRLLD+EF+ARLDKG CF+C
Sbjct: 258 GSTNKGPEGGEKGITRKTEFPLKQVTIPIKGNYQKSEPPVKRLLDVEFKARLDKGLCFKC 317

Query: 301 NEKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRL 360
           NE+YSP HRCK ++KRELML I+NEE+  E E  TE+   E++ELNQL L E T IEL+ 
Sbjct: 318 NERYSPGHRCKMKDKRELMLFIMNEEESLEDEDRTEETNEEVLELNQLTLEEGTEIELKA 377

Query: 361 ITRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDELQLSIDSGTRFGVTIRNGT 420
           I  +TSKGTMK+KG + GKEV+IL +SGAT+NFI   +V+E+ L +++ T FGVTI +GT
Sbjct: 378 IHGLTSKGTMKIKGEIKGKEVLILIDSGATHNFIHNKIVEEVGLELENHTPFGVTIGDGT 437

Query: 421 RCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWT 465
           RC+GRG+C R+++KLKE+TIVADFLA ELG VD++LG+QWL++TGTMK+HWPSLTMTF  
Sbjct: 438 RCQGRGVCNRLELKLKEITIVADFLAIELGSVDVILGMQWLNTTGTMKIHWPSLTMTFRM 497

BLAST of CSPI03G21030 vs. NCBI nr
Match: gi|659093726|ref|XP_008447685.1| (PREDICTED: uncharacterized protein LOC103490096 [Cucumis melo])

HSP 1 Score: 455.7 bits (1171), Expect = 9.9e-125
Identity = 258/522 (49.43%), Postives = 340/522 (65.13%), Query Frame = 1

Query: 1   MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTTEGPMLKLKGKLEDTETTAESGGSN 60
           +SLK+MMLEMKK MDR+ DELREN   KK+EE+ T++G ++K+KGK+E+T+ T E   + 
Sbjct: 18  LSLKKMMLEMKKNMDRLADELRENQSYKKKEESDTSDGSIMKMKGKMEETDITTEVNTNL 77

Query: 61  AGRR------------VNPESWAYRAEHFFDINNLSEAEKVKVAVVSFGQEE-------- 120
             R              NPES  YRAEHFF+INNL EAEKVKVAVVSFGQ++        
Sbjct: 78  VDRSKYKKLEMPMFFGENPESCVYRAEHFFEINNLPEAEKVKVAVVSFGQDKVDWYRWSH 137

Query: 121 --------QSLGARV----------------ICIKQKGSYSDYVKKFVTYSAPLPDMAES 180
                   + L  R+                I I+Q+GSY+DYVKKFV YSAPLP M ES
Sbjct: 138 NRKKVESWEDLKTRMFEFFRDTGQKSLGARLIRIQQEGSYNDYVKKFVNYSAPLPHMVES 197

Query: 181 VLLDAFFTGLEPALQAEVISRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSES 240
           VL D F TGLEP LQ EV+S HPQTLEECM  AQLVNDRN+ALKLA  EM  +EPKRSE+
Sbjct: 198 VLRDTFLTGLEPTLQVEVMSCHPQTLEECMMAAQLVNDRNLALKLAQAEMGIMEPKRSEA 257

Query: 241 SS-------------KLQGKGEKVTIPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRCN 300
           ++             K + + +++TIPLK  Y K +PP+KRL D EFRARLDKG CFRCN
Sbjct: 258 TNTKVPWNNDKGSMRKNEFQMKQITIPLKRSYHKGEPPVKRLSDAEFRARLDKGLCFRCN 317

Query: 301 EKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLI 360
           EKYS  H CK +EKR+LML ILNEE+  E    ++ +  E +E+NQLE+ E+  IE R I
Sbjct: 318 EKYSHEHHCKIKEKRDLMLFILNEEESTEEGEGSDTQKTEPLEINQLEVLEEAVIEYRAI 377

Query: 361 TRVTSKGTMKLKGHVNGKEVVILTNSGATNNFIS-QVLVDELQLSIDSGTRFGVTIRNGT 420
           T +T+KGTMKL+G V GKE+ +L NSG T+NFI  + +  ++++ ++     G+ +    
Sbjct: 378 TSLTTKGTMKLRGVVKGKEIFVLINSGETHNFIHWKGICSQVEIQLE-----GLKV---- 437

Query: 421 RCEGRGICKRVKVKLKELTIVADFLAGELGRVDLVLGIQWLDSTGTMKVHWPSLTMTFWT 465
                 +   + V+L ++ +V               G++WLD+TGTMK+HWPSLTM FW 
Sbjct: 438 ------VTDLLVVELGKVNVVL--------------GMKWLDTTGTMKIHWPSLTMVFWK 497

BLAST of CSPI03G21030 vs. NCBI nr
Match: gi|659112485|ref|XP_008456244.1| (PREDICTED: uncharacterized protein LOC103496243 [Cucumis melo])

HSP 1 Score: 438.7 bits (1127), Expect = 1.2e-119
Identity = 247/469 (52.67%), Postives = 320/469 (68.23%), Query Frame = 1

Query: 1   MSLKEMMLEMKKLMDRMTDELRENHGTKKREEAGTTEGPMLKLKGKLEDTETTAESGGSN 60
           + LKEMM EMKK M+R+ +E+RE+   KK+EE+GT +G ++KLKGK+E+ + TAE   + 
Sbjct: 18  LGLKEMMREMKKTMERLAEEMRESQYYKKKEESGTFDGFVMKLKGKMEELDVTAEVNTNT 77

Query: 61  AGRRV------------NPESWAYRAEHFFDINNLSEAEKVKVAVVSFGQEE-------- 120
             R              NPESW YR EHFF+INNLSEAEKVKV VVSFGQ+E        
Sbjct: 78  VDRSKYKKLEMPMFLGENPESWVYRVEHFFEINNLSEAEKVKVVVVSFGQDEVDWYRWSH 137

Query: 121 --------QSLGARV----------------ICIKQKGSYSDYVKKFVTYSAPLPDMAES 180
                   + L  R+                I I+Q GSY++YVKKFV YSAPLP MAES
Sbjct: 138 NPKKVESWEDLKTRMFEFFRDTGQKSLGARLIWIQQDGSYNEYVKKFVNYSAPLPYMAES 197

Query: 181 VLLDAFFTGLEPALQAEVISRHPQTLEECMKKAQLVNDRNIALKLAMEEMDKVEPKRSES 240
           VL DAF TGLEP LQAEV+SRHPQTLEECM +A LVND N+ALKL+  E+   + K  E 
Sbjct: 198 VLRDAFLTGLEPTLQAEVVSRHPQTLEECMMEALLVNDCNLALKLSRAELGIHKYKGGEP 257

Query: 241 SS-------------KLQGKGEKVTIPLKGDYQKKDPPIKRLLDMEFRARLDKGFCFRCN 300
           ++             K + + +++TIPLKG YQK DP +KRL D EFR+RL++G CFRCN
Sbjct: 258 ANTKAPVSNEKGNPRKNEFQMKQITIPLKGSYQKGDPLVKRLSDAEFRSRLERGLCFRCN 317

Query: 301 EKYSPRHRCKGREKRELMLLILNEEDDNERESNTEDEAGEIIELNQLELNEDTPIELRLI 360
           EKYS  H CK +EKRELML ILNEE+  +   N+E +  +I+EL QL+  E+  IE R I
Sbjct: 318 EKYSHGHHCKVKEKRELMLFILNEEESADEGENSETQREKIMELKQLDTLEEAVIEYRTI 377

Query: 361 TRVTSKGTMKLKGHVNGKEVVILTNSGATNNFISQVLVDELQLSIDSGTRFGVTIRNGTR 413
           T +T+KGTMKL+G V GK +++L +SGAT+NFI   LV E ++ ++S T+F VTI +GT 
Sbjct: 378 TSLTTKGTMKLQGEVKGKAIIVLIDSGATHNFIHYELVKEKRIPMESDTQFRVTIGDGTS 437

BLAST of CSPI03G21030 vs. NCBI nr
Match: gi|659077522|ref|XP_008439250.1| (PREDICTED: uncharacterized protein LOC103484090 [Cucumis melo])

HSP 1 Score: 374.4 bits (960), Expect = 2.9e-100
Identity = 189/313 (60.38%), Postives = 239/313 (76.36%), Query Frame = 1

Query: 166 MKKAQLVNDRNIALKLAMEEMDKVEPKRSESSS-KLQGKGEK------------VTIPLK 225
           MK+AQLVNDRN+ALKL+  E+   E +   SS  K  G  +K            +TIP+K
Sbjct: 1   MKEAQLVNDRNLALKLSKMELGMTEWEEGGSSKVKKLGDADKPPPRKTDFQMKQITIPIK 60

Query: 226 GDYQKKDPPIKRLLDMEFRARLDKGFCFRCNEKYSPRHRCKGREKRELMLLILNEEDDNE 285
           G+++K +PP+KRL D EFRAR+D+G CFRCN+KYSP HRCK +EKRELM  I+NEE++NE
Sbjct: 61  GNFKKGEPPMKRLSDAEFRARIDRGLCFRCNDKYSPGHRCKTKEKRELMFFIMNEEEENE 120

Query: 286 RESNTEDEAGEIIELNQLELNEDTPIELRLITRVTSKGTMKLKGHVNGKEVVILTNSGAT 345
              + E+     +EL  LEL ED  IEL+ +TR +SKGTMKLKG +  KE+VIL +SGAT
Sbjct: 121 EGDSHEEVTEGTVELKTLELTEDIAIELKTMTRFSSKGTMKLKGWIRQKEIVILIDSGAT 180

Query: 346 NNFISQVLVDELQLSIDSGTRFGVTIRNGTRCEGRGICKRVKVKLKELTIVADFLAGELG 405
           +NFI Q L  +L+L ++  T+FG TI NGTRC+G+GIC+RV+VKL+E+TI+ADFLA ELG
Sbjct: 181 HNFIHQSLAVDLKLGLEQHTQFGYTIGNGTRCKGKGICRRVEVKLEEITIIADFLAVELG 240

Query: 406 RVDLVLGIQWLDSTGTMKVHWPSLTMTFWTKGRRIILKGDPSLTKLECSLKTLEKTWQSG 465
            VD VLG+QWLD+TGTMK+HWPSLTM+FW +GR+IILKGDPSL K ECSL+TLEKTWQ  
Sbjct: 241 SVDAVLGMQWLDTTGTMKIHWPSLTMSFWNEGRQIILKGDPSLIKAECSLRTLEKTWQED 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
E5GC18_CUCME6.8e-5633.12Ty3-gypsy retrotransposon protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A5B2I6_VITVI1.4e-5334.91Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_043911 PE=4 SV=1[more]
E5GCI2_CUCME3.5e-5244.72Retrotransposon protein (Fragment) OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A0A087H2U0_ARAAL1.9e-4529.72Uncharacterized protein OS=Arabis alpina GN=AALP_AA4G124900 PE=4 SV=1[more]
A0A087GW89_ARAAL7.0e-4531.39Uncharacterized protein OS=Arabis alpina GN=AALP_AA5G106200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G29750.15.6e-0929.86 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659094491|ref|XP_008448087.1|1.7e-14055.81PREDICTED: uncharacterized protein LOC103490375 [Cucumis melo][more]
gi|778697580|ref|XP_011654353.1|8.3e-14053.26PREDICTED: uncharacterized protein LOC105435354 [Cucumis sativus][more]
gi|659093726|ref|XP_008447685.1|9.9e-12549.43PREDICTED: uncharacterized protein LOC103490096 [Cucumis melo][more]
gi|659112485|ref|XP_008456244.1|1.2e-11952.67PREDICTED: uncharacterized protein LOC103496243 [Cucumis melo][more]
gi|659077522|ref|XP_008439250.1|2.9e-10060.38PREDICTED: uncharacterized protein LOC103484090 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G21030.1CSPI03G21030.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 3..23
scor
NoneNo IPR availablePFAMPF13975gag-asp_proteascoord: 315..403
score: 1.

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None