CSPI02G15830 (gene) Wild cucumber (PI 183967)

NameCSPI02G15830
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionLate embryogenesis abundant hydroxyproline-rich glycoprotein, putative
LocationChr2 : 15298933 .. 15299964 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCCTCCTTCCCCTCGTCTCCCCCCATGGCGGAGCCGGAGCCGGAGCCGGTTCACCCTCCTCCTCCTCCGCCCGACTCTCTTTTCTTCTCCTCTGGTACCTACGTTGTCCAAATCCCTAAAGACCAAATCTACCGTATCCCTCCCCCCGAAAATGCCCTCATCGTCGAACGTCATCGTAACCCCTCCGTCGTCACCTCCTCTCGTCGCCGCTCATGTTGCTTTCGTATCTTCCTCCCTATCTTCGTCGTTCTTCTCCTCATCATCATCCTAGCCCTCCTCCTCCCTCCTCTTCTCACCCTTCCCAAACCCCCTGTCATTGAGCTCAAGAAATTCAAACTCACTCCGTCCACACGCAATTTCCTCATCAACCTTGATATTCTTAATCCTAACTCCGTCGGCTCCATCTCCTTCAAATCTCCCTCACGTGTCTCCCTCTCCTTCCGAAAAAACCAACTTGCCACAACTAAATTCCCTCTCATTCGTCAACAACATGGCTCCGAGAAAAAGGTTGCTTTGTCACTTCGCGCCAAATCGGCGTTCCCTAAGGAGTTGAAACGGCGGATGAAAAATAATAAGACGAAGCTCCACACTTCCTTGTCGTTAAAGATGAATCTTGCTGCCCAGACGATTGGACGACTGTCGAATCGTCGGAATGTCAAATTTGTTGTCACTTGTAGCTTCACTGTCAACACGTTGGGTAAAAATTCACGAATATTATCGCAAGATTGTGAAAGTGAACGCCAATGACACAATGCAGTTCAAGGTAACACATGTCGTCCAAAAAAGAGGAAACAAATTTGTTGGTATTTTGGCTTCAATTTCGATATCTCTCGTAAGCGATCCCATATGACCATATCATATCTCTCTTTCCTATCGTTTCGATGTAAGGGCAGTTTTTTTTCTTTATTTTTTTATTTTTTTTACCCGAATTTCCATCTCTGTTAATCAATGATTCTTAGGGTTTGTGTTTTTGTATTTTGTTTGTATGATTCTTCTTCTTCTTTTGGGCTTTTGAAAATTTGAAAAGAAAA

mRNA sequence

ATGGCGGAGCCGGAGCCGGAGCCGGTTCACCCTCCTCCTCCTCCGCCCGACTCTCTTTTCTTCTCCTCTGGTACCTACGTTGTCCAAATCCCTAAAGACCAAATCTACCGTATCCCTCCCCCCGAAAATGCCCTCATCGTCGAACGTCATCGTAACCCCTCCGTCGTCACCTCCTCTCGTCGCCGCTCATGTTGCTTTCGTATCTTCCTCCCTATCTTCGTCGTTCTTCTCCTCATCATCATCCTAGCCCTCCTCCTCCCTCCTCTTCTCACCCTTCCCAAACCCCCTGTCATTGAGCTCAAGAAATTCAAACTCACTCCGTCCACACGCAATTTCCTCATCAACCTTGATATTCTTAATCCTAACTCCGTCGGCTCCATCTCCTTCAAATCTCCCTCACGTGTCTCCCTCTCCTTCCGAAAAAACCAACTTGCCACAACTAAATTCCCTCTCATTCGTCAACAACATGGCTCCGAGAAAAAGGTTGCTTTGTCACTTCGCGCCAAATCGGCGTTCCCTAAGGAGTTGAAACGGCGGATGAAAAATAATAAGACGAAGCTCCACACTTCCTTGTCGTTAAAGATGAATCTTGCTGCCCAGACGATTGGACGACTGTCGAATCGTCGGAATGTCAAATTTGTTGTCACTTGTAGCTTCACTGTCAACACGTTGGGTAAAAATTCACGAATATTATCGCAAGATTGTGAAAGTGAACGCCAATGA

Coding sequence (CDS)

ATGGCGGAGCCGGAGCCGGAGCCGGTTCACCCTCCTCCTCCTCCGCCCGACTCTCTTTTCTTCTCCTCTGGTACCTACGTTGTCCAAATCCCTAAAGACCAAATCTACCGTATCCCTCCCCCCGAAAATGCCCTCATCGTCGAACGTCATCGTAACCCCTCCGTCGTCACCTCCTCTCGTCGCCGCTCATGTTGCTTTCGTATCTTCCTCCCTATCTTCGTCGTTCTTCTCCTCATCATCATCCTAGCCCTCCTCCTCCCTCCTCTTCTCACCCTTCCCAAACCCCCTGTCATTGAGCTCAAGAAATTCAAACTCACTCCGTCCACACGCAATTTCCTCATCAACCTTGATATTCTTAATCCTAACTCCGTCGGCTCCATCTCCTTCAAATCTCCCTCACGTGTCTCCCTCTCCTTCCGAAAAAACCAACTTGCCACAACTAAATTCCCTCTCATTCGTCAACAACATGGCTCCGAGAAAAAGGTTGCTTTGTCACTTCGCGCCAAATCGGCGTTCCCTAAGGAGTTGAAACGGCGGATGAAAAATAATAAGACGAAGCTCCACACTTCCTTGTCGTTAAAGATGAATCTTGCTGCCCAGACGATTGGACGACTGTCGAATCGTCGGAATGTCAAATTTGTTGTCACTTGTAGCTTCACTGTCAACACGTTGGGTAAAAATTCACGAATATTATCGCAAGATTGTGAAAGTGAACGCCAATGA
BLAST of CSPI02G15830 vs. TrEMBL
Match: A0A0A0LK43_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G307330 PE=4 SV=1)

HSP 1 Score: 450.3 bits (1157), Expect = 1.5e-123
Identity = 239/240 (99.58%), Postives = 240/240 (100.00%), Query Frame = 1

Query: 1   MAEPEPEPVHPPPPPPDSLFFSSGTYVVQIPKDQIYRIPPPENALIVERHRNPSVVTSSR 60
           MAEPEPEPVHPPPPPPDSLFFSSGTYVVQIPKDQIYRIPPPENALIVERHRNPSVVTSSR
Sbjct: 1   MAEPEPEPVHPPPPPPDSLFFSSGTYVVQIPKDQIYRIPPPENALIVERHRNPSVVTSSR 60

Query: 61  RRSCCFRIFLPIFVVLLLIIILALLLPPLLTLPKPPVIELKKFKLTPSTRNFLINLDILN 120
           RRSCCFRIFLPIFV+LLLIIILALLLPPLLTLPKPPVIELKKFKLTPSTRNFLINLDILN
Sbjct: 61  RRSCCFRIFLPIFVLLLLIIILALLLPPLLTLPKPPVIELKKFKLTPSTRNFLINLDILN 120

Query: 121 PNSVGSISFKSPSRVSLSFRKNQLATTKFPLIRQQHGSEKKVALSLRAKSAFPKELKRRM 180
           PNSVGSISFKSPSRVSLSFRKNQLATTKFPLIRQQHGSEKKVALSLRAKSAFPKELKRRM
Sbjct: 121 PNSVGSISFKSPSRVSLSFRKNQLATTKFPLIRQQHGSEKKVALSLRAKSAFPKELKRRM 180

Query: 181 KNNKTKLHTSLSLKMNLAAQTIGRLSNRRNVKFVVTCSFTVNTLGKNSRILSQDCESERQ 240
           KNNKTKLHTSLSLKMNLAAQTIGRLSNRRNVKFVVTCSFTVNTLGKNSRILSQDCESERQ
Sbjct: 181 KNNKTKLHTSLSLKMNLAAQTIGRLSNRRNVKFVVTCSFTVNTLGKNSRILSQDCESERQ 240

BLAST of CSPI02G15830 vs. TrEMBL
Match: A0A059BRD6_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F02438 PE=4 SV=1)

HSP 1 Score: 139.8 bits (351), Expect = 4.3e-30
Identity = 95/245 (38.78%), Postives = 148/245 (60.41%), Query Frame = 1

Query: 12  PPPPPDSL--FFSSGTYVVQIPKDQIYRIPPPENALIVERHRNPSVVTSSRRR-----SC 71
           P P PDS    F+S TYVVQIPKDQIYR+PPPENALIV+RHR P   T  ++R     SC
Sbjct: 36  PVPNPDSPEHAFASDTYVVQIPKDQIYRVPPPENALIVKRHRKP---TEPKKRSFCCSSC 95

Query: 72  CFRIFLPIFVVLLLIIILALLLPPLLTLPKPPVIELKKFKLTPSTRN--------FLINL 131
           C  +FL I V++L++ ILA++    L L K P   +  F +   T++        + + L
Sbjct: 96  CCWLFLAIIVIVLVVGILAIVSSVFLKL-KNPNFHVDHFVVKDLTKSHDKNTKLVYDVKL 155

Query: 132 DILNPNSVGSISFKSPSRVSLSFRKNQLATTKFPLIRQQHGSEKKVALSLR-AKSAFPKE 191
            + NPN+  S ++K    VSL+F++  +A  KF    Q   + K V + L+ + +A PKE
Sbjct: 156 KVENPNTYSSFTYKQGGAVSLAFKQKAIAMGKFVAFDQDRKTSKAVDIVLKGSNTALPKE 215

Query: 192 LKRRMKNNKTKLHTSLSLKMNLAAQTIGRLSNRRNVKFVVTCSFTVNTLGKNSRILSQDC 241
           +++ +++ KTK H + +L ++  A+    +    + +FV +C FTV+ LGK++R+LSQ C
Sbjct: 216 MQKSLRSKKTKNHLTFALHVDAPARRKIGIIKGSSSRFVASCQFTVDKLGKDARVLSQKC 275

BLAST of CSPI02G15830 vs. TrEMBL
Match: M5Y5Z5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa014613mg PE=4 SV=1)

HSP 1 Score: 131.0 bits (328), Expect = 2.0e-27
Identity = 99/255 (38.82%), Postives = 147/255 (57.65%), Query Frame = 1

Query: 6   PEPVHPPPPPPDS------------LFFSSGTYVVQIPKDQIYRIPPPENALIVERHRNP 65
           PE + PP PPP+S              F SGTY+VQ+PKDQIYR+PPPE+A IVERHR+ 
Sbjct: 22  PEFIPPPLPPPNSQQLILSNSNGSTATFRSGTYIVQVPKDQIYRMPPPEHATIVERHRD- 81

Query: 66  SVVTSSRRRSCCFRIFLPIFVVLLLIIILALLLPPLLTLPKPPV-IELKKFKLTPSTRNF 125
           S V       CC  I    F+VLL+I ++A++L  L     P   +E    K      ++
Sbjct: 82  SGVNKKSCSYCCLGII--AFIVLLIITLVAVILTMLAKSGDPKFSVERVVVKGKSGRPDY 141

Query: 126 LINLDILNPNSVGSISFKSPSRVSLSFRKNQLATTKFPLIRQQHGSEKKVALSLRAKS-A 185
            + L+  NPNS  +I +K     SL F++ ++A  K+P + Q  G  K+VAL L   +  
Sbjct: 142 DLTLEARNPNSRVAIVYKDGGGASLYFKQKKIANGKYPSLYQGSGKSKEVALVLHGSNMK 201

Query: 186 FPKELKRRMKNNKTKLH-----TSLSLKMNLAAQ-TIGRLSNRRNVKFVVTCSFTVNTLG 241
            PKE+++ +K++ +  +      SLSL M++ A+  IG L N R+ KF VTC  TV+TL 
Sbjct: 202 LPKEIEKSLKSSYSSTYKKGHRVSLSLNMDIPARMRIGTL-NSRSRKFHVTCDITVDTLA 261

BLAST of CSPI02G15830 vs. TrEMBL
Match: K7KJB0_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_04G103700 PE=4 SV=1)

HSP 1 Score: 125.6 bits (314), Expect = 8.4e-26
Identity = 86/247 (34.82%), Postives = 136/247 (55.06%), Query Frame = 1

Query: 8   PVHPPPPPP----------DSLFFSSGTYVVQIPKDQIYRIPPPENALIVERHRNPSVVT 67
           P  PPPPPP          +   F+ GTYVVQ+PKDQ+YR+PPPENA I E H+      
Sbjct: 8   PSPPPPPPPLEKKHSTNKLELPDFNPGTYVVQVPKDQVYRVPPPENARIAESHKKAPPKA 67

Query: 68  SSRRRSCCFRIFLPIFVVLLLIIILALL--LPPLLTLPKPPVIELKKFKL--TPSTRNFL 127
           +   R C F +   I   +LLI++ A+L  L  +L  PK P   + +FK+  T     + 
Sbjct: 68  AKTSRCCLFCVLFFIIFFVLLILLGAVLGGLFSMLLTPKDPQFSITRFKVVETKPHPKYD 127

Query: 128 INLDILNPNSVGSISFKSPSRVSLSFRKNQLATTKFPLIRQQHGSEKKVALSL-RAKSAF 187
           + L++ N NS   +S+K+   VSLS R+ ++A+  +P   Q         ++L  +K   
Sbjct: 128 VTLEVHNLNSDVGVSYKNKGHVSLSLRRQEVASGAYPSFNQDAHDRTTFGVTLTSSKVGL 187

Query: 188 PKELKRRMKNNKTKLHTSLSLKMNLAAQTIGRLSNRRNVKFVVTCSFTVNTLGKNSRILS 240
           PKE++  + N+K K++ + SL ++  A+    L     +KF VTC+  ++TL K +++LS
Sbjct: 188 PKEVEESVTNDKKKVNVTFSLAIHALARMKMGLLRSGTMKFDVTCNVKLDTLAKTTQVLS 247

BLAST of CSPI02G15830 vs. TrEMBL
Match: B9H747_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s16920g PE=4 SV=1)

HSP 1 Score: 125.6 bits (314), Expect = 8.4e-26
Identity = 89/246 (36.18%), Postives = 136/246 (55.28%), Query Frame = 1

Query: 9   VHPPPPPPDSLFFSSGTYVVQIPKDQIYRIPPPENALIVERHRNPSVVTSSRRRSCC-FR 68
           +H P    +  FF  GTYV+QIP+DQIYR+PPP NA + +R RNP      + +SCC   
Sbjct: 17  IHDPVATSNQPFFEPGTYVIQIPRDQIYRVPPPGNASVAQRQRNPH---QKKHKSCCGCS 76

Query: 69  IFLPIFVVLLLIIILALLLPPL---LTLPKPPVIELKKF--------KLTPSTRNFLINL 128
            F   F+ +   I +A+ +  L   L  PK P  ++++F        K   S  N+ I L
Sbjct: 77  CFWCCFIAIASGIAVAVTIVGLSFILLKPKDPEFQVQRFVVKNPQVSKHKYSYTNYDIRL 136

Query: 129 DILNPNSVGSISFKSPSRVSLSFRKNQLATTKFPLIRQQHGSEKKVALSLRAKS-AFPKE 188
           ++ N N   SI ++    VSLSFR+  +AT KFP   Q H +   + + L+      PK+
Sbjct: 137 NVHNSNRRSSILYQQGGAVSLSFRQQNVATGKFPTFHQGHKNSTDIGIVLKGTGVGLPKD 196

Query: 189 LKRRMKNNKTKLHTSLSLKMNLAA--QTIGRLSNRRNVKFVVTCSFTVNTLGKNSRILSQ 240
           ++  ++N K+K+  S SLKMN+    +T G  + R  +  VVTC FTV +L +++ ILSQ
Sbjct: 197 VQNSLRNRKSKVPDSFSLKMNVPVKMKTSGFKTGRAEI--VVTCDFTVQSLAQDTHILSQ 256

BLAST of CSPI02G15830 vs. TAIR10
Match: AT2G22180.1 (AT2G22180.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 74.3 bits (181), Expect = 1.1e-13
Identity = 77/242 (31.82%), Postives = 116/242 (47.93%), Query Frame = 1

Query: 4   PEPEPVHPPPPPPDSLFFSSGTYVVQIPKDQIYRIPPPENALIVE-RHRNPSVVTSSRRR 63
           P P    PP PPPDS+     TYVVQ+P+DQ+Y  PPPE+A  VE R +NP     ++++
Sbjct: 59  PLPTTSSPPLPPPDSIP-ELETYVVQVPRDQVYWTPPPEHAKYVEKRSKNPE---KNKKK 118

Query: 64  SCCFRI--FLPIFVVLLLIIILALLLPPLLTLPKPPVIELKKFKLTPSTRNFLINLDILN 123
            C  R+  F  I V+   ++   +L+      P  PV  +++  + PS  NF + L   N
Sbjct: 119 GCSKRLLWFFIILVIFGFLLGAIILILHFAFNPTLPVFAVERLTVNPS--NFEVTLRAEN 178

Query: 124 PNSVGSISFKSPSR--VSLSFRKNQLATTKFPLIRQQHGSEKKVALSLRAKSAFPKELKR 183
           P S   + +       VSL+++   L + KFP + Q      KV + L   S     ++ 
Sbjct: 179 PTSNMGVRYMMEKNGVVSLTYKNKSLGSGKFPGLSQAASGSDKVNVKLNG-STKNAVVQP 238

Query: 184 RMKNNKTKLHTSLSLKMNLAAQTIGRLSNRRNVKFVVTCSFTVNTL--GKNSRILSQDCE 239
           R       L  ++ LK    A  +     +RN + VVTC   V  L   K   I+S++CE
Sbjct: 239 RGSKQPVVLMLNMELKAEYEAGPV-----KRNKEVVVTCDVKVKGLLDAKKVEIVSENCE 288

BLAST of CSPI02G15830 vs. TAIR10
Match: AT5G21130.1 (AT5G21130.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 56.2 bits (134), Expect = 3.2e-08
Identity = 63/242 (26.03%), Postives = 105/242 (43.39%), Query Frame = 1

Query: 14  PPPDSLFFSSGTYVVQIPKDQIYRIPPPENALIVERHRNPSVVTSSRRRSCCFRIFLPIF 73
           PPP       GTYV+++PKDQIYR+PPPENA     HR   +      +SCC R      
Sbjct: 53  PPP-------GTYVIKLPKDQIYRVPPPENA-----HRYEYLSRRKTNKSCCRRCLCYSL 112

Query: 74  VVLLLIIILALLLPPLLTL---PKPPVIELKKFKLT------PSTRNFLINLDILNPNSV 133
             LL+II+LA +      L   P  P   +    +T       S  + +I + + + N  
Sbjct: 113 SALLIIIVLAAIAFGFFYLVYQPHKPQFSVSGVSVTGINLTSSSPFSPVIRIKLRSQNVK 172

Query: 134 GSIS--FKSPSRVSLSFRKNQLATTKFPLIRQQHGSEKKVAL-----SLRAKSAFPKELK 193
           G +   ++  +   + F   +L   +F   +Q  G+   +       S++ KS+  KEL 
Sbjct: 173 GKLGLIYEKGNEADVFFNGTKLGNGEFTAFKQPAGNVTVIVTVLKGSSVKLKSSSRKELT 232

Query: 194 RRMKNNKTK--LHTSLSLKMNLAAQTIGRLSNRRNVKFVVTCSFTVNTLGKNSRILSQDC 238
              K  K    L     +K  + + T   ++        V C  TV+ L  ++ + +++C
Sbjct: 233 ESQKKGKVPFGLRIKAPVKFKVGSVTTWTMT------ITVDCKITVDKLTASATVKTENC 276

BLAST of CSPI02G15830 vs. TAIR10
Match: AT4G39745.1 (AT4G39745.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 52.0 bits (123), Expect = 6.0e-07
Identity = 71/240 (29.58%), Postives = 93/240 (38.75%), Query Frame = 1

Query: 4   PEPEPVHPPPPPPDSLFFSSGTYVVQIPKDQIYRIPPPENALIVERHRNPSVVTSSRRRS 63
           P   P  PPPP P        TYVV +P+DQ+Y IPPP+NA             SS+   
Sbjct: 64  PHVSPSSPPPPDPIP---EIETYVVHVPRDQVYWIPPPDNA------------GSSKDAG 123

Query: 64  CCFRIFLPIFVVLLLIIILALLLPPLLTLPKPPVIELKKFKLTPSTRNFLINLDILNPNS 123
              R                      +  P+PPV  +KK +    +R+F I L   NP S
Sbjct: 124 IAIR--------------------GEIIKPEPPVFNVKKLE---KSRHFEIMLTSKNPTS 183

Query: 124 VGSISFKSPSRVSLSFRKNQLATTKFPLIRQQHGSEKKVALSLRAKSAFPKELKRRMKNN 183
              +++K    VSL+++   L    FP             LSL    +    LK     N
Sbjct: 184 TMWVTYK--GLVSLTYKNKNLGQGNFP------------ELSLAVSGSHTVNLKLDRSMN 243

Query: 184 KTKLH---TSLSLKMNLAAQTIGRLSNRRNVKFVVTCSFTVNTL--GKNSRILSQDCESE 239
              L     SL L M L A   G    +R  +  VTC   VN L       I+S+ CESE
Sbjct: 244 AAVLPPEVVSLVLTMGLDA-GFGTGLVKRAKEVAVTCDIKVNGLLDAHKVEIVSESCESE 250

BLAST of CSPI02G15830 vs. TAIR10
Match: AT2G27080.1 (AT2G27080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 50.1 bits (118), Expect = 2.3e-06
Identity = 65/249 (26.10%), Postives = 107/249 (42.97%), Query Frame = 1

Query: 8   PVHPPPPPPDSLFFSSGTYVVQIPKDQIYRIPPPENALIVERHRNPSVVTSSRRRS---C 67
           P  P PPP         TYV+Q+PKDQIYRIPPPENA     HR   +      RS   C
Sbjct: 27  PKKPAPPP--------STYVIQVPKDQIYRIPPPENA-----HRFEQLSRKKTNRSNCRC 86

Query: 68  CFRIFLPIFVVLLLIIILALLLPPLLTLPKPPVIELKKFKLTPSTRNFLINLDILNPNSV 127
           CF  FL    +L+++  ++  +  L+  P+ P   ++ F ++       INL+  +P S 
Sbjct: 87  CFCSFLAAVFILIVLAGISFAVLYLIYRPEAPKYSIEGFSVSG------INLNSTSPISP 146

Query: 128 G-SISFKSPS-------------RVSLSFRKNQLATTKFPLIRQQHGSEKKVALSLR-AK 187
             +++ +S +              V + +    ++    P+  Q   +   V L L  +K
Sbjct: 147 SFNVTVRSRNGNGKIGVYYEKESSVDVYYNDVDISNGVMPVFYQPAKNVTVVKLVLSGSK 206

Query: 188 SAFPKELKRRMKNNKTKLHTSLSLKMNLAAQTIGRLSNRRNVKFVVTCSFTVNTLGKNSR 239
                 +++ M+N  +K      LK+    +          +   V C  TV+ L   SR
Sbjct: 207 IQLTSGMRKEMRNEVSKKTVPFKLKIKAPVKIKFGSVKTWTMIVNVDCDVTVDKLTAPSR 256

BLAST of CSPI02G15830 vs. NCBI nr
Match: gi|449465892|ref|XP_004150661.1| (PREDICTED: uncharacterized protein LOC101202802 [Cucumis sativus])

HSP 1 Score: 450.3 bits (1157), Expect = 2.1e-123
Identity = 239/240 (99.58%), Postives = 240/240 (100.00%), Query Frame = 1

Query: 1   MAEPEPEPVHPPPPPPDSLFFSSGTYVVQIPKDQIYRIPPPENALIVERHRNPSVVTSSR 60
           MAEPEPEPVHPPPPPPDSLFFSSGTYVVQIPKDQIYRIPPPENALIVERHRNPSVVTSSR
Sbjct: 1   MAEPEPEPVHPPPPPPDSLFFSSGTYVVQIPKDQIYRIPPPENALIVERHRNPSVVTSSR 60

Query: 61  RRSCCFRIFLPIFVVLLLIIILALLLPPLLTLPKPPVIELKKFKLTPSTRNFLINLDILN 120
           RRSCCFRIFLPIFV+LLLIIILALLLPPLLTLPKPPVIELKKFKLTPSTRNFLINLDILN
Sbjct: 61  RRSCCFRIFLPIFVLLLLIIILALLLPPLLTLPKPPVIELKKFKLTPSTRNFLINLDILN 120

Query: 121 PNSVGSISFKSPSRVSLSFRKNQLATTKFPLIRQQHGSEKKVALSLRAKSAFPKELKRRM 180
           PNSVGSISFKSPSRVSLSFRKNQLATTKFPLIRQQHGSEKKVALSLRAKSAFPKELKRRM
Sbjct: 121 PNSVGSISFKSPSRVSLSFRKNQLATTKFPLIRQQHGSEKKVALSLRAKSAFPKELKRRM 180

Query: 181 KNNKTKLHTSLSLKMNLAAQTIGRLSNRRNVKFVVTCSFTVNTLGKNSRILSQDCESERQ 240
           KNNKTKLHTSLSLKMNLAAQTIGRLSNRRNVKFVVTCSFTVNTLGKNSRILSQDCESERQ
Sbjct: 181 KNNKTKLHTSLSLKMNLAAQTIGRLSNRRNVKFVVTCSFTVNTLGKNSRILSQDCESERQ 240

BLAST of CSPI02G15830 vs. NCBI nr
Match: gi|659089589|ref|XP_008445591.1| (PREDICTED: uncharacterized protein LOC103488571 [Cucumis melo])

HSP 1 Score: 403.7 bits (1036), Expect = 2.3e-109
Identity = 215/240 (89.58%), Postives = 226/240 (94.17%), Query Frame = 1

Query: 1   MAEPEPEPVHPPPPPPDSLFFSSGTYVVQIPKDQIYRIPPPENALIVERHRNPSVVTSSR 60
           MAE E EPVHPPPPPPD  FFSSGTYVVQIPKDQIYRIPPPENALIV+RHRNPSVVTSSR
Sbjct: 1   MAELELEPVHPPPPPPDPPFFSSGTYVVQIPKDQIYRIPPPENALIVQRHRNPSVVTSSR 60

Query: 61  RRSCCFRIFLPIFVVLLLIIILALLLPPLLTLPKPPVIELKKFKLTPSTRNFLINLDILN 120
           RRSCCFRIFLPIF+VLLLIIILALL+PPL+ LPKPPV +L KFKLTPSTRNF INLDILN
Sbjct: 61  RRSCCFRIFLPIFIVLLLIIILALLIPPLIALPKPPVFKLTKFKLTPSTRNFNINLDILN 120

Query: 121 PNSVGSISFKSPSRVSLSFRKNQLATTKFPLIRQQHGSEKKVALSLRAKSAFPKELKRRM 180
           PNS GSISFKSPSRVSLSFRK+QLATTKFPLIRQ HGS+K VALSLRAKSAFPKEL+RRM
Sbjct: 121 PNSAGSISFKSPSRVSLSFRKSQLATTKFPLIRQDHGSKKNVALSLRAKSAFPKELQRRM 180

Query: 181 KNNKTKLHTSLSLKMNLAAQTIGRLSNRRNVKFVVTCSFTVNTLGKNSRILSQDCESERQ 240
           K++KTKLHTSLSLKMNL A+T GRLSNRRNVKFVVTCSFTVNTLGK SRILSQDCESERQ
Sbjct: 181 KSSKTKLHTSLSLKMNLLAETKGRLSNRRNVKFVVTCSFTVNTLGKTSRILSQDCESERQ 240

BLAST of CSPI02G15830 vs. NCBI nr
Match: gi|702386443|ref|XP_010064764.1| (PREDICTED: uncharacterized protein LOC104451904 [Eucalyptus grandis])

HSP 1 Score: 139.8 bits (351), Expect = 6.2e-30
Identity = 95/245 (38.78%), Postives = 148/245 (60.41%), Query Frame = 1

Query: 12  PPPPPDSL--FFSSGTYVVQIPKDQIYRIPPPENALIVERHRNPSVVTSSRRR-----SC 71
           P P PDS    F+S TYVVQIPKDQIYR+PPPENALIV+RHR P   T  ++R     SC
Sbjct: 36  PVPNPDSPEHAFASDTYVVQIPKDQIYRVPPPENALIVKRHRKP---TEPKKRSFCCSSC 95

Query: 72  CFRIFLPIFVVLLLIIILALLLPPLLTLPKPPVIELKKFKLTPSTRN--------FLINL 131
           C  +FL I V++L++ ILA++    L L K P   +  F +   T++        + + L
Sbjct: 96  CCWLFLAIIVIVLVVGILAIVSSVFLKL-KNPNFHVDHFVVKDLTKSHDKNTKLVYDVKL 155

Query: 132 DILNPNSVGSISFKSPSRVSLSFRKNQLATTKFPLIRQQHGSEKKVALSLR-AKSAFPKE 191
            + NPN+  S ++K    VSL+F++  +A  KF    Q   + K V + L+ + +A PKE
Sbjct: 156 KVENPNTYSSFTYKQGGAVSLAFKQKAIAMGKFVAFDQDRKTSKAVDIVLKGSNTALPKE 215

Query: 192 LKRRMKNNKTKLHTSLSLKMNLAAQTIGRLSNRRNVKFVVTCSFTVNTLGKNSRILSQDC 241
           +++ +++ KTK H + +L ++  A+    +    + +FV +C FTV+ LGK++R+LSQ C
Sbjct: 216 MQKSLRSKKTKNHLTFALHVDAPARRKIGIIKGSSSRFVASCQFTVDKLGKDARVLSQKC 275

BLAST of CSPI02G15830 vs. NCBI nr
Match: gi|731428359|ref|XP_010664315.1| (PREDICTED: uncharacterized protein LOC104882481 [Vitis vinifera])

HSP 1 Score: 131.0 bits (328), Expect = 2.9e-27
Identity = 89/228 (39.04%), Postives = 134/228 (58.77%), Query Frame = 1

Query: 15  PPDSLFFSSGTYVVQIPKDQIYRIPPPENALIVERHRNPSVVTSSRRRSCCFRIFLPIFV 74
           P     F SGTYVVQ+PKDQIYR+PPPENALI ERHR+P+    S  RS CF + + I +
Sbjct: 19  PDSQNIFRSGTYVVQVPKDQIYRVPPPENALIAERHRSPAQKKKS-YRSRCFILCILICI 78

Query: 75  VLLLIIILALLLPPLLTLPKPPVIELKKFKLTPSTRN---FLINLDILNPNSVGSISFKS 134
           + +++ I A +   +L  PK P+  ++   +T S+ +   + I L   NPNS   I++++
Sbjct: 79  IAVIVAIAAAVSSRVLH-PKSPIFHIQHLVVTKSSHSRPQYKITLKAKNPNSHTGITYEA 138

Query: 135 PSRVSLSFRKNQLATTKFPLIRQ-QHGSEKKVALSLRAKSAFPKELKRRMKNNKTKLHTS 194
               SLSF+ +++A    P   Q Q  S   V     +KSA PKE++R +K+  +  HTS
Sbjct: 139 GGHASLSFKNHEIADGDCPTFSQGQKDSTVFVVPLSGSKSALPKEIERSIKSQNSTAHTS 198

Query: 195 LSLKMNLAAQTIGRL-SNRRNVKFVVTCSFTVNTLGKNSRILSQDCES 238
           LSL +++      R+   +R  +  V C+ TV+TL K +R+LSQ+C S
Sbjct: 199 LSLSLSMDYPIKRRVWLFKRGKRLAVLCNVTVDTLAKGTRVLSQECHS 244

BLAST of CSPI02G15830 vs. NCBI nr
Match: gi|596290130|ref|XP_007226131.1| (hypothetical protein PRUPE_ppa014613mg [Prunus persica])

HSP 1 Score: 131.0 bits (328), Expect = 2.9e-27
Identity = 99/255 (38.82%), Postives = 147/255 (57.65%), Query Frame = 1

Query: 6   PEPVHPPPPPPDS------------LFFSSGTYVVQIPKDQIYRIPPPENALIVERHRNP 65
           PE + PP PPP+S              F SGTY+VQ+PKDQIYR+PPPE+A IVERHR+ 
Sbjct: 22  PEFIPPPLPPPNSQQLILSNSNGSTATFRSGTYIVQVPKDQIYRMPPPEHATIVERHRD- 81

Query: 66  SVVTSSRRRSCCFRIFLPIFVVLLLIIILALLLPPLLTLPKPPV-IELKKFKLTPSTRNF 125
           S V       CC  I    F+VLL+I ++A++L  L     P   +E    K      ++
Sbjct: 82  SGVNKKSCSYCCLGII--AFIVLLIITLVAVILTMLAKSGDPKFSVERVVVKGKSGRPDY 141

Query: 126 LINLDILNPNSVGSISFKSPSRVSLSFRKNQLATTKFPLIRQQHGSEKKVALSLRAKS-A 185
            + L+  NPNS  +I +K     SL F++ ++A  K+P + Q  G  K+VAL L   +  
Sbjct: 142 DLTLEARNPNSRVAIVYKDGGGASLYFKQKKIANGKYPSLYQGSGKSKEVALVLHGSNMK 201

Query: 186 FPKELKRRMKNNKTKLH-----TSLSLKMNLAAQ-TIGRLSNRRNVKFVVTCSFTVNTLG 241
            PKE+++ +K++ +  +      SLSL M++ A+  IG L N R+ KF VTC  TV+TL 
Sbjct: 202 LPKEIEKSLKSSYSSTYKKGHRVSLSLNMDIPARMRIGTL-NSRSRKFHVTCDITVDTLA 261

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LK43_CUCSA1.5e-12399.58Uncharacterized protein OS=Cucumis sativus GN=Csa_2G307330 PE=4 SV=1[more]
A0A059BRD6_EUCGR4.3e-3038.78Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F02438 PE=4 SV=1[more]
M5Y5Z5_PRUPE2.0e-2738.82Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa014613mg PE=4 SV=1[more]
K7KJB0_SOYBN8.4e-2634.82Uncharacterized protein OS=Glycine max GN=GLYMA_04G103700 PE=4 SV=1[more]
B9H747_POPTR8.4e-2636.18Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s16920g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G22180.11.1e-1331.82 hydroxyproline-rich glycoprotein family protein[more]
AT5G21130.13.2e-0826.03 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT4G39745.16.0e-0729.58 hydroxyproline-rich glycoprotein family protein[more]
AT2G27080.12.3e-0626.10 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|449465892|ref|XP_004150661.1|2.1e-12399.58PREDICTED: uncharacterized protein LOC101202802 [Cucumis sativus][more]
gi|659089589|ref|XP_008445591.1|2.3e-10989.58PREDICTED: uncharacterized protein LOC103488571 [Cucumis melo][more]
gi|702386443|ref|XP_010064764.1|6.2e-3038.78PREDICTED: uncharacterized protein LOC104451904 [Eucalyptus grandis][more]
gi|731428359|ref|XP_010664315.1|2.9e-2739.04PREDICTED: uncharacterized protein LOC104882481 [Vitis vinifera][more]
gi|596290130|ref|XP_007226131.1|2.9e-2738.82hypothetical protein PRUPE_ppa014613mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0005774 vacuolar membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G15830.1CSPI02G15830.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 116..217
score: 3.
NoneNo IPR availablePANTHERPTHR31852FAMILY NOT NAMEDcoord: 12..238
score: 1.1
NoneNo IPR availablePANTHERPTHR31852:SF47HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILY PROTEINcoord: 12..238
score: 1.1