Cucsa.364900 (gene) Cucumber (Gy14) v1

NameCucsa.364900
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionLate embryogenesis abundant hydroxyproline-rich glycoprotein family, putative
Locationscaffold03611 : 2354281 .. 2355391 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGATTTGCCATTGAAACCGCCTCTCCAAAAGCCCCCTGGCTACAAGGATCACAACACCACCGCCACCTCCTCCTCCTCCGCCTCCACCGCCACCCACCTTCCTCCACCTCTCCGTCCCAAACCTCGTCCCCcTTCCTCCTACAAACCCAAGAAACGCAAACGCAATTGCTGCAGAACATGCTGCTGCATTTTTTGCTTCCTCATCCTCTTCCTCATCGTTGTTGCCGCCCTCGCCCTCGCTCTCTTCTACCTACTCTACGACCCAAAACTCCCCGTCTTCCACCTCCTCGCTTTCCGGATCTCGTCCTTCAAAGTCTCCACCACACCGGACGGGTCGTTCCTCGACTCGCAAGTGTCGATTCGAGTGGAATTCAAGAATCCAAATGAGAAGCTTTCGATAAAGTATGGTAAGATTGAGTATGATGTCACGGTGGGGCAGGCGACGGAGTTTGGGAGGAGAGAGTTGGCTGGATTTACGCAGGGGAGGAGGAGTACAACGACGGTGAAGGCGGAGGCGGCGGTGAAGAATAAGATGCTGGCAGTTGAGGATGGGGGGAGGCTGTTGTCGAAGTTTCAGAGTAAGGCGCTGGAGGTGAAAGTTGAAGCGGAGACGGAGGTGGGTGTGGTTGTTCAAGGATGGGGATTGGGTCCGATCACCGTCAAGTTGGATTGTGAGTCTAAATTGAAGAATATTGATGGTGGTGATATGCCTACTTGCAACATCAATTTGCTCAGATGGTATTTTCTTCCATTCTCTTTTTCTCTTTTTCAATCATATCTTATATTCCTATAAATATAATTTTACTTACATATATTAACACTCTCCTATAGTAACAATAATTTCTAATGTCTCTAATTCAATACTCTCATTATCTAATTCAATTCTtTTTATAGAACAAAAAGATTGGCTTAAATATAAATGACTTTTTGAACCACCAAGAAATATCATCATGATTGCATCTTCCAGGGCTATCACTTCTTTGTTCCACATCTAATGAATTTGAAATATATATATATATATAAACAAATGAATTATTGTAAAGTACTTATGGGTTCTTTTTTGCTTCTAATTGCAGGATCAATATACGTGGATGAAATTATTCCTC

mRNA sequence

ATGGCGGATTTGCCATTGAAACCGCCTCTCCAAAAGCCCCCTGGCTACAAGGATCACAACAccaccgccacctcctcctcctccgcctccaccgccacccaccttcctccacctctccgtcccaaacctcgtcccccttcctcctACAAACCCAAGAAACGCAAACGCAATTGCTGCAGAACATGCTGCTGCATTTTTTGCTTCCTCATCCTCTTCCTCATCGTTGTTGCCGCCCTCGCCCTCGCTCTCTTCTACCTACTCTACGACCCAAAACTCCCCGTCTTCCACCTCCTCGCTTTCCGGATCTCGTCCTTCAAAGTCTCCACCACACCGGACGGGTCGTTCCTCGACTCGCAAGTGTCGATTCGAGTGGAATTCAAGAATCCAAATGAGAAGCTTTCGATAAAGTATGGTAAGATTGAGTATGATGTCACGGTGGGGCAGGCGACGGAGTTTGGGAGGAGAGAGTTGGCTGGATTTACGCAGGGGAGGAGGAGTACAACGACGGTGAAGGCGGAGGCGGCGGTGAAGAATAAGATGCTGGCAGTTGAGGATGGGGGGAGGCTGTTGTCGAAGTTTCAGAGTAAGGCGCTGGAGGTGAAAGTTGAAGCGGAGACGGAGGTGGGTGTGGTTGTTCAAGGATGGGGATTGGGTCCGATCACCGTCAAGTTGGATTGTGAGTCTAAATTGAAGAATATTGATGGTGGTGATATGCCTACTTGCAACATCAATTTGCTCAGATGGATCAATATACGTGGATGAAATTATTCCTC

Coding sequence (CDS)

ATGGCGGATTTGCCATTGAAACCGCCTCTCCAAAAGCCCCCTGGCTACAAGGATCACAACACCACCGCCACCTCCTCCTCCTCCGCCTCCACCGCCACCCACCTTCCTCCACCTCTCCGTCCCAAACCTCGTCCCCcTTCCTCCTACAAACCCAAGAAACGCAAACGCAATTGCTGCAGAACATGCTGCTGCATTTTTTGCTTCCTCATCCTCTTCCTCATCGTTGTTGCCGCCCTCGCCCTCGCTCTCTTCTACCTACTCTACGACCCAAAACTCCCCGTCTTCCACCTCCTCGCTTTCCGGATCTCGTCCTTCAAAGTCTCCACCACACCGGACGGGTCGTTCCTCGACTCGCAAGTGTCGATTCGAGTGGAATTCAAGAATCCAAATGAGAAGCTTTCGATAAAGTATGGTAAGATTGAGTATGATGTCACGGTGGGGCAGGCGACGGAGTTTGGGAGGAGAGAGTTGGCTGGATTTACGCAGGGGAGGAGGAGTACAACGACGGTGAAGGCGGAGGCGGCGGTGAAGAATAAGATGCTGGCAGTTGAGGATGGGGGGAGGCTGTTGTCGAAGTTTCAGAGTAAGGCGCTGGAGGTGAAAGTTGAAGCGGAGACGGAGGTGGGTGTGGTTGTTCAAGGATGGGGATTGGGTCCGATCACCGTCAAGTTGGATTGTGAGTCTAAATTGAAGAATATTGATGGTGGTGATATGCCTACTTGCAACATCAATTTGCTCAGATGGATCAATATACGTGGATGA

Protein sequence

MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCRTCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQVSIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDGGDMPTCNINLLRWINIRG*
BLAST of Cucsa.364900 vs. Swiss-Prot
Match: YLS9_ARATH (Protein YLS9 OS=Arabidopsis thaliana GN=YLS9 PE=2 SV=1)

HSP 1 Score: 57.0 bits (136), Expect = 3.5e-07
Identity = 46/191 (24.08%), Postives = 87/191 (45.55%), Query Frame = 1

Query: 38  PLRPKPRPPSSYKPKKRKRNCCRTCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHL 97
           P  P P P   Y+ +   R C      +F  +I+ LIV+  +A  +F+L+  P+   FH+
Sbjct: 14  PSVPPPAPKGYYR-RGHGRGCGCCLLSLFVKVIISLIVILGVAALIFWLIVRPRAIKFHV 73

Query: 98  LAFRISSFKVSTTPDGSFLDSQVSIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRREL 157
               ++ F   T+PD + L   +++ V  +NPN+++ + Y +IE      +   F    L
Sbjct: 74  TDASLTRFD-HTSPD-NILRYNLALTVPVRNPNKRIGLYYDRIEAHAYY-EGKRFSTITL 133

Query: 158 AGFTQGRRSTTTVKAEAAVKNKMLAVEDGGRLLSKFQ-SKALEVKVEAETEVGVVVQGWG 217
             F QG ++TT +      +N ++      R L+  + S    ++++    V   +    
Sbjct: 134 TPFYQGHKNTTVLTPTFQGQNLVIFNAGQSRTLNAERISGVYNIEIKFRLRVRFKLGDLK 193

Query: 218 LGPITVKLDCE 228
              I  K+DC+
Sbjct: 194 FRRIKPKVDCD 200

BLAST of Cucsa.364900 vs. TrEMBL
Match: A0A0A0KCD8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G042460 PE=4 SV=1)

HSP 1 Score: 513.8 bits (1322), Expect = 1.2e-142
Identity = 253/253 (100.00%), Postives = 253/253 (100.00%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60
           MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR
Sbjct: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60

Query: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQV 120
           TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQV
Sbjct: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQV 120

Query: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKM 180
           SIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKM
Sbjct: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKM 180

Query: 181 LAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDGGDMPT 240
           LAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDGGDMPT
Sbjct: 181 LAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDGGDMPT 240

Query: 241 CNINLLRWINIRG 254
           CNINLLRWINIRG
Sbjct: 241 CNINLLRWINIRG 253

BLAST of Cucsa.364900 vs. TrEMBL
Match: A0A061DTS6_THECC (Late embryogenesis abundant hydroxyproline-rich glycoprotein family, putative OS=Theobroma cacao GN=TCM_005206 PE=4 SV=1)

HSP 1 Score: 229.6 bits (584), Expect = 4.4e-57
Identity = 117/252 (46.43%), Postives = 161/252 (63.89%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60
           M + PLKP LQKPPGYKD +  A            PPP   KP  P S+ PKKR+  CCR
Sbjct: 1   MPEPPLKPVLQKPPGYKDPSAPAVKPGFR------PPPR--KPVLPPSFHPKKRRGGCCR 60

Query: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQV 120
            CCC FC   L LI++  +  A+FYL +DPKLP FH+ + RIS F V+  PDG++LD+Q 
Sbjct: 61  VCCCCFCIFFLILILLLLICGAVFYLWFDPKLPGFHVQSVRISRFNVTNKPDGTYLDAQT 120

Query: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQA---TEFGRRELAGFTQGRRSTTTVKAEAAVK 180
           + R+E KNPN K++  YG  E DV+VG+    TE G   + GFT G+++TT++K E  V 
Sbjct: 121 TTRLEVKNPNAKMTYYYGNTEVDVSVGEGGDETELGTTTVHGFTMGKQNTTSLKVETKVI 180

Query: 181 NKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNIDGG 240
           NK++    G RL ++++SK+L V VEA T++G+ V G  +G + V + C+   LK +DGG
Sbjct: 181 NKLVDDGVGTRLQARYRSKSLRVSVEARTKIGLGVAGLKIGMVGVTVKCDGIALKRLDGG 240

Query: 241 DMPTCNINLLRW 249
           DMP C IN+L+W
Sbjct: 241 DMPKCVINMLKW 244

BLAST of Cucsa.364900 vs. TrEMBL
Match: W9SAG5_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_003064 PE=4 SV=1)

HSP 1 Score: 226.1 bits (575), Expect = 4.8e-56
Identity = 117/253 (46.25%), Postives = 164/253 (64.82%), Query Frame = 1

Query: 1   MADLPLKPP-LQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCC 60
           MA+ PLKPP LQKPPGY+D          A+    +  P + KP  P+S+ P+KR+RN C
Sbjct: 1   MAEQPLKPPPLQKPPGYRD---------PAAPGKPVARPPQRKPVLPASFHPRKRRRNWC 60

Query: 61  RTCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQ 120
           RTCCC     +L L +  A+A  +FYL ++PKLPVFHL + RI  F V+  PDG++LD+ 
Sbjct: 61  RTCCCFVFVFLLLLTLAVAIAGGIFYLWFEPKLPVFHLQSLRIPQFNVTVKPDGTYLDAG 120

Query: 121 VSIRVEFKNPNEKLSIKYGKIEYDVTVG--QATEFGRRELAGFTQGRRSTTTVKAEAAVK 180
              R+E KNPN KL + YG    +V+VG  +  E GR++L GFTQG+ +TT++K E  VK
Sbjct: 121 TVTRIEVKNPNGKLELYYGGTHVEVSVGEDEDAELGRKDLEGFTQGKENTTSLKVETTVK 180

Query: 181 NKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNIDGG 240
           N+++    G RL S ++SK L VK+EA+T VG +VQG  +G + V + C    LK +D G
Sbjct: 181 NQLVDDGLGKRLKSGYKSKDLVVKIEAKTSVGYIVQGVKIGTVEVGVLCGGVSLKKLDSG 240

Query: 241 DMPTCNINLLRWI 250
           DMP C+I+LL+W+
Sbjct: 241 DMPKCSIDLLKWV 244

BLAST of Cucsa.364900 vs. TrEMBL
Match: A0A0B0NJM7_GOSAR (D-alanine--poly (Phosphoribitol) ligase subunit 1 OS=Gossypium arboreum GN=F383_18367 PE=4 SV=1)

HSP 1 Score: 216.5 bits (550), Expect = 3.8e-53
Identity = 111/255 (43.53%), Postives = 159/255 (62.35%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60
           M++ P+KP LQKPPGYKD       SS A      PPP   KP  P S+ PKKRK +  R
Sbjct: 1   MSEPPVKPVLQKPPGYKD------PSSPAGQRRFRPPPR--KPVLPPSFHPKKRKTSYGR 60

Query: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQV 120
            CCC FC   L  +++  +  A+FYL +DPKLP FH+ +FRIS F V+  PDG++LD++ 
Sbjct: 61  ACCCCFCIFFLIFLLLILICGAVFYLWFDPKLPGFHIQSFRISRFNVTKRPDGTYLDART 120

Query: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQA---TEFGRRELAGFTQGRRSTTTVKAEAAVK 180
           + R+E KNPN K+   YG  E +V++G+    TE G   +  FT   ++T +++ E    
Sbjct: 121 TTRLEVKNPNRKMIYYYGDTEVEVSLGEGGYETELGTTTVPAFTMLEKNTRSLRVETKAS 180

Query: 181 NKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNIDGG 240
           NK++  E G +L ++++SK+L V VEA T+VGV V G  +G + V + C+    K +DGG
Sbjct: 181 NKLVVDEVGNKLRARYRSKSLPVNVEARTKVGVGVAGLKIGMVGVTVKCDGMSKKQLDGG 240

Query: 241 DMPTCNINLLRWINI 252
           DMP C IN+L+W+NI
Sbjct: 241 DMPKCVINMLKWLNI 247

BLAST of Cucsa.364900 vs. TrEMBL
Match: A0A0D2QQD3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G105400 PE=4 SV=1)

HSP 1 Score: 214.5 bits (545), Expect = 1.5e-52
Identity = 109/255 (42.75%), Postives = 159/255 (62.35%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60
           M++ P+KP LQKPPGYKD N      S A      PPP   KP  P S+ PKKRK +  R
Sbjct: 1   MSEPPVKPVLQKPPGYKDPN------SPAGQRRFRPPPR--KPVLPPSFHPKKRKTSYGR 60

Query: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQV 120
            CCC FC   L  +++  +  A+FYL +DP+LP FH+ +FRIS F V+  PDG++LD++ 
Sbjct: 61  ACCCCFCIFFLIFLLLILICGAVFYLWFDPQLPGFHIQSFRISRFNVTKRPDGTYLDART 120

Query: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQA---TEFGRRELAGFTQGRRSTTTVKAEAAVK 180
           + R+E KNPN K++  YG  E +++ G+    TE G   +  FT   ++T +++ E    
Sbjct: 121 TTRLEVKNPNGKMTYYYGDTEVEISFGEGGYETELGTTTVPAFTMLEKNTRSLRVETIAS 180

Query: 181 NKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNIDGG 240
           NK++  E G +L ++++SK+L V VEA T+VGV V G  +G + V + C+    K +DGG
Sbjct: 181 NKLVVDEVGNKLRARYRSKSLPVNVEARTKVGVGVAGLKIGMVGVTVKCDGMSKKQLDGG 240

Query: 241 DMPTCNINLLRWINI 252
           DMP C IN+L+W+NI
Sbjct: 241 DMPKCVINMLKWLNI 247

BLAST of Cucsa.364900 vs. TAIR10
Match: AT2G46300.1 (AT2G46300.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 201.4 bits (511), Expect = 6.4e-52
Identity = 105/260 (40.38%), Postives = 150/260 (57.69%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRP-----PSSYKPKKRK 60
           MAD  + P LQKPPGY+D N ++            PPP++ +P       P+SY+PKK++
Sbjct: 1   MADYQMNPVLQKPPGYRDPNMSSPPPP--------PPPIQQQPMRKAVPMPTSYRPKKKR 60

Query: 61  RNCCRTCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSF 120
           R+CCR CCC  C  ++  I +  +  A+FYL +DPKLP F L +FR+  FK++  PDG+ 
Sbjct: 61  RSCCRFCCCCICITLVLFIFLLLVGTAVFYLWFDPKLPTFSLASFRLDGFKLADDPDGAS 120

Query: 121 LDSQVSIRVEFKNPNEKLSIKYGKIEYDVTVGQA---TEFGRRELAGFTQGRRSTTTVKA 180
           L +    RVE KNPN KL   YG    D++VG     T  G   + GF QG +++T+VK 
Sbjct: 121 LSATAVARVEMKNPNSKLVFYYGNTAVDLSVGSGNDETGMGETTMNGFRQGPKNSTSVKV 180

Query: 181 EAAVKNKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKN 240
           E  VKN+++      RL +KFQSK L + V A+T+VG+ V G  +G + V L C     N
Sbjct: 181 ETTVKNQLVERGLAKRLAAKFQSKDLVINVVAKTKVGLGVGGIKIGMLAVNLRCGGVSLN 240

Query: 241 IDGGDMPTCNINLLRWINIR 253
               D P C +N L+W+ I+
Sbjct: 241 KLDTDSPKCILNTLKWVTIQ 252

BLAST of Cucsa.364900 vs. TAIR10
Match: AT4G01110.1 (AT4G01110.1 unknown protein)

HSP 1 Score: 140.2 bits (352), Expect = 1.8e-33
Identity = 90/257 (35.02%), Postives = 140/257 (54.47%), Query Frame = 1

Query: 6   LKPPLQKPPGYKD-HNTTAT---SSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCRT 65
           LKP LQKPPGY++ H+   T   SSSS+S+    PP       P + Y  KKR+ + CR 
Sbjct: 7   LKPVLQKPPGYRELHSQPQTPLGSSSSSSSMLRRPPK---HAIPAAFYPTKKRQWSRCRV 66

Query: 66  CCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDG---SFLDS 125
            CC  C  +  +I++  L +++F+L Y P+LPV  L +FR+S+F  S    G   S L +
Sbjct: 67  FCCCVCITVAIVILLLILTVSVFFLYYSPRLPVVRLSSFRVSNFNFSGGKAGDGLSQLTA 126

Query: 126 QVSIRVEFKNPNEKLSIKYGKIEYDVTVGQ---ATEFGRRELAGFTQGRRSTTTVKAEAA 185
           + + R++F+NPN KL   YG ++  V+VG+    T  G  ++ GF +   + T V     
Sbjct: 127 EATARLDFRNPNGKLRYYYGNVDVAVSVGEDDFETSLGSTKVKGFVEKPGNRTVVIVPIK 186

Query: 186 VKNKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNID 245
           VK + +      RL +  +SK L VKV A+T+VG+ V    +  + V + C   +L+ +D
Sbjct: 187 VKKQQVDDPTVKRLRADMKSKKLVVKVMAKTKVGLGVGRRKIVTVGVTISCGGVRLQTLD 246

Query: 246 GGDMPTCNINLLRWINI 252
              M  C I +L+WI +
Sbjct: 247 -SKMSKCTIKMLKWIKL 259

BLAST of Cucsa.364900 vs. TAIR10
Match: AT1G01453.2 (AT1G01453.2 unknown protein)

HSP 1 Score: 136.7 bits (343), Expect = 2.0e-32
Identity = 83/253 (32.81%), Postives = 132/253 (52.17%), Query Frame = 1

Query: 2   ADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCRT 61
           A+ PL+P LQKPPG++D     ++  S +      P  RP+P  P+    KKR+ + CR 
Sbjct: 16  AEKPLQPALQKPPGFRDQQNQPSAPPSGTATL---PRRRPRPIHPAD---KKRRCSFCRV 75

Query: 62  CCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVS--TTPDG-SFLDS 121
            CC  C L   ++++  +A+A+F+L Y PKLPV  L +F+IS+F  S   + DG SFL +
Sbjct: 76  FCCCVCILFAVILLLILIAVAVFFLWYSPKLPVVRLASFKISNFNFSDGKSDDGWSFLSA 135

Query: 122 QVSIRVEFKNPNEKLSIKYGKIEYDVTVGQ---ATEFGRRELAGFTQGRRSTTTVKAEAA 181
             +  ++F+NPN KL+  YG  +  V +G+    T     ++ GF +   + T V     
Sbjct: 136 DTTSVLDFRNPNGKLTFYYGDTDVAVILGEKDFETNLESTKVKGFIEKPGNRTAVIVPTT 195

Query: 182 VKNKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDG 241
           V+ + +      RL  + +SK L V V A+T+VG+ V    +  + V L C   +     
Sbjct: 196 VRKRQVDDPTAKRLQVELKSKKLLVTVTAKTKVGLAVGSRKIVTVGVSLRCGGVILQTLD 255

Query: 242 GDMPTCNINLLRW 249
             M  C I +L+W
Sbjct: 256 SKMAQCTIKMLKW 262

BLAST of Cucsa.364900 vs. TAIR10
Match: AT1G17620.1 (AT1G17620.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 89.4 bits (220), Expect = 3.6e-18
Identity = 61/229 (26.64%), Postives = 108/229 (47.16%), Query Frame = 1

Query: 9   PLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCRTCCCIFCF 68
           P  KPP         T+ +  +    L    RP  RPP+  +     R CC  CCC   F
Sbjct: 8   PASKPPAIVGGGAPTTNPTFPANKAQLYNANRPAYRPPAGRRRTSHTRGCCCRCCCWTIF 67

Query: 69  LILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSF--LDSQVSIRVEF 128
           +I+ L+++ A A A+ YL+Y P+ P     +F +S  K+ST    S   L + +S+ V  
Sbjct: 68  VIILLLLIVAAASAVVYLIYRPQRP-----SFTVSELKISTLNFTSAVRLTTAISLSVIA 127

Query: 129 KNPNEKLSIKYGKIEYDVTVGQATE-------FGRRELAGFTQGRRSTTTVKAEAAVKNK 188
           +NPN+ +   Y     D+T+ +A+         G+  +A F+ G+++TTT+++       
Sbjct: 128 RNPNKNVGFIYDVT--DITLYKASTGGDDDVVIGKGTIAAFSHGKKNTTTLRSTIGSPPD 187

Query: 189 MLAVEDGGRLLSKFQS-KALEVKVEAETEVGVVVQGWGLGPITVKLDCE 228
            L     G+L    ++ KA+ +K+   ++V V +         +++ CE
Sbjct: 188 ELDEISAGKLKGDLKAKKAVAIKIVLNSKVKVKMGALKTPKSGIRVTCE 229

BLAST of Cucsa.364900 vs. TAIR10
Match: AT1G65690.1 (AT1G65690.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 76.6 bits (187), Expect = 2.4e-14
Identity = 48/169 (28.40%), Postives = 77/169 (45.56%), Query Frame = 1

Query: 9   PLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSY----KPKKRKRNCCRTCCC 68
           P+Q P       T       +S + H  P   P  + P  +     PKKR+  CCR  C 
Sbjct: 9   PVQDPEAATARPTAPLVPRGSSRSEHGDPSKVPLNQRPQRFVPLAPPKKRRSCCCRCFCY 68

Query: 69  IFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQVSIRV 128
            FCFL+L ++ V A ++ + YL++ PKLP + +   +++ F ++     S L +  ++ +
Sbjct: 69  TFCFLLLLVVAVGA-SIGILYLVFKPKLPDYSIDRLQLTRFALN---QDSSLTTAFNVTI 128

Query: 129 EFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAE 174
             KNPNEK+ I Y             +     L  F QG  +TT +  E
Sbjct: 129 TAKNPNEKIGIYYEDGSKITVWYMEHQLSNGSLPKFYQGHENTTVIYVE 173

BLAST of Cucsa.364900 vs. NCBI nr
Match: gi|449446257|ref|XP_004140888.1| (PREDICTED: uncharacterized protein LOC101205096 [Cucumis sativus])

HSP 1 Score: 513.8 bits (1322), Expect = 1.7e-142
Identity = 253/253 (100.00%), Postives = 253/253 (100.00%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60
           MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR
Sbjct: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60

Query: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQV 120
           TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQV
Sbjct: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQV 120

Query: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKM 180
           SIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKM
Sbjct: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKM 180

Query: 181 LAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDGGDMPT 240
           LAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDGGDMPT
Sbjct: 181 LAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDGGDMPT 240

Query: 241 CNINLLRWINIRG 254
           CNINLLRWINIRG
Sbjct: 241 CNINLLRWINIRG 253

BLAST of Cucsa.364900 vs. NCBI nr
Match: gi|659089922|ref|XP_008445748.1| (PREDICTED: uncharacterized protein LOC103488682 [Cucumis melo])

HSP 1 Score: 476.9 bits (1226), Expect = 2.3e-131
Identity = 233/253 (92.09%), Postives = 242/253 (95.65%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60
           MADLP+KPPLQKPPGYKDH+T ATSSSSAST THLPPP R KPR PSSYKPKKRKRNCCR
Sbjct: 1   MADLPMKPPLQKPPGYKDHHTAATSSSSASTVTHLPPPPRSKPRLPSSYKPKKRKRNCCR 60

Query: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQV 120
           TCCCIFCFLILFLIVVAALALALFYL+YDPKLPVFHLLAFRIS+FKVS TPDGSFLD+QV
Sbjct: 61  TCCCIFCFLILFLIVVAALALALFYLIYDPKLPVFHLLAFRISTFKVSATPDGSFLDAQV 120

Query: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKM 180
           SIRVEFKNPN+KLSIKYGKIEYDV VGQATEFGRRELAGFTQ RRSTTTVKAEAAVKNKM
Sbjct: 121 SIRVEFKNPNDKLSIKYGKIEYDVMVGQATEFGRRELAGFTQDRRSTTTVKAEAAVKNKM 180

Query: 181 LAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDGGDMPT 240
           LAVEDG RLLSKFQSKALEVKVEAET VGVV+QGWGLGPITVKLDCE+KLKNI+GGDMP 
Sbjct: 181 LAVEDGARLLSKFQSKALEVKVEAETAVGVVIQGWGLGPITVKLDCETKLKNIEGGDMPI 240

Query: 241 CNINLLRWINIRG 254
           CNINLLRWINIRG
Sbjct: 241 CNINLLRWINIRG 253

BLAST of Cucsa.364900 vs. NCBI nr
Match: gi|590721513|ref|XP_007051635.1| (Late embryogenesis abundant hydroxyproline-rich glycoprotein family, putative [Theobroma cacao])

HSP 1 Score: 229.6 bits (584), Expect = 6.3e-57
Identity = 117/252 (46.43%), Postives = 161/252 (63.89%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60
           M + PLKP LQKPPGYKD +  A            PPP   KP  P S+ PKKR+  CCR
Sbjct: 1   MPEPPLKPVLQKPPGYKDPSAPAVKPGFR------PPPR--KPVLPPSFHPKKRRGGCCR 60

Query: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQV 120
            CCC FC   L LI++  +  A+FYL +DPKLP FH+ + RIS F V+  PDG++LD+Q 
Sbjct: 61  VCCCCFCIFFLILILLLLICGAVFYLWFDPKLPGFHVQSVRISRFNVTNKPDGTYLDAQT 120

Query: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQA---TEFGRRELAGFTQGRRSTTTVKAEAAVK 180
           + R+E KNPN K++  YG  E DV+VG+    TE G   + GFT G+++TT++K E  V 
Sbjct: 121 TTRLEVKNPNAKMTYYYGNTEVDVSVGEGGDETELGTTTVHGFTMGKQNTTSLKVETKVI 180

Query: 181 NKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNIDGG 240
           NK++    G RL ++++SK+L V VEA T++G+ V G  +G + V + C+   LK +DGG
Sbjct: 181 NKLVDDGVGTRLQARYRSKSLRVSVEARTKIGLGVAGLKIGMVGVTVKCDGIALKRLDGG 240

Query: 241 DMPTCNINLLRW 249
           DMP C IN+L+W
Sbjct: 241 DMPKCVINMLKW 244

BLAST of Cucsa.364900 vs. NCBI nr
Match: gi|702333839|ref|XP_010055051.1| (PREDICTED: protein YLS9-like [Eucalyptus grandis])

HSP 1 Score: 228.8 bits (582), Expect = 1.1e-56
Identity = 114/254 (44.88%), Postives = 162/254 (63.78%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPP--PLRPKPRPPSSYKPKKRKRNC 60
           MA+ P KP LQKPPGY+D           S     PP  P R    PPS Y P+K++R+C
Sbjct: 1   MAEPPQKPMLQKPPGYRD----------PSVVVQQPPTQPYRKPVMPPSMY-PRKKRRSC 60

Query: 61  CRTCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDS 120
           CR+CCC  C LI  ++ V  LA AL YL + PK+PVFHL +FRI  F V+  PDG++L +
Sbjct: 61  CRSCCCCLCVLIFLILCVLILAGALSYLWFGPKIPVFHLQSFRIPRFNVTAKPDGTYLKA 120

Query: 121 QVSIRVEFKNPNEKLSIKYGKIEYDVTVGQ--ATEFGRRELAGFTQGRRSTTTVKAEAAV 180
           Q  +RVE KNPN+KL + YG  + D+++G+    E G   L GFTQG+++ T++K    V
Sbjct: 121 QTVLRVEVKNPNQKLGLYYGGTDVDISLGRGGGIELGSDSLPGFTQGKKNVTSLKVTTEV 180

Query: 181 KNKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDC-ESKLKNIDG 240
           +++++    G  L S ++SK+L VKV+  T VG ++QGW +G + V ++C E  +K ++G
Sbjct: 181 RDELVEDGAGAELRSGYRSKSLVVKVKVRTSVGAIIQGWKVGRVRVNVECGEVAMKEVEG 240

Query: 241 GDMPTCNINLLRWI 250
           G+MP C INLLRWI
Sbjct: 241 GEMPKCKINLLRWI 243

BLAST of Cucsa.364900 vs. NCBI nr
Match: gi|703148826|ref|XP_010109444.1| (hypothetical protein L484_003064 [Morus notabilis])

HSP 1 Score: 226.1 bits (575), Expect = 6.9e-56
Identity = 117/253 (46.25%), Postives = 164/253 (64.82%), Query Frame = 1

Query: 1   MADLPLKPP-LQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCC 60
           MA+ PLKPP LQKPPGY+D          A+    +  P + KP  P+S+ P+KR+RN C
Sbjct: 1   MAEQPLKPPPLQKPPGYRD---------PAAPGKPVARPPQRKPVLPASFHPRKRRRNWC 60

Query: 61  RTCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQ 120
           RTCCC     +L L +  A+A  +FYL ++PKLPVFHL + RI  F V+  PDG++LD+ 
Sbjct: 61  RTCCCFVFVFLLLLTLAVAIAGGIFYLWFEPKLPVFHLQSLRIPQFNVTVKPDGTYLDAG 120

Query: 121 VSIRVEFKNPNEKLSIKYGKIEYDVTVG--QATEFGRRELAGFTQGRRSTTTVKAEAAVK 180
              R+E KNPN KL + YG    +V+VG  +  E GR++L GFTQG+ +TT++K E  VK
Sbjct: 121 TVTRIEVKNPNGKLELYYGGTHVEVSVGEDEDAELGRKDLEGFTQGKENTTSLKVETTVK 180

Query: 181 NKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNIDGG 240
           N+++    G RL S ++SK L VK+EA+T VG +VQG  +G + V + C    LK +D G
Sbjct: 181 NQLVDDGLGKRLKSGYKSKDLVVKIEAKTSVGYIVQGVKIGTVEVGVLCGGVSLKKLDSG 240

Query: 241 DMPTCNINLLRWI 250
           DMP C+I+LL+W+
Sbjct: 241 DMPKCSIDLLKWV 244

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
YLS9_ARATH3.5e-0724.08Protein YLS9 OS=Arabidopsis thaliana GN=YLS9 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KCD8_CUCSA1.2e-142100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_6G042460 PE=4 SV=1[more]
A0A061DTS6_THECC4.4e-5746.43Late embryogenesis abundant hydroxyproline-rich glycoprotein family, putative OS... [more]
W9SAG5_9ROSA4.8e-5646.25Uncharacterized protein OS=Morus notabilis GN=L484_003064 PE=4 SV=1[more]
A0A0B0NJM7_GOSAR3.8e-5343.53D-alanine--poly (Phosphoribitol) ligase subunit 1 OS=Gossypium arboreum GN=F383_... [more]
A0A0D2QQD3_GOSRA1.5e-5242.75Uncharacterized protein OS=Gossypium raimondii GN=B456_007G105400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G46300.16.4e-5240.38 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT4G01110.11.8e-3335.02 unknown protein[more]
AT1G01453.22.0e-3232.81 unknown protein[more]
AT1G17620.13.6e-1826.64 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G65690.12.4e-1428.40 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|449446257|ref|XP_004140888.1|1.7e-142100.00PREDICTED: uncharacterized protein LOC101205096 [Cucumis sativus][more]
gi|659089922|ref|XP_008445748.1|2.3e-13192.09PREDICTED: uncharacterized protein LOC103488682 [Cucumis melo][more]
gi|590721513|ref|XP_007051635.1|6.3e-5746.43Late embryogenesis abundant hydroxyproline-rich glycoprotein family, putative [T... [more]
gi|702333839|ref|XP_010055051.1|1.1e-5644.88PREDICTED: protein YLS9-like [Eucalyptus grandis][more]
gi|703148826|ref|XP_010109444.1|6.9e-5646.25hypothetical protein L484_003064 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.364900.1Cucsa.364900.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 124..227
score: 9.9
NoneNo IPR availablePANTHERPTHR31234FAMILY NOT NAMEDcoord: 5..252
score: 7.7
NoneNo IPR availablePANTHERPTHR31234:SF12LATE EMBRYOGENESIS ABUNDANT HYDROXYPROLINE-RICH GLYCOPROTEINcoord: 5..252
score: 7.7

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cucsa.364900Watermelon (97103) v2cgywmbB628