CSPI06G03690 (gene) Wild cucumber (PI 183967)

NameCSPI06G03690
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionLate embryogenesis abundant hydroxyproline-rich glycoprotein family, putative
LocationChr6 : 3398261 .. 3399681 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCTGTTTCTCTCTGTTTCTCTCCATTTTTCCAAAAATATTTCCCCCATATTCTCTCATTCTCTCTCACCCCCTCCTCCATTCAATTCAATCCCCTTCCATCTTTTCAATCCCTCATTTCCCATCCATTCCCTCCCTTTGATCCAAATCTCACTTCCTCAATACTCTCTCTCTTCCATGGCGGATTTGCCATTGAAACCGCCTCTCCAAAAGCCCCCTGGCTACAAGGATCACAACACCACCGCCACCTCCTCCTCCTCCGCCTCCACCGCCACCCACCTTCCTCCACCTCTCCGTCCCAAACCTCGTCCCCCTTCCTCCTACAAACCCAAGAAACGCAAACGCAATTGCTGCAGAACATGCTGCTGCATTTTTTGCTTCCTCATCCTCTTCCTCATCGTTGTTGCCGCCCTCGCCCTCGCTCTCTTCTACCTACTCTACGACCCAAAACTCCCCGTCTTCCACCTCCTCGCTTTCCGGATCTCGTCCTTCAAAGTCTCCGCCACACCGGACGGGTCGTTCCTCGACTCGCAAGTGTCGATTCGAGTGGAATTCAAGAATCCAAATGAGAAGCTTTCGATAAAGTATGGTAAGATTGAGTATGATGTCACGGTGGGGCAGGCGACGGAGTTTGGGAGGAGAGAGTTGGCTGGATTTACGCAGGGGAGGAGGAGTACAACGACGGTGAAGGCGGAGGCGGCGGTGAAGAATAAGATGCTGGCAGTTGAGGATGGGGGGAGGCTGTTGTCGAAGTTTCAGAGTAAGGCGCTGGAGGTGAAAGTTGAAGCGGAGACGGAGGTGGGTGTGGTTGTTCAAGGATGGGGATTGGGTCCGATCACCGTCAAGTTGGATTGTGAGTCTAAATTGAAGAATATTGATGGTGGTGATATGCCTACTTGCAACATCAATTTGCTCAGATGGTATTTTCTTCCATTCTCTTTTTCTCTTTTTCAATCATATCTTATATTCCTATAAATATAATTTTACTTACATATATTAACACTCTCCTATAATAATAATAATTTCTAATGTCTCTAATTCAATACTCTCATTATCTAATTCAATTGTTTTTATAAAACAAAAAGATTGGCTTAAATGTAAATGACTTTTTGAACCACCAAGAAATATCATCATGATTGCATCTTCCAAGGCTATCACTTCTTTGTTCCACATCTAATGAATTTGAAATATATATATAAACAAATGAATTATTGTAAAGTACTTATGGGTTCTTTTTTGCTTCTAACTGCAGGATCAATATACGTGGATGAAATTATTCCTCCCAAGAATTATATTTATATTGAAAATGATAATATAATCATTTTCTCGATAATATAATCATTTTCTCTCTCTTTTTTAATTTTTTATATTGGGGTGAAAATTTTCTTTTTGCAATTCTTTTCCTTTTTTCTTTTCCTACCC

mRNA sequence

ATGGCGGATTTGCCATTGAAACCGCCTCTCCAAAAGCCCCCTGGCTACAAGGATCACAACACCACCGCCACCTCCTCCTCCTCCGCCTCCACCGCCACCCACCTTCCTCCACCTCTCCGTCCCAAACCTCGTCCCCCTTCCTCCTACAAACCCAAGAAACGCAAACGCAATTGCTGCAGAACATGCTGCTGCATTTTTTGCTTCCTCATCCTCTTCCTCATCGTTGTTGCCGCCCTCGCCCTCGCTCTCTTCTACCTACTCTACGACCCAAAACTCCCCGTCTTCCACCTCCTCGCTTTCCGGATCTCGTCCTTCAAAGTCTCCGCCACACCGGACGGGTCGTTCCTCGACTCGCAAGTGTCGATTCGAGTGGAATTCAAGAATCCAAATGAGAAGCTTTCGATAAAGTATGGTAAGATTGAGTATGATGTCACGGTGGGGCAGGCGACGGAGTTTGGGAGGAGAGAGTTGGCTGGATTTACGCAGGGGAGGAGGAGTACAACGACGGTGAAGGCGGAGGCGGCGGTGAAGAATAAGATGCTGGCAGTTGAGGATGGGGGGAGGCTGTTGTCGAAGTTTCAGAGTAAGGCGCTGGAGGTGAAAGTTGAAGCGGAGACGGAGGTGGGTGTGGTTGTTCAAGGATGGGGATTGGGTCCGATCACCGTCAAGTTGGATTGTGAGTCTAAATTGAAGAATATTGATGGTGGTGATATGCCTACTTGCAACATCAATTTGCTCAGATGGATCAATATACGTGGATGA

Coding sequence (CDS)

ATGGCGGATTTGCCATTGAAACCGCCTCTCCAAAAGCCCCCTGGCTACAAGGATCACAACACCACCGCCACCTCCTCCTCCTCCGCCTCCACCGCCACCCACCTTCCTCCACCTCTCCGTCCCAAACCTCGTCCCCCTTCCTCCTACAAACCCAAGAAACGCAAACGCAATTGCTGCAGAACATGCTGCTGCATTTTTTGCTTCCTCATCCTCTTCCTCATCGTTGTTGCCGCCCTCGCCCTCGCTCTCTTCTACCTACTCTACGACCCAAAACTCCCCGTCTTCCACCTCCTCGCTTTCCGGATCTCGTCCTTCAAAGTCTCCGCCACACCGGACGGGTCGTTCCTCGACTCGCAAGTGTCGATTCGAGTGGAATTCAAGAATCCAAATGAGAAGCTTTCGATAAAGTATGGTAAGATTGAGTATGATGTCACGGTGGGGCAGGCGACGGAGTTTGGGAGGAGAGAGTTGGCTGGATTTACGCAGGGGAGGAGGAGTACAACGACGGTGAAGGCGGAGGCGGCGGTGAAGAATAAGATGCTGGCAGTTGAGGATGGGGGGAGGCTGTTGTCGAAGTTTCAGAGTAAGGCGCTGGAGGTGAAAGTTGAAGCGGAGACGGAGGTGGGTGTGGTTGTTCAAGGATGGGGATTGGGTCCGATCACCGTCAAGTTGGATTGTGAGTCTAAATTGAAGAATATTGATGGTGGTGATATGCCTACTTGCAACATCAATTTGCTCAGATGGATCAATATACGTGGATGA
BLAST of CSPI06G03690 vs. TrEMBL
Match: A0A0A0KCD8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G042460 PE=4 SV=1)

HSP 1 Score: 497.3 bits (1279), Expect = 1.1e-137
Identity = 252/253 (99.60%), Postives = 252/253 (99.60%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60
           MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR
Sbjct: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60

Query: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSATPDGSFLDSQV 120
           TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVS TPDGSFLDSQV
Sbjct: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQV 120

Query: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKM 180
           SIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKM
Sbjct: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKM 180

Query: 181 LAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDGGDMPT 240
           LAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDGGDMPT
Sbjct: 181 LAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDGGDMPT 240

Query: 241 CNINLLRWINIRG 254
           CNINLLRWINIRG
Sbjct: 241 CNINLLRWINIRG 253

BLAST of CSPI06G03690 vs. TrEMBL
Match: A0A061DTS6_THECC (Late embryogenesis abundant hydroxyproline-rich glycoprotein family, putative OS=Theobroma cacao GN=TCM_005206 PE=4 SV=1)

HSP 1 Score: 214.9 bits (546), Expect = 1.1e-52
Identity = 117/252 (46.43%), Postives = 161/252 (63.89%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60
           M + PLKP LQKPPGYKD +  A            PPP   KP  P S+ PKKR+  CCR
Sbjct: 1   MPEPPLKPVLQKPPGYKDPSAPAVKPGFR------PPPR--KPVLPPSFHPKKRRGGCCR 60

Query: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSATPDGSFLDSQV 120
            CCC FC   L LI++  +  A+FYL +DPKLP FH+ + RIS F V+  PDG++LD+Q 
Sbjct: 61  VCCCCFCIFFLILILLLLICGAVFYLWFDPKLPGFHVQSVRISRFNVTNKPDGTYLDAQT 120

Query: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQA---TEFGRRELAGFTQGRRSTTTVKAEAAVK 180
           + R+E KNPN K++  YG  E DV+VG+    TE G   + GFT G+++TT++K E  V 
Sbjct: 121 TTRLEVKNPNAKMTYYYGNTEVDVSVGEGGDETELGTTTVHGFTMGKQNTTSLKVETKVI 180

Query: 181 NKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNIDGG 240
           NK++    G RL ++++SK+L V VEA T++G+ V G  +G + V + C+   LK +DGG
Sbjct: 181 NKLVDDGVGTRLQARYRSKSLRVSVEARTKIGLGVAGLKIGMVGVTVKCDGIALKRLDGG 240

Query: 241 DMPTCNINLLRW 249
           DMP C IN+L+W
Sbjct: 241 DMPKCVINMLKW 244

BLAST of CSPI06G03690 vs. TrEMBL
Match: W9SAG5_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_003064 PE=4 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 9.4e-52
Identity = 117/253 (46.25%), Postives = 164/253 (64.82%), Query Frame = 1

Query: 1   MADLPLKPP-LQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCC 60
           MA+ PLKPP LQKPPGY+D          A+    +  P + KP  P+S+ P+KR+RN C
Sbjct: 1   MAEQPLKPPPLQKPPGYRD---------PAAPGKPVARPPQRKPVLPASFHPRKRRRNWC 60

Query: 61  RTCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSATPDGSFLDSQ 120
           RTCCC     +L L +  A+A  +FYL ++PKLPVFHL + RI  F V+  PDG++LD+ 
Sbjct: 61  RTCCCFVFVFLLLLTLAVAIAGGIFYLWFEPKLPVFHLQSLRIPQFNVTVKPDGTYLDAG 120

Query: 121 VSIRVEFKNPNEKLSIKYGKIEYDVTVG--QATEFGRRELAGFTQGRRSTTTVKAEAAVK 180
              R+E KNPN KL + YG    +V+VG  +  E GR++L GFTQG+ +TT++K E  VK
Sbjct: 121 TVTRIEVKNPNGKLELYYGGTHVEVSVGEDEDAELGRKDLEGFTQGKENTTSLKVETTVK 180

Query: 181 NKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNIDGG 240
           N+++    G RL S ++SK L VK+EA+T VG +VQG  +G + V + C    LK +D G
Sbjct: 181 NQLVDDGLGKRLKSGYKSKDLVVKIEAKTSVGYIVQGVKIGTVEVGVLCGGVSLKKLDSG 240

Query: 241 DMPTCNINLLRWI 250
           DMP C+I+LL+W+
Sbjct: 241 DMPKCSIDLLKWV 244

BLAST of CSPI06G03690 vs. TrEMBL
Match: A0A0B0NJM7_GOSAR (D-alanine--poly (Phosphoribitol) ligase subunit 1 OS=Gossypium arboreum GN=F383_18367 PE=4 SV=1)

HSP 1 Score: 202.6 bits (514), Expect = 5.7e-49
Identity = 111/255 (43.53%), Postives = 159/255 (62.35%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60
           M++ P+KP LQKPPGYKD       SS A      PPP   KP  P S+ PKKRK +  R
Sbjct: 1   MSEPPVKPVLQKPPGYKD------PSSPAGQRRFRPPPR--KPVLPPSFHPKKRKTSYGR 60

Query: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSATPDGSFLDSQV 120
            CCC FC   L  +++  +  A+FYL +DPKLP FH+ +FRIS F V+  PDG++LD++ 
Sbjct: 61  ACCCCFCIFFLIFLLLILICGAVFYLWFDPKLPGFHIQSFRISRFNVTKRPDGTYLDART 120

Query: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQA---TEFGRRELAGFTQGRRSTTTVKAEAAVK 180
           + R+E KNPN K+   YG  E +V++G+    TE G   +  FT   ++T +++ E    
Sbjct: 121 TTRLEVKNPNRKMIYYYGDTEVEVSLGEGGYETELGTTTVPAFTMLEKNTRSLRVETKAS 180

Query: 181 NKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNIDGG 240
           NK++  E G +L ++++SK+L V VEA T+VGV V G  +G + V + C+    K +DGG
Sbjct: 181 NKLVVDEVGNKLRARYRSKSLPVNVEARTKVGVGVAGLKIGMVGVTVKCDGMSKKQLDGG 240

Query: 241 DMPTCNINLLRWINI 252
           DMP C IN+L+W+NI
Sbjct: 241 DMPKCVINMLKWLNI 247

BLAST of CSPI06G03690 vs. TrEMBL
Match: A0A0D2QQD3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G105400 PE=4 SV=1)

HSP 1 Score: 201.1 bits (510), Expect = 1.7e-48
Identity = 109/255 (42.75%), Postives = 159/255 (62.35%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60
           M++ P+KP LQKPPGYKD N      S A      PPP   KP  P S+ PKKRK +  R
Sbjct: 1   MSEPPVKPVLQKPPGYKDPN------SPAGQRRFRPPPR--KPVLPPSFHPKKRKTSYGR 60

Query: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSATPDGSFLDSQV 120
            CCC FC   L  +++  +  A+FYL +DP+LP FH+ +FRIS F V+  PDG++LD++ 
Sbjct: 61  ACCCCFCIFFLIFLLLILICGAVFYLWFDPQLPGFHIQSFRISRFNVTKRPDGTYLDART 120

Query: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQA---TEFGRRELAGFTQGRRSTTTVKAEAAVK 180
           + R+E KNPN K++  YG  E +++ G+    TE G   +  FT   ++T +++ E    
Sbjct: 121 TTRLEVKNPNGKMTYYYGDTEVEISFGEGGYETELGTTTVPAFTMLEKNTRSLRVETIAS 180

Query: 181 NKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNIDGG 240
           NK++  E G +L ++++SK+L V VEA T+VGV V G  +G + V + C+    K +DGG
Sbjct: 181 NKLVVDEVGNKLRARYRSKSLPVNVEARTKVGVGVAGLKIGMVGVTVKCDGMSKKQLDGG 240

Query: 241 DMPTCNINLLRWINI 252
           DMP C IN+L+W+NI
Sbjct: 241 DMPKCVINMLKWLNI 247

BLAST of CSPI06G03690 vs. TAIR10
Match: AT2G46300.1 (AT2G46300.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 187.2 bits (474), Expect = 1.3e-47
Identity = 105/260 (40.38%), Postives = 150/260 (57.69%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRP-----PSSYKPKKRK 60
           MAD  + P LQKPPGY+D N ++            PPP++ +P       P+SY+PKK++
Sbjct: 1   MADYQMNPVLQKPPGYRDPNMSSPPPP--------PPPIQQQPMRKAVPMPTSYRPKKKR 60

Query: 61  RNCCRTCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSATPDGSF 120
           R+CCR CCC  C  ++  I +  +  A+FYL +DPKLP F L +FR+  FK++  PDG+ 
Sbjct: 61  RSCCRFCCCCICITLVLFIFLLLVGTAVFYLWFDPKLPTFSLASFRLDGFKLADDPDGAS 120

Query: 121 LDSQVSIRVEFKNPNEKLSIKYGKIEYDVTVGQA---TEFGRRELAGFTQGRRSTTTVKA 180
           L +    RVE KNPN KL   YG    D++VG     T  G   + GF QG +++T+VK 
Sbjct: 121 LSATAVARVEMKNPNSKLVFYYGNTAVDLSVGSGNDETGMGETTMNGFRQGPKNSTSVKV 180

Query: 181 EAAVKNKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKN 240
           E  VKN+++      RL +KFQSK L + V A+T+VG+ V G  +G + V L C     N
Sbjct: 181 ETTVKNQLVERGLAKRLAAKFQSKDLVINVVAKTKVGLGVGGIKIGMLAVNLRCGGVSLN 240

Query: 241 IDGGDMPTCNINLLRWINIR 253
               D P C +N L+W+ I+
Sbjct: 241 KLDTDSPKCILNTLKWVTIQ 252

BLAST of CSPI06G03690 vs. TAIR10
Match: AT4G01110.1 (AT4G01110.1 unknown protein)

HSP 1 Score: 126.3 bits (316), Expect = 2.6e-29
Identity = 90/257 (35.02%), Postives = 140/257 (54.47%), Query Frame = 1

Query: 6   LKPPLQKPPGYKD-HNTTAT---SSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCRT 65
           LKP LQKPPGY++ H+   T   SSSS+S+    PP       P + Y  KKR+ + CR 
Sbjct: 7   LKPVLQKPPGYRELHSQPQTPLGSSSSSSSMLRRPPK---HAIPAAFYPTKKRQWSRCRV 66

Query: 66  CCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSATPDG---SFLDS 125
            CC  C  +  +I++  L +++F+L Y P+LPV  L +FR+S+F  S    G   S L +
Sbjct: 67  FCCCVCITVAIVILLLILTVSVFFLYYSPRLPVVRLSSFRVSNFNFSGGKAGDGLSQLTA 126

Query: 126 QVSIRVEFKNPNEKLSIKYGKIEYDVTVGQ---ATEFGRRELAGFTQGRRSTTTVKAEAA 185
           + + R++F+NPN KL   YG ++  V+VG+    T  G  ++ GF +   + T V     
Sbjct: 127 EATARLDFRNPNGKLRYYYGNVDVAVSVGEDDFETSLGSTKVKGFVEKPGNRTVVIVPIK 186

Query: 186 VKNKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNID 245
           VK + +      RL +  +SK L VKV A+T+VG+ V    +  + V + C   +L+ +D
Sbjct: 187 VKKQQVDDPTVKRLRADMKSKKLVVKVMAKTKVGLGVGRRKIVTVGVTISCGGVRLQTLD 246

Query: 246 GGDMPTCNINLLRWINI 252
              M  C I +L+WI +
Sbjct: 247 -SKMSKCTIKMLKWIKL 259

BLAST of CSPI06G03690 vs. TAIR10
Match: AT1G01453.2 (AT1G01453.2 unknown protein)

HSP 1 Score: 122.1 bits (305), Expect = 5.0e-28
Identity = 83/253 (32.81%), Postives = 132/253 (52.17%), Query Frame = 1

Query: 2   ADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCRT 61
           A+ PL+P LQKPPG++D     ++  S +      P  RP+P  P+    KKR+ + CR 
Sbjct: 16  AEKPLQPALQKPPGFRDQQNQPSAPPSGTATL---PRRRPRPIHPAD---KKRRCSFCRV 75

Query: 62  CCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVS--ATPDG-SFLDS 121
            CC  C L   ++++  +A+A+F+L Y PKLPV  L +F+IS+F  S   + DG SFL +
Sbjct: 76  FCCCVCILFAVILLLILIAVAVFFLWYSPKLPVVRLASFKISNFNFSDGKSDDGWSFLSA 135

Query: 122 QVSIRVEFKNPNEKLSIKYGKIEYDVTVGQ---ATEFGRRELAGFTQGRRSTTTVKAEAA 181
             +  ++F+NPN KL+  YG  +  V +G+    T     ++ GF +   + T V     
Sbjct: 136 DTTSVLDFRNPNGKLTFYYGDTDVAVILGEKDFETNLESTKVKGFIEKPGNRTAVIVPTT 195

Query: 182 VKNKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDG 241
           V+ + +      RL  + +SK L V V A+T+VG+ V    +  + V L C   +     
Sbjct: 196 VRKRQVDDPTAKRLQVELKSKKLLVTVTAKTKVGLAVGSRKIVTVGVSLRCGGVILQTLD 255

Query: 242 GDMPTCNINLLRW 249
             M  C I +L+W
Sbjct: 256 SKMAQCTIKMLKW 262

BLAST of CSPI06G03690 vs. TAIR10
Match: AT1G17620.1 (AT1G17620.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 72.4 bits (176), Expect = 4.5e-13
Identity = 58/227 (25.55%), Postives = 107/227 (47.14%), Query Frame = 1

Query: 9   PLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCRTCCCIFCF 68
           P  KPP         T+ +  +    L    RP  RPP+  +     R CC  CCC   F
Sbjct: 8   PASKPPAIVGGGAPTTNPTFPANKAQLYNANRPAYRPPAGRRRTSHTRGCCCRCCCWTIF 67

Query: 69  LILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSATPDGSFLDSQVSIRVEFKN 128
           +I+ L+++ A A A+ YL+Y P+ P F +   +IS+   ++      L + +S+ V  +N
Sbjct: 68  VIILLLLIVAAASAVVYLIYRPQRPSFTVSELKISTLNFTSAVR---LTTAISLSVIARN 127

Query: 129 PNEKLSIKYGKIEYDVTVGQATE-------FGRRELAGFTQGRRSTTTVKAEAAVKNKML 188
           PN+ +   Y     D+T+ +A+         G+  +A F+ G+++TTT+++        L
Sbjct: 128 PNKNVGFIYDVT--DITLYKASTGGDDDVVIGKGTIAAFSHGKKNTTTLRSTIGSPPDEL 187

Query: 189 AVEDGGRLLSKFQS-KALEVKVEAETEVGVVVQGWGLGPITVKLDCE 228
                G+L    ++ KA+ +K+   ++V V +         +++ CE
Sbjct: 188 DEISAGKLKGDLKAKKAVAIKIVLNSKVKVKMGALKTPKSGIRVTCE 229

BLAST of CSPI06G03690 vs. TAIR10
Match: AT1G65690.1 (AT1G65690.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 60.5 bits (145), Expect = 1.8e-09
Identity = 49/169 (28.99%), Postives = 76/169 (44.97%), Query Frame = 1

Query: 9   PLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSY----KPKKRKRNCCRTCCC 68
           P+Q P       T       +S + H  P   P  + P  +     PKKR+  CCR  C 
Sbjct: 9   PVQDPEAATARPTAPLVPRGSSRSEHGDPSKVPLNQRPQRFVPLAPPKKRRSCCCRCFCY 68

Query: 69  IFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSATPDGSFLDSQVSIRV 128
            FCFL+L ++ V A ++ + YL++ PKLP + +   +++ F   A    S L +  ++ +
Sbjct: 69  TFCFLLLLVVAVGA-SIGILYLVFKPKLPDYSIDRLQLTRF---ALNQDSSLTTAFNVTI 128

Query: 129 EFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAE 174
             KNPNEK+ I Y             +     L  F QG  +TT +  E
Sbjct: 129 TAKNPNEKIGIYYEDGSKITVWYMEHQLSNGSLPKFYQGHENTTVIYVE 173

BLAST of CSPI06G03690 vs. NCBI nr
Match: gi|449446257|ref|XP_004140888.1| (PREDICTED: uncharacterized protein LOC101205096 [Cucumis sativus])

HSP 1 Score: 497.3 bits (1279), Expect = 1.6e-137
Identity = 252/253 (99.60%), Postives = 252/253 (99.60%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60
           MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR
Sbjct: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60

Query: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSATPDGSFLDSQV 120
           TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVS TPDGSFLDSQV
Sbjct: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQV 120

Query: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKM 180
           SIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKM
Sbjct: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKM 180

Query: 181 LAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDGGDMPT 240
           LAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDGGDMPT
Sbjct: 181 LAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDGGDMPT 240

Query: 241 CNINLLRWINIRG 254
           CNINLLRWINIRG
Sbjct: 241 CNINLLRWINIRG 253

BLAST of CSPI06G03690 vs. NCBI nr
Match: gi|659089922|ref|XP_008445748.1| (PREDICTED: uncharacterized protein LOC103488682 [Cucumis melo])

HSP 1 Score: 463.8 bits (1192), Expect = 2.0e-127
Identity = 234/253 (92.49%), Postives = 243/253 (96.05%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60
           MADLP+KPPLQKPPGYKDH+T ATSSSSAST THLPPP R KPR PSSYKPKKRKRNCCR
Sbjct: 1   MADLPMKPPLQKPPGYKDHHTAATSSSSASTVTHLPPPPRSKPRLPSSYKPKKRKRNCCR 60

Query: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSATPDGSFLDSQV 120
           TCCCIFCFLILFLIVVAALALALFYL+YDPKLPVFHLLAFRIS+FKVSATPDGSFLD+QV
Sbjct: 61  TCCCIFCFLILFLIVVAALALALFYLIYDPKLPVFHLLAFRISTFKVSATPDGSFLDAQV 120

Query: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKM 180
           SIRVEFKNPN+KLSIKYGKIEYDV VGQATEFGRRELAGFTQ RRSTTTVKAEAAVKNKM
Sbjct: 121 SIRVEFKNPNDKLSIKYGKIEYDVMVGQATEFGRRELAGFTQDRRSTTTVKAEAAVKNKM 180

Query: 181 LAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCESKLKNIDGGDMPT 240
           LAVEDG RLLSKFQSKALEVKVEAET VGVV+QGWGLGPITVKLDCE+KLKNI+GGDMP 
Sbjct: 181 LAVEDGARLLSKFQSKALEVKVEAETAVGVVIQGWGLGPITVKLDCETKLKNIEGGDMPI 240

Query: 241 CNINLLRWINIRG 254
           CNINLLRWINIRG
Sbjct: 241 CNINLLRWINIRG 253

BLAST of CSPI06G03690 vs. NCBI nr
Match: gi|702333839|ref|XP_010055051.1| (PREDICTED: protein YLS9-like [Eucalyptus grandis])

HSP 1 Score: 216.5 bits (550), Expect = 5.5e-53
Identity = 115/254 (45.28%), Postives = 163/254 (64.17%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPP--PLRPKPRPPSSYKPKKRKRNC 60
           MA+ P KP LQKPPGY+D           S     PP  P R    PPS Y P+K++R+C
Sbjct: 1   MAEPPQKPMLQKPPGYRD----------PSVVVQQPPTQPYRKPVMPPSMY-PRKKRRSC 60

Query: 61  CRTCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSATPDGSFLDS 120
           CR+CCC  C LI  ++ V  LA AL YL + PK+PVFHL +FRI  F V+A PDG++L +
Sbjct: 61  CRSCCCCLCVLIFLILCVLILAGALSYLWFGPKIPVFHLQSFRIPRFNVTAKPDGTYLKA 120

Query: 121 QVSIRVEFKNPNEKLSIKYGKIEYDVTVGQ--ATEFGRRELAGFTQGRRSTTTVKAEAAV 180
           Q  +RVE KNPN+KL + YG  + D+++G+    E G   L GFTQG+++ T++K    V
Sbjct: 121 QTVLRVEVKNPNQKLGLYYGGTDVDISLGRGGGIELGSDSLPGFTQGKKNVTSLKVTTEV 180

Query: 181 KNKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDC-ESKLKNIDG 240
           +++++    G  L S ++SK+L VKV+  T VG ++QGW +G + V ++C E  +K ++G
Sbjct: 181 RDELVEDGAGAELRSGYRSKSLVVKVKVRTSVGAIIQGWKVGRVRVNVECGEVAMKEVEG 240

Query: 241 GDMPTCNINLLRWI 250
           G+MP C INLLRWI
Sbjct: 241 GEMPKCKINLLRWI 243

BLAST of CSPI06G03690 vs. NCBI nr
Match: gi|590721513|ref|XP_007051635.1| (Late embryogenesis abundant hydroxyproline-rich glycoprotein family, putative [Theobroma cacao])

HSP 1 Score: 214.9 bits (546), Expect = 1.6e-52
Identity = 117/252 (46.43%), Postives = 161/252 (63.89%), Query Frame = 1

Query: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60
           M + PLKP LQKPPGYKD +  A            PPP   KP  P S+ PKKR+  CCR
Sbjct: 1   MPEPPLKPVLQKPPGYKDPSAPAVKPGFR------PPPR--KPVLPPSFHPKKRRGGCCR 60

Query: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSATPDGSFLDSQV 120
            CCC FC   L LI++  +  A+FYL +DPKLP FH+ + RIS F V+  PDG++LD+Q 
Sbjct: 61  VCCCCFCIFFLILILLLLICGAVFYLWFDPKLPGFHVQSVRISRFNVTNKPDGTYLDAQT 120

Query: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQA---TEFGRRELAGFTQGRRSTTTVKAEAAVK 180
           + R+E KNPN K++  YG  E DV+VG+    TE G   + GFT G+++TT++K E  V 
Sbjct: 121 TTRLEVKNPNAKMTYYYGNTEVDVSVGEGGDETELGTTTVHGFTMGKQNTTSLKVETKVI 180

Query: 181 NKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNIDGG 240
           NK++    G RL ++++SK+L V VEA T++G+ V G  +G + V + C+   LK +DGG
Sbjct: 181 NKLVDDGVGTRLQARYRSKSLRVSVEARTKIGLGVAGLKIGMVGVTVKCDGIALKRLDGG 240

Query: 241 DMPTCNINLLRW 249
           DMP C IN+L+W
Sbjct: 241 DMPKCVINMLKW 244

BLAST of CSPI06G03690 vs. NCBI nr
Match: gi|703148826|ref|XP_010109444.1| (hypothetical protein L484_003064 [Morus notabilis])

HSP 1 Score: 211.8 bits (538), Expect = 1.4e-51
Identity = 117/253 (46.25%), Postives = 164/253 (64.82%), Query Frame = 1

Query: 1   MADLPLKPP-LQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCC 60
           MA+ PLKPP LQKPPGY+D          A+    +  P + KP  P+S+ P+KR+RN C
Sbjct: 1   MAEQPLKPPPLQKPPGYRD---------PAAPGKPVARPPQRKPVLPASFHPRKRRRNWC 60

Query: 61  RTCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSATPDGSFLDSQ 120
           RTCCC     +L L +  A+A  +FYL ++PKLPVFHL + RI  F V+  PDG++LD+ 
Sbjct: 61  RTCCCFVFVFLLLLTLAVAIAGGIFYLWFEPKLPVFHLQSLRIPQFNVTVKPDGTYLDAG 120

Query: 121 VSIRVEFKNPNEKLSIKYGKIEYDVTVG--QATEFGRRELAGFTQGRRSTTTVKAEAAVK 180
              R+E KNPN KL + YG    +V+VG  +  E GR++L GFTQG+ +TT++K E  VK
Sbjct: 121 TVTRIEVKNPNGKLELYYGGTHVEVSVGEDEDAELGRKDLEGFTQGKENTTSLKVETTVK 180

Query: 181 NKMLAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNIDGG 240
           N+++    G RL S ++SK L VK+EA+T VG +VQG  +G + V + C    LK +D G
Sbjct: 181 NQLVDDGLGKRLKSGYKSKDLVVKIEAKTSVGYIVQGVKIGTVEVGVLCGGVSLKKLDSG 240

Query: 241 DMPTCNINLLRWI 250
           DMP C+I+LL+W+
Sbjct: 241 DMPKCSIDLLKWV 244

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KCD8_CUCSA1.1e-13799.60Uncharacterized protein OS=Cucumis sativus GN=Csa_6G042460 PE=4 SV=1[more]
A0A061DTS6_THECC1.1e-5246.43Late embryogenesis abundant hydroxyproline-rich glycoprotein family, putative OS... [more]
W9SAG5_9ROSA9.4e-5246.25Uncharacterized protein OS=Morus notabilis GN=L484_003064 PE=4 SV=1[more]
A0A0B0NJM7_GOSAR5.7e-4943.53D-alanine--poly (Phosphoribitol) ligase subunit 1 OS=Gossypium arboreum GN=F383_... [more]
A0A0D2QQD3_GOSRA1.7e-4842.75Uncharacterized protein OS=Gossypium raimondii GN=B456_007G105400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G46300.11.3e-4740.38 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT4G01110.12.6e-2935.02 unknown protein[more]
AT1G01453.25.0e-2832.81 unknown protein[more]
AT1G17620.14.5e-1325.55 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G65690.11.8e-0928.99 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|449446257|ref|XP_004140888.1|1.6e-13799.60PREDICTED: uncharacterized protein LOC101205096 [Cucumis sativus][more]
gi|659089922|ref|XP_008445748.1|2.0e-12792.49PREDICTED: uncharacterized protein LOC103488682 [Cucumis melo][more]
gi|702333839|ref|XP_010055051.1|5.5e-5345.28PREDICTED: protein YLS9-like [Eucalyptus grandis][more]
gi|590721513|ref|XP_007051635.1|1.6e-5246.43Late embryogenesis abundant hydroxyproline-rich glycoprotein family, putative [T... [more]
gi|703148826|ref|XP_010109444.1|1.4e-5146.25hypothetical protein L484_003064 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI06G03690.1CSPI06G03690.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 124..227
score: 9.9
NoneNo IPR availablePANTHERPTHR31234FAMILY NOT NAMEDcoord: 5..252
score: 9.0
NoneNo IPR availablePANTHERPTHR31234:SF12LATE EMBRYOGENESIS ABUNDANT HYDROXYPROLINE-RICH GLYCOPROTEINcoord: 5..252
score: 9.0

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CSPI06G03690Watermelon (97103) v2cpiwmbB531