CSPI06G00820 (gene) Wild cucumber (PI 183967)

NameCSPI06G00820
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionLate embryogenesis abundant hydroxyproline-rich glycoprotein family, putative
LocationChr6 : 600439 .. 601428 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCATGTCTTTGCTTGCCTTTAATTCAACATTGTTCAATGGCTTCAAATTTTGGGCACTTCCCTAAACCATCGCCTTATAAACAAAACTCAATAAGCAGTACTTAAAATCTCTGATAACCTTAAAGTTTTTCTCCACTGTTCCATAACCATGGTGGACAAGGACCAAGCTCAGCCTCTCACTCCAGCTACCCTCAATCGTTTGAGTAGCGACAGCGGCGAAACAAGATTACATCTAAAGAGAATCCAACGAAAAAGATTCATAAAATGTTGCAGTTTCATAGTAGCTCTTCTCATGATTCCAACGATAGTAATCATCATCATCTTGATGTTCACTCTATTTCAAATCAAGGATCCCATAATCCAAATGAATAGAGTTTCAATCACAAAGCTGGAGTTGATCAACAATGTCATACCAAAGCCAGGATCCAACGTGTCACTGACTGCCGATGTGTCAGTAAAAAATCCCAACATGGCATCGTTCAAGTACAGTAACACCACTACAACTCTATTCATCAATGAGACAGTGATAGGGGAGGTACGAGGGCCGTCGGGGAAAGCCAAGGCACGACAAACTGTGCGAATGAATGTCACCATTGACATTGTTGCCGATCGAGTTTTGTCGAACCTCAACAACGATGTGAGCTTGGGGAAGGTGAGATTGAGAAGCTTCTCGAGGATCCCAGGAAAAGTGAAGTTGTTGCATTTTATAGGGAGAAACGTTGTTGTCAAGATGAATTGTACATTCGTGATCAATATATTCAGTAAGTCAATTGAGGATCAGAAATGCAAGAGGAAGATGAAGATGTAGACTTTAATACTATATTTGTTCTTCATTCGAAAGCTGTTTTTTTTGTACACGCTGTGAAATTTAGAGTATGTTAATCTTTTTATATCCTCTATTCTTTTGGTTCTATATATACGAATAGCCTTTACAAAAGTTGTTTCACTCATTTGATCTTTTACCAAAATATCAAAATCTAACATACCTTA

mRNA sequence

ATGGTGGACAAGGACCAAGCTCAGCCTCTCACTCCAGCTACCCTCAATCGTTTGAGTAGCGACAGCGGCGAAACAAGATTACATCTAAAGAGAATCCAACGAAAAAGATTCATAAAATGTTGCAGTTTCATAGTAGCTCTTCTCATGATTCCAACGATAGTAATCATCATCATCTTGATGTTCACTCTATTTCAAATCAAGGATCCCATAATCCAAATGAATAGAGTTTCAATCACAAAGCTGGAGTTGATCAACAATGTCATACCAAAGCCAGGATCCAACGTGTCACTGACTGCCGATGTGTCAGTAAAAAATCCCAACATGGCATCGTTCAAGTACAGTAACACCACTACAACTCTATTCATCAATGAGACAGTGATAGGGGAGGTACGAGGGCCGTCGGGGAAAGCCAAGGCACGACAAACTGTGCGAATGAATGTCACCATTGACATTGTTGCCGATCGAGTTTTGTCGAACCTCAACAACGATGTGAGCTTGGGGAAGGTGAGATTGAGAAGCTTCTCGAGGATCCCAGGAAAAGTGAAGTTGTTGCATTTTATAGGGAGAAACGTTGTTGTCAAGATGAATTGTACATTCGTGATCAATATATTCAGTAAGTCAATTGAGGATCAGAAATGCAAGAGGAAGATGAAGATGTAG

Coding sequence (CDS)

ATGGTGGACAAGGACCAAGCTCAGCCTCTCACTCCAGCTACCCTCAATCGTTTGAGTAGCGACAGCGGCGAAACAAGATTACATCTAAAGAGAATCCAACGAAAAAGATTCATAAAATGTTGCAGTTTCATAGTAGCTCTTCTCATGATTCCAACGATAGTAATCATCATCATCTTGATGTTCACTCTATTTCAAATCAAGGATCCCATAATCCAAATGAATAGAGTTTCAATCACAAAGCTGGAGTTGATCAACAATGTCATACCAAAGCCAGGATCCAACGTGTCACTGACTGCCGATGTGTCAGTAAAAAATCCCAACATGGCATCGTTCAAGTACAGTAACACCACTACAACTCTATTCATCAATGAGACAGTGATAGGGGAGGTACGAGGGCCGTCGGGGAAAGCCAAGGCACGACAAACTGTGCGAATGAATGTCACCATTGACATTGTTGCCGATCGAGTTTTGTCGAACCTCAACAACGATGTGAGCTTGGGGAAGGTGAGATTGAGAAGCTTCTCGAGGATCCCAGGAAAAGTGAAGTTGTTGCATTTTATAGGGAGAAACGTTGTTGTCAAGATGAATTGTACATTCGTGATCAATATATTCAGTAAGTCAATTGAGGATCAGAAATGCAAGAGGAAGATGAAGATGTAG
BLAST of CSPI06G00820 vs. Swiss-Prot
Match: Y1465_ARATH (Late embryogenesis abundant protein At1g64065 OS=Arabidopsis thaliana GN=At1g64065 PE=2 SV=1)

HSP 1 Score: 61.6 bits (148), Expect = 1.2e-08
Identity = 56/220 (25.45%), Postives = 112/220 (50.91%), Query Frame = 1

Query: 1   MVDKDQAQPLTPATLNRLSSDSGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60
           MVD+D+          R   +    R+  ++ +     KC  + + +++I    + +IL 
Sbjct: 1   MVDEDRITLAPTEIYGRSDEEQSGPRIWRRKTEEPPG-KCLVYSLTIIVI-IFALCLILS 60

Query: 61  FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTT-TT 120
               +I  P I+   +S   L    N    P  N +L +D+S++N N  +F++ ++T   
Sbjct: 61  SIFLRISKPEIETRSISTRDLRSGGNST-NPYFNATLVSDISIRNSNFGAFEFEDSTLRV 120

Query: 121 LFINETVIGEVRGPSGKAKARQTVRM-NVTIDIVADRVL--SNLNNDVSLGKVRLRSFSR 180
           ++ +  V+GE +    + +A +TVR+  V ++I + R+L   +L+ D+ LG + LRS + 
Sbjct: 121 VYADHGVVGETKIEGRRVEAHKTVRITGVVVEIGSFRLLDTKDLDKDLRLGFLELRSVAE 180

Query: 181 IPGKVKLLHFIGRN--VVVKMNCTFVINIFSKSIEDQKCK 215
           + G++K+L   GR    V  M+CT  +N+  + I++  C+
Sbjct: 181 VRGRIKVL---GRKRWKVSVMSCTMRLNLTGRFIQNLLCE 214

BLAST of CSPI06G00820 vs. TrEMBL
Match: A0A0A0KD33_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G006820 PE=4 SV=1)

HSP 1 Score: 411.0 bits (1055), Expect = 9.2e-112
Identity = 218/219 (99.54%), Postives = 219/219 (100.00%), Query Frame = 1

Query: 1   MVDKDQAQPLTPATLNRLSSDSGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60
           MVDKDQAQPLTPATLNRLSSD+GETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM
Sbjct: 1   MVDKDQAQPLTPATLNRLSSDNGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60

Query: 61  FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL
Sbjct: 61  FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL 120

Query: 121 FINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLSNLNNDVSLGKVRLRSFSRIPGK 180
           FINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLSNLNNDVSLGKVRLRSFSRIPGK
Sbjct: 121 FINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLSNLNNDVSLGKVRLRSFSRIPGK 180

Query: 181 VKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 220
           VKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM
Sbjct: 181 VKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 219

BLAST of CSPI06G00820 vs. TrEMBL
Match: A0A061E0Q4_THECC (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putative isoform 1 OS=Theobroma cacao GN=TCM_005250 PE=4 SV=1)

HSP 1 Score: 243.0 bits (619), Expect = 3.3e-61
Identity = 124/221 (56.11%), Postives = 176/221 (79.64%), Query Frame = 1

Query: 1   MVDKDQAQPLTPATLNRLSSDSGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60
           +VD+DQ +PL PA+ +  SSD GE  L LK++QRK+ +KCC  I AL++I  +VIII L+
Sbjct: 2   VVDRDQVRPLAPAS-DLPSSDDGEAALQLKKVQRKKCVKCCGCIAALMIIQAVVIII-LV 61

Query: 61  FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FT+F++KDP+I+MN V++T LELIN   PKPGSN+SL ADVSVKNPN+ASFKY NTTTTL
Sbjct: 62  FTVFRVKDPVIKMNGVAVTHLELINGTTPKPGSNISLIADVSVKNPNVASFKYKNTTTTL 121

Query: 121 FINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLS--NLNNDVSLGKVRLRSFSRIP 180
           +   T++GE RGP+G+AKAR+T+RMN+++DI+ DR+L+  NL  DV+ G + + S+SRI 
Sbjct: 122 YYYGTIVGEARGPAGRAKARRTMRMNISVDIITDRLLASPNLVADVNSGTLTMSSYSRIG 181

Query: 181 GKVKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 220
           G+V +L+ I ++V VKMNC+  +NI S++I++QKCKRK+ +
Sbjct: 182 GRVNMLNIIKKHVTVKMNCSMTVNISSQAIQEQKCKRKVDL 220

BLAST of CSPI06G00820 vs. TrEMBL
Match: A0A059BZB0_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_E00095 PE=4 SV=1)

HSP 1 Score: 234.2 bits (596), Expect = 1.5e-58
Identity = 121/221 (54.75%), Postives = 167/221 (75.57%), Query Frame = 1

Query: 1   MVDKDQAQPLTPATLNRLSSDSGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60
           MV++DQ  PL P+    L SD  E  +  K  +++RFIKCC  I A ++I  +VIII L 
Sbjct: 1   MVERDQVSPLAPS--GSLRSDQDEASVFAKNFRKRRFIKCCGCIAAFMLIQAVVIII-LA 60

Query: 61  FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FT+F++KDP+I+MN V+ITKLELIN  IPKPGSN+SL AD+SVKNPN+ASFKY NTTTTL
Sbjct: 61  FTVFRVKDPVIKMNGVTITKLELINGTIPKPGSNMSLLADISVKNPNVASFKYKNTTTTL 120

Query: 121 FINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLSNLN--NDVSLGKVRLRSFSRIP 180
           + + TV+GE RGP GK++AR+T+RMN+++DI+ D +LSN N   D+    + + S+SRIP
Sbjct: 121 YYHGTVVGEARGPPGKSRARRTMRMNISVDIITDMLLSNPNLIEDMKQQLLPMSSYSRIP 180

Query: 181 GKVKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 220
           G+V +L+ I ++V VKMNCT  INI S++I++QKCKR + +
Sbjct: 181 GRVNMLNIIKKHVTVKMNCTMTINITSRAIQEQKCKRHVNI 218

BLAST of CSPI06G00820 vs. TrEMBL
Match: M5X4C9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa023497mg PE=4 SV=1)

HSP 1 Score: 231.5 bits (589), Expect = 1.0e-57
Identity = 125/219 (57.08%), Postives = 167/219 (76.26%), Query Frame = 1

Query: 1   MVDKDQAQPLTPATLNRLSSDSGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60
           MV+K+Q +PL PA  N  SSD+ E  LH K+   K+FI CC  I ALL+I   V+IIIL 
Sbjct: 1   MVEKEQVRPLAPAA-NGQSSDADEAALHSKKFGLKKFIYCCGGITALLLI-LAVVIIILA 60

Query: 61  FTLFQIKDPIIQMNRVSITKLELIN-NVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTT 120
           FT+F++K+P I+MN+V++T+LELIN N  PKPGSN+SLTADVSVKNPN ASF+Y+NTTTT
Sbjct: 61  FTVFRLKEPKIKMNKVTVTRLELINDNTTPKPGSNISLTADVSVKNPNAASFRYNNTTTT 120

Query: 121 LFINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLSN--LNNDVSLGKVRLRSFSRI 180
           L+ +  V+GE  G  GKAKAR+T+RMN+T+D++ DR+ SN     DV  G + + S+SRI
Sbjct: 121 LYYHGVVVGEAHGSPGKAKARRTMRMNITVDVITDRLTSNPKWGADVGSGLLTMSSYSRI 180

Query: 181 PGKVKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRK 217
           PG+V + + I R+VVVKMNCT  +NI S++I++QKCKRK
Sbjct: 181 PGRVNMWNIIKRHVVVKMNCTMTVNISSQAIQEQKCKRK 217

BLAST of CSPI06G00820 vs. TrEMBL
Match: B9RDY9_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1616750 PE=4 SV=1)

HSP 1 Score: 226.9 bits (577), Expect = 2.5e-56
Identity = 115/221 (52.04%), Postives = 169/221 (76.47%), Query Frame = 1

Query: 1   MVDKDQAQPLTPATLNRLSSDSGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60
           MV+ +Q +PL P+  +R SSD  E  +HLK+ +R+R IKCC  I A L++P IVI+I L+
Sbjct: 1   MVEHEQVRPLAPSA-DRTSSDDEEATIHLKKTRRRRCIKCCGCITASLLVPAIVIVI-LI 60

Query: 61  FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FT+F++KDP I++N V IT +ELINN IPKPG+N+SL AD+SVKNPN+ SFKY NTT+ L
Sbjct: 61  FTVFRVKDPTIKLNNVIITHMELINNTIPKPGTNISLVADLSVKNPNIVSFKYDNTTSAL 120

Query: 121 FINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLS--NLNNDVSLGKVRLRSFSRIP 180
           + +  ++GE RGP G +KAR+T+R+N TID+VAD+++S  NLN D + G + + S++++P
Sbjct: 121 YYHGVLVGEARGPPGHSKARRTMRLNATIDLVADKLISNPNLNTDAATGLLTVDSYTKLP 180

Query: 181 GKVKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 220
           GKVK+L  I ++V +KMNC+  +NI S++I+ QKCK K+ +
Sbjct: 181 GKVKIL-IIKKHVTIKMNCSLTVNISSQAIQSQKCKNKVDL 218

BLAST of CSPI06G00820 vs. TAIR10
Match: AT2G46150.1 (AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 166.0 bits (419), Expect = 2.6e-41
Identity = 94/224 (41.96%), Postives = 144/224 (64.29%), Query Frame = 1

Query: 1   MVDKDQAQPLTPATLNRLSSDSGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60
           M D +  +PL PAT+  +S +S     ++K   R R    CS  V    +    I++ L+
Sbjct: 1   MADSEHVRPLAPATILPVSDESAS---NIKNTHRSRNRIKCSICVTATSLILTTIVLTLV 60

Query: 61  FTLFQIKDPIIQMNRVSITKLELIN--NVIPKPGSNVSLTADVSVKNPNMASFKYSNTTT 120
           FT+F++KDPII+MN V +  L+ +   N +   G+N+S+  DVSVKNPN ASFKYSNTTT
Sbjct: 61  FTVFRVKDPIIKMNGVMVNGLDSVTGTNQVQLLGTNISMIVDVSVKNPNTASFKYSNTTT 120

Query: 121 TLFINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLSN--LNNDVS-LGKVRLRSFS 180
            ++   T++GE  G  GKA+  +T RMNVT+DI+ DR+LS+  L  ++S  G V + S++
Sbjct: 121 DIYYKGTLVGEAHGLPGKARPHRTSRMNVTVDIMLDRILSDPGLGREISRSGLVNVWSYT 180

Query: 181 RIPGKVKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 220
           R+ GKVK++  + ++V VKMNCT  +NI  ++I+D  CK+K+ +
Sbjct: 181 RVGGKVKIMGIVKKHVTVKMNCTMAVNITGQAIQDVDCKKKIDL 221

BLAST of CSPI06G00820 vs. TAIR10
Match: AT3G54200.1 (AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 105.9 bits (263), Expect = 3.2e-23
Identity = 65/193 (33.68%), Postives = 112/193 (58.03%), Query Frame = 1

Query: 30  KRIQRKRFIKCCSFIVALLMIPTIVIIIILMFTLFQIKDPIIQMNRVSITKLEL-INNVI 89
           K+++RKR  K C     LL++   ++I+IL FTLF+ K P   ++ V++ +L+  +N ++
Sbjct: 43  KKLRRKRNCKICICFTILLILLIAIVIVILAFTLFKPKRPTTTIDSVTVDRLQASVNPLL 102

Query: 90  PKPGSNVSLTADVSVKNPNMASFKYSNTTTTLFINETVIGEVRGPSGKAKARQTVRMNVT 149
            K   N++L  D+S+KNPN   F Y +++  L     VIGE   P+ +  AR+TV +N+T
Sbjct: 103 LKVLLNLTLNVDLSLKNPNRIGFSYDSSSALLNYRGQVIGEAPLPANRIAARKTVPLNIT 162

Query: 150 IDIVADRVLS--NLNNDVSLGKVRLRSFSRIPGKVKLLHFIGRNVVVKMNCTFVINIFSK 209
           + ++ADR+LS   L +DV  G + L +F ++ GKV +L      V    +C   I++  +
Sbjct: 163 LTLMADRLLSETQLLSDVMAGVIPLNTFVKVTGKVTVLKIFKIKVQSSSSCDLSISVSDR 222

Query: 210 SIEDQKCKRKMKM 220
           ++  Q CK   K+
Sbjct: 223 NVTSQHCKYSTKL 235

BLAST of CSPI06G00820 vs. TAIR10
Match: AT4G23610.1 (AT4G23610.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 71.2 bits (173), Expect = 8.7e-13
Identity = 58/206 (28.16%), Postives = 107/206 (51.94%), Query Frame = 1

Query: 2   VDKDQAQPLTPATLN-RLSSDSGETRLHLKRIQ----RKRFIKCCSFIVALLMIPTIVII 61
           +++DQA+PL P  L  R      E + H  R +    + + I CC FI +L M+   V  
Sbjct: 8   INEDQAKPLAPLFLTTRSDQPDEEDQYHHDRTKYVHSQTKLILCCGFIASLTML-IAVTF 67

Query: 62  IILMFTLFQIKDPIIQMNRVSIT-KLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSN 121
           I+L  T+F +  P + ++ +S   + + +N  +     N +++ ++S+ NPN A F   N
Sbjct: 68  IVLSLTVFHLHSPNLTVDSISFNQRFDFVNGKV-NTNQNTTVSVEISLHNPNPALFIVKN 127

Query: 122 TTTTLFINE-TVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLSN---LNNDVSLGKVRL 181
              + +  E  V+GE    S    A++TV+MN+T +IV  ++L++   L  D++   V L
Sbjct: 128 VNVSFYHGELVVVGESIRRSETIPAKRTVKMNLTAEIVKTKLLASLPGLMEDLNGRGVDL 187

Query: 182 RSFSRIPGKVKLLHFIGRNVVVKMNC 198
           +S   + G+VK +    + V ++ +C
Sbjct: 188 KSSVEVRGRVKKMKIFRKTVHLQTDC 211

BLAST of CSPI06G00820 vs. TAIR10
Match: AT3G05975.1 (AT3G05975.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 65.5 bits (158), Expect = 4.8e-11
Identity = 45/183 (24.59%), Postives = 93/183 (50.82%), Query Frame = 1

Query: 40  CCSFIVALLMIPTIVIIIILMFTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTA 99
           CC     + ++  I +  +++  +F+ K PI+Q    ++  +    ++  +   N +LT 
Sbjct: 7   CCIVSGIIFVLFVIFMTALILAQVFKPKHPILQTVSSTVDGISTNISLPYEVQLNFTLTL 66

Query: 100 DVSVKNPNMASFKYSNTTTTLFINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLSN 159
           ++ +KNPN+A F+Y      ++  +T++G +  PS    A+ +V +   + +  D+ ++N
Sbjct: 67  EMLLKNPNVADFEYKTVENLVYYRDTLVGNLTLPSSTLPAKGSVLLPCPLFLQLDKFVAN 126

Query: 160 LN---NDVSLGKVRLRSFSRIPGKVKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRK 219
           L     DV  GK+ + + +++PGK+ LL      +    +C  V+   S  +EDQ C  K
Sbjct: 127 LGDIVQDVLHGKIVMETRAKMPGKITLLGIFKIPLDSISHCNLVLGFPSMVVEDQVCDLK 186

BLAST of CSPI06G00820 vs. TAIR10
Match: AT1G64065.1 (AT1G64065.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 61.6 bits (148), Expect = 6.9e-10
Identity = 56/220 (25.45%), Postives = 112/220 (50.91%), Query Frame = 1

Query: 1   MVDKDQAQPLTPATLNRLSSDSGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60
           MVD+D+          R   +    R+  ++ +     KC  + + +++I    + +IL 
Sbjct: 1   MVDEDRITLAPTEIYGRSDEEQSGPRIWRRKTEEPPG-KCLVYSLTIIVI-IFALCLILS 60

Query: 61  FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTT-TT 120
               +I  P I+   +S   L    N    P  N +L +D+S++N N  +F++ ++T   
Sbjct: 61  SIFLRISKPEIETRSISTRDLRSGGNST-NPYFNATLVSDISIRNSNFGAFEFEDSTLRV 120

Query: 121 LFINETVIGEVRGPSGKAKARQTVRM-NVTIDIVADRVL--SNLNNDVSLGKVRLRSFSR 180
           ++ +  V+GE +    + +A +TVR+  V ++I + R+L   +L+ D+ LG + LRS + 
Sbjct: 121 VYADHGVVGETKIEGRRVEAHKTVRITGVVVEIGSFRLLDTKDLDKDLRLGFLELRSVAE 180

Query: 181 IPGKVKLLHFIGRN--VVVKMNCTFVINIFSKSIEDQKCK 215
           + G++K+L   GR    V  M+CT  +N+  + I++  C+
Sbjct: 181 VRGRIKVL---GRKRWKVSVMSCTMRLNLTGRFIQNLLCE 214

BLAST of CSPI06G00820 vs. NCBI nr
Match: gi|778709203|ref|XP_011656360.1| (PREDICTED: uncharacterized protein LOC105435724 [Cucumis sativus])

HSP 1 Score: 411.0 bits (1055), Expect = 1.3e-111
Identity = 218/219 (99.54%), Postives = 219/219 (100.00%), Query Frame = 1

Query: 1   MVDKDQAQPLTPATLNRLSSDSGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60
           MVDKDQAQPLTPATLNRLSSD+GETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM
Sbjct: 1   MVDKDQAQPLTPATLNRLSSDNGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60

Query: 61  FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL
Sbjct: 61  FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL 120

Query: 121 FINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLSNLNNDVSLGKVRLRSFSRIPGK 180
           FINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLSNLNNDVSLGKVRLRSFSRIPGK
Sbjct: 121 FINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLSNLNNDVSLGKVRLRSFSRIPGK 180

Query: 181 VKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 220
           VKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM
Sbjct: 181 VKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 219

BLAST of CSPI06G00820 vs. NCBI nr
Match: gi|659116614|ref|XP_008458164.1| (PREDICTED: uncharacterized protein LOC103497685 [Cucumis melo])

HSP 1 Score: 395.6 bits (1015), Expect = 5.7e-107
Identity = 210/219 (95.89%), Postives = 214/219 (97.72%), Query Frame = 1

Query: 1   MVDKDQAQPLTPATLNRLSSDSGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60
           MV KDQAQPLTPATL+RLSSD+GET LHLKRIQRKRFIKCCSFI ALL+IPTIVIIIILM
Sbjct: 1   MVGKDQAQPLTPATLDRLSSDNGETELHLKRIQRKRFIKCCSFIAALLIIPTIVIIIILM 60

Query: 61  FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FTLFQIKDPII+MNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL
Sbjct: 61  FTLFQIKDPIIRMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL 120

Query: 121 FINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLSNLNNDVSLGKVRLRSFSRIPGK 180
           FINETVIGEVRGP GKAKARQTVRMNVTIDIVADRVLSNLNNDVSLGKVRLRSFSRIPGK
Sbjct: 121 FINETVIGEVRGPPGKAKARQTVRMNVTIDIVADRVLSNLNNDVSLGKVRLRSFSRIPGK 180

Query: 181 VKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 220
           VKLLH IGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM
Sbjct: 181 VKLLHLIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 219

BLAST of CSPI06G00820 vs. NCBI nr
Match: gi|470142034|ref|XP_004306727.1| (PREDICTED: uncharacterized protein LOC101306460 [Fragaria vesca subsp. vesca])

HSP 1 Score: 247.3 bits (630), Expect = 2.5e-62
Identity = 126/221 (57.01%), Postives = 174/221 (78.73%), Query Frame = 1

Query: 1   MVDKDQAQPLTPATLNRLSSDSGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60
           MV+K+QA+PL PA   R SSD  E  LH+K  +RK+FI CC  I A+++I  +VIII L 
Sbjct: 1   MVEKEQARPLAPAGY-RPSSDDNEAALHMKIARRKKFINCCGCITAIVLIQAVVIII-LA 60

Query: 61  FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FT+F++K+P I MN+V++TKLEL+N   PKPG+N+SLTADVSVKNPN+ASFKYSNTTTTL
Sbjct: 61  FTVFRVKEPKIMMNKVTVTKLELVNGTTPKPGTNISLTADVSVKNPNVASFKYSNTTTTL 120

Query: 121 FINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLS--NLNNDVSLGKVRLRSFSRIP 180
           + + TV+GE RGP G+AKAR+T+RMN+T+DI+ D + +  NL  DV  G + + S+SRIP
Sbjct: 121 YYHGTVVGEARGPPGRAKARRTMRMNITVDIITDILTTNPNLKTDVGSGLLTMSSYSRIP 180

Query: 181 GKVKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 220
           G+V +L+ + ++VVVKMNCT  +NI S++I++QKCKRK+ +
Sbjct: 181 GRVNMLNIVKKHVVVKMNCTMTVNISSQAIQEQKCKRKVSL 219

BLAST of CSPI06G00820 vs. NCBI nr
Match: gi|590721704|ref|XP_007051691.1| (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 243.0 bits (619), Expect = 4.7e-61
Identity = 124/221 (56.11%), Postives = 176/221 (79.64%), Query Frame = 1

Query: 1   MVDKDQAQPLTPATLNRLSSDSGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60
           +VD+DQ +PL PA+ +  SSD GE  L LK++QRK+ +KCC  I AL++I  +VIII L+
Sbjct: 2   VVDRDQVRPLAPAS-DLPSSDDGEAALQLKKVQRKKCVKCCGCIAALMIIQAVVIII-LV 61

Query: 61  FTLFQIKDPIIQMNRVSITKLELINNVIPKPGSNVSLTADVSVKNPNMASFKYSNTTTTL 120
           FT+F++KDP+I+MN V++T LELIN   PKPGSN+SL ADVSVKNPN+ASFKY NTTTTL
Sbjct: 62  FTVFRVKDPVIKMNGVAVTHLELINGTTPKPGSNISLIADVSVKNPNVASFKYKNTTTTL 121

Query: 121 FINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLS--NLNNDVSLGKVRLRSFSRIP 180
           +   T++GE RGP+G+AKAR+T+RMN+++DI+ DR+L+  NL  DV+ G + + S+SRI 
Sbjct: 122 YYYGTIVGEARGPAGRAKARRTMRMNISVDIITDRLLASPNLVADVNSGTLTMSSYSRIG 181

Query: 181 GKVKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 220
           G+V +L+ I ++V VKMNC+  +NI S++I++QKCKRK+ +
Sbjct: 182 GRVNMLNIIKKHVTVKMNCSMTVNISSQAIQEQKCKRKVDL 220

BLAST of CSPI06G00820 vs. NCBI nr
Match: gi|694426930|ref|XP_009341124.1| (PREDICTED: uncharacterized protein LOC103933187 [Pyrus x bretschneideri])

HSP 1 Score: 242.7 bits (618), Expect = 6.2e-61
Identity = 125/222 (56.31%), Postives = 176/222 (79.28%), Query Frame = 1

Query: 1   MVDKDQAQPLTPATLNRLSSDSGETRLHLKRIQRKRFIKCCSFIVALLMIPTIVIIIILM 60
           MVD++Q +PL PA ++  SSD+ E   HLK+++R++FIKCC  I A+++I  +VIII L 
Sbjct: 1   MVDREQVRPLAPAAIHP-SSDADEAAFHLKKVRRRKFIKCCGCITAVILIQAVVIII-LA 60

Query: 61  FTLFQIKDPIIQMNRVSITKLELINN-VIPKPGSNVSLTADVSVKNPNMASFKYSNTTTT 120
           FT+F++K+P I+MN+V+IT+LELINN   PKPGSN+SL ADVSVKNPN ASFKY+NTTTT
Sbjct: 61  FTVFRVKEPKIKMNKVTITRLELINNNTAPKPGSNISLIADVSVKNPNFASFKYTNTTTT 120

Query: 121 LFINETVIGEVRGPSGKAKARQTVRMNVTIDIVADRVLS--NLNNDVSLGKVRLRSFSRI 180
           L+ +  V+GE  G  G AKAR+T+RMN+T+D++ DR+ S  NL  D + G + + S+SRI
Sbjct: 121 LYYHGVVVGEAHGAPGHAKARRTMRMNITVDMITDRLTSDPNLRADFNSGLMTMSSYSRI 180

Query: 181 PGKVKLLHFIGRNVVVKMNCTFVINIFSKSIEDQKCKRKMKM 220
           PG+VKLL+ I ++V+VKMNCT  +NI S++I++QKCKRK+K+
Sbjct: 181 PGRVKLLNIIKKHVIVKMNCTMTVNISSQAIQEQKCKRKVKL 220

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1465_ARATH1.2e-0825.45Late embryogenesis abundant protein At1g64065 OS=Arabidopsis thaliana GN=At1g640... [more]
Match NameE-valueIdentityDescription
A0A0A0KD33_CUCSA9.2e-11299.54Uncharacterized protein OS=Cucumis sativus GN=Csa_6G006820 PE=4 SV=1[more]
A0A061E0Q4_THECC3.3e-6156.11Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putat... [more]
A0A059BZB0_EUCGR1.5e-5854.75Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_E00095 PE=4 SV=1[more]
M5X4C9_PRUPE1.0e-5757.08Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa023497mg PE=4 SV=1[more]
B9RDY9_RICCO2.5e-5652.04Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1616750 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G46150.12.6e-4141.96 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT3G54200.13.2e-2333.68 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT4G23610.18.7e-1328.16 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT3G05975.14.8e-1124.59 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G64065.16.9e-1025.45 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|778709203|ref|XP_011656360.1|1.3e-11199.54PREDICTED: uncharacterized protein LOC105435724 [Cucumis sativus][more]
gi|659116614|ref|XP_008458164.1|5.7e-10795.89PREDICTED: uncharacterized protein LOC103497685 [Cucumis melo][more]
gi|470142034|ref|XP_004306727.1|2.5e-6257.01PREDICTED: uncharacterized protein LOC101306460 [Fragaria vesca subsp. vesca][more]
gi|590721704|ref|XP_007051691.1|4.7e-6156.11Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putat... [more]
gi|694426930|ref|XP_009341124.1|6.2e-6156.31PREDICTED: uncharacterized protein LOC103933187 [Pyrus x bretschneideri][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
IPR013783Ig-like_fold
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005886 plasma membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI06G00820.1CSPI06G00820.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 101..198
score: 2.1
IPR013783Immunoglobulin-like foldGENE3DG3DSA:2.60.40.10coord: 93..150
score: 1.
NoneNo IPR availablePANTHERPTHR31852FAMILY NOT NAMEDcoord: 3..219
score: 6.5
NoneNo IPR availablePANTHERPTHR31852:SF6LATE EMBRYOGENESIS ABUNDANT HYDROXYPROLINE-RICH GLYCOPROTEINcoord: 3..219
score: 6.5
NoneNo IPR availableunknownSSF117070LEA14-likecoord: 63..160
score: 4.5

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI06G00820Cla011406Watermelon (97103) v1cpiwmB523
CSPI06G00820Cla006636Watermelon (97103) v1cpiwmB559
CSPI06G00820Csa5G152140Cucumber (Chinese Long) v2cpicuB309
CSPI06G00820Csa6G006820Cucumber (Chinese Long) v2cpicuB314
CSPI06G00820MELO3C021179Melon (DHL92) v3.5.1cpimeB418
CSPI06G00820MELO3C005720Melon (DHL92) v3.5.1cpimeB412
CSPI06G00820ClCG06G003400Watermelon (Charleston Gray)cpiwcgB507
CSPI06G00820ClCG01G002260Watermelon (Charleston Gray)cpiwcgB455
CSPI06G00820Lsi09G002080Bottle gourd (USVL1VR-Ls)cpilsiB408
CSPI06G00820Lsi09G015980Bottle gourd (USVL1VR-Ls)cpilsiB397
CSPI06G00820MELO3C021179.2Melon (DHL92) v3.6.1cpimedB413
CSPI06G00820MELO3C005720.2Melon (DHL92) v3.6.1cpimedB408
CSPI06G00820CsaV3_5G002330Cucumber (Chinese Long) v3cpicucB359
CSPI06G00820Cla97C06G112770Watermelon (97103) v2cpiwmbB499
CSPI06G00820Cla97C01G002300Watermelon (97103) v2cpiwmbB435
CSPI06G00820Cucsa.303600Cucumber (Gy14) v1cgycpiB458
CSPI06G00820Cucsa.135790Cucumber (Gy14) v1cgycpiB198
CSPI06G00820CmaCh15G012560Cucurbita maxima (Rimu)cmacpiB319
CSPI06G00820CmaCh17G004580Cucurbita maxima (Rimu)cmacpiB372
CSPI06G00820CmaCh02G014190Cucurbita maxima (Rimu)cmacpiB647
CSPI06G00820CmoCh02G014510Cucurbita moschata (Rifu)cmocpiB641
CSPI06G00820CmoCh17G004350Cucurbita moschata (Rifu)cmocpiB360
CSPI06G00820Cp4.1LG12g03950Cucurbita pepo (Zucchini)cpecpiB143
CSPI06G00820CsGy6G000890Cucumber (Gy14) v2cgybcpiB288
CSPI06G00820CsGy5G002240Cucumber (Gy14) v2cgybcpiB243
CSPI06G00820Carg02561Silver-seed gourdcarcpiB0966
CSPI06G00820Carg26025Silver-seed gourdcarcpiB0649
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CSPI06G00820CSPI05G04930Wild cucumber (PI 183967)cpicpiB182