Csa5G152140 (gene) Cucumber (Chinese Long) v2

NameCsa5G152140
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionLate embryogenesis abundant hydroxyproline-rich glycoprotein; contains IPR004864 (Late embryogenesis abundant protein, LEA-14), IPR013783 (Immunoglobulin-like fold)
LocationChr5 : 4722482 .. 4723262 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGATAAAGAGCAGGTCAAACCCTTGGCTTCCGCCTCCGCCGAGCTCCGGAGTGACGACCATATCTTTCTTCCTCCTCCTGATCCCAAGCTCCACCTCCATAGAAACAAATATATCATGTGCTGTGGCTGCTTCGCCGCTCTCCTCTTAATCCTTGCCGTCATCGGTATCGTCCTCGGCTTCACCGTCTTCCATATCAAAACCCCAGATATTAAAATTGATTCCCTCTCCTTCCCAAATGACACTTTGAGTTCAAGTAAGTTTGCAATGACTCTTTTGAAATAACGAAAAGTACTCAAAATTGACCAGTACGGTTATACATGAACGATTATTTCTCTTCAAAGTCATTCTAAACTCGTATTTTCTTTCATATTTCTCAGATAGTGGGATAATTGTTGTGGCTAGCGTCTCGGTGAGGAACCCTAATGTTGCGTCGTTCAAATACTCGAAAGCTTCGATCGAGATTTACTACCACGACAAGGTCATCGGCGAGGGCGAGACACCACCAGGAGAGGTTAAGGCGAAAGACACGCTAAGGATGAATGTGACGGTAGAGATTGAACCTTGGAAGATGGATGATGCTTCGAGTTTGATAAAGGATTGGAATTCAGGATCTTTGAGTATAAGTAGCTACACGGAAATTCCTGGAAGAGTGAAAATACTTGGCTCCATCAAGAAAAACTATTTGGTGAAAATAAGCTGTTCATTGACTTACAATTCAAAAAGCAAGACGATTCAAGGACAAGATTGTGATCAACGTGTAAGAATCTCTGTTTAA

mRNA sequence

ATGGCTGATAAAGAGCAGGTCAAACCCTTGGCTTCCGCCTCCGCCGAGCTCCGGAGTGACGACCATATCTTTCTTCCTCCTCCTGATCCCAAGCTCCACCTCCATAGAAACAAATATATCATGTGCTGTGGCTGCTTCGCCGCTCTCCTCTTAATCCTTGCCGTCATCGGTATCGTCCTCGGCTTCACCGTCTTCCATATCAAAACCCCAGATATTAAAATTGATTCCCTCTCCTTCCCAAATGACACTTTGAGTTCAAATAGTGGGATAATTGTTGTGGCTAGCGTCTCGGTGAGGAACCCTAATGTTGCGTCGTTCAAATACTCGAAAGCTTCGATCGAGATTTACTACCACGACAAGGTCATCGGCGAGGGCGAGACACCACCAGGAGAGGTTAAGGCGAAAGACACGCTAAGGATGAATGTGACGGTAGAGATTGAACCTTGGAAGATGGATGATGCTTCGAGTTTGATAAAGGATTGGAATTCAGGATCTTTGAGTATAAGTAGCTACACGGAAATTCCTGGAAGAGTGAAAATACTTGGCTCCATCAAGAAAAACTATTTGGTGAAAATAAGCTGTTCATTGACTTACAATTCAAAAAGCAAGACGATTCAAGGACAAGATTGTGATCAACGTGTAAGAATCTCTGTTTAA

Coding sequence (CDS)

ATGGCTGATAAAGAGCAGGTCAAACCCTTGGCTTCCGCCTCCGCCGAGCTCCGGAGTGACGACCATATCTTTCTTCCTCCTCCTGATCCCAAGCTCCACCTCCATAGAAACAAATATATCATGTGCTGTGGCTGCTTCGCCGCTCTCCTCTTAATCCTTGCCGTCATCGGTATCGTCCTCGGCTTCACCGTCTTCCATATCAAAACCCCAGATATTAAAATTGATTCCCTCTCCTTCCCAAATGACACTTTGAGTTCAAATAGTGGGATAATTGTTGTGGCTAGCGTCTCGGTGAGGAACCCTAATGTTGCGTCGTTCAAATACTCGAAAGCTTCGATCGAGATTTACTACCACGACAAGGTCATCGGCGAGGGCGAGACACCACCAGGAGAGGTTAAGGCGAAAGACACGCTAAGGATGAATGTGACGGTAGAGATTGAACCTTGGAAGATGGATGATGCTTCGAGTTTGATAAAGGATTGGAATTCAGGATCTTTGAGTATAAGTAGCTACACGGAAATTCCTGGAAGAGTGAAAATACTTGGCTCCATCAAGAAAAACTATTTGGTGAAAATAAGCTGTTCATTGACTTACAATTCAAAAAGCAAGACGATTCAAGGACAAGATTGTGATCAACGTGTAAGAATCTCTGTTTAA

Protein sequence

MADKEQVKPLASASAELRSDDHIFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGIVLGFTVFHIKTPDIKIDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASIEIYYHDKVIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTEIPGRVKILGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRISV*
BLAST of Csa5G152140 vs. Swiss-Prot
Match: Y1465_ARATH (Late embryogenesis abundant protein At1g64065 OS=Arabidopsis thaliana GN=At1g64065 PE=2 SV=1)

HSP 1 Score: 74.3 bits (181), Expect = 1.8e-12
Identity = 49/171 (28.65%), Postives = 88/171 (51.46%), Query Frame = 1

Query: 49  LLLILAVIGIVLGFTVFHIKTPDIKIDSLSFPNDTLSSNS-----GIIVVASVSVRNPNV 108
           +++I+  + ++L      I  P+I+  S+S  +     NS        +V+ +S+RN N 
Sbjct: 46  IIVIIFALCLILSSIFLRISKPEIETRSISTRDLRSGGNSTNPYFNATLVSDISIRNSNF 105

Query: 109 ASFKYSKASIEIYYHDK-VIGEGETPPGEVKAKDTLRMN-VTVEIEPWKMDDASSLIKDW 168
            +F++  +++ + Y D  V+GE +     V+A  T+R+  V VEI  +++ D   L KD 
Sbjct: 106 GAFEFEDSTLRVVYADHGVVGETKIEGRRVEAHKTVRITGVVVEIGSFRLLDTKDLDKDL 165

Query: 169 NSGSLSISSYTEIPGRVKILGSIKKNYLVKI-SCSLTYNSKSKTIQGQDCD 212
             G L + S  E+ GR+K+LG  +K + V + SC++  N   + IQ   C+
Sbjct: 166 RLGFLELRSVAEVRGRIKVLG--RKRWKVSVMSCTMRLNLTGRFIQNLLCE 214

BLAST of Csa5G152140 vs. TrEMBL
Match: A0A0A0KMH1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G152140 PE=4 SV=1)

HSP 1 Score: 436.0 bits (1120), Expect = 2.6e-119
Identity = 218/218 (100.00%), Postives = 218/218 (100.00%), Query Frame = 1

Query: 1   MADKEQVKPLASASAELRSDDHIFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGIVL 60
           MADKEQVKPLASASAELRSDDHIFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGIVL
Sbjct: 1   MADKEQVKPLASASAELRSDDHIFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGIVL 60

Query: 61  GFTVFHIKTPDIKIDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASIEIYYHDK 120
           GFTVFHIKTPDIKIDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASIEIYYHDK
Sbjct: 61  GFTVFHIKTPDIKIDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASIEIYYHDK 120

Query: 121 VIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTEIPGRVKI 180
           VIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTEIPGRVKI
Sbjct: 121 VIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTEIPGRVKI 180

Query: 181 LGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRISV 219
           LGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRISV
Sbjct: 181 LGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRISV 218

BLAST of Csa5G152140 vs. TrEMBL
Match: M5WS18_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021244mg PE=4 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 1.9e-48
Identity = 106/223 (47.53%), Postives = 149/223 (66.82%), Query Frame = 1

Query: 1   MADKEQVKPLASASA-ELRSDDH-IFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGI 60
           MA KEQ KPLA A++  LRSD+  +F+      + L + KY+MCCGC +AL LI+AV  I
Sbjct: 1   MAAKEQGKPLAPANSYHLRSDEEEVFV---SSHIKLCQRKYVMCCGCVSALFLIIAVTAI 60

Query: 61  VLGFTVFHIKTP-----DIKIDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASI 120
           VLGFTVFH+K P     D+ I  L   N  L S++ + ++A VS++NPNVASFKY   + 
Sbjct: 61  VLGFTVFHVKGPRIKMNDVTIQQLEVANGALRSDTNVTLLADVSIKNPNVASFKYGNTTT 120

Query: 121 EIYYHDKVIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTE 180
            +YY    +G+G TP G  KA+ T+RMNVTV+I P ++      IK+  SG L++S+YT 
Sbjct: 121 RVYYSGTEVGQGRTPAGVAKARRTMRMNVTVDIVPGEISAVPGFIKEVASGKLTVSTYTR 180

Query: 181 IPGRVKILGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRI 217
           I G+VKIL  + KN +V+++CS+TYN  SK I+G+DC +RV +
Sbjct: 181 IEGKVKIL-MVNKNVVVELNCSMTYNFASKGIEGEDCKRRVSL 219

BLAST of Csa5G152140 vs. TrEMBL
Match: A0A061G639_THECC (Late embryogenesis abundant hydroxyproline-rich glycoprotein family, putative OS=Theobroma cacao GN=TCM_016356 PE=4 SV=1)

HSP 1 Score: 189.1 bits (479), Expect = 5.6e-45
Identity = 103/216 (47.69%), Postives = 145/216 (67.13%), Query Frame = 1

Query: 1   MADKEQVKPLASASAELRSDDHIFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGIVL 60
           MAD+EQVKPLA A+ + RSDD   L     +L L R +YI CCGC AALLLI AV+ +VL
Sbjct: 1   MADREQVKPLAPAAFQTRSDDEEAL---SKQLKLKRRRYIQCCGCVAALLLIQAVVILVL 60

Query: 61  GFTVFHIKTPDIKIDSLS------FPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASIE 120
            FTVF I+ P I+++S++      F N +L ++  + ++A VSV+NPNVA+FK++ ++  
Sbjct: 61  FFTVFRIQDPMIRMNSVTIQRLEFFQNGSLRTDVNVTLLADVSVKNPNVAAFKFNNSTTL 120

Query: 121 IYYHDKVIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTEI 180
           IYY  +V+GEG    G+ KA+ TLR NVTV+I P K+    SL+ D+ S +L+ISSYT I
Sbjct: 121 IYYGGRVVGEGHHLQGKAKARRTLRRNVTVDIIPEKILAVPSLMSDFASQALNISSYTRI 180

Query: 181 PGRVKILGSIKKNYLVKISCSLTYNSKSKTIQGQDC 211
            GRV+IL  IKK  +VK +C++TY    +   G+ C
Sbjct: 181 SGRVRILNFIKKKVVVKFNCTMTYRLSGQEFHGESC 213

BLAST of Csa5G152140 vs. TrEMBL
Match: W9SLI7_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_006687 PE=4 SV=1)

HSP 1 Score: 175.3 bits (443), Expect = 8.4e-41
Identity = 95/228 (41.67%), Postives = 139/228 (60.96%), Query Frame = 1

Query: 1   MAD-KEQVKPLASASAELRSDDHIFLPPPDPK-----LHLHRNKYIMCCGCFAALLLILA 60
           MAD KEQVKPLA A    RSD+       D K         RN  +  CGC +A+L+I A
Sbjct: 1   MADRKEQVKPLAPAFYLFRSDEEDNTNNDDNKNKSFFADRRRNSCVKRCGCASAILVIAA 60

Query: 61  VIGIVLGFTVFHIKTPDIKIDSLS------FPNDTLSSNSGIIVVASVSVRNPNVASFKY 120
           V  ++L  TVFH+K P +K+ S++      + N T+ ++  + +VA VSV+NPN ASF+Y
Sbjct: 61  VTMMILAITVFHVKGPIVKMTSVTVDPLQTYANGTIDTDKNVTLVAGVSVKNPNAASFRY 120

Query: 121 SKASIEIYYHDKVIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSI 180
           +  +  ++Y    +GEG    G+ KA+ T++MN+TVEI   KM ++  L+KDW SG L+ 
Sbjct: 121 ANTTTTVFYGGAAVGEGWNAAGKAKARRTVKMNLTVEISTAKMLESPGLLKDWGSGELTF 180

Query: 181 SSYTEIPGRVKILGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRI 217
            SYT I GRVKI   +KK  +VK++C+++YN  SK I+ Q C + V +
Sbjct: 181 DSYTRIEGRVKITDVVKKKVVVKLNCTVSYNVSSKGIERQHCKRHVSL 228

BLAST of Csa5G152140 vs. TrEMBL
Match: A0A067K8E9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14326 PE=4 SV=1)

HSP 1 Score: 173.7 bits (439), Expect = 2.5e-40
Identity = 93/221 (42.08%), Postives = 139/221 (62.90%), Query Frame = 1

Query: 1   MADKEQVKPLASASAELRSDDHIFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGIVL 60
           M + EQV+PLA A     SDD            + R + I CCGC AA+ LILA++ ++L
Sbjct: 1   MVEGEQVRPLAPARDRTSSDDE---EAAHQLKKIRRRRCIKCCGCIAAVSLILAIVIVIL 60

Query: 61  GFTVFHIKTPDIKIDSLSFPNDTLSSNSGI-------IVVASVSVRNPNVASFKYSKASI 120
            FTVF IK P+I+++ ++     L +N+ I        ++A VSV+NPN+ASFKY+  S 
Sbjct: 61  IFTVFRIKNPNIRLNGITITQLELINNTNIPKPGVNISLIADVSVKNPNIASFKYNNTST 120

Query: 121 EIYYHDKVIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTE 180
            ++Y+ +++GE   PPG  KA+ T+RMNVTVEI   K+    +L  +  SG L++SSY++
Sbjct: 121 ALFYYGELVGEARGPPGRAKARRTMRMNVTVEIITDKLISNPNLNTEAGSGLLTMSSYSK 180

Query: 181 IPGRVKILGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRV 215
           IPGRVK+   IKK+  VK++C++T N  S+ IQ Q C ++V
Sbjct: 181 IPGRVKLFHIIKKHATVKMNCTITVNISSQAIQTQKCKRKV 218

BLAST of Csa5G152140 vs. TAIR10
Match: AT2G46150.1 (AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 133.7 bits (335), Expect = 1.4e-31
Identity = 83/223 (37.22%), Postives = 122/223 (54.71%), Query Frame = 1

Query: 1   MADKEQVKPLASASAELRSDDHIFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGIVL 60
           MAD E V+PLA A+    SD+           H  RN+ I C  C  A  LIL  I + L
Sbjct: 1   MADSEHVRPLAPATILPVSDESA---SNIKNTHRSRNR-IKCSICVTATSLILTTIVLTL 60

Query: 61  GFTVFHIKTPDIK--------IDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKAS 120
            FTVF +K P IK        +DS++  N      + I ++  VSV+NPN ASFKYS  +
Sbjct: 61  VFTVFRVKDPIIKMNGVMVNGLDSVTGTNQVQLLGTNISMIVDVSVKNPNTASFKYSNTT 120

Query: 121 IEIYYHDKVIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWN-SGSLSISSY 180
            +IYY   ++GE    PG+ +   T RMNVTV+I   ++     L ++ + SG +++ SY
Sbjct: 121 TDIYYKGTLVGEAHGLPGKARPHRTSRMNVTVDIMLDRILSDPGLGREISRSGLVNVWSY 180

Query: 181 TEIPGRVKILGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRV 215
           T + G+VKI+G +KK+  VK++C++  N   + IQ  DC +++
Sbjct: 181 TRVGGKVKIMGIVKKHVTVKMNCTMAVNITGQAIQDVDCKKKI 219

BLAST of Csa5G152140 vs. TAIR10
Match: AT4G23610.1 (AT4G23610.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 105.1 bits (261), Expect = 5.4e-23
Identity = 70/222 (31.53%), Postives = 110/222 (49.55%), Query Frame = 1

Query: 3   DKEQVKPLASASAELRSDDHIFLPPPDPKLHLHRNKY-------IMCCGCFAALLLILAV 62
           +++Q KPLA      RSD     P  + + H  R KY       I+CCG  A+L +++AV
Sbjct: 9   NEDQAKPLAPLFLTTRSDQ----PDEEDQYHHDRTKYVHSQTKLILCCGFIASLTMLIAV 68

Query: 63  IGIVLGFTVFHIKTPDIKIDSLS------FPNDTLSSNSGIIVVASVSVRNPNVASFKYS 122
             IVL  TVFH+ +P++ +DS+S      F N  +++N    V   +S+ NPN A F   
Sbjct: 69  TFIVLSLTVFHLHSPNLTVDSISFNQRFDFVNGKVNTNQNTTVSVEISLHNPNPALFIVK 128

Query: 123 KASIEIYYHD-KVIGEGETPPGEVKAKDTLRMNVTVEIEPWK-MDDASSLIKDWNSGSLS 182
             ++  Y+ +  V+GE       + AK T++MN+T EI   K +     L++D N   + 
Sbjct: 129 NVNVSFYHGELVVVGESIRRSETIPAKRTVKMNLTAEIVKTKLLASLPGLMEDLNGRGVD 188

Query: 183 ISSYTEIPGRVKILGSIKKNYLVKISCSL---TYNSKSKTIQ 207
           + S  E+ GRVK +   +K   ++  C +   T N  + T Q
Sbjct: 189 LKSSVEVRGRVKKMKIFRKTVHLQTDCFMKMTTNNFLTPTFQ 226

BLAST of Csa5G152140 vs. TAIR10
Match: AT3G54200.1 (AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 91.3 bits (225), Expect = 8.1e-19
Identity = 55/214 (25.70%), Postives = 98/214 (45.79%), Query Frame = 1

Query: 25  LPPPDPKLH--------------LHRNKYIMCCGCFAALLLIL-AVIGIVLGFTVFHIKT 84
           LPPP P                 L R +    C CF  LL++L A++ ++L FT+F  K 
Sbjct: 22  LPPPKPNASSMETQSANTGTAKKLRRKRNCKICICFTILLILLIAIVIVILAFTLFKPKR 81

Query: 85  PDIKIDSLSFPNDTLSSNSGIIVV-------ASVSVRNPNVASFKYSKASIEIYYHDKVI 144
           P   IDS++      S N  ++ V         +S++NPN   F Y  +S  + Y  +VI
Sbjct: 82  PTTTIDSVTVDRLQASVNPLLLKVLLNLTLNVDLSLKNPNRIGFSYDSSSALLNYRGQVI 141

Query: 145 GEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTEIPGRVKILG 204
           GE   P   + A+ T+ +N+T+ +   ++   + L+ D  +G + ++++ ++ G+V +L 
Sbjct: 142 GEAPLPANRIAARKTVPLNITLTLMADRLLSETQLLSDVMAGVIPLNTFVKVTGKVTVLK 201

Query: 205 SIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRI 217
             K       SC L+ +   + +  Q C    ++
Sbjct: 202 IFKIKVQSSSSCDLSISVSDRNVTSQHCKYSTKL 235

BLAST of Csa5G152140 vs. TAIR10
Match: AT1G64065.1 (AT1G64065.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 74.3 bits (181), Expect = 1.0e-13
Identity = 49/171 (28.65%), Postives = 88/171 (51.46%), Query Frame = 1

Query: 49  LLLILAVIGIVLGFTVFHIKTPDIKIDSLSFPNDTLSSNS-----GIIVVASVSVRNPNV 108
           +++I+  + ++L      I  P+I+  S+S  +     NS        +V+ +S+RN N 
Sbjct: 46  IIVIIFALCLILSSIFLRISKPEIETRSISTRDLRSGGNSTNPYFNATLVSDISIRNSNF 105

Query: 109 ASFKYSKASIEIYYHDK-VIGEGETPPGEVKAKDTLRMN-VTVEIEPWKMDDASSLIKDW 168
            +F++  +++ + Y D  V+GE +     V+A  T+R+  V VEI  +++ D   L KD 
Sbjct: 106 GAFEFEDSTLRVVYADHGVVGETKIEGRRVEAHKTVRITGVVVEIGSFRLLDTKDLDKDL 165

Query: 169 NSGSLSISSYTEIPGRVKILGSIKKNYLVKI-SCSLTYNSKSKTIQGQDCD 212
             G L + S  E+ GR+K+LG  +K + V + SC++  N   + IQ   C+
Sbjct: 166 RLGFLELRSVAEVRGRIKVLG--RKRWKVSVMSCTMRLNLTGRFIQNLLCE 214

BLAST of Csa5G152140 vs. TAIR10
Match: AT4G23930.1 (AT4G23930.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 73.9 bits (180), Expect = 1.3e-13
Identity = 45/177 (25.42%), Postives = 83/177 (46.89%), Query Frame = 1

Query: 43  CGCFAALLLILAVIGIVLGFTVFHIKTPDIKIDSLSFPNDTLSSNSGIIVVASV-SVRNP 102
           C      ++ L +  + +  TVF  + P+I + S+  P+ +++++S     +   +VRNP
Sbjct: 11  CAVATLFIVFLIIAALTVYLTVFRPRDPEISVTSVKVPSFSVANSSVSFTFSQFSAVRNP 70

Query: 103 NVASFKYSKASIEIYYHDKVIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASS----- 162
           N A+F +    I+++Y+   IG    P GE+++  T RM  T  ++ + +  ASS     
Sbjct: 71  NRAAFSHYNNVIQLFYYGNRIGYTFVPAGEIESGRTKRMLATFSVQSFPLAAASSSQISA 130

Query: 163 ---LIKDWNSGSLSISSYTEIPGRVKILGSIKKNYLVKISCSLTYNSKSKTIQGQDC 211
                 D +  ++ I S  E+ GRV++LG        K +C +  +S   +I    C
Sbjct: 131 AQFQNSDRSGSTVEIESKLEMAGRVRVLGLFTHRIAAKCNCRIAISSSDGSIVAVRC 187

BLAST of Csa5G152140 vs. NCBI nr
Match: gi|700194872|gb|KGN50049.1| (hypothetical protein Csa_5G152140 [Cucumis sativus])

HSP 1 Score: 436.0 bits (1120), Expect = 3.8e-119
Identity = 218/218 (100.00%), Postives = 218/218 (100.00%), Query Frame = 1

Query: 1   MADKEQVKPLASASAELRSDDHIFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGIVL 60
           MADKEQVKPLASASAELRSDDHIFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGIVL
Sbjct: 1   MADKEQVKPLASASAELRSDDHIFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGIVL 60

Query: 61  GFTVFHIKTPDIKIDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASIEIYYHDK 120
           GFTVFHIKTPDIKIDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASIEIYYHDK
Sbjct: 61  GFTVFHIKTPDIKIDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASIEIYYHDK 120

Query: 121 VIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTEIPGRVKI 180
           VIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTEIPGRVKI
Sbjct: 121 VIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTEIPGRVKI 180

Query: 181 LGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRISV 219
           LGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRISV
Sbjct: 181 LGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRISV 218

BLAST of Csa5G152140 vs. NCBI nr
Match: gi|778708243|ref|XP_004143964.2| (PREDICTED: uncharacterized protein LOC101212153 [Cucumis sativus])

HSP 1 Score: 436.0 bits (1120), Expect = 3.8e-119
Identity = 218/218 (100.00%), Postives = 218/218 (100.00%), Query Frame = 1

Query: 1   MADKEQVKPLASASAELRSDDHIFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGIVL 60
           MADKEQVKPLASASAELRSDDHIFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGIVL
Sbjct: 233 MADKEQVKPLASASAELRSDDHIFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGIVL 292

Query: 61  GFTVFHIKTPDIKIDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASIEIYYHDK 120
           GFTVFHIKTPDIKIDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASIEIYYHDK
Sbjct: 293 GFTVFHIKTPDIKIDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASIEIYYHDK 352

Query: 121 VIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTEIPGRVKI 180
           VIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTEIPGRVKI
Sbjct: 353 VIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTEIPGRVKI 412

Query: 181 LGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRISV 219
           LGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRISV
Sbjct: 413 LGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRISV 450

BLAST of Csa5G152140 vs. NCBI nr
Match: gi|659073969|ref|XP_008437350.1| (PREDICTED: uncharacterized protein LOC103482794 [Cucumis melo])

HSP 1 Score: 403.7 bits (1036), Expect = 2.1e-109
Identity = 202/218 (92.66%), Postives = 214/218 (98.17%), Query Frame = 1

Query: 1   MADKEQVKPLASASAELRSDDHIFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGIVL 60
           MADKEQVKPLASA+AELRSDDHIFLPPP PKL+L+RNKYI CCGCF+ALLLILAVIGIVL
Sbjct: 1   MADKEQVKPLASATAELRSDDHIFLPPP-PKLNLYRNKYIKCCGCFSALLLILAVIGIVL 60

Query: 61  GFTVFHIKTPDIKIDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASIEIYYHDK 120
           GFTVFHIKTPDIKIDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKAS +IYYH+K
Sbjct: 61  GFTVFHIKTPDIKIDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASTKIYYHNK 120

Query: 121 VIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTEIPGRVKI 180
           VIGEGETPPGEVKAKDTL+MNVTV+IEPWK+DDASSLIKDWNSG+LSISSYTEIPGRVK+
Sbjct: 121 VIGEGETPPGEVKAKDTLKMNVTVKIEPWKIDDASSLIKDWNSGALSISSYTEIPGRVKL 180

Query: 181 LGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRISV 219
           LG+IKKNYLVKISCSLTYNSKSKTIQ QDCDQRVRISV
Sbjct: 181 LGAIKKNYLVKISCSLTYNSKSKTIQRQDCDQRVRISV 217

BLAST of Csa5G152140 vs. NCBI nr
Match: gi|595859288|ref|XP_007210919.1| (hypothetical protein PRUPE_ppa021244mg [Prunus persica])

HSP 1 Score: 200.7 bits (509), Expect = 2.7e-48
Identity = 106/223 (47.53%), Postives = 149/223 (66.82%), Query Frame = 1

Query: 1   MADKEQVKPLASASA-ELRSDDH-IFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGI 60
           MA KEQ KPLA A++  LRSD+  +F+      + L + KY+MCCGC +AL LI+AV  I
Sbjct: 1   MAAKEQGKPLAPANSYHLRSDEEEVFV---SSHIKLCQRKYVMCCGCVSALFLIIAVTAI 60

Query: 61  VLGFTVFHIKTP-----DIKIDSLSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASI 120
           VLGFTVFH+K P     D+ I  L   N  L S++ + ++A VS++NPNVASFKY   + 
Sbjct: 61  VLGFTVFHVKGPRIKMNDVTIQQLEVANGALRSDTNVTLLADVSIKNPNVASFKYGNTTT 120

Query: 121 EIYYHDKVIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTE 180
            +YY    +G+G TP G  KA+ T+RMNVTV+I P ++      IK+  SG L++S+YT 
Sbjct: 121 RVYYSGTEVGQGRTPAGVAKARRTMRMNVTVDIVPGEISAVPGFIKEVASGKLTVSTYTR 180

Query: 181 IPGRVKILGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRI 217
           I G+VKIL  + KN +V+++CS+TYN  SK I+G+DC +RV +
Sbjct: 181 IEGKVKIL-MVNKNVVVELNCSMTYNFASKGIEGEDCKRRVSL 219

BLAST of Csa5G152140 vs. NCBI nr
Match: gi|658036772|ref|XP_008353940.1| (PREDICTED: uncharacterized protein LOC103417542 [Malus domestica])

HSP 1 Score: 194.5 bits (493), Expect = 1.9e-46
Identity = 104/219 (47.49%), Postives = 148/219 (67.58%), Query Frame = 1

Query: 4   KEQVKPLASASAE-LRSDDHIFLPPPDPKLHLHRNKYIMCCGCFAALLLILAVIGIVLGF 63
           KEQVKPLA A++  LRSD+          + L + KY+ CCGC +A+ LI+AV  IVLGF
Sbjct: 5   KEQVKPLALANSHYLRSDEE---EVASLHIKLKQRKYVHCCGCVSAVFLIIAVTAIVLGF 64

Query: 64  TVFHIKTPDIKIDS-----LSFPNDTLSSNSGIIVVASVSVRNPNVASFKYSKASIEIYY 123
           TVFH+K P IK++      L F N  L +++ I ++A VSV+NPNVASFKYS A+  +YY
Sbjct: 65  TVFHVKDPKIKMNKVTVQRLEFANGNLRTDTNITLLADVSVKNPNVASFKYSNATTLVYY 124

Query: 124 HDKVIGEGETPPGEVKAKDTLRMNVTVEIEPWKMDDASSLIKDWNSGSLSISSYTEIPGR 183
               +GEG TP G  KA+ T RMNVTV+I P K+     L+++  SG L++++YT I G+
Sbjct: 125 SGTEVGEGRTPAGVAKARRTSRMNVTVDIVPGKILGVPGLMREVASGELTMTTYTRIQGK 184

Query: 184 VKILGSIKKNYLVKISCSLTYNSKSKTIQGQDCDQRVRI 217
           VK++  +KKN +V+++C++ YN  S  IQG+DC +RVR+
Sbjct: 185 VKVV-MVKKNVVVELNCTVRYNFSSGEIQGKDCKRRVRL 219

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1465_ARATH1.8e-1228.65Late embryogenesis abundant protein At1g64065 OS=Arabidopsis thaliana GN=At1g640... [more]
Match NameE-valueIdentityDescription
A0A0A0KMH1_CUCSA2.6e-119100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_5G152140 PE=4 SV=1[more]
M5WS18_PRUPE1.9e-4847.53Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021244mg PE=4 SV=1[more]
A0A061G639_THECC5.6e-4547.69Late embryogenesis abundant hydroxyproline-rich glycoprotein family, putative OS... [more]
W9SLI7_9ROSA8.4e-4141.67Uncharacterized protein OS=Morus notabilis GN=L484_006687 PE=4 SV=1[more]
A0A067K8E9_JATCU2.5e-4042.08Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14326 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G46150.11.4e-3137.22 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT4G23610.15.4e-2331.53 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT3G54200.18.1e-1925.70 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G64065.11.0e-1328.65 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT4G23930.11.3e-1325.42 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|700194872|gb|KGN50049.1|3.8e-119100.00hypothetical protein Csa_5G152140 [Cucumis sativus][more]
gi|778708243|ref|XP_004143964.2|3.8e-119100.00PREDICTED: uncharacterized protein LOC101212153 [Cucumis sativus][more]
gi|659073969|ref|XP_008437350.1|2.1e-10992.66PREDICTED: uncharacterized protein LOC103482794 [Cucumis melo][more]
gi|595859288|ref|XP_007210919.1|2.7e-4847.53hypothetical protein PRUPE_ppa021244mg [Prunus persica][more]
gi|658036772|ref|XP_008353940.1|1.9e-4647.49PREDICTED: uncharacterized protein LOC103417542 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
IPR013783Ig-like_fold
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005886 plasma membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0046872 metal ion binding
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa5G152140.1Csa5G152140.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 96..195
score: 2.0
IPR013783Immunoglobulin-like foldGENE3DG3DSA:2.60.40.10coord: 87..174
score: 3.
NoneNo IPR availablePANTHERPTHR31852FAMILY NOT NAMEDcoord: 1..218
score: 2.0
NoneNo IPR availablePANTHERPTHR31852:SF23SUBFAMILY NOT NAMEDcoord: 1..218
score: 2.0
NoneNo IPR availableunknownSSF117070LEA14-likecoord: 51..153
score: 6.02

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Csa5G152140Csa6G006820Cucumber (Chinese Long) v2cucuB164