Csa1G132720 (gene) Cucumber (Chinese Long) v2

NameCsa1G132720
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPutative harpin-induced protein 1; contains IPR004864 (Late embryogenesis abundant protein, LEA-14)
LocationChr1 : 9464033 .. 9465096 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCAAGTTTTATTAGTTAAATCATTCCACATCCAAACTTCCAAAACCTCTCTTATATATTAATCCATTCTCAGTTCCTTCTTTTTGTAGTACTCTTCTTAAAGCAAAGCCTCTTCAATCAAAGTTGGCTTCTTTTCCTTAGCAGAGAACTATCCATCCTTGAGCATGGCAGGTCCTCCGCAGCCACTGTCACGATCTGGCCCCTCAAGGATATTGCGTTTTGTCATAATTTTCTTAGTGGCATTGATCATACTCGTTGGCCTTGCCGTGCTCATTATCTGGCTAACTGTTAGGCCGAAACGACTGAGCTACACGGTGGAAAGCGCTGAGGTCCATAACTTCGACATGACCGACACCCAACTCAATGCATCCTTTAGTTTTGGGGTAAGAGCATATAATCCCAATAAACGAGTCTCGGTTTACTATGATTCCATCACTGCCACAGTTGGGTTCGGTGATCAAGACTTGTCTTTCGGCGTGCTCAGTCCTTTCTACCAACCTCACAAAAACGAGCAATGGTTGAACATCCACCTCAACGCTCAAAACTTTCTATTGCACGACTCTGTGTCGAAGGAATTGGCACTTGAAAGGTCAGCTGGAGAGATGGATTTGGATCTTTGGATCAAGGCAAGAATTAGGTTTAAGGTTGGGGTATGGAAGTCCGCGCATAGGACGCTTCGAATCCGGTGTTCGCCGGTGATTGTCTACTTGTCTAAATCCAAGACTTTCAAGAAGACTACTTGCTTTACAGAAGTCTAAGGTTATGGAATTTGGTTTTTTGCTTTTATCAAAAGTGTTTTTCTTTTTTCCTTCTTACTTTCTGAAAATTGTATTGTTGGTGTGGTTGTTGGGTTAATCTTTCAAGTGTTTAATTTGTGAGCTGTGATTGTTGCTTGGGTTTTTAATATGTATATGCTCTTTTTGATCACTTTTTTTTGTTCAGTACAAATGTCAGTCCATTCTTTAACAAATAAGTGCAACCATTTGTAATTTGATTTGATTTAAATAATTGAAATTTTTTATGTATACTTGTATTGTTTTCTATATTGCATAAACGAAGTTGGGA

mRNA sequence

ATGGCAGGTCCTCCGCAGCCACTGTCACGATCTGGCCCCTCAAGGATATTGCGTTTTGTCATAATTTTCTTAGTGGCATTGATCATACTCGTTGGCCTTGCCGTGCTCATTATCTGGCTAACTGTTAGGCCGAAACGACTGAGCTACACGGTGGAAAGCGCTGAGGTCCATAACTTCGACATGACCGACACCCAACTCAATGCATCCTTTAGTTTTGGGGTAAGAGCATATAATCCCAATAAACGAGTCTCGGTTTACTATGATTCCATCACTGCCACAGTTGGGTTCGGTGATCAAGACTTGTCTTTCGGCGTGCTCAGTCCTTTCTACCAACCTCACAAAAACGAGCAATGGTTGAACATCCACCTCAACGCTCAAAACTTTCTATTGCACGACTCTGTGTCGAAGGAATTGGCACTTGAAAGGTCAGCTGGAGAGATGGATTTGGATCTTTGGATCAAGGCAAGAATTAGGTTTAAGGTTGGGGTATGGAAGTCCGCGCATAGGACGCTTCGAATCCGGTGTTCGCCGGTGATTGTCTACTTGTCTAAATCCAAGACTTTCAAGAAGACTACTTGCTTTACAGAAGTCTAA

Coding sequence (CDS)

ATGGCAGGTCCTCCGCAGCCACTGTCACGATCTGGCCCCTCAAGGATATTGCGTTTTGTCATAATTTTCTTAGTGGCATTGATCATACTCGTTGGCCTTGCCGTGCTCATTATCTGGCTAACTGTTAGGCCGAAACGACTGAGCTACACGGTGGAAAGCGCTGAGGTCCATAACTTCGACATGACCGACACCCAACTCAATGCATCCTTTAGTTTTGGGGTAAGAGCATATAATCCCAATAAACGAGTCTCGGTTTACTATGATTCCATCACTGCCACAGTTGGGTTCGGTGATCAAGACTTGTCTTTCGGCGTGCTCAGTCCTTTCTACCAACCTCACAAAAACGAGCAATGGTTGAACATCCACCTCAACGCTCAAAACTTTCTATTGCACGACTCTGTGTCGAAGGAATTGGCACTTGAAAGGTCAGCTGGAGAGATGGATTTGGATCTTTGGATCAAGGCAAGAATTAGGTTTAAGGTTGGGGTATGGAAGTCCGCGCATAGGACGCTTCGAATCCGGTGTTCGCCGGTGATTGTCTACTTGTCTAAATCCAAGACTTTCAAGAAGACTACTTGCTTTACAGAAGTCTAA

Protein sequence

MAGPPQPLSRSGPSRILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFDMTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLNIHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIVYLSKSKTFKKTTCFTEV*
BLAST of Csa1G132720 vs. Swiss-Prot
Match: Y1816_ARATH (Uncharacterized protein At1g08160 OS=Arabidopsis thaliana GN=At1g08160 PE=2 SV=1)

HSP 1 Score: 132.9 bits (333), Expect = 3.9e-30
Identity = 73/199 (36.68%), Postives = 120/199 (60.30%), Query Frame = 1

Query: 6   QPLSRSGPSRILRFVIIFLVALIIL---VGLAVLIIWLTVRPKRLSYTVESAEVHNFDM- 65
           QP ++  P R +  V+  +VAL++L   VGLA+LI +LT+RPKRL YTVE+A V  F + 
Sbjct: 23  QPRAQPLPGRRMNPVLCIIVALVLLGLLVGLAILITYLTLRPKRLIYTVEAASVQEFAIG 82

Query: 66  -TDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLN 125
             D  +NA FS+ +++YNP K VSV Y S+  +    +Q ++   +SPF Q  KNE  + 
Sbjct: 83  NNDDHINAKFSYVIKSYNPEKHVSVRYHSMRISTAHHNQSVAHKNISPFKQRPKNETRIE 142

Query: 126 IHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 185
             L + N  L    +++L  E+S G ++++++I AR+ +K  +++S  RTL+  C+PV++
Sbjct: 143 TQLVSHNVALSKFNARDLRAEKSKGTIEMEVYITARVSYKTWIFRSRRRTLKAVCTPVMI 202

Query: 186 YLSKSKT--FKKTTCFTEV 198
            ++ S    F++  C T +
Sbjct: 203 NVTSSSLDGFQRVLCKTRL 221

BLAST of Csa1G132720 vs. Swiss-Prot
Match: YLS9_ARATH (Protein YLS9 OS=Arabidopsis thaliana GN=YLS9 PE=2 SV=1)

HSP 1 Score: 99.4 bits (246), Expect = 4.8e-20
Identity = 63/192 (32.81%), Postives = 98/192 (51.04%), Query Frame = 1

Query: 4   PPQPLS--RSGPSR-----ILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEV 63
           PP P    R G  R     +L   +  +++LI+++G+A LI WL VRP+ + + V  A +
Sbjct: 18  PPAPKGYYRRGHGRGCGCCLLSLFVKVIISLIVILGVAALIFWLIVRPRAIKFHVTDASL 77

Query: 64  HNFDMT--DTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHK 123
             FD T  D  L  + +  V   NPNKR+ +YYD I A   +  +  S   L+PFYQ HK
Sbjct: 78  TRFDHTSPDNILRYNLALTVPVRNPNKRIGLYYDRIEAHAYYEGKRFSTITLTPFYQGHK 137

Query: 124 NEQWLNIHLNAQNFLLHDS-VSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRI 183
           N   L      QN ++ ++  S+ L  ER +G  ++++  + R+RFK+G  K      ++
Sbjct: 138 NTTVLTPTFQGQNLVIFNAGQSRTLNAERISGVYNIEIKFRLRVRFKLGDLKFRRIKPKV 197

Query: 184 RCSPVIVYLSKS 186
            C  + + LS S
Sbjct: 198 DCDDLRLPLSTS 209

BLAST of Csa1G132720 vs. Swiss-Prot
Match: NHL3_ARATH (NDR1/HIN1-Like protein 3 OS=Arabidopsis thaliana GN=NHL3 PE=1 SV=1)

HSP 1 Score: 90.1 bits (222), Expect = 2.9e-17
Identity = 62/188 (32.98%), Postives = 95/188 (50.53%), Query Frame = 1

Query: 16  ILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFDMTDT---QLNASFSF 75
           IL  +   L+ + +L+G+A LIIWL  RP  + + V  A++  F +  T   + N   +F
Sbjct: 44  ILSVIFNILITIAVLLGIAALIIWLIFRPNAIKFHVTDAKLTEFTLDPTNNLRYNLDLNF 103

Query: 76  GVRAYNPNKRVSVYYDSITATVGFGDQDLSFGV---LSPFYQPHKNEQWLNIHLNAQNFL 135
            +R  NPN+R+ VYYD I     +GDQ   FG+   +S FYQ HKN   +   L  Q  +
Sbjct: 104 TIR--NPNRRIGVYYDEIEVRGYYGDQ--RFGMSNNISKFYQGHKNTTVVGTKLVGQQLV 163

Query: 136 LHD-SVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIVYLSKSKT- 194
           L D    K+L  + ++    +D  ++ +IRFK G+ KS     +I+C   +   S S + 
Sbjct: 164 LLDGGERKDLNEDVNSQIYRIDAKLRLKIRFKFGLIKSWRFKPKIKCDLKVPLTSNSTSG 223

BLAST of Csa1G132720 vs. TrEMBL
Match: A0A0A0LVK3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G132720 PE=4 SV=1)

HSP 1 Score: 396.0 bits (1016), Expect = 2.7e-107
Identity = 197/197 (100.00%), Postives = 197/197 (100.00%), Query Frame = 1

Query: 1   MAGPPQPLSRSGPSRILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFD 60
           MAGPPQPLSRSGPSRILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFD
Sbjct: 1   MAGPPQPLSRSGPSRILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFD 60

Query: 61  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLN 120
           MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLN
Sbjct: 61  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLN 120

Query: 121 IHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 180
           IHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV
Sbjct: 121 IHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 180

Query: 181 YLSKSKTFKKTTCFTEV 198
           YLSKSKTFKKTTCFTEV
Sbjct: 181 YLSKSKTFKKTTCFTEV 197

BLAST of Csa1G132720 vs. TrEMBL
Match: W9S3V4_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_011767 PE=4 SV=1)

HSP 1 Score: 223.8 bits (569), Expect = 1.9e-55
Identity = 104/195 (53.33%), Postives = 147/195 (75.38%), Query Frame = 1

Query: 6   QPLSRSGPSR---ILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFDMT 65
           +P   + P R   ILR++ +F +ALI+LVG+AVL+IWL VRPKRL Y+VE A +HNF++ 
Sbjct: 17  EPTREANPQRKPHILRWIAMFFLALIVLVGIAVLVIWLVVRPKRLVYSVEDASIHNFNIN 76

Query: 66  DTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLNIH 125
           +  LNASF F VR+YNPN +VS+YYD I + V + DQ L++ ++ PF+QPHKN   L + 
Sbjct: 77  NNHLNASFDFVVRSYNPNSKVSIYYDKIESRVEYDDQTLAYNMVEPFFQPHKNVTRLELK 136

Query: 126 LNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIVYL 185
           L AQ+  L  S+  +L LE+S+GE++L++W+KARIRFKVG WKS+HRTL+I CSPV+V+ 
Sbjct: 137 LAAQSVPLVGSIPADLRLEKSSGEIELNVWLKARIRFKVGAWKSSHRTLKIFCSPVLVHF 196

Query: 186 SKSKTFKKTTCFTEV 198
           S+SK F++T C  E+
Sbjct: 197 SRSKNFERTVCDVEL 211

BLAST of Csa1G132720 vs. TrEMBL
Match: K7M1R4_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_13G248600 PE=4 SV=1)

HSP 1 Score: 221.5 bits (563), Expect = 9.3e-55
Identity = 109/203 (53.69%), Postives = 151/203 (74.38%), Query Frame = 1

Query: 1   MAGPP-QPLSRSGP----SRILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAE 60
           MA PP Q  SR+      S +LR + IF++ALIILVG+AV+IIWL ++PKRL YTVE+A 
Sbjct: 1   MAHPPTQSQSRAANKPKRSNLLRCIAIFILALIILVGIAVIIIWLVLKPKRLEYTVENAA 60

Query: 61  VHNFDMTDTQ-LNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHK 120
           +HNF++TD   L A+F F +R+YNPN RVS+YYD++  +V + DQ L+   + PF+Q HK
Sbjct: 61  IHNFNLTDANHLYANFDFTIRSYNPNSRVSIYYDTVEVSVRYEDQTLATNAVQPFFQSHK 120

Query: 121 NEQWLNIHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIR 180
           N   L++ L AQ   L+DSV K+L LERS+G+++LD+W++ARIRFKVGVWKS HR L+I 
Sbjct: 121 NVTRLHVGLTAQTVALYDSVPKDLRLERSSGDIELDVWMRARIRFKVGVWKSKHRVLKIF 180

Query: 181 CSPVIVYLSKSKTFKKTTCFTEV 198
           CSPV+V+ SK K+F++  C  E+
Sbjct: 181 CSPVLVHFSKGKSFERAPCDVEL 203

BLAST of Csa1G132720 vs. TrEMBL
Match: A0A0R4J5D0_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_15G065400 PE=4 SV=1)

HSP 1 Score: 218.0 bits (554), Expect = 1.0e-53
Identity = 102/203 (50.25%), Postives = 149/203 (73.40%), Query Frame = 1

Query: 1   MAGPPQPLSRSGP-----SRILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAE 60
           MA PP   + +       S +L ++ +F+VALIILVG+AV+IIWL ++PKRL Y+VE+A 
Sbjct: 1   MAHPPSQSNSTAANKPKRSNLLHYIAMFIVALIILVGIAVIIIWLVLKPKRLEYSVENAA 60

Query: 61  VHNFDMTDTQ-LNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHK 120
           +HNF++TD   L A+F F +R+YNPN R+S+YYD++  +V + DQ L+   + PF+Q HK
Sbjct: 61  IHNFNLTDANHLYANFDFTIRSYNPNSRISIYYDTVEVSVRYEDQTLATNAVQPFFQSHK 120

Query: 121 NEQWLNIHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIR 180
           N   L++ L AQ+  L++SV K+L LERS+G+++LD+W++ARIRFKVG WKS HR LRI 
Sbjct: 121 NVTRLHVALTAQSVALYESVPKDLRLERSSGDIELDVWVRARIRFKVGAWKSRHRVLRIF 180

Query: 181 CSPVIVYLSKSKTFKKTTCFTEV 198
           CSPV+V+ SK K+F++  C  E+
Sbjct: 181 CSPVLVHFSKGKSFERAPCEVEL 203

BLAST of Csa1G132720 vs. TrEMBL
Match: A0A0B2RMU0_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_005671 PE=4 SV=1)

HSP 1 Score: 211.5 bits (537), Expect = 9.6e-52
Identity = 96/177 (54.24%), Postives = 138/177 (77.97%), Query Frame = 1

Query: 22  IFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFDMTDTQ-LNASFSFGVRAYNPN 81
           +F+VALIILVG+AV+IIWL ++PKRL Y+VE+A +HNF++TD   L A+F F +R+YNPN
Sbjct: 1   MFIVALIILVGIAVIIIWLVLKPKRLEYSVENAAIHNFNLTDANHLYANFDFTIRSYNPN 60

Query: 82  KRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLNIHLNAQNFLLHDSVSKELAL 141
            R+S+YYD++  +V + DQ L+   + PF+Q HKN   L++ L AQ+  L++SV K+L L
Sbjct: 61  SRISIYYDTVEVSVRYEDQTLATNAVQPFFQSHKNVTRLHVALTAQSVALYESVPKDLRL 120

Query: 142 ERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIVYLSKSKTFKKTTCFTEV 198
           ERS+G+++LD+W++ARIRFKVG WKS HR LRI CSPV+V+ SK K+F++  C  E+
Sbjct: 121 ERSSGDIELDVWVRARIRFKVGAWKSRHRVLRIFCSPVLVHFSKGKSFERAPCEVEL 177

BLAST of Csa1G132720 vs. TAIR10
Match: AT5G22870.1 (AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 163.3 bits (412), Expect = 1.5e-40
Identity = 80/195 (41.03%), Postives = 126/195 (64.62%), Query Frame = 1

Query: 4   PPQPLSRSGPSRILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFDMT- 63
           P QPL R  PS I  ++ + ++ LI +  +  LI WL  +PK+L YTVE+A V NF++T 
Sbjct: 16  PAQPLRR--PSLIC-YIFLVILTLIFMAAVGFLITWLETKPKKLRYTVENASVQNFNLTN 75

Query: 64  DTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLNIH 123
           D  ++A+F F ++++NPN R+SVYY S+   V F DQ L+F  + PF+QP  N + ++  
Sbjct: 76  DNHMSATFQFTIQSHNPNHRISVYYSSVEIFVKFKDQTLAFDTVEPFHQPRMNVKQIDET 135

Query: 124 LNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIVYL 183
           L A+N  +  S  K+L  + S G++  ++++KAR+RFKVG+WKS+HRT +I+CS V V L
Sbjct: 136 LIAENVAVSKSNGKDLRSQNSLGKIGFEVFVKARVRFKVGIWKSSHRTAKIKCSHVTVSL 195

Query: 184 SKSKTFKKTTCFTEV 198
           S+    + ++C  ++
Sbjct: 196 SQPNKSQNSSCDADI 207

BLAST of Csa1G132720 vs. TAIR10
Match: AT1G08160.1 (AT1G08160.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 132.9 bits (333), Expect = 2.2e-31
Identity = 73/199 (36.68%), Postives = 120/199 (60.30%), Query Frame = 1

Query: 6   QPLSRSGPSRILRFVIIFLVALIIL---VGLAVLIIWLTVRPKRLSYTVESAEVHNFDM- 65
           QP ++  P R +  V+  +VAL++L   VGLA+LI +LT+RPKRL YTVE+A V  F + 
Sbjct: 23  QPRAQPLPGRRMNPVLCIIVALVLLGLLVGLAILITYLTLRPKRLIYTVEAASVQEFAIG 82

Query: 66  -TDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLN 125
             D  +NA FS+ +++YNP K VSV Y S+  +    +Q ++   +SPF Q  KNE  + 
Sbjct: 83  NNDDHINAKFSYVIKSYNPEKHVSVRYHSMRISTAHHNQSVAHKNISPFKQRPKNETRIE 142

Query: 126 IHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 185
             L + N  L    +++L  E+S G ++++++I AR+ +K  +++S  RTL+  C+PV++
Sbjct: 143 TQLVSHNVALSKFNARDLRAEKSKGTIEMEVYITARVSYKTWIFRSRRRTLKAVCTPVMI 202

Query: 186 YLSKSKT--FKKTTCFTEV 198
            ++ S    F++  C T +
Sbjct: 203 NVTSSSLDGFQRVLCKTRL 221

BLAST of Csa1G132720 vs. TAIR10
Match: AT2G35980.1 (AT2G35980.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 99.4 bits (246), Expect = 2.7e-21
Identity = 63/192 (32.81%), Postives = 98/192 (51.04%), Query Frame = 1

Query: 4   PPQPLS--RSGPSR-----ILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEV 63
           PP P    R G  R     +L   +  +++LI+++G+A LI WL VRP+ + + V  A +
Sbjct: 18  PPAPKGYYRRGHGRGCGCCLLSLFVKVIISLIVILGVAALIFWLIVRPRAIKFHVTDASL 77

Query: 64  HNFDMT--DTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHK 123
             FD T  D  L  + +  V   NPNKR+ +YYD I A   +  +  S   L+PFYQ HK
Sbjct: 78  TRFDHTSPDNILRYNLALTVPVRNPNKRIGLYYDRIEAHAYYEGKRFSTITLTPFYQGHK 137

Query: 124 NEQWLNIHLNAQNFLLHDS-VSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRI 183
           N   L      QN ++ ++  S+ L  ER +G  ++++  + R+RFK+G  K      ++
Sbjct: 138 NTTVLTPTFQGQNLVIFNAGQSRTLNAERISGVYNIEIKFRLRVRFKLGDLKFRRIKPKV 197

Query: 184 RCSPVIVYLSKS 186
            C  + + LS S
Sbjct: 198 DCDDLRLPLSTS 209

BLAST of Csa1G132720 vs. TAIR10
Match: AT5G06320.1 (AT5G06320.1 NDR1/HIN1-like 3)

HSP 1 Score: 90.1 bits (222), Expect = 1.6e-18
Identity = 62/188 (32.98%), Postives = 95/188 (50.53%), Query Frame = 1

Query: 16  ILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFDMTDT---QLNASFSF 75
           IL  +   L+ + +L+G+A LIIWL  RP  + + V  A++  F +  T   + N   +F
Sbjct: 44  ILSVIFNILITIAVLLGIAALIIWLIFRPNAIKFHVTDAKLTEFTLDPTNNLRYNLDLNF 103

Query: 76  GVRAYNPNKRVSVYYDSITATVGFGDQDLSFGV---LSPFYQPHKNEQWLNIHLNAQNFL 135
            +R  NPN+R+ VYYD I     +GDQ   FG+   +S FYQ HKN   +   L  Q  +
Sbjct: 104 TIR--NPNRRIGVYYDEIEVRGYYGDQ--RFGMSNNISKFYQGHKNTTVVGTKLVGQQLV 163

Query: 136 LHD-SVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIVYLSKSKT- 194
           L D    K+L  + ++    +D  ++ +IRFK G+ KS     +I+C   +   S S + 
Sbjct: 164 LLDGGERKDLNEDVNSQIYRIDAKLRLKIRFKFGLIKSWRFKPKIKCDLKVPLTSNSTSG 223

BLAST of Csa1G132720 vs. TAIR10
Match: AT3G11650.1 (AT3G11650.1 NDR1/HIN1-like 2)

HSP 1 Score: 87.4 bits (215), Expect = 1.1e-17
Identity = 51/174 (29.31%), Postives = 84/174 (48.28%), Query Frame = 1

Query: 16  ILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFDMT-DTQLNASFSFGV 75
           IL  +   L+A+ +++G+A LI+WL  RP  + + V  A ++ F    +  L+ S     
Sbjct: 51  ILSLICNILIAVAVILGVAALILWLIFRPNAVKFYVADANLNRFSFDPNNNLHYSLDLNF 110

Query: 76  RAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLNIHLNAQNF-LLHDS 135
              NPN+RV VYYD  + +  +GDQ      +S FYQ HKN   +   +  QN  +L D 
Sbjct: 111 TIRNPNQRVGVYYDEFSVSGYYGDQRFGSANVSSFYQGHKNTTVILTKIEGQNLVVLGDG 170

Query: 136 VSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIVYLSKSKT 188
              +L  +  +G   ++  ++  +RFK    KS     +I+C  + + L  S +
Sbjct: 171 ARTDLKDDEKSGIYRINAKLRLSVRFKFWFIKSWKLKPKIKCDDLKIPLGSSNS 224

BLAST of Csa1G132720 vs. NCBI nr
Match: gi|778659278|ref|XP_011654115.1| (PREDICTED: uncharacterized protein At1g08160 [Cucumis sativus])

HSP 1 Score: 396.0 bits (1016), Expect = 3.9e-107
Identity = 197/197 (100.00%), Postives = 197/197 (100.00%), Query Frame = 1

Query: 1   MAGPPQPLSRSGPSRILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFD 60
           MAGPPQPLSRSGPSRILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFD
Sbjct: 1   MAGPPQPLSRSGPSRILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFD 60

Query: 61  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLN 120
           MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLN
Sbjct: 61  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLN 120

Query: 121 IHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 180
           IHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV
Sbjct: 121 IHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 180

Query: 181 YLSKSKTFKKTTCFTEV 198
           YLSKSKTFKKTTCFTEV
Sbjct: 181 YLSKSKTFKKTTCFTEV 197

BLAST of Csa1G132720 vs. NCBI nr
Match: gi|659072246|ref|XP_008464346.1| (PREDICTED: uncharacterized protein At1g08160 [Cucumis melo])

HSP 1 Score: 385.6 bits (989), Expect = 5.3e-104
Identity = 190/197 (96.45%), Postives = 196/197 (99.49%), Query Frame = 1

Query: 1   MAGPPQPLSRSGPSRILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFD 60
           MAGPPQP SR+GPSRILRFVIIFLVALIILVGLAVLIIWLT+RPKRLSYTVESAEVHNFD
Sbjct: 1   MAGPPQPPSRAGPSRILRFVIIFLVALIILVGLAVLIIWLTIRPKRLSYTVESAEVHNFD 60

Query: 61  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLN 120
           MT+TQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDL+FGVLSPFYQPHK+EQWLN
Sbjct: 61  MTNTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLSPFYQPHKDEQWLN 120

Query: 121 IHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 180
           IHLNAQNFLLHDSVSK+LALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV
Sbjct: 121 IHLNAQNFLLHDSVSKDLALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 180

Query: 181 YLSKSKTFKKTTCFTEV 198
           YLSKSKTFKKTTCFTEV
Sbjct: 181 YLSKSKTFKKTTCFTEV 197

BLAST of Csa1G132720 vs. NCBI nr
Match: gi|703150687|ref|XP_010109925.1| (hypothetical protein L484_011767 [Morus notabilis])

HSP 1 Score: 223.8 bits (569), Expect = 2.7e-55
Identity = 104/195 (53.33%), Postives = 147/195 (75.38%), Query Frame = 1

Query: 6   QPLSRSGPSR---ILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFDMT 65
           +P   + P R   ILR++ +F +ALI+LVG+AVL+IWL VRPKRL Y+VE A +HNF++ 
Sbjct: 17  EPTREANPQRKPHILRWIAMFFLALIVLVGIAVLVIWLVVRPKRLVYSVEDASIHNFNIN 76

Query: 66  DTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLNIH 125
           +  LNASF F VR+YNPN +VS+YYD I + V + DQ L++ ++ PF+QPHKN   L + 
Sbjct: 77  NNHLNASFDFVVRSYNPNSKVSIYYDKIESRVEYDDQTLAYNMVEPFFQPHKNVTRLELK 136

Query: 126 LNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIVYL 185
           L AQ+  L  S+  +L LE+S+GE++L++W+KARIRFKVG WKS+HRTL+I CSPV+V+ 
Sbjct: 137 LAAQSVPLVGSIPADLRLEKSSGEIELNVWLKARIRFKVGAWKSSHRTLKIFCSPVLVHF 196

Query: 186 SKSKTFKKTTCFTEV 198
           S+SK F++T C  E+
Sbjct: 197 SRSKNFERTVCDVEL 211

BLAST of Csa1G132720 vs. NCBI nr
Match: gi|356546690|ref|XP_003541756.1| (PREDICTED: uncharacterized protein At1g08160-like [Glycine max])

HSP 1 Score: 221.5 bits (563), Expect = 1.3e-54
Identity = 109/203 (53.69%), Postives = 151/203 (74.38%), Query Frame = 1

Query: 1   MAGPP-QPLSRSGP----SRILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAE 60
           MA PP Q  SR+      S +LR + IF++ALIILVG+AV+IIWL ++PKRL YTVE+A 
Sbjct: 1   MAHPPTQSQSRAANKPKRSNLLRCIAIFILALIILVGIAVIIIWLVLKPKRLEYTVENAA 60

Query: 61  VHNFDMTDTQ-LNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHK 120
           +HNF++TD   L A+F F +R+YNPN RVS+YYD++  +V + DQ L+   + PF+Q HK
Sbjct: 61  IHNFNLTDANHLYANFDFTIRSYNPNSRVSIYYDTVEVSVRYEDQTLATNAVQPFFQSHK 120

Query: 121 NEQWLNIHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIR 180
           N   L++ L AQ   L+DSV K+L LERS+G+++LD+W++ARIRFKVGVWKS HR L+I 
Sbjct: 121 NVTRLHVGLTAQTVALYDSVPKDLRLERSSGDIELDVWMRARIRFKVGVWKSKHRVLKIF 180

Query: 181 CSPVIVYLSKSKTFKKTTCFTEV 198
           CSPV+V+ SK K+F++  C  E+
Sbjct: 181 CSPVLVHFSKGKSFERAPCDVEL 203

BLAST of Csa1G132720 vs. NCBI nr
Match: gi|356554941|ref|XP_003545799.1| (PREDICTED: uncharacterized protein At1g08160 [Glycine max])

HSP 1 Score: 218.0 bits (554), Expect = 1.5e-53
Identity = 102/203 (50.25%), Postives = 149/203 (73.40%), Query Frame = 1

Query: 1   MAGPPQPLSRSGP-----SRILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAE 60
           MA PP   + +       S +L ++ +F+VALIILVG+AV+IIWL ++PKRL Y+VE+A 
Sbjct: 1   MAHPPSQSNSTAANKPKRSNLLHYIAMFIVALIILVGIAVIIIWLVLKPKRLEYSVENAA 60

Query: 61  VHNFDMTDTQ-LNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHK 120
           +HNF++TD   L A+F F +R+YNPN R+S+YYD++  +V + DQ L+   + PF+Q HK
Sbjct: 61  IHNFNLTDANHLYANFDFTIRSYNPNSRISIYYDTVEVSVRYEDQTLATNAVQPFFQSHK 120

Query: 121 NEQWLNIHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIR 180
           N   L++ L AQ+  L++SV K+L LERS+G+++LD+W++ARIRFKVG WKS HR LRI 
Sbjct: 121 NVTRLHVALTAQSVALYESVPKDLRLERSSGDIELDVWVRARIRFKVGAWKSRHRVLRIF 180

Query: 181 CSPVIVYLSKSKTFKKTTCFTEV 198
           CSPV+V+ SK K+F++  C  E+
Sbjct: 181 CSPVLVHFSKGKSFERAPCEVEL 203

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1816_ARATH3.9e-3036.68Uncharacterized protein At1g08160 OS=Arabidopsis thaliana GN=At1g08160 PE=2 SV=1[more]
YLS9_ARATH4.8e-2032.81Protein YLS9 OS=Arabidopsis thaliana GN=YLS9 PE=2 SV=1[more]
NHL3_ARATH2.9e-1732.98NDR1/HIN1-Like protein 3 OS=Arabidopsis thaliana GN=NHL3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LVK3_CUCSA2.7e-107100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G132720 PE=4 SV=1[more]
W9S3V4_9ROSA1.9e-5553.33Uncharacterized protein OS=Morus notabilis GN=L484_011767 PE=4 SV=1[more]
K7M1R4_SOYBN9.3e-5553.69Uncharacterized protein OS=Glycine max GN=GLYMA_13G248600 PE=4 SV=1[more]
A0A0R4J5D0_SOYBN1.0e-5350.25Uncharacterized protein OS=Glycine max GN=GLYMA_15G065400 PE=4 SV=1[more]
A0A0B2RMU0_GLYSO9.6e-5254.24Uncharacterized protein OS=Glycine soja GN=glysoja_005671 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G22870.11.5e-4041.03 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G08160.12.2e-3136.68 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT2G35980.12.7e-2132.81 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT5G06320.11.6e-1832.98 NDR1/HIN1-like 3[more]
AT3G11650.11.1e-1729.31 NDR1/HIN1-like 2[more]
Match NameE-valueIdentityDescription
gi|778659278|ref|XP_011654115.1|3.9e-107100.00PREDICTED: uncharacterized protein At1g08160 [Cucumis sativus][more]
gi|659072246|ref|XP_008464346.1|5.3e-10496.45PREDICTED: uncharacterized protein At1g08160 [Cucumis melo][more]
gi|703150687|ref|XP_010109925.1|2.7e-5553.33hypothetical protein L484_011767 [Morus notabilis][more]
gi|356546690|ref|XP_003541756.1|1.3e-5453.69PREDICTED: uncharacterized protein At1g08160-like [Glycine max][more]
gi|356554941|ref|XP_003545799.1|1.5e-5350.25PREDICTED: uncharacterized protein At1g08160 [Glycine max][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU096220cucumber EST collection version 3.0transcribed_cluster
CU098665cucumber EST collection version 3.0transcribed_cluster
CU123397cucumber EST collection version 3.0transcribed_cluster
CU129543cucumber EST collection version 3.0transcribed_cluster
CU142257cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G132720.1Csa1G132720.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU129543CU129543transcribed_cluster
CU096220CU096220transcribed_cluster
CU098665CU098665transcribed_cluster
CU123397CU123397transcribed_cluster
CU142257CU142257transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 74..175
score: 1.1
NoneNo IPR availablePANTHERPTHR31852FAMILY NOT NAMEDcoord: 5..193
score: 3.1