CSPI04G16880 (gene) Wild cucumber (PI 183967)

NameCSPI04G16880
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
LocationChr4 : 14257046 .. 14258124 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGCGTAATAAATATTAATGGTTGCTAAAAGATGAAAGAAGAAGAAGAAGAAGGAAATGGAGAAACACTTTTCTCCATCTACCTGTTTCCTACTCCAAAATTAATTTCAATTAGTTGAGAGTTAGAATAATCCCCCAATTTTTCAAATGTCTCCATAAATTCCCAATTTCTCACTATTCTTCTCCCAAAATCTGCCATGGAAATCGCTTCTTCCTCCTCCTCTTTAATCAAAGACCCCAAATCCACTCAATCCACCGCCGCCGCCGCCACCCGTTCCCGGAAACGCCGCAACACCTGCATCGGAATCTCCATCGCCATCCTCCTCCTTCTCATCATCATAATCATCATTCTCGCCTTCACAGTCTTCAAAGCCAAACGCCCAATCACCACCGTCAATTCCGTTGCCCTAGCCGATCTCGACGTGTCGCTAAACCTAGCCGGAGTCTCCGTCGACATCAACGTCACTCTAATCGCCGACATCGCAATCACGAACCCTAACAAGGTCGGATTCAGCTACAAGAACAGCACCGCGTTTTTGAATTACAGAGGGGAATTGGTCGGAGAAGCGCCGATTATGGCTGGAAAGATCGACGCGGGGGAGAGGAAGGAGATGAATATCACGCTGACGATTATGGCAGATCGATTACTGAAGACGACGACGGTTTTTACGGATGCGGTGGCGGGATCTATGCCGTTGAATACGTATACGAGGATTTCAGGTAAGGTGAAGATTTTGGGGATTTTTAATATTCATGTGGTTTCAAGTACGTCTTGTGATTTCAATGTGGATATATCGGAGAGGAAAATTGGAGATCAACAGTGTAATTATCATACTAAGATCTGATTGATTGATTGATAGATACAGCTTCTTCTTGTTTTGTTCTTAGTAAATTATGAAAGAGATTTGTAGGTTGAATTAGGGATGTTTTATAGAGGGTTGTAATTGATTGATTTGGTACTTAGTTACTATTCTTGTACTAAACTTCCCTCTCTCAATGGTTGTTATTTTCTTAATGTTAAATTACAAGTTTAATCCTAAATGGTGAGCATCAATACCATCGGTAATCCATTTTTTAACCA

mRNA sequence

ATGGAAATCGCTTCTTCCTCCTCCTCTTTAATCAAAGACCCCAAATCCACTCAATCCACCGCCGCCGCCGCCACCCGTTCCCGGAAACGCCGCAACACCTGCATCGGAATCTCCATCGCCATCCTCCTCCTTCTCATCATCATAATCATCATTCTCGCCTTCACAGTCTTCAAAGCCAAACGCCCAATCACCACCGTCAATTCCGTTGCCCTAGCCGATCTCGACGTGTCGCTAAACCTAGCCGGAGTCTCCGTCGACATCAACGTCACTCTAATCGCCGACATCGCAATCACGAACCCTAACAAGGTCGGATTCAGCTACAAGAACAGCACCGCGTTTTTGAATTACAGAGGGGAATTGGTCGGAGAAGCGCCGATTATGGCTGGAAAGATCGACGCGGGGGAGAGGAAGGAGATGAATATCACGCTGACGATTATGGCAGATCGATTACTGAAGACGACGACGGTTTTTACGGATGCGGTGGCGGGATCTATGCCGTTGAATACGTATACGAGGATTTCAGGTAAGGTGAAGATTTTGGGGATTTTTAATATTCATGTGGTTTCAAGTACGTCTTGTGATTTCAATGTGGATATATCGGAGAGGAAAATTGGAGATCAACAGTGTAATTATCATACTAAGATCTGA

Coding sequence (CDS)

ATGGAAATCGCTTCTTCCTCCTCCTCTTTAATCAAAGACCCCAAATCCACTCAATCCACCGCCGCCGCCGCCACCCGTTCCCGGAAACGCCGCAACACCTGCATCGGAATCTCCATCGCCATCCTCCTCCTTCTCATCATCATAATCATCATTCTCGCCTTCACAGTCTTCAAAGCCAAACGCCCAATCACCACCGTCAATTCCGTTGCCCTAGCCGATCTCGACGTGTCGCTAAACCTAGCCGGAGTCTCCGTCGACATCAACGTCACTCTAATCGCCGACATCGCAATCACGAACCCTAACAAGGTCGGATTCAGCTACAAGAACAGCACCGCGTTTTTGAATTACAGAGGGGAATTGGTCGGAGAAGCGCCGATTATGGCTGGAAAGATCGACGCGGGGGAGAGGAAGGAGATGAATATCACGCTGACGATTATGGCAGATCGATTACTGAAGACGACGACGGTTTTTACGGATGCGGTGGCGGGATCTATGCCGTTGAATACGTATACGAGGATTTCAGGTAAGGTGAAGATTTTGGGGATTTTTAATATTCATGTGGTTTCAAGTACGTCTTGTGATTTCAATGTGGATATATCGGAGAGGAAAATTGGAGATCAACAGTGTAATTATCATACTAAGATCTGA
BLAST of CSPI04G16880 vs. Swiss-Prot
Match: Y1465_ARATH (Late embryogenesis abundant protein At1g64065 OS=Arabidopsis thaliana GN=At1g64065 PE=2 SV=1)

HSP 1 Score: 52.4 bits (124), Expect = 7.3e-06
Identity = 46/178 (25.84%), Postives = 88/178 (49.44%), Query Frame = 1

Query: 34  CIGISIAILLLLIIIIIILAFTVFKAKRPITTVNSVALADLDVSLNLAGVSVDINVTLIA 93
           C+  S+ I++++  + +IL+    +  +P     S++  DL    N    +   N TL++
Sbjct: 39  CLVYSLTIIVIIFALCLILSSIFLRISKPEIETRSISTRDLRSGGN--STNPYFNATLVS 98

Query: 94  DIAITNPNKVGFSYKNSTAFLNYRGE-LVGEAPIMAGKIDAGERKEM-NITLTIMADRLL 153
           DI+I N N   F +++ST  + Y    +VGE  I   +++A +   +  + + I + RLL
Sbjct: 99  DISIRNSNFGAFEFEDSTLRVVYADHGVVGETKIEGRRVEAHKTVRITGVVVEIGSFRLL 158

Query: 154 KTTTVFTDAVAGSMPLNTYTRISGKVKILGIFNIHVVSSTSCDFNVDISERKIGDQQC 210
            T  +  D   G + L +   + G++K+LG      VS  SC   ++++ R I +  C
Sbjct: 159 DTKDLDKDLRLGFLELRSVAEVRGRIKVLG-RKRWKVSVMSCTMRLNLTGRFIQNLLC 213

BLAST of CSPI04G16880 vs. TrEMBL
Match: A0A0A0L094_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G337360 PE=4 SV=1)

HSP 1 Score: 394.0 bits (1011), Expect = 1.1e-106
Identity = 215/215 (100.00%), Postives = 215/215 (100.00%), Query Frame = 1

Query: 1   MEIASSSSSLIKDPKSTQSTAAAATRSRKRRNTCIGISIAILLLLIIIIIILAFTVFKAK 60
           MEIASSSSSLIKDPKSTQSTAAAATRSRKRRNTCIGISIAILLLLIIIIIILAFTVFKAK
Sbjct: 1   MEIASSSSSLIKDPKSTQSTAAAATRSRKRRNTCIGISIAILLLLIIIIIILAFTVFKAK 60

Query: 61  RPITTVNSVALADLDVSLNLAGVSVDINVTLIADIAITNPNKVGFSYKNSTAFLNYRGEL 120
           RPITTVNSVALADLDVSLNLAGVSVDINVTLIADIAITNPNKVGFSYKNSTAFLNYRGEL
Sbjct: 61  RPITTVNSVALADLDVSLNLAGVSVDINVTLIADIAITNPNKVGFSYKNSTAFLNYRGEL 120

Query: 121 VGEAPIMAGKIDAGERKEMNITLTIMADRLLKTTTVFTDAVAGSMPLNTYTRISGKVKIL 180
           VGEAPIMAGKIDAGERKEMNITLTIMADRLLKTTTVFTDAVAGSMPLNTYTRISGKVKIL
Sbjct: 121 VGEAPIMAGKIDAGERKEMNITLTIMADRLLKTTTVFTDAVAGSMPLNTYTRISGKVKIL 180

Query: 181 GIFNIHVVSSTSCDFNVDISERKIGDQQCNYHTKI 216
           GIFNIHVVSSTSCDFNVDISERKIGDQQCNYHTKI
Sbjct: 181 GIFNIHVVSSTSCDFNVDISERKIGDQQCNYHTKI 215

BLAST of CSPI04G16880 vs. TrEMBL
Match: A0A0L9U3N9_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan03g067800 PE=4 SV=1)

HSP 1 Score: 207.6 bits (527), Expect = 1.5e-50
Identity = 103/189 (54.50%), Postives = 141/189 (74.60%), Query Frame = 1

Query: 27  SRKRRNTCIGISIAILLLLIIIIIILAFTVFKAKRPITTVNSVALADLDVSLNLAGVSVD 86
           S K R  C+ I   +++ L+++I+ILA TVFKAK P T V+S  L D  +SL++A + VD
Sbjct: 5   SGKGRKVCL-IVTGVVIALVLVIVILALTVFKAKHPTTVVDSTKLEDFHMSLDIARLRVD 64

Query: 87  INVTLIADIAITNPNKVGFSYKNSTAFLNYRGELVGEAPIMAGKIDAGERKEMNITLTIM 146
           +NVTL  D+++ NPNKVGF Y +STA LNYRG+L+GE PI AG+I +GE K  N+TLTIM
Sbjct: 65  LNVTLSTDVSVKNPNKVGFKYSDSTAHLNYRGQLIGEVPIPAGEISSGETKGFNLTLTIM 124

Query: 147 ADRLLKTTTVFTDAVAGSMPLNTYTRISGKVKILGIFNIHVVSSTSCDFNVDISERKIGD 206
           ADRLL  + +F+D  +G++PLNT+ RISGKV ILG   +HVVSSTSCD  +++S R +G+
Sbjct: 125 ADRLLSNSQLFSDVTSGTLPLNTFVRISGKVNILGFIKVHVVSSTSCDVAINLSNRTVGN 184

Query: 207 QQCNYHTKI 216
           Q+C Y TK+
Sbjct: 185 QECQYRTKL 192

BLAST of CSPI04G16880 vs. TrEMBL
Match: W9SMP2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_022768 PE=4 SV=1)

HSP 1 Score: 206.5 bits (524), Expect = 3.4e-50
Identity = 119/224 (53.12%), Postives = 156/224 (69.64%), Query Frame = 1

Query: 1   MEIASSSSSLIKDPKST------QSTAAAATRSRKRRN---TCIGISIAILLLLIIIIII 60
           ME++SSS    K    T       S AAAA RS  RR     CI  ++A +L ++ I +I
Sbjct: 1   MEVSSSSPPPSKKGPPTPTKPSLSSAAAAAGRSPNRRRRLCVCISATVAAVLAIVFIAVI 60

Query: 61  LAFTVFKAKRPITTVNSVALADLDVSLNLAGVSVDINVTLIADIAITNPNKVGFSYKNST 120
           L+ TVFK KRPITTVN V+L DL  SLN+A + +D+NVT+  D+++ NPNKV F Y N+T
Sbjct: 61  LSQTVFKPKRPITTVNDVSLKDL--SLNVARLGIDLNVTVGVDLSVKNPNKVDFRYGNTT 120

Query: 121 AFLNYRGELVGEAPIMAGKIDAGERKEMNITLTIMADRLLKTTTVFTDAVAGSMPLNTYT 180
           + L+YRGE VGEA I  G+I AGE   MN+TLT+MADRLL  + V++D +AG +P N+YT
Sbjct: 121 SVLSYRGEQVGEAAIPGGEISAGETVPMNVTLTVMADRLLSRSQVYSDFLAGDVPFNSYT 180

Query: 181 RISGKVKILGIFNIHVVSSTSCDFNVDISERKIGDQQCNYHTKI 216
           RI+G+V ILGIF IHVVS TSC+F VD+S R + DQ+C Y TK+
Sbjct: 181 RIAGRVTILGIFKIHVVSVTSCEFAVDVSNRTVSDQRCTYKTKL 222

BLAST of CSPI04G16880 vs. TrEMBL
Match: B9IGY4_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0016s14900g PE=4 SV=2)

HSP 1 Score: 206.1 bits (523), Expect = 4.4e-50
Identity = 102/215 (47.44%), Postives = 156/215 (72.56%), Query Frame = 1

Query: 1   MEIASSSSSLIKDPKSTQSTAAAATRSRKRRNTCIGISIAILLLLIIIIIILAFTVFKAK 60
           M++ SS ++ +K         A + +  KRRN C+G++ A++L + ++++IL  TVFK K
Sbjct: 1   MDVESSKAAAMK---------AESPKKHKRRNICLGVTAAVILFIFLLLLILGLTVFKPK 60

Query: 61  RPITTVNSVALADLDVSLNLAGVSVDINVTLIADIAITNPNKVGFSYKNSTAFLNYRGEL 120
           +P TTV+S +++D+ VS ++A + VD+NV+L  D++I NPNKV   YKNS+AFLNYRG++
Sbjct: 61  QPTTTVDSTSISDMKVSFDIARLRVDVNVSLDVDLSIKNPNKVSVKYKNSSAFLNYRGQV 120

Query: 121 VGEAPIMAGKIDAGERKEMNITLTIMADRLLKTTTVFTDAVAGSMPLNTYTRISGKVKIL 180
           VGEAPI AGKI A + + +N+T+T+MADRLL  +  F+D +AG++P NT T+ISGK  + 
Sbjct: 121 VGEAPIPAGKILADKTQPINVTVTLMADRLLSDSQFFSDVMAGTIPFNTLTKISGKASVF 180

Query: 181 GIFNIHVVSSTSCDFNVDISERKIGDQQCNYHTKI 216
            +FN+H+ S++SCD  V +S R IGDQ+C Y TK+
Sbjct: 181 NLFNVHITSTSSCDLLVFVSNRTIGDQKCKYKTKL 206

BLAST of CSPI04G16880 vs. TrEMBL
Match: A0A061EK34_THECC (Harpin-induced 1, putative OS=Theobroma cacao GN=TCM_019926 PE=4 SV=1)

HSP 1 Score: 206.1 bits (523), Expect = 4.4e-50
Identity = 114/216 (52.78%), Postives = 152/216 (70.37%), Query Frame = 1

Query: 1   MEIASSSSSLIKDPKSTQSTAAAATRSRKR-RNTCIGISIAILLLLIIIIIILAFTVFKA 60
           ME+ SS S+  K      S+AA A R R++ RN C  + +A+LL +I++I+ILAFTVFKA
Sbjct: 1   MEVESSGSTNAKSMDERASSAARALRRRRKCRNICFAV-MAVLLFIIVLIVILAFTVFKA 60

Query: 61  KRPITTVNSVALADLDVSLNLAGVSVDINVTLIADIAITNPNKVGFSYKNSTAFLNYRGE 120
           KRP+TT++SV+LA+L  SL+L  + V +N +L  D++I NPNKV F Y +S+A LNYRG+
Sbjct: 61  KRPVTTIDSVSLANLKFSLDLVRLQVLLNASLDVDLSIKNPNKVAFKYTDSSAQLNYRGQ 120

Query: 121 LVGEAPIMAGKIDAGERKEMNITLTIMADRLLKTTTVFTDAVAGSMPLNTYTRISGKVKI 180
            VGE PI AGK+ A     MN+TLT+MADRLL  +  F+D   G +PLN + RI GKV +
Sbjct: 121 QVGEVPIPAGKMPADATVPMNLTLTLMADRLLSDSQFFSDVSGGELPLNAFARIPGKVNL 180

Query: 181 LGIFNIHVVSSTSCDFNVDISERKIGDQQCNYHTKI 216
           L +F IHVVSSTSCDF V +S   +GDQ C Y TK+
Sbjct: 181 LNLFKIHVVSSTSCDFTVFLSNSTVGDQDCKYKTKL 215

BLAST of CSPI04G16880 vs. TAIR10
Match: AT3G54200.1 (AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 182.2 bits (461), Expect = 3.4e-46
Identity = 96/202 (47.52%), Postives = 144/202 (71.29%), Query Frame = 1

Query: 17  TQST-AAAATRSRKRRNT--CIGISIAILLLLIIIIIILAFTVFKAKRPITTVNSVALAD 76
           TQS     A + R++RN   CI  +I ++LL+ I+I+ILAFT+FK KRP TT++SV +  
Sbjct: 34  TQSANTGTAKKLRRKRNCKICICFTILLILLIAIVIVILAFTLFKPKRPTTTIDSVTVDR 93

Query: 77  LDVSLNLAGVSVDINVTLIADIAITNPNKVGFSYKNSTAFLNYRGELVGEAPIMAGKIDA 136
           L  S+N   + V +N+TL  D+++ NPN++GFSY +S+A LNYRG+++GEAP+ A +I A
Sbjct: 94  LQASVNPLLLKVLLNLTLNVDLSLKNPNRIGFSYDSSSALLNYRGQVIGEAPLPANRIAA 153

Query: 137 GERKEMNITLTIMADRLLKTTTVFTDAVAGSMPLNTYTRISGKVKILGIFNIHVVSSTSC 196
            +   +NITLT+MADRLL  T + +D +AG +PLNT+ +++GKV +L IF I V SS+SC
Sbjct: 154 RKTVPLNITLTLMADRLLSETQLLSDVMAGVIPLNTFVKVTGKVTVLKIFKIKVQSSSSC 213

Query: 197 DFNVDISERKIGDQQCNYHTKI 216
           D ++ +S+R +  Q C Y TK+
Sbjct: 214 DLSISVSDRNVTSQHCKYSTKL 235

BLAST of CSPI04G16880 vs. TAIR10
Match: AT2G46150.1 (AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 100.9 bits (250), Expect = 1.0e-21
Identity = 69/199 (34.67%), Postives = 105/199 (52.76%), Query Frame = 1

Query: 14  PKSTQSTAAAATRSRKRRNTCIGISI-AILLLLIIIIIILAFTVFKAKRPITTVNSVALA 73
           P S +S +      R R      I + A  L+L  I++ L FTVF+ K PI  +N V + 
Sbjct: 17  PVSDESASNIKNTHRSRNRIKCSICVTATSLILTTIVLTLVFTVFRVKDPIIKMNGVMVN 76

Query: 74  DLDVSLNLAGVSV-DINVTLIADIAITNPNKVGFSYKNSTAFLNYRGELVGEAPIMAGKI 133
            LD       V +   N+++I D+++ NPN   F Y N+T  + Y+G LVGEA  + GK 
Sbjct: 77  GLDSVTGTNQVQLLGTNISMIVDVSVKNPNTASFKYSNTTTDIYYKGTLVGEAHGLPGKA 136

Query: 134 DAGERKEMNITLTIMADRLLKTTTVFTD-AVAGSMPLNTYTRISGKVKILGIFNIHVVSS 193
                  MN+T+ IM DR+L    +  + + +G + + +YTR+ GKVKI+GI   HV   
Sbjct: 137 RPHRTSRMNVTVDIMLDRILSDPGLGREISRSGLVNVWSYTRVGGKVKIMGIVKKHVTVK 196

Query: 194 TSCDFNVDISERKIGDQQC 210
            +C   V+I+ + I D  C
Sbjct: 197 MNCTMAVNITGQAIQDVDC 215

BLAST of CSPI04G16880 vs. TAIR10
Match: AT1G64450.1 (AT1G64450.1 Glycine-rich protein family)

HSP 1 Score: 62.0 bits (149), Expect = 5.2e-10
Identity = 45/121 (37.19%), Postives = 67/121 (55.37%), Query Frame = 1

Query: 26  RSRKRRNTC-IGISIAILLLLIIIIIILAFTVFKAKRPITTVNSVALADLDVSLNLAGVS 85
           RS  R N     ++   LL+L+++++++ FTVFK K P  +VN+V L    VS N A   
Sbjct: 9   RSSGRTNLASCAVATVFLLILLVVLLVVYFTVFKPKDPKISVNAVQLPSFAVSNNTA--- 68

Query: 86  VDINVTLIADIAITNPNKVGFSYKNSTAFLNYRGELVGEAPIMAGKIDAGERKEMNITLT 145
              N +    +A+ NPN+  FS+ +S+  L Y G  VG   I AGKID+G  + M  T T
Sbjct: 69  ---NFSFSQYVAVRNPNRAVFSHYDSSIQLLYSGNQVGFMFIPAGKIDSGRIQYMAATFT 123

BLAST of CSPI04G16880 vs. TAIR10
Match: AT4G23930.1 (AT4G23930.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 57.8 bits (138), Expect = 9.8e-09
Identity = 45/184 (24.46%), Postives = 87/184 (47.28%), Query Frame = 1

Query: 38  SIAILLLLIIIIIILAFTV----FKAKRPITTVNSVALADLDVSLNLAGVSVDINVTLIA 97
           S A+  L I+ +II A TV    F+ + P  +V SV +    V+      +  ++ T   
Sbjct: 10  SCAVATLFIVFLIIAALTVYLTVFRPRDPEISVTSVKVPSFSVA------NSSVSFTFSQ 69

Query: 98  DIAITNPNKVGFSYKNSTAFLNYRGELVGEAPIMAGKIDAGERKEMNITLTIMADRLLKT 157
             A+ NPN+  FS+ N+   L Y G  +G   + AG+I++G  K M  T ++ +  L   
Sbjct: 70  FSAVRNPNRAAFSHYNNVIQLFYYGNRIGYTFVPAGEIESGRTKRMLATFSVQSFPLAAA 129

Query: 158 TT--------VFTDAVAGSMPLNTYTRISGKVKILGIFNIHVVSSTSCDFNVDISERKIG 210
           ++          +D    ++ + +   ++G+V++LG+F   + +  +C   +  S+  I 
Sbjct: 130 SSSQISAAQFQNSDRSGSTVEIESKLEMAGRVRVLGLFTHRIAAKCNCRIAISSSDGSIV 187

BLAST of CSPI04G16880 vs. TAIR10
Match: AT1G64065.1 (AT1G64065.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 52.4 bits (124), Expect = 4.1e-07
Identity = 46/178 (25.84%), Postives = 88/178 (49.44%), Query Frame = 1

Query: 34  CIGISIAILLLLIIIIIILAFTVFKAKRPITTVNSVALADLDVSLNLAGVSVDINVTLIA 93
           C+  S+ I++++  + +IL+    +  +P     S++  DL    N    +   N TL++
Sbjct: 39  CLVYSLTIIVIIFALCLILSSIFLRISKPEIETRSISTRDLRSGGN--STNPYFNATLVS 98

Query: 94  DIAITNPNKVGFSYKNSTAFLNYRGE-LVGEAPIMAGKIDAGERKEM-NITLTIMADRLL 153
           DI+I N N   F +++ST  + Y    +VGE  I   +++A +   +  + + I + RLL
Sbjct: 99  DISIRNSNFGAFEFEDSTLRVVYADHGVVGETKIEGRRVEAHKTVRITGVVVEIGSFRLL 158

Query: 154 KTTTVFTDAVAGSMPLNTYTRISGKVKILGIFNIHVVSSTSCDFNVDISERKIGDQQC 210
            T  +  D   G + L +   + G++K+LG      VS  SC   ++++ R I +  C
Sbjct: 159 DTKDLDKDLRLGFLELRSVAEVRGRIKVLG-RKRWKVSVMSCTMRLNLTGRFIQNLLC 213

BLAST of CSPI04G16880 vs. NCBI nr
Match: gi|449449825|ref|XP_004142665.1| (PREDICTED: uncharacterized protein LOC101208230 [Cucumis sativus])

HSP 1 Score: 394.0 bits (1011), Expect = 1.6e-106
Identity = 215/215 (100.00%), Postives = 215/215 (100.00%), Query Frame = 1

Query: 1   MEIASSSSSLIKDPKSTQSTAAAATRSRKRRNTCIGISIAILLLLIIIIIILAFTVFKAK 60
           MEIASSSSSLIKDPKSTQSTAAAATRSRKRRNTCIGISIAILLLLIIIIIILAFTVFKAK
Sbjct: 1   MEIASSSSSLIKDPKSTQSTAAAATRSRKRRNTCIGISIAILLLLIIIIIILAFTVFKAK 60

Query: 61  RPITTVNSVALADLDVSLNLAGVSVDINVTLIADIAITNPNKVGFSYKNSTAFLNYRGEL 120
           RPITTVNSVALADLDVSLNLAGVSVDINVTLIADIAITNPNKVGFSYKNSTAFLNYRGEL
Sbjct: 61  RPITTVNSVALADLDVSLNLAGVSVDINVTLIADIAITNPNKVGFSYKNSTAFLNYRGEL 120

Query: 121 VGEAPIMAGKIDAGERKEMNITLTIMADRLLKTTTVFTDAVAGSMPLNTYTRISGKVKIL 180
           VGEAPIMAGKIDAGERKEMNITLTIMADRLLKTTTVFTDAVAGSMPLNTYTRISGKVKIL
Sbjct: 121 VGEAPIMAGKIDAGERKEMNITLTIMADRLLKTTTVFTDAVAGSMPLNTYTRISGKVKIL 180

Query: 181 GIFNIHVVSSTSCDFNVDISERKIGDQQCNYHTKI 216
           GIFNIHVVSSTSCDFNVDISERKIGDQQCNYHTKI
Sbjct: 181 GIFNIHVVSSTSCDFNVDISERKIGDQQCNYHTKI 215

BLAST of CSPI04G16880 vs. NCBI nr
Match: gi|659126678|ref|XP_008463309.1| (PREDICTED: uncharacterized protein LOC103501497 [Cucumis melo])

HSP 1 Score: 370.2 bits (949), Expect = 2.5e-99
Identity = 204/216 (94.44%), Postives = 208/216 (96.30%), Query Frame = 1

Query: 1   MEIASSSSSLIKDPKSTQS-TAAAATRSRKRRNTCIGISIAILLLLIIIIIILAFTVFKA 60
           MEIASSSSS IKDPKSTQS  AAAA RSRKRRNTCIGISIAILLLLII+IIILAFTVFKA
Sbjct: 1   MEIASSSSSSIKDPKSTQSAAAAAAARSRKRRNTCIGISIAILLLLIILIIILAFTVFKA 60

Query: 61  KRPITTVNSVALADLDVSLNLAGVSVDINVTLIADIAITNPNKVGFSYKNSTAFLNYRGE 120
           KRPITTVNSVALADLDVSLNLA VSVDINVTLIA IAITNPNKVGFSYKNSTAFLNYRGE
Sbjct: 61  KRPITTVNSVALADLDVSLNLARVSVDINVTLIAGIAITNPNKVGFSYKNSTAFLNYRGE 120

Query: 121 LVGEAPIMAGKIDAGERKEMNITLTIMADRLLKTTTVFTDAVAGSMPLNTYTRISGKVKI 180
           LVGEAPIMAGKIDAGERKEMNITLTIMADRLLKTTTVF+D VAGSMPLNTY RISGKVKI
Sbjct: 121 LVGEAPIMAGKIDAGERKEMNITLTIMADRLLKTTTVFSDVVAGSMPLNTYARISGKVKI 180

Query: 181 LGIFNIHVVSSTSCDFNVDISERKIGDQQCNYHTKI 216
           LGIFNIHVVS+TSCDFNVDISERK+GDQQCNYHTKI
Sbjct: 181 LGIFNIHVVSTTSCDFNVDISERKVGDQQCNYHTKI 216

BLAST of CSPI04G16880 vs. NCBI nr
Match: gi|1009159365|ref|XP_015897772.1| (PREDICTED: uncharacterized protein LOC107431396 [Ziziphus jujuba])

HSP 1 Score: 219.9 bits (559), Expect = 4.2e-54
Identity = 112/202 (55.45%), Postives = 156/202 (77.23%), Query Frame = 1

Query: 14  PKSTQSTAAAATRSRKRRNTCIGISIAILLLLIIIIIILAFTVFKAKRPITTVNSVALAD 73
           P S + T +   R+R R   C+G+ +A++++  +II+IL+ TVFK KRP+TT+++V+LAD
Sbjct: 2   PSSPKPTESGRRRNRSRA-ICLGV-MAVVVVAAVIIVILSLTVFKPKRPVTTIDAVSLAD 61

Query: 74  LDVSLNLAGVSVDINVTLIADIAITNPNKVGFSYKNSTAFLNYRGELVGEAPIMAGKIDA 133
           +DVSLN+A ++VD+NVTL  D+++ NPNKVGF Y +STAFLNYRG+ VGEA I AG I +
Sbjct: 62  MDVSLNVAKLAVDLNVTLDVDLSVRNPNKVGFKYADSTAFLNYRGQTVGEASIPAGGISS 121

Query: 134 GERKEMNITLTIMADRLLKTTTVFTDAVAGSMPLNTYTRISGKVKILGIFNIHVVSSTSC 193
            E K MN+TLT+MADRLL T+ +++D +AG++P N  TRISGKV ILG+  +HVVS+TSC
Sbjct: 122 DETKPMNLTLTVMADRLLSTSQIYSDVLAGTVPFNARTRISGKVSILGVVKVHVVSTTSC 181

Query: 194 DFNVDISERKIGDQQCNYHTKI 216
           DFNV +S R +G Q C Y TK+
Sbjct: 182 DFNVFVSNRSVGGQTCQYKTKL 201

BLAST of CSPI04G16880 vs. NCBI nr
Match: gi|694390601|ref|XP_009370863.1| (PREDICTED: uncharacterized protein LOC103960171 [Pyrus x bretschneideri])

HSP 1 Score: 215.7 bits (548), Expect = 8.0e-53
Identity = 111/195 (56.92%), Postives = 147/195 (75.38%), Query Frame = 1

Query: 21  AAAATRSRKRRNTCIGISIAILLLLIIIIIILAFTVFKAKRPITTVNSVALADLDVSLNL 80
           A+  +RSR  RN C+  + A++L+  I+++IL  TVFKAK P TTVNSV L DLD++LN+
Sbjct: 9   ASPPSRSRTCRNVCLAAT-AVVLVATIVLVILCLTVFKAKDPTTTVNSVVLKDLDLALNI 68

Query: 81  AGVSVDINVTLIADIAITNPNKVGFSYKNSTAFLNYRGELVGEAPIMAGKIDAGERKEMN 140
             +SVD+N+TL  D+++ NPNKVGF YKNSTAFLNYRG  VGEA I +GKI A   K MN
Sbjct: 69  PRLSVDVNLTLGVDLSVNNPNKVGFKYKNSTAFLNYRGTNVGEAQIGSGKIFADRTKSMN 128

Query: 141 ITLTIMADRLLKTTTVFTDAVAGSMPLNTYTRISGKVKILGIFNIHVVSSTSCDFNVDIS 200
           +TLTIMADRLL  + +F+D VAG++PLNT T++SG+  +LGIF IHVVS++SCDF+VD+ 
Sbjct: 129 VTLTIMADRLLGKSELFSDVVAGTLPLNTLTKVSGEASVLGIFKIHVVSTSSCDFSVDVG 188

Query: 201 ERKIGDQQCNYHTKI 216
              +G Q C Y  K+
Sbjct: 189 NITVGQQHCTYKIKL 202

BLAST of CSPI04G16880 vs. NCBI nr
Match: gi|657992299|ref|XP_008388400.1| (PREDICTED: uncharacterized protein LOC103450790 [Malus domestica])

HSP 1 Score: 214.2 bits (544), Expect = 2.3e-52
Identity = 113/194 (58.25%), Postives = 147/194 (75.77%), Query Frame = 1

Query: 23  AATRSRKRRNTCIGISIAILLLLIIIIIILAFTVFKAKRPITTVNSVALADLDVSLNLAG 82
           A+ R R  RN C+  + A + +  I+++IL  TVFKAK P TTVNS  L DLDVSLN+  
Sbjct: 9   ASPRRRTCRNVCLAAT-AXVFVATIVLVILCLTVFKAKNPTTTVNSAVLKDLDVSLNIPR 68

Query: 83  VSVDINVTLIADIAITNPNKVGFSYKNSTAFLNYRGELVGEAPIMAGKIDAGERKEMNIT 142
           VSVD N+TL  D+++ NPNKVGF YKNSTA LNYRG  VGEA I +GKI A + K MN+T
Sbjct: 69  VSVDXNLTLGVDLSVKNPNKVGFKYKNSTASLNYRGTQVGEAQIGSGKISADQTKPMNVT 128

Query: 143 LTIMADRLL-KTTTVFTDAVAGSMPLNTYTRISGKVKILGIFNIHVVSSTSCDFNVDISE 202
           LTIMADRLL K++ +F+D  AG++PLNT+T+ISGKV +LGIF IHVVS++SCDF++D+  
Sbjct: 129 LTIMADRLLGKSSELFSDVRAGTLPLNTFTKISGKVIVLGIFKIHVVSTSSCDFSIDVGN 188

Query: 203 RKIGDQQCNYHTKI 216
           R +G Q+C + TK+
Sbjct: 189 RTVGQQRCTHKTKL 201

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1465_ARATH7.3e-0625.84Late embryogenesis abundant protein At1g64065 OS=Arabidopsis thaliana GN=At1g640... [more]
Match NameE-valueIdentityDescription
A0A0A0L094_CUCSA1.1e-106100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_4G337360 PE=4 SV=1[more]
A0A0L9U3N9_PHAAN1.5e-5054.50Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan03g067800 PE=4 SV=1[more]
W9SMP2_9ROSA3.4e-5053.13Uncharacterized protein OS=Morus notabilis GN=L484_022768 PE=4 SV=1[more]
B9IGY4_POPTR4.4e-5047.44Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0016s14900g PE=4 SV=2[more]
A0A061EK34_THECC4.4e-5052.78Harpin-induced 1, putative OS=Theobroma cacao GN=TCM_019926 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G54200.13.4e-4647.52 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT2G46150.11.0e-2134.67 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G64450.15.2e-1037.19 Glycine-rich protein family[more]
AT4G23930.19.8e-0924.46 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G64065.14.1e-0725.84 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|449449825|ref|XP_004142665.1|1.6e-106100.00PREDICTED: uncharacterized protein LOC101208230 [Cucumis sativus][more]
gi|659126678|ref|XP_008463309.1|2.5e-9994.44PREDICTED: uncharacterized protein LOC103501497 [Cucumis melo][more]
gi|1009159365|ref|XP_015897772.1|4.2e-5455.45PREDICTED: uncharacterized protein LOC107431396 [Ziziphus jujuba][more]
gi|694390601|ref|XP_009370863.1|8.0e-5356.92PREDICTED: uncharacterized protein LOC103960171 [Pyrus x bretschneideri][more]
gi|657992299|ref|XP_008388400.1|2.3e-5258.25PREDICTED: uncharacterized protein LOC103450790 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005886 plasma membrane
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G16880.1CSPI04G16880.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 95..188
score: 9.2
NoneNo IPR availablePANTHERPTHR31852FAMILY NOT NAMEDcoord: 28..215
score: 4.4
NoneNo IPR availablePANTHERPTHR31852:SF43LATE EMBRYOGENESIS ABUNDANT HYDROXYPROLINE-RICH GLYCOPROTEINcoord: 28..215
score: 4.4
NoneNo IPR availableunknownSSF117070LEA14-likecoord: 85..155
score: 9.4