Csa4G269730 (gene) Cucumber (Chinese Long) v2

NameCsa4G269730
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionHarpin-induced protein-related-like; contains IPR004864 (Late embryogenesis abundant protein, LEA-14)
LocationChr4 : 10696387 .. 10697198 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCCCTTCTCATTCCTCCATTTCTGTTCTATACTGTGTTTGATCTTCCATGGCCGACCGTGTTCACCCCACTCTTAACCCTGATTCCGCTTCTAATAATCCGCTCAAAAGCCCTCCTTCTGCTACCTACGTCATCCAAATCCCCAAAGACCAAGTCTACCGAATTCCTCCTCCTGAAAACGCCGCCCGCTTCAACCTCTACACTCGCCACCACCACCGCCCCTCCCCCTGCCGCCGTTTCCTCTGCTTCATTCTCCTACTCCTCCTCCTCTCGGCCATCACTTCCGCTCTCGTCTTCCTAATCCTCCAACCTGACCTTCCTCGCTTCTCCATTCTCGCTGTATCCATCAGCAGAATCAAACCAAACACCACCTCATTCTCTCCCCAATTCAATGTCACAATACGAGCAGAAAACCACAACAAAAATATCGGAATCTACTACGAGAAAAACAGCACCGTCTCCATGAATTTATCGGATGTGATGCTGTGCGAAGGTGCATTGCCGTTGTTGTACCAGCCACCGAGGAACGTGACGGTAATGAGAGTAAAGGTGAAAGGATCTGGTATCAGATTATCGAGTAGTACGGGTAAAGCATTTGAGGATTGGGAGAAGGAAGGGAAAACACTGAGAATGAAGGTGGATGTGAGAGGTCCAATGAAGATGAAGTTGTACTGGATGGAGATGAGATGGAGGATTAGGGCAAAAGTAACATGCAAGATATTGGTGAAGAAGGAAATGGGGAAGACGAAAGTGATGGAGGAGAAGTGTGATCATAGCATGAAGCTGTGGTAGAAAAACACTATCTGTACCCAT

mRNA sequence

ATGGCCGACCGTGTTCACCCCACTCTTAACCCTGATTCCGCTTCTAATAATCCGCTCAAAAGCCCTCCTTCTGCTACCTACGTCATCCAAATCCCCAAAGACCAAGTCTACCGAATTCCTCCTCCTGAAAACGCCGCCCGCTTCAACCTCTACACTCGCCACCACCACCGCCCCTCCCCCTGCCGCCGTTTCCTCTGCTTCATTCTCCTACTCCTCCTCCTCTCGGCCATCACTTCCGCTCTCGTCTTCCTAATCCTCCAACCTGACCTTCCTCGCTTCTCCATTCTCGCTGTATCCATCAGCAGAATCAAACCAAACACCACCTCATTCTCTCCCCAATTCAATGTCACAATACGAGCAGAAAACCACAACAAAAATATCGGAATCTACTACGAGAAAAACAGCACCGTCTCCATGAATTTATCGGATGTGATGCTGTGCGAAGGTGCATTGCCGTTGTTGTACCAGCCACCGAGGAACGTGACGGTAATGAGAGTAAAGGTGAAAGGATCTGGTATCAGATTATCGAGTAGTACGGGTAAAGCATTTGAGGATTGGGAGAAGGAAGGGAAAACACTGAGAATGAAGGTGGATGTGAGAGGTCCAATGAAGATGAAGTTGTACTGGATGGAGATGAGATGGAGGATTAGGGCAAAAGTAACATGCAAGATATTGGTGAAGAAGGAAATGGGGAAGACGAAAGTGATGGAGGAGAAGTGTGATCATAGCATGAAGCTGTGGTAG

Coding sequence (CDS)

ATGGCCGACCGTGTTCACCCCACTCTTAACCCTGATTCCGCTTCTAATAATCCGCTCAAAAGCCCTCCTTCTGCTACCTACGTCATCCAAATCCCCAAAGACCAAGTCTACCGAATTCCTCCTCCTGAAAACGCCGCCCGCTTCAACCTCTACACTCGCCACCACCACCGCCCCTCCCCCTGCCGCCGTTTCCTCTGCTTCATTCTCCTACTCCTCCTCCTCTCGGCCATCACTTCCGCTCTCGTCTTCCTAATCCTCCAACCTGACCTTCCTCGCTTCTCCATTCTCGCTGTATCCATCAGCAGAATCAAACCAAACACCACCTCATTCTCTCCCCAATTCAATGTCACAATACGAGCAGAAAACCACAACAAAAATATCGGAATCTACTACGAGAAAAACAGCACCGTCTCCATGAATTTATCGGATGTGATGCTGTGCGAAGGTGCATTGCCGTTGTTGTACCAGCCACCGAGGAACGTGACGGTAATGAGAGTAAAGGTGAAAGGATCTGGTATCAGATTATCGAGTAGTACGGGTAAAGCATTTGAGGATTGGGAGAAGGAAGGGAAAACACTGAGAATGAAGGTGGATGTGAGAGGTCCAATGAAGATGAAGTTGTACTGGATGGAGATGAGATGGAGGATTAGGGCAAAAGTAACATGCAAGATATTGGTGAAGAAGGAAATGGGGAAGACGAAAGTGATGGAGGAGAAGTGTGATCATAGCATGAAGCTGTGGTAG

Protein sequence

MADRVHPTLNPDSASNNPLKSPPSATYVIQIPKDQVYRIPPPENAARFNLYTRHHHRPSPCRRFLCFILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPNTTSFSPQFNVTIRAENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVKVKGSGIRLSSSTGKAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKILVKKEMGKTKVMEEKCDHSMKLW*
BLAST of Csa4G269730 vs. Swiss-Prot
Match: NHL12_ARATH (NDR1/HIN1-like protein 12 OS=Arabidopsis thaliana GN=NHL12 PE=2 SV=1)

HSP 1 Score: 56.2 bits (134), Expect = 5.8e-07
Identity = 47/169 (27.81%), Postives = 77/169 (45.56%), Query Frame = 1

Query: 61  CRRFLCFILLLLLLSAITSALVFLILQPDLPRF-----SILAVSISRIKPNTTSFSPQFN 120
           C   + FI+++L    IT  LV++ILQP  PRF     ++ A ++S+  PN    +  F 
Sbjct: 22  CGVIIGFIIIVL----ITIFLVWIILQPTKPRFILQDATVYAFNLSQ--PNL--LTSNFQ 81

Query: 121 VTIRAENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVKVKGSGIRL 180
           +TI + N N  IGIYY++    +   +  +    A+P  YQ  +   V    V G+ + +
Sbjct: 82  ITIASRNRNSRIGIYYDRLHVYATYRNQQITLRTAIPPTYQGHKEDNVWSPFVYGNSVPI 141

Query: 181 SSSTGKAFEDWEKEG-KTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCK 224
           +     A  D +  G  TL ++ D R           +RW++   +T K
Sbjct: 142 APFNAVALGDEQNRGFVTLIIRADGR-----------VRWKVGTLITGK 171

BLAST of Csa4G269730 vs. TrEMBL
Match: A0A0A0L0D0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G269730 PE=4 SV=1)

HSP 1 Score: 504.2 bits (1297), Expect = 9.0e-140
Identity = 247/247 (100.00%), Postives = 247/247 (100.00%), Query Frame = 1

Query: 1   MADRVHPTLNPDSASNNPLKSPPSATYVIQIPKDQVYRIPPPENAARFNLYTRHHHRPSP 60
           MADRVHPTLNPDSASNNPLKSPPSATYVIQIPKDQVYRIPPPENAARFNLYTRHHHRPSP
Sbjct: 1   MADRVHPTLNPDSASNNPLKSPPSATYVIQIPKDQVYRIPPPENAARFNLYTRHHHRPSP 60

Query: 61  CRRFLCFILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPNTTSFSPQFNVTIRA 120
           CRRFLCFILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPNTTSFSPQFNVTIRA
Sbjct: 61  CRRFLCFILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPNTTSFSPQFNVTIRA 120

Query: 121 ENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVKVKGSGIRLSSSTG 180
           ENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVKVKGSGIRLSSSTG
Sbjct: 121 ENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVKVKGSGIRLSSSTG 180

Query: 181 KAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKILVKKEMGKTKVMEEKC 240
           KAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKILVKKEMGKTKVMEEKC
Sbjct: 181 KAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKILVKKEMGKTKVMEEKC 240

Query: 241 DHSMKLW 248
           DHSMKLW
Sbjct: 241 DHSMKLW 247

BLAST of Csa4G269730 vs. TrEMBL
Match: A0A0A0LX01_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G601000 PE=4 SV=1)

HSP 1 Score: 203.4 bits (516), Expect = 3.3e-49
Identity = 109/260 (41.92%), Postives = 168/260 (64.62%), Query Frame = 1

Query: 1   MADRVHPTLN-PDSASNNPLK------SPPSATYVIQIPKDQVYRIPPPENAARFNLYTR 60
           MADRVHPT + P  ++++ L       SPP  TYVIQ+PKDQ+YR+PPPENA RF LYTR
Sbjct: 1   MADRVHPTADSPRPSTSSTLSDTTKPSSPPPGTYVIQLPKDQIYRLPPPENAHRFKLYTR 60

Query: 61  H-HHRPSPCRR----FLCFILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPNTT 120
             H R + CR      L  + +L++L  IT A+ + +++P  P +SI A+SIS +   T+
Sbjct: 61  QSHRRRNRCRSCLFCLLAILAILIILLGITLAVFYFVVRPKSPNYSIDAISISGLNNLTS 120

Query: 121 S-FSPQFNVTIRAENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVK 180
           S  SP FN+++RA+N NK IGIYY   S+V +  S+  L EG LP  +QP +NV+V+R  
Sbjct: 121 SAISPVFNLSVRADNPNKKIGIYYLTGSSVRIYSSNEKLSEGVLPDFFQPSKNVSVLRAV 180

Query: 181 VKGSGIRLSSSTGKAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKILVK 240
           V+G+G+ LSS       +W K+ + + +KV++  P+K+K+  ++  W+I+ KV C + V 
Sbjct: 181 VRGAGVNLSSGAKNEIIEWVKQ-RAVLLKVEIGVPIKVKIGSVK-SWKIKVKVNCDVTVD 240

Query: 241 KEMGKTKVMEEKCDHSMKLW 248
           +     K++++ CD+S+K+W
Sbjct: 241 ELTAAAKIVKKNCDYSVKIW 258

BLAST of Csa4G269730 vs. TrEMBL
Match: M5W0N4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010043mg PE=4 SV=1)

HSP 1 Score: 202.6 bits (514), Expect = 5.6e-49
Identity = 108/268 (40.30%), Postives = 156/268 (58.21%), Query Frame = 1

Query: 1   MADRVHPTLNPDSASNNPLKS------------PPSATYVIQIPKDQVYRIPPPENAARF 60
           MADRVHP  +P      PL              PP  TYVIQIPKDQVYR+PPPENA+R+
Sbjct: 1   MADRVHPRDSPFHTETTPLSLSRPSPPDSEKPVPPPGTYVIQIPKDQVYRVPPPENASRY 60

Query: 61  NLYTRHHHRPSPCRRFLCFILLLL----LLSAITSALVFLILQPDLPRFSILAVSISRIK 120
             YT    R S C    C+ L LL     LSA  + + +L+++P+ P +S+ +++     
Sbjct: 61  QSYTHRKTRRSSCHCCCCWFLGLLAAIVFLSAAAAGIFYLVVRPEAPNYSVESIAFKGFN 120

Query: 121 PNTTS-----FSPQFNVTIRAENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPR 180
             TTS      SP+ +VT+RA+N NK IGIYYE+ S+V +  SD+ LC+G LP  YQP +
Sbjct: 121 LTTTSSPPSAISPEIHVTVRAQNPNKKIGIYYERESSVKLFYSDIKLCDGVLPAFYQPSK 180

Query: 181 NVTVMRVKVKGSGIRLSSSTGKAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAK 240
           NVT  R  + GSGI L+S+  K   D +K+GK + +++D+R P+++K+  ++  W I  K
Sbjct: 181 NVTEFRTALTGSGIELTSAVQKGLVDAQKQGK-VPLELDLRAPVRIKVGPIK-TWTITVK 240

Query: 241 VTCKILVKKEMGKTKVMEEKCDHSMKLW 248
           V C + V K      ++   CD+S+  W
Sbjct: 241 VACHLTVNKLTADANIVSRDCDYSVDPW 266

BLAST of Csa4G269730 vs. TrEMBL
Match: A0A087GRC4_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA6G240400 PE=4 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 2.1e-48
Identity = 109/262 (41.60%), Postives = 162/262 (61.83%), Query Frame = 1

Query: 1   MADRVHPTLNPD---------SASNNPLK-SPPSATYVIQIPKDQVYRIPPPENAARFNL 60
           MA+RV+P  +P          S+   P K +PP +TYVIQ+PKDQ+YRIPPPENA R   
Sbjct: 1   MAERVYPADSPPESGQFSGNFSSGEFPRKPAPPPSTYVIQVPKDQIYRIPPPENAHRLQY 60

Query: 61  YTRHHHRPSPCRRFLCFIL----LLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPN 120
            +R     S CR  +C +L    ++L+L+ I+ A+++L+  P+ PR+S+   S+S I   
Sbjct: 61  LSRKKVNRSRCRCCICSVLATLFIVLVLAGISLAVLYLVFHPEAPRYSVEGFSVSGINLT 120

Query: 121 TTS-FSPQFNVTIRAENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMR 180
           ++S  SP+FNVT+R+ N N  IGIYYEK S+V +  +DV LC GALP  YQPP NVTV++
Sbjct: 121 SSSPISPKFNVTVRSRNGNGKIGIYYEKGSSVDVFYNDVDLCNGALPAFYQPPNNVTVVK 180

Query: 181 VKVKGSGIRLSSSTGKAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKIL 240
             + GS I+L+++T K      +  KTL  K+ +R P+K+KL  ++  W +  KV C + 
Sbjct: 181 TALTGSTIQLTTATRKEM----RNEKTLPFKLKIRAPVKIKLGSVK-TWTLTVKVNCDVT 240

Query: 241 VKKEMGKTKVMEEKCDHSMKLW 248
           V K    ++++  KC H + LW
Sbjct: 241 VDKLTAPSRIVSRKCSHDVDLW 257

BLAST of Csa4G269730 vs. TrEMBL
Match: D7LFJ6_ARALL (Putative uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_481588 PE=4 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 2.1e-48
Identity = 109/262 (41.60%), Postives = 163/262 (62.21%), Query Frame = 1

Query: 1   MADRVHPTLNPD---------SASNNPLK-SPPSATYVIQIPKDQVYRIPPPENAARFNL 60
           MA+RV+PT +P          S+   P K +PP ATYVIQ+PKDQ+YRIPPPENA R   
Sbjct: 1   MAERVYPTDSPPQSGQFSGNFSSGEFPRKPTPPPATYVIQVPKDQIYRIPPPENAHRLQQ 60

Query: 61  YTRHHHRPSPCR----RFLCFILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPN 120
            +R  +  S CR     FL  I +L++L+ I+ A+++LI +P+ P++SI   ++S I  N
Sbjct: 61  LSRKKNNRSTCRCCFCSFLAAIFILIVLAGISLAILYLIYRPEAPKYSIEGFTVSGINLN 120

Query: 121 TTS-FSPQFNVTIRAENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMR 180
           +TS  SP FNVT+R+ N N  IG+YYEK S+V +  +DV LC G +P+ YQP +NVTV+R
Sbjct: 121 STSPISPNFNVTVRSRNGNGKIGVYYEKESSVDVYYNDVDLCNGVMPVFYQPAKNVTVVR 180

Query: 181 VKVKGSGIRLSSSTGKAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKIL 240
           + + GS I+L+S   K   + E   KTL  K+ ++ P+K+K+  ++  W +   V C + 
Sbjct: 181 LALSGSKIQLTSGMRKEMRN-EVSKKTLPFKLKIKAPVKIKVGSVK-TWSMIVNVECDVT 240

Query: 241 VKKEMGKTKVMEEKCDHSMKLW 248
           V K    ++++  KC H + LW
Sbjct: 241 VDKLTAPSRIVSRKCSHDVDLW 260

BLAST of Csa4G269730 vs. TAIR10
Match: AT2G27080.1 (AT2G27080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 191.4 bits (485), Expect = 6.5e-49
Identity = 104/262 (39.69%), Postives = 158/262 (60.31%), Query Frame = 1

Query: 1   MADRVHPTLNPD---------SASNNPLK-SPPSATYVIQIPKDQVYRIPPPENAARFNL 60
           MA+RV+P  +P          S+   P K +PP +TYVIQ+PKDQ+YRIPPPENA RF  
Sbjct: 1   MAERVYPADSPPQSGQFSGNFSSGEFPKKPAPPPSTYVIQVPKDQIYRIPPPENAHRFEQ 60

Query: 61  YTRHHHRPSPCR----RFLCFILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPN 120
            +R     S CR     FL  + +L++L+ I+ A+++LI +P+ P++SI   S+S I  N
Sbjct: 61  LSRKKTNRSNCRCCFCSFLAAVFILIVLAGISFAVLYLIYRPEAPKYSIEGFSVSGINLN 120

Query: 121 TTS-FSPQFNVTIRAENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMR 180
           +TS  SP FNVT+R+ N N  IG+YYEK S+V +  +DV +  G +P+ YQP +NVTV++
Sbjct: 121 STSPISPSFNVTVRSRNGNGKIGVYYEKESSVDVYYNDVDISNGVMPVFYQPAKNVTVVK 180

Query: 181 VKVKGSGIRLSSSTGKAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKIL 240
           + + GS I+L+S   K   + E   KT+  K+ ++ P+K+K +     W +   V C + 
Sbjct: 181 LVLSGSKIQLTSGMRKEMRN-EVSKKTVPFKLKIKAPVKIK-FGSVKTWTMIVNVDCDVT 240

Query: 241 VKKEMGKTKVMEEKCDHSMKLW 248
           V K    ++++  KC H + LW
Sbjct: 241 VDKLTAPSRIVSRKCSHDVDLW 260

BLAST of Csa4G269730 vs. TAIR10
Match: AT5G21130.1 (AT5G21130.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 151.4 bits (381), Expect = 7.5e-37
Identity = 85/254 (33.46%), Postives = 136/254 (53.54%), Query Frame = 1

Query: 6   HPTLNPDSASNNPLK--------SPPSATYVIQIPKDQVYRIPPPENAARFNLYTRHHHR 65
           HP+L+ + +S++            PP  TYVI++PKDQ+YR+PPPENA R+   +R    
Sbjct: 29  HPSLDTNDSSSSRYSVDSQKSRIGPPPGTYVIKLPKDQIYRVPPPENAHRYEYLSRRKTN 88

Query: 66  PSPCRRFLCF----ILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPNTTS-FSP 125
            S CRR LC+    +L++++L+AI     +L+ QP  P+FS+  VS++ I   ++S FSP
Sbjct: 89  KSCCRRCLCYSLSALLIIIVLAAIAFGFFYLVYQPHKPQFSVSGVSVTGINLTSSSPFSP 148

Query: 126 QFNVTIRAENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVKVKGSG 185
              + +R++N    +G+ YEK +   +  +   L  G      QP  NVTV+   +KGS 
Sbjct: 149 VIRIKLRSQNVKGKLGLIYEKGNEADVFFNGTKLGNGEFTAFKQPAGNVTVIVTVLKGSS 208

Query: 186 IRLSSSTGKAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKILVKKEMGK 245
           ++L SS+ K   + +K+GK +   + ++ P+K K+      W +   V CKI V K    
Sbjct: 209 VKLKSSSRKELTESQKKGK-VPFGLRIKAPVKFKV-GSVTTWTMTITVDCKITVDKLTAS 268

Query: 246 TKVMEEKCDHSMKL 247
             V  E C+  + L
Sbjct: 269 ATVKTENCETGLSL 280

BLAST of Csa4G269730 vs. TAIR10
Match: AT1G54540.1 (AT1G54540.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 94.7 bits (234), Expect = 8.3e-20
Identity = 65/247 (26.32%), Postives = 118/247 (47.77%), Query Frame = 1

Query: 4   RVHPTLNPDSASNNPLKSPPSATYVIQIPKDQVYRIPPPENAARFNLYTRHHHRPSPCRR 63
           ++HP L  ++         P  T ++ + +     IPPP   ++           + C +
Sbjct: 6   KIHPVLQMEANKTKTTTPAPGKTVLLPVQRP----IPPPVIPSK---------NRNMCCK 65

Query: 64  FLCFILLLLLLS----AITSALVFLILQPDLPRFSILAVSISRIKPNTT-SFSPQFNVTI 123
             C++L LL+++    AI  A+V+ +  P LP + + ++ ++ +  N   S S +F V I
Sbjct: 66  IFCWVLSLLVIALIALAIAVAVVYFVFHPKLPSYEVNSLRVTNLGINLDLSLSAEFKVEI 125

Query: 124 RAENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVKVKGSGIRLSSS 183
            A N N+ IGIYYEK   + +      LCEG +P  YQ  RNVT + V + G   +  ++
Sbjct: 126 TARNPNEKIGIYYEKGGHIGVWYDKTKLCEGPIPRFYQGHRNVTKLNVALTGRA-QYGNT 185

Query: 184 TGKAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKILVKKEMGKTKVMEE 243
              A +  ++ G+ + + + V  P+ +KL  ++M+ +IR   +CK++V        +  +
Sbjct: 186 VLAALQQQQQTGR-VPLDLKVNAPVAIKLGNLKMK-KIRILGSCKLVVDSLSTNNNINIK 236

Query: 244 KCDHSMK 246
             D S K
Sbjct: 246 ASDCSFK 236

BLAST of Csa4G269730 vs. TAIR10
Match: AT1G65690.1 (AT1G65690.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 90.5 bits (223), Expect = 1.6e-18
Identity = 67/251 (26.69%), Postives = 121/251 (48.21%), Query Frame = 1

Query: 4   RVHPTLNPDSASNNPLKSPPSATYVIQIPKDQVYRIPPPENAARF-NLYTRHHHRPSPCR 63
           +++P  +P++A+  P  +P       +       ++P  +   RF  L      R   CR
Sbjct: 6   KIYPVQDPEAATARPT-APLVPRGSSRSEHGDPSKVPLNQRPQRFVPLAPPKKRRSCCCR 65

Query: 64  RF---LCFILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPNT-TSFSPQFNVTI 123
            F    CF+LLL++    +  +++L+ +P LP +SI  + ++R   N  +S +  FNVTI
Sbjct: 66  CFCYTFCFLLLLVVAVGASIGILYLVFKPKLPDYSIDRLQLTRFALNQDSSLTTAFNVTI 125

Query: 124 RAENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVKVKGSGIRLSSS 183
            A+N N+ IGIYYE  S +++   +  L  G+LP  YQ   N TV+ V++ G   + +S 
Sbjct: 126 TAKNPNEKIGIYYEDGSKITVWYMEHQLSNGSLPKFYQGHENTTVIYVEMTGQ-TQNASG 185

Query: 184 TGKAFEDWEKEGKTLRMKVDVRGPMKM---KLYWMEMRWRIRAKVTCKILVKKEMGKTKV 243
                E+ ++    + +++ V  P+++   KL   E+R+ +R  V    L    +   K+
Sbjct: 186 LRTTLEEQQQRTGNIPLRIRVNQPVRVKFGKLKLFEVRFLVRCGVFVDSLATNNV--IKI 245

Query: 244 MEEKCDHSMKL 247
               C   ++L
Sbjct: 246 QSSSCKFRLRL 252

BLAST of Csa4G269730 vs. TAIR10
Match: AT2G22180.1 (AT2G22180.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 84.0 bits (206), Expect = 1.5e-16
Identity = 66/254 (25.98%), Postives = 112/254 (44.09%), Query Frame = 1

Query: 9   LNPDSASNNPLKSPPSA-----TYVIQIPKDQVYRIPPPENAARFNLYTRH---HHRPSP 68
           L+P   +++P   PP +     TYV+Q+P+DQVY  PPPE+A      +++   + +   
Sbjct: 57  LSPLPTTSSPPLPPPDSIPELETYVVQVPRDQVYWTPPPEHAKYVEKRSKNPEKNKKKGC 116

Query: 69  CRRFLCFILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPNTTSFSPQFNVTIRA 128
            +R L F ++L++   +  A++ ++     P   + AV    + P+       F VT+RA
Sbjct: 117 SKRLLWFFIILVIFGFLLGAIILILHFAFNPTLPVFAVERLTVNPS------NFEVTLRA 176

Query: 129 ENHNKNIGIYY--EKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVKVKGSGIRLSSS 188
           EN   N+G+ Y  EKN  VS+   +  L  G  P L Q       + VK+ G     S+ 
Sbjct: 177 ENPTSNMGVRYMMEKNGVVSLTYKNKSLGSGKFPGLSQAASGSDKVNVKLNG-----STK 236

Query: 189 TGKAFEDWEKEGKTLRMKVDVR-----GPMKMKLYWMEMRWRIRAKVTCKILVK--KEMG 246
                    K+   L + ++++     GP+K               VTC + VK   +  
Sbjct: 237 NAVVQPRGSKQPVVLMLNMELKAEYEAGPVKRNK---------EVVVTCDVKVKGLLDAK 290

BLAST of Csa4G269730 vs. NCBI nr
Match: gi|449463891|ref|XP_004149664.1| (PREDICTED: uncharacterized protein LOC101202799 [Cucumis sativus])

HSP 1 Score: 504.2 bits (1297), Expect = 1.3e-139
Identity = 247/247 (100.00%), Postives = 247/247 (100.00%), Query Frame = 1

Query: 1   MADRVHPTLNPDSASNNPLKSPPSATYVIQIPKDQVYRIPPPENAARFNLYTRHHHRPSP 60
           MADRVHPTLNPDSASNNPLKSPPSATYVIQIPKDQVYRIPPPENAARFNLYTRHHHRPSP
Sbjct: 1   MADRVHPTLNPDSASNNPLKSPPSATYVIQIPKDQVYRIPPPENAARFNLYTRHHHRPSP 60

Query: 61  CRRFLCFILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPNTTSFSPQFNVTIRA 120
           CRRFLCFILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPNTTSFSPQFNVTIRA
Sbjct: 61  CRRFLCFILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPNTTSFSPQFNVTIRA 120

Query: 121 ENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVKVKGSGIRLSSSTG 180
           ENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVKVKGSGIRLSSSTG
Sbjct: 121 ENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVKVKGSGIRLSSSTG 180

Query: 181 KAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKILVKKEMGKTKVMEEKC 240
           KAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKILVKKEMGKTKVMEEKC
Sbjct: 181 KAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKILVKKEMGKTKVMEEKC 240

Query: 241 DHSMKLW 248
           DHSMKLW
Sbjct: 241 DHSMKLW 247

BLAST of Csa4G269730 vs. NCBI nr
Match: gi|659097879|ref|XP_008449864.1| (PREDICTED: uncharacterized protein LOC103491613 [Cucumis melo])

HSP 1 Score: 432.2 bits (1110), Expect = 6.2e-118
Identity = 214/247 (86.64%), Postives = 229/247 (92.71%), Query Frame = 1

Query: 1   MADRVHPTLNPDSASNNPLKSPPSATYVIQIPKDQVYRIPPPENAARFNLYTRHHHRPSP 60
           MADRVHPT+NPDSASNNP K PPSATYVIQIPKDQVYRIPPPENAARFNLYTRH+ RPS 
Sbjct: 1   MADRVHPTVNPDSASNNP-KGPPSATYVIQIPKDQVYRIPPPENAARFNLYTRHNQRPSS 60

Query: 61  CRRFLCFILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPNTTSFSPQFNVTIRA 120
           CRRFLCFILLLL LSAITSAL FLILQPDLPR+SILAVSISRIK NTTS SPQ NVTIRA
Sbjct: 61  CRRFLCFILLLLFLSAITSALFFLILQPDLPRYSILAVSISRIKSNTTSISPQLNVTIRA 120

Query: 121 ENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVKVKGSGIRLSSSTG 180
           ENHNK IGIYYEKNS VS++LSDVMLCEGALPLLYQPP NVTV+ VK+KGSGIRLSSSTG
Sbjct: 121 ENHNKKIGIYYEKNSIVSVSLSDVMLCEGALPLLYQPPSNVTVIAVKMKGSGIRLSSSTG 180

Query: 181 KAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKILVKKEMGKTKVMEEKC 240
           KA +DWEKEG+ +R+KVDVR P++MKLYWM++RWRIR KVTCKILVKK MGKTKVMEEKC
Sbjct: 181 KALKDWEKEGR-VRLKVDVRAPIEMKLYWMKVRWRIRGKVTCKILVKKVMGKTKVMEEKC 240

Query: 241 DHSMKLW 248
           DHSMKLW
Sbjct: 241 DHSMKLW 245

BLAST of Csa4G269730 vs. NCBI nr
Match: gi|449452811|ref|XP_004144152.1| (PREDICTED: protein YLS9-like [Cucumis sativus])

HSP 1 Score: 203.4 bits (516), Expect = 4.7e-49
Identity = 109/260 (41.92%), Postives = 168/260 (64.62%), Query Frame = 1

Query: 1   MADRVHPTLN-PDSASNNPLK------SPPSATYVIQIPKDQVYRIPPPENAARFNLYTR 60
           MADRVHPT + P  ++++ L       SPP  TYVIQ+PKDQ+YR+PPPENA RF LYTR
Sbjct: 1   MADRVHPTADSPRPSTSSTLSDTTKPSSPPPGTYVIQLPKDQIYRLPPPENAHRFKLYTR 60

Query: 61  H-HHRPSPCRR----FLCFILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPNTT 120
             H R + CR      L  + +L++L  IT A+ + +++P  P +SI A+SIS +   T+
Sbjct: 61  QSHRRRNRCRSCLFCLLAILAILIILLGITLAVFYFVVRPKSPNYSIDAISISGLNNLTS 120

Query: 121 S-FSPQFNVTIRAENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVMRVK 180
           S  SP FN+++RA+N NK IGIYY   S+V +  S+  L EG LP  +QP +NV+V+R  
Sbjct: 121 SAISPVFNLSVRADNPNKKIGIYYLTGSSVRIYSSNEKLSEGVLPDFFQPSKNVSVLRAV 180

Query: 181 VKGSGIRLSSSTGKAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKILVK 240
           V+G+G+ LSS       +W K+ + + +KV++  P+K+K+  ++  W+I+ KV C + V 
Sbjct: 181 VRGAGVNLSSGAKNEIIEWVKQ-RAVLLKVEIGVPIKVKIGSVK-SWKIKVKVNCDVTVD 240

Query: 241 KEMGKTKVMEEKCDHSMKLW 248
           +     K++++ CD+S+K+W
Sbjct: 241 ELTAAAKIVKKNCDYSVKIW 258

BLAST of Csa4G269730 vs. NCBI nr
Match: gi|595827610|ref|XP_007205722.1| (hypothetical protein PRUPE_ppa010043mg [Prunus persica])

HSP 1 Score: 202.6 bits (514), Expect = 8.0e-49
Identity = 108/268 (40.30%), Postives = 156/268 (58.21%), Query Frame = 1

Query: 1   MADRVHPTLNPDSASNNPLKS------------PPSATYVIQIPKDQVYRIPPPENAARF 60
           MADRVHP  +P      PL              PP  TYVIQIPKDQVYR+PPPENA+R+
Sbjct: 1   MADRVHPRDSPFHTETTPLSLSRPSPPDSEKPVPPPGTYVIQIPKDQVYRVPPPENASRY 60

Query: 61  NLYTRHHHRPSPCRRFLCFILLLL----LLSAITSALVFLILQPDLPRFSILAVSISRIK 120
             YT    R S C    C+ L LL     LSA  + + +L+++P+ P +S+ +++     
Sbjct: 61  QSYTHRKTRRSSCHCCCCWFLGLLAAIVFLSAAAAGIFYLVVRPEAPNYSVESIAFKGFN 120

Query: 121 PNTTS-----FSPQFNVTIRAENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPR 180
             TTS      SP+ +VT+RA+N NK IGIYYE+ S+V +  SD+ LC+G LP  YQP +
Sbjct: 121 LTTTSSPPSAISPEIHVTVRAQNPNKKIGIYYERESSVKLFYSDIKLCDGVLPAFYQPSK 180

Query: 181 NVTVMRVKVKGSGIRLSSSTGKAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAK 240
           NVT  R  + GSGI L+S+  K   D +K+GK + +++D+R P+++K+  ++  W I  K
Sbjct: 181 NVTEFRTALTGSGIELTSAVQKGLVDAQKQGK-VPLELDLRAPVRIKVGPIK-TWTITVK 240

Query: 241 VTCKILVKKEMGKTKVMEEKCDHSMKLW 248
           V C + V K      ++   CD+S+  W
Sbjct: 241 VACHLTVNKLTADANIVSRDCDYSVDPW 266

BLAST of Csa4G269730 vs. NCBI nr
Match: gi|659100419|ref|XP_008451090.1| (PREDICTED: protein YLS9-like [Cucumis melo])

HSP 1 Score: 201.8 bits (512), Expect = 1.4e-48
Identity = 110/263 (41.83%), Postives = 171/263 (65.02%), Query Frame = 1

Query: 1   MADRVHPTLN-PDSASNNPLK------SPPSATYVIQIPKDQVYRIPPPENAARFNLYTR 60
           MADRVHPT++ P  ++++ L       SPP  TYVIQ+PKDQ+YR+PPPENA RF LYTR
Sbjct: 1   MADRVHPTVDSPRPSTSSTLSDTTKPPSPPPGTYVIQLPKDQIYRVPPPENAHRFQLYTR 60

Query: 61  HHHRP-SPCRR----FLCFILLLLLLSAITSALVFLILQPDLPRFSILAVSISRIKPNTT 120
            + R  +PCR      L  ++LL++L  IT A+ +L+++P  P +SI A+S+S +   T+
Sbjct: 61  QNRRRRNPCRSCLFCLLAILILLIILLGITVAVFYLVVRPKSPNYSIDAISVSGLNLLTS 120

Query: 121 S----FSPQFNVTIRAENHNKNIGIYYEKNSTVSMNLSDVMLCEGALPLLYQPPRNVTVM 180
           S     SP FN+T+RA+N NK IGIYY   S+V +  S+  L EG LP  +QP +NV+V+
Sbjct: 121 SSSSAISPLFNLTVRADNPNKKIGIYYLTGSSVRIYFSNEKLSEGVLPDFFQPAKNVSVL 180

Query: 181 RVKVKGSGIRLSSSTGKAFEDWEKEGKTLRMKVDVRGPMKMKLYWMEMRWRIRAKVTCKI 240
           R  V+G+G+ LSS       +  K+ + + +KV++  P+K+K+  ++  W++R KV C +
Sbjct: 181 RSVVRGTGVNLSSGAKNGLIESVKQ-RVVVLKVEIGVPIKVKVGAVK-SWKMRVKVNCDV 240

Query: 241 LVKKEMGKTKVMEEKCDHSMKLW 248
            V +     K++++ CD+S+K+W
Sbjct: 241 TVDELTTAAKIVKKNCDYSVKIW 261

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NHL12_ARATH5.8e-0727.81NDR1/HIN1-like protein 12 OS=Arabidopsis thaliana GN=NHL12 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L0D0_CUCSA9.0e-140100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_4G269730 PE=4 SV=1[more]
A0A0A0LX01_CUCSA3.3e-4941.92Uncharacterized protein OS=Cucumis sativus GN=Csa_1G601000 PE=4 SV=1[more]
M5W0N4_PRUPE5.6e-4940.30Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010043mg PE=4 SV=1[more]
A0A087GRC4_ARAAL2.1e-4841.60Uncharacterized protein OS=Arabis alpina GN=AALP_AA6G240400 PE=4 SV=1[more]
D7LFJ6_ARALL2.1e-4841.60Putative uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRA... [more]
Match NameE-valueIdentityDescription
AT2G27080.16.5e-4939.69 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT5G21130.17.5e-3733.46 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G54540.18.3e-2026.32 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G65690.11.6e-1826.69 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT2G22180.11.5e-1625.98 hydroxyproline-rich glycoprotein family protein[more]
Match NameE-valueIdentityDescription
gi|449463891|ref|XP_004149664.1|1.3e-139100.00PREDICTED: uncharacterized protein LOC101202799 [Cucumis sativus][more]
gi|659097879|ref|XP_008449864.1|6.2e-11886.64PREDICTED: uncharacterized protein LOC103491613 [Cucumis melo][more]
gi|449452811|ref|XP_004144152.1|4.7e-4941.92PREDICTED: protein YLS9-like [Cucumis sativus][more]
gi|595827610|ref|XP_007205722.1|8.0e-4940.30hypothetical protein PRUPE_ppa010043mg [Prunus persica][more]
gi|659100419|ref|XP_008451090.1|1.4e-4841.83PREDICTED: protein YLS9-like [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa4G269730.1Csa4G269730.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 117..222
score: 1.
NoneNo IPR availablePANTHERPTHR31852FAMILY NOT NAMEDcoord: 22..247
score: 2.1
NoneNo IPR availablePANTHERPTHR31852:SF0LATE EMBRYOGENESIS ABUNDANT HYDROXYPROLINE-RICH GLYCOPROTEINcoord: 22..247
score: 2.1

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Csa4G269730Carg02772Silver-seed gourdcarcuB0935
The following gene(s) are paralogous to this gene:

None