Csa7G047380 (gene) Cucumber (Chinese Long) v2

NameCsa7G047380
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionLate embryogenesis abundant hydroxyproline-rich glycoprotein; contains IPR004864 (Late embryogenesis abundant protein, LEA-14)
LocationChr7 : 2895166 .. 2896098 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAAAAAAAATACAACGAACAAACAGACAACAGCCTCTGTTTTTCCCCTTTCCCCTCATTTTGTTTAAACAAAGCTATGCCCATCAAACGGTCAACGCCCTTCAACTTCATTTTCAAATATATCTCTTCTCCTTTTTCCCCATTCTCATGGAAGCTCCACCTAAACCAAAATATGTTATGCTTTCCGATGACCACCACCAAGCCAGCCTCCGCCCTCCCCCCTATCGCCGCAACGTCCCTCGTTACCATTCCAAAGCCAACGGCGGAGGTGGTGGCGGTGTTGGTTGTTGTCTAAAATGCATCTGTTGTTGTTATTGTTTTATCTTCTTCCTTATCTTTGCTCTATTCGGCCTCGGCTATTTCCTTTTCTATTATTACAACCCCCAAGTCCCTTCTTACAAAGTATCTGATTTCAGTGTCCATGCCTTCAATGTCAAATCCGACTTCAGTTTATACACTGAATTCATCGTTATCGTTAAAGCCGATAACCCAAATGCAAACATTGGTTTCGTTTATGGGAAAGATAGCTCTGTTTCTGTTATGTACTCTAAATCGGAGCTCTGTTCCGGGCAGATTCCAAATTTCCGGCAGCCGTCGAAGAATGTAACGGATATAAGTATTTTGTTGAGTGGGAATAGTGAATTTGGAAGTGGGTTACAAGAAGCATTGATGCAGAACAGACATAGTGGGAAAATTCCGTTGCTTGTTAAGGTGAAAGTGCCGGTGACGGTGGTGATTGGGAGCTTGTCGTTGAAGAAGGTCAATGTGTTTGTTAATTGTTCATTGGTGGTTGATAAGTTGTCGCCGAATAAGAAAGTTGAGATTTTGTCGAGTAATTATACTTATGGTGCTTCTTTGTGAATCTAAAGCTTTACCCATAGTTTTGTTGCTCTTTCGTTTTTGTGCTATTTAATGGGAATTCCTTTTTTTTT

mRNA sequence

ATGGAAGCTCCACCTAAACCAAAATATGTTATGCTTTCCGATGACCACCACCAAGCCAGCCTCCGCCCTCCCCCCTATCGCCGCAACGTCCCTCGTTACCATTCCAAAGCCAACGGCGGAGGTGGTGGCGGTGTTGGTTGTTGTCTAAAATGCATCTGTTGTTGTTATTGTTTTATCTTCTTCCTTATCTTTGCTCTATTCGGCCTCGGCTATTTCCTTTTCTATTATTACAACCCCCAAGTCCCTTCTTACAAAGTATCTGATTTCAGTGTCCATGCCTTCAATGTCAAATCCGACTTCAGTTTATACACTGAATTCATCGTTATCGTTAAAGCCGATAACCCAAATGCAAACATTGGTTTCGTTTATGGGAAAGATAGCTCTGTTTCTGTTATGTACTCTAAATCGGAGCTCTGTTCCGGGCAGATTCCAAATTTCCGGCAGCCGTCGAAGAATGTAACGGATATAAGTATTTTGTTGAGTGGGAATAGTGAATTTGGAAGTGGGTTACAAGAAGCATTGATGCAGAACAGACATAGTGGGAAAATTCCGTTGCTTGTTAAGGTGAAAGTGCCGGTGACGGTGGTGATTGGGAGCTTGTCGTTGAAGAAGGTCAATGTGTTTGTTAATTGTTCATTGGTGGTTGATAAGTTGTCGCCGAATAAGAAAGTTGAGATTTTGTCGAGTAATTATACTTATGGTGCTTCTTTGTGA

Coding sequence (CDS)

ATGGAAGCTCCACCTAAACCAAAATATGTTATGCTTTCCGATGACCACCACCAAGCCAGCCTCCGCCCTCCCCCCTATCGCCGCAACGTCCCTCGTTACCATTCCAAAGCCAACGGCGGAGGTGGTGGCGGTGTTGGTTGTTGTCTAAAATGCATCTGTTGTTGTTATTGTTTTATCTTCTTCCTTATCTTTGCTCTATTCGGCCTCGGCTATTTCCTTTTCTATTATTACAACCCCCAAGTCCCTTCTTACAAAGTATCTGATTTCAGTGTCCATGCCTTCAATGTCAAATCCGACTTCAGTTTATACACTGAATTCATCGTTATCGTTAAAGCCGATAACCCAAATGCAAACATTGGTTTCGTTTATGGGAAAGATAGCTCTGTTTCTGTTATGTACTCTAAATCGGAGCTCTGTTCCGGGCAGATTCCAAATTTCCGGCAGCCGTCGAAGAATGTAACGGATATAAGTATTTTGTTGAGTGGGAATAGTGAATTTGGAAGTGGGTTACAAGAAGCATTGATGCAGAACAGACATAGTGGGAAAATTCCGTTGCTTGTTAAGGTGAAAGTGCCGGTGACGGTGGTGATTGGGAGCTTGTCGTTGAAGAAGGTCAATGTGTTTGTTAATTGTTCATTGGTGGTTGATAAGTTGTCGCCGAATAAGAAAGTTGAGATTTTGTCGAGTAATTATACTTATGGTGCTTCTTTGTGA

Protein sequence

MEAPPKPKYVMLSDDHHQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCFIFFLIFALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNANIGFVYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNRHSGKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEILSSNYTYGASL*
BLAST of Csa7G047380 vs. Swiss-Prot
Match: NHL3_ARATH (NDR1/HIN1-Like protein 3 OS=Arabidopsis thaliana GN=NHL3 PE=1 SV=1)

HSP 1 Score: 52.8 bits (125), Expect = 6.2e-06
Identity = 48/196 (24.49%), Postives = 76/196 (38.78%), Query Frame = 1

Query: 31  PRYHSKANGGGGGGVGCCLKCICCCYCFIFFLIF-------ALFGLGYFLFY-YYNPQVP 90
           P+  S ++G  GGG GC   C+ CC C I  +IF        L G+   + +  + P   
Sbjct: 16  PKKVSHSHGRRGGGCGCLGDCLGCCGCCILSVIFNILITIAVLLGIAALIIWLIFRPNAI 75

Query: 91  SYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNANIGFVYGKDSSVSVMYSKSEL-CSG 150
            + V+D  +  F +    +L     +     NPN  IG VY  +  V   Y       S 
Sbjct: 76  KFHVTDAKLTEFTLDPTNNLRYNLDLNFTIRNPNRRIG-VYYDEIEVRGYYGDQRFGMSN 135

Query: 151 QIPNFRQPSKNVTDISILLSGNS--EFGSGLQEALMQNRHSGKIPLLVKVKVPVTVVIGS 210
            I  F Q  KN T +   L G        G ++ L ++ +S    +  K+++ +    G 
Sbjct: 136 NISKFYQGHKNTTVVGTKLVGQQLVLLDGGERKDLNEDVNSQIYRIDAKLRLKIRFKFGL 195

Query: 211 LSLKKVNVFVNCSLVV 216
           +   +    + C L V
Sbjct: 196 IKSWRFKPKIKCDLKV 210

BLAST of Csa7G047380 vs. TrEMBL
Match: A0A0A0K2B4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G047380 PE=4 SV=1)

HSP 1 Score: 488.4 bits (1256), Expect = 4.9e-135
Identity = 237/237 (100.00%), Postives = 237/237 (100.00%), Query Frame = 1

Query: 1   MEAPPKPKYVMLSDDHHQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCFIF 60
           MEAPPKPKYVMLSDDHHQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCFIF
Sbjct: 1   MEAPPKPKYVMLSDDHHQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCFIF 60

Query: 61  FLIFALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNANIG 120
           FLIFALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNANIG
Sbjct: 61  FLIFALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNANIG 120

Query: 121 FVYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNRHS 180
           FVYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNRHS
Sbjct: 121 FVYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNRHS 180

Query: 181 GKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEILSSNYTYGASL 238
           GKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEILSSNYTYGASL
Sbjct: 181 GKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEILSSNYTYGASL 237

BLAST of Csa7G047380 vs. TrEMBL
Match: B9RW35_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1175710 PE=4 SV=1)

HSP 1 Score: 288.5 bits (737), Expect = 7.4e-75
Identity = 142/239 (59.41%), Postives = 174/239 (72.80%), Query Frame = 1

Query: 1   MEAPP--KPKYVMLSDDHHQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCF 60
           ME PP  +PKYVML+ +H   +LRPPP RRN+PRYHS  NG  GG    CL+C+CCC+CF
Sbjct: 1   MENPPPYQPKYVMLNSNH-ATNLRPPPQRRNIPRYHSNNNGKSGGNG--CLRCLCCCFCF 60

Query: 61  IFFLIFALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNAN 120
              L   L    + L+    P++P Y V  F VHAFNV+ DFSLYTEF+V VK+DNPN +
Sbjct: 61  WLLLFIFLAAALFALYSALQPEIPHYNVDRFDVHAFNVQPDFSLYTEFVVTVKSDNPNMH 120

Query: 121 IGFVYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNR 180
           IGF YGK+SSV V Y  S LCSG IP F QP  N++ I I+L G SEFGSGLQEALMQNR
Sbjct: 121 IGFDYGKESSVVVTYRDSPLCSGSIPTFHQPHHNISLIPIVLKGKSEFGSGLQEALMQNR 180

Query: 181 HSGKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEILSSNYTYGASL 238
           ++G+IPLLV+VK PV++V+  L L++V V +NCSLVVD LSPNKK +ILSS+Y YG  L
Sbjct: 181 NTGRIPLLVEVKAPVSIVVQELPLRQVTVLINCSLVVDNLSPNKKAKILSSSYQYGVEL 236

BLAST of Csa7G047380 vs. TrEMBL
Match: A0A059A096_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_K00959 PE=4 SV=1)

HSP 1 Score: 285.0 bits (728), Expect = 8.2e-74
Identity = 139/232 (59.91%), Postives = 173/232 (74.57%), Query Frame = 1

Query: 1   MEAPPK--PKYVMLSDDHHQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCF 60
           ME PP   PKYVML D   Q S+RPPPYRRN+PRYHS  +  GGG   CC++CICC  C 
Sbjct: 1   MEPPPPYPPKYVMLQDS--QGSIRPPPYRRNIPRYHSNHHKSGGGS--CCMRCICCFCCS 60

Query: 61  IFFLIFALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNAN 120
           +F LIF +  L ++ +  + P+VPSY V  F+ +AFN++ DFSLYTEF+V VKADNPN+N
Sbjct: 61  LFILIFVVATLAFYFYAVFQPRVPSYTVDSFATNAFNMQPDFSLYTEFVVTVKADNPNSN 120

Query: 121 IGFVYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNR 180
           IGF YGKDSSV V Y  S LC+GQ+P F Q  KN+T I +LL G SEFGSGLQEALM+NR
Sbjct: 121 IGFNYGKDSSVMVAYQDSTLCNGQLPAFHQGHKNITMIKVLLKGKSEFGSGLQEALMENR 180

Query: 181 HSGKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEILSSN 231
           H+G+IPL V VKVPV +VIG+  L++  V V C+LVVD LSPNKK +I+S++
Sbjct: 181 HTGRIPLNVIVKVPVGIVIGTFPLRRFTVRVICALVVDNLSPNKKTQIVSNS 228

BLAST of Csa7G047380 vs. TrEMBL
Match: U5FEV5_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0019s13360g PE=4 SV=1)

HSP 1 Score: 265.0 bits (676), Expect = 8.8e-68
Identity = 126/220 (57.27%), Postives = 163/220 (74.09%), Query Frame = 1

Query: 14  DDHHQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCFIFFLIFALFGLGYFL 73
           ++ + +S+RPPP RRN+PRYHS  +   G    CCLKC+CCC+CF   +I  L  L   L
Sbjct: 3   NNSNSSSVRPPPQRRNIPRYHSNHHHSHGH---CCLKCVCCCFCFSIVVIIVLASLLSVL 62

Query: 74  FYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNANIGFVYGKDSSVSVMY 133
           +   +P++P Y +  F V+AFN+  DFSLYTEF+V+VKA+NPN  I F YGKDSSV V Y
Sbjct: 63  YVTLDPKMPQYNIESFEVNAFNMAPDFSLYTEFVVVVKANNPNKEIAFTYGKDSSVVVAY 122

Query: 134 SKSELCSGQIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNRHSGKIPLLVKVKVPV 193
           S S LCSG++P F QP +N T I ++L+G SEFGSGLQEALM NR +G+IPLLV VK P+
Sbjct: 123 SDSTLCSGKLPAFHQPFENTTMIRVVLTGKSEFGSGLQEALMDNRETGRIPLLVIVKAPI 182

Query: 194 TVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEILSSNYTY 234
           +V++ SL+L++V V VNCSLVVD L+PNK+V ILSS YTY
Sbjct: 183 SVMVKSLALRQVMVNVNCSLVVDNLAPNKRVRILSSTYTY 219

BLAST of Csa7G047380 vs. TrEMBL
Match: B9I5K6_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0013s13940g PE=4 SV=1)

HSP 1 Score: 260.4 bits (664), Expect = 2.2e-66
Identity = 129/231 (55.84%), Postives = 163/231 (70.56%), Query Frame = 1

Query: 6   KPKYVMLSDDHHQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCFIFFLIFA 65
           +P YVML+ ++  +S+RPPP RRN+PRY S  +   GG    CLKC+C C+CF+  +I  
Sbjct: 8   QPNYVMLNYNN-SSSVRPPPQRRNIPRYQSNHHHSHGG----CLKCVCFCFCFLIVMIIL 67

Query: 66  LFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNANIGFVYGK 125
           L  +  F++   NP++P Y V+ F V+AFN+  DFSLYTEF V VKA+NPN  I F+YGK
Sbjct: 68  LASVIAFIYMTLNPKMPEYNVASFDVNAFNMAPDFSLYTEFAVTVKANNPNTGISFIYGK 127

Query: 126 DSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNRHSGKIPL 185
           +SSV V YS S LCSG++P F QP  N T I ++L G SEFGSGLQE LM NR +GKIPL
Sbjct: 128 ESSVVVAYSDSTLCSGKLPAFHQPGVNTTMIQVVLKGKSEFGSGLQEVLMDNRETGKIPL 187

Query: 186 LVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEILSSNYTYGAS 237
           LV V  PV+VV+ S  L++V V VNCSLVVD LSPNK+V ILSS Y Y  +
Sbjct: 188 LVMVNAPVSVVLKSFPLREVIVNVNCSLVVDNLSPNKRVRILSSEYAYAVN 233

BLAST of Csa7G047380 vs. TAIR10
Match: AT1G54540.1 (AT1G54540.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 141.4 bits (355), Expect = 7.4e-34
Identity = 67/190 (35.26%), Postives = 109/190 (57.37%), Query Frame = 1

Query: 48  CLKCICCCYCFIFFLIFALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFI 107
           C K  C     +   + AL      +++ ++P++PSY+V+   V    +  D SL  EF 
Sbjct: 50  CCKIFCWVLSLLVIALIALAIAVAVVYFVFHPKLPSYEVNSLRVTNLGINLDLSLSAEFK 109

Query: 108 VIVKADNPNANIGFVYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEFG 167
           V + A NPN  IG  Y K   + V Y K++LC G IP F Q  +NVT +++ L+G +++G
Sbjct: 110 VEITARNPNEKIGIYYEKGGHIGVWYDKTKLCEGPIPRFYQGHRNVTKLNVALTGRAQYG 169

Query: 168 SGLQEALMQNRHSGKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEIL 227
           + +  AL Q + +G++PL +KV  PV + +G+L +KK+ +  +C LVVD LS N  + I 
Sbjct: 170 NTVLAALQQQQQTGRVPLDLKVNAPVAIKLGNLKMKKIRILGSCKLVVDSLSTNNNINIK 229

Query: 228 SSNYTYGASL 238
           +S+ ++ A L
Sbjct: 230 ASDCSFKAKL 239

BLAST of Csa7G047380 vs. TAIR10
Match: AT1G65690.1 (AT1G65690.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 126.7 bits (317), Expect = 1.9e-29
Identity = 64/185 (34.59%), Postives = 99/185 (53.51%), Query Frame = 1

Query: 47  CCLKCICCCYCFIFFLIFALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEF 106
           CC +C C  +CF+  L+ A+      L+  + P++P Y +    +  F +  D SL T F
Sbjct: 61  CCCRCFCYTFCFLLLLVVAVGASIGILYLVFKPKLPDYSIDRLQLTRFALNQDSSLTTAF 120

Query: 107 IVIVKADNPNANIGFVYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEF 166
            V + A NPN  IG  Y   S ++V Y + +L +G +P F Q  +N T I + ++G ++ 
Sbjct: 121 NVTITAKNPNEKIGIYYEDGSKITVWYMEHQLSNGSLPKFYQGHENTTVIYVEMTGQTQN 180

Query: 167 GSGLQEAL-MQNRHSGKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVE 226
            SGL+  L  Q + +G IPL ++V  PV V  G L L +V   V C + VD L+ N  ++
Sbjct: 181 ASGLRTTLEEQQQRTGNIPLRIRVNQPVRVKFGKLKLFEVRFLVRCGVFVDSLATNNVIK 240

Query: 227 ILSSN 231
           I SS+
Sbjct: 241 IQSSS 245

BLAST of Csa7G047380 vs. TAIR10
Match: AT5G36970.1 (AT5G36970.1 NDR1/HIN1-like 25)

HSP 1 Score: 122.5 bits (306), Expect = 3.6e-28
Identity = 63/192 (32.81%), Postives = 97/192 (50.52%), Query Frame = 1

Query: 43  GGVGCCLKCICCCYCFIFFLIFALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSL 102
           G   C  +C+C     +F LI  +  +   L+  + P+ P Y +    +  F +  D SL
Sbjct: 53  GSRSCWCRCVCYTLLVLFLLIVIVGAIVGILYLVFRPKFPDYNIDRLQLTRFQLNQDLSL 112

Query: 103 YTEFIVIVKADNPNANIGFVYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSG 162
            T F V + A NPN  IG  Y   S +SV+Y ++ + +G +P F Q  +N T I + ++G
Sbjct: 113 STAFNVTITAKNPNEKIGIYYEDGSKISVLYMQTRISNGSLPKFYQGHENTTIILVEMTG 172

Query: 163 NSEFGSGLQEALM-QNRHSGKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPN 222
            ++  + L   L  Q R +G IPL ++V  PV + +G L L KV   V C + VD L+ N
Sbjct: 173 FTQNATSLMTTLQEQQRLTGSIPLRIRVTQPVRIKLGKLKLMKVRFLVRCGVSVDSLAAN 232

Query: 223 KKVEILSSNYTY 234
             + + SSN  Y
Sbjct: 233 SVIRVRSSNCKY 244

BLAST of Csa7G047380 vs. TAIR10
Match: AT2G27080.1 (AT2G27080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 114.0 bits (284), Expect = 1.3e-25
Identity = 65/224 (29.02%), Postives = 107/224 (47.77%), Query Frame = 1

Query: 3   APPKPKYVMLSDDHHQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCFIFFL 62
           APP   YV+         + PP       +   K             +C  C +    F+
Sbjct: 31  APPPSTYVIQVPKDQIYRIPPPENAHRFEQLSRKKTNRSN------CRCCFCSFLAAVFI 90

Query: 63  IFALFGLGYFLFYY-YNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNANIGF 122
           +  L G+ + + Y  Y P+ P Y +  FSV   N+ S   +   F V V++ N N  IG 
Sbjct: 91  LIVLAGISFAVLYLIYRPEAPKYSIEGFSVSGINLNSTSPISPSFNVTVRSRNGNGKIGV 150

Query: 123 VYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNS-EFGSGLQEALMQNRHS 182
            Y K+SSV V Y+  ++ +G +P F QP+KNVT + ++LSG+  +  SG+++ +      
Sbjct: 151 YYEKESSVDVYYNDVDISNGVMPVFYQPAKNVTVVKLVLSGSKIQLTSGMRKEMRNEVSK 210

Query: 183 GKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKV 225
             +P  +K+K PV +  GS+    + V V+C + VDKL+   ++
Sbjct: 211 KTVPFKLKIKAPVKIKFGSVKTWTMIVNVDCDVTVDKLTAPSRI 248

BLAST of Csa7G047380 vs. TAIR10
Match: AT1G17620.1 (AT1G17620.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 67.4 bits (163), Expect = 1.4e-11
Identity = 53/206 (25.73%), Postives = 85/206 (41.26%), Query Frame = 1

Query: 17  HQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCFIFFLIFALFGLGYFLFYY 76
           ++ + RPP  RR     H++         GCC +C C     I  L+  +      ++  
Sbjct: 38  NRPAYRPPAGRRRTS--HTR---------GCCCRCCCWTIFVIILLLLIVAAASAVVYLI 97

Query: 77  YNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNANIGFVYGKDSSVSVMYSKS 136
           Y PQ PS+ VS+  +   N  S   L T   + V A NPN N+GF+Y  D +   +Y  S
Sbjct: 98  YRPQRPSFTVSELKISTLNFTSAVRLTTAISLSVIARNPNKNVGFIY--DVTDITLYKAS 157

Query: 137 E-------LCSGQIPNFRQPSKNVTDISILLSGN----SEFGSGLQEALMQNRHSGKIPL 196
                   +  G I  F    KN T +   +        E  +G  +  ++ + +  I +
Sbjct: 158 TGGDDDVVIGKGTIAAFSHGKKNTTTLRSTIGSPPDELDEISAGKLKGDLKAKKAVAIKI 217

Query: 197 LVKVKVPVTVVIGSLSLKKVNVFVNC 212
           ++  KV V   +G+L   K  + V C
Sbjct: 218 VLNSKVKVK--MGALKTPKSGIRVTC 228

BLAST of Csa7G047380 vs. NCBI nr
Match: gi|778724056|ref|XP_011658744.1| (PREDICTED: uncharacterized protein LOC105436081 [Cucumis sativus])

HSP 1 Score: 488.4 bits (1256), Expect = 7.0e-135
Identity = 237/237 (100.00%), Postives = 237/237 (100.00%), Query Frame = 1

Query: 1   MEAPPKPKYVMLSDDHHQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCFIF 60
           MEAPPKPKYVMLSDDHHQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCFIF
Sbjct: 1   MEAPPKPKYVMLSDDHHQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCFIF 60

Query: 61  FLIFALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNANIG 120
           FLIFALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNANIG
Sbjct: 61  FLIFALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNANIG 120

Query: 121 FVYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNRHS 180
           FVYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNRHS
Sbjct: 121 FVYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNRHS 180

Query: 181 GKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEILSSNYTYGASL 238
           GKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEILSSNYTYGASL
Sbjct: 181 GKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEILSSNYTYGASL 237

BLAST of Csa7G047380 vs. NCBI nr
Match: gi|659110639|ref|XP_008455332.1| (PREDICTED: uncharacterized protein LOC103495521 [Cucumis melo])

HSP 1 Score: 446.4 bits (1147), Expect = 3.1e-122
Identity = 217/237 (91.56%), Postives = 227/237 (95.78%), Query Frame = 1

Query: 1   MEAPPKPKYVMLSDDHHQASLRPPPYRRNVPRYHSKAN-GGGGGGVGCCLKCICCCYCFI 60
           MEAPPKPKYVMLSD+HHQ SLRPPPYRRNVPRY SKA+ GGGGGGVGCCLKCICC YCFI
Sbjct: 1   MEAPPKPKYVMLSDNHHQTSLRPPPYRRNVPRYQSKAHGGGGGGGVGCCLKCICCFYCFI 60

Query: 61  FFLIFALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNANI 120
           FFLIFALFG GYFL+YYY+PQ+PSYKVS+FSVHAFNVK DFSLYTEFIVIVKADNPN NI
Sbjct: 61  FFLIFALFGFGYFLYYYYDPQIPSYKVSNFSVHAFNVKPDFSLYTEFIVIVKADNPNQNI 120

Query: 121 GFVYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNRH 180
           GF+YGK+SSVSVMYSKSELCSG+IPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNRH
Sbjct: 121 GFIYGKNSSVSVMYSKSELCSGKIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNRH 180

Query: 181 SGKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEILSSNYTYGAS 237
           SGKIPLLV+VKVPVTVVIGSLSLKKVNV VNCSLVVD LSPNKKV ILSSNYTYGAS
Sbjct: 181 SGKIPLLVEVKVPVTVVIGSLSLKKVNVLVNCSLVVDNLSPNKKVGILSSNYTYGAS 237

BLAST of Csa7G047380 vs. NCBI nr
Match: gi|255553827|ref|XP_002517954.1| (PREDICTED: uncharacterized protein LOC8262102 [Ricinus communis])

HSP 1 Score: 288.5 bits (737), Expect = 1.1e-74
Identity = 142/239 (59.41%), Postives = 174/239 (72.80%), Query Frame = 1

Query: 1   MEAPP--KPKYVMLSDDHHQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCF 60
           ME PP  +PKYVML+ +H   +LRPPP RRN+PRYHS  NG  GG    CL+C+CCC+CF
Sbjct: 1   MENPPPYQPKYVMLNSNH-ATNLRPPPQRRNIPRYHSNNNGKSGGNG--CLRCLCCCFCF 60

Query: 61  IFFLIFALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNAN 120
              L   L    + L+    P++P Y V  F VHAFNV+ DFSLYTEF+V VK+DNPN +
Sbjct: 61  WLLLFIFLAAALFALYSALQPEIPHYNVDRFDVHAFNVQPDFSLYTEFVVTVKSDNPNMH 120

Query: 121 IGFVYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNR 180
           IGF YGK+SSV V Y  S LCSG IP F QP  N++ I I+L G SEFGSGLQEALMQNR
Sbjct: 121 IGFDYGKESSVVVTYRDSPLCSGSIPTFHQPHHNISLIPIVLKGKSEFGSGLQEALMQNR 180

Query: 181 HSGKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEILSSNYTYGASL 238
           ++G+IPLLV+VK PV++V+  L L++V V +NCSLVVD LSPNKK +ILSS+Y YG  L
Sbjct: 181 NTGRIPLLVEVKAPVSIVVQELPLRQVTVLINCSLVVDNLSPNKKAKILSSSYQYGVEL 236

BLAST of Csa7G047380 vs. NCBI nr
Match: gi|702501996|ref|XP_010038643.1| (PREDICTED: uncharacterized protein LOC104427219 [Eucalyptus grandis])

HSP 1 Score: 285.0 bits (728), Expect = 1.2e-73
Identity = 139/232 (59.91%), Postives = 173/232 (74.57%), Query Frame = 1

Query: 1   MEAPPK--PKYVMLSDDHHQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCF 60
           ME PP   PKYVML D   Q S+RPPPYRRN+PRYHS  +  GGG   CC++CICC  C 
Sbjct: 1   MEPPPPYPPKYVMLQDS--QGSIRPPPYRRNIPRYHSNHHKSGGGS--CCMRCICCFCCS 60

Query: 61  IFFLIFALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNAN 120
           +F LIF +  L ++ +  + P+VPSY V  F+ +AFN++ DFSLYTEF+V VKADNPN+N
Sbjct: 61  LFILIFVVATLAFYFYAVFQPRVPSYTVDSFATNAFNMQPDFSLYTEFVVTVKADNPNSN 120

Query: 121 IGFVYGKDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNR 180
           IGF YGKDSSV V Y  S LC+GQ+P F Q  KN+T I +LL G SEFGSGLQEALM+NR
Sbjct: 121 IGFNYGKDSSVMVAYQDSTLCNGQLPAFHQGHKNITMIKVLLKGKSEFGSGLQEALMENR 180

Query: 181 HSGKIPLLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEILSSN 231
           H+G+IPL V VKVPV +VIG+  L++  V V C+LVVD LSPNKK +I+S++
Sbjct: 181 HTGRIPLNVIVKVPVGIVIGTFPLRRFTVRVICALVVDNLSPNKKTQIVSNS 228

BLAST of Csa7G047380 vs. NCBI nr
Match: gi|731409878|ref|XP_010657352.1| (PREDICTED: uncharacterized protein LOC104880896 [Vitis vinifera])

HSP 1 Score: 281.2 bits (718), Expect = 1.7e-72
Identity = 133/227 (58.59%), Postives = 171/227 (75.33%), Query Frame = 1

Query: 5   PKPKYVMLSDDHHQASLRPPPYRRNVPRYHSKANGGGGGGVGCCLKCICCCYCFIFFLIF 64
           P  KY ML     Q+SL PPPYRRNVPRYHS  +  GGG    CLKCICCCYCF+  LIF
Sbjct: 6   PPQKYAMLEQ---QSSLHPPPYRRNVPRYHSGHHKSGGG----CLKCICCCYCFLIILIF 65

Query: 65  ALFGLGYFLFYYYNPQVPSYKVSDFSVHAFNVKSDFSLYTEFIVIVKADNPNANIGFVYG 124
            L G+ ++ +  + P+VPSY+V    V AF+++ DFSL TEF+V VKADNPN +IGF+YG
Sbjct: 66  LLAGITFYFYTVFQPKVPSYQVEHLDVKAFDMQMDFSLNTEFLVTVKADNPNQHIGFIYG 125

Query: 125 KDSSVSVMYSKSELCSGQIPNFRQPSKNVTDISILLSGNSEFGSGLQEALMQNRHSGKIP 184
           KDSS  VMYS S+LCSG++P F+Q  KN+T + +++ G SEFGSGLQ+AL++NR +GKIP
Sbjct: 126 KDSSAIVMYSDSQLCSGRLPAFQQGPKNITLMKVVMKGKSEFGSGLQQALIENRENGKIP 185

Query: 185 LLVKVKVPVTVVIGSLSLKKVNVFVNCSLVVDKLSPNKKVEILSSNY 232
           LL+KV VPV VV+GS+ +++  V VNCSLV+D L+P KKV ILS+ Y
Sbjct: 186 LLIKVVVPVRVVVGSVQMRQFKVLVNCSLVIDNLAPKKKVRILSTKY 225

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NHL3_ARATH6.2e-0624.49NDR1/HIN1-Like protein 3 OS=Arabidopsis thaliana GN=NHL3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K2B4_CUCSA4.9e-135100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_7G047380 PE=4 SV=1[more]
B9RW35_RICCO7.4e-7559.41Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1175710 PE=4 SV=1[more]
A0A059A096_EUCGR8.2e-7459.91Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_K00959 PE=4 SV=1[more]
U5FEV5_POPTR8.8e-6857.27Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0019s13360g PE=4 SV=1[more]
B9I5K6_POPTR2.2e-6655.84Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0013s13940g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G54540.17.4e-3435.26 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G65690.11.9e-2934.59 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT5G36970.13.6e-2832.81 NDR1/HIN1-like 25[more]
AT2G27080.11.3e-2529.02 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G17620.11.4e-1125.73 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|778724056|ref|XP_011658744.1|7.0e-135100.00PREDICTED: uncharacterized protein LOC105436081 [Cucumis sativus][more]
gi|659110639|ref|XP_008455332.1|3.1e-12291.56PREDICTED: uncharacterized protein LOC103495521 [Cucumis melo][more]
gi|255553827|ref|XP_002517954.1|1.1e-7459.41PREDICTED: uncharacterized protein LOC8262102 [Ricinus communis][more]
gi|702501996|ref|XP_010038643.1|1.2e-7359.91PREDICTED: uncharacterized protein LOC104427219 [Eucalyptus grandis][more]
gi|731409878|ref|XP_010657352.1|1.7e-7258.59PREDICTED: uncharacterized protein LOC104880896 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0015031 protein transport
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU135011cucumber EST collection version 3.0transcribed_cluster
CU174992cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa7G047380.1Csa7G047380.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU135011CU135011transcribed_cluster
CU174992CU174992transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 110..211
score: 1.
NoneNo IPR availablePANTHERPTHR31852FAMILY NOT NAMEDcoord: 20..225
score: 1.7
NoneNo IPR availablePANTHERPTHR31852:SF26SUBFAMILY NOT NAMEDcoord: 20..225
score: 1.7

The following gene(s) are paralogous to this gene:

None