Cp4.1LG03g16230 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g16230
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionLate embryogenesis abundant hydroxyproline-rich glycoprotein family, putative
LocationCp4.1LG03 : 13344038 .. 13344775 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGAGCCGCCATTGAAGCCTGTACTCCAGAAGCCTCCGGGGTATAAAGATCCAAATCTCACTGCTCCGCCAGTGCCTAGACCACCGGTGAGGAAACTGATTCTACCGTCGCCGCTAAGCCAGAATAACAAGAAGCGACGGAGCTGCTGCAGGAGATTCTGTTGCTTCCTCTGCCTTTTCGTCGTGATCCTGATCGTCGTGATTGTTGCCGCCGGTGGCTTTTTATACCTCTGGTTTGAGCCAAAACTTCCTGTGTTTCATCTCCAATCATTCCGGATCTCCAAATTCAACGTCACCGACAAAGCGGACGGATCGTATTTAAACGCCAAAACAATCGGTCGCATTGAAATCAAGAATCCTAATGAGAAGTTATCACTAAATTACCGCGATATCGAGGTCGTTACCGCCGCCGGAGAAGGTACTCGAATTGAATTAGGATCCACAATCGTGCCGAGCTTTATTCAATCGAAGGAGAACACGACGAGTTGGAAGATCGAAACAATGGTCAGCAACGAAATCGTCGACGGCGGAGCGGGGCGGAAGCTGAACTCCGGCAACAGAACCGGAGAGCTAGTGGTGATCGTCGAAGCACGTACGAAAATCGGATTCGTGGTGAATGGGCGGAGAATGCCGCGGGTGAAGATCGAAGTGGCGTGCGGTGGCGTGAGCTTGAAACGTCTCGACGGAGGCAACACGCCGAAATGCTCCATTCATTTGCCCAGATGGTTTCTGTAA

mRNA sequence

ATGGCAGAGCCGCCATTGAAGCCTGTACTCCAGAAGCCTCCGGGGTATAAAGATCCAAATCTCACTGCTCCGCCAGTGCCTAGACCACCGGTGAGGAAACTGATTCTACCGTCGCCGCTAAGCCAGAATAACAAGAAGCGACGGAGCTGCTGCAGGAGATTCTGTTGCTTCCTCTGCCTTTTCGTCGTGATCCTGATCGTCGTGATTGTTGCCGCCGGTGGCTTTTTATACCTCTGGTTTGAGCCAAAACTTCCTGTGTTTCATCTCCAATCATTCCGGATCTCCAAATTCAACGTCACCGACAAAGCGGACGGATCGTATTTAAACGCCAAAACAATCGGTCGCATTGAAATCAAGAATCCTAATGAGAAGTTATCACTAAATTACCGCGATATCGAGGTCGTTACCGCCGCCGGAGAAGGTACTCGAATTGAATTAGGATCCACAATCGTGCCGAGCTTTATTCAATCGAAGGAGAACACGACGAGTTGGAAGATCGAAACAATGGTCAGCAACGAAATCGTCGACGGCGGAGCGGGGCGGAAGCTGAACTCCGGCAACAGAACCGGAGAGCTAGTGGTGATCGTCGAAGCACGTACGAAAATCGGATTCGTGGTGAATGGGCGGAGAATGCCGCGGGTGAAGATCGAAGTGGCGTGCGGTGGCGTGAGCTTGAAACGTCTCGACGGAGGCAACACGCCGAAATGCTCCATTCATTTGCCCAGATGGTTTCTGTAA

Coding sequence (CDS)

ATGGCAGAGCCGCCATTGAAGCCTGTACTCCAGAAGCCTCCGGGGTATAAAGATCCAAATCTCACTGCTCCGCCAGTGCCTAGACCACCGGTGAGGAAACTGATTCTACCGTCGCCGCTAAGCCAGAATAACAAGAAGCGACGGAGCTGCTGCAGGAGATTCTGTTGCTTCCTCTGCCTTTTCGTCGTGATCCTGATCGTCGTGATTGTTGCCGCCGGTGGCTTTTTATACCTCTGGTTTGAGCCAAAACTTCCTGTGTTTCATCTCCAATCATTCCGGATCTCCAAATTCAACGTCACCGACAAAGCGGACGGATCGTATTTAAACGCCAAAACAATCGGTCGCATTGAAATCAAGAATCCTAATGAGAAGTTATCACTAAATTACCGCGATATCGAGGTCGTTACCGCCGCCGGAGAAGGTACTCGAATTGAATTAGGATCCACAATCGTGCCGAGCTTTATTCAATCGAAGGAGAACACGACGAGTTGGAAGATCGAAACAATGGTCAGCAACGAAATCGTCGACGGCGGAGCGGGGCGGAAGCTGAACTCCGGCAACAGAACCGGAGAGCTAGTGGTGATCGTCGAAGCACGTACGAAAATCGGATTCGTGGTGAATGGGCGGAGAATGCCGCGGGTGAAGATCGAAGTGGCGTGCGGTGGCGTGAGCTTGAAACGTCTCGACGGAGGCAACACGCCGAAATGCTCCATTCATTTGCCCAGATGGTTTCTGTAA

Protein sequence

MAEPPLKPVLQKPPGYKDPNLTAPPVPRPPVRKLILPSPLSQNNKKRRSCCRRFCCFLCLFVVILIVVIVAAGGFLYLWFEPKLPVFHLQSFRISKFNVTDKADGSYLNAKTIGRIEIKNPNEKLSLNYRDIEVVTAAGEGTRIELGSTIVPSFIQSKENTTSWKIETMVSNEIVDGGAGRKLNSGNRTGELVVIVEARTKIGFVVNGRRMPRVKIEVACGGVSLKRLDGGNTPKCSIHLPRWFL
BLAST of Cp4.1LG03g16230 vs. TrEMBL
Match: A0A0A0L3Z7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G120370 PE=4 SV=1)

HSP 1 Score: 403.3 bits (1035), Expect = 2.1e-109
Identity = 208/246 (84.55%), Postives = 216/246 (87.80%), Query Frame = 1

Query: 1   MAE-PPLKPVLQKPPGYKDPNLTAPPVPRPPVRKLILPSPLSQNNKKRRSCCRRFCCFLC 60
           MAE PPLKP+LQKPPG+KDPN  A PVPRPP RKLILPSPLSQ NKKRRSC RR CCF C
Sbjct: 1   MAEPPPLKPILQKPPGFKDPNHIALPVPRPPARKLILPSPLSQKNKKRRSCWRRCCCFFC 60

Query: 61  LFVVILIVVIVAAGGFLYLWFEPKLPVFHLQSFRISKFNVTDKADGSYLNAKTIGRIEIK 120
           L V+ILIV I+A GG LYLWFEPKLPV HLQSFRISKFNVTDK+DGSYLNAKTIGRIEIK
Sbjct: 61  LLVLILIVAILAVGGVLYLWFEPKLPVVHLQSFRISKFNVTDKSDGSYLNAKTIGRIEIK 120

Query: 121 NPNEKLSLNYRDIEVVTAAGEGTRIELGSTIVPSFIQSKENTTSWKIETMVSNEIVDGGA 180
           NPN KLSLNY DIEV  AAGEGTR ELGS IVPSFIQS+ENTTS KIETMVSNE VD GA
Sbjct: 121 NPNSKLSLNYGDIEVQIAAGEGTRTELGSMIVPSFIQSEENTTSLKIETMVSNETVDDGA 180

Query: 181 GRKLNSGNRTGELVVIVEARTKIGFVVNGRRMPRVKIEVACGGVSLKRLDGGNTPKCSIH 240
           GR LNSGNRTGELVV VEARTKIGFVV+GRRMP VKIEV+CG VSLKRLD GN PKCSIH
Sbjct: 181 GRNLNSGNRTGELVVNVEARTKIGFVVDGRRMPPVKIEVSCGSVSLKRLDRGNVPKCSIH 240

Query: 241 LPRWFL 246
           L RWFL
Sbjct: 241 LRRWFL 246

BLAST of Cp4.1LG03g16230 vs. TrEMBL
Match: W9SAG5_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_003064 PE=4 SV=1)

HSP 1 Score: 257.7 bits (657), Expect = 1.4e-65
Identity = 131/246 (53.25%), Postives = 171/246 (69.51%), Query Frame = 1

Query: 1   MAEPPLKPV-LQKPPGYKDPNLTAPPVPRPPVRKLILPSPLSQNNKKRRSCCRRFCCFLC 60
           MAE PLKP  LQKPPGY+DP     PV RPP RK +LP+      K+RR+ CR  CCF+ 
Sbjct: 1   MAEQPLKPPPLQKPPGYRDPAAPGKPVARPPQRKPVLPASFHPR-KRRRNWCRTCCCFVF 60

Query: 61  LFVVILIVVIVAAGGFLYLWFEPKLPVFHLQSFRISKFNVTDKADGSYLNAKTIGRIEIK 120
           +F+++L + +  AGG  YLWFEPKLPVFHLQS RI +FNVT K DG+YL+A T+ RIE+K
Sbjct: 61  VFLLLLTLAVAIAGGIFYLWFEPKLPVFHLQSLRIPQFNVTVKPDGTYLDAGTVTRIEVK 120

Query: 121 NPNEKLSLNYRDIEVVTAAGEGTRIELGSTIVPSFIQSKENTTSWKIETMVSNEIVDGGA 180
           NPN KL L Y    V  + GE    ELG   +  F Q KENTTS K+ET V N++VD G 
Sbjct: 121 NPNGKLELYYGGTHVEVSVGEDEDAELGRKDLEGFTQGKENTTSLKVETTVKNQLVDDGL 180

Query: 181 GRKLNSGNRTGELVVIVEARTKIGFVVNGRRMPRVKIEVACGGVSLKRLDGGNTPKCSIH 240
           G++L SG ++ +LVV +EA+T +G++V G ++  V++ V CGGVSLK+LD G+ PKCSI 
Sbjct: 181 GKRLKSGYKSKDLVVKIEAKTSVGYIVQGVKIGTVEVGVLCGGVSLKKLDSGDMPKCSID 240

Query: 241 LPRWFL 246
           L +W +
Sbjct: 241 LLKWVI 245

BLAST of Cp4.1LG03g16230 vs. TrEMBL
Match: A0A061DTS6_THECC (Late embryogenesis abundant hydroxyproline-rich glycoprotein family, putative OS=Theobroma cacao GN=TCM_005206 PE=4 SV=1)

HSP 1 Score: 247.7 bits (631), Expect = 1.5e-62
Identity = 130/245 (53.06%), Postives = 169/245 (68.98%), Query Frame = 1

Query: 1   MAEPPLKPVLQKPPGYKDPNLTA-PPVPRPPVRKLILPSPLSQNNKKRRSCCRRFCCFLC 60
           M EPPLKPVLQKPPGYKDP+  A  P  RPP RK +LP P     K+R  CCR  CC  C
Sbjct: 1   MPEPPLKPVLQKPPGYKDPSAPAVKPGFRPPPRKPVLP-PSFHPKKRRGGCCRVCCCCFC 60

Query: 61  LFVVILIVVIVAAGGFLYLWFEPKLPVFHLQSFRISKFNVTDKADGSYLNAKTIGRIEIK 120
           +F +ILI++++  G   YLWF+PKLP FH+QS RIS+FNVT+K DG+YL+A+T  R+E+K
Sbjct: 61  IFFLILILLLLICGAVFYLWFDPKLPGFHVQSVRISRFNVTNKPDGTYLDAQTTTRLEVK 120

Query: 121 NPNEKLSLNYRDIEVVTAAGE-GTRIELGSTIVPSFIQSKENTTSWKIETMVSNEIVDGG 180
           NPN K++  Y + EV  + GE G   ELG+T V  F   K+NTTS K+ET V N++VD G
Sbjct: 121 NPNAKMTYYYGNTEVDVSVGEGGDETELGTTTVHGFTMGKQNTTSLKVETKVINKLVDDG 180

Query: 181 AGRKLNSGNRTGELVVIVEARTKIGFVVNGRRMPRVKIEVACGGVSLKRLDGGNTPKCSI 240
            G +L +  R+  L V VEARTKIG  V G ++  V + V C G++LKRLDGG+ PKC I
Sbjct: 181 VGTRLQARYRSKSLRVSVEARTKIGLGVAGLKIGMVGVTVKCDGIALKRLDGGDMPKCVI 240

Query: 241 HLPRW 244
           ++ +W
Sbjct: 241 NMLKW 244

BLAST of Cp4.1LG03g16230 vs. TrEMBL
Match: A0A0D2QQD3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G105400 PE=4 SV=1)

HSP 1 Score: 227.6 bits (579), Expect = 1.6e-56
Identity = 121/245 (49.39%), Postives = 166/245 (67.76%), Query Frame = 1

Query: 1   MAEPPLKPVLQKPPGYKDPNLTAPPVP-RPPVRKLILPSPLSQNNKKRRSCCRRFCCFLC 60
           M+EPP+KPVLQKPPGYKDPN  A     RPP RK +LP P     K++ S  R  CC  C
Sbjct: 1   MSEPPVKPVLQKPPGYKDPNSPAGQRRFRPPPRKPVLP-PSFHPKKRKTSYGRACCCCFC 60

Query: 61  LFVVILIVVIVAAGGFLYLWFEPKLPVFHLQSFRISKFNVTDKADGSYLNAKTIGRIEIK 120
           +F +I +++I+  G   YLWF+P+LP FH+QSFRIS+FNVT + DG+YL+A+T  R+E+K
Sbjct: 61  IFFLIFLLLILICGAVFYLWFDPQLPGFHIQSFRISRFNVTKRPDGTYLDARTTTRLEVK 120

Query: 121 NPNEKLSLNYRDIEVVTAAGE-GTRIELGSTIVPSFIQSKENTTSWKIETMVSNEIVDGG 180
           NPN K++  Y D EV  + GE G   ELG+T VP+F   ++NT S ++ET+ SN++V   
Sbjct: 121 NPNGKMTYYYGDTEVEISFGEGGYETELGTTTVPAFTMLEKNTRSLRVETIASNKLVVDE 180

Query: 181 AGRKLNSGNRTGELVVIVEARTKIGFVVNGRRMPRVKIEVACGGVSLKRLDGGNTPKCSI 240
            G KL +  R+  L V VEARTK+G  V G ++  V + V C G+S K+LDGG+ PKC I
Sbjct: 181 VGNKLRARYRSKSLPVNVEARTKVGVGVAGLKIGMVGVTVKCDGMSKKQLDGGDMPKCVI 240

Query: 241 HLPRW 244
           ++ +W
Sbjct: 241 NMLKW 244

BLAST of Cp4.1LG03g16230 vs. TrEMBL
Match: A0A0B0NJM7_GOSAR (D-alanine--poly (Phosphoribitol) ligase subunit 1 OS=Gossypium arboreum GN=F383_18367 PE=4 SV=1)

HSP 1 Score: 226.9 bits (577), Expect = 2.7e-56
Identity = 121/245 (49.39%), Postives = 164/245 (66.94%), Query Frame = 1

Query: 1   MAEPPLKPVLQKPPGYKDPNLTAPPVP-RPPVRKLILPSPLSQNNKKRRSCCRRFCCFLC 60
           M+EPP+KPVLQKPPGYKDP+  A     RPP RK +LP P     K++ S  R  CC  C
Sbjct: 1   MSEPPVKPVLQKPPGYKDPSSPAGQRRFRPPPRKPVLP-PSFHPKKRKTSYGRACCCCFC 60

Query: 61  LFVVILIVVIVAAGGFLYLWFEPKLPVFHLQSFRISKFNVTDKADGSYLNAKTIGRIEIK 120
           +F +I +++I+  G   YLWF+PKLP FH+QSFRIS+FNVT + DG+YL+A+T  R+E+K
Sbjct: 61  IFFLIFLLLILICGAVFYLWFDPKLPGFHIQSFRISRFNVTKRPDGTYLDARTTTRLEVK 120

Query: 121 NPNEKLSLNYRDIEVVTAAGE-GTRIELGSTIVPSFIQSKENTTSWKIETMVSNEIVDGG 180
           NPN K+   Y D EV  + GE G   ELG+T VP+F   ++NT S ++ET  SN++V   
Sbjct: 121 NPNRKMIYYYGDTEVEVSLGEGGYETELGTTTVPAFTMLEKNTRSLRVETKASNKLVVDE 180

Query: 181 AGRKLNSGNRTGELVVIVEARTKIGFVVNGRRMPRVKIEVACGGVSLKRLDGGNTPKCSI 240
            G KL +  R+  L V VEARTK+G  V G ++  V + V C G+S K+LDGG+ PKC I
Sbjct: 181 VGNKLRARYRSKSLPVNVEARTKVGVGVAGLKIGMVGVTVKCDGMSKKQLDGGDMPKCVI 240

Query: 241 HLPRW 244
           ++ +W
Sbjct: 241 NMLKW 244

BLAST of Cp4.1LG03g16230 vs. TAIR10
Match: AT2G46300.1 (AT2G46300.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 198.4 bits (503), Expect = 5.3e-51
Identity = 102/249 (40.96%), Postives = 157/249 (63.05%), Query Frame = 1

Query: 1   MAEPPLKPVLQKPPGYKDPNLTAPPVPRPPVR----KLILPSPLSQN-NKKRRSCCRRFC 60
           MA+  + PVLQKPPGY+DPN+++PP P PP++    +  +P P S    KKRRSCCR  C
Sbjct: 1   MADYQMNPVLQKPPGYRDPNMSSPPPPPPPIQQQPMRKAVPMPTSYRPKKKRRSCCRFCC 60

Query: 61  CFLCLFVVILIVVIVAAGGFLYLWFEPKLPVFHLQSFRISKFNVTDKADGSYLNAKTIGR 120
           C +C+ +V+ I +++      YLWF+PKLP F L SFR+  F + D  DG+ L+A  + R
Sbjct: 61  CCICITLVLFIFLLLVGTAVFYLWFDPKLPTFSLASFRLDGFKLADDPDGASLSATAVAR 120

Query: 121 IEIKNPNEKLSLNYRDIEVVTAAGEGT-RIELGSTIVPSFIQSKENTTSWKIETMVSNEI 180
           +E+KNPN KL   Y +  V  + G G     +G T +  F Q  +N+TS K+ET V N++
Sbjct: 121 VEMKNPNSKLVFYYGNTAVDLSVGSGNDETGMGETTMNGFRQGPKNSTSVKVETTVKNQL 180

Query: 181 VDGGAGRKLNSGNRTGELVVIVEARTKIGFVVNGRRMPRVKIEVACGGVSLKRLDGGNTP 240
           V+ G  ++L +  ++ +LV+ V A+TK+G  V G ++  + + + CGGVSL +LD  ++P
Sbjct: 181 VERGLAKRLAAKFQSKDLVINVVAKTKVGLGVGGIKIGMLAVNLRCGGVSLNKLD-TDSP 240

Query: 241 KCSIHLPRW 244
           KC ++  +W
Sbjct: 241 KCILNTLKW 248

BLAST of Cp4.1LG03g16230 vs. TAIR10
Match: AT1G01453.2 (AT1G01453.2 unknown protein)

HSP 1 Score: 149.8 bits (377), Expect = 2.1e-36
Identity = 99/254 (38.98%), Postives = 147/254 (57.87%), Query Frame = 1

Query: 2   AEPPLKPVLQKPPGYKDPNL--TAPP--VPRPPVRKLILPSPLSQ-NNKKRRSCCRRFCC 61
           AE PL+P LQKPPG++D     +APP      P R+   P P+   + K+R S CR FCC
Sbjct: 16  AEKPLQPALQKPPGFRDQQNQPSAPPSGTATLPRRR---PRPIHPADKKRRCSFCRVFCC 75

Query: 62  FLC-LFVVILIVVIVAAGGFLYLWFEPKLPVFHLQSFRISKFNVTD-KADG--SYLNAKT 121
            +C LF VIL+++++A   F +LW+ PKLPV  L SF+IS FN +D K+D   S+L+A T
Sbjct: 76  CVCILFAVILLLILIAVAVF-FLWYSPKLPVVRLASFKISNFNFSDGKSDDGWSFLSADT 135

Query: 122 IGRIEIKNPNEKLSLNYRDIEVVTAAGE-GTRIELGSTIVPSFIQSKENTTSWKIETMVS 181
              ++ +NPN KL+  Y D +V    GE      L ST V  FI+   N T+  + T V 
Sbjct: 136 TSVLDFRNPNGKLTFYYGDTDVAVILGEKDFETNLESTKVKGFIEKPGNRTAVIVPTTVR 195

Query: 182 NEIVDGGAGRKLNSGNRTGELVVIVEARTKIGFVVNGRRMPRVKIEVACGGVSLKRLDGG 241
              VD    ++L    ++ +L+V V A+TK+G  V  R++  V + + CGGV L+ LD  
Sbjct: 196 KRQVDDPTAKRLQVELKSKKLLVTVTAKTKVGLAVGSRKIVTVGVSLRCGGVILQTLD-S 255

Query: 242 NTPKCSIHLPRWFL 246
              +C+I + +W++
Sbjct: 256 KMAQCTIKMLKWYV 264

BLAST of Cp4.1LG03g16230 vs. TAIR10
Match: AT4G01110.1 (AT4G01110.1 unknown protein)

HSP 1 Score: 142.1 bits (357), Expect = 4.5e-34
Identity = 90/257 (35.02%), Postives = 140/257 (54.47%), Query Frame = 1

Query: 3   EPPLKPVLQKPPGYKDPNLTAPPVP------------RPPVRKLILPSPLSQNNKKRRSC 62
           E  LKPVLQKPPGY++ + + P  P            RPP  K  +P+      K++ S 
Sbjct: 4   ETLLKPVLQKPPGYRELH-SQPQTPLGSSSSSSSMLRRPP--KHAIPAAFYPTKKRQWSR 63

Query: 63  CRRFCCFLCLFVVILIVVIVAAGGFLYLWFEPKLPVFHLQSFRISKFNVTDKADG---SY 122
           CR FCC +C+ V I+I++++      +L++ P+LPV  L SFR+S FN +    G   S 
Sbjct: 64  CRVFCCCVCITVAIVILLLILTVSVFFLYYSPRLPVVRLSSFRVSNFNFSGGKAGDGLSQ 123

Query: 123 LNAKTIGRIEIKNPNEKLSLNYRDIEVVTAAGEGT-RIELGSTIVPSFIQSKENTTSWKI 182
           L A+   R++ +NPN KL   Y +++V  + GE      LGST V  F++   N T   +
Sbjct: 124 LTAEATARLDFRNPNGKLRYYYGNVDVAVSVGEDDFETSLGSTKVKGFVEKPGNRTVVIV 183

Query: 183 ETMVSNEIVDGGAGRKLNSGNRTGELVVIVEARTKIGFVVNGRRMPRVKIEVACGGVSLK 242
              V  + VD    ++L +  ++ +LVV V A+TK+G  V  R++  V + ++CGGV L+
Sbjct: 184 PIKVKKQQVDDPTVKRLRADMKSKKLVVKVMAKTKVGLGVGRRKIVTVGVTISCGGVRLQ 243

Query: 243 RLDGGNTPKCSIHLPRW 244
            LD     KC+I + +W
Sbjct: 244 TLD-SKMSKCTIKMLKW 256

BLAST of Cp4.1LG03g16230 vs. TAIR10
Match: AT1G65690.1 (AT1G65690.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 75.9 bits (185), Expect = 3.9e-14
Identity = 64/217 (29.49%), Postives = 100/217 (46.08%), Query Frame = 1

Query: 5   PLKPVLQKPPGYKDPNLTAPPVPRPPVRKLILPSPLSQNNKKRRSCCRRFCCFLCLFVVI 64
           PL P       + DP+    P+ + P R + L  P     KKRRSCC R  C+   F+++
Sbjct: 23  PLVPRGSSRSEHGDPSKV--PLNQRPQRFVPLAPP-----KKRRSCCCRCFCYTFCFLLL 82

Query: 65  LIVVIVAAGGFLYLWFEPKLPVFHLQSFRISKFNVTDKADGSYLNAKTIGRIEIKNPNEK 124
           L+V + A+ G LYL F+PKLP + +   ++++F +    D S   A  +  I  KNPNEK
Sbjct: 83  LVVAVGASIGILYLVFKPKLPDYSIDRLQLTRFALNQ--DSSLTTAFNV-TITAKNPNEK 142

Query: 125 LSLNYRDIEVVTAAGEGTRIELGSTIVPSFIQSKENTTSWKIETMVSNEIVDGGAGRKLN 184
           + + Y D   +T      ++  GS  +P F Q  ENTT   +E     +   G       
Sbjct: 143 IGIYYEDGSKITVWYMEHQLSNGS--LPKFYQGHENTTVIYVEMTGQTQNASGLRTTLEE 202

Query: 185 SGNRTGELVVIVEARTKIGFVVNGRRMPRVKIEVACG 222
              RTG + + +     +       ++  V+  V CG
Sbjct: 203 QQQRTGNIPLRIRVNQPVRVKFGKLKLFEVRFLVRCG 227

BLAST of Cp4.1LG03g16230 vs. TAIR10
Match: AT5G36970.1 (AT5G36970.1 NDR1/HIN1-like 25)

HSP 1 Score: 71.6 bits (174), Expect = 7.4e-13
Identity = 62/212 (29.25%), Postives = 96/212 (45.28%), Query Frame = 1

Query: 19  PNLTAPPVPRPPVR-------KLILPSPLSQNNKKR--RSCCRRFCCFLCLFVVILIVVI 78
           P+ TAP VPR   R       K    +PL    +K+  RSC  R  C+  L + +LIV++
Sbjct: 17  PHPTAPLVPRGSSRSEHGDPTKTQQAAPLDPPREKKGSRSCWCRCVCYTLLVLFLLIVIV 76

Query: 79  VAAGGFLYLWFEPKLPVFHLQSFRISKFNVTDKADGSYLNAKTIGRIEIKNPNEKLSLNY 138
            A  G LYL F PK P +++   ++++F +    D S   A  +  I  KNPNEK+ + Y
Sbjct: 77  GAIVGILYLVFRPKFPDYNIDRLQLTRFQLNQ--DLSLSTAFNV-TITAKNPNEKIGIYY 136

Query: 139 RDIEVVTAAGEGTRIELGSTIVPSFIQSKENTTSWKIETMVSNEIVDGGAGRKLNSGNRT 198
            D   ++     TRI  GS  +P F Q  ENTT   +E     +               T
Sbjct: 137 EDGSKISVLYMQTRISNGS--LPKFYQGHENTTIILVEMTGFTQNATSLMTTLQEQQRLT 196

Query: 199 GELVVIVEARTKIGFVVNGRRMPRVKIEVACG 222
           G + + +     +   +   ++ +V+  V CG
Sbjct: 197 GSIPLRIRVTQPVRIKLGKLKLMKVRFLVRCG 223

BLAST of Cp4.1LG03g16230 vs. NCBI nr
Match: gi|659075079|ref|XP_008437954.1| (PREDICTED: protein YLS9-like isoform X1 [Cucumis melo])

HSP 1 Score: 403.7 bits (1036), Expect = 2.3e-109
Identity = 209/246 (84.96%), Postives = 216/246 (87.80%), Query Frame = 1

Query: 1   MAEP-PLKPVLQKPPGYKDPNLTAPPVPRPPVRKLILPSPLSQNNKKRRSCCRRFCCFLC 60
           MAEP PLKP+LQKPPG+KDPN TA PVPRPP RKLILPSPLSQ NKKRRSC RR CCF C
Sbjct: 69  MAEPTPLKPILQKPPGFKDPNHTALPVPRPPARKLILPSPLSQKNKKRRSCWRRCCCFFC 128

Query: 61  LFVVILIVVIVAAGGFLYLWFEPKLPVFHLQSFRISKFNVTDKADGSYLNAKTIGRIEIK 120
           LFV+ILIVVI+A GG LYLWFE KLPV HLQSFRISKFNVTDK+DGSYLNAKTIGRIEIK
Sbjct: 129 LFVLILIVVILAVGGVLYLWFERKLPVVHLQSFRISKFNVTDKSDGSYLNAKTIGRIEIK 188

Query: 121 NPNEKLSLNYRDIEVVTAAGEGTRIELGSTIVPSFIQSKENTTSWKIETMVSNEIVDGGA 180
           NPN KLSLNY DIEV  AAGEGTR ELGS IVPSFIQS+ENTTS KIETMV+NE VD GA
Sbjct: 189 NPNSKLSLNYGDIEVQVAAGEGTRTELGSMIVPSFIQSEENTTSLKIETMVNNETVDDGA 248

Query: 181 GRKLNSGNRTGELVVIVEARTKIGFVVNGRRMPRVKIEVACGGVSLKRLDGGNTPKCSIH 240
           GR LNSGNRTGELVV VEARTKIGFVV GRRMP VKIEV CG VSLKRLD GN PKCSIH
Sbjct: 249 GRYLNSGNRTGELVVNVEARTKIGFVVEGRRMPPVKIEVTCGSVSLKRLDRGNVPKCSIH 308

Query: 241 LPRWFL 246
           L RWFL
Sbjct: 309 LRRWFL 314

BLAST of Cp4.1LG03g16230 vs. NCBI nr
Match: gi|659075081|ref|XP_008437955.1| (PREDICTED: protein YLS9-like isoform X2 [Cucumis melo])

HSP 1 Score: 403.7 bits (1036), Expect = 2.3e-109
Identity = 209/246 (84.96%), Postives = 216/246 (87.80%), Query Frame = 1

Query: 1   MAEP-PLKPVLQKPPGYKDPNLTAPPVPRPPVRKLILPSPLSQNNKKRRSCCRRFCCFLC 60
           MAEP PLKP+LQKPPG+KDPN TA PVPRPP RKLILPSPLSQ NKKRRSC RR CCF C
Sbjct: 1   MAEPTPLKPILQKPPGFKDPNHTALPVPRPPARKLILPSPLSQKNKKRRSCWRRCCCFFC 60

Query: 61  LFVVILIVVIVAAGGFLYLWFEPKLPVFHLQSFRISKFNVTDKADGSYLNAKTIGRIEIK 120
           LFV+ILIVVI+A GG LYLWFE KLPV HLQSFRISKFNVTDK+DGSYLNAKTIGRIEIK
Sbjct: 61  LFVLILIVVILAVGGVLYLWFERKLPVVHLQSFRISKFNVTDKSDGSYLNAKTIGRIEIK 120

Query: 121 NPNEKLSLNYRDIEVVTAAGEGTRIELGSTIVPSFIQSKENTTSWKIETMVSNEIVDGGA 180
           NPN KLSLNY DIEV  AAGEGTR ELGS IVPSFIQS+ENTTS KIETMV+NE VD GA
Sbjct: 121 NPNSKLSLNYGDIEVQVAAGEGTRTELGSMIVPSFIQSEENTTSLKIETMVNNETVDDGA 180

Query: 181 GRKLNSGNRTGELVVIVEARTKIGFVVNGRRMPRVKIEVACGGVSLKRLDGGNTPKCSIH 240
           GR LNSGNRTGELVV VEARTKIGFVV GRRMP VKIEV CG VSLKRLD GN PKCSIH
Sbjct: 181 GRYLNSGNRTGELVVNVEARTKIGFVVEGRRMPPVKIEVTCGSVSLKRLDRGNVPKCSIH 240

Query: 241 LPRWFL 246
           L RWFL
Sbjct: 241 LRRWFL 246

BLAST of Cp4.1LG03g16230 vs. NCBI nr
Match: gi|778677027|ref|XP_011650715.1| (PREDICTED: uncharacterized protein LOC101214208 [Cucumis sativus])

HSP 1 Score: 403.3 bits (1035), Expect = 3.1e-109
Identity = 208/246 (84.55%), Postives = 216/246 (87.80%), Query Frame = 1

Query: 1   MAE-PPLKPVLQKPPGYKDPNLTAPPVPRPPVRKLILPSPLSQNNKKRRSCCRRFCCFLC 60
           MAE PPLKP+LQKPPG+KDPN  A PVPRPP RKLILPSPLSQ NKKRRSC RR CCF C
Sbjct: 1   MAEPPPLKPILQKPPGFKDPNHIALPVPRPPARKLILPSPLSQKNKKRRSCWRRCCCFFC 60

Query: 61  LFVVILIVVIVAAGGFLYLWFEPKLPVFHLQSFRISKFNVTDKADGSYLNAKTIGRIEIK 120
           L V+ILIV I+A GG LYLWFEPKLPV HLQSFRISKFNVTDK+DGSYLNAKTIGRIEIK
Sbjct: 61  LLVLILIVAILAVGGVLYLWFEPKLPVVHLQSFRISKFNVTDKSDGSYLNAKTIGRIEIK 120

Query: 121 NPNEKLSLNYRDIEVVTAAGEGTRIELGSTIVPSFIQSKENTTSWKIETMVSNEIVDGGA 180
           NPN KLSLNY DIEV  AAGEGTR ELGS IVPSFIQS+ENTTS KIETMVSNE VD GA
Sbjct: 121 NPNSKLSLNYGDIEVQIAAGEGTRTELGSMIVPSFIQSEENTTSLKIETMVSNETVDDGA 180

Query: 181 GRKLNSGNRTGELVVIVEARTKIGFVVNGRRMPRVKIEVACGGVSLKRLDGGNTPKCSIH 240
           GR LNSGNRTGELVV VEARTKIGFVV+GRRMP VKIEV+CG VSLKRLD GN PKCSIH
Sbjct: 181 GRNLNSGNRTGELVVNVEARTKIGFVVDGRRMPPVKIEVSCGSVSLKRLDRGNVPKCSIH 240

Query: 241 LPRWFL 246
           L RWFL
Sbjct: 241 LRRWFL 246

BLAST of Cp4.1LG03g16230 vs. NCBI nr
Match: gi|702333839|ref|XP_010055051.1| (PREDICTED: protein YLS9-like [Eucalyptus grandis])

HSP 1 Score: 258.1 bits (658), Expect = 1.6e-65
Identity = 125/243 (51.44%), Postives = 169/243 (69.55%), Query Frame = 1

Query: 1   MAEPPLKPVLQKPPGYKDPNLTAPPVPRPPVRKLILPSPLSQNNKKRRSCCRRFCCFLCL 60
           MAEPP KP+LQKPPGY+DP++     P  P RK ++P P     KKRRSCCR  CC LC+
Sbjct: 1   MAEPPQKPMLQKPPGYRDPSVVVQQPPTQPYRKPVMP-PSMYPRKKRRSCCRSCCCCLCV 60

Query: 61  FVVILIVVIVAAGGFLYLWFEPKLPVFHLQSFRISKFNVTDKADGSYLNAKTIGRIEIKN 120
            + +++ V++ AG   YLWF PK+PVFHLQSFRI +FNVT K DG+YL A+T+ R+E+KN
Sbjct: 61  LIFLILCVLILAGALSYLWFGPKIPVFHLQSFRIPRFNVTAKPDGTYLKAQTVLRVEVKN 120

Query: 121 PNEKLSLNYRDIEVVTAAGEGTRIELGSTIVPSFIQSKENTTSWKIETMVSNEIVDGGAG 180
           PN+KL L Y   +V  + G G  IELGS  +P F Q K+N TS K+ T V +E+V+ GAG
Sbjct: 121 PNQKLGLYYGGTDVDISLGRGGGIELGSDSLPGFTQGKKNVTSLKVTTEVRDELVEDGAG 180

Query: 181 RKLNSGNRTGELVVIVEARTKIGFVVNGRRMPRVKIEVACGGVSLKRLDGGNTPKCSIHL 240
            +L SG R+  LVV V+ RT +G ++ G ++ RV++ V CG V++K ++GG  PKC I+L
Sbjct: 181 AELRSGYRSKSLVVKVKVRTSVGAIIQGWKVGRVRVNVECGEVAMKEVEGGEMPKCKINL 240

Query: 241 PRW 244
            RW
Sbjct: 241 LRW 242

BLAST of Cp4.1LG03g16230 vs. NCBI nr
Match: gi|703148826|ref|XP_010109444.1| (hypothetical protein L484_003064 [Morus notabilis])

HSP 1 Score: 257.7 bits (657), Expect = 2.1e-65
Identity = 131/246 (53.25%), Postives = 171/246 (69.51%), Query Frame = 1

Query: 1   MAEPPLKPV-LQKPPGYKDPNLTAPPVPRPPVRKLILPSPLSQNNKKRRSCCRRFCCFLC 60
           MAE PLKP  LQKPPGY+DP     PV RPP RK +LP+      K+RR+ CR  CCF+ 
Sbjct: 1   MAEQPLKPPPLQKPPGYRDPAAPGKPVARPPQRKPVLPASFHPR-KRRRNWCRTCCCFVF 60

Query: 61  LFVVILIVVIVAAGGFLYLWFEPKLPVFHLQSFRISKFNVTDKADGSYLNAKTIGRIEIK 120
           +F+++L + +  AGG  YLWFEPKLPVFHLQS RI +FNVT K DG+YL+A T+ RIE+K
Sbjct: 61  VFLLLLTLAVAIAGGIFYLWFEPKLPVFHLQSLRIPQFNVTVKPDGTYLDAGTVTRIEVK 120

Query: 121 NPNEKLSLNYRDIEVVTAAGEGTRIELGSTIVPSFIQSKENTTSWKIETMVSNEIVDGGA 180
           NPN KL L Y    V  + GE    ELG   +  F Q KENTTS K+ET V N++VD G 
Sbjct: 121 NPNGKLELYYGGTHVEVSVGEDEDAELGRKDLEGFTQGKENTTSLKVETTVKNQLVDDGL 180

Query: 181 GRKLNSGNRTGELVVIVEARTKIGFVVNGRRMPRVKIEVACGGVSLKRLDGGNTPKCSIH 240
           G++L SG ++ +LVV +EA+T +G++V G ++  V++ V CGGVSLK+LD G+ PKCSI 
Sbjct: 181 GKRLKSGYKSKDLVVKIEAKTSVGYIVQGVKIGTVEVGVLCGGVSLKKLDSGDMPKCSID 240

Query: 241 LPRWFL 246
           L +W +
Sbjct: 241 LLKWVI 245

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L3Z7_CUCSA2.1e-10984.55Uncharacterized protein OS=Cucumis sativus GN=Csa_3G120370 PE=4 SV=1[more]
W9SAG5_9ROSA1.4e-6553.25Uncharacterized protein OS=Morus notabilis GN=L484_003064 PE=4 SV=1[more]
A0A061DTS6_THECC1.5e-6253.06Late embryogenesis abundant hydroxyproline-rich glycoprotein family, putative OS... [more]
A0A0D2QQD3_GOSRA1.6e-5649.39Uncharacterized protein OS=Gossypium raimondii GN=B456_007G105400 PE=4 SV=1[more]
A0A0B0NJM7_GOSAR2.7e-5649.39D-alanine--poly (Phosphoribitol) ligase subunit 1 OS=Gossypium arboreum GN=F383_... [more]
Match NameE-valueIdentityDescription
AT2G46300.15.3e-5140.96 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G01453.22.1e-3638.98 unknown protein[more]
AT4G01110.14.5e-3435.02 unknown protein[more]
AT1G65690.13.9e-1429.49 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT5G36970.17.4e-1329.25 NDR1/HIN1-like 25[more]
Match NameE-valueIdentityDescription
gi|659075079|ref|XP_008437954.1|2.3e-10984.96PREDICTED: protein YLS9-like isoform X1 [Cucumis melo][more]
gi|659075081|ref|XP_008437955.1|2.3e-10984.96PREDICTED: protein YLS9-like isoform X2 [Cucumis melo][more]
gi|778677027|ref|XP_011650715.1|3.1e-10984.55PREDICTED: uncharacterized protein LOC101214208 [Cucumis sativus][more]
gi|702333839|ref|XP_010055051.1|1.6e-6551.44PREDICTED: protein YLS9-like [Eucalyptus grandis][more]
gi|703148826|ref|XP_010109444.1|2.1e-6553.25hypothetical protein L484_003064 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g16230.1Cp4.1LG03g16230.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 116..212
score: 8.
NoneNo IPR availablePANTHERPTHR31234FAMILY NOT NAMEDcoord: 1..244
score: 4.9
NoneNo IPR availablePANTHERPTHR31234:SF12LATE EMBRYOGENESIS ABUNDANT HYDROXYPROLINE-RICH GLYCOPROTEINcoord: 1..244
score: 4.9

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG03g16230Cp4.1LG08g01150Cucurbita pepo (Zucchini)cpecpeB482