Cp4.1LG08g01150 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG08g01150
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionD-alanine--poly (Phosphoribitol) ligase subunit 1
LocationCp4.1LG08 : 3731773 .. 3733019 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGTAGGAGACGAGTGATAAGTATGTACGCTCGGTTAGCCATATGATTTTCTCTATTTATGTGAAAAAAAAAGAGAGAGTTGGGACAAAAATGAGAATGAGAGGTTGTTAGGCCATATGATTTTACCAAATTGGATTTTGTGCTACGCAACGCAACCAATCCAAACGCCTCGCATCCAAATTTCTGTCAGTTCGTTCTCATCGCAAGAGCGGATTTCCGGCAGCAATTCGTTCAACTAAGGTAATCTTCTTCTTCTTCTTCTTCTTCTTCTTTGATTTGTTCATCTCTGCGTGTGAATTTGAGTTCAGTTCTCCTGTGACATGGAATTACGTTCTGTTCTTCTTTCCGGAGGAAACGATTTCGATTTCCTAATATTTAATGCGATTCATGAACTCGACTAATTAACTGAATAGATTCCTTTTTCTTTGTCTCGATTTCTGCATTGATTTTGAATTGACTGTAATTAAATCAGATTTCCGTGAAGGAGTTCGTTAATTTCAGAGAATATGACAGAGCCGCCATTGAAACCTGTTCTCCAGAAGCCTCCGGGGTATAAAGAACTGAATCTCGCTGCTCAGCCGGTGGTGAGACCGTCGGCGAGGAAATTGATTCTTCCTTCGCCGATGAACCAGAAGAACAAGAAGCGACGGAGCTTCTGCCGTGGATGCTGTTGTTTCCTCTGCTTTTTCCTTCTGATTCTGATCATTGTGGCTGTCGCCGCTGGTGGAGTTTTATACCTCTGGTTTGAGCCAAAACTCCCTGTCTTTCATCTCCAATTATTCCGAATCTTGAAATTGAACGTCACCGACAAATCGGACGGCTCGTATCTAAACGCGAGAACCCTCGGTCGTATCGAAATCAAGAACCCTAACTCGAAGTTATCACTGAATTACGGCGATATCGAGATCAAAATCGCCGCTGGAGAAGGAACTCAAACTGAATTAGGATCAACGATTTTGCGAAGCTTTGTTCAATCTAATGAGAACAAGACGAGTTTGAAGATCGAAACTATAGTCAGCAATGAAACAGTCGACAGCGGAGCGGGCTGGAAGTTGATCTCCGGCAACCGAACCGGCGAGTTAATAGTGAACGTCGGAGCAGAGACGAAAATCGGATATATGGTGAATGGTCGGAGAATGCCGCCGGTGAAGATCGAAGTGACGTGCGGAGGCGTGAGGTTGAAAAGTCTCAACAGAGGCGACATGCCGAAATGCTCCATCCATTTGCGCCGATGGTTTCTGTAA

mRNA sequence

ATGACGCCATATGATTTTACCAAATTGGATTTTGTGCTACGCAACGCAACCAATCCAAACGCCTCGCATCCAAATTTCTGTCAGTTCGTTCTCATCGCAAGAGCGGATTTCCGGCAGCAATTCGTTCAACTAAGGAGTTCGTTAATTTCAGAGAATATGACAGAGCCGCCATTGAAACCTGTTCTCCAGAAGCCTCCGGGGTATAAAGAACTGAATCTCGCTGCTCAGCCGGTGGTGAGACCGTCGGCGAGGAAATTGATTCTTCCTTCGCCGATGAACCAGAAGAACAAGAAGCGACGGAGCTTCTGCCGTGGATGCTGTTGTTTCCTCTGCTTTTTCCTTCTGATTCTGATCATTGTGGCTGTCGCCGCTGGTGGAGTTTTATACCTCTGGTTTGAGCCAAAACTCCCTGTCTTTCATCTCCAATTATTCCGAATCTTGAAATTGAACGTCACCGACAAATCGGACGGCTCGTATCTAAACGCGAGAACCCTCGGTCGTATCGAAATCAAGAACCCTAACTCGAAGTTATCACTGAATTACGGCGATATCGAGATCAAAATCGCCGCTGGAGAAGGAACTCAAACTGAATTAGGATCAACGATTTTGCGAAGCTTTGTTCAATCTAATGAGAACAAGACGAGTTTGAAGATCGAAACTATAGTCAGCAATGAAACAGTCGACAGCGGAGCGGGCTGGAAGTTGATCTCCGGCAACCGAACCGGCGAGTTAATAGTGAACGTCGGAGCAGAGACGAAAATCGGATATATGGTGAATGGTCGGAGAATGCCGCCGGTGAAGATCGAAGTGACGTGCGGAGGCGTGAGGTTGAAAAGTCTCAACAGAGGCGACATGCCGAAATGCTCCATCCATTTGCGCCGATGGTTTCTGTAA

Coding sequence (CDS)

ATGACGCCATATGATTTTACCAAATTGGATTTTGTGCTACGCAACGCAACCAATCCAAACGCCTCGCATCCAAATTTCTGTCAGTTCGTTCTCATCGCAAGAGCGGATTTCCGGCAGCAATTCGTTCAACTAAGGAGTTCGTTAATTTCAGAGAATATGACAGAGCCGCCATTGAAACCTGTTCTCCAGAAGCCTCCGGGGTATAAAGAACTGAATCTCGCTGCTCAGCCGGTGGTGAGACCGTCGGCGAGGAAATTGATTCTTCCTTCGCCGATGAACCAGAAGAACAAGAAGCGACGGAGCTTCTGCCGTGGATGCTGTTGTTTCCTCTGCTTTTTCCTTCTGATTCTGATCATTGTGGCTGTCGCCGCTGGTGGAGTTTTATACCTCTGGTTTGAGCCAAAACTCCCTGTCTTTCATCTCCAATTATTCCGAATCTTGAAATTGAACGTCACCGACAAATCGGACGGCTCGTATCTAAACGCGAGAACCCTCGGTCGTATCGAAATCAAGAACCCTAACTCGAAGTTATCACTGAATTACGGCGATATCGAGATCAAAATCGCCGCTGGAGAAGGAACTCAAACTGAATTAGGATCAACGATTTTGCGAAGCTTTGTTCAATCTAATGAGAACAAGACGAGTTTGAAGATCGAAACTATAGTCAGCAATGAAACAGTCGACAGCGGAGCGGGCTGGAAGTTGATCTCCGGCAACCGAACCGGCGAGTTAATAGTGAACGTCGGAGCAGAGACGAAAATCGGATATATGGTGAATGGTCGGAGAATGCCGCCGGTGAAGATCGAAGTGACGTGCGGAGGCGTGAGGTTGAAAAGTCTCAACAGAGGCGACATGCCGAAATGCTCCATCCATTTGCGCCGATGGTTTCTGTAA

Protein sequence

MTPYDFTKLDFVLRNATNPNASHPNFCQFVLIARADFRQQFVQLRSSLISENMTEPPLKPVLQKPPGYKELNLAAQPVVRPSARKLILPSPMNQKNKKRRSFCRGCCCFLCFFLLILIIVAVAAGGVLYLWFEPKLPVFHLQLFRILKLNVTDKSDGSYLNARTLGRIEIKNPNSKLSLNYGDIEIKIAAGEGTQTELGSTILRSFVQSNENKTSLKIETIVSNETVDSGAGWKLISGNRTGELIVNVGAETKIGYMVNGRRMPPVKIEVTCGGVRLKSLNRGDMPKCSIHLRRWFL
BLAST of Cp4.1LG08g01150 vs. TrEMBL
Match: A0A0A0L3Z7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G120370 PE=4 SV=1)

HSP 1 Score: 377.1 bits (967), Expect = 2.0e-101
Identity = 186/242 (76.86%), Postives = 210/242 (86.78%), Query Frame = 1

Query: 56  PPLKPVLQKPPGYKELNLAAQPVVRPSARKLILPSPMNQKNKKRRSFCRGCCCFLCFFLL 115
           PPLKP+LQKPPG+K+ N  A PV RP ARKLILPSP++QKNKKRRS  R CCCF C  +L
Sbjct: 5   PPLKPILQKPPGFKDPNHIALPVPRPPARKLILPSPLSQKNKKRRSCWRRCCCFFCLLVL 64

Query: 116 ILIIVAVAAGGVLYLWFEPKLPVFHLQLFRILKLNVTDKSDGSYLNARTLGRIEIKNPNS 175
           ILI+  +A GGVLYLWFEPKLPV HLQ FRI K NVTDKSDGSYLNA+T+GRIEIKNPNS
Sbjct: 65  ILIVAILAVGGVLYLWFEPKLPVVHLQSFRISKFNVTDKSDGSYLNAKTIGRIEIKNPNS 124

Query: 176 KLSLNYGDIEIKIAAGEGTQTELGSTILRSFVQSNENKTSLKIETIVSNETVDSGAGWKL 235
           KLSLNYGDIE++IAAGEGT+TELGS I+ SF+QS EN TSLKIET+VSNETVD GAG  L
Sbjct: 125 KLSLNYGDIEVQIAAGEGTRTELGSMIVPSFIQSEENTTSLKIETMVSNETVDDGAGRNL 184

Query: 236 ISGNRTGELIVNVGAETKIGYMVNGRRMPPVKIEVTCGGVRLKSLNRGDMPKCSIHLRRW 295
            SGNRTGEL+VNV A TKIG++V+GRRMPPVKIEV+CG V LK L+RG++PKCSIHLRRW
Sbjct: 185 NSGNRTGELVVNVEARTKIGFVVDGRRMPPVKIEVSCGSVSLKRLDRGNVPKCSIHLRRW 244

Query: 296 FL 298
           FL
Sbjct: 245 FL 246

BLAST of Cp4.1LG08g01150 vs. TrEMBL
Match: W9SAG5_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_003064 PE=4 SV=1)

HSP 1 Score: 255.4 bits (651), Expect = 8.7e-65
Identity = 127/246 (51.63%), Postives = 168/246 (68.29%), Query Frame = 1

Query: 53  MTEPPLKPV-LQKPPGYKELNLAAQPVVRPSARKLILPSPMNQKNKKRRSFCRGCCCFLC 112
           M E PLKP  LQKPPGY++     +PV RP  RK +LP+  + + K+RR++CR CCCF+ 
Sbjct: 1   MAEQPLKPPPLQKPPGYRDPAAPGKPVARPPQRKPVLPASFHPR-KRRRNWCRTCCCFVF 60

Query: 113 FFLLILIIVAVAAGGVLYLWFEPKLPVFHLQLFRILKLNVTDKSDGSYLNARTLGRIEIK 172
            FLL+L +    AGG+ YLWFEPKLPVFHLQ  RI + NVT K DG+YL+A T+ RIE+K
Sbjct: 61  VFLLLLTLAVAIAGGIFYLWFEPKLPVFHLQSLRIPQFNVTVKPDGTYLDAGTVTRIEVK 120

Query: 173 NPNSKLSLNYGDIEIKIAAGEGTQTELGSTILRSFVQSNENKTSLKIETIVSNETVDSGA 232
           NPN KL L YG   ++++ GE    ELG   L  F Q  EN TSLK+ET V N+ VD G 
Sbjct: 121 NPNGKLELYYGGTHVEVSVGEDEDAELGRKDLEGFTQGKENTTSLKVETTVKNQLVDDGL 180

Query: 233 GWKLISGNRTGELIVNVGAETKIGYMVNGRRMPPVKIEVTCGGVRLKSLNRGDMPKCSIH 292
           G +L SG ++ +L+V + A+T +GY+V G ++  V++ V CGGV LK L+ GDMPKCSI 
Sbjct: 181 GKRLKSGYKSKDLVVKIEAKTSVGYIVQGVKIGTVEVGVLCGGVSLKKLDSGDMPKCSID 240

Query: 293 LRRWFL 298
           L +W +
Sbjct: 241 LLKWVI 245

BLAST of Cp4.1LG08g01150 vs. TrEMBL
Match: A0A061DTS6_THECC (Late embryogenesis abundant hydroxyproline-rich glycoprotein family, putative OS=Theobroma cacao GN=TCM_005206 PE=4 SV=1)

HSP 1 Score: 236.1 bits (601), Expect = 5.5e-59
Identity = 122/245 (49.80%), Postives = 165/245 (67.35%), Query Frame = 1

Query: 53  MTEPPLKPVLQKPPGYKELNL-AAQPVVRPSARKLILPSPMNQKNKKRRSFCRGCCCFLC 112
           M EPPLKPVLQKPPGYK+ +  A +P  RP  RK +LP   + K K+R   CR CCC  C
Sbjct: 1   MPEPPLKPVLQKPPGYKDPSAPAVKPGFRPPPRKPVLPPSFHPK-KRRGGCCRVCCCCFC 60

Query: 113 FFLLILIIVAVAAGGVLYLWFEPKLPVFHLQLFRILKLNVTDKSDGSYLNARTLGRIEIK 172
            F LILI++ +  G V YLWF+PKLP FH+Q  RI + NVT+K DG+YL+A+T  R+E+K
Sbjct: 61  IFFLILILLLLICGAVFYLWFDPKLPGFHVQSVRISRFNVTNKPDGTYLDAQTTTRLEVK 120

Query: 173 NPNSKLSLNYGDIEIKIAAGE-GTQTELGSTILRSFVQSNENKTSLKIETIVSNETVDSG 232
           NPN+K++  YG+ E+ ++ GE G +TELG+T +  F    +N TSLK+ET V N+ VD G
Sbjct: 121 NPNAKMTYYYGNTEVDVSVGEGGDETELGTTTVHGFTMGKQNTTSLKVETKVINKLVDDG 180

Query: 233 AGWKLISGNRTGELIVNVGAETKIGYMVNGRRMPPVKIEVTCGGVRLKSLNRGDMPKCSI 292
            G +L +  R+  L V+V A TKIG  V G ++  V + V C G+ LK L+ GDMPKC I
Sbjct: 181 VGTRLQARYRSKSLRVSVEARTKIGLGVAGLKIGMVGVTVKCDGIALKRLDGGDMPKCVI 240

Query: 293 HLRRW 296
           ++ +W
Sbjct: 241 NMLKW 244

BLAST of Cp4.1LG08g01150 vs. TrEMBL
Match: A0A0D2QQD3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G105400 PE=4 SV=1)

HSP 1 Score: 226.1 bits (575), Expect = 5.6e-56
Identity = 118/245 (48.16%), Postives = 163/245 (66.53%), Query Frame = 1

Query: 53  MTEPPLKPVLQKPPGYKELNL-AAQPVVRPSARKLILPSPMNQKNKKRRSFCRGCCCFLC 112
           M+EPP+KPVLQKPPGYK+ N  A Q   RP  RK +LP   + K K++ S+ R CCC  C
Sbjct: 1   MSEPPVKPVLQKPPGYKDPNSPAGQRRFRPPPRKPVLPPSFHPK-KRKTSYGRACCCCFC 60

Query: 113 FFLLILIIVAVAAGGVLYLWFEPKLPVFHLQLFRILKLNVTDKSDGSYLNARTLGRIEIK 172
            F LI +++ +  G V YLWF+P+LP FH+Q FRI + NVT + DG+YL+ART  R+E+K
Sbjct: 61  IFFLIFLLLILICGAVFYLWFDPQLPGFHIQSFRISRFNVTKRPDGTYLDARTTTRLEVK 120

Query: 173 NPNSKLSLNYGDIEIKIAAGE-GTQTELGSTILRSFVQSNENKTSLKIETIVSNETVDSG 232
           NPN K++  YGD E++I+ GE G +TELG+T + +F    +N  SL++ETI SN+ V   
Sbjct: 121 NPNGKMTYYYGDTEVEISFGEGGYETELGTTTVPAFTMLEKNTRSLRVETIASNKLVVDE 180

Query: 233 AGWKLISGNRTGELIVNVGAETKIGYMVNGRRMPPVKIEVTCGGVRLKSLNRGDMPKCSI 292
            G KL +  R+  L VNV A TK+G  V G ++  V + V C G+  K L+ GDMPKC I
Sbjct: 181 VGNKLRARYRSKSLPVNVEARTKVGVGVAGLKIGMVGVTVKCDGMSKKQLDGGDMPKCVI 240

Query: 293 HLRRW 296
           ++ +W
Sbjct: 241 NMLKW 244

BLAST of Cp4.1LG08g01150 vs. TrEMBL
Match: A0A0D2SZI7_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G123300 PE=4 SV=1)

HSP 1 Score: 223.8 bits (569), Expect = 2.8e-55
Identity = 115/246 (46.75%), Postives = 161/246 (65.45%), Query Frame = 1

Query: 53  MTEPPLKPVLQKPPGYKELNL--AAQPVVRPSARKLILPSPMNQKNKKRRSFCRGCCCFL 112
           M+EPPL PVLQKPPGY++ NL  AAQ   RP  RK +LP   N + ++  SFCR CCC+L
Sbjct: 1   MSEPPLIPVLQKPPGYRDPNLPAAAQSGFRPPPRKPVLPPSFNPR-RRNHSFCRVCCCWL 60

Query: 113 CFFLLILIIVAVAAGGVLYLWFEPKLPVFHLQLFRILKLNVTDKSDGSYLNARTLGRIEI 172
           C F+L+L+++AV    V Y+WF+PK PVF ++ FR  + NVT++ DG+YL+A T  R+E+
Sbjct: 61  CIFVLVLVLLAVIGVLVFYIWFDPKFPVFRIRSFRTTRFNVTERPDGTYLDATTTTRLEM 120

Query: 173 KNPNSKLSLNYGDIEIKIAAGE-GTQTELGSTILRSFVQSNENKTSLKIETIVSNETVDS 232
           KNPN K++  YG  E+ ++ GE G +T +G+T +  F    ++  SLK+ET VSN  VD 
Sbjct: 121 KNPNVKITYYYGKTEVGVSVGEGGDETPVGTTAVPGFTMWKQSTMSLKVETKVSNTLVDD 180

Query: 233 GAGWKLISGNRTGELIVNVGAETKIGYMVNGRRMPPVKIEVTCGGVRLKSLNRGDMPKCS 292
             G +L S  R   L VNV A TK+G  V G ++  V + V C G+ +K L+ GDMPKC 
Sbjct: 181 WVGKRLRSRYRNKILAVNVEARTKVGVSVTGLKIGKVAVTVKCDGITMKELDGGDMPKCV 240

Query: 293 IHLRRW 296
           I + +W
Sbjct: 241 IDMLKW 245

BLAST of Cp4.1LG08g01150 vs. TAIR10
Match: AT2G46300.1 (AT2G46300.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 183.7 bits (465), Expect = 1.6e-46
Identity = 95/249 (38.15%), Postives = 150/249 (60.24%), Query Frame = 1

Query: 53  MTEPPLKPVLQKPPGYKELNLAAQPVVRPSAR----KLILPSPMNQK-NKKRRSFCRGCC 112
           M +  + PVLQKPPGY++ N+++ P   P  +    +  +P P + +  KKRRS CR CC
Sbjct: 1   MADYQMNPVLQKPPGYRDPNMSSPPPPPPPIQQQPMRKAVPMPTSYRPKKKRRSCCRFCC 60

Query: 113 CFLCFFLLILIIVAVAAGGVLYLWFEPKLPVFHLQLFRILKLNVTDKSDGSYLNARTLGR 172
           C +C  L++ I + +    V YLWF+PKLP F L  FR+    + D  DG+ L+A  + R
Sbjct: 61  CCICITLVLFIFLLLVGTAVFYLWFDPKLPTFSLASFRLDGFKLADDPDGASLSATAVAR 120

Query: 173 IEIKNPNSKLSLNYGDIEIKIAAGEGT-QTELGSTILRSFVQSNENKTSLKIETIVSNET 232
           +E+KNPNSKL   YG+  + ++ G G  +T +G T +  F Q  +N TS+K+ET V N+ 
Sbjct: 121 VEMKNPNSKLVFYYGNTAVDLSVGSGNDETGMGETTMNGFRQGPKNSTSVKVETTVKNQL 180

Query: 233 VDSGAGWKLISGNRTGELIVNVGAETKIGYMVNGRRMPPVKIEVTCGGVRLKSLNRGDMP 292
           V+ G   +L +  ++ +L++NV A+TK+G  V G ++  + + + CGGV L  L+  D P
Sbjct: 181 VERGLAKRLAAKFQSKDLVINVVAKTKVGLGVGGIKIGMLAVNLRCGGVSLNKLDT-DSP 240

Query: 293 KCSIHLRRW 296
           KC ++  +W
Sbjct: 241 KCILNTLKW 248

BLAST of Cp4.1LG08g01150 vs. TAIR10
Match: AT1G01453.2 (AT1G01453.2 unknown protein)

HSP 1 Score: 156.0 bits (393), Expect = 3.6e-38
Identity = 91/255 (35.69%), Postives = 148/255 (58.04%), Query Frame = 1

Query: 51  ENMTEPPLKPVLQKPPGYKELNLAAQPVVRPSARKLI---LPSPMNQKNKKRR-SFCRGC 110
           E   E PL+P LQKPPG+++     QP   PS    +    P P++  +KKRR SFCR  
Sbjct: 13  EMAAEKPLQPALQKPPGFRDQQ--NQPSAPPSGTATLPRRRPRPIHPADKKRRCSFCRVF 72

Query: 111 CCFLCFFLLILIIVAVAAGGVLYLWFEPKLPVFHLQLFRILKLNVTD-KSDG--SYLNAR 170
           CC +C    +++++ + A  V +LW+ PKLPV  L  F+I   N +D KSD   S+L+A 
Sbjct: 73  CCCVCILFAVILLLILIAVAVFFLWYSPKLPVVRLASFKISNFNFSDGKSDDGWSFLSAD 132

Query: 171 TLGRIEIKNPNSKLSLNYGDIEIKIAAGE-GTQTELGSTILRSFVQSNENKTSLKIETIV 230
           T   ++ +NPN KL+  YGD ++ +  GE   +T L ST ++ F++   N+T++ + T V
Sbjct: 133 TTSVLDFRNPNGKLTFYYGDTDVAVILGEKDFETNLESTKVKGFIEKPGNRTAVIVPTTV 192

Query: 231 SNETVDSGAGWKLISGNRTGELIVNVGAETKIGYMVNGRRMPPVKIEVTCGGVRLKSLNR 290
               VD     +L    ++ +L+V V A+TK+G  V  R++  V + + CGGV L++L+ 
Sbjct: 193 RKRQVDDPTAKRLQVELKSKKLLVTVTAKTKVGLAVGSRKIVTVGVSLRCGGVILQTLD- 252

Query: 291 GDMPKCSIHLRRWFL 298
             M +C+I + +W++
Sbjct: 253 SKMAQCTIKMLKWYV 264

BLAST of Cp4.1LG08g01150 vs. TAIR10
Match: AT4G01110.1 (AT4G01110.1 unknown protein)

HSP 1 Score: 146.4 bits (368), Expect = 2.9e-35
Identity = 86/254 (33.86%), Postives = 142/254 (55.91%), Query Frame = 1

Query: 55  EPPLKPVLQKPPGYKELNLAAQPVVRPSAR---------KLILPSPMNQKNKKRRSFCRG 114
           E  LKPVLQKPPGY+EL+   Q  +  S+          K  +P+      K++ S CR 
Sbjct: 4   ETLLKPVLQKPPGYRELHSQPQTPLGSSSSSSSMLRRPPKHAIPAAFYPTKKRQWSRCRV 63

Query: 115 CCCFLCFFLLILIIVAVAAGGVLYLWFEPKLPVFHLQLFRILKLNVTDKSDG---SYLNA 174
            CC +C  + I+I++ +    V +L++ P+LPV  L  FR+   N +    G   S L A
Sbjct: 64  FCCCVCITVAIVILLLILTVSVFFLYYSPRLPVVRLSSFRVSNFNFSGGKAGDGLSQLTA 123

Query: 175 RTLGRIEIKNPNSKLSLNYGDIEIKIAAGEGT-QTELGSTILRSFVQSNENKTSLKIETI 234
               R++ +NPN KL   YG++++ ++ GE   +T LGST ++ FV+   N+T + +   
Sbjct: 124 EATARLDFRNPNGKLRYYYGNVDVAVSVGEDDFETSLGSTKVKGFVEKPGNRTVVIVPIK 183

Query: 235 VSNETVDSGAGWKLISGNRTGELIVNVGAETKIGYMVNGRRMPPVKIEVTCGGVRLKSLN 294
           V  + VD     +L +  ++ +L+V V A+TK+G  V  R++  V + ++CGGVRL++L+
Sbjct: 184 VKKQQVDDPTVKRLRADMKSKKLVVKVMAKTKVGLGVGRRKIVTVGVTISCGGVRLQTLD 243

Query: 295 RGDMPKCSIHLRRW 296
              M KC+I + +W
Sbjct: 244 -SKMSKCTIKMLKW 256

BLAST of Cp4.1LG08g01150 vs. TAIR10
Match: AT1G65690.1 (AT1G65690.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 62.4 bits (150), Expect = 5.5e-10
Identity = 50/177 (28.25%), Postives = 80/177 (45.20%), Query Frame = 1

Query: 97  KKRRSFCRGCCCFLCFFLLILIIVAVAAGGVLYLWFEPKLPVFHLQLFRILKLNVTDKSD 156
           KKRRS C  C C+   FLL+L++   A+ G+LYL F+PKLP + +   ++ +  +    D
Sbjct: 56  KKRRSCCCRCFCYTFCFLLLLVVAVGASIGILYLVFKPKLPDYSIDRLQLTRFAL--NQD 115

Query: 157 GSYLNARTLGRIEIKNPNSKLSLNYGDIEIKIAAGEGTQTELGSTILRSFVQSNENKTSL 216
            S   A  +  I  KNPN K+ + Y D   KI      + +L +  L  F Q +EN T +
Sbjct: 116 SSLTTAFNV-TITAKNPNEKIGIYYED-GSKITVWY-MEHQLSNGSLPKFYQGHENTTVI 175

Query: 217 KIETIVSNETVDSGAGWKLISGNRTGELIVNVGAETKIGYMVNGRRMPPVKIEVTCG 274
            +E     +              RTG + + +     +       ++  V+  V CG
Sbjct: 176 YVEMTGQTQNASGLRTTLEEQQQRTGNIPLRIRVNQPVRVKFGKLKLFEVRFLVRCG 227

BLAST of Cp4.1LG08g01150 vs. TAIR10
Match: AT1G17620.1 (AT1G17620.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 62.0 bits (149), Expect = 7.1e-10
Identity = 52/217 (23.96%), Postives = 95/217 (43.78%), Query Frame = 1

Query: 65  PPGYKELNLAAQPVVRPSARKLILPSPMNQKNKKRRSFCRGCCCFLCFFLLILIIVAVAA 124
           P    +L  A +P  RP A +        ++    R  C  CCC+  F +++L+++  AA
Sbjct: 28  PANKAQLYNANRPAYRPPAGR--------RRTSHTRGCCCRCCCWTIFVIILLLLIVAAA 87

Query: 125 GGVLYLWFEPKLPVFHLQLFRILKLNVTDKSDGSYLNARTLGRIEIKNPNSKLSLNYGDI 184
             V+YL + P+ P F +   +I  LN T  S      A +L  I  +NPN  +   Y   
Sbjct: 88  SAVVYLIYRPQRPSFTVSELKISTLNFT--SAVRLTTAISLSVI-ARNPNKNVGFIYDVT 147

Query: 185 EI---KIAAGEGTQTELGSTILRSFVQSNENKTSLKIETIVSNETVDSGAGWKLISGNRT 244
           +I   K + G      +G   + +F    +N T+L+       + +D  +  KL    + 
Sbjct: 148 DITLYKASTGGDDDVVIGKGTIAAFSHGKKNTTTLRSTIGSPPDELDEISAGKLKGDLKA 207

Query: 245 GELI-VNVGAETKIGYMVNGRRMPPVKIEVTCGGVRL 278
            + + + +   +K+   +   + P   I VTC G+++
Sbjct: 208 KKAVAIKIVLNSKVKVKMGALKTPKSGIRVTCEGIKV 233

BLAST of Cp4.1LG08g01150 vs. NCBI nr
Match: gi|659075079|ref|XP_008437954.1| (PREDICTED: protein YLS9-like isoform X1 [Cucumis melo])

HSP 1 Score: 383.3 bits (983), Expect = 4.0e-103
Identity = 192/253 (75.89%), Postives = 217/253 (85.77%), Query Frame = 1

Query: 46  SSLISENMTEP-PLKPVLQKPPGYKELNLAAQPVVRPSARKLILPSPMNQKNKKRRSFCR 105
           S + SENM EP PLKP+LQKPPG+K+ N  A PV RP ARKLILPSP++QKNKKRRS  R
Sbjct: 62  SPISSENMAEPTPLKPILQKPPGFKDPNHTALPVPRPPARKLILPSPLSQKNKKRRSCWR 121

Query: 106 GCCCFLCFFLLILIIVAVAAGGVLYLWFEPKLPVFHLQLFRILKLNVTDKSDGSYLNART 165
            CCCF C F+LILI+V +A GGVLYLWFE KLPV HLQ FRI K NVTDKSDGSYLNA+T
Sbjct: 122 RCCCFFCLFVLILIVVILAVGGVLYLWFERKLPVVHLQSFRISKFNVTDKSDGSYLNAKT 181

Query: 166 LGRIEIKNPNSKLSLNYGDIEIKIAAGEGTQTELGSTILRSFVQSNENKTSLKIETIVSN 225
           +GRIEIKNPNSKLSLNYGDIE+++AAGEGT+TELGS I+ SF+QS EN TSLKIET+V+N
Sbjct: 182 IGRIEIKNPNSKLSLNYGDIEVQVAAGEGTRTELGSMIVPSFIQSEENTTSLKIETMVNN 241

Query: 226 ETVDSGAGWKLISGNRTGELIVNVGAETKIGYMVNGRRMPPVKIEVTCGGVRLKSLNRGD 285
           ETVD GAG  L SGNRTGEL+VNV A TKIG++V GRRMPPVKIEVTCG V LK L+RG+
Sbjct: 242 ETVDDGAGRYLNSGNRTGELVVNVEARTKIGFVVEGRRMPPVKIEVTCGSVSLKRLDRGN 301

Query: 286 MPKCSIHLRRWFL 298
           +PKCSIHLRRWFL
Sbjct: 302 VPKCSIHLRRWFL 314

BLAST of Cp4.1LG08g01150 vs. NCBI nr
Match: gi|778677027|ref|XP_011650715.1| (PREDICTED: uncharacterized protein LOC101214208 [Cucumis sativus])

HSP 1 Score: 377.1 bits (967), Expect = 2.8e-101
Identity = 186/242 (76.86%), Postives = 210/242 (86.78%), Query Frame = 1

Query: 56  PPLKPVLQKPPGYKELNLAAQPVVRPSARKLILPSPMNQKNKKRRSFCRGCCCFLCFFLL 115
           PPLKP+LQKPPG+K+ N  A PV RP ARKLILPSP++QKNKKRRS  R CCCF C  +L
Sbjct: 5   PPLKPILQKPPGFKDPNHIALPVPRPPARKLILPSPLSQKNKKRRSCWRRCCCFFCLLVL 64

Query: 116 ILIIVAVAAGGVLYLWFEPKLPVFHLQLFRILKLNVTDKSDGSYLNARTLGRIEIKNPNS 175
           ILI+  +A GGVLYLWFEPKLPV HLQ FRI K NVTDKSDGSYLNA+T+GRIEIKNPNS
Sbjct: 65  ILIVAILAVGGVLYLWFEPKLPVVHLQSFRISKFNVTDKSDGSYLNAKTIGRIEIKNPNS 124

Query: 176 KLSLNYGDIEIKIAAGEGTQTELGSTILRSFVQSNENKTSLKIETIVSNETVDSGAGWKL 235
           KLSLNYGDIE++IAAGEGT+TELGS I+ SF+QS EN TSLKIET+VSNETVD GAG  L
Sbjct: 125 KLSLNYGDIEVQIAAGEGTRTELGSMIVPSFIQSEENTTSLKIETMVSNETVDDGAGRNL 184

Query: 236 ISGNRTGELIVNVGAETKIGYMVNGRRMPPVKIEVTCGGVRLKSLNRGDMPKCSIHLRRW 295
            SGNRTGEL+VNV A TKIG++V+GRRMPPVKIEV+CG V LK L+RG++PKCSIHLRRW
Sbjct: 185 NSGNRTGELVVNVEARTKIGFVVDGRRMPPVKIEVSCGSVSLKRLDRGNVPKCSIHLRRW 244

Query: 296 FL 298
           FL
Sbjct: 245 FL 246

BLAST of Cp4.1LG08g01150 vs. NCBI nr
Match: gi|659075081|ref|XP_008437955.1| (PREDICTED: protein YLS9-like isoform X2 [Cucumis melo])

HSP 1 Score: 375.9 bits (964), Expect = 6.3e-101
Identity = 188/246 (76.42%), Postives = 212/246 (86.18%), Query Frame = 1

Query: 53  MTEP-PLKPVLQKPPGYKELNLAAQPVVRPSARKLILPSPMNQKNKKRRSFCRGCCCFLC 112
           M EP PLKP+LQKPPG+K+ N  A PV RP ARKLILPSP++QKNKKRRS  R CCCF C
Sbjct: 1   MAEPTPLKPILQKPPGFKDPNHTALPVPRPPARKLILPSPLSQKNKKRRSCWRRCCCFFC 60

Query: 113 FFLLILIIVAVAAGGVLYLWFEPKLPVFHLQLFRILKLNVTDKSDGSYLNARTLGRIEIK 172
            F+LILI+V +A GGVLYLWFE KLPV HLQ FRI K NVTDKSDGSYLNA+T+GRIEIK
Sbjct: 61  LFVLILIVVILAVGGVLYLWFERKLPVVHLQSFRISKFNVTDKSDGSYLNAKTIGRIEIK 120

Query: 173 NPNSKLSLNYGDIEIKIAAGEGTQTELGSTILRSFVQSNENKTSLKIETIVSNETVDSGA 232
           NPNSKLSLNYGDIE+++AAGEGT+TELGS I+ SF+QS EN TSLKIET+V+NETVD GA
Sbjct: 121 NPNSKLSLNYGDIEVQVAAGEGTRTELGSMIVPSFIQSEENTTSLKIETMVNNETVDDGA 180

Query: 233 GWKLISGNRTGELIVNVGAETKIGYMVNGRRMPPVKIEVTCGGVRLKSLNRGDMPKCSIH 292
           G  L SGNRTGEL+VNV A TKIG++V GRRMPPVKIEVTCG V LK L+RG++PKCSIH
Sbjct: 181 GRYLNSGNRTGELVVNVEARTKIGFVVEGRRMPPVKIEVTCGSVSLKRLDRGNVPKCSIH 240

Query: 293 LRRWFL 298
           LRRWFL
Sbjct: 241 LRRWFL 246

BLAST of Cp4.1LG08g01150 vs. NCBI nr
Match: gi|703148826|ref|XP_010109444.1| (hypothetical protein L484_003064 [Morus notabilis])

HSP 1 Score: 255.4 bits (651), Expect = 1.2e-64
Identity = 127/246 (51.63%), Postives = 168/246 (68.29%), Query Frame = 1

Query: 53  MTEPPLKPV-LQKPPGYKELNLAAQPVVRPSARKLILPSPMNQKNKKRRSFCRGCCCFLC 112
           M E PLKP  LQKPPGY++     +PV RP  RK +LP+  + + K+RR++CR CCCF+ 
Sbjct: 1   MAEQPLKPPPLQKPPGYRDPAAPGKPVARPPQRKPVLPASFHPR-KRRRNWCRTCCCFVF 60

Query: 113 FFLLILIIVAVAAGGVLYLWFEPKLPVFHLQLFRILKLNVTDKSDGSYLNARTLGRIEIK 172
            FLL+L +    AGG+ YLWFEPKLPVFHLQ  RI + NVT K DG+YL+A T+ RIE+K
Sbjct: 61  VFLLLLTLAVAIAGGIFYLWFEPKLPVFHLQSLRIPQFNVTVKPDGTYLDAGTVTRIEVK 120

Query: 173 NPNSKLSLNYGDIEIKIAAGEGTQTELGSTILRSFVQSNENKTSLKIETIVSNETVDSGA 232
           NPN KL L YG   ++++ GE    ELG   L  F Q  EN TSLK+ET V N+ VD G 
Sbjct: 121 NPNGKLELYYGGTHVEVSVGEDEDAELGRKDLEGFTQGKENTTSLKVETTVKNQLVDDGL 180

Query: 233 GWKLISGNRTGELIVNVGAETKIGYMVNGRRMPPVKIEVTCGGVRLKSLNRGDMPKCSIH 292
           G +L SG ++ +L+V + A+T +GY+V G ++  V++ V CGGV LK L+ GDMPKCSI 
Sbjct: 181 GKRLKSGYKSKDLVVKIEAKTSVGYIVQGVKIGTVEVGVLCGGVSLKKLDSGDMPKCSID 240

Query: 293 LRRWFL 298
           L +W +
Sbjct: 241 LLKWVI 245

BLAST of Cp4.1LG08g01150 vs. NCBI nr
Match: gi|1009142616|ref|XP_015888818.1| (PREDICTED: protein YLS9 [Ziziphus jujuba])

HSP 1 Score: 240.0 bits (611), Expect = 5.4e-60
Identity = 122/245 (49.80%), Postives = 166/245 (67.76%), Query Frame = 1

Query: 53  MTEPPLKPVLQKPPGYKELNLAAQPVVRPSARKLILPSPMNQKNKKRRSFCRGCCCFLCF 112
           M EPP+KP LQKPPGY++ +   +PV RP  RK  LP P  +  +KRRS CR CCCFLCF
Sbjct: 1   MREPPMKPALQKPPGYRDPSAPGKPVARPPPRKPTLP-PSFRTKRKRRSCCRTCCCFLCF 60

Query: 113 FLLILIIVAVAAGGVLYLWFEPKLPVFHLQLFRILKLNVTDKSDGSYLNARTLGRIEIKN 172
           F++IL I+ +  GGV YLWF PK+P FHLQ +RI +  VT K+D +YL+ART+ RIE+KN
Sbjct: 61  FIVILTIIVLVVGGVSYLWFSPKIPTFHLQSYRIPEFKVTVKTDATYLDARTVIRIEVKN 120

Query: 173 PNSKLSLNYGDIEIKIAAGEG-TQTELGSTILRSFVQSNENKTSLKIETIVSNETVDSGA 232
           PN+KL + YG  +I    G+G ++TELG + +  F Q  +N TSLKIE+   N  +D   
Sbjct: 121 PNTKLKVYYGRTQINAIVGKGESETELGQSEVAGFTQGIKNVTSLKIESSTKNRLIDDKD 180

Query: 233 GWKLISGNRTGELIVNVGAETKIGYMVNGRRMPPVKIEVTCGGVRLKSLN-RGDMPKCSI 292
           G KL SG +T  L V V A T +GY+V   R+  +++ V+CGG+  KSL+  G+MPKC++
Sbjct: 181 GRKLKSGYKTKNLEVRVKARTSLGYVVGRWRIGALRVTVSCGGMTFKSLDGGGEMPKCTV 240

Query: 293 HLRRW 296
           +  RW
Sbjct: 241 NFLRW 244

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L3Z7_CUCSA2.0e-10176.86Uncharacterized protein OS=Cucumis sativus GN=Csa_3G120370 PE=4 SV=1[more]
W9SAG5_9ROSA8.7e-6551.63Uncharacterized protein OS=Morus notabilis GN=L484_003064 PE=4 SV=1[more]
A0A061DTS6_THECC5.5e-5949.80Late embryogenesis abundant hydroxyproline-rich glycoprotein family, putative OS... [more]
A0A0D2QQD3_GOSRA5.6e-5648.16Uncharacterized protein OS=Gossypium raimondii GN=B456_007G105400 PE=4 SV=1[more]
A0A0D2SZI7_GOSRA2.8e-5546.75Uncharacterized protein OS=Gossypium raimondii GN=B456_008G123300 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G46300.11.6e-4638.15 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G01453.23.6e-3835.69 unknown protein[more]
AT4G01110.12.9e-3533.86 unknown protein[more]
AT1G65690.15.5e-1028.25 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT1G17620.17.1e-1023.96 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|659075079|ref|XP_008437954.1|4.0e-10375.89PREDICTED: protein YLS9-like isoform X1 [Cucumis melo][more]
gi|778677027|ref|XP_011650715.1|2.8e-10176.86PREDICTED: uncharacterized protein LOC101214208 [Cucumis sativus][more]
gi|659075081|ref|XP_008437955.1|6.3e-10176.42PREDICTED: protein YLS9-like isoform X2 [Cucumis melo][more]
gi|703148826|ref|XP_010109444.1|1.2e-6451.63hypothetical protein L484_003064 [Morus notabilis][more]
gi|1009142616|ref|XP_015888818.1|5.4e-6049.80PREDICTED: protein YLS9 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0016567 protein ubiquitination
biological_process GO:0006511 ubiquitin-dependent protein catabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0016874 ligase activity
molecular_function GO:0004842 ubiquitin-protein transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g01150.1Cp4.1LG08g01150.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31234FAMILY NOT NAMEDcoord: 52..292
score: 2.9
NoneNo IPR availablePANTHERPTHR31234:SF12LATE EMBRYOGENESIS ABUNDANT HYDROXYPROLINE-RICH GLYCOPROTEINcoord: 52..292
score: 2.9

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG08g01150Cp4.1LG03g16230Cucurbita pepo (Zucchini)cpecpeB482
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG08g01150Cucumber (Chinese Long) v3cpecucB1058
Cp4.1LG08g01150Cucumber (Chinese Long) v3cpecucB1071
Cp4.1LG08g01150Wax gourdcpewgoB1098
Cp4.1LG08g01150Cucurbita pepo (Zucchini)cpecpeB254
Cp4.1LG08g01150Cucurbita maxima (Rimu)cmacpeB016
Cp4.1LG08g01150Cucurbita moschata (Rifu)cmocpeB796
Cp4.1LG08g01150Watermelon (Charleston Gray)cpewcgB787
Cp4.1LG08g01150Cucumber (Gy14) v2cgybcpeB704
Cp4.1LG08g01150Melon (DHL92) v3.6.1cpemedB929