CmaCh16G004430 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh16G004430
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
Descriptionepidermis-specific secreted glycoprotein EP1-like
LocationCma_Chr16: 2211987 .. 2213303 (+)
RNA-Seq ExpressionCmaCh16G004430
SyntenyCmaCh16G004430
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGATCGCCATTGTTGACGCCTTTTCTGATCTCTTTCTTTTTCCTGTTCTTTTCTTTCTCTCTTGCTCTCGTGCCTGCTAACGAGACCTTCAAGTTCGTCAATGAAGGCGAGTTTGGTGAATTCATCATCGAGTACGACGGCACCTACAGACCTCTCAGCATCTTCAGATTTCCATTTCAGTTATCATTCTACAACACGACGCCGAACGCTTACACGCTCGCTCTTCGAATTTCGATTCGTCGCTCCGAATCGGCGATACGGTGGGTTTGGGAGGCCAATCGCGGCCGTCCGGTGCGCGAAAATGCTACATTCTCTCTCTCCACCAACGGAAACCTAGTTCTCGCTGAAGCCGACGGCACTGTTGTATGGCAATCGAACACAGCCAACAAGGGCGTTGTTGGATTCGAATTGCTTCCGAGTGGCAACATGGTGTTGTTCGACTCCAATGGCAAATTCCTCTGGCAGAGCTTCGATTCGCCGACGGACACACTCCTTGTCGGCCAATCGCTCCGTCTCGGCGGGCCAACGAAGCTAGTAAGCCGCGCATCGGAGGTAATGAACGTCAATGGACCTTACAGCCTTGTAATGGAACGAAAAGCCCTATCTCTTTACTACAAAAGCCCTAACTCTCCAAAACCGATGCGGTACTACTCATCCACAGACATGCTCTCAGTCCGGAAAGGCGTTCTCGCAAATATCACACTAAACGCCGCCGTAGATCCAGATCAAGGATTCGCCACCGAATTAACACTGAACTACGAGACCGGCTCGACTGAGAGCGGCGGTCCAATTCTAACCCGGCCGAAGTACAACAGCACACTAACATTCCTCCGATTAGGAATCGACGGCAACCTCCGCCTGATCACATACAACGACAAAGTCGATTGGGGCCCGTCGGAGATTTCGTTCACACTCTTCGATAGGGATTCGACTTGGGAGAACGAATGCCAATGGCCAGAGCGGTGCGGACAGTTCGGGTTGTGCGAGGACAACCAGTGCGTAGCCTGTCCAACAGAGAACGGACTGGCAGGATGGAGCAAGACCTGCGCGCCGAAGAAGGTAAGTTCCTGCGATCCCAAAAGCTTCCATTACTATAAACTAGTCGGTGTGGATCATTTCTTGACAAAGTACAACAAAGGAGAAGGGCCAATGGGACAGAAGGAGTGTGAGAAGAAGTGCAATTTGGATTGCAAATGTTTGGGATATTTTTACCAAACCAAAGGGTCGCTATGTTGGGTTGCAAATGAGCTGAAAACTTTGATAAAAGTGGCCAATTCCACTCATTTGGGCTTCATCAAAACACCCAATAAGTAG

mRNA sequence

ATGAGATCGCCATTGTTGACGCCTTTTCTGATCTCTTTCTTTTTCCTGTTCTTTTCTTTCTCTCTTGCTCTCGTGCCTGCTAACGAGACCTTCAAGTTCGTCAATGAAGGCGAGTTTGGTGAATTCATCATCGAGTACGACGGCACCTACAGACCTCTCAGCATCTTCAGATTTCCATTTCAGTTATCATTCTACAACACGACGCCGAACGCTTACACGCTCGCTCTTCGAATTTCGATTCGTCGCTCCGAATCGGCGATACGGTGGGTTTGGGAGGCCAATCGCGGCCGTCCGGTGCGCGAAAATGCTACATTCTCTCTCTCCACCAACGGAAACCTAGTTCTCGCTGAAGCCGACGGCACTGTTGTATGGCAATCGAACACAGCCAACAAGGGCGTTGTTGGATTCGAATTGCTTCCGAGTGGCAACATGGTGTTGTTCGACTCCAATGGCAAATTCCTCTGGCAGAGCTTCGATTCGCCGACGGACACACTCCTTGTCGGCCAATCGCTCCGTCTCGGCGGGCCAACGAAGCTAGTAAGCCGCGCATCGGAGGTAATGAACGTCAATGGACCTTACAGCCTTGTAATGGAACGAAAAGCCCTATCTCTTTACTACAAAAGCCCTAACTCTCCAAAACCGATGCGGTACTACTCATCCACAGACATGCTCTCAGTCCGGAAAGGCGTTCTCGCAAATATCACACTAAACGCCGCCGTAGATCCAGATCAAGGATTCGCCACCGAATTAACACTGAACTACGAGACCGGCTCGACTGAGAGCGGCGGTCCAATTCTAACCCGGCCGAAGTACAACAGCACACTAACATTCCTCCGATTAGGAATCGACGGCAACCTCCGCCTGATCACATACAACGACAAAGTCGATTGGGGCCCGTCGGAGATTTCGTTCACACTCTTCGATAGGGATTCGACTTGGGAGAACGAATGCCAATGGCCAGAGCGGTGCGGACAGTTCGGGTTGTGCGAGGACAACCAGTGCGTAGCCTGTCCAACAGAGAACGGACTGGCAGGATGGAGCAAGACCTGCGCGCCGAAGAAGGTAAGTTCCTGCGATCCCAAAAGCTTCCATTACTATAAACTAGTCGGTGTGGATCATTTCTTGACAAAGTACAACAAAGGAGAAGGGCCAATGGGACAGAAGGAGTGTGAGAAGAAGTGCAATTTGGATTGCAAATGTTTGGGATATTTTTACCAAACCAAAGGGTCGCTATGTTGGGTTGCAAATGAGCTGAAAACTTTGATAAAAGTGGCCAATTCCACTCATTTGGGCTTCATCAAAACACCCAATAAGTAG

Coding sequence (CDS)

ATGAGATCGCCATTGTTGACGCCTTTTCTGATCTCTTTCTTTTTCCTGTTCTTTTCTTTCTCTCTTGCTCTCGTGCCTGCTAACGAGACCTTCAAGTTCGTCAATGAAGGCGAGTTTGGTGAATTCATCATCGAGTACGACGGCACCTACAGACCTCTCAGCATCTTCAGATTTCCATTTCAGTTATCATTCTACAACACGACGCCGAACGCTTACACGCTCGCTCTTCGAATTTCGATTCGTCGCTCCGAATCGGCGATACGGTGGGTTTGGGAGGCCAATCGCGGCCGTCCGGTGCGCGAAAATGCTACATTCTCTCTCTCCACCAACGGAAACCTAGTTCTCGCTGAAGCCGACGGCACTGTTGTATGGCAATCGAACACAGCCAACAAGGGCGTTGTTGGATTCGAATTGCTTCCGAGTGGCAACATGGTGTTGTTCGACTCCAATGGCAAATTCCTCTGGCAGAGCTTCGATTCGCCGACGGACACACTCCTTGTCGGCCAATCGCTCCGTCTCGGCGGGCCAACGAAGCTAGTAAGCCGCGCATCGGAGGTAATGAACGTCAATGGACCTTACAGCCTTGTAATGGAACGAAAAGCCCTATCTCTTTACTACAAAAGCCCTAACTCTCCAAAACCGATGCGGTACTACTCATCCACAGACATGCTCTCAGTCCGGAAAGGCGTTCTCGCAAATATCACACTAAACGCCGCCGTAGATCCAGATCAAGGATTCGCCACCGAATTAACACTGAACTACGAGACCGGCTCGACTGAGAGCGGCGGTCCAATTCTAACCCGGCCGAAGTACAACAGCACACTAACATTCCTCCGATTAGGAATCGACGGCAACCTCCGCCTGATCACATACAACGACAAAGTCGATTGGGGCCCGTCGGAGATTTCGTTCACACTCTTCGATAGGGATTCGACTTGGGAGAACGAATGCCAATGGCCAGAGCGGTGCGGACAGTTCGGGTTGTGCGAGGACAACCAGTGCGTAGCCTGTCCAACAGAGAACGGACTGGCAGGATGGAGCAAGACCTGCGCGCCGAAGAAGGTAAGTTCCTGCGATCCCAAAAGCTTCCATTACTATAAACTAGTCGGTGTGGATCATTTCTTGACAAAGTACAACAAAGGAGAAGGGCCAATGGGACAGAAGGAGTGTGAGAAGAAGTGCAATTTGGATTGCAAATGTTTGGGATATTTTTACCAAACCAAAGGGTCGCTATGTTGGGTTGCAAATGAGCTGAAAACTTTGATAAAAGTGGCCAATTCCACTCATTTGGGCTTCATCAAAACACCCAATAAGTAG

Protein sequence

MRSPLLTPFLISFFFLFFSFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPLSIFRFPFQLSFYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFATELTLNYETGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLFDRDSTWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKVSSCDPKSFHYYKLVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANSTHLGFIKTPNK
Homology
BLAST of CmaCh16G004430 vs. ExPASy Swiss-Prot
Match: Q39688 (Epidermis-specific secreted glycoprotein EP1 OS=Daucus carota OX=4039 GN=EP1 PE=1 SV=1)

HSP 1 Score: 440.7 bits (1132), Expect = 2.0e-122
Identity = 223/379 (58.84%), Postives = 272/379 (71.77%), Query Frame = 0

Query: 6   LTPFLISFFFLFFSFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPLSIFRFPFQLSFY 65
           LT  ++ FF     F   LVPANETFKFVNEGE G++I EY G YRPL  F  PFQL FY
Sbjct: 7   LTLTILLFFIQRIDFCHTLVPANETFKFVNEGELGQYISEYFGDYRPLDPFTSPFQLCFY 66

Query: 66  NTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQ 125
           N TP A+TLALR+ +RR+ES +RWVWEANRG PV ENAT +   +GNLVLA ++G V WQ
Sbjct: 67  NQTPTAFTLALRMGLRRTESLMRWVWEANRGNPVDENATLTFGPDGNLVLARSNGQVAWQ 126

Query: 126 SNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASE 185
           ++TANKGVVG ++LP+GNMVL+DS GKFLWQSFD+PTDTLLVGQSL++G  TKLVSRAS 
Sbjct: 127 TSTANKGVVGLKILPNGNMVLYDSKGKFLWQSFDTPTDTLLVGQSLKMGAVTKLVSRASP 186

Query: 186 VMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKG-VLANITLNAAVDPDQ 245
             NVNGPYSLVME K L LYYK   SPKP+RYYS +    + K   L N+T     + DQ
Sbjct: 187 GENVNGPYSLVMEPKGLHLYYKPTTSPKPIRYYSFSLFTKLNKNESLQNVTFEFENENDQ 246

Query: 246 GFATELTLNYETGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISF 305
           GFA  L+L Y T ++  G  IL R KYN+TL+FLRL IDGN+++ TYNDKVD+G  E+++
Sbjct: 247 GFAFLLSLKYGTSNSLGGASILNRIKYNTTLSFLRLEIDGNVKIYTYNDKVDYGAWEVTY 306

Query: 306 TLFDR------------DSTWENECQWPERCGQFGLCEDNQCVACPTENG-LAGWSKTCA 365
           TLF +              +  +ECQ P++CG FGLCE++QCV CPT +G +  WSKTC 
Sbjct: 307 TLFLKAPPPLFQVSLAATESESSECQLPKKCGNFGLCEESQCVGCPTSSGPVLAWSKTCE 366

Query: 366 PKKVSSCDPKSFHYYKLVG 371
           P K+SSC PK FHY KL G
Sbjct: 367 PPKLSSCGPKDFHYNKLGG 385

BLAST of CmaCh16G004430 vs. ExPASy Swiss-Prot
Match: Q9ZVA5 (EP1-like glycoprotein 4 OS=Arabidopsis thaliana OX=3702 GN=At1g78860 PE=3 SV=1)

HSP 1 Score: 431.4 bits (1108), Expect = 1.2e-119
Identity = 219/430 (50.93%), Postives = 287/430 (66.74%), Query Frame = 0

Query: 16  LFFSFSL------ALVPANETFKFVNEGEFGEFI-IEYDGTYRPLSIFRFPFQLSFYNTT 75
           LFF+ S+      A VP ++ F+ VNEG + ++  IEY+   R    F   F+L FYNTT
Sbjct: 9   LFFTLSIFLVGAQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTT 68

Query: 76  PNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNT 135
            NAYTLALRI  R  ES +RWVWEANRG PV+ENAT +   +GNLVLAEADG VVWQ+NT
Sbjct: 69  QNAYTLALRIGNRAQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRVVWQTNT 128

Query: 136 ANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMN 195
           ANKGVVG ++L +GNMV++DSNGKF+WQSFDSPTDTLLVGQSL+L G  KLVSR S  +N
Sbjct: 129 ANKGVVGIKILENGNMVIYDSNGKFVWQSFDSPTDTLLVGQSLKLNGQNKLVSRLSPSVN 188

Query: 196 VNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFAT 255
            NGPYSLVME K L LYY +  +PKP+ YY       + +  L ++T  A  D D  +  
Sbjct: 189 ANGPYSLVMEAKKLVLYYTTNKTPKPIGYYEYEFFTKIAQ--LQSMTFQAVEDADTTWGL 248

Query: 256 ELTLNYETGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLFD 315
            +    ++GS  +    L+RPK+N+TL+FLRL  DGN+R+ +Y+        ++++T F 
Sbjct: 249 HME-GVDSGSQFNVSTFLSRPKHNATLSFLRLESDGNIRVWSYSTLATSTAWDVTYTAFT 308

Query: 316 RDSTWEN-ECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKVSSCDPKSFHYYK 375
            D+T  N EC+ PE C  FGLC+  QC ACP++ GL GW +TC    ++SCDPK+FHY+K
Sbjct: 309 NDNTDGNDECRIPEHCLGFGLCKKGQCNACPSDIGLLGWDETCKIPSLASCDPKTFHYFK 368

Query: 376 LVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANS 435
           + G D F+TKYN G     +  C  KC  DCKCLG+FY  K S CW+  ELKTL K  ++
Sbjct: 369 IEGADSFMTKYN-GGSTTTESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTKTGDT 428

Query: 436 THLGFIKTPN 438
           + + ++K PN
Sbjct: 429 SLVAYVKAPN 434

BLAST of CmaCh16G004430 vs. ExPASy Swiss-Prot
Match: Q9ZVA4 (EP1-like glycoprotein 3 OS=Arabidopsis thaliana OX=3702 GN=At1g78850 PE=1 SV=1)

HSP 1 Score: 426.4 bits (1095), Expect = 3.9e-118
Identity = 211/424 (49.76%), Postives = 284/424 (66.98%), Query Frame = 0

Query: 16  LFFSFSLALVPANETFKFVNEGEFGEFI-IEYDGTYRPLSIFRFPFQLSFYNTTPNAYTL 75
           +F   S A VP ++ F+ VNEG + ++  IEY+   R    F   F+L FYNTTPNAYTL
Sbjct: 15  IFLIGSQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTTPNAYTL 74

Query: 76  ALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGVV 135
           ALRI  R  ES +RWVWEANRG PV+ENAT +   +GNLVLAEADG +VWQ+NTANKG V
Sbjct: 75  ALRIGNRVQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRLVWQTNTANKGAV 134

Query: 136 GFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMNVNGPYS 195
           G ++L +GNMV++DS+GKF+WQSFDSPTDTLLVGQSL+L G TKLVSR S  +N NGPYS
Sbjct: 135 GIKILENGNMVIYDSSGKFVWQSFDSPTDTLLVGQSLKLNGRTKLVSRLSPSVNTNGPYS 194

Query: 196 LVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFATELTLNY 255
           LVME K L LYY +  +PKP+ Y+       + +    ++T  A  D D  +   +    
Sbjct: 195 LVMEAKKLVLYYTTNKTPKPIAYFEYEFFTKITQ--FQSMTFQAVEDSDTTWGLVME-GV 254

Query: 256 ETGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLF-DRDSTW 315
           ++GS  +    L+RPK+N+TL+F+RL  DGN+R+ +Y+        ++++T F + D+  
Sbjct: 255 DSGSKFNVSTFLSRPKHNATLSFIRLESDGNIRVWSYSTLATSTAWDVTYTAFTNADTDG 314

Query: 316 ENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKVSSCDPKSFHYYKLVGVDH 375
            +EC+ PE C  FGLC+  QC ACP++ GL GW +TC    ++SCDPK+FHY+K+ G D 
Sbjct: 315 NDECRIPEHCLGFGLCKKGQCNACPSDKGLLGWDETCKSPSLASCDPKTFHYFKIEGADS 374

Query: 376 FLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANSTHLGFI 435
           F+TKYN G     +  C  KC  DCKCLG+FY  K S CW+  ELKTL +  +S+ + ++
Sbjct: 375 FMTKYNGGSSTT-ESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTRTGDSSLVAYV 434

Query: 436 KTPN 438
           K PN
Sbjct: 435 KAPN 434

BLAST of CmaCh16G004430 vs. ExPASy Swiss-Prot
Match: Q9ZVA2 (EP1-like glycoprotein 2 OS=Arabidopsis thaliana OX=3702 GN=At1g78830 PE=1 SV=1)

HSP 1 Score: 385.6 bits (989), Expect = 7.6e-106
Identity = 209/440 (47.50%), Postives = 273/440 (62.05%), Query Frame = 0

Query: 19  SFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPL-----SIFRFPFQLSFYNTTPNAYT 78
           S  +A VP  + F+ VNEGEFGE+I EYD +YR +     S F  PFQL FYNTTP+AY 
Sbjct: 18  SVVIAQVPPEKQFRVVNEGEFGEYITEYDASYRFIESSNQSFFTSPFQLLFYNTTPSAYI 77

Query: 79  LALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGV 138
           LALR+ +RR ES +RW+W+ANR  PV ENAT SL  NGNLVLAEADG V WQ+NTANKGV
Sbjct: 78  LALRVGLRRDESTMRWIWDANRNNPVGENATLSLGRNGNLVLAEADGRVKWQTNTANKGV 137

Query: 139 VGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMNVNGPY 198
            GF++LP+GN+VL D NGKF+WQSFD PTDTLL GQSL++ G  KLVSR S+    +GPY
Sbjct: 138 TGFQILPNGNIVLHDKNGKFVWQSFDHPTDTLLTGQSLKVNGVNKLVSRTSDSNGSDGPY 197

Query: 199 SLVMERKALSLYYKSPNSPK-----PMRYYSSTDMLSVRKGVLANITLNAA----VDPDQ 258
           S+V+++K L++Y     +P      P   +  T   +V +    N+T  +A    ++P  
Sbjct: 198 SMVLDKKGLTMYVNKTGTPLVYGGWPDHDFRGTVTFAVTR-EFDNLTEPSAYELLLEPAP 257

Query: 259 GFATELTLN---YETGSTESGGPILTRPK--YNSTLTFLRLGIDGNLRLITYNDKVDWGP 318
             AT    N    +     SGG  L   K  YN T+++LRLG DG+L+  +Y     +  
Sbjct: 258 QPATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSLKAYSYFPAATYLK 317

Query: 319 SEISFTLFDRDSTWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKV---- 378
            E SF+ F   + +  +C  P  CG +G C+   C ACPT  GL GWS  CAP K     
Sbjct: 318 WEESFSFF--STYFVRQCGLPSFCGDYGYCDRGMCNACPTPKGLLGWSDKCAPPKTTQFC 377

Query: 379 SSCDPKSFHYYKLVGVDHFLTKY-NKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWV 435
           S    K+ +YYK+VGV+HF   Y N G+GP    +C+ KC+ DCKCLGYFY+ K   C +
Sbjct: 378 SGVKGKTVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGYFYKEKDKKCLL 437

BLAST of CmaCh16G004430 vs. ExPASy Swiss-Prot
Match: Q9ZVA1 (EP1-like glycoprotein 1 OS=Arabidopsis thaliana OX=3702 GN=At1g78820 PE=2 SV=1)

HSP 1 Score: 358.2 bits (918), Expect = 1.3e-97
Identity = 194/439 (44.19%), Postives = 261/439 (59.45%), Query Frame = 0

Query: 19  SFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPL-----SIFRFPFQLSFYNTTPNAYT 78
           S  +A VP  + F+ +NE  +  +I EYD +YR L     + F  PFQL FYNTTP+AY 
Sbjct: 18  SVVMAQVPPEKQFRVLNEPGYAPYITEYDASYRFLNSPNQNFFTIPFQLMFYNTTPSAYV 77

Query: 79  LALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGV 138
           LALR+  RR  S  RW+W+ANR  PV +N+T S   NGNLVLAE +G V WQ+NTANKGV
Sbjct: 78  LALRVGTRRDMSFTRWIWDANRNNPVGDNSTLSFGRNGNLVLAELNGQVKWQTNTANKGV 137

Query: 139 VGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMNVNGPY 198
            GF++LP+GNMVL D +GKF+WQSFD PTDTLLVGQSL++ G  KLVSR S++   +GPY
Sbjct: 138 TGFQILPNGNMVLHDKHGKFVWQSFDHPTDTLLVGQSLKVNGVNKLVSRTSDMNGSDGPY 197

Query: 199 SLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLA----NITLNAA----VDPDQG 258
           S+V++ K L++Y     +P     ++  D        +     N+T  +A    ++P   
Sbjct: 198 SMVLDNKGLTMYVNKTGTPLVYGGWTDHDFRGTVTFAVTREFDNLTEPSAYELLLEPAPQ 257

Query: 259 FATELTLN---YETGSTESGGPILTRPK--YNSTLTFLRLGIDGNLRLITYNDKVDWGPS 318
            AT    N    +     SGG  L   K  YN T+++LRLG DG+L+  +Y     +   
Sbjct: 258 PATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSLKAFSYFPAATYLEW 317

Query: 319 EISFTLFDRDSTWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKV----S 378
           E +F  F   + +  +C  P  CG +G C+   CV CPT  GL  WS  CAP K     S
Sbjct: 318 EETFAFF--SNYFVRQCGLPTFCGDYGYCDRGMCVGCPTPKGLLAWSDKCAPPKTTQFCS 377

Query: 379 SCDPKSFHYYKLVGVDHFLTKY-NKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVA 435
               K+ +YYK+VGV+HF   Y N G+GP    +C+ KC+ DCKCLGYFY+ K   C +A
Sbjct: 378 GGKGKAVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGYFYKEKDKKCLLA 437

BLAST of CmaCh16G004430 vs. TAIR 10
Match: AT1G78860.1 (D-mannose binding lectin protein with Apple-like carbohydrate-binding domain )

HSP 1 Score: 431.4 bits (1108), Expect = 8.6e-121
Identity = 219/430 (50.93%), Postives = 287/430 (66.74%), Query Frame = 0

Query: 16  LFFSFSL------ALVPANETFKFVNEGEFGEFI-IEYDGTYRPLSIFRFPFQLSFYNTT 75
           LFF+ S+      A VP ++ F+ VNEG + ++  IEY+   R    F   F+L FYNTT
Sbjct: 9   LFFTLSIFLVGAQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTT 68

Query: 76  PNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNT 135
            NAYTLALRI  R  ES +RWVWEANRG PV+ENAT +   +GNLVLAEADG VVWQ+NT
Sbjct: 69  QNAYTLALRIGNRAQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRVVWQTNT 128

Query: 136 ANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMN 195
           ANKGVVG ++L +GNMV++DSNGKF+WQSFDSPTDTLLVGQSL+L G  KLVSR S  +N
Sbjct: 129 ANKGVVGIKILENGNMVIYDSNGKFVWQSFDSPTDTLLVGQSLKLNGQNKLVSRLSPSVN 188

Query: 196 VNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFAT 255
            NGPYSLVME K L LYY +  +PKP+ YY       + +  L ++T  A  D D  +  
Sbjct: 189 ANGPYSLVMEAKKLVLYYTTNKTPKPIGYYEYEFFTKIAQ--LQSMTFQAVEDADTTWGL 248

Query: 256 ELTLNYETGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLFD 315
            +    ++GS  +    L+RPK+N+TL+FLRL  DGN+R+ +Y+        ++++T F 
Sbjct: 249 HME-GVDSGSQFNVSTFLSRPKHNATLSFLRLESDGNIRVWSYSTLATSTAWDVTYTAFT 308

Query: 316 RDSTWEN-ECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKVSSCDPKSFHYYK 375
            D+T  N EC+ PE C  FGLC+  QC ACP++ GL GW +TC    ++SCDPK+FHY+K
Sbjct: 309 NDNTDGNDECRIPEHCLGFGLCKKGQCNACPSDIGLLGWDETCKIPSLASCDPKTFHYFK 368

Query: 376 LVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANS 435
           + G D F+TKYN G     +  C  KC  DCKCLG+FY  K S CW+  ELKTL K  ++
Sbjct: 369 IEGADSFMTKYN-GGSTTTESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTKTGDT 428

Query: 436 THLGFIKTPN 438
           + + ++K PN
Sbjct: 429 SLVAYVKAPN 434

BLAST of CmaCh16G004430 vs. TAIR 10
Match: AT1G78850.1 (D-mannose binding lectin protein with Apple-like carbohydrate-binding domain )

HSP 1 Score: 426.4 bits (1095), Expect = 2.8e-119
Identity = 211/424 (49.76%), Postives = 284/424 (66.98%), Query Frame = 0

Query: 16  LFFSFSLALVPANETFKFVNEGEFGEFI-IEYDGTYRPLSIFRFPFQLSFYNTTPNAYTL 75
           +F   S A VP ++ F+ VNEG + ++  IEY+   R    F   F+L FYNTTPNAYTL
Sbjct: 15  IFLIGSQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTTPNAYTL 74

Query: 76  ALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGVV 135
           ALRI  R  ES +RWVWEANRG PV+ENAT +   +GNLVLAEADG +VWQ+NTANKG V
Sbjct: 75  ALRIGNRVQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRLVWQTNTANKGAV 134

Query: 136 GFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMNVNGPYS 195
           G ++L +GNMV++DS+GKF+WQSFDSPTDTLLVGQSL+L G TKLVSR S  +N NGPYS
Sbjct: 135 GIKILENGNMVIYDSSGKFVWQSFDSPTDTLLVGQSLKLNGRTKLVSRLSPSVNTNGPYS 194

Query: 196 LVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFATELTLNY 255
           LVME K L LYY +  +PKP+ Y+       + +    ++T  A  D D  +   +    
Sbjct: 195 LVMEAKKLVLYYTTNKTPKPIAYFEYEFFTKITQ--FQSMTFQAVEDSDTTWGLVME-GV 254

Query: 256 ETGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLF-DRDSTW 315
           ++GS  +    L+RPK+N+TL+F+RL  DGN+R+ +Y+        ++++T F + D+  
Sbjct: 255 DSGSKFNVSTFLSRPKHNATLSFIRLESDGNIRVWSYSTLATSTAWDVTYTAFTNADTDG 314

Query: 316 ENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKVSSCDPKSFHYYKLVGVDH 375
            +EC+ PE C  FGLC+  QC ACP++ GL GW +TC    ++SCDPK+FHY+K+ G D 
Sbjct: 315 NDECRIPEHCLGFGLCKKGQCNACPSDKGLLGWDETCKSPSLASCDPKTFHYFKIEGADS 374

Query: 376 FLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANSTHLGFI 435
           F+TKYN G     +  C  KC  DCKCLG+FY  K S CW+  ELKTL +  +S+ + ++
Sbjct: 375 FMTKYNGGSSTT-ESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTRTGDSSLVAYV 434

Query: 436 KTPN 438
           K PN
Sbjct: 435 KAPN 434

BLAST of CmaCh16G004430 vs. TAIR 10
Match: AT1G16905.1 (Curculin-like (mannose-binding) lectin family protein )

HSP 1 Score: 392.9 bits (1008), Expect = 3.4e-109
Identity = 208/427 (48.71%), Postives = 270/427 (63.23%), Query Frame = 0

Query: 15  FLFFSFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPLSIFRFPFQLSFYNTTPNAYTL 74
           FL  S     VP  E F+F+N G+FGE  +EY  +YR L + R  F+L F+NTTPNA+TL
Sbjct: 14  FLLISLVRPQVPPMEQFRFLNNGDFGESTVEYGASYRDLGVIRNQFRLCFFNTTPNAFTL 73

Query: 75  ALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGVV 134
           A+ +    S+S IRWVW+AN  +PV+E A+ S    GNLVLA+ DG VVWQ+ T NKGV+
Sbjct: 74  AIGMGTGSSDSIIRWVWQANPQKPVQEEASLSFGPEGNLVLAQPDGRVVWQTMTENKGVI 133

Query: 135 GFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRL-GGPTKLVSRASEVMNVNGPY 194
           G  +  +GN+VLFD  G  +WQSF+ PTDTLLVGQSL L G   KLVSR       NG Y
Sbjct: 134 GLTMNENGNLVLFDDGGWPVWQSFEFPTDTLLVGQSLTLDGSKNKLVSRN------NGSY 193

Query: 195 SLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFATELTLN 254
           SL++E   L L    P S      Y       +    + + TL +A   DQG  T+L L 
Sbjct: 194 SLILEPDRLVLNRLIPRSNNKSLVYH-----IIEGRFIPSATLYSA--KDQGTTTQLGL- 253

Query: 255 YETGSTESGGP---ILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLFDRD 314
             T       P    L RP++N++ +FLRL  DGNLR+ +++ KV +   E++F LF+ D
Sbjct: 254 -ATPGLRPEFPYKHFLARPRFNASQSFLRLDADGNLRIYSFDSKVTFLAWEVTFELFNHD 313

Query: 315 STWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKVSSCDPKSFHYYKLVG 374
           +   NEC  P +CG FG+CEDNQCVACP   GL GWSK C PKKV SCDPKSFHYY+L G
Sbjct: 314 N--NNECWLPSKCGAFGICEDNQCVACPLGVGLMGWSKACKPKKVKSCDPKSFHYYRLGG 373

Query: 375 VDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANSTHL 434
           V+HF+TKYN G   +G+ +C   C+ DCKCLGYF+      CW++ EL TL+KV++S  +
Sbjct: 374 VEHFMTKYNVGLA-LGESKCRGLCSGDCKCLGYFFDKSSFKCWISYELGTLVKVSDSRKV 422

Query: 435 GFIKTPN 438
            +IKTPN
Sbjct: 434 AYIKTPN 422

BLAST of CmaCh16G004430 vs. TAIR 10
Match: AT1G78830.1 (Curculin-like (mannose-binding) lectin family protein )

HSP 1 Score: 385.6 bits (989), Expect = 5.4e-107
Identity = 209/440 (47.50%), Postives = 273/440 (62.05%), Query Frame = 0

Query: 19  SFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPL-----SIFRFPFQLSFYNTTPNAYT 78
           S  +A VP  + F+ VNEGEFGE+I EYD +YR +     S F  PFQL FYNTTP+AY 
Sbjct: 18  SVVIAQVPPEKQFRVVNEGEFGEYITEYDASYRFIESSNQSFFTSPFQLLFYNTTPSAYI 77

Query: 79  LALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGV 138
           LALR+ +RR ES +RW+W+ANR  PV ENAT SL  NGNLVLAEADG V WQ+NTANKGV
Sbjct: 78  LALRVGLRRDESTMRWIWDANRNNPVGENATLSLGRNGNLVLAEADGRVKWQTNTANKGV 137

Query: 139 VGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMNVNGPY 198
            GF++LP+GN+VL D NGKF+WQSFD PTDTLL GQSL++ G  KLVSR S+    +GPY
Sbjct: 138 TGFQILPNGNIVLHDKNGKFVWQSFDHPTDTLLTGQSLKVNGVNKLVSRTSDSNGSDGPY 197

Query: 199 SLVMERKALSLYYKSPNSPK-----PMRYYSSTDMLSVRKGVLANITLNAA----VDPDQ 258
           S+V+++K L++Y     +P      P   +  T   +V +    N+T  +A    ++P  
Sbjct: 198 SMVLDKKGLTMYVNKTGTPLVYGGWPDHDFRGTVTFAVTR-EFDNLTEPSAYELLLEPAP 257

Query: 259 GFATELTLN---YETGSTESGGPILTRPK--YNSTLTFLRLGIDGNLRLITYNDKVDWGP 318
             AT    N    +     SGG  L   K  YN T+++LRLG DG+L+  +Y     +  
Sbjct: 258 QPATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSLKAYSYFPAATYLK 317

Query: 319 SEISFTLFDRDSTWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKV---- 378
            E SF+ F   + +  +C  P  CG +G C+   C ACPT  GL GWS  CAP K     
Sbjct: 318 WEESFSFF--STYFVRQCGLPSFCGDYGYCDRGMCNACPTPKGLLGWSDKCAPPKTTQFC 377

Query: 379 SSCDPKSFHYYKLVGVDHFLTKY-NKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWV 435
           S    K+ +YYK+VGV+HF   Y N G+GP    +C+ KC+ DCKCLGYFY+ K   C +
Sbjct: 378 SGVKGKTVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGYFYKEKDKKCLL 437

BLAST of CmaCh16G004430 vs. TAIR 10
Match: AT1G78820.1 (D-mannose binding lectin protein with Apple-like carbohydrate-binding domain )

HSP 1 Score: 358.2 bits (918), Expect = 9.3e-99
Identity = 194/439 (44.19%), Postives = 261/439 (59.45%), Query Frame = 0

Query: 19  SFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPL-----SIFRFPFQLSFYNTTPNAYT 78
           S  +A VP  + F+ +NE  +  +I EYD +YR L     + F  PFQL FYNTTP+AY 
Sbjct: 18  SVVMAQVPPEKQFRVLNEPGYAPYITEYDASYRFLNSPNQNFFTIPFQLMFYNTTPSAYV 77

Query: 79  LALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGV 138
           LALR+  RR  S  RW+W+ANR  PV +N+T S   NGNLVLAE +G V WQ+NTANKGV
Sbjct: 78  LALRVGTRRDMSFTRWIWDANRNNPVGDNSTLSFGRNGNLVLAELNGQVKWQTNTANKGV 137

Query: 139 VGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMNVNGPY 198
            GF++LP+GNMVL D +GKF+WQSFD PTDTLLVGQSL++ G  KLVSR S++   +GPY
Sbjct: 138 TGFQILPNGNMVLHDKHGKFVWQSFDHPTDTLLVGQSLKVNGVNKLVSRTSDMNGSDGPY 197

Query: 199 SLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLA----NITLNAA----VDPDQG 258
           S+V++ K L++Y     +P     ++  D        +     N+T  +A    ++P   
Sbjct: 198 SMVLDNKGLTMYVNKTGTPLVYGGWTDHDFRGTVTFAVTREFDNLTEPSAYELLLEPAPQ 257

Query: 259 FATELTLN---YETGSTESGGPILTRPK--YNSTLTFLRLGIDGNLRLITYNDKVDWGPS 318
            AT    N    +     SGG  L   K  YN T+++LRLG DG+L+  +Y     +   
Sbjct: 258 PATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSLKAFSYFPAATYLEW 317

Query: 319 EISFTLFDRDSTWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKV----S 378
           E +F  F   + +  +C  P  CG +G C+   CV CPT  GL  WS  CAP K     S
Sbjct: 318 EETFAFF--SNYFVRQCGLPTFCGDYGYCDRGMCVGCPTPKGLLAWSDKCAPPKTTQFCS 377

Query: 379 SCDPKSFHYYKLVGVDHFLTKY-NKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVA 435
               K+ +YYK+VGV+HF   Y N G+GP    +C+ KC+ DCKCLGYFY+ K   C +A
Sbjct: 378 GGKGKAVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGYFYKEKDKKCLLA 437

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q396882.0e-12258.84Epidermis-specific secreted glycoprotein EP1 OS=Daucus carota OX=4039 GN=EP1 PE=... [more]
Q9ZVA51.2e-11950.93EP1-like glycoprotein 4 OS=Arabidopsis thaliana OX=3702 GN=At1g78860 PE=3 SV=1[more]
Q9ZVA43.9e-11849.76EP1-like glycoprotein 3 OS=Arabidopsis thaliana OX=3702 GN=At1g78850 PE=1 SV=1[more]
Q9ZVA27.6e-10647.50EP1-like glycoprotein 2 OS=Arabidopsis thaliana OX=3702 GN=At1g78830 PE=1 SV=1[more]
Q9ZVA11.3e-9744.19EP1-like glycoprotein 1 OS=Arabidopsis thaliana OX=3702 GN=At1g78820 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT1G78860.18.6e-12150.93D-mannose binding lectin protein with Apple-like carbohydrate-binding domain [more]
AT1G78850.12.8e-11949.76D-mannose binding lectin protein with Apple-like carbohydrate-binding domain [more]
AT1G16905.13.4e-10948.71Curculin-like (mannose-binding) lectin family protein [more]
AT1G78830.15.4e-10747.50Curculin-like (mannose-binding) lectin family protein [more]
AT1G78820.19.3e-9944.19D-mannose binding lectin protein with Apple-like carbohydrate-binding domain [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001480Bulb-type lectin domainSMARTSM00108blect_4coord: 44..161
e-value: 4.2E-32
score: 122.6
IPR001480Bulb-type lectin domainPFAMPF01453B_lectincoord: 90..174
e-value: 1.1E-18
score: 67.6
IPR001480Bulb-type lectin domainPROSITEPS50927BULB_LECTINcoord: 41..159
score: 14.320436
IPR001480Bulb-type lectin domainCDDcd00028B_lectincoord: 45..161
e-value: 6.79137E-33
score: 118.569
IPR036426Bulb-type lectin domain superfamilyGENE3D2.90.10.10coord: 55..160
e-value: 5.3E-16
score: 60.7
IPR036426Bulb-type lectin domain superfamilySUPERFAMILY51110alpha-D-mannose-specific plant lectinscoord: 70..208
IPR035446S-locus-specific glycoprotein/EP1PIRSFPIRSF002686SLGcoord: 1..438
e-value: 3.0E-149
score: 495.3
NoneNo IPR availablePANTHERPTHR32444:SF64SECRETED GLYCOPROTEIN EP1, PUTATIVE-RELATEDcoord: 13..438
NoneNo IPR availablePANTHERPTHR32444FAMILY NOT NAMEDcoord: 13..438
NoneNo IPR availableCDDcd01098PAN_AP_plantcoord: 356..436
e-value: 2.91187E-10
score: 54.7498

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G004430.1CmaCh16G004430.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0110165 cellular anatomical entity