CmoCh16G004790 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh16G004790
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionepidermis-specific secreted glycoprotein EP1-like
LocationCmo_Chr16: 2292896 .. 2294212 (+)
RNA-Seq ExpressionCmoCh16G004790
SyntenyCmoCh16G004790
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGATCGCCATTGTTGACGCCTCTTCTGATCTCTTTCTTTTTCTTGTTCTTTTCTTTCTCTCTTGCTCTCGTGCCTGCTAACGAGACCTTCAAGTTCGTCAATGAAGGCGAGTTCGGTGATTTCGCCGTCGAGTACGGCGGCACCTACAGAGTTCTCAGCATCTTCAGATTTCCATTTCAGTTAGCTTTCTACAACACGACGCCGAACGCTTACACGCTTGCTCTTAGAATTTCGATTCGTCGCTCCGAATCGGCGATACGGTGGGTTTGGGAGGCCAATCGCGGCCGTCCGGTGCGCGAAAATGCTACATTCTCTCTCTCCGCCAACGGAAACCTAGTTCTCGCTGAAGCCGACGGCACTGTTGTATGGCAATCGAACACAGCCAACAAAGGCGTTGTTGGATTCGAATTGCTTCCGAGTGGCAACATGGTGTTGTTCGACTCCAATGGCAAATTCCTCTGGCAGAGCTTCGATTCGCCGACGGACACACTCCTAGTCGGCCAATCGCTCCGTCTCGGCGGGCCAATGAAGCTAGTAAGCCGCGCATCGGAGGAAATGAACGTCAATGGACCTTACAGCCTTGTAATGGAACGAAAAGTCCTATCTCTGTACTACAAAAGCCCTAACTCTCCGAAACCGATGCGGTACTACTCATCCACAGACATGCTCTCAGTCCGGAAAGGCGTTCTCGCAAATATCACACTAAACGCCGCCGTAGATCCAGATCAAGGATTCGCCACCGAATTAACACTGAACTACGACACCGGCTCGACTGAGAGCGGCGGTCCAATTCTAACCCGGCCGAAGTACAACAGCACACTAACATTCCTCCGATTAGGAATCGACGGCAACCTCCGCCTGATCACATACAACGACAAAGTCGATTGGGGCCCGTCGGAGATTTCGTTCACACTCTTCGATAGGGATTCAAGTTGGGAGAACGAATGCCAGTGGCCAGAGCGGTGCGGACAGTTCGGGTTGTGCGAGGACAACCAGTGCGTAGCCTGTCCAACAGAGAACGGACTGGCGGGATGGAGCAAGAGCTGCGCGCCGAAGAAGGTAAGTTCCTGCGATCCCAAAAGCTTCCATTACTATAAACTAGTCGGTGTGGATCATTTCTTGACAAAGTACAACAAAGGAGAAGGGCCAATGGGACAGAAGGAGTGTGAGAAGAAGTGCAATTTGGATTGCAAATGTTTGGGATATTTTTACCAAACCAAAGGGTCGCTTTGTTGGGTTGCAAATGAGCTGAAAACTTTGATAAAAGTGGCCAATTCCACTCATTTGGGCTTCATCAAAACACCCAATAAGTAG

mRNA sequence

ATGAGATCGCCATTGTTGACGCCTCTTCTGATCTCTTTCTTTTTCTTGTTCTTTTCTTTCTCTCTTGCTCTCGTGCCTGCTAACGAGACCTTCAAGTTCGTCAATGAAGGCGAGTTCGGTGATTTCGCCGTCGAGTACGGCGGCACCTACAGAGTTCTCAGCATCTTCAGATTTCCATTTCAGTTAGCTTTCTACAACACGACGCCGAACGCTTACACGCTTGCTCTTAGAATTTCGATTCGTCGCTCCGAATCGGCGATACGGTGGGTTTGGGAGGCCAATCGCGGCCGTCCGGTGCGCGAAAATGCTACATTCTCTCTCTCCGCCAACGGAAACCTAGTTCTCGCTGAAGCCGACGGCACTGTTGTATGGCAATCGAACACAGCCAACAAAGGCGTTGTTGGATTCGAATTGCTTCCGAGTGGCAACATGGTGTTGTTCGACTCCAATGGCAAATTCCTCTGGCAGAGCTTCGATTCGCCGACGGACACACTCCTAGTCGGCCAATCGCTCCGTCTCGGCGGGCCAATGAAGCTAGTAAGCCGCGCATCGGAGGAAATGAACGTCAATGGACCTTACAGCCTTGTAATGGAACGAAAAGTCCTATCTCTGTACTACAAAAGCCCTAACTCTCCGAAACCGATGCGGTACTACTCATCCACAGACATGCTCTCAGTCCGGAAAGGCGTTCTCGCAAATATCACACTAAACGCCGCCGTAGATCCAGATCAAGGATTCGCCACCGAATTAACACTGAACTACGACACCGGCTCGACTGAGAGCGGCGGTCCAATTCTAACCCGGCCGAAGTACAACAGCACACTAACATTCCTCCGATTAGGAATCGACGGCAACCTCCGCCTGATCACATACAACGACAAAGTCGATTGGGGCCCGTCGGAGATTTCGTTCACACTCTTCGATAGGGATTCAAGTTGGGAGAACGAATGCCAGTGGCCAGAGCGGTGCGGACAGTTCGGGTTGTGCGAGGACAACCAGTGCGTAGCCTGTCCAACAGAGAACGGACTGGCGGGATGGAGCAAGAGCTGCGCGCCGAAGAAGGTAAGTTCCTGCGATCCCAAAAGCTTCCATTACTATAAACTAGTCGGTGTGGATCATTTCTTGACAAAGTACAACAAAGGAGAAGGGCCAATGGGACAGAAGGAGTGTGAGAAGAAGTGCAATTTGGATTGCAAATGTTTGGGATATTTTTACCAAACCAAAGGGTCGCTTTGTTGGGTTGCAAATGAGCTGAAAACTTTGATAAAAGTGGCCAATTCCACTCATTTGGGCTTCATCAAAACACCCAATAAGTAG

Coding sequence (CDS)

ATGAGATCGCCATTGTTGACGCCTCTTCTGATCTCTTTCTTTTTCTTGTTCTTTTCTTTCTCTCTTGCTCTCGTGCCTGCTAACGAGACCTTCAAGTTCGTCAATGAAGGCGAGTTCGGTGATTTCGCCGTCGAGTACGGCGGCACCTACAGAGTTCTCAGCATCTTCAGATTTCCATTTCAGTTAGCTTTCTACAACACGACGCCGAACGCTTACACGCTTGCTCTTAGAATTTCGATTCGTCGCTCCGAATCGGCGATACGGTGGGTTTGGGAGGCCAATCGCGGCCGTCCGGTGCGCGAAAATGCTACATTCTCTCTCTCCGCCAACGGAAACCTAGTTCTCGCTGAAGCCGACGGCACTGTTGTATGGCAATCGAACACAGCCAACAAAGGCGTTGTTGGATTCGAATTGCTTCCGAGTGGCAACATGGTGTTGTTCGACTCCAATGGCAAATTCCTCTGGCAGAGCTTCGATTCGCCGACGGACACACTCCTAGTCGGCCAATCGCTCCGTCTCGGCGGGCCAATGAAGCTAGTAAGCCGCGCATCGGAGGAAATGAACGTCAATGGACCTTACAGCCTTGTAATGGAACGAAAAGTCCTATCTCTGTACTACAAAAGCCCTAACTCTCCGAAACCGATGCGGTACTACTCATCCACAGACATGCTCTCAGTCCGGAAAGGCGTTCTCGCAAATATCACACTAAACGCCGCCGTAGATCCAGATCAAGGATTCGCCACCGAATTAACACTGAACTACGACACCGGCTCGACTGAGAGCGGCGGTCCAATTCTAACCCGGCCGAAGTACAACAGCACACTAACATTCCTCCGATTAGGAATCGACGGCAACCTCCGCCTGATCACATACAACGACAAAGTCGATTGGGGCCCGTCGGAGATTTCGTTCACACTCTTCGATAGGGATTCAAGTTGGGAGAACGAATGCCAGTGGCCAGAGCGGTGCGGACAGTTCGGGTTGTGCGAGGACAACCAGTGCGTAGCCTGTCCAACAGAGAACGGACTGGCGGGATGGAGCAAGAGCTGCGCGCCGAAGAAGGTAAGTTCCTGCGATCCCAAAAGCTTCCATTACTATAAACTAGTCGGTGTGGATCATTTCTTGACAAAGTACAACAAAGGAGAAGGGCCAATGGGACAGAAGGAGTGTGAGAAGAAGTGCAATTTGGATTGCAAATGTTTGGGATATTTTTACCAAACCAAAGGGTCGCTTTGTTGGGTTGCAAATGAGCTGAAAACTTTGATAAAAGTGGCCAATTCCACTCATTTGGGCTTCATCAAAACACCCAATAAGTAG

Protein sequence

MRSPLLTPLLISFFFLFFSFSLALVPANETFKFVNEGEFGDFAVEYGGTYRVLSIFRFPFQLAFYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASEEMNVNGPYSLVMERKVLSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFATELTLNYDTGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLFDRDSSWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDPKSFHYYKLVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANSTHLGFIKTPNK
Homology
BLAST of CmoCh16G004790 vs. ExPASy Swiss-Prot
Match: Q39688 (Epidermis-specific secreted glycoprotein EP1 OS=Daucus carota OX=4039 GN=EP1 PE=1 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 5.4e-120
Identity = 220/379 (58.05%), Postives = 268/379 (70.71%), Query Frame = 0

Query: 6   LTPLLISFFFLFFSFSLALVPANETFKFVNEGEFGDFAVEYGGTYRVLSIFRFPFQLAFY 65
           LT  ++ FF     F   LVPANETFKFVNEGE G +  EY G YR L  F  PFQL FY
Sbjct: 7   LTLTILLFFIQRIDFCHTLVPANETFKFVNEGELGQYISEYFGDYRPLDPFTSPFQLCFY 66

Query: 66  NTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQ 125
           N TP A+TLALR+ +RR+ES +RWVWEANRG PV ENAT +   +GNLVLA ++G V WQ
Sbjct: 67  NQTPTAFTLALRMGLRRTESLMRWVWEANRGNPVDENATLTFGPDGNLVLARSNGQVAWQ 126

Query: 126 SNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASE 185
           ++TANKGVVG ++LP+GNMVL+DS GKFLWQSFD+PTDTLLVGQSL++G   KLVSRAS 
Sbjct: 127 TSTANKGVVGLKILPNGNMVLYDSKGKFLWQSFDTPTDTLLVGQSLKMGAVTKLVSRASP 186

Query: 186 EMNVNGPYSLVMERKVLSLYYKSPNSPKPMRYYSSTDMLSVRKG-VLANITLNAAVDPDQ 245
             NVNGPYSLVME K L LYYK   SPKP+RYYS +    + K   L N+T     + DQ
Sbjct: 187 GENVNGPYSLVMEPKGLHLYYKPTTSPKPIRYYSFSLFTKLNKNESLQNVTFEFENENDQ 246

Query: 246 GFATELTLNYDTGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISF 305
           GFA  L+L Y T ++  G  IL R KYN+TL+FLRL IDGN+++ TYNDKVD+G  E+++
Sbjct: 247 GFAFLLSLKYGTSNSLGGASILNRIKYNTTLSFLRLEIDGNVKIYTYNDKVDYGAWEVTY 306

Query: 306 TLFDR------------DSSWENECQWPERCGQFGLCEDNQCVACPTENG-LAGWSKSCA 365
           TLF +              S  +ECQ P++CG FGLCE++QCV CPT +G +  WSK+C 
Sbjct: 307 TLFLKAPPPLFQVSLAATESESSECQLPKKCGNFGLCEESQCVGCPTSSGPVLAWSKTCE 366

Query: 366 PKKVSSCDPKSFHYYKLVG 371
           P K+SSC PK FHY KL G
Sbjct: 367 PPKLSSCGPKDFHYNKLGG 385

BLAST of CmoCh16G004790 vs. ExPASy Swiss-Prot
Match: Q9ZVA5 (EP1-like glycoprotein 4 OS=Arabidopsis thaliana OX=3702 GN=At1g78860 PE=3 SV=1)

HSP 1 Score: 431.4 bits (1108), Expect = 1.2e-119
Identity = 218/430 (50.70%), Postives = 287/430 (66.74%), Query Frame = 0

Query: 16  LFFSFSL------ALVPANETFKFVNEGEFGDFA-VEYGGTYRVLSIFRFPFQLAFYNTT 75
           LFF+ S+      A VP ++ F+ VNEG + D++ +EY    R    F   F+L FYNTT
Sbjct: 9   LFFTLSIFLVGAQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTT 68

Query: 76  PNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNT 135
            NAYTLALRI  R  ES +RWVWEANRG PV+ENAT +   +GNLVLAEADG VVWQ+NT
Sbjct: 69  QNAYTLALRIGNRAQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRVVWQTNT 128

Query: 136 ANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASEEMN 195
           ANKGVVG ++L +GNMV++DSNGKF+WQSFDSPTDTLLVGQSL+L G  KLVSR S  +N
Sbjct: 129 ANKGVVGIKILENGNMVIYDSNGKFVWQSFDSPTDTLLVGQSLKLNGQNKLVSRLSPSVN 188

Query: 196 VNGPYSLVMERKVLSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFAT 255
            NGPYSLVME K L LYY +  +PKP+ YY       + +  L ++T  A  D D  +  
Sbjct: 189 ANGPYSLVMEAKKLVLYYTTNKTPKPIGYYEYEFFTKIAQ--LQSMTFQAVEDADTTWGL 248

Query: 256 ELTLNYDTGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLFD 315
            +    D+GS  +    L+RPK+N+TL+FLRL  DGN+R+ +Y+        ++++T F 
Sbjct: 249 HME-GVDSGSQFNVSTFLSRPKHNATLSFLRLESDGNIRVWSYSTLATSTAWDVTYTAFT 308

Query: 316 RDSSWEN-ECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDPKSFHYYK 375
            D++  N EC+ PE C  FGLC+  QC ACP++ GL GW ++C    ++SCDPK+FHY+K
Sbjct: 309 NDNTDGNDECRIPEHCLGFGLCKKGQCNACPSDIGLLGWDETCKIPSLASCDPKTFHYFK 368

Query: 376 LVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANS 435
           + G D F+TKYN G     +  C  KC  DCKCLG+FY  K S CW+  ELKTL K  ++
Sbjct: 369 IEGADSFMTKYN-GGSTTTESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTKTGDT 428

Query: 436 THLGFIKTPN 438
           + + ++K PN
Sbjct: 429 SLVAYVKAPN 434

BLAST of CmoCh16G004790 vs. ExPASy Swiss-Prot
Match: Q9ZVA4 (EP1-like glycoprotein 3 OS=Arabidopsis thaliana OX=3702 GN=At1g78850 PE=1 SV=1)

HSP 1 Score: 426.8 bits (1096), Expect = 3.0e-118
Identity = 210/424 (49.53%), Postives = 283/424 (66.75%), Query Frame = 0

Query: 16  LFFSFSLALVPANETFKFVNEGEFGDFA-VEYGGTYRVLSIFRFPFQLAFYNTTPNAYTL 75
           +F   S A VP ++ F+ VNEG + D++ +EY    R    F   F+L FYNTTPNAYTL
Sbjct: 15  IFLIGSQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTTPNAYTL 74

Query: 76  ALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNTANKGVV 135
           ALRI  R  ES +RWVWEANRG PV+ENAT +   +GNLVLAEADG +VWQ+NTANKG V
Sbjct: 75  ALRIGNRVQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRLVWQTNTANKGAV 134

Query: 136 GFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASEEMNVNGPYS 195
           G ++L +GNMV++DS+GKF+WQSFDSPTDTLLVGQSL+L G  KLVSR S  +N NGPYS
Sbjct: 135 GIKILENGNMVIYDSSGKFVWQSFDSPTDTLLVGQSLKLNGRTKLVSRLSPSVNTNGPYS 194

Query: 196 LVMERKVLSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFATELTLNY 255
           LVME K L LYY +  +PKP+ Y+       + +    ++T  A  D D  +   +    
Sbjct: 195 LVMEAKKLVLYYTTNKTPKPIAYFEYEFFTKITQ--FQSMTFQAVEDSDTTWGLVME-GV 254

Query: 256 DTGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLF-DRDSSW 315
           D+GS  +    L+RPK+N+TL+F+RL  DGN+R+ +Y+        ++++T F + D+  
Sbjct: 255 DSGSKFNVSTFLSRPKHNATLSFIRLESDGNIRVWSYSTLATSTAWDVTYTAFTNADTDG 314

Query: 316 ENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDPKSFHYYKLVGVDH 375
            +EC+ PE C  FGLC+  QC ACP++ GL GW ++C    ++SCDPK+FHY+K+ G D 
Sbjct: 315 NDECRIPEHCLGFGLCKKGQCNACPSDKGLLGWDETCKSPSLASCDPKTFHYFKIEGADS 374

Query: 376 FLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANSTHLGFI 435
           F+TKYN G     +  C  KC  DCKCLG+FY  K S CW+  ELKTL +  +S+ + ++
Sbjct: 375 FMTKYNGGSSTT-ESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTRTGDSSLVAYV 434

Query: 436 KTPN 438
           K PN
Sbjct: 435 KAPN 434

BLAST of CmoCh16G004790 vs. ExPASy Swiss-Prot
Match: Q9ZVA2 (EP1-like glycoprotein 2 OS=Arabidopsis thaliana OX=3702 GN=At1g78830 PE=1 SV=1)

HSP 1 Score: 380.2 bits (975), Expect = 3.2e-104
Identity = 203/440 (46.14%), Postives = 266/440 (60.45%), Query Frame = 0

Query: 19  SFSLALVPANETFKFVNEGEFGDFAVEYGGTYRVL-----SIFRFPFQLAFYNTTPNAYT 78
           S  +A VP  + F+ VNEGEFG++  EY  +YR +     S F  PFQL FYNTTP+AY 
Sbjct: 18  SVVIAQVPPEKQFRVVNEGEFGEYITEYDASYRFIESSNQSFFTSPFQLLFYNTTPSAYI 77

Query: 79  LALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNTANKGV 138
           LALR+ +RR ES +RW+W+ANR  PV ENAT SL  NGNLVLAEADG V WQ+NTANKGV
Sbjct: 78  LALRVGLRRDESTMRWIWDANRNNPVGENATLSLGRNGNLVLAEADGRVKWQTNTANKGV 137

Query: 139 VGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASEEMNVNGPY 198
            GF++LP+GN+VL D NGKF+WQSFD PTDTLL GQSL++ G  KLVSR S+    +GPY
Sbjct: 138 TGFQILPNGNIVLHDKNGKFVWQSFDHPTDTLLTGQSLKVNGVNKLVSRTSDSNGSDGPY 197

Query: 199 SLVMERKVLSLYYKSPNSPK-----PMRYYSSTDMLSVRKGVLANIT--------LNAAV 258
           S+V+++K L++Y     +P      P   +  T   +V +    N+T        L  A 
Sbjct: 198 SMVLDKKGLTMYVNKTGTPLVYGGWPDHDFRGTVTFAVTR-EFDNLTEPSAYELLLEPAP 257

Query: 259 DPDQGFATELTLNYDTGSTESGGPI-LTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGP 318
            P         L         GG + L +  YN T+++LRLG DG+L+  +Y     +  
Sbjct: 258 QPATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSLKAYSYFPAATYLK 317

Query: 319 SEISFTLFDRDSSWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKV---- 378
            E SF+ F   + +  +C  P  CG +G C+   C ACPT  GL GWS  CAP K     
Sbjct: 318 WEESFSFF--STYFVRQCGLPSFCGDYGYCDRGMCNACPTPKGLLGWSDKCAPPKTTQFC 377

Query: 379 SSCDPKSFHYYKLVGVDHFLTKY-NKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWV 435
           S    K+ +YYK+VGV+HF   Y N G+GP    +C+ KC+ DCKCLGYFY+ K   C +
Sbjct: 378 SGVKGKTVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGYFYKEKDKKCLL 437

BLAST of CmoCh16G004790 vs. ExPASy Swiss-Prot
Match: Q9ZVA1 (EP1-like glycoprotein 1 OS=Arabidopsis thaliana OX=3702 GN=At1g78820 PE=2 SV=1)

HSP 1 Score: 350.1 bits (897), Expect = 3.6e-95
Identity = 192/453 (42.38%), Postives = 259/453 (57.17%), Query Frame = 0

Query: 5   LLTPLLISFFFLFFSFSLALVPANETFKFVNEGEFGDFAVEYGGTYRVL-----SIFRFP 64
           L+T L IS      S  +A VP  + F+ +NE  +  +  EY  +YR L     + F  P
Sbjct: 8   LITALAIS----TVSVVMAQVPPEKQFRVLNEPGYAPYITEYDASYRFLNSPNQNFFTIP 67

Query: 65  FQLAFYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEAD 124
           FQL FYNTTP+AY LALR+  RR  S  RW+W+ANR  PV +N+T S   NGNLVLAE +
Sbjct: 68  FQLMFYNTTPSAYVLALRVGTRRDMSFTRWIWDANRNNPVGDNSTLSFGRNGNLVLAELN 127

Query: 125 GTVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKL 184
           G V WQ+NTANKGV GF++LP+GNMVL D +GKF+WQSFD PTDTLLVGQSL++ G  KL
Sbjct: 128 GQVKWQTNTANKGVTGFQILPNGNMVLHDKHGKFVWQSFDHPTDTLLVGQSLKVNGVNKL 187

Query: 185 VSRASEEMNVNGPYSLVMERKVLSLYYKSPNSPKPMRYYSSTDML------------SVR 244
           VSR S+    +GPYS+V++ K L++Y     +P     ++  D              ++ 
Sbjct: 188 VSRTSDMNGSDGPYSMVLDNKGLTMYVNKTGTPLVYGGWTDHDFRGTVTFAVTREFDNLT 247

Query: 245 KGVLANITLNAAVDPDQGFATELTLNYDTGSTESGGPI-LTRPKYNSTLTFLRLGIDGNL 304
           +     + L  A  P         L         GG + L +  YN T+++LRLG DG+L
Sbjct: 248 EPSAYELLLEPAPQPATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSL 307

Query: 305 RLITYNDKVDWGPSEISFTLFDRDSSWENECQWPERCGQFGLCEDNQCVACPTENGLAGW 364
           +  +Y     +   E +F  F   + +  +C  P  CG +G C+   CV CPT  GL  W
Sbjct: 308 KAFSYFPAATYLEWEETFAFF--SNYFVRQCGLPTFCGDYGYCDRGMCVGCPTPKGLLAW 367

Query: 365 SKSCAPKKV----SSCDPKSFHYYKLVGVDHFLTKY-NKGEGPMGQKECEKKCNLDCKCL 424
           S  CAP K     S    K+ +YYK+VGV+HF   Y N G+GP    +C+ KC+ DCKCL
Sbjct: 368 SDKCAPPKTTQFCSGGKGKAVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCL 427

Query: 425 GYFYQTKGSLCWVANELKTLIKVANSTHLGFIK 435
           GYFY+ K   C +A  L TLIK AN++ + +IK
Sbjct: 428 GYFYKEKDKKCLLAPLLGTLIKDANTSSVAYIK 454

BLAST of CmoCh16G004790 vs. ExPASy TrEMBL
Match: A0A6J1ETQ8 (epidermis-specific secreted glycoprotein EP1-like OS=Cucurbita moschata OX=3662 GN=LOC111437646 PE=4 SV=1)

HSP 1 Score: 905.2 bits (2338), Expect = 1.1e-259
Identity = 438/438 (100.00%), Postives = 438/438 (100.00%), Query Frame = 0

Query: 1   MRSPLLTPLLISFFFLFFSFSLALVPANETFKFVNEGEFGDFAVEYGGTYRVLSIFRFPF 60
           MRSPLLTPLLISFFFLFFSFSLALVPANETFKFVNEGEFGDFAVEYGGTYRVLSIFRFPF
Sbjct: 1   MRSPLLTPLLISFFFLFFSFSLALVPANETFKFVNEGEFGDFAVEYGGTYRVLSIFRFPF 60

Query: 61  QLAFYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADG 120
           QLAFYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADG
Sbjct: 61  QLAFYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADG 120

Query: 121 TVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV 180
           TVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV
Sbjct: 121 TVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV 180

Query: 181 SRASEEMNVNGPYSLVMERKVLSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAV 240
           SRASEEMNVNGPYSLVMERKVLSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAV
Sbjct: 181 SRASEEMNVNGPYSLVMERKVLSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAV 240

Query: 241 DPDQGFATELTLNYDTGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS 300
           DPDQGFATELTLNYDTGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS
Sbjct: 241 DPDQGFATELTLNYDTGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS 300

Query: 301 EISFTLFDRDSSWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDP 360
           EISFTLFDRDSSWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDP
Sbjct: 301 EISFTLFDRDSSWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDP 360

Query: 361 KSFHYYKLVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT 420
           KSFHYYKLVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT
Sbjct: 361 KSFHYYKLVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT 420

Query: 421 LIKVANSTHLGFIKTPNK 439
           LIKVANSTHLGFIKTPNK
Sbjct: 421 LIKVANSTHLGFIKTPNK 438

BLAST of CmoCh16G004790 vs. ExPASy TrEMBL
Match: A0A6J1JA71 (epidermis-specific secreted glycoprotein EP1-like OS=Cucurbita maxima OX=3661 GN=LOC111482665 PE=4 SV=1)

HSP 1 Score: 881.7 bits (2277), Expect = 1.3e-252
Identity = 424/438 (96.80%), Postives = 430/438 (98.17%), Query Frame = 0

Query: 1   MRSPLLTPLLISFFFLFFSFSLALVPANETFKFVNEGEFGDFAVEYGGTYRVLSIFRFPF 60
           MRSPLLTP LISFFFLFFSFSLALVPANETFKFVNEGEFG+F +EY GTYR LSIFRFPF
Sbjct: 1   MRSPLLTPFLISFFFLFFSFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPLSIFRFPF 60

Query: 61  QLAFYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADG 120
           QL+FYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLS NGNLVLAEADG
Sbjct: 61  QLSFYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADG 120

Query: 121 TVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV 180
           TVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGP KLV
Sbjct: 121 TVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLV 180

Query: 181 SRASEEMNVNGPYSLVMERKVLSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAV 240
           SRASE MNVNGPYSLVMERK LSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAV
Sbjct: 181 SRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAV 240

Query: 241 DPDQGFATELTLNYDTGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS 300
           DPDQGFATELTLNY+TGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS
Sbjct: 241 DPDQGFATELTLNYETGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS 300

Query: 301 EISFTLFDRDSSWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDP 360
           EISFTLFDRDS+WENECQWPERCGQFGLCEDNQCVACPTENGLAGWSK+CAPKKVSSCDP
Sbjct: 301 EISFTLFDRDSTWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKVSSCDP 360

Query: 361 KSFHYYKLVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT 420
           KSFHYYKLVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT
Sbjct: 361 KSFHYYKLVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT 420

Query: 421 LIKVANSTHLGFIKTPNK 439
           LIKVANSTHLGFIKTPNK
Sbjct: 421 LIKVANSTHLGFIKTPNK 438

BLAST of CmoCh16G004790 vs. ExPASy TrEMBL
Match: A0A6J1ETZ5 (epidermis-specific secreted glycoprotein EP1-like OS=Cucurbita moschata OX=3662 GN=LOC111437696 PE=4 SV=1)

HSP 1 Score: 855.1 bits (2208), Expect = 1.3e-244
Identity = 415/438 (94.75%), Postives = 422/438 (96.35%), Query Frame = 0

Query: 1   MRSPLLTPLLISFFFLFFSFSLALVPANETFKFVNEGEFGDFAVEYGGTYRVLSIFRFPF 60
           MRS LLTPLLISFFFLFFSFSLALVPANETFKFVNEG+FG+F +EY GTYR LSIFRFPF
Sbjct: 1   MRSTLLTPLLISFFFLFFSFSLALVPANETFKFVNEGDFGEFIIEYDGTYRPLSIFRFPF 60

Query: 61  QLAFYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADG 120
           QLAFYNTTPNAYTLALRISIRRSES IRWVWEANRGRPVRENATFSLSANGNLVLAEADG
Sbjct: 61  QLAFYNTTPNAYTLALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADG 120

Query: 121 TVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV 180
           TVVWQSNTANKGVVGFELLPSGNMVL DSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV
Sbjct: 121 TVVWQSNTANKGVVGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV 180

Query: 181 SRASEEMNVNGPYSLVMERKVLSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAV 240
           SRASE MNVNGPYSLVMERK LSLYYKSPNSPKPMRYYSSTDMLSVRKGVLAN+TL A V
Sbjct: 181 SRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANLTLIADV 240

Query: 241 DPDQGFATELTLNYDTGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS 300
           DPDQGFATEL LN DTGS +SGG +LTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS
Sbjct: 241 DPDQGFATELILNSDTGSPQSGGAVLTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS 300

Query: 301 EISFTLFDRDSSWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDP 360
           EISFTLFDRDS+ ENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDP
Sbjct: 301 EISFTLFDRDSTRENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDP 360

Query: 361 KSFHYYKLVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT 420
           KSFHYYKLVGVDH LTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT
Sbjct: 361 KSFHYYKLVGVDHVLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT 420

Query: 421 LIKVANSTHLGFIKTPNK 439
           LIKVANSTHLGFIKTPNK
Sbjct: 421 LIKVANSTHLGFIKTPNK 438

BLAST of CmoCh16G004790 vs. ExPASy TrEMBL
Match: A0A0A0L0B9 (Bulb-type lectin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G082380 PE=4 SV=1)

HSP 1 Score: 743.8 bits (1919), Expect = 4.1e-211
Identity = 359/442 (81.22%), Postives = 395/442 (89.37%), Query Frame = 0

Query: 1   MRSPLLTPLLISFFFLFFSFSLALVPANETFKFVNEGEFGDFAVEYGGTYRVLSIFRFPF 60
           MR PLLTPLL+SFFF FFS S A+VP+NETFKFVNEG+FGDFAVEY GTYR LSI   PF
Sbjct: 1   MRPPLLTPLLLSFFF-FFSLSFAIVPSNETFKFVNEGDFGDFAVEYDGTYRPLSISNSPF 60

Query: 61  QLAFYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADG 120
           QL FYNTTPNAYTLALR++I RSESA RWVWEANRGRPVRENAT SL ++GNLVLAEADG
Sbjct: 61  QLMFYNTTPNAYTLALRMAILRSESAKRWVWEANRGRPVRENATLSLGSDGNLVLAEADG 120

Query: 121 TVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV 180
           TVVWQ+NTANKGVV  +LLP+GNMVL DSNGKF+WQSFDSPTDTLLVGQSLR+GG  KLV
Sbjct: 121 TVVWQTNTANKGVVKLDLLPNGNMVLLDSNGKFVWQSFDSPTDTLLVGQSLRIGGVTKLV 180

Query: 181 SRASEEMNVNGPYSLVMERKVLSLYYKSPNSPKPMRYYS-STDMLSVRKGVLANITLNAA 240
           SRASE++NVNGPYS VMER  +SLYYKSPNSPKPMRY++ S++  +++KG LA +TL A 
Sbjct: 181 SRASEKLNVNGPYSFVMERNAMSLYYKSPNSPKPMRYFAGSSNWFTIQKGSLARVTLRAE 240

Query: 241 VDPDQGFATELTLNYDTGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGP 300
           VDPDQGFATELTLNY+   TE+GGPIL+RPKYNSTLTFLRLGIDGNLRL TYNDKVDW P
Sbjct: 241 VDPDQGFATELTLNYEVAGTENGGPILSRPKYNSTLTFLRLGIDGNLRLFTYNDKVDWSP 300

Query: 301 SEISFTLFDRD---SSWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVS 360
           SEI+FTLFDR+    + E+ECQWPERCGQFGLCE+NQCVACPTE GL GWSK+C  KKVS
Sbjct: 301 SEITFTLFDREFNTGNTESECQWPERCGQFGLCEENQCVACPTEKGLLGWSKTCMAKKVS 360

Query: 361 SCDPKSFHYYKLVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVAN 420
           SCDPKSFHYYK+ GVDHFLTKYNKGEG + QK+CEKKCNLDCKCLGYFYQTKGSLCWVAN
Sbjct: 361 SCDPKSFHYYKVEGVDHFLTKYNKGEG-LRQKDCEKKCNLDCKCLGYFYQTKGSLCWVAN 420

Query: 421 ELKTLIKVANSTHLGFIKTPNK 439
           ELKTLIKV NSTHLGFIKTPNK
Sbjct: 421 ELKTLIKVDNSTHLGFIKTPNK 440

BLAST of CmoCh16G004790 vs. ExPASy TrEMBL
Match: A0A5A7V4R1 (Epidermis-specific secreted glycoprotein EP1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold468G001550 PE=4 SV=1)

HSP 1 Score: 739.2 bits (1907), Expect = 1.0e-209
Identity = 358/442 (81.00%), Postives = 393/442 (88.91%), Query Frame = 0

Query: 1   MRSPLLTPLLISFFFLFFSFSLALVPANETFKFVNEGEFGDFAVEYGGTYRVLSIFRFPF 60
           MR PLLTPLL+SFFF F S S A+VP NETFKFVNEG+FGDFAVEY GTYR LSI   PF
Sbjct: 1   MRPPLLTPLLLSFFF-FSSLSFAIVPPNETFKFVNEGDFGDFAVEYDGTYRPLSISNSPF 60

Query: 61  QLAFYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADG 120
           QL FYNTTPNAYTLALR++I RSESA RWVWEANRGRPVRENAT SL ++GNLVLAEADG
Sbjct: 61  QLMFYNTTPNAYTLALRMAILRSESAKRWVWEANRGRPVRENATLSLGSDGNLVLAEADG 120

Query: 121 TVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV 180
           TVVWQ+NTANKGVV  +LLP+GNMVL DSNGKF+WQSFDSPTDTLLVGQSLRLGG  KLV
Sbjct: 121 TVVWQTNTANKGVVKLDLLPNGNMVLLDSNGKFVWQSFDSPTDTLLVGQSLRLGGVTKLV 180

Query: 181 SRASEEMNVNGPYSLVMERKVLSLYYKSPNSPKPMRYYS-STDMLSVRKGVLANITLNAA 240
           SRASE++NVNGPYS VMERK +SLYYKSPNSPKPMRY++ S++  +++KG LA +TL A 
Sbjct: 181 SRASEKLNVNGPYSFVMERKAVSLYYKSPNSPKPMRYFAGSSNWFTIQKGTLARVTLRAE 240

Query: 241 VDPDQGFATELTLNYDTGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGP 300
           VDP QGFATELTLNY+   TE+GGPIL+RPKYNSTLTFLRLGIDGNLRL TYND+VDW P
Sbjct: 241 VDPGQGFATELTLNYEVAGTENGGPILSRPKYNSTLTFLRLGIDGNLRLFTYNDQVDWSP 300

Query: 301 SEISFTLFDRD---SSWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVS 360
           SEI+FTLFDR+    + E+ECQWPERCGQFGLCE+NQCVACPTE GL GWSK+C  KKVS
Sbjct: 301 SEITFTLFDREFNTGNTESECQWPERCGQFGLCEENQCVACPTEKGLLGWSKTCMAKKVS 360

Query: 361 SCDPKSFHYYKLVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVAN 420
           SCDPKSFHYYKL GVDHFLTK+NKGEG + QK+CEKKCNLDCKCLGYFYQTKGSLCWVAN
Sbjct: 361 SCDPKSFHYYKLEGVDHFLTKFNKGEG-LSQKDCEKKCNLDCKCLGYFYQTKGSLCWVAN 420

Query: 421 ELKTLIKVANSTHLGFIKTPNK 439
           ELKTLIKV NSTHLGFIKTPNK
Sbjct: 421 ELKTLIKVDNSTHLGFIKTPNK 440

BLAST of CmoCh16G004790 vs. TAIR 10
Match: AT1G78860.1 (D-mannose binding lectin protein with Apple-like carbohydrate-binding domain )

HSP 1 Score: 431.4 bits (1108), Expect = 8.6e-121
Identity = 218/430 (50.70%), Postives = 287/430 (66.74%), Query Frame = 0

Query: 16  LFFSFSL------ALVPANETFKFVNEGEFGDFA-VEYGGTYRVLSIFRFPFQLAFYNTT 75
           LFF+ S+      A VP ++ F+ VNEG + D++ +EY    R    F   F+L FYNTT
Sbjct: 9   LFFTLSIFLVGAQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTT 68

Query: 76  PNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNT 135
            NAYTLALRI  R  ES +RWVWEANRG PV+ENAT +   +GNLVLAEADG VVWQ+NT
Sbjct: 69  QNAYTLALRIGNRAQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRVVWQTNT 128

Query: 136 ANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASEEMN 195
           ANKGVVG ++L +GNMV++DSNGKF+WQSFDSPTDTLLVGQSL+L G  KLVSR S  +N
Sbjct: 129 ANKGVVGIKILENGNMVIYDSNGKFVWQSFDSPTDTLLVGQSLKLNGQNKLVSRLSPSVN 188

Query: 196 VNGPYSLVMERKVLSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFAT 255
            NGPYSLVME K L LYY +  +PKP+ YY       + +  L ++T  A  D D  +  
Sbjct: 189 ANGPYSLVMEAKKLVLYYTTNKTPKPIGYYEYEFFTKIAQ--LQSMTFQAVEDADTTWGL 248

Query: 256 ELTLNYDTGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLFD 315
            +    D+GS  +    L+RPK+N+TL+FLRL  DGN+R+ +Y+        ++++T F 
Sbjct: 249 HME-GVDSGSQFNVSTFLSRPKHNATLSFLRLESDGNIRVWSYSTLATSTAWDVTYTAFT 308

Query: 316 RDSSWEN-ECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDPKSFHYYK 375
            D++  N EC+ PE C  FGLC+  QC ACP++ GL GW ++C    ++SCDPK+FHY+K
Sbjct: 309 NDNTDGNDECRIPEHCLGFGLCKKGQCNACPSDIGLLGWDETCKIPSLASCDPKTFHYFK 368

Query: 376 LVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANS 435
           + G D F+TKYN G     +  C  KC  DCKCLG+FY  K S CW+  ELKTL K  ++
Sbjct: 369 IEGADSFMTKYN-GGSTTTESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTKTGDT 428

Query: 436 THLGFIKTPN 438
           + + ++K PN
Sbjct: 429 SLVAYVKAPN 434

BLAST of CmoCh16G004790 vs. TAIR 10
Match: AT1G78850.1 (D-mannose binding lectin protein with Apple-like carbohydrate-binding domain )

HSP 1 Score: 426.8 bits (1096), Expect = 2.1e-119
Identity = 210/424 (49.53%), Postives = 283/424 (66.75%), Query Frame = 0

Query: 16  LFFSFSLALVPANETFKFVNEGEFGDFA-VEYGGTYRVLSIFRFPFQLAFYNTTPNAYTL 75
           +F   S A VP ++ F+ VNEG + D++ +EY    R    F   F+L FYNTTPNAYTL
Sbjct: 15  IFLIGSQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTTPNAYTL 74

Query: 76  ALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNTANKGVV 135
           ALRI  R  ES +RWVWEANRG PV+ENAT +   +GNLVLAEADG +VWQ+NTANKG V
Sbjct: 75  ALRIGNRVQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRLVWQTNTANKGAV 134

Query: 136 GFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASEEMNVNGPYS 195
           G ++L +GNMV++DS+GKF+WQSFDSPTDTLLVGQSL+L G  KLVSR S  +N NGPYS
Sbjct: 135 GIKILENGNMVIYDSSGKFVWQSFDSPTDTLLVGQSLKLNGRTKLVSRLSPSVNTNGPYS 194

Query: 196 LVMERKVLSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFATELTLNY 255
           LVME K L LYY +  +PKP+ Y+       + +    ++T  A  D D  +   +    
Sbjct: 195 LVMEAKKLVLYYTTNKTPKPIAYFEYEFFTKITQ--FQSMTFQAVEDSDTTWGLVME-GV 254

Query: 256 DTGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLF-DRDSSW 315
           D+GS  +    L+RPK+N+TL+F+RL  DGN+R+ +Y+        ++++T F + D+  
Sbjct: 255 DSGSKFNVSTFLSRPKHNATLSFIRLESDGNIRVWSYSTLATSTAWDVTYTAFTNADTDG 314

Query: 316 ENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDPKSFHYYKLVGVDH 375
            +EC+ PE C  FGLC+  QC ACP++ GL GW ++C    ++SCDPK+FHY+K+ G D 
Sbjct: 315 NDECRIPEHCLGFGLCKKGQCNACPSDKGLLGWDETCKSPSLASCDPKTFHYFKIEGADS 374

Query: 376 FLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANSTHLGFI 435
           F+TKYN G     +  C  KC  DCKCLG+FY  K S CW+  ELKTL +  +S+ + ++
Sbjct: 375 FMTKYNGGSSTT-ESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTRTGDSSLVAYV 434

Query: 436 KTPN 438
           K PN
Sbjct: 435 KAPN 434

BLAST of CmoCh16G004790 vs. TAIR 10
Match: AT1G16905.1 (Curculin-like (mannose-binding) lectin family protein )

HSP 1 Score: 393.3 bits (1009), Expect = 2.6e-109
Identity = 210/433 (48.50%), Postives = 276/433 (63.74%), Query Frame = 0

Query: 9   LLISFFFLFFSFSLALVPANETFKFVNEGEFGDFAVEYGGTYRVLSIFRFPFQLAFYNTT 68
           L++   FL  S     VP  E F+F+N G+FG+  VEYG +YR L + R  F+L F+NTT
Sbjct: 8   LILLSLFLLISLVRPQVPPMEQFRFLNNGDFGESTVEYGASYRDLGVIRNQFRLCFFNTT 67

Query: 69  PNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNT 128
           PNA+TLA+ +    S+S IRWVW+AN  +PV+E A+ S    GNLVLA+ DG VVWQ+ T
Sbjct: 68  PNAFTLAIGMGTGSSDSIIRWVWQANPQKPVQEEASLSFGPEGNLVLAQPDGRVVWQTMT 127

Query: 129 ANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRL-GGPMKLVSRASEEM 188
            NKGV+G  +  +GN+VLFD  G  +WQSF+ PTDTLLVGQSL L G   KLVSR     
Sbjct: 128 ENKGVIGLTMNENGNLVLFDDGGWPVWQSFEFPTDTLLVGQSLTLDGSKNKLVSRN---- 187

Query: 189 NVNGPYSLVMERKVLSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFA 248
             NG YSL++E   L L    P S      Y       +    + + TL +A   DQG  
Sbjct: 188 --NGSYSLILEPDRLVLNRLIPRSNNKSLVYH-----IIEGRFIPSATLYSA--KDQGTT 247

Query: 249 TELTLNYDTGSTESGGP---ILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISF 308
           T+L L   T       P    L RP++N++ +FLRL  DGNLR+ +++ KV +   E++F
Sbjct: 248 TQLGL--ATPGLRPEFPYKHFLARPRFNASQSFLRLDADGNLRIYSFDSKVTFLAWEVTF 307

Query: 309 TLFDRDSSWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDPKSFH 368
            LF+ D++  NEC  P +CG FG+CEDNQCVACP   GL GWSK+C PKKV SCDPKSFH
Sbjct: 308 ELFNHDNN--NECWLPSKCGAFGICEDNQCVACPLGVGLMGWSKACKPKKVKSCDPKSFH 367

Query: 369 YYKLVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKV 428
           YY+L GV+HF+TKYN G   +G+ +C   C+ DCKCLGYF+      CW++ EL TL+KV
Sbjct: 368 YYRLGGVEHFMTKYNVGLA-LGESKCRGLCSGDCKCLGYFFDKSSFKCWISYELGTLVKV 422

Query: 429 ANSTHLGFIKTPN 438
           ++S  + +IKTPN
Sbjct: 428 SDSRKVAYIKTPN 422

BLAST of CmoCh16G004790 vs. TAIR 10
Match: AT1G78830.1 (Curculin-like (mannose-binding) lectin family protein )

HSP 1 Score: 380.2 bits (975), Expect = 2.3e-105
Identity = 203/440 (46.14%), Postives = 266/440 (60.45%), Query Frame = 0

Query: 19  SFSLALVPANETFKFVNEGEFGDFAVEYGGTYRVL-----SIFRFPFQLAFYNTTPNAYT 78
           S  +A VP  + F+ VNEGEFG++  EY  +YR +     S F  PFQL FYNTTP+AY 
Sbjct: 18  SVVIAQVPPEKQFRVVNEGEFGEYITEYDASYRFIESSNQSFFTSPFQLLFYNTTPSAYI 77

Query: 79  LALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNTANKGV 138
           LALR+ +RR ES +RW+W+ANR  PV ENAT SL  NGNLVLAEADG V WQ+NTANKGV
Sbjct: 78  LALRVGLRRDESTMRWIWDANRNNPVGENATLSLGRNGNLVLAEADGRVKWQTNTANKGV 137

Query: 139 VGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASEEMNVNGPY 198
            GF++LP+GN+VL D NGKF+WQSFD PTDTLL GQSL++ G  KLVSR S+    +GPY
Sbjct: 138 TGFQILPNGNIVLHDKNGKFVWQSFDHPTDTLLTGQSLKVNGVNKLVSRTSDSNGSDGPY 197

Query: 199 SLVMERKVLSLYYKSPNSPK-----PMRYYSSTDMLSVRKGVLANIT--------LNAAV 258
           S+V+++K L++Y     +P      P   +  T   +V +    N+T        L  A 
Sbjct: 198 SMVLDKKGLTMYVNKTGTPLVYGGWPDHDFRGTVTFAVTR-EFDNLTEPSAYELLLEPAP 257

Query: 259 DPDQGFATELTLNYDTGSTESGGPI-LTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGP 318
            P         L         GG + L +  YN T+++LRLG DG+L+  +Y     +  
Sbjct: 258 QPATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSLKAYSYFPAATYLK 317

Query: 319 SEISFTLFDRDSSWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKV---- 378
            E SF+ F   + +  +C  P  CG +G C+   C ACPT  GL GWS  CAP K     
Sbjct: 318 WEESFSFF--STYFVRQCGLPSFCGDYGYCDRGMCNACPTPKGLLGWSDKCAPPKTTQFC 377

Query: 379 SSCDPKSFHYYKLVGVDHFLTKY-NKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWV 435
           S    K+ +YYK+VGV+HF   Y N G+GP    +C+ KC+ DCKCLGYFY+ K   C +
Sbjct: 378 SGVKGKTVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGYFYKEKDKKCLL 437

BLAST of CmoCh16G004790 vs. TAIR 10
Match: AT1G78820.1 (D-mannose binding lectin protein with Apple-like carbohydrate-binding domain )

HSP 1 Score: 350.1 bits (897), Expect = 2.5e-96
Identity = 192/453 (42.38%), Postives = 259/453 (57.17%), Query Frame = 0

Query: 5   LLTPLLISFFFLFFSFSLALVPANETFKFVNEGEFGDFAVEYGGTYRVL-----SIFRFP 64
           L+T L IS      S  +A VP  + F+ +NE  +  +  EY  +YR L     + F  P
Sbjct: 8   LITALAIS----TVSVVMAQVPPEKQFRVLNEPGYAPYITEYDASYRFLNSPNQNFFTIP 67

Query: 65  FQLAFYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEAD 124
           FQL FYNTTP+AY LALR+  RR  S  RW+W+ANR  PV +N+T S   NGNLVLAE +
Sbjct: 68  FQLMFYNTTPSAYVLALRVGTRRDMSFTRWIWDANRNNPVGDNSTLSFGRNGNLVLAELN 127

Query: 125 GTVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKL 184
           G V WQ+NTANKGV GF++LP+GNMVL D +GKF+WQSFD PTDTLLVGQSL++ G  KL
Sbjct: 128 GQVKWQTNTANKGVTGFQILPNGNMVLHDKHGKFVWQSFDHPTDTLLVGQSLKVNGVNKL 187

Query: 185 VSRASEEMNVNGPYSLVMERKVLSLYYKSPNSPKPMRYYSSTDML------------SVR 244
           VSR S+    +GPYS+V++ K L++Y     +P     ++  D              ++ 
Sbjct: 188 VSRTSDMNGSDGPYSMVLDNKGLTMYVNKTGTPLVYGGWTDHDFRGTVTFAVTREFDNLT 247

Query: 245 KGVLANITLNAAVDPDQGFATELTLNYDTGSTESGGPI-LTRPKYNSTLTFLRLGIDGNL 304
           +     + L  A  P         L         GG + L +  YN T+++LRLG DG+L
Sbjct: 248 EPSAYELLLEPAPQPATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSL 307

Query: 305 RLITYNDKVDWGPSEISFTLFDRDSSWENECQWPERCGQFGLCEDNQCVACPTENGLAGW 364
           +  +Y     +   E +F  F   + +  +C  P  CG +G C+   CV CPT  GL  W
Sbjct: 308 KAFSYFPAATYLEWEETFAFF--SNYFVRQCGLPTFCGDYGYCDRGMCVGCPTPKGLLAW 367

Query: 365 SKSCAPKKV----SSCDPKSFHYYKLVGVDHFLTKY-NKGEGPMGQKECEKKCNLDCKCL 424
           S  CAP K     S    K+ +YYK+VGV+HF   Y N G+GP    +C+ KC+ DCKCL
Sbjct: 368 SDKCAPPKTTQFCSGGKGKAVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCL 427

Query: 425 GYFYQTKGSLCWVANELKTLIKVANSTHLGFIK 435
           GYFY+ K   C +A  L TLIK AN++ + +IK
Sbjct: 428 GYFYKEKDKKCLLAPLLGTLIKDANTSSVAYIK 454

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q396885.4e-12058.05Epidermis-specific secreted glycoprotein EP1 OS=Daucus carota OX=4039 GN=EP1 PE=... [more]
Q9ZVA51.2e-11950.70EP1-like glycoprotein 4 OS=Arabidopsis thaliana OX=3702 GN=At1g78860 PE=3 SV=1[more]
Q9ZVA43.0e-11849.53EP1-like glycoprotein 3 OS=Arabidopsis thaliana OX=3702 GN=At1g78850 PE=1 SV=1[more]
Q9ZVA23.2e-10446.14EP1-like glycoprotein 2 OS=Arabidopsis thaliana OX=3702 GN=At1g78830 PE=1 SV=1[more]
Q9ZVA13.6e-9542.38EP1-like glycoprotein 1 OS=Arabidopsis thaliana OX=3702 GN=At1g78820 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1ETQ81.1e-259100.00epidermis-specific secreted glycoprotein EP1-like OS=Cucurbita moschata OX=3662 ... [more]
A0A6J1JA711.3e-25296.80epidermis-specific secreted glycoprotein EP1-like OS=Cucurbita maxima OX=3661 GN... [more]
A0A6J1ETZ51.3e-24494.75epidermis-specific secreted glycoprotein EP1-like OS=Cucurbita moschata OX=3662 ... [more]
A0A0A0L0B94.1e-21181.22Bulb-type lectin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G0... [more]
A0A5A7V4R11.0e-20981.00Epidermis-specific secreted glycoprotein EP1-like OS=Cucumis melo var. makuwa OX... [more]
Match NameE-valueIdentityDescription
AT1G78860.18.6e-12150.70D-mannose binding lectin protein with Apple-like carbohydrate-binding domain [more]
AT1G78850.12.1e-11949.53D-mannose binding lectin protein with Apple-like carbohydrate-binding domain [more]
AT1G16905.12.6e-10948.50Curculin-like (mannose-binding) lectin family protein [more]
AT1G78830.12.3e-10546.14Curculin-like (mannose-binding) lectin family protein [more]
AT1G78820.12.5e-9642.38D-mannose binding lectin protein with Apple-like carbohydrate-binding domain [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001480Bulb-type lectin domainSMARTSM00108blect_4coord: 44..161
e-value: 5.0E-33
score: 125.7
IPR001480Bulb-type lectin domainPFAMPF01453B_lectincoord: 90..173
e-value: 2.1E-18
score: 66.7
IPR001480Bulb-type lectin domainPROSITEPS50927BULB_LECTINcoord: 41..159
score: 14.320436
IPR001480Bulb-type lectin domainCDDcd00028B_lectincoord: 44..161
e-value: 1.94756E-32
score: 117.028
IPR035446S-locus-specific glycoprotein/EP1PIRSFPIRSF002686SLGcoord: 1..438
e-value: 1.0E-148
score: 493.6
IPR036426Bulb-type lectin domain superfamilyGENE3D2.90.10.10coord: 59..159
e-value: 9.2E-16
score: 59.9
IPR036426Bulb-type lectin domain superfamilySUPERFAMILY51110alpha-D-mannose-specific plant lectinscoord: 70..208
NoneNo IPR availablePANTHERPTHR32444:SF64SECRETED GLYCOPROTEIN EP1, PUTATIVE-RELATEDcoord: 13..438
NoneNo IPR availablePANTHERPTHR32444FAMILY NOT NAMEDcoord: 13..438
NoneNo IPR availableCDDcd01098PAN_AP_plantcoord: 356..436
e-value: 3.40436E-10
score: 54.3646

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh16G004790.1CmoCh16G004790.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0110165 cellular anatomical entity