CmoCh16G004780 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh16G004780
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionepidermis-specific secreted glycoprotein EP1-like
LocationCmo_Chr16: 2288991 .. 2290307 (+)
RNA-Seq ExpressionCmoCh16G004780
SyntenyCmoCh16G004780
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGATCGACATTGTTGACGCCTCTTCTGATCTCTTTCTTTTTCTTGTTCTTTTCTTTCTCTCTTGCTCTCGTGCCTGCTAACGAGACCTTCAAGTTCGTCAATGAAGGCGACTTTGGTGAGTTCATCATCGAGTACGACGGCACCTACAGACCTCTCAGCATCTTCAGATTTCCATTTCAGTTAGCTTTCTACAACACGACGCCGAACGCTTACACTCTCGCTCTTCGAATTTCGATTCGTCGCTCCGAATCGATGATACGGTGGGTTTGGGAGGCCAATCGCGGCCGTCCGGTGCGCGAAAATGCTACATTCTCTCTCTCCGCCAACGGAAACCTAGTTCTCGCTGAAGCCGACGGCACTGTTGTATGGCAATCGAACACAGCCAACAAAGGCGTTGTTGGATTCGAATTGCTTCCGAGTGGCAACATGGTGTTGTTAGACTCCAATGGCAAATTCCTCTGGCAGAGCTTCGATTCGCCGACGGACACACTCCTAGTCGGCCAATCTCTCCGTCTCGGCGGGCCAATGAAGCTAGTAAGCCGCGCATCGGAGGTAATGAACGTCAATGGACCTTACAGCCTTGTAATGGAACGAAAAGCCCTATCTCTGTACTACAAAAGCCCTAACTCTCCAAAACCGATGCGGTACTACTCATCCACAGACATGCTCTCAGTCCGGAAAGGCGTTCTCGCAAATCTCACACTAATCGCCGACGTAGATCCAGATCAAGGATTCGCCACCGAATTAATACTGAACTCCGACACCGGCTCGCCTCAGAGCGGCGGTGCAGTTCTAACCCGGCCGAAGTACAACAGCACACTAACATTCCTCCGATTAGGAATCGACGGCAACCTCCGCCTGATCACATACAACGACAAAGTCGATTGGGGCCCGTCGGAGATTTCGTTCACACTCTTCGATAGGGATTCAACTCGGGAGAACGAATGCCAGTGGCCAGAGCGGTGCGGACAGTTCGGGTTGTGCGAGGACAACCAGTGCGTAGCCTGTCCAACAGAGAACGGACTGGCGGGATGGAGCAAGAGCTGCGCGCCGAAGAAGGTAAGTTCCTGCGATCCCAAAAGCTTCCATTACTATAAACTAGTCGGTGTGGATCATGTCTTGACAAAGTACAACAAAGGAGAAGGGCCAATGGGACAGAAGGAGTGTGAGAAGAAGTGCAATTTGGACTGCAAATGTTTGGGATACTTTTACCAAACCAAAGGGTCGCTTTGTTGGGTTGCAAATGAGCTGAAAACTTTGATAAAAGTGGCCAATTCCACTCATTTGGGCTTCATCAAAACGCCCAATAAGTAG

mRNA sequence

ATGAGATCGACATTGTTGACGCCTCTTCTGATCTCTTTCTTTTTCTTGTTCTTTTCTTTCTCTCTTGCTCTCGTGCCTGCTAACGAGACCTTCAAGTTCGTCAATGAAGGCGACTTTGGTGAGTTCATCATCGAGTACGACGGCACCTACAGACCTCTCAGCATCTTCAGATTTCCATTTCAGTTAGCTTTCTACAACACGACGCCGAACGCTTACACTCTCGCTCTTCGAATTTCGATTCGTCGCTCCGAATCGATGATACGGTGGGTTTGGGAGGCCAATCGCGGCCGTCCGGTGCGCGAAAATGCTACATTCTCTCTCTCCGCCAACGGAAACCTAGTTCTCGCTGAAGCCGACGGCACTGTTGTATGGCAATCGAACACAGCCAACAAAGGCGTTGTTGGATTCGAATTGCTTCCGAGTGGCAACATGGTGTTGTTAGACTCCAATGGCAAATTCCTCTGGCAGAGCTTCGATTCGCCGACGGACACACTCCTAGTCGGCCAATCTCTCCGTCTCGGCGGGCCAATGAAGCTAGTAAGCCGCGCATCGGAGGTAATGAACGTCAATGGACCTTACAGCCTTGTAATGGAACGAAAAGCCCTATCTCTGTACTACAAAAGCCCTAACTCTCCAAAACCGATGCGGTACTACTCATCCACAGACATGCTCTCAGTCCGGAAAGGCGTTCTCGCAAATCTCACACTAATCGCCGACGTAGATCCAGATCAAGGATTCGCCACCGAATTAATACTGAACTCCGACACCGGCTCGCCTCAGAGCGGCGGTGCAGTTCTAACCCGGCCGAAGTACAACAGCACACTAACATTCCTCCGATTAGGAATCGACGGCAACCTCCGCCTGATCACATACAACGACAAAGTCGATTGGGGCCCGTCGGAGATTTCGTTCACACTCTTCGATAGGGATTCAACTCGGGAGAACGAATGCCAGTGGCCAGAGCGGTGCGGACAGTTCGGGTTGTGCGAGGACAACCAGTGCGTAGCCTGTCCAACAGAGAACGGACTGGCGGGATGGAGCAAGAGCTGCGCGCCGAAGAAGGTAAGTTCCTGCGATCCCAAAAGCTTCCATTACTATAAACTAGTCGGTGTGGATCATGTCTTGACAAAGTACAACAAAGGAGAAGGGCCAATGGGACAGAAGGAGTGTGAGAAGAAGTGCAATTTGGACTGCAAATGTTTGGGATACTTTTACCAAACCAAAGGGTCGCTTTGTTGGGTTGCAAATGAGCTGAAAACTTTGATAAAAGTGGCCAATTCCACTCATTTGGGCTTCATCAAAACGCCCAATAAGTAG

Coding sequence (CDS)

ATGAGATCGACATTGTTGACGCCTCTTCTGATCTCTTTCTTTTTCTTGTTCTTTTCTTTCTCTCTTGCTCTCGTGCCTGCTAACGAGACCTTCAAGTTCGTCAATGAAGGCGACTTTGGTGAGTTCATCATCGAGTACGACGGCACCTACAGACCTCTCAGCATCTTCAGATTTCCATTTCAGTTAGCTTTCTACAACACGACGCCGAACGCTTACACTCTCGCTCTTCGAATTTCGATTCGTCGCTCCGAATCGATGATACGGTGGGTTTGGGAGGCCAATCGCGGCCGTCCGGTGCGCGAAAATGCTACATTCTCTCTCTCCGCCAACGGAAACCTAGTTCTCGCTGAAGCCGACGGCACTGTTGTATGGCAATCGAACACAGCCAACAAAGGCGTTGTTGGATTCGAATTGCTTCCGAGTGGCAACATGGTGTTGTTAGACTCCAATGGCAAATTCCTCTGGCAGAGCTTCGATTCGCCGACGGACACACTCCTAGTCGGCCAATCTCTCCGTCTCGGCGGGCCAATGAAGCTAGTAAGCCGCGCATCGGAGGTAATGAACGTCAATGGACCTTACAGCCTTGTAATGGAACGAAAAGCCCTATCTCTGTACTACAAAAGCCCTAACTCTCCAAAACCGATGCGGTACTACTCATCCACAGACATGCTCTCAGTCCGGAAAGGCGTTCTCGCAAATCTCACACTAATCGCCGACGTAGATCCAGATCAAGGATTCGCCACCGAATTAATACTGAACTCCGACACCGGCTCGCCTCAGAGCGGCGGTGCAGTTCTAACCCGGCCGAAGTACAACAGCACACTAACATTCCTCCGATTAGGAATCGACGGCAACCTCCGCCTGATCACATACAACGACAAAGTCGATTGGGGCCCGTCGGAGATTTCGTTCACACTCTTCGATAGGGATTCAACTCGGGAGAACGAATGCCAGTGGCCAGAGCGGTGCGGACAGTTCGGGTTGTGCGAGGACAACCAGTGCGTAGCCTGTCCAACAGAGAACGGACTGGCGGGATGGAGCAAGAGCTGCGCGCCGAAGAAGGTAAGTTCCTGCGATCCCAAAAGCTTCCATTACTATAAACTAGTCGGTGTGGATCATGTCTTGACAAAGTACAACAAAGGAGAAGGGCCAATGGGACAGAAGGAGTGTGAGAAGAAGTGCAATTTGGACTGCAAATGTTTGGGATACTTTTACCAAACCAAAGGGTCGCTTTGTTGGGTTGCAAATGAGCTGAAAACTTTGATAAAAGTGGCCAATTCCACTCATTTGGGCTTCATCAAAACGCCCAATAAGTAG

Protein sequence

MRSTLLTPLLISFFFLFFSFSLALVPANETFKFVNEGDFGEFIIEYDGTYRPLSIFRFPFQLAFYNTTPNAYTLALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNTANKGVVGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANLTLIADVDPDQGFATELILNSDTGSPQSGGAVLTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLFDRDSTRENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDPKSFHYYKLVGVDHVLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANSTHLGFIKTPNK
Homology
BLAST of CmoCh16G004780 vs. ExPASy Swiss-Prot
Match: Q39688 (Epidermis-specific secreted glycoprotein EP1 OS=Daucus carota OX=4039 GN=EP1 PE=1 SV=1)

HSP 1 Score: 432.2 bits (1110), Expect = 7.1e-120
Identity = 218/379 (57.52%), Postives = 270/379 (71.24%), Query Frame = 0

Query: 6   LTPLLISFFFLFFSFSLALVPANETFKFVNEGDFGEFIIEYDGTYRPLSIFRFPFQLAFY 65
           LT  ++ FF     F   LVPANETFKFVNEG+ G++I EY G YRPL  F  PFQL FY
Sbjct: 7   LTLTILLFFIQRIDFCHTLVPANETFKFVNEGELGQYISEYFGDYRPLDPFTSPFQLCFY 66

Query: 66  NTTPNAYTLALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQ 125
           N TP A+TLALR+ +RR+ES++RWVWEANRG PV ENAT +   +GNLVLA ++G V WQ
Sbjct: 67  NQTPTAFTLALRMGLRRTESLMRWVWEANRGNPVDENATLTFGPDGNLVLARSNGQVAWQ 126

Query: 126 SNTANKGVVGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASE 185
           ++TANKGVVG ++LP+GNMVL DS GKFLWQSFD+PTDTLLVGQSL++G   KLVSRAS 
Sbjct: 127 TSTANKGVVGLKILPNGNMVLYDSKGKFLWQSFDTPTDTLLVGQSLKMGAVTKLVSRASP 186

Query: 186 VMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKG-VLANLTLIADVDPDQ 245
             NVNGPYSLVME K L LYYK   SPKP+RYYS +    + K   L N+T   + + DQ
Sbjct: 187 GENVNGPYSLVMEPKGLHLYYKPTTSPKPIRYYSFSLFTKLNKNESLQNVTFEFENENDQ 246

Query: 246 GFATELILNSDTGSPQSGGAVLTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISF 305
           GFA  L L   T +   G ++L R KYN+TL+FLRL IDGN+++ TYNDKVD+G  E+++
Sbjct: 247 GFAFLLSLKYGTSNSLGGASILNRIKYNTTLSFLRLEIDGNVKIYTYNDKVDYGAWEVTY 306

Query: 306 TLFDR------------DSTRENECQWPERCGQFGLCEDNQCVACPTENG-LAGWSKSCA 365
           TLF +              +  +ECQ P++CG FGLCE++QCV CPT +G +  WSK+C 
Sbjct: 307 TLFLKAPPPLFQVSLAATESESSECQLPKKCGNFGLCEESQCVGCPTSSGPVLAWSKTCE 366

Query: 366 PKKVSSCDPKSFHYYKLVG 371
           P K+SSC PK FHY KL G
Sbjct: 367 PPKLSSCGPKDFHYNKLGG 385

BLAST of CmoCh16G004780 vs. ExPASy Swiss-Prot
Match: Q9ZVA5 (EP1-like glycoprotein 4 OS=Arabidopsis thaliana OX=3702 GN=At1g78860 PE=3 SV=1)

HSP 1 Score: 426.4 bits (1095), Expect = 3.9e-118
Identity = 217/430 (50.47%), Postives = 282/430 (65.58%), Query Frame = 0

Query: 16  LFFSFSL------ALVPANETFKFVNEGDFGEFI-IEYDGTYRPLSIFRFPFQLAFYNTT 75
           LFF+ S+      A VP ++ F+ VNEG + ++  IEY+   R    F   F+L FYNTT
Sbjct: 9   LFFTLSIFLVGAQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTT 68

Query: 76  PNAYTLALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNT 135
            NAYTLALRI  R  ES +RWVWEANRG PV+ENAT +   +GNLVLAEADG VVWQ+NT
Sbjct: 69  QNAYTLALRIGNRAQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRVVWQTNT 128

Query: 136 ANKGVVGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASEVMN 195
           ANKGVVG ++L +GNMV+ DSNGKF+WQSFDSPTDTLLVGQSL+L G  KLVSR S  +N
Sbjct: 129 ANKGVVGIKILENGNMVIYDSNGKFVWQSFDSPTDTLLVGQSLKLNGQNKLVSRLSPSVN 188

Query: 196 VNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANLTLIADVDPDQGFAT 255
            NGPYSLVME K L LYY +  +PKP+ YY       + +        + D D   G   
Sbjct: 189 ANGPYSLVMEAKKLVLYYTTNKTPKPIGYYEYEFFTKIAQLQSMTFQAVEDADTTWGLHM 248

Query: 256 ELILNSDTGSPQSGGAVLTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLFD 315
           E +   D+GS  +    L+RPK+N+TL+FLRL  DGN+R+ +Y+        ++++T F 
Sbjct: 249 EGV---DSGSQFNVSTFLSRPKHNATLSFLRLESDGNIRVWSYSTLATSTAWDVTYTAFT 308

Query: 316 RDSTREN-ECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDPKSFHYYK 375
            D+T  N EC+ PE C  FGLC+  QC ACP++ GL GW ++C    ++SCDPK+FHY+K
Sbjct: 309 NDNTDGNDECRIPEHCLGFGLCKKGQCNACPSDIGLLGWDETCKIPSLASCDPKTFHYFK 368

Query: 376 LVGVDHVLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANS 435
           + G D  +TKYN G     +  C  KC  DCKCLG+FY  K S CW+  ELKTL K  ++
Sbjct: 369 IEGADSFMTKYN-GGSTTTESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTKTGDT 428

Query: 436 THLGFIKTPN 438
           + + ++K PN
Sbjct: 429 SLVAYVKAPN 434

BLAST of CmoCh16G004780 vs. ExPASy Swiss-Prot
Match: Q9ZVA4 (EP1-like glycoprotein 3 OS=Arabidopsis thaliana OX=3702 GN=At1g78850 PE=1 SV=1)

HSP 1 Score: 421.8 bits (1083), Expect = 9.6e-117
Identity = 209/424 (49.29%), Postives = 279/424 (65.80%), Query Frame = 0

Query: 16  LFFSFSLALVPANETFKFVNEGDFGEFI-IEYDGTYRPLSIFRFPFQLAFYNTTPNAYTL 75
           +F   S A VP ++ F+ VNEG + ++  IEY+   R    F   F+L FYNTTPNAYTL
Sbjct: 15  IFLIGSQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTTPNAYTL 74

Query: 76  ALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNTANKGVV 135
           ALRI  R  ES +RWVWEANRG PV+ENAT +   +GNLVLAEADG +VWQ+NTANKG V
Sbjct: 75  ALRIGNRVQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRLVWQTNTANKGAV 134

Query: 136 GFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASEVMNVNGPYS 195
           G ++L +GNMV+ DS+GKF+WQSFDSPTDTLLVGQSL+L G  KLVSR S  +N NGPYS
Sbjct: 135 GIKILENGNMVIYDSSGKFVWQSFDSPTDTLLVGQSLKLNGRTKLVSRLSPSVNTNGPYS 194

Query: 196 LVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANLTLIADVDPDQGFATELILNS 255
           LVME K L LYY +  +PKP+ Y+       + +        + D D   G   E +   
Sbjct: 195 LVMEAKKLVLYYTTNKTPKPIAYFEYEFFTKITQFQSMTFQAVEDSDTTWGLVMEGV--- 254

Query: 256 DTGSPQSGGAVLTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLF-DRDSTR 315
           D+GS  +    L+RPK+N+TL+F+RL  DGN+R+ +Y+        ++++T F + D+  
Sbjct: 255 DSGSKFNVSTFLSRPKHNATLSFIRLESDGNIRVWSYSTLATSTAWDVTYTAFTNADTDG 314

Query: 316 ENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDPKSFHYYKLVGVDH 375
            +EC+ PE C  FGLC+  QC ACP++ GL GW ++C    ++SCDPK+FHY+K+ G D 
Sbjct: 315 NDECRIPEHCLGFGLCKKGQCNACPSDKGLLGWDETCKSPSLASCDPKTFHYFKIEGADS 374

Query: 376 VLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANSTHLGFI 435
            +TKYN G     +  C  KC  DCKCLG+FY  K S CW+  ELKTL +  +S+ + ++
Sbjct: 375 FMTKYNGGSSTT-ESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTRTGDSSLVAYV 434

Query: 436 KTPN 438
           K PN
Sbjct: 435 KAPN 434

BLAST of CmoCh16G004780 vs. ExPASy Swiss-Prot
Match: Q9ZVA2 (EP1-like glycoprotein 2 OS=Arabidopsis thaliana OX=3702 GN=At1g78830 PE=1 SV=1)

HSP 1 Score: 380.9 bits (977), Expect = 1.9e-104
Identity = 208/442 (47.06%), Postives = 272/442 (61.54%), Query Frame = 0

Query: 19  SFSLALVPANETFKFVNEGDFGEFIIEYDGTYRPL-----SIFRFPFQLAFYNTTPNAYT 78
           S  +A VP  + F+ VNEG+FGE+I EYD +YR +     S F  PFQL FYNTTP+AY 
Sbjct: 18  SVVIAQVPPEKQFRVVNEGEFGEYITEYDASYRFIESSNQSFFTSPFQLLFYNTTPSAYI 77

Query: 79  LALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNTANKGV 138
           LALR+ +RR ES +RW+W+ANR  PV ENAT SL  NGNLVLAEADG V WQ+NTANKGV
Sbjct: 78  LALRVGLRRDESTMRWIWDANRNNPVGENATLSLGRNGNLVLAEADGRVKWQTNTANKGV 137

Query: 139 VGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASEVMNVNGPY 198
            GF++LP+GN+VL D NGKF+WQSFD PTDTLL GQSL++ G  KLVSR S+    +GPY
Sbjct: 138 TGFQILPNGNIVLHDKNGKFVWQSFDHPTDTLLTGQSLKVNGVNKLVSRTSDSNGSDGPY 197

Query: 199 SLVMERKALSLYYKSPNSPK-----PMRYYSSTDMLSVRKGVLANLT------LIADVDP 258
           S+V+++K L++Y     +P      P   +  T   +V +    NLT      L+ +  P
Sbjct: 198 SMVLDKKGLTMYVNKTGTPLVYGGWPDHDFRGTVTFAVTR-EFDNLTEPSAYELLLEPAP 257

Query: 259 ----DQGFATELILNSDTGSPQSGGAV-LTRPKYNSTLTFLRLGIDGNLRLITYNDKVDW 318
               + G    L+     GS   GG + L +  YN T+++LRLG DG+L+  +Y     +
Sbjct: 258 QPATNPGNNRRLLQVRPIGS--GGGTLNLNKINYNGTISYLRLGSDGSLKAYSYFPAATY 317

Query: 319 GPSEISFTLFDRDSTRENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKV-- 378
              E SF+ F     R  +C  P  CG +G C+   C ACPT  GL GWS  CAP K   
Sbjct: 318 LKWEESFSFFSTYFVR--QCGLPSFCGDYGYCDRGMCNACPTPKGLLGWSDKCAPPKTTQ 377

Query: 379 --SSCDPKSFHYYKLVGVDHVLTKY-NKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLC 435
             S    K+ +YYK+VGV+H    Y N G+GP    +C+ KC+ DCKCLGYFY+ K   C
Sbjct: 378 FCSGVKGKTVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGYFYKEKDKKC 437

BLAST of CmoCh16G004780 vs. ExPASy Swiss-Prot
Match: Q9ZVA1 (EP1-like glycoprotein 1 OS=Arabidopsis thaliana OX=3702 GN=At1g78820 PE=2 SV=1)

HSP 1 Score: 354.8 bits (909), Expect = 1.4e-96
Identity = 199/455 (43.74%), Postives = 266/455 (58.46%), Query Frame = 0

Query: 5   LLTPLLISFFFLFFSFSLALVPANETFKFVNEGDFGEFIIEYDGTYRPL-----SIFRFP 64
           L+T L IS      S  +A VP  + F+ +NE  +  +I EYD +YR L     + F  P
Sbjct: 8   LITALAIS----TVSVVMAQVPPEKQFRVLNEPGYAPYITEYDASYRFLNSPNQNFFTIP 67

Query: 65  FQLAFYNTTPNAYTLALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEAD 124
           FQL FYNTTP+AY LALR+  RR  S  RW+W+ANR  PV +N+T S   NGNLVLAE +
Sbjct: 68  FQLMFYNTTPSAYVLALRVGTRRDMSFTRWIWDANRNNPVGDNSTLSFGRNGNLVLAELN 127

Query: 125 GTVVWQSNTANKGVVGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKL 184
           G V WQ+NTANKGV GF++LP+GNMVL D +GKF+WQSFD PTDTLLVGQSL++ G  KL
Sbjct: 128 GQVKWQTNTANKGVTGFQILPNGNMVLHDKHGKFVWQSFDHPTDTLLVGQSLKVNGVNKL 187

Query: 185 VSRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLA----NLT 244
           VSR S++   +GPYS+V++ K L++Y     +P     ++  D        +     NLT
Sbjct: 188 VSRTSDMNGSDGPYSMVLDNKGLTMYVNKTGTPLVYGGWTDHDFRGTVTFAVTREFDNLT 247

Query: 245 ------LIADVDP----DQGFATELILNSDTGSPQSGGAV-LTRPKYNSTLTFLRLGIDG 304
                 L+ +  P    + G    L+     GS   GG + L +  YN T+++LRLG DG
Sbjct: 248 EPSAYELLLEPAPQPATNPGNNRRLLQVRPIGS--GGGTLNLNKINYNGTISYLRLGSDG 307

Query: 305 NLRLITYNDKVDWGPSEISFTLFDRDSTRENECQWPERCGQFGLCEDNQCVACPTENGLA 364
           +L+  +Y     +   E +F  F     R  +C  P  CG +G C+   CV CPT  GL 
Sbjct: 308 SLKAFSYFPAATYLEWEETFAFFSNYFVR--QCGLPTFCGDYGYCDRGMCVGCPTPKGLL 367

Query: 365 GWSKSCAPKKV----SSCDPKSFHYYKLVGVDHVLTKY-NKGEGPMGQKECEKKCNLDCK 424
            WS  CAP K     S    K+ +YYK+VGV+H    Y N G+GP    +C+ KC+ DCK
Sbjct: 368 AWSDKCAPPKTTQFCSGGKGKAVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCK 427

Query: 425 CLGYFYQTKGSLCWVANELKTLIKVANSTHLGFIK 435
           CLGYFY+ K   C +A  L TLIK AN++ + +IK
Sbjct: 428 CLGYFYKEKDKKCLLAPLLGTLIKDANTSSVAYIK 454

BLAST of CmoCh16G004780 vs. ExPASy TrEMBL
Match: A0A6J1ETZ5 (epidermis-specific secreted glycoprotein EP1-like OS=Cucurbita moschata OX=3662 GN=LOC111437696 PE=4 SV=1)

HSP 1 Score: 899.4 bits (2323), Expect = 5.8e-258
Identity = 438/438 (100.00%), Postives = 438/438 (100.00%), Query Frame = 0

Query: 1   MRSTLLTPLLISFFFLFFSFSLALVPANETFKFVNEGDFGEFIIEYDGTYRPLSIFRFPF 60
           MRSTLLTPLLISFFFLFFSFSLALVPANETFKFVNEGDFGEFIIEYDGTYRPLSIFRFPF
Sbjct: 1   MRSTLLTPLLISFFFLFFSFSLALVPANETFKFVNEGDFGEFIIEYDGTYRPLSIFRFPF 60

Query: 61  QLAFYNTTPNAYTLALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADG 120
           QLAFYNTTPNAYTLALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADG
Sbjct: 61  QLAFYNTTPNAYTLALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADG 120

Query: 121 TVVWQSNTANKGVVGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV 180
           TVVWQSNTANKGVVGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV
Sbjct: 121 TVVWQSNTANKGVVGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV 180

Query: 181 SRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANLTLIADV 240
           SRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANLTLIADV
Sbjct: 181 SRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANLTLIADV 240

Query: 241 DPDQGFATELILNSDTGSPQSGGAVLTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS 300
           DPDQGFATELILNSDTGSPQSGGAVLTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS
Sbjct: 241 DPDQGFATELILNSDTGSPQSGGAVLTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS 300

Query: 301 EISFTLFDRDSTRENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDP 360
           EISFTLFDRDSTRENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDP
Sbjct: 301 EISFTLFDRDSTRENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDP 360

Query: 361 KSFHYYKLVGVDHVLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT 420
           KSFHYYKLVGVDHVLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT
Sbjct: 361 KSFHYYKLVGVDHVLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT 420

Query: 421 LIKVANSTHLGFIKTPNK 439
           LIKVANSTHLGFIKTPNK
Sbjct: 421 LIKVANSTHLGFIKTPNK 438

BLAST of CmoCh16G004780 vs. ExPASy TrEMBL
Match: A0A6J1JA71 (epidermis-specific secreted glycoprotein EP1-like OS=Cucurbita maxima OX=3661 GN=LOC111482665 PE=4 SV=1)

HSP 1 Score: 860.1 bits (2221), Expect = 3.9e-246
Identity = 417/438 (95.21%), Postives = 424/438 (96.80%), Query Frame = 0

Query: 1   MRSTLLTPLLISFFFLFFSFSLALVPANETFKFVNEGDFGEFIIEYDGTYRPLSIFRFPF 60
           MRS LLTP LISFFFLFFSFSLALVPANETFKFVNEG+FGEFIIEYDGTYRPLSIFRFPF
Sbjct: 1   MRSPLLTPFLISFFFLFFSFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPLSIFRFPF 60

Query: 61  QLAFYNTTPNAYTLALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADG 120
           QL+FYNTTPNAYTLALRISIRRSES IRWVWEANRGRPVRENATFSLS NGNLVLAEADG
Sbjct: 61  QLSFYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADG 120

Query: 121 TVVWQSNTANKGVVGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV 180
           TVVWQSNTANKGVVGFELLPSGNMVL DSNGKFLWQSFDSPTDTLLVGQSLRLGGP KLV
Sbjct: 121 TVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLV 180

Query: 181 SRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANLTLIADV 240
           SRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLAN+TL A V
Sbjct: 181 SRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAV 240

Query: 241 DPDQGFATELILNSDTGSPQSGGAVLTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS 300
           DPDQGFATEL LN +TGS +SGG +LTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS
Sbjct: 241 DPDQGFATELTLNYETGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS 300

Query: 301 EISFTLFDRDSTRENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDP 360
           EISFTLFDRDST ENECQWPERCGQFGLCEDNQCVACPTENGLAGWSK+CAPKKVSSCDP
Sbjct: 301 EISFTLFDRDSTWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKVSSCDP 360

Query: 361 KSFHYYKLVGVDHVLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT 420
           KSFHYYKLVGVDH LTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT
Sbjct: 361 KSFHYYKLVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT 420

Query: 421 LIKVANSTHLGFIKTPNK 439
           LIKVANSTHLGFIKTPNK
Sbjct: 421 LIKVANSTHLGFIKTPNK 438

BLAST of CmoCh16G004780 vs. ExPASy TrEMBL
Match: A0A6J1ETQ8 (epidermis-specific secreted glycoprotein EP1-like OS=Cucurbita moschata OX=3662 GN=LOC111437646 PE=4 SV=1)

HSP 1 Score: 854.4 bits (2206), Expect = 2.1e-244
Identity = 415/438 (94.75%), Postives = 422/438 (96.35%), Query Frame = 0

Query: 1   MRSTLLTPLLISFFFLFFSFSLALVPANETFKFVNEGDFGEFIIEYDGTYRPLSIFRFPF 60
           MRS LLTPLLISFFFLFFSFSLALVPANETFKFVNEG+FG+F +EY GTYR LSIFRFPF
Sbjct: 1   MRSPLLTPLLISFFFLFFSFSLALVPANETFKFVNEGEFGDFAVEYGGTYRVLSIFRFPF 60

Query: 61  QLAFYNTTPNAYTLALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADG 120
           QLAFYNTTPNAYTLALRISIRRSES IRWVWEANRGRPVRENATFSLSANGNLVLAEADG
Sbjct: 61  QLAFYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSANGNLVLAEADG 120

Query: 121 TVVWQSNTANKGVVGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV 180
           TVVWQSNTANKGVVGFELLPSGNMVL DSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV
Sbjct: 121 TVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV 180

Query: 181 SRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANLTLIADV 240
           SRASE MNVNGPYSLVMERK LSLYYKSPNSPKPMRYYSSTDMLSVRKGVLAN+TL A V
Sbjct: 181 SRASEEMNVNGPYSLVMERKVLSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAV 240

Query: 241 DPDQGFATELILNSDTGSPQSGGAVLTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS 300
           DPDQGFATEL LN DTGS +SGG +LTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS
Sbjct: 241 DPDQGFATELTLNYDTGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPS 300

Query: 301 EISFTLFDRDSTRENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDP 360
           EISFTLFDRDS+ ENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDP
Sbjct: 301 EISFTLFDRDSSWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDP 360

Query: 361 KSFHYYKLVGVDHVLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT 420
           KSFHYYKLVGVDH LTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT
Sbjct: 361 KSFHYYKLVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKT 420

Query: 421 LIKVANSTHLGFIKTPNK 439
           LIKVANSTHLGFIKTPNK
Sbjct: 421 LIKVANSTHLGFIKTPNK 438

BLAST of CmoCh16G004780 vs. ExPASy TrEMBL
Match: A0A0A0L0B9 (Bulb-type lectin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G082380 PE=4 SV=1)

HSP 1 Score: 727.6 bits (1877), Expect = 3.0e-206
Identity = 352/442 (79.64%), Postives = 390/442 (88.24%), Query Frame = 0

Query: 1   MRSTLLTPLLISFFFLFFSFSLALVPANETFKFVNEGDFGEFIIEYDGTYRPLSIFRFPF 60
           MR  LLTPLL+SFFF FFS S A+VP+NETFKFVNEGDFG+F +EYDGTYRPLSI   PF
Sbjct: 1   MRPPLLTPLLLSFFF-FFSLSFAIVPSNETFKFVNEGDFGDFAVEYDGTYRPLSISNSPF 60

Query: 61  QLAFYNTTPNAYTLALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADG 120
           QL FYNTTPNAYTLALR++I RSES  RWVWEANRGRPVRENAT SL ++GNLVLAEADG
Sbjct: 61  QLMFYNTTPNAYTLALRMAILRSESAKRWVWEANRGRPVRENATLSLGSDGNLVLAEADG 120

Query: 121 TVVWQSNTANKGVVGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV 180
           TVVWQ+NTANKGVV  +LLP+GNMVLLDSNGKF+WQSFDSPTDTLLVGQSLR+GG  KLV
Sbjct: 121 TVVWQTNTANKGVVKLDLLPNGNMVLLDSNGKFVWQSFDSPTDTLLVGQSLRIGGVTKLV 180

Query: 181 SRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYS-STDMLSVRKGVLANLTLIAD 240
           SRASE +NVNGPYS VMER A+SLYYKSPNSPKPMRY++ S++  +++KG LA +TL A+
Sbjct: 181 SRASEKLNVNGPYSFVMERNAMSLYYKSPNSPKPMRYFAGSSNWFTIQKGSLARVTLRAE 240

Query: 241 VDPDQGFATELILNSDTGSPQSGGAVLTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGP 300
           VDPDQGFATEL LN +    ++GG +L+RPKYNSTLTFLRLGIDGNLRL TYNDKVDW P
Sbjct: 241 VDPDQGFATELTLNYEVAGTENGGPILSRPKYNSTLTFLRLGIDGNLRLFTYNDKVDWSP 300

Query: 301 SEISFTLFDRD---STRENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVS 360
           SEI+FTLFDR+      E+ECQWPERCGQFGLCE+NQCVACPTE GL GWSK+C  KKVS
Sbjct: 301 SEITFTLFDREFNTGNTESECQWPERCGQFGLCEENQCVACPTEKGLLGWSKTCMAKKVS 360

Query: 361 SCDPKSFHYYKLVGVDHVLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVAN 420
           SCDPKSFHYYK+ GVDH LTKYNKGEG + QK+CEKKCNLDCKCLGYFYQTKGSLCWVAN
Sbjct: 361 SCDPKSFHYYKVEGVDHFLTKYNKGEG-LRQKDCEKKCNLDCKCLGYFYQTKGSLCWVAN 420

Query: 421 ELKTLIKVANSTHLGFIKTPNK 439
           ELKTLIKV NSTHLGFIKTPNK
Sbjct: 421 ELKTLIKVDNSTHLGFIKTPNK 440

BLAST of CmoCh16G004780 vs. ExPASy TrEMBL
Match: A0A5A7V4R1 (Epidermis-specific secreted glycoprotein EP1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold468G001550 PE=4 SV=1)

HSP 1 Score: 723.0 bits (1865), Expect = 7.4e-205
Identity = 351/442 (79.41%), Postives = 388/442 (87.78%), Query Frame = 0

Query: 1   MRSTLLTPLLISFFFLFFSFSLALVPANETFKFVNEGDFGEFIIEYDGTYRPLSIFRFPF 60
           MR  LLTPLL+SFFF F S S A+VP NETFKFVNEGDFG+F +EYDGTYRPLSI   PF
Sbjct: 1   MRPPLLTPLLLSFFF-FSSLSFAIVPPNETFKFVNEGDFGDFAVEYDGTYRPLSISNSPF 60

Query: 61  QLAFYNTTPNAYTLALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADG 120
           QL FYNTTPNAYTLALR++I RSES  RWVWEANRGRPVRENAT SL ++GNLVLAEADG
Sbjct: 61  QLMFYNTTPNAYTLALRMAILRSESAKRWVWEANRGRPVRENATLSLGSDGNLVLAEADG 120

Query: 121 TVVWQSNTANKGVVGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLV 180
           TVVWQ+NTANKGVV  +LLP+GNMVLLDSNGKF+WQSFDSPTDTLLVGQSLRLGG  KLV
Sbjct: 121 TVVWQTNTANKGVVKLDLLPNGNMVLLDSNGKFVWQSFDSPTDTLLVGQSLRLGGVTKLV 180

Query: 181 SRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYS-STDMLSVRKGVLANLTLIAD 240
           SRASE +NVNGPYS VMERKA+SLYYKSPNSPKPMRY++ S++  +++KG LA +TL A+
Sbjct: 181 SRASEKLNVNGPYSFVMERKAVSLYYKSPNSPKPMRYFAGSSNWFTIQKGTLARVTLRAE 240

Query: 241 VDPDQGFATELILNSDTGSPQSGGAVLTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGP 300
           VDP QGFATEL LN +    ++GG +L+RPKYNSTLTFLRLGIDGNLRL TYND+VDW P
Sbjct: 241 VDPGQGFATELTLNYEVAGTENGGPILSRPKYNSTLTFLRLGIDGNLRLFTYNDQVDWSP 300

Query: 301 SEISFTLFDRD---STRENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVS 360
           SEI+FTLFDR+      E+ECQWPERCGQFGLCE+NQCVACPTE GL GWSK+C  KKVS
Sbjct: 301 SEITFTLFDREFNTGNTESECQWPERCGQFGLCEENQCVACPTEKGLLGWSKTCMAKKVS 360

Query: 361 SCDPKSFHYYKLVGVDHVLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVAN 420
           SCDPKSFHYYKL GVDH LTK+NKGEG + QK+CEKKCNLDCKCLGYFYQTKGSLCWVAN
Sbjct: 361 SCDPKSFHYYKLEGVDHFLTKFNKGEG-LSQKDCEKKCNLDCKCLGYFYQTKGSLCWVAN 420

Query: 421 ELKTLIKVANSTHLGFIKTPNK 439
           ELKTLIKV NSTHLGFIKTPNK
Sbjct: 421 ELKTLIKVDNSTHLGFIKTPNK 440

BLAST of CmoCh16G004780 vs. TAIR 10
Match: AT1G78860.1 (D-mannose binding lectin protein with Apple-like carbohydrate-binding domain )

HSP 1 Score: 426.4 bits (1095), Expect = 2.8e-119
Identity = 217/430 (50.47%), Postives = 282/430 (65.58%), Query Frame = 0

Query: 16  LFFSFSL------ALVPANETFKFVNEGDFGEFI-IEYDGTYRPLSIFRFPFQLAFYNTT 75
           LFF+ S+      A VP ++ F+ VNEG + ++  IEY+   R    F   F+L FYNTT
Sbjct: 9   LFFTLSIFLVGAQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTT 68

Query: 76  PNAYTLALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNT 135
            NAYTLALRI  R  ES +RWVWEANRG PV+ENAT +   +GNLVLAEADG VVWQ+NT
Sbjct: 69  QNAYTLALRIGNRAQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRVVWQTNT 128

Query: 136 ANKGVVGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASEVMN 195
           ANKGVVG ++L +GNMV+ DSNGKF+WQSFDSPTDTLLVGQSL+L G  KLVSR S  +N
Sbjct: 129 ANKGVVGIKILENGNMVIYDSNGKFVWQSFDSPTDTLLVGQSLKLNGQNKLVSRLSPSVN 188

Query: 196 VNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANLTLIADVDPDQGFAT 255
            NGPYSLVME K L LYY +  +PKP+ YY       + +        + D D   G   
Sbjct: 189 ANGPYSLVMEAKKLVLYYTTNKTPKPIGYYEYEFFTKIAQLQSMTFQAVEDADTTWGLHM 248

Query: 256 ELILNSDTGSPQSGGAVLTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLFD 315
           E +   D+GS  +    L+RPK+N+TL+FLRL  DGN+R+ +Y+        ++++T F 
Sbjct: 249 EGV---DSGSQFNVSTFLSRPKHNATLSFLRLESDGNIRVWSYSTLATSTAWDVTYTAFT 308

Query: 316 RDSTREN-ECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDPKSFHYYK 375
            D+T  N EC+ PE C  FGLC+  QC ACP++ GL GW ++C    ++SCDPK+FHY+K
Sbjct: 309 NDNTDGNDECRIPEHCLGFGLCKKGQCNACPSDIGLLGWDETCKIPSLASCDPKTFHYFK 368

Query: 376 LVGVDHVLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANS 435
           + G D  +TKYN G     +  C  KC  DCKCLG+FY  K S CW+  ELKTL K  ++
Sbjct: 369 IEGADSFMTKYN-GGSTTTESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTKTGDT 428

Query: 436 THLGFIKTPN 438
           + + ++K PN
Sbjct: 429 SLVAYVKAPN 434

BLAST of CmoCh16G004780 vs. TAIR 10
Match: AT1G78850.1 (D-mannose binding lectin protein with Apple-like carbohydrate-binding domain )

HSP 1 Score: 421.8 bits (1083), Expect = 6.8e-118
Identity = 209/424 (49.29%), Postives = 279/424 (65.80%), Query Frame = 0

Query: 16  LFFSFSLALVPANETFKFVNEGDFGEFI-IEYDGTYRPLSIFRFPFQLAFYNTTPNAYTL 75
           +F   S A VP ++ F+ VNEG + ++  IEY+   R    F   F+L FYNTTPNAYTL
Sbjct: 15  IFLIGSQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTTPNAYTL 74

Query: 76  ALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNTANKGVV 135
           ALRI  R  ES +RWVWEANRG PV+ENAT +   +GNLVLAEADG +VWQ+NTANKG V
Sbjct: 75  ALRIGNRVQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRLVWQTNTANKGAV 134

Query: 136 GFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASEVMNVNGPYS 195
           G ++L +GNMV+ DS+GKF+WQSFDSPTDTLLVGQSL+L G  KLVSR S  +N NGPYS
Sbjct: 135 GIKILENGNMVIYDSSGKFVWQSFDSPTDTLLVGQSLKLNGRTKLVSRLSPSVNTNGPYS 194

Query: 196 LVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANLTLIADVDPDQGFATELILNS 255
           LVME K L LYY +  +PKP+ Y+       + +        + D D   G   E +   
Sbjct: 195 LVMEAKKLVLYYTTNKTPKPIAYFEYEFFTKITQFQSMTFQAVEDSDTTWGLVMEGV--- 254

Query: 256 DTGSPQSGGAVLTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLF-DRDSTR 315
           D+GS  +    L+RPK+N+TL+F+RL  DGN+R+ +Y+        ++++T F + D+  
Sbjct: 255 DSGSKFNVSTFLSRPKHNATLSFIRLESDGNIRVWSYSTLATSTAWDVTYTAFTNADTDG 314

Query: 316 ENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDPKSFHYYKLVGVDH 375
            +EC+ PE C  FGLC+  QC ACP++ GL GW ++C    ++SCDPK+FHY+K+ G D 
Sbjct: 315 NDECRIPEHCLGFGLCKKGQCNACPSDKGLLGWDETCKSPSLASCDPKTFHYFKIEGADS 374

Query: 376 VLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANSTHLGFI 435
            +TKYN G     +  C  KC  DCKCLG+FY  K S CW+  ELKTL +  +S+ + ++
Sbjct: 375 FMTKYNGGSSTT-ESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTRTGDSSLVAYV 434

Query: 436 KTPN 438
           K PN
Sbjct: 435 KAPN 434

BLAST of CmoCh16G004780 vs. TAIR 10
Match: AT1G16905.1 (Curculin-like (mannose-binding) lectin family protein )

HSP 1 Score: 389.4 bits (999), Expect = 3.8e-108
Identity = 205/431 (47.56%), Postives = 271/431 (62.88%), Query Frame = 0

Query: 9   LLISFFFLFFSFSLALVPANETFKFVNEGDFGEFIIEYDGTYRPLSIFRFPFQLAFYNTT 68
           L++   FL  S     VP  E F+F+N GDFGE  +EY  +YR L + R  F+L F+NTT
Sbjct: 8   LILLSLFLLISLVRPQVPPMEQFRFLNNGDFGESTVEYGASYRDLGVIRNQFRLCFFNTT 67

Query: 69  PNAYTLALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNT 128
           PNA+TLA+ +    S+S+IRWVW+AN  +PV+E A+ S    GNLVLA+ DG VVWQ+ T
Sbjct: 68  PNAFTLAIGMGTGSSDSIIRWVWQANPQKPVQEEASLSFGPEGNLVLAQPDGRVVWQTMT 127

Query: 129 ANKGVVGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRL-GGPMKLVSRASEVM 188
            NKGV+G  +  +GN+VL D  G  +WQSF+ PTDTLLVGQSL L G   KLVSR     
Sbjct: 128 ENKGVIGLTMNENGNLVLFDDGGWPVWQSFEFPTDTLLVGQSLTLDGSKNKLVSRN---- 187

Query: 189 NVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANLTLIADVDPDQGFA 248
             NG YSL++E   L L    P S      Y   +   +    L +         DQG  
Sbjct: 188 --NGSYSLILEPDRLVLNRLIPRSNNKSLVYHIIEGRFIPSATLYSA-------KDQGTT 247

Query: 249 TELILNSDTGSPQ-SGGAVLTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTL 308
           T+L L +    P+      L RP++N++ +FLRL  DGNLR+ +++ KV +   E++F L
Sbjct: 248 TQLGLATPGLRPEFPYKHFLARPRFNASQSFLRLDADGNLRIYSFDSKVTFLAWEVTFEL 307

Query: 309 FDRDSTRENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKVSSCDPKSFHYY 368
           F+ D+   NEC  P +CG FG+CEDNQCVACP   GL GWSK+C PKKV SCDPKSFHYY
Sbjct: 308 FNHDN--NNECWLPSKCGAFGICEDNQCVACPLGVGLMGWSKACKPKKVKSCDPKSFHYY 367

Query: 369 KLVGVDHVLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVAN 428
           +L GV+H +TKYN G   +G+ +C   C+ DCKCLGYF+      CW++ EL TL+KV++
Sbjct: 368 RLGGVEHFMTKYNVGLA-LGESKCRGLCSGDCKCLGYFFDKSSFKCWISYELGTLVKVSD 422

Query: 429 STHLGFIKTPN 438
           S  + +IKTPN
Sbjct: 428 SRKVAYIKTPN 422

BLAST of CmoCh16G004780 vs. TAIR 10
Match: AT1G78830.1 (Curculin-like (mannose-binding) lectin family protein )

HSP 1 Score: 380.9 bits (977), Expect = 1.3e-105
Identity = 208/442 (47.06%), Postives = 272/442 (61.54%), Query Frame = 0

Query: 19  SFSLALVPANETFKFVNEGDFGEFIIEYDGTYRPL-----SIFRFPFQLAFYNTTPNAYT 78
           S  +A VP  + F+ VNEG+FGE+I EYD +YR +     S F  PFQL FYNTTP+AY 
Sbjct: 18  SVVIAQVPPEKQFRVVNEGEFGEYITEYDASYRFIESSNQSFFTSPFQLLFYNTTPSAYI 77

Query: 79  LALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEADGTVVWQSNTANKGV 138
           LALR+ +RR ES +RW+W+ANR  PV ENAT SL  NGNLVLAEADG V WQ+NTANKGV
Sbjct: 78  LALRVGLRRDESTMRWIWDANRNNPVGENATLSLGRNGNLVLAEADGRVKWQTNTANKGV 137

Query: 139 VGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKLVSRASEVMNVNGPY 198
            GF++LP+GN+VL D NGKF+WQSFD PTDTLL GQSL++ G  KLVSR S+    +GPY
Sbjct: 138 TGFQILPNGNIVLHDKNGKFVWQSFDHPTDTLLTGQSLKVNGVNKLVSRTSDSNGSDGPY 197

Query: 199 SLVMERKALSLYYKSPNSPK-----PMRYYSSTDMLSVRKGVLANLT------LIADVDP 258
           S+V+++K L++Y     +P      P   +  T   +V +    NLT      L+ +  P
Sbjct: 198 SMVLDKKGLTMYVNKTGTPLVYGGWPDHDFRGTVTFAVTR-EFDNLTEPSAYELLLEPAP 257

Query: 259 ----DQGFATELILNSDTGSPQSGGAV-LTRPKYNSTLTFLRLGIDGNLRLITYNDKVDW 318
               + G    L+     GS   GG + L +  YN T+++LRLG DG+L+  +Y     +
Sbjct: 258 QPATNPGNNRRLLQVRPIGS--GGGTLNLNKINYNGTISYLRLGSDGSLKAYSYFPAATY 317

Query: 319 GPSEISFTLFDRDSTRENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKSCAPKKV-- 378
              E SF+ F     R  +C  P  CG +G C+   C ACPT  GL GWS  CAP K   
Sbjct: 318 LKWEESFSFFSTYFVR--QCGLPSFCGDYGYCDRGMCNACPTPKGLLGWSDKCAPPKTTQ 377

Query: 379 --SSCDPKSFHYYKLVGVDHVLTKY-NKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLC 435
             S    K+ +YYK+VGV+H    Y N G+GP    +C+ KC+ DCKCLGYFY+ K   C
Sbjct: 378 FCSGVKGKTVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGYFYKEKDKKC 437

BLAST of CmoCh16G004780 vs. TAIR 10
Match: AT1G78820.1 (D-mannose binding lectin protein with Apple-like carbohydrate-binding domain )

HSP 1 Score: 354.8 bits (909), Expect = 1.0e-97
Identity = 199/455 (43.74%), Postives = 266/455 (58.46%), Query Frame = 0

Query: 5   LLTPLLISFFFLFFSFSLALVPANETFKFVNEGDFGEFIIEYDGTYRPL-----SIFRFP 64
           L+T L IS      S  +A VP  + F+ +NE  +  +I EYD +YR L     + F  P
Sbjct: 8   LITALAIS----TVSVVMAQVPPEKQFRVLNEPGYAPYITEYDASYRFLNSPNQNFFTIP 67

Query: 65  FQLAFYNTTPNAYTLALRISIRRSESMIRWVWEANRGRPVRENATFSLSANGNLVLAEAD 124
           FQL FYNTTP+AY LALR+  RR  S  RW+W+ANR  PV +N+T S   NGNLVLAE +
Sbjct: 68  FQLMFYNTTPSAYVLALRVGTRRDMSFTRWIWDANRNNPVGDNSTLSFGRNGNLVLAELN 127

Query: 125 GTVVWQSNTANKGVVGFELLPSGNMVLLDSNGKFLWQSFDSPTDTLLVGQSLRLGGPMKL 184
           G V WQ+NTANKGV GF++LP+GNMVL D +GKF+WQSFD PTDTLLVGQSL++ G  KL
Sbjct: 128 GQVKWQTNTANKGVTGFQILPNGNMVLHDKHGKFVWQSFDHPTDTLLVGQSLKVNGVNKL 187

Query: 185 VSRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLA----NLT 244
           VSR S++   +GPYS+V++ K L++Y     +P     ++  D        +     NLT
Sbjct: 188 VSRTSDMNGSDGPYSMVLDNKGLTMYVNKTGTPLVYGGWTDHDFRGTVTFAVTREFDNLT 247

Query: 245 ------LIADVDP----DQGFATELILNSDTGSPQSGGAV-LTRPKYNSTLTFLRLGIDG 304
                 L+ +  P    + G    L+     GS   GG + L +  YN T+++LRLG DG
Sbjct: 248 EPSAYELLLEPAPQPATNPGNNRRLLQVRPIGS--GGGTLNLNKINYNGTISYLRLGSDG 307

Query: 305 NLRLITYNDKVDWGPSEISFTLFDRDSTRENECQWPERCGQFGLCEDNQCVACPTENGLA 364
           +L+  +Y     +   E +F  F     R  +C  P  CG +G C+   CV CPT  GL 
Sbjct: 308 SLKAFSYFPAATYLEWEETFAFFSNYFVR--QCGLPTFCGDYGYCDRGMCVGCPTPKGLL 367

Query: 365 GWSKSCAPKKV----SSCDPKSFHYYKLVGVDHVLTKY-NKGEGPMGQKECEKKCNLDCK 424
            WS  CAP K     S    K+ +YYK+VGV+H    Y N G+GP    +C+ KC+ DCK
Sbjct: 368 AWSDKCAPPKTTQFCSGGKGKAVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCK 427

Query: 425 CLGYFYQTKGSLCWVANELKTLIKVANSTHLGFIK 435
           CLGYFY+ K   C +A  L TLIK AN++ + +IK
Sbjct: 428 CLGYFYKEKDKKCLLAPLLGTLIKDANTSSVAYIK 454

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q396887.1e-12057.52Epidermis-specific secreted glycoprotein EP1 OS=Daucus carota OX=4039 GN=EP1 PE=... [more]
Q9ZVA53.9e-11850.47EP1-like glycoprotein 4 OS=Arabidopsis thaliana OX=3702 GN=At1g78860 PE=3 SV=1[more]
Q9ZVA49.6e-11749.29EP1-like glycoprotein 3 OS=Arabidopsis thaliana OX=3702 GN=At1g78850 PE=1 SV=1[more]
Q9ZVA21.9e-10447.06EP1-like glycoprotein 2 OS=Arabidopsis thaliana OX=3702 GN=At1g78830 PE=1 SV=1[more]
Q9ZVA11.4e-9643.74EP1-like glycoprotein 1 OS=Arabidopsis thaliana OX=3702 GN=At1g78820 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1ETZ55.8e-258100.00epidermis-specific secreted glycoprotein EP1-like OS=Cucurbita moschata OX=3662 ... [more]
A0A6J1JA713.9e-24695.21epidermis-specific secreted glycoprotein EP1-like OS=Cucurbita maxima OX=3661 GN... [more]
A0A6J1ETQ82.1e-24494.75epidermis-specific secreted glycoprotein EP1-like OS=Cucurbita moschata OX=3662 ... [more]
A0A0A0L0B93.0e-20679.64Bulb-type lectin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G0... [more]
A0A5A7V4R17.4e-20579.41Epidermis-specific secreted glycoprotein EP1-like OS=Cucumis melo var. makuwa OX... [more]
Match NameE-valueIdentityDescription
AT1G78860.12.8e-11950.47D-mannose binding lectin protein with Apple-like carbohydrate-binding domain [more]
AT1G78850.16.8e-11849.29D-mannose binding lectin protein with Apple-like carbohydrate-binding domain [more]
AT1G16905.13.8e-10847.56Curculin-like (mannose-binding) lectin family protein [more]
AT1G78830.11.3e-10547.06Curculin-like (mannose-binding) lectin family protein [more]
AT1G78820.11.0e-9743.74D-mannose binding lectin protein with Apple-like carbohydrate-binding domain [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001480Bulb-type lectin domainSMARTSM00108blect_4coord: 44..161
e-value: 1.9E-31
score: 120.4
IPR001480Bulb-type lectin domainPFAMPF01453B_lectincoord: 90..173
e-value: 1.4E-18
score: 67.3
IPR001480Bulb-type lectin domainPROSITEPS50927BULB_LECTINcoord: 41..159
score: 14.115835
IPR001480Bulb-type lectin domainCDDcd00028B_lectincoord: 45..161
e-value: 2.22723E-32
score: 117.028
IPR035446S-locus-specific glycoprotein/EP1PIRSFPIRSF002686SLGcoord: 1..438
e-value: 1.6E-147
score: 489.7
IPR036426Bulb-type lectin domain superfamilyGENE3D2.90.10.10coord: 40..160
e-value: 1.3E-15
score: 59.5
IPR036426Bulb-type lectin domain superfamilySUPERFAMILY51110alpha-D-mannose-specific plant lectinscoord: 69..208
NoneNo IPR availablePANTHERPTHR32444:SF64SECRETED GLYCOPROTEIN EP1, PUTATIVE-RELATEDcoord: 13..438
NoneNo IPR availablePANTHERPTHR32444FAMILY NOT NAMEDcoord: 13..438
NoneNo IPR availableCDDcd01098PAN_AP_plantcoord: 356..436
e-value: 2.94045E-10
score: 54.7498

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh16G004780.1CmoCh16G004780.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016020 membrane