Cp4.1LG06g06550 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG06g06550
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionBulb-type lectin domain-containing protein
LocationCp4.1LG06: 4049263 .. 4050621 (-)
RNA-Seq ExpressionCp4.1LG06g06550
SyntenyCp4.1LG06g06550
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGACACGTTTTCATCTTCTTCTTCCCCCTCTCTGTTTCCTGCTCTTCACTGTTCTTCTCGCCGCCATAGCCACAGAAGCTCAAGTTCCTGCAAATGCCACCTTCCATTTCGTAAACCAAGGCGAATTCGGCGACCGAATCATTGAATACGACGCCAGCTACCGCGTAATTCGAAACGACGTGTACACCTTCTACACATTCCCCTTCCGCCTCTGTTTTTACAACACCACCCCTGATTCCTTCATTTTCGCCATTAGAGCTGGAATCCCCAACGACGAGAGTTTAATGCGATGGGTTTGGGACGCCAATCGCAACGACCCAGTTCGTGAAAACGCCACCCTCACCTTTGGCCGCGACGGAAACTTCGTCCTCGCCGACGTCGACGGCCGTGTCGTCTGGCAAACCAACACCAAAAACAGAGGAGTCACCGGAATCAAAATGCTCCCTAATGGAAACTTAATCCTCCACGACAAGAACGGGAAATTCATCTGGCAGAGCTTTGATTACCCTACTGATACTCTGTTAGTCGGTCAATCGATTCGAATCGGCAGCCGGAATAAATTAATTAGCCGGAAATCCGAAATCGACGGCTCTGATGGCCCTTACAGCCTTGTTTTAGATCGAACAGGGCTCACGATGTTTCTTTCCCACGACGGTCAGCTTTTAACCTACGGCGGTTGGCCGGGGACGGATCATGGAAGCAGAGTAACATTCGCCGCCGAACCAGAGAATGACAACGCCACCGCGTACGAGCTTCTTCTTTTAGTAAATCAGGACACCCCACGGCGGCGATTGTTACAAGTCCGGCCAATTAGAAGCGGCGGAGCGTTGAATTTGAACAAATTGAACTATAATGCAACGTACTCGTTTCTCCGGCTTAGCCACGACGGGAACTTGAAGGCATTCACGTACTACGATAAAGTGAGTTACTTGAAATGGGAAGAGAGCTTTGCGTTTTTTTCGAGCTATTTCATAAGGGAATGTGCTCTGCCGAGCAAATGTGGGGCTTACGGCTACTGCAACAGGGGAATGTGTGTGGCGTGTCCGAGCCCAAAAGGGCTTTTGGGGTGGAGCGAGAGATGTGCGCCGCCGAAGACGCCGCCGTGCAGCGGCGGAAAAGGGAAATTTGGGTACTATAAGATCGTGGGGGTGGAGCATTTTTTGAACCCGTACAAGGAGGACGGTGAAGGGCCGATTAAGGTTGGGGATTGCAGAGCCAAATGTGATAGAGATTGCAAGTGTTTAGGGTTTATCTATAAGGAGTATAGCTCAAAATGCTTGAGGGTTCCATTGTTGGGAACTTTGATTAAGGATATTAATTCGTCGTCGGTGGGTTATATTAAGTACTCGATTTAG

mRNA sequence

ATGGCGACACGTTTTCATCTTCTTCTTCCCCCTCTCTGTTTCCTGCTCTTCACTGTTCTTCTCGCCGCCATAGCCACAGAAGCTCAAGTTCCTGCAAATGCCACCTTCCATTTCGTAAACCAAGGCGAATTCGGCGACCGAATCATTGAATACGACGCCAGCTACCGCGTAATTCGAAACGACGTGTACACCTTCTACACATTCCCCTTCCGCCTCTGTTTTTACAACACCACCCCTGATTCCTTCATTTTCGCCATTAGAGCTGGAATCCCCAACGACGAGAGTTTAATGCGATGGGTTTGGGACGCCAATCGCAACGACCCAGTTCGTGAAAACGCCACCCTCACCTTTGGCCGCGACGGAAACTTCGTCCTCGCCGACGTCGACGGCCGTGTCGTCTGGCAAACCAACACCAAAAACAGAGGAGTCACCGGAATCAAAATGCTCCCTAATGGAAACTTAATCCTCCACGACAAGAACGGGAAATTCATCTGGCAGAGCTTTGATTACCCTACTGATACTCTGTTAGTCGGTCAATCGATTCGAATCGGCAGCCGGAATAAATTAATTAGCCGGAAATCCGAAATCGACGGCTCTGATGGCCCTTACAGCCTTGTTTTAGATCGAACAGGGCTCACGATGTTTCTTTCCCACGACGGTCAGCTTTTAACCTACGGCGGTTGGCCGGGGACGGATCATGGAAGCAGAGTAACATTCGCCGCCGAACCAGAGAATGACAACGCCACCGCGTACGAGCTTCTTCTTTTAGTAAATCAGGACACCCCACGGCGGCGATTGTTACAAGTCCGGCCAATTAGAAGCGGCGGAGCGTTGAATTTGAACAAATTGAACTATAATGCAACGTACTCGTTTCTCCGGCTTAGCCACGACGGGAACTTGAAGGCATTCACGTACTACGATAAAGTGAGTTACTTGAAATGGGAAGAGAGCTTTGCGTTTTTTTCGAGCTATTTCATAAGGGAATGTGCTCTGCCGAGCAAATGTGGGGCTTACGGCTACTGCAACAGGGGAATGTGTGTGGCGTGTCCGAGCCCAAAAGGGCTTTTGGGGTGGAGCGAGAGATGTGCGCCGCCGAAGACGCCGCCGTGCAGCGGCGGAAAAGGGAAATTTGGGTACTATAAGATCGTGGGGGTGGAGCATTTTTTGAACCCGTACAAGGAGGACGGTGAAGGGCCGATTAAGGTTGGGGATTGCAGAGCCAAATGTGATAGAGATTGCAAGTGTTTAGGGTTTATCTATAAGGAGTATAGCTCAAAATGCTTGAGGGTTCCATTGTTGGGAACTTTGATTAAGGATATTAATTCGTCGTCGGTGGGTTATATTAAGTACTCGATTTAG

Coding sequence (CDS)

ATGGCGACACGTTTTCATCTTCTTCTTCCCCCTCTCTGTTTCCTGCTCTTCACTGTTCTTCTCGCCGCCATAGCCACAGAAGCTCAAGTTCCTGCAAATGCCACCTTCCATTTCGTAAACCAAGGCGAATTCGGCGACCGAATCATTGAATACGACGCCAGCTACCGCGTAATTCGAAACGACGTGTACACCTTCTACACATTCCCCTTCCGCCTCTGTTTTTACAACACCACCCCTGATTCCTTCATTTTCGCCATTAGAGCTGGAATCCCCAACGACGAGAGTTTAATGCGATGGGTTTGGGACGCCAATCGCAACGACCCAGTTCGTGAAAACGCCACCCTCACCTTTGGCCGCGACGGAAACTTCGTCCTCGCCGACGTCGACGGCCGTGTCGTCTGGCAAACCAACACCAAAAACAGAGGAGTCACCGGAATCAAAATGCTCCCTAATGGAAACTTAATCCTCCACGACAAGAACGGGAAATTCATCTGGCAGAGCTTTGATTACCCTACTGATACTCTGTTAGTCGGTCAATCGATTCGAATCGGCAGCCGGAATAAATTAATTAGCCGGAAATCCGAAATCGACGGCTCTGATGGCCCTTACAGCCTTGTTTTAGATCGAACAGGGCTCACGATGTTTCTTTCCCACGACGGTCAGCTTTTAACCTACGGCGGTTGGCCGGGGACGGATCATGGAAGCAGAGTAACATTCGCCGCCGAACCAGAGAATGACAACGCCACCGCGTACGAGCTTCTTCTTTTAGTAAATCAGGACACCCCACGGCGGCGATTGTTACAAGTCCGGCCAATTAGAAGCGGCGGAGCGTTGAATTTGAACAAATTGAACTATAATGCAACGTACTCGTTTCTCCGGCTTAGCCACGACGGGAACTTGAAGGCATTCACGTACTACGATAAAGTGAGTTACTTGAAATGGGAAGAGAGCTTTGCGTTTTTTTCGAGCTATTTCATAAGGGAATGTGCTCTGCCGAGCAAATGTGGGGCTTACGGCTACTGCAACAGGGGAATGTGTGTGGCGTGTCCGAGCCCAAAAGGGCTTTTGGGGTGGAGCGAGAGATGTGCGCCGCCGAAGACGCCGCCGTGCAGCGGCGGAAAAGGGAAATTTGGGTACTATAAGATCGTGGGGGTGGAGCATTTTTTGAACCCGTACAAGGAGGACGGTGAAGGGCCGATTAAGGTTGGGGATTGCAGAGCCAAATGTGATAGAGATTGCAAGTGTTTAGGGTTTATCTATAAGGAGTATAGCTCAAAATGCTTGAGGGTTCCATTGTTGGGAACTTTGATTAAGGATATTAATTCGTCGTCGGTGGGTTATATTAAGTACTCGATTTAG

Protein sequence

MATRFHLLLPPLCFLLFTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNDVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGSRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNLKAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSERCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDINSSSVGYIKYSI
Homology
BLAST of Cp4.1LG06g06550 vs. ExPASy Swiss-Prot
Match: Q9ZVA2 (EP1-like glycoprotein 2 OS=Arabidopsis thaliana OX=3702 GN=At1g78830 PE=1 SV=1)

HSP 1 Score: 589.0 bits (1517), Expect = 4.7e-167
Identity = 285/452 (63.05%), Postives = 338/452 (74.78%), Query Frame = 0

Query: 14  FLLFTVLLAAIATE----AQVPANATFHFVNQGEFGDRIIEYDASYRVIRNDVYTFYTFP 73
           F +   L  AIAT     AQVP    F  VN+GEFG+ I EYDASYR I +   +F+T P
Sbjct: 4   FAILVTLALAIATVSVVIAQVPPEKQFRVVNEGEFGEYITEYDASYRFIESSNQSFFTSP 63

Query: 74  FRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRDGNFVLADVD 133
           F+L FYNTTP ++I A+R G+  DES MRW+WDANRN+PV ENATL+ GR+GN VLA+ D
Sbjct: 64  FQLLFYNTTPSAYILALRVGLRRDESTMRWIWDANRNNPVGENATLSLGRNGNLVLAEAD 123

Query: 134 GRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGSRNKL 193
           GRV WQTNT N+GVTG ++LPNGN++LHDKNGKF+WQSFD+PTDTLL GQS+++   NKL
Sbjct: 124 GRVKWQTNTANKGVTGFQILPNGNIVLHDKNGKFVWQSFDHPTDTLLTGQSLKVNGVNKL 183

Query: 194 ISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT 253
           +SR S+ +GSDGPYS+VLD+ GLTM+++  G  L YGGWP  D    VTFA   E DN T
Sbjct: 184 VSRTSDSNGSDGPYSMVLDKKGLTMYVNKTGTPLVYGGWPDHDFRGTVTFAVTREFDNLT 243

Query: 254 ---AYELLL-----LVNQDTPRRRLLQVRPIRS-GGALNLNKLNYNATYSFLRLSHDGNL 313
              AYELLL             RRLLQVRPI S GG LNLNK+NYN T S+LRL  DG+L
Sbjct: 244 EPSAYELLLEPAPQPATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSL 303

Query: 314 KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE 373
           KA++Y+   +YLKWEESF+FFS+YF+R+C LPS CG YGYC+RGMC ACP+PKGLLGWS+
Sbjct: 304 KAYSYFPAATYLKWEESFSFFSTYFVRQCGLPSFCGDYGYCDRGMCNACPTPKGLLGWSD 363

Query: 374 RCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGF 433
           +CAPPKT   CSG KGK   YYKIVGVEHF  PY  DG+GP  V DC+AKCDRDCKCLG+
Sbjct: 364 KCAPPKTTQFCSGVKGKTVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGY 423

Query: 434 IYKEYSSKCLRVPLLGTLIKDINSSSVGYIKY 451
            YKE   KCL  PLLGTLIKD N+SSV YIKY
Sbjct: 424 FYKEKDKKCLLAPLLGTLIKDANTSSVAYIKY 455

BLAST of Cp4.1LG06g06550 vs. ExPASy Swiss-Prot
Match: Q9ZVA1 (EP1-like glycoprotein 1 OS=Arabidopsis thaliana OX=3702 GN=At1g78820 PE=2 SV=1)

HSP 1 Score: 563.1 bits (1450), Expect = 2.8e-159
Identity = 269/450 (59.78%), Postives = 332/450 (73.78%), Query Frame = 0

Query: 14  FLLFTVLLAAIAT--EAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNDVYTFYTFPFR 73
           +LL T L  +  +   AQVP    F  +N+  +   I EYDASYR + +    F+T PF+
Sbjct: 6   YLLITALAISTVSVVMAQVPPEKQFRVLNEPGYAPYITEYDASYRFLNSPNQNFFTIPFQ 65

Query: 74  LCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRDGNFVLADVDGR 133
           L FYNTTP +++ A+R G   D S  RW+WDANRN+PV +N+TL+FGR+GN VLA+++G+
Sbjct: 66  LMFYNTTPSAYVLALRVGTRRDMSFTRWIWDANRNNPVGDNSTLSFGRNGNLVLAELNGQ 125

Query: 134 VVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGSRNKLIS 193
           V WQTNT N+GVTG ++LPNGN++LHDK+GKF+WQSFD+PTDTLLVGQS+++   NKL+S
Sbjct: 126 VKWQTNTANKGVTGFQILPNGNMVLHDKHGKFVWQSFDHPTDTLLVGQSLKVNGVNKLVS 185

Query: 194 RKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT-- 253
           R S+++GSDGPYS+VLD  GLTM+++  G  L YGGW   D    VTFA   E DN T  
Sbjct: 186 RTSDMNGSDGPYSMVLDNKGLTMYVNKTGTPLVYGGWTDHDFRGTVTFAVTREFDNLTEP 245

Query: 254 -AYELLL-----LVNQDTPRRRLLQVRPIRS-GGALNLNKLNYNATYSFLRLSHDGNLKA 313
            AYELLL             RRLLQVRPI S GG LNLNK+NYN T S+LRL  DG+LKA
Sbjct: 246 SAYELLLEPAPQPATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSLKA 305

Query: 314 FTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSERC 373
           F+Y+   +YL+WEE+FAFFS+YF+R+C LP+ CG YGYC+RGMCV CP+PKGLL WS++C
Sbjct: 306 FSYFPAATYLEWEETFAFFSNYFVRQCGLPTFCGDYGYCDRGMCVGCPTPKGLLAWSDKC 365

Query: 374 APPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY 433
           APPKT   CSGGKGK   YYKIVGVEHF  PY  DG+GP  V DC+AKCDRDCKCLG+ Y
Sbjct: 366 APPKTTQFCSGGKGKAVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGYFY 425

Query: 434 KEYSSKCLRVPLLGTLIKDINSSSVGYIKY 451
           KE   KCL  PLLGTLIKD N+SSV YIKY
Sbjct: 426 KEKDKKCLLAPLLGTLIKDANTSSVAYIKY 455

BLAST of Cp4.1LG06g06550 vs. ExPASy Swiss-Prot
Match: Q9ZVA4 (EP1-like glycoprotein 3 OS=Arabidopsis thaliana OX=3702 GN=At1g78850 PE=1 SV=1)

HSP 1 Score: 330.5 bits (846), Expect = 3.0e-89
Identity = 183/450 (40.67%), Postives = 252/450 (56.00%), Query Frame = 0

Query: 15  LLFTVLLAAIATEAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNDVYTFYTFP--FR 74
           L FT+ +  I ++A+VP +  F  VN+G + D   IEY+        DV  F  F   FR
Sbjct: 9   LCFTLSIFLIGSQAKVPVDDQFRVVNEGGYTDYSPIEYNP-------DVRGFVPFSDNFR 68

Query: 75  LCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRDGNFVLADVDGR 134
           LCFYNTTP+++  A+R G    ES +RWVW+ANR  PV+ENATLTFG DGN VLA+ DGR
Sbjct: 69  LCFYNTTPNAYTLALRIGNRVQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGR 128

Query: 135 VVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGSRNKLIS 194
           +VWQTNT N+G  GIK+L NGN++++D +GKF+WQSFD PTDTLLVGQS+++  R KL+S
Sbjct: 129 LVWQTNTANKGAVGIKILENGNMVIYDSSGKFVWQSFDSPTDTLLVGQSLKLNGRTKLVS 188

Query: 195 RKSEIDGSDGPYSLVLDRTGLTMFLSHDG-----QLLTYGGWPGTDHGSRVTFAAEPEND 254
           R S    ++GPYSLV++   L ++ + +          Y  +        +TF A  ++D
Sbjct: 189 RLSPSVNTNGPYSLVMEAKKLVLYYTTNKTPKPIAYFEYEFFTKITQFQSMTFQAVEDSD 248

Query: 255 NATAYELLLLVNQDTPRRRLLQVRPIRSGGALN----LNKLNYNATYSFLRLSHDGNLKA 314
                               L +  + SG   N    L++  +NAT SF+RL  DGN++ 
Sbjct: 249 TTWG----------------LVMEGVDSGSKFNVSTFLSRPKHNATLSFIRLESDGNIRV 308

Query: 315 FTYYDKVSYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWS 374
           ++Y    +   W+ ++  F++       EC +P  C  +G C +G C ACPS KGLLGW 
Sbjct: 309 WSYSTLATSTAWDVTYTAFTNADTDGNDECRIPEHCLGFGLCKKGQCNACPSDKGLLGWD 368

Query: 375 ERCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFI 434
           E C  P    C      F Y+KI G + F+  Y  +G        C  KC RDCKCLGF 
Sbjct: 369 ETCKSPSLASCD--PKTFHYFKIEGADSFMTKY--NGGSSTTESACGDKCTRDCKCLGFF 428

Query: 435 YKEYSSKCLRVPLLGTLIKDINSSSVGYIK 450
           Y   SS+C     L TL +  +SS V Y+K
Sbjct: 429 YNRKSSRCWLGYELKTLTRTGDSSLVAYVK 431

BLAST of Cp4.1LG06g06550 vs. ExPASy Swiss-Prot
Match: Q9ZVA5 (EP1-like glycoprotein 4 OS=Arabidopsis thaliana OX=3702 GN=At1g78860 PE=3 SV=1)

HSP 1 Score: 328.6 bits (841), Expect = 1.1e-88
Identity = 183/445 (41.12%), Postives = 249/445 (55.96%), Query Frame = 0

Query: 15  LLFTVLLAAIATEAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNDVYTFYTFP--FR 74
           L FT+ +  +  +A+VP +  F  VN+G + D   IEY+        DV  F  F   FR
Sbjct: 9   LFFTLSIFLVGAQAKVPVDDQFRVVNEGGYTDYSPIEYNP-------DVRGFVPFSDNFR 68

Query: 75  LCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRDGNFVLADVDGR 134
           LCFYNTT +++  A+R G    ES +RWVW+ANR  PV+ENATLTFG DGN VLA+ DGR
Sbjct: 69  LCFYNTTQNAYTLALRIGNRAQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGR 128

Query: 135 VVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGSRNKLIS 194
           VVWQTNT N+GV GIK+L NGN++++D NGKF+WQSFD PTDTLLVGQS+++  +NKL+S
Sbjct: 129 VVWQTNTANKGVVGIKILENGNMVIYDSNGKFVWQSFDSPTDTLLVGQSLKLNGQNKLVS 188

Query: 195 RKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAY 254
           R S    ++GPYSLV++   L ++ + +      G            +  E     A   
Sbjct: 189 RLSPSVNANGPYSLVMEAKKLVLYYTTNKTPKPIG-----------YYEYEFFTKIAQLQ 248

Query: 255 ELLLLVNQDTPRRRLLQVRPIRSGGALN----LNKLNYNATYSFLRLSHDGNLKAFTYYD 314
            +     +D      L +  + SG   N    L++  +NAT SFLRL  DGN++ ++Y  
Sbjct: 249 SMTFQAVEDADTTWGLHMEGVDSGSQFNVSTFLSRPKHNATLSFLRLESDGNIRVWSYST 308

Query: 315 KVSYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSERCAP 374
             +   W+ ++  F++       EC +P  C  +G C +G C ACPS  GLLGW E C  
Sbjct: 309 LATSTAWDVTYTAFTNDNTDGNDECRIPEHCLGFGLCKKGQCNACPSDIGLLGWDETCKI 368

Query: 375 PKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIYKEYS 434
           P    C      F Y+KI G + F+  Y  +G        C  KC RDCKCLGF Y   S
Sbjct: 369 PSLASCD--PKTFHYFKIEGADSFMTKY--NGGSTTTESACGDKCTRDCKCLGFFYNRKS 428

Query: 435 SKCLRVPLLGTLIKDINSSSVGYIK 450
           S+C     L TL K  ++S V Y+K
Sbjct: 429 SRCWLGYELKTLTKTGDTSLVAYVK 431

BLAST of Cp4.1LG06g06550 vs. ExPASy Swiss-Prot
Match: Q39688 (Epidermis-specific secreted glycoprotein EP1 OS=Daucus carota OX=4039 GN=EP1 PE=1 SV=1)

HSP 1 Score: 313.9 bits (803), Expect = 2.9e-84
Identity = 179/407 (43.98%), Postives = 229/407 (56.27%), Query Frame = 0

Query: 1   MATRFHLLLPPLCFLLFTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRN 60
           MA  F L L  L F +  +          VPAN TF FVN+GE G  I EY   YR +  
Sbjct: 1   MARFFPLTLTILLFFIQRIDFC----HTLVPANETFKFVNEGELGQYISEYFGDYRPLDP 60

Query: 61  DVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD 120
                +T PF+LCFYN TP +F  A+R G+   ESLMRWVW+ANR +PV ENATLTFG D
Sbjct: 61  -----FTSPFQLCFYNQTPTAFTLALRMGLRRTESLMRWVWEANRGNPVDENATLTFGPD 120

Query: 121 GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS 180
           GN VLA  +G+V WQT+T N+GV G+K+LPNGN++L+D  GKF+WQSFD PTDTLLVGQS
Sbjct: 121 GNLVLARSNGQVAWQTSTANKGVVGLKILPNGNMVLYDSKGKFLWQSFDTPTDTLLVGQS 180

Query: 181 IRIGSRNKLISRKSEIDGSDGPYSLVLDRTGLTMFL--SHDGQLLTYGGWP------GTD 240
           +++G+  KL+SR S  +  +GPYSLV++  GL ++   +   + + Y  +         +
Sbjct: 181 LKMGAVTKLVSRASPGENVNGPYSLVMEPKGLHLYYKPTTSPKPIRYYSFSLFTKLNKNE 240

Query: 241 HGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFL 300
               VTF  E END   A+ L L                   GGA  LN++ YN T SFL
Sbjct: 241 SLQNVTFEFENENDQGFAFLLSLKYGTSN-----------SLGGASILNRIKYNTTLSFL 300

Query: 301 RLSHDGNLKAFTYYDKVSYLKWEESFAFF--------------SSYFIRECALPSKCGAY 360
           RL  DGN+K +TY DKV Y  WE ++  F              +     EC LP KCG +
Sbjct: 301 RLEIDGNVKIYTYNDKVDYGAWEVTYTLFLKAPPPLFQVSLAATESESSECQLPKKCGNF 360

Query: 361 GYCNRGMCVACPSPKG-LLGWSERCAPPKTPPCSGGKGKFGYYKIVG 385
           G C    CV CP+  G +L WS+ C PPK   C  G   F Y K+ G
Sbjct: 361 GLCEESQCVGCPTSSGPVLAWSKTCEPPKLSSC--GPKDFHYNKLGG 385

BLAST of Cp4.1LG06g06550 vs. NCBI nr
Match: XP_023535213.1 (EP1-like glycoprotein 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 937 bits (2421), Expect = 0.0
Identity = 452/452 (100.00%), Postives = 452/452 (100.00%), Query Frame = 0

Query: 1   MATRFHLLLPPLCFLLFTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRN 60
           MATRFHLLLPPLCFLLFTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRN
Sbjct: 1   MATRFHLLLPPLCFLLFTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRN 60

Query: 61  DVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD 120
           DVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD
Sbjct: 61  DVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD 120

Query: 121 GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS 180
           GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS
Sbjct: 121 GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS 180

Query: 181 IRIGSRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA 240
           IRIGSRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA
Sbjct: 181 IRIGSRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA 240

Query: 241 AEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL 300
           AEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL
Sbjct: 241 AEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL 300

Query: 301 KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE 360
           KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE
Sbjct: 301 KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE 360

Query: 361 RCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY 420
           RCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY
Sbjct: 361 RCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY 420

Query: 421 KEYSSKCLRVPLLGTLIKDINSSSVGYIKYSI 452
           KEYSSKCLRVPLLGTLIKDINSSSVGYIKYSI
Sbjct: 421 KEYSSKCLRVPLLGTLIKDINSSSVGYIKYSI 452

BLAST of Cp4.1LG06g06550 vs. NCBI nr
Match: KAG6591915.1 (EP1-like glycoprotein 2, partial [Cucurbita argyrosperma subsp. sororia] >KAG7024788.1 EP1-like glycoprotein 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 926 bits (2392), Expect = 0.0
Identity = 446/452 (98.67%), Postives = 448/452 (99.12%), Query Frame = 0

Query: 1   MATRFHLLLPPLCFLLFTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRN 60
           MATRFHLLLPPLCFLL TVLLAAIAT+AQVPANATFHFVNQGEFGDRIIEYDASYRVIRN
Sbjct: 1   MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRN 60

Query: 61  DVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD 120
            VYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD
Sbjct: 61  HVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD 120

Query: 121 GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS 180
           GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS
Sbjct: 121 GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS 180

Query: 181 IRIGSRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA 240
           IRIG RNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA
Sbjct: 181 IRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA 240

Query: 241 AEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL 300
           AEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL
Sbjct: 241 AEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL 300

Query: 301 KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE 360
           KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE
Sbjct: 301 KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE 360

Query: 361 RCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY 420
            CAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY
Sbjct: 361 SCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY 420

Query: 421 KEYSSKCLRVPLLGTLIKDINSSSVGYIKYSI 452
           KEYSSKCLRVPLLGTLIKD+NSSSVGYIKYSI
Sbjct: 421 KEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI 452

BLAST of Cp4.1LG06g06550 vs. NCBI nr
Match: XP_022937366.1 (EP1-like glycoprotein 2 [Cucurbita moschata])

HSP 1 Score: 919 bits (2376), Expect = 0.0
Identity = 444/452 (98.23%), Postives = 445/452 (98.45%), Query Frame = 0

Query: 1   MATRFHLLLPPLCFLLFTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRN 60
           MAT FHLLLPPLCFLL TVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRN
Sbjct: 1   MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRN 60

Query: 61  DVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD 120
            VYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD
Sbjct: 61  HVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD 120

Query: 121 GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS 180
           GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS
Sbjct: 121 GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS 180

Query: 181 IRIGSRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA 240
           IRIG RNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA
Sbjct: 181 IRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA 240

Query: 241 AEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL 300
           AEPENDNATAYELLLLVNQDTPRRRLLQVRPI SGGALNLNKLNYNATYSFLRLSHDGNL
Sbjct: 241 AEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL 300

Query: 301 KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE 360
           KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE
Sbjct: 301 KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE 360

Query: 361 RCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY 420
            CAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKC GFIY
Sbjct: 361 SCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCSGFIY 420

Query: 421 KEYSSKCLRVPLLGTLIKDINSSSVGYIKYSI 452
           KEYSSKCLRVPLLGTLIKD+NSSSVGYIKYSI
Sbjct: 421 KEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI 452

BLAST of Cp4.1LG06g06550 vs. NCBI nr
Match: XP_022976498.1 (EP1-like glycoprotein 2 [Cucurbita maxima])

HSP 1 Score: 915 bits (2364), Expect = 0.0
Identity = 441/452 (97.57%), Postives = 445/452 (98.45%), Query Frame = 0

Query: 1   MATRFHLLLPPLCFLLFTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRN 60
           MATRFHLLL PLCFL+FTVLLAAIAT+AQVPANATFHF+NQGEFGDRIIEYDASYRVIRN
Sbjct: 1   MATRFHLLLRPLCFLVFTVLLAAIATQAQVPANATFHFINQGEFGDRIIEYDASYRVIRN 60

Query: 61  DVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD 120
           DVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD
Sbjct: 61  DVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD 120

Query: 121 GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS 180
           GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS
Sbjct: 121 GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS 180

Query: 181 IRIGSRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA 240
           IRIG R KLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA
Sbjct: 181 IRIGGRYKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA 240

Query: 241 AEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL 300
           AEPENDNATAYELLLLVNQDTPRRRLLQVRPIRS  ALNLNKLNYNATYSFLRLSHDGNL
Sbjct: 241 AEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSARALNLNKLNYNATYSFLRLSHDGNL 300

Query: 301 KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE 360
           KAFTYY KVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE
Sbjct: 301 KAFTYYAKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE 360

Query: 361 RCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY 420
            CAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY
Sbjct: 361 SCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY 420

Query: 421 KEYSSKCLRVPLLGTLIKDINSSSVGYIKYSI 452
           KEYSSKCLRVPLLGTLIKD+NSSSVGYIKYSI
Sbjct: 421 KEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI 452

BLAST of Cp4.1LG06g06550 vs. NCBI nr
Match: XP_038896945.1 (EP1-like glycoprotein 2 [Benincasa hispida])

HSP 1 Score: 867 bits (2241), Expect = 0.0
Identity = 413/441 (93.65%), Postives = 428/441 (97.05%), Query Frame = 0

Query: 13  CFLLFTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNDVYTFYTFPFRL 72
           CFLLFT+LLAA+ATEAQVPAN TFHF+NQGEFGDRIIEYDASYRVIRNDVYTFYTFPFRL
Sbjct: 13  CFLLFTILLAAMATEAQVPANETFHFINQGEFGDRIIEYDASYRVIRNDVYTFYTFPFRL 72

Query: 73  CFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRDGNFVLADVDGRV 132
           CFYNTTPDSFIFAIRAGIP DESLMRWVWDANRNDPVRENATLTFGRDGNFVLADVDGR+
Sbjct: 73  CFYNTTPDSFIFAIRAGIPRDESLMRWVWDANRNDPVRENATLTFGRDGNFVLADVDGRI 132

Query: 133 VWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGSRNKLISR 192
           VWQTNTKNRGVTGIKMLPNGNL+LHDKNGKFIWQSFDYPTDTLLVGQS+RIG RNKLISR
Sbjct: 133 VWQTNTKNRGVTGIKMLPNGNLVLHDKNGKFIWQSFDYPTDTLLVGQSLRIGGRNKLISR 192

Query: 193 KSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYE 252
           KSEIDGSDGPYSLVLDRTGLTMFLSH GQLLTYGGWP TD  +RVTF+ EPEN+NATAYE
Sbjct: 193 KSEIDGSDGPYSLVLDRTGLTMFLSHSGQLLTYGGWPDTDQINRVTFSVEPENENATAYE 252

Query: 253 LLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNLKAFTYYDKVSYL 312
           LLLL+N+DTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRL HDGNLKAFTYYD  SYL
Sbjct: 253 LLLLLNRDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLGHDGNLKAFTYYDGTSYL 312

Query: 313 KWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSERCAPPKTPPCSG 372
           KWEESFAFFSSYFIRECALPSKCGAYGYC+RGMCVACPSPKGLLGWSE CAPPKTPPCSG
Sbjct: 313 KWEESFAFFSSYFIRECALPSKCGAYGYCSRGMCVACPSPKGLLGWSESCAPPKTPPCSG 372

Query: 373 G-KGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVP 432
           G KGK+GYYKIVGVEHFLNPYK+DGEGPIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVP
Sbjct: 373 GGKGKYGYYKIVGVEHFLNPYKDDGEGPIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVP 432

Query: 433 LLGTLIKDINSSSVGYIKYSI 452
           LLGTLIKDINSSSVGYIKYS+
Sbjct: 433 LLGTLIKDINSSSVGYIKYSL 453

BLAST of Cp4.1LG06g06550 vs. ExPASy TrEMBL
Match: A0A6J1FA56 (EP1-like glycoprotein 2 OS=Cucurbita moschata OX=3662 GN=LOC111443673 PE=4 SV=1)

HSP 1 Score: 919 bits (2376), Expect = 0.0
Identity = 444/452 (98.23%), Postives = 445/452 (98.45%), Query Frame = 0

Query: 1   MATRFHLLLPPLCFLLFTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRN 60
           MAT FHLLLPPLCFLL TVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRN
Sbjct: 1   MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRN 60

Query: 61  DVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD 120
            VYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD
Sbjct: 61  HVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD 120

Query: 121 GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS 180
           GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS
Sbjct: 121 GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS 180

Query: 181 IRIGSRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA 240
           IRIG RNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA
Sbjct: 181 IRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA 240

Query: 241 AEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL 300
           AEPENDNATAYELLLLVNQDTPRRRLLQVRPI SGGALNLNKLNYNATYSFLRLSHDGNL
Sbjct: 241 AEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL 300

Query: 301 KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE 360
           KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE
Sbjct: 301 KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE 360

Query: 361 RCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY 420
            CAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKC GFIY
Sbjct: 361 SCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCSGFIY 420

Query: 421 KEYSSKCLRVPLLGTLIKDINSSSVGYIKYSI 452
           KEYSSKCLRVPLLGTLIKD+NSSSVGYIKYSI
Sbjct: 421 KEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI 452

BLAST of Cp4.1LG06g06550 vs. ExPASy TrEMBL
Match: A0A6J1IMC1 (EP1-like glycoprotein 2 OS=Cucurbita maxima OX=3661 GN=LOC111476879 PE=4 SV=1)

HSP 1 Score: 915 bits (2364), Expect = 0.0
Identity = 441/452 (97.57%), Postives = 445/452 (98.45%), Query Frame = 0

Query: 1   MATRFHLLLPPLCFLLFTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRN 60
           MATRFHLLL PLCFL+FTVLLAAIAT+AQVPANATFHF+NQGEFGDRIIEYDASYRVIRN
Sbjct: 1   MATRFHLLLRPLCFLVFTVLLAAIATQAQVPANATFHFINQGEFGDRIIEYDASYRVIRN 60

Query: 61  DVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD 120
           DVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD
Sbjct: 61  DVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRD 120

Query: 121 GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS 180
           GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS
Sbjct: 121 GNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQS 180

Query: 181 IRIGSRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA 240
           IRIG R KLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA
Sbjct: 181 IRIGGRYKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFA 240

Query: 241 AEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL 300
           AEPENDNATAYELLLLVNQDTPRRRLLQVRPIRS  ALNLNKLNYNATYSFLRLSHDGNL
Sbjct: 241 AEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSARALNLNKLNYNATYSFLRLSHDGNL 300

Query: 301 KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE 360
           KAFTYY KVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE
Sbjct: 301 KAFTYYAKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE 360

Query: 361 RCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY 420
            CAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY
Sbjct: 361 SCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY 420

Query: 421 KEYSSKCLRVPLLGTLIKDINSSSVGYIKYSI 452
           KEYSSKCLRVPLLGTLIKD+NSSSVGYIKYSI
Sbjct: 421 KEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI 452

BLAST of Cp4.1LG06g06550 vs. ExPASy TrEMBL
Match: A0A0A0L3A7 (Bulb-type lectin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G627220 PE=4 SV=1)

HSP 1 Score: 823 bits (2127), Expect = 5.14e-300
Identity = 399/448 (89.06%), Postives = 417/448 (93.08%), Query Frame = 0

Query: 6   HLL-LPPLCFLLFTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNDVYT 65
           HLL LP LCF L T+L AAIAT+AQVPAN TFHF+NQGEFGDRIIEYDASYRVIRN+VYT
Sbjct: 4   HLLPLPHLCFFLSTILFAAIATKAQVPANETFHFINQGEFGDRIIEYDASYRVIRNNVYT 63

Query: 66  FYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRDGNFV 125
           FYTFPFRLCFYNTTPDSFIFAIRAGIP DESLMRWVWDANRNDPVRENATLTFG DGNFV
Sbjct: 64  FYTFPFRLCFYNTTPDSFIFAIRAGIPRDESLMRWVWDANRNDPVRENATLTFGTDGNFV 123

Query: 126 LADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIG 185
           LADVDGR+VWQTNTKN+GVTGIKMLPNGNL+LHDKNGKFIWQSFDYPTDTLLVGQS+RIG
Sbjct: 124 LADVDGRIVWQTNTKNKGVTGIKMLPNGNLVLHDKNGKFIWQSFDYPTDTLLVGQSLRIG 183

Query: 186 SRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPE 245
            RNKLISRKSEIDGSDGPYSL+L RTGLTMFL++ GQ LTYGGW  TD  S VTF  EPE
Sbjct: 184 GRNKLISRKSEIDGSDGPYSLILSRTGLTMFLTYSGQRLTYGGWGDTDLNS-VTFTVEPE 243

Query: 246 NDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNLKAFT 305
           N+NATAYELLL +N+DT RRRLLQVRPIRSGGALNLNKLNYNATYSFLRL  DGNL+AFT
Sbjct: 244 NENATAYELLLSLNRDTQRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLGADGNLRAFT 303

Query: 306 YYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSERCAP 365
           YYD  SYLKWEESFAFFSSYFIREC LPSKCGAYGYC+RGMCV CPSPKGLLGWSERCAP
Sbjct: 304 YYDGTSYLKWEESFAFFSSYFIRECGLPSKCGAYGYCSRGMCVGCPSPKGLLGWSERCAP 363

Query: 366 PKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIYKEYS 425
           PKTP C GGK KFGYYKIVGVEHFLNPYK DGEGP+KVGDCRAKCDRDCKCLGFIYKEYS
Sbjct: 364 PKTPAC-GGKEKFGYYKIVGVEHFLNPYKNDGEGPMKVGDCRAKCDRDCKCLGFIYKEYS 423

Query: 426 SKCLRVPLLGTLIKDINSSSVGYIKYSI 452
           SKCLRVPLLGTLIKDINSSSVGYIKYS+
Sbjct: 424 SKCLRVPLLGTLIKDINSSSVGYIKYSL 449

BLAST of Cp4.1LG06g06550 vs. ExPASy TrEMBL
Match: F6H2N4 (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_19s0014g01360 PE=4 SV=1)

HSP 1 Score: 715 bits (1846), Expect = 3.48e-257
Identity = 335/434 (77.19%), Postives = 379/434 (87.33%), Query Frame = 0

Query: 25  ATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNDVYTFYTFPFRLCFYNTTPDSFIF 84
           A  A VPAN TF FVNQGEFGDRIIEYDASYRVIRNDVYTF+TFPFRLCFYNTTPD++IF
Sbjct: 18  AALALVPANQTFKFVNQGEFGDRIIEYDASYRVIRNDVYTFFTFPFRLCFYNTTPDNYIF 77

Query: 85  AIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVT 144
           AIRAG+P DESLMRWVWDANRN+P  EN+TLTFGRDGNFVLA+ DGRVVWQTNT N+GVT
Sbjct: 78  AIRAGVPGDESLMRWVWDANRNNPAHENSTLTFGRDGNFVLAEADGRVVWQTNTANKGVT 137

Query: 145 GIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGSRNKLISRKSEIDGSDGPYS 204
           GIK+LPNGNL+LHDKNGKFIWQSFDYPTDTLLVGQ +RI  RNKL+SR SE+DGSDG YS
Sbjct: 138 GIKLLPNGNLVLHDKNGKFIWQSFDYPTDTLLVGQLLRIKGRNKLVSRVSEMDGSDGKYS 197

Query: 205 LVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTP-- 264
           LV D+ GLTM++++ G+LL YGGWPG D G+ V+F A PENDNATA+EL+L   ++T   
Sbjct: 198 LVFDKKGLTMYINNSGKLLQYGGWPGDDFGNIVSFEAIPENDNATAFELVLSAYEETTPT 257

Query: 265 -----RRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNLKAFTYYDKVSYLKWEES 324
                RRRLLQVRPI SGG  NLNKLNYNATYSFLRLSHDGNL+A+TYYD+VSYLKW+E+
Sbjct: 258 PPPPGRRRLLQVRPISSGGQRNLNKLNYNATYSFLRLSHDGNLRAYTYYDQVSYLKWDET 317

Query: 325 FAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSERCAPPKTPPCSGGKGKF 384
           FAFFSSYFIRECALPSKCG++G CN+GMCVACPSPKGLLGWSE CAPP+ PPC GG  K 
Sbjct: 318 FAFFSSYFIRECALPSKCGSFGLCNKGMCVACPSPKGLLGWSESCAPPRLPPCKGGAAKV 377

Query: 385 GYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLI 444
            YYKI+GVE+FLNPY +DG+GP+KV +CR +C RDCKCLGFIYKE +SKCL  PLL TLI
Sbjct: 378 DYYKIIGVENFLNPYLDDGKGPMKVEECRERCSRDCKCLGFIYKEDTSKCLLAPLLATLI 437

Query: 445 KDINSSSVGYIKYS 451
           KD N++SVGYIKYS
Sbjct: 438 KDENATSVGYIKYS 451

BLAST of Cp4.1LG06g06550 vs. ExPASy TrEMBL
Match: A0A438E5D3 (EP1-like glycoprotein 2 OS=Vitis vinifera OX=29760 GN=VvCHDh000637_2 PE=4 SV=1)

HSP 1 Score: 715 bits (1846), Expect = 3.48e-257
Identity = 335/434 (77.19%), Postives = 379/434 (87.33%), Query Frame = 0

Query: 25  ATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNDVYTFYTFPFRLCFYNTTPDSFIF 84
           A  A VPAN TF FVNQGEFGDRIIEYDASYRVIRNDVYTF+TFPFRLCFYNTTPD++IF
Sbjct: 18  AALALVPANQTFKFVNQGEFGDRIIEYDASYRVIRNDVYTFFTFPFRLCFYNTTPDNYIF 77

Query: 85  AIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVT 144
           AIRAG+P DESLMRWVWDANRN+P  EN+TLTFGRDGNFVLA+ DGRVVWQTNT N+GVT
Sbjct: 78  AIRAGVPGDESLMRWVWDANRNNPAHENSTLTFGRDGNFVLAEADGRVVWQTNTANKGVT 137

Query: 145 GIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGSRNKLISRKSEIDGSDGPYS 204
           GIK+LPNGNL+LHDKNGKFIWQSFDYPTDTLLVGQ +RI  RNKL+SR SE+DGSDG YS
Sbjct: 138 GIKLLPNGNLVLHDKNGKFIWQSFDYPTDTLLVGQLLRIKGRNKLVSRVSEMDGSDGKYS 197

Query: 205 LVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTP-- 264
           LV D+ GLTM++++ G+LL YGGWPG D G+ V+F A PENDNATA+EL+L   ++T   
Sbjct: 198 LVFDKKGLTMYINNSGKLLQYGGWPGDDFGTIVSFEAIPENDNATAFELVLSAYEETTPT 257

Query: 265 -----RRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNLKAFTYYDKVSYLKWEES 324
                RRRLLQVRPI SGG  NLNKLNYNATYSFLRLSHDGNL+A+TYYD+VSYLKW+E+
Sbjct: 258 PPPPGRRRLLQVRPISSGGQRNLNKLNYNATYSFLRLSHDGNLRAYTYYDQVSYLKWDET 317

Query: 325 FAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSERCAPPKTPPCSGGKGKF 384
           FAFFSSYFIRECALPSKCG++G CN+GMCVACPSPKGLLGWSE CAPP+ PPC GG  K 
Sbjct: 318 FAFFSSYFIRECALPSKCGSFGLCNKGMCVACPSPKGLLGWSESCAPPRLPPCKGGAAKV 377

Query: 385 GYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLI 444
            YYKI+GVE+FLNPY +DG+GP+KV +CR +C RDCKCLGFIYKE +SKCL  PLL TLI
Sbjct: 378 DYYKIIGVENFLNPYLDDGKGPMKVEECRERCSRDCKCLGFIYKEDTSKCLLAPLLATLI 437

Query: 445 KDINSSSVGYIKYS 451
           KD N++SVGYIKYS
Sbjct: 438 KDENATSVGYIKYS 451

BLAST of Cp4.1LG06g06550 vs. TAIR 10
Match: AT1G78830.1 (Curculin-like (mannose-binding) lectin family protein )

HSP 1 Score: 589.0 bits (1517), Expect = 3.3e-168
Identity = 285/452 (63.05%), Postives = 338/452 (74.78%), Query Frame = 0

Query: 14  FLLFTVLLAAIATE----AQVPANATFHFVNQGEFGDRIIEYDASYRVIRNDVYTFYTFP 73
           F +   L  AIAT     AQVP    F  VN+GEFG+ I EYDASYR I +   +F+T P
Sbjct: 4   FAILVTLALAIATVSVVIAQVPPEKQFRVVNEGEFGEYITEYDASYRFIESSNQSFFTSP 63

Query: 74  FRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRDGNFVLADVD 133
           F+L FYNTTP ++I A+R G+  DES MRW+WDANRN+PV ENATL+ GR+GN VLA+ D
Sbjct: 64  FQLLFYNTTPSAYILALRVGLRRDESTMRWIWDANRNNPVGENATLSLGRNGNLVLAEAD 123

Query: 134 GRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGSRNKL 193
           GRV WQTNT N+GVTG ++LPNGN++LHDKNGKF+WQSFD+PTDTLL GQS+++   NKL
Sbjct: 124 GRVKWQTNTANKGVTGFQILPNGNIVLHDKNGKFVWQSFDHPTDTLLTGQSLKVNGVNKL 183

Query: 194 ISRKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT 253
           +SR S+ +GSDGPYS+VLD+ GLTM+++  G  L YGGWP  D    VTFA   E DN T
Sbjct: 184 VSRTSDSNGSDGPYSMVLDKKGLTMYVNKTGTPLVYGGWPDHDFRGTVTFAVTREFDNLT 243

Query: 254 ---AYELLL-----LVNQDTPRRRLLQVRPIRS-GGALNLNKLNYNATYSFLRLSHDGNL 313
              AYELLL             RRLLQVRPI S GG LNLNK+NYN T S+LRL  DG+L
Sbjct: 244 EPSAYELLLEPAPQPATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSL 303

Query: 314 KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE 373
           KA++Y+   +YLKWEESF+FFS+YF+R+C LPS CG YGYC+RGMC ACP+PKGLLGWS+
Sbjct: 304 KAYSYFPAATYLKWEESFSFFSTYFVRQCGLPSFCGDYGYCDRGMCNACPTPKGLLGWSD 363

Query: 374 RCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGF 433
           +CAPPKT   CSG KGK   YYKIVGVEHF  PY  DG+GP  V DC+AKCDRDCKCLG+
Sbjct: 364 KCAPPKTTQFCSGVKGKTVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGY 423

Query: 434 IYKEYSSKCLRVPLLGTLIKDINSSSVGYIKY 451
            YKE   KCL  PLLGTLIKD N+SSV YIKY
Sbjct: 424 FYKEKDKKCLLAPLLGTLIKDANTSSVAYIKY 455

BLAST of Cp4.1LG06g06550 vs. TAIR 10
Match: AT1G78820.1 (D-mannose binding lectin protein with Apple-like carbohydrate-binding domain )

HSP 1 Score: 563.1 bits (1450), Expect = 2.0e-160
Identity = 269/450 (59.78%), Postives = 332/450 (73.78%), Query Frame = 0

Query: 14  FLLFTVLLAAIAT--EAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNDVYTFYTFPFR 73
           +LL T L  +  +   AQVP    F  +N+  +   I EYDASYR + +    F+T PF+
Sbjct: 6   YLLITALAISTVSVVMAQVPPEKQFRVLNEPGYAPYITEYDASYRFLNSPNQNFFTIPFQ 65

Query: 74  LCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRDGNFVLADVDGR 133
           L FYNTTP +++ A+R G   D S  RW+WDANRN+PV +N+TL+FGR+GN VLA+++G+
Sbjct: 66  LMFYNTTPSAYVLALRVGTRRDMSFTRWIWDANRNNPVGDNSTLSFGRNGNLVLAELNGQ 125

Query: 134 VVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGSRNKLIS 193
           V WQTNT N+GVTG ++LPNGN++LHDK+GKF+WQSFD+PTDTLLVGQS+++   NKL+S
Sbjct: 126 VKWQTNTANKGVTGFQILPNGNMVLHDKHGKFVWQSFDHPTDTLLVGQSLKVNGVNKLVS 185

Query: 194 RKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT-- 253
           R S+++GSDGPYS+VLD  GLTM+++  G  L YGGW   D    VTFA   E DN T  
Sbjct: 186 RTSDMNGSDGPYSMVLDNKGLTMYVNKTGTPLVYGGWTDHDFRGTVTFAVTREFDNLTEP 245

Query: 254 -AYELLL-----LVNQDTPRRRLLQVRPIRS-GGALNLNKLNYNATYSFLRLSHDGNLKA 313
            AYELLL             RRLLQVRPI S GG LNLNK+NYN T S+LRL  DG+LKA
Sbjct: 246 SAYELLLEPAPQPATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSLKA 305

Query: 314 FTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSERC 373
           F+Y+   +YL+WEE+FAFFS+YF+R+C LP+ CG YGYC+RGMCV CP+PKGLL WS++C
Sbjct: 306 FSYFPAATYLEWEETFAFFSNYFVRQCGLPTFCGDYGYCDRGMCVGCPTPKGLLAWSDKC 365

Query: 374 APPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIY 433
           APPKT   CSGGKGK   YYKIVGVEHF  PY  DG+GP  V DC+AKCDRDCKCLG+ Y
Sbjct: 366 APPKTTQFCSGGKGKAVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGYFY 425

Query: 434 KEYSSKCLRVPLLGTLIKDINSSSVGYIKY 451
           KE   KCL  PLLGTLIKD N+SSV YIKY
Sbjct: 426 KEKDKKCLLAPLLGTLIKDANTSSVAYIKY 455

BLAST of Cp4.1LG06g06550 vs. TAIR 10
Match: AT1G78850.1 (D-mannose binding lectin protein with Apple-like carbohydrate-binding domain )

HSP 1 Score: 330.5 bits (846), Expect = 2.1e-90
Identity = 183/450 (40.67%), Postives = 252/450 (56.00%), Query Frame = 0

Query: 15  LLFTVLLAAIATEAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNDVYTFYTFP--FR 74
           L FT+ +  I ++A+VP +  F  VN+G + D   IEY+        DV  F  F   FR
Sbjct: 9   LCFTLSIFLIGSQAKVPVDDQFRVVNEGGYTDYSPIEYNP-------DVRGFVPFSDNFR 68

Query: 75  LCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRDGNFVLADVDGR 134
           LCFYNTTP+++  A+R G    ES +RWVW+ANR  PV+ENATLTFG DGN VLA+ DGR
Sbjct: 69  LCFYNTTPNAYTLALRIGNRVQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGR 128

Query: 135 VVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGSRNKLIS 194
           +VWQTNT N+G  GIK+L NGN++++D +GKF+WQSFD PTDTLLVGQS+++  R KL+S
Sbjct: 129 LVWQTNTANKGAVGIKILENGNMVIYDSSGKFVWQSFDSPTDTLLVGQSLKLNGRTKLVS 188

Query: 195 RKSEIDGSDGPYSLVLDRTGLTMFLSHDG-----QLLTYGGWPGTDHGSRVTFAAEPEND 254
           R S    ++GPYSLV++   L ++ + +          Y  +        +TF A  ++D
Sbjct: 189 RLSPSVNTNGPYSLVMEAKKLVLYYTTNKTPKPIAYFEYEFFTKITQFQSMTFQAVEDSD 248

Query: 255 NATAYELLLLVNQDTPRRRLLQVRPIRSGGALN----LNKLNYNATYSFLRLSHDGNLKA 314
                               L +  + SG   N    L++  +NAT SF+RL  DGN++ 
Sbjct: 249 TTWG----------------LVMEGVDSGSKFNVSTFLSRPKHNATLSFIRLESDGNIRV 308

Query: 315 FTYYDKVSYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWS 374
           ++Y    +   W+ ++  F++       EC +P  C  +G C +G C ACPS KGLLGW 
Sbjct: 309 WSYSTLATSTAWDVTYTAFTNADTDGNDECRIPEHCLGFGLCKKGQCNACPSDKGLLGWD 368

Query: 375 ERCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFI 434
           E C  P    C      F Y+KI G + F+  Y  +G        C  KC RDCKCLGF 
Sbjct: 369 ETCKSPSLASCD--PKTFHYFKIEGADSFMTKY--NGGSSTTESACGDKCTRDCKCLGFF 428

Query: 435 YKEYSSKCLRVPLLGTLIKDINSSSVGYIK 450
           Y   SS+C     L TL +  +SS V Y+K
Sbjct: 429 YNRKSSRCWLGYELKTLTRTGDSSLVAYVK 431

BLAST of Cp4.1LG06g06550 vs. TAIR 10
Match: AT1G78860.1 (D-mannose binding lectin protein with Apple-like carbohydrate-binding domain )

HSP 1 Score: 328.6 bits (841), Expect = 8.1e-90
Identity = 183/445 (41.12%), Postives = 249/445 (55.96%), Query Frame = 0

Query: 15  LLFTVLLAAIATEAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNDVYTFYTFP--FR 74
           L FT+ +  +  +A+VP +  F  VN+G + D   IEY+        DV  F  F   FR
Sbjct: 9   LFFTLSIFLVGAQAKVPVDDQFRVVNEGGYTDYSPIEYNP-------DVRGFVPFSDNFR 68

Query: 75  LCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFGRDGNFVLADVDGR 134
           LCFYNTT +++  A+R G    ES +RWVW+ANR  PV+ENATLTFG DGN VLA+ DGR
Sbjct: 69  LCFYNTTQNAYTLALRIGNRAQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGR 128

Query: 135 VVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGSRNKLIS 194
           VVWQTNT N+GV GIK+L NGN++++D NGKF+WQSFD PTDTLLVGQS+++  +NKL+S
Sbjct: 129 VVWQTNTANKGVVGIKILENGNMVIYDSNGKFVWQSFDSPTDTLLVGQSLKLNGQNKLVS 188

Query: 195 RKSEIDGSDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAY 254
           R S    ++GPYSLV++   L ++ + +      G            +  E     A   
Sbjct: 189 RLSPSVNANGPYSLVMEAKKLVLYYTTNKTPKPIG-----------YYEYEFFTKIAQLQ 248

Query: 255 ELLLLVNQDTPRRRLLQVRPIRSGGALN----LNKLNYNATYSFLRLSHDGNLKAFTYYD 314
            +     +D      L +  + SG   N    L++  +NAT SFLRL  DGN++ ++Y  
Sbjct: 249 SMTFQAVEDADTTWGLHMEGVDSGSQFNVSTFLSRPKHNATLSFLRLESDGNIRVWSYST 308

Query: 315 KVSYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSERCAP 374
             +   W+ ++  F++       EC +P  C  +G C +G C ACPS  GLLGW E C  
Sbjct: 309 LATSTAWDVTYTAFTNDNTDGNDECRIPEHCLGFGLCKKGQCNACPSDIGLLGWDETCKI 368

Query: 375 PKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIYKEYS 434
           P    C      F Y+KI G + F+  Y  +G        C  KC RDCKCLGF Y   S
Sbjct: 369 PSLASCD--PKTFHYFKIEGADSFMTKY--NGGSTTTESACGDKCTRDCKCLGFFYNRKS 428

Query: 435 SKCLRVPLLGTLIKDINSSSVGYIK 450
           S+C     L TL K  ++S V Y+K
Sbjct: 429 SRCWLGYELKTLTKTGDTSLVAYVK 431

BLAST of Cp4.1LG06g06550 vs. TAIR 10
Match: AT1G16905.1 (Curculin-like (mannose-binding) lectin family protein )

HSP 1 Score: 328.2 bits (840), Expect = 1.1e-89
Identity = 196/461 (42.52%), Postives = 264/461 (57.27%), Query Frame = 0

Query: 1   MATRFHLLLPPLCFLLFTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYR---V 60
           MA   H+L+    FLL +++        QVP    F F+N G+FG+  +EY ASYR   V
Sbjct: 1   MALASHILILLSLFLLISLV------RPQVPPMEQFRFLNNGDFGESTVEYGASYRDLGV 60

Query: 61  IRNDVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTF 120
           IRN         FRLCF+NTTP++F  AI  G  + +S++RWVW AN   PV+E A+L+F
Sbjct: 61  IRNQ--------FRLCFFNTTPNAFTLAIGMGTGSSDSIIRWVWQANPQKPVQEEASLSF 120

Query: 121 GRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLV 180
           G +GN VLA  DGRVVWQT T+N+GV G+ M  NGNL+L D  G  +WQSF++PTDTLLV
Sbjct: 121 GPEGNLVLAQPDGRVVWQTMTENKGVIGLTMNENGNLVLFDDGGWPVWQSFEFPTDTLLV 180

Query: 181 GQSIRI-GSRNKLISRKSEIDGSDGPYSLVL--DRTGLTMFLSH-DGQLLTYGGWPGTDH 240
           GQS+ + GS+NKL+SR      ++G YSL+L  DR  L   +   + + L Y    G   
Sbjct: 181 GQSLTLDGSKNKLVSR------NNGSYSLILEPDRLVLNRLIPRSNNKSLVYHIIEGRFI 240

Query: 241 GSRVTFAAEPENDNATAYELLLL---VNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYS 300
            S   ++A+   D  T  +L L    +  + P +  L  RP             +NA+ S
Sbjct: 241 PSATLYSAK---DQGTTTQLGLATPGLRPEFPYKHFL-ARP------------RFNASQS 300

Query: 301 FLRLSHDGNLKAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACP 360
           FLRL  DGNL+ +++  KV++L WE +F  F+     EC LPSKCGA+G C    CVACP
Sbjct: 301 FLRLDADGNLRIYSFDSKVTFLAWEVTFELFNHDNNNECWLPSKCGAFGICEDNQCVACP 360

Query: 361 SPKGLLGWSERCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGD--CRAK 420
              GL+GWS+ C P K   C      F YY++ GVEHF+  Y       + +G+  CR  
Sbjct: 361 LGVGLMGWSKACKPKKVKSCD--PKSFHYYRLGGVEHFMTKYNVG----LALGESKCRGL 419

Query: 421 CDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDINSSSVGYIK 450
           C  DCKCLG+ + + S KC     LGTL+K  +S  V YIK
Sbjct: 421 CSGDCKCLGYFFDKSSFKCWISYELGTLVKVSDSRKVAYIK 419

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZVA24.7e-16763.05EP1-like glycoprotein 2 OS=Arabidopsis thaliana OX=3702 GN=At1g78830 PE=1 SV=1[more]
Q9ZVA12.8e-15959.78EP1-like glycoprotein 1 OS=Arabidopsis thaliana OX=3702 GN=At1g78820 PE=2 SV=1[more]
Q9ZVA43.0e-8940.67EP1-like glycoprotein 3 OS=Arabidopsis thaliana OX=3702 GN=At1g78850 PE=1 SV=1[more]
Q9ZVA51.1e-8841.12EP1-like glycoprotein 4 OS=Arabidopsis thaliana OX=3702 GN=At1g78860 PE=3 SV=1[more]
Q396882.9e-8443.98Epidermis-specific secreted glycoprotein EP1 OS=Daucus carota OX=4039 GN=EP1 PE=... [more]
Match NameE-valueIdentityDescription
XP_023535213.10.0100.00EP1-like glycoprotein 2 [Cucurbita pepo subsp. pepo][more]
KAG6591915.10.098.67EP1-like glycoprotein 2, partial [Cucurbita argyrosperma subsp. sororia] >KAG702... [more]
XP_022937366.10.098.23EP1-like glycoprotein 2 [Cucurbita moschata][more]
XP_022976498.10.097.57EP1-like glycoprotein 2 [Cucurbita maxima][more]
XP_038896945.10.093.65EP1-like glycoprotein 2 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1FA560.098.23EP1-like glycoprotein 2 OS=Cucurbita moschata OX=3662 GN=LOC111443673 PE=4 SV=1[more]
A0A6J1IMC10.097.57EP1-like glycoprotein 2 OS=Cucurbita maxima OX=3661 GN=LOC111476879 PE=4 SV=1[more]
A0A0A0L3A75.14e-30089.06Bulb-type lectin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G6... [more]
F6H2N43.48e-25777.19Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_19s0014g01360 PE=4 SV=... [more]
A0A438E5D33.48e-25777.19EP1-like glycoprotein 2 OS=Vitis vinifera OX=29760 GN=VvCHDh000637_2 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G78830.13.3e-16863.05Curculin-like (mannose-binding) lectin family protein [more]
AT1G78820.12.0e-16059.78D-mannose binding lectin protein with Apple-like carbohydrate-binding domain [more]
AT1G78850.12.1e-9040.67D-mannose binding lectin protein with Apple-like carbohydrate-binding domain [more]
AT1G78860.18.1e-9041.12D-mannose binding lectin protein with Apple-like carbohydrate-binding domain [more]
AT1G16905.11.1e-8942.52Curculin-like (mannose-binding) lectin family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001480Bulb-type lectin domainSMARTSM00108blect_4coord: 49..171
e-value: 2.6E-26
score: 103.4
IPR001480Bulb-type lectin domainPFAMPF01453B_lectincoord: 100..188
e-value: 7.4E-20
score: 71.4
IPR001480Bulb-type lectin domainPROSITEPS50927BULB_LECTINcoord: 48..169
score: 14.256498
IPR001480Bulb-type lectin domainCDDcd00028B_lectincoord: 68..171
e-value: 2.13561E-30
score: 112.02
IPR035446S-locus-specific glycoprotein/EP1PIRSFPIRSF002686SLGcoord: 1..452
e-value: 5.7E-135
score: 448.3
IPR036426Bulb-type lectin domain superfamilyGENE3D2.90.10.10coord: 70..168
e-value: 4.7E-16
score: 60.9
IPR036426Bulb-type lectin domain superfamilySUPERFAMILY51110alpha-D-mannose-specific plant lectinscoord: 95..221
NoneNo IPR availablePANTHERPTHR32444FAMILY NOT NAMEDcoord: 13..451
NoneNo IPR availablePANTHERPTHR32444:SF58CURCULIN-LIKE (MANNOSE-BINDING) LECTIN FAMILY PROTEIN-RELATEDcoord: 13..451

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG06g06550.1Cp4.1LG06g06550.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0110165 cellular anatomical entity
molecular_function GO:0030246 carbohydrate binding