Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGATCGCCATTGTTGACGCCTTTTCTGATCTCTTTCTTTTTCCTGTTCTTTTCTTTCTCTCTTGCTCTCGTGCCTGCTAACGAGACCTTCAAGTTCGTCAATGAAGGCGAGTTTGGTGAATTCATCATCGAGTACGACGGCACCTACAGACCTCTCAGCATCTTCAGATTTCCATTTCAGTTATCATTCTACAACACGACGCCGAACGCTTACACGCTCGCTCTTCGAATTTCGATTCGTCGCTCCGAATCGGCGATACGGTGGGTTTGGGAGGCCAATCGCGGCCGTCCGGTGCGCGAAAATGCTACATTCTCTCTCTCCACCAACGGAAACCTAGTTCTCGCTGAAGCCGACGGCACTGTTGTATGGCAATCGAACACAGCCAACAAGGGCGTTGTTGGATTCGAATTGCTTCCGAGTGGCAACATGGTGTTGTTCGACTCCAATGGCAAATTCCTCTGGCAGAGCTTCGATTCGCCGACGGACACACTCCTTGTCGGCCAATCGCTCCGTCTCGGCGGGCCAACGAAGCTAGTAAGCCGCGCATCGGAGGTAATGAACGTCAATGGACCTTACAGCCTTGTAATGGAACGAAAAGCCCTATCTCTTTACTACAAAAGCCCTAACTCTCCAAAACCGATGCGGTACTACTCATCCACAGACATGCTCTCAGTCCGGAAAGGCGTTCTCGCAAATATCACACTAAACGCCGCCGTAGATCCAGATCAAGGATTCGCCACCGAATTAACACTGAACTACGAGACCGGCTCGACTGAGAGCGGCGGTCCAATTCTAACCCGGCCGAAGTACAACAGCACACTAACATTCCTCCGATTAGGAATCGACGGCAACCTCCGCCTGATCACATACAACGACAAAGTCGATTGGGGCCCGTCGGAGATTTCGTTCACACTCTTCGATAGGGATTCGACTTGGGAGAACGAATGCCAATGGCCAGAGCGGTGCGGACAGTTCGGGTTGTGCGAGGACAACCAGTGCGTAGCCTGTCCAACAGAGAACGGACTGGCAGGATGGAGCAAGACCTGCGCGCCGAAGAAGGTAAGTTCCTGCGATCCCAAAAGCTTCCATTACTATAAACTAGTCGGTGTGGATCATTTCTTGACAAAGTACAACAAAGGAGAAGGGCCAATGGGACAGAAGGAGTGTGAGAAGAAGTGCAATTTGGATTGCAAATGTTTGGGATATTTTTACCAAACCAAAGGGTCGCTATGTTGGGTTGCAAATGAGCTGAAAACTTTGATAAAAGTGGCCAATTCCACTCATTTGGGCTTCATCAAAACACCCAATAAGTAG
mRNA sequence
ATGAGATCGCCATTGTTGACGCCTTTTCTGATCTCTTTCTTTTTCCTGTTCTTTTCTTTCTCTCTTGCTCTCGTGCCTGCTAACGAGACCTTCAAGTTCGTCAATGAAGGCGAGTTTGGTGAATTCATCATCGAGTACGACGGCACCTACAGACCTCTCAGCATCTTCAGATTTCCATTTCAGTTATCATTCTACAACACGACGCCGAACGCTTACACGCTCGCTCTTCGAATTTCGATTCGTCGCTCCGAATCGGCGATACGGTGGGTTTGGGAGGCCAATCGCGGCCGTCCGGTGCGCGAAAATGCTACATTCTCTCTCTCCACCAACGGAAACCTAGTTCTCGCTGAAGCCGACGGCACTGTTGTATGGCAATCGAACACAGCCAACAAGGGCGTTGTTGGATTCGAATTGCTTCCGAGTGGCAACATGGTGTTGTTCGACTCCAATGGCAAATTCCTCTGGCAGAGCTTCGATTCGCCGACGGACACACTCCTTGTCGGCCAATCGCTCCGTCTCGGCGGGCCAACGAAGCTAGTAAGCCGCGCATCGGAGGTAATGAACGTCAATGGACCTTACAGCCTTGTAATGGAACGAAAAGCCCTATCTCTTTACTACAAAAGCCCTAACTCTCCAAAACCGATGCGGTACTACTCATCCACAGACATGCTCTCAGTCCGGAAAGGCGTTCTCGCAAATATCACACTAAACGCCGCCGTAGATCCAGATCAAGGATTCGCCACCGAATTAACACTGAACTACGAGACCGGCTCGACTGAGAGCGGCGGTCCAATTCTAACCCGGCCGAAGTACAACAGCACACTAACATTCCTCCGATTAGGAATCGACGGCAACCTCCGCCTGATCACATACAACGACAAAGTCGATTGGGGCCCGTCGGAGATTTCGTTCACACTCTTCGATAGGGATTCGACTTGGGAGAACGAATGCCAATGGCCAGAGCGGTGCGGACAGTTCGGGTTGTGCGAGGACAACCAGTGCGTAGCCTGTCCAACAGAGAACGGACTGGCAGGATGGAGCAAGACCTGCGCGCCGAAGAAGGTAAGTTCCTGCGATCCCAAAAGCTTCCATTACTATAAACTAGTCGGTGTGGATCATTTCTTGACAAAGTACAACAAAGGAGAAGGGCCAATGGGACAGAAGGAGTGTGAGAAGAAGTGCAATTTGGATTGCAAATGTTTGGGATATTTTTACCAAACCAAAGGGTCGCTATGTTGGGTTGCAAATGAGCTGAAAACTTTGATAAAAGTGGCCAATTCCACTCATTTGGGCTTCATCAAAACACCCAATAAGTAG
Coding sequence (CDS)
ATGAGATCGCCATTGTTGACGCCTTTTCTGATCTCTTTCTTTTTCCTGTTCTTTTCTTTCTCTCTTGCTCTCGTGCCTGCTAACGAGACCTTCAAGTTCGTCAATGAAGGCGAGTTTGGTGAATTCATCATCGAGTACGACGGCACCTACAGACCTCTCAGCATCTTCAGATTTCCATTTCAGTTATCATTCTACAACACGACGCCGAACGCTTACACGCTCGCTCTTCGAATTTCGATTCGTCGCTCCGAATCGGCGATACGGTGGGTTTGGGAGGCCAATCGCGGCCGTCCGGTGCGCGAAAATGCTACATTCTCTCTCTCCACCAACGGAAACCTAGTTCTCGCTGAAGCCGACGGCACTGTTGTATGGCAATCGAACACAGCCAACAAGGGCGTTGTTGGATTCGAATTGCTTCCGAGTGGCAACATGGTGTTGTTCGACTCCAATGGCAAATTCCTCTGGCAGAGCTTCGATTCGCCGACGGACACACTCCTTGTCGGCCAATCGCTCCGTCTCGGCGGGCCAACGAAGCTAGTAAGCCGCGCATCGGAGGTAATGAACGTCAATGGACCTTACAGCCTTGTAATGGAACGAAAAGCCCTATCTCTTTACTACAAAAGCCCTAACTCTCCAAAACCGATGCGGTACTACTCATCCACAGACATGCTCTCAGTCCGGAAAGGCGTTCTCGCAAATATCACACTAAACGCCGCCGTAGATCCAGATCAAGGATTCGCCACCGAATTAACACTGAACTACGAGACCGGCTCGACTGAGAGCGGCGGTCCAATTCTAACCCGGCCGAAGTACAACAGCACACTAACATTCCTCCGATTAGGAATCGACGGCAACCTCCGCCTGATCACATACAACGACAAAGTCGATTGGGGCCCGTCGGAGATTTCGTTCACACTCTTCGATAGGGATTCGACTTGGGAGAACGAATGCCAATGGCCAGAGCGGTGCGGACAGTTCGGGTTGTGCGAGGACAACCAGTGCGTAGCCTGTCCAACAGAGAACGGACTGGCAGGATGGAGCAAGACCTGCGCGCCGAAGAAGGTAAGTTCCTGCGATCCCAAAAGCTTCCATTACTATAAACTAGTCGGTGTGGATCATTTCTTGACAAAGTACAACAAAGGAGAAGGGCCAATGGGACAGAAGGAGTGTGAGAAGAAGTGCAATTTGGATTGCAAATGTTTGGGATATTTTTACCAAACCAAAGGGTCGCTATGTTGGGTTGCAAATGAGCTGAAAACTTTGATAAAAGTGGCCAATTCCACTCATTTGGGCTTCATCAAAACACCCAATAAGTAG
Protein sequence
MRSPLLTPFLISFFFLFFSFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPLSIFRFPFQLSFYNTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFATELTLNYETGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLFDRDSTWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKVSSCDPKSFHYYKLVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANSTHLGFIKTPNK
Homology
BLAST of CmaCh16G004430 vs. ExPASy Swiss-Prot
Match:
Q39688 (Epidermis-specific secreted glycoprotein EP1 OS=Daucus carota OX=4039 GN=EP1 PE=1 SV=1)
HSP 1 Score: 440.7 bits (1132), Expect = 2.0e-122
Identity = 223/379 (58.84%), Postives = 272/379 (71.77%), Query Frame = 0
Query: 6 LTPFLISFFFLFFSFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPLSIFRFPFQLSFY 65
LT ++ FF F LVPANETFKFVNEGE G++I EY G YRPL F PFQL FY
Sbjct: 7 LTLTILLFFIQRIDFCHTLVPANETFKFVNEGELGQYISEYFGDYRPLDPFTSPFQLCFY 66
Query: 66 NTTPNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQ 125
N TP A+TLALR+ +RR+ES +RWVWEANRG PV ENAT + +GNLVLA ++G V WQ
Sbjct: 67 NQTPTAFTLALRMGLRRTESLMRWVWEANRGNPVDENATLTFGPDGNLVLARSNGQVAWQ 126
Query: 126 SNTANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASE 185
++TANKGVVG ++LP+GNMVL+DS GKFLWQSFD+PTDTLLVGQSL++G TKLVSRAS
Sbjct: 127 TSTANKGVVGLKILPNGNMVLYDSKGKFLWQSFDTPTDTLLVGQSLKMGAVTKLVSRASP 186
Query: 186 VMNVNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKG-VLANITLNAAVDPDQ 245
NVNGPYSLVME K L LYYK SPKP+RYYS + + K L N+T + DQ
Sbjct: 187 GENVNGPYSLVMEPKGLHLYYKPTTSPKPIRYYSFSLFTKLNKNESLQNVTFEFENENDQ 246
Query: 246 GFATELTLNYETGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISF 305
GFA L+L Y T ++ G IL R KYN+TL+FLRL IDGN+++ TYNDKVD+G E+++
Sbjct: 247 GFAFLLSLKYGTSNSLGGASILNRIKYNTTLSFLRLEIDGNVKIYTYNDKVDYGAWEVTY 306
Query: 306 TLFDR------------DSTWENECQWPERCGQFGLCEDNQCVACPTENG-LAGWSKTCA 365
TLF + + +ECQ P++CG FGLCE++QCV CPT +G + WSKTC
Sbjct: 307 TLFLKAPPPLFQVSLAATESESSECQLPKKCGNFGLCEESQCVGCPTSSGPVLAWSKTCE 366
Query: 366 PKKVSSCDPKSFHYYKLVG 371
P K+SSC PK FHY KL G
Sbjct: 367 PPKLSSCGPKDFHYNKLGG 385
BLAST of CmaCh16G004430 vs. ExPASy Swiss-Prot
Match:
Q9ZVA5 (EP1-like glycoprotein 4 OS=Arabidopsis thaliana OX=3702 GN=At1g78860 PE=3 SV=1)
HSP 1 Score: 431.4 bits (1108), Expect = 1.2e-119
Identity = 219/430 (50.93%), Postives = 287/430 (66.74%), Query Frame = 0
Query: 16 LFFSFSL------ALVPANETFKFVNEGEFGEFI-IEYDGTYRPLSIFRFPFQLSFYNTT 75
LFF+ S+ A VP ++ F+ VNEG + ++ IEY+ R F F+L FYNTT
Sbjct: 9 LFFTLSIFLVGAQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTT 68
Query: 76 PNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNT 135
NAYTLALRI R ES +RWVWEANRG PV+ENAT + +GNLVLAEADG VVWQ+NT
Sbjct: 69 QNAYTLALRIGNRAQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRVVWQTNT 128
Query: 136 ANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMN 195
ANKGVVG ++L +GNMV++DSNGKF+WQSFDSPTDTLLVGQSL+L G KLVSR S +N
Sbjct: 129 ANKGVVGIKILENGNMVIYDSNGKFVWQSFDSPTDTLLVGQSLKLNGQNKLVSRLSPSVN 188
Query: 196 VNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFAT 255
NGPYSLVME K L LYY + +PKP+ YY + + L ++T A D D +
Sbjct: 189 ANGPYSLVMEAKKLVLYYTTNKTPKPIGYYEYEFFTKIAQ--LQSMTFQAVEDADTTWGL 248
Query: 256 ELTLNYETGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLFD 315
+ ++GS + L+RPK+N+TL+FLRL DGN+R+ +Y+ ++++T F
Sbjct: 249 HME-GVDSGSQFNVSTFLSRPKHNATLSFLRLESDGNIRVWSYSTLATSTAWDVTYTAFT 308
Query: 316 RDSTWEN-ECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKVSSCDPKSFHYYK 375
D+T N EC+ PE C FGLC+ QC ACP++ GL GW +TC ++SCDPK+FHY+K
Sbjct: 309 NDNTDGNDECRIPEHCLGFGLCKKGQCNACPSDIGLLGWDETCKIPSLASCDPKTFHYFK 368
Query: 376 LVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANS 435
+ G D F+TKYN G + C KC DCKCLG+FY K S CW+ ELKTL K ++
Sbjct: 369 IEGADSFMTKYN-GGSTTTESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTKTGDT 428
Query: 436 THLGFIKTPN 438
+ + ++K PN
Sbjct: 429 SLVAYVKAPN 434
BLAST of CmaCh16G004430 vs. ExPASy Swiss-Prot
Match:
Q9ZVA4 (EP1-like glycoprotein 3 OS=Arabidopsis thaliana OX=3702 GN=At1g78850 PE=1 SV=1)
HSP 1 Score: 426.4 bits (1095), Expect = 3.9e-118
Identity = 211/424 (49.76%), Postives = 284/424 (66.98%), Query Frame = 0
Query: 16 LFFSFSLALVPANETFKFVNEGEFGEFI-IEYDGTYRPLSIFRFPFQLSFYNTTPNAYTL 75
+F S A VP ++ F+ VNEG + ++ IEY+ R F F+L FYNTTPNAYTL
Sbjct: 15 IFLIGSQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTTPNAYTL 74
Query: 76 ALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGVV 135
ALRI R ES +RWVWEANRG PV+ENAT + +GNLVLAEADG +VWQ+NTANKG V
Sbjct: 75 ALRIGNRVQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRLVWQTNTANKGAV 134
Query: 136 GFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMNVNGPYS 195
G ++L +GNMV++DS+GKF+WQSFDSPTDTLLVGQSL+L G TKLVSR S +N NGPYS
Sbjct: 135 GIKILENGNMVIYDSSGKFVWQSFDSPTDTLLVGQSLKLNGRTKLVSRLSPSVNTNGPYS 194
Query: 196 LVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFATELTLNY 255
LVME K L LYY + +PKP+ Y+ + + ++T A D D + +
Sbjct: 195 LVMEAKKLVLYYTTNKTPKPIAYFEYEFFTKITQ--FQSMTFQAVEDSDTTWGLVME-GV 254
Query: 256 ETGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLF-DRDSTW 315
++GS + L+RPK+N+TL+F+RL DGN+R+ +Y+ ++++T F + D+
Sbjct: 255 DSGSKFNVSTFLSRPKHNATLSFIRLESDGNIRVWSYSTLATSTAWDVTYTAFTNADTDG 314
Query: 316 ENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKVSSCDPKSFHYYKLVGVDH 375
+EC+ PE C FGLC+ QC ACP++ GL GW +TC ++SCDPK+FHY+K+ G D
Sbjct: 315 NDECRIPEHCLGFGLCKKGQCNACPSDKGLLGWDETCKSPSLASCDPKTFHYFKIEGADS 374
Query: 376 FLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANSTHLGFI 435
F+TKYN G + C KC DCKCLG+FY K S CW+ ELKTL + +S+ + ++
Sbjct: 375 FMTKYNGGSSTT-ESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTRTGDSSLVAYV 434
Query: 436 KTPN 438
K PN
Sbjct: 435 KAPN 434
BLAST of CmaCh16G004430 vs. ExPASy Swiss-Prot
Match:
Q9ZVA2 (EP1-like glycoprotein 2 OS=Arabidopsis thaliana OX=3702 GN=At1g78830 PE=1 SV=1)
HSP 1 Score: 385.6 bits (989), Expect = 7.6e-106
Identity = 209/440 (47.50%), Postives = 273/440 (62.05%), Query Frame = 0
Query: 19 SFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPL-----SIFRFPFQLSFYNTTPNAYT 78
S +A VP + F+ VNEGEFGE+I EYD +YR + S F PFQL FYNTTP+AY
Sbjct: 18 SVVIAQVPPEKQFRVVNEGEFGEYITEYDASYRFIESSNQSFFTSPFQLLFYNTTPSAYI 77
Query: 79 LALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGV 138
LALR+ +RR ES +RW+W+ANR PV ENAT SL NGNLVLAEADG V WQ+NTANKGV
Sbjct: 78 LALRVGLRRDESTMRWIWDANRNNPVGENATLSLGRNGNLVLAEADGRVKWQTNTANKGV 137
Query: 139 VGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMNVNGPY 198
GF++LP+GN+VL D NGKF+WQSFD PTDTLL GQSL++ G KLVSR S+ +GPY
Sbjct: 138 TGFQILPNGNIVLHDKNGKFVWQSFDHPTDTLLTGQSLKVNGVNKLVSRTSDSNGSDGPY 197
Query: 199 SLVMERKALSLYYKSPNSPK-----PMRYYSSTDMLSVRKGVLANITLNAA----VDPDQ 258
S+V+++K L++Y +P P + T +V + N+T +A ++P
Sbjct: 198 SMVLDKKGLTMYVNKTGTPLVYGGWPDHDFRGTVTFAVTR-EFDNLTEPSAYELLLEPAP 257
Query: 259 GFATELTLN---YETGSTESGGPILTRPK--YNSTLTFLRLGIDGNLRLITYNDKVDWGP 318
AT N + SGG L K YN T+++LRLG DG+L+ +Y +
Sbjct: 258 QPATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSLKAYSYFPAATYLK 317
Query: 319 SEISFTLFDRDSTWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKV---- 378
E SF+ F + + +C P CG +G C+ C ACPT GL GWS CAP K
Sbjct: 318 WEESFSFF--STYFVRQCGLPSFCGDYGYCDRGMCNACPTPKGLLGWSDKCAPPKTTQFC 377
Query: 379 SSCDPKSFHYYKLVGVDHFLTKY-NKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWV 435
S K+ +YYK+VGV+HF Y N G+GP +C+ KC+ DCKCLGYFY+ K C +
Sbjct: 378 SGVKGKTVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGYFYKEKDKKCLL 437
BLAST of CmaCh16G004430 vs. ExPASy Swiss-Prot
Match:
Q9ZVA1 (EP1-like glycoprotein 1 OS=Arabidopsis thaliana OX=3702 GN=At1g78820 PE=2 SV=1)
HSP 1 Score: 358.2 bits (918), Expect = 1.3e-97
Identity = 194/439 (44.19%), Postives = 261/439 (59.45%), Query Frame = 0
Query: 19 SFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPL-----SIFRFPFQLSFYNTTPNAYT 78
S +A VP + F+ +NE + +I EYD +YR L + F PFQL FYNTTP+AY
Sbjct: 18 SVVMAQVPPEKQFRVLNEPGYAPYITEYDASYRFLNSPNQNFFTIPFQLMFYNTTPSAYV 77
Query: 79 LALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGV 138
LALR+ RR S RW+W+ANR PV +N+T S NGNLVLAE +G V WQ+NTANKGV
Sbjct: 78 LALRVGTRRDMSFTRWIWDANRNNPVGDNSTLSFGRNGNLVLAELNGQVKWQTNTANKGV 137
Query: 139 VGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMNVNGPY 198
GF++LP+GNMVL D +GKF+WQSFD PTDTLLVGQSL++ G KLVSR S++ +GPY
Sbjct: 138 TGFQILPNGNMVLHDKHGKFVWQSFDHPTDTLLVGQSLKVNGVNKLVSRTSDMNGSDGPY 197
Query: 199 SLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLA----NITLNAA----VDPDQG 258
S+V++ K L++Y +P ++ D + N+T +A ++P
Sbjct: 198 SMVLDNKGLTMYVNKTGTPLVYGGWTDHDFRGTVTFAVTREFDNLTEPSAYELLLEPAPQ 257
Query: 259 FATELTLN---YETGSTESGGPILTRPK--YNSTLTFLRLGIDGNLRLITYNDKVDWGPS 318
AT N + SGG L K YN T+++LRLG DG+L+ +Y +
Sbjct: 258 PATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSLKAFSYFPAATYLEW 317
Query: 319 EISFTLFDRDSTWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKV----S 378
E +F F + + +C P CG +G C+ CV CPT GL WS CAP K S
Sbjct: 318 EETFAFF--SNYFVRQCGLPTFCGDYGYCDRGMCVGCPTPKGLLAWSDKCAPPKTTQFCS 377
Query: 379 SCDPKSFHYYKLVGVDHFLTKY-NKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVA 435
K+ +YYK+VGV+HF Y N G+GP +C+ KC+ DCKCLGYFY+ K C +A
Sbjct: 378 GGKGKAVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGYFYKEKDKKCLLA 437
BLAST of CmaCh16G004430 vs. TAIR 10
Match:
AT1G78860.1 (D-mannose binding lectin protein with Apple-like carbohydrate-binding domain )
HSP 1 Score: 431.4 bits (1108), Expect = 8.6e-121
Identity = 219/430 (50.93%), Postives = 287/430 (66.74%), Query Frame = 0
Query: 16 LFFSFSL------ALVPANETFKFVNEGEFGEFI-IEYDGTYRPLSIFRFPFQLSFYNTT 75
LFF+ S+ A VP ++ F+ VNEG + ++ IEY+ R F F+L FYNTT
Sbjct: 9 LFFTLSIFLVGAQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTT 68
Query: 76 PNAYTLALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNT 135
NAYTLALRI R ES +RWVWEANRG PV+ENAT + +GNLVLAEADG VVWQ+NT
Sbjct: 69 QNAYTLALRIGNRAQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRVVWQTNT 128
Query: 136 ANKGVVGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMN 195
ANKGVVG ++L +GNMV++DSNGKF+WQSFDSPTDTLLVGQSL+L G KLVSR S +N
Sbjct: 129 ANKGVVGIKILENGNMVIYDSNGKFVWQSFDSPTDTLLVGQSLKLNGQNKLVSRLSPSVN 188
Query: 196 VNGPYSLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFAT 255
NGPYSLVME K L LYY + +PKP+ YY + + L ++T A D D +
Sbjct: 189 ANGPYSLVMEAKKLVLYYTTNKTPKPIGYYEYEFFTKIAQ--LQSMTFQAVEDADTTWGL 248
Query: 256 ELTLNYETGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLFD 315
+ ++GS + L+RPK+N+TL+FLRL DGN+R+ +Y+ ++++T F
Sbjct: 249 HME-GVDSGSQFNVSTFLSRPKHNATLSFLRLESDGNIRVWSYSTLATSTAWDVTYTAFT 308
Query: 316 RDSTWEN-ECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKVSSCDPKSFHYYK 375
D+T N EC+ PE C FGLC+ QC ACP++ GL GW +TC ++SCDPK+FHY+K
Sbjct: 309 NDNTDGNDECRIPEHCLGFGLCKKGQCNACPSDIGLLGWDETCKIPSLASCDPKTFHYFK 368
Query: 376 LVGVDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANS 435
+ G D F+TKYN G + C KC DCKCLG+FY K S CW+ ELKTL K ++
Sbjct: 369 IEGADSFMTKYN-GGSTTTESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTKTGDT 428
Query: 436 THLGFIKTPN 438
+ + ++K PN
Sbjct: 429 SLVAYVKAPN 434
BLAST of CmaCh16G004430 vs. TAIR 10
Match:
AT1G78850.1 (D-mannose binding lectin protein with Apple-like carbohydrate-binding domain )
HSP 1 Score: 426.4 bits (1095), Expect = 2.8e-119
Identity = 211/424 (49.76%), Postives = 284/424 (66.98%), Query Frame = 0
Query: 16 LFFSFSLALVPANETFKFVNEGEFGEFI-IEYDGTYRPLSIFRFPFQLSFYNTTPNAYTL 75
+F S A VP ++ F+ VNEG + ++ IEY+ R F F+L FYNTTPNAYTL
Sbjct: 15 IFLIGSQAKVPVDDQFRVVNEGGYTDYSPIEYNPDVRGFVPFSDNFRLCFYNTTPNAYTL 74
Query: 76 ALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGVV 135
ALRI R ES +RWVWEANRG PV+ENAT + +GNLVLAEADG +VWQ+NTANKG V
Sbjct: 75 ALRIGNRVQESTLRWVWEANRGSPVKENATLTFGEDGNLVLAEADGRLVWQTNTANKGAV 134
Query: 136 GFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMNVNGPYS 195
G ++L +GNMV++DS+GKF+WQSFDSPTDTLLVGQSL+L G TKLVSR S +N NGPYS
Sbjct: 135 GIKILENGNMVIYDSSGKFVWQSFDSPTDTLLVGQSLKLNGRTKLVSRLSPSVNTNGPYS 194
Query: 196 LVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFATELTLNY 255
LVME K L LYY + +PKP+ Y+ + + ++T A D D + +
Sbjct: 195 LVMEAKKLVLYYTTNKTPKPIAYFEYEFFTKITQ--FQSMTFQAVEDSDTTWGLVME-GV 254
Query: 256 ETGSTESGGPILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLF-DRDSTW 315
++GS + L+RPK+N+TL+F+RL DGN+R+ +Y+ ++++T F + D+
Sbjct: 255 DSGSKFNVSTFLSRPKHNATLSFIRLESDGNIRVWSYSTLATSTAWDVTYTAFTNADTDG 314
Query: 316 ENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKVSSCDPKSFHYYKLVGVDH 375
+EC+ PE C FGLC+ QC ACP++ GL GW +TC ++SCDPK+FHY+K+ G D
Sbjct: 315 NDECRIPEHCLGFGLCKKGQCNACPSDKGLLGWDETCKSPSLASCDPKTFHYFKIEGADS 374
Query: 376 FLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANSTHLGFI 435
F+TKYN G + C KC DCKCLG+FY K S CW+ ELKTL + +S+ + ++
Sbjct: 375 FMTKYNGGSSTT-ESACGDKCTRDCKCLGFFYNRKSSRCWLGYELKTLTRTGDSSLVAYV 434
Query: 436 KTPN 438
K PN
Sbjct: 435 KAPN 434
BLAST of CmaCh16G004430 vs. TAIR 10
Match:
AT1G16905.1 (Curculin-like (mannose-binding) lectin family protein )
HSP 1 Score: 392.9 bits (1008), Expect = 3.4e-109
Identity = 208/427 (48.71%), Postives = 270/427 (63.23%), Query Frame = 0
Query: 15 FLFFSFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPLSIFRFPFQLSFYNTTPNAYTL 74
FL S VP E F+F+N G+FGE +EY +YR L + R F+L F+NTTPNA+TL
Sbjct: 14 FLLISLVRPQVPPMEQFRFLNNGDFGESTVEYGASYRDLGVIRNQFRLCFFNTTPNAFTL 73
Query: 75 ALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGVV 134
A+ + S+S IRWVW+AN +PV+E A+ S GNLVLA+ DG VVWQ+ T NKGV+
Sbjct: 74 AIGMGTGSSDSIIRWVWQANPQKPVQEEASLSFGPEGNLVLAQPDGRVVWQTMTENKGVI 133
Query: 135 GFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRL-GGPTKLVSRASEVMNVNGPY 194
G + +GN+VLFD G +WQSF+ PTDTLLVGQSL L G KLVSR NG Y
Sbjct: 134 GLTMNENGNLVLFDDGGWPVWQSFEFPTDTLLVGQSLTLDGSKNKLVSRN------NGSY 193
Query: 195 SLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLANITLNAAVDPDQGFATELTLN 254
SL++E L L P S Y + + + TL +A DQG T+L L
Sbjct: 194 SLILEPDRLVLNRLIPRSNNKSLVYH-----IIEGRFIPSATLYSA--KDQGTTTQLGL- 253
Query: 255 YETGSTESGGP---ILTRPKYNSTLTFLRLGIDGNLRLITYNDKVDWGPSEISFTLFDRD 314
T P L RP++N++ +FLRL DGNLR+ +++ KV + E++F LF+ D
Sbjct: 254 -ATPGLRPEFPYKHFLARPRFNASQSFLRLDADGNLRIYSFDSKVTFLAWEVTFELFNHD 313
Query: 315 STWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKVSSCDPKSFHYYKLVG 374
+ NEC P +CG FG+CEDNQCVACP GL GWSK C PKKV SCDPKSFHYY+L G
Sbjct: 314 N--NNECWLPSKCGAFGICEDNQCVACPLGVGLMGWSKACKPKKVKSCDPKSFHYYRLGG 373
Query: 375 VDHFLTKYNKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVANELKTLIKVANSTHL 434
V+HF+TKYN G +G+ +C C+ DCKCLGYF+ CW++ EL TL+KV++S +
Sbjct: 374 VEHFMTKYNVGLA-LGESKCRGLCSGDCKCLGYFFDKSSFKCWISYELGTLVKVSDSRKV 422
Query: 435 GFIKTPN 438
+IKTPN
Sbjct: 434 AYIKTPN 422
BLAST of CmaCh16G004430 vs. TAIR 10
Match:
AT1G78830.1 (Curculin-like (mannose-binding) lectin family protein )
HSP 1 Score: 385.6 bits (989), Expect = 5.4e-107
Identity = 209/440 (47.50%), Postives = 273/440 (62.05%), Query Frame = 0
Query: 19 SFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPL-----SIFRFPFQLSFYNTTPNAYT 78
S +A VP + F+ VNEGEFGE+I EYD +YR + S F PFQL FYNTTP+AY
Sbjct: 18 SVVIAQVPPEKQFRVVNEGEFGEYITEYDASYRFIESSNQSFFTSPFQLLFYNTTPSAYI 77
Query: 79 LALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGV 138
LALR+ +RR ES +RW+W+ANR PV ENAT SL NGNLVLAEADG V WQ+NTANKGV
Sbjct: 78 LALRVGLRRDESTMRWIWDANRNNPVGENATLSLGRNGNLVLAEADGRVKWQTNTANKGV 137
Query: 139 VGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMNVNGPY 198
GF++LP+GN+VL D NGKF+WQSFD PTDTLL GQSL++ G KLVSR S+ +GPY
Sbjct: 138 TGFQILPNGNIVLHDKNGKFVWQSFDHPTDTLLTGQSLKVNGVNKLVSRTSDSNGSDGPY 197
Query: 199 SLVMERKALSLYYKSPNSPK-----PMRYYSSTDMLSVRKGVLANITLNAA----VDPDQ 258
S+V+++K L++Y +P P + T +V + N+T +A ++P
Sbjct: 198 SMVLDKKGLTMYVNKTGTPLVYGGWPDHDFRGTVTFAVTR-EFDNLTEPSAYELLLEPAP 257
Query: 259 GFATELTLN---YETGSTESGGPILTRPK--YNSTLTFLRLGIDGNLRLITYNDKVDWGP 318
AT N + SGG L K YN T+++LRLG DG+L+ +Y +
Sbjct: 258 QPATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSLKAYSYFPAATYLK 317
Query: 319 SEISFTLFDRDSTWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKV---- 378
E SF+ F + + +C P CG +G C+ C ACPT GL GWS CAP K
Sbjct: 318 WEESFSFF--STYFVRQCGLPSFCGDYGYCDRGMCNACPTPKGLLGWSDKCAPPKTTQFC 377
Query: 379 SSCDPKSFHYYKLVGVDHFLTKY-NKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWV 435
S K+ +YYK+VGV+HF Y N G+GP +C+ KC+ DCKCLGYFY+ K C +
Sbjct: 378 SGVKGKTVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGYFYKEKDKKCLL 437
BLAST of CmaCh16G004430 vs. TAIR 10
Match:
AT1G78820.1 (D-mannose binding lectin protein with Apple-like carbohydrate-binding domain )
HSP 1 Score: 358.2 bits (918), Expect = 9.3e-99
Identity = 194/439 (44.19%), Postives = 261/439 (59.45%), Query Frame = 0
Query: 19 SFSLALVPANETFKFVNEGEFGEFIIEYDGTYRPL-----SIFRFPFQLSFYNTTPNAYT 78
S +A VP + F+ +NE + +I EYD +YR L + F PFQL FYNTTP+AY
Sbjct: 18 SVVMAQVPPEKQFRVLNEPGYAPYITEYDASYRFLNSPNQNFFTIPFQLMFYNTTPSAYV 77
Query: 79 LALRISIRRSESAIRWVWEANRGRPVRENATFSLSTNGNLVLAEADGTVVWQSNTANKGV 138
LALR+ RR S RW+W+ANR PV +N+T S NGNLVLAE +G V WQ+NTANKGV
Sbjct: 78 LALRVGTRRDMSFTRWIWDANRNNPVGDNSTLSFGRNGNLVLAELNGQVKWQTNTANKGV 137
Query: 139 VGFELLPSGNMVLFDSNGKFLWQSFDSPTDTLLVGQSLRLGGPTKLVSRASEVMNVNGPY 198
GF++LP+GNMVL D +GKF+WQSFD PTDTLLVGQSL++ G KLVSR S++ +GPY
Sbjct: 138 TGFQILPNGNMVLHDKHGKFVWQSFDHPTDTLLVGQSLKVNGVNKLVSRTSDMNGSDGPY 197
Query: 199 SLVMERKALSLYYKSPNSPKPMRYYSSTDMLSVRKGVLA----NITLNAA----VDPDQG 258
S+V++ K L++Y +P ++ D + N+T +A ++P
Sbjct: 198 SMVLDNKGLTMYVNKTGTPLVYGGWTDHDFRGTVTFAVTREFDNLTEPSAYELLLEPAPQ 257
Query: 259 FATELTLN---YETGSTESGGPILTRPK--YNSTLTFLRLGIDGNLRLITYNDKVDWGPS 318
AT N + SGG L K YN T+++LRLG DG+L+ +Y +
Sbjct: 258 PATNPGNNRRLLQVRPIGSGGGTLNLNKINYNGTISYLRLGSDGSLKAFSYFPAATYLEW 317
Query: 319 EISFTLFDRDSTWENECQWPERCGQFGLCEDNQCVACPTENGLAGWSKTCAPKKV----S 378
E +F F + + +C P CG +G C+ CV CPT GL WS CAP K S
Sbjct: 318 EETFAFF--SNYFVRQCGLPTFCGDYGYCDRGMCVGCPTPKGLLAWSDKCAPPKTTQFCS 377
Query: 379 SCDPKSFHYYKLVGVDHFLTKY-NKGEGPMGQKECEKKCNLDCKCLGYFYQTKGSLCWVA 435
K+ +YYK+VGV+HF Y N G+GP +C+ KC+ DCKCLGYFY+ K C +A
Sbjct: 378 GGKGKAVNYYKIVGVEHFTGPYVNDGQGPTSVNDCKAKCDRDCKCLGYFYKEKDKKCLLA 437
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q39688 | 2.0e-122 | 58.84 | Epidermis-specific secreted glycoprotein EP1 OS=Daucus carota OX=4039 GN=EP1 PE=... | [more] |
Q9ZVA5 | 1.2e-119 | 50.93 | EP1-like glycoprotein 4 OS=Arabidopsis thaliana OX=3702 GN=At1g78860 PE=3 SV=1 | [more] |
Q9ZVA4 | 3.9e-118 | 49.76 | EP1-like glycoprotein 3 OS=Arabidopsis thaliana OX=3702 GN=At1g78850 PE=1 SV=1 | [more] |
Q9ZVA2 | 7.6e-106 | 47.50 | EP1-like glycoprotein 2 OS=Arabidopsis thaliana OX=3702 GN=At1g78830 PE=1 SV=1 | [more] |
Q9ZVA1 | 1.3e-97 | 44.19 | EP1-like glycoprotein 1 OS=Arabidopsis thaliana OX=3702 GN=At1g78820 PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT1G78860.1 | 8.6e-121 | 50.93 | D-mannose binding lectin protein with Apple-like carbohydrate-binding domain | [more] |
AT1G78850.1 | 2.8e-119 | 49.76 | D-mannose binding lectin protein with Apple-like carbohydrate-binding domain | [more] |
AT1G16905.1 | 3.4e-109 | 48.71 | Curculin-like (mannose-binding) lectin family protein | [more] |
AT1G78830.1 | 5.4e-107 | 47.50 | Curculin-like (mannose-binding) lectin family protein | [more] |
AT1G78820.1 | 9.3e-99 | 44.19 | D-mannose binding lectin protein with Apple-like carbohydrate-binding domain | [more] |