BLAST of CmoCh01G003500 vs. Swiss-Prot
Match:
PCS1L_ARATH (Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1)
HSP 1 Score: 221.9 bits (564), Expect = 1.1e-56
Identity = 133/376 (35.37%), Postives = 197/376 (52.39%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+GTPPQ + MV+DTGS+LSW++C+ + N V F+P+ SS++S +PC+S C+ R
Sbjct: 79 VGTPPQNISMVIDTGSELSWLRCNRSSNPNPVNN----FDPTRSSSYSPIPCSSPTCRTR 138
Query: 61 IPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSF----TASPIAIGCV------- 120
DF +P SCD + CH + YAD + +EGNL E F F S + GC+
Sbjct: 139 TRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSD 198
Query: 121 -KPSAENRGILGMNTGHLSFISQAKISKFSYCVPGRSRSDLTGLFYLGDNPNSGKFKYVN 180
+ + G+LGMN G LSFISQ KFSYC+ G D G LGD+ F ++
Sbjct: 199 PEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISG--TDDFPGFLLLGDS----NFTWLT 258
Query: 181 MLTFPE----SQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSE 240
L + S P D++AYT+ + GI++ L I +V PD TG+GQTM+DSG++
Sbjct: 259 PLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQ 318
Query: 241 LTYLVDEAYNNVRAEIVRLVGPMM----KKGYEYASVADMCF---DGAMAAAAGRRIGEM 300
T+L+ Y +R+ + ++ + + D+C+ + + R+ +
Sbjct: 319 FTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTV 378
Query: 301 WFQFENGVEILVGKGEGL------LTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWV 348
FE G EI V G+ L LTV V C G S +G E+ +IG+ HQQNMW+
Sbjct: 379 SLVFE-GAEIAV-SGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWI 438
BLAST of CmoCh01G003500 vs. Swiss-Prot
Match:
NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)
HSP 1 Score: 151.4 bits (381), Expect = 1.9e-35
Identity = 113/356 (31.74%), Postives = 167/356 (46.91%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+GTP Q ++DTGS L W QC + FNP SS+FS LPC+S LC+
Sbjct: 101 IGTPAQPFSAIMDTGSDLIWTQCQPCT--QCFNQSTPIFNPQGSSSFSTLPCSSQLCQA- 160
Query: 61 IPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSF---TASPIAIGCVKPS----- 120
+ PT + C Y+Y Y DG+ +G++ TE +F + I GC + +
Sbjct: 161 ---LSSPTCSN--NFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQ 220
Query: 121 AENRGILGMNTGHLSFISQAKISKFSYCVPGRSRSDLTGLFYLGDNPNSGKFKYVNMLTF 180
G++GM G LS SQ ++KFSYC+ S + L LG NS N T
Sbjct: 221 GNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSNLL-LGSLANSVTAGSPNT-TL 280
Query: 181 PESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPT-GSGQTMIDSGSELTYLVDE 240
+S P Y + + G+ +G+ +L I P+ F + G+G +IDSG+ LTY V+
Sbjct: 281 IQSSQIPTF----YYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNN 340
Query: 241 AYNNVRAEIVRLVGPMMKKGYEYASVADMCFDGAMAAAAGRRIGEMWFQFENGVEILVGK 300
AY +VR E + + + G +S D+CF + + +I F+ G L
Sbjct: 341 AYQSVRQEFISQINLPVVNG--SSSGFDLCFQ-TPSDPSNLQIPTFVMHFDGG--DLELP 400
Query: 301 GEGLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVC 348
E G+ C+ +G S + ++ GN+ QQNM V YD N V F A C
Sbjct: 401 SENYFISPSNGLICLAMGSSSQ---GMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434
BLAST of CmoCh01G003500 vs. Swiss-Prot
Match:
NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)
HSP 1 Score: 151.0 bits (380), Expect = 2.5e-35
Identity = 117/359 (32.59%), Postives = 168/359 (46.80%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+GTP ++DTGS L W QC S FNP SS+FS LPC S C+
Sbjct: 102 IGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPI--FNPQDSSSFSTLPCESQYCQ-- 161
Query: 61 IPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSFTASP---IAIGCVKPS----- 120
LP+ C Y+Y Y DG+ +G + TE F+F S IA GC + +
Sbjct: 162 ----DLPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQ 221
Query: 121 AENRGILGMNTGHLSFISQAKISKFSYCVPGRSRSDLTGLFYLGDNPNSGKFKYVNMLTF 180
G++GM G LS SQ + +FSYC+ S + L LG + SG + T
Sbjct: 222 GNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTL-ALG-SAASGVPEGSPSTTL 281
Query: 181 PESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSELTYLVDEA 240
S +P Y + ++GI +G L I + F+ G+G +IDSG+ LTYL +A
Sbjct: 282 IHSSLNPTY----YYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDA 341
Query: 241 YNNVRAEIVRLVGPMMKKGYEYASVADMCF----DGAMAAAAGRRIGEMWFQFENGVEIL 300
YN V + + E +S CF DG+ ++ E+ QF+ GV +
Sbjct: 342 YNAVAQAFTDQIN--LPTVDESSSGLSTCFQQPSDGSTV-----QVPEISMQFDGGV-LN 401
Query: 301 VGKGEGLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVC 348
+G+ L++ E GV C+ +G S +LG ++ GN+ QQ V YDL N V F C
Sbjct: 402 LGEQNILISPAE-GVICLAMGSSSQLGI--SIFGNIQQQETQVLYDLQNLAVSFVPTQC 435
BLAST of CmoCh01G003500 vs. Swiss-Prot
Match:
ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)
HSP 1 Score: 142.9 bits (359), Expect = 6.7e-33
Identity = 102/355 (28.73%), Postives = 165/355 (46.48%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+GTP + M +VLDTGS ++WIQC + + FNP+ SST+ L C++ C
Sbjct: 168 VGTPAKEMYLVLDTGSDVNWIQCEPCAD--CYQQSDPVFNPTSSSTYKSLTCSAPQCS-- 227
Query: 61 IPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSFTASP----IAIGCVKPS---- 120
L TS C Y Y DG+ G L T+ +F S +A+GC +
Sbjct: 228 ----LLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLF 287
Query: 121 AENRGILGMNTGHLSFISQAKISKFSYCVPGRSRSDLTGLFYLGDNPNSGKFKYVNMLTF 180
G+LG+ G LS +Q K + FSYC+ R + L + G T
Sbjct: 288 TGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGD------ATA 347
Query: 181 PESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSELTYLVDEA 240
P +N +D Y + + G +G ++ + A+F D +GSG ++D G+ +T L +A
Sbjct: 348 PLLRNK-KIDTFYY-VGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQA 407
Query: 241 YNNVRAEIVRLVGPMMKKGYEYASVADMCFDGAMAAAAGRRIGEMWFQFENGVEILVGKG 300
YN++R ++L +KKG S+ D C+D ++ + ++ + F F G + +
Sbjct: 408 YNSLRDAFLKLT-VNLKKGSSSISLFDTCYD--FSSLSTVKVPTVAFHFTGGKSLDLPAK 467
Query: 301 EGLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVC 348
L+ V + G C + + ++IGNV QQ + YDL+ +G G C
Sbjct: 468 NYLIPVDDSGTFCFAFAPT---SSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
BLAST of CmoCh01G003500 vs. Swiss-Prot
Match:
APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)
HSP 1 Score: 134.4 bits (337), Expect = 2.4e-30
Identity = 109/360 (30.28%), Postives = 174/360 (48.33%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+GTP + + MVLDTGS + W+QC S F+P S T++ +PC+S C+ R
Sbjct: 148 VGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPI--FDPRKSKTYATIPCSSPHCR-R 207
Query: 61 IPDFTLPTSCDPRRH-CHYSYFYADGTLAEGNLVTEKFSF---TASPIAIGCVKPS---- 120
+ C+ RR C Y Y DG+ G+ TE +F +A+GC +
Sbjct: 208 LDS----AGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLF 267
Query: 121 AENRGILGMNTGHLSFISQAK---ISKFSYCVPGRSRSDLTGLFYLGDNPNSGKFKYVNM 180
G+LG+ G LSF Q KFSYC+ RS S G+ S ++ +
Sbjct: 268 VGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPL 327
Query: 181 LTFPESQNSPNLDKLAYTLPMKGIRIGAVQLK-ISPAVFKPDPTGSGQTMIDSGSELTYL 240
L ++P LD Y + + GI +G ++ ++ ++FK D G+G +IDSG+ +T L
Sbjct: 328 L------SNPKLDTF-YYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRL 387
Query: 241 VDEAYNNVRAEIVRLVGPMMKKGYEYASVADMCFDGAMAAAAGRRIGEMWFQFENGVEIL 300
+ AY +R + R+ +K+ ++ S+ D CFD ++ ++ + F G ++
Sbjct: 388 IRPAYIAMR-DAFRVGAKTLKRAPDF-SLFDTCFD--LSNMNEVKVPTVVLHF-RGADVS 447
Query: 301 VGKGEGLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVCS 349
+ L+ V G C +G +G S +IGN+ QQ V YDLA+ RVGF C+
Sbjct: 448 LPATNYLIPVDTNGKFCFAF--AGTMGGLS-IIGNIQQQGFRVVYDLASSRVGFAPGGCA 485
BLAST of CmoCh01G003500 vs. TrEMBL
Match:
A0A061EL58_THECC (Eukaryotic aspartyl protease family protein OS=Theobroma cacao GN=TCM_017459 PE=3 SV=1)
HSP 1 Score: 458.8 bits (1179), Expect = 6.1e-126
Identity = 234/355 (65.92%), Postives = 265/355 (74.65%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+GTPPQ MVLDTGSQLSWIQCH V K P F+PSLSS+FS LPC LCKPR
Sbjct: 92 IGTPPQTQQMVLDTGSQLSWIQCHKKVARKPPPPPTS-FDPSLSSSFSVLPCTHPLCKPR 151
Query: 61 IPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSFTAS----PIAIGCVKPSAENR 120
IPDFTLPTSCD R CHYSYFYADGTLAEGNLV EKF+F+ S P+ +GC ++E++
Sbjct: 152 IPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSRSQSTPPLILGCATDTSEDK 211
Query: 121 GILGMNTGHLSFISQAKISKFSYCVPGRSRS---DLTGLFYLGDNPNSGKFKYVNMLTFP 180
GILGMN G LSF SQAKISKFSYCVP R TG FYLG+NP+S F+YVN++ FP
Sbjct: 212 GILGMNLGRLSFASQAKISKFSYCVPTRRTQPGFSPTGSFYLGENPSSRGFQYVNLMIFP 271
Query: 181 ESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSELTYLVDEAY 240
ES PN+D LAYTLPM+GIRIGA +L I +VF+PD GSGQTMIDSGSE TYLVD+AY
Sbjct: 272 ESGTRPNMDPLAYTLPMQGIRIGAKKLPIPTSVFRPDAGGSGQTMIDSGSEFTYLVDDAY 331
Query: 241 NNVRAEIVRLVGPMMKKGYEYASVADMCFDGAMAAAAGRRIGEMWFQFENGVEILVGKGE 300
N VR E+VRLVGP +KKGY Y VADMCFDG GR IG+M +FE GVEI V K E
Sbjct: 332 NKVREEVVRLVGPRIKKGYVYGGVADMCFDG-NPIEIGRLIGDMVLEFEKGVEITVEK-E 391
Query: 301 GLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVCS 349
+L VE GV C+GIGRS LG SN+IGN HQQN+WVEYDL N+RVGFG A CS
Sbjct: 392 RVLADVEGGVHCLGIGRSSMLGAASNIIGNFHQQNLWVEYDLVNRRVGFGKADCS 443
BLAST of CmoCh01G003500 vs. TrEMBL
Match:
A0A067JPK4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22524 PE=3 SV=1)
HSP 1 Score: 458.4 bits (1178), Expect = 8.0e-126
Identity = 233/355 (65.63%), Postives = 269/355 (75.77%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+GTPPQ MVLDTGSQLSWIQCH K P F+PSLSS+FS LPCN LCKPR
Sbjct: 9 IGTPPQTQQMVLDTGSQLSWIQCHKKAPRKL--PPTTSFDPSLSSSFSVLPCNHPLCKPR 68
Query: 61 IPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSFTAS----PIAIGCVKPSAENR 120
IPDFTLPT+CD R CHYSYFYADGTLAEG+LV EKF+F+ + P+ +GC + S +++
Sbjct: 69 IPDFTLPTTCDQNRLCHYSYFYADGTLAEGSLVREKFTFSNTQSTPPLILGCAEDSGDDK 128
Query: 121 GILGMNTGHLSFISQAKISKFSYCVPGR-SRSDL--TGLFYLGDNPNSGKFKYVNMLTFP 180
GILGMN G SF SQAKISKFSYCVP R +R+ L TGLFYLGDNPNSG F Y+N+LTF
Sbjct: 129 GILGMNLGRRSFASQAKISKFSYCVPTRGNRAGLSPTGLFYLGDNPNSGGFHYINLLTFT 188
Query: 181 ESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSELTYLVDEAY 240
SQ SPNLD LAYT+PM+GIRIG +L I +VF+PDP+GSGQTM+DSGSE TYLVDEAY
Sbjct: 189 PSQRSPNLDPLAYTVPMQGIRIGNTRLNIPASVFRPDPSGSGQTMVDSGSEFTYLVDEAY 248
Query: 241 NNVRAEIVRLVGPMMKKGYEYASVADMCFDGAMAAAAGRRIGEMWFQFENGVEILVGKGE 300
N VR EIVR+ G +KK Y Y V+DMCFDG GR IG M F+FE GVEI+V + E
Sbjct: 249 NKVREEIVRVAGTKLKKNYVYGGVSDMCFDG-NPVEIGRLIGNMVFEFEKGVEIVVDR-E 308
Query: 301 GLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVCS 349
+L V GV CVGIGRS LG SN+IGN HQQN+WVE+DLAN+RVGFG A CS
Sbjct: 309 RVLANVGNGVHCVGIGRSEMLGAASNIIGNFHQQNLWVEFDLANRRVGFGKADCS 359
BLAST of CmoCh01G003500 vs. TrEMBL
Match:
B9T2R1_RICCO (Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_0593500 PE=3 SV=1)
HSP 1 Score: 456.4 bits (1173), Expect = 3.0e-125
Identity = 228/355 (64.23%), Postives = 269/355 (75.77%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+GTPPQ MVLDTGSQLSWIQCH K P F+PSLSS+FS LPCN LCKPR
Sbjct: 86 IGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTS-FDPSLSSSFSVLPCNHPLCKPR 145
Query: 61 IPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSFTAS----PIAIGCVKPSAENR 120
IPDFTLPT+CD R CHYSYFYADGT AEG+LV EK +F++S P+ +GC + S + +
Sbjct: 146 IPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQSTPPLILGCAEASTDEK 205
Query: 121 GILGMNTGHLSFISQAKISKFSYCVPGR-SRSDL--TGLFYLGDNPNSGKFKYVNMLTFP 180
GILGMN G SF SQAKISKFSYCVP R +R+ L TG FYLG+NPNSG+F+Y+N+LTF
Sbjct: 206 GILGMNLGRRSFASQAKISKFSYCVPTRQARAGLSSTGSFYLGNNPNSGRFQYINLLTFT 265
Query: 181 ESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSELTYLVDEAY 240
SQ SPNLD LAYT+PM+GIR+G +L IS +F+PDP+G+GQT+IDSGSE TYLVDEAY
Sbjct: 266 PSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSGAGQTIIDSGSEFTYLVDEAY 325
Query: 241 NNVRAEIVRLVGPMMKKGYEYASVADMCFDGAMAAAAGRRIGEMWFQFENGVEILVGKGE 300
N VR E+VRLVGP +KKGY Y V+DMCFDG GR IG M F+FE GVEI++ K
Sbjct: 326 NKVREEVVRLVGPKLKKGYVYGGVSDMCFDG-NPMEIGRLIGNMVFEFEKGVEIVIDKWR 385
Query: 301 GLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVCS 349
+L V GV C+GIGRS LG SN+IGN HQQN+WVEYDLAN+R+G G A CS
Sbjct: 386 -VLADVGGGVHCIGIGRSEMLGAASNIIGNFHQQNLWVEYDLANRRIGLGKADCS 437
BLAST of CmoCh01G003500 vs. TrEMBL
Match:
W9SFW9_9ROSA (Aspartic proteinase nepenthesin-2 OS=Morus notabilis GN=L484_000286 PE=3 SV=1)
HSP 1 Score: 455.3 bits (1170), Expect = 6.8e-125
Identity = 232/359 (64.62%), Postives = 264/359 (73.54%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+GTPPQ MVLDTGSQLSWIQC K P F+PSLSSTFS LPC+ +CKPR
Sbjct: 94 IGTPPQTQQMVLDTGSQLSWIQCDKKAP-KVAPPPTASFDPSLSSTFSVLPCSHPVCKPR 153
Query: 61 IPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSFTAS----PIAIGCVKPSAENR 120
IPDFTLPTSCD R CHYSYFYADGT AEGNLV EKF+F+ S P +GC K ++++
Sbjct: 154 IPDFTLPTSCDQNRLCHYSYFYADGTFAEGNLVREKFTFSRSVTTPPFILGCAKDPSDSQ 213
Query: 121 GILGMNTGHLSFISQAKISKFSYCVPGRSRSDL-----TGLFYLGDNPNSGKFKYVNMLT 180
GILGMN G LSF SQAKI+KFSYCVP R R TG FYLG+NPNS FKYVN+LT
Sbjct: 214 GILGMNLGRLSFASQAKINKFSYCVPTRGRQTKSGSLPTGSFYLGNNPNSRWFKYVNLLT 273
Query: 181 FPESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSELTYLVDE 240
F +SQ PNLD LA+TLPM+GIRIGA +L I VF+PD +GSGQTMIDSGSE T+LVDE
Sbjct: 274 FRQSQRMPNLDPLAFTLPMQGIRIGARRLNIPATVFRPDSSGSGQTMIDSGSEFTFLVDE 333
Query: 241 AYNNVRAEIVRLVGPMMKKGYEYASVADMCFDGAMAAAAGRRIGEMWFQFENGVEILVGK 300
AYN VR EIVRLVGP +KKGY Y VADMCF G A A GR +G+M F+FE GVEI+ K
Sbjct: 334 AYNKVREEIVRLVGPRIKKGYVYGGVADMCFQGTDAVAIGRLVGDMAFEFEKGVEIVAPK 393
Query: 301 GEGLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVCSGL 351
E +L V GV C+ IGRS LG SN+IGN HQQN+WVE+DL +RVGFG A CS L
Sbjct: 394 -ERILADVGGGVHCLAIGRSNMLGAASNIIGNFHQQNIWVEFDLVGRRVGFGKADCSRL 450
BLAST of CmoCh01G003500 vs. TrEMBL
Match:
Q9FGI3_ARATH (AT5g37540/mpa22_p_70 OS=Arabidopsis thaliana GN=At5g37540 PE=2 SV=1)
HSP 1 Score: 453.8 bits (1166), Expect = 2.0e-124
Identity = 226/354 (63.84%), Postives = 264/354 (74.58%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+GTP Q ++VLDTGSQLSWIQCH K + P F+PSLSS+FS LPC+ LCKPR
Sbjct: 86 IGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCKPR 145
Query: 61 IPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSF----TASPIAIGCVKPSAENR 120
IPDFTLPTSCD R CHYSYFYADGT AEGNLV EKF+F T P+ +GC K S + +
Sbjct: 146 IPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLILGCAKESTDEK 205
Query: 121 GILGMNTGHLSFISQAKISKFSYCVPGRSRSD---LTGLFYLGDNPNSGKFKYVNMLTFP 180
GILGMN G LSFISQAKISKFSYC+P RS TG FYLGDNPNS FKYV++LTFP
Sbjct: 206 GILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGDNPNSRGFKYVSLLTFP 265
Query: 181 ESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSELTYLVDEAY 240
+SQ PNLD LAYT+P++GIRIG +L I +VF+PD GSGQTM+DSGSE T+LVD AY
Sbjct: 266 QSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAY 325
Query: 241 NNVRAEIVRLVGPMMKKGYEYASVADMCFDGAMAAAAGRRIGEMWFQFENGVEILVGKGE 300
+ V+ EIVRLVG +KKGY Y S ADMCFDG + GR IG++ F+F GVEILV K +
Sbjct: 326 DKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLVFEFGRGVEILVEK-Q 385
Query: 301 GLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVC 348
LL V G+ CVGIGRS LG SN+IGNVHQQN+WVE+D+ N+RVGF A C
Sbjct: 386 SLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFSKAEC 438
BLAST of CmoCh01G003500 vs. TAIR10
Match:
AT5G37540.1 (AT5G37540.1 Eukaryotic aspartyl protease family protein)
HSP 1 Score: 453.8 bits (1166), Expect = 1.0e-127
Identity = 226/354 (63.84%), Postives = 264/354 (74.58%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+GTP Q ++VLDTGSQLSWIQCH K + P F+PSLSS+FS LPC+ LCKPR
Sbjct: 86 IGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCKPR 145
Query: 61 IPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSF----TASPIAIGCVKPSAENR 120
IPDFTLPTSCD R CHYSYFYADGT AEGNLV EKF+F T P+ +GC K S + +
Sbjct: 146 IPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLILGCAKESTDEK 205
Query: 121 GILGMNTGHLSFISQAKISKFSYCVPGRSRSD---LTGLFYLGDNPNSGKFKYVNMLTFP 180
GILGMN G LSFISQAKISKFSYC+P RS TG FYLGDNPNS FKYV++LTFP
Sbjct: 206 GILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGDNPNSRGFKYVSLLTFP 265
Query: 181 ESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSELTYLVDEAY 240
+SQ PNLD LAYT+P++GIRIG +L I +VF+PD GSGQTM+DSGSE T+LVD AY
Sbjct: 266 QSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAY 325
Query: 241 NNVRAEIVRLVGPMMKKGYEYASVADMCFDGAMAAAAGRRIGEMWFQFENGVEILVGKGE 300
+ V+ EIVRLVG +KKGY Y S ADMCFDG + GR IG++ F+F GVEILV K +
Sbjct: 326 DKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLVFEFGRGVEILVEK-Q 385
Query: 301 GLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVC 348
LL V G+ CVGIGRS LG SN+IGNVHQQN+WVE+D+ N+RVGF A C
Sbjct: 386 SLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFSKAEC 438
BLAST of CmoCh01G003500 vs. TAIR10
Match:
AT1G66180.1 (AT1G66180.1 Eukaryotic aspartyl protease family protein)
HSP 1 Score: 440.3 bits (1131), Expect = 1.1e-123
Identity = 226/356 (63.48%), Postives = 264/356 (74.16%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKW-FNPSLSSTFSFLPCNSSLCKP 60
+GTPPQ MVLDTGSQLSWIQCH K + P+ K F+PSLSS+FS LPC+ LCKP
Sbjct: 78 IGTPPQAQQMVLDTGSQLSWIQCHR----KKLPPKPKTSFDPSLSSSFSTLPCSHPLCKP 137
Query: 61 RIPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSFTAS----PIAIGCVKPSAEN 120
RIPDFTLPTSCD R CHYSYFYADGT AEGNLV EK +F+ + P+ +GC S+++
Sbjct: 138 RIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESSDD 197
Query: 121 RGILGMNTGHLSFISQAKISKFSYCVPGRSRSD---LTGLFYLGDNPNSGKFKYVNMLTF 180
RGILGMN G LSF+SQAKISKFSYC+P +S TG FYLGDNPNS FKYV++LTF
Sbjct: 198 RGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGFKYVSLLTF 257
Query: 181 PESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSELTYLVDEA 240
PESQ PNLD LAYT+PM GIR G +L IS +VF+PD GSGQTM+DSGSE T+LVD A
Sbjct: 258 PESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAA 317
Query: 241 YNNVRAEIVRLVGPMMKKGYEYASVADMCFDGAMAAAAGRRIGEMWFQFENGVEILVGKG 300
Y+ VRAEI+ VG +KKGY Y ADMCFDG +A R IG++ F F GVEILV K
Sbjct: 318 YDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAMIP-RLIGDLVFVFTRGVEILVPK- 377
Query: 301 EGLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVCS 349
E +L V G+ CVGIGRS LG SN+IGNVHQQN+WVE+D+ N+RVGF A CS
Sbjct: 378 ERVLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADCS 427
BLAST of CmoCh01G003500 vs. TAIR10
Match:
AT2G39710.1 (AT2G39710.1 Eukaryotic aspartyl protease family protein)
HSP 1 Score: 234.6 bits (597), Expect = 9.5e-62
Identity = 141/371 (38.01%), Postives = 203/371 (54.72%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+G PPQ + MVLDTGS+LSW+ C + N SV FNP SST+S +PC+S +C+ R
Sbjct: 71 VGDPPQNISMVLDTGSELSWLHCKKSPNLGSV------FNPVSSSTYSPVPCSSPICRTR 130
Query: 61 IPDFTLPTSCDPRRH-CHYSYFYADGTLAEGNLVTEKF---SFTASPIAIGCV------- 120
D +P SCDP+ H CH + YAD T EGNL E F S T GC+
Sbjct: 131 TRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLSSN 190
Query: 121 -KPSAENRGILGMNTGHLSFISQAKISKFSYCVPGRSRSDLTGLFYLGDNPNS--GKFKY 180
+ A++ G++GMN G LSF++Q SKFSYC+ G SD +G LGD S G +Y
Sbjct: 191 SEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCISG---SDSSGFLLLGDASYSWLGPIQY 250
Query: 181 VNMLTFPESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSELT 240
++ +S P D++AYT+ ++GIR+G+ L + +VF PD TG+GQTM+DSG++ T
Sbjct: 251 TPLVL--QSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFT 310
Query: 241 YLVDEAYNNVRAEIVRLVGPMMK----KGYEYASVADMCFDGAMAAAAGRRIGEMWFQFE 300
+L+ Y ++ E + +++ + + D+C+ M
Sbjct: 311 FLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSLMF 370
Query: 301 NGVEILVGKGEGLLTVV-------EKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDL 347
G E+ V G+ LL V ++ V C G S LG E+ +IG+ HQQN+W+E+DL
Sbjct: 371 RGAEMSV-SGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFDL 429
BLAST of CmoCh01G003500 vs. TAIR10
Match:
AT5G02190.1 (AT5G02190.1 Eukaryotic aspartyl protease family protein)
HSP 1 Score: 221.9 bits (564), Expect = 6.4e-58
Identity = 133/376 (35.37%), Postives = 197/376 (52.39%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+GTPPQ + MV+DTGS+LSW++C+ + N V F+P+ SS++S +PC+S C+ R
Sbjct: 79 VGTPPQNISMVIDTGSELSWLRCNRSSNPNPVNN----FDPTRSSSYSPIPCSSPTCRTR 138
Query: 61 IPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSF----TASPIAIGCV------- 120
DF +P SCD + CH + YAD + +EGNL E F F S + GC+
Sbjct: 139 TRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSD 198
Query: 121 -KPSAENRGILGMNTGHLSFISQAKISKFSYCVPGRSRSDLTGLFYLGDNPNSGKFKYVN 180
+ + G+LGMN G LSFISQ KFSYC+ G D G LGD+ F ++
Sbjct: 199 PEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISG--TDDFPGFLLLGDS----NFTWLT 258
Query: 181 MLTFPE----SQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSE 240
L + S P D++AYT+ + GI++ L I +V PD TG+GQTM+DSG++
Sbjct: 259 PLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQ 318
Query: 241 LTYLVDEAYNNVRAEIVRLVGPMM----KKGYEYASVADMCF---DGAMAAAAGRRIGEM 300
T+L+ Y +R+ + ++ + + D+C+ + + R+ +
Sbjct: 319 FTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTV 378
Query: 301 WFQFENGVEILVGKGEGL------LTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWV 348
FE G EI V G+ L LTV V C G S +G E+ +IG+ HQQNMW+
Sbjct: 379 SLVFE-GAEIAV-SGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWI 438
BLAST of CmoCh01G003500 vs. TAIR10
Match:
AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)
HSP 1 Score: 143.7 bits (361), Expect = 2.2e-34
Identity = 119/364 (32.69%), Postives = 171/364 (46.98%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+GTP + MVLDTGS + W+QC + F+P S TF+ +PC S LC+ R
Sbjct: 141 VGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAI--FDPKKSKTFATVPCGSRLCR-R 200
Query: 61 IPDFTLPTSCDPRRH--CHYSYFYADGTLAEGNLVTEKFSF---TASPIAIGCVKPS--- 120
+ D + C RR C Y Y DG+ EG+ TE +F + +GC +
Sbjct: 201 LDD---SSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGL 260
Query: 121 -AENRGILGMNTGHLSFISQAK---ISKFSYCVPGRSRSDLTGLFYLGDNPNS----GKF 180
G+LG+ G LSF SQ K KFSYC+ R+ S + P S G
Sbjct: 261 FVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSS------SKPPSTIVFGNA 320
Query: 181 KYVNMLTFPESQNSPNLDKLAYTLPMKGIRIGAVQLK-ISPAVFKPDPTGSGQTMIDSGS 240
F +P LD Y L + GI +G ++ +S + FK D TG+G +IDSG+
Sbjct: 321 AVPKTSVFTPLLTNPKLDTF-YYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGT 380
Query: 241 ELTYLVDEAYNNVRAEIVRLVGPMMKKGYEYASVADMCFDGAMAAAAGRRIGEMWFQFEN 300
+T L AY +R + RL +K+ Y S+ D CFD ++ ++ + F F
Sbjct: 381 SVTRLTQPAYVALR-DAFRLGATKLKRAPSY-SLFDTCFD--LSGMTTVKVPTVVFHFGG 440
Query: 301 GVEILVGKGEGLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFG 348
G E+ + L+ V +G C +G +G+ S +IGN+ QQ V YDL RVGF
Sbjct: 441 G-EVSLPASNYLIPVNTEGRFCFAF--AGTMGSLS-IIGNIQQQGFRVAYDLVGSRVGFL 483
BLAST of CmoCh01G003500 vs. NCBI nr
Match:
gi|778679910|ref|XP_011651212.1| (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])
HSP 1 Score: 537.3 bits (1383), Expect = 1.9e-149
Identity = 264/359 (73.54%), Postives = 292/359 (81.34%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSV----KPRFKWFNPSLSSTFSFLPCNSSL 60
+GTPPQ D+VLDTGSQLSWIQCH K + KP+ F+PSLSS+FS LPCN +
Sbjct: 73 IGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLSSSFSLLPCNHPI 132
Query: 61 CKPRIPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSFTAS----PIAIGCVKPS 120
CKPRIPDFTLPTSCD R CHYSYFYADGTLAEGNLV EKF+F+ S P+ +GC + S
Sbjct: 133 CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPPVILGCAQGS 192
Query: 121 AENRGILGMNTGHLSFISQAKISKFSYCVPGRSRSDLTGLFYLGDNPNSGKFKYVNMLTF 180
ENRGILGMN G LSFISQAKISKFSYCVP R+ S+ TGLFYLGDNPNS KFKYV MLTF
Sbjct: 193 TENRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNSSKFKYVTMLTF 252
Query: 181 PESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSELTYLVDEA 240
PESQ+SPNLD LAYTLPMK I+I +L I PA FKPD GSGQTMIDSGS+LTYLVDEA
Sbjct: 253 PESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDSGSDLTYLVDEA 312
Query: 241 YNNVRAEIVRLVGPMMKKGYEYASVADMCFDGAMAAAAGRRIGEMWFQFENGVEILVGKG 300
Y V+ E+VRLVG MMKKGY YA+VADMCFD + GRRIG+M F+F+NGVEI VG+G
Sbjct: 313 YEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFEFDNGVEIFVGRG 372
Query: 301 EGLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVCSGLK 352
EG+LT VEKGVKCVGIGRSGRLG SN+IG VHQQNMWVEYDLANKRVGFGGA CS LK
Sbjct: 373 EGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRVGFGGAECSRLK 431
BLAST of CmoCh01G003500 vs. NCBI nr
Match:
gi|778679913|ref|XP_004140731.2| (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])
HSP 1 Score: 533.5 bits (1373), Expect = 2.8e-148
Identity = 263/359 (73.26%), Postives = 290/359 (80.78%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSV----KPRFKWFNPSLSSTFSFLPCNSSL 60
+GTPPQ D+VLDTGSQLSWIQCH K + KP+ F+PSLSS+FS LPCN +
Sbjct: 72 IGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLSSSFSLLPCNHPI 131
Query: 61 CKPRIPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSFTAS----PIAIGCVKPS 120
CKPRIPDFTLPTSCD R CHYSYFYADGTLAEGNLV EKF+F+ S P+ +GC + S
Sbjct: 132 CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVILGCAQAS 191
Query: 121 AENRGILGMNTGHLSFISQAKISKFSYCVPGRSRSDLTGLFYLGDNPNSGKFKYVNMLTF 180
ENRGILGMN G LSFISQAKISKFSYCVP R+ S+ TGLFYLGDNPNS KFKYV MLTF
Sbjct: 192 TENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNSSKFKYVTMLTF 251
Query: 181 PESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSELTYLVDEA 240
PESQ+SPNLD LAYTLPMK I+I +L I PA FKPD GSGQTMIDSGS+LTYLVDEA
Sbjct: 252 PESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDSGSDLTYLVDEA 311
Query: 241 YNNVRAEIVRLVGPMMKKGYEYASVADMCFDGAMAAAAGRRIGEMWFQFENGVEILVGKG 300
Y V+ E+VRLVG MMKKGY YA VADMCFD + A GRRIG + F+F+NGVEI VG+G
Sbjct: 312 YEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFEFDNGVEIFVGRG 371
Query: 301 EGLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVCSGLK 352
EG+LT VEKGVKCVGIGRS RLG SN+IG VHQQNMWVEYDLANKRVGFGGA CS LK
Sbjct: 372 EGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRVGFGGAECSRLK 430
BLAST of CmoCh01G003500 vs. NCBI nr
Match:
gi|659114575|ref|XP_008457122.1| (PREDICTED: aspartic proteinase PCS1 [Cucumis melo])
HSP 1 Score: 531.9 bits (1369), Expect = 8.2e-148
Identity = 261/358 (72.91%), Postives = 290/358 (81.01%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSV---KPRFKWFNPSLSSTFSFLPCNSSLC 60
+GTPPQ D+VLDTGSQLSWIQCH V K KP+ F+PSLSS+FS LPCN +C
Sbjct: 72 IGTPPQPTDLVLDTGSQLSWIQCHDKVKKKLPPLPKPKTASFDPSLSSSFSLLPCNHPIC 131
Query: 61 KPRIPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSFTAS----PIAIGCVKPSA 120
KPRIPDFTLPTSCD R CHYSYFYADGTLAEGNLV EKFS + S P+ +GC + S
Sbjct: 132 KPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFSLSNSLSTPPVILGCAQAST 191
Query: 121 ENRGILGMNTGHLSFISQAKISKFSYCVPGRSRSDLTGLFYLGDNPNSGKFKYVNMLTFP 180
ENRGILGMN G LSFISQAKISKFSYCVP R+ S+ TGLFYLGDNPNS +FKYV MLTFP
Sbjct: 192 ENRGILGMNKGRLSFISQAKISKFSYCVPARTGSNPTGLFYLGDNPNSSRFKYVTMLTFP 251
Query: 181 ESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSELTYLVDEAY 240
ESQ+SPNLD LAYTLPMKGI+I +L ISPA FKPD GSGQTMIDSGS+LTYLVDEAY
Sbjct: 252 ESQSSPNLDPLAYTLPMKGIKIAGKRLNISPAAFKPDAGGSGQTMIDSGSDLTYLVDEAY 311
Query: 241 NNVRAEIVRLVGPMMKKGYEYASVADMCFDGAMAAAAGRRIGEMWFQFENGVEILVGKGE 300
V+ E+VRLVG MKKGY YA+VADMCFD + A GRRIG + F+F+NGVEILVG+GE
Sbjct: 312 EKVKEEVVRLVGAKMKKGYVYAAVADMCFDARVTAEVGRRIGGISFEFDNGVEILVGRGE 371
Query: 301 GLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVCSGLK 352
G+LT VEKGVKCVG GRS RLG SN+IG VHQQNMWVEYDL N+R+GFGGA CS LK
Sbjct: 372 GVLTEVEKGVKCVGFGRSERLGIGSNIIGTVHQQNMWVEYDLTNRRIGFGGAECSRLK 429
BLAST of CmoCh01G003500 vs. NCBI nr
Match:
gi|1009128861|ref|XP_015881464.1| (PREDICTED: aspartic proteinase PCS1 [Ziziphus jujuba])
HSP 1 Score: 459.9 bits (1182), Expect = 4.0e-126
Identity = 230/356 (64.61%), Postives = 273/356 (76.69%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+GTPPQ MVLDTGSQLSWIQCH V P F+PSLSSTFS LPC +CKPR
Sbjct: 92 IGTPPQTQQMVLDTGSQLSWIQCHK--KAPRVPPPTASFDPSLSSTFSVLPCTHPICKPR 151
Query: 61 IPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSFTAS----PIAIGCVKPSAENR 120
+PDFTLPT CDP R CHYSYFYADGTLAEGNLV EKF+F+ S P+A+GC K ++++
Sbjct: 152 VPDFTLPTDCDPNRLCHYSYFYADGTLAEGNLVREKFAFSTSVSTPPLALGCAKDPSDSK 211
Query: 121 GILGMNTGHLSFISQAKISKFSYCVPGRS--RSDL-TGLFYLGDNPNSGKFKYVNMLTFP 180
GILGMN G LSF SQA+I+KFSYC+P R R L TG FYLG+NPNSG FKY+++LTFP
Sbjct: 212 GILGMNLGRLSFASQARITKFSYCIPTRRNLRGSLPTGSFYLGNNPNSGGFKYIDLLTFP 271
Query: 181 ESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSELTYLVDEAY 240
+SQ PNLD LAYT+ M+GIRIG +L I P VF+PD +GSGQTMIDSGSE T+LVDEAY
Sbjct: 272 QSQRMPNLDPLAYTVAMQGIRIGTKKLNIPPTVFRPDASGSGQTMIDSGSEFTFLVDEAY 331
Query: 241 NNVRAEIVRLVGPMMKKGYEYA-SVADMCFDGAMAAAAGRRIGEMWFQFENGVEILVGKG 300
N VR EIVRLVGP +KKGY Y+ VADMCFDG + GR +G+M F+F+ GVEI+V +
Sbjct: 332 NKVREEIVRLVGPRIKKGYVYSGGVADMCFDGNV-MEIGRLVGDMAFEFDKGVEIVVPRD 391
Query: 301 EGLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVCS 349
+ +L V GV+C+ IGRS LG SN+IGN HQQN+WVE+DLAN+RVGFG A CS
Sbjct: 392 Q-MLADVGGGVRCLAIGRSSMLGAASNIIGNFHQQNLWVEFDLANRRVGFGKADCS 443
BLAST of CmoCh01G003500 vs. NCBI nr
Match:
gi|590648249|ref|XP_007032118.1| (Eukaryotic aspartyl protease family protein [Theobroma cacao])
HSP 1 Score: 458.8 bits (1179), Expect = 8.8e-126
Identity = 234/355 (65.92%), Postives = 265/355 (74.65%), Query Frame = 1
Query: 1 MGTPPQLMDMVLDTGSQLSWIQCHGTVNGKSVKPRFKWFNPSLSSTFSFLPCNSSLCKPR 60
+GTPPQ MVLDTGSQLSWIQCH V K P F+PSLSS+FS LPC LCKPR
Sbjct: 92 IGTPPQTQQMVLDTGSQLSWIQCHKKVARKPPPPPTS-FDPSLSSSFSVLPCTHPLCKPR 151
Query: 61 IPDFTLPTSCDPRRHCHYSYFYADGTLAEGNLVTEKFSFTAS----PIAIGCVKPSAENR 120
IPDFTLPTSCD R CHYSYFYADGTLAEGNLV EKF+F+ S P+ +GC ++E++
Sbjct: 152 IPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSRSQSTPPLILGCATDTSEDK 211
Query: 121 GILGMNTGHLSFISQAKISKFSYCVPGRSRS---DLTGLFYLGDNPNSGKFKYVNMLTFP 180
GILGMN G LSF SQAKISKFSYCVP R TG FYLG+NP+S F+YVN++ FP
Sbjct: 212 GILGMNLGRLSFASQAKISKFSYCVPTRRTQPGFSPTGSFYLGENPSSRGFQYVNLMIFP 271
Query: 181 ESQNSPNLDKLAYTLPMKGIRIGAVQLKISPAVFKPDPTGSGQTMIDSGSELTYLVDEAY 240
ES PN+D LAYTLPM+GIRIGA +L I +VF+PD GSGQTMIDSGSE TYLVD+AY
Sbjct: 272 ESGTRPNMDPLAYTLPMQGIRIGAKKLPIPTSVFRPDAGGSGQTMIDSGSEFTYLVDDAY 331
Query: 241 NNVRAEIVRLVGPMMKKGYEYASVADMCFDGAMAAAAGRRIGEMWFQFENGVEILVGKGE 300
N VR E+VRLVGP +KKGY Y VADMCFDG GR IG+M +FE GVEI V K E
Sbjct: 332 NKVREEVVRLVGPRIKKGYVYGGVADMCFDG-NPIEIGRLIGDMVLEFEKGVEITVEK-E 391
Query: 301 GLLTVVEKGVKCVGIGRSGRLGTESNMIGNVHQQNMWVEYDLANKRVGFGGAVCS 349
+L VE GV C+GIGRS LG SN+IGN HQQN+WVEYDL N+RVGFG A CS
Sbjct: 392 RVLADVEGGVHCLGIGRSSMLGAASNIIGNFHQQNLWVEYDLVNRRVGFGKADCS 443
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
PCS1L_ARATH | 1.1e-56 | 35.37 | Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1 | [more] |
NEP1_NEPGR | 1.9e-35 | 31.74 | Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1 | [more] |
NEP2_NEPGR | 2.5e-35 | 32.59 | Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1 | [more] |
ASPG1_ARATH | 6.7e-33 | 28.73 | Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... | [more] |
APF2_ARATH | 2.4e-30 | 30.28 | Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A061EL58_THECC | 6.1e-126 | 65.92 | Eukaryotic aspartyl protease family protein OS=Theobroma cacao GN=TCM_017459 PE=... | [more] |
A0A067JPK4_JATCU | 8.0e-126 | 65.63 | Uncharacterized protein OS=Jatropha curcas GN=JCGZ_22524 PE=3 SV=1 | [more] |
B9T2R1_RICCO | 3.0e-125 | 64.23 | Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_0593500 ... | [more] |
W9SFW9_9ROSA | 6.8e-125 | 64.62 | Aspartic proteinase nepenthesin-2 OS=Morus notabilis GN=L484_000286 PE=3 SV=1 | [more] |
Q9FGI3_ARATH | 2.0e-124 | 63.84 | AT5g37540/mpa22_p_70 OS=Arabidopsis thaliana GN=At5g37540 PE=2 SV=1 | [more] |