Homology
BLAST of Moc01g23660 vs. NCBI nr
Match:
XP_022979057.1 (probable aspartyl protease At4g16563 [Cucurbita maxima])
HSP 1 Score: 714.9 bits (1844), Expect = 4.4e-202
Identity = 354/466 (75.97%), Postives = 394/466 (84.55%), Query Frame = 0
Query: 1 MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHL 60
MEF PI FL SI LLLS SSSSS +TLPLTAFPS PWKN+ +L SAS+ RA HL
Sbjct: 1 MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60
Query: 61 KN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLC 120
K + KS+ + AL PRSYGAYS+S+ FGTPPQ+LS VFDTGSSLVWFPCTA Y C
Sbjct: 61 KTPKTKSNTSI--QNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRC 120
Query: 121 SRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDAC 180
S CSFPNV+ ATI KFIPKLSSSARI+GC NRKC+WIF PN++S CR+C+P SR CSD C
Sbjct: 121 SNCSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTC 180
Query: 181 PGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQM 240
PGYGIQYGSG TAGFLLSETLD P+KRVPDFLVGCSVLSVHQPAGI GFGRGP+SLPSQM
Sbjct: 181 PGYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQM 240
Query: 241 RLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY 300
LKRFS+CL RQFDDSPVSSPLVLD +SGD+ TN LIY+PFRENPS S+AAFREYYY
Sbjct: 241 GLKRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYY 300
Query: 301 LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKY 360
L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KY
Sbjct: 301 LTLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKY 360
Query: 361 PRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTM 420
PRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG LALPPANY ALV ++GVVC+TM
Sbjct: 361 PRAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTM 420
Query: 421 LTD--DVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
+TD +GG GGGPAII GAFQQQN+LV+YDLAK+RIGFRKQRC
Sbjct: 421 ITDVNFLGG---GGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRC 461
BLAST of Moc01g23660 vs. NCBI nr
Match:
XP_023543736.1 (probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 713.4 bits (1840), Expect = 1.3e-201
Identity = 353/466 (75.75%), Postives = 394/466 (84.55%), Query Frame = 0
Query: 1 MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHL 60
MEF PIPFL SI LLLS SSSSS +TLPLT FPS PWKN+ +L SAS+ RA HL
Sbjct: 1 MEFFPIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFTHPWKNIKHLVSASLTRAQHL 60
Query: 61 KN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLC 120
K R KS+ + AL PRSYGAYS+S+ FGTPPQ+LS VFDTGSSLVWFPCTA Y C
Sbjct: 61 KTPRIKSNTSI--QNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRC 120
Query: 121 SRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDAC 180
S CSFPNV+ ATI KFIPKLSSSA+I+GC NRKC+WIF PN++S CR+C+P SR CSD C
Sbjct: 121 SNCSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKSLCRSCSPRSRKCSDTC 180
Query: 181 PGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQM 240
PGYGIQYGSG TAGFLLSETLD P+KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQM
Sbjct: 181 PGYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQM 240
Query: 241 RLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY 300
LKRFS+CL RQFDDSPVSSPLVLD S+SG++ N LIY+PFRENPS S+AAFREYYY
Sbjct: 241 GLKRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYY 300
Query: 301 LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKY 360
L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KY
Sbjct: 301 LTLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKY 360
Query: 361 PRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTM 420
PRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG LALPPANY ALV ++GVVC+TM
Sbjct: 361 PRAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTM 420
Query: 421 LTD--DVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
+TD +GG GGGPAII GAFQQQN+LV+YDLAKDRIGFRKQRC
Sbjct: 421 ITDVTFLGG---GGGPAIIFGAFQQQNVLVQYDLAKDRIGFRKQRC 461
BLAST of Moc01g23660 vs. NCBI nr
Match:
XP_022925946.1 (probable aspartyl protease At4g16563 [Cucurbita moschata] >KAG6604319.1 Aspartic proteinase nepenthesin-1, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 706.1 bits (1821), Expect = 2.0e-199
Identity = 349/466 (74.89%), Postives = 393/466 (84.33%), Query Frame = 0
Query: 1 MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHL 60
MEF IPFL SI LLLS SSSSS +TLPLT FPS PWKN+ +L SAS+ RA HL
Sbjct: 1 MEFFLIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFAHPWKNIKHLVSASLTRAQHL 60
Query: 61 KN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLC 120
K R KS+ + AL PRSYGAYS+S+ FGTPPQ+LS VFDTGSSLVWFPCTA Y C
Sbjct: 61 KTPRTKSNTSI--QNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRC 120
Query: 121 SRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDAC 180
S CSFPNV+ ATI KFIPKLSSSA+I+GC NRKC+WIF PN+++ CR+C+P SR CSD C
Sbjct: 121 SNCSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKTLCRSCSPRSRKCSDTC 180
Query: 181 PGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQM 240
PGYGIQYGSG TAGFLLSETLD P+KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQM
Sbjct: 181 PGYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQM 240
Query: 241 RLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY 300
LKRFS+CL RQFDDSPVSSPLVLD S+SG++ N LIY+PFRENPS S+AAFREYYY
Sbjct: 241 GLKRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYY 300
Query: 301 LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKY 360
L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KY
Sbjct: 301 LTLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKY 360
Query: 361 PRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTM 420
PRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG LALPP+NY ALVA++ VVC+TM
Sbjct: 361 PRAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPSNYLALVADTSVVCLTM 420
Query: 421 LTD--DVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
+TD +GG GGGPAII GAFQQQN+LV+YDLAK+RIGFRKQRC
Sbjct: 421 ITDVTFLGG---GGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRC 461
BLAST of Moc01g23660 vs. NCBI nr
Match:
KAG7034471.1 (Aspartic proteinase nepenthesin-1, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 705.3 bits (1819), Expect = 3.5e-199
Identity = 349/466 (74.89%), Postives = 393/466 (84.33%), Query Frame = 0
Query: 1 MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHL 60
MEF IPFL SI LLLS SSSSS +TLPLT FPS PWKN+ +L SAS+ RA HL
Sbjct: 1 MEFFLIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFAHPWKNIKHLVSASLTRAQHL 60
Query: 61 K-NRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLC 120
K R KS+ + AL PRSYGAYS+S+ FGTPPQ+LS VFDTGSSLVWFPCTA Y C
Sbjct: 61 KIPRTKSNTSI--QNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRC 120
Query: 121 SRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDAC 180
S CSFPNV+ ATI KFIPKLSSSA+I+GC NRKC+WIF PN+++ CR+C+P SR CSD C
Sbjct: 121 SNCSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKTLCRSCSPRSRKCSDTC 180
Query: 181 PGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQM 240
PGYGIQYGSG TAGFLLSETLD P+KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQM
Sbjct: 181 PGYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQM 240
Query: 241 RLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY 300
LKRFS+CL RQFDDSPVSSPLVLD S+SG++ N LIY+PFRENPS S+AAFREYYY
Sbjct: 241 GLKRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYY 300
Query: 301 LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKY 360
L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KY
Sbjct: 301 LTLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKY 360
Query: 361 PRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTM 420
PRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG LALPP+NY ALVA++ VVC+TM
Sbjct: 361 PRAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPSNYLALVADTSVVCLTM 420
Query: 421 LTD--DVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
+TD +GG GGGPAII GAFQQQN+LV+YDLAK+RIGFRKQRC
Sbjct: 421 ITDVTFLGG---GGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRC 461
BLAST of Moc01g23660 vs. NCBI nr
Match:
XP_011657732.1 (probable aspartyl protease At4g16563 [Cucumis sativus] >KGN48299.1 hypothetical protein Csa_004059 [Cucumis sativus])
HSP 1 Score: 699.9 bits (1805), Expect = 1.5e-197
Identity = 348/464 (75.00%), Postives = 392/464 (84.48%), Query Frame = 0
Query: 1 MEFLPIPFLFSIFLLLSTSSSSSIT-LPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKN 60
MEFLPIPFLFSIFLLL TSSSSS T LPLT FPS DP+K +N L SAS+ RA HLK
Sbjct: 1 MEFLPIPFLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKT 60
Query: 61 RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRC 120
S+ ++ S L PRSYGAYSVS+ FGTPPQNLSF+FDTGSSLVWFPCTA Y CSRC
Sbjct: 61 PQSKSNTSIQNVS-LFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRC 120
Query: 121 SFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGY 180
SFP V+ ATI+KF+PKLSSS ++VGC N KCAWIF PN++SRCRNC SR CSD+CPGY
Sbjct: 121 SFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGY 180
Query: 181 GIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLK 240
G+QYGSG TAG LLSETLDL +KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQMRLK
Sbjct: 181 GLQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLK 240
Query: 241 RFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSL 300
RFS+CL SR FDDSPVSSPLVLD GS+S ++ T IY+PFRENPS S+AAFREYYYLSL
Sbjct: 241 RFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSL 300
Query: 301 RRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRA 360
RRILIGGKPVKFPYKYL PDSTGNGG IIDSGSTFT LDKPIFEA+A+E EKQL+KYPRA
Sbjct: 301 RRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRA 360
Query: 361 TGVEARSGLRPCFNVSK-EKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLT 420
VEA+SGLRPCFN+ K E++ EFP++VLKFKGG +L+L NY A+V + GVVC+TM+T
Sbjct: 361 KDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMT 420
Query: 421 DD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
D+ VGG GGGPAIILGAFQQQN+LVEYDLAK RIGFRKQ+C
Sbjct: 421 DEAVVGG---GGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 460
BLAST of Moc01g23660 vs. ExPASy Swiss-Prot
Match:
Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)
HSP 1 Score: 196.1 bits (497), Expect = 9.0e-49
Identity = 158/501 (31.54%), Postives = 226/501 (45.11%), Query Frame = 0
Query: 4 LPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKS 63
L P L + LSTS SS L L S+R +SA R HH + + +
Sbjct: 25 LSTPLLLHLSHSLSTSKHSSSPLHLLKSSSSR-----------SSARFRRHHHKQQQQQL 84
Query: 64 SDFVHKSKSALTPRSYGA-YSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFP 123
S P S G+ Y +S+ G+ +S DTGS LVWFPC + C C
Sbjct: 85 S----------LPISSGSDYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILCESK 144
Query: 124 NVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRS---RCRNC------TPNSRNCS 183
+ + + LSSSA V C + C+ S NC T + S
Sbjct: 145 PLPPSPPS----SLSSSATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSS 204
Query: 184 DACPGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLP 243
CP + YG G L S++L LP V +F GC+ ++ +P G+ GFGRG SLP
Sbjct: 205 YPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLP 264
Query: 244 SQMRL------KRFSYCLASRQFDDSPVSSPLVLDFG-------SKSGDTN--------- 303
+Q+ + FSYCL S FD V P L G + G T+
Sbjct: 265 AQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEK 324
Query: 304 --TNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIID 363
N +++ ENP +Y +SL+ I IG + + P D G GG ++D
Sbjct: 325 KKKNEFVFTEMLENPK-----HPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVD 384
Query: 364 SGSTFTILDKPIFEAVAEEFEKQLIK-YPRATGVEARSGLRPCFNVSKEKTVEFPELVLK 423
SG+TFT+L + +V EEF+ ++ + + RA VE SG+ PC+ ++ +TV+ P LVL
Sbjct: 385 SGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN--QTVKVPALVLH 444
Query: 424 FKGG-LELALPPANYFALVAESG--------VVCMTMLTDDVGGEKVGGGPAIILGAFQQ 461
F G + LP NYF + G + C+ ML + ++ GG ILG +QQ
Sbjct: 445 FAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCL-MLMNGGDESELRGGTGAILGNYQQ 491
BLAST of Moc01g23660 vs. ExPASy Swiss-Prot
Match:
Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)
HSP 1 Score: 153.7 bits (387), Expect = 5.1e-36
Identity = 118/390 (30.26%), Postives = 170/390 (43.59%), Query Frame = 0
Query: 80 GAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSS 139
G Y +++ GTP Q S + DTGS L+W C C S P N P+ SSS
Sbjct: 93 GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFN--------PQGSSS 152
Query: 140 ARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGL-TAGFLLSETLD 199
+ C ++ C + P CS+ Y YG G T G + +ETL
Sbjct: 153 FSTLPCSSQLCQALSSP--------------TCSNNFCQYTYGYGDGSETQGSMGTETLT 212
Query: 200 LPDKRVPDFLVGCSV----LSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSP 259
+P+ GC AG+VG GRGP SLPSQ+ + +FSYC+ +P
Sbjct: 213 FGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCM-------TP 272
Query: 260 V--SSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKF- 319
+ S+P L GS + SP + +YY++L + +G +
Sbjct: 273 IGSSTPSNLLLGSLANSVTAG----SP--NTTLIQSSQIPTFYYITLNGLSVGSTRLPID 332
Query: 320 PYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPC 379
P + + G GG IIDSG+T T +++V +EF Q I P G + SG C
Sbjct: 333 PSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQ-INLPVVNG--SSSGFDLC 392
Query: 380 FNV-SKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGGP 439
F S ++ P V+ F GG +L LP NYF + +G++C+ M + G
Sbjct: 393 FQTPSDPSNLQIPTFVMHFDGG-DLELPSENYF-ISPSNGLICLAMGSSSQG-------- 434
Query: 440 AIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
I G QQQN+LV YD + F +C
Sbjct: 453 MSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434
BLAST of Moc01g23660 vs. ExPASy Swiss-Prot
Match:
Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)
HSP 1 Score: 152.5 bits (384), Expect = 1.1e-35
Identity = 132/417 (31.65%), Postives = 188/417 (45.08%), Query Frame = 0
Query: 58 KNRNKSSDFVHKSKSALTPRSY---GAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARY 117
+ R +S + + +S S + Y G Y +++ GTP + S + DTGS L+W C
Sbjct: 69 ERRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEP-- 128
Query: 118 LCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSD 177
C++C + F P+ SSS + C ++ C + S C++
Sbjct: 129 -CTQCF-----SQPTPIFNPQDSSSFSTLPCESQYCQDL--------------PSETCNN 188
Query: 178 ACPGYGIQYGSG-LTAGFLLSETLDLPDKRVPDFLVGCSV----LSVHQPAGIVGFGRGP 237
Y YG G T G++ +ET VP+ GC AG++G G GP
Sbjct: 189 NECQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGP 248
Query: 238 QSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKS-----GDTNTNGLIYSPFRENP 297
SLPSQ+ + +FSYC+ S SSP L GS + G +T LI+S NP
Sbjct: 249 LSLPSQLGVGQFSYCMTS-----YGSSSPSTLALGSAASGVPEGSPSTT-LIHSSL--NP 308
Query: 298 SASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEA 357
+ YYY++L+ I +GG + P G GG IIDSG+T T L + + A
Sbjct: 309 T--------YYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNA 368
Query: 358 VAEEFEKQLIKYPRATGVEARSGLRPCF-NVSKEKTVEFPELVLKFKGGLELALPPANYF 417
VA+ F Q I P T E+ SGL CF S TV+ PE+ ++F GG+ L L N
Sbjct: 369 VAQAFTDQ-INLP--TVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGV-LNLGEQNIL 428
Query: 418 ALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
AE GV+C+ M + G I G QQQ V YDL + F +C
Sbjct: 429 ISPAE-GVICLAMGSSSQLG-------ISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435
BLAST of Moc01g23660 vs. ExPASy Swiss-Prot
Match:
Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)
HSP 1 Score: 147.1 bits (370), Expect = 4.8e-34
Identity = 137/468 (29.27%), Postives = 194/468 (41.45%), Query Frame = 0
Query: 18 TSSSSSITLPL---TAFPSTRAPDPW---------KNLNYLAS-ASIIRAHHLKNRNKSS 77
+ SSSSITL L A S + PD + + +A+ A+ I ++ + +
Sbjct: 66 SESSSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPG 125
Query: 78 DFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNV 137
F S L+ S G Y +G GTP + + V DTGS +VW C C RC
Sbjct: 126 GFSSSVVSGLSQGS-GEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP---CRRC----- 185
Query: 138 NTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYG 197
+ + F P+ S + + C + C + +R + C Y + YG
Sbjct: 186 YSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCL------------YQVSYG 245
Query: 198 SG-LTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQ-------PAGIVGFGRGPQSLPSQM 257
G T G +ETL RV +GC H AG++G G+G S P Q
Sbjct: 246 DGSFTVGDFSTETLTFRRNRVKGVALGCG----HDNEGLFVGAAGLLGLGKGKLSFPGQT 305
Query: 258 RLK---RFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFRE 317
+ +FSYCL R S S P + FG N + R P S+
Sbjct: 306 GHRFNQKFSYCLVDR----SASSKPSSVVFG--------NAAVSRIARFTPLLSNPKLDT 365
Query: 318 YYYLSLRRILIGGKPVK-FPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQ 377
+YY+ L I +GG V D GNGG IIDSG++ T L +P + A+ + F
Sbjct: 366 FYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVG 425
Query: 378 LIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVV 437
RA S CF++S V+ P +VL F+G +++LP NY V +G
Sbjct: 426 AKTLKRAPDF---SLFDTCFDLSNMNEVKVPTVVLHFRGA-DVSLPATNYLIPVDTNGKF 484
Query: 438 CMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
C +GG I+G QQQ V YDLA R+GF C
Sbjct: 486 CFA-FAGTMGG-------LSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
BLAST of Moc01g23660 vs. ExPASy Swiss-Prot
Match:
Q8S9J6 (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At5g10770 PE=2 SV=1)
HSP 1 Score: 144.1 bits (362), Expect = 4.1e-33
Identity = 131/431 (30.39%), Postives = 182/431 (42.23%), Query Frame = 0
Query: 46 LASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGA------YSVSIGFGTPPQNLSFVF 105
L A + H ++ ++D V +SKS P G+ Y V++G GTP +LS +F
Sbjct: 90 LDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIF 149
Query: 106 DTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVR 165
DTGS L W C C R + F P S+S V C + C +
Sbjct: 150 DTGSDLTWTQCQP---CVRTCYDQKEPI----FNPSKSTSYYNVSCSSAACGSL------ 209
Query: 166 SRCRNCTPNSRNCSDACPGYGIQYG-SGLTAGFLLSETLDLPDKRVPD-FLVGCSVLS-- 225
+ T N+ +CS + YGIQYG + GFL E L + V D GC +
Sbjct: 210 ---SSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQG 269
Query: 226 -VHQPAGIVGFGRGPQSLPSQMRL---KRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTN 285
AG++G GR S PSQ K FSYCL S S L FGS
Sbjct: 270 LFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCL------PSSASYTGHLTFGSAG---- 329
Query: 286 TNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSG 345
I + P ++ +Y L++ I +GG+ + P S G +IDSG
Sbjct: 330 ----ISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFS-----TPGALIDSG 389
Query: 346 STFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKG 405
+ T L + A+ F+ ++ KYP +GV S L CF++S KTV P++ F G
Sbjct: 390 TVITRLPPKAYAALRSSFKAKMSKYPTTSGV---SILDTCFDLSGFKTVTIPKVAFSFSG 449
Query: 406 GLELALPPANYFALVAESGVVCMTML--TDDVGGEKVGGGPAIILGAFQQQNILVEYDLA 461
G + L F V + VC+ +DD A I G QQQ + V YD A
Sbjct: 450 GAVVELGSKGIF-YVFKISQVCLAFAGNSDD--------SNAAIFGNVQQQTLEVVYDGA 473
BLAST of Moc01g23660 vs. ExPASy TrEMBL
Match:
A0A6J1IMR7 (probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111478813 PE=3 SV=1)
HSP 1 Score: 714.9 bits (1844), Expect = 2.1e-202
Identity = 354/466 (75.97%), Postives = 394/466 (84.55%), Query Frame = 0
Query: 1 MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHL 60
MEF PI FL SI LLLS SSSSS +TLPLTAFPS PWKN+ +L SAS+ RA HL
Sbjct: 1 MEFFPIQFLLSIVLLLSASSSSSSITVTLPLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60
Query: 61 KN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLC 120
K + KS+ + AL PRSYGAYS+S+ FGTPPQ+LS VFDTGSSLVWFPCTA Y C
Sbjct: 61 KTPKTKSNTSI--QNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRC 120
Query: 121 SRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDAC 180
S CSFPNV+ ATI KFIPKLSSSARI+GC NRKC+WIF PN++S CR+C+P SR CSD C
Sbjct: 121 SNCSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFGPNLKSSCRSCSPRSRKCSDTC 180
Query: 181 PGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQM 240
PGYGIQYGSG TAGFLLSETLD P+KRVPDFLVGCSVLSVHQPAGI GFGRGP+SLPSQM
Sbjct: 181 PGYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQM 240
Query: 241 RLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY 300
LKRFS+CL RQFDDSPVSSPLVLD +SGD+ TN LIY+PFRENPS S+AAFREYYY
Sbjct: 241 GLKRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYY 300
Query: 301 LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKY 360
L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KY
Sbjct: 301 LTLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKY 360
Query: 361 PRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTM 420
PRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG LALPPANY ALV ++GVVC+TM
Sbjct: 361 PRAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTM 420
Query: 421 LTD--DVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
+TD +GG GGGPAII GAFQQQN+LV+YDLAK+RIGFRKQRC
Sbjct: 421 ITDVNFLGG---GGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRC 461
BLAST of Moc01g23660 vs. ExPASy TrEMBL
Match:
A0A6J1EDJ0 (probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC111433208 PE=3 SV=1)
HSP 1 Score: 706.1 bits (1821), Expect = 9.9e-200
Identity = 349/466 (74.89%), Postives = 393/466 (84.33%), Query Frame = 0
Query: 1 MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHL 60
MEF IPFL SI LLLS SSSSS +TLPLT FPS PWKN+ +L SAS+ RA HL
Sbjct: 1 MEFFLIPFLLSIVLLLSASSSSSSTTVTLPLTVFPSLPFAHPWKNIKHLVSASLTRAQHL 60
Query: 61 KN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLC 120
K R KS+ + AL PRSYGAYS+S+ FGTPPQ+LS VFDTGSSLVWFPCTA Y C
Sbjct: 61 KTPRTKSNTSI--QNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRC 120
Query: 121 SRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDAC 180
S CSFPNV+ ATI KFIPKLSSSA+I+GC NRKC+WIF PN+++ CR+C+P SR CSD C
Sbjct: 121 SNCSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKTLCRSCSPRSRKCSDTC 180
Query: 181 PGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQM 240
PGYGIQYGSG TAGFLLSETLD P+KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQM
Sbjct: 181 PGYGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQM 240
Query: 241 RLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY 300
LKRFS+CL RQFDDSPVSSPLVLD S+SG++ N LIY+PFRENPS S+AAFREYYY
Sbjct: 241 GLKRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYY 300
Query: 301 LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKY 360
L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KY
Sbjct: 301 LTLRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKY 360
Query: 361 PRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTM 420
PRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG LALPP+NY ALVA++ VVC+TM
Sbjct: 361 PRAKGVEAESGLRPCFDISKEESVEFPELILKFKGGATLALPPSNYLALVADTSVVCLTM 420
Query: 421 LTD--DVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
+TD +GG GGGPAII GAFQQQN+LV+YDLAK+RIGFRKQRC
Sbjct: 421 ITDVTFLGG---GGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRC 461
BLAST of Moc01g23660 vs. ExPASy TrEMBL
Match:
A0A0A0KHK2 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G454470 PE=3 SV=1)
HSP 1 Score: 699.9 bits (1805), Expect = 7.1e-198
Identity = 348/464 (75.00%), Postives = 392/464 (84.48%), Query Frame = 0
Query: 1 MEFLPIPFLFSIFLLLSTSSSSSIT-LPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKN 60
MEFLPIPFLFSIFLLL TSSSSS T LPLT FPS DP+K +N L SAS+ RA HLK
Sbjct: 1 MEFLPIPFLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKT 60
Query: 61 RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRC 120
S+ ++ S L PRSYGAYSVS+ FGTPPQNLSF+FDTGSSLVWFPCTA Y CSRC
Sbjct: 61 PQSKSNTSIQNVS-LFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRC 120
Query: 121 SFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGY 180
SFP V+ ATI+KF+PKLSSS ++VGC N KCAWIF PN++SRCRNC SR CSD+CPGY
Sbjct: 121 SFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGY 180
Query: 181 GIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLK 240
G+QYGSG TAG LLSETLDL +KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQMRLK
Sbjct: 181 GLQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLK 240
Query: 241 RFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSL 300
RFS+CL SR FDDSPVSSPLVLD GS+S ++ T IY+PFRENPS S+AAFREYYYLSL
Sbjct: 241 RFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSL 300
Query: 301 RRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRA 360
RRILIGGKPVKFPYKYL PDSTGNGG IIDSGSTFT LDKPIFEA+A+E EKQL+KYPRA
Sbjct: 301 RRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRA 360
Query: 361 TGVEARSGLRPCFNVSK-EKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLT 420
VEA+SGLRPCFN+ K E++ EFP++VLKFKGG +L+L NY A+V + GVVC+TM+T
Sbjct: 361 KDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMT 420
Query: 421 DD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
D+ VGG GGGPAIILGAFQQQN+LVEYDLAK RIGFRKQ+C
Sbjct: 421 DEAVVGG---GGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 460
BLAST of Moc01g23660 vs. ExPASy TrEMBL
Match:
A0A5A7SGF9 (Aspartic proteinase nepenthesin-2-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold541G00670 PE=3 SV=1)
HSP 1 Score: 692.6 bits (1786), Expect = 1.1e-195
Identity = 340/462 (73.59%), Postives = 389/462 (84.20%), Query Frame = 0
Query: 1 MEFLPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNR 60
MEFLPIPFLFSIFLLL TSSSSSITLPL FPS DP K +N+L SAS+ RA HLK+
Sbjct: 1 MEFLPIPFLFSIFLLLPTSSSSSITLPLATFPSIPFTDPLKTINHLLSASLSRAQHLKSP 60
Query: 61 NKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCS 120
S+ ++ S L PRSYGAY+VS+ FGTPPQNLSF+FDTGSSLVWFPCTA Y C+ CS
Sbjct: 61 QSKSNTSTENVS-LFPRSYGAYAVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCAHCS 120
Query: 121 FPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYG 180
FP+V+ ATI+KF+PKLSSS +IVGC N KCAWIF PN++SRCRNC P SR CSD+CPGYG
Sbjct: 121 FPHVDPATISKFVPKLSSSVKIVGCRNPKCAWIFGPNLKSRCRNCNPKSRKCSDSCPGYG 180
Query: 181 IQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKR 240
IQYGSG TAG LLSETLDL +KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQMRLKR
Sbjct: 181 IQYGSGATAGILLSETLDLQNKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR 240
Query: 241 FSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLR 300
FS+CL R FDDSPVSSPLVLD G +S ++ T IY+PF+ENPS S+ AFREYYYLSLR
Sbjct: 241 FSHCLLPRGFDDSPVSSPLVLDSGPESDESKTKSFIYAPFQENPSRSNTAFREYYYLSLR 300
Query: 301 RILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRAT 360
RILIGGKPVKFPYKYL PDSTG GG IIDSGSTFT LDKPIFEA+A E EKQL+KYPRA
Sbjct: 301 RILIGGKPVKFPYKYLVPDSTGKGGAIIDSGSTFTFLDKPIFEAIAGELEKQLVKYPRAK 360
Query: 361 GVEARSGLRPCFNVSK-EKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTD 420
+EA++GLRPCFN+SK E++ EFPE+ LKFKGG +L+LPP NY +V ++ VVC+TM+T+
Sbjct: 361 DIEAKTGLRPCFNISKEEESAEFPEVALKFKGGGKLSLPPENYLVMVTDANVVCLTMMTN 420
Query: 421 -DVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
+V G VGGGPAII GAFQQQN+LVEYDLAK RIGFRKQ+C
Sbjct: 421 AEVVG--VGGGPAIIFGAFQQQNVLVEYDLAKQRIGFRKQKC 459
BLAST of Moc01g23660 vs. ExPASy TrEMBL
Match:
A0A1S3CHV2 (aspartic proteinase nepenthesin-2-like OS=Cucumis melo OX=3656 GN=LOC103500932 PE=3 SV=1)
HSP 1 Score: 692.6 bits (1786), Expect = 1.1e-195
Identity = 340/462 (73.59%), Postives = 389/462 (84.20%), Query Frame = 0
Query: 1 MEFLPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNR 60
MEFLPIPFLFSIFLLL TSSSSSITLPL FPS DP K +N+L SAS+ RA HLK+
Sbjct: 1 MEFLPIPFLFSIFLLLPTSSSSSITLPLATFPSIPFTDPLKTINHLLSASLSRAQHLKSP 60
Query: 61 NKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCS 120
S+ ++ S L PRSYGAY+VS+ FGTPPQNLSF+FDTGSSLVWFPCTA Y C+ CS
Sbjct: 61 QSKSNTSTENVS-LFPRSYGAYAVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCAHCS 120
Query: 121 FPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYG 180
FP+V+ ATI+KF+PKLSSS +IVGC N KCAWIF PN++SRCRNC P SR CSD+CPGYG
Sbjct: 121 FPHVDPATISKFVPKLSSSVKIVGCRNPKCAWIFGPNLKSRCRNCNPKSRKCSDSCPGYG 180
Query: 181 IQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKR 240
IQYGSG TAG LLSETLDL +KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQMRLKR
Sbjct: 181 IQYGSGATAGILLSETLDLQNKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR 240
Query: 241 FSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLR 300
FS+CL R FDDSPVSSPLVLD G +S ++ T IY+PF+ENPS S+ AFREYYYLSLR
Sbjct: 241 FSHCLLPRGFDDSPVSSPLVLDSGPESDESKTKSFIYAPFQENPSRSNTAFREYYYLSLR 300
Query: 301 RILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRAT 360
RILIGGKPVKFPYKYL PDSTG GG IIDSGSTFT LDKPIFEA+A E EKQL+KYPRA
Sbjct: 301 RILIGGKPVKFPYKYLVPDSTGKGGAIIDSGSTFTFLDKPIFEAIAGELEKQLVKYPRAK 360
Query: 361 GVEARSGLRPCFNVSK-EKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTD 420
+EA++GLRPCFN+SK E++ EFPE+ LKFKGG +L+LPP NY +V ++ VVC+TM+T+
Sbjct: 361 DIEAKTGLRPCFNISKEEESAEFPEVALKFKGGGKLSLPPENYLVMVTDANVVCLTMMTN 420
Query: 421 -DVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
+V G VGGGPAII GAFQQQN+LVEYDLAK RIGFRKQ+C
Sbjct: 421 AEVVG--VGGGPAIIFGAFQQQNVLVEYDLAKQRIGFRKQKC 459
BLAST of Moc01g23660 vs. TAIR 10
Match:
AT3G52500.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 496.5 bits (1277), Expect = 2.3e-140
Identity = 248/462 (53.68%), Postives = 323/462 (69.91%), Query Frame = 0
Query: 13 FLLLSTSSSSSITLPLTAFP-STRAP-DPWKNLNYLASASIIRAHHLKNRNK---SSDFV 72
F L+ S S++ LPL+ F S ++P DP+ +L LA +SI RAH LK+ D +
Sbjct: 8 FFLIFLSVVSAVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDAL 67
Query: 73 HKS--------KSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRC 132
+ KS L+ +SYG YSVS+ FGTP Q + FVFDTGSSLVW PCT+RYLCS C
Sbjct: 68 SSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGC 127
Query: 133 SFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGY 192
F ++ I +FIPK SSS++I+GC + KC +++ PNV +CR C PN+RNC+ CP Y
Sbjct: 128 DFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNV--QCRGCDPNTRNCTVGCPPY 187
Query: 193 GIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLK 252
+QYG G TAG L++E LD PD VPDF+VGCS++S QPAGI GFGRGP SLPSQM LK
Sbjct: 188 ILQYGLGSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNLK 247
Query: 253 RFSYCLASRQFDDSPVSSPLVLDFGS-KSGDTNTNGLIYSPFRENPSASDAAFREYYYLS 312
RFS+CL SR+FDD+ V++ L LD GS + + T GL Y+PFR+NP+ S+ AF EYYYL+
Sbjct: 248 RFSHCLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLN 307
Query: 313 LRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPR 372
LRRI +G K VK PYKYL+P + G+GG+I+DSGSTFT +++P+FE VAEEF Q+ Y R
Sbjct: 308 LRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTR 367
Query: 373 ATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLT 432
+E +GL PCFN+S + V PEL+ +FKGG +L LP +NYF V + VC+T+++
Sbjct: 368 EKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVS 427
Query: 433 DDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
D G GPAIILG+FQQQN LVEYDL DR GF K++C
Sbjct: 428 DKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
BLAST of Moc01g23660 vs. TAIR 10
Match:
AT5G45120.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 201.8 bits (512), Expect = 1.2e-51
Identity = 138/406 (33.99%), Postives = 196/406 (48.28%), Query Frame = 0
Query: 82 YSVSIGFGTPPQNLSFVFDTGSSLVWFPC-TARYLCSRC-SFPNVNTATITKFIPKLSSS 141
Y +++ GTPPQ + DTGS L W PC + C C N + + + F P SS+
Sbjct: 83 YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSST 142
Query: 142 ARIVGCGNRKCAWI------FDPNVRSRCRNCTPNSRNCSDACPGYGIQYG-SGLTAGFL 201
+ C + C I FDP + C C CP + YG GL +G L
Sbjct: 143 SFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGIL 202
Query: 202 LSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRL--KRFSYCLASRQF 261
+ L + VP F GC + +P GI GFGRG SLPSQ+ K FS+C +F
Sbjct: 203 TRDILKARTRDVPRFSFGCVTSTYREPIGIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKF 262
Query: 262 DDSP-VSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGK-- 321
++P +SSPL+L + S + T+ L ++P P + YY+ L I IG
Sbjct: 263 VNNPNISSPLILGASALSINL-TDSLQFTPMLNTP-----MYPNSYYIGLESITIGTNIT 322
Query: 322 PVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSG 381
P + P DS GNGG ++DSG+T+T L +P + + + I YPRAT E+R+G
Sbjct: 323 PTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQ-STITYPRATETESRTG 382
Query: 382 LRPCFNV----------SKEKTVEFPELVLKFKGGLELALPPAN-YFALVAES-GVVCMT 441
C+ V + + FP + F L LP N ++A+ A S G V
Sbjct: 383 FDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQC 442
Query: 442 MLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRCV 462
+L ++ E GPA + G+FQQQN+ V YDL K+RIGF+ CV
Sbjct: 443 LLFQNM--EDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDCV 479
BLAST of Moc01g23660 vs. TAIR 10
Match:
AT4G16563.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 196.1 bits (497), Expect = 6.4e-50
Identity = 158/501 (31.54%), Postives = 226/501 (45.11%), Query Frame = 0
Query: 4 LPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKS 63
L P L + LSTS SS L L S+R +SA R HH + + +
Sbjct: 25 LSTPLLLHLSHSLSTSKHSSSPLHLLKSSSSR-----------SSARFRRHHHKQQQQQL 84
Query: 64 SDFVHKSKSALTPRSYGA-YSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFP 123
S P S G+ Y +S+ G+ +S DTGS LVWFPC + C C
Sbjct: 85 S----------LPISSGSDYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILCESK 144
Query: 124 NVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRS---RCRNC------TPNSRNCS 183
+ + + LSSSA V C + C+ S NC T + S
Sbjct: 145 PLPPSPPS----SLSSSATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSS 204
Query: 184 DACPGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLP 243
CP + YG G L S++L LP V +F GC+ ++ +P G+ GFGRG SLP
Sbjct: 205 YPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLP 264
Query: 244 SQMRL------KRFSYCLASRQFDDSPVSSPLVLDFG-------SKSGDTN--------- 303
+Q+ + FSYCL S FD V P L G + G T+
Sbjct: 265 AQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEK 324
Query: 304 --TNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIID 363
N +++ ENP +Y +SL+ I IG + + P D G GG ++D
Sbjct: 325 KKKNEFVFTEMLENPK-----HPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVD 384
Query: 364 SGSTFTILDKPIFEAVAEEFEKQLIK-YPRATGVEARSGLRPCFNVSKEKTVEFPELVLK 423
SG+TFT+L + +V EEF+ ++ + + RA VE SG+ PC+ ++ +TV+ P LVL
Sbjct: 385 SGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN--QTVKVPALVLH 444
Query: 424 FKGG-LELALPPANYFALVAESG--------VVCMTMLTDDVGGEKVGGGPAIILGAFQQ 461
F G + LP NYF + G + C+ ML + ++ GG ILG +QQ
Sbjct: 445 FAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCL-MLMNGGDESELRGGTGAILGNYQQ 491
BLAST of Moc01g23660 vs. TAIR 10
Match:
AT3G61820.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 168.7 bits (426), Expect = 1.1e-41
Identity = 139/432 (32.18%), Postives = 189/432 (43.75%), Query Frame = 0
Query: 41 KNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFD 100
K++ LA+ S R + + F S L+ S G Y + +G GTP N+ V D
Sbjct: 95 KSITSLAAVSTGRNATKRTPRTAGGFSGAVISGLSQGS-GEYFMRLGVGTPATNVYMVLD 154
Query: 101 TGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRS 160
TGS +VW C+ C C T F PK S + V CG+R C + D S
Sbjct: 155 TGSDVVWLQCSP---CKAC-----YNQTDAIFDPKKSKTFATVPCGSRLCRRLDD---SS 214
Query: 161 RCRNCTPNSRNCSDACPGYGIQYGSG-LTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQ- 220
C T S+ C Y + YG G T G +ETL RV +GC H
Sbjct: 215 EC--VTRRSKTCL-----YQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCG----HDN 274
Query: 221 ------PAGIVGFGRGPQSLPSQMRLK---RFSYCLASRQFDDSPVSSPLVLDFGSKSGD 280
AG++G GRG S PSQ + + +FSYCL R S P + FG+ +
Sbjct: 275 EGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVP 334
Query: 281 TNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVK-FPYKYLSPDSTGNGGTII 340
+ +++P NP +YYL L I +GG V D+TGNGG II
Sbjct: 335 KTS---VFTPLLTNPKLD-----TFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVII 394
Query: 341 DSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLK 400
DSG++ T L +P + A+ + F K RA + S CF++S TV+ P +V
Sbjct: 395 DSGTSVTRLTQPAYVALRDAFRLGATKLKRA---PSYSLFDTCFDLSGMTTVKVPTVVFH 454
Query: 401 FKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDL 460
F GG E++LP +NY V G C G I+G QQQ V YDL
Sbjct: 455 FGGG-EVSLPASNYLIPVNTEGRFCFAFAGT--------MGSLSIIGNIQQQGFRVAYDL 483
BLAST of Moc01g23660 vs. TAIR 10
Match:
AT1G25510.1 (Eukaryotic aspartyl protease family protein )
HSP 1 Score: 165.6 bits (418), Expect = 9.3e-41
Identity = 125/391 (31.97%), Postives = 169/391 (43.22%), Query Frame = 0
Query: 75 TPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIP 134
T + G Y +G G P + + V DTGS + W CT C+ C T F P
Sbjct: 141 TTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTP---CADCYH-----QTEPIFEP 200
Query: 135 KLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSG-LTAGFLL 194
SSS + C +C + S CRN T C Y + YG G T G
Sbjct: 201 SSSSSYEPLSCDTPQC----NALEVSECRNAT-----CL-----YEVSYGDGSYTVGDFA 260
Query: 195 SETLDLPDKRVPDFLVGCSVLS---VHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQF 254
+ETL + V + VGC + AG++G G G +LPSQ+ FSYCL R
Sbjct: 261 TETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDS 320
Query: 255 DDSPVSSPLVLDFG-SKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPV 314
D S +DFG S S D + +P N +YYL L I +GG+ +
Sbjct: 321 D-----SASTVDFGTSLSPDA-----VVAPLLRNHQLD-----TFYYLGLTGISVGGELL 380
Query: 315 KFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLR 374
+ P D +G+GG IIDSG+ T L I+ ++ + F K + +A GV +
Sbjct: 381 QIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGV---AMFD 440
Query: 375 PCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGG 434
C+N+S + TVE P + F GG LALP NY V G C+
Sbjct: 441 TCYNLSAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPT--------AS 483
Query: 435 PAIILGAFQQQNILVEYDLAKDRIGFRKQRC 461
I+G QQQ V +DLA IGF +C
Sbjct: 501 SLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022979057.1 | 4.4e-202 | 75.97 | probable aspartyl protease At4g16563 [Cucurbita maxima] | [more] |
XP_023543736.1 | 1.3e-201 | 75.75 | probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo] | [more] |
XP_022925946.1 | 2.0e-199 | 74.89 | probable aspartyl protease At4g16563 [Cucurbita moschata] >KAG6604319.1 Aspartic... | [more] |
KAG7034471.1 | 3.5e-199 | 74.89 | Aspartic proteinase nepenthesin-1, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_011657732.1 | 1.5e-197 | 75.00 | probable aspartyl protease At4g16563 [Cucumis sativus] >KGN48299.1 hypothetical ... | [more] |
Match Name | E-value | Identity | Description | |
Q940R4 | 9.0e-49 | 31.54 | Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... | [more] |
Q766C3 | 5.1e-36 | 30.26 | Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... | [more] |
Q766C2 | 1.1e-35 | 31.65 | Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... | [more] |
Q9LNJ3 | 4.8e-34 | 29.27 | Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... | [more] |
Q8S9J6 | 4.1e-33 | 30.39 | Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1IMR7 | 2.1e-202 | 75.97 | probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111478813... | [more] |
A0A6J1EDJ0 | 9.9e-200 | 74.89 | probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC1114332... | [more] |
A0A0A0KHK2 | 7.1e-198 | 75.00 | Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G45447... | [more] |
A0A5A7SGF9 | 1.1e-195 | 73.59 | Aspartic proteinase nepenthesin-2-like OS=Cucumis melo var. makuwa OX=1194695 GN... | [more] |
A0A1S3CHV2 | 1.1e-195 | 73.59 | aspartic proteinase nepenthesin-2-like OS=Cucumis melo OX=3656 GN=LOC103500932 P... | [more] |