BLAST of CmaCh20G009810 vs. Swiss-Prot
Match:
ASPL2_ARATH (Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana GN=At1g65240 PE=1 SV=2)
HSP 1 Score: 149.1 bits (375), Expect = 1.7e-34
Identity = 116/404 (28.71%), Postives = 188/404 (46.53%), Query Frame = 1
Query: 61 RRLRGFPNSNNRSNARMRLYDDLLLNG--------YYTTRLWIGTPPQKFALIVDTGSTV 120
+ L F + + R ++RM DL L G Y T++ +G+PP+++ + VDTGS +
Sbjct: 38 KNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDI 97
Query: 121 TYVPCSTCELCGKHQD-----PKFDPELSSTYQPVKCNSD-CT--CDNDGVQ----CVYE 180
++ C C C + FD SST + V C+ D C+ +D Q C Y
Sbjct: 98 LWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYH 157
Query: 181 RQYAEMSTSSGVLGDDVISFGN-----QSALVPQRAVFGCENEETGDLYS--QRADGIMG 240
YA+ STS G D+++ ++ + Q VFGC ++++G L + DG+MG
Sbjct: 158 IVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMG 217
Query: 241 LGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYY 300
G + S++ QL G FS C + GGG +G + P + + +Y
Sbjct: 218 FGQSNTSVLSQLAATGDAKRVFSHCLDNVK-GGGIFAVGVVDSPK--VKTTPMVPNQMHY 277
Query: 301 NVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKI 360
NV L + V G L L S+ G+++DSGTT +Y P+ + I+ K+
Sbjct: 278 NVMLMGMDVDGTSLDLPRSIVRNG-GTIVDSGTTLAYFPKVLYDSLIETIL--ARQPVKL 337
Query: 361 GGPDPNFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLG 420
+ F+ CF S + + + FP V F++ KL++ P +YLF + YC G
Sbjct: 338 HIVEETFQ--CF----SFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLF--TLEEELYCFG 397
Query: 421 IFENG----NNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCS 434
G + LLG +++ N LV+YD ++ IG+ NCS
Sbjct: 398 WQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427
BLAST of CmaCh20G009810 vs. Swiss-Prot
Match:
ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)
HSP 1 Score: 147.9 bits (372), Expect = 3.8e-34
Identity = 107/361 (29.64%), Postives = 165/361 (45.71%), Query Frame = 1
Query: 86 NGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCN 145
+G Y R+ +G+PP+ +++D+GS + +V C C+LC K DP FDP S +Y V C
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 187
Query: 146 SDCTCD---NDGVQ---CVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENEE 205
S CD N G C YE Y + S + G L + ++F + V + GC +
Sbjct: 188 SS-VCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTF---AKTVVRNVAMGCGHRN 247
Query: 206 TGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCY--GGMDIGGGAMVLGGISPP 265
G A G++G+G G +S V QL G +F C G D G++V G + P
Sbjct: 248 RGMFIG--AAGLLGIGGGSMSFVGQL--SGQTGGAFGYCLVSRGTD-STGSLVFGREALP 307
Query: 266 --SEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFD----GRYGSVLDSGTTYSYL 325
+ + +P +Y V LK + V G ++PL VFD G G V+D+GT + L
Sbjct: 308 VGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRL 367
Query: 326 PQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQK 385
P A+ F++ + +L + G + DTC+ +G +S PTV F G
Sbjct: 368 PTAAYVAFRDGFKSQTANLPRASG--VSIFDTCYDLSGF----VSVRVPTVSFYFTEGPV 427
Query: 386 LSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTN 433
L+L N+L G YC + +++G I V +D + +GF
Sbjct: 428 LTLPARNFLMPVDD-SGTYCFAFAASPTG--LSIIGNIQQEGIQVSFDGANGFVGFGPNV 470
BLAST of CmaCh20G009810 vs. Swiss-Prot
Match:
ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)
HSP 1 Score: 147.9 bits (372), Expect = 3.8e-34
Identity = 105/363 (28.93%), Postives = 172/363 (47.38%), Query Frame = 1
Query: 86 NGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCN 145
+G Y +R+ +GTP ++ L++DTGS V ++ C C C + DP F+P SSTY+ + C+
Sbjct: 159 SGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCS 218
Query: 146 S-DCTCDNDGV----QCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENEET 205
+ C+ +C+Y+ Y + S + G L D ++FGN + GC ++
Sbjct: 219 APQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKI--NNVALGCGHDNE 278
Query: 206 GDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMV------LGGI 265
G L++ A G++GLG G LSI +Q+ SFS C D G + + LGG
Sbjct: 279 G-LFTGAA-GLLGLGGGVLSITNQMKA-----TSFSYCLVDRDSGKSSSLDFNSVQLGGG 338
Query: 266 SPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFD----GRYGSVLDSGTTYSY 325
+ ++ + +Y V L V G+K+ L ++FD G G +LD GT +
Sbjct: 339 DATAPLL---RNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTR 398
Query: 326 LPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKT-FPTVDLIFDNG 385
L +A+ ++A + +LKK G + DTC+ D + LS PTV F G
Sbjct: 399 LQTQAYNSLRDAFLKLTVNLKK-GSSSISLFDTCY-----DFSSLSTVKVPTVAFHFTGG 458
Query: 386 QKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWK 433
+ L L +NYL G +C ++ +++G + + T + YD + IG
Sbjct: 459 KSLDLPAKNYLIPVDD-SGTFCFAFAPTSSS--LSIIGNVQQQGTRITYDLSKNVIGLSG 500
BLAST of CmaCh20G009810 vs. Swiss-Prot
Match:
NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)
HSP 1 Score: 142.5 bits (358), Expect = 1.6e-32
Identity = 111/369 (30.08%), Postives = 169/369 (45.80%), Query Frame = 1
Query: 86 NGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCN 145
+G Y L IGTP Q F+ I+DTGS + + C C C P F+P+ SS++ + C+
Sbjct: 92 DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCS 151
Query: 146 SDC-------TCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENE 205
S TC N+ C Y Y + S + G +G + ++FG+ S +P FGC
Sbjct: 152 SQLCQALSSPTCSNN--FCQYTYGYGDGSETQGSMGTETLTFGSVS--IP-NITFGCGEN 211
Query: 206 ETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGA---MVLGGI- 265
G A G++G+G G LS+ QL D Y IG ++LG +
Sbjct: 212 NQGFGQGNGA-GLVGMGRGPLSLPSQL-------DVTKFSYCMTPIGSSTPSNLLLGSLA 271
Query: 266 ------SPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVF-----DGRYGSVLD 325
SP + +I S P +Y + L + V +LP++PS F +G G ++D
Sbjct: 272 NSVTAGSPNTTLIQSSQIPT---FYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIID 331
Query: 326 SGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVD 385
SGTT +Y A+ + ++ + +L + G F D CF SD + L PT
Sbjct: 332 SGTTLTYFVNNAYQSVRQEFISQI-NLPVVNGSSSGF-DLCFQ-TPSDPSNLQ--IPTFV 391
Query: 386 LIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHS 433
+ FD G L L ENY S +G CL + ++ ++ G I +N LV+YD +S
Sbjct: 392 MHFDGGD-LELPSENYFI--SPSNGLICLAM--GSSSQGMSIFGNIQQQNMLVVYDTGNS 434
BLAST of CmaCh20G009810 vs. Swiss-Prot
Match:
NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)
HSP 1 Score: 142.1 bits (357), Expect = 2.1e-32
Identity = 117/370 (31.62%), Postives = 177/370 (47.84%), Query Frame = 1
Query: 86 NGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCN 145
+G Y + IGTP F+ I+DTGS + + C C C P F+P+ SS++ + C
Sbjct: 93 DGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCE 152
Query: 146 SD-C------TCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENE 205
S C TC+N+ +C Y Y + ST+ G + + +F +++ VP A FGC +
Sbjct: 153 SQYCQDLPSETCNNN--ECQYTYGYGDGSTTQGYMATETFTF--ETSSVPNIA-FGCGED 212
Query: 206 ETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLC---YGG-----MDIGGGAMV 265
G A G++G+G G LS+ QL GV FS C YG + +G A
Sbjct: 213 NQGFGQGNGA-GLIGMGWGPLSLPSQL---GV--GQFSYCMTSYGSSSPSTLALGSAASG 272
Query: 266 LGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVF----DGRYGSVLDSGT 325
+ SP + +I S +P YY + L+ I V G L + S F DG G ++DSGT
Sbjct: 273 VPEGSPSTTLIHSSLNPT---YYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGT 332
Query: 326 TYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFK--DTCFSGAGSDAAELSKTFPTVDL 385
T +YLPQ+A+ NA+ A + D + TCF SD + + P + +
Sbjct: 333 TLTYLPQDAY----NAVAQAFTDQINLPTVDESSSGLSTCFQ-QPSDGSTVQ--VPEISM 392
Query: 386 IFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQ--TTLLGGIIVRNTLVMYDREH 433
FD G L+L +N L S G CL + G++ Q ++ G I + T V+YD ++
Sbjct: 393 QFDGG-VLNLGEQNILI--SPAEGVICLAM---GSSSQLGISIFGNIQQQETQVLYDLQN 435
BLAST of CmaCh20G009810 vs. TrEMBL
Match:
A0A0A0LJB9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G277070 PE=3 SV=1)
HSP 1 Score: 1073.5 bits (2775), Expect = 6.9e-311
Identity = 526/640 (82.19%), Postives = 569/640 (88.91%), Query Frame = 1
Query: 1 MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
MA++P L+ A+LLH LSADPIS NPLL+PSHRAMVLPLY SSPNSSK IS PH
Sbjct: 1 MAKSPFLVAAILLHIF------LSADPISPNPLLSPSHRAMVLPLYLSSPNSSKFISNPH 60
Query: 61 RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
RRLR FP S+N SNARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCSTC
Sbjct: 61 RRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 120
Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
E CG+HQDPKFDPE SSTY+P+KCN DC CD+DGVQCVYERQYAEMSTSSGVLG+DVISF
Sbjct: 121 EQCGRHQDPKFDPESSSTYKPIKCNIDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISF 180
Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
GNQS L+PQRAVFGCEN ETGDL+SQRADGIMGLG+GDLS+VDQLVEKG INDSFSLCYG
Sbjct: 181 GNQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYG 240
Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
GMDIGGGAMVLGGISPPS+MIF+YSDPVRSPYYNVDLKEIHVAGKKLPL +FDGRYG+
Sbjct: 241 GMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGA 300
Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
VLDSGTTY+YLP EAF FK+AIM+ +HSLKKI GPDPNFKD CFSGAGSDAAELS FP
Sbjct: 301 VLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFP 360
Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
TVD++F+NGQKLSL PENY FRHSKVHGAYCLGIFENG NDQTTLLGGI+VRNTLVMYDR
Sbjct: 361 TVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENG-NDQTTLLGGIVVRNTLVMYDR 420
Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
+SKIGFWKTNCSELWERL ISD+NA PSVS SHD+D APASAPSE PH IP ++QI
Sbjct: 421 ANSKIGFWKTNCSELWERLRISDDNADGPSVSTKSHDSDIAPASAPSERPHYTIPGELQI 480
Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
GRITF ILLN SY LEPHIT LSDHIAQELNVSHSQV +LNFTMRGN SLIQLAILP G
Sbjct: 481 GRITFAILLNKSYTDLEPHITELSDHIAQELNVSHSQVIILNFTMRGNDSLIQLAILPYG 540
Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
SSE FSHATA TIIS IVEHHM+LPP +GSYQV+RWNVEP M+RS+WKRLYVLVGL I+V
Sbjct: 541 SSEIFSHATANTIISKIVEHHMQLPPTFGSYQVVRWNVEPPMERSMWKRLYVLVGLVIVV 600
Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
ILGLSA+G WF+ R RQQA +SYKPVNAA PEQELQ L
Sbjct: 601 IFILGLSALGAWFVLRSRQQAINSYKPVNAAVPEQELQPL 633
BLAST of CmaCh20G009810 vs. TrEMBL
Match:
A0A061DHD4_THECC (Aspartyl protease family protein OS=Theobroma cacao GN=TCM_000732 PE=3 SV=1)
HSP 1 Score: 850.9 bits (2197), Expect = 1.0e-243
Identity = 420/628 (66.88%), Postives = 501/628 (79.78%), Query Frame = 1
Query: 21 FTLS-ADPISSNPLLTP-----SHRAMVLPLYRSSPNSSKLISKPHRRLRGFPNSNNRSN 80
F LS ++P +S PLL P + AM+LPL+ NSS+ S R L + ++ N
Sbjct: 19 FLLSRSNPSTSTPLLLPPPHHGARPAMILPLFPFPKNSSRTFSHSGRHLLRSDSHSSHPN 78
Query: 81 ARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPE 140
ARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPC+TCE CG+HQDPKF P+
Sbjct: 79 ARMRLYDDLLLNGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCATCEQCGRHQDPKFQPD 138
Query: 141 LSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFG 200
LSSTYQPVKCN DC+CD D VQC YERQYAEMS+SSGVLG+D+ISFGNQS LVPQRAVFG
Sbjct: 139 LSSTYQPVKCNLDCSCDTDRVQCTYERQYAEMSSSSGVLGEDIISFGNQSELVPQRAVFG 198
Query: 201 CENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGI 260
CENEETGDLYSQ ADGIMGLG GDLS+VDQLVEKGVI+DSFSLCYGGMDIGGGAMVLGGI
Sbjct: 199 CENEETGDLYSQHADGIMGLGRGDLSVVDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGI 258
Query: 261 SPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQE 320
S P +M+FSYSDP RSPYYN+DLK IHVAGK+LPL P+VFD +YG+VLDSGTTY+YLP+
Sbjct: 259 SSPPDMVFSYSDPERSPYYNIDLKAIHVAGKQLPLNPNVFDVKYGTVLDSGTTYAYLPEA 318
Query: 321 AFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQKLSL 380
AF FKNAI+ L SLK+I GPDPN+ D CFSGA SD +ELSK FPTV+++FDN QKL L
Sbjct: 319 AFAAFKNAIIKELTSLKQIRGPDPNYNDICFSGASSDVSELSKIFPTVEMVFDNQQKLLL 378
Query: 381 APENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCSE 440
APENYLFRHSKV G YCLGIF N D TTLLGGIIVRNTLV YDREH KIGFWKTNCSE
Sbjct: 379 APENYLFRHSKVRGGYCLGIFPN-EKDPTTLLGGIIVRNTLVTYDREHLKIGFWKTNCSE 438
Query: 441 LWERLHISDENAHAPSVSNTSHDT--DTAPASAPSESPHDMIPEDIQIGRITFDILLNIS 500
LWERL I+ + +PS S+ ++ ++ P SAP S H IP +IQIG IT D+ L+I
Sbjct: 439 LWERLRINGAPSPSPSSSSGKDNSTVESPPTSAPDGSSHYAIPGEIQIGEITLDMSLSID 498
Query: 501 YKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSHATATT 560
Y +L+PHI L++ IA+EL+V+ SQV LL+FT GN SL+ AI+P+GS+ + S+ A +
Sbjct: 499 YSYLKPHINELAEFIAKELDVNASQVHLLDFTSEGNSSLVTWAIVPSGSATYISNVAAIS 558
Query: 561 IISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMVTLILGLSAVGVW 620
IIS + EH ++LP +G+YQ+++W VEP + ++ W++ Y++V LAIM+T+I+GLSA G W
Sbjct: 559 IISQLAEHRVRLPDTFGNYQLVQWKVEPSVQQTWWQQHYLVVLLAIMITIIVGLSASGGW 618
Query: 621 FIWRRRQQAFHSYKPVNAAAPEQELQTL 641
IWRRRQQA YKPV+ A EQELQ L
Sbjct: 619 IIWRRRQQALKLYKPVDGAVSEQELQPL 645
BLAST of CmaCh20G009810 vs. TrEMBL
Match:
A0A0D2QM13_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G056000 PE=3 SV=1)
HSP 1 Score: 850.1 bits (2195), Expect = 1.7e-243
Identity = 421/641 (65.68%), Postives = 507/641 (79.10%), Query Frame = 1
Query: 6 NLLLAVLLHFLHLTHFTLSADPISSNP--LLTPSHR----AMVLPLYRSSPNSSKLISKP 65
NL + ++ FL F LS S++P LL P H AMVLPL+ SS NSS+
Sbjct: 7 NLAVGTVVFFLL---FLLSQSNPSTSPPRLLPPPHHGARPAMVLPLFPSSKNSSRTFLHS 66
Query: 66 HRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCST 125
HR L + ++ NARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPC+T
Sbjct: 67 HRHLLRSDSHSSHPNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCAT 126
Query: 126 CELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVIS 185
CE CG+HQDPKF P+LSSTYQPVKCN DC CD+D VQC+YERQYAEMS+SSGVLG+D+IS
Sbjct: 127 CEQCGRHQDPKFQPDLSSTYQPVKCNLDCNCDSDRVQCIYERQYAEMSSSSGVLGEDIIS 186
Query: 186 FGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCY 245
FGNQS LVPQRAVFGCENEETGDLYSQ ADGIMGLG GDLS+VDQLVEKGVI+DSFSLCY
Sbjct: 187 FGNQSELVPQRAVFGCENEETGDLYSQHADGIMGLGRGDLSVVDQLVEKGVISDSFSLCY 246
Query: 246 GGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYG 305
GGMDIGGGAMVLGGIS PS+M+FSY+DPVRSPYY++ LKEIHVAGK+L L PSVFDG+YG
Sbjct: 247 GGMDIGGGAMVLGGISAPSDMVFSYADPVRSPYYSIGLKEIHVAGKQLSLNPSVFDGKYG 306
Query: 306 SVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTF 365
+VLDSGTTY+YLP+ AF FK AI+ L+ LK+I GPDPN+ D CFS A SD +ELSKTF
Sbjct: 307 TVLDSGTTYAYLPEPAFLAFKEAILKELNGLKQIRGPDPNYNDICFSTASSDVSELSKTF 366
Query: 366 PTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYD 425
PTV+++F + QKL L+PENYLFRHSKVHGAYCLGIF+N D TTLLGGIIVRNTLV YD
Sbjct: 367 PTVEMVFGDQQKLLLSPENYLFRHSKVHGAYCLGIFQN-EKDPTTLLGGIIVRNTLVTYD 426
Query: 426 REHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQ 485
REHSKIGFWKTNCSELWERLHI+ + PS S + T++ +A SPH P IQ
Sbjct: 427 REHSKIGFWKTNCSELWERLHITGALSPTPSSSGKGNSTESPTTTASDGSPHYDFPGKIQ 486
Query: 486 IGRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPN 545
IG+I D+ L+ ++ +L+P I L++ IA+EL+V+ SQV LLNFT GN SL++LAI+P+
Sbjct: 487 IGKIILDMSLSTNHSYLKPQINKLTEFIAKELDVNASQVHLLNFTSEGNSSLVRLAIVPS 546
Query: 546 GSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIM 605
SS + TA IIS + EH +KLP +G+YQ+++W VEP ++ W R Y++V +A++
Sbjct: 547 DSSTYIYKETARNIISRLAEHRVKLPDTFGNYQLVQWKVEPSTKQTWWGRNYMVVVVALI 606
Query: 606 VTLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
+ +++GLS GVW +WRR+QQ +SYKPV AAAPEQELQ L
Sbjct: 607 IIVVIGLSVYGVWGMWRRKQQTVNSYKPVGAAAPEQELQPL 643
BLAST of CmaCh20G009810 vs. TrEMBL
Match:
G7JCS6_MEDTR (Eukaryotic aspartyl protease family protein OS=Medicago truncatula GN=MTR_4g095270 PE=3 SV=2)
HSP 1 Score: 842.4 bits (2175), Expect = 3.6e-241
Identity = 418/641 (65.21%), Postives = 501/641 (78.16%), Query Frame = 1
Query: 1 MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
MAR L+ +L+ LH+TH T++ D L H AM+LPLY ++PNSS P
Sbjct: 1 MARPLTHLILILI--LHITH-TIAGD----TAFLRNRHHAMILPLYLTTPNSSTSALDPR 60
Query: 61 RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
R+L G S NARMRL+DDLLLNGYYTTRLWIGTPPQ FALIVDTGSTVTYVPCSTC
Sbjct: 61 RQLHG-SESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTC 120
Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
E CG+HQDPKF P+LSSTYQPVKC DC CDND +QCVYERQYAEMSTSSGVLG+DV+SF
Sbjct: 121 EQCGRHQDPKFQPDLSSTYQPVKCTLDCNCDNDRMQCVYERQYAEMSTSSGVLGEDVVSF 180
Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
GNQS L PQRAVFGCEN ETGDLYSQ ADGIMGLG GDLSI+DQLV+K V++DSFSLCYG
Sbjct: 181 GNQSELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYG 240
Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
GMD+GGGAMVLGGISPPS+M+F+ SDPVRSPYYN+DLKEIHVAGK+LPL PSVFDG++GS
Sbjct: 241 GMDVGGGAMVLGGISPPSDMVFAQSDPVRSPYYNIDLKEIHVAGKRLPLNPSVFDGKHGS 300
Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
VLDSGTTY+YLP+EAF FK AI+ L S +I GPDPN+ D CFSGAG D ++LSKTFP
Sbjct: 301 VLDSGTTYAYLPEEAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFP 360
Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
VD+IF NG K SL+PENY+FRHSKV GAYCLGIF+NG D TTLLGGI+VRNTLV+YDR
Sbjct: 361 VVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNG-KDPTTLLGGIVVRNTLVLYDR 420
Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTA-PASAPSESPHDMIPEDIQ 480
E +KIGFWKTNC+ELWERL IS P + ++ T + P+ APS S H++ + Q
Sbjct: 421 EQTKIGFWKTNCAELWERLQISSAPPPMPPNTEATNSTKSVDPSVAPSVSQHNIPRGEFQ 480
Query: 481 IGRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPN 540
I +IT + NISY ++P +T L+ IA ELNV+ SQ+ LLNFT GN SL + AI P
Sbjct: 481 IAQITIAVSFNISYDDMKPRLTELAGLIAHELNVNTSQIHLLNFTSSGNDSLSRWAITPR 540
Query: 541 GSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIM 600
+++FS++TA II + EH M+LP +GSY++I WNV P R+ W+R Y++VGLA++
Sbjct: 541 PYADYFSNSTAMNIIGRLAEHRMQLPDAFGSYKLIDWNVMPPSKRNWWQRYYMIVGLAVL 600
Query: 601 VTLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
+T +LGLS G +FIW+RR+Q+ HSYKPV+ A PEQELQ L
Sbjct: 601 LTSLLGLSIFG-FFIWKRRRQSAHSYKPVDVAVPEQELQPL 631
BLAST of CmaCh20G009810 vs. TrEMBL
Match:
V4SQ94_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025144mg PE=3 SV=1)
HSP 1 Score: 840.1 bits (2169), Expect = 1.8e-240
Identity = 417/643 (64.85%), Postives = 510/643 (79.32%), Query Frame = 1
Query: 1 MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHR--AMVLPLYRSSPNSSKLISK 60
MAR LL ++ F+++ + ++P +S + AMVLPLY S PN S+ IS
Sbjct: 1 MARASIPLLTTIVAFVYV----IQSNPATSTATILHGRTRPAMVLPLYLSQPNISRSISI 60
Query: 61 PHRRL-RGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPC 120
R L R PNS+ NARMRLYDDLLLNGYYTTRLWIGTPPQ FALIVDTGSTVTYVPC
Sbjct: 61 SRRHLQRSHPNSH--PNARMRLYDDLLLNGYYTTRLWIGTPPQTFALIVDTGSTVTYVPC 120
Query: 121 STCELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDV 180
+TCE CG HQDPKF+P+LSSTYQPVKCN C CD + QCVYER+YAEMS+SSGVLG+D+
Sbjct: 121 ATCEHCGDHQDPKFEPDLSSTYQPVKCNLYCNCDRERAQCVYERKYAEMSSSSGVLGEDI 180
Query: 181 ISFGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSL 240
ISFGN+S L PQRAVFGCEN ETGDLYSQ ADGI+GLG GDLS+VDQLVEKGVI+DSFSL
Sbjct: 181 ISFGNESDLKPQRAVFGCENVETGDLYSQHADGIIGLGRGDLSVVDQLVEKGVISDSFSL 240
Query: 241 CYGGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGR 300
CYGGMD+GGGAMVLGGISPP +M+F++SDPVRSPYYN+DLK IHVAGK LPL P VFDG+
Sbjct: 241 CYGGMDVGGGAMVLGGISPPKDMVFTHSDPVRSPYYNIDLKVIHVAGKPLPLNPKVFDGK 300
Query: 301 YGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSK 360
+G+VLDSGTTY+YLP+ AF FK+AIM+ L SLK+I GPDPN+ D CFSGA SD ++LS
Sbjct: 301 HGTVLDSGTTYAYLPEAAFLAFKDAIMSELQSLKQIRGPDPNYNDICFSGAPSDVSQLSD 360
Query: 361 TFPTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVM 420
TFP V++ F NGQKL L+PENYLFRHSKV GAYCLGIF+NG D TTLLGGIIVRNTLVM
Sbjct: 361 TFPAVEMAFGNGQKLLLSPENYLFRHSKVRGAYCLGIFQNG-RDPTTLLGGIIVRNTLVM 420
Query: 421 YDREHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPED 480
YDREHSKIGFWKTNCSELWERLHI+ + PS +S +++ +PSE P+ ++P D
Sbjct: 421 YDREHSKIGFWKTNCSELWERLHITGALSPIPS---SSEGKNSSTDLSPSEPPNYVLPGD 480
Query: 481 IQIGRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAIL 540
+QIGRITFD+ L+I+Y L PHI L+D IAQEL+V+ SQV LLNF +GN+S I A+
Sbjct: 481 LQIGRITFDMFLSINYSDLRPHIPELADSIAQELDVNTSQVHLLNFMSKGNNSFIAWAVF 540
Query: 541 PNGSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLA 600
P+GS+ + S+ATA IIS + EH + +P +G+Y++++WN+EP + R+ W+ +++V LA
Sbjct: 541 PSGSANYISNATALRIISRLAEHRVHIPDTFGNYKLLQWNIEPQVKRTWWQEHFLMVVLA 600
Query: 601 IMVTLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
I + +++GLS G+ FI RRR Q+ +SYKPV+AA PEQELQ L
Sbjct: 601 ITIMMVVGLSVFGILFILRRRHQSVNSYKPVDAALPEQELQPL 633
BLAST of CmaCh20G009810 vs. TAIR10
Match:
AT3G50050.1 (AT3G50050.1 Eukaryotic aspartyl protease family protein)
HSP 1 Score: 714.9 bits (1844), Expect = 4.4e-206
Identity = 357/607 (58.81%), Postives = 451/607 (74.30%), Query Frame = 1
Query: 37 SHRAMVLPLYRSSPNSS-KLISKPHRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWI 96
S R MV PL+ S PNSS + IS PHR+L +S + ++RMRLYDDLL+NGYYTTRLWI
Sbjct: 41 SRRPMVFPLFLSQPNSSSRSISIPHRKLHK-SDSKSLPHSRMRLYDDLLINGYYTTRLWI 100
Query: 97 GTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGV 156
GTPPQ FALIVD+GSTVTYVPCS CE CGKHQDPKF PE+SSTYQPVKCN DC CD+D
Sbjct: 101 GTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKCNMDCNCDDDRE 160
Query: 157 QCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLG 216
QCVYER+YAE S+S GVLG+D+ISFGN+S L PQRAVFGCE ETGDLYSQRADGI+GLG
Sbjct: 161 QCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETVETGDLYSQRADGIIGLG 220
Query: 217 SGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNV 276
GDLS+VDQLV+KG+I++SF LCYGGMD+GGG+M+LGG PS+M+F+ SDP RSPYYN+
Sbjct: 221 QGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGFDYPSDMVFTDSDPDRSPYYNI 280
Query: 277 DLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGG 336
DL I VAGK+L L VFDG +G+VLDSGTTY+YLP AF F+ A+M + +LK+I G
Sbjct: 281 DLTGIRVAGKQLSLHSRVFDGEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDG 340
Query: 337 PDPNFKDTCFSGAGSD-AAELSKTFPTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGI 396
PDPNFKDTCF A S+ +ELSK FP+V+++F +GQ L+PENY+FRHSKVHGAYCLG+
Sbjct: 341 PDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGV 400
Query: 397 FENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCSELWERLHISDENAHAPSVSNT 456
F NG D TTLLGGI+VRNTLV+YDRE+SK+GFW+TNCSEL +RLHI A SN
Sbjct: 401 FPNG-KDHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCSELSDRLHIDGAPPPATLPSND 460
Query: 457 SHDTDTAPASAPSESPHDMIPEDIQIGRITFDILLNISYKHLEPHITHLSDHIAQELNVS 516
S+ PS + + Q+G+I DI L ++ +L+P I LS ++EL+V
Sbjct: 461 SN---------PSHNSSSNLSGVTQVGQINLDIQLTVNSSYLKPRIEDLSKIFSKELDVK 520
Query: 517 HSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVI 576
SQV L N T +GN SL+++ +LP S +FS+ TAT I+S H +KLP +G+YQ++
Sbjct: 521 SSQVSLSNLTSKGNESLVRMVVLPPEPSTWFSNVTATNIVSRFTNHQIKLPEIFGNYQLV 580
Query: 577 RWNVEPLMDRSLWKRLYVLVGLAIMVTLILGLSAVGVWFIWRRRQQAFHSYKPVN-AAAP 636
+ +EP R+ + + +G+ + +I+GLSA G W IW+R+Q + YKPV+ A
Sbjct: 581 NYKLEPPRKRTNNNIVVIAIGI---IAVIVGLSAYGAWLIWKRKQTSI-PYKPVDEAIVA 632
Query: 637 EQELQTL 641
EQELQ +
Sbjct: 641 EQELQPI 632
BLAST of CmaCh20G009810 vs. TAIR10
Match:
AT5G43100.1 (AT5G43100.1 Eukaryotic aspartyl protease family protein)
HSP 1 Score: 705.3 bits (1819), Expect = 3.5e-203
Identity = 350/626 (55.91%), Postives = 455/626 (72.68%), Query Frame = 1
Query: 16 LHLTHFTLSADPISSNPLLTPSHRAMVLPL-YRSSPNSSKLISKPHRRLRGFPNSNNRSN 75
L L FT + I L T M+ PL Y S P ++ RRL + + N
Sbjct: 6 LLLLLFTTTTISIFFFDLTTADESPMIFPLSYSSLPPRPRVEDFRRRRL----HQSQLPN 65
Query: 76 ARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPE 135
A M+LYDDLL NGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCSTC+ CGKHQDPKF PE
Sbjct: 66 AHMKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPE 125
Query: 136 LSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFG 195
LS++YQ +KCN DC CD++G CVYER+YAEMS+SSGVL +D+ISFGN+S L PQRAVFG
Sbjct: 126 LSTSYQALKCNPDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFG 185
Query: 196 CENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGI 255
CENEETGDL+SQRADGIMGLG G LS+VDQLV+KGVI D FSLCYGGM++GGGAMVLG I
Sbjct: 186 CENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKI 245
Query: 256 SPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQE 315
SPP M+FS+SDP RSPYYN+DLK++HVAGK L L P VF+G++G+VLDSGTTY+Y P+E
Sbjct: 246 SPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYFPKE 305
Query: 316 AFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQKLSL 375
AF K+A++ + SLK+I GPDPN+ D CFSGAG D AE+ FP + + F NGQKL L
Sbjct: 306 AFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLIL 365
Query: 376 APENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCSE 435
+PENYLFRH+KV GAYCLGIF + D TTLLGGI+VRNTLV YDRE+ K+GF KTNCS+
Sbjct: 366 SPENYLFRHTKVRGAYCLGIFP--DRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCSD 425
Query: 436 LWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQIGRITFDILLNISYK 495
+W RL + A +S + ++ +P+ A SESP +P ++G ITF++ ++++
Sbjct: 426 IWRRLAAPESPAPTSPISQ-NKSSNISPSPATSESPTSHLPGVFRVGVITFEVSISVNNS 485
Query: 496 HLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSHATATTII 555
L+P + ++D IA EL++ +QVRLLNF+ GN ++ + P SSE+ S+ TA I+
Sbjct: 486 SLKPKFSEIADFIAHELDIQSAQVRLLNFSSSGNEYRLKWGVFPPQSSEYISNTTALNIM 545
Query: 556 SLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMVTLILGLSAVGVWFI 615
L+ E+ ++LP ++GSY+++ W E +S W++ + V M++L++ + + +
Sbjct: 546 LLLKENRLRLPGQFGSYKLLEWKAEQKKKQSWWEKHLLGVVGGAMISLLVTSVMIKLALV 605
Query: 616 WRRRQQAFHSYKPVNAAAPEQELQTL 641
WRRR+Q +Y+PVNAA EQELQ L
Sbjct: 606 WRRRKQEEATYEPVNAAIKEQELQPL 624
BLAST of CmaCh20G009810 vs. TAIR10
Match:
AT5G22850.1 (AT5G22850.1 Eukaryotic aspartyl protease family protein)
HSP 1 Score: 201.4 bits (511), Expect = 1.6e-51
Identity = 142/418 (33.97%), Postives = 210/418 (50.24%), Query Frame = 1
Query: 82 DLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPK-----FDPELS 141
D + G Y T+L +GTPP+ F + VDTGS V +V C++C C + + FDP S
Sbjct: 74 DPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSS 133
Query: 142 STYQPVKC----------NSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGN--QS 201
T P+ C +SD C C Y QY + S +SG DV+ F S
Sbjct: 134 VTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGS 193
Query: 202 ALVPQR---AVFGCENEETGDLY-SQRA-DGIMGLGSGDLSIVDQLVEKGVINDSFSLCY 261
+LVP VFGC +TGDL S RA DGI G G +S++ QL +G+ FS C
Sbjct: 194 SLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL 253
Query: 262 GGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVF--DGR 321
G + GGG +VLG I P+ M+F+ P P+YNV+L I V G+ LP+ PSVF
Sbjct: 254 KGENGGGGILVLGEIVEPN-MVFTPLVP-SQPHYNVNLLSISVNGQALPINPSVFSTSNG 313
Query: 322 YGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSK 381
G+++D+GTT +YL + A+ PF AI NA+ + P + + C+ S +
Sbjct: 314 QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVR---PVVSKGNQCYVITTS----VGD 373
Query: 382 TFPTVDLIFDNGQKLSLAPENYLFRHSKVHG--AYCLGIFENGNNDQTTLLGGIIVRNTL 441
FP V L F G + L P++YL + + V G +C+G F+ N T+LG +++++ +
Sbjct: 374 IFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIG-FQRIQNQGITILGDLVLKDKI 433
Query: 442 VMYDREHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDM 474
+YD +IG+ +CS + N A S S S + S + +P +
Sbjct: 434 FVYDLVGQRIGWANYDCS--------TSVNVSATSSSGRSEYVNAGQFSENAAAPQKL 473
BLAST of CmaCh20G009810 vs. TAIR10
Match:
AT1G08210.1 (AT1G08210.1 Eukaryotic aspartyl protease family protein)
HSP 1 Score: 199.5 bits (506), Expect = 6.2e-51
Identity = 152/456 (33.33%), Postives = 226/456 (49.56%), Query Frame = 1
Query: 7 LLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRS--SPNSSKLISKPHRRLR 66
++ AVLL L T +D + L P + + L R+ S +L+ P +
Sbjct: 11 IIAAVLL--LAATTLACGSDAVLKLERLIPPNHELGLTELRAFDSARHGRLLQSPVGGVV 70
Query: 67 GFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCG 126
FP D L G Y T++ +GTPP++F + +DTGS V +V C++C C
Sbjct: 71 NFPVDGA---------SDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCP 130
Query: 127 KHQDPK-----FDPELSSTYQPVKCN-----------SDCTCDNDGVQCVYERQYAEMST 186
K + + FDP +SS+ V C+ S C+ +N C Y +Y + S
Sbjct: 131 KTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTESGCSPNN---LCSYSFKYGDGSG 190
Query: 187 SSGVLGDDVISFGN--QSALVPQRA---VFGCENEETGDLYSQR--ADGIMGLGSGDLSI 246
+SG D +SF S L + VFGC N ++GDL R DGI GLG G LS+
Sbjct: 191 TSGYYISDFMSFDTVITSTLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSV 250
Query: 247 VDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISPPSEMIFSYSDPVRS-PYYNVDLKEI 306
+ QL +G+ FS C G GGG MVLG I P + Y+ V S P+YNV+L+ I
Sbjct: 251 ISQLAVQGLAPRVFSHCLKGDKSGGGIMVLGQIKRPDTV---YTPLVPSQPHYNVNLQSI 310
Query: 307 HVAGKKLPLEPSVFD--GRYGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDP 366
V G+ LP++PSVF G+++D+GTT +YLP EA+ PF A+ NA + + G P
Sbjct: 311 AVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVANA---VSQYGRPIT 370
Query: 367 NFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQKLSLAPENYL-FRHSKVHGAYCLGIFEN 426
CF D FP V L F G + L P YL S +C+G F+
Sbjct: 371 YESYQCFEITAGDV----DVFPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIG-FQR 430
Query: 427 GNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCS 434
++ + T+LG +++++ +V+YD +IG+ + +CS
Sbjct: 431 MSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441
BLAST of CmaCh20G009810 vs. TAIR10
Match:
AT2G36670.1 (AT2G36670.1 Eukaryotic aspartyl protease family protein)
HSP 1 Score: 182.2 bits (461), Expect = 1.0e-45
Identity = 127/377 (33.69%), Postives = 198/377 (52.52%), Query Frame = 1
Query: 89 YTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPK-----FDPELSSTYQPVK 148
Y T++ +G+PP +F + +DTGS + +V CS+C C FD S T V
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 164
Query: 149 CNSDCTCDN----------DGVQCVYERQYAEMSTSSG-----------VLGDDVISFGN 208
C SD C + + QC Y +Y + S +SG +LG+ +++ N
Sbjct: 165 C-SDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVA--N 224
Query: 209 QSALVPQRAVFGCENEETGDLYS--QRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 268
SA + VFGC ++GDL + DGI G G G LS+V QL +G+ FS C
Sbjct: 225 SSAPI----VFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLK 284
Query: 269 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFD--GRY 328
G GGG VLG I P M++S P P+YN++L I V G+ LPL+ +VF+
Sbjct: 285 GDGSGGGVFVLGEILVPG-MVYSPLVP-SQPHYNLNLLSIGVNGQMLPLDAAVFEASNTR 344
Query: 329 GSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKT 388
G+++D+GTT +YL +EA+ F NAI N S+ ++ P + + C+ + S +S
Sbjct: 345 GTIVDTGTTLTYLVKEAYDLFLNAISN---SVSQLVTPIISNGEQCYLVSTS----ISDM 404
Query: 389 FPTVDLIFDNGQKLSLAPENYLFRHSKVHGA--YCLGIFENGNNDQTTLLGGIIVRNTLV 434
FP+V L F G + L P++YLF + GA +C+G F+ +Q T+LG +++++ +
Sbjct: 405 FPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIG-FQKAPEEQ-TILGDLVLKDKVF 463
BLAST of CmaCh20G009810 vs. NCBI nr
Match:
gi|659115870|ref|XP_008457780.1| (PREDICTED: aspartic proteinase CDR1-like [Cucumis melo])
HSP 1 Score: 1084.7 bits (2804), Expect = 0.0e+00
Identity = 531/640 (82.97%), Postives = 573/640 (89.53%), Query Frame = 1
Query: 1 MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
MA++P LLL +L HF LSADPIS NPL+TPSHRAMVLPLY SS NSSK IS PH
Sbjct: 1 MAKSPFLLLPAIL-----LHFFLSADPISPNPLITPSHRAMVLPLYLSSSNSSKFISNPH 60
Query: 61 RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
R LR FP S+NRSNARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCSTC
Sbjct: 61 RHLRQFPTSDNRSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 120
Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
E CG+HQDPKFDPE SSTY+P+KCN DCTCD+DGVQCVYERQYAEMSTSSGVLG+DVISF
Sbjct: 121 EQCGRHQDPKFDPESSSTYKPIKCNIDCTCDSDGVQCVYERQYAEMSTSSGVLGEDVISF 180
Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
GNQS L+PQRAVFGCEN ETGDL+SQRADGIMGLG+GDLS+VDQLVEKG INDSFSLCYG
Sbjct: 181 GNQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYG 240
Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
GMDIGGGAMVLGGISPPS+MIF+YSDPVRSPYYNVDLKEIHVAGKKLPL S+FDGRYG+
Sbjct: 241 GMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSSIFDGRYGT 300
Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
VLDSGTTY+YLP EAFG FK+AIM+ LHSLKKI GPDPNFKD CFSGAGSDAAELS FP
Sbjct: 301 VLDSGTTYAYLPAEAFGAFKDAIMDELHSLKKIDGPDPNFKDICFSGAGSDAAELSNIFP 360
Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
TVD++F+NGQKLSLAPENY FRHSKVHGAYCLGIFENG NDQTTLLGGI+VRNTLVMYDR
Sbjct: 361 TVDMVFENGQKLSLAPENYFFRHSKVHGAYCLGIFENG-NDQTTLLGGIVVRNTLVMYDR 420
Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
HSKIGFWKTNCSELWERL SD+NAHAPS+S SH +D APASAP ESPH IP ++QI
Sbjct: 421 AHSKIGFWKTNCSELWERLRTSDDNAHAPSISTKSHGSDMAPASAPIESPHYTIPGELQI 480
Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
GRITF+ILLN SY LEPHIT LSDHIAQELNVSHSQV LLNFTMRGN SLI+LAI+P G
Sbjct: 481 GRITFEILLNKSYTDLEPHITELSDHIAQELNVSHSQVLLLNFTMRGNDSLIKLAIIPYG 540
Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
SSE FSHAT TIIS IVEHHM+LPP +GSYQV+RWNVEP M+RS+WKRLYVLVGLAI+V
Sbjct: 541 SSEIFSHATVNTIISKIVEHHMQLPPTFGSYQVVRWNVEPPMERSMWKRLYVLVGLAIIV 600
Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
ILGLSA+G WFI R RQQA +SYKPVNAA PEQELQ L
Sbjct: 601 IFILGLSALGAWFILRSRQQAINSYKPVNAAVPEQELQPL 634
BLAST of CmaCh20G009810 vs. NCBI nr
Match:
gi|778669864|ref|XP_011649314.1| (PREDICTED: aspartic proteinase CDR1 [Cucumis sativus])
HSP 1 Score: 1073.5 bits (2775), Expect = 1.0e-310
Identity = 526/640 (82.19%), Postives = 569/640 (88.91%), Query Frame = 1
Query: 1 MARTPNLLLAVLLHFLHLTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPH 60
MA++P L+ A+LLH LSADPIS NPLL+PSHRAMVLPLY SSPNSSK IS PH
Sbjct: 1 MAKSPFLVAAILLHIF------LSADPISPNPLLSPSHRAMVLPLYLSSPNSSKFISNPH 60
Query: 61 RRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTC 120
RRLR FP S+N SNARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPCSTC
Sbjct: 61 RRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 120
Query: 121 ELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISF 180
E CG+HQDPKFDPE SSTY+P+KCN DC CD+DGVQCVYERQYAEMSTSSGVLG+DVISF
Sbjct: 121 EQCGRHQDPKFDPESSSTYKPIKCNIDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISF 180
Query: 181 GNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYG 240
GNQS L+PQRAVFGCEN ETGDL+SQRADGIMGLG+GDLS+VDQLVEKG INDSFSLCYG
Sbjct: 181 GNQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYG 240
Query: 241 GMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGS 300
GMDIGGGAMVLGGISPPS+MIF+YSDPVRSPYYNVDLKEIHVAGKKLPL +FDGRYG+
Sbjct: 241 GMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGA 300
Query: 301 VLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFP 360
VLDSGTTY+YLP EAF FK+AIM+ +HSLKKI GPDPNFKD CFSGAGSDAAELS FP
Sbjct: 301 VLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFP 360
Query: 361 TVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDR 420
TVD++F+NGQKLSL PENY FRHSKVHGAYCLGIFENG NDQTTLLGGI+VRNTLVMYDR
Sbjct: 361 TVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENG-NDQTTLLGGIVVRNTLVMYDR 420
Query: 421 EHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQI 480
+SKIGFWKTNCSELWERL ISD+NA PSVS SHD+D APASAPSE PH IP ++QI
Sbjct: 421 ANSKIGFWKTNCSELWERLRISDDNADGPSVSTKSHDSDIAPASAPSERPHYTIPGELQI 480
Query: 481 GRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNG 540
GRITF ILLN SY LEPHIT LSDHIAQELNVSHSQV +LNFTMRGN SLIQLAILP G
Sbjct: 481 GRITFAILLNKSYTDLEPHITELSDHIAQELNVSHSQVIILNFTMRGNDSLIQLAILPYG 540
Query: 541 SSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMV 600
SSE FSHATA TIIS IVEHHM+LPP +GSYQV+RWNVEP M+RS+WKRLYVLVGL I+V
Sbjct: 541 SSEIFSHATANTIISKIVEHHMQLPPTFGSYQVVRWNVEPPMERSMWKRLYVLVGLVIVV 600
Query: 601 TLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
ILGLSA+G WF+ R RQQA +SYKPVNAA PEQELQ L
Sbjct: 601 IFILGLSALGAWFVLRSRQQAINSYKPVNAAVPEQELQPL 633
BLAST of CmaCh20G009810 vs. NCBI nr
Match:
gi|590705429|ref|XP_007047435.1| (Aspartyl protease family protein [Theobroma cacao])
HSP 1 Score: 850.9 bits (2197), Expect = 1.4e-243
Identity = 420/628 (66.88%), Postives = 501/628 (79.78%), Query Frame = 1
Query: 21 FTLS-ADPISSNPLLTP-----SHRAMVLPLYRSSPNSSKLISKPHRRLRGFPNSNNRSN 80
F LS ++P +S PLL P + AM+LPL+ NSS+ S R L + ++ N
Sbjct: 19 FLLSRSNPSTSTPLLLPPPHHGARPAMILPLFPFPKNSSRTFSHSGRHLLRSDSHSSHPN 78
Query: 81 ARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPE 140
ARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPC+TCE CG+HQDPKF P+
Sbjct: 79 ARMRLYDDLLLNGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCATCEQCGRHQDPKFQPD 138
Query: 141 LSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFG 200
LSSTYQPVKCN DC+CD D VQC YERQYAEMS+SSGVLG+D+ISFGNQS LVPQRAVFG
Sbjct: 139 LSSTYQPVKCNLDCSCDTDRVQCTYERQYAEMSSSSGVLGEDIISFGNQSELVPQRAVFG 198
Query: 201 CENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGI 260
CENEETGDLYSQ ADGIMGLG GDLS+VDQLVEKGVI+DSFSLCYGGMDIGGGAMVLGGI
Sbjct: 199 CENEETGDLYSQHADGIMGLGRGDLSVVDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGI 258
Query: 261 SPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQE 320
S P +M+FSYSDP RSPYYN+DLK IHVAGK+LPL P+VFD +YG+VLDSGTTY+YLP+
Sbjct: 259 SSPPDMVFSYSDPERSPYYNIDLKAIHVAGKQLPLNPNVFDVKYGTVLDSGTTYAYLPEA 318
Query: 321 AFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTFPTVDLIFDNGQKLSL 380
AF FKNAI+ L SLK+I GPDPN+ D CFSGA SD +ELSK FPTV+++FDN QKL L
Sbjct: 319 AFAAFKNAIIKELTSLKQIRGPDPNYNDICFSGASSDVSELSKIFPTVEMVFDNQQKLLL 378
Query: 381 APENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGFWKTNCSE 440
APENYLFRHSKV G YCLGIF N D TTLLGGIIVRNTLV YDREH KIGFWKTNCSE
Sbjct: 379 APENYLFRHSKVRGGYCLGIFPN-EKDPTTLLGGIIVRNTLVTYDREHLKIGFWKTNCSE 438
Query: 441 LWERLHISDENAHAPSVSNTSHDT--DTAPASAPSESPHDMIPEDIQIGRITFDILLNIS 500
LWERL I+ + +PS S+ ++ ++ P SAP S H IP +IQIG IT D+ L+I
Sbjct: 439 LWERLRINGAPSPSPSSSSGKDNSTVESPPTSAPDGSSHYAIPGEIQIGEITLDMSLSID 498
Query: 501 YKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSHATATT 560
Y +L+PHI L++ IA+EL+V+ SQV LL+FT GN SL+ AI+P+GS+ + S+ A +
Sbjct: 499 YSYLKPHINELAEFIAKELDVNASQVHLLDFTSEGNSSLVTWAIVPSGSATYISNVAAIS 558
Query: 561 IISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMVTLILGLSAVGVW 620
IIS + EH ++LP +G+YQ+++W VEP + ++ W++ Y++V LAIM+T+I+GLSA G W
Sbjct: 559 IISQLAEHRVRLPDTFGNYQLVQWKVEPSVQQTWWQQHYLVVLLAIMITIIVGLSASGGW 618
Query: 621 FIWRRRQQAFHSYKPVNAAAPEQELQTL 641
IWRRRQQA YKPV+ A EQELQ L
Sbjct: 619 IIWRRRQQALKLYKPVDGAVSEQELQPL 645
BLAST of CmaCh20G009810 vs. NCBI nr
Match:
gi|823184460|ref|XP_012489205.1| (PREDICTED: aspartic proteinase-like protein 2 [Gossypium raimondii])
HSP 1 Score: 850.1 bits (2195), Expect = 2.5e-243
Identity = 421/641 (65.68%), Postives = 507/641 (79.10%), Query Frame = 1
Query: 6 NLLLAVLLHFLHLTHFTLSADPISSNP--LLTPSHR----AMVLPLYRSSPNSSKLISKP 65
NL + ++ FL F LS S++P LL P H AMVLPL+ SS NSS+
Sbjct: 7 NLAVGTVVFFLL---FLLSQSNPSTSPPRLLPPPHHGARPAMVLPLFPSSKNSSRTFLHS 66
Query: 66 HRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCST 125
HR L + ++ NARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVPC+T
Sbjct: 67 HRHLLRSDSHSSHPNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCAT 126
Query: 126 CELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVIS 185
CE CG+HQDPKF P+LSSTYQPVKCN DC CD+D VQC+YERQYAEMS+SSGVLG+D+IS
Sbjct: 127 CEQCGRHQDPKFQPDLSSTYQPVKCNLDCNCDSDRVQCIYERQYAEMSSSSGVLGEDIIS 186
Query: 186 FGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCY 245
FGNQS LVPQRAVFGCENEETGDLYSQ ADGIMGLG GDLS+VDQLVEKGVI+DSFSLCY
Sbjct: 187 FGNQSELVPQRAVFGCENEETGDLYSQHADGIMGLGRGDLSVVDQLVEKGVISDSFSLCY 246
Query: 246 GGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYG 305
GGMDIGGGAMVLGGIS PS+M+FSY+DPVRSPYY++ LKEIHVAGK+L L PSVFDG+YG
Sbjct: 247 GGMDIGGGAMVLGGISAPSDMVFSYADPVRSPYYSIGLKEIHVAGKQLSLNPSVFDGKYG 306
Query: 306 SVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELSKTF 365
+VLDSGTTY+YLP+ AF FK AI+ L+ LK+I GPDPN+ D CFS A SD +ELSKTF
Sbjct: 307 TVLDSGTTYAYLPEPAFLAFKEAILKELNGLKQIRGPDPNYNDICFSTASSDVSELSKTF 366
Query: 366 PTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYD 425
PTV+++F + QKL L+PENYLFRHSKVHGAYCLGIF+N D TTLLGGIIVRNTLV YD
Sbjct: 367 PTVEMVFGDQQKLLLSPENYLFRHSKVHGAYCLGIFQN-EKDPTTLLGGIIVRNTLVTYD 426
Query: 426 REHSKIGFWKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQ 485
REHSKIGFWKTNCSELWERLHI+ + PS S + T++ +A SPH P IQ
Sbjct: 427 REHSKIGFWKTNCSELWERLHITGALSPTPSSSGKGNSTESPTTTASDGSPHYDFPGKIQ 486
Query: 486 IGRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPN 545
IG+I D+ L+ ++ +L+P I L++ IA+EL+V+ SQV LLNFT GN SL++LAI+P+
Sbjct: 487 IGKIILDMSLSTNHSYLKPQINKLTEFIAKELDVNASQVHLLNFTSEGNSSLVRLAIVPS 546
Query: 546 GSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIM 605
SS + TA IIS + EH +KLP +G+YQ+++W VEP ++ W R Y++V +A++
Sbjct: 547 DSSTYIYKETARNIISRLAEHRVKLPDTFGNYQLVQWKVEPSTKQTWWGRNYMVVVVALI 606
Query: 606 VTLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
+ +++GLS GVW +WRR+QQ +SYKPV AAAPEQELQ L
Sbjct: 607 IIVVIGLSVYGVWGMWRRKQQTVNSYKPVGAAAPEQELQPL 643
BLAST of CmaCh20G009810 vs. NCBI nr
Match:
gi|802645015|ref|XP_012079339.1| (PREDICTED: aspartic proteinase-like protein 2 [Jatropha curcas])
HSP 1 Score: 845.9 bits (2184), Expect = 4.7e-242
Identity = 418/645 (64.81%), Postives = 507/645 (78.60%), Query Frame = 1
Query: 1 MARTPNLLLAVLLHFLHLTHFTLSADPISSNP----LLTPSHRAMVLPLYRSSPNSSKLI 60
MA TP L+ F + D +S+N LL + A++LPL+ S NSSK +
Sbjct: 1 MASTPIQLIIFFYFFFFQLDAAIVLD-VSANSTTTVLLGGATPALILPLFLSPSNSSKQL 60
Query: 61 SKPHRRLRGFPNSNNRSNARMRLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVP 120
S P R L G N++ R NARMRLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGSTVTYVP
Sbjct: 61 SNPPRHLLG-SNASARPNARMRLYDDLLLNGYYTTRLWIGTPPQRFALIVDTGSTVTYVP 120
Query: 121 CSTCELCGKHQDPKFDPELSSTYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDD 180
CSTCE CG HQDPKF PELSSTYQP+KCN DC CD++ QC+Y+R+YAEMSTSSGVL +D
Sbjct: 121 CSTCEQCGNHQDPKFQPELSSTYQPLKCNPDCNCDDEREQCIYDRRYAEMSTSSGVLAED 180
Query: 181 VISFGNQSALVPQRAVFGCENEETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFS 240
ISFGNQS L PQRAVFGCEN ETGDLYSQ ADGIMGLGSGDLSIVDQLVEKGVI+DSFS
Sbjct: 181 FISFGNQSELEPQRAVFGCENVETGDLYSQHADGIMGLGSGDLSIVDQLVEKGVISDSFS 240
Query: 241 LCYGGMDIGGGAMVLGGISPPSEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDG 300
LCYGGM+IGGGAMVLG +SPPS M+F+YSDPVRS YYN+DL+EIHVAGK+LPLEP VFD
Sbjct: 241 LCYGGMNIGGGAMVLGSLSPPSGMVFTYSDPVRSQYYNIDLREIHVAGKRLPLEPGVFDR 300
Query: 301 RYGSVLDSGTTYSYLPQEAFGPFKNAIMNALHSLKKIGGPDPNFKDTCFSGAGSDAAELS 360
++G++LDSGTTY+YLP+ F FK+AIM LHSLK+I GPDPN+ D CFSGAGS+ ++LS
Sbjct: 301 KHGTILDSGTTYAYLPEAVFKAFKDAIMKELHSLKQIRGPDPNYNDICFSGAGSEVSQLS 360
Query: 361 KTFPTVDLIFDNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLV 420
FPTVD+IF++GQK SL+PENYLFRH+KV GAYCLGIF NG D TTLLGGIIVRNTLV
Sbjct: 361 NAFPTVDMIFEHGQKWSLSPENYLFRHTKVPGAYCLGIFPNG-KDPTTLLGGIIVRNTLV 420
Query: 421 MYDREHSKIGFWKTNCSELWERLHISDENAHAPSVSN-TSHDTDTAPASAPSESPHDMIP 480
MYDRE+SK+GFWKTNCSELWERLHI+ A PS SN T+ + P APS+ H ++P
Sbjct: 421 MYDRENSKVGFWKTNCSELWERLHITSAAAPLPSDSNGTNITVEIPPTLAPSDQLHYVLP 480
Query: 481 EDIQIGRITFDILLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLA 540
+++QIG+ITF++ L +Y HL+ H T L IAQ+L V+ SQV LL +GN SLI
Sbjct: 481 DELQIGQITFEMSLKANYSHLKIHATELIGFIAQQLGVNSSQVHLLKLASKGNDSLIGWT 540
Query: 541 ILPNGSSEFFSHATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVG 600
I+P+GS++ S+ATA +IIS + EHH++LP +GSY+++ W +EP +R+ W++ Y+ G
Sbjct: 541 IVPSGSADHISNATALSIISRVAEHHIQLPDTFGSYRLVHWKIEPPANRTWWQQHYLFAG 600
Query: 601 LAIMVTLILGLSAVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 641
L +++ LILGLSA G+ FIWR R+Q F +Y+PVN A PEQELQ L
Sbjct: 601 LVVIIVLILGLSASGLLFIWRCREQTFSAYRPVNTAVPEQELQPL 642
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
ASPL2_ARATH | 1.7e-34 | 28.71 | Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana GN=At1g65240 PE=1 SV=... | [more] |
ASPG2_ARATH | 3.8e-34 | 29.64 | Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... | [more] |
ASPG1_ARATH | 3.8e-34 | 28.93 | Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... | [more] |
NEP1_NEPGR | 1.6e-32 | 30.08 | Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1 | [more] |
NEP2_NEPGR | 2.1e-32 | 31.62 | Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LJB9_CUCSA | 6.9e-311 | 82.19 | Uncharacterized protein OS=Cucumis sativus GN=Csa_2G277070 PE=3 SV=1 | [more] |
A0A061DHD4_THECC | 1.0e-243 | 66.88 | Aspartyl protease family protein OS=Theobroma cacao GN=TCM_000732 PE=3 SV=1 | [more] |
A0A0D2QM13_GOSRA | 1.7e-243 | 65.68 | Uncharacterized protein OS=Gossypium raimondii GN=B456_007G056000 PE=3 SV=1 | [more] |
G7JCS6_MEDTR | 3.6e-241 | 65.21 | Eukaryotic aspartyl protease family protein OS=Medicago truncatula GN=MTR_4g0952... | [more] |
V4SQ94_9ROSI | 1.8e-240 | 64.85 | Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025144mg PE=3 SV=1 | [more] |