Cp4.1LG01g20080 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g20080
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionEukaryotic aspartyl protease family protein isoform 1
LocationCp4.1LG01 : 17114427 .. 17118516 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAGGAAACGCGCACCGTACGCACAAACAGCGCATTATAGCACTCTCACAGTCTCACCCAATTTGATTTATCGTCGTCACCATTCATGGTTCACGCTTCAATCTCTACTCCGTTTGCAGAAGGCTCGGTTCCTTGCCTTTTCCCCAAATCCGGACGCTCTCTCGCCGTTGTAGAAGTTCATTCTTACGAAATCTGATCTTCATCTTCAGTTTCCATGGCGAATTTCTTGGTAGTTTTGCTCTTTGTAGCTTGTTTCTTGGTGGACAGTTCTGTTGCGCTTAGGCTCTCGTCGAGGCTCGTTCATCGATTCTCCGATGAGGCGAAAGCGCTTTGGAAGTCGAGGAACGGTAATGCGTCTGGAAAATTTTGGCCGAGGAGGAATAGCTTGAAGTATTTTGAAACGCTTAAGGATTATGACTTGAAGAGGCGGAGACTGAAGATCGGATCGAAGTACGAGGTGATTTTTCCTTCTGAAGGAAACGAGGTCGTGTTCTTTGGGAACGAATTTGATTGGTTAGTATGCTTGAGCTTAACTGTTCCAGGACGTGATCGCTTTATAGACTAGCTAACTTCTAAGTTCTACCAATTGTGTAGTTTTCTTTACTTTCACAGCAACGATAATGTTTAATTTTATCAGCATTTCCTTCACCATTAAAAATTTAAAGCAGCATGGAGGGAGTATTTTCTTTACCTTGTTTGTTAGAGCGATTAAAGAATTCAGTGGAACTAGTTGTCCAGGTTACATTACACATGGATTGATATAGGAACACCGAGTGTTTCGTTTCTCGTTGCATTGGATGCTGGAAGTGATCTGCTCTGGGTTCCGTGTGATTGCATTCAATGTGCTCCCTTGTCTGCAAGTCATTATAGCTCACTGGTATGACTGCAATCACCTACATTAATTTCTTCATTTCTTTAATACCTCTTTTAACAAATTCCAAGTCGTATTTGGGATGTGAGCTGCTGAACTAGTTCATGAACTGGATAACTAGTCTCCTTTTCCTCCAAATTCAAATAAGAAGTTAAATTACTGTGCAAGAATCCTAATAATCGAATCATAAAATTGAGGTTTACTGATGTGGATATCGAAACATTCATATGTTCCGTATCCTTCAAAGTTGAAACCTATCATTAGTCTGTTGTAAAGCGACAATCTCTCTTTCTATCAGTTAAATTTGCCAAATAACTTGCGAAATGTTACTTCTAAAGAGAGATCTGTATTCAAAATCCTTCCACTCAGATATATCATTAACGGAATAAAAGAAAAAAGAACATAAAGACGAACTAATGTACATCAAAAGTAAGCTCTTATCTAAATCTATACTGATAATTAAATGGACACCGCATATATATACGATTGTTTTCAATTAATTACCTCTTTCTTTTATGTTCCGATCTATCTTAGTATGTCTCATTACTTGTCTAGTAGTCAAATAAGAACTGTGACAGGTTTTTCAGTTAATTTCACCAATTGACTGGCAGGTGCTCTCTTCTGTTGGGGCTGTCTATGATGCAGGATAGGGATCTGAGTGCGTACAATCCAGCTTTATCAAACACCAGCCAGTACCTTTCCTGCAGTCATCAATTATGTGCTTGGAGCACAACTTGCAAAAGTCCTGATGATCCCTGCAGTTATAAAAGAGATTATTACACAGATAATACATCAACTTCTGGATTTATGATTGAAGATAAATTGCATTTGGCATCTTTCAGCAAACATGGTACACAACGTCTTTTGCAGGCCTCAGTTGTATTAGGGTGGGCTCTTACTGCGTCACATACTTTGATGCTTTTCTTTGATGTTTTCTTCTTCTTTTACAATGTTGAAGTTGAGATATTGGAATGCCACCTGAATCTTTGCCTGTGATCTCCCCCTCTCCCTCTTGATGCAGCTGTGGTAGGAAACAGAGTGGTTACTATTTGGATGGGGCTGCTCCTGATGGTGTTATGGGGTTGGGTCCTGGAAACATTTCAGTGCCAACCTTATTGGCGAAAGCAGGATTGGTTAGAAATACATTCTCGCTTTGTTTTGATAATAATGGTTCTGGGAGAATCCTCTTTGGGGACAATGGTCCTGCCACCCAGCAAACAACACAATTTTTGCCATTATTTGGTGAATTGTATGTGATCTAGCAACTAAATTATTAGCTAAGCCAATTCTAACCTTTATTTTCTCTTGAAGTTATCTATTATCACCAACACCATTAAGCCATCCATTCCTTGTCTCAGTGATGCCTATTTTGTCGAGGTGGAGTCCTTTTGTGTTGGGAGTTCCTGTCTGCAGAAAAGTGGATTCCACGCATTGGTGGACAGTGGCTCGTCTTTTACATATCTTCCTACAGAAATCTATAAAAAGATAGTCTTTGAGGTGATACTGGAATATAATGATGTGTGATATTTCAATTTTGTTTGGCCTAAAGTCTCGAATTACTTAATGCAGTTTGACAAACAAGTAAAATTAAATGCTACCAGGATAATTCTCCAAGAATTTCCCTGGAATTACTGCTATAATTCCAGGTAGGTGCATGAAAAAGGTAAAATAAATATTGTATTTTCCTCATTTTTTAAGGGTTGACAGACAGTTCTTTTATTGTGGAACAGTTCGCTGGAGTCCTCTTATATTCCTAGTATGAAACTCGTGTTTCCTCTGAATCAAAGCTTTATACATGATCCTGTGTATACCCTCCCTGACAGCCAAGTAAGTTGGACATCTTGCTCTTTCTTCCCGAGTAATTTTACCCAAATGATTATTCCCAGTGGGCCCTTAAAAATATTGATTTCTTTCCTCAATTTGTTTGTTTGGGACTGGGCAAGCTAGAAATGTTTTTGAACACAGTTTGAGGACACGAAGTCATGTTTGGGCAAGTGCTGAAATATTTGCTCCCTTCTCTGGCTTTTCTTTTATTGTTATGGATGGAGCCTATTGCATTCAAAATACCCCTTTTCTAGAATGTCAACAACTTGACTTTAAAGGACTTCATATCCTTTTTGCCTCGGTGGTTACGATTGATCTGTTTTGGTCGAAGTATGAATTTTATAAAATTTAATGTGCAGGGATATAAACTGTTTTGTTTAACCTTAGAAGAGACAGATGATGATTACGGTGTAATTGGACGTAAGTAGGACTTTTAATTCTGTGGTTGTATATGCTACTTTTTCAATGATTTTCTATAATATAGTTCCTTTTCTTCTCCTACCACACGATTACACATAGGAGCTTGAATTCGTAAATGCATTTAATGATAAAAACTTGTGATATCTTCTTCCATTTGTGCTACAGAAAACTTGATGGTAGGTTACCGGCTGGTTTTTGACAGAGAAAATCTTCAGTTGGGTTGGTCGAAGTCCAAATGTAAGCATGTTTTCATTATTATTTACACTTTGAATTTTCTTCTAACAGTGTGCAGTAGCTGCTCTTCCATCACTCGAAAGATTATCTGGAAGAAAGAGTTTGGTTAAGGCAAAAATGAAATCTTGCTTTTTGAATTTTCTCAGCAACTTCTGATTTCAAGGATATTCAATTCTTCCTGTGTCGAGATTTAAGTTTCTGTGCATTGCTTATATACTTCTGGCCTTTTTATTTATTTATTTTAAATCAGCATGGAAAAAGAACTTATATTGTGAGGTTTGAGCATGATTCAATTCCCTTTTCTTCATTCGACATTTATCAGTGTATAAAATTGCAGGCCTAGATATCAACCACGGCGAAGCAGACCATGCCAAACCACCTTCAAATGACGGATCACCAACTGCATTACCAACCGATGGACATCTAAGCCCTCCAAATAGGCAAGAAATTGCACCCACTGCTGCTAGGGCGTTTTCCAAATCGTCCCTAACTGCACCCCATTTTTCTCTCTTCAGCTGCTGTTGTTTGAGGTTATTCTTGTTGCTTTTTGAGTTTGTTGAGTCCATGCTGTAAATATTGCCTTTTCCGACTGGCATTTCAATTACTTAATAATTTACCCCTTCCTGTAAGTTGCTGTATCATATAGGAAAAAATAAGTTACATTTTTTTTACTGGGATTTGAGTATTATATAGACGTCTCTTTCAATTCAAGCATCAGTGCAAGTGCCAGTTCTACGTGTA

mRNA sequence

CAAGGAAACGCGCACCGTACGCACAAACAGCGCATTATAGCACTCTCACAGTCTCACCCAATTTGATTTATCGTCGTCACCATTCATGGTTCACGCTTCAATCTCTACTCCGTTTGCAGAAGGCTCGGTTCCTTGCCTTTTCCCCAAATCCGGACGCTCTCTCGCCGTTGTAGAAGTTCATTCTTACGAAATCTGATCTTCATCTTCAGTTTCCATGGCGAATTTCTTGGTAGTTTTGCTCTTTGTAGCTTGTTTCTTGGTGGACAGTTCTGTTGCGCTTAGGCTCTCGTCGAGGCTCGTTCATCGATTCTCCGATGAGGCGAAAGCGCTTTGGAAGTCGAGGAACGGTAATGCGTCTGGAAAATTTTGGCCGAGGAGGAATAGCTTGAAGTATTTTGAAACGCTTAAGGATTATGACTTGAAGAGGCGGAGACTGAAGATCGGATCGAAGTACGAGGTGATTTTTCCTTCTGAAGGAAACGAGGTCGTGTTCTTTGGGAACGAATTTGATTGGTTACATTACACATGGATTGATATAGGAACACCGAGTGTTTCGTTTCTCGTTGCATTGGATGCTGGAAGTGATCTGCTCTGGGTTCCGTGTGATTGCATTCAATGTGCTCCCTTGTCTGCAAGTCATTATAGCTCACTGGATAGGGATCTGAGTGCGTACAATCCAGCTTTATCAAACACCAGCCAGTACCTTTCCTGCAGTCATCAATTATGTGCTTGGAGCACAACTTGCAAAAGTCCTGATGATCCCTGCAGTTATAAAAGAGATTATTACACAGATAATACATCAACTTCTGGATTTATGATTGAAGATAAATTGCATTTGGCATCTTTCAGCAAACATGGTACACAACGTCTTTTGCAGGCCTCAGTTGTATTAGGCTGTGGTAGGAAACAGAGTGGTTACTATTTGGATGGGGCTGCTCCTGATGGTGTTATGGGGTTGGGTCCTGGAAACATTTCAGTGCCAACCTTATTGGCGAAAGCAGGATTGGTTAGAAATACATTCTCGCTTTGTTTTGATAATAATGGTTCTGGGAGAATCCTCTTTGGGGACAATGGTCCTGCCACCCAGCAAACAACACAATTTTTGCCATTATTTGGTGAATTTGATGCCTATTTTGTCGAGGTGGAGTCCTTTTGTGTTGGGAGTTCCTGTCTGCAGAAAAGTGGATTCCACGCATTGGTGGACAGTGGCTCGTCTTTTACATATCTTCCTACAGAAATCTATAAAAAGATAGTCTTTGAGTTTGACAAACAAGTAAAATTAAATGCTACCAGGATAATTCTCCAAGAATTTCCCTGGAATTACTGCTATAATTCCAGTTCGCTGGAGTCCTCTTATATTCCTAGTATGAAACTCGTGTTTCCTCTGAATCAAAGCTTTATACATGATCCTGTGTATACCCTCCCTGACAGCCAAGGATATAAACTGTTTTGTTTAACCTTAGAAGAGACAGATGATGATTACGGTGTAATTGGACAAAACTTGATGGTAGGTTACCGGCTGGTTTTTGACAGAGAAAATCTTCAGTTGGGTTGGTCGAAGTCCAAATGCCTAGATATCAACCACGGCGAAGCAGACCATGCCAAACCACCTTCAAATGACGGATCACCAACTGCATTACCAACCGATGGACATCTAAGCCCTCCAAATAGGCAAGAAATTGCACCCACTGCTGCTAGGGCGTTTTCCAAATCGTCCCTAACTGCACCCCATTTTTCTCTCTTCAGCTGCTGTTGTTTGAGGTTATTCTTGTTGCTTTTTGAGTTTGTTGAGTCCATGCTGTAAATATTGCCTTTTCCGACTGGCATTTCAATTACTTAATAATTTACCCCTTCCTGTAAGTTGCTGTATCATATAGGAAAAAATAAGTTACATTTTTTTTACTGGGATTTGAGTATTATATAGACGTCTCTTTCAATTCAAGCATCAGTGCAAGTGCCAGTTCTACGTGTA

Coding sequence (CDS)

ATGGCGAATTTCTTGGTAGTTTTGCTCTTTGTAGCTTGTTTCTTGGTGGACAGTTCTGTTGCGCTTAGGCTCTCGTCGAGGCTCGTTCATCGATTCTCCGATGAGGCGAAAGCGCTTTGGAAGTCGAGGAACGGTAATGCGTCTGGAAAATTTTGGCCGAGGAGGAATAGCTTGAAGTATTTTGAAACGCTTAAGGATTATGACTTGAAGAGGCGGAGACTGAAGATCGGATCGAAGTACGAGGTGATTTTTCCTTCTGAAGGAAACGAGGTCGTGTTCTTTGGGAACGAATTTGATTGGTTACATTACACATGGATTGATATAGGAACACCGAGTGTTTCGTTTCTCGTTGCATTGGATGCTGGAAGTGATCTGCTCTGGGTTCCGTGTGATTGCATTCAATGTGCTCCCTTGTCTGCAAGTCATTATAGCTCACTGGATAGGGATCTGAGTGCGTACAATCCAGCTTTATCAAACACCAGCCAGTACCTTTCCTGCAGTCATCAATTATGTGCTTGGAGCACAACTTGCAAAAGTCCTGATGATCCCTGCAGTTATAAAAGAGATTATTACACAGATAATACATCAACTTCTGGATTTATGATTGAAGATAAATTGCATTTGGCATCTTTCAGCAAACATGGTACACAACGTCTTTTGCAGGCCTCAGTTGTATTAGGCTGTGGTAGGAAACAGAGTGGTTACTATTTGGATGGGGCTGCTCCTGATGGTGTTATGGGGTTGGGTCCTGGAAACATTTCAGTGCCAACCTTATTGGCGAAAGCAGGATTGGTTAGAAATACATTCTCGCTTTGTTTTGATAATAATGGTTCTGGGAGAATCCTCTTTGGGGACAATGGTCCTGCCACCCAGCAAACAACACAATTTTTGCCATTATTTGGTGAATTTGATGCCTATTTTGTCGAGGTGGAGTCCTTTTGTGTTGGGAGTTCCTGTCTGCAGAAAAGTGGATTCCACGCATTGGTGGACAGTGGCTCGTCTTTTACATATCTTCCTACAGAAATCTATAAAAAGATAGTCTTTGAGTTTGACAAACAAGTAAAATTAAATGCTACCAGGATAATTCTCCAAGAATTTCCCTGGAATTACTGCTATAATTCCAGTTCGCTGGAGTCCTCTTATATTCCTAGTATGAAACTCGTGTTTCCTCTGAATCAAAGCTTTATACATGATCCTGTGTATACCCTCCCTGACAGCCAAGGATATAAACTGTTTTGTTTAACCTTAGAAGAGACAGATGATGATTACGGTGTAATTGGACAAAACTTGATGGTAGGTTACCGGCTGGTTTTTGACAGAGAAAATCTTCAGTTGGGTTGGTCGAAGTCCAAATGCCTAGATATCAACCACGGCGAAGCAGACCATGCCAAACCACCTTCAAATGACGGATCACCAACTGCATTACCAACCGATGGACATCTAAGCCCTCCAAATAGGCAAGAAATTGCACCCACTGCTGCTAGGGCGTTTTCCAAATCGTCCCTAACTGCACCCCATTTTTCTCTCTTCAGCTGCTGTTGTTTGAGGTTATTCTTGTTGCTTTTTGAGTTTGTTGAGTCCATGCTGTAA

Protein sequence

MANFLVVLLFVACFLVDSSVALRLSSRLVHRFSDEAKALWKSRNGNASGKFWPRRNSLKYFETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDPCSYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLFGEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRIILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETDDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKPPSNDGSPTALPTDGHLSPPNRQEIAPTAARAFSKSSLTAPHFSLFSCCCLRLFLLLFEFVESML
BLAST of Cp4.1LG01g20080 vs. Swiss-Prot
Match: ASPL1_ARATH (Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana GN=At5g10080 PE=1 SV=1)

HSP 1 Score: 462.2 bits (1188), Expect = 7.5e-129
Identity = 249/534 (46.63%), Postives = 347/534 (64.98%), Query Frame = 1

Query: 8   LLFVACFLV-DSSVALRLSSRLVHRFSDEAKALWKSRNGNASGKFWPRRNSLKYFETLKD 67
           LLF   FL  + ++A   SSRL+HRFSDE +A  K+ + + S    P + SL+Y+  L +
Sbjct: 8   LLFCVLFLATEETLASLFSSRLIHRFSDEGRASIKTPSSSDS---LPNKQSLEYYRLLAE 67

Query: 68  YDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALDAGSDLL 127
            D +R+R+ +G+K + + PSEG++ +  GN+F WLHYTWIDIGTPSVSFLVALD GS+LL
Sbjct: 68  SDFRRQRMNLGAKVQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDTGSNLL 127

Query: 128 WVPCDCIQCAPLSASHYSSL-DRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDPCS 187
           W+PC+C+QCAPL++++YSSL  +DL+ YNP+ S+TS+   CSH+LC  ++ C+SP + C 
Sbjct: 128 WIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCP 187

Query: 188 YKRDYYTDNTSTSGFMIEDKLHLASFSKH---GTQRLLQASVVLGCGRKQSGYYLDGAAP 247
           Y  +Y + NTS+SG ++ED LHL   + +        ++A VV+GCG+KQSG YLDG AP
Sbjct: 188 YTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAP 247

Query: 248 DGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPL-FG 307
           DG+MGLGP  ISVP+ L+KAGL+RN+FSLCFD   SGRI FGD GP+ QQ+T FL L   
Sbjct: 248 DGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNN 307

Query: 308 EFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRI 367
           ++  Y V VE+ C+G+SCL+++ F   +DSG SFTYLP EIY+K+  E D+ +  NAT  
Sbjct: 308 KYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHI--NATSK 367

Query: 368 ILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSF-IHDPVYTLPDSQGYKLFCLTLEET- 427
             +   W YCY SS+     +P++KL F  N +F IH P++    SQG   FCL +  + 
Sbjct: 368 NFEGVSWEYCYESSA--EPKVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSG 427

Query: 428 DDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKP----PSNDGSPTAL 487
            +  G IGQN M GYR+VFDREN++LGWS SKC      + D  +P    P +  SP  L
Sbjct: 428 QEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKC------QEDKIEPPQASPGSTSSPNPL 487

Query: 488 PTDGHLSPPNRQEIAPTAARAFSKSSLTAPHFSLFSCCCLRLFLLLFEFVESML 530
           PTD   S          A +  SK+  ++  +S  S   L   LLL  ++ S++
Sbjct: 488 PTDEQQSRGGHAVSPAIAGKTPSKTPSSSSSYSFSSIMRLFNSLLLLHWLASLM 528

BLAST of Cp4.1LG01g20080 vs. Swiss-Prot
Match: APF1_ARATH (Aspartyl protease family protein 1 OS=Arabidopsis thaliana GN=APF1 PE=1 SV=1)

HSP 1 Score: 280.8 bits (717), Expect = 3.1e-74
Identity = 186/493 (37.73%), Postives = 265/493 (53.75%), Query Frame = 1

Query: 30  HRFSDEAKALWKSRNGNASGKFWPRRNSLKYFETLKDYD--LKRRRLKIGSKYEVIFPSE 89
           HRFSD+         G   G   P R+S KY+  +   D  ++ RRL    +  V F S+
Sbjct: 39  HRFSDQVV-------GVLPGDGLPNRDSSKYYRVMAHRDRLIRGRRLANEDQSLVTF-SD 98

Query: 90  GNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCA-PLSASHYSSL 149
           GNE V   +   +LHY  + +GTPS  F+VALD GSDL W+PCDC  C   L A   SSL
Sbjct: 99  GNETVRV-DALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSL 158

Query: 150 DRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDPCSYKRDYYTDNTSTSGFMIEDKL 209
           D  L+ Y+P  S+TS  + C+  LC     C SP+  C Y+  Y ++ TS++G ++ED L
Sbjct: 159 D--LNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVL 218

Query: 210 HLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGAAPDGVMGLGPGNISVPTLLAKAGLVR 269
           HL S  K  + + + A V  GCG+ Q+G + DGAAP+G+ GLG  +ISVP++LAK G+  
Sbjct: 219 HLVSNDK--SSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAA 278

Query: 270 NTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLFGEFDAYFVEVESFCVGSSCLQKSGFH 329
           N+FS+CF N+G+GRI FGD G   Q+ T  L +      Y + V    VG +      F 
Sbjct: 279 NSFSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGNTGDLE-FD 338

Query: 330 ALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRIILQEFPWNYCYNSSSLESSY-IPSM 389
           A+ DSG+SFTYL    Y  I   F+        +    E P+ YCY  S  + S+  P++
Sbjct: 339 AVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAV 398

Query: 390 KLVFPLNQSF-IHDPVYTLPDSQGYKLFCLTLEETDDDYGVIGQNLMVGYRLVFDRENLQ 449
            L      S+ ++ P+  +P  +   ++CL + +  +D  +IGQN M GYR+VFDRE L 
Sbjct: 399 NLTMKGGSSYPVYHPLVVIP-MKDTDVYCLAIMKI-EDISIIGQNFMTGYRVVFDREKLI 458

Query: 450 LGWSKSKCLDINHGEADHAKPPSNDGSPTALPTDGHLSP-----PNRQEIAPTAARAFSK 509
           LGW +S C     GE      PSN  S +A P      P     P+++    T + A+S 
Sbjct: 459 LGWKESDCYT---GETSARTLPSNRSSSSARPPASSFDPEATNIPSQRPNTSTTSAAYSL 511

Query: 510 S-SLTAPHFSLFS 512
           S SL+   FS+ +
Sbjct: 519 SISLSLFFFSILA 511

BLAST of Cp4.1LG01g20080 vs. Swiss-Prot
Match: ASPL2_ARATH (Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana GN=At1g65240 PE=1 SV=2)

HSP 1 Score: 124.8 bits (312), Expect = 2.8e-27
Identity = 120/429 (27.97%), Postives = 201/429 (46.85%), Query Frame = 1

Query: 47  ASGKFWPRRNSLKYFETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWI 106
           A  KF  ++ +L++F   K +D +R    + S   +  P  G+  V    +   L++T I
Sbjct: 29  AQHKFAGKKKNLEHF---KSHDTRRHSRMLAS---IDLPLGGDSRV----DSVGLYFTKI 88

Query: 107 DIGTPSVSFLVALDAGSDLLWVPC-DCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLS 166
            +G+P   + V +D GSD+LW+ C  C +C        ++L+  LS ++   S+TS+ + 
Sbjct: 89  KLGSPPKEYHVQVDTGSDILWINCKPCPKCPT-----KTNLNFRLSLFDMNASSTSKKVG 148

Query: 167 CSHQLCAW---STTCKSPDDPCSYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQA 226
           C    C++   S +C+ P   CSY    Y D +++ G  I D L L   +       L  
Sbjct: 149 CDDDFCSFISQSDSCQ-PALGCSY-HIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQ 208

Query: 227 SVVLGCGRKQSGYYLDG-AAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDN-NGSGR 286
            VV GCG  QSG   +G +A DGVMG G  N SV + LA  G  +  FS C DN  G G 
Sbjct: 209 EVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGI 268

Query: 287 ILFGDNGPATQQTTQFLPLFGEFDAYF----VEVESFCVGSSCLQKSGFHALVDSGSSFT 346
              G       +TT  +P    ++       V+  S  +  S ++  G   +VDSG++  
Sbjct: 269 FAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNGG--TIVDSGTTLA 328

Query: 347 YLPTEIYKKIVFEF--DKQVKLNATRIILQEFPWNYCYNSSSLESSYIP-------SMKL 406
           Y P  +Y  ++      + VKL+      Q F +     S++++ ++ P       S+KL
Sbjct: 329 YFPKVLYDSLIETILARQPVKLHIVEETFQCFSF-----STNVDEAFPPVSFEFEDSVKL 388

Query: 407 -VFPLNQSFIHDPVYTLPDSQ---GYKLFCLTLEETDDDYGVIGQNLMVGYRLVFDRENL 453
            V+P      HD ++TL +     G++   LT +E  +   ++G  ++    +V+D +N 
Sbjct: 389 TVYP------HDYLFTLEEELYCFGWQAGGLTTDERSEVI-LLGDLVLSNKLVVYDLDNE 426

BLAST of Cp4.1LG01g20080 vs. Swiss-Prot
Match: AED1_ARATH (Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1)

HSP 1 Score: 116.3 bits (290), Expect = 1.0e-24
Identity = 102/363 (28.10%), Postives = 163/363 (44.90%), Query Frame = 1

Query: 106 IDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLS 165
           I IGTP     +  D GSDL W      QC P   S YS  +     +NP+ S+T Q +S
Sbjct: 136 IGIGTPKHDLSLVFDTGSDLTWT-----QCEPCLGSCYSQKE---PKFNPSSSSTYQNVS 195

Query: 166 CSHQLCAWSTTCKSPDDPCSYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVV 225
           CS  +C  + +C + +  C Y    Y D + T GF+ ++K  L       T   +   V 
Sbjct: 196 CSSPMCEDAESCSASN--CVYSI-VYGDKSFTQGFLAKEKFTL-------TNSDVLEDVY 255

Query: 226 LGCGRKQSGYYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLC---FDNNGSGRIL 285
            GCG    G + DG A  G++GLGPG +S+P          N FS C   F +N +G + 
Sbjct: 256 FGCGENNQGLF-DGVA--GLLGLGPGKLSLPAQTTTT--YNNIFSYCLPSFTSNSTGHLT 315

Query: 286 FGDNGPATQQTTQFLPL--FGEFDAYFVEVESFCVGSS--CLQKSGFH---ALVDSGSSF 345
           FG  G    ++ +F P+  F     Y +++    VG     +  + F    A++DSG+ F
Sbjct: 316 FGSAG--ISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVF 375

Query: 346 TYLPTEIYKKIVFEFDKQVKLNATRIILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSF 405
           T LPT++Y ++   F +  K+++ +       ++ CY+ + L++   P++        SF
Sbjct: 376 TRLPTKVYAELRSVFKE--KMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAF------SF 435

Query: 406 IHDPVYTLPDSQGYKL------FCLTLEETDDDYGVIGQNLMVGYRLVFDRENLQLGWSK 453
               V  L D  G  L       CL     DD   + G        +V+D    ++G++ 
Sbjct: 436 AGSTVVEL-DGSGISLPIKISQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAP 464

BLAST of Cp4.1LG01g20080 vs. Swiss-Prot
Match: APCB1_ARATH (Aspartyl protease APCB1 OS=Arabidopsis thaliana GN=APCB1 PE=1 SV=1)

HSP 1 Score: 116.3 bits (290), Expect = 1.0e-24
Identity = 114/414 (27.54%), Postives = 178/414 (43.00%), Query Frame = 1

Query: 83  IFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVS--FLVALDAGSDLLWVPCD--CIQCAPL 142
           IFP  GN         D L+YT I +G P     + + +D GS+L W+ CD  C  CA  
Sbjct: 190 IFPVGGNVYP------DGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKG 249

Query: 143 SASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDPCSYKRDYYTDNTSTS 202
           +   Y     +L   + A     Q     +QL      C      C Y+ +Y  D++ + 
Sbjct: 250 ANQLYKPRKDNLVRSSEAFCVEVQ----RNQLTEHCENCHQ----CDYEIEY-ADHSYSM 309

Query: 203 GFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGAAP-DGVMGLGPGNISVPT 262
           G + +DK HL    K     L ++ +V GCG  Q G  L+     DG++GL    IS+P+
Sbjct: 310 GVLTKDKFHL----KLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPS 369

Query: 263 LLAKAGLVRNTFSLCF--DNNGSGRILFGDNGPATQQTTQFLPLFGE--FDAYFVEVESF 322
            LA  G++ N    C   D NG G I  G +   +   T ++P+  +   DAY ++V   
Sbjct: 370 QLASRGIISNVVGHCLASDLNGEGYIFMGSDLVPSHGMT-WVPMLHDSRLDAYQMQVTKM 429

Query: 323 CVGSSCLQKSGFHA-----LVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRIILQE--- 382
             G   L   G +      L D+GSS+TY P + Y ++V    +   L  TR    E   
Sbjct: 430 SYGQGMLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLP 489

Query: 383 FPW----NYCYNS-SSLESSYIP-----SMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLT 442
             W    N+ ++S S ++  + P       K +    +  I    Y +  ++G    CL 
Sbjct: 490 ICWRAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGN--VCLG 549

Query: 443 LEET----DDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKP 466
           + +     D    ++G   M G+ +V+D    ++GW KS C  +   E DH  P
Sbjct: 550 ILDGSSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDC--VRPREIDHNVP 579

BLAST of Cp4.1LG01g20080 vs. TrEMBL
Match: A0A0A0KN37_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G407090 PE=3 SV=1)

HSP 1 Score: 833.2 bits (2151), Expect = 1.8e-238
Identity = 411/524 (78.44%), Postives = 453/524 (86.45%), Query Frame = 1

Query: 1   MANFLVVLLFVACFLVDSSVALRLSSRLVHRFSDEAKALWKSRN-GNASGKFWPRRNSLK 60
           MAN  ++LLF+A   V+ S+AL LS  LVHRFSDEAK+LW+SR  GN S KFWP  NSLK
Sbjct: 1   MANCALLLLFIASLFVNCSLALTLSLNLVHRFSDEAKSLWESRRTGNVSAKFWPPTNSLK 60

Query: 61  YFETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVAL 120
           YF+ L DYDLKRRRL IGSKY+V+FPSEG++V+FFGNEF+WLHYTWID+GTPSV FLVAL
Sbjct: 61  YFQMLMDYDLKRRRLNIGSKYDVLFPSEGSQVIFFGNEFNWLHYTWIDLGTPSVPFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKS 180
           D GSDLLWVPCDCIQCAPLSA++YS LDRDLS YNPALS+TS++L C HQLCAWSTTCKS
Sbjct: 121 DVGSDLLWVPCDCIQCAPLSANYYSVLDRDLSEYNPALSSTSKHLFCGHQLCAWSTTCKS 180

Query: 181 PDDPCSYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDG 240
            +DPC+YKRDYY+DNTSTSGFMIEDKL L SFSKHGT  LLQASVV GCGRKQSG YLDG
Sbjct: 181 ANDPCTYKRDYYSDNTSTSGFMIEDKLQLTSFSKHGTHSLLQASVVFGCGRKQSGSYLDG 240

Query: 241 AAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPL 300
           AAPDGVMGLGPGNISVPTLLA+ GLVRNTFSLCFDNNGSGRILFGD+GPATQQTTQFLPL
Sbjct: 241 AAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSGRILFGDDGPATQQTTQFLPL 300

Query: 301 FGEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNAT 360
           FGEF AYF+ VESFCVGSSCLQ+SGF ALVDSGSSFTYLP E+YKKIVFEFDKQVK+NAT
Sbjct: 301 FGEFAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKVNAT 360

Query: 361 RIILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEET 420
           RI+L+E PWNYCYN S+L S  IPSM+LVFPLNQ FIHDPVY LP +QGYK+FCLTLEET
Sbjct: 361 RIVLRELPWNYCYNISTLVSFNIPSMQLVFPLNQIFIHDPVYVLPANQGYKVFCLTLEET 420

Query: 421 DDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKPPSNDG---SPTALP 480
           D+DYGVIGQNLMVGYR+VFDRENL+LGWSKSKCLDIN    +HAKPPSN+G   SP ALP
Sbjct: 421 DEDYGVIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSSTTEHAKPPSNNGNAKSPIALP 480

Query: 481 TDGHLSPPNRQEIAPTAARAFSKSSLTAPHFSLFSCCCLRLFLL 521
                 P NRQ IAPTAAR  SKSSL+A HFS      L  FL+
Sbjct: 481 ------PTNRQAIAPTAARTSSKSSLSASHFSPLLLLLLAAFLV 518

BLAST of Cp4.1LG01g20080 vs. TrEMBL
Match: A0A061FFI5_THECC (Eukaryotic aspartyl protease family protein isoform 1 OS=Theobroma cacao GN=TCM_034973 PE=3 SV=1)

HSP 1 Score: 642.5 bits (1656), Expect = 4.5e-181
Identity = 320/519 (61.66%), Postives = 391/519 (75.34%), Query Frame = 1

Query: 6   VVLLFVACFLVDSSVALRLSSRLVHRFSDEAKALWKSRNGNASGKF--WPRRNSLKYFET 65
           V L+     L+  S AL  SSRL+HRFSDEAKALW +RNGNA      WP+RNSL+Y E 
Sbjct: 6   VFLVLSVWLLLGGSAALTFSSRLIHRFSDEAKALWTARNGNAGNGVVSWPKRNSLEYLEL 65

Query: 66  LKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALDAGS 125
           L   DLKR+R+K+GS+Y ++FPS+G+E +FFGNEFDWLHYTWIDIGTP+VSFLVALDAGS
Sbjct: 66  LIGNDLKRQRMKLGSQYPLLFPSQGSETLFFGNEFDWLHYTWIDIGTPNVSFLVALDAGS 125

Query: 126 DLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDP 185
           DLLWVPCDCIQCAPLSAS+Y+SLD+DLS Y+P+LS++S+ LSCSH LC  S+ CK P+DP
Sbjct: 126 DLLWVPCDCIQCAPLSASYYNSLDKDLSEYSPSLSSSSKNLSCSHLLCESSSYCKGPNDP 185

Query: 186 CSYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGAAPD 245
           C Y  +Y +DNTSTSG+++EDKLHL SFS H  +  LQASVV+GCGRKQSG YLDGAAPD
Sbjct: 186 CPYIIEYDSDNTSTSGYLVEDKLHLKSFSGHSEESSLQASVVIGCGRKQSGGYLDGAAPD 245

Query: 246 GVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLFGEF 305
           G+MGLGPGNISVP+LLAKAGL++N+FS+C D NGSGRI FGD G ATQQ+T FLP+ G++
Sbjct: 246 GLMGLGPGNISVPSLLAKAGLIQNSFSICLDENGSGRIYFGDKGLATQQSTPFLPIGGKY 305

Query: 306 DAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRIIL 365
           + YFV VE  CVGSSCL+KSGF ALVDSG+SFTYLP EIY K+V EFDKQV  NA RI  
Sbjct: 306 EKYFVRVEHLCVGSSCLEKSGFSALVDSGTSFTYLPPEIYDKVVLEFDKQV--NARRISN 365

Query: 366 QEFPWNYCYNSSSLESSYIPSMKLVFPLNQSF-IHDPVYTLPDSQGYKLFCLTLEETDDD 425
           QE  W YCYN SS E   IPSM+L F +NQSF IH+ +Y+    +G+ +FCLT+    DD
Sbjct: 366 QEDFWKYCYNVSSQEPFKIPSMRLKFAINQSFEIHNHIYSYTGIEGFTVFCLTVLRGKDD 425

Query: 426 YGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKPPSNDGSPTALPTDGHLS 485
           +G+IGQN M G+ +VFDRENL+LGWS S C D+N   + H  PP +  SP  LPT+   +
Sbjct: 426 FGIIGQNFMTGHEIVFDRENLKLGWSHSSCQDVNDKSSVHLAPPPSGESPIPLPTNEQQN 485

Query: 486 PPNRQEIAPTAARAFSKSSLTAPHFSLFSCCCLRLFLLL 522
             N Q + P  A   S +   A    + +  CL   LL+
Sbjct: 486 TNNTQAVTPAVAGRASTNPSAASSLQIPTLLCLMASLLI 522

BLAST of Cp4.1LG01g20080 vs. TrEMBL
Match: B9GUJ4_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s09270g PE=3 SV=1)

HSP 1 Score: 632.1 bits (1629), Expect = 6.1e-178
Identity = 313/521 (60.08%), Postives = 398/521 (76.39%), Query Frame = 1

Query: 5   LVVLLFVACFLVDSSVALRLSSRLVHRFSDEAKALWKSRNGNASGKFWPRRNSLKYFETL 64
           LVV++ + C   ++S+ L  SS+L+HRFSDEAK++  SR GNASG  WP+R S +YF+ L
Sbjct: 9   LVVVVVLLCCQFEASIGLTFSSKLIHRFSDEAKSISISRKGNASGDLWPKRYSFEYFQLL 68

Query: 65  KDYDLKRRRLKIGS-KYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALDAGS 124
              DLKR+R+K+GS K +++FPS+G++ +FFGNE DWLHYTWIDIGTP+VSFLVALDAGS
Sbjct: 69  LGNDLKRQRMKLGSQKNQLLFPSQGSQALFFGNELDWLHYTWIDIGTPNVSFLVALDAGS 128

Query: 125 DLLWVPCDCIQCAPLSASHYS-SLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDD 184
           DLLWVPCDCIQCAPLSAS+Y+ SLDRDLS Y+P+LS+TS++LSC HQLC W + CK+P D
Sbjct: 129 DLLWVPCDCIQCAPLSASYYNISLDRDLSEYSPSLSSTSRHLSCDHQLCEWGSNCKNPKD 188

Query: 185 PCSYKRDYYT-DNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGAA 244
           PC Y  +Y   +NT+++GF++EDKLHLAS   H  +++LQASVVLGCGRKQ G + DGAA
Sbjct: 189 PCPYIFNYDDFENTTSAGFLVEDKLHLASVGDHTARKMLQASVVLGCGRKQGGSFFDGAA 248

Query: 245 PDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLFG 304
           PDGVMGLGPG+ISVP+LLAKAGL++N FSLCFD N SGRILFGD G A+QQ+T FLP+ G
Sbjct: 249 PDGVMGLGPGDISVPSLLAKAGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQG 308

Query: 305 EFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRI 364
            + AYFV VES+CVG+SCL++SGF ALVDSGSSFTYLP+E+Y ++V EFDKQV  NA RI
Sbjct: 309 TYVAYFVGVESYCVGNSCLKRSGFKALVDSGSSFTYLPSEVYNELVSEFDKQV--NAKRI 368

Query: 365 ILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSF-IHDPVYTLPDSQGYKLFCLTLEETD 424
             Q+  W+YCYN+SS E   IP+++L FP NQ+F +H+P Y++P  QG+ +FCL+L+ TD
Sbjct: 369 SFQDGLWDYCYNASSQELHDIPAIQLKFPRNQNFVVHNPTYSIPHHQGFTMFCLSLQPTD 428

Query: 425 DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKPPSNDGSPTALPTDGH 484
             YG+IGQN M+GYR+VFD ENL+LGWS S C D +     H  PP ++ SP  LPT+  
Sbjct: 429 GSYGIIGQNFMIGYRMVFDIENLKLGWSNSSCQDTSDSADVHLAPPPDNKSPNPLPTNEQ 488

Query: 485 LSPPNRQEIAPTAARAFSKSSLTAPHFSLFSCCCLRLFLLL 522
            S P    +AP  A   S  S  A     F    + L LLL
Sbjct: 489 QSIPRTPSVAPAVAGRTSSESSAASLVIPFLHLMISLLLLL 527

BLAST of Cp4.1LG01g20080 vs. TrEMBL
Match: A0A067K9F2_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11999 PE=3 SV=1)

HSP 1 Score: 627.5 bits (1617), Expect = 1.5e-176
Identity = 304/509 (59.72%), Postives = 390/509 (76.62%), Query Frame = 1

Query: 1   MANFLVVLLFVACFLVDSSVALRLSSRLVHRFSDEAKALWKSRNG-NASGKFWPRRNSLK 60
           MAN LV+++   CFL + S+ L  SS+L+HRFSDEAKALW SRNG N S   WP+++S +
Sbjct: 4   MANRLVLVVVCFCFLFEGSIGLTFSSKLIHRFSDEAKALWISRNGGNMSSDLWPKKHSFE 63

Query: 61  YFETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVAL 120
           YF+ L   DLKR+R+K+GS+ +++ P  G++  FFGNE DWLHYTWIDIGTP+VSFLVAL
Sbjct: 64  YFQLLLVNDLKRQRMKLGSQNQLLIPEHGSQTFFFGNELDWLHYTWIDIGTPNVSFLVAL 123

Query: 121 DAGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKS 180
           DAGSDLLWVPC+CIQCAPLSAS+YSSLDRDLS Y P+LS+TS++L CSHQLC   + CK+
Sbjct: 124 DAGSDLLWVPCECIQCAPLSASYYSSLDRDLSEYRPSLSSTSKHLPCSHQLCELGSNCKN 183

Query: 181 PDDPCSYKRDYY-TDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLD 240
             +PC Y  +Y  T NTS+SG++IEDKLHLAS S++ T+R +QASV++GCGRKQSG YLD
Sbjct: 184 LKEPCPYIANYADTGNTSSSGYLIEDKLHLASVSENETRRRVQASVIIGCGRKQSGGYLD 243

Query: 241 GAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLP 300
           GAAPDGVMGLGPG+ISVP+ LAKAGL++ +FSLCF+ N SGRILFGD   A+Q++T  L 
Sbjct: 244 GAAPDGVMGLGPGSISVPSFLAKAGLIQKSFSLCFNENDSGRILFGDQVHASQKSTPLLS 303

Query: 301 LFGEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNA 360
           + G +  YFVEVES+CVG SCL++SGF ALVD+G+SFT+LP ++Y KIV EFDKQV  NA
Sbjct: 304 IEGNYVTYFVEVESYCVGHSCLKQSGFKALVDTGTSFTFLPRKVYNKIVLEFDKQV--NA 363

Query: 361 TRIILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSF-IHDPVYTLPDSQGYKLFCLTLE 420
            RI  Q  PW+YCYN+SS E   IP+M++ FP+NQSF +H P Y++P +Q + +FCLTL+
Sbjct: 364 ERISSQGGPWDYCYNTSSRELVNIPAMQIQFPMNQSFLVHKPTYSVPQNQEFTIFCLTLQ 423

Query: 421 ETDDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKPPSNDGSPTALPT 480
           +TD+DYG+IG + + GYR+VFD ENL+ GWS S C DI+        PP +D SP  LPT
Sbjct: 424 QTDEDYGIIGHDFLTGYRVVFDMENLKFGWSSSNCQDISDNTEVRVAPPPDDKSPNPLPT 483

Query: 481 DGHLSPPNRQEIAPTAARAFSKSSLTAPH 507
           +   S PN+  +AP  A   S    +A H
Sbjct: 484 NEQQSVPNKNAVAPALAGRTSSKPSSASH 510

BLAST of Cp4.1LG01g20080 vs. TrEMBL
Match: B9RJF1_RICCO (Aspartic proteinase nepenthesin-2, putative OS=Ricinus communis GN=RCOM_1033780 PE=3 SV=1)

HSP 1 Score: 614.0 bits (1582), Expect = 1.7e-172
Identity = 305/505 (60.40%), Postives = 380/505 (75.25%), Query Frame = 1

Query: 8   LLFVACF--LVDSSVALRLSSRLVHRFSDEAKALWKSRNGNASGKFWPRRNSLKYFETLK 67
           LLFV CF  L + S+ L  SS+L+HRFS+EAK+L  S N N S + WP +NS +Y + L 
Sbjct: 6   LLFVICFCFLSNHSIGLTFSSKLIHRFSEEAKSLLISGNDNVSSQTWPNKNSFQYLQLLL 65

Query: 68  DYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALDAGSDL 127
           D DLKR+++K+G++ +++FPS G+   F+GN+ DWLHYTWIDIGTP+VSFLVALDAGSDL
Sbjct: 66  DNDLKRQKMKLGAQNQLLFPSLGSHTFFYGNDLDWLHYTWIDIGTPNVSFLVALDAGSDL 125

Query: 128 LWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDPCS 187
            WVPCDCIQCAPLSAS Y  LDRDLS Y P+LS TS++LSC+HQLC   + CK+  DPC 
Sbjct: 126 SWVPCDCIQCAPLSASLYKPLDRDLSEYRPSLSTTSRHLSCNHQLCELGSHCKNLKDPCP 185

Query: 188 YKRDYYTDNTSTSGFMIEDKLHLASFS--KHGTQRLLQASVVLGCGRKQSGYYLDGAAPD 247
           Y  DY   NTS+SGF++ED LHLAS S   + TQ+ +QASV+LGCGRKQ+G YLDGAAPD
Sbjct: 186 YIADYADPNTSSSGFLVEDILHLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAPD 245

Query: 248 GVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLFGEF 307
           GVMGLGPG+ISVP+LLAKAGL+R +FSLCFD NGSG ILFGD G  +Q++T  LP  G +
Sbjct: 246 GVMGLGPGSISVPSLLAKAGLIRKSFSLCFDVNGSGTILFGDQGHTSQKSTPLLPTQGNY 305

Query: 308 DAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRIIL 367
           DAY +EVES+CVG+SCL++SGF ALVDSG+SFTYLP ++Y KIV EFDKQV  NA RI  
Sbjct: 306 DAYLIEVESYCVGNSCLKQSGFKALVDSGASFTYLPIDVYNKIVLEFDKQV--NAQRISS 365

Query: 368 QEFPWNYCYNSSSLESSYIPSMKLVFPLNQS-FIHDPVYTLPDSQGYKLFCLTLEETDDD 427
           Q  PWNYCYN+SS +   +P+M+L F +NQS  IH+  Y +P +Q + +FCLTL+ TD +
Sbjct: 366 QGGPWNYCYNTSSKQLDNVPAMRLSFLMNQSLLIHNSTYYVPQNQEFAVFCLTLQPTDLN 425

Query: 428 YGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKPPSNDGSPTALPTDGHLS 487
           YG+IGQN M GYR+VFD ENL+LGWS S C DI+        P  ND SP  LPT+   S
Sbjct: 426 YGIIGQNYMTGYRVVFDMENLKLGWSSSNCKDISDETEVTLAPSPNDQSPNPLPTNEQQS 485

Query: 488 PPNRQEIAP-TAARAFSKSSLTAPH 507
            PN+Q +AP  A R  SK S+ + H
Sbjct: 486 VPNKQGVAPAVAGRTSSKHSVASQH 508

BLAST of Cp4.1LG01g20080 vs. TAIR10
Match: AT5G10080.1 (AT5G10080.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 462.2 bits (1188), Expect = 4.2e-130
Identity = 249/534 (46.63%), Postives = 347/534 (64.98%), Query Frame = 1

Query: 8   LLFVACFLV-DSSVALRLSSRLVHRFSDEAKALWKSRNGNASGKFWPRRNSLKYFETLKD 67
           LLF   FL  + ++A   SSRL+HRFSDE +A  K+ + + S    P + SL+Y+  L +
Sbjct: 8   LLFCVLFLATEETLASLFSSRLIHRFSDEGRASIKTPSSSDS---LPNKQSLEYYRLLAE 67

Query: 68  YDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALDAGSDLL 127
            D +R+R+ +G+K + + PSEG++ +  GN+F WLHYTWIDIGTPSVSFLVALD GS+LL
Sbjct: 68  SDFRRQRMNLGAKVQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDTGSNLL 127

Query: 128 WVPCDCIQCAPLSASHYSSL-DRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDPCS 187
           W+PC+C+QCAPL++++YSSL  +DL+ YNP+ S+TS+   CSH+LC  ++ C+SP + C 
Sbjct: 128 WIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCP 187

Query: 188 YKRDYYTDNTSTSGFMIEDKLHLASFSKH---GTQRLLQASVVLGCGRKQSGYYLDGAAP 247
           Y  +Y + NTS+SG ++ED LHL   + +        ++A VV+GCG+KQSG YLDG AP
Sbjct: 188 YTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAP 247

Query: 248 DGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPL-FG 307
           DG+MGLGP  ISVP+ L+KAGL+RN+FSLCFD   SGRI FGD GP+ QQ+T FL L   
Sbjct: 248 DGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNN 307

Query: 308 EFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRI 367
           ++  Y V VE+ C+G+SCL+++ F   +DSG SFTYLP EIY+K+  E D+ +  NAT  
Sbjct: 308 KYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHI--NATSK 367

Query: 368 ILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSF-IHDPVYTLPDSQGYKLFCLTLEET- 427
             +   W YCY SS+     +P++KL F  N +F IH P++    SQG   FCL +  + 
Sbjct: 368 NFEGVSWEYCYESSA--EPKVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSG 427

Query: 428 DDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKP----PSNDGSPTAL 487
            +  G IGQN M GYR+VFDREN++LGWS SKC      + D  +P    P +  SP  L
Sbjct: 428 QEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKC------QEDKIEPPQASPGSTSSPNPL 487

Query: 488 PTDGHLSPPNRQEIAPTAARAFSKSSLTAPHFSLFSCCCLRLFLLLFEFVESML 530
           PTD   S          A +  SK+  ++  +S  S   L   LLL  ++ S++
Sbjct: 488 PTDEQQSRGGHAVSPAIAGKTPSKTPSSSSSYSFSSIMRLFNSLLLLHWLASLM 528

BLAST of Cp4.1LG01g20080 vs. TAIR10
Match: AT4G35880.1 (AT4G35880.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 284.6 bits (727), Expect = 1.2e-76
Identity = 179/459 (39.00%), Postives = 263/459 (57.30%), Query Frame = 1

Query: 4   FLVVLLFVACFLVDSSVALRLSSRLVHRFSDEAKALWKSRNGNASGKFWPRRNSLKYFET 63
           FL+ +L +  F   S      +  + HRFSDE K  W    G  + KF P + S +YF  
Sbjct: 11  FLIPILMLLSF--GSCNGRIFTFEMHHRFSDEVKQ-WSDSTGRFA-KF-PPKGSFEYFNA 70

Query: 64  L--KDYDLKRRRLKIG---SKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVA 123
           L  +D+ ++ RRL      S+  + F S+GN      +   +LHYT + +GTP + F+VA
Sbjct: 71  LVLRDWLIRGRRLSESESESESSLTF-SDGNSTSRISS-LGFLHYTTVKLGTPGMRFMVA 130

Query: 124 LDAGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCK 183
           LD GSDL WVPCDC +CAP   + Y+S + +LS YNP +S T++ ++C++ LCA    C 
Sbjct: 131 LDTGSDLFWVPCDCGKCAPTEGATYAS-EFELSIYNPKVSTTNKKVTCNNSLCAQRNQCL 190

Query: 184 SPDDPCSYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLD 243
                C Y   Y +  TSTSG ++ED +HL +  K+  +  ++A V  GCG+ QSG +LD
Sbjct: 191 GTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLD 250

Query: 244 GAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLP 303
            AAP+G+ GLG   ISVP++LA+ GLV ++FS+CF ++G GRI FGD G + Q+ T F  
Sbjct: 251 IAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-N 310

Query: 304 LFGEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNA 363
           L      Y + V    VG++ +    F AL D+G+SFTYL   +Y  +   F  Q + + 
Sbjct: 311 LNPSHPNYNITVTRVRVGTTLIDDE-FTALFDTGTSFTYLVDPMYTTVSESFHSQAQ-DK 370

Query: 364 TRIILQEFPWNYCYN-SSSLESSYIPSMKLVFPLNQSF-IHDPVYTLPDSQGYKLFCLTL 423
                   P+ YCY+ S+   +S IPS+ L    N  F I+DP+  +  ++G  ++CL +
Sbjct: 371 RHSPDSRIPFEYCYDMSNDANASLIPSLSLTMKGNSHFTINDPIIVI-STEGELVYCLAI 430

Query: 424 EETDDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDI 456
            ++  +  +IGQN M GYR+VFDRE L L W K  C DI
Sbjct: 431 VKS-SELNIIGQNYMTGYRVVFDREKLVLAWKKFDCYDI 454

BLAST of Cp4.1LG01g20080 vs. TAIR10
Match: AT2G17760.1 (AT2G17760.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 280.8 bits (717), Expect = 1.7e-75
Identity = 186/493 (37.73%), Postives = 265/493 (53.75%), Query Frame = 1

Query: 30  HRFSDEAKALWKSRNGNASGKFWPRRNSLKYFETLKDYD--LKRRRLKIGSKYEVIFPSE 89
           HRFSD+         G   G   P R+S KY+  +   D  ++ RRL    +  V F S+
Sbjct: 39  HRFSDQVV-------GVLPGDGLPNRDSSKYYRVMAHRDRLIRGRRLANEDQSLVTF-SD 98

Query: 90  GNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCA-PLSASHYSSL 149
           GNE V   +   +LHY  + +GTPS  F+VALD GSDL W+PCDC  C   L A   SSL
Sbjct: 99  GNETVRV-DALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSL 158

Query: 150 DRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDPCSYKRDYYTDNTSTSGFMIEDKL 209
           D  L+ Y+P  S+TS  + C+  LC     C SP+  C Y+  Y ++ TS++G ++ED L
Sbjct: 159 D--LNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVL 218

Query: 210 HLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGAAPDGVMGLGPGNISVPTLLAKAGLVR 269
           HL S  K  + + + A V  GCG+ Q+G + DGAAP+G+ GLG  +ISVP++LAK G+  
Sbjct: 219 HLVSNDK--SSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAA 278

Query: 270 NTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLFGEFDAYFVEVESFCVGSSCLQKSGFH 329
           N+FS+CF N+G+GRI FGD G   Q+ T  L +      Y + V    VG +      F 
Sbjct: 279 NSFSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGNTGDLE-FD 338

Query: 330 ALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRIILQEFPWNYCYNSSSLESSY-IPSM 389
           A+ DSG+SFTYL    Y  I   F+        +    E P+ YCY  S  + S+  P++
Sbjct: 339 AVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAV 398

Query: 390 KLVFPLNQSF-IHDPVYTLPDSQGYKLFCLTLEETDDDYGVIGQNLMVGYRLVFDRENLQ 449
            L      S+ ++ P+  +P  +   ++CL + +  +D  +IGQN M GYR+VFDRE L 
Sbjct: 399 NLTMKGGSSYPVYHPLVVIP-MKDTDVYCLAIMKI-EDISIIGQNFMTGYRVVFDREKLI 458

Query: 450 LGWSKSKCLDINHGEADHAKPPSNDGSPTALPTDGHLSP-----PNRQEIAPTAARAFSK 509
           LGW +S C     GE      PSN  S +A P      P     P+++    T + A+S 
Sbjct: 459 LGWKESDCYT---GETSARTLPSNRSSSSARPPASSFDPEATNIPSQRPNTSTTSAAYSL 511

Query: 510 S-SLTAPHFSLFS 512
           S SL+   FS+ +
Sbjct: 519 SISLSLFFFSILA 511

BLAST of Cp4.1LG01g20080 vs. TAIR10
Match: AT3G51330.1 (AT3G51330.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 238.8 bits (608), Expect = 7.6e-63
Identity = 178/536 (33.21%), Postives = 266/536 (49.63%), Query Frame = 1

Query: 4   FLVVLLFVACFLVDSSVAL-RLSSRLVHRFSDEAKALWKSRNGNASGKFWPRRNSLKYFE 63
           F+++ L V C+ ++   A  + S  + H FSD  K               P + SL+YF+
Sbjct: 8   FVLLSLLVVCWGLERCEASGKFSFEVHHMFSDRVK------QSLGLDDLVPEKGSLEYFK 67

Query: 64  TLKDYD--LKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD 123
            L   D  ++ R L   ++   I    GN  +   +   +LHY  + +GTP+  FLVALD
Sbjct: 68  VLAQRDRLIRGRGLASNNEETPITFMRGNRTISI-DLLGFLHYANVSVGTPATWFLVALD 127

Query: 124 AGSDLLWVPCDCIQCAPLSASHYS-SLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKS 183
            GSDL W+PC+C             S  R L+ Y+P  S+TS  + CS   C  S+ C S
Sbjct: 128 TGSDLFWLPCNCGSTCIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSS 187

Query: 184 PDDPCSYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDG 243
           P   C Y+  Y + +T T+G + ED LHL +    G + + +A++ LGCG+ Q+G+    
Sbjct: 188 PASSCPYQIQYLSKDTFTTGTLFEDVLHLVT-EDEGLEPV-KANITLGCGKNQTGFLQSS 247

Query: 244 AAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDN--NGSGRILFGDNGPATQQTTQFL 303
           AA +G++GLG  + SVP++LAKA +  N+FS+CF N  +  GRI FGD G   Q  T  L
Sbjct: 248 AAVNGLLGLGLKDYSVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLL 307

Query: 304 PLFGEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLN 363
           P       Y V V    VG   +      AL D+G+SFT+L    Y  I   FD  V  +
Sbjct: 308 PT-EPSPTYAVSVTEVSVGGDAVGVQ-LLALFDTGTSFTHLLEPEYGLITKAFDDHV-TD 367

Query: 364 ATRIILQEFPWNYCYNSSSLESSYI-PSMKLVFP-LNQSFIHDPVYTLPDSQGYKLFCL- 423
             R I  E P+ +CY+ S  +++ + P + + F   +Q F+ +P++ + +     ++CL 
Sbjct: 368 KRRPIDPELPFEFCYDLSPNKTTILFPRVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLG 427

Query: 424 TLEETDDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKPPSNDGS--- 483
            L+  D    +IGQN M GYR+VFDRE + LGW +S C +    E+    PP  +     
Sbjct: 428 ILKSVDFKINIIGQNFMSGYRIVFDRERMILGWKRSDCFEDESLESTTPPPPETEAPSPS 487

Query: 484 -----PTALPTDGHLSPPNRQEIAPTAARAFSKSSLTAPHFSLFSCCCLRLFLLLF 523
                P+ LP     +PP   +I P  +   S +   A    L S   L L LL F
Sbjct: 488 ASTPLPSLLPPPAAATPP---QIDPRNSTRNSGTGTAANLVPLASQLLLLLPLLAF 528

BLAST of Cp4.1LG01g20080 vs. TAIR10
Match: AT3G51360.1 (AT3G51360.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 234.6 bits (597), Expect = 1.4e-61
Identity = 158/473 (33.40%), Postives = 236/473 (49.89%), Query Frame = 1

Query: 18  SSVALRLSSRLVHRFSDEAKALWKSRNGNASGKFWPRRNSLKYFETLKDYDLKRRRLKIG 77
           SSV+  LS  + HRFS++ K +         G   P   SL Y++ L   D  R+     
Sbjct: 16  SSVSGSLSFEIHHRFSEQVKTV-------LGGHGLPEMGSLDYYKALVHRDRGRQLTSNN 75

Query: 78  SKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAP 137
           +    I  ++GN       E  +LHY  + IGTP+  FLVALD GSDL W+PC+C     
Sbjct: 76  NNQTTISFAQGNST----EEISFLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCV 135

Query: 138 LSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDPCSYKRDYYTDNTST 197
            S          L+ YNP+ S +S  ++C+  LCA    C SP   C Y+  Y +  + +
Sbjct: 136 RSMETDQGERIKLNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPGSKS 195

Query: 198 SGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGAAPDGVMGLGPGNISVPT 257
           +G ++ED +H+++  + G  R   A +  GC   Q G + +  A +G+MGL   +I+VP 
Sbjct: 196 TGVLVEDVIHMST--EEGEAR--DARITFGCSESQLGLFKE-VAVNGIMGLAIADIAVPN 255

Query: 258 LLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLFGEFDAYF--VEVESFCV 317
           +L KAG+  ++FS+CF  NG G I FGD G + Q  T   PL G     F  V +  F V
Sbjct: 256 MLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQLET---PLSGTISPMFYDVSITKFKV 315

Query: 318 GSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRIILQEFPWNYCYN-S 377
           G   +    F A  DSG++ T+L    Y  +   F   V        +   P+ +CY  +
Sbjct: 316 GKVTVDTE-FTATFDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDS-PFEFCYIIT 375

Query: 378 SSLESSYIPSMKLVFPLNQSF-IHDPVYTLPDSQG-YKLFCLT-LEETDDDYGVIGQNLM 437
           S+ +   +PS+        ++ +  P+     S G ++++CL  L++ + D+ +IGQN M
Sbjct: 376 STSDEDKLPSVSFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQVNADFSIIGQNFM 435

Query: 438 VGYRLVFDRENLQLGWSKSKCLDIN--HGEADHAKPPSNDGSPTALPTDGHLS 483
             YR+V DRE   LGW KS C D N   G    AKPPS   +PT+ P   +LS
Sbjct: 436 TNYRIVHDRERRILGWKKSNCNDTNGFTGPTALAKPPSM--APTSSPRTINLS 465

BLAST of Cp4.1LG01g20080 vs. NCBI nr
Match: gi|659121083|ref|XP_008460494.1| (PREDICTED: aspartic proteinase-like protein 1 [Cucumis melo])

HSP 1 Score: 845.1 bits (2182), Expect = 6.6e-242
Identity = 415/524 (79.20%), Postives = 456/524 (87.02%), Query Frame = 1

Query: 1   MANFLVVLLFVACFLVDSSVALRLSSRLVHRFSDEAKALWKS-RNGNASGKFWPRRNSLK 60
           MAN  ++LL +AC  VD S+ L LS +LVHRFSDEAK+LWKS R GN S KFWP RNSLK
Sbjct: 1   MANCSLLLLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWKSTRTGNVSAKFWPPRNSLK 60

Query: 61  YFETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVAL 120
           YF+ L DYDLKRRRLKIGSKY+++FPSEG++V+FFGNEF+WLHYTWIDIGTP V FLVAL
Sbjct: 61  YFQMLLDYDLKRRRLKIGSKYDMLFPSEGSQVIFFGNEFNWLHYTWIDIGTPRVPFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKS 180
           D GSDLLWVPCDC+QCAPLSAS+YS LDRDLS YNPALS+TS++L C HQLCAWSTTCKS
Sbjct: 121 DVGSDLLWVPCDCVQCAPLSASYYSVLDRDLSEYNPALSSTSKHLFCGHQLCAWSTTCKS 180

Query: 181 PDDPCSYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDG 240
           P+DPC+YKRDYY+DNTSTSG+MIEDKLHL SFSKHGT  LLQASVVLGCGRKQSG YLDG
Sbjct: 181 PNDPCTYKRDYYSDNTSTSGYMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDG 240

Query: 241 AAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPL 300
           AAPDGVMGLGPGNISVPTLLA+ GLVRNTFSLCFDNNGSGRI+FGD+GPATQQTTQFLPL
Sbjct: 241 AAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSGRIIFGDDGPATQQTTQFLPL 300

Query: 301 FGEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNAT 360
           FGEF AYF+ VESFCVGSSCLQ+SGF ALVDSGSSFTYLP E+YKKIVFEFDKQVK NAT
Sbjct: 301 FGEFAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNAT 360

Query: 361 RIILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEET 420
           RI+LQE PWNYCYN S+L S  IPSMKLVFPLNQ FIHDPVY LP +QGYK+FCLTLEET
Sbjct: 361 RIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQIFIHDPVYILPANQGYKVFCLTLEET 420

Query: 421 DDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKPPSNDG---SPTALP 480
           D+DYGVIGQNLMVGYR+VFDRENL+LGWSKSKCLDIN    +HAKPPSN+G   SP ALP
Sbjct: 421 DEDYGVIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSSTTEHAKPPSNNGNAKSPIALP 480

Query: 481 TDGHLSPPNRQEIAPTAARAFSKSSLTAPHFSLFSCCCLRLFLL 521
                 P NRQ IAPTAAR  SKSSL+A +FS      L  FL+
Sbjct: 481 ------PTNRQAIAPTAARTSSKSSLSASYFSPLLLLLLAAFLV 518

BLAST of Cp4.1LG01g20080 vs. NCBI nr
Match: gi|449445106|ref|XP_004140314.1| (PREDICTED: aspartic proteinase-like protein 1 [Cucumis sativus])

HSP 1 Score: 833.2 bits (2151), Expect = 2.6e-238
Identity = 411/524 (78.44%), Postives = 453/524 (86.45%), Query Frame = 1

Query: 1   MANFLVVLLFVACFLVDSSVALRLSSRLVHRFSDEAKALWKSRN-GNASGKFWPRRNSLK 60
           MAN  ++LLF+A   V+ S+AL LS  LVHRFSDEAK+LW+SR  GN S KFWP  NSLK
Sbjct: 1   MANCALLLLFIASLFVNCSLALTLSLNLVHRFSDEAKSLWESRRTGNVSAKFWPPTNSLK 60

Query: 61  YFETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVAL 120
           YF+ L DYDLKRRRL IGSKY+V+FPSEG++V+FFGNEF+WLHYTWID+GTPSV FLVAL
Sbjct: 61  YFQMLMDYDLKRRRLNIGSKYDVLFPSEGSQVIFFGNEFNWLHYTWIDLGTPSVPFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKS 180
           D GSDLLWVPCDCIQCAPLSA++YS LDRDLS YNPALS+TS++L C HQLCAWSTTCKS
Sbjct: 121 DVGSDLLWVPCDCIQCAPLSANYYSVLDRDLSEYNPALSSTSKHLFCGHQLCAWSTTCKS 180

Query: 181 PDDPCSYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDG 240
            +DPC+YKRDYY+DNTSTSGFMIEDKL L SFSKHGT  LLQASVV GCGRKQSG YLDG
Sbjct: 181 ANDPCTYKRDYYSDNTSTSGFMIEDKLQLTSFSKHGTHSLLQASVVFGCGRKQSGSYLDG 240

Query: 241 AAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPL 300
           AAPDGVMGLGPGNISVPTLLA+ GLVRNTFSLCFDNNGSGRILFGD+GPATQQTTQFLPL
Sbjct: 241 AAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSGRILFGDDGPATQQTTQFLPL 300

Query: 301 FGEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNAT 360
           FGEF AYF+ VESFCVGSSCLQ+SGF ALVDSGSSFTYLP E+YKKIVFEFDKQVK+NAT
Sbjct: 301 FGEFAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKVNAT 360

Query: 361 RIILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEET 420
           RI+L+E PWNYCYN S+L S  IPSM+LVFPLNQ FIHDPVY LP +QGYK+FCLTLEET
Sbjct: 361 RIVLRELPWNYCYNISTLVSFNIPSMQLVFPLNQIFIHDPVYVLPANQGYKVFCLTLEET 420

Query: 421 DDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKPPSNDG---SPTALP 480
           D+DYGVIGQNLMVGYR+VFDRENL+LGWSKSKCLDIN    +HAKPPSN+G   SP ALP
Sbjct: 421 DEDYGVIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSSTTEHAKPPSNNGNAKSPIALP 480

Query: 481 TDGHLSPPNRQEIAPTAARAFSKSSLTAPHFSLFSCCCLRLFLL 521
                 P NRQ IAPTAAR  SKSSL+A HFS      L  FL+
Sbjct: 481 ------PTNRQAIAPTAARTSSKSSLSASHFSPLLLLLLAAFLV 518

BLAST of Cp4.1LG01g20080 vs. NCBI nr
Match: gi|1009136476|ref|XP_015885545.1| (PREDICTED: aspartic proteinase-like protein 1 isoform X2 [Ziziphus jujuba])

HSP 1 Score: 657.1 bits (1694), Expect = 2.5e-185
Identity = 335/528 (63.45%), Postives = 409/528 (77.46%), Query Frame = 1

Query: 1   MANFLVVLLFV-ACFLVDSSVALRLSSRLVHRFSDEAKALW-KSRNGNASGKFWPRRNSL 60
           MAN  + L F+ AC LV+ SVAL  S++L+HRFSDEAKAL   SR GN S K WP+RNSL
Sbjct: 1   MANRALALFFIMACLLVNGSVALTFSTKLIHRFSDEAKALLGSSRGGNFSAKLWPKRNSL 60

Query: 61  KYFETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVA 120
           +YF  L   D+ R+R+K+GSKY+++FPSEG+E +FFGN+FDWLHYTWIDIGTP+VSFLVA
Sbjct: 61  EYFRLLLRGDVNRQRIKLGSKYDLLFPSEGSETLFFGNDFDWLHYTWIDIGTPNVSFLVA 120

Query: 121 LDAGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCK 180
           LDAGSDLLWVPCDCIQCAPLSAS+Y++LDRD+S Y+P+LS+TS+ L CSHQLC  ST CK
Sbjct: 121 LDAGSDLLWVPCDCIQCAPLSASYYNTLDRDMSEYSPSLSSTSKQLPCSHQLCKLSTNCK 180

Query: 181 SPDDPCSYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLD 240
            P DPC Y  +Y T+NTS+SGF+IEDKLHLAS S H     +QASV+LGCGRKQSG YL+
Sbjct: 181 GPKDPCHYTAEYNTENTSSSGFLIEDKLHLASSSGHAKPTYVQASVILGCGRKQSGGYLE 240

Query: 241 GAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLP 300
           GAAPDGVMGLGPG IS+P+LLAKAGLV+N+FSLCFD+NGSGR+LFGD G  T Q T FLP
Sbjct: 241 GAAPDGVMGLGPGEISIPSLLAKAGLVQNSFSLCFDDNGSGRLLFGDEGVVTHQYTPFLP 300

Query: 301 LFGEFD-AYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLN 360
           + G+++ AY V VES+CVGSSCLQ++GF ALVDSGSSFTY+P++ YK I+ EFDKQ  +N
Sbjct: 301 IAGKYNIAYSVGVESYCVGSSCLQETGFQALVDSGSSFTYVPSKAYKTIILEFDKQ--MN 360

Query: 361 ATRIILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSF-IHDPVYTLPDSQGYKLFCLTL 420
           ATRI  Q++PW YCYN SSLE   IP+MKL+F  NQSF I +PV +   +Q + +FCLTL
Sbjct: 361 ATRISHQQYPWTYCYNVSSLELPNIPTMKLIFTSNQSFLIQNPVLSDSANQEFVIFCLTL 420

Query: 421 EETDDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKPPSNDGSPTALP 480
            +T+D+YG+IGQN M GYR+VFDRENL+LGWSKS C DI+ G+  +  PPS DGSP  LP
Sbjct: 421 LQTEDEYGIIGQNYMTGYRMVFDRENLRLGWSKSNCDDISKGKGLNPAPPS-DGSPGTLP 480

Query: 481 TDGHLSPPNRQEIAPT--AARAFSKSSLTAPHFSL-FSCCCLRLFLLL 522
                S  N Q   PT  A    S SS+    + + +  C L  FLLL
Sbjct: 481 ASEQQSTANTQAAEPTAMAVTTTSTSSVAVLSYQIPYRFCILTSFLLL 525

BLAST of Cp4.1LG01g20080 vs. NCBI nr
Match: gi|1009136474|ref|XP_015885544.1| (PREDICTED: aspartic proteinase-like protein 1 isoform X1 [Ziziphus jujuba])

HSP 1 Score: 648.7 bits (1672), Expect = 9.0e-183
Identity = 335/539 (62.15%), Postives = 409/539 (75.88%), Query Frame = 1

Query: 1   MANFLVVLLFV-ACFLVDSSVALRLSSRLVHRFSDEAKALW-KSRNGNASGKFWPRRNSL 60
           MAN  + L F+ AC LV+ SVAL  S++L+HRFSDEAKAL   SR GN S K WP+RNSL
Sbjct: 1   MANRALALFFIMACLLVNGSVALTFSTKLIHRFSDEAKALLGSSRGGNFSAKLWPKRNSL 60

Query: 61  KYFETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVA 120
           +YF  L   D+ R+R+K+GSKY+++FPSEG+E +FFGN+FDWLHYTWIDIGTP+VSFLVA
Sbjct: 61  EYFRLLLRGDVNRQRIKLGSKYDLLFPSEGSETLFFGNDFDWLHYTWIDIGTPNVSFLVA 120

Query: 121 LDAGSDLLWVPCDCIQCAPLSASHYSSL-----------DRDLSAYNPALSNTSQYLSCS 180
           LDAGSDLLWVPCDCIQCAPLSAS+Y++L           DRD+S Y+P+LS+TS+ L CS
Sbjct: 121 LDAGSDLLWVPCDCIQCAPLSASYYNTLVLPCIWAPMMQDRDMSEYSPSLSSTSKQLPCS 180

Query: 181 HQLCAWSTTCKSPDDPCSYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLG 240
           HQLC  ST CK P DPC Y  +Y T+NTS+SGF+IEDKLHLAS S H     +QASV+LG
Sbjct: 181 HQLCKLSTNCKGPKDPCHYTAEYNTENTSSSGFLIEDKLHLASSSGHAKPTYVQASVILG 240

Query: 241 CGRKQSGYYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNG 300
           CGRKQSG YL+GAAPDGVMGLGPG IS+P+LLAKAGLV+N+FSLCFD+NGSGR+LFGD G
Sbjct: 241 CGRKQSGGYLEGAAPDGVMGLGPGEISIPSLLAKAGLVQNSFSLCFDDNGSGRLLFGDEG 300

Query: 301 PATQQTTQFLPLFGEFD-AYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKI 360
             T Q T FLP+ G+++ AY V VES+CVGSSCLQ++GF ALVDSGSSFTY+P++ YK I
Sbjct: 301 VVTHQYTPFLPIAGKYNIAYSVGVESYCVGSSCLQETGFQALVDSGSSFTYVPSKAYKTI 360

Query: 361 VFEFDKQVKLNATRIILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSF-IHDPVYTLPD 420
           + EFDKQ  +NATRI  Q++PW YCYN SSLE   IP+MKL+F  NQSF I +PV +   
Sbjct: 361 ILEFDKQ--MNATRISHQQYPWTYCYNVSSLELPNIPTMKLIFTSNQSFLIQNPVLSDSA 420

Query: 421 SQGYKLFCLTLEETDDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKP 480
           +Q + +FCLTL +T+D+YG+IGQN M GYR+VFDRENL+LGWSKS C DI+ G+  +  P
Sbjct: 421 NQEFVIFCLTLLQTEDEYGIIGQNYMTGYRMVFDRENLRLGWSKSNCDDISKGKGLNPAP 480

Query: 481 PSNDGSPTALPTDGHLSPPNRQEIAPT--AARAFSKSSLTAPHFSL-FSCCCLRLFLLL 522
           PS DGSP  LP     S  N Q   PT  A    S SS+    + + +  C L  FLLL
Sbjct: 481 PS-DGSPGTLPASEQQSTANTQAAEPTAMAVTTTSTSSVAVLSYQIPYRFCILTSFLLL 536

BLAST of Cp4.1LG01g20080 vs. NCBI nr
Match: gi|1009140387|ref|XP_015887624.1| (PREDICTED: aspartic proteinase-like protein 1 [Ziziphus jujuba])

HSP 1 Score: 647.5 bits (1669), Expect = 2.0e-182
Identity = 322/491 (65.58%), Postives = 392/491 (79.84%), Query Frame = 1

Query: 1   MANFLVVLLFV-ACFLVDSSVALRLSSRLVHRFSDEAKA-LWKSRNGNASGKFWPRRNSL 60
           MAN  + L F+ A  LV+ SVAL  S++L+HRFSDEAKA L   R GN S K WP+RNSL
Sbjct: 1   MANRALALFFIMAWLLVNGSVALTFSTKLIHRFSDEAKAWLGSIRGGNFSAKLWPKRNSL 60

Query: 61  KYFETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVA 120
           +YF  L   D+ R+R+K+GSKY+++FPSEGNE +FFGN+FDWLHYTWIDIGTP+VSFLVA
Sbjct: 61  EYFRLLLRGDVNRQRIKLGSKYDLLFPSEGNETLFFGNDFDWLHYTWIDIGTPNVSFLVA 120

Query: 121 LDAGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCK 180
           LDAGSDLLWVPCDCIQCAPLSAS+Y++LDRD+S Y+P+LS+TS+ L CSHQLC  ST CK
Sbjct: 121 LDAGSDLLWVPCDCIQCAPLSASYYNTLDRDMSEYSPSLSSTSKQLPCSHQLCKLSTNCK 180

Query: 181 SPDDPCSYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLD 240
            P DPC Y  +Y T+NTS+SGF+IEDKLHLAS S H     +QASV+LGCGRKQSG YL+
Sbjct: 181 GPKDPCHYTAEYNTENTSSSGFLIEDKLHLASSSGHAKPTYVQASVILGCGRKQSGGYLE 240

Query: 241 GAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLP 300
           GAAPDGVMGLGPG IS+P+LLAKAGLV+N+FSLCFD+NGSGR+LFGD G  T Q T FLP
Sbjct: 241 GAAPDGVMGLGPGEISIPSLLAKAGLVQNSFSLCFDDNGSGRLLFGDEGVVTHQYTPFLP 300

Query: 301 LFGEFD-AYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLN 360
           + G+++ AY V VES+CVGSSCLQ++GF ALVDSGSSFTY+P++ YK I+ EFDKQ  +N
Sbjct: 301 IAGKYNIAYSVGVESYCVGSSCLQETGFQALVDSGSSFTYVPSKAYKTIILEFDKQ--MN 360

Query: 361 ATRIILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSF-IHDPVYTLPDSQGYKLFCLTL 420
           ATRI  Q++PW YCYN SSLE   IP+MKL+F  NQSF I +PV +   +Q + +FCLTL
Sbjct: 361 ATRISPQQYPWTYCYNVSSLELPNIPTMKLIFTSNQSFLIQNPVLSDSANQEFVIFCLTL 420

Query: 421 EETDDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKPPSNDGSPTALP 480
            +T+D+YG+IGQN M+GYR+VFDRENL+LGWSKS C DI+ G+  +  PPS DGSP  LP
Sbjct: 421 LQTEDEYGIIGQNYMMGYRMVFDRENLRLGWSKSNCDDISKGKGLNPAPPS-DGSPGTLP 480

Query: 481 TDGHLSPPNRQ 488
                S  N Q
Sbjct: 481 ASEQQSTANTQ 488

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPL1_ARATH7.5e-12946.63Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana GN=At5g10080 PE=1 SV=... [more]
APF1_ARATH3.1e-7437.73Aspartyl protease family protein 1 OS=Arabidopsis thaliana GN=APF1 PE=1 SV=1[more]
ASPL2_ARATH2.8e-2727.97Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana GN=At1g65240 PE=1 SV=... [more]
AED1_ARATH1.0e-2428.10Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1[more]
APCB1_ARATH1.0e-2427.54Aspartyl protease APCB1 OS=Arabidopsis thaliana GN=APCB1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KN37_CUCSA1.8e-23878.44Uncharacterized protein OS=Cucumis sativus GN=Csa_5G407090 PE=3 SV=1[more]
A0A061FFI5_THECC4.5e-18161.66Eukaryotic aspartyl protease family protein isoform 1 OS=Theobroma cacao GN=TCM_... [more]
B9GUJ4_POPTR6.1e-17860.08Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s09270g PE=3 SV=1[more]
A0A067K9F2_JATCU1.5e-17659.72Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11999 PE=3 SV=1[more]
B9RJF1_RICCO1.7e-17260.40Aspartic proteinase nepenthesin-2, putative OS=Ricinus communis GN=RCOM_1033780 ... [more]
Match NameE-valueIdentityDescription
AT5G10080.14.2e-13046.63 Eukaryotic aspartyl protease family protein[more]
AT4G35880.11.2e-7639.00 Eukaryotic aspartyl protease family protein[more]
AT2G17760.11.7e-7537.73 Eukaryotic aspartyl protease family protein[more]
AT3G51330.17.6e-6333.21 Eukaryotic aspartyl protease family protein[more]
AT3G51360.11.4e-6133.40 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659121083|ref|XP_008460494.1|6.6e-24279.20PREDICTED: aspartic proteinase-like protein 1 [Cucumis melo][more]
gi|449445106|ref|XP_004140314.1|2.6e-23878.44PREDICTED: aspartic proteinase-like protein 1 [Cucumis sativus][more]
gi|1009136476|ref|XP_015885545.1|2.5e-18563.45PREDICTED: aspartic proteinase-like protein 1 isoform X2 [Ziziphus jujuba][more]
gi|1009136474|ref|XP_015885544.1|9.0e-18362.15PREDICTED: aspartic proteinase-like protein 1 isoform X1 [Ziziphus jujuba][more]
gi|1009140387|ref|XP_015887624.1|2.0e-18265.58PREDICTED: aspartic proteinase-like protein 1 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR021109Peptidase_aspartic_dom_sf
IPR001969Aspartic_peptidase_AS
IPR001461Aspartic_peptidase_A1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0030163 protein catabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g20080.1Cp4.1LG01g20080.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 424..439
score: 7.6E-7coord: 327..338
score: 7.6E-7coord: 108..128
score: 7.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 5..460
score: 2.4E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 327..338
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 97..285
score: 2.9E-38coord: 291..453
score: 1.9
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 97..454
score: 3.98
NoneNo IPR availablePANTHERPTHR13683:SF339SUBFAMILY NOT NAMEDcoord: 5..460
score: 2.4E