Cp4.1LG01g17730 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g17730
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionEukaryotic aspartyl protease family protein
LocationCp4.1LG01 : 12987200 .. 12988591 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCGTGTGTTTCCCAATTCCTCCATCTCACCTTCTTCCTCCTCCTCCTCCTCCTCCTCCTCCTCCTCTCATCCCTCACGGATCCCTCCAATGCCATTCCCTCTCAATACCTGAAGTTCCCATTACTCCACACTAACCCCTTCTCCTCCCCTTCTCAAGCCCTCTCCTCCGACACCCACCGCCTCTCCCTCCTCTTCTCCTCCCACCGCCACAGCCCCACCCTCAAGTCCCCCCTCATCTCCGGCGCCTCCACCGGCTCTGGTCAGTACTTCGTCAATCTCCACCTCGGCACCCCTCCTCAAAGCCTCCTCCTCGTCGCCGATACCGGCAGCGACCTCGTCTGGGTCAAATGCTCCCCCTGCCGCAACTGCTCCCACCACCCTCCCTCCTCTGCCTTCTTCCCCCGCCACTCATCCTCCTTCTCCCCTTTCCACTGCTTCGACCCCCACTGCCGCCTCCTTCCCCACCCCCCTTCTCACCGCTGCAACCACACCCACCTCCACTCCCCCTGCCCCTTCCTCTACTCCTACGCCGACTCCTCCCTCTCCTCCGGCTTCTTCTCCAAAGATGTCACCACCTTCAACACCTTCTCCGGGACCCACACCCAGACCCGTCTCAACGACCTTTCCTTTGGATGTGGCTTTCGGATCTCGGGTCCTAGCGTTTCGGGCGCCCGATTCAATGGAGCACGTGGAGTCATGGGATTGGGCAGAGGCCCGATCTCCTTCTCCTCCCAACTCGGCCGCCGATTTGGCAACACTTTTTCTTATTGTCTTATGGATTATACACTCTCTCCGCCGCCCACCAGCTACCTCATGATCGGCGGCGGCCTCCGTAGCCTACCGGTGACGAACGCCTCAAAAATCAGCTACACCCCCTTGCAGATTAACCCTCTGTCCCCGACATTCTACTACATTGTCGTGAAGAGCATCACCGTGGACGGCGTGAAATTGCCCATCAATCCCAACGTGTGGGCCATCGACGAGCAAGGCAACGGCGGCACGGTGGTGGATTCCGGCACGACATTGACGTATCTAGCTGAGGCAGCATACGAGGAAGTGTTGAAGGCCATGAGACGGCGAGTGAAGCTGCCGAGAGCGCTCCAGTTGAGTCCAGGGTTCGATCTGTGCGTGAACGCATCGGGCGAGTCGCGGATGCGGAGTCTGCCGCAACTAAGATTCCGGGTAGGCGGCGGAGGTGTATTTGCGCCGCCGGCAAGGAACTATTTTGTGGAGACAGAGGAGGGGGTGATGTGCTTGGCGATCCGACCCGTGGATTCGGGAAATGGGTTTTCGGTGATTGGGAATCTGATGCAACAAGGATTCTTGTTGGAGTTCGATAGAGACAAGTCGAGGATGGGGTTCTCAAGGCGTGGCTGTGGGCTTCCTTGA

mRNA sequence

ATGCCGTGTGTTTCCCAATTCCTCCATCTCACCTTCTTCCTCCTCCTCCTCCTCCTCCTCCTCCTCCTCTCATCCCTCACGGATCCCTCCAATGCCATTCCCTCTCAATACCTGAAGTTCCCATTACTCCACACTAACCCCTTCTCCTCCCCTTCTCAAGCCCTCTCCTCCGACACCCACCGCCTCTCCCTCCTCTTCTCCTCCCACCGCCACAGCCCCACCCTCAAGTCCCCCCTCATCTCCGGCGCCTCCACCGGCTCTGGTCAGTACTTCGTCAATCTCCACCTCGGCACCCCTCCTCAAAGCCTCCTCCTCGTCGCCGATACCGGCAGCGACCTCGTCTGGGTCAAATGCTCCCCCTGCCGCAACTGCTCCCACCACCCTCCCTCCTCTGCCTTCTTCCCCCGCCACTCATCCTCCTTCTCCCCTTTCCACTGCTTCGACCCCCACTGCCGCCTCCTTCCCCACCCCCCTTCTCACCGCTGCAACCACACCCACCTCCACTCCCCCTGCCCCTTCCTCTACTCCTACGCCGACTCCTCCCTCTCCTCCGGCTTCTTCTCCAAAGATGTCACCACCTTCAACACCTTCTCCGGGACCCACACCCAGACCCGTCTCAACGACCTTTCCTTTGGATGTGGCTTTCGGATCTCGGGTCCTAGCGTTTCGGGCGCCCGATTCAATGGAGCACGTGGAGTCATGGGATTGGGCAGAGGCCCGATCTCCTTCTCCTCCCAACTCGGCCGCCGATTTGGCAACACTTTTTCTTATTGTCTTATGGATTATACACTCTCTCCGCCGCCCACCAGCTACCTCATGATCGGCGGCGGCCTCCGTAGCCTACCGGTGACGAACGCCTCAAAAATCAGCTACACCCCCTTGCAGATTAACCCTCTGTCCCCGACATTCTACTACATTGTCGTGAAGAGCATCACCGTGGACGGCGTGAAATTGCCCATCAATCCCAACGTGTGGGCCATCGACGAGCAAGGCAACGGCGGCACGGTGGTGGATTCCGGCACGACATTGACGTATCTAGCTGAGGCAGCATACGAGGAAGTGTTGAAGGCCATGAGACGGCGAGTGAAGCTGCCGAGAGCGCTCCAGTTGAGTCCAGGGTTCGATCTGTGCGTGAACGCATCGGGCGAGTCGCGGATGCGGAGTCTGCCGCAACTAAGATTCCGGGTAGGCGGCGGAGGTGTATTTGCGCCGCCGGCAAGGAACTATTTTGTGGAGACAGAGGAGGGGGTGATGTGCTTGGCGATCCGACCCGTGGATTCGGGAAATGGGTTTTCGGTGATTGGGAATCTGATGCAACAAGGATTCTTGTTGGAGTTCGATAGAGACAAGTCGAGGATGGGGTTCTCAAGGCGTGGCTGTGGGCTTCCTTGA

Coding sequence (CDS)

ATGCCGTGTGTTTCCCAATTCCTCCATCTCACCTTCTTCCTCCTCCTCCTCCTCCTCCTCCTCCTCCTCTCATCCCTCACGGATCCCTCCAATGCCATTCCCTCTCAATACCTGAAGTTCCCATTACTCCACACTAACCCCTTCTCCTCCCCTTCTCAAGCCCTCTCCTCCGACACCCACCGCCTCTCCCTCCTCTTCTCCTCCCACCGCCACAGCCCCACCCTCAAGTCCCCCCTCATCTCCGGCGCCTCCACCGGCTCTGGTCAGTACTTCGTCAATCTCCACCTCGGCACCCCTCCTCAAAGCCTCCTCCTCGTCGCCGATACCGGCAGCGACCTCGTCTGGGTCAAATGCTCCCCCTGCCGCAACTGCTCCCACCACCCTCCCTCCTCTGCCTTCTTCCCCCGCCACTCATCCTCCTTCTCCCCTTTCCACTGCTTCGACCCCCACTGCCGCCTCCTTCCCCACCCCCCTTCTCACCGCTGCAACCACACCCACCTCCACTCCCCCTGCCCCTTCCTCTACTCCTACGCCGACTCCTCCCTCTCCTCCGGCTTCTTCTCCAAAGATGTCACCACCTTCAACACCTTCTCCGGGACCCACACCCAGACCCGTCTCAACGACCTTTCCTTTGGATGTGGCTTTCGGATCTCGGGTCCTAGCGTTTCGGGCGCCCGATTCAATGGAGCACGTGGAGTCATGGGATTGGGCAGAGGCCCGATCTCCTTCTCCTCCCAACTCGGCCGCCGATTTGGCAACACTTTTTCTTATTGTCTTATGGATTATACACTCTCTCCGCCGCCCACCAGCTACCTCATGATCGGCGGCGGCCTCCGTAGCCTACCGGTGACGAACGCCTCAAAAATCAGCTACACCCCCTTGCAGATTAACCCTCTGTCCCCGACATTCTACTACATTGTCGTGAAGAGCATCACCGTGGACGGCGTGAAATTGCCCATCAATCCCAACGTGTGGGCCATCGACGAGCAAGGCAACGGCGGCACGGTGGTGGATTCCGGCACGACATTGACGTATCTAGCTGAGGCAGCATACGAGGAAGTGTTGAAGGCCATGAGACGGCGAGTGAAGCTGCCGAGAGCGCTCCAGTTGAGTCCAGGGTTCGATCTGTGCGTGAACGCATCGGGCGAGTCGCGGATGCGGAGTCTGCCGCAACTAAGATTCCGGGTAGGCGGCGGAGGTGTATTTGCGCCGCCGGCAAGGAACTATTTTGTGGAGACAGAGGAGGGGGTGATGTGCTTGGCGATCCGACCCGTGGATTCGGGAAATGGGTTTTCGGTGATTGGGAATCTGATGCAACAAGGATTCTTGTTGGAGTTCGATAGAGACAAGTCGAGGATGGGGTTCTCAAGGCGTGGCTGTGGGCTTCCTTGA

Protein sequence

MPCVSQFLHLTFFLLLLLLLLLLSSLTDPSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSSHRHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSLSSGFFSKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISFSSQLGRRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVKSITVDGVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQLSPGFDLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDRDKSRMGFSRRGCGLP
BLAST of Cp4.1LG01g17730 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 219.2 bits (557), Expect = 9.7e-56
Identity = 156/461 (33.84%), Postives = 232/461 (50.33%), Query Frame = 1

Query: 21  LLLSSLTDPSNAIPSQYLKFPLLHTNPFSS---PSQALSS----DTHRLSLLFS------ 80
           LL S     S++  S  +   L H +  SS   P +  SS    D+ R+  + +      
Sbjct: 55  LLESEFESGSDSESSSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIP 114

Query: 81  --SHRHSPT---LKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCR 140
             +  H+P      S ++SG S GSG+YF  L +GTP + + +V DTGSD+VW++C+PCR
Sbjct: 115 GRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCR 174

Query: 141 NCSHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSL 200
            C +      F PR S +++   C  PHCR L    S  CN       C +  SY D S 
Sbjct: 175 RC-YSQSDPIFDPRKSKTYATIPCSSPHCRRL---DSAGCNTR--RKTCLYQVSYGDGSF 234

Query: 201 SSGFFSKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPIS 260
           + G FS +  TF        + R+  ++ GCG    G       F GA G++GLG+G +S
Sbjct: 235 TVGDFSTETLTFR-------RNRVKGVALGCGHDNEG------LFVGAAGLLGLGKGKLS 294

Query: 261 FSSQLGRRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPT 320
           F  Q G RF   FSYCL+D + S  P+S +     +  +         +TPL  NP   T
Sbjct: 295 FPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRI-------ARFTPLLSNPKLDT 354

Query: 321 FYYIVVKSITVDGVKLP-INPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRR 380
           FYY+ +  I+V G ++P +  +++ +D+ GNGG ++DSGT++T L   AY  +  A R  
Sbjct: 355 FYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVG 414

Query: 381 VK-LPRALQLSPGFDLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETE-EGVMC 440
            K L RA   S  FD C + S  + ++ +P +     G  V + PA NY +  +  G  C
Sbjct: 415 AKTLKRAPDFSL-FDTCFDLSNMNEVK-VPTVVLHFRGADV-SLPATNYLIPVDTNGKFC 474

Query: 441 LAIRPVDSGNGFSVIGNLMQQGFLLEFDRDKSRMGFSRRGC 461
            A     +  G S+IGN+ QQGF + +D   SR+GF+  GC
Sbjct: 475 FAF--AGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Cp4.1LG01g17730 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 207.6 bits (527), Expect = 2.9e-52
Identity = 135/409 (33.01%), Postives = 206/409 (50.37%), Query Frame = 1

Query: 53  QALSSDTHRLSLLFSSHRHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSD 112
           +A+     R+  + +  + S  +++P+ +G     G+Y +N+ +GTP  S   + DTGSD
Sbjct: 63  RAIKRGERRMRSINAMLQSSSGIETPVYAG----DGEYLMNVAIGTPDSSFSAIMDTGSD 122

Query: 113 LVWVKCSPCRNCSHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCP 172
           L+W +C PC  C   P +  F P+ SSSFS   C   +C+ LP   S  CN+      C 
Sbjct: 123 LIWTQCEPCTQCFSQP-TPIFNPQDSSSFSTLPCESQYCQDLP---SETCNNNE----CQ 182

Query: 173 FLYSYADSSLSSGFFSKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARG 232
           + Y Y D S + G+ + +  TF T S       + +++FGCG    G      + NGA G
Sbjct: 183 YTYGYGDGSTTQGYMATETFTFETSS-------VPNIAFGCGEDNQG----FGQGNGA-G 242

Query: 233 VMGLGRGPISFSSQLGRRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYT 292
           ++G+G GP+S  SQLG      FSYC+  Y  S P T  L +G     +P  + S    T
Sbjct: 243 LIGMGWGPLSLPSQLGV---GQFSYCMTSYGSSSPST--LALGSAASGVPEGSPS----T 302

Query: 293 PLQINPLSPTFYYIVVKSITVDGVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYE 352
            L  + L+PT+YYI ++ ITV G  L I  + + + + G GG ++DSGTTLTYL + AY 
Sbjct: 303 TLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYN 362

Query: 353 EVLKAMRRRVKLPRALQLSPGFDLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVE 412
            V +A   ++ LP   + S G   C     +     +P++  +   GGV     +N  + 
Sbjct: 363 AVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQF-DGGVLNLGEQNILIS 422

Query: 413 TEEGVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDRDKSRMGFSRRGCG 462
             EGV+CLA+    S  G S+ GN+ QQ   + +D     + F    CG
Sbjct: 423 PAEGVICLAMGS-SSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQCG 436

BLAST of Cp4.1LG01g17730 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 206.1 bits (523), Expect = 8.5e-52
Identity = 130/377 (34.48%), Postives = 192/377 (50.93%), Query Frame = 1

Query: 86  GSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHPPSSAFFPRHSSSFSPFH 145
           G G+Y +NL +GTP Q    + DTGSDL+W +C PC  C  +  +  F P+ SSSFS   
Sbjct: 91  GDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQC-FNQSTPIFNPQGSSSFSTLP 150

Query: 146 CFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSLSSGFFSKDVTTFNTFSGTHTQTR 205
           C    C+ L  P       T  ++ C + Y Y D S + G    +  TF + S       
Sbjct: 151 CSSQLCQALSSP-------TCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVS------- 210

Query: 206 LNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISFSSQLGRRFGNTFSYCLMDYTLS 265
           + +++FGCG    G      + NGA G++G+GRGP+S  SQL       FSYC+     S
Sbjct: 211 IPNITFGCGENNQG----FGQGNGA-GLVGMGRGPLSLPSQLD---VTKFSYCMTPIGSS 270

Query: 266 PPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVKSITVDGVKLPINPNVW 325
            P  S L++G    S+    A   + T +Q + + PTFYYI +  ++V   +LPI+P+ +
Sbjct: 271 TP--SNLLLGSLANSV---TAGSPNTTLIQSSQI-PTFYYITLNGLSVGSTRLPIDPSAF 330

Query: 326 AID-EQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQLSPGFDLCVNASGES 385
           A++   G GG ++DSGTTLTY    AY+ V +    ++ LP     S GFDLC     + 
Sbjct: 331 ALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDP 390

Query: 386 RMRSLPQLRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGNGFSVIGNLMQQGFLL 445
               +P       GG +   P+ NYF+    G++CLA+    S  G S+ GN+ QQ  L+
Sbjct: 391 SNLQIPTFVMHFDGGDL-ELPSENYFISPSNGLICLAMG--SSSQGMSIFGNIQQQNMLV 435

Query: 446 EFDRDKSRMGFSRRGCG 462
            +D   S + F+   CG
Sbjct: 451 VYDTGNSVVSFASAQCG 435

BLAST of Cp4.1LG01g17730 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 194.9 bits (494), Expect = 2.0e-48
Identity = 129/386 (33.42%), Postives = 195/386 (50.52%), Query Frame = 1

Query: 77  SPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHPPSSAFFPR 136
           S ++SG   GSG+YFV + +G+PP+   +V D+GSD+VWV+C PC+ C +      F P 
Sbjct: 118 SDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLC-YKQSDPVFDPA 177

Query: 137 HSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSLSSGFFSKDVTTFNT 196
            S S++   C    C  + +   H          C +   Y D S + G  + +  TF  
Sbjct: 178 KSGSYTGVSCGSSVCDRIENSGCH-------SGGCRYEVMYGDGSYTKGTLALETLTF-- 237

Query: 197 FSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISFSSQLGRRFGNTFS 256
                 +T + +++ GCG R  G       F GA G++G+G G +SF  QL  + G  F 
Sbjct: 238 -----AKTVVRNVAMGCGHRNRG------MFIGAAGLLGIGGGSMSFVGQLSGQTGGAFG 297

Query: 257 YCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVKSITVDGV 316
           YCL+  +     T  L+   G  +LPV      S+ PL  NP +P+FYY+ +K + V GV
Sbjct: 298 YCLV--SRGTDSTGSLVF--GREALPV----GASWVPLVRNPRAPSFYYVGLKGLGVGGV 357

Query: 317 KLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMR-RRVKLPRALQLSPGFD 376
           ++P+   V+ + E G+GG V+D+GT +T L  AAY       + +   LPRA  +S  FD
Sbjct: 358 RIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSI-FD 417

Query: 377 LCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETEE-GVMCLAIRPVDSGNGFSVI 436
            C + SG   +R +P + F    G V   PARN+ +  ++ G  C A     S  G S+I
Sbjct: 418 TCYDLSGFVSVR-VPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAF--AASPTGLSII 470

Query: 437 GNLMQQGFLLEFDRDKSRMGFSRRGC 461
           GN+ Q+G  + FD     +GF    C
Sbjct: 478 GNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of Cp4.1LG01g17730 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 192.2 bits (487), Expect = 1.3e-47
Identity = 125/391 (31.97%), Postives = 195/391 (49.87%), Query Frame = 1

Query: 75  LKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHPPSSAFF 134
           L +P++SGAS GSG+YF  + +GTP + + LV DTGSD+ W++C PC +C +      F 
Sbjct: 147 LTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADC-YQQSDPVFN 206

Query: 135 PRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSLSSGFFSKDVTTF 194
           P  SS++    C  P C LL    +  C      + C +  SY D S + G  + D  TF
Sbjct: 207 PTSSSTYKSLTCSAPQCSLL---ETSACR----SNKCLYQVSYGDGSFTVGELATDTVTF 266

Query: 195 NTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISFSSQLGRRFGNT 254
              SG     ++N+++ GCG    G       F GA G++GLG G +S ++Q+      +
Sbjct: 267 GN-SG-----KINNVALGCGHDNEG------LFTGAAGLLGLGGGVLSITNQMK---ATS 326

Query: 255 FSYCLMDYTLSPPPT---SYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVKSI 314
           FSYCL+D       +   + + +GGG  + P+    KI            TFYY+ +   
Sbjct: 327 FSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKID-----------TFYYVGLSGF 386

Query: 315 TVDGVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKA-MRRRVKLPRALQL 374
           +V G K+ +   ++ +D  G+GG ++D GT +T L   AY  +  A ++  V L +    
Sbjct: 387 SVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSS 446

Query: 375 SPGFDLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETEE-GVMCLAIRPVDSGN 434
              FD C + S  S ++ +P + F   GG     PA+NY +  ++ G  C A  P  S  
Sbjct: 447 ISLFDTCYDFSSLSTVK-VPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSS-- 500

Query: 435 GFSVIGNLMQQGFLLEFDRDKSRMGFSRRGC 461
             S+IGN+ QQG  + +D  K+ +G S   C
Sbjct: 507 SLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of Cp4.1LG01g17730 vs. TrEMBL
Match: A0A0A0KNH6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G174650 PE=3 SV=1)

HSP 1 Score: 742.3 bits (1915), Expect = 3.7e-211
Identity = 368/455 (80.88%), Postives = 402/455 (88.35%), Query Frame = 1

Query: 12  FFLLLLLLLLLLSSLTDPSN---AIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSS 71
           FFLL+LL    L+ L +P+    A P+ +LK PLLH  PFSSPSQ+LSSDTHRLSLLFS 
Sbjct: 9   FFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFS- 68

Query: 72  HRHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHP 131
            R +PTLKSPLISGASTGSGQYFV++ LGTPPQSLLLVADTGSDLVWVKCS CRNCSHHP
Sbjct: 69  -RPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHP 128

Query: 132 PSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSLSSGFFS 191
           PSSAF PRHSSSFSPFHCFDPHCRLLPH P H CNHT LHSPC FLYSYAD SLSSGFFS
Sbjct: 129 PSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFS 188

Query: 192 KDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISFSSQLG 251
           K+ TT  + SG+  +  L  LSFGCGFRISGPSVSGA+FNGARGVMGLGRG ISFSSQLG
Sbjct: 189 KETTTLKSLSGS--EIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLG 248

Query: 252 RRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVV 311
           RRFGN FSYCLMDYTLSPPPTS+LMIGGGL SLP+TNA+KISYTPLQINPLSPTFYYI +
Sbjct: 249 RRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITI 308

Query: 312 KSITVDGVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRAL 371
            SIT+DGVKLPINP VW IDEQGNGGTVVDSGTTLTYL + AYEEVLK++RRRVKLP A 
Sbjct: 309 HSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAA 368

Query: 372 QLSPGFDLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSG 431
           +L+PGFDLCVNASGESR  SLP+LRFR+GGG VFAPP RNYF+ETEEGVMCLAIR V+SG
Sbjct: 369 ELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESG 428

Query: 432 NGFSVIGNLMQQGFLLEFDRDKSRMGFSRRGCGLP 464
           NGFSVIGNLMQQGFLLEFD+++SR+GF+RRGCGLP
Sbjct: 429 NGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 459

BLAST of Cp4.1LG01g17730 vs. TrEMBL
Match: A0A067K2U7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18279 PE=3 SV=1)

HSP 1 Score: 586.3 bits (1510), Expect = 3.4e-164
Identity = 296/454 (65.20%), Postives = 356/454 (78.41%), Query Frame = 1

Query: 13  FLLLLLLLLLLSSL---TDPSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSSH 72
           FLLLLLL  L  S+   T  ++    +YLK PLLH  PF SP+QAL  D  RLSLL   H
Sbjct: 9   FLLLLLLTDLCYSISLRTTVNSTATKEYLKLPLLHRTPFKSPAQALPFDIRRLSLL---H 68

Query: 73  RHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHPP 132
           R   +LKSP+ISGASTGSGQYFV+L LG+P Q+LLLVADTGSDLVWVKCS C+NCS++ P
Sbjct: 69  RQRTSLKSPVISGASTGSGQYFVSLRLGSPAQTLLLVADTGSDLVWVKCSACKNCSNYSP 128

Query: 133 SSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSLSSGFFSK 192
            SAF  RHSS+FS  HCF+  CRL+PHP  + CN T LHSPC + YSYAD S +SGFFSK
Sbjct: 129 GSAFLARHSSTFSLIHCFNSQCRLVPHPRPNPCNRTRLHSPCRYEYSYADGSSTSGFFSK 188

Query: 193 DVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISFSSQLGR 252
           + TT NT +G   + +L +L+FGCGFRISGPS++GA F GA GV+GLGR PISFSSQLGR
Sbjct: 189 ETTTLNTSAGR--EKKLKNLAFGCGFRISGPSLTGASFAGAHGVIGLGRAPISFSSQLGR 248

Query: 253 RFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVK 312
           RFGN FSYCLMDYTLSPPPTSYLMIGG   S  V+    +++TPL +N LSPTFYYI +K
Sbjct: 249 RFGNKFSYCLMDYTLSPPPTSYLMIGGHQNSA-VSRKRILNFTPLLVNSLSPTFYYIGIK 308

Query: 313 SITVDGVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQ 372
           S++VDGVKLPINP+VW+ID+ GNGGT++DSGTTLT+L E AY E+L A++RRVKLP   +
Sbjct: 309 SVSVDGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFLVEPAYREILSAIKRRVKLPGPGE 368

Query: 373 LSPGFDLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGN 432
           L+PGFDLCVN SG  R    P++   + G  VF+PP RNYF++T EGV CLAI+PV+SG+
Sbjct: 369 LTPGFDLCVNVSG-VRRPVFPRMSLELAGNSVFSPPPRNYFIDTSEGVKCLAIQPVNSGS 428

Query: 433 GFSVIGNLMQQGFLLEFDRDKSRMGFSRRGCGLP 464
           GFSVIGNLMQQG+LLEFDRD+SR+GF+R GC LP
Sbjct: 429 GFSVIGNLMQQGYLLEFDRDRSRLGFARSGCALP 455

BLAST of Cp4.1LG01g17730 vs. TrEMBL
Match: B9SEI2_RICCO (Basic 7S globulin 2 small subunit, putative OS=Ricinus communis GN=RCOM_0705030 PE=3 SV=1)

HSP 1 Score: 581.6 bits (1498), Expect = 8.3e-163
Identity = 285/459 (62.09%), Postives = 356/459 (77.56%), Query Frame = 1

Query: 10  LTFFLLLLLLLLLLSSLTDPSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSSH 69
           L FF  LL+ L   SS    +    ++YLK PLLH  PF+SPS+AL+ D +R   L   H
Sbjct: 4   LLFFFFLLITLCPSSSAAANTT---TEYLKLPLLHKTPFTSPSEALAFDINRRLSLLHHH 63

Query: 70  RHSP-----TLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRNC 129
           RH       + +SP+ISGAS+GSGQYFV+L +GTPPQ+LLLVADTGSDL+WVKCSPCRNC
Sbjct: 64  RHQQQHKQNSFRSPVISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNC 123

Query: 130 SHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSLSS 189
           SH  P SAFF RHS+++S  HC+ P C+L+PHP  + CN T LHSPC + Y+YADSS ++
Sbjct: 124 SHRSPGSAFFARHSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTT 183

Query: 190 GFFSKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISFS 249
           GFFSK+  T NT +G     +LN LSFGCGFRISGPS++GA F GA+GVMGLGR PISFS
Sbjct: 184 GFFSKEALTLNTSTG--KVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFS 243

Query: 250 SQLGRRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFY 309
           SQLGRRFG+ FSYCLMDYTLSPPPTS+L IGG  +++ V+    +S+TPL INPLSPTFY
Sbjct: 244 SQLGRRFGSKFSYCLMDYTLSPPPTSFLTIGGA-QNVAVSKKGIMSFTPLLINPLSPTFY 303

Query: 310 YIVVKSITVDGVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKL 369
           YI +K + V+GVKLPINP+VW+ID+ GNGGT++DSGTTLT++ E AY E+LKA ++RVKL
Sbjct: 304 YIAIKGVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKL 363

Query: 370 PRALQLSPGFDLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETEEGVMCLAIRP 429
           P   + +PGFDLC+N SG +R  +LP++ F + GG VF+PP RNYF+ET + + CLA++P
Sbjct: 364 PSPAEPTPGFDLCMNVSGVTR-PALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQP 423

Query: 430 VDSGNGFSVIGNLMQQGFLLEFDRDKSRMGFSRRGCGLP 464
           V    GFSV+GNLMQQGFLLEFDRDKSR+GF+RRGC LP
Sbjct: 424 VSQDGGFSVLGNLMQQGFLLEFDRDKSRLGFTRRGCALP 455

BLAST of Cp4.1LG01g17730 vs. TrEMBL
Match: F6HF17_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g02930 PE=3 SV=1)

HSP 1 Score: 578.9 bits (1491), Expect = 5.3e-162
Identity = 290/460 (63.04%), Postives = 351/460 (76.30%), Query Frame = 1

Query: 10  LTFFLLLLLLLLLLSSLTDPSNAIPS------QYLKFPLLHTNPFSSPSQALSSDTHRLS 69
           L F  L  LLLLL+   TD  NA+P       +YLK  LLH  PF++PSQALS D+HRLS
Sbjct: 3   LPFSSLFSLLLLLIFFFTDICNALPIAQNGTVEYLKLRLLHIKPFTTPSQALSFDSHRLS 62

Query: 70  LLFSSHRHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRN 129
             FS+     +LKSP++SGASTGSGQYFV+L LGTPPQ LLLVADTGSDLVWVKCS CRN
Sbjct: 63  FFFSALHTPQSLKSPVVSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRN 122

Query: 130 CSHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSLS 189
           C+ H P SAF  RHS++FSP HC+D  C+L+P P  HRCNH  LHSPC + YSY D S +
Sbjct: 123 CTRHTPGSAFLARHSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKT 182

Query: 190 SGFFSKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISF 249
           SGFFSK+ TT NT SG   + +L  ++FGC FRISGPSVSGA FNGA GVMGLGRGPIS 
Sbjct: 183 SGFFSKETTTLNTSSG--REAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISL 242

Query: 250 SSQLGRRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTF 309
           SSQLG RFGN FSYCLMD+ +SP PTSYL+IG     +      ++ +TPL INPLSPTF
Sbjct: 243 SSQLGHRFGNKFSYCLMDHDISPSPTSYLLIGSTQNDV-APGKRRMRFTPLHINPLSPTF 302

Query: 310 YYIVVKSITVDGVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVK 369
           YYI ++S++VDG+KLPINP+VWA+DE GNGGT+VDSGTTLT+L E AY ++L  ++RRV+
Sbjct: 303 YYIGIESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVR 362

Query: 370 LPRALQLSPGFDLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETEEGVMCLAIR 429
           LP   + +PGFDLCVN S E     LP+L F++GG  VF+PP RNYFV+T+E V CLA++
Sbjct: 363 LPSPAEPTPGFDLCVNVS-EIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQ 422

Query: 430 PVDSGNGFSVIGNLMQQGFLLEFDRDKSRMGFSRRGCGLP 464
            V + +GFSVIGNLMQQGFLLEFD+D++R+GFSR GC LP
Sbjct: 423 AVMTPSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGCALP 458

BLAST of Cp4.1LG01g17730 vs. TrEMBL
Match: Q9LI73_ARATH (Aspartyl protease family protein OS=Arabidopsis thaliana GN=At3g25700 PE=2 SV=1)

HSP 1 Score: 570.9 bits (1470), Expect = 1.5e-159
Identity = 288/456 (63.16%), Postives = 347/456 (76.10%), Query Frame = 1

Query: 10  LTFFLLLLLLLLLLSSLTDPSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSSH 69
           L FFL   L L LL      + +  ++YLK PLL  +PF SP+QAL+ DT RL  L    
Sbjct: 4   LIFFLCSFLSLFLLPPSNIAAVSNHNKYLKLPLLRKSPFPSPTQALALDTRRLHFLSLRR 63

Query: 70  RHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHPP 129
           +  P +KSP++SGA++GSGQYFV+L +G PPQSLLL+ADTGSDLVWVKCS CRNCSHH P
Sbjct: 64  KPIPFVKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSP 123

Query: 130 SSAFFPRHSSSFSPFHCFDPHCRLLPHPP-SHRCNHTHLHSPCPFLYSYADSSLSSGFFS 189
           ++ FFPRHSS+FSP HC+DP CRL+P P  +  CNHT +HS C + Y YAD SL+SG F+
Sbjct: 124 ATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFA 183

Query: 190 KDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISFSSQLG 249
           ++ T+  T SG   + RL  ++FGCGFRISG SVSG  FNGA GVMGLGRGPISF+SQLG
Sbjct: 184 RETTSLKTSSG--KEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLG 243

Query: 250 RRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVV 309
           RRFGN FSYCLMDYTLSPPPTSYL+IG G   +     SK+ +TPL  NPLSPTFYY+ +
Sbjct: 244 RRFGNKFSYCLMDYTLSPPPTSYLIIGNGGDGI-----SKLFFTPLLTNPLSPTFYYVKL 303

Query: 310 KSITVDGVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRAL 369
           KS+ V+G KL I+P++W ID+ GNGGTVVDSGTTL +LAE AY  V+ A+RRRVKLP A 
Sbjct: 304 KSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIAD 363

Query: 370 QLSPGFDLCVNASGESR-MRSLPQLRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDS 429
            L+PGFDLCVN SG ++  + LP+L+F   GG VF PP RNYF+ETEE + CLAI+ VD 
Sbjct: 364 ALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDP 423

Query: 430 GNGFSVIGNLMQQGFLLEFDRDKSRMGFSRRGCGLP 464
             GFSVIGNLMQQGFL EFDRD+SR+GFSRRGC LP
Sbjct: 424 KVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCALP 452

BLAST of Cp4.1LG01g17730 vs. TAIR10
Match: AT3G25700.1 (AT3G25700.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 570.9 bits (1470), Expect = 7.4e-163
Identity = 288/456 (63.16%), Postives = 347/456 (76.10%), Query Frame = 1

Query: 10  LTFFLLLLLLLLLLSSLTDPSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSSH 69
           L FFL   L L LL      + +  ++YLK PLL  +PF SP+QAL+ DT RL  L    
Sbjct: 4   LIFFLCSFLSLFLLPPSNIAAVSNHNKYLKLPLLRKSPFPSPTQALALDTRRLHFLSLRR 63

Query: 70  RHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHPP 129
           +  P +KSP++SGA++GSGQYFV+L +G PPQSLLL+ADTGSDLVWVKCS CRNCSHH P
Sbjct: 64  KPIPFVKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSP 123

Query: 130 SSAFFPRHSSSFSPFHCFDPHCRLLPHPP-SHRCNHTHLHSPCPFLYSYADSSLSSGFFS 189
           ++ FFPRHSS+FSP HC+DP CRL+P P  +  CNHT +HS C + Y YAD SL+SG F+
Sbjct: 124 ATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFA 183

Query: 190 KDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISFSSQLG 249
           ++ T+  T SG   + RL  ++FGCGFRISG SVSG  FNGA GVMGLGRGPISF+SQLG
Sbjct: 184 RETTSLKTSSG--KEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLG 243

Query: 250 RRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVV 309
           RRFGN FSYCLMDYTLSPPPTSYL+IG G   +     SK+ +TPL  NPLSPTFYY+ +
Sbjct: 244 RRFGNKFSYCLMDYTLSPPPTSYLIIGNGGDGI-----SKLFFTPLLTNPLSPTFYYVKL 303

Query: 310 KSITVDGVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRAL 369
           KS+ V+G KL I+P++W ID+ GNGGTVVDSGTTL +LAE AY  V+ A+RRRVKLP A 
Sbjct: 304 KSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIAD 363

Query: 370 QLSPGFDLCVNASGESR-MRSLPQLRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDS 429
            L+PGFDLCVN SG ++  + LP+L+F   GG VF PP RNYF+ETEE + CLAI+ VD 
Sbjct: 364 ALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDP 423

Query: 430 GNGFSVIGNLMQQGFLLEFDRDKSRMGFSRRGCGLP 464
             GFSVIGNLMQQGFL EFDRD+SR+GFSRRGC LP
Sbjct: 424 KVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCALP 452

BLAST of Cp4.1LG01g17730 vs. TAIR10
Match: AT2G42980.1 (AT2G42980.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 246.5 bits (628), Expect = 3.2e-65
Identity = 156/408 (38.24%), Postives = 225/408 (55.15%), Query Frame = 1

Query: 59  THRLSLLFSSHRHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKC 118
           T  +SL+ +       L + L SG + GSG+YF+++ +GTPP+   L+ DTGSDL W++C
Sbjct: 129 TSDISLVGAPEVSPGKLIATLESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQC 188

Query: 119 SPCRNCSHHPPSSAFF-PRHSSSFSPFHCFDPHCRLLPHP-PSHRCNHTHLHSPCPFLYS 178
            PC +C H   +  F+ P+ S+SF    C DP C L+  P P  +C   +    CP+ Y 
Sbjct: 189 LPCYDCFHQ--NGMFYDPKTSASFKNITCNDPRCSLISSPDPPVQCESDN--QSCPYFYW 248

Query: 179 YADSSLSSGFFSKDVTTFN--TFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVM 238
           Y D S ++G F+ +  T N  T  G  ++ ++ ++ FGCG    G       F+GA G++
Sbjct: 249 YGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNMMFGCGHWNRG------LFSGASGLL 308

Query: 239 GLGRGPISFSSQLGRRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPL 298
           GLGRGP+SFSSQL   +G++FSYCL+D   +   +S L+ G     L  TN +  S+   
Sbjct: 309 GLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNG 368

Query: 299 QINPLSPTFYYIVVKSITVDGVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEV 358
           + N +  TFYYI +KSI V G  L I    W I   G+GGT++DSGTTL+Y AE AYE +
Sbjct: 369 KENSVE-TFYYIQIKSILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEII 428

Query: 359 LKAMRRRVKLPRAL-QLSPGFDLCVNASG-ESRMRSLPQLRFRVGGGGVFAPPARNYFVE 418
                 ++K    + +  P  D C N SG E     LP+L      G V+  PA N F+ 
Sbjct: 429 KNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIW 488

Query: 419 TEEGVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDRDKSRMGFSRRGC 461
             E ++CLAI        FS+IGN  QQ F + +D  +SR+GF+   C
Sbjct: 489 LSEDLVCLAILGTPKST-FSIIGNYQQQNFHILYDTKRSRLGFTPTKC 524

BLAST of Cp4.1LG01g17730 vs. TAIR10
Match: AT3G59080.1 (AT3G59080.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 233.4 bits (594), Expect = 2.8e-61
Identity = 146/391 (37.34%), Postives = 213/391 (54.48%), Query Frame = 1

Query: 75  LKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHPPSSAFF 134
           L + L SG + GSG+YF+++ +G+PP+   L+ DTGSDL W++C PC +C     + AF+
Sbjct: 155 LVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQ--NGAFY 214

Query: 135 -PRHSSSFSPFHCFDPHCRLLPHP-PSHRCNHTHLHSPCPFLYSYADSSLSSGFFSKDVT 194
            P+ S+S+    C D  C L+  P P   C   +    CP+ Y Y DSS ++G F+ +  
Sbjct: 215 DPKASASYKNITCNDQRCNLVSSPDPPMPCKSDN--QSCPYYYWYGDSSNTTGDFAVETF 274

Query: 195 TFN--TFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISFSSQLGRR 254
           T N  T  G+     + ++ FGCG    G       F+GA G++GLGRGP+SFSSQL   
Sbjct: 275 TVNLTTNGGSSELYNVENMMFGCGHWNRG------LFHGAAGLLGLGRGPLSFSSQLQSL 334

Query: 255 FGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVKS 314
           +G++FSYCL+D       +S L+ G     L   N +  S+   + N L  TFYY+ +KS
Sbjct: 335 YGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKEN-LVDTFYYVQIKS 394

Query: 315 ITVDGVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQL 374
           I V G  L I    W I   G GGT++DSGTTL+Y AE AYE +   +  + K    +  
Sbjct: 395 ILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYR 454

Query: 375 S-PGFDLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGN 434
             P  D C N SG   ++ LP+L      G V+  P  N F+   E ++CLA+      +
Sbjct: 455 DFPILDPCFNVSGIHNVQ-LPELGIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPK-S 514

Query: 435 GFSVIGNLMQQGFLLEFDRDKSRMGFSRRGC 461
            FS+IGN  QQ F + +D  +SR+G++   C
Sbjct: 515 AFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 532

BLAST of Cp4.1LG01g17730 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 223.0 bits (567), Expect = 3.8e-58
Identity = 165/465 (35.48%), Postives = 238/465 (51.18%), Query Frame = 1

Query: 25  SLTDPSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSSHR-------------H 84
           SLTD S +  +  L   L H +  SS S A  +D   L L   S R              
Sbjct: 48  SLTDESLSESTTSLSVHLSHVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVSTGR 107

Query: 85  SPTLKSP---------LISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCR 144
           + T ++P         +ISG S GSG+YF+ L +GTP  ++ +V DTGSD+VW++CSPC+
Sbjct: 108 NATKRTPRTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCK 167

Query: 145 NCSHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSL 204
            C ++   + F P+ S +F+   C    CR L    S  C  T     C +  SY D S 
Sbjct: 168 AC-YNQTDAIFDPKKSKTFATVPCGSRLCRRL--DDSSEC-VTRRSKTCLYQVSYGDGSF 227

Query: 205 SSGFFSKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPIS 264
           + G FS +  TF+         R++ +  GCG    G       F GA G++GLGRG +S
Sbjct: 228 TEGDFSTETLTFH-------GARVDHVPLGCGHDNEG------LFVGAAGLLGLGRGGLS 287

Query: 265 FSSQLGRRFGNTFSYCLMDYT----LSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINP 324
           F SQ   R+   FSYCL+D T     S PP++ +    G  ++P T+     +TPL  NP
Sbjct: 288 FPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVF---GNAAVPKTSV----FTPLLTNP 347

Query: 325 LSPTFYYIVVKSITVDGVKLP-INPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKA 384
              TFYY+ +  I+V G ++P ++ + + +D  GNGG ++DSGT++T L + AY  +  A
Sbjct: 348 KLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDA 407

Query: 385 MR-RRVKLPRALQLSPGFDLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETE-E 444
            R    KL RA   S  FD C + SG + ++ +P + F  GGG V + PA NY +    E
Sbjct: 408 FRLGATKLKRAPSYSL-FDTCFDLSGMTTVK-VPTVVFHFGGGEV-SLPASNYLIPVNTE 467

Query: 445 GVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDRDKSRMGFSRRGC 461
           G  C A     +    S+IGN+ QQGF + +D   SR+GF  R C
Sbjct: 468 GRFCFAF--AGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of Cp4.1LG01g17730 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 219.2 bits (557), Expect = 5.4e-57
Identity = 156/461 (33.84%), Postives = 232/461 (50.33%), Query Frame = 1

Query: 21  LLLSSLTDPSNAIPSQYLKFPLLHTNPFSS---PSQALSS----DTHRLSLLFS------ 80
           LL S     S++  S  +   L H +  SS   P +  SS    D+ R+  + +      
Sbjct: 55  LLESEFESGSDSESSSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIP 114

Query: 81  --SHRHSPT---LKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCR 140
             +  H+P      S ++SG S GSG+YF  L +GTP + + +V DTGSD+VW++C+PCR
Sbjct: 115 GRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCR 174

Query: 141 NCSHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSL 200
            C +      F PR S +++   C  PHCR L    S  CN       C +  SY D S 
Sbjct: 175 RC-YSQSDPIFDPRKSKTYATIPCSSPHCRRL---DSAGCNTR--RKTCLYQVSYGDGSF 234

Query: 201 SSGFFSKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPIS 260
           + G FS +  TF        + R+  ++ GCG    G       F GA G++GLG+G +S
Sbjct: 235 TVGDFSTETLTFR-------RNRVKGVALGCGHDNEG------LFVGAAGLLGLGKGKLS 294

Query: 261 FSSQLGRRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPT 320
           F  Q G RF   FSYCL+D + S  P+S +     +  +         +TPL  NP   T
Sbjct: 295 FPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRI-------ARFTPLLSNPKLDT 354

Query: 321 FYYIVVKSITVDGVKLP-INPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRR 380
           FYY+ +  I+V G ++P +  +++ +D+ GNGG ++DSGT++T L   AY  +  A R  
Sbjct: 355 FYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVG 414

Query: 381 VK-LPRALQLSPGFDLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETE-EGVMC 440
            K L RA   S  FD C + S  + ++ +P +     G  V + PA NY +  +  G  C
Sbjct: 415 AKTLKRAPDFSL-FDTCFDLSNMNEVK-VPTVVLHFRGADV-SLPATNYLIPVDTNGKFC 474

Query: 441 LAIRPVDSGNGFSVIGNLMQQGFLLEFDRDKSRMGFSRRGC 461
            A     +  G S+IGN+ QQGF + +D   SR+GF+  GC
Sbjct: 475 FAF--AGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Cp4.1LG01g17730 vs. NCBI nr
Match: gi|449451908|ref|XP_004143702.1| (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis sativus])

HSP 1 Score: 742.3 bits (1915), Expect = 5.2e-211
Identity = 368/455 (80.88%), Postives = 402/455 (88.35%), Query Frame = 1

Query: 12  FFLLLLLLLLLLSSLTDPSN---AIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSS 71
           FFLL+LL    L+ L +P+    A P+ +LK PLLH  PFSSPSQ+LSSDTHRLSLLFS 
Sbjct: 9   FFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFS- 68

Query: 72  HRHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHP 131
            R +PTLKSPLISGASTGSGQYFV++ LGTPPQSLLLVADTGSDLVWVKCS CRNCSHHP
Sbjct: 69  -RPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHP 128

Query: 132 PSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSLSSGFFS 191
           PSSAF PRHSSSFSPFHCFDPHCRLLPH P H CNHT LHSPC FLYSYAD SLSSGFFS
Sbjct: 129 PSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFS 188

Query: 192 KDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISFSSQLG 251
           K+ TT  + SG+  +  L  LSFGCGFRISGPSVSGA+FNGARGVMGLGRG ISFSSQLG
Sbjct: 189 KETTTLKSLSGS--EIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLG 248

Query: 252 RRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVV 311
           RRFGN FSYCLMDYTLSPPPTS+LMIGGGL SLP+TNA+KISYTPLQINPLSPTFYYI +
Sbjct: 249 RRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITI 308

Query: 312 KSITVDGVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRAL 371
            SIT+DGVKLPINP VW IDEQGNGGTVVDSGTTLTYL + AYEEVLK++RRRVKLP A 
Sbjct: 309 HSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAA 368

Query: 372 QLSPGFDLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSG 431
           +L+PGFDLCVNASGESR  SLP+LRFR+GGG VFAPP RNYF+ETEEGVMCLAIR V+SG
Sbjct: 369 ELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESG 428

Query: 432 NGFSVIGNLMQQGFLLEFDRDKSRMGFSRRGCGLP 464
           NGFSVIGNLMQQGFLLEFD+++SR+GF+RRGCGLP
Sbjct: 429 NGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 459

BLAST of Cp4.1LG01g17730 vs. NCBI nr
Match: gi|659073000|ref|XP_008467208.1| (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo])

HSP 1 Score: 741.1 bits (1912), Expect = 1.2e-210
Identity = 368/455 (80.88%), Postives = 401/455 (88.13%), Query Frame = 1

Query: 12  FFLLLLLLLLLLSSLTDPSN---AIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSS 71
           FFLL+ L    L+ L++P+    A  + +LK PLLH  PFSSPSQ+LSSDTHRLSLLFS 
Sbjct: 9   FFLLIPLFFFFLTHLSNPNATAVAAAADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFS- 68

Query: 72  HRHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHP 131
            R +PTLKSPLISGASTGSGQYFV++ LGTPPQSLLLVADTGSDLVWVKCS CRNCSHHP
Sbjct: 69  -RPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHP 128

Query: 132 PSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSLSSGFFS 191
           PSSAFFPRHSSSFSPFHCFDPHCRLLPH P H CNHT LHSPC FLYSYAD SLSSGFFS
Sbjct: 129 PSSAFFPRHSSSFSPFHCFDPHCRLLPHAPPHHCNHTLLHSPCRFLYSYADGSLSSGFFS 188

Query: 192 KDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISFSSQLG 251
           K+ TT  T SG+  +  L  LSFGCGFRISGPSVSGA+FNGARGVMGLGRG ISFSSQLG
Sbjct: 189 KETTTLKTLSGS--EIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLG 248

Query: 252 RRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVV 311
           RRFGN FSYCLMDYTLSPPPTS+LMIGGGL SLPV NA+KISYTPLQINPLSPTFYYI +
Sbjct: 249 RRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPVNNATKISYTPLQINPLSPTFYYITI 308

Query: 312 KSITVDGVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRAL 371
            SIT+DGVKLPINP VW IDEQGNGGTVVDSGTTLTYL + AYEEVLK++RRRVKLP A 
Sbjct: 309 NSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAA 368

Query: 372 QLSPGFDLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSG 431
           +L+PGFDLCVNASGESR  SLP+LRFR+GGG VFAPP RNYF+ETEEGVMCLAIR V+SG
Sbjct: 369 ELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESG 428

Query: 432 NGFSVIGNLMQQGFLLEFDRDKSRMGFSRRGCGLP 464
           NGFSVIGNLMQQGFLLEFD+++SR+GF+RRGCGLP
Sbjct: 429 NGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 459

BLAST of Cp4.1LG01g17730 vs. NCBI nr
Match: gi|802680767|ref|XP_012082020.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Jatropha curcas])

HSP 1 Score: 586.3 bits (1510), Expect = 4.8e-164
Identity = 296/454 (65.20%), Postives = 356/454 (78.41%), Query Frame = 1

Query: 13  FLLLLLLLLLLSSL---TDPSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSSH 72
           FLLLLLL  L  S+   T  ++    +YLK PLLH  PF SP+QAL  D  RLSLL   H
Sbjct: 9   FLLLLLLTDLCYSISLRTTVNSTATKEYLKLPLLHRTPFKSPAQALPFDIRRLSLL---H 68

Query: 73  RHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHPP 132
           R   +LKSP+ISGASTGSGQYFV+L LG+P Q+LLLVADTGSDLVWVKCS C+NCS++ P
Sbjct: 69  RQRTSLKSPVISGASTGSGQYFVSLRLGSPAQTLLLVADTGSDLVWVKCSACKNCSNYSP 128

Query: 133 SSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSLSSGFFSK 192
            SAF  RHSS+FS  HCF+  CRL+PHP  + CN T LHSPC + YSYAD S +SGFFSK
Sbjct: 129 GSAFLARHSSTFSLIHCFNSQCRLVPHPRPNPCNRTRLHSPCRYEYSYADGSSTSGFFSK 188

Query: 193 DVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISFSSQLGR 252
           + TT NT +G   + +L +L+FGCGFRISGPS++GA F GA GV+GLGR PISFSSQLGR
Sbjct: 189 ETTTLNTSAGR--EKKLKNLAFGCGFRISGPSLTGASFAGAHGVIGLGRAPISFSSQLGR 248

Query: 253 RFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVK 312
           RFGN FSYCLMDYTLSPPPTSYLMIGG   S  V+    +++TPL +N LSPTFYYI +K
Sbjct: 249 RFGNKFSYCLMDYTLSPPPTSYLMIGGHQNSA-VSRKRILNFTPLLVNSLSPTFYYIGIK 308

Query: 313 SITVDGVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQ 372
           S++VDGVKLPINP+VW+ID+ GNGGT++DSGTTLT+L E AY E+L A++RRVKLP   +
Sbjct: 309 SVSVDGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFLVEPAYREILSAIKRRVKLPGPGE 368

Query: 373 LSPGFDLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGN 432
           L+PGFDLCVN SG  R    P++   + G  VF+PP RNYF++T EGV CLAI+PV+SG+
Sbjct: 369 LTPGFDLCVNVSG-VRRPVFPRMSLELAGNSVFSPPPRNYFIDTSEGVKCLAIQPVNSGS 428

Query: 433 GFSVIGNLMQQGFLLEFDRDKSRMGFSRRGCGLP 464
           GFSVIGNLMQQG+LLEFDRD+SR+GF+R GC LP
Sbjct: 429 GFSVIGNLMQQGYLLEFDRDRSRLGFARSGCALP 455

BLAST of Cp4.1LG01g17730 vs. NCBI nr
Match: gi|1009178127|ref|XP_015870354.1| (PREDICTED: aspartic proteinase nepenthesin-2 [Ziziphus jujuba])

HSP 1 Score: 583.6 bits (1503), Expect = 3.1e-163
Identity = 293/449 (65.26%), Postives = 348/449 (77.51%), Query Frame = 1

Query: 15  LLLLLLLLLSSLTDPSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSSHRHSPT 74
           L  L L +  ++ DP N +   YLK PLLH  P +SPS+ L  DT+RLS+L    R   +
Sbjct: 9   LFSLFLFVFVNVCDPQN-VTVPYLKLPLLHKTPLASPSKLLYYDTYRLSVL----RGRQS 68

Query: 75  LKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHPPSSAFF 134
            KSP++SGASTGSGQYFV+LHLGTPPQ LLLVADTGSDLVWV+CS C+NC++H P SAF 
Sbjct: 69  FKSPVVSGASTGSGQYFVDLHLGTPPQRLLLVADTGSDLVWVRCSSCKNCTNHDPRSAFL 128

Query: 135 PRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSLSSGFFSKDVTTF 194
            RHSS+FSP HC++  CRL+PHP  + CN T LHSPC + YSYAD S++SGFFSK+ TT 
Sbjct: 129 ARHSSTFSPHHCYESSCRLVPHPKPNPCNRTRLHSPCRYEYSYADGSVTSGFFSKETTTL 188

Query: 195 NTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISFSSQLGRRFGNT 254
           NT SG   + +L  L+FGCGFR SGPSVSG  FNGA GVMGLGRGPISFSSQLGRR GN 
Sbjct: 189 NTSSG--KEAKLRSLAFGCGFRNSGPSVSGPSFNGANGVMGLGRGPISFSSQLGRRLGNK 248

Query: 255 FSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVKSITVD 314
           FSYCLMDYTL+P PTSYLMIGG    + V+  S++S+TPLQ N  SPTFYYI +KS  V 
Sbjct: 249 FSYCLMDYTLAPTPTSYLMIGGEQNEV-VSKGSRMSFTPLQTNRFSPTFYYIGIKSAFVG 308

Query: 315 GVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQLSPGF 374
           G KLPI+P VW+IDE GNGGTV+DSGTTLT+L E AY  VL A RR+VK+P A +L+PGF
Sbjct: 309 GAKLPISPTVWSIDESGNGGTVIDSGTTLTFLPELAYRVVLAAFRRQVKIPSAAELTPGF 368

Query: 375 DLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGNGFSVI 434
           DLC+N SG SR  SLP+L F++ G  VFAPP RNYF++T +GV CLAI+PV+SG G SVI
Sbjct: 369 DLCLNVSGVSR-PSLPRLSFKLVGNSVFAPPPRNYFIDTADGVKCLAIQPVNSGEGISVI 428

Query: 435 GNLMQQGFLLEFDRDKSRMGFSRRGCGLP 464
           GNLMQQGFLL FD+DK R+GFSRRGC +P
Sbjct: 429 GNLMQQGFLLVFDKDKLRLGFSRRGCAVP 448

BLAST of Cp4.1LG01g17730 vs. NCBI nr
Match: gi|255566835|ref|XP_002524401.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Ricinus communis])

HSP 1 Score: 581.6 bits (1498), Expect = 1.2e-162
Identity = 285/459 (62.09%), Postives = 356/459 (77.56%), Query Frame = 1

Query: 10  LTFFLLLLLLLLLLSSLTDPSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSSH 69
           L FF  LL+ L   SS    +    ++YLK PLLH  PF+SPS+AL+ D +R   L   H
Sbjct: 4   LLFFFFLLITLCPSSSAAANTT---TEYLKLPLLHKTPFTSPSEALAFDINRRLSLLHHH 63

Query: 70  RHSP-----TLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVADTGSDLVWVKCSPCRNC 129
           RH       + +SP+ISGAS+GSGQYFV+L +GTPPQ+LLLVADTGSDL+WVKCSPCRNC
Sbjct: 64  RHQQQHKQNSFRSPVISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNC 123

Query: 130 SHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCPFLYSYADSSLSS 189
           SH  P SAFF RHS+++S  HC+ P C+L+PHP  + CN T LHSPC + Y+YADSS ++
Sbjct: 124 SHRSPGSAFFARHSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTT 183

Query: 190 GFFSKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFNGARGVMGLGRGPISFS 249
           GFFSK+  T NT +G     +LN LSFGCGFRISGPS++GA F GA+GVMGLGR PISFS
Sbjct: 184 GFFSKEALTLNTSTG--KVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFS 243

Query: 250 SQLGRRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFY 309
           SQLGRRFG+ FSYCLMDYTLSPPPTS+L IGG  +++ V+    +S+TPL INPLSPTFY
Sbjct: 244 SQLGRRFGSKFSYCLMDYTLSPPPTSFLTIGGA-QNVAVSKKGIMSFTPLLINPLSPTFY 303

Query: 310 YIVVKSITVDGVKLPINPNVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKL 369
           YI +K + V+GVKLPINP+VW+ID+ GNGGT++DSGTTLT++ E AY E+LKA ++RVKL
Sbjct: 304 YIAIKGVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKL 363

Query: 370 PRALQLSPGFDLCVNASGESRMRSLPQLRFRVGGGGVFAPPARNYFVETEEGVMCLAIRP 429
           P   + +PGFDLC+N SG +R  +LP++ F + GG VF+PP RNYF+ET + + CLA++P
Sbjct: 364 PSPAEPTPGFDLCMNVSGVTR-PALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQP 423

Query: 430 VDSGNGFSVIGNLMQQGFLLEFDRDKSRMGFSRRGCGLP 464
           V    GFSV+GNLMQQGFLLEFDRDKSR+GF+RRGC LP
Sbjct: 424 VSQDGGFSVLGNLMQQGFLLEFDRDKSRLGFTRRGCALP 455

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
APF2_ARATH9.7e-5633.84Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
NEP2_NEPGR2.9e-5233.01Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
NEP1_NEPGR8.5e-5234.48Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
ASPG2_ARATH2.0e-4833.42Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
ASPG1_ARATH1.3e-4731.97Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KNH6_CUCSA3.7e-21180.88Uncharacterized protein OS=Cucumis sativus GN=Csa_5G174650 PE=3 SV=1[more]
A0A067K2U7_JATCU3.4e-16465.20Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18279 PE=3 SV=1[more]
B9SEI2_RICCO8.3e-16362.09Basic 7S globulin 2 small subunit, putative OS=Ricinus communis GN=RCOM_0705030 ... [more]
F6HF17_VITVI5.3e-16263.04Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g02930 PE=3 SV=... [more]
Q9LI73_ARATH1.5e-15963.16Aspartyl protease family protein OS=Arabidopsis thaliana GN=At3g25700 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT3G25700.17.4e-16363.16 Eukaryotic aspartyl protease family protein[more]
AT2G42980.13.2e-6538.24 Eukaryotic aspartyl protease family protein[more]
AT3G59080.12.8e-6137.34 Eukaryotic aspartyl protease family protein[more]
AT3G61820.13.8e-5835.48 Eukaryotic aspartyl protease family protein[more]
AT1G01300.15.4e-5733.84 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|449451908|ref|XP_004143702.1|5.2e-21180.88PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis sativus][more]
gi|659073000|ref|XP_008467208.1|1.2e-21080.88PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo][more]
gi|802680767|ref|XP_012082020.1|4.8e-16465.20PREDICTED: aspartic proteinase nepenthesin-1 [Jatropha curcas][more]
gi|1009178127|ref|XP_015870354.1|3.1e-16365.26PREDICTED: aspartic proteinase nepenthesin-2 [Ziziphus jujuba][more]
gi|255566835|ref|XP_002524401.1|1.2e-16262.09PREDICTED: aspartic proteinase nepenthesin-1 [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR021109Peptidase_aspartic_dom_sf
IPR001969Aspartic_peptidase_AS
IPR001461Aspartic_peptidase_A1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g17730.1Cp4.1LG01g17730.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 432..447
score: 1.8E-7coord: 96..116
score: 1.8E-7coord: 335..346
score: 1.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 7..461
score: 3.4E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 105..116
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 78..275
score: 1.4E-38coord: 285..460
score: 9.1
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 84..460
score: 8.67
NoneNo IPR availablePANTHERPTHR13683:SF354ASPARTYL PROTEASE FAMILY PROTEINcoord: 7..461
score: 3.4E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG01g17730CmoCh04G021150Cucurbita moschata (Rifu)cmocpeB685
Cp4.1LG01g17730Carg27159Silver-seed gourdcarcpeB0708
The following gene(s) are paralogous to this gene:

None