Cp4.1LG05g10050 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG05g10050
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionaspartic proteinase NANA, chloroplast-like
LocationCp4.1LG05: 6577056 .. 6578803 (-)
RNA-Seq ExpressionCp4.1LG05g10050
SyntenyCp4.1LG05g10050
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TCCGCCATTCTCATCTTCCTCCTCGCTTCATCTGTTTTCTCTGACAGGAACGGAGCAATGTCGCCGATTTCTCATTCTTTAATCCTTTTCTTCGTCTTCGTCTTCTTCTCTCCTATCACCGTCGCAGTCGCCGATCAAAGCAATGCCAATAATCTCAAACAAGAAAGCGATGCCAATAATGAAGAACAAGAATTCGTGAGGCTAGATCTGATACACCGTCACCATCCGGAAGTGGTTAAAAGGCTTGATGACGAAATTAAGGTGGATAGTGTCGAGGATCGCATCAGGGATATTCGCTATCACGATCAAAACCGTCTCCGATCCATATCCGCCAAGCTGAATTGGACAAAAGTTGTGGAGAATGCGGAGGAGAAAGAGAAGGAGGTCTCGGGTTCGAATCTACCTCCACAGTCGCAGACGCCAATAGGATTGAAAACATACCCCGGCGCTGATTTCGGTAGCGGTGAATTTTTCGTGCAATTGAAAGTCGGAACACCGCCGCAGACGTTCACACTGATTGCAGATACCGGAAGTGACCTATTGTGGACGAAATGCAGATTCCGGCGGTGCAGGGGAGATTGCAGCAACCTCTCTCCGATGCATAAGATGCGTAACAAAATGAGAGGGAGATTCAGATACGCGCTTTATGCGAATCAGTCGTCTTCTTTCTCCCCAATCCCTTGTTCCTCCAGGCAGTGTATCGATGATTTCCCTGATCTCGGCGGCCAACCCGATTGTCCAACCCCTAACACCCCCTGTTCCTATACCTACAGGTATTAATCATTATTATTATTTTTTTATTTTTTTATTTTTTTAAATAAAATTTTTGGTGGGGACCATTATAACNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATTACAGCTACACAGGTGGGGAGCGTGCGAGCGGAATATTCGCAAACGAGACGGTAACGGTAAGACTAACAAACGGAAAAGAAAAGCAACTGAAGGACATTCTATTCGGCTGCACAGAAGAAGTCGAACTCACCAACTTCATGAAGGGAGCCGATGGCCTCATTGGCTTAGGCTCAAGCATCTACTCCTTCGTCTACAAAGCCGCCGAAAACAACATCGGCGGCGGCTTCTCCTACTGCCTCGCCGATCACAACCGCAACACAACCGCCATTAGCTACTTCGTCTTCGGCACCCCTTCCCCCAAGACCTTCTCCGCCACCACCTCCTCTCCCATCGGCCCCCCCGCCACCACTAAACTCTTCACCGGCGGCCAATACAGCTGCTACTACGGTGTCCAACTGATCGGAATCTCCGTCGACGACCAGATGCTTAAGATCCCCCGTCACGTCTGGAACATCAAATCCGGGTGCGGTACCATCTTGGACACCGGCACCAGCCTGACGTTGCTGACGGCACCGGCTCACGATGCGGTGATAGAAGCGATGGCTCCGAAGATCGCGAAATTCGGAAGAATGGAAAAGCAGAGGAACTTCGAACTTTGCTTCAATGACACTGAGTGGAATTTTGGTATGTCGCCGAAGCTTGGATTCCATTTCGAAGGCGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACGTCGTTTCGGCGGCAACCCAATGTAGCTGTGTTGCCATAAGTTCTCTGCCCTTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAAACTTACTTTTGGCAATTTGATTTACTCAAGGAATCCGTCACTTTTGCTCCCTCCGATTGCGCCTAG

mRNA sequence

TCCGCCATTCTCATCTTCCTCCTCGCTTCATCTGTTTTCTCTGACAGGAACGGAGCAATGTCGCCGATTTCTCATTCTTTAATCCTTTTCTTCGTCTTCGTCTTCTTCTCTCCTATCACCGTCGCAGTCGCCGATCAAAGCAATGCCAATAATCTCAAACAAGAAAGCGATGCCAATAATGAAGAACAAGAATTCGTGAGGCTAGATCTGATACACCGTCACCATCCGGAAGTGGTTAAAAGGCTTGATGACGAAATTAAGGTGGATAGTGTCGAGGATCGCATCAGGGATATTCGCTATCACGATCAAAACCGTCTCCGATCCATATCCGCCAAGCTGAATTGGACAAAAGTTGTGGAGAATGCGGAGGAGAAAGAGAAGGAGGTCTCGGGTTCGAATCTACCTCCACAGTCGCAGACGCCAATAGGATTGAAAACATACCCCGGCGCTGATTTCGGTAGCGGTGAATTTTTCGTGCAATTGAAAGTCGGAACACCGCCGCAGACGTTCACACTGATTGCAGATACCGGAAGTGACCTATTGTGGACGAAATGCAGATTCCGGCGGTGCAGGGGAGATTGCAGCAACCTCTCTCCGATGCATAAGATGCGTAACAAAATGAGAGGGAGATTCAGATACGCGCTTTATGCGAATCAGTCGTCTTCTTTCTCCCCAATCCCTTGTTCCTCCAGGCAGTGTATCGATGATTTCCCTGATCTCGGCGGCCAACCCGATTGTCCAACCCCTAACACCCCCTGTTCCTATACCTACAGCTACACAGGTGGGGAGCGTGCGAGCGGAATATTCGCAAACGAGACGGTAACGGTAAGACTAACAAACGGAAAAGAAAAGCAACTGAAGGACATTCTATTCGGCTGCACAGAAGAAGTCGAACTCACCAACTTCATGAAGGGAGCCGATGGCCTCATTGGCTTAGGCTCAAGCATCTACTCCTTCGTCTACAAAGCCGCCGAAAACAACATCGGCGGCGGCTTCTCCTACTGCCTCGCCGATCACAACCGCAACACAACCGCCATTAGCTACTTCGTCTTCGGCACCCCTTCCCCCAAGACCTTCTCCGCCACCACCTCCTCTCCCATCGGCCCCCCCGCCACCACTAAACTCTTCACCGGCGGCCAATACAGCTGCTACTACGGTGTCCAACTGATCGGAATCTCCGTCGACGACCAGATGCTTAAGATCCCCCGTCACGTCTGGAACATCAAATCCGGGTGCGGTACCATCTTGGACACCGGCACCAGCCTGACGTTGCTGACGGCACCGGCTCACGATGCGGTGATAGAAGCGATGGCTCCGAAGATCGCGAAATTCGGAAGAATGGAAAAGCAGAGGAACTTCGAACTTTGCTTCAATGACACTGAGTGGAATTTTGGTATGTCGCCGAAGCTTGGATTCCATTTCGAAGGCGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACGTCGTTTCGGCGGCAACCCAATGTAGCTGTGTTGCCATAAGTTCTCTGCCCTTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAAACTTACTTTTGGCAATTTGATTTACTCAAGGAATCCGTCACTTTTGCTCCCTCCGATTGCGCCTAG

Coding sequence (CDS)

TCCGCCATTCTCATCTTCCTCCTCGCTTCATCTGTTTTCTCTGACAGGAACGGAGCAATGTCGCCGATTTCTCATTCTTTAATCCTTTTCTTCGTCTTCGTCTTCTTCTCTCCTATCACCGTCGCAGTCGCCGATCAAAGCAATGCCAATAATCTCAAACAAGAAAGCGATGCCAATAATGAAGAACAAGAATTCGTGAGGCTAGATCTGATACACCGTCACCATCCGGAAGTGGTTAAAAGGCTTGATGACGAAATTAAGGTGGATAGTGTCGAGGATCGCATCAGGGATATTCGCTATCACGATCAAAACCGTCTCCGATCCATATCCGCCAAGCTGAATTGGACAAAAGTTGTGGAGAATGCGGAGGAGAAAGAGAAGGAGGTCTCGGGTTCGAATCTACCTCCACAGTCGCAGACGCCAATAGGATTGAAAACATACCCCGGCGCTGATTTCGGTAGCGGTGAATTTTTCGTGCAATTGAAAGTCGGAACACCGCCGCAGACGTTCACACTGATTGCAGATACCGGAAGTGACCTATTGTGGACGAAATGCAGATTCCGGCGGTGCAGGGGAGATTGCAGCAACCTCTCTCCGATGCATAAGATGCGTAACAAAATGAGAGGGAGATTCAGATACGCGCTTTATGCGAATCAGTCGTCTTCTTTCTCCCCAATCCCTTGTTCCTCCAGGCAGTGTATCGATGATTTCCCTGATCTCGGCGGCCAACCCGATTGTCCAACCCCTAACACCCCCTGTTCCTATACCTACAGCTACACAGGTGGGGAGCGTGCGAGCGGAATATTCGCAAACGAGACGGTAACGGTAAGACTAACAAACGGAAAAGAAAAGCAACTGAAGGACATTCTATTCGGCTGCACAGAAGAAGTCGAACTCACCAACTTCATGAAGGGAGCCGATGGCCTCATTGGCTTAGGCTCAAGCATCTACTCCTTCGTCTACAAAGCCGCCGAAAACAACATCGGCGGCGGCTTCTCCTACTGCCTCGCCGATCACAACCGCAACACAACCGCCATTAGCTACTTCGTCTTCGGCACCCCTTCCCCCAAGACCTTCTCCGCCACCACCTCCTCTCCCATCGGCCCCCCCGCCACCACTAAACTCTTCACCGGCGGCCAATACAGCTGCTACTACGGTGTCCAACTGATCGGAATCTCCGTCGACGACCAGATGCTTAAGATCCCCCGTCACGTCTGGAACATCAAATCCGGGTGCGGTACCATCTTGGACACCGGCACCAGCCTGACGTTGCTGACGGCACCGGCTCACGATGCGGTGATAGAAGCGATGGCTCCGAAGATCGCGAAATTCGGAAGAATGGAAAAGCAGAGGAACTTCGAACTTTGCTTCAATGACACTGAGTGGAATTTTGGTATGTCGCCGAAGCTTGGATTCCATTTCGAAGGCGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACGTCGTTTCGGCGGCAACCCAATGTAGCTGTGTTGCCATAAGTTCTCTGCCCTTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAAACTTACTTTTGGCAATTTGATTTACTCAAGGAATCCGTCACTTTTGCTCCCTCCGATTGCGCCTAG

Protein sequence

SAILIFLLASSVFSDRNGAMSPISHSLILFFVFVFFSPITVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRLDDEIKVDSVEDRIRDIRYHDQNRLRSISAKLNWTKVVENAEEKEKEVSGSNLPPQSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGTILDTGTSLTLLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAATQCSCVAISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDCA
Homology
BLAST of Cp4.1LG05g10050 vs. ExPASy Swiss-Prot
Match: Q9LTW4 (Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE=1 SV=1)

HSP 1 Score: 273.5 bits (698), Expect = 5.2e-72
Identity = 174/484 (35.95%), Postives = 248/484 (51.24%), Query Frame = 0

Query: 58  ANNEEQEFVRLDLIHRHHPEVVKRLDDEIKVDSVEDRIRDIRYHDQNRLRSISAKLNWTK 117
           A++ +   VRL L HR           +  +     RI D+   DQ R   IS K N   
Sbjct: 41  ADSMKDTSVRLKLAHR-----------DTLLPKPLSRIEDVIGADQKRHSLISRKRN--- 100

Query: 118 VVENAEEKEKEVSGSNLPPQSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTG 177
                               S   + +    G D+G+ ++F +++VGTP + F ++ DTG
Sbjct: 101 --------------------STVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTG 160

Query: 178 SDLLWTKCRFRRCRGDCSNLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDF 237
           S+L W  CR+ R RG  +                R    A++S SF  + C ++ C  D 
Sbjct: 161 SELTWVNCRY-RARGKDN----------------RRVFRADESKSFKTVGCLTQTCKVDL 220

Query: 238 PDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEV 297
            +L     CPTP+TPCSY Y Y  G  A G+FA ET+TV LTNG+  +L   L GC+   
Sbjct: 221 MNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSF 280

Query: 298 ELTNFMKGADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTP-SP 357
              +F +GADG++GL  S +SF    A +  G  FSYCL DH  N    +Y +FG+  S 
Sbjct: 281 TGQSF-QGADGVLGLAFSDFSFT-STATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRST 340

Query: 358 KT-FSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGTI 417
           KT F  TT     P   T++        +Y + +IGIS+   ML IP  VW+  SG GTI
Sbjct: 341 KTAFRRTT-----PLDLTRI------PPFYAINVIGISLGYDMLDIPSQVWDATSGGGTI 400

Query: 418 LDTGTSLTLLTAPAHDAVIEAMAPKIAKFGRMEKQR-NFELCFNDTE-WNFGMSPKLGFH 477
           LD+GTSLTLL   A+  V+  +A  + +  R++ +    E CF+ T  +N    P+L FH
Sbjct: 401 LDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFH 460

Query: 478 FEGGAVFEPPDRSYVVSAATQCSCVAISSLPFPSINILGNIIQQTYFWQFDLLKESVTFA 537
            +GGA FEP  +SY+V AA    C+   S   P+ N++GNI+QQ Y W+FDL+  +++FA
Sbjct: 461 LKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSFA 460

BLAST of Cp4.1LG05g10050 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 155.6 bits (392), Expect = 1.6e-36
Identity = 133/442 (30.09%), Postives = 201/442 (45.48%), Query Frame = 0

Query: 106 LRSISAKLNWTK--VVENAEEK-EKEVSGSNLPPQSQTPIGLKTYPGADFGSGEFFVQLK 165
           L  + +  N TK  +++ A ++ E+ +   N   QS + I    Y     G GE+ + + 
Sbjct: 46  LEQVDSGKNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYA----GDGEYLMNVA 105

Query: 166 VGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLSPMHKMRNKMRGRFRYALYANQSSS 225
           +GTP  +F+ I DTGSDL+WT+C    C    S  +P+   ++              SSS
Sbjct: 106 IGTPDSSFSAIMDTGSDLIWTQC--EPCTQCFSQPTPIFNPQD--------------SSS 165

Query: 226 FSPIPCSSRQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGK 285
           FS +PC S+ C D        P     N  C YTY Y  G    G  A ET T   ++  
Sbjct: 166 FSTLPCESQYCQD-------LPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETSS-- 225

Query: 286 EKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIYSFVYKAAENNIG-GGFSYCLADHNR 345
              + +I FGC E+ +      GA GLIG+G    S       + +G G FSYC+     
Sbjct: 226 ---VPNIAFGCGEDNQGFGQGNGA-GLIGMGWGPLSL-----PSQLGVGQFSYCMTS--- 285

Query: 346 NTTAISYFVFGTPSPKTF---SATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQM 405
                    +G+ SP T    SA +  P G P+TT L        YY + L GI+V    
Sbjct: 286 ---------YGSSSPSTLALGSAASGVPEGSPSTT-LIHSSLNPTYYYITLQGITVGGDN 345

Query: 406 LKIPRHVWNIKSG--CGTILDTGTSLTLLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELC 465
           L IP   + ++     G I+D+GT+LT L   A++AV +A   +I      E       C
Sbjct: 346 LGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTC 405

Query: 466 FND-TEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAATQCSCVAISSLPFPSINILGNII 525
           F   ++ +    P++   F+GG V    +++ ++S A    C+A+ S     I+I GNI 
Sbjct: 406 FQQPSDGSTVQVPEISMQFDGG-VLNLGEQNILISPAEGVICLAMGSSSQLGISIFGNIQ 435

Query: 526 QQTYFWQFDLLKESVTFAPSDC 538
           QQ     +DL   +V+F P+ C
Sbjct: 466 QQETQVLYDLQNLAVSFVPTQC 435

BLAST of Cp4.1LG05g10050 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 154.1 bits (388), Expect = 4.6e-36
Identity = 113/385 (29.35%), Postives = 175/385 (45.45%), Query Frame = 0

Query: 154 SGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLSPMHKMRNKMRGRFRY 213
           SGE+ + + +GTPP     IADTGSDLLWT+C    C    + + P+   +         
Sbjct: 87  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQC--APCDDCYTQVDPLFDPKT-------- 146

Query: 214 ALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANET 273
                 SS++  + CSS QC      L  Q  C T +  CSY+ SY       G  A +T
Sbjct: 147 ------SSTYKDVSCSSSQC----TALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDT 206

Query: 274 VTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIYSFVYKAAENNIGGGFS 333
           +T+  ++ +  QLK+I+ GC        F K   G++GLG    S + K   ++I G FS
Sbjct: 207 LTLGSSDTRPMQLKNIIIGCGHN-NAGTFNKKGSGIVGLGGGPVSLI-KQLGDSIDGKFS 266

Query: 334 YCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGIS 393
           YCL          S   FGT +  + S   S+P+   A+ + F        Y + L  IS
Sbjct: 267 YCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETF--------YYLTLKSIS 326

Query: 394 VDDQMLKIPRHVWNIKSGCGTILDTGTSLTLLTAPAHDAVIEAMAPKIAKFGRMEKQRNF 453
           V  + ++          G   I+D+GT+LTLL    +  + +A+A  I    + + Q   
Sbjct: 327 VGSKQIQYSGSDSESSEG-NIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGL 386

Query: 454 ELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAATQCSCVAISSLPFPSINILGN 513
            LC++ T       P +  HF+G  V      ++ V  +    C A      PS +I GN
Sbjct: 387 SLCYSAT--GDLKVPVITMHFDGADVKLDSSNAF-VQVSEDLVCFAFRG--SPSFSIYGN 435

Query: 514 IIQQTYFWQFDLLKESVTFAPSDCA 539
           + Q  +   +D + ++V+F P+DCA
Sbjct: 447 VAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of Cp4.1LG05g10050 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 139.8 bits (351), Expect = 9.0e-32
Identity = 121/401 (30.17%), Postives = 173/401 (43.14%), Query Frame = 0

Query: 141 PIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLSPM 200
           P G++T   A  G GE+ + L +GTP Q F+ I DTGSDL+WT+C  + C    +  +P+
Sbjct: 81  PSGVETSVYA--GDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQC--QPCTQCFNQSTPI 140

Query: 201 HKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPTPNTPCSYTYSYT 260
              +               SSSFS +PCSS+ C          P C   N  C YTY Y 
Sbjct: 141 FNPQG--------------SSSFSTLPCSSQLC-----QALSSPTC--SNNFCQYTYGYG 200

Query: 261 GGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIYSFV 320
            G    G    ET+T    +     + +I FGC E  +      GA GL+G+G    S  
Sbjct: 201 DGSETQGSMGTETLTFGSVS-----IPNITFGCGENNQGFGQGNGA-GLVGMGRGPLSLP 260

Query: 321 YKAAENNIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQ 380
            +         FSYC+     +T   S  + G       S   S   G P TT L    Q
Sbjct: 261 SQLDVTK----FSYCMTPIGSSTP--SNLLLG-------SLANSVTAGSPNTT-LIQSSQ 320

Query: 381 YSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGT---ILDTGTSLTLLTAPAHDAVIEAM 440
              +Y + L G+SV    L I    + + S  GT   I+D+GT+LT     A+ +V +  
Sbjct: 321 IPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEF 380

Query: 441 APKIAKFGRMEKQRNFELCFNDTEWNFGMS-PKLGFHFEGGAVFEPPDRSYVVSAATQCS 500
             +I           F+LCF        +  P    HF+GG + E P  +Y +S +    
Sbjct: 381 ISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDL-ELPSENYFISPSNGLI 434

Query: 501 CVAISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDC 538
           C+A+ S     ++I GNI QQ     +D     V+FA + C
Sbjct: 441 CLAMGS-SSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Cp4.1LG05g10050 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 138.7 bits (348), Expect = 2.0e-31
Identity = 116/395 (29.37%), Postives = 170/395 (43.04%), Query Frame = 0

Query: 149 GADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLSPMHKMRNKMR 208
           G   GSGE+F +L VGTP +   ++ DTGSD++W +C    CR   S   P+   R    
Sbjct: 134 GLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC--APCRRCYSQSDPIFDPR---- 193

Query: 209 GRFRYALYANQSSSFSPIPCSSRQC--IDDFPDLGGQPDCPTPNTPCSYTYSYTGGERAS 268
                     +S +++ IPCSS  C  +D          C T    C Y  SY  G    
Sbjct: 194 ----------KSKTYATIPCSSPHCRRLD-------SAGCNTRRKTCLYQVSYGDGSFTV 253

Query: 269 GIFANETVTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIYSFVYKAAEN 328
           G F+ ET+T R       ++K +  GC  + E      GA GL+GLG    SF  +   +
Sbjct: 254 GDFSTETLTFR-----RNRVKGVALGCGHDNE--GLFVGAAGLLGLGKGKLSFPGQTG-H 313

Query: 329 NIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYG 388
                FSYCL D + ++          PS   F     S I     T L +  +   +Y 
Sbjct: 314 RFNQKFSYCLVDRSASS---------KPSSVVFGNAAVSRIA--RFTPLLSNPKLDTFYY 373

Query: 389 VQLIGISVDDQMLK-IPRHVWNIK--SGCGTILDTGTSLTLLTAPAHDAVIEAMAPKIAK 448
           V L+GISV    +  +   ++ +      G I+D+GTS+T L  PA+ A+ +A       
Sbjct: 374 VGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKT 433

Query: 449 FGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAATQCSCVAISSL 508
             R      F+ CF+ +  N    P +  HF G  V   P  +Y++   T        + 
Sbjct: 434 LKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADV-SLPATNYLIPVDTNGKFCFAFAG 485

Query: 509 PFPSINILGNIIQQTYFWQFDLLKESVTFAPSDCA 539
               ++I+GNI QQ +   +DL    V FAP  CA
Sbjct: 494 TMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of Cp4.1LG05g10050 vs. NCBI nr
Match: XP_023532727.1 (aspartic proteinase NANA, chloroplast-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1063 bits (2749), Expect = 0.0
Identity = 519/519 (100.00%), Postives = 519/519 (100.00%), Query Frame = 0

Query: 20  MSPISHSLILFFVFVFFSPITVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV 79
           MSPISHSLILFFVFVFFSPITVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV
Sbjct: 1   MSPISHSLILFFVFVFFSPITVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV 60

Query: 80  KRLDDEIKVDSVEDRIRDIRYHDQNRLRSISAKLNWTKVVENAEEKEKEVSGSNLPPQSQ 139
           KRLDDEIKVDSVEDRIRDIRYHDQNRLRSISAKLNWTKVVENAEEKEKEVSGSNLPPQSQ
Sbjct: 61  KRLDDEIKVDSVEDRIRDIRYHDQNRLRSISAKLNWTKVVENAEEKEKEVSGSNLPPQSQ 120

Query: 140 TPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLSP 199
           TPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLSP
Sbjct: 121 TPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLSP 180

Query: 200 MHKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPTPNTPCSYTYSY 259
           MHKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPTPNTPCSYTYSY
Sbjct: 181 MHKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPTPNTPCSYTYSY 240

Query: 260 TGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIYSF 319
           TGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIYSF
Sbjct: 241 TGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIYSF 300

Query: 320 VYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGG 379
           VYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGG
Sbjct: 301 VYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGG 360

Query: 380 QYSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGTILDTGTSLTLLTAPAHDAVIEAMAP 439
           QYSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGTILDTGTSLTLLTAPAHDAVIEAMAP
Sbjct: 361 QYSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGTILDTGTSLTLLTAPAHDAVIEAMAP 420

Query: 440 KIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAATQCSCVA 499
           KIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAATQCSCVA
Sbjct: 421 KIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAATQCSCVA 480

Query: 500 ISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDCA 538
           ISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDCA
Sbjct: 481 ISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDCA 519

BLAST of Cp4.1LG05g10050 vs. NCBI nr
Match: XP_022947059.1 (aspartic proteinase NANA, chloroplast-like [Cucurbita moschata])

HSP 1 Score: 1013 bits (2620), Expect = 0.0
Identity = 495/521 (95.01%), Postives = 510/521 (97.89%), Query Frame = 0

Query: 20  MSPISHSLILFFVFVF--FSPITVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPE 79
           MSPISH LIL FVFVF  FSP+TVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPE
Sbjct: 1   MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPE 60

Query: 80  VVKRLDDEIKVDSVEDRIRDIRYHDQNRLRSISAKLNWTKVVENAEEKEKEVSGSNLPPQ 139
           VVKR+DDEIKVDSVEDRI+DIRYHDQNRLR+ISA LNWTKVVENAEEKEKEVSGSNL   
Sbjct: 61  VVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTKVVENAEEKEKEVSGSNL--- 120

Query: 140 SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNL 199
           SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCS+L
Sbjct: 121 SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHL 180

Query: 200 SPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPTPNTPCSYTY 259
           SPMHKMRNKMRGRFRYALYANQSSSFSPIPCSS+QCIDDFPDLGGQPDCPTPNTPCSYTY
Sbjct: 181 SPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTY 240

Query: 260 SYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIY 319
           SYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVE+T+FMKGADGLIGLGSSIY
Sbjct: 241 SYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIY 300

Query: 320 SFVYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFT 379
           SFVYKAAENNIGGGFSYCLADH+RNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFT
Sbjct: 301 SFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFT 360

Query: 380 GGQYSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGTILDTGTSLTLLTAPAHDAVIEAM 439
           GGQYSCYYGVQLIGISVDDQ+L IPRHVWNIKSGCGTILDTGTSLT+LTAPAHDAVIEAM
Sbjct: 361 GGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAM 420

Query: 440 APKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAATQCSC 499
           APKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSY+VSA+ QCSC
Sbjct: 421 APKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSC 480

Query: 500 VAISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDCA 538
           +AI+SLPFPSINILGNIIQQTYFWQFDLLK SVTFAPSDCA
Sbjct: 481 IAITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSDCA 518

BLAST of Cp4.1LG05g10050 vs. NCBI nr
Match: KAG6605363.1 (Aspartic proteinase NANA, chloroplast, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1008 bits (2605), Expect = 0.0
Identity = 491/519 (94.61%), Postives = 507/519 (97.69%), Query Frame = 0

Query: 20  MSPISHSLILFFVFVFFSPITVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV 79
           MSPISH LILFFV  FFSP+TVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV
Sbjct: 1   MSPISHLLILFFV--FFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV 60

Query: 80  KRLDDEIKVDSVEDRIRDIRYHDQNRLRSISAKLNWTKVVENAEEKEKEVSGSNLPPQSQ 139
           KR+DDEIKVD+VEDRI+DIRYHDQNRLR+ISA LNWTKVVENAEEKEKEVSGSNL   SQ
Sbjct: 61  KRIDDEIKVDTVEDRIKDIRYHDQNRLRAISAHLNWTKVVENAEEKEKEVSGSNL---SQ 120

Query: 140 TPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLSP 199
           TPIGLK YPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCS+LSP
Sbjct: 121 TPIGLKIYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSP 180

Query: 200 MHKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPTPNTPCSYTYSY 259
           MHKMRNKMRGRFRYALYANQSSSFSPIPCSS+QCIDDFPDLGGQPDCPTPNTPCSYTYSY
Sbjct: 181 MHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSY 240

Query: 260 TGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIYSF 319
           TGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVE+TNFMKGADGLIGLGSSIYSF
Sbjct: 241 TGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTNFMKGADGLIGLGSSIYSF 300

Query: 320 VYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGG 379
           VYKAAENNIGGGFSYCLADH+RN TAISYFVFGTPSPKTFSATTSSPIGPP+TTKLFTGG
Sbjct: 301 VYKAAENNIGGGFSYCLADHHRNITAISYFVFGTPSPKTFSATTSSPIGPPSTTKLFTGG 360

Query: 380 QYSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGTILDTGTSLTLLTAPAHDAVIEAMAP 439
           QYSCYYGVQLIGISVDDQ+L IPRHVWNIKSGCGTILDTGTSLT+LTAPAHDAVIEAMAP
Sbjct: 361 QYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAP 420

Query: 440 KIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAATQCSCVA 499
           KIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSY+VSA+ QCSC+A
Sbjct: 421 KIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIA 480

Query: 500 ISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDCA 538
           I+SLPFPSINILGNIIQQTYFWQFDLLK SVTFAPSDCA
Sbjct: 481 ITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSDCA 514

BLAST of Cp4.1LG05g10050 vs. NCBI nr
Match: XP_023007158.1 (aspartic proteinase NANA, chloroplast-like [Cucurbita maxima])

HSP 1 Score: 926 bits (2392), Expect = 0.0
Identity = 449/525 (85.52%), Postives = 480/525 (91.43%), Query Frame = 0

Query: 20  MSPISHSLILFFVFVFFSPITVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV 79
           MS ISH LILFFV  FFSP+TVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV
Sbjct: 1   MSSISHLLILFFVVFFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV 60

Query: 80  KRLDDEIKVDSVEDRIRDIRYHDQNRLRSISAKLNWTKVVENAEEKEKEVSGSNLPPQSQ 139
           KRL DEIKVD +EDRI+DIRYHDQ+RLR+ISA LNWTKVVENAEEK KE SGSN PP SQ
Sbjct: 61  KRLHDEIKVDKMEDRIKDIRYHDQSRLRAISAHLNWTKVVENAEEKVKEASGSNHPPHSQ 120

Query: 140 TPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLSP 199
           TPI LKTYPGADFGS EFFVQLKVGTPPQ FT+IADTGSDLLWT+CR+RRCRGDCSN SP
Sbjct: 121 TPIALKTYPGADFGSSEFFVQLKVGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSP 180

Query: 200 MHKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPTPNTPCSYTYSY 259
           +HKMRNKMR RF YALYANQSSSFSPIPCSS+QCI DF +LGGQPDCPTPNTPCSYTYSY
Sbjct: 181 IHKMRNKMRERFNYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNTPCSYTYSY 240

Query: 260 TGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIYSF 319
             G+RA GIFA ETVTVRLTNGKEKQLKDIL+GCTEE+  + F+ GADGLIGLGSSIYSF
Sbjct: 241 LSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEMTDSQFLDGADGLIGLGSSIYSF 300

Query: 320 VYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGG 379
           VYKAAENN+GGGFSYCLADH RN TAISYFVFGTPSPKTFSA+TSSPIGPPATTKLFTGG
Sbjct: 301 VYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFSASTSSPIGPPATTKLFTGG 360

Query: 380 QYSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGTILDTGTSLTLLTAPAHDAVIEAMAP 439
           +YSCYYGVQL GISVD Q+L IP HVWNIKSGCGTILDTGTSLT+LTAPAHDAVIEAMAP
Sbjct: 361 RYSCYYGVQLSGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAP 420

Query: 440 KIAKFGRMEK------QRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAAT 499
           KI KFGRMEK      ++NF+LCFNDTEWNFGM PKLGFHFE GAVFEPPDRSY+VSA+ 
Sbjct: 421 KIEKFGRMEKDVKGEREKNFKLCFNDTEWNFGMLPKLGFHFEDGAVFEPPDRSYIVSASY 480

Query: 500 QCSCVAISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDCA 538
           QCSC+AI+SLPFPSINILGNIIQQT+ W++DLLK SVTFAPSDCA
Sbjct: 481 QCSCIAITSLPFPSINILGNIIQQTFIWKYDLLKGSVTFAPSDCA 525

BLAST of Cp4.1LG05g10050 vs. NCBI nr
Match: XP_022947824.1 (aspartic proteinase NANA, chloroplast-like [Cucurbita moschata])

HSP 1 Score: 915 bits (2364), Expect = 0.0
Identity = 444/526 (84.41%), Postives = 481/526 (91.44%), Query Frame = 0

Query: 20  MSPISHSLILFFVFVFFSPITVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV 79
           MSPISH LILFFV  FFSP+TVAVADQSNANN KQESDANNEE+EFVRLDLIHRHHPEVV
Sbjct: 1   MSPISHLLILFFV--FFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVV 60

Query: 80  KRLDDEIKVDSVEDRIRDIRYHDQNRLRSISAKLNWTKVVENAEEKEK-EVSGSNLPPQS 139
           KRL DEIKVD +EDRI+DIRYHDQ+RLR+IS  +NWTKVVENAEEKEK E S SNLPPQS
Sbjct: 61  KRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQS 120

Query: 140 QTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLS 199
           QTPI LKTYPGADFGS EFFVQLK+GTPPQ FT+IADTGSDLLWT+CR+RRCRGDCSN S
Sbjct: 121 QTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPS 180

Query: 200 PMHKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPTPNTPCSYTYS 259
           P+HKMRN+MR RF YALYANQSSSFSPIPCSS+QCI DF +LGGQPDCPTPN+PCSYTYS
Sbjct: 181 PIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYS 240

Query: 260 YTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIYS 319
           Y  G+RA GIFA ETVTVRLTNGKEKQLKDIL+GCTEE+  + F+ GADGLIGLGSSIYS
Sbjct: 241 YLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYS 300

Query: 320 FVYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTG 379
           FVYKAAENN+GGGFSYCLADH RN TAISYFVFGTPSPKTF+A+TSSPIGPPATT+L TG
Sbjct: 301 FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITG 360

Query: 380 GQYSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGTILDTGTSLTLLTAPAHDAVIEAMA 439
           G+YSCYYGVQL GISVD Q+L IP HVWNIKSGCGTILDTGTSLT+LTAPAHDAVIEAMA
Sbjct: 361 GRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMA 420

Query: 440 PKIAKFGRMEK------QRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAA 499
           PKI KFGRME+      ++NF+LCFNDT+WNFGM PKLGFHFEGGAVFEPPDRSY+VSA+
Sbjct: 421 PKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSAS 480

Query: 500 TQCSCVAISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDCA 538
            QCSC+AI+SLPFPSINILGNIIQQTY WQFDLLK SVTFAPSDCA
Sbjct: 481 YQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 524

BLAST of Cp4.1LG05g10050 vs. ExPASy TrEMBL
Match: A0A6J1G5P4 (aspartic proteinase NANA, chloroplast-like OS=Cucurbita moschata OX=3662 GN=LOC111451044 PE=3 SV=1)

HSP 1 Score: 1013 bits (2620), Expect = 0.0
Identity = 495/521 (95.01%), Postives = 510/521 (97.89%), Query Frame = 0

Query: 20  MSPISHSLILFFVFVF--FSPITVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPE 79
           MSPISH LIL FVFVF  FSP+TVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPE
Sbjct: 1   MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPE 60

Query: 80  VVKRLDDEIKVDSVEDRIRDIRYHDQNRLRSISAKLNWTKVVENAEEKEKEVSGSNLPPQ 139
           VVKR+DDEIKVDSVEDRI+DIRYHDQNRLR+ISA LNWTKVVENAEEKEKEVSGSNL   
Sbjct: 61  VVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTKVVENAEEKEKEVSGSNL--- 120

Query: 140 SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNL 199
           SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCS+L
Sbjct: 121 SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHL 180

Query: 200 SPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPTPNTPCSYTY 259
           SPMHKMRNKMRGRFRYALYANQSSSFSPIPCSS+QCIDDFPDLGGQPDCPTPNTPCSYTY
Sbjct: 181 SPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTY 240

Query: 260 SYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIY 319
           SYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVE+T+FMKGADGLIGLGSSIY
Sbjct: 241 SYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIY 300

Query: 320 SFVYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFT 379
           SFVYKAAENNIGGGFSYCLADH+RNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFT
Sbjct: 301 SFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFT 360

Query: 380 GGQYSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGTILDTGTSLTLLTAPAHDAVIEAM 439
           GGQYSCYYGVQLIGISVDDQ+L IPRHVWNIKSGCGTILDTGTSLT+LTAPAHDAVIEAM
Sbjct: 361 GGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAM 420

Query: 440 APKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAATQCSC 499
           APKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSY+VSA+ QCSC
Sbjct: 421 APKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSC 480

Query: 500 VAISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDCA 538
           +AI+SLPFPSINILGNIIQQTYFWQFDLLK SVTFAPSDCA
Sbjct: 481 IAITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSDCA 518

BLAST of Cp4.1LG05g10050 vs. ExPASy TrEMBL
Match: A0A6J1L6Y2 (aspartic proteinase NANA, chloroplast-like OS=Cucurbita maxima OX=3661 GN=LOC111499734 PE=3 SV=1)

HSP 1 Score: 926 bits (2392), Expect = 0.0
Identity = 449/525 (85.52%), Postives = 480/525 (91.43%), Query Frame = 0

Query: 20  MSPISHSLILFFVFVFFSPITVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV 79
           MS ISH LILFFV  FFSP+TVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV
Sbjct: 1   MSSISHLLILFFVVFFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV 60

Query: 80  KRLDDEIKVDSVEDRIRDIRYHDQNRLRSISAKLNWTKVVENAEEKEKEVSGSNLPPQSQ 139
           KRL DEIKVD +EDRI+DIRYHDQ+RLR+ISA LNWTKVVENAEEK KE SGSN PP SQ
Sbjct: 61  KRLHDEIKVDKMEDRIKDIRYHDQSRLRAISAHLNWTKVVENAEEKVKEASGSNHPPHSQ 120

Query: 140 TPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLSP 199
           TPI LKTYPGADFGS EFFVQLKVGTPPQ FT+IADTGSDLLWT+CR+RRCRGDCSN SP
Sbjct: 121 TPIALKTYPGADFGSSEFFVQLKVGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSP 180

Query: 200 MHKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPTPNTPCSYTYSY 259
           +HKMRNKMR RF YALYANQSSSFSPIPCSS+QCI DF +LGGQPDCPTPNTPCSYTYSY
Sbjct: 181 IHKMRNKMRERFNYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNTPCSYTYSY 240

Query: 260 TGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIYSF 319
             G+RA GIFA ETVTVRLTNGKEKQLKDIL+GCTEE+  + F+ GADGLIGLGSSIYSF
Sbjct: 241 LSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEMTDSQFLDGADGLIGLGSSIYSF 300

Query: 320 VYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGG 379
           VYKAAENN+GGGFSYCLADH RN TAISYFVFGTPSPKTFSA+TSSPIGPPATTKLFTGG
Sbjct: 301 VYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFSASTSSPIGPPATTKLFTGG 360

Query: 380 QYSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGTILDTGTSLTLLTAPAHDAVIEAMAP 439
           +YSCYYGVQL GISVD Q+L IP HVWNIKSGCGTILDTGTSLT+LTAPAHDAVIEAMAP
Sbjct: 361 RYSCYYGVQLSGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAP 420

Query: 440 KIAKFGRMEK------QRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAAT 499
           KI KFGRMEK      ++NF+LCFNDTEWNFGM PKLGFHFE GAVFEPPDRSY+VSA+ 
Sbjct: 421 KIEKFGRMEKDVKGEREKNFKLCFNDTEWNFGMLPKLGFHFEDGAVFEPPDRSYIVSASY 480

Query: 500 QCSCVAISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDCA 538
           QCSC+AI+SLPFPSINILGNIIQQT+ W++DLLK SVTFAPSDCA
Sbjct: 481 QCSCIAITSLPFPSINILGNIIQQTFIWKYDLLKGSVTFAPSDCA 525

BLAST of Cp4.1LG05g10050 vs. ExPASy TrEMBL
Match: A0A6J1G810 (aspartic proteinase NANA, chloroplast-like OS=Cucurbita moschata OX=3662 GN=LOC111451585 PE=3 SV=1)

HSP 1 Score: 915 bits (2364), Expect = 0.0
Identity = 444/526 (84.41%), Postives = 481/526 (91.44%), Query Frame = 0

Query: 20  MSPISHSLILFFVFVFFSPITVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV 79
           MSPISH LILFFV  FFSP+TVAVADQSNANN KQESDANNEE+EFVRLDLIHRHHPEVV
Sbjct: 1   MSPISHLLILFFV--FFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVV 60

Query: 80  KRLDDEIKVDSVEDRIRDIRYHDQNRLRSISAKLNWTKVVENAEEKEK-EVSGSNLPPQS 139
           KRL DEIKVD +EDRI+DIRYHDQ+RLR+IS  +NWTKVVENAEEKEK E S SNLPPQS
Sbjct: 61  KRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQS 120

Query: 140 QTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLS 199
           QTPI LKTYPGADFGS EFFVQLK+GTPPQ FT+IADTGSDLLWT+CR+RRCRGDCSN S
Sbjct: 121 QTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPS 180

Query: 200 PMHKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPTPNTPCSYTYS 259
           P+HKMRN+MR RF YALYANQSSSFSPIPCSS+QCI DF +LGGQPDCPTPN+PCSYTYS
Sbjct: 181 PIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYS 240

Query: 260 YTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIYS 319
           Y  G+RA GIFA ETVTVRLTNGKEKQLKDIL+GCTEE+  + F+ GADGLIGLGSSIYS
Sbjct: 241 YLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYS 300

Query: 320 FVYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTG 379
           FVYKAAENN+GGGFSYCLADH RN TAISYFVFGTPSPKTF+A+TSSPIGPPATT+L TG
Sbjct: 301 FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITG 360

Query: 380 GQYSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGTILDTGTSLTLLTAPAHDAVIEAMA 439
           G+YSCYYGVQL GISVD Q+L IP HVWNIKSGCGTILDTGTSLT+LTAPAHDAVIEAMA
Sbjct: 361 GRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMA 420

Query: 440 PKIAKFGRMEK------QRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAA 499
           PKI KFGRME+      ++NF+LCFNDT+WNFGM PKLGFHFEGGAVFEPPDRSY+VSA+
Sbjct: 421 PKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSAS 480

Query: 500 TQCSCVAISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDCA 538
            QCSC+AI+SLPFPSINILGNIIQQTY WQFDLLK SVTFAPSDCA
Sbjct: 481 YQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 524

BLAST of Cp4.1LG05g10050 vs. ExPASy TrEMBL
Match: A0A0A0KG92 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G134390 PE=3 SV=1)

HSP 1 Score: 528 bits (1361), Expect = 8.88e-181
Identity = 269/532 (50.56%), Postives = 356/532 (66.92%), Query Frame = 0

Query: 20  MSPISHSLILFFVFVFF------SPITVAVADQSNANNLKQESDANNEEQEFVRLDLIHR 79
           MSPIS+    FF F+ F      S    A+ D+ N  N     + + +EQE ++ DL+HR
Sbjct: 8   MSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHR 67

Query: 80  HHPEVVKRLDDEIKVDSVEDRIRDIRYHDQNRLRSISAKLNWTKVVE-------NAEEKE 139
           HHP+V +++  ++K+  V +R++DI  HD NR RSIS  +N  +V +        A  +E
Sbjct: 68  HHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRHRSISKSMNQKQVEDARLRAEAEAATEE 127

Query: 140 KEVSGSNLPPQSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCR 199
           +    + LPP + TPIG++   GADFGS E+FV+LKVGTP QTF LIADTGSDL W KCR
Sbjct: 128 EVAKSAILPPATSTPIGMRMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCR 187

Query: 200 FRRCRGDCSNLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDC 259
           +RRC G+CS+ +  HK +N+ + RFR+A  AN SSSF  + CSS  C +D  DL    +C
Sbjct: 188 YRRCFGNCSS-NVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVREC 247

Query: 260 PTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVELTNFMKGA 319
             P +PC Y YSYTGG  A GIFA ET+TV LTNGKEKQL + + GCTE V+ + F  GA
Sbjct: 248 HNPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGSVF-GGA 307

Query: 320 DGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSP 379
           DG++GLG+S YS  YKAAEN  GGGFSYCL DH  +  AISYFV G P+P T ++T+S+ 
Sbjct: 308 DGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAK 367

Query: 380 IGPPAT-TKLFTGGQYSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGTILDTGTSLTLL 439
           +    T TKL+ G  YS +YGV LIGIS +  ML IP  VW+I SG GTI+D+GTSLT+L
Sbjct: 368 LPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDSGTSLTIL 427

Query: 440 TAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDR 499
            APA D V+EA+ P++ KF ++E +  F+ CFN++++   M+PKL FHF  G VFEPP +
Sbjct: 428 AAPAFDMVMEALTPRLKKFQQLEIEP-FDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTK 487

Query: 500 SYVVSAATQCSCVAISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDC 537
           SY+VS     SC+   S+PFP+ NI+GNI+QQ + WQFD  K  V FAPS+C
Sbjct: 488 SYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC 536

BLAST of Cp4.1LG05g10050 vs. ExPASy TrEMBL
Match: A0A1S3C2F3 (aspartic proteinase CDR1 OS=Cucumis melo OX=3656 GN=LOC103496268 PE=3 SV=1)

HSP 1 Score: 523 bits (1348), Expect = 6.49e-179
Identity = 269/532 (50.56%), Postives = 357/532 (67.11%), Query Frame = 0

Query: 20  MSPISHSLILFFVFVFF----SPITVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHH 79
           MSPIS+    FF+ +FF    S    A+ D++N  N   + D    EQ+ +R DL+HRHH
Sbjct: 8   MSPISN-FCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDED----EQQTIRFDLLHRHH 67

Query: 80  PEVVKRLDDEIKVDSVEDRIRDIRYHDQNRLRSISAKLNWTKVVENAEEKEKEVS----- 139
           P+V ++L+ ++K+  + +R++DI  HD+NR RSIS  +N  ++ +     E E +     
Sbjct: 68  PQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEV 127

Query: 140 --GSNLPPQSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFR 199
              + LPP + TPIG+K   GADFGS E+FVQLKVGTP QTF LIADTGSDL W KCR+R
Sbjct: 128 AKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYR 187

Query: 200 RCRGDCSNLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPT 259
           RC G+CS  +  HK +N+ + RFR+AL ANQSS+F  + CSS  C ++  +L    +C T
Sbjct: 188 RCFGNCSG-NVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDT 247

Query: 260 PNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADG 319
           P +PC Y YSY GG  A GIFA ET+TV LTNGKEKQL++ + GCTE V+  N   GADG
Sbjct: 248 PTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQ-GNVFDGADG 307

Query: 320 LIGLGSSIYSFVYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIG 379
           ++GLG+S YS  YKAAEN  GGGFSYCL DH  +  A+SYFV G P+P T ++T+S+   
Sbjct: 308 VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAK-- 367

Query: 380 PPAT---TKLFTGGQYSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGTILDTGTSLTLL 439
           PPA    TKL+ G  YS +YGV LIGIS D QML IP  VW+   GCGTI+D+GTSLT+L
Sbjct: 368 PPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVL 427

Query: 440 TAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDR 499
             PA D V+E +  ++ +F ++E +  F  CFN++++   M+PKL FHF  G VFEPP +
Sbjct: 428 ATPAFDVVMEVLTSRLKQFQQIEIEP-FNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTK 487

Query: 500 SYVVSAATQCSCVAISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDC 537
           SY+VS     SC+ I S+PFPS+NI+GNI+QQ + WQFD  K  V FA S+C
Sbjct: 488 SYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC 529

BLAST of Cp4.1LG05g10050 vs. TAIR 10
Match: AT3G12700.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 273.5 bits (698), Expect = 3.7e-73
Identity = 174/484 (35.95%), Postives = 248/484 (51.24%), Query Frame = 0

Query: 58  ANNEEQEFVRLDLIHRHHPEVVKRLDDEIKVDSVEDRIRDIRYHDQNRLRSISAKLNWTK 117
           A++ +   VRL L HR           +  +     RI D+   DQ R   IS K N   
Sbjct: 41  ADSMKDTSVRLKLAHR-----------DTLLPKPLSRIEDVIGADQKRHSLISRKRN--- 100

Query: 118 VVENAEEKEKEVSGSNLPPQSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTG 177
                               S   + +    G D+G+ ++F +++VGTP + F ++ DTG
Sbjct: 101 --------------------STVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTG 160

Query: 178 SDLLWTKCRFRRCRGDCSNLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDF 237
           S+L W  CR+ R RG  +                R    A++S SF  + C ++ C  D 
Sbjct: 161 SELTWVNCRY-RARGKDN----------------RRVFRADESKSFKTVGCLTQTCKVDL 220

Query: 238 PDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEV 297
            +L     CPTP+TPCSY Y Y  G  A G+FA ET+TV LTNG+  +L   L GC+   
Sbjct: 221 MNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSF 280

Query: 298 ELTNFMKGADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTP-SP 357
              +F +GADG++GL  S +SF    A +  G  FSYCL DH  N    +Y +FG+  S 
Sbjct: 281 TGQSF-QGADGVLGLAFSDFSFT-STATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRST 340

Query: 358 KT-FSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQMLKIPRHVWNIKSGCGTI 417
           KT F  TT     P   T++        +Y + +IGIS+   ML IP  VW+  SG GTI
Sbjct: 341 KTAFRRTT-----PLDLTRI------PPFYAINVIGISLGYDMLDIPSQVWDATSGGGTI 400

Query: 418 LDTGTSLTLLTAPAHDAVIEAMAPKIAKFGRMEKQR-NFELCFNDTE-WNFGMSPKLGFH 477
           LD+GTSLTLL   A+  V+  +A  + +  R++ +    E CF+ T  +N    P+L FH
Sbjct: 401 LDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFH 460

Query: 478 FEGGAVFEPPDRSYVVSAATQCSCVAISSLPFPSINILGNIIQQTYFWQFDLLKESVTFA 537
            +GGA FEP  +SY+V AA    C+   S   P+ N++GNI+QQ Y W+FDL+  +++FA
Sbjct: 461 LKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSFA 460

BLAST of Cp4.1LG05g10050 vs. TAIR 10
Match: AT3G25700.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 206.5 bits (524), Expect = 5.5e-53
Identity = 135/405 (33.33%), Postives = 203/405 (50.12%), Query Frame = 0

Query: 149 GADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLSPMHKMRNKMR 208
           GA  GSG++FV L++G PPQ+  LIADTGSDL+W KC    CR +CS+ SP         
Sbjct: 76  GAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKC--SACR-NCSHHSP--------- 135

Query: 209 GRFRYALYANQSSSFSPIPCSSRQC-IDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASG 268
                  +   SS+FSP  C    C +   PD     +    ++ C Y Y Y  G   SG
Sbjct: 136 ---ATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSG 195

Query: 269 IFANETVTVRLTNGKEKQLKDILFGC-----TEEVELTNFMKGADGLIGLGSSIYSFVYK 328
           +FA ET +++ ++GKE +LK + FGC      + V  T+F  GA+G++GLG    SF  +
Sbjct: 196 LFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSF-NGANGVMGLGRGPISFASQ 255

Query: 329 AAENNIGGGFSYCLADHNRNTTAISYFVFGTP----SPKTFSATTSSPIGPPATTKLFTG 388
                 G  FSYCL D+  +    SY + G      S   F+   ++P+ P         
Sbjct: 256 LG-RRFGNKFSYCLMDYTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLSP--------- 315

Query: 389 GQYSCYYGVQLIGISVDDQMLKIPRHVWNI--KSGCGTILDTGTSLTLLTAPAHDAVIEA 448
                +Y V+L  + V+   L+I   +W I      GT++D+GT+L  L  PA+ +VI A
Sbjct: 316 ----TFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAA 375

Query: 449 MAPKIAKFGRMEKQRNFELCFN--DTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAATQ 508
           +  ++           F+LC N         + P+L F F GGAVF PP R+Y +    Q
Sbjct: 376 VRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQ 435

Query: 509 CSCVAISSL-PFPSINILGNIIQQTYFWQFDLLKESVTFAPSDCA 539
             C+AI S+ P    +++GN++QQ + ++FD  +  + F+   CA
Sbjct: 436 IQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450

BLAST of Cp4.1LG05g10050 vs. TAIR 10
Match: AT2G42980.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 174.9 bits (442), Expect = 1.8e-43
Identity = 141/459 (30.72%), Postives = 209/459 (45.53%), Query Frame = 0

Query: 97  DIRYHDQNRLRSISAKLNWTKVVENAEEKEKEVSGSNL---PPQSQTPIGLKTYPGADFG 156
           D++  D  R++++ A+ N +K  +N + ++K  S  +L   P  S   +      G   G
Sbjct: 97  DLQIQDLTRIKTLHARFNKSKKQKNEKVRKKITSDISLVGAPEVSPGKLIATLESGMTLG 156

Query: 157 SGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLSPMHKMRNKMRGRFRY 216
           SGE+F+ + VGTPP+ F+LI DTGSDL W +C    C  DC + + M             
Sbjct: 157 SGEYFMDVLVGTPPKHFSLILDTGSDLNWLQC--LPCY-DCFHQNGMF------------ 216

Query: 217 ALYANQSSSFSPIPCSSRQC-IDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANE 276
                 S+SF  I C+  +C +   PD   Q  C + N  C Y Y Y      +G FA E
Sbjct: 217 -YDPKTSASFKNITCNDPRCSLISSPDPPVQ--CESDNQSCPYFYWYGDRSNTTGDFAVE 276

Query: 277 TVTVRLT----NGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIYSFVYKAAENNI 336
           T TV LT       E ++ +++FGC           GA GL+GLG    SF     ++  
Sbjct: 277 TFTVNLTTTEGGSSEYKVGNMMFGCGHWNR--GLFSGASGLLGLGRGPLSF-SSQLQSLY 336

Query: 337 GGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYS--CYYG 396
           G  FSYCL D N NT   S  +FG           +        T    G + S   +Y 
Sbjct: 337 GHSFSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNLN-------FTSFVNGKENSVETFYY 396

Query: 397 VQLIGISVDDQMLKIPRHVWNIKS--GCGTILDTGTSLTLLTAPAHDAVIEAMAPKIAKF 456
           +Q+  I V  + L IP   WNI S    GTI+D+GT+L+    PA++ +    A K+ + 
Sbjct: 397 IQIKSILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKE- 456

Query: 457 GRMEKQRNFEL---CFN--DTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAATQCSCVA 516
                 R+F +   CFN    E N    P+LG  F  G V+  P  +  +  +    C+A
Sbjct: 457 -NYPIFRDFPVLDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLA 516

Query: 517 ISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDCA 539
           I   P  + +I+GN  QQ +   +D  +  + F P+ CA
Sbjct: 517 ILGTPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKCA 525

BLAST of Cp4.1LG05g10050 vs. TAIR 10
Match: AT3G59080.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 168.3 bits (425), Expect = 1.7e-41
Identity = 156/531 (29.38%), Postives = 237/531 (44.63%), Query Frame = 0

Query: 35  FFSPITVAVADQSNANNL------KQESDANNEEQEFVRLDLIHRHHPEVVKRLDDEIKV 94
           F +P+    A  S +N+       K+ +     E + V+  L  R      K   + +  
Sbjct: 43  FPNPMRFGSASSSTSNDCGFSSPEKEPTKERTGENKTVKFHLKRRETTTTEKATTNSV-- 102

Query: 95  DSVEDRIRDIRYHDQNRLRSISAKLNWTKVVENAEEKEKEV-----SGSNLPPQSQTPIG 154
             +E +IRD+    Q   + +  K N   V +  ++ +KEV       S++  Q+   + 
Sbjct: 103 --LELQIRDLT-RIQTLHKRVLEKNNQNTVSQKQKKNDKEVVTTTPVASSVEEQAGQLVA 162

Query: 155 LKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLSPMHKM 214
                G   GSGE+F+ + VG+PP+ F+LI DTGSDL W +C    C  DC         
Sbjct: 163 -TLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQC--LPCY-DCF-------- 222

Query: 215 RNKMRGRFRYALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPTP----NTPCSYTYSY 274
             +  G F        S+S+  I C+ ++C     +L   PD P P    N  C Y Y Y
Sbjct: 223 --QQNGAF---YDPKASASYKNITCNDQRC-----NLVSSPDPPMPCKSDNQSCPYYYWY 282

Query: 275 TGGERASGIFANETVTVRL-TNGKEKQL---KDILFGCTEEVELTNFMKGADGLIGLGSS 334
                 +G FA ET TV L TNG   +L   ++++FGC           GA GL+GLG  
Sbjct: 283 GDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWNR--GLFHGAAGLLGLGRG 342

Query: 335 IYSFVYKAAENNIGGGFSYCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKL 394
             SF     ++  G  FSYCL D N +T   S  +FG                P      
Sbjct: 343 PLSF-SSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSH--------PNLNFTS 402

Query: 395 FTGGQ---YSCYYGVQLIGISVDDQMLKIPRHVWNIKS--GCGTILDTGTSLTLLTAPAH 454
           F  G+      +Y VQ+  I V  ++L IP   WNI S    GTI+D+GT+L+    PA+
Sbjct: 403 FVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAY 462

Query: 455 DAVIEAMAPKIAKFGRMEKQRNFEL---CFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSY 514
           + +   +A K AK G+    R+F +   CFN +  +    P+LG  F  GAV+  P  + 
Sbjct: 463 EFIKNKIAEK-AK-GKYPVYRDFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENS 522

Query: 515 VVSAATQCSCVAISSLPFPSINILGNIIQQTYFWQFDLLKESVTFAPSDCA 539
            +       C+A+   P  + +I+GN  QQ +   +D  +  + +AP+ CA
Sbjct: 523 FIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCA 533

BLAST of Cp4.1LG05g10050 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 154.1 bits (388), Expect = 3.3e-37
Identity = 113/385 (29.35%), Postives = 175/385 (45.45%), Query Frame = 0

Query: 154 SGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSNLSPMHKMRNKMRGRFRY 213
           SGE+ + + +GTPP     IADTGSDLLWT+C    C    + + P+   +         
Sbjct: 87  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQC--APCDDCYTQVDPLFDPKT-------- 146

Query: 214 ALYANQSSSFSPIPCSSRQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANET 273
                 SS++  + CSS QC      L  Q  C T +  CSY+ SY       G  A +T
Sbjct: 147 ------SSTYKDVSCSSSQC----TALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDT 206

Query: 274 VTVRLTNGKEKQLKDILFGCTEEVELTNFMKGADGLIGLGSSIYSFVYKAAENNIGGGFS 333
           +T+  ++ +  QLK+I+ GC        F K   G++GLG    S + K   ++I G FS
Sbjct: 207 LTLGSSDTRPMQLKNIIIGCGHN-NAGTFNKKGSGIVGLGGGPVSLI-KQLGDSIDGKFS 266

Query: 334 YCLADHNRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGIS 393
           YCL          S   FGT +  + S   S+P+   A+ + F        Y + L  IS
Sbjct: 267 YCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETF--------YYLTLKSIS 326

Query: 394 VDDQMLKIPRHVWNIKSGCGTILDTGTSLTLLTAPAHDAVIEAMAPKIAKFGRMEKQRNF 453
           V  + ++          G   I+D+GT+LTLL    +  + +A+A  I    + + Q   
Sbjct: 327 VGSKQIQYSGSDSESSEG-NIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGL 386

Query: 454 ELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYVVSAATQCSCVAISSLPFPSINILGN 513
            LC++ T       P +  HF+G  V      ++ V  +    C A      PS +I GN
Sbjct: 387 SLCYSAT--GDLKVPVITMHFDGADVKLDSSNAF-VQVSEDLVCFAFRG--SPSFSIYGN 435

Query: 514 IIQQTYFWQFDLLKESVTFAPSDCA 539
           + Q  +   +D + ++V+F P+DCA
Sbjct: 447 VAQMNFLVGYDTVSKTVSFKPTDCA 435

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LTW45.2e-7235.95Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE... [more]
Q766C21.6e-3630.09Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q6XBF84.6e-3629.35Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q766C39.0e-3230.17Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LNJ32.0e-3129.37Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Match NameE-valueIdentityDescription
XP_023532727.10.0100.00aspartic proteinase NANA, chloroplast-like [Cucurbita pepo subsp. pepo][more]
XP_022947059.10.095.01aspartic proteinase NANA, chloroplast-like [Cucurbita moschata][more]
KAG6605363.10.094.61Aspartic proteinase NANA, chloroplast, partial [Cucurbita argyrosperma subsp. so... [more]
XP_023007158.10.085.52aspartic proteinase NANA, chloroplast-like [Cucurbita maxima][more]
XP_022947824.10.084.41aspartic proteinase NANA, chloroplast-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1G5P40.095.01aspartic proteinase NANA, chloroplast-like OS=Cucurbita moschata OX=3662 GN=LOC1... [more]
A0A6J1L6Y20.085.52aspartic proteinase NANA, chloroplast-like OS=Cucurbita maxima OX=3661 GN=LOC111... [more]
A0A6J1G8100.084.41aspartic proteinase NANA, chloroplast-like OS=Cucurbita moschata OX=3662 GN=LOC1... [more]
A0A0A0KG928.88e-18150.56Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G13439... [more]
A0A1S3C2F36.49e-17950.56aspartic proteinase CDR1 OS=Cucumis melo OX=3656 GN=LOC103496268 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G12700.13.7e-7335.95Eukaryotic aspartyl protease family protein [more]
AT3G25700.15.5e-5333.33Eukaryotic aspartyl protease family protein [more]
AT2G42980.11.8e-4330.72Eukaryotic aspartyl protease family protein [more]
AT3G59080.11.7e-4129.38Eukaryotic aspartyl protease family protein [more]
AT5G33340.13.3e-3729.35Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 44..64
NoneNo IPR availablePANTHERPTHR47967:SF69ASPARTIC PROTEINASE NANA, CHLOROPLASTcoord: 51..537
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 51..537
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 509..524
score: 32.64
coord: 414..425
score: 52.39
coord: 163..183
score: 54.9
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 385..533
e-value: 4.8E-26
score: 91.5
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 359..538
e-value: 3.7E-33
score: 116.6
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 144..352
e-value: 1.5E-41
score: 144.4
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 149..537
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 157..352
e-value: 1.0E-44
score: 152.9
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 157..533
score: 33.723354
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 156..537
e-value: 3.03689E-67
score: 216.36

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG05g10050.1Cp4.1LG05g10050.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0110165 cellular anatomical entity
molecular_function GO:0004190 aspartic-type endopeptidase activity