CmoCh02G006780 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh02G006780
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionaspartic proteinase NANA, chloroplast-like
LocationCmo_Chr02: 4302239 .. 4303967 (-)
RNA-Seq ExpressionCmoCh02G006780
SyntenyCmoCh02G006780
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGCCGATTTCTCATCTTTTAATCCTTTTCTTCGTCTTCTTCTCTCCTCTCACCGTCGCAGTCGCCGATCAAAGCAATGCCAATAATCCGAAACAAGAAAGCGATGCCAATAATGAAGAAAAAGAATTCGTGAGGCTGGATCTGATACACCGCCACCATCCGGAAGTGGTTAAGAGGCTTCATGATGAAATTAAGGTGGATAAGATGGAGGATCGCATCAAGGATATTCGATATCACGATCAATCTCGCCTCCGAGCCATCTCCGTCCACATGAATTGGACCAAAGTTGTGGAGAATGCGGAGGAGAAGGAGAAGAAGGAGGCGTCGAGTTCGAACCTTCCTCCACAGTCGCAGACTCCAATAGCATTGAAAACATACCCCGGCGCTGATTTCGGTAGCAGTGAGTTTTTCGTGCAATTGAAATTGGGAACGCCGCCGCAGAAGTTCACGATGATTGCAGATACCGGAAGTGACCTATTGTGGACGAGATGCAGATACCGGCGGTGCAGGGGAGATTGCAGCAACCCCTCTCCGATCCATAAAATGCGTAACAGAATGAGAGAGAGATTCATTTACGCGCTTTATGCGAATCAGTCATCTTCTTTCTCCCCAATTCCTTGTTCCTCCAAGCAGTGTATCCAGGATTTCTCTGAGCTCGGCGGCCAACCCGATTGTCCAACCCCTAACTCCCCTTGTTCCTATACCTACAGGTATTAATAGTAATTAATATTATTATTATTATTTTTTTAAATAAAATTTTGGTGGGGACCATTATAACAATAAATGGATGTTGGTTGGTTTGTTTAAAAAAAAAATAACAGCTACTTAAGTGGGGACCGCGCGATGGGAATATTCGCAACCGAGACGGTAACGGTAAGACTAACAAACGGAAAAGAAAAGCAACTGAAGGACATTCTATACGGCTGCACCGAAGAAATAACTGACAGCCAGTTCTTGGACGGAGCCGATGGCCTCATTGGCCTAGGCTCTAGCATCTACTCCTTCGTTTACAAAGCGGCCGAAAACAACGTCGGCGGCGGCTTCTCCTACTGCCTCGCCGACCACCTCCGCAATATAACCGCCATTAGCTACTTCGTCTTCGGCACCCCCTCCCCCAAGACCTTCGCCGCCTCCACATCCTCTCCCATCGGCCCCCCCGCCACCACTAGACTCATCACCGGCGGCCGATACAGCTGCTACTACGGCGTCCAACTGGCCGGAATCTCCGTGGACGGACAGATCCTGAACATCCCCCCTCACGTCTGGAACATCAAGTCCGGTTGCGGCACCATCTTGGACACCGGCACCAGCCTGACGATGCTGACGGCGCCGGCTCACGATGCGGTGATAGAAGCGATGGCTCCCAAGATCGAAAAATTCGGAAGAATGGAAAGGGATGTAAAGGGTGAAAGGGAGAAGAACTTCAAACTTTGCTTCAATGACACGCAGTGGAATTTTGGTATGTTGCCGAAGCTTGGATTCCATTTCGAAGGCGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACATCGTTTCGGCGTCATACCAATGTAGCTGTATTGCCATAACTTCTCTGCCCTTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAAACTTACCTTTGGCAATTTGATTTACTCAAGGGATCCGTCACTTTTGCTCCCTCCGACTGCGCCTAGAACTTCTCCATTTTCTTTCATTTATTACTTCCTTCTTATTAAT

mRNA sequence

ATGTCGCCGATTTCTCATCTTTTAATCCTTTTCTTCGTCTTCTTCTCTCCTCTCACCGTCGCAGTCGCCGATCAAAGCAATGCCAATAATCCGAAACAAGAAAGCGATGCCAATAATGAAGAAAAAGAATTCGTGAGGCTGGATCTGATACACCGCCACCATCCGGAAGTGGTTAAGAGGCTTCATGATGAAATTAAGGTGGATAAGATGGAGGATCGCATCAAGGATATTCGATATCACGATCAATCTCGCCTCCGAGCCATCTCCGTCCACATGAATTGGACCAAAGTTGTGGAGAATGCGGAGGAGAAGGAGAAGAAGGAGGCGTCGAGTTCGAACCTTCCTCCACAGTCGCAGACTCCAATAGCATTGAAAACATACCCCGGCGCTGATTTCGGTAGCAGTGAGTTTTTCGTGCAATTGAAATTGGGAACGCCGCCGCAGAAGTTCACGATGATTGCAGATACCGGAAGTGACCTATTGTGGACGAGATGCAGATACCGGCGGTGCAGGGGAGATTGCAGCAACCCCTCTCCGATCCATAAAATGCGTAACAGAATGAGAGAGAGATTCATTTACGCGCTTTATGCGAATCAGTCATCTTCTTTCTCCCCAATTCCTTGTTCCTCCAAGCAGTGTATCCAGGATTTCTCTGAGCTCGGCGGCCAACCCGATTGTCCAACCCCTAACTCCCCTTGTTCCTATACCTACAGCTACTTAAGTGGGGACCGCGCGATGGGAATATTCGCAACCGAGACGGTAACGGTAAGACTAACAAACGGAAAAGAAAAGCAACTGAAGGACATTCTATACGGCTGCACCGAAGAAATAACTGACAGCCAGTTCTTGGACGGAGCCGATGGCCTCATTGGCCTAGGCTCTAGCATCTACTCCTTCGTTTACAAAGCGGCCGAAAACAACGTCGGCGGCGGCTTCTCCTACTGCCTCGCCGACCACCTCCGCAATATAACCGCCATTAGCTACTTCGTCTTCGGCACCCCCTCCCCCAAGACCTTCGCCGCCTCCACATCCTCTCCCATCGGCCCCCCCGCCACCACTAGACTCATCACCGGCGGCCGATACAGCTGCTACTACGGCGTCCAACTGGCCGGAATCTCCGTGGACGGACAGATCCTGAACATCCCCCCTCACGTCTGGAACATCAAGTCCGGTTGCGGCACCATCTTGGACACCGGCACCAGCCTGACGATGCTGACGGCGCCGGCTCACGATGCGGTGATAGAAGCGATGGCTCCCAAGATCGAAAAATTCGGAAGAATGGAAAGGGATGTAAAGGGTGAAAGGGAGAAGAACTTCAAACTTTGCTTCAATGACACGCAGTGGAATTTTGGTATGTTGCCGAAGCTTGGATTCCATTTCGAAGGCGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACATCGTTTCGGCGTCATACCAATGTAGCTGTATTGCCATAACTTCTCTGCCCTTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAAACTTACCTTTGGCAATTTGATTTACTCAAGGGATCCGTCACTTTTGCTCCCTCCGACTGCGCCTAGAACTTCTCCATTTTCTTTCATTTATTACTTCCTTCTTATTAAT

Coding sequence (CDS)

ATGTCGCCGATTTCTCATCTTTTAATCCTTTTCTTCGTCTTCTTCTCTCCTCTCACCGTCGCAGTCGCCGATCAAAGCAATGCCAATAATCCGAAACAAGAAAGCGATGCCAATAATGAAGAAAAAGAATTCGTGAGGCTGGATCTGATACACCGCCACCATCCGGAAGTGGTTAAGAGGCTTCATGATGAAATTAAGGTGGATAAGATGGAGGATCGCATCAAGGATATTCGATATCACGATCAATCTCGCCTCCGAGCCATCTCCGTCCACATGAATTGGACCAAAGTTGTGGAGAATGCGGAGGAGAAGGAGAAGAAGGAGGCGTCGAGTTCGAACCTTCCTCCACAGTCGCAGACTCCAATAGCATTGAAAACATACCCCGGCGCTGATTTCGGTAGCAGTGAGTTTTTCGTGCAATTGAAATTGGGAACGCCGCCGCAGAAGTTCACGATGATTGCAGATACCGGAAGTGACCTATTGTGGACGAGATGCAGATACCGGCGGTGCAGGGGAGATTGCAGCAACCCCTCTCCGATCCATAAAATGCGTAACAGAATGAGAGAGAGATTCATTTACGCGCTTTATGCGAATCAGTCATCTTCTTTCTCCCCAATTCCTTGTTCCTCCAAGCAGTGTATCCAGGATTTCTCTGAGCTCGGCGGCCAACCCGATTGTCCAACCCCTAACTCCCCTTGTTCCTATACCTACAGCTACTTAAGTGGGGACCGCGCGATGGGAATATTCGCAACCGAGACGGTAACGGTAAGACTAACAAACGGAAAAGAAAAGCAACTGAAGGACATTCTATACGGCTGCACCGAAGAAATAACTGACAGCCAGTTCTTGGACGGAGCCGATGGCCTCATTGGCCTAGGCTCTAGCATCTACTCCTTCGTTTACAAAGCGGCCGAAAACAACGTCGGCGGCGGCTTCTCCTACTGCCTCGCCGACCACCTCCGCAATATAACCGCCATTAGCTACTTCGTCTTCGGCACCCCCTCCCCCAAGACCTTCGCCGCCTCCACATCCTCTCCCATCGGCCCCCCCGCCACCACTAGACTCATCACCGGCGGCCGATACAGCTGCTACTACGGCGTCCAACTGGCCGGAATCTCCGTGGACGGACAGATCCTGAACATCCCCCCTCACGTCTGGAACATCAAGTCCGGTTGCGGCACCATCTTGGACACCGGCACCAGCCTGACGATGCTGACGGCGCCGGCTCACGATGCGGTGATAGAAGCGATGGCTCCCAAGATCGAAAAATTCGGAAGAATGGAAAGGGATGTAAAGGGTGAAAGGGAGAAGAACTTCAAACTTTGCTTCAATGACACGCAGTGGAATTTTGGTATGTTGCCGAAGCTTGGATTCCATTTCGAAGGCGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACATCGTTTCGGCGTCATACCAATGTAGCTGTATTGCCATAACTTCTCTGCCCTTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAAACTTACCTTTGGCAATTTGATTTACTCAAGGGATCCGTCACTTTTGCTCCCTCCGACTGCGCCTAG

Protein sequence

MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA
Homology
BLAST of CmoCh02G006780 vs. ExPASy Swiss-Prot
Match: Q9LTW4 (Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE=1 SV=1)

HSP 1 Score: 277.7 bits (709), Expect = 2.7e-73
Identity = 156/427 (36.53%), Postives = 234/427 (54.80%), Query Frame = 0

Query: 98  VENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTG 157
           +E+    ++K  S  +    S   + +    G D+G++++F ++++GTP +KF ++ DTG
Sbjct: 67  IEDVIGADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTG 126

Query: 158 SDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDF 217
           S+L W  CRYR    D           NR   R      A++S SF  + C ++ C  D 
Sbjct: 127 SELTWVNCRYRARGKD-----------NRRVFR------ADESKSFKTVGCLTQTCKVDL 186

Query: 218 SELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEI 277
             L     CPTP++PCSY Y Y  G  A G+FA ET+TV LTNG+  +L   L GC+   
Sbjct: 187 MNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSF 246

Query: 278 TDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPK 337
           T   F  GADG++GL  S +SF    A +  G  FSYCL DHL N    +Y +FG+    
Sbjct: 247 TGQSF-QGADGVLGLAFSDFSFT-STATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRST 306

Query: 338 TFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILD 397
             A   ++P+        +T  R   +Y + + GIS+   +L+IP  VW+  SG GTILD
Sbjct: 307 KTAFRRTTPLD-------LT--RIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILD 366

Query: 398 TGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQ-WNFGMLPKL 457
           +GTSLT+L   A+  V+  +A  + +  R++ +         + CF+ T  +N   LP+L
Sbjct: 367 SGTSLTLLADAAYKQVVTGLARYLVELKRVKPE-----GVPIEYCFSFTSGFNVSKLPQL 426

Query: 458 GFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSV 517
            FH +GGA FEP  +SY+V A+    C+   S   P+ N++GNI+QQ YLW+FDL+  ++
Sbjct: 427 TFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTL 460

Query: 518 TFAPSDC 524
           +FAPS C
Sbjct: 487 SFAPSAC 460

BLAST of CmoCh02G006780 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 160.6 bits (405), Expect = 4.8e-38
Identity = 136/448 (30.36%), Postives = 205/448 (45.76%), Query Frame = 0

Query: 85  LRAISVHMNWTK--VVENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLK 144
           L  +    N TK  +++ A ++ ++   S N   QS + I    Y     G  E+ + + 
Sbjct: 46  LEQVDSGKNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYA----GDGEYLMNVA 105

Query: 145 LGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSS 204
           +GTP   F+ I DTGSDL+WT+C    C    S P+PI   ++              SSS
Sbjct: 106 IGTPDSSFSAIMDTGSDLIWTQC--EPCTQCFSQPTPIFNPQD--------------SSS 165

Query: 205 FSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGK 264
           FS +PC S+ C QD       P     N+ C YTY Y  G    G  ATET T   ++  
Sbjct: 166 FSTLPCESQYC-QDL------PSETCNNNECQYTYGYGDGSTTQGYMATETFTFETSS-- 225

Query: 265 EKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVG-GGFSYCLADHLR 324
              + +I +GC E+       +GA GLIG+G    S       + +G G FSYC+     
Sbjct: 226 ---VPNIAFGCGEDNQGFGQGNGA-GLIGMGWGPLSL-----PSQLGVGQFSYCMTS--- 285

Query: 325 NITAISYFVFGTPSPKTFA---ASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQI 384
                    +G+ SP T A   A++  P G P+TT LI       YY + L GI+V G  
Sbjct: 286 ---------YGSSSPSTLALGSAASGVPEGSPSTT-LIHSSLNPTYYYITLQGITVGGDN 345

Query: 385 LNIPPHVWNIKSG--CGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGERE 444
           L IP   + ++     G I+D+GT+LT L   A++AV +A   +I            E  
Sbjct: 346 LGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI------NLPTVDESS 405

Query: 445 KNFKLCFND-TQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSIN 504
                CF   +  +   +P++   F+GG V    +++ ++S +    C+A+ S     I+
Sbjct: 406 SGLSTCFQQPSDGSTVQVPEISMQFDGG-VLNLGEQNILISPAEGVICLAMGSSSQLGIS 435

Query: 505 ILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
           I GNI QQ     +DL   +V+F P+ C
Sbjct: 466 IFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CmoCh02G006780 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 146.0 bits (367), Expect = 1.2e-33
Identity = 110/391 (28.13%), Postives = 174/391 (44.50%), Query Frame = 0

Query: 134 SSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIY 193
           S E+ + + +GTPP     IADTGSDLLWT+C    C    +   P+   +         
Sbjct: 87  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQC--APCDDCYTQVDPLFDPKT-------- 146

Query: 194 ALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATET 253
                 SS++  + CSS QC    + L  Q  C T ++ CSY+ SY       G  A +T
Sbjct: 147 ------SSTYKDVSCSSSQC----TALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDT 206

Query: 254 VTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFS 313
           +T+  ++ +  QLK+I+ GC        F     G++GLG    S + K   +++ G FS
Sbjct: 207 LTLGSSDTRPMQLKNIIIGCGHN-NAGTFNKKGSGIVGLGGGPVSLI-KQLGDSIDGKFS 266

Query: 314 YCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGIS 373
           YCL          S   FGT +  + +   S+P        LI       +Y + L  IS
Sbjct: 267 YCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTP--------LIAKASQETFYYLTLKSIS 326

Query: 374 VDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKG 433
           V  + +           G   I+D+GT+LT+L    +  + +A+A  I      + + K 
Sbjct: 327 VGSKQIQYSGSDSESSEG-NIIIDSGTTLTLLPTEFYSELEDAVASSI------DAEKKQ 386

Query: 434 EREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPS 493
           + +    LC++ T      +P +  HF+G  V      ++ V  S    C A      PS
Sbjct: 387 DPQSGLSLCYSAT--GDLKVPVITMHFDGADVKLDSSNAF-VQVSEDLVCFAFRG--SPS 435

Query: 494 INILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
            +I GN+ Q  +L  +D +  +V+F P+DCA
Sbjct: 447 FSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of CmoCh02G006780 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 142.5 bits (358), Expect = 1.3e-32
Identity = 119/395 (30.13%), Postives = 172/395 (43.54%), Query Frame = 0

Query: 133 GSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFI 192
           G  E+ + L +GTP Q F+ I DTGSDL+WT+C  + C    +  +PI   +        
Sbjct: 91  GDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQC--QPCTQCFNQSTPIFNPQG------- 150

Query: 193 YALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATE 252
                  SSSFS +PCSS+ C     +    P C   N+ C YTY Y  G    G   TE
Sbjct: 151 -------SSSFSTLPCSSQLC-----QALSSPTC--SNNFCQYTYGYGDGSETQGSMGTE 210

Query: 253 TVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGF 312
           T+T    +     + +I +GC E        +GA GL+G+G    S   +         F
Sbjct: 211 TLTFGSVS-----IPNITFGCGENNQGFGQGNGA-GLVGMGRGPLSLPSQLDVTK----F 270

Query: 313 SYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGI 372
           SYC+       T I      TPS     +  +S       T LI   +   +Y + L G+
Sbjct: 271 SYCM-------TPIG---SSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGL 330

Query: 373 SVDGQILNIPPHVWNIKSGCGT---ILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMER 432
           SV    L I P  + + S  GT   I+D+GT+LT     A+ +V      + E   ++  
Sbjct: 331 SVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSV------RQEFISQINL 390

Query: 433 DVKGEREKNFKLCFNDTQWNFGM-LPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITS 492
            V       F LCF        + +P    HF+GG + E P  +Y +S S    C+A+ S
Sbjct: 391 PVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDL-ELPSENYFISPSNGLICLAMGS 434

Query: 493 LPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
                ++I GNI QQ  L  +D     V+FA + C
Sbjct: 451 -SSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CmoCh02G006780 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 139.0 bits (349), Expect = 1.5e-31
Identity = 118/399 (29.57%), Postives = 171/399 (42.86%), Query Frame = 0

Query: 129 GADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMR 188
           G   GS E+F +L +GTP +   M+ DTGSD++W +C    CR   S   PI   R    
Sbjct: 134 GLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC--APCRRCYSQSDPIFDPR---- 193

Query: 189 ERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGI 248
                     +S +++ IPCSS  C +  S       C T    C Y  SY  G   +G 
Sbjct: 194 ----------KSKTYATIPCSSPHCRRLDS-----AGCNTRRKTCLYQVSYGDGSFTVGD 253

Query: 249 FATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNV 308
           F+TET+T R       ++K +  GC  +  +     GA GL+GLG    SF  +   +  
Sbjct: 254 FSTETLTFR-----RNRVKGVALGCGHD--NEGLFVGAAGLLGLGKGKLSFPGQTG-HRF 313

Query: 309 GGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQ 368
              FSYCL D   +           PS   F  +  S I     T L++  +   +Y V 
Sbjct: 314 NQKFSYCLVDRSAS---------SKPSSVVFGNAAVSRIA--RFTPLLSNPKLDTFYYVG 373

Query: 369 LAGISVDG-QILNIPPHVWNIK--SGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFG 428
           L GISV G ++  +   ++ +      G I+D+GTS+T L  PA+ A+ +A     +   
Sbjct: 374 LLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLK 433

Query: 429 RMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIA 488
           R            F  CF+ +  N   +P +  HF G  V  P     I   +    C A
Sbjct: 434 R------APDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFA 485

Query: 489 ITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
                   ++I+GNI QQ +   +DL    V FAP  CA
Sbjct: 494 FAG-TMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CmoCh02G006780 vs. ExPASy TrEMBL
Match: A0A6J1G810 (aspartic proteinase NANA, chloroplast-like OS=Cucurbita moschata OX=3662 GN=LOC111451585 PE=3 SV=1)

HSP 1 Score: 1072.8 bits (2773), Expect = 3.9e-310
Identity = 524/524 (100.00%), Postives = 524/524 (100.00%), Query Frame = 0

Query: 1   MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKR 60
           MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKR
Sbjct: 1   MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKR 60

Query: 61  LHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQSQT 120
           LHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQSQT
Sbjct: 61  LHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQSQT 120

Query: 121 PIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPI 180
           PIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPI
Sbjct: 121 PIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPI 180

Query: 181 HKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYL 240
           HKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYL
Sbjct: 181 HKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYL 240

Query: 241 SGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV 300
           SGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV
Sbjct: 241 SGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV 300

Query: 301 YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGR 360
           YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGR
Sbjct: 301 YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGR 360

Query: 361 YSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPK 420
           YSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPK
Sbjct: 361 YSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPK 420

Query: 421 IEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQ 480
           IEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQ
Sbjct: 421 IEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQ 480

Query: 481 CSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
           CSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA
Sbjct: 481 CSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 524

BLAST of CmoCh02G006780 vs. ExPASy TrEMBL
Match: A0A6J1L6Y2 (aspartic proteinase NANA, chloroplast-like OS=Cucurbita maxima OX=3661 GN=LOC111499734 PE=3 SV=1)

HSP 1 Score: 1019.2 bits (2634), Expect = 6.0e-294
Identity = 498/526 (94.68%), Postives = 513/526 (97.53%), Query Frame = 0

Query: 1   MSPISHLLILFFV--FFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVV 60
           MS ISHLLILFFV  FFSPLTVAVADQSNANN KQESDANNEE+EFVRLDLIHRHHPEVV
Sbjct: 1   MSSISHLLILFFVVFFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV 60

Query: 61  KRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQS 120
           KRLHDEIKVDKMEDRIKDIRYHDQSRLRAIS H+NWTKVVENAEEK  KEAS SN PP S
Sbjct: 61  KRLHDEIKVDKMEDRIKDIRYHDQSRLRAISAHLNWTKVVENAEEK-VKEASGSNHPPHS 120

Query: 121 QTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPS 180
           QTPIALKTYPGADFGSSEFFVQLK+GTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPS
Sbjct: 121 QTPIALKTYPGADFGSSEFFVQLKVGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPS 180

Query: 181 PIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYS 240
           PIHKMRN+MRERF YALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPN+PCSYTYS
Sbjct: 181 PIHKMRNKMRERFNYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNTPCSYTYS 240

Query: 241 YLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYS 300
           YLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEE+TDSQFLDGADGLIGLGSSIYS
Sbjct: 241 YLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEMTDSQFLDGADGLIGLGSSIYS 300

Query: 301 FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITG 360
           FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTF+ASTSSPIGPPATT+L TG
Sbjct: 301 FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFSASTSSPIGPPATTKLFTG 360

Query: 361 GRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMA 420
           GRYSCYYGVQL+GISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMA
Sbjct: 361 GRYSCYYGVQLSGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMA 420

Query: 421 PKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSAS 480
           PKIEKFGRME+DVKGEREKNFKLCFNDT+WNFGMLPKLGFHFE GAVFEPPDRSYIVSAS
Sbjct: 421 PKIEKFGRMEKDVKGEREKNFKLCFNDTEWNFGMLPKLGFHFEDGAVFEPPDRSYIVSAS 480

Query: 481 YQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
           YQCSCIAITSLPFPSINILGNIIQQT++W++DLLKGSVTFAPSDCA
Sbjct: 481 YQCSCIAITSLPFPSINILGNIIQQTFIWKYDLLKGSVTFAPSDCA 525

BLAST of CmoCh02G006780 vs. ExPASy TrEMBL
Match: A0A6J1G5P4 (aspartic proteinase NANA, chloroplast-like OS=Cucurbita moschata OX=3662 GN=LOC111451044 PE=3 SV=1)

HSP 1 Score: 922.2 bits (2382), Expect = 1.0e-264
Identity = 453/528 (85.80%), Postives = 482/528 (91.29%), Query Frame = 0

Query: 1   MSPISHLLIL----FFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPE 60
           MSPISHLLIL     FVFFSPLTVAVADQSNANN KQESDANNEE+EFVRLDLIHRHHPE
Sbjct: 1   MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPE 60

Query: 61  VVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPP 120
           VVKR+ DEIKVD +EDRIKDIRYHDQ+RLRAIS H+NWTKVVENAEEKE KE S SNL  
Sbjct: 61  VVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTKVVENAEEKE-KEVSGSNL-- 120

Query: 121 QSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSN 180
            SQTPI LKTYPGADFGS EFFVQLK+GTPPQ FT+IADTGSDLLWT+CR+RRCRGDCS+
Sbjct: 121 -SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSH 180

Query: 181 PSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYT 240
            SP+HKMRN+MR RF YALYANQSSSFSPIPCSSKQCI DF +LGGQPDCPTPN+PCSYT
Sbjct: 181 LSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYT 240

Query: 241 YSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSI 300
           YSY  G+RA GIFA ETVTVRLTNGKEKQLKDIL+GCTEE+  + F+ GADGLIGLGSSI
Sbjct: 241 YSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSI 300

Query: 301 YSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLI 360
           YSFVYKAAENN+GGGFSYCLADH RN TAISYFVFGTPSPKTF+A+TSSPIGPPATT+L 
Sbjct: 301 YSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLF 360

Query: 361 TGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEA 420
           TGG+YSCYYGVQL GISVD QILNIP HVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEA
Sbjct: 361 TGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEA 420

Query: 421 MAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVS 480
           MAPKI KFGRM      E+++NF+LCFNDT+WNFGM PKLGFHFEGGAVFEPPDRSYIVS
Sbjct: 421 MAPKIAKFGRM------EKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVS 480

Query: 481 ASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
           ASYQCSCIAITSLPFPSINILGNIIQQTY WQFDLLKGSVTFAPSDCA
Sbjct: 481 ASYQCSCIAITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSDCA 518

BLAST of CmoCh02G006780 vs. ExPASy TrEMBL
Match: A0A1S3C2F3 (aspartic proteinase CDR1 OS=Cucumis melo OX=3656 GN=LOC103496268 PE=3 SV=1)

HSP 1 Score: 524.6 bits (1350), Expect = 4.6e-145
Identity = 274/538 (50.93%), Postives = 360/538 (66.91%), Query Frame = 0

Query: 1   MSPISH-----LLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHP 60
           MSPIS+     LL+ F  F S    A+ D++N  N   + D    E++ +R DL+HRHHP
Sbjct: 8   MSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDED----EQQTIRFDLLHRHHP 67

Query: 61  EVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEAS----- 120
           +V ++L+ ++K+  + +R+KDI  HD++R R+IS  MN  K +E+A  + + EA+     
Sbjct: 68  QVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSMN-QKQIEDARLRAEAEAATQVEV 127

Query: 121 --SSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYR 180
             S+ LPP + TPI +K   GADFGSSE+FVQLK+GTP Q F +IADTGSDL W +CRYR
Sbjct: 128 AKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYR 187

Query: 181 RCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPT 240
           RC G+CS  +  HK +N  ++RF +AL ANQSS+F  + CSS  C  + +EL    +C T
Sbjct: 188 RCFGNCSG-NVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDT 247

Query: 241 PNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADG 300
           P SPC Y YSY  G  A GIFA ET+TV LTNGKEKQL++ + GCT EI      DGADG
Sbjct: 248 PTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCT-EIVQGNVFDGADG 307

Query: 301 LIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIG 360
           ++GLG+S YS  YKAAEN  GGGFSYCL DHL +  A+SYFV G P+P T A+++S+   
Sbjct: 308 VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAK-- 367

Query: 361 PPAT---TRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTML 420
           PPA    T+L  G  YS +YGV L GIS DGQ+LNIPP VW+   GCGTI+D+GTSLT+L
Sbjct: 368 PPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVL 427

Query: 421 TAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAV 480
             PA D V+E +  ++++F ++E +        F  CFN++Q+   M PKL FHF  G V
Sbjct: 428 ATPAFDVVMEVLTSRLKQFQQIEIE-------PFNFCFNNSQYTHDMAPKLRFHFGDGTV 487

Query: 481 FEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
           FEPP +SYIVS     SCI I S+PFPS+NI+GNI+QQ +LWQFD  K  V FA S+C
Sbjct: 488 FEPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC 529

BLAST of CmoCh02G006780 vs. ExPASy TrEMBL
Match: A0A0A0KG92 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G134390 PE=3 SV=1)

HSP 1 Score: 516.9 bits (1330), Expect = 9.7e-143
Identity = 270/539 (50.09%), Postives = 356/539 (66.05%), Query Frame = 0

Query: 1   MSPISH--------LLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHR 60
           MSPIS+        LL  F  F S    A+ D+ N  N     + + +E+E ++ DL+HR
Sbjct: 8   MSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHR 67

Query: 61  HHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEK-------E 120
           HHP+V +++H ++K+  + +R+KDI  HD +R R+IS  MN  K VE+A  +       E
Sbjct: 68  HHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRHRSISKSMN-QKQVEDARLRAEAEAATE 127

Query: 121 KKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRC 180
           ++ A S+ LPP + TPI ++   GADFGSSE+FV+LK+GTP Q F +IADTGSDL W +C
Sbjct: 128 EEVAKSAILPPATSTPIGMRMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKC 187

Query: 181 RYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPD 240
           RYRRC G+CS+ +  HK +N  ++RF +A  AN SSSF  + CSS  C  D ++L    +
Sbjct: 188 RYRRCFGNCSS-NVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVRE 247

Query: 241 CPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDG 300
           C  P SPC Y YSY  G  A GIFA ET+TV LTNGKEKQL + + GCTE +  S F  G
Sbjct: 248 CHNPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGSVF-GG 307

Query: 301 ADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSS 360
           ADG++GLG+S YS  YKAAEN  GGGFSYCL DHL +  AISYFV G P+P T A+++S+
Sbjct: 308 ADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSA 367

Query: 361 PIGPPAT-TRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTM 420
            +    T T+L  G  YS +YGV L GIS +G +LNIP  VW+I SG GTI+D+GTSLT+
Sbjct: 368 KLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDSGTSLTI 427

Query: 421 LTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGA 480
           L APA D V+EA+ P+++KF ++E +        F  CFN++Q+   M PKL FHF  G 
Sbjct: 428 LAAPAFDMVMEALTPRLKKFQQLEIE-------PFDFCFNNSQYTHEMAPKLRFHFGDGT 487

Query: 481 VFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
           VFEPP +SYIVS     SCI   S+PFP+ NI+GNI+QQ +LWQFD  K  V FAPS+C
Sbjct: 488 VFEPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC 536

BLAST of CmoCh02G006780 vs. NCBI nr
Match: XP_022947824.1 (aspartic proteinase NANA, chloroplast-like [Cucurbita moschata])

HSP 1 Score: 1072.8 bits (2773), Expect = 8.0e-310
Identity = 524/524 (100.00%), Postives = 524/524 (100.00%), Query Frame = 0

Query: 1   MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKR 60
           MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKR
Sbjct: 1   MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKR 60

Query: 61  LHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQSQT 120
           LHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQSQT
Sbjct: 61  LHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQSQT 120

Query: 121 PIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPI 180
           PIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPI
Sbjct: 121 PIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPI 180

Query: 181 HKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYL 240
           HKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYL
Sbjct: 181 HKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYL 240

Query: 241 SGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV 300
           SGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV
Sbjct: 241 SGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV 300

Query: 301 YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGR 360
           YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGR
Sbjct: 301 YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGR 360

Query: 361 YSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPK 420
           YSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPK
Sbjct: 361 YSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPK 420

Query: 421 IEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQ 480
           IEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQ
Sbjct: 421 IEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQ 480

Query: 481 CSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
           CSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA
Sbjct: 481 CSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 524

BLAST of CmoCh02G006780 vs. NCBI nr
Match: KAG6605377.1 (Aspartic proteinase NANA, chloroplast, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1052.0 bits (2719), Expect = 1.7e-303
Identity = 512/524 (97.71%), Postives = 518/524 (98.85%), Query Frame = 0

Query: 1   MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKR 60
           MSPISHLLILFFVF SPLTVAVA+ SNANNPKQESDANNEE+EFVRLDL+HRHHPEVVKR
Sbjct: 1   MSPISHLLILFFVFVSPLTVAVANLSNANNPKQESDANNEEQEFVRLDLVHRHHPEVVKR 60

Query: 61  LHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQSQT 120
           LHDEIKVDKMEDRIKDIRYHDQSRLRAISVH+NWTKVVENAEEKEKKEASSSNLPP SQT
Sbjct: 61  LHDEIKVDKMEDRIKDIRYHDQSRLRAISVHLNWTKVVENAEEKEKKEASSSNLPPHSQT 120

Query: 121 PIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPI 180
           PIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPI
Sbjct: 121 PIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPI 180

Query: 181 HKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYL 240
           HKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPN PCSYTYSYL
Sbjct: 181 HKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNYPCSYTYSYL 240

Query: 241 SGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV 300
           SGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV
Sbjct: 241 SGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV 300

Query: 301 YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGR 360
           YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTF ASTS+PIGPPATTRLITGGR
Sbjct: 301 YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFTASTSTPIGPPATTRLITGGR 360

Query: 361 YSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPK 420
           YSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPK
Sbjct: 361 YSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPK 420

Query: 421 IEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQ 480
           IEKFGRMERD +GEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQ
Sbjct: 421 IEKFGRMERDERGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQ 480

Query: 481 CSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
           CSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA
Sbjct: 481 CSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 524

BLAST of CmoCh02G006780 vs. NCBI nr
Match: XP_023533886.1 (aspartic proteinase NANA, chloroplast-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1030.0 bits (2662), Expect = 7.0e-297
Identity = 498/524 (95.04%), Postives = 512/524 (97.71%), Query Frame = 0

Query: 1   MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKR 60
           MSPISHLLILFFVFFSPLTVA ADQSNANNPKQESDANNEE+E VRLDLIHRHHPEVVKR
Sbjct: 1   MSPISHLLILFFVFFSPLTVAFADQSNANNPKQESDANNEEQEIVRLDLIHRHHPEVVKR 60

Query: 61  LHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQSQT 120
           LHDEIKVDKMEDRIKDIRYHDQSRLRAIS H+NWTKVVENAEEKEKKEAS SNLPPQSQ+
Sbjct: 61  LHDEIKVDKMEDRIKDIRYHDQSRLRAISAHLNWTKVVENAEEKEKKEASGSNLPPQSQS 120

Query: 121 PIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPI 180
           PIALKTYPGAD+GSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPI
Sbjct: 121 PIALKTYPGADYGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPI 180

Query: 181 HKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYL 240
           HKMRNRMR+RFIYALYANQSSSFSPIPCSS+QCIQDFSELGGQPDCPTPNSPCSYTYSYL
Sbjct: 181 HKMRNRMRDRFIYALYANQSSSFSPIPCSSEQCIQDFSELGGQPDCPTPNSPCSYTYSYL 240

Query: 241 SGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV 300
           SGD AMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDS+FLDGADGLIGLGSSIYSFV
Sbjct: 241 SGDCAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSKFLDGADGLIGLGSSIYSFV 300

Query: 301 YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGR 360
           YKAAENN+GGGFSYCLADH+R+ITAISYFVFGTPSPKTFAASTS+PIGPPATT+LITGGR
Sbjct: 301 YKAAENNIGGGFSYCLADHIRSITAISYFVFGTPSPKTFAASTSTPIGPPATTKLITGGR 360

Query: 361 YSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPK 420
           YSCYYGVQLAGISVDGQILNIPPHVWNI SGCGTILDTGTSLTMLTAPAHDAVIEAMAPK
Sbjct: 361 YSCYYGVQLAGISVDGQILNIPPHVWNINSGCGTILDTGTSLTMLTAPAHDAVIEAMAPK 420

Query: 421 IEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQ 480
           IEKFGRMERDVKGEREKNFKLCFNDT+W FGM PKLGFHFEGG VFEPPDRSY+V AS Q
Sbjct: 421 IEKFGRMERDVKGEREKNFKLCFNDTEWRFGMTPKLGFHFEGGVVFEPPDRSYVVPASEQ 480

Query: 481 CSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
           CSCIAITSLPFPSINILGNIIQQTYLWQFDL KGSVTFAPSDCA
Sbjct: 481 CSCIAITSLPFPSINILGNIIQQTYLWQFDLHKGSVTFAPSDCA 524

BLAST of CmoCh02G006780 vs. NCBI nr
Match: XP_023007158.1 (aspartic proteinase NANA, chloroplast-like [Cucurbita maxima])

HSP 1 Score: 1019.2 bits (2634), Expect = 1.2e-293
Identity = 498/526 (94.68%), Postives = 513/526 (97.53%), Query Frame = 0

Query: 1   MSPISHLLILFFV--FFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVV 60
           MS ISHLLILFFV  FFSPLTVAVADQSNANN KQESDANNEE+EFVRLDLIHRHHPEVV
Sbjct: 1   MSSISHLLILFFVVFFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVV 60

Query: 61  KRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQS 120
           KRLHDEIKVDKMEDRIKDIRYHDQSRLRAIS H+NWTKVVENAEEK  KEAS SN PP S
Sbjct: 61  KRLHDEIKVDKMEDRIKDIRYHDQSRLRAISAHLNWTKVVENAEEK-VKEASGSNHPPHS 120

Query: 121 QTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPS 180
           QTPIALKTYPGADFGSSEFFVQLK+GTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPS
Sbjct: 121 QTPIALKTYPGADFGSSEFFVQLKVGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPS 180

Query: 181 PIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYS 240
           PIHKMRN+MRERF YALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPN+PCSYTYS
Sbjct: 181 PIHKMRNKMRERFNYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNTPCSYTYS 240

Query: 241 YLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYS 300
           YLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEE+TDSQFLDGADGLIGLGSSIYS
Sbjct: 241 YLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEMTDSQFLDGADGLIGLGSSIYS 300

Query: 301 FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITG 360
           FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTF+ASTSSPIGPPATT+L TG
Sbjct: 301 FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFSASTSSPIGPPATTKLFTG 360

Query: 361 GRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMA 420
           GRYSCYYGVQL+GISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMA
Sbjct: 361 GRYSCYYGVQLSGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMA 420

Query: 421 PKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSAS 480
           PKIEKFGRME+DVKGEREKNFKLCFNDT+WNFGMLPKLGFHFE GAVFEPPDRSYIVSAS
Sbjct: 421 PKIEKFGRMEKDVKGEREKNFKLCFNDTEWNFGMLPKLGFHFEDGAVFEPPDRSYIVSAS 480

Query: 481 YQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
           YQCSCIAITSLPFPSINILGNIIQQT++W++DLLKGSVTFAPSDCA
Sbjct: 481 YQCSCIAITSLPFPSINILGNIIQQTFIWKYDLLKGSVTFAPSDCA 525

BLAST of CmoCh02G006780 vs. NCBI nr
Match: KAG6605363.1 (Aspartic proteinase NANA, chloroplast, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 928.7 bits (2399), Expect = 2.2e-266
Identity = 453/524 (86.45%), Postives = 483/524 (92.18%), Query Frame = 0

Query: 1   MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKR 60
           MSPISHLLILFFVFFSPLTVAVADQSNANN KQESDANNEE+EFVRLDLIHRHHPEVVKR
Sbjct: 1   MSPISHLLILFFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKR 60

Query: 61  LHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQSQT 120
           + DEIKVD +EDRIKDIRYHDQ+RLRAIS H+NWTKVVENAEEKE KE S SNL   SQT
Sbjct: 61  IDDEIKVDTVEDRIKDIRYHDQNRLRAISAHLNWTKVVENAEEKE-KEVSGSNL---SQT 120

Query: 121 PIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPI 180
           PI LK YPGADFGS EFFVQLK+GTPPQ FT+IADTGSDLLWT+CR+RRCRGDCS+ SP+
Sbjct: 121 PIGLKIYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPM 180

Query: 181 HKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYL 240
           HKMRN+MR RF YALYANQSSSFSPIPCSSKQCI DF +LGGQPDCPTPN+PCSYTYSY 
Sbjct: 181 HKMRNKMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYT 240

Query: 241 SGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV 300
            G+RA GIFA ETVTVRLTNGKEKQLKDIL+GCTEE+  + F+ GADGLIGLGSSIYSFV
Sbjct: 241 GGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTNFMKGADGLIGLGSSIYSFV 300

Query: 301 YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGR 360
           YKAAENN+GGGFSYCLADH RNITAISYFVFGTPSPKTF+A+TSSPIGPP+TT+L TGG+
Sbjct: 301 YKAAENNIGGGFSYCLADHHRNITAISYFVFGTPSPKTFSATTSSPIGPPSTTKLFTGGQ 360

Query: 361 YSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPK 420
           YSCYYGVQL GISVD QILNIP HVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPK
Sbjct: 361 YSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPK 420

Query: 421 IEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQ 480
           I KFGRM      E+++NF+LCFNDT+WNFGM PKLGFHFEGGAVFEPPDRSYIVSASYQ
Sbjct: 421 IAKFGRM------EKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQ 480

Query: 481 CSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
           CSCIAITSLPFPSINILGNIIQQTY WQFDLLKGSVTFAPSDCA
Sbjct: 481 CSCIAITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSDCA 514

BLAST of CmoCh02G006780 vs. TAIR 10
Match: AT3G12700.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 277.7 bits (709), Expect = 1.9e-74
Identity = 156/427 (36.53%), Postives = 234/427 (54.80%), Query Frame = 0

Query: 98  VENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTG 157
           +E+    ++K  S  +    S   + +    G D+G++++F ++++GTP +KF ++ DTG
Sbjct: 67  IEDVIGADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTG 126

Query: 158 SDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDF 217
           S+L W  CRYR    D           NR   R      A++S SF  + C ++ C  D 
Sbjct: 127 SELTWVNCRYRARGKD-----------NRRVFR------ADESKSFKTVGCLTQTCKVDL 186

Query: 218 SELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEI 277
             L     CPTP++PCSY Y Y  G  A G+FA ET+TV LTNG+  +L   L GC+   
Sbjct: 187 MNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSF 246

Query: 278 TDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPK 337
           T   F  GADG++GL  S +SF    A +  G  FSYCL DHL N    +Y +FG+    
Sbjct: 247 TGQSF-QGADGVLGLAFSDFSFT-STATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRST 306

Query: 338 TFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILD 397
             A   ++P+        +T  R   +Y + + GIS+   +L+IP  VW+  SG GTILD
Sbjct: 307 KTAFRRTTPLD-------LT--RIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILD 366

Query: 398 TGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQ-WNFGMLPKL 457
           +GTSLT+L   A+  V+  +A  + +  R++ +         + CF+ T  +N   LP+L
Sbjct: 367 SGTSLTLLADAAYKQVVTGLARYLVELKRVKPE-----GVPIEYCFSFTSGFNVSKLPQL 426

Query: 458 GFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSV 517
            FH +GGA FEP  +SY+V A+    C+   S   P+ N++GNI+QQ YLW+FDL+  ++
Sbjct: 427 TFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTL 460

Query: 518 TFAPSDC 524
           +FAPS C
Sbjct: 487 SFAPSAC 460

BLAST of CmoCh02G006780 vs. TAIR 10
Match: AT3G25700.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 208.4 bits (529), Expect = 1.4e-53
Identity = 137/411 (33.33%), Postives = 201/411 (48.91%), Query Frame = 0

Query: 129 GADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMR 188
           GA  GS ++FV L++G PPQ   +IADTGSDL+W +C    CR +CS+ SP         
Sbjct: 76  GAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKC--SACR-NCSHHSP--------- 135

Query: 189 ERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPD-CPTPN-----SPCSYTYSYLSG 248
                  +   SS+FSP  C    C      L  +PD  P  N     S C Y Y Y  G
Sbjct: 136 ---ATVFFPRHSSTFSPAHCYDPVC-----RLVPKPDRAPICNHTRIHSTCHYEYGYADG 195

Query: 249 DRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQF----LDGADGLIGLGSSIYS 308
               G+FA ET +++ ++GKE +LK + +GC   I+         +GA+G++GLG    S
Sbjct: 196 SLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPIS 255

Query: 309 FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITG 368
           F  +      G  FSYCL D+  +    SY + G         +    I     T L+T 
Sbjct: 256 FASQLG-RRFGNKFSYCLMDYTLSPPPTSYLIIG---------NGGDGISKLFFTPLLTN 315

Query: 369 GRYSCYYGVQLAGISVDGQILNIPPHVWNI--KSGCGTILDTGTSLTMLTAPAHDAVIEA 428
                +Y V+L  + V+G  L I P +W I      GT++D+GT+L  L  PA+ +VI A
Sbjct: 316 PLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAA 375

Query: 429 MAPKIEKFGRMERDVKGEREKNFKLCFN--DTQWNFGMLPKLGFHFEGGAVFEPPDRSYI 488
           +        R++  +       F LC N         +LP+L F F GGAVF PP R+Y 
Sbjct: 376 VR------RRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYF 435

Query: 489 VSASYQCSCIAITSL-PFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
           +    Q  C+AI S+ P    +++GN++QQ +L++FD  +  + F+   CA
Sbjct: 436 IETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450

BLAST of CmoCh02G006780 vs. TAIR 10
Match: AT2G42980.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 169.9 bits (429), Expect = 5.6e-42
Identity = 146/518 (28.19%), Postives = 226/518 (43.63%), Query Frame = 0

Query: 23  ADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQ 82
           A  S +N+    S  ++  KE  R  +      +   R+  E K  +    + D++  D 
Sbjct: 52  ASSSTSNDCGFSSKEHDPSKEHTRESV------KPQSRIKQETK--RTTHSVVDLQIQDL 111

Query: 83  SRLRAISVHMNWTKVVENAEEKEK--KEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQ 142
           +R++ +    N +K  +N + ++K   + S    P  S   +      G   GS E+F+ 
Sbjct: 112 TRIKTLHARFNKSKKQKNEKVRKKITSDISLVGAPEVSPGKLIATLESGMTLGSGEYFMD 171

Query: 143 LKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS 202
           + +GTPP+ F++I DTGSDL W +C    C  DC + + +                   S
Sbjct: 172 VLVGTPPKHFSLILDTGSDLNWLQC--LPCY-DCFHQNGMF-------------YDPKTS 231

Query: 203 SSFSPIPCSSKQCIQDFSELGGQPD----CPTPNSPCSYTYSYLSGDRAMGIFATETVTV 262
           +SF  I C+  +C      L   PD    C + N  C Y Y Y       G FA ET TV
Sbjct: 232 ASFKNITCNDPRC-----SLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTV 291

Query: 263 RLT----NGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGF 322
            LT       E ++ ++++GC     +     GA GL+GLG    SF     ++  G  F
Sbjct: 292 NLTTTEGGSSEYKVGNMMFGCGH--WNRGLFSGASGLLGLGRGPLSF-SSQLQSLYGHSF 351

Query: 323 SYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYS--CYYGVQLA 382
           SYCL D   N    S  +FG    K     T+        T  + G   S   +Y +Q+ 
Sbjct: 352 SYCLVDRNSNTNVSSKLIFG--EDKDLLNHTNLNF-----TSFVNGKENSVETFYYIQIK 411

Query: 383 GISVDGQILNIPPHVWNIKS--GCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRME 442
            I V G+ L+IP   WNI S    GTI+D+GT+L+    PA++ +    A K+++   + 
Sbjct: 412 SILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIF 471

Query: 443 RDVKGEREKNFKLCFN--DTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAI 502
           RD           CFN    + N   LP+LG  F  G V+  P  +  +  S    C+AI
Sbjct: 472 RDF-----PVLDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAI 525

Query: 503 TSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
              P  + +I+GN  QQ +   +D  +  + F P+ CA
Sbjct: 532 LGTPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKCA 525

BLAST of CmoCh02G006780 vs. TAIR 10
Match: AT3G59080.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 162.9 bits (411), Expect = 6.8e-40
Identity = 145/530 (27.36%), Postives = 231/530 (43.58%), Query Frame = 0

Query: 14  FFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDR 73
           F +P+    A  S +N+    S      KE    +   + H   +KR           + 
Sbjct: 43  FPNPMRFGSASSSTSNDCGFSSPEKEPTKERTGENKTVKFH---LKRRETTTTEKATTNS 102

Query: 74  IKDIRYHDQSRLRAIS---VHMNWTKVVENAEEKEKKEA-----SSSNLPPQSQTPIALK 133
           + +++  D +R++ +    +  N    V   ++K  KE       +S++  Q+   +A  
Sbjct: 103 VLELQIRDLTRIQTLHKRVLEKNNQNTVSQKQKKNDKEVVTTTPVASSVEEQAGQLVA-T 162

Query: 134 TYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRN 193
              G   GS E+F+ + +G+PP+ F++I DTGSDL W +C    C  DC   +       
Sbjct: 163 LESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQC--LPCY-DCFQQNG------ 222

Query: 194 RMRERFIYALY-ANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTP----NSPCSYTYSYL 253
                   A Y    S+S+  I C+ ++C      L   PD P P    N  C Y Y Y 
Sbjct: 223 --------AFYDPKASASYKNITCNDQRC-----NLVSSPDPPMPCKSDNQSCPYYYWYG 282

Query: 254 SGDRAMGIFATETVTVRL-TNGKEKQL---KDILYGCTEEITDSQFLDGADGLIGLGSSI 313
                 G FA ET TV L TNG   +L   +++++GC     +     GA GL+GLG   
Sbjct: 283 DSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGH--WNRGLFHGAAGLLGLGRGP 342

Query: 314 YSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLI 373
            SF     ++  G  FSYCL D   +    S  +FG         + +          L+
Sbjct: 343 LSF-SSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLV 402

Query: 374 TGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKS--GCGTILDTGTSLTMLTAPAHDAVI 433
                  +Y VQ+  I V G++LNIP   WNI S    GTI+D+GT+L+    PA++ + 
Sbjct: 403 -----DTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIK 462

Query: 434 EAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYI 493
             +A K +    + RD           CFN +  +   LP+LG  F  GAV+  P  +  
Sbjct: 463 NKIAEKAKGKYPVYRDF-----PILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSF 522

Query: 494 VSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
           +  +    C+A+   P  + +I+GN  QQ +   +D  +  + +AP+ CA
Sbjct: 523 IWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCA 533

BLAST of CmoCh02G006780 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 146.0 bits (367), Expect = 8.6e-35
Identity = 110/391 (28.13%), Postives = 174/391 (44.50%), Query Frame = 0

Query: 134 SSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIY 193
           S E+ + + +GTPP     IADTGSDLLWT+C    C    +   P+   +         
Sbjct: 87  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQC--APCDDCYTQVDPLFDPKT-------- 146

Query: 194 ALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATET 253
                 SS++  + CSS QC    + L  Q  C T ++ CSY+ SY       G  A +T
Sbjct: 147 ------SSTYKDVSCSSSQC----TALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDT 206

Query: 254 VTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFS 313
           +T+  ++ +  QLK+I+ GC        F     G++GLG    S + K   +++ G FS
Sbjct: 207 LTLGSSDTRPMQLKNIIIGCGHN-NAGTFNKKGSGIVGLGGGPVSLI-KQLGDSIDGKFS 266

Query: 314 YCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGIS 373
           YCL          S   FGT +  + +   S+P        LI       +Y + L  IS
Sbjct: 267 YCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTP--------LIAKASQETFYYLTLKSIS 326

Query: 374 VDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKG 433
           V  + +           G   I+D+GT+LT+L    +  + +A+A  I      + + K 
Sbjct: 327 VGSKQIQYSGSDSESSEG-NIIIDSGTTLTLLPTEFYSELEDAVASSI------DAEKKQ 386

Query: 434 EREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPS 493
           + +    LC++ T      +P +  HF+G  V      ++ V  S    C A      PS
Sbjct: 387 DPQSGLSLCYSAT--GDLKVPVITMHFDGADVKLDSSNAF-VQVSEDLVCFAFRG--SPS 435

Query: 494 INILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
            +I GN+ Q  +L  +D +  +V+F P+DCA
Sbjct: 447 FSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LTW42.7e-7336.53Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE... [more]
Q766C24.8e-3830.36Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q6XBF81.2e-3328.13Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q766C31.3e-3230.13Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LNJ31.5e-3129.57Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A6J1G8103.9e-310100.00aspartic proteinase NANA, chloroplast-like OS=Cucurbita moschata OX=3662 GN=LOC1... [more]
A0A6J1L6Y26.0e-29494.68aspartic proteinase NANA, chloroplast-like OS=Cucurbita maxima OX=3661 GN=LOC111... [more]
A0A6J1G5P41.0e-26485.80aspartic proteinase NANA, chloroplast-like OS=Cucurbita moschata OX=3662 GN=LOC1... [more]
A0A1S3C2F34.6e-14550.93aspartic proteinase CDR1 OS=Cucumis melo OX=3656 GN=LOC103496268 PE=3 SV=1[more]
A0A0A0KG929.7e-14350.09Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G13439... [more]
Match NameE-valueIdentityDescription
XP_022947824.18.0e-310100.00aspartic proteinase NANA, chloroplast-like [Cucurbita moschata][more]
KAG6605377.11.7e-30397.71Aspartic proteinase NANA, chloroplast, partial [Cucurbita argyrosperma subsp. so... [more]
XP_023533886.17.0e-29795.04aspartic proteinase NANA, chloroplast-like [Cucurbita pepo subsp. pepo][more]
XP_023007158.11.2e-29394.68aspartic proteinase NANA, chloroplast-like [Cucurbita maxima][more]
KAG6605363.12.2e-26686.45Aspartic proteinase NANA, chloroplast, partial [Cucurbita argyrosperma subsp. so... [more]
Match NameE-valueIdentityDescription
AT3G12700.11.9e-7436.53Eukaryotic aspartyl protease family protein [more]
AT3G25700.11.4e-5333.33Eukaryotic aspartyl protease family protein [more]
AT2G42980.15.6e-4228.19Eukaryotic aspartyl protease family protein [more]
AT3G59080.16.8e-4027.36Eukaryotic aspartyl protease family protein [more]
AT5G33340.18.6e-3528.13Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 495..510
score: 31.13
coord: 394..405
score: 51.54
coord: 143..163
score: 53.32
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 349..524
e-value: 2.0E-34
score: 120.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 119..332
e-value: 2.2E-39
score: 137.4
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 129..523
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 365..519
e-value: 4.0E-26
score: 91.7
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 137..332
e-value: 1.1E-41
score: 142.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 101..122
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 28..523
NoneNo IPR availablePANTHERPTHR47967:SF69ASPARTIC PROTEINASE NANA, CHLOROPLASTcoord: 28..523
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 137..519
score: 33.669682
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 136..523
e-value: 3.03061E-65
score: 210.967

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh02G006780.1CmoCh02G006780.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0110165 cellular anatomical entity
molecular_function GO:0004190 aspartic-type endopeptidase activity