CmoCh04G006310 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G006310
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionEukaryotic aspartyl protease family protein, putative
LocationCmo_Chr04 : 3126628 .. 3127941 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGCCATTTCAATTTTCTTCTGTTTGTTCCTCATTTCCTTCTCCCAAGCAACCGCTCATGGGGGCGTTGGCGGCGGCGGCCATGGCTTCACCACCTCTCTCTTCCACCGCGATTCTTTTCTCTCCCCTCTCTATAACCCATCTCTCTCCCACTATGACCGACTCACTAACGCCTTCCGCCGCTCCTTCTCCCGCTCCGACACCCTCCTCAACCGCGCTGCCGCCGTCTCCATCACCGGCATCCATTCCAGGATCATCCCCGACGACGGCGAGTTCCTAATGTCCATCTCCATCGGAACCCCGCGGGTCAAAATCATGGCCATCGCTGATACCGGCAGCGACCTGACGTGGACCCAGTGCATGCCATGTCACAAATGCTTCAACCAATCATTTCCCATTTTTAATCCACGTCGATCCTTCTCCTACCGCCACGTGTCTTGCACTTCCAATGCTTGCCGCTCTCTCGACGACTATCGCTGTGGGCCCGACAACCGAACCTGCAGCTACGGTTATAGCTACGGAGACCAATCCTTTACGTATGGCGACCTGGCATCTGAGAAAATTACTATTGGATCCTTCAAACTCTACAAGACACTTATTGGATGTGGCCATGTGAACGGCGGCACTTTCAGCGAAGATACCTCAGGAATTATCGGACTCGGCGGCGGTCCTCTCTCTTTAATCTCTCAAATGAGAAAAATCGCCGCCGTCAAACGGCGGTTCTCGTATTGCTTGCCGACATTCTTCAGTGACAAGAATGTAACAGGCAAAATAAGCTTCGGCAAAAAAGCCATTGTTTCAGGGCGAAAAGTCGTTTCTACCCCTCTCGTGTTAAAAGAACCCAATACCTTCTATTACCTAACTCTCGAAGCAATGTCCGTTGCAAACAAGCGGTTTAAGGCCGCGAACAACATGTCAACGGCCGTAGAACAAGGGAATATCCTTATCGATTCCGGTACAACATTGACGATTCTACCCCAGAATTTGTACAAAGGTGTTGCTTCGACATTGGCACATGTTGTTAAAGCGAAGCGAGTGAATGATCCGACTGGAGTTTTGGATCTCTGCTTCGCCGCATGCAGCGTTGATCATTTGAATATTCCAGTCATTACAGCACATTTTGCCGGCAACGCCGACGTGAAATTGTTACCGTTGAATACATTTGCAATGGTGGCTGATAATGTGGCTTGTTTGGCTTTCGTGCCGTCGGCGAACTTTGCCATTTTTGGAAACTTAGCACAGGTGAACTTTTTGGTCGGATACGATCTCGAGCGCAAGAGATTGTCGTTCAAATACAACGTTTGTGCTTAA

mRNA sequence

ATGGCTGCCATTTCAATTTTCTTCTGTTTGTTCCTCATTTCCTTCTCCCAAGCAACCGCTCATGGGGGCGTTGGCGGCGGCGGCCATGGCTTCACCACCTCTCTCTTCCACCGCGATTCTTTTCTCTCCCCTCTCTATAACCCATCTCTCTCCCACTATGACCGACTCACTAACGCCTTCCGCCGCTCCTTCTCCCGCTCCGACACCCTCCTCAACCGCGCTGCCGCCGTCTCCATCACCGGCATCCATTCCAGGATCATCCCCGACGACGGCGAGTTCCTAATGTCCATCTCCATCGGAACCCCGCGGGTCAAAATCATGGCCATCGCTGATACCGGCAGCGACCTGACGTGGACCCAGTGCATGCCATGTCACAAATGCTTCAACCAATCATTTCCCATTTTTAATCCACGTCGATCCTTCTCCTACCGCCACGTGTCTTGCACTTCCAATGCTTGCCGCTCTCTCGACGACTATCGCTGTGGGCCCGACAACCGAACCTGCAGCTACGGTTATAGCTACGGAGACCAATCCTTTACGTATGGCGACCTGGCATCTGAGAAAATTACTATTGGATCCTTCAAACTCTACAAGACACTTATTGGATGTGGCCATGTGAACGGCGGCACTTTCAGCGAAGATACCTCAGGAATTATCGGACTCGGCGGCGGTCCTCTCTCTTTAATCTCTCAAATGAGAAAAATCGCCGCCGTCAAACGGCGGTTCTCGTATTGCTTGCCGACATTCTTCAGTGACAAGAATGTAACAGGCAAAATAAGCTTCGGCAAAAAAGCCATTGTTTCAGGGCGAAAAGTCGTTTCTACCCCTCTCGTGTTAAAAGAACCCAATACCTTCTATTACCTAACTCTCGAAGCAATGTCCGTTGCAAACAAGCGGTTTAAGGCCGCGAACAACATGTCAACGGCCGTAGAACAAGGGAATATCCTTATCGATTCCGGTACAACATTGACGATTCTACCCCAGAATTTGTACAAAGGTGTTGCTTCGACATTGGCACATGTTGTTAAAGCGAAGCGAGTGAATGATCCGACTGGAGTTTTGGATCTCTGCTTCGCCGCATGCAGCGTTGATCATTTGAATATTCCAGTCATTACAGCACATTTTGCCGGCAACGCCGACGTGAAATTGTTACCGTTGAATACATTTGCAATGGTGGCTGATAATGTGGCTTGTTTGGCTTTCGTGCCGTCGGCGAACTTTGCCATTTTTGGAAACTTAGCACAGGTGAACTTTTTGGTCGGATACGATCTCGAGCGCAAGAGATTGTCGTTCAAATACAACGTTTGTGCTTAA

Coding sequence (CDS)

ATGGCTGCCATTTCAATTTTCTTCTGTTTGTTCCTCATTTCCTTCTCCCAAGCAACCGCTCATGGGGGCGTTGGCGGCGGCGGCCATGGCTTCACCACCTCTCTCTTCCACCGCGATTCTTTTCTCTCCCCTCTCTATAACCCATCTCTCTCCCACTATGACCGACTCACTAACGCCTTCCGCCGCTCCTTCTCCCGCTCCGACACCCTCCTCAACCGCGCTGCCGCCGTCTCCATCACCGGCATCCATTCCAGGATCATCCCCGACGACGGCGAGTTCCTAATGTCCATCTCCATCGGAACCCCGCGGGTCAAAATCATGGCCATCGCTGATACCGGCAGCGACCTGACGTGGACCCAGTGCATGCCATGTCACAAATGCTTCAACCAATCATTTCCCATTTTTAATCCACGTCGATCCTTCTCCTACCGCCACGTGTCTTGCACTTCCAATGCTTGCCGCTCTCTCGACGACTATCGCTGTGGGCCCGACAACCGAACCTGCAGCTACGGTTATAGCTACGGAGACCAATCCTTTACGTATGGCGACCTGGCATCTGAGAAAATTACTATTGGATCCTTCAAACTCTACAAGACACTTATTGGATGTGGCCATGTGAACGGCGGCACTTTCAGCGAAGATACCTCAGGAATTATCGGACTCGGCGGCGGTCCTCTCTCTTTAATCTCTCAAATGAGAAAAATCGCCGCCGTCAAACGGCGGTTCTCGTATTGCTTGCCGACATTCTTCAGTGACAAGAATGTAACAGGCAAAATAAGCTTCGGCAAAAAAGCCATTGTTTCAGGGCGAAAAGTCGTTTCTACCCCTCTCGTGTTAAAAGAACCCAATACCTTCTATTACCTAACTCTCGAAGCAATGTCCGTTGCAAACAAGCGGTTTAAGGCCGCGAACAACATGTCAACGGCCGTAGAACAAGGGAATATCCTTATCGATTCCGGTACAACATTGACGATTCTACCCCAGAATTTGTACAAAGGTGTTGCTTCGACATTGGCACATGTTGTTAAAGCGAAGCGAGTGAATGATCCGACTGGAGTTTTGGATCTCTGCTTCGCCGCATGCAGCGTTGATCATTTGAATATTCCAGTCATTACAGCACATTTTGCCGGCAACGCCGACGTGAAATTGTTACCGTTGAATACATTTGCAATGGTGGCTGATAATGTGGCTTGTTTGGCTTTCGTGCCGTCGGCGAACTTTGCCATTTTTGGAAACTTAGCACAGGTGAACTTTTTGGTCGGATACGATCTCGAGCGCAAGAGATTGTCGTTCAAATACAACGTTTGTGCTTAA
BLAST of CmoCh04G006310 vs. Swiss-Prot
Match: CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1)

HSP 1 Score: 354.0 bits (907), Expect = 2.4e-96
Identity = 195/415 (46.99%), Postives = 266/415 (64.10%), Query Frame = 1

Query: 30  GFTTSLFHRDSFLSPLYNPSLSHYDRLTNAFRRSFSRSDTLLNRAAAVSITGIHSRIIPD 89
           GFT  L HRDS  SP YNP  +   RL NA  RS +R   + +     +       +  +
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR---VFHFTEKDNTPQPQIDLTSN 89

Query: 90  DGEFLMSISIGTPRVKIMAIADTGSDLTWTQCMPCHKCFNQSFPIFNPRRSFSYRHVSCT 149
            GE+LM++SIGTP   IMAIADTGSDL WTQC PC  C+ Q  P+F+P+ S +Y+ VSC+
Sbjct: 90  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCS 149

Query: 150 SNACRSLDDY-RCGPDNRTCSYGYSYGDQSFTYGDLASEKITIGS-----FKLYKTLIGC 209
           S+ C +L++   C  ++ TCSY  SYGD S+T G++A + +T+GS      +L   +IGC
Sbjct: 150 SSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGC 209

Query: 210 GHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVKRRFSYCLPTFFSDKNVTGKISFGK 269
           GH N GTF++  SGI+GLGGGP+SLI Q+    ++  +FSYCL    S K+ T KI+FG 
Sbjct: 210 GHNNAGTFNKKGSGIVGLGGGPVSLIKQLGD--SIDGKFSYCLVPLTSKKDQTSKINFGT 269

Query: 270 KAIVSGRKVVSTPLVLK-EPNTFYYLTLEAMSVANKRFKAANNMSTAVEQGNILIDSGTT 329
            AIVSG  VVSTPL+ K    TFYYLTL+++SV +K+ + + + S + E GNI+IDSGTT
Sbjct: 270 NAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSE-GNIIIDSGTT 329

Query: 330 LTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCFAACSVDHLNIPVITAHFAGNADVK 389
           LT+LP   Y  +   +A  + A++  DP   L LC++A     L +PVIT HF G ADVK
Sbjct: 330 LTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA--TGDLKVPVITMHFDG-ADVK 389

Query: 390 LLPLNTFAMVADNVACLAFVPSANFAIFGNLAQVNFLVGYDLERKRLSFKYNVCA 438
           L   N F  V++++ C AF  S +F+I+GN+AQ+NFLVGYD   K +SFK   CA
Sbjct: 390 LDSSNAFVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of CmoCh04G006310 vs. Swiss-Prot
Match: ASPR1_ARATH (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 352.8 bits (904), Expect = 5.3e-96
Identity = 206/456 (45.18%), Postives = 282/456 (61.84%), Query Frame = 1

Query: 3   AISIFFCLFLISFSQATAHGGVGGGGH--GFTTSLFHRDSFLSPLYNPSLSHYDRLTNAF 62
           A  I  C FL  F   T    +   GH   F+  L HRDS LSP+YNP ++  DRL  AF
Sbjct: 2   ATQILLCFFL--FFSVT----LSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAF 61

Query: 63  RRSFSRSDTLLNRAAAVSITGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWTQ 122
            RS SRS    ++   +S T + S +I  DGEF MSI+IGTP +K+ AIADTGSDLTW Q
Sbjct: 62  LRSVSRSRRFNHQ---LSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQ 121

Query: 123 CMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLD--DYRCGPDNRTCSYGYSYGDQS 182
           C PC +C+ ++ PIF+ ++S +Y+   C S  C++L   +  C   N  C Y YSYGDQS
Sbjct: 122 CKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQS 181

Query: 183 FTYGDLASEKITIGS-----FKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQMR 242
           F+ GD+A+E ++I S          T+ GCG+ NGGTF E  SGIIGLGGG LSLISQ+ 
Sbjct: 182 FSKGDVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLG 241

Query: 243 KIAAVKRRFSYCLPTFFSDKNVTGKISFGKKAIVSGRK----VVSTPLVLKEPNTFYYLT 302
             +++ ++FSYCL    +  N T  I+ G  +I S       VVSTPLV KEP T+YYLT
Sbjct: 242 --SSISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLT 301

Query: 303 LEAMSVANKR-------FKAANNMSTAVEQGNILIDSGTTLTILPQNLYKGVASTLAH-V 362
           LEA+SV  K+       +   ++   +   GNI+IDSGTTLT+L    +   +S +   V
Sbjct: 302 LEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESV 361

Query: 363 VKAKRVNDPTGVLDLCFAACSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACLAF 422
             AKRV+DP G+L  CF + S + + +P IT HF G ADV+L P+N F  +++++ CL+ 
Sbjct: 362 TGAKRVSDPQGLLSHCFKSGSAE-IGLPEITVHFTG-ADVRLSPINAFVKLSEDMVCLSM 421

Query: 423 VPSANFAIFGNLAQVNFLVGYDLERKRLSFKYNVCA 438
           VP+   AI+GN AQ++FLVGYDLE + +SF++  C+
Sbjct: 422 VPTTEVAIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444

BLAST of CmoCh04G006310 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 224.2 bits (570), Expect = 2.8e-57
Identity = 150/413 (36.32%), Postives = 206/413 (49.88%), Query Frame = 1

Query: 30  GFTTSLFHRDSFLSPLYNPSLSHYDRLTNAFRRSFSRSDTLLNRAAAVSITGIHSRIIPD 89
           GF   L H DS        +L+ +  L  A  R   R   L   A     +G+ + +   
Sbjct: 40  GFQIMLEHVDS------GKNLTKFQLLERAIERGSRRLQRL--EAMLNGPSGVETSVYAG 99

Query: 90  DGEFLMSISIGTPRVKIMAIADTGSDLTWTQCMPCHKCFNQSFPIFNPRRSFSYRHVSCT 149
           DGE+LM++SIGTP     AI DTGSDL WTQC PC +CFNQS PIFNP+ S S+  + C+
Sbjct: 100 DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCS 159

Query: 150 SNACRSLDDYRCGPDNRTCSYGYSYGDQSFTYGDLASEKITIGSFKLYKTLIGCGHVNGG 209
           S  C++L    C   N  C Y Y YGD S T G + +E +T GS  +     GCG  N G
Sbjct: 160 SQLCQALSSPTC--SNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQG 219

Query: 210 TFSEDTSGIIGLGGGPLSLISQMRKIAAVKRRFSYCLPTFFSDKNVTGKISFGKKA-IVS 269
               + +G++G+G GPLSL SQ+        +FSYC+    S  +    +  G  A  V+
Sbjct: 220 FGQGNGAGLVGMGRGPLSLPSQLD-----VTKFSYCMTPIGS--STPSNLLLGSLANSVT 279

Query: 270 GRKVVSTPLVLKEPNTFYYLTLEAMSVANKRF---KAANNMSTAVEQGNILIDSGTTLTI 329
                +T +   +  TFYY+TL  +SV + R     +A  +++    G I+IDSGTTLT 
Sbjct: 280 AGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTY 339

Query: 330 LPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCFAACS-VDHLNIPVITAHFAGNADVKLL 389
              N Y+ V       +    VN  +   DLCF   S   +L IP    HF G  D++L 
Sbjct: 340 FVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDG-GDLELP 399

Query: 390 PLNTFAMVADNVACLAFVPSA-NFAIFGNLAQVNFLVGYDLERKRLSFKYNVC 437
             N F   ++ + CLA   S+   +IFGN+ Q N LV YD     +SF    C
Sbjct: 400 SENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CmoCh04G006310 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 5.5e-53
Identity = 137/394 (34.77%), Postives = 199/394 (50.51%), Query Frame = 1

Query: 49  SLSHYDRLTNAFRRSFSRSDTLLNRAAAVSITGIHSRIIPDDGEFLMSISIGTPRVKIMA 108
           +L+ Y+ +  A +R   R  ++   A   S +GI + +   DGE+LM+++IGTP     A
Sbjct: 54  NLTKYELIKRAIKRGERRMRSI--NAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSA 113

Query: 109 IADTGSDLTWTQCMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTC 168
           I DTGSDL WTQC PC +CF+Q  PIFNP+ S S+  + C S  C+ L    C  +N  C
Sbjct: 114 IMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETC--NNNEC 173

Query: 169 SYGYSYGDQSFTYGDLASEKITIGSFKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSL 228
            Y Y YGD S T G +A+E  T  +  +     GCG  N G    + +G+IG+G GPLSL
Sbjct: 174 QYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSL 233

Query: 229 ISQMRKIAAVKRRFSYCLPTFFSDKNVTGKISFGKKAIVSGRKVVSTPLVLKEPN-TFYY 288
            SQ+        +FSYC+ ++ S    T  +      +  G    ST L+    N T+YY
Sbjct: 234 PSQLG-----VGQFSYCMTSYGSSSPSTLALGSAASGVPEGSP--STTLIHSSLNPTYYY 293

Query: 289 LTLEAMSVANKRFKAANNMSTAVEQ--GNILIDSGTTLTILPQNLYKGVASTLAHVVKAK 348
           +TL+ ++V        ++     +   G ++IDSGTTLT LPQ+ Y  VA      +   
Sbjct: 294 ITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLP 353

Query: 349 RVNDPTGVLDLCFAACS-VDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACLAFVPS 408
            V++ +  L  CF   S    + +P I+  F G   + L   N     A+ V CLA   S
Sbjct: 354 TVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGV-LNLGEQNILISPAEGVICLAMGSS 413

Query: 409 A--NFAIFGNLAQVNFLVGYDLERKRLSFKYNVC 437
           +    +IFGN+ Q    V YDL+   +SF    C
Sbjct: 414 SQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CmoCh04G006310 vs. Swiss-Prot
Match: ASPA_ARATH (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 5.7e-50
Identity = 137/362 (37.85%), Postives = 186/362 (51.38%), Query Frame = 1

Query: 91  GEFLMSISIGTPRVKIMAIADTGSDLTWTQCMPC-HKCFNQSFPIFNPRRSFSYRHVSCT 150
           G +++++ +GTP+  +  I DTGSDLTWTQC PC   C++Q  PIFNP +S SY +VSC+
Sbjct: 130 GNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCS 189

Query: 151 SNACRSLDDY-----RCGPDNRTCSYGYSYGDQSFTYGDLASEKITIGSFKLYK-TLIGC 210
           S AC SL         C   N  C YG  YGDQSF+ G LA EK T+ +  ++     GC
Sbjct: 190 SAACGSLSSATGNAGSCSASN--CIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGC 249

Query: 211 GHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVKRRFSYCLPTFFSDKNVTGKISFGK 270
           G  N G F+   +G++GLG   LS  SQ     A  + FSYCLP   S  + TG ++FG 
Sbjct: 250 GENNQGLFT-GVAGLLGLGRDKLSFPSQ--TATAYNKIFSYCLP---SSASYTGHLTFGS 309

Query: 271 KAIVSGRKVVSTPL-VLKEPNTFYYLTLEAMSVANKRFKAANNMSTAVEQGNILIDSGTT 330
             I   R V  TP+  + +  +FY L + A++V  ++       ST       LIDSGT 
Sbjct: 310 AGI--SRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIP---STVFSTPGALIDSGTV 369

Query: 331 LTILPQNLYKGVASTLAHVVKAKRVNDPT----GVLDLCFAACSVDHLNIPVITAHFAGN 390
           +T LP   Y  + S+     KAK    PT     +LD CF       + IP +   F+G 
Sbjct: 370 ITRLPPKAYAALRSSF----KAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGG 429

Query: 391 ADVKLLPLNTFAMVADNVACLAFV---PSANFAIFGNLAQVNFLVGYDLERKRLSFKYNV 438
           A V+L     F +   +  CLAF      +N AIFGN+ Q    V YD    R+ F  N 
Sbjct: 430 AVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNG 474

BLAST of CmoCh04G006310 vs. TrEMBL
Match: A0A0A0KZZ3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055410 PE=3 SV=1)

HSP 1 Score: 625.2 bits (1611), Expect = 6.1e-176
Identity = 314/437 (71.85%), Postives = 358/437 (81.92%), Query Frame = 1

Query: 1   MAAISIFFCLFLISFSQATAHGGVGGGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNAF 60
           MAAISIFF   L   S+ TAHGG   G HGFTTSLF RDS LSPL+NPSLS YD L +AF
Sbjct: 1   MAAISIFFYFLLFFSSKVTAHGG---GHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAF 60

Query: 61  RRSFSRSDTLLNRAAAVSITGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWTQ 120
           RRSFSRS TLL    +VS   I S IIPD GEFLMSI IGTP V ++AIADTGSDLTWTQ
Sbjct: 61  RRSFSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQ 120

Query: 121 CMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTCSYGYSYGDQSFT 180
           C+PC +CFNQS PIFNPRRS SYR VSC S+ CRSL+ Y CGPD ++CSYGYSYGD+SFT
Sbjct: 121 CLPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFT 180

Query: 181 YGDLASEKITIGSFKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVKR 240
           YGDLAS++ITIGSFKL KT+IGCGH NGGTF   TSGIIGLGGG LSL+SQMR IA VK 
Sbjct: 181 YGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKP 240

Query: 241 RFSYCLPTFFSDKNVTGKISFGKKAIVSGRKVVSTPLVLKEPNTFYYLTLEAMSVANKRF 300
           RFSYCLPTFFS+ N+TG ISFG+KA+VSGR+VVSTPLV + P+TFY+LTLEA+SV  KRF
Sbjct: 241 RFSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRF 300

Query: 301 KAANNMSTAVEQGNILIDSGTTLTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCFAA 360
           KAAN +S     GNI+IDSGTTLT+LP++LY GV STLA V+KAKRV+DP+G+L+LC++A
Sbjct: 301 KAANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSA 360

Query: 361 CSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACLAFVPSANFAIFGNLAQVNFLV 420
             VD LNIP+ITAHFAG ADVKLLP+NTFA VADNV CL F P+   AIFGNLAQ+NF V
Sbjct: 361 GQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEV 420

Query: 421 GYDLERKRLSFKYNVCA 438
           GYDL  KRLSF+  +CA
Sbjct: 421 GYDLGNKRLSFEPKLCA 434

BLAST of CmoCh04G006310 vs. TrEMBL
Match: A0A0A0KV20_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055400 PE=3 SV=1)

HSP 1 Score: 481.9 bits (1239), Expect = 8.4e-133
Identity = 265/446 (59.42%), Postives = 312/446 (69.96%), Query Frame = 1

Query: 2   AAISIFF--CLFLISFSQATAHGGVGGGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNA 61
           A IS+FF   LFLISFSQ T    +  G +GFTTSLFHRDS LSPL   SLSHYDRL NA
Sbjct: 3   ATISLFFHLILFLISFSQTT----IINGNNGFTTSLFHRDSLLSPLEFSSLSHYDRLANA 62

Query: 62  FRRSFSRSDTLLNRAAAVSITGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWT 121
           FRRS SRS  LLNRAA     G+ S I P  GE+LMS+SIGTP V  + IADTGSDLTW 
Sbjct: 63  FRRSLSRSAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWA 122

Query: 122 QCMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTCSYGYSYGDQSF 181
           QC+PC KC+ Q  PIFNP +S S+ HV C +  C ++DD  CG     C Y Y+YGD+++
Sbjct: 123 QCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHCGVQG-VCDYSYTYGDRTY 182

Query: 182 TYGDLASEKITIGSFKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVK 241
           + GDL  EKITIGS  + K++IGCGH + G F    SG+IGLGGG LSL+SQM + + + 
Sbjct: 183 SKGDLGFEKITIGSSSV-KSVIGCGHASSGGFGF-ASGVIGLGGGQLSLVSQMSQTSGIS 242

Query: 242 RRFSYCLPTFFSDKNVTGKISFGKKAIVSGRKVVSTPLVLKEPNTFYYLTLEAMSVANKR 301
           RRFSYCLPT  S  N  GKI+FG+ A+VSG  VVSTPL+ K   T+YY+TLEA+S+ N+R
Sbjct: 243 RRFSYCLPTLLSHAN--GKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNER 302

Query: 302 FKAANNMSTAVEQGNILIDSGTTLTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCF- 361
             A        +QGN++IDSGTTLTILP+ LY GV S+L  VVKAKRV DP G LDLCF 
Sbjct: 303 HMA------FAKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFD 362

Query: 362 ----AACSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACL---AFVPSANFAIFG 421
               AA S   L IPVITAHF+G A+V LLP+NTF  VADNV CL   A  P+  F I G
Sbjct: 363 DGINAAAS---LGIPVITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIG 422

Query: 422 NLAQVNFLVGYDLERKRLSFKYNVCA 438
           NLAQ NFL+GYDLE KRLSFK  VCA
Sbjct: 423 NLAQANFLIGYDLEAKRLSFKPTVCA 430

BLAST of CmoCh04G006310 vs. TrEMBL
Match: M5WRG3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025167mg PE=3 SV=1)

HSP 1 Score: 425.6 bits (1093), Expect = 7.1e-116
Identity = 226/429 (52.68%), Postives = 286/429 (66.67%), Query Frame = 1

Query: 29  HGFTTSLFHRDSFLSPLYNPSLSHYDRLTNAFRRSFSR-----SDTLLNRAAAVSITGIH 88
           HGFT  L HRDS LSPLYN S+SH DRL NAFRRS +R       T+ + +++++   I 
Sbjct: 31  HGFTADLIHRDSPLSPLYNSSMSHLDRLHNAFRRSVTRVHHFIKPTMTSLSSSLAAPNIQ 90

Query: 89  SRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWTQCMPCHKCFNQSFPIFNPRRSFSY 148
           S IIP  GE+LM++SIGTP V+++ IADTGSDL WTQC PC +CFNQ+ P+F+P++S +Y
Sbjct: 91  SIIIPSAGEYLMNVSIGTPPVEVLGIADTGSDLIWTQCKPCKQCFNQNPPLFDPKKSSTY 150

Query: 149 RHVSCTSNACRSLDDYRCGP----DNRTCSYGYSYGDQSFTYGDLASEKITIGS-----F 208
             + C S++C  L++  CG     D+ TC Y Y YGD+SFT G LA E +T GS      
Sbjct: 151 HSIPCQSSSCTYLEEAACGTLINGDHDTCEYSYRYGDRSFTRGTLALETLTFGSTSGRPT 210

Query: 209 KLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVKRRFSYC-LPTFFSDK 268
            L K + GCGH NGGTF E  SG+IGLGGGPLSLISQ+ K+     +FSYC LPT     
Sbjct: 211 SLPKVVFGCGHENGGTFDESGSGLIGLGGGPLSLISQLTKLTN-GGKFSYCLLPT---AN 270

Query: 269 NVTGKISFGKKAIVSGRKVVSTPLVLKEPNTFYYLTLEAMSVANKRF------KAANNMS 328
               KISFG   IVSG   VSTPLV K P+TFYYLTLEA+SV  KR             +
Sbjct: 271 TAASKISFGSAGIVSGSGAVSTPLVAKNPDTFYYLTLEAISVGEKRLAYKTKSPDCEKAA 330

Query: 329 TAVEQGNILIDSGTTLTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCFAACSVDHLN 388
            A  +GNI+IDSGTTLT+LP   +  + S L   + A+RV+DP G+L LCF + S D + 
Sbjct: 331 VAANEGNIIIDSGTTLTLLPPGFHDDLVSALETAINAERVSDPRGILSLCFKSKS-DDIG 390

Query: 389 IPVITAHFAGNADVKLLPLNTFAMVADNVACLAFVPSANFAIFGNLAQVNFLVGYDLERK 437
           +PVIT HF+G ADVKL  LNTFA + D++ C   +PS++ AIFGNLAQ+NFLVGYDLE +
Sbjct: 391 VPVITVHFSGGADVKLQALNTFARMDDDMICFTMIPSSDVAIFGNLAQMNFLVGYDLEER 450

BLAST of CmoCh04G006310 vs. TrEMBL
Match: A0A0A0KX67_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055390 PE=3 SV=1)

HSP 1 Score: 424.9 bits (1091), Expect = 1.2e-115
Identity = 241/447 (53.91%), Postives = 295/447 (66.00%), Query Frame = 1

Query: 1   MAAISIFF--CLFLISFSQATAHGGVGGGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTN 60
           +A ISIFF   L LISFSQ T    +  G +GFTTSLFHRDS LSPL   SLSHYDRLTN
Sbjct: 2   VATISIFFHLILLLISFSQTT----IINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTN 61

Query: 61  AFRRSFSRSDTLLNRAAAVSITGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTW 120
           AFRRS SRS TLLNRAA      + + + P  GE+LMS+SIGTP V  + +ADTGSDL W
Sbjct: 62  AFRRSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMW 121

Query: 121 TQCMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTCSYGYSYGDQS 180
            QC+PC KC+ QS PIF+P +S S+ HV C S  C+++DD  CG     C Y Y+YGD++
Sbjct: 122 AQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQG-VCDYSYTYGDRT 181

Query: 181 FTYGDLASEKITIGSFKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAV 240
           ++ GDL  EKITIGS  + K++IGCGH +GG F    SG+IGLGGG    +         
Sbjct: 182 YSKGDLGFEKITIGSSSV-KSVIGCGHESGGGFGF-ASGVIGLGGGANPPV--------- 241

Query: 241 KRRFSYCLPTFFSDKNVTGKISFGKKAIVSGRKVVSTPLVLKEPNTFYYLTLEAMSVANK 300
                  LPT  S  N  GKI+FG+ A+VSG  VVSTPL+ K P T+YY+TLEA+S+ N+
Sbjct: 242 -------LPTLLSHAN--GKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNE 301

Query: 301 RFKAANNMSTAVEQGNILIDSGTTLTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCF 360
           R  A+       +QGN++IDSGTTL+ LP+ LY GV S+L  VVKAKRV DP    DLCF
Sbjct: 302 RHMAS------AKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCF 361

Query: 361 AACSVDHLN------IPVITAHFAGNADVKLLPLNTFAMVADNVACLAFVPSA---NFAI 420
                D +N      IP+ITA F+G A+V LLP+NTF  VA+NV CL   P++    F I
Sbjct: 362 D----DGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGI 413

Query: 421 FGNLAQVNFLVGYDLERKRLSFKYNVC 437
            GNLA  NFL+GYDLE KRLSFK  VC
Sbjct: 422 IGNLALANFLIGYDLEAKRLSFKPTVC 413

BLAST of CmoCh04G006310 vs. TrEMBL
Match: W9SK79_9ROSA (Putative aspartic protease OS=Morus notabilis GN=L484_022741 PE=3 SV=1)

HSP 1 Score: 413.7 bits (1062), Expect = 2.8e-112
Identity = 234/454 (51.54%), Postives = 295/454 (64.98%), Query Frame = 1

Query: 3   AISIFFCLFLIS-FSQATAHGGVGGGGHGFTTSLFHRDSFLSPLYNP-SLSHYDRLTNAF 62
           ++ +F CL +++ FS  T          GF   L  RDS  SP YNP +  ++DRL +AF
Sbjct: 11  SVVLFGCLIMLNDFSLPTE-----ALTRGFIIDLIQRDSPFSPAYNPLAADNFDRLRSAF 70

Query: 63  RRSFSRSDTLLNRAAAVSITG-------IHSRIIPDDGEFLMSISIGTPRVKIMAIADTG 122
            RSFSR D L      +S +        I S+IIP +GE+LM++S+GTP V ++ IADTG
Sbjct: 71  GRSFSRVDRLYKPTTLLSFSSSSSSSIPIQSKIIPSEGEYLMNVSLGTPPVPVLGIADTG 130

Query: 123 SDLTWTQCMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGP----DNRTCS 182
           SDL WTQC PC +CF Q+ P+FNP +S +YR+++C S  C  L +  C         TC 
Sbjct: 131 SDLMWTQCKPCTQCFKQNPPMFNPNKSSTYRNIACESKPCSELLESSCDAAAERGGDTCE 190

Query: 183 YGYSYGDQSFTYGDLASEKITIGSFKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLI 242
           Y YSYGD SFT G+LAS+ +TIGS  L K + GCG  NGGTF E  SG+IGLGGGPLSL+
Sbjct: 191 YRYSYGDHSFTKGNLASDTLTIGSTSLPKIIFGCGRENGGTFDESGSGLIGLGGGPLSLV 250

Query: 243 SQMRKIAAVKRRFSYCLPTFFSDKNVTGKISFGKKAIVSGRKVVSTPLVLKEPNTFYYLT 302
           SQ+ K  ++  +FSYCL    S+  VT KISFG+  IVSG  VVSTPLV KEPNTFYYLT
Sbjct: 251 SQLGK--SIGGKFSYCLVPLTSEPYVTSKISFGRAGIVSGPSVVSTPLVAKEPNTFYYLT 310

Query: 303 LEAMSVANKR---FKAANNMSTAV--EQGNILIDSGTTLTILPQNLYKGVASTLAHVVKA 362
           LEA+SV  KR   +   +N S A+   +GNI+IDSGTTLT LP   +  + S LA  V A
Sbjct: 311 LEAISVGKKRLVYYHENHNQSKALAGNEGNIIIDSGTTLTFLPVGFHDDLVSALAEAVDA 370

Query: 363 KRVNDPTGVLDLCFAA--CSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACLAFV 422
           +RV+DP GVL LCF A   S    + P+ITAHF+G ADV L P+NTFA V D++ C   +
Sbjct: 371 ERVSDPKGVLSLCFRAEKESESLASAPIITAHFSG-ADVVLQPMNTFAKVEDDLFCFTMI 430

Query: 423 PSANFAIFGNLAQVNFLVGYDLERKRLSFKYNVC 437
           PS + AIFGNLAQ+NFLVGYDLE   +SFK   C
Sbjct: 431 PSNDVAIFGNLAQMNFLVGYDLESGIVSFKPTDC 456

BLAST of CmoCh04G006310 vs. TAIR10
Match: AT1G64830.1 (AT1G64830.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 378.3 bits (970), Expect = 6.6e-105
Identity = 205/413 (49.64%), Postives = 266/413 (64.41%), Query Frame = 1

Query: 30  GFTTSLFHRDSFLSPLYNPSLSHYDRLTNAFRRSFSRSDTLLNRAAAVSITGIHSRIIPD 89
           GFT  L HRDS  SP YN + +   R+ NA RRS   +    N  A  S     S I  +
Sbjct: 25  GFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDA--SPNSPQSFITSN 84

Query: 90  DGEFLMSISIGTPRVKIMAIADTGSDLTWTQCMPCHKCFNQSFPIFNPRRSFSYRHVSCT 149
            GE+LM+ISIGTP V I+AIADTGSDL WTQC PC  C+ Q+ P+F+P+ S +YR VSC+
Sbjct: 85  RGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCS 144

Query: 150 SNACRSLDDYRCGPDNRTCSYGYSYGDQSFTYGDLASEKITIGS-----FKLYKTLIGCG 209
           S+ CR+L+D  C  D  TCSY  +YGD S+T GD+A + +T+GS       L   +IGCG
Sbjct: 145 SSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCG 204

Query: 210 HVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVKRRFSYCLPTFFSDKNVTGKISFGKK 269
           H N GTF    SGIIGLGGG  SL+SQ+RK  ++  +FSYCL  F S+  +T KI+FG  
Sbjct: 205 HENTGTFDPAGSGIIGLGGGSTSLVSQLRK--SINGKFSYCLVPFTSETGLTSKINFGTN 264

Query: 270 AIVSGRKVVSTPLVLKEPNTFYYLTLEAMSVANKRFKAANNMSTAVEQGNILIDSGTTLT 329
            IVSG  VVST +V K+P T+Y+L LEA+SV +K+ +  + +     +GNI+IDSGTTLT
Sbjct: 265 GIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTI-FGTGEGNIVIDSGTTLT 324

Query: 330 ILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCFAACSVDHLNIPVITAHFAGNADVKLL 389
           +LP N Y  + S +A  +KA+RV DP G+L LC+   S     +P IT HF G  DVKL 
Sbjct: 325 LLPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSS--SFKVPDITVHFKG-GDVKLG 384

Query: 390 PLNTFAMVADNVACLAFVPSANFAIFGNLAQVNFLVGYDLERKRLSFKYNVCA 438
            LNTF  V+++V+C AF  +    IFGNLAQ+NFLVGYD     +SFK   C+
Sbjct: 385 NLNTFVAVSEDVSCFAFAANEQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDCS 429

BLAST of CmoCh04G006310 vs. TAIR10
Match: AT5G33340.1 (AT5G33340.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 354.0 bits (907), Expect = 1.3e-97
Identity = 195/415 (46.99%), Postives = 266/415 (64.10%), Query Frame = 1

Query: 30  GFTTSLFHRDSFLSPLYNPSLSHYDRLTNAFRRSFSRSDTLLNRAAAVSITGIHSRIIPD 89
           GFT  L HRDS  SP YNP  +   RL NA  RS +R   + +     +       +  +
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR---VFHFTEKDNTPQPQIDLTSN 89

Query: 90  DGEFLMSISIGTPRVKIMAIADTGSDLTWTQCMPCHKCFNQSFPIFNPRRSFSYRHVSCT 149
            GE+LM++SIGTP   IMAIADTGSDL WTQC PC  C+ Q  P+F+P+ S +Y+ VSC+
Sbjct: 90  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCS 149

Query: 150 SNACRSLDDY-RCGPDNRTCSYGYSYGDQSFTYGDLASEKITIGS-----FKLYKTLIGC 209
           S+ C +L++   C  ++ TCSY  SYGD S+T G++A + +T+GS      +L   +IGC
Sbjct: 150 SSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGC 209

Query: 210 GHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVKRRFSYCLPTFFSDKNVTGKISFGK 269
           GH N GTF++  SGI+GLGGGP+SLI Q+    ++  +FSYCL    S K+ T KI+FG 
Sbjct: 210 GHNNAGTFNKKGSGIVGLGGGPVSLIKQLGD--SIDGKFSYCLVPLTSKKDQTSKINFGT 269

Query: 270 KAIVSGRKVVSTPLVLK-EPNTFYYLTLEAMSVANKRFKAANNMSTAVEQGNILIDSGTT 329
            AIVSG  VVSTPL+ K    TFYYLTL+++SV +K+ + + + S + E GNI+IDSGTT
Sbjct: 270 NAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSE-GNIIIDSGTT 329

Query: 330 LTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCFAACSVDHLNIPVITAHFAGNADVK 389
           LT+LP   Y  +   +A  + A++  DP   L LC++A     L +PVIT HF G ADVK
Sbjct: 330 LTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA--TGDLKVPVITMHFDG-ADVK 389

Query: 390 LLPLNTFAMVADNVACLAFVPSANFAIFGNLAQVNFLVGYDLERKRLSFKYNVCA 438
           L   N F  V++++ C AF  S +F+I+GN+AQ+NFLVGYD   K +SFK   CA
Sbjct: 390 LDSSNAFVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of CmoCh04G006310 vs. TAIR10
Match: AT2G35615.1 (AT2G35615.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 352.8 bits (904), Expect = 3.0e-97
Identity = 206/456 (45.18%), Postives = 282/456 (61.84%), Query Frame = 1

Query: 3   AISIFFCLFLISFSQATAHGGVGGGGH--GFTTSLFHRDSFLSPLYNPSLSHYDRLTNAF 62
           A  I  C FL  F   T    +   GH   F+  L HRDS LSP+YNP ++  DRL  AF
Sbjct: 2   ATQILLCFFL--FFSVT----LSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAF 61

Query: 63  RRSFSRSDTLLNRAAAVSITGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWTQ 122
            RS SRS    ++   +S T + S +I  DGEF MSI+IGTP +K+ AIADTGSDLTW Q
Sbjct: 62  LRSVSRSRRFNHQ---LSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQ 121

Query: 123 CMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLD--DYRCGPDNRTCSYGYSYGDQS 182
           C PC +C+ ++ PIF+ ++S +Y+   C S  C++L   +  C   N  C Y YSYGDQS
Sbjct: 122 CKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQS 181

Query: 183 FTYGDLASEKITIGS-----FKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQMR 242
           F+ GD+A+E ++I S          T+ GCG+ NGGTF E  SGIIGLGGG LSLISQ+ 
Sbjct: 182 FSKGDVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLG 241

Query: 243 KIAAVKRRFSYCLPTFFSDKNVTGKISFGKKAIVSGRK----VVSTPLVLKEPNTFYYLT 302
             +++ ++FSYCL    +  N T  I+ G  +I S       VVSTPLV KEP T+YYLT
Sbjct: 242 --SSISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLT 301

Query: 303 LEAMSVANKR-------FKAANNMSTAVEQGNILIDSGTTLTILPQNLYKGVASTLAH-V 362
           LEA+SV  K+       +   ++   +   GNI+IDSGTTLT+L    +   +S +   V
Sbjct: 302 LEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESV 361

Query: 363 VKAKRVNDPTGVLDLCFAACSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACLAF 422
             AKRV+DP G+L  CF + S + + +P IT HF G ADV+L P+N F  +++++ CL+ 
Sbjct: 362 TGAKRVSDPQGLLSHCFKSGSAE-IGLPEITVHFTG-ADVRLSPINAFVKLSEDMVCLSM 421

Query: 423 VPSANFAIFGNLAQVNFLVGYDLERKRLSFKYNVCA 438
           VP+   AI+GN AQ++FLVGYDLE + +SF++  C+
Sbjct: 422 VPTTEVAIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444

BLAST of CmoCh04G006310 vs. TAIR10
Match: AT1G31450.1 (AT1G31450.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 338.6 bits (867), Expect = 5.8e-93
Identity = 196/455 (43.08%), Postives = 269/455 (59.12%), Query Frame = 1

Query: 1   MAAISIFFC-LFLISFSQATAHGGVGGGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNA 60
           MA  +  +C L  ISF  A+            T  L HRDS  SPLYNP  +  DRL  A
Sbjct: 1   MATKTFLYCSLLAISFFFAS---NSSANRENLTVELIHRDSPHSPLYNPHHTVSDRLNAA 60

Query: 61  FRRSFSRSDTLLNRAAAVSITGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWT 120
           F RS SRS     +      T + S +I + GE+ MSISIGTP  K+ AIADTGSDLTW 
Sbjct: 61  FLRSISRSRRFTTK------TDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWV 120

Query: 121 QCMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYR--CGPDNRTCSYGYSYGDQ 180
           QC PC +C+ Q+ P+F+ ++S +Y+  SC S  C++L ++   C      C Y YSYGD 
Sbjct: 121 QCKPCQQCYKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDN 180

Query: 181 SFTYGDLASEKITI-----GSFKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQM 240
           SFT GD+A+E I+I      S     T+ GCG+ NGGTF E  SGIIGLGGGPLSL+SQ+
Sbjct: 181 SFTKGDVATETISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQL 240

Query: 241 RKIAAVKRRFSYCLPTFFSDKNVTGKISFGKKAIVSG----RKVVSTPLVLKEPNTFYYL 300
              +++ ++FSYCL    +  N T  I+ G  +I S        ++TPL+ K+P T+Y+L
Sbjct: 241 G--SSIGKKFSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFL 300

Query: 301 TLEAMSVANKRFKAAN-----NMSTAVEQGNILIDSGTTLTILPQNLYKGVASTLAH-VV 360
           TLEA++V   +          N  ++   GNI+IDSGTTLT+L    Y    + +   V 
Sbjct: 301 TLEAVTVGKTKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVT 360

Query: 361 KAKRVNDPTGVLDLCFAACSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACLAFV 420
            AKRV+DP G+L  CF +   + + +P IT HF  NADVKL P+N F  + ++  CL+ +
Sbjct: 361 GAKRVSDPQGLLTHCFKSGDKE-IGLPAITMHFT-NADVKLSPINAFVKLNEDTVCLSMI 420

Query: 421 PSANFAIFGNLAQVNFLVGYDLERKRLSFKYNVCA 438
           P+   AI+GN+ Q++FLVGYDLE K +SF+   C+
Sbjct: 421 PTTEVAIYGNMVQMDFLVGYDLETKTVSFQRMDCS 442

BLAST of CmoCh04G006310 vs. TAIR10
Match: AT2G28010.1 (AT2G28010.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 232.6 bits (592), Expect = 4.5e-61
Identity = 162/419 (38.66%), Postives = 219/419 (52.27%), Query Frame = 1

Query: 29  HGFTTSLFHRDSFLSPLYNPSLSHYDRLTNAFRRSFSRSDTLLNRAAAVSITGIHSRIIP 88
           HGFT  L HR S  S           R++N    S   ++T+                  
Sbjct: 28  HGFTMDLIHRRSNAS----------SRVSNTQSGSSPYANTVF----------------- 87

Query: 89  DDGEFLMSISIGTPRVKIMAIADTGSDLTWTQCMPCHKCFNQSFPIFNPRRSFSYRHVSC 148
           D+  +LM + +GTP  +I AI DTGS++TWTQC+PC  C+ Q+ PIF+P +S +++   C
Sbjct: 88  DNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRC 147

Query: 149 TSNACRSLDDYRCGPDNRTCSYGYSYGDQSFTYGDLASEKITIGS-----FKLYKTLIGC 208
                          D  +C Y   Y D ++T G LA+E IT+ S     F + +T+IGC
Sbjct: 148 ---------------DGHSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGC 207

Query: 209 GHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVKRRFSYCLPTFFSDKNVTGKISFGK 268
           GH N   F    SG++GL  GP SLI+QM          SYC    FS +  T KI+FG 
Sbjct: 208 GH-NNSWFKPSFSGMVGLNWGPSSLITQMG--GEYPGLMSYC----FSGQG-TSKINFGA 267

Query: 269 KAIVSGRKVVSTPLVLKEPNT-FYYLTLEAMSVANKRFKAANNMSTAVEQGNILIDSGTT 328
            AIV+G  VVST + +      FYYL L+A+SV N R +       A+E GNI+IDSGTT
Sbjct: 268 NAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALE-GNIVIDSGTT 327

Query: 329 LTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCFAACSVDHLNIPVITAHFAGNADVK 388
           LT  P +    V   + HVV A R  DPTG   LC+ + ++D    PVIT HF+G  D+ 
Sbjct: 328 LTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTID--IFPVITMHFSGGVDLV 387

Query: 389 LLPLNTFAMVADN--VACLAFVPSA--NFAIFGNLAQVNFLVGYDLERKRLSFKYNVCA 438
           L   N + M ++N  V CLA + ++    AIFGN AQ NFLVGYD     +SF    C+
Sbjct: 388 LDKYNMY-MESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392

BLAST of CmoCh04G006310 vs. NCBI nr
Match: gi|449462551|ref|XP_004149004.1| (PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus])

HSP 1 Score: 625.2 bits (1611), Expect = 8.8e-176
Identity = 314/437 (71.85%), Postives = 358/437 (81.92%), Query Frame = 1

Query: 1   MAAISIFFCLFLISFSQATAHGGVGGGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNAF 60
           MAAISIFF   L   S+ TAHGG   G HGFTTSLF RDS LSPL+NPSLS YD L +AF
Sbjct: 1   MAAISIFFYFLLFFSSKVTAHGG---GHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAF 60

Query: 61  RRSFSRSDTLLNRAAAVSITGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWTQ 120
           RRSFSRS TLL    +VS   I S IIPD GEFLMSI IGTP V ++AIADTGSDLTWTQ
Sbjct: 61  RRSFSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQ 120

Query: 121 CMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTCSYGYSYGDQSFT 180
           C+PC +CFNQS PIFNPRRS SYR VSC S+ CRSL+ Y CGPD ++CSYGYSYGD+SFT
Sbjct: 121 CLPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFT 180

Query: 181 YGDLASEKITIGSFKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVKR 240
           YGDLAS++ITIGSFKL KT+IGCGH NGGTF   TSGIIGLGGG LSL+SQMR IA VK 
Sbjct: 181 YGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKP 240

Query: 241 RFSYCLPTFFSDKNVTGKISFGKKAIVSGRKVVSTPLVLKEPNTFYYLTLEAMSVANKRF 300
           RFSYCLPTFFS+ N+TG ISFG+KA+VSGR+VVSTPLV + P+TFY+LTLEA+SV  KRF
Sbjct: 241 RFSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRF 300

Query: 301 KAANNMSTAVEQGNILIDSGTTLTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCFAA 360
           KAAN +S     GNI+IDSGTTLT+LP++LY GV STLA V+KAKRV+DP+G+L+LC++A
Sbjct: 301 KAANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSA 360

Query: 361 CSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACLAFVPSANFAIFGNLAQVNFLV 420
             VD LNIP+ITAHFAG ADVKLLP+NTFA VADNV CL F P+   AIFGNLAQ+NF V
Sbjct: 361 GQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEV 420

Query: 421 GYDLERKRLSFKYNVCA 438
           GYDL  KRLSF+  +CA
Sbjct: 421 GYDLGNKRLSFEPKLCA 434

BLAST of CmoCh04G006310 vs. NCBI nr
Match: gi|659102472|ref|XP_008452150.1| (PREDICTED: probable aspartic protease At2g35615 [Cucumis melo])

HSP 1 Score: 624.4 bits (1609), Expect = 1.5e-175
Identity = 312/437 (71.40%), Postives = 359/437 (82.15%), Query Frame = 1

Query: 1   MAAISIFFCLFLISFSQATAHGGVGGGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNAF 60
           M AISIFF   L   S+ATAHGG   G HGFTTSL+HRDS LSPL+NPSLS YD L  +F
Sbjct: 1   MPAISIFFYFLLFFSSKATAHGG---GHHGFTTSLYHRDSLLSPLHNPSLSRYDSLVESF 60

Query: 61  RRSFSRSDTLLNRAAAVSITGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWTQ 120
           RRSFSRS TLLN   +VS   I S IIPD GEFLMSI IGTPRV  +AIADTGSDLTWTQ
Sbjct: 61  RRSFSRSATLLNHLTSVSTACIRSPIIPDSGEFLMSIFIGTPRVNFIAIADTGSDLTWTQ 120

Query: 121 CMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTCSYGYSYGDQSFT 180
           C+PC +CFNQS PIFNPRRS SYR VSC+S+ CRSL+   CG D ++CSYGYSYGD+SFT
Sbjct: 121 CLPCRECFNQSQPIFNPRRSSSYRKVSCSSDTCRSLESSHCGLDLKSCSYGYSYGDRSFT 180

Query: 181 YGDLASEKITIGSFKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVKR 240
           YGDLAS+KITIGSFKL KT+IGCGH NGGTF   TSGIIGLGGG LSL+SQM  IA VK 
Sbjct: 181 YGDLASDKITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMSTIAGVKP 240

Query: 241 RFSYCLPTFFSDKNVTGKISFGKKAIVSGRKVVSTPLVLKEPNTFYYLTLEAMSVANKRF 300
           +FSYCLPTFFS++N+TGKISFG+KA+VSGR+VVSTPLV + P+TFY+LTLEA+SV NKRF
Sbjct: 241 QFSYCLPTFFSNENITGKISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGNKRF 300

Query: 301 KAANNMSTAVEQGNILIDSGTTLTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCFAA 360
           KAA +MS    QGNI+IDSGTTLT+LP++LY GV STLA V+K KRV+DP+G+L+LC++A
Sbjct: 301 KAAKDMSAMTNQGNIIIDSGTTLTLLPRSLYDGVVSTLARVIKTKRVDDPSGILELCYSA 360

Query: 361 CSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACLAFVPSANFAIFGNLAQVNFLV 420
             ++ LNIP+ITAHF+G ADVKLLP+NTFA VADNV CL   P+ N AIFGNLAQ+NF V
Sbjct: 361 GQLEDLNIPIITAHFSGRADVKLLPVNTFAPVADNVICLTLAPATNVAIFGNLAQINFEV 420

Query: 421 GYDLERKRLSFKYNVCA 438
           GYDL  KRLSFK   CA
Sbjct: 421 GYDLGNKRLSFKPTRCA 434

BLAST of CmoCh04G006310 vs. NCBI nr
Match: gi|778697533|ref|XP_004149005.2| (PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus])

HSP 1 Score: 481.9 bits (1239), Expect = 1.2e-132
Identity = 265/446 (59.42%), Postives = 312/446 (69.96%), Query Frame = 1

Query: 2   AAISIFF--CLFLISFSQATAHGGVGGGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNA 61
           A IS+FF   LFLISFSQ T    +  G +GFTTSLFHRDS LSPL   SLSHYDRL NA
Sbjct: 3   ATISLFFHLILFLISFSQTT----IINGNNGFTTSLFHRDSLLSPLEFSSLSHYDRLANA 62

Query: 62  FRRSFSRSDTLLNRAAAVSITGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWT 121
           FRRS SRS  LLNRAA     G+ S I P  GE+LMS+SIGTP V  + IADTGSDLTW 
Sbjct: 63  FRRSLSRSAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWA 122

Query: 122 QCMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTCSYGYSYGDQSF 181
           QC+PC KC+ Q  PIFNP +S S+ HV C +  C ++DD  CG     C Y Y+YGD+++
Sbjct: 123 QCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHCGVQG-VCDYSYTYGDRTY 182

Query: 182 TYGDLASEKITIGSFKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVK 241
           + GDL  EKITIGS  + K++IGCGH + G F    SG+IGLGGG LSL+SQM + + + 
Sbjct: 183 SKGDLGFEKITIGSSSV-KSVIGCGHASSGGFGF-ASGVIGLGGGQLSLVSQMSQTSGIS 242

Query: 242 RRFSYCLPTFFSDKNVTGKISFGKKAIVSGRKVVSTPLVLKEPNTFYYLTLEAMSVANKR 301
           RRFSYCLPT  S  N  GKI+FG+ A+VSG  VVSTPL+ K   T+YY+TLEA+S+ N+R
Sbjct: 243 RRFSYCLPTLLSHAN--GKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNER 302

Query: 302 FKAANNMSTAVEQGNILIDSGTTLTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCF- 361
             A        +QGN++IDSGTTLTILP+ LY GV S+L  VVKAKRV DP G LDLCF 
Sbjct: 303 HMA------FAKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFD 362

Query: 362 ----AACSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACL---AFVPSANFAIFG 421
               AA S   L IPVITAHF+G A+V LLP+NTF  VADNV CL   A  P+  F I G
Sbjct: 363 DGINAAAS---LGIPVITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIG 422

Query: 422 NLAQVNFLVGYDLERKRLSFKYNVCA 438
           NLAQ NFL+GYDLE KRLSFK  VCA
Sbjct: 423 NLAQANFLIGYDLEAKRLSFKPTVCA 430

BLAST of CmoCh04G006310 vs. NCBI nr
Match: gi|659102476|ref|XP_008452153.1| (PREDICTED: probable aspartic protease At2g35615 [Cucumis melo])

HSP 1 Score: 480.7 bits (1236), Expect = 2.7e-132
Identity = 263/445 (59.10%), Postives = 313/445 (70.34%), Query Frame = 1

Query: 3   AISIF--FCLFLISFSQATAHGGVGGGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNAF 62
           A SIF    LFLISFSQ T    +  G +GFTTSLFHRDS LSPL   SLSHYDRL+NAF
Sbjct: 2   AASIFCRLILFLISFSQTT----IINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLSNAF 61

Query: 63  RRSFSRSDTLLNRAAAVSITGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWTQ 122
           RRS SRS  LLNRAA     G+ S I P  GE+LMS+SIGTP V  + +ADTGSDLTW Q
Sbjct: 62  RRSLSRSAALLNRAATSGAVGLQSPIAPGSGEYLMSVSIGTPPVDYIGLADTGSDLTWAQ 121

Query: 123 CMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTCSYGYSYGDQSFT 182
           C+PC KCF QS PIFNP +S S+ HV C S  C+++DD  CG     C Y Y+YGDQ++T
Sbjct: 122 CLPCVKCFKQSRPIFNPLKSTSFSHVPCNSQICQAIDDAHCGVQG-VCDYSYTYGDQTYT 181

Query: 183 YGDLASEKITIGSFKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVKR 242
            GDL  EKITIGS  + K++IGCGH +GG F    SG+IGLGGG LSL+SQM + + + R
Sbjct: 182 KGDLGLEKITIGSSSV-KSVIGCGHESGGGFGF-ASGVIGLGGGQLSLVSQMSQTSGISR 241

Query: 243 RFSYCLPTFFSDKNVTGKISFGKKAIVSGRKVVSTPLVLKEPNTFYYLTLEAMSVANKRF 302
           RFSYCLPT  S  N  GKI+FG+ A+VSG  VVSTPL+ K+P T+YY+TLEA+S+ N+R 
Sbjct: 242 RFSYCLPTLLSHAN--GKINFGQNAVVSGPGVVSTPLISKDPVTYYYITLEAISIGNERH 301

Query: 303 KAANNMSTAVEQGNILIDSGTTLTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCF-- 362
            A+       +QGN++IDSGTTLT+LP+ LY GV S+L  VVKAKRV DP    DLCF  
Sbjct: 302 MAS------AKQGNVIIDSGTTLTVLPKELYDGVVSSLLKVVKAKRVKDPGSFWDLCFDD 361

Query: 363 ----AACSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACL---AFVPSANFAIFG 422
               AA S     IP+ITAHF+G A+V LLP+NTF  VA+NV CL   A  P+  F I G
Sbjct: 362 GINVAASS----GIPIITAHFSGGANVNLLPVNTFQKVANNVNCLTLTAASPTDEFGIIG 421

Query: 423 NLAQVNFLVGYDLERKRLSFKYNVC 437
           NLAQ NFL+GYDLE KRLSFK  VC
Sbjct: 422 NLAQANFLIGYDLEAKRLSFKPTVC 427

BLAST of CmoCh04G006310 vs. NCBI nr
Match: gi|659102474|ref|XP_008452152.1| (PREDICTED: probable aspartic protease At2g35615 [Cucumis melo])

HSP 1 Score: 468.8 bits (1205), Expect = 1.1e-128
Identity = 257/446 (57.62%), Postives = 308/446 (69.06%), Query Frame = 1

Query: 2   AAISIFFCLFL--ISFSQATAHGGVGGGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNA 61
           A ISIFF LFL  ISFSQ T    +  G +GFTTSLFHRDS LSPL   +LSHYDRL+NA
Sbjct: 3   ATISIFFLLFLLLISFSQTT----IINGDNGFTTSLFHRDSLLSPLEFSTLSHYDRLSNA 62

Query: 62  FRRSFSRSDTLLNRAAAVSITGIHSRIIPDDGEFLMSISIGTPRVKIMAIADTGSDLTWT 121
           FRRS SRS  LLNR A     G+ S I P  GE+LM +SIGTP V  + + DTGSDLTW 
Sbjct: 63  FRRSLSRSAALLNRTATSGAVGLQSPIAPGSGEYLMYVSIGTPPVDYIGMIDTGSDLTWA 122

Query: 122 QCMPCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTCSYGYSYGDQSF 181
           QC+PC KCF Q  PIFNP +S S+ HV C S  C+++DD  CG     C Y Y+YGDQ++
Sbjct: 123 QCLPCRKCFLQLRPIFNPLKSTSFSHVPCNSQICQAIDDAHCGVQG-VCDYSYTYGDQTY 182

Query: 182 TYGDLASEKITIGSFKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVK 241
           T GDL  EKITIGS  + K++IGCGH +GG F    SG+IGLGGG LSL+SQM + + + 
Sbjct: 183 TKGDLGFEKITIGSSSV-KSVIGCGHESGGGFG-FASGVIGLGGGQLSLVSQMSQTSGIS 242

Query: 242 RRFSYCLPTFFSDKNVTGKISFGKKAIVSGRKVVSTPLVLKEPNTFYYLTLEAMSVANKR 301
           RRFSYCLP      N  GKI+F + A+VSG  VVSTPL+ K+P T+YY+TLEA+S+ N+R
Sbjct: 243 RRFSYCLPPLLGHAN--GKINFAQNAVVSGPGVVSTPLISKDPVTYYYITLEAISIGNER 302

Query: 302 FKAANNMSTAVEQGNILIDSGTTLTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCF- 361
             A      + +QGN++IDSGTTLT+LP+ LY GV S+L  VVKAKRV DP    DLCF 
Sbjct: 303 HMA------SAKQGNVIIDSGTTLTVLPKELYDGVVSSLLKVVKAKRVKDPGSFWDLCFD 362

Query: 362 -----AACSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACL---AFVPSANFAIF 421
                AA S     IP+ITAHF+G A+V LLP+NTF  VA+NV CL   A  P+  F I 
Sbjct: 363 DGINVAASS----GIPIITAHFSGGANVNLLPVNTFQKVANNVNCLTLTAASPTDEFGII 422

Query: 422 GNLAQVNFLVGYDLERKRLSFKYNVC 437
           GNLAQ NFL+GYDLE KRLSFK  VC
Sbjct: 423 GNLAQANFLIGYDLEAKRLSFKPTVC 429

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CDR1_ARATH2.4e-9646.99Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1[more]
ASPR1_ARATH5.3e-9645.18Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 S... [more]
NEP1_NEPGR2.8e-5736.32Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
NEP2_NEPGR5.5e-5334.77Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
ASPA_ARATH5.7e-5037.85Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana GN=At5g10770 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KZZ3_CUCSA6.1e-17671.85Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055410 PE=3 SV=1[more]
A0A0A0KV20_CUCSA8.4e-13359.42Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055400 PE=3 SV=1[more]
M5WRG3_PRUPE7.1e-11652.68Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025167mg PE=3 SV=1[more]
A0A0A0KX67_CUCSA1.2e-11553.91Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055390 PE=3 SV=1[more]
W9SK79_9ROSA2.8e-11251.54Putative aspartic protease OS=Morus notabilis GN=L484_022741 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G64830.16.6e-10549.64 Eukaryotic aspartyl protease family protein[more]
AT5G33340.11.3e-9746.99 Eukaryotic aspartyl protease family protein[more]
AT2G35615.13.0e-9745.18 Eukaryotic aspartyl protease family protein[more]
AT1G31450.15.8e-9343.08 Eukaryotic aspartyl protease family protein[more]
AT2G28010.14.5e-6138.66 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|449462551|ref|XP_004149004.1|8.8e-17671.85PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus][more]
gi|659102472|ref|XP_008452150.1|1.5e-17571.40PREDICTED: probable aspartic protease At2g35615 [Cucumis melo][more]
gi|778697533|ref|XP_004149005.2|1.2e-13259.42PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus][more]
gi|659102476|ref|XP_008452153.1|2.7e-13259.10PREDICTED: probable aspartic protease At2g35615 [Cucumis melo][more]
gi|659102474|ref|XP_008452152.1|1.1e-12857.62PREDICTED: probable aspartic protease At2g35615 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G006310.1CmoCh04G006310.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 4..432
score: 3.8E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 108..119
score: -coord: 315..326
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 90..263
score: 1.0E-36coord: 268..436
score: 9.6
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 88..436
score: 1.8
NoneNo IPR availablePANTHERPTHR13683:SF298ASPARTIC PROTEINASE CDR1-RELATEDcoord: 4..432
score: 3.8E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None