CmoCh04G015950.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh04G015950.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionEukaryotic aspartyl protease family protein
LocationCmo_Chr04 : 8139050 .. 8140507 (+)
Sequence length1458
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCACCGCCTTTTTTCTCTTCCTTCTTCCCCTTCTTTACGCCTCTTTTTCCGGCGTTCTCTCCCGGTCCTTGCCTCACGCCCCTCAAACCTCCGTGCTCGATGTCGACGCTTCCATTCACACCACTCGCCAGGTCTTTGCATTTCAGCCTCAATCTCCAATGCGAGATGAAAGTACCGTCTCGGACTCTTCTTCCTTGACTCTGCTGTTGAATTCCAGGGTTTCCATTATGAAAACCTCGCACACTGACTACAAATCCCTCACGTTATCCAGACTCTACCGCGACTCTGCTCGAGTCAGATCTTTGACTGCAAGGATGGATCTAGCCATTCGAGGTATTACTGGAGCGGATCTTGAACCTCTCTGGAATGGTGGTGGTTCGCAATTTGGGGCGGAGGATTTTGAGAGTCCGATTGTCTCAGGCGCGAGTCAGGGGAGCGGTGAGTACTTCTCCCGAGTTGGAATCGGTAGGCCGCCGAGTCCGGTTTACATGGTGCTCGATACTGGTAGTGATGTAAGTTGGGTACAATGCGCGCCTTGTGCTGATTGCTACGAGCAAACCGATCCAATTTTTGAGCCTACTTCCTCTGCTTCTTTCACGTCTCTGTCCTGCCAAACACAGCAATGTAAATCGCTCGATGTTTCTGAGTGCCGGAATGGTACTTGTCTCTACGAGGTCTCTTATGGCGATGGTTCTTACACCGTCGGCGATTTCGTTACTGAAACTGTTACTCTCGGCTCGACTTCTCTCACAAATATCGCTCTAGGCTGTGGCCATAACAATGAGGGTTTGTTCATCGGCGCCGCCGGTTTGCTCGGACTAGGAGGCGGCTCGCTCTCGTTCCCTTCGCAGCTTAATGCCTCGTCTTTTTCGTACTGTCTTGTGGACCGTGACTCTGAATCCACCTCGACTCTCGATTTCAACTCACCGATTCCTCCCGATGCCGTAACAGCGCCGCTGCACCGGAACCCTAATTTGGACACGTTTTTCTACCTCGGCATGACAGGGATGAGCGTCGGAGGTGAAATTCTTCCGATTCCCGAGACGTCGTTCCAAATGAGCCAAGACGGAAACGGCGGCATCATCATTGACTCCGGCACCGCCGTGACGCGGTTGCAGACCACCGCTTATAACTTGTTGCGCGACGCGTTCGTTAAGAAGACGCACGATTTGCAGTCCGCACGTGGCGTGGCGTTGTTTGATACTTGTTACGACTTGTCGTCGAAGTCGAGAGTCGAAGTGCCGACGGTGTCATTTCACTTCGCGGACGGGAAAGAGTTGCCGTTGCCGGCAAAAAATTACCTGATACCGGTTGACTCGGAAGGAACATTTTGTTTCGCCTTTGCCCCTACCGATTCAACATTGTCAATACTCGGAAACGCACAGCAGCAAGGGACACGTGTCAGTTTCGACCTCGCTAATTCGCTCGTTGGGTTCTCCTCCAACAAATGCTAA

mRNA sequence

ATGGCCACCGCCTTTTTTCTCTTCCTTCTTCCCCTTCTTTACGCCTCTTTTTCCGGCGTTCTCTCCCGGTCCTTGCCTCACGCCCCTCAAACCTCCGTGCTCGATGTCGACGCTTCCATTCACACCACTCGCCAGGTCTTTGCATTTCAGCCTCAATCTCCAATGCGAGATGAAAGTACCGTCTCGGACTCTTCTTCCTTGACTCTGCTGTTGAATTCCAGGGTTTCCATTATGAAAACCTCGCACACTGACTACAAATCCCTCACGTTATCCAGACTCTACCGCGACTCTGCTCGAGTCAGATCTTTGACTGCAAGGATGGATCTAGCCATTCGAGGTATTACTGGAGCGGATCTTGAACCTCTCTGGAATGGTGGTGGTTCGCAATTTGGGGCGGAGGATTTTGAGAGTCCGATTGTCTCAGGCGCGAGTCAGGGGAGCGGTGAGTACTTCTCCCGAGTTGGAATCGGTAGGCCGCCGAGTCCGGTTTACATGGTGCTCGATACTGGTAGTGATGTAAGTTGGGTACAATGCGCGCCTTGTGCTGATTGCTACGAGCAAACCGATCCAATTTTTGAGCCTACTTCCTCTGCTTCTTTCACGTCTCTGTCCTGCCAAACACAGCAATGTAAATCGCTCGATGTTTCTGAGTGCCGGAATGGTACTTGTCTCTACGAGGTCTCTTATGGCGATGGTTCTTACACCGTCGGCGATTTCGTTACTGAAACTGTTACTCTCGGCTCGACTTCTCTCACAAATATCGCTCTAGGCTGTGGCCATAACAATGAGGGTTTGTTCATCGGCGCCGCCGGTTTGCTCGGACTAGGAGGCGGCTCGCTCTCGTTCCCTTCGCAGCTTAATGCCTCGTCTTTTTCGTACTGTCTTGTGGACCGTGACTCTGAATCCACCTCGACTCTCGATTTCAACTCACCGATTCCTCCCGATGCCGTAACAGCGCCGCTGCACCGGAACCCTAATTTGGACACGTTTTTCTACCTCGGCATGACAGGGATGAGCGTCGGAGGTGAAATTCTTCCGATTCCCGAGACGTCGTTCCAAATGAGCCAAGACGGAAACGGCGGCATCATCATTGACTCCGGCACCGCCGTGACGCGGTTGCAGACCACCGCTTATAACTTGTTGCGCGACGCGTTCGTTAAGAAGACGCACGATTTGCAGTCCGCACGTGGCGTGGCGTTGTTTGATACTTGTTACGACTTGTCGTCGAAGTCGAGAGTCGAAGTGCCGACGGTGTCATTTCACTTCGCGGACGGGAAAGAGTTGCCGTTGCCGGCAAAAAATTACCTGATACCGGTTGACTCGGAAGGAACATTTTGTTTCGCCTTTGCCCCTACCGATTCAACATTGTCAATACTCGGAAACGCACAGCAGCAAGGGACACGTGTCAGTTTCGACCTCGCTAATTCGCTCGTTGGGTTCTCCTCCAACAAATGCTAA

Coding sequence (CDS)

ATGGCCACCGCCTTTTTTCTCTTCCTTCTTCCCCTTCTTTACGCCTCTTTTTCCGGCGTTCTCTCCCGGTCCTTGCCTCACGCCCCTCAAACCTCCGTGCTCGATGTCGACGCTTCCATTCACACCACTCGCCAGGTCTTTGCATTTCAGCCTCAATCTCCAATGCGAGATGAAAGTACCGTCTCGGACTCTTCTTCCTTGACTCTGCTGTTGAATTCCAGGGTTTCCATTATGAAAACCTCGCACACTGACTACAAATCCCTCACGTTATCCAGACTCTACCGCGACTCTGCTCGAGTCAGATCTTTGACTGCAAGGATGGATCTAGCCATTCGAGGTATTACTGGAGCGGATCTTGAACCTCTCTGGAATGGTGGTGGTTCGCAATTTGGGGCGGAGGATTTTGAGAGTCCGATTGTCTCAGGCGCGAGTCAGGGGAGCGGTGAGTACTTCTCCCGAGTTGGAATCGGTAGGCCGCCGAGTCCGGTTTACATGGTGCTCGATACTGGTAGTGATGTAAGTTGGGTACAATGCGCGCCTTGTGCTGATTGCTACGAGCAAACCGATCCAATTTTTGAGCCTACTTCCTCTGCTTCTTTCACGTCTCTGTCCTGCCAAACACAGCAATGTAAATCGCTCGATGTTTCTGAGTGCCGGAATGGTACTTGTCTCTACGAGGTCTCTTATGGCGATGGTTCTTACACCGTCGGCGATTTCGTTACTGAAACTGTTACTCTCGGCTCGACTTCTCTCACAAATATCGCTCTAGGCTGTGGCCATAACAATGAGGGTTTGTTCATCGGCGCCGCCGGTTTGCTCGGACTAGGAGGCGGCTCGCTCTCGTTCCCTTCGCAGCTTAATGCCTCGTCTTTTTCGTACTGTCTTGTGGACCGTGACTCTGAATCCACCTCGACTCTCGATTTCAACTCACCGATTCCTCCCGATGCCGTAACAGCGCCGCTGCACCGGAACCCTAATTTGGACACGTTTTTCTACCTCGGCATGACAGGGATGAGCGTCGGAGGTGAAATTCTTCCGATTCCCGAGACGTCGTTCCAAATGAGCCAAGACGGAAACGGCGGCATCATCATTGACTCCGGCACCGCCGTGACGCGGTTGCAGACCACCGCTTATAACTTGTTGCGCGACGCGTTCGTTAAGAAGACGCACGATTTGCAGTCCGCACGTGGCGTGGCGTTGTTTGATACTTGTTACGACTTGTCGTCGAAGTCGAGAGTCGAAGTGCCGACGGTGTCATTTCACTTCGCGGACGGGAAAGAGTTGCCGTTGCCGGCAAAAAATTACCTGATACCGGTTGACTCGGAAGGAACATTTTGTTTCGCCTTTGCCCCTACCGATTCAACATTGTCAATACTCGGAAACGCACAGCAGCAAGGGACACGTGTCAGTTTCGACCTCGCTAATTCGCTCGTTGGGTTCTCCTCCAACAAATGCTAA
BLAST of CmoCh04G015950.1 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 538.9 bits (1387), Expect = 5.8e-152
Identity = 274/501 (54.69%), Postives = 364/501 (72.65%), Query Frame = 1

Query: 1   MATAFFLFLLPL----LYASFSGVLSRSLPHAPQTSVLDVDASIHTTRQVFAFQPQ---- 60
           MA   FL LL +    L+ + +   SRSL   P+T+VLDV +S+  T+ + +  P     
Sbjct: 1   MAFPRFLSLLAVVTLSLFLTTTDASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSL 60

Query: 61  -----SPMRDESTVSDSSSLTLLLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSLTARM 120
                  + D    + SS L+L L+SR + + + H DYKSLTLSRL RDS+RV  + A++
Sbjct: 61  TTTKPESLSDPVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKI 120

Query: 121 DLAIRGITGADLEPLWNGGGSQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVL 180
             A+ G+  +DL+P++N   +++  ED  +P+VSGASQGSGEYFSR+G+G P   +Y+VL
Sbjct: 121 RFAVEGVDRSDLKPVYNED-TRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVL 180

Query: 181 DTGSDVSWVQCAPCADCYEQTDPIFEPTSSASFTSLSCQTQQCKSLDVSECRNGTCLYEV 240
           DTGSDV+W+QC PCADCY+Q+DP+F PTSS+++ SL+C   QC  L+ S CR+  CLY+V
Sbjct: 181 DTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQV 240

Query: 241 SYGDGSYTVGDFVTETVTLG-STSLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQL 300
           SYGDGS+TVG+  T+TVT G S  + N+ALGCGH+NEGLF GAAGLLGLGGG LS  +Q+
Sbjct: 241 SYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQM 300

Query: 301 NASSFSYCLVDRDSESTSTLDFNS-PIPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGEIL 360
            A+SFSYCLVDRDS  +S+LDFNS  +     TAPL RN  +DTF+Y+G++G SVGGE +
Sbjct: 301 KATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKV 360

Query: 361 PIPETSFQMSQDGNGGIIIDSGTAVTRLQTTAYNLLRDAFVKKTHDL-QSARGVALFDTC 420
            +P+  F +   G+GG+I+D GTAVTRLQT AYN LRDAF+K T +L + +  ++LFDTC
Sbjct: 361 VLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTC 420

Query: 421 YDLSSKSRVEVPTVSFHFADGKELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQ 480
           YD SS S V+VPTV+FHF  GK L LPAKNYLIPVD  GTFCFAFAPT S+LSI+GN QQ
Sbjct: 421 YDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQ 480

Query: 481 QGTRVSFDLANSLVGFSSNKC 486
           QGTR+++DL+ +++G S NKC
Sbjct: 481 QGTRITYDLSKNVIGLSGNKC 500

BLAST of CmoCh04G015950.1 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 405.6 bits (1041), Expect = 7.6e-112
Identity = 220/485 (45.36%), Postives = 298/485 (61.44%), Query Frame = 1

Query: 5   FFLFLLPLLYASFSGVLSRSLPHAPQTSVLDVDASIHTTRQVFAFQPQSPMRDESTVSDS 64
           FF FL   L+ S S  +S      P   ++DV     T          +   DES    S
Sbjct: 6   FFFFLHLHLHLSSSSSIS-----FPDFQIIDVLQPPLTVTATLPDFNNTHFSDES----S 65

Query: 65  SSLTLLLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSLTARMDLAIRGITGADLEPLWN 124
           S  TL L  R      ++ ++     +R+ RD+ RV ++  R+   +  I  +D      
Sbjct: 66  SKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKV--IPSSD------ 125

Query: 125 GGGSQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCADC 184
              S++   DF S IVSG  QGSGEYF R+G+G PP   YMV+D+GSD+ WVQC PC  C
Sbjct: 126 ---SRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLC 185

Query: 185 YEQTDPIFEPTSSASFTSLSCQTQQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETV 244
           Y+Q+DP+F+P  S S+T +SC +  C  ++ S C +G C YEV YGDGSYT G    ET+
Sbjct: 186 YKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETL 245

Query: 245 TLGSTSLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLN---ASSFSYCLVDRDSE 304
           T   T + N+A+GCGH N G+FIGAAGLLG+GGGS+SF  QL+     +F YCLV R ++
Sbjct: 246 TFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTD 305

Query: 305 STSTLDF-NSPIPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGEILPIPETSFQMSQDGNG 364
           ST +L F    +P  A   PL RNP   +F+Y+G+ G+ VGG  +P+P+  F +++ G+G
Sbjct: 306 STGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDG 365

Query: 365 GIIIDSGTAVTRLQTTAYNLLRDAFVKKTHDLQSARGVALFDTCYDLSSKSRVEVPTVSF 424
           G+++D+GTAVTRL T AY   RD F  +T +L  A GV++FDTCYDLS    V VPTVSF
Sbjct: 366 GVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSF 425

Query: 425 HFADGKELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVSFDLANSLVGF 484
           +F +G  L LPA+N+L+PVD  GT+CFAFA + + LSI+GN QQ+G +VSFD AN  VGF
Sbjct: 426 YFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGF 470

Query: 485 SSNKC 486
             N C
Sbjct: 486 GPNVC 470

BLAST of CmoCh04G015950.1 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 394.8 bits (1013), Expect = 1.3e-108
Identity = 224/443 (50.56%), Postives = 285/443 (64.33%), Query Frame = 1

Query: 52  QSPMRDESTVSDSSSLTLLLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSL-TARMDLA 111
           +S     S    SSS+TL L+   ++      D   L  SRL RDS RV+S+ T    + 
Sbjct: 57  ESEFESGSDSESSSSITLNLDHIDALSSNKTPD--ELFSSRLQRDSRRVKSIATLAAQIP 116

Query: 112 IRGITGADLEPLWNGGGSQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTG 171
            R +T A        GG       F S +VSG SQGSGEYF+R+G+G P   VYMVLDTG
Sbjct: 117 GRNVTHAP-----RPGG-------FSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTG 176

Query: 172 SDVSWVQCAPCADCYEQTDPIFEPTSSASFTSLSCQTQQCKSLDVSEC--RNGTCLYEVS 231
           SD+ W+QCAPC  CY Q+DPIF+P  S ++ ++ C +  C+ LD + C  R  TCLY+VS
Sbjct: 177 SDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVS 236

Query: 232 YGDGSYTVGDFVTETVTLGSTSLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLN- 291
           YGDGS+TVGDF TET+T     +  +ALGCGH+NEGLF+GAAGLLGLG G LSFP Q   
Sbjct: 237 YGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGH 296

Query: 292 --ASSFSYCLVDRDSES--TSTLDFNSPIPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGE 351
                FSYCLVDR + S  +S +  N+ +   A   PL  NP LDTF+Y+G+ G+SVGG 
Sbjct: 297 RFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGT 356

Query: 352 ILP-IPETSFQMSQDGNGGIIIDSGTAVTRLQTTAYNLLRDAFVKKTHDLQSARGVALFD 411
            +P +  + F++ Q GNGG+IIDSGT+VTRL   AY  +RDAF      L+ A   +LFD
Sbjct: 357 RVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFD 416

Query: 412 TCYDLSSKSRVEVPTVSFHFADGKELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNA 471
           TC+DLS+ + V+VPTV  HF  G ++ LPA NYLIPVD+ G FCFAFA T   LSI+GN 
Sbjct: 417 TCFDLSNMNEVKVPTVVLHFR-GADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNI 476

Query: 472 QQQGTRVSFDLANSLVGFSSNKC 486
           QQQG RV +DLA+S VGF+   C
Sbjct: 477 QQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CmoCh04G015950.1 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 266.9 bits (681), Expect = 4.2e-70
Identity = 144/355 (40.56%), Postives = 199/355 (56.06%), Query Frame = 1

Query: 137 SPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCADCYEQTDPIFEPTS 196
           S + +    G GEY   + IG P  P   ++DTGSD+ W QC PC  C+ Q+ PIF P  
Sbjct: 82  SGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQG 141

Query: 197 SASFTSLSCQTQQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGSTSLTNIAL 256
           S+SF++L C +Q C++L    C N  C Y   YGDGS T G   TET+T GS S+ NI  
Sbjct: 142 SSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITF 201

Query: 257 GCGHNNEGLFIG-AAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSESTSTLDFNS---PI 316
           GCG NN+G   G  AGL+G+G G LS PSQL+ + FSYC+    S + S L   S    +
Sbjct: 202 GCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSNLLLGSLANSV 261

Query: 317 PPDAVTAPLHRNPNLDTFFYLGMTGMSVGGEILPIPETSFQM-SQDGNGGIIIDSGTAVT 376
              +    L ++  + TF+Y+ + G+SVG   LPI  ++F + S +G GGIIIDSGT +T
Sbjct: 262 TAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLT 321

Query: 377 RLQTTAYNLLRDAFVKKTHDLQSARGVALFDTCYDL-SSKSRVEVPTVSFHFADGKELPL 436
                AY  +R  F+ + +        + FD C+   S  S +++PT   HF DG +L L
Sbjct: 322 YFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHF-DGGDLEL 381

Query: 437 PAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVSFDLANSLVGFSSNKC 486
           P++NY I   S G  C A   +   +SI GN QQQ   V +D  NS+V F+S +C
Sbjct: 382 PSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CmoCh04G015950.1 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 260.8 bits (665), Expect = 3.0e-68
Identity = 147/356 (41.29%), Postives = 198/356 (55.62%), Query Frame = 1

Query: 136 ESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCADCYEQTDPIFEPT 195
           E+P+ +G     GEY   V IG P S    ++DTGSD+ W QC PC  C+ Q  PIF P 
Sbjct: 86  ETPVYAG----DGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQ 145

Query: 196 SSASFTSLSCQTQQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGSTSLTNIA 255
            S+SF++L C++Q C+ L    C N  C Y   YGDGS T G   TET T  ++S+ NIA
Sbjct: 146 DSSSFSTLPCESQYCQDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETSSVPNIA 205

Query: 256 LGCGHNNEGLFIG-AAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSESTSTLDFNSP--- 315
            GCG +N+G   G  AGL+G+G G LS PSQL    FSYC+    S S STL   S    
Sbjct: 206 FGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLALGSAASG 265

Query: 316 IPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGEILPIPETSFQMSQDGNGGIIIDSGTAVT 375
           +P  + +  L  +    T++Y+ + G++VGG+ L IP ++FQ+  DG GG+IIDSGT +T
Sbjct: 266 VPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLT 325

Query: 376 RLQTTAYNLLRDAFVKKTHDLQSARGVALFDTCYDL-SSKSRVEVPTVSFHFADGKELPL 435
            L   AYN +  AF  + +        +   TC+   S  S V+VP +S  F DG  L L
Sbjct: 326 YLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQF-DGGVLNL 385

Query: 436 PAKNYLIPVDSEGTFCFAFAPTDST-LSILGNAQQQGTRVSFDLANSLVGFSSNKC 486
             +N LI   +EG  C A   +    +SI GN QQQ T+V +DL N  V F   +C
Sbjct: 386 GEQNILIS-PAEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CmoCh04G015950.1 vs. TrEMBL
Match: A0A0A0KUG1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G608450 PE=3 SV=1)

HSP 1 Score: 837.4 bits (2162), Expect = 8.7e-240
Identity = 423/481 (87.94%), Postives = 447/481 (92.93%), Query Frame = 1

Query: 7   LFLLPLLYASFSGVLSRSLPHAPQTSVLDVDASIHTTRQVFAFQPQSPMRDESTVSDSSS 66
           LFLL LL++S S    R+L   P TSVLDV ASI  T+QVFA +P+S   DE+TVSD SS
Sbjct: 6   LFLLSLLFSSLSAFHCRTLHPTPTTSVLDVAASIQRTQQVFAVEPKSSTPDETTVSDPSS 65

Query: 67  LTLLLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSLTARMDLAIRGITGADLEPLWNGG 126
           L+L LNSR+S+MK SH+DYKSLTLSRL RDSARVRSLTAR+DLAIRGITG DLEPL NGG
Sbjct: 66  LSLQLNSRISVMKASHSDYKSLTLSRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGG 125

Query: 127 G--SQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCADC 186
           G  SQFG EDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCA+C
Sbjct: 126 GGGSQFGTEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAEC 185

Query: 187 YEQTDPIFEPTSSASFTSLSCQTQQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETV 246
           YEQTDPIFEPTSSASFTSLSC+T+QCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETV
Sbjct: 186 YEQTDPIFEPTSSASFTSLSCETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETV 245

Query: 247 TLGSTSLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSESTS 306
           TLGSTSL NIA+GCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDS+STS
Sbjct: 246 TLGSTSLGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSDSTS 305

Query: 307 TLDFNSPIPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGEILPIPETSFQMSQDGNGGIII 366
           TLDFNSPI PDAVTAPLHRNPNLDTFFYLG+TGMSVGG +LPIPETSFQMS+DGNGGII+
Sbjct: 306 TLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIV 365

Query: 367 DSGTAVTRLQTTAYNLLRDAFVKKTHDLQSARGVALFDTCYDLSSKSRVEVPTVSFHFAD 426
           DSGTAVTRLQTT YN+LRDAFVK THDLQ+ARGVALFDTCYDLSSKSRVEVPTVSFHFA+
Sbjct: 366 DSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFAN 425

Query: 427 GKELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVSFDLANSLVGFSSNK 486
           G ELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRV FDLANSLVGFS NK
Sbjct: 426 GNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNK 485

BLAST of CmoCh04G015950.1 vs. TrEMBL
Match: B9HWK9_POPTR (Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0010s13830g PE=3 SV=1)

HSP 1 Score: 684.9 bits (1766), Expect = 7.3e-194
Identity = 341/487 (70.02%), Postives = 403/487 (82.75%), Query Frame = 1

Query: 1   MATAFFLFLLPLLYASFSGVLSRSL-PHAPQTSVLDVDASIHTTRQVFAFQPQ-SPMRDE 60
           M   F++F   L +AS     SR L PH  +T+VLDV ASI  T+ +F+  P+ SP   +
Sbjct: 1   MGLLFYVFF-SLFFASPPVSCSRILTPHPSETTVLDVAASIQRTKNIFSSGPKMSPFNQQ 60

Query: 61  STVSDSSSLTLLLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSLTARMDLAIRGITGAD 120
              + SS LT+ L SR SI KT+HT YKSLTLSRL RDSARV+SL  R+DLAI  I+ +D
Sbjct: 61  EKETTSSELTVELLSRTSIQKTTHTGYKSLTLSRLQRDSARVKSLVTRLDLAINSISSSD 120

Query: 121 LEPLWNGGGSQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQC 180
           L+PL     S+F  ED +SPI+SG SQGSGEYFSRVGIG+PPS  Y++LDTGSDV+WVQC
Sbjct: 121 LKPLETD--SEFKPEDLQSPIISGTSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQC 180

Query: 181 APCADCYEQTDPIFEPTSSASFTSLSCQTQQCKSLDVSECRNGTCLYEVSYGDGSYTVGD 240
           APCADCY+Q DPIFEP SSASF++LSC T+QC+SLDVSECRN TCLYEVSYGDGSYTVGD
Sbjct: 181 APCADCYQQADPIFEPASSASFSTLSCNTRQCRSLDVSECRNDTCLYEVSYGDGSYTVGD 240

Query: 241 FVTETVTLGSTSLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDR 300
           FVTET+TLGS  + N+A+GCGHNNEGLF+GAAGLLGLGGGSLSFPSQ+NA+SFSYCLVDR
Sbjct: 241 FVTETITLGSAPVDNVAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCLVDR 300

Query: 301 DSESTSTLDFNSPIPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGEILPIPETSFQMSQDG 360
           DSES STL+FNS +PP+AV+APL RN +LDTF+Y+G+TG+SVGGE++ IPE++FQ+ + G
Sbjct: 301 DSESASTLEFNSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESG 360

Query: 361 NGGIIIDSGTAVTRLQTTAYNLLRDAFVKKTHDLQSARGVALFDTCYDLSSKSRVEVPTV 420
           NGG+I+DSGTA+TRLQT  YN LRDAFVK+T DL S  G+ALFDTCYDLSSK  VEVPTV
Sbjct: 361 NGGVIVDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVEVPTV 420

Query: 421 SFHFADGKELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVSFDLANSLV 480
           SFHF DGKELPLPAKNYL+P+DSEGTFCFAFAPT S+LSI+GN QQQGTRV +DL N LV
Sbjct: 421 SFHFPDGKELPLPAKNYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNHLV 480

Query: 481 GFSSNKC 486
           GF  NKC
Sbjct: 481 GFVPNKC 484

BLAST of CmoCh04G015950.1 vs. TrEMBL
Match: V4SSZ0_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031364mg PE=3 SV=1)

HSP 1 Score: 677.9 bits (1748), Expect = 8.9e-192
Identity = 345/484 (71.28%), Postives = 403/484 (83.26%), Query Frame = 1

Query: 5   FFLFLLPLLYASFSGVLSRSLPHAP---QTSVLDVDASIHTTRQVFAFQPQSPMRDESTV 64
           F +    LL+AS     SR+ PHA     T+ LDV ASI  T + F+F P++  +   + 
Sbjct: 5   FHVLSAALLFASSPFGDSRTTPHASISVTTTTLDVSASIQNTLKPFSFDPRTTPQSLIS- 64

Query: 65  SDSSSLTLLLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSLTARMDLAIRGITGADLEP 124
           S SSSL L L+SR S+ +TSH DYKSLTL+RL RDSARVRSL AR+DLAIRGI  +DL+P
Sbjct: 65  SSSSSLALQLHSRTSVQRTSHNDYKSLTLARLERDSARVRSLAARLDLAIRGIATSDLKP 124

Query: 125 LWNGGGSQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPC 184
           L    GS+F AE+ +SPIVSG+SQGSGEYFSRVGIG+PPS VYMVLDTGSDV+W+QCAPC
Sbjct: 125 L--DSGSEFEAEEIQSPIVSGSSQGSGEYFSRVGIGKPPSQVYMVLDTGSDVNWLQCAPC 184

Query: 185 ADCYEQTDPIFEPTSSASFTSLSCQTQQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVT 244
           ADCY+Q DPIFEPTSS+S++ L+C T+QC+SLD SECRN TCLYEVSYGDGSYTVGDFVT
Sbjct: 185 ADCYQQADPIFEPTSSSSYSPLTCNTKQCQSLDESECRNNTCLYEVSYGDGSYTVGDFVT 244

Query: 245 ETVTLGSTSLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSE 304
           ETVTLGS S+ NIA+GCGHNNEGLF+GAAGLLGLGGG LSFPSQ+NAS+FSYCLVDRDS 
Sbjct: 245 ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGLLSFPSQINASTFSYCLVDRDSG 304

Query: 305 STSTLDFNSPIPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGEILPIPETSFQMSQDGNGG 364
           STSTL+F+S +PP+AVTAPL RN  LDTF+YLG+TG+SVGG++LPI ET+F++ + GNGG
Sbjct: 305 STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 364

Query: 365 IIIDSGTAVTRLQTTAYNLLRDAFVKKTHDLQSARGVALFDTCYDLSSKSRVEVPTVSFH 424
           II+DSGTAVTRLQT  YN LRDAFV+ T  L    GVALFDTCYD SS+S VEVPTVSFH
Sbjct: 365 IIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFH 424

Query: 425 FADGKELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVSFDLANSLVGFS 484
           F +GK LPLPAKN+LIPVDS GTFCFAFAPT S+LSI+GN QQQGTRVSF+L NSLVGF+
Sbjct: 425 FPEGKVLPLPAKNFLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLVGFT 484

Query: 485 SNKC 486
            NKC
Sbjct: 485 PNKC 485

BLAST of CmoCh04G015950.1 vs. TrEMBL
Match: A0A067K2W2_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18299 PE=3 SV=1)

HSP 1 Score: 676.4 bits (1744), Expect = 2.6e-191
Identity = 337/476 (70.80%), Postives = 394/476 (82.77%), Query Frame = 1

Query: 13  LYASFSGVLSRSLPHAPQTSVLDVDASIHTTRQVFAFQPQS--PMRDESTVSDSSS-LTL 72
           L  +F    SRSL H+  T +LDV ASI  T+ +F+   ++  P   +   S SSS +T+
Sbjct: 11  LLLTFPFAYSRSLSHSSTTIILDVKASIQKTKDIFSTDAKTTMPFNQQGKGSSSSSWVTM 70

Query: 73  LLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSLTARMDLAIRGITGADLEPLWNGGGSQ 132
            L+SR SI KTSHTDYKSLTL+RL RDSARVRSLT R+DL I+G + +DL+PL  G   +
Sbjct: 71  ELHSRNSIQKTSHTDYKSLTLARLQRDSARVRSLTTRLDLVIQGFSTSDLKPL--GSDLE 130

Query: 133 FGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCADCYEQTD 192
           F AED + PIVSG SQGSGEYFSRVGIG+PPS VY+VLDTGSDV+W+QCAPCADCY+Q D
Sbjct: 131 FKAEDLQGPIVSGTSQGSGEYFSRVGIGKPPSSVYLVLDTGSDVNWLQCAPCADCYQQAD 190

Query: 193 PIFEPTSSASFTSLSCQTQQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGST 252
           PIFEP SS S++ L+C +++CKSLDVSECRNG+CLYEVSYGDGSYTVGD+VTET+TLGS 
Sbjct: 191 PIFEPASSTSYSPLTCDSKECKSLDVSECRNGSCLYEVSYGDGSYTVGDYVTETITLGSA 250

Query: 253 SLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSESTSTLDFN 312
           S+ N+A+GCGHNNEGLF+GAAGLLGLGGGSLSFPSQ+NA+SFSYCLVDRDS+S STL+FN
Sbjct: 251 SVENVAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCLVDRDSDSASTLEFN 310

Query: 313 SPIPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGEILPIPETSFQMSQDGNGGIIIDSGTA 372
           SPI P AVTAPL RN  LDTF+Y+GMTG+SVGGE+L IPE++F++ + GNGGII+DSGTA
Sbjct: 311 SPILPSAVTAPLLRNHELDTFYYIGMTGLSVGGELLSIPESAFKIDESGNGGIIVDSGTA 370

Query: 373 VTRLQTTAYNLLRDAFVKKTHDLQSARGVALFDTCYDLSSKSRVEVPTVSFHFADGKELP 432
           +TRLQT  YN LRDAFVK T  L S   VALFDTCYDLSSK  VEVP +SFHF DGK LP
Sbjct: 371 ITRLQTDVYNSLRDAFVKGTEGLPSTNSVALFDTCYDLSSKYSVEVPALSFHFPDGKVLP 430

Query: 433 LPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVSFDLANSLVGFSSNKC 486
           LPAKNYLIPVDS+GTFCFAFAPT S LSI+GN QQQGTRVSFDLANS +GF  NKC
Sbjct: 431 LPAKNYLIPVDSDGTFCFAFAPTASALSIIGNVQQQGTRVSFDLANSRIGFEPNKC 484

BLAST of CmoCh04G015950.1 vs. TrEMBL
Match: A0A061EAV1_THECC (Eukaryotic aspartyl protease family protein OS=Theobroma cacao GN=TCM_011849 PE=3 SV=1)

HSP 1 Score: 670.2 bits (1728), Expect = 1.8e-189
Identity = 338/471 (71.76%), Postives = 393/471 (83.44%), Query Frame = 1

Query: 22  SRSLP--HAPQTSVLDVDASIHTTRQVFAFQPQ-----SPMRDESTVSDSSSLTLLLNSR 81
           SRSLP  H P T+VLDV  ++  TR VF+F P      SP+    + S SS L+L + SR
Sbjct: 22  SRSLPQSHLP-TTVLDVAEALEKTRNVFSFDPTKKPAFSPVDQSLSASSSSLLSLQVYSR 81

Query: 82  VSIMKTSHTDYKSLTLSRLYRDSARVRSLTARMDLAIRGITGADLEPLWNGGGSQFGAED 141
            S+ K+SH DYKSLTLSRL RDS RVRSLT R+DLA+ GI+ +DLEPL    GS+F AE+
Sbjct: 82  ASVHKSSHLDYKSLTLSRLKRDSGRVRSLTTRLDLAVNGISRSDLEPL--DIGSEFSAEE 141

Query: 142 FESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCADCYEQTDPIFEP 201
            E PIVSG+SQGSGEYFSRVGIG+PPS VYMVLDTGSDV+WVQCAPCADCY+Q DPIFEP
Sbjct: 142 MEGPIVSGSSQGSGEYFSRVGIGKPPSQVYMVLDTGSDVNWVQCAPCADCYQQADPIFEP 201

Query: 202 TSSASFTSLSCQTQQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGSTSLTNI 261
           +SS++++ LSC+TQQCK LD SECRN TCLYEVSYGDGSYTVGDFVTET+TLGS S+ N+
Sbjct: 202 SSSSTYSPLSCETQQCKYLDTSECRNDTCLYEVSYGDGSYTVGDFVTETITLGSASVDNV 261

Query: 262 ALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSESTSTLDFNSPIPP 321
           A+GCGHNNEGLF+GAAGLLGLGGG LSF SQLNASSFSYCLVDRDS+S STL+F+S +PP
Sbjct: 262 AIGCGHNNEGLFVGAAGLLGLGGGPLSFSSQLNASSFSYCLVDRDSDSASTLEFDSALPP 321

Query: 322 DAVTAPLHRNPNLDTFFYLGMTGMSVGGEILPIPETSFQMSQDGNGGIIIDSGTAVTRLQ 381
           +AV APL RN  LDTF+YLG+TG+SVGGE+LPIP+++FQM + GNGG IIDSGTAVTRLQ
Sbjct: 322 NAVKAPLLRNHQLDTFYYLGLTGISVGGELLPIPQSAFQMDESGNGGTIIDSGTAVTRLQ 381

Query: 382 TTAYNLLRDAFVKKTHDLQSARGVALFDTCYDLSSKSRVEVPTVSFHFADGKELPLPAKN 441
           +  Y++LRDAFVK T +L S   VALFDTCYDLS +S V+VPTVSFHF +G+ LPLPAKN
Sbjct: 382 SDTYDILRDAFVKGTKNLPSTDSVALFDTCYDLSKRSSVDVPTVSFHFPEGQVLPLPAKN 441

Query: 442 YLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVSFDLANSLVGFSSNKC 486
           YLIPVDSEGTFCFAFAPT S+LSI+GN QQQGTRV FDL NSLV F  +KC
Sbjct: 442 YLIPVDSEGTFCFAFAPTSSSLSIIGNVQQQGTRVGFDLGNSLVEFVPDKC 489

BLAST of CmoCh04G015950.1 vs. TAIR10
Match: AT1G25510.1 (AT1G25510.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 627.5 bits (1617), Expect = 7.0e-180
Identity = 317/483 (65.63%), Postives = 388/483 (80.33%), Query Frame = 1

Query: 5   FFLFLLPLLYASFSGVLSRSLPHAPQT--SVLDVDASIHTTRQVFAFQPQSPMRDESTVS 64
           FF+F L     S S V SR LP    T  S+L+V  SIH T+   +F+     ++E T S
Sbjct: 9   FFIFFL----TSHSSVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLNQ--QEEQTHS 68

Query: 65  DSSSLTLLLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSLTARMDLAIRGITGADLEPL 124
            SSS +L L+SRVS+  T H+DYKSLTL+RL RD+ARV+SL  R+DLAI  I+ ADL+P+
Sbjct: 69  ASSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPI 128

Query: 125 WNGGGSQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCA 184
                ++   +D E+P++SG +QGSGEYF+RVGIG+P   VYMVLDTGSDV+W+QC PCA
Sbjct: 129 STMYTTE--EQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCA 188

Query: 185 DCYEQTDPIFEPTSSASFTSLSCQTQQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTE 244
           DCY QT+PIFEP+SS+S+  LSC T QC +L+VSECRN TCLYEVSYGDGSYTVGDF TE
Sbjct: 189 DCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATE 248

Query: 245 TVTLGSTSLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSES 304
           T+T+GST + N+A+GCGH+NEGLF+GAAGLLGLGGG L+ PSQLN +SFSYCLVDRDS+S
Sbjct: 249 TLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDS 308

Query: 305 TSTLDFNSPIPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGEILPIPETSFQMSQDGNGGI 364
            ST+DF + + PDAV APL RN  LDTF+YLG+TG+SVGGE+L IP++SF+M + G+GGI
Sbjct: 309 ASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGI 368

Query: 365 IIDSGTAVTRLQTTAYNLLRDAFVKKTHDLQSARGVALFDTCYDLSSKSRVEVPTVSFHF 424
           IIDSGTAVTRLQT  YN LRD+FVK T DL+ A GVA+FDTCY+LS+K+ VEVPTV+FHF
Sbjct: 369 IIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHF 428

Query: 425 ADGKELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVSFDLANSLVGFSS 484
             GK L LPAKNY+IPVDS GTFC AFAPT S+L+I+GN QQQGTRV+FDLANSL+GFSS
Sbjct: 429 PGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSS 483

Query: 485 NKC 486
           NKC
Sbjct: 489 NKC 483

BLAST of CmoCh04G015950.1 vs. TAIR10
Match: AT3G18490.1 (AT3G18490.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 538.9 bits (1387), Expect = 3.3e-153
Identity = 274/501 (54.69%), Postives = 364/501 (72.65%), Query Frame = 1

Query: 1   MATAFFLFLLPL----LYASFSGVLSRSLPHAPQTSVLDVDASIHTTRQVFAFQPQ---- 60
           MA   FL LL +    L+ + +   SRSL   P+T+VLDV +S+  T+ + +  P     
Sbjct: 1   MAFPRFLSLLAVVTLSLFLTTTDASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSL 60

Query: 61  -----SPMRDESTVSDSSSLTLLLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSLTARM 120
                  + D    + SS L+L L+SR + + + H DYKSLTLSRL RDS+RV  + A++
Sbjct: 61  TTTKPESLSDPVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKI 120

Query: 121 DLAIRGITGADLEPLWNGGGSQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVL 180
             A+ G+  +DL+P++N   +++  ED  +P+VSGASQGSGEYFSR+G+G P   +Y+VL
Sbjct: 121 RFAVEGVDRSDLKPVYNED-TRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVL 180

Query: 181 DTGSDVSWVQCAPCADCYEQTDPIFEPTSSASFTSLSCQTQQCKSLDVSECRNGTCLYEV 240
           DTGSDV+W+QC PCADCY+Q+DP+F PTSS+++ SL+C   QC  L+ S CR+  CLY+V
Sbjct: 181 DTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQV 240

Query: 241 SYGDGSYTVGDFVTETVTLG-STSLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQL 300
           SYGDGS+TVG+  T+TVT G S  + N+ALGCGH+NEGLF GAAGLLGLGGG LS  +Q+
Sbjct: 241 SYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQM 300

Query: 301 NASSFSYCLVDRDSESTSTLDFNS-PIPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGEIL 360
            A+SFSYCLVDRDS  +S+LDFNS  +     TAPL RN  +DTF+Y+G++G SVGGE +
Sbjct: 301 KATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKV 360

Query: 361 PIPETSFQMSQDGNGGIIIDSGTAVTRLQTTAYNLLRDAFVKKTHDL-QSARGVALFDTC 420
            +P+  F +   G+GG+I+D GTAVTRLQT AYN LRDAF+K T +L + +  ++LFDTC
Sbjct: 361 VLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTC 420

Query: 421 YDLSSKSRVEVPTVSFHFADGKELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQ 480
           YD SS S V+VPTV+FHF  GK L LPAKNYLIPVD  GTFCFAFAPT S+LSI+GN QQ
Sbjct: 421 YDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQ 480

Query: 481 QGTRVSFDLANSLVGFSSNKC 486
           QGTR+++DL+ +++G S NKC
Sbjct: 481 QGTRITYDLSKNVIGLSGNKC 500

BLAST of CmoCh04G015950.1 vs. TAIR10
Match: AT3G20015.1 (AT3G20015.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 405.6 bits (1041), Expect = 4.3e-113
Identity = 220/485 (45.36%), Postives = 298/485 (61.44%), Query Frame = 1

Query: 5   FFLFLLPLLYASFSGVLSRSLPHAPQTSVLDVDASIHTTRQVFAFQPQSPMRDESTVSDS 64
           FF FL   L+ S S  +S      P   ++DV     T          +   DES    S
Sbjct: 6   FFFFLHLHLHLSSSSSIS-----FPDFQIIDVLQPPLTVTATLPDFNNTHFSDES----S 65

Query: 65  SSLTLLLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSLTARMDLAIRGITGADLEPLWN 124
           S  TL L  R      ++ ++     +R+ RD+ RV ++  R+   +  I  +D      
Sbjct: 66  SKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKV--IPSSD------ 125

Query: 125 GGGSQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCADC 184
              S++   DF S IVSG  QGSGEYF R+G+G PP   YMV+D+GSD+ WVQC PC  C
Sbjct: 126 ---SRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLC 185

Query: 185 YEQTDPIFEPTSSASFTSLSCQTQQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETV 244
           Y+Q+DP+F+P  S S+T +SC +  C  ++ S C +G C YEV YGDGSYT G    ET+
Sbjct: 186 YKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETL 245

Query: 245 TLGSTSLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLN---ASSFSYCLVDRDSE 304
           T   T + N+A+GCGH N G+FIGAAGLLG+GGGS+SF  QL+     +F YCLV R ++
Sbjct: 246 TFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTD 305

Query: 305 STSTLDF-NSPIPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGEILPIPETSFQMSQDGNG 364
           ST +L F    +P  A   PL RNP   +F+Y+G+ G+ VGG  +P+P+  F +++ G+G
Sbjct: 306 STGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDG 365

Query: 365 GIIIDSGTAVTRLQTTAYNLLRDAFVKKTHDLQSARGVALFDTCYDLSSKSRVEVPTVSF 424
           G+++D+GTAVTRL T AY   RD F  +T +L  A GV++FDTCYDLS    V VPTVSF
Sbjct: 366 GVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSF 425

Query: 425 HFADGKELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVSFDLANSLVGF 484
           +F +G  L LPA+N+L+PVD  GT+CFAFA + + LSI+GN QQ+G +VSFD AN  VGF
Sbjct: 426 YFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGF 470

Query: 485 SSNKC 486
             N C
Sbjct: 486 GPNVC 470

BLAST of CmoCh04G015950.1 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 402.5 bits (1033), Expect = 3.6e-112
Identity = 226/445 (50.79%), Postives = 289/445 (64.94%), Query Frame = 1

Query: 55  MRDESTVSDSSSLTLLLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSLTARMDLAIRGI 114
           + DES    ++SL++ L+   ++   S      L   RL RDS RV+S+T+   ++  G 
Sbjct: 49  LTDESLSESTTSLSVHLSHVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVST-GR 108

Query: 115 TGADLEPLWNGGGSQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVS 174
                 P   GG        F   ++SG SQGSGEYF R+G+G P + VYMVLDTGSDV 
Sbjct: 109 NATKRTPRTAGG--------FSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVV 168

Query: 175 WVQCAPCADCYEQTDPIFEPTSSASFTSLSCQTQQCKSL-DVSEC---RNGTCLYEVSYG 234
           W+QC+PC  CY QTD IF+P  S +F ++ C ++ C+ L D SEC   R+ TCLY+VSYG
Sbjct: 169 WLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYG 228

Query: 235 DGSYTVGDFVTETVTLGSTSLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLN--- 294
           DGS+T GDF TET+T     + ++ LGCGH+NEGLF+GAAGLLGLG G LSFPSQ     
Sbjct: 229 DGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRY 288

Query: 295 ASSFSYCLVDRDSEST-----STLDF-NSPIPPDAVTAPLHRNPNLDTFFYLGMTGMSVG 354
              FSYCLVDR S  +     ST+ F N+ +P  +V  PL  NP LDTF+YL + G+SVG
Sbjct: 289 NGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVG 348

Query: 355 GEILP-IPETSFQMSQDGNGGIIIDSGTAVTRLQTTAYNLLRDAFVKKTHDLQSARGVAL 414
           G  +P + E+ F++   GNGG+IIDSGT+VTRL   AY  LRDAF      L+ A   +L
Sbjct: 349 GSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSL 408

Query: 415 FDTCYDLSSKSRVEVPTVSFHFADGKELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILG 474
           FDTC+DLS  + V+VPTV FHF  G E+ LPA NYLIPV++EG FCFAFA T  +LSI+G
Sbjct: 409 FDTCFDLSGMTTVKVPTVVFHFGGG-EVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIG 468

Query: 475 NAQQQGTRVSFDLANSLVGFSSNKC 486
           N QQQG RV++DL  S VGF S  C
Sbjct: 469 NIQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of CmoCh04G015950.1 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 394.8 bits (1013), Expect = 7.6e-110
Identity = 224/443 (50.56%), Postives = 285/443 (64.33%), Query Frame = 1

Query: 52  QSPMRDESTVSDSSSLTLLLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSL-TARMDLA 111
           +S     S    SSS+TL L+   ++      D   L  SRL RDS RV+S+ T    + 
Sbjct: 57  ESEFESGSDSESSSSITLNLDHIDALSSNKTPD--ELFSSRLQRDSRRVKSIATLAAQIP 116

Query: 112 IRGITGADLEPLWNGGGSQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTG 171
            R +T A        GG       F S +VSG SQGSGEYF+R+G+G P   VYMVLDTG
Sbjct: 117 GRNVTHAP-----RPGG-------FSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTG 176

Query: 172 SDVSWVQCAPCADCYEQTDPIFEPTSSASFTSLSCQTQQCKSLDVSEC--RNGTCLYEVS 231
           SD+ W+QCAPC  CY Q+DPIF+P  S ++ ++ C +  C+ LD + C  R  TCLY+VS
Sbjct: 177 SDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVS 236

Query: 232 YGDGSYTVGDFVTETVTLGSTSLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLN- 291
           YGDGS+TVGDF TET+T     +  +ALGCGH+NEGLF+GAAGLLGLG G LSFP Q   
Sbjct: 237 YGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGH 296

Query: 292 --ASSFSYCLVDRDSES--TSTLDFNSPIPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGE 351
                FSYCLVDR + S  +S +  N+ +   A   PL  NP LDTF+Y+G+ G+SVGG 
Sbjct: 297 RFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGT 356

Query: 352 ILP-IPETSFQMSQDGNGGIIIDSGTAVTRLQTTAYNLLRDAFVKKTHDLQSARGVALFD 411
            +P +  + F++ Q GNGG+IIDSGT+VTRL   AY  +RDAF      L+ A   +LFD
Sbjct: 357 RVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFD 416

Query: 412 TCYDLSSKSRVEVPTVSFHFADGKELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNA 471
           TC+DLS+ + V+VPTV  HF  G ++ LPA NYLIPVD+ G FCFAFA T   LSI+GN 
Sbjct: 417 TCFDLSNMNEVKVPTVVLHFR-GADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNI 476

Query: 472 QQQGTRVSFDLANSLVGFSSNKC 486
           QQQG RV +DLA+S VGF+   C
Sbjct: 477 QQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CmoCh04G015950.1 vs. NCBI nr
Match: gi|659091469|ref|XP_008446567.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis melo])

HSP 1 Score: 844.3 bits (2180), Expect = 1.0e-241
Identity = 424/481 (88.15%), Postives = 451/481 (93.76%), Query Frame = 1

Query: 7   LFLLPLLYASFSGVLSRSLPHAPQTSVLDVDASIHTTRQVFAFQPQSPMRDESTVSDSSS 66
           LFLL LL++S S  L R+LP  P+TSVLDV ASI  T+Q+FA +P+S   DE TVSDSSS
Sbjct: 6   LFLLSLLFSSLSAFLCRTLPPTPRTSVLDVAASIQRTQQIFAMEPKSSTPDEITVSDSSS 65

Query: 67  LTLLLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSLTARMDLAIRGITGADLEPLWNG- 126
           L+L LNSR+S+MKTSH+DYKSLTLSRL RDSARV+SLTAR+DLAIRGITG DLEPL NG 
Sbjct: 66  LSLQLNSRISVMKTSHSDYKSLTLSRLKRDSARVKSLTARIDLAIRGITGTDLEPLGNGD 125

Query: 127 -GGSQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCADC 186
            GGSQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCA+C
Sbjct: 126 GGGSQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAEC 185

Query: 187 YEQTDPIFEPTSSASFTSLSCQTQQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETV 246
           YEQTDPIFEPTSSASFTSLSC+T+QCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETV
Sbjct: 186 YEQTDPIFEPTSSASFTSLSCETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETV 245

Query: 247 TLGSTSLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSESTS 306
           TLGSTSL NIA+GCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDS+STS
Sbjct: 246 TLGSTSLRNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSDSTS 305

Query: 307 TLDFNSPIPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGEILPIPETSFQMSQDGNGGIII 366
           TLDFNSPI PDAVTAPLHRNPNLDTFFYLG+TGMSVGG +LPIPETSFQMS+DGNGGII+
Sbjct: 306 TLDFNSPISPDAVTAPLHRNPNLDTFFYLGLTGMSVGGTVLPIPETSFQMSEDGNGGIIV 365

Query: 367 DSGTAVTRLQTTAYNLLRDAFVKKTHDLQSARGVALFDTCYDLSSKSRVEVPTVSFHFAD 426
           DSGTAVTRLQTT YN+LRD+FVK THDLQSARGVALFDTCYDLSSKS VEVPTVSFHFA+
Sbjct: 366 DSGTAVTRLQTTVYNVLRDSFVKSTHDLQSARGVALFDTCYDLSSKSSVEVPTVSFHFAN 425

Query: 427 GKELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVSFDLANSLVGFSSNK 486
           G ELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRV FDL+NSLVGFS NK
Sbjct: 426 GNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLSNSLVGFSPNK 485

BLAST of CmoCh04G015950.1 vs. NCBI nr
Match: gi|449434646|ref|XP_004135107.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis sativus])

HSP 1 Score: 837.4 bits (2162), Expect = 1.3e-239
Identity = 423/481 (87.94%), Postives = 447/481 (92.93%), Query Frame = 1

Query: 7   LFLLPLLYASFSGVLSRSLPHAPQTSVLDVDASIHTTRQVFAFQPQSPMRDESTVSDSSS 66
           LFLL LL++S S    R+L   P TSVLDV ASI  T+QVFA +P+S   DE+TVSD SS
Sbjct: 6   LFLLSLLFSSLSAFHCRTLHPTPTTSVLDVAASIQRTQQVFAVEPKSSTPDETTVSDPSS 65

Query: 67  LTLLLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSLTARMDLAIRGITGADLEPLWNGG 126
           L+L LNSR+S+MK SH+DYKSLTLSRL RDSARVRSLTAR+DLAIRGITG DLEPL NGG
Sbjct: 66  LSLQLNSRISVMKASHSDYKSLTLSRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGG 125

Query: 127 G--SQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCADC 186
           G  SQFG EDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCA+C
Sbjct: 126 GGGSQFGTEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAEC 185

Query: 187 YEQTDPIFEPTSSASFTSLSCQTQQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETV 246
           YEQTDPIFEPTSSASFTSLSC+T+QCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETV
Sbjct: 186 YEQTDPIFEPTSSASFTSLSCETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETV 245

Query: 247 TLGSTSLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSESTS 306
           TLGSTSL NIA+GCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDS+STS
Sbjct: 246 TLGSTSLGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSDSTS 305

Query: 307 TLDFNSPIPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGEILPIPETSFQMSQDGNGGIII 366
           TLDFNSPI PDAVTAPLHRNPNLDTFFYLG+TGMSVGG +LPIPETSFQMS+DGNGGII+
Sbjct: 306 TLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIV 365

Query: 367 DSGTAVTRLQTTAYNLLRDAFVKKTHDLQSARGVALFDTCYDLSSKSRVEVPTVSFHFAD 426
           DSGTAVTRLQTT YN+LRDAFVK THDLQ+ARGVALFDTCYDLSSKSRVEVPTVSFHFA+
Sbjct: 366 DSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFAN 425

Query: 427 GKELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVSFDLANSLVGFSSNK 486
           G ELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRV FDLANSLVGFS NK
Sbjct: 426 GNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNK 485

BLAST of CmoCh04G015950.1 vs. NCBI nr
Match: gi|1009155353|ref|XP_015895667.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Ziziphus jujuba])

HSP 1 Score: 700.3 bits (1806), Expect = 2.4e-198
Identity = 350/485 (72.16%), Postives = 410/485 (84.54%), Query Frame = 1

Query: 6   FLFLLPLLYASFSGVL-SRSLPHAPQTSVLDVDASIHTTRQVFAFQPQSP----MRDEST 65
           FLF +   + S SG++ SR+L    +T+VLDV A    T    +   +       +++S 
Sbjct: 3   FLFYILFFFFSSSGIVHSRNLLGNSKTTVLDVAALTQETINALSLDSKPTEAFNQQEQSF 62

Query: 66  VSDSSSLTLLLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSLTARMDLAIRGITGADLE 125
            + SSSL+L L+SR+SI + SH DYKSLTL+RL RDSARV+S+T R+DLA+ GIT +DL+
Sbjct: 63  PASSSSLSLQLHSRISIHRPSHGDYKSLTLARLERDSARVKSITTRVDLALGGITHSDLK 122

Query: 126 PLWNGGGSQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAP 185
           P+  G G +FGAED + PIVSG SQGSGEYFSRVGIG PPS VYMVLDTGSDV+WVQCAP
Sbjct: 123 PVDTGKGLEFGAEDIQGPIVSGTSQGSGEYFSRVGIGNPPSQVYMVLDTGSDVNWVQCAP 182

Query: 186 CADCYEQTDPIFEPTSSASFTSLSCQTQQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFV 245
           CADCY+Q DPIF+PTSS++++ LSCQTQQCKSLD SECRNG+CLYEVSYGDGSYTVGDFV
Sbjct: 183 CADCYQQADPIFQPTSSSTYSPLSCQTQQCKSLDESECRNGSCLYEVSYGDGSYTVGDFV 242

Query: 246 TETVTLGSTSLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDS 305
           TET+TLGS S+  +A+GCGHNNEGLFIGAAGL+GLGGGSLSFPSQ+NA+SFSYCLVDRDS
Sbjct: 243 TETITLGSASVNGVAIGCGHNNEGLFIGAAGLMGLGGGSLSFPSQINATSFSYCLVDRDS 302

Query: 306 ESTSTLDFNSPIPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGEILPIPETSFQMSQDGNG 365
           +S STL+F+SP+P +AVTAPLHRNP LDTF+YLGM G+SVGG++LPI E+SFQ+++DGNG
Sbjct: 303 DSASTLEFDSPLPRNAVTAPLHRNPQLDTFYYLGMKGLSVGGQLLPISESSFQLTEDGNG 362

Query: 366 GIIIDSGTAVTRLQTTAYNLLRDAFVKKTHDLQSARGVALFDTCYDLSSKSRVEVPTVSF 425
           GII+DSGTAVTRLQT  YN+LRDAFVK T  L SA GVALFDTCYDLSSKS VEVPT+SF
Sbjct: 363 GIIVDSGTAVTRLQTDTYNVLRDAFVKGTKHLPSANGVALFDTCYDLSSKSSVEVPTLSF 422

Query: 426 HFADGKELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVSFDLANSLVGF 485
           HF DGKELPLPAKNYLIPVDS GTFCFAFAPT S+LSI+GN QQQGTRV FDLANSLVGF
Sbjct: 423 HFPDGKELPLPAKNYLIPVDSAGTFCFAFAPTSSSLSIIGNVQQQGTRVGFDLANSLVGF 482

BLAST of CmoCh04G015950.1 vs. NCBI nr
Match: gi|224111722|ref|XP_002315953.1| (aspartyl protease family protein [Populus trichocarpa])

HSP 1 Score: 684.9 bits (1766), Expect = 1.0e-193
Identity = 341/487 (70.02%), Postives = 403/487 (82.75%), Query Frame = 1

Query: 1   MATAFFLFLLPLLYASFSGVLSRSL-PHAPQTSVLDVDASIHTTRQVFAFQPQ-SPMRDE 60
           M   F++F   L +AS     SR L PH  +T+VLDV ASI  T+ +F+  P+ SP   +
Sbjct: 1   MGLLFYVFF-SLFFASPPVSCSRILTPHPSETTVLDVAASIQRTKNIFSSGPKMSPFNQQ 60

Query: 61  STVSDSSSLTLLLNSRVSIMKTSHTDYKSLTLSRLYRDSARVRSLTARMDLAIRGITGAD 120
              + SS LT+ L SR SI KT+HT YKSLTLSRL RDSARV+SL  R+DLAI  I+ +D
Sbjct: 61  EKETTSSELTVELLSRTSIQKTTHTGYKSLTLSRLQRDSARVKSLVTRLDLAINSISSSD 120

Query: 121 LEPLWNGGGSQFGAEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQC 180
           L+PL     S+F  ED +SPI+SG SQGSGEYFSRVGIG+PPS  Y++LDTGSDV+WVQC
Sbjct: 121 LKPLETD--SEFKPEDLQSPIISGTSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQC 180

Query: 181 APCADCYEQTDPIFEPTSSASFTSLSCQTQQCKSLDVSECRNGTCLYEVSYGDGSYTVGD 240
           APCADCY+Q DPIFEP SSASF++LSC T+QC+SLDVSECRN TCLYEVSYGDGSYTVGD
Sbjct: 181 APCADCYQQADPIFEPASSASFSTLSCNTRQCRSLDVSECRNDTCLYEVSYGDGSYTVGD 240

Query: 241 FVTETVTLGSTSLTNIALGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDR 300
           FVTET+TLGS  + N+A+GCGHNNEGLF+GAAGLLGLGGGSLSFPSQ+NA+SFSYCLVDR
Sbjct: 241 FVTETITLGSAPVDNVAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCLVDR 300

Query: 301 DSESTSTLDFNSPIPPDAVTAPLHRNPNLDTFFYLGMTGMSVGGEILPIPETSFQMSQDG 360
           DSES STL+FNS +PP+AV+APL RN +LDTF+Y+G+TG+SVGGE++ IPE++FQ+ + G
Sbjct: 301 DSESASTLEFNSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESG 360

Query: 361 NGGIIIDSGTAVTRLQTTAYNLLRDAFVKKTHDLQSARGVALFDTCYDLSSKSRVEVPTV 420
           NGG+I+DSGTA+TRLQT  YN LRDAFVK+T DL S  G+ALFDTCYDLSSK  VEVPTV
Sbjct: 361 NGGVIVDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVEVPTV 420

Query: 421 SFHFADGKELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVSFDLANSLV 480
           SFHF DGKELPLPAKNYL+P+DSEGTFCFAFAPT S+LSI+GN QQQGTRV +DL N LV
Sbjct: 421 SFHFPDGKELPLPAKNYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNHLV 480

Query: 481 GFSSNKC 486
           GF  NKC
Sbjct: 481 GFVPNKC 484

BLAST of CmoCh04G015950.1 vs. NCBI nr
Match: gi|694417006|ref|XP_009336599.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Pyrus x bretschneideri])

HSP 1 Score: 681.4 bits (1757), Expect = 1.2e-192
Identity = 333/464 (71.77%), Postives = 388/464 (83.62%), Query Frame = 1

Query: 22  SRSLPHAPQTSVLDVDASIHTTRQVFAFQPQSPMRDESTVSDSSSLTLLLNSRVSIMKTS 81
           SRS P   +T+VLDV ASI TT +  + +  S      +  D SSL++ L+SR+S+ K S
Sbjct: 25  SRSSPLTSKTTVLDVAASIRTTLRALSSEDTSRTAQALSQQDHSSLSVPLHSRISLHKPS 84

Query: 82  HTDYKSLTLSRLYRDSARVRSLTARMDLAIRGITGADLEPLWNGGGSQFGAEDFESPIVS 141
           H+DYKSLTL+RL RDSARVRSLT R+DLA+RG+  +DL+P+  G G Q  A+ FE PI+S
Sbjct: 85  HSDYKSLTLARLERDSARVRSLTTRLDLAVRGVATSDLKPVETGSGLQLDADGFEGPIIS 144

Query: 142 GASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCADCYEQTDPIFEPTSSASFT 201
           G SQGSGEYFSRVGIG+PPS  Y+VLDTGSD+SWVQCAPCADCY+Q DPIFEP SS SF+
Sbjct: 145 GTSQGSGEYFSRVGIGKPPSQAYVVLDTGSDISWVQCAPCADCYQQADPIFEPASSTSFS 204

Query: 202 SLSCQTQQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGSTSLTNIALGCGHN 261
            LSC++QQCKSLDV ECRNGTCLYEV+YGDGSYTVGDFVTET+++G  S   IA+GCGH 
Sbjct: 205 PLSCESQQCKSLDVFECRNGTCLYEVAYGDGSYTVGDFVTETISIGGASAKEIAIGCGHT 264

Query: 262 NEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSESTSTLDFNSPIPPDAVTAPL 321
           NEGLF+GAAGLLGLGGGSLSFPSQLNA+S SYCLVDRDS+S STLDFNSP+ P+AVTAPL
Sbjct: 265 NEGLFVGAAGLLGLGGGSLSFPSQLNATSLSYCLVDRDSDSASTLDFNSPLRPNAVTAPL 324

Query: 322 HRNPNLDTFFYLGMTGMSVGGEILPIPETSFQMSQDGNGGIIIDSGTAVTRLQTTAYNLL 381
            RN  LDTF+YLG+TG+SVGG +LPIPE++FQ+   GNGGIIIDSGTAVTRLQT  YN+L
Sbjct: 325 RRNSQLDTFYYLGLTGLSVGGSLLPIPESAFQIDGSGNGGIIIDSGTAVTRLQTDTYNVL 384

Query: 382 RDAFVKKTHDLQSARGVALFDTCYDLSSKSRVEVPTVSFHFADGKELPLPAKNYLIPVDS 441
           RDAF+K T DL   +G ALFD CYDLSS+ RVEVPTVSFHFADGK LPLPAKN+LIPVDS
Sbjct: 385 RDAFMKGTKDLPFTKGPALFDACYDLSSRKRVEVPTVSFHFADGKVLPLPAKNFLIPVDS 444

Query: 442 EGTFCFAFAPTDSTLSILGNAQQQGTRVSFDLANSLVGFSSNKC 486
           +GTFCFAFAPT S+LSI+GN QQQGTRV FDL NS+VGFS N+C
Sbjct: 445 DGTFCFAFAPTPSSLSIIGNVQQQGTRVGFDLVNSVVGFSLNQC 488

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPG1_ARATH5.8e-15254.69Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
ASPG2_ARATH7.6e-11245.36Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
APF2_ARATH1.3e-10850.56Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
NEP1_NEPGR4.2e-7040.56Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
NEP2_NEPGR3.0e-6841.29Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KUG1_CUCSA8.7e-24087.94Uncharacterized protein OS=Cucumis sativus GN=Csa_5G608450 PE=3 SV=1[more]
B9HWK9_POPTR7.3e-19470.02Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0010s13830g PE=... [more]
V4SSZ0_9ROSI8.9e-19271.28Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031364mg PE=3 SV=1[more]
A0A067K2W2_JATCU2.6e-19170.80Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18299 PE=3 SV=1[more]
A0A061EAV1_THECC1.8e-18971.76Eukaryotic aspartyl protease family protein OS=Theobroma cacao GN=TCM_011849 PE=... [more]
Match NameE-valueIdentityDescription
AT1G25510.17.0e-18065.63 Eukaryotic aspartyl protease family protein[more]
AT3G18490.13.3e-15354.69 Eukaryotic aspartyl protease family protein[more]
AT3G20015.14.3e-11345.36 Eukaryotic aspartyl protease family protein[more]
AT3G61820.13.6e-11250.79 Eukaryotic aspartyl protease family protein[more]
AT1G01300.17.6e-11050.56 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659091469|ref|XP_008446567.1|1.0e-24188.15PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis melo][more]
gi|449434646|ref|XP_004135107.1|1.3e-23987.94PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis sativus][more]
gi|1009155353|ref|XP_015895667.1|2.4e-19872.16PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Ziziphus jujuba][more]
gi|224111722|ref|XP_002315953.1|1.0e-19370.02aspartyl protease family protein [Populus trichocarpa][more]
gi|694417006|ref|XP_009336599.1|1.2e-19271.77PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Pyrus x bretschneider... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh04G015950CmoCh04G015950gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh04G015950.1CmoCh04G015950.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh04G015950.1.exon.1CmoCh04G015950.1.exon.1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh04G015950.1.CDS.1CmoCh04G015950.1.CDS.1CDS


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 57..119
score: 2.0E-260coord: 4..21
score: 2.0E-260coord: 137..485
score: 2.0E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 165..176
score: -coord: 362..373
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 137..304
score: 3.2E-34coord: 315..485
score: 5.6
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 144..485
score: 1.09
NoneNo IPR availablePANTHERPTHR13683:SF274ASPARTYL PROTEASE FAMILY PROTEINcoord: 57..119
score: 2.0E-260coord: 4..21
score: 2.0E-260coord: 137..485
score: 2.0E