CmoCh18G008610 (gene) Cucurbita moschata (Rifu)

NameCmoCh18G008610
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionEukaryotic aspartyl protease family protein
LocationCmo_Chr18 : 9934686 .. 9937125 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACACTCTAATTCAAATCAAAAGCCATGAACACAAAACCCTCCACTTTCCTCCTCTCTCTCCTCCTCCTCCTCCGCCTCCTCGCCGCCGCTTCCGGCGGCTCCCTCCGCGTCTTCCACATCTCCATCCATTCCAAGCCGCTGTCATGGGCCGACGGAGTACTCCAAATGCAGGCCAAGGACGAAGCTCGTCTCCAGCTACTCTCCTCCCTAGTCGCCCGCCGCTCCGACGCGCCCATCGCCTCCGGCCGCCAGGTGATCCAGAGCCCGTCGTACGTGGTCCGGGCAAACATCGGAACCCCTGCCCAGACGCTGCTATTGGCCCTGGATACCAGCAATGATGCCACGTGGATTCCGTGTTCCGGCTGCGTGGGCTGCCCGGGAACCTCCGTTTTCTCCTCTGATCAATCCTCCTCCTTCCGGCCGCTCCCCTGCCAGTCCCCGCAATGCAGCCAGGTAGGGCCCACTCCATTTTCTTTAATTATAAATAAATAAATAAATTCATGATATTTTAAGGGCATTATTGTCAGAAAAACAGTTCTTATTGGATTTCCGTACTAATTAAAAAAAAAAGTCTTTTTATTTTCATATACTATTGGGTATCTCTATAATGTCTGTCTGTCAGTCACCGACTGACTGTTCAGTCACGGACCAACAAATTTTTCCAATTCCCTGTCCACACTTGCACGTGATCTCTTGTTTTTTTTTTTCTAATTAATAAATATATTATTTATTTTTGTACTAATAATAATCTTACCATTTAAAAATAAAAGATAAAATACACTTATTGTAATTTTTTAATAATTTTATTTAATATTATTGAATTTTTTAAACATTTTTTTATAGAAATTTTAATTAATTTAATGTACTCTTAAAAGTTTTAATAATATTCTAAACTTATTTAAATAAAAAATATTAATATTTTATTTAAAACTATTTGACTTTTAGAAAAAAAAATTGGAGGGTATTTTTGAAAATTTAAGTATATATTTTAAACCAATTACAAAATTTAAGAGGTTTTTTATTAATTTAATTTTTTTTTTTTTACTTTTAACCAAATCATAAATTTAGGTTATGAATTGTCAAAATTAAAATTTGGAATTTTAGTATGAAATAAGATTATATTTAAATTTATAATTAATTTTAGAATTTGAAAAAATATTTATGACTCACAAAATTTCTTTTAACGTTAATCTCAGGTACCGAACCCCAGTTGCGGTACCAGCGCGTGCGGTTTCAACATGACATACGGCAGCTCGACGGTGGCGGCGGATTTGGTTCAAGACAACATAACGCTGGCTAGGGACTCGGTTCCGGCGTTCACATTCGGGTGCATCACGAAGGCCACCGGGAGCTCAGTGCCGCCGCAGGGGCTATTGGGGCTGGGCCGTGGCTCATTGTCGCTCTTGTCGCAGACCCAGAGTCTGTACCAGTCCACATTCTCCTACTGCCTTCCCAGCTTCAAATCGGCTAGCTTTTCTGGGTCGTTGCGGCTCGGGCCAGTGGCTCAGCCTAAGAGGATTAAGTACACACCGCTCCTCCGCAACCCGAGACGGTCGTCGTTGTATTATGTCAATTTGATCGGGATTAGAGTTGGTAGTAAAGTCGTCGACATTCCTTCTAGTGCTTTCGCGTTTAACACCGCCACGGGCTCCGGCACCATCATTGATTCAGGTAAGATTGTTTGTCGGGAATCCTAGATTAACTAATTTAAGGAATGATCATATATTTATAAATAAATCATACATTTAAAAAATAAAAAACAAAATTACGATAGCTTATGATCATATTATTAAATGTAAGTTTAATTTTGATTGGTATAATTATTCAGGAACGACGTTCACAAGACTAGTGGCGCCGGCGTACACGGCAGTGAGGGACGAGTTCCAGCGAAGAGTAGGCAACGCAACCGTCTCATCTCTTGGTGGCTTCGACACTTGCTACACCGTCCCCATCGTCTCCCCCACCATAACCTTCATGTTCGACGCAATGAACGTCACTCTCCCACCGGACAACTTCCTCATCCACAGCACCGCCGGATCCACCACCTGCCTTGCCATGGCCTCGGCGCCCGACAATGTCAACTCTGTCCTCAACGTTATTGCCAACATGCAACAACAAAACCACCGTATCGTCTTCGACATTCCCAATTCCCGTGTCGGTGTCGCCCGCGAACCGTGCTCTTAGAACGTGTCGTCTCCGATTCCGGTGATGTTTTACTTTCGGAGGTCTTAGGATAAGAGTGTCCGGTCATGGTTTGAGTTGGTTATGGGAAGGGGGTAAAATAGTAATTTTGAATTGGTGTTGAATTGAAAAGAGTGTTTTTATGGTTGTGTTATTGTGTTTCTTTTTATTTATTATTAGTTTGACATTTTAATAAAAATATTTAATTTTAAATTTATTTTTCAAAGACCATTAATATAATTATGATGTTTTATTCGAA

mRNA sequence

ACACTCTAATTCAAATCAAAAGCCATGAACACAAAACCCTCCACTTTCCTCCTCTCTCTCCTCCTCCTCCTCCGCCTCCTCGCCGCCGCTTCCGGCGGCTCCCTCCGCGTCTTCCACATCTCCATCCATTCCAAGCCGCTGTCATGGGCCGACGGAGTACTCCAAATGCAGGCCAAGGACGAAGCTCGTCTCCAGCTACTCTCCTCCCTAGTCGCCCGCCGCTCCGACGCGCCCATCGCCTCCGGCCGCCAGGTGATCCAGAGCCCGTCGTACGTGGTCCGGGCAAACATCGGAACCCCTGCCCAGACGCTGCTATTGGCCCTGGATACCAGCAATGATGCCACGTGGATTCCGTGTTCCGGCTGCGTGGGCTGCCCGGGAACCTCCGTTTTCTCCTCTGATCAATCCTCCTCCTTCCGGCCGCTCCCCTGCCAGTCCCCGCAATGCAGCCAGGTACCGAACCCCAGTTGCGGTACCAGCGCGTGCGGTTTCAACATGACATACGGCAGCTCGACGGTGGCGGCGGATTTGGTTCAAGACAACATAACGCTGGCTAGGGACTCGGTTCCGGCGTTCACATTCGGGTGCATCACGAAGGCCACCGGGAGCTCAGTGCCGCCGCAGGGGCTATTGGGGCTGGGCCGTGGCTCATTGTCGCTCTTGTCGCAGACCCAGAGTCTGTACCAGTCCACATTCTCCTACTGCCTTCCCAGCTTCAAATCGGCTAGCTTTTCTGGGTCGTTGCGGCTCGGGCCAGTGGCTCAGCCTAAGAGGATTAAGTACACACCGCTCCTCCGCAACCCGAGACGGTCGTCGTTGTATTATGTCAATTTGATCGGGATTAGAGTTGGTAGTAAAGTCGTCGACATTCCTTCTAGTGCTTTCGCGTTTAACACCGCCACGGGCTCCGGCACCATCATTGATTCAGGAACGACGTTCACAAGACTAGTGGCGCCGGCGTACACGGCAGTGAGGGACGAGTTCCAGCGAAGAGTAGGCAACGCAACCGTCTCATCTCTTGGTGGCTTCGACACTTGCTACACCGTCCCCATCGTCTCCCCCACCATAACCTTCATGTTCGACGCAATGAACGTCACTCTCCCACCGGACAACTTCCTCATCCACAGCACCGCCGGATCCACCACCTGCCTTGCCATGGCCTCGGCGCCCGACAATGTCAACTCTGTCCTCAACGTTATTGCCAACATGCAACAACAAAACCACCGTATCGTCTTCGACATTCCCAATTCCCGTGTCGGTGTCGCCCGCGAACCGTGCTCTTAGAACGTGTCGTCTCCGATTCCGGTGATGTTTTACTTTCGGAGGTCTTAGGATAAGAGTGTCCGGTCATGGTTTGAGTTGGTTATGGGAAGGGGGTAAAATAGTAATTTTGAATTGGTGTTGAATTGAAAAGAGTGTTTTTATGGTTGTGTTATTGTGTTTCTTTTTATTTATTATTAGTTTGACATTTTAATAAAAATATTTAATTTTAAATTTATTTTTCAAAGACCATTAATATAATTATGATGTTTTATTCGAA

Coding sequence (CDS)

ATGAACACAAAACCCTCCACTTTCCTCCTCTCTCTCCTCCTCCTCCTCCGCCTCCTCGCCGCCGCTTCCGGCGGCTCCCTCCGCGTCTTCCACATCTCCATCCATTCCAAGCCGCTGTCATGGGCCGACGGAGTACTCCAAATGCAGGCCAAGGACGAAGCTCGTCTCCAGCTACTCTCCTCCCTAGTCGCCCGCCGCTCCGACGCGCCCATCGCCTCCGGCCGCCAGGTGATCCAGAGCCCGTCGTACGTGGTCCGGGCAAACATCGGAACCCCTGCCCAGACGCTGCTATTGGCCCTGGATACCAGCAATGATGCCACGTGGATTCCGTGTTCCGGCTGCGTGGGCTGCCCGGGAACCTCCGTTTTCTCCTCTGATCAATCCTCCTCCTTCCGGCCGCTCCCCTGCCAGTCCCCGCAATGCAGCCAGGTACCGAACCCCAGTTGCGGTACCAGCGCGTGCGGTTTCAACATGACATACGGCAGCTCGACGGTGGCGGCGGATTTGGTTCAAGACAACATAACGCTGGCTAGGGACTCGGTTCCGGCGTTCACATTCGGGTGCATCACGAAGGCCACCGGGAGCTCAGTGCCGCCGCAGGGGCTATTGGGGCTGGGCCGTGGCTCATTGTCGCTCTTGTCGCAGACCCAGAGTCTGTACCAGTCCACATTCTCCTACTGCCTTCCCAGCTTCAAATCGGCTAGCTTTTCTGGGTCGTTGCGGCTCGGGCCAGTGGCTCAGCCTAAGAGGATTAAGTACACACCGCTCCTCCGCAACCCGAGACGGTCGTCGTTGTATTATGTCAATTTGATCGGGATTAGAGTTGGTAGTAAAGTCGTCGACATTCCTTCTAGTGCTTTCGCGTTTAACACCGCCACGGGCTCCGGCACCATCATTGATTCAGGAACGACGTTCACAAGACTAGTGGCGCCGGCGTACACGGCAGTGAGGGACGAGTTCCAGCGAAGAGTAGGCAACGCAACCGTCTCATCTCTTGGTGGCTTCGACACTTGCTACACCGTCCCCATCGTCTCCCCCACCATAACCTTCATGTTCGACGCAATGAACGTCACTCTCCCACCGGACAACTTCCTCATCCACAGCACCGCCGGATCCACCACCTGCCTTGCCATGGCCTCGGCGCCCGACAATGTCAACTCTGTCCTCAACGTTATTGCCAACATGCAACAACAAAACCACCGTATCGTCTTCGACATTCCCAATTCCCGTGTCGGTGTCGCCCGCGAACCGTGCTCTTAG
BLAST of CmoCh18G008610 vs. Swiss-Prot
Match: AED3_ARATH (Aspartyl protease AED3 OS=Arabidopsis thaliana GN=AED3 PE=1 SV=1)

HSP 1 Score: 395.6 bits (1015), Expect = 6.8e-109
Identity = 207/389 (53.21%), Postives = 274/389 (70.44%), Query Frame = 1

Query: 43  DGVLQMQAKDEARLQLLSSLVARR---SDAPIASGRQVIQSPSYVVRANIGTPAQTLLLA 102
           D VL M + D  RL  LSSLVA +   +  P+ASG Q +   +YVVRA +GTP Q + + 
Sbjct: 62  DTVLHMASSDSHRLTYLSSLVAGKPKPTSVPVASGNQ-LHIGNYVVRAKLGTPPQLMFMV 121

Query: 103 LDTSNDATWIPCSGCVGCPGTSV-FSSDQSSSFRPLPCQSPQCSQVPNPSCGTSA----- 162
           LDTSNDA W+PCSGC GC   S  F+++ SS++  + C + QC+Q    +C +S+     
Sbjct: 122 LDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSV 181

Query: 163 CGFNMTYGS-STVAADLVQDNITLARDSVPAFTFGCITKATGSSVPPQGLLGLGRGSLSL 222
           C FN +YG  S+ +A LVQD +TLA D +P F+FGCI  A+G+S+PPQGL+GLGRG +SL
Sbjct: 182 CSFNQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSL 241

Query: 223 LSQTQSLYQSTFSYCLPSFKSASFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIG 282
           +SQT SLY   FSYCLPSF+S  FSGSL+LG + QPK I+YTPLLRNPRR SLYYVNL G
Sbjct: 242 VSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTG 301

Query: 283 IRVGSKVVDIPSSAFAFNTATGSGTIIDSGTTFTRLVAPAYTAVRDEFQRRVGNATVSSL 342
           + VGS  V +      F+  +G+GTIIDSGT  TR   P Y A+RDEF+++V  ++ S+L
Sbjct: 302 VSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTL 361

Query: 343 GGFDTCYTV--PIVSPTITFMFDAMNVTLPPDNFLIHSTAGSTTCLAMASAPDNVNSVLN 402
           G FDTC++     V+P IT    ++++ LP +N LIHS+AG+ TCL+MA    N N+VLN
Sbjct: 362 GAFDTCFSADNENVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN 421

Query: 403 VIANMQQQNHRIVFDIPNSRVGVAREPCS 420
           VIAN+QQQN RI+FD+PNSR+G+A EPC+
Sbjct: 422 VIANLQQQNLRILFDVPNSRIGIAPEPCN 449

BLAST of CmoCh18G008610 vs. Swiss-Prot
Match: AP25_ORYSJ (Aspartyl protease 25 OS=Oryza sativa subsp. japonica GN=AP25 PE=2 SV=1)

HSP 1 Score: 369.8 bits (948), Expect = 4.0e-101
Identity = 212/439 (48.29%), Postives = 283/439 (64.46%), Query Frame = 1

Query: 6   STFLLSLLLLLRLLAAASGGSLRVFHISIHSKPLSWADGVLQMQAKDEARLQLLSSLVAR 65
           +T +  LLLLL    AA+   L V+H ++H    S  + ++ +   D+ARL  LSS  A 
Sbjct: 4   TTTIPLLLLLLAATVAAAAAELSVYH-NVHPSSPSPLESIIALARDDDARLLFLSSKAAT 63

Query: 66  R--SDAPIASGRQVIQSPSYVVRANIGTPAQTLLLALDTSNDATWIPCSGCVGCPGTSVF 125
              S AP+ASG+     PSYVVRA +G+P+Q LLLALDTS DATW  CS C  CP +S+F
Sbjct: 64  AGVSSAPVASGQA---PPSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSSLF 123

Query: 126 SSDQSSSFRPLPCQSPQC-----SQVPNPSCGTSA---------CGFNMTYGSSTVAADL 185
           +   SSS+  LPC S  C        P P  G  A         C F+  +  ++  A L
Sbjct: 124 APANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAAL 183

Query: 186 VQDNITLARDSVPAFTFGCITKATG--SSVPPQGLLGLGRGSLSLLSQTQSLYQSTFSYC 245
             D + L +D++P +TFGC++  TG  +++P QGLLGLGRG ++LLSQ  SLY   FSYC
Sbjct: 184 ASDTLRLGKDAIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYC 243

Query: 246 LPSFKSASFSGSLRLGPVA-QPKRIKYTPLLRNPRRSSLYYVNLIGIRVGSKVVDIPSSA 305
           LPS++S  FSGSLRLG    QP+ ++YTP+LRNP RSSLYYVN+ G+ VG   V +P+ +
Sbjct: 244 LPSYRSYYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGS 303

Query: 306 FAFNTATGSGTIIDSGTTFTRLVAPAYTAVRDEFQRRVGNAT-VSSLGGFDTCYTVPIV- 365
           FAF+ ATG+GT++DSGT  TR  AP Y A+R+EF+R+V   +  +SLG FDTC+    V 
Sbjct: 304 FAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVA 363

Query: 366 ---SPTITFMFD-AMNVTLPPDNFLIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNH 420
              +P +T   D  +++ LP +N LIHS+A    CLAMA AP NVNSV+NVIAN+QQQN 
Sbjct: 364 AGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNI 423

BLAST of CmoCh18G008610 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 203.0 bits (515), Expect = 6.5e-51
Identity = 126/353 (35.69%), Postives = 182/353 (51.56%), Query Frame = 1

Query: 80  SPSYVVRANIGTPAQTLLLALDTSNDATWIPCSGCVGCPGTS--VFSSDQSSSFRPLPCQ 139
           S  Y  R  +GTPA+ + + LDT +D  W+ C+ C  C   S  +F   +S ++  +PC 
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCS 198

Query: 140 SPQCSQVPNPSCGT--SACGFNMTYGS-STVAADLVQDNITLARDSVPAFTFGCITKATG 199
           SP C ++ +  C T    C + ++YG  S    D   + +T  R+ V     GC     G
Sbjct: 199 SPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEG 258

Query: 200 SSVPPQGLLGLGRGSLSLLSQTQSLYQSTFSYCLPSFKSASFSGSLRLGPVAQPKRIKYT 259
             V   GLLGLG+G LS   QT   +   FSYCL    ++S   S+  G  A  +  ++T
Sbjct: 259 LFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFT 318

Query: 260 PLLRNPRRSSLYYVNLIGIRV-GSKVVDIPSSAFAFNTATGSGTIIDSGTTFTRLVAPAY 319
           PLL NP+  + YYV L+GI V G++V  + +S F  +     G IIDSGT+ TRL+ PAY
Sbjct: 319 PLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAY 378

Query: 320 TAVRDEFQRRVGNATVS---SLGGFDTCYTV----PIVSPTITFMFDAMNVTLPPDNFLI 379
            A+RD F  RVG  T+        FDTC+ +     +  PT+   F   +V+LP  N+LI
Sbjct: 379 IAMRDAF--RVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLI 438

Query: 380 HSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRIVFDIPNSRVGVAREPCS 420
                   C A A         L++I N+QQQ  R+V+D+ +SRVG A   C+
Sbjct: 439 PVDTNGKFCFAFAGTMGG----LSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CmoCh18G008610 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 180.6 bits (457), Expect = 3.4e-44
Identity = 116/348 (33.33%), Postives = 167/348 (47.99%), Query Frame = 1

Query: 80  SPSYVVRANIGTPAQTLLLALDTSNDATWIPCSGCVGCPGTS--VFSSDQSSSFRPLPCQ 139
           S  Y VR  +G+P +   + +D+ +D  W+ C  C  C   S  VF   +S S+  + C 
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 187

Query: 140 SPQCSQVPNPSCGTSACGFNMTYGS-STVAADLVQDNITLARDSVPAFTFGCITKATGSS 199
           S  C ++ N  C +  C + + YG  S     L  + +T A+  V     GC  +  G  
Sbjct: 188 SSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMF 247

Query: 200 VPPQGLLGLGRGSLSLLSQTQSLYQSTFSYCLPSFKSASFSGSLRLGPVAQPKRIKYTPL 259
           +   GLLG+G GS+S + Q        F YCL S +    +GSL  G  A P    + PL
Sbjct: 248 IGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSLVFGREALPVGASWVPL 307

Query: 260 LRNPRRSSLYYVNLIGIRVGSKVVDIPSSAFAFNTATGSGTIIDSGTTFTRLVAPAYTAV 319
           +RNPR  S YYV L G+ VG   + +P   F        G ++D+GT  TRL   AY A 
Sbjct: 308 VRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAF 367

Query: 320 RDEFQRRVGN-ATVSSLGGFDTCYT----VPIVSPTITFMF-DAMNVTLPPDNFLIHSTA 379
           RD F+ +  N    S +  FDTCY     V +  PT++F F +   +TLP  NFL+    
Sbjct: 368 RDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDD 427

Query: 380 GSTTCLAMASAPDNVNSVLNVIANMQQQNHRIVFDIPNSRVGVAREPC 419
             T C A A++P      L++I N+QQ+  ++ FD  N  VG     C
Sbjct: 428 SGTYCFAFAASPTG----LSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CmoCh18G008610 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 179.1 bits (453), Expect = 1.0e-43
Identity = 126/349 (36.10%), Postives = 178/349 (51.00%), Query Frame = 1

Query: 83  YVVRANIGTPAQTLLLALDTSNDATWIPCSGCVGC--PGTSVFSSDQSSSFRPLPCQSPQ 142
           Y++  +IGTPAQ     +DT +D  W  C  C  C    T +F+   SSSF  LPC S  
Sbjct: 95  YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154

Query: 143 CSQVPNPSCGTSACGFNMTYGS-STVAADLVQDNITLARDSVPAFTFGCITKATG-SSVP 202
           C  + +P+C  + C +   YG  S     +  + +T    S+P  TFGC     G     
Sbjct: 155 CQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGN 214

Query: 203 PQGLLGLGRGSLSLLSQTQSLYQSTFSYCLPSFKSASFSGSLRLGPVAQPKRI--KYTPL 262
             GL+G+GRG LSL SQ   L  + FSYC+    S++ S +L LG +A         T L
Sbjct: 215 GAGLVGMGRGPLSLPSQ---LDVTKFSYCMTPIGSSTPS-NLLLGSLANSVTAGSPNTTL 274

Query: 263 LRNPRRSSLYYVNLIGIRVGSKVVDIPSSAFAFNTATGS-GTIIDSGTTFTRLVAPAYTA 322
           +++ +  + YY+ L G+ VGS  + I  SAFA N+  G+ G IIDSGTT T  V  AY +
Sbjct: 275 IQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQS 334

Query: 323 VRDEFQRRVGNATVS-SLGGFDTCYTVP-----IVSPTITFMFDAMNVTLPPDNFLIHST 382
           VR EF  ++    V+ S  GFD C+  P     +  PT    FD  ++ LP +N+ I S 
Sbjct: 335 VRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDLELPSENYFI-SP 394

Query: 383 AGSTTCLAMASAPDNVNSVLNVIANMQQQNHRIVFDIPNSRVGVAREPC 419
           +    CLAM S+       +++  N+QQQN  +V+D  NS V  A   C
Sbjct: 395 SNGLICLAMGSSSQG----MSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CmoCh18G008610 vs. TrEMBL
Match: A0A0A0KTR5_CUCSA (Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_5G623870 PE=3 SV=1)

HSP 1 Score: 674.5 bits (1739), Expect = 8.5e-191
Identity = 356/441 (80.73%), Postives = 381/441 (86.39%), Query Frame = 1

Query: 1   MNTKPSTFLLSLLLLLRLL---------------AAASGGSLRVFHISIH------SKPL 60
           MN   S FLL LLLLL L                AA    +L+VFHI         SKPL
Sbjct: 1   MNNTKSPFLL-LLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKPL 60

Query: 61  SWADGVLQMQAKDEARLQLLSSLVARRSDAPIASGRQVIQSPSYVVRANIGTPAQTLLLA 120
           SWAD VLQMQAKD+ARLQ LSSLVARRS  PIAS RQ+IQSP++VVRA IGTPAQTLLLA
Sbjct: 61  SWADNVLQMQAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLLA 120

Query: 121 LDTSNDATWIPCSGCVGCPGTSVFSSDQSSSFRPLPCQSPQCSQVPNPSCGTSACGFNMT 180
           LDTSNDA WIPCSGC+GCP T+VFSSD+SSSFRPLPCQSPQC+QVPNPSC  SACGFN+T
Sbjct: 121 LDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGSACGFNLT 180

Query: 181 YGSSTVAADLVQDNITLARDSVPAFTFGCITKATGSSVPPQGLLGLGRGSLSLLSQTQSL 240
           YGSSTVAADLVQDN+TLA DSVP++TFGCI KATGSSVPPQGLLGLGRG LSLL Q+QSL
Sbjct: 181 YGSSTVAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSL 240

Query: 241 YQSTFSYCLPSFKSASFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIGIRVGSKV 300
           YQSTFSYCLPSFKS +FSGSLRLGPVAQP RIKYTPLLRNPRRSSLYYVNLI IRVG K+
Sbjct: 241 YQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKI 300

Query: 301 VDIPSSAFAFNTATGSGTIIDSGTTFTRLVAPAYTAVRDEFQRRVG-NATVSSLGGFDTC 360
           VDIP SA AFN+ATG+GT+IDSGTTFTRLVAPAYTAVRDEF+RRVG N TVSSLGGFDTC
Sbjct: 301 VDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTC 360

Query: 361 YTVPIVSPTITFMFDAMNVTLPPDNFLIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQ 420
           YTVPI+SPTITFMF  MNVTLPPDNFLIHSTAGSTTCLAMA+APDNVNSVLNVIA+MQQQ
Sbjct: 361 YTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQ 420

BLAST of CmoCh18G008610 vs. TrEMBL
Match: B9RG92_RICCO (Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_1452350 PE=3 SV=1)

HSP 1 Score: 612.8 bits (1579), Expect = 3.0e-172
Identity = 303/402 (75.37%), Postives = 351/402 (87.31%), Query Frame = 1

Query: 24  GGSLRVFHISIH------SKPLSWADGVLQMQAKDEARLQLLSSLVARRSDAPIASGRQV 83
           G +L+VFH+         SKPL W + VLQMQAKD+ARLQ LSSLVAR+S  PIASGRQ+
Sbjct: 31  GSNLQVFHVYSPCSPFWPSKPLKWEESVLQMQAKDQARLQFLSSLVARKSVVPIASGRQI 90

Query: 84  IQSPSYVVRANIGTPAQTLLLALDTSNDATWIPCSGCVGCPGTSVFSSDQSSSFRPLPCQ 143
           +QSP+Y+VRA IGTPAQT+LLA+DTSNDA WIPCSGCVGC  T VF++ +S++F+ + C+
Sbjct: 91  VQSPTYIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGCSST-VFNNVKSTTFKTVGCE 150

Query: 144 SPQCSQVPNPSCGTSACGFNMTYGSSTVAADLVQDNITLARDSVPAFTFGCITKATGSSV 203
           +PQC QVPN  CG SAC FNMTYGSS++AA+L QD +TLA DS+P++TFGC+T+ATGSS+
Sbjct: 151 APQCKQVPNSKCGGSACAFNMTYGSSSIAANLSQDVVTLATDSIPSYTFGCLTEATGSSI 210

Query: 204 PPQGLLGLGRGSLSLLSQTQSLYQSTFSYCLPSFKSASFSGSLRLGPVAQPKRIKYTPLL 263
           PPQGLLGLGRG +SLLSQTQ+LYQSTFSYCLPSF+S +FSGSLRLGPV QPKRIK TPLL
Sbjct: 211 PPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPSFRSLNFSGSLRLGPVGQPKRIKTTPLL 270

Query: 264 RNPRRSSLYYVNLIGIRVGSKVVDIPSSAFAFNTATGSGTIIDSGTTFTRLVAPAYTAVR 323
           +NPRRSSLYYVNL+ IRVG +VVDIP SA AFN  TG+GTI DSGT FTRLVAPAYTAVR
Sbjct: 271 KNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGTVFTRLVAPAYTAVR 330

Query: 324 DEFQRRVGNATVSSLGGFDTCYTVPIVSPTITFMFDAMNVTLPPDNFLIHSTAGSTTCLA 383
           D F++RVGNATV+SLGGFDTCYT PIV+PTITFMF  MNVTLPPDN LIHSTA S TCLA
Sbjct: 331 DAFRKRVGNATVTSLGGFDTCYTSPIVAPTITFMFSGMNVTLPPDNLLIHSTASSITCLA 390

Query: 384 MASAPDNVNSVLNVIANMQQQNHRIVFDIPNSRVGVAREPCS 420
           MA+APDNVNSVLNVIANMQQQNHRI+FD+PNSR+GVAREPC+
Sbjct: 391 MAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVAREPCT 431

BLAST of CmoCh18G008610 vs. TrEMBL
Match: A0A067L7Q3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24539 PE=3 SV=1)

HSP 1 Score: 601.3 bits (1549), Expect = 9.1e-169
Identity = 313/430 (72.79%), Postives = 354/430 (82.33%), Query Frame = 1

Query: 7   TFLLSLLLLLRLLAAA-----------SGGSLRVFHISIH------SKPLSWADGVLQMQ 66
           T LLSL  L   LA              G +L+VFH+         SKPLSW + VLQMQ
Sbjct: 3   THLLSLAFLFFSLAQGLHLNPKCSSQDQGSTLQVFHVYSPCSPFRPSKPLSWEESVLQMQ 62

Query: 67  AKDEARLQLLSSLVARRSDAPIASGRQVIQSPSYVVRANIGTPAQTLLLALDTSNDATWI 126
           AKD+ARLQ LSSLVA RS  PIASGRQ+IQSP+Y+VRA IGTPAQTLLLA+DTSNDA WI
Sbjct: 63  AKDQARLQFLSSLVAGRSFVPIASGRQIIQSPTYIVRAKIGTPAQTLLLAVDTSNDAAWI 122

Query: 127 PCSGCVGCPGTSVFSSDQSSSFRPLPCQSPQCSQVPNPSCGTSACGFNMTYGSSTVAADL 186
           PCSGC GC  T VF S +S+SF+ + C +PQC QVPNP+C  SAC FN TYGSS++AA+L
Sbjct: 123 PCSGCDGCSST-VFDSVKSTSFQTVGCGAPQCKQVPNPTCSGSACTFNTTYGSSSIAANL 182

Query: 187 VQDNITLARDSVPAFTFGCITKATGSSVPPQGLLGLGRGSLSLLSQTQSLYQSTFSYCLP 246
            QD ++LA DSVP +TFGCI KATGSSVPPQGLLGLGRG LSLLSQTQ+LYQSTFSYCLP
Sbjct: 183 SQDTVSLATDSVPGYTFGCIAKATGSSVPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLP 242

Query: 247 SFKSASFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIGIRVGSKVVDIPSSAFAF 306
           SF+S +FSG+LRLGP  QPKRIK TPLLRNPRRSSLYYVNL+ IRVG +VVDIP SA AF
Sbjct: 243 SFRSLNFSGTLRLGPNGQPKRIKTTPLLRNPRRSSLYYVNLVAIRVGRRVVDIPPSALAF 302

Query: 307 NTATGSGTIIDSGTTFTRLVAPAYTAVRDEFQRRVGNATVSSLGGFDTCYTVPIVSPTIT 366
           N  TG+GTI DSGT FTRLV PAYTAVRD F++RVGNATV+SLGGFDTCY+VPIV+PTIT
Sbjct: 303 NPTTGAGTIFDSGTVFTRLVTPAYTAVRDAFRKRVGNATVTSLGGFDTCYSVPIVAPTIT 362

Query: 367 FMFDAMNVTLPPDNFLIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRIVFDIPNS 420
           FMF  MNVTLPP+N LIHSTAGST+CLA+A+APDNVNSVLNVIANMQQQNHRI+FD+PNS
Sbjct: 363 FMFSGMNVTLPPENLLIHSTAGSTSCLAIAAAPDNVNSVLNVIANMQQQNHRILFDVPNS 422

BLAST of CmoCh18G008610 vs. TrEMBL
Match: I1JW44_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_04G136000 PE=3 SV=1)

HSP 1 Score: 598.6 bits (1542), Expect = 5.9e-168
Identity = 308/438 (70.32%), Postives = 356/438 (81.28%), Query Frame = 1

Query: 4   KPSTFLLSLLLLLRLLAAASG-----------GSLRVFHISIHS------KPLSWADGVL 63
           K + F LS L L  L +   G            +L VFH+          KPLSWA+ VL
Sbjct: 2   KTTLFSLSPLFLFLLFSLVEGLTPKCDTQDHGSTLEVFHVFSPCSPFRPPKPLSWAESVL 61

Query: 64  QMQAKDEARLQLLSSLVARRSDAPIASGRQVIQSPSYVVRANIGTPAQTLLLALDTSNDA 123
           Q+QAKD+ARLQ L+S+VA RS  PIASGRQ+IQSP+Y+VRA IG+P QTLLLA+DTSNDA
Sbjct: 62  QLQAKDQARLQFLASMVAGRSVVPIASGRQIIQSPTYIVRAKIGSPPQTLLLAMDTSNDA 121

Query: 124 TWIPCSGCVGCPGTSVFSSDQSSSFRPLPCQSPQCSQVPNPSCGTSACGFNMTYGSSTVA 183
            WIPC+ C GC  T +F+ ++S++F+ + C SPQC+QVPNPSCGTSAC FN+TYGSS++A
Sbjct: 122 AWIPCTACDGCTST-LFAPEKSTTFKNVSCGSPQCNQVPNPSCGTSACTFNLTYGSSSIA 181

Query: 184 ADLVQDNITLARDSVPAFTFGCITKATGSSVPPQGLLGLGRGSLSLLSQTQSLYQSTFSY 243
           A++VQD +TLA D +P +TFGC+ K TG+S PPQGLLGLGRG LSLLSQTQ+LYQSTFSY
Sbjct: 182 ANVVQDTVTLATDPIPDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSY 241

Query: 244 CLPSFKSASFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIGIRVGSKVVDIPSSA 303
           CLPSFKS +FSGSLRLGPVAQP RIKYTPLL+NPRRSSLYYVNL+ IRVG KVVDIP  A
Sbjct: 242 CLPSFKSLNFSGSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEA 301

Query: 304 FAFNTATGSGTIIDSGTTFTRLVAPAYTAVRDEFQRRV-----GNATVSSLGGFDTCYTV 363
            AFN ATG+GT+ DSGT FTRLVAPAYTAVRDEFQRRV      N TV+SLGGFDTCYTV
Sbjct: 302 LAFNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTV 361

Query: 364 PIVSPTITFMFDAMNVTLPPDNFLIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHR 420
           PIV+PTITFMF  MNVTLP DN LIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHR
Sbjct: 362 PIVAPTITFMFSGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHR 421

BLAST of CmoCh18G008610 vs. TrEMBL
Match: A0A0B2PGF3_GLYSO (Aspartic proteinase nepenthesin-1 OS=Glycine soja GN=glysoja_045617 PE=3 SV=1)

HSP 1 Score: 598.6 bits (1542), Expect = 5.9e-168
Identity = 308/438 (70.32%), Postives = 356/438 (81.28%), Query Frame = 1

Query: 4   KPSTFLLSLLLLLRLLAAASG-----------GSLRVFHISIHS------KPLSWADGVL 63
           K + F LS L L  L +   G            +L VFH+          KPLSWA+ VL
Sbjct: 2   KTTLFSLSPLFLFLLFSLVEGLTPKCDTQDHGSTLEVFHVFSPCSPFRPPKPLSWAESVL 61

Query: 64  QMQAKDEARLQLLSSLVARRSDAPIASGRQVIQSPSYVVRANIGTPAQTLLLALDTSNDA 123
           Q+QAKD+ARLQ L+S+VA RS  PIASGRQ+IQSP+Y+VRA IG+P QTLLLA+DTSNDA
Sbjct: 62  QLQAKDQARLQFLASMVAGRSVVPIASGRQIIQSPTYIVRAKIGSPPQTLLLAMDTSNDA 121

Query: 124 TWIPCSGCVGCPGTSVFSSDQSSSFRPLPCQSPQCSQVPNPSCGTSACGFNMTYGSSTVA 183
            WIPC+ C GC  T +F+ ++S++F+ + C SPQC+QVPNPSCGTSAC FN+TYGSS++A
Sbjct: 122 AWIPCTACDGCTST-LFAPEKSTTFKNVSCGSPQCNQVPNPSCGTSACTFNLTYGSSSIA 181

Query: 184 ADLVQDNITLARDSVPAFTFGCITKATGSSVPPQGLLGLGRGSLSLLSQTQSLYQSTFSY 243
           A++VQD +TLA D +P +TFGC+ K TG+S PPQGLLGLGRG LSLLSQTQ+LYQSTFSY
Sbjct: 182 ANVVQDTVTLATDPIPDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSY 241

Query: 244 CLPSFKSASFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIGIRVGSKVVDIPSSA 303
           CLPSFKS +FSGSLRLGPVAQP RIKYTPLL+NPRRSSLYYVNL+ IRVG KVVDIP  A
Sbjct: 242 CLPSFKSLNFSGSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEA 301

Query: 304 FAFNTATGSGTIIDSGTTFTRLVAPAYTAVRDEFQRRV-----GNATVSSLGGFDTCYTV 363
            AFN ATG+GT+ DSGT FTRLVAPAYTAVRDEFQRRV      N TV+SLGGFDTCYTV
Sbjct: 302 LAFNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTV 361

Query: 364 PIVSPTITFMFDAMNVTLPPDNFLIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHR 420
           PIV+PTITFMF  MNVTLP DN LIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHR
Sbjct: 362 PIVAPTITFMFSGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHR 421

BLAST of CmoCh18G008610 vs. TAIR10
Match: AT5G07030.1 (AT5G07030.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 531.9 bits (1369), Expect = 3.4e-151
Identity = 264/406 (65.02%), Postives = 323/406 (79.56%), Query Frame = 1

Query: 24  GGSLRVFHISI------HSKPLSWADGVLQMQAKDEARLQLLSSLVARRSDAPIASGRQV 83
           G +LR+FHI         S PLSW   VLQ  A+D+ARLQ LSSLVA RS  PIASGRQ+
Sbjct: 50  GSTLRIFHIDSPCSPFKSSSPLSWEARVLQTLAQDQARLQYLSSLVAGRSVVPIASGRQM 109

Query: 84  IQSPSYVVRANIGTPAQTLLLALDTSNDATWIPCSGCVGCPGTSVFSSDQSSSFRPLPCQ 143
           +QS +Y+V+A IGTPAQ LLLA+DTS+D  WIPCSGCVGCP  + FS  +S+SF+ + C 
Sbjct: 110 LQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTAFSPAKSTSFKNVSCS 169

Query: 144 SPQCSQVPNPSCGTSACGFNMTYGSSTVAADLVQDNITLARDSVPAFTFGCITKATGSSV 203
           +PQC QVPNP+CG  AC FN+TYGSS++AA+L QD I LA D + AFTFGC+ K  G   
Sbjct: 170 APQCKQVPNPTCGARACSFNLTYGSSSIAANLSQDTIRLAADPIKAFTFGCVNKVAGGGT 229

Query: 204 --PPQGLLGLGRGSLSLLSQTQSLYQSTFSYCLPSFKSASFSGSLRLGPVAQPKRIKYTP 263
             PPQGLLGLGRG LSL+SQ QS+Y+STFSYCLPSF+S +FSGSLRLGP +QP+R+KYT 
Sbjct: 230 IPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQRVKYTQ 289

Query: 264 LLRNPRRSSLYYVNLIGIRVGSKVVDIPSSAFAFNTATGSGTIIDSGTTFTRLVAPAYTA 323
           LLRNPRRSSLYYVNL+ IRVG KVVD+P +A AFN +TG+GTI DSGT +TRL  P Y A
Sbjct: 290 LLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEA 349

Query: 324 VRDEFQRRV--GNATVSSLGGFDTCYTVPIVSPTITFMFDAMNVTLPPDNFLIHSTAGST 383
           VR+EF++RV    A V+SLGGFDTCY+  +  PTITFMF  +N+T+P DN ++HSTAGST
Sbjct: 350 VRNEFRKRVKPTTAVVTSLGGFDTCYSGQVKVPTITFMFKGVNMTMPADNLMLHSTAGST 409

Query: 384 TCLAMASAPDNVNSVLNVIANMQQQNHRIVFDIPNSRVGVAREPCS 420
           +CLAMA+AP+NVNSV+NVIA+MQQQNHR++ D+PN R+G+ARE CS
Sbjct: 410 SCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 455

BLAST of CmoCh18G008610 vs. TAIR10
Match: AT3G54400.1 (AT3G54400.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 531.6 bits (1368), Expect = 4.5e-151
Identity = 275/425 (64.71%), Postives = 336/425 (79.06%), Query Frame = 1

Query: 9   LLSLLLLLRLLAAAS--------GGSLRVFHISIHSKP----LSWADGVLQMQAKDEARL 68
           LL LL+ L +L + S           LRVFHI+    P    +SWAD +LQ    D+AR 
Sbjct: 5   LLILLISLLILKSESINCNEKSHSSDLRVFHINSLCSPFKTSVSWADTLLQ----DKARF 64

Query: 69  QLLSSLVA-RRSDAPIASGRQVIQSPSYVVRANIGTPAQTLLLALDTSNDATWIPCSGCV 128
             LSSL   R+S  PIASGR ++QSP+Y+VRANIGTPAQ +L+ALDTSNDA WIPCSGCV
Sbjct: 65  LYLSSLAGVRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCV 124

Query: 129 GCPGTSVFSSDQSSSFRPLPCQSPQCSQVPNPSCGTS-ACGFNMTYGSSTVAADLVQDNI 188
           GC  + +F   +SSS R L C++PQC Q PNPSC  S +CGFNMTYG ST+ A L QD +
Sbjct: 125 GCSSSVLFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSTIEAYLTQDTL 184

Query: 189 TLARDSVPAFTFGCITKATGSSVPPQGLLGLGRGSLSLLSQTQSLYQSTFSYCLPSFKSA 248
           TLA D +P +TFGCI KA+G+S+P QGL+GLGRG LSL+SQ+Q+LYQSTFSYCLP+ KS+
Sbjct: 185 TLASDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSS 244

Query: 249 SFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIGIRVGSKVVDIPSSAFAFNTATG 308
           +FSGSLRLGP  QP RIK TPLL+NPRRSSLYYVNL+GIRVG+K+VDIP+SA AF+ ATG
Sbjct: 245 NFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATG 304

Query: 309 SGTIIDSGTTFTRLVAPAYTAVRDEFQRRVGNATVSSLGGFDTCYTVPIVSPTITFMFDA 368
           +GTI DSGT +TRLV PAY AVR+EF+RRV NA  +SLGGFDTCY+  +V P++TFMF  
Sbjct: 305 AGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGGFDTCYSGSVVFPSVTFMFAG 364

Query: 369 MNVTLPPDNFLIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRIVFDIPNSRVGVA 420
           MNVTLPPDN LIHS+AG+ +CLAMA+AP NVNSVLNVIA+MQQQNHR++ D+PNSR+G++
Sbjct: 365 MNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGIS 424

BLAST of CmoCh18G008610 vs. TAIR10
Match: AT1G09750.1 (AT1G09750.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 395.6 bits (1015), Expect = 3.8e-110
Identity = 207/389 (53.21%), Postives = 274/389 (70.44%), Query Frame = 1

Query: 43  DGVLQMQAKDEARLQLLSSLVARR---SDAPIASGRQVIQSPSYVVRANIGTPAQTLLLA 102
           D VL M + D  RL  LSSLVA +   +  P+ASG Q +   +YVVRA +GTP Q + + 
Sbjct: 62  DTVLHMASSDSHRLTYLSSLVAGKPKPTSVPVASGNQ-LHIGNYVVRAKLGTPPQLMFMV 121

Query: 103 LDTSNDATWIPCSGCVGCPGTSV-FSSDQSSSFRPLPCQSPQCSQVPNPSCGTSA----- 162
           LDTSNDA W+PCSGC GC   S  F+++ SS++  + C + QC+Q    +C +S+     
Sbjct: 122 LDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSV 181

Query: 163 CGFNMTYGS-STVAADLVQDNITLARDSVPAFTFGCITKATGSSVPPQGLLGLGRGSLSL 222
           C FN +YG  S+ +A LVQD +TLA D +P F+FGCI  A+G+S+PPQGL+GLGRG +SL
Sbjct: 182 CSFNQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSL 241

Query: 223 LSQTQSLYQSTFSYCLPSFKSASFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIG 282
           +SQT SLY   FSYCLPSF+S  FSGSL+LG + QPK I+YTPLLRNPRR SLYYVNL G
Sbjct: 242 VSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTG 301

Query: 283 IRVGSKVVDIPSSAFAFNTATGSGTIIDSGTTFTRLVAPAYTAVRDEFQRRVGNATVSSL 342
           + VGS  V +      F+  +G+GTIIDSGT  TR   P Y A+RDEF+++V  ++ S+L
Sbjct: 302 VSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTL 361

Query: 343 GGFDTCYTV--PIVSPTITFMFDAMNVTLPPDNFLIHSTAGSTTCLAMASAPDNVNSVLN 402
           G FDTC++     V+P IT    ++++ LP +N LIHS+AG+ TCL+MA    N N+VLN
Sbjct: 362 GAFDTCFSADNENVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN 421

Query: 403 VIANMQQQNHRIVFDIPNSRVGVAREPCS 420
           VIAN+QQQN RI+FD+PNSR+G+A EPC+
Sbjct: 422 VIANLQQQNLRILFDVPNSRIGIAPEPCN 449

BLAST of CmoCh18G008610 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 203.0 bits (515), Expect = 3.7e-52
Identity = 126/353 (35.69%), Postives = 182/353 (51.56%), Query Frame = 1

Query: 80  SPSYVVRANIGTPAQTLLLALDTSNDATWIPCSGCVGCPGTS--VFSSDQSSSFRPLPCQ 139
           S  Y  R  +GTPA+ + + LDT +D  W+ C+ C  C   S  +F   +S ++  +PC 
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCS 198

Query: 140 SPQCSQVPNPSCGT--SACGFNMTYGS-STVAADLVQDNITLARDSVPAFTFGCITKATG 199
           SP C ++ +  C T    C + ++YG  S    D   + +T  R+ V     GC     G
Sbjct: 199 SPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEG 258

Query: 200 SSVPPQGLLGLGRGSLSLLSQTQSLYQSTFSYCLPSFKSASFSGSLRLGPVAQPKRIKYT 259
             V   GLLGLG+G LS   QT   +   FSYCL    ++S   S+  G  A  +  ++T
Sbjct: 259 LFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFT 318

Query: 260 PLLRNPRRSSLYYVNLIGIRV-GSKVVDIPSSAFAFNTATGSGTIIDSGTTFTRLVAPAY 319
           PLL NP+  + YYV L+GI V G++V  + +S F  +     G IIDSGT+ TRL+ PAY
Sbjct: 319 PLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAY 378

Query: 320 TAVRDEFQRRVGNATVS---SLGGFDTCYTV----PIVSPTITFMFDAMNVTLPPDNFLI 379
            A+RD F  RVG  T+        FDTC+ +     +  PT+   F   +V+LP  N+LI
Sbjct: 379 IAMRDAF--RVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLI 438

Query: 380 HSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRIVFDIPNSRVGVAREPCS 420
                   C A A         L++I N+QQQ  R+V+D+ +SRVG A   C+
Sbjct: 439 PVDTNGKFCFAFAGTMGG----LSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CmoCh18G008610 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 191.0 bits (484), Expect = 1.4e-48
Identity = 129/358 (36.03%), Postives = 178/358 (49.72%), Query Frame = 1

Query: 80  SPSYVVRANIGTPAQTLLLALDTSNDATWIPCSGCVGCPGTS--VFSSDQSSSFRPLPCQ 139
           S  Y +R  +GTPA  + + LDT +D  W+ CS C  C   +  +F   +S +F  +PC 
Sbjct: 132 SGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCG 191

Query: 140 SPQCSQVPNPS-CGT---SACGFNMTYGS-STVAADLVQDNITLARDSVPAFTFGCITKA 199
           S  C ++ + S C T     C + ++YG  S    D   + +T     V     GC    
Sbjct: 192 SRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDN 251

Query: 200 TGSSVPPQGLLGLGRGSLSLLSQTQSLYQSTFSYCLPSFKSASFSG----SLRLGPVAQP 259
            G  V   GLLGLGRG LS  SQT++ Y   FSYCL    S+  S     ++  G  A P
Sbjct: 252 EGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVP 311

Query: 260 KRIKYTPLLRNPRRSSLYYVNLIGIRVG-SKVVDIPSSAFAFNTATGSGTIIDSGTTFTR 319
           K   +TPLL NP+  + YY+ L+GI VG S+V  +  S F  +     G IIDSGT+ TR
Sbjct: 312 KTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTR 371

Query: 320 LVAPAYTAVRDEFQRRVGNATVS---SLGGFDTCYTV----PIVSPTITFMFDAMNVTLP 379
           L  PAY A+RD F  R+G   +    S   FDTC+ +     +  PT+ F F    V+LP
Sbjct: 372 LTQPAYVALRDAF--RLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFGGGEVSLP 431

Query: 380 PDNFLIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRIVFDIPNSRVGVAREPC 419
             N+LI        C A A    +    L++I N+QQQ  R+ +D+  SRVG     C
Sbjct: 432 ASNYLIPVNTEGRFCFAFAGTMGS----LSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of CmoCh18G008610 vs. NCBI nr
Match: gi|659092233|ref|XP_008446966.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo])

HSP 1 Score: 675.2 bits (1741), Expect = 7.1e-191
Identity = 357/443 (80.59%), Postives = 383/443 (86.46%), Query Frame = 1

Query: 1   MNTKPSTFL-LSLLLLLRLLAAAS----------------GGSLRVFHISIH------SK 60
           MN   S FL L LLLLL LL + S                  +L+VFHI         SK
Sbjct: 1   MNNTKSPFLPLPLLLLLLLLFSISTAKSHIPLNCNPADDRSSTLKVFHIFSPCSPFRPSK 60

Query: 61  PLSWADGVLQMQAKDEARLQLLSSLVARRSDAPIASGRQVIQSPSYVVRANIGTPAQTLL 120
           PLSWAD VLQMQAKD+ARLQ LSSLVARRS  PIAS RQ+IQSP++VVRA IGTPAQTLL
Sbjct: 61  PLSWADNVLQMQAKDQARLQFLSSLVARRSVVPIASARQLIQSPTFVVRAKIGTPAQTLL 120

Query: 121 LALDTSNDATWIPCSGCVGCPGTSVFSSDQSSSFRPLPCQSPQCSQVPNPSCGTSACGFN 180
           LALDTSNDA WIPCSGCVGCP T+VFSSD+SSSFRPLPCQSPQC+QVPNPSC  +ACGFN
Sbjct: 121 LALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGTACGFN 180

Query: 181 MTYGSSTVAADLVQDNITLARDSVPAFTFGCITKATGSSVPPQGLLGLGRGSLSLLSQTQ 240
           +TYGSSTVAADLVQDN+TLA DSVPA+TFGCI KATGSSVPPQGLLGLGRG LSLL Q+Q
Sbjct: 181 LTYGSSTVAADLVQDNVTLATDSVPAYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQ 240

Query: 241 SLYQSTFSYCLPSFKSASFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIGIRVGS 300
           SLY+STFSYCLPSFKS +FSGSLRLGPVAQP RIKYTPLLRNPRRSSLYYVNLI IRVG 
Sbjct: 241 SLYRSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLIAIRVGR 300

Query: 301 KVVDIPSSAFAFNTATGSGTIIDSGTTFTRLVAPAYTAVRDEFQRRVG-NATVSSLGGFD 360
           ++VDIP SA AFN+ATG+GTIIDSGTTFTRLVAPAYTAVRDEF+RRVG N TVSSLGGFD
Sbjct: 301 RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFD 360

Query: 361 TCYTVPIVSPTITFMFDAMNVTLPPDNFLIHSTAGSTTCLAMASAPDNVNSVLNVIANMQ 420
           TCYTVPI+SPTITFMF  MNVTLPPDNFLIHSTAGSTTCLAMA+APDNVNSVLNVIA+MQ
Sbjct: 361 TCYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQ 420

BLAST of CmoCh18G008610 vs. NCBI nr
Match: gi|449449334|ref|XP_004142420.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus])

HSP 1 Score: 674.5 bits (1739), Expect = 1.2e-190
Identity = 356/441 (80.73%), Postives = 381/441 (86.39%), Query Frame = 1

Query: 1   MNTKPSTFLLSLLLLLRLL---------------AAASGGSLRVFHISIH------SKPL 60
           MN   S FLL LLLLL L                AA    +L+VFHI         SKPL
Sbjct: 1   MNNTKSPFLL-LLLLLHLFSISTAKSHIPSNCNPAADRSSTLQVFHIFSPCSPFRPSKPL 60

Query: 61  SWADGVLQMQAKDEARLQLLSSLVARRSDAPIASGRQVIQSPSYVVRANIGTPAQTLLLA 120
           SWAD VLQMQAKD+ARLQ LSSLVARRS  PIAS RQ+IQSP++VVRA IGTPAQTLLLA
Sbjct: 61  SWADNVLQMQAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLLA 120

Query: 121 LDTSNDATWIPCSGCVGCPGTSVFSSDQSSSFRPLPCQSPQCSQVPNPSCGTSACGFNMT 180
           LDTSNDA WIPCSGC+GCP T+VFSSD+SSSFRPLPCQSPQC+QVPNPSC  SACGFN+T
Sbjct: 121 LDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGSACGFNLT 180

Query: 181 YGSSTVAADLVQDNITLARDSVPAFTFGCITKATGSSVPPQGLLGLGRGSLSLLSQTQSL 240
           YGSSTVAADLVQDN+TLA DSVP++TFGCI KATGSSVPPQGLLGLGRG LSLL Q+QSL
Sbjct: 181 YGSSTVAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSL 240

Query: 241 YQSTFSYCLPSFKSASFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIGIRVGSKV 300
           YQSTFSYCLPSFKS +FSGSLRLGPVAQP RIKYTPLLRNPRRSSLYYVNLI IRVG K+
Sbjct: 241 YQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKI 300

Query: 301 VDIPSSAFAFNTATGSGTIIDSGTTFTRLVAPAYTAVRDEFQRRVG-NATVSSLGGFDTC 360
           VDIP SA AFN+ATG+GT+IDSGTTFTRLVAPAYTAVRDEF+RRVG N TVSSLGGFDTC
Sbjct: 301 VDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTC 360

Query: 361 YTVPIVSPTITFMFDAMNVTLPPDNFLIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQ 420
           YTVPI+SPTITFMF  MNVTLPPDNFLIHSTAGSTTCLAMA+APDNVNSVLNVIA+MQQQ
Sbjct: 361 YTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQ 420

BLAST of CmoCh18G008610 vs. NCBI nr
Match: gi|255543963|ref|XP_002513044.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Ricinus communis])

HSP 1 Score: 612.8 bits (1579), Expect = 4.3e-172
Identity = 303/402 (75.37%), Postives = 351/402 (87.31%), Query Frame = 1

Query: 24  GGSLRVFHISIH------SKPLSWADGVLQMQAKDEARLQLLSSLVARRSDAPIASGRQV 83
           G +L+VFH+         SKPL W + VLQMQAKD+ARLQ LSSLVAR+S  PIASGRQ+
Sbjct: 31  GSNLQVFHVYSPCSPFWPSKPLKWEESVLQMQAKDQARLQFLSSLVARKSVVPIASGRQI 90

Query: 84  IQSPSYVVRANIGTPAQTLLLALDTSNDATWIPCSGCVGCPGTSVFSSDQSSSFRPLPCQ 143
           +QSP+Y+VRA IGTPAQT+LLA+DTSNDA WIPCSGCVGC  T VF++ +S++F+ + C+
Sbjct: 91  VQSPTYIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGCSST-VFNNVKSTTFKTVGCE 150

Query: 144 SPQCSQVPNPSCGTSACGFNMTYGSSTVAADLVQDNITLARDSVPAFTFGCITKATGSSV 203
           +PQC QVPN  CG SAC FNMTYGSS++AA+L QD +TLA DS+P++TFGC+T+ATGSS+
Sbjct: 151 APQCKQVPNSKCGGSACAFNMTYGSSSIAANLSQDVVTLATDSIPSYTFGCLTEATGSSI 210

Query: 204 PPQGLLGLGRGSLSLLSQTQSLYQSTFSYCLPSFKSASFSGSLRLGPVAQPKRIKYTPLL 263
           PPQGLLGLGRG +SLLSQTQ+LYQSTFSYCLPSF+S +FSGSLRLGPV QPKRIK TPLL
Sbjct: 211 PPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPSFRSLNFSGSLRLGPVGQPKRIKTTPLL 270

Query: 264 RNPRRSSLYYVNLIGIRVGSKVVDIPSSAFAFNTATGSGTIIDSGTTFTRLVAPAYTAVR 323
           +NPRRSSLYYVNL+ IRVG +VVDIP SA AFN  TG+GTI DSGT FTRLVAPAYTAVR
Sbjct: 271 KNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGTVFTRLVAPAYTAVR 330

Query: 324 DEFQRRVGNATVSSLGGFDTCYTVPIVSPTITFMFDAMNVTLPPDNFLIHSTAGSTTCLA 383
           D F++RVGNATV+SLGGFDTCYT PIV+PTITFMF  MNVTLPPDN LIHSTA S TCLA
Sbjct: 331 DAFRKRVGNATVTSLGGFDTCYTSPIVAPTITFMFSGMNVTLPPDNLLIHSTASSITCLA 390

Query: 384 MASAPDNVNSVLNVIANMQQQNHRIVFDIPNSRVGVAREPCS 420
           MA+APDNVNSVLNVIANMQQQNHRI+FD+PNSR+GVAREPC+
Sbjct: 391 MAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVAREPCT 431

BLAST of CmoCh18G008610 vs. NCBI nr
Match: gi|802574128|ref|XP_012068673.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Jatropha curcas])

HSP 1 Score: 601.3 bits (1549), Expect = 1.3e-168
Identity = 313/430 (72.79%), Postives = 354/430 (82.33%), Query Frame = 1

Query: 7   TFLLSLLLLLRLLAAA-----------SGGSLRVFHISIH------SKPLSWADGVLQMQ 66
           T LLSL  L   LA              G +L+VFH+         SKPLSW + VLQMQ
Sbjct: 3   THLLSLAFLFFSLAQGLHLNPKCSSQDQGSTLQVFHVYSPCSPFRPSKPLSWEESVLQMQ 62

Query: 67  AKDEARLQLLSSLVARRSDAPIASGRQVIQSPSYVVRANIGTPAQTLLLALDTSNDATWI 126
           AKD+ARLQ LSSLVA RS  PIASGRQ+IQSP+Y+VRA IGTPAQTLLLA+DTSNDA WI
Sbjct: 63  AKDQARLQFLSSLVAGRSFVPIASGRQIIQSPTYIVRAKIGTPAQTLLLAVDTSNDAAWI 122

Query: 127 PCSGCVGCPGTSVFSSDQSSSFRPLPCQSPQCSQVPNPSCGTSACGFNMTYGSSTVAADL 186
           PCSGC GC  T VF S +S+SF+ + C +PQC QVPNP+C  SAC FN TYGSS++AA+L
Sbjct: 123 PCSGCDGCSST-VFDSVKSTSFQTVGCGAPQCKQVPNPTCSGSACTFNTTYGSSSIAANL 182

Query: 187 VQDNITLARDSVPAFTFGCITKATGSSVPPQGLLGLGRGSLSLLSQTQSLYQSTFSYCLP 246
            QD ++LA DSVP +TFGCI KATGSSVPPQGLLGLGRG LSLLSQTQ+LYQSTFSYCLP
Sbjct: 183 SQDTVSLATDSVPGYTFGCIAKATGSSVPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLP 242

Query: 247 SFKSASFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIGIRVGSKVVDIPSSAFAF 306
           SF+S +FSG+LRLGP  QPKRIK TPLLRNPRRSSLYYVNL+ IRVG +VVDIP SA AF
Sbjct: 243 SFRSLNFSGTLRLGPNGQPKRIKTTPLLRNPRRSSLYYVNLVAIRVGRRVVDIPPSALAF 302

Query: 307 NTATGSGTIIDSGTTFTRLVAPAYTAVRDEFQRRVGNATVSSLGGFDTCYTVPIVSPTIT 366
           N  TG+GTI DSGT FTRLV PAYTAVRD F++RVGNATV+SLGGFDTCY+VPIV+PTIT
Sbjct: 303 NPTTGAGTIFDSGTVFTRLVTPAYTAVRDAFRKRVGNATVTSLGGFDTCYSVPIVAPTIT 362

Query: 367 FMFDAMNVTLPPDNFLIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRIVFDIPNS 420
           FMF  MNVTLPP+N LIHSTAGST+CLA+A+APDNVNSVLNVIANMQQQNHRI+FD+PNS
Sbjct: 363 FMFSGMNVTLPPENLLIHSTAGSTSCLAIAAAPDNVNSVLNVIANMQQQNHRILFDVPNS 422

BLAST of CmoCh18G008610 vs. NCBI nr
Match: gi|356508308|ref|XP_003522900.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Glycine max])

HSP 1 Score: 598.6 bits (1542), Expect = 8.5e-168
Identity = 308/438 (70.32%), Postives = 356/438 (81.28%), Query Frame = 1

Query: 4   KPSTFLLSLLLLLRLLAAASG-----------GSLRVFHISIHS------KPLSWADGVL 63
           K + F LS L L  L +   G            +L VFH+          KPLSWA+ VL
Sbjct: 2   KTTLFSLSPLFLFLLFSLVEGLTPKCDTQDHGSTLEVFHVFSPCSPFRPPKPLSWAESVL 61

Query: 64  QMQAKDEARLQLLSSLVARRSDAPIASGRQVIQSPSYVVRANIGTPAQTLLLALDTSNDA 123
           Q+QAKD+ARLQ L+S+VA RS  PIASGRQ+IQSP+Y+VRA IG+P QTLLLA+DTSNDA
Sbjct: 62  QLQAKDQARLQFLASMVAGRSVVPIASGRQIIQSPTYIVRAKIGSPPQTLLLAMDTSNDA 121

Query: 124 TWIPCSGCVGCPGTSVFSSDQSSSFRPLPCQSPQCSQVPNPSCGTSACGFNMTYGSSTVA 183
            WIPC+ C GC  T +F+ ++S++F+ + C SPQC+QVPNPSCGTSAC FN+TYGSS++A
Sbjct: 122 AWIPCTACDGCTST-LFAPEKSTTFKNVSCGSPQCNQVPNPSCGTSACTFNLTYGSSSIA 181

Query: 184 ADLVQDNITLARDSVPAFTFGCITKATGSSVPPQGLLGLGRGSLSLLSQTQSLYQSTFSY 243
           A++VQD +TLA D +P +TFGC+ K TG+S PPQGLLGLGRG LSLLSQTQ+LYQSTFSY
Sbjct: 182 ANVVQDTVTLATDPIPDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSY 241

Query: 244 CLPSFKSASFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIGIRVGSKVVDIPSSA 303
           CLPSFKS +FSGSLRLGPVAQP RIKYTPLL+NPRRSSLYYVNL+ IRVG KVVDIP  A
Sbjct: 242 CLPSFKSLNFSGSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEA 301

Query: 304 FAFNTATGSGTIIDSGTTFTRLVAPAYTAVRDEFQRRV-----GNATVSSLGGFDTCYTV 363
            AFN ATG+GT+ DSGT FTRLVAPAYTAVRDEFQRRV      N TV+SLGGFDTCYTV
Sbjct: 302 LAFNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTV 361

Query: 364 PIVSPTITFMFDAMNVTLPPDNFLIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHR 420
           PIV+PTITFMF  MNVTLP DN LIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHR
Sbjct: 362 PIVAPTITFMFSGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHR 421

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AED3_ARATH6.8e-10953.21Aspartyl protease AED3 OS=Arabidopsis thaliana GN=AED3 PE=1 SV=1[more]
AP25_ORYSJ4.0e-10148.29Aspartyl protease 25 OS=Oryza sativa subsp. japonica GN=AP25 PE=2 SV=1[more]
APF2_ARATH6.5e-5135.69Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
ASPG2_ARATH3.4e-4433.33Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
NEP1_NEPGR1.0e-4336.10Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KTR5_CUCSA8.5e-19180.73Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_5G623870 PE=3 SV=1[more]
B9RG92_RICCO3.0e-17275.37Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_1452350 ... [more]
A0A067L7Q3_JATCU9.1e-16972.79Uncharacterized protein OS=Jatropha curcas GN=JCGZ_24539 PE=3 SV=1[more]
I1JW44_SOYBN5.9e-16870.32Uncharacterized protein OS=Glycine max GN=GLYMA_04G136000 PE=3 SV=1[more]
A0A0B2PGF3_GLYSO5.9e-16870.32Aspartic proteinase nepenthesin-1 OS=Glycine soja GN=glysoja_045617 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G07030.13.4e-15165.02 Eukaryotic aspartyl protease family protein[more]
AT3G54400.14.5e-15164.71 Eukaryotic aspartyl protease family protein[more]
AT1G09750.13.8e-11053.21 Eukaryotic aspartyl protease family protein[more]
AT1G01300.13.7e-5235.69 Eukaryotic aspartyl protease family protein[more]
AT3G61820.11.4e-4836.03 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659092233|ref|XP_008446966.1|7.1e-19180.59PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo][more]
gi|449449334|ref|XP_004142420.1|1.2e-19080.73PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus][more]
gi|255543963|ref|XP_002513044.1|4.3e-17275.37PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Ricinus communis][more]
gi|802574128|ref|XP_012068673.1|1.3e-16872.79PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Jatropha curcas][more]
gi|356508308|ref|XP_003522900.1|8.5e-16870.32PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Glycine max][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0030163 protein catabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0048046 apoplast
cellular_component GO:0009507 chloroplast
cellular_component GO:0009505 plant-type cell wall
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh18G008610.1CmoCh18G008610.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 1..419
score: 3.0E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 245..419
score: 1.0E-42coord: 80..243
score: 4.5
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 81..418
score: 9.75
NoneNo IPR availablePANTHERPTHR13683:SF258ASPARTYL PROTEASE FAMILY PROTEIN-RELATEDcoord: 1..419
score: 3.0E