ClCG08G005600 (gene) Watermelon (Charleston Gray)

NameClCG08G005600
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionEukaryotic aspartyl protease family protein LENGTH=437
LocationCG_Chr08 : 17655722 .. 17657023 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCAATGGCAAATTGTTGCTTCTTGTTACTGCTTTTGTTGCCAGTTCTCCATTCCCAGCCAGCTTTCAGTGAAAGTTCACTAAAACCTGGATTTCATTTTGATCTTACACATGTTGATGCTACCCAAAACTTCACCAAATTCGAAATGCTCCGACGAGGAATTGAGCGAGACAAGACCCGGATACAAAATTTCAAAAAGATGACTGTAAACTTCGAAGTTCGAATGCCACTCGTTGACCAACAAGGGAGCTATCTGATGAACTTGTCATTTGGAACACCTCCTGTTTCGTTCTCTGCAATTTTCGATACTGGCAGTGACTTGATTTGGACACAGTGCAATCCATGCTTGGAATGTTTTGGACAACCAACCCCCGTGTATGATCCTGCACAATCGTCTTCCTTCACCAATGCTACTCGCTCCGCTGCATTATGCCAGGCTTTGACCAATTCAAGTTCCAGCAATGGTTGTGAGTACTCATATGGTTATGGAGATGGATCTTTCACCATAGGTTATTTGGCGTTCGAGACCTTGACGCTTGGGGAAGCAAACCAGCAAGCTTCTACACCAGACATAGCTTTTGGGTGCAGGATAAGGAACTACGTAAACGGCTTGACGCAGGGTGCTGGCATAGTAGGGTTCAGCCGAGGACCATTGTCGCTTCTTTCGCAGCTCCATATTCGAAAGTTCTCTTACTGTTTAAGTCCAAATGGAACTGGGTGTCTGGCCTTTGGATCATCTTCAACAAGCTTTGACAACACAACAAACCAAGCCGTCAAGAACACGCCACTGATACAAAATCCATCCAATCCATCTTACTACTATCTATCTCTACAAGGAATCACAGTTGGCCAAAAGTTTTTGCCCGTCCCATCCTCATGGTTTGAACTAAATGCTGATGGCAGCGGCGGCGTGATCATAGATTCTGACACATCGATCACGTATATAACAGAGGATGCCTTTGATGTGCTGAAGCCAGTTTTTACTTCACAGACAAACCTCCCGGTGATAAATTCAGCAAGTATTGGTCTTGACCTCTGCTTTGAGCTACCTTCCCCAAACAATAGTAAACTTGATGTGCCTGATTTGATATTTCGTTTTGAGGGTCTCGACTTGAAGCTTCCAGTCGAGAACTATATGGTCGTCAATGAGGAGGTAGGGATTGTATGCTTGGCAATGGGGGCTGCAGGAGGTTTGTCGATTTTCGGCAGCATGCAACATCAAAATATGTTGGTTCTTCATGATCTCAAGAAAAAAGTCATATCATTCATTCCCACACAATGCTCTCGGATCAATTACTAG

mRNA sequence

ATGCCAATGGCAAATTGTTGCTTCTTGTTACTGCTTTTGTTGCCAGTTCTCCATTCCCAGCCAGCTTTCAGTGAAAGTTCACTAAAACCTGGATTTCATTTTGATCTTACACATGTTGATGCTACCCAAAACTTCACCAAATTCGAAATGCTCCGACGAGGAATTGAGCGAGACAAGACCCGGATACAAAATTTCAAAAAGATGACTGTAAACTTCGAAGTTCGAATGCCACTCGTTGACCAACAAGGGAGCTATCTGATGAACTTGTCATTTGGAACACCTCCTGTTTCGTTCTCTGCAATTTTCGATACTGGCAGTGACTTGATTTGGACACAGTGCAATCCATGCTTGGAATGTTTTGGACAACCAACCCCCGTGTATGATCCTGCACAATCGTCTTCCTTCACCAATGCTACTCGCTCCGCTGCATTATGCCAGGCTTTGACCAATTCAAGTTCCAGCAATGGTTGTGAGTACTCATATGGTTATGGAGATGGATCTTTCACCATAGGTTATTTGGCGTTCGAGACCTTGACGCTTGGGGAAGCAAACCAGCAAGCTTCTACACCAGACATAGCTTTTGGGTGCAGGATAAGGAACTACGTAAACGGCTTGACGCAGGGTGCTGGCATAGTAGGGTTCAGCCGAGGACCATTGTCGCTTCTTTCGCAGCTCCATATTCGAAAGTTCTCTTACTGTTTAAGTCCAAATGGAACTGGGTGTCTGGCCTTTGGATCATCTTCAACAAGCTTTGACAACACAACAAACCAAGCCGTCAAGAACACGCCACTGATACAAAATCCATCCAATCCATCTTACTACTATCTATCTCTACAAGGAATCACAGTTGGCCAAAAGTTTTTGCCCGTCCCATCCTCATGGTTTGAACTAAATGCTGATGGCAGCGGCGGCGTGATCATAGATTCTGACACATCGATCACGTATATAACAGAGGATGCCTTTGATGTGCTGAAGCCAGTTTTTACTTCACAGACAAACCTCCCGGTGATAAATTCAGCAAGTATTGGTCTTGACCTCTGCTTTGAGCTACCTTCCCCAAACAATAGTAAACTTGATGTGCCTGATTTGATATTTCGTTTTGAGGGTCTCGACTTGAAGCTTCCAGTCGAGAACTATATGGTCGTCAATGAGGAGGTAGGGATTGTATGCTTGGCAATGGGGGCTGCAGGAGGTTTGTCGATTTTCGGCAGCATGCAACATCAAAATATGTTGGTTCTTCATGATCTCAAGAAAAAAGTCATATCATTCATTCCCACACAATGCTCTCGGATCAATTACTAG

Coding sequence (CDS)

ATGCCAATGGCAAATTGTTGCTTCTTGTTACTGCTTTTGTTGCCAGTTCTCCATTCCCAGCCAGCTTTCAGTGAAAGTTCACTAAAACCTGGATTTCATTTTGATCTTACACATGTTGATGCTACCCAAAACTTCACCAAATTCGAAATGCTCCGACGAGGAATTGAGCGAGACAAGACCCGGATACAAAATTTCAAAAAGATGACTGTAAACTTCGAAGTTCGAATGCCACTCGTTGACCAACAAGGGAGCTATCTGATGAACTTGTCATTTGGAACACCTCCTGTTTCGTTCTCTGCAATTTTCGATACTGGCAGTGACTTGATTTGGACACAGTGCAATCCATGCTTGGAATGTTTTGGACAACCAACCCCCGTGTATGATCCTGCACAATCGTCTTCCTTCACCAATGCTACTCGCTCCGCTGCATTATGCCAGGCTTTGACCAATTCAAGTTCCAGCAATGGTTGTGAGTACTCATATGGTTATGGAGATGGATCTTTCACCATAGGTTATTTGGCGTTCGAGACCTTGACGCTTGGGGAAGCAAACCAGCAAGCTTCTACACCAGACATAGCTTTTGGGTGCAGGATAAGGAACTACGTAAACGGCTTGACGCAGGGTGCTGGCATAGTAGGGTTCAGCCGAGGACCATTGTCGCTTCTTTCGCAGCTCCATATTCGAAAGTTCTCTTACTGTTTAAGTCCAAATGGAACTGGGTGTCTGGCCTTTGGATCATCTTCAACAAGCTTTGACAACACAACAAACCAAGCCGTCAAGAACACGCCACTGATACAAAATCCATCCAATCCATCTTACTACTATCTATCTCTACAAGGAATCACAGTTGGCCAAAAGTTTTTGCCCGTCCCATCCTCATGGTTTGAACTAAATGCTGATGGCAGCGGCGGCGTGATCATAGATTCTGACACATCGATCACGTATATAACAGAGGATGCCTTTGATGTGCTGAAGCCAGTTTTTACTTCACAGACAAACCTCCCGGTGATAAATTCAGCAAGTATTGGTCTTGACCTCTGCTTTGAGCTACCTTCCCCAAACAATAGTAAACTTGATGTGCCTGATTTGATATTTCGTTTTGAGGGTCTCGACTTGAAGCTTCCAGTCGAGAACTATATGGTCGTCAATGAGGAGGTAGGGATTGTATGCTTGGCAATGGGGGCTGCAGGAGGTTTGTCGATTTTCGGCAGCATGCAACATCAAAATATGTTGGTTCTTCATGATCTCAAGAAAAAAGTCATATCATTCATTCCCACACAATGCTCTCGGATCAATTACTAG

Protein sequence

MPMANCCFLLLLLLPVLHSQPAFSESSLKPGFHFDLTHVDATQNFTKFEMLRRGIERDKTRIQNFKKMTVNFEVRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWTQCNPCLECFGQPTPVYDPAQSSSFTNATRSAALCQALTNSSSSNGCEYSYGYGDGSFTIGYLAFETLTLGEANQQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIRKFSYCLSPNGTGCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQKFLPVPSSWFELNADGSGGVIIDSDTSITYITEDAFDVLKPVFTSQTNLPVINSASIGLDLCFELPSPNNSKLDVPDLIFRFEGLDLKLPVENYMVVNEEVGIVCLAMGAAGGLSIFGSMQHQNMLVLHDLKKKVISFIPTQCSRINY
BLAST of ClCG08G005600 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 358.2 bits (918), Expect = 1.2e-97
Identity = 200/437 (45.77%), Postives = 265/437 (60.64%), Query Frame = 1

Query: 8   FLLLLLLPVLHSQPAFSESSLK---------PGFHFDLTHVDATQNFTKFEMLRRGIERD 67
           FLL L +  +   P  S S             GF   L HVD+ +N TKF++L R IER 
Sbjct: 8   FLLALSIVYIFVAPTHSTSRTALNHRHEAKVTGFQIMLEHVDSGKNLTKFQLLERAIERG 67

Query: 68  KTRIQNFKKMTVNFE-VRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWTQCNPCL 127
             R+Q  + M      V   +    G YLMNLS GTP   FSAI DTGSDLIWTQC PC 
Sbjct: 68  SRRLQRLEAMLNGPSGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCT 127

Query: 128 ECFGQPTPVYDPAQSSSFTNATRSAALCQALTNSSSSNG-CEYSYGYGDGSFTIGYLAFE 187
           +CF Q TP+++P  SSSF+    S+ LCQAL++ + SN  C+Y+YGYGDGS T G +  E
Sbjct: 128 QCFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGYGDGSETQGSMGTE 187

Query: 188 TLTLGEANQQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIRKFSYCLSP 247
           TLT G      S P+I FGC   N   G   GAG+VG  RGPLSL SQL + KFSYC++P
Sbjct: 188 TLTFG----SVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTP 247

Query: 248 NGTGC---LAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQKFLPVPSS 307
            G+     L  GS +    N+      NT LIQ+   P++YY++L G++VG   LP+  S
Sbjct: 248 IGSSTPSNLLLGSLA----NSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPS 307

Query: 308 WFELNA-DGSGGVIIDSDTSITYITEDAFDVLKPVFTSQTNLPVINSASIGLDLCFELPS 367
            F LN+ +G+GG+IIDS T++TY   +A+  ++  F SQ NLPV+N +S G DLCF+ PS
Sbjct: 308 AFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPS 367

Query: 368 PNNSKLDVPDLIFRFEGLDLKLPVENYMVVNEEVGIVCLAMGAAG-GLSIFGSMQHQNML 427
            + S L +P  +  F+G DL+LP ENY  ++   G++CLAMG++  G+SIFG++Q QNML
Sbjct: 368 -DPSNLQIPTFVMHFDGGDLELPSENYF-ISPSNGLICLAMGSSSQGMSIFGNIQQQNML 427

Query: 428 VLHDLKKKVISFIPTQC 429
           V++D    V+SF   QC
Sbjct: 428 VVYDTGNSVVSFASAQC 434

BLAST of ClCG08G005600 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 351.3 bits (900), Expect = 1.5e-95
Identity = 182/407 (44.72%), Postives = 263/407 (64.62%), Query Frame = 1

Query: 29  KPGFHFDLTHVDATQNFTKFEMLRRGIERDKTRIQNFKKMTVNFE-VRMPLVDQQGSYLM 88
           +PG   DL  VD+ +N TK+E+++R I+R + R+++   M  +   +  P+    G YLM
Sbjct: 39  QPGLRVDLEQVDSGKNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAGDGEYLM 98

Query: 89  NLSFGTPPVSFSAIFDTGSDLIWTQCNPCLECFGQPTPVYDPAQSSSFTNATRSAALCQA 148
           N++ GTP  SFSAI DTGSDLIWTQC PC +CF QPTP+++P  SSSF+     +  CQ 
Sbjct: 99  NVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQD 158

Query: 149 LTNSSSSNG-CEYSYGYGDGSFTIGYLAFETLTLGEANQQASTPDIAFGCRIRNYVNGLT 208
           L + + +N  C+Y+YGYGDGS T GY+A ET T     + +S P+IAFGC   N   G  
Sbjct: 159 LPSETCNNNECQYTYGYGDGSTTQGYMATETFTF----ETSSVPNIAFGCGEDNQGFGQG 218

Query: 209 QGAGIVGFSRGPLSLLSQLHIRKFSYCLSPNGTGC---LAFGSSSTSFDNTTNQAVKNTP 268
            GAG++G   GPLSL SQL + +FSYC++  G+     LA GS+++       +   +T 
Sbjct: 219 NGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLALGSAASGVP----EGSPSTT 278

Query: 269 LIQNPSNPSYYYLSLQGITVGQKFLPVPSSWFELNADGSGGVIIDSDTSITYITEDAFDV 328
           LI +  NP+YYY++LQGITVG   L +PSS F+L  DG+GG+IIDS T++TY+ +DA++ 
Sbjct: 279 LIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNA 338

Query: 329 LKPVFTSQTNLPVINSASIGLDLCFELPSPNNSKLDVPDLIFRFEGLDLKLPVENYMVVN 388
           +   FT Q NLP ++ +S GL  CF+ PS + S + VP++  +F+G  L L  +N ++  
Sbjct: 339 VAQAFTDQINLPTVDESSSGLSTCFQQPS-DGSTVQVPEISMQFDGGVLNLGEQNILISP 398

Query: 389 EEVGIVCLAMGAAG--GLSIFGSMQHQNMLVLHDLKKKVISFIPTQC 429
            E G++CLAMG++   G+SIFG++Q Q   VL+DL+   +SF+PTQC
Sbjct: 399 AE-GVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of ClCG08G005600 vs. Swiss-Prot
Match: CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1)

HSP 1 Score: 240.4 bits (612), Expect = 3.8e-62
Identity = 156/445 (35.06%), Postives = 227/445 (51.01%), Query Frame = 1

Query: 8   FLLLLLLPVLHSQPAFSESSLKP--GFHFDLTHVDATQN------FTKFEMLRRGIERDK 67
           F  +LL   L S    S ++ KP  GF  DL H D+ ++       T  + LR  I R  
Sbjct: 5   FSSVLLSLCLLSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSV 64

Query: 68  TRIQNFKKMTVNFEVRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWTQCNPCLEC 127
            R+ +F +     + ++ L    G YLMN+S GTPP    AI DTGSDL+WTQC PC +C
Sbjct: 65  NRVFHFTEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDC 124

Query: 128 FGQPTPVYDPAQSSSFTNATRSAALCQALTN----SSSSNGCEYSYGYGDGSFTIGYLAF 187
           + Q  P++DP  SS++ + + S++ C AL N    S++ N C YS  YGD S+T G +A 
Sbjct: 125 YTQVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAV 184

Query: 188 ETLTLGEAN-QQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIR---KFS 247
           +TLTLG ++ +     +I  GC   N      +G+GIVG   GP+SL+ QL      KFS
Sbjct: 185 DTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFS 244

Query: 248 YCLSP-----NGTGCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQK 307
           YCL P     + T  + FG+++      +   V +TPLI   S  ++YYL+L+ I+VG K
Sbjct: 245 YCLVPLTSKKDQTSKINFGTNAI----VSGSGVVSTPLIAKASQETFYYLTLKSISVGSK 304

Query: 308 FLPVPSSWFELNADGSGGVIIDSDTSITYITEDAFDVLKPVFTSQTNLPVINSASIGLDL 367
            +    S  E      G +IIDS T++T +  + +  L+    S  +         GL L
Sbjct: 305 QIQYSGSDSE---SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSL 364

Query: 368 CFELPSPNNSKLDVPDLIFRFEGLDLKLPVENYMVVNEEVGIVCLAMGAAGGLSIFGSMQ 427
           C+         L VP +   F+G D+KL   N  V   E  +VC A   +   SI+G++ 
Sbjct: 365 CYSA----TGDLKVPVITMHFDGADVKLDSSNAFVQVSE-DLVCFAFRGSPSFSIYGNVA 424

Query: 428 HQNMLVLHDLKKKVISFIPTQCSRI 432
             N LV +D   K +SF PT C+++
Sbjct: 425 QMNFLVGYDTVSKTVSFKPTDCAKM 437

BLAST of ClCG08G005600 vs. Swiss-Prot
Match: AP37_ORYSJ (Aspartyl protease 37 OS=Oryza sativa subsp. japonica GN=AP37 PE=3 SV=2)

HSP 1 Score: 229.2 bits (583), Expect = 8.7e-59
Identity = 157/471 (33.33%), Postives = 232/471 (49.26%), Query Frame = 1

Query: 10  LLLLLPVLHSQPAFSESSLKP-GFHFDLTHVDATQ----NFTKFEMLRRGIERDKTRIQN 69
           +LLLL  L + PA   S   P  F  +L  VDA+     N T+ E+LRR I+R + R+  
Sbjct: 5   VLLLLLALAALPA---SCAPPRSFRLELASVDASAADAANLTEHELLRRAIQRSRYRLAG 64

Query: 70  F----------KKMTVNFEVRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWTQCN 129
                      +K  V      P++   G YL+ L  GTPP  F+A  DT SDLIWTQC 
Sbjct: 65  IGMARGEAASARKAVV---AETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ 124

Query: 130 PCLECFGQPTPVYDPAQSSSFTNATRSAALCQAL----TNSSSSNGCEYSYGYGDGSFTI 189
           PC  C+ Q  P+++P  SS++     S+  C  L            C+Y+Y Y   + T 
Sbjct: 125 PCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTE 184

Query: 190 GYLAFETLTLGEANQQASTPDIAFGCRIRNYVNG-LTQGAGIVGFSRGPLSLLSQLHIRK 249
           G LA + L +GE     +   +AFGC   +       Q +G+VG  RGPLSL+SQL +R+
Sbjct: 185 GTLAVDKLVIGE----DAFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRR 244

Query: 250 FSYCLSPNGT---GCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQK 309
           F+YCL P  +   G L  G+ + +  N TN+     P+ ++P  PSYYYL+L G+ +G +
Sbjct: 245 FAYCLPPPASRIPGKLVLGADADAARNATNRIA--VPMRRDPRYPSYYYLNLDGLLIGDR 304

Query: 310 FL--------------------PVPSSWFELNADGSG---GVIIDSDTSITYITEDAFDV 369
            +                    P PS      A G     G+IID  ++IT++    +D 
Sbjct: 305 AMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDE 364

Query: 370 LKPVFTSQTNLPVINSASIGLDLCFELP-SPNNSKLDVPDLIFRFEGLDLKLPVENYMVV 429
           L      +  LP    +S+GLDLCF LP      ++ VP +   F+G  L+L        
Sbjct: 365 LVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRVYVPAVALAFDGRWLRLDKARLFAE 424

Query: 430 NEEVGIVCLAMG--AAGGLSIFGSMQHQNMLVLHDLKKKVISFIPTQCSRI 432
           + E G++CL +G   AG +SI G+ Q QNM VL++L++  ++F+ + C  +
Sbjct: 425 DRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPCGAL 463

BLAST of ClCG08G005600 vs. Swiss-Prot
Match: ASPR1_ARATH (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 216.1 bits (549), Expect = 7.7e-55
Identity = 142/446 (31.84%), Postives = 228/446 (51.12%), Query Frame = 1

Query: 11  LLLLPVLHSQPAFSESSLKPGFHFDLTHVDA------TQNFTKFEMLRRGIERDKTRIQN 70
           +LL   L      S S     F  +L H D+          T  + L     R  +R + 
Sbjct: 5   ILLCFFLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRR 64

Query: 71  FKKMTVNFEVRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWTQCNPCLECFGQPT 130
           F       +++  L+   G + M+++ GTPP+   AI DTGSDL W QC PC +C+ +  
Sbjct: 65  FNHQLSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENG 124

Query: 131 PVYDPAQSSSFTNATRSAALCQALTNS-----SSSNGCEYSYGYGDGSFTIGYLAFETLT 190
           P++D  +SS++ +    +  CQAL+++      S+N C+Y Y YGD SF+ G +A ET++
Sbjct: 125 PIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVS 184

Query: 191 LGEAN-QQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLH---IRKFSYCLS 250
           +  A+    S P   FGC   N       G+GI+G   G LSL+SQL     +KFSYCLS
Sbjct: 185 IDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLS 244

Query: 251 -----PNGTGCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQKFLPV 310
                 NGT  +  G++S     + +  V +TPL+ +    +YYYL+L+ I+VG+K +P 
Sbjct: 245 HKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLV-DKEPLTYYYLTLEAISVGKKKIPY 304

Query: 311 PSSWFELNADG-----SGGVIIDSDTSITYITEDAFDVL-KPVFTSQTNLPVINSASIGL 370
             S +  N DG     SG +IIDS T++T +    FD     V  S T    ++     L
Sbjct: 305 TGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLL 364

Query: 371 DLCFELPSPNNSKLDVPDLIFRFEGLDLKL-PVENYMVVNEEVGIVCLAMGAAGGLSIFG 430
             CF+     ++++ +P++   F G D++L P+  ++ ++E+  +VCL+M     ++I+G
Sbjct: 365 SHCFK---SGSAEIGLPEITVHFTGADVRLSPINAFVKLSED--MVCLSMVPTTEVAIYG 424

BLAST of ClCG08G005600 vs. TrEMBL
Match: F6H0S5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g03090 PE=3 SV=1)

HSP 1 Score: 441.0 bits (1133), Expect = 1.6e-120
Identity = 226/442 (51.13%), Postives = 294/442 (66.52%), Query Frame = 1

Query: 2   PMANCCFLLLLL----LPVLHS--QPAFSESSLKPGFHFDLTHVDATQNFTKFEMLRRGI 61
           P+ +  FLLL+L    LP   S   P F    L+ GF   L H+DA +NFT+ ++++RGI
Sbjct: 7   PLPSLPFLLLILTVLGLPRSSSTLMPGFRRQQLETGFQVGLRHIDAGRNFTRLQLIQRGI 66

Query: 62  ERDKTRIQNFKKMTVNFE---VRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWTQ 121
            R + R+Q    M    E    + P+    G +++NL  GTPPV F AI DTGSDLIWTQ
Sbjct: 67  NRGRQRLQRMSGMATTAERNGFQAPVHVGDGEFVVNLMIGTPPVPFPAIMDTGSDLIWTQ 126

Query: 122 CNPCLECFGQPTPVYDPAQSSSFTNATRSAALCQALTNSSSSNGCEYSYGYGDGSFTIGY 181
           CNPC  CF Q TPV++P +SS+F+N + S+ LC+ +  S     CEY Y YGD S T G+
Sbjct: 127 CNPCKLCFQQSTPVFNPKRSSTFSNISCSSKLCKGVKPSKCDKSCEYRYTYGDESSTEGF 186

Query: 182 LAFETLTLGEANQQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIRKFSY 241
           +A +T+T GE  ++ S P I FGC + N   G+ Q AG++G  RG LSL+SQL  +KFSY
Sbjct: 187 MAMDTITFGELPKRVSIPRIGFGCGVNNRATGMDQTAGLLGLGRGVLSLVSQLGTQKFSY 246

Query: 242 CLSP---NGTGCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQKFLP 301
           CL+    N T  L FGS   ++ N     +  TPLIQNP  PSYYYL+L+GITVG   LP
Sbjct: 247 CLTSIHENKTSSLLFGS--LAYSNFNPGKIPRTPLIQNPFLPSYYYLALKGITVGYTLLP 306

Query: 302 VPSSWFELNADGSGGVIIDSDTSITYITEDAFDVLKPVFTSQTNLPVINSASIGLDLCFE 361
           +P   F+L  DGSGG+I+DS T+ITY+ EDAFDVLK  F SQT L V NS++ GLDLCF 
Sbjct: 307 IPEFAFQLGKDGSGGMILDSGTTITYLQEDAFDVLKNAFISQTELQVANSSTTGLDLCFH 366

Query: 362 LPSPNNSKLDVPDLIFRFEGLDLKLPVENYMVVNEEVGIVCLAMGAAGGLSIFGSMQHQN 421
           LP  N +++ VP LIF F+GLDL LPVENYMV + E+G++CLA+ A G LSIFG++Q QN
Sbjct: 367 LPVKNAAEVKVPKLIFHFKGLDLALPVENYMVSDPEMGLICLAIDATGSLSIFGNIQQQN 426

Query: 422 MLVLHDLKKKVISFIPTQCSRI 432
           MLVLHDLKK  +S +PTQC ++
Sbjct: 427 MLVLHDLKKSTLSLVPTQCDKV 446

BLAST of ClCG08G005600 vs. TrEMBL
Match: B9GH97_POPTR (Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0001s31370g PE=3 SV=2)

HSP 1 Score: 437.2 bits (1123), Expect = 2.4e-119
Identity = 221/440 (50.23%), Postives = 302/440 (68.64%), Query Frame = 1

Query: 3   MANCCFLLLLLLPVLHSQPAFSES-------SLKPGFHFDLTHVDATQNFTKFEMLRRGI 62
           M + CF+L L +  +   PAFS S        ++ GF   L HVD+ +N TK E +R G+
Sbjct: 4   MTSLCFVLALAMFTIFFSPAFSTSRRALEHPKMQKGFRVRLKHVDSGKNLTKLERIRHGV 63

Query: 63  ERDKTRIQNFKKMTV----NFEVRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWT 122
           +R + R+Q  + M +    + E+  P++   G +LM L+ GTPP ++SAI DTGSDLIWT
Sbjct: 64  KRGRNRLQRLQAMALVASSSSEIEAPVLPGNGEFLMKLAIGTPPETYSAILDTGSDLIWT 123

Query: 123 QCNPCLECFGQPTPVYDPAQSSSFTNATRSAALCQALTNSSSSNGCEYSYGYGDGSFTIG 182
           QC PC +CF Q TP++DP +SSSF+  + S+ LC+AL  SS +NGCEY Y YGD S T G
Sbjct: 124 QCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEALPQSSCNNGCEYLYSYGDYSSTQG 183

Query: 183 YLAFETLTLGEANQQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIRKFS 242
            LA ETLT G    +AS P +AFGC   N  +G +QGAG+VG  RGPLSL+SQL   KFS
Sbjct: 184 ILASETLTFG----KASVPHVAFGCGADNEGSGFSQGAGLVGLGRGPLSLVSQLKEPKFS 243

Query: 243 YCLS---PNGTGCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQKFL 302
           YCL+      T  L  GS ++   N ++ A+K TPLI +P++PS+YYLSL+GI+VG   L
Sbjct: 244 YCLTTVDDTKTSTLLMGSLASV--NASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRL 303

Query: 303 PVPSSWFELNADGSGGVIIDSDTSITYITEDAFDVLKPVFTSQTNLPVINSASIGLDLCF 362
           P+  S F L  DGSGG+IIDS T+ITY+ E AF+++   FT++ NLPV +S S GLD+CF
Sbjct: 304 PIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKINLPVDSSGSTGLDVCF 363

Query: 363 ELPSPNNSKLDVPDLIFRFEGLDLKLPVENYMVVNEEVGIVCLAMGAAGGLSIFGSMQHQ 422
            LPS  ++ ++VP L+F F+G DL+LP ENYM+ +  +G+ CLAMG++ G+SIFG++Q Q
Sbjct: 364 TLPS-GSTNIEVPKLVFHFDGADLELPAENYMIGDSSMGVACLAMGSSSGMSIFGNVQQQ 423

Query: 423 NMLVLHDLKKKVISFIPTQC 429
           NMLVLHDL+K+ +SF+PTQC
Sbjct: 424 NMLVLHDLEKETLSFLPTQC 436

BLAST of ClCG08G005600 vs. TrEMBL
Match: B9N2J4_POPTR (Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0019s01600g PE=3 SV=1)

HSP 1 Score: 433.7 bits (1114), Expect = 2.6e-118
Identity = 214/418 (51.20%), Postives = 290/418 (69.38%), Query Frame = 1

Query: 19  SQPAFSESSLKPGFHFDLTHVDATQNFTKFEMLRRGIERDKTRIQNFKKMTV----NFEV 78
           S+       ++ GF   L HVD+ +N TKFE ++ G++R + R+Q FK M +    N E+
Sbjct: 27  SRRVLEHPKVQNGFRAKLKHVDSGKNLTKFERIQHGVKRGRHRLQRFKAMALVASSNSEI 86

Query: 79  RMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWTQCNPCLECFGQPTPVYDPAQSSS 138
             P++   G +LM L+ GTPP ++SAI DTGSDLIWTQC PC +CF QPTP++DP +SSS
Sbjct: 87  DAPVLPGNGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSS 146

Query: 139 FTNATRSAALCQALTNSSSSNGCEYSYGYGDGSFTIGYLAFETLTLGEANQQASTPDIAF 198
           F+  + S+ LC+AL  S+ S+GCEY YGYGD S T G LA ETLT G    + S P++AF
Sbjct: 147 FSKLSCSSKLCEALPQSTCSDGCEYLYGYGDYSSTQGMLASETLTFG----KVSVPEVAF 206

Query: 199 GCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIRKFSYCL-SPNGTGCLAFGSSSTSFDN 258
           GC   N  +G +QG+G+VG  RGPLSL+SQL   KFSYCL S + T        S +   
Sbjct: 207 GCGEDNEGSGFSQGSGLVGLGRGPLSLVSQLKEPKFSYCLTSVDDTKASTLLMGSLASVK 266

Query: 259 TTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQKFLPVPSSWFELNADGSGGVIIDSDTSI 318
            ++  +K TPLIQN + PS+YYLSL+GI+VG   LP+  S F L  DGSGG+IIDS T+I
Sbjct: 267 ASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTI 326

Query: 319 TYITEDAFDVLKPVFTSQTNLPVINSASIGLDLCFELPSPNNSKLDVPDLIFRFEGLDLK 378
           TY+ + AFD++   FTSQ NLPV NS S GL++CF LPS  ++ ++VP L+F F+G DL+
Sbjct: 327 TYLEQSAFDLVAKEFTSQINLPVDNSGSTGLEVCFTLPS-GSTDIEVPKLVFHFDGADLE 386

Query: 379 LPVENYMVVNEEVGIVCLAMGAAGGLSIFGSMQHQNMLVLHDLKKKVISFIPTQCSRI 432
           LP ENYM+ +  +G+ CLAMG++ G+SIFG++Q QNMLVLHDL+K+ +SF+PTQC  +
Sbjct: 387 LPAENYMIADASMGVACLAMGSSSGMSIFGNIQQQNMLVLHDLEKETLSFLPTQCDEL 439

BLAST of ClCG08G005600 vs. TrEMBL
Match: B9SA95_RICCO (Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_1697100 PE=3 SV=1)

HSP 1 Score: 430.6 bits (1106), Expect = 2.2e-117
Identity = 221/438 (50.46%), Postives = 297/438 (67.81%), Query Frame = 1

Query: 9   LLLLLLPVLHSQPAFSES--------SLKPGFHFDLTHVDATQNFTKFEMLRRGIERDKT 68
           LL LL+  L   PAFS S         LK GF   L HVD+ +N TKF+ ++ GI+R   
Sbjct: 12  LLSLLILSLSVYPAFSTSRRALSYPAQLKNGFRITLKHVDSDKNLTKFQRIQHGIKRANH 71

Query: 69  RIQNFKKMTV----NFEVRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWTQCNPC 128
           R++    M +    N E+  P++   G +LMNL+ GTPP ++SAI DTGSDLIWTQC PC
Sbjct: 72  RLERLNAMVLAASSNAEINSPVLSGNGEFLMNLAIGTPPETYSAIMDTGSDLIWTQCKPC 131

Query: 129 LECFGQPTPVYDPAQSSSFTNATRSAALCQALTNSSSSNGCEYSYGYGDGSFTIGYLAFE 188
            +CF QP+P++DP +SSSF+  + S+ LC+AL  SS S+ CEY Y YGD S T G +A E
Sbjct: 132 TQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSCSDSCEYLYTYGDYSSTQGTMATE 191

Query: 189 TLTLGEANQQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIRKFSYCLS- 248
           T T G    + S P++ FGC   N  +G TQG+G+VG  RGPLSL+SQL   KFSYCL+ 
Sbjct: 192 TFTFG----KVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQLKEAKFSYCLTS 251

Query: 249 --PNGTGCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQKFLPVPSS 308
                T  L  GS ++   N T+ A++ TPLIQNP  PS+YYLSL+GI+VG   LP+  S
Sbjct: 252 IDDTKTSTLLMGSLASV--NGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKES 311

Query: 309 WFELNADGSGGVIIDSDTSITYITEDAFDVLKPVFTSQTNLPVINSASIGLDLCFELPSP 368
            F+L  DG+GG+IIDS T+ITY+ E AFD++K  FTSQ  LPV NS + GL+LC+ LPS 
Sbjct: 312 TFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATGLELCYNLPS- 371

Query: 369 NNSKLDVPDLIFRFEGLDLKLPVENYMVVNEEVGIVCLAMGAAGGLSIFGSMQHQNMLVL 428
           + S+L+VP L+  F G DL+LP ENYM+ +  +G++CLAMG++GG+SIFG++Q QNM V 
Sbjct: 372 DTSELEVPKLVLHFTGADLELPGENYMIADSSMGVICLAMGSSGGMSIFGNVQQQNMFVS 431

Query: 429 HDLKKKVISFIPTQCSRI 432
           HDL+K+ +SF+PT C ++
Sbjct: 432 HDLEKETLSFLPTNCGQL 442

BLAST of ClCG08G005600 vs. TrEMBL
Match: A0A059B9W8_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_H05072 PE=3 SV=1)

HSP 1 Score: 423.7 bits (1088), Expect = 2.7e-115
Identity = 226/440 (51.36%), Postives = 300/440 (68.18%), Query Frame = 1

Query: 8   FLLLLLLPVLHSQPAFSES--------SLKPGFHFDLTHVDATQNFTKFEMLRRGIERDK 67
           FL+L +   L S P+FS S        + + GF   L HVD  +NFTKFE L+R ++R K
Sbjct: 12  FLVLAIFASLAS-PSFSTSRRALGSKEAKQIGFRVTLKHVDHGKNFTKFERLQRAMKRGK 71

Query: 68  TRIQNFKKMTV----NFEVRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWTQCNP 127
           +R+Q    M +    + E+  P+    G +LM LS GTP  SFSAI DTGSDLIWTQC P
Sbjct: 72  SRLQRLNAMVLAAGDSTELASPIHAGNGEFLMQLSIGTPADSFSAIVDTGSDLIWTQCKP 131

Query: 128 CLECFGQPTPVYDPAQSSSFTNATRSAALCQAL-TNSSSSNGCEYSYGYGDGSFTIGYLA 187
           C +CF Q TP++DP +SS+F+    S+ LC+AL T+S  ++GCEY Y YGD S T G LA
Sbjct: 132 CTQCFDQSTPIFDPKKSSTFSKLGCSSQLCEALPTSSCGTDGCEYLYTYGDYSSTQGILA 191

Query: 188 FETLTLGEANQQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIRKFSYCL 247
           ++T T  ++    S P + FGC   N  +G  QGAG+VG  RGPLSL+SQL + KFSYCL
Sbjct: 192 YDTFTFADS---VSVPKVGFGCGEDNEGSGFDQGAGLVGLGRGPLSLVSQLGVPKFSYCL 251

Query: 248 SP---NGTGCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQKFLPVP 307
           +      T  L  GS +TS  N + +A+K TPLI+NP  PS+YYLSL+GI+VG   LP+ 
Sbjct: 252 TSIDDTATSKLLLGSEATS-GNLSTKAMKTTPLIKNPLQPSFYYLSLEGISVGDTLLPIK 311

Query: 308 SSWFELNADGSGGVIIDSDTSITYITEDAFDVLKPVFTSQTNLPVINSASIGLDLCFELP 367
            S F L +DGSGGVIIDS T+ITYI E AFD++K  F SQT L V +S S GLDLCF+LP
Sbjct: 312 KSTFALQSDGSGGVIIDSGTTITYIEESAFDLVKKEFKSQTKLTVDDSGSAGLDLCFKLP 371

Query: 368 SPNNSKLDVPDLIFRFEGLDLKLPVENYMVVNEEVGIVCLAMGAAGGLSIFGSMQHQNML 427
           S ++S+++VP LIF FEG DL LP ENYM+ +  VG+VCLAMG++ G+SIFG++Q Q+ +
Sbjct: 372 S-DSSQVEVPKLIFHFEGADLDLPGENYMIADSTVGLVCLAMGSSSGMSIFGNVQQQDTM 431

Query: 428 VLHDLKKKVISFIPTQCSRI 432
           V+HDL K+ +SF+PT+C ++
Sbjct: 432 VIHDLAKETLSFLPTKCDKL 445

BLAST of ClCG08G005600 vs. TAIR10
Match: AT2G03200.1 (AT2G03200.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 384.0 bits (985), Expect = 1.2e-106
Identity = 210/452 (46.46%), Postives = 278/452 (61.50%), Query Frame = 1

Query: 8   FLLLL--LLPVLHSQPAFSESSL-----KPGFHFDLTHVDATQNFTKFEMLRRGIERDKT 67
           FL+L   L+ V  S+ +  + +L     + GF   L HVD+ +N TK + ++RGI R   
Sbjct: 14  FLILFSCLISVSSSRRSLIDRTLPKNLPRSGFRLSLRHVDSGKNLTKIQKIQRGINRGFH 73

Query: 68  RIQNFKKMTV---------NFEVRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWT 127
           R+     + V            ++ P     G +LM LS G P V +SAI DTGSDLIWT
Sbjct: 74  RLNRLGAVAVLAVASKPDDTNNIKAPTHGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWT 133

Query: 128 QCNPCLECFGQPTPVYDPAQSSSFTNATRSAALCQALTNSSSS---NGCEYSYGYGDGSF 187
           QC PC ECF QPTP++DP +SSS++    S+ LC AL  S+ +   + CEY Y YGD S 
Sbjct: 134 QCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYGDYSS 193

Query: 188 TIGYLAFETLTLGEANQQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIR 247
           T G LA ET T  + N   S   I FGC + N  +G +QG+G+VG  RGPLSL+SQL   
Sbjct: 194 TRGLLATETFTFEDEN---SISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKET 253

Query: 248 KFSYCLSP----NGTGCLAFGSSSTSFDNTTN-----QAVKNTPLIQNPSNPSYYYLSLQ 307
           KFSYCL+       +  L  GS ++   N T      +  K   L++NP  PS+YYL LQ
Sbjct: 254 KFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQ 313

Query: 308 GITVGQKFLPVPSSWFELNADGSGGVIIDSDTSITYITEDAFDVLKPVFTSQTNLPVINS 367
           GITVG K L V  S FEL  DG+GG+IIDS T+ITY+ E AF VLK  FTS+ +LPV +S
Sbjct: 314 GITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDS 373

Query: 368 ASIGLDLCFELPSPNNSKLDVPDLIFRFEGLDLKLPVENYMVVNEEVGIVCLAMGAAGGL 427
            S GLDLCF+LP    + + VP +IF F+G DL+LP ENYMV +   G++CLAMG++ G+
Sbjct: 374 GSTGLDLCFKLPDAAKN-IAVPKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGM 433

Query: 428 SIFGSMQHQNMLVLHDLKKKVISFIPTQCSRI 432
           SIFG++Q QN  VLHDL+K+ +SF+PT+C ++
Sbjct: 434 SIFGNVQQQNFNVLHDLEKETVSFVPTECGKL 461

BLAST of ClCG08G005600 vs. TAIR10
Match: AT5G33340.1 (AT5G33340.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 240.4 bits (612), Expect = 2.1e-63
Identity = 156/445 (35.06%), Postives = 227/445 (51.01%), Query Frame = 1

Query: 8   FLLLLLLPVLHSQPAFSESSLKP--GFHFDLTHVDATQN------FTKFEMLRRGIERDK 67
           F  +LL   L S    S ++ KP  GF  DL H D+ ++       T  + LR  I R  
Sbjct: 5   FSSVLLSLCLLSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSV 64

Query: 68  TRIQNFKKMTVNFEVRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWTQCNPCLEC 127
            R+ +F +     + ++ L    G YLMN+S GTPP    AI DTGSDL+WTQC PC +C
Sbjct: 65  NRVFHFTEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDC 124

Query: 128 FGQPTPVYDPAQSSSFTNATRSAALCQALTN----SSSSNGCEYSYGYGDGSFTIGYLAF 187
           + Q  P++DP  SS++ + + S++ C AL N    S++ N C YS  YGD S+T G +A 
Sbjct: 125 YTQVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAV 184

Query: 188 ETLTLGEAN-QQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIR---KFS 247
           +TLTLG ++ +     +I  GC   N      +G+GIVG   GP+SL+ QL      KFS
Sbjct: 185 DTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFS 244

Query: 248 YCLSP-----NGTGCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQK 307
           YCL P     + T  + FG+++      +   V +TPLI   S  ++YYL+L+ I+VG K
Sbjct: 245 YCLVPLTSKKDQTSKINFGTNAI----VSGSGVVSTPLIAKASQETFYYLTLKSISVGSK 304

Query: 308 FLPVPSSWFELNADGSGGVIIDSDTSITYITEDAFDVLKPVFTSQTNLPVINSASIGLDL 367
            +    S  E      G +IIDS T++T +  + +  L+    S  +         GL L
Sbjct: 305 QIQYSGSDSE---SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSL 364

Query: 368 CFELPSPNNSKLDVPDLIFRFEGLDLKLPVENYMVVNEEVGIVCLAMGAAGGLSIFGSMQ 427
           C+         L VP +   F+G D+KL   N  V   E  +VC A   +   SI+G++ 
Sbjct: 365 CYSA----TGDLKVPVITMHFDGADVKLDSSNAFVQVSE-DLVCFAFRGSPSFSIYGNVA 424

Query: 428 HQNMLVLHDLKKKVISFIPTQCSRI 432
             N LV +D   K +SF PT C+++
Sbjct: 425 QMNFLVGYDTVSKTVSFKPTDCAKM 437

BLAST of ClCG08G005600 vs. TAIR10
Match: AT1G64830.1 (AT1G64830.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 222.2 bits (565), Expect = 6.0e-58
Identity = 150/446 (33.63%), Postives = 234/446 (52.47%), Query Frame = 1

Query: 3   MANCCFLLLLLLPVLHSQPAFSESSLKPGFHFDLTHVDATQN------FTKFEMLRRGIE 62
           MA+  F  LL L +L +  A+     K GF  DL H D+ ++       T  + +R  I 
Sbjct: 1   MASLIFATLLSLLLLSNVNAYP----KDGFTIDLIHRDSPKSPFYNSAETSSQRMRNAIR 60

Query: 63  RDKTRIQNFKKMTVNFEVRMPLV-DQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWTQCNP 122
           R       F     +       +   +G YLMN+S GTPPV   AI DTGSDLIWTQCNP
Sbjct: 61  RSARSTLQFSNDDASPNSPQSFITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNP 120

Query: 123 CLECFGQPTPVYDPAQSSSFTNATRSAALCQALTNSSSS---NGCEYSYGYGDGSFTIGY 182
           C +C+ Q +P++DP +SS++   + S++ C+AL ++S S   N C Y+  YGD S+T G 
Sbjct: 121 CEDCYQQTSPLFDPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGD 180

Query: 183 LAFETLTLGEANQQ-ASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIR--- 242
           +A +T+T+G + ++  S  ++  GC   N       G+GI+G   G  SL+SQL      
Sbjct: 181 VAVDTVTMGSSGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSING 240

Query: 243 KFSYCLSP--NGTGCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQK 302
           KFSYCL P  + TG  +  +  T+   + +  V  + + ++P+  +YY+L+L+ I+VG K
Sbjct: 241 KFSYCLVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDPA--TYYFLNLEAISVGSK 300

Query: 303 FLPVPSSWFELNADGSGGVIIDSDTSITYITEDAFDVLKPVFTSQTNLPVINSASIGLDL 362
            +   S+ F     G G ++IDS T++T +  + +  L+ V  S      +      L L
Sbjct: 301 KIQFTSTIF---GTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSL 360

Query: 363 CFELPSPNNSKLDVPDLIFRFEGLDLKL-PVENYMVVNEEVGIVCLAMGAAGGLSIFGSM 422
           C+     ++S   VPD+   F+G D+KL  +  ++ V+E+V   C A  A   L+IFG++
Sbjct: 361 CYR----DSSSFKVPDITVHFKGGDVKLGNLNTFVAVSEDVS--CFAFAANEQLTIFGNL 420

Query: 423 QHQNMLVLHDLKKKVISFIPTQCSRI 432
              N LV +D     +SF  T CS++
Sbjct: 421 AQMNFLVGYDTVSGTVSFKKTDCSQM 431

BLAST of ClCG08G005600 vs. TAIR10
Match: AT1G31450.1 (AT1G31450.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 216.5 bits (550), Expect = 3.3e-56
Identity = 134/387 (34.63%), Postives = 203/387 (52.45%), Query Frame = 1

Query: 62  IQNFKKMTVNFEVRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWTQCNPCLECFG 121
           I   ++ T   +++  L+   G Y M++S GTPP    AI DTGSDL W QC PC +C+ 
Sbjct: 62  ISRSRRFTTKTDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYK 121

Query: 122 QPTPVYDPAQSSSFTNATRSAALCQALTN-----SSSSNGCEYSYGYGDGSFTIGYLAFE 181
           Q +P++D  +SS++   +  +  CQAL+        S + C+Y Y YGD SFT G +A E
Sbjct: 122 QNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATE 181

Query: 182 TLTL-GEANQQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHI---RKFSY 241
           T+++   +    S P   FGC   N       G+GI+G   GPLSL+SQL     +KFSY
Sbjct: 182 TISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSY 241

Query: 242 CLS-----PNGTGCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQKF 301
           CLS      NGT  +  G++S   + + + A   TPLIQ     +YY+L+L+ +TVG+  
Sbjct: 242 CLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPE-TYYFLTLEAVTVGKTK 301

Query: 302 LPVPSSWFELNADGS---GGVIIDSDTSITYITEDAFDVL-KPVFTSQTNLPVINSASIG 361
           LP     + LN   S   G +IIDS T++T +    +D     V  S T    ++     
Sbjct: 302 LPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGL 361

Query: 362 LDLCFELPSPNNSKLDVPDLIFRFEGLDLKL-PVENYMVVNEEVGIVCLAMGAAGGLSIF 421
           L  CF+     + ++ +P +   F   D+KL P+  ++ +NE+   VCL+M     ++I+
Sbjct: 362 LTHCFK---SGDKEIGLPAITMHFTNADVKLSPINAFVKLNEDT--VCLSMIPTTEVAIY 421

Query: 422 GSMQHQNMLVLHDLKKKVISFIPTQCS 430
           G+M   + LV +DL+ K +SF    CS
Sbjct: 422 GNMVQMDFLVGYDLETKTVSFQRMDCS 442

BLAST of ClCG08G005600 vs. TAIR10
Match: AT2G35615.1 (AT2G35615.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 216.1 bits (549), Expect = 4.3e-56
Identity = 142/446 (31.84%), Postives = 228/446 (51.12%), Query Frame = 1

Query: 11  LLLLPVLHSQPAFSESSLKPGFHFDLTHVDA------TQNFTKFEMLRRGIERDKTRIQN 70
           +LL   L      S S     F  +L H D+          T  + L     R  +R + 
Sbjct: 5   ILLCFFLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRR 64

Query: 71  FKKMTVNFEVRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWTQCNPCLECFGQPT 130
           F       +++  L+   G + M+++ GTPP+   AI DTGSDL W QC PC +C+ +  
Sbjct: 65  FNHQLSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENG 124

Query: 131 PVYDPAQSSSFTNATRSAALCQALTNS-----SSSNGCEYSYGYGDGSFTIGYLAFETLT 190
           P++D  +SS++ +    +  CQAL+++      S+N C+Y Y YGD SF+ G +A ET++
Sbjct: 125 PIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVS 184

Query: 191 LGEAN-QQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLH---IRKFSYCLS 250
           +  A+    S P   FGC   N       G+GI+G   G LSL+SQL     +KFSYCLS
Sbjct: 185 IDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLS 244

Query: 251 -----PNGTGCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQKFLPV 310
                 NGT  +  G++S     + +  V +TPL+ +    +YYYL+L+ I+VG+K +P 
Sbjct: 245 HKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLV-DKEPLTYYYLTLEAISVGKKKIPY 304

Query: 311 PSSWFELNADG-----SGGVIIDSDTSITYITEDAFDVL-KPVFTSQTNLPVINSASIGL 370
             S +  N DG     SG +IIDS T++T +    FD     V  S T    ++     L
Sbjct: 305 TGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLL 364

Query: 371 DLCFELPSPNNSKLDVPDLIFRFEGLDLKL-PVENYMVVNEEVGIVCLAMGAAGGLSIFG 430
             CF+     ++++ +P++   F G D++L P+  ++ ++E+  +VCL+M     ++I+G
Sbjct: 365 SHCFK---SGSAEIGLPEITVHFTGADVRLSPINAFVKLSED--MVCLSMVPTTEVAIYG 424

BLAST of ClCG08G005600 vs. NCBI nr
Match: gi|743904624|ref|XP_011045692.1| (PREDICTED: aspartic proteinase nepenthesin-1-like [Populus euphratica])

HSP 1 Score: 443.0 bits (1138), Expect = 6.1e-121
Identity = 225/440 (51.14%), Postives = 304/440 (69.09%), Query Frame = 1

Query: 3   MANCCFLLLLLLPVLHSQPAFSES-------SLKPGFHFDLTHVDATQNFTKFEMLRRGI 62
           M + CF+L L +  +   PAFS S        ++ GF   L HVD+ +N TK E +R G+
Sbjct: 4   MTSLCFVLALAMFTIFFSPAFSTSRRALEHPEMQKGFRVRLKHVDSGKNLTKLERIRHGV 63

Query: 63  ERDKTRIQNFKKMTV----NFEVRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWT 122
           +R + R+Q  K M +    + E+  P++   G +LM L+ GTPP ++SAI DTGSDLIWT
Sbjct: 64  KRGRNRLQRLKTMALVASSSSEIEAPVLPGNGEFLMKLAIGTPPETYSAILDTGSDLIWT 123

Query: 123 QCNPCLECFGQPTPVYDPAQSSSFTNATRSAALCQALTNSSSSNGCEYSYGYGDGSFTIG 182
           QC PC +CF Q TP++DP +SSSF+  + S+ LC+AL  SS SNGCEY Y YGD S T G
Sbjct: 124 QCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEALPQSSCSNGCEYLYSYGDYSSTQG 183

Query: 183 YLAFETLTLGEANQQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIRKFS 242
            LA ETLT G    +AS P++AFGC   N  +G +QGAG+VG  RGPLSL+SQL   KFS
Sbjct: 184 ILASETLTFG----KASVPNVAFGCGANNEGSGFSQGAGLVGLGRGPLSLVSQLKEPKFS 243

Query: 243 YCLS---PNGTGCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQKFL 302
           YCL+      T  L  GS ++   N ++ A+K TPLI++P++PS+YYLSL+GI+VG   L
Sbjct: 244 YCLTSVDDAKTSTLLMGSLASV--NASSSAIKTTPLIRSPAHPSFYYLSLEGISVGDTRL 303

Query: 303 PVPSSWFELNADGSGGVIIDSDTSITYITEDAFDVLKPVFTSQTNLPVINSASIGLDLCF 362
           P+  S F L  DGSGG+IIDS T+ITY+ E AF+++   FTSQ NLPV +S S GLD+CF
Sbjct: 304 PIKKSTFSLQDDGSGGLIIDSGTTITYLEERAFNLVAKEFTSQINLPVDSSGSTGLDVCF 363

Query: 363 ELPSPNNSKLDVPDLIFRFEGLDLKLPVENYMVVNEEVGIVCLAMGAAGGLSIFGSMQHQ 422
            LPS  ++ ++VP L+F F+G DL+LP ENYM+ +  +G+ CLAMG++ G+SIFG++Q Q
Sbjct: 364 TLPS-GSTNIEVPKLVFHFDGADLELPAENYMIGDSSMGVACLAMGSSSGMSIFGNVQQQ 423

Query: 423 NMLVLHDLKKKVISFIPTQC 429
           NMLVLHDL+K+ +SF+PTQC
Sbjct: 424 NMLVLHDLEKETLSFLPTQC 436

BLAST of ClCG08G005600 vs. NCBI nr
Match: gi|731428103|ref|XP_010664223.1| (PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera])

HSP 1 Score: 441.0 bits (1133), Expect = 2.3e-120
Identity = 226/442 (51.13%), Postives = 294/442 (66.52%), Query Frame = 1

Query: 2   PMANCCFLLLLL----LPVLHS--QPAFSESSLKPGFHFDLTHVDATQNFTKFEMLRRGI 61
           P+ +  FLLL+L    LP   S   P F    L+ GF   L H+DA +NFT+ ++++RGI
Sbjct: 7   PLPSLPFLLLILTVLGLPRSSSTLMPGFRRQQLETGFQVGLRHIDAGRNFTRLQLIQRGI 66

Query: 62  ERDKTRIQNFKKMTVNFE---VRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWTQ 121
            R + R+Q    M    E    + P+    G +++NL  GTPPV F AI DTGSDLIWTQ
Sbjct: 67  NRGRQRLQRMSGMATTAERNGFQAPVHVGDGEFVVNLMIGTPPVPFPAIMDTGSDLIWTQ 126

Query: 122 CNPCLECFGQPTPVYDPAQSSSFTNATRSAALCQALTNSSSSNGCEYSYGYGDGSFTIGY 181
           CNPC  CF Q TPV++P +SS+F+N + S+ LC+ +  S     CEY Y YGD S T G+
Sbjct: 127 CNPCKLCFQQSTPVFNPKRSSTFSNISCSSKLCKGVKPSKCDKSCEYRYTYGDESSTEGF 186

Query: 182 LAFETLTLGEANQQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIRKFSY 241
           +A +T+T GE  ++ S P I FGC + N   G+ Q AG++G  RG LSL+SQL  +KFSY
Sbjct: 187 MAMDTITFGELPKRVSIPRIGFGCGVNNRATGMDQTAGLLGLGRGVLSLVSQLGTQKFSY 246

Query: 242 CLSP---NGTGCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQKFLP 301
           CL+    N T  L FGS   ++ N     +  TPLIQNP  PSYYYL+L+GITVG   LP
Sbjct: 247 CLTSIHENKTSSLLFGS--LAYSNFNPGKIPRTPLIQNPFLPSYYYLALKGITVGYTLLP 306

Query: 302 VPSSWFELNADGSGGVIIDSDTSITYITEDAFDVLKPVFTSQTNLPVINSASIGLDLCFE 361
           +P   F+L  DGSGG+I+DS T+ITY+ EDAFDVLK  F SQT L V NS++ GLDLCF 
Sbjct: 307 IPEFAFQLGKDGSGGMILDSGTTITYLQEDAFDVLKNAFISQTELQVANSSTTGLDLCFH 366

Query: 362 LPSPNNSKLDVPDLIFRFEGLDLKLPVENYMVVNEEVGIVCLAMGAAGGLSIFGSMQHQN 421
           LP  N +++ VP LIF F+GLDL LPVENYMV + E+G++CLA+ A G LSIFG++Q QN
Sbjct: 367 LPVKNAAEVKVPKLIFHFKGLDLALPVENYMVSDPEMGLICLAIDATGSLSIFGNIQQQN 426

Query: 422 MLVLHDLKKKVISFIPTQCSRI 432
           MLVLHDLKK  +S +PTQC ++
Sbjct: 427 MLVLHDLKKSTLSLVPTQCDKV 446

BLAST of ClCG08G005600 vs. NCBI nr
Match: gi|566152021|ref|XP_002300215.2| (aspartyl protease family protein [Populus trichocarpa])

HSP 1 Score: 437.2 bits (1123), Expect = 3.4e-119
Identity = 221/440 (50.23%), Postives = 302/440 (68.64%), Query Frame = 1

Query: 3   MANCCFLLLLLLPVLHSQPAFSES-------SLKPGFHFDLTHVDATQNFTKFEMLRRGI 62
           M + CF+L L +  +   PAFS S        ++ GF   L HVD+ +N TK E +R G+
Sbjct: 4   MTSLCFVLALAMFTIFFSPAFSTSRRALEHPKMQKGFRVRLKHVDSGKNLTKLERIRHGV 63

Query: 63  ERDKTRIQNFKKMTV----NFEVRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWT 122
           +R + R+Q  + M +    + E+  P++   G +LM L+ GTPP ++SAI DTGSDLIWT
Sbjct: 64  KRGRNRLQRLQAMALVASSSSEIEAPVLPGNGEFLMKLAIGTPPETYSAILDTGSDLIWT 123

Query: 123 QCNPCLECFGQPTPVYDPAQSSSFTNATRSAALCQALTNSSSSNGCEYSYGYGDGSFTIG 182
           QC PC +CF Q TP++DP +SSSF+  + S+ LC+AL  SS +NGCEY Y YGD S T G
Sbjct: 124 QCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEALPQSSCNNGCEYLYSYGDYSSTQG 183

Query: 183 YLAFETLTLGEANQQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIRKFS 242
            LA ETLT G    +AS P +AFGC   N  +G +QGAG+VG  RGPLSL+SQL   KFS
Sbjct: 184 ILASETLTFG----KASVPHVAFGCGADNEGSGFSQGAGLVGLGRGPLSLVSQLKEPKFS 243

Query: 243 YCLS---PNGTGCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQKFL 302
           YCL+      T  L  GS ++   N ++ A+K TPLI +P++PS+YYLSL+GI+VG   L
Sbjct: 244 YCLTTVDDTKTSTLLMGSLASV--NASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRL 303

Query: 303 PVPSSWFELNADGSGGVIIDSDTSITYITEDAFDVLKPVFTSQTNLPVINSASIGLDLCF 362
           P+  S F L  DGSGG+IIDS T+ITY+ E AF+++   FT++ NLPV +S S GLD+CF
Sbjct: 304 PIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKINLPVDSSGSTGLDVCF 363

Query: 363 ELPSPNNSKLDVPDLIFRFEGLDLKLPVENYMVVNEEVGIVCLAMGAAGGLSIFGSMQHQ 422
            LPS  ++ ++VP L+F F+G DL+LP ENYM+ +  +G+ CLAMG++ G+SIFG++Q Q
Sbjct: 364 TLPS-GSTNIEVPKLVFHFDGADLELPAENYMIGDSSMGVACLAMGSSSGMSIFGNVQQQ 423

Query: 423 NMLVLHDLKKKVISFIPTQC 429
           NMLVLHDL+K+ +SF+PTQC
Sbjct: 424 NMLVLHDLEKETLSFLPTQC 436

BLAST of ClCG08G005600 vs. NCBI nr
Match: gi|743905840|ref|XP_011046330.1| (PREDICTED: aspartic proteinase nepenthesin-1-like [Populus euphratica])

HSP 1 Score: 435.6 bits (1119), Expect = 9.8e-119
Identity = 220/441 (49.89%), Postives = 300/441 (68.03%), Query Frame = 1

Query: 3   MANCCFLLLLLLPVLHSQPAFSES-------SLKPGFHFDLTHVDATQNFTKFEMLRRGI 62
           M++  F++ L +  L    AFS S         + GF   L HVD+ +N TKFE ++ G+
Sbjct: 4   MSSLSFVVALAIFALVFSHAFSTSRRVLEHPKAQNGFRVKLKHVDSGKNLTKFERIQHGV 63

Query: 63  ERDKTRIQNFKKMTV----NFEVRMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWT 122
           +R + R+Q FK M +    N E+  P++   G +LMNL+ GTPP ++SAI DTGSDLIWT
Sbjct: 64  KRGRHRLQRFKAMALVASSNSEIDAPVLPGNGEFLMNLAIGTPPATYSAIMDTGSDLIWT 123

Query: 123 QCNPCLECFGQPTPVYDPAQSSSFTNATRSAALCQALTNSSSSNGCEYSYGYGDGSFTIG 182
           QC PC +CF QPTP++DP +SSSF+  + S+ LC+AL  S+ S+GCEY YGYGD S T G
Sbjct: 124 QCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCEALPQSTCSDGCEYLYGYGDYSSTQG 183

Query: 183 YLAFETLTLGEANQQASTPDIAFGCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIRKFS 242
            LA ETLT G    + S P +AFGC   N  +G +QG+G+VG  RGPLSL+SQL   KFS
Sbjct: 184 ILASETLTFG----KVSVPKVAFGCGEDNEGSGFSQGSGLVGLGRGPLSLVSQLKEPKFS 243

Query: 243 YCL-SPNGTGCLAFGSSSTSFDNTTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQKFLPV 302
           YCL S + T        S +    ++  +K+TPLIQN + PS+YYLSL+GI+VG   LP+
Sbjct: 244 YCLTSVDDTKASTLLMGSLASVKASDSEIKSTPLIQNSAQPSFYYLSLEGISVGDTSLPI 303

Query: 303 PSSWFELNADGSGGVIIDSDTSITYITEDAFDVLKPVFTSQTNLPVINSASIGLDLCFEL 362
             S F L  DGSGG+IIDS T+ITY+ + AFD++   FTSQ NLPV NS + GL++CF L
Sbjct: 304 KKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDLVSKEFTSQMNLPVDNSGATGLEVCFTL 363

Query: 363 PSPNNSKLDVPDLIFRFEGLDLKLPVENYMVVNEEVGIVCLAMGAAGGLSIFGSMQHQNM 422
           PS  ++ ++VP L+F F+G DL+LP ENYM+ +  +G+ CLAMG++ G+SIFG++Q QNM
Sbjct: 364 PS-GSTDIEVPKLVFHFDGADLELPAENYMIADASMGVACLAMGSSSGMSIFGNIQQQNM 423

Query: 423 LVLHDLKKKVISFIPTQCSRI 432
           LVLHDL+K+ +SF+P QC  +
Sbjct: 424 LVLHDLEKETLSFLPAQCDEL 439

BLAST of ClCG08G005600 vs. NCBI nr
Match: gi|566222317|ref|XP_006370905.1| (aspartyl protease family protein [Populus trichocarpa])

HSP 1 Score: 433.7 bits (1114), Expect = 3.7e-118
Identity = 214/418 (51.20%), Postives = 290/418 (69.38%), Query Frame = 1

Query: 19  SQPAFSESSLKPGFHFDLTHVDATQNFTKFEMLRRGIERDKTRIQNFKKMTV----NFEV 78
           S+       ++ GF   L HVD+ +N TKFE ++ G++R + R+Q FK M +    N E+
Sbjct: 27  SRRVLEHPKVQNGFRAKLKHVDSGKNLTKFERIQHGVKRGRHRLQRFKAMALVASSNSEI 86

Query: 79  RMPLVDQQGSYLMNLSFGTPPVSFSAIFDTGSDLIWTQCNPCLECFGQPTPVYDPAQSSS 138
             P++   G +LM L+ GTPP ++SAI DTGSDLIWTQC PC +CF QPTP++DP +SSS
Sbjct: 87  DAPVLPGNGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSS 146

Query: 139 FTNATRSAALCQALTNSSSSNGCEYSYGYGDGSFTIGYLAFETLTLGEANQQASTPDIAF 198
           F+  + S+ LC+AL  S+ S+GCEY YGYGD S T G LA ETLT G    + S P++AF
Sbjct: 147 FSKLSCSSKLCEALPQSTCSDGCEYLYGYGDYSSTQGMLASETLTFG----KVSVPEVAF 206

Query: 199 GCRIRNYVNGLTQGAGIVGFSRGPLSLLSQLHIRKFSYCL-SPNGTGCLAFGSSSTSFDN 258
           GC   N  +G +QG+G+VG  RGPLSL+SQL   KFSYCL S + T        S +   
Sbjct: 207 GCGEDNEGSGFSQGSGLVGLGRGPLSLVSQLKEPKFSYCLTSVDDTKASTLLMGSLASVK 266

Query: 259 TTNQAVKNTPLIQNPSNPSYYYLSLQGITVGQKFLPVPSSWFELNADGSGGVIIDSDTSI 318
            ++  +K TPLIQN + PS+YYLSL+GI+VG   LP+  S F L  DGSGG+IIDS T+I
Sbjct: 267 ASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTI 326

Query: 319 TYITEDAFDVLKPVFTSQTNLPVINSASIGLDLCFELPSPNNSKLDVPDLIFRFEGLDLK 378
           TY+ + AFD++   FTSQ NLPV NS S GL++CF LPS  ++ ++VP L+F F+G DL+
Sbjct: 327 TYLEQSAFDLVAKEFTSQINLPVDNSGSTGLEVCFTLPS-GSTDIEVPKLVFHFDGADLE 386

Query: 379 LPVENYMVVNEEVGIVCLAMGAAGGLSIFGSMQHQNMLVLHDLKKKVISFIPTQCSRI 432
           LP ENYM+ +  +G+ CLAMG++ G+SIFG++Q QNMLVLHDL+K+ +SF+PTQC  +
Sbjct: 387 LPAENYMIADASMGVACLAMGSSSGMSIFGNIQQQNMLVLHDLEKETLSFLPTQCDEL 439

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NEP1_NEPGR1.2e-9745.77Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
NEP2_NEPGR1.5e-9544.72Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
CDR1_ARATH3.8e-6235.06Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1[more]
AP37_ORYSJ8.7e-5933.33Aspartyl protease 37 OS=Oryza sativa subsp. japonica GN=AP37 PE=3 SV=2[more]
ASPR1_ARATH7.7e-5531.84Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 S... [more]
Match NameE-valueIdentityDescription
F6H0S5_VITVI1.6e-12051.13Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g03090 PE=3 SV=... [more]
B9GH97_POPTR2.4e-11950.23Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0001s31370g PE=... [more]
B9N2J4_POPTR2.6e-11851.20Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0019s01600g PE=... [more]
B9SA95_RICCO2.2e-11750.46Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_1697100 ... [more]
A0A059B9W8_EUCGR2.7e-11551.36Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_H05072 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT2G03200.11.2e-10646.46 Eukaryotic aspartyl protease family protein[more]
AT5G33340.12.1e-6335.06 Eukaryotic aspartyl protease family protein[more]
AT1G64830.16.0e-5833.63 Eukaryotic aspartyl protease family protein[more]
AT1G31450.13.3e-5634.63 Eukaryotic aspartyl protease family protein[more]
AT2G35615.14.3e-5631.84 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|743904624|ref|XP_011045692.1|6.1e-12151.14PREDICTED: aspartic proteinase nepenthesin-1-like [Populus euphratica][more]
gi|731428103|ref|XP_010664223.1|2.3e-12051.13PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera][more]
gi|566152021|ref|XP_002300215.2|3.4e-11950.23aspartyl protease family protein [Populus trichocarpa][more]
gi|743905840|ref|XP_011046330.1|9.8e-11949.89PREDICTED: aspartic proteinase nepenthesin-1-like [Populus euphratica][more]
gi|566222317|ref|XP_006370905.1|3.7e-11851.20aspartyl protease family protein [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0044260 cellular macromolecule metabolic process
biological_process GO:0006508 proteolysis
biological_process GO:0044699 single-organism process
biological_process GO:0030163 protein catabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG08G005600.1ClCG08G005600.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 1..432
score: 1.2E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 80..246
score: 5.1E-33coord: 255..430
score: 4.0
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 80..430
score: 1.56
NoneNo IPR availablePANTHERPTHR13683:SF324SUBFAMILY NOT NAMEDcoord: 1..432
score: 1.2E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
ClCG08G005600Cla013946Watermelon (97103) v1wcgwmB400
ClCG08G005600Cla97C08G148380Watermelon (97103) v2wcgwmbB336
ClCG08G005600Bhi04G001526Wax gourdwcgwgoB624
ClCG08G005600Lsi08G004290Bottle gourd (USVL1VR-Ls)lsiwcgB439
The following gene(s) are paralogous to this gene:

None