ClCG02G018840 (gene) Watermelon (Charleston Gray)

NameClCG02G018840
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionEukaryotic aspartyl protease family protein LENGTH=461
LocationCG_Chr02 : 33596902 .. 33598677 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTAGGTTACAGGAAGCCAATGTCGCCTATTTCACATTTTTGTTTCTTCTTCCTCTTCTTCTTCCTCTCTGTTCACAACGCATTGGGCGGCCATGACCAAGAAACTGTAAAACTCGATCTACTTCACCGTCACCATCCACAAGTCGCCGAGAAGCTTCACGGCAATATGAAAGTTGAAGATCTCAACGATCGGATCAAGGATATTCACGATCACGACCTCAAACGATATCAAACCATCTCCACATCGTTGAACCGGAAGCAAATTGAGGAGAAATTGAAGGCGGAAGCGGAGGTTGAGGCAGCGAAGGATCCGATACTTCCACCGACGTCGCCTACGCCGATAGGGCTGAAAATGATATCAGGTTCAGATTATGGAAGTAGTGAGTATTTTGTTCAATTGAAAGTGGGAACACCGCCTCAGACGTTCATGTTGATCGTGGATACCGGAAGCGATCTAACGTGGATAAAATGTAGATATCGGAGGTGTATCGGAAATTGTAGCAGCAAGGCGAATCATAAGAGCCGAAACGAACGGAAAATGAGATTTAGAAATGCGTTTTTGGCAAATTATTCCTCGTCTTTCAAGACCATTCATTGCAGCTCCACGACTTGTACCACTGACCTCGCAGATCTGTTCTCAATTGGGGAATGCAAAACCCCAACTAGCCCTTGTATCTATGATTACAGGTAGGTACGTAAGATTGATTCACCAAAAACTAAATTCCTTACCCCTCAACTCAACTGACTTATTTAGAGTTGGAGGGGGCGAGGTGCGGTGCAACCCGACTACACCTAAGTAACTTAGGTGAAGTATAAAATAAACTAACAACCGAAGGTGTAAGTTTTTATTTATTTATTTGATTATTATTATTATTAAATTTTGGAATGGAAACAGCTACGCAGGAGGAGCAAGTGCAAAGGGAATATTCGCAATCGAGACCCTAACCGTAGACCTAACAAACGGAAAAGAAAAACAGCTCCACAATTCTATAATCGGCTGCACCGAATCAGTCCAAGGAAGGATATTTGGCGGAGCCGACGGCGTCATTGGCTTAGGCACTAGCCCCTACTCTTTTACCTACAAAGCCGCCGAAAACGCCAACGGCGGCGGCTTCTCCTACTGCCTTGTCGACCATCTCAGCCACCACACCGCCACCAGCTACTTCATCCTCGGCGCCCCTTCCCCCTCCGCTTCCGCTGCTGCCTCCTCCGTCGTCCCTTCTGGCAACATGACCTTCACCAAACTCTTCGTCGGCGACCCTTACAACAGCTTCTACGGCGTCGATCTCATTGCAATCTCCGCCGACGGCGTCATGCTCAACATTCCTCCCCGCGTTTGGGACATCAATTCCGGCGGCGGTACCATCCTCGACTCCGGAACTAGCCTCACCATGCTGGCGGCGCCGGCGTTCGATATGGTCATGGAAGCTCTGACTCCTAAGCTGAAGCATTTCGAGCAAATTGAAGTCGAACCCTTCAAATTTTGCTTCAATAATAGCCAGTACACCCATGAAATGGCCCCGAAGATCCGATTCCATTTCGGCGACGGCACGATGTTCCAGCCGCCGGTGAAAAGCTACGTTGTGTCGGCGGGTGAATATATTAGCTGTATTGGGTTCGTTTCTATGCCTTTTCCGGCCTACAATATCATCGGGAATATTCTTCAGCAAAATCACCTTTGGCAATTTGATTTCTTTAGGAGAAAAGTCGGTTTTGCCCCCTCTGAATGCGTCTAAAAACTTCTTTCAATTTCTTCATCATCATCTTCTTCTTC

mRNA sequence

ATGTTAGGTTACAGGAAGCCAATGTCGCCTATTTCACATTTTTGTTTCTTCTTCCTCTTCTTCTTCCTCTCTGTTCACAACGCATTGGGCGGCCATGACCAAGAAACTGTAAAACTCGATCTACTTCACCGTCACCATCCACAAGTCGCCGAGAAGCTTCACGGCAATATGAAAGTTGAAGATCTCAACGATCGGATCAAGGATATTCACGATCACGACCTCAAACGATATCAAACCATCTCCACATCGTTGAACCGGAAGCAAATTGAGGAGAAATTGAAGGCGGAAGCGGAGGTTGAGGCAGCGAAGGATCCGATACTTCCACCGACGTCGCCTACGCCGATAGGGCTGAAAATGATATCAGGTTCAGATTATGGAAGTAGTGAGTATTTTGTTCAATTGAAAGTGGGAACACCGCCTCAGACGTTCATGTTGATCGTGGATACCGGAAGCGATCTAACGTGGATAAAATGTAGATATCGGAGGTGTATCGGAAATTGTAGCAGCAAGGCGAATCATAAGAGCCGAAACGAACGGAAAATGAGATTTAGAAATGCGTTTTTGGCAAATTATTCCTCGTCTTTCAAGACCATTCATTGCAGCTCCACGACTTGTACCACTGACCTCGCAGATCTGTTCTCAATTGGGGAATGCAAAACCCCAACTAGCCCTTGTATCTATGATTACAGCTACGCAGGAGGAGCAAGTGCAAAGGGAATATTCGCAATCGAGACCCTAACCGTAGACCTAACAAACGGAAAAGAAAAACAGCTCCACAATTCTATAATCGGCTGCACCGAATCAGTCCAAGGAAGGATATTTGGCGGAGCCGACGGCGTCATTGGCTTAGGCACTAGCCCCTACTCTTTTACCTACAAAGCCGCCGAAAACGCCAACGGCGGCGGCTTCTCCTACTGCCTTGTCGACCATCTCAGCCACCACACCGCCACCAGCTACTTCATCCTCGGCGCCCCTTCCCCCTCCGCTTCCGCTGCTGCCTCCTCCGTCGTCCCTTCTGGCAACATGACCTTCACCAAACTCTTCGTCGGCGACCCTTACAACAGCTTCTACGGCGTCGATCTCATTGCAATCTCCGCCGACGGCGTCATGCTCAACATTCCTCCCCGCGTTTGGGACATCAATTCCGGCGGCGGTACCATCCTCGACTCCGGAACTAGCCTCACCATGCTGGCGGCGCCGGCGTTCGATATGGTCATGGAAGCTCTGACTCCTAAGCTGAAGCATTTCGAGCAAATTGAAGTCGAACCCTTCAAATTTTGCTTCAATAATAGCCAGTACACCCATGAAATGGCCCCGAAGATCCGATTCCATTTCGGCGACGGCACGATGTTCCAGCCGCCGGTGAAAAGCTACGTTGTGTCGGCGGGTGAATATATTAGCTGTATTGGGTTCGTTTCTATGCCTTTTCCGGCCTACAATATCATCGGGAATATTCTTCAGCAAAATCACCTTTGGCAATTTGATTTCTTTAGGAGAAAAGTCGGTTTTGCCCCCTCTGAATGCGTCTAAAAACTTCTTTCAATTTCTTCATCATCATCTTCTTCTTC

Coding sequence (CDS)

ATGTTAGGTTACAGGAAGCCAATGTCGCCTATTTCACATTTTTGTTTCTTCTTCCTCTTCTTCTTCCTCTCTGTTCACAACGCATTGGGCGGCCATGACCAAGAAACTGTAAAACTCGATCTACTTCACCGTCACCATCCACAAGTCGCCGAGAAGCTTCACGGCAATATGAAAGTTGAAGATCTCAACGATCGGATCAAGGATATTCACGATCACGACCTCAAACGATATCAAACCATCTCCACATCGTTGAACCGGAAGCAAATTGAGGAGAAATTGAAGGCGGAAGCGGAGGTTGAGGCAGCGAAGGATCCGATACTTCCACCGACGTCGCCTACGCCGATAGGGCTGAAAATGATATCAGGTTCAGATTATGGAAGTAGTGAGTATTTTGTTCAATTGAAAGTGGGAACACCGCCTCAGACGTTCATGTTGATCGTGGATACCGGAAGCGATCTAACGTGGATAAAATGTAGATATCGGAGGTGTATCGGAAATTGTAGCAGCAAGGCGAATCATAAGAGCCGAAACGAACGGAAAATGAGATTTAGAAATGCGTTTTTGGCAAATTATTCCTCGTCTTTCAAGACCATTCATTGCAGCTCCACGACTTGTACCACTGACCTCGCAGATCTGTTCTCAATTGGGGAATGCAAAACCCCAACTAGCCCTTGTATCTATGATTACAGCTACGCAGGAGGAGCAAGTGCAAAGGGAATATTCGCAATCGAGACCCTAACCGTAGACCTAACAAACGGAAAAGAAAAACAGCTCCACAATTCTATAATCGGCTGCACCGAATCAGTCCAAGGAAGGATATTTGGCGGAGCCGACGGCGTCATTGGCTTAGGCACTAGCCCCTACTCTTTTACCTACAAAGCCGCCGAAAACGCCAACGGCGGCGGCTTCTCCTACTGCCTTGTCGACCATCTCAGCCACCACACCGCCACCAGCTACTTCATCCTCGGCGCCCCTTCCCCCTCCGCTTCCGCTGCTGCCTCCTCCGTCGTCCCTTCTGGCAACATGACCTTCACCAAACTCTTCGTCGGCGACCCTTACAACAGCTTCTACGGCGTCGATCTCATTGCAATCTCCGCCGACGGCGTCATGCTCAACATTCCTCCCCGCGTTTGGGACATCAATTCCGGCGGCGGTACCATCCTCGACTCCGGAACTAGCCTCACCATGCTGGCGGCGCCGGCGTTCGATATGGTCATGGAAGCTCTGACTCCTAAGCTGAAGCATTTCGAGCAAATTGAAGTCGAACCCTTCAAATTTTGCTTCAATAATAGCCAGTACACCCATGAAATGGCCCCGAAGATCCGATTCCATTTCGGCGACGGCACGATGTTCCAGCCGCCGGTGAAAAGCTACGTTGTGTCGGCGGGTGAATATATTAGCTGTATTGGGTTCGTTTCTATGCCTTTTCCGGCCTACAATATCATCGGGAATATTCTTCAGCAAAATCACCTTTGGCAATTTGATTTCTTTAGGAGAAAAGTCGGTTTTGCCCCCTCTGAATGCGTCTAA

Protein sequence

MLGYRKPMSPISHFCFFFLFFFLSVHNALGGHDQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYSYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
BLAST of ClCG02G018840 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 151.0 bits (380), Expect = 3.6e-35
Identity = 112/396 (28.28%), Postives = 180/396 (45.45%), Query Frame = 1

Query: 119 MISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNE 178
           ++SG+  GS EYF ++ VGTP +   L++DTGSD+ WI+C       +C  +++      
Sbjct: 151 VVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQC---EPCADCYQQSD------ 210

Query: 179 RKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYSYAGGASAK 238
                   F    SS++K++ CS+  C+     L     C+  ++ C+Y  SY  G+   
Sbjct: 211 ------PVFNPTSSSTYKSLTCSAPQCS-----LLETSACR--SNKCLYQVSYGDGSFTV 270

Query: 239 GIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENA 298
           G  A +T+T     G   +++N  +GC    +G +F GA G++GLG    S T       
Sbjct: 271 GELATDTVTF----GNSGKINNVALGCGHDNEG-LFTGAAGLLGLGGGVLSIT----NQM 330

Query: 299 NGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYG 358
               FSYCLVD  S               S+S   +SV   G      L      ++FY 
Sbjct: 331 KATSFSYCLVDRDS-------------GKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYY 390

Query: 359 VDLIAISADGVMLNIPPRVWDINS--GGGTILDSGTSLTMLAAPAFDMVMEA---LTPKL 418
           V L   S  G  + +P  ++D+++   GG ILD GT++T L   A++ + +A   LT  L
Sbjct: 391 VGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNL 450

Query: 419 KHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEY-ISCIGFV 478
           K      +  F  C++ S  +    P + FHF  G     P K+Y++   +    C  F 
Sbjct: 451 KKGSS-SISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFA 500

Query: 479 SMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC 509
                + +IIGN+ QQ     +D  +  +G + ++C
Sbjct: 511 PTS-SSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of ClCG02G018840 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 4.6e-35
Identity = 124/408 (30.39%), Postives = 177/408 (43.38%), Query Frame = 1

Query: 111 SPTPIGLK--MISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCR-YRRCIGNC 170
           +P P G    ++SG   GS EYF +L VGTP +   +++DTGSD+ W++C   RRC    
Sbjct: 121 APRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQS 180

Query: 171 SSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIY 230
                              F    S ++ TI CSS  C      L S G C T    C+Y
Sbjct: 181 DP----------------IFDPRKSKTYATIPCSSPHCRR----LDSAG-CNTRRKTCLY 240

Query: 231 DYSYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSP 290
             SY  G+   G F+ ETLT      +  ++    +GC    +G +F GA G++GLG   
Sbjct: 241 QVSYGDGSFTVGDFSTETLTF-----RRNRVKGVALGCGHDNEG-LFVGAAGLLGLGKGK 300

Query: 291 YSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKL 350
            SF  +     N   FSYCLVD              A S  +S    +   S    FT L
Sbjct: 301 LSFPGQTGHRFN-QKFSYCLVDR------------SASSKPSSVVFGNAAVSRIARFTPL 360

Query: 351 FVGDPYNSFYGVDLIAISADGVML-NIPPRVWDIN--SGGGTILDSGTSLTMLAAPAFDM 410
                 ++FY V L+ IS  G  +  +   ++ ++    GG I+DSGTS+T L  PA+  
Sbjct: 361 LSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIA 420

Query: 411 VMEALTPKLKHFEQI-EVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVV--- 470
           + +A     K  ++  +   F  CF+ S       P +  HF  G     P  +Y++   
Sbjct: 421 MRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHF-RGADVSLPATNYLIPVD 480

Query: 471 SAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC 509
           + G++  C  F        +IIGNI QQ     +D    +VGFAP  C
Sbjct: 481 TNGKF--CFAFAG-TMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of ClCG02G018840 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 145.6 bits (366), Expect = 1.5e-33
Identity = 111/399 (27.82%), Postives = 177/399 (44.36%), Query Frame = 1

Query: 116 GLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKS 175
           G  ++SG D GS EYFV++ VG+PP+   +++D+GSD+ W++C+          K  +K 
Sbjct: 117 GSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ--------PCKLCYKQ 176

Query: 176 RNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYSYAGGA 235
            +         F    S S+  + C S+ C     D      C   +  C Y+  Y  G+
Sbjct: 177 SDP-------VFDPAKSGSYTGVSCGSSVC-----DRIENSGCH--SGGCRYEVMYGDGS 236

Query: 236 SAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAA 295
             KG  A+ETLT   T      + N  +GC    +G +F GA G++G+G    SF  + +
Sbjct: 237 YTKGTLALETLTFAKT-----VVRNVAMGCGHRNRG-MFIGAAGLLGIGGGSMSFVGQLS 296

Query: 296 ENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNS 355
               GG F YCLV              G  S  +       +P G  ++  L       S
Sbjct: 297 -GQTGGAFGYCLVSR------------GTDSTGSLVFGREALPVG-ASWVPLVRNPRAPS 356

Query: 356 FYGVDLIAISADGVMLNIPPRVWDI--NSGGGTILDSGTSLTMLAAPAFDMVMEALTPKL 415
           FY V L  +   GV + +P  V+D+     GG ++D+GT++T L   A+    +    + 
Sbjct: 357 FYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQT 416

Query: 416 KHFEQIE-VEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVV---SAGEYISCI 475
            +  +   V  F  C++ S +     P + F+F +G +   P +++++    +G Y  C 
Sbjct: 417 ANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTY--CF 470

Query: 476 GFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC 509
            F + P    +IIGNI Q+     FD     VGF P+ C
Sbjct: 477 AFAASP-TGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of ClCG02G018840 vs. Swiss-Prot
Match: CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 9.7e-33
Identity = 109/383 (28.46%), Postives = 167/383 (43.60%), Query Frame = 1

Query: 127 SSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNA 186
           S EY + + +GTPP   M I DTGSDL W +C       +C ++ +              
Sbjct: 87  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAP---CDDCYTQVDP------------L 146

Query: 187 FLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYSYAGGASAKGIFAIETL 246
           F    SS++K + CSS+ CT     L +   C T  + C Y  SY   +  KG  A++TL
Sbjct: 147 FDPKTSSTYKDVSCSSSQCTA----LENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTL 206

Query: 247 TVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYC 306
           T+  ++ +  QL N IIGC  +  G       G++GLG  P S   +  ++ + G FSYC
Sbjct: 207 TLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSID-GKFSYC 266

Query: 307 LVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISA 366
           LV   S    TS    G          +++V    +  T L       +FY + L +IS 
Sbjct: 267 LVPLTSKKDQTSKINFG---------TNAIVSGSGVVSTPLIAKASQETFYYLTLKSISV 326

Query: 367 DGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKL-KHFEQIEVEPFK 426
               +       + +S G  I+DSGT+LT+L    +  + +A+   +    +Q       
Sbjct: 327 GSKQIQYSGSDSE-SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLS 386

Query: 427 FCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNI 486
            C+  S       P I  HF DG   +    +  V   E + C  F     P+++I GN+
Sbjct: 387 LCY--SATGDLKVPVITMHF-DGADVKLDSSNAFVQVSEDLVCFAFRGS--PSFSIYGNV 434

Query: 487 LQQNHLWQFDFFRRKVGFAPSEC 509
            Q N L  +D   + V F P++C
Sbjct: 447 AQMNFLVGYDTVSKTVSFKPTDC 434

BLAST of ClCG02G018840 vs. Swiss-Prot
Match: AED3_ARATH (Aspartyl protease AED3 OS=Arabidopsis thaliana GN=AED3 PE=1 SV=1)

HSP 1 Score: 139.4 bits (350), Expect = 1.1e-31
Identity = 109/404 (26.98%), Postives = 168/404 (41.58%), Query Frame = 1

Query: 112 PTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKA 171
           P P  + + SG+      Y V+ K+GTPPQ   +++DT +D  W+ C        CS  +
Sbjct: 86  PKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPC------SGCSGCS 145

Query: 172 NHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYSY 231
           N  +          +F  N SS++ T+ CS+  CT   A   +        S C ++ SY
Sbjct: 146 NAST----------SFNTNSSSTYSTVSCSTAQCTQ--ARGLTCPSSSPQPSVCSFNQSY 205

Query: 232 AGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFT 291
            G +S       +TLT+         + N   GC  S  G       G++GLG  P S  
Sbjct: 206 GGDSSFSASLVQDTLTL-----APDVIPNFSFGCINSASGNSL-PPQGLMGLGRGPMSLV 265

Query: 292 YKAAENANGGGFSYCLVDHLSHHTATS--YFILGAPSPSASAAASSVVPSGNMTFTKLFV 351
            +   +   G FSYCL    S + + S    +LG P               ++ +T L  
Sbjct: 266 SQTT-SLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPK--------------SIRYTPLLR 325

Query: 352 GDPYNSFYGVDLIAISADGVMLNIPP--RVWDINSGGGTILDSGTSLTMLAAPAFDMVME 411
                S Y V+L  +S   V + + P    +D NSG GTI+DSGT +T  A P ++ + +
Sbjct: 326 NPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRD 385

Query: 412 ALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYIS 471
               ++       +  F  CF  S     +APKI  H     +  P   + + S+   ++
Sbjct: 386 EFRKQVNVSSFSTLGAFDTCF--SADNENVAPKITLHMTSLDLKLPMENTLIHSSAGTLT 445

Query: 472 CIGFVSMPFPA---YNIIGNILQQNHLWQFDFFRRKVGFAPSEC 509
           C+    +   A    N+I N+ QQN    FD    ++G AP  C
Sbjct: 446 CLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448

BLAST of ClCG02G018840 vs. TrEMBL
Match: A0A0A0KG92_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G134390 PE=3 SV=1)

HSP 1 Score: 817.8 bits (2111), Expect = 7.5e-234
Identity = 393/537 (73.18%), Postives = 455/537 (84.73%), Query Frame = 1

Query: 1   MLGYRKPMSPISHFCFFF----LFFFLSVHN----ALGGHD---------------QETV 60
           MLGYRKPMSPIS+FCFFF    LFFFLS  +    ALG  D               QE +
Sbjct: 1   MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEII 60

Query: 61  KLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEE-KLKAE 120
           K DLLHRHHPQVAEK+HG+MK++D+++R+KDIH+HD  R+++IS S+N+KQ+E+ +L+AE
Sbjct: 61  KFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRHRSISKSMNQKQVEDARLRAE 120

Query: 121 AEV----EAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSD 180
           AE     E AK  ILPP + TPIG++MISG+D+GSSEYFV+LKVGTP QTFMLI DTGSD
Sbjct: 121 AEAATEEEVAKSAILPPATSTPIGMRMISGADFGSSEYFVELKVGTPAQTFMLIADTGSD 180

Query: 181 LTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADL 240
           LTW+KCRYRRC GNCSS  NHKS+NE+K RFR+AFLAN+SSSFKT+ CSST CT DLADL
Sbjct: 181 LTWMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADL 240

Query: 241 FSIGECKTPTSPCIYDYSYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGR 300
           F++ EC  PTSPC+YDYSY GGASAKGIFA ETLTV LTNGKEKQLHNSIIGCTESVQG 
Sbjct: 241 FAVRECHNPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS 300

Query: 301 IFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAA 360
           +FGGADGV+GLGTS YS TYKAAENANGGGFSYCLVDHL+   A SYF+LG P+PS SA+
Sbjct: 301 VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSAS 360

Query: 361 ASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGT 420
            SS      MT+TKL+VGDPY+SFYGVDLI ISA+G+MLNIP RVWDINSGGGTI+DSGT
Sbjct: 361 TSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDSGT 420

Query: 421 SLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQ 480
           SLT+LAAPAFDMVMEALTP+LK F+Q+E+EPF FCFNNSQYTHEMAPK+RFHFGDGT+F+
Sbjct: 421 SLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFE 480

Query: 481 PPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV 510
           PP KSY+VS G++ISCIGFVSMPFPA NIIGNILQQNHLWQFDF +R+VGFAPSEC+
Sbjct: 481 PPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI 537

BLAST of ClCG02G018840 vs. TrEMBL
Match: F6H9S0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0085g01110 PE=3 SV=1)

HSP 1 Score: 412.9 bits (1060), Expect = 5.6e-112
Identity = 217/479 (45.30%), Postives = 300/479 (62.63%), Query Frame = 1

Query: 32  HDQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEE 91
           ++ +T++L+L+HRH PQV  +    ++      R+K++   D  R   I   L   QI  
Sbjct: 36  YNSDTMRLELIHRHSPQVMGRPKTQLQ------RLKELVHSDSVRQLMILHKLRGGQIPR 95

Query: 92  KLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGS 151
           + KA+  + ++        S   I + M   +DYG  +Y V  KVGTP Q FML+ DTGS
Sbjct: 96  R-KAKEVLSSSSGR----GSDDAIEVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGS 155

Query: 152 DLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLAD 211
           DLTW+ C+Y     NCS+      R  R++R +  F AN SSSFKTI C +  C  +L D
Sbjct: 156 DLTWMSCKYHCRSRNCSN------RKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMD 215

Query: 212 LFSIGECKTPTSPCIYDYSYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQG 271
           LFS+  C TP +PC YDY Y+ G++A G FA ET+TV+L  G++ +LHN +IGC+ES QG
Sbjct: 216 LFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQG 275

Query: 272 RIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASA 331
           + F  ADGV+GLG S YSF  KAAE   GG FSYCLVDHLSH   ++Y   G       +
Sbjct: 276 QSFQAADGVMGLGYSKYSFAIKAAEKF-GGKFSYCLVDHLSHKNVSNYLTFG-------S 335

Query: 332 AASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSG 391
           + S      NMT+T+L +G   NSFY V+++ IS  G ML IP  VWD+   GGTILDSG
Sbjct: 336 SRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEVWDVKGAGGTILDSG 395

Query: 392 TSLTMLAAPAFDMVMEALTPKLKHFEQIEVE--PFKFCFNNSQYTHEMAPKIRFHFGDGT 451
           +SLT L  PA+  VM AL   L  F ++E++  P ++CFN++ +   + P++ FHF DG 
Sbjct: 396 SSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEESLVPRLVFHFADGA 455

Query: 452 MFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC 509
            F+PPVKSYV+SA + + C+GFVS+ +P  +++GNI+QQNHLW+FD   +K+GFAPS C
Sbjct: 456 EFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 488

BLAST of ClCG02G018840 vs. TrEMBL
Match: A5BLS9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_015630 PE=3 SV=1)

HSP 1 Score: 412.1 bits (1058), Expect = 9.5e-112
Identity = 217/474 (45.78%), Postives = 297/474 (62.66%), Query Frame = 1

Query: 37  VKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAE 96
           ++L+L+HRH PQV  +    ++      R+K++   D  R   I   L   QI  + KA+
Sbjct: 1   MRLELIHRHSPQVMGRPKTQLQ------RLKELVHSDSVRQLMILHKLRGGQIPRR-KAK 60

Query: 97  AEVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWI 156
             + ++        S   I + M   +DYG  +YFV  KVGTP Q FML+ DTGSDLTW+
Sbjct: 61  EVLSSSSGR----GSDDAIEVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWM 120

Query: 157 KCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIG 216
            C+Y     NCS+      R  R++R +  F AN SSSFKTI C +  C  +L DLFS+ 
Sbjct: 121 SCKYHCRSRNCSN------RKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLT 180

Query: 217 ECKTPTSPCIYDYSYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGG 276
            C TP +PC YDY Y+ G++A G FA ET+TV+L  G++ +LHN +IGC+ES QG+ F  
Sbjct: 181 NCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQA 240

Query: 277 ADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSV 336
           ADGV+GLG S YSF  KAAE   GG FSYCLVDHLSH   ++Y   G       ++ S  
Sbjct: 241 ADGVMGLGYSKYSFAIKAAEKF-GGKFSYCLVDHLSHKNVSNYLTFG-------SSRSKE 300

Query: 337 VPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTM 396
               NMT+T+L +G   NSFY V+++ IS  G ML IP  VWD+   GGTILDSG+SLT 
Sbjct: 301 ALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTF 360

Query: 397 LAAPAFDMVMEALTPKLKHFEQIEVE--PFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPP 456
           L  PA+  VM AL   L  F ++E++  P ++CFN++ +   + P++ FHF DG  F+PP
Sbjct: 361 LTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPP 420

Query: 457 VKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC 509
           VKSYV+SA + + C+GFVS+ +P  +++GNI+QQNHLW+FD   +K+GFAPS C
Sbjct: 421 VKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448

BLAST of ClCG02G018840 vs. TrEMBL
Match: A0A0B0NTS3_GOSAR (Asparticase nepenthesin-1 OS=Gossypium arboreum GN=F383_00615 PE=3 SV=1)

HSP 1 Score: 377.1 bits (967), Expect = 3.4e-101
Identity = 206/479 (43.01%), Postives = 286/479 (59.71%), Query Frame = 1

Query: 32  HDQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEE 91
           HD  ++ L+L+HRH PQ       N      + R+ D+  HD+ R+  +S   +R++ +E
Sbjct: 31  HDSNSITLELIHRHAPQFT-----NNNPITQHQRLVDLLYHDIIRHGIMS---HRRRAKE 90

Query: 92  KLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGS 151
           +           DP+        I + + SG D+G  +Y    KVGTP Q F LIVDTGS
Sbjct: 91  E-----------DPLT-----ASIKMPLASGRDFGIGQYITSFKVGTPSQKFWLIVDTGS 150

Query: 152 DLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLAD 211
           DLTWI+CRYR   G+ S  +  K R  RK      F A  SSSF  + C S  C  +L +
Sbjct: 151 DLTWIRCRYRCSRGDRSCTS--KGRINRK----RVFHAPLSSSFNPVPCFSEMCKVELMN 210

Query: 212 LFSIGECKTPTSPCIYDYSYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQG 271
           LFS+  C TP +PC YDY Y+ G++A G+FA ET++  LTNG++ +LHN +IGCT+S QG
Sbjct: 211 LFSLTTCPTPITPCAYDYRYSDGSAAMGVFANETVSAGLTNGRKTRLHNVLIGCTDSFQG 270

Query: 272 RIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASA 331
                 DG++GL  + YSF   AA    GG FSYCLVDHLSH  AT+Y I G        
Sbjct: 271 PTLQNVDGIMGLANTKYSFATNAAATF-GGKFSYCLVDHLSHLNATNYIIFG-------T 330

Query: 332 AASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSG 391
             + V  SGN   TKL + D   SFY V++I IS    ML IP +VWD + GGGTI+DSG
Sbjct: 331 NRNQVKVSGNTRHTKLEL-DAIPSFYAVNVIGISVGNKMLEIPMQVWDASEGGGTIIDSG 390

Query: 392 TSLTMLAAPAFDMVMEALTPKLKHFEQIEVE--PFKFCFNNSQYTHEMAPKIRFHFGDGT 451
           TSLT LA PA+  VMEAL   +  +++++++  P ++CFN++ +   + PK+  HF DG 
Sbjct: 391 TSLTFLADPAYQAVMEALKVSVSKYQRVKLDGVPMEYCFNSTGFNGSLVPKLIIHFDDGA 450

Query: 452 MFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC 509
            F+P   SYV++A   + C+GF+   FPA ++IGNI+QQN+LW+FD   +++ FAPS C
Sbjct: 451 RFEPHWNSYVIAAAAEVRCLGFLPARFPALSVIGNIMQQNYLWEFDLKGKRLVFAPSSC 470

BLAST of ClCG02G018840 vs. TrEMBL
Match: A0A022QK09_ERYGU (Uncharacterized protein OS=Erythranthe guttata GN=MIMGU_mgv1a004950mg PE=3 SV=1)

HSP 1 Score: 376.3 bits (965), Expect = 5.8e-101
Identity = 209/489 (42.74%), Postives = 289/489 (59.10%), Query Frame = 1

Query: 37  VKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAE 96
           VKL+L+HRHH Q   +   N+  + L +R++ +   D  R + IS  +   Q        
Sbjct: 38  VKLELIHRHHLQGERR---NVAAQPL-ERLRQLVHSDAVRLRGISLKVMLIQGGAG-PVR 97

Query: 97  AEVEAAKDPILPPTSPTPIG---------------LKMISGSDYGSSEYFVQLKVGTPPQ 156
             V    D  +P ++    G               L + SG+D+G+ +YFVQ +VG+P Q
Sbjct: 98  RRVSETDDAFIPASTNGGGGGGSNNKEQFSNVSGQLPISSGADFGTGQYFVQFRVGSPAQ 157

Query: 157 TFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCS 216
             +LI DTGSDLTW+ C+YR C G         S N+R++     F A+ SSSF+T+ CS
Sbjct: 158 KVVLIADTGSDLTWMNCKYR-CRGGGGGGCRRNS-NKRRL-----FWADRSSSFRTVPCS 217

Query: 217 STTCTTDLADLFSIGECKTPTSPCIYDYSYAGGASAKGIFAIETLTVDLTNGKEKQLHNS 276
           STTCT DLA+LFS+  C +P SPC YDY Y+ G++A+G+F  ET+T+ LTNG++ +LHN 
Sbjct: 218 STTCTNDLANLFSLTRCPSPISPCAYDYRYSDGSAAQGLFGNETVTLSLTNGRKTRLHNV 277

Query: 277 IIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFI 336
           +IGC+ S  G  F  ADGVIGLG S YS   KA+ N   G FSYCLVDHLS    +SY  
Sbjct: 278 LIGCSISSSGPTFQSADGVIGLGYSNYSLAVKAS-NLFRGIFSYCLVDHLSPKNISSYLT 337

Query: 337 LGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDIN 396
            G          S+   +  M +T L + D  N FY V +  IS  G ML+IP  VWD+ 
Sbjct: 338 FG----------SAKQQTDTMHYTALIL-DVINPFYAVSMNGISIGGSMLDIPAEVWDVK 397

Query: 397 SGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQ--IEVEPFKFCFNNSQYTHEMAP 456
             GG ILDSGTSLT L  PA+  VM ALT  L  FE+  ++V P ++CFN++ +   + P
Sbjct: 398 GSGGVILDSGTSLTSLVGPAYRPVMAALTASLSGFEKLGLDVGPLEYCFNSTGFVESVVP 457

Query: 457 KIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRR 509
           ++ FHFGDG  F+PPVKSYV+ A   + C+GFV   +P  +++GNI+QQN+ W+FD   +
Sbjct: 458 RLVFHFGDGARFEPPVKSYVIDAAPGVKCLGFVGGAWPGVSVVGNIMQQNYFWEFDLVNK 502

BLAST of ClCG02G018840 vs. TAIR10
Match: AT3G12700.1 (AT3G12700.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 329.7 bits (844), Expect = 3.1e-90
Identity = 187/449 (41.65%), Postives = 256/449 (57.02%), Query Frame = 1

Query: 65  RIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPILPPTSPTPIGLKMISGS- 124
           RI+D+   D KR+  IS   N                           + +G+KM  GS 
Sbjct: 66  RIEDVIGADQKRHSLISRKRN---------------------------STVGVKMDLGSG 125

Query: 125 -DYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMR 184
            DYG+++YF +++VGTP + F ++VDTGS+LTW+ CRYR        K N          
Sbjct: 126 IDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYR-----ARGKDN---------- 185

Query: 185 FRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYSYAGGASAKGIFA 244
            R  F A+ S SFKT+ C + TC  DL +LFS+  C TP++PC YDY YA G++A+G+FA
Sbjct: 186 -RRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFA 245

Query: 245 IETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGG 304
            ET+TV LTNG+  +L   +IGC+ S  G+ F GADGV+GL  S +SFT   A +  G  
Sbjct: 246 KETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFT-STATSLYGAK 305

Query: 305 FSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLI 364
           FSYCLVDHLS+   ++Y I G+ S S   A     P   +  T++        FY +++I
Sbjct: 306 FSYCLVDHLSNKNVSNYLIFGS-SRSTKTAFRRTTP---LDLTRI------PPFYAINVI 365

Query: 365 AISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVE 424
            IS    ML+IP +VWD  SGGGTILDSGTSLT+LA  A+  V+  L   L   ++++ E
Sbjct: 366 GISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPE 425

Query: 425 --PFKFCFN-NSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAY 484
             P ++CF+  S +     P++ FH   G  F+P  KSY+V A   + C+GFVS   PA 
Sbjct: 426 GVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPAT 460

Query: 485 NIIGNILQQNHLWQFDFFRRKVGFAPSEC 509
           N+IGNI+QQN+LW+FD     + FAPS C
Sbjct: 486 NVIGNIMQQNYLWEFDLMASTLSFAPSAC 460

BLAST of ClCG02G018840 vs. TAIR10
Match: AT3G25700.1 (AT3G25700.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 226.9 bits (577), Expect = 2.9e-59
Identity = 142/402 (35.32%), Postives = 205/402 (51.00%), Query Frame = 1

Query: 119 MISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNE 178
           ++SG+  GS +YFV L++G PPQ+ +LI DTGSDL W+KC   R   NCS    H S   
Sbjct: 73  VVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACR---NCS----HHSP-- 132

Query: 179 RKMRFRNAFLANYSSSFKTIHCSSTTC-TTDLADLFSIGECKTPTSPCIYDYSYAGGASA 238
                   F   +SS+F   HC    C      D   I       S C Y+Y YA G+  
Sbjct: 133 -----ATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLT 192

Query: 239 KGIFAIETLTVDLTNGKEKQLHNSIIGC-----TESVQGRIFGGADGVIGLGTSPYSFTY 298
            G+FA ET ++  ++GKE +L +   GC      +SV G  F GA+GV+GLG  P SF  
Sbjct: 193 SGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFAS 252

Query: 299 KAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDP 358
           +      G  FSYCL+D+      TSY I+G      S           + FT L     
Sbjct: 253 QLGRRF-GNKFSYCLMDYTLSPPPTSYLIIGNGGDGIS----------KLFFTPLLTNPL 312

Query: 359 YNSFYGVDLIAISADGVMLNIPPRVWDI--NSGGGTILDSGTSLTMLAAPAFDMVMEALT 418
             +FY V L ++  +G  L I P +W+I  +  GGT++DSGT+L  LA PA+  V+ A+ 
Sbjct: 313 SPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVR 372

Query: 419 PKLKHFEQIEVEP-FKFCFNNSQYT--HEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYIS 478
            ++K      + P F  C N S  T   ++ P+++F F  G +F PP ++Y +   E I 
Sbjct: 373 RRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQ 432

Query: 479 CIGFVSM-PFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC 509
           C+   S+ P   +++IGN++QQ  L++FD  R ++GF+   C
Sbjct: 433 CLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 449

BLAST of ClCG02G018840 vs. TAIR10
Match: AT2G42980.1 (AT2G42980.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 200.7 bits (509), Expect = 2.2e-51
Identity = 145/482 (30.08%), Postives = 230/482 (47.72%), Query Frame = 1

Query: 44  RHHPQVAEKLHGNMKVED--LNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEA 103
           + H + + K    +K E       + D+   DL R +T+    N+ + ++  K   ++ +
Sbjct: 71  KEHTRESVKPQSRIKQETKRTTHSVVDLQIQDLTRIKTLHARFNKSKKQKNEKVRKKITS 130

Query: 104 AKDPI-LPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKC-R 163
               +  P  SP  +   + SG   GS EYF+ + VGTPP+ F LI+DTGSDL W++C  
Sbjct: 131 DISLVGAPEVSPGKLIATLESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLP 190

Query: 164 YRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECK 223
              C        + K+                S+SFK I C+   C+  ++      +C+
Sbjct: 191 CYDCFHQNGMFYDPKT----------------SASFKNITCNDPRCSL-ISSPDPPVQCE 250

Query: 224 TPTSPCIYDYSYAGGASAKGIFAIETLTVDLT----NGKEKQLHNSIIGCTESVQGRIFG 283
           +    C Y Y Y   ++  G FA+ET TV+LT       E ++ N + GC    +G +F 
Sbjct: 251 SDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNMMFGCGHWNRG-LFS 310

Query: 284 GADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASS 343
           GA G++GLG  P SF+    ++  G  FSYCLVD  S+   +S  I G            
Sbjct: 311 GASGLLGLGRGPLSFS-SQLQSLYGHSFSYCLVDRNSNTNVSSKLIFGED--------KD 370

Query: 344 VVPSGNMTFTKLFVGDPYNS---FYGVDLIAISADGVMLNIPPRVWDINS--GGGTILDS 403
           ++   N+ FT  FV    NS   FY + + +I   G  L+IP   W+I+S   GGTI+DS
Sbjct: 371 LLNHTNLNFTS-FVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGDGGTIIDS 430

Query: 404 GTSLTMLAAPAFDMVMEALTPKLKHFEQI--EVEPFKFCFNNS--QYTHEMAPKIRFHFG 463
           GT+L+  A PA++++      K+K    I  +      CFN S  +  +   P++   F 
Sbjct: 431 GTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHLPELGIAFV 490

Query: 464 DGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPS 509
           DGT++  P ++  +   E + C+  +  P   ++IIGN  QQN    +D  R ++GF P+
Sbjct: 491 DGTVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPT 524

BLAST of ClCG02G018840 vs. TAIR10
Match: AT3G59080.1 (AT3G59080.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 199.9 bits (507), Expect = 3.8e-51
Identity = 151/491 (30.75%), Postives = 237/491 (48.27%), Query Frame = 1

Query: 33  DQETVKLDLLHRHHPQVAEKLHGN----MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQ 92
           + +TVK  L  R      EK   N    +++ DL  RI+ +H   L++    + S  +K+
Sbjct: 76  ENKTVKFHL-KRRETTTTEKATTNSVLELQIRDLT-RIQTLHKRVLEKNNQNTVSQKQKK 135

Query: 93  IEEKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVD 152
            ++++     V ++ +        T     + SG   GS EYF+ + VG+PP+ F LI+D
Sbjct: 136 NDKEVVTTTPVASSVEEQAGQLVAT-----LESGMTLGSGEYFMDVLVGSPPKHFSLILD 195

Query: 153 TGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFL-ANYSSSFKTIHCSSTTCTT 212
           TGSDL WI+C    C  +C  +               AF     S+S+K I C+   C  
Sbjct: 196 TGSDLNWIQC--LPCY-DCFQQ-------------NGAFYDPKASASYKNITCNDQRCNL 255

Query: 213 DLADLFSIGECKTPTSPCIYDYSYAGGASAKGIFAIETLTVDL-TNGKEKQLH---NSII 272
            ++       CK+    C Y Y Y   ++  G FA+ET TV+L TNG   +L+   N + 
Sbjct: 256 -VSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMF 315

Query: 273 GCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILG 332
           GC    +G +F GA G++GLG  P SF+    ++  G  FSYCLVD  S    +S  I G
Sbjct: 316 GCGHWNRG-LFHGAAGLLGLGRGPLSFS-SQLQSLYGHSFSYCLVDRNSDTNVSSKLIFG 375

Query: 333 APSPSASAAASSVVPSGNMTFTKLFVG--DPYNSFYGVDLIAISADGVMLNIPPRVWDIN 392
                       ++   N+ FT    G  +  ++FY V + +I   G +LNIP   W+I+
Sbjct: 376 ED--------KDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNIS 435

Query: 393 S--GGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQI--EVEPFKFCFNNSQYTHEM 452
           S   GGTI+DSGT+L+  A PA++ +   +  K K    +  +      CFN S   +  
Sbjct: 436 SDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQ 495

Query: 453 APKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFF 509
            P++   F DG ++  P ++  +   E + C+  +  P  A++IIGN  QQN    +D  
Sbjct: 496 LPELGIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTK 532

BLAST of ClCG02G018840 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 156.4 bits (394), Expect = 4.8e-38
Identity = 126/400 (31.50%), Postives = 175/400 (43.75%), Query Frame = 1

Query: 119 MISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNE 178
           +ISG   GS EYF++L VGTP     +++DTGSD+ W++C        C +  N      
Sbjct: 124 VISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQC------SPCKACYNQTDA-- 183

Query: 179 RKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTS-PCIYDYSYAGGASA 238
                   F    S +F T+ C S  C      L    EC T  S  C+Y  SY  G+  
Sbjct: 184 -------IFDPKKSKTFATVPCGSRLCRR----LDDSSECVTRRSKTCLYQVSYGDGSFT 243

Query: 239 KGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAEN 298
           +G F+ ETLT         ++ +  +GC    +G +F GA G++GLG    SF     +N
Sbjct: 244 EGDFSTETLTF-----HGARVDHVPLGCGHDNEG-LFVGAAGLLGLGRGGLSFP-SQTKN 303

Query: 299 ANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFY 358
              G FSYCLVD  S  +++         PS     ++ VP  ++ FT L      ++FY
Sbjct: 304 RYNGKFSYCLVDRTSSGSSSK-------PPSTIVFGNAAVPKTSV-FTPLLTNPKLDTFY 363

Query: 359 GVDLIAISADGVMLNIPPRV------WDINSGGGTILDSGTSLTMLAAPAFDMVMEAL-- 418
            + L+ IS  G  +   P V       D    GG I+DSGTS+T L  PA+  + +A   
Sbjct: 364 YLQLLGISVGGSRV---PGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRL 423

Query: 419 -TPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISC 478
              KLK      +  F  CF+ S  T    P + FHFG G +  P     +    E   C
Sbjct: 424 GATKLKRAPSYSL--FDTCFDLSGMTTVKVPTVVFHFGGGEVSLPASNYLIPVNTEGRFC 483

Query: 479 IGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC 509
             F      + +IIGNI QQ     +D    +VGF    C
Sbjct: 484 FAFAG-TMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of ClCG02G018840 vs. NCBI nr
Match: gi|778713001|ref|XP_004140022.2| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus])

HSP 1 Score: 817.8 bits (2111), Expect = 1.1e-233
Identity = 393/537 (73.18%), Postives = 455/537 (84.73%), Query Frame = 1

Query: 1   MLGYRKPMSPISHFCFFF----LFFFLSVHN----ALGGHD---------------QETV 60
           MLGYRKPMSPIS+FCFFF    LFFFLS  +    ALG  D               QE +
Sbjct: 1   MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEII 60

Query: 61  KLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEE-KLKAE 120
           K DLLHRHHPQVAEK+HG+MK++D+++R+KDIH+HD  R+++IS S+N+KQ+E+ +L+AE
Sbjct: 61  KFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRHRSISKSMNQKQVEDARLRAE 120

Query: 121 AEV----EAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSD 180
           AE     E AK  ILPP + TPIG++MISG+D+GSSEYFV+LKVGTP QTFMLI DTGSD
Sbjct: 121 AEAATEEEVAKSAILPPATSTPIGMRMISGADFGSSEYFVELKVGTPAQTFMLIADTGSD 180

Query: 181 LTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADL 240
           LTW+KCRYRRC GNCSS  NHKS+NE+K RFR+AFLAN+SSSFKT+ CSST CT DLADL
Sbjct: 181 LTWMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADL 240

Query: 241 FSIGECKTPTSPCIYDYSYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGR 300
           F++ EC  PTSPC+YDYSY GGASAKGIFA ETLTV LTNGKEKQLHNSIIGCTESVQG 
Sbjct: 241 FAVRECHNPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS 300

Query: 301 IFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAA 360
           +FGGADGV+GLGTS YS TYKAAENANGGGFSYCLVDHL+   A SYF+LG P+PS SA+
Sbjct: 301 VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSAS 360

Query: 361 ASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGT 420
            SS      MT+TKL+VGDPY+SFYGVDLI ISA+G+MLNIP RVWDINSGGGTI+DSGT
Sbjct: 361 TSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDSGT 420

Query: 421 SLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQ 480
           SLT+LAAPAFDMVMEALTP+LK F+Q+E+EPF FCFNNSQYTHEMAPK+RFHFGDGT+F+
Sbjct: 421 SLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFE 480

Query: 481 PPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV 510
           PP KSY+VS G++ISCIGFVSMPFPA NIIGNILQQNHLWQFDF +R+VGFAPSEC+
Sbjct: 481 PPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI 537

BLAST of ClCG02G018840 vs. NCBI nr
Match: gi|659112547|ref|XP_008456273.1| (PREDICTED: aspartic proteinase CDR1 [Cucumis melo])

HSP 1 Score: 790.4 bits (2040), Expect = 1.8e-225
Identity = 380/530 (71.70%), Postives = 444/530 (83.77%), Query Frame = 1

Query: 1   MLGYRKPMSPISHFCFFFLF-FFLSVHN----ALGGH-----------DQETVKLDLLHR 60
           MLGYRKPMSPIS+FCFFFL  FFLS  +    ALG             +Q+T++ DLLHR
Sbjct: 1   MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHR 60

Query: 61  HHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEE-KLKAEAE----V 120
           HHPQV+EKL+G+MK++DL++R+KDIH+HD  R+++IS S+N+KQIE+ +L+AEAE    V
Sbjct: 61  HHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQV 120

Query: 121 EAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCR 180
           E AK  ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLI DTGSDLTW+KCR
Sbjct: 121 EVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCR 180

Query: 181 YRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECK 240
           YRRC GNCS   NHKS+NE+K RFR+A LAN SS+FKT+ CSST CT +LA+LF++ EC 
Sbjct: 181 YRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECD 240

Query: 241 TPTSPCIYDYSYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADG 300
           TPTSPC+YDYSYAGGASAKGIFA ETLTV LTNGKEKQL NSIIGCTE VQG +F GADG
Sbjct: 241 TPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADG 300

Query: 301 VIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPS 360
           V+GLGTS YS TYKAAENANGGGFSYCLVDHL+   A SYF+LG P+PS SA+ SS  P 
Sbjct: 301 VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPP 360

Query: 361 GNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAA 420
             M++TKL+VGDPY+SFYGVDLI ISADG MLNIPPRVWD   G GTI+DSGTSLT+LA 
Sbjct: 361 AKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLAT 420

Query: 421 PAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYV 480
           PAFD+VME LT +LK F+QIE+EPF FCFNNSQYTH+MAPK+RFHFGDGT+F+PP KSY+
Sbjct: 421 PAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYI 480

Query: 481 VSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV 510
           VS GE+ISCIG VSMPFP+ NIIGNILQQNHLWQFDF +R+VGFA SEC+
Sbjct: 481 VSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI 530

BLAST of ClCG02G018840 vs. NCBI nr
Match: gi|731434480|ref|XP_002265771.3| (PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera])

HSP 1 Score: 412.9 bits (1060), Expect = 8.0e-112
Identity = 217/479 (45.30%), Postives = 300/479 (62.63%), Query Frame = 1

Query: 32  HDQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEE 91
           ++ +T++L+L+HRH PQV  +    ++      R+K++   D  R   I   L   QI  
Sbjct: 36  YNSDTMRLELIHRHSPQVMGRPKTQLQ------RLKELVHSDSVRQLMILHKLRGGQIPR 95

Query: 92  KLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGS 151
           + KA+  + ++        S   I + M   +DYG  +Y V  KVGTP Q FML+ DTGS
Sbjct: 96  R-KAKEVLSSSSGR----GSDDAIEVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGS 155

Query: 152 DLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLAD 211
           DLTW+ C+Y     NCS+      R  R++R +  F AN SSSFKTI C +  C  +L D
Sbjct: 156 DLTWMSCKYHCRSRNCSN------RKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMD 215

Query: 212 LFSIGECKTPTSPCIYDYSYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQG 271
           LFS+  C TP +PC YDY Y+ G++A G FA ET+TV+L  G++ +LHN +IGC+ES QG
Sbjct: 216 LFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQG 275

Query: 272 RIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASA 331
           + F  ADGV+GLG S YSF  KAAE   GG FSYCLVDHLSH   ++Y   G       +
Sbjct: 276 QSFQAADGVMGLGYSKYSFAIKAAEKF-GGKFSYCLVDHLSHKNVSNYLTFG-------S 335

Query: 332 AASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSG 391
           + S      NMT+T+L +G   NSFY V+++ IS  G ML IP  VWD+   GGTILDSG
Sbjct: 336 SRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEVWDVKGAGGTILDSG 395

Query: 392 TSLTMLAAPAFDMVMEALTPKLKHFEQIEVE--PFKFCFNNSQYTHEMAPKIRFHFGDGT 451
           +SLT L  PA+  VM AL   L  F ++E++  P ++CFN++ +   + P++ FHF DG 
Sbjct: 396 SSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEESLVPRLVFHFADGA 455

Query: 452 MFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC 509
            F+PPVKSYV+SA + + C+GFVS+ +P  +++GNI+QQNHLW+FD   +K+GFAPS C
Sbjct: 456 EFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 488

BLAST of ClCG02G018840 vs. NCBI nr
Match: gi|147814824|emb|CAN65806.1| (hypothetical protein VITISV_015630 [Vitis vinifera])

HSP 1 Score: 412.1 bits (1058), Expect = 1.4e-111
Identity = 217/474 (45.78%), Postives = 297/474 (62.66%), Query Frame = 1

Query: 37  VKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAE 96
           ++L+L+HRH PQV  +    ++      R+K++   D  R   I   L   QI  + KA+
Sbjct: 1   MRLELIHRHSPQVMGRPKTQLQ------RLKELVHSDSVRQLMILHKLRGGQIPRR-KAK 60

Query: 97  AEVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWI 156
             + ++        S   I + M   +DYG  +YFV  KVGTP Q FML+ DTGSDLTW+
Sbjct: 61  EVLSSSSGR----GSDDAIEVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWM 120

Query: 157 KCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIG 216
            C+Y     NCS+      R  R++R +  F AN SSSFKTI C +  C  +L DLFS+ 
Sbjct: 121 SCKYHCRSRNCSN------RKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLT 180

Query: 217 ECKTPTSPCIYDYSYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGG 276
            C TP +PC YDY Y+ G++A G FA ET+TV+L  G++ +LHN +IGC+ES QG+ F  
Sbjct: 181 NCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQA 240

Query: 277 ADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSV 336
           ADGV+GLG S YSF  KAAE   GG FSYCLVDHLSH   ++Y   G       ++ S  
Sbjct: 241 ADGVMGLGYSKYSFAIKAAEKF-GGKFSYCLVDHLSHKNVSNYLTFG-------SSRSKE 300

Query: 337 VPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTM 396
               NMT+T+L +G   NSFY V+++ IS  G ML IP  VWD+   GGTILDSG+SLT 
Sbjct: 301 ALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTF 360

Query: 397 LAAPAFDMVMEALTPKLKHFEQIEVE--PFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPP 456
           L  PA+  VM AL   L  F ++E++  P ++CFN++ +   + P++ FHF DG  F+PP
Sbjct: 361 LTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPP 420

Query: 457 VKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC 509
           VKSYV+SA + + C+GFVS+ +P  +++GNI+QQNHLW+FD   +K+GFAPS C
Sbjct: 421 VKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448

BLAST of ClCG02G018840 vs. NCBI nr
Match: gi|297736090|emb|CBI24128.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 395.6 bits (1015), Expect = 1.3e-106
Identity = 196/392 (50.00%), Postives = 260/392 (66.33%), Query Frame = 1

Query: 119 MISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNE 178
           M   +DYG  +Y V  KVGTP Q FML+ DTGSDLTW+ C+Y     NCS+      R  
Sbjct: 1   MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSN------RKA 60

Query: 179 RKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYSYAGGASAK 238
           R++R +  F AN SSSFKTI C +  C  +L DLFS+  C TP +PC YDY Y+ G++A 
Sbjct: 61  RRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTAL 120

Query: 239 GIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENA 298
           G FA ET+TV+L  G++ +LHN +IGC+ES QG+ F  ADGV+GLG S YSF  KAAE  
Sbjct: 121 GFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKF 180

Query: 299 NGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYG 358
            GG FSYCLVDHLSH   ++Y   G       ++ S      NMT+T+L +G   NSFY 
Sbjct: 181 -GGKFSYCLVDHLSHKNVSNYLTFG-------SSRSKEALLNNMTYTELVLG-MVNSFYA 240

Query: 359 VDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQ 418
           V+++ IS  G ML IP  VWD+   GGTILDSG+SLT L  PA+  VM AL   L  F +
Sbjct: 241 VNMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRK 300

Query: 419 IEVE--PFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPF 478
           +E++  P ++CFN++ +   + P++ FHF DG  F+PPVKSYV+SA + + C+GFVS+ +
Sbjct: 301 VEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAW 360

Query: 479 PAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC 509
           P  +++GNI+QQNHLW+FD   +K+GFAPS C
Sbjct: 361 PGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 377

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPG1_ARATH3.6e-3528.28Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
APF2_ARATH4.6e-3530.39Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
ASPG2_ARATH1.5e-3327.82Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
CDR1_ARATH9.7e-3328.46Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1[more]
AED3_ARATH1.1e-3126.98Aspartyl protease AED3 OS=Arabidopsis thaliana GN=AED3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KG92_CUCSA7.5e-23473.18Uncharacterized protein OS=Cucumis sativus GN=Csa_6G134390 PE=3 SV=1[more]
F6H9S0_VITVI5.6e-11245.30Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0085g01110 PE=3 SV=... [more]
A5BLS9_VITVI9.5e-11245.78Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_015630 PE=3 SV=1[more]
A0A0B0NTS3_GOSAR3.4e-10143.01Asparticase nepenthesin-1 OS=Gossypium arboreum GN=F383_00615 PE=3 SV=1[more]
A0A022QK09_ERYGU5.8e-10142.74Uncharacterized protein OS=Erythranthe guttata GN=MIMGU_mgv1a004950mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G12700.13.1e-9041.65 Eukaryotic aspartyl protease family protein[more]
AT3G25700.12.9e-5935.32 Eukaryotic aspartyl protease family protein[more]
AT2G42980.12.2e-5130.08 Eukaryotic aspartyl protease family protein[more]
AT3G59080.13.8e-5130.75 Eukaryotic aspartyl protease family protein[more]
AT3G61820.14.8e-3831.50 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|778713001|ref|XP_004140022.2|1.1e-23373.18PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus][more]
gi|659112547|ref|XP_008456273.1|1.8e-22571.70PREDICTED: aspartic proteinase CDR1 [Cucumis melo][more]
gi|731434480|ref|XP_002265771.3|8.0e-11245.30PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera][more]
gi|147814824|emb|CAN65806.1|1.4e-11145.78hypothetical protein VITISV_015630 [Vitis vinifera][more]
gi|297736090|emb|CBI24128.3|1.3e-10650.00unnamed protein product [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0044238 primary metabolic process
biological_process GO:0030163 protein catabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG02G018840.1ClCG02G018840.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 136..156
score: 3.6E-8coord: 480..495
score: 3.6E-8coord: 386..397
score: 3.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 340..508
score: 1.2E-146coord: 114..324
score: 1.2E-146coord: 33..96
score: 1.2E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 145..156
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 339..508
score: 1.4E-35coord: 192..323
score: 4.5E-35coord: 124..160
score: 4.5
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 122..508
score: 4.68
NoneNo IPR availablePANTHERPTHR13683:SF280ASPARTYL PROTEASE FAMILY PROTEINcoord: 33..96
score: 1.2E-146coord: 114..324
score: 1.2E-146coord: 340..508
score: 1.2E