Cla002325 (gene) Watermelon (97103) v1

NameCla002325
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionAspartyl protease family protein (AHRD V1 **-- Q9LI73_ARATH); contains Interpro domain(s) IPR001461 Peptidase A1
LocationChr1 : 4730607 .. 4731182 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATCGGCGGCGGCCTCCGCAGCCTTCCCGTGAACAATGCCACCAGAATCAGCTACACCCCCTTGCAGATTAACCCTCTCTCCCCCACATTCTACTACATTGGCATCCACAGCATCACCGTCGACGGCGTGAAATTACCCATCAACCCCGCCGTATGGGCAATCGACGAGCAGGGAAACGGCGGTACGGTGGTCGATTCAGGGACGACATTGACCTACCTGGCGAAGGCGGCGTACGAGGAGGTGCTGAAGGCGGTAAGACAGCGAGTGAAACTACCAAATGCAGCTGAGTTGACTCCGGGGTTCGATCTATGCGTGAACGCGTCAGGGGAGTCACGGCGGCCGAGTCTGCCGCGACTGAGATTGGAACTGAAGGGTGGGGCGGTGTTTGCGCCGCCGCCGAGGAACTATTTTCTGGAAACGGAGGAGAGAGTGATGTGCTTGGCGATCCGAGCGGTGGAATCCGGTAATGGGTTTTCGGTGATTGGGAATCTGATGCAGCAAGGATTCTTGTTGGAGTTCGATAGGGAGGCGTCGAGGCTGGGTTTTTCAAGGCGGGGTTGTGGCCTTCCATGA

mRNA sequence

ATGATCGGCGGCGGCCTCCGCAGCCTTCCCGTGAACAATGCCACCAGAATCAGCTACACCCCCTTGCAGATTAACCCTCTCTCCCCCACATTCTACTACATTGGCATCCACAGCATCACCGTCGACGGCGTGAAATTACCCATCAACCCCGCCGTATGGGCAATCGACGAGCAGGGAAACGGCGGTACGGTGGTCGATTCAGGGACGACATTGACCTACCTGGCGAAGGCGGCGTACGAGGAGGTGCTGAAGGCGGTAAGACAGCGAGTGAAACTACCAAATGCAGCTGAGTTGACTCCGGGGTTCGATCTATGCGTGAACGCGTCAGGGGAGTCACGGCGGCCGAGTCTGCCGCGACTGAGATTGGAACTGAAGGGTGGGGCGGTGTTTGCGCCGCCGCCGAGGAACTATTTTCTGGAAACGGAGGAGAGAGTGATGTGCTTGGCGATCCGAGCGGTGGAATCCGGTAATGGGTTTTCGGTGATTGGGAATCTGATGCAGCAAGGATTCTTGTTGGAGTTCGATAGGGAGGCGTCGAGGCTGGGTTTTTCAAGGCGGGGTTGTGGCCTTCCATGA

Coding sequence (CDS)

ATGATCGGCGGCGGCCTCCGCAGCCTTCCCGTGAACAATGCCACCAGAATCAGCTACACCCCCTTGCAGATTAACCCTCTCTCCCCCACATTCTACTACATTGGCATCCACAGCATCACCGTCGACGGCGTGAAATTACCCATCAACCCCGCCGTATGGGCAATCGACGAGCAGGGAAACGGCGGTACGGTGGTCGATTCAGGGACGACATTGACCTACCTGGCGAAGGCGGCGTACGAGGAGGTGCTGAAGGCGGTAAGACAGCGAGTGAAACTACCAAATGCAGCTGAGTTGACTCCGGGGTTCGATCTATGCGTGAACGCGTCAGGGGAGTCACGGCGGCCGAGTCTGCCGCGACTGAGATTGGAACTGAAGGGTGGGGCGGTGTTTGCGCCGCCGCCGAGGAACTATTTTCTGGAAACGGAGGAGAGAGTGATGTGCTTGGCGATCCGAGCGGTGGAATCCGGTAATGGGTTTTCGGTGATTGGGAATCTGATGCAGCAAGGATTCTTGTTGGAGTTCGATAGGGAGGCGTCGAGGCTGGGTTTTTCAAGGCGGGGTTGTGGCCTTCCATGA

Protein sequence

MIGGGLRSLPVNNATRISYTPLQINPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGNGGTVVDSGTTLTYLAKAAYEEVLKAVRQRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRLELKGGAVFAPPPRNYFLETEERVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLGFSRRGCGLP
BLAST of Cla002325 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 108.6 bits (270), Expect = 7.6e-23
Identity = 56/162 (34.57%), Postives = 86/162 (53.09%), Query Frame = 1

Query: 29  PTFYYIGIHSITVDGVKLPINPAVWAIDEQ-GNGGTVVDSGTTLTYLAKAAYEEVLKAVR 88
           PTFYYI ++ ++V   +LPI+P+ +A++   G GG ++DSGTTLTY    AY+ V +   
Sbjct: 277 PTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFI 336

Query: 89  QRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRLELKGGAVFAPPPRNYFLETEERVMC 148
            ++ LP     + GFDLC     +     +P   +   GG +   P  NYF+     ++C
Sbjct: 337 SQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDL-ELPSENYFISPSNGLIC 396

Query: 149 LAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLGFSRRGCG 190
           LA+ +  S  G S+ GN+ QQ  L+ +D   S + F+   CG
Sbjct: 397 LAMGS--SSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQCG 435

BLAST of Cla002325 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 99.8 bits (247), Expect = 3.5e-20
Identity = 60/172 (34.88%), Postives = 91/172 (52.91%), Query Frame = 1

Query: 19  YTPLQINPLSPTFYYIGIHSITVDGVKLP-INPAVWAIDEQGNGGTVVDSGTTLTYLAKA 78
           +TPL  NP   TFYY+G+  I+V G ++P +  +++ +D+ GNGG ++DSGT++T L + 
Sbjct: 317 FTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRP 376

Query: 79  AYEEVLKAVRQRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRLELKGGAVFAPPPRNY 138
           AY  +  A R   K    A     FD C + S       +P + L  +G  V + P  NY
Sbjct: 377 AYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLS-NMNEVKVPTVVLHFRGADV-SLPATNY 436

Query: 139 FLETEER-VMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLGFSRRGC 189
            +  +     C A     +  G S+IGN+ QQGF + +D  +SR+GF+  GC
Sbjct: 437 LIPVDTNGKFCFAFAG--TMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Cla002325 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 99.0 bits (245), Expect = 6.0e-20
Identity = 56/170 (32.94%), Postives = 85/170 (50.00%), Query Frame = 1

Query: 20  TPLQINPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGNGGTVVDSGTTLTYLAKAAY 79
           T L  + L+PT+YYI +  ITV G  L I  + + + + G GG ++DSGTTLTYL + AY
Sbjct: 269 TTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAY 328

Query: 80  EEVLKAVRQRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRLELKGGAVFAPPPRNYFL 139
             V +A   ++ LP   E + G   C     +     +P + ++  GG V     +N  +
Sbjct: 329 NAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGG-VLNLGEQNILI 388

Query: 140 ETEERVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLGFSRRGCG 190
              E V+CLA+ +  S  G S+ GN+ QQ   + +D +   + F    CG
Sbjct: 389 SPAEGVICLAMGS-SSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQCG 436

BLAST of Cla002325 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 98.2 bits (243), Expect = 1.0e-19
Identity = 63/186 (33.87%), Postives = 93/186 (50.00%), Query Frame = 1

Query: 5   GLRSLPVNNATRISYTPLQINPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGNGGTV 64
           G  +LPV      S+ PL  NP +P+FYY+G+  + V GV++P+   V+ + E G+GG V
Sbjct: 293 GREALPVG----ASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVV 352

Query: 65  VDSGTTLTYLAKAAYEEVLKAVR-QRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRLE 124
           +D+GT +T L  AAY       + Q   LP A+ ++  FD C + SG      +P +   
Sbjct: 353 MDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSI-FDTCYDLSG-FVSVRVPTVSFY 412

Query: 125 LKGGAVFAPPPRNYFLETEER-VMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLG 184
              G V   P RN+ +  ++    C A  A  S  G S+IGN+ Q+G  + FD     +G
Sbjct: 413 FTEGPVLTLPARNFLMPVDDSGTYCFAFAA--SPTGLSIIGNIQQEGIQVSFDGANGFVG 470

Query: 185 FSRRGC 189
           F    C
Sbjct: 473 FGPNVC 470

BLAST of Cla002325 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 82.4 bits (202), Expect = 5.8e-15
Identity = 51/170 (30.00%), Postives = 79/170 (46.47%), Query Frame = 1

Query: 21  PLQINPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGNGGTVVDSGTTLTYLAKAAYE 80
           PL  N    TFYY+G+   +V G K+ +  A++ +D  G+GG ++D GT +T L   AY 
Sbjct: 334 PLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYN 393

Query: 81  EVLKA-VRQRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRLELKGGAVFAPPPRNYFL 140
            +  A ++  V L   +     FD C + S  S    +P +     GG     P +NY +
Sbjct: 394 SLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLS-TVKVPTVAFHFTGGKSLDLPAKNYLI 453

Query: 141 ETEER-VMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLGFSRRGC 189
             ++    C A     S    S+IGN+ QQG  + +D   + +G S   C
Sbjct: 454 PVDDSGTFCFAFAPTSS--SLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of Cla002325 vs. TrEMBL
Match: A0A0A0KNH6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G174650 PE=3 SV=1)

HSP 1 Score: 353.6 bits (906), Expect = 1.5e-94
Identity = 173/191 (90.58%), Postives = 180/191 (94.24%), Query Frame = 1

Query: 1   MIGGGLRSLPVNNATRISYTPLQINPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGN 60
           MIGGGL SLP+ NAT+ISYTPLQINPLSPTFYYI IHSIT+DGVKLPINPAVW IDEQGN
Sbjct: 269 MIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWEIDEQGN 328

Query: 61  GGTVVDSGTTLTYLAKAAYEEVLKAVRQRVKLPNAAELTPGFDLCVNASGESRRPSLPRL 120
           GGTVVDSGTTLTYL K AYEEVLK+VR+RVKLPNAAELTPGFDLCVNASGESRRPSLPRL
Sbjct: 329 GGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRPSLPRL 388

Query: 121 RLELKGGAVFAPPPRNYFLETEERVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASR 180
           R  L GGAVFAPPPRNYFLETEE VMCLAIRAVESGNGFSVIGNLMQQGFLLEFD+E SR
Sbjct: 389 RFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEESR 448

Query: 181 LGFSRRGCGLP 192
           LGF+RRGCGLP
Sbjct: 449 LGFTRRGCGLP 459

BLAST of Cla002325 vs. TrEMBL
Match: A0A067K2U7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18279 PE=3 SV=1)

HSP 1 Score: 264.2 bits (674), Expect = 1.2e-67
Identity = 124/188 (65.96%), Postives = 154/188 (81.91%), Query Frame = 1

Query: 4   GGLRSLPVNNATRISYTPLQINPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGNGGT 63
           GG ++  V+    +++TPL +N LSPTFYYIGI S++VDGVKLPINP+VW+ID+ GNGGT
Sbjct: 269 GGHQNSAVSRKRILNFTPLLVNSLSPTFYYIGIKSVSVDGVKLPINPSVWSIDDLGNGGT 328

Query: 64  VVDSGTTLTYLAKAAYEEVLKAVRQRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRLE 123
           ++DSGTTLT+L + AY E+L A+++RVKLP   ELTPGFDLCVN SG  RRP  PR+ LE
Sbjct: 329 IIDSGTTLTFLVEPAYREILSAIKRRVKLPGPGELTPGFDLCVNVSG-VRRPVFPRMSLE 388

Query: 124 LKGGAVFAPPPRNYFLETEERVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLGF 183
           L G +VF+PPPRNYF++T E V CLAI+ V SG+GFSVIGNLMQQG+LLEFDR+ SRLGF
Sbjct: 389 LAGNSVFSPPPRNYFIDTSEGVKCLAIQPVNSGSGFSVIGNLMQQGYLLEFDRDRSRLGF 448

Query: 184 SRRGCGLP 192
           +R GC LP
Sbjct: 449 ARSGCALP 455

BLAST of Cla002325 vs. TrEMBL
Match: B9SEI2_RICCO (Basic 7S globulin 2 small subunit, putative OS=Ricinus communis GN=RCOM_0705030 PE=3 SV=1)

HSP 1 Score: 258.1 bits (658), Expect = 8.6e-66
Identity = 119/188 (63.30%), Postives = 154/188 (81.91%), Query Frame = 1

Query: 4   GGLRSLPVNNATRISYTPLQINPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGNGGT 63
           GG +++ V+    +S+TPL INPLSPTFYYI I  + V+GVKLPINP+VW+ID+ GNGGT
Sbjct: 269 GGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWSIDDLGNGGT 328

Query: 64  VVDSGTTLTYLAKAAYEEVLKAVRQRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRLE 123
           ++DSGTTLT++ + AY E+LKA ++RVKLP+ AE TPGFDLC+N SG + RP+LPR+   
Sbjct: 329 IIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFDLCMNVSGVT-RPALPRMSFN 388

Query: 124 LKGGAVFAPPPRNYFLETEERVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLGF 183
           L GG+VF+PPPRNYF+ET +++ CLA++ V    GFSV+GNLMQQGFLLEFDR+ SRLGF
Sbjct: 389 LAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGNLMQQGFLLEFDRDKSRLGF 448

Query: 184 SRRGCGLP 192
           +RRGC LP
Sbjct: 449 TRRGCALP 455

BLAST of Cla002325 vs. TrEMBL
Match: V4KZL9_EUTSA (Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10004188mg PE=3 SV=1)

HSP 1 Score: 254.6 bits (649), Expect = 9.5e-65
Identity = 125/190 (65.79%), Postives = 153/190 (80.53%), Query Frame = 1

Query: 3   GGGLRSLPVNNATRISYTPLQINPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGNGG 62
           GGG+RS  V+   ++S+TPL  NPLSPTFYY+ + SI V+G KL I+P+VW ID+ GNGG
Sbjct: 269 GGGVRSDAVS---KLSFTPLLTNPLSPTFYYVRLKSIFVNGAKLRIDPSVWEIDDSGNGG 328

Query: 63  TVVDSGTTLTYLAKAAYEEVLKAVRQRVKLPNAAELTPGFDLCVNASGESR-RPSLPRLR 122
           TVVDSGTTL +LA+ AY  V+ AVR+R++LP AAE+TPGFDLCVN SG S+    +PRL+
Sbjct: 329 TVVDSGTTLAFLAEPAYRSVIAAVRRRIRLPIAAEVTPGFDLCVNISGVSKPEKIMPRLK 388

Query: 123 LELKGGAVFAPPPRNYFLETEERVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRL 182
            EL GGA+F PPPRNYF+ETEE++ CLAI++V    GFSVIGNLMQQGFL EFDR+ SRL
Sbjct: 389 FELAGGALFVPPPRNYFIETEEQIQCLAIQSVNPKVGFSVIGNLMQQGFLFEFDRDRSRL 448

Query: 183 GFSRRGCGLP 192
           GFSRRGC LP
Sbjct: 449 GFSRRGCALP 455

BLAST of Cla002325 vs. TrEMBL
Match: F6HF17_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g02930 PE=3 SV=1)

HSP 1 Score: 251.9 bits (642), Expect = 6.2e-64
Identity = 117/176 (66.48%), Postives = 148/176 (84.09%), Query Frame = 1

Query: 16  RISYTPLQINPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGNGGTVVDSGTTLTYLA 75
           R+ +TPL INPLSPTFYYIGI S++VDG+KLPINP+VWA+DE GNGGT+VDSGTTLT+L 
Sbjct: 284 RMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLP 343

Query: 76  KAAYEEVLKAVRQRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRLELKGGAVFAPPPR 135
           + AY ++L  +++RV+LP+ AE TPGFDLCVN S E   P LP+L  +L G +VF+PPPR
Sbjct: 344 EPAYLQILTVIKRRVRLPSPAEPTPGFDLCVNVS-EIEHPRLPKLSFKLGGDSVFSPPPR 403

Query: 136 NYFLETEERVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLGFSRRGCGLP 192
           NYF++T+E V CLA++AV + +GFSVIGNLMQQGFLLEFD++ +RLGFSR GC LP
Sbjct: 404 NYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGCALP 458

BLAST of Cla002325 vs. TAIR10
Match: AT3G25700.1 (AT3G25700.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 245.4 bits (625), Expect = 2.9e-65
Identity = 117/178 (65.73%), Postives = 141/178 (79.21%), Query Frame = 1

Query: 15  TRISYTPLQINPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGNGGTVVDSGTTLTYL 74
           +++ +TPL  NPLSPTFYY+ + S+ V+G KL I+P++W ID+ GNGGTVVDSGTTL +L
Sbjct: 275 SKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFL 334

Query: 75  AKAAYEEVLKAVRQRVKLPNAAELTPGFDLCVNASGESR-RPSLPRLRLELKGGAVFAPP 134
           A+ AY  V+ AVR+RVKLP A  LTPGFDLCVN SG ++    LPRL+ E  GGAVF PP
Sbjct: 335 AEPAYRSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPP 394

Query: 135 PRNYFLETEERVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLGFSRRGCGLP 192
           PRNYF+ETEE++ CLAI++V+   GFSVIGNLMQQGFL EFDR+ SRLGFSRRGC LP
Sbjct: 395 PRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCALP 452

BLAST of Cla002325 vs. TAIR10
Match: AT2G42980.1 (AT2G42980.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 107.8 bits (268), Expect = 7.3e-24
Identity = 66/162 (40.74%), Postives = 84/162 (51.85%), Query Frame = 1

Query: 30  TFYYIGIHSITVDGVKLPINPAVWAIDEQGNGGTVVDSGTTLTYLAKAAYEEVLKAVRQR 89
           TFYYI I SI V G  L I    W I   G+GGT++DSGTTL+Y A+ AYE +     ++
Sbjct: 365 TFYYIQIKSILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEK 424

Query: 90  VK--LPNAAELTPGFDLCVNASG-ESRRPSLPRLRLELKGGAVFAPPPRNYFLETEERVM 149
           +K   P   +  P  D C N SG E     LP L +    G V+  P  N F+   E ++
Sbjct: 425 MKENYPIFRDF-PVLDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLV 484

Query: 150 CLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLGFSRRGC 189
           CLAI        FS+IGN  QQ F + +D + SRLGF+   C
Sbjct: 485 CLAILGTPKST-FSIIGNYQQQNFHILYDTKRSRLGFTPTKC 524

BLAST of Cla002325 vs. TAIR10
Match: AT3G59080.1 (AT3G59080.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 104.4 bits (259), Expect = 8.1e-23
Identity = 64/164 (39.02%), Postives = 85/164 (51.83%), Query Frame = 1

Query: 27  LSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGNGGTVVDSGTTLTYLAKAAYEEVLKAV 86
           L  TFYY+ I SI V G  L I    W I   G GGT++DSGTTL+Y A+ AYE +   +
Sbjct: 372 LVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKI 431

Query: 87  RQRV--KLPNAAELTPGFDLCVNASGESRRPSLPRLRLELKGGAVFAPPPRNYFLETEER 146
            ++   K P   +  P  D C N SG      LP L +    GAV+  P  N F+   E 
Sbjct: 432 AEKAKGKYPVYRDF-PILDPCFNVSG-IHNVQLPELGIAFADGAVWNFPTENSFIWLNED 491

Query: 147 VMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLGFSRRGC 189
           ++CLA+      + FS+IGN  QQ F + +D + SRLG++   C
Sbjct: 492 LVCLAMLGTPK-SAFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 532

BLAST of Cla002325 vs. TAIR10
Match: AT2G03200.1 (AT2G03200.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 100.5 bits (249), Expect = 1.2e-21
Identity = 52/165 (31.52%), Postives = 83/165 (50.30%), Query Frame = 1

Query: 25  NPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGNGGTVVDSGTTLTYLAKAAYEEVLK 84
           NP  P+FYY+ +  ITV   +L +  + + + E G GG ++DSGTT+TYL + A++ + +
Sbjct: 298 NPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKE 357

Query: 85  AVRQRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRLELKGGAVFAPPPRNYFLETEER 144
               R+ LP     + G DLC      ++  ++P++    KG  +  P       ++   
Sbjct: 358 EFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGADLELPGENYMVADSSTG 417

Query: 145 VMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLGFSRRGCG 190
           V+CL   A+ S NG S+ GN+ QQ F +  D E   + F    CG
Sbjct: 418 VLCL---AMGSSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTECG 459

BLAST of Cla002325 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 99.8 bits (247), Expect = 2.0e-21
Identity = 60/172 (34.88%), Postives = 91/172 (52.91%), Query Frame = 1

Query: 19  YTPLQINPLSPTFYYIGIHSITVDGVKLP-INPAVWAIDEQGNGGTVVDSGTTLTYLAKA 78
           +TPL  NP   TFYY+G+  I+V G ++P +  +++ +D+ GNGG ++DSGT++T L + 
Sbjct: 317 FTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRP 376

Query: 79  AYEEVLKAVRQRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRLELKGGAVFAPPPRNY 138
           AY  +  A R   K    A     FD C + S       +P + L  +G  V + P  NY
Sbjct: 377 AYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLS-NMNEVKVPTVVLHFRGADV-SLPATNY 436

Query: 139 FLETEER-VMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLGFSRRGC 189
            +  +     C A     +  G S+IGN+ QQGF + +D  +SR+GF+  GC
Sbjct: 437 LIPVDTNGKFCFAFAG--TMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Cla002325 vs. NCBI nr
Match: gi|659073000|ref|XP_008467208.1| (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo])

HSP 1 Score: 354.4 bits (908), Expect = 1.3e-94
Identity = 174/191 (91.10%), Postives = 181/191 (94.76%), Query Frame = 1

Query: 1   MIGGGLRSLPVNNATRISYTPLQINPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGN 60
           MIGGGL SLPVNNAT+ISYTPLQINPLSPTFYYI I+SIT+DGVKLPINPAVW IDEQGN
Sbjct: 269 MIGGGLHSLPVNNATKISYTPLQINPLSPTFYYITINSITIDGVKLPINPAVWEIDEQGN 328

Query: 61  GGTVVDSGTTLTYLAKAAYEEVLKAVRQRVKLPNAAELTPGFDLCVNASGESRRPSLPRL 120
           GGTVVDSGTTLTYL K AYEEVLK+VR+RVKLPNAAELTPGFDLCVNASGESRRPSLPRL
Sbjct: 329 GGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRPSLPRL 388

Query: 121 RLELKGGAVFAPPPRNYFLETEERVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASR 180
           R  L GGAVFAPPPRNYFLETEE VMCLAIRAVESGNGFSVIGNLMQQGFLLEFD+E SR
Sbjct: 389 RFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEESR 448

Query: 181 LGFSRRGCGLP 192
           LGF+RRGCGLP
Sbjct: 449 LGFTRRGCGLP 459

BLAST of Cla002325 vs. NCBI nr
Match: gi|449451908|ref|XP_004143702.1| (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis sativus])

HSP 1 Score: 353.6 bits (906), Expect = 2.2e-94
Identity = 173/191 (90.58%), Postives = 180/191 (94.24%), Query Frame = 1

Query: 1   MIGGGLRSLPVNNATRISYTPLQINPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGN 60
           MIGGGL SLP+ NAT+ISYTPLQINPLSPTFYYI IHSIT+DGVKLPINPAVW IDEQGN
Sbjct: 269 MIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWEIDEQGN 328

Query: 61  GGTVVDSGTTLTYLAKAAYEEVLKAVRQRVKLPNAAELTPGFDLCVNASGESRRPSLPRL 120
           GGTVVDSGTTLTYL K AYEEVLK+VR+RVKLPNAAELTPGFDLCVNASGESRRPSLPRL
Sbjct: 329 GGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRPSLPRL 388

Query: 121 RLELKGGAVFAPPPRNYFLETEERVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASR 180
           R  L GGAVFAPPPRNYFLETEE VMCLAIRAVESGNGFSVIGNLMQQGFLLEFD+E SR
Sbjct: 389 RFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEESR 448

Query: 181 LGFSRRGCGLP 192
           LGF+RRGCGLP
Sbjct: 449 LGFTRRGCGLP 459

BLAST of Cla002325 vs. NCBI nr
Match: gi|802680767|ref|XP_012082020.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Jatropha curcas])

HSP 1 Score: 264.2 bits (674), Expect = 1.7e-67
Identity = 124/188 (65.96%), Postives = 154/188 (81.91%), Query Frame = 1

Query: 4   GGLRSLPVNNATRISYTPLQINPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGNGGT 63
           GG ++  V+    +++TPL +N LSPTFYYIGI S++VDGVKLPINP+VW+ID+ GNGGT
Sbjct: 269 GGHQNSAVSRKRILNFTPLLVNSLSPTFYYIGIKSVSVDGVKLPINPSVWSIDDLGNGGT 328

Query: 64  VVDSGTTLTYLAKAAYEEVLKAVRQRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRLE 123
           ++DSGTTLT+L + AY E+L A+++RVKLP   ELTPGFDLCVN SG  RRP  PR+ LE
Sbjct: 329 IIDSGTTLTFLVEPAYREILSAIKRRVKLPGPGELTPGFDLCVNVSG-VRRPVFPRMSLE 388

Query: 124 LKGGAVFAPPPRNYFLETEERVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLGF 183
           L G +VF+PPPRNYF++T E V CLAI+ V SG+GFSVIGNLMQQG+LLEFDR+ SRLGF
Sbjct: 389 LAGNSVFSPPPRNYFIDTSEGVKCLAIQPVNSGSGFSVIGNLMQQGYLLEFDRDRSRLGF 448

Query: 184 SRRGCGLP 192
           +R GC LP
Sbjct: 449 ARSGCALP 455

BLAST of Cla002325 vs. NCBI nr
Match: gi|255566835|ref|XP_002524401.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Ricinus communis])

HSP 1 Score: 258.1 bits (658), Expect = 1.2e-65
Identity = 119/188 (63.30%), Postives = 154/188 (81.91%), Query Frame = 1

Query: 4   GGLRSLPVNNATRISYTPLQINPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGNGGT 63
           GG +++ V+    +S+TPL INPLSPTFYYI I  + V+GVKLPINP+VW+ID+ GNGGT
Sbjct: 269 GGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWSIDDLGNGGT 328

Query: 64  VVDSGTTLTYLAKAAYEEVLKAVRQRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRLE 123
           ++DSGTTLT++ + AY E+LKA ++RVKLP+ AE TPGFDLC+N SG + RP+LPR+   
Sbjct: 329 IIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFDLCMNVSGVT-RPALPRMSFN 388

Query: 124 LKGGAVFAPPPRNYFLETEERVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRLGF 183
           L GG+VF+PPPRNYF+ET +++ CLA++ V    GFSV+GNLMQQGFLLEFDR+ SRLGF
Sbjct: 389 LAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGNLMQQGFLLEFDRDKSRLGF 448

Query: 184 SRRGCGLP 192
           +RRGC LP
Sbjct: 449 TRRGCALP 455

BLAST of Cla002325 vs. NCBI nr
Match: gi|567142517|ref|XP_006395632.1| (hypothetical protein EUTSA_v10004188mg [Eutrema salsugineum])

HSP 1 Score: 254.6 bits (649), Expect = 1.4e-64
Identity = 125/190 (65.79%), Postives = 153/190 (80.53%), Query Frame = 1

Query: 3   GGGLRSLPVNNATRISYTPLQINPLSPTFYYIGIHSITVDGVKLPINPAVWAIDEQGNGG 62
           GGG+RS  V+   ++S+TPL  NPLSPTFYY+ + SI V+G KL I+P+VW ID+ GNGG
Sbjct: 269 GGGVRSDAVS---KLSFTPLLTNPLSPTFYYVRLKSIFVNGAKLRIDPSVWEIDDSGNGG 328

Query: 63  TVVDSGTTLTYLAKAAYEEVLKAVRQRVKLPNAAELTPGFDLCVNASGESR-RPSLPRLR 122
           TVVDSGTTL +LA+ AY  V+ AVR+R++LP AAE+TPGFDLCVN SG S+    +PRL+
Sbjct: 329 TVVDSGTTLAFLAEPAYRSVIAAVRRRIRLPIAAEVTPGFDLCVNISGVSKPEKIMPRLK 388

Query: 123 LELKGGAVFAPPPRNYFLETEERVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDREASRL 182
            EL GGA+F PPPRNYF+ETEE++ CLAI++V    GFSVIGNLMQQGFL EFDR+ SRL
Sbjct: 389 FELAGGALFVPPPRNYFIETEEQIQCLAIQSVNPKVGFSVIGNLMQQGFLFEFDRDRSRL 448

Query: 183 GFSRRGCGLP 192
           GFSRRGC LP
Sbjct: 449 GFSRRGCALP 455

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NEP1_NEPGR7.6e-2334.57Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
APF2_ARATH3.5e-2034.88Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
NEP2_NEPGR6.0e-2032.94Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
ASPG2_ARATH1.0e-1933.87Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
ASPG1_ARATH5.8e-1530.00Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KNH6_CUCSA1.5e-9490.58Uncharacterized protein OS=Cucumis sativus GN=Csa_5G174650 PE=3 SV=1[more]
A0A067K2U7_JATCU1.2e-6765.96Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18279 PE=3 SV=1[more]
B9SEI2_RICCO8.6e-6663.30Basic 7S globulin 2 small subunit, putative OS=Ricinus communis GN=RCOM_0705030 ... [more]
V4KZL9_EUTSA9.5e-6565.79Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10004188mg PE=3 SV=1[more]
F6HF17_VITVI6.2e-6466.48Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g02930 PE=3 SV=... [more]
Match NameE-valueIdentityDescription
AT3G25700.12.9e-6565.73 Eukaryotic aspartyl protease family protein[more]
AT2G42980.17.3e-2440.74 Eukaryotic aspartyl protease family protein[more]
AT3G59080.18.1e-2339.02 Eukaryotic aspartyl protease family protein[more]
AT2G03200.11.2e-2131.52 Eukaryotic aspartyl protease family protein[more]
AT1G01300.12.0e-2134.88 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659073000|ref|XP_008467208.1|1.3e-9491.10PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo][more]
gi|449451908|ref|XP_004143702.1|2.2e-9490.58PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis sativus][more]
gi|802680767|ref|XP_012082020.1|1.7e-6765.96PREDICTED: aspartic proteinase nepenthesin-1 [Jatropha curcas][more]
gi|255566835|ref|XP_002524401.1|1.2e-6563.30PREDICTED: aspartic proteinase nepenthesin-1 [Ricinus communis][more]
gi|567142517|ref|XP_006395632.1|1.4e-6465.79hypothetical protein EUTSA_v10004188mg [Eutrema salsugineum][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0030163 protein catabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005576 extracellular region
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0008233 peptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla002325Cla002325.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 1..189
score: 2.1E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 12..188
score: 3.0
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 4..188
score: 6.07
NoneNo IPR availablePANTHERPTHR13683:SF354ASPARTYL PROTEASE FAMILY PROTEINcoord: 1..189
score: 2.1E

The following gene(s) are paralogous to this gene:

None