Tan0022648 (gene) Snake gourd v1

Overview
NameTan0022648
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEukaryotic aspartyl protease family protein
LocationLG01: 23201356 .. 23203382 (-)
RNA-Seq ExpressionTan0022648
SyntenyTan0022648
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTATTGTACAAGGAATCACACAACCAAAGTTGTAATTGAACCAAAGAAGAAAGAGGAAGAAAGAAGCCCCTAGTTCCCATGGCCACCACTCCTCAAACCCATACAAAAACCCATTTCCCCCTTCTAACCAGAACCTTCTTCTTCTTCTTCCTCAATCCTCACCTCTTTATATTCCCAACTTTCTCTCTCTAACTTCACTCCAAAAACCCCATCTCTTTCTGAACAGCCCTTCAATGGCTTCCCCTGTTTTTCTCCTCTTCCTCCTCTGTATTCTCCTTTCTTCCTCTGTTTTCTCTTCAGAAATTCTCCTTCTACCTCTCTCCCACTCCTTATCATCCTCATCAGATTTCAACAACACCCACAACCTCCTCAAATCTACTGCTGCCCGCTCCGTCGCCCGCTTCCACCACCGCCGCCGTACCCACCGCCGCAGCCACCTCTCTCTCCCACTCTCTCCAGGTGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGGCTCACAAAATTTCCCTCTATATGGACACCGGCAGCGACCTCGTTTGGTTCCCCTGTTCCCCATTTGAATGTATTCTCTGCGAAGGCAAACCAAAAGTTCAATCCCCTTTGCCCAAAATCTCAAAACAGAAATCAGTTTCTTGCAGCGCCGCCGCATGCTCCGCCGCCCACGGCGGCTCCCTCTCCTCCTCCCACCTCTGTGCAATTTCCCGATGCCCACTTGAATCCATTGAAATTTCTGAGTGTTCTTCCTTTTCTTGTCCGCCTTTCTATTATGCTTATGGCGATGGGAGTTTAATTGCTCAGCTTTATAGAGATAGCCTCAGTTTGCCGGCGCCCGCACCGGCACCGGCGATTAGTGTTCGGAATTTTACTTTTGGATGTGCCCACACGGCGCTCGGTGAGCCGGTCGGGGTCGCCGGGTTCGGTCGGGGAACGTTGTCGATGCCGAGTCAACTCGCCACTTTCTCACCCCAATTGGGGAACCGGTTTTCTTATTGTTTGGTTTCTCATTCGTTTGCGGCGGACCGGGTTCGCCGCCCGAGTCCGCTGATTCTGGGGCGGTACTACGGCGGCGAGACGGAGTTCGTTTACACTTCCTTGCTTGAGAATCCGAAGCATCCTTACTATTACTCGGTTGGGTTGGCGGGAATTTCGGTCGGGTCGGTGAGAATTCCGGCGCCGGAGTTTTTGAAACGGGTGGATGAGGGGGGCAGCGGCGGCGTTGTAGTGGATTCCGGTACTACTTTCACTATGCTGCCGGCCGGTTTGTATGACTCGGTGGTGGGTCAGTTTGAGAATCGGACCGGGCAAGTTGCGACCCGGGCGAGCCGTATTGAAGAAAATACCGGGTTGAGCCCTTGTTATTACTATGATAACTCAATTGAAGTGCCACGTGTCGTGCTGCATTTCGTCGGGGAAAAATCCAGTGTGATGCTTCCTAGGAAGAATTATTTTTACGAGTTTTTGGATGGTGGAGATGGGACGGGGAGGAAGAGAAAAGTCGGGTGTTTGATGCTGATGAACGGTGGAGATGAGGATGAGCTGGCAGGTGGGCCCGGAGCCACGCTGGGGAACTATCAACAACAGGGTTTTGAAGTGGTCTATGATTTAGAGAAGAACCGGGTCGGTTTTGCCCGGCGGCAGTGTTCGACGCTTTGGGACAGCTTGAACCGGAGTTAGTATGAACCGTGGGCCCGGTCGAGGACGTGAAGGTTGACAATTGAATGGTTTTGACTTGGGACTGTGCCAATGGTCAACGCTTTTGTGGTAAATAAGTTATTTTGACATTTGATGGGGTCTTTTTTGTAAATTCTTGTGAGCACTTCACTTCTTGCTTCACTGCTATTTCTAATAGTTAAAATTTGTATATGAAAAGTGTTCAAAATATATAATAAACAAAAAAGGGAAAAGATTATTTAATGGCTTATTGATTTATTAGTTCTGTTTTTAATGCATGAAGCAAGAAATGAAGTGCTTGCTACATTGGATGAATTTTTTTAAACTTTTTTTTTTTTTTTTTTTAAGGTT

mRNA sequence

TTATTGTACAAGGAATCACACAACCAAAGTTGTAATTGAACCAAAGAAGAAAGAGGAAGAAAGAAGCCCCTAGTTCCCATGGCCACCACTCCTCAAACCCATACAAAAACCCATTTCCCCCTTCTAACCAGAACCTTCTTCTTCTTCTTCCTCAATCCTCACCTCTTTATATTCCCAACTTTCTCTCTCTAACTTCACTCCAAAAACCCCATCTCTTTCTGAACAGCCCTTCAATGGCTTCCCCTGTTTTTCTCCTCTTCCTCCTCTGTATTCTCCTTTCTTCCTCTGTTTTCTCTTCAGAAATTCTCCTTCTACCTCTCTCCCACTCCTTATCATCCTCATCAGATTTCAACAACACCCACAACCTCCTCAAATCTACTGCTGCCCGCTCCGTCGCCCGCTTCCACCACCGCCGCCGTACCCACCGCCGCAGCCACCTCTCTCTCCCACTCTCTCCAGGTGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGGCTCACAAAATTTCCCTCTATATGGACACCGGCAGCGACCTCGTTTGGTTCCCCTGTTCCCCATTTGAATGTATTCTCTGCGAAGGCAAACCAAAAGTTCAATCCCCTTTGCCCAAAATCTCAAAACAGAAATCAGTTTCTTGCAGCGCCGCCGCATGCTCCGCCGCCCACGGCGGCTCCCTCTCCTCCTCCCACCTCTGTGCAATTTCCCGATGCCCACTTGAATCCATTGAAATTTCTGAGTGTTCTTCCTTTTCTTGTCCGCCTTTCTATTATGCTTATGGCGATGGGAGTTTAATTGCTCAGCTTTATAGAGATAGCCTCAGTTTGCCGGCGCCCGCACCGGCACCGGCGATTAGTGTTCGGAATTTTACTTTTGGATGTGCCCACACGGCGCTCGGTGAGCCGGTCGGGGTCGCCGGGTTCGGTCGGGGAACGTTGTCGATGCCGAGTCAACTCGCCACTTTCTCACCCCAATTGGGGAACCGGTTTTCTTATTGTTTGGTTTCTCATTCGTTTGCGGCGGACCGGGTTCGCCGCCCGAGTCCGCTGATTCTGGGGCGGTACTACGGCGGCGAGACGGAGTTCGTTTACACTTCCTTGCTTGAGAATCCGAAGCATCCTTACTATTACTCGGTTGGGTTGGCGGGAATTTCGGTCGGGTCGGTGAGAATTCCGGCGCCGGAGTTTTTGAAACGGGTGGATGAGGGGGGCAGCGGCGGCGTTGTAGTGGATTCCGGTACTACTTTCACTATGCTGCCGGCCGGTTTGTATGACTCGGTGGTGGGTCAGTTTGAGAATCGGACCGGGCAAGTTGCGACCCGGGCGAGCCGTATTGAAGAAAATACCGGGTTGAGCCCTTGTTATTACTATGATAACTCAATTGAAGTGCCACGTGTCGTGCTGCATTTCGTCGGGGAAAAATCCAGTGTGATGCTTCCTAGGAAGAATTATTTTTACGAGTTTTTGGATGGTGGAGATGGGACGGGGAGGAAGAGAAAAGTCGGGTGTTTGATGCTGATGAACGGTGGAGATGAGGATGAGCTGGCAGGTGGGCCCGGAGCCACGCTGGGGAACTATCAACAACAGGGTTTTGAAGTGGTCTATGATTTAGAGAAGAACCGGGTCGGTTTTGCCCGGCGGCAGTGTTCGACGCTTTGGGACAGCTTGAACCGGAGTTAGTATGAACCGTGGGCCCGGTCGAGGACGTGAAGGTTGACAATTGAATGGTTTTGACTTGGGACTGTGCCAATGGTCAACGCTTTTGTGGTAAATAAGTTATTTTGACATTTGATGGGGTCTTTTTTGTAAATTCTTGTGAGCACTTCACTTCTTGCTTCACTGCTATTTCTAATAGTTAAAATTTGTATATGAAAAGTGTTCAAAATATATAATAAACAAAAAAGGGAAAAGATTATTTAATGGCTTATTGATTTATTAGTTCTGTTTTTAATGCATGAAGCAAGAAATGAAGTGCTTGCTACATTGGATGAATTTTTTTAAACTTTTTTTTTTTTTTTTTTTAAGGTT

Coding sequence (CDS)

ATGGCTTCCCCTGTTTTTCTCCTCTTCCTCCTCTGTATTCTCCTTTCTTCCTCTGTTTTCTCTTCAGAAATTCTCCTTCTACCTCTCTCCCACTCCTTATCATCCTCATCAGATTTCAACAACACCCACAACCTCCTCAAATCTACTGCTGCCCGCTCCGTCGCCCGCTTCCACCACCGCCGCCGTACCCACCGCCGCAGCCACCTCTCTCTCCCACTCTCTCCAGGTGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGGCTCACAAAATTTCCCTCTATATGGACACCGGCAGCGACCTCGTTTGGTTCCCCTGTTCCCCATTTGAATGTATTCTCTGCGAAGGCAAACCAAAAGTTCAATCCCCTTTGCCCAAAATCTCAAAACAGAAATCAGTTTCTTGCAGCGCCGCCGCATGCTCCGCCGCCCACGGCGGCTCCCTCTCCTCCTCCCACCTCTGTGCAATTTCCCGATGCCCACTTGAATCCATTGAAATTTCTGAGTGTTCTTCCTTTTCTTGTCCGCCTTTCTATTATGCTTATGGCGATGGGAGTTTAATTGCTCAGCTTTATAGAGATAGCCTCAGTTTGCCGGCGCCCGCACCGGCACCGGCGATTAGTGTTCGGAATTTTACTTTTGGATGTGCCCACACGGCGCTCGGTGAGCCGGTCGGGGTCGCCGGGTTCGGTCGGGGAACGTTGTCGATGCCGAGTCAACTCGCCACTTTCTCACCCCAATTGGGGAACCGGTTTTCTTATTGTTTGGTTTCTCATTCGTTTGCGGCGGACCGGGTTCGCCGCCCGAGTCCGCTGATTCTGGGGCGGTACTACGGCGGCGAGACGGAGTTCGTTTACACTTCCTTGCTTGAGAATCCGAAGCATCCTTACTATTACTCGGTTGGGTTGGCGGGAATTTCGGTCGGGTCGGTGAGAATTCCGGCGCCGGAGTTTTTGAAACGGGTGGATGAGGGGGGCAGCGGCGGCGTTGTAGTGGATTCCGGTACTACTTTCACTATGCTGCCGGCCGGTTTGTATGACTCGGTGGTGGGTCAGTTTGAGAATCGGACCGGGCAAGTTGCGACCCGGGCGAGCCGTATTGAAGAAAATACCGGGTTGAGCCCTTGTTATTACTATGATAACTCAATTGAAGTGCCACGTGTCGTGCTGCATTTCGTCGGGGAAAAATCCAGTGTGATGCTTCCTAGGAAGAATTATTTTTACGAGTTTTTGGATGGTGGAGATGGGACGGGGAGGAAGAGAAAAGTCGGGTGTTTGATGCTGATGAACGGTGGAGATGAGGATGAGCTGGCAGGTGGGCCCGGAGCCACGCTGGGGAACTATCAACAACAGGGTTTTGAAGTGGTCTATGATTTAGAGAAGAACCGGGTCGGTTTTGCCCGGCGGCAGTGTTCGACGCTTTGGGACAGCTTGAACCGGAGTTAG

Protein sequence

MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
Homology
BLAST of Tan0022648 vs. ExPASy Swiss-Prot
Match: Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 587.8 bits (1514), Expect = 1.1e-166
Identity = 293/477 (61.43%), Postives = 355/477 (74.42%), Query Frame = 0

Query: 25  LLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFN 84
           LLL LSHSLS+S   ++  +LLKS+++RS ARF       ++  LSLP+S G DY +S +
Sbjct: 29  LLLHLSHSLSTSKHSSSPLHLLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISLS 88

Query: 85  LGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQ-KSVSCSAAACSA 144
           +GS +  +SLY+DTGSDLVWFPC PF CILCE KP   SP   +S    +VSCS+ +CSA
Sbjct: 89  VGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSA 148

Query: 145 AHGGSLSSSHLCAISRCPLESIEISEC--SSFSCPPFYYAYGDGSLIAQLYRDSLSLPAP 204
           AH  SL SS LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSL+A+LY DSLSL   
Sbjct: 149 AH-SSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSL--- 208

Query: 205 APAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSF 264
              P++SV NFTFGCAHT L EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHSF
Sbjct: 209 ---PSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSF 268

Query: 265 AADRVRRPSPLILGRYYG--------------------GETEFVYTSLLENPKHPYYYSV 324
            +DRVRRPSPLILGR+                       + EFV+T +LENPKHPY+YSV
Sbjct: 269 DSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSV 328

Query: 325 GLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVA 384
            L GIS+G   IPAP  L+R+D+ G GGVVVDSGTTFTMLPA  Y+SVV +F++R G+V 
Sbjct: 329 SLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVH 388

Query: 385 TRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKR 444
            RA R+E ++G+SPCYY + +++VP +VLHF G +SSV LPR+NYFYEF+DGGDG   KR
Sbjct: 389 ERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKR 448

Query: 445 KVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL 479
           K+GCLMLMNGGDE EL GG GA LGNYQQQGFEVVYDL   RVGFA+R+C++LWDSL
Sbjct: 449 KIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498

BLAST of Tan0022648 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 151.4 bits (381), Expect = 2.7e-35
Identity = 121/405 (29.88%), Postives = 178/405 (43.95%), Query Frame = 0

Query: 77  GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSC 136
           G+Y ++ ++G+ A   S  MDTGSDL+W  C P  C  C          P  + Q S S 
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQP--CTQC-----FNQSTPIFNPQGSSSF 152

Query: 137 SAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGS-LIAQLYRDS 196
           S   C         SS LC       +++    CS+  C  + Y YGDGS     +  ++
Sbjct: 153 STLPC---------SSQLC-------QALSSPTCSNNFC-QYTYGYGDGSETQGSMGTET 212

Query: 197 LSLPAPAPAPAISVRNFTFGCAHT----ALGEPVGVAGFGRGTLSMPSQLATFSPQLGNR 256
           L+        ++S+ N TFGC         G   G+ G GRG LS+PSQL         +
Sbjct: 213 LTF------GSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDV------TK 272

Query: 257 FSYCLVSHSFAADRVRRPSPLILGRYYGGETE-FVYTSLLENPKHPYYYSVGLAGISVGS 316
           FSYC+     +      PS L+LG      T     T+L+++ + P +Y + L G+SVGS
Sbjct: 273 FSYCMTPIGSST-----PSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGS 332

Query: 317 VRIPA-PEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEE 376
            R+P  P         G+GG+++DSGTT T      Y SV  +F ++        S    
Sbjct: 333 TRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGS---- 392

Query: 377 NTGLSPCYYY---DNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCL 436
           ++G   C+      +++++P  V+HF G    + LP +NYF                G +
Sbjct: 393 SSGFDLCFQTPSDPSNLQIPTFVMHFDG--GDLELPSENYFI-----------SPSNGLI 434

Query: 437 MLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 472
            L  G     +     +  GN QQQ   VVYD   + V FA  QC
Sbjct: 453 CLAMGSSSQGM-----SIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Tan0022648 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 149.4 bits (376), Expect = 1.0e-34
Identity = 133/410 (32.44%), Postives = 185/410 (45.12%), Query Frame = 0

Query: 73  LSPG-GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQ 132
           LS G G+Y     +G+ A  + + +DTGSD+VW  C+P  C  C  +       P    +
Sbjct: 135 LSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP--CRRCYSQSD-----PIFDPR 194

Query: 133 KSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSL-IAQ 192
           KS + +   CS+ H   L S+  C   R          C       +  +YGDGS  +  
Sbjct: 195 KSKTYATIPCSSPHCRRLDSAG-CNTRR--------KTCL------YQVSYGDGSFTVGD 254

Query: 193 LYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVA---GFGRGTLSMPSQLATFSPQ 252
              ++L+           V+    GC H   G  VG A   G G+G LS P Q      +
Sbjct: 255 FSTETLTFRRN------RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQT---GHR 314

Query: 253 LGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYSVGLAGIS 312
              +FSYCLV  S ++    +PS ++ G          +T LL NPK   +Y VGL GIS
Sbjct: 315 FNQKFSYCLVDRSASS----KPSSVVFGNAAVSRIA-RFTPLLSNPKLDTFYYVGLLGIS 374

Query: 313 VGSVRIP-APEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASR 372
           VG  R+P     L ++D+ G+GGV++DSGT+ T L    Y ++   F  R G  A    R
Sbjct: 375 VGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF--RVG--AKTLKR 434

Query: 373 IEENTGLSPCYYYD--NSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVG 432
             + +    C+     N ++VP VVLHF G  + V LP  NY                  
Sbjct: 435 APDFSLFDTCFDLSNMNEVKVPTVVLHFRG--ADVSLPATNYLIP--------------- 485

Query: 433 CLMLMNGGDEDELAGGPG--ATLGNYQQQGFEVVYDLEKNRVGFARRQCS 473
             +  NG      AG  G  + +GN QQQGF VVYDL  +RVGFA   C+
Sbjct: 495 --VDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of Tan0022648 vs. ExPASy Swiss-Prot
Match: O04496 (Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1)

HSP 1 Score: 149.1 bits (375), Expect = 1.3e-34
Identity = 121/415 (29.16%), Postives = 176/415 (42.41%), Query Frame = 0

Query: 70  SLPLSPG-----GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSP 129
           S+P++ G     G+Y +   LG+    + + +DT +D VW PCS      C G     + 
Sbjct: 90  SVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSG-----CSGCSNASTS 149

Query: 130 L--PKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAY 189
                 S   +VSCS A C+ A G             CP  S + S CS      F  +Y
Sbjct: 150 FNTNSSSTYSTVSCSTAQCTQARG-----------LTCPSSSPQPSVCS------FNQSY 209

Query: 190 -GDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGE---PVGVAGFGRGTLSMP 249
            GD S  A L +D+L+L     AP + + NF+FGC ++A G    P G+ G GRG +S+ 
Sbjct: 210 GGDSSFSASLVQDTLTL-----APDV-IPNFSFGCINSASGNSLPPQGLMGLGRGPMSLV 269

Query: 250 SQLATFSPQLGNRFSYCLVS-HSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPY 309
           SQ  +        FSYCL S  SF          L LG   G      YT LL NP+ P 
Sbjct: 270 SQTTSL---YSGVFSYCLPSFRSFYFS-----GSLKLG-LLGQPKSIRYTPLLRNPRRPS 329

Query: 310 YYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRT 369
            Y V L G+SVGSV++P        D     G ++DSGT  T     +Y+++  +F  + 
Sbjct: 330 LYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQ- 389

Query: 370 GQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGT 429
                  S          C+  DN    P++ LH       + LP +N           T
Sbjct: 390 ----VNVSSFSTLGAFDTCFSADNENVAPKITLHMT--SLDLKLPMEN-----------T 449

Query: 430 GRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCS 473
                 G L  ++     + A      + N QQQ   +++D+  +R+G A   C+
Sbjct: 450 LIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPCN 449

BLAST of Tan0022648 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 147.9 bits (372), Expect = 2.9e-34
Identity = 120/401 (29.93%), Postives = 172/401 (42.89%), Query Frame = 0

Query: 77  GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSC 136
           G+Y     +G+ A ++ L +DTGSD+ W  C P  C  C  +          S  KS++C
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEP--CADCYQQSDPVFNPTSSSTYKSLTC 219

Query: 137 SAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSL-IAQLYRDS 196
           SA  CS                      +E S C S  C  +  +YGDGS  + +L  D+
Sbjct: 220 SAPQCSL---------------------LETSACRSNKC-LYQVSYGDGSFTVGELATDT 279

Query: 197 LSLPAPAPAPAISVRNFTFGCAHTALG---EPVGVAGFGRGTLSMPSQLATFSPQLGNRF 256
           ++        +  + N   GC H   G      G+ G G G LS+ +Q+   S      F
Sbjct: 280 VTF-----GNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATS------F 339

Query: 257 SYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYSVGLAGISVGSVR 316
           SYCLV            + + LG   GG+       LL N K   +Y VGL+G SVG  +
Sbjct: 340 SYCLVDRDSGKSSSLDFNSVQLG---GGDAT---APLLRNKKIDTFYYVGLSGFSVGGEK 399

Query: 317 IPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTG 376
           +  P+ +  VD  GSGGV++D GT  T L    Y+S+   F   T  +   +S I   + 
Sbjct: 400 VVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSI---SL 459

Query: 377 LSPCYYYD--NSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMN 436
              CY +   ++++VP V  HF G K S+ LP KNY     D G          C     
Sbjct: 460 FDTCYDFSSLSTVKVPTVAFHFTGGK-SLDLPAKNYLIPVDDSG--------TFCFAFAP 500

Query: 437 GGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 472
                 +       +GN QQQG  + YDL KN +G +  +C
Sbjct: 520 TSSSLSI-------IGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of Tan0022648 vs. NCBI nr
Match: XP_023553227.1 (probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 897.1 bits (2317), Expect = 6.5e-257
Identity = 444/483 (91.93%), Postives = 465/483 (96.27%), Query Frame = 0

Query: 1   MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHR 60
           MASPVF LFLLC LLSS VFSS++LLLPLS+SLSSSSDFNNTHNLLKSTAARS ARFHHR
Sbjct: 1   MASPVF-LFLLCFLLSSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHR 60

Query: 61  RRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPK 120
           RRTH RSHLSLPLSPGGDYTLSFNLGSE+ KISLYMDTGSDLVWFPCSPFECILCEGKPK
Sbjct: 61  RRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPK 120

Query: 121 VQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYY 180
           +QSPLPKI+ +KSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIE+SECSSFSCPPFYY
Sbjct: 121 IQSPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYY 180

Query: 181 AYGDGSLIAQLYRDSLSLPAPAPA--PAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMP 240
           AYGDGSLI +LYRDSLSLPAPAPA  PAI+VRNFTFGCAH+ALGEP+GVAGFGRG LSMP
Sbjct: 181 AYGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMP 240

Query: 241 SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYY 300
           SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG ETEF+YTSLLENPKHPY+
Sbjct: 241 SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYF 300

Query: 301 YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTG 360
           YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLY+SVV QFENRTG
Sbjct: 301 YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTG 360

Query: 361 QVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTG 420
           +VA+RASRIEENTGLSPCYYY+NS+EVPRVVLHFVGEKSSV+LPRKNYFYEFLDGGDG  
Sbjct: 361 RVASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVE 420

Query: 421 RKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL 480
           RKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSL
Sbjct: 421 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSL 480

Query: 481 NRS 482
           NRS
Sbjct: 481 NRS 482

BLAST of Tan0022648 vs. NCBI nr
Match: KAG6577689.1 (putative aspartyl protease, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 894.8 bits (2311), Expect = 3.2e-256
Identity = 443/483 (91.72%), Postives = 463/483 (95.86%), Query Frame = 0

Query: 1   MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHR 60
           MASPVF LFLLC L+SS VFSS++LLLPLS+SLSSSSDFNNTHNLLKSTAARS ARFHHR
Sbjct: 1   MASPVF-LFLLCFLISSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHR 60

Query: 61  RRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPK 120
           RRTH RSHLSLPLSPGGDYTLSFNLGSE+ KISLYMDTGSDLVWFPCSPFECILCEGKPK
Sbjct: 61  RRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPK 120

Query: 121 VQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYY 180
           +QSPLPKIS QKSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIE+SECSSFSCPPFYY
Sbjct: 121 IQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYY 180

Query: 181 AYGDGSLIAQLYRDSLSLPAPAPA--PAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMP 240
           AYGDGSLI +LYRDSLSLPAPAPA  PAI+VRNFTFGCAH+ALGEP+GVAGFGRG LSMP
Sbjct: 181 AYGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMP 240

Query: 241 SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYY 300
           SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG ETEF+YTS+LENPKHPY+
Sbjct: 241 SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYF 300

Query: 301 YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTG 360
           YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLY+SVV QFENRTG
Sbjct: 301 YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTG 360

Query: 361 QVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTG 420
           +VA+RASRIEENTGLSPCY Y+ S+EVPRVVLHFVGEKSSV LPRKNYFYEFLDGGDG G
Sbjct: 361 RVASRASRIEENTGLSPCYSYEKSVEVPRVVLHFVGEKSSVELPRKNYFYEFLDGGDGVG 420

Query: 421 RKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL 480
           RKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSL
Sbjct: 421 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSL 480

Query: 481 NRS 482
           NRS
Sbjct: 481 NRS 482

BLAST of Tan0022648 vs. NCBI nr
Match: XP_022923540.1 (probable aspartyl protease At4g16563 [Cucurbita moschata])

HSP 1 Score: 894.0 bits (2309), Expect = 5.5e-256
Identity = 443/483 (91.72%), Postives = 462/483 (95.65%), Query Frame = 0

Query: 1   MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHR 60
           MASPVF LFLLC L SS VFSS++LLLPLS+SLSSSSDFNNTHNLLKSTAARS ARFHHR
Sbjct: 1   MASPVF-LFLLCFLFSSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHR 60

Query: 61  RRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPK 120
           RRTH RSHLSLPLSPGGDYTLSFNLGSE+ KISLYMDTGSDLVWFPCSPFECILCEGKPK
Sbjct: 61  RRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPK 120

Query: 121 VQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYY 180
           +QSPLPKIS QKSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIE+SECSSFSCPPFYY
Sbjct: 121 IQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYY 180

Query: 181 AYGDGSLIAQLYRDSLSLPAPAPA--PAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMP 240
           AYGDGSLI +LYRDSLSLPAPAPA  PAI+VRNFTFGCAH+ALGEP+GVAGFGRG LSMP
Sbjct: 181 AYGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMP 240

Query: 241 SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYY 300
           SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG ETEF+YTS+LENPKHPY+
Sbjct: 241 SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYF 300

Query: 301 YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTG 360
           YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLY+SVV QFENRTG
Sbjct: 301 YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTG 360

Query: 361 QVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTG 420
           +VA+RASRIEENTGLSPCY Y+ S+EVPRVVLHFVGEKSSV LPRKNYFYEFLDGGDG G
Sbjct: 361 RVASRASRIEENTGLSPCYSYEKSVEVPRVVLHFVGEKSSVELPRKNYFYEFLDGGDGVG 420

Query: 421 RKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL 480
           RKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSL
Sbjct: 421 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSL 480

Query: 481 NRS 482
           NRS
Sbjct: 481 NRS 482

BLAST of Tan0022648 vs. NCBI nr
Match: XP_023007805.1 (probable aspartyl protease At4g16563 [Cucurbita maxima])

HSP 1 Score: 894.0 bits (2309), Expect = 5.5e-256
Identity = 440/481 (91.48%), Postives = 461/481 (95.84%), Query Frame = 0

Query: 1   MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHR 60
           MASPVF LFLLC LL S VFSS+ILLLPLS+SLSSSSDFNNTHNLLKSTAARS ARFHHR
Sbjct: 1   MASPVF-LFLLCFLLPSPVFSSQILLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHR 60

Query: 61  RRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPK 120
           RRTH RSHLSLPLSPGGDYTLSFNLGSE+ KISLYMDTGSDLVWFPCSPFECILCEGKPK
Sbjct: 61  RRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPK 120

Query: 121 VQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYY 180
           +QSPLPKIS QKSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIE+SECSSFSCPPFYY
Sbjct: 121 IQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYY 180

Query: 181 AYGDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQ 240
           AYGDGSLI +LYRDSLSLPAPAP+PAI+VRNFTFGCAH+ALGEP+GVAGFGRG LSMP Q
Sbjct: 181 AYGDGSLIGRLYRDSLSLPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPIQ 240

Query: 241 LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYS 300
           LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG ETEF+YTS+LENPKHPY+YS
Sbjct: 241 LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFYS 300

Query: 301 VGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQV 360
           VGLAGISVGSV IPAPEFLK+VDEGGSGGVVVDSGTTFTMLPAGLY+SVV QFENRTG+V
Sbjct: 301 VGLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRV 360

Query: 361 ATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRK 420
           A+RAS+IEENTGLSPCYYY+ S+EVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDG GRK
Sbjct: 361 ASRASQIEENTGLSPCYYYEKSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRK 420

Query: 421 RKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNR 480
            KVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSLNR
Sbjct: 421 IKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLNR 480

Query: 481 S 482
           S
Sbjct: 481 S 480

BLAST of Tan0022648 vs. NCBI nr
Match: XP_038905814.1 (probable aspartyl protease At4g16563 [Benincasa hispida])

HSP 1 Score: 886.3 bits (2289), Expect = 1.2e-253
Identity = 438/482 (90.87%), Postives = 460/482 (95.44%), Query Frame = 0

Query: 1   MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSS-SDFNNTHNLLKSTAARSVARFHH 60
           MAS VF+L LLC LLSS VFSS++LLLPLSHSLSSS SDFNNTHNLLKSTAARS ARFHH
Sbjct: 1   MASSVFVL-LLCFLLSSPVFSSQLLLLPLSHSLSSSISDFNNTHNLLKSTAARSSARFHH 60

Query: 61  RRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKP 120
           RRRT   +HLSLPLSPGGDYTLSFNLGSE+HKISLYMDTGSDLVWFPCSPFECILCEGKP
Sbjct: 61  RRRTQHHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKP 120

Query: 121 KVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFY 180
           KVQSPLPKIS  KSVSCSA ACSAAHGGSLS+SHLCAIS+CPLESIEISECSSFSCPPFY
Sbjct: 121 KVQSPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFY 180

Query: 181 YAYGDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPS 240
           YAYGDGSLIA+LYRDSLSLPAPAP+PAI+VRNFTFGCAHTALGEPVGVAGFGRGTLSMPS
Sbjct: 181 YAYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTALGEPVGVAGFGRGTLSMPS 240

Query: 241 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYY 300
           QLATFSPQLGNRFSYCLVSHSFAA+RVRRPSPLILGRYYGGETEF+YTSLLENPKHPY+Y
Sbjct: 241 QLATFSPQLGNRFSYCLVSHSFAAERVRRPSPLILGRYYGGETEFIYTSLLENPKHPYFY 300

Query: 301 SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQ 360
           SVGL GISVG++ IPAPEFLK+VDEGGSGGVVVDSGTTFTMLPAGLYDSVV  FENRTG+
Sbjct: 301 SVGLTGISVGNMMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAAFENRTGR 360

Query: 361 VATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGR 420
           VA RA RIEENTGLSPCYYY+NS+EVPRVVLHFVGEKSSV+LP+KNYFYEFLDGGDG G+
Sbjct: 361 VANRARRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVLLPKKNYFYEFLDGGDGVGK 420

Query: 421 KRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLN 480
           KRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDL KNRVGFARRQCSTLWDSLN
Sbjct: 421 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLAKNRVGFARRQCSTLWDSLN 480

Query: 481 RS 482
           RS
Sbjct: 481 RS 481

BLAST of Tan0022648 vs. ExPASy TrEMBL
Match: A0A6J1L3Z9 (probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111500303 PE=3 SV=1)

HSP 1 Score: 894.0 bits (2309), Expect = 2.7e-256
Identity = 440/481 (91.48%), Postives = 461/481 (95.84%), Query Frame = 0

Query: 1   MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHR 60
           MASPVF LFLLC LL S VFSS+ILLLPLS+SLSSSSDFNNTHNLLKSTAARS ARFHHR
Sbjct: 1   MASPVF-LFLLCFLLPSPVFSSQILLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHR 60

Query: 61  RRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPK 120
           RRTH RSHLSLPLSPGGDYTLSFNLGSE+ KISLYMDTGSDLVWFPCSPFECILCEGKPK
Sbjct: 61  RRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPK 120

Query: 121 VQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYY 180
           +QSPLPKIS QKSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIE+SECSSFSCPPFYY
Sbjct: 121 IQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYY 180

Query: 181 AYGDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQ 240
           AYGDGSLI +LYRDSLSLPAPAP+PAI+VRNFTFGCAH+ALGEP+GVAGFGRG LSMP Q
Sbjct: 181 AYGDGSLIGRLYRDSLSLPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPIQ 240

Query: 241 LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYS 300
           LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG ETEF+YTS+LENPKHPY+YS
Sbjct: 241 LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFYS 300

Query: 301 VGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQV 360
           VGLAGISVGSV IPAPEFLK+VDEGGSGGVVVDSGTTFTMLPAGLY+SVV QFENRTG+V
Sbjct: 301 VGLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRV 360

Query: 361 ATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRK 420
           A+RAS+IEENTGLSPCYYY+ S+EVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDG GRK
Sbjct: 361 ASRASQIEENTGLSPCYYYEKSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRK 420

Query: 421 RKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNR 480
            KVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSLNR
Sbjct: 421 IKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLNR 480

Query: 481 S 482
           S
Sbjct: 481 S 480

BLAST of Tan0022648 vs. ExPASy TrEMBL
Match: A0A6J1EC44 (probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC111431201 PE=3 SV=1)

HSP 1 Score: 894.0 bits (2309), Expect = 2.7e-256
Identity = 443/483 (91.72%), Postives = 462/483 (95.65%), Query Frame = 0

Query: 1   MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHR 60
           MASPVF LFLLC L SS VFSS++LLLPLS+SLSSSSDFNNTHNLLKSTAARS ARFHHR
Sbjct: 1   MASPVF-LFLLCFLFSSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHR 60

Query: 61  RRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPK 120
           RRTH RSHLSLPLSPGGDYTLSFNLGSE+ KISLYMDTGSDLVWFPCSPFECILCEGKPK
Sbjct: 61  RRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPK 120

Query: 121 VQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYY 180
           +QSPLPKIS QKSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIE+SECSSFSCPPFYY
Sbjct: 121 IQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYY 180

Query: 181 AYGDGSLIAQLYRDSLSLPAPAPA--PAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMP 240
           AYGDGSLI +LYRDSLSLPAPAPA  PAI+VRNFTFGCAH+ALGEP+GVAGFGRG LSMP
Sbjct: 181 AYGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMP 240

Query: 241 SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYY 300
           SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG ETEF+YTS+LENPKHPY+
Sbjct: 241 SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYF 300

Query: 301 YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTG 360
           YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLY+SVV QFENRTG
Sbjct: 301 YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTG 360

Query: 361 QVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTG 420
           +VA+RASRIEENTGLSPCY Y+ S+EVPRVVLHFVGEKSSV LPRKNYFYEFLDGGDG G
Sbjct: 361 RVASRASRIEENTGLSPCYSYEKSVEVPRVVLHFVGEKSSVELPRKNYFYEFLDGGDGVG 420

Query: 421 RKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL 480
           RKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSL
Sbjct: 421 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSL 480

Query: 481 NRS 482
           NRS
Sbjct: 481 NRS 482

BLAST of Tan0022648 vs. ExPASy TrEMBL
Match: A0A0A0L5I7 (Pepsin A OS=Cucumis sativus OX=3659 GN=Csa_3G020060 PE=3 SV=1)

HSP 1 Score: 865.9 bits (2236), Expect = 7.8e-248
Identity = 429/481 (89.19%), Postives = 453/481 (94.18%), Query Frame = 0

Query: 3   SPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSS-SDFNNTHNLLKSTAARSVARFHHRR 62
           SPVF +FLLC LLSS VFSS+I LLPLSHSLSSS SDFNNTHNLLKSTA RS ARFH   
Sbjct: 4   SPVF-IFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR-- 63

Query: 63  RTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKV 122
             HR +HLSLPLSPGGDYTLSFNLGSE+HKISLYMDTGSDLVWFPCSPFECILCEGKPK+
Sbjct: 64  --HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKI 123

Query: 123 QSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYA 182
           QSPLPKI+  KSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIEISECSSFSCPPFYYA
Sbjct: 124 QSPLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYA 183

Query: 183 YGDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQL 242
           YGDGSL+A+LYRDSLSLP PAP+P I+VRNFTFGCAHT LGEPVGVAGFGRG LSMPSQL
Sbjct: 184 YGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQL 243

Query: 243 ATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYSV 302
           ATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYY GETEF+YTSLLENPKHPY+YSV
Sbjct: 244 ATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSV 303

Query: 303 GLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVA 362
           GLAGISVG++RIPAPEFL +VDEGGSGGVVVDSGTTFTMLPAGLY+SVV +FENRTG+VA
Sbjct: 304 GLAGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVA 363

Query: 363 TRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDG-TGRK 422
            RA RIEENTGLSPCYYY+NS+ VPRVVLHFVGEKS+V+LPRKNYFYEFLDGGDG  GRK
Sbjct: 364 NRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRK 423

Query: 423 RKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNR 482
           RKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD+LNR
Sbjct: 424 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNR 479

BLAST of Tan0022648 vs. ExPASy TrEMBL
Match: A0A1S3BK28 (aspartic proteinase nepenthesin-1 OS=Cucumis melo OX=3656 GN=LOC103490888 PE=3 SV=1)

HSP 1 Score: 861.7 bits (2225), Expect = 1.5e-246
Identity = 428/483 (88.61%), Postives = 453/483 (93.79%), Query Frame = 0

Query: 3   SPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSS-SDFNNTHNLLKSTAARSVARFHHRR 62
           SPVF +FLLC LLSS VFSS+I LLPLSHSLSSS SDFN+THNLLKSTA RS ARFH   
Sbjct: 4   SPVF-IFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR-- 63

Query: 63  RTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKV 122
             HR +HLSLPLSPGGDYTLSFNLGSE+HKISLYMDTGSDLVWFPCSPFECILCEGKPK+
Sbjct: 64  --HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKI 123

Query: 123 QSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYA 182
           QSPLPKIS  KSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIEISECSSFSCPPFYYA
Sbjct: 124 QSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYA 183

Query: 183 YGDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQL 242
           YGDGSL+A+LYRDSLSLP PAP+P I+VRNFTFGCAHT LGEPVGVAGFGRG LSMPSQL
Sbjct: 184 YGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQL 243

Query: 243 ATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYSV 302
           ATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+ GETEF+YTSLLENPKHPY+YSV
Sbjct: 244 ATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSV 303

Query: 303 GLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVA 362
           GLAGISVG+VRIPAPEFL++VDE GSGGVVVDSGTTFTMLP+GLY+SVV +FENRTG+VA
Sbjct: 304 GLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVA 363

Query: 363 TRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDG---TG 422
            RA RIEENTGLSPCYYY+NS+ VPRVVLHFVGEKSSV+LPRKNYFYEFLDGGDG    G
Sbjct: 364 NRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVG 423

Query: 423 RKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL 482
           RKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD+L
Sbjct: 424 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNL 481

BLAST of Tan0022648 vs. ExPASy TrEMBL
Match: A0A5D3CP11 (Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1017G00280 PE=3 SV=1)

HSP 1 Score: 857.8 bits (2215), Expect = 2.1e-245
Identity = 429/485 (88.45%), Postives = 453/485 (93.40%), Query Frame = 0

Query: 3   SPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSS-SDFNNTHNLLKSTAARSVARFHHRR 62
           SPVF +FLLC LLSS VFSS+I LLPLSHSLSSS SDFN+THNLLKSTA RS ARFH   
Sbjct: 4   SPVF-IFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR-- 63

Query: 63  RTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKV 122
             HR +HLSLPLSPGGDYTLSFNLGSE+HKISLYMDTGSDLVWFPCSPFECILCEGKPK+
Sbjct: 64  --HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKI 123

Query: 123 QSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYA 182
           QSPLPKIS  KSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIEISECSSFSCPPFYYA
Sbjct: 124 QSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYA 183

Query: 183 YGDGSLIAQLYRDSLSLPAPAPAPA--ISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPS 242
           YGDGSL+A+LYRDSLSLP PAPAP+  I+VRNFTFGCAHT LGEPVGVAGFGRG LSMPS
Sbjct: 184 YGDGSLVARLYRDSLSLPTPAPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPS 243

Query: 243 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYY 302
           QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+ GETEF+YTSLLENPKHPY+Y
Sbjct: 244 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFY 303

Query: 303 SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQ 362
           SVGLAGISVG+VRIPAPEFL++VDE GSGGVVVDSGTTFTMLP+GLY+SVV +FENRTG+
Sbjct: 304 SVGLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGK 363

Query: 363 VATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDG--- 422
           VA RA RIEENTGLSPCYYY NS+ VPRVVLHFVGEKSSV+LPRKNYFYEFLDGGDG   
Sbjct: 364 VANRARRIEENTGLSPCYYYQNSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVE 423

Query: 423 TGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD 482
            GRKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD
Sbjct: 424 VGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD 483

BLAST of Tan0022648 vs. TAIR 10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 587.8 bits (1514), Expect = 7.9e-168
Identity = 293/477 (61.43%), Postives = 355/477 (74.42%), Query Frame = 0

Query: 25  LLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFN 84
           LLL LSHSLS+S   ++  +LLKS+++RS ARF       ++  LSLP+S G DY +S +
Sbjct: 29  LLLHLSHSLSTSKHSSSPLHLLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISLS 88

Query: 85  LGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQ-KSVSCSAAACSA 144
           +GS +  +SLY+DTGSDLVWFPC PF CILCE KP   SP   +S    +VSCS+ +CSA
Sbjct: 89  VGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSA 148

Query: 145 AHGGSLSSSHLCAISRCPLESIEISEC--SSFSCPPFYYAYGDGSLIAQLYRDSLSLPAP 204
           AH  SL SS LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSL+A+LY DSLSL   
Sbjct: 149 AH-SSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSL--- 208

Query: 205 APAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSF 264
              P++SV NFTFGCAHT L EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHSF
Sbjct: 209 ---PSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSF 268

Query: 265 AADRVRRPSPLILGRYYG--------------------GETEFVYTSLLENPKHPYYYSV 324
            +DRVRRPSPLILGR+                       + EFV+T +LENPKHPY+YSV
Sbjct: 269 DSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSV 328

Query: 325 GLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVA 384
            L GIS+G   IPAP  L+R+D+ G GGVVVDSGTTFTMLPA  Y+SVV +F++R G+V 
Sbjct: 329 SLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVH 388

Query: 385 TRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKR 444
            RA R+E ++G+SPCYY + +++VP +VLHF G +SSV LPR+NYFYEF+DGGDG   KR
Sbjct: 389 ERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKR 448

Query: 445 KVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL 479
           K+GCLMLMNGGDE EL GG GA LGNYQQQGFEVVYDL   RVGFA+R+C++LWDSL
Sbjct: 449 KIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498

BLAST of Tan0022648 vs. TAIR 10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 198.0 bits (502), Expect = 1.8e-50
Identity = 161/499 (32.26%), Postives = 234/499 (46.89%), Query Frame = 0

Query: 5   VFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTH 64
           V  LFLL  LL ++   ++        + SSSS       L KS+ +    +   + R  
Sbjct: 7   VLFLFLLITLLLNTTNKTQ---ARQHKNPSSSSSSFLVLTLTKSSVSLPTPKSQTQERIK 66

Query: 65  R-RSHLSLPLSP----GGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCS--PFECILCEG 124
           +  S + + + P       Y ++ N+G+    + +Y+DTGSDL W PC    F+CI C  
Sbjct: 67  KPLSSVDVVMEPLREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYD 126

Query: 125 -------KPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISEC 184
                   P V SPL   +  +  SC+++ C   H  S +    CA++ C +  +  S C
Sbjct: 127 LKNNDLKSPSVFSPLHSSTSFRD-SCASSFCVEIH-SSDNPFDPCAVAGCSVSMLLKSTC 186

Query: 185 SSFSCPPFYYAYGDGSLIAQ-LYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVAG 244
               CP F Y YG+G LI+  L RD L       A    V  F+FGC  +   EP+G+AG
Sbjct: 187 VR-PCPSFAYTYGEGGLISGILTRDILK------ARTRDVPRFSFGCVTSTYREPIGIAG 246

Query: 245 FGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGR---YYGGETEFVY 304
           FGRG LS+PSQL      L   FS+C +   F  +     SPLILG             +
Sbjct: 247 FGRGLLSLPSQLGF----LEKGFSHCFLPFKF-VNNPNISSPLILGASALSINLTDSLQF 306

Query: 305 TSLLENPKHPYYYSVGLAGISVGSVRIP--APEFLKRVDEGGSGGVVVDSGTTFTMLPAG 364
           T +L  P +P  Y +GL  I++G+   P   P  L++ D  G+GG++VDSGTT+T LP  
Sbjct: 307 TPMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEP 366

Query: 365 LYDSVVGQFENRTGQVATRASRIEENTGLSPCYYY------------DNSIEVPRVVLHF 424
            Y  ++   ++       RA+  E  TG   CY              D  +  P +  HF
Sbjct: 367 FYSQLLTTLQSTI--TYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHF 426

Query: 425 VGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQG 472
           +   ++++LP+ N FY      DG+     V CL+  N  D D    GP    G++QQQ 
Sbjct: 427 L-NNATLLLPQGNSFYAMSAPSDGS----VVQCLLFQNMEDGDY---GPAGVFGSFQQQN 478

BLAST of Tan0022648 vs. TAIR 10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 178.7 bits (452), Expect = 1.1e-44
Identity = 152/506 (30.04%), Postives = 229/506 (45.26%), Query Frame = 0

Query: 1   MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHR 60
           MAS +F  FL+ + + S+V   ++ L P SHS  S  D    +  L+  A  S+AR H  
Sbjct: 1   MASSIFFFFLIFLSVVSAV---KLPLSPFSHSDQSPKD---PYLSLRRLAESSIARAHKL 60

Query: 61  RR---------------THRRSHLSLPLSPG--GDYTLSFNLGSEAHKISLYMDTGSDLV 120
           +                T   + +  PLS    G Y++S + G+ +  I    DTGS LV
Sbjct: 61  KHGTSIKPDEDALSSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLV 120

Query: 121 WFPC-SPFECILCEGKPKVQSPLPKI-----SKQKSVSCSAAACSAAHGGSLSSSHLCAI 180
           W PC S + C  C+      + +P+      S  K + C +  C   +G ++        
Sbjct: 121 WLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNV-------- 180

Query: 181 SRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCA 240
            +C         C +  CPP+   YG GS    L  + L        P ++V +F  GC+
Sbjct: 181 -QCRGCDPNTRNC-TVGCPPYILQYGLGSTAGVLITEKLDF------PDLTVPDFVVGCS 240

Query: 241 HTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY 300
             +  +P G+AGFGRG +S+PSQ+         RFS+CLVS  F    V     L  G  
Sbjct: 241 IISTRQPAGIAGFGRGPVSLPSQMNL------KRFSHCLVSRRFDDTNVTTDLDLDTGSG 300

Query: 301 Y--GGETE-FVYTSLLENPK-----HPYYYSVGLAGISVGSVRIPAPEFLKRVDEGGSGG 360
           +  G +T    YT   +NP         YY + L  I VG   +  P         G GG
Sbjct: 301 HNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGG 360

Query: 361 VVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYD--NSIEVPR 420
            +VDSG+TFT +   +++ V  +F ++     TR   +E+ TGL PC+       + VP 
Sbjct: 361 SIVDSGSTFTFMERPVFELVAEEFASQMSNY-TREKDLEKETGLGPCFNISGKGDVTVPE 420

Query: 421 VVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAG-GPGATLG 473
           ++  F G  + + LP  NYF  F+   D         CL +++    +   G GP   LG
Sbjct: 421 LIFEFKG-GAKLELPLSNYF-TFVGNTDTV-------CLTVVSDKTVNPSGGTGPAIILG 468

BLAST of Tan0022648 vs. TAIR 10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 150.2 bits (378), Expect = 4.2e-36
Identity = 126/421 (29.93%), Postives = 182/421 (43.23%), Query Frame = 0

Query: 63  THRRSHLSLPLSPG-----GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEG 122
           T     +  PL  G     G+Y     +G  A ++ + +DTGSD+ W  C+P  C  C  
Sbjct: 127 TTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTP--CADCYH 186

Query: 123 KPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPP 182
           + +        S  + +SC    C+A                     +E+SEC + +C  
Sbjct: 187 QTEPIFEPSSSSSYEPLSCDTPQCNA---------------------LEVSECRNATC-L 246

Query: 183 FYYAYGDGS-LIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVA---GFGRG 242
           +  +YGDGS  +     ++L++ +        V+N   GC H+  G  VG A   G G G
Sbjct: 247 YEVSYGDGSYTVGDFATETLTIGSTL------VQNVAVGCGHSNEGLFVGAAGLLGLGGG 306

Query: 243 TLSMPSQLATFSPQLGNRFSYCLVSH-SFAADRVRRPSPLILGRYYGGETEFVYTSLLEN 302
            L++PSQL T S      FSYCLV   S +A  V   + L          + V   LL N
Sbjct: 307 LLALPSQLNTTS------FSYCLVDRDSDSASTVDFGTSL--------SPDAVVAPLLRN 366

Query: 303 PKHPYYYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQ 362
            +   +Y +GL GISVG   +  P+    +DE GSGG+++DSGT  T L   +Y+S+   
Sbjct: 367 HQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDS 426

Query: 363 FENRTGQVATRASRIEENTGLSPCYYYD--NSIEVPRVVLHFVGEKSSVMLPRKNYFYEF 422
           F   T  +   A     +T    CY      ++EVP V  HF G K  + LP KNY    
Sbjct: 427 FVKGTLDLEKAAGVAMFDT----CYNLSAKTTVEVPTVAFHFPGGK-MLALPAKNYMIPV 483

Query: 423 LDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQ 472
                       VG   L        L     A +GN QQQG  V +DL  + +GF+  +
Sbjct: 487 ----------DSVGTFCLAFAPTASSL-----AIIGNVQQQGTRVTFDLANSLIGFSSNK 483

BLAST of Tan0022648 vs. TAIR 10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 149.4 bits (376), Expect = 7.2e-36
Identity = 133/410 (32.44%), Postives = 185/410 (45.12%), Query Frame = 0

Query: 73  LSPG-GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQ 132
           LS G G+Y     +G+ A  + + +DTGSD+VW  C+P  C  C  +       P    +
Sbjct: 135 LSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP--CRRCYSQSD-----PIFDPR 194

Query: 133 KSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSL-IAQ 192
           KS + +   CS+ H   L S+  C   R          C       +  +YGDGS  +  
Sbjct: 195 KSKTYATIPCSSPHCRRLDSAG-CNTRR--------KTCL------YQVSYGDGSFTVGD 254

Query: 193 LYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVA---GFGRGTLSMPSQLATFSPQ 252
              ++L+           V+    GC H   G  VG A   G G+G LS P Q      +
Sbjct: 255 FSTETLTFRRN------RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQT---GHR 314

Query: 253 LGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYSVGLAGIS 312
              +FSYCLV  S ++    +PS ++ G          +T LL NPK   +Y VGL GIS
Sbjct: 315 FNQKFSYCLVDRSASS----KPSSVVFGNAAVSRIA-RFTPLLSNPKLDTFYYVGLLGIS 374

Query: 313 VGSVRIP-APEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASR 372
           VG  R+P     L ++D+ G+GGV++DSGT+ T L    Y ++   F  R G  A    R
Sbjct: 375 VGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF--RVG--AKTLKR 434

Query: 373 IEENTGLSPCYYYD--NSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVG 432
             + +    C+     N ++VP VVLHF G  + V LP  NY                  
Sbjct: 435 APDFSLFDTCFDLSNMNEVKVPTVVLHFRG--ADVSLPATNYLIP--------------- 485

Query: 433 CLMLMNGGDEDELAGGPG--ATLGNYQQQGFEVVYDLEKNRVGFARRQCS 473
             +  NG      AG  G  + +GN QQQGF VVYDL  +RVGFA   C+
Sbjct: 495 --VDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q940R41.1e-16661.43Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
Q766C32.7e-3529.88Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LNJ31.0e-3432.44Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
O044961.3e-3429.16Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1[more]
Q9LS402.9e-3429.93Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
XP_023553227.16.5e-25791.93probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo][more]
KAG6577689.13.2e-25691.72putative aspartyl protease, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022923540.15.5e-25691.72probable aspartyl protease At4g16563 [Cucurbita moschata][more]
XP_023007805.15.5e-25691.48probable aspartyl protease At4g16563 [Cucurbita maxima][more]
XP_038905814.11.2e-25390.87probable aspartyl protease At4g16563 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1L3Z92.7e-25691.48probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111500303... [more]
A0A6J1EC442.7e-25691.72probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC1114312... [more]
A0A0A0L5I77.8e-24889.19Pepsin A OS=Cucumis sativus OX=3659 GN=Csa_3G020060 PE=3 SV=1[more]
A0A1S3BK281.5e-24688.61aspartic proteinase nepenthesin-1 OS=Cucumis melo OX=3656 GN=LOC103490888 PE=3 S... [more]
A0A5D3CP112.1e-24588.45Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
Match NameE-valueIdentityDescription
AT4G16563.17.9e-16861.43Eukaryotic aspartyl protease family protein [more]
AT5G45120.11.8e-5032.26Eukaryotic aspartyl protease family protein [more]
AT3G52500.11.1e-4430.04Eukaryotic aspartyl protease family protein [more]
AT1G25510.14.2e-3629.93Eukaryotic aspartyl protease family protein [more]
AT1G01300.17.2e-3632.44Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 79..262
e-value: 5.8E-30
score: 104.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 58..264
e-value: 9.6E-35
score: 122.2
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 278..477
e-value: 1.6E-47
score: 163.5
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 74..476
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 299..467
e-value: 8.2E-27
score: 94.0
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 1..476
NoneNo IPR availablePANTHERPTHR47967:SF26BNAA01G17170D PROTEINcoord: 1..476
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 330..341
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 79..467
score: 30.825125
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 79..471
e-value: 3.14703E-69
score: 219.827

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0022648.1Tan0022648.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004190 aspartic-type endopeptidase activity