Tan0022026 (gene) Snake gourd v1

Overview
NameTan0022026
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAspartyl protease family protein
LocationLG06: 3618881 .. 3622783 (-)
RNA-Seq ExpressionTan0022026
SyntenyTan0022026
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGATTGCAAAGCATTCCCATTTTCTCCTTCTTCTTCCTTTTCTCCTCCTCTTTGTCGTCGATGCTCATTCGAGCTCGGTTGACGTCATTAATGGCGACCACGAGAAGCTTCTTCTTAATCTTCAGAAACTTCCATGGAAGCAGCTGGAAAAGGCAGCAACTAGGTGCATCTTTCAGAAGCCAAGTGAGTGTTTAAAAACCCCTCTTCTAAAATGGAGAAAAAATGTTTCAATAAAATTTAATTATGTGTTGATTGTTATTAATTATCTTTGGGATCTGTTTTTTTCCCTTTTTTTTTGTTGAATGTGGCTTTTTTTGCTTTGCTTTTGCTAGGATCTAGGATTTGGGCCTTTTTTTTTTTTTTTTGCTTTAAAAGATGAACCCTTCATCATTGTTTTTTTTTTTCTTTATAAGTTAAAAGGGGTTTAATTTCTAATAACTCTAATCATTTTTTTGTGCTTTTGGATGATGTGATTTTTTTTTTGAAAAGGATGTGAATCTCTTTTTAAAAGGGTGTTTTTTGATTAATACATCGCACTATTAGCTCAAATTACATTTGGTTATCCTTGAAGAGCAATTTGTTTTGAGCCAAAAAAACAAAATATCAATTTTTTTTTGTGGACTTCTCTCACTCTATCTTCTCTCTCTTCTGCTGTTTAGAATTTCACTGTTTTTTTTTTTTTTGGAAACTATATGCACCAATTAAATAATTGATCCTTTTCTTTTCTCCCAGTTTTTTTCCCTTTTCTCCTGGTAATTCATCAGTAATTTTTAAGAAATATGGTGTAGGGAATAGGGTTGGATTAAACAATATTTCAGTCAGGTAATTTATTTGAGTTTAGATTGATTAATGTTCTTTATTTTATTTATTGGTGATTTAAAAATTTTAAAAGAGCTCTGGAAACTGAAATGAAAATTAAATATATTTAAATGAATATCAATCCAAGTAAGTTTATAATTCAATCTAATCCTACTTAAGGGATTGGAGTTCGGTTATATTATAAACGTGTTTGAGTCATTGATTTTTATCAATCATTAAATTTTGTGAATCTTTTAATTGTGGTCCTCTTGTTTTTTTTTTAGAACTCCACTCAGATTAGTCGGTTTTCTTTTTAAAGAAAATATCAATTTATTTAAGTAGGGATTCGAATCACTCACCTTTGGAGCATATATACTTTATCAGTTAGGTTGGACTTAAATTGGCTCTGCTATTTTTACCATCTTAGTTCTTAGAATTTAAGAATTTTATTTTATTAGATTTAAGGATTTTACTTTATTAGAATTATAAAGTACATGGAATAATATAAATAAATAAATATAAGCAGATAAAAGATTTCAAATCAAAGACTATTAAAATGTTATATCAAAGTATATTTCTTCTTATAAATAGTTTCTTCTATATATAGAATGATGAATACAAATAAAAAAAAATGCTGTCAGCAATGTTGATTTAATAAAGAGAAACTGCATATTATAAATATTCAACAATGTGAGAAAAATCCCAAATTGCCCTCGATTGGAGAGTTGGGTTGGAATGTAACTTCTTTTTCATGCTTTACATTACGGGTTATTTCAGTAATTAAGAAAAATGAGAGGGATATGCAAGCAAAAGCCTCTTTGTTTCTCCATATAACTGTCTAATTTAAGTATTATCCTTTTTTTTTTTTTTTTGAGTTCCAACTATTTTAAGTATTGTTATTACATGTAGACATCGTTATAGAGGTAGGAATCTAGGATACTCTAAATTATTTTTTGTGTGTTTCTTACAAATAGTTTCCTATACTTTTGTGTTTTCCCTCCCGCCCTCACAATCACTATGGAAAAAAATTACTACACATATAATAATAATACACTTAGAATAGGCCAGATTTTTCAAATCATAGGAATATAAATGTGACTAAAATTATATTTAGTATTTCCCTTATTAAGAAAAAACTAAAGTCCATACTGATAAGAACGAATTGCTCAAGCTTATTTATTTATTTATTTATTTTTATGGTAATTGCTCAAGCTTATTAAAGCTTACATAAATACTTCAATAACCTAATGTCCTAAGCTATGCTTAGATTAAGCATAATTCTTTCACAAATACAAAAAATATATATATATATATCTAGAACATGAGAGTTTAGATTAAAAAAATTATAAAAATTATCAAATGTGGTACCATATGTCACCGTCACTATTAGATATGAGATGAGTCCCTAACTATTAGGGATTATTTGTGGCGTGAGTTATAATAATTTGTGAAATACTTTACCTTCTAAATACAAACTAATTCTAAACACCGACTATTATAATTATAGATCAATCAACTCTATGTGTTAAACAATATTATATAATTCTACAAATTATTATAACTTACCAGGGCAGCACTTGAGTGTTTGAGTGTTGTTTAGGCAGTACTCTCATCTATTCACCACACATTTTTTTTCTATTTTTTAAATAAAAAAATATATGTAGCCAGAGGGCTGCACCATATGGCAGCACCCAAGGGCCAAGTATTTCATGTTTTTCATGTCTCAACGTTTATTTTTCTAATTTTTGTATTCCATCCTAGCGAAGCAAAATTGTTCCAACCACATCGAATAAATTTATTTGGGTTTGAATTTTTCAGGGGTGGAGAAGGGAACAACGATATTGGAAATGAAAGAAAGAGATTACTGTTCAGGCAAAGTCAGGGACTGGGATAAGAATCTCCAAAACCGCCTATTCCTGGATGCCATTCACGTCGAATCCCTCCAATCCCGGTTCAAATCCGCCATATTTCCCGGCCAGGCCCACCAAATCTCCGACTCCCAAATCCCCTTATCCCCCGGCACCCGCCTCCAAACCCTCAACTACATTGTCACCGTCGGCATCGGCCGCCAGAACGTAACCCTCATCGTCGACACCGGCAGCGATCTCACGTGGGTCCAGTGCCGCCCTTGCCGTCTCTGTTACAGCCAACAAGAACCCCTTTTCGATCCCTCAAATTCCTCTTCATTCCTCTCTCTTCCTTGCAATTCCACCACCTGTTCCGATCTTCAACCCGCAACCGGAAGTTCCGATGTTTGTGGAAATGGGAATTCAAATACCTGTGCTTATGAGATTAACTATGGCGATGGGTCTTATTCCCGAGGAGACCTTGGATTTGAGACGCTGAATTTGGGGAAAACATCGATTGAGAAGTTTGTATTTGGGTGTGGTCGGAATAACAAGGGGTTGTTTGGCGGAACTTCGGGATTAATGGGTTTAGCTAGAAGTGAATTATCTCTCGTTTCTCAAACTTCCTCTGTTTTTGGTGGGATTTTTTCTTACTGTTTGCCTACCGCTGGACTTGGATCTTCAGGTTCTTTAACAATGGGGGGTGGGGATTTTTCGAATTTCAGAAACGTTTCACCAATTTCTTACACGAGAATGGTTCCAAATCCTCAGATGTCGAATTTTTACATTCTGAATTTGACTGGAATTAGCGTTGGTGGGGTGAAATTGGATGTGCCGCGTTTGGCTTCGAATAATGGGGTTTTGAGTTTAATCGATTCTGGGACTGTGATTACCAGGTTGGCTCCATCGATTTACAGAGCTTTGAAAGTGGAATTCGAGAAGCAATTTTCTGGGTTCCAAACAGCGCCTGGGTTTTCGATTTTGAACACTTGTTTTAATCTTACTGGGTTGAAAGAAGTCAATATTCCGACTCTGAAATTTTATTTTGAAGGTGATGCAGAGCTGACTGTGGATGTTGAAGGGATTTTTTACTTTGTCAAAACTGATGATTCTCAGATCTGTTTGGCCTTTGCGAGTTTGGCTTCTGAAGATCAGATTGGGATTATTGGGAATTATCAGCAGAAGAATCAGAGGGTTATTTATAATTCCAAGGAATCGAAGGTGGGTTTTGCAGCAGAGCCTTGTAGTTTCTAG

mRNA sequence

ATGGAGATTGCAAAGCATTCCCATTTTCTCCTTCTTCTTCCTTTTCTCCTCCTCTTTGTCGTCGATGCTCATTCGAGCTCGGTTGACGTCATTAATGGCGACCACGAGAAGCTTCTTCTTAATCTTCAGAAACTTCCATGGAAGCAGCTGGAAAAGGCAGCAACTAGGTGCATCTTTCAGAAGCCAAGGGTGGAGAAGGGAACAACGATATTGGAAATGAAAGAAAGAGATTACTGTTCAGGCAAAGTCAGGGACTGGGATAAGAATCTCCAAAACCGCCTATTCCTGGATGCCATTCACGTCGAATCCCTCCAATCCCGGTTCAAATCCGCCATATTTCCCGGCCAGGCCCACCAAATCTCCGACTCCCAAATCCCCTTATCCCCCGGCACCCGCCTCCAAACCCTCAACTACATTGTCACCGTCGGCATCGGCCGCCAGAACGTAACCCTCATCGTCGACACCGGCAGCGATCTCACGTGGGTCCAGTGCCGCCCTTGCCGTCTCTGTTACAGCCAACAAGAACCCCTTTTCGATCCCTCAAATTCCTCTTCATTCCTCTCTCTTCCTTGCAATTCCACCACCTGTTCCGATCTTCAACCCGCAACCGGAAGTTCCGATGTTTGTGGAAATGGGAATTCAAATACCTGTGCTTATGAGATTAACTATGGCGATGGGTCTTATTCCCGAGGAGACCTTGGATTTGAGACGCTGAATTTGGGGAAAACATCGATTGAGAAGTTTGTATTTGGGTGTGGTCGGAATAACAAGGGGTTGTTTGGCGGAACTTCGGGATTAATGGGTTTAGCTAGAAGTGAATTATCTCTCGTTTCTCAAACTTCCTCTGTTTTTGGTGGGATTTTTTCTTACTGTTTGCCTACCGCTGGACTTGGATCTTCAGGTTCTTTAACAATGGGGGGTGGGGATTTTTCGAATTTCAGAAACGTTTCACCAATTTCTTACACGAGAATGGTTCCAAATCCTCAGATGTCGAATTTTTACATTCTGAATTTGACTGGAATTAGCGTTGGTGGGGTGAAATTGGATGTGCCGCGTTTGGCTTCGAATAATGGGGTTTTGAGTTTAATCGATTCTGGGACTGTGATTACCAGGTTGGCTCCATCGATTTACAGAGCTTTGAAAGTGGAATTCGAGAAGCAATTTTCTGGGTTCCAAACAGCGCCTGGGTTTTCGATTTTGAACACTTGTTTTAATCTTACTGGGTTGAAAGAAGTCAATATTCCGACTCTGAAATTTTATTTTGAAGGTGATGCAGAGCTGACTGTGGATGTTGAAGGGATTTTTTACTTTGTCAAAACTGATGATTCTCAGATCTGTTTGGCCTTTGCGAGTTTGGCTTCTGAAGATCAGATTGGGATTATTGGGAATTATCAGCAGAAGAATCAGAGGGTTATTTATAATTCCAAGGAATCGAAGGTGGGTTTTGCAGCAGAGCCTTGTAGTTTCTAG

Coding sequence (CDS)

ATGGAGATTGCAAAGCATTCCCATTTTCTCCTTCTTCTTCCTTTTCTCCTCCTCTTTGTCGTCGATGCTCATTCGAGCTCGGTTGACGTCATTAATGGCGACCACGAGAAGCTTCTTCTTAATCTTCAGAAACTTCCATGGAAGCAGCTGGAAAAGGCAGCAACTAGGTGCATCTTTCAGAAGCCAAGGGTGGAGAAGGGAACAACGATATTGGAAATGAAAGAAAGAGATTACTGTTCAGGCAAAGTCAGGGACTGGGATAAGAATCTCCAAAACCGCCTATTCCTGGATGCCATTCACGTCGAATCCCTCCAATCCCGGTTCAAATCCGCCATATTTCCCGGCCAGGCCCACCAAATCTCCGACTCCCAAATCCCCTTATCCCCCGGCACCCGCCTCCAAACCCTCAACTACATTGTCACCGTCGGCATCGGCCGCCAGAACGTAACCCTCATCGTCGACACCGGCAGCGATCTCACGTGGGTCCAGTGCCGCCCTTGCCGTCTCTGTTACAGCCAACAAGAACCCCTTTTCGATCCCTCAAATTCCTCTTCATTCCTCTCTCTTCCTTGCAATTCCACCACCTGTTCCGATCTTCAACCCGCAACCGGAAGTTCCGATGTTTGTGGAAATGGGAATTCAAATACCTGTGCTTATGAGATTAACTATGGCGATGGGTCTTATTCCCGAGGAGACCTTGGATTTGAGACGCTGAATTTGGGGAAAACATCGATTGAGAAGTTTGTATTTGGGTGTGGTCGGAATAACAAGGGGTTGTTTGGCGGAACTTCGGGATTAATGGGTTTAGCTAGAAGTGAATTATCTCTCGTTTCTCAAACTTCCTCTGTTTTTGGTGGGATTTTTTCTTACTGTTTGCCTACCGCTGGACTTGGATCTTCAGGTTCTTTAACAATGGGGGGTGGGGATTTTTCGAATTTCAGAAACGTTTCACCAATTTCTTACACGAGAATGGTTCCAAATCCTCAGATGTCGAATTTTTACATTCTGAATTTGACTGGAATTAGCGTTGGTGGGGTGAAATTGGATGTGCCGCGTTTGGCTTCGAATAATGGGGTTTTGAGTTTAATCGATTCTGGGACTGTGATTACCAGGTTGGCTCCATCGATTTACAGAGCTTTGAAAGTGGAATTCGAGAAGCAATTTTCTGGGTTCCAAACAGCGCCTGGGTTTTCGATTTTGAACACTTGTTTTAATCTTACTGGGTTGAAAGAAGTCAATATTCCGACTCTGAAATTTTATTTTGAAGGTGATGCAGAGCTGACTGTGGATGTTGAAGGGATTTTTTACTTTGTCAAAACTGATGATTCTCAGATCTGTTTGGCCTTTGCGAGTTTGGCTTCTGAAGATCAGATTGGGATTATTGGGAATTATCAGCAGAAGAATCAGAGGGTTATTTATAATTCCAAGGAATCGAAGGTGGGTTTTGCAGCAGAGCCTTGTAGTTTCTAG

Protein sequence

MEIAKHSHFLLLLPFLLLFVVDAHSSSVDVINGDHEKLLLNLQKLPWKQLEKAATRCIFQKPRVEKGTTILEMKERDYCSGKVRDWDKNLQNRLFLDAIHVESLQSRFKSAIFPGQAHQISDSQIPLSPGTRLQTLNYIVTVGIGRQNVTLIVDTGSDLTWVQCRPCRLCYSQQEPLFDPSNSSSFLSLPCNSTTCSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRGDLGFETLNLGKTSIEKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCLPTAGLGSSGSLTMGGGDFSNFRNVSPISYTRMVPNPQMSNFYILNLTGISVGGVKLDVPRLASNNGVLSLIDSGTVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKEVNIPTLKFYFEGDAELTVDVEGIFYFVKTDDSQICLAFASLASEDQIGIIGNYQQKNQRVIYNSKESKVGFAAEPCSF
Homology
BLAST of Tan0022026 vs. ExPASy Swiss-Prot
Match: Q8S9J6 (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 300.1 bits (767), Expect = 4.7e-80
Identity = 171/400 (42.75%), Postives = 235/400 (58.75%), Query Frame = 0

Query: 94  LFLDAIHVESLQSRFKSAIFPGQAHQISDSQIPLSPGTRLQTLNYIVTVGIG--RQNVTL 153
           L LD   V S+ S+    +      +   + +P   G+ L + NYIVTVG+G  + +++L
Sbjct: 88  LRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSL 147

Query: 154 IVDTGSDLTWVQCRPC-RLCYSQQEPLFDPSNSSSFLSLPCNSTTCSDLQPATGSSDVCG 213
           I DTGSDLTW QC+PC R CY Q+EP+F+PS S+S+ ++ C+S  C  L  ATG++  C 
Sbjct: 148 IFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCS 207

Query: 214 NGNSNTCAYEINYGDGSYSRGDLGFETLNLGKTSI-EKFVFGCGRNNKGLFGGTSGLMGL 273
             N   C Y I YGD S+S G L  E   L  + + +   FGCG NN+GLF G +GL+GL
Sbjct: 208 ASN---CIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGL 267

Query: 274 ARSELSLVSQTSSVFGGIFSYCLPTAGLGSSGSLTMGGGDFSNFRNVSPISYTRMVPNPQ 333
            R +LS  SQT++ +  IFSYCLP++    +G LT G    S     +PIS         
Sbjct: 268 GRDKLSFPSQTATAYNKIFSYCLPSSA-SYTGHLTFGSAGISRSVKFTPISTI-----TD 327

Query: 334 MSNFYILNLTGISVGGVKLDVP-RLASNNGVLSLIDSGTVITRLAPSIYRALKVEFEKQF 393
            ++FY LN+  I+VGG KL +P  + S  G  +LIDSGTVITRL P  Y AL+  F+ + 
Sbjct: 328 GTSFYGLNIVAITVGGQKLPIPSTVFSTPG--ALIDSGTVITRLPPKAYAALRSSFKAKM 387

Query: 394 SGFQTAPGFSILNTCFNLTGLKEVNIPTLKFYFEGDAELTVDVEGIFYFVKTDDSQICLA 453
           S + T  G SIL+TCF+L+G K V IP + F F G A + +  +GIFY  K   SQ+CLA
Sbjct: 388 SKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKI--SQVCLA 447

Query: 454 FASLASEDQIGIIGNYQQKNQRVIYNSKESKVGFAAEPCS 489
           FA  + +    I GN QQ+   V+Y+    +VGFA   CS
Sbjct: 448 FAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474

BLAST of Tan0022026 vs. ExPASy Swiss-Prot
Match: Q9LEW3 (Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1)

HSP 1 Score: 260.0 bits (663), Expect = 5.4e-68
Identity = 154/396 (38.89%), Postives = 225/396 (56.82%), Query Frame = 0

Query: 97  DAIHVESLQSRFKSAIFPGQAHQISDSQIPLSPGTRLQTLNYIVTVGIG--RQNVTLIVD 156
           D   VES+ S+  S     +  +   +++P   G  L + NYIVT+GIG  + +++L+ D
Sbjct: 92  DQARVESIYSKL-SKNSANEVSEAKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSLVFD 151

Query: 157 TGSDLTWVQCRPC-RLCYSQQEPLFDPSNSSSFLSLPCNSTTCSDLQPATGSSDVCGNGN 216
           TGSDLTW QC PC   CYSQ+EP F+PS+SS++ ++ C+S  C D +  + S+       
Sbjct: 152 TGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAESCSASN------- 211

Query: 217 SNTCAYEINYGDGSYSRGDLGFETLNLGKTSI-EKFVFGCGRNNKGLFGGTSGLMGLARS 276
              C Y I YGD S+++G L  E   L  + + E   FGCG NN+GLF G +GL+GL   
Sbjct: 212 ---CVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLGLGPG 271

Query: 277 ELSLVSQTSSVFGGIFSYCLPTAGLGSSGSLTMGGGDFSNFRNVSPISYTRMVPNPQMSN 336
           +LSL +QT++ +  IFSYCLP+    S+G LT G    S     +PIS       P   N
Sbjct: 272 KLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFTPIS-----SFPSAFN 331

Query: 337 FYILNLTGISVGGVKLDV-PRLASNNGVLSLIDSGTVITRLAPSIYRALKVEFEKQFSGF 396
            Y +++ GISVG  +L + P   S  G  ++IDSGTV TRL   +Y  L+  F+++ S +
Sbjct: 332 -YGIDIIGISVGDKELAITPNSFSTEG--AIIDSGTVFTRLPTKVYAELRSVFKEKMSSY 391

Query: 397 QTAPGFSILNTCFNLTGLKEVNIPTLKFYFEGDAELTVDVEGIFYFVKTDDSQICLAFAS 456
           ++  G+ + +TC++ TGL  V  PT+ F F G   + +D  GI   +K   SQ+CLAFA 
Sbjct: 392 KSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPIKI--SQVCLAFA- 451

Query: 457 LASEDQIGIIGNYQQKNQRVIYNSKESKVGFAAEPC 488
             ++D   I GN QQ    V+Y+    +VGFA   C
Sbjct: 452 -GNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of Tan0022026 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 251.5 bits (641), Expect = 1.9e-65
Identity = 147/433 (33.95%), Postives = 223/433 (51.50%), Query Frame = 0

Query: 71  LEMKERD-YCSGKVRDWDKNLQNRLFLDAIHVESLQSRFKSAIFPG--QAHQISDSQIPL 130
           L +  RD + S   R+    L  R+  D   V ++  R    + P     ++++D    +
Sbjct: 61  LRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDI 120

Query: 131 SPGTRLQTLNYIVTVGIGR--QNVTLIVDTGSDLTWVQCRPCRLCYSQQEPLFDPSNSSS 190
             G    +  Y V +G+G   ++  +++D+GSD+ WVQC+PC+LCY Q +P+FDP+ S S
Sbjct: 121 VSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGS 180

Query: 191 FLSLPCNSTTCSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRGDLGFETLNLGKTSI 250
           +  + C S+ C  ++     +  C +G    C YE+ YGDGSY++G L  ETL   KT +
Sbjct: 181 YTGVSCGSSVCDRIE-----NSGCHSGG---CRYEVMYGDGSYTKGTLALETLTFAKTVV 240

Query: 251 EKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCLPTAGLGSSGSLTM 310
                GCG  N+G+F G +GL+G+    +S V Q S   GG F YCL + G  S+GSL  
Sbjct: 241 RNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVF 300

Query: 311 GGGDFSNFRNVSPI--SYTRMVPNPQMSNFYILNLTGISVGGVKLDVPRLASNNGVLSL- 370
           G       R   P+  S+  +V NP+  +FY + L G+ VGGV++ +P     +GV  L 
Sbjct: 301 G-------REALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLP-----DGVFDLT 360

Query: 371 --------IDSGTVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKEVNI 430
                   +D+GT +TRL  + Y A +  F+ Q +    A G SI +TC++L+G   V +
Sbjct: 361 ETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRV 420

Query: 431 PTLKFYFEGDAELTVDVEGIFYFVKTDDSQICLAFASLASEDQIGIIGNYQQKNQRVIYN 488
           PT+ FYF     LT+     F     D    C AFA  AS   + IIGN QQ+  +V ++
Sbjct: 421 PTVSFYFTEGPVLTLPARN-FLMPVDDSGTYCFAFA--ASPTGLSIIGNIQQEGIQVSFD 470

BLAST of Tan0022026 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 232.6 bits (592), Expect = 9.2e-60
Identity = 135/363 (37.19%), Postives = 198/363 (54.55%), Query Frame = 0

Query: 138 YIVTVGIG--RQNVTLIVDTGSDLTWVQCRPCRLCYSQQEPLFDPSNSSSFLSLPCNSTT 197
           Y   +G+G   + V +++DTGSD+ W+QC PCR CYSQ +P+FDP  S ++ ++PC+S  
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPH 201

Query: 198 CSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRGDLGFETLNLGKTSIEKFVFGCGRN 257
           C  L  A      C N    TC Y+++YGDGS++ GD   ETL   +  ++    GCG +
Sbjct: 202 CRRLDSAG-----C-NTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHD 261

Query: 258 NKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCLPTAGLGSSGSLTMGGGDFSNFRN 317
           N+GLF G +GL+GL + +LS   QT   F   FSYCL      S  S  +    F N   
Sbjct: 262 NEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVV----FGNAAV 321

Query: 318 VSPISYTRMVPNPQMSNFYILNLTGISVGGVKLDVPRLAS---------NNGVLSLIDSG 377
                +T ++ NP++  FY + L GISVGG +  VP + +         N GV  +IDSG
Sbjct: 322 SRIARFTPLLSNPKLDTFYYVGLLGISVGGTR--VPGVTASLFKLDQIGNGGV--IIDSG 381

Query: 378 TVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKEVNIPTLKFYFEGDAE 437
           T +TRL    Y A++  F       + AP FS+ +TCF+L+ + EV +PT+  +F G   
Sbjct: 382 TSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRG--- 441

Query: 438 LTVDVEGIFYFVKTD-DSQICLAFASLASEDQIGIIGNYQQKNQRVIYNSKESKVGFAAE 489
             V +    Y +  D + + C AFA   +   + IIGN QQ+  RV+Y+   S+VGFA  
Sbjct: 442 ADVSLPATNYLIPVDTNGKFCFAFA--GTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPG 485

BLAST of Tan0022026 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 218.0 bits (554), Expect = 2.4e-55
Identity = 140/445 (31.46%), Postives = 229/445 (51.46%), Query Frame = 0

Query: 71  LEMKERD-YCSGKVRDWDKNLQNRLFLDAIHVESLQSRFKSAI------------FPGQA 130
           LE+  RD + + + +D+     +RL  D+  V  + ++ + A+                 
Sbjct: 82  LELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTR 141

Query: 131 HQISDSQIPLSPGTRLQTLNYIVTVGIG--RQNVTLIVDTGSDLTWVQCRPCRLCYSQQE 190
           +Q  D   P+  G    +  Y   +G+G   + + L++DTGSD+ W+QC PC  CY Q +
Sbjct: 142 YQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSD 201

Query: 191 PLFDPSNSSSFLSLPCNSTTCSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRGDLGF 250
           P+F+P++SS++ SL C++  CS L+     +  C    SN C Y+++YGDGS++ G+L  
Sbjct: 202 PVFNPTSSSTYKSLTCSAPQCSLLE-----TSAC---RSNKCLYQVSYGDGSFTVGELAT 261

Query: 251 ETLNLGKT-SIEKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCLPT 310
           +T+  G +  I     GCG +N+GLF G +GL+GL    LS+ +Q  +     FSYCL  
Sbjct: 262 DTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKAT---SFSYCLVD 321

Query: 311 AGLGSSGSL-----TMGGGDFSNFRNVSPISYTRMVPNPQMSNFYILNLTGISVGGVKLD 370
              G S SL      +GGGD          +   ++ N ++  FY + L+G SVGG K+ 
Sbjct: 322 RDSGKSSSLDFNSVQLGGGD----------ATAPLLRNKKIDTFYYVGLSGFSVGGEKVV 381

Query: 371 VPRL-----ASNNGVLSLIDSGTVITRLAPSIYRALKVEFEKQFSGFQT-APGFSILNTC 430
           +P       AS +G + ++D GT +TRL    Y +L+  F K     +  +   S+ +TC
Sbjct: 382 LPDAIFDVDASGSGGV-ILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTC 441

Query: 431 FNLTGLKEVNIPTLKFYFEGDAELTVDVEGIFYFVKTDDS-QICLAFASLASEDQIGIIG 488
           ++ + L  V +PT+ F+F G   L  D+    Y +  DDS   C AFA  +S   + IIG
Sbjct: 442 YDFSSLSTVKVPTVAFHFTGGKSL--DLPAKNYLIPVDDSGTFCFAFAPTSS--SLSIIG 500

BLAST of Tan0022026 vs. NCBI nr
Match: KAG7014194.1 (Aspartyl protease family protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 766.1 bits (1977), Expect = 1.8e-217
Identity = 391/492 (79.47%), Postives = 419/492 (85.16%), Query Frame = 0

Query: 1   MEIAKH-SHFLLLLPFLLLFVVDAHSSSVDVINGDHEKL--LLNLQKLPWKQLEKAATRC 60
           MEI+K    FLLLL  LLLF VD   S  D INGD EKL  LL+LQKLPWKQ E+A   C
Sbjct: 1   MEISKSLCFFLLLLLLLLLFFVDQARS--DAINGDSEKLHRLLHLQKLPWKQQEEAVINC 60

Query: 61  IFQKPRVEKGTTILEMKERDYCSGKVRDWDKNLQNRLFLDAIHVESLQSRFKSAIFPGQA 120
           IFQKPRV +G T LEMKERDYCSGKV DW KNLQNRL  DAIHV+SLQSR KSAIF G  
Sbjct: 61  IFQKPRVREGITTLEMKERDYCSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAIFSGDT 120

Query: 121 HQISDSQIPLSPGTRLQTLNYIVTVGIGRQNVTLIVDTGSDLTWVQCRPCRLCYSQQEPL 180
           HQISDSQIPLS GTRLQTLNYIVTV +G ++ TLIVDTGSDLTWVQCRPCRLCY+QQEPL
Sbjct: 121 HQISDSQIPLSSGTRLQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYNQQEPL 180

Query: 181 FDPSNSSSFLSLPCNSTTCSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRGDLGFET 240
           FDPSNSSSFLSL CNS TC  L PATG+S +CG GNS++C YEINYGDGSYSRG+LGFE 
Sbjct: 181 FDPSNSSSFLSLSCNSPTCLALPPATGNSGLCGYGNSSSCGYEINYGDGSYSRGELGFER 240

Query: 241 LNLGKTSIEKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCLPTAGL 300
           LNLG+  I+ F+FGCGRNNKGLFGG SGLMGL RS+LSLVSQTSSVF GIFSYCLP+ G 
Sbjct: 241 LNLGEIVIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGA 300

Query: 301 GSSGSLTMGGGDFSNFRNVSPISYTRMVPNPQMSNFYILNLTGISVGGVKLDVPRLASNN 360
           G+SGSLTMGGGDFSNFRNVSPISYTRMV NPQM NFY LNLTGI++GGV L V    SNN
Sbjct: 301 GASGSLTMGGGDFSNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGV----SNN 360

Query: 361 GVLSLIDSGTVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKEVNIPTL 420
           G LSLIDSGTVITRL PSIYRA K EFEKQFSGFQTAPGFSILNTCFNLTG KEVNIPT+
Sbjct: 361 GALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTV 420

Query: 421 KFYFEGDAELTVDVEGIFYFVKTDDSQICLAFASLASEDQIGIIGNYQQKNQRVIYNSKE 480
           KFYFEG+AE+TVDVEG+FYFVK+D SQICLAFASL  EDQ  IIGNYQQKNQRV+YNSKE
Sbjct: 421 KFYFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKE 480

Query: 481 SKVGFAAEPCSF 490
           S VGFAAEPC F
Sbjct: 481 STVGFAAEPCGF 486

BLAST of Tan0022026 vs. NCBI nr
Match: KAG6575642.1 (Aspartyl protease family protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 765.4 bits (1975), Expect = 3.0e-217
Identity = 386/491 (78.62%), Postives = 416/491 (84.73%), Query Frame = 0

Query: 1   MEIAKHSHFLLLLPFLLLFVVDAHSSSVDVINGDHEKL--LLNLQKLPWKQLEKAATRCI 60
           MEI+K   F LLL  LLL +     +  D INGD EKL  LL+LQKLPWKQ E+A   CI
Sbjct: 1   MEISKSLCFFLLLLLLLLLLFFVDQARSDAINGDSEKLHRLLHLQKLPWKQQEEAVINCI 60

Query: 61  FQKPRVEKGTTILEMKERDYCSGKVRDWDKNLQNRLFLDAIHVESLQSRFKSAIFPGQAH 120
           FQKPRV +G T LEMKERDYCSGKV DW KNLQNRL  DAIHV+SLQSR KSAIF G  H
Sbjct: 61  FQKPRVREGITTLEMKERDYCSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAIFSGDTH 120

Query: 121 QISDSQIPLSPGTRLQTLNYIVTVGIGRQNVTLIVDTGSDLTWVQCRPCRLCYSQQEPLF 180
           QISDSQIPLS GTRLQTLNYIVTV +G ++ TLIVDTGSDLTWVQCRPCRLCY+QQEPLF
Sbjct: 121 QISDSQIPLSSGTRLQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLF 180

Query: 181 DPSNSSSFLSLPCNSTTCSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRGDLGFETL 240
           DPSNSSSFLSL CNS TC  L PATG+S +CG GNS++C YEINYGDGSYSRG+LGFE L
Sbjct: 181 DPSNSSSFLSLSCNSPTCLALPPATGNSGLCGYGNSSSCGYEINYGDGSYSRGELGFERL 240

Query: 241 NLGKTSIEKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCLPTAGLG 300
           NLG+  I+ F+FGCGRNNKGLFGG SGLMGL RS+LSLVSQTSSVF GIFSYCLP+ G G
Sbjct: 241 NLGEIVIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAG 300

Query: 301 SSGSLTMGGGDFSNFRNVSPISYTRMVPNPQMSNFYILNLTGISVGGVKLDVPRLASNNG 360
           +SGSLTMGGGDFSNFRNVSPISYTRMV NPQM NFY LNLTGI++GGV L V    SNNG
Sbjct: 301 ASGSLTMGGGDFSNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGV----SNNG 360

Query: 361 VLSLIDSGTVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKEVNIPTLK 420
            LSLIDSGTVITRL PSIYRA K EFEKQFSGFQTAPGFSILNTCFNLTG KEVNIPT+K
Sbjct: 361 ALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTVK 420

Query: 421 FYFEGDAELTVDVEGIFYFVKTDDSQICLAFASLASEDQIGIIGNYQQKNQRVIYNSKES 480
           FYFEG+AE+TVDVEG+FYFVK+D SQICLAFASL  EDQ  IIGNYQQKNQRV+YNSKES
Sbjct: 421 FYFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKES 480

Query: 481 KVGFAAEPCSF 490
            VGFAAEPC F
Sbjct: 481 TVGFAAEPCGF 487

BLAST of Tan0022026 vs. NCBI nr
Match: XP_022991886.1 (aspartyl protease family protein At5g10770 [Cucurbita maxima])

HSP 1 Score: 765.0 bits (1974), Expect = 3.9e-217
Identity = 386/491 (78.62%), Postives = 417/491 (84.93%), Query Frame = 0

Query: 1   MEIAKHSHFLLLLPFLLLFVVDAHSSSVDVINGDHEKL--LLNLQKLPWKQLEKAATRCI 60
           MEI+K   F LLL  LLLF VD   S  D INGD EKL  LL+LQKLPWKQ E+A   CI
Sbjct: 1   MEISKSLCFFLLLLLLLLFFVDEARS--DAINGDSEKLHRLLHLQKLPWKQQEEAVVNCI 60

Query: 61  FQKPRVEKGTTILEMKERDYCSGKVRDWDKNLQNRLFLDAIHVESLQSRFKSAIFPGQAH 120
           FQKPRV +G T LEMKERDYCSGKV DW  NLQNRL  DAI ++SLQSR KSAIF G  H
Sbjct: 61  FQKPRVREGITTLEMKERDYCSGKVTDWQNNLQNRLIFDAIRLQSLQSRIKSAIFSGDTH 120

Query: 121 QISDSQIPLSPGTRLQTLNYIVTVGIGRQNVTLIVDTGSDLTWVQCRPCRLCYSQQEPLF 180
           QISDSQIPLS GTRLQTLNYIVTV +G ++ TLIVDTGSDLTWVQCRPCRLCY+QQEPLF
Sbjct: 121 QISDSQIPLSSGTRLQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLF 180

Query: 181 DPSNSSSFLSLPCNSTTCSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRGDLGFETL 240
           DPSNSSSFLSL CNS TC  L PATG+S +CGNGNS++C YEINYGDGSYSRG+LGFE L
Sbjct: 181 DPSNSSSFLSLSCNSPTCLALPPATGNSGLCGNGNSSSCGYEINYGDGSYSRGELGFERL 240

Query: 241 NLGKTSIEKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCLPTAGLG 300
           NLG+  I+ F+FGCGRNNKGLFGG SGLMGL RS+LSLVSQTSSVF GIFSYCLP+ G G
Sbjct: 241 NLGEIVIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAG 300

Query: 301 SSGSLTMGGGDFSNFRNVSPISYTRMVPNPQMSNFYILNLTGISVGGVKLDVPRLASNNG 360
           +SGSLTMGGGDFSN+RNVSPISYTRMV NPQM NFY LNLTGI++GGV L VP    NNG
Sbjct: 301 ASGSLTMGGGDFSNYRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGVP----NNG 360

Query: 361 VLSLIDSGTVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKEVNIPTLK 420
            LSLIDSGTVITRL PSIYRA K EFEKQFSGFQTAPGFSILNTCFNLTG KEVNIPT+K
Sbjct: 361 ALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTVK 420

Query: 421 FYFEGDAELTVDVEGIFYFVKTDDSQICLAFASLASEDQIGIIGNYQQKNQRVIYNSKES 480
           F+FEG+AE+TVDVEG+FYFVK+D SQICLAFASL  EDQ  IIGNYQQKNQRV+YNSKES
Sbjct: 421 FFFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKES 480

Query: 481 KVGFAAEPCSF 490
            VGFAAEPC F
Sbjct: 481 TVGFAAEPCGF 485

BLAST of Tan0022026 vs. NCBI nr
Match: XP_022953875.1 (aspartyl protease family protein At5g10770 [Cucurbita moschata])

HSP 1 Score: 762.3 bits (1967), Expect = 2.6e-216
Identity = 388/491 (79.02%), Postives = 418/491 (85.13%), Query Frame = 0

Query: 1   MEIAKHSHFLLLLPFLLLFVVDAHSSSVDVINGDHEKL--LLNLQKLPWKQLEKAATRCI 60
           MEI+K   F LLL  LLLF VD   S  D INGD EKL  LL+LQK PWKQ E+A   CI
Sbjct: 1   MEISKSLCFFLLL--LLLFFVDQARS--DAINGDSEKLHRLLHLQKRPWKQQEEAVINCI 60

Query: 61  FQKPRVEKGTTILEMKERDYCSGKVRDWDKNLQNRLFLDAIHVESLQSRFKSAIFPGQAH 120
           FQKPRV +G T LEMKE+DYCSG+V DW KNLQNRL  DAIHV+SLQSR KSAIF G  H
Sbjct: 61  FQKPRVREGITTLEMKEKDYCSGEVTDWQKNLQNRLISDAIHVQSLQSRIKSAIFSGDTH 120

Query: 121 QISDSQIPLSPGTRLQTLNYIVTVGIGRQNVTLIVDTGSDLTWVQCRPCRLCYSQQEPLF 180
           QISDSQIPLS GTRLQTLNYIVTV +G ++ TLIVDTGSDLTWVQCRPCRLCY+QQEPLF
Sbjct: 121 QISDSQIPLSSGTRLQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLF 180

Query: 181 DPSNSSSFLSLPCNSTTCSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRGDLGFETL 240
           DPSNSSSFLSL CNS TC  L PATG+S +CGNGNS++C YEINYGDGSYSRG+LGFE L
Sbjct: 181 DPSNSSSFLSLSCNSPTCLALPPATGNSGLCGNGNSSSCGYEINYGDGSYSRGELGFERL 240

Query: 241 NLGKTSIEKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCLPTAGLG 300
           NLG+  I+ F+FGCGRNNKGLFGG SGLMGL RS+LSLVSQTSSVF GIFSYCLP+ G G
Sbjct: 241 NLGEIVIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAG 300

Query: 301 SSGSLTMGGGDFSNFRNVSPISYTRMVPNPQMSNFYILNLTGISVGGVKLDVPRLASNNG 360
           +SGSLTMGGGDFSNFRNVSPISYTRMV NPQM NFY LNLTGI++GGV L V    SNNG
Sbjct: 301 ASGSLTMGGGDFSNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGV----SNNG 360

Query: 361 VLSLIDSGTVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKEVNIPTLK 420
            LSLIDSGTVITRL PSIYRA K EFEKQFSGFQTAPGFSILNTCFNLTG KEVNIPT+K
Sbjct: 361 ALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTVK 420

Query: 421 FYFEGDAELTVDVEGIFYFVKTDDSQICLAFASLASEDQIGIIGNYQQKNQRVIYNSKES 480
           FYFEG+AE+TVDVEG+FYFVK+D SQICLAFASL  EDQ  IIGNYQQKNQRV+YNSKES
Sbjct: 421 FYFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKES 480

Query: 481 KVGFAAEPCSF 490
            VGFAAEPC F
Sbjct: 481 TVGFAAEPCGF 483

BLAST of Tan0022026 vs. NCBI nr
Match: XP_023547381.1 (aspartyl protease family protein At5g10770 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 761.9 bits (1966), Expect = 3.3e-216
Identity = 386/491 (78.62%), Postives = 417/491 (84.93%), Query Frame = 0

Query: 1   MEIAKHSHFLLLLPFLLLFVVDAHSSSVDVINGDHEKL--LLNLQKLPWKQLEKAATRCI 60
           MEI+K   F LLL  LL FV  A S++   INGD EKL  LL+LQKLPWKQ E+A   CI
Sbjct: 1   MEISKSLCFFLLLLLLLFFVDQARSAA---INGDSEKLHRLLHLQKLPWKQQEEAVINCI 60

Query: 61  FQKPRVEKGTTILEMKERDYCSGKVRDWDKNLQNRLFLDAIHVESLQSRFKSAIFPGQAH 120
           FQKPRV +G T LEMKERDYCSGKV DW KNLQNRL  DAIHV+SLQSR KSAIF G  H
Sbjct: 61  FQKPRVREGITTLEMKERDYCSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAIFSGDTH 120

Query: 121 QISDSQIPLSPGTRLQTLNYIVTVGIGRQNVTLIVDTGSDLTWVQCRPCRLCYSQQEPLF 180
           QISDSQIPLS GTRLQTLNYIVTV +G ++ TLIVDTGSDLTWVQCRPCRLCY+QQEPLF
Sbjct: 121 QISDSQIPLSSGTRLQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLF 180

Query: 181 DPSNSSSFLSLPCNSTTCSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRGDLGFETL 240
           DPSNSSSFLSL CNS TC  L  ATG+S +CG GNS++C YEINYGDGSYSRG+LGFE L
Sbjct: 181 DPSNSSSFLSLSCNSPTCLALPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGELGFERL 240

Query: 241 NLGKTSIEKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCLPTAGLG 300
           NLG+  I+ F+FGCGRNNKGLFGG SGLMGL RS+LSLVSQTSSVF GIFSYCLP+ G G
Sbjct: 241 NLGEIVIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAG 300

Query: 301 SSGSLTMGGGDFSNFRNVSPISYTRMVPNPQMSNFYILNLTGISVGGVKLDVPRLASNNG 360
           +SGSLTMGGGDFSNFRNVSPISYTRMV NPQM NFY LNLTGI++GGV L VP    NNG
Sbjct: 301 ASGSLTMGGGDFSNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGVP----NNG 360

Query: 361 VLSLIDSGTVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKEVNIPTLK 420
            LSLIDSGTVITRL PSIYRA K EFEKQFSGFQTAPGFSILNTCFNLTG KEVNIPT+K
Sbjct: 361 ALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTVK 420

Query: 421 FYFEGDAELTVDVEGIFYFVKTDDSQICLAFASLASEDQIGIIGNYQQKNQRVIYNSKES 480
           FYFEG+AE+TVDVEG+FYFVK+D SQICLAFASL  EDQ  IIGNYQQKNQRV+YNSKES
Sbjct: 421 FYFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKES 480

Query: 481 KVGFAAEPCSF 490
            +GFAAEPC F
Sbjct: 481 TLGFAAEPCGF 484

BLAST of Tan0022026 vs. ExPASy TrEMBL
Match: A0A6J1JS16 (aspartyl protease family protein At5g10770 OS=Cucurbita maxima OX=3661 GN=LOC111488391 PE=3 SV=1)

HSP 1 Score: 765.0 bits (1974), Expect = 1.9e-217
Identity = 386/491 (78.62%), Postives = 417/491 (84.93%), Query Frame = 0

Query: 1   MEIAKHSHFLLLLPFLLLFVVDAHSSSVDVINGDHEKL--LLNLQKLPWKQLEKAATRCI 60
           MEI+K   F LLL  LLLF VD   S  D INGD EKL  LL+LQKLPWKQ E+A   CI
Sbjct: 1   MEISKSLCFFLLLLLLLLFFVDEARS--DAINGDSEKLHRLLHLQKLPWKQQEEAVVNCI 60

Query: 61  FQKPRVEKGTTILEMKERDYCSGKVRDWDKNLQNRLFLDAIHVESLQSRFKSAIFPGQAH 120
           FQKPRV +G T LEMKERDYCSGKV DW  NLQNRL  DAI ++SLQSR KSAIF G  H
Sbjct: 61  FQKPRVREGITTLEMKERDYCSGKVTDWQNNLQNRLIFDAIRLQSLQSRIKSAIFSGDTH 120

Query: 121 QISDSQIPLSPGTRLQTLNYIVTVGIGRQNVTLIVDTGSDLTWVQCRPCRLCYSQQEPLF 180
           QISDSQIPLS GTRLQTLNYIVTV +G ++ TLIVDTGSDLTWVQCRPCRLCY+QQEPLF
Sbjct: 121 QISDSQIPLSSGTRLQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLF 180

Query: 181 DPSNSSSFLSLPCNSTTCSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRGDLGFETL 240
           DPSNSSSFLSL CNS TC  L PATG+S +CGNGNS++C YEINYGDGSYSRG+LGFE L
Sbjct: 181 DPSNSSSFLSLSCNSPTCLALPPATGNSGLCGNGNSSSCGYEINYGDGSYSRGELGFERL 240

Query: 241 NLGKTSIEKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCLPTAGLG 300
           NLG+  I+ F+FGCGRNNKGLFGG SGLMGL RS+LSLVSQTSSVF GIFSYCLP+ G G
Sbjct: 241 NLGEIVIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAG 300

Query: 301 SSGSLTMGGGDFSNFRNVSPISYTRMVPNPQMSNFYILNLTGISVGGVKLDVPRLASNNG 360
           +SGSLTMGGGDFSN+RNVSPISYTRMV NPQM NFY LNLTGI++GGV L VP    NNG
Sbjct: 301 ASGSLTMGGGDFSNYRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGVP----NNG 360

Query: 361 VLSLIDSGTVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKEVNIPTLK 420
            LSLIDSGTVITRL PSIYRA K EFEKQFSGFQTAPGFSILNTCFNLTG KEVNIPT+K
Sbjct: 361 ALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTVK 420

Query: 421 FYFEGDAELTVDVEGIFYFVKTDDSQICLAFASLASEDQIGIIGNYQQKNQRVIYNSKES 480
           F+FEG+AE+TVDVEG+FYFVK+D SQICLAFASL  EDQ  IIGNYQQKNQRV+YNSKES
Sbjct: 421 FFFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKES 480

Query: 481 KVGFAAEPCSF 490
            VGFAAEPC F
Sbjct: 481 TVGFAAEPCGF 485

BLAST of Tan0022026 vs. ExPASy TrEMBL
Match: A0A6J1GQV9 (aspartyl protease family protein At5g10770 OS=Cucurbita moschata OX=3662 GN=LOC111456281 PE=3 SV=1)

HSP 1 Score: 762.3 bits (1967), Expect = 1.2e-216
Identity = 388/491 (79.02%), Postives = 418/491 (85.13%), Query Frame = 0

Query: 1   MEIAKHSHFLLLLPFLLLFVVDAHSSSVDVINGDHEKL--LLNLQKLPWKQLEKAATRCI 60
           MEI+K   F LLL  LLLF VD   S  D INGD EKL  LL+LQK PWKQ E+A   CI
Sbjct: 1   MEISKSLCFFLLL--LLLFFVDQARS--DAINGDSEKLHRLLHLQKRPWKQQEEAVINCI 60

Query: 61  FQKPRVEKGTTILEMKERDYCSGKVRDWDKNLQNRLFLDAIHVESLQSRFKSAIFPGQAH 120
           FQKPRV +G T LEMKE+DYCSG+V DW KNLQNRL  DAIHV+SLQSR KSAIF G  H
Sbjct: 61  FQKPRVREGITTLEMKEKDYCSGEVTDWQKNLQNRLISDAIHVQSLQSRIKSAIFSGDTH 120

Query: 121 QISDSQIPLSPGTRLQTLNYIVTVGIGRQNVTLIVDTGSDLTWVQCRPCRLCYSQQEPLF 180
           QISDSQIPLS GTRLQTLNYIVTV +G ++ TLIVDTGSDLTWVQCRPCRLCY+QQEPLF
Sbjct: 121 QISDSQIPLSSGTRLQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLF 180

Query: 181 DPSNSSSFLSLPCNSTTCSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRGDLGFETL 240
           DPSNSSSFLSL CNS TC  L PATG+S +CGNGNS++C YEINYGDGSYSRG+LGFE L
Sbjct: 181 DPSNSSSFLSLSCNSPTCLALPPATGNSGLCGNGNSSSCGYEINYGDGSYSRGELGFERL 240

Query: 241 NLGKTSIEKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCLPTAGLG 300
           NLG+  I+ F+FGCGRNNKGLFGG SGLMGL RS+LSLVSQTSSVF GIFSYCLP+ G G
Sbjct: 241 NLGEIVIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAG 300

Query: 301 SSGSLTMGGGDFSNFRNVSPISYTRMVPNPQMSNFYILNLTGISVGGVKLDVPRLASNNG 360
           +SGSLTMGGGDFSNFRNVSPISYTRMV NPQM NFY LNLTGI++GGV L V    SNNG
Sbjct: 301 ASGSLTMGGGDFSNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGV----SNNG 360

Query: 361 VLSLIDSGTVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKEVNIPTLK 420
            LSLIDSGTVITRL PSIYRA K EFEKQFSGFQTAPGFSILNTCFNLTG KEVNIPT+K
Sbjct: 361 ALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTVK 420

Query: 421 FYFEGDAELTVDVEGIFYFVKTDDSQICLAFASLASEDQIGIIGNYQQKNQRVIYNSKES 480
           FYFEG+AE+TVDVEG+FYFVK+D SQICLAFASL  EDQ  IIGNYQQKNQRV+YNSKES
Sbjct: 421 FYFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKES 480

Query: 481 KVGFAAEPCSF 490
            VGFAAEPC F
Sbjct: 481 TVGFAAEPCGF 483

BLAST of Tan0022026 vs. ExPASy TrEMBL
Match: A0A0A0K8J2 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G431320 PE=3 SV=1)

HSP 1 Score: 753.4 bits (1944), Expect = 5.7e-214
Identity = 377/497 (75.86%), Postives = 421/497 (84.71%), Query Frame = 0

Query: 1   MEIAKHSHF------LLLLPFLLLFVVDAHSSSVDVINGD-HEKLLLNL-QKLPWKQLEK 60
           MEI+K  HF      LLLLP LL   VDA SSS ++ NGD HEK LL L Q  PWK+  +
Sbjct: 1   MEISKSLHFPLSLLLLLLLP-LLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGE 60

Query: 61  AATRCIFQKPRVEKGTTILEMKERDYCSGKVRDWDKNLQNRLFLDAIHVESLQSRFKSAI 120
           A   CIFQKP++ KG T LEMK+RDYCSGK+ DW+K  QNR+ LDAI+V SL S FKSAI
Sbjct: 61  AVVNCIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAI 120

Query: 121 FPGQAHQISDSQIPLSPGTRLQTLNYIVTVGIGRQNVTLIVDTGSDLTWVQCRPCRLCYS 180
           FPGQ HQ+SDSQIP+S G RLQTLNYIVTVGIG QN TLIVDTGSDLTWVQC PCRLCY+
Sbjct: 121 FPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYN 180

Query: 181 QQEPLFDPSNSSSFLSLPCNSTTCSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRGD 240
           QQEPLF+PSNSSSFLSLPCNS TC  LQP  GSS +C N NS +C Y+I+YGDGSYSRG+
Sbjct: 181 QQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGE 240

Query: 241 LGFETLNLGKTSIEKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCL 300
           LGFE L LGKT I+ F+FGCGRNNKGLFGG SGLMGLARSELSLVSQTSS+FG +FSYCL
Sbjct: 241 LGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCL 300

Query: 301 PTAGLGSSGSLTMGGGDFSNFRNVSPISYTRMVPNPQMSNFYILNLTGISVGGVKLDVPR 360
           PT G+GSSGSLT+GG DFSNF+N+SPISYTRM+ NPQMSNFY LNLTGIS+GGV L+VPR
Sbjct: 301 PTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPR 360

Query: 361 LASNNGVLSLIDSGTVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKEV 420
           L+SN GVLSL+DSGTVITRL+PSIY+A K EFEKQFSG++T PGFSILNTCFNLTG +EV
Sbjct: 361 LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEV 420

Query: 421 NIPTLKFYFEGDAELTVDVEGIFYFVKTDDSQICLAFASLASEDQIGIIGNYQQKNQRVI 480
           NIPT+KF FEG+AE+ VDVEG+FYFVK+D SQICLAFASL  EDQ  IIGNYQQKNQRVI
Sbjct: 421 NIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVI 480

Query: 481 YNSKESKVGFAAEPCSF 490
           YNSKESKVGFA EPCSF
Sbjct: 481 YNSKESKVGFAGEPCSF 496

BLAST of Tan0022026 vs. ExPASy TrEMBL
Match: A0A5A7UYY6 (Aspartyl protease family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold339G001610 PE=3 SV=1)

HSP 1 Score: 742.7 bits (1916), Expect = 1.0e-210
Identity = 371/495 (74.95%), Postives = 418/495 (84.44%), Query Frame = 0

Query: 1   MEIAKHSHFLLLLPFLLL----FVVDAHSSSVDVING-DHEKLLLNL-QKLPWKQLEKAA 60
           ME++K  HF L L FLLL     +VDA SSS  V NG +HEK LL L Q  PWK+  +A 
Sbjct: 3   MEVSKSLHFPLSLLFLLLPLLSIIVDARSSSFGVGNGSNHEKGLLQLFQNFPWKEHGEAV 62

Query: 61  TRCIFQKPRVEKGTTILEMKERDYCSGKVRDWDKNLQNRLFLDAIHVESLQSRFKSAIFP 120
             CIFQKP++ KG T LEMK+RDYCSGK+ D +K  QNR+ LDAI+V SL S  KSAIFP
Sbjct: 63  VNCIFQKPKITKGITTLEMKQRDYCSGKITDLEKIFQNRIILDAINVNSLLSHVKSAIFP 122

Query: 121 GQAHQISDSQIPLSPGTRLQTLNYIVTVGIGRQNVTLIVDTGSDLTWVQCRPCRLCYSQQ 180
           GQ HQ+SDSQIP+S G RLQTLNYIVTVGIG QN TLIVDTGSDLTWVQC PCRLCY+QQ
Sbjct: 123 GQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQ 182

Query: 181 EPLFDPSNSSSFLSLPCNSTTCSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRGDLG 240
           EPLF+PSNSSSFLSLPC+S TC  LQP  GSS +C N NS +C Y+I+YGDGSYSRG+LG
Sbjct: 183 EPLFNPSNSSSFLSLPCSSPTCLALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELG 242

Query: 241 FETLNLGKTSIEKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCLPT 300
           +E L LGKT I+ F+FGCGRNNKGLFGG SGLMGLARSELSLVSQTSSVFG IFSYCLPT
Sbjct: 243 YEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSVFGSIFSYCLPT 302

Query: 301 AGLGSSGSLTMGGGDFSNFRNVSPISYTRMVPNPQMSNFYILNLTGISVGGVKLDVPRLA 360
            G+GSSGSLT+GG DFS+F+N+SPISYTRM+ NPQMSNFY LNLTGIS+GGV L+VPRL+
Sbjct: 303 TGVGSSGSLTLGGTDFSSFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLS 362

Query: 361 SNNGVLSLIDSGTVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKEVNI 420
           SN GVLSL+DSGTVITRL+PSIY+A K EFEKQFSG++T PGFSILNTCFNLTG +EVNI
Sbjct: 363 SNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNI 422

Query: 421 PTLKFYFEGDAELTVDVEGIFYFVKTDDSQICLAFASLASEDQIGIIGNYQQKNQRVIYN 480
           PT+KF FEG+AE+ VDVEG+FYFVK+D SQICLAFASL  EDQ  IIGNYQQKNQRV+YN
Sbjct: 423 PTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVVYN 482

Query: 481 SKESKVGFAAEPCSF 490
           SKESKVGFA EPCSF
Sbjct: 483 SKESKVGFAGEPCSF 497

BLAST of Tan0022026 vs. ExPASy TrEMBL
Match: A0A1S3CDQ0 (aspartyl protease family protein At5g10770 OS=Cucumis melo OX=3656 GN=LOC103499859 PE=3 SV=1)

HSP 1 Score: 734.9 bits (1896), Expect = 2.1e-208
Identity = 371/498 (74.50%), Postives = 417/498 (83.73%), Query Frame = 0

Query: 1   MEIAKHSH------FLLLLPFLLLFVVDAHSSSVDVINGD--HEKLLLNL-QKLPWKQLE 60
           ME++K  H      FLLLLP LL  +VDA SS   V NG   HEK LL L Q  PWK+  
Sbjct: 3   MEVSKSLHFPLSLLFLLLLP-LLFIIVDARSS---VGNGGNYHEKGLLQLFQNFPWKEHG 62

Query: 61  KAATRCIFQKPRVEKGTTILEMKERDYCSGKVRDWDKNLQNRLFLDAIHVESLQSRFKSA 120
           +A   CIFQKP++ KG T LEMK+RDYCSGK+ D +K  QNR+ LDAI+V SL S  KSA
Sbjct: 63  EAVVNCIFQKPKITKGITTLEMKQRDYCSGKITDLEKIFQNRIILDAINVNSLLSHVKSA 122

Query: 121 IFPGQAHQISDSQIPLSPGTRLQTLNYIVTVGIGRQNVTLIVDTGSDLTWVQCRPCRLCY 180
           IFPGQ HQ+SDSQIP+S G RLQTLNYIVTVGIG QN TLIVDTGSDLTWVQC PCRLCY
Sbjct: 123 IFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCY 182

Query: 181 SQQEPLFDPSNSSSFLSLPCNSTTCSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRG 240
           +QQEPLF+PSNSSSFLSLPC+S TC  LQP  GSS +C N NS +C Y+I+YGDGSYSRG
Sbjct: 183 NQQEPLFNPSNSSSFLSLPCSSPTCLALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRG 242

Query: 241 DLGFETLNLGKTSIEKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYC 300
           +LG+E L LGKT I+ F+FGCGRNNKGLFGG SGLMGLARSELSLVSQTSSVFG IFSYC
Sbjct: 243 ELGYEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSVFGSIFSYC 302

Query: 301 LPTAGLGSSGSLTMGGGDFSNFRNVSPISYTRMVPNPQMSNFYILNLTGISVGGVKLDVP 360
           LPT G+GSSGSLT+GG DFS+F+N+SPISYTRM+ NPQMSNFY LNLTGIS+GGV L+VP
Sbjct: 303 LPTTGVGSSGSLTLGGTDFSSFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP 362

Query: 361 RLASNNGVLSLIDSGTVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKE 420
           RL+SN GVLSL+DSGTVITRL+PSIY+A K EFEKQFSG++T PGFSILNTCFNLTG +E
Sbjct: 363 RLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEE 422

Query: 421 VNIPTLKFYFEGDAELTVDVEGIFYFVKTDDSQICLAFASLASEDQIGIIGNYQQKNQRV 480
           VNIPT+KF FEG+AE+ VDVEG+FYFVK+D SQICLAFASL  EDQ  IIGNYQQKNQRV
Sbjct: 423 VNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRV 482

Query: 481 IYNSKESKVGFAAEPCSF 490
           +YNSKESKVGFA EPCSF
Sbjct: 483 VYNSKESKVGFAGEPCSF 496

BLAST of Tan0022026 vs. TAIR 10
Match: AT1G79720.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 502.3 bits (1292), Expect = 4.4e-142
Identity = 262/483 (54.24%), Postives = 345/483 (71.43%), Query Frame = 0

Query: 10  LLLLPFLLLFVVDAHSSSVDVINGDHEKLLLNLQKLPW--KQLEKAATRCIFQKPRVEKG 69
           L L P LL+F+         V++G  EK +L++    W  K+  +A+T C  +     + 
Sbjct: 9   LSLAPLLLVFLFLLSC----VVHGVDEKKILSVHNNIWSPKKSYEASTSCFSRSLGKGRE 68

Query: 70  TTILEMKERDYCSGKVRDWDKNLQNRLFLDAIHVESLQSRFKSAIFPGQAHQISDSQIPL 129
           +T LEMK R+ CSGK  D  K ++  L LD I V+SLQ + K+         +S++QIPL
Sbjct: 69  STTLEMKHRELCSGKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPL 128

Query: 130 SPGTRLQTLNYIVTVGIGRQNVTLIVDTGSDLTWVQCRPCRLCYSQQEPLFDPSNSSSFL 189
           + G +L++LNYIVTV +G +N++LIVDTGSDLTWVQC+PCR CY+QQ PL+DPS SSS+ 
Sbjct: 129 TSGIKLESLNYIVTVELGGKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYK 188

Query: 190 SLPCNSTTCSDLQPATGSSDVCGNGN---SNTCAYEINYGDGSYSRGDLGFETLNLGKTS 249
           ++ CNS+TC DL  AT +S  CG  N      C Y ++YGDGSY+RGDL  E++ LG T 
Sbjct: 189 TVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTK 248

Query: 250 IEKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCLPTAGLGSSGSLT 309
           +E FVFGCGRNNKGLFGG+SGLMGL RS +SLVSQT   F G+FSYCLP+   G+SGSL+
Sbjct: 249 LENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLS 308

Query: 310 MGGGDFSNFRNVSPISYTRMVPNPQMSNFYILNLTGISVGGVKLDVPRLASNNGVLSLID 369
             G D S + N + +SYT +V NPQ+ +FYILNLTG S+GGV+L     +S+ G   LID
Sbjct: 309 F-GNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELK----SSSFGRGILID 368

Query: 370 SGTVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKEVNIPTLKFYFEGD 429
           SGTVITRL PSIY+A+K+EF KQFSGF TAPG+SIL+TCFNLT  ++++IP +K  F+G+
Sbjct: 369 SGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGN 428

Query: 430 AELTVDVEGIFYFVKTDDSQICLAFASLASEDQIGIIGNYQQKNQRVIYNSKESKVGFAA 488
           AEL VDV G+FYFVK D S +CLA ASL+ E+++GIIGNYQQKNQRVIY++ + ++G   
Sbjct: 429 AELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVG 482

BLAST of Tan0022026 vs. TAIR 10
Match: AT5G10770.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 300.1 bits (767), Expect = 3.3e-81
Identity = 171/400 (42.75%), Postives = 235/400 (58.75%), Query Frame = 0

Query: 94  LFLDAIHVESLQSRFKSAIFPGQAHQISDSQIPLSPGTRLQTLNYIVTVGIG--RQNVTL 153
           L LD   V S+ S+    +      +   + +P   G+ L + NYIVTVG+G  + +++L
Sbjct: 88  LRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSL 147

Query: 154 IVDTGSDLTWVQCRPC-RLCYSQQEPLFDPSNSSSFLSLPCNSTTCSDLQPATGSSDVCG 213
           I DTGSDLTW QC+PC R CY Q+EP+F+PS S+S+ ++ C+S  C  L  ATG++  C 
Sbjct: 148 IFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCS 207

Query: 214 NGNSNTCAYEINYGDGSYSRGDLGFETLNLGKTSI-EKFVFGCGRNNKGLFGGTSGLMGL 273
             N   C Y I YGD S+S G L  E   L  + + +   FGCG NN+GLF G +GL+GL
Sbjct: 208 ASN---CIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGL 267

Query: 274 ARSELSLVSQTSSVFGGIFSYCLPTAGLGSSGSLTMGGGDFSNFRNVSPISYTRMVPNPQ 333
            R +LS  SQT++ +  IFSYCLP++    +G LT G    S     +PIS         
Sbjct: 268 GRDKLSFPSQTATAYNKIFSYCLPSSA-SYTGHLTFGSAGISRSVKFTPISTI-----TD 327

Query: 334 MSNFYILNLTGISVGGVKLDVP-RLASNNGVLSLIDSGTVITRLAPSIYRALKVEFEKQF 393
            ++FY LN+  I+VGG KL +P  + S  G  +LIDSGTVITRL P  Y AL+  F+ + 
Sbjct: 328 GTSFYGLNIVAITVGGQKLPIPSTVFSTPG--ALIDSGTVITRLPPKAYAALRSSFKAKM 387

Query: 394 SGFQTAPGFSILNTCFNLTGLKEVNIPTLKFYFEGDAELTVDVEGIFYFVKTDDSQICLA 453
           S + T  G SIL+TCF+L+G K V IP + F F G A + +  +GIFY  K   SQ+CLA
Sbjct: 388 SKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKI--SQVCLA 447

Query: 454 FASLASEDQIGIIGNYQQKNQRVIYNSKESKVGFAAEPCS 489
           FA  + +    I GN QQ+   V+Y+    +VGFA   CS
Sbjct: 448 FAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474

BLAST of Tan0022026 vs. TAIR 10
Match: AT5G10760.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 260.0 bits (663), Expect = 3.8e-69
Identity = 154/396 (38.89%), Postives = 225/396 (56.82%), Query Frame = 0

Query: 97  DAIHVESLQSRFKSAIFPGQAHQISDSQIPLSPGTRLQTLNYIVTVGIG--RQNVTLIVD 156
           D   VES+ S+  S     +  +   +++P   G  L + NYIVT+GIG  + +++L+ D
Sbjct: 92  DQARVESIYSKL-SKNSANEVSEAKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSLVFD 151

Query: 157 TGSDLTWVQCRPC-RLCYSQQEPLFDPSNSSSFLSLPCNSTTCSDLQPATGSSDVCGNGN 216
           TGSDLTW QC PC   CYSQ+EP F+PS+SS++ ++ C+S  C D +  + S+       
Sbjct: 152 TGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAESCSASN------- 211

Query: 217 SNTCAYEINYGDGSYSRGDLGFETLNLGKTSI-EKFVFGCGRNNKGLFGGTSGLMGLARS 276
              C Y I YGD S+++G L  E   L  + + E   FGCG NN+GLF G +GL+GL   
Sbjct: 212 ---CVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLGLGPG 271

Query: 277 ELSLVSQTSSVFGGIFSYCLPTAGLGSSGSLTMGGGDFSNFRNVSPISYTRMVPNPQMSN 336
           +LSL +QT++ +  IFSYCLP+    S+G LT G    S     +PIS       P   N
Sbjct: 272 KLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFTPIS-----SFPSAFN 331

Query: 337 FYILNLTGISVGGVKLDV-PRLASNNGVLSLIDSGTVITRLAPSIYRALKVEFEKQFSGF 396
            Y +++ GISVG  +L + P   S  G  ++IDSGTV TRL   +Y  L+  F+++ S +
Sbjct: 332 -YGIDIIGISVGDKELAITPNSFSTEG--AIIDSGTVFTRLPTKVYAELRSVFKEKMSSY 391

Query: 397 QTAPGFSILNTCFNLTGLKEVNIPTLKFYFEGDAELTVDVEGIFYFVKTDDSQICLAFAS 456
           ++  G+ + +TC++ TGL  V  PT+ F F G   + +D  GI   +K   SQ+CLAFA 
Sbjct: 392 KSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPIKI--SQVCLAFA- 451

Query: 457 LASEDQIGIIGNYQQKNQRVIYNSKESKVGFAAEPC 488
             ++D   I GN QQ    V+Y+    +VGFA   C
Sbjct: 452 -GNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of Tan0022026 vs. TAIR 10
Match: AT3G20015.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 251.5 bits (641), Expect = 1.4e-66
Identity = 147/433 (33.95%), Postives = 223/433 (51.50%), Query Frame = 0

Query: 71  LEMKERD-YCSGKVRDWDKNLQNRLFLDAIHVESLQSRFKSAIFPG--QAHQISDSQIPL 130
           L +  RD + S   R+    L  R+  D   V ++  R    + P     ++++D    +
Sbjct: 61  LRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDI 120

Query: 131 SPGTRLQTLNYIVTVGIGR--QNVTLIVDTGSDLTWVQCRPCRLCYSQQEPLFDPSNSSS 190
             G    +  Y V +G+G   ++  +++D+GSD+ WVQC+PC+LCY Q +P+FDP+ S S
Sbjct: 121 VSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGS 180

Query: 191 FLSLPCNSTTCSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRGDLGFETLNLGKTSI 250
           +  + C S+ C  ++     +  C +G    C YE+ YGDGSY++G L  ETL   KT +
Sbjct: 181 YTGVSCGSSVCDRIE-----NSGCHSGG---CRYEVMYGDGSYTKGTLALETLTFAKTVV 240

Query: 251 EKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCLPTAGLGSSGSLTM 310
                GCG  N+G+F G +GL+G+    +S V Q S   GG F YCL + G  S+GSL  
Sbjct: 241 RNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVF 300

Query: 311 GGGDFSNFRNVSPI--SYTRMVPNPQMSNFYILNLTGISVGGVKLDVPRLASNNGVLSL- 370
           G       R   P+  S+  +V NP+  +FY + L G+ VGGV++ +P     +GV  L 
Sbjct: 301 G-------REALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLP-----DGVFDLT 360

Query: 371 --------IDSGTVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKEVNI 430
                   +D+GT +TRL  + Y A +  F+ Q +    A G SI +TC++L+G   V +
Sbjct: 361 ETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRV 420

Query: 431 PTLKFYFEGDAELTVDVEGIFYFVKTDDSQICLAFASLASEDQIGIIGNYQQKNQRVIYN 488
           PT+ FYF     LT+     F     D    C AFA  AS   + IIGN QQ+  +V ++
Sbjct: 421 PTVSFYFTEGPVLTLPARN-FLMPVDDSGTYCFAFA--ASPTGLSIIGNIQQEGIQVSFD 470

BLAST of Tan0022026 vs. TAIR 10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 243.0 bits (619), Expect = 4.9e-64
Identity = 144/375 (38.40%), Postives = 212/375 (56.53%), Query Frame = 0

Query: 122 DSQIPLSPGTRLQTLNYIVTVGIGR--QNVTLIVDTGSDLTWVQCRPCRLCYSQQEPLFD 181
           D + PL  GT   +  Y   VGIG+  + V +++DTGSD+ W+QC PC  CY Q EP+F+
Sbjct: 132 DIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFE 191

Query: 182 PSNSSSFLSLPCNSTTCSDLQPATGSSDVCGNGNSNTCAYEINYGDGSYSRGDLGFETLN 241
           PS+SSS+  L C++  C+ L+ +      C N    TC YE++YGDGSY+ GD   ETL 
Sbjct: 192 PSSSSSYEPLSCDTPQCNALEVSE-----CRNA---TCLYEVSYGDGSYTVGDFATETLT 251

Query: 242 LGKTSIEKFVFGCGRNNKGLFGGTSGLMGLARSELSLVSQTSSVFGGIFSYCLPTAGLGS 301
           +G T ++    GCG +N+GLF G +GL+GL    L+L SQ ++     FSYCL      S
Sbjct: 252 IGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDRDSDS 311

Query: 302 SGSLTMGGGDFSNFRNVSPISYTR-MVPNPQMSNFYILNLTGISVGGVKLDVPRLA---- 361
           + ++  G        ++SP +    ++ N Q+  FY L LTGISVGG  L +P+ +    
Sbjct: 312 ASTVDFG-------TSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMD 371

Query: 362 -SNNGVLSLIDSGTVITRLAPSIYRALKVEFEKQFSGFQTAPGFSILNTCFNLTGLKEVN 421
            S +G + +IDSGT +TRL   IY +L+  F K     + A G ++ +TC+NL+    V 
Sbjct: 372 ESGSGGI-IIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVE 431

Query: 422 IPTLKFYFEGDAELTVDVEGIFYFVKTDD-SQICLAFASLASEDQIGIIGNYQQKNQRVI 481
           +PT+ F+F G   L +  +   Y +  D     CLAFA  AS   + IIGN QQ+  RV 
Sbjct: 432 VPTVAFHFPGGKMLALPAKN--YMIPVDSVGTFCLAFAPTAS--SLAIIGNVQQQGTRVT 483

Query: 482 YNSKESKVGFAAEPC 488
           ++   S +GF++  C
Sbjct: 492 FDLANSLIGFSSNKC 483

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8S9J64.7e-8042.75Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At... [more]
Q9LEW35.4e-6838.89Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1[more]
Q9LHE31.9e-6533.95Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q9LNJ39.2e-6037.19Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q9LS402.4e-5531.46Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
KAG7014194.11.8e-21779.47Aspartyl protease family protein [Cucurbita argyrosperma subsp. argyrosperma][more]
KAG6575642.13.0e-21778.62Aspartyl protease family protein, partial [Cucurbita argyrosperma subsp. sororia... [more]
XP_022991886.13.9e-21778.62aspartyl protease family protein At5g10770 [Cucurbita maxima][more]
XP_022953875.12.6e-21679.02aspartyl protease family protein At5g10770 [Cucurbita moschata][more]
XP_023547381.13.3e-21678.62aspartyl protease family protein At5g10770 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1JS161.9e-21778.62aspartyl protease family protein At5g10770 OS=Cucurbita maxima OX=3661 GN=LOC111... [more]
A0A6J1GQV91.2e-21679.02aspartyl protease family protein At5g10770 OS=Cucurbita moschata OX=3662 GN=LOC1... [more]
A0A0A0K8J25.7e-21475.86Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G43132... [more]
A0A5A7UYY61.0e-21074.95Aspartyl protease family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27... [more]
A0A1S3CDQ02.1e-20874.50aspartyl protease family protein At5g10770 OS=Cucumis melo OX=3656 GN=LOC1034998... [more]
Match NameE-valueIdentityDescription
AT1G79720.14.4e-14254.24Eukaryotic aspartyl protease family protein [more]
AT5G10770.13.3e-8142.75Eukaryotic aspartyl protease family protein [more]
AT5G10760.13.8e-6938.89Eukaryotic aspartyl protease family protein [more]
AT3G20015.11.4e-6633.95Eukaryotic aspartyl protease family protein [more]
AT1G25510.14.9e-6438.40Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 142..162
score: 42.68
coord: 361..372
score: 30.84
coord: 301..314
score: 32.44
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 16..488
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 310..489
e-value: 3.0E-42
score: 146.4
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 116..306
e-value: 2.4E-48
score: 166.7
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 132..487
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 138..307
e-value: 2.2E-49
score: 168.1
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 333..483
e-value: 2.9E-27
score: 95.4
NoneNo IPR availablePANTHERPTHR13683:SF827SUBFAMILY NOT NAMEDcoord: 16..488
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 151..162
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 138..483
score: 39.895504
IPR033873CND41-likeCDDcd05472cnd41_likecoord: 137..487
e-value: 4.217E-131
score: 380.078

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0022026.1Tan0022026.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity