Tan0003138 (gene) Snake gourd v1

Overview
NameTan0003138
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionaspartyl protease family protein 2
LocationLG06: 35073820 .. 35075600 (-)
RNA-Seq ExpressionTan0003138
SyntenyTan0003138
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCATTCCCATTTCCCACTCCTAAAAGTACCGCATTGTCCGCTCAATGCGCTCATCATCTCTTTGTCTCTTTTTTCTTTTTCATTATAATTAACTCTCTTCTCTCCCAAAAACCACAGTCAATCTCTCTCTCTCTCTCTCTCAATGGAATCACCACCAACAAATGTCCTATTCTTCTTCTTCTTCTTCTTCTTCTTCCTCTTCTTTTCTTTCGCCGCCGCATCGGAGTTCCAAATCCTAACTCTCCGCCCTCTTCCGAATCCCTCTCGCCCTCCCTTACTAGAACCCCAGTTCCACTCCGAGGAAGAAGCCCTCAAATCCACCGCCGGCGTCACCCTTGAGCTCCATCATTTGGACTCACTCTCCCTCAACAAAACCCCCACCGATCTCTTCAACCTCCGGCTCCACCGTGACGCCCTCCGCGTCCAGTCTCTGACCTCTCTGGCCGGCGCAAGGAGCCGGAACCCACTCCCACGCGCCGGTTTCAGCAGCTCCGTCATCTCCGGCCTCGCCCAAGGCAGCGGCGAGTACTTCACACGCCTCGGCGTTGGAACCCCTCCTAGATACCTCTACATGGTCCTCGACACTGGAAGCGACATTGTTTGGCTCCAATGCTCCCCTTGCCGCAAATGCTACTCCCAATCCGATCCCATTTTCAACCCCTTTAAATCCAAATCCTTCGCCGGAATCCCCTGCTCTTCCCCTCTCTGCCGCCGTCTCGACTCCTCCGGCTGCTCCACCCGCCGCCACACCTGCCTCTACCAAGTCTCCTACGGCGACGGCTCCTTCACCACCGGCGACTTCGCCACCGAAACCCTCACCTTTCGTGGCAATCAAATCGCCAAAGTCGCCCTCGGCTGCGGCCACGACAATCAAGGCCTCTTCGTCGGCGCCGCCGGTTTGTTGGGCCTCGGCCGTGGCCGCTTGTCTTTCCCTTCCCAAACCGGAATCCGGTTCAACCACAAATTCTCTTATTGTTTGGTGGACCGGTCCGCTTCCTCCAAACCCTCCTCCATGGTTTTCGGCGATGCGGCGATTTCCCGGCTCGCCCGGTTCACTCCTCTGATTCGGAACCCCAAATTGGAAACGTTTTATTACGTCGAACTCATCGGAATCAGCGTCGGCGGAGTCCGAGTCCGCGGCGTCTCCGCCGCTCTCTTCAAGCTCGATCCGGCCGGCAACGGCGGCGTCATCATCGACTCGGGTACCTCGGTAACCCGATTGACCCGACCCGCTTACACGGCTCTTCGCGACGCGTTCCGGGCCGGAGCGGCCCATTTGAAAAAGGGTCCCGAGTTTTCGCTGTTCGATACGTGTTACGACTTGTCGGGTCAGTCCGCCGTGAAGGTTCCGACGGTGGTGCTGCATTTCCGGGGAGCCGACATGTCGTTGCCGGCGACGAATTATTTGATTCCGGTGGACGACAGTGGGAGCTTTTGCTTTGCGTTTGCGGGTACTATGTCCGGGTTGTCGATTATTGGGAACATTCAACAGCAGGGGTTCCGGGTTGTGTACGACTTGGCGGGTTCTCGGATCGGGTTTGCTCCACGTGGGTGCACGTGATCTCTGACCCAGTGATGGGATTTCTGTTTGTTTAGGGACAAAGAAGGAATAACAAAGAAATGGAAAATTAAAAGAAAAATAGGATTTTTCGTGATGACATTGTCTTCTAGTTCTATTTAAGGTTTGTTTTTTGGTGTATTGTATTTATTATCTATTGAAAGTCATTAAAGCCTCTAACTTGGAGGTGATTTGGTTTGTTTCAAAGGTAAAGATTTGGGC

mRNA sequence

CCCATTCCCATTTCCCACTCCTAAAAGTACCGCATTGTCCGCTCAATGCGCTCATCATCTCTTTGTCTCTTTTTTCTTTTTCATTATAATTAACTCTCTTCTCTCCCAAAAACCACAGTCAATCTCTCTCTCTCTCTCTCTCAATGGAATCACCACCAACAAATGTCCTATTCTTCTTCTTCTTCTTCTTCTTCTTCCTCTTCTTTTCTTTCGCCGCCGCATCGGAGTTCCAAATCCTAACTCTCCGCCCTCTTCCGAATCCCTCTCGCCCTCCCTTACTAGAACCCCAGTTCCACTCCGAGGAAGAAGCCCTCAAATCCACCGCCGGCGTCACCCTTGAGCTCCATCATTTGGACTCACTCTCCCTCAACAAAACCCCCACCGATCTCTTCAACCTCCGGCTCCACCGTGACGCCCTCCGCGTCCAGTCTCTGACCTCTCTGGCCGGCGCAAGGAGCCGGAACCCACTCCCACGCGCCGGTTTCAGCAGCTCCGTCATCTCCGGCCTCGCCCAAGGCAGCGGCGAGTACTTCACACGCCTCGGCGTTGGAACCCCTCCTAGATACCTCTACATGGTCCTCGACACTGGAAGCGACATTGTTTGGCTCCAATGCTCCCCTTGCCGCAAATGCTACTCCCAATCCGATCCCATTTTCAACCCCTTTAAATCCAAATCCTTCGCCGGAATCCCCTGCTCTTCCCCTCTCTGCCGCCGTCTCGACTCCTCCGGCTGCTCCACCCGCCGCCACACCTGCCTCTACCAAGTCTCCTACGGCGACGGCTCCTTCACCACCGGCGACTTCGCCACCGAAACCCTCACCTTTCGTGGCAATCAAATCGCCAAAGTCGCCCTCGGCTGCGGCCACGACAATCAAGGCCTCTTCGTCGGCGCCGCCGGTTTGTTGGGCCTCGGCCGTGGCCGCTTGTCTTTCCCTTCCCAAACCGGAATCCGGTTCAACCACAAATTCTCTTATTGTTTGGTGGACCGGTCCGCTTCCTCCAAACCCTCCTCCATGGTTTTCGGCGATGCGGCGATTTCCCGGCTCGCCCGGTTCACTCCTCTGATTCGGAACCCCAAATTGGAAACGTTTTATTACGTCGAACTCATCGGAATCAGCGTCGGCGGAGTCCGAGTCCGCGGCGTCTCCGCCGCTCTCTTCAAGCTCGATCCGGCCGGCAACGGCGGCGTCATCATCGACTCGGGTACCTCGGTAACCCGATTGACCCGACCCGCTTACACGGCTCTTCGCGACGCGTTCCGGGCCGGAGCGGCCCATTTGAAAAAGGGTCCCGAGTTTTCGCTGTTCGATACGTGTTACGACTTGTCGGGTCAGTCCGCCGTGAAGGTTCCGACGGTGGTGCTGCATTTCCGGGGAGCCGACATGTCGTTGCCGGCGACGAATTATTTGATTCCGGTGGACGACAGTGGGAGCTTTTGCTTTGCGTTTGCGGGTACTATGTCCGGGTTGTCGATTATTGGGAACATTCAACAGCAGGGGTTCCGGGTTGTGTACGACTTGGCGGGTTCTCGGATCGGGTTTGCTCCACGTGGGTGCACGTGATCTCTGACCCAGTGATGGGATTTCTGTTTGTTTAGGGACAAAGAAGGAATAACAAAGAAATGGAAAATTAAAAGAAAAATAGGATTTTTCGTGATGACATTGTCTTCTAGTTCTATTTAAGGTTTGTTTTTTGGTGTATTGTATTTATTATCTATTGAAAGTCATTAAAGCCTCTAACTTGGAGGTGATTTGGTTTGTTTCAAAGGTAAAGATTTGGGC

Coding sequence (CDS)

ATGGAATCACCACCAACAAATGTCCTATTCTTCTTCTTCTTCTTCTTCTTCTTCCTCTTCTTTTCTTTCGCCGCCGCATCGGAGTTCCAAATCCTAACTCTCCGCCCTCTTCCGAATCCCTCTCGCCCTCCCTTACTAGAACCCCAGTTCCACTCCGAGGAAGAAGCCCTCAAATCCACCGCCGGCGTCACCCTTGAGCTCCATCATTTGGACTCACTCTCCCTCAACAAAACCCCCACCGATCTCTTCAACCTCCGGCTCCACCGTGACGCCCTCCGCGTCCAGTCTCTGACCTCTCTGGCCGGCGCAAGGAGCCGGAACCCACTCCCACGCGCCGGTTTCAGCAGCTCCGTCATCTCCGGCCTCGCCCAAGGCAGCGGCGAGTACTTCACACGCCTCGGCGTTGGAACCCCTCCTAGATACCTCTACATGGTCCTCGACACTGGAAGCGACATTGTTTGGCTCCAATGCTCCCCTTGCCGCAAATGCTACTCCCAATCCGATCCCATTTTCAACCCCTTTAAATCCAAATCCTTCGCCGGAATCCCCTGCTCTTCCCCTCTCTGCCGCCGTCTCGACTCCTCCGGCTGCTCCACCCGCCGCCACACCTGCCTCTACCAAGTCTCCTACGGCGACGGCTCCTTCACCACCGGCGACTTCGCCACCGAAACCCTCACCTTTCGTGGCAATCAAATCGCCAAAGTCGCCCTCGGCTGCGGCCACGACAATCAAGGCCTCTTCGTCGGCGCCGCCGGTTTGTTGGGCCTCGGCCGTGGCCGCTTGTCTTTCCCTTCCCAAACCGGAATCCGGTTCAACCACAAATTCTCTTATTGTTTGGTGGACCGGTCCGCTTCCTCCAAACCCTCCTCCATGGTTTTCGGCGATGCGGCGATTTCCCGGCTCGCCCGGTTCACTCCTCTGATTCGGAACCCCAAATTGGAAACGTTTTATTACGTCGAACTCATCGGAATCAGCGTCGGCGGAGTCCGAGTCCGCGGCGTCTCCGCCGCTCTCTTCAAGCTCGATCCGGCCGGCAACGGCGGCGTCATCATCGACTCGGGTACCTCGGTAACCCGATTGACCCGACCCGCTTACACGGCTCTTCGCGACGCGTTCCGGGCCGGAGCGGCCCATTTGAAAAAGGGTCCCGAGTTTTCGCTGTTCGATACGTGTTACGACTTGTCGGGTCAGTCCGCCGTGAAGGTTCCGACGGTGGTGCTGCATTTCCGGGGAGCCGACATGTCGTTGCCGGCGACGAATTATTTGATTCCGGTGGACGACAGTGGGAGCTTTTGCTTTGCGTTTGCGGGTACTATGTCCGGGTTGTCGATTATTGGGAACATTCAACAGCAGGGGTTCCGGGTTGTGTACGACTTGGCGGGTTCTCGGATCGGGTTTGCTCCACGTGGGTGCACGTGA

Protein sequence

MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Homology
BLAST of Tan0003138 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 658.7 bits (1698), Expect = 5.0e-188
Identity = 337/478 (70.50%), Postives = 390/478 (81.59%), Query Frame = 0

Query: 9   LFFFFFFFFFLFFSFAAASEFQIL--------TLRPL---PNPSRPPLLEPQFHSEEEAL 68
           L F   FFF    SF++   FQ L           P+   P+     LLE +F S  ++ 
Sbjct: 8   LLFSLCFFFLSLPSFSSLPSFQTLFPNSHSLPCASPVSFQPDSDSESLLESEFESGSDS- 67

Query: 69  KSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAG---ARSRNPLPR-AG 128
           +S++ +TL L H+D+LS NKTP +LF+ RL RD+ RV+S+ +LA     R+    PR  G
Sbjct: 68  ESSSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGG 127

Query: 129 FSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNP 188
           FSSSV+SGL+QGSGEYFTRLGVGTP RY+YMVLDTGSDIVWLQC+PCR+CYSQSDPIF+P
Sbjct: 128 FSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDP 187

Query: 189 FKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIA 248
            KSK++A IPCSSP CRRLDS+GC+TRR TCLYQVSYGDGSFT GDF+TETLTFR N++ 
Sbjct: 188 RKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVK 247

Query: 249 KVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVF 308
            VALGCGHDN+GLFVGAAGLLGLG+G+LSFP QTG RFN KFSYCLVDRSASSKPSS+VF
Sbjct: 248 GVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVF 307

Query: 309 GDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDS 368
           G+AA+SR+ARFTPL+ NPKL+TFYYV L+GISVGG RV GV+A+LFKLD  GNGGVIIDS
Sbjct: 308 GNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDS 367

Query: 369 GTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGAD 428
           GTSVTRL RPAY A+RDAFR GA  LK+ P+FSLFDTC+DLS  + VKVPTVVLHFRGAD
Sbjct: 368 GTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGAD 427

Query: 429 MSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 472
           +SLPATNYLIPVD +G FCFAFAGTM GLSIIGNIQQQGFRVVYDLA SR+GFAP GC
Sbjct: 428 VSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Tan0003138 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 404.8 bits (1039), Expect = 1.3e-111
Identity = 226/453 (49.89%), Postives = 287/453 (63.36%), Query Frame = 0

Query: 38  PNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSL--SLNKTPTDLFNLRLHRDALRVQ 97
           P  S     +P+  S+     S++ ++LELH  D+   S +K    L   RL RD+ RV 
Sbjct: 55  PTRSSLTTTKPESLSDPVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVA 114

Query: 98  SLT-----SLAGARSRNPLP---------RAGFSSSVISGLAQGSGEYFTRLGVGTPPRY 157
            +      ++ G    +  P             ++ V+SG +QGSGEYF+R+GVGTP + 
Sbjct: 115 GIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKE 174

Query: 158 LYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRR 217
           +Y+VLDTGSD+ W+QC PC  CY QSDP+FNP  S ++  + CS+P C  L++S C  R 
Sbjct: 175 MYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSAC--RS 234

Query: 218 HTCLYQVSYGDGSFTTGDFATETLTF-RGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGR 277
           + CLYQVSYGDGSFT G+ AT+T+TF    +I  VALGCGHDN+GLF GAAGLLGLG G 
Sbjct: 235 NKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGV 294

Query: 278 LSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVE 337
           LS  +Q        FSYCLVDR  S K SS+ F    +       PL+RN K++TFYYV 
Sbjct: 295 LSITNQMKAT---SFSYCLVDRD-SGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVG 354

Query: 338 LIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLK 397
           L G SVGG +V  +  A+F +D +G+GGVI+D GT+VTRL   AY +LRDAF     +LK
Sbjct: 355 LSGFSVGGEKV-VLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLK 414

Query: 398 KG-PEFSLFDTCYDLSGQSAVKVPTVVLHFRGA-DMSLPATNYLIPVDDSGSFCFAFAGT 457
           KG    SLFDTCYD S  S VKVPTV  HF G   + LPA NYLIPVDDSG+FCFAFA T
Sbjct: 415 KGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPT 474

Query: 458 MSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 472
            S LSIIGN+QQQG R+ YDL+ + IG +   C
Sbjct: 475 SSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of Tan0003138 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 389.4 bits (999), Expect = 5.7e-107
Identity = 214/476 (44.96%), Postives = 298/476 (62.61%), Query Frame = 0

Query: 8   VLFFFFFFFFFLFFSFAAAS-----EFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAG 67
           +L   FFFF  L    +++S     +FQI+ +  L  P       P F++   + +S++ 
Sbjct: 1   MLLPLFFFFLHLHLHLSSSSSISFPDFQIIDV--LQPPLTVTATLPDFNNTHFSDESSSK 60

Query: 68  VTLELHHLDSLS--LNKTPTDLFNLRLHRDALRVQSLTSLAGAR----SRNPLPRAGFSS 127
            TL L H D       +      + R+ RD  RV ++      +    S +      F S
Sbjct: 61  YTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGS 120

Query: 128 SVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKS 187
            ++SG+ QGSGEYF R+GVG+PPR  YMV+D+GSD+VW+QC PC+ CY QSDP+F+P KS
Sbjct: 121 DIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKS 180

Query: 188 KSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVA 247
            S+ G+ C S +C R+++SGC +    C Y+V YGDGS+T G  A ETLTF    +  VA
Sbjct: 181 GSYTGVSCGSSVCDRIENSGCHS--GGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVA 240

Query: 248 LGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDA 307
           +GCGH N+G+F+GAAGLLG+G G +SF  Q   +    F YCLV R   S   S+VFG  
Sbjct: 241 MGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDS-TGSLVFGRE 300

Query: 308 AISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTS 367
           A+   A + PL+RNP+  +FYYV L G+ VGGVR+  +   +F L   G+GGV++D+GT+
Sbjct: 301 ALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRI-PLPDGVFDLTETGDGGVVMDTGTA 360

Query: 368 VTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHF-RGADMS 427
           VTRL   AY A RD F++  A+L +    S+FDTCYDLSG  +V+VPTV  +F  G  ++
Sbjct: 361 VTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLT 420

Query: 428 LPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 472
           LPA N+L+PVDDSG++CFAFA + +GLSIIGNIQQ+G +V +D A   +GF P  C
Sbjct: 421 LPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of Tan0003138 vs. ExPASy Swiss-Prot
Match: Q9LEW3 (Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1)

HSP 1 Score: 283.5 bits (724), Expect = 4.4e-75
Identity = 161/389 (41.39%), Postives = 214/389 (55.01%), Query Frame = 0

Query: 87  LHRDALRVQSLTSLAGARSRNPLPRAGFSS-SVISGLAQGSGEYFTRLGVGTPPRYLYMV 146
           + RD  RV+S+ S     S N +  A  +     SG+  GSG Y   +G+GTP   L +V
Sbjct: 89  IRRDQARVESIYSKLSKNSANEVSEAKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSLV 148

Query: 147 LDTGSDIVWLQCSPC-RKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTC 206
            DTGSD+ W QC PC   CYSQ +P FNP  S ++  + CSSP+C   D+  CS     C
Sbjct: 149 FDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCE--DAESCSA--SNC 208

Query: 207 LYQVSYGDGSFTTGDFATETLTFRGNQIAK-VALGCGHDNQGLFVGAAGLLGLGRGRLSF 266
           +Y + YGD SFT G  A E  T   + + + V  GCG +NQGLF G AGLLGLG G+LS 
Sbjct: 209 VYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSL 268

Query: 267 PSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVELIG 326
           P+QT   +N+ FSYCL   +++S    + FG A IS   +FTP+   P     Y +++IG
Sbjct: 269 PAQTTTTYNNIFSYCLPSFTSNS-TGHLTFGSAGISESVKFTPISSFPSAFN-YGIDIIG 328

Query: 327 ISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGP 386
           ISVG   +  ++   F  +     G IIDSGT  TRL    Y  LR  F+   +  K   
Sbjct: 329 ISVGDKEL-AITPNSFSTE-----GAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTS 388

Query: 387 EFSLFDTCYDLSGQSAVKVPTVVLHFRGAD-MSLPATNYLIPVDDSGSFCFAFAGTMSGL 446
            + LFDTCYD +G   V  PT+   F G+  + L  +   +P+  S   C AFAG     
Sbjct: 389 GYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPIKIS-QVCLAFAGNDDLP 448

Query: 447 SIIGNIQQQGFRVVYDLAGSRIGFAPRGC 472
           +I GN+QQ    VVYD+AG R+GFAP GC
Sbjct: 449 AIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of Tan0003138 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 283.5 bits (724), Expect = 4.4e-75
Identity = 178/455 (39.12%), Postives = 239/455 (52.53%), Query Frame = 0

Query: 21  FSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPT 80
           +SF  A     + + P  + SR  L     +   EA     G  + L H+DS   N T  
Sbjct: 6   YSFLLALSIVYIFVAPTHSTSRTAL-----NHRHEA--KVTGFQIMLEHVDS-GKNLTKF 65

Query: 81  DLFNLRLHRDALRVQSLTSLAGARSRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPR 140
            L    + R + R+Q L ++    S       G  +SV +    G GEY   L +GTP +
Sbjct: 66  QLLERAIERGSRRLQRLEAMLNGPS-------GVETSVYA----GDGEYLMNLSIGTPAQ 125

Query: 141 YLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTR 200
               ++DTGSD++W QC PC +C++QS PIFNP  S SF+ +PCSS LC+ L S  CS  
Sbjct: 126 PFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCS-- 185

Query: 201 RHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVG-AAGLLGLGRG 260
            + C Y   YGDGS T G   TETLTF    I  +  GCG +NQG   G  AGL+G+GRG
Sbjct: 186 NNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRG 245

Query: 261 RLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF--TPLIRNPKLETFY 320
            LS PSQ  +    KFSYC+     SS PS+++ G  A S  A    T LI++ ++ TFY
Sbjct: 246 PLSLPSQLDVT---KFSYCMTP-IGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFY 305

Query: 321 YVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAA 380
           Y+ L G+SVG  R+    +A       G GG+IIDSGT++T     AY ++R  F +   
Sbjct: 306 YITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQIN 365

Query: 381 HLKKGPEFSLFDTCYDL-SGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFA 440
                   S FD C+   S  S +++PT V+HF G D+ LP+ NY I    +G  C A  
Sbjct: 366 LPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDLELPSENYFIS-PSNGLICLAMG 425

Query: 441 GTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 472
            +  G+SI GNIQQQ   VVYD   S + FA   C
Sbjct: 426 SSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Tan0003138 vs. NCBI nr
Match: KAG7025313.1 (Aspartyl protease family protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 840.5 bits (2170), Expect = 7.1e-240
Identity = 421/472 (89.19%), Postives = 443/472 (93.86%), Query Frame = 0

Query: 1   MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKST 60
           MESPP  +LFFFFFFF  +    +AASEFQ LTLR LP PS  P  +PQF + +E L+ST
Sbjct: 1   MESPPRYLLFFFFFFFAAV---ASAASEFQTLTLRRLPIPSPLPFPQPQFDT-QETLEST 60

Query: 61  AGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRAGFSSSVIS 120
           A +T+ELHHLDSLS NKTP+DLFNLRLHRDALRV SLTSL   RSR PL RAGFSSSVIS
Sbjct: 61  AALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGRSRTPLRRAGFSSSVIS 120

Query: 121 GLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFA 180
           GLAQGSGEYFTRLGVGTPPRY++MVLDTGSD+VWLQCSPCRKCYSQSDPIFNPFKSKSFA
Sbjct: 121 GLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFA 180

Query: 181 GIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCG 240
           GIPCSSPLCRRLDSSGC+TRRHTCLYQVSYGDGSFTTGDFATETLTFRGN+IAKVALGCG
Sbjct: 181 GIPCSSPLCRRLDSSGCNTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCG 240

Query: 241 HDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR 300
           HDN+GLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASSKPSSMVFGDAAISR
Sbjct: 241 HDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISR 300

Query: 301 LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRL 360
           LARFTPLI NPKLETFYYVELIGISVGGVRVRG+SA+LFKLD AGNGGVIIDSGTSVTRL
Sbjct: 301 LARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRL 360

Query: 361 TRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATN 420
           TRPAYTALRDAFRAGA+HLK+GPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADM+LPATN
Sbjct: 361 TRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMALPATN 420

Query: 421 YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 473
           YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Sbjct: 421 YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 468

BLAST of Tan0003138 vs. NCBI nr
Match: XP_022959948.1 (aspartyl protease family protein 2 [Cucurbita moschata])

HSP 1 Score: 839.7 bits (2168), Expect = 1.2e-239
Identity = 422/472 (89.41%), Postives = 443/472 (93.86%), Query Frame = 0

Query: 1   MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKST 60
           MESPP  +LFFFFFFF  +    +AASEFQ LTLR LP PS  P  +PQF + +E L+ST
Sbjct: 1   MESPPRYLLFFFFFFFAAV---ASAASEFQTLTLRRLPIPSPLPFPQPQFDT-QETLEST 60

Query: 61  AGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRAGFSSSVIS 120
           A +T+ELHHLDSLS NKTP+DLFNLRLHRDALRV SLTSL  ARSR PL RAGFSSSVIS
Sbjct: 61  AALTVELHHLDSLSPNKTPSDLFNLRLHRDALRVDSLTSLTAARSRTPLRRAGFSSSVIS 120

Query: 121 GLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFA 180
           GLAQGSGEYFTRLGVGTP RY+YMVLDTGSD+VWLQCSPCRKCYSQSDPIFNPFKSKSFA
Sbjct: 121 GLAQGSGEYFTRLGVGTPSRYIYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFA 180

Query: 181 GIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCG 240
           GIPCSSPLCRRLDSSGC+TRRHTCLYQVSYGDGSFTTGDFATETLTFRGN+IAKVALGCG
Sbjct: 181 GIPCSSPLCRRLDSSGCNTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCG 240

Query: 241 HDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR 300
           HDN+GLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASSKPSSMVFGDAAISR
Sbjct: 241 HDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISR 300

Query: 301 LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRL 360
           LARFTPLI NPKLETFYYVELIGISVGGVRVRG+SA+LFKLD AGNGGVIIDSGTSVTRL
Sbjct: 301 LARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRL 360

Query: 361 TRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATN 420
           TRPAYTALRDAFRAGA+HLK+GPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADM+LPATN
Sbjct: 361 TRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMALPATN 420

Query: 421 YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 473
           YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Sbjct: 421 YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 468

BLAST of Tan0003138 vs. NCBI nr
Match: XP_023514169.1 (aspartyl protease family protein 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 839.0 bits (2166), Expect = 2.1e-239
Identity = 421/472 (89.19%), Postives = 441/472 (93.43%), Query Frame = 0

Query: 1   MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKST 60
           MESPP  +LFFFFFF        +AASEFQ LTLR LP PS  P  +PQF + +E L+ST
Sbjct: 1   MESPPRYLLFFFFFFAAVA----SAASEFQTLTLRRLPIPSPLPFPQPQFDT-QETLEST 60

Query: 61  AGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRAGFSSSVIS 120
           A +T+ELHHLDSLS NKTP+DLFNLRLHRDALRV SLTSL  ARSR PL RAGFSSSVIS
Sbjct: 61  AALTVELHHLDSLSPNKTPSDLFNLRLHRDALRVDSLTSLTAARSRTPLRRAGFSSSVIS 120

Query: 121 GLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFA 180
           GLAQGSGEYFTRLGVGTPPRY+YMVLDTGSD+VWLQCSPCRKCYSQSDPIFNPFKSKSFA
Sbjct: 121 GLAQGSGEYFTRLGVGTPPRYIYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFA 180

Query: 181 GIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCG 240
           GIPCSSPLCRRLDSSGC+TRRHTCLYQVSYGDGSFTTGDFATETLTFRGN+IAKVALGCG
Sbjct: 181 GIPCSSPLCRRLDSSGCNTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCG 240

Query: 241 HDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR 300
           HDN+GLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASSKPSSMVFGDAAISR
Sbjct: 241 HDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISR 300

Query: 301 LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRL 360
           LARFTPLI NPKLETFYYVELIGISVGGVRVRG+SA+LFKLD AGNGGVIIDSGTSVTRL
Sbjct: 301 LARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRL 360

Query: 361 TRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATN 420
           TRPAYTALRDAFR GA+HLK+GPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADM+LPATN
Sbjct: 361 TRPAYTALRDAFRVGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMALPATN 420

Query: 421 YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 473
           YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Sbjct: 421 YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 467

BLAST of Tan0003138 vs. NCBI nr
Match: XP_023005015.1 (aspartyl protease family protein 2 [Cucurbita maxima])

HSP 1 Score: 834.7 bits (2155), Expect = 3.9e-238
Identity = 420/472 (88.98%), Postives = 439/472 (93.01%), Query Frame = 0

Query: 1   MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKST 60
           MESPP N+LFFFFF         +AASEFQ LTLR LP PS     + QF + +E L+ST
Sbjct: 1   MESPPRNLLFFFFFXAAVA----SAASEFQTLTLRRLPIPSPLSFPQSQFDT-QETLEST 60

Query: 61  AGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRAGFSSSVIS 120
           A +T+ELHHLDSLS NKTP+DLFNLRLHRDALRV SLTSL  ARSR PL RAGFSSSVIS
Sbjct: 61  AALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAARSRTPLRRAGFSSSVIS 120

Query: 121 GLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFA 180
           GLAQGSGEYFTRLGVGTPPRYLYMVLDTGSD+VWLQCSPCRKCYSQSDPIFNPFKSKSFA
Sbjct: 121 GLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFA 180

Query: 181 GIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCG 240
           GIPCSSPLCRRLDSSGC+TRRHTCLYQVSYGDGSFTTGDFATETLTFRGN+IAKVALGCG
Sbjct: 181 GIPCSSPLCRRLDSSGCNTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCG 240

Query: 241 HDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR 300
           HDN+GLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASSKPSSMVFGDAAISR
Sbjct: 241 HDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISR 300

Query: 301 LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRL 360
           LARFTPLI NPKLETFYYVELIG SVGGVRVRG+SA+LFKLD AGNGGVIIDSGTSVTRL
Sbjct: 301 LARFTPLILNPKLETFYYVELIGFSVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRL 360

Query: 361 TRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATN 420
           TRPAYTALRDAFRAGA+HLK+GPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADM+LPATN
Sbjct: 361 TRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMALPATN 420

Query: 421 YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 473
           YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Sbjct: 421 YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 467

BLAST of Tan0003138 vs. NCBI nr
Match: KAG6592908.1 (Aspartyl protease family protein 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 809.7 bits (2090), Expect = 1.3e-230
Identity = 400/444 (90.09%), Postives = 420/444 (94.59%), Query Frame = 0

Query: 37  LPNPSRPP--------LLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLH 96
           +PNP  PP        L +PQF + +E L+STA +T+ELHHLDSLS NKTP+DLFNLRLH
Sbjct: 66  VPNPYSPPSSNSLSPSLPQPQFDT-QETLESTAALTVELHHLDSLSTNKTPSDLFNLRLH 125

Query: 97  RDALRVQSLTSLAGARSRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDT 156
           RDALRV SLTSL  ARSR PL RAGFSSSVISGLAQGSGEYFTRLGVGTPPRY++MVLDT
Sbjct: 126 RDALRVDSLTSLTAARSRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDT 185

Query: 157 GSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQV 216
           GSD+VWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGC+TRRHTCLYQV
Sbjct: 186 GSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCLYQV 245

Query: 217 SYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTG 276
           SYGDGSFTTGDFATETLTFRGN+IAKVALGCGHDN+GLFVGAAGLLGLGRGR SFPSQTG
Sbjct: 246 SYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTG 305

Query: 277 IRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVELIGISVGG 336
           +RFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLI NPKLETFYYVELIGISVGG
Sbjct: 306 LRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLILNPKLETFYYVELIGISVGG 365

Query: 337 VRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLF 396
           VRVRG+SA+LFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA+HLK+GPEFSLF
Sbjct: 366 VRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLF 425

Query: 397 DTCYDLSGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNI 456
           DTCYDLSGQSAVKVPTVVLHFRGADM+LPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNI
Sbjct: 426 DTCYDLSGQSAVKVPTVVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNI 485

Query: 457 QQQGFRVVYDLAGSRIGFAPRGCT 473
           QQQGFRVVYDLAGSRIGFAPRGCT
Sbjct: 486 QQQGFRVVYDLAGSRIGFAPRGCT 508

BLAST of Tan0003138 vs. ExPASy TrEMBL
Match: A0A6J1H7E0 (aspartyl protease family protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111460848 PE=3 SV=1)

HSP 1 Score: 839.7 bits (2168), Expect = 5.9e-240
Identity = 422/472 (89.41%), Postives = 443/472 (93.86%), Query Frame = 0

Query: 1   MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKST 60
           MESPP  +LFFFFFFF  +    +AASEFQ LTLR LP PS  P  +PQF + +E L+ST
Sbjct: 1   MESPPRYLLFFFFFFFAAV---ASAASEFQTLTLRRLPIPSPLPFPQPQFDT-QETLEST 60

Query: 61  AGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRAGFSSSVIS 120
           A +T+ELHHLDSLS NKTP+DLFNLRLHRDALRV SLTSL  ARSR PL RAGFSSSVIS
Sbjct: 61  AALTVELHHLDSLSPNKTPSDLFNLRLHRDALRVDSLTSLTAARSRTPLRRAGFSSSVIS 120

Query: 121 GLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFA 180
           GLAQGSGEYFTRLGVGTP RY+YMVLDTGSD+VWLQCSPCRKCYSQSDPIFNPFKSKSFA
Sbjct: 121 GLAQGSGEYFTRLGVGTPSRYIYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFA 180

Query: 181 GIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCG 240
           GIPCSSPLCRRLDSSGC+TRRHTCLYQVSYGDGSFTTGDFATETLTFRGN+IAKVALGCG
Sbjct: 181 GIPCSSPLCRRLDSSGCNTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCG 240

Query: 241 HDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR 300
           HDN+GLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASSKPSSMVFGDAAISR
Sbjct: 241 HDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISR 300

Query: 301 LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRL 360
           LARFTPLI NPKLETFYYVELIGISVGGVRVRG+SA+LFKLD AGNGGVIIDSGTSVTRL
Sbjct: 301 LARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRL 360

Query: 361 TRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATN 420
           TRPAYTALRDAFRAGA+HLK+GPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADM+LPATN
Sbjct: 361 TRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMALPATN 420

Query: 421 YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 473
           YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Sbjct: 421 YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 468

BLAST of Tan0003138 vs. ExPASy TrEMBL
Match: A0A6J1KW81 (aspartyl protease family protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111498134 PE=3 SV=1)

HSP 1 Score: 834.7 bits (2155), Expect = 1.9e-238
Identity = 420/472 (88.98%), Postives = 439/472 (93.01%), Query Frame = 0

Query: 1   MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKST 60
           MESPP N+LFFFFF         +AASEFQ LTLR LP PS     + QF + +E L+ST
Sbjct: 1   MESPPRNLLFFFFFXAAVA----SAASEFQTLTLRRLPIPSPLSFPQSQFDT-QETLEST 60

Query: 61  AGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRAGFSSSVIS 120
           A +T+ELHHLDSLS NKTP+DLFNLRLHRDALRV SLTSL  ARSR PL RAGFSSSVIS
Sbjct: 61  AALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAARSRTPLRRAGFSSSVIS 120

Query: 121 GLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFA 180
           GLAQGSGEYFTRLGVGTPPRYLYMVLDTGSD+VWLQCSPCRKCYSQSDPIFNPFKSKSFA
Sbjct: 121 GLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFA 180

Query: 181 GIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCG 240
           GIPCSSPLCRRLDSSGC+TRRHTCLYQVSYGDGSFTTGDFATETLTFRGN+IAKVALGCG
Sbjct: 181 GIPCSSPLCRRLDSSGCNTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCG 240

Query: 241 HDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR 300
           HDN+GLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASSKPSSMVFGDAAISR
Sbjct: 241 HDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISR 300

Query: 301 LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRL 360
           LARFTPLI NPKLETFYYVELIG SVGGVRVRG+SA+LFKLD AGNGGVIIDSGTSVTRL
Sbjct: 301 LARFTPLILNPKLETFYYVELIGFSVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRL 360

Query: 361 TRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATN 420
           TRPAYTALRDAFRAGA+HLK+GPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADM+LPATN
Sbjct: 361 TRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMALPATN 420

Query: 421 YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 473
           YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Sbjct: 421 YLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 467

BLAST of Tan0003138 vs. ExPASy TrEMBL
Match: A0A5A7U8Z2 (Aspartyl protease family protein 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold673G001170 PE=3 SV=1)

HSP 1 Score: 784.3 bits (2024), Expect = 2.9e-223
Identity = 397/460 (86.30%), Postives = 418/460 (90.87%), Query Frame = 0

Query: 15  FFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKST--AGVTLELHHLDS 74
           +    +F  +AASEFQ LTLR LP PS  PL       + E+L+S+  A +TL+LHHLDS
Sbjct: 9   YLLLFYFISSAASEFQTLTLRSLPTPSPLPLF-----PDSESLQSSPAAALTLDLHHLDS 68

Query: 75  LSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRAGFSSSVISGLAQGSGEYFTR 134
           LSLNKTPTDLFNLRLHRDALRV +LTS A           GFSSSVISGLAQGSGEYFTR
Sbjct: 69  LSLNKTPTDLFNLRLHRDALRVHALTSRAA---------PGFSSSVISGLAQGSGEYFTR 128

Query: 135 LGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRL 194
           LGVGTPPRYLYMVLDTGSD+VWLQCSPCRKCYSQSDPIFNP+KSKSFAGIPCSSPLCRRL
Sbjct: 129 LGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSPLCRRL 188

Query: 195 DSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAG 254
           DSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGN+IAKVALGCGH N+GLFVGAAG
Sbjct: 189 DSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHHNEGLFVGAAG 248

Query: 255 LLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK 314
           LLGLGRGRLSFPSQTGIRFN KFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK
Sbjct: 249 LLGLGRGRLSFPSQTGIRFNRKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK 308

Query: 315 LETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAF 374
           L+TFYYVELIGISVGGVRVRGV  +LFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAF
Sbjct: 309 LDTFYYVELIGISVGGVRVRGVYPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAF 368

Query: 375 RAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFC 434
           RAGA HLK+GPEFSLFDTCYDLSGQS+VKVPTVVLHFRGADM LPATNYLIPVD++GSFC
Sbjct: 369 RAGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVVLHFRGADMPLPATNYLIPVDENGSFC 428

Query: 435 FAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 473
           FAFAGT+SGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Sbjct: 429 FAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 454

BLAST of Tan0003138 vs. ExPASy TrEMBL
Match: A0A0A0K4G2 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G329350 PE=3 SV=1)

HSP 1 Score: 781.6 bits (2017), Expect = 1.9e-222
Identity = 394/460 (85.65%), Postives = 417/460 (90.65%), Query Frame = 0

Query: 15  FFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKST--AGVTLELHHLDS 74
           +    FF   AASEFQ LTLR LP PS  PL       + ++L+S+  A +TL+LHHLDS
Sbjct: 9   YLLLFFFISTAASEFQTLTLRSLPTPSPLPLF-----PDSQSLQSSPDAPLTLDLHHLDS 68

Query: 75  LSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRAGFSSSVISGLAQGSGEYFTR 134
           LSLNKTPTDLFNLRLHRD LRV +L S A          AGFSSSV+SGL+QGSGEYFTR
Sbjct: 69  LSLNKTPTDLFNLRLHRDTLRVHALNSRA----------AGFSSSVVSGLSQGSGEYFTR 128

Query: 135 LGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRL 194
           LGVGTPPRYLYMVLDTGSD+VWLQCSPCRKCYSQSDPIFNP+KSKSFAGIPCSSPLCRRL
Sbjct: 129 LGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSPLCRRL 188

Query: 195 DSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAG 254
           DSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGN+IAKVALGCGH N+GLFVGAAG
Sbjct: 189 DSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHHNEGLFVGAAG 248

Query: 255 LLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK 314
           LLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK
Sbjct: 249 LLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK 308

Query: 315 LETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAF 374
           L+TFYYV LIGISVGGVRVRGVS +LFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAF
Sbjct: 309 LDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAF 368

Query: 375 RAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFC 434
           R GA HLK+GPEFSLFDTCYDLSGQS+VKVPTVVLHFRGADM+LPATNYLIPVD++GSFC
Sbjct: 369 RVGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVVLHFRGADMALPATNYLIPVDENGSFC 428

Query: 435 FAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 473
           FAFAGT+SGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Sbjct: 429 FAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 453

BLAST of Tan0003138 vs. ExPASy TrEMBL
Match: A0A1S3CHC4 (aspartyl protease family protein 2 OS=Cucumis melo OX=3656 GN=LOC103500464 PE=3 SV=1)

HSP 1 Score: 781.6 bits (2017), Expect = 1.9e-222
Identity = 398/460 (86.52%), Postives = 419/460 (91.09%), Query Frame = 0

Query: 15  FFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKST--AGVTLELHHLDS 74
           +    +F  +AASEFQ LTLR LP PS P  L P    + E+L+S+  A +TL+LHHLDS
Sbjct: 9   YLLLFYFISSAASEFQTLTLRSLPTPS-PLSLFP----DSESLQSSPAAALTLDLHHLDS 68

Query: 75  LSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRAGFSSSVISGLAQGSGEYFTR 134
           LSLNKTPTDLFNLRLHRDALRV +LTS A           GFSSSVISGLAQGSGEYFTR
Sbjct: 69  LSLNKTPTDLFNLRLHRDALRVHALTSRAA---------PGFSSSVISGLAQGSGEYFTR 128

Query: 135 LGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRL 194
           LGVGTPPRYLYMVLDTGSD+VWLQCSPCRKCYSQSDPIFNP+KSKSFAGIPCSSPLCRRL
Sbjct: 129 LGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSPLCRRL 188

Query: 195 DSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAG 254
           DSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGN+IAKVALGCGH N+GLFVGAAG
Sbjct: 189 DSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHHNEGLFVGAAG 248

Query: 255 LLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK 314
           LLGLGRGRLSFPSQTGIRFN KFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK
Sbjct: 249 LLGLGRGRLSFPSQTGIRFNRKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK 308

Query: 315 LETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAF 374
           L+TFYYVELIGISVGGVRVRGV  +LFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAF
Sbjct: 309 LDTFYYVELIGISVGGVRVRGVYPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAF 368

Query: 375 RAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFC 434
           RAGA HLK+GPEFSLFDTCYDLSGQS+VKVPTVVLHFRGADM LPATNYLIPVD++GSFC
Sbjct: 369 RAGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVVLHFRGADMPLPATNYLIPVDENGSFC 428

Query: 435 FAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 473
           FAFAGT+SGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Sbjct: 429 FAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 454

BLAST of Tan0003138 vs. TAIR 10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 658.7 bits (1698), Expect = 3.6e-189
Identity = 337/478 (70.50%), Postives = 390/478 (81.59%), Query Frame = 0

Query: 9   LFFFFFFFFFLFFSFAAASEFQIL--------TLRPL---PNPSRPPLLEPQFHSEEEAL 68
           L F   FFF    SF++   FQ L           P+   P+     LLE +F S  ++ 
Sbjct: 8   LLFSLCFFFLSLPSFSSLPSFQTLFPNSHSLPCASPVSFQPDSDSESLLESEFESGSDS- 67

Query: 69  KSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAG---ARSRNPLPR-AG 128
           +S++ +TL L H+D+LS NKTP +LF+ RL RD+ RV+S+ +LA     R+    PR  G
Sbjct: 68  ESSSSITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGG 127

Query: 129 FSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNP 188
           FSSSV+SGL+QGSGEYFTRLGVGTP RY+YMVLDTGSDIVWLQC+PCR+CYSQSDPIF+P
Sbjct: 128 FSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDP 187

Query: 189 FKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIA 248
            KSK++A IPCSSP CRRLDS+GC+TRR TCLYQVSYGDGSFT GDF+TETLTFR N++ 
Sbjct: 188 RKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVK 247

Query: 249 KVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVF 308
            VALGCGHDN+GLFVGAAGLLGLG+G+LSFP QTG RFN KFSYCLVDRSASSKPSS+VF
Sbjct: 248 GVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVF 307

Query: 309 GDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDS 368
           G+AA+SR+ARFTPL+ NPKL+TFYYV L+GISVGG RV GV+A+LFKLD  GNGGVIIDS
Sbjct: 308 GNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDS 367

Query: 369 GTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGAD 428
           GTSVTRL RPAY A+RDAFR GA  LK+ P+FSLFDTC+DLS  + VKVPTVVLHFRGAD
Sbjct: 368 GTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGAD 427

Query: 429 MSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 472
           +SLPATNYLIPVD +G FCFAFAGTM GLSIIGNIQQQGFRVVYDLA SR+GFAP GC
Sbjct: 428 VSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Tan0003138 vs. TAIR 10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 592.0 bits (1525), Expect = 4.1e-169
Identity = 305/473 (64.48%), Postives = 365/473 (77.17%), Query Frame = 0

Query: 13  FFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDS 72
           F  F  LFF+ +A+S++Q L +  LP+ +     E +  ++E   +ST  +++ L H+D+
Sbjct: 11  FSVFAVLFFTSSASSQYQTLVVNTLPSSATLSWPESESLTDESLSESTTSLSVHLSHVDA 70

Query: 73  LS--LNKTPTDLFNLRLHRDALRVQSLTSLA------GARSRNPLPRAGFSSSVISGLAQ 132
           LS   + +P DLFNLRL RD+LRV+S+TSLA       A  R P    GFS +VISGL+Q
Sbjct: 71  LSSFSDASPADLFNLRLQRDSLRVKSITSLAAVSTGRNATKRTPRTAGGFSGAVISGLSQ 130

Query: 133 GSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPC 192
           GSGEYF RLGVGTP   +YMVLDTGSD+VWLQCSPC+ CY+Q+D IF+P KSK+FA +PC
Sbjct: 131 GSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPC 190

Query: 193 SSPLCRRL-DSSGCSTRR-HTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHD 252
            S LCRRL DSS C TRR  TCLYQVSYGDGSFT GDF+TETLTF G ++  V LGCGHD
Sbjct: 191 GSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHD 250

Query: 253 NQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDR----SASSKPSSMVFGDAAI 312
           N+GLFVGAAGLLGLGRG LSFPSQT  R+N KFSYCLVDR    S+S  PS++VFG+AA+
Sbjct: 251 NEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAV 310

Query: 313 SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVT 372
            + + FTPL+ NPKL+TFYY++L+GISVGG RV GVS + FKLD  GNGGVIIDSGTSVT
Sbjct: 311 PKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVT 370

Query: 373 RLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPA 432
           RLT+PAY ALRDAFR GA  LK+ P +SLFDTC+DLSG + VKVPTVV HF G ++SLPA
Sbjct: 371 RLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFGGGEVSLPA 430

Query: 433 TNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 472
           +NYLIPV+  G FCFAFAGTM  LSIIGNIQQQGFRV YDL GSR+GF  R C
Sbjct: 431 SNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of Tan0003138 vs. TAIR 10
Match: AT3G18490.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 404.8 bits (1039), Expect = 9.3e-113
Identity = 226/453 (49.89%), Postives = 287/453 (63.36%), Query Frame = 0

Query: 38  PNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSL--SLNKTPTDLFNLRLHRDALRVQ 97
           P  S     +P+  S+     S++ ++LELH  D+   S +K    L   RL RD+ RV 
Sbjct: 55  PTRSSLTTTKPESLSDPVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVA 114

Query: 98  SLT-----SLAGARSRNPLP---------RAGFSSSVISGLAQGSGEYFTRLGVGTPPRY 157
            +      ++ G    +  P             ++ V+SG +QGSGEYF+R+GVGTP + 
Sbjct: 115 GIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKE 174

Query: 158 LYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRR 217
           +Y+VLDTGSD+ W+QC PC  CY QSDP+FNP  S ++  + CS+P C  L++S C  R 
Sbjct: 175 MYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSAC--RS 234

Query: 218 HTCLYQVSYGDGSFTTGDFATETLTF-RGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGR 277
           + CLYQVSYGDGSFT G+ AT+T+TF    +I  VALGCGHDN+GLF GAAGLLGLG G 
Sbjct: 235 NKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGV 294

Query: 278 LSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVE 337
           LS  +Q        FSYCLVDR  S K SS+ F    +       PL+RN K++TFYYV 
Sbjct: 295 LSITNQMKAT---SFSYCLVDRD-SGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVG 354

Query: 338 LIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLK 397
           L G SVGG +V  +  A+F +D +G+GGVI+D GT+VTRL   AY +LRDAF     +LK
Sbjct: 355 LSGFSVGGEKV-VLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLK 414

Query: 398 KG-PEFSLFDTCYDLSGQSAVKVPTVVLHFRGA-DMSLPATNYLIPVDDSGSFCFAFAGT 457
           KG    SLFDTCYD S  S VKVPTV  HF G   + LPA NYLIPVDDSG+FCFAFA T
Sbjct: 415 KGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPT 474

Query: 458 MSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 472
            S LSIIGN+QQQG R+ YDL+ + IG +   C
Sbjct: 475 SSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of Tan0003138 vs. TAIR 10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 390.6 bits (1002), Expect = 1.8e-108
Identity = 227/491 (46.23%), Postives = 298/491 (60.69%), Query Frame = 0

Query: 13  FFFFFFLFFSFAAASEFQILTLRPLPNPSRPPL----LEPQFH------------SEEEA 72
           + FFFF+FF  + +S F     R LP  S        +    H             EE+ 
Sbjct: 5   YSFFFFIFFLTSHSSVFS----RILPETSTTTTSILNVADSIHRTKYTSSFRLNQQEEQT 64

Query: 73  LKSTAGVTLELHHLDSL--SLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRN------- 132
             +++  +L+LH   S+  + +     L   RL+RD  RV+SL +       N       
Sbjct: 65  HSASSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLK 124

Query: 133 ------PLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCR 192
                         + +ISG  QGSGEYFTR+G+G P R +YMVLDTGSD+ WLQC+PC 
Sbjct: 125 PISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCA 184

Query: 193 KCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFA 252
            CY Q++PIF P  S S+  + C +P C  L+ S C  R  TCLY+VSYGDGS+T GDFA
Sbjct: 185 DCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSEC--RNATCLYEVSYGDGSYTVGDFA 244

Query: 253 TETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVD 312
           TETLT     +  VA+GCGH N+GLFVGAAGLLGLG G L+ PSQ        FSYCLVD
Sbjct: 245 TETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLN---TTSFSYCLVD 304

Query: 313 RSASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKL 372
           R + S  S++ FG  ++S  A   PL+RN +L+TFYY+ L GISVGG  ++ +  + F++
Sbjct: 305 RDSDS-ASTVDFG-TSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQ-IPQSSFEM 364

Query: 373 DPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVK 432
           D +G+GG+IIDSGT+VTRL    Y +LRD+F  G   L+K    ++FDTCY+LS ++ V+
Sbjct: 365 DESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVE 424

Query: 433 VPTVVLHFRGADM-SLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLA 472
           VPTV  HF G  M +LPA NY+IPVD  G+FC AFA T S L+IIGN+QQQG RV +DLA
Sbjct: 425 VPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLA 483

BLAST of Tan0003138 vs. TAIR 10
Match: AT3G20015.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 389.4 bits (999), Expect = 4.0e-108
Identity = 214/476 (44.96%), Postives = 298/476 (62.61%), Query Frame = 0

Query: 8   VLFFFFFFFFFLFFSFAAAS-----EFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAG 67
           +L   FFFF  L    +++S     +FQI+ +  L  P       P F++   + +S++ 
Sbjct: 1   MLLPLFFFFLHLHLHLSSSSSISFPDFQIIDV--LQPPLTVTATLPDFNNTHFSDESSSK 60

Query: 68  VTLELHHLDSLS--LNKTPTDLFNLRLHRDALRVQSLTSLAGAR----SRNPLPRAGFSS 127
            TL L H D       +      + R+ RD  RV ++      +    S +      F S
Sbjct: 61  YTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGS 120

Query: 128 SVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKS 187
            ++SG+ QGSGEYF R+GVG+PPR  YMV+D+GSD+VW+QC PC+ CY QSDP+F+P KS
Sbjct: 121 DIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKS 180

Query: 188 KSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVA 247
            S+ G+ C S +C R+++SGC +    C Y+V YGDGS+T G  A ETLTF    +  VA
Sbjct: 181 GSYTGVSCGSSVCDRIENSGCHS--GGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVA 240

Query: 248 LGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDA 307
           +GCGH N+G+F+GAAGLLG+G G +SF  Q   +    F YCLV R   S   S+VFG  
Sbjct: 241 MGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDS-TGSLVFGRE 300

Query: 308 AISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTS 367
           A+   A + PL+RNP+  +FYYV L G+ VGGVR+  +   +F L   G+GGV++D+GT+
Sbjct: 301 ALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRI-PLPDGVFDLTETGDGGVVMDTGTA 360

Query: 368 VTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHF-RGADMS 427
           VTRL   AY A RD F++  A+L +    S+FDTCYDLSG  +V+VPTV  +F  G  ++
Sbjct: 361 VTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLT 420

Query: 428 LPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 472
           LPA N+L+PVDDSG++CFAFA + +GLSIIGNIQQ+G +V +D A   +GF P  C
Sbjct: 421 LPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LNJ35.0e-18870.50Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q9LS401.3e-11149.89Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q9LHE35.7e-10744.96Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q9LEW34.4e-7541.39Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1[more]
Q766C34.4e-7539.12Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Match NameE-valueIdentityDescription
KAG7025313.17.1e-24089.19Aspartyl protease family protein 2, partial [Cucurbita argyrosperma subsp. argyr... [more]
XP_022959948.11.2e-23989.41aspartyl protease family protein 2 [Cucurbita moschata][more]
XP_023514169.12.1e-23989.19aspartyl protease family protein 2 [Cucurbita pepo subsp. pepo][more]
XP_023005015.13.9e-23888.98aspartyl protease family protein 2 [Cucurbita maxima][more]
KAG6592908.11.3e-23090.09Aspartyl protease family protein 2, partial [Cucurbita argyrosperma subsp. soror... [more]
Match NameE-valueIdentityDescription
A0A6J1H7E05.9e-24089.41aspartyl protease family protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111460848... [more]
A0A6J1KW811.9e-23888.98aspartyl protease family protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111498134 P... [more]
A0A5A7U8Z22.9e-22386.30Aspartyl protease family protein 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C... [more]
A0A0A0K4G21.9e-22285.65Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G32935... [more]
A0A1S3CHC41.9e-22286.52aspartyl protease family protein 2 OS=Cucumis melo OX=3656 GN=LOC103500464 PE=3 ... [more]
Match NameE-valueIdentityDescription
AT1G01300.13.6e-18970.50Eukaryotic aspartyl protease family protein [more]
AT3G61820.14.1e-16964.48Eukaryotic aspartyl protease family protein [more]
AT3G18490.19.3e-11349.89Eukaryotic aspartyl protease family protein [more]
AT1G25510.11.8e-10846.23Eukaryotic aspartyl protease family protein [more]
AT3G20015.14.0e-10844.96Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 349..360
score: 41.93
coord: 135..155
score: 41.49
coord: 443..458
score: 26.83
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 24..471
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 292..472
e-value: 2.0E-53
score: 182.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 114..291
e-value: 2.5E-53
score: 183.0
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 122..471
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 129..295
e-value: 4.8E-56
score: 189.8
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 316..467
e-value: 1.2E-35
score: 122.6
NoneNo IPR availablePANTHERPTHR13683:SF761ASPARTYL PROTEASE FAMILY PROTEIN 2-LIKEcoord: 24..471
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 144..155
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 349..360
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 129..467
score: 46.890614
IPR033873CND41-likeCDDcd05472cnd41_likecoord: 128..471
e-value: 3.04073E-140
score: 402.805

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0003138.1Tan0003138.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity