Tan0003209 (gene) Snake gourd v1

Overview
NameTan0003209
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCysteine protease
LocationLG05: 13296905 .. 13299659 (-)
RNA-Seq ExpressionTan0003209
SyntenyTan0003209
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCTGTAGAGTCGTAGTCATATTTCTTCATTCTTCTTCTTCTCCATTTTTTTCACCTTCAATTCTCAGCGGTGAAAATGGTTAAGCGTCTCTCTCTCTTCGTCGCTCTCTCTCTCCTCGCCGTATTGGCGATCGGAGGCGACTTTCTCTCCGGCGAATCGGACGGAGATTTTATTATTAGGCAGGTTGTCGACGACGGAGGAAGTAACGGCGACGATCTGCTGCTTGGAGCCGAGCACCACTTTTCGCTCTTTAAGCGGAAGTTTCGGAAGTCGTACGCCTCCCAGGAGGAGCATGATCACCGGTTCAGGGTCTTCAAAGCGAACCTGAGGCGAGCTCAGCGCCACCAGGCCCTTGATCCATCTGCTACTCATGGCGTCACTCAGTTTTCTGATTTGACTCCATCGGAGTTCCGACGATCGTTTCTAGGTCTTAGAGGTCGCCGTCTTGGACTCCCTGTGGATGCTAACAAGGCCCCTATTCTTCCTACTGATGGCCTTCCGACCGATTTCGATTGGAGAGATCATGGAGCTGTCACGGAAGTCAAGAATCAGGTTCGCGTTTCTTCTTTGTCCTCTCTTAATTTGCTTTTACCATTTTGTTTCCTCCTTAAATTTTTACTTGGGGAGAAAGTCGCCTTCATCTTATTTTGGTATGGAATTGAATGCTTTCCGATTCTCCTTCGTATCTTTTATATATTTAACATGGCTATTTCGTTTAGTTTGCAACTTCTAACTGTCTGTTATTTGTATTCCTCATGTTGATGATGACTTCCCATTAATTATTTGACCAAAGATGACTTGCATTTATTAAACCGAGGTTTTGATTATTGTGATTGTTTCTGTTCTTATTTTCTTATAAGGGTTCTTGTGGGTCGTGCTGGAGTTTCAGTACAACCGGTGCTCTTGAAGGCGCTAACTTTCTTGCTACCGGCAAACTTGTTAGCCTTAGCGAACAACAGCTTGTTGACTGCGATCACGAGGTTTGGATTCTAATTTTCACTTGTCTTTGACTTAATTTTGCAGACTTTACGATTAGATAATGGTCGTTTATGATTTTGTATTGGGTGCCATATCTTCCTCTTTGTACGTGATTCCAGAGTTTTAGGAAGCAAACTACCTGTTTTTGCGTTTGTTGATGATCTATGATTCCCGTTTATGTTGATGGTTGCCCTCTATTAGATTTAGATGCTACGTATTAATTTGTTACTAACTTACGGCTATTGTATAATATGGAGACAATTGGTAATTATTTGCATCTGTTTAGTGTTGGCTGCCAATAAATGATCTGTACTAGAAAATTTTGCAATCTAGAAAAGGATTCAAGTGCAAGATTTGCCTTGCAGGTTGCTTGAAGTATTTGATAAATGATAAACTTTTTGGTACAAGGGATGAATGATTTGGTTCGTTTCAATTTGTGGAATTTTAGGAAATTTAATTATGTATGTGATGTTGGTTAATGTTTATGATTGTATATAGTTTTTCTGTATCTAATTAAAAACGAAGATAAAAAAAGGATCTGCTCATTAATTGTCATTTTAACTCGTTATTACATATTTTACATTCATGTCCATCTGAACATTAAAAATATGCACTTGAACTTTATATCCTGACTTTTGTTCAGTTATATCTACATTGGAACTTCATTAGGAAAGGCGTTTCTTTTCACTTTCATGGAAATTAACGCACTTCTAAACTTTTGCCAGTGCGATCCAGAAGAAAAAGGTTCCTGTGATTCTGGCTGCGGTGGGGGGCTAATGAATAGTGCATTCGAGTACACATTGAAAGCTGGTGGACTTATGAAAGAGAAAGACTACCCATACACTGGTACAGATCGTGGAACCTGCAAATTTGACAAGAGCAAAATTGCTGCATCAGTAGCCAACTTTAGTGTTATCTCTCTTGATGAAGAACAAATTGCTGCTAATCTCGTGAAAAATGGTCCTCTTGCAAGTAATTACATTCCCAACCAAGCTTTCTCAAATCCTATTCTTTTCTTTTGTTTGTAAAACCTATTATAGTTGCTGACTGCTCAATTTCTCTGGAATGCAGTTGCTATCAATGCCGTATTCATGCAGACATATGTTGGAGGGGTTTCTTGCCCTTACATTTGCTCAAAGCATTTGGATCATGGAGTTCTGTTGGTTGGTTATGGATCTGCTGGCTATGCTCCCATCCGATTGAAAGAGAAACCTTACTGGATCATTAAGAACTCCTGGGGAGCCAACTGGGGGGAGAATGGATACTACAAAATCTGCCGGGGTCGCAATATCTGTGGTGTCGATTCTATGGTGTCGACAGTCGCTGCAGTTCATACCGCCTCGAACTAGTTTATGAACTCAAAACTGCTGGAGATTGCTCAGCTACAATTCCTGTATATATTGCAATATTATCACTCTTCTGGCAGGAAGTTGGTAACTGCCATTTCAAGCATAGGACGTCTCAGTAGAATTTGAACTCTCTGCTTTTATGTTGGTTTAGAAACTTGAAAAAGTAGCTTATATATATATGCATATTTATATTTAGTTCAAGCCGATTGCTTGTAAGCAAAAAACAAGAATCTGTTTCTGATTTGTGGTTGGCTCACCCACTCAACTTTGATGCGTATGAAAAATTCTCCCAATGTCAGTGTACAACATGATAGGGAAATATTTCTGATGTGTTATCTCGATCTTTTCTGATGCTAAACGTCAACATGATAATTTAAAATAGTCCAGGGATTGTCTTAATGTTTATTTAAGTAACACAATTTGGAAG

mRNA sequence

CTCTGTAGAGTCGTAGTCATATTTCTTCATTCTTCTTCTTCTCCATTTTTTTCACCTTCAATTCTCAGCGGTGAAAATGGTTAAGCGTCTCTCTCTCTTCGTCGCTCTCTCTCTCCTCGCCGTATTGGCGATCGGAGGCGACTTTCTCTCCGGCGAATCGGACGGAGATTTTATTATTAGGCAGGTTGTCGACGACGGAGGAAGTAACGGCGACGATCTGCTGCTTGGAGCCGAGCACCACTTTTCGCTCTTTAAGCGGAAGTTTCGGAAGTCGTACGCCTCCCAGGAGGAGCATGATCACCGGTTCAGGGTCTTCAAAGCGAACCTGAGGCGAGCTCAGCGCCACCAGGCCCTTGATCCATCTGCTACTCATGGCGTCACTCAGTTTTCTGATTTGACTCCATCGGAGTTCCGACGATCGTTTCTAGGTCTTAGAGGTCGCCGTCTTGGACTCCCTGTGGATGCTAACAAGGCCCCTATTCTTCCTACTGATGGCCTTCCGACCGATTTCGATTGGAGAGATCATGGAGCTGTCACGGAAGTCAAGAATCAGGGTTCTTGTGGGTCGTGCTGGAGTTTCAGTACAACCGGTGCTCTTGAAGGCGCTAACTTTCTTGCTACCGGCAAACTTGTTAGCCTTAGCGAACAACAGCTTGTTGACTGCGATCACGAGTGCGATCCAGAAGAAAAAGGTTCCTGTGATTCTGGCTGCGGTGGGGGGCTAATGAATAGTGCATTCGAGTACACATTGAAAGCTGGTGGACTTATGAAAGAGAAAGACTACCCATACACTGGTACAGATCGTGGAACCTGCAAATTTGACAAGAGCAAAATTGCTGCATCAGTAGCCAACTTTAGTGTTATCTCTCTTGATGAAGAACAAATTGCTGCTAATCTCGTGAAAAATGGTCCTCTTGCAATTGCTATCAATGCCGTATTCATGCAGACATATGTTGGAGGGGTTTCTTGCCCTTACATTTGCTCAAAGCATTTGGATCATGGAGTTCTGTTGGTTGGTTATGGATCTGCTGGCTATGCTCCCATCCGATTGAAAGAGAAACCTTACTGGATCATTAAGAACTCCTGGGGAGCCAACTGGGGGGAGAATGGATACTACAAAATCTGCCGGGGTCGCAATATCTGTGGTGTCGATTCTATGGTGTCGACAGTCGCTGCAGTTCATACCGCCTCGAACTAGTTTATGAACTCAAAACTGCTGGAGATTGCTCAGCTACAATTCCTGTATATATTGCAATATTATCACTCTTCTGGCAGGAAGTTGGTAACTGCCATTTCAAGCATAGGACGTCTCAGTAGAATTTGAACTCTCTGCTTTTATGTTGGTTTAGAAACTTGAAAAAGTAGCTTATATATATATGCATATTTATATTTAGTTCAAGCCGATTGCTTGTAAGCAAAAAACAAGAATCTGTTTCTGATTTGTGGTTGGCTCACCCACTCAACTTTGATGCGTATGAAAAATTCTCCCAATGTCAGTGTACAACATGATAGGGAAATATTTCTGATGTGTTATCTCGATCTTTTCTGATGCTAAACGTCAACATGATAATTTAAAATAGTCCAGGGATTGTCTTAATGTTTATTTAAGTAACACAATTTGGAAG

Coding sequence (CDS)

ATGGTTAAGCGTCTCTCTCTCTTCGTCGCTCTCTCTCTCCTCGCCGTATTGGCGATCGGAGGCGACTTTCTCTCCGGCGAATCGGACGGAGATTTTATTATTAGGCAGGTTGTCGACGACGGAGGAAGTAACGGCGACGATCTGCTGCTTGGAGCCGAGCACCACTTTTCGCTCTTTAAGCGGAAGTTTCGGAAGTCGTACGCCTCCCAGGAGGAGCATGATCACCGGTTCAGGGTCTTCAAAGCGAACCTGAGGCGAGCTCAGCGCCACCAGGCCCTTGATCCATCTGCTACTCATGGCGTCACTCAGTTTTCTGATTTGACTCCATCGGAGTTCCGACGATCGTTTCTAGGTCTTAGAGGTCGCCGTCTTGGACTCCCTGTGGATGCTAACAAGGCCCCTATTCTTCCTACTGATGGCCTTCCGACCGATTTCGATTGGAGAGATCATGGAGCTGTCACGGAAGTCAAGAATCAGGGTTCTTGTGGGTCGTGCTGGAGTTTCAGTACAACCGGTGCTCTTGAAGGCGCTAACTTTCTTGCTACCGGCAAACTTGTTAGCCTTAGCGAACAACAGCTTGTTGACTGCGATCACGAGTGCGATCCAGAAGAAAAAGGTTCCTGTGATTCTGGCTGCGGTGGGGGGCTAATGAATAGTGCATTCGAGTACACATTGAAAGCTGGTGGACTTATGAAAGAGAAAGACTACCCATACACTGGTACAGATCGTGGAACCTGCAAATTTGACAAGAGCAAAATTGCTGCATCAGTAGCCAACTTTAGTGTTATCTCTCTTGATGAAGAACAAATTGCTGCTAATCTCGTGAAAAATGGTCCTCTTGCAATTGCTATCAATGCCGTATTCATGCAGACATATGTTGGAGGGGTTTCTTGCCCTTACATTTGCTCAAAGCATTTGGATCATGGAGTTCTGTTGGTTGGTTATGGATCTGCTGGCTATGCTCCCATCCGATTGAAAGAGAAACCTTACTGGATCATTAAGAACTCCTGGGGAGCCAACTGGGGGGAGAATGGATACTACAAAATCTGCCGGGGTCGCAATATCTGTGGTGTCGATTCTATGGTGTCGACAGTCGCTGCAGTTCATACCGCCTCGAACTAG

Protein sequence

MVKRLSLFVALSLLAVLAIGGDFLSGESDGDFIIRQVVDDGGSNGDDLLLGAEHHFSLFKRKFRKSYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSFLGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDYPYTGTDRGTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGGVSCPYICSKHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNICGVDSMVSTVAAVHTASN
Homology
BLAST of Tan0003209 vs. ExPASy Swiss-Prot
Match: P43296 (Cysteine protease RD19A OS=Arabidopsis thaliana OX=3702 GN=RD19A PE=1 SV=1)

HSP 1 Score: 562.4 bits (1448), Expect = 3.9e-159
Identity = 277/364 (76.10%), Postives = 305/364 (83.79%), Query Frame = 0

Query: 4   RLSLFVALSLLAVLAIGGDFLSGESDGDFIIRQVVDDGGSNGDDLLLGAEHHFSLFKRKF 63
           RL L+ ++ +L+   +           D +IRQVV  GG+  +  +L +E HFSLFKRKF
Sbjct: 3   RLKLYFSVFVLSFFIVSVSSSDVNDGDDLVIRQVV--GGA--EPQVLTSEDHFSLFKRKF 62

Query: 64  RKSYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSFLGLRGRR 123
            K YAS EEHD+RF VFKANLRRA+RHQ LDPSATHGVTQFSDLT SEFR+  LG+R   
Sbjct: 63  GKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS-G 122

Query: 124 LGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEGANFLATG 183
             LP DANKAPILPT+ LP DFDWRDHGAVT VKNQGSCGSCWSFS TGALEGANFLATG
Sbjct: 123 FKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATG 182

Query: 184 KLVSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDYPYTGTDR 243
           KLVSLSEQQLVDCDHECDPEE  SCDSGC GGLMNSAFEYTLK GGLMKE+DYPYTG D 
Sbjct: 183 KLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDG 242

Query: 244 GTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGGVSCPYICS 303
            TCK DKSKI ASV+NFSVIS+DEEQIAANLVKNGPLA+AINA +MQTY+GGVSCPYIC+
Sbjct: 243 KTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYICT 302

Query: 304 KHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNICGVDSMVS 363
           + L+HGVLLVGYG+AGYAP R KEKPYWIIKNSWG  WGENG+YKIC+GRNICGVDSMVS
Sbjct: 303 RRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNICGVDSMVS 361

Query: 364 TVAA 368
           TVAA
Sbjct: 363 TVAA 361

BLAST of Tan0003209 vs. ExPASy Swiss-Prot
Match: Q9SUL1 (Probable cysteine protease RD19C OS=Arabidopsis thaliana OX=3702 GN=RD19C PE=2 SV=1)

HSP 1 Score: 550.1 bits (1416), Expect = 2.0e-155
Identity = 267/367 (72.75%), Postives = 305/367 (83.11%), Query Frame = 0

Query: 8   FVALSLLAVLAIGGDFLSGESDGDFI--IRQVVDDGGSNGDDLLLGAEHHFSLFKRKFRK 67
           F+  + L   ++G   +SGE    F+  IRQVV +     D+ LL AEHHF+LFK K+ K
Sbjct: 8   FLIAATLLAGSLGSTVISGEVTDGFVNPIRQVVPE---ENDEQLLNAEHHFTLFKSKYEK 67

Query: 68  SYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSFLGLRGRRLG 127
           +YA+Q EHDHRFRVFKANLRRA+R+Q LDPSA HGVTQFSDLTP EFRR FLGL+ R   
Sbjct: 68  TYATQVEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRKFLGLKRRGFR 127

Query: 128 LPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEGANFLATGKL 187
           LP D   APILPT  LPT+FDWR+ GAVT VKNQG CGSCWSFS  GALEGA+FLAT +L
Sbjct: 128 LPTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLATKEL 187

Query: 188 VSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDYPYTGTDRGT 247
           VSLSEQQLVDCDHECDP +  SCDSGC GGLMN+AFEY LKAGGLMKE+DYPYTG D   
Sbjct: 188 VSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDYPYTGRDHTA 247

Query: 248 CKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGGVSCPYICSKH 307
           CKFDKSKI ASV+NFSV+S DE+QIAANLV++GPLAIAINA++MQTY+GGVSCPY+CSK 
Sbjct: 248 CKFDKSKIVASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMWMQTYIGGVSCPYVCSKS 307

Query: 308 LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRG-RNICGVDSMVST 367
            DHGVLLVG+GS+GYAPIRLKEKPYWIIKNSWGA WGE+GYYKICRG  N+CG+D+MVST
Sbjct: 308 QDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKICRGPHNMCGMDTMVST 367

Query: 368 VAAVHTA 372
           VAAVHT+
Sbjct: 368 VAAVHTS 371

BLAST of Tan0003209 vs. ExPASy Swiss-Prot
Match: P43295 (Probable cysteine protease RD19B OS=Arabidopsis thaliana OX=3702 GN=RD19B PE=2 SV=2)

HSP 1 Score: 545.4 bits (1404), Expect = 4.9e-154
Identity = 260/339 (76.70%), Postives = 295/339 (87.02%), Query Frame = 0

Query: 29  DGDFIIRQVVDDGGSNGDDLLLGAEHHFSLFKRKFRKSYASQEEHDHRFRVFKANLRRAQ 88
           D D +IRQVVD+     +  +L +E HF+LFK+KF K Y S EEH +RF VFKANL RA 
Sbjct: 25  DEDVLIRQVVDE----TEPKVLSSEDHFTLFKKKFGKVYGSIEEHYYRFSVFKANLLRAM 84

Query: 89  RHQALDPSATHGVTQFSDLTPSEFRRSFLGLRGRRLGLPVDANKAPILPTDGLPTDFDWR 148
           RHQ +DPSA HGVTQFSDLT SEFRR  LG++G    LP DAN+APILPT  LP +FDWR
Sbjct: 85  RHQKMDPSARHGVTQFSDLTRSEFRRKHLGVKG-GFKLPKDANQAPILPTQNLPEEFDWR 144

Query: 149 DHGAVTEVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEKGSC 208
           D GAVT VKNQGSCGSCWSFSTTGALEGA+FLATGKLVSLSEQQLVDCDHECDPEE+GSC
Sbjct: 145 DRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSC 204

Query: 209 DSGCGGGLMNSAFEYTLKAGGLMKEKDYPYTGTDRGTCKFDKSKIAASVANFSVISLDEE 268
           DSGC GGLMNSAFEYTLK GGLM+EKDYPYTGTD G+CK D+SKI ASV+NFSV+S++E+
Sbjct: 205 DSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLDRSKIVASVSNFSVVSINED 264

Query: 269 QIAANLVKNGPLAIAINAVFMQTYVGGVSCPYICSKHLDHGVLLVGYGSAGYAPIRLKEK 328
           QIAANL+KNGPLA+AINA +MQTY+GGVSCPYICS+ L+HGVLLVGYGSAG++  RLKEK
Sbjct: 265 QIAANLIKNGPLAVAINAAYMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGFSQARLKEK 324

Query: 329 PYWIIKNSWGANWGENGYYKICRGRNICGVDSMVSTVAA 368
           PYWIIKNSWG +WGENG+YKIC+GRNICGVDS+VSTVAA
Sbjct: 325 PYWIIKNSWGESWGENGFYKICKGRNICGVDSLVSTVAA 358

BLAST of Tan0003209 vs. ExPASy Swiss-Prot
Match: P25804 (Cysteine proteinase 15A OS=Pisum sativum OX=3888 PE=2 SV=1)

HSP 1 Score: 530.8 bits (1366), Expect = 1.3e-149
Identity = 257/360 (71.39%), Postives = 300/360 (83.33%), Query Frame = 0

Query: 12  SLLAVLAIGGDFLSGESDGDFIIRQVVDDGGSNGDDLLLGAEHHFSLFKRKFRKSYASQE 71
           +L    A+        ++ DFIIRQVVD    N +D LL AEHHF+ FK KF KSYA++E
Sbjct: 8   ALFLFAAVATAVTDDTNNDDFIIRQVVD----NEEDHLLNAEHHFTSFKSKFSKSYATKE 67

Query: 72  EHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSFLGLRGRRLGLPVDAN 131
           EHD+RF VFK+NL +A+ HQ  DP+A HG+T+FSDLT SEFRR FLGL+ +RL LP  A 
Sbjct: 68  EHDYRFGVFKSNLIKAKLHQNRDPTAEHGITKFSDLTASEFRRQFLGLK-KRLRLPAHAQ 127

Query: 132 KAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQ 191
           KAPILPT  LP DFDWR+ GAVT VK+QGSCGSCW+FSTTGALEGA++LATGKLVSLSEQ
Sbjct: 128 KAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQ 187

Query: 192 QLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDYPYTGTDRGTCKFDKS 251
           QLVDCDH CDPE+ GSCDSGC GGLMN+AFEY L++GG+++EKDY YTG D G+CKFDKS
Sbjct: 188 QLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQEKDYAYTGRD-GSCKFDKS 247

Query: 252 KIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGGVSCPYICSK-HLDHGV 311
           K+ ASV+NFSV++LDE+QIAANLVKNGPLA+AINA +MQTY+ GVSCPY+C+K  LDHGV
Sbjct: 248 KVVASVSNFSVVTLDEDQIAANLVKNGPLAVAINAAWMQTYMSGVSCPYVCAKSRLDHGV 307

Query: 312 LLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNICGVDSMVSTVAAVHT 371
           LLVG+G   YAPIRLKEKPYWIIKNSWG NWGE GYYKICRGRN+CGVDSMVSTVAA  +
Sbjct: 308 LLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWGEQGYYKICRGRNVCGVDSMVSTVAAAQS 361

BLAST of Tan0003209 vs. ExPASy Swiss-Prot
Match: Q10716 (Cysteine proteinase 1 OS=Zea mays OX=4577 GN=CCP1 PE=2 SV=1)

HSP 1 Score: 510.8 bits (1314), Expect = 1.3e-143
Identity = 257/378 (67.99%), Postives = 294/378 (77.78%), Query Frame = 0

Query: 1   MVKRLSLFVALSLLAVLAIGGDFLSGESDGDFIIRQVVDDGGSNGDDLLLGAEHHFSLFK 60
           M  R+ L ++L+  A +A   D        D +IRQVV  G  N  DL L AE HF  F 
Sbjct: 1   MAHRVLLLLSLASAAAVAAAVD------AEDPLIRQVVPGGDDN--DLELNAESHFLSFV 60

Query: 61  RKFRKSYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSFLGLR 120
           ++F KSY   +EH +R  VFK NLRRA+RHQ LDPSA HGVT+FSDLTP+EFRR++LGLR
Sbjct: 61  QRFGKSYKDADEHAYRLSVFKDNLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRTYLGLR 120

Query: 121 GRRLG----LPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEG 180
             R      L   A++AP+LPTDGLP DFDWRDHGAV  VKNQGSCGSCWSFS +GALEG
Sbjct: 121 KSRRALLRELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEG 180

Query: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDY 240
           A++LATGKL  LSEQQ VDCDHECD  E  SCDSGC GGLM +AF Y  KAGGL  EKDY
Sbjct: 181 AHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQKAGGLESEKDY 240

Query: 241 PYTGTDRGTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGGV 300
           PYTG+D G CKFDKSKI ASV NFSV+S+DE QI+ANL+K+GPLAI INA +MQTY+GGV
Sbjct: 241 PYTGSD-GKCKFDKSKIVASVQNFSVVSVDEAQISANLIKHGPLAIGINAAYMQTYIGGV 300

Query: 301 SCPYICSKHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRG---R 360
           SCPYIC +HLDHGVLLVGYG++G+APIRLK+KPYWIIKNSWG NWGENGYYKICRG   R
Sbjct: 301 SCPYICGRHLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWGENGYYKICRGSNVR 360

Query: 361 NICGVDSMVSTVAAVHTA 372
           N CGVDSMVSTV+AVH +
Sbjct: 361 NKCGVDSMVSTVSAVHAS 369

BLAST of Tan0003209 vs. NCBI nr
Match: ARA15252.1 (cysteine proteinase 2 [Citrullus lanatus])

HSP 1 Score: 721.1 bits (1860), Expect = 5.0e-204
Identity = 353/377 (93.63%), Postives = 359/377 (95.23%), Query Frame = 0

Query: 1   MVKRLSLFVALSLLAVLAIGGDFLSGESDGDFIIRQVVDD----GGSNGDDLLLGAEHHF 60
           MVK  SL VALSLLAV AIG +  SGESDGDF+IRQVVDD    GGSNGDDLLLGAEHHF
Sbjct: 1   MVKCFSLIVALSLLAVSAIGAEVFSGESDGDFVIRQVVDDGGVNGGSNGDDLLLGAEHHF 60

Query: 61  SLFKRKFRKSYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSF 120
           SLFKRKF KSYAS+EEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSF
Sbjct: 61  SLFKRKFGKSYASKEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSF 120

Query: 121 LGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEG 180
           LGLRGRRLGLP DANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEG
Sbjct: 121 LGLRGRRLGLPADANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEG 180

Query: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDY 240
           ANFLATGKLVSLSEQQLVDCDHECDPEEK SCDSGC GGLMNSAFEYTLKAGGLMKE DY
Sbjct: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKDSCDSGCNGGLMNSAFEYTLKAGGLMKENDY 240

Query: 241 PYTGTDRGTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGGV 300
           PYTGTDRGTCKFDKSKIAASVANFSV+SLDEEQIAANLVKNGPLAIAINAVFMQTY+GGV
Sbjct: 241 PYTGTDRGTCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAIAINAVFMQTYIGGV 300

Query: 301 SCPYICSKHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNIC 360
           SCPYICSKHLDHGVLLVGYGS  YAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNIC
Sbjct: 301 SCPYICSKHLDHGVLLVGYGSGAYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNIC 360

Query: 361 GVDSMVSTVAAVHTASN 374
           GVDSMVSTVAAVHTA+N
Sbjct: 361 GVDSMVSTVAAVHTAAN 377

BLAST of Tan0003209 vs. NCBI nr
Match: XP_038901827.1 (cysteine protease RD19A-like [Benincasa hispida])

HSP 1 Score: 709.5 bits (1830), Expect = 1.5e-200
Identity = 348/377 (92.31%), Postives = 356/377 (94.43%), Query Frame = 0

Query: 1   MVKRLSLFVALSLLAVLAIGGDFLSGESDGDFIIRQVVDD----GGSNGDDLLLGAEHHF 60
           MVK  SL VALSLLAV AIG +  SGESDGDFIIRQVVDD    GGSNGDDLLLGAEHHF
Sbjct: 1   MVKCFSLIVALSLLAVAAIGAEVFSGESDGDFIIRQVVDDGVVNGGSNGDDLLLGAEHHF 60

Query: 61  SLFKRKFRKSYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSF 120
           SLFKRKF KSYAS+EEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTP EFR SF
Sbjct: 61  SLFKRKFGKSYASKEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPLEFRSSF 120

Query: 121 LGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEG 180
           LGLRGRRL LP DANKAP+LPTD LPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEG
Sbjct: 121 LGLRGRRLRLPADANKAPVLPTDDLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEG 180

Query: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDY 240
           ANFLATGKLVSLSEQQLVDCDHECDPEEK SCDSGC GGLMNSAFEYTLKAGGLMKEKDY
Sbjct: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKDSCDSGCSGGLMNSAFEYTLKAGGLMKEKDY 240

Query: 241 PYTGTDRGTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGGV 300
           PYTGTDRGTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTY+ GV
Sbjct: 241 PYTGTDRGTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYIRGV 300

Query: 301 SCPYICSKHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNIC 360
           SCPYICSKHLDHGVLLVGYGS GYAPIR+K+KPYWIIKNSWGANWGENGYYKICRGRN+C
Sbjct: 301 SCPYICSKHLDHGVLLVGYGSDGYAPIRMKDKPYWIIKNSWGANWGENGYYKICRGRNVC 360

Query: 361 GVDSMVSTVAAVHTASN 374
           GVDSMVSTVAAVHTA+N
Sbjct: 361 GVDSMVSTVAAVHTAAN 377

BLAST of Tan0003209 vs. NCBI nr
Match: XP_004150061.1 (cysteine protease RD19A [Cucumis sativus] >KGN61836.1 hypothetical protein Csa_006176 [Cucumis sativus])

HSP 1 Score: 703.7 bits (1815), Expect = 8.2e-199
Identity = 342/377 (90.72%), Postives = 357/377 (94.69%), Query Frame = 0

Query: 1   MVKRLSLFVALSLLAVLAIGGDFLSGESDGDFIIRQVVDDG----GSNGDDLLLGAEHHF 60
           MVK  SL V LSLLA  AIG + +SGESDGDFIIRQVVDDG    GSNGDDLLLGA+HHF
Sbjct: 1   MVKCFSLIVVLSLLAASAIGSEVISGESDGDFIIRQVVDDGGVNEGSNGDDLLLGADHHF 60

Query: 61  SLFKRKFRKSYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSF 120
           S+FK+KF KSYAS+EEHDHRFRVFKANL+RAQRHQALDPSATHGVTQFSDLTPSEFRRSF
Sbjct: 61  SVFKQKFGKSYASKEEHDHRFRVFKANLKRAQRHQALDPSATHGVTQFSDLTPSEFRRSF 120

Query: 121 LGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEG 180
           LGLR RRLGLP DANKAPILPTDGLPTDFDWRD GAV+EVKNQGSCGSCWSFS TGALEG
Sbjct: 121 LGLRSRRLGLPADANKAPILPTDGLPTDFDWRDKGAVSEVKNQGSCGSCWSFSATGALEG 180

Query: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDY 240
           ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGC GGLMNSAFEYTLK+GGLMKE+DY
Sbjct: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCNGGLMNSAFEYTLKSGGLMKEQDY 240

Query: 241 PYTGTDRGTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGGV 300
           PYTGTDRGTCKFDKSKIAASVANFSV+SLDEEQIAANLVKNGPLA+AINAVFMQTY+ GV
Sbjct: 241 PYTGTDRGTCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYIKGV 300

Query: 301 SCPYICSKHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNIC 360
           SCPYICSKHLDHGVLLVGYGS GYAPIRLK+KPYWIIKNSWGANWGENGYYKICRGRNIC
Sbjct: 301 SCPYICSKHLDHGVLLVGYGSDGYAPIRLKDKPYWIIKNSWGANWGENGYYKICRGRNIC 360

Query: 361 GVDSMVSTVAAVHTASN 374
           GVDSMVSTVAAVHTA+N
Sbjct: 361 GVDSMVSTVAAVHTAAN 377

BLAST of Tan0003209 vs. NCBI nr
Match: XP_023006920.1 (cysteine protease RD19A-like [Cucurbita maxima])

HSP 1 Score: 703.0 bits (1813), Expect = 1.4e-198
Identity = 343/378 (90.74%), Postives = 356/378 (94.18%), Query Frame = 0

Query: 1   MVKRLSLFVALSLLAVLAIGGDFLSGESDGDFIIRQVVDD-----GGSNGDDLLLGAEHH 60
           MVK  SLFVALSLLAV AIGGD  SG+  GD IIRQVVDD     GGSNGDDLLLGAEHH
Sbjct: 1   MVKCFSLFVALSLLAVSAIGGDHFSGDGYGDSIIRQVVDDGGVNGGGSNGDDLLLGAEHH 60

Query: 61  FSLFKRKFRKSYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRS 120
           FS+FK+KF KSYAS+EEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRR 
Sbjct: 61  FSVFKQKFGKSYASKEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRL 120

Query: 121 FLGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALE 180
           FLGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFS+TGALE
Sbjct: 121 FLGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSSTGALE 180

Query: 181 GANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKD 240
           GANFL++G+LVSLSEQQLVDCDHECDPEEKGSCD+GC GGLMNSAFEYTLKAGGLMKEKD
Sbjct: 181 GANFLSSGELVSLSEQQLVDCDHECDPEEKGSCDAGCSGGLMNSAFEYTLKAGGLMKEKD 240

Query: 241 YPYTGTDRGTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGG 300
           YPYTGTDRG CKFDK+KIAASVANFSVISLDEEQIAANLVKNGPLAI INAVFMQTY+GG
Sbjct: 241 YPYTGTDRGACKFDKTKIAASVANFSVISLDEEQIAANLVKNGPLAIGINAVFMQTYIGG 300

Query: 301 VSCPYICSKHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNI 360
           VSCPYICSKHLDHGVLLVGYGSA YAPIR+KEKPYWIIKNSWGA WGENGYYK+CRGRNI
Sbjct: 301 VSCPYICSKHLDHGVLLVGYGSAAYAPIRMKEKPYWIIKNSWGAKWGENGYYKLCRGRNI 360

Query: 361 CGVDSMVSTVAAVHTASN 374
           CGVDSMVSTVAAVH  SN
Sbjct: 361 CGVDSMVSTVAAVHITSN 378

BLAST of Tan0003209 vs. NCBI nr
Match: XP_008460996.1 (PREDICTED: cysteine proteinase RD19a-like [Cucumis melo] >KAA0045619.1 cysteine proteinase RD19a-like [Cucumis melo var. makuwa] >TYK02636.1 cysteine proteinase RD19a-like [Cucumis melo var. makuwa])

HSP 1 Score: 702.6 bits (1812), Expect = 1.8e-198
Identity = 341/377 (90.45%), Postives = 357/377 (94.69%), Query Frame = 0

Query: 1   MVKRLSLFVALSLLAVLAIGGDFLSGESDGDFIIRQVVDDGG----SNGDDLLLGAEHHF 60
           MVK  SL +ALSLLA  AIG + +SGESDGDFIIRQVVDDGG    SNGDDLLLGAEHHF
Sbjct: 1   MVKCFSLIIALSLLAASAIGKEVISGESDGDFIIRQVVDDGGVNGASNGDDLLLGAEHHF 60

Query: 61  SLFKRKFRKSYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSF 120
           S+FK+KF KSYAS+EEHDHRFRVFKANL+RAQRHQALDPSATHGVTQFSDLTPSEFRRSF
Sbjct: 61  SVFKQKFGKSYASKEEHDHRFRVFKANLKRAQRHQALDPSATHGVTQFSDLTPSEFRRSF 120

Query: 121 LGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEG 180
           LGLR RRLGLP DANKAPILPTDGLP DFDWRD GAVTEVKNQGSCGSCWSFS TGALEG
Sbjct: 121 LGLRSRRLGLPEDANKAPILPTDGLPADFDWRDKGAVTEVKNQGSCGSCWSFSATGALEG 180

Query: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDY 240
           ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGC GGLMNSAFEYTLK+GGLMKE+DY
Sbjct: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCNGGLMNSAFEYTLKSGGLMKEQDY 240

Query: 241 PYTGTDRGTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGGV 300
           PYTGTDRGTCKFDKSKIAASVANFSV+SLDEEQIAANLVKNGPLA+AINAVFMQTY+ GV
Sbjct: 241 PYTGTDRGTCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYIRGV 300

Query: 301 SCPYICSKHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNIC 360
           SCPYICS+HLDHGVLLVGYGS GYAPIR+K+KPYWIIKNSWGANWGENGYYKICRGRNIC
Sbjct: 301 SCPYICSRHLDHGVLLVGYGSEGYAPIRMKDKPYWIIKNSWGANWGENGYYKICRGRNIC 360

Query: 361 GVDSMVSTVAAVHTASN 374
           GVDSMVSTVAAVHTA+N
Sbjct: 361 GVDSMVSTVAAVHTAAN 377

BLAST of Tan0003209 vs. ExPASy TrEMBL
Match: A0A7R6B0V1 (Cysteine proteinase 2 OS=Citrullus lanatus OX=3654 PE=2 SV=1)

HSP 1 Score: 721.1 bits (1860), Expect = 2.4e-204
Identity = 353/377 (93.63%), Postives = 359/377 (95.23%), Query Frame = 0

Query: 1   MVKRLSLFVALSLLAVLAIGGDFLSGESDGDFIIRQVVDD----GGSNGDDLLLGAEHHF 60
           MVK  SL VALSLLAV AIG +  SGESDGDF+IRQVVDD    GGSNGDDLLLGAEHHF
Sbjct: 1   MVKCFSLIVALSLLAVSAIGAEVFSGESDGDFVIRQVVDDGGVNGGSNGDDLLLGAEHHF 60

Query: 61  SLFKRKFRKSYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSF 120
           SLFKRKF KSYAS+EEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSF
Sbjct: 61  SLFKRKFGKSYASKEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSF 120

Query: 121 LGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEG 180
           LGLRGRRLGLP DANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEG
Sbjct: 121 LGLRGRRLGLPADANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEG 180

Query: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDY 240
           ANFLATGKLVSLSEQQLVDCDHECDPEEK SCDSGC GGLMNSAFEYTLKAGGLMKE DY
Sbjct: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKDSCDSGCNGGLMNSAFEYTLKAGGLMKENDY 240

Query: 241 PYTGTDRGTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGGV 300
           PYTGTDRGTCKFDKSKIAASVANFSV+SLDEEQIAANLVKNGPLAIAINAVFMQTY+GGV
Sbjct: 241 PYTGTDRGTCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAIAINAVFMQTYIGGV 300

Query: 301 SCPYICSKHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNIC 360
           SCPYICSKHLDHGVLLVGYGS  YAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNIC
Sbjct: 301 SCPYICSKHLDHGVLLVGYGSGAYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNIC 360

Query: 361 GVDSMVSTVAAVHTASN 374
           GVDSMVSTVAAVHTA+N
Sbjct: 361 GVDSMVSTVAAVHTAAN 377

BLAST of Tan0003209 vs. ExPASy TrEMBL
Match: A0A0A0LPD0 (Cysteine protease OS=Cucumis sativus OX=3659 GN=Csa_2G249900 PE=3 SV=1)

HSP 1 Score: 703.7 bits (1815), Expect = 4.0e-199
Identity = 342/377 (90.72%), Postives = 357/377 (94.69%), Query Frame = 0

Query: 1   MVKRLSLFVALSLLAVLAIGGDFLSGESDGDFIIRQVVDDG----GSNGDDLLLGAEHHF 60
           MVK  SL V LSLLA  AIG + +SGESDGDFIIRQVVDDG    GSNGDDLLLGA+HHF
Sbjct: 1   MVKCFSLIVVLSLLAASAIGSEVISGESDGDFIIRQVVDDGGVNEGSNGDDLLLGADHHF 60

Query: 61  SLFKRKFRKSYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSF 120
           S+FK+KF KSYAS+EEHDHRFRVFKANL+RAQRHQALDPSATHGVTQFSDLTPSEFRRSF
Sbjct: 61  SVFKQKFGKSYASKEEHDHRFRVFKANLKRAQRHQALDPSATHGVTQFSDLTPSEFRRSF 120

Query: 121 LGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEG 180
           LGLR RRLGLP DANKAPILPTDGLPTDFDWRD GAV+EVKNQGSCGSCWSFS TGALEG
Sbjct: 121 LGLRSRRLGLPADANKAPILPTDGLPTDFDWRDKGAVSEVKNQGSCGSCWSFSATGALEG 180

Query: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDY 240
           ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGC GGLMNSAFEYTLK+GGLMKE+DY
Sbjct: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCNGGLMNSAFEYTLKSGGLMKEQDY 240

Query: 241 PYTGTDRGTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGGV 300
           PYTGTDRGTCKFDKSKIAASVANFSV+SLDEEQIAANLVKNGPLA+AINAVFMQTY+ GV
Sbjct: 241 PYTGTDRGTCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYIKGV 300

Query: 301 SCPYICSKHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNIC 360
           SCPYICSKHLDHGVLLVGYGS GYAPIRLK+KPYWIIKNSWGANWGENGYYKICRGRNIC
Sbjct: 301 SCPYICSKHLDHGVLLVGYGSDGYAPIRLKDKPYWIIKNSWGANWGENGYYKICRGRNIC 360

Query: 361 GVDSMVSTVAAVHTASN 374
           GVDSMVSTVAAVHTA+N
Sbjct: 361 GVDSMVSTVAAVHTAAN 377

BLAST of Tan0003209 vs. ExPASy TrEMBL
Match: A0A6J1L1J1 (cysteine protease RD19A-like OS=Cucurbita maxima OX=3661 GN=LOC111499563 PE=3 SV=1)

HSP 1 Score: 703.0 bits (1813), Expect = 6.8e-199
Identity = 343/378 (90.74%), Postives = 356/378 (94.18%), Query Frame = 0

Query: 1   MVKRLSLFVALSLLAVLAIGGDFLSGESDGDFIIRQVVDD-----GGSNGDDLLLGAEHH 60
           MVK  SLFVALSLLAV AIGGD  SG+  GD IIRQVVDD     GGSNGDDLLLGAEHH
Sbjct: 1   MVKCFSLFVALSLLAVSAIGGDHFSGDGYGDSIIRQVVDDGGVNGGGSNGDDLLLGAEHH 60

Query: 61  FSLFKRKFRKSYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRS 120
           FS+FK+KF KSYAS+EEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRR 
Sbjct: 61  FSVFKQKFGKSYASKEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRL 120

Query: 121 FLGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALE 180
           FLGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFS+TGALE
Sbjct: 121 FLGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSSTGALE 180

Query: 181 GANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKD 240
           GANFL++G+LVSLSEQQLVDCDHECDPEEKGSCD+GC GGLMNSAFEYTLKAGGLMKEKD
Sbjct: 181 GANFLSSGELVSLSEQQLVDCDHECDPEEKGSCDAGCSGGLMNSAFEYTLKAGGLMKEKD 240

Query: 241 YPYTGTDRGTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGG 300
           YPYTGTDRG CKFDK+KIAASVANFSVISLDEEQIAANLVKNGPLAI INAVFMQTY+GG
Sbjct: 241 YPYTGTDRGACKFDKTKIAASVANFSVISLDEEQIAANLVKNGPLAIGINAVFMQTYIGG 300

Query: 301 VSCPYICSKHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNI 360
           VSCPYICSKHLDHGVLLVGYGSA YAPIR+KEKPYWIIKNSWGA WGENGYYK+CRGRNI
Sbjct: 301 VSCPYICSKHLDHGVLLVGYGSAAYAPIRMKEKPYWIIKNSWGAKWGENGYYKLCRGRNI 360

Query: 361 CGVDSMVSTVAAVHTASN 374
           CGVDSMVSTVAAVH  SN
Sbjct: 361 CGVDSMVSTVAAVHITSN 378

BLAST of Tan0003209 vs. ExPASy TrEMBL
Match: A0A5D3BWN5 (Cysteine proteinase RD19a-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold280G00440 PE=3 SV=1)

HSP 1 Score: 702.6 bits (1812), Expect = 8.9e-199
Identity = 341/377 (90.45%), Postives = 357/377 (94.69%), Query Frame = 0

Query: 1   MVKRLSLFVALSLLAVLAIGGDFLSGESDGDFIIRQVVDDGG----SNGDDLLLGAEHHF 60
           MVK  SL +ALSLLA  AIG + +SGESDGDFIIRQVVDDGG    SNGDDLLLGAEHHF
Sbjct: 1   MVKCFSLIIALSLLAASAIGKEVISGESDGDFIIRQVVDDGGVNGASNGDDLLLGAEHHF 60

Query: 61  SLFKRKFRKSYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSF 120
           S+FK+KF KSYAS+EEHDHRFRVFKANL+RAQRHQALDPSATHGVTQFSDLTPSEFRRSF
Sbjct: 61  SVFKQKFGKSYASKEEHDHRFRVFKANLKRAQRHQALDPSATHGVTQFSDLTPSEFRRSF 120

Query: 121 LGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEG 180
           LGLR RRLGLP DANKAPILPTDGLP DFDWRD GAVTEVKNQGSCGSCWSFS TGALEG
Sbjct: 121 LGLRSRRLGLPEDANKAPILPTDGLPADFDWRDKGAVTEVKNQGSCGSCWSFSATGALEG 180

Query: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDY 240
           ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGC GGLMNSAFEYTLK+GGLMKE+DY
Sbjct: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCNGGLMNSAFEYTLKSGGLMKEQDY 240

Query: 241 PYTGTDRGTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGGV 300
           PYTGTDRGTCKFDKSKIAASVANFSV+SLDEEQIAANLVKNGPLA+AINAVFMQTY+ GV
Sbjct: 241 PYTGTDRGTCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYIRGV 300

Query: 301 SCPYICSKHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNIC 360
           SCPYICS+HLDHGVLLVGYGS GYAPIR+K+KPYWIIKNSWGANWGENGYYKICRGRNIC
Sbjct: 301 SCPYICSRHLDHGVLLVGYGSEGYAPIRMKDKPYWIIKNSWGANWGENGYYKICRGRNIC 360

Query: 361 GVDSMVSTVAAVHTASN 374
           GVDSMVSTVAAVHTA+N
Sbjct: 361 GVDSMVSTVAAVHTAAN 377

BLAST of Tan0003209 vs. ExPASy TrEMBL
Match: A0A1S3CE63 (cysteine proteinase RD19a-like OS=Cucumis melo OX=3656 GN=LOC103499710 PE=3 SV=1)

HSP 1 Score: 702.6 bits (1812), Expect = 8.9e-199
Identity = 341/377 (90.45%), Postives = 357/377 (94.69%), Query Frame = 0

Query: 1   MVKRLSLFVALSLLAVLAIGGDFLSGESDGDFIIRQVVDDGG----SNGDDLLLGAEHHF 60
           MVK  SL +ALSLLA  AIG + +SGESDGDFIIRQVVDDGG    SNGDDLLLGAEHHF
Sbjct: 1   MVKCFSLIIALSLLAASAIGKEVISGESDGDFIIRQVVDDGGVNGASNGDDLLLGAEHHF 60

Query: 61  SLFKRKFRKSYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSF 120
           S+FK+KF KSYAS+EEHDHRFRVFKANL+RAQRHQALDPSATHGVTQFSDLTPSEFRRSF
Sbjct: 61  SVFKQKFGKSYASKEEHDHRFRVFKANLKRAQRHQALDPSATHGVTQFSDLTPSEFRRSF 120

Query: 121 LGLRGRRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEG 180
           LGLR RRLGLP DANKAPILPTDGLP DFDWRD GAVTEVKNQGSCGSCWSFS TGALEG
Sbjct: 121 LGLRSRRLGLPEDANKAPILPTDGLPADFDWRDKGAVTEVKNQGSCGSCWSFSATGALEG 180

Query: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDY 240
           ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGC GGLMNSAFEYTLK+GGLMKE+DY
Sbjct: 181 ANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCNGGLMNSAFEYTLKSGGLMKEQDY 240

Query: 241 PYTGTDRGTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGGV 300
           PYTGTDRGTCKFDKSKIAASVANFSV+SLDEEQIAANLVKNGPLA+AINAVFMQTY+ GV
Sbjct: 241 PYTGTDRGTCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYIRGV 300

Query: 301 SCPYICSKHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNIC 360
           SCPYICS+HLDHGVLLVGYGS GYAPIR+K+KPYWIIKNSWGANWGENGYYKICRGRNIC
Sbjct: 301 SCPYICSRHLDHGVLLVGYGSEGYAPIRMKDKPYWIIKNSWGANWGENGYYKICRGRNIC 360

Query: 361 GVDSMVSTVAAVHTASN 374
           GVDSMVSTVAAVHTA+N
Sbjct: 361 GVDSMVSTVAAVHTAAN 377

BLAST of Tan0003209 vs. TAIR 10
Match: AT4G39090.1 (Papain family cysteine protease )

HSP 1 Score: 562.4 bits (1448), Expect = 2.8e-160
Identity = 277/364 (76.10%), Postives = 305/364 (83.79%), Query Frame = 0

Query: 4   RLSLFVALSLLAVLAIGGDFLSGESDGDFIIRQVVDDGGSNGDDLLLGAEHHFSLFKRKF 63
           RL L+ ++ +L+   +           D +IRQVV  GG+  +  +L +E HFSLFKRKF
Sbjct: 3   RLKLYFSVFVLSFFIVSVSSSDVNDGDDLVIRQVV--GGA--EPQVLTSEDHFSLFKRKF 62

Query: 64  RKSYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSFLGLRGRR 123
            K YAS EEHD+RF VFKANLRRA+RHQ LDPSATHGVTQFSDLT SEFR+  LG+R   
Sbjct: 63  GKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS-G 122

Query: 124 LGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEGANFLATG 183
             LP DANKAPILPT+ LP DFDWRDHGAVT VKNQGSCGSCWSFS TGALEGANFLATG
Sbjct: 123 FKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATG 182

Query: 184 KLVSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDYPYTGTDR 243
           KLVSLSEQQLVDCDHECDPEE  SCDSGC GGLMNSAFEYTLK GGLMKE+DYPYTG D 
Sbjct: 183 KLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDG 242

Query: 244 GTCKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGGVSCPYICS 303
            TCK DKSKI ASV+NFSVIS+DEEQIAANLVKNGPLA+AINA +MQTY+GGVSCPYIC+
Sbjct: 243 KTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYICT 302

Query: 304 KHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNICGVDSMVS 363
           + L+HGVLLVGYG+AGYAP R KEKPYWIIKNSWG  WGENG+YKIC+GRNICGVDSMVS
Sbjct: 303 RRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNICGVDSMVS 361

Query: 364 TVAA 368
           TVAA
Sbjct: 363 TVAA 361

BLAST of Tan0003209 vs. TAIR 10
Match: AT4G16190.1 (Papain family cysteine protease )

HSP 1 Score: 550.1 bits (1416), Expect = 1.4e-156
Identity = 267/367 (72.75%), Postives = 305/367 (83.11%), Query Frame = 0

Query: 8   FVALSLLAVLAIGGDFLSGESDGDFI--IRQVVDDGGSNGDDLLLGAEHHFSLFKRKFRK 67
           F+  + L   ++G   +SGE    F+  IRQVV +     D+ LL AEHHF+LFK K+ K
Sbjct: 8   FLIAATLLAGSLGSTVISGEVTDGFVNPIRQVVPE---ENDEQLLNAEHHFTLFKSKYEK 67

Query: 68  SYASQEEHDHRFRVFKANLRRAQRHQALDPSATHGVTQFSDLTPSEFRRSFLGLRGRRLG 127
           +YA+Q EHDHRFRVFKANLRRA+R+Q LDPSA HGVTQFSDLTP EFRR FLGL+ R   
Sbjct: 68  TYATQVEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRKFLGLKRRGFR 127

Query: 128 LPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEGANFLATGKL 187
           LP D   APILPT  LPT+FDWR+ GAVT VKNQG CGSCWSFS  GALEGA+FLAT +L
Sbjct: 128 LPTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLATKEL 187

Query: 188 VSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDYPYTGTDRGT 247
           VSLSEQQLVDCDHECDP +  SCDSGC GGLMN+AFEY LKAGGLMKE+DYPYTG D   
Sbjct: 188 VSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDYPYTGRDHTA 247

Query: 248 CKFDKSKIAASVANFSVISLDEEQIAANLVKNGPLAIAINAVFMQTYVGGVSCPYICSKH 307
           CKFDKSKI ASV+NFSV+S DE+QIAANLV++GPLAIAINA++MQTY+GGVSCPY+CSK 
Sbjct: 248 CKFDKSKIVASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMWMQTYIGGVSCPYVCSKS 307

Query: 308 LDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRG-RNICGVDSMVST 367
            DHGVLLVG+GS+GYAPIRLKEKPYWIIKNSWGA WGE+GYYKICRG  N+CG+D+MVST
Sbjct: 308 QDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKICRGPHNMCGMDTMVST 367

Query: 368 VAAVHTA 372
           VAAVHT+
Sbjct: 368 VAAVHTS 371

BLAST of Tan0003209 vs. TAIR 10
Match: AT2G21430.1 (Papain family cysteine protease )

HSP 1 Score: 545.4 bits (1404), Expect = 3.5e-155
Identity = 260/339 (76.70%), Postives = 295/339 (87.02%), Query Frame = 0

Query: 29  DGDFIIRQVVDDGGSNGDDLLLGAEHHFSLFKRKFRKSYASQEEHDHRFRVFKANLRRAQ 88
           D D +IRQVVD+     +  +L +E HF+LFK+KF K Y S EEH +RF VFKANL RA 
Sbjct: 25  DEDVLIRQVVDE----TEPKVLSSEDHFTLFKKKFGKVYGSIEEHYYRFSVFKANLLRAM 84

Query: 89  RHQALDPSATHGVTQFSDLTPSEFRRSFLGLRGRRLGLPVDANKAPILPTDGLPTDFDWR 148
           RHQ +DPSA HGVTQFSDLT SEFRR  LG++G    LP DAN+APILPT  LP +FDWR
Sbjct: 85  RHQKMDPSARHGVTQFSDLTRSEFRRKHLGVKG-GFKLPKDANQAPILPTQNLPEEFDWR 144

Query: 149 DHGAVTEVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEKGSC 208
           D GAVT VKNQGSCGSCWSFSTTGALEGA+FLATGKLVSLSEQQLVDCDHECDPEE+GSC
Sbjct: 145 DRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSC 204

Query: 209 DSGCGGGLMNSAFEYTLKAGGLMKEKDYPYTGTDRGTCKFDKSKIAASVANFSVISLDEE 268
           DSGC GGLMNSAFEYTLK GGLM+EKDYPYTGTD G+CK D+SKI ASV+NFSV+S++E+
Sbjct: 205 DSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLDRSKIVASVSNFSVVSINED 264

Query: 269 QIAANLVKNGPLAIAINAVFMQTYVGGVSCPYICSKHLDHGVLLVGYGSAGYAPIRLKEK 328
           QIAANL+KNGPLA+AINA +MQTY+GGVSCPYICS+ L+HGVLLVGYGSAG++  RLKEK
Sbjct: 265 QIAANLIKNGPLAVAINAAYMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGFSQARLKEK 324

Query: 329 PYWIIKNSWGANWGENGYYKICRGRNICGVDSMVSTVAA 368
           PYWIIKNSWG +WGENG+YKIC+GRNICGVDS+VSTVAA
Sbjct: 325 PYWIIKNSWGESWGENGFYKICKGRNICGVDSLVSTVAA 358

BLAST of Tan0003209 vs. TAIR 10
Match: AT3G54940.2 (Papain family cysteine protease )

HSP 1 Score: 432.2 bits (1110), Expect = 4.3e-121
Identity = 209/340 (61.47%), Postives = 260/340 (76.47%), Query Frame = 0

Query: 31  DFIIRQVVDDGGSNGDDLL-LGAEHHFSLFKRKFRKSYASQEEHDHRFRVFKANLRRAQR 90
           D  IRQV  D      +LL    E  F LF   + K+Y+++EE+ HR  +F  N+ +A  
Sbjct: 25  DLTIRQVTADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAAE 84

Query: 91  HQALDPSATHGVTQFSDLTPSEFRRSFLGLR--GRRLGLPVDANKAPILPTDGLPTDFDW 150
           HQ +DPSA HGVTQFSDLT  EF+R + G+   G   G  V A +AP++  DGLP DFDW
Sbjct: 85  HQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGSRGGTVGA-EAPMVEVDGLPEDFDW 144

Query: 151 RDHGAVTEVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEKGS 210
           R+ G VTEVKNQG+CGSCW+FSTTGA EGA+F++TGKL+SLSEQQLVDCD  CDP++K +
Sbjct: 145 REKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKA 204

Query: 211 CDSGCGGGLMNSAFEYTLKAGGLMKEKDYPYTGTDRGTCKFDKSKIAASVANFSVISLDE 270
           CD+GCGGGLM +A+EY ++AGGL +E+ YPYTG  RG CKFD  K+A  V NF+ I LDE
Sbjct: 205 CDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTG-KRGHCKFDPEKVAVRVLNFTTIPLDE 264

Query: 271 EQIAANLVKNGPLAIAINAVFMQTYVGGVSCPYICSK-HLDHGVLLVGYGSAGYAPIRLK 330
            QIAANLV++GPLA+ +NAVFMQTY+GGVSCP ICSK +++HGVLLVGYGS G++ +RL 
Sbjct: 265 NQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLS 324

Query: 331 EKPYWIIKNSWGANWGENGYYKICRGRNICGVDSMVSTVA 367
            KPYWIIKNSWG  WGENGYYK+CRG +ICG++SMVS VA
Sbjct: 325 NKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVA 362

BLAST of Tan0003209 vs. TAIR 10
Match: AT3G19390.1 (Granulin repeat cysteine protease family protein )

HSP 1 Score: 234.2 bits (596), Expect = 1.7e-61
Identity = 132/311 (42.44%), Postives = 181/311 (58.20%), Query Frame = 0

Query: 64  RKSYASQEEHDHRFRVFKANLRRAQRHQALDPSATH--GVTQFSDLTPSEFRRSFLGLRG 123
           RK+Y    E + RF +FK NL+  + H ++ P+ T+  G+T+F+DLT  EFR  +L  + 
Sbjct: 51  RKNYNGLGEKERRFEIFKDNLKFVEEHSSI-PNRTYEVGLTRFADLTNDEFRAIYLRSKM 110

Query: 124 RRLGLPVDANKAPILPTDGLPTDFDWRDHGAVTEVKNQGSCGSCWSFSTTGALEGANFLA 183
            R  +PV   K      D LP   DWR  GAV  VK+QGSCGSCW+FS  GA+EG N + 
Sbjct: 111 ERTRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIK 170

Query: 184 TGKLVSLSEQQLVDCDHECDPEEKGSCDSGCGGGLMNSAFEYTLKAGGLMKEKDYPYTGT 243
           TG+L+SLSEQ+LVDCD         S + GCGGGLM+ AF++ ++ GG+  E+DYPY  T
Sbjct: 171 TGELISLSEQELVDCD--------TSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIAT 230

Query: 244 DRGTCKFDKSKI-AASVANFSVISLDEEQIAANLVKNGPLAIAINA--VFMQTYVGGVSC 303
           D   C  DK      ++  +  +  ++E+     + N P+++AI A     Q Y  GV  
Sbjct: 231 DVNVCNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFT 290

Query: 304 PYICSKHLDHGVLLVGYGSAGYAPIRLKEKPYWIIKNSWGANWGENGYYKICRGRNI--- 363
              C   LDHGV+ VGYGS G        + YWI++NSWG+NWGE+GY+K+   RNI   
Sbjct: 291 G-TCGTSLDHGVVAVGYGSEG-------GQDYWIVRNSWGSNWGESGYFKL--ERNIKES 342

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P432963.9e-15976.10Cysteine protease RD19A OS=Arabidopsis thaliana OX=3702 GN=RD19A PE=1 SV=1[more]
Q9SUL12.0e-15572.75Probable cysteine protease RD19C OS=Arabidopsis thaliana OX=3702 GN=RD19C PE=2 S... [more]
P432954.9e-15476.70Probable cysteine protease RD19B OS=Arabidopsis thaliana OX=3702 GN=RD19B PE=2 S... [more]
P258041.3e-14971.39Cysteine proteinase 15A OS=Pisum sativum OX=3888 PE=2 SV=1[more]
Q107161.3e-14367.99Cysteine proteinase 1 OS=Zea mays OX=4577 GN=CCP1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
ARA15252.15.0e-20493.63cysteine proteinase 2 [Citrullus lanatus][more]
XP_038901827.11.5e-20092.31cysteine protease RD19A-like [Benincasa hispida][more]
XP_004150061.18.2e-19990.72cysteine protease RD19A [Cucumis sativus] >KGN61836.1 hypothetical protein Csa_0... [more]
XP_023006920.11.4e-19890.74cysteine protease RD19A-like [Cucurbita maxima][more]
XP_008460996.11.8e-19890.45PREDICTED: cysteine proteinase RD19a-like [Cucumis melo] >KAA0045619.1 cysteine ... [more]
Match NameE-valueIdentityDescription
A0A7R6B0V12.4e-20493.63Cysteine proteinase 2 OS=Citrullus lanatus OX=3654 PE=2 SV=1[more]
A0A0A0LPD04.0e-19990.72Cysteine protease OS=Cucumis sativus OX=3659 GN=Csa_2G249900 PE=3 SV=1[more]
A0A6J1L1J16.8e-19990.74cysteine protease RD19A-like OS=Cucurbita maxima OX=3661 GN=LOC111499563 PE=3 SV... [more]
A0A5D3BWN58.9e-19990.45Cysteine proteinase RD19a-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
A0A1S3CE638.9e-19990.45cysteine proteinase RD19a-like OS=Cucumis melo OX=3656 GN=LOC103499710 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G39090.12.8e-16076.10Papain family cysteine protease [more]
AT4G16190.11.4e-15672.75Papain family cysteine protease [more]
AT2G21430.13.5e-15576.70Papain family cysteine protease [more]
AT3G54940.24.3e-12161.47Papain family cysteine protease [more]
AT3G19390.11.7e-6142.44Granulin repeat cysteine protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000668Peptidase C1A, papain C-terminalPRINTSPR00705PAPAINcoord: 330..336
score: 75.14
coord: 159..174
score: 64.12
coord: 308..318
score: 58.8
IPR000668Peptidase C1A, papain C-terminalSMARTSM00645pept_c1coord: 141..366
e-value: 1.5E-110
score: 383.2
IPR000668Peptidase C1A, papain C-terminalPFAMPF00112Peptidase_C1coord: 141..364
e-value: 3.5E-74
score: 249.6
IPR013201Cathepsin propeptide inhibitor domain (I29)SMARTSM00848Inhibitor_I29_2coord: 56..112
e-value: 5.9E-23
score: 92.3
IPR013201Cathepsin propeptide inhibitor domain (I29)PFAMPF08246Inhibitor_I29coord: 56..112
e-value: 2.0E-12
score: 47.3
NoneNo IPR availableGENE3D3.90.70.10Cysteine proteinasescoord: 42..366
e-value: 3.1E-99
score: 334.8
NoneNo IPR availablePANTHERPTHR12411CYSTEINE PROTEASE FAMILY C1-RELATEDcoord: 31..367
NoneNo IPR availablePANTHERPTHR12411:SF783CYSTEINE PROTEASE RD19C-RELATEDcoord: 31..367
IPR025660Cysteine peptidase, histidine active sitePROSITEPS00639THIOL_PROTEASE_HIScoord: 306..316
IPR025661Cysteine peptidase, asparagine active sitePROSITEPS00640THIOL_PROTEASE_ASNcoord: 330..349
IPR000169Cysteine peptidase, cysteine active sitePROSITEPS00139THIOL_PROTEASE_CYScoord: 159..170
IPR039417Papain-like cysteine endopeptidaseCDDcd02248Peptidase_C1Acoord: 142..365
e-value: 9.82402E-98
score: 287.598
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 54..365

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0003209.1Tan0003209.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0051603 proteolysis involved in cellular protein catabolic process
biological_process GO:0006508 proteolysis
cellular_component GO:0005615 extracellular space
cellular_component GO:0005764 lysosome
molecular_function GO:0004197 cysteine-type endopeptidase activity
molecular_function GO:0008234 cysteine-type peptidase activity