Tan0001049 (gene) Snake gourd v1

Overview
NameTan0001049
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionULP_PROTEASE domain-containing protein
LocationLG02: 67780010 .. 67781259 (+)
RNA-Seq ExpressionTan0001049
SyntenyTan0001049
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAAAGAATCGATTGTCTAAGGAGTATGAGTTGGGTGTTGAAACATTCATTGAATTTAGTTTACACCATGCGAATATGGTTCCAACTTCATTCGATGTCCATGTTTGAAATGTGGGAATCGTGTCTCCAGTGATGTTGCAACCATTAGAAGTCACTTGTATGTCAATGGTATGGATCAAAGTTATAAGACATGGTATTGGCACGGTGAAGACTTGGAAGCAGAGTATGTAAGTGATGGAGTTAGGGATTTCATAAATGAAAAATATGTTGATGATAATGACTTGTATAATGTGATTGACATGGTTCAAGCTGCTCATGCTCAATTTGATATGTTATTTCGTGTCCGTTGGAATGATATTTGTTATATATTCATATTTTATTACACTTATAGGTATCTATACAAACTTATTGAAAAGTCAAATGAATTAAGTTTGTACAAGTTTTTGGATGCTAGTACAATTTCATTATCTAGATCATCGTCAGAATCACGAGCCCAACTTTTGATTGCACGATTGGGTGGAATGGATCCAAATCAGATACTCATTTTTCCTTATAATTCTGGGTATGTCATGTTCGTATACATTGATTACTTCTGTCTGTGTGAGTTATTGATCATGTGTATTATCCTAATTGGTAGAAATCATTGGACTTTGATCATCATTGACCACATCAAGAGCGTTGCATTTTGTATGGACCCCTTAAAAAATCGACTTCATGAGGACATCATTGTTGTAGTCGGCATGTAAGTTGAACTCATCCTATATTTTAGAATTCAACCCTAATTAGATTATTTGAATAGTAATAGACTCTTATATTCATCCACTTGTAGGGCATTCAAAGTTGTAAAAAAAAGAAAACCTATTTGGAAGTCTGTTAAGGTAGGTTATTTAATAAATAACAACTATTATTCAACATACGTATATTCGTTATTGAGACATTGATGTTTTGATGTTATACGATAGTGCCCCAAACAACCAGGTGTCGTAGAATGCGATTATTATGTTATGCGTTTTATGCGTGAGATAATCTACCAAAAAAATACTCCAATCATTGATCTAGTACGTATATTCCTTAAACATTACAATGATGGATATAATAAAAGTAAGTTTCTGGTTTCCAATTGTTCATTTCTGTTTTTTTCTTGTTTACAGATGAAAGATGCACCGTATACTTATACACAAAGTGATATCGACATTGTGAGAATTGAGTGGGCGGAGTTTGTAGGAACGCACATATTATTTGTATAA

mRNA sequence

ATGAAAAAGAATCGATTGGCATTCAAAGTTGTAAAAAAAAGAAAACCTATTTGGAAGTCTGTTAAGTGCCCCAAACAACCAGGTGTCGTAGAATGCGATTATTATGTTATGCGTTTTATGCGTGAGATAATCTACCAAAAAAATACTCCAATCATTGATCTAATGAAAGATGCACCGTATACTTATACACAAAGTGATATCGACATTGTGAGAATTGAGTGGGCGGAGTTTGTAGGAACGCACATATTATTTGTATAA

Coding sequence (CDS)

ATGAAAAAGAATCGATTGGCATTCAAAGTTGTAAAAAAAAGAAAACCTATTTGGAAGTCTGTTAAGTGCCCCAAACAACCAGGTGTCGTAGAATGCGATTATTATGTTATGCGTTTTATGCGTGAGATAATCTACCAAAAAAATACTCCAATCATTGATCTAATGAAAGATGCACCGTATACTTATACACAAAGTGATATCGACATTGTGAGAATTGAGTGGGCGGAGTTTGTAGGAACGCACATATTATTTGTATAA

Protein sequence

MKKNRLAFKVVKKRKPIWKSVKCPKQPGVVECDYYVMRFMREIIYQKNTPIIDLMKDAPYTYTQSDIDIVRIEWAEFVGTHILFV
Homology
BLAST of Tan0001049 vs. NCBI nr
Match: KAA0067083.1 (uncharacterized protein E6C27_scaffold38G001360 [Cucumis melo var. makuwa] >TYK26318.1 uncharacterized protein E5676_scaffold14G00990 [Cucumis melo var. makuwa])

HSP 1 Score: 114.4 bits (285), Expect = 4.9e-22
Identity = 50/76 (65.79%), Postives = 61/76 (80.26%), Query Frame = 0

Query: 7   AFKVVKKRKPIWKSVKCPKQPGVVECDYYVMRFMREIIYQKNTPIIDLMKDAPYTYTQSD 66
           +F ++KK+KP W+ VKCPKQ G VEC YYVMRFMR+II   +T IID+MKD+P TYTQ D
Sbjct: 445 SFNIMKKKKPNWRIVKCPKQSGKVECGYYVMRFMRDIILSMSTSIIDIMKDSPRTYTQDD 504

Query: 67  IDIVRIEWAEFVGTHI 83
           ID +R EWAEFVG H+
Sbjct: 505 IDCIRSEWAEFVGKHV 520

BLAST of Tan0001049 vs. NCBI nr
Match: TYK30805.1 (transposase [Cucumis melo var. makuwa])

HSP 1 Score: 112.1 bits (279), Expect = 2.4e-21
Identity = 48/76 (63.16%), Postives = 60/76 (78.95%), Query Frame = 0

Query: 7   AFKVVKKRKPIWKSVKCPKQPGVVECDYYVMRFMREIIYQKNTPIIDLMKDAPYTYTQSD 66
           +F ++ K+KP W+ VKCPKQ G+VEC YYVMRFMR+II   +T II +MKD+P TYTQ D
Sbjct: 737 SFNIMNKKKPNWRVVKCPKQSGLVECGYYVMRFMRDIIMSASTSIIQIMKDSPRTYTQDD 796

Query: 67  IDIVRIEWAEFVGTHI 83
           ID +R EWAEFVG H+
Sbjct: 797 IDCIRFEWAEFVGKHV 812

BLAST of Tan0001049 vs. NCBI nr
Match: KAA0054150.1 (uncharacterized protein E6C27_scaffold131G00730 [Cucumis melo var. makuwa])

HSP 1 Score: 112.1 bits (279), Expect = 2.4e-21
Identity = 48/77 (62.34%), Postives = 63/77 (81.82%), Query Frame = 0

Query: 6   LAFKVVKKRKPIWKSVKCPKQPGVVECDYYVMRFMREIIYQKNTPIIDLMKDAPYTYTQS 65
           ++F ++KK+KP W+ +KCPK+ GVVEC YYVMRFMR+II   +T IID+MKD+P TYTQ 
Sbjct: 196 MSFNIMKKKKPNWRVMKCPKRSGVVECGYYVMRFMRDIILSTSTFIIDMMKDSPRTYTQD 255

Query: 66  DIDIVRIEWAEFVGTHI 83
           DID +R +WAEFVG H+
Sbjct: 256 DIDCIRSKWAEFVGKHV 272

BLAST of Tan0001049 vs. NCBI nr
Match: KAA0043076.1 (transposase [Cucumis melo var. makuwa])

HSP 1 Score: 112.1 bits (279), Expect = 2.4e-21
Identity = 48/76 (63.16%), Postives = 60/76 (78.95%), Query Frame = 0

Query: 7   AFKVVKKRKPIWKSVKCPKQPGVVECDYYVMRFMREIIYQKNTPIIDLMKDAPYTYTQSD 66
           +F ++ K+KP W+ VKCPKQ G+VEC YYVMRFMR+II   +T II +MKD+P TYTQ D
Sbjct: 637 SFNIMNKKKPNWRVVKCPKQSGLVECGYYVMRFMRDIIMSASTSIIQIMKDSPRTYTQDD 696

Query: 67  IDIVRIEWAEFVGTHI 83
           ID +R EWAEFVG H+
Sbjct: 697 IDCIRFEWAEFVGKHV 712

BLAST of Tan0001049 vs. NCBI nr
Match: KAA0031799.1 (uncharacterized protein E6C27_scaffold848G00410 [Cucumis melo var. makuwa])

HSP 1 Score: 111.7 bits (278), Expect = 3.1e-21
Identity = 49/80 (61.25%), Postives = 60/80 (75.00%), Query Frame = 0

Query: 3   KNRLAFKVVKKRKPIWKSVKCPKQPGVVECDYYVMRFMREIIYQKNTPIIDLMKDAPYTY 62
           K   +F ++ K+KP W+ VKCPKQ GVVEC YYVMRFMR+II   +T II +MKD+P  Y
Sbjct: 243 KQMWSFNIMNKKKPAWRVVKCPKQSGVVECGYYVMRFMRDIIMSTSTSIIQIMKDSPRAY 302

Query: 63  TQSDIDIVRIEWAEFVGTHI 83
           TQ DID +R EWAEFVG H+
Sbjct: 303 TQDDIDCIRSEWAEFVGKHV 322

BLAST of Tan0001049 vs. ExPASy TrEMBL
Match: A0A5D3DS26 (ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold14G00990 PE=3 SV=1)

HSP 1 Score: 114.4 bits (285), Expect = 2.4e-22
Identity = 50/76 (65.79%), Postives = 61/76 (80.26%), Query Frame = 0

Query: 7   AFKVVKKRKPIWKSVKCPKQPGVVECDYYVMRFMREIIYQKNTPIIDLMKDAPYTYTQSD 66
           +F ++KK+KP W+ VKCPKQ G VEC YYVMRFMR+II   +T IID+MKD+P TYTQ D
Sbjct: 445 SFNIMKKKKPNWRIVKCPKQSGKVECGYYVMRFMRDIILSMSTSIIDIMKDSPRTYTQDD 504

Query: 67  IDIVRIEWAEFVGTHI 83
           ID +R EWAEFVG H+
Sbjct: 505 IDCIRSEWAEFVGKHV 520

BLAST of Tan0001049 vs. ExPASy TrEMBL
Match: A0A5A7UFX9 (ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold131G00730 PE=3 SV=1)

HSP 1 Score: 112.1 bits (279), Expect = 1.2e-21
Identity = 48/77 (62.34%), Postives = 63/77 (81.82%), Query Frame = 0

Query: 6   LAFKVVKKRKPIWKSVKCPKQPGVVECDYYVMRFMREIIYQKNTPIIDLMKDAPYTYTQS 65
           ++F ++KK+KP W+ +KCPK+ GVVEC YYVMRFMR+II   +T IID+MKD+P TYTQ 
Sbjct: 196 MSFNIMKKKKPNWRVMKCPKRSGVVECGYYVMRFMRDIILSTSTFIIDMMKDSPRTYTQD 255

Query: 66  DIDIVRIEWAEFVGTHI 83
           DID +R +WAEFVG H+
Sbjct: 256 DIDCIRSKWAEFVGKHV 272

BLAST of Tan0001049 vs. ExPASy TrEMBL
Match: A0A5A7TM34 (Transposase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold158G00130 PE=3 SV=1)

HSP 1 Score: 112.1 bits (279), Expect = 1.2e-21
Identity = 48/76 (63.16%), Postives = 60/76 (78.95%), Query Frame = 0

Query: 7   AFKVVKKRKPIWKSVKCPKQPGVVECDYYVMRFMREIIYQKNTPIIDLMKDAPYTYTQSD 66
           +F ++ K+KP W+ VKCPKQ G+VEC YYVMRFMR+II   +T II +MKD+P TYTQ D
Sbjct: 637 SFNIMNKKKPNWRVVKCPKQSGLVECGYYVMRFMRDIIMSASTSIIQIMKDSPRTYTQDD 696

Query: 67  IDIVRIEWAEFVGTHI 83
           ID +R EWAEFVG H+
Sbjct: 697 IDCIRFEWAEFVGKHV 712

BLAST of Tan0001049 vs. ExPASy TrEMBL
Match: A0A5D3E670 (Transposase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold267G00320 PE=3 SV=1)

HSP 1 Score: 112.1 bits (279), Expect = 1.2e-21
Identity = 48/76 (63.16%), Postives = 60/76 (78.95%), Query Frame = 0

Query: 7   AFKVVKKRKPIWKSVKCPKQPGVVECDYYVMRFMREIIYQKNTPIIDLMKDAPYTYTQSD 66
           +F ++ K+KP W+ VKCPKQ G+VEC YYVMRFMR+II   +T II +MKD+P TYTQ D
Sbjct: 737 SFNIMNKKKPNWRVVKCPKQSGLVECGYYVMRFMRDIIMSASTSIIQIMKDSPRTYTQDD 796

Query: 67  IDIVRIEWAEFVGTHI 83
           ID +R EWAEFVG H+
Sbjct: 797 IDCIRFEWAEFVGKHV 812

BLAST of Tan0001049 vs. ExPASy TrEMBL
Match: A0A5A7SR84 (Transpos_assoc domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold848G00410 PE=4 SV=1)

HSP 1 Score: 111.7 bits (278), Expect = 1.5e-21
Identity = 49/80 (61.25%), Postives = 60/80 (75.00%), Query Frame = 0

Query: 3   KNRLAFKVVKKRKPIWKSVKCPKQPGVVECDYYVMRFMREIIYQKNTPIIDLMKDAPYTY 62
           K   +F ++ K+KP W+ VKCPKQ GVVEC YYVMRFMR+II   +T II +MKD+P  Y
Sbjct: 243 KQMWSFNIMNKKKPAWRVVKCPKQSGVVECGYYVMRFMRDIIMSTSTSIIQIMKDSPRAY 302

Query: 63  TQSDIDIVRIEWAEFVGTHI 83
           TQ DID +R EWAEFVG H+
Sbjct: 303 TQDDIDCIRSEWAEFVGKHV 322

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAA0067083.14.9e-2265.79uncharacterized protein E6C27_scaffold38G001360 [Cucumis melo var. makuwa] >TYK2... [more]
TYK30805.12.4e-2163.16transposase [Cucumis melo var. makuwa][more]
KAA0054150.12.4e-2162.34uncharacterized protein E6C27_scaffold131G00730 [Cucumis melo var. makuwa][more]
KAA0043076.12.4e-2163.16transposase [Cucumis melo var. makuwa][more]
KAA0031799.13.1e-2161.25uncharacterized protein E6C27_scaffold848G00410 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A5D3DS262.4e-2265.79ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A5A7UFX91.2e-2162.34ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A5A7TM341.2e-2163.16Transposase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold158G00130 PE... [more]
A0A5D3E6701.2e-2163.16Transposase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold267G00320 PE... [more]
A0A5A7SR841.5e-2161.25Transpos_assoc domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 ... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 11..78

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0001049.1Tan0001049.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0008234 cysteine-type peptidase activity