Tan0003442 (gene) Snake gourd v1

Overview
NameTan0003442
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionULP_PROTEASE domain-containing protein
LocationLG02: 22463173 .. 22464162 (-)
RNA-Seq ExpressionTan0003442
SyntenyTan0003442
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTCGAGAGTGATGAGAAAGAAAAAGTAAATGAAGTGCTTTTAGGGAAAGACAATGTGAAGGTATTTATTGATGTCATACAAGTTGAAAAAGGTAATCTTTGTATTCCTTTTCCGATGAAGGGGAATATGGAGACGAGTCTTTACCAAGCTCAGAGTTGTTTTGTTGCTTGGCCTCGCAATCTCGTTATTATTTCTAAAAATGACAAGGTACGTATGCTTCTAAATAAAATTGTTTAAATCTAGTTATGTATAATGCACATTGAATTTTAATTATTATATTGCGTTCTTTATAGGATTCTAAACAAAAGAACCAACTAAACATAGTTTTCCAAGTCATACGATTTGTCTCTTCATGCATCAAGATTCTTTATCGATATGCTGAAAAATATGGTAAAGATAATCTAGTAGTTGTGCCCATAAGTGATAGAATATTTGGAAAAGGAGAAACTCTTTATCTTATGCCAGAAGATATCATGAAATTTTGTGCAGTGATAGAGATATCAAACACATGCATGTTAGTCTACATTGTGTAAGTAAATAACTTTTAATAATATTAAAAATTACTTATACTTATTAAATTATATATAGTCTGATGTATCTGTGACAAAATATGATGCAGGTTTCTTTGAAAATATTTTCAAGAGACATAAAGAATAAACATGTTTAGAGTATAAATTCGAATGACATTGCATCATGCTTTTGGTACTCTAGAAGATCGAGCAAGAAAAGTGGCGCATGTTTTTTCGCAAGTGAAACCACAACAAATAGTACTGATTTCATATAATCATGAGTAAGAGTCTCAATAATTCACTTGATTAGATTCAAGACTACTTTATTTTGATAAGAATAATTACATATATATTGGTTGGTTTTGATAAACGTCATTAGGTATTGTGCGTGGTGGATGTATTTGTGAATAATGATTATCTTTTTTATTCCCTACATCCTAGCATGGCAGACGACCTCCAAAACATTGTGAATACGTAA

mRNA sequence

ATGGTCGAGAGTGATGAGAAAGAAAAAGTAAATGAAGTGCTTTTAGGGAAAGACAATGTGAAGGTATTTATTGATGTCATACAAGTTGAAAAAGGTAATCTTTGTATTCCTTTTCCGATGAAGGGGAATATGGAGACGAGTCTTTACCAAGCTCAGAGTTGTTTTGTTGCTTGGCCTCGCAATCTCGTTATTATTTCTAAAAATGACAAGGATTCTAAACAAAAGAACCAACTAAACATAGTTTTCCAAGTCATACGATTTGTCTCTTCATGCATCAAGATTCTTTATCGATATGCTGAAAAATATGGTAAAGATAATCTAGTAGTTGTGCCCATAAGTGATAGAATATTTGGAAAAGGAGAAACTCTTTATCTTATGCCAGAAGATATCATGAAATTTTGTGCAGTGATAGAGATATCAAACACATGCATGTTAGTCTACATTGTATTGTGCGTGGTGGATGTATTTGTGAATAATGATTATCTTTTTTATTCCCTACATCCTAGCATGGCAGACGACCTCCAAAACATTGTGAATACGTAA

Coding sequence (CDS)

ATGGTCGAGAGTGATGAGAAAGAAAAAGTAAATGAAGTGCTTTTAGGGAAAGACAATGTGAAGGTATTTATTGATGTCATACAAGTTGAAAAAGGTAATCTTTGTATTCCTTTTCCGATGAAGGGGAATATGGAGACGAGTCTTTACCAAGCTCAGAGTTGTTTTGTTGCTTGGCCTCGCAATCTCGTTATTATTTCTAAAAATGACAAGGATTCTAAACAAAAGAACCAACTAAACATAGTTTTCCAAGTCATACGATTTGTCTCTTCATGCATCAAGATTCTTTATCGATATGCTGAAAAATATGGTAAAGATAATCTAGTAGTTGTGCCCATAAGTGATAGAATATTTGGAAAAGGAGAAACTCTTTATCTTATGCCAGAAGATATCATGAAATTTTGTGCAGTGATAGAGATATCAAACACATGCATGTTAGTCTACATTGTATTGTGCGTGGTGGATGTATTTGTGAATAATGATTATCTTTTTTATTCCCTACATCCTAGCATGGCAGACGACCTCCAAAACATTGTGAATACGTAA

Protein sequence

MVESDEKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDKDSKQKNQLNIVFQVIRFVSSCIKILYRYAEKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYIVLCVVDVFVNNDYLFYSLHPSMADDLQNIVNT
Homology
BLAST of Tan0003442 vs. NCBI nr
Match: KAE8649224.1 (hypothetical protein Csa_014966 [Cucumis sativus])

HSP 1 Score: 95.1 bits (235), Expect = 6.5e-16
Identity = 58/151 (38.41%), Postives = 91/151 (60.26%), Query Frame = 0

Query: 1   MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWP 60
           M ESD +   ++ + LG DN++V +DVI VE  ++ +P P+KG +ET L QA   FVAWP
Sbjct: 390 MFESDVQCPTIHGIPLGADNIRVTVDVIMVE--DVALPIPLKGEIET-LNQAIGNFVAWP 449

Query: 61  RNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRYA-EKYGKDNLVVVPISDRIF 120
           R LVI+++  K  S    +          V   IK+L RYA       +++ + +++ IF
Sbjct: 450 RKLVILTQEKKAPSMAATESTTQSSKYTDVHVTIKLLNRYAIHTMQVKDMIQINLNEHIF 509

Query: 121 GKGETLYLMPEDIMKFCAVIEISNTCMLVYI 149
           GK +T+YL P+DI+++C + EI  +C+L YI
Sbjct: 510 GKEKTIYLRPDDIIQYCGMTEIGYSCILTYI 537

BLAST of Tan0003442 vs. NCBI nr
Match: XP_031740251.1 (uncharacterized protein LOC101213947 [Cucumis sativus])

HSP 1 Score: 95.1 bits (235), Expect = 6.5e-16
Identity = 58/151 (38.41%), Postives = 91/151 (60.26%), Query Frame = 0

Query: 1   MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWP 60
           M ESD +   ++ + LG DN++V +DVI VE  ++ +P P+KG +ET L QA   FVAWP
Sbjct: 394 MFESDVQCPTIHGIPLGADNIRVTVDVIMVE--DVALPIPLKGEIET-LNQAIGNFVAWP 453

Query: 61  RNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRYA-EKYGKDNLVVVPISDRIF 120
           R LVI+++  K  S    +          V   IK+L RYA       +++ + +++ IF
Sbjct: 454 RKLVILTQEKKAPSMAATESTTQSSKYTDVHVTIKLLNRYAIHTMQVKDMIQINLNEHIF 513

Query: 121 GKGETLYLMPEDIMKFCAVIEISNTCMLVYI 149
           GK +T+YL P+DI+++C + EI  +C+L YI
Sbjct: 514 GKEKTIYLRPDDIIQYCGMTEIGYSCILTYI 541

BLAST of Tan0003442 vs. NCBI nr
Match: XP_038895921.1 (uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida] >XP_038895924.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida] >XP_038895927.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida])

HSP 1 Score: 92.4 bits (228), Expect = 4.2e-15
Identity = 57/152 (37.50%), Postives = 90/152 (59.21%), Query Frame = 0

Query: 1   MVESDEK-EKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWP 60
           M ESD +   +NE+ LG DNV+  +D++  E  ++ +P P K  ++T L QA   FVAWP
Sbjct: 375 MFESDAQCPSINEIPLGPDNVRAMVDIVMGE--DVALPIPQKDKIKT-LDQAIGNFVAWP 434

Query: 61  RNLVIISKNDKDSKQKNQLNIVFQVIRF--VSSCIKILYRYA-EKYGKDNLVVVPISDRI 120
           R LVI +K  K        +I  Q  ++  V   IK+L RYA      D+++ + +S++I
Sbjct: 435 RKLVITTKEKKAPSPTTSKSIA-QSSKYTDVHVTIKLLNRYAMHSMQVDDMIQINLSEQI 494

Query: 121 FGKGETLYLMPEDIMKFCAVIEISNTCMLVYI 149
            GK +T+YL  +DI+++C + EI  +C+L YI
Sbjct: 495 LGKEKTIYLQRDDIIQYCGMAEIGYSCILAYI 522

BLAST of Tan0003442 vs. NCBI nr
Match: XP_008451868.1 (PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo] >XP_008451869.1 PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo] >XP_016901189.1 PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo] >KAA0040180.1 uncharacterized protein E6C27_scaffold118G00160 [Cucumis melo var. makuwa] >TYK16450.1 uncharacterized protein E5676_scaffold21G002480 [Cucumis melo var. makuwa])

HSP 1 Score: 92.4 bits (228), Expect = 4.2e-15
Identity = 65/190 (34.21%), Postives = 109/190 (57.37%), Query Frame = 0

Query: 1   MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWP 60
           M ESD +   ++ + LG +N++V +D+  VE  ++ +P P+KG++ET L QA   FVAWP
Sbjct: 390 MFESDVQCPTIHGIPLGAENIRVTVDIAMVE--DVALPIPLKGDIET-LNQAIGNFVAWP 449

Query: 61  RNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRYA-EKYGKDNLVVVPISDRIF 120
           R LVI++K  K  S   ++          V   IK+L RYA +    ++++ + +S+ IF
Sbjct: 450 RKLVIVTKEKKAPSLTASESTTQSSKYTDVHVTIKLLNRYAMQTMQVEDIIQISLSEHIF 509

Query: 121 GKGETLYLMPEDIMKFCAVIEISNTCMLVYIV----LCVVDV---FVNNDYLFYSLH-PS 180
           GK +T+YL  +DI+++C + EI  +C+L YI     +C  ++   FV  D    S H  S
Sbjct: 510 GKEKTIYLRRDDIIQYCGMTEIGYSCILTYIACLWNVCESEITKRFVLVDQATISSHIKS 569

BLAST of Tan0003442 vs. NCBI nr
Match: XP_038895930.1 (uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida])

HSP 1 Score: 92.4 bits (228), Expect = 4.2e-15
Identity = 57/152 (37.50%), Postives = 90/152 (59.21%), Query Frame = 0

Query: 1   MVESDEK-EKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWP 60
           M ESD +   +NE+ LG DNV+  +D++  E  ++ +P P K  ++T L QA   FVAWP
Sbjct: 375 MFESDAQCPSINEIPLGPDNVRAMVDIVMGE--DVALPIPQKDKIKT-LDQAIGNFVAWP 434

Query: 61  RNLVIISKNDKDSKQKNQLNIVFQVIRF--VSSCIKILYRYA-EKYGKDNLVVVPISDRI 120
           R LVI +K  K        +I  Q  ++  V   IK+L RYA      D+++ + +S++I
Sbjct: 435 RKLVITTKEKKAPSPTTSKSIA-QSSKYTDVHVTIKLLNRYAMHSMQVDDMIQINLSEQI 494

Query: 121 FGKGETLYLMPEDIMKFCAVIEISNTCMLVYI 149
            GK +T+YL  +DI+++C + EI  +C+L YI
Sbjct: 495 LGKEKTIYLQRDDIIQYCGMAEIGYSCILAYI 522

BLAST of Tan0003442 vs. ExPASy TrEMBL
Match: A0A1S3BRX5 (uncharacterized protein LOC103493028 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103493028 PE=3 SV=1)

HSP 1 Score: 92.4 bits (228), Expect = 2.0e-15
Identity = 65/190 (34.21%), Postives = 109/190 (57.37%), Query Frame = 0

Query: 1   MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWP 60
           M ESD +   ++ + LG +N++V +D+  VE  ++ +P P+KG++ET L QA   FVAWP
Sbjct: 390 MFESDVQCPTIHGIPLGAENIRVTVDIAMVE--DVALPIPLKGDIET-LNQAIGNFVAWP 449

Query: 61  RNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRYA-EKYGKDNLVVVPISDRIF 120
           R LVI++K  K  S   ++          V   IK+L RYA +    ++++ + +S+ IF
Sbjct: 450 RKLVIVTKEKKAPSLTASESTTQSSKYTDVHVTIKLLNRYAMQTMQVEDIIQISLSEHIF 509

Query: 121 GKGETLYLMPEDIMKFCAVIEISNTCMLVYIV----LCVVDV---FVNNDYLFYSLH-PS 180
           GK +T+YL  +DI+++C + EI  +C+L YI     +C  ++   FV  D    S H  S
Sbjct: 510 GKEKTIYLRRDDIIQYCGMTEIGYSCILTYIACLWNVCESEITKRFVLVDQATISSHIKS 569

BLAST of Tan0003442 vs. ExPASy TrEMBL
Match: A0A5D3CYL9 (ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G002480 PE=3 SV=1)

HSP 1 Score: 92.4 bits (228), Expect = 2.0e-15
Identity = 65/190 (34.21%), Postives = 109/190 (57.37%), Query Frame = 0

Query: 1   MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWP 60
           M ESD +   ++ + LG +N++V +D+  VE  ++ +P P+KG++ET L QA   FVAWP
Sbjct: 390 MFESDVQCPTIHGIPLGAENIRVTVDIAMVE--DVALPIPLKGDIET-LNQAIGNFVAWP 449

Query: 61  RNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRYA-EKYGKDNLVVVPISDRIF 120
           R LVI++K  K  S   ++          V   IK+L RYA +    ++++ + +S+ IF
Sbjct: 450 RKLVIVTKEKKAPSLTASESTTQSSKYTDVHVTIKLLNRYAMQTMQVEDIIQISLSEHIF 509

Query: 121 GKGETLYLMPEDIMKFCAVIEISNTCMLVYIV----LCVVDV---FVNNDYLFYSLH-PS 180
           GK +T+YL  +DI+++C + EI  +C+L YI     +C  ++   FV  D    S H  S
Sbjct: 510 GKEKTIYLRRDDIIQYCGMTEIGYSCILTYIACLWNVCESEITKRFVLVDQATISSHIKS 569

BLAST of Tan0003442 vs. ExPASy TrEMBL
Match: A0A1S4DZN2 (uncharacterized protein LOC103493028 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103493028 PE=4 SV=1)

HSP 1 Score: 92.4 bits (228), Expect = 2.0e-15
Identity = 65/190 (34.21%), Postives = 109/190 (57.37%), Query Frame = 0

Query: 1   MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWP 60
           M ESD +   ++ + LG +N++V +D+  VE  ++ +P P+KG++ET L QA   FVAWP
Sbjct: 390 MFESDVQCPTIHGIPLGAENIRVTVDIAMVE--DVALPIPLKGDIET-LNQAIGNFVAWP 449

Query: 61  RNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRYA-EKYGKDNLVVVPISDRIF 120
           R LVI++K  K  S   ++          V   IK+L RYA +    ++++ + +S+ IF
Sbjct: 450 RKLVIVTKEKKAPSLTASESTTQSSKYTDVHVTIKLLNRYAMQTMQVEDIIQISLSEHIF 509

Query: 121 GKGETLYLMPEDIMKFCAVIEISNTCMLVYIV----LCVVDV---FVNNDYLFYSLH-PS 180
           GK +T+YL  +DI+++C + EI  +C+L YI     +C  ++   FV  D    S H  S
Sbjct: 510 GKEKTIYLRRDDIIQYCGMTEIGYSCILTYIACLWNVCESEITKRFVLVDQATISSHIKS 569

BLAST of Tan0003442 vs. ExPASy TrEMBL
Match: A0A6J1C2V2 (uncharacterized protein LOC111007859 isoform X4 OS=Momordica charantia OX=3673 GN=LOC111007859 PE=3 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 1.9e-13
Identity = 55/141 (39.01%), Postives = 79/141 (56.03%), Query Frame = 0

Query: 10  VNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKND 69
           V+ V LG DNV+V +D++  E     IP P++G +ET L Q    FVAWPR LVI+S+  
Sbjct: 354 VHGVPLGVDNVRVMVDIVIDEYAT--IPIPVRGEIET-LNQTIGGFVAWPRRLVILSEEK 413

Query: 70  K-DSKQKNQLNIVFQVIRFVSSCIKILYRYAE-KYGKDNLVVVPISDRIFGKGETLYLMP 129
              S + +Q          V   IK+L RY       ++ V + +S  IFGK + +YL  
Sbjct: 414 NISSSRTSQTRTQLSKHTDVHVSIKLLNRYVMLSMQHEDTVEINLSKDIFGKEKNIYLTR 473

Query: 130 EDIMKFCAVIEISNTCMLVYI 149
            DIM++C +IEI  +C+L YI
Sbjct: 474 NDIMQYCTMIEIGYSCILTYI 491

BLAST of Tan0003442 vs. ExPASy TrEMBL
Match: A0A6J1C398 (uncharacterized protein LOC111007859 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111007859 PE=4 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 1.9e-13
Identity = 55/141 (39.01%), Postives = 79/141 (56.03%), Query Frame = 0

Query: 10  VNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKND 69
           V+ V LG DNV+V +D++  E     IP P++G +ET L Q    FVAWPR LVI+S+  
Sbjct: 354 VHGVPLGVDNVRVMVDIVIDEYAT--IPIPVRGEIET-LNQTIGGFVAWPRRLVILSEEK 413

Query: 70  K-DSKQKNQLNIVFQVIRFVSSCIKILYRYAE-KYGKDNLVVVPISDRIFGKGETLYLMP 129
              S + +Q          V   IK+L RY       ++ V + +S  IFGK + +YL  
Sbjct: 414 NISSSRTSQTRTQLSKHTDVHVSIKLLNRYVMLSMQHEDTVEINLSKDIFGKEKNIYLTR 473

Query: 130 EDIMKFCAVIEISNTCMLVYI 149
            DIM++C +IEI  +C+L YI
Sbjct: 474 NDIMQYCTMIEIGYSCILTYI 491

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAE8649224.16.5e-1638.41hypothetical protein Csa_014966 [Cucumis sativus][more]
XP_031740251.16.5e-1638.41uncharacterized protein LOC101213947 [Cucumis sativus][more]
XP_038895921.14.2e-1537.50uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida] >XP_03889592... [more]
XP_008451868.14.2e-1534.21PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo] >XP_00... [more]
XP_038895930.14.2e-1537.50uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A1S3BRX52.0e-1534.21uncharacterized protein LOC103493028 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5D3CYL92.0e-1534.21ULP_PROTEASE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A1S4DZN22.0e-1534.21uncharacterized protein LOC103493028 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1C2V21.9e-1339.01uncharacterized protein LOC111007859 isoform X4 OS=Momordica charantia OX=3673 G... [more]
A0A6J1C3981.9e-1339.01uncharacterized protein LOC111007859 isoform X3 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0003442.1Tan0003442.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0110165 cellular anatomical entity