Tan0006546 (gene) Snake gourd v1

Overview
NameTan0006546
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotran_gag_3 domain-containing protein
LocationLG02: 6967086 .. 6967589 (-)
RNA-Seq ExpressionTan0006546
SyntenyTan0006546
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTAACACCCAATCTTACGAAAGTCCTTCCTTGTTGACCGAAACATCGAACTTCAGTGCACGCTCTCTAAACCAGCTGCTAAATAAGGTTACAACAATTAAGCTTGATCGTGGAAATTTCCTGTCATGGAAAAACCTAACCCTACCCATTCTTCGCAGCTACAAGTTGGAAGGCTTCCTACTTGGGACAAAATTGTGTCCACCAATGTTCGTTTCCCACCTTGAAAGTCAACAAAATGCTAGAATTATGGCTTCCGCTGAAGGAGCATCAAGTTTTGGAGTATCAAGTTCAGGAGTAGAGGGCATTCTGAATCCGGAGTATGAATATTGGATGACTATCGATCAGCTTGTTTTGAGTTGGTTGTACAATTCTATGACACCAGAAGTAGCCACTCAAGTGATGGCCTATGGGAGCTCAAAAGATCTGTGGGGAGCTATACAAACACTATTTGGATTGCAATCAAGGGTTGAAGAAGATTACTTAAGACAAGTGTTTCAATAG

mRNA sequence

ATGGCTAACACCCAATCTTACGAAAGTCCTTCCTTGTTGACCGAAACATCGAACTTCAGTGCACGCTCTCTAAACCAGCTGCTAAATAAGGTTACAACAATTAAGCTTGATCGTGGAAATTTCCTGTCATGGAAAAACCTAACCCTACCCATTCTTCGCAGCTACAAGTTGGAAGGCTTCCTACTTGGGACAAAATTGTGTCCACCAATGTTCGTTTCCCACCTTGAAAGTCAACAAAATGCTAGAATTATGGCTTCCGCTGAAGGAGCATCAAGTTTTGGAGTATCAAGTTCAGGAGTAGAGGGCATTCTGAATCCGGAGTATGAATATTGGATGACTATCGATCAGCTTGTTTTGAGTTGGTTGTACAATTCTATGACACCAGAAGTAGCCACTCAAGTGATGGCCTATGGGAGCTCAAAAGATCTGTGGGGAGCTATACAAACACTATTTGGATTGCAATCAAGGGTTGAAGAAGATTACTTAAGACAAGTGTTTCAATAG

Coding sequence (CDS)

ATGGCTAACACCCAATCTTACGAAAGTCCTTCCTTGTTGACCGAAACATCGAACTTCAGTGCACGCTCTCTAAACCAGCTGCTAAATAAGGTTACAACAATTAAGCTTGATCGTGGAAATTTCCTGTCATGGAAAAACCTAACCCTACCCATTCTTCGCAGCTACAAGTTGGAAGGCTTCCTACTTGGGACAAAATTGTGTCCACCAATGTTCGTTTCCCACCTTGAAAGTCAACAAAATGCTAGAATTATGGCTTCCGCTGAAGGAGCATCAAGTTTTGGAGTATCAAGTTCAGGAGTAGAGGGCATTCTGAATCCGGAGTATGAATATTGGATGACTATCGATCAGCTTGTTTTGAGTTGGTTGTACAATTCTATGACACCAGAAGTAGCCACTCAAGTGATGGCCTATGGGAGCTCAAAAGATCTGTGGGGAGCTATACAAACACTATTTGGATTGCAATCAAGGGTTGAAGAAGATTACTTAAGACAAGTGTTTCAATAG

Protein sequence

MANTQSYESPSLLTETSNFSARSLNQLLNKVTTIKLDRGNFLSWKNLTLPILRSYKLEGFLLGTKLCPPMFVSHLESQQNARIMASAEGASSFGVSSSGVEGILNPEYEYWMTIDQLVLSWLYNSMTPEVATQVMAYGSSKDLWGAIQTLFGLQSRVEEDYLRQVFQ
Homology
BLAST of Tan0006546 vs. NCBI nr
Match: XP_038887015.1 (uncharacterized protein LOC120077182 [Benincasa hispida])

HSP 1 Score: 165.2 bits (417), Expect = 4.7e-37
Identity = 88/158 (55.70%), Postives = 108/158 (68.35%), Query Frame = 0

Query: 11  SLLTETSNFSARSLNQLLNKVTTIKLDRGNFLSWKNLTLPILRSYKLEGFLLGTKLCPPM 70
           S + E++ FS   LNQLLN++T  KL+RG F+ WK L LPILR YKLE  L GTK+C PM
Sbjct: 9   SSIGESTKFSTPPLNQLLNQITNTKLERGYFMLWKTLALPILRGYKLERHLSGTKICSPM 68

Query: 71  F-VSHLESQQNARIMASAEGASSFGVSSSGVEGILNPEYEYWMTIDQLVLSWLYNSMTPE 130
           F VS + S +     + A G    G SS+  E   NP YE W+T DQL+L W YNSMTPE
Sbjct: 69  FTVSTIPSIETGDFGSQASG----GASSATGERTFNPLYEVWVTADQLLLGWPYNSMTPE 128

Query: 131 VATQVMAYGSSKDLWGAIQTLFGLQSRVEEDYLRQVFQ 168
           VA Q+M + S+K LW AI +LFG+QSR E+DYLRQVFQ
Sbjct: 129 VAVQLMGHESAKSLWDAIHSLFGVQSRAEKDYLRQVFQ 162

BLAST of Tan0006546 vs. NCBI nr
Match: KAA0067279.1 (uncharacterized protein E6C27_scaffold418G001000 [Cucumis melo var. makuwa])

HSP 1 Score: 156.0 bits (393), Expect = 2.9e-34
Identity = 81/154 (52.60%), Postives = 104/154 (67.53%), Query Frame = 0

Query: 14  TETSNFSARSLNQLLNKVTTIKLDRGNFLSWKNLTLPILRSYKLEGFLLGTKLCPPMFVS 73
           T T++F+   LNQ+LN++TTIKLDRGN+L WK L LPIL+SYKL   L G   C P  + 
Sbjct: 12  TPTTSFTNPLLNQILNQLTTIKLDRGNYLLWKTLALPILKSYKLNSHLFGESPCLPKIIM 71

Query: 74  HLESQQNARIMASAEGASSFGVSSSGVEGILNPEYEYWMTIDQLVLSWLYNSMTPEVATQ 133
            L +Q N  I+ +A   S    SSS     +NP+YE W+T D L+L WLYNSMTPEV  Q
Sbjct: 72  -LTTQPNESIVENAGEPSQETSSSSTAVVTVNPKYERWITTDLLLLGWLYNSMTPEVTIQ 131

Query: 134 VMAYGSSKDLWGAIQTLFGLQSRVEEDYLRQVFQ 168
           +M + ++KDLW A Q LFG+QSR +ED+L Q FQ
Sbjct: 132 LMGFTNAKDLWEATQDLFGIQSRAKEDFLHQTFQ 164

BLAST of Tan0006546 vs. NCBI nr
Match: XP_016900937.1 (PREDICTED: uncharacterized protein LOC107991116 [Cucumis melo])

HSP 1 Score: 156.0 bits (393), Expect = 2.9e-34
Identity = 81/154 (52.60%), Postives = 104/154 (67.53%), Query Frame = 0

Query: 14  TETSNFSARSLNQLLNKVTTIKLDRGNFLSWKNLTLPILRSYKLEGFLLGTKLCPPMFVS 73
           T T++F+   LNQ+LN++TTIKLDRGN+L WK L LPIL+SYKL   L G   C P  + 
Sbjct: 87  TPTTSFTNPLLNQILNQLTTIKLDRGNYLLWKTLALPILKSYKLNSHLFGESPCLPKIIM 146

Query: 74  HLESQQNARIMASAEGASSFGVSSSGVEGILNPEYEYWMTIDQLVLSWLYNSMTPEVATQ 133
            L +Q N  I+ +A   S    SSS     +NP+YE W+T D L+L WLYNSMTPEV  Q
Sbjct: 147 -LTTQPNESIVENAGEPSQETSSSSTAVVTVNPKYERWITTDLLLLGWLYNSMTPEVTIQ 206

Query: 134 VMAYGSSKDLWGAIQTLFGLQSRVEEDYLRQVFQ 168
           +M + ++KDLW A Q LFG+QSR +ED+L Q FQ
Sbjct: 207 LMGFTNAKDLWEATQDLFGIQSRAKEDFLHQTFQ 239

BLAST of Tan0006546 vs. NCBI nr
Match: XP_022151683.1 (uncharacterized protein LOC111019598 [Momordica charantia])

HSP 1 Score: 154.8 bits (390), Expect = 6.4e-34
Identity = 78/159 (49.06%), Postives = 105/159 (66.04%), Query Frame = 0

Query: 9   SPSLLTETSNFSARSLNQLLNKVTTIKLDRGNFLSWKNLTLPILRSYKLEGFLLGTKLCP 68
           +P  +   + F++  LNQLLN++T+IK+DRGNFL W+NL LPILRSYKL  +L G K CP
Sbjct: 12  TPPAVVSGAVFTSPPLNQLLNQITSIKMDRGNFLLWQNLALPILRSYKLFDYLTGDKPCP 71

Query: 69  PMFVSHLESQQNARIMASAEGASSFGVSSSGVEGILNPEYEYWMTIDQLVLSWLYNSMTP 128
           P  +   ++  N             G +SS     LNP YE W+ +D+L+L WLYNSM  
Sbjct: 72  PTHLVPTDTPTNIE-----------GSTSSQSSPTLNPTYEAWIVVDKLLLGWLYNSMAA 131

Query: 129 EVATQVMAYGSSKDLWGAIQTLFGLQSRVEEDYLRQVFQ 168
           +VA QVM + +S++LW A+Q LFG+QSR E DYL+QVFQ
Sbjct: 132 DVAMQVMGFSTSRELWTAVQELFGVQSRAEVDYLKQVFQ 159

BLAST of Tan0006546 vs. NCBI nr
Match: XP_016902203.1 (PREDICTED: uncharacterized protein LOC107991581 isoform X3 [Cucumis melo])

HSP 1 Score: 153.7 bits (387), Expect = 1.4e-33
Identity = 84/168 (50.00%), Postives = 111/168 (66.07%), Query Frame = 0

Query: 1   MANTQSYESPSLLTETSNFSARSLNQLLNKVTTIKLDRGNFLSWKNLTLPILRSYKLEGF 60
           MAN Q   +P  L+ ++ FS   LNQ+LN++TT+KLDR N+L WK L LPIL+ YKLEG 
Sbjct: 1   MANAQPTAAPPSLS-SAGFSNPPLNQILNQLTTVKLDRKNYLLWKTLALPILKDYKLEGH 60

Query: 61  LLGTKLCPPMFVSHLESQQNARIMASAEGA-SSFGVSSSGVEGILNPEYEYWMTIDQLVL 120
           L     CP  FV    S  ++    + EGA ++ G SSS    I+NP +E W+T D L+L
Sbjct: 61  LTAETPCPSHFVL---SASSSNTTVTEEGADATIGASSSITPRIVNPLFEQWVTTDLLLL 120

Query: 121 SWLYNSMTPEVATQVMAYGSSKDLWGAIQTLFGLQSRVEEDYLRQVFQ 168
            WLYNSMTP+VA Q+M + + +DLW A Q  FG+QSR EED+LRQ+ Q
Sbjct: 121 GWLYNSMTPDVAIQLMGFTNVEDLWDATQDFFGVQSRAEEDFLRQMLQ 164

BLAST of Tan0006546 vs. ExPASy TrEMBL
Match: A0A5A7VPY0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold418G001000 PE=4 SV=1)

HSP 1 Score: 156.0 bits (393), Expect = 1.4e-34
Identity = 81/154 (52.60%), Postives = 104/154 (67.53%), Query Frame = 0

Query: 14  TETSNFSARSLNQLLNKVTTIKLDRGNFLSWKNLTLPILRSYKLEGFLLGTKLCPPMFVS 73
           T T++F+   LNQ+LN++TTIKLDRGN+L WK L LPIL+SYKL   L G   C P  + 
Sbjct: 12  TPTTSFTNPLLNQILNQLTTIKLDRGNYLLWKTLALPILKSYKLNSHLFGESPCLPKIIM 71

Query: 74  HLESQQNARIMASAEGASSFGVSSSGVEGILNPEYEYWMTIDQLVLSWLYNSMTPEVATQ 133
            L +Q N  I+ +A   S    SSS     +NP+YE W+T D L+L WLYNSMTPEV  Q
Sbjct: 72  -LTTQPNESIVENAGEPSQETSSSSTAVVTVNPKYERWITTDLLLLGWLYNSMTPEVTIQ 131

Query: 134 VMAYGSSKDLWGAIQTLFGLQSRVEEDYLRQVFQ 168
           +M + ++KDLW A Q LFG+QSR +ED+L Q FQ
Sbjct: 132 LMGFTNAKDLWEATQDLFGIQSRAKEDFLHQTFQ 164

BLAST of Tan0006546 vs. ExPASy TrEMBL
Match: A0A1S4DY80 (uncharacterized protein LOC107991116 OS=Cucumis melo OX=3656 GN=LOC107991116 PE=4 SV=1)

HSP 1 Score: 156.0 bits (393), Expect = 1.4e-34
Identity = 81/154 (52.60%), Postives = 104/154 (67.53%), Query Frame = 0

Query: 14  TETSNFSARSLNQLLNKVTTIKLDRGNFLSWKNLTLPILRSYKLEGFLLGTKLCPPMFVS 73
           T T++F+   LNQ+LN++TTIKLDRGN+L WK L LPIL+SYKL   L G   C P  + 
Sbjct: 87  TPTTSFTNPLLNQILNQLTTIKLDRGNYLLWKTLALPILKSYKLNSHLFGESPCLPKIIM 146

Query: 74  HLESQQNARIMASAEGASSFGVSSSGVEGILNPEYEYWMTIDQLVLSWLYNSMTPEVATQ 133
            L +Q N  I+ +A   S    SSS     +NP+YE W+T D L+L WLYNSMTPEV  Q
Sbjct: 147 -LTTQPNESIVENAGEPSQETSSSSTAVVTVNPKYERWITTDLLLLGWLYNSMTPEVTIQ 206

Query: 134 VMAYGSSKDLWGAIQTLFGLQSRVEEDYLRQVFQ 168
           +M + ++KDLW A Q LFG+QSR +ED+L Q FQ
Sbjct: 207 LMGFTNAKDLWEATQDLFGIQSRAKEDFLHQTFQ 239

BLAST of Tan0006546 vs. ExPASy TrEMBL
Match: A0A6J1DCW4 (uncharacterized protein LOC111019598 OS=Momordica charantia OX=3673 GN=LOC111019598 PE=4 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 3.1e-34
Identity = 78/159 (49.06%), Postives = 105/159 (66.04%), Query Frame = 0

Query: 9   SPSLLTETSNFSARSLNQLLNKVTTIKLDRGNFLSWKNLTLPILRSYKLEGFLLGTKLCP 68
           +P  +   + F++  LNQLLN++T+IK+DRGNFL W+NL LPILRSYKL  +L G K CP
Sbjct: 12  TPPAVVSGAVFTSPPLNQLLNQITSIKMDRGNFLLWQNLALPILRSYKLFDYLTGDKPCP 71

Query: 69  PMFVSHLESQQNARIMASAEGASSFGVSSSGVEGILNPEYEYWMTIDQLVLSWLYNSMTP 128
           P  +   ++  N             G +SS     LNP YE W+ +D+L+L WLYNSM  
Sbjct: 72  PTHLVPTDTPTNIE-----------GSTSSQSSPTLNPTYEAWIVVDKLLLGWLYNSMAA 131

Query: 129 EVATQVMAYGSSKDLWGAIQTLFGLQSRVEEDYLRQVFQ 168
           +VA QVM + +S++LW A+Q LFG+QSR E DYL+QVFQ
Sbjct: 132 DVAMQVMGFSTSRELWTAVQELFGVQSRAEVDYLKQVFQ 159

BLAST of Tan0006546 vs. ExPASy TrEMBL
Match: A0A1S4E1U9 (uncharacterized protein LOC107991581 isoform X4 OS=Cucumis melo OX=3656 GN=LOC107991581 PE=4 SV=1)

HSP 1 Score: 153.7 bits (387), Expect = 6.9e-34
Identity = 84/168 (50.00%), Postives = 111/168 (66.07%), Query Frame = 0

Query: 1   MANTQSYESPSLLTETSNFSARSLNQLLNKVTTIKLDRGNFLSWKNLTLPILRSYKLEGF 60
           MAN Q   +P  L+ ++ FS   LNQ+LN++TT+KLDR N+L WK L LPIL+ YKLEG 
Sbjct: 1   MANAQPTAAPPSLS-SAGFSNPPLNQILNQLTTVKLDRKNYLLWKTLALPILKDYKLEGH 60

Query: 61  LLGTKLCPPMFVSHLESQQNARIMASAEGA-SSFGVSSSGVEGILNPEYEYWMTIDQLVL 120
           L     CP  FV    S  ++    + EGA ++ G SSS    I+NP +E W+T D L+L
Sbjct: 61  LTAETPCPSHFVL---SASSSNTTVTEEGADATIGASSSITPRIVNPLFEQWVTTDLLLL 120

Query: 121 SWLYNSMTPEVATQVMAYGSSKDLWGAIQTLFGLQSRVEEDYLRQVFQ 168
            WLYNSMTP+VA Q+M + + +DLW A Q  FG+QSR EED+LRQ+ Q
Sbjct: 121 GWLYNSMTPDVAIQLMGFTNVEDLWDATQDFFGVQSRAEEDFLRQMLQ 164

BLAST of Tan0006546 vs. ExPASy TrEMBL
Match: A0A1S4E1U6 (uncharacterized protein LOC107991581 isoform X1 OS=Cucumis melo OX=3656 GN=LOC107991581 PE=4 SV=1)

HSP 1 Score: 153.7 bits (387), Expect = 6.9e-34
Identity = 84/168 (50.00%), Postives = 111/168 (66.07%), Query Frame = 0

Query: 1   MANTQSYESPSLLTETSNFSARSLNQLLNKVTTIKLDRGNFLSWKNLTLPILRSYKLEGF 60
           MAN Q   +P  L+ ++ FS   LNQ+LN++TT+KLDR N+L WK L LPIL+ YKLEG 
Sbjct: 1   MANAQPTAAPPSLS-SAGFSNPPLNQILNQLTTVKLDRKNYLLWKTLALPILKDYKLEGH 60

Query: 61  LLGTKLCPPMFVSHLESQQNARIMASAEGA-SSFGVSSSGVEGILNPEYEYWMTIDQLVL 120
           L     CP  FV    S  ++    + EGA ++ G SSS    I+NP +E W+T D L+L
Sbjct: 61  LTAETPCPSHFVL---SASSSNTTVTEEGADATIGASSSITPRIVNPLFEQWVTTDLLLL 120

Query: 121 SWLYNSMTPEVATQVMAYGSSKDLWGAIQTLFGLQSRVEEDYLRQVFQ 168
            WLYNSMTP+VA Q+M + + +DLW A Q  FG+QSR EED+LRQ+ Q
Sbjct: 121 GWLYNSMTPDVAIQLMGFTNVEDLWDATQDFFGVQSRAEEDFLRQMLQ 164

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038887015.14.7e-3755.70uncharacterized protein LOC120077182 [Benincasa hispida][more]
KAA0067279.12.9e-3452.60uncharacterized protein E6C27_scaffold418G001000 [Cucumis melo var. makuwa][more]
XP_016900937.12.9e-3452.60PREDICTED: uncharacterized protein LOC107991116 [Cucumis melo][more]
XP_022151683.16.4e-3449.06uncharacterized protein LOC111019598 [Momordica charantia][more]
XP_016902203.11.4e-3350.00PREDICTED: uncharacterized protein LOC107991581 isoform X3 [Cucumis melo][more]
Match NameE-valueIdentityDescription
A0A5A7VPY01.4e-3452.60Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A1S4DY801.4e-3452.60uncharacterized protein LOC107991116 OS=Cucumis melo OX=3656 GN=LOC107991116 PE=... [more]
A0A6J1DCW43.1e-3449.06uncharacterized protein LOC111019598 OS=Momordica charantia OX=3673 GN=LOC111019... [more]
A0A1S4E1U96.9e-3450.00uncharacterized protein LOC107991581 isoform X4 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S4E1U66.9e-3450.00uncharacterized protein LOC107991581 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR37610:SF39GAG-POLYPEPTIDE OF LTR COPIA-TYPE-RELATEDcoord: 23..164
NoneNo IPR availablePANTHERPTHR37610FAMILY NOT NAMEDcoord: 23..164

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0006546.1Tan0006546.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0044237 cellular metabolic process