Tan0008154 (gene) Snake gourd v1

Overview
NameTan0008154
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
LocationLG01: 104967199 .. 104969428 (+)
RNA-Seq ExpressionTan0008154
SyntenyTan0008154
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACTTGGCCACCAAATTTCTTCGCACTGTAACCAATGCCACCAACAACCAAACCTTAATCAACGTTTGTTTGATTATTTCGTTTGGGGTGCTGAGCGCAAGGTCAATCAAGCAGCAGAGGGAAATTGAAGCTCTGGAGGCCGAGAAAGATTCACTTCTCAATTCCAATAAAGCGTTGAAGAAAACCATGTGGGATTGGAAGCAACAGCTATTCGCCGAAGCCTCAACCGAGTCCGCTTTGGTTCCTCTCGCCAGGATCAAAGCTATCTACGGCGAAGCTCCGATATCCCCTTCCGGTAATTTACGCTTTCCTTTTCTCTTCACTCTTGTTTTTTCTGAATGTTTAGTAAAATTTTGTCGAAATGTTAGTTATTATTTGAAGGAAATGGTACCGTGCTTGATTCATATACACTCGTTTTAGAATTTGAGTGTCGTCATCGAGAATTATGGCTCATTCCAAGCATTTCAAGCTCCCTACTCGTAGTTGGAAAATTATTCGGTTCCGTTACTACTTCATCTTTCAGTGTTGGCCTGATGTTTCTCATCATCTAATTCTGAAGTCGAGTGTTATCATAATTAATAACCGTGTTCTTGGTTGTCATGAATTCTAATTTGAATCTTACTTTCGTGGGCTAGTATTTTTTTTTCTTTTTTTTTCCAGTACCTCCGAAATAATGAGCATTGAATTGTATAATGGGATAGACCATGTCATTGAGCTTAACGGTCCTATGCTTATTAAGTTCATGTTCCTGCACAACTGATGCAGAGGTCTGTCATTATGGAAGAAGAACTTGAGGATAGTACATTCTGCATAAAACTTTACCGAAGACATTGTTATTGAGCTGGGATTCATATATGAGAAGGCGACGCTCAAAAAGTTTTGATGCAAAGGAATTAATTTCTCCTATTGGGACTAGGAAGTTTAAAAAGGGCACTCCAAGTGAAGTTAATTACAAAGGCAGAATAGTCGCAGAAAGAGTTGGAAGGATAAAATAAAAAAGAACATTGAAATTTTGTTCCCAATATCTATCCCTATTGTTAGAAATCTGTTAATTTCTTTCTAGCCGTAGGCCTCTACTCCTAGTCCTTGACTGCATTCTTCCATAAGATTTTTGTCTTTTCTTTTCTTTGAAGTTGGTTCCACACAGAATCAGCATCAGACCATCCTTAGTCAAGAAATCAAAATCCAACTCAAAGTTGATTAGAGGAAAAGAGCTTCAAAAACTTTCTGCTTCAAAAGCTTCGAAGTAAATGGTTTTGTGTCTTGTTTGAATTCCTACATAATGAGCAGTTGAGCACCAATTTGGTTGTAACTTGTAAGATCATCAATGGACATCTTCTTTGTAGTCTCTCCGTTGTGTTTAGCCCCCTCAAGACACACCAACCAGCAGAAACACTCCAACTTTTATTTTGGTTCTTAACCTTTCCAAATCGAGTGGTCAGTTACTAGATAGCAAGGAAATTTTGATAGACCTGTGATTTTTACTAATAGAAATGCATTACTTCTGTCAACCTTGTGATAATATTACCCTGTGGACGATGACTGTAGTATGCATTTCACAATTTGTGGGGAGATAAAGCTCGAGAATTGGCGGTCATGTTAACATGACTTGATTGAGCCCAAGTAGACTGGAAGATAAAAACAGCATGAGTGGATTCTTTTAAAAAAACAATTCAAGTGCTTGGTTTGATGCAATTTTTTTATAATCTTTTTTGCTTGTTGAATTCATGCAGAACAATGTTCAGAATCTATAATCTATTACTTGAGAGTTTTAAGTCTAGCTTGGATTTATTGTAGTATCTCAATTGAGAGTTGAGAGTAGTTCAAATTTCCCTCATATAATTCCGGGTGAGGAGGTTTGGGGGGTGACTAGGTTTAATGTCTCTCTTTAGGCTTCGGTTACTGGGCCTTTTTGTAATAAACCTTTGTCCGGTTCATTTGGATTGGAGTCCTTTTCTGTAGTTTATTTTGGGCTCCTATTATTTCATTTTTCTCCCGTTTCTTAGAAAATAAATAAATAAAATTCACTCTCTTATGCATATTACTTAGTTCAGATGTTTTTATATATCAAGTCAACTTTAGCTTATTTGAGATCTGAGAAGATGGCCATTATTGTATAAGCTAGCTTCTACAGATTTATACACTGATCCACTTTTTATTTCATGCAGGAGCAGGAAATGCCGTAATCGAAGATGCAAATTCACAAGGCTCCAAACTTATGGTTTAA

mRNA sequence

ATGGACTTGGCCACCAAATTTCTTCGCACTGTAACCAATGCCACCAACAACCAAACCTTAATCAACGTTTGTTTGATTATTTCGTTTGGGGTGCTGAGCGCAAGGTCAATCAAGCAGCAGAGGGAAATTGAAGCTCTGGAGGCCGAGAAAGATTCACTTCTCAATTCCAATAAAGCGTTGAAGAAAACCATGTGGGATTGGAAGCAACAGCTATTCGCCGAAGCCTCAACCGAGTCCGCTTTGGTTCCTCTCGCCAGGATCAAAGCTATCTACGGCGAAGCTCCGATATCCCCTTCCGGAGCAGGAAATGCCGTAATCGAAGATGCAAATTCACAAGGCTCCAAACTTATGGTTTAA

Coding sequence (CDS)

ATGGACTTGGCCACCAAATTTCTTCGCACTGTAACCAATGCCACCAACAACCAAACCTTAATCAACGTTTGTTTGATTATTTCGTTTGGGGTGCTGAGCGCAAGGTCAATCAAGCAGCAGAGGGAAATTGAAGCTCTGGAGGCCGAGAAAGATTCACTTCTCAATTCCAATAAAGCGTTGAAGAAAACCATGTGGGATTGGAAGCAACAGCTATTCGCCGAAGCCTCAACCGAGTCCGCTTTGGTTCCTCTCGCCAGGATCAAAGCTATCTACGGCGAAGCTCCGATATCCCCTTCCGGAGCAGGAAATGCCGTAATCGAAGATGCAAATTCACAAGGCTCCAAACTTATGGTTTAA

Protein sequence

MDLATKFLRTVTNATNNQTLINVCLIISFGVLSARSIKQQREIEALEAEKDSLLNSNKALKKTMWDWKQQLFAEASTESALVPLARIKAIYGEAPISPSGAGNAVIEDANSQGSKLMV
Homology
BLAST of Tan0008154 vs. NCBI nr
Match: XP_038897052.1 (uncharacterized protein LOC120085226 [Benincasa hispida])

HSP 1 Score: 203.8 bits (517), Expect = 8.4e-49
Identity = 106/118 (89.83%), Postives = 111/118 (94.07%), Query Frame = 0

Query: 1   MDLATKFLRTVTNATNNQTLINVCLIISFGVLSARSIKQQREIEALEAEKDSLLNSNKAL 60
           MDLA+KFLRTVTNATNN TLINVCL++SFG LSARSIKQQREIEALEAEK SLLNSNKAL
Sbjct: 1   MDLASKFLRTVTNATNNNTLINVCLVLSFGALSARSIKQQREIEALEAEKVSLLNSNKAL 60

Query: 61  KKTMWDWKQQLFAEASTESALVPLARIKAIYGEAPISPSGAGNAVIEDANSQGSKLMV 119
           KKTMWDWKQQLFAEAST+SALVPLARIKAIYGEAPISPSG G+   EDANSQGSKLMV
Sbjct: 61  KKTMWDWKQQLFAEASTDSALVPLARIKAIYGEAPISPSGVGHVATEDANSQGSKLMV 118

BLAST of Tan0008154 vs. NCBI nr
Match: XP_022937146.1 (uncharacterized protein LOC111443531 [Cucurbita moschata] >XP_023536544.1 uncharacterized protein LOC111797682 [Cucurbita pepo subsp. pepo] >KAG7024766.1 hypothetical protein SDJN02_13584 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 199.5 bits (506), Expect = 1.6e-47
Identity = 103/118 (87.29%), Postives = 111/118 (94.07%), Query Frame = 0

Query: 1   MDLATKFLRTVTNATNNQTLINVCLIISFGVLSARSIKQQREIEALEAEKDSLLNSNKAL 60
           MDLATKFLRTVT+ATNN TLINVCL+ISFG LSARSIKQQREIEALEAEKDSLLNSNK+L
Sbjct: 1   MDLATKFLRTVTSATNNNTLINVCLVISFGALSARSIKQQREIEALEAEKDSLLNSNKSL 60

Query: 61  KKTMWDWKQQLFAEASTESALVPLARIKAIYGEAPISPSGAGNAVIEDANSQGSKLMV 119
           KKTMWDWKQQL+++AST+SALVPLARIKAIYGEAP+SPSGA  A   DANSQGSKLMV
Sbjct: 61  KKTMWDWKQQLYSDASTDSALVPLARIKAIYGEAPVSPSGAEQAATGDANSQGSKLMV 118

BLAST of Tan0008154 vs. NCBI nr
Match: XP_022976887.1 (uncharacterized protein LOC111477117 [Cucurbita maxima])

HSP 1 Score: 198.4 bits (503), Expect = 3.5e-47
Identity = 102/118 (86.44%), Postives = 110/118 (93.22%), Query Frame = 0

Query: 1   MDLATKFLRTVTNATNNQTLINVCLIISFGVLSARSIKQQREIEALEAEKDSLLNSNKAL 60
           MDLA+KFLRT+TNATNN TLINVCL+ISFG LSARSIKQQREIEALEAEKDSLLNSNKAL
Sbjct: 1   MDLASKFLRTLTNATNNNTLINVCLVISFGALSARSIKQQREIEALEAEKDSLLNSNKAL 60

Query: 61  KKTMWDWKQQLFAEASTESALVPLARIKAIYGEAPISPSGAGNAVIEDANSQGSKLMV 119
           KKTMWDWKQQL++EAST+SAL+PLARIKAIY EAP+SPSGA  A   DANSQGSKLMV
Sbjct: 61  KKTMWDWKQQLYSEASTDSALIPLARIKAIYSEAPVSPSGAEQAATGDANSQGSKLMV 118

BLAST of Tan0008154 vs. NCBI nr
Match: KAG6591893.1 (Acetyltransferase, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 198.0 bits (502), Expect = 4.6e-47
Identity = 102/117 (87.18%), Postives = 110/117 (94.02%), Query Frame = 0

Query: 1   MDLATKFLRTVTNATNNQTLINVCLIISFGVLSARSIKQQREIEALEAEKDSLLNSNKAL 60
           MDLATKFLRTVT+ATNN TLINVCL+ISFG LSARSIKQQREIEALEAEKDSLLNSNK+L
Sbjct: 113 MDLATKFLRTVTSATNNNTLINVCLVISFGALSARSIKQQREIEALEAEKDSLLNSNKSL 172

Query: 61  KKTMWDWKQQLFAEASTESALVPLARIKAIYGEAPISPSGAGNAVIEDANSQGSKLM 118
           KKTMWDWKQQL+++AST+SALVPLARIKAIYGEAP+SPSGA  A   DANSQGSKLM
Sbjct: 173 KKTMWDWKQQLYSDASTDSALVPLARIKAIYGEAPVSPSGAEQAATGDANSQGSKLM 229

BLAST of Tan0008154 vs. NCBI nr
Match: XP_022140674.1 (uncharacterized protein LOC111011272 [Momordica charantia])

HSP 1 Score: 194.9 bits (494), Expect = 3.9e-46
Identity = 101/119 (84.87%), Postives = 112/119 (94.12%), Query Frame = 0

Query: 1   MDLATKFLRTVTNATNNQTLINVCLIISFGVLSARSIKQQREIEALEAEKDSLLNSNKAL 60
           MD+ATKFLRT+TNA+NN+TLINVCL++SFG LSARSIKQQREIEALEAEKDSLLNSNKAL
Sbjct: 1   MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKAL 60

Query: 61  KKTMWDWKQQLFAEASTESALVPLARIKAIYGEAPI-SPSGAGNAVIEDANSQGSKLMV 119
           KK+MWDWKQQLFAEAS+ESALVPLARIKAIYGE PI SP+GAG+A  ED NSQGSK +V
Sbjct: 61  KKSMWDWKQQLFAEASSESALVPLARIKAIYGEVPISSPTGAGHAATEDENSQGSKFVV 119

BLAST of Tan0008154 vs. ExPASy TrEMBL
Match: A0A6J1F9J6 (uncharacterized protein LOC111443531 OS=Cucurbita moschata OX=3662 GN=LOC111443531 PE=4 SV=1)

HSP 1 Score: 199.5 bits (506), Expect = 7.7e-48
Identity = 103/118 (87.29%), Postives = 111/118 (94.07%), Query Frame = 0

Query: 1   MDLATKFLRTVTNATNNQTLINVCLIISFGVLSARSIKQQREIEALEAEKDSLLNSNKAL 60
           MDLATKFLRTVT+ATNN TLINVCL+ISFG LSARSIKQQREIEALEAEKDSLLNSNK+L
Sbjct: 1   MDLATKFLRTVTSATNNNTLINVCLVISFGALSARSIKQQREIEALEAEKDSLLNSNKSL 60

Query: 61  KKTMWDWKQQLFAEASTESALVPLARIKAIYGEAPISPSGAGNAVIEDANSQGSKLMV 119
           KKTMWDWKQQL+++AST+SALVPLARIKAIYGEAP+SPSGA  A   DANSQGSKLMV
Sbjct: 61  KKTMWDWKQQLYSDASTDSALVPLARIKAIYGEAPVSPSGAEQAATGDANSQGSKLMV 118

BLAST of Tan0008154 vs. ExPASy TrEMBL
Match: A0A6J1IGY3 (uncharacterized protein LOC111477117 OS=Cucurbita maxima OX=3661 GN=LOC111477117 PE=4 SV=1)

HSP 1 Score: 198.4 bits (503), Expect = 1.7e-47
Identity = 102/118 (86.44%), Postives = 110/118 (93.22%), Query Frame = 0

Query: 1   MDLATKFLRTVTNATNNQTLINVCLIISFGVLSARSIKQQREIEALEAEKDSLLNSNKAL 60
           MDLA+KFLRT+TNATNN TLINVCL+ISFG LSARSIKQQREIEALEAEKDSLLNSNKAL
Sbjct: 1   MDLASKFLRTLTNATNNNTLINVCLVISFGALSARSIKQQREIEALEAEKDSLLNSNKAL 60

Query: 61  KKTMWDWKQQLFAEASTESALVPLARIKAIYGEAPISPSGAGNAVIEDANSQGSKLMV 119
           KKTMWDWKQQL++EAST+SAL+PLARIKAIY EAP+SPSGA  A   DANSQGSKLMV
Sbjct: 61  KKTMWDWKQQLYSEASTDSALIPLARIKAIYSEAPVSPSGAEQAATGDANSQGSKLMV 118

BLAST of Tan0008154 vs. ExPASy TrEMBL
Match: A0A6J1CGB8 (uncharacterized protein LOC111011272 OS=Momordica charantia OX=3673 GN=LOC111011272 PE=4 SV=1)

HSP 1 Score: 194.9 bits (494), Expect = 1.9e-46
Identity = 101/119 (84.87%), Postives = 112/119 (94.12%), Query Frame = 0

Query: 1   MDLATKFLRTVTNATNNQTLINVCLIISFGVLSARSIKQQREIEALEAEKDSLLNSNKAL 60
           MD+ATKFLRT+TNA+NN+TLINVCL++SFG LSARSIKQQREIEALEAEKDSLLNSNKAL
Sbjct: 1   MDMATKFLRTLTNASNNKTLINVCLVVSFGALSARSIKQQREIEALEAEKDSLLNSNKAL 60

Query: 61  KKTMWDWKQQLFAEASTESALVPLARIKAIYGEAPI-SPSGAGNAVIEDANSQGSKLMV 119
           KK+MWDWKQQLFAEAS+ESALVPLARIKAIYGE PI SP+GAG+A  ED NSQGSK +V
Sbjct: 61  KKSMWDWKQQLFAEASSESALVPLARIKAIYGEVPISSPTGAGHAATEDENSQGSKFVV 119

BLAST of Tan0008154 vs. ExPASy TrEMBL
Match: A0A1S3B9R7 (uncharacterized protein LOC103487583 OS=Cucumis melo OX=3656 GN=LOC103487583 PE=4 SV=1)

HSP 1 Score: 181.8 bits (460), Expect = 1.7e-42
Identity = 95/118 (80.51%), Postives = 105/118 (88.98%), Query Frame = 0

Query: 1   MDLATKFLRTVTNATNNQTLINVCLIISFGVLSARSIKQQREIEALEAEKDSLLNSNKAL 60
           MDLA+KFLR ++N  N  TLINVCL+ SF  LSARSIKQ+R+IEALEAEK+SLL+SNKAL
Sbjct: 1   MDLASKFLRILSNDNNKNTLINVCLVFSFAALSARSIKQERQIEALEAEKNSLLDSNKAL 60

Query: 61  KKTMWDWKQQLFAEASTESALVPLARIKAIYGEAPISPSGAGNAVIEDANSQGSKLMV 119
           KKTMWDWKQQLFAEAST+SALVPLARIKAIYGEAPISPSGA NA  EDA S+ SKLMV
Sbjct: 61  KKTMWDWKQQLFAEASTQSALVPLARIKAIYGEAPISPSGAVNAATEDATSRSSKLMV 118

BLAST of Tan0008154 vs. ExPASy TrEMBL
Match: A0A0A0L380 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G364020 PE=4 SV=1)

HSP 1 Score: 164.9 bits (416), Expect = 2.1e-37
Identity = 83/100 (83.00%), Postives = 93/100 (93.00%), Query Frame = 0

Query: 1   MDLATKFLRTVTNATNNQTLINVCLIISFGVLSARSIKQQREIEALEAEKDSLLNSNKAL 60
           MD A+KFLR++ NATN  T+INVCL++SF  L+ARSIKQ+R+IEALE EK+SLLNSNKAL
Sbjct: 1   MDSASKFLRSLANATNKNTVINVCLVVSFAALTARSIKQERQIEALETEKNSLLNSNKAL 60

Query: 61  KKTMWDWKQQLFAEASTESALVPLARIKAIYGEAPISPSG 101
           KKTMWDWKQQLFAEASTESALVPLARIKAIYGEAPISPSG
Sbjct: 61  KKTMWDWKQQLFAEASTESALVPLARIKAIYGEAPISPSG 100

BLAST of Tan0008154 vs. TAIR 10
Match: AT1G48200.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 30 Blast hits to 30 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 30; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 105.9 bits (263), Expect = 2.2e-23
Identity = 56/118 (47.46%), Postives = 80/118 (67.80%), Query Frame = 0

Query: 3   LATKFLRTVTNATNNQTLINVCLIISFGVLSARSIKQQREIEALEAEKDSLLNSNKALKK 62
           +A K    ++ A NN  +IN CL +SF VL  RS KQQ+ +EAL  +K+SL  SNKA+K 
Sbjct: 1   MANKIAMFLSEAMNNNAVINTCLGVSFVVLGLRSDKQQKYVEALAEQKESLFKSNKAMKL 60

Query: 63  TMWDWKQQLFAEAST--ESALVPLARIKAIYGEAPISPSGAGNAVIEDANSQGSKLMV 119
           TMW+WKQQLFAEA++   +A+VPL+ +KAIYGE   + + +G+   ED+     K+M+
Sbjct: 61  TMWEWKQQLFAEAASAGNAAVVPLSTLKAIYGEVTTTTNQSGDTAKEDSKVSTPKIMI 118

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038897052.18.4e-4989.83uncharacterized protein LOC120085226 [Benincasa hispida][more]
XP_022937146.11.6e-4787.29uncharacterized protein LOC111443531 [Cucurbita moschata] >XP_023536544.1 unchar... [more]
XP_022976887.13.5e-4786.44uncharacterized protein LOC111477117 [Cucurbita maxima][more]
KAG6591893.14.6e-4787.18Acetyltransferase, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022140674.13.9e-4684.87uncharacterized protein LOC111011272 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A0A6J1F9J67.7e-4887.29uncharacterized protein LOC111443531 OS=Cucurbita moschata OX=3662 GN=LOC1114435... [more]
A0A6J1IGY31.7e-4786.44uncharacterized protein LOC111477117 OS=Cucurbita maxima OX=3661 GN=LOC111477117... [more]
A0A6J1CGB81.9e-4684.87uncharacterized protein LOC111011272 OS=Momordica charantia OX=3673 GN=LOC111011... [more]
A0A1S3B9R71.7e-4280.51uncharacterized protein LOC103487583 OS=Cucumis melo OX=3656 GN=LOC103487583 PE=... [more]
A0A0A0L3802.1e-3783.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G364020 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G48200.12.2e-2347.46unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 36..63
NoneNo IPR availablePANTHERPTHR38355OS06G0149500 PROTEINcoord: 1..118

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0008154.1Tan0008154.1mRNA