Tan0018702 (gene) Snake gourd v1

Overview
NameTan0018702
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMultidrug resistance protein
LocationLG01: 15737568 .. 15741054 (-)
RNA-Seq ExpressionTan0018702
SyntenyTan0018702
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TACACTTGCAGGTGGTACAGCAAATGGCTTTAGTGTTTCTGACAAAAGTTATGGGTTCCTCTCAACAGGTTATTTGAAGAGGAGAGATATATGAAGAATAGTTGTGGTCTACATCACATTCTTTCCTACCTTTGCTCTTTTTTGCCACCAACCAGTTTTCTTTCTACTCTTGAATCATATGGATGTGCTTGGTACAGCATAGCTGTAAGGTGAGCAAAACATTATTCTCTTCTGCTACTCTTTATCATGTCGTTTTATCTTTAGATTCTTTCTCCAGAGGAATAATTTTCTCCCATTCTTGTTTTTTTTTTCCCCTCTTCTTTTTGATAGAAAATGGCTACACAAACAAGACATGCCTTCACTTTGGGCTAGCTTTTGTTTTGCTTAAATTAATATCAATATCATGACTGCTTTTTGTATTCTCTTTCCTTTCATCTGCTTTCATGGTGTTTTTATATGAGATTGTGTGAGATATCCCTTCCCTCCCAATGATGGCATCTGAAGGGACATTGAATATGAACTGGGTTAATGTTTTGATGATTTCAAAATGTATTTTTGAGCTAAAAGGTCTGTAAAAACTTGAGAAACTATTCAACATGGCTGTCGTTTTGCGATCTTTAGATAATCATCTAATGGTAATGGTGCAGCGGTCCATTCCAACAATTTATCTTGAGAAATTATCCATGTCAGGCATTTTAACTATCATGGCTTAGCAACCACTTCTGATAATTTCTCCATAAAAACTATTATCGCAGAACGATTAATTATATCAAAACACATAGTTTCTCTTTGTTTCTCTTCTACTATGTCAGTAAACATGGATGAACAATCATTATGGAGGTGAAAAATAATGGTTCTGGGGGCTGGGGTTAGTGCATATAATTATTTGATTCAAATGTTATCTTGGAAATTTAGGGGATGATCTGAAATCTGAATAGGCAGGCTGTAAAATCTCCTCCAACTTTCTAAACACTTTTAATTAGCTCATTCATGGTAACTGGGTTTAGACTAAAATTCTTCATCTAAATGAAATTTGTTTCTTTTAAAGGGTGGAAACTTTTTGAAATTACCAATAAGCCATTGTTATTTTGGCTCCCCAGACAGGGAGGGCCAATCTTTTGCTTTCTTTTAGTTTGTTGCTTCAGACAAAGTATAGTATTCACTAAGCTAGGCATCCTCATTTAAATGGAAAAAGGTGTAGCAATCAAATTATAATTAACCCTCTTCCCATTTATTTTATTTCTCTATAGTGGAAAGTGCGAGTTGGGTTGGCTGCTTTCTTTATTAAATAATGCTTTGAGAAAAATAGATGTCTTGTTTTTTCATGACAGGTTAGCTATGGAATTGGAAGTATTGGATGGAAACTATGCTTCAAAGCCCAAATCAAAAGTTTAGATACTCTTAAGAAACTGATTAGTTTATAGCAGATGGTTTGCATCATTTTCGATATCACATCATCAGAATAACTAACTGCCATCACAGAGGATCTTAGACATCTTCTTCCTCTACCCTTGACTTCTGTAAGTGCTAGCATTTTGTTTATTATTCTCTTCTTCTCATTCTGGTTCTAATACTTGTGCGTGTAACCCACCCACCCCCAGCGACAAGGGGAGGGCATTGAAGCATGTACCCTTGATATCTTAATCATTGAAGCATGCACAAATAGGAAGGAAAAAAAAAACATTGGTAGTTCCTTTTTCTGCAGTTATTTAGCGGAGGGTCTGAAAGATAAAACAAAGACCACCCCCCCAACCCTCAACCTGACTCCATAGTTTCTCTGTGGACCGGTGGGGGGGTTGTTTCATTTTCTTTTTTCTAGTATTTATGAAAGTTTTGGAATTTTCTAATTTGCCCCCTCTTTCTCTATTACAATTGACTTTATAAGATTTTTGCCCCCCTTTTAACAAAATTTGTGGCTCTGCCCCTGGAGATGAGATTCAAATAAGAATGCTGAGAAAATCAGTTTCTTTTCAGGGCTAGGGGTCATAAGGAATACATTCTTATGTAGAAGCCATTTTCATCTGCATCATTTTTGCTCCAAGCATTCAGTTGAGGAAGATATAAACCACCACCTCAAACTTAAAGGTACAGGCTGCAGAGCTTGCAAAATATCAGAAGAGGTATGAGTGAGGTTGGCTCTGAAAGATCAAAGTCATGGAACATATACACGACTCCAGACCAGAGCCCATCATCTCAAACAGGCATTGGTCAAGAAGCTCCATGGAAAAACTTTGGGTCCTCCATGAATGCCATTTCCTTTGGCTTTGTTGCCACTGCAATCTTGATCTCAATGTTCCTTATCATGGCCATCTTTGAGCATCTGTTTCGACCAACTTCTCCTTTCTCCTCATCTGATGAAGTGACCAACAACTCCGCAGATTCAGGTCCAGTCGAGAAATTCGCAAGTCCAAATACGGTACATTTCGACCTGGGTTGTTTACCTTTACATTTGGCTTGTGGGATTTGGATTATCATCCTTTGTTTAATATCATTAAACGTCTTCAGTGCTTTCCCCTCCCCTTAAAACAAGTGTTTCAGTTACTGAGGCTTTAAACTACTTCATATCATTATATATCAAGCCCCCTTGAACAGATTTGTTTTGAGACCAGTAGGAGAAAACAACTTAAAGCCAACTCTTCTTTCTTAATTCTAGAATAAGAAAAGACTGATTGTACAACTTGTTTCTTCACCATGAGTTTGATTAAACCTGTCTTTTAAAATACAAGTTTCACAAGTGGCTTCATAAACTTTTTCTCTTGACAAAACTCATCTTGCTCAAAACAAAAACAATTATAATTTCCTAACTAGTTTTGTTTCTACTATATTTCAAAAATAAGTTCTCAAAACTGCAATATTAGACCAAAAAAACCTCGAAAAGAAGGTGTAGTTCGATGTTTGAGTCGATTAAAAGAAAAACTATTTTCTCTTGTCTTCTTCAAGAACATTATCTTTTATCATAACAGTGACAAGACACAGACATCACCACAAACCAGTACCAATCATTACACAATACATGCTTTTCTTTTAAGGTCAAGTCATTGAATTGGACATTGGGAATGATATGTTTTTGTTTATTGAGTGTTGTTTTTCTGTGGCAGGTGCCAACATCATATGCATCTGATTACTCAGTGTTGATGCCAGGTCAGCACATCCCCACCTTCATTGCCCAGCCAGCTCCTCTGCCATGCCAAAGAGAGAGGATTTATTGGCCATCTCATGACCATAATTTTTCAGGTCCTTAAAACAAATATGATATTTCTTGCCTCCATTGAAAGCAGCTGCTGCATTTAGGAAAAACTGTCAGCTTTGCTTCTTCTCAGAATAATAATTCTTGTGCTTCTTCTTCCCCCATAGGGAGATTTGCTCAAATGCTGATGTTTTTTTGGTCTTGGTTTCTGATCTGTAAATTGATTATAAAGAAGGCACAAGTATTCACACAAGTTTACTGGTTTGATAAAGATTTAAGGTGCAAATAGATATTGTCCCA

mRNA sequence

TACACTTGCAGGTGGTACAGCAAATGGCTTTAGTGTTTCTGACAAAAGTTATGGGTTCCTCTCAACAGGTTATTTGAAGAGGAGAGATATATGAAGAATAGTTGTGGTCTACATCACATTCTTTCCTACCTTTGCTCTTTTTTGCCACCAACCAGTTTTCTTTCTACTCTTGAATCATATGGATGTGCTTGGTACAGCATAGCTGTAAGGTGAGCAAAACATTATTCTCTTCTGCTACTCTTTATCATGTCGTTTTATCTTTAGATTCTTTCTCCAGAGGAATAATTTTCTCCCATTCTTGTTTTTTTTTTCCCCTCTTCTTTTTGATAGAAAATGGCTACACAAACAAGACATGCCTTCACTTTGGGCTAGCTTTTGTTTTGCTTAAATTAATATCAATATCATGACTGCTTTTTGTATTCTCTTTCCTTTCATCTGCTTTCATGGTGTTTTTATATGAGATTGTGTGAGATATCCCTTCCCTCCCAATGATGGCATCTGAAGGGACATTGAATATGAACTGGGTTAATGTTTTGATGATTTCAAAATGTATTTTTGAGCTAAAAGGTCTGTAAAAACTTGAGAAACTATTCAACATGGCTGTCGTTTTGCGATCTTTAGATAATCATCTAATGGTAATGGTGCAGCGGTCCATTCCAACAATTTATCTTGAGAAATTATCCATGTCAGGCATTTTAACTATCATGGCTTAGCAACCACTTCTGATAATTTCTCCATAAAAACTATTATCGCAGAACGATTAATTATATCAAAACACATAGTTTCTCTTTGTTTCTCTTCTACTATGTCAGTAAACATGGATGAACAATCATTATGGAGGTGAAAAATAATGGTTCTGGGGGCTGGGGTTAGTGCATATAATTATTTGATTCAAATGTTATCTTGGAAATTTAGGGGATGATCTGAAATCTGAATAGGCAGGCTGTAAAATCTCCTCCAACTTTCTAAACACTTTTAATTAGCTCATTCATGGTAACTGGGTTTAGACTAAAATTCTTCATCTAAATGAAATTTGTTTCTTTTAAAGGGTGGAAACTTTTTGAAATTACCAATAAGCCATTGTTATTTTGGCTCCCCAGACAGGGAGGGCCAATCTTTTGCTTTCTTTTAGTTTGTTGCTTCAGACAAAGTATAGTATTCACTAAGCTAGGCATCCTCATTTAAATGGAAAAAGGTGTAGCAATCAAATTATAATTAACCCTCTTCCCATTTATTTTATTTCTCTATAGTGGAAAGTGCGAGTTGGGTTGGCTGCTTTCTTTATTAAATAATGCTTTGAGAAAAATAGATGTCTTGTTTTTTCATGACAGGTTAGCTATGGAATTGGAAGTATTGGATGGAAACTATGCTTCAAAGCCCAAATCAAAAGTTTAGATACTCTTAAGAAACTGATTAGTTTATAGCAGATGGTTTGCATCATTTTCGATATCACATCATCAGAATAACTAACTGCCATCACAGAGGATCTTAGACATCTTCTTCCTCTACCCTTGACTTCTGGCTAGGGGTCATAAGGAATACATTCTTATGTAGAAGCCATTTTCATCTGCATCATTTTTGCTCCAAGCATTCAGTTGAGGAAGATATAAACCACCACCTCAAACTTAAAGGTACAGGCTGCAGAGCTTGCAAAATATCAGAAGAGGTATGAGTGAGGTTGGCTCTGAAAGATCAAAGTCATGGAACATATACACGACTCCAGACCAGAGCCCATCATCTCAAACAGGCATTGGTCAAGAAGCTCCATGGAAAAACTTTGGGTCCTCCATGAATGCCATTTCCTTTGGCTTTGTTGCCACTGCAATCTTGATCTCAATGTTCCTTATCATGGCCATCTTTGAGCATCTGTTTCGACCAACTTCTCCTTTCTCCTCATCTGATGAAGTGACCAACAACTCCGCAGATTCAGGTCCAGTCGAGAAATTCGCAAGTCCAAATACGGTGCCAACATCATATGCATCTGATTACTCAGTGTTGATGCCAGGTCAGCACATCCCCACCTTCATTGCCCAGCCAGCTCCTCTGCCATGCCAAAGAGAGAGGATTTATTGGCCATCTCATGACCATAATTTTTCAGGTCCTTAAAACAAATATGATATTTCTTGCCTCCATTGAAAGCAGCTGCTGCATTTAGGAAAAACTGTCAGCTTTGCTTCTTCTCAGAATAATAATTCTTGTGCTTCTTCTTCCCCCATAGGGAGATTTGCTCAAATGCTGATGTTTTTTTGGTCTTGGTTTCTGATCTGTAAATTGATTATAAAGAAGGCACAAGTATTCACACAAGTTTACTGGTTTGATAAAGATTTAAGGTGCAAATAGATATTGTCCCA

Coding sequence (CDS)

ATGAGTGAGGTTGGCTCTGAAAGATCAAAGTCATGGAACATATACACGACTCCAGACCAGAGCCCATCATCTCAAACAGGCATTGGTCAAGAAGCTCCATGGAAAAACTTTGGGTCCTCCATGAATGCCATTTCCTTTGGCTTTGTTGCCACTGCAATCTTGATCTCAATGTTCCTTATCATGGCCATCTTTGAGCATCTGTTTCGACCAACTTCTCCTTTCTCCTCATCTGATGAAGTGACCAACAACTCCGCAGATTCAGGTCCAGTCGAGAAATTCGCAAGTCCAAATACGGTGCCAACATCATATGCATCTGATTACTCAGTGTTGATGCCAGGTCAGCACATCCCCACCTTCATTGCCCAGCCAGCTCCTCTGCCATGCCAAAGAGAGAGGATTTATTGGCCATCTCATGACCATAATTTTTCAGGTCCTTAA

Protein sequence

MSEVGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLIMAIFEHLFRPTSPFSSSDEVTNNSADSGPVEKFASPNTVPTSYASDYSVLMPGQHIPTFIAQPAPLPCQRERIYWPSHDHNFSGP
Homology
BLAST of Tan0018702 vs. NCBI nr
Match: XP_022983836.1 (uncharacterized protein LOC111482330 [Cucurbita maxima] >XP_022983844.1 uncharacterized protein LOC111482330 [Cucurbita maxima] >XP_022983852.1 uncharacterized protein LOC111482330 [Cucurbita maxima] >KAG7031361.1 hypothetical protein SDJN02_05401 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 277.7 bits (709), Expect = 5.7e-71
Identity = 136/145 (93.79%), Postives = 139/145 (95.86%), Query Frame = 0

Query: 1   MSEVGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI 60
           MSE+GSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI
Sbjct: 1   MSEIGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI 60

Query: 61  MAIFEHLFRPTSPFSSSDEVTNNSADSGPVEKFASPNTVPTSYASDYSVLMPGQHIPTFI 120
           MAIFEHLFRPTS FSSS EVTNN A SGPVEKFASPNTVPTSYA+D+SVLMPGQHIPTFI
Sbjct: 61  MAIFEHLFRPTSSFSSSGEVTNNFAQSGPVEKFASPNTVPTSYAADFSVLMPGQHIPTFI 120

Query: 121 AQPAPLPCQRERIYWPSHDHNFSGP 146
           AQPAPLPCQRE  YWPSHDHNFSGP
Sbjct: 121 AQPAPLPCQREGTYWPSHDHNFSGP 145

BLAST of Tan0018702 vs. NCBI nr
Match: KAG6600722.1 (hypothetical protein SDJN03_05955, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 277.7 bits (709), Expect = 5.7e-71
Identity = 136/145 (93.79%), Postives = 139/145 (95.86%), Query Frame = 0

Query: 1   MSEVGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI 60
           MSE+GSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI
Sbjct: 86  MSEIGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI 145

Query: 61  MAIFEHLFRPTSPFSSSDEVTNNSADSGPVEKFASPNTVPTSYASDYSVLMPGQHIPTFI 120
           MAIFEHLFRPTS FSSS EVTNN A SGPVEKFASPNTVPTSYA+D+SVLMPGQHIPTFI
Sbjct: 146 MAIFEHLFRPTSSFSSSGEVTNNFAQSGPVEKFASPNTVPTSYAADFSVLMPGQHIPTFI 205

Query: 121 AQPAPLPCQRERIYWPSHDHNFSGP 146
           AQPAPLPCQRE  YWPSHDHNFSGP
Sbjct: 206 AQPAPLPCQREGTYWPSHDHNFSGP 230

BLAST of Tan0018702 vs. NCBI nr
Match: XP_004133716.1 (uncharacterized protein LOC101206733 isoform X1 [Cucumis sativus] >KGN56252.1 hypothetical protein Csa_010216 [Cucumis sativus])

HSP 1 Score: 275.8 bits (704), Expect = 2.1e-70
Identity = 134/145 (92.41%), Postives = 140/145 (96.55%), Query Frame = 0

Query: 1   MSEVGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI 60
           MSE+GSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFL+
Sbjct: 1   MSEIGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLV 60

Query: 61  MAIFEHLFRPTSPFSSSDEVTNNSADSGPVEKFASPNTVPTSYASDYSVLMPGQHIPTFI 120
           MAIFEHLFRP+SPFSSSDEVTNNS++S P EKFASPNTV TSYASD+SVLMPGQHIPTFI
Sbjct: 61  MAIFEHLFRPSSPFSSSDEVTNNSSESTPAEKFASPNTVSTSYASDFSVLMPGQHIPTFI 120

Query: 121 AQPAPLPCQRERIYWPSHDHNFSGP 146
           AQPAPLPCQRE IYWPSH HNFSGP
Sbjct: 121 AQPAPLPCQREGIYWPSHRHNFSGP 145

BLAST of Tan0018702 vs. NCBI nr
Match: XP_023540824.1 (uncharacterized protein LOC111801082 [Cucurbita pepo subsp. pepo] >XP_023540834.1 uncharacterized protein LOC111801082 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 274.2 bits (700), Expect = 6.3e-70
Identity = 134/145 (92.41%), Postives = 138/145 (95.17%), Query Frame = 0

Query: 1   MSEVGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI 60
           MSE+GSERSKSWNIYTTPDQSPSSQ GIGQEAPWKNFGSSMNAISFGFVATAILISMFLI
Sbjct: 1   MSEIGSERSKSWNIYTTPDQSPSSQRGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI 60

Query: 61  MAIFEHLFRPTSPFSSSDEVTNNSADSGPVEKFASPNTVPTSYASDYSVLMPGQHIPTFI 120
           MAIFEHLFRPTS FSSS EVTNN A SGPVEKF+SPNTVPTSYA+D+SVLMPGQHIPTFI
Sbjct: 61  MAIFEHLFRPTSSFSSSGEVTNNFAQSGPVEKFSSPNTVPTSYAADFSVLMPGQHIPTFI 120

Query: 121 AQPAPLPCQRERIYWPSHDHNFSGP 146
           AQPAPLPCQRE  YWPSHDHNFSGP
Sbjct: 121 AQPAPLPCQREGTYWPSHDHNFSGP 145

BLAST of Tan0018702 vs. NCBI nr
Match: XP_022942533.1 (uncharacterized protein LOC111447541 [Cucurbita moschata] >XP_022942534.1 uncharacterized protein LOC111447541 [Cucurbita moschata])

HSP 1 Score: 274.2 bits (700), Expect = 6.3e-70
Identity = 134/145 (92.41%), Postives = 138/145 (95.17%), Query Frame = 0

Query: 1   MSEVGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI 60
           MSE+GSERSKSWNIYTTPDQSPSSQTGIGQ+APWKNFGSSMNAISFGFVATAILISMFLI
Sbjct: 1   MSEIGSERSKSWNIYTTPDQSPSSQTGIGQDAPWKNFGSSMNAISFGFVATAILISMFLI 60

Query: 61  MAIFEHLFRPTSPFSSSDEVTNNSADSGPVEKFASPNTVPTSYASDYSVLMPGQHIPTFI 120
           MAIFEHLFRPTS FSSS EVTNN A SGPVEKFASPNTVPTSYA+D+SVLMPGQHIPTFI
Sbjct: 61  MAIFEHLFRPTSSFSSSGEVTNNFAQSGPVEKFASPNTVPTSYAADFSVLMPGQHIPTFI 120

Query: 121 AQPAPLPCQRERIYWPSHDHNFSGP 146
           AQPAPLPCQRE  YWPSHDHNF GP
Sbjct: 121 AQPAPLPCQREGTYWPSHDHNFLGP 145

BLAST of Tan0018702 vs. ExPASy TrEMBL
Match: A0A6J1J0H1 (uncharacterized protein LOC111482330 OS=Cucurbita maxima OX=3661 GN=LOC111482330 PE=4 SV=1)

HSP 1 Score: 277.7 bits (709), Expect = 2.7e-71
Identity = 136/145 (93.79%), Postives = 139/145 (95.86%), Query Frame = 0

Query: 1   MSEVGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI 60
           MSE+GSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI
Sbjct: 1   MSEIGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI 60

Query: 61  MAIFEHLFRPTSPFSSSDEVTNNSADSGPVEKFASPNTVPTSYASDYSVLMPGQHIPTFI 120
           MAIFEHLFRPTS FSSS EVTNN A SGPVEKFASPNTVPTSYA+D+SVLMPGQHIPTFI
Sbjct: 61  MAIFEHLFRPTSSFSSSGEVTNNFAQSGPVEKFASPNTVPTSYAADFSVLMPGQHIPTFI 120

Query: 121 AQPAPLPCQRERIYWPSHDHNFSGP 146
           AQPAPLPCQRE  YWPSHDHNFSGP
Sbjct: 121 AQPAPLPCQREGTYWPSHDHNFSGP 145

BLAST of Tan0018702 vs. ExPASy TrEMBL
Match: A0A0A0L6N7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G110010 PE=4 SV=1)

HSP 1 Score: 275.8 bits (704), Expect = 1.0e-70
Identity = 134/145 (92.41%), Postives = 140/145 (96.55%), Query Frame = 0

Query: 1   MSEVGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI 60
           MSE+GSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFL+
Sbjct: 1   MSEIGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLV 60

Query: 61  MAIFEHLFRPTSPFSSSDEVTNNSADSGPVEKFASPNTVPTSYASDYSVLMPGQHIPTFI 120
           MAIFEHLFRP+SPFSSSDEVTNNS++S P EKFASPNTV TSYASD+SVLMPGQHIPTFI
Sbjct: 61  MAIFEHLFRPSSPFSSSDEVTNNSSESTPAEKFASPNTVSTSYASDFSVLMPGQHIPTFI 120

Query: 121 AQPAPLPCQRERIYWPSHDHNFSGP 146
           AQPAPLPCQRE IYWPSH HNFSGP
Sbjct: 121 AQPAPLPCQREGIYWPSHRHNFSGP 145

BLAST of Tan0018702 vs. ExPASy TrEMBL
Match: A0A6J1FV27 (uncharacterized protein LOC111447541 OS=Cucurbita moschata OX=3662 GN=LOC111447541 PE=4 SV=1)

HSP 1 Score: 274.2 bits (700), Expect = 3.0e-70
Identity = 134/145 (92.41%), Postives = 138/145 (95.17%), Query Frame = 0

Query: 1   MSEVGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI 60
           MSE+GSERSKSWNIYTTPDQSPSSQTGIGQ+APWKNFGSSMNAISFGFVATAILISMFLI
Sbjct: 1   MSEIGSERSKSWNIYTTPDQSPSSQTGIGQDAPWKNFGSSMNAISFGFVATAILISMFLI 60

Query: 61  MAIFEHLFRPTSPFSSSDEVTNNSADSGPVEKFASPNTVPTSYASDYSVLMPGQHIPTFI 120
           MAIFEHLFRPTS FSSS EVTNN A SGPVEKFASPNTVPTSYA+D+SVLMPGQHIPTFI
Sbjct: 61  MAIFEHLFRPTSSFSSSGEVTNNFAQSGPVEKFASPNTVPTSYAADFSVLMPGQHIPTFI 120

Query: 121 AQPAPLPCQRERIYWPSHDHNFSGP 146
           AQPAPLPCQRE  YWPSHDHNF GP
Sbjct: 121 AQPAPLPCQREGTYWPSHDHNFLGP 145

BLAST of Tan0018702 vs. ExPASy TrEMBL
Match: A0A1S4DZ08 (uncharacterized protein LOC103493322 OS=Cucumis melo OX=3656 GN=LOC103493322 PE=4 SV=1)

HSP 1 Score: 271.2 bits (692), Expect = 2.6e-69
Identity = 134/145 (92.41%), Postives = 139/145 (95.86%), Query Frame = 0

Query: 1   MSEVGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI 60
           MSE+GSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFL+
Sbjct: 1   MSEIGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLV 60

Query: 61  MAIFEHLFRPTSPFSSSDEVTNNSADSGPVEKFASPNTVPTSYASDYSVLMPGQHIPTFI 120
           MAIFEHLFRP+SPFSSSDEVTNNS +S P EKFASPNTV TSYASD+SVLMPGQHIPTFI
Sbjct: 61  MAIFEHLFRPSSPFSSSDEVTNNS-ESTPAEKFASPNTVSTSYASDFSVLMPGQHIPTFI 120

Query: 121 AQPAPLPCQRERIYWPSHDHNFSGP 146
           AQPAPLPCQRE IYWPSH HNFSGP
Sbjct: 121 AQPAPLPCQREGIYWPSHHHNFSGP 144

BLAST of Tan0018702 vs. ExPASy TrEMBL
Match: A0A5D3BTG7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold18G00690 PE=4 SV=1)

HSP 1 Score: 271.2 bits (692), Expect = 2.6e-69
Identity = 134/145 (92.41%), Postives = 139/145 (95.86%), Query Frame = 0

Query: 1   MSEVGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLI 60
           MSE+GSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFL+
Sbjct: 1   MSEIGSERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLV 60

Query: 61  MAIFEHLFRPTSPFSSSDEVTNNSADSGPVEKFASPNTVPTSYASDYSVLMPGQHIPTFI 120
           MAIFEHLFRP+SPFSSSDEVTNNS +S P EKFASPNTV TSYASD+SVLMPGQHIPTFI
Sbjct: 61  MAIFEHLFRPSSPFSSSDEVTNNS-ESTPAEKFASPNTVSTSYASDFSVLMPGQHIPTFI 120

Query: 121 AQPAPLPCQRERIYWPSHDHNFSGP 146
           AQPAPLPCQRE IYWPSH HNFSGP
Sbjct: 121 AQPAPLPCQREGIYWPSHHHNFSGP 144

BLAST of Tan0018702 vs. TAIR 10
Match: AT1G09812.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G58007.2); Has 93 Blast hits to 93 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 93; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 116.3 bits (290), Expect = 2.0e-26
Identity = 71/132 (53.79%), Postives = 88/132 (66.67%), Query Frame = 0

Query: 8   RSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLIMAIFEHL 67
           +S SW+IY +P +  S       E PW++  +SMNAISFGFVATAILISMFLIMAIFEHL
Sbjct: 3   KSSSWSIY-SPREGDS-------EGPWRS-STSMNAISFGFVATAILISMFLIMAIFEHL 62

Query: 68  FRP-TSPFSSSDEVTNNSAD-SGPVEKFA-SPNTVPTSYASDYSVLMPGQHIPTFIAQPA 127
           FRP  S F S  ++     D S   +K A   + VP S   D SV+MPG+ +P+ IA PA
Sbjct: 63  FRPENSSFDSPHQIRQRQRDGSSQFQKLADQASMVPVSTVVDVSVVMPGEKLPSHIALPA 122

Query: 128 PLPCQRERIYWP 137
           PLPC+RE I+WP
Sbjct: 123 PLPCRREGIHWP 125

BLAST of Tan0018702 vs. TAIR 10
Match: AT1G58007.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G09812.1); Has 91 Blast hits to 91 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 91; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 105.9 bits (263), Expect = 2.7e-23
Identity = 66/133 (49.62%), Postives = 80/133 (60.15%), Query Frame = 0

Query: 6   SERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLIMAIFE 65
           S   ++W+IY T D S                GSSMNAISFGFVATAILI MF+IMAI E
Sbjct: 4   SRSIRTWSIYRTKDTS----------------GSSMNAISFGFVATAILILMFIIMAILE 63

Query: 66  HLFRPTSPFSSSDEVTNNSADSGPVEKFASPNTVPTSYASDYSVLMPGQHIPTFIAQPAP 125
           HLFR  S  SSS +V     DS   +K A   ++     SD SV+MPG  +P+++A PAP
Sbjct: 64  HLFR--SDHSSSYDVD----DSSQFQKLAEKASMVPVTTSDVSVVMPGDKLPSYVALPAP 114

Query: 126 LPCQRERIYWPSH 139
            PC+RE I WPSH
Sbjct: 124 FPCRREGIRWPSH 114

BLAST of Tan0018702 vs. TAIR 10
Match: AT1G58007.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G09812.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 105.9 bits (263), Expect = 2.7e-23
Identity = 66/133 (49.62%), Postives = 80/133 (60.15%), Query Frame = 0

Query: 6   SERSKSWNIYTTPDQSPSSQTGIGQEAPWKNFGSSMNAISFGFVATAILISMFLIMAIFE 65
           S   ++W+IY T D S                GSSMNAISFGFVATAILI MF+IMAI E
Sbjct: 4   SRSIRTWSIYRTKDTS----------------GSSMNAISFGFVATAILILMFIIMAILE 63

Query: 66  HLFRPTSPFSSSDEVTNNSADSGPVEKFASPNTVPTSYASDYSVLMPGQHIPTFIAQPAP 125
           HLFR  S  SSS +V     DS   +K A   ++     SD SV+MPG  +P+++A PAP
Sbjct: 64  HLFR--SDHSSSYDVD----DSSQFQKLAEKASMVPVTTSDVSVVMPGDKLPSYVALPAP 114

Query: 126 LPCQRERIYWPSH 139
            PC+RE I WPSH
Sbjct: 124 FPCRREGIRWPSH 114

BLAST of Tan0018702 vs. TAIR 10
Match: AT1G11120.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 10 plant structures; EXPRESSED DURING: 4 anthesis, F mature embryo stage, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G28170.1); Has 94 Blast hits to 94 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 94; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 97.8 bits (242), Expect = 7.5e-21
Identity = 58/124 (46.77%), Postives = 76/124 (61.29%), Query Frame = 0

Query: 30  QEAPWKN-FGSSMNAISFGFVATAILISMFLIMAIFEHLFRPTSPFSSSDEVTNN----S 89
           +E PWK+ F  S+NA+SFGFVATAILISMFL+MAIFE L R T+  +++ + +++     
Sbjct: 26  REEPWKSQFDDSVNAVSFGFVATAILISMFLVMAIFERLIRTTTTSTTNSDSSSSRVLPG 85

Query: 90  ADS-----GPVEKFASPNTVPTSYASDYSVLMPGQHIPTFIAQPAPLPCQRERIYWPSHD 144
            DS     G   K    +   T Y++  SVLMPG  IPTFIA PAP+PC  + I    H 
Sbjct: 86  MDSRVGFNGAATKLGYQSPKMTVYSNGVSVLMPGDDIPTFIAHPAPVPCPPQNISQSQHQ 145

BLAST of Tan0018702 vs. TAIR 10
Match: AT1G11120.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G09812.1); Has 270 Blast hits to 255 proteins in 62 species: Archae - 0; Bacteria - 2; Metazoa - 93; Fungi - 14; Plants - 126; Viruses - 0; Other Eukaryotes - 35 (source: NCBI BLink). )

HSP 1 Score: 86.7 bits (213), Expect = 1.7e-17
Identity = 61/150 (40.67%), Postives = 79/150 (52.67%), Query Frame = 0

Query: 30  QEAPWKN-FGSSMNAISFGFVATAILISMFLIMAIFEHLFRPTSPFSSSDEVTNN----S 89
           +E PWK+ F  S+NA+SFGFVATAILISMFL+MAIFE L R T+  +++ + +++     
Sbjct: 26  REEPWKSQFDDSVNAVSFGFVATAILISMFLVMAIFERLIRTTTTSTTNSDSSSSRVLPG 85

Query: 90  ADS-----GPVEKF------ASPNTV--------------------PTSYASDYSVLMPG 144
            DS     G   K       AS N +                     T Y++  SVLMPG
Sbjct: 86  MDSRVGFNGAATKLGYQSPKASNNRIVFSFSFNILKSELFIVIILRMTVYSNGVSVLMPG 145

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022983836.15.7e-7193.79uncharacterized protein LOC111482330 [Cucurbita maxima] >XP_022983844.1 uncharac... [more]
KAG6600722.15.7e-7193.79hypothetical protein SDJN03_05955, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_004133716.12.1e-7092.41uncharacterized protein LOC101206733 isoform X1 [Cucumis sativus] >KGN56252.1 hy... [more]
XP_023540824.16.3e-7092.41uncharacterized protein LOC111801082 [Cucurbita pepo subsp. pepo] >XP_023540834.... [more]
XP_022942533.16.3e-7092.41uncharacterized protein LOC111447541 [Cucurbita moschata] >XP_022942534.1 unchar... [more]
Match NameE-valueIdentityDescription
A0A6J1J0H12.7e-7193.79uncharacterized protein LOC111482330 OS=Cucurbita maxima OX=3661 GN=LOC111482330... [more]
A0A0A0L6N71.0e-7092.41Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G110010 PE=4 SV=1[more]
A0A6J1FV273.0e-7092.41uncharacterized protein LOC111447541 OS=Cucurbita moschata OX=3662 GN=LOC1114475... [more]
A0A1S4DZ082.6e-6992.41uncharacterized protein LOC103493322 OS=Cucumis melo OX=3656 GN=LOC103493322 PE=... [more]
A0A5D3BTG72.6e-6992.41Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
Match NameE-valueIdentityDescription
AT1G09812.12.0e-2653.79unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G58007.12.7e-2349.62unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G58007.22.7e-2349.62unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G11120.27.5e-2146.77unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G11120.11.7e-1740.67unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 73..97
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..31
NoneNo IPR availablePANTHERPTHR33728CTTNBP 2 AMINO-TERMINAL-LIKE PROTEINcoord: 1..143
NoneNo IPR availablePANTHERPTHR33728:SF3MULTIDRUG RESISTANCE PROTEINcoord: 1..143

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0018702.1Tan0018702.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane