Tan0002506 (gene) Snake gourd v1

Overview
NameTan0002506
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein CDI-like
LocationLG07: 6607360 .. 6609435 (-)
RNA-Seq ExpressionTan0002506
SyntenyTan0002506
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCGTCTTCCATTGCAAGGAGCGTACCCGCTCATTTCTCTCTCCCTTGTTCTTCTCTCTCGCAGTCGATTTAGAACGCTCCCTAAATTTCTTGGACCAAGATTCTCTTCTCAAATCTACAAATATCAGGACCAAAGCCGATTCAATTTGACTGTTTTCTTCCTCCTTCAAGATCTACATTTGCTTCCTTCTTGGTAAACTTCTCGCCTTCATTTCTCCCCTCTTTTCTGTCCGCAGTTCAATCAGATCTGATGCATTTTCATTGTTTTTGTTACATATCTTTGAATTTTCTTGTTCGTCTTCTCGAACGATCACTCGTTACGATTTGAATCTCACAAGCTTATGATATATCTATGGGTTTTTATTTTTAATTGAGACTTCGGAAATGTGTCTGTGTTTGTGTTGTTTCTAATTCGAAATCTTTAAGGTTGTGGGTTTATTTTGTATGGCTCTAATTGAATCTCTCTGTTTCTTTTAGTTTGCACCCTTGAAGATTACTGTTACTCTGGGATTTTGCTTTTTTTTTCTTGTGTTGAGAATGTATGTGACCTATGGTATATGGGGTCTTCTAATAAAGAGAAATAATGCTAGAACGGGAAGTCTTGTGTTAGAATCCTGCAAGTTGTTTATTTGGAATTGATTTTATTTGTTTGATTTTGTGAATTGGAAATCTGGGTCCAGGGATGGTGATTTGATACTTTATGGAATTTTAATGTTTGTGGTTGTTAATGGTTTCTGTGTTTTCGTTTAGGTTTTATTGTTTGAGTTTCTATATCCTTTTTTTATCATTTGGTGGCTAAAATTGAGTTGAATTACTGCAATTTTAGCATTGGAGAGTGTTGAATCTGAAGATTTAGTGCCATAACCATCGGCTTCCAATGGGTTCTTGTAATGGAGAGACTCATCCTGCTGTTGGAGATGTGGAGCAACCATTTAGGGTCTTTGTGGGCTATGATGTTCGTGAAGATCTTGCTTATGAAGTCTGTCGCCATTCCATCTTGAAGCGATCTTCAATCCCTGTTGAGATCATTCCAATCAAGCAGGCAGATCTGAGAAAGAATAGTGTCTATTGGCGTGAGAGAGGAAAATTCGAAAGCACCGAGTTCTCGTTTTCCCGGTTCTTAACTCCGTATTTGGCGAATTATGAAGGATGGGCAATGTTTGTGGACTGTGATTTTCTGTATCTGGCTGACATTAAGGAACTAAGGGACTTGATTGACAATAAGTATGCAGTTATGTGTGTCCACCATGATTACACACCAAAAGAAACTACAAAAATGGATGGTGCAGTTCAAACTGTGTACCCAAGGAAGAACTGGTCTTCAATGGTTTTGTACAACTGTGGCCATCCAAAGAACAAGGTCTTGACACCTGAGGCTGTCAACACCCAAACTGGTGCTTTTCTTCATAGGTTCCAATGGCTTGAAGATGATGAAATTGGGTCAGTCCCATTTGTTTGGAACTTTCTTGAGGGCCATAACAAGAGTGTGGAGGGTGATTCAACCACTCTTCCTAAAGCAATCCACTACACTCGTGGTGGGCCGTGGTTTGAAGCTTGGAAGAATTGTGAATTTGCAGATCTCTGGCTGAAAGAAATGGAGGAGTATCAGAAGGAGGCCAAGAAGAAATCTGAAGAGTAGATGAGATAAAGCAGCGCATGAAAGTAAGTTCTGCTTGAGGTTTAATGGTGGGATTGAAATTCTTTAGACTTTTGCTTAATATTCTTTAGATTGATTTGGTTGAATTGTAGAGTTTATTTTGGAATATCTGATCGTGCATGAAGTTCTTGAAAATGGCGCTGCTGTGTATTTGTAGTAGTTTAGTCCTAATATTGCAGCAACATGTATTATTTTATAGCAGTACTATTTTGCTATCCTTCTATAAATTTGCGTGGGCACTTGTACCTAGCTCAAGTGGTCAGACATTAGTGGTATATTTTCTAATTATTCACTAAGTTATAGGTCGGAATCTCCTTCCTCATGTATTATGTAATGTAATATCGTATAAGAAAAAAACTATGCGACGAATGTAAAATTTTAGCTTGTAATTGCAATACTCAGTTTATTAATTGAGAAGA

mRNA sequence

CCCGTCTTCCATTGCAAGGAGCGTACCCGCTCATTTCTCTCTCCCTTGTTCTTCTCTCTCGCAGTCGATTTAGAACGCTCCCTAAATTTCTTGGACCAAGATTCTCTTCTCAAATCTACAAATATCAGGACCAAAGCCGATTCAATTTGACTGTTTTCTTCCTCCTTCAAGATCTACATTTGCTTCCTTCTTGCATTGGAGAGTGTTGAATCTGAAGATTTAGTGCCATAACCATCGGCTTCCAATGGGTTCTTGTAATGGAGAGACTCATCCTGCTGTTGGAGATGTGGAGCAACCATTTAGGGTCTTTGTGGGCTATGATGTTCGTGAAGATCTTGCTTATGAAGTCTGTCGCCATTCCATCTTGAAGCGATCTTCAATCCCTGTTGAGATCATTCCAATCAAGCAGGCAGATCTGAGAAAGAATAGTGTCTATTGGCGTGAGAGAGGAAAATTCGAAAGCACCGAGTTCTCGTTTTCCCGGTTCTTAACTCCGTATTTGGCGAATTATGAAGGATGGGCAATGTTTGTGGACTGTGATTTTCTGTATCTGGCTGACATTAAGGAACTAAGGGACTTGATTGACAATAAGTATGCAGTTATGTGTGTCCACCATGATTACACACCAAAAGAAACTACAAAAATGGATGGTGCAGTTCAAACTGTGTACCCAAGGAAGAACTGGTCTTCAATGGTTTTGTACAACTGTGGCCATCCAAAGAACAAGGTCTTGACACCTGAGGCTGTCAACACCCAAACTGGTGCTTTTCTTCATAGGTTCCAATGGCTTGAAGATGATGAAATTGGGTCAGTCCCATTTGTTTGGAACTTTCTTGAGGGCCATAACAAGAGTGTGGAGGGTGATTCAACCACTCTTCCTAAAGCAATCCACTACACTCGTGGTGGGCCGTGGTTTGAAGCTTGGAAGAATTGTGAATTTGCAGATCTCTGGCTGAAAGAAATGGAGGAGTATCAGAAGGAGGCCAAGAAGAAATCTGAAGAGTAGATGAGATAAAGCAGCGCATGAAAGTAAGTTCTGCTTGAGGTTTAATGGTGGGATTGAAATTCTTTAGACTTTTGCTTAATATTCTTTAGATTGATTTGGTTGAATTGTAGAGTTTATTTTGGAATATCTGATCGTGCATGAAGTTCTTGAAAATGGCGCTGCTGTGTATTTGTAGTAGTTTAGTCCTAATATTGCAGCAACATGTATTATTTTATAGCAGTACTATTTTGCTATCCTTCTATAAATTTGCGTGGGCACTTGTACCTAGCTCAAGTGGTCAGACATTAGTGGTATATTTTCTAATTATTCACTAAGTTATAGGTCGGAATCTCCTTCCTCATGTATTATGTAATGTAATATCGTATAAGAAAAAAACTATGCGACGAATGTAAAATTTTAGCTTGTAATTGCAATACTCAGTTTATTAATTGAGAAGA

Coding sequence (CDS)

ATGGGTTCTTGTAATGGAGAGACTCATCCTGCTGTTGGAGATGTGGAGCAACCATTTAGGGTCTTTGTGGGCTATGATGTTCGTGAAGATCTTGCTTATGAAGTCTGTCGCCATTCCATCTTGAAGCGATCTTCAATCCCTGTTGAGATCATTCCAATCAAGCAGGCAGATCTGAGAAAGAATAGTGTCTATTGGCGTGAGAGAGGAAAATTCGAAAGCACCGAGTTCTCGTTTTCCCGGTTCTTAACTCCGTATTTGGCGAATTATGAAGGATGGGCAATGTTTGTGGACTGTGATTTTCTGTATCTGGCTGACATTAAGGAACTAAGGGACTTGATTGACAATAAGTATGCAGTTATGTGTGTCCACCATGATTACACACCAAAAGAAACTACAAAAATGGATGGTGCAGTTCAAACTGTGTACCCAAGGAAGAACTGGTCTTCAATGGTTTTGTACAACTGTGGCCATCCAAAGAACAAGGTCTTGACACCTGAGGCTGTCAACACCCAAACTGGTGCTTTTCTTCATAGGTTCCAATGGCTTGAAGATGATGAAATTGGGTCAGTCCCATTTGTTTGGAACTTTCTTGAGGGCCATAACAAGAGTGTGGAGGGTGATTCAACCACTCTTCCTAAAGCAATCCACTACACTCGTGGTGGGCCGTGGTTTGAAGCTTGGAAGAATTGTGAATTTGCAGATCTCTGGCTGAAAGAAATGGAGGAGTATCAGAAGGAGGCCAAGAAGAAATCTGAAGAGTAG

Protein sequence

MGSCNGETHPAVGDVEQPFRVFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLRKNSVYWRERGKFESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKYAVMCVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQWLEDDEIGSVPFVWNFLEGHNKSVEGDSTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEMEEYQKEAKKKSEE
Homology
BLAST of Tan0002506 vs. ExPASy Swiss-Prot
Match: Q9XIP8 (Protein CDI OS=Arabidopsis thaliana OX=3702 GN=CDI PE=2 SV=1)

HSP 1 Score: 439.9 bits (1130), Expect = 2.0e-122
Identity = 198/251 (78.88%), Postives = 221/251 (88.05%), Query Frame = 0

Query: 2   GSCNGETHPAVGDVEQPFRVFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLRKN 61
           G    ET       ++PFR+FVGYD REDLAY+VC HSI KRSSIPVEI PI Q+DLRK 
Sbjct: 6   GDVKSETCNNGSSEKKPFRIFVGYDPREDLAYQVCHHSITKRSSIPVEITPIIQSDLRKK 65

Query: 62  SVYWRERGKFESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKYAVMC 121
            +YWRERG+ ESTEFSFSRFLTP+L++Y+GWAMFVDCDFLYLADIKEL DLID+KYA+MC
Sbjct: 66  GLYWRERGQLESTEFSFSRFLTPHLSDYQGWAMFVDCDFLYLADIKELTDLIDDKYAIMC 125

Query: 122 VHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQW 181
           V HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK L+PE VNTQTGAFLHRFQW
Sbjct: 126 VQHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKTLSPEIVNTQTGAFLHRFQW 185

Query: 182 LEDDEIGSVPFVWNFLEGHNKSVEGDSTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEME 241
           LED+EIGS+PFVWNFLEGHN+ VE D TT PKA+HYTRGGPWF+AWK+CEFADLWL EME
Sbjct: 186 LEDEEIGSIPFVWNFLEGHNRVVEKDPTTQPKAVHYTRGGPWFDAWKDCEFADLWLNEME 245

Query: 242 EYQKEAKKKSE 253
           EY KE KK+++
Sbjct: 246 EYNKENKKEAD 256

BLAST of Tan0002506 vs. NCBI nr
Match: XP_038886869.1 (protein CDI [Benincasa hispida] >XP_038886870.1 protein CDI [Benincasa hispida])

HSP 1 Score: 519.6 bits (1337), Expect = 1.5e-143
Identity = 241/253 (95.26%), Postives = 248/253 (98.02%), Query Frame = 0

Query: 1   MGSCNGETHPAVGDVEQPFRVFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLRK 60
           MGSCNGE HPAVGDVEQPFR+FVGYDVREDLAYEVCR+SI+KRSSIPVEIIPIKQADLRK
Sbjct: 1   MGSCNGENHPAVGDVEQPFRIFVGYDVREDLAYEVCRYSIMKRSSIPVEIIPIKQADLRK 60

Query: 61  NSVYWRERGKFESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKYAVM 120
           + VYWRERG+FESTEFSFSRFLTPYLANY+GWAMFVDCDFLYLADIKELRDLIDNK+AVM
Sbjct: 61  DGVYWRERGQFESTEFSFSRFLTPYLANYKGWAMFVDCDFLYLADIKELRDLIDNKFAVM 120

Query: 121 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ 180
           CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPE VNTQTGAFLHRFQ
Sbjct: 121 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQ 180

Query: 181 WLEDDEIGSVPFVWNFLEGHNKSVEGDSTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240
           WLEDDEIGSVPFVWNFLEGHNKSVEGD TTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM
Sbjct: 181 WLEDDEIGSVPFVWNFLEGHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240

Query: 241 EEYQKEAKKKSEE 254
           EEY KEAKKKSEE
Sbjct: 241 EEY-KEAKKKSEE 252

BLAST of Tan0002506 vs. NCBI nr
Match: XP_022958670.1 (protein CDI-like [Cucurbita moschata] >KAG6605714.1 Protein CDI, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 517.7 bits (1332), Expect = 5.7e-143
Identity = 238/253 (94.07%), Postives = 245/253 (96.84%), Query Frame = 0

Query: 1   MGSCNGETHPAVGDVEQPFRVFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLRK 60
           MGSCNGETH AV  +EQPFR+FVGYDV EDLAYEVCRHSILKRSSIPVEIIPIKQADLRK
Sbjct: 1   MGSCNGETHSAVEGLEQPFRIFVGYDVSEDLAYEVCRHSILKRSSIPVEIIPIKQADLRK 60

Query: 61  NSVYWRERGKFESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKYAVM 120
           N VYWRERG+ ESTEFSFSRFLTPYLANY+GWAMFVDCDFLYLADIKELRDLIDNKYA+M
Sbjct: 61  NGVYWRERGQLESTEFSFSRFLTPYLANYKGWAMFVDCDFLYLADIKELRDLIDNKYAIM 120

Query: 121 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ 180
           CVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ
Sbjct: 121 CVHHDYAPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ 180

Query: 181 WLEDDEIGSVPFVWNFLEGHNKSVEGDSTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240
           WLEDDEIGS+PFVWNFLEGHNKSVEGD +TLPKAIHYTRGGPWFEAWKNCEFADLWLKEM
Sbjct: 181 WLEDDEIGSIPFVWNFLEGHNKSVEGDLSTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240

Query: 241 EEYQKEAKKKSEE 254
           EEYQKEAKKKSEE
Sbjct: 241 EEYQKEAKKKSEE 253

BLAST of Tan0002506 vs. NCBI nr
Match: XP_023533470.1 (protein CDI-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 516.2 bits (1328), Expect = 1.6e-142
Identity = 237/253 (93.68%), Postives = 244/253 (96.44%), Query Frame = 0

Query: 1   MGSCNGETHPAVGDVEQPFRVFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLRK 60
           MGSCNGETH AV  +EQPFR+FVGYDV EDLAYEVCRHSILKRSSIPVEIIPIKQADLRK
Sbjct: 1   MGSCNGETHSAVEGLEQPFRIFVGYDVSEDLAYEVCRHSILKRSSIPVEIIPIKQADLRK 60

Query: 61  NSVYWRERGKFESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKYAVM 120
           N  YWRERG+ ESTEFSFSRFLTPYLANY+GWAMFVDCDFLYLADIKELRDLIDNKYA+M
Sbjct: 61  NGAYWRERGQLESTEFSFSRFLTPYLANYKGWAMFVDCDFLYLADIKELRDLIDNKYAIM 120

Query: 121 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ 180
           CVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ
Sbjct: 121 CVHHDYAPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ 180

Query: 181 WLEDDEIGSVPFVWNFLEGHNKSVEGDSTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240
           WLEDDEIGS+PFVWNFLEGHNKSVEGD +TLPKAIHYTRGGPWFEAWKNCEFADLWLKEM
Sbjct: 181 WLEDDEIGSIPFVWNFLEGHNKSVEGDLSTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240

Query: 241 EEYQKEAKKKSEE 254
           EEYQKEAKKKSEE
Sbjct: 241 EEYQKEAKKKSEE 253

BLAST of Tan0002506 vs. NCBI nr
Match: KAG7011466.1 (Protein CDI, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 512.3 bits (1318), Expect = 2.4e-141
Identity = 237/253 (93.68%), Postives = 243/253 (96.05%), Query Frame = 0

Query: 1   MGSCNGETHPAVGDVEQPFRVFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLRK 60
           MGSCN E  PAVG+VEQPF++FVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLR 
Sbjct: 66  MGSCNEENLPAVGEVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLRT 125

Query: 61  NSVYWRERGKFESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKYAVM 120
           N VYWRERG+ ESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNK+AVM
Sbjct: 126 NGVYWRERGQLESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKFAVM 185

Query: 121 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ 180
           CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK+LTPE VNTQTGAFLHRFQ
Sbjct: 186 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKLLTPETVNTQTGAFLHRFQ 245

Query: 181 WLEDDEIGSVPFVWNFLEGHNKSVEGDSTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240
           WLEDDEIGSVPFVWNFLEGHNKSVEGD TTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM
Sbjct: 246 WLEDDEIGSVPFVWNFLEGHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 305

Query: 241 EEYQKEAKKKSEE 254
           EEYQKEA KKS E
Sbjct: 306 EEYQKEAVKKSAE 318

BLAST of Tan0002506 vs. NCBI nr
Match: XP_022995211.1 (protein CDI-like [Cucurbita maxima])

HSP 1 Score: 512.3 bits (1318), Expect = 2.4e-141
Identity = 236/253 (93.28%), Postives = 243/253 (96.05%), Query Frame = 0

Query: 1   MGSCNGETHPAVGDVEQPFRVFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLRK 60
           MGSCNGETH AV  +EQPFR+FVGYDV EDLAYEVCRHSILKRSSIPVEIIPIKQADLRK
Sbjct: 1   MGSCNGETHSAVEGLEQPFRIFVGYDVSEDLAYEVCRHSILKRSSIPVEIIPIKQADLRK 60

Query: 61  NSVYWRERGKFESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKYAVM 120
           N VYWRERG+ ESTEFSFSRFLTP LANY+GWAMFVDCDFLYLADIKELRDLIDNKYA+M
Sbjct: 61  NGVYWRERGQLESTEFSFSRFLTPSLANYKGWAMFVDCDFLYLADIKELRDLIDNKYAIM 120

Query: 121 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ 180
           CVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ
Sbjct: 121 CVHHDYAPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ 180

Query: 181 WLEDDEIGSVPFVWNFLEGHNKSVEGDSTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240
           WLEDDEIGS+PFVWNFLEGHNKSVEGD +TLPKAIHYTRGGPWFEAWKNCEFADLWLKEM
Sbjct: 181 WLEDDEIGSIPFVWNFLEGHNKSVEGDLSTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240

Query: 241 EEYQKEAKKKSEE 254
           EEYQKEAK KSEE
Sbjct: 241 EEYQKEAKNKSEE 253

BLAST of Tan0002506 vs. ExPASy TrEMBL
Match: A0A6J1H454 (protein CDI-like OS=Cucurbita moschata OX=3662 GN=LOC111459824 PE=4 SV=1)

HSP 1 Score: 517.7 bits (1332), Expect = 2.7e-143
Identity = 238/253 (94.07%), Postives = 245/253 (96.84%), Query Frame = 0

Query: 1   MGSCNGETHPAVGDVEQPFRVFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLRK 60
           MGSCNGETH AV  +EQPFR+FVGYDV EDLAYEVCRHSILKRSSIPVEIIPIKQADLRK
Sbjct: 1   MGSCNGETHSAVEGLEQPFRIFVGYDVSEDLAYEVCRHSILKRSSIPVEIIPIKQADLRK 60

Query: 61  NSVYWRERGKFESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKYAVM 120
           N VYWRERG+ ESTEFSFSRFLTPYLANY+GWAMFVDCDFLYLADIKELRDLIDNKYA+M
Sbjct: 61  NGVYWRERGQLESTEFSFSRFLTPYLANYKGWAMFVDCDFLYLADIKELRDLIDNKYAIM 120

Query: 121 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ 180
           CVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ
Sbjct: 121 CVHHDYAPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ 180

Query: 181 WLEDDEIGSVPFVWNFLEGHNKSVEGDSTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240
           WLEDDEIGS+PFVWNFLEGHNKSVEGD +TLPKAIHYTRGGPWFEAWKNCEFADLWLKEM
Sbjct: 181 WLEDDEIGSIPFVWNFLEGHNKSVEGDLSTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240

Query: 241 EEYQKEAKKKSEE 254
           EEYQKEAKKKSEE
Sbjct: 241 EEYQKEAKKKSEE 253

BLAST of Tan0002506 vs. ExPASy TrEMBL
Match: A0A6J1EY80 (protein CDI-like OS=Cucurbita moschata OX=3662 GN=LOC111439568 PE=4 SV=1)

HSP 1 Score: 512.3 bits (1318), Expect = 1.2e-141
Identity = 237/253 (93.68%), Postives = 243/253 (96.05%), Query Frame = 0

Query: 1   MGSCNGETHPAVGDVEQPFRVFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLRK 60
           MGSCN E  PAVG+VEQPF++FVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLR 
Sbjct: 1   MGSCNEENLPAVGEVEQPFKIFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLRT 60

Query: 61  NSVYWRERGKFESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKYAVM 120
           N VYWRERG+ ESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNK+AVM
Sbjct: 61  NGVYWRERGQLESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKFAVM 120

Query: 121 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ 180
           CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK+LTPE VNTQTGAFLHRFQ
Sbjct: 121 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKLLTPETVNTQTGAFLHRFQ 180

Query: 181 WLEDDEIGSVPFVWNFLEGHNKSVEGDSTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240
           WLEDDEIGSVPFVWNFLEGHNKSVEGD TTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM
Sbjct: 181 WLEDDEIGSVPFVWNFLEGHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240

Query: 241 EEYQKEAKKKSEE 254
           EEYQKEA KKS E
Sbjct: 241 EEYQKEAVKKSAE 253

BLAST of Tan0002506 vs. ExPASy TrEMBL
Match: A0A6J1K1D2 (protein CDI-like OS=Cucurbita maxima OX=3661 GN=LOC111490823 PE=4 SV=1)

HSP 1 Score: 512.3 bits (1318), Expect = 1.2e-141
Identity = 236/253 (93.28%), Postives = 243/253 (96.05%), Query Frame = 0

Query: 1   MGSCNGETHPAVGDVEQPFRVFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLRK 60
           MGSCNGETH AV  +EQPFR+FVGYDV EDLAYEVCRHSILKRSSIPVEIIPIKQADLRK
Sbjct: 1   MGSCNGETHSAVEGLEQPFRIFVGYDVSEDLAYEVCRHSILKRSSIPVEIIPIKQADLRK 60

Query: 61  NSVYWRERGKFESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKYAVM 120
           N VYWRERG+ ESTEFSFSRFLTP LANY+GWAMFVDCDFLYLADIKELRDLIDNKYA+M
Sbjct: 61  NGVYWRERGQLESTEFSFSRFLTPSLANYKGWAMFVDCDFLYLADIKELRDLIDNKYAIM 120

Query: 121 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ 180
           CVHHDY PKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ
Sbjct: 121 CVHHDYAPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ 180

Query: 181 WLEDDEIGSVPFVWNFLEGHNKSVEGDSTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240
           WLEDDEIGS+PFVWNFLEGHNKSVEGD +TLPKAIHYTRGGPWFEAWKNCEFADLWLKEM
Sbjct: 181 WLEDDEIGSIPFVWNFLEGHNKSVEGDLSTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240

Query: 241 EEYQKEAKKKSEE 254
           EEYQKEAK KSEE
Sbjct: 241 EEYQKEAKNKSEE 253

BLAST of Tan0002506 vs. ExPASy TrEMBL
Match: A0A6J1I857 (protein CDI-like OS=Cucurbita maxima OX=3661 GN=LOC111471810 PE=4 SV=1)

HSP 1 Score: 508.8 bits (1309), Expect = 1.3e-140
Identity = 235/253 (92.89%), Postives = 242/253 (95.65%), Query Frame = 0

Query: 1   MGSCNGETHPAVGDVEQPFRVFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLRK 60
           MGSCN E  PAVG+VEQPF++FVGYDVREDLA+EVCRHSILKRSSIPVEIIPIKQADLR 
Sbjct: 1   MGSCNEENLPAVGEVEQPFKIFVGYDVREDLAFEVCRHSILKRSSIPVEIIPIKQADLRT 60

Query: 61  NSVYWRERGKFESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKYAVM 120
           N VYWRERG+ ESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNK+AVM
Sbjct: 61  NGVYWRERGQLESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKFAVM 120

Query: 121 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ 180
           CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK+LTPE VNTQTGAFLHRFQ
Sbjct: 121 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKLLTPETVNTQTGAFLHRFQ 180

Query: 181 WLEDDEIGSVPFVWNFLEGHNKSVEGDSTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240
           WLEDDEIGSVPFVWNFLEGHN SVEGD TTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM
Sbjct: 181 WLEDDEIGSVPFVWNFLEGHNNSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240

Query: 241 EEYQKEAKKKSEE 254
           EEYQKEA KKS E
Sbjct: 241 EEYQKEAVKKSAE 253

BLAST of Tan0002506 vs. ExPASy TrEMBL
Match: A0A0A0LKJ4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G915190 PE=4 SV=1)

HSP 1 Score: 506.5 bits (1303), Expect = 6.3e-140
Identity = 236/253 (93.28%), Postives = 244/253 (96.44%), Query Frame = 0

Query: 1   MGSCNGETHPAVGDVEQPFRVFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLRK 60
           MGS NGE  PAVGD EQPFR+FVGYDVREDLAY+VCRHSILKRSSIPVEIIPIKQADLRK
Sbjct: 1   MGSSNGENRPAVGD-EQPFRIFVGYDVREDLAYQVCRHSILKRSSIPVEIIPIKQADLRK 60

Query: 61  NSVYWRERGKFESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKYAVM 120
           N VYWRERG+ ESTEFSFSRFLTPYLAN++GWAMFVDCDFLYLADIKELRDLIDNK+AVM
Sbjct: 61  NGVYWRERGQTESTEFSFSRFLTPYLANFKGWAMFVDCDFLYLADIKELRDLIDNKFAVM 120

Query: 121 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQ 180
           CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPE VNTQTGAFLHRFQ
Sbjct: 121 CVHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEIVNTQTGAFLHRFQ 180

Query: 181 WLEDDEIGSVPFVWNFLEGHNKSVEGDSTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240
           WLED+EIGSVPFVWNFLEGHNKSVEGD TTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM
Sbjct: 181 WLEDNEIGSVPFVWNFLEGHNKSVEGDLTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEM 240

Query: 241 EEYQKEAKKKSEE 254
           EEY KEA+KKSEE
Sbjct: 241 EEYNKEAEKKSEE 252

BLAST of Tan0002506 vs. TAIR 10
Match: AT1G64980.1 (Nucleotide-diphospho-sugar transferases superfamily protein )

HSP 1 Score: 439.9 bits (1130), Expect = 1.4e-123
Identity = 198/251 (78.88%), Postives = 221/251 (88.05%), Query Frame = 0

Query: 2   GSCNGETHPAVGDVEQPFRVFVGYDVREDLAYEVCRHSILKRSSIPVEIIPIKQADLRKN 61
           G    ET       ++PFR+FVGYD REDLAY+VC HSI KRSSIPVEI PI Q+DLRK 
Sbjct: 6   GDVKSETCNNGSSEKKPFRIFVGYDPREDLAYQVCHHSITKRSSIPVEITPIIQSDLRKK 65

Query: 62  SVYWRERGKFESTEFSFSRFLTPYLANYEGWAMFVDCDFLYLADIKELRDLIDNKYAVMC 121
            +YWRERG+ ESTEFSFSRFLTP+L++Y+GWAMFVDCDFLYLADIKEL DLID+KYA+MC
Sbjct: 66  GLYWRERGQLESTEFSFSRFLTPHLSDYQGWAMFVDCDFLYLADIKELTDLIDDKYAIMC 125

Query: 122 VHHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKVLTPEAVNTQTGAFLHRFQW 181
           V HDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNK L+PE VNTQTGAFLHRFQW
Sbjct: 126 VQHDYTPKETTKMDGAVQTVYPRKNWSSMVLYNCGHPKNKTLSPEIVNTQTGAFLHRFQW 185

Query: 182 LEDDEIGSVPFVWNFLEGHNKSVEGDSTTLPKAIHYTRGGPWFEAWKNCEFADLWLKEME 241
           LED+EIGS+PFVWNFLEGHN+ VE D TT PKA+HYTRGGPWF+AWK+CEFADLWL EME
Sbjct: 186 LEDEEIGSIPFVWNFLEGHNRVVEKDPTTQPKAVHYTRGGPWFDAWKDCEFADLWLNEME 245

Query: 242 EYQKEAKKKSE 253
           EY KE KK+++
Sbjct: 246 EYNKENKKEAD 256

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9XIP82.0e-12278.88Protein CDI OS=Arabidopsis thaliana OX=3702 GN=CDI PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_038886869.11.5e-14395.26protein CDI [Benincasa hispida] >XP_038886870.1 protein CDI [Benincasa hispida][more]
XP_022958670.15.7e-14394.07protein CDI-like [Cucurbita moschata] >KAG6605714.1 Protein CDI, partial [Cucurb... [more]
XP_023533470.11.6e-14293.68protein CDI-like [Cucurbita pepo subsp. pepo][more]
KAG7011466.12.4e-14193.68Protein CDI, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022995211.12.4e-14193.28protein CDI-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1H4542.7e-14394.07protein CDI-like OS=Cucurbita moschata OX=3662 GN=LOC111459824 PE=4 SV=1[more]
A0A6J1EY801.2e-14193.68protein CDI-like OS=Cucurbita moschata OX=3662 GN=LOC111439568 PE=4 SV=1[more]
A0A6J1K1D21.2e-14193.28protein CDI-like OS=Cucurbita maxima OX=3661 GN=LOC111490823 PE=4 SV=1[more]
A0A6J1I8571.3e-14092.89protein CDI-like OS=Cucurbita maxima OX=3661 GN=LOC111471810 PE=4 SV=1[more]
A0A0A0LKJ46.3e-14093.28Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G915190 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G64980.11.4e-12378.88Nucleotide-diphospho-sugar transferases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 233..253
NoneNo IPR availablePANTHERPTHR35105:SF5SUBFAMILY NOT NAMEDcoord: 10..252
NoneNo IPR availablePANTHERPTHR35105EXPRESSED PROTEINcoord: 10..252
IPR029044Nucleotide-diphospho-sugar transferasesGENE3D3.90.550.10Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain Acoord: 16..252
e-value: 1.8E-8
score: 35.8
IPR029044Nucleotide-diphospho-sugar transferasesSUPERFAMILY53448Nucleotide-diphospho-sugar transferasescoord: 29..243

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0002506.1Tan0002506.1mRNA