Sed0009033 (gene) Chayote v1

Overview
NameSed0009033
Typegene
OrganismSechium edule (Chayote v1)
DescriptionFe2OG dioxygenase domain-containing protein
LocationLG03: 23238069 .. 23240753 (+)
RNA-Seq ExpressionSed0009033
SyntenySed0009033
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTAATTTGGTGGAAATTGTTACGCCTCCTCGGCCAACTCGGCGGCGACCGAAACTTGCGGATTAAGCTCCCAAAATCGTATTCCTCCTCCGACGATTCCATTTCCGACTCTTGGGTTCGTTCCCAAATTTTGAAACCTATAAACAATCTCTGTACAACCCATTATTCCCAATGTTGTTGATCCGTACAATTCCCGTCTCGCCCTCGCCGTCGTTGAATCTTCTTCATCGGCTTTTGCTCGCTCAGTCTCGATTTCAGCCCATGGATTCGTTTGCCAGTTCAGCAAATTCCCGCGTGAGCGTTTAAATTTTAATAATCGACGAAGAATTTGAGATTTATAGTCGTTTTGTTTTTCTTACGAGTTATGAATGTATGGCAGTTAATTTAGAAACTGTTAATTCCTATTACAGAATCTAGCACTAATACTTCCTTTTTACACTCTTCTTTGTTGAACTGCTTAATTGAATTTTGAATTCCAATTAATCTCCAGGAAATACCTGATCTGCCTTGTTGTGGTAGTTCTTGTGGCGCCAACTTGCATGGTAGAGATCATAATTCAAATGTGATAATGATAGGAACAGTTCCTGTGAATCTGAATCACAAGGTCAGTAAACAGACATCTTTGTCTCGGTTGCCTGTTGATAAAAGTGATGATTTCGAGTCGAGAAGAGATCAAAAGGGGATTTCTCCAAATGTACCCAGTTCTTACTATGATTTTCCACCTGTTTCTCCTTCCAAAAGAAGAAACCGAATCGATTTAGGATTTGAAAGAAGTTTGAAGAGTAATACAAGAACAACTCATGTGGATGAATCATCCTTGCCTAATCAATTTGGAAAGAAAAATGGATCGTATTTTCCTTATAACTGCTGGCCTGTGGATATCGATTCCAAAAGTTATCTATTTACTGACAATTTGCATCCCTTTGAACCATTTGATATATGTTCTTCAAAAAGAAGAGGTAAAGCAAAATCCGGAGGTCATTGGCAGGTTAAAGACAATGGGAAAGTTATGGAGCATGCTGTAGAAGCTAAAAATAATACAGTGTTGAGGCCTGGAATGGTTTTATTGAAGCACTACATTAGTCTACATAAACAGGTACCTTCTTCCTTAGATTGTTGATATTTATATGAACTTAGGTTCACCACTTTTTTTGAATGGGAAGAAGTATTTTGTTGCCATGAAGTCCCATATTGGGATTTTGGACTATAATCTGTCTCTAACAAGTGAAACAACTTGACTTTTGGTAATTATAAAAATTCAATGTTTATCTATTGTTAAAAAGGTGCCCTGTAGGTTCAACTTTTTGCTGCTTTACATTGCTTGATATGAAAGAGTAGAATTTATGAGTTGGCCTAGCGACCAAGAGGGTCTTATACAACTTAGAGGTTATGGAGTTTATCAGATTATCATGCTGATATTTAGTTATTACTTAGACATTTAGCATTCCAACTGTATAGCTTAGTTGCTTATACTCTAGTTTTTTCATACTTGTACCCTGTTCATAATTACTATGGTCAGTGAACAGTGTACTATTTAAAGGAGTTTTGCCTACTCTGAATAATATATGAGATTATATCTAAAAGAGTTCAATTTATGGTAACCACCTACCTAGAAATTAAATTCTTACCAGTCTTCTTGAAAACCAAATATTGTAGAGTTTGGTGATTTGTCTCATGTAAATAGCCGAGCTGCGTATAAGCTTACCAGGGCACTCATGGTTATCAAAAGAAAATGTTGTTCTCTTTATCCAAGTTGTAAGTACGTATCAAAGAATTCTCTTGTTGTAGGTCAGTTTAGTGAAAACTTGTCAAAAGCTTGGTCTTGGCCCATGGGGGTTTTACCAGCCTGGTTATAAAGATGGTGCAAAACTCCGGCTTCAGATGATGTGCCTTGGATTGGATTGGGATCCTCAAGCAAGGAAATATGAACAAAACCGGGCTGTTGATGGTAATAAACCACCAAAAATACCTCCTGAATTCGCAGTTCTGGTTAAAGAAGCACTTAAATGTGCACATGCCTTGATCAAGAACAACGGCAATACAAATAACGTAGAAGACACACTTCCATCAATGTCTCCTGATATATGCATTGTGAATTTCTACTCGACAAGTGGAAGACTGGGTTTGCATCAGGTTTGTGCTTGCCTTTAGTATCACAAATATAATCAAATGAATTCAAGTGCAGTACAATGATTCAGTTTTTCGAGCACAATGCTGATTCCATACCGACATGTTCTTTAGTCCATTTTTAATTCACATGATAAAGGTTTTCTAACCAGTAGAGTATCTGCAGGATCGTGATGAAAGCAAAGAGAGTCTCGTTAACGGACTACCGGTCGTCTCGTTTTCTTTAGGCAATTCAGCAGAATTCTTGTATGGAGATCGAAGAGATGTCGATATTGCAGAGAAGATTGTACTGGAATCAGGCGATGTTCTAATATTTGGTGGAGAATCTAGGCATATATTTCATGGAGTATCTTCAATCATACCTAAATCAACGCCTAAGTTGTTGCTTGATCATACGGGTCTTCGACCTGGGCGTCTAAATCTTACCTTTAGAAAGTATTAGAACATAGATGTGTCTGTTTTTAGGTCTCGTTTGTCATGTACACATGAAATGGCTGTTGTATTTCTTCATCACATATTTATCAAAATGAAATATCGCAAGTCTAAAATGTAGGCCCCG

mRNA sequence

TTAATTTGGTGGAAATTGTTACGCCTCCTCGGCCAACTCGGCGGCGACCGAAACTTGCGGATTAAGCTCCCAAAATCGTATTCCTCCTCCGACGATTCCATTTCCGACTCTTGGGTTCGTTCCCAAATTTTGAAACCTATAAACAATCTCTGTACAACCCATTATTCCCAATGTTGTTGATCCGTACAATTCCCGTCTCGCCCTCGCCGTCGTTGAATCTTCTTCATCGGCTTTTGCTCGCTCAGTCTCGATTTCAGCCCATGGATTCGTTTGCCAGTTCAGCAAATTCCCGCGAAATACCTGATCTGCCTTGTTGTGGTAGTTCTTGTGGCGCCAACTTGCATGGTAGAGATCATAATTCAAATGTGATAATGATAGGAACAGTTCCTGTGAATCTGAATCACAAGGTCAGTAAACAGACATCTTTGTCTCGGTTGCCTGTTGATAAAAGTGATGATTTCGAGTCGAGAAGAGATCAAAAGGGGATTTCTCCAAATGTACCCAGTTCTTACTATGATTTTCCACCTGTTTCTCCTTCCAAAAGAAGAAACCGAATCGATTTAGGATTTGAAAGAAGTTTGAAGAGTAATACAAGAACAACTCATGTGGATGAATCATCCTTGCCTAATCAATTTGGAAAGAAAAATGGATCGTATTTTCCTTATAACTGCTGGCCTGTGGATATCGATTCCAAAAGTTATCTATTTACTGACAATTTGCATCCCTTTGAACCATTTGATATATGTTCTTCAAAAAGAAGAGGTAAAGCAAAATCCGGAGGTCATTGGCAGGTTAAAGACAATGGGAAAGTTATGGAGCATGCTGTAGAAGCTAAAAATAATACAGTGTTGAGGCCTGGAATGGTTTTATTGAAGCACTACATTAGTCTACATAAACAGGTCAGTTTAGTGAAAACTTGTCAAAAGCTTGGTCTTGGCCCATGGGGGTTTTACCAGCCTGGTTATAAAGATGGTGCAAAACTCCGGCTTCAGATGATGTGCCTTGGATTGGATTGGGATCCTCAAGCAAGGAAATATGAACAAAACCGGGCTGTTGATGGTAATAAACCACCAAAAATACCTCCTGAATTCGCAGTTCTGGTTAAAGAAGCACTTAAATGTGCACATGCCTTGATCAAGAACAACGGCAATACAAATAACGTAGAAGACACACTTCCATCAATGTCTCCTGATATATGCATTGTGAATTTCTACTCGACAAGTGGAAGACTGGGTTTGCATCAGGATCGTGATGAAAGCAAAGAGAGTCTCGTTAACGGACTACCGGTCGTCTCGTTTTCTTTAGGCAATTCAGCAGAATTCTTGTATGGAGATCGAAGAGATGTCGATATTGCAGAGAAGATTGTACTGGAATCAGGCGATGTTCTAATATTTGGTGGAGAATCTAGGCATATATTTCATGGAGTATCTTCAATCATACCTAAATCAACGCCTAAGTTGTTGCTTGATCATACGGGTCTTCGACCTGGGCGTCTAAATCTTACCTTTAGAAAGTATTAGAACATAGATGTGTCTGTTTTTAGGTCTCGTTTGTCATGTACACATGAAATGGCTGTTGTATTTCTTCATCACATATTTATCAAAATGAAATATCGCAAGTCTAAAATGTAGGCCCCG

Coding sequence (CDS)

ATGTTGTTGATCCGTACAATTCCCGTCTCGCCCTCGCCGTCGTTGAATCTTCTTCATCGGCTTTTGCTCGCTCAGTCTCGATTTCAGCCCATGGATTCGTTTGCCAGTTCAGCAAATTCCCGCGAAATACCTGATCTGCCTTGTTGTGGTAGTTCTTGTGGCGCCAACTTGCATGGTAGAGATCATAATTCAAATGTGATAATGATAGGAACAGTTCCTGTGAATCTGAATCACAAGGTCAGTAAACAGACATCTTTGTCTCGGTTGCCTGTTGATAAAAGTGATGATTTCGAGTCGAGAAGAGATCAAAAGGGGATTTCTCCAAATGTACCCAGTTCTTACTATGATTTTCCACCTGTTTCTCCTTCCAAAAGAAGAAACCGAATCGATTTAGGATTTGAAAGAAGTTTGAAGAGTAATACAAGAACAACTCATGTGGATGAATCATCCTTGCCTAATCAATTTGGAAAGAAAAATGGATCGTATTTTCCTTATAACTGCTGGCCTGTGGATATCGATTCCAAAAGTTATCTATTTACTGACAATTTGCATCCCTTTGAACCATTTGATATATGTTCTTCAAAAAGAAGAGGTAAAGCAAAATCCGGAGGTCATTGGCAGGTTAAAGACAATGGGAAAGTTATGGAGCATGCTGTAGAAGCTAAAAATAATACAGTGTTGAGGCCTGGAATGGTTTTATTGAAGCACTACATTAGTCTACATAAACAGGTCAGTTTAGTGAAAACTTGTCAAAAGCTTGGTCTTGGCCCATGGGGGTTTTACCAGCCTGGTTATAAAGATGGTGCAAAACTCCGGCTTCAGATGATGTGCCTTGGATTGGATTGGGATCCTCAAGCAAGGAAATATGAACAAAACCGGGCTGTTGATGGTAATAAACCACCAAAAATACCTCCTGAATTCGCAGTTCTGGTTAAAGAAGCACTTAAATGTGCACATGCCTTGATCAAGAACAACGGCAATACAAATAACGTAGAAGACACACTTCCATCAATGTCTCCTGATATATGCATTGTGAATTTCTACTCGACAAGTGGAAGACTGGGTTTGCATCAGGATCGTGATGAAAGCAAAGAGAGTCTCGTTAACGGACTACCGGTCGTCTCGTTTTCTTTAGGCAATTCAGCAGAATTCTTGTATGGAGATCGAAGAGATGTCGATATTGCAGAGAAGATTGTACTGGAATCAGGCGATGTTCTAATATTTGGTGGAGAATCTAGGCATATATTTCATGGAGTATCTTCAATCATACCTAAATCAACGCCTAAGTTGTTGCTTGATCATACGGGTCTTCGACCTGGGCGTCTAAATCTTACCTTTAGAAAGTATTAG

Protein sequence

MLLIRTIPVSPSPSLNLLHRLLLAQSRFQPMDSFASSANSREIPDLPCCGSSCGANLHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKSDDFESRRDQKGISPNVPSSYYDFPPVSPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFDICSSKRRGKAKSGGHWQVKDNGKVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY
Homology
BLAST of Sed0009033 vs. NCBI nr
Match: KAG6595567.1 (hypothetical protein SDJN03_12120, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 609.4 bits (1570), Expect = 2.5e-170
Identity = 323/461 (70.07%), Postives = 358/461 (77.66%), Query Frame = 0

Query: 1   MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN- 60
           MLLIRT+P S  P  NLL RLL A+SR   FQ MDSF SSA    +PD  C GSSCG N 
Sbjct: 1   MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRMDSFGSSA----LPDSSCYGSSCGGNE 60

Query: 61  --LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKSDDFESRRDQKGISPNVPSSY 120
             LH RDHNSNVIMIG +PVNLN K ++Q SLSRL V K DDF+ R DQKGI  N+PSSY
Sbjct: 61  ECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSY 120

Query: 121 YD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPV 180
           +D  FPPV    +KRR+RIDLG ER LK++T ++ ++ +  P  F K             
Sbjct: 121 HDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNE-PFSFKKHRSP--------- 180

Query: 181 DIDSKSYLFTDNLHPFEPFDICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLR 240
           DI SK+ L T NL P E FDIC  +RRGK+K    WQ KD    KVMEHA EA N  V+R
Sbjct: 181 DIGSKNSLATANLPPIESFDICFPERRGKSKHRYSWQSKDRDTMKVMEHADEATNGIVMR 240

Query: 241 PGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK 300
           PGMVLLKHYI LH+QV++VKT QKLGLGP GFYQPGYKDGAKLRLQMMCLGLDWDPQ RK
Sbjct: 241 PGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRK 300

Query: 301 YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFY 360
           Y + R  DGNKPP +PPEFA+LV +AL  AHALIKNNG+TNN+ED LP+MSPDICIVNFY
Sbjct: 301 YARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFY 360

Query: 361 STSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIF 420
           STSGRLGLHQDRDES+ESLV GLPVVSFSLGNSAEFLYGD+RDVD A KI+LESGDVLIF
Sbjct: 361 STSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIF 420

Query: 421 GGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY 450
           GGESRHIFHGVSSIIPKSTPK LLDHTGLRPGRLNLTFRKY
Sbjct: 421 GGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY 447

BLAST of Sed0009033 vs. NCBI nr
Match: XP_022924913.1 (uncharacterized protein LOC111432318 [Cucurbita moschata])

HSP 1 Score: 607.8 bits (1566), Expect = 7.4e-170
Identity = 322/461 (69.85%), Postives = 358/461 (77.66%), Query Frame = 0

Query: 1   MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN- 60
           MLLIRT+P S  P  NLL RLL A+SR   FQ +DSF SSA    +PD  C GSSCG N 
Sbjct: 1   MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSSA----LPDSSCYGSSCGGNE 60

Query: 61  --LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKSDDFESRRDQKGISPNVPSSY 120
             LH RDHNSNVIMIG +PVNLN K ++Q SLSRL V K DDF+ R DQKGI  N+PSSY
Sbjct: 61  ECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSY 120

Query: 121 YD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPV 180
           +D  FPPV    +KRR+RIDLG ER LK++T ++ ++ +  P  F K             
Sbjct: 121 HDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNE-PFSFKKHRSP--------- 180

Query: 181 DIDSKSYLFTDNLHPFEPFDICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLR 240
           DI SK+ L T NL P E FDIC  +RRGK+K    WQ KD    KVMEHA EA N  V+R
Sbjct: 181 DIGSKNSLATANLPPIESFDICFPERRGKSKPRYSWQSKDRDTMKVMEHADEATNGIVMR 240

Query: 241 PGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK 300
           PGMVLLKHYI LH+QV++VKT QKLGLGP GFYQPGYKDGAKLRLQMMCLGLDWDPQ RK
Sbjct: 241 PGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRK 300

Query: 301 YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFY 360
           Y + R  DGNKPP +PPEFA+LV +AL  AHALIKNNG+TNN+ED LP+MSPDICIVNFY
Sbjct: 301 YARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFY 360

Query: 361 STSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIF 420
           STSGRLGLHQDRDES+ESLV GLPVVSFSLGNSAEFLYGD+RDVD A KI+LESGDVLIF
Sbjct: 361 STSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIF 420

Query: 421 GGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY 450
           GGESRHIFHGVSSIIPKSTPK LLDHTGLRPGRLNLTFRKY
Sbjct: 421 GGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY 447

BLAST of Sed0009033 vs. NCBI nr
Match: XP_023517205.1 (uncharacterized protein LOC111781040 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 606.7 bits (1563), Expect = 1.6e-169
Identity = 321/461 (69.63%), Postives = 358/461 (77.66%), Query Frame = 0

Query: 1   MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN- 60
           M LIRT+P S  P  NLL RLL A+SR   FQ MDSF SSA    +PD  C GSSCG N 
Sbjct: 1   MSLIRTVPASLPPRSNLLRRLLFAESRLLQFQRMDSFGSSA----LPDSSCYGSSCGGNE 60

Query: 61  --LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKSDDFESRRDQKGISPNVPSSY 120
             LH RDHNSNVIMIG +PVNLN K ++Q SLSRL V K DDF+ R DQK I  N+PSSY
Sbjct: 61  ECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKRIPANIPSSY 120

Query: 121 YD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPV 180
           +D  FPPV    +KRR+RIDLG ER LK++T ++ ++ +  P  F K   +         
Sbjct: 121 HDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNE-PFSFNKHRSA--------- 180

Query: 181 DIDSKSYLFTDNLHPFEPFDICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLR 240
           DI SK+ L T NL P E FDIC  +RRGK+K    WQ KD    KVMEHA EA N  V+R
Sbjct: 181 DIGSKNSLATANLPPIESFDICFPERRGKSKPRYSWQSKDRDTMKVMEHADEATNGIVMR 240

Query: 241 PGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK 300
           PGMVLLKHYI LH+QV++VKT QKLGLGP GFYQPGYKDGAKLRLQMMCLGLDWDPQ RK
Sbjct: 241 PGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRK 300

Query: 301 YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFY 360
           Y + R  DGNKPP +PPEFA+LV +AL  AHALIKNNG+TNN+ED LP+MSPDICIVNFY
Sbjct: 301 YARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFY 360

Query: 361 STSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIF 420
           STSGRLGLHQDRDES+ESLV+GLPVVSFSLGNSAEFLYGD+RDVD A KI+LESGDVLIF
Sbjct: 361 STSGRLGLHQDRDESRESLVSGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIF 420

Query: 421 GGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY 450
           GGESRHIFHGVSSIIPKSTPK LLDHTGLRPGRLNLTFRKY
Sbjct: 421 GGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY 447

BLAST of Sed0009033 vs. NCBI nr
Match: XP_022966314.1 (uncharacterized protein LOC111466008 [Cucurbita maxima])

HSP 1 Score: 576.6 bits (1485), Expect = 1.8e-160
Identity = 308/461 (66.81%), Postives = 346/461 (75.05%), Query Frame = 0

Query: 1   MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN- 60
           M LIRT+P S  P  NLL +LL A+SR   FQ MDSF SSA    +P+  C GSS G N 
Sbjct: 1   MSLIRTVPASLPPWSNLLRQLLFAESRLLQFQRMDSFGSSA----LPNSSCYGSSSGGNE 60

Query: 61  --LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKSDDFESRRDQKGISPNVPSSY 120
             LH RDHNSNVIMIG +PVNLN K ++Q SLSRL V K DDF+ R DQKGI  N+PS Y
Sbjct: 61  ECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSKY 120

Query: 121 YD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPV 180
           +D  FPPV    +KRR+RID G ER LK++T ++ +  +  P  F K   +         
Sbjct: 121 HDDEFPPVPRQNTKRRSRIDSGSERRLKNSTSSSQMKRNE-PFSFNKHRSA--------- 180

Query: 181 DIDSKSYLFTDNLHPFEPFDICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLR 240
           DI SK+ L T NL P E FDIC  +RRGK+K    WQ KD    KVMEH  EA N  V+R
Sbjct: 181 DIGSKNSLATANLPPIESFDICFPERRGKSKPRCSWQYKDRDTMKVMEHVDEATNGIVMR 240

Query: 241 PGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK 300
           PGMVLLKHYI LH+QV++VKT QKLGLGP GFYQPGYKDGAKLRLQMMCLGLDWDPQ RK
Sbjct: 241 PGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRK 300

Query: 301 YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFY 360
           Y + R  DGNKPP +PPEFA+LV +AL  AHALIKNNG+ N +ED LP+MSPDICIVNFY
Sbjct: 301 YARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDMNKIEDILPTMSPDICIVNFY 360

Query: 361 STSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIF 420
           ST GRLGLHQDRDES+ESLV+GLPVVSFSLGNSA FLYGD R+VD A KI+LESGDVLIF
Sbjct: 361 STIGRLGLHQDRDESRESLVSGLPVVSFSLGNSAVFLYGDERNVDKAGKIILESGDVLIF 420

Query: 421 GGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY 450
           GGESRHIFHGVSSIIPKS PK LLDHTG RPG LNLTFRKY
Sbjct: 421 GGESRHIFHGVSSIIPKSMPKFLLDHTGFRPGCLNLTFRKY 447

BLAST of Sed0009033 vs. NCBI nr
Match: KAG7027547.1 (alkB, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 575.5 bits (1482), Expect = 4.1e-160
Identity = 295/416 (70.91%), Postives = 329/416 (79.09%), Query Frame = 0

Query: 43  IPDLPCCGSSCGAN---LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKSDDFES 102
           +PD  C GSSCG N   LH RDHNSNVIMIG +PVNLN K ++Q SLSRL V K DDF+ 
Sbjct: 72  LPDSSCYGSSCGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKL 131

Query: 103 RRDQKGISPNVPSSYYD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQF 162
           R DQKGI  N+PSSY+D  FPPV    +KRR+RIDLG ER LK++T ++ ++ +  P  F
Sbjct: 132 RSDQKGIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNE-PFSF 191

Query: 163 GKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFEPFDICSSKRRGKAKSGGHWQVKDNG--K 222
            K             DI SK+ + T NL P E FDIC  +RRGK+K    WQ KD    K
Sbjct: 192 KKHRSP---------DIGSKNSIATANLPPIESFDICFPERRGKSKPRYSWQSKDRNTMK 251

Query: 223 VMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRL 282
           VMEHA EA N  V+RPGMVLLKHYI LH+QV++VKT QKLGLGP GFYQPGYKDGAKLRL
Sbjct: 252 VMEHADEATNGIVMRPGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRL 311

Query: 283 QMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVED 342
           QMMCLGLDWDPQ RKY + R  DGNKPP +PPEFA+LV +AL  AHALIKNNG+TNN+ED
Sbjct: 312 QMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIED 371

Query: 343 TLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVD 402
            LP+MSPDICIVNFYSTSGRLGLHQDRDES+ESLV GLPVVSFSLGNSAEFLYGD+RDVD
Sbjct: 372 ILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVD 431

Query: 403 IAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY 450
            A KI+LESGDVLIFGGESRHIFHGVSSIIPKSTPK LLDHTGLRPGRLNLTFRKY
Sbjct: 432 KAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY 477

BLAST of Sed0009033 vs. ExPASy Swiss-Prot
Match: P0CAT7 (Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter vibrioides (strain ATCC 19089 / CB15) OX=190650 GN=alkB PE=3 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 1.3e-15
Identity = 66/197 (33.50%), Postives = 87/197 (44.16%), Query Frame = 0

Query: 257 PWGFYQPGYKDGAKLRLQMMCLG-LDWDPQAR--KYEQNRAVDGNKPPKIPPEFAVLVKE 316
           P+  Y+  Y  G  + + M  LG L W   AR  +Y       G   P +PP        
Sbjct: 53  PFSNYRTAY--GKPMSVAMTALGSLGWTSDARGYRYVDRHPETGRPWPDMPP-------- 112

Query: 317 ALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPV 376
           AL     ++ +           P   PD C+VN Y    R+GLHQDRDE+        PV
Sbjct: 113 ALLDLWTVLGD-----------PETPPDSCLVNLYRDGARMGLHQDRDEADPR----FPV 172

Query: 377 VSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLD 436
           +S SLG++A F  G     D    + L SGDV    G +R  FHGV  I+P S       
Sbjct: 173 LSISLGDTAVFRIGGVNRKDPTRSLRLASGDVCRLLGPARLAFHGVDRILPGS------- 216

Query: 437 HTGLRP--GRLNLTFRK 449
            + L P  GR+NLT R+
Sbjct: 233 -SSLVPGGGRINLTLRR 216

BLAST of Sed0009033 vs. ExPASy Swiss-Prot
Match: B8GWW6 (Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter vibrioides (strain NA1000 / CB15N) OX=565050 GN=alkB PE=3 SV=2)

HSP 1 Score: 85.9 bits (211), Expect = 1.3e-15
Identity = 66/197 (33.50%), Postives = 87/197 (44.16%), Query Frame = 0

Query: 257 PWGFYQPGYKDGAKLRLQMMCLG-LDWDPQAR--KYEQNRAVDGNKPPKIPPEFAVLVKE 316
           P+  Y+  Y  G  + + M  LG L W   AR  +Y       G   P +PP        
Sbjct: 53  PFSNYRTAY--GKPMSVAMTALGSLGWTSDARGYRYVDRHPETGRPWPDMPP-------- 112

Query: 317 ALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPV 376
           AL     ++ +           P   PD C+VN Y    R+GLHQDRDE+        PV
Sbjct: 113 ALLDLWTVLGD-----------PETPPDSCLVNLYRDGARMGLHQDRDEADPR----FPV 172

Query: 377 VSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLD 436
           +S SLG++A F  G     D    + L SGDV    G +R  FHGV  I+P S       
Sbjct: 173 LSISLGDTAVFRIGGVNRKDPTRSLRLASGDVCRLLGPARLAFHGVDRILPGS------- 216

Query: 437 HTGLRP--GRLNLTFRK 449
            + L P  GR+NLT R+
Sbjct: 233 -SSLVPGGGRINLTLRR 216

BLAST of Sed0009033 vs. ExPASy Swiss-Prot
Match: P05050 (Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Escherichia coli (strain K12) OX=83333 GN=alkB PE=1 SV=1)

HSP 1 Score: 82.4 bits (202), Expect = 1.4e-14
Identity = 53/152 (34.87%), Postives = 75/152 (49.34%), Query Frame = 0

Query: 298 NKP-PKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGL 357
           NKP P +P  F  L + A   A                 P   PD C++N Y+   +L L
Sbjct: 86  NKPWPAMPQSFHNLCQRAATAA---------------GYPDFQPDACLINRYAPGAKLSL 145

Query: 358 HQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIF 417
           HQD+DE         P+VS SLG  A F +G  +  D  ++++LE GDV+++GGESR  +
Sbjct: 146 HQDKDEPDLR----APIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFY 205

Query: 418 HGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRK 449
           HG+  +     P L +D       R NLTFR+
Sbjct: 206 HGIQPLKAGFHP-LTID------CRYNLTFRQ 211

BLAST of Sed0009033 vs. ExPASy Swiss-Prot
Match: P37462 (Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) OX=99287 GN=alkB PE=3 SV=2)

HSP 1 Score: 82.4 bits (202), Expect = 1.4e-14
Identity = 45/112 (40.18%), Postives = 63/112 (56.25%), Query Frame = 0

Query: 337 SMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAE 396
           S  PD C++N Y+   +L LHQD+DE         P+VS SLG  A F +G  R  D  +
Sbjct: 111 SFQPDACLINRYAPGAKLSLHQDKDEPDLR----APIVSVSLGVPAVFQFGGLRRSDPIQ 170

Query: 397 KIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRK 449
           +I+LE GD++++GGESR  +HG+  +     P      TG    R NLTFR+
Sbjct: 171 RILLEHGDIVVWGGESRLFYHGIQPLKAGFHPM-----TG--EFRYNLTFRQ 211

BLAST of Sed0009033 vs. ExPASy Swiss-Prot
Match: Q54N08 (Alpha-ketoglutarate-dependent dioxygenase alkB OS=Dictyostelium discoideum OX=44689 GN=alkB PE=2 SV=1)

HSP 1 Score: 62.8 bits (151), Expect = 1.2e-08
Identity = 34/90 (37.78%), Postives = 51/90 (56.67%), Query Frame = 0

Query: 345 VNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYG-DRRDVDIAEKIVLESG 404
           VNFYS    +G H D  E +       P++S S G++A FL G + RD+     + + SG
Sbjct: 266 VNFYSEDSIMGGHLDDAEQEME----KPIISISFGSTAVFLMGAETRDI-APVPLFIRSG 325

Query: 405 DVLIFGGESRHIFHGVSSIIPKSTPKLLLD 434
           D++I GG SR+ +HGV+ I+  S    L+D
Sbjct: 326 DIVIMGGRSRYCYHGVAKIVENSFDLGLID 350

BLAST of Sed0009033 vs. ExPASy TrEMBL
Match: A0A6J1EDT3 (uncharacterized protein LOC111432318 OS=Cucurbita moschata OX=3662 GN=LOC111432318 PE=4 SV=1)

HSP 1 Score: 607.8 bits (1566), Expect = 3.6e-170
Identity = 322/461 (69.85%), Postives = 358/461 (77.66%), Query Frame = 0

Query: 1   MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN- 60
           MLLIRT+P S  P  NLL RLL A+SR   FQ +DSF SSA    +PD  C GSSCG N 
Sbjct: 1   MLLIRTVPASLPPWSNLLRRLLFAESRLLQFQRVDSFGSSA----LPDSSCYGSSCGGNE 60

Query: 61  --LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKSDDFESRRDQKGISPNVPSSY 120
             LH RDHNSNVIMIG +PVNLN K ++Q SLSRL V K DDF+ R DQKGI  N+PSSY
Sbjct: 61  ECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSSY 120

Query: 121 YD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPV 180
           +D  FPPV    +KRR+RIDLG ER LK++T ++ ++ +  P  F K             
Sbjct: 121 HDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNE-PFSFKKHRSP--------- 180

Query: 181 DIDSKSYLFTDNLHPFEPFDICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLR 240
           DI SK+ L T NL P E FDIC  +RRGK+K    WQ KD    KVMEHA EA N  V+R
Sbjct: 181 DIGSKNSLATANLPPIESFDICFPERRGKSKPRYSWQSKDRDTMKVMEHADEATNGIVMR 240

Query: 241 PGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK 300
           PGMVLLKHYI LH+QV++VKT QKLGLGP GFYQPGYKDGAKLRLQMMCLGLDWDPQ RK
Sbjct: 241 PGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRK 300

Query: 301 YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFY 360
           Y + R  DGNKPP +PPEFA+LV +AL  AHALIKNNG+TNN+ED LP+MSPDICIVNFY
Sbjct: 301 YARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDTNNIEDILPTMSPDICIVNFY 360

Query: 361 STSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIF 420
           STSGRLGLHQDRDES+ESLV GLPVVSFSLGNSAEFLYGD+RDVD A KI+LESGDVLIF
Sbjct: 361 STSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLYGDQRDVDKAGKIILESGDVLIF 420

Query: 421 GGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY 450
           GGESRHIFHGVSSIIPKSTPK LLDHTGLRPGRLNLTFRKY
Sbjct: 421 GGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTFRKY 447

BLAST of Sed0009033 vs. ExPASy TrEMBL
Match: A0A6J1HTF0 (uncharacterized protein LOC111466008 OS=Cucurbita maxima OX=3661 GN=LOC111466008 PE=4 SV=1)

HSP 1 Score: 576.6 bits (1485), Expect = 8.8e-161
Identity = 308/461 (66.81%), Postives = 346/461 (75.05%), Query Frame = 0

Query: 1   MLLIRTIPVSPSPSLNLLHRLLLAQSR---FQPMDSFASSANSREIPDLPCCGSSCGAN- 60
           M LIRT+P S  P  NLL +LL A+SR   FQ MDSF SSA    +P+  C GSS G N 
Sbjct: 1   MSLIRTVPASLPPWSNLLRQLLFAESRLLQFQRMDSFGSSA----LPNSSCYGSSSGGNE 60

Query: 61  --LHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKSDDFESRRDQKGISPNVPSSY 120
             LH RDHNSNVIMIG +PVNLN K ++Q SLSRL V K DDF+ R DQKGI  N+PS Y
Sbjct: 61  ECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQKGIPANIPSKY 120

Query: 121 YD--FPPV--SPSKRRNRIDLGFERSLKSNTRTTHVDESSLPNQFGKKNGSYFPYNCWPV 180
           +D  FPPV    +KRR+RID G ER LK++T ++ +  +  P  F K   +         
Sbjct: 121 HDDEFPPVPRQNTKRRSRIDSGSERRLKNSTSSSQMKRNE-PFSFNKHRSA--------- 180

Query: 181 DIDSKSYLFTDNLHPFEPFDICSSKRRGKAKSGGHWQVKDNG--KVMEHAVEAKNNTVLR 240
           DI SK+ L T NL P E FDIC  +RRGK+K    WQ KD    KVMEH  EA N  V+R
Sbjct: 181 DIGSKNSLATANLPPIESFDICFPERRGKSKPRCSWQYKDRDTMKVMEHVDEATNGIVMR 240

Query: 241 PGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARK 300
           PGMVLLKHYI LH+QV++VKT QKLGLGP GFYQPGYKDGAKLRLQMMCLGLDWDPQ RK
Sbjct: 241 PGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYKDGAKLRLQMMCLGLDWDPQTRK 300

Query: 301 YEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFY 360
           Y + R  DGNKPP +PPEFA+LV +AL  AHALIKNNG+ N +ED LP+MSPDICIVNFY
Sbjct: 301 YARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNGDMNKIEDILPTMSPDICIVNFY 360

Query: 361 STSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIF 420
           ST GRLGLHQDRDES+ESLV+GLPVVSFSLGNSA FLYGD R+VD A KI+LESGDVLIF
Sbjct: 361 STIGRLGLHQDRDESRESLVSGLPVVSFSLGNSAVFLYGDERNVDKAGKIILESGDVLIF 420

Query: 421 GGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY 450
           GGESRHIFHGVSSIIPKS PK LLDHTG RPG LNLTFRKY
Sbjct: 421 GGESRHIFHGVSSIIPKSMPKFLLDHTGFRPGCLNLTFRKY 447

BLAST of Sed0009033 vs. ExPASy TrEMBL
Match: A0A6J1CQI1 (uncharacterized protein LOC111013827 OS=Momordica charantia OX=3673 GN=LOC111013827 PE=4 SV=1)

HSP 1 Score: 543.5 bits (1399), Expect = 8.3e-151
Identity = 305/585 (52.14%), Postives = 356/585 (60.85%), Query Frame = 0

Query: 1   MLLIRTIPVSPSPSLNLLHRLLLAQSR-----------FQPMDSFASSANSREIPDLPCC 60
           M +IRT+P+SP    N LHRLL A SR           F+ MDS  +SA S         
Sbjct: 1   MSIIRTVPISPLS--NQLHRLLFASSRFPGGRSSRLLQFRRMDSIVASAISH-------- 60

Query: 61  GSSCGANLHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKSDDFESRRDQKGISPN 120
             +   N H R H+S+++M+G +PV LN K  ++ S S   V+KSDDFE  R++K    N
Sbjct: 61  NGALTENSHNRGHSSDMVMVGEIPVYLNRKRYEEESSSPWSVNKSDDFELGRERKRTPAN 120

Query: 121 VPSSYYD----------------------------------------------------- 180
           VP+SY+D                                                     
Sbjct: 121 VPNSYHDDKFQPVSRQNNKTRSRMDLGLERSINTSSFQVEGSPLLNNNSQQNESSLPKQF 180

Query: 181 ----------------------------------------------------------FP 240
                                                                     F 
Sbjct: 181 GKKNEPFCIQKCQYMDIDYKNSLVTDNLHPFEQFDIRPYERRSNAKPGAHWQVKGRDEFQ 240

Query: 241 PVS--PSKRRNRIDLGFERSLKSNT----------RTTHVDESSLPNQFGKKNGSYFPYN 300
           PVS   +KRRNR+DLGF+RS  +++            + +DESS PNQFGKKN  ++   
Sbjct: 241 PVSRQNTKRRNRVDLGFQRSNNTSSFQVEGFSLLNNNSQLDESSQPNQFGKKNEPFYVQK 300

Query: 301 CWPVDIDSKSYLFTDNLHPFEPFDICSSKRRGKAKSGGHWQVK--DNGKVMEHAVEAKNN 360
           C  +DI SK+ L  DNLHPFEPFDIC  +RRG AK G HWQ K  D  KVMEH  EA N 
Sbjct: 301 CQSMDIGSKNSLVMDNLHPFEPFDICPHERRGNAKPGAHWQFKGRDTVKVMEHVAEASNY 360

Query: 361 TVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDP 420
            VLRPGMVLLK+YI+LH+QV++VKTCQ+LG+GP GFY+PGYKDGAKLRLQMMCLGLDWDP
Sbjct: 361 RVLRPGMVLLKNYITLHEQVNIVKTCQELGVGPGGFYRPGYKDGAKLRLQMMCLGLDWDP 420

Query: 421 QARKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICI 450
           Q RKY   RAVDG+KPP+IPP+FA+LV EALK AHALIKN  NT NVE  LPSMSPDICI
Sbjct: 421 QTRKYGDKRAVDGDKPPEIPPKFAILVTEALKDAHALIKNKCNTGNVESILPSMSPDICI 480

BLAST of Sed0009033 vs. ExPASy TrEMBL
Match: A0A1S4E4K6 (uncharacterized protein LOC103502183 OS=Cucumis melo OX=3656 GN=LOC103502183 PE=4 SV=1)

HSP 1 Score: 526.2 bits (1354), Expect = 1.4e-145
Identity = 292/481 (60.71%), Postives = 338/481 (70.27%), Query Frame = 0

Query: 1   MLLIRTIPVSPSPSLNLLHRLLLAQS-----------RFQPMDSFASSANSREIPDLPCC 60
           M  IRT+P+ PSPS N L RLL   S           +FQ MDSF+SSANS   PD  C 
Sbjct: 1   MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCR 60

Query: 61  GSSCGA-----NLHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKSDDFESRRDQK 120
           G+SCG      +L  RD+ S+VI +G+  V+LN K  +  SL+ L   K D  E   D+ 
Sbjct: 61  GNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPKSLTPLSAKKCDYVEVGSDKF 120

Query: 121 GISPNVPSSYY--DFPPVS-PSKRRNRIDLGFERSLKSNTRTTHVD------------ES 180
           GIS N P SY+  +F PVS  + RRNRIDLG +R LKSN R+  V+            ES
Sbjct: 121 GISSNEPKSYHYDEFLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYES 180

Query: 181 SLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFE-PFDICSSKRRGKAKSGGHWQV 240
           SLP  FGKKN  +F      +DI SK  + TD+  PFE PFDIC     G  K    W+V
Sbjct: 181 SLPIHFGKKNEVFFSKR-QSLDIGSKESVVTDHSLPFEPPFDIC-FPGGGNVKHRNFWRV 240

Query: 241 KDNGKVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDG 300
           KD+G V       K+  +LRPGMVLLKHYI+  +Q+++VKTCQKLGLGP GFYQP YKDG
Sbjct: 241 KDSGTV-------KDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDG 300

Query: 301 AKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNT 360
           AKLRL+MMCLGLDWDPQ R+Y+  R VDGNKPP IPP F+ LVK ALK AHA IKN  N 
Sbjct: 301 AKLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNI 360

Query: 361 NNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGD 420
           +NVED LPSMSPDICI NFY+TSGRLGLHQDRDESKESL +GLPVVSFS+GN+AEFLYGD
Sbjct: 361 SNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGD 420

Query: 421 RRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRK 450
           +RDV+ AEK+ LESGDVLIFGGESRH+FHGVSSIIPKSTPK LL HTGLRPGRLNLTFRK
Sbjct: 421 KRDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRK 472

BLAST of Sed0009033 vs. ExPASy TrEMBL
Match: A0A0A0KY56 (Fe2OG dioxygenase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G329550 PE=4 SV=1)

HSP 1 Score: 501.1 bits (1289), Expect = 4.7e-138
Identity = 278/481 (57.80%), Postives = 322/481 (66.94%), Query Frame = 0

Query: 1   MLLIRTIPVSPSPSLNLLHRLLLAQS-----------RFQPMDSFASSANSREIPDLPCC 60
           M  IRT+P+ PSPS N L RLL   S           +FQPMDSF++SANS  +PD  CC
Sbjct: 1   MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCC 60

Query: 61  GSSCGA-----NLHGRDHNSNVIMIGTVPVNLNHKVSKQTSLSRLPVDKSDDFESRRDQK 120
           GSSCG      +LH RD++S+VI +G++PV+LN K  +                      
Sbjct: 61  GSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKERE---------------------- 120

Query: 121 GISPNVPSSY-YD--FPPVSPSKRRNRIDLGFERSLKSNTRTTHVD------------ES 180
                 P SY YD   P    + RR+RIDLG +R LKSN R+  V+            +S
Sbjct: 121 ------PKSYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKS 180

Query: 181 SLPNQFGKKNGSYFPYNCWPVDIDSKSYLFTDNLHPFE-PFDICSSKRRGKAKSGGHWQV 240
           SLP  FGKKN   F      +D   K  + TDN  PFE PFDIC     G  K    + V
Sbjct: 181 SLPIHFGKKN-EVFVSKLQSLDTGPKESVVTDNSLPFEPPFDIC-LPGGGNVKHRNIYVV 240

Query: 241 KDNGKVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDG 300
           K+ G V       K+  +LRPGMVLLKHYI+  +Q+++VKTCQ LG+GP GFYQPGYKDG
Sbjct: 241 KEGGTV-------KDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDG 300

Query: 301 AKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNT 360
           AKLRL+MMCLGLDWDPQ R+YE  R VDGNKPP IPP+F  LVK ALK AHA IKNN N 
Sbjct: 301 AKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNI 360

Query: 361 NNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGD 420
           +NVE+ LPSMSPDICI NFY+T GRLGLHQDRDESKESL  GLPVVSFS+GN+AEFLYGD
Sbjct: 361 SNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWRGLPVVSFSVGNAAEFLYGD 420

Query: 421 RRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRK 450
           +R+VD AE + LESGDVLIFGGESRHIFHGVSSIIPKSTPK LL HTGLRPGRLNLTFRK
Sbjct: 421 KRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRK 444

BLAST of Sed0009033 vs. TAIR 10
Match: AT5G01780.1 (2-oxoglutarate-dependent dioxygenase family protein )

HSP 1 Score: 308.1 bits (788), Expect = 1.1e-83
Identity = 156/263 (59.32%), Postives = 193/263 (73.38%), Query Frame = 0

Query: 188 PFDICSSKRRGKAKSGGHWQVKDNGKVMEHAVEAKN-NTVLRPGMVLLKHYISLHKQVSL 247
           PFDICSS       S   W + D  +     VE  N + V+RPGMVLLK +++   QV +
Sbjct: 129 PFDICSSVLERNDTSIKDWILAD--ETNRETVEVSNKHKVIRPGMVLLKDFLTPDIQVDI 188

Query: 248 VKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPE 307
           VKTC++LG+ P GFYQPGY  G+KL LQMMCLG +WDPQ  KY +N  +D +K P+IP  
Sbjct: 189 VKTCRELGVKPTGFYQPGYSVGSKLHLQMMCLGRNWDPQT-KYRKNTDID-SKAPEIPVT 248

Query: 308 FAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKES 367
           F VLV++A++ AHALI     T + E  LP MSPDICIVNFYS +GRLGLHQDRDES+ES
Sbjct: 249 FNVLVEKAIREAHALIDRESGTEDAERILPVMSPDICIVNFYSETGRLGLHQDRDESEES 308

Query: 368 LVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKS 427
           +  GLP+VSFS+G+SAEFLYG++RDV+ A+ ++LESGDVLIFGGESR IFHGV SIIP S
Sbjct: 309 IARGLPIVSFSIGDSAEFLYGEKRDVEEAQGVILESGDVLIFGGESRMIFHGVKSIIPNS 368

Query: 428 TPKLLLDHTGLRPGRLNLTFRKY 450
            P  LL+ + LR GRLNLTFR +
Sbjct: 369 APMSLLNESKLRTGRLNLTFRHF 387

BLAST of Sed0009033 vs. TAIR 10
Match: AT5G01780.2 (2-oxoglutarate-dependent dioxygenase family protein )

HSP 1 Score: 308.1 bits (788), Expect = 1.1e-83
Identity = 156/263 (59.32%), Postives = 193/263 (73.38%), Query Frame = 0

Query: 188 PFDICSSKRRGKAKSGGHWQVKDNGKVMEHAVEAKN-NTVLRPGMVLLKHYISLHKQVSL 247
           PFDICSS       S   W + D  +     VE  N + V+RPGMVLLK +++   QV +
Sbjct: 184 PFDICSSVLERNDTSIKDWILAD--ETNRETVEVSNKHKVIRPGMVLLKDFLTPDIQVDI 243

Query: 248 VKTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPE 307
           VKTC++LG+ P GFYQPGY  G+KL LQMMCLG +WDPQ  KY +N  +D +K P+IP  
Sbjct: 244 VKTCRELGVKPTGFYQPGYSVGSKLHLQMMCLGRNWDPQT-KYRKNTDID-SKAPEIPVT 303

Query: 308 FAVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQDRDESKES 367
           F VLV++A++ AHALI     T + E  LP MSPDICIVNFYS +GRLGLHQDRDES+ES
Sbjct: 304 FNVLVEKAIREAHALIDRESGTEDAERILPVMSPDICIVNFYSETGRLGLHQDRDESEES 363

Query: 368 LVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVLIFGGESRHIFHGVSSIIPKS 427
           +  GLP+VSFS+G+SAEFLYG++RDV+ A+ ++LESGDVLIFGGESR IFHGV SIIP S
Sbjct: 364 IARGLPIVSFSIGDSAEFLYGEKRDVEEAQGVILESGDVLIFGGESRMIFHGVKSIIPNS 423

Query: 428 TPKLLLDHTGLRPGRLNLTFRKY 450
            P  LL+ + LR GRLNLTFR +
Sbjct: 424 APMSLLNESKLRTGRLNLTFRHF 442

BLAST of Sed0009033 vs. TAIR 10
Match: AT3G14160.1 (2-oxoglutarate-dependent dioxygenase family protein )

HSP 1 Score: 295.4 bits (755), Expect = 7.6e-80
Identity = 137/232 (59.05%), Postives = 175/232 (75.43%), Query Frame = 0

Query: 218 AVEAKNNTVLRPGMVLLKHYISLHKQVSLVKTCQKLGLGPWGFYQPGYKDGAKLRLQMMC 277
           A +  + TV+RPGMVLLK+Y+S++ QV +V  C++LGLG  GFYQPGY+D AKL L+MMC
Sbjct: 224 AAKGYSGTVIRPGMVLLKNYLSINDQVMIVNKCRRLGLGEGGFYQPGYRDEAKLHLKMMC 283

Query: 278 LGLDWDPQARKYEQNRAVDGNKPPKIPPEFAVLVKEALKCAHALIKNNGNTNNVEDTLPS 337
           LG +WDP+  +Y + R  DG+  P+IP EF   V++A+K + +L  +N       D +P 
Sbjct: 284 LGKNWDPETSRYGETRPFDGSTAPRIPAEFNQFVEKAVKESQSLAASNSKQTKGGDEIPF 343

Query: 338 MSPDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEK 397
           M PDICIVNFYS++GRLGLHQD+DES+ S+  GLPVVSFS+G+SAEFLYGD+RD D AE 
Sbjct: 344 MLPDICIVNFYSSTGRLGLHQDKDESENSIRKGLPVVSFSIGDSAEFLYGDQRDEDKAET 403

Query: 398 IVLESGDVLIFGGESRHIFHGVSSIIPKSTPKLLLDHTGLRPGRLNLTFRKY 450
           + LESGDVL+FGG SR +FHGV SI   + PK LL  T LRPGRLNLTFR+Y
Sbjct: 404 LTLESGDVLLFGGRSRKVFHGVRSIRKDTAPKALLQETSLRPGRLNLTFRQY 455

BLAST of Sed0009033 vs. TAIR 10
Match: AT3G14140.1 (2-oxoglutarate-dependent dioxygenase family protein )

HSP 1 Score: 250.4 bits (638), Expect = 2.8e-66
Identity = 124/264 (46.97%), Postives = 175/264 (66.29%), Query Frame = 0

Query: 188 PFDICSSKRRGKAKSGGHWQVKDNGKVMEHAVEAKNNTVLRPGMVLLKHYISLHKQVSLV 247
           PFDI   K+  + K        +  +  + A +  +  V+RPGMVLLK+Y+S++ QV +V
Sbjct: 203 PFDIFLKKKVMRLKP----SFLELNREKKKAAKGFSGIVIRPGMVLLKNYLSINNQVMIV 262

Query: 248 KTCQKLGLGPWGFYQPGYKDGAKLRLQMMCLGLDWDPQARKYEQNRAVDGNKPPKIPPEF 307
             C++LGLG  GFYQPG++DG  L L+MMCLG +WD Q R+Y + R +DG+ PP+IP EF
Sbjct: 263 NKCRQLGLGEGGFYQPGFQDGGLLHLKMMCLGKNWDCQTRRYGEIRPIDGSVPPRIPVEF 322

Query: 308 AVLVKEALKCAHALIKNNGNTNNVEDTLPSMSPDICIVNFYSTSGRLGLHQ--------- 367
           + LV++A+K + +L+  N N     D +P + PDIC+VNFY+++G+LGLHQ         
Sbjct: 323 SQLVEKAIKESKSLVATNSNETKGGDEIPLLLPDICVVNFYTSTGKLGLHQVSVYDKTSF 382

Query: 368 ------------DRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIVLESGDVL 427
                       D+ ESK+SL  GLP+VSFS+G+SAEFLYGD++DVD A+ ++LESGDVL
Sbjct: 383 DFLKYKGGYLNTDKGESKKSLRKGLPIVSFSIGDSAEFLYGDQKDVDKADTLILESGDVL 442

Query: 428 IFGGESRHIFHGVSSIIPKSTPKL 431
           IFG  SR++FHGV SI     P+L
Sbjct: 443 IFGERSRNVFHGVRSIRKILPPRL 462

BLAST of Sed0009033 vs. TAIR 10
Match: AT1G11780.1 (oxidoreductase, 2OG-Fe(II) oxygenase family protein )

HSP 1 Score: 60.5 bits (145), Expect = 4.1e-09
Identity = 33/83 (39.76%), Postives = 44/83 (53.01%), Query Frame = 0

Query: 340 PDICIVNFYSTSGRLGLHQDRDESKESLVNGLPVVSFSLGNSAEFLYGDRRDVDIAEKIV 399
           P+  IVN++     LG H D  E+  S     P+VS SLG  A FL G +   D    + 
Sbjct: 226 PEGAIVNYFGIGDTLGGHLDDMEADWS----KPIVSMSLGCKAIFLLGGKSKDDPPHAMY 285

Query: 400 LESGDVLIFGGESRHIFHGVSSI 423
           L SGDV++  GE+R  FHG+  I
Sbjct: 286 LRSGDVVLMAGEARECFHGIPRI 304

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6595567.12.5e-17070.07hypothetical protein SDJN03_12120, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022924913.17.4e-17069.85uncharacterized protein LOC111432318 [Cucurbita moschata][more]
XP_023517205.11.6e-16969.63uncharacterized protein LOC111781040 [Cucurbita pepo subsp. pepo][more]
XP_022966314.11.8e-16066.81uncharacterized protein LOC111466008 [Cucurbita maxima][more]
KAG7027547.14.1e-16070.91alkB, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
P0CAT71.3e-1533.50Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter vibrioides... [more]
B8GWW61.3e-1533.50Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter vibrioides... [more]
P050501.4e-1434.87Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Escherichia coli (strain K12) ... [more]
P374621.4e-1440.18Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Salmonella typhimurium (strain... [more]
Q54N081.2e-0837.78Alpha-ketoglutarate-dependent dioxygenase alkB OS=Dictyostelium discoideum OX=44... [more]
Match NameE-valueIdentityDescription
A0A6J1EDT33.6e-17069.85uncharacterized protein LOC111432318 OS=Cucurbita moschata OX=3662 GN=LOC1114323... [more]
A0A6J1HTF08.8e-16166.81uncharacterized protein LOC111466008 OS=Cucurbita maxima OX=3661 GN=LOC111466008... [more]
A0A6J1CQI18.3e-15152.14uncharacterized protein LOC111013827 OS=Momordica charantia OX=3673 GN=LOC111013... [more]
A0A1S4E4K61.4e-14560.71uncharacterized protein LOC103502183 OS=Cucumis melo OX=3656 GN=LOC103502183 PE=... [more]
A0A0A0KY564.7e-13857.80Fe2OG dioxygenase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G... [more]
Match NameE-valueIdentityDescription
AT5G01780.11.1e-8359.322-oxoglutarate-dependent dioxygenase family protein [more]
AT5G01780.21.1e-8359.322-oxoglutarate-dependent dioxygenase family protein [more]
AT3G14160.17.6e-8059.052-oxoglutarate-dependent dioxygenase family protein [more]
AT3G14140.12.8e-6646.972-oxoglutarate-dependent dioxygenase family protein [more]
AT1G11780.14.1e-0939.76oxidoreductase, 2OG-Fe(II) oxygenase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR037151Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamilyGENE3D2.60.120.590coord: 157..382
e-value: 2.2E-52
score: 179.7
IPR027450Alpha-ketoglutarate-dependent dioxygenase AlkB-likePFAMPF135322OG-FeII_Oxy_2coord: 163..380
e-value: 3.3E-43
score: 148.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 26..62
NoneNo IPR availablePANTHERPTHR16557:SF92OG-FE(II) OXYGENASE FAMILY PROTEINcoord: 18..382
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 160..381
IPR004574Alkylated DNA repair protein AlkBPANTHERPTHR16557ALKYLATED DNA REPAIR PROTEIN ALKB-RELATEDcoord: 18..382
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 272..382
score: 9.233233

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0009033.1Sed0009033.1mRNA
Sed0009033.2Sed0009033.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006974 cellular response to DNA damage stimulus
biological_process GO:0006259 DNA metabolic process
biological_process GO:0006281 DNA repair
biological_process GO:0043412 macromolecule modification
biological_process GO:0070989 oxidative demethylation
biological_process GO:0035513 oxidative RNA demethylation
biological_process GO:0035552 oxidative single-stranded DNA demethylation
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005622 intracellular anatomical structure
cellular_component GO:0005634 nucleus
molecular_function GO:0016706 2-oxoglutarate-dependent dioxygenase activity
molecular_function GO:0140640 catalytic activity, acting on a nucleic acid
molecular_function GO:0032451 demethylase activity
molecular_function GO:0008198 ferrous iron binding
molecular_function GO:0035516 oxidative DNA demethylase activity
molecular_function GO:0035515 oxidative RNA demethylase activity