Lsi07G006230 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi07G006230
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionAlpha-ketoglutarate-dependent dioxygenase AlkB
Locationchr07 : 6673107 .. 6676662 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGCAATGAAACGGCCTAAAACCTAGGCAAAGTCTTACTATTGACGCCACACGGACTATTCACATCCCTTTCAAAAAGGGTATAAAGTTTAAACCAACAAAAAGGATTAAAACCCAGCCCTCGCTGCATCAAAACCTCTCTTAATCCCTCCATCTTCTCTCGATCGCTCCTCCGATTCAAGGTTTGTTCGTCTTTCAACCCGTCTTGTTCTCTTCAAGAGTTTCTTGCCTTACGATTGTTGCAATCACGATTTTATGTTTTTTTTTTCCTGCTACAATGTAATTAGCAATTAGCAATTTCGTTTATAATCTCACTCTGTTACTCGATTTATCTCTTTTCTTTTTATTTGAAGTTCCTGATTAAACTGTCTGTTTCTGACTGTCATTCATGTTAAGAATCTAATTGTTCTCGTCGATGTTACAGATTTGTCACCGAAATTTATCGAACGCCTTACAAACAATTCAGCCCCTTTCGGCATTCAATTCCCCCCCGATCAATCTTTTCTGTTTCCAGTTCCCTCCTCAAGATCATGAATGACGGTGGCCCTAGATATGCTGGAAGAGGCCATCCGAATAACAGAGGACGATCTCCTCGCTCTGCCGATAATTTCATTTATCGTCCCCGTCATGTAATGTAACTTTCTTCAAGCTTCGCAATGTTTACAAATTTTACGCATATTGGCTATAGTCTTTAGTTTTTTCATTCTTCGTTTATTAATATGTGTTTATTCGTGTCTATAAAAAGATGGAGTTAACAGTTCACTGAGTCCGTTTCAGTTATTTATTTATTTATTTTTCTCTTCATGAAGAAGAATTGATGATATTCCTGACACAATTGGAGATTTCATGGATCTTGGGAATGACACGAGGACTTAACTTGTTTATGGAATTCTGAAACTTGTAGATGTTTAATTTTGTTTGCAAGTTTATGAAAATACAATGTATGATAGGAATAGGTTGTATCAGTTAGTAAGGGAGTCGAATTGAAGGTGCTGGAGTTTATATTTGTACTTTCATGCTTACTTCTAATCTTTTCACATTCTAACAAAAAGTGCAGGCTGGAGGTGAGGGGCCCTCTGTTGCTGGTTCTTCTAACACTGGTAGTGGAGCTTTCCGAGCGAGAAGTTCTCACCAGATGTCATCAAGGAATTTCAGGAAGCCTGTTGGACAGAAACAGGCCTCGAGTTCTGAACAGTGGCAGTGGCGGCCTTTAAATAGTGGGAAAGATGCATCTCCTTGTGCTGTTGATCTTCAGCTGGAACATAATTCAACTGATGATATGAGCAATACCTCTAAAAAATTATTGGGATCTATTGCATCAAATTCTGATAGCAATCAAATGAGCAATACCTCTGAGCAACTATTGGGATCTATTGCTTCAAATTCTGATAGCAATCAAATGAGCAATACCTCTGAACAACTATTGGGATCTGTTGCATCAAATTCTGATTGCATCGAACTCCCGCCCTCTTCTGCTGAAAATGTCTTTAAGAGTTTGCATTCTGCTGTAGAAAGAATTCAGATTCGGGAACCTACAGCGGTAGGGGGAAGTTGTAGTGATTCATTTCCTTATGATAATTGCAACAGATCAGATGCTGTTGGACAGGAACTCAAGGTTCAACTTACCTTGGAATCCTGTGTGAAAGATGAGAGTTCCACCATAAAACTAAGGGAAAGTAATAATGTTTCTGACTCTACGGACTCAAAAGACAAGAAGCCTTCAGTGAACCTCGATCCTTTTGATATATGCCCTCTAAAGTCTGGAGTTGTCATGCTGAATCCTTCTTTATTGGCTAAGAACAGAGAAAAGAGGAATGAGATGAAGCGTGCAATGGAGGGAAATAGTGGAATTGTGTTAAGATCAGGAATGGTTCATTTGAAGAGTGGCATTTCCCTTAGGGATCAGGTATACATTGATTTTTCTCTCTCACCTGCATAAATGATGCATATCAAAATTTTGTTATTGTAGAAACAGCCTCAGACATAACTGAATCAGTTCTTGAGTTAAAGATCTTTCCATGTATCTCTAGGAGGGAAAGCTCGCTCTCCAAGGATAGGAAAGCGCTGTATCATTTTATTTTTCTTGAACCAAGCTTTCATTGAGAAAAACGGAATATGGAAAAGAAAAATAAAAGAAAAAGCCTCACAAAACGAGTCCAAATGCTAAGCACAAATTGACTAAGATCTAGATGATAGTTACAAAAATCCTGAGGAAAGATGATTTATTATCAAGCAAGAAATATATCTACAGGATGAGTTGTAGCTTCCACTATGGTCTGTGTTTTCCTTTCATTCTAGGCTCACCATATATGTGTTGGATGGTTTTCTTTATGTTTGATTTGAACCACACTATTATTTTTAGCTTTGTTGAGGGGGTTTTCAAAACTCTGAATCTTCCTTGCTTTCCTTACTGTTGTTTTTCTTTCTCTGTTATGCTCGCTGCAGGTAAAGATAGTAAAAAGATGTCGGGATCTTGGTGTTGGACCTGGAGGATTTTACCAACCTGGTTACCGTGAAGGAGGAAAGCTACACTTGAAAATGATGTGCCTAGGTAAAAATTGGGATCCTGACAGTAGTACATATGGGGATGTTCGTCCATTTGATGATACAAAACCACCGAATCTACCCGATGAATTTCATCAACTGGTTGAAAAGGCAATCAAAGATTCTTATGCCATTATGGGGAAAGATTCAACAATAAAAAATCCTGAACGTGTACTTCCAGGGATGAAGCCAAACATCTGTATTGTAAACTTCTACTCACAAAATGGACGATTAGATCTTCATCAGGTCTGCAATAATGGCTCCTATGATCCACTTTTTGGATTGTCCAACATTTGCATAATTCTTCCATCATAATCTTTATTTCATTTTCTTCCCATTAGGATCGAGATGAAAGTCAAGAAAGTCTCGATAAAGGATTGCCTGTTGTCTCCTTCTCCATTGGTGACTCTGCAGAATTCCTATTTGGTGATCGGAGTGACGTTGATCAAGCAGAGAAAGTTACTTTGGAATCCGGAGATATCTTGATATTTGGTGGGAAATCAAGACATGTTTTCCATGGGGTGACTGCAATTCATCCAAACACTGCTCCAAAAGCACTTTTAGAATCAACAAATCTTCGTCCAGGTCGTTTGAATCTTACTTTCCGTCAGTACTGAAAGCTTTTTTTTTTTTCTCATCTAGGGAGATATCCATGAAACAACGCCCTTATATATAATACAGTTCCATGTTTCCCTCTAACAACGCCCTTATAATACAGTTCCATGTTTCCCTCTAGTGCCTTCGGGAAGTTGGTTTTGTTTTACTTAACACATTTCAAAAGTTTCGTCGTAATAGATTCTTGATGCTGTACAGAGGTAGATATTAGTTGTAATTTATTATCCGATGGGAAATTTTATGGCTTTGCAAGAACTTTCTCATATTGTGCAGTTGTAGTCTTTTGCATCACAGTGAAGAGTGGTTGTGAAGGAATTTTTGTTGGGAGTTCTGATCTCTGGAAGGAGTTTCTTCTCTTGTGTTAGTTACCATGGAATCACCCAATTGACTGCCATTTTG

mRNA sequence

CGCAATGAAACGGCCTAAAACCTAGGCAAAGTCTTACTATTGACGCCACACGGACTATTCACATCCCTTTCAAAAAGGGTATAAAGTTTAAACCAACAAAAAGGATTAAAACCCAGCCCTCGCTGCATCAAAACCTCTCTTAATCCCTCCATCTTCTCTCGATCGCTCCTCCGATTCAAGATTTGTCACCGAAATTTATCGAACGCCTTACAAACAATTCAGCCCCTTTCGGCATTCAATTCCCCCCCGATCAATCTTTTCTGTTTCCAGTTCCCTCCTCAAGATCATGAATGACGGTGGCCCTAGATATGCTGGAAGAGGCCATCCGAATAACAGAGGACGATCTCCTCGCTCTGCCGATAATTTCATTTATCGTCCCCGTCATGCTGGAGGTGAGGGGCCCTCTGTTGCTGGTTCTTCTAACACTGGTAGTGGAGCTTTCCGAGCGAGAAGTTCTCACCAGATGTCATCAAGGAATTTCAGGAAGCCTGTTGGACAGAAACAGGCCTCGAGTTCTGAACAGTGGCAGTGGCGGCCTTTAAATAGTGGGAAAGATGCATCTCCTTGTGCTGTTGATCTTCAGCTGGAACATAATTCAACTGATGATATGAGCAATACCTCTAAAAAATTATTGGGATCTATTGCATCAAATTCTGATAGCAATCAAATGAGCAATACCTCTGAGCAACTATTGGGATCTATTGCTTCAAATTCTGATAGCAATCAAATGAGCAATACCTCTGAACAACTATTGGGATCTGTTGCATCAAATTCTGATTGCATCGAACTCCCGCCCTCTTCTGCTGAAAATGTCTTTAAGAGTTTGCATTCTGCTGTAGAAAGAATTCAGATTCGGGAACCTACAGCGGTAGGGGGAAGTTGTAGTGATTCATTTCCTTATGATAATTGCAACAGATCAGATGCTGTTGGACAGGAACTCAAGGTTCAACTTACCTTGGAATCCTGTGTGAAAGATGAGAGTTCCACCATAAAACTAAGGGAAAGTAATAATGTTTCTGACTCTACGGACTCAAAAGACAAGAAGCCTTCAGTGAACCTCGATCCTTTTGATATATGCCCTCTAAAGTCTGGAGTTGTCATGCTGAATCCTTCTTTATTGGCTAAGAACAGAGAAAAGAGGAATGAGATGAAGCGTGCAATGGAGGGAAATAGTGGAATTGTGTTAAGATCAGGAATGGTTCATTTGAAGAGTGGCATTTCCCTTAGGGATCAGGTAAAGATAGTAAAAAGATGTCGGGATCTTGGTGTTGGACCTGGAGGATTTTACCAACCTGGTTACCGTGAAGGAGGAAAGCTACACTTGAAAATGATGTGCCTAGGTAAAAATTGGGATCCTGACAGTAGTACATATGGGGATGTTCGTCCATTTGATGATACAAAACCACCGAATCTACCCGATGAATTTCATCAACTGGTTGAAAAGGCAATCAAAGATTCTTATGCCATTATGGGGAAAGATTCAACAATAAAAAATCCTGAACGTGTACTTCCAGGGATGAAGCCAAACATCTGTATTGTAAACTTCTACTCACAAAATGGACGATTAGATCTTCATCAGGATCGAGATGAAAGTCAAGAAAGTCTCGATAAAGGATTGCCTGTTGTCTCCTTCTCCATTGGTGACTCTGCAGAATTCCTATTTGGTGATCGGAGTGACGTTGATCAAGCAGAGAAAGTTACTTTGGAATCCGGAGATATCTTGATATTTGGTGGGAAATCAAGACATGTTTTCCATGGGGTGACTGCAATTCATCCAAACACTGCTCCAAAAGCACTTTTAGAATCAACAAATCTTCGTCCAGGTCGTTTGAATCTTACTTTCCGTCAGTACTGAAAGCTTTTTTTTTTTTCTCATCTAGGGAGATATCCATGAAACAACGCCCTTATATATAATACAGTTCCATGTTTCCCTCTAACAACGCCCTTATAATACAGTTCCATGTTTCCCTCTAGTGCCTTCGGGAAGTTGGTTTTGTTTTACTTAACACATTTCAAAAGTTTCGTCGTAATAGATTCTTGATGCTGTACAGAGGTAGATATTAGTTGTAATTTATTATCCGATGGGAAATTTTATGGCTTTGCAAGAACTTTCTCATATTGTGCAGTTGTAGTCTTTTGCATCACAGTGAAGAGTGGTTGTGAAGGAATTTTTGTTGGGAGTTCTGATCTCTGGAAGGAGTTTCTTCTCTTGTGTTAGTTACCATGGAATCACCCAATTGACTGCCATTTTG

Coding sequence (CDS)

ATGAATGACGGTGGCCCTAGATATGCTGGAAGAGGCCATCCGAATAACAGAGGACGATCTCCTCGCTCTGCCGATAATTTCATTTATCGTCCCCGTCATGCTGGAGGTGAGGGGCCCTCTGTTGCTGGTTCTTCTAACACTGGTAGTGGAGCTTTCCGAGCGAGAAGTTCTCACCAGATGTCATCAAGGAATTTCAGGAAGCCTGTTGGACAGAAACAGGCCTCGAGTTCTGAACAGTGGCAGTGGCGGCCTTTAAATAGTGGGAAAGATGCATCTCCTTGTGCTGTTGATCTTCAGCTGGAACATAATTCAACTGATGATATGAGCAATACCTCTAAAAAATTATTGGGATCTATTGCATCAAATTCTGATAGCAATCAAATGAGCAATACCTCTGAGCAACTATTGGGATCTATTGCTTCAAATTCTGATAGCAATCAAATGAGCAATACCTCTGAACAACTATTGGGATCTGTTGCATCAAATTCTGATTGCATCGAACTCCCGCCCTCTTCTGCTGAAAATGTCTTTAAGAGTTTGCATTCTGCTGTAGAAAGAATTCAGATTCGGGAACCTACAGCGGTAGGGGGAAGTTGTAGTGATTCATTTCCTTATGATAATTGCAACAGATCAGATGCTGTTGGACAGGAACTCAAGGTTCAACTTACCTTGGAATCCTGTGTGAAAGATGAGAGTTCCACCATAAAACTAAGGGAAAGTAATAATGTTTCTGACTCTACGGACTCAAAAGACAAGAAGCCTTCAGTGAACCTCGATCCTTTTGATATATGCCCTCTAAAGTCTGGAGTTGTCATGCTGAATCCTTCTTTATTGGCTAAGAACAGAGAAAAGAGGAATGAGATGAAGCGTGCAATGGAGGGAAATAGTGGAATTGTGTTAAGATCAGGAATGGTTCATTTGAAGAGTGGCATTTCCCTTAGGGATCAGGTAAAGATAGTAAAAAGATGTCGGGATCTTGGTGTTGGACCTGGAGGATTTTACCAACCTGGTTACCGTGAAGGAGGAAAGCTACACTTGAAAATGATGTGCCTAGGTAAAAATTGGGATCCTGACAGTAGTACATATGGGGATGTTCGTCCATTTGATGATACAAAACCACCGAATCTACCCGATGAATTTCATCAACTGGTTGAAAAGGCAATCAAAGATTCTTATGCCATTATGGGGAAAGATTCAACAATAAAAAATCCTGAACGTGTACTTCCAGGGATGAAGCCAAACATCTGTATTGTAAACTTCTACTCACAAAATGGACGATTAGATCTTCATCAGGATCGAGATGAAAGTCAAGAAAGTCTCGATAAAGGATTGCCTGTTGTCTCCTTCTCCATTGGTGACTCTGCAGAATTCCTATTTGGTGATCGGAGTGACGTTGATCAAGCAGAGAAAGTTACTTTGGAATCCGGAGATATCTTGATATTTGGTGGGAAATCAAGACATGTTTTCCATGGGGTGACTGCAATTCATCCAAACACTGCTCCAAAAGCACTTTTAGAATCAACAAATCTTCGTCCAGGTCGTTTGAATCTTACTTTCCGTCAGTACTGA

Protein sequence

MNDGGPRYAGRGHPNNRGRSPRSADNFIYRPRHAGGEGPSVAGSSNTGSGAFRARSSHQMSSRNFRKPVGQKQASSSEQWQWRPLNSGKDASPCAVDLQLEHNSTDDMSNTSKKLLGSIASNSDSNQMSNTSEQLLGSIASNSDSNQMSNTSEQLLGSVASNSDCIELPPSSAENVFKSLHSAVERIQIREPTAVGGSCSDSFPYDNCNRSDAVGQELKVQLTLESCVKDESSTIKLRESNNVSDSTDSKDKKPSVNLDPFDICPLKSGVVMLNPSLLAKNREKRNEMKRAMEGNSGIVLRSGMVHLKSGISLRDQVKIVKRCRDLGVGPGGFYQPGYREGGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFHQLVEKAIKDSYAIMGKDSTIKNPERVLPGMKPNICIVNFYSQNGRLDLHQDRDESQESLDKGLPVVSFSIGDSAEFLFGDRSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHPNTAPKALLESTNLRPGRLNLTFRQY
BLAST of Lsi07G006230 vs. Swiss-Prot
Match: ALKB_CAUCN (Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter crescentus (strain NA1000 / CB15N) GN=alkB PE=3 SV=2)

HSP 1 Score: 83.6 bits (205), Expect = 7.1e-15
Identity = 64/195 (32.82%), Postives = 89/195 (45.64%), Query Frame = 1

Query: 330 PGGFYQPGYREGGKLHLKMMCLGK-NWDPDSSTYGDVRPFDDTKPPNLPDEFHQLVEKAI 389
           P   Y+  Y  G  + + M  LG   W  D+  Y  V    +T  P  PD     +  A+
Sbjct: 53  PFSNYRTAY--GKPMSVAMTALGSLGWTSDARGYRYVDRHPETGRP-WPD-----MPPAL 112

Query: 390 KDSYAIMGKDSTIKNPERVLPGMKPNICIVNFYSQNGRLDLHQDRDESQESLDKGLPVVS 449
            D + ++G   T            P+ C+VN Y    R+ LHQDRDE+    D   PV+S
Sbjct: 113 LDLWTVLGDPET-----------PPDSCLVNLYRDGARMGLHQDRDEA----DPRFPVLS 172

Query: 450 FSIGDSAEFLFGDRSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHPNTAPKALLEST 509
            S+GD+A F  G  +  D    + L SGD+    G +R  FHGV  I P         S+
Sbjct: 173 ISLGDTAVFRIGGVNRKDPTRSLRLASGDVCRLLGPARLAFHGVDRILPG--------SS 216

Query: 510 NLRP--GRLNLTFRQ 522
           +L P  GR+NLT R+
Sbjct: 233 SLVPGGGRINLTLRR 216

BLAST of Lsi07G006230 vs. Swiss-Prot
Match: ALKB_CAUCR (Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter crescentus (strain ATCC 19089 / CB15) GN=alkB PE=3 SV=1)

HSP 1 Score: 83.6 bits (205), Expect = 7.1e-15
Identity = 64/195 (32.82%), Postives = 89/195 (45.64%), Query Frame = 1

Query: 330 PGGFYQPGYREGGKLHLKMMCLGK-NWDPDSSTYGDVRPFDDTKPPNLPDEFHQLVEKAI 389
           P   Y+  Y  G  + + M  LG   W  D+  Y  V    +T  P  PD     +  A+
Sbjct: 53  PFSNYRTAY--GKPMSVAMTALGSLGWTSDARGYRYVDRHPETGRP-WPD-----MPPAL 112

Query: 390 KDSYAIMGKDSTIKNPERVLPGMKPNICIVNFYSQNGRLDLHQDRDESQESLDKGLPVVS 449
            D + ++G   T            P+ C+VN Y    R+ LHQDRDE+    D   PV+S
Sbjct: 113 LDLWTVLGDPET-----------PPDSCLVNLYRDGARMGLHQDRDEA----DPRFPVLS 172

Query: 450 FSIGDSAEFLFGDRSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHPNTAPKALLEST 509
            S+GD+A F  G  +  D    + L SGD+    G +R  FHGV  I P         S+
Sbjct: 173 ISLGDTAVFRIGGVNRKDPTRSLRLASGDVCRLLGPARLAFHGVDRILPG--------SS 216

Query: 510 NLRP--GRLNLTFRQ 522
           +L P  GR+NLT R+
Sbjct: 233 SLVPGGGRINLTLRR 216

BLAST of Lsi07G006230 vs. Swiss-Prot
Match: ALKBH_SCHPO (Alpha-ketoglutarate-dependent dioxygenase abh1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=abh1 PE=2 SV=3)

HSP 1 Score: 82.8 bits (203), Expect = 1.2e-14
Identity = 58/181 (32.04%), Postives = 80/181 (44.20%), Query Frame = 1

Query: 346 LKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFHQLVEKAIKDSYAIMGKDSTIKNPE 405
           L+ + LG+ +D  +  Y D      +K P  P +    VEK +K+S   +          
Sbjct: 140 LRWVTLGEQYDWTTKEYPD-----PSKSPGFPKDLGDFVEKVVKESTDFLH--------- 199

Query: 406 RVLPGMKPNICIVNFYSQNGRLDLHQDRDESQESLDKGLPVVSFSIGDSAEFLFGDRSDV 465
                 K    IVNFYS    L  H D  ES+E L   LP++S S+G    +L G  S  
Sbjct: 200 -----WKAEAAIVNFYSPGDTLSAHID--ESEEDLT--LPLISLSMGLDCIYLIGTESRS 259

Query: 466 DQAEKVTLESGDILIFGGKSRHVFHGVTAIHPNTAPKALLESTNLRPG-----RLNLTFR 522
           ++   + L SGD++I  G SR  FH V  I PN+ P  LL       G     R+N   R
Sbjct: 260 EKPSALRLHSGDVVIMTGTSRKAFHAVPKIIPNSTPNYLLTGNKAWDGWISRKRVNFNVR 297

BLAST of Lsi07G006230 vs. Swiss-Prot
Match: ALKB_ECOLI (Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Escherichia coli (strain K12) GN=alkB PE=1 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 1.2e-14
Identity = 49/160 (30.63%), Postives = 76/160 (47.50%), Query Frame = 1

Query: 362 YGDVRPFDDTKPPNLPDEFHQLVEKAIKDSYAIMGKDSTIKNPERVLPGMKPNICIVNFY 421
           Y  + P  +   P +P  FH L ++A   +                 P  +P+ C++N Y
Sbjct: 78  YSPIDPQTNKPWPAMPQSFHNLCQRAATAAG---------------YPDFQPDACLINRY 137

Query: 422 SQNGRLDLHQDRDESQESLDKGLPVVSFSIGDSAEFLFGDRSDVDQAEKVTLESGDILIF 481
           +   +L LHQD+DE     D   P+VS S+G  A F FG     D  +++ LE GD++++
Sbjct: 138 APGAKLSLHQDKDEP----DLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVW 197

Query: 482 GGKSRHVFHGVTAIHPNTAPKALLESTNLRPGRLNLTFRQ 522
           GG+SR  +HG+  +     P  +         R NLTFRQ
Sbjct: 198 GGESRLFYHGIQPLKAGFHPLTI-------DCRYNLTFRQ 211

BLAST of Lsi07G006230 vs. Swiss-Prot
Match: ALKB_SALTY (Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) GN=alkB PE=3 SV=2)

HSP 1 Score: 79.3 bits (194), Expect = 1.3e-13
Identity = 57/176 (32.39%), Postives = 77/176 (43.75%), Query Frame = 1

Query: 351 LGKNWDPDSSTYGDVRPFDDTKPPNLPDEFHQLVEKA-IKDSYAIMGKDSTIKNPERVLP 410
           LG   D     Y    P  D   P LP  F  +  +A I   YA                
Sbjct: 67  LGWTTDRHGYCYAVRDPLTDKPWPALPLSFASVCRQAAIAAGYA---------------- 126

Query: 411 GMKPNICIVNFYSQNGRLDLHQDRDESQESLDKGLPVVSFSIGDSAEFLFGDRSDVDQAE 470
             +P+ C++N Y+   +L LHQD+DE     D   P+VS S+G  A F FG     D  +
Sbjct: 127 SFQPDACLINRYAPGAKLSLHQDKDEP----DLRAPIVSVSLGVPAVFQFGGLRRSDPIQ 186

Query: 471 KVTLESGDILIFGGKSRHVFHGV----TAIHPNTAPKALLESTNLRPGRLNLTFRQ 522
           ++ LE GDI+++GG+SR  +HG+       HP T              R NLTFRQ
Sbjct: 187 RILLEHGDIVVWGGESRLFYHGIQPLKAGFHPMTG-----------EFRYNLTFRQ 211

BLAST of Lsi07G006230 vs. TrEMBL
Match: A0A0A0LC72_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G651830 PE=4 SV=1)

HSP 1 Score: 823.9 bits (2127), Expect = 1.1e-235
Identity = 423/522 (81.03%), Postives = 449/522 (86.02%), Query Frame = 1

Query: 1   MNDGGPRYAGRGHPNNRGRSPRSADNFIYRPRHAGGEGPSVAGSSNTGSGAFRARSSHQM 60
           MNDGGPRYAGRGHPNNRGRSPRSAD+F YRPRHAGGEG SVAGSSN GSGAFR RSSHQM
Sbjct: 53  MNDGGPRYAGRGHPNNRGRSPRSADHFTYRPRHAGGEGSSVAGSSNPGSGAFRGRSSHQM 112

Query: 61  SSRNFRKPVGQKQASSSEQWQWRPLNSGKDASPCAVDLQLEHNSTDDMSNTSKKLLGSIA 120
           SSRNFRKPVGQKQASSSEQWQWRPLNSGKDASP AVDLQL+HNSTDDMSN +        
Sbjct: 113 SSRNFRKPVGQKQASSSEQWQWRPLNSGKDASPGAVDLQLQHNSTDDMSNNN-------- 172

Query: 121 SNSDSNQMSNTSEQLLGSIASNSDSNQMSNTSEQLLGSVASNSDCIELPPSSAENVFKSL 180
                       +QLL SI                    ASNSDCIEL  SSA+NV KSL
Sbjct: 173 ------------KQLLESI--------------------ASNSDCIELSSSSAQNVSKSL 232

Query: 181 HSAVERIQIREPTAVGGSCSDSFPYDNCNRSDAVGQELKVQLTLESCVKDESSTIKLRES 240
           HSAVERI ++ PTAV GS  DSFPYDNCNRSD VGQELKVQ +L+SC KDES TI+L +S
Sbjct: 233 HSAVERIHVQGPTAVCGSYGDSFPYDNCNRSDVVGQELKVQPSLKSCAKDESFTIQLGKS 292

Query: 241 NNVSDSTDSKDKKPSVNLDPFDICPLKSGVVMLNPSLLAKNREKRNEMKRAMEGNSGIVL 300
           N+V +STDSKDKKPSV+LD FDICP K+G VMLNPSLLA NREKRNEM+RAMEGN+GIVL
Sbjct: 293 NDVFNSTDSKDKKPSVDLDSFDICPPKTGGVMLNPSLLAMNREKRNEMRRAMEGNNGIVL 352

Query: 301 RSGMVHLKSGISLRDQVKIVKRCRDLGVGPGGFYQPGYREGGKLHLKMMCLGKNWDPDSS 360
           R GMVHLK GIS+RDQ KIVK+CRDLG+G GGFYQPGYREGGKLHLKMMCLGKNWDPDSS
Sbjct: 353 RPGMVHLKGGISVRDQAKIVKKCRDLGIGAGGFYQPGYREGGKLHLKMMCLGKNWDPDSS 412

Query: 361 TYGDVRPFDDTKPPNLPDEFHQLVEKAIKDSYAIMGKDSTIKNPERVLPGMKPNICIVNF 420
           TYGD+RPFDDTKPPNLPDEF+QLVEKAIKDSYAIM +DSTIKNPERVLP MKP+ICIVNF
Sbjct: 413 TYGDIRPFDDTKPPNLPDEFYQLVEKAIKDSYAIMAEDSTIKNPERVLPWMKPDICIVNF 472

Query: 421 YSQNGRLDLHQDRDESQESLDKGLPVVSFSIGDSAEFLFGDRSDVDQAEKVTLESGDILI 480
           YSQNGRL LHQDRDESQESLDKGLPV+SFSIGDSAEFLFGDRSDVDQAEKVTLESGDILI
Sbjct: 473 YSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDRSDVDQAEKVTLESGDILI 532

Query: 481 FGGKSRHVFHGVTAIHPNTAPKALLESTNLRPGRLNLTFRQY 523
           FGGKSRHVFHGVTAIH NTAPKALLE+TNLRPGRLNLTFRQY
Sbjct: 533 FGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY 534

BLAST of Lsi07G006230 vs. TrEMBL
Match: A0A061GD55_THECC (2-oxoglutarate-dependent dioxygenase family protein, putative isoform 1 OS=Theobroma cacao GN=TCM_029204 PE=4 SV=1)

HSP 1 Score: 401.0 bits (1029), Expect = 2.3e-108
Identity = 200/301 (66.45%), Postives = 238/301 (79.07%), Query Frame = 1

Query: 226 SCVKDESSTIKLRESNNVSDST---DSKDKKPSVNLDPFDICPLKSGV-VMLNPSLLAKN 285
           SC++DES   +  +  +  +S    DS   +  V +DPFDIC  K+G  VML PSLL KN
Sbjct: 161 SCLQDESEPSESSQKMSPQNSAGFGDSVHTECQVVVDPFDICLSKAGTPVMLKPSLLVKN 220

Query: 286 REKRNEMKRAMEGNSGIVLRSGMVHLKSGISLRDQVKIVKRCRDLGVGPGGFYQPGYREG 345
           REKRNE+KR+MEG +GIVLRSGMV LK  +SL DQVKIVK CR+LG G GGFYQPGYR+G
Sbjct: 221 REKRNEIKRSMEGQNGIVLRSGMVLLKKYLSLSDQVKIVKACRELGFGSGGFYQPGYRDG 280

Query: 346 GKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFHQLVEKAIKDSYAIMGKDSTI 405
            KLHLKMMCLGKNWDP++  Y D+RP D   PP++P EF+ LVEKAIKDS+A++ + +  
Sbjct: 281 AKLHLKMMCLGKNWDPETGNYEDLRPIDCAVPPHIPREFYLLVEKAIKDSHALLQQKAIA 340

Query: 406 KNPERVLPGMKPNICIVNFYSQNGRLDLHQDRDESQESLDKGLPVVSFSIGDSAEFLFGD 465
            + E +LP M PNICIVNFYS +GRL LHQDRDES ESL K LPVVSFSIGDSAEFL+GD
Sbjct: 341 SHVEDILPWMSPNICIVNFYSASGRLGLHQDRDESPESLHKRLPVVSFSIGDSAEFLYGD 400

Query: 466 RSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHPNTAPKALLESTNLRPGRLNLTFRQ 523
           + DVD+AEKV LESGD+LIFGG SRH+FHGVTAI  NTAP+AL++ TNLRPGRLNLTFR+
Sbjct: 401 QRDVDKAEKVELESGDVLIFGGNSRHIFHGVTAIKQNTAPRALVDETNLRPGRLNLTFRE 460

BLAST of Lsi07G006230 vs. TrEMBL
Match: W9RP27_9ROSA (Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Morus notabilis GN=L484_005039 PE=4 SV=1)

HSP 1 Score: 382.9 bits (982), Expect = 6.3e-103
Identity = 185/310 (59.68%), Postives = 232/310 (74.84%), Query Frame = 1

Query: 217 ELKVQLTLESCVK----DESSTIKLRESNNVSDSTDSKDKKPSVNLDPFDICPLKSGVVM 276
           ++++   +E+C+     DE   + ++     +     ++ + S   +PFDIC  K+  V 
Sbjct: 99  QIEISQQVENCLPPKHGDEDVPLSIKSLKTPASDRKFENSESSEVFEPFDICLPKTSAVK 158

Query: 277 LNPSLLAKNREKRNEMKRAMEGNSGIVLRSGMVHLKSGISLRDQVKIVKRCRDLGVGPGG 336
           L PSLLA NRE+RNE KR  EG +G +LR GMV LKS IS+  Q KIVKRCR LG+GPGG
Sbjct: 159 LKPSLLATNRERRNETKRTTEGLNGRILRPGMVLLKSYISISTQTKIVKRCRHLGLGPGG 218

Query: 337 FYQPGYREGGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFHQLVEKAIKDSY 396
           FYQPGYR+G KLHL MMCLGKNWDP +S YGD RP D  KPP +P EF++LV KAI+DS+
Sbjct: 219 FYQPGYRDGAKLHLNMMCLGKNWDPQTSKYGDYRPTDGAKPPPIPKEFYELVMKAIEDSH 278

Query: 397 AIMGKDSTIKNPERVLPGMKPNICIVNFYSQNGRLDLHQDRDESQESLDKGLPVVSFSIG 456
            ++ K+S   N E++LP M P+IC+VNFYS NGRL LHQDRDES ES+ KGLPVVSFSIG
Sbjct: 279 VLIRKESEAGNAEQILPRMTPDICLVNFYSTNGRLGLHQDRDESHESIRKGLPVVSFSIG 338

Query: 457 DSAEFLFGDRSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHPNTAPKALLESTNLRP 516
           D+A+F +GD+ DVD A++V LESGD+LIFGG +R+VFHGVT IH NTAPK LLE T+LRP
Sbjct: 339 DAADFKYGDQRDVDTAKEVMLESGDVLIFGGDARYVFHGVTTIHTNTAPKTLLEQTDLRP 398

Query: 517 GRLNLTFRQY 523
           GR+NLTFRQY
Sbjct: 399 GRVNLTFRQY 408

BLAST of Lsi07G006230 vs. TrEMBL
Match: A0A0D2RCG1_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_010G235600 PE=4 SV=1)

HSP 1 Score: 378.3 bits (970), Expect = 1.6e-101
Identity = 188/295 (63.73%), Postives = 224/295 (75.93%), Query Frame = 1

Query: 230 DESSTIKLRESNNVSDSTDSKDK-KPSVNLDPFDICPLKSGV-VMLNPSLLAKNREKRNE 289
           D+S   + R     S   D  ++ K    ++PFDIC  K G  VML PSLL KNREKRNE
Sbjct: 132 DKSEPSQERSPPQSSSGIDDLNQVKCQAVIEPFDICLPKIGTPVMLKPSLLVKNREKRNE 191

Query: 290 MKRAMEGNSGIVLRSGMVHLKSGISLRDQVKIVKRCRDLGVGPGGFYQPGYREGGKLHLK 349
           +KR+ EG  G VLR GMV LK  +SL DQVKIV+ CR LG+G GGFYQPGYR+G KLHLK
Sbjct: 192 IKRSAEGQIGNVLRPGMVLLKKYLSLGDQVKIVRACRALGLGSGGFYQPGYRDGAKLHLK 251

Query: 350 MMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFHQLVEKAIKDSYAIMGKDSTIKNPERV 409
           MMCLGKNWDP++  YGD+RP D   PP +P EF QLVEK IKDS++++   +   + E +
Sbjct: 252 MMCLGKNWDPETGNYGDLRPIDRAVPPGIPREFFQLVEKVIKDSHSLVQLKTKASHVEHI 311

Query: 410 LPGMKPNICIVNFYSQNGRLDLHQDRDESQESLDKGLPVVSFSIGDSAEFLFGDRSDVDQ 469
           LP MKPNICIVNFYS +GRL LHQD+DES ESL KGLPV+SFSIGD+AEFL+ D+ +VD+
Sbjct: 312 LPSMKPNICIVNFYSASGRLGLHQDKDESPESLHKGLPVISFSIGDAAEFLYSDQREVDK 371

Query: 470 AEKVTLESGDILIFGGKSRHVFHGVTAIHPNTAPKALLESTNLRPGRLNLTFRQY 523
           AEKV LESGD+L+FGG SRH+FHGVTAI   TAP +LLE TNLRPGRLNLTFR+Y
Sbjct: 372 AEKVELESGDVLVFGGSSRHIFHGVTAIKQKTAPGSLLEETNLRPGRLNLTFREY 426

BLAST of Lsi07G006230 vs. TrEMBL
Match: M5WUR8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa016882mg PE=4 SV=1)

HSP 1 Score: 375.9 bits (964), Expect = 7.8e-101
Identity = 185/314 (58.92%), Postives = 235/314 (74.84%), Query Frame = 1

Query: 215 GQELKVQLTLESCVKDESSTIKLRESNNVSDSTDSKDKKPSVNLD-----PFDICPLKSG 274
           G ++ V+L++       S   ++      SD T S +     N +      FD+CP K+G
Sbjct: 85  GVDISVRLSMNHSRNLHSGVERMHIGETPSDMTGSVNTTGHRNPELSEHSAFDLCPTKAG 144

Query: 275 -VVMLNPSLLAKNREKRNEMKRAMEGNSGIVLRSGMVHLKSGISLRDQVKIVKRCRDLGV 334
             V L   LL +NRE+RNEMKR+ME  +G VL+ GMV LK  +S  +Q+ IVK CRDLG+
Sbjct: 145 GCVTLKVPLLVQNRERRNEMKRSMEKQNGSVLQPGMVLLKGYLSPSEQINIVKLCRDLGL 204

Query: 335 GPGGFYQPGYREGGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFHQLVEKAI 394
           GPGGFY+PGYR+G KL+LKMMCLGKNWDP++S+YGD RPFD  KPP++P EF +LV+ AI
Sbjct: 205 GPGGFYKPGYRDGAKLYLKMMCLGKNWDPETSSYGDHRPFDGAKPPSIPVEFFRLVKSAI 264

Query: 395 KDSYAIMGKDSTIKNPERVLPGMKPNICIVNFYSQNGRLDLHQDRDESQESLDKGLPVVS 454
           ++S++++ KDS + N E +LP M P+IC+VNFYS +GRL LHQD DES+ SL KGLPVVS
Sbjct: 265 EESHSLIRKDSKVSNSESILPWMSPDICLVNFYSSSGRLGLHQDCDESERSLHKGLPVVS 324

Query: 455 FSIGDSAEFLFGDRSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHPNTAPKALLEST 514
           FSIGD+AEFL+GD+ DV++A KV LESGD+LIFGGKSRHVFHGV +I PNTAP  LLE T
Sbjct: 325 FSIGDTAEFLYGDQRDVERANKVLLESGDVLIFGGKSRHVFHGVASIQPNTAPMTLLEKT 384

Query: 515 NLRPGRLNLTFRQY 523
           N+RPGRLNLTFRQY
Sbjct: 385 NIRPGRLNLTFRQY 398

BLAST of Lsi07G006230 vs. TAIR10
Match: AT3G14160.1 (AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein)

HSP 1 Score: 335.1 bits (858), Expect = 7.7e-92
Identity = 175/316 (55.38%), Postives = 224/316 (70.89%), Query Frame = 1

Query: 207 NCNRSDAVGQELKVQLTLESCVKDESSTIKLRESNNVSDSTDSKDKKPSVNLDPFDICPL 266
           +C  S +     KV+L   S V+D+ S  K   + N S+ + ++          FDI   
Sbjct: 157 SCQESVSSTVVQKVEL---SSVEDQKSAPKADGAGNSSNESSTRH---------FDIFLE 216

Query: 267 KSGVVMLNPSLLAKNREKRNEMKRAMEGNSGIVLRSGMVHLKSGISLRDQVKIVKRCRDL 326
           K G+V L P+LL  +REK    K+A +G SG V+R GMV LK+ +S+ DQV IV +CR L
Sbjct: 217 KKGIV-LKPNLLVLSREK----KKAAKGYSGTVIRPGMVLLKNYLSINDQVMIVNKCRRL 276

Query: 327 GVGPGGFYQPGYREGGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFHQLVEK 386
           G+G GGFYQPGYR+  KLHLKMMCLGKNWDP++S YG+ RPFD +  P +P EF+Q VEK
Sbjct: 277 GLGEGGFYQPGYRDEAKLHLKMMCLGKNWDPETSRYGETRPFDGSTAPRIPAEFNQFVEK 336

Query: 387 AIKDSYAIMGKDSTIKNPERVLPGMKPNICIVNFYSQNGRLDLHQDRDESQESLDKGLPV 446
           A+K+S ++   +S        +P M P+ICIVNFYS  GRL LHQD+DES+ S+ KGLPV
Sbjct: 337 AVKESQSLAASNSKQTKGGDEIPFMLPDICIVNFYSSTGRLGLHQDKDESENSIRKGLPV 396

Query: 447 VSFSIGDSAEFLFGDRSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHPNTAPKALLE 506
           VSFSIGDSAEFL+GD+ D D+AE +TLESGD+L+FGG+SR VFHGV +I  +TAPKALL+
Sbjct: 397 VSFSIGDSAEFLYGDQRDEDKAETLTLESGDVLLFGGRSRKVFHGVRSIRKDTAPKALLQ 455

Query: 507 STNLRPGRLNLTFRQY 523
            T+LRPGRLNLTFRQY
Sbjct: 457 ETSLRPGRLNLTFRQY 455

BLAST of Lsi07G006230 vs. TAIR10
Match: AT5G01780.2 (AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein)

HSP 1 Score: 283.5 bits (724), Expect = 2.7e-76
Identity = 151/302 (50.00%), Postives = 204/302 (67.55%), Query Frame = 1

Query: 226 SCVKDESSTIKLRESNNVSDST-DSKDKKPSVNLDP--FDICP--LKSGVVMLNPSLLAK 285
           S    +S  +K+R+  N  +S   S+D+ P    DP  FDIC   L+     +   +LA 
Sbjct: 147 SSKSSQSQNLKIRKVRNHRNSGFKSRDQSPQRIKDPPPFDICSSVLERNDTSIKDWILAD 206

Query: 286 NREKRNEMKRAMEGNSGIVLRSGMVHLKSGISLRDQVKIVKRCRDLGVGPGGFYQPGYRE 345
              +          N   V+R GMV LK  ++   QV IVK CR+LGV P GFYQPGY  
Sbjct: 207 ETNRET----VEVSNKHKVIRPGMVLLKDFLTPDIQVDIVKTCRELGVKPTGFYQPGYSV 266

Query: 346 GGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFHQLVEKAIKDSYAIMGKDST 405
           G KLHL+MMCLG+NWDP +    +     D+K P +P  F+ LVEKAI++++A++ ++S 
Sbjct: 267 GSKLHLQMMCLGRNWDPQTKYRKNTDI--DSKAPEIPVTFNVLVEKAIREAHALIDRESG 326

Query: 406 IKNPERVLPGMKPNICIVNFYSQNGRLDLHQDRDESQESLDKGLPVVSFSIGDSAEFLFG 465
            ++ ER+LP M P+ICIVNFYS+ GRL LHQDRDES+ES+ +GLP+VSFSIGDSAEFL+G
Sbjct: 327 TEDAERILPVMSPDICIVNFYSETGRLGLHQDRDESEESIARGLPIVSFSIGDSAEFLYG 386

Query: 466 DRSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHPNTAPKALLESTNLRPGRLNLTFR 523
           ++ DV++A+ V LESGD+LIFGG+SR +FHGV +I PN+AP +LL  + LR GRLNLTFR
Sbjct: 387 EKRDVEEAQGVILESGDVLIFGGESRMIFHGVKSIIPNSAPMSLLNESKLRTGRLNLTFR 442

BLAST of Lsi07G006230 vs. TAIR10
Match: AT3G14140.1 (AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein)

HSP 1 Score: 268.1 bits (684), Expect = 1.2e-71
Identity = 142/299 (47.49%), Postives = 194/299 (64.88%), Query Frame = 1

Query: 226 SCVKDESSTIKLRESNNVSDSTDSKDKKPSVNLD-PFDICPLKSGVVMLNPSLLAKNREK 285
           S   D++  +   E++ ++   D      + +   PFDI  LK  V+ L PS L  NREK
Sbjct: 168 SSTSDKNVELSSVENHKIAPKADGPGNSSNESSSSPFDIF-LKKKVMRLKPSFLELNREK 227

Query: 286 RNEMKRAMEGNSGIVLRSGMVHLKSGISLRDQVKIVKRCRDLGVGPGGFYQPGYREGGKL 345
               K+A +G SGIV+R GMV LK+ +S+ +QV IV +CR LG+G GGFYQPG+++GG L
Sbjct: 228 ----KKAAKGFSGIVIRPGMVLLKNYLSINNQVMIVNKCRQLGLGEGGFYQPGFQDGGLL 287

Query: 346 HLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFHQLVEKAIKDSYAIMGKDSTIKNP 405
           HLKMMCLGKNWD  +  YG++RP D + PP +P EF QLVEKAIK+S +++  +S     
Sbjct: 288 HLKMMCLGKNWDCQTRRYGEIRPIDGSVPPRIPVEFSQLVEKAIKESKSLVATNSNETKG 347

Query: 406 ERVLPGMKPNICIVNFYSQNGRLDLHQ---------------------DRDESQESLDKG 465
              +P + P+IC+VNFY+  G+L LHQ                     D+ ES++SL KG
Sbjct: 348 GDEIPLLLPDICVVNFYTSTGKLGLHQVSVYDKTSFDFLKYKGGYLNTDKGESKKSLRKG 407

Query: 466 LPVVSFSIGDSAEFLFGDRSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHPNTAPK 503
           LP+VSFSIGDSAEFL+GD+ DVD+A+ + LESGD+LIFG +SR+VFHGV +I     P+
Sbjct: 408 LPIVSFSIGDSAEFLYGDQKDVDKADTLILESGDVLIFGERSRNVFHGVRSIRKILPPR 461

BLAST of Lsi07G006230 vs. TAIR10
Match: AT1G11780.1 (AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein)

HSP 1 Score: 59.3 bits (142), Expect = 8.1e-09
Identity = 46/150 (30.67%), Postives = 63/150 (42.00%), Query Frame = 1

Query: 346 LKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFHQLVEKAIKDSYAIMGKDSTIKNPE 405
           L+   LG  +D     Y    P +     N+PD   QL     K   AI   D     PE
Sbjct: 177 LRWSTLGLQFDWSKRNYDVSLPHN-----NIPDALCQLA----KTHAAIAMPDGEEFRPE 236

Query: 406 RVLPGMKPNICIVNFYSQNGRLDLHQDRDESQESLDKGLPVVSFSIGDSAEFLFGDRSDV 465
                      IVN++     L  H D  E+    D   P+VS S+G  A FL G +S  
Sbjct: 237 GA---------IVNYFGIGDTLGGHLDDMEA----DWSKPIVSMSLGCKAIFLLGGKSKD 296

Query: 466 DQAEKVTLESGDILIFGGKSRHVFHGVTAI 496
           D    + L SGD+++  G++R  FHG+  I
Sbjct: 297 DPPHAMYLRSGDVVLMAGEARECFHGIPRI 304

BLAST of Lsi07G006230 vs. NCBI nr
Match: gi|659127991|ref|XP_008463993.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103501985 [Cucumis melo])

HSP 1 Score: 865.1 bits (2234), Expect = 6.0e-248
Identity = 445/522 (85.25%), Postives = 464/522 (88.89%), Query Frame = 1

Query: 1   MNDGGPRYAGRGHPNNRGRSPRSADNFIYRPRHAGGEGPSVAGSSNTGSGAFRARSSHQM 60
           MNDGGPRYAGRGHPNNRGRSPRSADNF YRPRHAGGEG SVAGSSN GSGAFR RSSHQM
Sbjct: 1   MNDGGPRYAGRGHPNNRGRSPRSADNFTYRPRHAGGEGSSVAGSSNPGSGAFRGRSSHQM 60

Query: 61  SSRNFRKPVGQKQASSSEQWQWRPLNSGKDASPCAVDLQLEHNSTDDMSNTSKKLLGSIA 120
           SSRNFRKPVGQKQASSSEQWQWRPLNS KDASP AVDL L+HNSTDDMSNT+K       
Sbjct: 61  SSRNFRKPVGQKQASSSEQWQWRPLNSRKDASPGAVDLHLQHNSTDDMSNTNK------- 120

Query: 121 SNSDSNQMSNTSEQLLGSIASNSDSNQMSNTSEQLLGSVASNSDCIELPPSSAENVFKSL 180
                        QLLGSIASNSDSNQ+SNTSEQ LG +ASNSDCIEL  SSA+NV KSL
Sbjct: 121 -------------QLLGSIASNSDSNQISNTSEQRLGPIASNSDCIELLSSSAQNVSKSL 180

Query: 181 HSAVERIQIREPTAVGGSCSDSFPYDNCNRSDAVGQELKVQLTLESCVKDESSTIKLRES 240
           HSAVERIQI+ PTAV G+CSDSFPYDN N SD VGQELKVQ TLESC KD+SSTIKL ES
Sbjct: 181 HSAVERIQIQGPTAVCGNCSDSFPYDNRNGSDVVGQELKVQPTLESCAKDKSSTIKLGES 240

Query: 241 NNVSDSTDSKDKKPSVNLDPFDICPLKSGVVMLNPSLLAKNREKRNEMKRAMEGNSGIVL 300
           NNV +S DSKDKKPSV+LD FDICP K+G V LNPSLLA NR KRNEMKRAM+GN+GIVL
Sbjct: 241 NNVFNSMDSKDKKPSVDLDTFDICPPKTGGVTLNPSLLAMNRXKRNEMKRAMDGNNGIVL 300

Query: 301 RSGMVHLKSGISLRDQVKIVKRCRDLGVGPGGFYQPGYREGGKLHLKMMCLGKNWDPDSS 360
           R GMVHLK  ISLRDQ KIVK+CRDLG+G GGFYQPGYREGGKLHLKMMCLGKNWDPDSS
Sbjct: 301 RPGMVHLKGSISLRDQAKIVKKCRDLGIGAGGFYQPGYREGGKLHLKMMCLGKNWDPDSS 360

Query: 361 TYGDVRPFDDTKPPNLPDEFHQLVEKAIKDSYAIMGKDSTIKNPERVLPGMKPNICIVNF 420
           TYGDVRPFDDTKPPNLPDEF+QLVEKAIKDSYAI+ KDSTIKNPERVLP MKPNICIVNF
Sbjct: 361 TYGDVRPFDDTKPPNLPDEFYQLVEKAIKDSYAIIAKDSTIKNPERVLPWMKPNICIVNF 420

Query: 421 YSQNGRLDLHQDRDESQESLDKGLPVVSFSIGDSAEFLFGDRSDVDQAEKVTLESGDILI 480
           YSQNGRL LHQDRDESQESLDKGLPV+SFSIGDSAEFLFGDRSDVDQAEKVTLESGDILI
Sbjct: 421 YSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDRSDVDQAEKVTLESGDILI 480

Query: 481 FGGKSRHVFHGVTAIHPNTAPKALLESTNLRPGRLNLTFRQY 523
           FGGKSRHVFHGVTAIH  TAPKALLE+TNLRPGRLNLTFRQY
Sbjct: 481 FGGKSRHVFHGVTAIHSKTAPKALLEATNLRPGRLNLTFRQY 502

BLAST of Lsi07G006230 vs. NCBI nr
Match: gi|778682576|ref|XP_011651736.1| (PREDICTED: uncharacterized protein LOC101205291 [Cucumis sativus])

HSP 1 Score: 823.9 bits (2127), Expect = 1.5e-235
Identity = 423/522 (81.03%), Postives = 449/522 (86.02%), Query Frame = 1

Query: 1   MNDGGPRYAGRGHPNNRGRSPRSADNFIYRPRHAGGEGPSVAGSSNTGSGAFRARSSHQM 60
           MNDGGPRYAGRGHPNNRGRSPRSAD+F YRPRHAGGEG SVAGSSN GSGAFR RSSHQM
Sbjct: 1   MNDGGPRYAGRGHPNNRGRSPRSADHFTYRPRHAGGEGSSVAGSSNPGSGAFRGRSSHQM 60

Query: 61  SSRNFRKPVGQKQASSSEQWQWRPLNSGKDASPCAVDLQLEHNSTDDMSNTSKKLLGSIA 120
           SSRNFRKPVGQKQASSSEQWQWRPLNSGKDASP AVDLQL+HNSTDDMSN +        
Sbjct: 61  SSRNFRKPVGQKQASSSEQWQWRPLNSGKDASPGAVDLQLQHNSTDDMSNNN-------- 120

Query: 121 SNSDSNQMSNTSEQLLGSIASNSDSNQMSNTSEQLLGSVASNSDCIELPPSSAENVFKSL 180
                       +QLL SI                    ASNSDCIEL  SSA+NV KSL
Sbjct: 121 ------------KQLLESI--------------------ASNSDCIELSSSSAQNVSKSL 180

Query: 181 HSAVERIQIREPTAVGGSCSDSFPYDNCNRSDAVGQELKVQLTLESCVKDESSTIKLRES 240
           HSAVERI ++ PTAV GS  DSFPYDNCNRSD VGQELKVQ +L+SC KDES TI+L +S
Sbjct: 181 HSAVERIHVQGPTAVCGSYGDSFPYDNCNRSDVVGQELKVQPSLKSCAKDESFTIQLGKS 240

Query: 241 NNVSDSTDSKDKKPSVNLDPFDICPLKSGVVMLNPSLLAKNREKRNEMKRAMEGNSGIVL 300
           N+V +STDSKDKKPSV+LD FDICP K+G VMLNPSLLA NREKRNEM+RAMEGN+GIVL
Sbjct: 241 NDVFNSTDSKDKKPSVDLDSFDICPPKTGGVMLNPSLLAMNREKRNEMRRAMEGNNGIVL 300

Query: 301 RSGMVHLKSGISLRDQVKIVKRCRDLGVGPGGFYQPGYREGGKLHLKMMCLGKNWDPDSS 360
           R GMVHLK GIS+RDQ KIVK+CRDLG+G GGFYQPGYREGGKLHLKMMCLGKNWDPDSS
Sbjct: 301 RPGMVHLKGGISVRDQAKIVKKCRDLGIGAGGFYQPGYREGGKLHLKMMCLGKNWDPDSS 360

Query: 361 TYGDVRPFDDTKPPNLPDEFHQLVEKAIKDSYAIMGKDSTIKNPERVLPGMKPNICIVNF 420
           TYGD+RPFDDTKPPNLPDEF+QLVEKAIKDSYAIM +DSTIKNPERVLP MKP+ICIVNF
Sbjct: 361 TYGDIRPFDDTKPPNLPDEFYQLVEKAIKDSYAIMAEDSTIKNPERVLPWMKPDICIVNF 420

Query: 421 YSQNGRLDLHQDRDESQESLDKGLPVVSFSIGDSAEFLFGDRSDVDQAEKVTLESGDILI 480
           YSQNGRL LHQDRDESQESLDKGLPV+SFSIGDSAEFLFGDRSDVDQAEKVTLESGDILI
Sbjct: 421 YSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDRSDVDQAEKVTLESGDILI 480

Query: 481 FGGKSRHVFHGVTAIHPNTAPKALLESTNLRPGRLNLTFRQY 523
           FGGKSRHVFHGVTAIH NTAPKALLE+TNLRPGRLNLTFRQY
Sbjct: 481 FGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY 482

BLAST of Lsi07G006230 vs. NCBI nr
Match: gi|700203366|gb|KGN58499.1| (hypothetical protein Csa_3G651830 [Cucumis sativus])

HSP 1 Score: 823.9 bits (2127), Expect = 1.5e-235
Identity = 423/522 (81.03%), Postives = 449/522 (86.02%), Query Frame = 1

Query: 1   MNDGGPRYAGRGHPNNRGRSPRSADNFIYRPRHAGGEGPSVAGSSNTGSGAFRARSSHQM 60
           MNDGGPRYAGRGHPNNRGRSPRSAD+F YRPRHAGGEG SVAGSSN GSGAFR RSSHQM
Sbjct: 53  MNDGGPRYAGRGHPNNRGRSPRSADHFTYRPRHAGGEGSSVAGSSNPGSGAFRGRSSHQM 112

Query: 61  SSRNFRKPVGQKQASSSEQWQWRPLNSGKDASPCAVDLQLEHNSTDDMSNTSKKLLGSIA 120
           SSRNFRKPVGQKQASSSEQWQWRPLNSGKDASP AVDLQL+HNSTDDMSN +        
Sbjct: 113 SSRNFRKPVGQKQASSSEQWQWRPLNSGKDASPGAVDLQLQHNSTDDMSNNN-------- 172

Query: 121 SNSDSNQMSNTSEQLLGSIASNSDSNQMSNTSEQLLGSVASNSDCIELPPSSAENVFKSL 180
                       +QLL SI                    ASNSDCIEL  SSA+NV KSL
Sbjct: 173 ------------KQLLESI--------------------ASNSDCIELSSSSAQNVSKSL 232

Query: 181 HSAVERIQIREPTAVGGSCSDSFPYDNCNRSDAVGQELKVQLTLESCVKDESSTIKLRES 240
           HSAVERI ++ PTAV GS  DSFPYDNCNRSD VGQELKVQ +L+SC KDES TI+L +S
Sbjct: 233 HSAVERIHVQGPTAVCGSYGDSFPYDNCNRSDVVGQELKVQPSLKSCAKDESFTIQLGKS 292

Query: 241 NNVSDSTDSKDKKPSVNLDPFDICPLKSGVVMLNPSLLAKNREKRNEMKRAMEGNSGIVL 300
           N+V +STDSKDKKPSV+LD FDICP K+G VMLNPSLLA NREKRNEM+RAMEGN+GIVL
Sbjct: 293 NDVFNSTDSKDKKPSVDLDSFDICPPKTGGVMLNPSLLAMNREKRNEMRRAMEGNNGIVL 352

Query: 301 RSGMVHLKSGISLRDQVKIVKRCRDLGVGPGGFYQPGYREGGKLHLKMMCLGKNWDPDSS 360
           R GMVHLK GIS+RDQ KIVK+CRDLG+G GGFYQPGYREGGKLHLKMMCLGKNWDPDSS
Sbjct: 353 RPGMVHLKGGISVRDQAKIVKKCRDLGIGAGGFYQPGYREGGKLHLKMMCLGKNWDPDSS 412

Query: 361 TYGDVRPFDDTKPPNLPDEFHQLVEKAIKDSYAIMGKDSTIKNPERVLPGMKPNICIVNF 420
           TYGD+RPFDDTKPPNLPDEF+QLVEKAIKDSYAIM +DSTIKNPERVLP MKP+ICIVNF
Sbjct: 413 TYGDIRPFDDTKPPNLPDEFYQLVEKAIKDSYAIMAEDSTIKNPERVLPWMKPDICIVNF 472

Query: 421 YSQNGRLDLHQDRDESQESLDKGLPVVSFSIGDSAEFLFGDRSDVDQAEKVTLESGDILI 480
           YSQNGRL LHQDRDESQESLDKGLPV+SFSIGDSAEFLFGDRSDVDQAEKVTLESGDILI
Sbjct: 473 YSQNGRLGLHQDRDESQESLDKGLPVISFSIGDSAEFLFGDRSDVDQAEKVTLESGDILI 532

Query: 481 FGGKSRHVFHGVTAIHPNTAPKALLESTNLRPGRLNLTFRQY 523
           FGGKSRHVFHGVTAIH NTAPKALLE+TNLRPGRLNLTFRQY
Sbjct: 533 FGGKSRHVFHGVTAIHSNTAPKALLEATNLRPGRLNLTFRQY 534

BLAST of Lsi07G006230 vs. NCBI nr
Match: gi|590621107|ref|XP_007024711.1| (2-oxoglutarate-dependent dioxygenase family protein, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 401.0 bits (1029), Expect = 3.2e-108
Identity = 200/301 (66.45%), Postives = 238/301 (79.07%), Query Frame = 1

Query: 226 SCVKDESSTIKLRESNNVSDST---DSKDKKPSVNLDPFDICPLKSGV-VMLNPSLLAKN 285
           SC++DES   +  +  +  +S    DS   +  V +DPFDIC  K+G  VML PSLL KN
Sbjct: 161 SCLQDESEPSESSQKMSPQNSAGFGDSVHTECQVVVDPFDICLSKAGTPVMLKPSLLVKN 220

Query: 286 REKRNEMKRAMEGNSGIVLRSGMVHLKSGISLRDQVKIVKRCRDLGVGPGGFYQPGYREG 345
           REKRNE+KR+MEG +GIVLRSGMV LK  +SL DQVKIVK CR+LG G GGFYQPGYR+G
Sbjct: 221 REKRNEIKRSMEGQNGIVLRSGMVLLKKYLSLSDQVKIVKACRELGFGSGGFYQPGYRDG 280

Query: 346 GKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPNLPDEFHQLVEKAIKDSYAIMGKDSTI 405
            KLHLKMMCLGKNWDP++  Y D+RP D   PP++P EF+ LVEKAIKDS+A++ + +  
Sbjct: 281 AKLHLKMMCLGKNWDPETGNYEDLRPIDCAVPPHIPREFYLLVEKAIKDSHALLQQKAIA 340

Query: 406 KNPERVLPGMKPNICIVNFYSQNGRLDLHQDRDESQESLDKGLPVVSFSIGDSAEFLFGD 465
            + E +LP M PNICIVNFYS +GRL LHQDRDES ESL K LPVVSFSIGDSAEFL+GD
Sbjct: 341 SHVEDILPWMSPNICIVNFYSASGRLGLHQDRDESPESLHKRLPVVSFSIGDSAEFLYGD 400

Query: 466 RSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAIHPNTAPKALLESTNLRPGRLNLTFRQ 523
           + DVD+AEKV LESGD+LIFGG SRH+FHGVTAI  NTAP+AL++ TNLRPGRLNLTFR+
Sbjct: 401 QRDVDKAEKVELESGDVLIFGGNSRHIFHGVTAIKQNTAPRALVDETNLRPGRLNLTFRE 460

BLAST of Lsi07G006230 vs. NCBI nr
Match: gi|1009111586|ref|XP_015901656.1| (PREDICTED: uncharacterized protein LOC107434680 [Ziziphus jujuba])

HSP 1 Score: 399.1 bits (1024), Expect = 1.2e-107
Identity = 188/267 (70.41%), Postives = 226/267 (84.64%), Query Frame = 1

Query: 257 NLDPFDICPLKS-GVVMLNPSLLAKNREKRNEMKRAMEGNSGIVLRSGMVHLKSGISLRD 316
           +++PFDICP K+   ++L PSLLA+NRE+RN+ K   E  +  +L SGMV LKS I+L D
Sbjct: 150 SVEPFDICPPKTTSPIVLKPSLLAQNRERRNQTKSTTERKNPSILGSGMVLLKSCITLND 209

Query: 317 QVKIVKRCRDLGVGPGGFYQPGYREGGKLHLKMMCLGKNWDPDSSTYGDVRPFDDTKPPN 376
           QV+IVK+CR LG+GPGGFYQPGYR+G KLHLKMMCLGKNWDP +  Y D RP DD KPP+
Sbjct: 210 QVEIVKKCRALGLGPGGFYQPGYRDGAKLHLKMMCLGKNWDPGTGKYEDRRPIDDAKPPD 269

Query: 377 LPDEFHQLVEKAIKDSYAIMGKDSTIKNPERVLPGMKPNICIVNFYSQNGRLDLHQDRDE 436
           LP EF++LVEKAIKDS++++ K+    N E++LP M PNICIVNFYS+NGRL LHQDRDE
Sbjct: 270 LPVEFNRLVEKAIKDSHSLIEKEDKAGNAEKILPWMSPNICIVNFYSENGRLGLHQDRDE 329

Query: 437 SQESLDKGLPVVSFSIGDSAEFLFGDRSDVDQAEKVTLESGDILIFGGKSRHVFHGVTAI 496
           S+ESL+ GLPVVSFSIGD+ EFL+GD+ D+D+A+KV LESGD+LIFGGKSRHVFHGVTAI
Sbjct: 330 SRESLELGLPVVSFSIGDAGEFLYGDQRDIDEAQKVVLESGDVLIFGGKSRHVFHGVTAI 389

Query: 497 HPNTAPKALLESTNLRPGRLNLTFRQY 523
           H NTAPKALLE+TNLRPGRLNLTFRQY
Sbjct: 390 HKNTAPKALLEATNLRPGRLNLTFRQY 416

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ALKB_CAUCN7.1e-1532.82Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter crescentus... [more]
ALKB_CAUCR7.1e-1532.82Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter crescentus... [more]
ALKBH_SCHPO1.2e-1432.04Alpha-ketoglutarate-dependent dioxygenase abh1 OS=Schizosaccharomyces pombe (str... [more]
ALKB_ECOLI1.2e-1430.63Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Escherichia coli (strain K12) ... [more]
ALKB_SALTY1.3e-1332.39Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Salmonella typhimurium (strain... [more]
Match NameE-valueIdentityDescription
A0A0A0LC72_CUCSA1.1e-23581.03Uncharacterized protein OS=Cucumis sativus GN=Csa_3G651830 PE=4 SV=1[more]
A0A061GD55_THECC2.3e-10866.452-oxoglutarate-dependent dioxygenase family protein, putative isoform 1 OS=Theob... [more]
W9RP27_9ROSA6.3e-10359.68Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Morus notabilis GN=L484_005039... [more]
A0A0D2RCG1_GOSRA1.6e-10163.73Uncharacterized protein OS=Gossypium raimondii GN=B456_010G235600 PE=4 SV=1[more]
M5WUR8_PRUPE7.8e-10158.92Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa016882mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G14160.17.7e-9255.38 2-oxoglutarate-dependent dioxygenase family protein[more]
AT5G01780.22.7e-7650.00 2-oxoglutarate-dependent dioxygenase family protein[more]
AT3G14140.11.2e-7147.49 2-oxoglutarate-dependent dioxygenase family protein[more]
AT1G11780.18.1e-0930.67 oxidoreductase, 2OG-Fe(II) oxygenase family protein[more]
Match NameE-valueIdentityDescription
gi|659127991|ref|XP_008463993.1|6.0e-24885.25PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103501985 [Cucumis me... [more]
gi|778682576|ref|XP_011651736.1|1.5e-23581.03PREDICTED: uncharacterized protein LOC101205291 [Cucumis sativus][more]
gi|700203366|gb|KGN58499.1|1.5e-23581.03hypothetical protein Csa_3G651830 [Cucumis sativus][more]
gi|590621107|ref|XP_007024711.1|3.2e-10866.452-oxoglutarate-dependent dioxygenase family protein, putative isoform 1 [Theobro... [more]
gi|1009111586|ref|XP_015901656.1|1.2e-10770.41PREDICTED: uncharacterized protein LOC107434680 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016491oxidoreductase activity
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
GO:0006281DNA repair
Vocabulary: INTERPRO
TermDefinition
IPR027450AlkB-like
IPR005123Oxoglu/Fe-dep_dioxygenase
IPR004574Alkb
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006281 DNA repair
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0051213 dioxygenase activity
molecular_function GO:0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi07G006230.1Lsi07G006230.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004574Alkylated DNA repair protein AlkBPANTHERPTHR16557ALKYLATED DNA REPAIR PROTEIN ALKB-RELATEDcoord: 239..522
score: 1.0E
IPR005123Oxoglutarate/iron-dependent dioxygenasePROFILEPS51471FE2OG_OXYcoord: 412..522
score: 10
IPR027450Alpha-ketoglutarate-dependent dioxygenase AlkB-likeGENE3DG3DSA:2.60.120.590coord: 299..522
score: 2.1
IPR027450Alpha-ketoglutarate-dependent dioxygenase AlkB-likePFAMPF135322OG-FeII_Oxy_2coord: 303..520
score: 2.3
NoneNo IPR availablePANTHERPTHR16557:SF42-OXOGLUTARATE-DEPENDENT DIOXYGENASE FAMILY PROTEIN-RELATEDcoord: 239..522
score: 1.0E
NoneNo IPR availableunknownSSF51197Clavaminate synthase-likecoord: 300..522
score: 3.71