Cucsa.130370 (gene) Cucumber (Gy14) v1

NameCucsa.130370
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionAlpha-ketoglutarate-dependent dioxygenase AlkB
Locationscaffold01029 : 359713 .. 361655 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GCACTACCTGACTCTTCATGTTGTGGTAGTTCTTATGGTTGTGGGAGAGACAAGGAACATTTGCATGACAGAGATAATAGTTCAGATGTCATACATGTGGGAAGCATTCCTGTGCATCTAAATCCCAAGGAACGTGAACCCAAATCTTATAATTATGATGAGTCTCTACCTGTTCATAGACAAAATACTAGAAGAAGCCGGATAGATTTAGGGTCCAAAAGAGATTTGAAGAGTAATGCAAGATCATATCAAGTAGAGAGGCTTGAATTTTTGAACGATTCTTGTCAGGAGTATAAATCATCTCTTCCTATTCATTTTGGGAAGAAAAATGAAGTGTTTGTCTCAAAGCTCCAGTCCCTTGATACCGGTCCCAAAGAATCTGTAGTTACGGACAATTCACTTCCCTTTGAACCACCATTTGATATTTGTTTACCTGGAGGAGGTAATGTGAAACATAGAAATATTTATGTTGTTAAAGAGGGTGGCACTGTGAAAGATTATAGACTGTTGAGGCCTGGAATGGTTTTACTGAAGCACTACATCACTCCACGTGAACAGGTATTCTCTTGCTTCAATTGTCAACACTTATATGAACTAGAAATGGGAATTGGCATTTTGTTCTCTAGAATACCCATCTTGGGATGATGGACATTAACCTACTTCACTTTTGCTAATTATAAAATTTCCAATGTTGCCTATTGTTAAGAGGTGCCCTGTAGTTTCATCTCTTTGCTGATATTACTCCTTGTTGCCTTACATTGCTTAAATGACGGGATGGTGTTTTTATATCTTATATTTTTCCGTATGGAAGTTAGGGTTTAGTAATTGAGGATTCATTATTCTTGTTGCAGATCAATATAGTGAAAACTTGTCAAAATCTTGGTATTGGCCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGCAAAACTTAGGCTTCGTATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAGGTATGAAAACAAACGGGTTGTGGATGGTAATAAACCACCAGATATACCTCCTCAATTTACATTTCTTGTTAAACGTGCACTTAAAGATGCACATGCCTTCATCAAGAACAACTGCAATATAAGTAATGTAGAAGAAATTCTTCCGTCAATGTCTCCAGACATATGCATTGCGAACTTCTACACAACGAGGGGAAGATTGGGTCTGCATCAGGTTAGTGCTTGTCCTTTGTATCACAATTATATTTAAATTACGGCAGTTTTTTCTTCAAGTGTAATGCTAATTCCAACTCATGTTCTTTGGCAATTAGCCATTTTTATTCACACGATAAATAAATGCTTACATTATAAGTTTAGTTTCTCAACAATGAAGTTTGTGTTAATTTTGTTCATAAACTTTCAAGAATGTCTCTTAACTTTCAATCATGTGCATCACAGTTCCCTAGACTTTTAAGTTTGTCAAGCAGATTCACAGAATATAAAAAATCTGAAAGTTCATAGACTAAATAGATTTACCAAAGCAACATAGCCTATGAATGTGTTAACGTCCATAAGCTCCTTGGTTGGAATCTCCCACCCCATTGTACTAAAAATATATCAAATAAACCTAAATCTTGAAGTTTAGGGACTAAATTTGTAATTTAACCTCCCAAACTTTTTCTAACAAGTTCAATTTGTTGTGCAGGACCGTGATGAAAGCAAAGAGAGTCTTTGGACGGGACTACCGGTTGTTTCCTTTTCTGTAGGCAATACAGCAGAATTCTTGTATGGAGATAAAAGAAATGTGGATAAAGCAGAGATGGTTGAACTGGAATCAGGTGATGTTCTAATTTTTGGTGGCGAATCTAGACATATATTCCATGGAGTATCTTCAATCATACCGAAATCGACACCTAAGTTTTTGCTTCATCATACTGGTCTGCGTCCCGGCCGTCTTAATCTTACCTTTAGAAAGTATTAA

mRNA sequence

GCACTACCTGACTCTTCATGTTGTGGTAGTTCTTATGGTTGTGGGAGAGACAAGGAACATTTGCATGACAGAGATAATAGTTCAGATGTCATACATGTGGGAAGCATTCCTGTGCATCTAAATCCCAAGGAACGTGAACCCAAATCTTATAATTATGATGAGTCTCTACCTGTTCATAGACAAAATACTAGAAGAAGCCGGATAGATTTAGGGTCCAAAAGAGATTTGAAGAGTAATGCAAGATCATATCAAGTAGAGAGGCTTGAATTTTTGAACGATTCTTGTCAGGAGTATAAATCATCTCTTCCTATTCATTTTGGGAAGAAAAATGAAGTGTTTGTCTCAAAGCTCCAGTCCCTTGATACCGGTCCCAAAGAATCTGTAGTTACGGACAATTCACTTCCCTTTGAACCACCATTTGATATTTGTTTACCTGGAGGAGGTAATGTGAAACATAGAAATATTTATGTTGTTAAAGAGGGTGGCACTGTGAAAGATTATAGACTGTTGAGGCCTGGAATGGTTTTACTGAAGCACTACATCACTCCACGTGAACAGATCAATATAGTGAAAACTTGTCAAAATCTTGGTATTGGCCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGCAAAACTTAGGCTTCGTATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAGGTATGAAAACAAACGGGTTGTGGATGGTAATAAACCACCAGATATACCTCCTCAATTTACATTTCTTGTTAAACGTGCACTTAAAGATGCACATGCCTTCATCAAGAACAACTGCAATATAAGTAATGTAGAAGAAATTCTTCCGTCAATGTCTCCAGACATATGCATTGCGAACTTCTACACAACGAGGGGAAGATTGGGTCTGCATCAGGACCGTGATGAAAGCAAAGAGAGTCTTTGGACGGGACTACCGGTTGTTTCCTTTTCTGTAGGCAATACAGCAGAATTCTTGTATGGAGATAAAAGAAATGTGGATAAAGCAGAGATGGTTGAACTGGAATCAGGTGATGTTCTAATTTTTGGTGGCGAATCTAGACATATATTCCATGGAGTATCTTCAATCATACCGAAATCGACACCTAAGTTTTTGCTTCATCATACTGGTCTGCGTCCCGGCCGTCTTAATCTTACCTTTAGAAAGTATTAA

Coding sequence (CDS)

GCACTACCTGACTCTTCATGTTGTGGTAGTTCTTATGGTTGTGGGAGAGACAAGGAACATTTGCATGACAGAGATAATAGTTCAGATGTCATACATGTGGGAAGCATTCCTGTGCATCTAAATCCCAAGGAACGTGAACCCAAATCTTATAATTATGATGAGTCTCTACCTGTTCATAGACAAAATACTAGAAGAAGCCGGATAGATTTAGGGTCCAAAAGAGATTTGAAGAGTAATGCAAGATCATATCAAGTAGAGAGGCTTGAATTTTTGAACGATTCTTGTCAGGAGTATAAATCATCTCTTCCTATTCATTTTGGGAAGAAAAATGAAGTGTTTGTCTCAAAGCTCCAGTCCCTTGATACCGGTCCCAAAGAATCTGTAGTTACGGACAATTCACTTCCCTTTGAACCACCATTTGATATTTGTTTACCTGGAGGAGGTAATGTGAAACATAGAAATATTTATGTTGTTAAAGAGGGTGGCACTGTGAAAGATTATAGACTGTTGAGGCCTGGAATGGTTTTACTGAAGCACTACATCACTCCACGTGAACAGATCAATATAGTGAAAACTTGTCAAAATCTTGGTATTGGCCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGCAAAACTTAGGCTTCGTATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAGGTATGAAAACAAACGGGTTGTGGATGGTAATAAACCACCAGATATACCTCCTCAATTTACATTTCTTGTTAAACGTGCACTTAAAGATGCACATGCCTTCATCAAGAACAACTGCAATATAAGTAATGTAGAAGAAATTCTTCCGTCAATGTCTCCAGACATATGCATTGCGAACTTCTACACAACGAGGGGAAGATTGGGTCTGCATCAGGACCGTGATGAAAGCAAAGAGAGTCTTTGGACGGGACTACCGGTTGTTTCCTTTTCTGTAGGCAATACAGCAGAATTCTTGTATGGAGATAAAAGAAATGTGGATAAAGCAGAGATGGTTGAACTGGAATCAGGTGATGTTCTAATTTTTGGTGGCGAATCTAGACATATATTCCATGGAGTATCTTCAATCATACCGAAATCGACACCTAAGTTTTTGCTTCATCATACTGGTCTGCGTCCCGGCCGTCTTAATCTTACCTTTAGAAAGTATTAA

Protein sequence

ALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY*
BLAST of Cucsa.130370 vs. Swiss-Prot
Match: ALKB_CAUCN (Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter crescentus (strain NA1000 / CB15N) GN=alkB PE=3 SV=2)

HSP 1 Score: 83.6 bits (205), Expect = 5.4e-15
Identity = 67/197 (34.01%), Postives = 87/197 (44.16%), Query Frame = 1

Query: 200 PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPDIPPQFTFLVKR 259
           P   Y+  Y  G  + + M  LG L W    R  RY ++    G   PD+PP        
Sbjct: 53  PFSNYRTAY--GKPMSVAMTALGSLGWTSDARGYRYVDRHPETGRPWPDMPP-------- 112

Query: 260 ALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPV 319
           AL D    + +           P   PD C+ N Y    R+GLHQDRDE+        PV
Sbjct: 113 ALLDLWTVLGD-----------PETPPDSCLVNLYRDGARMGLHQDRDEADPRF----PV 172

Query: 320 VSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLH 379
           +S S+G+TA F  G     D    + L SGDV    G +R  FHGV  I+P S       
Sbjct: 173 LSISLGDTAVFRIGGVNRKDPTRSLRLASGDVCRLLGPARLAFHGVDRILPGS------- 216

Query: 380 HTGLRP--GRLNLTFRK 392
            + L P  GR+NLT R+
Sbjct: 233 -SSLVPGGGRINLTLRR 216

BLAST of Cucsa.130370 vs. Swiss-Prot
Match: ALKB_CAUCR (Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter crescentus (strain ATCC 19089 / CB15) GN=alkB PE=3 SV=1)

HSP 1 Score: 83.6 bits (205), Expect = 5.4e-15
Identity = 67/197 (34.01%), Postives = 87/197 (44.16%), Query Frame = 1

Query: 200 PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPDIPPQFTFLVKR 259
           P   Y+  Y  G  + + M  LG L W    R  RY ++    G   PD+PP        
Sbjct: 53  PFSNYRTAY--GKPMSVAMTALGSLGWTSDARGYRYVDRHPETGRPWPDMPP-------- 112

Query: 260 ALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPV 319
           AL D    + +           P   PD C+ N Y    R+GLHQDRDE+        PV
Sbjct: 113 ALLDLWTVLGD-----------PETPPDSCLVNLYRDGARMGLHQDRDEADPRF----PV 172

Query: 320 VSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLH 379
           +S S+G+TA F  G     D    + L SGDV    G +R  FHGV  I+P S       
Sbjct: 173 LSISLGDTAVFRIGGVNRKDPTRSLRLASGDVCRLLGPARLAFHGVDRILPGS------- 216

Query: 380 HTGLRP--GRLNLTFRK 392
            + L P  GR+NLT R+
Sbjct: 233 -SSLVPGGGRINLTLRR 216

BLAST of Cucsa.130370 vs. Swiss-Prot
Match: ALKBH_SCHPO (Alpha-ketoglutarate-dependent dioxygenase abh1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=abh1 PE=2 SV=3)

HSP 1 Score: 78.2 bits (191), Expect = 2.3e-13
Identity = 65/240 (27.08%), Postives = 102/240 (42.50%), Query Frame = 1

Query: 172 PGMVLLKHYITPREQINIVKTCQ-----------------NLGIGPGGFYQPGYK-DGAK 231
           PG+++LK+Y++   Q+ ++K+                    L +G    ++  Y  DG  
Sbjct: 60  PGLLILKNYVSSELQMQLLKSIMFTQIQDPENKTNLSPFYQLPLGNDSIWRRYYNGDGES 119

Query: 232 L------------------RLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVK 291
           +                  +LR + LG  +D  T+ Y      D +K P  P      V+
Sbjct: 120 IIDGLGETKPLTVDRLVHKKLRWVTLGEQYDWTTKEYP-----DPSKSPGFPKDLGDFVE 179

Query: 292 RALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLP 351
           + +K++  F+                  +  I NFY+    L  H D  ES+E L   LP
Sbjct: 180 KVVKESTDFLH--------------WKAEAAIVNFYSPGDTLSAHID--ESEEDLT--LP 239

Query: 352 VVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL 376
           ++S S+G    +L G +   +K   + L SGDV+I  G SR  FH V  IIP STP +LL
Sbjct: 240 LISLSMGLDCIYLIGTESRSEKPSALRLHSGDVVIMTGTSRKAFHAVPKIIPNSTPNYLL 276

BLAST of Cucsa.130370 vs. Swiss-Prot
Match: ALKB_SALTY (Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) GN=alkB PE=3 SV=2)

HSP 1 Score: 72.0 bits (175), Expect = 1.6e-11
Identity = 42/112 (37.50%), Postives = 57/112 (50.89%), Query Frame = 1

Query: 280 SMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAE 339
           S  PD C+ N Y    +L LHQD+DE         P+VS S+G  A F +G  R  D  +
Sbjct: 111 SFQPDACLINRYAPGAKLSLHQDKDEPD----LRAPIVSVSLGVPAVFQFGGLRRSDPIQ 170

Query: 340 MVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRK 392
            + LE GD++++GGESR  +HG+        P     H      R NLTFR+
Sbjct: 171 RILLEHGDIVVWGGESRLFYHGIQ-------PLKAGFHPMTGEFRYNLTFRQ 211

BLAST of Cucsa.130370 vs. Swiss-Prot
Match: ALKB_ECOLI (Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Escherichia coli (strain K12) GN=alkB PE=1 SV=1)

HSP 1 Score: 71.6 bits (174), Expect = 2.1e-11
Identity = 55/167 (32.93%), Postives = 74/167 (44.31%), Query Frame = 1

Query: 226 DPQTRRYENKRVVDGNKP-PDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPD 285
           DPQT           NKP P +P  F  L +RA   A                 P   PD
Sbjct: 82  DPQT-----------NKPWPAMPQSFHNLCQRAATAAG---------------YPDFQPD 141

Query: 286 ICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELE 345
            C+ N Y    +L LHQD+DE         P+VS S+G  A F +G  +  D  + + LE
Sbjct: 142 ACLINRYAPGAKLSLHQDKDEPD----LRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLE 201

Query: 346 SGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRK 392
            GDV+++GGESR  +HG+  +     P  +         R NLTFR+
Sbjct: 202 HGDVVVWGGESRLFYHGIQPLKAGFHPLTI-------DCRYNLTFRQ 211

BLAST of Cucsa.130370 vs. TrEMBL
Match: A0A0A0KY56_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G329550 PE=4 SV=1)

HSP 1 Score: 806.2 bits (2081), Expect = 1.7e-230
Identity = 389/392 (99.23%), Postives = 389/392 (99.23%), Query Frame = 1

Query: 1   ALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHR 60
           ALPDSSCCGSS GCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHR
Sbjct: 53  ALPDSSCCGSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHR 112

Query: 61  QNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSL 120
           QNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSL
Sbjct: 113 QNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSL 172

Query: 121 DTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHY 180
           DTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHY
Sbjct: 173 DTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHY 232

Query: 181 ITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDG 240
           ITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDG
Sbjct: 233 ITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDG 292

Query: 241 NKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLH 300
           NKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLH
Sbjct: 293 NKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLH 352

Query: 301 QDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFH 360
           QDRDESKESLW GLPVVSFSVGN AEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFH
Sbjct: 353 QDRDESKESLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFH 412

Query: 361 GVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY 393
           GVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
Sbjct: 413 GVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY 444

BLAST of Cucsa.130370 vs. TrEMBL
Match: A0A061EI26_THECC (2-oxoglutarate-dependent dioxygenase family protein isoform 2 OS=Theobroma cacao GN=TCM_019764 PE=4 SV=1)

HSP 1 Score: 364.0 bits (933), Expect = 2.3e-97
Identity = 197/343 (57.43%), Postives = 240/343 (69.97%), Query Frame = 1

Query: 55  SLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKN-EVF 114
           +L     N RR+R D G +   K+   S   E     N    + K SLP HFGKK   ++
Sbjct: 32  TLKGRNSNNRRTRSDSGFEPRHKAVDSS---EHKGIANSLSLQDKCSLPSHFGKKVVNIY 91

Query: 115 VSKLQSLDTGPKESVVTDNS-----LPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYR 174
           V K  S ++  K+ V T N+     LP    FDICLP       R  + ++        +
Sbjct: 92  VPKSVSGESKSKDVVGTKNTDFSEGLPKVERFDICLPT------RRAFGIQ--------K 151

Query: 175 LLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQ 234
           +LRPGMVLLK YI+  EQINIVKTCQ LG+GPGGFY+PGYKDGAKLRL MMCLGL+WDPQ
Sbjct: 152 VLRPGMVLLKRYISLCEQINIVKTCQTLGVGPGGFYRPGYKDGAKLRLHMMCLGLNWDPQ 211

Query: 235 TRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIA 294
           TR+Y+ +  +D  +PP+IP +F  LV+RA++DAH  IK N  + NVE++LPSMSPDICI 
Sbjct: 212 TRKYDKRHPIDDCEPPNIPCEFCLLVRRAIQDAHCLIKKNYIVGNVEDVLPSMSPDICII 271

Query: 295 NFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDV 354
           NFYTT GRLGLHQDRDES+ESL  GLPVVSFS+GN+AEFLYGD+R+ DKAE V L+SGDV
Sbjct: 272 NFYTTNGRLGLHQDRDESRESLHKGLPVVSFSIGNSAEFLYGDQRDEDKAEKVVLDSGDV 331

Query: 355 LIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRK 392
           LIFGGESR +FHGV SIIP + P+ LL  TGLR GRLNLTFR+
Sbjct: 332 LIFGGESRMVFHGVPSIIPNTAPQALLAETGLRRGRLNLTFRQ 357

BLAST of Cucsa.130370 vs. TrEMBL
Match: A0A061EHN0_THECC (2-oxoglutarate-dependent dioxygenase family protein isoform 1 OS=Theobroma cacao GN=TCM_019764 PE=4 SV=1)

HSP 1 Score: 364.0 bits (933), Expect = 2.3e-97
Identity = 197/343 (57.43%), Postives = 240/343 (69.97%), Query Frame = 1

Query: 55  SLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKN-EVF 114
           +L     N RR+R D G +   K+   S   E     N    + K SLP HFGKK   ++
Sbjct: 107 TLKGRNSNNRRTRSDSGFEPRHKAVDSS---EHKGIANSLSLQDKCSLPSHFGKKVVNIY 166

Query: 115 VSKLQSLDTGPKESVVTDNS-----LPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYR 174
           V K  S ++  K+ V T N+     LP    FDICLP       R  + ++        +
Sbjct: 167 VPKSVSGESKSKDVVGTKNTDFSEGLPKVERFDICLPT------RRAFGIQ--------K 226

Query: 175 LLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQ 234
           +LRPGMVLLK YI+  EQINIVKTCQ LG+GPGGFY+PGYKDGAKLRL MMCLGL+WDPQ
Sbjct: 227 VLRPGMVLLKRYISLCEQINIVKTCQTLGVGPGGFYRPGYKDGAKLRLHMMCLGLNWDPQ 286

Query: 235 TRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIA 294
           TR+Y+ +  +D  +PP+IP +F  LV+RA++DAH  IK N  + NVE++LPSMSPDICI 
Sbjct: 287 TRKYDKRHPIDDCEPPNIPCEFCLLVRRAIQDAHCLIKKNYIVGNVEDVLPSMSPDICII 346

Query: 295 NFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDV 354
           NFYTT GRLGLHQDRDES+ESL  GLPVVSFS+GN+AEFLYGD+R+ DKAE V L+SGDV
Sbjct: 347 NFYTTNGRLGLHQDRDESRESLHKGLPVVSFSIGNSAEFLYGDQRDEDKAEKVVLDSGDV 406

Query: 355 LIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRK 392
           LIFGGESR +FHGV SIIP + P+ LL  TGLR GRLNLTFR+
Sbjct: 407 LIFGGESRMVFHGVPSIIPNTAPQALLAETGLRRGRLNLTFRQ 432

BLAST of Cucsa.130370 vs. TrEMBL
Match: M5WB46_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006591mg PE=4 SV=1)

HSP 1 Score: 353.2 bits (905), Expect = 4.1e-94
Identity = 183/315 (58.10%), Postives = 223/315 (70.79%), Query Frame = 1

Query: 99  KSSLPIHFGKKNEVFVS-KLQSLDTGPKESVVTDNS-----LPFEPPFDICLPGGGNVKH 158
           K  LP  FG K + F + K  S     K    + NS       +  PFDICL G  + + 
Sbjct: 90  KYCLPTEFGNKRKHFSAVKPHSEPRNMKYGCASKNSDCSKGFHYNEPFDICLSGSRSYEL 149

Query: 159 RNIYVVK-EGGTVKDYR--------------LLRPGMVLLKHYITPREQINIVKTCQNLG 218
           +  Y    E    +D+               +LRPGMVLLKHY+T  EQ+ IVK C+ LG
Sbjct: 150 KASYARNMENQNEEDHMVEFTNPEALNSTNLILRPGMVLLKHYVTHTEQVEIVKKCRQLG 209

Query: 219 IGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRA 278
           +GPGGFYQPGYKDGAKLRL+MMCLG DWDP+TR+Y ++R +DG +PP IP +F+ LVKRA
Sbjct: 210 LGPGGFYQPGYKDGAKLRLQMMCLGHDWDPETRKYGSRRTIDGTQPPGIPHEFSLLVKRA 269

Query: 279 LKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVV 338
           +++AHA IK    +S+VEEILPS+SPDICIANFYTT GRLGLHQDRDES++SL  GLPVV
Sbjct: 270 IEEAHAHIKEELRVSSVEEILPSISPDICIANFYTTSGRLGLHQDRDESEKSLREGLPVV 329

Query: 339 SFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHH 393
           S S+G++A+FLYGD+R++ KAE V LESGDVLIFGG SRHIFHGV+SIIP S P  LL  
Sbjct: 330 SISIGDSADFLYGDQRDIGKAESVVLESGDVLIFGGRSRHIFHGVTSIIPDSAPMNLLEE 389

BLAST of Cucsa.130370 vs. TrEMBL
Match: A0A151RSG9_CAJCA (Alpha-ketoglutarate-dependent dioxygenase alkB isogeny OS=Cajanus cajan GN=KK1_032956 PE=4 SV=1)

HSP 1 Score: 344.0 bits (881), Expect = 2.5e-91
Identity = 182/348 (52.30%), Postives = 231/348 (66.38%), Query Frame = 1

Query: 60  RQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEV-FVSKLQ 119
           R++ R++RI+LG     + N      +     N +    +S    H  KK +  F     
Sbjct: 20  RKSKRKTRINLGLSHASEENCSVPHTKEFSTSNLTSYHDESPQDSHSWKKKDASFKRPYY 79

Query: 120 SLDTGPKESVVTDN-----SLPFEPPFDICL------PGGGNVKHRN---IYVVKEGGTV 179
           +  T    SVV  N       P   PFDIC       P  G   HR    + +  +   +
Sbjct: 80  NSPTNSNYSVVGHNLDSPVGTPKFKPFDICFQRKRNSPSIGATSHRESNEMGIEMQEEEI 139

Query: 180 KDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLD 239
           ++  LL PGMVLLK++IT  EQ+ IVK C+ LG+GPGGFYQPGY  GAKLRL+MMCLG D
Sbjct: 140 QEEILLGPGMVLLKNFITHDEQVEIVKVCRELGVGPGGFYQPGYASGAKLRLKMMCLGKD 199

Query: 240 WDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPD 299
           WDPQT +Y  KRV+DG++PP IP  F+ LV R++K+AH+ IK  C + NVE+ LPSM+PD
Sbjct: 200 WDPQTYKYGKKRVIDGSEPPSIPNYFSELVSRSIKEAHSLIKKECRVWNVEDELPSMTPD 259

Query: 300 ICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELE 359
           ICI NFYT  G+LGLHQDRDES+ESL  GLPVVSFS+G++AEFLYGD+R+V+ AE V LE
Sbjct: 260 ICIVNFYTNNGKLGLHQDRDESRESLRKGLPVVSFSIGDSAEFLYGDQRDVEMAESVLLE 319

Query: 360 SGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY 393
           SGDVLIFGGESRH+FHGVSS++P S P+ LL  + L PGRLNLTFR+Y
Sbjct: 320 SGDVLIFGGESRHVFHGVSSVLPNSAPEKLLRDSCLIPGRLNLTFRQY 367

BLAST of Cucsa.130370 vs. TAIR10
Match: AT5G01780.2 (AT5G01780.2 2-oxoglutarate-dependent dioxygenase family protein)

HSP 1 Score: 288.9 bits (738), Expect = 4.8e-78
Identity = 147/266 (55.26%), Postives = 190/266 (71.43%), Query Frame = 1

Query: 138 PPFDICLPGGGNVKHRNIYVVKEGGTVKD-----------YRLLRPGMVLLKHYITPREQ 197
           PPFDIC     +V  RN   +K+     +           ++++RPGMVLLK ++TP  Q
Sbjct: 183 PPFDIC----SSVLERNDTSIKDWILADETNRETVEVSNKHKVIRPGMVLLKDFLTPDIQ 242

Query: 198 INIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDI 257
           ++IVKTC+ LG+ P GFYQPGY  G+KL L+MMCLG +WDPQT+  +N  +   +K P+I
Sbjct: 243 VDIVKTCRELGVKPTGFYQPGYSVGSKLHLQMMCLGRNWDPQTKYRKNTDI--DSKAPEI 302

Query: 258 PPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDES 317
           P  F  LV++A+++AHA I       + E ILP MSPDICI NFY+  GRLGLHQDRDES
Sbjct: 303 PVTFNVLVEKAIREAHALIDRESGTEDAERILPVMSPDICIVNFYSETGRLGLHQDRDES 362

Query: 318 KESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSII 377
           +ES+  GLP+VSFS+G++AEFLYG+KR+V++A+ V LESGDVLIFGGESR IFHGV SII
Sbjct: 363 EESIARGLPIVSFSIGDSAEFLYGEKRDVEEAQGVILESGDVLIFGGESRMIFHGVKSII 422

Query: 378 PKSTPKFLLHHTGLRPGRLNLTFRKY 393
           P S P  LL+ + LR GRLNLTFR +
Sbjct: 423 PNSAPMSLLNESKLRTGRLNLTFRHF 442

BLAST of Cucsa.130370 vs. TAIR10
Match: AT3G14160.1 (AT3G14160.1 2-oxoglutarate-dependent dioxygenase family protein)

HSP 1 Score: 284.3 bits (726), Expect = 1.2e-76
Identity = 156/330 (47.27%), Postives = 214/330 (64.85%), Query Frame = 1

Query: 70  LGSKRDLKSNARSY--QVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKES 129
           + S+ + K  A+ Y   V R+  +  SCQE  SS  +      +V +S ++   + PK  
Sbjct: 131 VSSECEDKDGAKMYCDLVNRVNDVTLSCQESVSSTVVQ-----KVELSSVEDQKSAPKAD 190

Query: 130 VVTDNSLPFEPP-FDICLPGGGNVKHRNIYVV--KEGGTVKDYR--LLRPGMVLLKHYIT 189
              ++S       FDI L   G V   N+ V+  ++    K Y   ++RPGMVLLK+Y++
Sbjct: 191 GAGNSSNESSTRHFDIFLEKKGIVLKPNLLVLSREKKKAAKGYSGTVIRPGMVLLKNYLS 250

Query: 190 PREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNK 249
             +Q+ IV  C+ LG+G GGFYQPGY+D AKL L+MMCLG +WDP+T RY   R  DG+ 
Sbjct: 251 INDQVMIVNKCRRLGLGEGGFYQPGYRDEAKLHLKMMCLGKNWDPETSRYGETRPFDGST 310

Query: 250 PPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQD 309
            P IP +F   V++A+K++ +   +N   +   + +P M PDICI NFY++ GRLGLHQD
Sbjct: 311 APRIPAEFNQFVEKAVKESQSLAASNSKQTKGGDEIPFMLPDICIVNFYSSTGRLGLHQD 370

Query: 310 RDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGV 369
           +DES+ S+  GLPVVSFS+G++AEFLYGD+R+ DKAE + LESGDVL+FGG SR +FHGV
Sbjct: 371 KDESENSIRKGLPVVSFSIGDSAEFLYGDQRDEDKAETLTLESGDVLLFGGRSRKVFHGV 430

Query: 370 SSIIPKSTPKFLLHHTGLRPGRLNLTFRKY 393
            SI   + PK LL  T LRPGRLNLTFR+Y
Sbjct: 431 RSIRKDTAPKALLQETSLRPGRLNLTFRQY 455

BLAST of Cucsa.130370 vs. TAIR10
Match: AT3G14140.1 (AT3G14140.1 2-oxoglutarate-dependent dioxygenase family protein)

HSP 1 Score: 229.9 bits (585), Expect = 2.6e-60
Identity = 111/218 (50.92%), Postives = 154/218 (70.64%), Query Frame = 1

Query: 169 LLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQ 228
           ++RPGMVLLK+Y++   Q+ IV  C+ LG+G GGFYQPG++DG  L L+MMCLG +WD Q
Sbjct: 237 VIRPGMVLLKNYLSINNQVMIVNKCRQLGLGEGGFYQPGFQDGGLLHLKMMCLGKNWDCQ 296

Query: 229 TRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIA 288
           TRRY   R +DG+ PP IP +F+ LV++A+K++ + +  N N +   + +P + PDIC+ 
Sbjct: 297 TRRYGEIRPIDGSVPPRIPVEFSQLVEKAIKESKSLVATNSNETKGGDEIPLLLPDICVV 356

Query: 289 NFYTTRGRLGLHQ---------------------DRDESKESLWTGLPVVSFSVGNTAEF 348
           NFYT+ G+LGLHQ                     D+ ESK+SL  GLP+VSFS+G++AEF
Sbjct: 357 NFYTSTGKLGLHQVSVYDKTSFDFLKYKGGYLNTDKGESKKSLRKGLPIVSFSIGDSAEF 416

Query: 349 LYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSI 366
           LYGD+++VDKA+ + LESGDVLIFG  SR++FHGV SI
Sbjct: 417 LYGDQKDVDKADTLILESGDVLIFGERSRNVFHGVRSI 454

BLAST of Cucsa.130370 vs. TAIR10
Match: AT1G11780.1 (AT1G11780.1 oxidoreductase, 2OG-Fe(II) oxygenase family protein)

HSP 1 Score: 56.6 bits (135), Expect = 4.0e-08
Identity = 50/151 (33.11%), Postives = 68/151 (45.03%), Query Frame = 1

Query: 215 RLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNV 274
           +LR   LGL +D   R Y+    +  N  PD   Q        L   HA I     + + 
Sbjct: 176 KLRWSTLGLQFDWSKRNYDVS--LPHNNIPDALCQ--------LAKTHAAIA----MPDG 235

Query: 275 EEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRN 334
           EE      P+  I N++     LG H D     E+ W+  P+VS S+G  A FL G K  
Sbjct: 236 EEF----RPEGAIVNYFGIGDTLGGHLD---DMEADWSK-PIVSMSLGCKAIFLLGGKSK 295

Query: 335 VDKAEMVELESGDVLIFGGESRHIFHGVSSI 366
            D    + L SGDV++  GE+R  FHG+  I
Sbjct: 296 DDPPHAMYLRSGDVVLMAGEARECFHGIPRI 304

BLAST of Cucsa.130370 vs. NCBI nr
Match: gi|449464420|ref|XP_004149927.1| (PREDICTED: uncharacterized protein LOC101210053 [Cucumis sativus])

HSP 1 Score: 806.2 bits (2081), Expect = 2.5e-230
Identity = 389/392 (99.23%), Postives = 389/392 (99.23%), Query Frame = 1

Query: 1   ALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHR 60
           ALPDSSCCGSS GCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHR
Sbjct: 53  ALPDSSCCGSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHR 112

Query: 61  QNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSL 120
           QNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSL
Sbjct: 113 QNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSL 172

Query: 121 DTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHY 180
           DTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHY
Sbjct: 173 DTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHY 232

Query: 181 ITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDG 240
           ITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDG
Sbjct: 233 ITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDG 292

Query: 241 NKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLH 300
           NKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLH
Sbjct: 293 NKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLH 352

Query: 301 QDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFH 360
           QDRDESKESLW GLPVVSFSVGN AEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFH
Sbjct: 353 QDRDESKESLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFH 412

Query: 361 GVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY 393
           GVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY
Sbjct: 413 GVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY 444

BLAST of Cucsa.130370 vs. NCBI nr
Match: gi|659128544|ref|XP_008464254.1| (PREDICTED: uncharacterized protein LOC103502183 [Cucumis melo])

HSP 1 Score: 694.1 bits (1790), Expect = 1.4e-196
Identity = 344/420 (81.90%), Postives = 362/420 (86.19%), Query Frame = 1

Query: 1   ALPDSSCCGSSYGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKS----------- 60
           A PDSSC G+S GCGRDKEHL DRDN SDVI +GS  VHLNPKEREPKS           
Sbjct: 53  APPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPKSLTPLSAKKCDY 112

Query: 61  -----------------YNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLN 120
                            Y+YDE LPV RQNTRR+RIDLGSKRDLKSNARS+QVER EF N
Sbjct: 113 VEVGSDKFGISSNEPKSYHYDEFLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFFN 172

Query: 121 DSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNVKH 180
           D CQEY+SSLPIHFGKKNEVF SK QSLD G KESVVTD+SLPFEPPFDIC PGGGNVKH
Sbjct: 173 DYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPFDICFPGGGNVKH 232

Query: 181 RNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGA 240
           RN + VK+ GTVKDYRLLRPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQP YKDGA
Sbjct: 233 RNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGA 292

Query: 241 KLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNIS 300
           KLRLRMMCLGLDWDPQTRRY+NKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CNIS
Sbjct: 293 KLRLRMMCLGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNIS 352

Query: 301 NVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDK 360
           NVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL +GLPVVSFSVGNTAEFLYGDK
Sbjct: 353 NVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDK 412

Query: 361 RNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY 393
           R+V+KAE VELESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL+HTGLRPGRLNLTFRKY
Sbjct: 413 RDVEKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY 472

BLAST of Cucsa.130370 vs. NCBI nr
Match: gi|590654056|ref|XP_007033595.1| (2-oxoglutarate-dependent dioxygenase family protein isoform 2 [Theobroma cacao])

HSP 1 Score: 364.0 bits (933), Expect = 3.3e-97
Identity = 197/343 (57.43%), Postives = 240/343 (69.97%), Query Frame = 1

Query: 55  SLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKN-EVF 114
           +L     N RR+R D G +   K+   S   E     N    + K SLP HFGKK   ++
Sbjct: 32  TLKGRNSNNRRTRSDSGFEPRHKAVDSS---EHKGIANSLSLQDKCSLPSHFGKKVVNIY 91

Query: 115 VSKLQSLDTGPKESVVTDNS-----LPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYR 174
           V K  S ++  K+ V T N+     LP    FDICLP       R  + ++        +
Sbjct: 92  VPKSVSGESKSKDVVGTKNTDFSEGLPKVERFDICLPT------RRAFGIQ--------K 151

Query: 175 LLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQ 234
           +LRPGMVLLK YI+  EQINIVKTCQ LG+GPGGFY+PGYKDGAKLRL MMCLGL+WDPQ
Sbjct: 152 VLRPGMVLLKRYISLCEQINIVKTCQTLGVGPGGFYRPGYKDGAKLRLHMMCLGLNWDPQ 211

Query: 235 TRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIA 294
           TR+Y+ +  +D  +PP+IP +F  LV+RA++DAH  IK N  + NVE++LPSMSPDICI 
Sbjct: 212 TRKYDKRHPIDDCEPPNIPCEFCLLVRRAIQDAHCLIKKNYIVGNVEDVLPSMSPDICII 271

Query: 295 NFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDV 354
           NFYTT GRLGLHQDRDES+ESL  GLPVVSFS+GN+AEFLYGD+R+ DKAE V L+SGDV
Sbjct: 272 NFYTTNGRLGLHQDRDESRESLHKGLPVVSFSIGNSAEFLYGDQRDEDKAEKVVLDSGDV 331

Query: 355 LIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRK 392
           LIFGGESR +FHGV SIIP + P+ LL  TGLR GRLNLTFR+
Sbjct: 332 LIFGGESRMVFHGVPSIIPNTAPQALLAETGLRRGRLNLTFRQ 357

BLAST of Cucsa.130370 vs. NCBI nr
Match: gi|590654052|ref|XP_007033594.1| (2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Theobroma cacao])

HSP 1 Score: 364.0 bits (933), Expect = 3.3e-97
Identity = 197/343 (57.43%), Postives = 240/343 (69.97%), Query Frame = 1

Query: 55  SLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKN-EVF 114
           +L     N RR+R D G +   K+   S   E     N    + K SLP HFGKK   ++
Sbjct: 107 TLKGRNSNNRRTRSDSGFEPRHKAVDSS---EHKGIANSLSLQDKCSLPSHFGKKVVNIY 166

Query: 115 VSKLQSLDTGPKESVVTDNS-----LPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYR 174
           V K  S ++  K+ V T N+     LP    FDICLP       R  + ++        +
Sbjct: 167 VPKSVSGESKSKDVVGTKNTDFSEGLPKVERFDICLPT------RRAFGIQ--------K 226

Query: 175 LLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQ 234
           +LRPGMVLLK YI+  EQINIVKTCQ LG+GPGGFY+PGYKDGAKLRL MMCLGL+WDPQ
Sbjct: 227 VLRPGMVLLKRYISLCEQINIVKTCQTLGVGPGGFYRPGYKDGAKLRLHMMCLGLNWDPQ 286

Query: 235 TRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIA 294
           TR+Y+ +  +D  +PP+IP +F  LV+RA++DAH  IK N  + NVE++LPSMSPDICI 
Sbjct: 287 TRKYDKRHPIDDCEPPNIPCEFCLLVRRAIQDAHCLIKKNYIVGNVEDVLPSMSPDICII 346

Query: 295 NFYTTRGRLGLHQDRDESKESLWTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDV 354
           NFYTT GRLGLHQDRDES+ESL  GLPVVSFS+GN+AEFLYGD+R+ DKAE V L+SGDV
Sbjct: 347 NFYTTNGRLGLHQDRDESRESLHKGLPVVSFSIGNSAEFLYGDQRDEDKAEKVVLDSGDV 406

Query: 355 LIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRK 392
           LIFGGESR +FHGV SIIP + P+ LL  TGLR GRLNLTFR+
Sbjct: 407 LIFGGESRMVFHGVPSIIPNTAPQALLAETGLRRGRLNLTFRQ 432

BLAST of Cucsa.130370 vs. NCBI nr
Match: gi|502169184|ref|XP_004514555.1| (PREDICTED: uncharacterized protein LOC101492962 [Cicer arietinum])

HSP 1 Score: 356.3 bits (913), Expect = 6.9e-95
Identity = 180/262 (68.70%), Postives = 204/262 (77.86%), Query Frame = 1

Query: 141 DICLPGGGNV----------KHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIV 200
           DIC  G  N              N   ++EGG + D R+LRPGMVLLKH++T  EQ+ IV
Sbjct: 222 DICFHGKRNFDLIGSPLLEQNMENCSEMQEGG-IND-RILRPGMVLLKHHLTHDEQVEIV 281

Query: 201 KTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQF 260
           K C+NLG+GPGGFYQPGY DGAKLRL MMCLG+DWDPQTR+Y  KRVVDG+KPP IP  F
Sbjct: 282 KNCRNLGLGPGGFYQPGYADGAKLRLTMMCLGMDWDPQTRKYGYKRVVDGSKPPSIPNFF 341

Query: 261 TFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESL 320
           + LV RAL++AH  I   C IS VE+ILPSM+PDICI NFYTTRGRLGLHQDRDES+ESL
Sbjct: 342 SKLVIRALQEAHRLINQECEISYVEDILPSMTPDICIVNFYTTRGRLGLHQDRDESRESL 401

Query: 321 WTGLPVVSFSVGNTAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKST 380
             GLPVVSFSVG+TAEFLYGD RN++KAE   LESGDVLIFGGESRH+FHG+SSIIP S 
Sbjct: 402 QKGLPVVSFSVGDTAEFLYGDNRNIEKAENALLESGDVLIFGGESRHVFHGISSIIPNSA 461

Query: 381 PKFLLHHTGLRPGRLNLTFRKY 393
           P  LLH T L PGRLNLTFR+Y
Sbjct: 462 PNELLHDTCLCPGRLNLTFRQY 481

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ALKB_CAUCN5.4e-1534.01Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter crescentus... [more]
ALKB_CAUCR5.4e-1534.01Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter crescentus... [more]
ALKBH_SCHPO2.3e-1327.08Alpha-ketoglutarate-dependent dioxygenase abh1 OS=Schizosaccharomyces pombe (str... [more]
ALKB_SALTY1.6e-1137.50Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Salmonella typhimurium (strain... [more]
ALKB_ECOLI2.1e-1132.93Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Escherichia coli (strain K12) ... [more]
Match NameE-valueIdentityDescription
A0A0A0KY56_CUCSA1.7e-23099.23Uncharacterized protein OS=Cucumis sativus GN=Csa_4G329550 PE=4 SV=1[more]
A0A061EI26_THECC2.3e-9757.432-oxoglutarate-dependent dioxygenase family protein isoform 2 OS=Theobroma cacao... [more]
A0A061EHN0_THECC2.3e-9757.432-oxoglutarate-dependent dioxygenase family protein isoform 1 OS=Theobroma cacao... [more]
M5WB46_PRUPE4.1e-9458.10Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006591mg PE=4 SV=1[more]
A0A151RSG9_CAJCA2.5e-9152.30Alpha-ketoglutarate-dependent dioxygenase alkB isogeny OS=Cajanus cajan GN=KK1_0... [more]
Match NameE-valueIdentityDescription
AT5G01780.24.8e-7855.26 2-oxoglutarate-dependent dioxygenase family protein[more]
AT3G14160.11.2e-7647.27 2-oxoglutarate-dependent dioxygenase family protein[more]
AT3G14140.12.6e-6050.92 2-oxoglutarate-dependent dioxygenase family protein[more]
AT1G11780.14.0e-0833.11 oxidoreductase, 2OG-Fe(II) oxygenase family protein[more]
Match NameE-valueIdentityDescription
gi|449464420|ref|XP_004149927.1|2.5e-23099.23PREDICTED: uncharacterized protein LOC101210053 [Cucumis sativus][more]
gi|659128544|ref|XP_008464254.1|1.4e-19681.90PREDICTED: uncharacterized protein LOC103502183 [Cucumis melo][more]
gi|590654056|ref|XP_007033595.1|3.3e-9757.432-oxoglutarate-dependent dioxygenase family protein isoform 2 [Theobroma cacao][more]
gi|590654052|ref|XP_007033594.1|3.3e-9757.432-oxoglutarate-dependent dioxygenase family protein isoform 1 [Theobroma cacao][more]
gi|502169184|ref|XP_004514555.1|6.9e-9568.70PREDICTED: uncharacterized protein LOC101492962 [Cicer arietinum][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004574Alkb
IPR005123Oxoglu/Fe-dep_dioxygenase
IPR027450AlkB-like
Vocabulary: Biological Process
TermDefinition
GO:0006281DNA repair
GO:0055114oxidation-reduction process
Vocabulary: Molecular Function
TermDefinition
GO:0016491oxidoreductase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006281 DNA repair
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors
molecular_function GO:0016491 oxidoreductase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.130370.1Cucsa.130370.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004574Alkylated DNA repair protein AlkBPANTHERPTHR16557ALKYLATED DNA REPAIR PROTEIN ALKB-RELATEDcoord: 139..392
score: 1.5E
IPR005123Oxoglutarate/iron-dependent dioxygenasePROFILEPS51471FE2OG_OXYcoord: 282..392
score: 8
IPR027450Alpha-ketoglutarate-dependent dioxygenase AlkB-likeGENE3DG3DSA:2.60.120.590coord: 171..391
score: 2.8
IPR027450Alpha-ketoglutarate-dependent dioxygenase AlkB-likePFAMPF135322OG-FeII_Oxy_2coord: 173..390
score: 8.6
NoneNo IPR availablePANTHERPTHR16557:SF42-OXOGLUTARATE-DEPENDENT DIOXYGENASE FAMILY PROTEIN-RELATEDcoord: 139..392
score: 1.5E
NoneNo IPR availableunknownSSF51197Clavaminate synthase-likecoord: 170..391
score: 7.69

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cucsa.130370Cucumber (Gy14) v1cgycgyB078
Cucsa.130370Cucurbita maxima (Rimu)cgycmaB0356
Cucsa.130370Cucurbita maxima (Rimu)cgycmaB0353
Cucsa.130370Cucurbita maxima (Rimu)cgycmaB0354
Cucsa.130370Cucurbita maxima (Rimu)cgycmaB0355
Cucsa.130370Cucurbita moschata (Rifu)cgycmoB0347
Cucsa.130370Cucurbita moschata (Rifu)cgycmoB0348
Cucsa.130370Cucurbita moschata (Rifu)cgycmoB0349
Cucsa.130370Cucurbita moschata (Rifu)cgycmoB0350
Cucsa.130370Cucurbita moschata (Rifu)cgycmoB0351
Cucsa.130370Wild cucumber (PI 183967)cgycpiB182
Cucsa.130370Wild cucumber (PI 183967)cgycpiB184
Cucsa.130370Cucumber (Chinese Long) v2cgycuB179
Cucsa.130370Cucumber (Chinese Long) v2cgycuB181
Cucsa.130370Melon (DHL92) v3.5.1cgymeB209
Cucsa.130370Watermelon (Charleston Gray)cgywcgB212
Cucsa.130370Watermelon (Charleston Gray)cgywcgB213
Cucsa.130370Watermelon (97103) v1cgywmB217
Cucsa.130370Cucurbita pepo (Zucchini)cgycpeB0343
Cucsa.130370Cucurbita pepo (Zucchini)cgycpeB0344
Cucsa.130370Cucurbita pepo (Zucchini)cgycpeB0347
Cucsa.130370Cucurbita pepo (Zucchini)cgycpeB0348
Cucsa.130370Bottle gourd (USVL1VR-Ls)cgylsiB203
Cucsa.130370Bottle gourd (USVL1VR-Ls)cgylsiB204
Cucsa.130370Melon (DHL92) v3.6.1cgymedB206
Cucsa.130370Silver-seed gourdcarcgyB0018
Cucsa.130370Silver-seed gourdcarcgyB0190
Cucsa.130370Silver-seed gourdcarcgyB0306
Cucsa.130370Silver-seed gourdcarcgyB0702
Cucsa.130370Silver-seed gourdcarcgyB1114
Cucsa.130370Cucumber (Chinese Long) v3cgycucB184
Cucsa.130370Cucumber (Chinese Long) v3cgycucB187
Cucsa.130370Watermelon (97103) v2cgywmbB209
Cucsa.130370Watermelon (97103) v2cgywmbB210
Cucsa.130370Wax gourdcgywgoB246
Cucsa.130370Wax gourdcgywgoB247