CsaV3_4G026990 (gene) Cucumber (Chinese Long) v3

Overview
NameCsaV3_4G026990
Typegene
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
DescriptionFe2OG dioxygenase domain-containing protein
Locationchr4: 16084244 .. 16087261 (-)
RNA-Seq ExpressionCsaV3_4G026990
SyntenyCsaV3_4G026990
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAGGGTTTAAAACGATAAACTTGTTCAAAATTTACAAGTTTAATTGAGTCAAATTTACAAGTTTAATTGAGTCAAATAAATTATTAGATACATTCCAAGTTAAAGTTAACCTTTTTTCTAACACATTCAATTTCAAAGTTGTCCAAAAATAAAATAAAAATAAAAATTGCAATACATTTTAGTTTTATAATTTCAATGAATATGACCATCAATTTAGCCTCTTTATTTCAGCACAAGAAATGATCACACACACAGCTTTGAAAATGGCCGAAATCCTGACTCCTTCCTCTTCCTCTTCCTCTTCCCGGTGCAGTCTCCTCGGCCAACGCGGCGGCCACGACCGAAACTTGCGGATTAATTCTACAATATCCTCATTATGTCATTCCCATCCTCCTCCGACCCTTCCATTTCCTTCTTCTTTTCCGATTTTATCATCATTCATCCTCAATTTCTGAACCCTATAAAGTATCTCTCTAATATCCAATTTTCCCAATGTTTTTCATCCGTACACTTCCCCTTCCCCCATCGCCCTCCTCCAATCAACTTCGTCGCCTTCTATTCCCTGCTTCTTCATTTCCCTGCCTGCGCGGCTTTCGTTTGCTTCAATTTCAACCAATGGATTCGTTTTCCACTTCAGCAAATAGCCATGTGAGGAATACTGAGATTATTATTCACTGTTAATTACTCATTGCTCTTATACTCAATCTTACTACTGTATTTCTATTAATTTGATTTTGATTTCAAGTCTATTTCGTTATTCTTCAGGCATTACCTGACTCTTCATGTTGTGGTAGTTCTTGTGGTTGTGGGAGAGACAAGGAACATTTGCATGACAGAGATAATAGTTCAGATGTCATACATGTGGGAAGCATTCCTGTGCATCTAAATCCCAAGGAACGTGAACCCAAATCTTATAATTATGATGAGTCTCTACCTGTTCATAGACAAAATACTAGAAGAAGCCGGATAGATTTAGGGTCCAAAAGAGATTTGAAGAGTAATGCAAGATCATATCAAGTAGAGAGGCTTGAATTTTTGAACGATTCTTGTCAGGAGTATAAATCATCTCTTCCTATTCATTTTGGGAAGAAAAATGAAGTGTTTGTCTCAAAGCTCCAGTCCCTTGATACCGGTCCCAAAGAATCTGTAGTTACGGACAATTCACTTCCCTTTGAACCACCATTTGATATTTGTTTACCTGGGGGAGGTAATGTGAAACATAGAAATATTTATGTTGTTAAAGAGGGTGGCACTGTGAAAGATTATAGACTGTTGAGGCCTGGAATGGTTTTACTGAAGCACTACATCACTCCACGTGAACAGGTATTCTCTTGCTTCAATTGTCAACACTTATATGAACTAGAAATGGGAATTGGCATTTTGTTCTCTAGAATACCCATCTTGGGATGATGGACATTAACCTACTTCACTTTTGCTAATTATAAAATTTCCAATGTTGCCTATTGTTAAGAGGTGCCCTGTAGTTTCATCTCTTTGCTGATATTACTCCTTGTTGCCTTACATTGCTTAAATGACGGGATGGTGTTTTTATATCTTATATTTTTCCGTATGGAAGTTAGGGTTTAGTAATTGAGGATTCATTATTCTTGTTGCAGATCAATATAGTGAAAACTTGTCAAAATCTTGGTATTGGCCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGCAAAACTTAGGCTTCGTATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAGGTATGAAAACAAACGGGTTGTGGATGGTAATAAACCACCAGATATACCTCCTCAATTTACATTTCTTGTTAAACGTGCACTTAAAGATGCACATGCCTTCATCAAGAACAACTGCAATATAAGTAATGTAGAAGAAATTCTTCCGTCAATGTCTCCAGACATATGCATTGCGAACTTCTACACAACGAGGGGAAGATTGGGTCTGCATCAGGTTAGTGCTTGTCCTTTGTATCACAATTATATTTAAATGAATCTGAATGCCGTAAGGTGGTTTTTTCTTCAAGTGTAATGCTAATTCCAACTCATGTTCTTTGGCAATTAGCCATTTTTATTCACACGATAAATAAATGCTTACATTATAAGTTTAGTTTCTCAACAATGAAGTTTGTGTTAATTTTGTTCATAAACTTTCAAGAATGTCTCTTAACTTTCAATCATGTGCATCACAGTTTCCTAGACTTTTAAGTTAGTTTTCTGACTATTAGATAGAAATTTGAACCTTATGTCTAACATCGTCACGTGGTGTGTCAAGCAGATTCACAGAATATAAAAAATCTGAAAGTTCATAGACTGAATAGATTTACCAAAGCAACATAGCCTATGAATGTGTTAACGTCCATAAGCTCCTTGGTTGGAATCTCCCACCCCATTGTACTAAAAATATATCAAATAAACCTAAATCTTGAAGTTTAGGGACTAAATTTGTAATTTAACCTCCCAAACTTTTTCTAACAAGTTCAATTTGTTGTGCAGGACCGTGATGAAAGCAAAGAGAGTCTTTGGAGGGGACTACCGGTTGTTTCCTTTTCTGTAGGCAATGCAGCAGAATTCTTGTATGGAGATAAAAGAAATGTGGATAAAGCAGAGATGGTTGAACTGGAATCAGGTGATGTTCTAATTTTTGGTGGCGAATCTAGACATATATTCCATGGAGTATCTTCAATCATACCAAAATCGACACCCAAGTTTTTGCTTCATCATACTGGTCTGCGTCCCGGCCGTCTTAATCTTACCTTTAGAAAGTATTAAAACACTACCTCCATGTTTATGCTATACATCTGAATCGGTGTTATTCATTTGATGTTCATTTATGGAATCGTGTAAATCTATAATGTAAGTATTGTCTGTTTCTGTTTCATTTACTTTGGATGTTACTTTGTTCAGTACTTTTCAGTTGCCGTACGTGAATGATACAAAGTTTTAATTTCAAGTTTGTATACTTTATTTTATCTCAAATTCTCAGTACTATATTTGACATGCTCAATAT

mRNA sequence

ATGTTTTTCATCCGTACACTTCCCCTTCCCCCATCGCCCTCCTCCAATCAACTTCGTCGCCTTCTATTCCCTGCTTCTTCATTTCCCTGCCTGCGCGGCTTTCGTTTGCTTCAATTTCAACCAATGGATTCGTTTTCCACTTCAGCAAATAGCCATGCATTACCTGACTCTTCATGTTGTGGTAGTTCTTGTGGTTGTGGGAGAGACAAGGAACATTTGCATGACAGAGATAATAGTTCAGATGTCATACATGTGGGAAGCATTCCTGTGCATCTAAATCCCAAGGAACGTGAACCCAAATCTTATAATTATGATGAGTCTCTACCTGTTCATAGACAAAATACTAGAAGAAGCCGGATAGATTTAGGGTCCAAAAGAGATTTGAAGAGTAATGCAAGATCATATCAAGTAGAGAGGCTTGAATTTTTGAACGATTCTTGTCAGGAGTATAAATCATCTCTTCCTATTCATTTTGGGAAGAAAAATGAAGTGTTTGTCTCAAAGCTCCAGTCCCTTGATACCGGTCCCAAAGAATCTGTAGTTACGGACAATTCACTTCCCTTTGAACCACCATTTGATATTTGTTTACCTGGGGGAGGTAATGTGAAACATAGAAATATTTATGTTGTTAAAGAGGGTGGCACTGTGAAAGATTATAGACTGTTGAGGCCTGGAATGGTTTTACTGAAGCACTACATCACTCCACGTGAACAGATCAATATAGTGAAAACTTGTCAAAATCTTGGTATTGGCCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGCAAAACTTAGGCTTCGTATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAGGTATGAAAACAAACGGGTTGTGGATGGTAATAAACCACCAGATATACCTCCTCAATTTACATTTCTTGTTAAACGTGCACTTAAAGATGCACATGCCTTCATCAAGAACAACTGCAATATAAGTAATGTAGAAGAAATTCTTCCGTCAATGTCTCCAGACATATGCATTGCGAACTTCTACACAACGAGGGGAAGATTGGGTCTGCATCAGGACCGTGATGAAAGCAAAGAGAGTCTTTGGAGGGGACTACCGGTTGTTTCCTTTTCTGTAGGCAATGCAGCAGAATTCTTGTATGGAGATAAAAGAAATGTGGATAAAGCAGAGATGGTTGAACTGGAATCAGGTGATGTTCTAATTTTTGGTGGCGAATCTAGACATATATTCCATGGAGTATCTTCAATCATACCAAAATCGACACCCAAGTTTTTGCTTCATCATACTGGTCTGCGTCCCGGCCGTCTTAATCTTACCTTTAGAAAGTATTAA

Coding sequence (CDS)

ATGTTTTTCATCCGTACACTTCCCCTTCCCCCATCGCCCTCCTCCAATCAACTTCGTCGCCTTCTATTCCCTGCTTCTTCATTTCCCTGCCTGCGCGGCTTTCGTTTGCTTCAATTTCAACCAATGGATTCGTTTTCCACTTCAGCAAATAGCCATGCATTACCTGACTCTTCATGTTGTGGTAGTTCTTGTGGTTGTGGGAGAGACAAGGAACATTTGCATGACAGAGATAATAGTTCAGATGTCATACATGTGGGAAGCATTCCTGTGCATCTAAATCCCAAGGAACGTGAACCCAAATCTTATAATTATGATGAGTCTCTACCTGTTCATAGACAAAATACTAGAAGAAGCCGGATAGATTTAGGGTCCAAAAGAGATTTGAAGAGTAATGCAAGATCATATCAAGTAGAGAGGCTTGAATTTTTGAACGATTCTTGTCAGGAGTATAAATCATCTCTTCCTATTCATTTTGGGAAGAAAAATGAAGTGTTTGTCTCAAAGCTCCAGTCCCTTGATACCGGTCCCAAAGAATCTGTAGTTACGGACAATTCACTTCCCTTTGAACCACCATTTGATATTTGTTTACCTGGGGGAGGTAATGTGAAACATAGAAATATTTATGTTGTTAAAGAGGGTGGCACTGTGAAAGATTATAGACTGTTGAGGCCTGGAATGGTTTTACTGAAGCACTACATCACTCCACGTGAACAGATCAATATAGTGAAAACTTGTCAAAATCTTGGTATTGGCCCAGGGGGATTTTACCAGCCTGGTTATAAAGATGGAGCAAAACTTAGGCTTCGTATGATGTGTCTTGGATTGGACTGGGATCCTCAAACAAGAAGGTATGAAAACAAACGGGTTGTGGATGGTAATAAACCACCAGATATACCTCCTCAATTTACATTTCTTGTTAAACGTGCACTTAAAGATGCACATGCCTTCATCAAGAACAACTGCAATATAAGTAATGTAGAAGAAATTCTTCCGTCAATGTCTCCAGACATATGCATTGCGAACTTCTACACAACGAGGGGAAGATTGGGTCTGCATCAGGACCGTGATGAAAGCAAAGAGAGTCTTTGGAGGGGACTACCGGTTGTTTCCTTTTCTGTAGGCAATGCAGCAGAATTCTTGTATGGAGATAAAAGAAATGTGGATAAAGCAGAGATGGTTGAACTGGAATCAGGTGATGTTCTAATTTTTGGTGGCGAATCTAGACATATATTCCATGGAGTATCTTCAATCATACCAAAATCGACACCCAAGTTTTTGCTTCATCATACTGGTCTGCGTCCCGGCCGTCTTAATCTTACCTTTAGAAAGTATTAA

Protein sequence

MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCCGSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY*
Homology
BLAST of CsaV3_4G026990 vs. NCBI nr
Match: XP_004149927.1 (uncharacterized protein LOC101210053 isoform X1 [Cucumis sativus] >KGN54433.1 hypothetical protein Csa_012762 [Cucumis sativus])

HSP 1 Score: 925.6 bits (2391), Expect = 1.6e-265
Identity = 444/444 (100.00%), Postives = 444/444 (100.00%), Query Frame = 0

Query: 1   MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCC 60
           MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCC
Sbjct: 1   MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCC 60

Query: 61  GSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHRQNTRRSRI 120
           GSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHRQNTRRSRI
Sbjct: 61  GSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHRQNTRRSRI 120

Query: 121 DLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESV 180
           DLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESV
Sbjct: 121 DLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESV 180

Query: 181 VTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQIN 240
           VTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQIN
Sbjct: 181 VTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQIN 240

Query: 241 IVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP 300
           IVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP
Sbjct: 241 IVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP 300

Query: 301 QFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKE 360
           QFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKE
Sbjct: 301 QFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKE 360

Query: 361 SLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPK 420
           SLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPK
Sbjct: 361 SLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPK 420

Query: 421 STPKFLLHHTGLRPGRLNLTFRKY 445
           STPKFLLHHTGLRPGRLNLTFRKY
Sbjct: 421 STPKFLLHHTGLRPGRLNLTFRKY 444

BLAST of CsaV3_4G026990 vs. NCBI nr
Match: XP_016903166.1 (PREDICTED: uncharacterized protein LOC103502183 [Cucumis melo] >XP_016903167.1 PREDICTED: uncharacterized protein LOC103502183 [Cucumis melo])

HSP 1 Score: 793.1 bits (2047), Expect = 1.2e-225
Identity = 391/472 (82.84%), Postives = 409/472 (86.65%), Query Frame = 0

Query: 1   MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCC 60
           MFFIRTLPLPPSPSSNQLRRLLFPASSFP  RGF LLQFQ MDSFS+SANSHA PDSSC 
Sbjct: 1   MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCR 60

Query: 61  GSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKER----------------------- 120
           G+SCGCGRDKEHL DRDN SDVI +GS  VHLNPKER                       
Sbjct: 61  GNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPKSLTPLSAKKCDYVEVGSDKF 120

Query: 121 -----EPKSYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKS 180
                EPKSY+YDE LPV RQNTRR+RIDLGSKRDLKSNARS+QVER EF ND CQEY+S
Sbjct: 121 GISSNEPKSYHYDEFLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYES 180

Query: 181 SLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKE 240
           SLPIHFGKKNEVF SK QSLD G KESVVTD+SLPFEPPFDIC PGGGNVKHRN + VK+
Sbjct: 181 SLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKD 240

Query: 241 GGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMC 300
            GTVKDYRLLRPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQP YKDGAKLRLRMMC
Sbjct: 241 SGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMC 300

Query: 301 LGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPS 360
           LGLDWDPQTRRY+NKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CNISNVE+ILPS
Sbjct: 301 LGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPS 360

Query: 361 MSPDICIANFYTTRGRLGLHQDRDESKESLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEM 420
           MSPDICIANFYTT GRLGLHQDRDESKESL  GLPVVSFSVGN AEFLYGDKR+V+KAE 
Sbjct: 361 MSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEK 420

Query: 421 VELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY 445
           VELESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL+HTGLRPGRLNLTFRKY
Sbjct: 421 VELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY 472

BLAST of CsaV3_4G026990 vs. NCBI nr
Match: KAA0050407.1 (2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa])

HSP 1 Score: 713.8 bits (1841), Expect = 9.5e-202
Identity = 350/422 (82.94%), Postives = 364/422 (86.26%), Query Frame = 0

Query: 51  SHALPDSSCCGSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKER------------- 110
           S A PDSSC G+SCGCGRDKEHL DRDN SDVI VGS  VHLNPKER             
Sbjct: 213 STAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKC 272

Query: 111 ---------------EPKSYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEF 170
                          EPKSY+YDE LPV RQNTRR+RIDLGSKRDLKSNARS+QVER EF
Sbjct: 273 DYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEF 332

Query: 171 LNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNV 230
           LND CQEY+SSLPIHFGKKNEVF SK QSLD G KESVVTD+SLPFEPPFDIC PGGGNV
Sbjct: 333 LNDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPFDICFPGGGNV 392

Query: 231 KHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKD 290
           KHRN + VK+ GTVKDYRLLRPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQPGYKD
Sbjct: 393 KHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKD 452

Query: 291 GAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCN 350
           GAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CN
Sbjct: 453 GAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN 512

Query: 351 ISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWRGLPVVSFSVGNAAEFLYG 410
           ISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL  GLPVVSFSVGN AEFLYG
Sbjct: 513 ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYG 572

Query: 411 DKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFR 445
           DKR+VDKAE VELESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL+HTGLRPGRLNLTFR
Sbjct: 573 DKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFR 632

BLAST of CsaV3_4G026990 vs. NCBI nr
Match: TYJ97997.1 (2-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var. makuwa])

HSP 1 Score: 711.1 bits (1834), Expect = 6.1e-201
Identity = 349/422 (82.70%), Postives = 363/422 (86.02%), Query Frame = 0

Query: 51  SHALPDSSCCGSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKER------------- 110
           S A PDSSC G+SCGCGRDKEHL DRDN SDVI VGS  VHLNPKER             
Sbjct: 213 STAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKC 272

Query: 111 ---------------EPKSYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEF 170
                          EPKSY+YDE LPV RQNTRR+RIDLGSKRDLKSNARS+QVER EF
Sbjct: 273 DYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEF 332

Query: 171 LNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNV 230
           LND CQEY+SSLPIHFGKKNEVF SK QSLD G KESVVTD+S PFEPPFDIC PGGGNV
Sbjct: 333 LNDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSPPFEPPFDICFPGGGNV 392

Query: 231 KHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKD 290
           KHRN + VK+ GTVKDYRLLRPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQPGYKD
Sbjct: 393 KHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKD 452

Query: 291 GAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCN 350
           GAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CN
Sbjct: 453 GAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN 512

Query: 351 ISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWRGLPVVSFSVGNAAEFLYG 410
           ISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL  GLPVVSFSVGN AEFLYG
Sbjct: 513 ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYG 572

Query: 411 DKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFR 445
           DKR+VDKAE VELESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL+HTGLRPGRLNLTFR
Sbjct: 573 DKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFR 632

BLAST of CsaV3_4G026990 vs. NCBI nr
Match: XP_031739557.1 (uncharacterized protein LOC101210053 isoform X2 [Cucumis sativus])

HSP 1 Score: 591.3 bits (1523), Expect = 7.1e-165
Identity = 283/284 (99.65%), Postives = 284/284 (100.00%), Query Frame = 0

Query: 1   MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCC 60
           MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCC
Sbjct: 1   MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCC 60

Query: 61  GSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHRQNTRRSRI 120
           GSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHRQNTRRSRI
Sbjct: 61  GSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHRQNTRRSRI 120

Query: 121 DLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESV 180
           DLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESV
Sbjct: 121 DLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESV 180

Query: 181 VTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQIN 240
           VTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQIN
Sbjct: 181 VTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQIN 240

Query: 241 IVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRY 285
           IVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRR+
Sbjct: 241 IVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRH 284

BLAST of CsaV3_4G026990 vs. ExPASy Swiss-Prot
Match: P0CAT7 (Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter vibrioides (strain ATCC 19089 / CB15) OX=190650 GN=alkB PE=3 SV=1)

HSP 1 Score: 90.5 bits (223), Expect = 5.2e-17
Identity = 66/197 (33.50%), Postives = 86/197 (43.65%), Query Frame = 0

Query: 252 PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPDIPPQFTFLVKR 311
           P   Y+  Y  G  + + M  LG L W    R  RY ++    G   PD+PP        
Sbjct: 53  PFSNYRTAY--GKPMSVAMTALGSLGWTSDARGYRYVDRHPETGRPWPDMPP-------- 112

Query: 312 ALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWRGLPV 371
           AL D    + +           P   PD C+ N Y    R+GLHQDRDE+        PV
Sbjct: 113 ALLDLWTVLGD-----------PETPPDSCLVNLYRDGARMGLHQDRDEADPR----FPV 172

Query: 372 VSFSVGNAAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLH 431
           +S S+G+ A F  G     D    + L SGDV    G +R  FHGV  I+P S       
Sbjct: 173 LSISLGDTAVFRIGGVNRKDPTRSLRLASGDVCRLLGPARLAFHGVDRILPGS------- 216

Query: 432 HTGLRP--GRLNLTFRK 444
            + L P  GR+NLT R+
Sbjct: 233 -SSLVPGGGRINLTLRR 216

BLAST of CsaV3_4G026990 vs. ExPASy Swiss-Prot
Match: B8GWW6 (Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter vibrioides (strain NA1000 / CB15N) OX=565050 GN=alkB PE=3 SV=2)

HSP 1 Score: 90.5 bits (223), Expect = 5.2e-17
Identity = 66/197 (33.50%), Postives = 86/197 (43.65%), Query Frame = 0

Query: 252 PGGFYQPGYKDGAKLRLRMMCLG-LDWDPQTR--RYENKRVVDGNKPPDIPPQFTFLVKR 311
           P   Y+  Y  G  + + M  LG L W    R  RY ++    G   PD+PP        
Sbjct: 53  PFSNYRTAY--GKPMSVAMTALGSLGWTSDARGYRYVDRHPETGRPWPDMPP-------- 112

Query: 312 ALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWRGLPV 371
           AL D    + +           P   PD C+ N Y    R+GLHQDRDE+        PV
Sbjct: 113 ALLDLWTVLGD-----------PETPPDSCLVNLYRDGARMGLHQDRDEADPR----FPV 172

Query: 372 VSFSVGNAAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLH 431
           +S S+G+ A F  G     D    + L SGDV    G +R  FHGV  I+P S       
Sbjct: 173 LSISLGDTAVFRIGGVNRKDPTRSLRLASGDVCRLLGPARLAFHGVDRILPGS------- 216

Query: 432 HTGLRP--GRLNLTFRK 444
            + L P  GR+NLT R+
Sbjct: 233 -SSLVPGGGRINLTLRR 216

BLAST of CsaV3_4G026990 vs. ExPASy Swiss-Prot
Match: O60066 (Alpha-ketoglutarate-dependent dioxygenase abh1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=abh1 PE=2 SV=3)

HSP 1 Score: 88.2 bits (217), Expect = 2.6e-16
Identity = 65/240 (27.08%), Postives = 102/240 (42.50%), Query Frame = 0

Query: 224 PGMVLLKHYITPREQINIVKTCQ-----------------NLGIGPGGFYQPGYK-DGAK 283
           PG+++LK+Y++   Q+ ++K+                    L +G    ++  Y  DG  
Sbjct: 60  PGLLILKNYVSSELQMQLLKSIMFTQIQDPENKTNLSPFYQLPLGNDSIWRRYYNGDGES 119

Query: 284 L------------------RLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVK 343
           +                  +LR + LG  +D  T+ Y      D +K P  P      V+
Sbjct: 120 IIDGLGETKPLTVDRLVHKKLRWVTLGEQYDWTTKEYP-----DPSKSPGFPKDLGDFVE 179

Query: 344 RALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWRGLP 403
           + +K++  F+                  +  I NFY+    L  H   DES+E L   LP
Sbjct: 180 KVVKESTDFL--------------HWKAEAAIVNFYSPGDTLSAH--IDESEEDL--TLP 239

Query: 404 VVSFSVGNAAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL 428
           ++S S+G    +L G +   +K   + L SGDV+I  G SR  FH V  IIP STP +LL
Sbjct: 240 LISLSMGLDCIYLIGTESRSEKPSALRLHSGDVVIMTGTSRKAFHAVPKIIPNSTPNYLL 276

BLAST of CsaV3_4G026990 vs. ExPASy Swiss-Prot
Match: P05050 (Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Escherichia coli (strain K12) OX=83333 GN=alkB PE=1 SV=1)

HSP 1 Score: 77.8 bits (190), Expect = 3.5e-13
Identity = 55/167 (32.93%), Postives = 74/167 (44.31%), Query Frame = 0

Query: 278 DPQTRRYENKRVVDGNKP-PDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPD 337
           DPQT           NKP P +P  F  L +RA   A                 P   PD
Sbjct: 82  DPQT-----------NKPWPAMPQSFHNLCQRAATAAG---------------YPDFQPD 141

Query: 338 ICIANFYTTRGRLGLHQDRDESKESLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEMVELE 397
            C+ N Y    +L LHQD+DE         P+VS S+G  A F +G  +  D  + + LE
Sbjct: 142 ACLINRYAPGAKLSLHQDKDEPD----LRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLE 201

Query: 398 SGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRK 444
            GDV+++GGESR  +HG+  +     P  +         R NLTFR+
Sbjct: 202 HGDVVVWGGESRLFYHGIQPLKAGFHPLTI-------DCRYNLTFRQ 211

BLAST of CsaV3_4G026990 vs. ExPASy Swiss-Prot
Match: P37462 (Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) OX=99287 GN=alkB PE=3 SV=2)

HSP 1 Score: 76.6 bits (187), Expect = 7.7e-13
Identity = 42/112 (37.50%), Postives = 57/112 (50.89%), Query Frame = 0

Query: 332 SMSPDICIANFYTTRGRLGLHQDRDESKESLWRGLPVVSFSVGNAAEFLYGDKRNVDKAE 391
           S  PD C+ N Y    +L LHQD+DE         P+VS S+G  A F +G  R  D  +
Sbjct: 111 SFQPDACLINRYAPGAKLSLHQDKDEPD----LRAPIVSVSLGVPAVFQFGGLRRSDPIQ 170

Query: 392 MVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRK 444
            + LE GD++++GGESR  +HG+        P     H      R NLTFR+
Sbjct: 171 RILLEHGDIVVWGGESRLFYHGIQ-------PLKAGFHPMTGEFRYNLTFRQ 211

BLAST of CsaV3_4G026990 vs. ExPASy TrEMBL
Match: A0A0A0KY56 (Fe2OG dioxygenase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G329550 PE=4 SV=1)

HSP 1 Score: 925.6 bits (2391), Expect = 7.7e-266
Identity = 444/444 (100.00%), Postives = 444/444 (100.00%), Query Frame = 0

Query: 1   MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCC 60
           MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCC
Sbjct: 1   MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCC 60

Query: 61  GSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHRQNTRRSRI 120
           GSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHRQNTRRSRI
Sbjct: 61  GSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKEREPKSYNYDESLPVHRQNTRRSRI 120

Query: 121 DLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESV 180
           DLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESV
Sbjct: 121 DLGSKRDLKSNARSYQVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESV 180

Query: 181 VTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQIN 240
           VTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQIN
Sbjct: 181 VTDNSLPFEPPFDICLPGGGNVKHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQIN 240

Query: 241 IVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP 300
           IVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP
Sbjct: 241 IVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP 300

Query: 301 QFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKE 360
           QFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKE
Sbjct: 301 QFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKE 360

Query: 361 SLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPK 420
           SLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPK
Sbjct: 361 SLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPK 420

Query: 421 STPKFLLHHTGLRPGRLNLTFRKY 445
           STPKFLLHHTGLRPGRLNLTFRKY
Sbjct: 421 STPKFLLHHTGLRPGRLNLTFRKY 444

BLAST of CsaV3_4G026990 vs. ExPASy TrEMBL
Match: A0A1S4E4K6 (uncharacterized protein LOC103502183 OS=Cucumis melo OX=3656 GN=LOC103502183 PE=4 SV=1)

HSP 1 Score: 793.1 bits (2047), Expect = 6.0e-226
Identity = 391/472 (82.84%), Postives = 409/472 (86.65%), Query Frame = 0

Query: 1   MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCC 60
           MFFIRTLPLPPSPSSNQLRRLLFPASSFP  RGF LLQFQ MDSFS+SANSHA PDSSC 
Sbjct: 1   MFFIRTLPLPPSPSSNQLRRLLFPASSFPGARGFSLLQFQRMDSFSSSANSHAPPDSSCR 60

Query: 61  GSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKER----------------------- 120
           G+SCGCGRDKEHL DRDN SDVI +GS  VHLNPKER                       
Sbjct: 61  GNSCGCGRDKEHLRDRDNCSDVIFLGSFRVHLNPKEREPKSLTPLSAKKCDYVEVGSDKF 120

Query: 121 -----EPKSYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYKS 180
                EPKSY+YDE LPV RQNTRR+RIDLGSKRDLKSNARS+QVER EF ND CQEY+S
Sbjct: 121 GISSNEPKSYHYDEFLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEFFNDYCQEYES 180

Query: 181 SLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNVKHRNIYVVKE 240
           SLPIHFGKKNEVF SK QSLD G KESVVTD+SLPFEPPFDIC PGGGNVKHRN + VK+
Sbjct: 181 SLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPFDICFPGGGNVKHRNFWRVKD 240

Query: 241 GGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMC 300
            GTVKDYRLLRPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQP YKDGAKLRLRMMC
Sbjct: 241 SGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPSYKDGAKLRLRMMC 300

Query: 301 LGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPS 360
           LGLDWDPQTRRY+NKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CNISNVE+ILPS
Sbjct: 301 LGLDWDPQTRRYKNKRVVDGNKPPDIPPPFSFLVKSALKDAHAFIKNKCNISNVEDILPS 360

Query: 361 MSPDICIANFYTTRGRLGLHQDRDESKESLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEM 420
           MSPDICIANFYTT GRLGLHQDRDESKESL  GLPVVSFSVGN AEFLYGDKR+V+KAE 
Sbjct: 361 MSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYGDKRDVEKAEK 420

Query: 421 VELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFRKY 445
           VELESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL+HTGLRPGRLNLTFRKY
Sbjct: 421 VELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFRKY 472

BLAST of CsaV3_4G026990 vs. ExPASy TrEMBL
Match: A0A5A7U7Q2 (2-oxoglutarate-dependent dioxygenase family protein isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold1166G00230 PE=4 SV=1)

HSP 1 Score: 713.8 bits (1841), Expect = 4.6e-202
Identity = 350/422 (82.94%), Postives = 364/422 (86.26%), Query Frame = 0

Query: 51  SHALPDSSCCGSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKER------------- 110
           S A PDSSC G+SCGCGRDKEHL DRDN SDVI VGS  VHLNPKER             
Sbjct: 213 STAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKC 272

Query: 111 ---------------EPKSYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEF 170
                          EPKSY+YDE LPV RQNTRR+RIDLGSKRDLKSNARS+QVER EF
Sbjct: 273 DYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEF 332

Query: 171 LNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNV 230
           LND CQEY+SSLPIHFGKKNEVF SK QSLD G KESVVTD+SLPFEPPFDIC PGGGNV
Sbjct: 333 LNDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSLPFEPPFDICFPGGGNV 392

Query: 231 KHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKD 290
           KHRN + VK+ GTVKDYRLLRPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQPGYKD
Sbjct: 393 KHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKD 452

Query: 291 GAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCN 350
           GAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CN
Sbjct: 453 GAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN 512

Query: 351 ISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWRGLPVVSFSVGNAAEFLYG 410
           ISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL  GLPVVSFSVGN AEFLYG
Sbjct: 513 ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYG 572

Query: 411 DKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFR 445
           DKR+VDKAE VELESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL+HTGLRPGRLNLTFR
Sbjct: 573 DKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFR 632

BLAST of CsaV3_4G026990 vs. ExPASy TrEMBL
Match: A0A5D3BFV0 (2-oxoglutarate-dependent dioxygenase family protein isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold487G00200 PE=4 SV=1)

HSP 1 Score: 711.1 bits (1834), Expect = 3.0e-201
Identity = 349/422 (82.70%), Postives = 363/422 (86.02%), Query Frame = 0

Query: 51  SHALPDSSCCGSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKER------------- 110
           S A PDSSC G+SCGCGRDKEHL DRDN SDVI VGS  VHLNPKER             
Sbjct: 213 STAPPDSSCRGNSCGCGRDKEHLRDRDNCSDVIFVGSFRVHLNPKEREPKSLTPLSVKKC 272

Query: 111 ---------------EPKSYNYDESLPVHRQNTRRSRIDLGSKRDLKSNARSYQVERLEF 170
                          EPKSY+YDE LPV RQNTRR+RIDLGSKRDLKSNARS+QVER EF
Sbjct: 273 DYVEVGSDKFGISSNEPKSYHYDECLPVSRQNTRRNRIDLGSKRDLKSNARSFQVERHEF 332

Query: 171 LNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLPGGGNV 230
           LND CQEY+SSLPIHFGKKNEVF SK QSLD G KESVVTD+S PFEPPFDIC PGGGNV
Sbjct: 333 LNDYCQEYESSLPIHFGKKNEVFFSKRQSLDIGSKESVVTDHSPPFEPPFDICFPGGGNV 392

Query: 231 KHRNIYVVKEGGTVKDYRLLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKD 290
           KHRN + VK+ GTVKDYRLLRPGMVLLKHYITP EQINIVKTCQ LG+GPGGFYQPGYKD
Sbjct: 393 KHRNFWRVKDSGTVKDYRLLRPGMVLLKHYITPPEQINIVKTCQKLGLGPGGFYQPGYKD 452

Query: 291 GAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCN 350
           GAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPP F+FLVK ALKDAHAFIKN CN
Sbjct: 453 GAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPPFSFLVKNALKDAHAFIKNKCN 512

Query: 351 ISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWRGLPVVSFSVGNAAEFLYG 410
           ISNVE+ILPSMSPDICIANFYTT GRLGLHQDRDESKESL  GLPVVSFSVGN AEFLYG
Sbjct: 513 ISNVEDILPSMSPDICIANFYTTSGRLGLHQDRDESKESLSSGLPVVSFSVGNTAEFLYG 572

Query: 411 DKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTFR 445
           DKR+VDKAE VELESGDVLIFGGESRH+FHGVSSIIPKSTPKFLL+HTGLRPGRLNLTFR
Sbjct: 573 DKRDVDKAEKVELESGDVLIFGGESRHVFHGVSSIIPKSTPKFLLYHTGLRPGRLNLTFR 632

BLAST of CsaV3_4G026990 vs. ExPASy TrEMBL
Match: A0A6J1EDT3 (uncharacterized protein LOC111432318 OS=Cucurbita moschata OX=3662 GN=LOC111432318 PE=4 SV=1)

HSP 1 Score: 502.3 bits (1292), Expect = 2.1e-138
Identity = 284/483 (58.80%), Postives = 326/483 (67.49%), Query Frame = 0

Query: 1   MFFIRTLPLPPSPSSNQLRRLLFPASSFPCLRGFRLLQFQPMDSFSTSANSHALPDSSCC 60
           M  IRT+P    P SN LRRLLF  S        RLLQFQ +DSF +S    ALPDSSC 
Sbjct: 1   MLLIRTVPASLPPWSNLLRRLLFAES--------RLLQFQRVDSFGSS----ALPDSSCY 60

Query: 61  GSSCGCGRDKEHLHDRDNSSDVIHVGSIPVHLNPKERE---------------------- 120
           GSS  CG ++E LH+RD++S+VI +G IPV+LN K  E                      
Sbjct: 61  GSS--CGGNEECLHNRDHNSNVIMIGEIPVNLNRKGNEQESLSRLSVGKCDDFKLRSDQK 120

Query: 121 ------PKSYNYDESLPVHRQNT-RRSRIDLGSKRDLKSNARSYQVERLEFLNDSCQEYK 180
                 P SY+ DE  PV RQNT RRSRIDLGS+R LK++  S Q+ER E          
Sbjct: 121 GIPANIPSSYHDDEFPPVPRQNTKRRSRIDLGSERRLKNSTSSSQMERNE---------- 180

Query: 181 SSLPIHFGKKNEVFVSKLQSLDTGPKESVVTDNSLPFEPPFDICLP-GGGNVKHRNIYVV 240
              P  F         K +S D G K S+ T N  P E  FDIC P   G  K R  +  
Sbjct: 181 ---PFSF--------KKHRSPDIGSKNSLATANLPPIE-SFDICFPERRGKSKPRYSWQS 240

Query: 241 KEGGTVKDYR---------LLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYK 300
           K+  T+K            ++RPGMVLLKHYI   EQ+NIVKT Q LG+GPGGFYQPGYK
Sbjct: 241 KDRDTMKVMEHADEATNGIVMRPGMVLLKHYIPLHEQVNIVKTIQKLGLGPGGFYQPGYK 300

Query: 301 DGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNC 360
           DGAKLRL+MMCLGLDWDPQTR+Y  KRV DGNKPPD+PP+F  LV +AL DAHA IKNN 
Sbjct: 301 DGAKLRLQMMCLGLDWDPQTRKYARKRVADGNKPPDLPPEFAILVGKALNDAHALIKNNG 360

Query: 361 NISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDESKESLWRGLPVVSFSVGNAAEFLY 420
           + +N+E+ILP+MSPDICI NFY+T GRLGLHQDRDES+ESL  GLPVVSFS+GN+AEFLY
Sbjct: 361 DTNNIEDILPTMSPDICIVNFYSTSGRLGLHQDRDESRESLVGGLPVVSFSLGNSAEFLY 420

Query: 421 GDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLHHTGLRPGRLNLTF 445
           GD+R+VDKA  + LESGDVLIFGGESRHIFHGVSSIIPKSTPKFLL HTGLRPGRLNLTF
Sbjct: 421 GDQRDVDKAGKIILESGDVLIFGGESRHIFHGVSSIIPKSTPKFLLDHTGLRPGRLNLTF 447

BLAST of CsaV3_4G026990 vs. TAIR 10
Match: AT5G01780.1 (2-oxoglutarate-dependent dioxygenase family protein )

HSP 1 Score: 303.9 bits (777), Expect = 2.1e-82
Identity = 150/266 (56.39%), Postives = 193/266 (72.56%), Query Frame = 0

Query: 190 PPFDICLPGGGNVKHRNIYVVKE--------GGTVK---DYRLLRPGMVLLKHYITPREQ 249
           PPFDIC     +V  RN   +K+          TV+    ++++RPGMVLLK ++TP  Q
Sbjct: 128 PPFDIC----SSVLERNDTSIKDWILADETNRETVEVSNKHKVIRPGMVLLKDFLTPDIQ 187

Query: 250 INIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDI 309
           ++IVKTC+ LG+ P GFYQPGY  G+KL L+MMCLG +WDPQT+  +N  +   +K P+I
Sbjct: 188 VDIVKTCRELGVKPTGFYQPGYSVGSKLHLQMMCLGRNWDPQTKYRKNTDI--DSKAPEI 247

Query: 310 PPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDES 369
           P  F  LV++A+++AHA I       + E ILP MSPDICI NFY+  GRLGLHQDRDES
Sbjct: 248 PVTFNVLVEKAIREAHALIDRESGTEDAERILPVMSPDICIVNFYSETGRLGLHQDRDES 307

Query: 370 KESLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSII 429
           +ES+ RGLP+VSFS+G++AEFLYG+KR+V++A+ V LESGDVLIFGGESR IFHGV SII
Sbjct: 308 EESIARGLPIVSFSIGDSAEFLYGEKRDVEEAQGVILESGDVLIFGGESRMIFHGVKSII 367

Query: 430 PKSTPKFLLHHTGLRPGRLNLTFRKY 445
           P S P  LL+ + LR GRLNLTFR +
Sbjct: 368 PNSAPMSLLNESKLRTGRLNLTFRHF 387

BLAST of CsaV3_4G026990 vs. TAIR 10
Match: AT5G01780.2 (2-oxoglutarate-dependent dioxygenase family protein )

HSP 1 Score: 303.9 bits (777), Expect = 2.1e-82
Identity = 150/266 (56.39%), Postives = 193/266 (72.56%), Query Frame = 0

Query: 190 PPFDICLPGGGNVKHRNIYVVKE--------GGTVK---DYRLLRPGMVLLKHYITPREQ 249
           PPFDIC     +V  RN   +K+          TV+    ++++RPGMVLLK ++TP  Q
Sbjct: 183 PPFDIC----SSVLERNDTSIKDWILADETNRETVEVSNKHKVIRPGMVLLKDFLTPDIQ 242

Query: 250 INIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNKPPDI 309
           ++IVKTC+ LG+ P GFYQPGY  G+KL L+MMCLG +WDPQT+  +N  +   +K P+I
Sbjct: 243 VDIVKTCRELGVKPTGFYQPGYSVGSKLHLQMMCLGRNWDPQTKYRKNTDI--DSKAPEI 302

Query: 310 PPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQDRDES 369
           P  F  LV++A+++AHA I       + E ILP MSPDICI NFY+  GRLGLHQDRDES
Sbjct: 303 PVTFNVLVEKAIREAHALIDRESGTEDAERILPVMSPDICIVNFYSETGRLGLHQDRDES 362

Query: 370 KESLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSII 429
           +ES+ RGLP+VSFS+G++AEFLYG+KR+V++A+ V LESGDVLIFGGESR IFHGV SII
Sbjct: 363 EESIARGLPIVSFSIGDSAEFLYGEKRDVEEAQGVILESGDVLIFGGESRMIFHGVKSII 422

Query: 430 PKSTPKFLLHHTGLRPGRLNLTFRKY 445
           P S P  LL+ + LR GRLNLTFR +
Sbjct: 423 PNSAPMSLLNESKLRTGRLNLTFRHF 442

BLAST of CsaV3_4G026990 vs. TAIR 10
Match: AT3G14160.1 (2-oxoglutarate-dependent dioxygenase family protein )

HSP 1 Score: 298.9 bits (764), Expect = 6.8e-81
Identity = 156/330 (47.27%), Postives = 215/330 (65.15%), Query Frame = 0

Query: 122 LGSKRDLKSNARSY--QVERLEFLNDSCQEYKSSLPIHFGKKNEVFVSKLQSLDTGPKES 181
           + S+ + K  A+ Y   V R+  +  SCQE  SS  +      +V +S ++   + PK  
Sbjct: 131 VSSECEDKDGAKMYCDLVNRVNDVTLSCQESVSSTVV-----QKVELSSVEDQKSAPKAD 190

Query: 182 VVTDNSLPFEP-PFDICLPGGGNVKHRNIYVV--KEGGTVKDY--RLLRPGMVLLKHYIT 241
              ++S       FDI L   G V   N+ V+  ++    K Y   ++RPGMVLLK+Y++
Sbjct: 191 GAGNSSNESSTRHFDIFLEKKGIVLKPNLLVLSREKKKAAKGYSGTVIRPGMVLLKNYLS 250

Query: 242 PREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQTRRYENKRVVDGNK 301
             +Q+ IV  C+ LG+G GGFYQPGY+D AKL L+MMCLG +WDP+T RY   R  DG+ 
Sbjct: 251 INDQVMIVNKCRRLGLGEGGFYQPGYRDEAKLHLKMMCLGKNWDPETSRYGETRPFDGST 310

Query: 302 PPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIANFYTTRGRLGLHQD 361
            P IP +F   V++A+K++ +   +N   +   + +P M PDICI NFY++ GRLGLHQD
Sbjct: 311 APRIPAEFNQFVEKAVKESQSLAASNSKQTKGGDEIPFMLPDICIVNFYSSTGRLGLHQD 370

Query: 362 RDESKESLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGV 421
           +DES+ S+ +GLPVVSFS+G++AEFLYGD+R+ DKAE + LESGDVL+FGG SR +FHGV
Sbjct: 371 KDESENSIRKGLPVVSFSIGDSAEFLYGDQRDEDKAETLTLESGDVLLFGGRSRKVFHGV 430

Query: 422 SSIIPKSTPKFLLHHTGLRPGRLNLTFRKY 445
            SI   + PK LL  T LRPGRLNLTFR+Y
Sbjct: 431 RSIRKDTAPKALLQETSLRPGRLNLTFRQY 455

BLAST of CsaV3_4G026990 vs. TAIR 10
Match: AT3G14140.1 (2-oxoglutarate-dependent dioxygenase family protein )

HSP 1 Score: 241.1 bits (614), Expect = 1.7e-63
Identity = 111/218 (50.92%), Postives = 155/218 (71.10%), Query Frame = 0

Query: 221 LLRPGMVLLKHYITPREQINIVKTCQNLGIGPGGFYQPGYKDGAKLRLRMMCLGLDWDPQ 280
           ++RPGMVLLK+Y++   Q+ IV  C+ LG+G GGFYQPG++DG  L L+MMCLG +WD Q
Sbjct: 237 VIRPGMVLLKNYLSINNQVMIVNKCRQLGLGEGGFYQPGFQDGGLLHLKMMCLGKNWDCQ 296

Query: 281 TRRYENKRVVDGNKPPDIPPQFTFLVKRALKDAHAFIKNNCNISNVEEILPSMSPDICIA 340
           TRRY   R +DG+ PP IP +F+ LV++A+K++ + +  N N +   + +P + PDIC+ 
Sbjct: 297 TRRYGEIRPIDGSVPPRIPVEFSQLVEKAIKESKSLVATNSNETKGGDEIPLLLPDICVV 356

Query: 341 NFYTTRGRLGLHQ---------------------DRDESKESLWRGLPVVSFSVGNAAEF 400
           NFYT+ G+LGLHQ                     D+ ESK+SL +GLP+VSFS+G++AEF
Sbjct: 357 NFYTSTGKLGLHQVSVYDKTSFDFLKYKGGYLNTDKGESKKSLRKGLPIVSFSIGDSAEF 416

Query: 401 LYGDKRNVDKAEMVELESGDVLIFGGESRHIFHGVSSI 418
           LYGD+++VDKA+ + LESGDVLIFG  SR++FHGV SI
Sbjct: 417 LYGDQKDVDKADTLILESGDVLIFGERSRNVFHGVRSI 454

BLAST of CsaV3_4G026990 vs. TAIR 10
Match: AT1G11780.1 (oxidoreductase, 2OG-Fe(II) oxygenase family protein )

HSP 1 Score: 57.8 bits (138), Expect = 2.6e-08
Identity = 32/83 (38.55%), Postives = 43/83 (51.81%), Query Frame = 0

Query: 335 PDICIANFYTTRGRLGLHQDRDESKESLWRGLPVVSFSVGNAAEFLYGDKRNVDKAEMVE 394
           P+  I N++     LG H D     E+ W   P+VS S+G  A FL G K   D    + 
Sbjct: 226 PEGAIVNYFGIGDTLGGHLD---DMEADW-SKPIVSMSLGCKAIFLLGGKSKDDPPHAMY 285

Query: 395 LESGDVLIFGGESRHIFHGVSSI 418
           L SGDV++  GE+R  FHG+  I
Sbjct: 286 LRSGDVVLMAGEARECFHGIPRI 304

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004149927.11.6e-265100.00uncharacterized protein LOC101210053 isoform X1 [Cucumis sativus] >KGN54433.1 hy... [more]
XP_016903166.11.2e-22582.84PREDICTED: uncharacterized protein LOC103502183 [Cucumis melo] >XP_016903167.1 P... [more]
KAA0050407.19.5e-20282.942-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var.... [more]
TYJ97997.16.1e-20182.702-oxoglutarate-dependent dioxygenase family protein isoform 1 [Cucumis melo var.... [more]
XP_031739557.17.1e-16599.65uncharacterized protein LOC101210053 isoform X2 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
P0CAT75.2e-1733.50Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter vibrioides... [more]
B8GWW65.2e-1733.50Alpha-ketoglutarate-dependent dioxygenase AlkB homolog OS=Caulobacter vibrioides... [more]
O600662.6e-1627.08Alpha-ketoglutarate-dependent dioxygenase abh1 OS=Schizosaccharomyces pombe (str... [more]
P050503.5e-1332.93Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Escherichia coli (strain K12) ... [more]
P374627.7e-1337.50Alpha-ketoglutarate-dependent dioxygenase AlkB OS=Salmonella typhimurium (strain... [more]
Match NameE-valueIdentityDescription
A0A0A0KY567.7e-266100.00Fe2OG dioxygenase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G... [more]
A0A1S4E4K66.0e-22682.84uncharacterized protein LOC103502183 OS=Cucumis melo OX=3656 GN=LOC103502183 PE=... [more]
A0A5A7U7Q24.6e-20282.942-oxoglutarate-dependent dioxygenase family protein isoform 1 OS=Cucumis melo va... [more]
A0A5D3BFV03.0e-20182.702-oxoglutarate-dependent dioxygenase family protein isoform 1 OS=Cucumis melo va... [more]
A0A6J1EDT32.1e-13858.80uncharacterized protein LOC111432318 OS=Cucurbita moschata OX=3662 GN=LOC1114323... [more]
Match NameE-valueIdentityDescription
AT5G01780.12.1e-8256.392-oxoglutarate-dependent dioxygenase family protein [more]
AT5G01780.22.1e-8256.392-oxoglutarate-dependent dioxygenase family protein [more]
AT3G14160.16.8e-8147.272-oxoglutarate-dependent dioxygenase family protein [more]
AT3G14140.11.7e-6350.922-oxoglutarate-dependent dioxygenase family protein [more]
AT1G11780.12.6e-0838.55oxidoreductase, 2OG-Fe(II) oxygenase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Chinese Long) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR037151Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamilyGENE3D2.60.120.590coord: 219..444
e-value: 1.6E-50
score: 173.6
IPR027450Alpha-ketoglutarate-dependent dioxygenase AlkB-likePFAMPF135322OG-FeII_Oxy_2coord: 225..442
e-value: 3.0E-42
score: 145.0
IPR004574Alkylated DNA repair protein AlkBPANTHERPTHR16557ALKYLATED DNA REPAIR PROTEIN ALKB-RELATEDcoord: 1..444
NoneNo IPR availablePANTHERPTHR16557:SF92OG-FE(II) OXYGENASE FAMILY PROTEINcoord: 1..444
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 222..443
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 334..444
score: 8.843623

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_4G026990.1CsaV3_4G026990.1mRNA