Cp4.1LG00g02340 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG00g02340
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
LocationCp4.1LG00 : 7028689 .. 7041417 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGGATAACAAGTTTAGACGTCCACGCAATCGATTGGGTCCCACATGTACACGTCACGTCCAGAACTAACGAAACAGAGCCCTAACCCTAACCTGCAACTTTCACCATCTTCGTCTTCTTCCTTTTCATAACCCTAATTTCAATCCACCTCTCACACTCTCACACTCTCACCCGCAATAACATGCTTCTGTAATTCTTACGTCACACGCATTTGACCTCCGCAGATGCGAAGCTATGTCTCTGGAAGCTTCGCTTGAACGACGAAAGCAGTCCCAAGCTCCTGTGACTGGCAATGGAAATGGCGTCGTCTCGTCCAGCGCACCTTCCTTCTCAACTCACAGGCTTCGTCTCCAGCCAAAGGAAGATCACAAGTCGGAGACCTACGAGGACCTGCAATTGGAATTTAGCCCCCTCCTCTTTAGTATGCTGGAAAGGCACTTGCCTCCGAGCATGCTCAATGTGGCACGCGACCTTAAGCTTCAGTACATGAGGGACATTCTACTCCGATATGCTCCAGAGGGTGAACGCAACCGAGTAAGCATTTTCAATTGATTATCTCTATATCTGTGATAAATTTCAATACTTGTTTGCTTGGTTGGCTGGTAAATAGAGGAGGAGGAGGAGGAGGAGGAGGAATAGTGATAGATTTTATTCTACACTTTTTTATAGGTTGAAATCCAATGATTAGGCAATGCCGCGTGATCTTATCAGTTGTTAGTTGGAAGTCGTTTGTAGGATAAGCATTAAGATTTTTGTTTTTCTTGCTATGCGATGGTGGATCCGAAGGCGGGAACATGGTTAGTTGAATATGCAGGTTTTTCATTCGTCTGATGGAAGCTAAAACCGCATGTTGGTAAAATTTACGATTAGAACTGCGAATTAGACATTTATGTTAAGATGTCAAGTAAGAAATGAGGATTGAATTTGCCAATTTGCTAATTTTCCTTTGATGGTAGTGAGTCAACGATCTTGATGTTTGAAGCAAATTTGATGAAAGCAAGGAGATATCCATGTTTATACCTTATTACTCGTTGGGAATGCGTTCTGTTGGTCTTGAGGAGCTATAATGGATTTCAGCTTATGAATTTTGGAGGATTTGCATTCAAGTTGCAGTCTGGGAAGTCAAGGTCTATTCTTGTATCTTGTTCCTATATGTGATTGCTTCCATGTTTCTCGGAAGTATTTCCAGAACAGTGGATTCAGTTTTATTTAGACAAAAAAGGAGAAAGTATGTAAAGTATGAAGAAAGATAAATATTCCTTCCAAGAATCGAAAACCTCAATTGAGTTTCCAAAGAGGACTCTATGAATAGTAGCAGGAGAATCATTGCTAAAAGTTAAAAAGTGAACTTCAAATGGATGCTACAAAGCAGATCCAATCCTAAAACTCATTCAATGATCTAATGTTCTATGTTCTAACCATAGGGAACATTTGAACTTGGACTCCGGCAAGCTCCCCATGTTCTAACCAACTATTCTGTCTTTGCTCCGTATGCTATTTTATTTGGGTGCTCTGTGTTTTGGACGTTTTGATGTGTTACATGCTGGCAAGCCGAGTTCACTTATGCGCTTGCCTCACATGAACCACAATTTTAAAAATATTTTTTTGGGTGAACACTCTAATGTCTTTGTTCTTTTTCTTTTACTATTATAGAATTTTCTTAGTTTTTTTTAGCTTATTTGGTCAATCAGAAAGGTGATTTGAACTACATGTTCTTGGGGGTTAGCCAGGGAAACTACCTTTGAATAGTTATTGAAAGAGGAGTCAATCTATTACTTATTAGTTGCATTCCAATCACCTTTTTGTCACCTTCTTGATGCATTTTGTGCGTAAGAAAAGAAAACTCCAGGGTTTTTCTTGAGTGTATTTTCGTGAAATATTCTTTTCTCATTCATTATATCCTGAAAATAACGAAGAGTATCGAGGTAACTTTCCAATTAAATGCATCTCTGTGCCTAATAATGTATAGTTTATTGATTTGATTTAAAAATCACTGTTTTGAAAGCTGCGTTGTTATAGAAAAAGAATATAAATGCCGTATTATGACTTAACTACCATGGGTTAGTGATTAAAATGGCAAGCTATAAATTTTGATGTCCTTACGGTGATGAACACAAACTGTTGTAGTTATGCACCTTGGATATAAATATCCTTAAGTTTTCTCTTTCCTAAGACTAATATTGTAAGGTCAAGTTATAATATACGATTATCAAGGTGTGTAAATTGGTCTAAACACTTGAATTATGGCCCCAATTTTAATATCACATTGTGTTGTAAATGTAATTTACCTTGCATTTACAAAAATTATCATATATGACTTTTGTACTTAATGCATCTCGCATTTTCATCCAAGCAAGAAGTTAGTTTTCTTAAATTATGTTTGAGCCTTGAAAACATGGCCTGCTTATGCATGACAAATTTAGTAGTTCATCGTATTCATAGGCTTAAAATAGTTTTCTTTTTTATGTATATAATCTGAAAACTTTTTAATATATTATGTGCCTCATTCATATCTGCTCCTAGTTTTGAGGCTTAGGAGCTTTTCTACCTTGGTTTGCATTGTCCATATAAAAATATTGGCATTGAAAAAAGCTGAATGATAGAGCATCCTTTCAGTTGCTCTGCCAACCAGATGGTTTGTCATTCTTTACATTGTTATATCCGGTTTCAGCAGATTTCTTGAAGCCGATGGGTGTGTAAGTGGACATCAAAGAAGTTCTATGTATAATTTAACCATCTTTCAATAATGGCTATCTCCTTTATGTCTTTCCACTCATTACATGTGGTTACCCGTATGATCCATGTAACCTTTTGGCATTACCTTCTAGCTAAAAGCAGGAACACCTTAGTTTGGATAGTGAGTCCGTATCTGACTCTTGGATGTTTCGACACTTGTTGGGCACACATTGGATACTTATGAGCACAATAGTTGTGTTAGACACTAGTTGTACAAAGTCAATATAGGTCAAACATTTGTTAGACAAGTATGAACACTTGTTAAGTATATATTAAATAGACATTCGATAATAAATTTTGAGAGCGAAATACATAAAACTCATATTTTTAATCATATAAATGCATAAACTTATGGATTTTGAATTTTCTTCTTGTATCGGAATGACATGTTACAGTGCCTGTGTCAGGTCATATCTATGTTTTGTGTCAGCATCCATGCTCCGTAGGCTTCTAGGAAGATACATTTTTCATGCTGGTGCAAAGACATAAATGCTAGCTTATTTGTTTTCCTGTACTTTTATGGACATATATGCCTTCTGTGATATGCATTTTGATGGGCTATAAAGATTGCTACTTCAAATTGTTGCTAATTTTGAATTCCACACACTCAAGAAAGAAAGAAAGAAAAAGAAAAAAAGAAAAAGAAAAAGAATGTTGTAGCCAATTTTAAAGTGTAACTGTTGGGTGATTTGAAGGCTGATTAATTTTTTTTCTTGTTCCCTTCTGGATCCTACTGCGTGTAGTGTGGACATATGTTATTGGAATTGGTGAAACATGAAAAGCAAAATATTTTATGTTAAGGTTAATTTAATTATATCATCGGCCAATGGAAGTTGTGATATTTCTACATATTTTTATTGTTGTAATGCAAAAAATGCTACTTTTCTCATCTGAAATGTTCTCTTATGTAAACTTTTATGTATTTGAACTATCAGGTTCAGAGGCATAGAGAATACCGACAAAAGATAATATCAAATTATCAGGTAAATGCAAGTCTAGCTAAGTATGTATTCGAGTTAGACACTCTTAGATTGGTGTGAGAATTTATGGAATAGTTTTTCTATGCTTCAAAATTGTTTAAATTAAGGTTTACAGTCAATTGGTTAGTATGTCCAATTCTTTGGGTAAATCTAATGGCATAAAGTTTACTTGATAAATTGGAATGTATGACTGAGTATTTTCAAGGACCACTTTCCAGGGTTTGTCCTAAATTCCTATGTCACTGGGTTTTAGAAGAAGAAAAAAAGTAATGTGGTTTTTTTTTTTTAGCATTAATCTTCAGTAATAAAATTCATAGAAATTTGATGAGATCTTGTGTAATGATCCTCAGCTTTCATTTAGATGATTCCATGGATAGTTACTTTCAGTACTTCCGTATTATATTCCCAGGCTAAGTTGTCCTACTATTTGCAGCCATTACACAGGGAGCTTTACAGCATGCATGCTGCAAACTTCTTTGTCCCTTCTTTTCTCAAAGCTATCAATGAGAATTCAGAGGAGAGCTTTAGACGCATCATGTCTGAACCCTCTCCAGGAATATATAAATTTGAAATGCTTCAGCCACAATTTTGCGAGAAGCTATTATCTGAGGTACCATAGTTTAAATTTATTGGCATTTTTTTTACTTTCGAGAGATTTGGTTTCTGTGTAATACTATTCTCCTCTTAATTTTCAGGTGGAAAGTTTTGAAAGATGGGTTCACGAGACAAAATTCAGAATCATGCGACCAAACACAATGAACAAATATGGTGCTGTTCTTGATGATTTTGGTTTGGAGACCATGCTTGATAAGTTGATGGATGATTTTATCCGTCCTATATCCAGAGGTAATACATTAAATTGCTCCTTGAAAGTTATTTTTGTGTAAAATGCGAGACATTTTCCCTCATGTTCTTTTTTGGTATTATGTAGTTTTCTTTTCAGAAGTTGGAGGAGGCACATTGGACTCTCATCACGGCTTTGTTGTAGAATATGGAATTGATAGAGACGTCGAACTTGGTAGGTTTTAATAGTAACTACTTTATTTTCTTGCTTTCCATTCCTGTCAATCCTATTTTTACTAGTTCTTTTGTTTAATACGAAATTGATGAAGATATGGAGCATGGTAGTACTTTCAGTAGCTTTTTCTACGGTCACCGATCTTATTTTTTGTTTAAAGAAACTGAACTTTCATTGAACAAGTGAATGGATACAAGATATTATAAACAAACAAGCAGGAAAGAAAAGGGAGCACAAACACTAACTACAAAACAGAGATCCATTCTAAAAGACCAAGACGAAGACCCTAATCACAAAAAGGTCAAGTGATCGATGCCCACAAGGAAGCATTATTCAAGAGCACTTCCTCCAAGAAAGACCATCCATTTCTATGTGCACTCCTGAAACACCAAACGAACACCTCCAACAACCCTATTGGTAGTTAGCAAATTGGCACTCCAATAGTTAATGAGTCAGATCCTGCCTCCAATGCACTATTGGGGAAACAAACTAGATAAGAGTGTTTTGGAATGCGATCCAAAGTTGCCTTCCAAGTTAAACTTTCTACGATAAAGATTTCTCTTGCTTAGGAATATTAACCTTCCAGAGAGAGGCAAAAAATAGAAGCCTCAGGAGCTAACAAGGCAAAAGAGAGAGACAAAGAATGAAAGTAAAATTGTCTGGAAAAACTCCTTGAAGAATCAGGGGTCCAAACCCTAACATCTCTCCTCTCGAGCATAATACATTGATCTCGAGGGATAAATAGAAGACTCGCCACATCAAAGAAAGATCAGAGGAAGACAACGTGAGATCATCAAATGCTACCTCTTTCCTGAAAGGAAGTACAGATGGGACATGCACAAAGGTTTATCACTTACCGAAAGGTCCTCCATAAATAGACCTTTGAGCCTTCCCCCACGGTGCACCTAATAAACTAAGAAAACAAAAGAAAACAAGAAGTGATAACTTTCCAAAGGTTTTTACTAAGGCTTCTCAATTCCCCACTCGAACAACACTCAAAAGGGTGAGAGGCATATCTACTCACAATCACTTTATGCTACAGAGTTTCAAATTTCGTTAACCACTTAGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANACCAAATGGGATCCTCCTCCTTCATGAACCCCTCTACGCTAGGTTCCTCATCAATCTTTCAAGCATCTTGCTCACAATGACTGGGTTCCTAAACAAGTATAAGAAGCTAGGTGTCCCACTTAAGACAGCCTGGATTAGAGTAAGTCTCCCCTCTTTGGAGAAAAAAGCCTTTTCTAACTAGATAACTATTTCTAAATTTTTTCTAGAATGGGATTCGAGAACACTTGGCTTTTAGAATTACCTCCCAATGGGAGCTCTAAATAAGTAGAAGGGAATTGGCTAACCTCACACTCTACTGCTACCACCCACTACGCTATCCCAAAAATGGTACACTTCCCCCGTTAATCTTAAGACCAGAAATGGCGTCAAAGAATGATACAGATCTGTTAAGATTAACAAAAGAGCTCTCCAATAATACCCCTTTCATAAAACTTAGAGAAAACCTTCCACAAGTCCTCTTTAATGTTACTCAAACAATCCTGAGAAGGCCATGGGGAATCCGTCAGGACTAGGAGACTTTGGCCTATAATAGCTAAACAACTAAACAACTTTAGTTTAAGTTTTATATAATTTTATTTTATCAAGGATGCATACATGGCCATAAACCCATGATTTTTCTTTTATACTTGCACATATCATTTATGTACTTTTAAGTACACTACAATGTATAAGCATCAATTTTTAGTCCATGATGGATAAATCTCCACCTCCTCGAAGAAAGCCTTCATGTAAAATTGTAGCTAAACTCCTACCCAATTGCTTGAAAAAGGTTTGAGTTGCAAGTGGCTTTTGTAGCAGGGAGACAGATTCTTGATTCGGCCCTATATAGCCAATGGAGCTATTAAGGATTAGAGAAGCAAAAAGAAATGAGTGATTTTGAAAAAGTCTACAATCATGTGGACTGGACTTTCTTGGATTAGGTTTTGGAAAAGAAGGGGCTTGGGTACAAATGAAGGATGTAGATGTGGAGTTATATGAGAAAGGTGAGTTATTATTTTCTCATTAACGGTAGCCCAAAGGGTAACTTTTGGCCTCAAGAGGTCGTAGACGAAGAGATCCTTTATTTTTTTGTTTGTTGAATGTGCTAAGCAGGCTTGTTGGTAAAAGTGTTTTTTCCAAAATCATTGAGCTGGTTGAGGTTGTCAAGGAGAACGTTGGAGTGATATCCCTATTTATTATTGCTTCATATTTAGAGCTTTGTACACGGTTTGTCAGAATATTGAGAAACTTTTGCACAATTTGTTTGGAAAAGGGTGGAAGAAGGAAAAAGGTCGCACTTGGTTGGGTGGGAGGTCATTGAGATGCTGGCTTATTTGAGGTGGAGGGATTAAAAATTGAAAATTTAAGCACTTGTAACAAAGCTCTGTTGGCGAAATGGCTTTGGCATTTTACCCTTTAACCCAACTCTCTCTTGCATAGAACTATTGCTAGCAAGTTTGGTCCTCATTTATTTAAGTGGCTGTCAGGTGTGGTTAAAGACACTTAACGAGAATTTATGGAAGGGTCTCTTTTGAGCTCTCATCTTTCTCTCACTTGTTCCATTGTCCGGTGGGAGAGGGGAAGAAAATATATTTTTGGGAAGAATAGTTGGTGGATGATAGACTCCTTTTGTAACAACCCAAGTCCATCGCTAGCAGATATTGTCCTCTTTGGACTTTACCTTTCGGGCTTCCCCTCCAGGGTTTTAAAATGCATTTGTTAGGGAGAGGTTTCCGCACCCTTATAAAGAATGCTTCGTTCTCCTCCCCAATCGATCTGGGATCTTACAATCCACCCCCCTTCTAGGCCCAACATCCTCACTGGCACTCGTTCCCTTTCGATGTGGGATCCTCCAATCCACCCCCCTTCGAGGACCAGCGTTCTTGTTGGCACATTGCCTCGTGTCCACCCCCATTAAGGGTTCAGCCTCCTTGCTAGCACATCGCTCGGTGTCTGGCCTTGATACCATTTGTAACAACCAAAGCCCACCGCTAGCAGATATTGTCTTCTTTGGGCTTTCCTTTTCGGGCTTCCCCTCAAGATTTTTAAAACGCGTCTGCTATGGAGAGGTTTCCATACTCTTATAAAGAATGTTTCGTTCTCCTCCCCAACCGATGTGAGATATCACATCTTTTCTCCATTTCTCTTGTCTCTATCAGTTGTTTTCTTTTCAAAATTGTGTGATGTCTAATTGTTTGGTCTGGTCGGGGAGCTCGGTTTCCTTTTTTGTTTGGGTTTTGTCTCTCTTTGATAGATAGGGAAGCGATAGATGTCTCTTTCTTTGCCTTGATTGGATAGGTCATTTTTAGGCTTGGGAGAAGAGATGTTCAAGTGTGGAGTCTCCGCTCTTCTAAAGGTTTTTCTTGCCGATCTTTTTTTTCTGTAGTTTGTTGAACCCCTCTCACTTACATGAGTCGGTTTATTCTACTCTATGAAGGATTAAAATTTTAAGGAAGGTGAAGTTCTTTACATGACAAGTTGTACACGACCGAGTTGATCCTAAAGATTGGGTTTCAAAAAGAAAAAAAACCCTATTAGTTGGTTCATTTATTTTTTTCTTTCTTTGTCATAAGGCTGAGGATTATGTGGATCACATTTTCCGAAGATCTTTTCTTCCAGGCATTTGGCTTTTCACTAGCTCGACATAGGAACACTAGGGATATGATGAGGGAGTTCCTCTATCTGCCTTTTCATGAAAAGAGCTTGTTTTTATGACTTGTTGGGGTGTGTACTATTTTGTGCAATCTTTGGGGTGAGTGGGATAACAGAGTATTTAGAGGGTTGGAGAGGTATCCTAGTCATGTAGGGTCCCTCGTTAGATTTCATGTTCGTTTTTGGACTTCGATTTCAAATTTTTTAGTAATTATTCTTTTAAGCGTTGTTGTATAGCTGGAGGCCTCTTCTTTAATGTGCTTCCTTTTTGTGCGCTTAGTTTTTTGTGTGCCCTTGTATTCTTTCATTTTTTTCCTCTATGAAAGTTGTTATTATAAAAAAAAAATCCATCCTTATACATTTGAAAATTTTAATTTCATCACTATAATTAGGTTATCCCAAAAGTTATTATTGTTAACTACTGGGTGACAAATATAATGGAAAGTTAATGAAGCATTAATATAGTGCTATGTGGACTTACTTGACAACACTATACTTGCGTGACAAAGAGGTGATCAACTCTGTTTTCAATGATAAATAACAAAAAAATGCCAATGGAGCTAATTAACTATAGAGCAGACATTAGAACTTCTAGAAAGTATGAGGAAAAAATTAATACACGTGAAACTATAGGAACAAAAGTGTGATAATTTAACCTTACTTTTATATTTACTTAATTTTATTTTGCTCACTGCAATTTTCTTTATAGACATATTTTGCTCACTTAATTTTATTGACATTAAACTTTAAAAGATCAACTATAGTTTGTTCTTTCTCTTTGCTCACTGCTCAGGACTTTTCTTTTTGGCTGAAATTTCATTTGTGGTCCTTTTGGTATATTTCCCTATTTCTATCTTTAGTTACTGTAAACTAAAAATTTATGACATCTGTTCTCTTGTTTTGTTGCTTGTTTTTCTTCTTTTCCAGAGGGTTTCATATTCATCTGTTAGAGTCGAGCTATTAATGATTTTTCCTTAAAAATGATTTTTACTGTAGGTTTTCATGTGGATGACTCGGAAGTCACGTTGAATGTTTGCTTGGGTAAACAATTTTCTGGTGGTGAACTATTCTTTCGTGGCATCCGATGTGACAAACATGTTAATACGGAGACTCAATCAGAGGTATGTTAATCTTAAAGAAAAATCCTTCTTACACCTTTATTTTTGTATTTCTGTTCGTTCTTTGATTGTGAATCATTAAATTTTCTTGTGGACTCTACAACCGTTAGTTCCTTTTGAATCTGCAATGGAATAGATAGAAGCTTATTTAACAATTAATACAATTGAAGAGCTTTTGGATAATTAATACGTTCTTAAATTTATGTAATTAGAATATTATCATTGAAAAGTTGGACATTTATGCGTATGGGTACACGAAAGCAAGATTACTTTCAATTTTTTTTTTCATGATGATGCAAGTGTGTTAATATACATTACAAGATTATTAATTTGTAGTCAGATTGTTGGCCCTTCTGTTAAGTAGTGTTCCGTAGGGGTGCAATTTACCAAAATGGTCTCTTTATTAGGAAATCTTTGACTATTTCCACGTTCCTGGGCATGCGGTTCTTCATCGTGGTCGTCATCGGCATGGTGCTAGAGCCACAACATCTGGGCGTCGGGTCAACTTACTTTTGTGGTGCAGAAGGTAATTTCTTACAAGGTCCTTGACCTTTCGATCATCGCTTTAATTATCCTAGTAAAAGAAAAAGGATGTGATATTTAAGGTGTGTTTGGGAGTATTTATAAAACAAGTGCTTCTAGTAAAAGTATTTTTAGTAAAAGTATTTTTAGTAAAAGTATTTTTAGTAAAAACATTTTAATTATAACTAATTTACTAAAAAGTGATTAATAACTGATCTTTTTAATGTTTGGTTTTATATTCATAAAAGTGTTTTTGATTTCTTTGTAACCAAGTCTTAGCACACTTTACAAATGCAACACTTTAAAGTGACCCAAATCATTTTTTTCACGTCTTTTTAGTTACTTTGATATGACAAAAGTGATTTTGGCTATTCTAAAAGCAACTCCCAAATATGTGGTGTTTTTTGACTTTTGTCTAAAAGGCATCTTTTGTTACTTTGAAATGGGGCGTGTTTGAGTGGGAAACGTAGTCGTATTATGTACAGTTATAGATAATATATTGTGCTCACTTGTTTTCTTCTTTTCCTTACCGTTTCCCCCCCTAAATCTTGCTTGTTACAGTTCTGTATTCAGAGAGTTGAAAAAATACCAGAAAGATTTCTCCAGCTGGTGTGGAGAGTGCCAACGTGAGAAGAGAGAAAGGCAGCTCATTTCAATCGATGCAACAAAGCAGGTAGGTACCATTCTGTCCAAGAGAGTTGCGTTCTACTTGTTTTTACCTTAAATTCAATCTGGCCTATCTCACATCTCCAGGAGTTACTTAAAAGGGAAGTAAAATCTCCTCCTTGAGCCTGCTATGTTGAAAACTATGGTCGAGGAAGAGCTGTAAATTTTAACCAAATACGAAGGATGTTCATCAAATGAAGTCTCTCTGCTGTATTAATGCTTTCGCCATGTGCACAGGTAATGGATGTCCTGTATTATTTATTGTGCTGCATTACATTTTGCAGGTGTAGCTGTACATTTGAAATGTTGTTGATAGGTGGAATTAATTTAACCAATTAGAAATTCGTTTGTGCCGAATTGGAGTGAATTAACTTGTATCTTTAGCCCCTACTTTCTATATATAGAATTGAATATTCTGTTCATAGTGCAGCAATTTTATTACACTGCTCATCAAGTGGGGGACGGCAATGAGATTGGATTAATGTGGTGAAACGAATTATGGTATTTTCTTTCATATTTGGTATGGCATACAAGTATATGTGTTGACAACTTTTTGAACTTTTGATATCATATATGGTTTAATTGACATAGCTCCCGATTGATTGACTTCGTGGCCTATTGATTTCTTGGATAGTTGGTGGTCGTCCTCTCTTTCCTTTTATCTCAATTGCACTGTTACTATTCTCCGAATCTTACATTAACGATGAGAAAGCCTCTGAAATCATATCAATCTTGTGTTGTTATGGTAGAGTCTTAATAGAGATGGGAAATTAAAGATCTTCCTTAATTATTGAAACTATGACCAAGTTTATTTTCACGACTGTTATCTCCTTTTGTTTTTTAAATTCGTTTTAGAAAGTTCAGTATTTACGTATCATTCATGCTATAACACTATTATCTACAAATTTATGATCTTATGAGCAAGAATGAACTCGCTGGCTCTTAAACAGTTAAATTCAAGTTAATAGCATTATCATGTTTTGTATCGTTTCTTGTAGATTGGCCAAATAAAACTACTCTCCCACAACCCCTAGAGTTTTCATTGAAGTTTTGTGAACCTCTTACAGGGTAACCCAGTATTTCAGTTCATAATCGATTTGGAAGATTTTATTCTTTTCTTTTTAATTTAAAGGGAAGTGAACAGCCCAAACCAAACACCATAGTGCCTCTCACACGGTTCGAACTTTGTTTCCATCTCTTATTCAAATTGTTCGAATCTTCCAGCCTTGAGCCTATATAACTCCCACAAAATTTGAGGTAAATCGTAATGGGTTTCCTTCAGCATGTTCGGGTATGAGGAATGAGTGGGTTTCGTGTTCAAAATTTGCAATTCAAGGTGTGTTATGGTAGTATTTCAGGCTTTAGGGGCTTCAAAGTATTGGATGTGTGTTCTATATGTTATTTTCTCTTAATTTTCTTTTTCTTGTTCGTAATCGAACAAATGTCGGTCGGGATTTATGCCAGGGTGCACCGGGGGTGGACGATAAACGAAGGTCTGGTAGAGTGGGGACTCTGTTTTGTGTTGAGGAAGACCCTATTTCCTCGGTGGAATATCACCTTGGCCCTATGGTACTGAAATGAGAAGTTAATTTTGTTTTTTTTTCTCATATAGTCAAAATGGGGATCGAGAAAACACAGTCCATTAGCCTTGGTGGAATTTCACCTAAATATGACTTGGTCAATGAGTCTTCTACGCTGGAAATGCGTTTGTTAGTATTGATTTTATTTATTATATGAAGGTGACATGACCATGGTAGGATAAATAGACTCCAATCTAAGCATCTTTTTGACCATTTTTCTAGATACATCCTTCATTCTTAATTGTCAACCCAATTGCATTCTTGGGGGAGAAAATGGGAGAAGACCTTATGTCATGAATTTTGACCTTTTGGGATTATTAGAAAGAGAGAATTTGATCTCATAACTAGA

mRNA sequence

AGGATAACAAGTTTAGACGTCCACGCAATCGATTGGGTCCCACATGTACACGTCACGTCCAGAACTAACGAAACAGAGCCCTAACCCTAACCTGCAACTTTCACCATCTTCGTCTTCTTCCTTTTCATAACCCTAATTTCAATCCACCTCTCACACTCTCACACTCTCACCCGCAATAACATGCTTCTGTAATTCTTACGTCACACGCATTTGACCTCCGCAGATGCGAAGCTATGTCTCTGGAAGCTTCGCTTGAACGACGAAAGCAGTCCCAAGCTCCTGTGACTGGCAATGGAAATGGCGTCGTCTCGTCCAGCGCACCTTCCTTCTCAACTCACAGGCTTCGTCTCCAGCCAAAGGAAGATCACAAGTCGGAGACCTACGAGGACCTGCAATTGGAATTTAGCCCCCTCCTCTTTAGTATGCTGGAAAGGCACTTGCCTCCGAGCATGCTCAATGTGGCACGCGACCTTAAGCTTCAGTACATGAGGGACATTCTACTCCGATATGCTCCAGAGGGTGAACGCAACCGAGTTCAGAGGCATAGAGAATACCGACAAAAGATAATATCAAATTATCAGGCTAAGTTGTCCTACTATTTGCAGCCATTACACAGGGAGCTTTACAGCATGCATGCTGCAAACTTCTTTGTCCCTTCTTTTCTCAAAGCTATCAATGAGAATTCAGAGGAGAGCTTTAGACGCATCATGTCTGAACCCTCTCCAGGAATATATAAATTTGAAATGCTTCAGCCACAATTTTGCGAGAAGCTATTATCTGAGGTGGAAAGTTTTGAAAGATGGGTTCACGAGACAAAATTCAGAATCATGCGACCAAACACAATGAACAAATATGGTGCTGTTCTTGATGATTTTGGTTTGGAGACCATGCTTGATAAGTTGATGGATGATTTTATCCGTCCTATATCCAGAGTTTTCTTTTCAGAAGTTGGAGGAGGCACATTGGACTCTCATCACGGCTTTGTTGTAGAATATGGAATTGATAGAGACGTCGAACTTGGTTTTCATGTGGATGACTCGGAAGTCACGTTGAATGTTTGCTTGGGTAAACAATTTTCTGGTGGTGAACTATTCTTTCGTGGCATCCGATGTGACAAACATGTTAATACGGAGACTCAATCAGAGGAAATCTTTGACTATTTCCACGTTCCTGGGCATGCGGTTCTTCATCGTGGTCGTCATCGGCATGGTGCTAGAGCCACAACATCTGGGCGTCGGGTCAACTTACTTTTGTGGTGCAGAAGTTCTGTATTCAGAGAGTTGAAAAAATACCAGAAAGATTTCTCCAGCTGGTGTGGAGAGTGCCAACGTGAGAAGAGAGAAAGGCAGCTCATTTCAATCGATGCAACAAAGCAGGAGTTACTTAAAAGGGAAGTAAAATCTCCTCCTTGAGCCTGCTATGTTGAAAACTATGGTCGAGGAAGAGCTGTAAATTTTAACCAAATACGAAGGATGTTCATCAAATGAAGTCTCTCTGCTGTATTAATGCTTTCGCCATGTGCACAGCAATTTTATTACACTGCTCATCAAGTGGGGGACGGCAATGAGATTGGATTAATGTGGTGAAACGAATTATGGTATTTTCTTTCATATTTGGGTGCACCGGGGGTGGACGATAAACGAAGGTCTGGTAGAGTGGGGACTCTGTTTTGTGTTGAGGAAGACCCTATTTCCTCGGTGGAATATCACCTTGGCCCTATGGTACTGAAATGAGAAGTTAATTTTGTTTTTTTTTCTCATATAGTCAAAATGGGGATCGAGAAAACACAGTCCATTAGCCTTGGTGGAATTTCACCTAAATATGACTTGGTCAATGAGTCTTCTACGCTGGAAATGCGTTTGTTAGTATTGATTTTATTTATTATATGAAGGTGACATGACCATGGTAGGATAAATAGACTCCAATCTAAGCATCTTTTTGACCATTTTTCTAGATACATCCTTCATTCTTAATTGTCAACCCAATTGCATTCTTGGGGGAGAAAATGGGAGAAGACCTTATGTCATGAATTTTGACCTTTTGGGATTATTAGAAAGAGAGAATTTGATCTCATAACTAGA

Coding sequence (CDS)

ATGTCTCTGGAAGCTTCGCTTGAACGACGAAAGCAGTCCCAAGCTCCTGTGACTGGCAATGGAAATGGCGTCGTCTCGTCCAGCGCACCTTCCTTCTCAACTCACAGGCTTCGTCTCCAGCCAAAGGAAGATCACAAGTCGGAGACCTACGAGGACCTGCAATTGGAATTTAGCCCCCTCCTCTTTAGTATGCTGGAAAGGCACTTGCCTCCGAGCATGCTCAATGTGGCACGCGACCTTAAGCTTCAGTACATGAGGGACATTCTACTCCGATATGCTCCAGAGGGTGAACGCAACCGAGTTCAGAGGCATAGAGAATACCGACAAAAGATAATATCAAATTATCAGGCTAAGTTGTCCTACTATTTGCAGCCATTACACAGGGAGCTTTACAGCATGCATGCTGCAAACTTCTTTGTCCCTTCTTTTCTCAAAGCTATCAATGAGAATTCAGAGGAGAGCTTTAGACGCATCATGTCTGAACCCTCTCCAGGAATATATAAATTTGAAATGCTTCAGCCACAATTTTGCGAGAAGCTATTATCTGAGGTGGAAAGTTTTGAAAGATGGGTTCACGAGACAAAATTCAGAATCATGCGACCAAACACAATGAACAAATATGGTGCTGTTCTTGATGATTTTGGTTTGGAGACCATGCTTGATAAGTTGATGGATGATTTTATCCGTCCTATATCCAGAGTTTTCTTTTCAGAAGTTGGAGGAGGCACATTGGACTCTCATCACGGCTTTGTTGTAGAATATGGAATTGATAGAGACGTCGAACTTGGTTTTCATGTGGATGACTCGGAAGTCACGTTGAATGTTTGCTTGGGTAAACAATTTTCTGGTGGTGAACTATTCTTTCGTGGCATCCGATGTGACAAACATGTTAATACGGAGACTCAATCAGAGGAAATCTTTGACTATTTCCACGTTCCTGGGCATGCGGTTCTTCATCGTGGTCGTCATCGGCATGGTGCTAGAGCCACAACATCTGGGCGTCGGGTCAACTTACTTTTGTGGTGCAGAAGTTCTGTATTCAGAGAGTTGAAAAAATACCAGAAAGATTTCTCCAGCTGGTGTGGAGAGTGCCAACGTGAGAAGAGAGAAAGGCAGCTCATTTCAATCGATGCAACAAAGCAGGAGTTACTTAAAAGGGAAGTAAAATCTCCTCCTTGA

Protein sequence

MSLEASLERRKQSQAPVTGNGNGVVSSSAPSFSTHRLRLQPKEDHKSETYEDLQLEFSPLLFSMLERHLPPSMLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQAKLSYYLQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFSEVGGGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREKRERQLISIDATKQELLKREVKSPP
BLAST of Cp4.1LG00g02340 vs. Swiss-Prot
Match: Y1295_ARATH (Uncharacterized PKHD-type hydroxylase At1g22950 OS=Arabidopsis thaliana GN=At1g22950 PE=2 SV=2)

HSP 1 Score: 465.7 bits (1197), Expect = 5.0e-130
Identity = 225/388 (57.99%), Postives = 293/388 (75.52%), Query Frame = 1

Query: 1   MSLEASLER--RKQSQAPVTGNGNGVVSSSAPSFSTHRLRLQPKEDHKSETYEDLQLEFS 60
           M+L++S ++  ++Q Q P   +GNG            +LR  P E+H+ E YEDL L++S
Sbjct: 10  MALDSSGKQPEQQQQQQPRASSGNGEARL--------KLRRTPNEEHEPENYEDLPLDYS 69

Query: 61  PLLFSMLERHLPPSMLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQAK 120
           P LF+ LER+LP  +LN  R  K  +MRD+LLRY+P+ ER RV RH+EYR KI+S+YQ  
Sbjct: 70  PSLFTSLERYLPEQLLNSTRIDKASFMRDLLLRYSPDTERVRVLRHKEYRDKIMSSYQR- 129

Query: 121 LSYYLQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCE 180
                  LH E+Y++  ++FF PSFL A +  SE +FR  M E  PGI+ FEM +PQFCE
Sbjct: 130 -------LHGEIYTLDPSSFFAPSFLGAFSRKSEPNFRSSMVESYPGIFTFEMFKPQFCE 189

Query: 181 KLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFSE 240
            LL+EVE  E+WV++++  IMRPNTMN +G VLDDFG ++ML KL+DDFI PI++V F E
Sbjct: 190 MLLAEVEHMEKWVYDSRSTIMRPNTMNNFGVVLDDFGFDSMLQKLVDDFISPIAQVLFPE 249

Query: 241 VGGGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVN 300
           V G +LDSHHG++VEYG DRDV+LGFHVDDSEV+LNVCLGKQFSGGEL+FRG+RCDKHVN
Sbjct: 250 VCGTSLDSHHGYIVEYGKDRDVDLGFHVDDSEVSLNVCLGKQFSGGELYFRGVRCDKHVN 309

Query: 301 TETQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFS 360
           +++  +E++DY HVPGHA+LHRGRHRHGARATTSG R NL+LWCRSS FRE+K YQ+DFS
Sbjct: 310 SDSTEKEVYDYSHVPGHAILHRGRHRHGARATTSGHRANLILWCRSSTFREMKNYQRDFS 369

Query: 361 SWCGECQREKRERQLISIDATKQELLKR 387
            WCG C+ +K+ RQ  SI+ATK+ L ++
Sbjct: 370 GWCGGCKLDKQRRQRDSINATKEILARK 381

BLAST of Cp4.1LG00g02340 vs. Swiss-Prot
Match: OGFD2_XENTR (2-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Xenopus tropicalis GN=ogfod2 PE=2 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 9.1e-39
Identity = 93/258 (36.05%), Postives = 148/258 (57.36%), Query Frame = 1

Query: 98  RNRVQRHREYRQKIISNYQAKLSYYLQPLHRELYSMHAANFFVPSFLKAIN------ENS 157
           +  V+R R+  ++ +   + ++S + +PL+ E+Y +  + F    FL A+        N 
Sbjct: 63  KKEVERRRKLGEESLHR-RREISLHYKPLYPEVYVLQES-FLAAEFLTAVKYSKSPQANV 122

Query: 158 EESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMNKYGAVL 217
           E     + S     IY+  +  P+FC KL+ E+E+FER    +     RPNTMN YG +L
Sbjct: 123 EGLLHHLHSITDKRIYRLPVFIPEFCAKLVEELENFER----SDLPKGRPNTMNNYGILL 182

Query: 218 DDFG-LETMLDKLMDDFIRPISRVFFSEVGGGTLDSHHGFVVEYGIDRDVELGFHVDDSE 277
           ++ G ++ +   L + +I P++ + F + GGG LDSH  FVV+Y +  D++L  H D++E
Sbjct: 183 NELGFVDALTAPLCEKYIEPLTSLLFPDWGGGCLDSHRAFVVKYALQEDLDLSCHYDNAE 242

Query: 278 VTLNVCLGKQFSGGELFFRGIRCDKHVNTETQSEEIFDYFHVPGHAVLHRGRHRHGARAT 337
           VTLNV LGK+F+ G L+F  ++ +  VN  T +E      H+ G  +LHRG+H HGA   
Sbjct: 243 VTLNVSLGKEFTDGNLYFSDMK-EVPVNERTYAE----VEHITGQGILHRGQHVHGALPI 302

Query: 338 TSGRRVNLLLWCRSSVFR 349
           +SG R NL+LW R+S  R
Sbjct: 303 SSGERWNLILWMRASDVR 309

BLAST of Cp4.1LG00g02340 vs. Swiss-Prot
Match: OGFD2_DANRE (2-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Danio rerio GN=ogfod2 PE=2 SV=1)

HSP 1 Score: 154.1 bits (388), Expect = 3.2e-36
Identity = 91/281 (32.38%), Postives = 149/281 (53.02%), Query Frame = 1

Query: 75  NVARDLKLQ---YMRDILLRYAPEGERNRVQRHREYRQKIISNYQAKLSYYLQPLHRELY 134
           N+ R L  +     RD++ +   E ER +  + +   +  +      +     PLH+ +Y
Sbjct: 38  NILRSLGCESESQFRDVIGKIQAEIERRQNHKLKSTERAAV------IKEIYTPLHQHVY 97

Query: 135 SMHAANFFVPSFLKAIN------ENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVE 194
            +  + F  P  L+ +        N +   + I +E +  +++F++ + +FC+ LL E+E
Sbjct: 98  HLQES-FLAPELLEMVKYCASSEANVQGLLKLIQTEAASRVFRFQVFRKEFCKDLLEELE 157

Query: 195 SFERWVHETKFRIMRPNTMNKYGAVLDDFGL-ETMLDKLMDDFIRPISRVFFSEVGGGTL 254
            FE    ++     RPNTMN YG VL++ G  E  +  L + ++RP++ + +S+ GG  L
Sbjct: 158 HFE----QSDAPKGRPNTMNNYGIVLNELGFDEGFITPLREVYLRPLTALLYSDCGGNCL 217

Query: 255 DSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQSE 314
           DSH  FVV+Y +  D+ L +H D+SEVTLNV LGK F+ G LFF  +R            
Sbjct: 218 DSHKAFVVKYDMHEDLNLSYHYDNSEVTLNVSLGKDFTEGNLFFGDMR-----QVPLSET 277

Query: 315 EIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSS 346
           E  +  H     +LHRG+H HGA + +SG R NL++W R+S
Sbjct: 278 ECVEVEHRVTEGLLHRGQHMHGALSISSGTRWNLIIWMRAS 302

BLAST of Cp4.1LG00g02340 vs. Swiss-Prot
Match: OGFD2_HUMAN (2-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Homo sapiens GN=OGFOD2 PE=2 SV=2)

HSP 1 Score: 149.8 bits (377), Expect = 6.1e-35
Identity = 83/203 (40.89%), Postives = 115/203 (56.65%), Query Frame = 1

Query: 166 IYKFEMLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGL-ETMLDKLM 225
           IY+  +    FC+ LL E+E FE    ++     RPNTMN YG +L + GL E ++  L 
Sbjct: 139 IYRVPVFTAPFCQALLEELEHFE----QSDMPKGRPNTMNNYGVLLHELGLDEPLMTPLR 198

Query: 226 DDFIRPISRVFFSEVGGGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGG 285
           + F++P+  + + + GGG LDSH  FVV+Y   +D+ELG H D++E+TLNV LGK F+GG
Sbjct: 199 ERFLQPLMALLYPDCGGGRLDSHRAFVVKYAPGQDLELGCHYDNAELTLNVALGKVFTGG 258

Query: 286 ELFFRGIRCDKHVNTETQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRS 345
            L+F G+         T   E  +  HV G  VLHRG   HGAR   +G R NL++W R+
Sbjct: 259 ALYFGGL-----FQAPTALTEPLEVEHVVGQGVLHRGGQLHGARPLGTGERWNLVVWLRA 318

Query: 346 SVFRELKKYQKDFSSWCGECQRE 368
           S  R         +S C  C RE
Sbjct: 319 SAVR---------NSLCPMCCRE 323

BLAST of Cp4.1LG00g02340 vs. Swiss-Prot
Match: OGFD2_MOUSE (2-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Mus musculus GN=Ogfod2 PE=2 SV=1)

HSP 1 Score: 145.6 bits (366), Expect = 1.1e-33
Identity = 88/261 (33.72%), Postives = 139/261 (53.26%), Query Frame = 1

Query: 96  GERNRVQRHREYRQKIISNYQAKLSYYLQPLHRELYSMHAANFFVPSFLKAINENS---- 155
           G R R+ +    R+ +I++     SY+  P   E+YS        P F+ A   ++    
Sbjct: 68  GRRRRLGQESAVRKALIAS-----SYH--PARPEVYSSLQDAALAPEFMAAAEYSTSPGA 127

Query: 156 --EESFRRIMS-EPSPGIYKFEMLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMNKYG 215
             E   +R+ +      IY+  +   +FC+ LL E+E FE    ++     RPNTMN +G
Sbjct: 128 DLEGLLQRLETVSEEKRIYRVPVFSAKFCQTLLEELEHFE----QSDMPKGRPNTMNNHG 187

Query: 216 AVLDDFGLET-MLDKLMDDFIRPISRVFFSEVGGGTLDSHHGFVVEYGIDRDVELGFHVD 275
            ++ + GL+  ++  L + F+ P+  + + + GGG LDSH  FVV+Y + +D++LG H D
Sbjct: 188 VLMYELGLDDPLVTPLRERFLLPLMALLYPDYGGGYLDSHRAFVVKYALGQDLDLGCHYD 247

Query: 276 DSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQSEEIFDYFHVPGHAVLHRGRHRHGA 335
           ++E+TLNV LGK F+GG L+F G+            +E  +  HV G  +LHRG   HGA
Sbjct: 248 NAELTLNVALGKDFTGGALYFGGL-----FQAPAALKETLEVEHVVGSGILHRGGQLHGA 307

Query: 336 RATTSGRRVNLLLWCRSSVFR 349
           R    G R NL++W R+S  R
Sbjct: 308 RPLCKGERWNLVVWLRASAVR 312

BLAST of Cp4.1LG00g02340 vs. TrEMBL
Match: A0A0A0KPJ6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G576670 PE=4 SV=1)

HSP 1 Score: 740.0 bits (1909), Expect = 1.5e-210
Identity = 367/392 (93.62%), Postives = 375/392 (95.66%), Query Frame = 1

Query: 1   MSLEASLERRKQSQAPVTGNGNGVVSSSAPSFSTHRLRLQPKEDHKSETYEDLQLEFSPL 60
           MSLEASLERRKQ QAP TGNGNGVVS +  S STHRLRLQPKEDHKSE+YEDLQLEFSP+
Sbjct: 1   MSLEASLERRKQPQAPGTGNGNGVVSPTPQSLSTHRLRLQPKEDHKSESYEDLQLEFSPV 60

Query: 61  LFSMLERHLPPSMLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQAKLS 120
           LFSMLERHLPP+MLNVAR++KLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQ    
Sbjct: 61  LFSMLERHLPPNMLNVAREVKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQ---- 120

Query: 121 YYLQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKL 180
               PLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKL
Sbjct: 121 ----PLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKL 180

Query: 181 LSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFSEVG 240
           LSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFF EVG
Sbjct: 181 LSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVG 240

Query: 241 GGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTE 300
           G TLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTE
Sbjct: 241 GATLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTE 300

Query: 301 TQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSW 360
           TQSEEIFDY HVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSW
Sbjct: 301 TQSEEIFDYLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSW 360

Query: 361 CGECQREKRERQLISIDATKQELLKREVKSPP 393
           CGECQREKRERQL+SIDATKQELL+REVKSPP
Sbjct: 361 CGECQREKRERQLLSIDATKQELLRREVKSPP 384

BLAST of Cp4.1LG00g02340 vs. TrEMBL
Match: A0A061GER7_THECC (2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_026929 PE=4 SV=1)

HSP 1 Score: 624.4 bits (1609), Expect = 9.4e-176
Identity = 311/393 (79.13%), Postives = 343/393 (87.28%), Query Frame = 1

Query: 1   MSLEASLERRKQSQAPVTG-NGNGVVSSSAPSFST-HRLRLQPKEDHKSETYEDLQLEFS 60
           MS + + +  +Q   P  G NGNGV  +  PS +T HRLRL P  +HK ETYE LQLEFS
Sbjct: 1   MSFDLTRKEPQQPTPPSAGCNGNGV--AVLPSMATAHRLRLNPNTEHKPETYEGLQLEFS 60

Query: 61  PLLFSMLERHLPPSMLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQAK 120
           PLLFS LER+LPP ML+++RD KL YMRDI+LRY+PEGER RVQRHREYRQKIIS+YQ  
Sbjct: 61  PLLFSSLERYLPPPMLSLSRDSKLNYMRDIILRYSPEGERTRVQRHREYRQKIISHYQ-- 120

Query: 121 LSYYLQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCE 180
                 PLHRELY+MHA+NFFVPSFLKAINEN EESFR IM+EP+ G++ FEMLQP FCE
Sbjct: 121 ------PLHRELYAMHASNFFVPSFLKAINENKEESFRSIMAEPTLGVFTFEMLQPHFCE 180

Query: 181 KLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFSE 240
            LLSEVE+FE+WVHETKFRIMRPNTMNK+GAVLDDFGLETMLDKLM+DFIRPIS+VFFS+
Sbjct: 181 LLLSEVENFEKWVHETKFRIMRPNTMNKFGAVLDDFGLETMLDKLMEDFIRPISKVFFSD 240

Query: 241 VGGGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVN 300
           VGG TLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGG+LFFRG+RCDKHVN
Sbjct: 241 VGGSTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGDLFFRGVRCDKHVN 300

Query: 301 TETQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFS 360
           TETQS+EI DY HVPG AVLHRGRHRHGARATTSG RVNLLLWCRSSVFREL+KYQKDFS
Sbjct: 301 TETQSDEILDYSHVPGRAVLHRGRHRHGARATTSGHRVNLLLWCRSSVFRELRKYQKDFS 360

Query: 361 SWCGECQREKRERQLISIDATKQELLKREVKSP 392
           SWCGECQREK+ERQ +SI ATKQELLKRE K P
Sbjct: 361 SWCGECQREKKERQRVSIAATKQELLKREGKPP 383

BLAST of Cp4.1LG00g02340 vs. TrEMBL
Match: A0A061G6N2_THECC (2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 2 OS=Theobroma cacao GN=TCM_026929 PE=4 SV=1)

HSP 1 Score: 612.1 bits (1577), Expect = 4.8e-172
Identity = 303/383 (79.11%), Postives = 335/383 (87.47%), Query Frame = 1

Query: 1   MSLEASLERRKQSQAPVTG-NGNGVVSSSAPSFST-HRLRLQPKEDHKSETYEDLQLEFS 60
           MS + + +  +Q   P  G NGNGV  +  PS +T HRLRL P  +HK ETYE LQLEFS
Sbjct: 1   MSFDLTRKEPQQPTPPSAGCNGNGV--AVLPSMATAHRLRLNPNTEHKPETYEGLQLEFS 60

Query: 61  PLLFSMLERHLPPSMLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQAK 120
           PLLFS LER+LPP ML+++RD KL YMRDI+LRY+PEGER RVQRHREYRQKIIS+YQ  
Sbjct: 61  PLLFSSLERYLPPPMLSLSRDSKLNYMRDIILRYSPEGERTRVQRHREYRQKIISHYQ-- 120

Query: 121 LSYYLQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCE 180
                 PLHRELY+MHA+NFFVPSFLKAINEN EESFR IM+EP+ G++ FEMLQP FCE
Sbjct: 121 ------PLHRELYAMHASNFFVPSFLKAINENKEESFRSIMAEPTLGVFTFEMLQPHFCE 180

Query: 181 KLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFSE 240
            LLSEVE+FE+WVHETKFRIMRPNTMNK+GAVLDDFGLETMLDKLM+DFIRPIS+VFFS+
Sbjct: 181 LLLSEVENFEKWVHETKFRIMRPNTMNKFGAVLDDFGLETMLDKLMEDFIRPISKVFFSD 240

Query: 241 VGGGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVN 300
           VGG TLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGG+LFFRG+RCDKHVN
Sbjct: 241 VGGSTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGDLFFRGVRCDKHVN 300

Query: 301 TETQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFS 360
           TETQS+EI DY HVPG AVLHRGRHRHGARATTSG RVNLLLWCRSSVFREL+KYQKDFS
Sbjct: 301 TETQSDEILDYSHVPGRAVLHRGRHRHGARATTSGHRVNLLLWCRSSVFRELRKYQKDFS 360

Query: 361 SWCGECQREKRERQLISIDATKQ 382
           SWCGECQREK+ERQ +SI ATKQ
Sbjct: 361 SWCGECQREKKERQRVSIAATKQ 373

BLAST of Cp4.1LG00g02340 vs. TrEMBL
Match: A0A0D2NB24_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G118000 PE=4 SV=1)

HSP 1 Score: 610.1 bits (1572), Expect = 1.8e-171
Identity = 302/390 (77.44%), Postives = 337/390 (86.41%), Query Frame = 1

Query: 1   MSLEASLERRKQSQAPVTG-NGNGVVSSSAPSFSTHRLRLQPKEDHKSETYEDLQLEFSP 60
           MSLE + +  +Q   P  G NGNG+    + + +THRLRL P  +HK E+YE L LEFSP
Sbjct: 1   MSLEVTRKENQQPTPPTGGHNGNGMALLQSMA-TTHRLRLNPNTEHKPESYEGLHLEFSP 60

Query: 61  LLFSMLERHLPPSMLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQAKL 120
           LLFS LER+LPP ML+ +RD KL YMRDI+LRY+PEGER RVQ+ REYRQKIIS+YQ   
Sbjct: 61  LLFSSLERYLPPPMLSHSRDSKLHYMRDIILRYSPEGERTRVQKQREYRQKIISHYQ--- 120

Query: 121 SYYLQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEK 180
                PLHRELY+MHA+NFF PSFLKAINEN EESFR IM+EP+ G++ FEMLQP FCE 
Sbjct: 121 -----PLHRELYAMHASNFFAPSFLKAINENKEESFRSIMAEPTQGVFTFEMLQPHFCEL 180

Query: 181 LLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFSEV 240
           LLSEVE+FE+WVHETKFRIMRPNTMNK+GAVLDDFGLETML KLM+DFIRPIS+VFFS+V
Sbjct: 181 LLSEVENFEKWVHETKFRIMRPNTMNKFGAVLDDFGLETMLGKLMEDFIRPISKVFFSDV 240

Query: 241 GGGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNT 300
           GG TLDSHHGFVVEYGI+RDVELGFHVDDSEVTLNVCLGKQFSGG+LFFRG+RCDKHVNT
Sbjct: 241 GGSTLDSHHGFVVEYGINRDVELGFHVDDSEVTLNVCLGKQFSGGDLFFRGVRCDKHVNT 300

Query: 301 ETQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSS 360
           ETQS+EI DY HVPG AVLHRGRHRHGARATTSG RVNLLLWCRSSVFREL+KYQKDFSS
Sbjct: 301 ETQSDEILDYSHVPGRAVLHRGRHRHGARATTSGHRVNLLLWCRSSVFRELRKYQKDFSS 360

Query: 361 WCGECQREKRERQLISIDATKQELLKREVK 390
           WCGECQREK+ERQ +SI ATKQELLKRE K
Sbjct: 361 WCGECQREKKERQRVSIAATKQELLKREGK 381

BLAST of Cp4.1LG00g02340 vs. TrEMBL
Match: A0A0B0MGW6_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_22241 PE=4 SV=1)

HSP 1 Score: 607.1 bits (1564), Expect = 1.6e-170
Identity = 301/391 (76.98%), Postives = 335/391 (85.68%), Query Frame = 1

Query: 1   MSLEASLERRKQSQAPVTG-NGNGV-VSSSAPSFSTHRLRLQPKEDHKSETYEDLQLEFS 60
           MS + S +  +Q   P TG NGN V V  S  + + +RLRL P  +HK E YE LQLEFS
Sbjct: 1   MSFDVSRKEAQQPAPPSTGHNGNAVAVLPSMSTVTANRLRLNPNTEHKPENYEGLQLEFS 60

Query: 61  PLLFSMLERHLPPSMLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQAK 120
           PLLFS LER+LPP ML++ RD KL YMRDI+LRY+P+GER RVQRHREYRQKIIS+YQ  
Sbjct: 61  PLLFSSLERYLPPPMLSLPRDSKLHYMRDIILRYSPDGERIRVQRHREYRQKIISHYQ-- 120

Query: 121 LSYYLQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCE 180
                 PLHRELY+MHA+NFF PSFLKAINEN EE FR IM+EP+ G++ FEM QP+FCE
Sbjct: 121 ------PLHRELYTMHASNFFAPSFLKAINENKEEGFRSIMAEPTLGVFTFEMFQPRFCE 180

Query: 181 KLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFSE 240
            LLSEVE+FE+WVHETKFRIMRPNTMNK+GAVLDDFGLETML KLM+DFIRPIS+VFFS+
Sbjct: 181 LLLSEVENFEKWVHETKFRIMRPNTMNKFGAVLDDFGLETMLGKLMEDFIRPISKVFFSD 240

Query: 241 VGGGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVN 300
           VGG TLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGG+LFFRG+RCDKHVN
Sbjct: 241 VGGSTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGDLFFRGVRCDKHVN 300

Query: 301 TETQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFS 360
           TETQS+EI DY HVPG AVLHRGRHRHGARATTSG+R NLLLWCRSSVFREL+KYQKDFS
Sbjct: 301 TETQSDEILDYSHVPGRAVLHRGRHRHGARATTSGQRFNLLLWCRSSVFRELRKYQKDFS 360

Query: 361 SWCGECQREKRERQLISIDATKQELLKREVK 390
            WCGECQREK+ERQ +SI ATKQELLKRE K
Sbjct: 361 IWCGECQREKKERQRVSISATKQELLKREGK 383

BLAST of Cp4.1LG00g02340 vs. TAIR10
Match: AT3G18210.1 (AT3G18210.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 516.2 bits (1328), Expect = 1.8e-146
Identity = 255/395 (64.56%), Postives = 312/395 (78.99%), Query Frame = 1

Query: 6   SLERRKQSQAPVTGN--GNGVVS-----SSAPS--------FSTHRLRLQPKEDHKSETY 65
           S E+R+ SQ   T    GNG ++     S+AP+         S  RLRL P  +H+ ++Y
Sbjct: 2   SSEQREGSQETTTTTVEGNGTIAGQNSHSAAPTTLRATSTMVSCQRLRLNPNNEHRPDSY 61

Query: 66  EDLQLEFSPLLFSMLERHLPPSMLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQK 125
           EDLQL+F   ++S LE++LPP+ML   RD K+++M DI+LR+ P GER+R QRH +YR K
Sbjct: 62  EDLQLDFPNSVYSSLEKYLPPNMLVSNRDEKIKFMTDIMLRHLPHGERSRAQRHSDYRLK 121

Query: 126 IISNYQAKLSYYLQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFE 185
           I +NYQ        PLH+ELY++     FVP+FLKAINEN+EESFR I+SEPSPG++ F+
Sbjct: 122 ITTNYQ--------PLHKELYTLVPTVCFVPAFLKAINENTEESFRNIISEPSPGVFVFD 181

Query: 186 MLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRP 245
           MLQP FCE +L+E+++FERWV ETKFRIMRPNTMNKYGAVLDDFGL+TMLDKLM+ FIRP
Sbjct: 182 MLQPSFCEMMLAEIDNFERWVGETKFRIMRPNTMNKYGAVLDDFGLDTMLDKLMEGFIRP 241

Query: 246 ISRVFFSEVGGGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRG 305
           IS+VFFS+VGG TLDSHHGFVVEYG DRDV+LGFHVDDSEVTLNVCLG QF GGELFFRG
Sbjct: 242 ISKVFFSDVGGATLDSHHGFVVEYGKDRDVDLGFHVDDSEVTLNVCLGNQFVGGELFFRG 301

Query: 306 IRCDKHVNTETQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFREL 365
            RC+KHVNT T+++E +DY H+PG AVLHRGRHRHGARATT G RVN+LLWCRSSVFREL
Sbjct: 302 TRCEKHVNTATKADETYDYCHIPGQAVLHRGRHRHGARATTCGHRVNMLLWCRSSVFREL 361

Query: 366 KKYQKDFSSWCGECQREKRERQLISIDATKQELLK 386
           K + KDFSSWCGEC  EKR+ ++ SIDA +++L K
Sbjct: 362 KTHHKDFSSWCGECFCEKRDEKVRSIDALRKKLFK 388

BLAST of Cp4.1LG00g02340 vs. TAIR10
Match: AT1G22950.1 (AT1G22950.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 465.7 bits (1197), Expect = 2.8e-131
Identity = 225/388 (57.99%), Postives = 293/388 (75.52%), Query Frame = 1

Query: 1   MSLEASLER--RKQSQAPVTGNGNGVVSSSAPSFSTHRLRLQPKEDHKSETYEDLQLEFS 60
           M+L++S ++  ++Q Q P   +GNG            +LR  P E+H+ E YEDL L++S
Sbjct: 10  MALDSSGKQPEQQQQQQPRASSGNGEARL--------KLRRTPNEEHEPENYEDLPLDYS 69

Query: 61  PLLFSMLERHLPPSMLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQAK 120
           P LF+ LER+LP  +LN  R  K  +MRD+LLRY+P+ ER RV RH+EYR KI+S+YQ  
Sbjct: 70  PSLFTSLERYLPEQLLNSTRIDKASFMRDLLLRYSPDTERVRVLRHKEYRDKIMSSYQR- 129

Query: 121 LSYYLQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCE 180
                  LH E+Y++  ++FF PSFL A +  SE +FR  M E  PGI+ FEM +PQFCE
Sbjct: 130 -------LHGEIYTLDPSSFFAPSFLGAFSRKSEPNFRSSMVESYPGIFTFEMFKPQFCE 189

Query: 181 KLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFSE 240
            LL+EVE  E+WV++++  IMRPNTMN +G VLDDFG ++ML KL+DDFI PI++V F E
Sbjct: 190 MLLAEVEHMEKWVYDSRSTIMRPNTMNNFGVVLDDFGFDSMLQKLVDDFISPIAQVLFPE 249

Query: 241 VGGGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVN 300
           V G +LDSHHG++VEYG DRDV+LGFHVDDSEV+LNVCLGKQFSGGEL+FRG+RCDKHVN
Sbjct: 250 VCGTSLDSHHGYIVEYGKDRDVDLGFHVDDSEVSLNVCLGKQFSGGELYFRGVRCDKHVN 309

Query: 301 TETQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFS 360
           +++  +E++DY HVPGHA+LHRGRHRHGARATTSG R NL+LWCRSS FRE+K YQ+DFS
Sbjct: 310 SDSTEKEVYDYSHVPGHAILHRGRHRHGARATTSGHRANLILWCRSSTFREMKNYQRDFS 369

Query: 361 SWCGECQREKRERQLISIDATKQELLKR 387
            WCG C+ +K+ RQ  SI+ATK+ L ++
Sbjct: 370 GWCGGCKLDKQRRQRDSINATKEILARK 381

BLAST of Cp4.1LG00g02340 vs. TAIR10
Match: AT1G48740.2 (AT1G48740.2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 368.6 bits (945), Expect = 4.7e-102
Identity = 186/375 (49.60%), Postives = 250/375 (66.67%), Query Frame = 1

Query: 13  SQAPVTGNGNGVVSSSAPSFSTHRLRLQPKEDHKSETYEDLQLEFSPLLFSMLERHLPPS 72
           SQ PVT       +++    +  RL   P  +H S+ Y DL+LE+S  + S LE++LPP 
Sbjct: 14  SQPPVT-----TAAATTEEKAIARLSPFPNMEHISDNYGDLELEYSSAMLSSLEKYLPPE 73

Query: 73  MLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQAKLSYYLQPLHRELYS 132
           ML   R+ K ++M DIL +Y    E ++ +  + Y QKI SNYQ        PL RELY+
Sbjct: 74  MLTATREEKAKFMSDILRKYISRDECSKAKWCKNYWQKIKSNYQ--------PLSRELYN 133

Query: 133 MHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFERWVH 192
                F +PSF KAI+EN++ESFRRI+SEP PG+  F+M QP F +KL+ EVE+  +WVH
Sbjct: 134 FDPELFLLPSFRKAISENTKESFRRIISEPFPGVLVFQMFQPDFIQKLIVEVENIGKWVH 193

Query: 193 ETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFSEVGGGTLDSHHGFVV 252
           ET F I RP  M+KYG    DFGL+ ML +LM++F+ PI +VFF E  G   DSHHG+ +
Sbjct: 194 ETNFPIRRPYHMSKYGVAFVDFGLDIMLQQLMEEFLFPICKVFFPEECGAMFDSHHGYYI 253

Query: 253 EYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQSEEIFDYFHV 312
           E G DRD  LG+H+DDSE+TLNVC+ KQF GGE+ F G RC +H  T+ + EE+F Y H 
Sbjct: 254 ENGEDRDPPLGYHLDDSEITLNVCVRKQFEGGEISFIGTRCLRHKRTDVKPEEVFHYCHS 313

Query: 313 PGHAVLHRGRHRHGARATT-SGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREKRER 372
           PG A+LHRGRHRHG RA T S  R N++L CR+S+FRE++KY+KDF  WC EC  EK+E+
Sbjct: 314 PGQAILHRGRHRHGPRANTPSCSRANMILCCRNSLFREMEKYEKDFPEWCNECAHEKKEK 373

Query: 373 QLISIDATKQELLKR 387
           +  S+DA ++ + KR
Sbjct: 374 ESQSLDAKRKVIKKR 375

BLAST of Cp4.1LG00g02340 vs. TAIR10
Match: AT5G43660.1 (AT5G43660.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 362.5 bits (929), Expect = 3.4e-100
Identity = 182/378 (48.15%), Postives = 247/378 (65.34%), Query Frame = 1

Query: 11  KQSQAPVTGNGNGVVSSSAPSFSTHRLRLQPKEDHKSETYEDLQLEFSPLLFSMLERHLP 70
           +  Q PVT         + P     RL L P  +H S+ YEDL+LEFS  +   LER+LP
Sbjct: 34  RDHQPPVTTTAAATTKMAIP-----RLSLLPNNEHNSDNYEDLELEFSSSVLRSLERYLP 93

Query: 71  PSMLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQAKLSYYLQPLHREL 130
           P +L   R+ K ++M DIL +Y    E  +  R + YR+ I+SNYQ        P  REL
Sbjct: 94  PEILTANREEKAKFMSDILHKYISREECAKAIRFKNYREWIMSNYQ--------PRFREL 153

Query: 131 YSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKLLSEVESFERW 190
           Y +   +  +P F KA+ EN+EESFRRIM EP PG+Y F+M QP F +KLL EVE+  +W
Sbjct: 154 YKLDPESLLLPCFRKAVRENTEESFRRIMFEPFPGVYVFKMFQPDFFQKLLVEVENMRKW 213

Query: 191 VHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFSEVGGGTLDSHHGF 250
           +HE K  I +PN  +KYG VLDDFG++ ML  L++DFI PI +VFF +V G   D+ HGF
Sbjct: 214 LHEAKLMIRKPNNKSKYGVVLDDFGMDIMLKPLVEDFIFPICKVFFPQVCGTMFDTQHGF 273

Query: 251 VVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTETQSEEIFDYF 310
           V+E   DRD ELGFHV++S++TLNVCL KQ  GGE+ F G RC+KH+    + EEIF+Y 
Sbjct: 274 VIENCEDRDAELGFHVENSDITLNVCLSKQSEGGEILFTGTRCNKHLKAGPKPEEIFEYC 333

Query: 311 HVPGHAVLHRGRHRHGARAT-TSGRRVNLLLWCRSSVFRELKKYQKDFSSWCGECQREKR 370
           H PG A+LH G H HGA+A  TS  R N++LWC +S+FRE++ Y  +F  WCG+C REK+
Sbjct: 334 HEPGQAILHLGCHSHGAKAAITSCSRANMILWCINSLFREMQTYDNEFRDWCGQCAREKK 393

Query: 371 ERQLISIDATKQELLKRE 388
           E++  S+ A+K+++ KR+
Sbjct: 394 EKKSQSL-ASKRKVKKRK 397

BLAST of Cp4.1LG00g02340 vs. TAIR10
Match: AT1G48700.1 (AT1G48700.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 338.6 bits (867), Expect = 5.2e-93
Identity = 157/293 (53.58%), Postives = 213/293 (72.70%), Query Frame = 1

Query: 100 RVQRHREYRQKIISNYQAKLSYYLQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIM 159
           + +R + YRQ+IISNYQ        P  + LY +    F +PSF KAI+EN+EESFRRI+
Sbjct: 3   QAKRRKTYRQEIISNYQ--------PRFKGLYKLDPKLFLLPSFRKAISENTEESFRRII 62

Query: 160 SEPSPGIYKFEMLQPQFCEKLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETM 219
           SEP PG++ F+M QP F EKLL EVE+F +W +ET F I RP+  +KYG VLDDFGL+ M
Sbjct: 63  SEPFPGVFVFKMFQPDFSEKLLLEVENFRKWANETNFTIRRPDNTSKYGVVLDDFGLDIM 122

Query: 220 LDKLMDDFIRPISRVFFSEVGGGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGK 279
           L +LMDDFI PI +VFF EV G   DSH+GF +E G DRD ++GFHV+DS++TLNVCL K
Sbjct: 123 LKQLMDDFIFPICKVFFPEVCGTMFDSHYGFFIENGEDRDADVGFHVEDSDITLNVCLSK 182

Query: 280 QFSGGELFFRGIRCDKHVNTETQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLL 339
           Q  GGE+ F G RC+KH++ + + EE FDY H+PG A+LHRG H HGARAT SGRR N++
Sbjct: 183 QGEGGEILFAGARCNKHMDIDPKPEEYFDYCHIPGQAILHRGCHVHGARATASGRRANMI 242

Query: 340 LWCRSSVFRELKKYQKDFSSWCGECQREKRERQLISIDATKQELLKREVKSPP 393
           LWC++S+FRE++ Y+ +FS WCG+C  E++E +   +   ++E+ + E ++ P
Sbjct: 243 LWCQNSLFREMQTYEPEFSDWCGQCVHEEKENKSQILAVKRKEMFRIESEAEP 287

BLAST of Cp4.1LG00g02340 vs. NCBI nr
Match: gi|449458771|ref|XP_004147120.1| (PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950 [Cucumis sativus])

HSP 1 Score: 740.0 bits (1909), Expect = 2.2e-210
Identity = 367/392 (93.62%), Postives = 375/392 (95.66%), Query Frame = 1

Query: 1   MSLEASLERRKQSQAPVTGNGNGVVSSSAPSFSTHRLRLQPKEDHKSETYEDLQLEFSPL 60
           MSLEASLERRKQ QAP TGNGNGVVS +  S STHRLRLQPKEDHKSE+YEDLQLEFSP+
Sbjct: 1   MSLEASLERRKQPQAPGTGNGNGVVSPTPQSLSTHRLRLQPKEDHKSESYEDLQLEFSPV 60

Query: 61  LFSMLERHLPPSMLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQAKLS 120
           LFSMLERHLPP+MLNVAR++KLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQ    
Sbjct: 61  LFSMLERHLPPNMLNVAREVKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQ---- 120

Query: 121 YYLQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKL 180
               PLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKL
Sbjct: 121 ----PLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKL 180

Query: 181 LSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFSEVG 240
           LSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFF EVG
Sbjct: 181 LSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVG 240

Query: 241 GGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTE 300
           G TLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTE
Sbjct: 241 GATLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTE 300

Query: 301 TQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSW 360
           TQSEEIFDY HVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSW
Sbjct: 301 TQSEEIFDYLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSW 360

Query: 361 CGECQREKRERQLISIDATKQELLKREVKSPP 393
           CGECQREKRERQL+SIDATKQELL+REVKSPP
Sbjct: 361 CGECQREKRERQLLSIDATKQELLRREVKSPP 384

BLAST of Cp4.1LG00g02340 vs. NCBI nr
Match: gi|659072926|ref|XP_008467170.1| (PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like [Cucumis melo])

HSP 1 Score: 739.6 bits (1908), Expect = 2.9e-210
Identity = 366/392 (93.37%), Postives = 375/392 (95.66%), Query Frame = 1

Query: 1   MSLEASLERRKQSQAPVTGNGNGVVSSSAPSFSTHRLRLQPKEDHKSETYEDLQLEFSPL 60
           MSLEASLERRKQ QAP TGNGNGVVS +  S STHRLRLQPKEDHKSE+YEDLQLEFSP+
Sbjct: 1   MSLEASLERRKQPQAPGTGNGNGVVSPTPQSLSTHRLRLQPKEDHKSESYEDLQLEFSPV 60

Query: 61  LFSMLERHLPPSMLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQAKLS 120
           LFSMLERHLPP+MLNVAR++KLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQ    
Sbjct: 61  LFSMLERHLPPNMLNVAREVKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQ---- 120

Query: 121 YYLQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKL 180
               PLHRELYSMHAANFFVPSFLKA+NENSEESFRRIMSEPSPGIYKFEMLQPQFCEKL
Sbjct: 121 ----PLHRELYSMHAANFFVPSFLKAVNENSEESFRRIMSEPSPGIYKFEMLQPQFCEKL 180

Query: 181 LSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFSEVG 240
           LSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFF EVG
Sbjct: 181 LSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFPEVG 240

Query: 241 GGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTE 300
           G TLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTE
Sbjct: 241 GATLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTE 300

Query: 301 TQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSW 360
           TQSEEIFDY HVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSW
Sbjct: 301 TQSEEIFDYLHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSW 360

Query: 361 CGECQREKRERQLISIDATKQELLKREVKSPP 393
           CGECQREKRERQL+SIDATKQELL+REVKSPP
Sbjct: 361 CGECQREKRERQLLSIDATKQELLRREVKSPP 384

BLAST of Cp4.1LG00g02340 vs. NCBI nr
Match: gi|1009120068|ref|XP_015876721.1| (PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like [Ziziphus jujuba])

HSP 1 Score: 628.2 bits (1619), Expect = 9.3e-177
Identity = 309/391 (79.03%), Postives = 344/391 (87.98%), Query Frame = 1

Query: 1   MSLEASLERRKQSQ-APVTGNGNGVVSSSAPSFSTHRLRLQPKEDHKSETYEDLQLEFSP 60
           MS++ SL+RRKQ   A   GNGNGVV      ++ +RLRL P +DHK + YEDLQLEFSP
Sbjct: 1   MSVDGSLDRRKQPPPAQSAGNGNGVVQPGVGQYAANRLRLNPNKDHKPDNYEDLQLEFSP 60

Query: 61  LLFSMLERHLPPSMLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQAKL 120
           LLFS LE++LPP+ML V+RD+KLQYMR ILLRY+PEGER RVQRHREYRQKIISNYQ   
Sbjct: 61  LLFSSLEQYLPPTMLKVSRDVKLQYMRHILLRYSPEGERLRVQRHREYRQKIISNYQ--- 120

Query: 121 SYYLQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEK 180
                PL+RELY+MHAANFFVPSFLKA+++N+EESFR IM EP+PGIY FEMLQP FCE 
Sbjct: 121 -----PLYRELYTMHAANFFVPSFLKALSDNTEESFRNIMVEPAPGIYAFEMLQPNFCEM 180

Query: 181 LLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFSEV 240
           LL+EVE+FERWVHETKFRIMRPNTMNKYGAVLDDFGLETML+KL+DDFIRPISRVFF EV
Sbjct: 181 LLTEVENFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLEKLLDDFIRPISRVFFPEV 240

Query: 241 GGGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNT 300
           GG TLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLG+QFSGGELFFRG+RCDKHVN+
Sbjct: 241 GGSTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGRQFSGGELFFRGVRCDKHVNS 300

Query: 301 ETQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSS 360
           ETQSEEI DY H  G AVLHRGRHRHGARATT+GRRVNLLLWCRSSV+REL+KYQKD SS
Sbjct: 301 ETQSEEILDYSHALGRAVLHRGRHRHGARATTAGRRVNLLLWCRSSVYRELRKYQKDCSS 360

Query: 361 WCGECQREKRERQLISIDATKQELLKREVKS 391
           WCGECQREK+ERQ +SI ATK ELLKR+ K+
Sbjct: 361 WCGECQREKKERQRLSIAATKMELLKRDGKA 383

BLAST of Cp4.1LG00g02340 vs. NCBI nr
Match: gi|590614320|ref|XP_007022907.1| (2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 624.4 bits (1609), Expect = 1.3e-175
Identity = 311/393 (79.13%), Postives = 343/393 (87.28%), Query Frame = 1

Query: 1   MSLEASLERRKQSQAPVTG-NGNGVVSSSAPSFST-HRLRLQPKEDHKSETYEDLQLEFS 60
           MS + + +  +Q   P  G NGNGV  +  PS +T HRLRL P  +HK ETYE LQLEFS
Sbjct: 1   MSFDLTRKEPQQPTPPSAGCNGNGV--AVLPSMATAHRLRLNPNTEHKPETYEGLQLEFS 60

Query: 61  PLLFSMLERHLPPSMLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQAK 120
           PLLFS LER+LPP ML+++RD KL YMRDI+LRY+PEGER RVQRHREYRQKIIS+YQ  
Sbjct: 61  PLLFSSLERYLPPPMLSLSRDSKLNYMRDIILRYSPEGERTRVQRHREYRQKIISHYQ-- 120

Query: 121 LSYYLQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCE 180
                 PLHRELY+MHA+NFFVPSFLKAINEN EESFR IM+EP+ G++ FEMLQP FCE
Sbjct: 121 ------PLHRELYAMHASNFFVPSFLKAINENKEESFRSIMAEPTLGVFTFEMLQPHFCE 180

Query: 181 KLLSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFSE 240
            LLSEVE+FE+WVHETKFRIMRPNTMNK+GAVLDDFGLETMLDKLM+DFIRPIS+VFFS+
Sbjct: 181 LLLSEVENFEKWVHETKFRIMRPNTMNKFGAVLDDFGLETMLDKLMEDFIRPISKVFFSD 240

Query: 241 VGGGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVN 300
           VGG TLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGG+LFFRG+RCDKHVN
Sbjct: 241 VGGSTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGDLFFRGVRCDKHVN 300

Query: 301 TETQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFS 360
           TETQS+EI DY HVPG AVLHRGRHRHGARATTSG RVNLLLWCRSSVFREL+KYQKDFS
Sbjct: 301 TETQSDEILDYSHVPGRAVLHRGRHRHGARATTSGHRVNLLLWCRSSVFRELRKYQKDFS 360

Query: 361 SWCGECQREKRERQLISIDATKQELLKREVKSP 392
           SWCGECQREK+ERQ +SI ATKQELLKRE K P
Sbjct: 361 SWCGECQREKKERQRVSIAATKQELLKREGKPP 383

BLAST of Cp4.1LG00g02340 vs. NCBI nr
Match: gi|720075726|ref|XP_010279401.1| (PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like [Nelumbo nucifera])

HSP 1 Score: 621.7 bits (1602), Expect = 8.7e-175
Identity = 304/390 (77.95%), Postives = 340/390 (87.18%), Query Frame = 1

Query: 1   MSLEASLERRKQSQAPVTGNGNGVVSSSAPSFSTHRLRLQPKEDHKSETYEDLQLEFSPL 60
           MS + S+ RR++SQ    GNGNGVV+SS P +++HRLRL P  DHK E Y+DLQLEFSP 
Sbjct: 1   MSCDGSVGRREESQTG-NGNGNGVVASSRPLYASHRLRLNPNTDHKPENYDDLQLEFSPS 60

Query: 61  LFSMLERHLPPSMLNVARDLKLQYMRDILLRYAPEGERNRVQRHREYRQKIISNYQAKLS 120
           +FS LER+LPPSMLNV+RD K+QYM++IL RY PEGER RVQRHREYRQKIISNY     
Sbjct: 61  VFSSLERYLPPSMLNVSRDAKVQYMKEILSRYLPEGERTRVQRHREYRQKIISNY----- 120

Query: 121 YYLQPLHRELYSMHAANFFVPSFLKAINENSEESFRRIMSEPSPGIYKFEMLQPQFCEKL 180
              QPLHRELY++H   FFVPSF+KAI+EN+EES R I+SEPSPG+Y FEMLQP+FCE L
Sbjct: 121 ---QPLHRELYTIHPTTFFVPSFIKAISENTEESLRSIISEPSPGVYTFEMLQPRFCELL 180

Query: 181 LSEVESFERWVHETKFRIMRPNTMNKYGAVLDDFGLETMLDKLMDDFIRPISRVFFSEVG 240
           LSEVE+FE+WV E KFRIMRPNTMNK+GAVLDDFGLETMLDKLMDDF+RPIS+VFF+EVG
Sbjct: 181 LSEVENFEKWVREAKFRIMRPNTMNKFGAVLDDFGLETMLDKLMDDFLRPISKVFFAEVG 240

Query: 241 GGTLDSHHGFVVEYGIDRDVELGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTE 300
           G TLDSHHGFVVEYG DRDV+LGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTE
Sbjct: 241 GSTLDSHHGFVVEYGKDRDVDLGFHVDDSEVTLNVCLGKQFSGGELFFRGIRCDKHVNTE 300

Query: 301 TQSEEIFDYFHVPGHAVLHRGRHRHGARATTSGRRVNLLLWCRSSVFRELKKYQKDFSSW 360
           TQ EEI DY HVPG AVLHRGRHRHGARATTSG R+NLLLWCRSSVFRELKKYQKDFSSW
Sbjct: 301 TQPEEILDYSHVPGQAVLHRGRHRHGARATTSGHRINLLLWCRSSVFRELKKYQKDFSSW 360

Query: 361 CGECQREKRERQLISIDATKQELLKREVKS 391
           CGECQREK+ERQ  S+ A+K EL +RE +S
Sbjct: 361 CGECQREKKERQRQSVAASKLELFRREGES 381

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y1295_ARATH5.0e-13057.99Uncharacterized PKHD-type hydroxylase At1g22950 OS=Arabidopsis thaliana GN=At1g2... [more]
OGFD2_XENTR9.1e-3936.052-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Xenop... [more]
OGFD2_DANRE3.2e-3632.382-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Danio... [more]
OGFD2_HUMAN6.1e-3540.892-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Homo ... [more]
OGFD2_MOUSE1.1e-3333.722-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 OS=Mus m... [more]
Match NameE-valueIdentityDescription
A0A0A0KPJ6_CUCSA1.5e-21093.62Uncharacterized protein OS=Cucumis sativus GN=Csa_5G576670 PE=4 SV=1[more]
A0A061GER7_THECC9.4e-17679.132-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 OS=T... [more]
A0A061G6N2_THECC4.8e-17279.112-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 2 OS=T... [more]
A0A0D2NB24_GOSRA1.8e-17177.44Uncharacterized protein OS=Gossypium raimondii GN=B456_005G118000 PE=4 SV=1[more]
A0A0B0MGW6_GOSAR1.6e-17076.98Uncharacterized protein OS=Gossypium arboreum GN=F383_22241 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G18210.11.8e-14664.56 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT1G22950.12.8e-13157.99 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT1G48740.24.7e-10249.60 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT5G43660.13.4e-10048.15 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT1G48700.15.2e-9353.58 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
Match NameE-valueIdentityDescription
gi|449458771|ref|XP_004147120.1|2.2e-21093.62PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950 [Cucumis sativus][more]
gi|659072926|ref|XP_008467170.1|2.9e-21093.37PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like [Cucumis melo][more]
gi|1009120068|ref|XP_015876721.1|9.3e-17779.03PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like [Ziziphus jujuba... [more]
gi|590614320|ref|XP_007022907.1|1.3e-17579.132-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 [The... [more]
gi|720075726|ref|XP_010279401.1|8.7e-17577.95PREDICTED: uncharacterized PKHD-type hydroxylase At1g22950-like [Nelumbo nucifer... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0031418L-ascorbic acid binding
GO:0005506iron ion binding
GO:0016705oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:0016491oxidoreductase activity
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
Vocabulary: INTERPRO
TermDefinition
IPR006620Pro_4_hyd_alph
IPR005123Oxoglu/Fe-dep_dioxygenase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006554 lysine catabolic process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0019538 protein metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0008475 procollagen-lysine 5-dioxygenase activity
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG00g02340.1Cp4.1LG00g02340.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005123Oxoglutarate/iron-dependent dioxygenasePROFILEPS51471FE2OG_OXYcoord: 245..344
score: 10
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 164..343
score: 9.1
NoneNo IPR availablePANTHERPTHR24014FAMILY NOT NAMEDcoord: 33..387
score: 4.8E
NoneNo IPR availablePANTHERPTHR24014:SF42-OXOGLUTARATE AND IRON-DEPENDENT OXYGENASE DOMAIN-CONTAINING PROTEIN 2coord: 33..387
score: 4.8E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG00g02340Cucurbita moschata (Rifu)cmocpeB080