Lsi04G010940 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi04G010940
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Locationchr04 : 13194082 .. 13205762 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTACCCTTTTTAATTTTAATTCATCGTCATAAATTTCCGATGCATGATTAAATGTGCAAATTCAACCGCTTCTCTTCCTCTTCATTTTCTCTCCCCTTCTTCTAGTTCTAATGGCTTCCTCACTCGATCATAATCAGCTACCAAACATCCACGGCGGCGCCACCGCAGCTCCTCCACCTACTCCCTCCTCACAAACCAACCACCTCTCCACCTCCGCCGCCGCCGATGCTCTCTCCAGGCTCCTCCATCGCCTACCGCCCAATCTCTCTCTCCCTACTCGCCGCTCCCCCTCCGTTAGGTCACCGCCGACGATTTCCTTCTCAGAATCTCCCAATCCTGACCTTCTCGACCGCCTTCTCTCCGCCGCCTCTGAACTCGGCTTCTTCCAACTCACCGATCATAAAATCTCTTCTCATCTCGCTCTCTCCGCCGAGTCGGAATCCGCCGCTCTGTTCGACCTTCCGGCGGAGAAGAAGGAGTCTTTATTTCCGAAAAACTGGCCTCTCGGATTTTATGGCGATGGAGACGAAGAATCGAACGGACTCGGTGAATCGTTCTGCTTCGATTCGAGACCGCGTTTCTCCGATTTGGCCGAAATTTCTCTACATTCTTTGGAGGAGTTCGTTCTTGAAATGGAGAGCCTCGGATTGAAGATCGTCGAGTTTCTGTTTCGTGCGATTGGCTTCGAGAATCCAATCGGTGAAGATCGTACCGGATTCCGGTCGCTAATATGGATATCGGATGGTTGTCCGAGCACGGAACCGGCAATGGCCGGTGGATTTTACCCGTACATTGTTGGATTGCAGTATCAGAGCAAAAACCAGAGGTGTTCTCTGTTAGGGGATTCCGGCTTGGTTGCGGCGGCGGATTCTGTGACGGTCTCCATCGGCGATATTGCGCAAGTAAGTTATCTAATCCATACGCTGTCGTTTTGTTGATATTATTTTTTGAAAGGATTTGGATAGATAATGAGATAATTACGTAATTTAATTATTATTATTATTTTTTAAAGTACAAATGGGGAAGGGATTTGAACCACGAGTACATAGATGCCCATTGAGCTAATAATTAATTAATTAATTAATGGTCAAGTATTTTTCATAATTAATAATTATTTTTAAAATGTCACAATTAATAATTCAAAAGATGTTGAATTGAGGCATCGGTCACGTGTGTTGGTGACAAATATGCTTGTTTCACAAATCCAGACAAGCATGGCAACATCTTTCTAGGGGAAAATTGAGAAAATAAGATAGTTTTTCAAAATATTTATAGAAATATTGCAGTTTTGAAAATATTTGTAAAAATATTGTAGTTTTGTGATGCGATGGAAGGTATTGTAGTTTTGTGATTAGGTGAAAGGTGTGATGTATTTAATTGGATTGGGCAAATGGTTTTTTTTTTCAAATTTTTAAATGGTCGAACACTCGACTATTCTCTATCTTTCTTTACAAATTTTCAGAAGGTCGATATCAATATAAAAATTTAATGTTTTTCATTCATTAAAAATAATAATAAAAAAATCAATGTGTCTCACAAGCGGACGTCATCGATTTCGAGTTGGTTGTCGTCATCGACTTTGAACCCTAGGCTGCTCTTGTTCCTCTCGATCTTCTGGTGGCTCTGGTACTTCAGCGCTTCATAGTATAAATATTCGTTATGCTCTCCACGTCCGCGCCTATGTCCTTGCATGTATGAAAACGATAGACCGGCCTCTAAATAATAATATCCTGAACCAAATATTGGTATCGTCATCATCATACTAGCATAATCAGCCTGACTATGCGTCTGCGTCATAGGAGACAATTCTTCCACATTGTGGCCAACCTCATCGAACTCATCTTCTTCTACTCGAGCATTCCCCCTACGCCTAAAAACTAGTGGAGATGCAATTGGTATATCGTACATATATTGTATCTTCTCCATGTAGCTCTTGTTATTATTACATACAACAATTACTTGGTTTGTCCATGTAGCTCTTGTTATTATTACATACAACAATTACTTGGTTTGGATCATTTGCAACATCACGAAGGCCACTGTTGCTTGATCTCTTTGAATTATAAAACAATTAAATAATATTTGAAATAAATTCAAATTTAGAATCATTAAACAATATTTAATTATATATCAGATGACCCACAACTGCTCTAGATGAGTAATGTATCGCCTTGTTATATTATTATACCAATTGATATAATCTTGAGTTGCATTCCTATCGAATTACCTAATTAAAACCTCTGTTGCAATAAACCTTCTACGATAGTACCATCACACTACCAAATGTGCAACTTTTTCGAACCAATCTGCGGTTCGCAAGTCGATGTCTTGCTCAGTATTACGCTCTGGTGGGACATCCTACTAGAAACCAAATTGTTTGATTACCTTGTTGGGGAAATGTCACTCACCAATATAAAAACATATGAGAGGACTTATCATTCGCCATATGTTTGACCGTTTGTACAGAAATTGGGCAAAGTGTGCATATCAGCTTTGTAGGGCTCCTAAATAATCTGAAACATAACGTAGTATATTAAAAATATTAAAGAAAAGTACATAAAAAATAGGTAGAAAAAATTCAATTGGATTCAGTATTCATGTACTTGATCGGGTTGAAGCAGATCCAACATGTATCTATATTGACTCACAACATGTGTAGTTGATCTAGTTACACAAAATTGGTCTCTCCATCTAAATTATTAAATCAATATTTTAGATTTTAAATTAACTATATCAGTTATATAAAAAATAATCTGTAAATTAAATTAAATTAAATTAATATATCGAGAATCGTATACTTGTCCAGCTAATTGATGCTCATTAACATGTCGAAGTAGGGGAGCCATTGTAGGAAATCGTTTCCATGCCTATAATTGCAGAAGTATCAGAGACCTAGTGATTTCACGAACTTTTAGGTTTTGTTGCTTTGCATAGCTACTTATACAACCATGTCAAACACGCTCCACTCCAAGAATATCATCCCGCTTCAAGAAGATTGACTACTAATGGCAGGAACATTAAGTGCACGTAGTGACTTGATATGTCAGAAAACAAACTTCCACCCATCATCTGTAATATGTATGCTCGTGTATATCTCATGACAACTTCCTTGTCTGCATCATCATCAAGCCCTGGAAATTGTGTCCCTAACCATGTTAAACTTAATCTTGATCCTTTGATTTTATCAACAGGTGGGATTACACTGACTAACAACTGACAAATATTCAACCAATCATTCTACATTGCTCCGGTAACAGGCTCGTCGTCAACAAGTAACACAAACAATACTTCTATGTCTTGTAGGGTGATAGTGCACCCTTCAATAGGCACATGAAACGTATGCGTCTCTGACATCCATCTCTCAACTAGTGTAGTGATGAGATGTCAATCTAACTGAATGAATCCCAATCTGGCAACTCCATAAAATCCTAATGTACGAAGTAGTGGTAGTATTTTATGGTTGAGTGGGATGGTGTGATGAACAACCGCCTCTCCACGTTTACAATATATCTCACATTTAGTACGATCTTGTCATACAACTGATGATCGATGAATATATTGATCGTATAAAACATAAGAATCAATCAGTCTTGGGTTTAAAGTCATGATCTAAAAGAAAAATATATATTTATTTCTCAAAGCTTGATTACACAATAAAATTATTAATTACACAATAAAAATATTAATTATATGATAGGAATATTAACAATAAAAATTCCTAATTGCTTGGTGCCGGTCTTTGTAATGAAGGACACTTCCTCCTATTGTGATCTAACTAATTGCAAAATCCACACCGTACATAACTACTGGTTTCTGACCAGTCCATCTCATTGTGGTACAATGAACTCTTTGGTTTACCAGGCTTCCTCAACAACGATGAGTCGGCATATAAGACAGACATATTTGGATGTATGATCCAACAATCTTCGTGTCGTATAGATTGAAATTGAGGAGAGTAGCATGCAGCGTATGTTGATAGTTTATAGTAGTCCTAAATCAAATCTTCATGGTTCATGTTAAACCGTGAGCAAACCGCCATAAAATGCGAGTATGGGATGCCAAATGCTTGCCACTTGTTACATGAACAGTACCGCTCATACCCATTTAGCCTCAGTGTTTGAATATTTCCTCCTTTAGCATTGAAGCTTTGACGTCTGGTTTTCACGTGAAACGCACCTTCATATCGATCATATGACTATACTTCATGTTTACTGGATCTTGCACCCATTTAAAAAAAATTTCGTGTGCGTACCTAAATGTTTACTCACTTAGTATTACAAATATATTCGAAACGAAATTGTTGATTACTAAAAAAAGAAAAAAATACTACCTGGTATATTTATCTCGATGCTCTAAAGCAGCTCTTATTTCTTCTCTTCTTATTTCTTCTCTTCTTTTTTCAAAATAATTTACACATCTGAAAAATGTTACTTGAGCGAGAGCGTTTATCGGCAACATTTGAGCTCATTCGAGGACTCCATTGATGCACTCGGATAGATTTGTCGTCATCCACCCGTATCTGACACCTCCATCATGTGCTTGAGTCCATTGCTCAATACCAATGTTGTCAAAAAAAACTCAAACAACTCTGATTTATTCTTCTAATGTCCTCAATCGCTTTATTTAATTTGCGAATTTGAAATTGACATCCGGCCCGATAAACATAATTATTTAATGGATTGAATTTATACTTTTATTGAAGTTGCTGACAATATGTCGTAAGCAGTATCGATGATGACATTTTGGTCTCGTCTAACCATTTTCTAGATTGTTCATTGCTGATATAATGCTAGCATGTCGATCTGAAATGCTGACATTATCTCAACAATGAACAATCCAGAAATTGGTTGCAAATATTTCCCAAACTACAATATTTGATTTCAGATCAACATGCTAAAATTATCTCAGCAATGACCAATCTAGAAAATGGTTGGACGGGACCAAAACGTCATTGTCGATACTCCTTACGACATATTGTCAGCAACTTCAATAAAAAAAGTATAAATTCAATCCATTAAAGAATTATGTTTATCGGGCCAGATGTTAATTTCAAATTCAGAAGTTAAATAAAGCGATTGAGGACATTAGAAGAATAAATCAGAGTTGTTTGAGTTTTTTTTTACAACATTGGTATTGAGTAATGGACTCAAACACATAATAGAGGTGTCAGATACGGGTGGATGACGACAAATCTATCCGAGTGCATGAATGGAGTCTTGAAAGGAGCTCAAATGTTGTCGATAAACGCTCTCGCTCAAGTAACATTTTTCAAATGCGTAAATTATTTTGAAAAAATAAGAGCTGCTTTAGAGCATCGAGATAAATATACGAGTTAATATTTTTTTTCTTTTTTTTAGTAATCAATAATTTGTTTCGAATATATTTGTAATACTAAGGGAGTAAACATTTAGGTACGCACATGAAATTTTTTTGATATGGGCTACAAGATCCAGTAAACATGAAGTACAGTCATATGATCGATATGAAGGTGTGTTTCATGTGAAAATCGGCAACCTCAATGCTAAAGGAGGAAATATTCAAACACTGAGGCTAAATGTGTCTGAGCGGTACTGTTCATGTGACAAGTGGCAAGCATTTGGCATCTCATGCTCGCATTTTATGGCGGTTTGCTCACGGTTTAACATGAACTATGAAGATTTCATTTAGGACTACTATAAACTATCAACATACGCTGCATGCTACTCTCCTCAATTTCAATCTATACGGCACGAAGATTGCTGGATCATACATCCAAATATGTCTGTCTTATATGCCGACTCATCGTTGTTGAGGAAGCCTGGTAAACCAAAGAGTTCATTGTACCACAATGAGATGGACTGGTCAGAATCGAGTAGTTATGTACGGTGTGGATTTTGCAATCAGTTAGATCACAATTGGAGGAAGTGTCTTTCATTACAAAGACCGGCACCAGACAATTAGAAATTTTTATTGTTAATATTCCTATCATGTAATTAATAATTTTATTGTGTAATCAAGCTTTGAGAAATAAATATATTTTTTTCTTTTAGATCATGACTTTAAACTCAGGACTGATTGATTCTTATGTTTTATACGATCTATAAATGTGAGATATATTGTAGACGTCGAGAGGCGGTTGTTCATCATACCATCCCACTCAACCATAAAATACTACTACTACTTCGTACATTAGGATTGTATGGAGTTGCCAGATTGGGATTCATTCAGTTAGATTGACATTTCATCACTGCACTAGTTGAGAGATGGATGTCAGAGACGCATACGTTTCATGTTCCTATTGAAGGGTGCACTATCACCCTACAAGACATAGAAGTATTGTTTGTGTTACTTGTTGATAGCGAGCCTGTTACCGGAGCAATGCAGAATGATTGGTTGAATATTTACTAGTCGTTACTTGGTGTAATCCCACCTGCTGATAAAATTAAAGGATCAAGATTAAGTTTAACATGGTTAGAGACACAATTTCCTGGGTTTGATGATGATGCAGATAAGGAAGTTGTCATGAGATATACACGAGCATACATATTACAGATGATGAGTGGAAATTTGTTTTCTGACTAATCAAGTCACTACGTGCACTTAATGTTTCTGTCATTGTTAGCCAATCTTCATGAAGCGGAACAATATTCTTGGAGCGGAGCGTGTTTGGCATGACTGTATAGACAACTGTGCAAAGCAACAAAACTTAAAGTTCGTGAAATTGCTAGGTCTTTGATACTTATGCAATTATGAGTTTGGGAAAGATTTCCAACAATAGTTCTTCAACTCCGACATGTCAATGAGCATCAATAACTGGACAAGTATACGGTTCTCGATATGTTAATTTAATTTAATTTACAAATTATTTTTTATATGACTGATTTAGTTAATTTAAAATCTAAAATATTGATTTAATAATATAGGTGAAGAGACCAATTTTGTATAACTATATCAGCTACATATGTTGTGAGTCAATATAGATACATGTTGGATCTGCTTCAACCTGATTAGGTACATGAATACTTAATCCAATTGAATTTTTATACATATTTTTTATGTACTTTTCTTTAATATTTTTAATATACTACGTTATGTTTCAGATTATTTGGGAGCCCTACAAAGTTGATATGCACACTTTGTCAAATTTCTGTACCAACGGTCAAGACATATAGCGGACGATAAGTCCTCTCATATGTTTTCATATTGGTGAGTGGCATTTCTCCAATAGGGTGATAAGACAATTTGGTTTCCAATAGGATGTCCCACGAGAGTGTAATATTGAATCCTTGCTACATGACATCGACCTGAGAACTGCAGATTGGTCTGAAAAAGTTGCAAATTTGGTAGTGCGATGGTACTATCGTAGAAGGTTTATTGCAATGAGGATTTCAATTGGGCAATTCGATAGGGATGAAACTCAATATTATATCAATTGATATAATAATATAACAAGACGCTACATTACTCGTCTAGGAGCAGTTGTGGGTCATCTTGTTTAATGATTTTAAATTTGAATTTATTTCAAATATTATCTAATTGTTTTATTATTCACAGAGATCGAGCAATAGTAGCTTTCGTGATGTTGCAAATGATCCAAACCAGGTCCTTACTGTATGTAATAATAACGAGAGCTACATGGAGGAGATACATTATATGTACAATATACTCATTGCACCTCCACCAGCTCCTAGGCGTAGGGGGCATGCTCAAATAGAAGAAGATGAGTTCGATGATGTTGGTCGCAATGTGGAAGAATTGCCTCCTATGACGCAGACACATAGCCAGGTTGATTATGCTAGTCCGATGATGATGATGCCAACATTTGGTTCAGGATATTATGATTTAAAGGTCGGTCCATCGTCTTCATACATGCAAGAACATGAGTGCAGACGTGGAGAGCATAATAAATATTTATACTATGAAGCGCCTGCAGAGGTACTAGAGCCACCAGAGAGGATCAAGAGCAACCTAGAGTTCAAAGTCGACGACGACAACCAACTCGAAATCGACGACATCCGCCTTGTGGGACACATTGATTTTTTTTTTATTATTATTTTTAATGAATGAAAAACATTAAATTGTTATATTGATATGAACATTTTAATTTACATTTCAGTATTTTTAAATTATGAATCATAGTTGTAATTTGTAAGAAAAGAGATTGAGAATTTATTATTTACTATAAGTCTATAATGTCTCTTACGACAAAGTTAAAATAGGGATAAATTTTGTATTAAAAAAAAAGGAAAAGAGAGCATTAGAAGAAAAAAAAGGGATGAACACTCGACCACTTGAAAATTTGAAAAAAAAAAATAACCACTTCACTCAACCCAATTAAATACATCACACCTCCCATTCCATCACAAAACTACAATATTTTTGCAAATATTTTCAAAACTACGATATTTCTACAAATATTTTGAAAAACTACCATATTTTCTCAATTTTTCCTTGGTTGTGATAGATGTTGATAGAAGTCTATCAGTGATAAACTTTTATCATTTCTATCATTGATAGATGTTGATAGAAGTCTATCAATGTCTGTCAATATTCTCTTTTGCTATAAATAGTTTTACATTTTTTTTATAGGTGAAAATTTTCCTTTAAGGGTGTATTACCTCGTTATTCGCATTAACTTACATTTCTCATCTCATATTTGTATTAACTCCCTCAATTACTCCCCACCTATAATAATTACCAACTCAACTCAATTATGCGCCCCAAACGCCCCCTAAGAAAATATAACATTTTTTTTATTCTTTTAATGGATGTGGCTCCATTTAAAAAATTAACTGAAGTTTGATTGTGGTTCTAAACTTAACGGTTCCATATTTTCTTATGTTAAAATTCGGATTCAATCACCCTGAAATATGAGTTTGGAACGGTGGCATCGTGCACTACAATTTTTTCCTCCATCAATTGGGTCGAATTAGACTTTAAGCATTCAATATATATATATATATATATATATATATATATATATATATATATATATATATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATATATATATATATATATATATATATATATATATATATATATATATATATATATTTGGAAAGACGGTAAGGATCTGTTTGGATTGACTTTCTAAGAGGATGTTTGAAGTAAGGATTGGAATAGTGAGTAGTAAGGAGTTGTGAACTCCTTGAGCCCACAGTGTAAGGAGTTGATAAGATATAAAATTGATGTTTTCTAACCGTGGGACCCACCAACTCTTTGGACTAAACAAGGAGTGGAGTTCATAAAGTTACTTTCCTACCCCTACTTTTACTTTCCTACCCCTACTTCCCAACTCCTTGAGTTTTCAAATTTGACATGTTTGGTCACTATTTAAATTTGATATACTCAAACTTTATAGGTGTGGAGTAACGGCAAACTCAAGAAAGTGAGAGGGAGACCGGTGCCAACGATAGCGAGCATAGGAAATACCAACAACACCAACTCATGCATAATCTCATTATCGCTACTCATTACTCTTCCTGTTGACAGTCAAGTGTCTCCCCTTCTTCAACCAACTAATGAAAATGCAAATGTGGAACAATTCAGAAGTGTCGGCGATAGAAAGGAAAAAGAAGAAGACGAGGACAACAACAGTGAGGCAAAGGAACAAGGCAGTGTTTCATTCATTTAGCTTTGAAGAGTACGCATGGAGAGCGTATCACGGTTGATGCTTTCTGCTTAAAGATCCACTTGACAGATATCAAATTTGAATCGATCGATGGATGGATTGAATTGCCATTACTTTTCTATTTAAAGCACGACTCTGATGGATGAAGCAGCGACATGTGAGATTGTTTAATATTATGTTATTTATATTATGTTCATTTCCTATATTACAAGTCCATTGGACCTTTTAGCCTTTGTATCTTGACTAATATTTAATGTTGTTTTACAGCCTCAATTTAGTAAGGGTGAAATCGTAATTCTATACCTTCTCCATTATTCGACCCTCTCTTTATTTTGTATGATGGTATCACAGCACTTTATTTTGTATTGATGGTTGTGAGGTTGCAATTGATTACAGACCAAAAATCAATTAAATACAGTTTGGTGAAATCTTCA

mRNA sequence

TCTACCCTTTTTAATTTTAATTCATCGTCATAAATTTCCGATGCATGATTAAATGTGCAAATTCAACCGCTTCTCTTCCTCTTCATTTTCTCTCCCCTTCTTCTAGTTCTAATGGCTTCCTCACTCGATCATAATCAGCTACCAAACATCCACGGCGGCGCCACCGCAGCTCCTCCACCTACTCCCTCCTCACAAACCAACCACCTCTCCACCTCCGCCGCCGCCGATGCTCTCTCCAGGCTCCTCCATCGCCTACCGCCCAATCTCTCTCTCCCTACTCGCCGCTCCCCCTCCGTTAGGTCACCGCCGACGATTTCCTTCTCAGAATCTCCCAATCCTGACCTTCTCGACCGCCTTCTCTCCGCCGCCTCTGAACTCGGCTTCTTCCAACTCACCGATCATAAAATCTCTTCTCATCTCGCTCTCTCCGCCGAGTCGGAATCCGCCGCTCTGTTCGACCTTCCGGCGGAGAAGAAGGAGTCTTTATTTCCGAAAAACTGGCCTCTCGGATTTTATGGCGATGGAGACGAAGAATCGAACGGACTCGGTGAATCGTTCTGCTTCGATTCGAGACCGCGTTTCTCCGATTTGGCCGAAATTTCTCTACATTCTTTGGAGGAGTTCGTTCTTGAAATGGAGAGCCTCGGATTGAAGATCGTCGAGTTTCTGTTTCGTGCGATTGGCTTCGAGAATCCAATCGGTGAAGATCGTACCGGATTCCGGTCGCTAATATGGATATCGGATGGTTGTCCGAGCACGGAACCGGCAATGGCCGGTGGATTTTACCCGTACATTGTTGGATTGCAGTATCAGAGCAAAAACCAGAGGTGTTCTCTGTTAGGGGATTCCGGCTTGGTTGCGGCGGCGGATTCTGTGACGGTCTCCATCGGCGATATTGCGCAAGTGTGGAGTAACGGCAAACTCAAGAAAGTGAGAGGGAGACCGGTGCCAACGATAGCGAGCATAGGAAATACCAACAACACCAACTCATGCATAATCTCATTATCGCTACTCATTACTCTTCCTGTTGACAGTCAAGTGTCTCCCCTTCTTCAACCAACTAATGAAAATGCAAATGTGGAACAATTCAGAAGTGTCGGCGATAGAAAGGAAAAAGAAGAAGACGAGGACAACAACAGTGAGGCAAAGGAACAAGGCAGTGTTTCATTCATTTAGCTTTGAAGAGTACGCATGGAGAGCGTATCACGGTTGATGCTTTCTGCTTAAAGATCCACTTGACAGATATCAAATTTGAATCGATCGATGGATGGATTGAATTGCCATTACTTTTCTATTTAAAGCACGACTCTGATGGATGAAGCAGCGACATGTGAGATTGTTTAATATTATGTTATTTATATTATGTTCATTTCCTATATTACAAGTCCATTGGACCTTTTAGCCTTTGTATCTTGACTAATATTTAATGTTGTTTTACAGCCTCAATTTAGTAAGGGTGAAATCGTAATTCTATACCTTCTCCATTATTCGACCCTCTCTTTATTTTGTATGATGGTATCACAGCACTTTATTTTGTATTGATGGTTGTGAGGTTGCAATTGATTACAGACCAAAAATCAATTAAATACAGTTTGGTGAAATCTTCA

Coding sequence (CDS)

ATGGCTTCCTCACTCGATCATAATCAGCTACCAAACATCCACGGCGGCGCCACCGCAGCTCCTCCACCTACTCCCTCCTCACAAACCAACCACCTCTCCACCTCCGCCGCCGCCGATGCTCTCTCCAGGCTCCTCCATCGCCTACCGCCCAATCTCTCTCTCCCTACTCGCCGCTCCCCCTCCGTTAGGTCACCGCCGACGATTTCCTTCTCAGAATCTCCCAATCCTGACCTTCTCGACCGCCTTCTCTCCGCCGCCTCTGAACTCGGCTTCTTCCAACTCACCGATCATAAAATCTCTTCTCATCTCGCTCTCTCCGCCGAGTCGGAATCCGCCGCTCTGTTCGACCTTCCGGCGGAGAAGAAGGAGTCTTTATTTCCGAAAAACTGGCCTCTCGGATTTTATGGCGATGGAGACGAAGAATCGAACGGACTCGGTGAATCGTTCTGCTTCGATTCGAGACCGCGTTTCTCCGATTTGGCCGAAATTTCTCTACATTCTTTGGAGGAGTTCGTTCTTGAAATGGAGAGCCTCGGATTGAAGATCGTCGAGTTTCTGTTTCGTGCGATTGGCTTCGAGAATCCAATCGGTGAAGATCGTACCGGATTCCGGTCGCTAATATGGATATCGGATGGTTGTCCGAGCACGGAACCGGCAATGGCCGGTGGATTTTACCCGTACATTGTTGGATTGCAGTATCAGAGCAAAAACCAGAGGTGTTCTCTGTTAGGGGATTCCGGCTTGGTTGCGGCGGCGGATTCTGTGACGGTCTCCATCGGCGATATTGCGCAAGTGTGGAGTAACGGCAAACTCAAGAAAGTGAGAGGGAGACCGGTGCCAACGATAGCGAGCATAGGAAATACCAACAACACCAACTCATGCATAATCTCATTATCGCTACTCATTACTCTTCCTGTTGACAGTCAAGTGTCTCCCCTTCTTCAACCAACTAATGAAAATGCAAATGTGGAACAATTCAGAAGTGTCGGCGATAGAAAGGAAAAAGAAGAAGACGAGGACAACAACAGTGAGGCAAAGGAACAAGGCAGTGTTTCATTCATTTAG

Protein sequence

MASSLDHNQLPNIHGGATAAPPPTPSSQTNHLSTSAAADALSRLLHRLPPNLSLPTRRSPSVRSPPTISFSESPNPDLLDRLLSAASELGFFQLTDHKISSHLALSAESESAALFDLPAEKKESLFPKNWPLGFYGDGDEESNGLGESFCFDSRPRFSDLAEISLHSLEEFVLEMESLGLKIVEFLFRAIGFENPIGEDRTGFRSLIWISDGCPSTEPAMAGGFYPYIVGLQYQSKNQRCSLLGDSGLVAAADSVTVSIGDIAQVWSNGKLKKVRGRPVPTIASIGNTNNTNSCIISLSLLITLPVDSQVSPLLQPTNENANVEQFRSVGDRKEKEEDEDNNSEAKEQGSVSFI
BLAST of Lsi04G010940 vs. TrEMBL
Match: A0A0A0KP52_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G208470 PE=4 SV=1)

HSP 1 Score: 463.0 bits (1190), Expect = 3.3e-127
Identity = 237/268 (88.43%), Postives = 248/268 (92.54%), Query Frame = 1

Query: 1   MASSLDHNQLPNIHGGATAAPPPTPSSQTNHLSTSAAADALSRLLHRLPPNLSLPTRRSP 60
           MASSLDHNQLPNIH GATAAPPPTPSSQTNHLSTSAAADALS+LLHRLPPNLSLPTRRSP
Sbjct: 1   MASSLDHNQLPNIHAGATAAPPPTPSSQTNHLSTSAAADALSKLLHRLPPNLSLPTRRSP 60

Query: 61  SVRSPPTISFSESPNPDLLDRLLSAASELGFFQLTDHKISSHLALSAESESAALFDLPAE 120
           SV  PPTISFSESPNPDLL+RLLSAASELGFFQLTDHKISSHLALSAESESA LF+LPAE
Sbjct: 61  SVIPPPTISFSESPNPDLLNRLLSAASELGFFQLTDHKISSHLALSAESESAPLFNLPAE 120

Query: 121 KKESLFPKNWPLGFYGDGDEESNGLGESFCFDSRPRFSDLAEISLHSLEEFVLEMESLGL 180
           KKESLFPKNWPLGF GDGDEES+G GES CFDSR   SD  EIS HSL +FVLEMESLGL
Sbjct: 121 KKESLFPKNWPLGFKGDGDEESDGSGESLCFDSRNCLSDSPEISFHSLTDFVLEMESLGL 180

Query: 181 KIVEFLFRAIGFENPIGEDRTGFRSLIWISDGCPSTEPAMAGGFYPYIVGLQYQSKNQRC 240
           KIVEFLFRAIGFENPIGEDRTGFRSL+WIS+GC STEPAMAGGFYPYI+GLQYQS+NQ+C
Sbjct: 181 KIVEFLFRAIGFENPIGEDRTGFRSLVWISEGCRSTEPAMAGGFYPYIIGLQYQSRNQKC 240

Query: 241 SLLGDSGLV---AAADSVTVSIGDIAQV 266
           SLLGDSG V   AAADSV VSIGDIAQ+
Sbjct: 241 SLLGDSGWVAAAAAADSVMVSIGDIAQL 268

BLAST of Lsi04G010940 vs. TrEMBL
Match: A0A061FKB2_THECC (2-oxoglutarate and Fe(II)-dependent oxygenase domain-containing protein, putative OS=Theobroma cacao GN=TCM_042485 PE=4 SV=1)

HSP 1 Score: 328.2 bits (840), Expect = 1.3e-86
Identity = 195/360 (54.17%), Postives = 252/360 (70.00%), Query Frame = 1

Query: 1   MASSLDHNQ--LPNIHGGATAAPPPTPSSQTNH--LSTSAAADALSRLLHRLPPNLSLPT 60
           MASS    Q  LPN++G ATAAPPPTPS Q NH  +++SA ADALS+LLHRLPP LSLPT
Sbjct: 1   MASSTHKQQQHLPNLYG-ATAAPPPTPSGQPNHHLVTSSATADALSKLLHRLPPTLSLPT 60

Query: 61  RRS-PSVRSPPTISFSESPNPDLLDRLLSAASELGFFQLTDHKISSHLALSAESESAALF 120
            RS PS  SP T+SFS+   P+L D LLS+ S++GFFQLT H +SS LA SAE+ES +LF
Sbjct: 61  HRSAPSTASPRTVSFSD---PNLKDLLLSSGSKVGFFQLTSHDVSSQLANSAETESLSLF 120

Query: 121 DLPAEKKESLFPKNWPLGFYGDGDEESNGLGESFCFDSRPRFSDLAEISLHSLEEFVLEM 180
           +LP E+KES FPKNWPLGF  D +E  +  GESFC D+    ++L  +SL SL EF   +
Sbjct: 121 ELPKEQKESCFPKNWPLGFDADEEEGGDEKGESFCLDATCS-TELTNLSLSSLREFTRAL 180

Query: 181 ESLGLKIVEFLFRAIGFENPIGEDRTGFRSLIWISDGCPSTEPAMAGGFYPYIVGLQYQS 240
           E LGLKI++ L  A+GFENPIGED T F SL+WI +G    +   +GGFYP+++GLQYQ 
Sbjct: 181 EKLGLKIIDTLANAVGFENPIGEDPTRFCSLMWILEGLHGDDKP-SGGFYPFVIGLQYQI 240

Query: 241 KNQRCSLLGDSGLVAAA---DSVTVSIGDIAQVWSNGKLKKVRGRPVPTIASIGNTNNTN 300
           + Q+ SLL +SG V+ +   DS+ V++GDIAQVWSNGKL+KVRGRPV      GN    N
Sbjct: 241 RCQQYSLLSESGWVSVSPEVDSIMVTLGDIAQVWSNGKLRKVRGRPVAACLDDGN----N 300

Query: 301 SCIISLSLLITLPVDSQVSPLLQPT---NENANVEQFRSVGDRKEKEEDEDNNSEAKEQG 350
           S  +S+SLL+TLP+DSQV+PLL      +ENA+ ++ R          D++  +E K++G
Sbjct: 301 SRHVSMSLLLTLPMDSQVAPLLTKVIADDENASDDEIR----------DDEIGTEGKKEG 340

BLAST of Lsi04G010940 vs. TrEMBL
Match: A0A151U1H0_CAJCA (Gibberellin 2-beta-dioxygenase 2 OS=Cajanus cajan GN=KK1_005679 PE=4 SV=1)

HSP 1 Score: 327.8 bits (839), Expect = 1.6e-86
Identity = 190/349 (54.44%), Postives = 248/349 (71.06%), Query Frame = 1

Query: 1   MASSLDHNQL-PNIHGGATAAPPPTPSSQTNHL-STSAAADALSRLLHRLPPNLSLPTRR 60
           MASS    QL PN++GGAT+APPPTP++Q N L STS AADALSRLLHRLPP LSLPTR 
Sbjct: 1   MASSTHKQQLLPNLYGGATSAPPPTPAAQPNSLLSTSDAADALSRLLHRLPPTLSLPTR- 60

Query: 61  SPSVRSPPTISFSESPNPDLLDRLLSAASELGFFQLTDHKISSHLALSAESESAALFDLP 120
                SP + + +  P+  L D +LS  S+LG+ QLTDH + S LA SAESE+ ALFDL 
Sbjct: 61  ---CASPSSAAATCPPSLSLNDDVLSCVSQLGYAQLTDHSVPSGLANSAESEALALFDLS 120

Query: 121 AEKKESLFPKNWPLGFYGDGDEESNGLGESFCFDSRPRFSDLAEISLHSLEEFVLEMESL 180
            ++KE+LFPKNWPLG+   GDEE  GL ESF  DS  R+++ +E++L SL E  LE+E L
Sbjct: 121 RDRKEALFPKNWPLGY---GDEEDEGLAESFRLDS-ARWTESSELALASLRELALELEKL 180

Query: 181 GLKIVEFLFRAIGFENPIGEDRTGFRSLIWISDGCPSTEPAMAGGFYPYIVGLQYQSKNQ 240
           GLKIV+ L + +GFENP+G D T FRSL+W+S+    ++P ++GGFYP++VGLQ+Q +NQ
Sbjct: 181 GLKIVDGLTKELGFENPLGHDPTRFRSLMWVSECVRGSKPDLSGGFYPFVVGLQFQIRNQ 240

Query: 241 RCSLLGDSGLVAA---ADSVTVSIGDIAQVWSNGKLKKVRGRPVPTIASIGNTNNTNSCI 300
           + S+L DSG V+     DS+ V++GDIAQVWSNGKLKKVRGRPV   A++G  N+T    
Sbjct: 241 KYSMLSDSGWVSVLPHVDSILVTVGDIAQVWSNGKLKKVRGRPV---ATVGEENDTR--C 300

Query: 301 ISLSLLITLPVDSQVSPLLQPTNENANVEQFRSVGDRKEKEEDEDNNSE 345
           I++SLLITLP +S+V+PLL   ++             KE+ E+E NN E
Sbjct: 301 ITMSLLITLPTESRVAPLLPNKDQT------------KEESEEESNNGE 324

BLAST of Lsi04G010940 vs. TrEMBL
Match: A0A0B0PL18_GOSAR (Gibberellin 2-beta-dioxygenase 1-like protein OS=Gossypium arboreum GN=F383_07507 PE=4 SV=1)

HSP 1 Score: 313.9 bits (803), Expect = 2.5e-82
Identity = 178/320 (55.62%), Postives = 229/320 (71.56%), Query Frame = 1

Query: 7   HNQLPNIHGGATAAPPPTPSSQTNH--LSTSAAADALSRLLHRLPPNLSLPTRRSPSV-- 66
           H  LPN++G ATAAPPPTPS+Q NH  +++SAAADALS LLHRLPP LSLP RRS +   
Sbjct: 11  HQHLPNLYG-ATAAPPPTPSAQPNHHTVTSSAAADALSNLLHRLPPTLSLPKRRSSTSAT 70

Query: 67  -RSPPTISFSESPNPDLLDRLLSAASELGFFQLTDHKISSHLALSAESESAALFDLPAEK 126
            R+ PT+SFS+   P+    L+S+ SELGF QLT+H I S LA SAE+ES +LF+L  ++
Sbjct: 71  SRALPTLSFSD---PNFNHLLVSSGSELGFLQLTNHDIPSQLANSAETESLSLFELTRDQ 130

Query: 127 KESLFPKNWPLGFYGDGDEES----NGLGESFCFDSRPRFSDLAEISLHSLEEFVLEMES 186
           KES FPKNWPLGF  D D+E     +G GESFC D+       A++SL SL EF   +E 
Sbjct: 131 KESCFPKNWPLGFDADDDDEEEEDGDGKGESFCLDTECSTETTADLSLTSLREFTRALEK 190

Query: 187 LGLKIVEFLFRAIGFENPIGEDRTGFRSLIWISDGCPSTEPAMAGGFYPYIVGLQYQSKN 246
           LGLKI++ L  A+GF+N IGED T FRSL+WIS+G        +GGFYPY++GLQYQ + 
Sbjct: 191 LGLKIIDKLADAMGFDNSIGEDPTRFRSLMWISEGLHGDHDKPSGGFYPYVIGLQYQIRC 250

Query: 247 QRCSLLGDSGLVAAA---DSVTVSIGDIAQVWSNGKLKKVRGRPVPTIASIGNTNNTNSC 306
           Q+ SLL DSG V+ +   DS+ V++GDIAQVWSNGKL KVRGRP+   A +G+ N +   
Sbjct: 251 QKYSLLKDSGSVSVSPQVDSIMVTLGDIAQVWSNGKLSKVRGRPM---ACLGDGNKSR-- 310

Query: 307 IISLSLLITLPVDSQVSPLL 315
            +S+SLL+TLP +S+V+PLL
Sbjct: 311 FVSMSLLVTLPCNSRVTPLL 321

BLAST of Lsi04G010940 vs. TrEMBL
Match: G7K254_MEDTR (2-deoxymugineic-acid 2-dioxygenase-like protein, putative OS=Medicago truncatula GN=MTR_5g082850 PE=4 SV=1)

HSP 1 Score: 310.8 bits (795), Expect = 2.1e-81
Identity = 185/362 (51.10%), Postives = 246/362 (67.96%), Query Frame = 1

Query: 1   MASSLDHNQL--PNIHGG-ATAAPPPTPSSQT---NHLSTSAAADALSRLLHRLPPNLSL 60
           MASS   +Q+   N+HGG +T+APPPTPS+ T   NHLSTS AADALSRLLHRLPPNLSL
Sbjct: 1   MASSTHTHQILPQNLHGGGSTSAPPPTPSTTTTNNNHLSTSTAADALSRLLHRLPPNLSL 60

Query: 61  PT--RRSPSVRSPPTISFSESPNPDLLDRLLSAASELGFFQLTDHKISSHLALSAESESA 120
           PT  R SP+  SPP++SFS S  P+   +L+S+ S+LGF QLTDH +SS LA  AESES 
Sbjct: 61  PTIRRSSPTTTSPPSLSFS-SLTPE---KLISSISQLGFIQLTDHSVSSKLANLAESESL 120

Query: 121 ALFDLPAEKKESLFPKNWPLGFYGDGDEESNGLGESFCFDSRPRFS-DLAEISLHSLEEF 180
            LF+L  ++KES FP+NWP G+ GD D +   L ESF F      S +  +I L SL EF
Sbjct: 121 KLFNLSHDQKESFFPQNWPFGYEGDNDNDEEKLVESFRFQFDSLCSTESNQIKLESLSEF 180

Query: 181 VLEMESLGLKIVEFLFRAIGFENPIGEDRTGFRSLIWISDGCPSTEPAMAGGFYPYIVGL 240
              +E LGL I++ L   +G ENP+G+D   F S++W+S+  P  +P   GGFYP+IVGL
Sbjct: 181 ACALEKLGLNIIDVLMNGLGVENPVGDDSNRFSSIMWVSECLPGNKPGSMGGFYPFIVGL 240

Query: 241 QYQSKNQRCSLLGDSG----LVAAADSVTVSIGDIAQVWSNGKLKKVRGRPVPTIASIGN 300
           QYQ + Q+ SLL DSG    ++   DS+ V++GD+AQVWSNGKLKKVRGRP+  +A++G+
Sbjct: 241 QYQIRCQKYSLLSDSGGWVSVLPHVDSILVTVGDVAQVWSNGKLKKVRGRPI--MAALGD 300

Query: 301 TNNTNSCIISLSLLITLPVDSQVSPLLQPTNENANVEQFRSVGDRKEKEEDEDNNSEAKE 350
            N+ +SC I++SLLITLP++S V+PLL              +G++ + E+D DN+ E   
Sbjct: 301 END-SSC-ITMSLLITLPLESNVAPLL-------------PIGNKNKVEDDIDNDEEENN 341

BLAST of Lsi04G010940 vs. TAIR10
Match: AT3G11150.1 (AT3G11150.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 229.9 bits (585), Expect = 2.4e-60
Identity = 151/358 (42.18%), Postives = 216/358 (60.34%), Query Frame = 1

Query: 2   ASSLDHNQLPNIHGGATAAPPPTPSSQTNHLSTSA----AADALSRLLHRLPPNLSLPTR 61
           ++ L+H+Q    HGGATAAPPPTPS   +HL+T++    A DALS L HRLPP LSLP R
Sbjct: 5   SAHLNHHQPLIKHGGATAAPPPTPSC--SHLNTTSKSETAVDALSSLFHRLPPLLSLPNR 64

Query: 62  RSPSVRSPPTISFSESPNPDLLDRLLSAASELGFFQL----TDHKISSHLALSAESESAA 121
           RS  + S P +S S     +  D L+SA ++ G+FQL    +D      +A +AES+S +
Sbjct: 65  RS--IPSLPMVSLSAGDRLEW-DDLISAVTDFGYFQLINDDSDILFPPGIAEAAESDSLS 124

Query: 122 LFDLPAEKKESLFPKNWPLGFYGDGDEESNGLGESFCFDSRPRFSDLAEISLHSLEEFVL 181
           L +L  EKKES FPK WPLG+  D +  S      FC D+    +D +E++L SL EF  
Sbjct: 125 LLELSEEKKESSFPKKWPLGYEADAETPS------FCLDADCS-TDSSELNLSSLREFTR 184

Query: 182 EMESLGLKIVEFLFRAIGFENPIGEDRTGFRSLIWISDGCPSTEPAMAGGFYPYIVGLQY 241
            +E +GLK VE L  A+GF    G D T F +L+W++ G P  EP +  GFYP++V LQY
Sbjct: 185 TLEKVGLKTVEMLANALGF----GYDSTRFNTLMWVNQGVPDDEPEVTNGFYPFVVCLQY 244

Query: 242 QSKNQRCSLLGDSGLVAA---ADSVTVSIGDIAQVWSNGKLKKVRGRPVPTIASIGNTNN 301
           Q + Q+  LL +SG V+     DSV V++GDIAQVW NG++K+V+ RPV      G  + 
Sbjct: 245 QIREQKYCLLTESGWVSVLPRVDSVLVTLGDIAQVWRNGEVKRVKYRPV---LCSGQKDG 304

Query: 302 TNSCIISLSLLITLPVDSQVSPLLQPTNENANVEQFRSVGDRKEKEEDEDNNSEAKEQ 349
              C ++++L++TLP+DS VS L    ++    E++        +EE+ED  + + E+
Sbjct: 305 PVKC-VTMTLMLTLPMDSMVSSLKDMISDGDKEEEY-------AEEEEEDGGARSDER 335

BLAST of Lsi04G010940 vs. TAIR10
Match: AT4G21200.1 (AT4G21200.1 gibberellin 2-oxidase 8)

HSP 1 Score: 49.3 bits (116), Expect = 5.7e-06
Identity = 51/224 (22.77%), Postives = 96/224 (42.86%), Query Frame = 1

Query: 68  ISFSESPNPDLLDRLLSAASELGFFQLTDHKISSHLALSAESESAALFDLPAEKKESLFP 127
           I  +E       + +  A+ E GFFQ+ +H IS  +      E   +F  P +KK     
Sbjct: 51  IDGAEEEREKCKEAIARASREWGFFQVINHGISMDVLEKMRQEQIRVFREPFDKKSK--S 110

Query: 128 KNWPLGFYGDGDEESNGLGE-SFCFDSRPRFSDLAE----ISLHS-LEEFVLEMESLGLK 187
           + +  G Y  G   +  + + S+        +D+++     +L S +E+F  E E+L   
Sbjct: 111 EKFSAGSYRWGTPSATSIRQLSWSEAFHVPMTDISDNKDFTTLSSTMEKFASESEALAYM 170

Query: 188 IVEFLFRAIGFENPIGEDRTGFRSL-IWISDGCPSTEPAMAGGFYPY----IVGLQYQSK 247
           + E L    G  +   ++     +  + ++   P  +P+   G  P+     + + YQ +
Sbjct: 171 LAEVLAEKSGQNSSFFKENCVRNTCYLRMNRYPPCPKPSEVYGLMPHTDSDFLTILYQDQ 230

Query: 248 NQRCSLLGDSGLVAAADS---VTVSIGDIAQVWSNGKLKKVRGR 278
                L+ D+  +A   +   + ++IGD+ Q WSNG  K V  R
Sbjct: 231 VGGLQLIKDNRWIAVKPNPKALIINIGDLFQAWSNGMYKSVEHR 272

BLAST of Lsi04G010940 vs. NCBI nr
Match: gi|659130522|ref|XP_008465218.1| (PREDICTED: 1-aminocyclopropane-1-carboxylate oxidase [Cucumis melo])

HSP 1 Score: 561.2 bits (1445), Expect = 1.3e-156
Identity = 297/351 (84.62%), Postives = 318/351 (90.60%), Query Frame = 1

Query: 1   MASSLDHNQLPNIHGGATAAPPPTPSSQTNHLSTSAAADALSRLLHRLPPNLSLPTRRSP 60
           MASSL+HNQLPNIHGGATAAPPPTPSSQTNHLSTSAAADALS+LLHRLPPNLSLPTRRSP
Sbjct: 1   MASSLNHNQLPNIHGGATAAPPPTPSSQTNHLSTSAAADALSKLLHRLPPNLSLPTRRSP 60

Query: 61  SVRSPPTISFSESPNPDLLDRLLSAASELGFFQLTDHKISSHLALSAESESAALFDLPAE 120
           SV +PPTISFSESPNPDLL+ LLSAASELGFFQLTDHKISSHLALSAESESAALF+L AE
Sbjct: 61  SVIAPPTISFSESPNPDLLNHLLSAASELGFFQLTDHKISSHLALSAESESAALFNLSAE 120

Query: 121 KKESLFPKNWPLGFYGDGDEESNGLGESFCFDSRPRFSDLAEISLHSLEEFVLEMESLGL 180
           KKESLFPKNWPLGF GDGDEES+G G+S CFDSR  FSD AEISLHSL EFVLEMESLGL
Sbjct: 121 KKESLFPKNWPLGFKGDGDEESDGSGDSLCFDSRLCFSDSAEISLHSLTEFVLEMESLGL 180

Query: 181 KIVEFLFRAIGFENPIGEDRTGFRSLIWISDGCPSTEPAMAGGFYPYIVGLQYQSKNQRC 240
           KIVEFLFRAIGFENPIGEDRTGFRSL+WIS+GC  TEPAMAGGFYPYIVGLQYQS+NQRC
Sbjct: 181 KIVEFLFRAIGFENPIGEDRTGFRSLVWISEGCRGTEPAMAGGFYPYIVGLQYQSRNQRC 240

Query: 241 SLLGDSGLV---AAADSVTVSIGDIAQVWSNGKLKKVRGRPVPTIASIGNTNNTNSCIIS 300
           SLLGDSG V   AAADSV VSIGD+AQVWSNGKLKK+RGRPVP  +S+ NT++TNS  IS
Sbjct: 241 SLLGDSGWVAAAAAADSVMVSIGDVAQVWSNGKLKKMRGRPVPMASSLANTSSTNSRTIS 300

Query: 301 LSLLITLPVDSQVSPLLQPTNENANVEQFRSVGDRKEKEEDEDNNSEAKEQ 349
           LSLLITLPVD+QVSPLL  TNEN   EQF SV D+ +KEED DN+ E KE+
Sbjct: 301 LSLLITLPVDTQVSPLLS-TNENEKEEQFASVRDKMKKEEDADND-EGKEK 349

BLAST of Lsi04G010940 vs. NCBI nr
Match: gi|778701311|ref|XP_011654997.1| (PREDICTED: gibberellin 2-beta-dioxygenase 2 isoform X1 [Cucumis sativus])

HSP 1 Score: 557.4 bits (1435), Expect = 1.8e-155
Identity = 294/351 (83.76%), Postives = 315/351 (89.74%), Query Frame = 1

Query: 1   MASSLDHNQLPNIHGGATAAPPPTPSSQTNHLSTSAAADALSRLLHRLPPNLSLPTRRSP 60
           MASSLDHNQLPNIH GATAAPPPTPSSQTNHLSTSAAADALS+LLHRLPPNLSLPTRRSP
Sbjct: 1   MASSLDHNQLPNIHAGATAAPPPTPSSQTNHLSTSAAADALSKLLHRLPPNLSLPTRRSP 60

Query: 61  SVRSPPTISFSESPNPDLLDRLLSAASELGFFQLTDHKISSHLALSAESESAALFDLPAE 120
           SV  PPTISFSESPNPDLL+RLLSAASELGFFQLTDHKISSHLALSAESESA LF+LPAE
Sbjct: 61  SVIPPPTISFSESPNPDLLNRLLSAASELGFFQLTDHKISSHLALSAESESAPLFNLPAE 120

Query: 121 KKESLFPKNWPLGFYGDGDEESNGLGESFCFDSRPRFSDLAEISLHSLEEFVLEMESLGL 180
           KKESLFPKNWPLGF GDGDEES+G GES CFDSR   SD  EIS HSL +FVLEMESLGL
Sbjct: 121 KKESLFPKNWPLGFKGDGDEESDGSGESLCFDSRNCLSDSPEISFHSLTDFVLEMESLGL 180

Query: 181 KIVEFLFRAIGFENPIGEDRTGFRSLIWISDGCPSTEPAMAGGFYPYIVGLQYQSKNQRC 240
           KIVEFLFRAIGFENPIGEDRTGFRSL+WIS+GC STEPAMAGGFYPYI+GLQYQS+NQ+C
Sbjct: 181 KIVEFLFRAIGFENPIGEDRTGFRSLVWISEGCRSTEPAMAGGFYPYIIGLQYQSRNQKC 240

Query: 241 SLLGDSGLV---AAADSVTVSIGDIAQVWSNGKLKKVRGRPVPTIASIGNTNNTNSCIIS 300
           SLLGDSG V   AAADSV VSIGDIAQVWSNGKLKK+RGRPVP  +S+ NT++TNS  IS
Sbjct: 241 SLLGDSGWVAAAAAADSVMVSIGDIAQVWSNGKLKKMRGRPVPMASSVANTSSTNSRTIS 300

Query: 301 LSLLITLPVDSQVSPLLQPTNENANVEQFRSVGDRKEKEEDEDNNSEAKEQ 349
           LSLLITLPVD+QVSPLL  TNENAN EQF    D+KE+E+D D + E KE+
Sbjct: 301 LSLLITLPVDTQVSPLLLSTNENANEEQF----DKKEREDDGD-SGEGKEK 346

BLAST of Lsi04G010940 vs. NCBI nr
Match: gi|778701315|ref|XP_011654998.1| (PREDICTED: 2'-deoxymugineic-acid 2'-dioxygenase isoform X2 [Cucumis sativus])

HSP 1 Score: 463.0 bits (1190), Expect = 4.7e-127
Identity = 237/268 (88.43%), Postives = 248/268 (92.54%), Query Frame = 1

Query: 1   MASSLDHNQLPNIHGGATAAPPPTPSSQTNHLSTSAAADALSRLLHRLPPNLSLPTRRSP 60
           MASSLDHNQLPNIH GATAAPPPTPSSQTNHLSTSAAADALS+LLHRLPPNLSLPTRRSP
Sbjct: 1   MASSLDHNQLPNIHAGATAAPPPTPSSQTNHLSTSAAADALSKLLHRLPPNLSLPTRRSP 60

Query: 61  SVRSPPTISFSESPNPDLLDRLLSAASELGFFQLTDHKISSHLALSAESESAALFDLPAE 120
           SV  PPTISFSESPNPDLL+RLLSAASELGFFQLTDHKISSHLALSAESESA LF+LPAE
Sbjct: 61  SVIPPPTISFSESPNPDLLNRLLSAASELGFFQLTDHKISSHLALSAESESAPLFNLPAE 120

Query: 121 KKESLFPKNWPLGFYGDGDEESNGLGESFCFDSRPRFSDLAEISLHSLEEFVLEMESLGL 180
           KKESLFPKNWPLGF GDGDEES+G GES CFDSR   SD  EIS HSL +FVLEMESLGL
Sbjct: 121 KKESLFPKNWPLGFKGDGDEESDGSGESLCFDSRNCLSDSPEISFHSLTDFVLEMESLGL 180

Query: 181 KIVEFLFRAIGFENPIGEDRTGFRSLIWISDGCPSTEPAMAGGFYPYIVGLQYQSKNQRC 240
           KIVEFLFRAIGFENPIGEDRTGFRSL+WIS+GC STEPAMAGGFYPYI+GLQYQS+NQ+C
Sbjct: 181 KIVEFLFRAIGFENPIGEDRTGFRSLVWISEGCRSTEPAMAGGFYPYIIGLQYQSRNQKC 240

Query: 241 SLLGDSGLV---AAADSVTVSIGDIAQV 266
           SLLGDSG V   AAADSV VSIGDIAQ+
Sbjct: 241 SLLGDSGWVAAAAAADSVMVSIGDIAQL 268

BLAST of Lsi04G010940 vs. NCBI nr
Match: gi|590561888|ref|XP_007008940.1| (2-oxoglutarate and Fe(II)-dependent oxygenase domain-containing protein, putative [Theobroma cacao])

HSP 1 Score: 328.2 bits (840), Expect = 1.8e-86
Identity = 195/360 (54.17%), Postives = 252/360 (70.00%), Query Frame = 1

Query: 1   MASSLDHNQ--LPNIHGGATAAPPPTPSSQTNH--LSTSAAADALSRLLHRLPPNLSLPT 60
           MASS    Q  LPN++G ATAAPPPTPS Q NH  +++SA ADALS+LLHRLPP LSLPT
Sbjct: 1   MASSTHKQQQHLPNLYG-ATAAPPPTPSGQPNHHLVTSSATADALSKLLHRLPPTLSLPT 60

Query: 61  RRS-PSVRSPPTISFSESPNPDLLDRLLSAASELGFFQLTDHKISSHLALSAESESAALF 120
            RS PS  SP T+SFS+   P+L D LLS+ S++GFFQLT H +SS LA SAE+ES +LF
Sbjct: 61  HRSAPSTASPRTVSFSD---PNLKDLLLSSGSKVGFFQLTSHDVSSQLANSAETESLSLF 120

Query: 121 DLPAEKKESLFPKNWPLGFYGDGDEESNGLGESFCFDSRPRFSDLAEISLHSLEEFVLEM 180
           +LP E+KES FPKNWPLGF  D +E  +  GESFC D+    ++L  +SL SL EF   +
Sbjct: 121 ELPKEQKESCFPKNWPLGFDADEEEGGDEKGESFCLDATCS-TELTNLSLSSLREFTRAL 180

Query: 181 ESLGLKIVEFLFRAIGFENPIGEDRTGFRSLIWISDGCPSTEPAMAGGFYPYIVGLQYQS 240
           E LGLKI++ L  A+GFENPIGED T F SL+WI +G    +   +GGFYP+++GLQYQ 
Sbjct: 181 EKLGLKIIDTLANAVGFENPIGEDPTRFCSLMWILEGLHGDDKP-SGGFYPFVIGLQYQI 240

Query: 241 KNQRCSLLGDSGLVAAA---DSVTVSIGDIAQVWSNGKLKKVRGRPVPTIASIGNTNNTN 300
           + Q+ SLL +SG V+ +   DS+ V++GDIAQVWSNGKL+KVRGRPV      GN    N
Sbjct: 241 RCQQYSLLSESGWVSVSPEVDSIMVTLGDIAQVWSNGKLRKVRGRPVAACLDDGN----N 300

Query: 301 SCIISLSLLITLPVDSQVSPLLQPT---NENANVEQFRSVGDRKEKEEDEDNNSEAKEQG 350
           S  +S+SLL+TLP+DSQV+PLL      +ENA+ ++ R          D++  +E K++G
Sbjct: 301 SRHVSMSLLLTLPMDSQVAPLLTKVIADDENASDDEIR----------DDEIGTEGKKEG 340

BLAST of Lsi04G010940 vs. NCBI nr
Match: gi|1012361885|gb|KYP73068.1| (Gibberellin 2-beta-dioxygenase 2 [Cajanus cajan])

HSP 1 Score: 327.8 bits (839), Expect = 2.4e-86
Identity = 190/349 (54.44%), Postives = 248/349 (71.06%), Query Frame = 1

Query: 1   MASSLDHNQL-PNIHGGATAAPPPTPSSQTNHL-STSAAADALSRLLHRLPPNLSLPTRR 60
           MASS    QL PN++GGAT+APPPTP++Q N L STS AADALSRLLHRLPP LSLPTR 
Sbjct: 1   MASSTHKQQLLPNLYGGATSAPPPTPAAQPNSLLSTSDAADALSRLLHRLPPTLSLPTR- 60

Query: 61  SPSVRSPPTISFSESPNPDLLDRLLSAASELGFFQLTDHKISSHLALSAESESAALFDLP 120
                SP + + +  P+  L D +LS  S+LG+ QLTDH + S LA SAESE+ ALFDL 
Sbjct: 61  ---CASPSSAAATCPPSLSLNDDVLSCVSQLGYAQLTDHSVPSGLANSAESEALALFDLS 120

Query: 121 AEKKESLFPKNWPLGFYGDGDEESNGLGESFCFDSRPRFSDLAEISLHSLEEFVLEMESL 180
            ++KE+LFPKNWPLG+   GDEE  GL ESF  DS  R+++ +E++L SL E  LE+E L
Sbjct: 121 RDRKEALFPKNWPLGY---GDEEDEGLAESFRLDS-ARWTESSELALASLRELALELEKL 180

Query: 181 GLKIVEFLFRAIGFENPIGEDRTGFRSLIWISDGCPSTEPAMAGGFYPYIVGLQYQSKNQ 240
           GLKIV+ L + +GFENP+G D T FRSL+W+S+    ++P ++GGFYP++VGLQ+Q +NQ
Sbjct: 181 GLKIVDGLTKELGFENPLGHDPTRFRSLMWVSECVRGSKPDLSGGFYPFVVGLQFQIRNQ 240

Query: 241 RCSLLGDSGLVAA---ADSVTVSIGDIAQVWSNGKLKKVRGRPVPTIASIGNTNNTNSCI 300
           + S+L DSG V+     DS+ V++GDIAQVWSNGKLKKVRGRPV   A++G  N+T    
Sbjct: 241 KYSMLSDSGWVSVLPHVDSILVTVGDIAQVWSNGKLKKVRGRPV---ATVGEENDTR--C 300

Query: 301 ISLSLLITLPVDSQVSPLLQPTNENANVEQFRSVGDRKEKEEDEDNNSE 345
           I++SLLITLP +S+V+PLL   ++             KE+ E+E NN E
Sbjct: 301 ITMSLLITLPTESRVAPLLPNKDQT------------KEESEEESNNGE 324

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KP52_CUCSA3.3e-12788.43Uncharacterized protein OS=Cucumis sativus GN=Csa_5G208470 PE=4 SV=1[more]
A0A061FKB2_THECC1.3e-8654.172-oxoglutarate and Fe(II)-dependent oxygenase domain-containing protein, putativ... [more]
A0A151U1H0_CAJCA1.6e-8654.44Gibberellin 2-beta-dioxygenase 2 OS=Cajanus cajan GN=KK1_005679 PE=4 SV=1[more]
A0A0B0PL18_GOSAR2.5e-8255.63Gibberellin 2-beta-dioxygenase 1-like protein OS=Gossypium arboreum GN=F383_0750... [more]
G7K254_MEDTR2.1e-8151.102-deoxymugineic-acid 2-dioxygenase-like protein, putative OS=Medicago truncatula... [more]
Match NameE-valueIdentityDescription
AT3G11150.12.4e-6042.18 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT4G21200.15.7e-0622.77 gibberellin 2-oxidase 8[more]
Match NameE-valueIdentityDescription
gi|659130522|ref|XP_008465218.1|1.3e-15684.62PREDICTED: 1-aminocyclopropane-1-carboxylate oxidase [Cucumis melo][more]
gi|778701311|ref|XP_011654997.1|1.8e-15583.76PREDICTED: gibberellin 2-beta-dioxygenase 2 isoform X1 [Cucumis sativus][more]
gi|778701315|ref|XP_011654998.1|4.7e-12788.43PREDICTED: 2'-deoxymugineic-acid 2'-dioxygenase isoform X2 [Cucumis sativus][more]
gi|590561888|ref|XP_007008940.1|1.8e-8654.172-oxoglutarate and Fe(II)-dependent oxygenase domain-containing protein, putativ... [more]
gi|1012361885|gb|KYP73068.1|2.4e-8654.44Gibberellin 2-beta-dioxygenase 2 [Cajanus cajan][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR027443IPNS-like
IPR026992DIOX_N
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0016491 oxidoreductase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi04G010940.1Lsi04G010940.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR026992Non-haem dioxygenase N-terminal domainPFAMPF14226DIOX_Ncoord: 66..142
score: 3.8
IPR027443Isopenicillin N synthase-likeGENE3DG3DSA:2.60.120.330coord: 56..194
score: 1.5
NoneNo IPR availablePANTHERPTHR34945FAMILY NOT NAMEDcoord: 3..354
score: 9.7E
NoneNo IPR availablePANTHERPTHR34945:SF12-OXOGLUTARATE (2OG) AND FE(II)-DEPENDENT OXYGENASE SUPERFAMILY PROTEINcoord: 3..354
score: 9.7E
NoneNo IPR availableunknownSSF51197Clavaminate synthase-likecoord: 50..280
score: 2.47