CmoCh12G011670 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh12G011670
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionAT-hook motif nuclear-localized protein
LocationCmo_Chr12: 10558067 .. 10567660 (-)
RNA-Seq ExpressionCmoCh12G011670
SyntenyCmoCh12G011670
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTATGGATAAATGATAATATGTACACAAAAAAGGAAAAAAAAGTTATATTTATTTTATTTATTTATTTTAATTTTCCAAAACATACGCTCTCTCTCTCATCTCAAAAATGGAACCCAATGACAACCAGCTCAGCTCCTACTTCCACCACCATCAACACCACCATCAAAGTCCCACCACATCGCCGACCAATGGCCTTCTACCCTCCACCCACCACCTCTCCTCCGCCGACGCCACCACCCATGTCCTTTACCCTCACTCGGTTCCCTCCGCCGCCGTCTCCTCCTCTCCTCTCGAGCCCGGTCGCCGGAAGAGAGGTCGCCCGCGGAAGTACGGCACGCCGGAGGAGGCTTTAGCGGCTAAGAAAGCTGCTACGGCGTCGTCTCACTCTTCCTCCAAGGCCAAGAAGGACCTCGTTTCTTCCTCTTCTCTTAATGCCGTTTCCGCTTCTTCGAAGAAATCTCAGTTGGCTGCACTTGGTGGGTTGTCGTGTTGGAATTTGTTGTTAATTTCTTTTCTCTTGTTGGGATTTTTGGGGATTTTTCTAGGGTTCTGGAATTCGTTGTGGTTTTGGGGATTTCTGGGTTTTGGAATTGGGTTTTCTTTGTTTTAACCCCAAATCTCAAAAGCCCTTAATTCCCTTTTGGTCTTTTAGCAAAAGATTTAAAGAGATGCTTTACCATAATTATTCAATTTTGTATGTATGCTTGTAATGGTTTCATGTTCTAAAATAACATGTTTCCTGTCTTGGCAATTGTTTCTTTCTTTTTTATTTTCATTTACTTTTCGTTTTTGTTTCACTTTTGACTATGTGCTCGAATACTTACTTGGAGTGTATTTTTCTCACTGGACATGTAGGTAATGCAGGCCAAGGTTTTGCGCCACAGGTTATTGATGTGGCAGCTGGTGAGGTAATGCATTCTCCATAGTTTCCAAAGATTTGATTCCATTTATACTGATGTACTTGGAAGTGCATGTGATGACGTGCAGGCATTTGCATAAACGCATACAATGATATAGCTAGAAATTGTGGGTCATATACAATTAATGGATTTGAAATGCTAGCTTTAAAGAATTAGCAAACTAAGTCACCACTTCTCAATATGGGGCATAACCTATGCTGTCCTGGTATTGATATATCATCTAGTATGATGGTGCTGCGATAGGACAGCAAGACCATTAAGGATCTCATAAAAATTTCTATGATCTTGAATGCCCTTTTCTGTTGAATTCTAGTTTGTATCTGGGTTCCGAGTTTTTAAAATGTATGGTTATAGTCGATAGCTTCTTAGTTTAGTTTCCTGAGTCTTTTCTTTCAAAATAGTTTATGTTCTAGTAAACTACAGACTTGAGAGATCCCACGTCAGTTAGGGAGGAGAACGAAGCATTTTTTATAAGGGTGTGGAAACCTCTCCCTAGTTTACAACTTTTAAAAACCTTGAGGAAAAGCTCGAAAGGAAAAGCCCAAAGTGGACAATATCTGCTAGCGGTGGGCTTGGGCTATTACAAATGGTACAAGAGCCGGACACCGGGTGTTGTCACAGCGGGAGGCTGAGTCCACCTCGAAGGGAAGGACATGAGGTGGTGTGCTAGCAAGGATGTTGGGCCTCGAAGGGGGGTGGATTTGGGGGTCCCACATCGATTAGAGAAGTGAACGAGTGCCAACGAGGACGCTAGGCCCTAAAGAGGGGTGAATTGTGAGATCCCACATTTGTTGGGGAGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCCCTCTAGTAGACGCGTTTGAAAAACCTTGAGGGAAAGCCTGAAAAGGAAAGTCCAAAGTGGACAATATCTGCCAGTGGTGGGCTTGGGCTGTTATGGACCTATTGTCTCTACCTATGATGTTTTTATGTTCTTTGAAGGATTAAATGTTTAATGCGTTTAATTAAAAGGAACGTGTCAGCCTGAAGCACTAGTTGGTTAAGTTTTCATACCGTGAATGGGGCGTAGTGTCCACTTATAGAGGTTCTGCAGATGGTTTTCTGTTTTTTCTTCCTCTGTTTCTTTGTTCTCCTTTTTTTATTTACTACTTACGATCCAGTGGCGACGCACTTGAATTTAAATCCATGACAATCATTGTTGAGATTTCTTTCTCTTGATAAGGAACATAATATTTCATTGATAAGACGAAATTACAAAAAAAGTGAAATTCCCTTTCCGAATAAAGGGACAAAAATTGTTGAGACTTTGGTATCGAGAATCTTCGAGCTACCGCTTGGTCCCATGAATTTTCTCATCTTAATAATGAAATGGAACTAGTTCTAATTTTGGTTCTATTGTACATAAAATGATTGGGTTGTATTAGAGTGCAGAAAAAGTAAGAAGAATAAGATAATACTCTTGGTGCAGGTTATTCTTTAACTTCTTGTCAGGGAGTAACATAAAACAAGATTATCTCTTGGCTTCTTAATAGACATGGTAATTTGGAATATTGGAGGGATTGAAAAAATAATTACTCAGGTACATTATTAAAAAAATATATTTCTTCTCTCGTTCCATTTCTTTTATGCCAAGTCTCAATTAAAAGGTGTCTTGCAGATGAGGACGGTTTCTTAGTACCCCTCTCACATACCGGCACACATGTTCGTGTACTTATGCACACACAAAGGAAGTAATAGATTTATGTTCTTGAACATGTGTAAGGTATGTGTATCACATGGTTTTTGGTAGAATTCACATATATGAGTAAGAGAGTGCCAGGCTAATCTGCAACATCCACTGTGGTTATTTTCGTTTTCGGAGGGAATAATATGCTGCATACAAACAGGGAAACTTGAAAGTCCGAATTGGAATAAGGGATAAAAATCAAATTTTAAAAGATTATATTTACGATGTCAGTTCGTTTACAGTCTTCTCCTGAAAGGAAGGTTTTGTATTTGTTTTGTGGTTATCTACTTTGGTCCTATTGTGGAGCTTCATTTCTGCGTGTTGATGTGATCTCTATGTGATTATAGCTACTTTTTTTGTATCAACTACTCATTCTTTTGTTTCTCTGTGGATTTGTATTAATTTAGCTGGTTGAGAAGCCCGTTTTAAAAAAGCATCTCTTGTTTTCCCCCTCGTTCGTGTCGATAAGTTTTCTGTCCAGTAACAAGGAGAAGAGAGTTCAGGCAGGTTTCTGATTTGGTTTTAGGAAAGTGGAGCTGTGGTGGGCTGGCGGCTTAAGTTCATGGGAAACCTTAATGATAGAGAGATTGAAGGGTGTATGGCTATCTGGAGCAAGGTCCTTTACTCCGAGTGATTCTAGGATGTGGTTATACTTAAAGGTTGGAACATTTTTTTCGTCAAGTCATTGTTCAACAACCTGGCTGCTAATGATTTTATCATCTCCCAAAGTTCTTCTGCCAAGCAATCTTGAATTTTGAAGGTCAGGAGAAGGTTTATTAGGTGTTGTGACTCTTTTTCTTGGAGGCATCAATAACGGATCTATTCCGGAAGAGGAATCATTATACTCTCTTTCGGGTTGCAATCCTAGTGTAAAGTTGAGCAGAGATGCAAATCACTTGTTTCAGTATATCACTTGTATTTGGTGAACAGTGTATTTCTATTGCTCGTAGGTGGAACCTCTTCGGTGAGTTTGGTAGGTGGGTCGTGAGATCCCACATCGGTTGGAGAGGGGAACCAAACATTCCTTATAAGAGTGTGGAAACCTCTCCCTAACAGACATGTTTTAAAACTTTGAGGGGAAGCTCAGAAGGGAAACCCCAAAGAGGATAATACTGCTAGCGGTGGGCTTGGGCTGTCACACAGGTGGTCCTTCCTAGAGATGCCCAAGACTTCATTCTCAACATGTTATGTAGCTATCCTTTTCGGGGCAAGGCGAGAGAATTATGAGAATCTCCAAGGAGATGATATGGGGTGTTTCTTGATCGACTATTTGTCATTGGTGTGTTCTTCATCAAGTTTTTGTTTTCGTTTTTGTGAGTAGCAAGCCTTTTGTTGAGAAAGTGAAGGAATAGTAAGGATGATAAGAAACCCACCCAAATCAAAGGATATTACAGAACCCACATGCAGATGAATTTCTTAATTCTATCCCGTCTTCCATAGCTGATTGTCACATTGTGTACAAAAATCACATTGTTGATTTCTGCTTCGTACTGGTTTACTTATGTATACTCTATATTATATGCTTGATGAGCTATGGCTGTATCCCTTGTCCTCTTTACCCTCCTAACGAACACTCCGGTCATATTGAATAGGACGTGGGCCAGAAAATTATGCTGTTTATGCAACAATGTAAGCGGGAAATCTGTATCCTTTCCGCATCTGGTTTGATCTCCAATGCATCTCTCCGTCAGCCGGCCTCATCTGGTGGCAATGTTACGTATGAGGTATTTTTGTTCTCCCTATTTGATATCGTAGCTCTTCCTCCTTGGTGTGTGGAAGAAACGATAACAAGAATGATGCTTTGTAGTAGCAATCGTGGATGATTTTATCGAGTTGTTTATATATTTTGTGGGCTGTTTCGGAAAATGGACAAAAGATAACTGGCTACTCATTTGGATGATGATGTCTGTTTTTTTCCTTCCAGAAGTTTGCTTATGAATGCATTATTATAACTATTTGCATTGTAATGTTACTATTAGGGCCGTTTCGAGATTGTTTCGTTATGTGGATCTTATGTACGAACTGACATTGGAGGAAAGACTGGTGGTCTTAGCGTATGTTTGTCGAGTGCTGATGGCCATATCATAGGAGGGGGAGTTGGTGGACCGTTGATGGCTGCTGGACCCGTGCAGGTACCGTAACCAACAGATTGTTTTTCTCGTCTTTTCACTAAGAAGTTCAGTCAATGTTTGAATGTTTTATTGCATAAAGAGTTTTAGATGTATTATTCTTTTGCTTAGTTCATTCTTGTACGGATCCGTCCATTTCAGGTTATTGTTGGTACTTTCATAACCGACCCGACAAAGGAAGCTGGTGGTGGGATTAAAGGCGATACATCTGCTGGCAAGTTGTCCTCACCCACTGGTGGGACATCGATGTCAGGTCTACGCTATGGTTCGAACATCGACTTGGGAGGTAATCAAGTCAGGGGAAACAATGAGCACCAAGGTATCGGGGAGAGTCATTTCTTGCTTCAACCCCGGGGAGTGAACCTGACATCTCCTCGATCCAACTGGAGAACAGGTCTGGATGGCACCACCACTGCTTATGATTTTACAGGTACACAGGCACACTCAATCCGTTCATCGTTTCCTTTTTCCGAACTCGTGTTTTTATGAGGTTGGGAACTTCTTTGTGCAGGAAGAACAGACCATTCTCCCGAAAACGGAGATTACGATCAGATTCCTGAATGAGAGTTAACATACGAGATGAGACAAGACGTGACTGCAAAGTATGTTGTAGATAAATGTACAATATCAACCGCTGCAACGCCAGCAATCTTCGCTTCACTGTGTTAGCCTAATTCTGTGGTTGTAGTATGCACCATACTGTAAAGGTGATAGCAAACTCTTCAGAAGTTCTTATCCTTCCATCCATCCACACCTTTCATGGTGTACTTGAATGTCTAATTCTAATTCTCCTGTTTTTCCCTCTTTAGATGAACAAGTTCGGTGTAGTTGTTACAGTTTGCAATCTGCAATTTGATCATATAAACAAACCCCTAGGGTGATTTAGAATCAAATCTCACTTTTCTATTGCATTATATTTGGTTTGAACATTTTATGGGAGTGTTTGGCCAACCATCTTGAAGTTCGTTGGTCGTTGGTCGGTGGGGTTTGATCTATATACGTTATAACATGGTGAGTAAATAACTTCACCTATCAGCTTTAATAGTATTCACGGTTCCATCCGACCAACTTCATTAGAGTCCACGGTTGCAAGTTTTCATCTTGCAAAGGTGTGGAAGCTGGTTAAAGAGAGGGACGAAACATTATAAAGGTGTGGAAGCTTCTTTTTAATAGACGCGGTTTAAAATTGTGAAGTTGATGACGATATGTAACGATCTAAAGTGGACAATATCTCTTGTATCAGTTATTTCGGTTAGATTTCTGTTAATTGTGATATGGGTTCACACCATCTTTTATCTTTATCTTCTTATTTACATAAAGAGCATGTTAGCTGTCGTTACCTTAACAGTTATGTCTACATCTCCCAAAACCAGAAAAATCCGAAGCCCATTAATTCAGTTCCTCGTTCTTCGATCTCCCTGTGGGCGCGCACAAATCCGGATTACCCACCAAAAATTTCCCCCCAATTTCCTGCATGGCCAAACCTCGCCCATACCCAGTTATATATATTAATCTTTGTTTCAATTTCAGAAGTAGCAGCTTATCGGAATCGGGGTTTGATTCTTCTGAAGCTTCTTCGGTCTTTAATGGCGGTGTTGAAAGGGAAATACATCAAGTTTCAGGGCCGGAAATGGTCCACATTCAAGCTTTCCAAGATAGTCATGGCCTTGCTTTTGGCACTTGGGATTTCCATGTTCATCGCTTTCCGATTCTTCTCTCCTACTGAAAGTTCTCATAGCAATCTACTCCACCGGCTCGCTTCCGTCCAGCATAGAGCCGTTCATAGGTACTTGGCAATGCTTATTAGCGGAATCAAATGATTTTGATTTTGATTTTTGATCTTGGAGCCCAACCCCCATTTTTTGTTGCAGTGATGGATTGGGGAAGAGAGAGGATCAATGGGTTGAGTTCATTTCATGGGAGCCTAGGGCTTTTGTTTATCACAATTTCTTGGTAAGTTCTCAGAATTTTGGATTCCTTTTGGCATTTTGTTTGGTTTTTGTCATGTTTTTGGTTGAAATGCTTGCTGATATGAATGATTTTGGACAGTCCAAGGAAGAATGCTTGTATTTGATTAGTCTTGCAACACCTTACATGAAAAAATCAACTGTGGTTGATATAAAAACTGGCAAGATTAAAGATAGCAGGTCTGTATGTGTCATTTTCATTTTTGTCATCAATAGAATTTGTCAATCCTGACTATGAACTAACGTTGGTGGGGCTTACAATGTCATGTCCGCAGGACGCGCACCAGTTCCGGGATGTTTCTGAATAGAGGGCAGAACAAAATTGTCAGCAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCCGTAGGTAATGATATCCAAATTTGCATGTTACCTTGGCCAATTTTTTCATGCATCTTCTAATGATTCTTATCTATAAGGGCTTCAAAACATGTTGGAATCATTGTTTTTGAAATCTTGTTTCTATGTGTTAGTCTTGTCTGGTGGAACATGATAGTGATTTGATTGACGACCTTACTTAAACAAGTGTCAATAACATCACGTATAGTTAGACTAGCCAAATTGTGAGATCCCACCTTGGTTGTAGAGGGAAACAAAGCATTTCTTATAAGAGTGTGGAAACATATTCCTAGTAAGCACGTTTTAAAACTGTGAGGCTAACGATGATATGTAATGGGCCAAAACAAACAACATCTACTAGCGGTGGGCTTGAATTGTTACAAATAGTATCAGAGCCACTTATTGGACGGTTCGAAGCACTGTTTGGAATAAGCCAGTTATTTTAAAACCGTGAGGATGATGGTGATGTGCAACCGTCCAAAGCGGACGATATCAGTTAGCGGAGGACTTGGGCTGTTACAAATGGTATCAGAGCCACATATCGGGCGATGTGCTAGCGAGGAGGCTGGGTCCCAAGGGGGTGGATTGTGAGATCCCACATCGGTTGGAGAGGAGAACAAAACTTCCTTATAAGGGTGTAGAAACTTCTCCCTAATAGACGTTTTAAAATTGTGAGGCTGATGGCGATATGTAACGAGTCAGAGCAGATAATATAGCAGTGGGCTTGGGCCGATGAAAAATGAGCTAATTTTCTCGTGGGTTTGTTCTCCATTCTAGTTTTGAGGCCCAGACCAAGCAGGACTTTGGTGAACACTATTCATTTGCTTACTGATGCTTTGTAGACATCACTGACATTCTTGGTTTCAATACATTTTGGTATTAATGTTGATATTCACTATTCCAAACAGAGCATGGAGAAGCACTTCAAATTCTGCACTATGAAGTCGGCCAGAAGTATGATGCTCACTATGATTTCATTTCTGAGGAGTTCATCCGAAAAGGAGGCCAAAGAATAGCCACTCTTCTCATGTATCTGTAAGATTCAAATTTTACATCTCTCAAACTCATCTGCCCCTCTCTACCTCTCTCTCTAAAACTGTGACTGATACGAATGCAGGTCAGACGTCGAAGAAGGGGGTGAGACAGTGTTCCCAGCAGCCGAAGGGAACTTCAGCTCTTTGCCCGGGTGGAATGAACTGTCTGAATGTGGTAAAGGTGGACTATCTGTAAAACCAAAGATGGGTGATGCATTATTGTTCTGGACCCTGAGGCCTAATAATACCTTAGATCCTACAAGTTTGCATGGTTAGTGATATCTCCTGCACCAATTCTAATGAACAATACATATGCCTTTGCTCAAGGATGGTGAAGTAGAATTAATTTGACATTCTTGTGATATGTAGGTGCTTGCCCTGTCATAAGAGGGAACAAGTGGTCATGTACAAAGTGGATGCATGTTAATGAATACTAATTCAAACGTACGTTATCGACTAATTTTAGTAACAATGTTCCTACATTATATCTACCCACATATATACTTAGGATTCGAATCAAGGTTTAAGATTCTGATTTAGTACCTGCTGGCCTCGATATGTCTCCTGTGCGACATCACATTACCTACTTGTTGTCGTAGACTAATTCTTACAAACCAACACGGGTCTTTTCAACATATTTTGTTTTCACTCCTATGCTTTCATCATCCAACTTAGAATTGTTCCAAGCTAAGAACGCTTAATTTGGAAGTTTTTTTTTATTAAGCCACTGAAAAGGAACGGAACGTGCGCCTTGTTGGTATAGATAGTAACTTCTATTCTTTTTTAATCTGTCTCAATCACTTTTATACGAATATGGTCTTGTCTTGTCATTTGTGTGCTTTCTTGTTACTTGGGTGTCATAATTTTTAGAGGTGGGTTTGAAATAATAGCTACAAATCTAGTGTTTTATTAGCAATATATGACATGATTTTAAGTGAGATCATTTCCAAATTGCAGGGTGGAATGAAAGGCATACATCTTAACTGGCTACTCATTTGGATGAAGATGTCTGTTTTTTTCCTTCCCAAAGTTTGCTTATGAATGCATTATTGTAACTATTTGCAATGTAATGTTACTATTAGGGCCGTTTCGAGATTGTTTCGTTATGTGGATCTTATGTACGAGCTGACACTGGAGGAAAGACGGGTGGTCTTAGCGTATGTTTGTCGAGTGCTGATGGCTATATCATAGGAGGGGGAGTTGGTGGACCGTTGAAGGCCGCTGGACCCGTGCAGGTACCCTAACCAACAAATTGTTTTTCTAGTATTTTCCCTAAGAAGTTCAGTCGATGTCTGAATGTTTTATTGCATAAAGAGTTTTAGATGTATTATTCTTTTGCTTAGTTCATTCTTGTATGGACCCATCCAAATCAGGTTAACATACAGGACGAGACGAGACAAGACGTGACTGCACAACCGCTGCAACGCCAGCAATCTTCGCGTCACTGTGTTAGCTAA

mRNA sequence

ATTATGGATAAATGATAATATGTACACAAAAAAGGAAAAAAAAGTTATATTTATTTTATTTATTTATTTTAATTTTCCAAAACATACGCTCTCTCTCTCATCTCAAAAATGGAACCCAATGACAACCAGCTCAGCTCCTACTTCCACCACCATCAACACCACCATCAAAGTCCCACCACATCGCCGACCAATGGCCTTCTACCCTCCACCCACCACCTCTCCTCCGCCGACGCCACCACCCATGTCCTTTACCCTCACTCGGTTCCCTCCGCCGCCGTCTCCTCCTCTCCTCTCGAGCCCGGTCGCCGGAAGAGAGGTCGCCCGCGGAAGTACGGCACGCCGGAGGAGGCTTTAGCGGCTAAGAAAGCTGCTACGGCGTCGTCTCACTCTTCCTCCAAGGCCAAGAAGGACCTCGTTTCTTCCTCTTCTCTTAATGCCGTTTCCGCTTCTTCGAAGAAATCTCAGTTGGCTGCACTTGGTAATGCAGGCCAAGGTTTTGCGCCACAGGTTATTGATGTGGCAGCTGGTGAGGAAAGTGGAGCTGTGGTGGGCTGGCGGCTTAAGTTCATGGGAAACCTTAATGATAGAGAGATTGAAGGGTGTATGGCTATCTGGAGCAAGGACGTGGGCCAGAAAATTATGCTGTTTATGCAACAATGTAAGCGGGAAATCTGTATCCTTTCCGCATCTGGTTTGATCTCCAATGCATCTCTCCGTCAGCCGGCCTCATCTGGTGGCAATGTTACGTATGAGGGCCGTTTCGAGATTGTTTCGTTATGTGGATCTTATGTACGAACTGACATTGGAGGAAAGACTGGTGGTCTTAGCGTATGTTTGTCGAGTGCTGATGGCCATATCATAGGAGGGGGAGTTGGTGGACCGTTGATGGCTGCTGGACCCGTGCAGGTTATTGTTGGTACTTTCATAACCGACCCGACAAAGGAAGCTGGTGGTGGGATTAAAGGCGATACATCTGCTGGCAAGTTGTCCTCACCCACTGGTGGGACATCGATGTCAGGTCTACGCTATGGTTCGAACATCGACTTGGGAGGTAATCAAGTCAGGGGAAACAATGAGCACCAAGGTATCGGGGAGAGTCATTTCTTGCTTCAACCCCGGGGAGTGAACCTGACATCTCCTCGATCCAACTGGAGAACAGGTCTGGATGGCACCACCACTGCTTATGATTTTACAGAAGTAGCAGCTTATCGGAATCGGGGTTTGATTCTTCTGAAGCTTCTTCGGTCTTTAATGGCGGTGTTGAAAGGGAAATACATCAAGTTTCAGGGCCGGAAATGGTCCACATTCAAGCTTTCCAAGATAGTCATGGCCTTGCTTTTGGCACTTGGGATTTCCATGTTCATCGCTTTCCGATTCTTCTCTCCTACTGAAAGTTCTCATAGCAATCTACTCCACCGGCTCGCTTCCGTCCAGCATAGAGCCGTTCATAGTGATGGATTGGGGAAGAGAGAGGATCAATGGGTTGAGTTCATTTCATGGGAGCCTAGGGCTTTTGTTTATCACAATTTCTTGTCCAAGGAAGAATGCTTGTATTTGATTAGTCTTGCAACACCTTACATGAAAAAATCAACTGTGGTTGATATAAAAACTGGCAAGATTAAAGATAGCAGGACGCGCACCAGTTCCGGGATGTTTCTGAATAGAGGGCAGAACAAAATTGTCAGCAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCCGTAGAGCATGGAGAAGCACTTCAAATTCTGCACTATGAAGTCGGCCAGAAGTATGATGCTCACTATGATTTCATTTCTGAGGAGTTCATCCGAAAAGGAGGCCAAAGAATAGCCACTCTTCTCATGTATCTGTCAGACGTCGAAGAAGGGGGTGAGACAGTGTTCCCAGCAGCCGAAGGGAACTTCAGCTCTTTGCCCGGGTGGAATGAACTGTCTGAATGTGGTAAAGGTGGACTATCTGTAAAACCAAAGATGGGTGATGCATTATTGTTCTGGACCCTGAGGCCTAATAATACCTTAGATCCTACAAGTTTGCATGGTGCTTGCCCTGTCATAAGAGGGAACAAGTGGTCATGTACAAAGTGGATGCATGGCCGTTTCGAGATTGTTTCGTTATGTGGATCTTATGTACGAGCTGACACTGGAGGAAAGACGGGTGGTCTTAGCGTATGTTTGTCGAGTGCTGATGGCTATATCATAGGAGGGGGAGTTGGTGGACCGTTGAAGGCCGCTGGACCCGTGCAGGTTAACATACAGGACGAGACGAGACAAGACGTGACTGCACAACCGCTGCAACGCCAGCAATCTTCGCGTCACTGTGTTAGCTAA

Coding sequence (CDS)

ATGGAACCCAATGACAACCAGCTCAGCTCCTACTTCCACCACCATCAACACCACCATCAAAGTCCCACCACATCGCCGACCAATGGCCTTCTACCCTCCACCCACCACCTCTCCTCCGCCGACGCCACCACCCATGTCCTTTACCCTCACTCGGTTCCCTCCGCCGCCGTCTCCTCCTCTCCTCTCGAGCCCGGTCGCCGGAAGAGAGGTCGCCCGCGGAAGTACGGCACGCCGGAGGAGGCTTTAGCGGCTAAGAAAGCTGCTACGGCGTCGTCTCACTCTTCCTCCAAGGCCAAGAAGGACCTCGTTTCTTCCTCTTCTCTTAATGCCGTTTCCGCTTCTTCGAAGAAATCTCAGTTGGCTGCACTTGGTAATGCAGGCCAAGGTTTTGCGCCACAGGTTATTGATGTGGCAGCTGGTGAGGAAAGTGGAGCTGTGGTGGGCTGGCGGCTTAAGTTCATGGGAAACCTTAATGATAGAGAGATTGAAGGGTGTATGGCTATCTGGAGCAAGGACGTGGGCCAGAAAATTATGCTGTTTATGCAACAATGTAAGCGGGAAATCTGTATCCTTTCCGCATCTGGTTTGATCTCCAATGCATCTCTCCGTCAGCCGGCCTCATCTGGTGGCAATGTTACGTATGAGGGCCGTTTCGAGATTGTTTCGTTATGTGGATCTTATGTACGAACTGACATTGGAGGAAAGACTGGTGGTCTTAGCGTATGTTTGTCGAGTGCTGATGGCCATATCATAGGAGGGGGAGTTGGTGGACCGTTGATGGCTGCTGGACCCGTGCAGGTTATTGTTGGTACTTTCATAACCGACCCGACAAAGGAAGCTGGTGGTGGGATTAAAGGCGATACATCTGCTGGCAAGTTGTCCTCACCCACTGGTGGGACATCGATGTCAGGTCTACGCTATGGTTCGAACATCGACTTGGGAGGTAATCAAGTCAGGGGAAACAATGAGCACCAAGGTATCGGGGAGAGTCATTTCTTGCTTCAACCCCGGGGAGTGAACCTGACATCTCCTCGATCCAACTGGAGAACAGGTCTGGATGGCACCACCACTGCTTATGATTTTACAGAAGTAGCAGCTTATCGGAATCGGGGTTTGATTCTTCTGAAGCTTCTTCGGTCTTTAATGGCGGTGTTGAAAGGGAAATACATCAAGTTTCAGGGCCGGAAATGGTCCACATTCAAGCTTTCCAAGATAGTCATGGCCTTGCTTTTGGCACTTGGGATTTCCATGTTCATCGCTTTCCGATTCTTCTCTCCTACTGAAAGTTCTCATAGCAATCTACTCCACCGGCTCGCTTCCGTCCAGCATAGAGCCGTTCATAGTGATGGATTGGGGAAGAGAGAGGATCAATGGGTTGAGTTCATTTCATGGGAGCCTAGGGCTTTTGTTTATCACAATTTCTTGTCCAAGGAAGAATGCTTGTATTTGATTAGTCTTGCAACACCTTACATGAAAAAATCAACTGTGGTTGATATAAAAACTGGCAAGATTAAAGATAGCAGGACGCGCACCAGTTCCGGGATGTTTCTGAATAGAGGGCAGAACAAAATTGTCAGCAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCCGTAGAGCATGGAGAAGCACTTCAAATTCTGCACTATGAAGTCGGCCAGAAGTATGATGCTCACTATGATTTCATTTCTGAGGAGTTCATCCGAAAAGGAGGCCAAAGAATAGCCACTCTTCTCATGTATCTGTCAGACGTCGAAGAAGGGGGTGAGACAGTGTTCCCAGCAGCCGAAGGGAACTTCAGCTCTTTGCCCGGGTGGAATGAACTGTCTGAATGTGGTAAAGGTGGACTATCTGTAAAACCAAAGATGGGTGATGCATTATTGTTCTGGACCCTGAGGCCTAATAATACCTTAGATCCTACAAGTTTGCATGGTGCTTGCCCTGTCATAAGAGGGAACAAGTGGTCATGTACAAAGTGGATGCATGGCCGTTTCGAGATTGTTTCGTTATGTGGATCTTATGTACGAGCTGACACTGGAGGAAAGACGGGTGGTCTTAGCGTATGTTTGTCGAGTGCTGATGGCTATATCATAGGAGGGGGAGTTGGTGGACCGTTGAAGGCCGCTGGACCCGTGCAGGTTAACATACAGGACGAGACGAGACAAGACGTGACTGCACAACCGCTGCAACGCCAGCAATCTTCGCGTCACTGTGTTAGCTAA

Protein sequence

MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKKDLVSSSSLNAVSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLFMQQCKREICILSASGLISNASLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGTSMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYDFTEVAAYRNRGLILLKLLRSLMAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEFIRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMHGRFEIVSLCGSYVRADTGGKTGGLSVCLSSADGYIIGGGVGGPLKAAGPVQVNIQDETRQDVTAQPLQRQQSSRHCVS
Homology
BLAST of CmoCh12G011670 vs. ExPASy Swiss-Prot
Match: Q9LN20 (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=1)

HSP 1 Score: 377.1 bits (967), Expect = 4.6e-103
Identity = 186/286 (65.03%), Postives = 222/286 (77.62%), Query Frame = 0

Query: 386 KGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFS-PTESSHSNLLHRLASVQHR 445
           K ++ +FQ RKWST  L  + M  +L + + M +AF  FS P  +  S+ +      +  
Sbjct: 3   KLRHSRFQARKWSTLML-VLFMLFMLTIVLLMLLAFGVFSLPINNDESSPIDLSYFRRAA 62

Query: 446 AVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKTGKI 505
              S+GLGKR DQW E +SWEPRAFVYHNFLSKEEC YLISLA P+M KSTVVD +TGK 
Sbjct: 63  TERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKS 122

Query: 506 KDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFI 565
           KDSR RTSSG FL RG++KI+  IEKRIAD+TFIP +HGE LQ+LHYE GQKY+ HYD+ 
Sbjct: 123 KDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYF 182

Query: 566 SEEF-IRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPK 625
            +EF  + GGQR+AT+LMYLSDVEEGGETVFPAA  NFSS+P +NELSECGK GLSVKP+
Sbjct: 183 VDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPR 242

Query: 626 MGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH-GRFEI 669
           MGDALLFW++RP+ TLDPTSLHG CPVIRGNKWS TKWMH G ++I
Sbjct: 243 MGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGEYKI 287

BLAST of CmoCh12G011670 vs. ExPASy Swiss-Prot
Match: F4JZ24 (Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana OX=3702 GN=P4H10 PE=3 SV=1)

HSP 1 Score: 347.1 bits (889), Expect = 5.1e-94
Identity = 167/263 (63.50%), Postives = 205/263 (77.95%), Query Frame = 0

Query: 402 LSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGL-GKREDQWVE 461
           +S  V+ +LLA GI          P+ ++ S+  + L S+  + +   G    + ++WVE
Sbjct: 27  MSTFVILILLAFGI-------LSVPSNNAGSSKANDLTSIVRKTLQRSGEDDSKNERWVE 86

Query: 462 FISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRG 521
            ISWEPRA VYHNFL+KEEC YLI LA P+M+KSTVVD KTGK  DSR RTSSG FL RG
Sbjct: 87  IISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLARG 146

Query: 522 QNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATL 581
           ++K +  IEKRI+DFTFIPVEHGE LQ+LHYE+GQKY+ HYD+  +E+  R GGQRIAT+
Sbjct: 147 RDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRIATV 206

Query: 582 LMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTL 641
           LMYLSDVEEGGETVFPAA+GN+S++P WNELSECGKGGLSVKPKMGDALLFW++ P+ TL
Sbjct: 207 LMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSMTPDATL 266

Query: 642 DPTSLHGACPVIRGNKWSCTKWM 663
           DP+SLHG C VI+GNKWS TKW+
Sbjct: 267 DPSSLHGGCAVIKGNKWSSTKWL 282

BLAST of CmoCh12G011670 vs. ExPASy Swiss-Prot
Match: F4JNU8 (Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana OX=3702 GN=P4H8 PE=3 SV=1)

HSP 1 Score: 330.1 bits (845), Expect = 6.4e-89
Identity = 165/283 (58.30%), Postives = 211/283 (74.56%), Query Frame = 0

Query: 386 KGKYIKFQGRK-WSTFKLSKIVMAL---LLALGISMFIAFRFFSPTESSHSNLLHRLASV 445
           K K ++ + RK +ST   + +V+ L   L+ +G+ +F +    + T S   +L   + ++
Sbjct: 4   KPKQLRNKPRKSFSTQTFTVVVLVLFVILILVGLGIF-SLPSTNKTSSMPMDLTTIVQTI 63

Query: 446 QHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKT 505
           Q R    D      D+W+E ISWEPRAFVYHNFL+ EEC +LISLA P M KS VVD+KT
Sbjct: 64  QERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKT 123

Query: 506 GKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHY 565
           GK  DSR RTSSG FLNRG ++IV  IE RI+DFTFIP E+GE LQ+LHYEVGQ+Y+ H+
Sbjct: 124 GKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHH 183

Query: 566 DFISEEF-IRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSV 625
           D+  +EF +RKGGQRIAT+LMYLSDV+EGGETVFPAA+GN S +P W+ELS+CGK GLSV
Sbjct: 184 DYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSV 243

Query: 626 KPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH 664
            PK  DALLFW+++P+ +LDP+SLHG CPVI+GNKWS TKW H
Sbjct: 244 LPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFH 285

BLAST of CmoCh12G011670 vs. ExPASy Swiss-Prot
Match: Q24JN5 (Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana OX=3702 GN=P4H5 PE=2 SV=1)

HSP 1 Score: 325.1 bits (832), Expect = 2.1e-87
Identity = 161/285 (56.49%), Postives = 208/285 (72.98%), Query Frame = 0

Query: 382 MAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFS-PTESSHSNLLHRLAS 441
           MA    +++++Q RK  +       + +LL + I + +     S P  + +S+  + L +
Sbjct: 1   MASKSKQHLRYQPRKSVSRSTQAFTVLILLLVVILILLGLGILSLPNANRNSSKTNDLTN 60

Query: 442 VQHRAVHSDGLGK-REDQWVEFISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDI 501
           +  ++  S G  +   ++WVE ISWEPRA VYHNFL+ EEC +LISLA P M KSTVVD 
Sbjct: 61  IVRKSETSSGDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDE 120

Query: 502 KTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDA 561
           KTG  KDSR RTSSG FL RG +++V  IEKRI+DFTFIPVE+GE LQ+LHY+VGQKY+ 
Sbjct: 121 KTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEP 180

Query: 562 HYDFISEEF-IRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGL 621
           HYD+  +EF  + GGQRIAT+LMYLSDV++GGETVFPAA GN S++P WNELS+CGK GL
Sbjct: 181 HYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGL 240

Query: 622 SVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH 664
           SV PK  DALLFW +RP+ +LDP+SLHG CPV++GNKWS TKW H
Sbjct: 241 SVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFH 285

BLAST of CmoCh12G011670 vs. ExPASy Swiss-Prot
Match: A1L4X7 (AT-hook motif nuclear-localized protein 14 OS=Arabidopsis thaliana OX=3702 GN=AHL14 PE=1 SV=1)

HSP 1 Score: 258.1 bits (658), Expect = 3.1e-67
Identity = 176/375 (46.93%), Postives = 235/375 (62.67%), Query Frame = 0

Query: 9   SSYFHHH-QHHHQSPTT------------SPTNGLL---PSTHHLSSADATTHVLYPHSV 68
           S YFHH  QHHH  PTT            S  NGL    P   H  +  +++  +YPHSV
Sbjct: 33  SPYFHHQLQHHHHLPTTVATTASTGNAVPSSNNGLFPPQPQPQHQPNDGSSSLAVYPHSV 92

Query: 69  PSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKKDL--VSSSSLNA 128
           PS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A+++S SS+K +++L  V+  +++ 
Sbjct: 93  PSSAV-TAPMEPVKRKRGRPRKYVTPEQALAAKKLASSASSSSAKQRRELAAVTGGTVST 152

Query: 129 VSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWS 188
            S SSKKSQL ++G  GQ F P ++++A GE                             
Sbjct: 153 NSGSSKKSQLGSVGKTGQCFTPHIVNIAPGE----------------------------- 212

Query: 189 KDVGQKIMLFMQQCKREICILSASGLISNASLRQPASSGGNVTYEGRFEIVSLCGSYVRT 248
            DV QKIM+F  Q K E+C+LSASG ISNASLRQPA SGGN+ YEG++EI+SL GSY+RT
Sbjct: 213 -DVVQKIMMFANQSKHELCVLSASGTISNASLRQPAPSGGNLPYEGQYEILSLSGSYIRT 272

Query: 249 DIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKE-AGGGIKGD-- 308
           + GGK+GGLSV LS++DG IIGG +G  L AAGPVQVI+GTF  D  K+ AG G KGD  
Sbjct: 273 EQGGKSGGLSVSLSASDGQIIGGAIGSHLTAAGPVQVILGTFQLDRKKDAAGSGGKGDAS 332

Query: 309 TSAGKLSSPTGGTSMSGLRYGSNID-LGGNQVRGNNE------HQ-GI-GESHFLLQ-PR 352
            S  +L+SP     + G+ +   ++  G N +RGN+E      HQ G+ G  HF++Q P+
Sbjct: 333 NSGSRLTSPVSSGQLLGMGFPPGMESTGRNPMRGNDEQHDHHHHQAGLGGPHHFMMQAPQ 376

BLAST of CmoCh12G011670 vs. ExPASy TrEMBL
Match: A0A6J1FHE6 (AT-hook motif nuclear-localized protein OS=Cucurbita moschata OX=3662 GN=LOC111444170 PE=4 SV=1)

HSP 1 Score: 618.2 bits (1593), Expect = 4.4e-173
Identity = 332/362 (91.71%), Postives = 332/362 (91.71%), Query Frame = 0

Query: 1   MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSS 60
           MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSS
Sbjct: 1   MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSS 60

Query: 61  PLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKKDLVSSSSLNAVSASSKKSQL 120
           PLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKKDLVSSSSLNAVSASSKKSQL
Sbjct: 61  PLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKKDLVSSSSLNAVSASSKKSQL 120

Query: 121 AALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLF 180
           AALGNAGQGFAPQVIDVAAGE                              DVGQKIMLF
Sbjct: 121 AALGNAGQGFAPQVIDVAAGE------------------------------DVGQKIMLF 180

Query: 181 MQQCKREICILSASGLISNASLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLS 240
           MQQCKREICILSASGLISNASLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLS
Sbjct: 181 MQQCKREICILSASGLISNASLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLS 240

Query: 241 VCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGT 300
           VCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGT
Sbjct: 241 VCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGT 300

Query: 301 SMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYD 360
           SMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYD
Sbjct: 301 SMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYD 332

Query: 361 FT 363
           FT
Sbjct: 361 FT 332

BLAST of CmoCh12G011670 vs. ExPASy TrEMBL
Match: A0A6J1FI70 (probable prolyl 4-hydroxylase 3 OS=Cucurbita moschata OX=3662 GN=LOC111444169 PE=4 SV=1)

HSP 1 Score: 614.4 bits (1583), Expect = 6.3e-172
Identity = 301/302 (99.67%), Postives = 302/302 (100.00%), Query Frame = 0

Query: 362 TEVAAYRNRGLILLKLLRSLMAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAF 421
           +EVAAYRNRGLILLKLLRSLMAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAF
Sbjct: 85  SEVAAYRNRGLILLKLLRSLMAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAF 144

Query: 422 RFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECL 481
           RFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECL
Sbjct: 145 RFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECL 204

Query: 482 YLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVE 541
           YLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVE
Sbjct: 205 YLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVE 264

Query: 542 HGEALQILHYEVGQKYDAHYDFISEEFIRKGGQRIATLLMYLSDVEEGGETVFPAAEGNF 601
           HGEALQILHYEVGQKYDAHYDFISEEFIRKGGQRIATLLMYLSDVEEGGETVFPAAEGNF
Sbjct: 265 HGEALQILHYEVGQKYDAHYDFISEEFIRKGGQRIATLLMYLSDVEEGGETVFPAAEGNF 324

Query: 602 SSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKW 661
           SSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKW
Sbjct: 325 SSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKW 384

Query: 662 MH 664
           MH
Sbjct: 385 MH 386

BLAST of CmoCh12G011670 vs. ExPASy TrEMBL
Match: A0A6J1HRZ8 (AT-hook motif nuclear-localized protein OS=Cucurbita maxima OX=3661 GN=LOC111465589 PE=4 SV=1)

HSP 1 Score: 611.7 bits (1576), Expect = 4.1e-171
Identity = 328/362 (90.61%), Postives = 330/362 (91.16%), Query Frame = 0

Query: 1   MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSS 60
           MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSS
Sbjct: 1   MEPNDNQLSSYFHHHQHHHQSPTTSPTNGLLPSTHHLSSADATTHVLYPHSVPSAAVSSS 60

Query: 61  PLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKKDLVSSSSLNAVSASSKKSQL 120
           PLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKKDLVSSSSLNAVSASSKKSQL
Sbjct: 61  PLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKKDLVSSSSLNAVSASSKKSQL 120

Query: 121 AALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWSKDVGQKIMLF 180
           AALGNAGQGFAPQVIDVAAGE                              DVGQKIMLF
Sbjct: 121 AALGNAGQGFAPQVIDVAAGE------------------------------DVGQKIMLF 180

Query: 181 MQQCKREICILSASGLISNASLRQPASSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLS 240
           MQQCKREICILSASGL+SNASLRQP+SSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLS
Sbjct: 181 MQQCKREICILSASGLVSNASLRQPSSSGGNVTYEGRFEIVSLCGSYVRTDIGGKTGGLS 240

Query: 241 VCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKEAGGGIKGDTSAGKLSSPTGGT 300
           VCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFI DPTKEAGGGIKGDTSAGKLSSPTGGT
Sbjct: 241 VCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFIIDPTKEAGGGIKGDTSAGKLSSPTGGT 300

Query: 301 SMSGLRYGSNIDLGGNQVRGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYD 360
           SMSGLRYGSNIDLGGNQV GNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYD
Sbjct: 301 SMSGLRYGSNIDLGGNQVGGNNEHQGIGESHFLLQPRGVNLTSPRSNWRTGLDGTTTAYD 332

Query: 361 FT 363
           FT
Sbjct: 361 FT 332

BLAST of CmoCh12G011670 vs. ExPASy TrEMBL
Match: A0A6J1HMV0 (probable prolyl 4-hydroxylase 3 OS=Cucurbita maxima OX=3661 GN=LOC111465625 PE=4 SV=1)

HSP 1 Score: 530.8 bits (1366), Expect = 9.2e-147
Identity = 255/283 (90.11%), Postives = 268/283 (94.70%), Query Frame = 0

Query: 382 MAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASV 441
           MAVLKGKYIKFQGRKWSTFKLSKI+M  LLALG+SMFIAFRFFSP ESSHS LLHRLASV
Sbjct: 1   MAVLKGKYIKFQGRKWSTFKLSKIIMVFLLALGVSMFIAFRFFSPPESSHSELLHRLASV 60

Query: 442 QHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKT 501
           QH AVHSDGLGKR DQWVEFISWEPRAFVYHNFLSKEECLYLISLA PYM+KSTVVD KT
Sbjct: 61  QHSAVHSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMEKSTVVDSKT 120

Query: 502 GKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHY 561
           GK  DSR RTSSGMFL+RGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHY
Sbjct: 121 GKSVDSRARTSSGMFLHRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHY 180

Query: 562 DFISEEF-IRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSV 621
           DF ++EF I+ GGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNE SECGKGGLS+
Sbjct: 181 DFFADEFNIKHGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNERSECGKGGLSL 240

Query: 622 KPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH 664
           KPKMGDALLFW++RP+NTLDPTS+HG+CPVIRGNKWSCTKWMH
Sbjct: 241 KPKMGDALLFWSMRPDNTLDPTSMHGSCPVIRGNKWSCTKWMH 283

BLAST of CmoCh12G011670 vs. ExPASy TrEMBL
Match: A0A6J1FI75 (probable prolyl 4-hydroxylase 3 OS=Cucurbita moschata OX=3662 GN=LOC111444173 PE=4 SV=1)

HSP 1 Score: 526.9 bits (1356), Expect = 1.3e-145
Identity = 253/283 (89.40%), Postives = 266/283 (93.99%), Query Frame = 0

Query: 382 MAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASV 441
           MAV KGKYIKFQGRKWSTFKLSKI+M  +LALG+SMFIAFRFFSP ESSHS LLHRLASV
Sbjct: 1   MAVSKGKYIKFQGRKWSTFKLSKIIMVFVLALGVSMFIAFRFFSPPESSHSELLHRLASV 60

Query: 442 QHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKT 501
           QH AVHSDGLGKREDQWVE ISWEPRAFVYHNFLSKEECLYLISLA PYM+KSTVVDIKT
Sbjct: 61  QHSAVHSDGLGKREDQWVEIISWEPRAFVYHNFLSKEECLYLISLAKPYMEKSTVVDIKT 120

Query: 502 GKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHY 561
           GK  DSR RTSSGMFL RGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHY
Sbjct: 121 GKNIDSRARTSSGMFLRRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHY 180

Query: 562 DFISEEF-IRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSV 621
           DF ++EF I+ GGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNE SECGKGGLS+
Sbjct: 181 DFFADEFNIKHGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNERSECGKGGLSL 240

Query: 622 KPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH 664
            PKMGDALLFW++RP+NTLDPTS+HG+CPVIRGNKWSCTKWMH
Sbjct: 241 NPKMGDALLFWSMRPDNTLDPTSMHGSCPVIRGNKWSCTKWMH 283

BLAST of CmoCh12G011670 vs. TAIR 10
Match: AT1G20270.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 377.1 bits (967), Expect = 3.3e-104
Identity = 186/286 (65.03%), Postives = 222/286 (77.62%), Query Frame = 0

Query: 386 KGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFS-PTESSHSNLLHRLASVQHR 445
           K ++ +FQ RKWST  L  + M  +L + + M +AF  FS P  +  S+ +      +  
Sbjct: 3   KLRHSRFQARKWSTLML-VLFMLFMLTIVLLMLLAFGVFSLPINNDESSPIDLSYFRRAA 62

Query: 446 AVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKTGKI 505
              S+GLGKR DQW E +SWEPRAFVYHNFLSKEEC YLISLA P+M KSTVVD +TGK 
Sbjct: 63  TERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKS 122

Query: 506 KDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFI 565
           KDSR RTSSG FL RG++KI+  IEKRIAD+TFIP +HGE LQ+LHYE GQKY+ HYD+ 
Sbjct: 123 KDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYF 182

Query: 566 SEEF-IRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPK 625
            +EF  + GGQR+AT+LMYLSDVEEGGETVFPAA  NFSS+P +NELSECGK GLSVKP+
Sbjct: 183 VDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPR 242

Query: 626 MGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH-GRFEI 669
           MGDALLFW++RP+ TLDPTSLHG CPVIRGNKWS TKWMH G ++I
Sbjct: 243 MGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGEYKI 287

BLAST of CmoCh12G011670 vs. TAIR 10
Match: AT5G66060.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 347.1 bits (889), Expect = 3.6e-95
Identity = 167/263 (63.50%), Postives = 205/263 (77.95%), Query Frame = 0

Query: 402 LSKIVMALLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGL-GKREDQWVE 461
           +S  V+ +LLA GI          P+ ++ S+  + L S+  + +   G    + ++WVE
Sbjct: 27  MSTFVILILLAFGI-------LSVPSNNAGSSKANDLTSIVRKTLQRSGEDDSKNERWVE 86

Query: 462 FISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKTGKIKDSRTRTSSGMFLNRG 521
            ISWEPRA VYHNFL+KEEC YLI LA P+M+KSTVVD KTGK  DSR RTSSG FL RG
Sbjct: 87  IISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLARG 146

Query: 522 QNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHYDFISEEF-IRKGGQRIATL 581
           ++K +  IEKRI+DFTFIPVEHGE LQ+LHYE+GQKY+ HYD+  +E+  R GGQRIAT+
Sbjct: 147 RDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQRIATV 206

Query: 582 LMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSVKPKMGDALLFWTLRPNNTL 641
           LMYLSDVEEGGETVFPAA+GN+S++P WNELSECGKGGLSVKPKMGDALLFW++ P+ TL
Sbjct: 207 LMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSMTPDATL 266

Query: 642 DPTSLHGACPVIRGNKWSCTKWM 663
           DP+SLHG C VI+GNKWS TKW+
Sbjct: 267 DPSSLHGGCAVIKGNKWSSTKWL 282

BLAST of CmoCh12G011670 vs. TAIR 10
Match: AT4G35810.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 330.1 bits (845), Expect = 4.6e-90
Identity = 165/283 (58.30%), Postives = 211/283 (74.56%), Query Frame = 0

Query: 386 KGKYIKFQGRK-WSTFKLSKIVMAL---LLALGISMFIAFRFFSPTESSHSNLLHRLASV 445
           K K ++ + RK +ST   + +V+ L   L+ +G+ +F +    + T S   +L   + ++
Sbjct: 4   KPKQLRNKPRKSFSTQTFTVVVLVLFVILILVGLGIF-SLPSTNKTSSMPMDLTTIVQTI 63

Query: 446 QHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDIKT 505
           Q R    D      D+W+E ISWEPRAFVYHNFL+ EEC +LISLA P M KS VVD+KT
Sbjct: 64  QERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKT 123

Query: 506 GKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDAHY 565
           GK  DSR RTSSG FLNRG ++IV  IE RI+DFTFIP E+GE LQ+LHYEVGQ+Y+ H+
Sbjct: 124 GKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHH 183

Query: 566 DFISEEF-IRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGLSV 625
           D+  +EF +RKGGQRIAT+LMYLSDV+EGGETVFPAA+GN S +P W+ELS+CGK GLSV
Sbjct: 184 DYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSV 243

Query: 626 KPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH 664
            PK  DALLFW+++P+ +LDP+SLHG CPVI+GNKWS TKW H
Sbjct: 244 LPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFH 285

BLAST of CmoCh12G011670 vs. TAIR 10
Match: AT2G17720.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 325.1 bits (832), Expect = 1.5e-88
Identity = 161/285 (56.49%), Postives = 208/285 (72.98%), Query Frame = 0

Query: 382 MAVLKGKYIKFQGRKWSTFKLSKIVMALLLALGISMFIAFRFFS-PTESSHSNLLHRLAS 441
           MA    +++++Q RK  +       + +LL + I + +     S P  + +S+  + L +
Sbjct: 1   MASKSKQHLRYQPRKSVSRSTQAFTVLILLLVVILILLGLGILSLPNANRNSSKTNDLTN 60

Query: 442 VQHRAVHSDGLGK-REDQWVEFISWEPRAFVYHNFLSKEECLYLISLATPYMKKSTVVDI 501
           +  ++  S G  +   ++WVE ISWEPRA VYHNFL+ EEC +LISLA P M KSTVVD 
Sbjct: 61  IVRKSETSSGDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDE 120

Query: 502 KTGKIKDSRTRTSSGMFLNRGQNKIVSNIEKRIADFTFIPVEHGEALQILHYEVGQKYDA 561
           KTG  KDSR RTSSG FL RG +++V  IEKRI+DFTFIPVE+GE LQ+LHY+VGQKY+ 
Sbjct: 121 KTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEP 180

Query: 562 HYDFISEEF-IRKGGQRIATLLMYLSDVEEGGETVFPAAEGNFSSLPGWNELSECGKGGL 621
           HYD+  +EF  + GGQRIAT+LMYLSDV++GGETVFPAA GN S++P WNELS+CGK GL
Sbjct: 181 HYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGL 240

Query: 622 SVKPKMGDALLFWTLRPNNTLDPTSLHGACPVIRGNKWSCTKWMH 664
           SV PK  DALLFW +RP+ +LDP+SLHG CPV++GNKWS TKW H
Sbjct: 241 SVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFH 285

BLAST of CmoCh12G011670 vs. TAIR 10
Match: AT3G04590.2 (AT hook motif DNA-binding family protein )

HSP 1 Score: 258.1 bits (658), Expect = 2.2e-68
Identity = 176/375 (46.93%), Postives = 235/375 (62.67%), Query Frame = 0

Query: 9   SSYFHHH-QHHHQSPTT------------SPTNGLL---PSTHHLSSADATTHVLYPHSV 68
           S YFHH  QHHH  PTT            S  NGL    P   H  +  +++  +YPHSV
Sbjct: 33  SPYFHHQLQHHHHLPTTVATTASTGNAVPSSNNGLFPPQPQPQHQPNDGSSSLAVYPHSV 92

Query: 69  PSAAVSSSPLEPGRRKRGRPRKYGTPEEALAAKKAATASSHSSSKAKKDL--VSSSSLNA 128
           PS+AV ++P+EP +RKRGRPRKY TPE+ALAAKK A+++S SS+K +++L  V+  +++ 
Sbjct: 93  PSSAV-TAPMEPVKRKRGRPRKYVTPEQALAAKKLASSASSSSAKQRRELAAVTGGTVST 152

Query: 129 VSASSKKSQLAALGNAGQGFAPQVIDVAAGEESGAVVGWRLKFMGNLNDREIEGCMAIWS 188
            S SSKKSQL ++G  GQ F P ++++A GE                             
Sbjct: 153 NSGSSKKSQLGSVGKTGQCFTPHIVNIAPGE----------------------------- 212

Query: 189 KDVGQKIMLFMQQCKREICILSASGLISNASLRQPASSGGNVTYEGRFEIVSLCGSYVRT 248
            DV QKIM+F  Q K E+C+LSASG ISNASLRQPA SGGN+ YEG++EI+SL GSY+RT
Sbjct: 213 -DVVQKIMMFANQSKHELCVLSASGTISNASLRQPAPSGGNLPYEGQYEILSLSGSYIRT 272

Query: 249 DIGGKTGGLSVCLSSADGHIIGGGVGGPLMAAGPVQVIVGTFITDPTKE-AGGGIKGD-- 308
           + GGK+GGLSV LS++DG IIGG +G  L AAGPVQVI+GTF  D  K+ AG G KGD  
Sbjct: 273 EQGGKSGGLSVSLSASDGQIIGGAIGSHLTAAGPVQVILGTFQLDRKKDAAGSGGKGDAS 332

Query: 309 TSAGKLSSPTGGTSMSGLRYGSNID-LGGNQVRGNNE------HQ-GI-GESHFLLQ-PR 352
            S  +L+SP     + G+ +   ++  G N +RGN+E      HQ G+ G  HF++Q P+
Sbjct: 333 NSGSRLTSPVSSGQLLGMGFPPGMESTGRNPMRGNDEQHDHHHHQAGLGGPHHFMMQAPQ 376

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LN204.6e-10365.03Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana OX=3702 GN=P4H3 PE=2 SV=... [more]
F4JZ245.1e-9463.50Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana OX=3702 GN=P4H10 PE=3 S... [more]
F4JNU86.4e-8958.30Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana OX=3702 GN=P4H8 PE=3 SV=... [more]
Q24JN52.1e-8756.49Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana OX=3702 GN=P4H5 PE=2 SV=1[more]
A1L4X73.1e-6746.93AT-hook motif nuclear-localized protein 14 OS=Arabidopsis thaliana OX=3702 GN=AH... [more]
Match NameE-valueIdentityDescription
A0A6J1FHE64.4e-17391.71AT-hook motif nuclear-localized protein OS=Cucurbita moschata OX=3662 GN=LOC1114... [more]
A0A6J1FI706.3e-17299.67probable prolyl 4-hydroxylase 3 OS=Cucurbita moschata OX=3662 GN=LOC111444169 PE... [more]
A0A6J1HRZ84.1e-17190.61AT-hook motif nuclear-localized protein OS=Cucurbita maxima OX=3661 GN=LOC111465... [more]
A0A6J1HMV09.2e-14790.11probable prolyl 4-hydroxylase 3 OS=Cucurbita maxima OX=3661 GN=LOC111465625 PE=4... [more]
A0A6J1FI751.3e-14589.40probable prolyl 4-hydroxylase 3 OS=Cucurbita moschata OX=3662 GN=LOC111444173 PE... [more]
Match NameE-valueIdentityDescription
AT1G20270.13.3e-10465.032-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT5G66060.13.6e-9563.502-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT4G35810.14.6e-9058.302-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT2G17720.11.5e-8856.492-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT3G04590.22.2e-6846.93AT hook motif DNA-binding family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 739..741
NoneNo IPR availableGENE3D3.30.1330.80coord: 170..282
e-value: 3.8E-19
score: 70.9
NoneNo IPR availableGENE3D2.60.120.620q2cbj1_9rhob like domaincoord: 459..663
e-value: 1.4E-74
score: 252.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 280..304
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 53..103
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 21..36
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 290..304
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..36
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 88..103
NoneNo IPR availablePANTHERPTHR31500:SF68AT-HOOK MOTIF NUCLEAR-LOCALIZED PROTEIN 14coord: 664..715
NoneNo IPR availablePANTHERPTHR31500:SF68AT-HOOK MOTIF NUCLEAR-LOCALIZED PROTEIN 14coord: 1..363
NoneNo IPR availableSUPERFAMILY117856AF0104/ALDC/Ptd012-likecoord: 659..721
NoneNo IPR availableSUPERFAMILY117856AF0104/ALDC/Ptd012-likecoord: 171..274
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 466..663
e-value: 1.7E-55
score: 200.4
IPR044862Prolyl 4-hydroxylase alpha subunit, Fe(2+) 2OG dioxygenase domainPFAMPF136402OG-FeII_Oxy_3coord: 546..663
e-value: 1.2E-18
score: 67.7
IPR005175PPC domainPFAMPF03479PCCcoord: 654..718
e-value: 6.3E-8
score: 33.0
coord: 171..272
e-value: 9.6E-19
score: 67.8
IPR005175PPC domainPROSITEPS51742PPCcoord: 155..296
score: 25.339685
IPR005175PPC domainCDDcd11378DUF296coord: 172..272
e-value: 6.9668E-15
score: 69.5366
IPR039605AT-hook motif nuclear-localized proteinPANTHERPTHR31500AT-HOOK MOTIF NUCLEAR-LOCALIZED PROTEIN 9coord: 664..715
IPR039605AT-hook motif nuclear-localized proteinPANTHERPTHR31500AT-HOOK MOTIF NUCLEAR-LOCALIZED PROTEIN 9coord: 1..363
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 542..664
score: 11.675326

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh12G011670.1CmoCh12G011670.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005634 nucleus
molecular_function GO:0051213 dioxygenase activity
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0003680 minor groove of adenine-thymine-rich DNA binding
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen