HG10012396 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10012396
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUMP-CMP kinase
LocationChr01: 20759794 .. 20770111 (+)
RNA-Seq ExpressionHG10012396
SyntenyHG10012396
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAAATGGAAAAAAATGGGTGCAGGGGAAGGTGGCGCTGGTGACGGGCGGGGACTCGGGCATAGGGCGGGCAGTGTGTCATTGTTTCGCTTTAGAGGGCGCAACCGTGGCCTTCACCTACGTCAAGGCCCAGGAAGACAAAGACGCCAACGACACTATTGAAATGATAAAGAAGGCCAAATCCAGCGCAGCCAAGGACCCATTAGCCATACCGGCGGACTTGGGGTTCGATGAAAACTGCAAGAGGGTGGTGGACGAGGTGGTCAAAGCCTACGGTCACATCGACATTTTGATCAACAACGCCGCCGAGCAGTACAAATCCACTTCTGTTGAAGACATCGACGAGGACAGACTCCTCAGAGTGTTTCGAACCAACATTTTCTCCTACTTCTTCACCACCAGGTAAATCAATTAGTAATCACAACACAATTGGAGATTTCATAATAATAATAATGGGTGTTTTGTGTTTTATGTCAGGCATGCATTGAAGCATATGAAGGAAGGGAGCTCCATAATCAACACCACCTCAGTGAATGCCTATAAAGGCAATGCTAAGCTGCTTGATTACACTGCCACAAAGGGGGCGATTGTGTCGTTTACCAGAGGCCTAGCACTGCAGCTAGCCACCAAAGGGATAAGGGTTAACGGCGTGGCGCCGGGGCCGATATGGACGCCGTTGATTCCGGCCTCCTTTGATGAGGAAGAGACAGCTAATTTTGGGTCTCAGGTGCCAATGAAGCGAGCTGGGCAACCTATTGAAGTGGCTCCCTCATATGTCTTCCTTGCCTGTAATGTTGATTCCTCTTATATTACTGGCCAAGTCCTTCACCCTAATGGTAAGCTTTTCAAAATAAGAATTAAAAAAGGGAAAAAAAAAAAACTACATTACTCAATTGAAAATTCAATAACTAAATTGTTTTGTTTTGTTTTCATTTATTCATGTCTTTTGGAATTTGTTTTTATTTGTTTTTATTTTTTTGGTCTTCAGGTGGGACTGTAGTGAATGCTTGAGGAGGGGTCTGTTACGTGGGCTTGCCATGGAGGAGGTTGAGAGCTGGAGAAGAACGATATGGTTGTATAGACTTTTGTTAGTGGTCTGGGAAAGATGATGTAAATAATGTAAAATATCCGAACTTCAAATAAAGTTTGTCCTAGTTTTATATTACTTTAAATTTTATATCTAATAATGGAAAATCCTGTTGCAGGTTGGAATTGTAACAAACATACATGTTGTATATAACATATGAATTGGTTAGCTTCTTTAATATTGTTTCTTCTTTACCATTGATATGAAATCTTGGAAATGAGGCCACATTCCCACTAATCACACACAGGAAATTTGCATAATGAGTAAATTTTTAGATCCAGTCTAGAGTTTTCTTCATATATTCGAGGCCGGGACATTGTATGATAATTAGATTATGATCGATTCAGTCTGACTGATGTTCCCCTTCCATCTCAAGAGCTAATAATCATAAAATTGTTACACAGTATTATTGGGTAAATTGATAACATACCATATGTGCTTACTGTGTTGTCAATTCCAGATGGCTTTCCATGAATTATCTTTTCACCTTTGAAAGGCCCATAATTTGTTCAACATGTCAAGTTTTTTTTCTCCAAAACTTGACCATCCTTGGTGCGCCCTGTCCACGCTTACAGAATAGGATAAAGCAATTAGAGCAGCTGGAGTTTTAAGGGTATCATCATTCCTAATACAAATGGATTTGATCTCGTGAGAAAATAAATTAACCCAGAAATTTTAAGTGAATTATTTTGATTGACAAAGTAACAAGACATTAAACTGCAGAACACATTCTGAATTAAGATTCCATTCATAATGACTAGAAAAATAACAGAAACTAACAGTATTTTTTTTACTATTTTTTTTTAAATAGTTACACATTTTTTTATTTGTGAAAAATTAATTTTTTTTTTAATATAAATCATGTTAGTAAATAAATTAATAATAGTTTATTAGATAACTACAAATTTGTAAAGCATATAGTATCAACTACATTTTAAACAATTATTTAATTAAATTAACAATATTAATTTAATATTTTTAGAAGAAAATATGGGCATGAACCTGCTGCACAAAGATATAAATTAAGTTTGTTCCAGCACATATGCCAAATTTTTTTAAAAGGTATAAAAAAATATAGATTATGGTACATATTTTGAAAATTTAAATACTAAAATAGCGTTGGTACTAAAGGGAGACATTGTATATTTAGTGGACAAAATTTAATGATTCAATGTGACACAACATCGCTGAAGGTGAATAGAATAATGTTATTAGAATAACTGCAGGTAGAACAAGCATATTAACCCTCTTTGCGACTCTATTTTATCTCGTGAATTTATCTTCTTATGCTATCTATTTGCTATGTCATATCAAATAAGTTAATATTTGTTTCTACACCAAAGTCATCTAGCTTTTGTTTTTAAACAAAAGTCATCTAGTCAATATTAATGTAAAATTAAATACATTAAATACTCTTACCTTTCAAAGTTCAACGGCTTGTTGATTTTATCTCTACCATTTTAAAATTGCATAGTGATAAAATCGAACAACTTCAAAGTGGATAGTAAAAGTTTAAATCCATTTTTGCGATTTCTATAGTAGAATTTTTCCTTTTATTTTGTGTGATTAAAATTTAAAACAAAAATGTTAGTATGTACGGGGAGAAAATGGAAAATAACCTAATCCCATTCCCAAAATCTAAGTCCTTCTGAAGCTGATCCGTCCCTTCCGCGAGATCACAACGTGGATCCCACTCTATGTATCTTTCATGGCGTTCCTGTCCGCTTCATTTATGATCAACCTCCCAAGGAGACCTTGACAGCAGGTTGCAGCTACGTGGGCACCATCGATTTTCAACAACGCACCCTAGGTGTTCGACGAAATATCTCATGTGAAAAATTGAATCCCATGAACGATCAGAATCACATCAACTAGAATACAACTTCTGGAAGACAGCCGGCGATCAGAAGATGTGGAGACGAGTAGCTTCAGTCTCTCATTTCACGTTTGCCCATAAGTCCATTGTCAACAATAAGGTATTTACATCTACTTTGATCTCTCGAGCCAGCCAGTAATGGGCGGCTGCTGTCAATTCGTTGATTACTATTTACGTTAATAAACATGTATATTGAGAGGGTAGAAATGTTCTAACCCTTCTTCACTTTCTTCCGGGAACAACTTATTTCATTGATGTGTTTGGGTTTTGATTAGAAGAGTAATGCCTACTTTGTACGCTGAAAATGGGTAGGTCTTACAACCATACGCATTCTTCTTTCGAGTTAAAAGCCTTTGAAATCATTCACTCTCGATCTTAGCGAACTGGAAGTCTTTCTTGTAACCTTGGTAGCTTTTGGTTGGTTTGAATTTCATATTCTCCCATTCTTTCGTAACATCAATTTTCTTGGCTAAAAGAACCTCTCGCAATTGAGAACCGGAAAGAGACAAGCTTATCTTTTCTTGATTATGTATGGCATGAAAATATTTCCTTATATGTTGAGTTAAAGCTTAAACAATATTGGCTTCTCCAAGGTCATATTTGCTTTGTGGGCCAGACTTTTTTCGGCAATGTAGAAAGTTGGATTAAGTATATACGAGATGGATTTCAGCAATAACAGGAGAATGAATATTCGTGTGAGGAGTCTGGTTGCACAATCTATCGCAGGCCATGTCTGTGAGGATTTGACGTTGCACAGTCTAGAACTTAGCAGATTATAAATGACTGGTTTGTGATTGATTAGATTAAAATGCTAATTCAGATGAAATTGGGGAATTTTTAGAGAAGGTGTGGACGACGCATAGATGTCACGTCAATGAAACCTAGGATCTTGCAGAAGTGAAATCTCTTGATACCATATGCTGGAATGAAGTTCGTCTGACTTCGGAACAAGCAGGGTATTAACAACCTTGACACCTTTGTCATATTCTTCGTTTCCTCCATATTTTTGGAAGTTTATGAATTCTTTGAATTCTATTGTTCACCATACTCCATTTCTCTAGTTTCATGAAATACGTCTCACTGAAACTTGTGCTGTTAAATTGCCCCATATAAATTGTGCTCACTTTTTGTCTCTCAAACTTTATAACCAGTTTCACCTCTTTTATTCCTGATACCAATGCAGGATTTGTGTAAATTCAAGTTTTGGGAAGCATTTACCACTGAAATACCCAGAAAGGTAATATATTCATTTTTTATCTCGTAGTCAGCTTAACTTTCTAGGTGTTTAACTGCTCAGATATTTGTAGTTTCTTTATGTTTGCTTTAATTTTGTATCAACTGTAAAGCATGCACCTCAAGTTCTGTAGTTTTCATTATCTGTGGAAATGAACATTCAAACCTATTATCTACCTGAGTCCTGAGATGAGTAGAAAAGCAAGGTGCAATATGAATTTGCTTCGCCCCTCACTATTGTCTCCGGAATTGTCATATCATGTCACAGTTTCTCTCTTTATCAAGTGTAACTGTGTGTCAGTAATATTTTTTCATCTTTATGATGTGAAGAACTGAAAATTAGTGGTCACACGTTGTAGCATCTACTCCCACATGCGAGATTTAAATGCACGTGTGTGTTTATTTATGTCATTCTGTCTAATACATTTGGTGTAAAGTTTTAGATTTTTTTAATTTGAGTGCAGGAAAAAGGTACCTTTCAAAGAGATAAAACACCATTCATAACTTTTGTATTAGGTAACGTTCACTGCTTTTTCCTCCTTTCCATTTTTGGTGACAATGAAGGAAGCTGTATTTATGTTTGTGGTCAATTATATATAGTTTTTCATAATATAGAAGCCTAGAAGGTGACTAAGAAGCCAAGGGAGAACATAAATTTATCTGTTGAAGTTCTTTTCCTCAAAGGTTTATGGCTTGAAGTATCCAAAAGTAAAATGTCATGGCTCTAATAATGACATGTGATCCTTGCTTAATCAATTTTAAACTTAAACCTCCATGTTTGTTTCTCAGTGGGGGTTGTTCAATATTCCAATTATGTGTATTGATCTTTGTCTTCGTATGAAGTTGGATGAGGAGCTGAAAGAAATAGGGGTGTAATAGGTGTCTTAAGAGTTTCTTAATTCATTGTCATCACAGCTCCTTGTTAGTCCACCATTTATTTGCGCATGTTGTTAGTTAAAAATTTTATATAACTAGTGGTGGAACTATTGATTTTTTTTTCGTTGAGAGGTTAATAAATTCTTGTATGATGTAAATAAATATTTAAGAAAATAAAAATCCATAATTTAGAATAGCTAATAGTTTGAAGATGTTAATTGCATATAGTTCTAAACCTATATTCAAGAGAATGTTTAGTAGTTGCTGATAAATTTAGTACTCATTACTCAACACCTGCACACCTGCTAATGAGAGAATATTTTCTTATTATATTAGAAAAATAAATAAAATATTCAAGGATACTGTAAATGTCAAATTCTCATGCCCACAGTACTGTCGAAGTTGTATGATTAATCATTAAACAAAATGGGACGTCAGGTAAGATGTATGTGAATATAAACTAGTGTCGTATAGTAATGCAATAATAATCCTTTGAGGTTTCATATTTGTTCACCTTCTCTATCTTGAGCTCTCATGGATTTGTTTGAATAAATGGTTCTAGTAAGTAACGAAAAACAACAAAACGAGGAGAACAAAAGATGGAAAGAAGTGAAAATTCCACGAAGGATCAAGTGCTTCAGTTGTGTTTTATGCATTGAAGTGGAAACATAATAAGTAAAACTAACCACTATGCCACTAACTTTAAGATTCTCCCTTAATCAACCCCACTAAGCCACTAGGCCAGTTCCTCCTAATTAAGGCACTACCCTTAATAAAAGCCACTAACATTAATTTAAAGAAAAAGACCACTTACTTGCGAATTAAGTTTGCACTAATTTTTAACAGGAAAGATAGAAGAAAAGAAGTCATTAATATTCAAATGATATGACCTAGATGCTTATATGAATATACTAAAAGTGGCAATATTATCAGGTGGTCCTGGAAGTGGTAAAGGTACACAATGTATGAAGATAGTTGATAACTTTGGATTTACACACTTGAGTGCTGGAGATCTATTAAGGAGGGAGATAGCTTCTAATAGTGCAGATGGGTAATTGTGCTTCGTTTGCCTTGTCGAGAAAACTTTCATATCCATCTAGCCTCTGATGATGTCTGTTGATTGCGAATTTCTATTTTCAGTACCATGATTCTCAACACAATTAAGGAAGGAAAAATAGTTCCTTCAGAATTGACGATCAGACTTATTCAAAAGGAAATGGAATCCAGTGACAATTATAAGTTTCTCATTGATGGGTTTCCACGGAGTGAGGAAAATCGGATAGCATTCGAACAAATTGTGAGTTTCTTCTATACTCTTTCCATTCAGTTCTTTATTTCCTGTGGTGAAAATGTTTTTCCTTTAAGAACTTTATGAAACAGATTCCACATTTTTTCAATTTAACATATATGCTTCAGATACCTGCTTGGTGTATAATACAATTCTGTCATTCATGATCCAGATCGGGGCAGAACCAGACATCGTACTCTTCTTTGACTGTCCAGAAGATGAGATGGTGAAGCGGGTGCTCAATCGTAATGAGGTAGCTGACATGTGGTCGATCTTTCCTGCATCCAAATCGTTATCATCATTAGTATCTCAAAATGGCATATATTGCAGGGAAGAGTTGATGACAACATTGACACAATCAAGAAAAGGCTGAAAGTTTTTGCCGCATTAAATCTTCCTGTAGTCAAATATTATCTGGAGAAGGGAAAGCTTTACAAGGTAATTTTGGTTGAGTTTCGAACTCAAATCCTTTTAGTGTTGGAGCCTTGCTGCTATTAAATGGATGCTATCATTGATTGACTTGCCTTAACCACGTTTTTTTATTATTCTTTCCTTCTTGTGCAAACTTCTTGTTTGAGCTTATCGTATACAAATTTATACCTTCATTAACCGGAAAAATGTTGCAGATAAACGCAGTTGGAACAGTGGATGAAATATACAAACAAGTTTATCCAGTTTTTGCATCATTCAATTTTGAGGTATTTGGTTTCTGTTTCCACTTTTAGTTCTTCTGTTTTCCACAAAAGTATTTTGTATATGATTGGCTTGCATGTTTTAGCAGTAAAAACCTTTTAAACTCGAACCTTACAGAATCCCATTTGCTCCAATCATCCGTGCTTTCAGTTATGACATCAACTCAGTGCTGTAGTCACTAGTCAGTTGTTGATCAGCTTCCAAATATTGTTTAATCACTTTTATAAGCCATTAATTTTGTTCAGCAGCGGGTAAGAGCATGAAAAGTGTTGATTAATCGCCTTATGTCATGGATAACATGGAGCATGATCCCGGTTTGTGCTTGGAGTTACTCTTCATTTCAGTTGAAGATCCTTGTGAGTTCATTTTCCTGATTACTGTTTTTTCAAGAATGTAATATAACTGAACGTCAAGTACAGCATTGGGAATTTCTTATTGGTCCAAAAAGTGATCTCATTAATCATTCCCCTCCCCCCCCCCCCCCCCCCCCCCCCAAAAAAAAAAAAAAAAAAAATCCTGTTGGCTCATATTCTTGATATCAACAGGTGGCATATGTTCCCCAACTTCAGATGGAGGGAACTCTACACACAGAACTAATAAATATGCTTAGGAATGAGCACACATTCTCAAAAAATTGATAAAATGTTGATATTAATGGTGTTACTGGCAAAAATATAAGCTAAGTTAAAGGATGCTGAGACATGGTTTGGGTTTGTTCATTAGAAGTGTGGGAAAGGAATAGAAAGAATAACTCGTTAGAGGTGGGTGGCAAATGAGTCTAGTTGAACTAATTAAAGACATACTTGTTTGAATCTTCCCCCCCCCCCCCCCCCTTTTTGTTTTCTTTACTCTTTAATTTTACCTAATTTATTTCGAACCTCAAACCTTAGACTTTACATAACCAATGATTTACCATTCTACTAGTTAGGTAAATAATAAACTCTGCTAGGTAAATAACAAACTCTGCTACTACTACATAATATTATATATACATAACGAAAAAAATAAAATAAACTCAGTTTAATCTTTAGAATTACTAAGTTTATAATATGTTTCAAGAGTCAAGCATGTGACCAATAAATGCAAAAGGGAAAAATATGAAAAAAGCGGGTTAAATTACGAATCTCTAAAGTGTCTATTATGTTTTTTGAACTTGAATAATATAATATTTAAAGAATAATATTAATTTTTTGTTTTAAAATTTTTTTGAGTAATATTTGGGTGCGGGTATCCCTTCAATCATAAAAGAATTTTTTTTTGTTTTTTGCAGCTATTTTAAAAAATGTAATATTTGGGAGATGCATCCATGTTTAAACGTCTCATTTTCATCCCTCACTAAAAAAGAATCAAAATATTTCTAAAAAGGTATAAAAAGAATTTGTGATATGATAAATTATGATTGGACAAAAATAGCCGGAAGCACCCAATTATTTGTCTTTTAAAAAATGATTTGAACCCAATTTTGTTGGCATAAAATTACGATAGGGTGGTGGGTGTGGCCTGCGAGAACGTATGAAACAAACCCCAAATGAGCGCTTCGGAACCGCCATTTTCCCTCTTCTTCTCCAACGGTTTGTTTGCCCAATGCTGATTTCTCTGCAACTGTCAATTTCACCCACAACTTCGCTTCTTCTCTTCTGTTCTTCTAAGCCCAAGAAATCCAAGAAAGAGAGAAGGAAACTTCTTCAGCAAAAACTTATTCGCATAAACAAAGCCAAAGAAACCACTGATCTCTCCTTCCCTAAATCCTCATCAACCCCTCTCTTAATCCACCCCAAACCCTTCTTCCAAACCAAAATTCAAGCCCTTGATGCTCTTCTCACCGACCTTGAAGCCTCCGTCGACAATGGCCTCCTTATTGATCCTGAAATTTTCTCTTCCCTTTTGGAAGTTTGCTACCAATTGCGAGCCATCCACCATGGTATTCGGCTTCATCGCCTAATACCCACCAATTTTTTACGGAGAAATGTGGGTGTTTCTTCTAAGCTTCTTCGTCTGTATGCTTCTTTTGGGTACATGGAGAATGCACACCAGGTGTTCGATGAAATGTGTAAACGTAATATCTCTGCTTTTGCTTGGAATTCTCTTATTTCTGGATATGCTGAACTTGGTCTTTATGAAGATGCTCTGGCGCTTTACTTTCAAATGGAGGAAGAAGGTGTTGAACCTGACCACTTCACTTTTCCTCGTGTGCTCAAGGCCTGTGGTGGCATTGGGTTGATTCAAATCGGAGAGGCGGTGCACCGCCATGTCGTTCGTTCAGGCTTTGCTGGAGATGTCTTTGTCCTCAATGCTCTAGTTGATATGTATTCCAAATGTGGTAGCATTGTGAGGGCTAGAAAAGTTTTTGATCAGATTGTCTGTAAGGATACAGTTTCTTGGAACTCAATGCTCACTGGTTACACACGCCATGGGCTTCTCTTGGAGGCATTGGACATCTTTGATCAAATGATTCAAGAAGGTTACGAGCCGGATTCGGTTGCTTTGTCCACCATTCTTTCTAACATTTCATCATTGAAATTCAAGTTACACATTCATGGATGGGTGATTCGGCATGGAGTGGAATGGAATTTGTCCATTGCTAACTCCTTGATTGTCATGTATGCCAATTGTGGTAAGATTAACAGAGCAAAATGGTTGTTCCAGCAGATGCCTCAAAAAGACATAGTCTCATGGAACTCCATAATCTCTGCTCATTTCAATACCCCAGAAGCTTTGACATATTTTGAAGTGATGGAAAGCCTTGGTGTTTTGCCAGACTCTGTAACATTTGTGTCATTGTTGTCAACTTGTGCCCATCTGGGCTTATTGAAGGAAGGGGGAAAGTTGTATTCTTTGATGAAGGGGAAGTATGGAATAAGACCAACCATAGAACATTATGCTTGTATGGTGAATCTTTATGGGAGAGCAGGGCTGATTGAAGAAGCTTATAGAATCATAACGAAAGCAATGAAGATCGAGGCAGGTCCGACTGTATGGGGGGCGCTGTTGTATGCGTGTTATCTCCACAGCAATGTAGATATCGCCGAGATTGCTGCTGAAAGACTCTTCGAATTGGAGCCGGATAATGAACTCAATTTTGAGCTTCTGATGAAGGTTTATGGCAATGCCGGGAGATTGGAAGACGAGAAGAGAGTGAAATTAATGATGGCGGAACGAGGACTGGATTCGTAGTGTCGATAATATGTGAAATCTATTCACGCACATGGATGAATGATGAATTCATCTGGATAGATAACTGGTTTATTGTGAAGTTAATTTGA

mRNA sequence

ATGAAAAATGGAAAAAAATGGGTGCAGGGGAAGGTGGCGCTGGTGACGGGCGGGGACTCGGGCATAGGGCGGGCAGTGTGTCATTGTTTCGCTTTAGAGGGCGCAACCGTGGCCTTCACCTACGTCAAGGCCCAGGAAGACAAAGACGCCAACGACACTATTGAAATGATAAAGAAGGCCAAATCCAGCGCAGCCAAGGACCCATTAGCCATACCGGCGGACTTGGGGTTCGATGAAAACTGCAAGAGGGTGGTGGACGAGGTGGTCAAAGCCTACGGTCACATCGACATTTTGATCAACAACGCCGCCGAGCAGTACAAATCCACTTCTGTTGAAGACATCGACGAGGACAGACTCCTCAGAGTGTTTCGAACCAACATTTTCTCCTACTTCTTCACCACCAGGCATGCATTGAAGCATATGAAGGAAGGGAGCTCCATAATCAACACCACCTCAGTGAATGCCTATAAAGGCAATGCTAAGCTGCTTGATTACACTGCCACAAAGGGGGCGATTGTGTCGTTTACCAGAGGCCTAGCACTGCAGCTAGCCACCAAAGGGATAAGGGTTAACGGCGTGGCGCCGGGGCCGATATGGACGCCGTTGATTCCGGCCTCCTTTGATGAGGAAGAGACAGCTAATTTTGGGTCTCAGGTGCCAATGAAGCGAGCTGGGCAACCTATTGAAGTGGCTCCCTCATATGTCTTCCTTGCCTGTAATGTTGATTCCTCTTATATTACTGGCCAAGTCCTTCACCCTAATGGTGGTCCTGGAAGTGGTAAAGGTACACAATGTATGAAGATAGTTGATAACTTTGGATTTACACACTTGAGTGCTGGAGATCTATTAAGGAGGGAGATAGCTTCTAATAGTGCAGATGGTACCATGATTCTCAACACAATTAAGGAAGGAAAAATAGTTCCTTCAGAATTGACGATCAGACTTATTCAAAAGGAAATGGAATCCAGTGACAATTATAAGTTTCTCATTGATGGGTTTCCACGGAGTGAGGAAAATCGGATAGCATTCGAACAAATTATCGGGGCAGAACCAGACATCGTACTCTTCTTTGACTGTCCAGAAGATGAGATGGTGAAGCGGGTGCTCAATCGTAATGAGGGAAGAGTTGATGACAACATTGACACAATCAAGAAAAGGCTGAAAGTTTTTGCCGCATTAAATCTTCCTGTAGTCAAATATTATCTGGAGAAGGGAAAGCTTTACAAGATAAACGCAGTTGGAACAGTGGATGAAATATACAAACAAGTTTATCCAGTTTTTGCATCATTCAATTTTGAGGGTGGTGGGTGTGGCCTGCGAGAACGTATGAAACAAACCCCAAATGAGCGCTTCGGAACCGCCATTTTCCCTCTTCTTCTCCAACGGTTTGTTTGCCCAATGCTGATTTCTCTGCAACTGTCAATTTCACCCACAACTTCGCTTCTTCTCTTCTGTTCTTCTAAGCCCAAGAAATCCAAGAAAGAGAGAAGGAAACTTCTTCAGCAAAAACTTATTCGCATAAACAAAGCCAAAGAAACCACTGATCTCTCCTTCCCTAAATCCTCATCAACCCCTCTCTTAATCCACCCCAAACCCTTCTTCCAAACCAAAATTCAAGCCCTTGATGCTCTTCTCACCGACCTTGAAGCCTCCGTCGACAATGGCCTCCTTATTGATCCTGAAATTTTCTCTTCCCTTTTGGAAGTTTGCTACCAATTGCGAGCCATCCACCATGGTATTCGGCTTCATCGCCTAATACCCACCAATTTTTTACGGAGAAATGTGGGTGTTTCTTCTAAGCTTCTTCGTCTGTATGCTTCTTTTGGGTACATGGAGAATGCACACCAGGTGTTCGATGAAATGTGTAAACGTAATATCTCTGCTTTTGCTTGGAATTCTCTTATTTCTGGATATGCTGAACTTGGTCTTTATGAAGATGCTCTGGCGCTTTACTTTCAAATGGAGGAAGAAGGTGTTGAACCTGACCACTTCACTTTTCCTCGTGTGCTCAAGGCCTGTGGTGGCATTGGGTTGATTCAAATCGGAGAGGCGGTGCACCGCCATGTCGTTCGTTCAGGCTTTGCTGGAGATGTCTTTGTCCTCAATGCTCTAGTTGATATGTATTCCAAATGTGGTAGCATTGTGAGGGCTAGAAAAGTTTTTGATCAGATTGTCTGTAAGGATACAGTTTCTTGGAACTCAATGCTCACTGGTTACACACGCCATGGGCTTCTCTTGGAGGCATTGGACATCTTTGATCAAATGATTCAAGAAGGTTACGAGCCGGATTCGGTTGCTTTGTCCACCATTCTTTCTAACATTTCATCATTGAAATTCAAGTTACACATTCATGGATGGGTGATTCGGCATGGAGTGGAATGGAATTTGTCCATTGCTAACTCCTTGATTGTCATGTATGCCAATTGTGGTTTATGGCAATGCCGGGAGATTGGAAGACGAGAAGAGAGTGAAATTAATGATGGCGGAACGAGGACTGGATTCGTAGTGTCGATAATATGTGAAATCTATTCACGCACATGGATGAATGATGAATTCATCTGGATAGATAACTGGTTTATTGTGAAGTTAATTTGA

Coding sequence (CDS)

ATGAAAAATGGAAAAAAATGGGTGCAGGGGAAGGTGGCGCTGGTGACGGGCGGGGACTCGGGCATAGGGCGGGCAGTGTGTCATTGTTTCGCTTTAGAGGGCGCAACCGTGGCCTTCACCTACGTCAAGGCCCAGGAAGACAAAGACGCCAACGACACTATTGAAATGATAAAGAAGGCCAAATCCAGCGCAGCCAAGGACCCATTAGCCATACCGGCGGACTTGGGGTTCGATGAAAACTGCAAGAGGGTGGTGGACGAGGTGGTCAAAGCCTACGGTCACATCGACATTTTGATCAACAACGCCGCCGAGCAGTACAAATCCACTTCTGTTGAAGACATCGACGAGGACAGACTCCTCAGAGTGTTTCGAACCAACATTTTCTCCTACTTCTTCACCACCAGGCATGCATTGAAGCATATGAAGGAAGGGAGCTCCATAATCAACACCACCTCAGTGAATGCCTATAAAGGCAATGCTAAGCTGCTTGATTACACTGCCACAAAGGGGGCGATTGTGTCGTTTACCAGAGGCCTAGCACTGCAGCTAGCCACCAAAGGGATAAGGGTTAACGGCGTGGCGCCGGGGCCGATATGGACGCCGTTGATTCCGGCCTCCTTTGATGAGGAAGAGACAGCTAATTTTGGGTCTCAGGTGCCAATGAAGCGAGCTGGGCAACCTATTGAAGTGGCTCCCTCATATGTCTTCCTTGCCTGTAATGTTGATTCCTCTTATATTACTGGCCAAGTCCTTCACCCTAATGGTGGTCCTGGAAGTGGTAAAGGTACACAATGTATGAAGATAGTTGATAACTTTGGATTTACACACTTGAGTGCTGGAGATCTATTAAGGAGGGAGATAGCTTCTAATAGTGCAGATGGTACCATGATTCTCAACACAATTAAGGAAGGAAAAATAGTTCCTTCAGAATTGACGATCAGACTTATTCAAAAGGAAATGGAATCCAGTGACAATTATAAGTTTCTCATTGATGGGTTTCCACGGAGTGAGGAAAATCGGATAGCATTCGAACAAATTATCGGGGCAGAACCAGACATCGTACTCTTCTTTGACTGTCCAGAAGATGAGATGGTGAAGCGGGTGCTCAATCGTAATGAGGGAAGAGTTGATGACAACATTGACACAATCAAGAAAAGGCTGAAAGTTTTTGCCGCATTAAATCTTCCTGTAGTCAAATATTATCTGGAGAAGGGAAAGCTTTACAAGATAAACGCAGTTGGAACAGTGGATGAAATATACAAACAAGTTTATCCAGTTTTTGCATCATTCAATTTTGAGGGTGGTGGGTGTGGCCTGCGAGAACGTATGAAACAAACCCCAAATGAGCGCTTCGGAACCGCCATTTTCCCTCTTCTTCTCCAACGGTTTGTTTGCCCAATGCTGATTTCTCTGCAACTGTCAATTTCACCCACAACTTCGCTTCTTCTCTTCTGTTCTTCTAAGCCCAAGAAATCCAAGAAAGAGAGAAGGAAACTTCTTCAGCAAAAACTTATTCGCATAAACAAAGCCAAAGAAACCACTGATCTCTCCTTCCCTAAATCCTCATCAACCCCTCTCTTAATCCACCCCAAACCCTTCTTCCAAACCAAAATTCAAGCCCTTGATGCTCTTCTCACCGACCTTGAAGCCTCCGTCGACAATGGCCTCCTTATTGATCCTGAAATTTTCTCTTCCCTTTTGGAAGTTTGCTACCAATTGCGAGCCATCCACCATGGTATTCGGCTTCATCGCCTAATACCCACCAATTTTTTACGGAGAAATGTGGGTGTTTCTTCTAAGCTTCTTCGTCTGTATGCTTCTTTTGGGTACATGGAGAATGCACACCAGGTGTTCGATGAAATGTGTAAACGTAATATCTCTGCTTTTGCTTGGAATTCTCTTATTTCTGGATATGCTGAACTTGGTCTTTATGAAGATGCTCTGGCGCTTTACTTTCAAATGGAGGAAGAAGGTGTTGAACCTGACCACTTCACTTTTCCTCGTGTGCTCAAGGCCTGTGGTGGCATTGGGTTGATTCAAATCGGAGAGGCGGTGCACCGCCATGTCGTTCGTTCAGGCTTTGCTGGAGATGTCTTTGTCCTCAATGCTCTAGTTGATATGTATTCCAAATGTGGTAGCATTGTGAGGGCTAGAAAAGTTTTTGATCAGATTGTCTGTAAGGATACAGTTTCTTGGAACTCAATGCTCACTGGTTACACACGCCATGGGCTTCTCTTGGAGGCATTGGACATCTTTGATCAAATGATTCAAGAAGGTTACGAGCCGGATTCGGTTGCTTTGTCCACCATTCTTTCTAACATTTCATCATTGAAATTCAAGTTACACATTCATGGATGGGTGATTCGGCATGGAGTGGAATGGAATTTGTCCATTGCTAACTCCTTGATTGTCATGTATGCCAATTGTGGTTTATGGCAATGCCGGGAGATTGGAAGACGAGAAGAGAGTGAAATTAATGATGGCGGAACGAGGACTGGATTCGTAGTGTCGATAATATGTGAAATCTATTCACGCACATGGATGAATGATGAATTCATCTGGATAGATAACTGGTTTATTGTGAAGTTAATTTGA

Protein sequence

MKNGKKWVQGKVALVTGGDSGIGRAVCHCFALEGATVAFTYVKAQEDKDANDTIEMIKKAKSSAAKDPLAIPADLGFDENCKRVVDEVVKAYGHIDILINNAAEQYKSTSVEDIDEDRLLRVFRTNIFSYFFTTRHALKHMKEGSSIINTTSVNAYKGNAKLLDYTATKGAIVSFTRGLALQLATKGIRVNGVAPGPIWTPLIPASFDEEETANFGSQVPMKRAGQPIEVAPSYVFLACNVDSSYITGQVLHPNGGPGSGKGTQCMKIVDNFGFTHLSAGDLLRREIASNSADGTMILNTIKEGKIVPSELTIRLIQKEMESSDNYKFLIDGFPRSEENRIAFEQIIGAEPDIVLFFDCPEDEMVKRVLNRNEGRVDDNIDTIKKRLKVFAALNLPVVKYYLEKGKLYKINAVGTVDEIYKQVYPVFASFNFEGGGCGLRERMKQTPNERFGTAIFPLLLQRFVCPMLISLQLSISPTTSLLLFCSSKPKKSKKERRKLLQQKLIRINKAKETTDLSFPKSSSTPLLIHPKPFFQTKIQALDALLTDLEASVDNGLLIDPEIFSSLLEVCYQLRAIHHGIRLHRLIPTNFLRRNVGVSSKLLRLYASFGYMENAHQVFDEMCKRNISAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGLIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGSIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIQEGYEPDSVALSTILSNISSLKFKLHIHGWVIRHGVEWNLSIANSLIVMYANCGLWQCREIGRREESEINDGGTRTGFVVSIICEIYSRTWMNDEFIWIDNWFIVKLI
Homology
BLAST of HG10012396 vs. NCBI nr
Match: KAG7030831.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 927.9 bits (2397), Expect = 6.2e-266
Identity = 466/557 (83.66%), Postives = 504/557 (90.48%), Query Frame = 0

Query: 254 NGGPGSGKGTQCMKIVDNFGFTHLSAGDLLRREIASNSADGTMILNTIKEGKIVPSELTI 313
           +GGPGSGKGTQCMKIV+NFGFTHLSAGD+LRREIASNSADGTMIL+TIKEGKIVPSELT+
Sbjct: 3   SGGPGSGKGTQCMKIVENFGFTHLSAGDILRREIASNSADGTMILDTIKEGKIVPSELTV 62

Query: 314 RLIQKEMESSDNYKFLIDGFPRSEENRIAFEQIIGAEPDIVLFFDCPEDEMVKRVLNRNE 373
           +LIQKEMESSDNYKFLIDGFPRSE+NRIAFEQIIGAEPDIVLFFDCPEDEM+KRVLNRN+
Sbjct: 63  KLIQKEMESSDNYKFLIDGFPRSEDNRIAFEQIIGAEPDIVLFFDCPEDEMMKRVLNRNQ 122

Query: 374 GRVDDNIDTIKKRLKVFAALNLPVVKYYLEKGKLYKINAVGTVDEIYKQVYPVFASFNFE 433
           GRVDDN+DTIKKRLKVF+ALNLPVVKYYLE+GKLYKINAVGTVDEIYKQVYP+FA FNFE
Sbjct: 123 GRVDDNVDTIKKRLKVFSALNLPVVKYYLERGKLYKINAVGTVDEIYKQVYPLFAQFNFE 182

Query: 434 GGGCGLRERMKQTPNERFGTAIFPLLLQRFVCPMLISLQLSISPTTSLLLFCSSKPKKSK 493
                                   + +Q+    MLISL+ SISP TSL LFCSS PKKSK
Sbjct: 183 S----------------IADHCLIIFVQQ---QMLISLRFSISPITSLRLFCSSGPKKSK 242

Query: 494 KERRKLLQQKLIRINKAKETTDLSFPKSSSTPLLIHPKPFFQTKIQALDALLTDLEASVD 553
           KERRKLLQ+KLIRI+KAKE T L FPKSSSTPLLIH KPF Q+KIQALDA+L DLEAS+ 
Sbjct: 243 KERRKLLQEKLIRISKAKEATRLPFPKSSSTPLLIHHKPFSQSKIQALDAVLNDLEASLH 302

Query: 554 NGLLIDPEIFSSLLEVCYQLRAIHHGIRLHRLIPTNFLRRNVGVSSKLLRLYASFGYMEN 613
           NG+ ID EIFSSLLE CYQLRA+ HGIR+HRLIPTNFLRRNVGVSSKLLRLYASFGYME+
Sbjct: 303 NGVPIDAEIFSSLLETCYQLRALDHGIRIHRLIPTNFLRRNVGVSSKLLRLYASFGYMED 362

Query: 614 AHQVFDEMCKRNISAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVLKAC 673
           AHQVFDEMC+RN+SAF+WNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVLKAC
Sbjct: 363 AHQVFDEMCQRNLSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVLKAC 422

Query: 674 GGIGLIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGSIVRARKVFDQIVCKDTVSWN 733
           GGIG I++GEAVHRHVVRSGFAGD+FVLNALVDMY+KCG I+RARKVFDQIV KDTVSWN
Sbjct: 423 GGIGSIRVGEAVHRHVVRSGFAGDIFVLNALVDMYAKCGDIMRARKVFDQIVSKDTVSWN 482

Query: 734 SMLTGYTRHGLLLEALDIFDQMIQEGYEPDSVALSTILSNISSLKFKLHIHGWVIRHGVE 793
           SMLTGYTRHGLLLEAL+ FDQMIQEGYEPDSVALST++SNISS KFKLHIHGW IRHG+E
Sbjct: 483 SMLTGYTRHGLLLEALNTFDQMIQEGYEPDSVALSTMISNISSSKFKLHIHGWAIRHGIE 540

Query: 794 WNLSIANSLIVMYANCG 811
           WNLSIANSLI MYAN G
Sbjct: 543 WNLSIANSLIAMYANSG 540

BLAST of HG10012396 vs. NCBI nr
Match: KAE8650431.1 (hypothetical protein Csa_011770 [Cucumis sativus])

HSP 1 Score: 758.4 bits (1957), Expect = 6.5e-215
Identity = 399/511 (78.08%), Postives = 412/511 (80.63%), Query Frame = 0

Query: 10  GKVALVTGGDSGIGRAVCHCFALEGATVAFTYVKAQEDKDANDTIEMIKKA-KSSAAKDP 69
           GKVALVTGGDSGIGRAVC+CFALEGA VAFTYVK QEDKDA DTIEMIKKA KSSA KDP
Sbjct: 58  GKVALVTGGDSGIGRAVCYCFALEGAIVAFTYVKGQEDKDAKDTIEMIKKATKSSAVKDP 117

Query: 70  LAIPADLGFDENCKRVVDEVVKAYGHIDILINNAAEQYKSTSVEDIDEDRLLRVFRTNIF 129
           LAIPADLGFDENCKRVVDEVVKAYG IDILINNAAEQYKS+SVEDIDE+RLLRVFRTNIF
Sbjct: 118 LAIPADLGFDENCKRVVDEVVKAYGRIDILINNAAEQYKSSSVEDIDEERLLRVFRTNIF 177

Query: 130 SYFFTTRHALKHMKEGSSIINTTSVNAYKGNAKLLDYTATKGAIVSFTRGLALQLATKGI 189
           SYFFTTRHALKHMKEGSSIINTTSVNAYKGNAKLLDYT+TKGAIV+FTRGLALQLA KGI
Sbjct: 178 SYFFTTRHALKHMKEGSSIINTTSVNAYKGNAKLLDYTSTKGAIVAFTRGLALQLANKGI 237

Query: 190 RVNGVAPGPIWTPLIPASFDEEETANFGSQVPMKRAGQPIEVAPSYVFLACNVDSSYITG 249
           RVNGVAPGPIWTPLIPASFDEEETA+FGSQVPMKRAGQPIEVAPSYVFLACN DSSYITG
Sbjct: 238 RVNGVAPGPIWTPLIPASFDEEETASFGSQVPMKRAGQPIEVAPSYVFLACNADSSYITG 297

Query: 250 QVLHPN------------------------------------------------------ 309
           QVLHPN                                                      
Sbjct: 298 QVLHPNGRDRKPVRNPAADSHQLDYNFWNTPRDQKMWRRAVSVSHFTFAHKSIAHNKDVC 357

Query: 310 --------------------------------GGPGSGKGTQCMKIVDNFGFTHLSAGDL 369
                                           GGPGSGKGTQCMKIV+NFGFTHLSAGDL
Sbjct: 358 KLKFWETFTTETPMKEKGTFQRDKTPFITFVLGGPGSGKGTQCMKIVENFGFTHLSAGDL 417

Query: 370 LRREIASNSADGTMILNTIKEGKIVPSELTIRLIQKEMESSDNYKFLIDGFPRSEENRIA 429
           LRREIASNSADGTMILNTIKEGKIVPSELT+RLIQKEMESSDNYKFLIDGFPRSEENRIA
Sbjct: 418 LRREIASNSADGTMILNTIKEGKIVPSELTVRLIQKEMESSDNYKFLIDGFPRSEENRIA 477

Query: 430 FEQIIGAEPDIVLFFDCPEDEMVKRVLNRNEGRVDDNIDTIKKRLKVFAALNLPVVKYYL 434
           FEQI+G EPD+VLFFDCPEDEMVKRVLNRN+GRVDDNI TIKKRLKVF ALNLPVVKYY+
Sbjct: 478 FEQIMGVEPDVVLFFDCPEDEMVKRVLNRNQGRVDDNIVTIKKRLKVFDALNLPVVKYYM 537

BLAST of HG10012396 vs. NCBI nr
Match: KAF3433128.1 (hypothetical protein FNV43_RR24230 [Rhamnella rubrinervis])

HSP 1 Score: 750.0 bits (1935), Expect = 2.3e-212
Identity = 384/576 (66.67%), Postives = 455/576 (78.99%), Query Frame = 0

Query: 255 GGPGSGKGTQCMKIVDNFGFTHLSAGDLLRREIASNSADGTMILNTIKEGKIVPSELTIR 314
           GGPGSGKGTQC KIV+ FG THLSAGDLLRREI SNSA G++ILNTIKEGKIVPSE+TI+
Sbjct: 51  GGPGSGKGTQCAKIVETFGLTHLSAGDLLRREITSNSAYGSLILNTIKEGKIVPSEVTIK 110

Query: 315 LIQKEMESSDNYKFLIDGFPRSEENRIAFEQIIGAEPDIVLFFDCPEDEMVKRVLNRNEG 374
           LIQ+EMES ++ KFLIDGFPRSEENRIAFEQIIGAEP++VLFFDCPE+EMVKRVLNRN+G
Sbjct: 111 LIQREMESCNSSKFLIDGFPRSEENRIAFEQIIGAEPNVVLFFDCPEEEMVKRVLNRNQG 170

Query: 375 RVDDNIDTIKKRLKVFAALNLPVVKYYLEKGKLYKINAVGTVDEIYKQVYPVFASFNFEG 434
           RVDDNIDTIKKRLKVF ALN PV+ YY +KGKLYKINAVGT DEI++QV+P+FA+     
Sbjct: 171 RVDDNIDTIKKRLKVFEALNRPVINYYSQKGKLYKINAVGTEDEIFEQVHPIFAA----- 230

Query: 435 GGCGLRERMKQTPNERFGTAIFPLLLQRFVCPMLISLQLSISPTTSLLLFCSSKPKKSKK 494
             C L   M   P++                 M+I+L       T++ + CSSK  KS+K
Sbjct: 231 --CELMNVMSSRPSQ--------------CGNMVITLPALSFYATNVAIHCSSKSNKSRK 290

Query: 495 ERRKLLQQKLIRINKAKETTDLS----FPKSSSTPLLIHPKPFFQTKIQALDALLTDLEA 554
                  QK +  NK+K  T+ +    + K S TPLLI  KP FQTK+QAL+A++ DLE 
Sbjct: 291 -------QKQMHQNKSKSKTNTTSVRPYAKPSPTPLLIRQKPTFQTKLQALEAVIKDLEK 350

Query: 555 SVDNGLLIDPEIFSSLLEVCYQLRAIHHGIRLHRLIPTNFLRRNVGVSSKLLRLYASFGY 614
           S++NG+ +D +IFSSLLE CY+L AIH G+R+HRLIP N LRRNVG+SSKLLRLYAS GY
Sbjct: 351 SIENGIDVDTDIFSSLLETCYRLEAIHCGMRIHRLIPANLLRRNVGLSSKLLRLYASCGY 410

Query: 615 MENAHQVFDEMCKRNISAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVL 674
           ++ AH+VFD+M  RN SAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPD FTFPRVL
Sbjct: 411 VDKAHEVFDQMSNRNASAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDRFTFPRVL 470

Query: 675 KACGGIGLIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGSIVRARKVFDQIVCKDTV 734
           KACGG+G+I IGEAVHR++VR G+  D FVLNALVDMY+KCG IV+ARKVF QI  +D+V
Sbjct: 471 KACGGVGVIHIGEAVHRNIVRLGYYDDGFVLNALVDMYAKCGDIVKARKVFHQISSRDSV 530

Query: 735 SWNSMLTGYTRHGLLLEALDIFDQMIQEGYEPDSVALSTILSNISSLKFKLHIHGWVIRH 794
           SWNSMLTGY RHGL +EALDIF QM+Q+GY PDSVALSTILS++SSLK  + IHGW IRH
Sbjct: 531 SWNSMLTGYIRHGLSVEALDIFCQMLQQGYRPDSVALSTILSDVSSLKLGVQIHGWAIRH 590

Query: 795 GVEWNLSIANSLIVMYANCG-----LWQCREIGRRE 822
           G+EWNLSIANSL+ MY++ G      W  +E+  R+
Sbjct: 591 GIEWNLSIANSLVDMYSSHGKVVRARWLFKEMPERD 598

BLAST of HG10012396 vs. NCBI nr
Match: EOY22925.1 (Tetratricopeptide repeat-like superfamily protein [Theobroma cacao])

HSP 1 Score: 750.0 bits (1935), Expect = 2.3e-212
Identity = 383/556 (68.88%), Postives = 445/556 (80.04%), Query Frame = 0

Query: 255 GGPGSGKGTQCMKIVDNFGFTHLSAGDLLRREIASNSADGTMILNTIKEGKIVPSELTIR 314
           GGPGSGKGTQC+KIV+ FGFTHLSAGDLLR+EI SNSADG MILNTIKEG+IVPSE+T++
Sbjct: 57  GGPGSGKGTQCIKIVETFGFTHLSAGDLLRQEITSNSADGAMILNTIKEGRIVPSEVTVK 116

Query: 315 LIQKEMESSDNYKFLIDGFPRSEENRIAFEQIIGAEPDIVLFFDCPEDEMVKRVLNRNEG 374
           LIQKEMES+DN+KFLIDGFPRSEENRIAFE+IIGAEP+IVLFFDCPE+EMVKRVLNRN+G
Sbjct: 117 LIQKEMESNDNHKFLIDGFPRSEENRIAFERIIGAEPNIVLFFDCPEEEMVKRVLNRNQG 176

Query: 375 RVDDNIDTIKKRLKVFAALNLPVVKYYLEKGKLYKINAVGTVDEIYKQVYPVFASFNFEG 434
           RVDDNIDT++KRLKVF ALNLPV+ YY ++GKLY INAVGTVDEI++QV PVF +     
Sbjct: 177 RVDDNIDTVRKRLKVFEALNLPVINYYSQRGKLYTINAVGTVDEIFEQVLPVFTASEL-- 236

Query: 435 GGCGLRERMKQTPNERFGTAIFPLLLQRFVCPMLISLQLSISPTTSLLLFCSSKPKKSKK 494
                           F  +I P L       M+  LQ       S  L CSSK KKS  
Sbjct: 237 ---------------TFKISIPPFLSALLSLTMVALLQPPSFHFVSWTLSCSSKSKKS-- 296

Query: 495 ERRKLLQQKLIRINKAKETTDLSFPKSSSTPLLIHPKPFFQTKIQALDALLTDLEASVDN 554
           E++K L++K I  +K   +T L F KSS TPLLI+ KPF QTK+QALDA++ DLEASV N
Sbjct: 297 EKQKQLKRKQIHQSK---STALPFRKSSPTPLLINHKPFTQTKLQALDAVVKDLEASVKN 356

Query: 555 GLLIDPEIFSSLLEVCYQLRAIHHGIRLHRLIPTNFLRRNVGVSSKLLRLYASFGYMENA 614
           G+ I  EIFSSLLE CYQL++I  GI++H L+P   LR+N G+SSKLLRLYAS G++E+A
Sbjct: 357 GMNITSEIFSSLLETCYQLKSIDQGIKIHNLVPKTLLRKNTGISSKLLRLYASCGHIESA 416

Query: 615 HQVFDEMCKRNISAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVLKACG 674
           HQVFDEM KRN SAF WNSLISGYAELG YEDALA+YFQMEEEGVEPD +TFPR LKAC 
Sbjct: 417 HQVFDEMSKRNESAFPWNSLISGYAELGQYEDALAIYFQMEEEGVEPDRYTFPRALKACA 476

Query: 675 GIGLIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGSIVRARKVFDQIVCKDTVSWNS 734
           GIGLIQIGEAVHR VVR GF  D FVLNAL+DMY+KCG IV+AR+VFD I CKDTVSWNS
Sbjct: 477 GIGLIQIGEAVHRDVVRKGFGNDGFVLNALIDMYAKCGDIVKARRVFDNIACKDTVSWNS 536

Query: 735 MLTGYTRHGLLLEALDIFDQMIQEGYEPDSVALSTILSNISSLKFKLHIHGWVIRHGVEW 794
           MLTGY RHGLL+EAL++F  MI+EGYEPD VA+STILS + SLK  L IHGW++R G EW
Sbjct: 537 MLTGYIRHGLLVEALEVFRGMIREGYEPDPVAMSTILSGVWSLKIALQIHGWILRRGNEW 590

Query: 795 NLSIANSLIVMYANCG 811
           NLS+ N+LIV+Y+N G
Sbjct: 597 NLSVVNALIVVYSNHG 590

BLAST of HG10012396 vs. NCBI nr
Match: XP_021285603.1 (pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Herrania umbratica])

HSP 1 Score: 748.4 bits (1931), Expect = 6.7e-212
Identity = 382/572 (66.78%), Postives = 455/572 (79.55%), Query Frame = 0

Query: 255 GGPGSGKGTQCMKIVDNFGFTHLSAGDLLRREIASNSADGTMILNTIKEGKIVPSELTIR 314
           GGPGSGKGTQC+KIV+ FGFTHLSAGDLLRREIASNSADG MILNTIKEGKIVPSE+T++
Sbjct: 57  GGPGSGKGTQCIKIVETFGFTHLSAGDLLRREIASNSADGAMILNTIKEGKIVPSEVTVK 116

Query: 315 LIQKEMESSDNYKFLIDGFPRSEENRIAFEQIIGAEPDIVLFFDCPEDEMVKRVLNRNEG 374
           LIQKEMES+DN+KFLIDGFPRSEENRIAFE+IIGAEP+IVLFFDCPE+EMVKRVLNRN+G
Sbjct: 117 LIQKEMESNDNHKFLIDGFPRSEENRIAFERIIGAEPNIVLFFDCPEEEMVKRVLNRNQG 176

Query: 375 RVDDNIDTIKKRLKVFAALNLPVVKYYLEKGKLYKINAVGTVDEIYKQVYPVFASFNFEG 434
           RVDDNIDT++KRLKVF ALNLPV+ YY ++GKLY INAVGTV+EI++QV PVF +     
Sbjct: 177 RVDDNIDTVRKRLKVFEALNLPVINYYSQRGKLYTINAVGTVNEIFEQVLPVFTASE--- 236

Query: 435 GGCGLRERMKQTPNERFGTAIFPLLLQRFVCPMLISLQLSISPTTSLLLFCSSKPKKSKK 494
               L  ++   P         P L    +  +L    L +    SL L CSS  KKS+K
Sbjct: 237 ----LTFKISSPP-------FLPALPSTTMVALLRPPSLHL---VSLTLRCSSTSKKSEK 296

Query: 495 ERRKLLQQKLIRINKAKETTDLSFPKSSSTPLLIHPKPFFQTKIQALDALLTDLEASVDN 554
           ++    Q KL +I+++  T  L F KSS TPLLI+ KPF QTK+QALDA++ DLEA+V N
Sbjct: 297 QK----QLKLKQIHQSNSTA-LPFRKSSPTPLLINHKPFTQTKLQALDAVVKDLEATVKN 356

Query: 555 GLLIDPEIFSSLLEVCYQLRAIHHGIRLHRLIPTNFLRRNVGVSSKLLRLYASFGYMENA 614
           G+ I  EIFSSLLE CYQL++I HGI++H L+P   LR+N G+SSKL+RLYAS G++E+A
Sbjct: 357 GMNITSEIFSSLLETCYQLKSIDHGIKIHNLVPKTLLRKNTGISSKLVRLYASCGHIESA 416

Query: 615 HQVFDEMCKRNISAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVLKACG 674
           HQVFDEM KRN SAF WNSLISGYAELG YEDALALYFQMEEEGVEPD +TFPR LKAC 
Sbjct: 417 HQVFDEMSKRNESAFPWNSLISGYAELGQYEDALALYFQMEEEGVEPDRYTFPRALKACA 476

Query: 675 GIGLIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGSIVRARKVFDQIVCKDTVSWNS 734
           G+GLIQIGEAVHR +VR GF  D FVLNALVDMY+KCG +V+AR+VFD I CKDTVSWNS
Sbjct: 477 GLGLIQIGEAVHRDLVRKGFGNDGFVLNALVDMYAKCGDVVKARRVFDNIACKDTVSWNS 536

Query: 735 MLTGYTRHGLLLEALDIFDQMIQEGYEPDSVALSTILSNISSLKFKLHIHGWVIRHGVEW 794
           MLTGY RHGLL+EA ++F  MI+EGYEPD VA+STILS + SLK  L IHGW++R G+EW
Sbjct: 537 MLTGYIRHGLLVEASEVFRGMIREGYEPDPVAISTILSGVWSLKIVLQIHGWILRRGIEW 596

Query: 795 NLSIANSLIVMYANCG-----LWQCREIGRRE 822
           NLS+ N+L+V+Y+N G      W  R++  R+
Sbjct: 597 NLSVVNALVVVYSNHGKLDRASWLFRQMPERD 606

BLAST of HG10012396 vs. ExPASy Swiss-Prot
Match: Q9FZ42 (NADPH-dependent aldehyde reductase 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=ChlADR1 PE=1 SV=1)

HSP 1 Score: 419.9 bits (1078), Expect = 7.2e-116
Identity = 205/249 (82.33%), Postives = 230/249 (92.37%), Query Frame = 0

Query: 8   VQGKVALVTGGDSGIGRAVCHCFALEGATVAFTYVKAQEDKDANDTIEMIKKAKSSAAKD 67
           ++GKVAL+TGGDSGIGRAV +CFA EGATVAFTYVK QE+KDA +T++M+K+ K+S +K+
Sbjct: 35  LRGKVALITGGDSGIGRAVGYCFASEGATVAFTYVKGQEEKDAQETLQMLKEVKTSDSKE 94

Query: 68  PLAIPADLGFDENCKRVVDEVVKAYGHIDILINNAAEQYKSTSVEDIDEDRLLRVFRTNI 127
           P+AIP DLGFDENCKRVVDEVV A+G ID+LINNAAEQY+S+++E+IDE RL RVFRTNI
Sbjct: 95  PIAIPTDLGFDENCKRVVDEVVNAFGRIDVLINNAAEQYESSTIEEIDEPRLERVFRTNI 154

Query: 128 FSYFFTTRHALKHMKEGSSIINTTSVNAYKGNAKLLDYTATKGAIVSFTRGLALQLATKG 187
           FSYFF TRHALKHMKEGSSIINTTSVNAYKGNA LLDYTATKGAIV+FTRGLALQLA KG
Sbjct: 155 FSYFFLTRHALKHMKEGSSIINTTSVNAYKGNASLLDYTATKGAIVAFTRGLALQLAEKG 214

Query: 188 IRVNGVAPGPIWTPLIPASFDEEETANFGSQVPMKRAGQPIEVAPSYVFLACNVDSSYIT 247
           IRVNGVAPGPIWTPLIPASF+EE+  NFGS+VPMKRAGQPIEVAPSYVFLACN  SSY T
Sbjct: 215 IRVNGVAPGPIWTPLIPASFNEEKIKNFGSEVPMKRAGQPIEVAPSYVFLACNHCSSYFT 274

Query: 248 GQVLHPNGG 257
           GQVLHPNGG
Sbjct: 275 GQVLHPNGG 283

BLAST of HG10012396 vs. ExPASy Swiss-Prot
Match: Q5KTS5 (Glucose and ribitol dehydrogenase OS=Daucus carota OX=4039 GN=CAISE5 PE=2 SV=1)

HSP 1 Score: 406.4 bits (1043), Expect = 8.2e-112
Identity = 196/249 (78.71%), Postives = 230/249 (92.37%), Query Frame = 0

Query: 8   VQGKVALVTGGDSGIGRAVCHCFALEGATVAFTYVKAQEDKDANDTIEMIKKAKSSAAKD 67
           +QGKVALVTGGDSGIGR+VC+ FALEGATVAFT+VK  EDKDAN+T+E+++KAKSS AKD
Sbjct: 39  LQGKVALVTGGDSGIGRSVCYHFALEGATVAFTFVKGHEDKDANETLELLRKAKSSDAKD 98

Query: 68  PLAIPADLGFDENCKRVVDEVVKAYGHIDILINNAAEQYKSTSVEDIDEDRLLRVFRTNI 127
           P+AI ADLGFD+NCK+VVD+VV A+G ID+L+NNAAEQYK+++VEDIDE+RL RVFRTNI
Sbjct: 99  PIAIAADLGFDDNCKKVVDQVVNAFGSIDVLVNNAAEQYKASTVEDIDEERLERVFRTNI 158

Query: 128 FSYFFTTRHALKHMKEGSSIINTTSVNAYKGNAKLLDYTATKGAIVSFTRGLALQLATKG 187
           F+YFF  RHALKHM+EGS+IINTTS+NAYKGNAKLLDYTATKGAIV+FTRGL+LQL +KG
Sbjct: 159 FAYFFMARHALKHMREGSTIINTTSINAYKGNAKLLDYTATKGAIVAFTRGLSLQLISKG 218

Query: 188 IRVNGVAPGPIWTPLIPASFDEEETANFGSQVPMKRAGQPIEVAPSYVFLACNVDSSYIT 247
           IRVNGVAPGP+WTPLIP+SFDEEE   FGS+VPMKRAGQP E+A +YVFLA + DSSY +
Sbjct: 219 IRVNGVAPGPVWTPLIPSSFDEEEVKQFGSEVPMKRAGQPYEIATAYVFLA-SCDSSYYS 278

Query: 248 GQVLHPNGG 257
           GQVLHPNGG
Sbjct: 279 GQVLHPNGG 286

BLAST of HG10012396 vs. ExPASy Swiss-Prot
Match: Q9SB36 (Pentatricopeptide repeat-containing protein At4g25270, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-E53 PE=3 SV=1)

HSP 1 Score: 402.9 bits (1034), Expect = 9.1e-111
Identity = 203/335 (60.60%), Postives = 249/335 (74.33%), Query Frame = 0

Query: 477 PTTSLLLFCSSKPKKSKKERRKLLQQKLIRINKAKETTDLSFPKSSSTPLLIHPKPFFQT 536
           P+ S     SS  KK  +  ++L Q +  + N     T LSF K S TPLLI  +   +T
Sbjct: 9   PSFSYPSVSSSSMKKKPRHHQQLKQHRQNQYNN-NGFTSLSFTKPSPTPLLIEKQSIHRT 68

Query: 537 KIQALDALLTDLEASVDNGL-LIDPEIFSSLLEVCYQLRAIHHGIRLHRLIPTNFLRRNV 596
           +++ALD+++TDLE S   G+ L +PEIF+SLLE CY LRAI HG+R+H LIP   LR N+
Sbjct: 69  QLEALDSVITDLETSAQKGISLTEPEIFASLLETCYSLRAIDHGVRVHHLIPPYLLRNNL 128

Query: 597 GVSSKLLRLYASFGYMENAHQVFDEMCKRNISAFAWNSLISGYAELGLYEDALALYFQME 656
           G+SSKL+RLYAS GY E AH+VFD M KR+ S FAWNSLISGYAELG YEDA+ALYFQM 
Sbjct: 129 GISSKLVRLYASCGYAEVAHEVFDRMSKRDSSPFAWNSLISGYAELGQYEDAMALYFQMA 188

Query: 657 EEGVEPDHFTFPRVLKACGGIGLIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGSIV 716
           E+GV+PD FTFPRVLKACGGIG +QIGEA+HR +V+ GF  DV+VLNALV MY+KCG IV
Sbjct: 189 EDGVKPDRFTFPRVLKACGGIGSVQIGEAIHRDLVKEGFGYDVYVLNALVVMYAKCGDIV 248

Query: 717 RARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIQEGYEPDSVALSTILSNIS 776
           +AR VFD I  KD VSWNSMLTGY  HGLL EALDIF  M+Q G EPD VA+S++L+ + 
Sbjct: 249 KARNVFDMIPHKDYVSWNSMLTGYLHHGLLHEALDIFRLMVQNGIEPDKVAISSVLARVL 308

Query: 777 SLKFKLHIHGWVIRHGVEWNLSIANSLIVMYANCG 811
           S K    +HGWVIR G+EW LS+AN+LIV+Y+  G
Sbjct: 309 SFKHGRQLHGWVIRRGMEWELSVANALIVLYSKRG 342

BLAST of HG10012396 vs. ExPASy Swiss-Prot
Match: Q9MA93 (Glucose and ribitol dehydrogenase homolog 2 OS=Arabidopsis thaliana OX=3702 GN=At3g05260 PE=2 SV=1)

HSP 1 Score: 376.7 bits (966), Expect = 7.0e-103
Identity = 184/249 (73.90%), Postives = 214/249 (85.94%), Query Frame = 0

Query: 8   VQGKVALVTGGDSGIGRAVCHCFALEGATVAFTYVKAQEDKDANDTIEMIKKAKSSAAKD 67
           + GKVALVTGGDSGIG+AVCHC+ALEGA+VAFTYVK +EDKDA +T+ ++ + K+  AK+
Sbjct: 37  LHGKVALVTGGDSGIGKAVCHCYALEGASVAFTYVKGREDKDAEETLRLLHEVKTREAKE 96

Query: 68  PLAIPADLGFDENCKRVVDEVVKAYGHIDILINNAAEQYKSTSVEDIDEDRLLRVFRTNI 127
           P+ I  DLGF+ENCKRVV+EVV ++G ID+L+N AAEQ++  S+EDIDE RL RVFRTNI
Sbjct: 97  PIMIATDLGFEENCKRVVEEVVNSFGRIDVLVNCAAEQHE-VSIEDIDEARLERVFRTNI 156

Query: 128 FSYFFTTRHALKHMKEGSSIINTTSVNAYKGNAKLLDYTATKGAIVSFTRGLALQLATKG 187
           FS FF  ++ALKHMKEGSSIINTTSV AY GN+ LL+YTATKGAIVSFTRGLALQLA KG
Sbjct: 157 FSQFFLVKYALKHMKEGSSIINTTSVVAYAGNSSLLEYTATKGAIVSFTRGLALQLAPKG 216

Query: 188 IRVNGVAPGPIWTPLIPASFDEEETANFGSQVPMKRAGQPIEVAPSYVFLACNVDSSYIT 247
           IRVNGVAPGP+WTPLIPASF EE    FGS+ PMKRA QP+EVAPSYVFLACN  SSY T
Sbjct: 217 IRVNGVAPGPVWTPLIPASFSEEAIKQFGSETPMKRAAQPVEVAPSYVFLACNHCSSYYT 276

Query: 248 GQVLHPNGG 257
           GQ+LHPNGG
Sbjct: 277 GQILHPNGG 284

BLAST of HG10012396 vs. ExPASy Swiss-Prot
Match: Q75KH3 (Glucose and ribitol dehydrogenase homolog OS=Oryza sativa subsp. japonica OX=39947 GN=Os05g0140800 PE=2 SV=2)

HSP 1 Score: 350.9 bits (899), Expect = 4.1e-95
Identity = 180/259 (69.50%), Postives = 215/259 (83.01%), Query Frame = 0

Query: 8   VQGKVALVTGGDSGIGRAVCHCFALEGATVAFTYVKAQEDKDANDTIEMIKKAKS-SAAK 67
           ++ KVA+VTGGDSGIGRAVC CFALEGATVAFTYVK QE+KDA +T+  ++  ++ + AK
Sbjct: 38  LKDKVAIVTGGDSGIGRAVCLCFALEGATVAFTYVKGQEEKDAEETLRALRDIRARTGAK 97

Query: 68  DPLAIPADLGFDENCKRVVDEVVKAY-GHIDILINNAAEQYKSTSVEDIDEDRLLRVFRT 127
           DP+AIPADLG+D+NC++VVDEV  AY G IDIL+NNAAEQY+  S+ DI ED L RVFRT
Sbjct: 98  DPMAIPADLGYDDNCRKVVDEVAGAYGGAIDILVNNAAEQYERPSITDITEDDLERVFRT 157

Query: 128 NIFSYFFTTRHALKHMKE--------GSSIINTTSVNAYKGNAKLLDYTATKGAIVSFTR 187
           NIFSYFF ++HA+K M++        G SIINT+S+NAYKGN  LLDYTATKGAIV+FTR
Sbjct: 158 NIFSYFFMSKHAVKRMRDRRGGAGAGGCSIINTSSINAYKGNKTLLDYTATKGAIVAFTR 217

Query: 188 GLALQLATKGIRVNGVAPGPIWTPLIPASFDEEETANFGSQVPMKRAGQPIEVAPSYVFL 247
            LALQLA +GIRVNGVAPGPIWTPLIPASF EE+   FGSQVPM RAGQP EVAPS+VFL
Sbjct: 218 ALALQLAEEGIRVNGVAPGPIWTPLIPASFAEEKVRQFGSQVPMGRAGQPSEVAPSFVFL 277

Query: 248 ACNVDSSYITGQVLHPNGG 257
           A + D+SY++GQ+LH NGG
Sbjct: 278 ASD-DASYMSGQMLHVNGG 295

BLAST of HG10012396 vs. ExPASy TrEMBL
Match: A0A061FZF0 (UMP-CMP kinase OS=Theobroma cacao OX=3641 GN=TCM_014953 PE=3 SV=1)

HSP 1 Score: 750.0 bits (1935), Expect = 1.1e-212
Identity = 383/556 (68.88%), Postives = 445/556 (80.04%), Query Frame = 0

Query: 255 GGPGSGKGTQCMKIVDNFGFTHLSAGDLLRREIASNSADGTMILNTIKEGKIVPSELTIR 314
           GGPGSGKGTQC+KIV+ FGFTHLSAGDLLR+EI SNSADG MILNTIKEG+IVPSE+T++
Sbjct: 57  GGPGSGKGTQCIKIVETFGFTHLSAGDLLRQEITSNSADGAMILNTIKEGRIVPSEVTVK 116

Query: 315 LIQKEMESSDNYKFLIDGFPRSEENRIAFEQIIGAEPDIVLFFDCPEDEMVKRVLNRNEG 374
           LIQKEMES+DN+KFLIDGFPRSEENRIAFE+IIGAEP+IVLFFDCPE+EMVKRVLNRN+G
Sbjct: 117 LIQKEMESNDNHKFLIDGFPRSEENRIAFERIIGAEPNIVLFFDCPEEEMVKRVLNRNQG 176

Query: 375 RVDDNIDTIKKRLKVFAALNLPVVKYYLEKGKLYKINAVGTVDEIYKQVYPVFASFNFEG 434
           RVDDNIDT++KRLKVF ALNLPV+ YY ++GKLY INAVGTVDEI++QV PVF +     
Sbjct: 177 RVDDNIDTVRKRLKVFEALNLPVINYYSQRGKLYTINAVGTVDEIFEQVLPVFTASEL-- 236

Query: 435 GGCGLRERMKQTPNERFGTAIFPLLLQRFVCPMLISLQLSISPTTSLLLFCSSKPKKSKK 494
                           F  +I P L       M+  LQ       S  L CSSK KKS  
Sbjct: 237 ---------------TFKISIPPFLSALLSLTMVALLQPPSFHFVSWTLSCSSKSKKS-- 296

Query: 495 ERRKLLQQKLIRINKAKETTDLSFPKSSSTPLLIHPKPFFQTKIQALDALLTDLEASVDN 554
           E++K L++K I  +K   +T L F KSS TPLLI+ KPF QTK+QALDA++ DLEASV N
Sbjct: 297 EKQKQLKRKQIHQSK---STALPFRKSSPTPLLINHKPFTQTKLQALDAVVKDLEASVKN 356

Query: 555 GLLIDPEIFSSLLEVCYQLRAIHHGIRLHRLIPTNFLRRNVGVSSKLLRLYASFGYMENA 614
           G+ I  EIFSSLLE CYQL++I  GI++H L+P   LR+N G+SSKLLRLYAS G++E+A
Sbjct: 357 GMNITSEIFSSLLETCYQLKSIDQGIKIHNLVPKTLLRKNTGISSKLLRLYASCGHIESA 416

Query: 615 HQVFDEMCKRNISAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVLKACG 674
           HQVFDEM KRN SAF WNSLISGYAELG YEDALA+YFQMEEEGVEPD +TFPR LKAC 
Sbjct: 417 HQVFDEMSKRNESAFPWNSLISGYAELGQYEDALAIYFQMEEEGVEPDRYTFPRALKACA 476

Query: 675 GIGLIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGSIVRARKVFDQIVCKDTVSWNS 734
           GIGLIQIGEAVHR VVR GF  D FVLNAL+DMY+KCG IV+AR+VFD I CKDTVSWNS
Sbjct: 477 GIGLIQIGEAVHRDVVRKGFGNDGFVLNALIDMYAKCGDIVKARRVFDNIACKDTVSWNS 536

Query: 735 MLTGYTRHGLLLEALDIFDQMIQEGYEPDSVALSTILSNISSLKFKLHIHGWVIRHGVEW 794
           MLTGY RHGLL+EAL++F  MI+EGYEPD VA+STILS + SLK  L IHGW++R G EW
Sbjct: 537 MLTGYIRHGLLVEALEVFRGMIREGYEPDPVAMSTILSGVWSLKIALQIHGWILRRGNEW 590

Query: 795 NLSIANSLIVMYANCG 811
           NLS+ N+LIV+Y+N G
Sbjct: 597 NLSVVNALIVVYSNHG 590

BLAST of HG10012396 vs. ExPASy TrEMBL
Match: A0A6J1AF96 (UMP-CMP kinase OS=Herrania umbratica OX=108875 GN=LOC110417541 PE=3 SV=1)

HSP 1 Score: 748.4 bits (1931), Expect = 3.3e-212
Identity = 382/572 (66.78%), Postives = 455/572 (79.55%), Query Frame = 0

Query: 255 GGPGSGKGTQCMKIVDNFGFTHLSAGDLLRREIASNSADGTMILNTIKEGKIVPSELTIR 314
           GGPGSGKGTQC+KIV+ FGFTHLSAGDLLRREIASNSADG MILNTIKEGKIVPSE+T++
Sbjct: 57  GGPGSGKGTQCIKIVETFGFTHLSAGDLLRREIASNSADGAMILNTIKEGKIVPSEVTVK 116

Query: 315 LIQKEMESSDNYKFLIDGFPRSEENRIAFEQIIGAEPDIVLFFDCPEDEMVKRVLNRNEG 374
           LIQKEMES+DN+KFLIDGFPRSEENRIAFE+IIGAEP+IVLFFDCPE+EMVKRVLNRN+G
Sbjct: 117 LIQKEMESNDNHKFLIDGFPRSEENRIAFERIIGAEPNIVLFFDCPEEEMVKRVLNRNQG 176

Query: 375 RVDDNIDTIKKRLKVFAALNLPVVKYYLEKGKLYKINAVGTVDEIYKQVYPVFASFNFEG 434
           RVDDNIDT++KRLKVF ALNLPV+ YY ++GKLY INAVGTV+EI++QV PVF +     
Sbjct: 177 RVDDNIDTVRKRLKVFEALNLPVINYYSQRGKLYTINAVGTVNEIFEQVLPVFTASE--- 236

Query: 435 GGCGLRERMKQTPNERFGTAIFPLLLQRFVCPMLISLQLSISPTTSLLLFCSSKPKKSKK 494
               L  ++   P         P L    +  +L    L +    SL L CSS  KKS+K
Sbjct: 237 ----LTFKISSPP-------FLPALPSTTMVALLRPPSLHL---VSLTLRCSSTSKKSEK 296

Query: 495 ERRKLLQQKLIRINKAKETTDLSFPKSSSTPLLIHPKPFFQTKIQALDALLTDLEASVDN 554
           ++    Q KL +I+++  T  L F KSS TPLLI+ KPF QTK+QALDA++ DLEA+V N
Sbjct: 297 QK----QLKLKQIHQSNSTA-LPFRKSSPTPLLINHKPFTQTKLQALDAVVKDLEATVKN 356

Query: 555 GLLIDPEIFSSLLEVCYQLRAIHHGIRLHRLIPTNFLRRNVGVSSKLLRLYASFGYMENA 614
           G+ I  EIFSSLLE CYQL++I HGI++H L+P   LR+N G+SSKL+RLYAS G++E+A
Sbjct: 357 GMNITSEIFSSLLETCYQLKSIDHGIKIHNLVPKTLLRKNTGISSKLVRLYASCGHIESA 416

Query: 615 HQVFDEMCKRNISAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVLKACG 674
           HQVFDEM KRN SAF WNSLISGYAELG YEDALALYFQMEEEGVEPD +TFPR LKAC 
Sbjct: 417 HQVFDEMSKRNESAFPWNSLISGYAELGQYEDALALYFQMEEEGVEPDRYTFPRALKACA 476

Query: 675 GIGLIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGSIVRARKVFDQIVCKDTVSWNS 734
           G+GLIQIGEAVHR +VR GF  D FVLNALVDMY+KCG +V+AR+VFD I CKDTVSWNS
Sbjct: 477 GLGLIQIGEAVHRDLVRKGFGNDGFVLNALVDMYAKCGDVVKARRVFDNIACKDTVSWNS 536

Query: 735 MLTGYTRHGLLLEALDIFDQMIQEGYEPDSVALSTILSNISSLKFKLHIHGWVIRHGVEW 794
           MLTGY RHGLL+EA ++F  MI+EGYEPD VA+STILS + SLK  L IHGW++R G+EW
Sbjct: 537 MLTGYIRHGLLVEASEVFRGMIREGYEPDPVAISTILSGVWSLKIVLQIHGWILRRGIEW 596

Query: 795 NLSIANSLIVMYANCG-----LWQCREIGRRE 822
           NLS+ N+L+V+Y+N G      W  R++  R+
Sbjct: 597 NLSVVNALVVVYSNHGKLDRASWLFRQMPERD 606

BLAST of HG10012396 vs. ExPASy TrEMBL
Match: A0A1R3JTM4 (Adenylate kinase OS=Corchorus olitorius OX=93759 GN=COLO4_14147 PE=3 SV=1)

HSP 1 Score: 728.8 bits (1880), Expect = 2.7e-206
Identity = 380/621 (61.19%), Postives = 455/621 (73.27%), Query Frame = 0

Query: 203 IPASFDEEETANFGSQVPMKRAGQPIEVAPSYVFLACNVDSSYITGQVLHPN-------- 262
           + +S      ++F  QV +     PI +  +      N+  S  TG     N        
Sbjct: 8   LSSSLISSSNSSFLGQVLVSSFSAPIFIKKNQTAYRFNIWESLTTGISQQANGAVGSKER 67

Query: 263 --------GGPGSGKGTQCMKIVDNFGFTHLSAGDLLRREIASNSADGTMILNTIKEGKI 322
                   GGPGSGKGTQC+KIV+ FGF HLSAGDLLRREIA N+ADG MIL+TIKEGKI
Sbjct: 68  TPFITFVLGGPGSGKGTQCIKIVETFGFKHLSAGDLLRREIACNTADGAMILDTIKEGKI 127

Query: 323 VPSELTIRLIQKEMESSDNYKFLIDGFPRSEENRIAFEQIIGAEPDIVLFFDCPEDEMVK 382
           VPSE+T++LIQKE+ESSDN+K LIDGFPRSEENRIAFE+IIGAEP+IVLFFDCPE+EMVK
Sbjct: 128 VPSEVTVKLIQKEIESSDNHKILIDGFPRSEENRIAFEKIIGAEPNIVLFFDCPEEEMVK 187

Query: 383 RVLNRNEGRVDDNIDTIKKRLKVFAALNLPVVKYYLEKGKLYKINAVGTVDEIYKQVYPV 442
           RVL+RN+GRVDDNIDTI+KRLKVF ALNLPV+ YY ++GKLY INAVGTVDEI++QV PV
Sbjct: 188 RVLSRNQGRVDDNIDTIRKRLKVFEALNLPVINYYSQRGKLYTINAVGTVDEIFEQVRPV 247

Query: 443 FASFNFEGGGCGLRERMKQTPNERFGTAIFPLLLQRFVCPMLISLQLSISPTTSLLLFCS 502
           F SF                                   P +++L      +T+L   CS
Sbjct: 248 FNSFESN--------------------------------PKMVALLSPSFHSTTLTFHCS 307

Query: 503 SKPKKSKKERRKLLQQKLIRINKAKETTDLSFPKSSSTPLLIHPKPFFQTKIQALDALLT 562
           SK KK+K  RR+ L +K  ++ K+K      FP+SS TPLLI+ KPF QT++QALDA++ 
Sbjct: 308 SKGKKNK--RREQLNRK--QLQKSKRIA-FPFPESSPTPLLINYKPFTQTRLQALDAVVQ 367

Query: 563 DLEASVDNGLLIDPEIFSSLLEVCYQLRAIHHGIRLHRLIPTNFLRRNVGVSSKLLRLYA 622
           DLEASV+ G+ ID EIF+SLLE CYQL +I HGI++H LIP   LRRN G+SSKLLRLYA
Sbjct: 368 DLEASVEKGIKIDTEIFASLLETCYQLNSIDHGIKVHSLIPKTLLRRNTGISSKLLRLYA 427

Query: 623 SFGYMENAHQVFDEMCKRNISAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF 682
           S G++E+AHQVFDEM KRN SAF WNSLISGYAELG YEDALALYFQMEEEGVEPD FTF
Sbjct: 428 SCGHIESAHQVFDEMYKRNESAFPWNSLISGYAELGQYEDALALYFQMEEEGVEPDRFTF 487

Query: 683 PRVLKACGGIGLIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGSIVRARKVFDQIVC 742
           PR LKAC GIG+IQIGEAVHR VVR GF  D FVLNAL DMY+KCG IV+AR+VFD I+ 
Sbjct: 488 PRALKACAGIGMIQIGEAVHRDVVRKGFGNDGFVLNALCDMYAKCGDIVKARRVFDSIIY 547

Query: 743 KDTVSWNSMLTGYTRHGLLLEALDIFDQMIQEGYEPDSVALSTILSNISSLKFKLHIHGW 802
           KD VSWNSMLT Y RHGLL EAL++F  MI+EG++PD VA+ST+LS  SSLK    IHGW
Sbjct: 548 KDMVSWNSMLTSYIRHGLLFEALEVFRGMIEEGFDPDPVAISTVLSGFSSLKIAAQIHGW 591

Query: 803 VIRHGVEWNLSIANSLIVMYA 808
           V+R G+EWNLS+ N+L+++Y+
Sbjct: 608 VLRRGIEWNLSVVNALVLVYS 591

BLAST of HG10012396 vs. ExPASy TrEMBL
Match: A0A1R3ITD5 (Adenylate kinase OS=Corchorus capsularis OX=210143 GN=CCACVL1_09972 PE=3 SV=1)

HSP 1 Score: 722.6 bits (1864), Expect = 1.9e-204
Identity = 368/553 (66.55%), Postives = 434/553 (78.48%), Query Frame = 0

Query: 255 GGPGSGKGTQCMKIVDNFGFTHLSAGDLLRREIASNSADGTMILNTIKEGKIVPSELTIR 314
           GGPGSGKGTQC+KIV+ FGF HLSAGDLLRREIA N+ADG MIL+TIKEGKIVPSE+T++
Sbjct: 42  GGPGSGKGTQCIKIVETFGFKHLSAGDLLRREIACNTADGAMILDTIKEGKIVPSEVTVK 101

Query: 315 LIQKEMESSDNYKFLIDGFPRSEENRIAFEQIIGAEPDIVLFFDCPEDEMVKRVLNRNEG 374
           LIQKE+ESSDN+K LIDGFPRSEENRIAFE+I+GAEP+IVLFFDCPE+EMVKRVL+RN+G
Sbjct: 102 LIQKEIESSDNHKILIDGFPRSEENRIAFEKIVGAEPNIVLFFDCPEEEMVKRVLSRNQG 161

Query: 375 RVDDNIDTIKKRLKVFAALNLPVVKYYLEKGKLYKINAVGTVDEIYKQVYPVFASFNFEG 434
           RVDDNIDTI+KRLKVF ALNLPV+ YY ++GKLY INAVGTVDEI++QV PVF SF  + 
Sbjct: 162 RVDDNIDTIRKRLKVFEALNLPVINYYSQRGKLYTINAVGTVDEIFEQVRPVFNSFESD- 221

Query: 435 GGCGLRERMKQTPNERFGTAIFPLLLQRFVCPMLISLQLSISPTTSLLLFCSSKPKKSKK 494
                                          P +++L      +T L   CSSK KK+KK
Sbjct: 222 -------------------------------PKMVALLSPSFHSTKLTFHCSSKSKKNKK 281

Query: 495 ERRKLLQQKLIRINKAKETTDLSFPKSSSTPLLIHPKPFFQTKIQALDALLTDLEASVDN 554
             R+ L +K  ++ K+K      +P+SS TPLLI+ KPF QT++QALDA++ DLEASV  
Sbjct: 282 --REQLNRK--QLQKSKRIA-FPYPESSPTPLLINHKPFTQTRLQALDAVVQDLEASVKK 341

Query: 555 GLLIDPEIFSSLLEVCYQLRAIHHGIRLHRLIPTNFLRRNVGVSSKLLRLYASFGYMENA 614
           G+ ID EIF+SLLE CYQL +I HGI++H LIP   LRRN G+SSKLLRLYAS G++E+A
Sbjct: 342 GINIDTEIFASLLETCYQLNSIDHGIKVHSLIPKTMLRRNTGISSKLLRLYASSGHIESA 401

Query: 615 HQVFDEMCKRNISAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVLKACG 674
           HQVFDEM KRN SAF WNSLISGYAELG YEDALALYFQMEEEGV PD FTFPR LKAC 
Sbjct: 402 HQVFDEMYKRNESAFPWNSLISGYAELGQYEDALALYFQMEEEGVGPDRFTFPRALKACA 461

Query: 675 GIGLIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGSIVRARKVFDQIVCKDTVSWNS 734
           GIG+IQIGEAVHR VVR GF  D FVLNAL DMY+KCG IV+AR+VFD IV KD VSWNS
Sbjct: 462 GIGMIQIGEAVHRDVVRKGFGNDGFVLNALCDMYAKCGDIVKARRVFDSIVYKDMVSWNS 521

Query: 735 MLTGYTRHGLLLEALDIFDQMIQEGYEPDSVALSTILSNISSLKFKLHIHGWVIRHGVEW 794
           MLT Y RHGLL EAL++F  MIQEG++PD +A+ST+LS  SSLK    IHGWV+R G+EW
Sbjct: 522 MLTSYIRHGLLFEALEVFRGMIQEGFDPDPIAMSTVLSGFSSLKIAAQIHGWVLRRGIEW 557

Query: 795 NLSIANSLIVMYA 808
           NLS+ N+LI++Y+
Sbjct: 582 NLSVVNALILVYS 557

BLAST of HG10012396 vs. ExPASy TrEMBL
Match: A0A2H5NMG4 (UMP-CMP kinase OS=Citrus unshiu OX=55188 GN=CUMW_058940 PE=3 SV=1)

HSP 1 Score: 700.3 bits (1806), Expect = 1.0e-197
Identity = 363/565 (64.25%), Postives = 435/565 (76.99%), Query Frame = 0

Query: 255 GGPGSGKGTQCMKIVDNFGFTHLSAGDLLRREIASNSADGTMILNTIKEGKIVPSELTIR 314
           GGPGSGKGTQC KIV N+G THLSAG+LLRREIASNS  GT ILNTIKEGKIVPSE+T+ 
Sbjct: 55  GGPGSGKGTQCAKIVKNYGLTHLSAGELLRREIASNSEYGTTILNTIKEGKIVPSEVTVS 114

Query: 315 LIQKEMESSDNYKFLIDGFPRSEENRIAFEQIIGAEPDIVLFFDCPEDEMVKRVLNRNEG 374
           LIQKEMESSD+ KFLIDGFPRSEENR AFE+I+GAEPDIVLFFDCPE+EMV RVLNRNEG
Sbjct: 115 LIQKEMESSDSKKFLIDGFPRSEENRAAFERIMGAEPDIVLFFDCPEEEMVNRVLNRNEG 174

Query: 375 RVDDNIDTIKKRLKVFAALNLPVVKYYLEKGKLYKINAVGTVDEIYKQVYPVFASFNFEG 434
           RVDDNIDT++KRL+VF ALNLPV+ YY  +GKLY INAVGTVDEI++QV  VFA+     
Sbjct: 175 RVDDNIDTVRKRLQVFKALNLPVINYYARRGKLYTINAVGTVDEIFEQVRAVFAALKPSL 234

Query: 435 G--GCGLRERMKQTPN-ERFGTAIFPLLLQRFVCP------MLISLQLSISPTTSLLLFC 494
           G    G+   M   P     G  I   L + FV         + +  LS   T+ +++ C
Sbjct: 235 GLDIPGMESEMLWRPAFTSPGMTISLSLFKHFVSESKLVHCCISTTVLSSFHTSLVIIHC 294

Query: 495 SSKPKKSKKERRKLLQQKLIRINKAKETTDLSFPKSSSTPLLIHPKPFFQTKIQALDALL 554
            SK K+S+K+RR+  QQ    I++ + TT  S+PKSS TPLL + K F +TK+QALD+++
Sbjct: 295 GSKNKRSRKQRRQKQQQ----ISRNRITTFSSYPKSSPTPLLTNQKAFPKTKLQALDSII 354

Query: 555 TDLEASVDNGLLIDPEIFSSLLEVCYQLRAIHHGIRLHRLIPTNFLRRNVGVSSKLLRLY 614
            DLE+SV NG+ +  E F+SLLE CYQL+A+ HGI+LHRLIPTN LR+N G+SSKLLRLY
Sbjct: 355 QDLESSVQNGITVQTETFASLLETCYQLKAVEHGIKLHRLIPTNLLRKNKGISSKLLRLY 414

Query: 615 ASFGYMENAHQVFDEMCKRNISAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFT 674
           A+FG ++ AHQVFD+M  R   AF WNSLISGYAELG YEDA+ALYFQMEEEGVEPD FT
Sbjct: 415 ATFGLIDEAHQVFDQMSNRTAFAFPWNSLISGYAELGEYEDAIALYFQMEEEGVEPDQFT 474

Query: 675 FPRVLKACGGIGLIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGSIVRARKVFDQIV 734
           FPRVLKAC G+GLI++GE VH   VR GF  D FVLNALVDMY+KCG IV+AR VFD+I 
Sbjct: 475 FPRVLKACAGLGLIRVGEKVHLDAVRFGFGFDGFVLNALVDMYAKCGDIVKARTVFDRIG 534

Query: 735 CKDTVSWNSMLTGYTRHGLLLEALDIFDQMIQEGYEPDSVALSTILSNISSLKFKLHIHG 794
            KD +S+NSMLTGY  HGLL+EA DIF  MI  G++PD VA+S+IL+N S L+    +HG
Sbjct: 535 NKDLISYNSMLTGYIHHGLLVEAFDIFRGMILNGFDPDPVAISSILANASLLRIGAQVHG 594

Query: 795 WVIRHGVEWNLSIANSLIVMYANCG 811
           WV+R GVEW+L IANSLIV+Y+  G
Sbjct: 595 WVLRRGVEWDLCIANSLIVVYSKDG 615

BLAST of HG10012396 vs. TAIR 10
Match: AT1G54870.1 (NAD(P)-binding Rossmann-fold superfamily protein )

HSP 1 Score: 419.9 bits (1078), Expect = 5.1e-117
Identity = 205/249 (82.33%), Postives = 230/249 (92.37%), Query Frame = 0

Query: 8   VQGKVALVTGGDSGIGRAVCHCFALEGATVAFTYVKAQEDKDANDTIEMIKKAKSSAAKD 67
           ++GKVAL+TGGDSGIGRAV +CFA EGATVAFTYVK QE+KDA +T++M+K+ K+S +K+
Sbjct: 82  LRGKVALITGGDSGIGRAVGYCFASEGATVAFTYVKGQEEKDAQETLQMLKEVKTSDSKE 141

Query: 68  PLAIPADLGFDENCKRVVDEVVKAYGHIDILINNAAEQYKSTSVEDIDEDRLLRVFRTNI 127
           P+AIP DLGFDENCKRVVDEVV A+G ID+LINNAAEQY+S+++E+IDE RL RVFRTNI
Sbjct: 142 PIAIPTDLGFDENCKRVVDEVVNAFGRIDVLINNAAEQYESSTIEEIDEPRLERVFRTNI 201

Query: 128 FSYFFTTRHALKHMKEGSSIINTTSVNAYKGNAKLLDYTATKGAIVSFTRGLALQLATKG 187
           FSYFF TRHALKHMKEGSSIINTTSVNAYKGNA LLDYTATKGAIV+FTRGLALQLA KG
Sbjct: 202 FSYFFLTRHALKHMKEGSSIINTTSVNAYKGNASLLDYTATKGAIVAFTRGLALQLAEKG 261

Query: 188 IRVNGVAPGPIWTPLIPASFDEEETANFGSQVPMKRAGQPIEVAPSYVFLACNVDSSYIT 247
           IRVNGVAPGPIWTPLIPASF+EE+  NFGS+VPMKRAGQPIEVAPSYVFLACN  SSY T
Sbjct: 262 IRVNGVAPGPIWTPLIPASFNEEKIKNFGSEVPMKRAGQPIEVAPSYVFLACNHCSSYFT 321

Query: 248 GQVLHPNGG 257
           GQVLHPNGG
Sbjct: 322 GQVLHPNGG 330

BLAST of HG10012396 vs. TAIR 10
Match: AT4G25270.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 402.9 bits (1034), Expect = 6.5e-112
Identity = 203/335 (60.60%), Postives = 249/335 (74.33%), Query Frame = 0

Query: 477 PTTSLLLFCSSKPKKSKKERRKLLQQKLIRINKAKETTDLSFPKSSSTPLLIHPKPFFQT 536
           P+ S     SS  KK  +  ++L Q +  + N     T LSF K S TPLLI  +   +T
Sbjct: 9   PSFSYPSVSSSSMKKKPRHHQQLKQHRQNQYNN-NGFTSLSFTKPSPTPLLIEKQSIHRT 68

Query: 537 KIQALDALLTDLEASVDNGL-LIDPEIFSSLLEVCYQLRAIHHGIRLHRLIPTNFLRRNV 596
           +++ALD+++TDLE S   G+ L +PEIF+SLLE CY LRAI HG+R+H LIP   LR N+
Sbjct: 69  QLEALDSVITDLETSAQKGISLTEPEIFASLLETCYSLRAIDHGVRVHHLIPPYLLRNNL 128

Query: 597 GVSSKLLRLYASFGYMENAHQVFDEMCKRNISAFAWNSLISGYAELGLYEDALALYFQME 656
           G+SSKL+RLYAS GY E AH+VFD M KR+ S FAWNSLISGYAELG YEDA+ALYFQM 
Sbjct: 129 GISSKLVRLYASCGYAEVAHEVFDRMSKRDSSPFAWNSLISGYAELGQYEDAMALYFQMA 188

Query: 657 EEGVEPDHFTFPRVLKACGGIGLIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGSIV 716
           E+GV+PD FTFPRVLKACGGIG +QIGEA+HR +V+ GF  DV+VLNALV MY+KCG IV
Sbjct: 189 EDGVKPDRFTFPRVLKACGGIGSVQIGEAIHRDLVKEGFGYDVYVLNALVVMYAKCGDIV 248

Query: 717 RARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIQEGYEPDSVALSTILSNIS 776
           +AR VFD I  KD VSWNSMLTGY  HGLL EALDIF  M+Q G EPD VA+S++L+ + 
Sbjct: 249 KARNVFDMIPHKDYVSWNSMLTGYLHHGLLHEALDIFRLMVQNGIEPDKVAISSVLARVL 308

Query: 777 SLKFKLHIHGWVIRHGVEWNLSIANSLIVMYANCG 811
           S K    +HGWVIR G+EW LS+AN+LIV+Y+  G
Sbjct: 309 SFKHGRQLHGWVIRRGMEWELSVANALIVLYSKRG 342

BLAST of HG10012396 vs. TAIR 10
Match: AT3G05260.1 (NAD(P)-binding Rossmann-fold superfamily protein )

HSP 1 Score: 376.7 bits (966), Expect = 5.0e-104
Identity = 184/249 (73.90%), Postives = 214/249 (85.94%), Query Frame = 0

Query: 8   VQGKVALVTGGDSGIGRAVCHCFALEGATVAFTYVKAQEDKDANDTIEMIKKAKSSAAKD 67
           + GKVALVTGGDSGIG+AVCHC+ALEGA+VAFTYVK +EDKDA +T+ ++ + K+  AK+
Sbjct: 37  LHGKVALVTGGDSGIGKAVCHCYALEGASVAFTYVKGREDKDAEETLRLLHEVKTREAKE 96

Query: 68  PLAIPADLGFDENCKRVVDEVVKAYGHIDILINNAAEQYKSTSVEDIDEDRLLRVFRTNI 127
           P+ I  DLGF+ENCKRVV+EVV ++G ID+L+N AAEQ++  S+EDIDE RL RVFRTNI
Sbjct: 97  PIMIATDLGFEENCKRVVEEVVNSFGRIDVLVNCAAEQHE-VSIEDIDEARLERVFRTNI 156

Query: 128 FSYFFTTRHALKHMKEGSSIINTTSVNAYKGNAKLLDYTATKGAIVSFTRGLALQLATKG 187
           FS FF  ++ALKHMKEGSSIINTTSV AY GN+ LL+YTATKGAIVSFTRGLALQLA KG
Sbjct: 157 FSQFFLVKYALKHMKEGSSIINTTSVVAYAGNSSLLEYTATKGAIVSFTRGLALQLAPKG 216

Query: 188 IRVNGVAPGPIWTPLIPASFDEEETANFGSQVPMKRAGQPIEVAPSYVFLACNVDSSYIT 247
           IRVNGVAPGP+WTPLIPASF EE    FGS+ PMKRA QP+EVAPSYVFLACN  SSY T
Sbjct: 217 IRVNGVAPGPVWTPLIPASFSEEAIKQFGSETPMKRAAQPVEVAPSYVFLACNHCSSYYT 276

Query: 248 GQVLHPNGG 257
           GQ+LHPNGG
Sbjct: 277 GQILHPNGG 284

BLAST of HG10012396 vs. TAIR 10
Match: AT4G25280.1 (P-loop containing nucleoside triphosphate hydrolases superfamily protein )

HSP 1 Score: 282.3 bits (721), Expect = 1.3e-75
Identity = 136/201 (67.66%), Postives = 164/201 (81.59%), Query Frame = 0

Query: 255 GGPGSGKGTQCMKIVDNFGFTHLSAGDLLRREIASNSADGTMILNTIKEGKIVPSELTIR 314
           GGPGSGKGTQC KIV+ FG  HLSAGDLLRREIA ++ +G MILN IK+GKIVPSE+T++
Sbjct: 50  GGPGSGKGTQCEKIVETFGLQHLSAGDLLRREIAMHTENGAMILNLIKDGKIVPSEVTVK 109

Query: 315 LIQKEMESSDNYKFLIDGFPRSEENRIAFEQIIGAEPDIVLFFDCPEDEMVKRVLNRNEG 374
           LIQKE+ESSDN KFLIDGFPR+EENR+AFE+II A+PD+VLFFDCPE+EMVKRVLNRN+G
Sbjct: 110 LIQKELESSDNRKFLIDGFPRTEENRVAFERIIRADPDVVLFFDCPEEEMVKRVLNRNQG 169

Query: 375 RVDDNIDTIKKRLKVFAALNLPVVKYYLEKGKLYKINAVGTVDEIYKQVYPVFASFNFEG 434
           R+DDNI T+KKRLK+F ALN PV+ YY  KGKLY INAVGTVD+I++ V P+F SF    
Sbjct: 170 RIDDNITTMKKRLKIFNALNRPVIDYYKNKGKLYTINAVGTVDDIFQHVLPIFNSFE--- 229

Query: 435 GGCGLRERMKQTPNERFGTAI 456
               L+E     P    G+++
Sbjct: 230 ---QLKESSHVNPQSHLGSSL 244

BLAST of HG10012396 vs. TAIR 10
Match: AT5G26667.1 (P-loop containing nucleoside triphosphate hydrolases superfamily protein )

HSP 1 Score: 231.5 bits (589), Expect = 2.6e-60
Identity = 113/192 (58.85%), Postives = 149/192 (77.60%), Query Frame = 0

Query: 240 NVDSSYITGQ---VLHPNGGPGSGKGTQCMKIVDNFGFTHLSAGDLLRREIASNSADGTM 299
           +VD++  +G+   V+   GGPGSGKGTQC  IV+++G+THLSAGDLLR EI S S +GTM
Sbjct: 3   SVDAANGSGKKPTVIFVLGGPGSGKGTQCAYIVEHYGYTHLSAGDLLRAEIKSGSENGTM 62

Query: 300 ILNTIKEGKIVPSELTIRLIQKEMESSDNYKFLIDGFPRSEENRIAFEQIIGAEPDIVLF 359
           I N IKEGKIVPSE+TI+L+QK ++ + N KFLIDGFPR+EENR AFE++   EP  VLF
Sbjct: 63  IQNMIKEGKIVPSEVTIKLLQKAIQENGNDKFLIDGFPRNEENRAAFEKVTEIEPKFVLF 122

Query: 360 FDCPEDEMVKRVLNRNEGRVDDNIDTIKKRLKVFAALNLPVVKYYLEKGKLYKINAVGTV 419
           FDCPE+EM KR+L RN+GR DDNI+TI+KR KVF   +LPV+ YY  KGK+ KINA   +
Sbjct: 123 FDCPEEEMEKRLLGRNQGREDDNIETIRKRFKVFLESSLPVIHYYEAKGKVRKINAAKPI 182

Query: 420 DEIYKQVYPVFA 429
           + ++++V  +F+
Sbjct: 183 EAVFEEVKAIFS 194

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7030831.16.2e-26683.66Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
KAE8650431.16.5e-21578.08hypothetical protein Csa_011770 [Cucumis sativus][more]
KAF3433128.12.3e-21266.67hypothetical protein FNV43_RR24230 [Rhamnella rubrinervis][more]
EOY22925.12.3e-21268.88Tetratricopeptide repeat-like superfamily protein [Theobroma cacao][more]
XP_021285603.16.7e-21266.78pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Herrania u... [more]
Match NameE-valueIdentityDescription
Q9FZ427.2e-11682.33NADPH-dependent aldehyde reductase 1, chloroplastic OS=Arabidopsis thaliana OX=3... [more]
Q5KTS58.2e-11278.71Glucose and ribitol dehydrogenase OS=Daucus carota OX=4039 GN=CAISE5 PE=2 SV=1[more]
Q9SB369.1e-11160.60Pentatricopeptide repeat-containing protein At4g25270, chloroplastic OS=Arabidop... [more]
Q9MA937.0e-10373.90Glucose and ribitol dehydrogenase homolog 2 OS=Arabidopsis thaliana OX=3702 GN=A... [more]
Q75KH34.1e-9569.50Glucose and ribitol dehydrogenase homolog OS=Oryza sativa subsp. japonica OX=399... [more]
Match NameE-valueIdentityDescription
A0A061FZF01.1e-21268.88UMP-CMP kinase OS=Theobroma cacao OX=3641 GN=TCM_014953 PE=3 SV=1[more]
A0A6J1AF963.3e-21266.78UMP-CMP kinase OS=Herrania umbratica OX=108875 GN=LOC110417541 PE=3 SV=1[more]
A0A1R3JTM42.7e-20661.19Adenylate kinase OS=Corchorus olitorius OX=93759 GN=COLO4_14147 PE=3 SV=1[more]
A0A1R3ITD51.9e-20466.55Adenylate kinase OS=Corchorus capsularis OX=210143 GN=CCACVL1_09972 PE=3 SV=1[more]
A0A2H5NMG41.0e-19764.25UMP-CMP kinase OS=Citrus unshiu OX=55188 GN=CUMW_058940 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G54870.15.1e-11782.33NAD(P)-binding Rossmann-fold superfamily protein [more]
AT4G25270.16.5e-11260.60Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G05260.15.0e-10473.90NAD(P)-binding Rossmann-fold superfamily protein [more]
AT4G25280.11.3e-7567.66P-loop containing nucleoside triphosphate hydrolases superfamily protein [more]
AT5G26667.12.6e-6058.85P-loop containing nucleoside triphosphate hydrolases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002347Short-chain dehydrogenase/reductase SDRPRINTSPR00081GDHRDHcoord: 220..240
score: 36.35
coord: 12..29
score: 50.2
coord: 93..104
score: 51.06
coord: 139..155
score: 27.74
coord: 165..184
score: 40.69
coord: 186..203
score: 48.87
IPR002347Short-chain dehydrogenase/reductase SDRPRINTSPR00080SDRFAMILYcoord: 145..153
score: 43.67
coord: 165..184
score: 39.56
coord: 93..104
score: 49.11
NoneNo IPR availablePFAMPF13561adh_short_C2coord: 20..256
e-value: 1.5E-54
score: 185.0
NoneNo IPR availablePFAMPF00406ADKcoord: 255..405
e-value: 6.8E-44
score: 149.6
NoneNo IPR availableGENE3D3.40.50.720coord: 1..255
e-value: 1.4E-77
score: 262.6
NoneNo IPR availablePIRSRPIRSR629511-2PIRSR629511-2coord: 7..257
e-value: 4.2E-27
score: 92.8
NoneNo IPR availablePIRSRPIRSR000095-1PIRSR000095-1coord: 14..211
e-value: 2.4E-11
score: 41.2
NoneNo IPR availablePIRSRPIRSR000094-3PIRSR000094-3coord: 8..256
e-value: 2.9E-16
score: 57.2
NoneNo IPR availablePANTHERPTHR47925:SF105SUBFAMILY NOT NAMEDcoord: 503..810
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 503..810
NoneNo IPR availableCDDcd05355SDR_c1coord: 10..256
e-value: 9.2078E-149
score: 436.723
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 681..781
e-value: 2.8E-23
score: 84.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 537..680
e-value: 3.1E-25
score: 91.2
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 701..726
e-value: 0.1
score: 12.9
coord: 601..626
e-value: 0.016
score: 15.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 630..662
e-value: 4.5E-9
score: 33.9
coord: 730..764
e-value: 7.1E-9
score: 33.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 727..772
e-value: 6.0E-10
score: 39.2
coord: 628..673
e-value: 7.0E-10
score: 39.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 728..762
score: 13.570147
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 627..661
score: 12.912469
IPR006266UMP-CMP kinaseTIGRFAMTIGR01359TIGR01359coord: 255..427
e-value: 5.3E-71
score: 236.3
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3D3.40.50.300coord: 256..430
e-value: 1.5E-54
score: 186.6
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILY52540P-loop containing nucleoside triphosphate hydrolasescoord: 255..430
IPR020904Short-chain dehydrogenase/reductase, conserved sitePROSITEPS00061ADH_SHORTcoord: 152..180
IPR033690Adenylate kinase, conserved sitePROSITEPS00113ADENYLATE_KINASEcoord: 328..339
IPR000850Adenylate kinase/UMP-CMP kinaseHAMAPMF_00235Adenylate_kinase_Adkcoord: 255..428
score: 35.664444
IPR000850Adenylate kinase/UMP-CMP kinaseCDDcd01428ADKcoord: 255..419
e-value: 3.37549E-60
score: 200.926
IPR036291NAD(P)-binding domain superfamilySUPERFAMILY51735NAD(P)-binding Rossmann-fold domainscoord: 9..256

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10012396.1HG10012396.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006207 'de novo' pyrimidine nucleobase biosynthetic process
biological_process GO:0046940 nucleoside monophosphate phosphorylation
biological_process GO:0016310 phosphorylation
biological_process GO:0006221 pyrimidine nucleotide biosynthetic process
biological_process GO:0043170 macromolecule metabolic process
biological_process GO:0006139 nucleobase-containing compound metabolic process
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005634 nucleus
molecular_function GO:0033862 UMP kinase activity
molecular_function GO:0019205 nucleobase-containing compound kinase activity
molecular_function GO:0004127 cytidylate kinase activity
molecular_function GO:0036431 dCMP kinase activity
molecular_function GO:0004672 protein kinase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0036430 CMP kinase activity
molecular_function GO:0005524 ATP binding
molecular_function GO:0009041 uridylate kinase activity