HG10020520 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10020520
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA polymerase epsilon subunit
LocationChr05: 252025 .. 264535 (-)
RNA-Seq ExpressionHG10020520
SyntenyHG10020520
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACACATTAACTCTGAGGAGAAAGGTACAGAGGAAAGCTAAAATCAGAGGGTTGTACAGCATCAAACTAGAAGCTCTTCATGAGATTGTTTCCTTCGTTTCTCGCTTCCATGGCTGTGAAGATGATGCTATAGAGCTTGTTCTCGACAATCTTCATGATGAATCTTGTACTACTATGTTCCTTTTTCTTTCATACTACAGTTATATCTTCTCTGAAATTCACCTCATTCCTTTTCATGGACAATTTCTTTGTTTTCTTCTATTACGGCAGTGAAATCTTCGATATTAGATAAGGATGCGGTGCATCGGGTTGTAAACATTATGCTTGCAGCTGATGAAGTGGGAGAAGAAAGTCCCACTACCGTTACCAGTACATCTGCTCTTTGCATCATCAACGCTTTTTATATCTCCAAGTTTCGTTATGATCCCATCAAAAAGATCTTCCATCTGTATGTTTTTCTCCCTTTTATATTTATCTGAATAATTGTTGGACAATTGTAGTATGAAAGCCACCAATTGGCCAATCTGAGCATAGTTGGTTAAGCATATACACTTGACCAAGAGGTCAGGGCAAAATCTCTCATTCCCATATGTTGTTGAATACATGAACCGAAGAAAACTACTAGTTGGACTTTGGGATAATTAGACGGGATAAGTGATGGAACTCATTGCAAAAAAGGAAGATGTAATGGGTTTTAACTGAGAACTTATCATCTTCATCAGTTTAAACATTGCTTTTCTCTGCACTTGAAACCAGTTGTTCTTGAACCACGCATATATATTTTTTTAGACACACTGGAAGCCTTCCAATTCATGGGGATGCTCCCGCTAAAGCTGCTCTATATAGAGATAGGTTTCTTCTGCTTTCCCAAAGGCTCTCCCGAGATCAGCATTTCTCCAAACCCGCTTTTGACATTGGCATGTCTCATTTCGGAAGTTGTGAGGTACTCCATTTCTCTTGACCCATGAACTGAGCTTGTTAGCCTCTACTATTGAACATGAGCATTTACACTAATTTCTTTTGTGATATGTTTCCCAGATTTCTCCTATTCAATCTCTGGTGGGGCAAACAGGGAGAAAATGGGTCATGGGAGTGATCTCTCAAATGGAGGACGGCCACTTTTACTTGGAGGACCTTACTGCGTCTGTGGAAATTAATTTATCTAATGCTATATCCTTTACTGGCATCTTCTATGATGTGGAAAAATTTAAACTTGTTTGATATTTGAAGATGACATGCACTGCAGTTTTCATTAATAATCAGCACAAGATAACTACAGGTCTATTTACAGAGAACACCATTGTTGTAGCAGAAGGAGAGATGCTCGTGGAGGGTATTTTTCAGGTTTGTGCCACGTTTGCCTTTAAGTTCTGTATAATTTGGTGGTTGGAATTTTCCCCTATTGTTCTATAAGGCATTTCTAACATCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCTCTCCCGCGTTTGTAGAGAGAACTTTCTTTCTTTAATACATATGATAGCGTTGCCATTAATTATGCTATTTTCTCTACGGATCATCCAGTTTTCCTATTGGCTATTGCTAGTATGCACTTTGCTTTACGATTTTTTCATATTTTATTATATTATCATGCTTGGTTACTTTTTAAAAAATGTCTTTGCACGTATGCCTATATTCTTTTCAACAAAACTTTGCAGTTATGGCACTTGTACTGTATAGCGGCATTTTGTTTATAATGTAACTAAAATGGCCTGAATTAAATTCCTTCCCGACTCAAATATTTTCTGAAGATTTTACGTGCTTATGAGATGATTTTCATACAATGTTACGGGATGGGTCATAGTTATGTTTGGTCCTGGTCAGAGTTTTGCATGCCTAAGTTATATTTTACGTAGTTCTCAATTGGTAAGTGGCTAACACCATCTTGTAGTTAGTTATTTAATATTTAGTTAGGTATCCAGAAGTTAAAAGTTTACTTTTGAGAAATCTGAAACCTCATCGTTCTAAACATAGTTTTATTTTTTTGGGTGAGTTCTGCCCACACTCTTCTTTAAAGGTTTTGAATCTTTGATAACATGAGGTTGACATTTAAGAACAAAACAGCTGAGGGAGAAGATGCAAGTGTCCTGCCCTCATCCAAAGGATATTTACTTCTCAAAGTTGGATTTGCATGTGGGATTTCATTCTTGTGGTTTACTTTCTTATTTGCTACCAAGCACAAGGATACCTTCCTTCTCAAGCGTTTTGGAAAATGTCCCGAACTTGTTTCATAGAAATATTTCAAATACTTTTCTTCATAGTAATAACGTGCTTTTCCTGGGCCTCTATTTTGTTAAGGTTGTTACATGTGGATTTCCTCCACTAGAGGAAAGGGACAAGTCTCTTAAATTGCTAGCAGGCCAGGACTTCTTTGGAGGTGGTGCTCTACCTAAAGAGGAGAATGTATCCTCTTTCTTTTTGTCTATAAAAAGGATTTCTAAATAAGCATTGGAAAGGAGGGCTATAGCTGACATTTTATTATGCTGAATTTTGGCTATTAGAAGGGTGTATACAACAACTTGTTTCCCTTAATTATCAATAGCTCAGGCTTGCAGATCTGGAGAAGAAAGCTGTTAATGATATGTTTGTCATACTCTCTGACATTTGGTTGGATAGTGAAGAGGTGATACATAACCTCTTTATATTTCACGTTTATTGTGTGCCTGTGAAGCTTGGTGGCCTTATCAAAATTTGTTCATGTGATAGGCCATGGGAAAACTGGAGACCATACTTGATGGTTTCGAGAATGTTGAAATGGTTCCTTCCTTATTTGTTTTGATGGGAAATTTTTGTTCCCGTCCATGTAATCTTGCCTTTAATTCTTTCTCAAGTCTCAGGTAAGTATTATCACAATTGTTTTCTGCGTATTGACATTGCTGTTATTGACATGAGTTCAGAGCAATAATGAGTTAGAATGTGCAGACTACAGTTTGGTAAGTTGGGAAAAATGATAGCAGCTCATCCACGATTAAAAGAGCATAGCAAGTTTCTTTTTATTCCTGGTCCCGATGATGCAGGTTTGTGTCGTTATTTGTTAAGAAGCTCCTAGCCTTGAGTTTAACTGTGCATGAGATTTTCTAAATTTTATTTACATGGAGTTTATAGGACCATCAACAGTTCTACCTAGGTGTGCTTTGCCGAAGTATTTAACTGAAGAGCTCCAGATGCACGTCCCAAATGCCATTTTCTCAAGTAACCCTTGCAGGTGGATACTTTGATTGTTGACTTATAGCATTAATGGTCTTTCTAGTCTTAACTGTAGTATTATCCTGCCTGAGGGGCTTCACTTCTACAGCTTACTCGAGAAACAAATCATATTTTTTCCATTTGGAATAAGCAAACAGAACTGTATTTAATAGATTTAAAGGCTAGTGTTAATGGTCTGGATTTAGTTATTCAGTTTTGTTAACTTTATTGAATTTAAGAAATAAAAAATAATAATAAAGTTCATTTATTAACAAGTGAGAACAAAGCCTATGATGGACAAAATAAATGGTTCTTCAAAAGAGCTTCCGATTTTTTACTTAATTAAAAGAAATAAACCGATTTTAAGTGGGAATTATGCTCTCTTTGGTGAATGGTATGTTACTCCATGGTGGTGGATAATTGTCATATTTGAATGCAGGGTCAGATTCTACACCCAGGAAATTGTATTTTTCCGACAAGACCTGCTTTATAGAATGCGCCGTTCCTGTCTCATACCACCTTCGACAGAAGAAACGAGTGATCCTTTTGAGCATGTGATATATTTTTTTTTCCTTTTATATATCTGTTATTTTTTCTAGTTATCAATTTATTGATTCTAGCCCTTTTGGATATTTATATTTGGAGGCCCTAAGGTTTCATGATTCTCTTGCAGCTTGTTGCGACCATAACTCATCAAAGTCATCTCTGCCCTCTTCCTCTGGTTATTCAACCAATTATCTGGAATTACGATCACTGTCTTCATCTATATCCTACTCCTCATGTGGTACTTACCCATATCTCAATCTAGTGTTTTAATCAATCTATTTATACATGCAAAGTTGGATATCTTCTCTTTCGGCACCTGAAGACAGATAATGAACGAGGCTTCTAATGGATGATAATGTTCTGCTGCGATTGAACCTCATGGCTACATGAACTTACAGATTATTATATCAATCTTTATTTGGTTTGTGCAGATAGTTTTGGGGGATAGAAGCAAGCAACAAGCATTCAAATACACAGGAATCACTTGCTTTAATCCTGGTTCCTTCACAAATGATAGCACTTTTGTGGCATATCGTCCTTGCAATCAAGAAGTTGAACTGTCTGCCTTGTAAATTATTACAAAGTTTAGGCCAGATTCAATGCGGGCACTCCTTATGCGGCATAACAATTCATGTCGAATTTTGAAAGAGAAATCCTATAGTATCATTCTTTATCTTGGGAGTTGCATTGTGCCCACCAAGAATTACCACTGGCTAGAATTTCATTCTATGCTGAAGAAGAGTATTCGAAACTTGGTGTAGAATCAGGAATCTGTTTAGATGTACAGTTGTAAATCTGAGTAAAATATTCTTTAGTTTGTATACTTTTAACGGTGGATGGTGTATTGATAAACAACTAACCTAAAAATTCTCCCACATGAGACTCGTTCAGGGTTGTATGTTATAATGTTACAAGAAGATCATGGATACAGGTTAACCGAAAGGAGCGTTAGAAGAGAAGAATGAGGGAAATGTTTTCAACATCCAACTTTGGTGAGTGTGGTTGTCCATACTTTTTGAGAGTTGCTGAAGTGTGTAGAACAGGTAGTTTAGTTGGGTATCCGTTTGATAAGACTTGTCAAACGGATAGGTCTGGTTAGTTGGGGCCATACACATTGCCCGTTGCCCACACAATTAATTCTTTCTTTCATGGAAGGTAAGAAGTTTAATTTTAAGGACCCTCGTCCTTTTTCTTTTCTTTTTTTTTTTTCTTCTCTTTTTGGATCCAAGTCCAACTACACATTCCATTCGCTTAGGATTTAAAAGTGGGGAAGGAAGGGAAAGTGGAAGCTTCCATCTGAGTGGGATAAGTCCATTTTTTTGTTGTTAGAATGATTGCAAACTTGACTTGTTTAACTAATTAAATATCATATTTGGAGATCAAAGCCAACCGTAACCACGTTTCTTTTCCCTTCATGCGCACACAGAAAATTAACTACCCCTATTTTTGTGCAACCCACTCGAGTGTCTAGTCACTAGTTGTCAGATAAGAAAAAAATTTTATTAAAAAAATATTTTTTTTTATATTTTTTTCTCCTTTGTTATGGAAGAAGATGCATGAGTTGCACTCAAGTATTACTCGTGGTATTACAATGTGCCTCATAATTCTAAATTCTCTATTTTCATAAACGAATGTTAGATACCATTTTTATCAATATATATTTTTTATTTTTAAAACAAGTTAAAAATGGGAAAATTATTATATAGAAAAAATATTAAACTATATATAAATATAAAAAAATTTCACTATCTATCACGGTCTATTGTGGAATTTTTCTATATTGGTAAATATTTTGGTTCATTTTGCTATATTTAAAAACAATCCTTAAAAATATGAAAAATTATTATAAACAGAAAAAATATCAACCTTTTTACAAACATATAAAAATTTCATTGTCTATGAGACTGCGATTGAATTGTATTGTGATCTATCGCTAATAGACAGTAAATTTTTCTATATTTGTAAATAGTTTGTCTCATTTTTCTATATTTGAAAACAGTTCTAAAAGTATCTTTATTCTTTTTATCCTTTCAAAAGTTGCAAAATGTTTGAGTAGAAGGGGGTAGCCTAAGATTATTTAAACACAAATTTGAATTTATTGATACTTATGAAAGAACTTCTTTTAAGTAATAGAACAAAATAGAGTCAGGAATCAAACTTTTAACTGAATATAGAAAGTACTGGCCAAACAGTTCAGCTATCCATATTTCAACAAAAGTAGTAAATAATGTGGAGTTTTGAGTAATACTAAAGGATTATTGTGGATATTTCTGCCCAATCAATCTAATCCAATGTGGGTTTTATTTTTCTGTTGCATCAAATTTGAAAGTGACAAAATGGTGATGTGATATATATATATATTATTTTTGTATTGCAAAACGAAACACTTAAGGCGAAAGAAAAGTAGTCTGAAAAACTACAATATTTGAGTTGATCAGTGGCATCTTTTTTACTATTTTATGTAATTTTTGGATGTAGCTGTGCTTTTGTGTATTGTTTTCAATGGCGGCATTCAATAATATAATTTATAGAATGATGGGTAGGTTAATTACCATTTTATCTCTGAATCAGCTGAGTGAACCTCTAGGCTCTAGGGATATTCCTGTAAATAAAGACTACGAACAGTGCTACATTTTACATTAGGTAGATTTTATGGTCAGTTATAGTTATAAAATAGACTGCAAGCTTTCCCTAAAAAGTGTGAAAAATTACACAAAAATGGAGTGTTGATGTTCCATAGTTAGTTTAGTTTTCATTATATATATATACATACATATATATATATATACATACATATATATATATATACATACATATATATATATATATATATATATATTTCCCTGTCTTTTACCTAATTATATTCCCACAATCACAATCACAAAAACCTCTCAACTAAAAGTGAGATGCACTCCAAGTTTCAATTCTCAGTTCGGTTTTGTGCTTAGTGATTTTTATCAACTAATTAAGAAATTTGAAACTATACTTAATTAATAATAATTACCTTGGTTAATTTGTTTGCCTTAATGAAGGATGGAGACTGCTGTAAACATTATTATTATAACCAAACTAATTAAAACTAACTTTGGCTGCCAAAGCAATATTTCATTTGATGATGGAGATTGTTTGTTAACTGCAGGCTTGCTTGCTAAATTATATGGTTTTGGTTTGGTAGCCATTTTCCAACTACCTCTTCTCTCTCGCCCTTCTATATCTCATTTATATAATTTTATAATTTAATTAATTACGAACTCAAAGCCATCATATCTCTTTTTTTTTTTATTTTAATCAGTTTATTTAATTAGGTAATTATAAATATTTCGCATTTACATAATGAAATTAAAGATCAATGTACAAATTGTGAACAGAAACTTCCAGAAGAGTGGTAGTAGGTGAAACAGTAGGTGTGAGGAGAAGTTGTGTGCTTTGTGTTTGCGATGATGCACATAATAATTAAATTATATAACACATTAATTATAATAGAGAAGATCAGAGAAATGTTAATTAAAGTAAATTAGTTAAAATTAGCCAGCAAATTTGAACACATAGTTGTACTGTTAATGGGAACTACATTTACAAGATATCTTATAATAAGCTAAAAGGTGGTTGAAAGATATACACCACTTCACACTTCTCTCCTTAATTTACCCTTCCTTCTTCCTTCTCACACACTATATTTTTTGTCCTACCTCAGAGAATGCTACCCCAACTCTCAAAAATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATATATATATATATATATATTATATATATATATATATATATATATATATATAGTAATTTTAAAAATAGAAAAACAAGGGAAATATTTATGCAAAATAGCAAAATTTTTATATAGTTGTAATAGACGCTTATAGAAGTCTATCAGGGTCTACCAGTGATAGACTTTATCAATCACTGATAGAAGTTTATCAATATCTATAAGTGTTTTTTTGTTTTTTTTGCTATTTTCCATAAATAGTTTGTCATTTTTTCTATATGTGAAAATTTCTCGTACGTTTTTCACTTGAATATAGTTTATCTAAAAGCAAGAGAGAAAATGGTTTTATTTGTGACTGCTAATATTTGTAATAGAAATTGAAAAAGGAGGAAACAACAGAGAGTTGGGTGTGGGGTTGATTATGGTATGAAGGGAACGGCTCTGATGGCTAAGAATTAAAGCCAAAGTCTTACCTAATGTATATATTATTATTAATTATATATGGGTTTATAGAACATATATAACTTGTAAGAATTTACCTCTAAAATAATATATTATATATTATATTGTAGTTAGACAATATAGCTCACTTTGTGTCGTAAGTATAAGTTTAGTGTTTGTTTGTTTTAGCTTTGCTTTTACTTTATGCATTGAAGCATTCAAAAGCCTTTCTCTAGTTAGGGAGCAGAGGCGAAGCTGAAACCATCATCCTAACAGAATCTGAAACTGTTATGTACAATACAAAATTCGTACACCCTAAATTTCTGTGTTTTTGTACGACCATAAAAAATAAATATCAAATTTAAATAAATAACCTTTCTCATTTTTGAACCGGAAAAGCCACCATAAACAATAATTCGCCTCTCAAATTATTAAGAATGTTGATGACAAATTTTAGATGGGGAAAACACACATAATCTTTACCTAACAACTATAAATTTATAATTTACGGAATAGAAGACCTTAAAAAAAAAAAAAAAAAAAGACTCAGATGCCAAAGTTAGAGATCAAAATTGTAGGTAGAGAAGATGTAGTGTTTCCTAGGGTTCAGAGTTGCCTAAGGATAATTAGATAATGCCTAGAAGTGTGCCCGATTTCATGTTGGGTTAGAGTTGGCTTGAGGCCGAGCATCCTCCTTAGGCCATATTCCGGTCCTAAAACGATATCTTGAACGTATGCCCCAGTTGGGTGATTGAGACTGAGGCCGAGCACACTGGCTTGGGCTAAGGTTCCAAACTAGGCCCTTTTCTTAGGCCCATTTTGGTCTTTGCCCGTATCATGCCCTCTTCTTGACTTTGTCGTTCCTATTTCTGACTAAATGTCAGTTATACTTTTTGGTCCAAAATCACTCATAACGAAGAACATAACAGCAAAATATGTAACAATTTATTGTGTAAATAAAAGTAATATGTAACTTCTTTCAAATATATATATATGTATATGAGAACGTGGATTAATATCTTAAATGCTAAAGTCTTTTTTTTTTTAACAAGAAACTAGAGATCAAGGGAGCCGTAGGCACACCATAATATCTCAACTAGGTAAACATATCCATAGCACCTTCTTCACGCCCTAGTTCCAAAAGCGATACATTGATCAAAACAAAAAATACATACGAGATGCATTGAACATTCTTAGCACCCTTATCCATGACGCTCAAGTCTTTTCAATTTGGCTGGAATATATAGTTTAGTTGTCTAAAGTTGTGTATTATCATAATGCAGGCACAAAATACCAATATATTTGAGATATTATTTAATAGTGAACTATATCATAATAAGTAGGGTTCAACCTAAGCGCTTGTTAATGTATATATCTTTCATCAAGTGATGGAAGATTTTTAGAAAGGTTCAAATTTTTACTTTCAACAATACAAATGCCCACCCAGAGTACTAATATGATAATTGTGAATGCAAAATTTGGATCATGTTTAGTAACTACTTCGTTATTTATTTCTTTTTTTTTTAATTTAAGTCTACAAACACTTCTTTCTACCTTTAATACTTTTTATCCATATTTTCAAAAACTAAGTCAATTTTTTTTAATTAAAAAAAGTAGTTTTTAAAAACTTGTGTTTGTTTAAACGGGGTCTTAACGGTACACATTCAACATTGATTCATCCTATATGAGTTTAAAAGATCATTAAGAAAAAAAATTGTTGTTGAAACTAACCTTTTGAAAAACCAAATGATCAAATCAACCCATCTAACTAAGTTGATATCAAAATCGAGAAAATTAAACCAACCAACTTTTTGAATGACAGAAATAATATGGAAAAAACGATATGAAGCAACTAAATTGATTTTGTGAAATTGATGTATTGCTCAAGATCAAACCAAGTATATATATTCTTCTCAATTTTTTGGTTTAAAGATAATTTCATCAGCATATATAGATCGAAAATATTTGAGTTGAAAACCGAAAGAAAAGAACACGTGTATTTGAATTGAAGGAAATGGGTAGTTAAGTTTGGAGGTTTAGAAAGTTAGTTAGGTTTGAAAAGTAAAGGCAACCTGAATCTGCTTTCCATCCATAACCATATAATTAAGCATCTAGAGACCTGAACAATAATATATAATAATAATCTTTATTTAACATTTTTTTTCTTTTTAAATTTATATTTAGAGAATGGAATATTGATGTAATTTGAATAAGTGAGAGAATAAAGGTAGGTAGGGTAATTAGGAGAAGAAAGGAAAGGAAATTAAAAAAGAAAGAGAAGAGAGAATTAATGAATTATATAAGAAAGACGTTTATAAGTAGAGGTTGTAGGTTAGAAACTTAGGAAATAAAGACGTTGAAAGGAGAACCAAAATGAGCTAGACTACTGTATGAAGCAAACGGCTGCTCTGTCACACTTACATTGGATTATTACTAATTTATAAAGATATATATATATATATATATATATATATATATATACACACACACACACATATATATATTTGGGAAATTAGTTTGTTTTGTTATTATTTGTAAAATCAAAATAATTAGATCGGATTAGGGTTTTAGGATAATGTGCAGCAGAGGGCATTGGCGGCCTGCTGAAGACGAGAAGCTTCGAGAGCTGGTTGAGCGTTACGGACCTCATAATTGGAACGCCATAGCTCAAAAGCTTGAAGGAAGATCCGGGAAGAGCTGCCGGTTGAGATGGTTCAATCAATTGGATCCACGAATCAACAGAAGCCCTTTCACGGAGGAGGAAGAAGAGAGGCTGATGGCCTCACATAGGGTTCATGGGAATAGATGGGCCATCATCGCAAGGCTTTTCCCTGGCCGCACCGACAACGCCGTCAAGAACCACTGGCACGTCATCATGGCTCGTAGGTCTAGAATTAATAGATCCAAAACCCAAACCCAAACTCAAACTCCATCTTCTTTTCACAATAATCGCCTTCTTCTCTCTTCTTTTCTCCAACTCCATACATATCATCCCACTTCTAATTCATTCATCCCCAAACCACCACCTGGTAATTAATTTTATTCTATTCTTTCTTTTTTTATATGTAATTATAATTATATATATATTATGTTTCCTTATAATTATATATATATTATGTGTATGCAGATGATAGAGTACATACGAATTCCATTCATTATTACGACTTTCTCCACATAAATACGGAGTCCATCAATGGAAGTGAAGTGATAGACAACTCAAGAAAAGAGGAGGATTATGAAGAGGTCAATGATCAAGAAGCCGCCGACACCGCCGGCCCGGTACTTCCCTTCATTGATTTCTTTTCTGCCTCCCAAACCTCCAACAACAACTCCATCTCATCCTAACATATATATATATATATATACATGCATATTATTAACCTCCTTTATCTTACATTTCATCTAACTCATTTGGGGCATCATCAAATCACACCCCGGCTACATATATACATACATGTAATCCATAAATATATTATAATTGTCATTTTGTACACACAATTTCAAACTTACATCACTTTTAGTATTTCTATTGTTTCTAAACTTTCTTGTGTCAATCCCATGCATGTTTTTCTTCTCTGCATGGTTTTAGTCCCACCTTAATTACTCCCTATCATTTTACACTTCATACTTAACTTCTCTCTCCATAATTGTTAAAATTGGATGACATATTTATATATAGAAATTGTGAGATGAAATTAAACTAAAAAGACGTTATTACTTTTCGAATATTACTTCCAACTAGAAAAAATAATAAAATCAAATAAGTGAGATATCAAATTTGAAATGAGAGCAACCTTAAGTTAGCATGAAGAAAGAAAAAGAGACAGCAGTTGTAACTCAAGTGGTTAGATGGATGATAAAGGCCTGAAAAGAGAGGAAAATCTTCGGGATCAGAAAAATTAGTTATTGAATTCAAACATATATTTATTTACCCTATAATATATTGAGTAATAGAAAACAATAGAATTCTATCGCAGTCTATCACTAATAGATTATAAATTTTTTCTATATTTATAAATAGTTTAGCTCATTTTTTTATATTTAAAAATAACCCTATAATATATTGGAACTAAAAGTGAAATATTCTATAAACAAAAATAGATTTGGAGCTCCAATATGGTTTCTAATTCTGTATATTGAAAAATGTACATTTTAGCAAGAGTTTAGGGACTTGTTTAAATAGTTATAAAGCTGTACATAATTAGGTCTTTTTTTTTTTTTTTGTTGGTAGTCCAGTTACTTATTAAGACTTGCGTTCCTTCTTTCTAACTTCTTGCCAAGTGTAATAATTACTAATTAATCCAGCAAGTAAGTTTAACACAATTGAAGAAAAGTAATGATTAGATGTAGCCTAGATTGCACATAATCCACATGCAGTTTTTTTTGTACATAATAGAAAACAAATCATTTAAAATAAATTGTCACAAGACAAATTTTTATTGGGTGCAGCCTGTTTTCCATTGAAGAATTCATTGTGTATCTAATTGACATTGAACATATCTTGTTAGAAACTAGAAAGTAAAAACGATAACTCACCGTGCGGAGTTCAAGAAGAATTGATCTTACAGTAG

mRNA sequence

ATGGACACATTAACTCTGAGGAGAAAGGTACAGAGGAAAGCTAAAATCAGAGGGTTGTACAGCATCAAACTAGAAGCTCTTCATGAGATTGTTTCCTTCGTTTCTCGCTTCCATGGCTGTGAAGATGATGCTATAGAGCTTGTTCTCGACAATCTTCATGATGAATCTTTGAAATCTTCGATATTAGATAAGGATGCGGTGCATCGGGTTGTAAACATTATGCTTGCAGCTGATGAAGTGGGAGAAGAAAGTCCCACTACCGTTACCAGTACATCTGCTCTTTGCATCATCAACGCTTTTTATATCTCCAAGTTTCGTTATGATCCCATCAAAAAGATCTTCCATCTACACACTGGAAGCCTTCCAATTCATGGGGATGCTCCCGCTAAAGCTGCTCTATATAGAGATAGGTTTCTTCTGCTTTCCCAAAGGCTCTCCCGAGATCAGCATTTCTCCAAACCCGCTTTTGACATTGGCATGTCTCATTTCGGAAGTTGTGAGATGACATGCACTGCAGTTTTCATTAATAATCAGCACAAGATAACTACAGGTCTATTTACAGAGAACACCATTGTTGTAGCAGAAGGAGAGATGCTCGTGGAGGGTATTTTTCAGGTTGTTACATGTGGATTTCCTCCACTAGAGGAAAGGGACAAGTCTCTTAAATTGCTAGCAGGCCAGGACTTCTTTGGAGGTGGTGCTCTACCTAAAGAGGAGAATCTCAGGCTTGCAGATCTGGAGAAGAAAGCTGTTAATGATATGTTTGTCATACTCTCTGACATTTGGTTGGATAGTGAAGAGGCCATGGGAAAACTGGAGACCATACTTGATGGTTTCGAGAATGTTGAAATGGTTCCTTCCTTATTTGTTTTGATGGGAAATTTTTGTTCCCGTCCATGTAATCTTGCCTTTAATTCTTTCTCAAGTCTCAGACTACAGTTTGGTAAGTTGGGAAAAATGATAGCAGCTCATCCACGATTAAAAGAGCATAGCAAGTTTCTTTTTATTCCTGGTCCCGATGATGCAGGACCATCAACAGTTCTACCTAGGTGTGCTTTGCCGAAGTATTTAACTGAAGAGCTCCAGATGCACGTCCCAAATGCCATTTTCTCAAACGAGAAGCTTCGAGAGCTGGTTGAGCGTTACGGACCTCATAATTGGAACGCCATAGCTCAAAAGCTTGAAGGAAGATCCGGGAAGAGCTGCCGGTTGAGATGGTTCAATCAATTGGATCCACGAATCAACAGAAGCCCTTTCACGGAGGAGGAAGAAGAGAGGCTGATGGCCTCACATAGGGTTCATGGGAATAGATGGGCCATCATCGCAAGGCTTTTCCCTGGCCGCACCGACAACGCCGTCAAGAACCACTGGCACGTCATCATGGCTCGTAGGTCTAGAATTAATAGATCCAAAACCCAAACCCAAACTCAAACTCCATCTTCTTTTCACAATAATCGCCTTCTTCTCTCTTCTTTTCTCCAACTCCATACATATCATCCCACTTCTAATTCATTCATCCCCAAACCACCACCTGATGATAGAGTACATACGAATTCCATTCATTATTACGACTTTCTCCACATAAATACGGAGTCCATCAATGGAAGTGAAGTGATAGACAACTCAAGAAAAGAGGAGGATTATGAAGAGGTCAATGATCAAGAAGCCGCCGACACCGCCGGCCCGAAACTAGAAAGTAAAAACGATAACTCACCGTGCGGAGTTCAAGAAGAATTGATCTTACAGTAG

Coding sequence (CDS)

ATGGACACATTAACTCTGAGGAGAAAGGTACAGAGGAAAGCTAAAATCAGAGGGTTGTACAGCATCAAACTAGAAGCTCTTCATGAGATTGTTTCCTTCGTTTCTCGCTTCCATGGCTGTGAAGATGATGCTATAGAGCTTGTTCTCGACAATCTTCATGATGAATCTTTGAAATCTTCGATATTAGATAAGGATGCGGTGCATCGGGTTGTAAACATTATGCTTGCAGCTGATGAAGTGGGAGAAGAAAGTCCCACTACCGTTACCAGTACATCTGCTCTTTGCATCATCAACGCTTTTTATATCTCCAAGTTTCGTTATGATCCCATCAAAAAGATCTTCCATCTACACACTGGAAGCCTTCCAATTCATGGGGATGCTCCCGCTAAAGCTGCTCTATATAGAGATAGGTTTCTTCTGCTTTCCCAAAGGCTCTCCCGAGATCAGCATTTCTCCAAACCCGCTTTTGACATTGGCATGTCTCATTTCGGAAGTTGTGAGATGACATGCACTGCAGTTTTCATTAATAATCAGCACAAGATAACTACAGGTCTATTTACAGAGAACACCATTGTTGTAGCAGAAGGAGAGATGCTCGTGGAGGGTATTTTTCAGGTTGTTACATGTGGATTTCCTCCACTAGAGGAAAGGGACAAGTCTCTTAAATTGCTAGCAGGCCAGGACTTCTTTGGAGGTGGTGCTCTACCTAAAGAGGAGAATCTCAGGCTTGCAGATCTGGAGAAGAAAGCTGTTAATGATATGTTTGTCATACTCTCTGACATTTGGTTGGATAGTGAAGAGGCCATGGGAAAACTGGAGACCATACTTGATGGTTTCGAGAATGTTGAAATGGTTCCTTCCTTATTTGTTTTGATGGGAAATTTTTGTTCCCGTCCATGTAATCTTGCCTTTAATTCTTTCTCAAGTCTCAGACTACAGTTTGGTAAGTTGGGAAAAATGATAGCAGCTCATCCACGATTAAAAGAGCATAGCAAGTTTCTTTTTATTCCTGGTCCCGATGATGCAGGACCATCAACAGTTCTACCTAGGTGTGCTTTGCCGAAGTATTTAACTGAAGAGCTCCAGATGCACGTCCCAAATGCCATTTTCTCAAACGAGAAGCTTCGAGAGCTGGTTGAGCGTTACGGACCTCATAATTGGAACGCCATAGCTCAAAAGCTTGAAGGAAGATCCGGGAAGAGCTGCCGGTTGAGATGGTTCAATCAATTGGATCCACGAATCAACAGAAGCCCTTTCACGGAGGAGGAAGAAGAGAGGCTGATGGCCTCACATAGGGTTCATGGGAATAGATGGGCCATCATCGCAAGGCTTTTCCCTGGCCGCACCGACAACGCCGTCAAGAACCACTGGCACGTCATCATGGCTCGTAGGTCTAGAATTAATAGATCCAAAACCCAAACCCAAACTCAAACTCCATCTTCTTTTCACAATAATCGCCTTCTTCTCTCTTCTTTTCTCCAACTCCATACATATCATCCCACTTCTAATTCATTCATCCCCAAACCACCACCTGATGATAGAGTACATACGAATTCCATTCATTATTACGACTTTCTCCACATAAATACGGAGTCCATCAATGGAAGTGAAGTGATAGACAACTCAAGAAAAGAGGAGGATTATGAAGAGGTCAATGATCAAGAAGCCGCCGACACCGCCGGCCCGAAACTAGAAAGTAAAAACGATAACTCACCGTGCGGAGTTCAAGAAGAATTGATCTTACAGTAG

Protein sequence

MDTLTLRRKVQRKAKIRGLYSIKLEALHEIVSFVSRFHGCEDDAIELVLDNLHDESLKSSILDKDAVHRVVNIMLAADEVGEESPTTVTSTSALCIINAFYISKFRYDPIKKIFHLHTGSLPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEMTCTAVFINNQHKITTGLFTENTIVVAEGEMLVEGIFQVVTCGFPPLEERDKSLKLLAGQDFFGGGALPKEENLRLADLEKKAVNDMFVILSDIWLDSEEAMGKLETILDGFENVEMVPSLFVLMGNFCSRPCNLAFNSFSSLRLQFGKLGKMIAAHPRLKEHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSNEKLRELVERYGPHNWNAIAQKLEGRSGKSCRLRWFNQLDPRINRSPFTEEEEERLMASHRVHGNRWAIIARLFPGRTDNAVKNHWHVIMARRSRINRSKTQTQTQTPSSFHNNRLLLSSFLQLHTYHPTSNSFIPKPPPDDRVHTNSIHYYDFLHINTESINGSEVIDNSRKEEDYEEVNDQEAADTAGPKLESKNDNSPCGVQEELILQ
Homology
BLAST of HG10020520 vs. NCBI nr
Match: XP_038894268.1 (DNA polymerase epsilon subunit B [Benincasa hispida])

HSP 1 Score: 669.8 bits (1727), Expect = 2.1e-188
Identity = 345/408 (84.56%), Postives = 359/408 (87.99%), Query Frame = 0

Query: 1   MDTLTLRRKVQRKAKIRGLYSIKLEALHEIVSFVSRFHGCEDDAIELVLDNLHDESLKSS 60
           MD LTLR+KVQRKAKIRGLYSIKLEAL EIVSFVSRFHGCEDDAI+LVLDNLHDESLKSS
Sbjct: 1   MDALTLRKKVQRKAKIRGLYSIKLEALDEIVSFVSRFHGCEDDAIDLVLDNLHDESLKSS 60

Query: 61  ILDKDAVHRVVNIMLAADEVGEESPTTVTSTSALCIINAFYISKFRYDPIKKIFHLHTGS 120
           ILDKDAVHRVV+IMLAADEVGEESP T+TSTSALCII+AF ISKFRYDPIKKIF+LHTG 
Sbjct: 61  ILDKDAVHRVVSIMLAADEVGEESPNTITSTSALCIIDAFDISKFRYDPIKKIFYLHTGK 120

Query: 121 LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEMT----------- 180
           LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFD GMSHFGSCE++           
Sbjct: 121 LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDTGMSHFGSCEISPIQSLVGQTGR 180

Query: 181 --------------------CTAVFIN-NQHKITTGLFTENTIVVAEGEMLVEGIFQVVT 240
                                 +V IN +  KITTGLFTENTIVVAEGEMLVEGIFQVVT
Sbjct: 181 KWVMGVISQMEDGHFYLEDLTASVEINLSNAKITTGLFTENTIVVAEGEMLVEGIFQVVT 240

Query: 241 CGFPPLEERDKSLKLLAGQDFFGGGALPKEENLRLADLEKKAVNDMFVILSDIWLDSEEA 300
           CGFPPLEERDKSLKLLAGQDFFGGGAL KEE LRLADLEKKAVNDMFVILSDIWLDSEEA
Sbjct: 241 CGFPPLEERDKSLKLLAGQDFFGGGALSKEETLRLADLEKKAVNDMFVILSDIWLDSEEA 300

Query: 301 MGKLETILDGFENVEMVPSLFVLMGNFCSRPCNLAFNSFSSLRLQFGKLGKMIAAHPRLK 360
           MGKLETILDGFENVE+VPSLFVLMGNFCSRPCNLAFNSFSSLRLQFGKLG+MIAAHPRL 
Sbjct: 301 MGKLETILDGFENVEVVPSLFVLMGNFCSRPCNLAFNSFSSLRLQFGKLGRMIAAHPRLI 360

Query: 361 EHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSNEKLR 377
           EHSKFLFIPGPDDAGPSTVLPRCALPK+LTEELQMHVPNAIFS+   R
Sbjct: 361 EHSKFLFIPGPDDAGPSTVLPRCALPKFLTEELQMHVPNAIFSSNPCR 408

BLAST of HG10020520 vs. NCBI nr
Match: TYJ98844.1 (DNA polymerase epsilon subunit 2 [Cucumis melo var. makuwa])

HSP 1 Score: 642.5 bits (1656), Expect = 3.5e-180
Identity = 330/418 (78.95%), Postives = 350/418 (83.73%), Query Frame = 0

Query: 1    MDTLTLRRKVQRKAKIRGLYSIKLEALHEIVSFVSRFHGCEDDAIELVLDNLHDESLKSS 60
            MD LTLR+KVQRKAKIRGLYSIKLEAL EIVSFVSRFHG EDDAIELVLD+LH+ESLKS 
Sbjct: 613  MDALTLRKKVQRKAKIRGLYSIKLEALDEIVSFVSRFHGFEDDAIELVLDDLHEESLKSP 672

Query: 61   ILDKDAVHRVVNIMLAADEVGEESPTTVTSTSALCIINAFYISKFRYDPIKKIFHLHTGS 120
            IL+KD VHRV++ +LAA+E  E SP T+TST+ALCII+AF ISKFRYDPIKKIFH HTG 
Sbjct: 673  ILEKDGVHRVISKLLAAEEAEEASPNTITSTTALCIIDAFDISKFRYDPIKKIFHRHTGD 732

Query: 121  LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEMT----------- 180
            LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCE++           
Sbjct: 733  LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEISPIQSLVGQTGR 792

Query: 181  --------------------CTAVFIN-----------NQHKITTGLFTENTIVVAEGEM 240
                                  +V IN             HKITTGLFTENTI+VAEGEM
Sbjct: 793  KWVMGVISQMEDGHFYLEDLTASVEINLSNAISFTGNFYDHKITTGLFTENTIIVAEGEM 852

Query: 241  LVEGIFQVVTCGFPPLEERDKSLKLLAGQDFFGGGALPKEENLRLADLEKKAVNDMFVIL 300
            LVEGIFQVVTCGFPPLEERDKSLKLLAGQDFFGGG LPKEE LRLADLEKKAVNDMFVIL
Sbjct: 853  LVEGIFQVVTCGFPPLEERDKSLKLLAGQDFFGGGVLPKEETLRLADLEKKAVNDMFVIL 912

Query: 301  SDIWLDSEEAMGKLETILDGFENVEMVPSLFVLMGNFCSRPCNLAFNSFSSLRLQFGKLG 360
            SDIWLDSEEAMGKLETILDGFENVE+VPSLFVLMGNFCS PCN+AFNSFSSLRLQFGKLG
Sbjct: 913  SDIWLDSEEAMGKLETILDGFENVEVVPSLFVLMGNFCSHPCNIAFNSFSSLRLQFGKLG 972

Query: 361  KMIAAHPRLKEHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSNEKLR 377
            KMIAAHPRL +HSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQ+HVPNA FS+   R
Sbjct: 973  KMIAAHPRLNKHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQVHVPNATFSSNPCR 1030

BLAST of HG10020520 vs. NCBI nr
Match: XP_016899382.1 (PREDICTED: LOW QUALITY PROTEIN: DNA polymerase epsilon subunit 2 [Cucumis melo])

HSP 1 Score: 641.7 bits (1654), Expect = 6.0e-180
Identity = 328/408 (80.39%), Postives = 349/408 (85.54%), Query Frame = 0

Query: 1   MDTLTLRRKVQRKAKIRGLYSIKLEALHEIVSFVSRFHGCEDDAIELVLDNLHDESLKSS 60
           MD LTLR+KVQRKAKIRGLYSIKLEAL EIVSFVSRFHG EDDAIELVLD+LH+ESLKS 
Sbjct: 1   MDALTLRKKVQRKAKIRGLYSIKLEALDEIVSFVSRFHGFEDDAIELVLDDLHEESLKSP 60

Query: 61  ILDKDAVHRVVNIMLAADEVGEESPTTVTSTSALCIINAFYISKFRYDPIKKIFHLHTGS 120
           IL+KD VHRV++ +LAA+E  E SP T+TST+ALCII+AF ISKFRYDPIKKIFH HTG 
Sbjct: 61  ILEKDGVHRVISKLLAAEEAEEASPNTITSTTALCIIDAFDISKFRYDPIKKIFHRHTGD 120

Query: 121 LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEMT----------- 180
           LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCE++           
Sbjct: 121 LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEISPIQSLVGQTGR 180

Query: 181 --------------------CTAVFIN-NQHKITTGLFTENTIVVAEGEMLVEGIFQVVT 240
                                 +V IN +  KITTGLFTENTI+VAEGEMLVEGIFQVVT
Sbjct: 181 KWVMGVISQMEDGHFYLEDLTASVEINLSNAKITTGLFTENTIIVAEGEMLVEGIFQVVT 240

Query: 241 CGFPPLEERDKSLKLLAGQDFFGGGALPKEENLRLADLEKKAVNDMFVILSDIWLDSEEA 300
           CGFPPLEERDKSLKLLAGQDFFGGG LPKEE LRLADLEKKAVNDMFVILSDIWLDSEEA
Sbjct: 241 CGFPPLEERDKSLKLLAGQDFFGGGVLPKEETLRLADLEKKAVNDMFVILSDIWLDSEEA 300

Query: 301 MGKLETILDGFENVEMVPSLFVLMGNFCSRPCNLAFNSFSSLRLQFGKLGKMIAAHPRLK 360
           MGKLETILDGFENVE+VPSLFVLMGNFCS PCN+AFNSFSSLRLQFGKLGKMIAAHPRL 
Sbjct: 301 MGKLETILDGFENVEVVPSLFVLMGNFCSHPCNIAFNSFSSLRLQFGKLGKMIAAHPRLN 360

Query: 361 EHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSNEKLR 377
           +HSKF FIPGPDDAGPSTVLPRCALPKYLTEELQ+HVPNA FS+   R
Sbjct: 361 KHSKFXFIPGPDDAGPSTVLPRCALPKYLTEELQVHVPNATFSSNPCR 408

BLAST of HG10020520 vs. NCBI nr
Match: KAA0036041.1 (DNA polymerase epsilon subunit 2 [Cucumis melo var. makuwa])

HSP 1 Score: 640.6 bits (1651), Expect = 1.3e-179
Identity = 327/416 (78.61%), Postives = 347/416 (83.41%), Query Frame = 0

Query: 1    MDTLTLRRKVQRKAKIRGLYSIKLEALHEIVSFVSRFHGCEDDAIELVLDNLHDESLKSS 60
            MD LTLR+KVQRKAKIRGLYSIKLEAL EIVSFVSRFHG EDDAIELVLD+LH+ESLKS 
Sbjct: 614  MDALTLRKKVQRKAKIRGLYSIKLEALDEIVSFVSRFHGFEDDAIELVLDDLHEESLKSP 673

Query: 61   ILDKDAVHRVVNIMLAADEVGEESPTTVTSTSALCIINAFYISKFRYDPIKKIFHLHTGS 120
            IL+KD VHRV++ +LAA+E  E SP T+TST+ALCII+AF ISKFRYDPIKKIFH HTG 
Sbjct: 674  ILEKDGVHRVISKLLAAEEAEEASPNTITSTTALCIIDAFDISKFRYDPIKKIFHRHTGD 733

Query: 121  LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCE------------- 180
            LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCE             
Sbjct: 734  LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEISPIQSLVGQTGR 793

Query: 181  ---------------------------MTCTAVFINNQHKITTGLFTENTIVVAEGEMLV 240
                                       ++    F  N + ITTGLFTENTI+VAEGEMLV
Sbjct: 794  KWVMGVISQMEDGHFYLEDLTASVEINLSNAISFTGNFYDITTGLFTENTIIVAEGEMLV 853

Query: 241  EGIFQVVTCGFPPLEERDKSLKLLAGQDFFGGGALPKEENLRLADLEKKAVNDMFVILSD 300
            EGIFQVVTCGFPPLEERDKSLKLLAGQDFFGGG LPKEE LRLADLEKKAVNDMFVILSD
Sbjct: 854  EGIFQVVTCGFPPLEERDKSLKLLAGQDFFGGGVLPKEETLRLADLEKKAVNDMFVILSD 913

Query: 301  IWLDSEEAMGKLETILDGFENVEMVPSLFVLMGNFCSRPCNLAFNSFSSLRLQFGKLGKM 360
            IWLDSEEAMGKLETILDGFENVE+VPSLFVLMGNFCS PCN+AFNSFSSLRLQFGKLGKM
Sbjct: 914  IWLDSEEAMGKLETILDGFENVEVVPSLFVLMGNFCSHPCNIAFNSFSSLRLQFGKLGKM 973

Query: 361  IAAHPRLKEHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSNEKLR 377
            IAAHPRL +HSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQ+HVPNA FS+   R
Sbjct: 974  IAAHPRLNKHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQVHVPNATFSSNPCR 1029

BLAST of HG10020520 vs. NCBI nr
Match: KAG7037068.1 (DNA polymerase epsilon subunit B [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 640.2 bits (1650), Expect = 1.7e-179
Identity = 326/401 (81.30%), Postives = 349/401 (87.03%), Query Frame = 0

Query: 1   MDTLTLRRKVQRKAKIRGLYSIKLEALHEIVSFVSRFHGCEDDAIELVLDNLHDESLKSS 60
           MD  TLRRKVQRK+KIRG YSIKL+AL EIVSF SRF GCEDDAI+LVLD+LHDESL+SS
Sbjct: 1   MDESTLRRKVQRKSKIRG-YSIKLDALAEIVSFASRFDGCEDDAIDLVLDHLHDESLQSS 60

Query: 61  ILDKDAVHRVVNIMLAADEVGEESPTTVTSTSALCIINAFYISKFRYDPIKKIFHLHTGS 120
           I+DKDAVHRVV+I+LAADE GEE P T TSTSALCII+AF ISK+RYDPIKKIF++HTGS
Sbjct: 61  IIDKDAVHRVVSILLAADEAGEECPDTSTSTSALCIIDAFDISKYRYDPIKKIFYMHTGS 120

Query: 121 LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEMTCTAVFINN--- 180
           LPIHGDA AKAALYRDRFLLLSQRLSRDQHFSKPAFD GMSHFGSCE++     +     
Sbjct: 121 LPIHGDASAKAALYRDRFLLLSQRLSRDQHFSKPAFDSGMSHFGSCEISPIQSLVGQTGR 180

Query: 181 ----------------------QHKITTGLFTENTIVVAEGEMLVEGIFQVVTCGFPPLE 240
                                  HKITTGLFTENTIVVAEGEMLVEGIF+V+TCGFPPLE
Sbjct: 181 KWVMGVISQMEDGHFFLEDLTASHKITTGLFTENTIVVAEGEMLVEGIFKVITCGFPPLE 240

Query: 241 ERDKSLKLLAGQDFFGGGALPKEENLRLADLEKKAVNDMFVILSDIWLDSEEAMGKLETI 300
           +RDKSLKLLAGQDFFGGG+L KEE LRLADLEK+AVNDMFVILSDIWLDSEEAMGKLETI
Sbjct: 241 DRDKSLKLLAGQDFFGGGSLTKEETLRLADLEKRAVNDMFVILSDIWLDSEEAMGKLETI 300

Query: 301 LDGFENVEMVPSLFVLMGNFCSRPCNLAFNSFSSLRLQFGKLGKMIAAHPRLKEHSKFLF 360
           LDGFENVE+VPSLFVLMGNFCSRPCNLAFNS+SSLRLQFGKLGKMIAA PRLKEHSKFLF
Sbjct: 301 LDGFENVEIVPSLFVLMGNFCSRPCNLAFNSYSSLRLQFGKLGKMIAARPRLKEHSKFLF 360

Query: 361 IPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSNEKLR 377
           IPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFS+   R
Sbjct: 361 IPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSSNPCR 400

BLAST of HG10020520 vs. ExPASy Swiss-Prot
Match: Q500V9 (DNA polymerase epsilon subunit B OS=Arabidopsis thaliana OX=3702 GN=DPB2 PE=1 SV=1)

HSP 1 Score: 449.1 bits (1154), Expect = 7.5e-125
Identity = 238/403 (59.06%), Postives = 295/403 (73.20%), Query Frame = 0

Query: 7   RRKVQRKAKIRGLYSIKLEALHEIVSFVSRFHGCED-DAIELVLDNLHDESLKSSILDKD 66
           R+K+Q+K K RG Y++K +AL EI+ F  +F   +D +AI+L+LDNL  E+ KSS +D +
Sbjct: 8   RKKIQKKFKNRG-YNLKFDALDEILVFADQFPDDDDGEAIDLLLDNL-QETHKSSTVDAE 67

Query: 67  AVHRVVNIMLAADEVGEESPTTVTSTSALCIINAFYISKFRYDPIKKIFHLHTGSLPIHG 126
           +V  ++N +L A    EE PT  TS S+L II+AF + KF YD +KK F+ HT SLPIHG
Sbjct: 68  SVRGLINRLLGAHNAPEE-PT--TSASSLAIIDAFLVPKFGYDSVKKKFNEHTSSLPIHG 127

Query: 127 DAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEMTCTAVFIN--------- 186
           +A AK ALYR+RF+LLSQR+SR +HFS+PAFD  MS F + E++     I+         
Sbjct: 128 EASAKTALYRERFMLLSQRVSRAEHFSRPAFDAEMSQFENNEISSIQSLISQRGRKWVMG 187

Query: 187 -----------------------NQHKITTGLFTENTIVVAEGEMLVEGIFQVVTCGFPP 246
                                  ++ KITTG FTENTI++AEGEM V GIFQV+TCGFPP
Sbjct: 188 VISQLEDGHFYLEDLSASVEIDLSKAKITTGFFTENTIILAEGEMQVNGIFQVITCGFPP 247

Query: 247 LEERDKSLKLLAGQDFFGGGALPKEENLRLADLEKKAVNDMFVILSDIWLDSEEAMGKLE 306
           LE+RDK+LK  +  DFFGGG L KEE ++LADLE++AVND FVILSDIWLD EE M KLE
Sbjct: 248 LEDRDKTLKAHSEYDFFGGGTLTKEEMIKLADLERQAVNDTFVILSDIWLDDEEVMRKLE 307

Query: 307 TILDGFENVEMVPSLFVLMGNFCSRPCNLAFNSFSSLRLQFGKLGKMIAAHPRLKEHSKF 366
           T+LDGFE+VE VPSLFV MGNFCSRPCNL+F S+SSLR QFGKLG+MI  HPRLKE+S+F
Sbjct: 308 TVLDGFESVETVPSLFVFMGNFCSRPCNLSFGSYSSLREQFGKLGRMIGNHPRLKENSRF 367

Query: 367 LFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSNEKLR 377
           LFIPGP+DAGPSTVLPRCALPKYLTEEL+  +PNAIFS+   R
Sbjct: 368 LFIPGPEDAGPSTVLPRCALPKYLTEELRNIIPNAIFSSNPCR 405

BLAST of HG10020520 vs. ExPASy Swiss-Prot
Match: Q6R0C4 (Transcription factor MYB52 OS=Arabidopsis thaliana OX=3702 GN=MYB52 PE=2 SV=1)

HSP 1 Score: 194.1 bits (492), Expect = 4.3e-48
Identity = 112/209 (53.59%), Postives = 132/209 (63.16%), Query Frame = 0

Query: 372 NEKLRELVERYGPHNWNAIAQKLEGRSGKSCRLRWFNQLDPRINRSPFTEEEEERLMASH 431
           +EKLRELVE++GPHNWNAIAQKL GRSGKSCRLRWFNQLDPRINR+PFTEEEEERL+ASH
Sbjct: 13  DEKLRELVEQFGPHNWNAIAQKLSGRSGKSCRLRWFNQLDPRINRNPFTEEEEERLLASH 72

Query: 432 RVHGNRWAIIARLFPGRTDNAVKNHWHVIMARRSRINRSK-------------------- 491
           R+HGNRW++IAR FPGRTDNAVKNHWHVIMARR R  RSK                    
Sbjct: 73  RIHGNRWSVIARFFPGRTDNAVKNHWHVIMARRGR-ERSKLRPRGLGHDGTVAATGMIGN 132

Query: 492 -----------TQTQTQTPSSFH--NNRLLLSSFLQLHTYHPTSNSFIPKPPPDDRVHTN 548
                      T T    P  F   N+  +L  FL        S + I +   D      
Sbjct: 133 YKDCDKERRLATTTAINFPYQFSHINHFQVLKEFLTGKIGFRNSTTPIQEGAIDQT--KR 192

BLAST of HG10020520 vs. ExPASy Swiss-Prot
Match: Q9FX36 (Transcription factor MYB54 OS=Arabidopsis thaliana OX=3702 GN=MYB54 PE=1 SV=1)

HSP 1 Score: 193.0 bits (489), Expect = 9.6e-48
Identity = 110/203 (54.19%), Postives = 134/203 (66.01%), Query Frame = 0

Query: 372 NEKLRELVERYGPHNWNAIAQKLEGRSGKSCRLRWFNQLDPRINRSPFTEEEEERLMASH 431
           +EKL++LVE+YGPHNWNAIA KL GRSGKSCRLRWFNQLDPRINR+PFTEEEEERL+A+H
Sbjct: 14  DEKLKDLVEQYGPHNWNAIALKLPGRSGKSCRLRWFNQLDPRINRNPFTEEEEERLLAAH 73

Query: 432 RVHGNRWAIIARLFPGRTDNAVKNHWHVIMARRSRINRSKTQTQTQTPSS---FHNNRLL 491
           R+HGNRW+IIARLFPGRTDNAVKNHWHVIMARR+R   SK +    T SS     + +++
Sbjct: 74  RIHGNRWSIIARLFPGRTDNAVKNHWHVIMARRTR-QTSKPRLLPSTTSSSSLMASEQIM 133

Query: 492 LSSFLQLHTYHPTSNSFIPKPPPDDRV-------HTNSIH-------------------- 541
           +SS    H Y   S+    K  P D +       H N +H                    
Sbjct: 134 MSSGGYNHNY---SSDDRKKIFPADFINFPYKFSHINHLHFLKEFFTGKIALNHKANQSK 193

BLAST of HG10020520 vs. ExPASy Swiss-Prot
Match: Q9LQX5 (Transcription factor MYB117 OS=Arabidopsis thaliana OX=3702 GN=MYB117 PE=2 SV=1)

HSP 1 Score: 172.9 bits (437), Expect = 1.0e-41
Identity = 84/150 (56.00%), Postives = 109/150 (72.67%), Query Frame = 0

Query: 374 KLRELVERYGPHNWNAIAQKLEGRSGKSCRLRWFNQLDPRINRSPFTEEEEERLMASHRV 433
           KL+ELV  YGP NWN IA+KL+GRSGKSCRLRWFNQLDPRINR  FTEEEEERLM +HR+
Sbjct: 108 KLKELVSIYGPQNWNLIAEKLQGRSGKSCRLRWFNQLDPRINRRAFTEEEEERLMQAHRL 167

Query: 434 HGNRWAIIARLFPGRTDNAVKNHWHVIMARRSRINRSKTQTQTQTPSSFHNNRLLLSSFL 493
           +GN+WA+IARLFPGRTDN+VKNHWHV+MAR+ R          +  S++   +L+ ++ L
Sbjct: 168 YGNKWAMIARLFPGRTDNSVKNHWHVVMARKYR----------EHSSAYRRRKLMSNNPL 227

Query: 494 QLHTYHPTSNSFIPKPPPDDRVHTNSIHYY 524
           + H     +N+  P P P+     ++ HY+
Sbjct: 228 KPH----LTNNHHPNPNPNYHSFISTNHYF 243

BLAST of HG10020520 vs. ExPASy Swiss-Prot
Match: Q5NBM8 (Transcription factor CSA OS=Oryza sativa subsp. japonica OX=39947 GN=CSA PE=2 SV=2)

HSP 1 Score: 172.2 bits (435), Expect = 1.8e-41
Identity = 79/112 (70.54%), Postives = 93/112 (83.04%), Query Frame = 0

Query: 374 KLRELVERYGPHNWNAIAQKLEGRSGKSCRLRWFNQLDPRINRSPFTEEEEERLMASHRV 433
           KL++LV +YGP NWN IA+KL+GRSGKSCRLRWFNQLDPRINR  FTEEEEERLMA+HR 
Sbjct: 59  KLKDLVAQYGPQNWNLIAEKLDGRSGKSCRLRWFNQLDPRINRRAFTEEEEERLMAAHRA 118

Query: 434 HGNRWAIIARLFPGRTDNAVKNHWHVIMARRSR-----INRSKTQTQTQTPS 481
           +GN+WA+IARLFPGRTDNAVKNHWHV+MARR R       R K  + + +P+
Sbjct: 119 YGNKWALIARLFPGRTDNAVKNHWHVLMARRHREQSGAFRRRKPSSSSASPA 170

BLAST of HG10020520 vs. ExPASy TrEMBL
Match: A0A5D3BIF9 (Glutamate decarboxylase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G00380 PE=3 SV=1)

HSP 1 Score: 642.5 bits (1656), Expect = 1.7e-180
Identity = 330/418 (78.95%), Postives = 350/418 (83.73%), Query Frame = 0

Query: 1    MDTLTLRRKVQRKAKIRGLYSIKLEALHEIVSFVSRFHGCEDDAIELVLDNLHDESLKSS 60
            MD LTLR+KVQRKAKIRGLYSIKLEAL EIVSFVSRFHG EDDAIELVLD+LH+ESLKS 
Sbjct: 613  MDALTLRKKVQRKAKIRGLYSIKLEALDEIVSFVSRFHGFEDDAIELVLDDLHEESLKSP 672

Query: 61   ILDKDAVHRVVNIMLAADEVGEESPTTVTSTSALCIINAFYISKFRYDPIKKIFHLHTGS 120
            IL+KD VHRV++ +LAA+E  E SP T+TST+ALCII+AF ISKFRYDPIKKIFH HTG 
Sbjct: 673  ILEKDGVHRVISKLLAAEEAEEASPNTITSTTALCIIDAFDISKFRYDPIKKIFHRHTGD 732

Query: 121  LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEMT----------- 180
            LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCE++           
Sbjct: 733  LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEISPIQSLVGQTGR 792

Query: 181  --------------------CTAVFIN-----------NQHKITTGLFTENTIVVAEGEM 240
                                  +V IN             HKITTGLFTENTI+VAEGEM
Sbjct: 793  KWVMGVISQMEDGHFYLEDLTASVEINLSNAISFTGNFYDHKITTGLFTENTIIVAEGEM 852

Query: 241  LVEGIFQVVTCGFPPLEERDKSLKLLAGQDFFGGGALPKEENLRLADLEKKAVNDMFVIL 300
            LVEGIFQVVTCGFPPLEERDKSLKLLAGQDFFGGG LPKEE LRLADLEKKAVNDMFVIL
Sbjct: 853  LVEGIFQVVTCGFPPLEERDKSLKLLAGQDFFGGGVLPKEETLRLADLEKKAVNDMFVIL 912

Query: 301  SDIWLDSEEAMGKLETILDGFENVEMVPSLFVLMGNFCSRPCNLAFNSFSSLRLQFGKLG 360
            SDIWLDSEEAMGKLETILDGFENVE+VPSLFVLMGNFCS PCN+AFNSFSSLRLQFGKLG
Sbjct: 913  SDIWLDSEEAMGKLETILDGFENVEVVPSLFVLMGNFCSHPCNIAFNSFSSLRLQFGKLG 972

Query: 361  KMIAAHPRLKEHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSNEKLR 377
            KMIAAHPRL +HSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQ+HVPNA FS+   R
Sbjct: 973  KMIAAHPRLNKHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQVHVPNATFSSNPCR 1030

BLAST of HG10020520 vs. ExPASy TrEMBL
Match: A0A1S4DTQ5 (DNA polymerase epsilon subunit OS=Cucumis melo OX=3656 GN=LOC103501415 PE=3 SV=1)

HSP 1 Score: 641.7 bits (1654), Expect = 2.9e-180
Identity = 328/408 (80.39%), Postives = 349/408 (85.54%), Query Frame = 0

Query: 1   MDTLTLRRKVQRKAKIRGLYSIKLEALHEIVSFVSRFHGCEDDAIELVLDNLHDESLKSS 60
           MD LTLR+KVQRKAKIRGLYSIKLEAL EIVSFVSRFHG EDDAIELVLD+LH+ESLKS 
Sbjct: 1   MDALTLRKKVQRKAKIRGLYSIKLEALDEIVSFVSRFHGFEDDAIELVLDDLHEESLKSP 60

Query: 61  ILDKDAVHRVVNIMLAADEVGEESPTTVTSTSALCIINAFYISKFRYDPIKKIFHLHTGS 120
           IL+KD VHRV++ +LAA+E  E SP T+TST+ALCII+AF ISKFRYDPIKKIFH HTG 
Sbjct: 61  ILEKDGVHRVISKLLAAEEAEEASPNTITSTTALCIIDAFDISKFRYDPIKKIFHRHTGD 120

Query: 121 LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEMT----------- 180
           LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCE++           
Sbjct: 121 LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEISPIQSLVGQTGR 180

Query: 181 --------------------CTAVFIN-NQHKITTGLFTENTIVVAEGEMLVEGIFQVVT 240
                                 +V IN +  KITTGLFTENTI+VAEGEMLVEGIFQVVT
Sbjct: 181 KWVMGVISQMEDGHFYLEDLTASVEINLSNAKITTGLFTENTIIVAEGEMLVEGIFQVVT 240

Query: 241 CGFPPLEERDKSLKLLAGQDFFGGGALPKEENLRLADLEKKAVNDMFVILSDIWLDSEEA 300
           CGFPPLEERDKSLKLLAGQDFFGGG LPKEE LRLADLEKKAVNDMFVILSDIWLDSEEA
Sbjct: 241 CGFPPLEERDKSLKLLAGQDFFGGGVLPKEETLRLADLEKKAVNDMFVILSDIWLDSEEA 300

Query: 301 MGKLETILDGFENVEMVPSLFVLMGNFCSRPCNLAFNSFSSLRLQFGKLGKMIAAHPRLK 360
           MGKLETILDGFENVE+VPSLFVLMGNFCS PCN+AFNSFSSLRLQFGKLGKMIAAHPRL 
Sbjct: 301 MGKLETILDGFENVEVVPSLFVLMGNFCSHPCNIAFNSFSSLRLQFGKLGKMIAAHPRLN 360

Query: 361 EHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSNEKLR 377
           +HSKF FIPGPDDAGPSTVLPRCALPKYLTEELQ+HVPNA FS+   R
Sbjct: 361 KHSKFXFIPGPDDAGPSTVLPRCALPKYLTEELQVHVPNATFSSNPCR 408

BLAST of HG10020520 vs. ExPASy TrEMBL
Match: A0A5A7T342 (Glutamate decarboxylase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold112G00330 PE=3 SV=1)

HSP 1 Score: 640.6 bits (1651), Expect = 6.5e-180
Identity = 327/416 (78.61%), Postives = 347/416 (83.41%), Query Frame = 0

Query: 1    MDTLTLRRKVQRKAKIRGLYSIKLEALHEIVSFVSRFHGCEDDAIELVLDNLHDESLKSS 60
            MD LTLR+KVQRKAKIRGLYSIKLEAL EIVSFVSRFHG EDDAIELVLD+LH+ESLKS 
Sbjct: 614  MDALTLRKKVQRKAKIRGLYSIKLEALDEIVSFVSRFHGFEDDAIELVLDDLHEESLKSP 673

Query: 61   ILDKDAVHRVVNIMLAADEVGEESPTTVTSTSALCIINAFYISKFRYDPIKKIFHLHTGS 120
            IL+KD VHRV++ +LAA+E  E SP T+TST+ALCII+AF ISKFRYDPIKKIFH HTG 
Sbjct: 674  ILEKDGVHRVISKLLAAEEAEEASPNTITSTTALCIIDAFDISKFRYDPIKKIFHRHTGD 733

Query: 121  LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCE------------- 180
            LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCE             
Sbjct: 734  LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEISPIQSLVGQTGR 793

Query: 181  ---------------------------MTCTAVFINNQHKITTGLFTENTIVVAEGEMLV 240
                                       ++    F  N + ITTGLFTENTI+VAEGEMLV
Sbjct: 794  KWVMGVISQMEDGHFYLEDLTASVEINLSNAISFTGNFYDITTGLFTENTIIVAEGEMLV 853

Query: 241  EGIFQVVTCGFPPLEERDKSLKLLAGQDFFGGGALPKEENLRLADLEKKAVNDMFVILSD 300
            EGIFQVVTCGFPPLEERDKSLKLLAGQDFFGGG LPKEE LRLADLEKKAVNDMFVILSD
Sbjct: 854  EGIFQVVTCGFPPLEERDKSLKLLAGQDFFGGGVLPKEETLRLADLEKKAVNDMFVILSD 913

Query: 301  IWLDSEEAMGKLETILDGFENVEMVPSLFVLMGNFCSRPCNLAFNSFSSLRLQFGKLGKM 360
            IWLDSEEAMGKLETILDGFENVE+VPSLFVLMGNFCS PCN+AFNSFSSLRLQFGKLGKM
Sbjct: 914  IWLDSEEAMGKLETILDGFENVEVVPSLFVLMGNFCSHPCNIAFNSFSSLRLQFGKLGKM 973

Query: 361  IAAHPRLKEHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSNEKLR 377
            IAAHPRL +HSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQ+HVPNA FS+   R
Sbjct: 974  IAAHPRLNKHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQVHVPNATFSSNPCR 1029

BLAST of HG10020520 vs. ExPASy TrEMBL
Match: A0A6J1GCV3 (DNA polymerase epsilon subunit OS=Cucurbita moschata OX=3662 GN=LOC111452793 PE=3 SV=1)

HSP 1 Score: 635.6 bits (1638), Expect = 2.1e-178
Identity = 328/408 (80.39%), Postives = 352/408 (86.27%), Query Frame = 0

Query: 1   MDTLTLRRKVQRKAKIRGLYSIKLEALHEIVSFVSRFHGCEDDAIELVLDNLHDESLKSS 60
           MD  TLRRKVQRK+KIRG YSIKL+AL EIVSF SRF GCEDDAI+LVLD+LHDESL+SS
Sbjct: 1   MDESTLRRKVQRKSKIRG-YSIKLDALAEIVSFASRFDGCEDDAIDLVLDHLHDESLQSS 60

Query: 61  ILDKDAVHRVVNIMLAADEVGEESPTTVTSTSALCIINAFYISKFRYDPIKKIFHLHTGS 120
           I+DKDAVHRVV+I+LAADE GEE P T TSTSALCII+AF ISK+RYDPIKKIF++HTGS
Sbjct: 61  IIDKDAVHRVVSILLAADEAGEECPDTSTSTSALCIIDAFDISKYRYDPIKKIFYMHTGS 120

Query: 121 LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEMT----------- 180
           LPIHGDA AKAALYRDRFLLLSQRLSRDQHFSKPAFD GMSHFGSCE++           
Sbjct: 121 LPIHGDASAKAALYRDRFLLLSQRLSRDQHFSKPAFDSGMSHFGSCEISPIQSLVGQTGR 180

Query: 181 --------------------CTAVFIN-NQHKITTGLFTENTIVVAEGEMLVEGIFQVVT 240
                                 +V IN +  KITTGLFTENTIVVAEGEMLVEGIF+V+T
Sbjct: 181 KWVMGVISQMEDGHFFLEDLTASVEINLSSAKITTGLFTENTIVVAEGEMLVEGIFKVIT 240

Query: 241 CGFPPLEERDKSLKLLAGQDFFGGGALPKEENLRLADLEKKAVNDMFVILSDIWLDSEEA 300
           CGFPPLE+RDKSLKLLAGQDFFGGG+L KEE LRLADLEK+AVNDMFVILSDIWLDSEEA
Sbjct: 241 CGFPPLEDRDKSLKLLAGQDFFGGGSLTKEETLRLADLEKRAVNDMFVILSDIWLDSEEA 300

Query: 301 MGKLETILDGFENVEMVPSLFVLMGNFCSRPCNLAFNSFSSLRLQFGKLGKMIAAHPRLK 360
           MGKLETILDGFENVE+VPSLFVLMGNFCSRPCNLAFNS+SSLRLQFGKLGKMIAA PRLK
Sbjct: 301 MGKLETILDGFENVEIVPSLFVLMGNFCSRPCNLAFNSYSSLRLQFGKLGKMIAARPRLK 360

Query: 361 EHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSNEKLR 377
           EHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFS+   R
Sbjct: 361 EHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSSNPCR 407

BLAST of HG10020520 vs. ExPASy TrEMBL
Match: A0A6J1KGF0 (DNA polymerase epsilon subunit OS=Cucurbita maxima OX=3661 GN=LOC111492993 PE=3 SV=1)

HSP 1 Score: 633.6 bits (1633), Expect = 7.9e-178
Identity = 325/408 (79.66%), Postives = 352/408 (86.27%), Query Frame = 0

Query: 1   MDTLTLRRKVQRKAKIRGLYSIKLEALHEIVSFVSRFHGCEDDAIELVLDNLHDESLKSS 60
           MD  TLRRKVQRK+KIRG YSIKL+AL EIVSF SRF GCEDDAI+LVLD+LHDESL+SS
Sbjct: 1   MDESTLRRKVQRKSKIRG-YSIKLDALAEIVSFASRFDGCEDDAIDLVLDHLHDESLQSS 60

Query: 61  ILDKDAVHRVVNIMLAADEVGEESPTTVTSTSALCIINAFYISKFRYDPIKKIFHLHTGS 120
           I+DKDAVHRVV+I+LAADE GEE P T TSTSALCII+AF ISK+RYDPIKKIF++HTGS
Sbjct: 61  IIDKDAVHRVVSILLAADEAGEECPDTSTSTSALCIIDAFDISKYRYDPIKKIFYMHTGS 120

Query: 121 LPIHGDAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEMT----------- 180
           LPIHGDA AKAALYRDRFLLLSQRLSRDQHFSKPAFD GMSHFGSCE++           
Sbjct: 121 LPIHGDASAKAALYRDRFLLLSQRLSRDQHFSKPAFDSGMSHFGSCEISPIQSLVGQTGR 180

Query: 181 --------------------CTAVFIN-NQHKITTGLFTENTIVVAEGEMLVEGIFQVVT 240
                                 +V IN +  KITTGLFTENT+VVAEGEMLVEGIF+V+T
Sbjct: 181 KWVMGVISQMEDGHFYLEDLTASVEINLSSAKITTGLFTENTVVVAEGEMLVEGIFKVIT 240

Query: 241 CGFPPLEERDKSLKLLAGQDFFGGGALPKEENLRLADLEKKAVNDMFVILSDIWLDSEEA 300
           CGFPPLE+RDKSLKLLAGQDFFGGG+L KEE LRLADLEK+A+NDMFVILSDIWLDSEEA
Sbjct: 241 CGFPPLEDRDKSLKLLAGQDFFGGGSLTKEETLRLADLEKRAINDMFVILSDIWLDSEEA 300

Query: 301 MGKLETILDGFENVEMVPSLFVLMGNFCSRPCNLAFNSFSSLRLQFGKLGKMIAAHPRLK 360
           MGKLETILDGFENVE+VPSLFVLMGNFCSRPCNLAFNS+SSLRLQFGKLGKMIA+ PRLK
Sbjct: 301 MGKLETILDGFENVEIVPSLFVLMGNFCSRPCNLAFNSYSSLRLQFGKLGKMIASRPRLK 360

Query: 361 EHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSNEKLR 377
           EHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFS+   R
Sbjct: 361 EHSKFLFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSSNPCR 407

BLAST of HG10020520 vs. TAIR 10
Match: AT5G22110.1 (DNA polymerase epsilon subunit B2 )

HSP 1 Score: 449.1 bits (1154), Expect = 5.3e-126
Identity = 238/403 (59.06%), Postives = 295/403 (73.20%), Query Frame = 0

Query: 7   RRKVQRKAKIRGLYSIKLEALHEIVSFVSRFHGCED-DAIELVLDNLHDESLKSSILDKD 66
           R+K+Q+K K RG Y++K +AL EI+ F  +F   +D +AI+L+LDNL  E+ KSS +D +
Sbjct: 8   RKKIQKKFKNRG-YNLKFDALDEILVFADQFPDDDDGEAIDLLLDNL-QETHKSSTVDAE 67

Query: 67  AVHRVVNIMLAADEVGEESPTTVTSTSALCIINAFYISKFRYDPIKKIFHLHTGSLPIHG 126
           +V  ++N +L A    EE PT  TS S+L II+AF + KF YD +KK F+ HT SLPIHG
Sbjct: 68  SVRGLINRLLGAHNAPEE-PT--TSASSLAIIDAFLVPKFGYDSVKKKFNEHTSSLPIHG 127

Query: 127 DAPAKAALYRDRFLLLSQRLSRDQHFSKPAFDIGMSHFGSCEMTCTAVFIN--------- 186
           +A AK ALYR+RF+LLSQR+SR +HFS+PAFD  MS F + E++     I+         
Sbjct: 128 EASAKTALYRERFMLLSQRVSRAEHFSRPAFDAEMSQFENNEISSIQSLISQRGRKWVMG 187

Query: 187 -----------------------NQHKITTGLFTENTIVVAEGEMLVEGIFQVVTCGFPP 246
                                  ++ KITTG FTENTI++AEGEM V GIFQV+TCGFPP
Sbjct: 188 VISQLEDGHFYLEDLSASVEIDLSKAKITTGFFTENTIILAEGEMQVNGIFQVITCGFPP 247

Query: 247 LEERDKSLKLLAGQDFFGGGALPKEENLRLADLEKKAVNDMFVILSDIWLDSEEAMGKLE 306
           LE+RDK+LK  +  DFFGGG L KEE ++LADLE++AVND FVILSDIWLD EE M KLE
Sbjct: 248 LEDRDKTLKAHSEYDFFGGGTLTKEEMIKLADLERQAVNDTFVILSDIWLDDEEVMRKLE 307

Query: 307 TILDGFENVEMVPSLFVLMGNFCSRPCNLAFNSFSSLRLQFGKLGKMIAAHPRLKEHSKF 366
           T+LDGFE+VE VPSLFV MGNFCSRPCNL+F S+SSLR QFGKLG+MI  HPRLKE+S+F
Sbjct: 308 TVLDGFESVETVPSLFVFMGNFCSRPCNLSFGSYSSLREQFGKLGRMIGNHPRLKENSRF 367

Query: 367 LFIPGPDDAGPSTVLPRCALPKYLTEELQMHVPNAIFSNEKLR 377
           LFIPGP+DAGPSTVLPRCALPKYLTEEL+  +PNAIFS+   R
Sbjct: 368 LFIPGPEDAGPSTVLPRCALPKYLTEELRNIIPNAIFSSNPCR 405

BLAST of HG10020520 vs. TAIR 10
Match: AT1G17950.1 (myb domain protein 52 )

HSP 1 Score: 194.1 bits (492), Expect = 3.1e-49
Identity = 112/209 (53.59%), Postives = 132/209 (63.16%), Query Frame = 0

Query: 372 NEKLRELVERYGPHNWNAIAQKLEGRSGKSCRLRWFNQLDPRINRSPFTEEEEERLMASH 431
           +EKLRELVE++GPHNWNAIAQKL GRSGKSCRLRWFNQLDPRINR+PFTEEEEERL+ASH
Sbjct: 13  DEKLRELVEQFGPHNWNAIAQKLSGRSGKSCRLRWFNQLDPRINRNPFTEEEEERLLASH 72

Query: 432 RVHGNRWAIIARLFPGRTDNAVKNHWHVIMARRSRINRSK-------------------- 491
           R+HGNRW++IAR FPGRTDNAVKNHWHVIMARR R  RSK                    
Sbjct: 73  RIHGNRWSVIARFFPGRTDNAVKNHWHVIMARRGR-ERSKLRPRGLGHDGTVAATGMIGN 132

Query: 492 -----------TQTQTQTPSSFH--NNRLLLSSFLQLHTYHPTSNSFIPKPPPDDRVHTN 548
                      T T    P  F   N+  +L  FL        S + I +   D      
Sbjct: 133 YKDCDKERRLATTTAINFPYQFSHINHFQVLKEFLTGKIGFRNSTTPIQEGAIDQT--KR 192

BLAST of HG10020520 vs. TAIR 10
Match: AT1G73410.1 (myb domain protein 54 )

HSP 1 Score: 193.0 bits (489), Expect = 6.9e-49
Identity = 110/203 (54.19%), Postives = 134/203 (66.01%), Query Frame = 0

Query: 372 NEKLRELVERYGPHNWNAIAQKLEGRSGKSCRLRWFNQLDPRINRSPFTEEEEERLMASH 431
           +EKL++LVE+YGPHNWNAIA KL GRSGKSCRLRWFNQLDPRINR+PFTEEEEERL+A+H
Sbjct: 14  DEKLKDLVEQYGPHNWNAIALKLPGRSGKSCRLRWFNQLDPRINRNPFTEEEEERLLAAH 73

Query: 432 RVHGNRWAIIARLFPGRTDNAVKNHWHVIMARRSRINRSKTQTQTQTPSS---FHNNRLL 491
           R+HGNRW+IIARLFPGRTDNAVKNHWHVIMARR+R   SK +    T SS     + +++
Sbjct: 74  RIHGNRWSIIARLFPGRTDNAVKNHWHVIMARRTR-QTSKPRLLPSTTSSSSLMASEQIM 133

Query: 492 LSSFLQLHTYHPTSNSFIPKPPPDDRV-------HTNSIH-------------------- 541
           +SS    H Y   S+    K  P D +       H N +H                    
Sbjct: 134 MSSGGYNHNY---SSDDRKKIFPADFINFPYKFSHINHLHFLKEFFTGKIALNHKANQSK 193

BLAST of HG10020520 vs. TAIR 10
Match: AT1G26780.1 (myb domain protein 117 )

HSP 1 Score: 172.9 bits (437), Expect = 7.3e-43
Identity = 84/150 (56.00%), Postives = 109/150 (72.67%), Query Frame = 0

Query: 374 KLRELVERYGPHNWNAIAQKLEGRSGKSCRLRWFNQLDPRINRSPFTEEEEERLMASHRV 433
           KL+ELV  YGP NWN IA+KL+GRSGKSCRLRWFNQLDPRINR  FTEEEEERLM +HR+
Sbjct: 108 KLKELVSIYGPQNWNLIAEKLQGRSGKSCRLRWFNQLDPRINRRAFTEEEEERLMQAHRL 167

Query: 434 HGNRWAIIARLFPGRTDNAVKNHWHVIMARRSRINRSKTQTQTQTPSSFHNNRLLLSSFL 493
           +GN+WA+IARLFPGRTDN+VKNHWHV+MAR+ R          +  S++   +L+ ++ L
Sbjct: 168 YGNKWAMIARLFPGRTDNSVKNHWHVVMARKYR----------EHSSAYRRRKLMSNNPL 227

Query: 494 QLHTYHPTSNSFIPKPPPDDRVHTNSIHYY 524
           + H     +N+  P P P+     ++ HY+
Sbjct: 228 KPH----LTNNHHPNPNPNYHSFISTNHYF 243

BLAST of HG10020520 vs. TAIR 10
Match: AT1G26780.2 (myb domain protein 117 )

HSP 1 Score: 172.9 bits (437), Expect = 7.3e-43
Identity = 84/150 (56.00%), Postives = 109/150 (72.67%), Query Frame = 0

Query: 374 KLRELVERYGPHNWNAIAQKLEGRSGKSCRLRWFNQLDPRINRSPFTEEEEERLMASHRV 433
           KL+ELV  YGP NWN IA+KL+GRSGKSCRLRWFNQLDPRINR  FTEEEEERLM +HR+
Sbjct: 108 KLKELVSIYGPQNWNLIAEKLQGRSGKSCRLRWFNQLDPRINRRAFTEEEEERLMQAHRL 167

Query: 434 HGNRWAIIARLFPGRTDNAVKNHWHVIMARRSRINRSKTQTQTQTPSSFHNNRLLLSSFL 493
           +GN+WA+IARLFPGRTDN+VKNHWHV+MAR+ R          +  S++   +L+ ++ L
Sbjct: 168 YGNKWAMIARLFPGRTDNSVKNHWHVVMARKYR----------EHSSAYRRRKLMSNNPL 227

Query: 494 QLHTYHPTSNSFIPKPPPDDRVHTNSIHYY 524
           + H     +N+  P P P+     ++ HY+
Sbjct: 228 KPH----LTNNHHPNPNPNYHSFISTNHYF 243

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038894268.12.1e-18884.56DNA polymerase epsilon subunit B [Benincasa hispida][more]
TYJ98844.13.5e-18078.95DNA polymerase epsilon subunit 2 [Cucumis melo var. makuwa][more]
XP_016899382.16.0e-18080.39PREDICTED: LOW QUALITY PROTEIN: DNA polymerase epsilon subunit 2 [Cucumis melo][more]
KAA0036041.11.3e-17978.61DNA polymerase epsilon subunit 2 [Cucumis melo var. makuwa][more]
KAG7037068.11.7e-17981.30DNA polymerase epsilon subunit B [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
Q500V97.5e-12559.06DNA polymerase epsilon subunit B OS=Arabidopsis thaliana OX=3702 GN=DPB2 PE=1 SV... [more]
Q6R0C44.3e-4853.59Transcription factor MYB52 OS=Arabidopsis thaliana OX=3702 GN=MYB52 PE=2 SV=1[more]
Q9FX369.6e-4854.19Transcription factor MYB54 OS=Arabidopsis thaliana OX=3702 GN=MYB54 PE=1 SV=1[more]
Q9LQX51.0e-4156.00Transcription factor MYB117 OS=Arabidopsis thaliana OX=3702 GN=MYB117 PE=2 SV=1[more]
Q5NBM81.8e-4170.54Transcription factor CSA OS=Oryza sativa subsp. japonica OX=39947 GN=CSA PE=2 SV... [more]
Match NameE-valueIdentityDescription
A0A5D3BIF91.7e-18078.95Glutamate decarboxylase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S4DTQ52.9e-18080.39DNA polymerase epsilon subunit OS=Cucumis melo OX=3656 GN=LOC103501415 PE=3 SV=1[more]
A0A5A7T3426.5e-18078.61Glutamate decarboxylase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A6J1GCV32.1e-17880.39DNA polymerase epsilon subunit OS=Cucurbita moschata OX=3662 GN=LOC111452793 PE=... [more]
A0A6J1KGF07.9e-17879.66DNA polymerase epsilon subunit OS=Cucurbita maxima OX=3661 GN=LOC111492993 PE=3 ... [more]
Match NameE-valueIdentityDescription
AT5G22110.15.3e-12659.06DNA polymerase epsilon subunit B2 [more]
AT1G17950.13.1e-4953.59myb domain protein 52 [more]
AT1G73410.16.9e-4954.19myb domain protein 54 [more]
AT1G26780.17.3e-4356.00myb domain protein 117 [more]
AT1G26780.27.3e-4356.00myb domain protein 117 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainSMARTSM00717santcoord: 415..463
e-value: 9.5E-14
score: 61.7
coord: 366..412
e-value: 1.0E-5
score: 35.0
IPR001005SANT/Myb domainPROSITEPS50090MYB_LIKEcoord: 372..410
score: 8.675871
IPR001005SANT/Myb domainPROSITEPS50090MYB_LIKEcoord: 411..461
score: 9.802274
IPR001005SANT/Myb domainCDDcd00167SANTcoord: 373..408
e-value: 3.81869E-11
score: 56.0446
IPR001005SANT/Myb domainCDDcd00167SANTcoord: 418..458
e-value: 3.56605E-11
score: 56.4298
NoneNo IPR availableGENE3D1.10.8.60coord: 1..73
e-value: 1.1E-7
score: 33.8
NoneNo IPR availableGENE3D1.10.10.60coord: 417..486
e-value: 4.8E-18
score: 66.9
NoneNo IPR availableGENE3D1.10.10.60coord: 371..415
e-value: 3.8E-18
score: 67.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 537..582
IPR007185DNA polymerase alpha/delta/epsilon, subunit BPFAMPF04042DNA_pol_E_Bcoord: 255..370
e-value: 6.0E-19
score: 68.2
IPR017930Myb domainPFAMPF00249Myb_DNA-bindingcoord: 373..409
e-value: 5.1E-11
score: 42.6
coord: 416..459
e-value: 2.5E-13
score: 50.0
IPR017930Myb domainPROSITEPS51294HTH_MYBcoord: 411..465
score: 25.520588
IPR017930Myb domainPROSITEPS51294HTH_MYBcoord: 373..410
score: 13.79203
IPR016266DNA polymerase epsilon, subunit BPANTHERPTHR12708DNA POLYMERASE EPSILON SUBUNIT Bcoord: 7..169
coord: 174..373
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 373..457

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10020520.1HG10020520.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006261 DNA-dependent DNA replication
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0006536 glutamate metabolic process
biological_process GO:0051781 positive regulation of cell division
biological_process GO:0006260 DNA replication
cellular_component GO:0008622 epsilon DNA polymerase complex
molecular_function GO:0003677 DNA binding
molecular_function GO:0070182 DNA polymerase binding
molecular_function GO:0004351 glutamate decarboxylase activity
molecular_function GO:0030170 pyridoxal phosphate binding