HG10002100 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10002100
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Description2-(3-amino-3-carboxypropyl)histidine synthase subunit 2
LocationChr11: 3370855 .. 3384498 (-)
RNA-Seq ExpressionHG10002100
SyntenyHG10002100
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGTTGGAATCCTACTACGAAATTTCCCGTACCGCCGACTTCATTCACACCCGGAACTTCACCAGAGTTGCATTGCAGGTTAAATTCCCAAATACTGGAGAATGAAACGATTCTCACTTCTTTCTCTGATCCTTATTTACTAACGACAGAAAGTTTTGTTTTTTTTCCTATTTAAAACTTGAATTTGATTATCTTTCTGAGTACCTTGACAGTCTTCCAAACAGAATTTTAAGCTAGACCTTTATTTTTTTGAACTCTCTGAAGTTTATTTTTGACGTTTTCCCCATCTTTTTCTTCATTTTCTTTTGAAAATTCTAGTTCCCTGATGAGCTCTTGAAGGATTCAACTAGAGTGGTGAGGGCTTTAAAGGATAGGCTTCGTGTGCTTGATGAATCTGATACAAATCACAATGACAGCAAAAATGAGGTTAGGTTGTTCATAATGGCTGATACAACTTATGGTAGTTGTTGTGTTGATGAGGTTGGAGCAGCTCACGCTAATGCTGATTGTGTCATACACTATGGACATACCTGTCTGAGCCCGTGAGTACCAATCTACTATGGCATGATGCTTTCTACGCAAGGTTTTTGTTTTCTTTTCACCTTGCAGCCCCCTTGGAAATCAGCATCCTAAGTATTGTATCGTTTTTGCAAATTTGTGCAAGTGTTTGTGCTGTTGTTGATTTTTTGCATTTTTGAGTGAGAAACTACAGCTAAAAAAACTTGTAATGTAACCCCTGGGAAGTGGATTGCAGATGTGTTATACGTCATAATTCATATTTAATGGTTGGTGCTATGAAATTATTCTTTGCATTATCAGCTAATGCTTCAAAAGTCTGAAATATAATTGGTGTGGTATAAAAAAACTTAGATATTTACTGCAATACTGTTGGCCTTTGAAGTAGCATCATTAAATAAAGTATTTATGTTATCAATCATCACGTAAGAGGCATCTACATGGTTATGCTAACCTTTTGGATGATTGAGCATTCACCATTCCTCGTGATTGAGCAAATGAGGTTCTTACTTGACTGATCATAAGTGCACCTTGGTAAAAGTGGAGAAGTATTTGGATGTGACTCTACTAAGGTGTTTGTATGCTTGGCTTTCCTTATGATATGGAGTTTTGGATGAAAGATTTTTTCTTGGAATGTATTTTACTTATTCTTCCTTGTTCAATATTCTTGGTTGATTTCTTAGTTTTAACTTTAATTGGAATATTAATATTTATATCCCTCCTGAAAGATTGGTAGATCTACTAATGGATAATAAAGGTGTAACCTACAGTCTATGCTATTATTTAAATGCTCTCATTTTTCAGTCTTTAGCATCTGGCTTATGTGATTTGTCACAGTGATTGCTGGTTTACTATTAAGAGTTTTACAAATCACTTCTGGTGTTTTTCAGGACAACAACTCTTCCAGCACTTTTTGTTTTTGGAAAGGATTCAATCGATGTATCCACTTGTGCCAAAACTTTATCTCAGTACAGTTTGTCCAATGGAAAGCCTGTTTTGGTATGCATTTGTATTCTTAGATCAATGTACTGCTTTTTTGTTTTTTTTTGTTTTTTTTTTAAAATGACAATATGACATCCATTATGAAGAATGATGGAGATAACAGAAAAAACGAAGCCAAAAACATAGACCCTAGCACCCAAAACATTAAGACTCCCATAAGCTTGGGATCTTCAGTTATATGTGCTTACAATAATTCACATCATTATACTCTAGATTTTAGAATGAAGCATTTGGGAAAAAAGTAATGTAGCTGCAAAGTTTCCTTTTACTACCGTATTCCTTCGTGCACCTTTCTGTACTCTATAAGTCAGAGAATGAATCTCTCAAAAAGTATATTCCATAAGACTTGAGTTACATATATATAGGCGAATAAGAAATCCTAATCTAGACAATGTGAAATTACACTTAAGAACAATATACATAGATAATTACAATTAAGGACGTATTCTAACACTCCCCCTCAAGCTGGAGCAAATATATCAATCATGCCCAGCTTGTTACATAGATAATTTATTCTTGCTCCATTCACAGCTTTGGTAAGAATATCTCCCAACTGTTCTCCAGTCTTCACATATCCTGTAGATATCACTCATTCTTGTATTTTCTCACGAATGAAATGACAGTCCACTTCAATATGTTTAGTTCGTTCATGAAACACTGGGTTGGATGCAATGTGCATAGCAGCTTGATTATCACACCATAATTTAGCTGGCACGGTAACGCTAAAGCCCATCTCAAATAAAAGTTGATGTATCCACATTATCTCACACACAGATTGTGCCATAGCTCTATATTCTGACTCCGCACTCGAGCGTGAAACTACATTCTGTTTCTTACTTTTCCACGACACTAAGTTTCCTCCAACAAAAACACAATATCCAGTCGTCGATCTCCTATCTTCTCGAGATCCAGCCCAATCTGCATCTGAAAAACACTAACCCTTGTGTGGCCATGATCTTTATATAAGATCCCACGTCCAGGTGTAGCTTTCAGATAACATAAAATTTGTTCTACTGCAGCCCAATGATCTACTGTAGGAGAAGACATAAACTGACTCACTATACTCACAACATAAGCAATGTCTGGTCGTGTCACTGTTAAATAGTTCAACTTCCCGACTAGTCTTCTATATCTCTCAGGGTCTTTAAACAATTCACCTTCTTTGACAAGTAGCAAATTTGGTATCATTGGAGTACTACATGGTTTGACTCCTAGTTTTCCAGTCTCAGACAACAAATCAAGTACATATTTTCGTTGTGATAGATAAATACCTTTCTTGCTCCTCATTACTTCAATACCCAAGAAGTATTTTAACTCTCCCAAATCTTTAGTATGAAACTGACCCTGAAGAAAAGTTTTAAGAGAGGATATACCTGATACATCATTGCCAGTGATAACAATATCATCAACATACACAACTAGCAAAGTAATACCATGCTCTGATCGTCGATAAAATACAGAATGATCAGATGTACTTTTCTTCATACCAAAACATTCAAGGGCTTGACTAAACTTACCAAACCATGCATGAGGGCTCTGTTTCAATCCATACAAAGATTTTCGAAGTCGACACACTTTACCACTCTCCCCCTGAGCAACAAACCCAGGTGGTTGCTCCATATAAACTTCTTCTTGAAGATCACCATGTAGAAAAACATTCTTAATATCAAGTTGATGCAAAGACCACTTGTGAGTAGCAGTCATGGAAAGAAACAATCGAATGGAAGTCAATTTGGCAACAGGAGAAAAAGTATCAGAATAATCAGTCCCATAGATTTGGGCATAACCCTTGGCAACAAGGCGAGCTTTCAGACGAGCCACTGTGCCGTCAGGATTGAATTTGATAGCAAACACCCATTTACACCCAATTGCCTTCTTGCCTGCCGGCGGAGACACTAAATCCCAAGTACCATTGTCATCCAAAGCAGTCATCTCCTCAATCATTGCACTACGCCACCCAGGATGAGATAAAACTTCATGAACAGAGTTAGGAATAGAGGTGGAATCAAAAGATGTAAGAAATGAATAAGTGGGAGGAGACAACTGGTTATACGAAACAAATGAAGCAATAGGACAAGTACAAGTGCGCTTACCTTTCCGAAGGGCAATAGGAAGATCATCACTCGATCCTGGATCAGATGTCGAAGAAGGTAGAGGTGGAGGACATGTTCCTGTGGGTTGTTGAGGAGGCCGTCTGGAGAAGACTCGAGTAACAGGTGGAGGAGACGAAAGAGAGACAGAAGACGACGTAGGAAGAGACGGTGTAGGAACGGTAACTTTGGTAATCTCATAAATCAAAAGATCATCATCCTCCCCTTGACAAGAACTTGACGGTGAAGAACTAAAGGGTATATCTTCAAAAAACGTAACATCAAGAGATACAAGATATCTGTTCAGATTAAGAGAATAACAACGATAACCCTTTTGAACACGCGAATAGCCTAAGAATATGCATTTCAACGATTTTGGATCTAACTTAGTACGATGGGGACTAACATCCCGAACAAAACAGACACACCCAAATATTTTAGGAGCAATAGGAAACAAAGGTTTTGTAGGAAACAAAACACGATAGGGAATCTCACCGCTGAGAACAGAAGAGGGCATTCGATTAATCAGGAAGCAAGCGGTGGAAACAGCATCCGCCCAAAAATGCTTAGGGACATGCATTTGAAAGGATAGGGCTCGAGCTGTTTCAAGTAGGTGTCGGTTTTTTCTTTCAGCAACTCCATTTTGAGATGGAGTGTCAGGACACGAGGATTGATGAATGATGTCATTATCACGTAAGTAAGACCCAAGTAGATGAGAAAAATATTCACCAGCATTATCAGTGCACAGGATTTTGATAGAGACATCAAATTGAGTTCGAATTTCAGCATGAAAAGCACAAAAATGAGATAGTAATTCAGAACGATTTTTCATTAAATATAACCAAGTCACACGAGTATAATCGTCAACAAAAGTAACAAAATAGTGAAAACCAGTTTGGGACACAACAGGACACAGACCCCCAATATCAGAGTGAACTAACTCAAATGGAGCACTAGCTCGTTTATGGACTCTAGGACTAGAGCTAAGACGATGAAACTTAGCAAACTGACACGAATCACAATTTAAAGAAGACAAAGAACGAAATTCGGGATAAAGTTTCTTCAACACAACAATGATGGATGACCTAAACGACAATGAACTTCAAACGGAGATGTAACTCCACGACAAGCCACAACTTTCGGGATTTGGTGGTCGAAAATGTAAAGGCCTCCTGACTCATGTCCTCTACCAATAATCTTCTTCGTCACATGATCCTGAAACAAGCAATAACCAGGAAAGAACGAGACAAAACAATTAAGGTCACGAGTAAGTTGGCTAATAGAGATTAAATTAAAAGACAATTGAGGCAAATGTAACACAGAAGACAAAGAGAGGGAAGAAGTAAGATTAACGGTGCCGGAGCCAAGGACAGAAGAGGTGGAGCCATCTGCCAAGGTAACAGATGGGGATGAGGTAGGGGACAAAGGACTAGAAAACAAGTTAGAATTACCTGTCATATGAGCAGTGGCACCAGAGTCTATGACCCATTTGGTAGATGATGTAAGGAGGCATTTTGTGTTACCTGTCTCAGCAATGGTGGCAATAGGAGTCGATGAGGAAGATGCCTGTAGGGATTCCTGGAATACTTGGAATTTAGCAAAGTCTTCAGCAGAGATGGTAACAGACTTCTCAGGCATATCAGTGGTGGAAGCTATCTGAATGCTGAGATCTCTGAGTCTTATACAGCAGCTTTCGACAATCACGTTTCACATGGCTTGGCTTACGACAATAATGACAGACGATCTCCTGAGAATCTGATTTTCGACTATCATAGGTAGGCTTCTGAAAATGGTTAGGCATCCCTGAAGGAGCTCGGGGTTGATCATTCTTGCTTATGAGAGCACTGCTCGGTTGAGACACAGACGAGCCAGATTGAGAACTCTCAATACGAAGAACACGACTAAAGGTGTCGTCTAGCGATGGAATGTCAGAATTAGATAGAATCTGTGTTTTAGCCATGCCAAATTCAGGGACAGCCCATTCAAAAATATCATAACAGCCATCTTTTCTCGTTGAGCTTGCTGAACTTTGATATCAGGGCTGAAAGGTAATAATAAAGCAAGTTCAGCGGCTATCTTCTTAAATCTCATAAAATAGCTTGTAATAGATTCTGCCTTCTGTTCGGCTCGAAAAAACTGCATGCAAACATCAAACATTCGATGGACGTGTTCTTTCCCCGAATAAAGAAATTCCAAGAACTCTAAAAGTTCCTTCACCGAATTACAATGATTCACCAAGCCAACGACTTCACTCTCAATGGAGTTCTTAATTTGAAGATAAAGACGAGCATCATCGCGAAGCCAACCCTTTTTCTTATCATCTTTCGGTGGATCCTCAGTTACATGATCATCCATATCAGTACTCAGTAGGTAAAACTGAATCGTTCTTCGCCACTCATAGTAATTAGATCCATTTAACTTGTGTTCGGTGATCTTGGATGATACGGGAAGTACATTAGACACCACCATATGTGTCGGTTCAGCCATGACAACTTGTAGATATCGAAAATAGGGTAGATGCAACAAAAAAACAGTGAACCAAGACCCGGAATAGATAAAACAGCCCCAAAAAGGACCAAAATAGTCACAAACCCTAAGCGCAAAAAGGGTATTTTCGAAATAACCTCACACTCGAGACGATTTTGATGGAACCAAAGGTACAGACAGGTCAGAAACTCAAAACCGAGACGAACCCAACTGACGGACGCGGAACCAGATGTCCCACGTGCCTCCACGCGCCGGCACGTGACGGAGGAGTCCGAAGGTATAGGCGGCGCGTGAGCCTCACGCGCGGCAGTTTCCGGCGACTGACGAAGACAGATGGCTTCGGAAGGTGGCGGCGCTTCTTCTGAGGTGGGTGGTGTCGACAAACGGCCACTCGAAATCGATGACCAAAATACCAAACCCTAACTCTATGAGATTGTTGCCGTCAAACGAAAGTTCGAAAGTTCGAAACCCTAGAGGGCTTTGATACCATGTACTCTATAAGTCAGAGAATGAATCTCTCAAAAAGTATATTCCATAAGACTTGAGTTACATATATATAGGCGAATAAGAAACCCTAATCTGGACAATGTGAAATTACACTTAAGGACAATATACATAGATAATTACAATTAAGGACGTATTCTAACACTTTCATGCTCAATCCCTTTGAAGATGGGTTTTCTGTTCCTATCTCTTGATACTACATGACCTCTGTATTAAAATTGACTGTGATTTGAAAAGCGTATAGGTGCACTTTCTAAAAAAATGTATTGAAGGTGCCTTGCTTCTTTCAATGAAGCACATTGAGGCGCAAGCCTCATAGGAAATGCACCAGCCTTAGCACATGATGCCTTCTTTTAAAAATTATTTTATTATTTTTATTAAGAAACTACAAAATATTCTTAAAAGATGCTTATCTCCCCTCAATCTTCTACAAAAGCTCTAATTTCTTAAAATTTTTTCGTTGCCTTCCTATTCATGATACTCCTAGTGCATATGTTTTTTAATTTTTTGTATACAAAAAATGTATTTTCATTTTCCTATTGTGTGCCTCCGAAAAAAAAGAAAAAAGAAATTGTCCATCTTTGTGCTTAAGCCTCAAAAGACTATTGCACTATGTTTCTTTTTTAGGAAAAAAAAATGAAAGAATACACATACGTACAAAAAAAGAAAAGCCCACAAGAAGAAATTAGACTAAAAGAAGGAGCTTCAGTCAAGAGAAATCTAAGCAAATAATTACAAAAGAGCCTAGAAACAGAGACTCTAACAAGAGACTAGACATTACCAAATGAACTCTCACGCCTTCCGAAGATTTTGTTGTTTCTCTCACCCCAAAACCTCCACAAGGTTTTTCCGTGTGCCTCTTGCTTTAAAAGACACTAAATAGTAACCCTTTATATAATGATTAGGTAACTATAAGTTTTTGTGTGATTCTTGGCTGTCAGTTAATGGCTTAATACAAAGGGTAATAGAGTTGAACTGCTACATGCCCGCTTATTTGAGGTCCTTGTCCCCTTAGGGCTGTTCTTATCTTTCAGTTTCATGGAAATACATCACAGAAAACAGAGCCCTATGGAAGAGGGCGACAGCTTCAAAGTACTGCAGGACCATCTTTGATACGTAGGAACTTTTGGCATCCTCCCAAAATCACAAAGGCCCTTGGAAAGCCATTTTAAATACAAGAATTTGGTCCTATTGGTCCTATTAAAAGCTATATGGGTGCCTGGAAAAGTTCTTCCATACACTTCTGGCATGATAGATGGGGGCTCTATCAAATAACTGGAACACCCTACTTTTAATTTCCTTCATTTTGTGAGAACACTCTGTTGAAAAGAATGACCATCTCGCGACAAATCAGCAGAGCGTGCGTACGTTTACACTTGACAGAAAACAACCGTTCTTTGAATGACAAAGGAAGGATCGTCAAGGTTTTTGGAGATTTGATCTTGTTACACCACTTTCTTGGTGTAGATTGTACTTCCCTTTAGATCCTTGAAGTCGAAGGATTTTTTTTTTTGTAATTCTCCATGGCTCTGGATTTGGCCCCTTTTTTTTGGGTAATATTTCATACCGTCCAAGAAATATATATTAAATATTGATTTTCTTTTTCCAGTTCCCTGTGCTGGTAGTGACTTGTGGGTATGGTATAGGAGTTTCATCTCAAGTTGCCTTTTGGGATCCTAGTCTTTTTCTTGTTCACAATTTGTTCGTTTAGAGGATTGAGCATGATGGCCAAAAAGATTTACTATGTCAGTCATTTGAATTTTGAAGTACCATAGTGATTACTTAACGGTCTGCCTGCAAATTATAGATAACAGAATTTGATATATGATTCCATATTATGTCCAGTACTGGCCTCAAGCTTATGGAAATCAAACTACTATAGTTCATTCACTGTTTGTGTGTATGTTTCTGTCAGTATTTACATATCAAATATCAGATGCACAAATAAAAGAATTTGAACTTTGAATTGTAGGAATTCTCTTATGCAAAAATGTGAAGCTATTCACCTTTTTTTTAACTAAAAACTACTATTCTTATTGTTTATCGAGAAACAAAGCTATTAATTTTTTAATTATATCATTATTCAGTTGGATTATTTACTCCGACTAAGGTTCATTTTTTGTAGTTGCTGTATATTCACTTTGAAGCTTATACCCCAAAATGTAGTAACAATGTACCTTCAAGATGTTTCTATAGGAGTGTGAAAGTTTATCTAACTTCTTCAACAGGTTCTTTTTGGGTTGGAATATGCACATTCAATGGATGATATTAGACAAACACTATTAGATTCATTCCAGACAAGCGGTTTGGGGTCAGAATTAGAAGTTCATTTTGCGGATGTTAAATGTTCAAGTTTGGATCCGACCTCGCATCATGATAATGTTGCGAAAGGTGGAGAACAAGTTTTCAGTGCTGAAGGTGGGAATTCTGAGAACATGGCTGGAGCTAGACATCACATTGGAGGCTTGTTTTGGGAGTTACCTAGAGAGCGAAGAATGGAGGATTGCTCACTTTTCTGGATTGGTTCTGAAAATTCAGCCTTCGCAAATGTGGTTCTAACATTCAATGGCTGTGAGATAGGTGCGTGGATTTCAATGTTCTACATTGAGATTATCTTATTAGCGACAATGCTAAATATTATTCTGTTGCCATTTATTCTTATGACAACATGCTCTCTATCTGATGACCCTTGGAAGTGGCTCCAGCCTCCAGCAATCTTGTTCAACCTTAAATTTTAAACTCACAGCTTGACTTAACTTCATTGTTATTGAATTTAGTCAGATATGATGCAAAAGAAAGTTGCTTAGTTACTGATGTATCTAAGCAGAGAAGGATCCTCAAGCGCAGGTAACTTTGGCAATGCACATTCATTATTTATATTTGTACACTTTTAACTGATTTTTGTCGATCTATTTGTAAGACAGCATGCCTTCTTTTGTTGCAATTACATTCTCCCACTATACAGTGCATTTCTTTTTGTTTTCCTTTTTGGGACAAAAACTGGATACAACAATCATCTTTTGTTTTATTTATTTTATCTTATTATGTTTCAAGCAGATATTACTTGGTGGAGAAGGCAAAGGATGCTGGCATTGTTGGAATTTTGGTTGGTACTCTTGGTTTAGGTATGTATGACGATGAAAGGAGGTTTTTGGGCATTGTACTCAGATGGTCAATGTTTGGACTATAATGTCGTATAACATATTATGGGATCAAAGACTAAATTTCTTAGCCCCATTATCTGTAATTGGAAAGTTCTTTTGTTTCTTTTAGTTTTTTCCATCTCAATTTGTGTTTCTTCCTTTTATTTATTTTAGTAAGAAACAAATAAAATATATGGAAGAAGGTGAATGCCCCATTCAAGACAGTTTCAAGAGGTCTCTCAGTATGTTTTCAGCCATAAAAGTAGAATATTCACAGAAAGGTTTAGAAAGATTGGTCCAAGAGGGGGACATGTCATCCCAAACTTCAATCAGGGAAGTACCTTTTATTTTTCCTCCCAACCAAACTTTACGAGAGGTGTTCATGATACCGTTTATTCAGAGAGTAGGCTATAAGAGGAAAAGGGAGAGAGAAATTGAAGTGAGGGAATAGTTGGGAATTGTTGTGGAAAGATTCAGACCTCACGAAAGTTTGAAGAGTTTTTATATCCTTTTCTTTCCACGTTGATTGTGAACTCATTGGCAATTGCTGCTGGAAGCTGCTATTATGCTATACATCGACGATCAAACTGTTCTCAGGAGATCTTTCCTGTTATGATATTTGATTAAAAAGTATTTGTGACTTATGTGGTGCTCGTTCCTCCACTTATTTTGGAAGTGTTAATTTTTAGATTAGACTGTTAGATTGTTGAGCATTGAAGTCGTAATATTTTGGATATATGCCATCATCTGAATCTGTTTACACTTTACAGTTTATTGGATTTTGCTTTGTTGAGGTCCCAGAGGAATATTATTCTACCCTACTGTTTTGTGGATGTTAGATCACGTCTTAATTTGGTGATGCTTTATGCAGCTGGTTACCTCCATATCATTCATCAGATGAAAGAGCTGATCACTGGAGCTGGGAAGAAGGCTTATACACTTGTTATGGGAAAACCCAATCCTGCAAAACTTGCCAACTTCCCGGAGGTATTTATCCGATTCATATGACTTAATTTGGGTTTTTTTCCATTGCTTTACTGGTAGTCTAATTCCTTAGCATTTATTCCAGTGTGGTGTATTCATATATGTTTCTTGCGCCCAAACTGCACTTATGGATAGCAAAGAGTATCTTGCTCCAGTTATTACTCCATTTGAAGCCACACTAGCTTTCAGCAGGTACCTCTAATGTCTGTTTCAATTATCTTCACTTTGTTGAACATCATACCGTTTGAGCTGTCTTAGTTCTGTAAACTCAGGACCAAAGTAGATTCAAGTCTTCATGAAGTGAAGGTGGAAACGTACTTCTATTTTTTATTTTCTTTTGGGTTTTCTTCCAATCATTATAGTAATATCTATATGGCCTTCTTTCGATGACATTCCATGAAATTGTCCAGCTGTTCAAGCATGTATAAGAAATATGCTAAGCTCACCTGCACTAGAAGACGTTATAATAAGGGGATGGAATTGCTTGGTGGATTTGGTTGGGAGCTGTCGGTGGGTCAGAATAGAGACTACGAAGTGTAGTTGTTGAAGAATTCTGATCTTTTATTTTATTCTTTTTTTTTTTTTTTTCATGTGAATCCCTATTTGTTAATCTTGCTGTTTTTCAGCGAGATCATTACACAGCAATTTAAAGATTGTTTGGAGTGTTAACTGCCTAAATGATGAAGATTAAGAATTTCCTTTGGTCTCTGTTTTTGTGAGGGGTGGACACCTGGGATAAGATTTAGAAAAGACTGTGTGTGCCATGTCACCGTGGTGGTGCGTATGTAAGGGGGATAAGGAAGACGACAACTTGTTTTACTATCATTTAATTGCGAAGTATCCTCTCAGGTAATTGTCCTCACTATGACCCGCCATTATAGGTAGTAAATATGGTCTTGACCACTTGAGGTGAAACGGCTTGGTTCAGAGAGACAGTTGTTGAAGCCCATCTAATCTATCTTTCTTTTTCTTTCTTTTTTTATTTATTCTTTTCCTGTTTTGGTTCAGCTAGGTTGGTGGGATACCTTGATGCTTTTAAGGTTGTGAATGTGAACGCAGATTATAGAACATGCTGGGACCAGGCAATTGGCAGATTGCGAGCCCCTTTGTTGCTTAAATTTCTTAGCTTTCTACTTCACTAGTGTGGCAATATCAGATAGATAGCATACTTGCTAGTTGCCATTAACCCTTGAGCATGTAATTTGAAACTTTGAAGGAAGCTCAATGAGGTGGATAACCCCATGTCTAAAAAGGTTGAAATAGCCAATGCTGGGGCTCGGTGAGAAAAGGTGCCTGCTATAGATTTTTCCAGTCTGATTAGCAAAGAGAGATGATTTTTTATTTATTTATTTATTTATTTTAATTTGAAACAAAAAGTTGGAAGATGATTTGGGATTATGCAAGTGATACAAAAACTTGACCCTACTAGACTCGTTAACACTAGGAGTACCCAATTGGATTTCTCTCGGACTAAGATAATTTGAGGTGAAAGTATTTGAGCCTTCTAGGAAGCGTGTAAAATCAAGAGAGCGTATAGGTTGTGTTAACTTAATCTCACTGTTTAAAGACAATCCATCTTGAGGAAGAGAATAAGAGGTTCCGAAAGTATGATGTTGCTGCATCTGTTTGGAGAAAGAGCCATCGTTCGCTTTTTTCTTTGAAGCTTGTCTTGGGTGTTGAATTTTCTTTATATCTGTTTTCTGTGCTTGATTTGATGTTAGCAGGAAATTTTGCATTAAATCATTTTCAAGTTTGTCATTGCCAAATTTAATCCTCTTATATTTCTTATATTGAATTTTTGTATCATTGATAGCCAAACTCTCTTCTTTCTGTAAAACTACAACTCCAAAATCCTATTGTATGAATATGCTGTTCTATTTTCCGATAAACTGGTGAACCTATGCAAACATCTAAAGTTGACTGGAGGAGAGATGCCTATGATTATTTCAATATACGATCAAACGGAGAAAAAACTATTTCCTTCAGATGGAAAATTTATAGTGGATATCTTAGCTCCTATGTCTAAATTTCATTGAGATCTTCATTGAAGAATGGCAATTTTAATTGTCTAGAGGAAGTCAATGGACGGGAGCGTATGTTATGGAATTTCAAGATTTGATTGATTTCTTAACACCTAAAGAAGGAAACCAATCAGATGAAGCTCGGTATTCTTTCTTGAAAGGTGGATATGTTGAAGACTGGGATTCCCAAGGTAATGACTCTTCAAGTTGTGCTTCGTTTTTCTGCTGCAAGTGCTTTTTTTTTTTTTAGATAAAATAAAATAAAATCAAATCAAACATAAAGAGATTGATCATTACTTTTTTTTCCCCTCCCCCAGAGTACTTTCATGTGAGAAACTTATTCAGGTGGTTGAAGTCAACAGTCTTATACAACAGGAAGGAGTTTTGGATTTTTTGTATATATTTTGTAGATGTCAACACTGGAGAACATTGATGCTTTCTAAATTATCAAAACCAAAAAAAAATTACATTCTGGTAAAATAGACTGGAAATCCAGAAGCCATCTTAAAACTCATTTCATTATTTTTACTGAAATACAGAATTACTGGTTACATGGCTTTTAGTTACAATGTGTATGACTTCACTGCATTAACTTGAAGCATCACAGTTCATACTGTCAAATTGCTCTCTCAAACCTTTTATCTGTTTTTCCAAGCAGAAAATACTGAGGAAGAAAATGGAGCTACTGCTCTAGTAACTGCAACAGAGAAGGCTCTCCAGTTGCGAAATAATCGAAACTCGCTTATTGAAGGAACTGCCAGATCTGGAGCAGAATTTTTTGCAGCTCGTTCCTTTCAAGGCTTAGATATATACCATGGCAGTTCCGAGCCAGAGCCATATGTGATAGGGAGGAGTGGCAGGGCGTCGGGGTATCAAGACGAGAAAAACAGGTAA

mRNA sequence

ATGGAGTTGGAATCCTACTACGAAATTTCCCGTACCGCCGACTTCATTCACACCCGGAACTTCACCAGAGTTGCATTGCAGTTCCCTGATGAGCTCTTGAAGGATTCAACTAGAGTGGTGAGGGCTTTAAAGGATAGGCTTCGTGTGCTTGATGAATCTGATACAAATCACAATGACAGCAAAAATGAGGTTAGGTTGTTCATAATGGCTGATACAACTTATGGTAGTTGTTGTGTTGATGAGGTTGGAGCAGCTCACGCTAATGCTGATTGTGTCATACACTATGGACATACCTGTCTGAGCCCGACAACAACTCTTCCAGCACTTTTTGTTTTTGGAAAGGATTCAATCGATGTATCCACTTGTGCCAAAACTTTATCTCAGTACAGTTTGTCCAATGGAAAGCCTGTTTTGGTTCTTTTTGGGTTGGAATATGCACATTCAATGGATGATATTAGACAAACACTATTAGATTCATTCCAGACAAGCGGTTTGGGGTCAGAATTAGAAGTTCATTTTGCGGATGTTAAATGTTCAAGTTTGGATCCGACCTCGCATCATGATAATGTTGCGAAAGGTGGAGAACAAGTTTTCAGTGCTGAAGGTGGGAATTCTGAGAACATGGCTGGAGCTAGACATCACATTGGAGGCTTGTTTTGGGAGTTACCTAGAGAGCGAAGAATGGAGGATTGCTCACTTTTCTGGATTGGTTCTGAAAATTCAGCCTTCGCAAATGTGGTTCTAACATTCAATGGCTGTGAGATAGTCAGATATGATGCAAAAGAAAGTTGCTTAGTTACTGATGTATCTAAGCAGAGAAGGATCCTCAAGCGCAGCAGATATTACTTGGTGGAGAAGGCAAAGGATGCTGGCATTGTTGGAATTTTGGTTGGTACTCTTGGTTTAGGTATGTATGACGATGAAAGGAGTTTATTGGATTTTGCTTTGTTGAGGTCCCAGAGGAATATTATTCTACCCTACTGTTTTGTGGATGTTAGATCACGTCTTAATTTGGTGATGCTTTATGCAGCTGGTTACCTCCATATCATTCATCAGATGAAAGAGCTGATCACTGGAGCTGGGAAGAAGGCTTATACACTTGTTATGGGAAAACCCAATCCTGCAAAACTTGCCAACTTCCCGGAGTGTGGTGTATTCATATATGTTTCTTGCGCCCAAACTGCACTTATGGATAGCAAAGAGTATCTTGCTCCAGTTATTACTCCATTTGAAGCCACACTAGCTTTCAGCAGAGGAAGTCAATGGACGGGAGCGTATGTTATGGAATTTCAAGATTTGATTGATTTCTTAACACCTAAAGAAGGAAACCAATCAGATGAAGCTCGGTATTCTTTCTTGAAAGGTGGATATGTTGAAGACTGGGATTCCCAAGAAAATACTGAGGAAGAAAATGGAGCTACTGCTCTAGTAACTGCAACAGAGAAGGCTCTCCAGTTGCGAAATAATCGAAACTCGCTTATTGAAGGAACTGCCAGATCTGGAGCAGAATTTTTTGCAGCTCGTTCCTTTCAAGGCTTAGATATATACCATGGCAGTTCCGAGCCAGAGCCATATGTGATAGGGAGGAGTGGCAGGGCGTCGGGGTATCAAGACGAGAAAAACAGGTAA

Coding sequence (CDS)

ATGGAGTTGGAATCCTACTACGAAATTTCCCGTACCGCCGACTTCATTCACACCCGGAACTTCACCAGAGTTGCATTGCAGTTCCCTGATGAGCTCTTGAAGGATTCAACTAGAGTGGTGAGGGCTTTAAAGGATAGGCTTCGTGTGCTTGATGAATCTGATACAAATCACAATGACAGCAAAAATGAGGTTAGGTTGTTCATAATGGCTGATACAACTTATGGTAGTTGTTGTGTTGATGAGGTTGGAGCAGCTCACGCTAATGCTGATTGTGTCATACACTATGGACATACCTGTCTGAGCCCGACAACAACTCTTCCAGCACTTTTTGTTTTTGGAAAGGATTCAATCGATGTATCCACTTGTGCCAAAACTTTATCTCAGTACAGTTTGTCCAATGGAAAGCCTGTTTTGGTTCTTTTTGGGTTGGAATATGCACATTCAATGGATGATATTAGACAAACACTATTAGATTCATTCCAGACAAGCGGTTTGGGGTCAGAATTAGAAGTTCATTTTGCGGATGTTAAATGTTCAAGTTTGGATCCGACCTCGCATCATGATAATGTTGCGAAAGGTGGAGAACAAGTTTTCAGTGCTGAAGGTGGGAATTCTGAGAACATGGCTGGAGCTAGACATCACATTGGAGGCTTGTTTTGGGAGTTACCTAGAGAGCGAAGAATGGAGGATTGCTCACTTTTCTGGATTGGTTCTGAAAATTCAGCCTTCGCAAATGTGGTTCTAACATTCAATGGCTGTGAGATAGTCAGATATGATGCAAAAGAAAGTTGCTTAGTTACTGATGTATCTAAGCAGAGAAGGATCCTCAAGCGCAGCAGATATTACTTGGTGGAGAAGGCAAAGGATGCTGGCATTGTTGGAATTTTGGTTGGTACTCTTGGTTTAGGTATGTATGACGATGAAAGGAGTTTATTGGATTTTGCTTTGTTGAGGTCCCAGAGGAATATTATTCTACCCTACTGTTTTGTGGATGTTAGATCACGTCTTAATTTGGTGATGCTTTATGCAGCTGGTTACCTCCATATCATTCATCAGATGAAAGAGCTGATCACTGGAGCTGGGAAGAAGGCTTATACACTTGTTATGGGAAAACCCAATCCTGCAAAACTTGCCAACTTCCCGGAGTGTGGTGTATTCATATATGTTTCTTGCGCCCAAACTGCACTTATGGATAGCAAAGAGTATCTTGCTCCAGTTATTACTCCATTTGAAGCCACACTAGCTTTCAGCAGAGGAAGTCAATGGACGGGAGCGTATGTTATGGAATTTCAAGATTTGATTGATTTCTTAACACCTAAAGAAGGAAACCAATCAGATGAAGCTCGGTATTCTTTCTTGAAAGGTGGATATGTTGAAGACTGGGATTCCCAAGAAAATACTGAGGAAGAAAATGGAGCTACTGCTCTAGTAACTGCAACAGAGAAGGCTCTCCAGTTGCGAAATAATCGAAACTCGCTTATTGAAGGAACTGCCAGATCTGGAGCAGAATTTTTTGCAGCTCGTTCCTTTCAAGGCTTAGATATATACCATGGCAGTTCCGAGCCAGAGCCATATGTGATAGGGAGGAGTGGCAGGGCGTCGGGGTATCAAGACGAGAAAAACAGGTAA

Protein sequence

MELESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDSKNEVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVSTCAKTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQTSGLGSELEVHFADVKCSSLDPTSHHDNVAKGGEQVFSAEGGNSENMAGARHHIGGLFWELPRERRMEDCSLFWIGSENSAFANVVLTFNGCEIVRYDAKESCLVTDVSKQRRILKRSRYYLVEKAKDAGIVGILVGTLGLGMYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYLHIIHQMKELITGAGKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGSQWTGAYVMEFQDLIDFLTPKEGNQSDEARYSFLKGGYVEDWDSQENTEEENGATALVTATEKALQLRNNRNSLIEGTARSGAEFFAARSFQGLDIYHGSSEPEPYVIGRSGRASGYQDEKNR
Homology
BLAST of HG10002100 vs. NCBI nr
Match: XP_038877054.1 (2-(3-amino-3-carboxypropyl)histidine synthase subunit 2 isoform X2 [Benincasa hispida])

HSP 1 Score: 925.6 bits (2391), Expect = 1.9e-265
Identity = 470/542 (86.72%), Postives = 488/542 (90.04%), Query Frame = 0

Query: 1   MELESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDS 60
           MELESYYEISRTADFIH+RNFTRVALQFPDELLKDSTRVVRALKDRLRVL ESDTNH+D 
Sbjct: 1   MELESYYEISRTADFIHSRNFTRVALQFPDELLKDSTRVVRALKDRLRVLHESDTNHSDI 60

Query: 61  KNEVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVS 120
           +NEVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVS
Sbjct: 61  ENEVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVS 120

Query: 121 TCAKTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQTSGLGSELEVHFADVKCSS 180
           TCAKTLSQYSLSNGKPVLVLFGLEYAHSM DIRQTLLDSFQTS LG ELEVHFADVKCSS
Sbjct: 121 TCAKTLSQYSLSNGKPVLVLFGLEYAHSMFDIRQTLLDSFQTSSLGPELEVHFADVKCSS 180

Query: 181 LDPTSHHDNVAKGGEQVFSAEGGNSENMAGARHHIGGLFWELPRERRMEDCSLFWIGSEN 240
           LDP+SHH+NV KGGEQV + E G+SEN+AGARHHIGGLFWELPRERRMEDCSLFWIGSEN
Sbjct: 181 LDPSSHHENVVKGGEQVANDESGSSENIAGARHHIGGLFWELPRERRMEDCSLFWIGSEN 240

Query: 241 SAFANVVLTFNGCEIVRYDAKESCLVTDVSKQRRILKRSRYYLVEKAKDAGIVGILVGTL 300
           SAFANVVLTFNGCEIVRYDAKESCLVTDVSKQRRILKR RYYLVEKAKDAGIVGILVGTL
Sbjct: 241 SAFANVVLTFNGCEIVRYDAKESCLVTDVSKQRRILKR-RYYLVEKAKDAGIVGILVGTL 300

Query: 301 GLGMYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYLHIIHQMKELITGA 360
           GL                                         AGYLHIIHQMKELITGA
Sbjct: 301 GL-----------------------------------------AGYLHIIHQMKELITGA 360

Query: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420
           GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS
Sbjct: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420

Query: 421 QWTGAYVMEFQDLIDFLTPKEGNQSDEARYSFLKGGYVEDWDSQENTEEENGATALVTAT 480
           +WTGAY+MEFQDL+DF TPKEGNQSDEARYSFL+GGYVEDW+SQENTEEENGA ALV+AT
Sbjct: 421 RWTGAYIMEFQDLVDFSTPKEGNQSDEARYSFLQGGYVEDWNSQENTEEENGARALVSAT 480

Query: 481 EKALQLRNNRNSLIEGTARSGAEFFAARSFQGLDIYHGSSEPEPYVIGRSGRASGYQDEK 540
           EK+LQLR+NRNSLIEGTARSGAEFFAARSFQGLDI +GSSEPEPYVIGRSG+ASGYQDEK
Sbjct: 481 EKSLQLRDNRNSLIEGTARSGAEFFAARSFQGLDINNGSSEPEPYVIGRSGKASGYQDEK 500

Query: 541 NR 543
           NR
Sbjct: 541 NR 500

BLAST of HG10002100 vs. NCBI nr
Match: KAA0044552.1 (diphthamide biosynthesis protein 2 isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 910.6 bits (2352), Expect = 6.4e-261
Identity = 462/542 (85.24%), Postives = 481/542 (88.75%), Query Frame = 0

Query: 1   MELESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDS 60
           ME ESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNH+DS
Sbjct: 1   MEFESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHSDS 60

Query: 61  KNEVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVS 120
           KNEVRLF+MADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDV+
Sbjct: 61  KNEVRLFVMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVT 120

Query: 121 TCAKTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQTSGLGSELEVHFADVKCSS 180
           TCAKTLSQYSLS+GKPVLVLFGLEYAHSM DI+Q LLDSFQTS LGSELEVHFADVKCSS
Sbjct: 121 TCAKTLSQYSLSSGKPVLVLFGLEYAHSMFDIKQRLLDSFQTSSLGSELEVHFADVKCSS 180

Query: 181 LDPTSHHDNVAKGGEQVFSAEGGNSENMAGARHHIGGLFWELPRERRMEDCSLFWIGSEN 240
           LDP+ + +NV KGGEQ  +AE G+SEN+AGARHHIGGLFWELP+ERRMEDCSLFWIGS+N
Sbjct: 181 LDPSPYQENVVKGGEQASNAESGSSENIAGARHHIGGLFWELPKERRMEDCSLFWIGSDN 240

Query: 241 SAFANVVLTFNGCEIVRYDAKESCLVTDVSKQRRILKRSRYYLVEKAKDAGIVGILVGTL 300
           SAFANVVLTFNGCEIVRYDAKES LVTDVSKQRRILKRSRYYLVEKAKDA IVGILVGTL
Sbjct: 241 SAFANVVLTFNGCEIVRYDAKESWLVTDVSKQRRILKRSRYYLVEKAKDAAIVGILVGTL 300

Query: 301 GLGMYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYLHIIHQMKELITGA 360
           GL                                         AGYLHIIHQMKELITGA
Sbjct: 301 GL-----------------------------------------AGYLHIIHQMKELITGA 360

Query: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420
           GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS
Sbjct: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420

Query: 421 QWTGAYVMEFQDLIDFLTPKEGNQSDEARYSFLKGGYVEDWDSQENTEEENGATALVTAT 480
           QWTG YVMEFQDLID  TPKE NQSDEARYSFL+GGYVEDWDSQENTEEENGA ALVTAT
Sbjct: 421 QWTGVYVMEFQDLIDCSTPKEANQSDEARYSFLQGGYVEDWDSQENTEEENGAHALVTAT 480

Query: 481 EKALQLRNNRNSLIEGTARSGAEFFAARSFQGLDIYHGSSEPEPYVIGRSGRASGYQDEK 540
           +KALQLR+NRN LIEGTARSGAEFFA+RSFQGLDI +GS EPEPYVIGRSG+ASGYQDEK
Sbjct: 481 QKALQLRDNRNLLIEGTARSGAEFFASRSFQGLDINNGSFEPEPYVIGRSGKASGYQDEK 501

Query: 541 NR 543
           NR
Sbjct: 541 NR 501

BLAST of HG10002100 vs. NCBI nr
Match: XP_038877053.1 (2-(3-amino-3-carboxypropyl)histidine synthase subunit 2 isoform X1 [Benincasa hispida])

HSP 1 Score: 908.3 bits (2346), Expect = 3.2e-260
Identity = 470/576 (81.60%), Postives = 488/576 (84.72%), Query Frame = 0

Query: 1   MELESYYEISRTADFIHTRNFTRVALQ--------------------------------- 60
           MELESYYEISRTADFIH+RNFTRVALQ                                 
Sbjct: 1   MELESYYEISRTADFIHSRNFTRVALQVKFPNTGGRNYSHFFLCSLLTNDRKFSCVLRNL 60

Query: 61  -FPDELLKDSTRVVRALKDRLRVLDESDTNHNDSKNEVRLFIMADTTYGSCCVDEVGAAH 120
            FPDELLKDSTRVVRALKDRLRVL ESDTNH+D +NEVRLFIMADTTYGSCCVDEVGAAH
Sbjct: 61  LFPDELLKDSTRVVRALKDRLRVLHESDTNHSDIENEVRLFIMADTTYGSCCVDEVGAAH 120

Query: 121 ANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVSTCAKTLSQYSLSNGKPVLVLFGLEYA 180
           ANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVSTCAKTLSQYSLSNGKPVLVLFGLEYA
Sbjct: 121 ANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVSTCAKTLSQYSLSNGKPVLVLFGLEYA 180

Query: 181 HSMDDIRQTLLDSFQTSGLGSELEVHFADVKCSSLDPTSHHDNVAKGGEQVFSAEGGNSE 240
           HSM DIRQTLLDSFQTS LG ELEVHFADVKCSSLDP+SHH+NV KGGEQV + E G+SE
Sbjct: 181 HSMFDIRQTLLDSFQTSSLGPELEVHFADVKCSSLDPSSHHENVVKGGEQVANDESGSSE 240

Query: 241 NMAGARHHIGGLFWELPRERRMEDCSLFWIGSENSAFANVVLTFNGCEIVRYDAKESCLV 300
           N+AGARHHIGGLFWELPRERRMEDCSLFWIGSENSAFANVVLTFNGCEIVRYDAKESCLV
Sbjct: 241 NIAGARHHIGGLFWELPRERRMEDCSLFWIGSENSAFANVVLTFNGCEIVRYDAKESCLV 300

Query: 301 TDVSKQRRILKRSRYYLVEKAKDAGIVGILVGTLGLGMYDDERSLLDFALLRSQRNIILP 360
           TDVSKQRRILKR RYYLVEKAKDAGIVGILVGTLGL                        
Sbjct: 301 TDVSKQRRILKR-RYYLVEKAKDAGIVGILVGTLGL------------------------ 360

Query: 361 YCFVDVRSRLNLVMLYAAGYLHIIHQMKELITGAGKKAYTLVMGKPNPAKLANFPECGVF 420
                            AGYLHIIHQMKELITGAGKKAYTLVMGKPNPAKLANFPECGVF
Sbjct: 361 -----------------AGYLHIIHQMKELITGAGKKAYTLVMGKPNPAKLANFPECGVF 420

Query: 421 IYVSCAQTALMDSKEYLAPVITPFEATLAFSRGSQWTGAYVMEFQDLIDFLTPKEGNQSD 480
           IYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS+WTGAY+MEFQDL+DF TPKEGNQSD
Sbjct: 421 IYVSCAQTALMDSKEYLAPVITPFEATLAFSRGSRWTGAYIMEFQDLVDFSTPKEGNQSD 480

Query: 481 EARYSFLKGGYVEDWDSQENTEEENGATALVTATEKALQLRNNRNSLIEGTARSGAEFFA 540
           EARYSFL+GGYVEDW+SQENTEEENGA ALV+ATEK+LQLR+NRNSLIEGTARSGAEFFA
Sbjct: 481 EARYSFLQGGYVEDWNSQENTEEENGARALVSATEKSLQLRDNRNSLIEGTARSGAEFFA 534

Query: 541 ARSFQGLDIYHGSSEPEPYVIGRSGRASGYQDEKNR 543
           ARSFQGLDI +GSSEPEPYVIGRSG+ASGYQDEKNR
Sbjct: 541 ARSFQGLDINNGSSEPEPYVIGRSGKASGYQDEKNR 534

BLAST of HG10002100 vs. NCBI nr
Match: XP_004152127.1 (2-(3-amino-3-carboxypropyl)histidine synthase subunit 2 [Cucumis sativus] >KGN53024.1 hypothetical protein Csa_014391 [Cucumis sativus])

HSP 1 Score: 907.5 bits (2344), Expect = 5.4e-260
Identity = 462/542 (85.24%), Postives = 482/542 (88.93%), Query Frame = 0

Query: 1   MELESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDS 60
           ME ESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNH+DS
Sbjct: 1   MEFESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHSDS 60

Query: 61  KNEVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVS 120
           K+EVRLF+MADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDV 
Sbjct: 61  KSEVRLFVMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVP 120

Query: 121 TCAKTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQTSGLGSELEVHFADVKCSS 180
           TCAKTLSQYSLS+GKPVLVLFGLEYAHSM DI+Q LLDSFQTS LGSELEVHFADVKCSS
Sbjct: 121 TCAKTLSQYSLSSGKPVLVLFGLEYAHSMLDIKQRLLDSFQTSSLGSELEVHFADVKCSS 180

Query: 181 LDPTSHHDNVAKGGEQVFSAEGGNSENMAGARHHIGGLFWELPRERRMEDCSLFWIGSEN 240
           LDP+ HH+NV KGGEQ  +AE G+SE++AGARHHIGGLFWELP+ERR++DCSLFWIGSEN
Sbjct: 181 LDPSPHHENVVKGGEQASNAESGSSESIAGARHHIGGLFWELPKERRVKDCSLFWIGSEN 240

Query: 241 SAFANVVLTFNGCEIVRYDAKESCLVTDVSKQRRILKRSRYYLVEKAKDAGIVGILVGTL 300
           SAFANVVLTFNGCEIVRYDAKES LVTDVSKQRRILKR RYYLVEKAKDA IVGILVGTL
Sbjct: 241 SAFANVVLTFNGCEIVRYDAKESWLVTDVSKQRRILKR-RYYLVEKAKDAAIVGILVGTL 300

Query: 301 GLGMYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYLHIIHQMKELITGA 360
           GL                                         AGYLHIIHQMKELITGA
Sbjct: 301 GL-----------------------------------------AGYLHIIHQMKELITGA 360

Query: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420
           GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS
Sbjct: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420

Query: 421 QWTGAYVMEFQDLIDFLTPKEGNQSDEARYSFLKGGYVEDWDSQENTEEENGATALVTAT 480
           QWTGAYVMEFQDLIDF TP+E N+SDEARYSFL+GGYVEDWDSQENTEEENGA ALVTAT
Sbjct: 421 QWTGAYVMEFQDLIDFSTPEEANRSDEARYSFLQGGYVEDWDSQENTEEENGAHALVTAT 480

Query: 481 EKALQLRNNRNSLIEGTARSGAEFFAARSFQGLDIYHGSSEPEPYVIGRSGRASGYQDEK 540
           EKALQLR+NRNSLIEGTARSGAEFFA RSFQGLDI +GS EPEPYVIGRSG+ASGYQDEK
Sbjct: 481 EKALQLRDNRNSLIEGTARSGAEFFATRSFQGLDINNGSLEPEPYVIGRSGKASGYQDEK 500

Query: 541 NR 543
           NR
Sbjct: 541 NR 500

BLAST of HG10002100 vs. NCBI nr
Match: XP_008454038.1 (PREDICTED: diphthamide biosynthesis protein 2 isoform X1 [Cucumis melo])

HSP 1 Score: 904.4 bits (2336), Expect = 4.6e-259
Identity = 461/542 (85.06%), Postives = 480/542 (88.56%), Query Frame = 0

Query: 1   MELESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDS 60
           ME ESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNH+DS
Sbjct: 1   MEFESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHSDS 60

Query: 61  KNEVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVS 120
           KNEVRLF+MADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDV+
Sbjct: 61  KNEVRLFVMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVT 120

Query: 121 TCAKTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQTSGLGSELEVHFADVKCSS 180
           TCAKTLSQYSLS+GKPVLVLFGLEYAHSM DI+Q LLDSFQTS LGSELEVHFADVKCSS
Sbjct: 121 TCAKTLSQYSLSSGKPVLVLFGLEYAHSMFDIKQRLLDSFQTSSLGSELEVHFADVKCSS 180

Query: 181 LDPTSHHDNVAKGGEQVFSAEGGNSENMAGARHHIGGLFWELPRERRMEDCSLFWIGSEN 240
           LDP+ + +NV KGGEQ  +AE G+SEN+AGARHHIGGLFWELP+ERRMEDCSLFWIGS+N
Sbjct: 181 LDPSPYQENVVKGGEQASNAESGSSENIAGARHHIGGLFWELPKERRMEDCSLFWIGSDN 240

Query: 241 SAFANVVLTFNGCEIVRYDAKESCLVTDVSKQRRILKRSRYYLVEKAKDAGIVGILVGTL 300
           SAFANVVLTFNGCEIVRYDAKES LVTDVSKQRRILKR RYYLVEKAKDA IVGILVGTL
Sbjct: 241 SAFANVVLTFNGCEIVRYDAKESWLVTDVSKQRRILKR-RYYLVEKAKDAAIVGILVGTL 300

Query: 301 GLGMYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYLHIIHQMKELITGA 360
           GL                                         AGYLHIIHQMKELITGA
Sbjct: 301 GL-----------------------------------------AGYLHIIHQMKELITGA 360

Query: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420
           GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS
Sbjct: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420

Query: 421 QWTGAYVMEFQDLIDFLTPKEGNQSDEARYSFLKGGYVEDWDSQENTEEENGATALVTAT 480
           QWTG YVMEFQDLID  TPKE NQSDEARYSFL+GGYVEDWDSQENTEEENGA ALVTAT
Sbjct: 421 QWTGVYVMEFQDLIDCSTPKEANQSDEARYSFLQGGYVEDWDSQENTEEENGAHALVTAT 480

Query: 481 EKALQLRNNRNSLIEGTARSGAEFFAARSFQGLDIYHGSSEPEPYVIGRSGRASGYQDEK 540
           +KALQLR+NRN LIEGTARSGAEFFA+RSFQGLDI +GS EPEPYVIGRSG+ASGYQDEK
Sbjct: 481 QKALQLRDNRNLLIEGTARSGAEFFASRSFQGLDINNGSFEPEPYVIGRSGKASGYQDEK 500

Query: 541 NR 543
           NR
Sbjct: 541 NR 500

BLAST of HG10002100 vs. ExPASy Swiss-Prot
Match: A4QN59 (2-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Danio rerio OX=7955 GN=dph2 PE=2 SV=1)

HSP 1 Score: 271.2 bits (692), Expect = 2.6e-71
Identity = 180/547 (32.91%), Postives = 265/547 (48.45%), Query Frame = 0

Query: 7   YEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDSKNEVRL 66
           Y+   T  FI + +F +VALQFPDELL D+ RV   ++D               K + + 
Sbjct: 33  YQTPETCRFITSNHFKKVALQFPDELLPDAVRVSAEIED---------------KTKAKT 92

Query: 67  FIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVSTCAKTL 126
           +I+ DT+YGSCCVDEV A H  ADC++HYG +CLSP   LP L+VFGK  IDV  CA + 
Sbjct: 93  YILGDTSYGSCCVDEVAAEHVGADCIVHYGSSCLSPCRRLPLLYVFGKRPIDVHQCASSF 152

Query: 127 SQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQ---TSGLGSELEVHFADVKCSSLDP 186
            +   +    ++VLF + Y+H++DD+R  L D +     S L ++       ++ S +D 
Sbjct: 153 KELYPNLQSHIIVLFDVTYSHAIDDLRTLLCDVYPNVVVSRLKTDHSCGAELIQDSCVDL 212

Query: 187 TSHHDNVA-KGGEQVFSAEGGNSENMAGARHHIGGLFWELPRERRMEDCSLFWIGSENSA 246
            S+ D V  K G Q    EG                       + + D S+F+IG E   
Sbjct: 213 QSNDDGVIFKFGRQFRIKEG-----------------------QTVNDYSIFYIGQEGLT 272

Query: 247 FANVVLTFNGCEIVRYDAKESC-LVTDVSKQRRILKRSRYYLVEKAKDAGIVGILVGTLG 306
             N ++++N C    ++ + S   V  V   + ++K  RYY +E+AKDA +VGILVGTLG
Sbjct: 273 LTNFMMSWNNCVFSSFNPETSTGRVESVQINKALMK--RYYAIERAKDASVVGILVGTLG 332

Query: 307 LGMYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYLHIIHQMKELITGAG 366
           +                                         A YL II Q+K+ I  AG
Sbjct: 333 V-----------------------------------------ANYLIIIEQLKDTIQRAG 392

Query: 367 KKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGSQ 426
           KK+Y   MGK N  KLANF E  +++ V+C + +L+DS E+  PV+TPFE  LA ++  +
Sbjct: 393 KKSYMFAMGKINVPKLANFLEIDIYVLVACPENSLLDSSEFYRPVVTPFEMELACNKHRE 452

Query: 427 WTGAYVMEFQDL-------IDFLTPKEGNQSDEARYSFLKGGYVEDWDSQENTEEENGAT 486
           WTG YV +F++L       + F  P +    +E     L  G +    +  +    N  T
Sbjct: 453 WTGEYVTDFRELLPGGSSHVGFPEPSQSATEEETTDVSLITGALRSCSTNSSEMMHNSET 489

Query: 487 ALVTATEKALQLRNNRNSLIEGTARSGAEFFAARSFQGLDIYHGSSEPEPYVIGRSGRAS 542
           +       +L LRN   +L      + A F A RS+QGL+   G +     V G+ G A 
Sbjct: 513 S-------SLVLRN--QTLTVANTNAAASFLAGRSWQGLEPKLGQTPVVKAVKGQRGIAI 489

BLAST of HG10002100 vs. ExPASy Swiss-Prot
Match: Q5ZKI2 (2-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Gallus gallus OX=9031 GN=DPH2 PE=2 SV=1)

HSP 1 Score: 246.9 bits (629), Expect = 5.2e-64
Identity = 165/542 (30.44%), Postives = 245/542 (45.20%), Query Frame = 0

Query: 4   ESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDSKNE 63
           + +YE+ R A F+    F +VALQFPD LL D+  V   +++                  
Sbjct: 29  DEFYEVDRAAAFVRDGGFRKVALQFPDALLADAAAVAARMEE---------------VTG 88

Query: 64  VRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVSTCA 123
             ++++ DTTYGSCCVDEV A H +A  V+HYG  CLSP   LP L VFG+  +DV  CA
Sbjct: 89  AEMYVLGDTTYGSCCVDEVAAEHVSAGAVVHYGPACLSPCRKLPVLHVFGRQPLDVGRCA 148

Query: 124 KTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQTSGLGSELEVHFADVKCSSLDP 183
           +   +        V+VL  + YAH+M ++ + L   +          + F++V C     
Sbjct: 149 EVFRELYPERQSRVVVLSDVVYAHAMGELEKQLCHEYP--------NIIFSEVVCGD--- 208

Query: 184 TSHHDNVAKGGEQVFSAEGGNSENMAGARHHIGGLFWELPRERRMEDCSLFWIGSENSAF 243
                                S  + G     G  F  +     ++DCS+F++G+E  A 
Sbjct: 209 -------------------APSPTLPGEVRQFGRRF-HMEAAEELQDCSMFYVGAEGLAL 268

Query: 244 ANVVLTFNGCEIVRYDAKESCLVTDVSKQRRILKRSRYYLVEKAKDAGIVGILVGTLGLG 303
            + +LT+N      +D        +     R L R R YLVE+A+DA +VGILVGTLG+ 
Sbjct: 269 TSFMLTWNRFPFSSFDPATGHGRRETLNVNRALMR-RLYLVERARDAHVVGILVGTLGV- 328

Query: 304 MYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYLHIIHQMKELITGAGKK 363
                                                   AGYL ++  + +L+  AGK+
Sbjct: 329 ----------------------------------------AGYLDVLEHLHQLVRRAGKR 388

Query: 364 AYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGSQWT 423
           +YTL +GKPNPAKLANF E  +F+ V+CAQ +L+DS E+  P++TP+E  LA +   +WT
Sbjct: 389 SYTLSVGKPNPAKLANFLEVDIFVLVACAQNSLLDSSEFYRPIVTPYELELACNPAREWT 448

Query: 424 GAYVMEFQDLID------FLTPKEGNQSDEARYSFLKGGYVEDWDSQENTEEENGATALV 483
           G Y+ +F+DL+        L P           S + G        +          A  
Sbjct: 449 GNYLTDFRDLLPGACAHIELPPAVPAAEAIPDVSLITG--------EMRATHLCDPLAPQ 472

Query: 484 TATEKALQLRNNRNSLIEGTARSGAEFFAARSFQGLDIYHGSSEPEPYVIGRSGRASGYQ 540
             +   L  R+   +L E      A F  +RS++GL+   G +     V GR G A  Y+
Sbjct: 509 PPSSTTLACRDQTRALAE--MSPAATFLESRSWRGLEQQLGKTAVSKAVQGRRGIAIAYE 472

BLAST of HG10002100 vs. ExPASy Swiss-Prot
Match: A7SKJ3 (2-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Nematostella vectensis OX=45351 GN=dph2 PE=3 SV=1)

HSP 1 Score: 243.0 bits (619), Expect = 7.6e-63
Identity = 174/554 (31.41%), Postives = 267/554 (48.19%), Query Frame = 0

Query: 7   YEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDR--LRVLDESDTNHN------ 66
           YEI R+   I +  F +VALQFPD LL DS  V R ++ R   +    +DT++       
Sbjct: 44  YEIERSVQVITSSGFNKVALQFPDSLLADSASVARLIEQRASCKAFILADTSYGRHPPLP 103

Query: 67  DSKNEVRLF-------IMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFV 126
            +K  ++L+       ++A   + SCCVDE+ A HA+A+ +IHYG  CLS T  LP L+V
Sbjct: 104 ANKTFLKLYNVRICFKLLAFLHFSSCCVDEIAAEHADAELIIHYGQACLSQTKRLPVLYV 163

Query: 127 FGKDSIDVSTCAKTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQTSGLGSELEV 186
           FGK+ I+V  C++   Q     G  VLV + + Y H +  + + L      S L   + +
Sbjct: 164 FGKNPINVIECSQHFRQLYPDTGIRVLVFYDVVYNHCIGALDEAL------SPLYPNMTI 223

Query: 187 HFADVKCSSLDPTSHHDNVAKGGEQVFSAEGGNSENMAGARHHIGGLFWELPRERRMEDC 246
                +    +PTS   +       V     G   N    R    G  + L     + D 
Sbjct: 224 STIAPEGLPSEPTSQQSSRNPQASDV----EGMEVNQCNRRF---GRDFTLAANSSISDY 283

Query: 247 SLFWIGSENSAFANVVLTFNGCEIVRYDAKESCLVTDVSKQRRI----LKRSRYYLVEKA 306
            +F+IG ++    N+++T+N C+   YD      +T+ S++  +        RY+++++A
Sbjct: 284 QIFYIGEQSLTLRNLMMTYNKCQFSTYDP-----ITNESRRETLNVNKALMKRYHMIQRA 343

Query: 307 KDAGIVGILVGTLGLGMYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYL 366
           KDA IVGI+VGTLG+                                         A YL
Sbjct: 344 KDAQIVGIVVGTLGV-----------------------------------------ADYL 403

Query: 367 HIIHQMKELITGAGKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVI 426
            II ++K+++  AGKK+Y  VMGK N AKLANF E  VF+ VSC + +L+DSKE+  PV+
Sbjct: 404 KIIERLKKVLAIAGKKSYVFVMGKLNVAKLANFLEIDVFVLVSCPENSLIDSKEFYKPVV 463

Query: 427 TPFEATLAFSRGSQWTGAYVMEFQDLI--DFLTPKEGNQSDEARYSFLKGGYVEDWDSQE 486
           TP+E  +A  R  +WTG YV +F +L+    +     + +D++ Y  L  G ++      
Sbjct: 464 TPYEMEIACLRTQEWTGDYVTDFHELLPGKSINTFWTDDNDDSPYISLITGKMQHNYKSS 523

Query: 487 NTEEENGATALVTATEKALQLRNNRNSLIEGTARSGAEFFAARSFQGLDIYHGSSEPEPY 540
             E    +T+LV         + N+ + +       AEF A+RS+QGL    G +     
Sbjct: 524 AKEAGETSTSLV---------QRNQETTLATQQPLTAEFLASRSWQGLQQNLGDTPVTTA 529

BLAST of HG10002100 vs. ExPASy Swiss-Prot
Match: Q10206 (2-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=dph2 PE=3 SV=1)

HSP 1 Score: 240.0 bits (611), Expect = 6.4e-62
Identity = 159/552 (28.80%), Postives = 263/552 (47.64%), Query Frame = 0

Query: 2   ELESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDSK 61
           +L   YEI+RT DFI + N+  VALQFPDE L DS +V   L + +              
Sbjct: 32  DLVEVYEINRTVDFIKSGNYNSVALQFPDEHLADSGKVASILTNLV-------------- 91

Query: 62  NEVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVST 121
            E  + I+ADT YGSCCVDEV A H +AD ++HYG  CLSPT+ LP L+VFG+  I++  
Sbjct: 92  -EANVQILADTNYGSCCVDEVAAEHMSADAIVHYGRACLSPTSRLPVLYVFGRLPINLHK 151

Query: 122 CAKTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQTSGLGSELEVHFADVKCSSL 181
             K L   ++   + +L++    + ++ D I    L S +T G  +  E H  +     +
Sbjct: 152 LEKCL---TIPLDQNILLVSDTRWYYAQDSI----LKSLKTLGYQNVYESHLKE----RI 211

Query: 182 DPTSHHDNVAKGGEQVFSAEGGNSENMAGARHHIGGLFWELPRERRMEDCSLFWIGSENS 241
           +P                    N E  A   + I G  + LP+   ++D +L +IG ++ 
Sbjct: 212 EP--------------------NLEE-ASTSYTIPGRTYSLPKSLSLQDMTLLYIGPDSP 271

Query: 242 AFANVVLTFNGC--EIVRYDAKESCLVTDVSKQRRILKRSRYYLVEKAKDAGIVGILVGT 301
             ++++++      + + +D   + +V + S     L+R RY LV++ +DAG++GI++GT
Sbjct: 272 TLSSILMSHYSLVNQFLSFDPLSNKIVEESSFTGAKLRR-RYALVQRCRDAGVIGIVIGT 331

Query: 302 LGLGMYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYLHIIHQMKELITG 361
           LG+                                           YLH+++Q++++I  
Sbjct: 332 LGVHR-----------------------------------------YLHVLNQLRKMILN 391

Query: 362 AGKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRG 421
           AGKK Y L +GK NPAKLANF E   F+ ++C + +L+DSKE+  P++TPFE   A S  
Sbjct: 392 AGKKPYMLAVGKLNPAKLANFQEIECFVLIACGENSLIDSKEFYRPIVTPFELVKALSSD 451

Query: 422 SQWTGAYVMEFQDLIDFLTPKEG--------NQSDEARYSFLKGGYVEDWDSQE-NTEEE 481
             W   +++ F +++     K+          +S E  +S + G +V     +  +   E
Sbjct: 452 MSWNNDFILSFDEVLKLSEGKQSKEPSEVLTEESAEPHFSLITGKFVNSTPMRHLDVTLE 494

Query: 482 NGATALVTATEKALQLRNNRNSLIEGTARSGAEFFAARSFQGLDIYHGSSEPEPYVIGRS 541
                   ++  +++ R  R+  + G     A F  ++S+ GLD       P     G+S
Sbjct: 512 TADAKNNDSSSASIEKRGMRSLAVNGVYSPAAAFLQSKSWSGLDSVDEGEGPSKLYEGQS 494

Query: 542 GRASGYQDEKNR 543
           G A GY  E ++
Sbjct: 572 GIAKGYVGEGSK 494

BLAST of HG10002100 vs. ExPASy Swiss-Prot
Match: Q6DE00 (2-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Xenopus laevis OX=8355 GN=dph2 PE=2 SV=1)

HSP 1 Score: 228.4 bits (581), Expect = 1.9e-58
Identity = 163/544 (29.96%), Postives = 255/544 (46.88%), Query Frame = 0

Query: 2   ELESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDSK 61
           +L  +YEI +T +FI      +VALQFPD+LL DS +V R L++                
Sbjct: 32  DLGEFYEIEKTVEFIQRNAAQKVALQFPDDLLLDSVKVARKLEE---------------A 91

Query: 62  NEVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVST 121
              + +I+ DT+YGSCCVDEV A H  A+ ++HYG  CLSP   LP  +VFG+ ++++  
Sbjct: 92  TGAKTYILGDTSYGSCCVDEVAAEHVKANVLVHYGRACLSPCCRLPVSYVFGRKAVNMDL 151

Query: 122 CAKTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQTSGLGSELEVHFADVKCSSL 181
           CA+    +       V+VL  + Y H++ ++ + +  ++          V F+  K +S 
Sbjct: 152 CAEAFLSHYRDTESHVVVLSDVVYDHALGELAKRIRSAYP--------NVIFS--KLTSC 211

Query: 182 DPTSHHDNVAKGGEQVFSAEGGNSENMAGARHHIGGLFWELPRERRMEDCSLFWIGSENS 241
             T+  D + K G + FS +                  W        E   +F++G E S
Sbjct: 212 GETASPDEIVKFGRR-FSPD---------------LRLWP-------ESYGIFYVGGEGS 271

Query: 242 AFANVVLTFNGCEIVRYDAKESCLVTDVSKQRRILKRSRYYLVEKAKDAGIVGILVGTLG 301
              N++LT+  C    ++       T+     R L   R+YL+E+A+DA + GILVGTLG
Sbjct: 272 TLNNLMLTWPRCSFFSFNPFTGEGRTEGLHVNRAL-MIRFYLIERARDAHVFGILVGTLG 331

Query: 302 LGMYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYLHIIHQMKELITGAG 361
           +                                         + YL  +  +K +I  AG
Sbjct: 332 V-----------------------------------------SDYLSALKHLKNIIHLAG 391

Query: 362 KKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGSQ 421
           KK+Y   +GK NPAKLANFPE  VF+ V+C + +L+DS E+  PV+TP E  +A +   +
Sbjct: 392 KKSYMFSVGKLNPAKLANFPEIDVFVLVACPENSLLDSSEFYKPVVTPDEMEIACNPARE 451

Query: 422 WTGAYVMEFQDLID----FLTPKEGNQSD--EARYSFLKGGYVEDWDSQENTEEENGATA 481
           W G  +  F++L+     ++   E + SD      S + G       +   T E++  T+
Sbjct: 452 WHGYCITNFRELLPGGSAYVEFPETDPSDAHHTDVSLITGNLRSSHLTVAETLEKDSDTS 475

Query: 482 LVTATEKALQLRNNRNSLIEGTARSGAEFFAARSFQGLDIYHGSSEPEPYVIGRSGRASG 540
           LV         RN++ +L +    S A + A+RS+QGLD   G +     V GR G A  
Sbjct: 512 LVQ--------RNSKTALAQ--MSSAASYLASRSWQGLDKALGQTPVVKAVEGRKGIAIA 475

BLAST of HG10002100 vs. ExPASy TrEMBL
Match: A0A5A7TMM8 (2-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold46G003010 PE=3 SV=1)

HSP 1 Score: 910.6 bits (2352), Expect = 3.1e-261
Identity = 462/542 (85.24%), Postives = 481/542 (88.75%), Query Frame = 0

Query: 1   MELESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDS 60
           ME ESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNH+DS
Sbjct: 1   MEFESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHSDS 60

Query: 61  KNEVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVS 120
           KNEVRLF+MADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDV+
Sbjct: 61  KNEVRLFVMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVT 120

Query: 121 TCAKTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQTSGLGSELEVHFADVKCSS 180
           TCAKTLSQYSLS+GKPVLVLFGLEYAHSM DI+Q LLDSFQTS LGSELEVHFADVKCSS
Sbjct: 121 TCAKTLSQYSLSSGKPVLVLFGLEYAHSMFDIKQRLLDSFQTSSLGSELEVHFADVKCSS 180

Query: 181 LDPTSHHDNVAKGGEQVFSAEGGNSENMAGARHHIGGLFWELPRERRMEDCSLFWIGSEN 240
           LDP+ + +NV KGGEQ  +AE G+SEN+AGARHHIGGLFWELP+ERRMEDCSLFWIGS+N
Sbjct: 181 LDPSPYQENVVKGGEQASNAESGSSENIAGARHHIGGLFWELPKERRMEDCSLFWIGSDN 240

Query: 241 SAFANVVLTFNGCEIVRYDAKESCLVTDVSKQRRILKRSRYYLVEKAKDAGIVGILVGTL 300
           SAFANVVLTFNGCEIVRYDAKES LVTDVSKQRRILKRSRYYLVEKAKDA IVGILVGTL
Sbjct: 241 SAFANVVLTFNGCEIVRYDAKESWLVTDVSKQRRILKRSRYYLVEKAKDAAIVGILVGTL 300

Query: 301 GLGMYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYLHIIHQMKELITGA 360
           GL                                         AGYLHIIHQMKELITGA
Sbjct: 301 GL-----------------------------------------AGYLHIIHQMKELITGA 360

Query: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420
           GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS
Sbjct: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420

Query: 421 QWTGAYVMEFQDLIDFLTPKEGNQSDEARYSFLKGGYVEDWDSQENTEEENGATALVTAT 480
           QWTG YVMEFQDLID  TPKE NQSDEARYSFL+GGYVEDWDSQENTEEENGA ALVTAT
Sbjct: 421 QWTGVYVMEFQDLIDCSTPKEANQSDEARYSFLQGGYVEDWDSQENTEEENGAHALVTAT 480

Query: 481 EKALQLRNNRNSLIEGTARSGAEFFAARSFQGLDIYHGSSEPEPYVIGRSGRASGYQDEK 540
           +KALQLR+NRN LIEGTARSGAEFFA+RSFQGLDI +GS EPEPYVIGRSG+ASGYQDEK
Sbjct: 481 QKALQLRDNRNLLIEGTARSGAEFFASRSFQGLDINNGSFEPEPYVIGRSGKASGYQDEK 501

Query: 541 NR 543
           NR
Sbjct: 541 NR 501

BLAST of HG10002100 vs. ExPASy TrEMBL
Match: A0A0A0KW05 (2-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Cucumis sativus OX=3659 GN=Csa_4G011710 PE=3 SV=1)

HSP 1 Score: 907.5 bits (2344), Expect = 2.6e-260
Identity = 462/542 (85.24%), Postives = 482/542 (88.93%), Query Frame = 0

Query: 1   MELESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDS 60
           ME ESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNH+DS
Sbjct: 1   MEFESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHSDS 60

Query: 61  KNEVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVS 120
           K+EVRLF+MADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDV 
Sbjct: 61  KSEVRLFVMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVP 120

Query: 121 TCAKTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQTSGLGSELEVHFADVKCSS 180
           TCAKTLSQYSLS+GKPVLVLFGLEYAHSM DI+Q LLDSFQTS LGSELEVHFADVKCSS
Sbjct: 121 TCAKTLSQYSLSSGKPVLVLFGLEYAHSMLDIKQRLLDSFQTSSLGSELEVHFADVKCSS 180

Query: 181 LDPTSHHDNVAKGGEQVFSAEGGNSENMAGARHHIGGLFWELPRERRMEDCSLFWIGSEN 240
           LDP+ HH+NV KGGEQ  +AE G+SE++AGARHHIGGLFWELP+ERR++DCSLFWIGSEN
Sbjct: 181 LDPSPHHENVVKGGEQASNAESGSSESIAGARHHIGGLFWELPKERRVKDCSLFWIGSEN 240

Query: 241 SAFANVVLTFNGCEIVRYDAKESCLVTDVSKQRRILKRSRYYLVEKAKDAGIVGILVGTL 300
           SAFANVVLTFNGCEIVRYDAKES LVTDVSKQRRILKR RYYLVEKAKDA IVGILVGTL
Sbjct: 241 SAFANVVLTFNGCEIVRYDAKESWLVTDVSKQRRILKR-RYYLVEKAKDAAIVGILVGTL 300

Query: 301 GLGMYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYLHIIHQMKELITGA 360
           GL                                         AGYLHIIHQMKELITGA
Sbjct: 301 GL-----------------------------------------AGYLHIIHQMKELITGA 360

Query: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420
           GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS
Sbjct: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420

Query: 421 QWTGAYVMEFQDLIDFLTPKEGNQSDEARYSFLKGGYVEDWDSQENTEEENGATALVTAT 480
           QWTGAYVMEFQDLIDF TP+E N+SDEARYSFL+GGYVEDWDSQENTEEENGA ALVTAT
Sbjct: 421 QWTGAYVMEFQDLIDFSTPEEANRSDEARYSFLQGGYVEDWDSQENTEEENGAHALVTAT 480

Query: 481 EKALQLRNNRNSLIEGTARSGAEFFAARSFQGLDIYHGSSEPEPYVIGRSGRASGYQDEK 540
           EKALQLR+NRNSLIEGTARSGAEFFA RSFQGLDI +GS EPEPYVIGRSG+ASGYQDEK
Sbjct: 481 EKALQLRDNRNSLIEGTARSGAEFFATRSFQGLDINNGSLEPEPYVIGRSGKASGYQDEK 500

Query: 541 NR 543
           NR
Sbjct: 541 NR 500

BLAST of HG10002100 vs. ExPASy TrEMBL
Match: A0A5D3D327 (2-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold130G001750 PE=3 SV=1)

HSP 1 Score: 904.4 bits (2336), Expect = 2.2e-259
Identity = 461/542 (85.06%), Postives = 480/542 (88.56%), Query Frame = 0

Query: 1   MELESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDS 60
           ME ESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNH+DS
Sbjct: 1   MEFESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHSDS 60

Query: 61  KNEVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVS 120
           KNEVRLF+MADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDV+
Sbjct: 61  KNEVRLFVMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVT 120

Query: 121 TCAKTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQTSGLGSELEVHFADVKCSS 180
           TCAKTLSQYSLS+GKPVLVLFGLEYAHSM DI+Q LLDSFQTS LGSELEVHFADVKCSS
Sbjct: 121 TCAKTLSQYSLSSGKPVLVLFGLEYAHSMFDIKQRLLDSFQTSSLGSELEVHFADVKCSS 180

Query: 181 LDPTSHHDNVAKGGEQVFSAEGGNSENMAGARHHIGGLFWELPRERRMEDCSLFWIGSEN 240
           LDP+ + +NV KGGEQ  +AE G+SEN+AGARHHIGGLFWELP+ERRMEDCSLFWIGS+N
Sbjct: 181 LDPSPYQENVVKGGEQASNAESGSSENIAGARHHIGGLFWELPKERRMEDCSLFWIGSDN 240

Query: 241 SAFANVVLTFNGCEIVRYDAKESCLVTDVSKQRRILKRSRYYLVEKAKDAGIVGILVGTL 300
           SAFANVVLTFNGCEIVRYDAKES LVTDVSKQRRILKR RYYLVEKAKDA IVGILVGTL
Sbjct: 241 SAFANVVLTFNGCEIVRYDAKESWLVTDVSKQRRILKR-RYYLVEKAKDAAIVGILVGTL 300

Query: 301 GLGMYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYLHIIHQMKELITGA 360
           GL                                         AGYLHIIHQMKELITGA
Sbjct: 301 GL-----------------------------------------AGYLHIIHQMKELITGA 360

Query: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420
           GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS
Sbjct: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420

Query: 421 QWTGAYVMEFQDLIDFLTPKEGNQSDEARYSFLKGGYVEDWDSQENTEEENGATALVTAT 480
           QWTG YVMEFQDLID  TPKE NQSDEARYSFL+GGYVEDWDSQENTEEENGA ALVTAT
Sbjct: 421 QWTGVYVMEFQDLIDCSTPKEANQSDEARYSFLQGGYVEDWDSQENTEEENGAHALVTAT 480

Query: 481 EKALQLRNNRNSLIEGTARSGAEFFAARSFQGLDIYHGSSEPEPYVIGRSGRASGYQDEK 540
           +KALQLR+NRN LIEGTARSGAEFFA+RSFQGLDI +GS EPEPYVIGRSG+ASGYQDEK
Sbjct: 481 QKALQLRDNRNLLIEGTARSGAEFFASRSFQGLDINNGSFEPEPYVIGRSGKASGYQDEK 500

Query: 541 NR 543
           NR
Sbjct: 541 NR 500

BLAST of HG10002100 vs. ExPASy TrEMBL
Match: A0A1S3BYG9 (2-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Cucumis melo OX=3656 GN=LOC103494584 PE=3 SV=1)

HSP 1 Score: 904.4 bits (2336), Expect = 2.2e-259
Identity = 461/542 (85.06%), Postives = 480/542 (88.56%), Query Frame = 0

Query: 1   MELESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDS 60
           ME ESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNH+DS
Sbjct: 1   MEFESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHSDS 60

Query: 61  KNEVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVS 120
           KNEVRLF+MADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDV+
Sbjct: 61  KNEVRLFVMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVT 120

Query: 121 TCAKTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQTSGLGSELEVHFADVKCSS 180
           TCAKTLSQYSLS+GKPVLVLFGLEYAHSM DI+Q LLDSFQTS LGSELEVHFADVKCSS
Sbjct: 121 TCAKTLSQYSLSSGKPVLVLFGLEYAHSMFDIKQRLLDSFQTSSLGSELEVHFADVKCSS 180

Query: 181 LDPTSHHDNVAKGGEQVFSAEGGNSENMAGARHHIGGLFWELPRERRMEDCSLFWIGSEN 240
           LDP+ + +NV KGGEQ  +AE G+SEN+AGARHHIGGLFWELP+ERRMEDCSLFWIGS+N
Sbjct: 181 LDPSPYQENVVKGGEQASNAESGSSENIAGARHHIGGLFWELPKERRMEDCSLFWIGSDN 240

Query: 241 SAFANVVLTFNGCEIVRYDAKESCLVTDVSKQRRILKRSRYYLVEKAKDAGIVGILVGTL 300
           SAFANVVLTFNGCEIVRYDAKES LVTDVSKQRRILKR RYYLVEKAKDA IVGILVGTL
Sbjct: 241 SAFANVVLTFNGCEIVRYDAKESWLVTDVSKQRRILKR-RYYLVEKAKDAAIVGILVGTL 300

Query: 301 GLGMYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYLHIIHQMKELITGA 360
           GL                                         AGYLHIIHQMKELITGA
Sbjct: 301 GL-----------------------------------------AGYLHIIHQMKELITGA 360

Query: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420
           GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS
Sbjct: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420

Query: 421 QWTGAYVMEFQDLIDFLTPKEGNQSDEARYSFLKGGYVEDWDSQENTEEENGATALVTAT 480
           QWTG YVMEFQDLID  TPKE NQSDEARYSFL+GGYVEDWDSQENTEEENGA ALVTAT
Sbjct: 421 QWTGVYVMEFQDLIDCSTPKEANQSDEARYSFLQGGYVEDWDSQENTEEENGAHALVTAT 480

Query: 481 EKALQLRNNRNSLIEGTARSGAEFFAARSFQGLDIYHGSSEPEPYVIGRSGRASGYQDEK 540
           +KALQLR+NRN LIEGTARSGAEFFA+RSFQGLDI +GS EPEPYVIGRSG+ASGYQDEK
Sbjct: 481 QKALQLRDNRNLLIEGTARSGAEFFASRSFQGLDINNGSFEPEPYVIGRSGKASGYQDEK 500

Query: 541 NR 543
           NR
Sbjct: 541 NR 500

BLAST of HG10002100 vs. ExPASy TrEMBL
Match: A0A6J1EXA1 (2-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Cucurbita moschata OX=3662 GN=LOC111439125 PE=3 SV=1)

HSP 1 Score: 864.8 bits (2233), Expect = 2.0e-247
Identity = 442/541 (81.70%), Postives = 471/541 (87.06%), Query Frame = 0

Query: 1   MELESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDS 60
           MELESYYEISRTAD+IHTRNFTRVALQFPDELLKDSTRVV AL+ RL  L  +DTN +D 
Sbjct: 1   MELESYYEISRTADYIHTRNFTRVALQFPDELLKDSTRVVSALRKRLCAL--NDTNQSDG 60

Query: 61  KNEVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVS 120
           +N+VRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPA FVFGKD IDVS
Sbjct: 61  ENKVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPAFFVFGKDPIDVS 120

Query: 121 TCAKTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQTSGLGSELEVHFADVKCSS 180
           TCAKTLS YS SNGKPVLVLFGLEYAHSMDDIRQ LLDSFQTSGL SELEVHFADVKCSS
Sbjct: 121 TCAKTLSHYSSSNGKPVLVLFGLEYAHSMDDIRQALLDSFQTSGLTSELEVHFADVKCSS 180

Query: 181 LDPTSHHDNVAKGGEQVFSAEGGNSENMAGARHHIGGLFWELPRERRMEDCSLFWIGSEN 240
           LDP+SH++NVA+GGEQV +AE  +SEN+AG RHHIG LFW+L RERRMEDCS+FWIGSEN
Sbjct: 181 LDPSSHNENVARGGEQVTNAESESSENIAGTRHHIGSLFWDLHRERRMEDCSIFWIGSEN 240

Query: 241 SAFANVVLTFNGCEIVRYDAKESCLVTDVSKQRRILKRSRYYLVEKAKDAGIVGILVGTL 300
           SAFANVVLTFNGCEIVRYDAKESCLVTDVS+QRRILKR RYYLVEKAKDAGIVGILVGTL
Sbjct: 241 SAFANVVLTFNGCEIVRYDAKESCLVTDVSQQRRILKR-RYYLVEKAKDAGIVGILVGTL 300

Query: 301 GLGMYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYLHIIHQMKELITGA 360
           GL                                         AGYLHIIHQMKELITGA
Sbjct: 301 GL-----------------------------------------AGYLHIIHQMKELITGA 360

Query: 361 GKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRGS 420
           GKKAYTLVMG+PNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAF+RGS
Sbjct: 361 GKKAYTLVMGRPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFTRGS 420

Query: 421 QWTGAYVMEFQDLIDFLTPKEGNQSDEARYSFLKGGYVEDWDSQENTEEENGATALVTAT 480
           QWTGAY+MEFQDLI+F  P EGN+SDEARYSFL+G YVED+DSQEN EEEN A ALV+AT
Sbjct: 421 QWTGAYIMEFQDLINFPIPNEGNRSDEARYSFLQGRYVEDYDSQENVEEENEACALVSAT 480

Query: 481 EKALQLRNNRNSLIEGTARSGAEFFAARSFQGLDIYHGSSEPEPYVIGRSGRASGYQDEK 540
           EKALQ+R+NRNSLIEGTARSGAEFFAARSFQGLDI +GSS+PEPY+IGRSGRASGYQDEK
Sbjct: 481 EKALQIRDNRNSLIEGTARSGAEFFAARSFQGLDIQNGSSQPEPYLIGRSGRASGYQDEK 497

Query: 541 N 542
           N
Sbjct: 541 N 497

BLAST of HG10002100 vs. TAIR 10
Match: AT3G59630.1 (diphthamide synthesis DPH2 family protein )

HSP 1 Score: 535.4 bits (1378), Expect = 5.2e-152
Identity = 293/548 (53.47%), Postives = 380/548 (69.34%), Query Frame = 0

Query: 1   MELESYYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDS 60
           +E ES YEI+RTA+FI +++FTR+ALQFPDELLKDST+VV ALK + R+L +        
Sbjct: 3   LEFESKYEINRTAEFIISKSFTRIALQFPDELLKDSTKVVSALKSKTRLLTD-------- 62

Query: 61  KNEVRLFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSPTTTLPALFVFGKDSIDVS 120
             EVR F+MADTTYGSCC+DEVGA H +++CV+HYG TCLSPT+ LPA FVFGK SI+VS
Sbjct: 63  -REVRFFVMADTTYGSCCIDEVGALHIDSECVVHYGQTCLSPTSVLPAFFVFGKASINVS 122

Query: 121 TCAKTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTLLDSFQTSGLGSELEVHFADVKCSS 180
           +C K L  Y+  + KP+++L+GLEYAH +  IR+ L      S   S+L V  A+V CS 
Sbjct: 123 SCVKHLIDYASKSDKPIMILYGLEYAHVIPQIREEL----GLSKTDSQLSV--ANVLCSF 182

Query: 181 LDPTSHHDNVAKGGEQVFSAEGGNSENMAGAR-HHIGGLFWELPRERRMEDCSLFWIGSE 240
           + P+       +       +E  +S++++ +R + +GGL W+LP   ++ED  LFWIGS+
Sbjct: 183 ISPSKDPRESMEHPRPY--SESDSSDSLSSSRSYRLGGLTWDLPEGSKIEDYLLFWIGSD 242

Query: 241 NSAFANVVLTFNGCEIVRYDAKESCLVTDVSKQRRILKRSRYYLVEKAKDAGIVGILVGT 300
           +SAFANVVLTFNGC+IVRYDA+E  LVT+  +QRRILKR RYYLVEKAKDA I+GILVGT
Sbjct: 243 SSAFANVVLTFNGCDIVRYDAEEDSLVTEFYQQRRILKR-RYYLVEKAKDANIIGILVGT 302

Query: 301 LGLGMYDDERSLLDFALLRSQRNIILPYCFVDVRSRLNLVMLYAAGYLHIIHQMKELITG 360
           LG+                                         AGYLH+IH M+ LI+ 
Sbjct: 303 LGV-----------------------------------------AGYLHMIHHMQALISA 362

Query: 361 AGKKAYTLVMGKPNPAKLANFPECGVFIYVSCAQTALMDSKEYLAPVITPFEATLAFSRG 420
           AGKK+Y L MG+PNPAKLANFPEC VFIY+SCAQTAL+DSKE+++PVITPFEA LAFSRG
Sbjct: 363 AGKKSYILAMGRPNPAKLANFPECDVFIYISCAQTALLDSKEFMSPVITPFEANLAFSRG 422

Query: 421 SQWTGAYVMEFQDLIDFLTPKEGNQ--SDEARYSFLKGGYVEDW---DSQENTEEENGAT 480
           S+WTGAY+M FQD+I+ +  +      S+E R+SF +GGYVED    D  +N EE+ G T
Sbjct: 423 SEWTGAYLMHFQDVINSVKSESEAHIGSEEPRFSFFQGGYVEDHKTNDQAKNGEEDTGET 482

Query: 481 -ALVTATEKALQLR-NNRNSLIEGT-ARSGAEFFAARSFQGLDIYHGSSEPEPYVIGRSG 540
             LV A EKALQLR N+ NSL + T A+SG E+F  R ++GL+I   ++ PEPY++GRSG
Sbjct: 483 MTLVQAAEKALQLRGNDHNSLTKQTAAKSGPEYFLNRVYRGLEINSENTLPEPYIVGRSG 491

BLAST of HG10002100 vs. TAIR 10
Match: AT5G62030.1 (diphthamide synthesis DPH2 family protein )

HSP 1 Score: 75.5 bits (184), Expect = 1.5e-13
Identity = 46/153 (30.07%), Postives = 75/153 (49.02%), Query Frame = 0

Query: 6   YYEISRTADFIHTRNFTRVALQFPDELLKDSTRVVRALKDRLRVLDESDTNHNDSKNEVR 65
           ++E+ +    I T N  R+ALQ P+ LL             +  L  SD           
Sbjct: 48  HFEVHKCVWRIKTSNAKRIALQLPEGLL-------------MYALTLSDI-FTSFAGASH 107

Query: 66  LFIMADTTYGSCCVDEVGAAHANADCVIHYGHTCLSP--TTTLPALFVFGKDSIDVSTCA 125
            F++ D TYG+CCVD+  A    AD +IHYGH+CL P  +T +P L+VF +  IDV    
Sbjct: 108 CFVLGDVTYGACCVDDFSACALGADLLIHYGHSCLVPIDSTKIPCLYVFVEIQIDVKCLL 167

Query: 126 KTLSQYSLSNGKPVLVLFGLEYAHSMDDIRQTL 157
            T+     S+ K +++   +++  ++  ++  L
Sbjct: 168 NTIHLNLASDVKNIILAGTIQFTSAIRAVKPEL 186

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038877054.11.9e-26586.722-(3-amino-3-carboxypropyl)histidine synthase subunit 2 isoform X2 [Benincasa hi... [more]
KAA0044552.16.4e-26185.24diphthamide biosynthesis protein 2 isoform X1 [Cucumis melo var. makuwa][more]
XP_038877053.13.2e-26081.602-(3-amino-3-carboxypropyl)histidine synthase subunit 2 isoform X1 [Benincasa hi... [more]
XP_004152127.15.4e-26085.242-(3-amino-3-carboxypropyl)histidine synthase subunit 2 [Cucumis sativus] >KGN53... [more]
XP_008454038.14.6e-25985.06PREDICTED: diphthamide biosynthesis protein 2 isoform X1 [Cucumis melo][more]
Match NameE-valueIdentityDescription
A4QN592.6e-7132.912-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Danio rerio OX=7955 G... [more]
Q5ZKI25.2e-6430.442-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Gallus gallus OX=9031... [more]
A7SKJ37.6e-6331.412-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Nematostella vectensi... [more]
Q102066.4e-6228.802-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Schizosaccharomyces p... [more]
Q6DE001.9e-5829.962-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Xenopus laevis OX=835... [more]
Match NameE-valueIdentityDescription
A0A5A7TMM83.1e-26185.242-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Cucumis melo var. mak... [more]
A0A0A0KW052.6e-26085.242-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Cucumis sativus OX=36... [more]
A0A5D3D3272.2e-25985.062-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Cucumis melo var. mak... [more]
A0A1S3BYG92.2e-25985.062-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Cucumis melo OX=3656 ... [more]
A0A6J1EXA12.0e-24781.702-(3-amino-3-carboxypropyl)histidine synthase subunit 2 OS=Cucurbita moschata OX... [more]
Match NameE-valueIdentityDescription
AT3G59630.15.2e-15253.47diphthamide synthesis DPH2 family protein [more]
AT5G62030.11.5e-1330.07diphthamide synthesis DPH2 family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR042263Diphthamide synthesis DPH1/DPH2, domain 1GENE3D3.40.50.11840Diphthamide synthesis DPH1/DPH2 domain 1coord: 9..116
e-value: 5.1E-34
score: 118.2
IPR016435Diphthamide synthesis DPH1/DPH2TIGRFAMTIGR00322TIGR00322coord: 341..416
e-value: 2.1E-26
score: 91.0
coord: 6..305
e-value: 1.3E-70
score: 236.3
IPR016435Diphthamide synthesis DPH1/DPH2PFAMPF01866Diphthamide_syncoord: 28..302
e-value: 1.3E-48
score: 166.0
coord: 341..426
e-value: 1.3E-23
score: 83.9
IPR016435Diphthamide synthesis DPH1/DPH2SFLDSFLDS00032Radical_SAM_3-amino-3-carboxycoord: 2..540
e-value: 0.0
score: 411.9
IPR016435Diphthamide synthesis DPH1/DPH2PANTHERPTHR10762DIPHTHAMIDE BIOSYNTHESIS PROTEINcoord: 1..305
coord: 343..541
IPR042265Diphthamide synthesis DPH1/DPH2, domain 3GENE3D3.40.50.11860Diphthamide synthesis DPH1/DPH2 domain 3coord: 274..416
e-value: 3.9E-28
score: 99.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 519..542
IPR010014Diphthamide synthesis DHP2PANTHERPTHR10762:SF22-(3-AMINO-3-CARBOXYPROPYL)HISTIDINE SYNTHASE SUBUNIT 2coord: 1..305
coord: 343..541

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10002100.1HG10002100.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0017183 peptidyl-diphthamide biosynthetic process from peptidyl-histidine
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0090560 2-(3-amino-3-carboxypropyl)histidine synthase activity