HG10002324 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10002324
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptiondentin sialophosphoprotein isoform X1
LocationChr11: 5566195 .. 5580084 (-)
RNA-Seq ExpressionHG10002324
SyntenyHG10002324
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATGGCGGCTCATCCAAGCTCGGTCGACCCGGCGCCGGCGCCGGCCGGGGAGCCGGAGGAAAGCGTCCGCACTCTTCTTTTCCTCTACCACCTTCTCACCGCCCCTCCGGTCGTCTCTCTCTTGGCGGCGGTGGTGCTGGTTCTGTCGCCAATCCTCGGAATCGAACCACCACTGCAACCACATCCGAAGCCCCTCAATCCGTTGAGGAGAATTTCAGTCTCGTTACTGGTAACAACCCGTTGGCTTTTGCCATGATAATTCGGTTGGCTCCGGATTTGATCGATGAGATCAAGCGGGTCGAGGCGCAGGGCGGGACTCCGAGGATTAAGTTTGATGCGAACGCCAAGAATTCTAGTGGTAATGTAAGTTTTCTACAATACTTGTTCTTTAGTTAGTCATTCACATCATGTGGTAATGAAGATTTTTATGCTCTTGTGTATTTTTATTCTGTAATGTGCGAGTGTGGTATTTTTTGGATTATTTGAACTTGTGTTAAGTACTTCAACTATGCTGTCGGATATGTGCACAAGTAGATTGTACAGATTATGCTTTTGTTCACTATTTTTGGATTTCAATGGGGGCAGTTACTTTGACACTAAGTTGATAGGTTTTATTTGCTTTTGACACTAAGTTGATGGGTTTTATTTGCTTTGTGCAACTGCTACTTCATTACATATATTCTGGTCTTTATGGTCCTTATTCAGTGGAATTTGTACTCAATTTTAGTATTGCTTAATTATTTATTCTTTTGTAACATTATTGTGTTTACATTTTTTATTGGGTACCAGTTAGCTGTATGCTGAATTGGTGAATACGAATACTGTAAGGTCAAGGATTAGATTACCTAATTGAGTAGCGACATGGAAAACCGTGTAAATAAGTATTTGTTCGACAAATATAAGTAGGAATACTATAATCAGATTATATCATCAGTGCTTTTACATGCCTTTGATGCATATATGGTTTTGTAGAAGGGTATACTGGAGTTCGTGCCCCCCTGAAAATTGGGGCATGGCCATTAAAACTTTTTCAAGCTGTTGATCTACACTCGATAATACTTTCTGCAGTTATGGCCTAAAATTTGAGCTTATTTTTCTCTCTCATTTCCGTAGACCTCATAGTTTAATGATTCAAGGAAGTGCTGCCACTGTTGAGGCCTTGGTCTTAGTCTTACAATGTGCACAACATGGTTATAAAGATTATGCAATTTATGATGGATGTGGAGTAAATTTTAGTAGGAAGTAAGTAAGAGAGACAAGTTGAAAGGGTTAACTCCAATTCTCTCTCTGCTTATTGGAGAGCTTTATCTTGATAAAGTTACTATCTCAAGGATCTGATTAAATTTCCTATCGATTGACCTGGATCATATTCTCTGGTGTTGTGAGTTCGTGAGAACTGTGTGGGATTTTTTCTTTCAAACGTTCAGGTTCTCGCTTGCTCGGTGTAGGGTTTCTAGTGATATGGTCGGGGATTTCCTCCTCTATTTGCATTTTCGTGAGAAGGAGCAGTTCTTATGGATGGTCGGGGTGTGTGCTGTTCTTTGGGTTATGTGGGAGGAGCGGAGTCTCCTTTTTAGAGGGAATTACTTTCTCTTTTGGTGGGCTTGGTTTTTGTTTGCCCTATATTACTTCATTTTTAATGAAAGTTGTTTCTATAAAAAAAAAAGGAGTTATTTATTTATTTTACTCTTATATTTTAAAATATTACTTTAGGCTGATAAATTCTTAGTGTAATTTGAAAAGATGTCCAGTAGGATGGCTAGTATCCCTGAAATTTTAAAGCATGATACCAATAGGAGGTAGACTAAGTGGCCTATTGTGATTCTAGTAGGAAAACATACGGAGGGGTGGGCAGGCTTTCTACAATTATATAAAAAGGCTAACAACCTGGAATTTCCCTACCATCAGTGGTTCTTTTAATCAAGTACTCCATAATTTGTGGGTGCCCAAATACTAGATGTCATATGAACAAGTTAAAAGTAGTCGGATTTAGGAAAAAATAAAAATATATTTTCCTCTAAATTCTTCCAAAATCTCCAGACCAAGAAAGAGAAAATGTTGATTTTTGCTGATGGTCTGGATTGGGATTGATAGAGAAGGAAAGGCCTTTCTTGATAAAGAATAGGTTCTAAAGATTACGAAAAATCAGGGATGAAATAGTTGGGTCCAAATGACTTTACCATTTGAAATCTTAAAAAATTGTTGGAATGCCATCAAAAGTGAACTCATGGTAGTGTTCCAAGGTTCTTGTGGGAGTCGGGTGTTTCAAATGAATTGTGGAGTCTTGAACGCTAGAGTTTTGAGTATTAAGAGCATTGCATCTTTCCAATCCTCATGGAGGTTGGGATCTCCATTGACGTCTGATATTCCAAACCTTCCTAGATTTGGAAATATGAGGTTGCTTCTTCAGGATTTCATTTTGTCACTTCATTAGAACAGAATGGATGGACCTATTAGGATGGCTAGGGCCTAGGCAAGGGTTGGTGCTGAAATAGAAAAATTGTAAGGCCTTGTTTTGATTGCATATGCTAGGTTTCGGTTATGTTGTTGAGCCCTGTAAGATCTACTCTTTTTTCTTGATATCCCGATTCTTCTGTTTGTGTGAAAATGGGGAGGAGTACATCCTGCACATTATGTACAAAGTGCTATCCTGATTTCACTGTTTTTCAGTTTATCAATGAATTTGTGGTACAAAAATGTCTCTCTTTTGAGCAAACTTAAAAATCATAGTTTAGGGATAAATTGTATGTCCACGAACTTTCAGGGACCAAAAGTGATTATTAACCTTTAAATTATTCTCCACTCCTGAAGCTTTTTCATGTGCATTTTCTTCCTATTGATAAGAAGGGTCTAAAAGTCTCGAGGAACTGCCCCTATTTCACCGCCCCCTACTGTGTTAAAGTTATAATATGCACATGAATCTAACATATTCTCATTTCTCATCATATCAATGTTTCCTGTCAGTCAATGCTATAATCGTCCATGCACCTTAAGATCTAATTTCTGCATCTATGTTTGAAGCTATTTTATGAGGTCTTGGCGATTTAAGATTGTTGTAATTGAACTCAAATTTGACATCAGGTCATTGATGTTGGTGGGAAAGAGTTTAGGTTCACATGGTCACGTGAAGTTGGTGATCTTTGTGACATATATGAAGAACGCAAAAGTGGAGAAGATGGAAGTGGTTTGCTCATTGAATCAGGCAATGCTTGGAGAAAACTGAATGTGCAGCGTATCTTAGACGAATCAACTACAAACCATGTTAAGAAGTTGTCTGAAGAAGCTGAACGTAGATCTAAGTCTCGTAGGTATGCAATCCTGTATTGAGGAGTTTCTTGATTCTCTACTTTTGGCAATTAGCACATGTGACCTAACTGCATTCTGCCCTTTTCAATTTATAGATTATCTAGACTTTTATTCATTTTTATATTTTTATAAAACTAAGTGGGGCTAGTTTATGTGCTTATCTTAATTCTCACACACTTGATAGTTTTGGCACCATTACATGAACTATAATTTGTAACTGCAAAGAAAATACGTTAGCAATATTAGCACTAACATATATGTAATTTTACCTAGTAAGTCTATTAGAGTTGTGTAAATTGAGGAGATTTCTTAGTGTATTTTGGGAGACACCAAACCTCTCAAAGAACTTGGGAGCTTGTACTTCTTTTGTAAATTTTAATGCAAATCAAATTGTGTCTTTCAACTTCTTATGAAATTGGTGTACGCTTTAAACTGACAAACCTCGTCAATAGGCCTGGTCCTATCCGAAATTATTCCCCAAAGATGTCCTCCCCGAAGTTCCTATTCACTCTATTAATACAACAACTTAGCTCTAAGAAACTTACCAAATAAGCCAATAACATATCCTTACTAATATTCTTAACCAAATGGTGATAAGATTTTATTGAGACATTTTTAAAAAAAAAAATTTAAAAATTTTTTTTTTTTTTAAAAAAATCATTGGTGTGTTAACAATAAGGTACAAAAGCCGAACAAAGTCAAGGAGTTTGAGAAAAGCTTGTTGATTAGCTATCATCATAAAATTAAGAATATTACAAAAAGATTAGAATGGAGGCTCAGGTTCCCCTCTCAGTTTGCAAGGTTTAAGGCCTTAGGCTGTCACCTTTTTGTCTGGTTAATATATCTATTGTTTCTTAAAAAAAAAAAAGGAAAAAGAAAAAATGGTTTAAGGCTGGGTGTGGAAAATATTTGCTTTTTGGAAGATAGTTGAATGATCAGGTTTGATTCCTTGATTATTTATGCTGTTCTGTGCACAAAAGAAGCCTCAATTTCTGATTGTGAGGATAGTAGCAATCAAGCTTGGGACTTGGGTCTTATTAGAGGTGTTTTTGATAGAGAGCTTCACTTGGGTGACCCTCATGGAAAAGATTAGCTCGTTTCATCTAGGGAGGGAGTGGGTTGAATTTGTGGGTCTGGAAGATTAGGGGAGATATTCTAGTAAGTTGGCCTTTCAAAACATCCCCACAAAACCCGCCTTTAGGGGAGAACAATAGGGTTTTTAGAGGTATGGAGAGGGCCTCTAGTGAGATTTGGGCCCTTGTTCGTTATCATGTTTCCTTGTGCGCTTCGATTTCGAAGATTTTTTGTAACTATTCTATTGGTGTTATTTTGCATAGTAGGACTTCCTTTCTTTTGAGGGTGCTCCCCTTTTTGTAGGTTGGTTTTTTGGATGCCTGTGTATTCTTTCATTTTTCTCGATGAAAGTGGTTATTTTTATATTTAAAAAAAAACCCCATAAAATCCACTAACACCAAGGTTAACATTTCCCTGACTAGCAAGGGATCAAACCGAATCGAACTGAAAAATTTAAAAATGTATATACGAAACCGAACTGAACCGATTTATAGATTCGAACCGAACCTAATTGCAAATGTGGTTCGGTTCGATTTTTAGTTTTGCTCTTATATATATATATTGGTTTTTTGTTTTGTTTCCTTTTCTAAAACCAAAGAGAAAGAGAAAGAGATTATAGTTTGGGTTTGAAGCATGTAAAACAAAAGTAAAGGCAAATGAGACAATGAGCAATATGTAATGAAAGTTAAATTTCAATGCCGAACTATTAATTAATTACAAATCCCATTTCCCCACTCTTTTCTTTTCAAACATTAAGAAGCCGCTCTCACCTAGATCTACCATTTTCCTTAAGAATTAGGCTTCAAATCAAATGGAATAGCATGTTAATATAATGGTGCAAGCGACAAAGACCTCTTGGTCTGACGCTGACCTTCAGAGAGCCGAGAGAGAGAGAGAGACGAGACGAGAGATGGAGGGAGATGAGGGATCATGAGAGAGAGACTGAGAGAGGGAGACGAAGTGAAGCCGAGAGAGAGGGAGGGAGAAGAAGGTCGAACATTGAAGGATGGCTGATGGAGGGTCGCACGTTGAAGGATGGTCGAAGGTCGACTGAGAGACGATACAGGGGCTGGCGACAGAGCACTAAGAGAGGAAGAGGGCTGGGTGGTGACAATCTTTAAGTTGAATAACCCTAAGTTGCAACCCAATAAAATGGTTCGGTTTTAAATTTGGTTTTCGAAAATGAAATGAACCAACCGAATCATTTGGTTTCAAAATTCCTTTAAACCGAACTAAAACACAATTTTCAATTTCAATGATAATTCGGTTTTGTAAAAGGGTTCGGTTTTAGCGATTCAGTGTACACCCCTACTGATTAGTATGATTTGGAAGCACAAATGCCCCAAGAAAGTCAAGATCTTCCTAAAGCCTTTGGCTTTTAGGAGCCTTAACATGGATGATAGACTTCAAAGGAAGTTCAGGAATTGGTCTTTTTCCTTTGAGATGCAGGTTATGTTTGGATGGTGGGAACCGAGGAGAACCTTGAACACAAACACCTCTTCAACCATTGTGACTTTGCTTATAGATCGTGGAGTTACTTCTTTGGCATTTTGGGCATCTTTGCTTGGGCCTAGGAGGGTAGATGATTGGATTATGAAAGGCTTAAATGCTTGAAATTTGACGGAAAGAGCCAAAATTCTTGGGAGTTGTGCTTTTCGAGCTCTTTTGTGGCATATTTGGAACGAAAGGAATGTGAGGTCTTTCGAAGATAACTCTTTTTGTCTTAATTTATTTTGCGATCTTCTACAAAATACAGTGTCTTGGTGGATATCTCTATGTTCAAAATTCTTTTGTAATTACAACCTCTAGATGATTATTAACGATTGGAAAGCTCTCTGTGTAGAGTTTGGGGAGGGTGTTCCCTTTGCCCCCGACCCTTAGGGTGTTCTTGGTTTTTTCTTTTTTTTTCTTTTTTTTTTATTTTTTTTTATAATTTTAAAAGTATTATTAATATTTTTATATCAATATATTCTCTCGTATCTTTAAAGAAAAAGAAAGGAAGCCCGCCCAAGAAGAGGCCGCAAACTTATTCCCACATGAATGAAATGCCTTTTAAAATCATTTTGTTTCTAATATAACATAACTACAGATGACTTTCAAGCTTTTTTTCTATTTAATTAACTTTCGACATGAAACTATAGCGTTTTCTTGTTGGCTAGTGGAATGATCTTAAATTAAAATTTTATTTGGGTTGTGGATGTTTCACTGTTCTTCTAAGCCTTGATTTAGTTTTATTTGTAAGTTTAATGATAATATTACAATCAACTTCTTTTTTAATATTCGTCCTCCCTCTCAAACCAAAATACAGAGGAAAATAAAGCTTTTGATGACAGTTTGTGAAACCTTTTACTTTTTTGAGTATATAACTTCTAAACTATAGAGAGATTTTGGTTTTTGTTATAGAGCTATCCGTGATTGTGTTTGGTACTATCTTTACACATTCATGCACATATTTTCTGTTTCTTGCTGCAGAGCTATTGTCTTAGAACCTGGGAATCCATCTATGAAGAATCAAATAAAGCAATTGGCTGCTGCCGAAGGTATGCATTTTTTTAAGGATTTGTTTAGTGGTGACTGGATAATTTATGTTATTTTATCATAGGCATGAATAGGAGGTTGATTCATTCTTATATCTGGTAAAGTTGTCTTATCAGCCTGCTTTAATTACTATTTTCTTTTTCTTCATCTTTTTTTTTGTTGTTGTTCTTATTGTTGCTGTTACTGTTATCTGATTGTAATACTGTAAAAACTGAGCCTTCTATGTGGAGATCAAGCCTCTACTAATACCTTGCTTGCTTTGCTTGCAGCTAATCCATGGAGGCACTTTAAGAATAAGAAAGAGCCTCCCTTTAAAAAGCAGAAAAACGAATTGTCTCAAGGTGATTGTCTGTTTACACATATTGAGTTGTTCATTTTCTTTTTGCAAGTTTTTATGCTTCGTTCTTCCTCTTCTGGCATAAAACTTTTTCTCCCTCTTTGAGTGTTTTCTCTCTCTGACATATACTGTTTCTCCTGATGGACATTAATGTCTATTAGATAGGCTATCATCTTCACCTATTCCATCGCCTCCTGAGCAATCCGGTGCTCCAGTATCTCAATTTGGATCTGCAAACACCACTAAGACTCATGTTATTGCAGAAGATATTAGACCTCGACTGCCTGCTAAGATTAATTCTGCTGCTAGCAGCGAGAAGGAAATCCCGACCAAAGCTGCAAAAGGAGTACTTGAAACACCAGGACAAGAAGGGAATAGTGGAGCTAAACCAACAGACTTGCAAGGAATGTTGTATAATTTACTTTCGGAAAACCCCAAGGGGATGAGTTTGAAGGTACACAAGTAAAACTTCTACATTAGGAACGTAGGATGTTAATTTTCATCTATCTTAATTGAAGTATTGTCTATTGATTCCTAATATGAAAATAATGTTTGATGTTTCCTGTTCATTGTATGTTTGTTCCCTTTAACAAATTTATCAGTTCTGTTCAAAACATATGGGTTTTACTATTGTCTACTTCTCTCTCAATGTTAACTCTTCTGTCACTCCTTCTCCCTTTCTTCCCCTCCCTCTTTCTTGTAGGCCTTGGAGAAAGCTGTTGGCGATAAAATCCCAAATGCTGTAAAAAAGATCGAGCCAATCATTAAAAAAGTAAGTGATCCAAATGTTAAATTATTTGAAATAATGCATAATCTACTGGGTTAGTGTTTATTCCTTCCTTTTTTTCCCCTCTTTATATATATATATAGACAGATAGATAAGAAACAATGTAGAATTGAACAAAAAGAGAAAGAATAGCCTAGGGGTCGGGGGTAGAGAGAACTTCACCCGAAAGAACTATAAGAAAGCTTTCTCGTTATTCACAATTGTGGGAAGGTTGTGGTTACACAAGAATTTCCTGAATATTATTTAGTAAATCTCTTACTACTCCCTACATATGTTGTCTTGAATATTACTTCAATTAAAAAGATCCAACAAATGAACTATCAATTGAAACAAGGAGAAAATGACAGGCGTGAATTGAACTCTGTTCTTTGAACTCATAATAAATTGACTGCTCCTGAAATCTTCCGATGTTGTTGTAGTTTTGAATTCTGATATCTTCCATGCATGCTGAAGTTATTGTTCTTATATTCTTGTTTGGTCTTGGCTATTAGATTGCAACCTACCAAGCTCCGGGGAGATATTGTTTGAAATCAGGAGTTGAGTTGGAAGGCTCCAAAAAACCTACATCCGAAGGTGAAAGGTACTTCAACATTTAAAGGGCTGATTTCATGTGGTTGACTTGAACAGTTATTTGCAAAATCTTAAATCAGGTTGTTTGTTCCTGTTACCATTTCTGATGTTGTTGGTCAGACAGCGTCAACTGTCTCATTTTTTGAGAATATTACATTATTACATACTGGATCCAAACCTACAACCTATAGAGGGAGTAGAAATGCTTCAAAAGACCCTCACATGTCTCATTTTTTTGAGAATATTACATTACATACTGGATTCGAAATTATAATCTCTTAGAGGAAGTAGAAATGCTTCAACACAACCTTTAATGTCTCATTTTTTTTAGAATATTACATTACATGCTGGATTCAGACTTATAACCTCTTAGAGGAAGTAGAAATGCCTCAACTGTCTTGTAGTCATTGTTTGGTTTTATTGATTATCAATAAACTATATTTTATGAGAAAAGAAAGCACTAAGCTTACTGCAAGGATTTATAATAAACAAAAATGACTTCATGCAAACATAATGTTTAATGGTTAAGACATTTCTACCTCCTCTAAGAGATTGCAGGTTCAAACCTCCACGTGTTTGTAATATTGTCTATACAAAAAATAATAGAAAAACAAAATAACTTCATGAAAAGAGTAATTACAGTGGAGATTGGATCAAGGATCAAACCAAAAGAAAGCTCAACCCAGTGATTTACTAGTCGGTGTATTTGATATATATGGATTGCTTGATTTTCATGTAGGAGTCTAATTCATGTCTTGTACTTTTACAGCTCTCCTTTGATCAGCCATCACCAAACCCCTGTACATGAAGACCTCCCTGATCAAATAACTGCTCCAGAATTGCAGTTAGAAGCAAGACGTGGCATTGAATTGGAGGAAAAGGTAGAAACCTCTCAAGCTAACAAAGAATCAAATTTCTTGGAGAAAAATGGCGTCCAACAGCATTCACCAGATCTTTTTGCTGAGAAAAAAGGTTCTGAAAATAGTGAAGGCCAGGCAGCTAGTTCTTCTGACAACGAAAGTGACAGTGATTCTGAAAGTGATAGCAGTGATAGTGGAAGTGATAGTGGGAATCATAGTAGGAGTAGAAGCCAAAGCCCCGTGGGTAGTGGGAGTGGGAGTAGCAGTGATAGTGAAAGTGATGCACCTTCCAATAGCAAGGAGGGTTCTGATGAGGACGTGGATATTATGACCAGTGATGATGACAAAGAATCCAAGCATAAATTGCAAGCTCCCGTGCAGGGTTTCTCTACATCTCCTGCTGCTTGGAAAAGTCCAGATGGTGGGGCTGCGCAGATTATTGATGATGAGAAGGAAGATGGACAAGGATCTGATGCAATTGATATTGAGAAAGATTCTTCTGATGATGAGCCAGATGCTAAAATTGATGATAGCAGTTTACTTCGTATTGAAGAAGGTGGAAGACCTGTGGAAGAACCAAGATCCTTTTCACCATACCCTGATGAATTCCAAGAGCGCCAAAATTTTATAGGGAGTTTGTTTGAGGATAGGGAAAATACTCTTCTGGACAGTGCCAGGCATGAACAATCTGACAGCACAGGTCGGATATCTAAAGGCAAGTCTAAAAGGAGTTCTGATCTGGAGTGCTTAGAAGAGAAATCTGATCATACCAAGAGATTAAAATCAGAAAGCTTAGCCCAACAACCAGTTTCTGGTAATTGGGGAGTCCAATTACAGAGTCCTCACAATTTATCTCCTAGTAAACTCAACAGAGATTCTGTCAGAAATCCTACCAGTCAAGTTACGAATAAAGGTGAAATTAAAGGCAATTCTGATTTTAGACCAAAAAAGGGAAACAAAGAAACAGTTCCAGAAAAAAATAGTTCAGATGTTTCACAAGCAGGTTGGAGGCCTCATGACCAGAGTGGTGGAGGAGTGAGGGCTGTGGATACAGCTGCCAGAGCCGACAAGCATGGTGATATTGGACGTGGCACTAAACACACCGAAAAGGGTGGTCATGCTAATGAAAATTTTCATGTGTTTAAAGATACCTTTTATGGAAATGCTGAAAATGAAGGGACAAAGGAGAAAAAGGTGTCAAAAAATTCTAGATCAGGTGGTCCAGGGGACAAACAGATACAACCTTTTGACTCCCATCACAGTAAACCTGGTGAAATAGTTGGAAAATTCAAAGATGGCCAAATATTTTCAAGCTCGCAGATGGGGTATTCACCAAGGGATAATAATAATAGAGTTAGTGCCAACAGGTCCCCTGTTAATGGAAAAGGCCGAAGTCTCCAAAGAGAGCTTTCAGACCTGGAGTTAGGTGAACTTCGTGAGCCTTTCCCTGAGGAAGCACGGGGTAAAAAGAAATTTGAAAGAAACAATTCGTTGAAACAGTTGGAGAATAAAGAAAACACTACAGATATCTGGAGTTCAGATTTAAATAAAGGAAAATCTAATTTGAAGGCTAGTATAGAATATGGAAAGCGATCCTCACCCCATGTAAGTACCAAGTTTCCCAGCAATCCGGAAGGCTCAAATAAAAAGAAGAATTCAGAACATATAGTTGAAGATTCGACCAGGCTTAACCACCGGTCTCTGCAGTCTCATCCACAATATAATTCAAGGGTAGACCATGTTGAAGTCGATAAGTCAGTTGATGCAAATGTAAAACCTAATCAGGGGATTGGTCCAGAAGGCTGTGGGGAAAGCAACAGAAAAGCATCCGTTGGCATTTCCCAGCTGAATGATATGAAAAGAGAACAGCTTCCCTCAAAAAAAGGAAGTAAAAGACTAGCACCTAATCCAATAACCGAAGTTACTGATGCACTGAAGAACCCAGTATCAGCTGAGCGTGAAAATAGTGATCCAAAGAGAAGAGATTCTTCTTCAGACGAAAACAGTTGTTCATATTCCAAGTATGAAAAGGATGAGCCAGAGTTCAAGGGGGCAATCAAGGATTTTTCTCAGTAAGTTTAATCTATAACTCATTCAAGTTCTTTTACTTTTGTTATTAATGTCTGTGATAACTGATGAAAAGATATACTTATTATTAGGTACAAGGAATATGTACAAGAGTATCATGATAAATATGAATCATACCTATCCTTGAACAAAATCCTAGAAAGCTACAGGTAATAGAACTTTTCATTAATGTCCGATCCTTATCCATTCCTACACATGTTGCAGTCTGCCTGACTTAATATCCGGTTATAATGTTTAAAGAAGTTTGATGATTTATTTGTGTGTTCTGGTATGTTGTTCCTGTAAAGAAAACATTATTAAGTTTTTTTAGCATCGATTGATATATGTAGGGCTGAGTTCTGCAAACTCGGAAAGGAACTTGATTCTGCTAGAGGACAAGATTCGGAGAGATACTTTAATGTCTTAGGACAGCTGAAAGAATCCTATCGGCTGTGTTCAACGGTATGGTCACCTATCTTAGTATCTTTACTTCAAGTTTGCAGTTGATAGTATTAGTAGTTTCTCACTTTCCAGTGTATGTCTATGGGACTCTGGATTAATTTGACATTAGTAATTTCCCTTGCACAAAATCCCAATGTTTACTCTGTGGCAAAGCTCTGAGTTCTGATATTTCATTCAGCTAGTGCTCGTTGTAGTTTAGGTTTGTATGATCTTTTGGTTCATATTGAAAAATTGGTGTTCCAGAAGGGGGCAGAGGGAGAGAGGGAACTCGTGATTGGAAGTAAAAGGTTTAGGATTGTATAGGATTAAGGCTGAGAGTAAATTCAGAGGCAGCCAGGCATGGTTGACGGAATTGGTAGAGGCAATACTTTCTCTATCTCTGTTGATTGGATAACCTCATGTGTTCAGGTTATTTGAATCTGGATTAGATGGCTCACATGGCTTGTACTTGATGTAGGATTAATATTTTTTGTATGCCTTGATCTGTATGCACATCAATTCTTTTCCCCCTTGTTATACACAAATTTTCTTACATAATGAAGTTGACTCAAACTAAGATTAATTTGCAGAGGCACAAGAGGTTGAAAAAAATATTCATTGTACTCCACGAAGAGCTGAAGGTGATTAACATTAACTAACTTATCTACATTACCTTCTTGCTTTCTTGTAGTTCTAATTTTTATCTCTAGTTCGATTTTCTTTATATGTACTATATTGTTGCAGCATATAAAGGAAAGGATTAGAGATTTTGCACGAACTTGTACAAAAGATTAAGACTAGATGTGCCTCTCTTCGAGCAGTCGAACGTCCTTTAAACAACCATTTTCAACTCAAGGTAGGTTACTAAAAGATCGAGGTGGATGCAAGAACTGCATCAATGAATTCCCGTGTATAGAAAATTAGTCAATTTCTTGCTACCTTCTTCCCAACCCCAATATAGGAACATGTAAATTTTTTTGCTCTCTCCCTCTCTTAAAAAAATGCATTTTCTTTTGTTTAGCTAAGTAGAGAGGTTGGACTTTTGTTTTAGTTTAGCCTGACCTCTTCCTGTATAGACTTGTGAGGCTTATTTTAAGTTTGGAAAATATAATCTATTTTATAGGTTGTACAAATTTTGAAGGTGTAAAAATGTTCCCTATAACTGTGAATATGCTAATGTTATTGGCAATCATGGGAAGTTTTCTTGGAGGGGAATTCTATAGTGAAGTAATGAGAAGTTTCTTCTCTGGGACAATGCAGGCTCTGTTGAATTGGTTAAATTATTACTGTGTAGGTTTTTACATCATTGATAGAGCACAATTTTTTCTTTAAATTAAAAATAATTTGAACCCTCCTTGAAAATGAAATGATAAATGGATGGTACTCCTTGTAAGATATAGAATTGATGACACGTTTAACTTGTGGAATAGATAATATATTCTAAATGGTAATAACATTAAACAGCTGAAATGTTTTCGAACAGTGGTGGCATATGTGGACATGTGACAACTGACAAATCGAATAGATGACACTATCGTATATTACCATCTTTTTCTAGGTAGAATCGATGACCATTTTTTTAAGTGAAAGGTAGCGATGATAAATTGGAGTATGTTAAGAGTTAAAAATCGTGTAAAATATTGAGGTTCTCATTCAATATCTATTTGATTCTTCAAGTTTGAAAATAAATGGTCTTAAATGTGTTTTGACTAACAGAAATATGACTTAGCAATGAGATTGGTGATTTATTTGCTAATTGAAAAATTATATTAATTTAATATAATCACTTAGTATTATGTGCTATCTTTCTCCTTCCTCTCTTTTCCTTTACCCTTCATTTTTTCTTCCAGCCGAAGGCTTCATCTCCTACCAATCTTGGAAAGCATGATGGAGGATGGAGAGTTTGA

mRNA sequence

ATGTATGGCGGCTCATCCAAGCTCGGTCGACCCGGCGCCGGCGCCGGCCGGGGAGCCGGAGGAAAGCGTCCGCACTCTTCTTTTCCTCTACCACCTTCTCACCGCCCCTCCGGTCGTCTCTCTCTTGGCGGCGGTGGTGCTGGTTCTGTCGCCAATCCTCGGAATCGAACCACCACTGCAACCACATCCGAAGCCCCTCAATCCGTTGAGGAGAATTTCAGTCTCGTTACTGGTAACAACCCGTTGGCTTTTGCCATGATAATTCGGTTGGCTCCGGATTTGATCGATGAGATCAAGCGGGTCGAGGCGCAGGGCGGGACTCCGAGGATTAAGTTTGATGCGAACGCCAAGAATTCTAGTGGTAATGTCATTGATGTTGGTGGGAAAGAGTTTAGGTTCACATGGTCACGTGAAGTTGGTGATCTTTGTGACATATATGAAGAACGCAAAAGTGGAGAAGATGGAAGTGGTTTGCTCATTGAATCAGGCAATGCTTGGAGAAAACTGAATGTGCAGCGTATCTTAGACGAATCAACTACAAACCATGTTAAGAAGTTGTCTGAAGAAGCTGAACGTAGATCTAAGTCTCGTAGAGCTATTGTCTTAGAACCTGGGAATCCATCTATGAAGAATCAAATAAAGCAATTGGCTGCTGCCGAAGCTAATCCATGGAGGCACTTTAAGAATAAGAAAGAGCCTCCCTTTAAAAAGCAGAAAAACGAATTGTCTCAAGATAGGCTATCATCTTCACCTATTCCATCGCCTCCTGAGCAATCCGGTGCTCCAGTATCTCAATTTGGATCTGCAAACACCACTAAGACTCATGTTATTGCAGAAGATATTAGACCTCGACTGCCTGCTAAGATTAATTCTGCTGCTAGCAGCGAGAAGGAAATCCCGACCAAAGCTGCAAAAGGAGTACTTGAAACACCAGGACAAGAAGGGAATAGTGGAGCTAAACCAACAGACTTGCAAGGAATGTTGTATAATTTACTTTCGGAAAACCCCAAGGGGATGAGTTTGAAGGCCTTGGAGAAAGCTGTTGGCGATAAAATCCCAAATGCTGTAAAAAAGATCGAGCCAATCATTAAAAAAGTAAGTGATCCAAATGTTAAATTATTTGAAATAATGCATAATCTACTGGGAGTTGAGTTGGAAGGCTCCAAAAAACCTACATCCGAAGGTGAAAGCTCTCCTTTGATCAGCCATCACCAAACCCCTGTACATGAAGACCTCCCTGATCAAATAACTGCTCCAGAATTGCAGTTAGAAGCAAGACGTGGCATTGAATTGGAGGAAAAGGTAGAAACCTCTCAAGCTAACAAAGAATCAAATTTCTTGGAGAAAAATGGCGTCCAACAGCATTCACCAGATCTTTTTGCTGAGAAAAAAGGTTCTGAAAATAGTGAAGGCCAGGCAGCTAGTTCTTCTGACAACGAAAGTGACAGTGATTCTGAAAGTGATAGCAGTGATAGTGGAAGTGATAGTGGGAATCATAGTAGGAGTAGAAGCCAAAGCCCCGTGGGTAGTGGGAGTGGGAGTAGCAGTGATAGTGAAAGTGATGCACCTTCCAATAGCAAGGAGGGTTCTGATGAGGACGTGGATATTATGACCAGTGATGATGACAAAGAATCCAAGCATAAATTGCAAGCTCCCGTGCAGGGTTTCTCTACATCTCCTGCTGCTTGGAAAAGTCCAGATGGTGGGGCTGCGCAGATTATTGATGATGAGAAGGAAGATGGACAAGGATCTGATGCAATTGATATTGAGAAAGATTCTTCTGATGATGAGCCAGATGCTAAAATTGATGATAGCAGTTTACTTCGTATTGAAGAAGGTGGAAGACCTGTGGAAGAACCAAGATCCTTTTCACCATACCCTGATGAATTCCAAGAGCGCCAAAATTTTATAGGGAGTTTGTTTGAGGATAGGGAAAATACTCTTCTGGACAGTGCCAGGCATGAACAATCTGACAGCACAGGTCGGATATCTAAAGGCAAGTCTAAAAGGAGTTCTGATCTGGAGTGCTTAGAAGAGAAATCTGATCATACCAAGAGATTAAAATCAGAAAGCTTAGCCCAACAACCAGTTTCTGGTAATTGGGGAGTCCAATTACAGAGTCCTCACAATTTATCTCCTAGTAAACTCAACAGAGATTCTGTCAGAAATCCTACCAGTCAAGTTACGAATAAAGGTGAAATTAAAGGCAATTCTGATTTTAGACCAAAAAAGGGAAACAAAGAAACAGTTCCAGAAAAAAATAGTTCAGATGTTTCACAAGCAGGTTGGAGGCCTCATGACCAGAGTGGTGGAGGAGTGAGGGCTGTGGATACAGCTGCCAGAGCCGACAAGCATGGTGATATTGGACGTGGCACTAAACACACCGAAAAGGGTGGTCATGCTAATGAAAATTTTCATGTGTTTAAAGATACCTTTTATGGAAATGCTGAAAATGAAGGGACAAAGGAGAAAAAGGTGTCAAAAAATTCTAGATCAGGTGGTCCAGGGGACAAACAGATACAACCTTTTGACTCCCATCACAGTAAACCTGGTGAAATAGTTGGAAAATTCAAAGATGGCCAAATATTTTCAAGCTCGCAGATGGGGTATTCACCAAGGGATAATAATAATAGAGTTAGTGCCAACAGGTCCCCTGTTAATGGAAAAGGCCGAAGTCTCCAAAGAGAGCTTTCAGACCTGGAGTTAGGTGAACTTCGTGAGCCTTTCCCTGAGGAAGCACGGGGTAAAAAGAAATTTGAAAGAAACAATTCGTTGAAACAGTTGGAGAATAAAGAAAACACTACAGATATCTGGAGTTCAGATTTAAATAAAGGAAAATCTAATTTGAAGGCTAGTATAGAATATGGAAAGCGATCCTCACCCCATGTAAGTACCAAGTTTCCCAGCAATCCGGAAGGCTCAAATAAAAAGAAGAATTCAGAACATATAGTTGAAGATTCGACCAGGCTTAACCACCGGTCTCTGCAGTCTCATCCACAATATAATTCAAGGGTAGACCATGTTGAAGTCGATAAGTCAGTTGATGCAAATGTAAAACCTAATCAGGGGATTGGTCCAGAAGGCTGTGGGGAAAGCAACAGAAAAGCATCCGTTGGCATTTCCCAGCTGAATGATATGAAAAGAGAACAGCTTCCCTCAAAAAAAGGAAGTAAAAGACTAGCACCTAATCCAATAACCGAAGTTACTGATGCACTGAAGAACCCAGTATCAGCTGAGCGTGAAAATAGTGATCCAAAGAGAAGAGATTCTTCTTCAGACGAAAACAGTTGTTCATATTCCAAGTATGAAAAGGATGAGCCAGAGTTCAAGGGGGCAATCAAGGATTTTTCTCAGTACAAGGAATATGTACAAGAGTATCATGATAAATATGAATCATACCTATCCTTGAACAAAATCCTAGAAAGCTACAGGGCTGAGTTCTGCAAACTCGGAAAGGAACTTGATTCTGCTAGAGGACAAGATTCGGAGAGATACTTTAATGTCTTAGGACAGCTGAAAGAATCCTATCGGCTGTGTTCAACGCCGAAGGCTTCATCTCCTACCAATCTTGGAAAGCATGATGGAGGATGGAGAGTTTGA

Coding sequence (CDS)

ATGTATGGCGGCTCATCCAAGCTCGGTCGACCCGGCGCCGGCGCCGGCCGGGGAGCCGGAGGAAAGCGTCCGCACTCTTCTTTTCCTCTACCACCTTCTCACCGCCCCTCCGGTCGTCTCTCTCTTGGCGGCGGTGGTGCTGGTTCTGTCGCCAATCCTCGGAATCGAACCACCACTGCAACCACATCCGAAGCCCCTCAATCCGTTGAGGAGAATTTCAGTCTCGTTACTGGTAACAACCCGTTGGCTTTTGCCATGATAATTCGGTTGGCTCCGGATTTGATCGATGAGATCAAGCGGGTCGAGGCGCAGGGCGGGACTCCGAGGATTAAGTTTGATGCGAACGCCAAGAATTCTAGTGGTAATGTCATTGATGTTGGTGGGAAAGAGTTTAGGTTCACATGGTCACGTGAAGTTGGTGATCTTTGTGACATATATGAAGAACGCAAAAGTGGAGAAGATGGAAGTGGTTTGCTCATTGAATCAGGCAATGCTTGGAGAAAACTGAATGTGCAGCGTATCTTAGACGAATCAACTACAAACCATGTTAAGAAGTTGTCTGAAGAAGCTGAACGTAGATCTAAGTCTCGTAGAGCTATTGTCTTAGAACCTGGGAATCCATCTATGAAGAATCAAATAAAGCAATTGGCTGCTGCCGAAGCTAATCCATGGAGGCACTTTAAGAATAAGAAAGAGCCTCCCTTTAAAAAGCAGAAAAACGAATTGTCTCAAGATAGGCTATCATCTTCACCTATTCCATCGCCTCCTGAGCAATCCGGTGCTCCAGTATCTCAATTTGGATCTGCAAACACCACTAAGACTCATGTTATTGCAGAAGATATTAGACCTCGACTGCCTGCTAAGATTAATTCTGCTGCTAGCAGCGAGAAGGAAATCCCGACCAAAGCTGCAAAAGGAGTACTTGAAACACCAGGACAAGAAGGGAATAGTGGAGCTAAACCAACAGACTTGCAAGGAATGTTGTATAATTTACTTTCGGAAAACCCCAAGGGGATGAGTTTGAAGGCCTTGGAGAAAGCTGTTGGCGATAAAATCCCAAATGCTGTAAAAAAGATCGAGCCAATCATTAAAAAAGTAAGTGATCCAAATGTTAAATTATTTGAAATAATGCATAATCTACTGGGAGTTGAGTTGGAAGGCTCCAAAAAACCTACATCCGAAGGTGAAAGCTCTCCTTTGATCAGCCATCACCAAACCCCTGTACATGAAGACCTCCCTGATCAAATAACTGCTCCAGAATTGCAGTTAGAAGCAAGACGTGGCATTGAATTGGAGGAAAAGGTAGAAACCTCTCAAGCTAACAAAGAATCAAATTTCTTGGAGAAAAATGGCGTCCAACAGCATTCACCAGATCTTTTTGCTGAGAAAAAAGGTTCTGAAAATAGTGAAGGCCAGGCAGCTAGTTCTTCTGACAACGAAAGTGACAGTGATTCTGAAAGTGATAGCAGTGATAGTGGAAGTGATAGTGGGAATCATAGTAGGAGTAGAAGCCAAAGCCCCGTGGGTAGTGGGAGTGGGAGTAGCAGTGATAGTGAAAGTGATGCACCTTCCAATAGCAAGGAGGGTTCTGATGAGGACGTGGATATTATGACCAGTGATGATGACAAAGAATCCAAGCATAAATTGCAAGCTCCCGTGCAGGGTTTCTCTACATCTCCTGCTGCTTGGAAAAGTCCAGATGGTGGGGCTGCGCAGATTATTGATGATGAGAAGGAAGATGGACAAGGATCTGATGCAATTGATATTGAGAAAGATTCTTCTGATGATGAGCCAGATGCTAAAATTGATGATAGCAGTTTACTTCGTATTGAAGAAGGTGGAAGACCTGTGGAAGAACCAAGATCCTTTTCACCATACCCTGATGAATTCCAAGAGCGCCAAAATTTTATAGGGAGTTTGTTTGAGGATAGGGAAAATACTCTTCTGGACAGTGCCAGGCATGAACAATCTGACAGCACAGGTCGGATATCTAAAGGCAAGTCTAAAAGGAGTTCTGATCTGGAGTGCTTAGAAGAGAAATCTGATCATACCAAGAGATTAAAATCAGAAAGCTTAGCCCAACAACCAGTTTCTGGTAATTGGGGAGTCCAATTACAGAGTCCTCACAATTTATCTCCTAGTAAACTCAACAGAGATTCTGTCAGAAATCCTACCAGTCAAGTTACGAATAAAGGTGAAATTAAAGGCAATTCTGATTTTAGACCAAAAAAGGGAAACAAAGAAACAGTTCCAGAAAAAAATAGTTCAGATGTTTCACAAGCAGGTTGGAGGCCTCATGACCAGAGTGGTGGAGGAGTGAGGGCTGTGGATACAGCTGCCAGAGCCGACAAGCATGGTGATATTGGACGTGGCACTAAACACACCGAAAAGGGTGGTCATGCTAATGAAAATTTTCATGTGTTTAAAGATACCTTTTATGGAAATGCTGAAAATGAAGGGACAAAGGAGAAAAAGGTGTCAAAAAATTCTAGATCAGGTGGTCCAGGGGACAAACAGATACAACCTTTTGACTCCCATCACAGTAAACCTGGTGAAATAGTTGGAAAATTCAAAGATGGCCAAATATTTTCAAGCTCGCAGATGGGGTATTCACCAAGGGATAATAATAATAGAGTTAGTGCCAACAGGTCCCCTGTTAATGGAAAAGGCCGAAGTCTCCAAAGAGAGCTTTCAGACCTGGAGTTAGGTGAACTTCGTGAGCCTTTCCCTGAGGAAGCACGGGGTAAAAAGAAATTTGAAAGAAACAATTCGTTGAAACAGTTGGAGAATAAAGAAAACACTACAGATATCTGGAGTTCAGATTTAAATAAAGGAAAATCTAATTTGAAGGCTAGTATAGAATATGGAAAGCGATCCTCACCCCATGTAAGTACCAAGTTTCCCAGCAATCCGGAAGGCTCAAATAAAAAGAAGAATTCAGAACATATAGTTGAAGATTCGACCAGGCTTAACCACCGGTCTCTGCAGTCTCATCCACAATATAATTCAAGGGTAGACCATGTTGAAGTCGATAAGTCAGTTGATGCAAATGTAAAACCTAATCAGGGGATTGGTCCAGAAGGCTGTGGGGAAAGCAACAGAAAAGCATCCGTTGGCATTTCCCAGCTGAATGATATGAAAAGAGAACAGCTTCCCTCAAAAAAAGGAAGTAAAAGACTAGCACCTAATCCAATAACCGAAGTTACTGATGCACTGAAGAACCCAGTATCAGCTGAGCGTGAAAATAGTGATCCAAAGAGAAGAGATTCTTCTTCAGACGAAAACAGTTGTTCATATTCCAAGTATGAAAAGGATGAGCCAGAGTTCAAGGGGGCAATCAAGGATTTTTCTCAGTACAAGGAATATGTACAAGAGTATCATGATAAATATGAATCATACCTATCCTTGAACAAAATCCTAGAAAGCTACAGGGCTGAGTTCTGCAAACTCGGAAAGGAACTTGATTCTGCTAGAGGACAAGATTCGGAGAGATACTTTAATGTCTTAGGACAGCTGAAAGAATCCTATCGGCTGTGTTCAACGCCGAAGGCTTCATCTCCTACCAATCTTGGAAAGCATGATGGAGGATGGAGAGTTTGA

Protein sequence

MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQDRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDIRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDGQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAARADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCSTPKASSPTNLGKHDGGWRV
Homology
BLAST of HG10002324 vs. NCBI nr
Match: XP_038883601.1 (dentin sialophosphoprotein isoform X1 [Benincasa hispida])

HSP 1 Score: 2007.6 bits (5200), Expect = 0.0e+00
Identity = 1083/1200 (90.25%), Postives = 1115/1200 (92.92%), Query Frame = 0

Query: 1    MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
            MYGG+SKLGRPG GAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGG  SV+NPRNRTTTA
Sbjct: 1    MYGGASKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGPVSVSNPRNRTTTA 60

Query: 61   TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
            TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS
Sbjct: 61   TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120

Query: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
            GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT
Sbjct: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180

Query: 181  NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
            NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN
Sbjct: 181  NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240

Query: 241  ELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI 300
            ELSQ                   DRLSSSPIPSPPEQSGAPVS FGSANTTKTHVI EDI
Sbjct: 241  ELSQVGPPKSTYKPGISSLPASKDRLSSSPIPSPPEQSGAPVSHFGSANTTKTHVITEDI 300

Query: 301  RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSL 360
            RPRLPAK+N+AASSEKEI TKAAKGVLETPGQEGNSGAK TDLQGMLYNLL ENPKGMSL
Sbjct: 301  RPRLPAKVNAAASSEKEISTKAAKGVLETPGQEGNSGAKTTDLQGMLYNLLLENPKGMSL 360

Query: 361  KALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPLI 420
            KALEKAVGDKIPNAVKKIEPIIKK++         + +  GVELEGSKKP+SEGESSPLI
Sbjct: 361  KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKS--GVELEGSKKPSSEGESSPLI 420

Query: 421  SHHQT-PVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLF 480
            SHHQ  PVHEDLPDQITAPELQLEAR GIELEEKVETSQANK+SNFLEKNG+QQHSPDLF
Sbjct: 421  SHHQNPPVHEDLPDQITAPELQLEARSGIELEEKVETSQANKKSNFLEKNGIQQHSPDLF 480

Query: 481  AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSES 540
            AEKKGSENSE QAASSSDNESDSDSESDSSDSGSDSGNHSRSRS+SPVGSGSGSSSDSES
Sbjct: 481  AEKKGSENSERQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSES 540

Query: 541  DAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED 600
            + PSNSKEGSDEDVDIMTSDDDKESKHKLQA  QGFSTSPAAWKSPDGGA QIIDDEKED
Sbjct: 541  EVPSNSKEGSDEDVDIMTSDDDKESKHKLQASAQGFSTSPAAWKSPDGGAVQIIDDEKED 600

Query: 601  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSL 660
            GQ SDAIDIE DSSDDEPDAKIDD S L I EGGR VEEPRSFSPYPDEFQERQNFIGSL
Sbjct: 601  GQESDAIDIENDSSDDEPDAKIDDRSFLPI-EGGRLVEEPRSFSPYPDEFQERQNFIGSL 660

Query: 661  FEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGN 720
            FEDR+NT++DS RHEQSDSTG+ISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVS  
Sbjct: 661  FEDRDNTVVDSGRHEQSDSTGQISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVS-- 720

Query: 721  WGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQ 780
                         SK  RDSVRNPTSQVTNKGE+KGNSDFRPKKG+KETV EKNSSDVSQ
Sbjct: 721  -------------SKHGRDSVRNPTSQVTNKGEVKGNSDFRPKKGHKETVSEKNSSDVSQ 780

Query: 781  AGWRPHDQSGGGVRAVDTAARADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEG 840
            AGWRPHDQS GGVRAVDTAAR DKHGDIGRGTKHTEK GHANENFH+FKDTF+GNAENEG
Sbjct: 781  AGWRPHDQS-GGVRAVDTAARTDKHGDIGRGTKHTEKSGHANENFHMFKDTFHGNAENEG 840

Query: 841  TKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVS 900
            TKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQ FSSSQMGYSPRDNNNR+S
Sbjct: 841  TKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRIS 900

Query: 901  ANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSS 960
            ANRSPVNGKGR LQRELSDLELGELR+PFPEE+RGKKKFERNNSLKQLENKE+TTDIW S
Sbjct: 901  ANRSPVNGKGRILQRELSDLELGELRDPFPEESRGKKKFERNNSLKQLENKESTTDIWGS 960

Query: 961  DLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQ 1020
            DL++GKSNLK S+EYGKRS PHVSTKFPSNPEGSNKKK SEHIVEDSTRLN RSLQSHPQ
Sbjct: 961  DLSRGKSNLKTSLEYGKRSPPHVSTKFPSNPEGSNKKKTSEHIVEDSTRLNQRSLQSHPQ 1020

Query: 1021 YNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRL 1080
            YNSRVDHVEVDKS+ ANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRL
Sbjct: 1021 YNSRVDHVEVDKSIAANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRL 1080

Query: 1081 APNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYK 1140
            APNPITEVT+ALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQY+
Sbjct: 1081 APNPITEVTEALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYE 1140

Query: 1141 EYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCS 1181
            EYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSE+YFNVLGQLKESYRLCS
Sbjct: 1141 EYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCS 1181

BLAST of HG10002324 vs. NCBI nr
Match: XP_008447590.1 (PREDICTED: dentin sialophosphoprotein isoform X1 [Cucumis melo])

HSP 1 Score: 1984.1 bits (5139), Expect = 0.0e+00
Identity = 1070/1200 (89.17%), Postives = 1097/1200 (91.42%), Query Frame = 0

Query: 1    MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
            MYGG SKLGRPG GAGRG GGKRPHSSFPLPPSHRPSGRLSLGGG AGS +NPRNRTTTA
Sbjct: 1    MYGGPSKLGRPGGGAGRGHGGKRPHSSFPLPPSHRPSGRLSLGGGAAGSASNPRNRTTTA 60

Query: 61   TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
            TTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKRVEAQGGTPRIKFDA A NSS
Sbjct: 61   TTSEASQSTEENFSLVTGNNPLAFGMIIRLAPDLIDEIKRVEAQGGTPRIKFDAIANNSS 120

Query: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
            GNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTT
Sbjct: 121  GNVIDVGGKEFRFTWSRERGDSCEIYEERKSGEDGSGLLIESGNCWRKLNVHRVLDESTT 180

Query: 181  NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
            NHVKKLSEEAER+SKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN
Sbjct: 181  NHVKKLSEEAERKSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240

Query: 241  ELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI 300
            ELSQ                   DRLSSSPIP PPEQ G PVSQFGSANT KTHVIAEDI
Sbjct: 241  ELSQVGPPKSTYKPGMPSLPASKDRLSSSPIPLPPEQFGGPVSQFGSANTNKTHVIAEDI 300

Query: 301  RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSL 360
            RPR+PAKIN AAS+EKEI T A KGVLETPGQEGNSGAKPTDLQGMLYNLL ENPKGMSL
Sbjct: 301  RPRVPAKINPAASNEKEILTIAPKGVLETPGQEGNSGAKPTDLQGMLYNLLLENPKGMSL 360

Query: 361  KALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPLI 420
            KALEKAVGDKIPNAVKKIEPIIKK++         + +  GVELEGSKKPTSEGESSPL+
Sbjct: 361  KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKS--GVELEGSKKPTSEGESSPLV 420

Query: 421  SHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFA 480
            SHHQT VHEDLPDQI APELQLEA  GI+LEEKVETSQANKESNFLEKNG+QQ  PD FA
Sbjct: 421  SHHQTSVHEDLPDQINAPELQLEAGCGIDLEEKVETSQANKESNFLEKNGIQQ--PDPFA 480

Query: 481  EKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESD 540
            EKKGSENSEGQAASSSDN SDSDS+SDSSDSGSDSGNHSRSRS+SPVGSGSGSSSDSESD
Sbjct: 481  EKKGSENSEGQAASSSDNVSDSDSDSDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSESD 540

Query: 541  APSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG 600
             PSNS+EGSDEDVDIMTSDDDKESKHKLQA VQGFSTSPAAWKSPDGG  QIIDDEKEDG
Sbjct: 541  GPSNSQEGSDEDVDIMTSDDDKESKHKLQASVQGFSTSPAAWKSPDGGPVQIIDDEKEDG 600

Query: 601  QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLF 660
            Q  DAIDIEKDSSDDEPDAK+D  SLL  EE GRPVEEPRSFSPYPDEFQERQNFIGSLF
Sbjct: 601  QEYDAIDIEKDSSDDEPDAKVDGRSLLPTEEVGRPVEEPRSFSPYPDEFQERQNFIGSLF 660

Query: 661  EDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 720
            EDREN + DSARHEQSDSTGRISKGKSKRSSDLECLEEK+DHTKRLKSESLAQQPVSGNW
Sbjct: 661  EDRENNVADSARHEQSDSTGRISKGKSKRSSDLECLEEKADHTKRLKSESLAQQPVSGNW 720

Query: 721  GVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQA 780
            GVQLQSP NLSPSKLNRDSVRN TSQVTNKGEIKGNSDFRPKKGNKETV EKNSSDV QA
Sbjct: 721  GVQLQSPRNLSPSKLNRDSVRNLTSQVTNKGEIKGNSDFRPKKGNKETVSEKNSSDVPQA 780

Query: 781  GWRPHDQSGGGVRAVDTAARADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGT 840
            GWRPHDQS GGVRAVDTA RADKHGDIGRGTKH EK GHANENFHVFKDTFYGNA+NEGT
Sbjct: 781  GWRPHDQS-GGVRAVDTATRADKHGDIGRGTKHIEKSGHANENFHVFKDTFYGNADNEGT 840

Query: 841  KEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA 900
            KEKKVSKNSRSGGPGDK IQPFDSH SKPGEIVGKFKDGQ FSSSQMGYSPRDNNNRVSA
Sbjct: 841  KEKKVSKNSRSGGPGDKHIQPFDSHQSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRVSA 900

Query: 901  NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSD 960
            NRSPVNGKGR LQRE SDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIW SD
Sbjct: 901  NRSPVNGKGRILQREPSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWGSD 960

Query: 961  LNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQY 1020
            LNKGKSNLKAS+EYGKRSSPHVSTKFPSNPEGSNKKKNSEH+VEDS RLN+RSL SH QY
Sbjct: 961  LNKGKSNLKASLEYGKRSSPHVSTKFPSNPEGSNKKKNSEHMVEDSNRLNNRSLLSHSQY 1020

Query: 1021 NSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLA 1080
            NSR+DH EVDKSVD NV+PNQG GPEG  ESNRKASVGISQLND KREQLPSKKGSKR A
Sbjct: 1021 NSRIDHAEVDKSVDGNVRPNQGNGPEGYVESNRKASVGISQLNDTKREQLPSKKGSKRQA 1080

Query: 1081 PNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKE 1140
            PNPITEVTD LKNP+SAE ENSDPKRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKE
Sbjct: 1081 PNPITEVTDGLKNPISAEHENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140

Query: 1141 YVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST 1182
            YVQEYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSE+YFNVLGQLKESYRLCST
Sbjct: 1141 YVQEYHDKYESYLSLNKILESYRTEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCST 1195

BLAST of HG10002324 vs. NCBI nr
Match: XP_004146856.1 (dentin sialophosphoprotein isoform X1 [Cucumis sativus] >KGN59835.1 hypothetical protein Csa_002066 [Cucumis sativus])

HSP 1 Score: 1983.0 bits (5136), Expect = 0.0e+00
Identity = 1071/1200 (89.25%), Postives = 1097/1200 (91.42%), Query Frame = 0

Query: 1    MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
            MYGG SKLGRPG GAGRG  GKRPHSSFPLPPSHRPSGRLSLGGG AGSV+NPRNRTTTA
Sbjct: 1    MYGGPSKLGRPGGGAGRGHAGKRPHSSFPLPPSHRPSGRLSLGGGAAGSVSNPRNRTTTA 60

Query: 61   TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
            TTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKRVEAQGGTPRIKFDANA NSS
Sbjct: 61   TTSEASQSAEENFSLVTGNNPLAFGMIIRLAPDLIDEIKRVEAQGGTPRIKFDANANNSS 120

Query: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
            GNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTT
Sbjct: 121  GNVIDVGGKEFRFTWSRERGDSCEIYEERKSGEDGSGLLIESGNCWRKLNVHRVLDESTT 180

Query: 181  NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
            NHVKKLSEEAER+SKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN
Sbjct: 181  NHVKKLSEEAERKSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240

Query: 241  ELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI 300
            ELSQ                   DRLSSSPIP PPEQ GAPVSQFGSANT+KTHVIAEDI
Sbjct: 241  ELSQVGPPKSTYKPGMPSLPASKDRLSSSPIPLPPEQFGAPVSQFGSANTSKTHVIAEDI 300

Query: 301  RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSL 360
            RPR+PAKIN AAS+EKEIPT A KGVLETPGQEGNSG KPTDLQGMLYNLL ENPKGMSL
Sbjct: 301  RPRVPAKINPAASNEKEIPTIAPKGVLETPGQEGNSGTKPTDLQGMLYNLLLENPKGMSL 360

Query: 361  KALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPLI 420
            KALEKAVGDKIPNAVKKIEPIIKK++        ++ +  GV LEGSKKPTSEGESSPLI
Sbjct: 361  KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYLLKS--GVGLEGSKKPTSEGESSPLI 420

Query: 421  SHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFA 480
            SHHQT VHEDLPDQ  APELQLEAR G++LEEKVETSQANKESNFLE NG+QQ  PD FA
Sbjct: 421  SHHQTSVHEDLPDQTNAPELQLEARCGMDLEEKVETSQANKESNFLETNGIQQ--PDPFA 480

Query: 481  EKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESD 540
            EKK SENSEGQAASSSDNESDSDS+SDSSDSGSDSGNHSRSRS+SPVGSGSGSSSDSESD
Sbjct: 481  EKKSSENSEGQAASSSDNESDSDSDSDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSESD 540

Query: 541  APSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG 600
             PSNS+EGSD DVDIMTSDDDKESK KLQA VQGFSTSPAAWKSPDGG  QIIDDEKEDG
Sbjct: 541  GPSNSQEGSDVDVDIMTSDDDKESKQKLQASVQGFSTSPAAWKSPDGGPVQIIDDEKEDG 600

Query: 601  QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLF 660
            Q  DAIDIEKDSSDDEPDAKID  SLL  EEG RPVEEPRSFSPYPDEFQERQNFIGSLF
Sbjct: 601  QEYDAIDIEKDSSDDEPDAKIDGRSLLPTEEGVRPVEEPRSFSPYPDEFQERQNFIGSLF 660

Query: 661  EDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 720
            EDREN ++DSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW
Sbjct: 661  EDRENNVVDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 720

Query: 721  GVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQA 780
            GVQLQSP NLSPSKLNRDSVRN TSQVTNKGEIKGNSDFRPKKGNKETV EKNSSDVSQA
Sbjct: 721  GVQLQSPRNLSPSKLNRDSVRNLTSQVTNKGEIKGNSDFRPKKGNKETVSEKNSSDVSQA 780

Query: 781  GWRPHDQSGGGVRAVDTAARADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGT 840
            GWRPHDQS  GVRAVDTA RADKHGDIGRGTKHTEK GHANENFHVFKDTFYGN +NEGT
Sbjct: 781  GWRPHDQS--GVRAVDTATRADKHGDIGRGTKHTEKSGHANENFHVFKDTFYGNPDNEGT 840

Query: 841  KEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA 900
            KEKKVSKNSRSGGPGDKQIQP DSHHSKPGEIVGKFKDGQ FSSSQMGYSPRDNNNRVSA
Sbjct: 841  KEKKVSKNSRSGGPGDKQIQPLDSHHSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRVSA 900

Query: 901  NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSD 960
            NRSPVNGKGR LQRE SDLELGELREPF EEARGKKKFERNNSLKQLENKENTTDIW SD
Sbjct: 901  NRSPVNGKGRILQREPSDLELGELREPFHEEARGKKKFERNNSLKQLENKENTTDIWGSD 960

Query: 961  LNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQY 1020
            LNKGKSNLKAS+EYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDS R+N+RSL SH QY
Sbjct: 961  LNKGKSNLKASLEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSNRINNRSLLSHSQY 1020

Query: 1021 NSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLA 1080
            NSR+DH EVDKS D NVKPNQG GPEG  ESNRKASVGISQLND KREQ PSKKGSKR A
Sbjct: 1021 NSRIDHAEVDKSADGNVKPNQGNGPEGYVESNRKASVGISQLNDTKREQPPSKKGSKRQA 1080

Query: 1081 PNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKE 1140
            PNPITEVTD LKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKE
Sbjct: 1081 PNPITEVTDGLKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140

Query: 1141 YVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST 1182
            YVQEYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSE+YFNVLGQLKESYRLCST
Sbjct: 1141 YVQEYHDKYESYLSLNKILESYRTEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCST 1194

BLAST of HG10002324 vs. NCBI nr
Match: XP_023524752.1 (dentin sialophosphoprotein-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1875.1 bits (4856), Expect = 0.0e+00
Identity = 1022/1217 (83.98%), Postives = 1084/1217 (89.07%), Query Frame = 0

Query: 1    MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
            MYGG SKL R G GAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+ A
Sbjct: 1    MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSAA 60

Query: 61   TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
              SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKRVE+QGGTPRIKFDANAKNSS
Sbjct: 61   AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120

Query: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
            GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTT
Sbjct: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180

Query: 181  NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK 240
            NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQK
Sbjct: 181  NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240

Query: 241  NELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED 300
            NELSQ                   +RLSSSP+PSPPEQSGAP+SQFGSAN TKTH  AED
Sbjct: 241  NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCTAED 300

Query: 301  IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMS 360
            I+PR PAKIN+AASSEK+IPTKAAKGVLE PGQE N+GAKPTDLQGMLYNLL +NPKGMS
Sbjct: 301  IKPRQPAKINAAASSEKDIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360

Query: 361  LKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPL 420
            LKALEKAVGDKIPN+VKKIEPIIKK++         + +   VE+EGSKKP+SEGESSPL
Sbjct: 361  LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKS--EVEVEGSKKPSSEGESSPL 420

Query: 421  ISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLF 480
            +SH QTPVHED  DQ   PE QLEAR  IELEEKVETSQANKESNFLEKNG+QQ+SPD F
Sbjct: 421  VSHQQTPVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPF 480

Query: 481  AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSES 540
            AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGN SRSRS+SPVGSGSGSSSDSES
Sbjct: 481  AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSES 540

Query: 541  DAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED 600
            DAPSNSKEGSDEDVDIMTSDDDKE K+KLQA VQGFS SPAAWKSPDGGA   IDDEKED
Sbjct: 541  DAPSNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKED 600

Query: 601  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSL 660
            G  SDAIDIEKDSSDDEP+AKIDD SL    EGGRPVEE RS SPYPDEFQERQNFIGSL
Sbjct: 601  GHESDAIDIEKDSSDDEPEAKIDDRSLPPTVEGGRPVEESRSLSPYPDEFQERQNFIGSL 660

Query: 661  FEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGN 720
            FEDRENT++DSARHEQSDST R+SKGKSKRSS+LEC EE + HTKRLK ES +QQPVSGN
Sbjct: 661  FEDRENTVVDSARHEQSDSTDRMSKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGN 720

Query: 721  WGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQ 780
            WG QLQS  NLSPSKLNRDS RNPTSQVTNKGE+KGNSDFRPK GNKE V EKN SDVSQ
Sbjct: 721  WGAQLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQ 780

Query: 781  AGWRPHDQSGGGVRAVDTAARADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENE 840
            A WRPHDQS  GVRAVDTA R DKHG+ IGRG KH+EKGGHANE+FH +KD FYGNAENE
Sbjct: 781  ASWRPHDQS--GVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENE 840

Query: 841  GTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV 900
            G  EKKVS+NSRSGGPGDKQIQP DSH SKPG+IVGKFKDG+ FSSSQMGYSPRDNNNR+
Sbjct: 841  GMNEKKVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRI 900

Query: 901  SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWS 960
            SA+RSPVNGKGR LQRE SDLELGELREPFPEEA GKKKFERNNS KQLENKE+T+DIWS
Sbjct: 901  SADRSPVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWS 960

Query: 961  SDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSH- 1020
            S+LNKGKSNLKAS++ GKRSSPH+STKFPSNPEGSNKKK SEH VED TR+NHR  QSH 
Sbjct: 961  SELNKGKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHP 1020

Query: 1021 --PQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKG 1080
              PQY+SRVDHVEV+K VDANVKPNQGIGPE CGESNRKASVGISQL+DMKREQLPSKKG
Sbjct: 1021 QGPQYSSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKG 1080

Query: 1081 SKRLAPNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDF 1140
            SKR APN ITEVTDALKNP+SAE ENSD KRRDSSSDENSCSYSKYEKDEPE KGAIKDF
Sbjct: 1081 SKRQAPNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDF 1140

Query: 1141 SQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY 1194
            SQYKEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDS+RGQDS++YFN+LGQLKESY
Sbjct: 1141 SQYKEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQDSDKYFNLLGQLKESY 1200

BLAST of HG10002324 vs. NCBI nr
Match: XP_023524753.1 (dentin sialophosphoprotein-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1874.0 bits (4853), Expect = 0.0e+00
Identity = 1019/1205 (84.56%), Postives = 1079/1205 (89.54%), Query Frame = 0

Query: 1    MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
            MYGG SKL R G GAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+ A
Sbjct: 1    MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSAA 60

Query: 61   TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
              SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKRVE+QGGTPRIKFDANAKNSS
Sbjct: 61   AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120

Query: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
            GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTT
Sbjct: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180

Query: 181  NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK 240
            NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQK
Sbjct: 181  NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240

Query: 241  NELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED 300
            NELSQ                   +RLSSSP+PSPPEQSGAP+SQFGSAN TKTH  AED
Sbjct: 241  NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCTAED 300

Query: 301  IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMS 360
            I+PR PAKIN+AASSEK+IPTKAAKGVLE PGQE N+GAKPTDLQGMLYNLL +NPKGMS
Sbjct: 301  IKPRQPAKINAAASSEKDIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360

Query: 361  LKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPL 420
            LKALEKAVGDKIPN+VKKIEPIIKK++         + +   VE+EGSKKP+SEGESSPL
Sbjct: 361  LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKS--EVEVEGSKKPSSEGESSPL 420

Query: 421  ISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLF 480
            +SH QTPVHED  DQ   PE QLEAR  IELEEKVETSQANKESNFLEKNG+QQ+SPD F
Sbjct: 421  VSHQQTPVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPF 480

Query: 481  AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSES 540
            AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGN SRSRS+SPVGSGSGSSSDSES
Sbjct: 481  AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSES 540

Query: 541  DAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED 600
            DAPSNSKEGSDEDVDIMTSDDDKE K+KLQA VQGFS SPAAWKSPDGGA   IDDEKED
Sbjct: 541  DAPSNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKED 600

Query: 601  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSL 660
            G  SDAIDIEKDSSDDEP+AKIDD SL    EGGRPVEE RS SPYPDEFQERQNFIGSL
Sbjct: 601  GHESDAIDIEKDSSDDEPEAKIDDRSLPPTVEGGRPVEESRSLSPYPDEFQERQNFIGSL 660

Query: 661  FEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGN 720
            FEDRENT++DSARHEQSDST R+SKGKSKRSS+LEC EE + HTKRLK ES +QQPVSGN
Sbjct: 661  FEDRENTVVDSARHEQSDSTDRMSKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGN 720

Query: 721  WGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQ 780
            WG QLQS  NLSPSKLNRDS RNPTSQVTNKGE+KGNSDFRPK GNKE V EKN SDVSQ
Sbjct: 721  WGAQLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQ 780

Query: 781  AGWRPHDQSGGGVRAVDTAARADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENE 840
            A WRPHDQS  GVRAVDTA R DKHG+ IGRG KH+EKGGHANE+FH +KD FYGNAENE
Sbjct: 781  ASWRPHDQS--GVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENE 840

Query: 841  GTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV 900
            G  EKKVS+NSRSGGPGDKQIQP DSH SKPG+IVGKFKDG+ FSSSQMGYSPRDNNNR+
Sbjct: 841  GMNEKKVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRI 900

Query: 901  SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWS 960
            SA+RSPVNGKGR LQRE SDLELGELREPFPEEA GKKKFERNNS KQLENKE+T+DIWS
Sbjct: 901  SADRSPVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWS 960

Query: 961  SDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSH- 1020
            S+LNKGKSNLKAS++ GKRSSPH+STKFPSNPEGSNKKK SEH VED TR+NHR  QSH 
Sbjct: 961  SELNKGKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHP 1020

Query: 1021 --PQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKG 1080
              PQY+SRVDHVEV+K VDANVKPNQGIGPE CGESNRKASVGISQL+DMKREQLPSKKG
Sbjct: 1021 QGPQYSSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKG 1080

Query: 1081 SKRLAPNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDF 1140
            SKR APN ITEVTDALKNP+SAE ENSD KRRDSSSDENSCSYSKYEKDEPE KGAIKDF
Sbjct: 1081 SKRQAPNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDF 1140

Query: 1141 SQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY 1182
            SQYKEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDS+RGQDS++YFN+LGQLKESY
Sbjct: 1141 SQYKEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQDSDKYFNLLGQLKESY 1200

BLAST of HG10002324 vs. ExPASy TrEMBL
Match: A0A1S3BIQ1 (dentin sialophosphoprotein isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490006 PE=4 SV=1)

HSP 1 Score: 1984.1 bits (5139), Expect = 0.0e+00
Identity = 1070/1200 (89.17%), Postives = 1097/1200 (91.42%), Query Frame = 0

Query: 1    MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
            MYGG SKLGRPG GAGRG GGKRPHSSFPLPPSHRPSGRLSLGGG AGS +NPRNRTTTA
Sbjct: 1    MYGGPSKLGRPGGGAGRGHGGKRPHSSFPLPPSHRPSGRLSLGGGAAGSASNPRNRTTTA 60

Query: 61   TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
            TTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKRVEAQGGTPRIKFDA A NSS
Sbjct: 61   TTSEASQSTEENFSLVTGNNPLAFGMIIRLAPDLIDEIKRVEAQGGTPRIKFDAIANNSS 120

Query: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
            GNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTT
Sbjct: 121  GNVIDVGGKEFRFTWSRERGDSCEIYEERKSGEDGSGLLIESGNCWRKLNVHRVLDESTT 180

Query: 181  NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
            NHVKKLSEEAER+SKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN
Sbjct: 181  NHVKKLSEEAERKSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240

Query: 241  ELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI 300
            ELSQ                   DRLSSSPIP PPEQ G PVSQFGSANT KTHVIAEDI
Sbjct: 241  ELSQVGPPKSTYKPGMPSLPASKDRLSSSPIPLPPEQFGGPVSQFGSANTNKTHVIAEDI 300

Query: 301  RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSL 360
            RPR+PAKIN AAS+EKEI T A KGVLETPGQEGNSGAKPTDLQGMLYNLL ENPKGMSL
Sbjct: 301  RPRVPAKINPAASNEKEILTIAPKGVLETPGQEGNSGAKPTDLQGMLYNLLLENPKGMSL 360

Query: 361  KALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPLI 420
            KALEKAVGDKIPNAVKKIEPIIKK++         + +  GVELEGSKKPTSEGESSPL+
Sbjct: 361  KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKS--GVELEGSKKPTSEGESSPLV 420

Query: 421  SHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFA 480
            SHHQT VHEDLPDQI APELQLEA  GI+LEEKVETSQANKESNFLEKNG+QQ  PD FA
Sbjct: 421  SHHQTSVHEDLPDQINAPELQLEAGCGIDLEEKVETSQANKESNFLEKNGIQQ--PDPFA 480

Query: 481  EKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESD 540
            EKKGSENSEGQAASSSDN SDSDS+SDSSDSGSDSGNHSRSRS+SPVGSGSGSSSDSESD
Sbjct: 481  EKKGSENSEGQAASSSDNVSDSDSDSDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSESD 540

Query: 541  APSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG 600
             PSNS+EGSDEDVDIMTSDDDKESKHKLQA VQGFSTSPAAWKSPDGG  QIIDDEKEDG
Sbjct: 541  GPSNSQEGSDEDVDIMTSDDDKESKHKLQASVQGFSTSPAAWKSPDGGPVQIIDDEKEDG 600

Query: 601  QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLF 660
            Q  DAIDIEKDSSDDEPDAK+D  SLL  EE GRPVEEPRSFSPYPDEFQERQNFIGSLF
Sbjct: 601  QEYDAIDIEKDSSDDEPDAKVDGRSLLPTEEVGRPVEEPRSFSPYPDEFQERQNFIGSLF 660

Query: 661  EDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 720
            EDREN + DSARHEQSDSTGRISKGKSKRSSDLECLEEK+DHTKRLKSESLAQQPVSGNW
Sbjct: 661  EDRENNVADSARHEQSDSTGRISKGKSKRSSDLECLEEKADHTKRLKSESLAQQPVSGNW 720

Query: 721  GVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQA 780
            GVQLQSP NLSPSKLNRDSVRN TSQVTNKGEIKGNSDFRPKKGNKETV EKNSSDV QA
Sbjct: 721  GVQLQSPRNLSPSKLNRDSVRNLTSQVTNKGEIKGNSDFRPKKGNKETVSEKNSSDVPQA 780

Query: 781  GWRPHDQSGGGVRAVDTAARADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGT 840
            GWRPHDQS GGVRAVDTA RADKHGDIGRGTKH EK GHANENFHVFKDTFYGNA+NEGT
Sbjct: 781  GWRPHDQS-GGVRAVDTATRADKHGDIGRGTKHIEKSGHANENFHVFKDTFYGNADNEGT 840

Query: 841  KEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA 900
            KEKKVSKNSRSGGPGDK IQPFDSH SKPGEIVGKFKDGQ FSSSQMGYSPRDNNNRVSA
Sbjct: 841  KEKKVSKNSRSGGPGDKHIQPFDSHQSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRVSA 900

Query: 901  NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSD 960
            NRSPVNGKGR LQRE SDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIW SD
Sbjct: 901  NRSPVNGKGRILQREPSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWGSD 960

Query: 961  LNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQY 1020
            LNKGKSNLKAS+EYGKRSSPHVSTKFPSNPEGSNKKKNSEH+VEDS RLN+RSL SH QY
Sbjct: 961  LNKGKSNLKASLEYGKRSSPHVSTKFPSNPEGSNKKKNSEHMVEDSNRLNNRSLLSHSQY 1020

Query: 1021 NSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLA 1080
            NSR+DH EVDKSVD NV+PNQG GPEG  ESNRKASVGISQLND KREQLPSKKGSKR A
Sbjct: 1021 NSRIDHAEVDKSVDGNVRPNQGNGPEGYVESNRKASVGISQLNDTKREQLPSKKGSKRQA 1080

Query: 1081 PNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKE 1140
            PNPITEVTD LKNP+SAE ENSDPKRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKE
Sbjct: 1081 PNPITEVTDGLKNPISAEHENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140

Query: 1141 YVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST 1182
            YVQEYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSE+YFNVLGQLKESYRLCST
Sbjct: 1141 YVQEYHDKYESYLSLNKILESYRTEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCST 1195

BLAST of HG10002324 vs. ExPASy TrEMBL
Match: A0A0A0LCU6 (Occludin_ELL domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G849910 PE=4 SV=1)

HSP 1 Score: 1983.0 bits (5136), Expect = 0.0e+00
Identity = 1071/1200 (89.25%), Postives = 1097/1200 (91.42%), Query Frame = 0

Query: 1    MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
            MYGG SKLGRPG GAGRG  GKRPHSSFPLPPSHRPSGRLSLGGG AGSV+NPRNRTTTA
Sbjct: 1    MYGGPSKLGRPGGGAGRGHAGKRPHSSFPLPPSHRPSGRLSLGGGAAGSVSNPRNRTTTA 60

Query: 61   TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
            TTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKRVEAQGGTPRIKFDANA NSS
Sbjct: 61   TTSEASQSAEENFSLVTGNNPLAFGMIIRLAPDLIDEIKRVEAQGGTPRIKFDANANNSS 120

Query: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
            GNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTT
Sbjct: 121  GNVIDVGGKEFRFTWSRERGDSCEIYEERKSGEDGSGLLIESGNCWRKLNVHRVLDESTT 180

Query: 181  NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
            NHVKKLSEEAER+SKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN
Sbjct: 181  NHVKKLSEEAERKSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240

Query: 241  ELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI 300
            ELSQ                   DRLSSSPIP PPEQ GAPVSQFGSANT+KTHVIAEDI
Sbjct: 241  ELSQVGPPKSTYKPGMPSLPASKDRLSSSPIPLPPEQFGAPVSQFGSANTSKTHVIAEDI 300

Query: 301  RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSL 360
            RPR+PAKIN AAS+EKEIPT A KGVLETPGQEGNSG KPTDLQGMLYNLL ENPKGMSL
Sbjct: 301  RPRVPAKINPAASNEKEIPTIAPKGVLETPGQEGNSGTKPTDLQGMLYNLLLENPKGMSL 360

Query: 361  KALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPLI 420
            KALEKAVGDKIPNAVKKIEPIIKK++        ++ +  GV LEGSKKPTSEGESSPLI
Sbjct: 361  KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYLLKS--GVGLEGSKKPTSEGESSPLI 420

Query: 421  SHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFA 480
            SHHQT VHEDLPDQ  APELQLEAR G++LEEKVETSQANKESNFLE NG+QQ  PD FA
Sbjct: 421  SHHQTSVHEDLPDQTNAPELQLEARCGMDLEEKVETSQANKESNFLETNGIQQ--PDPFA 480

Query: 481  EKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESD 540
            EKK SENSEGQAASSSDNESDSDS+SDSSDSGSDSGNHSRSRS+SPVGSGSGSSSDSESD
Sbjct: 481  EKKSSENSEGQAASSSDNESDSDSDSDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSESD 540

Query: 541  APSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG 600
             PSNS+EGSD DVDIMTSDDDKESK KLQA VQGFSTSPAAWKSPDGG  QIIDDEKEDG
Sbjct: 541  GPSNSQEGSDVDVDIMTSDDDKESKQKLQASVQGFSTSPAAWKSPDGGPVQIIDDEKEDG 600

Query: 601  QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLF 660
            Q  DAIDIEKDSSDDEPDAKID  SLL  EEG RPVEEPRSFSPYPDEFQERQNFIGSLF
Sbjct: 601  QEYDAIDIEKDSSDDEPDAKIDGRSLLPTEEGVRPVEEPRSFSPYPDEFQERQNFIGSLF 660

Query: 661  EDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 720
            EDREN ++DSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW
Sbjct: 661  EDRENNVVDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 720

Query: 721  GVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQA 780
            GVQLQSP NLSPSKLNRDSVRN TSQVTNKGEIKGNSDFRPKKGNKETV EKNSSDVSQA
Sbjct: 721  GVQLQSPRNLSPSKLNRDSVRNLTSQVTNKGEIKGNSDFRPKKGNKETVSEKNSSDVSQA 780

Query: 781  GWRPHDQSGGGVRAVDTAARADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGT 840
            GWRPHDQS  GVRAVDTA RADKHGDIGRGTKHTEK GHANENFHVFKDTFYGN +NEGT
Sbjct: 781  GWRPHDQS--GVRAVDTATRADKHGDIGRGTKHTEKSGHANENFHVFKDTFYGNPDNEGT 840

Query: 841  KEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA 900
            KEKKVSKNSRSGGPGDKQIQP DSHHSKPGEIVGKFKDGQ FSSSQMGYSPRDNNNRVSA
Sbjct: 841  KEKKVSKNSRSGGPGDKQIQPLDSHHSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRVSA 900

Query: 901  NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSD 960
            NRSPVNGKGR LQRE SDLELGELREPF EEARGKKKFERNNSLKQLENKENTTDIW SD
Sbjct: 901  NRSPVNGKGRILQREPSDLELGELREPFHEEARGKKKFERNNSLKQLENKENTTDIWGSD 960

Query: 961  LNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQY 1020
            LNKGKSNLKAS+EYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDS R+N+RSL SH QY
Sbjct: 961  LNKGKSNLKASLEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSNRINNRSLLSHSQY 1020

Query: 1021 NSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLA 1080
            NSR+DH EVDKS D NVKPNQG GPEG  ESNRKASVGISQLND KREQ PSKKGSKR A
Sbjct: 1021 NSRIDHAEVDKSADGNVKPNQGNGPEGYVESNRKASVGISQLNDTKREQPPSKKGSKRQA 1080

Query: 1081 PNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKE 1140
            PNPITEVTD LKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKE
Sbjct: 1081 PNPITEVTDGLKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140

Query: 1141 YVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST 1182
            YVQEYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSE+YFNVLGQLKESYRLCST
Sbjct: 1141 YVQEYHDKYESYLSLNKILESYRTEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCST 1194

BLAST of HG10002324 vs. ExPASy TrEMBL
Match: A0A6J1K6B7 (dentin sialophosphoprotein isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111492713 PE=4 SV=1)

HSP 1 Score: 1861.3 bits (4820), Expect = 0.0e+00
Identity = 1018/1217 (83.65%), Postives = 1079/1217 (88.66%), Query Frame = 0

Query: 1    MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
            MYGG SKL R G GAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+TA
Sbjct: 1    MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60

Query: 61   TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
              SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKRVE+QGGTPRIKFDANAKNSS
Sbjct: 61   AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120

Query: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
            GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTT
Sbjct: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180

Query: 181  NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK 240
            NHVKKLSEEAERRSKSRRAIVLEPGN SMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQK
Sbjct: 181  NHVKKLSEEAERRSKSRRAIVLEPGNLSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240

Query: 241  NELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED 300
            NELSQ                   +RLSSSP+PSPPEQ GAP+SQFGSAN TKTH IAED
Sbjct: 241  NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQFGAPISQFGSANPTKTHCIAED 300

Query: 301  IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMS 360
            I+PR PAKIN+AASSEKEIPTKAAKGVLE PGQE N+GAKPTDLQGMLYNLL +NPKGMS
Sbjct: 301  IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360

Query: 361  LKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPL 420
            LKALEKAVGDKIPN+VKKIEPIIKK++         + +   VELEGSKKP+SEGESSPL
Sbjct: 361  LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKS--EVELEGSKKPSSEGESSPL 420

Query: 421  ISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLF 480
            +SH QTPVHED  DQ   PE QLEAR  IELEEKVETSQANKESNFLEKNG+QQ+SPD F
Sbjct: 421  VSHQQTPVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPF 480

Query: 481  AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSES 540
            AEKKGSENSEG+AA+SSDNESDSDSESDSSDSGSDSGN SRSRS+SPVGSGSGSSSDSES
Sbjct: 481  AEKKGSENSEGEAATSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSES 540

Query: 541  DAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED 600
            DAPSNSKEGSDEDVDIMTSDDDKE KHKLQA VQGFS SPAAWKSPDGGA   IDDEKED
Sbjct: 541  DAPSNSKEGSDEDVDIMTSDDDKEPKHKLQASVQGFSASPAAWKSPDGGAVLNIDDEKED 600

Query: 601  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSL 660
            G  SDAIDIEKDSSDDEP+AKIDD SL    EGGR VEE RS SPYPDEFQERQNFIGSL
Sbjct: 601  GHESDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRTVEESRSLSPYPDEFQERQNFIGSL 660

Query: 661  FEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGN 720
            FEDRENT++DS RHEQSDST RISKGKSKRSS+LEC EE + HTKRLK ES +QQPVSGN
Sbjct: 661  FEDRENTVVDSGRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGN 720

Query: 721  WGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQ 780
            WGVQLQS  NLSPSKLNRDS RNPT+QVTNKGE+KGNSDFRPK GNKE V EKN SDVSQ
Sbjct: 721  WGVQLQSSRNLSPSKLNRDSARNPTNQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQ 780

Query: 781  AGWRPHDQSGGGVRAVDTAARADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENE 840
            A WRPHDQS  GVRAVDTA R DKHG+ IGRG KH+EKGGHANE+FH +KD FYGNAENE
Sbjct: 781  ASWRPHDQS--GVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENE 840

Query: 841  GTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV 900
               EKKVS+NSRSGGPGDKQIQP DSH SKPG+IVGKFKDG+ F SSQMGYSPRDNNNR+
Sbjct: 841  EMNEKKVSRNSRSGGPGDKQIQPSDSHLSKPGDIVGKFKDGKTFPSSQMGYSPRDNNNRI 900

Query: 901  SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWS 960
            SA+RSPVNGKGR LQRE SDLELGELREPFPEEA GKKKFERNNS KQLENKE+T+DIWS
Sbjct: 901  SADRSPVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWS 960

Query: 961  SDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHP 1020
            S+LNKGKSNLKAS++ GKRSSPH+STKFPSNPEGSNKKK SEH VED TR+NHR  QSHP
Sbjct: 961  SELNKGKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHP 1020

Query: 1021 ---QYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKG 1080
               QY+SRVDHVEV+K VDANVK NQGIGPE CGESNRKASVG+SQL+DMKREQLPSKKG
Sbjct: 1021 QGSQYSSRVDHVEVEKPVDANVKLNQGIGPESCGESNRKASVGMSQLHDMKREQLPSKKG 1080

Query: 1081 SKRLAPNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDF 1140
            SKR APN I EVTDALKNP+SAE ENSD KRRDSSSDENSCSYSKYEKDEPE KGAIKDF
Sbjct: 1081 SKRQAPNQIIEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDF 1140

Query: 1141 SQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY 1194
            SQYKEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDSARGQDS++YFN+LGQLKESY
Sbjct: 1141 SQYKEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSARGQDSDKYFNLLGQLKESY 1200

BLAST of HG10002324 vs. ExPASy TrEMBL
Match: A0A6J1KF98 (dentin sialophosphoprotein isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111492713 PE=4 SV=1)

HSP 1 Score: 1861.3 bits (4820), Expect = 0.0e+00
Identity = 1018/1217 (83.65%), Postives = 1079/1217 (88.66%), Query Frame = 0

Query: 1    MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
            MYGG SKL R G GAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+TA
Sbjct: 1    MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60

Query: 61   TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
              SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKRVE+QGGTPRIKFDANAKNSS
Sbjct: 61   AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120

Query: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
            GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTT
Sbjct: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180

Query: 181  NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK 240
            NHVKKLSEEAERRSKSRRAIVLEPGN SMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQK
Sbjct: 181  NHVKKLSEEAERRSKSRRAIVLEPGNLSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240

Query: 241  NELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED 300
            NELSQ                   +RLSSSP+PSPPEQ GAP+SQFGSAN TKTH IAED
Sbjct: 241  NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQFGAPISQFGSANPTKTHCIAED 300

Query: 301  IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMS 360
            I+PR PAKIN+AASSEKEIPTKAAKGVLE PGQE N+GAKPTDLQGMLYNLL +NPKGMS
Sbjct: 301  IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360

Query: 361  LKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPL 420
            LKALEKAVGDKIPN+VKKIEPIIKK++         + +   VELEGSKKP+SEGESSPL
Sbjct: 361  LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKS--EVELEGSKKPSSEGESSPL 420

Query: 421  ISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLF 480
            +SH QTPVHED  DQ   PE QLEAR  IELEEKVETSQANKESNFLEKNG+QQ+SPD F
Sbjct: 421  VSHQQTPVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPF 480

Query: 481  AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSES 540
            AEKKGSENSEG+AA+SSDNESDSDSESDSSDSGSDSGN SRSRS+SPVGSGSGSSSDSES
Sbjct: 481  AEKKGSENSEGEAATSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSES 540

Query: 541  DAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED 600
            DAPSNSKEGSDEDVDIMTSDDDKE KHKLQA VQGFS SPAAWKSPDGGA   IDDEKED
Sbjct: 541  DAPSNSKEGSDEDVDIMTSDDDKEPKHKLQASVQGFSASPAAWKSPDGGAVLNIDDEKED 600

Query: 601  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSL 660
            G  SDAIDIEKDSSDDEP+AKIDD SL    EGGR VEE RS SPYPDEFQERQNFIGSL
Sbjct: 601  GHESDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRTVEESRSLSPYPDEFQERQNFIGSL 660

Query: 661  FEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGN 720
            FEDRENT++DS RHEQSDST RISKGKSKRSS+LEC EE + HTKRLK ES +QQPVSGN
Sbjct: 661  FEDRENTVVDSGRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGN 720

Query: 721  WGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQ 780
            WGVQLQS  NLSPSKLNRDS RNPT+QVTNKGE+KGNSDFRPK GNKE V EKN SDVSQ
Sbjct: 721  WGVQLQSSRNLSPSKLNRDSARNPTNQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQ 780

Query: 781  AGWRPHDQSGGGVRAVDTAARADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENE 840
            A WRPHDQS  GVRAVDTA R DKHG+ IGRG KH+EKGGHANE+FH +KD FYGNAENE
Sbjct: 781  ASWRPHDQS--GVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENE 840

Query: 841  GTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV 900
               EKKVS+NSRSGGPGDKQIQP DSH SKPG+IVGKFKDG+ F SSQMGYSPRDNNNR+
Sbjct: 841  EMNEKKVSRNSRSGGPGDKQIQPSDSHLSKPGDIVGKFKDGKTFPSSQMGYSPRDNNNRI 900

Query: 901  SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWS 960
            SA+RSPVNGKGR LQRE SDLELGELREPFPEEA GKKKFERNNS KQLENKE+T+DIWS
Sbjct: 901  SADRSPVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWS 960

Query: 961  SDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHP 1020
            S+LNKGKSNLKAS++ GKRSSPH+STKFPSNPEGSNKKK SEH VED TR+NHR  QSHP
Sbjct: 961  SELNKGKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHP 1020

Query: 1021 ---QYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKG 1080
               QY+SRVDHVEV+K VDANVK NQGIGPE CGESNRKASVG+SQL+DMKREQLPSKKG
Sbjct: 1021 QGSQYSSRVDHVEVEKPVDANVKLNQGIGPESCGESNRKASVGMSQLHDMKREQLPSKKG 1080

Query: 1081 SKRLAPNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDF 1140
            SKR APN I EVTDALKNP+SAE ENSD KRRDSSSDENSCSYSKYEKDEPE KGAIKDF
Sbjct: 1081 SKRQAPNQIIEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDF 1140

Query: 1141 SQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY 1194
            SQYKEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDSARGQDS++YFN+LGQLKESY
Sbjct: 1141 SQYKEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSARGQDSDKYFNLLGQLKESY 1200

BLAST of HG10002324 vs. ExPASy TrEMBL
Match: A0A6J1KCU5 (dentin sialophosphoprotein isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111492713 PE=4 SV=1)

HSP 1 Score: 1860.1 bits (4817), Expect = 0.0e+00
Identity = 1015/1205 (84.23%), Postives = 1074/1205 (89.13%), Query Frame = 0

Query: 1    MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
            MYGG SKL R G GAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+TA
Sbjct: 1    MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60

Query: 61   TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
              SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKRVE+QGGTPRIKFDANAKNSS
Sbjct: 61   AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120

Query: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
            GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTT
Sbjct: 121  GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180

Query: 181  NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK 240
            NHVKKLSEEAERRSKSRRAIVLEPGN SMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQK
Sbjct: 181  NHVKKLSEEAERRSKSRRAIVLEPGNLSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240

Query: 241  NELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED 300
            NELSQ                   +RLSSSP+PSPPEQ GAP+SQFGSAN TKTH IAED
Sbjct: 241  NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQFGAPISQFGSANPTKTHCIAED 300

Query: 301  IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMS 360
            I+PR PAKIN+AASSEKEIPTKAAKGVLE PGQE N+GAKPTDLQGMLYNLL +NPKGMS
Sbjct: 301  IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360

Query: 361  LKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPL 420
            LKALEKAVGDKIPN+VKKIEPIIKK++         + +   VELEGSKKP+SEGESSPL
Sbjct: 361  LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKS--EVELEGSKKPSSEGESSPL 420

Query: 421  ISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLF 480
            +SH QTPVHED  DQ   PE QLEAR  IELEEKVETSQANKESNFLEKNG+QQ+SPD F
Sbjct: 421  VSHQQTPVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPF 480

Query: 481  AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSES 540
            AEKKGSENSEG+AA+SSDNESDSDSESDSSDSGSDSGN SRSRS+SPVGSGSGSSSDSES
Sbjct: 481  AEKKGSENSEGEAATSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSES 540

Query: 541  DAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED 600
            DAPSNSKEGSDEDVDIMTSDDDKE KHKLQA VQGFS SPAAWKSPDGGA   IDDEKED
Sbjct: 541  DAPSNSKEGSDEDVDIMTSDDDKEPKHKLQASVQGFSASPAAWKSPDGGAVLNIDDEKED 600

Query: 601  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSL 660
            G  SDAIDIEKDSSDDEP+AKIDD SL    EGGR VEE RS SPYPDEFQERQNFIGSL
Sbjct: 601  GHESDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRTVEESRSLSPYPDEFQERQNFIGSL 660

Query: 661  FEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGN 720
            FEDRENT++DS RHEQSDST RISKGKSKRSS+LEC EE + HTKRLK ES +QQPVSGN
Sbjct: 661  FEDRENTVVDSGRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGN 720

Query: 721  WGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQ 780
            WGVQLQS  NLSPSKLNRDS RNPT+QVTNKGE+KGNSDFRPK GNKE V EKN SDVSQ
Sbjct: 721  WGVQLQSSRNLSPSKLNRDSARNPTNQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQ 780

Query: 781  AGWRPHDQSGGGVRAVDTAARADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENE 840
            A WRPHDQS  GVRAVDTA R DKHG+ IGRG KH+EKGGHANE+FH +KD FYGNAENE
Sbjct: 781  ASWRPHDQS--GVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENE 840

Query: 841  GTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV 900
               EKKVS+NSRSGGPGDKQIQP DSH SKPG+IVGKFKDG+ F SSQMGYSPRDNNNR+
Sbjct: 841  EMNEKKVSRNSRSGGPGDKQIQPSDSHLSKPGDIVGKFKDGKTFPSSQMGYSPRDNNNRI 900

Query: 901  SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWS 960
            SA+RSPVNGKGR LQRE SDLELGELREPFPEEA GKKKFERNNS KQLENKE+T+DIWS
Sbjct: 901  SADRSPVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWS 960

Query: 961  SDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHP 1020
            S+LNKGKSNLKAS++ GKRSSPH+STKFPSNPEGSNKKK SEH VED TR+NHR  QSHP
Sbjct: 961  SELNKGKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHP 1020

Query: 1021 ---QYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKG 1080
               QY+SRVDHVEV+K VDANVK NQGIGPE CGESNRKASVG+SQL+DMKREQLPSKKG
Sbjct: 1021 QGSQYSSRVDHVEVEKPVDANVKLNQGIGPESCGESNRKASVGMSQLHDMKREQLPSKKG 1080

Query: 1081 SKRLAPNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDF 1140
            SKR APN I EVTDALKNP+SAE ENSD KRRDSSSDENSCSYSKYEKDEPE KGAIKDF
Sbjct: 1081 SKRQAPNQIIEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDF 1140

Query: 1141 SQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY 1182
            SQYKEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDSARGQDS++YFN+LGQLKESY
Sbjct: 1141 SQYKEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSARGQDSDKYFNLLGQLKESY 1200

BLAST of HG10002324 vs. TAIR 10
Match: AT3G21290.1 (dentin sialophosphoprotein-related )

HSP 1 Score: 540.4 bits (1391), Expect = 3.6e-153
Identity = 490/1265 (38.74%), Postives = 668/1265 (52.81%), Query Frame = 0

Query: 1    MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPS--GRLSLGGGGAGSVANPRNRTT 60
            M+ GSSK G  G   G G+G  R  +SFP P +  PS  GR+S GGGG GS A PR R+ 
Sbjct: 1    MFKGSSKRGGRGGSGGGGSGPSRNRNSFPPPTNRHPSPIGRMSSGGGGGGSAA-PRQRSN 60

Query: 61   T------ATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKF 120
            +      A+T+ + ++VEE F+LV   +  AF MIIRL+PDL+DEIKRVEAQGG  +IKF
Sbjct: 61   STSVKAAASTTVSSRTVEETFNLVPRESSSAFGMIIRLSPDLVDEIKRVEAQGGAAKIKF 120

Query: 121  DANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQ 180
            DA   NS+ N+I+VGGKEF+FTWS E G+LCDIYEE +SGEDG+GLLIE+G AWRKLNV 
Sbjct: 121  DAFPNNSTENIINVGGKEFKFTWSGEKGELCDIYEEHQSGEDGNGLLIEAGCAWRKLNVL 180

Query: 181  RILDESTTNHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKK 240
            R LDESTT+H+K  S EAE+R+KSR+AIVL+PGNPS+    KQLA AE +PWR   K KK
Sbjct: 181  RTLDESTTSHMKMRSVEAEQRTKSRKAIVLDPGNPSV---TKQLAHAEGSPWRMSNKQKK 240

Query: 241  EPPFKKQK--------------------NELSQDRLSSSPIPSPPEQSGAPVSQFGSANT 300
            EPP KK+K                        ++RLS+SP PSP  Q   P   +G  N 
Sbjct: 241  EPPPKKRKVDPPPVPVGGPKPSFRPGASTPTMKNRLSASPGPSPSNQYNTP--PYGIGNM 300

Query: 301  TKTHVIAEDIRP-RLPAKINSAASSEKEIPTKAAKGVL-ETPGQEGNSGAKPTDLQGMLY 360
             KTH   E++ P +   ++N     EKE P+     VL +T G+E  +  K  DLQ +L 
Sbjct: 301  AKTHAANENVTPVQTKGRVNMI---EKE-PSAWKNNVLRDTSGREAINVNKEIDLQSLLV 360

Query: 361  NLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSK 420
            ++L E P  MSLKALEKAVGDK+PN  KKIEPI+K++++     + +       ELE  K
Sbjct: 361  DILKEAP--MSLKALEKAVGDKVPNPAKKIEPILKRIANFQAPRYFLKPE---AELESYK 420

Query: 421  KPTSEGESSPLISHHQ-TPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNF-- 480
            K + +  SSP   H Q  PV E   DQ+  P        G    EK    + N E +   
Sbjct: 421  KHSPDSGSSP--EHQQLLPVTECSRDQLPVP--------GRNNTEKFSLCEQNGEGSLDC 480

Query: 481  -----------LEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNESDSDSESDSSDSGSD 540
                        E   ++ HSP +F E+K SEN E QA SS    SDSDS+SD+SDSGSD
Sbjct: 481  LPVHLVEQLSTQENVDIEHHSPGIFHEEKRSENREAQARSS----SDSDSDSDNSDSGSD 540

Query: 541  SGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQ- 600
                    S+S  GS SGSSSDSE  A SNSK+GSDEDVDIM SD D+E     Q+  Q 
Sbjct: 541  --------SKSAAGSDSGSSSDSE--ASSNSKDGSDEDVDIM-SDGDREPLLTTQSLEQD 600

Query: 601  -----GFSTSPAAWKSPDGGAAQIIDDEKE----DGQGSDAIDIEKDSSDD----EPDAK 660
                 G  +S    +  +  A  I   + +    DG GSD +D+E +SSD+    + D K
Sbjct: 601  AIDLPGHGSSAVEIEGHNSDAVDIDGHDSDAVDIDGHGSDTVDVEGNSSDEGHGSDADRK 660

Query: 661  IDDSSLLRIE---------EGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSA 720
             +  +  ++E          G   +     F+   D  +ERQNFIG LF+D ENT  ++ 
Sbjct: 661  KNSDNNWKMETTTGTSPTANGEVGISGQEHFTSGHDNLRERQNFIGQLFDDTENTTKNNF 720

Query: 721  RHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWGVQLQSPHNLS 780
            ++++ D + R+ K +++++ D E   +KS H K  KS+S  Q        V   S H   
Sbjct: 721  KNDKRDISERLGKDQNQKALDFEHYSQKSAHEKNRKSQSCNQLS-----AVSKDSQH--- 780

Query: 781  PSKLNRDS-VRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGG 840
             S+L  D+ +RN ++  T            P +G  ++  EK++                
Sbjct: 781  -SELKYDAELRNASASQT----------IDPLRGLLKSSIEKSNRH-------------- 840

Query: 841  GVRAVDTAARADKHGDIGRGTKHTEKGGH------ANENFHVFKDTFYGNAENEGTKEKK 900
                     +++KH D     + ++KG H      ++ +   F+D    N  ++   + K
Sbjct: 841  --------GKSNKHSDALGNVRKSDKGDHFPLEMLSSRSGKAFRD----NQRDDVHLKNK 900

Query: 901  VSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSANRSP 960
              +N + G    +   P ++   KP E+ G  KD +  S   +G SP D+     A    
Sbjct: 901  FPRNKKDGESAIRPSLPTETSDRKPDELDGSDKDPKNVSGLSIGSSPLDSQRTYLAKLP- 960

Query: 961  VNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKG 1020
              G G  LQ+++S+LELGEL EP  E+    K  E   S +Q   K +T++    D +K 
Sbjct: 961  -KGNGPVLQKQVSELELGELPEPLGEDT-ALKPIEEKTSFRQSNLKPSTSEKLGIDSDKR 1020

Query: 1021 KSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQYNSRV 1080
            +S    S    K+++P      P    GSN     EH+VEDS R    +LQSH Q  +  
Sbjct: 1021 RSKKSDS----KKAAP------PHTVNGSN-----EHVVEDSERSQKWALQSHGQNLTGT 1080

Query: 1081 DHVEVD-----------KSVDANVKPNQGIGPEGCGESNRKASV--GISQLNDMKREQLP 1140
            D  E+            KS   + +   G   EG GE+N+K  V    S+     R    
Sbjct: 1081 D-TEISSQNKNLEDAAYKSRQKDSRARVGNSVEGYGETNKKTPVVKHGSKRASTSRSSRE 1140

Query: 1141 SKKGSKRLAPNPITEVTDALKNP-VSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKG 1177
            SK+ S     N I    DA   P  S  RE    K+  S  +E+S SY KYEK  PE KG
Sbjct: 1141 SKRHSS--VSNSINGHKDATSIPGGSVVRE----KQMTSFGEEDS-SYLKYEKASPELKG 1154

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038883601.10.0e+0090.25dentin sialophosphoprotein isoform X1 [Benincasa hispida][more]
XP_008447590.10.0e+0089.17PREDICTED: dentin sialophosphoprotein isoform X1 [Cucumis melo][more]
XP_004146856.10.0e+0089.25dentin sialophosphoprotein isoform X1 [Cucumis sativus] >KGN59835.1 hypothetical... [more]
XP_023524752.10.0e+0083.98dentin sialophosphoprotein-like isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_023524753.10.0e+0084.56dentin sialophosphoprotein-like isoform X2 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3BIQ10.0e+0089.17dentin sialophosphoprotein isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490006 PE... [more]
A0A0A0LCU60.0e+0089.25Occludin_ELL domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G84991... [more]
A0A6J1K6B70.0e+0083.65dentin sialophosphoprotein isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC11149271... [more]
A0A6J1KF980.0e+0083.65dentin sialophosphoprotein isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC11149271... [more]
A0A6J1KCU50.0e+0084.23dentin sialophosphoprotein isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC11149271... [more]
Match NameE-valueIdentityDescription
AT3G21290.13.6e-15338.74dentin sialophosphoprotein-related [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR010844Occludin homology domainPFAMPF07303Occludin_ELLcoord: 1111..1180
e-value: 7.7E-10
score: 39.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 736..751
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 894..927
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 244..268
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 816..831
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 778..804
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 975..992
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 425..439
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 51..69
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 224..243
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 647..689
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 571..618
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1077..1104
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 647..1104
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 958..974
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 387..635
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 490..523
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 185..268
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 185..199
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 690..734
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 524..549
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 928..947
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 860..891
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..69
NoneNo IPR availablePANTHERPTHR38372DENTIN SIALOPHOSPHOPROTEIN-LIKE PROTEINcoord: 7..1181

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10002324.1HG10002324.1mRNA