Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATGGCGGCTCATCCAAGCTCGGTCGACCCGGCGCCGGCGCCGGCCGGGGAGCCGGAGGAAAGCGTCCGCACTCTTCTTTTCCTCTACCACCTTCTCACCGCCCCTCCGGTCGTCTCTCTCTTGGCGGCGGTGGTGCTGGTTCTGTCGCCAATCCTCGGAATCGAACCACCACTGCAACCACATCCGAAGCCCCTCAATCCGTTGAGGAGAATTTCAGTCTCGTTACTGGTAACAACCCGTTGGCTTTTGCCATGATAATTCGGTTGGCTCCGGATTTGATCGATGAGATCAAGCGGGTCGAGGCGCAGGGCGGGACTCCGAGGATTAAGTTTGATGCGAACGCCAAGAATTCTAGTGGTAATGTAAGTTTTCTACAATACTTGTTCTTTAGTTAGTCATTCACATCATGTGGTAATGAAGATTTTTATGCTCTTGTGTATTTTTATTCTGTAATGTGCGAGTGTGGTATTTTTTGGATTATTTGAACTTGTGTTAAGTACTTCAACTATGCTGTCGGATATGTGCACAAGTAGATTGTACAGATTATGCTTTTGTTCACTATTTTTGGATTTCAATGGGGGCAGTTACTTTGACACTAAGTTGATAGGTTTTATTTGCTTTTGACACTAAGTTGATGGGTTTTATTTGCTTTGTGCAACTGCTACTTCATTACATATATTCTGGTCTTTATGGTCCTTATTCAGTGGAATTTGTACTCAATTTTAGTATTGCTTAATTATTTATTCTTTTGTAACATTATTGTGTTTACATTTTTTATTGGGTACCAGTTAGCTGTATGCTGAATTGGTGAATACGAATACTGTAAGGTCAAGGATTAGATTACCTAATTGAGTAGCGACATGGAAAACCGTGTAAATAAGTATTTGTTCGACAAATATAAGTAGGAATACTATAATCAGATTATATCATCAGTGCTTTTACATGCCTTTGATGCATATATGGTTTTGTAGAAGGGTATACTGGAGTTCGTGCCCCCCTGAAAATTGGGGCATGGCCATTAAAACTTTTTCAAGCTGTTGATCTACACTCGATAATACTTTCTGCAGTTATGGCCTAAAATTTGAGCTTATTTTTCTCTCTCATTTCCGTAGACCTCATAGTTTAATGATTCAAGGAAGTGCTGCCACTGTTGAGGCCTTGGTCTTAGTCTTACAATGTGCACAACATGGTTATAAAGATTATGCAATTTATGATGGATGTGGAGTAAATTTTAGTAGGAAGTAAGTAAGAGAGACAAGTTGAAAGGGTTAACTCCAATTCTCTCTCTGCTTATTGGAGAGCTTTATCTTGATAAAGTTACTATCTCAAGGATCTGATTAAATTTCCTATCGATTGACCTGGATCATATTCTCTGGTGTTGTGAGTTCGTGAGAACTGTGTGGGATTTTTTCTTTCAAACGTTCAGGTTCTCGCTTGCTCGGTGTAGGGTTTCTAGTGATATGGTCGGGGATTTCCTCCTCTATTTGCATTTTCGTGAGAAGGAGCAGTTCTTATGGATGGTCGGGGTGTGTGCTGTTCTTTGGGTTATGTGGGAGGAGCGGAGTCTCCTTTTTAGAGGGAATTACTTTCTCTTTTGGTGGGCTTGGTTTTTGTTTGCCCTATATTACTTCATTTTTAATGAAAGTTGTTTCTATAAAAAAAAAAGGAGTTATTTATTTATTTTACTCTTATATTTTAAAATATTACTTTAGGCTGATAAATTCTTAGTGTAATTTGAAAAGATGTCCAGTAGGATGGCTAGTATCCCTGAAATTTTAAAGCATGATACCAATAGGAGGTAGACTAAGTGGCCTATTGTGATTCTAGTAGGAAAACATACGGAGGGGTGGGCAGGCTTTCTACAATTATATAAAAAGGCTAACAACCTGGAATTTCCCTACCATCAGTGGTTCTTTTAATCAAGTACTCCATAATTTGTGGGTGCCCAAATACTAGATGTCATATGAACAAGTTAAAAGTAGTCGGATTTAGGAAAAAATAAAAATATATTTTCCTCTAAATTCTTCCAAAATCTCCAGACCAAGAAAGAGAAAATGTTGATTTTTGCTGATGGTCTGGATTGGGATTGATAGAGAAGGAAAGGCCTTTCTTGATAAAGAATAGGTTCTAAAGATTACGAAAAATCAGGGATGAAATAGTTGGGTCCAAATGACTTTACCATTTGAAATCTTAAAAAATTGTTGGAATGCCATCAAAAGTGAACTCATGGTAGTGTTCCAAGGTTCTTGTGGGAGTCGGGTGTTTCAAATGAATTGTGGAGTCTTGAACGCTAGAGTTTTGAGTATTAAGAGCATTGCATCTTTCCAATCCTCATGGAGGTTGGGATCTCCATTGACGTCTGATATTCCAAACCTTCCTAGATTTGGAAATATGAGGTTGCTTCTTCAGGATTTCATTTTGTCACTTCATTAGAACAGAATGGATGGACCTATTAGGATGGCTAGGGCCTAGGCAAGGGTTGGTGCTGAAATAGAAAAATTGTAAGGCCTTGTTTTGATTGCATATGCTAGGTTTCGGTTATGTTGTTGAGCCCTGTAAGATCTACTCTTTTTTCTTGATATCCCGATTCTTCTGTTTGTGTGAAAATGGGGAGGAGTACATCCTGCACATTATGTACAAAGTGCTATCCTGATTTCACTGTTTTTCAGTTTATCAATGAATTTGTGGTACAAAAATGTCTCTCTTTTGAGCAAACTTAAAAATCATAGTTTAGGGATAAATTGTATGTCCACGAACTTTCAGGGACCAAAAGTGATTATTAACCTTTAAATTATTCTCCACTCCTGAAGCTTTTTCATGTGCATTTTCTTCCTATTGATAAGAAGGGTCTAAAAGTCTCGAGGAACTGCCCCTATTTCACCGCCCCCTACTGTGTTAAAGTTATAATATGCACATGAATCTAACATATTCTCATTTCTCATCATATCAATGTTTCCTGTCAGTCAATGCTATAATCGTCCATGCACCTTAAGATCTAATTTCTGCATCTATGTTTGAAGCTATTTTATGAGGTCTTGGCGATTTAAGATTGTTGTAATTGAACTCAAATTTGACATCAGGTCATTGATGTTGGTGGGAAAGAGTTTAGGTTCACATGGTCACGTGAAGTTGGTGATCTTTGTGACATATATGAAGAACGCAAAAGTGGAGAAGATGGAAGTGGTTTGCTCATTGAATCAGGCAATGCTTGGAGAAAACTGAATGTGCAGCGTATCTTAGACGAATCAACTACAAACCATGTTAAGAAGTTGTCTGAAGAAGCTGAACGTAGATCTAAGTCTCGTAGGTATGCAATCCTGTATTGAGGAGTTTCTTGATTCTCTACTTTTGGCAATTAGCACATGTGACCTAACTGCATTCTGCCCTTTTCAATTTATAGATTATCTAGACTTTTATTCATTTTTATATTTTTATAAAACTAAGTGGGGCTAGTTTATGTGCTTATCTTAATTCTCACACACTTGATAGTTTTGGCACCATTACATGAACTATAATTTGTAACTGCAAAGAAAATACGTTAGCAATATTAGCACTAACATATATGTAATTTTACCTAGTAAGTCTATTAGAGTTGTGTAAATTGAGGAGATTTCTTAGTGTATTTTGGGAGACACCAAACCTCTCAAAGAACTTGGGAGCTTGTACTTCTTTTGTAAATTTTAATGCAAATCAAATTGTGTCTTTCAACTTCTTATGAAATTGGTGTACGCTTTAAACTGACAAACCTCGTCAATAGGCCTGGTCCTATCCGAAATTATTCCCCAAAGATGTCCTCCCCGAAGTTCCTATTCACTCTATTAATACAACAACTTAGCTCTAAGAAACTTACCAAATAAGCCAATAACATATCCTTACTAATATTCTTAACCAAATGGTGATAAGATTTTATTGAGACATTTTTAAAAAAAAAAATTTAAAAATTTTTTTTTTTTTTAAAAAAATCATTGGTGTGTTAACAATAAGGTACAAAAGCCGAACAAAGTCAAGGAGTTTGAGAAAAGCTTGTTGATTAGCTATCATCATAAAATTAAGAATATTACAAAAAGATTAGAATGGAGGCTCAGGTTCCCCTCTCAGTTTGCAAGGTTTAAGGCCTTAGGCTGTCACCTTTTTGTCTGGTTAATATATCTATTGTTTCTTAAAAAAAAAAAAGGAAAAAGAAAAAATGGTTTAAGGCTGGGTGTGGAAAATATTTGCTTTTTGGAAGATAGTTGAATGATCAGGTTTGATTCCTTGATTATTTATGCTGTTCTGTGCACAAAAGAAGCCTCAATTTCTGATTGTGAGGATAGTAGCAATCAAGCTTGGGACTTGGGTCTTATTAGAGGTGTTTTTGATAGAGAGCTTCACTTGGGTGACCCTCATGGAAAAGATTAGCTCGTTTCATCTAGGGAGGGAGTGGGTTGAATTTGTGGGTCTGGAAGATTAGGGGAGATATTCTAGTAAGTTGGCCTTTCAAAACATCCCCACAAAACCCGCCTTTAGGGGAGAACAATAGGGTTTTTAGAGGTATGGAGAGGGCCTCTAGTGAGATTTGGGCCCTTGTTCGTTATCATGTTTCCTTGTGCGCTTCGATTTCGAAGATTTTTTGTAACTATTCTATTGGTGTTATTTTGCATAGTAGGACTTCCTTTCTTTTGAGGGTGCTCCCCTTTTTGTAGGTTGGTTTTTTGGATGCCTGTGTATTCTTTCATTTTTCTCGATGAAAGTGGTTATTTTTATATTTAAAAAAAAACCCCATAAAATCCACTAACACCAAGGTTAACATTTCCCTGACTAGCAAGGGATCAAACCGAATCGAACTGAAAAATTTAAAAATGTATATACGAAACCGAACTGAACCGATTTATAGATTCGAACCGAACCTAATTGCAAATGTGGTTCGGTTCGATTTTTAGTTTTGCTCTTATATATATATATTGGTTTTTTGTTTTGTTTCCTTTTCTAAAACCAAAGAGAAAGAGAAAGAGATTATAGTTTGGGTTTGAAGCATGTAAAACAAAAGTAAAGGCAAATGAGACAATGAGCAATATGTAATGAAAGTTAAATTTCAATGCCGAACTATTAATTAATTACAAATCCCATTTCCCCACTCTTTTCTTTTCAAACATTAAGAAGCCGCTCTCACCTAGATCTACCATTTTCCTTAAGAATTAGGCTTCAAATCAAATGGAATAGCATGTTAATATAATGGTGCAAGCGACAAAGACCTCTTGGTCTGACGCTGACCTTCAGAGAGCCGAGAGAGAGAGAGAGACGAGACGAGAGATGGAGGGAGATGAGGGATCATGAGAGAGAGACTGAGAGAGGGAGACGAAGTGAAGCCGAGAGAGAGGGAGGGAGAAGAAGGTCGAACATTGAAGGATGGCTGATGGAGGGTCGCACGTTGAAGGATGGTCGAAGGTCGACTGAGAGACGATACAGGGGCTGGCGACAGAGCACTAAGAGAGGAAGAGGGCTGGGTGGTGACAATCTTTAAGTTGAATAACCCTAAGTTGCAACCCAATAAAATGGTTCGGTTTTAAATTTGGTTTTCGAAAATGAAATGAACCAACCGAATCATTTGGTTTCAAAATTCCTTTAAACCGAACTAAAACACAATTTTCAATTTCAATGATAATTCGGTTTTGTAAAAGGGTTCGGTTTTAGCGATTCAGTGTACACCCCTACTGATTAGTATGATTTGGAAGCACAAATGCCCCAAGAAAGTCAAGATCTTCCTAAAGCCTTTGGCTTTTAGGAGCCTTAACATGGATGATAGACTTCAAAGGAAGTTCAGGAATTGGTCTTTTTCCTTTGAGATGCAGGTTATGTTTGGATGGTGGGAACCGAGGAGAACCTTGAACACAAACACCTCTTCAACCATTGTGACTTTGCTTATAGATCGTGGAGTTACTTCTTTGGCATTTTGGGCATCTTTGCTTGGGCCTAGGAGGGTAGATGATTGGATTATGAAAGGCTTAAATGCTTGAAATTTGACGGAAAGAGCCAAAATTCTTGGGAGTTGTGCTTTTCGAGCTCTTTTGTGGCATATTTGGAACGAAAGGAATGTGAGGTCTTTCGAAGATAACTCTTTTTGTCTTAATTTATTTTGCGATCTTCTACAAAATACAGTGTCTTGGTGGATATCTCTATGTTCAAAATTCTTTTGTAATTACAACCTCTAGATGATTATTAACGATTGGAAAGCTCTCTGTGTAGAGTTTGGGGAGGGTGTTCCCTTTGCCCCCGACCCTTAGGGTGTTCTTGGTTTTTTCTTTTTTTTTCTTTTTTTTTTATTTTTTTTTATAATTTTAAAAGTATTATTAATATTTTTATATCAATATATTCTCTCGTATCTTTAAAGAAAAAGAAAGGAAGCCCGCCCAAGAAGAGGCCGCAAACTTATTCCCACATGAATGAAATGCCTTTTAAAATCATTTTGTTTCTAATATAACATAACTACAGATGACTTTCAAGCTTTTTTTCTATTTAATTAACTTTCGACATGAAACTATAGCGTTTTCTTGTTGGCTAGTGGAATGATCTTAAATTAAAATTTTATTTGGGTTGTGGATGTTTCACTGTTCTTCTAAGCCTTGATTTAGTTTTATTTGTAAGTTTAATGATAATATTACAATCAACTTCTTTTTTAATATTCGTCCTCCCTCTCAAACCAAAATACAGAGGAAAATAAAGCTTTTGATGACAGTTTGTGAAACCTTTTACTTTTTTGAGTATATAACTTCTAAACTATAGAGAGATTTTGGTTTTTGTTATAGAGCTATCCGTGATTGTGTTTGGTACTATCTTTACACATTCATGCACATATTTTCTGTTTCTTGCTGCAGAGCTATTGTCTTAGAACCTGGGAATCCATCTATGAAGAATCAAATAAAGCAATTGGCTGCTGCCGAAGGTATGCATTTTTTTAAGGATTTGTTTAGTGGTGACTGGATAATTTATGTTATTTTATCATAGGCATGAATAGGAGGTTGATTCATTCTTATATCTGGTAAAGTTGTCTTATCAGCCTGCTTTAATTACTATTTTCTTTTTCTTCATCTTTTTTTTTGTTGTTGTTCTTATTGTTGCTGTTACTGTTATCTGATTGTAATACTGTAAAAACTGAGCCTTCTATGTGGAGATCAAGCCTCTACTAATACCTTGCTTGCTTTGCTTGCAGCTAATCCATGGAGGCACTTTAAGAATAAGAAAGAGCCTCCCTTTAAAAAGCAGAAAAACGAATTGTCTCAAGGTGATTGTCTGTTTACACATATTGAGTTGTTCATTTTCTTTTTGCAAGTTTTTATGCTTCGTTCTTCCTCTTCTGGCATAAAACTTTTTCTCCCTCTTTGAGTGTTTTCTCTCTCTGACATATACTGTTTCTCCTGATGGACATTAATGTCTATTAGATAGGCTATCATCTTCACCTATTCCATCGCCTCCTGAGCAATCCGGTGCTCCAGTATCTCAATTTGGATCTGCAAACACCACTAAGACTCATGTTATTGCAGAAGATATTAGACCTCGACTGCCTGCTAAGATTAATTCTGCTGCTAGCAGCGAGAAGGAAATCCCGACCAAAGCTGCAAAAGGAGTACTTGAAACACCAGGACAAGAAGGGAATAGTGGAGCTAAACCAACAGACTTGCAAGGAATGTTGTATAATTTACTTTCGGAAAACCCCAAGGGGATGAGTTTGAAGGTACACAAGTAAAACTTCTACATTAGGAACGTAGGATGTTAATTTTCATCTATCTTAATTGAAGTATTGTCTATTGATTCCTAATATGAAAATAATGTTTGATGTTTCCTGTTCATTGTATGTTTGTTCCCTTTAACAAATTTATCAGTTCTGTTCAAAACATATGGGTTTTACTATTGTCTACTTCTCTCTCAATGTTAACTCTTCTGTCACTCCTTCTCCCTTTCTTCCCCTCCCTCTTTCTTGTAGGCCTTGGAGAAAGCTGTTGGCGATAAAATCCCAAATGCTGTAAAAAAGATCGAGCCAATCATTAAAAAAGTAAGTGATCCAAATGTTAAATTATTTGAAATAATGCATAATCTACTGGGTTAGTGTTTATTCCTTCCTTTTTTTCCCCTCTTTATATATATATATAGACAGATAGATAAGAAACAATGTAGAATTGAACAAAAAGAGAAAGAATAGCCTAGGGGTCGGGGGTAGAGAGAACTTCACCCGAAAGAACTATAAGAAAGCTTTCTCGTTATTCACAATTGTGGGAAGGTTGTGGTTACACAAGAATTTCCTGAATATTATTTAGTAAATCTCTTACTACTCCCTACATATGTTGTCTTGAATATTACTTCAATTAAAAAGATCCAACAAATGAACTATCAATTGAAACAAGGAGAAAATGACAGGCGTGAATTGAACTCTGTTCTTTGAACTCATAATAAATTGACTGCTCCTGAAATCTTCCGATGTTGTTGTAGTTTTGAATTCTGATATCTTCCATGCATGCTGAAGTTATTGTTCTTATATTCTTGTTTGGTCTTGGCTATTAGATTGCAACCTACCAAGCTCCGGGGAGATATTGTTTGAAATCAGGAGTTGAGTTGGAAGGCTCCAAAAAACCTACATCCGAAGGTGAAAGGTACTTCAACATTTAAAGGGCTGATTTCATGTGGTTGACTTGAACAGTTATTTGCAAAATCTTAAATCAGGTTGTTTGTTCCTGTTACCATTTCTGATGTTGTTGGTCAGACAGCGTCAACTGTCTCATTTTTTGAGAATATTACATTATTACATACTGGATCCAAACCTACAACCTATAGAGGGAGTAGAAATGCTTCAAAAGACCCTCACATGTCTCATTTTTTTGAGAATATTACATTACATACTGGATTCGAAATTATAATCTCTTAGAGGAAGTAGAAATGCTTCAACACAACCTTTAATGTCTCATTTTTTTTAGAATATTACATTACATGCTGGATTCAGACTTATAACCTCTTAGAGGAAGTAGAAATGCCTCAACTGTCTTGTAGTCATTGTTTGGTTTTATTGATTATCAATAAACTATATTTTATGAGAAAAGAAAGCACTAAGCTTACTGCAAGGATTTATAATAAACAAAAATGACTTCATGCAAACATAATGTTTAATGGTTAAGACATTTCTACCTCCTCTAAGAGATTGCAGGTTCAAACCTCCACGTGTTTGTAATATTGTCTATACAAAAAATAATAGAAAAACAAAATAACTTCATGAAAAGAGTAATTACAGTGGAGATTGGATCAAGGATCAAACCAAAAGAAAGCTCAACCCAGTGATTTACTAGTCGGTGTATTTGATATATATGGATTGCTTGATTTTCATGTAGGAGTCTAATTCATGTCTTGTACTTTTACAGCTCTCCTTTGATCAGCCATCACCAAACCCCTGTACATGAAGACCTCCCTGATCAAATAACTGCTCCAGAATTGCAGTTAGAAGCAAGACGTGGCATTGAATTGGAGGAAAAGGTAGAAACCTCTCAAGCTAACAAAGAATCAAATTTCTTGGAGAAAAATGGCGTCCAACAGCATTCACCAGATCTTTTTGCTGAGAAAAAAGGTTCTGAAAATAGTGAAGGCCAGGCAGCTAGTTCTTCTGACAACGAAAGTGACAGTGATTCTGAAAGTGATAGCAGTGATAGTGGAAGTGATAGTGGGAATCATAGTAGGAGTAGAAGCCAAAGCCCCGTGGGTAGTGGGAGTGGGAGTAGCAGTGATAGTGAAAGTGATGCACCTTCCAATAGCAAGGAGGGTTCTGATGAGGACGTGGATATTATGACCAGTGATGATGACAAAGAATCCAAGCATAAATTGCAAGCTCCCGTGCAGGGTTTCTCTACATCTCCTGCTGCTTGGAAAAGTCCAGATGGTGGGGCTGCGCAGATTATTGATGATGAGAAGGAAGATGGACAAGGATCTGATGCAATTGATATTGAGAAAGATTCTTCTGATGATGAGCCAGATGCTAAAATTGATGATAGCAGTTTACTTCGTATTGAAGAAGGTGGAAGACCTGTGGAAGAACCAAGATCCTTTTCACCATACCCTGATGAATTCCAAGAGCGCCAAAATTTTATAGGGAGTTTGTTTGAGGATAGGGAAAATACTCTTCTGGACAGTGCCAGGCATGAACAATCTGACAGCACAGGTCGGATATCTAAAGGCAAGTCTAAAAGGAGTTCTGATCTGGAGTGCTTAGAAGAGAAATCTGATCATACCAAGAGATTAAAATCAGAAAGCTTAGCCCAACAACCAGTTTCTGGTAATTGGGGAGTCCAATTACAGAGTCCTCACAATTTATCTCCTAGTAAACTCAACAGAGATTCTGTCAGAAATCCTACCAGTCAAGTTACGAATAAAGGTGAAATTAAAGGCAATTCTGATTTTAGACCAAAAAAGGGAAACAAAGAAACAGTTCCAGAAAAAAATAGTTCAGATGTTTCACAAGCAGGTTGGAGGCCTCATGACCAGAGTGGTGGAGGAGTGAGGGCTGTGGATACAGCTGCCAGAGCCGACAAGCATGGTGATATTGGACGTGGCACTAAACACACCGAAAAGGGTGGTCATGCTAATGAAAATTTTCATGTGTTTAAAGATACCTTTTATGGAAATGCTGAAAATGAAGGGACAAAGGAGAAAAAGGTGTCAAAAAATTCTAGATCAGGTGGTCCAGGGGACAAACAGATACAACCTTTTGACTCCCATCACAGTAAACCTGGTGAAATAGTTGGAAAATTCAAAGATGGCCAAATATTTTCAAGCTCGCAGATGGGGTATTCACCAAGGGATAATAATAATAGAGTTAGTGCCAACAGGTCCCCTGTTAATGGAAAAGGCCGAAGTCTCCAAAGAGAGCTTTCAGACCTGGAGTTAGGTGAACTTCGTGAGCCTTTCCCTGAGGAAGCACGGGGTAAAAAGAAATTTGAAAGAAACAATTCGTTGAAACAGTTGGAGAATAAAGAAAACACTACAGATATCTGGAGTTCAGATTTAAATAAAGGAAAATCTAATTTGAAGGCTAGTATAGAATATGGAAAGCGATCCTCACCCCATGTAAGTACCAAGTTTCCCAGCAATCCGGAAGGCTCAAATAAAAAGAAGAATTCAGAACATATAGTTGAAGATTCGACCAGGCTTAACCACCGGTCTCTGCAGTCTCATCCACAATATAATTCAAGGGTAGACCATGTTGAAGTCGATAAGTCAGTTGATGCAAATGTAAAACCTAATCAGGGGATTGGTCCAGAAGGCTGTGGGGAAAGCAACAGAAAAGCATCCGTTGGCATTTCCCAGCTGAATGATATGAAAAGAGAACAGCTTCCCTCAAAAAAAGGAAGTAAAAGACTAGCACCTAATCCAATAACCGAAGTTACTGATGCACTGAAGAACCCAGTATCAGCTGAGCGTGAAAATAGTGATCCAAAGAGAAGAGATTCTTCTTCAGACGAAAACAGTTGTTCATATTCCAAGTATGAAAAGGATGAGCCAGAGTTCAAGGGGGCAATCAAGGATTTTTCTCAGTAAGTTTAATCTATAACTCATTCAAGTTCTTTTACTTTTGTTATTAATGTCTGTGATAACTGATGAAAAGATATACTTATTATTAGGTACAAGGAATATGTACAAGAGTATCATGATAAATATGAATCATACCTATCCTTGAACAAAATCCTAGAAAGCTACAGGTAATAGAACTTTTCATTAATGTCCGATCCTTATCCATTCCTACACATGTTGCAGTCTGCCTGACTTAATATCCGGTTATAATGTTTAAAGAAGTTTGATGATTTATTTGTGTGTTCTGGTATGTTGTTCCTGTAAAGAAAACATTATTAAGTTTTTTTAGCATCGATTGATATATGTAGGGCTGAGTTCTGCAAACTCGGAAAGGAACTTGATTCTGCTAGAGGACAAGATTCGGAGAGATACTTTAATGTCTTAGGACAGCTGAAAGAATCCTATCGGCTGTGTTCAACGGTATGGTCACCTATCTTAGTATCTTTACTTCAAGTTTGCAGTTGATAGTATTAGTAGTTTCTCACTTTCCAGTGTATGTCTATGGGACTCTGGATTAATTTGACATTAGTAATTTCCCTTGCACAAAATCCCAATGTTTACTCTGTGGCAAAGCTCTGAGTTCTGATATTTCATTCAGCTAGTGCTCGTTGTAGTTTAGGTTTGTATGATCTTTTGGTTCATATTGAAAAATTGGTGTTCCAGAAGGGGGCAGAGGGAGAGAGGGAACTCGTGATTGGAAGTAAAAGGTTTAGGATTGTATAGGATTAAGGCTGAGAGTAAATTCAGAGGCAGCCAGGCATGGTTGACGGAATTGGTAGAGGCAATACTTTCTCTATCTCTGTTGATTGGATAACCTCATGTGTTCAGGTTATTTGAATCTGGATTAGATGGCTCACATGGCTTGTACTTGATGTAGGATTAATATTTTTTGTATGCCTTGATCTGTATGCACATCAATTCTTTTCCCCCTTGTTATACACAAATTTTCTTACATAATGAAGTTGACTCAAACTAAGATTAATTTGCAGAGGCACAAGAGGTTGAAAAAAATATTCATTGTACTCCACGAAGAGCTGAAGGTGATTAACATTAACTAACTTATCTACATTACCTTCTTGCTTTCTTGTAGTTCTAATTTTTATCTCTAGTTCGATTTTCTTTATATGTACTATATTGTTGCAGCATATAAAGGAAAGGATTAGAGATTTTGCACGAACTTGTACAAAAGATTAAGACTAGATGTGCCTCTCTTCGAGCAGTCGAACGTCCTTTAAACAACCATTTTCAACTCAAGGTAGGTTACTAAAAGATCGAGGTGGATGCAAGAACTGCATCAATGAATTCCCGTGTATAGAAAATTAGTCAATTTCTTGCTACCTTCTTCCCAACCCCAATATAGGAACATGTAAATTTTTTTGCTCTCTCCCTCTCTTAAAAAAATGCATTTTCTTTTGTTTAGCTAAGTAGAGAGGTTGGACTTTTGTTTTAGTTTAGCCTGACCTCTTCCTGTATAGACTTGTGAGGCTTATTTTAAGTTTGGAAAATATAATCTATTTTATAGGTTGTACAAATTTTGAAGGTGTAAAAATGTTCCCTATAACTGTGAATATGCTAATGTTATTGGCAATCATGGGAAGTTTTCTTGGAGGGGAATTCTATAGTGAAGTAATGAGAAGTTTCTTCTCTGGGACAATGCAGGCTCTGTTGAATTGGTTAAATTATTACTGTGTAGGTTTTTACATCATTGATAGAGCACAATTTTTTCTTTAAATTAAAAATAATTTGAACCCTCCTTGAAAATGAAATGATAAATGGATGGTACTCCTTGTAAGATATAGAATTGATGACACGTTTAACTTGTGGAATAGATAATATATTCTAAATGGTAATAACATTAAACAGCTGAAATGTTTTCGAACAGTGGTGGCATATGTGGACATGTGACAACTGACAAATCGAATAGATGACACTATCGTATATTACCATCTTTTTCTAGGTAGAATCGATGACCATTTTTTTAAGTGAAAGGTAGCGATGATAAATTGGAGTATGTTAAGAGTTAAAAATCGTGTAAAATATTGAGGTTCTCATTCAATATCTATTTGATTCTTCAAGTTTGAAAATAAATGGTCTTAAATGTGTTTTGACTAACAGAAATATGACTTAGCAATGAGATTGGTGATTTATTTGCTAATTGAAAAATTATATTAATTTAATATAATCACTTAGTATTATGTGCTATCTTTCTCCTTCCTCTCTTTTCCTTTACCCTTCATTTTTTCTTCCAGCCGAAGGCTTCATCTCCTACCAATCTTGGAAAGCATGATGGAGGATGGAGAGTTTGA
mRNA sequence
ATGTATGGCGGCTCATCCAAGCTCGGTCGACCCGGCGCCGGCGCCGGCCGGGGAGCCGGAGGAAAGCGTCCGCACTCTTCTTTTCCTCTACCACCTTCTCACCGCCCCTCCGGTCGTCTCTCTCTTGGCGGCGGTGGTGCTGGTTCTGTCGCCAATCCTCGGAATCGAACCACCACTGCAACCACATCCGAAGCCCCTCAATCCGTTGAGGAGAATTTCAGTCTCGTTACTGGTAACAACCCGTTGGCTTTTGCCATGATAATTCGGTTGGCTCCGGATTTGATCGATGAGATCAAGCGGGTCGAGGCGCAGGGCGGGACTCCGAGGATTAAGTTTGATGCGAACGCCAAGAATTCTAGTGGTAATGTCATTGATGTTGGTGGGAAAGAGTTTAGGTTCACATGGTCACGTGAAGTTGGTGATCTTTGTGACATATATGAAGAACGCAAAAGTGGAGAAGATGGAAGTGGTTTGCTCATTGAATCAGGCAATGCTTGGAGAAAACTGAATGTGCAGCGTATCTTAGACGAATCAACTACAAACCATGTTAAGAAGTTGTCTGAAGAAGCTGAACGTAGATCTAAGTCTCGTAGAGCTATTGTCTTAGAACCTGGGAATCCATCTATGAAGAATCAAATAAAGCAATTGGCTGCTGCCGAAGCTAATCCATGGAGGCACTTTAAGAATAAGAAAGAGCCTCCCTTTAAAAAGCAGAAAAACGAATTGTCTCAAGATAGGCTATCATCTTCACCTATTCCATCGCCTCCTGAGCAATCCGGTGCTCCAGTATCTCAATTTGGATCTGCAAACACCACTAAGACTCATGTTATTGCAGAAGATATTAGACCTCGACTGCCTGCTAAGATTAATTCTGCTGCTAGCAGCGAGAAGGAAATCCCGACCAAAGCTGCAAAAGGAGTACTTGAAACACCAGGACAAGAAGGGAATAGTGGAGCTAAACCAACAGACTTGCAAGGAATGTTGTATAATTTACTTTCGGAAAACCCCAAGGGGATGAGTTTGAAGGCCTTGGAGAAAGCTGTTGGCGATAAAATCCCAAATGCTGTAAAAAAGATCGAGCCAATCATTAAAAAAGTAAGTGATCCAAATGTTAAATTATTTGAAATAATGCATAATCTACTGGGAGTTGAGTTGGAAGGCTCCAAAAAACCTACATCCGAAGGTGAAAGCTCTCCTTTGATCAGCCATCACCAAACCCCTGTACATGAAGACCTCCCTGATCAAATAACTGCTCCAGAATTGCAGTTAGAAGCAAGACGTGGCATTGAATTGGAGGAAAAGGTAGAAACCTCTCAAGCTAACAAAGAATCAAATTTCTTGGAGAAAAATGGCGTCCAACAGCATTCACCAGATCTTTTTGCTGAGAAAAAAGGTTCTGAAAATAGTGAAGGCCAGGCAGCTAGTTCTTCTGACAACGAAAGTGACAGTGATTCTGAAAGTGATAGCAGTGATAGTGGAAGTGATAGTGGGAATCATAGTAGGAGTAGAAGCCAAAGCCCCGTGGGTAGTGGGAGTGGGAGTAGCAGTGATAGTGAAAGTGATGCACCTTCCAATAGCAAGGAGGGTTCTGATGAGGACGTGGATATTATGACCAGTGATGATGACAAAGAATCCAAGCATAAATTGCAAGCTCCCGTGCAGGGTTTCTCTACATCTCCTGCTGCTTGGAAAAGTCCAGATGGTGGGGCTGCGCAGATTATTGATGATGAGAAGGAAGATGGACAAGGATCTGATGCAATTGATATTGAGAAAGATTCTTCTGATGATGAGCCAGATGCTAAAATTGATGATAGCAGTTTACTTCGTATTGAAGAAGGTGGAAGACCTGTGGAAGAACCAAGATCCTTTTCACCATACCCTGATGAATTCCAAGAGCGCCAAAATTTTATAGGGAGTTTGTTTGAGGATAGGGAAAATACTCTTCTGGACAGTGCCAGGCATGAACAATCTGACAGCACAGGTCGGATATCTAAAGGCAAGTCTAAAAGGAGTTCTGATCTGGAGTGCTTAGAAGAGAAATCTGATCATACCAAGAGATTAAAATCAGAAAGCTTAGCCCAACAACCAGTTTCTGGTAATTGGGGAGTCCAATTACAGAGTCCTCACAATTTATCTCCTAGTAAACTCAACAGAGATTCTGTCAGAAATCCTACCAGTCAAGTTACGAATAAAGGTGAAATTAAAGGCAATTCTGATTTTAGACCAAAAAAGGGAAACAAAGAAACAGTTCCAGAAAAAAATAGTTCAGATGTTTCACAAGCAGGTTGGAGGCCTCATGACCAGAGTGGTGGAGGAGTGAGGGCTGTGGATACAGCTGCCAGAGCCGACAAGCATGGTGATATTGGACGTGGCACTAAACACACCGAAAAGGGTGGTCATGCTAATGAAAATTTTCATGTGTTTAAAGATACCTTTTATGGAAATGCTGAAAATGAAGGGACAAAGGAGAAAAAGGTGTCAAAAAATTCTAGATCAGGTGGTCCAGGGGACAAACAGATACAACCTTTTGACTCCCATCACAGTAAACCTGGTGAAATAGTTGGAAAATTCAAAGATGGCCAAATATTTTCAAGCTCGCAGATGGGGTATTCACCAAGGGATAATAATAATAGAGTTAGTGCCAACAGGTCCCCTGTTAATGGAAAAGGCCGAAGTCTCCAAAGAGAGCTTTCAGACCTGGAGTTAGGTGAACTTCGTGAGCCTTTCCCTGAGGAAGCACGGGGTAAAAAGAAATTTGAAAGAAACAATTCGTTGAAACAGTTGGAGAATAAAGAAAACACTACAGATATCTGGAGTTCAGATTTAAATAAAGGAAAATCTAATTTGAAGGCTAGTATAGAATATGGAAAGCGATCCTCACCCCATGTAAGTACCAAGTTTCCCAGCAATCCGGAAGGCTCAAATAAAAAGAAGAATTCAGAACATATAGTTGAAGATTCGACCAGGCTTAACCACCGGTCTCTGCAGTCTCATCCACAATATAATTCAAGGGTAGACCATGTTGAAGTCGATAAGTCAGTTGATGCAAATGTAAAACCTAATCAGGGGATTGGTCCAGAAGGCTGTGGGGAAAGCAACAGAAAAGCATCCGTTGGCATTTCCCAGCTGAATGATATGAAAAGAGAACAGCTTCCCTCAAAAAAAGGAAGTAAAAGACTAGCACCTAATCCAATAACCGAAGTTACTGATGCACTGAAGAACCCAGTATCAGCTGAGCGTGAAAATAGTGATCCAAAGAGAAGAGATTCTTCTTCAGACGAAAACAGTTGTTCATATTCCAAGTATGAAAAGGATGAGCCAGAGTTCAAGGGGGCAATCAAGGATTTTTCTCAGTACAAGGAATATGTACAAGAGTATCATGATAAATATGAATCATACCTATCCTTGAACAAAATCCTAGAAAGCTACAGGGCTGAGTTCTGCAAACTCGGAAAGGAACTTGATTCTGCTAGAGGACAAGATTCGGAGAGATACTTTAATGTCTTAGGACAGCTGAAAGAATCCTATCGGCTGTGTTCAACGCCGAAGGCTTCATCTCCTACCAATCTTGGAAAGCATGATGGAGGATGGAGAGTTTGA
Coding sequence (CDS)
ATGTATGGCGGCTCATCCAAGCTCGGTCGACCCGGCGCCGGCGCCGGCCGGGGAGCCGGAGGAAAGCGTCCGCACTCTTCTTTTCCTCTACCACCTTCTCACCGCCCCTCCGGTCGTCTCTCTCTTGGCGGCGGTGGTGCTGGTTCTGTCGCCAATCCTCGGAATCGAACCACCACTGCAACCACATCCGAAGCCCCTCAATCCGTTGAGGAGAATTTCAGTCTCGTTACTGGTAACAACCCGTTGGCTTTTGCCATGATAATTCGGTTGGCTCCGGATTTGATCGATGAGATCAAGCGGGTCGAGGCGCAGGGCGGGACTCCGAGGATTAAGTTTGATGCGAACGCCAAGAATTCTAGTGGTAATGTCATTGATGTTGGTGGGAAAGAGTTTAGGTTCACATGGTCACGTGAAGTTGGTGATCTTTGTGACATATATGAAGAACGCAAAAGTGGAGAAGATGGAAGTGGTTTGCTCATTGAATCAGGCAATGCTTGGAGAAAACTGAATGTGCAGCGTATCTTAGACGAATCAACTACAAACCATGTTAAGAAGTTGTCTGAAGAAGCTGAACGTAGATCTAAGTCTCGTAGAGCTATTGTCTTAGAACCTGGGAATCCATCTATGAAGAATCAAATAAAGCAATTGGCTGCTGCCGAAGCTAATCCATGGAGGCACTTTAAGAATAAGAAAGAGCCTCCCTTTAAAAAGCAGAAAAACGAATTGTCTCAAGATAGGCTATCATCTTCACCTATTCCATCGCCTCCTGAGCAATCCGGTGCTCCAGTATCTCAATTTGGATCTGCAAACACCACTAAGACTCATGTTATTGCAGAAGATATTAGACCTCGACTGCCTGCTAAGATTAATTCTGCTGCTAGCAGCGAGAAGGAAATCCCGACCAAAGCTGCAAAAGGAGTACTTGAAACACCAGGACAAGAAGGGAATAGTGGAGCTAAACCAACAGACTTGCAAGGAATGTTGTATAATTTACTTTCGGAAAACCCCAAGGGGATGAGTTTGAAGGCCTTGGAGAAAGCTGTTGGCGATAAAATCCCAAATGCTGTAAAAAAGATCGAGCCAATCATTAAAAAAGTAAGTGATCCAAATGTTAAATTATTTGAAATAATGCATAATCTACTGGGAGTTGAGTTGGAAGGCTCCAAAAAACCTACATCCGAAGGTGAAAGCTCTCCTTTGATCAGCCATCACCAAACCCCTGTACATGAAGACCTCCCTGATCAAATAACTGCTCCAGAATTGCAGTTAGAAGCAAGACGTGGCATTGAATTGGAGGAAAAGGTAGAAACCTCTCAAGCTAACAAAGAATCAAATTTCTTGGAGAAAAATGGCGTCCAACAGCATTCACCAGATCTTTTTGCTGAGAAAAAAGGTTCTGAAAATAGTGAAGGCCAGGCAGCTAGTTCTTCTGACAACGAAAGTGACAGTGATTCTGAAAGTGATAGCAGTGATAGTGGAAGTGATAGTGGGAATCATAGTAGGAGTAGAAGCCAAAGCCCCGTGGGTAGTGGGAGTGGGAGTAGCAGTGATAGTGAAAGTGATGCACCTTCCAATAGCAAGGAGGGTTCTGATGAGGACGTGGATATTATGACCAGTGATGATGACAAAGAATCCAAGCATAAATTGCAAGCTCCCGTGCAGGGTTTCTCTACATCTCCTGCTGCTTGGAAAAGTCCAGATGGTGGGGCTGCGCAGATTATTGATGATGAGAAGGAAGATGGACAAGGATCTGATGCAATTGATATTGAGAAAGATTCTTCTGATGATGAGCCAGATGCTAAAATTGATGATAGCAGTTTACTTCGTATTGAAGAAGGTGGAAGACCTGTGGAAGAACCAAGATCCTTTTCACCATACCCTGATGAATTCCAAGAGCGCCAAAATTTTATAGGGAGTTTGTTTGAGGATAGGGAAAATACTCTTCTGGACAGTGCCAGGCATGAACAATCTGACAGCACAGGTCGGATATCTAAAGGCAAGTCTAAAAGGAGTTCTGATCTGGAGTGCTTAGAAGAGAAATCTGATCATACCAAGAGATTAAAATCAGAAAGCTTAGCCCAACAACCAGTTTCTGGTAATTGGGGAGTCCAATTACAGAGTCCTCACAATTTATCTCCTAGTAAACTCAACAGAGATTCTGTCAGAAATCCTACCAGTCAAGTTACGAATAAAGGTGAAATTAAAGGCAATTCTGATTTTAGACCAAAAAAGGGAAACAAAGAAACAGTTCCAGAAAAAAATAGTTCAGATGTTTCACAAGCAGGTTGGAGGCCTCATGACCAGAGTGGTGGAGGAGTGAGGGCTGTGGATACAGCTGCCAGAGCCGACAAGCATGGTGATATTGGACGTGGCACTAAACACACCGAAAAGGGTGGTCATGCTAATGAAAATTTTCATGTGTTTAAAGATACCTTTTATGGAAATGCTGAAAATGAAGGGACAAAGGAGAAAAAGGTGTCAAAAAATTCTAGATCAGGTGGTCCAGGGGACAAACAGATACAACCTTTTGACTCCCATCACAGTAAACCTGGTGAAATAGTTGGAAAATTCAAAGATGGCCAAATATTTTCAAGCTCGCAGATGGGGTATTCACCAAGGGATAATAATAATAGAGTTAGTGCCAACAGGTCCCCTGTTAATGGAAAAGGCCGAAGTCTCCAAAGAGAGCTTTCAGACCTGGAGTTAGGTGAACTTCGTGAGCCTTTCCCTGAGGAAGCACGGGGTAAAAAGAAATTTGAAAGAAACAATTCGTTGAAACAGTTGGAGAATAAAGAAAACACTACAGATATCTGGAGTTCAGATTTAAATAAAGGAAAATCTAATTTGAAGGCTAGTATAGAATATGGAAAGCGATCCTCACCCCATGTAAGTACCAAGTTTCCCAGCAATCCGGAAGGCTCAAATAAAAAGAAGAATTCAGAACATATAGTTGAAGATTCGACCAGGCTTAACCACCGGTCTCTGCAGTCTCATCCACAATATAATTCAAGGGTAGACCATGTTGAAGTCGATAAGTCAGTTGATGCAAATGTAAAACCTAATCAGGGGATTGGTCCAGAAGGCTGTGGGGAAAGCAACAGAAAAGCATCCGTTGGCATTTCCCAGCTGAATGATATGAAAAGAGAACAGCTTCCCTCAAAAAAAGGAAGTAAAAGACTAGCACCTAATCCAATAACCGAAGTTACTGATGCACTGAAGAACCCAGTATCAGCTGAGCGTGAAAATAGTGATCCAAAGAGAAGAGATTCTTCTTCAGACGAAAACAGTTGTTCATATTCCAAGTATGAAAAGGATGAGCCAGAGTTCAAGGGGGCAATCAAGGATTTTTCTCAGTACAAGGAATATGTACAAGAGTATCATGATAAATATGAATCATACCTATCCTTGAACAAAATCCTAGAAAGCTACAGGGCTGAGTTCTGCAAACTCGGAAAGGAACTTGATTCTGCTAGAGGACAAGATTCGGAGAGATACTTTAATGTCTTAGGACAGCTGAAAGAATCCTATCGGCTGTGTTCAACGCCGAAGGCTTCATCTCCTACCAATCTTGGAAAGCATGATGGAGGATGGAGAGTTTGA
Protein sequence
MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQDRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDIRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDGQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAARADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCSTPKASSPTNLGKHDGGWRV
Homology
BLAST of HG10002324 vs. NCBI nr
Match:
XP_038883601.1 (dentin sialophosphoprotein isoform X1 [Benincasa hispida])
HSP 1 Score: 2007.6 bits (5200), Expect = 0.0e+00
Identity = 1083/1200 (90.25%), Postives = 1115/1200 (92.92%), Query Frame = 0
Query: 1 MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
MYGG+SKLGRPG GAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGG SV+NPRNRTTTA
Sbjct: 1 MYGGASKLGRPGGGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGPVSVSNPRNRTTTA 60
Query: 61 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS
Sbjct: 61 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
Query: 241 ELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI 300
ELSQ DRLSSSPIPSPPEQSGAPVS FGSANTTKTHVI EDI
Sbjct: 241 ELSQVGPPKSTYKPGISSLPASKDRLSSSPIPSPPEQSGAPVSHFGSANTTKTHVITEDI 300
Query: 301 RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSL 360
RPRLPAK+N+AASSEKEI TKAAKGVLETPGQEGNSGAK TDLQGMLYNLL ENPKGMSL
Sbjct: 301 RPRLPAKVNAAASSEKEISTKAAKGVLETPGQEGNSGAKTTDLQGMLYNLLLENPKGMSL 360
Query: 361 KALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPLI 420
KALEKAVGDKIPNAVKKIEPIIKK++ + + GVELEGSKKP+SEGESSPLI
Sbjct: 361 KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKS--GVELEGSKKPSSEGESSPLI 420
Query: 421 SHHQT-PVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLF 480
SHHQ PVHEDLPDQITAPELQLEAR GIELEEKVETSQANK+SNFLEKNG+QQHSPDLF
Sbjct: 421 SHHQNPPVHEDLPDQITAPELQLEARSGIELEEKVETSQANKKSNFLEKNGIQQHSPDLF 480
Query: 481 AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSES 540
AEKKGSENSE QAASSSDNESDSDSESDSSDSGSDSGNHSRSRS+SPVGSGSGSSSDSES
Sbjct: 481 AEKKGSENSERQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSES 540
Query: 541 DAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED 600
+ PSNSKEGSDEDVDIMTSDDDKESKHKLQA QGFSTSPAAWKSPDGGA QIIDDEKED
Sbjct: 541 EVPSNSKEGSDEDVDIMTSDDDKESKHKLQASAQGFSTSPAAWKSPDGGAVQIIDDEKED 600
Query: 601 GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSL 660
GQ SDAIDIE DSSDDEPDAKIDD S L I EGGR VEEPRSFSPYPDEFQERQNFIGSL
Sbjct: 601 GQESDAIDIENDSSDDEPDAKIDDRSFLPI-EGGRLVEEPRSFSPYPDEFQERQNFIGSL 660
Query: 661 FEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGN 720
FEDR+NT++DS RHEQSDSTG+ISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVS
Sbjct: 661 FEDRDNTVVDSGRHEQSDSTGQISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVS-- 720
Query: 721 WGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQ 780
SK RDSVRNPTSQVTNKGE+KGNSDFRPKKG+KETV EKNSSDVSQ
Sbjct: 721 -------------SKHGRDSVRNPTSQVTNKGEVKGNSDFRPKKGHKETVSEKNSSDVSQ 780
Query: 781 AGWRPHDQSGGGVRAVDTAARADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEG 840
AGWRPHDQS GGVRAVDTAAR DKHGDIGRGTKHTEK GHANENFH+FKDTF+GNAENEG
Sbjct: 781 AGWRPHDQS-GGVRAVDTAARTDKHGDIGRGTKHTEKSGHANENFHMFKDTFHGNAENEG 840
Query: 841 TKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVS 900
TKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQ FSSSQMGYSPRDNNNR+S
Sbjct: 841 TKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRIS 900
Query: 901 ANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSS 960
ANRSPVNGKGR LQRELSDLELGELR+PFPEE+RGKKKFERNNSLKQLENKE+TTDIW S
Sbjct: 901 ANRSPVNGKGRILQRELSDLELGELRDPFPEESRGKKKFERNNSLKQLENKESTTDIWGS 960
Query: 961 DLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQ 1020
DL++GKSNLK S+EYGKRS PHVSTKFPSNPEGSNKKK SEHIVEDSTRLN RSLQSHPQ
Sbjct: 961 DLSRGKSNLKTSLEYGKRSPPHVSTKFPSNPEGSNKKKTSEHIVEDSTRLNQRSLQSHPQ 1020
Query: 1021 YNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRL 1080
YNSRVDHVEVDKS+ ANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRL
Sbjct: 1021 YNSRVDHVEVDKSIAANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRL 1080
Query: 1081 APNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYK 1140
APNPITEVT+ALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQY+
Sbjct: 1081 APNPITEVTEALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYE 1140
Query: 1141 EYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCS 1181
EYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSE+YFNVLGQLKESYRLCS
Sbjct: 1141 EYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCS 1181
BLAST of HG10002324 vs. NCBI nr
Match:
XP_008447590.1 (PREDICTED: dentin sialophosphoprotein isoform X1 [Cucumis melo])
HSP 1 Score: 1984.1 bits (5139), Expect = 0.0e+00
Identity = 1070/1200 (89.17%), Postives = 1097/1200 (91.42%), Query Frame = 0
Query: 1 MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
MYGG SKLGRPG GAGRG GGKRPHSSFPLPPSHRPSGRLSLGGG AGS +NPRNRTTTA
Sbjct: 1 MYGGPSKLGRPGGGAGRGHGGKRPHSSFPLPPSHRPSGRLSLGGGAAGSASNPRNRTTTA 60
Query: 61 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
TTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKRVEAQGGTPRIKFDA A NSS
Sbjct: 61 TTSEASQSTEENFSLVTGNNPLAFGMIIRLAPDLIDEIKRVEAQGGTPRIKFDAIANNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
GNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSRERGDSCEIYEERKSGEDGSGLLIESGNCWRKLNVHRVLDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
NHVKKLSEEAER+SKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN
Sbjct: 181 NHVKKLSEEAERKSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
Query: 241 ELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI 300
ELSQ DRLSSSPIP PPEQ G PVSQFGSANT KTHVIAEDI
Sbjct: 241 ELSQVGPPKSTYKPGMPSLPASKDRLSSSPIPLPPEQFGGPVSQFGSANTNKTHVIAEDI 300
Query: 301 RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSL 360
RPR+PAKIN AAS+EKEI T A KGVLETPGQEGNSGAKPTDLQGMLYNLL ENPKGMSL
Sbjct: 301 RPRVPAKINPAASNEKEILTIAPKGVLETPGQEGNSGAKPTDLQGMLYNLLLENPKGMSL 360
Query: 361 KALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPLI 420
KALEKAVGDKIPNAVKKIEPIIKK++ + + GVELEGSKKPTSEGESSPL+
Sbjct: 361 KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKS--GVELEGSKKPTSEGESSPLV 420
Query: 421 SHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFA 480
SHHQT VHEDLPDQI APELQLEA GI+LEEKVETSQANKESNFLEKNG+QQ PD FA
Sbjct: 421 SHHQTSVHEDLPDQINAPELQLEAGCGIDLEEKVETSQANKESNFLEKNGIQQ--PDPFA 480
Query: 481 EKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESD 540
EKKGSENSEGQAASSSDN SDSDS+SDSSDSGSDSGNHSRSRS+SPVGSGSGSSSDSESD
Sbjct: 481 EKKGSENSEGQAASSSDNVSDSDSDSDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSESD 540
Query: 541 APSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG 600
PSNS+EGSDEDVDIMTSDDDKESKHKLQA VQGFSTSPAAWKSPDGG QIIDDEKEDG
Sbjct: 541 GPSNSQEGSDEDVDIMTSDDDKESKHKLQASVQGFSTSPAAWKSPDGGPVQIIDDEKEDG 600
Query: 601 QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLF 660
Q DAIDIEKDSSDDEPDAK+D SLL EE GRPVEEPRSFSPYPDEFQERQNFIGSLF
Sbjct: 601 QEYDAIDIEKDSSDDEPDAKVDGRSLLPTEEVGRPVEEPRSFSPYPDEFQERQNFIGSLF 660
Query: 661 EDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 720
EDREN + DSARHEQSDSTGRISKGKSKRSSDLECLEEK+DHTKRLKSESLAQQPVSGNW
Sbjct: 661 EDRENNVADSARHEQSDSTGRISKGKSKRSSDLECLEEKADHTKRLKSESLAQQPVSGNW 720
Query: 721 GVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQA 780
GVQLQSP NLSPSKLNRDSVRN TSQVTNKGEIKGNSDFRPKKGNKETV EKNSSDV QA
Sbjct: 721 GVQLQSPRNLSPSKLNRDSVRNLTSQVTNKGEIKGNSDFRPKKGNKETVSEKNSSDVPQA 780
Query: 781 GWRPHDQSGGGVRAVDTAARADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGT 840
GWRPHDQS GGVRAVDTA RADKHGDIGRGTKH EK GHANENFHVFKDTFYGNA+NEGT
Sbjct: 781 GWRPHDQS-GGVRAVDTATRADKHGDIGRGTKHIEKSGHANENFHVFKDTFYGNADNEGT 840
Query: 841 KEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA 900
KEKKVSKNSRSGGPGDK IQPFDSH SKPGEIVGKFKDGQ FSSSQMGYSPRDNNNRVSA
Sbjct: 841 KEKKVSKNSRSGGPGDKHIQPFDSHQSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRVSA 900
Query: 901 NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSD 960
NRSPVNGKGR LQRE SDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIW SD
Sbjct: 901 NRSPVNGKGRILQREPSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWGSD 960
Query: 961 LNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQY 1020
LNKGKSNLKAS+EYGKRSSPHVSTKFPSNPEGSNKKKNSEH+VEDS RLN+RSL SH QY
Sbjct: 961 LNKGKSNLKASLEYGKRSSPHVSTKFPSNPEGSNKKKNSEHMVEDSNRLNNRSLLSHSQY 1020
Query: 1021 NSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLA 1080
NSR+DH EVDKSVD NV+PNQG GPEG ESNRKASVGISQLND KREQLPSKKGSKR A
Sbjct: 1021 NSRIDHAEVDKSVDGNVRPNQGNGPEGYVESNRKASVGISQLNDTKREQLPSKKGSKRQA 1080
Query: 1081 PNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKE 1140
PNPITEVTD LKNP+SAE ENSDPKRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKE
Sbjct: 1081 PNPITEVTDGLKNPISAEHENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
Query: 1141 YVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST 1182
YVQEYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSE+YFNVLGQLKESYRLCST
Sbjct: 1141 YVQEYHDKYESYLSLNKILESYRTEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCST 1195
BLAST of HG10002324 vs. NCBI nr
Match:
XP_004146856.1 (dentin sialophosphoprotein isoform X1 [Cucumis sativus] >KGN59835.1 hypothetical protein Csa_002066 [Cucumis sativus])
HSP 1 Score: 1983.0 bits (5136), Expect = 0.0e+00
Identity = 1071/1200 (89.25%), Postives = 1097/1200 (91.42%), Query Frame = 0
Query: 1 MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
MYGG SKLGRPG GAGRG GKRPHSSFPLPPSHRPSGRLSLGGG AGSV+NPRNRTTTA
Sbjct: 1 MYGGPSKLGRPGGGAGRGHAGKRPHSSFPLPPSHRPSGRLSLGGGAAGSVSNPRNRTTTA 60
Query: 61 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
TTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKRVEAQGGTPRIKFDANA NSS
Sbjct: 61 TTSEASQSAEENFSLVTGNNPLAFGMIIRLAPDLIDEIKRVEAQGGTPRIKFDANANNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
GNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSRERGDSCEIYEERKSGEDGSGLLIESGNCWRKLNVHRVLDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
NHVKKLSEEAER+SKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN
Sbjct: 181 NHVKKLSEEAERKSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
Query: 241 ELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI 300
ELSQ DRLSSSPIP PPEQ GAPVSQFGSANT+KTHVIAEDI
Sbjct: 241 ELSQVGPPKSTYKPGMPSLPASKDRLSSSPIPLPPEQFGAPVSQFGSANTSKTHVIAEDI 300
Query: 301 RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSL 360
RPR+PAKIN AAS+EKEIPT A KGVLETPGQEGNSG KPTDLQGMLYNLL ENPKGMSL
Sbjct: 301 RPRVPAKINPAASNEKEIPTIAPKGVLETPGQEGNSGTKPTDLQGMLYNLLLENPKGMSL 360
Query: 361 KALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPLI 420
KALEKAVGDKIPNAVKKIEPIIKK++ ++ + GV LEGSKKPTSEGESSPLI
Sbjct: 361 KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYLLKS--GVGLEGSKKPTSEGESSPLI 420
Query: 421 SHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFA 480
SHHQT VHEDLPDQ APELQLEAR G++LEEKVETSQANKESNFLE NG+QQ PD FA
Sbjct: 421 SHHQTSVHEDLPDQTNAPELQLEARCGMDLEEKVETSQANKESNFLETNGIQQ--PDPFA 480
Query: 481 EKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESD 540
EKK SENSEGQAASSSDNESDSDS+SDSSDSGSDSGNHSRSRS+SPVGSGSGSSSDSESD
Sbjct: 481 EKKSSENSEGQAASSSDNESDSDSDSDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSESD 540
Query: 541 APSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG 600
PSNS+EGSD DVDIMTSDDDKESK KLQA VQGFSTSPAAWKSPDGG QIIDDEKEDG
Sbjct: 541 GPSNSQEGSDVDVDIMTSDDDKESKQKLQASVQGFSTSPAAWKSPDGGPVQIIDDEKEDG 600
Query: 601 QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLF 660
Q DAIDIEKDSSDDEPDAKID SLL EEG RPVEEPRSFSPYPDEFQERQNFIGSLF
Sbjct: 601 QEYDAIDIEKDSSDDEPDAKIDGRSLLPTEEGVRPVEEPRSFSPYPDEFQERQNFIGSLF 660
Query: 661 EDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 720
EDREN ++DSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW
Sbjct: 661 EDRENNVVDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 720
Query: 721 GVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQA 780
GVQLQSP NLSPSKLNRDSVRN TSQVTNKGEIKGNSDFRPKKGNKETV EKNSSDVSQA
Sbjct: 721 GVQLQSPRNLSPSKLNRDSVRNLTSQVTNKGEIKGNSDFRPKKGNKETVSEKNSSDVSQA 780
Query: 781 GWRPHDQSGGGVRAVDTAARADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGT 840
GWRPHDQS GVRAVDTA RADKHGDIGRGTKHTEK GHANENFHVFKDTFYGN +NEGT
Sbjct: 781 GWRPHDQS--GVRAVDTATRADKHGDIGRGTKHTEKSGHANENFHVFKDTFYGNPDNEGT 840
Query: 841 KEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA 900
KEKKVSKNSRSGGPGDKQIQP DSHHSKPGEIVGKFKDGQ FSSSQMGYSPRDNNNRVSA
Sbjct: 841 KEKKVSKNSRSGGPGDKQIQPLDSHHSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRVSA 900
Query: 901 NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSD 960
NRSPVNGKGR LQRE SDLELGELREPF EEARGKKKFERNNSLKQLENKENTTDIW SD
Sbjct: 901 NRSPVNGKGRILQREPSDLELGELREPFHEEARGKKKFERNNSLKQLENKENTTDIWGSD 960
Query: 961 LNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQY 1020
LNKGKSNLKAS+EYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDS R+N+RSL SH QY
Sbjct: 961 LNKGKSNLKASLEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSNRINNRSLLSHSQY 1020
Query: 1021 NSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLA 1080
NSR+DH EVDKS D NVKPNQG GPEG ESNRKASVGISQLND KREQ PSKKGSKR A
Sbjct: 1021 NSRIDHAEVDKSADGNVKPNQGNGPEGYVESNRKASVGISQLNDTKREQPPSKKGSKRQA 1080
Query: 1081 PNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKE 1140
PNPITEVTD LKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKE
Sbjct: 1081 PNPITEVTDGLKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
Query: 1141 YVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST 1182
YVQEYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSE+YFNVLGQLKESYRLCST
Sbjct: 1141 YVQEYHDKYESYLSLNKILESYRTEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCST 1194
BLAST of HG10002324 vs. NCBI nr
Match:
XP_023524752.1 (dentin sialophosphoprotein-like isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1875.1 bits (4856), Expect = 0.0e+00
Identity = 1022/1217 (83.98%), Postives = 1084/1217 (89.07%), Query Frame = 0
Query: 1 MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
MYGG SKL R G GAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+ A
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSAA 60
Query: 61 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKRVE+QGGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK 240
NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 241 NELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED 300
NELSQ +RLSSSP+PSPPEQSGAP+SQFGSAN TKTH AED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCTAED 300
Query: 301 IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMS 360
I+PR PAKIN+AASSEK+IPTKAAKGVLE PGQE N+GAKPTDLQGMLYNLL +NPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKDIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 361 LKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPL 420
LKALEKAVGDKIPN+VKKIEPIIKK++ + + VE+EGSKKP+SEGESSPL
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKS--EVEVEGSKKPSSEGESSPL 420
Query: 421 ISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLF 480
+SH QTPVHED DQ PE QLEAR IELEEKVETSQANKESNFLEKNG+QQ+SPD F
Sbjct: 421 VSHQQTPVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPF 480
Query: 481 AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSES 540
AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGN SRSRS+SPVGSGSGSSSDSES
Sbjct: 481 AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSES 540
Query: 541 DAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED 600
DAPSNSKEGSDEDVDIMTSDDDKE K+KLQA VQGFS SPAAWKSPDGGA IDDEKED
Sbjct: 541 DAPSNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKED 600
Query: 601 GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSL 660
G SDAIDIEKDSSDDEP+AKIDD SL EGGRPVEE RS SPYPDEFQERQNFIGSL
Sbjct: 601 GHESDAIDIEKDSSDDEPEAKIDDRSLPPTVEGGRPVEESRSLSPYPDEFQERQNFIGSL 660
Query: 661 FEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGN 720
FEDRENT++DSARHEQSDST R+SKGKSKRSS+LEC EE + HTKRLK ES +QQPVSGN
Sbjct: 661 FEDRENTVVDSARHEQSDSTDRMSKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGN 720
Query: 721 WGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQ 780
WG QLQS NLSPSKLNRDS RNPTSQVTNKGE+KGNSDFRPK GNKE V EKN SDVSQ
Sbjct: 721 WGAQLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQ 780
Query: 781 AGWRPHDQSGGGVRAVDTAARADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENE 840
A WRPHDQS GVRAVDTA R DKHG+ IGRG KH+EKGGHANE+FH +KD FYGNAENE
Sbjct: 781 ASWRPHDQS--GVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENE 840
Query: 841 GTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV 900
G EKKVS+NSRSGGPGDKQIQP DSH SKPG+IVGKFKDG+ FSSSQMGYSPRDNNNR+
Sbjct: 841 GMNEKKVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRI 900
Query: 901 SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWS 960
SA+RSPVNGKGR LQRE SDLELGELREPFPEEA GKKKFERNNS KQLENKE+T+DIWS
Sbjct: 901 SADRSPVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWS 960
Query: 961 SDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSH- 1020
S+LNKGKSNLKAS++ GKRSSPH+STKFPSNPEGSNKKK SEH VED TR+NHR QSH
Sbjct: 961 SELNKGKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHP 1020
Query: 1021 --PQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKG 1080
PQY+SRVDHVEV+K VDANVKPNQGIGPE CGESNRKASVGISQL+DMKREQLPSKKG
Sbjct: 1021 QGPQYSSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKG 1080
Query: 1081 SKRLAPNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDF 1140
SKR APN ITEVTDALKNP+SAE ENSD KRRDSSSDENSCSYSKYEKDEPE KGAIKDF
Sbjct: 1081 SKRQAPNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDF 1140
Query: 1141 SQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY 1194
SQYKEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDS+RGQDS++YFN+LGQLKESY
Sbjct: 1141 SQYKEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQDSDKYFNLLGQLKESY 1200
BLAST of HG10002324 vs. NCBI nr
Match:
XP_023524753.1 (dentin sialophosphoprotein-like isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1874.0 bits (4853), Expect = 0.0e+00
Identity = 1019/1205 (84.56%), Postives = 1079/1205 (89.54%), Query Frame = 0
Query: 1 MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
MYGG SKL R G GAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+ A
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSAA 60
Query: 61 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKRVE+QGGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK 240
NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 241 NELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED 300
NELSQ +RLSSSP+PSPPEQSGAP+SQFGSAN TKTH AED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQSGAPISQFGSANPTKTHCTAED 300
Query: 301 IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMS 360
I+PR PAKIN+AASSEK+IPTKAAKGVLE PGQE N+GAKPTDLQGMLYNLL +NPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKDIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 361 LKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPL 420
LKALEKAVGDKIPN+VKKIEPIIKK++ + + VE+EGSKKP+SEGESSPL
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKS--EVEVEGSKKPSSEGESSPL 420
Query: 421 ISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLF 480
+SH QTPVHED DQ PE QLEAR IELEEKVETSQANKESNFLEKNG+QQ+SPD F
Sbjct: 421 VSHQQTPVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPF 480
Query: 481 AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSES 540
AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGN SRSRS+SPVGSGSGSSSDSES
Sbjct: 481 AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSES 540
Query: 541 DAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED 600
DAPSNSKEGSDEDVDIMTSDDDKE K+KLQA VQGFS SPAAWKSPDGGA IDDEKED
Sbjct: 541 DAPSNSKEGSDEDVDIMTSDDDKEPKNKLQASVQGFSASPAAWKSPDGGAVLNIDDEKED 600
Query: 601 GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSL 660
G SDAIDIEKDSSDDEP+AKIDD SL EGGRPVEE RS SPYPDEFQERQNFIGSL
Sbjct: 601 GHESDAIDIEKDSSDDEPEAKIDDRSLPPTVEGGRPVEESRSLSPYPDEFQERQNFIGSL 660
Query: 661 FEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGN 720
FEDRENT++DSARHEQSDST R+SKGKSKRSS+LEC EE + HTKRLK ES +QQPVSGN
Sbjct: 661 FEDRENTVVDSARHEQSDSTDRMSKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGN 720
Query: 721 WGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQ 780
WG QLQS NLSPSKLNRDS RNPTSQVTNKGE+KGNSDFRPK GNKE V EKN SDVSQ
Sbjct: 721 WGAQLQSSRNLSPSKLNRDSARNPTSQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQ 780
Query: 781 AGWRPHDQSGGGVRAVDTAARADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENE 840
A WRPHDQS GVRAVDTA R DKHG+ IGRG KH+EKGGHANE+FH +KD FYGNAENE
Sbjct: 781 ASWRPHDQS--GVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENE 840
Query: 841 GTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV 900
G EKKVS+NSRSGGPGDKQIQP DSH SKPG+IVGKFKDG+ FSSSQMGYSPRDNNNR+
Sbjct: 841 GMNEKKVSRNSRSGGPGDKQIQPCDSHLSKPGDIVGKFKDGKTFSSSQMGYSPRDNNNRI 900
Query: 901 SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWS 960
SA+RSPVNGKGR LQRE SDLELGELREPFPEEA GKKKFERNNS KQLENKE+T+DIWS
Sbjct: 901 SADRSPVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWS 960
Query: 961 SDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSH- 1020
S+LNKGKSNLKAS++ GKRSSPH+STKFPSNPEGSNKKK SEH VED TR+NHR QSH
Sbjct: 961 SELNKGKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHP 1020
Query: 1021 --PQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKG 1080
PQY+SRVDHVEV+K VDANVKPNQGIGPE CGESNRKASVGISQL+DMKREQLPSKKG
Sbjct: 1021 QGPQYSSRVDHVEVEKPVDANVKPNQGIGPESCGESNRKASVGISQLHDMKREQLPSKKG 1080
Query: 1081 SKRLAPNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDF 1140
SKR APN ITEVTDALKNP+SAE ENSD KRRDSSSDENSCSYSKYEKDEPE KGAIKDF
Sbjct: 1081 SKRQAPNQITEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDF 1140
Query: 1141 SQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY 1182
SQYKEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDS+RGQDS++YFN+LGQLKESY
Sbjct: 1141 SQYKEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSSRGQDSDKYFNLLGQLKESY 1200
BLAST of HG10002324 vs. ExPASy TrEMBL
Match:
A0A1S3BIQ1 (dentin sialophosphoprotein isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490006 PE=4 SV=1)
HSP 1 Score: 1984.1 bits (5139), Expect = 0.0e+00
Identity = 1070/1200 (89.17%), Postives = 1097/1200 (91.42%), Query Frame = 0
Query: 1 MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
MYGG SKLGRPG GAGRG GGKRPHSSFPLPPSHRPSGRLSLGGG AGS +NPRNRTTTA
Sbjct: 1 MYGGPSKLGRPGGGAGRGHGGKRPHSSFPLPPSHRPSGRLSLGGGAAGSASNPRNRTTTA 60
Query: 61 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
TTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKRVEAQGGTPRIKFDA A NSS
Sbjct: 61 TTSEASQSTEENFSLVTGNNPLAFGMIIRLAPDLIDEIKRVEAQGGTPRIKFDAIANNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
GNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSRERGDSCEIYEERKSGEDGSGLLIESGNCWRKLNVHRVLDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
NHVKKLSEEAER+SKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN
Sbjct: 181 NHVKKLSEEAERKSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
Query: 241 ELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI 300
ELSQ DRLSSSPIP PPEQ G PVSQFGSANT KTHVIAEDI
Sbjct: 241 ELSQVGPPKSTYKPGMPSLPASKDRLSSSPIPLPPEQFGGPVSQFGSANTNKTHVIAEDI 300
Query: 301 RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSL 360
RPR+PAKIN AAS+EKEI T A KGVLETPGQEGNSGAKPTDLQGMLYNLL ENPKGMSL
Sbjct: 301 RPRVPAKINPAASNEKEILTIAPKGVLETPGQEGNSGAKPTDLQGMLYNLLLENPKGMSL 360
Query: 361 KALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPLI 420
KALEKAVGDKIPNAVKKIEPIIKK++ + + GVELEGSKKPTSEGESSPL+
Sbjct: 361 KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYCLKS--GVELEGSKKPTSEGESSPLV 420
Query: 421 SHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFA 480
SHHQT VHEDLPDQI APELQLEA GI+LEEKVETSQANKESNFLEKNG+QQ PD FA
Sbjct: 421 SHHQTSVHEDLPDQINAPELQLEAGCGIDLEEKVETSQANKESNFLEKNGIQQ--PDPFA 480
Query: 481 EKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESD 540
EKKGSENSEGQAASSSDN SDSDS+SDSSDSGSDSGNHSRSRS+SPVGSGSGSSSDSESD
Sbjct: 481 EKKGSENSEGQAASSSDNVSDSDSDSDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSESD 540
Query: 541 APSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG 600
PSNS+EGSDEDVDIMTSDDDKESKHKLQA VQGFSTSPAAWKSPDGG QIIDDEKEDG
Sbjct: 541 GPSNSQEGSDEDVDIMTSDDDKESKHKLQASVQGFSTSPAAWKSPDGGPVQIIDDEKEDG 600
Query: 601 QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLF 660
Q DAIDIEKDSSDDEPDAK+D SLL EE GRPVEEPRSFSPYPDEFQERQNFIGSLF
Sbjct: 601 QEYDAIDIEKDSSDDEPDAKVDGRSLLPTEEVGRPVEEPRSFSPYPDEFQERQNFIGSLF 660
Query: 661 EDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 720
EDREN + DSARHEQSDSTGRISKGKSKRSSDLECLEEK+DHTKRLKSESLAQQPVSGNW
Sbjct: 661 EDRENNVADSARHEQSDSTGRISKGKSKRSSDLECLEEKADHTKRLKSESLAQQPVSGNW 720
Query: 721 GVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQA 780
GVQLQSP NLSPSKLNRDSVRN TSQVTNKGEIKGNSDFRPKKGNKETV EKNSSDV QA
Sbjct: 721 GVQLQSPRNLSPSKLNRDSVRNLTSQVTNKGEIKGNSDFRPKKGNKETVSEKNSSDVPQA 780
Query: 781 GWRPHDQSGGGVRAVDTAARADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGT 840
GWRPHDQS GGVRAVDTA RADKHGDIGRGTKH EK GHANENFHVFKDTFYGNA+NEGT
Sbjct: 781 GWRPHDQS-GGVRAVDTATRADKHGDIGRGTKHIEKSGHANENFHVFKDTFYGNADNEGT 840
Query: 841 KEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA 900
KEKKVSKNSRSGGPGDK IQPFDSH SKPGEIVGKFKDGQ FSSSQMGYSPRDNNNRVSA
Sbjct: 841 KEKKVSKNSRSGGPGDKHIQPFDSHQSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRVSA 900
Query: 901 NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSD 960
NRSPVNGKGR LQRE SDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIW SD
Sbjct: 901 NRSPVNGKGRILQREPSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWGSD 960
Query: 961 LNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQY 1020
LNKGKSNLKAS+EYGKRSSPHVSTKFPSNPEGSNKKKNSEH+VEDS RLN+RSL SH QY
Sbjct: 961 LNKGKSNLKASLEYGKRSSPHVSTKFPSNPEGSNKKKNSEHMVEDSNRLNNRSLLSHSQY 1020
Query: 1021 NSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLA 1080
NSR+DH EVDKSVD NV+PNQG GPEG ESNRKASVGISQLND KREQLPSKKGSKR A
Sbjct: 1021 NSRIDHAEVDKSVDGNVRPNQGNGPEGYVESNRKASVGISQLNDTKREQLPSKKGSKRQA 1080
Query: 1081 PNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKE 1140
PNPITEVTD LKNP+SAE ENSDPKRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKE
Sbjct: 1081 PNPITEVTDGLKNPISAEHENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
Query: 1141 YVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST 1182
YVQEYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSE+YFNVLGQLKESYRLCST
Sbjct: 1141 YVQEYHDKYESYLSLNKILESYRTEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCST 1195
BLAST of HG10002324 vs. ExPASy TrEMBL
Match:
A0A0A0LCU6 (Occludin_ELL domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G849910 PE=4 SV=1)
HSP 1 Score: 1983.0 bits (5136), Expect = 0.0e+00
Identity = 1071/1200 (89.25%), Postives = 1097/1200 (91.42%), Query Frame = 0
Query: 1 MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
MYGG SKLGRPG GAGRG GKRPHSSFPLPPSHRPSGRLSLGGG AGSV+NPRNRTTTA
Sbjct: 1 MYGGPSKLGRPGGGAGRGHAGKRPHSSFPLPPSHRPSGRLSLGGGAAGSVSNPRNRTTTA 60
Query: 61 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
TTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKRVEAQGGTPRIKFDANA NSS
Sbjct: 61 TTSEASQSAEENFSLVTGNNPLAFGMIIRLAPDLIDEIKRVEAQGGTPRIKFDANANNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
GNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSRERGDSCEIYEERKSGEDGSGLLIESGNCWRKLNVHRVLDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
NHVKKLSEEAER+SKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN
Sbjct: 181 NHVKKLSEEAERKSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKN 240
Query: 241 ELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI 300
ELSQ DRLSSSPIP PPEQ GAPVSQFGSANT+KTHVIAEDI
Sbjct: 241 ELSQVGPPKSTYKPGMPSLPASKDRLSSSPIPLPPEQFGAPVSQFGSANTSKTHVIAEDI 300
Query: 301 RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSL 360
RPR+PAKIN AAS+EKEIPT A KGVLETPGQEGNSG KPTDLQGMLYNLL ENPKGMSL
Sbjct: 301 RPRVPAKINPAASNEKEIPTIAPKGVLETPGQEGNSGTKPTDLQGMLYNLLLENPKGMSL 360
Query: 361 KALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPLI 420
KALEKAVGDKIPNAVKKIEPIIKK++ ++ + GV LEGSKKPTSEGESSPLI
Sbjct: 361 KALEKAVGDKIPNAVKKIEPIIKKIATYQAPGRYLLKS--GVGLEGSKKPTSEGESSPLI 420
Query: 421 SHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFA 480
SHHQT VHEDLPDQ APELQLEAR G++LEEKVETSQANKESNFLE NG+QQ PD FA
Sbjct: 421 SHHQTSVHEDLPDQTNAPELQLEARCGMDLEEKVETSQANKESNFLETNGIQQ--PDPFA 480
Query: 481 EKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESD 540
EKK SENSEGQAASSSDNESDSDS+SDSSDSGSDSGNHSRSRS+SPVGSGSGSSSDSESD
Sbjct: 481 EKKSSENSEGQAASSSDNESDSDSDSDSSDSGSDSGNHSRSRSRSPVGSGSGSSSDSESD 540
Query: 541 APSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG 600
PSNS+EGSD DVDIMTSDDDKESK KLQA VQGFSTSPAAWKSPDGG QIIDDEKEDG
Sbjct: 541 GPSNSQEGSDVDVDIMTSDDDKESKQKLQASVQGFSTSPAAWKSPDGGPVQIIDDEKEDG 600
Query: 601 QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLF 660
Q DAIDIEKDSSDDEPDAKID SLL EEG RPVEEPRSFSPYPDEFQERQNFIGSLF
Sbjct: 601 QEYDAIDIEKDSSDDEPDAKIDGRSLLPTEEGVRPVEEPRSFSPYPDEFQERQNFIGSLF 660
Query: 661 EDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 720
EDREN ++DSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW
Sbjct: 661 EDRENNVVDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNW 720
Query: 721 GVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQA 780
GVQLQSP NLSPSKLNRDSVRN TSQVTNKGEIKGNSDFRPKKGNKETV EKNSSDVSQA
Sbjct: 721 GVQLQSPRNLSPSKLNRDSVRNLTSQVTNKGEIKGNSDFRPKKGNKETVSEKNSSDVSQA 780
Query: 781 GWRPHDQSGGGVRAVDTAARADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGT 840
GWRPHDQS GVRAVDTA RADKHGDIGRGTKHTEK GHANENFHVFKDTFYGN +NEGT
Sbjct: 781 GWRPHDQS--GVRAVDTATRADKHGDIGRGTKHTEKSGHANENFHVFKDTFYGNPDNEGT 840
Query: 841 KEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA 900
KEKKVSKNSRSGGPGDKQIQP DSHHSKPGEIVGKFKDGQ FSSSQMGYSPRDNNNRVSA
Sbjct: 841 KEKKVSKNSRSGGPGDKQIQPLDSHHSKPGEIVGKFKDGQTFSSSQMGYSPRDNNNRVSA 900
Query: 901 NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSD 960
NRSPVNGKGR LQRE SDLELGELREPF EEARGKKKFERNNSLKQLENKENTTDIW SD
Sbjct: 901 NRSPVNGKGRILQREPSDLELGELREPFHEEARGKKKFERNNSLKQLENKENTTDIWGSD 960
Query: 961 LNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQY 1020
LNKGKSNLKAS+EYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDS R+N+RSL SH QY
Sbjct: 961 LNKGKSNLKASLEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSNRINNRSLLSHSQY 1020
Query: 1021 NSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLA 1080
NSR+DH EVDKS D NVKPNQG GPEG ESNRKASVGISQLND KREQ PSKKGSKR A
Sbjct: 1021 NSRIDHAEVDKSADGNVKPNQGNGPEGYVESNRKASVGISQLNDTKREQPPSKKGSKRQA 1080
Query: 1081 PNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKE 1140
PNPITEVTD LKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKE
Sbjct: 1081 PNPITEVTDGLKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPELKGAIKDFSQYKE 1140
Query: 1141 YVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST 1182
YVQEYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSE+YFNVLGQLKESYRLCST
Sbjct: 1141 YVQEYHDKYESYLSLNKILESYRTEFCKLGKELDSARGQDSEKYFNVLGQLKESYRLCST 1194
BLAST of HG10002324 vs. ExPASy TrEMBL
Match:
A0A6J1K6B7 (dentin sialophosphoprotein isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111492713 PE=4 SV=1)
HSP 1 Score: 1861.3 bits (4820), Expect = 0.0e+00
Identity = 1018/1217 (83.65%), Postives = 1079/1217 (88.66%), Query Frame = 0
Query: 1 MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
MYGG SKL R G GAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+TA
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
Query: 61 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKRVE+QGGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK 240
NHVKKLSEEAERRSKSRRAIVLEPGN SMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNLSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 241 NELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED 300
NELSQ +RLSSSP+PSPPEQ GAP+SQFGSAN TKTH IAED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQFGAPISQFGSANPTKTHCIAED 300
Query: 301 IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMS 360
I+PR PAKIN+AASSEKEIPTKAAKGVLE PGQE N+GAKPTDLQGMLYNLL +NPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 361 LKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPL 420
LKALEKAVGDKIPN+VKKIEPIIKK++ + + VELEGSKKP+SEGESSPL
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKS--EVELEGSKKPSSEGESSPL 420
Query: 421 ISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLF 480
+SH QTPVHED DQ PE QLEAR IELEEKVETSQANKESNFLEKNG+QQ+SPD F
Sbjct: 421 VSHQQTPVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPF 480
Query: 481 AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSES 540
AEKKGSENSEG+AA+SSDNESDSDSESDSSDSGSDSGN SRSRS+SPVGSGSGSSSDSES
Sbjct: 481 AEKKGSENSEGEAATSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSES 540
Query: 541 DAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED 600
DAPSNSKEGSDEDVDIMTSDDDKE KHKLQA VQGFS SPAAWKSPDGGA IDDEKED
Sbjct: 541 DAPSNSKEGSDEDVDIMTSDDDKEPKHKLQASVQGFSASPAAWKSPDGGAVLNIDDEKED 600
Query: 601 GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSL 660
G SDAIDIEKDSSDDEP+AKIDD SL EGGR VEE RS SPYPDEFQERQNFIGSL
Sbjct: 601 GHESDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRTVEESRSLSPYPDEFQERQNFIGSL 660
Query: 661 FEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGN 720
FEDRENT++DS RHEQSDST RISKGKSKRSS+LEC EE + HTKRLK ES +QQPVSGN
Sbjct: 661 FEDRENTVVDSGRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGN 720
Query: 721 WGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQ 780
WGVQLQS NLSPSKLNRDS RNPT+QVTNKGE+KGNSDFRPK GNKE V EKN SDVSQ
Sbjct: 721 WGVQLQSSRNLSPSKLNRDSARNPTNQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQ 780
Query: 781 AGWRPHDQSGGGVRAVDTAARADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENE 840
A WRPHDQS GVRAVDTA R DKHG+ IGRG KH+EKGGHANE+FH +KD FYGNAENE
Sbjct: 781 ASWRPHDQS--GVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENE 840
Query: 841 GTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV 900
EKKVS+NSRSGGPGDKQIQP DSH SKPG+IVGKFKDG+ F SSQMGYSPRDNNNR+
Sbjct: 841 EMNEKKVSRNSRSGGPGDKQIQPSDSHLSKPGDIVGKFKDGKTFPSSQMGYSPRDNNNRI 900
Query: 901 SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWS 960
SA+RSPVNGKGR LQRE SDLELGELREPFPEEA GKKKFERNNS KQLENKE+T+DIWS
Sbjct: 901 SADRSPVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWS 960
Query: 961 SDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHP 1020
S+LNKGKSNLKAS++ GKRSSPH+STKFPSNPEGSNKKK SEH VED TR+NHR QSHP
Sbjct: 961 SELNKGKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHP 1020
Query: 1021 ---QYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKG 1080
QY+SRVDHVEV+K VDANVK NQGIGPE CGESNRKASVG+SQL+DMKREQLPSKKG
Sbjct: 1021 QGSQYSSRVDHVEVEKPVDANVKLNQGIGPESCGESNRKASVGMSQLHDMKREQLPSKKG 1080
Query: 1081 SKRLAPNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDF 1140
SKR APN I EVTDALKNP+SAE ENSD KRRDSSSDENSCSYSKYEKDEPE KGAIKDF
Sbjct: 1081 SKRQAPNQIIEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDF 1140
Query: 1141 SQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY 1194
SQYKEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDSARGQDS++YFN+LGQLKESY
Sbjct: 1141 SQYKEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSARGQDSDKYFNLLGQLKESY 1200
BLAST of HG10002324 vs. ExPASy TrEMBL
Match:
A0A6J1KF98 (dentin sialophosphoprotein isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111492713 PE=4 SV=1)
HSP 1 Score: 1861.3 bits (4820), Expect = 0.0e+00
Identity = 1018/1217 (83.65%), Postives = 1079/1217 (88.66%), Query Frame = 0
Query: 1 MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
MYGG SKL R G GAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+TA
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
Query: 61 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKRVE+QGGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK 240
NHVKKLSEEAERRSKSRRAIVLEPGN SMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNLSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 241 NELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED 300
NELSQ +RLSSSP+PSPPEQ GAP+SQFGSAN TKTH IAED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQFGAPISQFGSANPTKTHCIAED 300
Query: 301 IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMS 360
I+PR PAKIN+AASSEKEIPTKAAKGVLE PGQE N+GAKPTDLQGMLYNLL +NPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 361 LKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPL 420
LKALEKAVGDKIPN+VKKIEPIIKK++ + + VELEGSKKP+SEGESSPL
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKS--EVELEGSKKPSSEGESSPL 420
Query: 421 ISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLF 480
+SH QTPVHED DQ PE QLEAR IELEEKVETSQANKESNFLEKNG+QQ+SPD F
Sbjct: 421 VSHQQTPVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPF 480
Query: 481 AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSES 540
AEKKGSENSEG+AA+SSDNESDSDSESDSSDSGSDSGN SRSRS+SPVGSGSGSSSDSES
Sbjct: 481 AEKKGSENSEGEAATSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSES 540
Query: 541 DAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED 600
DAPSNSKEGSDEDVDIMTSDDDKE KHKLQA VQGFS SPAAWKSPDGGA IDDEKED
Sbjct: 541 DAPSNSKEGSDEDVDIMTSDDDKEPKHKLQASVQGFSASPAAWKSPDGGAVLNIDDEKED 600
Query: 601 GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSL 660
G SDAIDIEKDSSDDEP+AKIDD SL EGGR VEE RS SPYPDEFQERQNFIGSL
Sbjct: 601 GHESDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRTVEESRSLSPYPDEFQERQNFIGSL 660
Query: 661 FEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGN 720
FEDRENT++DS RHEQSDST RISKGKSKRSS+LEC EE + HTKRLK ES +QQPVSGN
Sbjct: 661 FEDRENTVVDSGRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGN 720
Query: 721 WGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQ 780
WGVQLQS NLSPSKLNRDS RNPT+QVTNKGE+KGNSDFRPK GNKE V EKN SDVSQ
Sbjct: 721 WGVQLQSSRNLSPSKLNRDSARNPTNQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQ 780
Query: 781 AGWRPHDQSGGGVRAVDTAARADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENE 840
A WRPHDQS GVRAVDTA R DKHG+ IGRG KH+EKGGHANE+FH +KD FYGNAENE
Sbjct: 781 ASWRPHDQS--GVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENE 840
Query: 841 GTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV 900
EKKVS+NSRSGGPGDKQIQP DSH SKPG+IVGKFKDG+ F SSQMGYSPRDNNNR+
Sbjct: 841 EMNEKKVSRNSRSGGPGDKQIQPSDSHLSKPGDIVGKFKDGKTFPSSQMGYSPRDNNNRI 900
Query: 901 SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWS 960
SA+RSPVNGKGR LQRE SDLELGELREPFPEEA GKKKFERNNS KQLENKE+T+DIWS
Sbjct: 901 SADRSPVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWS 960
Query: 961 SDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHP 1020
S+LNKGKSNLKAS++ GKRSSPH+STKFPSNPEGSNKKK SEH VED TR+NHR QSHP
Sbjct: 961 SELNKGKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHP 1020
Query: 1021 ---QYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKG 1080
QY+SRVDHVEV+K VDANVK NQGIGPE CGESNRKASVG+SQL+DMKREQLPSKKG
Sbjct: 1021 QGSQYSSRVDHVEVEKPVDANVKLNQGIGPESCGESNRKASVGMSQLHDMKREQLPSKKG 1080
Query: 1081 SKRLAPNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDF 1140
SKR APN I EVTDALKNP+SAE ENSD KRRDSSSDENSCSYSKYEKDEPE KGAIKDF
Sbjct: 1081 SKRQAPNQIIEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDF 1140
Query: 1141 SQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY 1194
SQYKEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDSARGQDS++YFN+LGQLKESY
Sbjct: 1141 SQYKEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSARGQDSDKYFNLLGQLKESY 1200
BLAST of HG10002324 vs. ExPASy TrEMBL
Match:
A0A6J1KCU5 (dentin sialophosphoprotein isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111492713 PE=4 SV=1)
HSP 1 Score: 1860.1 bits (4817), Expect = 0.0e+00
Identity = 1015/1205 (84.23%), Postives = 1074/1205 (89.13%), Query Frame = 0
Query: 1 MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTA 60
MYGG SKL R G GAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+TA
Sbjct: 1 MYGGPSKLARAGGGAGRGASGKRPPSSFPLPPAHRPSGRLSLGGGGAGSGANPRNRTSTA 60
Query: 61 TTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKFDANAKNSS 120
SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKRVE+QGGTPRIKFDANAKNSS
Sbjct: 61 AKSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIEEIKRVESQGGTPRIKFDANAKNSS 120
Query: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTT 180
GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTT
Sbjct: 121 GNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLVESGNAWRKVNVQRILDESTT 180
Query: 181 NHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK 240
NHVKKLSEEAERRSKSRRAIVLEPGN SMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQK
Sbjct: 181 NHVKKLSEEAERRSKSRRAIVLEPGNLSMKNQIKQLAAAESNPWRMHYKNKKEPPFKKQK 240
Query: 241 NELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED 300
NELSQ +RLSSSP+PSPPEQ GAP+SQFGSAN TKTH IAED
Sbjct: 241 NELSQVGPPKSTFKPGMSSVPASKERLSSSPVPSPPEQFGAPISQFGSANPTKTHCIAED 300
Query: 301 IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMS 360
I+PR PAKIN+AASSEKEIPTKAAKGVLE PGQE N+GAKPTDLQGMLYNLL +NPKGMS
Sbjct: 301 IKPRQPAKINAAASSEKEIPTKAAKGVLEAPGQEVNAGAKPTDLQGMLYNLLLDNPKGMS 360
Query: 361 LKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPL 420
LKALEKAVGDKIPN+VKKIEPIIKK++ + + VELEGSKKP+SEGESSPL
Sbjct: 361 LKALEKAVGDKIPNSVKKIEPIIKKIATYQAPGRYCLKS--EVELEGSKKPSSEGESSPL 420
Query: 421 ISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLF 480
+SH QTPVHED DQ PE QLEAR IELEEKVETSQANKESNFLEKNG+QQ+SPD F
Sbjct: 421 VSHQQTPVHEDFHDQ-PVPESQLEARHVIELEEKVETSQANKESNFLEKNGIQQNSPDPF 480
Query: 481 AEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSES 540
AEKKGSENSEG+AA+SSDNESDSDSESDSSDSGSDSGN SRSRS+SPVGSGSGSSSDSES
Sbjct: 481 AEKKGSENSEGEAATSSDNESDSDSESDSSDSGSDSGNRSRSRSRSPVGSGSGSSSDSES 540
Query: 541 DAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED 600
DAPSNSKEGSDEDVDIMTSDDDKE KHKLQA VQGFS SPAAWKSPDGGA IDDEKED
Sbjct: 541 DAPSNSKEGSDEDVDIMTSDDDKEPKHKLQASVQGFSASPAAWKSPDGGAVLNIDDEKED 600
Query: 601 GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSL 660
G SDAIDIEKDSSDDEP+AKIDD SL EGGR VEE RS SPYPDEFQERQNFIGSL
Sbjct: 601 GHESDAIDIEKDSSDDEPEAKIDDRSLPPTGEGGRTVEESRSLSPYPDEFQERQNFIGSL 660
Query: 661 FEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGN 720
FEDRENT++DS RHEQSDST RISKGKSKRSS+LEC EE + HTKRLK ES +QQPVSGN
Sbjct: 661 FEDRENTVVDSGRHEQSDSTDRISKGKSKRSSELECFEENAVHTKRLKLESSSQQPVSGN 720
Query: 721 WGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQ 780
WGVQLQS NLSPSKLNRDS RNPT+QVTNKGE+KGNSDFRPK GNKE V EKN SDVSQ
Sbjct: 721 WGVQLQSSRNLSPSKLNRDSARNPTNQVTNKGELKGNSDFRPKMGNKEIVSEKNCSDVSQ 780
Query: 781 AGWRPHDQSGGGVRAVDTAARADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENE 840
A WRPHDQS GVRAVDTA R DKHG+ IGRG KH+EKGGHANE+FH +KD FYGNAENE
Sbjct: 781 ASWRPHDQS--GVRAVDTAVRPDKHGESIGRGGKHSEKGGHANESFHAYKDRFYGNAENE 840
Query: 841 GTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV 900
EKKVS+NSRSGGPGDKQIQP DSH SKPG+IVGKFKDG+ F SSQMGYSPRDNNNR+
Sbjct: 841 EMNEKKVSRNSRSGGPGDKQIQPSDSHLSKPGDIVGKFKDGKTFPSSQMGYSPRDNNNRI 900
Query: 901 SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWS 960
SA+RSPVNGKGR LQRE SDLELGELREPFPEEA GKKKFERNNS KQLENKE+T+DIWS
Sbjct: 901 SADRSPVNGKGRILQREHSDLELGELREPFPEEALGKKKFERNNSSKQLENKEHTSDIWS 960
Query: 961 SDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHP 1020
S+LNKGKSNLKAS++ GKRSSPH+STKFPSNPEGSNKKK SEH VED TR+NHR QSHP
Sbjct: 961 SELNKGKSNLKASLDNGKRSSPHISTKFPSNPEGSNKKKISEHKVEDLTRINHRPPQSHP 1020
Query: 1021 ---QYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKG 1080
QY+SRVDHVEV+K VDANVK NQGIGPE CGESNRKASVG+SQL+DMKREQLPSKKG
Sbjct: 1021 QGSQYSSRVDHVEVEKPVDANVKLNQGIGPESCGESNRKASVGMSQLHDMKREQLPSKKG 1080
Query: 1081 SKRLAPNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDF 1140
SKR APN I EVTDALKNP+SAE ENSD KRRDSSSDENSCSYSKYEKDEPE KGAIKDF
Sbjct: 1081 SKRQAPNQIIEVTDALKNPISAEHENSDLKRRDSSSDENSCSYSKYEKDEPELKGAIKDF 1140
Query: 1141 SQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY 1182
SQYKEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDSARGQDS++YFN+LGQLKESY
Sbjct: 1141 SQYKEYVQEYRDKYECYLSLNKILESYRAEFCKLGKELDSARGQDSDKYFNLLGQLKESY 1200
BLAST of HG10002324 vs. TAIR 10
Match:
AT3G21290.1 (dentin sialophosphoprotein-related )
HSP 1 Score: 540.4 bits (1391), Expect = 3.6e-153
Identity = 490/1265 (38.74%), Postives = 668/1265 (52.81%), Query Frame = 0
Query: 1 MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPS--GRLSLGGGGAGSVANPRNRTT 60
M+ GSSK G G G G+G R +SFP P + PS GR+S GGGG GS A PR R+
Sbjct: 1 MFKGSSKRGGRGGSGGGGSGPSRNRNSFPPPTNRHPSPIGRMSSGGGGGGSAA-PRQRSN 60
Query: 61 T------ATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRIKF 120
+ A+T+ + ++VEE F+LV + AF MIIRL+PDL+DEIKRVEAQGG +IKF
Sbjct: 61 STSVKAAASTTVSSRTVEETFNLVPRESSSAFGMIIRLSPDLVDEIKRVEAQGGAAKIKF 120
Query: 121 DANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQ 180
DA NS+ N+I+VGGKEF+FTWS E G+LCDIYEE +SGEDG+GLLIE+G AWRKLNV
Sbjct: 121 DAFPNNSTENIINVGGKEFKFTWSGEKGELCDIYEEHQSGEDGNGLLIEAGCAWRKLNVL 180
Query: 181 RILDESTTNHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKK 240
R LDESTT+H+K S EAE+R+KSR+AIVL+PGNPS+ KQLA AE +PWR K KK
Sbjct: 181 RTLDESTTSHMKMRSVEAEQRTKSRKAIVLDPGNPSV---TKQLAHAEGSPWRMSNKQKK 240
Query: 241 EPPFKKQK--------------------NELSQDRLSSSPIPSPPEQSGAPVSQFGSANT 300
EPP KK+K ++RLS+SP PSP Q P +G N
Sbjct: 241 EPPPKKRKVDPPPVPVGGPKPSFRPGASTPTMKNRLSASPGPSPSNQYNTP--PYGIGNM 300
Query: 301 TKTHVIAEDIRP-RLPAKINSAASSEKEIPTKAAKGVL-ETPGQEGNSGAKPTDLQGMLY 360
KTH E++ P + ++N EKE P+ VL +T G+E + K DLQ +L
Sbjct: 301 AKTHAANENVTPVQTKGRVNMI---EKE-PSAWKNNVLRDTSGREAINVNKEIDLQSLLV 360
Query: 361 NLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSK 420
++L E P MSLKALEKAVGDK+PN KKIEPI+K++++ + + ELE K
Sbjct: 361 DILKEAP--MSLKALEKAVGDKVPNPAKKIEPILKRIANFQAPRYFLKPE---AELESYK 420
Query: 421 KPTSEGESSPLISHHQ-TPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNF-- 480
K + + SSP H Q PV E DQ+ P G EK + N E +
Sbjct: 421 KHSPDSGSSP--EHQQLLPVTECSRDQLPVP--------GRNNTEKFSLCEQNGEGSLDC 480
Query: 481 -----------LEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNESDSDSESDSSDSGSD 540
E ++ HSP +F E+K SEN E QA SS SDSDS+SD+SDSGSD
Sbjct: 481 LPVHLVEQLSTQENVDIEHHSPGIFHEEKRSENREAQARSS----SDSDSDSDNSDSGSD 540
Query: 541 SGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQ- 600
S+S GS SGSSSDSE A SNSK+GSDEDVDIM SD D+E Q+ Q
Sbjct: 541 --------SKSAAGSDSGSSSDSE--ASSNSKDGSDEDVDIM-SDGDREPLLTTQSLEQD 600
Query: 601 -----GFSTSPAAWKSPDGGAAQIIDDEKE----DGQGSDAIDIEKDSSDD----EPDAK 660
G +S + + A I + + DG GSD +D+E +SSD+ + D K
Sbjct: 601 AIDLPGHGSSAVEIEGHNSDAVDIDGHDSDAVDIDGHGSDTVDVEGNSSDEGHGSDADRK 660
Query: 661 IDDSSLLRIE---------EGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSA 720
+ + ++E G + F+ D +ERQNFIG LF+D ENT ++
Sbjct: 661 KNSDNNWKMETTTGTSPTANGEVGISGQEHFTSGHDNLRERQNFIGQLFDDTENTTKNNF 720
Query: 721 RHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWGVQLQSPHNLS 780
++++ D + R+ K +++++ D E +KS H K KS+S Q V S H
Sbjct: 721 KNDKRDISERLGKDQNQKALDFEHYSQKSAHEKNRKSQSCNQLS-----AVSKDSQH--- 780
Query: 781 PSKLNRDS-VRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGG 840
S+L D+ +RN ++ T P +G ++ EK++
Sbjct: 781 -SELKYDAELRNASASQT----------IDPLRGLLKSSIEKSNRH-------------- 840
Query: 841 GVRAVDTAARADKHGDIGRGTKHTEKGGH------ANENFHVFKDTFYGNAENEGTKEKK 900
+++KH D + ++KG H ++ + F+D N ++ + K
Sbjct: 841 --------GKSNKHSDALGNVRKSDKGDHFPLEMLSSRSGKAFRD----NQRDDVHLKNK 900
Query: 901 VSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSANRSP 960
+N + G + P ++ KP E+ G KD + S +G SP D+ A
Sbjct: 901 FPRNKKDGESAIRPSLPTETSDRKPDELDGSDKDPKNVSGLSIGSSPLDSQRTYLAKLP- 960
Query: 961 VNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKG 1020
G G LQ+++S+LELGEL EP E+ K E S +Q K +T++ D +K
Sbjct: 961 -KGNGPVLQKQVSELELGELPEPLGEDT-ALKPIEEKTSFRQSNLKPSTSEKLGIDSDKR 1020
Query: 1021 KSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQYNSRV 1080
+S S K+++P P GSN EH+VEDS R +LQSH Q +
Sbjct: 1021 RSKKSDS----KKAAP------PHTVNGSN-----EHVVEDSERSQKWALQSHGQNLTGT 1080
Query: 1081 DHVEVD-----------KSVDANVKPNQGIGPEGCGESNRKASV--GISQLNDMKREQLP 1140
D E+ KS + + G EG GE+N+K V S+ R
Sbjct: 1081 D-TEISSQNKNLEDAAYKSRQKDSRARVGNSVEGYGETNKKTPVVKHGSKRASTSRSSRE 1140
Query: 1141 SKKGSKRLAPNPITEVTDALKNP-VSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKG 1177
SK+ S N I DA P S RE K+ S +E+S SY KYEK PE KG
Sbjct: 1141 SKRHSS--VSNSINGHKDATSIPGGSVVRE----KQMTSFGEEDS-SYLKYEKASPELKG 1154
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038883601.1 | 0.0e+00 | 90.25 | dentin sialophosphoprotein isoform X1 [Benincasa hispida] | [more] |
XP_008447590.1 | 0.0e+00 | 89.17 | PREDICTED: dentin sialophosphoprotein isoform X1 [Cucumis melo] | [more] |
XP_004146856.1 | 0.0e+00 | 89.25 | dentin sialophosphoprotein isoform X1 [Cucumis sativus] >KGN59835.1 hypothetical... | [more] |
XP_023524752.1 | 0.0e+00 | 83.98 | dentin sialophosphoprotein-like isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_023524753.1 | 0.0e+00 | 84.56 | dentin sialophosphoprotein-like isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3BIQ1 | 0.0e+00 | 89.17 | dentin sialophosphoprotein isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490006 PE... | [more] |
A0A0A0LCU6 | 0.0e+00 | 89.25 | Occludin_ELL domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G84991... | [more] |
A0A6J1K6B7 | 0.0e+00 | 83.65 | dentin sialophosphoprotein isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC11149271... | [more] |
A0A6J1KF98 | 0.0e+00 | 83.65 | dentin sialophosphoprotein isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC11149271... | [more] |
A0A6J1KCU5 | 0.0e+00 | 84.23 | dentin sialophosphoprotein isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC11149271... | [more] |
Match Name | E-value | Identity | Description | |
AT3G21290.1 | 3.6e-153 | 38.74 | dentin sialophosphoprotein-related | [more] |