HG10005552 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10005552
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionnucleolar complex protein 4 homolog
LocationChr07: 3579513 .. 3592599 (-)
RNA-Seq ExpressionHG10005552
SyntenyHG10005552
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTCCATTCCCTCAAACAAGAAGAAGACGAAGAATCACAAGCTTTCAGACGTCAAAACCCTAGGCCTCCAACTTCTCTCATCTCGAGCTCACATCAACAACCTCGCTTTGCTTCTCACCTTCGTTTCTCCTTCTTCTCCTCCTCCCTATGTCCTTGAAGCCCTCCTCTCTCTTCAGTCCTTCTTCATCACCAACCTTCCCTCCCTTCCTTCATCCTCCTCCAAGCCTGCCGCCGCCGGAGACGACGTTCAGGTCGACGCCGAATTCATTTACCGAACCTGGCTCCGTTCCAAGTTCGATGAACTCGTTAAGTCGCTCATTGATGTGGCGGTTTCTTCCGAATGCGACGACACTCTCAAGGTGGTGGAATTTCGTGGCTTTTGATTGGTTGTGTTGTTTGTTACCTGTATTGGTTTGTGTTTGGTGATTTAGTTTAATTATGTGTTTTTGTTTGTTCCGTTGTTGTGCGTGGTTTAGGAGATTGTGTTGGATGCGATTATGGAGTTTGTTAAAGTTGGTAACAAGGGGAAATTTCATTCTGCTGTATACCACAGGTTTCTACAGAGTATCGTAAGTTGCTTTGTTATCGTTAATTACTAGATTTTTGCTGTGTTTCTGATTGTGCTAGAGCTGTGCATTTAACCTTTTCTCTTACATGTAGGCGCATTCTTTGACTCCAGTTGATACTCTGATAGCCTTGCTTGTGAAAAAGTACTTCAATCACCTCGATGTCCGGTATGTTTTTCTTCTTCTTCTTTCCCCTTTAACATCCTGTGCCTACATGTCATTCGAAAGTTTATTTTTCTCTAGCATAATGTTCAAAATATTTTCTTCTTGTGCACTCACATGACTGGGATTTCATTTTAACGCTTATTCCTGTCCTCTTTAGCTTTGTATATTGCATCATTTTTCCTACATTGTTGATTCCTATGCCATAAAAGAATGCACGTCTTATCATATCTCAACATTTTTGTCGAATTCTTGAGAGACATGCTTTAGAATGCCTATGAGATCCATTTTAACCTGCTTATCTCTTGTGCAGTTATTTTACATATATTAGCATTAAAGAACTTGACAAGACTTTCAAGGCTGAGTACATGTCTGGTATTTTCTCCCACCTATTCTTGACATTTTGTTATTTTTATTTTCTGAATGTTCCGTTCAATGAAACTATCTTCTTCTTGTAGGTGATAAAAATGGTAGGATTAATGGCGATGATGGTGGTCACTCAAGAGAAGGGTAAATGTTATTTGATCTCAAATCATTGTCCTAAAACACTGCTTCTCTTCAACATTTTTATAAGTCATTATTCTTCTATATATATGCTTGGGATTTTCTCCTTTTTTGAAAACGGAGTATATTAATACTGTATTTACCCCTACGTATATAAGATGAGGGAGGGAGTGAACCTACTTTGTCTCAGGGAATAGGGTAAAAACTCCATTGTCTACCGGGAGGGGAACACGTCATATATTTTTTTTAGCTCCAAAAATTAAGAGTGGGGATTCAGACGCCTGAGATTTAAGAACCATTGGCACATTAGTTTGTTAATTTAATAGAGTGTAATTTCTTCAATTCTTTGTCAGGCCATATGATATTTAGATCACAACGAATATATTCTTGACCACCTTAAACCACAATTTTGTGCTTTGCATGATCCCCATTGAATAATAATTTAAAGGCACTAGAATTATGAAGTCTTTTAAAAAATTGGTCCATCACTGGTGACGTTTATATTGAATACATGGAACTTACCTTTTTCCATGCAGAGTGGAGTTCATTCACATTGTGCACTCTATCTTATCCTCCATTCCCCCTTTGGAAAACTCAAATGAATCTGACTACACTTTGTGGGTTGAATCAGGTAATCTCTACAGATTTCGTTTTTGAGCTTATAGATGTTTGTTTTTTTTATACACGTCTTGAATTCTTAACCAGTTGAATTTAATGTCTCTAACTTTCTCAAAGGGTGTTTTGATGGAATTTTTTTGAGACATAAACAAACTTTTTTAGTTTCATATTGACATTTAATTTTGTTTGTTTATTAAAATGGCATCATTAACTCCAATTTAAGATAAATAATTTCTACTATGGCCAAGGTTGGCCAAAGTTTCAATTTTATACTTGGGCCAATTTGAGTGATTGGATTTGTAGGTTATTCCTTCCTTTTATAAACAAATTCACACCTGGAATCTTTGTTTTTTATGTTTAAACCCTCATGGAGAAGTGCTCTAGATTGTTGTTAAGAGACCCTCCAATCAAATGACTAATAAAAAGACACTTTTGTCCAGCTGGAGAAGTTTTAGCATTTATAGTCTGAACATTTTTTTATCCAGCAGGAGAAGTTTTAGTATTTTGAGTCTAACCACTTTTTATCCCGCTAGAGAAGTTTTTTTGTAATCTCTATGGATGGCATGCCCTTTTGTATATTTCAATTGTTACCTGTCCAGCAAAAAAATGTAGTAAAAAGATCATTCAAGAACTGCTGATCAAGGTTCTAAATTTACATTTTTATGAATAAAGAGTCATGGCATTGAGCATTGCAATTATCTTCTGTGTTTACGGTCAGCACTCCTTTTATGGATCAGGGTTTTTGCTTGAGTTATTATTTTTCCAATTGAAGGTGATGACAAAGTGCTTTCTGACAATCAAGAAGCAAAGCAGCTTAAGATGAAGAAAAATGATGAAGAGGTCCTTACAACTAGTTCTTTCTTTATCCATTTTTATTCTCTCTTTTGGGTATTGGATTTGGCTGCAATGAATTCTCTCATTGTTTCTTCTTTTAAAGAATGTTATTAACACCCTTCAGGTCTTATCTGCATCAAAGATTGTTAGAAGAATGAAACTAAAATTTTCAAAAGCATGGATTTCATTTCTTAAGTTACCACTTCCAATTGATGTGTACAAGGAGGTAACCTAACCACTCTGCTCTTTATTTTTGGCTCTGTATAGAAGCACTTGTTATTACGAAATTGGTCTGTATATCATGGTTTTGCCACCTGTTATAATTGACCTGACTACTCAAGAAGTCAATATTGTTTCTGGAAACAATTTTACATCTCATTTTACCTATGATGAAGTTATTTTATTTTGTGGGGAGCGCACAATGCAGTTATGCTTGATCTGGATGGACAATTTATATCCATACAATTTGCTGCTGTTTGGAAACAGTGTTAAAAGTTGCTCCCCACTGTTTCCAATTTGTGTTCTAATATAAATATTTGGACAGTAATTCTTTTACGTTGGTTTTGGTTGAGTTATTTTTAATAGTTACCTTTAGCTTGAGAAGAAATTTTCTAGTGTTCTTAATTATACTAGAAACACAAGTGACGTACAGTAGGCTCAAGAACATGGTACGCCATGAAATAGTGTAAGAATTCCCGTTTCTTTGGGTAACATTTTCATTATAAAAAAAAAACCTAAAAGGCTTATTTTCTGAAAACCACACCAGCAGTAACTATTTTTTAGCTTTATTTTGTCTGGAGAGTAGACCTATAAAGCAAATCCTGCTGGTCTTTTGCAAACTTCTTGTGATTGATCCCCATCTAATTTAGGTAAGTAATTGTATTGTTATTTTTCTCGCTGACTTTATTATCATAGTTTCTCTCTGTGTTTCTCGACGATACCTCTTGTATCATTGTTCCATAGTGTTGTCTTTTTCTTCCATTAATTATTCTTGATAAATACATGGGTTGGTTATTCAGAAAATCTTCTATGATTGTCTGTGTAACAATCTACCCGTCCCCATTGTCTTCACATTGTAATTCTTTTTATCCCTTTTTTGGGCTTTGTTGTCATCTAATATCATTTTGCTCATCCTTTTGCCACTCAGGTTCTTGTAATTCTTGATCAGGAAGTCATTCCTTATCTTTCTAAACCAATCATATTATGGTTAGTGATAATATGGCTTGTATATTCTCTCTTATAAATAAAGCTTGTATTTCTCATTTAACACTTTCGTTACATAATAAAATGCAGCGACTTCTTAACAAAATCCTATGATATTGGTGGCGTCGTCAGTGTTATGGCTCTTAGCAGCCTCTTCCTCCTAATGACAAAATATGGTTTAGAGTATCCAAACTTCTATGAAAAACTGTATGCTCTATTGGTTCCTTCAATATTCATGGCAAAACACCGGGCCAAATTTTTTCAGGTAAGTTGAGTGGTCTGTCATTTTTTTTTTTTAAATTATCTTTTTCAGAAATTATTTTTTAGGAGTATGGATAGCAATTTCAAGTATCTCTTGGAATTTTAAATGCTCTTCAATCTACGGAGTATAGTTATATCCTAATTTAAAGACTATTTGGTTGCCTATCTCACCTAGTTTGAGTTAATTCCCATAGATAATACAATGTAAAGGGAAGGGTTTTTGGCATGCTTTTCTTGGAGTCTTTGGATTGATAGAGATCTCGAAAGAGGTTTGGGATACTGTTAAATTTAATATATCCTCTTTGCGGATGTCAATTTCTAGGTTATTTATCAGTTAGGTTATTTATTATTATTATTAAATATTGGAGTCTGTTTGTACATGGCTAGAGTTTTGACTCCTTTGGCAAGCCGGTTTTTTACATGCACTTGTGTATTTTTTTTCAATTGTCTTACTGAAAACTTGGTTTCTCATAAAAAAATAATTGTTGCAGATTGAGAAGATGATCAAATTTCGAAGCTACTCCTGTTTATTTATTTGTATTATATTATTTTGTACATAGGACACGTATCCATATCCTAGGGGATCATATGTCTACAACGGTGAAAACCAAGTTGTATTATGACTAATGACTTTTCTAAATGTATGGTTCAATTTCGGTGATTCCTTAACATTTTTATACCATTCTTGGCAGCTTCTTGATTCCTGCTTGAAGTCGCCACTTCTTCCAGCATACTTGGCTGCTGCTTTTGCAAAGAAATTGAGTAGGCTATCACTTGTTGTTCCTCCTTCAGGAGCACTTGTCATTATAGCTCTTATTCACAATCTCTTACGAAGACATCCCTCAATCAACTGTTTGGTTCACCGGGTAACTAATTTTCTTCCTTTGATTAGGTACATTATGTAGTTTTCCAAATAAGATGTTTAGTTAATGGTTTACTTTTTATACATATAGGTTCTTTTTCTTAATTTGTAAAGCCATTTTATATGAGCAGCTAGCTTTATTACAATAAGATAACTTTCTTAACCAGAAAGTAGGTGGTCTGCAAGGATCCTCAATAGATAACTATCGCCATAAATAAGTGCCTTAACCAGAAAACATGATCGTTCGGTGGATTTGGATTTCTATTTGAAAACTGTCCTTGACTTTAGGATATTGGTTATGATTGAAGTTAGCACTGTTTTTAATCTTATTGTTATAGATAGATTCTACAACCTGCATCAACTTCTCTTGATTTCTTTTGGATGTAATGTATGTATGATGTGAGCAGGAAAATGTTAGCGAGAGTAAGAATGATGATTCGACAAGTGAAGAGGTTGCTAAAGGCACAGATGCTTCAGAGGTTGATGCTGATACACCCAACATGAAGCCAGGCATCGACCATTTTAACTACGAGGAAACTGATCCTATTAAATCTAGTGCCTTGAGTAAGTGTTGAATTCACTTAATGAATTGTCTAAACCATTCCTTGGAGATTGTTATGATTAATGAATTGTAGTTTACCTTTCTAGCATCTTTTTATTTATGAATCTCCAGTCTTGGTGCACTAGATTGGTGGTTTCTTTTTCTTTTTTCGTTTTCTTTTTATCCTTTTGACTTTGTAAATTGGTGGTATTTTTTCTCTGCTTTCTTATATTCTTTCATCATGTGTTTGAAAATTTATTCCATGCCATGTGGATTAATTAGTATTCCTGTGGTGAGCTGTTAATATCACCATCACTTGTCAAAATTGCTTTCATGGTTTACTCAAGATTTTTTAATTCATATAACCATCATGGGTTGGCCTAGTGGTAGTGGGAACATCAAGAAAAGGCCAAAGGGCTAAGAGATAATGAGTTCAATCCATGGTCGCCACTTACCTAGGATTTAATACCCTACGAGTTTCCTCGATACCCAAATATTGTAGGGTTAGGCGGGTTGTCCCATGAGATTAGTTGAGGGGCACGTAAGCTGGCCCCGGACACTCACGGATATCATAAAAAAAAAAAAAAGATCTTTTAACTCGTATTTAAGTCTTGTCAGATTTAATGGCTGAAAATACCGTTAACTACGTGTCATGTGGTTGTTTGTTATGCAATAGTCCATGTTTGCTCAATGTTTAGATTATTCTGTTAGGTACTAAATATTTAGTTATAAGAGGTATAAGGGTAATTAGATAAATAGGTAGTTATTGAACAAAGGTAATTATTGGATGTTATAAATAGAGGGAGGGTAAGTGAGTGAGGGGATGTTATTCTATTAAGTGATCTTGTTCTTGTTAGGAAGTATCCTAACATGTTCCTTCTTGAGTTAATCCATGACATCGATGCTACTGGTGGATGCTTTCTTTTTATTTTGTGGTGGTTTAATATTTGATCCTTATTAACAAGAATTGTTGGTACAGAAAGCTCACTTTGGGAAATTGATAGTCTTCGACACCATTATTGTCCTCCTGTTTCAAGGTATGCAGTTTAATATTGATCCAGAAACTTCTATTAGTTTTGGATTACATACTGTAAACGAGTCCTTATGCTCTTGTAGGTTAGTTTTGTCGCTTGAGAATGATCTGACTGTGAGATCGAAAACAACTGAAATGGATGTTAAAGATTTTGTTGCCGGTTCATATGCTACAATACTCGGGCAAGAGGTAAGTCTCACTGGTTCAGTAATTCCGCTATAATTAGTTAGCTATTTTATTTATTCTTATAAGGTTGCAAATTGCCTTTTTTGAAATTGTAATGGTTATATTATTGACTAACTATTATTCCATAGAAAGAGTGGATGTTTTCTCCTTTGTTTGGGATGAATCTCCCAGCTATTACGACTTTTGATATCGTCTCTGAAATTCATTCGTTGCATCAAATTGGATAAAATGAAATTCGGATTTTCTTTTCATTCTGAATGGTATCTTTGGTTGCAGTTGAGAAAGAAAATGAAGCGAGTCCCTCTGGCATTCTACCAAGCAGCCCCTACTGCCTTGTTCTCGGAGTCTGATTTCGCTGGCTGGAGTTTCGATTATAAACACAGTGAGAAGAATATTGATGGTAGCGATCATCTTTCGGCTAAAAGACAGCGCGTAGAGAGCTCGTAACTCAGGTTTCTTCACCTCCAAATTCACATTGGGACTCTGTTTCCTTCCTCCAGTGAACTATTCCAGGAAGCATTGAAATATGGTAGAAAGACGAATTGCTTATATTACATCTCGCTTAGAGTTTATAAGAATCGCCAGACCGTCCTAGGTTAACGGTCGGAAGATTGTGAATCCTTGAAGGCATTTTATCACTCAAGAGTGGCAGGTGAATTTTTGCAGCCTTTTCATGCGCCATTCGTAATTGAACTACAAAGGGGAGAGGTAACATTTCCTTTATACCATTTTTGCTTTATTTATTCAGGTTACCCATTGGCGTCTTTAAGGTTATTAGCCAATTGTACACCTTTTTTTTTTAATTTTTTTTTTTGTAGTTTCTAATACTATTTTTATATTCCAACTTGAAGAAACTGACTATATTTGACTTAGAAATTGTACTTTAGAAATTTAATGAATTTAGCTTCAGTGGTGAGTTGAAAAACTTGATTATAGCTCTCATGAGCTGGGTTTTGATTAGTTTTGAAATGGTGCAAAGTTGGACTGCAATGAAACGATTTGAATCCCTTATTATTTTCATGCATTTTTATGAATTTGGCTCGCTTAACTAAATAGTTTTTCAATTTTTTTAATTAAAATATCATTTAGAAATGTCTAAACTTGGTTCGTGCTTCTAATCAATTTTAAAATCGGTCTGAAGTTATTTTTCATTTTATTTTCTATACAATAATTGCAGGGATGATAAATCACTTGTATATCATTGAGTTTCATTCCCTTTGACAAATTTAAGTTGGTTAGGGACAATATTTTAGTAAGTTAAATAATGAATTAATTATGGTTTATTTATAGTACAATAACTAAAGTTAGATGCTTAGAACGATCTTTTTTTTTTTCCAAGTACAATGGAGGGTGGAAAGCTTCAAACTACATATCTTGTGGTTGATAATACATACTATCTATCAGCAATGCAAGTGATAAAATTCTATTGCGATCTACGATCTATTATAGTAAAATAGTTTGACTTATTTTTCTATATTTAAGAACGCTAAAGATTGAAATGAATATTTAATAAAAAATTGAAGTTTATTGAAGTTAAAAGTGTAATTTTTAAACCTAGGGACTAAAAAAAGAACTTAAAAATTGTAATATTTTGAAACTTAAAGACTACATTAAAATCAATTTCAAAACTTAAGGCTTAAAAAAGGAAAATTTTGAAATTTAAGATTAAATAGAAATTAGATTTAGAACTTAGAGACAAAATTGTATTTTTTTTTCAATTATATATTAGATATCGTAAGGGAACAATGTACTTGGTTTTGAAAATATATAAATATAAAGAAATATAGAACCAAGCAATTCCATAACACCAAGTTTATCTTCCATAATTTTTTTGCGCCAAACGAAATTGTTCAAATAGCTAACCAATCTTAAATAATTTTAATTAAACATCGTTCATAAAATCATAACAACAATCTTTCTATAATTATTTTCAAAACTACCTCATTCAACAAATGTTCAAATAGCTAACCAAGTACATACTTTTTTTGCGCCAAACGCAATTGTTTGGAGAAAGATAGTAGCTTAGAAACAACAAATTATTATTAAAGAAATATATAGAAATATTATTATTATTATTGTTGTTGACAATATGTAGCGTGAAAAAATTGAAATTCAGAATTCAAGGTCAATATAGCACTATGCAAAAATTGACACCTAAAAAATCGACACAAATCTAAGCTTTAAAAATTATAATTCCATGATTTTAACACAATCCGTAATATTTAATTTTCTTCCCTTTCTTATTCTGCTTTGGGCTATCCAATTTATCCATCCATTTAAAAAATGCAACCCAAATAGTTTTGGCCACGTTTGTATTTGATTTTTTATTTTTGATTTTTGAAAATTAAGCCTATAAACTCTTCAACCACCTCTAAATTTCTTGTTTTTTATCTACATGATTTTACCATTCTTTTTGAAATTTAGCTAAGATTGCAATTCTATTATTTAAGTAAGATACTAATCATTGTAAGAAATTTGGAGAAAATATGCTTAATTTTCAAAAACCAAAAATAAAAAATAAAAGGGTTACCAGAGGGGCTTCACAAATCTACTCTCCAAATGCCAAAGAGGCCCAAAGAAAAAAAAAAAATTCACTAGGTAGTTCAACAATTGCAAGCAAGAGAAGATCAAATCTACCTTTGAGTCATTACTGAGCATAGCTCGTATTAACAAATCCTTGATGCTTCATCTTCTTGTTGGATTTTTGGTTTGATCAACTCTTCCATTTTCCGGCTAATCGTGCGGAAAAATGCATGTGGAAGACATCAAATGTTCTTGAATTGTTTGATTATCTCATATTCGCTAGGCTTCTGGGACCAGTATAGTATCAAAGATGAATTCAACTACAAAAGAGGATCAAGAATCAGACTCATTTTTCAATATGAAAATACCACATATACTACAGTATTATCAAAGCACTTTTATCATGTACAAAGTCACTCTATACATGCTTTGAATCATTAAAAAACAAATTTTAATGGTAAGGAAATCGTGTTTAAAAATGAAAAATTAAACATTAAAGTTTCTTATTAAAGGCATGTTTGCAAGTTATTTTGAACATGACAGAAGAAATTTTAATAATATCAAAATCGCTTTCAAATATGCCTTTGATCCAAAAAGAAAAAAAAGAAAAAAAAAAGTCAAATGGCTGCAAATCCAACAAAGTTTAACCATCCCTCTGACAACAGATGACCACTAGATGGCCAAACCAACAGAATTTTTTAGACATAAATTGCTTAATATACTTGTCATGAATTGAGGAGAAAGTTCAGAACAACTAAATTGAAGTATCAAACAAATTGTCAGTCAGTATAAATTGAGAAGAGATGACAGGTCTGGTAAAGGCCATCACAGACAGATATAATGGAAAAGCAAGACTTGCAGCTGTTTTTTATCATGTTCAATATTAAGTATATATTTGAAGTGATTTCAAATCCAGAGATTTTCCATACTCGAAAATCTCTTCCCTCTTTCGGAGTTACCTATTTGAAGGCTTACAAATAGCTTTGAAAAAGAATTTCTCTAACGAGAGAGATAATAATCTTCATCTTTTATAAAAAAGTCCACTGAGAGGTAGTGACTATATTTAAACCACTCAACCGTAATTGTAAAACGGTGGGGGGGAAGCCTCTGTAGTGGTAAATGAACACTTCAAAGTCAGAAGGACCACACATACTACTTCAAGATAGTGCTTCATTAAAAATTGGATAAAAGCCAAACAAAAGGTCCATTACAATGAAAGCCATTAGGATAAGAAAATATACAACACATGATCTTAGCCAAAAGGCCGAGAAACGTACACAATGAAAATGCGACAATAGAAGCAAGCATTGCACCCGGAAGAATGGAATCATTAACATTAAAGGTTGAACTCAAAATTTTCTAGCCTTCTAAGAGATAATCGTTTCTTATCCAAAAAAAAAAAACATTTTCTAGAGATAACACTTACCAAAACCCGTCTTACTTTCAAATATACAAATTACAAGGTTCTTTTAATTCGCTAATTTGATACTATTCTACTTACGAGTTAAGCAGTACACATTCATCAATAAAGTTCAATATACAATGTGCTGAAAGTAAGATCTACTAACCTTCCTCTTTCCGAATGTTAGGTTGGTCAACCAAGAGTTGCTGTAAGGAGCTGAGATCTTGCGCTCTATAATTCCTAACAATTCAAGATACACATATTTTCAGAGGGGAAGAAATGGGCCATAGTAATAACATACTTTGGAGAGGGGGGGGGGGGGGGGGGAAGTAGATGATAGTTGAAGGCCATTACTAGAAAAGGATGTTTCATATGCAACTATCAGAAGAAAATTGAAGCTCCAAAACCAAGTGCTTTTTTTCCTTACATTACCAACTTCATTAATCTTCTAATTAAAAGCAGAACATCCCATCAAATTTCTCAACTGAGAAAGAAATTACCTAGCTTTCCAGAAAAGTTCTGGAAGTGCTGAGAAGATAGATTCTTTAAGATTGTACCTAAAGAAATTGAAACAAGTTCATTAAGAAGCCAAACTATACATCCACTCCAAAACTAGGCAAATGAGATCTAGAAGCCAACATAGTTGATCTAAATTCGACTTTCATAGGTTAGTTGGTTTCCTCCTACCATGATCAGGAACCGAATTTTCAAATGAAAGAGTATACGAGGGCGTAAGAAACTAACCCCACAAAAAGGAGCCCCAAAACTAACTAAGAAACAAACTCCAATCCAAACCTAACATGATCAGCACCAAACCTAACATGACCAGTTTTTAATTTGCACTGCATACACACATTTGCTTTCTTGACATAAAAAGAGATGAGAAAAATTCTTACTCAGAGCGGTGCATGTCGCAGAGAAGCATTAACCCATTCACGCAATCTTCAAGGCTGGGTTTCAAACCAATTTTTTGTTGTTGTTGTTTTATTGATGGCTGATTGGAACCACCTTTGACTAATTGCTTCCCATCACGATAAATCTTCTCCATTGAACGGACAATATTCCCAAGTTCTTCCCTGAGATCTTTATTCAGATTCTTCATTCATGAAAACAAGATAGCATGAATATTTTAAAGGACTATTTGGACTTAAGAAACACAAACATGGATACAACACGACATAGACATGGTTGTGCCATTTTTTTAAAATCTAGGACATAGGCACGGCGATGACAATTTTATTAAAATTTACATTTTTGAATATATATATATCATTTTTATACAAGAAAGAAATTCAAGATCAATGAGTTTATGTATTCATATGCTTAAAAGATTAGTATGATGTATTTTGCTTTCAAATTTTATTGTTTTTTTTATTATATATATGTTTATTTAATATACTAAACAAGTGTTCGATGCATGTCTAACAAATGTTAAACCTACTTTGATTTTGTATAACTAGTATCATATTAACAAGTTTTAGATACGTGTCCAATAAGTGTTGTAGTGTACAAATATTTGACACATGTAGGACATAAACACACTAGCCAAACTAAAGTGTATGTGCTTCTTAGATTTGGACTGACTTCATGATCGTCAATTTTTTTTAGCTCAACAACATGTTCAGGTGAAAATTCAAATTTCTTTCCTCTTAGTCATAAAGGATATATAACTTAACCAGTTGACGGTGCTCATATTGTCAAGTTCATGATCAGCTATTATTTTTGTTTATGATTTTTCTTCTGGCATATCCTACAAATCACAATGTAAAGGCAAGAAAAGAAACAGAAAGGAAAACAAACTTACATTGTTTTCTTAATAGACATGATACTAAGTTGCAAGGATTCCATCTGCTTCCCCAATAACGCATCTACAATGCCATCCATGCCAGCCAGACTTCCAAAATTCTTAGGATCTTGAAGCATCTGACAATAAATTTTCAAAGATTCTAATCTTTAAACTAAAAACAAATACAAATACGTGTGATTAAAATCAGAACTGAGACTGACCTGCAGCCTCCCGATGATGGAGGAAGCATTTTGGAACTGCGAAATTAAACGAACTTGAAGTTCATCCCAGCGATCCATTTCATCTTTGAACTTTCTAAATCTTTGCTGATACTTTTTCACCATGGCCTCCATAGATTGGAATTTCAGACTACGAGTAGTCAAGCTGACGCGCCTCGTGGATTAGAGATTGTCACAACACCGCCGTTACTGAGAAGACAGCAGGGGAATCGCGCCGCCGTAGGTAGATTTGGGCGTCGTGGTTCGTGGATAGAGCACCACCGCTACAGGAAGGAGCTTGTGCCGCTGTGGGTTGGGCTTCACAGGCCGGTGTTAGTCGTCGCTGGCCTGAGCTGCTATGGGAAGAGAAATTGCCAAAATTAA

mRNA sequence

ATGGCGTCCATTCCCTCAAACAAGAAGAAGACGAAGAATCACAAGCTTTCAGACGTCAAAACCCTAGGCCTCCAACTTCTCTCATCTCGAGCTCACATCAACAACCTCGCTTTGCTTCTCACCTTCGTTTCTCCTTCTTCTCCTCCTCCCTATGTCCTTGAAGCCCTCCTCTCTCTTCAGTCCTTCTTCATCACCAACCTTCCCTCCCTTCCTTCATCCTCCTCCAAGCCTGCCGCCGCCGGAGACGACGTTCAGGTCGACGCCGAATTCATTTACCGAACCTGGCTCCGTTCCAAGTTCGATGAACTCGTTAAGTCGCTCATTGATGTGGCGGTTTCTTCCGAATGCGACGACACTCTCAAGGAGATTGTGTTGGATGCGATTATGGAGTTTGTTAAAGTTGGTAACAAGGGGAAATTTCATTCTGCTGTATACCACAGGTTTCTACAGAGTATCGCGCATTCTTTGACTCCAGTTGATACTCTGATAGCCTTGCTTGTGAAAAAGTACTTCAATCACCTCGATGTCCGTTATTTTACATATATTAGCATTAAAGAACTTGACAAGACTTTCAAGGCTGAGTACATGTCTGGTGATAAAAATGGTAGGATTAATGGCGATGATGGTGGTCACTCAAGAGAAGGAGTGGAGTTCATTCACATTGTGCACTCTATCTTATCCTCCATTCCCCCTTTGGAAAACTCAAATGAATCTGACTACACTTTGTGGGTTGAATCAGGTGATGACAAAGTGCTTTCTGACAATCAAGAAGCAAAGCAGCTTAAGATGAAGAAAAATGATGAAGAGAATGTTATTAACACCCTTCAGGTCTTATCTGCATCAAAGATTGTTAGAAGAATGAAACTAAAATTTTCAAAAGCATGGATTTCATTTCTTAAGTTACCACTTCCAATTGATGTGTACAAGGAGGTTCTTGTAATTCTTGATCAGGAAGTCATTCCTTATCTTTCTAAACCAATCATATTATGCGACTTCTTAACAAAATCCTATGATATTGGTGGCGTCGTCAGTGTTATGGCTCTTAGCAGCCTCTTCCTCCTAATGACAAAATATGGTTTAGAGTATCCAAACTTCTATGAAAAACTGTATGCTCTATTGGTTCCTTCAATATTCATGGCAAAACACCGGGCCAAATTTTTTCAGCTTCTTGATTCCTGCTTGAAGTCGCCACTTCTTCCAGCATACTTGGCTGCTGCTTTTGCAAAGAAATTGAGTAGGCTATCACTTGTTGTTCCTCCTTCAGGAGCACTTGTCATTATAGCTCTTATTCACAATCTCTTACGAAGACATCCCTCAATCAACTGTTTGGTTCACCGGGAAAATGTTAGCGAGAGTAAGAATGATGATTCGACAAGTGAAGAGGTTGCTAAAGGCACAGATGCTTCAGAGGTTGATGCTGATACACCCAACATGAAGCCAGGCATCGACCATTTTAACTACGAGGAAACTGATCCTATTAAATCTAGTGCCTTGAAAAGCTCACTTTGGGAAATTGATAGTCTTCGACACCATTATTGTCCTCCTGTTTCAAGGTTAGTTTTGTCGCTTGAGAATGATCTGACTGTGAGATCGAAAACAACTGAAATGGATGTTAAAGATTTTGTTGCCGGTTCATATGCTACAATACTCGGGCAAGAGTTGAGAAAGAAAATGAAGCGAGTCCCTCTGGCATTCTACCAAGCAGCCCCTACTGCCTTGTTCTCGGAGTCTGATTTCGCTGGCTGGAGTTTCGATTATAAACACAGTGAGAAGAATATTGATGGTAGCGATCATCTTTCGGCTAAAAGACAGCGCGTAGAGAGCTCATTGGAATTTCAGACTACGAGTAGTCAAGCTGACGCGCCTCGTGGATTAGAGATTGTCACAACACCGCCGTTACTGAGAAGACAGCAGGGGAATCGCGCCGCCGTAGGTAGATTTGGGCGTCGTGGTTCGTGGATAGAGCACCACCGCTACAGGAAGGAGCTTGTGCCGCTGTGGGTTGGGCTTCACAGGCCGGTGTTAGTCGTCGCTGGCCTGAGCTGCTATGGGAAGAGAAATTGCCAAAATTAA

Coding sequence (CDS)

ATGGCGTCCATTCCCTCAAACAAGAAGAAGACGAAGAATCACAAGCTTTCAGACGTCAAAACCCTAGGCCTCCAACTTCTCTCATCTCGAGCTCACATCAACAACCTCGCTTTGCTTCTCACCTTCGTTTCTCCTTCTTCTCCTCCTCCCTATGTCCTTGAAGCCCTCCTCTCTCTTCAGTCCTTCTTCATCACCAACCTTCCCTCCCTTCCTTCATCCTCCTCCAAGCCTGCCGCCGCCGGAGACGACGTTCAGGTCGACGCCGAATTCATTTACCGAACCTGGCTCCGTTCCAAGTTCGATGAACTCGTTAAGTCGCTCATTGATGTGGCGGTTTCTTCCGAATGCGACGACACTCTCAAGGAGATTGTGTTGGATGCGATTATGGAGTTTGTTAAAGTTGGTAACAAGGGGAAATTTCATTCTGCTGTATACCACAGGTTTCTACAGAGTATCGCGCATTCTTTGACTCCAGTTGATACTCTGATAGCCTTGCTTGTGAAAAAGTACTTCAATCACCTCGATGTCCGTTATTTTACATATATTAGCATTAAAGAACTTGACAAGACTTTCAAGGCTGAGTACATGTCTGGTGATAAAAATGGTAGGATTAATGGCGATGATGGTGGTCACTCAAGAGAAGGAGTGGAGTTCATTCACATTGTGCACTCTATCTTATCCTCCATTCCCCCTTTGGAAAACTCAAATGAATCTGACTACACTTTGTGGGTTGAATCAGGTGATGACAAAGTGCTTTCTGACAATCAAGAAGCAAAGCAGCTTAAGATGAAGAAAAATGATGAAGAGAATGTTATTAACACCCTTCAGGTCTTATCTGCATCAAAGATTGTTAGAAGAATGAAACTAAAATTTTCAAAAGCATGGATTTCATTTCTTAAGTTACCACTTCCAATTGATGTGTACAAGGAGGTTCTTGTAATTCTTGATCAGGAAGTCATTCCTTATCTTTCTAAACCAATCATATTATGCGACTTCTTAACAAAATCCTATGATATTGGTGGCGTCGTCAGTGTTATGGCTCTTAGCAGCCTCTTCCTCCTAATGACAAAATATGGTTTAGAGTATCCAAACTTCTATGAAAAACTGTATGCTCTATTGGTTCCTTCAATATTCATGGCAAAACACCGGGCCAAATTTTTTCAGCTTCTTGATTCCTGCTTGAAGTCGCCACTTCTTCCAGCATACTTGGCTGCTGCTTTTGCAAAGAAATTGAGTAGGCTATCACTTGTTGTTCCTCCTTCAGGAGCACTTGTCATTATAGCTCTTATTCACAATCTCTTACGAAGACATCCCTCAATCAACTGTTTGGTTCACCGGGAAAATGTTAGCGAGAGTAAGAATGATGATTCGACAAGTGAAGAGGTTGCTAAAGGCACAGATGCTTCAGAGGTTGATGCTGATACACCCAACATGAAGCCAGGCATCGACCATTTTAACTACGAGGAAACTGATCCTATTAAATCTAGTGCCTTGAAAAGCTCACTTTGGGAAATTGATAGTCTTCGACACCATTATTGTCCTCCTGTTTCAAGGTTAGTTTTGTCGCTTGAGAATGATCTGACTGTGAGATCGAAAACAACTGAAATGGATGTTAAAGATTTTGTTGCCGGTTCATATGCTACAATACTCGGGCAAGAGTTGAGAAAGAAAATGAAGCGAGTCCCTCTGGCATTCTACCAAGCAGCCCCTACTGCCTTGTTCTCGGAGTCTGATTTCGCTGGCTGGAGTTTCGATTATAAACACAGTGAGAAGAATATTGATGGTAGCGATCATCTTTCGGCTAAAAGACAGCGCGTAGAGAGCTCATTGGAATTTCAGACTACGAGTAGTCAAGCTGACGCGCCTCGTGGATTAGAGATTGTCACAACACCGCCGTTACTGAGAAGACAGCAGGGGAATCGCGCCGCCGTAGGTAGATTTGGGCGTCGTGGTTCGTGGATAGAGCACCACCGCTACAGGAAGGAGCTTGTGCCGCTGTGGGTTGGGCTTCACAGGCCGGTGTTAGTCGTCGCTGGCCTGAGCTGCTATGGGAAGAGAAATTGCCAAAATTAA

Protein sequence

MASIPSNKKKTKNHKLSDVKTLGLQLLSSRAHINNLALLLTFVSPSSPPPYVLEALLSLQSFFITNLPSLPSSSSKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKYFNHLDVRYFTYISIKELDKTFKAEYMSGDKNGRINGDDGGHSREGVEFIHIVHSILSSIPPLENSNESDYTLWVESGDDKVLSDNQEAKQLKMKKNDEENVINTLQVLSASKIVRRMKLKFSKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSEEVAKGTDASEVDADTPNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEMDVKDFVAGSYATILGQELRKKMKRVPLAFYQAAPTALFSESDFAGWSFDYKHSEKNIDGSDHLSAKRQRVESSLEFQTTSSQADAPRGLEIVTTPPLLRRQQGNRAAVGRFGRRGSWIEHHRYRKELVPLWVGLHRPVLVVAGLSCYGKRNCQN
Homology
BLAST of HG10005552 vs. NCBI nr
Match: XP_038887732.1 (protein NUCLEOLAR COMPLEX ASSOCIATED 4 isoform X1 [Benincasa hispida])

HSP 1 Score: 1089.3 bits (2816), Expect = 0.0e+00
Identity = 572/614 (93.16%), Postives = 587/614 (95.60%), Query Frame = 0

Query: 1   MASIPSN----KKKTKNHKLSDVKTLGLQLLSSRAHINNLALLLTFVSPSSPPPYVLEAL 60
           MASIPSN    KKK K+HKLSD+KTLGLQLLSSRAHINNL LLLT+VSPSSPPPYVLEAL
Sbjct: 1   MASIPSNNHNDKKKKKSHKLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPPYVLEAL 60

Query: 61  LSLQSFFITNLPSLPSSSSKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVAVSSEC 120
           LSLQSFFITNLPSLPSSS KPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVAVSSEC
Sbjct: 61  LSLQSFFITNLPSLPSSSFKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVAVSSEC 120

Query: 121 DDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKYFNHLDV 180
           DDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKYFNHLDV
Sbjct: 121 DDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKYFNHLDV 180

Query: 181 RYFTYISIKELDKTFKAEYMSGDKNGRINGDDGGHSREGVEFIHIVHSILSSIPPLENSN 240
           RYFTYISIKEL KTFKAEYMSGD+N RINGDDGGHSREGVEFIHIVHSILSSIPPLENSN
Sbjct: 181 RYFTYISIKELAKTFKAEYMSGDRNVRINGDDGGHSREGVEFIHIVHSILSSIPPLENSN 240

Query: 241 ESDYTLWVESGDD-KVLSDNQEAKQLKMKKNDEENVINTLQVLSASKIVRRMKLKFSKAW 300
           ESDYT+WVESGDD   LSDNQEAKQLKMKKNDEE       VLSASKIVRRMKLKFSKAW
Sbjct: 241 ESDYTMWVESGDDNNALSDNQEAKQLKMKKNDEE-------VLSASKIVRRMKLKFSKAW 300

Query: 301 ISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIGGVVSVMALSSLFLLM 360
           ISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFL KSYDIGGV+SVMALSSLFLLM
Sbjct: 301 ISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLIKSYDIGGVISVMALSSLFLLM 360

Query: 361 TKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLS 420
           TKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLS
Sbjct: 361 TKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLS 420

Query: 421 LVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSEEVAKGTDASEVDADT 480
           LVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDS SEEV KG DASEVDADT
Sbjct: 421 LVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSKSEEVVKGADASEVDADT 480

Query: 481 PNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTE 540
           PNMKPGIDHFNYEETDPIKSSAL+SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTE
Sbjct: 481 PNMKPGIDHFNYEETDPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTE 540

Query: 541 MDVKDFVAGSYATILGQELRKKMKRVPLAFYQAAPTALFSESDFAGWSFDYKHSEKNIDG 600
           +DVKDFVAGSY+TILGQEL+KK+KRVPLAFYQ  PT LFSESDFAGWSF+++HSEKNIDG
Sbjct: 541 IDVKDFVAGSYSTILGQELKKKLKRVPLAFYQTPPTTLFSESDFAGWSFNHEHSEKNIDG 600

Query: 601 SDHLSAKRQRVESS 610
           SDHLSAKRQR+ SS
Sbjct: 601 SDHLSAKRQRIGSS 607

BLAST of HG10005552 vs. NCBI nr
Match: XP_038887733.1 (protein NUCLEOLAR COMPLEX ASSOCIATED 4 isoform X2 [Benincasa hispida])

HSP 1 Score: 1077.4 bits (2785), Expect = 0.0e+00
Identity = 566/614 (92.18%), Postives = 581/614 (94.63%), Query Frame = 0

Query: 1   MASIPSN----KKKTKNHKLSDVKTLGLQLLSSRAHINNLALLLTFVSPSSPPPYVLEAL 60
           MASIPSN    KKK K+HKLSD+KTLGLQLLSSRAHINNL LLLT+VSPSSPPPYVLEAL
Sbjct: 1   MASIPSNNHNDKKKKKSHKLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPPYVLEAL 60

Query: 61  LSLQSFFITNLPSLPSSSSKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVAVSSEC 120
           LSLQSFFITNLPSLPSSS KPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVAVSSEC
Sbjct: 61  LSLQSFFITNLPSLPSSSFKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVAVSSEC 120

Query: 121 DDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKYFNHLDV 180
           DDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKYFNHLDV
Sbjct: 121 DDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKYFNHLDV 180

Query: 181 RYFTYISIKELDKTFKAEYMSGDKNGRINGDDGGHSREGVEFIHIVHSILSSIPPLENSN 240
           RYFTYISIKEL KTFKAEYMSGD+N RINGDDGGHSREGVEFIHIVHSILSSIPPLENSN
Sbjct: 181 RYFTYISIKELAKTFKAEYMSGDRNVRINGDDGGHSREGVEFIHIVHSILSSIPPLENSN 240

Query: 241 ESDYTLWVESGDD-KVLSDNQEAKQLKMKKNDEENVINTLQVLSASKIVRRMKLKFSKAW 300
           ESDYT+WVESGDD   LSDNQEAKQLKMKKNDEE             IVRRMKLKFSKAW
Sbjct: 241 ESDYTMWVESGDDNNALSDNQEAKQLKMKKNDEE-------------IVRRMKLKFSKAW 300

Query: 301 ISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIGGVVSVMALSSLFLLM 360
           ISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFL KSYDIGGV+SVMALSSLFLLM
Sbjct: 301 ISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLIKSYDIGGVISVMALSSLFLLM 360

Query: 361 TKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLS 420
           TKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLS
Sbjct: 361 TKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLS 420

Query: 421 LVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSEEVAKGTDASEVDADT 480
           LVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDS SEEV KG DASEVDADT
Sbjct: 421 LVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSKSEEVVKGADASEVDADT 480

Query: 481 PNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTE 540
           PNMKPGIDHFNYEETDPIKSSAL+SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTE
Sbjct: 481 PNMKPGIDHFNYEETDPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTE 540

Query: 541 MDVKDFVAGSYATILGQELRKKMKRVPLAFYQAAPTALFSESDFAGWSFDYKHSEKNIDG 600
           +DVKDFVAGSY+TILGQEL+KK+KRVPLAFYQ  PT LFSESDFAGWSF+++HSEKNIDG
Sbjct: 541 IDVKDFVAGSYSTILGQELKKKLKRVPLAFYQTPPTTLFSESDFAGWSFNHEHSEKNIDG 600

Query: 601 SDHLSAKRQRVESS 610
           SDHLSAKRQR+ SS
Sbjct: 601 SDHLSAKRQRIGSS 601

BLAST of HG10005552 vs. NCBI nr
Match: XP_008447831.1 (PREDICTED: nucleolar complex protein 4 homolog [Cucumis melo])

HSP 1 Score: 1037.7 bits (2682), Expect = 4.4e-299
Identity = 555/620 (89.52%), Postives = 575/620 (92.74%), Query Frame = 0

Query: 1   MASIPSN-------KKKTKN---HKLSDVKTLGLQLLSSRAHINNLALLLTFVSPSSPPP 60
           MASIPSN       KKKTKN   H LSD+KTLGLQLLSSRAHINNL LLLTFVSPSSPPP
Sbjct: 1   MASIPSNDHNEKMKKKKTKNEKTHSLSDLKTLGLQLLSSRAHINNLPLLLTFVSPSSPPP 60

Query: 61  YVLEALLSLQSFFITNLPSLPSSSSKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDV 120
           YVLEALLSLQSFFITNLP+LP SSSKP  AGDDVQVDAEFIYRTWLRSKFDELVKSLIDV
Sbjct: 61  YVLEALLSLQSFFITNLPTLP-SSSKPPLAGDDVQVDAEFIYRTWLRSKFDELVKSLIDV 120

Query: 121 AVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKY 180
           AVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIA S TPVDTLIALLVKKY
Sbjct: 121 AVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIARSSTPVDTLIALLVKKY 180

Query: 181 FNHLDVRYFTYISIKELDKTFKAEYMSGDKNGRINGDDGGHSREGVEFIHIVHSILSSIP 240
           F++LDVRYFTYISIKEL K FKAEYMS        GD GGHS+EGVEFIHIVHSI+SSIP
Sbjct: 181 FHYLDVRYFTYISIKELAKIFKAEYMS--------GDGGGHSKEGVEFIHIVHSIISSIP 240

Query: 241 PLENSNESDYTLWVESGDDKVLSDNQEAKQLKMKKNDEENVINTLQVLSASKIVRRMKLK 300
           PLENSN+SDYT+WVESGD+KVLSD+QEAKQLKMKKNDEE       VL+ASKIVRRMKLK
Sbjct: 241 PLENSNQSDYTMWVESGDNKVLSDDQEAKQLKMKKNDEE-------VLTASKIVRRMKLK 300

Query: 301 FSKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIGGVVSVMALSS 360
           FSKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIIL DFLTKSYDIGGV+SVMALSS
Sbjct: 301 FSKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILSDFLTKSYDIGGVISVMALSS 360

Query: 361 LFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKK 420
           LFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKK
Sbjct: 361 LFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKK 420

Query: 421 LSRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSEEVAKGTDASE 480
           LSRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENV ESKNDDSTSEE AKGTDASE
Sbjct: 421 LSRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVGESKNDDSTSEEAAKGTDASE 480

Query: 481 VDADTPNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLVLSLENDLTVR 540
           VDADTP MKPGIDHFNYEETDPIKSSAL+SSLWEIDSLRHHYCPPVSRLVLSLENDLTVR
Sbjct: 481 VDADTPKMKPGIDHFNYEETDPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVR 540

Query: 541 SKTTEMDVKDFVAGSYATILGQELRKKMKRVPLAFYQAAPTALFSESDFAGWSFDYKHSE 600
           SKTTE+DVKDFVAGSY+TILGQEL+KK+KRVPLAFYQA PT LFSESDF GWSFDY+HS+
Sbjct: 541 SKTTEIDVKDFVAGSYSTILGQELKKKLKRVPLAFYQAPPTTLFSESDFVGWSFDYEHSD 600

Query: 601 K-NIDGSDHLSAKRQRVESS 610
           K NIDGSDHLSAKRQ + SS
Sbjct: 601 KNNIDGSDHLSAKRQCIGSS 604

BLAST of HG10005552 vs. NCBI nr
Match: TYK23279.1 (nucleolar complex protein 4-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 1036.9 bits (2680), Expect = 7.6e-299
Identity = 555/619 (89.66%), Postives = 574/619 (92.73%), Query Frame = 0

Query: 1   MASIPSN-----KKKT----KNHKLSDVKTLGLQLLSSRAHINNLALLLTFVSPSSPPPY 60
           MASIPSN     KKKT    K H LSD+KTLGLQLLSSRAHINNL LLLTFVSPSSPPPY
Sbjct: 1   MASIPSNDHNEKKKKTTKNEKTHSLSDLKTLGLQLLSSRAHINNLPLLLTFVSPSSPPPY 60

Query: 61  VLEALLSLQSFFITNLPSLPSSSSKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVA 120
           VLEALLSLQSFFITNLPSLP SSSKP  AGDDVQVDAEFIYRTWLRSKFDELVKSLIDVA
Sbjct: 61  VLEALLSLQSFFITNLPSLP-SSSKPPLAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVA 120

Query: 121 VSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKYF 180
           VSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIA S TPVDTLIALLVKKYF
Sbjct: 121 VSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIARSSTPVDTLIALLVKKYF 180

Query: 181 NHLDVRYFTYISIKELDKTFKAEYMSGDKNGRINGDDGGHSREGVEFIHIVHSILSSIPP 240
           ++LDVRYFTYISIKEL K FKAEYMS        GD GGHS+EGVEFIHIVHSI+SSIPP
Sbjct: 181 HYLDVRYFTYISIKELAKIFKAEYMS--------GDGGGHSKEGVEFIHIVHSIISSIPP 240

Query: 241 LENSNESDYTLWVESGDDKVLSDNQEAKQLKMKKNDEENVINTLQVLSASKIVRRMKLKF 300
           LENSN+SDYT+WVESGD+KVLSD+QEAKQLKMKKNDEE       VL+ASKIVRRMKLKF
Sbjct: 241 LENSNQSDYTMWVESGDNKVLSDDQEAKQLKMKKNDEE-------VLTASKIVRRMKLKF 300

Query: 301 SKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIGGVVSVMALSSL 360
           SKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIIL DFLTKSYDIGGV+SVMALSSL
Sbjct: 301 SKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILSDFLTKSYDIGGVISVMALSSL 360

Query: 361 FLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKL 420
           FLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKL
Sbjct: 361 FLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKL 420

Query: 421 SRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSEEVAKGTDASEV 480
           SRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENV ESKNDDSTSEE AKGTDASEV
Sbjct: 421 SRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVGESKNDDSTSEEAAKGTDASEV 480

Query: 481 DADTPNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRS 540
           DADTP MKPGIDHFNYEETDPIKSSAL+SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRS
Sbjct: 481 DADTPKMKPGIDHFNYEETDPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRS 540

Query: 541 KTTEMDVKDFVAGSYATILGQELRKKMKRVPLAFYQAAPTALFSESDFAGWSFDYKHSEK 600
           KTTE+DVKDFVAGSY+TILGQEL+KK+KRVPLAFYQA PT LFSESDF GWSFDY+HS+K
Sbjct: 541 KTTEIDVKDFVAGSYSTILGQELKKKLKRVPLAFYQAPPTTLFSESDFVGWSFDYEHSDK 600

Query: 601 -NIDGSDHLSAKRQRVESS 610
            NIDGSDHLSAKRQ + SS
Sbjct: 601 NNIDGSDHLSAKRQCIGSS 603

BLAST of HG10005552 vs. NCBI nr
Match: KAG7011815.1 (Nucleolar complex protein 4-like B, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1034.6 bits (2674), Expect = 3.8e-298
Identity = 546/616 (88.64%), Postives = 577/616 (93.67%), Query Frame = 0

Query: 1   MASIPS-------NKKKTKNHKLSDVKTLGLQLLSSRAHINNLALLLTFVSPSSPPPYVL 60
           MAS+PS        KK  KNHKLSD+KTLGLQLLSSRAHINNL LLLTFVSPS PP YVL
Sbjct: 1   MASLPSLNQSEKKKKKNEKNHKLSDLKTLGLQLLSSRAHINNLPLLLTFVSPSKPPQYVL 60

Query: 61  EALLSLQSFFITNLPSLPSSSSKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVAVS 120
           EALLSLQSFFIT LPSLP SSSKPAA   DVQ DAE IYRTWLRSKFDELVKSLIDVAVS
Sbjct: 61  EALLSLQSFFITVLPSLP-SSSKPAA---DVQDDAELIYRTWLRSKFDELVKSLIDVAVS 120

Query: 121 SECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKYFNH 180
           SECDDTLKEIVLDAIMEFVKVGN+GKFHSAVYHRFLQSIAHS TPV+TLIALLVKKYFNH
Sbjct: 121 SECDDTLKEIVLDAIMEFVKVGNRGKFHSAVYHRFLQSIAHSSTPVNTLIALLVKKYFNH 180

Query: 181 LDVRYFTYISIKELDKTFKAEYMSGDKNGRINGDDGGHSREGVEFIHIVHSILSSIPPLE 240
           LDVRYFTYISI++L +TF+AEYMSGD++ RI+GDDG HSREGVEFIHIVHSI+SSIPPLE
Sbjct: 181 LDVRYFTYISIEKLTRTFEAEYMSGDRSVRIDGDDGDHSREGVEFIHIVHSIISSIPPLE 240

Query: 241 NSNESDYTLWVESGDDKVLSDNQEAKQLKMKKNDEENVINTLQVLSASKIVRRMKLKFSK 300
           NSN+SDYT+WVESGD+KVLSDNQEAKQLKM+KNDEE       VL+ASKIVR+MK KF+K
Sbjct: 241 NSNQSDYTMWVESGDNKVLSDNQEAKQLKMRKNDEE-------VLTASKIVRKMKPKFTK 300

Query: 301 AWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIGGVVSVMALSSLFL 360
           AWISFL+LPLPIDVYKEVLVILDQEVIPYLS PIILCDFLTKSYD+GGVVSVMALSSLFL
Sbjct: 301 AWISFLRLPLPIDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYDMGGVVSVMALSSLFL 360

Query: 361 LMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSR 420
           LMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSR
Sbjct: 361 LMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSR 420

Query: 421 LSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSEEVAKGTDASEVDA 480
           LSLVVPPSGAL+IIALIHNLLRRHPSINCLVHRENVSESKNDDSTS+EVAKGTDASEV+A
Sbjct: 421 LSLVVPPSGALIIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSKEVAKGTDASEVEA 480

Query: 481 DTPNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKT 540
           DTPNMKPGID FNYEETDPIKSSAL+SSLWEID LRHHYCPPVSRLVLSLENDLTVRSKT
Sbjct: 481 DTPNMKPGIDRFNYEETDPIKSSALRSSLWEIDCLRHHYCPPVSRLVLSLENDLTVRSKT 540

Query: 541 TEMDVKDFVAGSYATILGQELRKKMKRVPLAFYQAAPTALFSESDFAGWSFDYKHSEKNI 600
           TE+DVKDFVAGSYATILGQEL+KKMKRVPLAFYQA PT+LFSESDF GWSF+++HSEKNI
Sbjct: 541 TEIDVKDFVAGSYATILGQELKKKMKRVPLAFYQAIPTSLFSESDFPGWSFNHEHSEKNI 600

Query: 601 DGSDHLSAKRQRVESS 610
           DG DHL AKRQRVESS
Sbjct: 601 DGCDHLPAKRQRVESS 605

BLAST of HG10005552 vs. ExPASy Swiss-Prot
Match: F4IMH3 (Protein NUCLEOLAR COMPLEX ASSOCIATED 4 OS=Arabidopsis thaliana OX=3702 GN=NOC4 PE=2 SV=1)

HSP 1 Score: 605.9 bits (1561), Expect = 5.7e-172
Identity = 333/595 (55.97%), Postives = 431/595 (72.44%), Query Frame = 0

Query: 1   MASIPSNK-KKTKNHKLSDVKTLGLQLLSSRAHINNLALLLTFVSPSSPPPYVLEALLSL 60
           MASI S K KK + + L ++K+LG  LL+SR+HINNL LLLTFVSP SPP +V+E+LLSL
Sbjct: 1   MASILSKKQKKNEKYTLKELKSLGHDLLTSRSHINNLPLLLTFVSPESPPQFVVESLLSL 60

Query: 61  QSFFITNLPSLPSSSSKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVAVSSECDDT 120
           QSFF   L  LP +SS P++   +   D E +++ WLRSKFDE VK L+DV VS + +D+
Sbjct: 61  QSFFTPLLSQLPPTSSSPSSTKTE---DPEVVFKAWLRSKFDEFVKLLLDVLVSQQSEDS 120

Query: 121 LKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKYFNHLDVRYF 180
           L+ IVL  +MEFVK+ N G+FHS++YHR L +I HS   ++  + +L  KYF ++DVRYF
Sbjct: 121 LRGIVLGTLMEFVKLLNAGRFHSSIYHRLLDAIIHSEVDIEIFLDILTSKYFKYIDVRYF 180

Query: 181 TYISIKELDKTFKAEYMSGDKNGRINGDDGGHSREGVEF-IHIVHSILSSIPPLENSNE- 240
           TYIS+++  KT +A  +S D+    N +    S+E +E  +  ++ +LS IPP E   E 
Sbjct: 181 TYISMEKFVKTLEAS-VSADRTVIENNEAESDSKESLELSVRKIYQVLSQIPPPEKQAEK 240

Query: 241 SDYTLWVESGDDKVLSDNQEAKQLKMKKNDEENVINTLQVLSASKIVRRMKLKFSKAWIS 300
           S + +W  SG D+ +S+    K+ K +K D         +LS + I +RMKLKF+KAWIS
Sbjct: 241 SQHEMW--SGSDESISEKPTDKKKKTEKGDS-------TLLSPATISKRMKLKFTKAWIS 300

Query: 301 FLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIGGVVSVMALSSLFLLMTK 360
           FL+LPLPIDVYKEVL  +   VIP+LS P +LCDFLTKSYDIGGVVSVMALSSLF+LMT+
Sbjct: 301 FLRLPLPIDVYKEVLASIHLTVIPHLSNPTMLCDFLTKSYDIGGVVSVMALSSLFILMTQ 360

Query: 361 YGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLSLV 420
           +GLEYP FYEKLYALLVPS+F+AKHRAKF QLLD+CLKS +LPAYLAA+F KKLSRLSL 
Sbjct: 361 HGLEYPFFYEKLYALLVPSVFVAKHRAKFLQLLDACLKSSMLPAYLAASFTKKLSRLSLS 420

Query: 421 VPPSGALVIIALIHNLLRRHPSINCLVHR--ENVSESKNDDSTSEEVAKGTDASEVDADT 480
           +PP+G+LVI ALI+NLLRR+P+IN LV    EN  E+  +     E    T         
Sbjct: 421 IPPAGSLVITALIYNLLRRNPTINHLVQEIVENADEANTEAGEHNESQPKT--------I 480

Query: 481 PNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTE 540
              K GID+FN +E+DP KS ALKSSLWEID+LRHHYCPPVSR + SLE +LT+RSKTTE
Sbjct: 481 KKRKLGIDYFNNQESDPKKSGALKSSLWEIDTLRHHYCPPVSRFISSLETNLTIRSKTTE 540

Query: 541 MDVKDFVAGSYATILGQELRKKMKRVPLAFYQAAPTALFSESDFAGWSFDYKHSE 591
           M ++DF +GSYATI G E+R+++K+VPLAFY+  PT+LF++SDF GW+F     E
Sbjct: 541 MKIEDFCSGSYATIFGDEIRRRVKQVPLAFYKTVPTSLFADSDFPGWTFTIPQEE 574

BLAST of HG10005552 vs. ExPASy Swiss-Prot
Match: Q6NU91 (Nucleolar complex protein 4 homolog B OS=Xenopus laevis OX=8355 GN=noc4l-b PE=2 SV=1)

HSP 1 Score: 220.3 bits (560), Expect = 6.7e-56
Identity = 166/546 (30.40%), Postives = 272/546 (49.82%), Query Frame = 0

Query: 52  VLEALLSLQSFFITNLPSLPSSSSKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVA 111
           + E LL  +  +I +LP+   S     +A D         Y+ W+R++++  V  L+D+ 
Sbjct: 66  LFEVLLEKRELYIGDLPAEDDSPPDTCSAEDK--------YKMWMRNRYNSCVSCLLDLL 125

Query: 112 VSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVY---HRF----LQSIAHSLTPVD---T 171
             S    +++E+VL  +M+F+++  K    ++ +   +RF    L+ +  +L   +   T
Sbjct: 126 QYSSF--SVQELVLCTLMKFIQLEGKFPLENSEWRDSYRFPRELLKFVVDNLLQEEADCT 185

Query: 172 LIALLVKKYFNHLDVRYFTYISIKELDKTFKAEYMSGDKNGRINGDDGGHSREGVEFIHI 231
           L+    ++Y  + DVRY+T     E     +       KN ++             F   
Sbjct: 186 LLITRFQEYLEYDDVRYYTMTVTTECVSRIQ------QKNKQVLPP---------VFQTN 245

Query: 232 VHSILSSI-PPLENSNESDYTLWVESGDDKVLSDNQEAKQLKMKKNDEENVINTLQVLSA 291
           V  +LSSI  P+E S   ++          +++ N+  ++ K  K               
Sbjct: 246 VFCLLSSINMPVEESTLGNF----------LVTKNENHEEWKPSK--------------- 305

Query: 292 SKIVRRMKLKFSKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIG 351
              ++  K  F + W+SFLK  L + +YK+VL+IL + ++P++SKP ++ DFLT +YD+G
Sbjct: 306 ---LKEQKRVFERVWMSFLKHQLSVSLYKKVLLILHESILPHMSKPSLMIDFLTAAYDVG 365

Query: 352 GVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLP 411
           G +S++AL+ LF+L+ ++ LEYP+FY+KLY+LL PS+F  K+RA+FF L +  L S  LP
Sbjct: 366 GAISLLALNGLFILIHQHNLEYPDFYKKLYSLLEPSVFHVKYRARFFHLANLFLSSTHLP 425

Query: 412 AYLAAAFAKKLSRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSE 471
            YL AAFAK+L+RL+L  PP   L+II  I NL+RRHP+   L+HR +  +   D     
Sbjct: 426 VYLVAAFAKRLARLALTAPPQVLLMIIPFICNLIRRHPACRVLIHRPSAGDLVTDP---- 485

Query: 472 EVAKGTDASEVDADTPNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLV 531
                                   +  EE DP KS AL+S LWE++ L+ HY   V R  
Sbjct: 486 ------------------------YIMEEQDPAKSQALESCLWELEVLQQHYHGDVVRAA 525

Query: 532 LSLENDLTVRSKTTEMDVKDFVAGSYATILGQELRKKMKRVPLAFYQAAPTALFSESDFA 587
             +   L+ +    E DV   +  S   +  +E++KK K VPL  Y+     L  +SD  
Sbjct: 546 NVISRALSAQ----ESDVSGLLEMSSCELFDKEMKKKFKSVPLE-YEPVRGLLGLKSDIT 525

BLAST of HG10005552 vs. ExPASy Swiss-Prot
Match: Q6NRQ2 (Nucleolar complex protein 4 homolog A OS=Xenopus laevis OX=8355 GN=noc4l-a PE=2 SV=1)

HSP 1 Score: 203.0 bits (515), Expect = 1.1e-50
Identity = 164/551 (29.76%), Postives = 268/551 (48.64%), Query Frame = 0

Query: 52  VLEALLSLQSFFITNLP----SLPSSSSKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSL 111
           + E +L  +  +I +LP    +LP + S            AE  Y+ W+R +++     +
Sbjct: 66  LFEVMLEKRELYIGDLPAENGTLPDTYS------------AEDKYKMWMRHRYNSCAACI 125

Query: 112 IDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVY---HRF----LQSIAHSLTPVD 171
           +D+   S   +  +E+ L  +M+F+++  K    ++ +   +RF    L+ +  +L   +
Sbjct: 126 LDLLQHSSFSN--QELALCTLMKFIQLEGKFPLENSEWKDSYRFPRELLKFVIDNLLQEE 185

Query: 172 ---TLIALLVKKYFNHLDVRYFTYISIKELDKTFKAEYMSGDKNGRINGDDGGHSREGVE 231
              TL+    ++Y  + DVRY+T     +     +       KN  +             
Sbjct: 186 ADCTLLITRFQEYLEYDDVRYYTMTVTNDCVSRVQ------QKNKLVLPP---------V 245

Query: 232 FIHIVHSILSSIP-PLENSNESDYTLWVESGDDKVLSDNQEAKQLKMKKNDEENVINTLQ 291
           F   V  +LSSI  P+E S   ++ +           +N++ K  K+K +          
Sbjct: 246 FQTNVFCLLSSINIPVEESALGNFLVTKN-------VNNEDWKPSKLKDH---------- 305

Query: 292 VLSASKIVRRMKLKFSKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKS 351
                      K  F + W+ FLK  L + +YK+VL+IL + ++P++SKP ++ DFLT +
Sbjct: 306 -----------KRVFERVWMIFLKHQLSVSLYKKVLLILHESILPHMSKPTLMIDFLTAA 365

Query: 352 YDIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKS 411
           YD+GG +S++AL+ LF+L+ ++ LEYP+FY+KLY+LL PSIF  K+RA+FF L +  L S
Sbjct: 366 YDVGGAISLLALNGLFILIHQHNLEYPDFYKKLYSLLEPSIFHVKYRARFFHLANMFLSS 425

Query: 412 PLLPAYLAAAFAKKLSRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDD 471
             LP YL AAFAK+L+RL+L  PP   L+II  I NL+RRHP+   L+HR +  +   D 
Sbjct: 426 THLPVYLVAAFAKRLARLALTAPPQVLLMIIPFICNLIRRHPACRVLIHRPSAGDLATDP 485

Query: 472 STSEEVAKGTDASEVDADTPNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPV 531
                                       +  EE DP KS AL+SSLWE++ L+ HY   V
Sbjct: 486 ----------------------------YIMEEQDPAKSQALESSLWELEVLQQHYHGDV 526

Query: 532 SRLVLSLENDLTVRSKTTEMDVKDFVAGSYATILGQEL-RKKMKRVPLAFYQAAPTALFS 587
            R    +   L+ +    E D+   +  S   +  +E+ +KK K VPL  Y+     L  
Sbjct: 546 VRAANVISRPLSAQ----ESDISGLLEISSCELYDKEMKKKKFKSVPLE-YEPVRGLLGL 526

BLAST of HG10005552 vs. ExPASy Swiss-Prot
Match: Q8BHY2 (Nucleolar complex protein 4 homolog OS=Mus musculus OX=10090 GN=Noc4l PE=2 SV=1)

HSP 1 Score: 202.2 bits (513), Expect = 1.9e-50
Identity = 117/286 (40.91%), Postives = 171/286 (59.79%), Query Frame = 0

Query: 284 VRRMKLKFSKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIGGVV 343
           ++  K  F + W+ FLK  LP+ +YK+VLV +   ++P+L++P ++ DFLT + D+GG +
Sbjct: 245 LKEHKKAFQEMWLGFLKHKLPLSLYKKVLVAMHDSILPHLAQPTLMIDFLTSACDVGGAI 304

Query: 344 SVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYL 403
           S++AL+ LF+L+ K+ LEYP+FY+KLY LL PSIF  K+RA+FF L D  L S  LPAYL
Sbjct: 305 SLLALNGLFILIHKHNLEYPDFYQKLYGLLDPSIFHVKYRARFFHLADLFLSSSHLPAYL 364

Query: 404 AAAFAKKLSRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSEEVA 463
            AAFAK+L+RL+L  PP   L+++ LI NLLRRHP+   +VHR                 
Sbjct: 365 VAAFAKRLARLALTAPPEALLMVLPLICNLLRRHPACRVMVHR----------------- 424

Query: 464 KGTDASEVDADTPNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLVLSL 523
                 E+DAD          ++  E DP +S AL+S LWE+ +L+ HY P VS+    +
Sbjct: 425 --PQGPELDADP---------YDPTEKDPARSRALESCLWELQTLQQHYHPEVSKAASVI 484

Query: 524 ENDLTVRSKTTEMDVKDFVAGSYATILGQELRKKM-KRVPLAFYQA 569
              L+V     E+ +   +  +   I  Q+L+KKM + VPL F  A
Sbjct: 485 NQVLSV----PEVSIAPLLELTAYEIFEQDLKKKMPESVPLEFIPA 498

BLAST of HG10005552 vs. ExPASy Swiss-Prot
Match: Q5ZJC7 (Nucleolar complex protein 4 homolog OS=Gallus gallus OX=9031 GN=NOC4L PE=2 SV=1)

HSP 1 Score: 201.8 bits (512), Expect = 2.5e-50
Identity = 115/280 (41.07%), Postives = 168/280 (60.00%), Query Frame = 0

Query: 288 KLKFSKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIGGVVSVMA 347
           K  F + W++FLK  LP  +YK+VLVIL   ++PY+++P ++ DFLT +Y +GG +S++A
Sbjct: 241 KQAFERMWLTFLKHQLPSGLYKKVLVILHDSILPYMNEPTLMIDFLTVAYGVGGAISLLA 300

Query: 348 LSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAF 407
           L+ LF+L+ ++ LEYP+FY+KLY+LL PSI+  K+RA+FF L D  L S  LPAYL AAF
Sbjct: 301 LNGLFILIHQHNLEYPDFYKKLYSLLDPSIYHVKYRARFFHLADLFLSSSHLPAYLVAAF 360

Query: 408 AKKLSRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSEEVAKGTD 467
            K+LSRL+L  PP   L++I  I NL RRHP+   L+HR N  +               D
Sbjct: 361 IKRLSRLALTAPPEALLMVIPFICNLFRRHPACKVLMHRPNGPQ---------------D 420

Query: 468 ASEVDADTPNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLVLSLENDL 527
            SE            D +  E+ +P +S AL+SSLWE+ SL++HY P V++    L   L
Sbjct: 421 LSE------------DPYIMEQEEPSESRALESSLWELQSLQNHYHPDVAQAAAILNQSL 480

Query: 528 TVRSKTTEMDVKDFVAGSYATILGQELRKKMKRVPLAFYQ 568
           +      E D+   +  S + +  +E++K    VPL F Q
Sbjct: 481 S----EIEDDISGLLELSASELFDKEIKKTSANVPLEFEQ 489

BLAST of HG10005552 vs. ExPASy TrEMBL
Match: A0A1S3BIC0 (nucleolar complex protein 4 homolog OS=Cucumis melo OX=3656 GN=LOC103490206 PE=3 SV=1)

HSP 1 Score: 1037.7 bits (2682), Expect = 2.2e-299
Identity = 555/620 (89.52%), Postives = 575/620 (92.74%), Query Frame = 0

Query: 1   MASIPSN-------KKKTKN---HKLSDVKTLGLQLLSSRAHINNLALLLTFVSPSSPPP 60
           MASIPSN       KKKTKN   H LSD+KTLGLQLLSSRAHINNL LLLTFVSPSSPPP
Sbjct: 1   MASIPSNDHNEKMKKKKTKNEKTHSLSDLKTLGLQLLSSRAHINNLPLLLTFVSPSSPPP 60

Query: 61  YVLEALLSLQSFFITNLPSLPSSSSKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDV 120
           YVLEALLSLQSFFITNLP+LP SSSKP  AGDDVQVDAEFIYRTWLRSKFDELVKSLIDV
Sbjct: 61  YVLEALLSLQSFFITNLPTLP-SSSKPPLAGDDVQVDAEFIYRTWLRSKFDELVKSLIDV 120

Query: 121 AVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKY 180
           AVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIA S TPVDTLIALLVKKY
Sbjct: 121 AVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIARSSTPVDTLIALLVKKY 180

Query: 181 FNHLDVRYFTYISIKELDKTFKAEYMSGDKNGRINGDDGGHSREGVEFIHIVHSILSSIP 240
           F++LDVRYFTYISIKEL K FKAEYMS        GD GGHS+EGVEFIHIVHSI+SSIP
Sbjct: 181 FHYLDVRYFTYISIKELAKIFKAEYMS--------GDGGGHSKEGVEFIHIVHSIISSIP 240

Query: 241 PLENSNESDYTLWVESGDDKVLSDNQEAKQLKMKKNDEENVINTLQVLSASKIVRRMKLK 300
           PLENSN+SDYT+WVESGD+KVLSD+QEAKQLKMKKNDEE       VL+ASKIVRRMKLK
Sbjct: 241 PLENSNQSDYTMWVESGDNKVLSDDQEAKQLKMKKNDEE-------VLTASKIVRRMKLK 300

Query: 301 FSKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIGGVVSVMALSS 360
           FSKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIIL DFLTKSYDIGGV+SVMALSS
Sbjct: 301 FSKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILSDFLTKSYDIGGVISVMALSS 360

Query: 361 LFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKK 420
           LFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKK
Sbjct: 361 LFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKK 420

Query: 421 LSRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSEEVAKGTDASE 480
           LSRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENV ESKNDDSTSEE AKGTDASE
Sbjct: 421 LSRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVGESKNDDSTSEEAAKGTDASE 480

Query: 481 VDADTPNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLVLSLENDLTVR 540
           VDADTP MKPGIDHFNYEETDPIKSSAL+SSLWEIDSLRHHYCPPVSRLVLSLENDLTVR
Sbjct: 481 VDADTPKMKPGIDHFNYEETDPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVR 540

Query: 541 SKTTEMDVKDFVAGSYATILGQELRKKMKRVPLAFYQAAPTALFSESDFAGWSFDYKHSE 600
           SKTTE+DVKDFVAGSY+TILGQEL+KK+KRVPLAFYQA PT LFSESDF GWSFDY+HS+
Sbjct: 541 SKTTEIDVKDFVAGSYSTILGQELKKKLKRVPLAFYQAPPTTLFSESDFVGWSFDYEHSD 600

Query: 601 K-NIDGSDHLSAKRQRVESS 610
           K NIDGSDHLSAKRQ + SS
Sbjct: 601 KNNIDGSDHLSAKRQCIGSS 604

BLAST of HG10005552 vs. ExPASy TrEMBL
Match: A0A5D3DJ82 (Nucleolar complex protein 4-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold142G003360 PE=3 SV=1)

HSP 1 Score: 1036.9 bits (2680), Expect = 3.7e-299
Identity = 555/619 (89.66%), Postives = 574/619 (92.73%), Query Frame = 0

Query: 1   MASIPSN-----KKKT----KNHKLSDVKTLGLQLLSSRAHINNLALLLTFVSPSSPPPY 60
           MASIPSN     KKKT    K H LSD+KTLGLQLLSSRAHINNL LLLTFVSPSSPPPY
Sbjct: 1   MASIPSNDHNEKKKKTTKNEKTHSLSDLKTLGLQLLSSRAHINNLPLLLTFVSPSSPPPY 60

Query: 61  VLEALLSLQSFFITNLPSLPSSSSKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVA 120
           VLEALLSLQSFFITNLPSLP SSSKP  AGDDVQVDAEFIYRTWLRSKFDELVKSLIDVA
Sbjct: 61  VLEALLSLQSFFITNLPSLP-SSSKPPLAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVA 120

Query: 121 VSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKYF 180
           VSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIA S TPVDTLIALLVKKYF
Sbjct: 121 VSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIARSSTPVDTLIALLVKKYF 180

Query: 181 NHLDVRYFTYISIKELDKTFKAEYMSGDKNGRINGDDGGHSREGVEFIHIVHSILSSIPP 240
           ++LDVRYFTYISIKEL K FKAEYMS        GD GGHS+EGVEFIHIVHSI+SSIPP
Sbjct: 181 HYLDVRYFTYISIKELAKIFKAEYMS--------GDGGGHSKEGVEFIHIVHSIISSIPP 240

Query: 241 LENSNESDYTLWVESGDDKVLSDNQEAKQLKMKKNDEENVINTLQVLSASKIVRRMKLKF 300
           LENSN+SDYT+WVESGD+KVLSD+QEAKQLKMKKNDEE       VL+ASKIVRRMKLKF
Sbjct: 241 LENSNQSDYTMWVESGDNKVLSDDQEAKQLKMKKNDEE-------VLTASKIVRRMKLKF 300

Query: 301 SKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIGGVVSVMALSSL 360
           SKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIIL DFLTKSYDIGGV+SVMALSSL
Sbjct: 301 SKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILSDFLTKSYDIGGVISVMALSSL 360

Query: 361 FLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKL 420
           FLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKL
Sbjct: 361 FLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKL 420

Query: 421 SRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSEEVAKGTDASEV 480
           SRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENV ESKNDDSTSEE AKGTDASEV
Sbjct: 421 SRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVGESKNDDSTSEEAAKGTDASEV 480

Query: 481 DADTPNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRS 540
           DADTP MKPGIDHFNYEETDPIKSSAL+SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRS
Sbjct: 481 DADTPKMKPGIDHFNYEETDPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRS 540

Query: 541 KTTEMDVKDFVAGSYATILGQELRKKMKRVPLAFYQAAPTALFSESDFAGWSFDYKHSEK 600
           KTTE+DVKDFVAGSY+TILGQEL+KK+KRVPLAFYQA PT LFSESDF GWSFDY+HS+K
Sbjct: 541 KTTEIDVKDFVAGSYSTILGQELKKKLKRVPLAFYQAPPTTLFSESDFVGWSFDYEHSDK 600

Query: 601 -NIDGSDHLSAKRQRVESS 610
            NIDGSDHLSAKRQ + SS
Sbjct: 601 NNIDGSDHLSAKRQCIGSS 603

BLAST of HG10005552 vs. ExPASy TrEMBL
Match: A0A6J1HZT8 (nucleolar complex protein 4 homolog B OS=Cucurbita maxima OX=3661 GN=LOC111468149 PE=3 SV=1)

HSP 1 Score: 1033.1 bits (2670), Expect = 5.3e-298
Identity = 546/616 (88.64%), Postives = 576/616 (93.51%), Query Frame = 0

Query: 1   MASIPS-------NKKKTKNHKLSDVKTLGLQLLSSRAHINNLALLLTFVSPSSPPPYVL 60
           MAS+PS        KK  KNHKLSD+KTLGLQLLSS+AHINNL LLLTFVSPS PP YVL
Sbjct: 1   MASLPSLNQNEKKKKKNEKNHKLSDLKTLGLQLLSSQAHINNLPLLLTFVSPSKPPQYVL 60

Query: 61  EALLSLQSFFITNLPSLPSSSSKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVAVS 120
           EALLSLQSFFIT LPSLP SSSKPAA   DVQ DAE IYRTWLRSKFDELVKSLIDVAVS
Sbjct: 61  EALLSLQSFFITVLPSLP-SSSKPAA---DVQDDAELIYRTWLRSKFDELVKSLIDVAVS 120

Query: 121 SECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKYFNH 180
           SECDDTLKEIVLDAIMEFVKVGN+GKFHSAVYHRFLQSIAHS  PV+TLIALLVKKYFNH
Sbjct: 121 SECDDTLKEIVLDAIMEFVKVGNRGKFHSAVYHRFLQSIAHSSAPVNTLIALLVKKYFNH 180

Query: 181 LDVRYFTYISIKELDKTFKAEYMSGDKNGRINGDDGGHSREGVEFIHIVHSILSSIPPLE 240
           LDVRYFTYISI++L +TF+AEYMSGD++ RINGDDG HSREGVEFIHIVHSI+SSIPPLE
Sbjct: 181 LDVRYFTYISIEKLTRTFEAEYMSGDRSVRINGDDGDHSREGVEFIHIVHSIISSIPPLE 240

Query: 241 NSNESDYTLWVESGDDKVLSDNQEAKQLKMKKNDEENVINTLQVLSASKIVRRMKLKFSK 300
           NSN+SDYT+WVESGDDKVLSDNQEAKQLKM+KNDEE       VLSAS+IVR+MK KF+K
Sbjct: 241 NSNQSDYTMWVESGDDKVLSDNQEAKQLKMRKNDEE-------VLSASRIVRKMKPKFTK 300

Query: 301 AWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIGGVVSVMALSSLFL 360
           AWISFL+LPLPIDVYKEVLVILDQEVIPYLS PIILCDFLTKSYD+GGVVSVMALSSLFL
Sbjct: 301 AWISFLRLPLPIDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYDMGGVVSVMALSSLFL 360

Query: 361 LMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSR 420
           LMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSR
Sbjct: 361 LMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSR 420

Query: 421 LSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSEEVAKGTDASEVDA 480
           LSLVVPPSGAL+IIALIHNLLRRHPSINCLVHRENVSESKNDDSTS+EVAKGTDASEV+A
Sbjct: 421 LSLVVPPSGALIIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSKEVAKGTDASEVEA 480

Query: 481 DTPNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKT 540
           DT NMKPGID FNYEETDPIKSSAL+SSLWEID LRHHYCPPVSRLVLSLENDLTVRSKT
Sbjct: 481 DTLNMKPGIDRFNYEETDPIKSSALRSSLWEIDCLRHHYCPPVSRLVLSLENDLTVRSKT 540

Query: 541 TEMDVKDFVAGSYATILGQELRKKMKRVPLAFYQAAPTALFSESDFAGWSFDYKHSEKNI 600
           TE+DVKDFVAGSYATILGQEL+KKMKRVPLAFYQA PT+LFSESDF GWSF+++HSEKNI
Sbjct: 541 TEIDVKDFVAGSYATILGQELKKKMKRVPLAFYQAIPTSLFSESDFPGWSFNHEHSEKNI 600

Query: 601 DGSDHLSAKRQRVESS 610
           DGSDHL AKRQRVESS
Sbjct: 601 DGSDHLPAKRQRVESS 605

BLAST of HG10005552 vs. ExPASy TrEMBL
Match: A0A6J1GJR4 (nucleolar complex protein 4 homolog OS=Cucurbita moschata OX=3662 GN=LOC111454922 PE=3 SV=1)

HSP 1 Score: 1028.5 bits (2658), Expect = 1.3e-296
Identity = 542/616 (87.99%), Postives = 576/616 (93.51%), Query Frame = 0

Query: 1   MASIPS-------NKKKTKNHKLSDVKTLGLQLLSSRAHINNLALLLTFVSPSSPPPYVL 60
           MAS+PS        KK  KNHKLSD+KTLGLQLLSSRAHINNL LLLTF+SPS PP YVL
Sbjct: 1   MASVPSLNQSEKKKKKNEKNHKLSDLKTLGLQLLSSRAHINNLPLLLTFLSPSKPPQYVL 60

Query: 61  EALLSLQSFFITNLPSLPSSSSKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVAVS 120
           EALLSLQSFFIT LPSLP SSSKPAA   DVQ DAE IYRTWLRSKFDELVKSLIDVAVS
Sbjct: 61  EALLSLQSFFITVLPSLP-SSSKPAA---DVQDDAELIYRTWLRSKFDELVKSLIDVAVS 120

Query: 121 SECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKYFNH 180
           SECDDTLKEIVLDAIMEFVKVGN+GKFHSA+YHRFLQSIAHS TPV+TLIALLVKKYFNH
Sbjct: 121 SECDDTLKEIVLDAIMEFVKVGNRGKFHSALYHRFLQSIAHSSTPVNTLIALLVKKYFNH 180

Query: 181 LDVRYFTYISIKELDKTFKAEYMSGDKNGRINGDDGGHSREGVEFIHIVHSILSSIPPLE 240
           LDVRYFTYISI++L +TF+AEYM GD++ RI+GDDG HSR+GVEFIHIVHSI+SSIPPLE
Sbjct: 181 LDVRYFTYISIEKLTRTFEAEYMFGDRSVRIDGDDGDHSRKGVEFIHIVHSIISSIPPLE 240

Query: 241 NSNESDYTLWVESGDDKVLSDNQEAKQLKMKKNDEENVINTLQVLSASKIVRRMKLKFSK 300
           NSN+SDYT+WVESGD+KVLSDNQEAKQLKM+KNDEE       VL+ASKIVR+MK KF+K
Sbjct: 241 NSNQSDYTMWVESGDNKVLSDNQEAKQLKMRKNDEE-------VLTASKIVRKMKPKFTK 300

Query: 301 AWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIGGVVSVMALSSLFL 360
           AWISFL+LPLPIDVYKEVLVILDQEVIPYLS PIILCDFLTKSYD+GGVVSVMALSSLFL
Sbjct: 301 AWISFLRLPLPIDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYDMGGVVSVMALSSLFL 360

Query: 361 LMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSR 420
           LMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSR
Sbjct: 361 LMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSR 420

Query: 421 LSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSEEVAKGTDASEVDA 480
           LSLVVPPSGAL+IIALIHNLLRRHPSINCLVHRENVSESKNDDSTS+EVAKGTDASEV+A
Sbjct: 421 LSLVVPPSGALIIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSKEVAKGTDASEVEA 480

Query: 481 DTPNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKT 540
           DTPNMKPGID FNYEETDPIKSSAL+SSLWEID LRHHYCPPVSRLVLSLENDLTVRSKT
Sbjct: 481 DTPNMKPGIDRFNYEETDPIKSSALRSSLWEIDCLRHHYCPPVSRLVLSLENDLTVRSKT 540

Query: 541 TEMDVKDFVAGSYATILGQELRKKMKRVPLAFYQAAPTALFSESDFAGWSFDYKHSEKNI 600
           TE+DVKDFVAGSYATILGQEL+KKMKRVPLAFYQA PT+LFS SDF GWSF+++HSEKNI
Sbjct: 541 TEIDVKDFVAGSYATILGQELKKKMKRVPLAFYQAIPTSLFSASDFPGWSFNHEHSEKNI 600

Query: 601 DGSDHLSAKRQRVESS 610
           DGSDHL AKRQRVESS
Sbjct: 601 DGSDHLPAKRQRVESS 605

BLAST of HG10005552 vs. ExPASy TrEMBL
Match: A0A0A0K685 (CBF domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G025180 PE=3 SV=1)

HSP 1 Score: 1027.7 bits (2656), Expect = 2.2e-296
Identity = 552/622 (88.75%), Postives = 571/622 (91.80%), Query Frame = 0

Query: 1   MASIPSN------KKKTK-------NHKLSDVKTLGLQLLSSRAHINNLALLLTFVSPSS 60
           MASIPSN      KKKTK        H LSD+KTLGLQLLSSRAHINNL LLLTFVSPSS
Sbjct: 1   MASIPSNNHNEKEKKKTKTKTKNENTHSLSDLKTLGLQLLSSRAHINNLPLLLTFVSPSS 60

Query: 61  PPPYVLEALLSLQSFFITNLPSLPSSSSKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSL 120
           PPPYVLEALLSLQSFFITNLPSLP SSSKP  AGDDVQVDAEFIYRTWLRSKFDELVKSL
Sbjct: 61  PPPYVLEALLSLQSFFITNLPSLP-SSSKPPPAGDDVQVDAEFIYRTWLRSKFDELVKSL 120

Query: 121 IDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLV 180
           IDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHS TPVDTLIALLV
Sbjct: 121 IDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSSTPVDTLIALLV 180

Query: 181 KKYFNHLDVRYFTYISIKELDKTFKAEYMSGDKNGRINGDDGGHSREGVEFIHIVHSILS 240
           KKYF++LDVRYFTYISIKEL KTFKAEYMSGD         GGHS+EGVEFIHIVHSI+S
Sbjct: 181 KKYFHYLDVRYFTYISIKELAKTFKAEYMSGDV--------GGHSKEGVEFIHIVHSIIS 240

Query: 241 SIPPLENSNESDYTLWVESGDDKVLSDNQEAKQLKMKKNDEENVINTLQVLSASKIVRRM 300
           SIPPLENSN+SDYT+WVESGD+KVLSD+QEAKQLKMKKNDEE       VL++SKIVRRM
Sbjct: 241 SIPPLENSNQSDYTMWVESGDNKVLSDDQEAKQLKMKKNDEE-------VLTSSKIVRRM 300

Query: 301 KLKFSKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIGGVVSVMA 360
           KLKFSKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIIL DFLTKSYDIGGV+SVMA
Sbjct: 301 KLKFSKAWISFLKLPLPIDVYKEVLVILDQEVIPYLSKPIILSDFLTKSYDIGGVISVMA 360

Query: 361 LSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAF 420
           LSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAF
Sbjct: 361 LSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAF 420

Query: 421 AKKLSRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDDSTSEEVAKGTD 480
           AKKLSRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKND+STSEE AKGT 
Sbjct: 421 AKKLSRLSLVVPPSGALVIIALIHNLLRRHPSINCLVHRENVSESKNDNSTSEEAAKGT- 480

Query: 481 ASEVDADTPNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLVLSLENDL 540
               DADTP MKPGIDHFNYEE DPIKSSAL+SSLWEIDSLRHHYCPPVSRLVLSLENDL
Sbjct: 481 ----DADTPKMKPGIDHFNYEEADPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDL 540

Query: 541 TVRSKTTEMDVKDFVAGSYATILGQELRKKMKRVPLAFYQAAPTALFSESDFAGWSFDYK 600
           TVRSKTTE+DVKDFVAGSY+TILGQEL+KK+KRVPLAFYQA PT LFSESDFAGWSFD +
Sbjct: 541 TVRSKTTEIDVKDFVAGSYSTILGQELKKKLKRVPLAFYQAPPTTLFSESDFAGWSFDNE 600

Query: 601 HSEKNIDGSDHLSAKRQRVESS 610
           HSEKNID SDHLSAKRQRV SS
Sbjct: 601 HSEKNIDSSDHLSAKRQRVGSS 601

BLAST of HG10005552 vs. TAIR 10
Match: AT2G17250.1 (CCAAT-binding factor )

HSP 1 Score: 605.9 bits (1561), Expect = 4.0e-173
Identity = 333/595 (55.97%), Postives = 431/595 (72.44%), Query Frame = 0

Query: 1   MASIPSNK-KKTKNHKLSDVKTLGLQLLSSRAHINNLALLLTFVSPSSPPPYVLEALLSL 60
           MASI S K KK + + L ++K+LG  LL+SR+HINNL LLLTFVSP SPP +V+E+LLSL
Sbjct: 1   MASILSKKQKKNEKYTLKELKSLGHDLLTSRSHINNLPLLLTFVSPESPPQFVVESLLSL 60

Query: 61  QSFFITNLPSLPSSSSKPAAAGDDVQVDAEFIYRTWLRSKFDELVKSLIDVAVSSECDDT 120
           QSFF   L  LP +SS P++   +   D E +++ WLRSKFDE VK L+DV VS + +D+
Sbjct: 61  QSFFTPLLSQLPPTSSSPSSTKTE---DPEVVFKAWLRSKFDEFVKLLLDVLVSQQSEDS 120

Query: 121 LKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSIAHSLTPVDTLIALLVKKYFNHLDVRYF 180
           L+ IVL  +MEFVK+ N G+FHS++YHR L +I HS   ++  + +L  KYF ++DVRYF
Sbjct: 121 LRGIVLGTLMEFVKLLNAGRFHSSIYHRLLDAIIHSEVDIEIFLDILTSKYFKYIDVRYF 180

Query: 181 TYISIKELDKTFKAEYMSGDKNGRINGDDGGHSREGVEF-IHIVHSILSSIPPLENSNE- 240
           TYIS+++  KT +A  +S D+    N +    S+E +E  +  ++ +LS IPP E   E 
Sbjct: 181 TYISMEKFVKTLEAS-VSADRTVIENNEAESDSKESLELSVRKIYQVLSQIPPPEKQAEK 240

Query: 241 SDYTLWVESGDDKVLSDNQEAKQLKMKKNDEENVINTLQVLSASKIVRRMKLKFSKAWIS 300
           S + +W  SG D+ +S+    K+ K +K D         +LS + I +RMKLKF+KAWIS
Sbjct: 241 SQHEMW--SGSDESISEKPTDKKKKTEKGDS-------TLLSPATISKRMKLKFTKAWIS 300

Query: 301 FLKLPLPIDVYKEVLVILDQEVIPYLSKPIILCDFLTKSYDIGGVVSVMALSSLFLLMTK 360
           FL+LPLPIDVYKEVL  +   VIP+LS P +LCDFLTKSYDIGGVVSVMALSSLF+LMT+
Sbjct: 301 FLRLPLPIDVYKEVLASIHLTVIPHLSNPTMLCDFLTKSYDIGGVVSVMALSSLFILMTQ 360

Query: 361 YGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLSLV 420
           +GLEYP FYEKLYALLVPS+F+AKHRAKF QLLD+CLKS +LPAYLAA+F KKLSRLSL 
Sbjct: 361 HGLEYPFFYEKLYALLVPSVFVAKHRAKFLQLLDACLKSSMLPAYLAASFTKKLSRLSLS 420

Query: 421 VPPSGALVIIALIHNLLRRHPSINCLVHR--ENVSESKNDDSTSEEVAKGTDASEVDADT 480
           +PP+G+LVI ALI+NLLRR+P+IN LV    EN  E+  +     E    T         
Sbjct: 421 IPPAGSLVITALIYNLLRRNPTINHLVQEIVENADEANTEAGEHNESQPKT--------I 480

Query: 481 PNMKPGIDHFNYEETDPIKSSALKSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTE 540
              K GID+FN +E+DP KS ALKSSLWEID+LRHHYCPPVSR + SLE +LT+RSKTTE
Sbjct: 481 KKRKLGIDYFNNQESDPKKSGALKSSLWEIDTLRHHYCPPVSRFISSLETNLTIRSKTTE 540

Query: 541 MDVKDFVAGSYATILGQELRKKMKRVPLAFYQAAPTALFSESDFAGWSFDYKHSE 591
           M ++DF +GSYATI G E+R+++K+VPLAFY+  PT+LF++SDF GW+F     E
Sbjct: 541 MKIEDFCSGSYATIFGDEIRRRVKQVPLAFYKTVPTSLFADSDFPGWTFTIPQEE 574

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038887732.10.0e+0093.16protein NUCLEOLAR COMPLEX ASSOCIATED 4 isoform X1 [Benincasa hispida][more]
XP_038887733.10.0e+0092.18protein NUCLEOLAR COMPLEX ASSOCIATED 4 isoform X2 [Benincasa hispida][more]
XP_008447831.14.4e-29989.52PREDICTED: nucleolar complex protein 4 homolog [Cucumis melo][more]
TYK23279.17.6e-29989.66nucleolar complex protein 4-like protein [Cucumis melo var. makuwa][more]
KAG7011815.13.8e-29888.64Nucleolar complex protein 4-like B, partial [Cucurbita argyrosperma subsp. argyr... [more]
Match NameE-valueIdentityDescription
F4IMH35.7e-17255.97Protein NUCLEOLAR COMPLEX ASSOCIATED 4 OS=Arabidopsis thaliana OX=3702 GN=NOC4 P... [more]
Q6NU916.7e-5630.40Nucleolar complex protein 4 homolog B OS=Xenopus laevis OX=8355 GN=noc4l-b PE=2 ... [more]
Q6NRQ21.1e-5029.76Nucleolar complex protein 4 homolog A OS=Xenopus laevis OX=8355 GN=noc4l-a PE=2 ... [more]
Q8BHY21.9e-5040.91Nucleolar complex protein 4 homolog OS=Mus musculus OX=10090 GN=Noc4l PE=2 SV=1[more]
Q5ZJC72.5e-5041.07Nucleolar complex protein 4 homolog OS=Gallus gallus OX=9031 GN=NOC4L PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3BIC02.2e-29989.52nucleolar complex protein 4 homolog OS=Cucumis melo OX=3656 GN=LOC103490206 PE=3... [more]
A0A5D3DJ823.7e-29989.66Nucleolar complex protein 4-like protein OS=Cucumis melo var. makuwa OX=1194695 ... [more]
A0A6J1HZT85.3e-29888.64nucleolar complex protein 4 homolog B OS=Cucurbita maxima OX=3661 GN=LOC11146814... [more]
A0A6J1GJR41.3e-29687.99nucleolar complex protein 4 homolog OS=Cucurbita moschata OX=3662 GN=LOC11145492... [more]
A0A0A0K6852.2e-29688.75CBF domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G025180 PE=3 SV... [more]
Match NameE-valueIdentityDescription
AT2G17250.14.0e-17355.97CCAAT-binding factor [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 258..278
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 451..468
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 451..481
NoneNo IPR availablePANTHERPTHR12455:SF1BNAC09G09250D PROTEINcoord: 1..589
IPR005612CCAAT-binding factorPFAMPF03914CBFcoord: 344..519
e-value: 1.8E-34
score: 119.2
IPR027193Nucleolar complex protein 4PANTHERPTHR12455NUCLEOLAR COMPLEX PROTEIN 4coord: 1..589

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10005552.1HG10005552.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0006364 rRNA processing
biological_process GO:0042254 ribosome biogenesis
cellular_component GO:0016020 membrane
cellular_component GO:0030692 Noc4p-Nop14p complex
cellular_component GO:0005730 nucleolus
cellular_component GO:0005654 nucleoplasm
cellular_component GO:0032040 small-subunit processome