HG10016220 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10016220
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAAA domain-containing protein
LocationChr03: 3592118 .. 3602373 (-)
RNA-Seq ExpressionHG10016220
SyntenyHG10016220
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGATTCCCTTCGAAAATCCCCAATCTCAGCCTCTCTCCCGGCCCACCGGCGCCAGAAGAAGCGGCAGCTTCCCCACCTCCCCTGAGTTCGAGTTCTGGATGGTTCGAAACCCCTCTTTCCCTCAGCCCAATCTTCTTTCTGCCGACGAGCTCTTCGTTGATGGCGTTCTTCTTCCCCTTCACCTTGTATCCAACCACTCCCCATCTCAGTCCACTGACCCTAACCAGAAATCTGACCTGGAACCTCCTCCCTCCGAACCCGATCCCAGCGACGGCCCCAAATTGACGCCCAATTCGGCGGATTCGGGTTCTTCGTTAACCTCGTCGAAGCGGTGGAGCATTTTCAAGAAGAGCGAGAAGAAGAACGTCACGGGTAATCAGGAGGATCGAGACAAGGAGAAGAAGAAAGAGAAGAAGACTGGGAATGGGTCTACATCGGCCGAGTTGAATATCAATATTTGGCCCTTTTCGCGTAGTAGATCCGCTGGGAATGCTTTCACTAGGCCTAAAATGTTCCCTGGCGGCCAACCCGGATCCCGGAAGGTCAACAGTGCGCCGTGTTCTCGCAGCAACTCCGCCGGCGAATCCAAGTCCAGGAAGTGGCCGAGCAGCCCGAGCCGCGCTGGCGTCCATCTCGGCCGGAGTAGCCCAGTTTGGCAGGTCCGCCGCGGCGGATCCGCTCCCAAAACCTCCGAAAACCTCTCTCGCAATGCCGAAAAACCCGCCCGGAAAGAACCCACGGACGCGCACCGGAGCAAGGCTGTAGCCGCCTCCTCCTCCGCCTCCAGAGTTAGAGTTTTGAATTTGAATGTCCCCATGTGTATTGGGTACAGGAACCATTTGAGCTGTAGAAGCGATGAGACCAGTGCACTTGGGGTTATTGGCAGCGGCGGCGGTGGAAGCAGCGGCGGCAGCCATGGTTACGACAACAATGGAGACGGCGGCAATATCAGTAATCCTGGAAACTCAAGTAGTACTGCCAATCTCTTTAGCATAAGAAGCCTTTTTACTAAGAAAGTGCATTAACAACCTTGTCATAAAATGAATTTACCGCTTTCACATGTATTATTGTATAGGAATCTCTCCTAGTCTAAACTTTTTGCATCTCTATGTAATTAACCAAAAGTATTAGCCATCTGAGTTCTGACTGTACTTAATTAATCAGCTCTGTACAGAGTTTTGTTGGCTGTTGCCTTAATTTCAATCAATTTTTCTCAGAAATTAACTCCCCACATTTGAATTGATCTCTTTGTTCCTAATAATTTCGTCTCACTGATGATTTGTTGGGTTGAATGAATGCCATATTCTGAAAACTTTTTCATGTCAAGTCTTTATGGAGTTTTCTTTTTCTTCTTTTCTTGTTTCTTCATCATTATGAGGACCACCAATTGTCATCCTGTTGGAGACTTGCTCTAGAAGATGATAGATTTTCGCTGATATTTTTACAGAGAAATCTTTTGTCTTGTAATCCATGGCTGCTTCTGATTTACTGGGGAAAAGAATAGACCAGTGTATCTTTTATCCTATCTACCTTTTCAAACCATGCCCATAGAATTCTGTACTGAACTGTAAAACTGTAATTCTGGCTTGTTGCCCATACAAACCAATGTAATCTTGTTATTGTTTTTTTTTTCTCTGGCTATTATTTTCACCATCCATACATTAGTTTCCAGGACATTATCCTGTTCAAGTTTTCCCCAAGTTTTGTTTTCCATAAGAACAAGACAGTTTCAATCACTAGATGTCCTTAGTACAGAGTGAGAGAGCTGGATTAACTCTTTGGTGAATAACCAGTTAAAAAAGCTTGAGTTCTTTGTGATATGATTTCTTTTTAGTACAATAGAGGTAACGGGATTTAAACAACCACAGTCTTCTTGGTTGTTAACACATTCGTATATTCGTTGAGCTATGCTTGGTTTGTCATGATGTGATTTTTATTTCAAAATAAATATAGATAAATTGTTGATTATGACTTTATTGTAATTGTAAGATTTTGTCCATTTTATCCAATTGGATGTTGGGAGAGTGGGACGAGAATGAGATAATTCAAAGGTTGGATTAGCTTAAGTTTTAATTATGGCTTTCTGACTTTGAGTTTGTTTATCTTTTATATTTGTGGTTGACACTGATTAACAAAATGTATAATGACACCTTTGTTTTTTCTTGAATTTGGATAAGGTGGTGTCAATTTAGATGAAGTGGTATGACTTGACACTAAAATGAATTGACAATTGACACTTGAAATGTTGGAGGCCATGTGAACAAAATTTGAAGAAAATTAAGCCTCCAACGGTTTAATTTCCTACTTAAACGATCTTTCAAATCTAATTCGAGCCTATTTTTTTTAGTAGATTCCAACCACAAACTATATTGCCAATTGAACTATGCTTGCTTCGACAATTCAACCGTAATTGCTTTGACTATTGATTTATATAATTCATTAGATTGACACAACATACTATTTTGTCCAAAATAAAATGTCACTGGATAATTCACAATTGAGTAAAGCTAGGGTGAAAATATAGGGATAATATATGAACTTTAGAATGTTTTGGTACAATTAATAAGCTATTATCTTTTTTAAGTTTGACTAATTGGTTATTTTATATGATGAAATATCAATTTTGTTTAAGTTATAATCCGGATTTTGCTGCACCGTCCAGGTGTTTGACTGAAGTTGAGATTTGTGGAATTTGGAAGTTGAAATGGTGTTTGGTCGGCTGACTTGAAATGGTCTGGTTTCTTCAAAAATAAGATCCATGTCTTACGGAATGCGCTTTTTATTTATTTATTTATTTATTTTCTCTTTGTTTTTCTTGAGGAAAAGAAAAAGGAACCATAAATTGATTTCGTGAGCAGCAAATGATGCATTTGGTCAAGTGTCATTACTAATTTAAAATGCAATGGTCAATGGATTAAGTTGTATATTATTCAAGTGTTTCATATATATTTTGAAACTAGACTTTAATTTAAAAAGATCTTCCATGTGTGTTTAGAAATGTATCTATAAGTCAACCACCAAATGTTTAGTTGAATTTAAAATATCATATTATTGAGAGACAATTACAAATATAGTAATTAGATCCAAAGTATTGGCCTATATAACACAATATAAAAGAATTTGTAGATATAACAAAATTGGGCACCCAACATAGGATTACTCCAAGCTAAGCACACATAATTTTGGATTTTGGAGTTCTTATGATTGAGTCACCAAAAAGAAAAGTGCACCTTATTGGTATATGTAGTAACTTTCAATTTTTTTAAGTATTTCTTAACGATACTTTCATATCCTTAGGATCCTTCTTATTTAGATGTGATCTTGGTTCATTCATGTCTCCCTCCTAAACTCAAGTCGTTACATCCTTAATATTGTTGATCGAGTTGGGGGAAATGGTTTGGGGGAACAATTTCTCGTTATGACTTCTTCAATTTGTTTTTTGTTTTTTTTTTTGTTAGACGACAAGTAGGGTGAGAAGTTCGAACTTTTAATCTCAAAATTGACGGTACAAATTTATGTCAGTTGACCTATACTTGCGTTAGCAATTTCAATAAATAAAAAATTGATCTAATTGAGTGAACGAGAAAAGAAGAAACAAAAGCAGCCGAACTTTTGCATTTATGGACTGTGAGTTGGGCTCAACATTTCGAGCCCAAGCAGAGGCTCTTGGCCCAGCCTACGTTTTGGGAGAAGTGAAGTGGTGAACCCTTATATTAGGGAGAGAGAGAAGGAAAGAATAAAGAAAAAGAAAAAATACGAACCCGTTCCGGGTTCTCACCAATCACCACTTCGAATCTCCGACTTGTCGACCCGTCTGCAGAAGGCAGTGTCCAATCTCTTGTTCGTCGCTGTGTTTGATCGACGTTTCGCCGGTCAGCAAATAGCCTAACTTTGAGCAGGGACATAGAGTTACGGTATAATCTATAGCAGCGAAGATTTCATGTCGGTTCTGGAGCGGATTGCGAGGACCGCCCGAGCGTGGCGAATACTTACGTCATTCAAACTCAAGGATTCTTCGAGCGCATCCCGGAGTTTATATCTGAGTAAGTTTCTCTTCTTATTGCTGGTAATCATCGTTTCTAAACTACAGTTAATCACGGTGTTTCTGTCTGAATTCTAGTTTTACTGATACTTTGTGTTACGTCACGCGTATATATATATCGGAGTTGCTTATTTTCTGTTTGACATGGATATGTAATCTTTGTTCCAAATTATAGTTTCTATAACGGACTGAGCCGTGCTGTTCTCTTTTTTATGTCGATGGTTATTTGTTGTTAAGTGGGCGGGAGCTATGGGTTGCTTGATTCGCTGTTCTTTTGAATTATAACAATGTTCTTCAGCCGTTGATATTTTATATTAGAAATGAAACTTCATTTGTTTAAAACGAACGTTGATATTTTATATTAGAAATGAAACTTCATTTGTTTAAAACGAATTGGTTTGGCAAGTGAAGAAGGCTTATCATATGTATATGCAGGACATTTTCGGCATCAGCAAGAGCGTCTAAAACCAGGATCGCTGCCTAATGGACGTGACCCTTGCTTTCTCTATCCAGCTATCATTGCTGGTATGTTTGGTGTTGGGGCAATGGAAATAGCGTATGCAGAAGCTGAAGAGGTTCGTCCTTGATGATTAATTCCTTGGAGAGTCGTAGACTGTTTCTGATATTTTTCCCAAGATATCTGATGGTTCATAATTGAAATTCAGTTTTTTATCTTACTTGATCCTAATTATATATCGGTCATTTCTTTATCTGGTCAATTTTTTTCTGAATTGTTTGAATTGATTTAGTGATGACTTTAGCTATCTGCAGCCTTCTGTCACTCCACCACCACCAAAGGATCTCTCTACCCATGCTGATATGGAGGAGATTGCTAAGAAAGAGAGACAGCGCATAACTGAACAGTTAACCAGGAACAAAGGAACGAAGTATAGTGCTTGTCCGCGATTTACTGTTGGAGTTAAGGGGCAAAAGGTACTGTCTATTTGTTCTCTTTTTGACTCATCGGTTTTGTAGTTGCTCTATAATCAAATATTTAGATCCATGCTGAGTATTTGAGCCAATAATTATCTTTGGAAACAGTAAATGTACCATCTGAAAATGAAGATTGGTTGAACAACCAGTGCTTCAATGTGCTGTTTTTCTTTATTTGTTACCAGTTTGAGTTTGTTACTTGTGAAACATGTATCGTAAATATATGTTCGGATAAAATATATGATGTAGGTACTTCAATAGGTCTGAGATGCAAGCTTTGTATTTTATAATTAGCTTACTAAACAATTCTTTAGTTCCAAATAAAAGGGCACGCTCGGTATTTACCCTTTAGTTACCGTTGAGCGTCCTTCCTCCTGTATAAATAAAGCATGTTTATGAATTAATAAAGGCATTACTCCACTTTTGTTAAAGATTCAATTAGTCTCCAAGATTGGATTCACCAATATAATTTTTATATTTAACACTGTAAGATATATGCAATTGGGATTATGTTTTAATTCTCTAGAATGAGATGCAGGTCAGTATCAAATTCCAAGTTCCTTCCAGTTGTGAAGTTTCACATCTGATTGCAAATCTTGTATCAAATCTTGGACTGAAGGTTGAAGAAAGTGCTGGTGGTTCAGATATGTTGTTGCGTGCATGGGACAGGTACATGCACCTGTATATTTGAATTTCCATGAATCATTTAGTTGCAATTATTCAACAGTTGTTGACAACGATGTCAAGCAATTTTCAGTTTTGAGGACGAAGGGATTTTACATTTGTTTTTTTGTTTCTTCTTTTCTAAAAAATAGTTATAGACATTGTTTGCTGCCTTCATTTGTTTTTCTTTTTTTCATTTTTCTTACATTGCCCATTATGAAGTTGTTAATTGTACCCTGTTATATATTGAATATTGCAGTCCAGTTGCTTGGCAACTAACTCTTACTCGACCTAAAACTCAGAGAGAAGCTGGAGGAAACAAGGGAAACCCAATAGAAATGGATGCTGAGGATGGAGATCTAAGCGTCCTGATATTTCATTCACTCGTTACCTCAGATAAAACTGTCAGTGCTAATTTTAAATAAAACATCATTTACAAATTATGGATTCATTTTTATTAATTAACTTATTCTAGGAGCAAAAATTTAAATAGAAACTTTTCATAATGGCTATTGCACATTGCTACAGGAAATTGAATTCATAAAGCAGGGAAGCTTGAGCACCAAAGAGCTTGATTCTTTGGTTTCTGTTTTACAATTAGCCGGTGGAAGGTTGGGAGAAAGTAGATCCTTTGAAAGAAAATCGAGGGAAGGGTCTACACAAATGCCATCTTCCGAAAAATCTATATCTAGTCTTGAAGCAATGGGTGTGAAAGTTTATGGACTTGACGAGCCTCATGAATATTCCTCTAAAAATGAGATATCCTGGGATAATATTGCTGGTTATGATCAACAGAAACGGTAGGCTCTGCTGTCTATTGCATACTTTTGTAGTTCTCATTAAATATAGGACATATATATGGATAGGCATTGACGATGAACCAGTACCACCCTGTAATGCTTCAATAAATTTTGACTTAAAATTGGAACCTTATATAAGGTAAAGAAGAAAGGCAGTATGCAAAATGGATTATATGGTTGCTCTTCTCTTAAAGTTCTGTGAATGTTTTAAAATTAGTCTGCAATACTAAATTTTTTTCCTACAAGGATGAAATAATGAAAAGAGATTAATGATCAGACGATACAAACTCCACAAGGGGTGAATAAGAAAAGCAATAAATAAAAACGATTACATAAATAAATGTCATATAAAGAAAAAATCAGCAAAGGGCTTGGAAATTTAAGCGTGCAAAATCAAACTGATCATACTAAGGGAGATGGAGATGCTTGTCTGAAAAAATTCTCTGATTCCTTTCCATCCACAACGCTGATAAAAGAGTTTGATCATATTTAAGCATAGTAACTTCGATTTTGAAGCCAAAAAATAACCAAAAATCACCTGAAGAACATTACCCCTTTAACACAATTGAGAAAATCCACCTTGTGTATGGCAGAGAGAGAAGCTTGAACAAACACTTCAACGAATAGGAGCAACAAAAGAAGATATGGACAAGATAATCTCCATTCTCTTGGCTAAGAGGATAGTTTGCAATGGCTTATCCGTTTCATTTTATTACTTTCTGATTTCTATGAATTCGTGAAGGTTTCATTAAAGATGATATAATCTATGGTCTCTGCTTCATGTTTATTGTTTGTAGATATAGATCATCTACGTTTGTTACGAGAATAATGTTTTTCTCATTTCATTTGCTCATAATCTTATTGTAGTGAAATAGAAGACACAATACTGATGACTCTACATAATCCAGAATTATTTGATGATATTGCTCATGGGACTCGGCGCAAGTTTGAATCAAATAGACCTCGAGCTGTTCTCTTTGAAGGGCCTCCAGGTTTGTTACTGAATTGAGAATGTCTGTGAATATGATAATATTTCAGGAAATGACCTTCTTTACTAGGTTATGCATATTCCTTATCTTGGCCTGTTTTTTGTGGGCTCAAACTCTCGTTTTCTCGTCATATAGTGCCAACTGCCCATAGAAATATGAGAGAATGAAAATCAACTAAATAATTTTATGGAACTTGCAAATATTTCTTTTTTTTATTGTTTCTTCTGTCAGGGTTAAGAAACTACCATTCATTTGGTATTAATATTAACCATGCTATATGTTATAGATGTTACATTTAGTTTAAAAACCAATGTTAAGTTTACTTTTGTTTCTTTTTGTTTGCTTTACTATTTGAAGTTATGTAAACTATCCAGAATGATAGTCCTTTAAGGAAAACATGCAATTTATTAAATTTCAATTCTTCCACTGATTTTTCATTAGCGAAATATATCTTCAAAGAGGATACGAGATCTTCCAAATCTTATAGGGAGAACATCACAAGTGCTTTAGTATTTTCATTTTTTTTCATCGATATTATATCTCAAATCCCTTCTCTCTGTGTCCACACTCTATCTTTTTCGGACTGAATGCACCCTGGATTAATCGACTTGTTTTTCTTCATGCCCAAGTAAAAATAGGATTTGCTGCTCTTAAAATAGGGTGACACGTGCTACATTAGTAGAATACAATTATCCACTTCCGAATATCCTTTTGAGGTTTCTTATACAAAATAATAGTAATAATAATAATCTACTACTTTCGGTCCAGGATGCGAAATTTTCTTGTACTTGGTTTTTTAACAAAAAAAAACTACCACATTTTAAAACCTTGCTTGCCATTGGCATTGAGATTCAGCAGAGTTGTGAGATCCCTGTAATAGTTTTTTGCCTTTCTAGTTTAGAAATTTTGGATAAGTTATGAGCGCAGCTTACTTGTAGCATCATTGGTTCAGGTACCGGTAAAACATCGTCTGCTCGTGTAATTGCAAATCAAGCTGTAAGTCATTATATGCTTTTTGGTACTCAACTCGTTAAATTTGTTATCTTTGCTGGTCGCTTTGAGGATGAGATCACTTACCCTATCCTACTGACTACTCTATATACTTTCAACTCAGGGAAAAGTACAGTGTAGTTTAAATTATTGTTTTCATGCATCACTTTGTTCTCAGCGGGTACTCAAATATGTATTGCATTTCATCTAATTTTTTTCAGTCCTAGGAAATGATGTCTGCTTGCTTTGCAACAAAGAGTTTTCTAATCTAATTACGCAAAATGGTCTTGGACCTCATAAGCTGCTGATCTCGTGAATTAAGTATGTGATACGTAGAAGGAACCTAAAAATTTGTACCCTCACGTTCTGTGTCAGGGCGTCCCGTTGGTGTATGTACCACTTGAGGTTATCATGTCGAAGTACTATGGTGAAAGTGAACGGCTATTGGGAAAAGTGTTTTCACTTGCTAATGAACTCAGTACCGGTGCTATCATTTTCCTAGACGAGGTAAAAGACATGGTCATCGCCTGATCATCAACTTCTTTTTTTTAATCAAATTATTTTTATTTTTATTATTATTTACAATTTACATATTCCTTTAGATTTTATTATTTGTGATTTTCCCCCGCTTCTTTTCCTATTGCAGGTTGATTCTTTTGCAATTTCTCGTGACAGTGAAATACATGAAGCCACTCGAAGAGTATTATCAGTGTTATTACGTCAGGTAAAGATGAATTTACATGAATCCATAACTCCAAAAGTCATGTGCCTTCAGGCTTCAGCGCATCTGTGAATCAACTGAACATAAATGTTCGACTCAAAAACATGTGCATGCACACCAAGAGCCTAATTTGGATGGCAGTCAACTGGAATTTCATGTGTCATTTACGTAAAAAAAACGTATCAACAGGTCTTGCTTTTTGAGCAACGTAGTTTAATTTACAATTGCCGTGCAGATCGATGGATTCGAGCAGGACAGGAAAGTGGTTGTAATTGCTGCTACGAATAGAAAGCAAGACCTTGATCCTGCTTTAATTAGGTATGCAGATCTGTACAGTTTTTTTTTTCTAAAAAAAAAATTTATCGACATTGATTGCGATGTGCATTAAAAGAAGAAGAGCGCGCGCACAAAACCGAATGCTAAGTTGCTGGGTTTCTGAACTAATTTCTTTTCCTGAATTTTCGTTTAGTCGGTTTGACTTGATGATCACATTTGGGTTACCTGATGAGCGGAACCGGGAGGAAATAGCAGCTCAGTATGCAAAGCAGCTAACGAAACCTGAATTGAGCGAGTTCGCCAGAAACACAGAAGGGTAAGTTAATCGTAGGCCCGTAACTACATATGATTTCTTTCGGTAGTCTAGGCATTGTTCGTATTTGTCTGCTATTCTGCTAATGGTCCACATACAATACAAGCCTCTCTCACTAGAATATAGTTTTGTCGACTTCTTTGTTTCTAGGGATGTAGTTGTAACAATCTCTAAATCTACTGTCTTTTTGCTTCGCTTCGCATGTATTTTCAGAATGTCTGGAAGGGACATCAGGGATATTTGTCAGCAAGCCGAGCGTTCATGGGCATCAAAGGCAAACTACCCGTCCTTTATTTCATCGATCCTAGAGTTCCTTTTTGGTGGCCATAAGTATAACCTCTCTTTCTGGGTTTCTTTTTACAGATAATTCGAGGGAAAGTATCGAAAACTGGAGAACATGGAATACTTCCACCCCTTGAAGAATATATCGAGTGTGCCATGAGCAGGCGCAAGGCTTTGCAAACTATTGATGACCATGAAATCAAGGACTCCAACATCCGCACGAAGAAAACTCAATTAGCTTGA

mRNA sequence

ATGGAGATTCCCTTCGAAAATCCCCAATCTCAGCCTCTCTCCCGGCCCACCGGCGCCAGAAGAAGCGGCAGCTTCCCCACCTCCCCTGAGTTCGAGTTCTGGATGGTTCGAAACCCCTCTTTCCCTCAGCCCAATCTTCTTTCTGCCGACGAGCTCTTCGTTGATGGCGTTCTTCTTCCCCTTCACCTTGTATCCAACCACTCCCCATCTCAGTCCACTGACCCTAACCAGAAATCTGACCTGGAACCTCCTCCCTCCGAACCCGATCCCAGCGACGGCCCCAAATTGACGCCCAATTCGGCGGATTCGGGTTCTTCGTTAACCTCGTCGAAGCGGTGGAGCATTTTCAAGAAGAGCGAGAAGAAGAACGTCACGGGTAATCAGGAGGATCGAGACAAGGAGAAGAAGAAAGAGAAGAAGACTGGGAATGGGTCTACATCGGCCGAGTTGAATATCAATATTTGGCCCTTTTCGCGTAGTAGATCCGCTGGGAATGCTTTCACTAGGCCTAAAATGTTCCCTGGCGGCCAACCCGGATCCCGGAAGGTCAACAGTGCGCCGTGTTCTCGCAGCAACTCCGCCGGCGAATCCAAGTCCAGGAAGTGGCCGAGCAGCCCGAGCCGCGCTGGCGTCCATCTCGGCCGGAGTAGCCCAGTTTGGCAGGTCCGCCGCGGCGGATCCGCTCCCAAAACCTCCGAAAACCTCTCTCGCAATGCCGAAAAACCCGCCCGGAAAGAACCCACGGACGCGCACCGGAGCAAGGCTGTAGCCGCCTCCTCCTCCGCCTCCAGAGTTAGAGTTTTGAATTTGAATGTCCCCATGTGTATTGGCGAAGATTTCATGTCGGTTCTGGAGCGGATTGCGAGGACCGCCCGAGCGTGGCGAATACTTACGTCATTCAAACTCAAGGATTCTTCGAGCGCATCCCGGAGTTTATATCTGAGACATTTTCGGCATCAGCAAGAGCGTCTAAAACCAGGATCGCTGCCTAATGGACGTGACCCTTGCTTTCTCTATCCAGCTATCATTGCTGGTATGTTTGGTGTTGGGGCAATGGAAATAGCGTATGCAGAAGCTGAAGAGCCTTCTGTCACTCCACCACCACCAAAGGATCTCTCTACCCATGCTGATATGGAGGAGATTGCTAAGAAAGAGAGACAGCGCATAACTGAACAGTTAACCAGGAACAAAGGAACGAAGTATAGTGCTTGTCCGCGATTTACTGTTGGAGTTAAGGGGCAAAAGAATGAGATGCAGGTCAGTATCAAATTCCAAGTTCCTTCCAGTTGTGAAGTTTCACATCTGATTGCAAATCTTGTATCAAATCTTGGACTGAAGGTTGAAGAAAGTGCTGGTGGTTCAGATATGTTGTTGCGTGCATGGGACAGTCCAGTTGCTTGGCAACTAACTCTTACTCGACCTAAAACTCAGAGAGAAGCTGGAGGAAACAAGGGAAACCCAATAGAAATGGATGCTGAGGATGGAGATCTAAGCGTCCTGATATTTCATTCACTCGTTACCTCAGATAAAACTGAAATTGAATTCATAAAGCAGGGAAGCTTGAGCACCAAAGAGCTTGATTCTTTGGTTTCTGTTTTACAATTAGCCGGTGGAAGGTTGGGAGAAAGTAGATCCTTTGAAAGAAAATCGAGGGAAGGGTCTACACAAATGCCATCTTCCGAAAAATCTATATCTAGTCTTGAAGCAATGGGTGTGAAAGTTTATGGACTTGACGAGCCTCATGAATATTCCTCTAAAAATGAGATATCCTGGGATAATATTGCTGGTTATGATCAACAGAAACGTGAAATAGAAGACACAATACTGATGACTCTACATAATCCAGAATTATTTGATGATATTGCTCATGGGACTCGGCGCAAGTTTGAATCAAATAGACCTCGAGCTGTTCTCTTTGAAGGGCCTCCAGGTACCGGTAAAACATCGTCTGCTCGTGTAATTGCAAATCAAGCTGGCGTCCCGTTGGTGTATGTACCACTTGAGGTTATCATGTCGAAGTACTATGGTGAAAGTGAACGGCTATTGGGAAAAGTGTTTTCACTTGCTAATGAACTCAGTACCGGTGCTATCATTTTCCTAGACGAGGTTGATTCTTTTGCAATTTCTCGTGACAGTGAAATACATGAAGCCACTCGAAGAGTATTATCAGTGTTATTACGTCAGATCGATGGATTCGAGCAGGACAGGAAAGTGGTTGTAATTGCTGCTACGAATAGAAAGCAAGACCTTGATCCTGCTTTAATTAGTCGGTTTGACTTGATGATCACATTTGGGTTACCTGATGAGCGGAACCGGGAGGAAATAGCAGCTCAGTATGCAAAGCAGCTAACGAAACCTGAATTGAGCGAGTTCGCCAGAAACACAGAAGGGGATGTAGTTGTAACAATCTCTAAATCTACTGTCTTTTTGCTTCGCTTCGCATGTATTTTCAGAATGTCTGGAAGGGACATCAGGGATATTTGTCAGCAAGCCGAGCGTTCATGGGCATCAAAGGCAAACTACCCGTCCTTTATTTCATCGATCCTAGAGTTCCTTTTTGGTGGCCATAAGTATAACCTCTCTTTCTGGATAATTCGAGGGAAAGTATCGAAAACTGGAGAACATGGAATACTTCCACCCCTTGAAGAATATATCGAGTGTGCCATGAGCAGGCGCAAGGCTTTGCAAACTATTGATGACCATGAAATCAAGGACTCCAACATCCGCACGAAGAAAACTCAATTAGCTTGA

Coding sequence (CDS)

ATGGAGATTCCCTTCGAAAATCCCCAATCTCAGCCTCTCTCCCGGCCCACCGGCGCCAGAAGAAGCGGCAGCTTCCCCACCTCCCCTGAGTTCGAGTTCTGGATGGTTCGAAACCCCTCTTTCCCTCAGCCCAATCTTCTTTCTGCCGACGAGCTCTTCGTTGATGGCGTTCTTCTTCCCCTTCACCTTGTATCCAACCACTCCCCATCTCAGTCCACTGACCCTAACCAGAAATCTGACCTGGAACCTCCTCCCTCCGAACCCGATCCCAGCGACGGCCCCAAATTGACGCCCAATTCGGCGGATTCGGGTTCTTCGTTAACCTCGTCGAAGCGGTGGAGCATTTTCAAGAAGAGCGAGAAGAAGAACGTCACGGGTAATCAGGAGGATCGAGACAAGGAGAAGAAGAAAGAGAAGAAGACTGGGAATGGGTCTACATCGGCCGAGTTGAATATCAATATTTGGCCCTTTTCGCGTAGTAGATCCGCTGGGAATGCTTTCACTAGGCCTAAAATGTTCCCTGGCGGCCAACCCGGATCCCGGAAGGTCAACAGTGCGCCGTGTTCTCGCAGCAACTCCGCCGGCGAATCCAAGTCCAGGAAGTGGCCGAGCAGCCCGAGCCGCGCTGGCGTCCATCTCGGCCGGAGTAGCCCAGTTTGGCAGGTCCGCCGCGGCGGATCCGCTCCCAAAACCTCCGAAAACCTCTCTCGCAATGCCGAAAAACCCGCCCGGAAAGAACCCACGGACGCGCACCGGAGCAAGGCTGTAGCCGCCTCCTCCTCCGCCTCCAGAGTTAGAGTTTTGAATTTGAATGTCCCCATGTGTATTGGCGAAGATTTCATGTCGGTTCTGGAGCGGATTGCGAGGACCGCCCGAGCGTGGCGAATACTTACGTCATTCAAACTCAAGGATTCTTCGAGCGCATCCCGGAGTTTATATCTGAGACATTTTCGGCATCAGCAAGAGCGTCTAAAACCAGGATCGCTGCCTAATGGACGTGACCCTTGCTTTCTCTATCCAGCTATCATTGCTGGTATGTTTGGTGTTGGGGCAATGGAAATAGCGTATGCAGAAGCTGAAGAGCCTTCTGTCACTCCACCACCACCAAAGGATCTCTCTACCCATGCTGATATGGAGGAGATTGCTAAGAAAGAGAGACAGCGCATAACTGAACAGTTAACCAGGAACAAAGGAACGAAGTATAGTGCTTGTCCGCGATTTACTGTTGGAGTTAAGGGGCAAAAGAATGAGATGCAGGTCAGTATCAAATTCCAAGTTCCTTCCAGTTGTGAAGTTTCACATCTGATTGCAAATCTTGTATCAAATCTTGGACTGAAGGTTGAAGAAAGTGCTGGTGGTTCAGATATGTTGTTGCGTGCATGGGACAGTCCAGTTGCTTGGCAACTAACTCTTACTCGACCTAAAACTCAGAGAGAAGCTGGAGGAAACAAGGGAAACCCAATAGAAATGGATGCTGAGGATGGAGATCTAAGCGTCCTGATATTTCATTCACTCGTTACCTCAGATAAAACTGAAATTGAATTCATAAAGCAGGGAAGCTTGAGCACCAAAGAGCTTGATTCTTTGGTTTCTGTTTTACAATTAGCCGGTGGAAGGTTGGGAGAAAGTAGATCCTTTGAAAGAAAATCGAGGGAAGGGTCTACACAAATGCCATCTTCCGAAAAATCTATATCTAGTCTTGAAGCAATGGGTGTGAAAGTTTATGGACTTGACGAGCCTCATGAATATTCCTCTAAAAATGAGATATCCTGGGATAATATTGCTGGTTATGATCAACAGAAACGTGAAATAGAAGACACAATACTGATGACTCTACATAATCCAGAATTATTTGATGATATTGCTCATGGGACTCGGCGCAAGTTTGAATCAAATAGACCTCGAGCTGTTCTCTTTGAAGGGCCTCCAGGTACCGGTAAAACATCGTCTGCTCGTGTAATTGCAAATCAAGCTGGCGTCCCGTTGGTGTATGTACCACTTGAGGTTATCATGTCGAAGTACTATGGTGAAAGTGAACGGCTATTGGGAAAAGTGTTTTCACTTGCTAATGAACTCAGTACCGGTGCTATCATTTTCCTAGACGAGGTTGATTCTTTTGCAATTTCTCGTGACAGTGAAATACATGAAGCCACTCGAAGAGTATTATCAGTGTTATTACGTCAGATCGATGGATTCGAGCAGGACAGGAAAGTGGTTGTAATTGCTGCTACGAATAGAAAGCAAGACCTTGATCCTGCTTTAATTAGTCGGTTTGACTTGATGATCACATTTGGGTTACCTGATGAGCGGAACCGGGAGGAAATAGCAGCTCAGTATGCAAAGCAGCTAACGAAACCTGAATTGAGCGAGTTCGCCAGAAACACAGAAGGGGATGTAGTTGTAACAATCTCTAAATCTACTGTCTTTTTGCTTCGCTTCGCATGTATTTTCAGAATGTCTGGAAGGGACATCAGGGATATTTGTCAGCAAGCCGAGCGTTCATGGGCATCAAAGGCAAACTACCCGTCCTTTATTTCATCGATCCTAGAGTTCCTTTTTGGTGGCCATAAGTATAACCTCTCTTTCTGGATAATTCGAGGGAAAGTATCGAAAACTGGAGAACATGGAATACTTCCACCCCTTGAAGAATATATCGAGTGTGCCATGAGCAGGCGCAAGGCTTTGCAAACTATTGATGACCATGAAATCAAGGACTCCAACATCCGCACGAAGAAAACTCAATTAGCTTGA

Protein sequence

MEIPFENPQSQPLSRPTGARRSGSFPTSPEFEFWMVRNPSFPQPNLLSADELFVDGVLLPLHLVSNHSPSQSTDPNQKSDLEPPPSEPDPSDGPKLTPNSADSGSSLTSSKRWSIFKKSEKKNVTGNQEDRDKEKKKEKKTGNGSTSAELNINIWPFSRSRSAGNAFTRPKMFPGGQPGSRKVNSAPCSRSNSAGESKSRKWPSSPSRAGVHLGRSSPVWQVRRGGSAPKTSENLSRNAEKPARKEPTDAHRSKAVAASSSASRVRVLNLNVPMCIGEDFMSVLERIARTARAWRILTSFKLKDSSSASRSLYLRHFRHQQERLKPGSLPNGRDPCFLYPAIIAGMFGVGAMEIAYAEAEEPSVTPPPPKDLSTHADMEEIAKKERQRITEQLTRNKGTKYSACPRFTVGVKGQKNEMQVSIKFQVPSSCEVSHLIANLVSNLGLKVEESAGGSDMLLRAWDSPVAWQLTLTRPKTQREAGGNKGNPIEMDAEDGDLSVLIFHSLVTSDKTEIEFIKQGSLSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQMPSSEKSISSLEAMGVKVYGLDEPHEYSSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRAVLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPALISRFDLMITFGLPDERNREEIAAQYAKQLTKPELSEFARNTEGDVVVTISKSTVFLLRFACIFRMSGRDIRDICQQAERSWASKANYPSFISSILEFLFGGHKYNLSFWIIRGKVSKTGEHGILPPLEEYIECAMSRRKALQTIDDHEIKDSNIRTKKTQLA
Homology
BLAST of HG10016220 vs. NCBI nr
Match: XP_038882484.1 (26S proteasome subunit RPT4 isoform X1 [Benincasa hispida])

HSP 1 Score: 1069.7 bits (2765), Expect = 1.4e-308
Identity = 565/636 (88.84%), Postives = 572/636 (89.94%), Query Frame = 0

Query: 281 MSVLERIARTARAWRILTSFKLKDSSSASRSLYLRHFRHQQERLKPGSLPNGRDPCFLYP 340
           MSVLERIARTARAWRILTSFKLKDSSSASRSLYLRHFRHQ E LKPGSL NGR+PCFLYP
Sbjct: 1   MSVLERIARTARAWRILTSFKLKDSSSASRSLYLRHFRHQPEHLKPGSLSNGREPCFLYP 60

Query: 341 AIIAGMFGVGAMEIAYAEAEEPSVTPPPPKDLSTHADMEEIAKKERQRITEQLTRNKGTK 400
           AIIAGMFGVGAMEIAYAEAEE SVTPPPPKDLSTHADMEEIAKKERQRI EQLTRNKGTK
Sbjct: 61  AIIAGMFGVGAMEIAYAEAEESSVTPPPPKDLSTHADMEEIAKKERQRIIEQLTRNKGTK 120

Query: 401 YSACPRFTVGVKGQKNEMQVSIKFQVPSSCEVSHLIANLVSNLGLKVEESAGGSDMLLRA 460
           Y ACPRF VGVKG K    VSIKFQVPSSCEVSHL+ANLVSNLGLKVEESAGGSDMLLRA
Sbjct: 121 YGACPRFMVGVKGPK----VSIKFQVPSSCEVSHLLANLVSNLGLKVEESAGGSDMLLRA 180

Query: 461 WDSPVAWQLTLTRPKTQREAGGNKGNPIEMDAEDGDLSVLIFHSLVTSDKTEIEFIKQGS 520
           WDSPVAWQLTLTRPKTQRE+G NKGN +EMDAEDGDLSVLIFHSLVTSDKTEIEFIKQGS
Sbjct: 181 WDSPVAWQLTLTRPKTQRESGENKGNSLEMDAEDGDLSVLIFHSLVTSDKTEIEFIKQGS 240

Query: 521 LSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQMPSSEKSISSLEAMGVKVYGLDEP 580
           LSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQMPSSE SISSLEAMGVKVYGLDEP
Sbjct: 241 LSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQMPSSENSISSLEAMGVKVYGLDEP 300

Query: 581 HEYSSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRAVLFE 640
            EYSSK+EISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRAVLFE
Sbjct: 301 GEYSSKSEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRAVLFE 360

Query: 641 GPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFL 700
           GPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFL
Sbjct: 361 GPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFL 420

Query: 701 DEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPALISRFDL 760
           DEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPALISRFDL
Sbjct: 421 DEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPALISRFDL 480

Query: 761 MITFGLPDERNREEIAAQYAKQLTKPELSEFARNTEGDVVVTISKSTVFLLRFACIFRMS 820
           MITFGLPDERNREEIAAQYAKQLTKPEL+EFARNTEG                     MS
Sbjct: 481 MITFGLPDERNREEIAAQYAKQLTKPELNEFARNTEG---------------------MS 540

Query: 821 GRDIRDICQQAERSWASKANYPSFISSILEFLFGGHKYNLSFWIIRGKVSKTGEHGILPP 880
           GRDIRDICQQAERSWASK                         IIRGKVSKTGEHGILPP
Sbjct: 541 GRDIRDICQQAERSWASK-------------------------IIRGKVSKTGEHGILPP 586

Query: 881 LEEYIECAMSRRKALQTIDDHEIKDSNIRTKKTQLA 917
           LEEYIECAMSRRKALQTID+HEIKD N RTKKTQLA
Sbjct: 601 LEEYIECAMSRRKALQTIDNHEIKDPNFRTKKTQLA 586

BLAST of HG10016220 vs. NCBI nr
Match: XP_008440044.1 (PREDICTED: katanin p60 ATPase-containing subunit A1 isoform X3 [Cucumis melo])

HSP 1 Score: 1029.2 bits (2660), Expect = 2.1e-296
Identity = 541/636 (85.06%), Postives = 557/636 (87.58%), Query Frame = 0

Query: 281 MSVLERIARTARAWRILTSFKLKDSSSASRSLYLRHFRHQQERLKPGSLPNGRDPCFLYP 340
           MSV  RI+RTARAWRIL SFKLKDS SASRS YLR F HQ E LKPG L NGR+PCFLYP
Sbjct: 1   MSVFNRISRTARAWRILASFKLKDSPSASRSFYLRQFGHQPEHLKPGLLSNGREPCFLYP 60

Query: 341 AIIAGMFGVGAMEIAYAEAEEPSVTPPPPKDLSTHADMEEIAKKERQRITEQLTRNKGTK 400
           AI+AGM GVGAMEIAYAEAEE + TPPPPKDLSTHADME++AKKER RI EQL RNKGTK
Sbjct: 61  AIVAGMLGVGAMEIAYAEAEESAATPPPPKDLSTHADMEDVAKKERLRIIEQLNRNKGTK 120

Query: 401 YSACPRFTVGVKGQKNEMQVSIKFQVPSSCEVSHLIANLVSNLGLKVEESAGGSDMLLRA 460
           Y ACPRFTVGVKGQK    VSIKFQVP SCEVSHLIANLVSNLGLKVEESAGGSDMLLRA
Sbjct: 121 YGACPRFTVGVKGQK----VSIKFQVPPSCEVSHLIANLVSNLGLKVEESAGGSDMLLRA 180

Query: 461 WDSPVAWQLTLTRPKTQREAGGNKGNPIEMDAEDGDLSVLIFHSLVTSDKTEIEFIKQGS 520
           WDSPVAWQLTL+RPK+QR AG NKGN IEMDAEDGDL+VLIFHSLVTSDKTEIEFIKQGS
Sbjct: 181 WDSPVAWQLTLSRPKSQRVAGENKGNSIEMDAEDGDLTVLIFHSLVTSDKTEIEFIKQGS 240

Query: 521 LSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQMPSSEKSISSLEAMGVKVYGLDEP 580
           LSTKELDSLVSVLQLAGGRLGESRS ERKSREGSTQMPSSEKSISSLEAMGVKVYGLDEP
Sbjct: 241 LSTKELDSLVSVLQLAGGRLGESRSSERKSREGSTQMPSSEKSISSLEAMGVKVYGLDEP 300

Query: 581 HEYSSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRAVLFE 640
           H  S++NEISWDNIAGYDQQKREIED+ILMTLHNPELFDDIAHGTRR+FESNRPRAVLFE
Sbjct: 301 HVNSTENEISWDNIAGYDQQKREIEDSILMTLHNPELFDDIAHGTRRRFESNRPRAVLFE 360

Query: 641 GPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFL 700
           GPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFL
Sbjct: 361 GPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFL 420

Query: 701 DEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPALISRFDL 760
           DEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKV+VIAATNRKQDLDPALISRFDL
Sbjct: 421 DEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVIVIAATNRKQDLDPALISRFDL 480

Query: 761 MITFGLPDERNREEIAAQYAKQLTKPELSEFARNTEGDVVVTISKSTVFLLRFACIFRMS 820
           MITFGLPDERNREEIA QYAKQLTKPEL EFARNT+G                     MS
Sbjct: 481 MITFGLPDERNREEIAVQYAKQLTKPELKEFARNTDG---------------------MS 540

Query: 821 GRDIRDICQQAERSWASKANYPSFISSILEFLFGGHKYNLSFWIIRGKVSKTGEHGILPP 880
           GRDIRDICQQAERSWASK                         IIRGKVSKTGEHGILPP
Sbjct: 541 GRDIRDICQQAERSWASK-------------------------IIRGKVSKTGEHGILPP 586

Query: 881 LEEYIECAMSRRKALQTIDDHEIKDSNIRTKKTQLA 917
           +EEYIECAM RRKALQTIDDHEIKD NIRTKKTQLA
Sbjct: 601 IEEYIECAMKRRKALQTIDDHEIKDPNIRTKKTQLA 586

BLAST of HG10016220 vs. NCBI nr
Match: XP_004134780.1 (ATPase family AAA domain-containing protein 1-B isoform X1 [Cucumis sativus] >KGN49077.1 hypothetical protein Csa_003807 [Cucumis sativus])

HSP 1 Score: 1023.8 bits (2646), Expect = 8.8e-295
Identity = 538/636 (84.59%), Postives = 562/636 (88.36%), Query Frame = 0

Query: 281 MSVLERIARTARAWRILTSFKLKDSSSASRSLYLRHFRHQQERLKPGSLPNGRDPCFLYP 340
           MSVL RI+RTARAWRIL SFKLK+SSSAS+S YLRHF HQ E  KPGSL NGR+PCFLYP
Sbjct: 1   MSVLNRISRTARAWRILASFKLKESSSASQSFYLRHFGHQPEHFKPGSLSNGREPCFLYP 60

Query: 341 AIIAGMFGVGAMEIAYAEAEEPSVTPPPPKDLSTHADMEEIAKKERQRITEQLTRNKGTK 400
           AI+AGM GVGAMEIAYAEAEE + TPPPP+DLSTHADME+IAKKER RITEQL RNKGTK
Sbjct: 61  AIVAGMVGVGAMEIAYAEAEESTSTPPPPRDLSTHADMEDIAKKERLRITEQLKRNKGTK 120

Query: 401 YSACPRFTVGVKGQKNEMQVSIKFQVPSSCEVSHLIANLVSNLGLKVEESAGGSDMLLRA 460
           Y ACPRFTVGVKGQK    VSIKFQVP SCEVSHLIANLVSNLGLKVEE+AGGSDMLLRA
Sbjct: 121 YGACPRFTVGVKGQK----VSIKFQVPPSCEVSHLIANLVSNLGLKVEETAGGSDMLLRA 180

Query: 461 WDSPVAWQLTLTRPKTQREAGGNKGNPIEMDAEDGDLSVLIFHSLVTSDKTEIEFIKQGS 520
           WDSPVAWQLTL+RPK+Q+EAG NKGN IEMDA+DGDL+VLIFHSL+TSDKTEIEFIKQGS
Sbjct: 181 WDSPVAWQLTLSRPKSQKEAGENKGNSIEMDADDGDLTVLIFHSLITSDKTEIEFIKQGS 240

Query: 521 LSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQMPSSEKSISSLEAMGVKVYGLDEP 580
           LSTKELDSLVSVLQLAGGRLGESRSFERKS+E STQMPSSEKSISSLEAMGVKVYGLD P
Sbjct: 241 LSTKELDSLVSVLQLAGGRLGESRSFERKSKEESTQMPSSEKSISSLEAMGVKVYGLDGP 300

Query: 581 HEYSSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRAVLFE 640
           H  S+KNEISWDNIAGYDQQKREIED+ILMTLHNPELFDDIAHGTRRKFESN+PRAVLFE
Sbjct: 301 HLNSTKNEISWDNIAGYDQQKREIEDSILMTLHNPELFDDIAHGTRRKFESNKPRAVLFE 360

Query: 641 GPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFL 700
           GPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLAN+LSTGAIIFL
Sbjct: 361 GPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANDLSTGAIIFL 420

Query: 701 DEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPALISRFDL 760
           DEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKV+VIAATNRKQDLDPALISRFD+
Sbjct: 421 DEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVIVIAATNRKQDLDPALISRFDM 480

Query: 761 MITFGLPDERNREEIAAQYAKQLTKPELSEFARNTEGDVVVTISKSTVFLLRFACIFRMS 820
           MITFGLPDERNREEIAAQYAKQLT+PEL EFARNTEG                     MS
Sbjct: 481 MITFGLPDERNREEIAAQYAKQLTQPELKEFARNTEG---------------------MS 540

Query: 821 GRDIRDICQQAERSWASKANYPSFISSILEFLFGGHKYNLSFWIIRGKVSKTGEHGILPP 880
           GRDIRDICQQAERSWASK                         IIRGKVSKTGEHGILPP
Sbjct: 541 GRDIRDICQQAERSWASK-------------------------IIRGKVSKTGEHGILPP 585

Query: 881 LEEYIECAMSRRKALQTIDDHEIKDSNIRTKKTQLA 917
           LEEYIECAM+RRKALQTIDDHEIKD N RTKKTQLA
Sbjct: 601 LEEYIECAMNRRKALQTIDDHEIKDPN-RTKKTQLA 585

BLAST of HG10016220 vs. NCBI nr
Match: XP_008440043.1 (PREDICTED: probable 26S protease regulatory subunit 10B isoform X2 [Cucumis melo])

HSP 1 Score: 1023.5 bits (2645), Expect = 1.2e-294
Identity = 541/640 (84.53%), Postives = 557/640 (87.03%), Query Frame = 0

Query: 281 MSVLERIARTARAWRILTSFKLKDSSSASRSLYLRHFRHQQERLKPGSLPNGRDPCFLYP 340
           MSV  RI+RTARAWRIL SFKLKDS SASRS YLR F HQ E LKPG L NGR+PCFLYP
Sbjct: 1   MSVFNRISRTARAWRILASFKLKDSPSASRSFYLRQFGHQPEHLKPGLLSNGREPCFLYP 60

Query: 341 AIIAGMFGVGAMEIAYAEAEEPSVTPPPPKDLSTHADMEEIAKKERQRITEQLTRNKGTK 400
           AI+AGM GVGAMEIAYAEAEE + TPPPPKDLSTHADME++AKKER RI EQL RNKGTK
Sbjct: 61  AIVAGMLGVGAMEIAYAEAEESAATPPPPKDLSTHADMEDVAKKERLRIIEQLNRNKGTK 120

Query: 401 YSACPRFTVGVKGQKNEMQVSIKFQVPSSCEVSHLIANLVSNLGLKVEESAGGSDMLLRA 460
           Y ACPRFTVGVKGQK    VSIKFQVP SCEVSHLIANLVSNLGLKVEESAGGSDMLLRA
Sbjct: 121 YGACPRFTVGVKGQK----VSIKFQVPPSCEVSHLIANLVSNLGLKVEESAGGSDMLLRA 180

Query: 461 WDSPVAWQLTLTRPKTQREAGGNKGNPIEMDAEDGDLSVLIFHSLVTSDKT----EIEFI 520
           WDSPVAWQLTL+RPK+QR AG NKGN IEMDAEDGDL+VLIFHSLVTSDKT    EIEFI
Sbjct: 181 WDSPVAWQLTLSRPKSQRVAGENKGNSIEMDAEDGDLTVLIFHSLVTSDKTVSANEIEFI 240

Query: 521 KQGSLSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQMPSSEKSISSLEAMGVKVYG 580
           KQGSLSTKELDSLVSVLQLAGGRLGESRS ERKSREGSTQMPSSEKSISSLEAMGVKVYG
Sbjct: 241 KQGSLSTKELDSLVSVLQLAGGRLGESRSSERKSREGSTQMPSSEKSISSLEAMGVKVYG 300

Query: 581 LDEPHEYSSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRA 640
           LDEPH  S++NEISWDNIAGYDQQKREIED+ILMTLHNPELFDDIAHGTRR+FESNRPRA
Sbjct: 301 LDEPHVNSTENEISWDNIAGYDQQKREIEDSILMTLHNPELFDDIAHGTRRRFESNRPRA 360

Query: 641 VLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGA 700
           VLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGA
Sbjct: 361 VLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGA 420

Query: 701 IIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPALIS 760
           IIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKV+VIAATNRKQDLDPALIS
Sbjct: 421 IIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVIVIAATNRKQDLDPALIS 480

Query: 761 RFDLMITFGLPDERNREEIAAQYAKQLTKPELSEFARNTEGDVVVTISKSTVFLLRFACI 820
           RFDLMITFGLPDERNREEIA QYAKQLTKPEL EFARNT+G                   
Sbjct: 481 RFDLMITFGLPDERNREEIAVQYAKQLTKPELKEFARNTDG------------------- 540

Query: 821 FRMSGRDIRDICQQAERSWASKANYPSFISSILEFLFGGHKYNLSFWIIRGKVSKTGEHG 880
             MSGRDIRDICQQAERSWASK                         IIRGKVSKTGEHG
Sbjct: 541 --MSGRDIRDICQQAERSWASK-------------------------IIRGKVSKTGEHG 590

Query: 881 ILPPLEEYIECAMSRRKALQTIDDHEIKDSNIRTKKTQLA 917
           ILPP+EEYIECAM RRKALQTIDDHEIKD NIRTKKTQLA
Sbjct: 601 ILPPIEEYIECAMKRRKALQTIDDHEIKDPNIRTKKTQLA 590

BLAST of HG10016220 vs. NCBI nr
Match: XP_008440042.1 (PREDICTED: katanin p60 ATPase-containing subunit A1 isoform X1 [Cucumis melo])

HSP 1 Score: 1022.7 bits (2643), Expect = 2.0e-294
Identity = 541/642 (84.27%), Postives = 557/642 (86.76%), Query Frame = 0

Query: 281 MSVLERIARTARAWRILTSFKLKDSSSASRSLYLRHFRHQQERLKPGSLPNGRDPCFLYP 340
           MSV  RI+RTARAWRIL SFKLKDS SASRS YLR F HQ E LKPG L NGR+PCFLYP
Sbjct: 1   MSVFNRISRTARAWRILASFKLKDSPSASRSFYLRQFGHQPEHLKPGLLSNGREPCFLYP 60

Query: 341 AIIAGMFGVGAMEIAYAEAEEPSVTPPPPKDLSTHADMEEIAKKERQRITEQLTRNKGTK 400
           AI+AGM GVGAMEIAYAEAEE + TPPPPKDLSTHADME++AKKER RI EQL RNKGTK
Sbjct: 61  AIVAGMLGVGAMEIAYAEAEESAATPPPPKDLSTHADMEDVAKKERLRIIEQLNRNKGTK 120

Query: 401 YSACPRFTVGVKGQKNEMQVSIKFQVPSSCEVSHLIANLVSNLGLKVEESAGGSDMLLRA 460
           Y ACPRFTVGVKGQK    VSIKFQVP SCEVSHLIANLVSNLGLKVEESAGGSDMLLRA
Sbjct: 121 YGACPRFTVGVKGQK----VSIKFQVPPSCEVSHLIANLVSNLGLKVEESAGGSDMLLRA 180

Query: 461 WDSPVAWQLTLTRPKTQREAGGNKGNPIEMDAEDGDLSVLIFHSLVTSDKT------EIE 520
           WDSPVAWQLTL+RPK+QR AG NKGN IEMDAEDGDL+VLIFHSLVTSDKT      EIE
Sbjct: 181 WDSPVAWQLTLSRPKSQRVAGENKGNSIEMDAEDGDLTVLIFHSLVTSDKTVSANVKEIE 240

Query: 521 FIKQGSLSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQMPSSEKSISSLEAMGVKV 580
           FIKQGSLSTKELDSLVSVLQLAGGRLGESRS ERKSREGSTQMPSSEKSISSLEAMGVKV
Sbjct: 241 FIKQGSLSTKELDSLVSVLQLAGGRLGESRSSERKSREGSTQMPSSEKSISSLEAMGVKV 300

Query: 581 YGLDEPHEYSSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRP 640
           YGLDEPH  S++NEISWDNIAGYDQQKREIED+ILMTLHNPELFDDIAHGTRR+FESNRP
Sbjct: 301 YGLDEPHVNSTENEISWDNIAGYDQQKREIEDSILMTLHNPELFDDIAHGTRRRFESNRP 360

Query: 641 RAVLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELST 700
           RAVLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELST
Sbjct: 361 RAVLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELST 420

Query: 701 GAIIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPAL 760
           GAIIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKV+VIAATNRKQDLDPAL
Sbjct: 421 GAIIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVIVIAATNRKQDLDPAL 480

Query: 761 ISRFDLMITFGLPDERNREEIAAQYAKQLTKPELSEFARNTEGDVVVTISKSTVFLLRFA 820
           ISRFDLMITFGLPDERNREEIA QYAKQLTKPEL EFARNT+G                 
Sbjct: 481 ISRFDLMITFGLPDERNREEIAVQYAKQLTKPELKEFARNTDG----------------- 540

Query: 821 CIFRMSGRDIRDICQQAERSWASKANYPSFISSILEFLFGGHKYNLSFWIIRGKVSKTGE 880
               MSGRDIRDICQQAERSWASK                         IIRGKVSKTGE
Sbjct: 541 ----MSGRDIRDICQQAERSWASK-------------------------IIRGKVSKTGE 592

Query: 881 HGILPPLEEYIECAMSRRKALQTIDDHEIKDSNIRTKKTQLA 917
           HGILPP+EEYIECAM RRKALQTIDDHEIKD NIRTKKTQLA
Sbjct: 601 HGILPPIEEYIECAMKRRKALQTIDDHEIKDPNIRTKKTQLA 592

BLAST of HG10016220 vs. ExPASy Swiss-Prot
Match: P25694 (Cell division control protein 48 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=CDC48 PE=1 SV=3)

HSP 1 Score: 139.0 bits (349), Expect = 2.6e-31
Identity = 90/233 (38.63%), Postives = 122/233 (52.36%), Query Frame = 0

Query: 577 LDEPHEYSSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRA 636
           ++   E ++ NE+ +D+I G  +Q  +I + + + L +P+LF  I            PR 
Sbjct: 199 INREDEENNMNEVGYDDIGGCRKQMAQIREMVELPLRHPQLFKAIG--------IKPPRG 258

Query: 637 VLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGA 696
           VL  GPPGTGKT  AR +AN+ G     +    +MSK  GESE  L K F  A E +  A
Sbjct: 259 VLMYGPPGTGKTLMARAVANETGAFFFLINGPEVMSKMAGESESNLRKAFEEA-EKNAPA 318

Query: 697 IIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPAL-- 756
           IIF+DE+DS A  RD    E  RRV+S LL  +DG +    VVVIAATNR   +DPAL  
Sbjct: 319 IIFIDEIDSIAPKRDKTNGEVERRVVSQLLTLMDGMKARSNVVVIAATNRPNSIDPALRR 378

Query: 757 ISRFDLMITFGLPDERNREEIAAQYAKQL---TKPELSEFARNTEGDVVVTIS 805
             RFD  +  G+PD   R E+   + K +      +L   A  T G V   I+
Sbjct: 379 FGRFDREVDIGIPDATGRLEVLRIHTKNMKLADDVDLEALAAETHGYVGADIA 422

BLAST of HG10016220 vs. ExPASy Swiss-Prot
Match: Q5AWS6 (Cell division control protein 48 OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) OX=227321 GN=cdc48 PE=1 SV=2)

HSP 1 Score: 138.7 bits (348), Expect = 3.4e-31
Identity = 89/223 (39.91%), Postives = 120/223 (53.81%), Query Frame = 0

Query: 582 EYSSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRAVLFEG 641
           E ++ NE+ +D+I G  +Q  +I + + + L +P+LF  I            PR +L  G
Sbjct: 215 EENNLNEVGYDDIGGCRKQMAQIRELVELPLRHPQLFKSIG--------IKPPRGILMYG 274

Query: 642 PPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFLD 701
           PPGTGKT  AR +AN+ G     +    IMSK  GESE  L K F  A E ++ AIIF+D
Sbjct: 275 PPGTGKTLMARAVANETGAFFFLINGPEIMSKMAGESESNLRKAFEEA-EKNSPAIIFID 334

Query: 702 EVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPAL--ISRFD 761
           E+DS A  R+    E  RRV+S LL  +DG +    VVV+AATNR   +DPAL    RFD
Sbjct: 335 EIDSIAPKREKTNGEVERRVVSQLLTLMDGMKARSNVVVMAATNRPNSIDPALRRFGRFD 394

Query: 762 LMITFGLPDERNREEIAAQYAKQLTKPE---LSEFARNTEGDV 800
             +  G+PD   R EI + + K +   E   L   A  T G V
Sbjct: 395 REVDIGIPDPTGRLEILSIHTKNMKLGEDVDLETIAAETHGYV 428

BLAST of HG10016220 vs. ExPASy Swiss-Prot
Match: O28972 (Cell division cycle protein 48 homolog AF_1297 OS=Archaeoglobus fulgidus (strain ATCC 49558 / VC-16 / DSM 4304 / JCM 9628 / NBRC 100126) OX=224325 GN=AF_1297 PE=3 SV=1)

HSP 1 Score: 138.3 bits (347), Expect = 4.4e-31
Identity = 84/217 (38.71%), Postives = 124/217 (57.14%), Query Frame = 0

Query: 588 EISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRAVLFEGPPGTGK 647
           ++++++I G  ++ R + + I + L +PELF        ++     P+ VL  GPPGTGK
Sbjct: 178 DVTYEDIGGLKRELRLVREMIELPLKHPELF--------QRLGIEPPKGVLLYGPPGTGK 237

Query: 648 TSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFLDEVDSFA 707
           T  A+ +AN+     + +    IMSKYYGESE+ L ++F  A E +  +IIF+DE+DS A
Sbjct: 238 TLIAKAVANEVDAHFIPISGPEIMSKYYGESEQRLREIFEEAKE-NAPSIIFIDEIDSIA 297

Query: 708 ISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPAL--ISRFDLMITFG 767
             R+    E  RRV++ LL  +DG E    V+VIAATNR   +DPAL    RFD  I  G
Sbjct: 298 PKREEVTGEVERRVVAQLLALMDGLEARGDVIVIAATNRPDAIDPALRRPGRFDREIEIG 357

Query: 768 LPDERNREEIAAQYAKQLTKPE---LSEFARNTEGDV 800
           +PD+  R+EI   + +++   E   L E A  T G V
Sbjct: 358 VPDKEGRKEILEIHTRKMPLAEDVDLEELAELTNGFV 385

BLAST of HG10016220 vs. ExPASy Swiss-Prot
Match: Q9P3A7 (Cell division cycle protein 48 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=cdc48 PE=1 SV=2)

HSP 1 Score: 137.1 bits (344), Expect = 9.9e-31
Identity = 89/228 (39.04%), Postives = 121/228 (53.07%), Query Frame = 0

Query: 577 LDEPHEYSSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRA 636
           ++   E SS  E+ +D+I G  +Q  +I + + + L +P+LF  I            PR 
Sbjct: 209 INREDEESSLAEVGYDDIGGCRRQMAQIRELVELPLRHPQLFKSIG--------IKPPRG 268

Query: 637 VLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGA 696
           +L  GPPGTGKT  AR +AN+ G     +    IMSK  GESE  L K F  A E ++ A
Sbjct: 269 ILMYGPPGTGKTLMARAVANETGAFFFLINGPEIMSKMAGESESNLRKAFEEA-EKNSPA 328

Query: 697 IIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPAL-- 756
           IIF+DE+DS A  R+    E  RRV+S LL  +DG +    VVV+AATNR   +DPAL  
Sbjct: 329 IIFIDEIDSIAPKREKTNGEVERRVVSQLLTLMDGMKARSNVVVMAATNRPNSIDPALRR 388

Query: 757 ISRFDLMITFGLPDERNREEIAAQYAKQL---TKPELSEFARNTEGDV 800
             RFD  +  G+PD   R EI   + K +      +L + A  T G V
Sbjct: 389 FGRFDREVDVGIPDPTGRLEILRIHTKNMKLADDVDLEQIAAETHGYV 427

BLAST of HG10016220 vs. ExPASy Swiss-Prot
Match: O26824 (Proteasome-activating nucleotidase OS=Methanothermobacter thermautotrophicus (strain ATCC 29096 / DSM 1053 / JCM 10044 / NBRC 100330 / Delta H) OX=187420 GN=pan PE=3 SV=1)

HSP 1 Score: 136.7 bits (343), Expect = 1.3e-30
Identity = 98/322 (30.43%), Postives = 155/322 (48.14%), Query Frame = 0

Query: 543 SRSFERKSREGSTQMPSSEKSISSLEAMGVKVYGLDEPHEYSSKNEISWDNIAGYDQQKR 602
           SR  +RK  E   ++  ++++ S ++ +  +   +    E   K ++S++ I G ++Q R
Sbjct: 102 SRFIDRKQLEPGARVALNQQTFSIVDVLPSEKDPVVTGMEVEEKPDVSYEQIGGLEEQVR 161

Query: 603 EIEDTILMTLHNPELFDDIAHGTRRKFESNRPRAVLFEGPPGTGKTSSARVIANQAGVPL 662
           E+++T+ + L  PELF+        K     P+ VL  GPPGTGKT  A+ +A++     
Sbjct: 162 EVKETVELPLKKPELFE--------KIGIEPPKGVLLYGPPGTGKTLLAKAVAHETNATF 221

Query: 663 VYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFLDEVDSFAISR---DSEIHEATR 722
           + +     + KY GE  RL+  VF LA E S  +IIF+DE+D+ A  R    +      +
Sbjct: 222 IKIVASEFVRKYIGEGARLVRGVFELAKEKSP-SIIFIDEIDAVAAKRLKSSTSGDREVQ 281

Query: 723 RVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPALI--SRFDLMITFGLPDERNREEIAA 782
           R L  LL ++DGFE    V ++AATNR   LDPAL+   RFD  I   LP+E  R EI  
Sbjct: 282 RTLMQLLAELDGFESRGNVGIVAATNRPDILDPALLRPGRFDRFIEVPLPNEDGRREILK 341

Query: 783 QYAKQLTKPE---LSEFARNTEGDVVVTISKSTVFLLRFACIFRMSGRDIRDICQQA--- 842
            +   +   E   +   AR T+G                      SG D++ IC +A   
Sbjct: 342 IHTSGMALAEEVDIELLARITDG---------------------ASGADLKAICTEAGMF 393

Query: 843 ----ERSWASKANYPSFISSIL 850
               ER   + A++   +  I+
Sbjct: 402 AIRDERDEVTMADFMDAVDKIM 393

BLAST of HG10016220 vs. ExPASy TrEMBL
Match: A0A1S3B059 (katanin p60 ATPase-containing subunit A1 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103484638 PE=3 SV=1)

HSP 1 Score: 1029.2 bits (2660), Expect = 1.0e-296
Identity = 541/636 (85.06%), Postives = 557/636 (87.58%), Query Frame = 0

Query: 281 MSVLERIARTARAWRILTSFKLKDSSSASRSLYLRHFRHQQERLKPGSLPNGRDPCFLYP 340
           MSV  RI+RTARAWRIL SFKLKDS SASRS YLR F HQ E LKPG L NGR+PCFLYP
Sbjct: 1   MSVFNRISRTARAWRILASFKLKDSPSASRSFYLRQFGHQPEHLKPGLLSNGREPCFLYP 60

Query: 341 AIIAGMFGVGAMEIAYAEAEEPSVTPPPPKDLSTHADMEEIAKKERQRITEQLTRNKGTK 400
           AI+AGM GVGAMEIAYAEAEE + TPPPPKDLSTHADME++AKKER RI EQL RNKGTK
Sbjct: 61  AIVAGMLGVGAMEIAYAEAEESAATPPPPKDLSTHADMEDVAKKERLRIIEQLNRNKGTK 120

Query: 401 YSACPRFTVGVKGQKNEMQVSIKFQVPSSCEVSHLIANLVSNLGLKVEESAGGSDMLLRA 460
           Y ACPRFTVGVKGQK    VSIKFQVP SCEVSHLIANLVSNLGLKVEESAGGSDMLLRA
Sbjct: 121 YGACPRFTVGVKGQK----VSIKFQVPPSCEVSHLIANLVSNLGLKVEESAGGSDMLLRA 180

Query: 461 WDSPVAWQLTLTRPKTQREAGGNKGNPIEMDAEDGDLSVLIFHSLVTSDKTEIEFIKQGS 520
           WDSPVAWQLTL+RPK+QR AG NKGN IEMDAEDGDL+VLIFHSLVTSDKTEIEFIKQGS
Sbjct: 181 WDSPVAWQLTLSRPKSQRVAGENKGNSIEMDAEDGDLTVLIFHSLVTSDKTEIEFIKQGS 240

Query: 521 LSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQMPSSEKSISSLEAMGVKVYGLDEP 580
           LSTKELDSLVSVLQLAGGRLGESRS ERKSREGSTQMPSSEKSISSLEAMGVKVYGLDEP
Sbjct: 241 LSTKELDSLVSVLQLAGGRLGESRSSERKSREGSTQMPSSEKSISSLEAMGVKVYGLDEP 300

Query: 581 HEYSSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRAVLFE 640
           H  S++NEISWDNIAGYDQQKREIED+ILMTLHNPELFDDIAHGTRR+FESNRPRAVLFE
Sbjct: 301 HVNSTENEISWDNIAGYDQQKREIEDSILMTLHNPELFDDIAHGTRRRFESNRPRAVLFE 360

Query: 641 GPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFL 700
           GPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFL
Sbjct: 361 GPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFL 420

Query: 701 DEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPALISRFDL 760
           DEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKV+VIAATNRKQDLDPALISRFDL
Sbjct: 421 DEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVIVIAATNRKQDLDPALISRFDL 480

Query: 761 MITFGLPDERNREEIAAQYAKQLTKPELSEFARNTEGDVVVTISKSTVFLLRFACIFRMS 820
           MITFGLPDERNREEIA QYAKQLTKPEL EFARNT+G                     MS
Sbjct: 481 MITFGLPDERNREEIAVQYAKQLTKPELKEFARNTDG---------------------MS 540

Query: 821 GRDIRDICQQAERSWASKANYPSFISSILEFLFGGHKYNLSFWIIRGKVSKTGEHGILPP 880
           GRDIRDICQQAERSWASK                         IIRGKVSKTGEHGILPP
Sbjct: 541 GRDIRDICQQAERSWASK-------------------------IIRGKVSKTGEHGILPP 586

Query: 881 LEEYIECAMSRRKALQTIDDHEIKDSNIRTKKTQLA 917
           +EEYIECAM RRKALQTIDDHEIKD NIRTKKTQLA
Sbjct: 601 IEEYIECAMKRRKALQTIDDHEIKDPNIRTKKTQLA 586

BLAST of HG10016220 vs. ExPASy TrEMBL
Match: A0A0A0KL52 (AAA domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G512920 PE=3 SV=1)

HSP 1 Score: 1023.8 bits (2646), Expect = 4.3e-295
Identity = 538/636 (84.59%), Postives = 562/636 (88.36%), Query Frame = 0

Query: 281 MSVLERIARTARAWRILTSFKLKDSSSASRSLYLRHFRHQQERLKPGSLPNGRDPCFLYP 340
           MSVL RI+RTARAWRIL SFKLK+SSSAS+S YLRHF HQ E  KPGSL NGR+PCFLYP
Sbjct: 1   MSVLNRISRTARAWRILASFKLKESSSASQSFYLRHFGHQPEHFKPGSLSNGREPCFLYP 60

Query: 341 AIIAGMFGVGAMEIAYAEAEEPSVTPPPPKDLSTHADMEEIAKKERQRITEQLTRNKGTK 400
           AI+AGM GVGAMEIAYAEAEE + TPPPP+DLSTHADME+IAKKER RITEQL RNKGTK
Sbjct: 61  AIVAGMVGVGAMEIAYAEAEESTSTPPPPRDLSTHADMEDIAKKERLRITEQLKRNKGTK 120

Query: 401 YSACPRFTVGVKGQKNEMQVSIKFQVPSSCEVSHLIANLVSNLGLKVEESAGGSDMLLRA 460
           Y ACPRFTVGVKGQK    VSIKFQVP SCEVSHLIANLVSNLGLKVEE+AGGSDMLLRA
Sbjct: 121 YGACPRFTVGVKGQK----VSIKFQVPPSCEVSHLIANLVSNLGLKVEETAGGSDMLLRA 180

Query: 461 WDSPVAWQLTLTRPKTQREAGGNKGNPIEMDAEDGDLSVLIFHSLVTSDKTEIEFIKQGS 520
           WDSPVAWQLTL+RPK+Q+EAG NKGN IEMDA+DGDL+VLIFHSL+TSDKTEIEFIKQGS
Sbjct: 181 WDSPVAWQLTLSRPKSQKEAGENKGNSIEMDADDGDLTVLIFHSLITSDKTEIEFIKQGS 240

Query: 521 LSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQMPSSEKSISSLEAMGVKVYGLDEP 580
           LSTKELDSLVSVLQLAGGRLGESRSFERKS+E STQMPSSEKSISSLEAMGVKVYGLD P
Sbjct: 241 LSTKELDSLVSVLQLAGGRLGESRSFERKSKEESTQMPSSEKSISSLEAMGVKVYGLDGP 300

Query: 581 HEYSSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRAVLFE 640
           H  S+KNEISWDNIAGYDQQKREIED+ILMTLHNPELFDDIAHGTRRKFESN+PRAVLFE
Sbjct: 301 HLNSTKNEISWDNIAGYDQQKREIEDSILMTLHNPELFDDIAHGTRRKFESNKPRAVLFE 360

Query: 641 GPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFL 700
           GPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLAN+LSTGAIIFL
Sbjct: 361 GPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANDLSTGAIIFL 420

Query: 701 DEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPALISRFDL 760
           DEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKV+VIAATNRKQDLDPALISRFD+
Sbjct: 421 DEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVIVIAATNRKQDLDPALISRFDM 480

Query: 761 MITFGLPDERNREEIAAQYAKQLTKPELSEFARNTEGDVVVTISKSTVFLLRFACIFRMS 820
           MITFGLPDERNREEIAAQYAKQLT+PEL EFARNTEG                     MS
Sbjct: 481 MITFGLPDERNREEIAAQYAKQLTQPELKEFARNTEG---------------------MS 540

Query: 821 GRDIRDICQQAERSWASKANYPSFISSILEFLFGGHKYNLSFWIIRGKVSKTGEHGILPP 880
           GRDIRDICQQAERSWASK                         IIRGKVSKTGEHGILPP
Sbjct: 541 GRDIRDICQQAERSWASK-------------------------IIRGKVSKTGEHGILPP 585

Query: 881 LEEYIECAMSRRKALQTIDDHEIKDSNIRTKKTQLA 917
           LEEYIECAM+RRKALQTIDDHEIKD N RTKKTQLA
Sbjct: 601 LEEYIECAMNRRKALQTIDDHEIKDPN-RTKKTQLA 585

BLAST of HG10016220 vs. ExPASy TrEMBL
Match: A0A1S3B076 (probable 26S protease regulatory subunit 10B isoform X2 OS=Cucumis melo OX=3656 GN=LOC103484638 PE=3 SV=1)

HSP 1 Score: 1023.5 bits (2645), Expect = 5.6e-295
Identity = 541/640 (84.53%), Postives = 557/640 (87.03%), Query Frame = 0

Query: 281 MSVLERIARTARAWRILTSFKLKDSSSASRSLYLRHFRHQQERLKPGSLPNGRDPCFLYP 340
           MSV  RI+RTARAWRIL SFKLKDS SASRS YLR F HQ E LKPG L NGR+PCFLYP
Sbjct: 1   MSVFNRISRTARAWRILASFKLKDSPSASRSFYLRQFGHQPEHLKPGLLSNGREPCFLYP 60

Query: 341 AIIAGMFGVGAMEIAYAEAEEPSVTPPPPKDLSTHADMEEIAKKERQRITEQLTRNKGTK 400
           AI+AGM GVGAMEIAYAEAEE + TPPPPKDLSTHADME++AKKER RI EQL RNKGTK
Sbjct: 61  AIVAGMLGVGAMEIAYAEAEESAATPPPPKDLSTHADMEDVAKKERLRIIEQLNRNKGTK 120

Query: 401 YSACPRFTVGVKGQKNEMQVSIKFQVPSSCEVSHLIANLVSNLGLKVEESAGGSDMLLRA 460
           Y ACPRFTVGVKGQK    VSIKFQVP SCEVSHLIANLVSNLGLKVEESAGGSDMLLRA
Sbjct: 121 YGACPRFTVGVKGQK----VSIKFQVPPSCEVSHLIANLVSNLGLKVEESAGGSDMLLRA 180

Query: 461 WDSPVAWQLTLTRPKTQREAGGNKGNPIEMDAEDGDLSVLIFHSLVTSDKT----EIEFI 520
           WDSPVAWQLTL+RPK+QR AG NKGN IEMDAEDGDL+VLIFHSLVTSDKT    EIEFI
Sbjct: 181 WDSPVAWQLTLSRPKSQRVAGENKGNSIEMDAEDGDLTVLIFHSLVTSDKTVSANEIEFI 240

Query: 521 KQGSLSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQMPSSEKSISSLEAMGVKVYG 580
           KQGSLSTKELDSLVSVLQLAGGRLGESRS ERKSREGSTQMPSSEKSISSLEAMGVKVYG
Sbjct: 241 KQGSLSTKELDSLVSVLQLAGGRLGESRSSERKSREGSTQMPSSEKSISSLEAMGVKVYG 300

Query: 581 LDEPHEYSSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRA 640
           LDEPH  S++NEISWDNIAGYDQQKREIED+ILMTLHNPELFDDIAHGTRR+FESNRPRA
Sbjct: 301 LDEPHVNSTENEISWDNIAGYDQQKREIEDSILMTLHNPELFDDIAHGTRRRFESNRPRA 360

Query: 641 VLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGA 700
           VLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGA
Sbjct: 361 VLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGA 420

Query: 701 IIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPALIS 760
           IIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKV+VIAATNRKQDLDPALIS
Sbjct: 421 IIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVIVIAATNRKQDLDPALIS 480

Query: 761 RFDLMITFGLPDERNREEIAAQYAKQLTKPELSEFARNTEGDVVVTISKSTVFLLRFACI 820
           RFDLMITFGLPDERNREEIA QYAKQLTKPEL EFARNT+G                   
Sbjct: 481 RFDLMITFGLPDERNREEIAVQYAKQLTKPELKEFARNTDG------------------- 540

Query: 821 FRMSGRDIRDICQQAERSWASKANYPSFISSILEFLFGGHKYNLSFWIIRGKVSKTGEHG 880
             MSGRDIRDICQQAERSWASK                         IIRGKVSKTGEHG
Sbjct: 541 --MSGRDIRDICQQAERSWASK-------------------------IIRGKVSKTGEHG 590

Query: 881 ILPPLEEYIECAMSRRKALQTIDDHEIKDSNIRTKKTQLA 917
           ILPP+EEYIECAM RRKALQTIDDHEIKD NIRTKKTQLA
Sbjct: 601 ILPPIEEYIECAMKRRKALQTIDDHEIKDPNIRTKKTQLA 590

BLAST of HG10016220 vs. ExPASy TrEMBL
Match: A0A1S3AZS0 (katanin p60 ATPase-containing subunit A1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103484638 PE=3 SV=1)

HSP 1 Score: 1022.7 bits (2643), Expect = 9.5e-295
Identity = 541/642 (84.27%), Postives = 557/642 (86.76%), Query Frame = 0

Query: 281 MSVLERIARTARAWRILTSFKLKDSSSASRSLYLRHFRHQQERLKPGSLPNGRDPCFLYP 340
           MSV  RI+RTARAWRIL SFKLKDS SASRS YLR F HQ E LKPG L NGR+PCFLYP
Sbjct: 1   MSVFNRISRTARAWRILASFKLKDSPSASRSFYLRQFGHQPEHLKPGLLSNGREPCFLYP 60

Query: 341 AIIAGMFGVGAMEIAYAEAEEPSVTPPPPKDLSTHADMEEIAKKERQRITEQLTRNKGTK 400
           AI+AGM GVGAMEIAYAEAEE + TPPPPKDLSTHADME++AKKER RI EQL RNKGTK
Sbjct: 61  AIVAGMLGVGAMEIAYAEAEESAATPPPPKDLSTHADMEDVAKKERLRIIEQLNRNKGTK 120

Query: 401 YSACPRFTVGVKGQKNEMQVSIKFQVPSSCEVSHLIANLVSNLGLKVEESAGGSDMLLRA 460
           Y ACPRFTVGVKGQK    VSIKFQVP SCEVSHLIANLVSNLGLKVEESAGGSDMLLRA
Sbjct: 121 YGACPRFTVGVKGQK----VSIKFQVPPSCEVSHLIANLVSNLGLKVEESAGGSDMLLRA 180

Query: 461 WDSPVAWQLTLTRPKTQREAGGNKGNPIEMDAEDGDLSVLIFHSLVTSDKT------EIE 520
           WDSPVAWQLTL+RPK+QR AG NKGN IEMDAEDGDL+VLIFHSLVTSDKT      EIE
Sbjct: 181 WDSPVAWQLTLSRPKSQRVAGENKGNSIEMDAEDGDLTVLIFHSLVTSDKTVSANVKEIE 240

Query: 521 FIKQGSLSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQMPSSEKSISSLEAMGVKV 580
           FIKQGSLSTKELDSLVSVLQLAGGRLGESRS ERKSREGSTQMPSSEKSISSLEAMGVKV
Sbjct: 241 FIKQGSLSTKELDSLVSVLQLAGGRLGESRSSERKSREGSTQMPSSEKSISSLEAMGVKV 300

Query: 581 YGLDEPHEYSSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRP 640
           YGLDEPH  S++NEISWDNIAGYDQQKREIED+ILMTLHNPELFDDIAHGTRR+FESNRP
Sbjct: 301 YGLDEPHVNSTENEISWDNIAGYDQQKREIEDSILMTLHNPELFDDIAHGTRRRFESNRP 360

Query: 641 RAVLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELST 700
           RAVLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELST
Sbjct: 361 RAVLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELST 420

Query: 701 GAIIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPAL 760
           GAIIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKV+VIAATNRKQDLDPAL
Sbjct: 421 GAIIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVIVIAATNRKQDLDPAL 480

Query: 761 ISRFDLMITFGLPDERNREEIAAQYAKQLTKPELSEFARNTEGDVVVTISKSTVFLLRFA 820
           ISRFDLMITFGLPDERNREEIA QYAKQLTKPEL EFARNT+G                 
Sbjct: 481 ISRFDLMITFGLPDERNREEIAVQYAKQLTKPELKEFARNTDG----------------- 540

Query: 821 CIFRMSGRDIRDICQQAERSWASKANYPSFISSILEFLFGGHKYNLSFWIIRGKVSKTGE 880
               MSGRDIRDICQQAERSWASK                         IIRGKVSKTGE
Sbjct: 541 ----MSGRDIRDICQQAERSWASK-------------------------IIRGKVSKTGE 592

Query: 881 HGILPPLEEYIECAMSRRKALQTIDDHEIKDSNIRTKKTQLA 917
           HGILPP+EEYIECAM RRKALQTIDDHEIKD NIRTKKTQLA
Sbjct: 601 HGILPPIEEYIECAMKRRKALQTIDDHEIKDPNIRTKKTQLA 592

BLAST of HG10016220 vs. ExPASy TrEMBL
Match: A0A6J1IQS3 (spermatogenesis-associated protein 5 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111477922 PE=3 SV=1)

HSP 1 Score: 1019.2 bits (2634), Expect = 1.1e-293
Identity = 533/637 (83.67%), Postives = 561/637 (88.07%), Query Frame = 0

Query: 281 MSVLERIARTARAWRILTSFKLKDSSSASRSLYLRHFRHQQERLKPGSLPNGRDPCFLYP 340
           MSVLERIART +AWRI+ SFKLKDSSS S SLYLRHFRHQ E LKPG L NGR+PC LYP
Sbjct: 1   MSVLERIARTVKAWRIIKSFKLKDSSSVSPSLYLRHFRHQPEHLKPGFLSNGREPCLLYP 60

Query: 341 AIIAGMFGVGAMEIAYAEAEEPSVT-PPPPKDLSTHADMEEIAKKERQRITEQLTRNKGT 400
           AI+A MFGVGAMEIAYAEAEE +VT PPPPKDLSTHAD+EEIAKKERQRI EQL RNKG 
Sbjct: 61  AIVASMFGVGAMEIAYAEAEESAVTPPPPPKDLSTHADLEEIAKKERQRINEQLIRNKGM 120

Query: 401 KYSACPRFTVGVKGQKNEMQVSIKFQVPSSCEVSHLIANLVSNLGLKVEESAGGSDMLLR 460
           KY ACPRFTVGVKGQK    VSIKFQVPS+CEVSHL+ANLVSNLGLKVEE AGGSDMLLR
Sbjct: 121 KYGACPRFTVGVKGQK----VSIKFQVPSNCEVSHLVANLVSNLGLKVEERAGGSDMLLR 180

Query: 461 AWDSPVAWQLTLTRPKTQREAGGNKGNPIEMDAEDGDLSVLIFHSLVTSDKTEIEFIKQG 520
           AW SPVAWQL+LTRPKTQREAGGN+GN +E+DA DGDLSVLIFHSLV+SDKTEIEF+KQG
Sbjct: 181 AWASPVAWQLSLTRPKTQREAGGNEGNSLEIDAGDGDLSVLIFHSLVSSDKTEIEFLKQG 240

Query: 521 SLSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQMPSSEKSISSLEAMGVKVYGLDE 580
           SLSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQ+P  EKSISSLE+MGVKVYGLDE
Sbjct: 241 SLSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQLPDPEKSISSLESMGVKVYGLDE 300

Query: 581 PHEYSSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRAVLF 640
           PH  SSKNEISWDNIAGYDQQK EIEDTIL+TLHNPELFD+IAHGTRRKFESNRPRAVLF
Sbjct: 301 PHVNSSKNEISWDNIAGYDQQKSEIEDTILLTLHNPELFDEIAHGTRRKFESNRPRAVLF 360

Query: 641 EGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIF 700
           EGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLG+VFSLANELSTGAIIF
Sbjct: 361 EGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGQVFSLANELSTGAIIF 420

Query: 701 LDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPALISRFD 760
           LDEVDSFAI+RDSE+HEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPALISRFD
Sbjct: 421 LDEVDSFAIARDSEMHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPALISRFD 480

Query: 761 LMITFGLPDERNREEIAAQYAKQLTKPELSEFARNTEGDVVVTISKSTVFLLRFACIFRM 820
           LMITFGLPDERNR+EIAAQYAKQLTKPEL+EFA+NTEG                     M
Sbjct: 481 LMITFGLPDERNRQEIAAQYAKQLTKPELNEFAKNTEG---------------------M 540

Query: 821 SGRDIRDICQQAERSWASKANYPSFISSILEFLFGGHKYNLSFWIIRGKVSKTGEHGILP 880
           SGRDIRDICQQAERSWASK                         I+RGKVSKTGEHGILP
Sbjct: 541 SGRDIRDICQQAERSWASK-------------------------ILRGKVSKTGEHGILP 587

Query: 881 PLEEYIECAMSRRKALQTIDDHEIKDSNIRTKKTQLA 917
           PL+EYIECAMSRRKALQT+DDH+IKDSNIRTKKTQ A
Sbjct: 601 PLQEYIECAMSRRKALQTVDDHKIKDSNIRTKKTQFA 587

BLAST of HG10016220 vs. TAIR 10
Match: AT4G04180.1 (P-loop containing nucleoside triphosphate hydrolases superfamily protein )

HSP 1 Score: 611.3 bits (1575), Expect = 1.3e-174
Identity = 346/656 (52.74%), Postives = 450/656 (68.60%), Query Frame = 0

Query: 281 MSVLERIARTARAWR-ILTSFKLKDSSSASRSLYLRHFRHQ----------QERLKPGSL 340
           M++L+++ R +RAWR  L+  K    S + R+  LR   H            + L+  S 
Sbjct: 1   MAILDKLFRASRAWRGSLSHSKSMVPSQSPRARELRRCFHHGNFEQSNSKVNQVLRSCST 60

Query: 341 PNGRDPCFLYPAIIAGMFGVGAMEIAYAEAEE----PSVTPPPPKDLSTHADME------ 400
            N      + PA++  +F VG + +AYAE++E     S  P  P D ++ A ++      
Sbjct: 61  LNDSPYFSMAPAVLGALFSVGVIGVAYAESDEANNDKSSAPIDPNDETSSAPIDPPPNYV 120

Query: 401 EIAKKERQRITEQLTRNKGTKYSACPRFTVGVKGQKNEMQVSIKFQVPSSCEVSHLIANL 460
           +IAKKER R+ E+L ++KGT+Y + PRF V V+GQK    +++KFQVPS+CEV+ LI+N+
Sbjct: 121 DIAKKERARV-EELIQSKGTQYGSYPRFNVAVRGQK----ITLKFQVPSTCEVAQLISNI 180

Query: 461 VSNLGLKVEESAGGSDMLLRAWDSPVAWQLTLTRPKTQREAGGNKGNPIEMDAEDGDLSV 520
            S LG+KV +  GGSDMLLRAWD+PVAWQ+TL   + +++ G ++      D  D DL +
Sbjct: 181 GSQLGVKVSDRTGGSDMLLRAWDNPVAWQITLRSVENKKKLGESE------DDSDDDLCI 240

Query: 521 LIFHSLVTSDKTEIEFIKQGSLSTKELDSLVSVLQLAGGRLGESRSFERKSREGSTQMPS 580
           LIF SL+TSDK E+EFIK+GSL+T+EL++ VS L +AG + G+++        GST+  S
Sbjct: 241 LIFGSLLTSDKVEVEFIKKGSLTTEELEAFVSALGVAGTKAGQNKG---SGSRGSTRDSS 300

Query: 581 SEKSISSLEAMGVKVYGLDEPHEYSSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFD 640
           ++KSIS LE+MGV++YG+++P    S +EISWDNIAGYDQQKREIEDTILM LH+PE++D
Sbjct: 301 TDKSISQLESMGVRIYGVNKPLGDDSMDEISWDNIAGYDQQKREIEDTILMALHSPEVYD 360

Query: 641 DIAHGTRRKFESNRPRAVLFEGPPGTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESE 700
           DI  GTR KFESNRPRAVLFEGPPGTGKTS ARVIANQAG+PL+YVPLE +MSKYYGESE
Sbjct: 361 DIVRGTRSKFESNRPRAVLFEGPPGTGKTSCARVIANQAGIPLLYVPLEAVMSKYYGESE 420

Query: 701 RLLGKVFSLANELSTGAIIFLDEVDSFAISRDSEIHEATRRVLSVLLRQIDGFEQDRKVV 760
           RLLG VFS ANEL  GAIIFLDE+D+FAISRDSE+HEATRRVLSVLLRQIDGFEQ++KVV
Sbjct: 421 RLLGAVFSQANELPDGAIIFLDEIDAFAISRDSEMHEATRRVLSVLLRQIDGFEQEKKVV 480

Query: 761 VIAATNRKQDLDPALISRFDLMITFGLPDERNREEIAAQYAKQLTKPELSEFARNTEGDV 820
           VIAATNRKQDLDPALISRFD MI F LPD + R+EI AQYAKQL+KPEL + A+ TE   
Sbjct: 481 VIAATNRKQDLDPALISRFDSMIMFDLPDLQTRQEIIAQYAKQLSKPELVQLAQATEA-- 540

Query: 821 VVTISKSTVFLLRFACIFRMSGRDIRDICQQAERSWASKANYPSFISSILEFLFGGHKYN 880
                              MSGRDIRD+CQ AER+WASK                 + Y 
Sbjct: 541 -------------------MSGRDIRDVCQGAERTWASKII---------------NLYI 600

Query: 881 LSFWIIRGKVSKTGEHGILPPLEEYIECAMSRRKALQTIDDHEIKDSNIRTKKTQL 916
           +   I R K     +   LPP++EY+E A +RRK+L+++ + + +    R+KK  L
Sbjct: 601 VGQLIRRAKAGGEEQKITLPPIQEYLESAEARRKSLRSVTEQKEQKFAARSKKPLL 606

BLAST of HG10016220 vs. TAIR 10
Match: AT4G22190.1 (unknown protein; Has 283 Blast hits to 154 proteins in 44 species: Archae - 0; Bacteria - 2; Metazoa - 24; Fungi - 12; Plants - 48; Viruses - 0; Other Eukaryotes - 197 (source: NCBI BLink). )

HSP 1 Score: 214.9 bits (546), Expect = 2.6e-55
Identity = 158/290 (54.48%), Postives = 186/290 (64.14%), Query Frame = 0

Query: 9   QSQPLSR---PTGA--RRSGSFPTSPEFEFWMVRNPSFPQ--PNLLSADELFVDGVLLPL 68
           +S+PL     P G+  RRS      PEFEFW + N SFPQ   +LLSADELF DGVLLPL
Sbjct: 56  RSKPLPETLSPCGSQRRRSSCDSNPPEFEFWRLTNSSFPQADSDLLSADELFHDGVLLPL 115

Query: 69  HLVSNHSPSQSTDPNQKSDLEPPPSEPDPSDGPKLTPNSAD----SGSSLTS----SKRW 128
            L+S  S  QS DPN  ++ +P PS   PS G  +T   +D     GS LT     SKRW
Sbjct: 116 DLLSVKSELQS-DPN-IAECDPDPS---PSTGSLITEQKSDLEPGLGSELTRETTVSKRW 175

Query: 129 -SIFKKSEKKNVTGNQEDRDKEKKKEKKTGNGSTS-----AELNINIWPFSRSRSAGNAF 188
             IF+KSE K   G +E   + KK++KKTG+G +S     AELNINIWPFSRSRSAGN  
Sbjct: 176 RDIFRKSETK-PPGKKEKVKENKKEKKKTGSGPSSGSGSGAELNINIWPFSRSRSAGNNV 235

Query: 189 TRPKMFPGGQPGSRKVNSAPCSRSNSAGESKSRKWPSSPSRAGVHLGRSSPVWQVRRGGS 248
           TRP+M   G P +RKV+SAPCSRSNS GESKSRKWPSSPSR GVHLGR+SPVWQVRRGG 
Sbjct: 236 TRPRM-SFGAPTTRKVSSAPCSRSNSTGESKSRKWPSSPSRNGVHLGRNSPVWQVRRGGG 295

Query: 249 APKTSENLSRNAEKPARKEPTDAHRSKAVAASSSASRVRVLNLNVPMCIG 278
           AP              ++E  +  + K V  S+ A   +VLNLNVPMCIG
Sbjct: 296 APVGKTIPEPMGRVVGKREIPETRKGKTVIESNKA---KVLNLNVPMCIG 335

BLAST of HG10016220 vs. TAIR 10
Match: AT5G03340.1 (ATPase, AAA-type, CDC48 protein )

HSP 1 Score: 130.2 bits (326), Expect = 8.6e-30
Identity = 80/218 (36.70%), Postives = 117/218 (53.67%), Query Frame = 0

Query: 587 NEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRAVLFEGPPGTG 646
           +E+ +D++ G  +Q  +I + + + L +P+LF  I            P+ +L  GPPG+G
Sbjct: 202 DEVGYDDVGGVRKQMAQIRELVELPLRHPQLFKSIG--------VKPPKGILLYGPPGSG 261

Query: 647 KTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFLDEVDSF 706
           KT  AR +AN+ G     +    IMSK  GESE  L K F  A E +  +IIF+DE+DS 
Sbjct: 262 KTLIARAVANETGAFFFCINGPEIMSKLAGESESNLRKAFEEA-EKNAPSIIFIDEIDSI 321

Query: 707 AISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPAL--ISRFDLMITF 766
           A  R+    E  RR++S LL  +DG +    V+V+ ATNR   +DPAL    RFD  I  
Sbjct: 322 APKREKTNGEVERRIVSQLLTLMDGLKSRAHVIVMGATNRPNSIDPALRRFGRFDREIDI 381

Query: 767 GLPDERNREEIAAQYAKQLTKPE---LSEFARNTEGDV 800
           G+PDE  R E+   + K +   E   L   +++T G V
Sbjct: 382 GVPDEIGRLEVLRIHTKNMKLAEDVDLERISKDTHGYV 410

BLAST of HG10016220 vs. TAIR 10
Match: AT5G58290.1 (regulatory particle triple-A ATPase 3 )

HSP 1 Score: 129.4 bits (324), Expect = 1.5e-29
Identity = 93/263 (35.36%), Postives = 133/263 (50.57%), Query Frame = 0

Query: 584 SSKNEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRAVLFEGPP 643
           S K ++S+++I G D QK+EI + + + L + EL+  I          + PR VL  GPP
Sbjct: 147 SEKPDVSYNDIGGCDIQKQEIREAVELPLTHHELYKQIG--------IDPPRGVLLYGPP 206

Query: 644 GTGKTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFLDEV 703
           GTGKT  A+ +AN      + V     + KY GE  R++  VF LA E +  AIIF+DEV
Sbjct: 207 GTGKTMLAKAVANHTTAAFIRVVGSEFVQKYLGEGPRMVRDVFRLAKE-NAPAIIFIDEV 266

Query: 704 DSFAISR---DSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPALI--SRF 763
           D+ A +R    +      +R+L  LL Q+DGF+Q   V VI ATNR   LDPAL+   R 
Sbjct: 267 DAIATARFDAQTGADREVQRILMELLNQMDGFDQTVNVKVIMATNRADTLDPALLRPGRL 326

Query: 764 DLMITFGLPDERNREEIAAQYAKQLTKPELSEFARNTEGDVVVTISKSTVFLLRFACIFR 823
           D  I F LPD R +  +   +    +K  LS+     E D+   +S+            +
Sbjct: 327 DRKIEFPLPDRRQKRLV---FQVCTSKMNLSD-----EVDLEDYVSRPD----------K 382

Query: 824 MSGRDIRDICQQAERSWASKANY 842
           +S  +I  ICQ+A      K  Y
Sbjct: 387 ISAAEIAAICQEAGMHAVRKNRY 382

BLAST of HG10016220 vs. TAIR 10
Match: AT3G09840.1 (cell division cycle 48 )

HSP 1 Score: 129.0 bits (323), Expect = 1.9e-29
Identity = 79/218 (36.24%), Postives = 117/218 (53.67%), Query Frame = 0

Query: 587 NEISWDNIAGYDQQKREIEDTILMTLHNPELFDDIAHGTRRKFESNRPRAVLFEGPPGTG 646
           +++ +D++ G  +Q  +I + + + L +P+LF  I            P+ +L  GPPG+G
Sbjct: 202 DDVGYDDVGGVRKQMAQIRELVELPLRHPQLFKSIG--------VKPPKGILLYGPPGSG 261

Query: 647 KTSSARVIANQAGVPLVYVPLEVIMSKYYGESERLLGKVFSLANELSTGAIIFLDEVDSF 706
           KT  AR +AN+ G     +    IMSK  GESE  L K F  A E +  +IIF+DE+DS 
Sbjct: 262 KTLIARAVANETGAFFFCINGPEIMSKLAGESESNLRKAFEEA-EKNAPSIIFIDEIDSI 321

Query: 707 AISRDSEIHEATRRVLSVLLRQIDGFEQDRKVVVIAATNRKQDLDPAL--ISRFDLMITF 766
           A  R+    E  RR++S LL  +DG +    V+V+ ATNR   +DPAL    RFD  I  
Sbjct: 322 APKREKTNGEVERRIVSQLLTLMDGLKSRAHVIVMGATNRPNSIDPALRRFGRFDREIDI 381

Query: 767 GLPDERNREEIAAQYAKQLTKPE---LSEFARNTEGDV 800
           G+PDE  R E+   + K +   E   L   +++T G V
Sbjct: 382 GVPDEIGRLEVLRIHTKNMKLAEDVDLERISKDTHGYV 410

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038882484.11.4e-30888.8426S proteasome subunit RPT4 isoform X1 [Benincasa hispida][more]
XP_008440044.12.1e-29685.06PREDICTED: katanin p60 ATPase-containing subunit A1 isoform X3 [Cucumis melo][more]
XP_004134780.18.8e-29584.59ATPase family AAA domain-containing protein 1-B isoform X1 [Cucumis sativus] >KG... [more]
XP_008440043.11.2e-29484.53PREDICTED: probable 26S protease regulatory subunit 10B isoform X2 [Cucumis melo... [more]
XP_008440042.12.0e-29484.27PREDICTED: katanin p60 ATPase-containing subunit A1 isoform X1 [Cucumis melo][more]
Match NameE-valueIdentityDescription
P256942.6e-3138.63Cell division control protein 48 OS=Saccharomyces cerevisiae (strain ATCC 204508... [more]
Q5AWS63.4e-3139.91Cell division control protein 48 OS=Emericella nidulans (strain FGSC A4 / ATCC 3... [more]
O289724.4e-3138.71Cell division cycle protein 48 homolog AF_1297 OS=Archaeoglobus fulgidus (strain... [more]
Q9P3A79.9e-3139.04Cell division cycle protein 48 OS=Schizosaccharomyces pombe (strain 972 / ATCC 2... [more]
O268241.3e-3030.43Proteasome-activating nucleotidase OS=Methanothermobacter thermautotrophicus (st... [more]
Match NameE-valueIdentityDescription
A0A1S3B0591.0e-29685.06katanin p60 ATPase-containing subunit A1 isoform X3 OS=Cucumis melo OX=3656 GN=L... [more]
A0A0A0KL524.3e-29584.59AAA domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G512920 PE=3 SV... [more]
A0A1S3B0765.6e-29584.53probable 26S protease regulatory subunit 10B isoform X2 OS=Cucumis melo OX=3656 ... [more]
A0A1S3AZS09.5e-29584.27katanin p60 ATPase-containing subunit A1 isoform X1 OS=Cucumis melo OX=3656 GN=L... [more]
A0A6J1IQS31.1e-29383.67spermatogenesis-associated protein 5 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT4G04180.11.3e-17452.74P-loop containing nucleoside triphosphate hydrolases superfamily protein [more]
AT4G22190.12.6e-5554.48unknown protein; Has 283 Blast hits to 154 proteins in 44 species: Archae - 0; B... [more]
AT5G03340.18.6e-3036.70ATPase, AAA-type, CDC48 protein [more]
AT5G58290.11.5e-2935.36regulatory particle triple-A ATPase 3 [more]
AT3G09840.11.9e-2936.24cell division cycle 48 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003593AAA+ ATPase domainSMARTSM00382AAA_5coord: 633..768
e-value: 5.3E-16
score: 69.2
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3D3.40.50.300coord: 580..787
e-value: 8.0E-60
score: 203.7
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILY52540P-loop containing nucleoside triphosphate hydrolasescoord: 588..834
IPR003959ATPase, AAA-type, corePFAMPF00004AAAcoord: 637..765
e-value: 6.8E-35
score: 120.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 237..252
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 96..113
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 7..24
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..29
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 64..258
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 64..80
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 182..204
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 543..564
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 114..143
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 144..165
NoneNo IPR availablePANTHERPTHR23073:SF82P-LOOP CONTAINING NUCLEOSIDE TRIPHOSPHATE HYDROLASES SUPERFAMILY PROTEINcoord: 818..898
coord: 348..802
NoneNo IPR availablePANTHERPTHR2307326S PROTEASOME REGULATORY SUBUNITcoord: 818..898
coord: 348..802
NoneNo IPR availableCDDcd00009AAAcoord: 630..766
e-value: 7.70193E-23
score: 93.7499
IPR003960ATPase, AAA-type, conserved sitePROSITEPS00674AAAcoord: 738..757

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10016220.1HG10016220.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1901800 positive regulation of proteasomal protein catabolic process
biological_process GO:0045899 positive regulation of RNA polymerase II transcription preinitiation complex assembly
biological_process GO:0030433 ubiquitin-dependent ERAD pathway
cellular_component GO:0031597 cytosolic proteasome complex
cellular_component GO:0008540 proteasome regulatory particle, base subcomplex
molecular_function GO:0005524 ATP binding
molecular_function GO:0016887 ATP hydrolysis activity
molecular_function GO:0008233 peptidase activity
molecular_function GO:0036402 proteasome-activating activity