HG10002437 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10002437
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProteasome subunit alpha
LocationChr11: 6746965 .. 6757425 (+)
RNA-Seq ExpressionHG10002437
SyntenyHG10002437
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTCGTGGAAGTGGAGGCGGATACGATCGTCACATCACAATTTTCTCTCCCGAAGGCCGTCTATTTCAAGTTGGTACGTATATTTTCTCTCACAATACCTTCATCGTATCTTACTTCTTGCTGTCGCAGTGATTTGTTGGCCTTTTAGATTCTTTTTTTTTTTTGTGGCGTTCATGGTTTTTGAGAGAGATTTTTGAATGATTACTTCGCGTTGCTCTTCGAAATGAGCTCGACTCCCTTTCGAATTGCTAACGAATTTGGGGGATGGAAGTACAGGATGGATCGAATTTGATCGCGACAAGTTAATTGCAACTCTTGTATGTGGAATGGAGTTCAATTTGAGATGGCTAGAGTCTTTGGATTGAATTATAGCCATTTGTTTGCGTTCTTCTTTTTCTAGTGAGAAATGCGAATTCGAAGCTCATTTTGTTTAAGAGACTCAAACTCTACAACTACAGAATATTTGAGCTGTTTCCTTTCCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTTGAAGTTTTACATGGAGCGTCTTAATCCATCTTGCTCTTTGAATTGTCTCTATACTCCATTATGTATTTCTTTTTCATTGTTTATGCACAGTTATACCTAGGTTTGGGAGATTTGGATGTTGTCAATTTCCACTAGACCAAACGTTTAAGCTATTAACTTAGGATACATTCATTTATTTATTTATATTTTTGGCTCGTAAATGAGTTCATTTCATGATTTCTAATCCATGGAAACCTGGAACTTTGTTCTATGATTCCATGTTAAATAACCCTACGTTTCGTCAAAGCCTATGCAGTTTAGTATAGATAAATTTCATCATTCATTATATTCTCAGTAGATGTCAATCTAATCTTTCTCTAGTGATCTTTCAGTCAGTAAGGTTAACCTATCAGTTTAGCGATCATAGTGTGTTCTCATAACTCTAGTTGAATTTTCAGTTTTTATTTTTGTAATGGAAAACAGAATATGCATTTAAGGCTGTCAAGGCTGCTGGAATTACTTCGATTGGTGTTCAGGGTAAAAATTCAGTTTGTGTCGTGACCCAAAAGAAGGTTCCGGTGAGTACAATTGGTTTGACTCTGAGAGGACTTTGCATCTCTATTAGTAATATAATCTATGGGCGATGCTGAATGTGAATTTGAAAATTCATTGCAGGATAAGCTTCTGGATCAATCAAGTGTCTCACATCTTTTCCCTATAACAAAGTACCTAGGGTTGTTAGCAACAGGCATGACAGGTATTTCTCTTGGCCTTTTCAAATTTACTTACAGATATGTTGTATTTATAGGAGGCGACTTATTTGTTCATCATTCCAGCTACGCAACTGAAATTAGTTAAGGCATGCAGCATATGCTATATAGGATTGCATCTAGCTAATGACGAATCATGATGTGGCATGGTATTGCGATGTACAGATATGATCCCTATACATGGTGTTGCAATAGCAGTACCCTTTCCTAGATATTGCTTAGTTATTTAGTTGAGAAATTTAACAGAATCAGCTGTTCTTCCATCCAGAAGAAAAAATTAAACAAACTGATATGGTTAATTTGGACACATGTATGACTTTAAACTATTTGTTGATACACACTTTATATTCACACTCGACTATTAATTTGTTTTTCAGTCAAATGTATTAATTGACCCTAGATTTTAGTGCTTGTCTCATTGGCGGTCTTTCCATGATTTTTTGGGACGAGTGGTCACTGATATGATTTTTATTCTGTTTTTAGCTGATGCCAGAACCTTAGTTCAGCAAGCGAGGAATGAAGCAGCCGAGTTTCGTTTTAGATATGGATATGAGATGCCGGTGGATTTATTAGCTAAATGGTATCATATTTGTTTCATGCTATACCAAGACATTATTGTTTTAAGTATATACCACACAAGCAAACAATTTGTTCAACCATATGAATTATGGGTTGTAGAGCAAAGAATAGCTCTACATGTTGCTTACTGACCTTAGTTTATCATTCCATTATCCATAATGTTGGTCTAGATATTGTCAATTTCTGCACAATCACAACATAGCAGCTTTTTGCATGCATCTCTTTTCCATCATGGCTGTTGCTTTCAAATATCTTCCGCTGACATACTGATATACATTATGTAGTAATGGCAATTATGAGTAATTATGTACCATGGGAATTAAACCTTTGGCCTCTTTGACCTTCTTGTGCTGGATTGTTGGTTGGCAGGATAGCTGATAAATCACAAATCTATACTCAGCATGCTTATATGAGATCACTTGGAGTAGGTATGTTTTCCCCAGGTTTTGGTGTTGCATTATATTTTAAATTTCAACTTTTGTTATGGACAATAAAAGTGATCTATCTTTTTTAATCTTTTGCTCCTTTCATTCGTCTGCATCTAGTTGCCATGGTTTTAGGCATTGATGATGAGTTTGGACCTCGCCTCTACAAATGTGATCCAGCTGGTCATTTCTTTGGTCACAAGGTTTATATAGTTAATCTATTTACTCTTATGGTCAATATTTTAGGAAGGCATATCATATTGAAATATTTACATTTTTCTATTCCAGCTGCCAACGTGGAATAGTTAATTATCAATCATTGTGAGATTAAGTTTCTTCTACTCTAACTACCGCCAGTTCTCTGTGAGCTGAAAACATTTTATAAGCATCAAATTAATGTCAATGTATTGTTTCTTTATAGGTCCTCTTTTTCTTTTCTTTTTTTTTTTTTTTTTAATTTTTTAATAATAAAAAAAAGGAATAAAGTCAATGTATTGTTGCACACTCTTGCTCCAACTGCAATAGGCGCTGAAGCTTGTTATTACTTTCTCATTTATATATAATACAATTATAGCATCAGACATCATGAGTGTACCTTCGCTGGTCTCTCAAAGATATTCATGTATAGAGTGCATCTTTTTAGTTTCCTTCTTGAATCTCATTGAAAATTTCCGTTGGTTTTTTTGAATTTGGTATTTCATCACTATGCTTGAGACGTTTTTTTTTTCCCAATGTGTAGGCTACGAGTGCTGGTTTAAAAGAGCAGGAATCAATCAATTTCTTGGAAAAGAAAATGAAGAACGATCCAGAGTTCACCTATGAAGAGACAGTACAGGTCGATGTGCTTTCTTTCAAACTTTACATGACACACAGCAAAACCTAAAAATTATAATGAGTTAGGCTTTTTACATGCGCATGGATGCTTGTCTTTACTTTTGTGTATATCCCTCCTCCAATATTGTCGTATTTCCATTTGCAGACTGCAATTTCAGCCCTCCAATCGGTTCTGCAGGAGGACTTCAAGGCAAATGAGATTGAGGTTCTGATACTTTTACTTGTCTTCTTCTTCTTCTTCTTCTTTCTTTCTTTCTTTTTCTTATTTTTTTTTTTAAAATAATTATACTAGAGTGGTAATGTGACGGGAGTTCTATACCACTATCGATTTATTAATGAAGGGTTTGATATTTGAATTGGGGAGAAGTCTTCTGTTTCCTTTAATGTCAAGGAAATGCCAAAAAGACGTTGGAAATGTTAAAAAAAATAAAAAGGTTACAATGGTTGACCATATAAATACAAATACAATACAAATAATAATGATAAAGAACTCAGTTGCGGAGTCGTTAGTGGTTTAAAACTAATCTAATCAATTTTGACAAGCTTGTATGATTTTTGCGATATATTTTTTCATATCTTGCTTTAACTCCTAGATCTAAACCATAGTTTAAGGTAATTTTTACTAATTTGAACGTGAACTGTTTCACTGATTTCTTCTATTAGCTCCAATTATCACTCTGACTTTGTAAGATCGTATAAAGGACATATATTAATTACTTGGCATTCCAAATATTTCTCAGTAGTAGGCTAGTAAATTGTTGTACAAACATTAGAGTAAACTATTGTTGAATATTTTGACACGAACAGGTTGGAGTGGTGAGGCAAGAAAATCCTGTGTTTAGAGTGTTGACGACCGAGGAGATCGACGAACATTTAACTGCAATAAGTGAACGTGACTGAATGGAGAGGCATGTTGGAGGCCAAACCATGAGGAGGAATGGTATCGCTTAAATGTTGTTGAAGGCTTTGTAGTATCAGATCATTGCTTGTTTAGAGCATCAAAAAAGGCACACTCACATTATCTGGAAGAAATGCTGCCTGCCTTCTTCCCCTTCCGCCCCCTTTTCCTTTACTTTGCTTCCCATTGTAGCTGTAGATGATCTTCTGAACACAATGTCCGTTCTCCAAGACACAATTGTACAACCTCTAGAACAGATCCTATTTAAGTAAAAGTCGTAACACAATACCCGTTCTGCTGTTGTTGTATTTTTTATGTTTTGGATTTCTTTTTAATATCTGTTGCGTTGCCCATCCAATATAATTATTATTATGCCCCCAGAATTTTATTTATCATATTCTTTTATGTTTGTCACTATTTAAAATACATATTTCTTCTATCTAAAGGTAAGTATAGTTTATTGAAAGGTAATTGTTATATACTGTTTTTCTTGAAGTTAAAAATTTAAATCTTTACGATCCACTTCTCTTCAGAAGAAGGAAAAGAAAAATATATTTCATTGCTACCAATATGTTCCTTGAATATATGCATAATTTTTTTAAAATATTATCGATCAAACATACCATGATCATGTGGACCATTTGGAAACTTGGGAATACATGCAACTTTTTTTTTTTTAATCATCCATTAGTTTGGATTAATCATTTCTTAAGAAGGCAAATTTAGACATGTTCTTCTACTTTGGTTTTTTTTTTTTCTTTTTTTGAAATCATAGACATATTCTTCTACTTACGAAAATAAATTTCTTAAAAAGACATTTTTTTGACACATTCTTCTATTTATGATAATAATAGCATATGGATGCAAATAATTTTGTAGTATTATATATATAAGGAAACCATATTTACATTGAACTAAAAAGAAATTAAGAGGCCATCTGGGACAATAACTAGAAGTCGGGTGGAGTTGGGACACCAAATATAATAGTTATGCATATAATTATTTATAACCTATAATTAAGTATAGGAAATACTATTTCAAAATCCTCTTGCTACTTACTATTTTTTATTCATCCTCTTTTTTCTTCTTTACTATAGTGTTTACTATTTCTTATTCAAACAAAAATAGTTTGTACCTCAAACACAAACATTGTAATCCACGAACTATAATAATGACTAACTATGTTAATCAGTGCCTCAAACATTTTCGATATTTACAATCATAAGCTTATTACATACCTTCGAATGGATGGGTTATAATCATGAGTATAAATATATTATTGCATGTCTTAATATTCGGGTTTGTGTTTTTTGGATGAAGAATGGGGGCAGATTTAGTATAGGCCCCCTCTCAACTTTATGCTTTTTATATAATATATATAAAAAAATTAATAAAATATCAATACTATTAATTAATCTAGTGGTAAATTTGTCTTTCATCCCTCTTTAGACCTAGGTTTAATTTCCATTTTTTACATTAAATTTAAAACTCACGTAAAAAAAAAAGAAAAAAATATATAGTTTTGTAGCACTGAATTTACTAGAAACGTTCTCCTGTAAAGTCGGTTCCATTGATAATGGAATTGCCGCGTGTAATTTTAGAAAAAAAGGAAGAAAAAATTGGCGCGTGTAATTGAGAAAATATAGTTACGAAATTAAGCCTAAGAAAGTACCAATTTTGCTTCGTCGCGAATCAATTAAAACTGAACTGCTGTGGAGAAGCAGCGATCGAAACCCCAAAAACCTCAATTCTTCTACAGAGCATAAACTCGGAAATCGTTGGCGACGCCTATTCTTTTCTTACGAACTCCGGCGCACAGTCGTAGCTTTTCGGGTACGCTTTTTTTTCTTGTCTTTTGTTCCTTCAATGTTCTTCAATTTCGCTAGTTGATTCCAGAATTTTCGATCCTTTTTCGTGTTCTCTGTTTCGTACTGGACAGTTGTGTAATTGAACTCTCCTCATATATGCTCTTTCTGCTAGACTTTCATACGTCTTCGCGGTTTCTGTTTGGATATAGGCATTATGTTGGTTGATTGAACTCATACTTTTGGATTGCATCGATAATATGGGAAGAGAAGAGAGCATGTTCTTGGGTAATGGGTATCCATTTTTAGGAAATTTTACTCCTACTGGATGCTAATTATTGTTCGAAAGCAGAATAGAAGTTGAAAGCTGTAGTTTTGAACTTTTGTATCATCGACTAAACCTGTTGCATGTGTGTGTCAGGGGACTGAGAAGTTTCTGAAATTTAGGGGCTTTTCTTCTTTAAACAGATTATTTTACCTTGAATTTTACTGCTTTCTTGCTCTAGGGATTACAAGTTCATGGCAACTGCAACTATGGCCACGGCTGCTGGTGCAGCTGCACTTCTATATTATACATTGAACCGTAAGTTGCATTCAAGTGGAGATGACGATGATGGGGATGTAGATGGTAATGATGCTCCCAGTCATGCTCTTCTGGGAGGTGATCGAGTTTCTCATAGACTGATTCAAGCGCCTGCTACATGGCTTGAAACAATTTCAACATTATCTGAGACCTTAAGGTTTACATACTCGGAGACACTTGGAAAGTGGCCCATTGGGGATTTGGCATTTGGAATTAATTTTCTCTTAAAGAGACAGGTATGGAGCTTTTTGATAGTTGTAGATTTTTAAGGGATTAATGTGTAGCCCATTGTATGTTTTAAGCTGGGTTGGTTCTTGCTTTCACTGTCTCCCTTCAAAACTGAAGGTACTGATATCGATCTGTTTCCTTAAAGGTGTTACTACTTTAAAGTGTGAGAGTGAAAGTTTTATCTCTCTTTCACTTTGGGTTCACGTGATGTAGTGGTCTTCACTTCATTTGATTTTTCATTCTCCTAACATGCATAAAAACCTGTATGATGTGTAATTATGCTCTTGTAGAATTTTATTGTTGTACCTTTCATATATCCACCCAAAAGAAAGGTCTAAATTCTTCTATTAGTGGCTTCTGAACAGATTTTTTTTTTTATATAATCTAATTTTGGTCTATTTATATGCATTTTTGGAAGAAGAAAAAAGGTGAACTGGTTATATTATCTTTGTATACCCAGGGTCTTGGAGTTTTTGGTTTTGAGGGAAAAGAGAGGATGGTTTATTGATTTTCACTTGGAAGGCTTATTTTATGGTGGTTGGCAAGGCCTACTTTATTGGGTTTGGAGAAAGACTTTTGAAAAAATGGAGTTTCAACTTCCCGAAATTAGGAGTTCATTCTTCAAAATTTTCACGTTTCTTCTAGTGGGTAGAGTATGGTGGTTTTATCATGTATTTATTCATAGGATTATTCAGTAACGTTAGTCTGGTTTCATTGGTGGTTTTTAATATATGTATTATGGAGGAAGGTAGAAATGATCAATTCCATATGAAAGTCTCGATTGGGATGTAGTTTGTAGAATTTGGAGGTTGGAAATTTTTTTTTTAGGGTTGAAGTAGTAGTTCTAAAGGGGATGCTGTTGTAATTGTCTTGAAGGGATTTTTCTCAGGTACAATGTTCTAATTTTAGTACTTTAAAGGAAGATTTTTTTCTTTCTTTACTACAACTTTTCAGACATCAATAGGAAGTTCTTGTTTTCTTTAAATAGAAAAAAAAAAAACCCTCAATAGAAGATGATAATGTTTGATCTGAACAATCTAGGTTAATTGCTAATGCTGCTACATATCTTATCGGTTTGGCCTTTGATGGGATGTGCTTATTGTGCAGGGAAACTTACATGTAGGCAGTGTCTTTGGTAATGAAGACAGTATTCAGCTTAAGGGGACTGAAATGATTACTGAGCTGAAATACCTCTTGCACTTGTTGACTTTGTGTTGGCATTTCTCAAAAAAACCATTTCCGCTGTTTCTAGAAGAAACTGGTTTTTCCAAGGAGAATATTCTCCTTCAGGAACCAAAAGCAGGAGTAAGTCTTGATGAACTTCAGATAAAAATTGAAATAGATATTCTACATTATACGCTCTCCCTCTCATTTTTGTTCAATTTATTATTGACATTTATCATATTTATCCATGACTGCAGATTTTGAAGCCTGCTTTTACCATTATAGTTGACCATAATACAAAATGTATTCTCCTGTTGATTCGTGGAACACACAGTATCAAAGACACTCTTACAGCTGCCACTGGAGCTGTGGTGCCATTCCATCATAGTGTGGTGCATGAGGGAGGGGTTAGCAATTTGGTTTTAGGATATGCACATTGTGGAATGGTTGCAGCTGCTCGTTGGATTGCGAAGTTATCTACTCCTTGTCTTCTGAAAGCGCTTGGTCAATATTCTGGTTATAATATTAAGGTTTGACATTTCTTCTTAGTTTACTGGGATTCTTTTTAGCAGATAAGAAGTGTAAAGGAAAAAAAAGATCGATGGAATAATCATTTGTTGGTTTCTAATAGGTTGTGGGGCACTCTTTGGGTGGAGGAACAGCTGCACTTCTAACTTATATTTTAAGAGAGCAGAAGGAATTGTCTATTACTTCTTGTGTTACATTTGCCCCAGGTCTGTATTATTTTTGCATGGTTTGTGTGTGTGTGTTTTAAAAAATTACATTATACTTGACATGATGTTCTGACTTTTCTTTCTTTATGAACTGATCATGTTACTGAATATCTATAACCTCTGCAGCTGCTTGTATGACATGGGAACTAGCAGAATCAGGCAATGAATTTATCACTTCTGTGATTAATGGAGCTGATTTGGTGCCCACTTTCTCAGCTGCTTCAGTAGATGACTTGCGTGGTGAGGTACGTTTGAAGTTCATCTAATTAACAAGCAATTTATTTGCTTCAACTGATTACTCTATAAATTACGTGTATTTTTTTCTGAAAATTTTCTTATTACATTTTATCTTTTGGTGAAAGAATATTTGAGTTGTTAGAGTTTAGGAATGAACTCAGAAGCACAGACACTTTAGTTTAGCAAGCGTGTCCATGTCCGATACGTGTGTTCTACTCACGATGATGAATGTAATTTAGATGAACAAAATTTAGTTAACTTAGCCACATCCCTCTGATCTTTTGCTAAAGCCAATGAAATTTCTGGTATAGGTGACTGCTTCTGCTTGGGTAAATGATCTGAGAAATCAAATTGAGAGAACTCGAATTCTTAGCACTGTTTACCGTTCTGCTTCGGCATTGGGATCCCGTCTCCCATCCATAGCCAGTGCGAGAGCAAAAGTTGCTGGTGCAGGGGCCATTCTGAGACCAGTCTCTAGTGGCACACAGGTAACATGCATCGTGCACATGATCCTTATTCTGTATATATTAGTAATTTTCTTAGAGCTGGAATTTGACAGAAATTTGAAATGGGCGGGGTGTGGTTCTTTGACTTCTTAATAGTTTACTGAAATGTAGGTTTAGGACCTCGTGATCTTGGTGTCATTAGATGGCAATAACTAATCCGGAGGTTTTTATATATATATATATATATATATATTATTTTTTTATTTTATTTTTTTAATATAAAAGAACAAACACAGTAATCTAAAGATATCAGATACCTAGATTCTCAAACAATTAAATATCGGACATCTAATGATCATTAATATTCTTTTGGGGGGGGGGGGGGGGGGGGGCGTTATGGCAGGTTAGTTCCTGTGACTTGTATATAATAGTTAAATTTATACCTGTTTATGTTTTGTATTTCCGCAGGTTGTGATGAAGAGAGCACAGAGCATGGCTCATGCAGCATGGACACGACCTTCACTCCGTTTATCATCTTGGTCATGCATAGGCCCACGTCGTAGAGCCATGACTTCTCACTCAGTGGCTGAAGAAGATGGAAGTTCACCAAAATCATCTCCAAGAAAAATGGAATCTTGTGAGCCACTTAGATCATCCCCTGAAGAAATTGTGGAAGCTATTGAACTTCCTGAATCCTCAACGACAGCAATGCAATGGACTAATGAAATTGAATGCTCATATTCTGAGGAGATAAATCCTGAGGGTATGACAGATGAGCTTGATGATGATGGTCAGGCCCTCATGGGCCATATTCAAGATGAACAAATGACGGAAGTTGAGTTATGGCAGCAACTCGAGCATGAACTATACGATAGGGGTGAGCCTGATGTTGCCAAGGAAATAAGGGAAGAAGAAGCTGCTGCTATGGCAGAAGTAGGACAGTCTGATAGCTCTACTTCTGGAATAAAGGAAGCACACAGATTTTTTCCTGCTGGGAAGATCATGCACATTATCGATATTCAATCAGATGCCCCTGATTGTGAAAGTGATAGCAGCAGCTCCAGATCCAGCATCTCAGACAACAGCCCCCTGGCAGAGTCTAAGATTGGTATTTTCCTTACATCAAGATCATTGTATAGCAAACTCAGATTGTCTCAGACAATGATAAGTGACCACTACATGCCAGCTTATAGAAGACAGATAGAAAAGTTAATTAAAGAATTGGAGAAAGAAGATTGTTACAATCGTGAAATGGAAAGGTAG

mRNA sequence

ATGAGTCGTGGAAGTGGAGGCGGATACGATCGTCACATCACAATTTTCTCTCCCGAAGGCCGTCTATTTCAAGTTGAATATGCATTTAAGGCTGTCAAGGCTGCTGGAATTACTTCGATTGGTGTTCAGGGTAAAAATTCAGTTTGTGTCGTGACCCAAAAGAAGGTTCCGGATAAGCTTCTGGATCAATCAAGTGTCTCACATCTTTTCCCTATAACAAAGTACCTAGGGTTGTTAGCAACAGGCATGACAGCTGATGCCAGAACCTTAGTTCAGCAAGCGAGGAATGAAGCAGCCGAGTTTCGTTTTAGATATGGATATGAGATGCCGGTGGATTTATTAGCTAAATGGATAGCTGATAAATCACAAATCTATACTCAGCATGCTTATATGAGATCACTTGGAGTAGTTGCCATGGTTTTAGGCATTGATGATGAGTTTGGACCTCGCCTCTACAAATGTGATCCAGCTGGTCATTTCTTTGGTCACAAGGCTACGAGTGCTGGTTTAAAAGAGCAGGAATCAATCAATTTCTTGGAAAAGAAAATGAAGAACGATCCAGAGTTCACCTATGAAGAGACAGTACAGACTGCAATTTCAGCCCTCCAATCGGTTCTGCAGGAGGACTTCAAGGCAAATGAGATTGAGGCATTATGTTGGTTGATTGAACTCATACTTTTGGATTGCATCGATAATATGGGAAGAGAAGAGAGCATGTTCTTGGGTAATGGGGATTACAAGTTCATGGCAACTGCAACTATGGCCACGGCTGCTGGTGCAGCTGCACTTCTATATTATACATTGAACCGTAAGTTGCATTCAAGTGGAGATGACGATGATGGGGATGTAGATGGTAATGATGCTCCCAGTCATGCTCTTCTGGGAGGTGATCGAGTTTCTCATAGACTGATTCAAGCGCCTGCTACATGGCTTGAAACAATTTCAACATTATCTGAGACCTTAAGGTTTACATACTCGGAGACACTTGGAAAGTGGCCCATTGGGGATTTGGCATTTGGAATTAATTTTCTCTTAAAGAGACAGGGAAACTTACATGTAGGCAGTGTCTTTGGTAATGAAGACAGTATTCAGCTTAAGGGGACTGAAATGATTACTGAGCTGAAATACCTCTTGCACTTGTTGACTTTGTGTTGGCATTTCTCAAAAAAACCATTTCCGCTGTTTCTAGAAGAAACTGGTTTTTCCAAGGAGAATATTCTCCTTCAGGAACCAAAAGCAGGAATTTTGAAGCCTGCTTTTACCATTATAGTTGACCATAATACAAAATGTATTCTCCTGTTGATTCGTGGAACACACAGTATCAAAGACACTCTTACAGCTGCCACTGGAGCTGTGGTGCCATTCCATCATAGTGTGGTGCATGAGGGAGGGGTTAGCAATTTGGTTTTAGGATATGCACATTGTGGAATGGTTGCAGCTGCTCGTTGGATTGCGAAGTTATCTACTCCTTGTCTTCTGAAAGCGCTTGGTCAATATTCTGGTTATAATATTAAGGTTGTGGGGCACTCTTTGGGTGGAGGAACAGCTGCACTTCTAACTTATATTTTAAGAGAGCAGAAGGAATTGTCTATTACTTCTTGTGTTACATTTGCCCCAGCTGCTTGTATGACATGGGAACTAGCAGAATCAGGCAATGAATTTATCACTTCTGTGATTAATGGAGCTGATTTGGTGCCCACTTTCTCAGCTGCTTCAGTAGATGACTTGCGTGGTGAGGTTGTGATGAAGAGAGCACAGAGCATGGCTCATGCAGCATGGACACGACCTTCACTCCGTTTATCATCTTGGTCATGCATAGGCCCACGTCGTAGAGCCATGACTTCTCACTCAGTGGCTGAAGAAGATGGAAGTTCACCAAAATCATCTCCAAGAAAAATGGAATCTTGTGAGCCACTTAGATCATCCCCTGAAGAAATTGTGGAAGCTATTGAACTTCCTGAATCCTCAACGACAGCAATGCAATGGACTAATGAAATTGAATGCTCATATTCTGAGGAGATAAATCCTGAGGGTATGACAGATGAGCTTGATGATGATGGTCAGGCCCTCATGGGCCATATTCAAGATGAACAAATGACGGAAGTTGAGTTATGGCAGCAACTCGAGCATGAACTATACGATAGGGGTGAGCCTGATGTTGCCAAGGAAATAAGGGAAGAAGAAGCTGCTGCTATGGCAGAAGTAGGACAGTCTGATAGCTCTACTTCTGGAATAAAGGAAGCACACAGATTTTTTCCTGCTGGGAAGATCATGCACATTATCGATATTCAATCAGATGCCCCTGATTGTGAAAGTGATAGCAGCAGCTCCAGATCCAGCATCTCAGACAACAGCCCCCTGGCAGAGTCTAAGATTGGTATTTTCCTTACATCAAGATCATTGTATAGCAAACTCAGATTGTCTCAGACAATGATAAGTGACCACTACATGCCAGCTTATAGAAGACAGATAGAAAAGTTAATTAAAGAATTGGAGAAAGAAGATTGTTACAATCGTGAAATGGAAAGGTAG

Coding sequence (CDS)

ATGAGTCGTGGAAGTGGAGGCGGATACGATCGTCACATCACAATTTTCTCTCCCGAAGGCCGTCTATTTCAAGTTGAATATGCATTTAAGGCTGTCAAGGCTGCTGGAATTACTTCGATTGGTGTTCAGGGTAAAAATTCAGTTTGTGTCGTGACCCAAAAGAAGGTTCCGGATAAGCTTCTGGATCAATCAAGTGTCTCACATCTTTTCCCTATAACAAAGTACCTAGGGTTGTTAGCAACAGGCATGACAGCTGATGCCAGAACCTTAGTTCAGCAAGCGAGGAATGAAGCAGCCGAGTTTCGTTTTAGATATGGATATGAGATGCCGGTGGATTTATTAGCTAAATGGATAGCTGATAAATCACAAATCTATACTCAGCATGCTTATATGAGATCACTTGGAGTAGTTGCCATGGTTTTAGGCATTGATGATGAGTTTGGACCTCGCCTCTACAAATGTGATCCAGCTGGTCATTTCTTTGGTCACAAGGCTACGAGTGCTGGTTTAAAAGAGCAGGAATCAATCAATTTCTTGGAAAAGAAAATGAAGAACGATCCAGAGTTCACCTATGAAGAGACAGTACAGACTGCAATTTCAGCCCTCCAATCGGTTCTGCAGGAGGACTTCAAGGCAAATGAGATTGAGGCATTATGTTGGTTGATTGAACTCATACTTTTGGATTGCATCGATAATATGGGAAGAGAAGAGAGCATGTTCTTGGGTAATGGGGATTACAAGTTCATGGCAACTGCAACTATGGCCACGGCTGCTGGTGCAGCTGCACTTCTATATTATACATTGAACCGTAAGTTGCATTCAAGTGGAGATGACGATGATGGGGATGTAGATGGTAATGATGCTCCCAGTCATGCTCTTCTGGGAGGTGATCGAGTTTCTCATAGACTGATTCAAGCGCCTGCTACATGGCTTGAAACAATTTCAACATTATCTGAGACCTTAAGGTTTACATACTCGGAGACACTTGGAAAGTGGCCCATTGGGGATTTGGCATTTGGAATTAATTTTCTCTTAAAGAGACAGGGAAACTTACATGTAGGCAGTGTCTTTGGTAATGAAGACAGTATTCAGCTTAAGGGGACTGAAATGATTACTGAGCTGAAATACCTCTTGCACTTGTTGACTTTGTGTTGGCATTTCTCAAAAAAACCATTTCCGCTGTTTCTAGAAGAAACTGGTTTTTCCAAGGAGAATATTCTCCTTCAGGAACCAAAAGCAGGAATTTTGAAGCCTGCTTTTACCATTATAGTTGACCATAATACAAAATGTATTCTCCTGTTGATTCGTGGAACACACAGTATCAAAGACACTCTTACAGCTGCCACTGGAGCTGTGGTGCCATTCCATCATAGTGTGGTGCATGAGGGAGGGGTTAGCAATTTGGTTTTAGGATATGCACATTGTGGAATGGTTGCAGCTGCTCGTTGGATTGCGAAGTTATCTACTCCTTGTCTTCTGAAAGCGCTTGGTCAATATTCTGGTTATAATATTAAGGTTGTGGGGCACTCTTTGGGTGGAGGAACAGCTGCACTTCTAACTTATATTTTAAGAGAGCAGAAGGAATTGTCTATTACTTCTTGTGTTACATTTGCCCCAGCTGCTTGTATGACATGGGAACTAGCAGAATCAGGCAATGAATTTATCACTTCTGTGATTAATGGAGCTGATTTGGTGCCCACTTTCTCAGCTGCTTCAGTAGATGACTTGCGTGGTGAGGTTGTGATGAAGAGAGCACAGAGCATGGCTCATGCAGCATGGACACGACCTTCACTCCGTTTATCATCTTGGTCATGCATAGGCCCACGTCGTAGAGCCATGACTTCTCACTCAGTGGCTGAAGAAGATGGAAGTTCACCAAAATCATCTCCAAGAAAAATGGAATCTTGTGAGCCACTTAGATCATCCCCTGAAGAAATTGTGGAAGCTATTGAACTTCCTGAATCCTCAACGACAGCAATGCAATGGACTAATGAAATTGAATGCTCATATTCTGAGGAGATAAATCCTGAGGGTATGACAGATGAGCTTGATGATGATGGTCAGGCCCTCATGGGCCATATTCAAGATGAACAAATGACGGAAGTTGAGTTATGGCAGCAACTCGAGCATGAACTATACGATAGGGGTGAGCCTGATGTTGCCAAGGAAATAAGGGAAGAAGAAGCTGCTGCTATGGCAGAAGTAGGACAGTCTGATAGCTCTACTTCTGGAATAAAGGAAGCACACAGATTTTTTCCTGCTGGGAAGATCATGCACATTATCGATATTCAATCAGATGCCCCTGATTGTGAAAGTGATAGCAGCAGCTCCAGATCCAGCATCTCAGACAACAGCCCCCTGGCAGAGTCTAAGATTGGTATTTTCCTTACATCAAGATCATTGTATAGCAAACTCAGATTGTCTCAGACAATGATAAGTGACCACTACATGCCAGCTTATAGAAGACAGATAGAAAAGTTAATTAAAGAATTGGAGAAAGAAGATTGTTACAATCGTGAAATGGAAAGGTAG

Protein sequence

MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVQGKNSVCVVTQKKVPDKLLDQSSVSHLFPITKYLGLLATGMTADARTLVQQARNEAAEFRFRYGYEMPVDLLAKWIADKSQIYTQHAYMRSLGVVAMVLGIDDEFGPRLYKCDPAGHFFGHKATSAGLKEQESINFLEKKMKNDPEFTYEETVQTAISALQSVLQEDFKANEIEALCWLIELILLDCIDNMGREESMFLGNGDYKFMATATMATAAGAAALLYYTLNRKLHSSGDDDDGDVDGNDAPSHALLGGDRVSHRLIQAPATWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGTEMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENILLQEPKAGILKPAFTIIVDHNTKCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLSTPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELAESGNEFITSVINGADLVPTFSAASVDDLRGEVVMKRAQSMAHAAWTRPSLRLSSWSCIGPRRRAMTSHSVAEEDGSSPKSSPRKMESCEPLRSSPEEIVEAIELPESSTTAMQWTNEIECSYSEEINPEGMTDELDDDGQALMGHIQDEQMTEVELWQQLEHELYDRGEPDVAKEIREEEAAAMAEVGQSDSSTSGIKEAHRFFPAGKIMHIIDIQSDAPDCESDSSSSRSSISDNSPLAESKIGIFLTSRSLYSKLRLSQTMISDHYMPAYRRQIEKLIKELEKEDCYNREMER
Homology
BLAST of HG10002437 vs. NCBI nr
Match: KAG6581770.1 (Proteasome subunit alpha type-6, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1449.1 bits (3750), Expect = 0.0e+00
Identity = 762/911 (83.64%), Postives = 790/911 (86.72%), Query Frame = 0

Query: 1   MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVQGKNSVCVVTQKKVPDKL 60
           MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGV+GK+SVCVVTQKKVPDKL
Sbjct: 1   MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVRGKDSVCVVTQKKVPDKL 60

Query: 61  LDQSSVSHLFPITKYLGLLATGMTADARTLVQQARNEAAEFRFRYGYEMPVDLLAKWIAD 120
           LDQSSVSHLFPITKYLGLLATG+TADARTLVQQAR+EAAEFRFRYGYEMPVD+LAKWIAD
Sbjct: 61  LDQSSVSHLFPITKYLGLLATGLTADARTLVQQARSEAAEFRFRYGYEMPVDVLAKWIAD 120

Query: 121 KSQIYTQHAYMRSLGVVAMVLGIDDEFGPRLYKCDPAGHFFGHKATSAGLKEQESINFLE 180
           KSQIYTQHAYMR LGVVAMVLGIDDEFGPRLYKCDPAGHFFGHKATSAGLKEQESINFLE
Sbjct: 121 KSQIYTQHAYMRPLGVVAMVLGIDDEFGPRLYKCDPAGHFFGHKATSAGLKEQESINFLE 180

Query: 181 KKMKNDPEFTYEETVQTAISALQSVLQEDFKANEIEALCWLIELILLDCIDNMGREESMF 240
           KKMKNDPEFTYEETVQTAISALQSVLQEDFKANEIE         +   +    R +   
Sbjct: 181 KKMKNDPEFTYEETVQTAISALQSVLQEDFKANEIEVGVVRQGNPVFRVLTTEERRKDFD 240

Query: 241 LGNG-------DYKFMATATMATAAGAAALLYYTLNRKLHSSGDDDDGDVDGNDAPSHAL 300
            G         +YKFMATATMATAAGAAALLYYTLNRKLHSS D DD DV GNDA SHAL
Sbjct: 241 SGTSQYTSELREYKFMATATMATAAGAAALLYYTLNRKLHSSRDQDDSDVAGNDASSHAL 300

Query: 301 LGGDRVSHRLIQAPATWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHV 360
           LGGDRVSHRL+QAPATWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHV
Sbjct: 301 LGGDRVSHRLVQAPATWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHV 360

Query: 361 GSVFGNEDSIQLKGTEMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENILLQEPKA 420
             VFGNEDS QLKG EMI+ELKYLLHLLTLCWHFSKKPFPLFLEETGFS+EN+LLQEPKA
Sbjct: 361 SGVFGNEDSSQLKGAEMISELKYLLHLLTLCWHFSKKPFPLFLEETGFSEENVLLQEPKA 420

Query: 421 GILKPAFTIIVDHNTKCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYA 480
           GILKPAFTIIVDHNTKCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYA
Sbjct: 421 GILKPAFTIIVDHNTKCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYA 480

Query: 481 HCGMVAAARWIAKLSTPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITS 540
           HCGMVAAARWIAKLSTPCLL ALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELS+TS
Sbjct: 481 HCGMVAAARWIAKLSTPCLLSALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSVTS 540

Query: 541 CVTFAPAACMTWELAESGNEFITSVINGADLVPTFSAASVDDLRGE-------------- 600
           CVTFAPAACMTWELAESGN+FITSVINGADLVPTFSAASVDDLR E              
Sbjct: 541 CVTFAPAACMTWELAESGNDFITSVINGADLVPTFSAASVDDLRAEVTASAWVNDLRNQI 600

Query: 601 ---------------------------------------------VVMKRAQSMAHAAWT 660
                                                        VVMKRAQSMA AAWT
Sbjct: 601 ERTRILSTVYRSASALGSRLPSIASARAKVAGAGAILKPVSSGTQVVMKRAQSMAQAAWT 660

Query: 661 RPSLRLSSWSCIGPRRRAMTSHSVAEEDGSSPKSSPRKMESCEPLRSSPEEIVEAIELPE 720
           RPSL LSSWSCIGPRRRAMTSHS AEE+GSSPKSSPRKMES EPLRSSP+EIVEA ELPE
Sbjct: 661 RPSLHLSSWSCIGPRRRAMTSHSTAEENGSSPKSSPRKMESSEPLRSSPQEIVEATELPE 720

Query: 721 SSTTAMQWTNEIECSYSEEINPEGMTDELDDDGQALMGHIQDEQMTEVELWQQLEHELYD 780
           SSTT +QWTNEIECSYSEEINP+GM DELDDD QALM HIQDEQ+TEVELWQQLEHEL+D
Sbjct: 721 SSTTTIQWTNEIECSYSEEINPDGMRDELDDDDQALMSHIQDEQITEVELWQQLEHELHD 780

Query: 781 RGEPDVAKEIREEEAAAMAEVGQSDSSTSGIKEAHRFFPAGKIMHIIDI--QSDAPDCES 840
           R E DVAKEIREEEAAAMAEVGQSDS TSG+KEAHRFFPAGKIMHII+I  QSDAPDCES
Sbjct: 781 RSEADVAKEIREEEAAAMAEVGQSDSFTSGMKEAHRFFPAGKIMHIIEIETQSDAPDCES 840

Query: 841 DSSSSRSSISDNSPLAESKIGIFLTSRSLYSKLRLSQTMISDHYMPAYRRQIEKLIKELE 844
           DSSSS SS SD+SP A+S+ GIFLTSRSLYSKLRLSQTMISDHYMPAYRRQIEKL+KELE
Sbjct: 841 DSSSS-SSTSDSSPQAQSRTGIFLTSRSLYSKLRLSQTMISDHYMPAYRRQIEKLVKELE 900

BLAST of HG10002437 vs. NCBI nr
Match: KAA8540233.1 (hypothetical protein F0562_024204 [Nyssa sinensis])

HSP 1 Score: 1110.5 bits (2871), Expect = 0.0e+00
Identity = 601/888 (67.68%), Postives = 671/888 (75.56%), Query Frame = 0

Query: 1   MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVQGKNSVCVVTQKKVPDKL 60
           MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGV+GK+SVCVVTQKKVPDKL
Sbjct: 1   MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVRGKDSVCVVTQKKVPDKL 60

Query: 61  LDQSSVSHLFPITKYLGLLATGMTADARTLVQQARNEAAEFRFRYGYEMPVDLLAKWIAD 120
           LDQ+SV+HLFPITKYLGLLATGMTADARTLVQQARNEAAEFRF+YGYEMPVD+LA+WIAD
Sbjct: 61  LDQTSVTHLFPITKYLGLLATGMTADARTLVQQARNEAAEFRFKYGYEMPVDVLARWIAD 120

Query: 121 KSQIYTQHAYMRSLGVVAMVLGIDDEFGPRLYKCDPAGHFFGHKATSAGLKEQESINFLE 180
           KSQ+YTQHAYMR LGVVAMVLGIDDEFGPRL+KCDPAGHFFGHKATSAGLKEQE+INFLE
Sbjct: 121 KSQVYTQHAYMRPLGVVAMVLGIDDEFGPRLFKCDPAGHFFGHKATSAGLKEQEAINFLE 180

Query: 181 KKMKNDPEFTYEETVQTAISALQSVLQEDFKANEIEA----------------------- 240
           KKMKNDP F+YEETVQTAISALQSVLQEDFKANEIE                        
Sbjct: 181 KKMKNDPAFSYEETVQTAISALQSVLQEDFKANEIEVGVVKKENPVFRVLSTEEIDEHLT 240

Query: 241 --LCWLIE-----------------------------LILLDCIDNMGREESMFLGNGDY 300
              CW  E                             L + +    +  +  +F  + D 
Sbjct: 241 AIKCWRFEWREGPFILFYVHRIFQCYEKRFGLKIEIALSIYETYHRLSVKIELFRFS-DR 300

Query: 301 KFMATATMATAAGAAALLYYTLNRKLHSSGDDDDGDVDGNDAPSHALLGGDRVSHRLIQA 360
           K MATATMATAAGAAALLYYTLNRKL S+ + +D D  G++  SHA LG DRVS+RLI+A
Sbjct: 301 KLMATATMATAAGAAALLYYTLNRKLQSTTNTEDDDEIGSNLHSHAPLGIDRVSNRLIRA 360

Query: 361 PATWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLK 420
           PATWLETISTLSETLRFTYSETLGKWPIGDLAFGI+FLLKRQGNLHVGSVFG  DS+QLK
Sbjct: 361 PATWLETISTLSETLRFTYSETLGKWPIGDLAFGISFLLKRQGNLHVGSVFGGNDSLQLK 420

Query: 421 GTEMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENILLQEPKAGILKPAFTIIVDH 480
           G+++I+EL+YLL+LLTLCWHFSKKPFPLFLEETG+SKEN+LLQEPKAGILKPAFT +VDH
Sbjct: 421 GSDIISELRYLLNLLTLCWHFSKKPFPLFLEETGYSKENVLLQEPKAGILKPAFTCLVDH 480

Query: 481 NTKCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAK 540
            TK  LLLIRGTHSIKDTLTAATGAVVPFHH+VVHEGGVSNLVLGYAHCGMVAAARWIAK
Sbjct: 481 KTKSFLLLIRGTHSIKDTLTAATGAVVPFHHTVVHEGGVSNLVLGYAHCGMVAAARWIAK 540

Query: 541 LSTPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWE 600
           L+TPCL+KAL +Y  Y +K+VGHSLGGGTAALLTY+LREQKELS  +CVTFA AACMTWE
Sbjct: 541 LATPCLIKALSKYPDYKLKIVGHSLGGGTAALLTYVLREQKELSTATCVTFAAAACMTWE 600

Query: 601 LAESGNEFITSVINGADLVPTFSAASVDDLRGE--------------------------- 660
           LAESGN FITSVINGADLVPTFSAASVDDLR E                           
Sbjct: 601 LAESGNGFITSVINGADLVPTFSAASVDDLRAEVTASAWLNDLRNQIERTRILSTVYRSA 660

Query: 661 --------------------------------VVMKRAQSMAHAAWTRPSLRLSSWSCIG 720
                                           VVMKRAQSMA AAW+RPSL LSSWSC+G
Sbjct: 661 SALGSRLPSIASARARVAGAGAILRPVSNGTQVVMKRAQSMAQAAWSRPSLHLSSWSCMG 720

Query: 721 PRRRAMTSHSVAEEDGSSPKSSPRKMESCEPLRSSPEEI---VEAIELPESSTTAMQWTN 767
           PR RA  +H+   E G+S + S  K E+ EPL +SP +    +E IELP SS+  M W +
Sbjct: 721 PRHRANAAHANLSEGGNSTEPSLTKSETSEPLLTSPRKTTSSIEIIELPISSSEGMVWNS 780

BLAST of HG10002437 vs. NCBI nr
Match: XP_038901142.1 (uncharacterized protein LOC120088127 isoform X1 [Benincasa hispida] >XP_038901147.1 uncharacterized protein LOC120088127 isoform X1 [Benincasa hispida])

HSP 1 Score: 1089.7 bits (2817), Expect = 0.0e+00
Identity = 575/654 (87.92%), Postives = 578/654 (88.38%), Query Frame = 0

Query: 249 MATATMATAAGAAALLYYTLNRKLHSSGDDDDGDVDGNDAPSHALLGGDRVSHRLIQAPA 308
           MATATMATAAGAAALLYYTLNRKLHSSGD DDGDVDGND PSHALLGGDRVSHRLIQAPA
Sbjct: 1   MATATMATAAGAAALLYYTLNRKLHSSGDQDDGDVDGNDTPSHALLGGDRVSHRLIQAPA 60

Query: 309 TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGT 368
           TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGT
Sbjct: 61  TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGT 120

Query: 369 EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENILLQEPKAGILKPAFTIIVDHNT 428
           EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKEN+LLQEPKAGILKPAFTIIVDHNT
Sbjct: 121 EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENVLLQEPKAGILKPAFTIIVDHNT 180

Query: 429 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 488
           KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS
Sbjct: 181 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 240

Query: 489 TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA 548
           TPCLLKAL QYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA
Sbjct: 241 TPCLLKALAQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA 300

Query: 549 ESGNEFITSVINGADLVPTFSAASVDDLRGE----------------------------- 608
           ESGNEFITSVINGADLVPTFSAASVDDLRGE                             
Sbjct: 301 ESGNEFITSVINGADLVPTFSAASVDDLRGEVTASAWVNDLRNQIERTRILSTVYRSASA 360

Query: 609 ------------------------------VVMKRAQSMAHAAWTRPSLRLSSWSCIGPR 668
                                         VVMKRAQSMA AAWTRPSLRLSSWSCIGPR
Sbjct: 361 LGSRLPSIASARAKVAGAGAILRPVSSGTQVVMKRAQSMAQAAWTRPSLRLSSWSCIGPR 420

Query: 669 RRAMTSHSVAEEDGSSPKSSPRKMESCEPLRSSPEEIVEAIELPESSTTAMQWTNEIECS 728
           RRAM SHSVAEEDGSSPK SPRKME CEPLRSSPEEIVEAIE PESSTTAMQWTNEIECS
Sbjct: 421 RRAMASHSVAEEDGSSPKPSPRKMEPCEPLRSSPEEIVEAIEHPESSTTAMQWTNEIECS 480

Query: 729 YSEEINPEGMTDELDDDGQALMGHIQDEQMTEVELWQQLEHELYDRGEPDVAKEIREEEA 788
           YSEEI PEGMTD LDDDGQALM HIQDEQMTEVELWQQLEHELYDRGEPDVAKEIREEEA
Sbjct: 481 YSEEIIPEGMTDGLDDDGQALMDHIQDEQMTEVELWQQLEHELYDRGEPDVAKEIREEEA 540

Query: 789 AAMAEVGQSDSSTSGIKEAHRFFPAGKIMHIIDIQSDAPDCESDSSSSRSSISDNSPLAE 844
           AAMA VGQSDSSTSGIKEAHRFFPAGKIMHIIDIQSD+P CESDSSS+ SSISDNSPL E
Sbjct: 541 AAMAAVGQSDSSTSGIKEAHRFFPAGKIMHIIDIQSDSPVCESDSSSA-SSISDNSPLEE 600

BLAST of HG10002437 vs. NCBI nr
Match: XP_008460311.1 (PREDICTED: uncharacterized protein LOC103499170 [Cucumis melo])

HSP 1 Score: 1083.2 bits (2800), Expect = 0.0e+00
Identity = 567/654 (86.70%), Postives = 580/654 (88.69%), Query Frame = 0

Query: 249 MATATMATAAGAAALLYYTLNRKLHSSGDDDDGDVDGNDAPSHALLGGDRVSHRLIQAPA 308
           MATATMATAAGAAALLYYTLNRKLHSSGD DDGDVDGNDAP+HALLGGDRVSHRLIQAPA
Sbjct: 1   MATATMATAAGAAALLYYTLNRKLHSSGDQDDGDVDGNDAPTHALLGGDRVSHRLIQAPA 60

Query: 309 TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGT 368
           TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKG 
Sbjct: 61  TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGK 120

Query: 369 EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENILLQEPKAGILKPAFTIIVDHNT 428
           EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKEN+LLQEPKAGILKPAFTIIVDHNT
Sbjct: 121 EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENVLLQEPKAGILKPAFTIIVDHNT 180

Query: 429 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 488
           KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS
Sbjct: 181 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 240

Query: 489 TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA 548
           TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA
Sbjct: 241 TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA 300

Query: 549 ESGNEFITSVINGADLVPTFSAASVDDLRGE----------------------------- 608
           ESGNEFITSVINGADLVPTFSAASVDDLRGE                             
Sbjct: 301 ESGNEFITSVINGADLVPTFSAASVDDLRGEVTASAWVNDLRNQIERTRILSTVYRSASA 360

Query: 609 ------------------------------VVMKRAQSMAHAAWTRPSLRLSSWSCIGPR 668
                                         VVMKRAQSMA AAWTRPSLRLSSWSCIGPR
Sbjct: 361 LGSRLPSIASARAKVAGAGAILRPVSSGTQVVMKRAQSMAAAAWTRPSLRLSSWSCIGPR 420

Query: 669 RRAMTSHSVAEEDGSSPKSSPRKMESCEPLRSSPEEIVEAIELPESSTTAMQWTNEIECS 728
           RRAM SHSVAEE GSSPK SPRKMESCEPLRS+PEEIVEAIE  ESSTTAM+W+NEIE S
Sbjct: 421 RRAMASHSVAEEGGSSPKPSPRKMESCEPLRSTPEEIVEAIEPTESSTTAMEWSNEIEYS 480

Query: 729 YSEEINPEGMTDELDDDGQALMGHIQDEQMTEVELWQQLEHELYDRGEPDVAKEIREEEA 788
           YSEEINPEG+TDELDDDGQ LM +IQDEQMTEVELWQQLEHELYD+GEPDVA+EIREEEA
Sbjct: 481 YSEEINPEGITDELDDDGQTLMSNIQDEQMTEVELWQQLEHELYDKGEPDVAEEIREEEA 540

Query: 789 AAMAEVGQSDSSTSGIKEAHRFFPAGKIMHIIDIQSDAPDCESDSSSSRSSISDNSPLAE 844
           AAMAEVGQSDSS SGIKEAHRFFPAGKIMHIIDIQSDAPDCESDSSSS SSIS+NSPLAE
Sbjct: 541 AAMAEVGQSDSSASGIKEAHRFFPAGKIMHIIDIQSDAPDCESDSSSSTSSISENSPLAE 600

BLAST of HG10002437 vs. NCBI nr
Match: XP_004144431.1 (uncharacterized protein LOC101203983 [Cucumis sativus] >KGN58447.1 hypothetical protein Csa_002505 [Cucumis sativus])

HSP 1 Score: 1073.9 bits (2776), Expect = 6.5e-310
Identity = 564/657 (85.84%), Postives = 578/657 (87.98%), Query Frame = 0

Query: 249 MATATMATAAGAAALLYYTLNRKLHSSGDDDDGDVDGNDAPSHALLGGDRVSHRLIQAPA 308
           MATATMATAAGAAALLYYTLNRKLHSSGD DDGDVDGNDA +HALLGGDRVSHRLIQAPA
Sbjct: 1   MATATMATAAGAAALLYYTLNRKLHSSGDQDDGDVDGNDASTHALLGGDRVSHRLIQAPA 60

Query: 309 TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGT 368
           TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGT
Sbjct: 61  TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGT 120

Query: 369 EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENILLQEPKAGILKPAFTIIVDHNT 428
           EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKEN+LLQEPKAGILKPAFTI+VDHNT
Sbjct: 121 EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENVLLQEPKAGILKPAFTILVDHNT 180

Query: 429 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 488
           KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS
Sbjct: 181 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 240

Query: 489 TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA 548
           TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA
Sbjct: 241 TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA 300

Query: 549 ESGNEFITSVINGADLVPTFSAASVDDLRGE----------------------------- 608
           ESGNEFITSVINGADLVPTFSAASVDDLR E                             
Sbjct: 301 ESGNEFITSVINGADLVPTFSAASVDDLRSEVTASAWVNDLRNQIERTRILSTVYRSASA 360

Query: 609 ------------------------------VVMKRAQSMAHAAWTRPSLRLSSWSCIGPR 668
                                         VVMKRAQSMA AAWTRPSL LSSWSCIGPR
Sbjct: 361 LGSRLPSIASARAKVAGAGAILRPVSSGTQVVMKRAQSMAQAAWTRPSLHLSSWSCIGPR 420

Query: 669 RRAMTSHSVAEEDGSSPKSSPRKMESCEPLRSSPEEIVEAIEL---PESSTTAMQWTNEI 728
           RRAM SHSVAEE GSSPK SPRKMESCEPLRSSPEE VEAIE    PESSTTAMQW+NEI
Sbjct: 421 RRAMASHSVAEEGGSSPKPSPRKMESCEPLRSSPEETVEAIEAIEPPESSTTAMQWSNEI 480

Query: 729 ECSYSEEINPEGMTDELDDDGQALMGHIQDEQMTEVELWQQLEHELYDRGEPDVAKEIRE 788
           E SYSEEINPEG+TDEL+DDGQ LMG+IQDEQMTEVELWQQLEHELYD+GEPDVA+EIRE
Sbjct: 481 EYSYSEEINPEGITDELEDDGQTLMGNIQDEQMTEVELWQQLEHELYDKGEPDVAEEIRE 540

Query: 789 EEAAAMAEVGQSDSSTSGIKEAHRFFPAGKIMHIIDIQSDAPDCESDSSSSRSSISDNSP 844
           EEAAAMAEVGQSD+S  GIKEAHRFFPAGKIMH+IDIQSDAPDCESDSSSSRSSIS+NSP
Sbjct: 541 EEAAAMAEVGQSDTSACGIKEAHRFFPAGKIMHMIDIQSDAPDCESDSSSSRSSISENSP 600

BLAST of HG10002437 vs. ExPASy Swiss-Prot
Match: O48551 (Proteasome subunit alpha type-6 OS=Glycine max OX=3847 GN=PAA1 PE=2 SV=2)

HSP 1 Score: 405.6 bits (1041), Expect = 1.4e-111
Identity = 197/216 (91.20%), Postives = 210/216 (97.22%), Query Frame = 0

Query: 1   MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVQGKNSVCVVTQKKVPDKL 60
           MSRGSGGGY+RHITIFSPEGRLFQVEYAFKAVKAAGITSIGV+GK+S+CVVT KKVPDKL
Sbjct: 1   MSRGSGGGYNRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVRGKDSICVVTHKKVPDKL 60

Query: 61  LDQSSVSHLFPITKYLGLLATGMTADARTLVQQARNEAAEFRFRYGYEMPVDLLAKWIAD 120
           LD +SV+HLFPITKYLGLLATGMTADARTLVQQARNEAAEFRF YGYEMPVD+LAKWIAD
Sbjct: 61  LDNTSVTHLFPITKYLGLLATGMTADARTLVQQARNEAAEFRFTYGYEMPVDVLAKWIAD 120

Query: 121 KSQIYTQHAYMRSLGVVAMVLGIDDEFGPRLYKCDPAGHFFGHKATSAGLKEQESINFLE 180
           KSQ+YTQHAYMR LGVVAMVLGIDDE+GP+LYKCDPAGH+FGHKATSAGLK+QE+INFLE
Sbjct: 121 KSQVYTQHAYMRPLGVVAMVLGIDDEYGPQLYKCDPAGHYFGHKATSAGLKDQEAINFLE 180

Query: 181 KKMKNDPEFTYEETVQTAISALQSVLQEDFKANEIE 217
           KKMKNDP FTYEETVQTAISALQSVLQEDFKA EIE
Sbjct: 181 KKMKNDPSFTYEETVQTAISALQSVLQEDFKATEIE 216

BLAST of HG10002437 vs. ExPASy Swiss-Prot
Match: Q9XG77 (Proteasome subunit alpha type-6 OS=Nicotiana tabacum OX=4097 GN=PAA1 PE=2 SV=1)

HSP 1 Score: 405.6 bits (1041), Expect = 1.4e-111
Identity = 198/216 (91.67%), Postives = 212/216 (98.15%), Query Frame = 0

Query: 1   MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVQGKNSVCVVTQKKVPDKL 60
           MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGV+GK+SVCVVTQKKVPDKL
Sbjct: 1   MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVRGKDSVCVVTQKKVPDKL 60

Query: 61  LDQSSVSHLFPITKYLGLLATGMTADARTLVQQARNEAAEFRFRYGYEMPVDLLAKWIAD 120
           LDQ+SVSHLFPITKYLGLLATGMTADARTLVQQARNEAAEFRF+YGYEMPVD+L+KWIAD
Sbjct: 61  LDQTSVSHLFPITKYLGLLATGMTADARTLVQQARNEAAEFRFKYGYEMPVDVLSKWIAD 120

Query: 121 KSQIYTQHAYMRSLGVVAMVLGIDDEFGPRLYKCDPAGHFFGHKATSAGLKEQESINFLE 180
           KSQ+YTQHAYMR LGVVAM+LGID+E GP+L+KCDPAGHFFGHKATSAG KEQE+INFLE
Sbjct: 121 KSQVYTQHAYMRPLGVVAMILGIDEEKGPQLFKCDPAGHFFGHKATSAGSKEQEAINFLE 180

Query: 181 KKMKNDPEFTYEETVQTAISALQSVLQEDFKANEIE 217
           KKMKNDP F+YEETVQTAISALQSVLQEDFKA+EIE
Sbjct: 181 KKMKNDPAFSYEETVQTAISALQSVLQEDFKASEIE 216

BLAST of HG10002437 vs. ExPASy Swiss-Prot
Match: O81147 (Proteasome subunit alpha type-6-B OS=Arabidopsis thaliana OX=3702 GN=PAA2 PE=1 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 3.7e-109
Identity = 193/216 (89.35%), Postives = 208/216 (96.30%), Query Frame = 0

Query: 1   MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVQGKNSVCVVTQKKVPDKL 60
           MSRGSG GYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGV+GK+SVCVVTQKKVPDKL
Sbjct: 1   MSRGSGAGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVRGKDSVCVVTQKKVPDKL 60

Query: 61  LDQSSVSHLFPITKYLGLLATGMTADARTLVQQARNEAAEFRFRYGYEMPVDLLAKWIAD 120
           LDQSSVSHLFP+TKYLGLLATGMTAD+R+LVQQARNEAAEFRF+YGYEMP D+LAKWIAD
Sbjct: 61  LDQSSVSHLFPVTKYLGLLATGMTADSRSLVQQARNEAAEFRFQYGYEMPADILAKWIAD 120

Query: 121 KSQIYTQHAYMRSLGVVAMVLGIDDEFGPRLYKCDPAGHFFGHKATSAGLKEQESINFLE 180
           KSQ+YTQHAYMR LGVVAMVLGID+E GP LYKCDPAGHF+GHKATSAG+KEQE++NFLE
Sbjct: 121 KSQVYTQHAYMRPLGVVAMVLGIDEERGPLLYKCDPAGHFYGHKATSAGMKEQEAVNFLE 180

Query: 181 KKMKNDPEFTYEETVQTAISALQSVLQEDFKANEIE 217
           KKMK +P FTY+ETVQTAISALQSVLQEDFKA EIE
Sbjct: 181 KKMKENPAFTYDETVQTAISALQSVLQEDFKATEIE 216

BLAST of HG10002437 vs. ExPASy Swiss-Prot
Match: O81146 (Proteasome subunit alpha type-6-A OS=Arabidopsis thaliana OX=3702 GN=PAA1 PE=1 SV=2)

HSP 1 Score: 390.2 bits (1001), Expect = 6.0e-107
Identity = 187/216 (86.57%), Postives = 207/216 (95.83%), Query Frame = 0

Query: 1   MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVQGKNSVCVVTQKKVPDKL 60
           MSRGSG GYDRHITIFSPEGRLFQVEYAFKAVK AGITSIGV+GK+SVCVVTQKKVPDKL
Sbjct: 1   MSRGSGAGYDRHITIFSPEGRLFQVEYAFKAVKTAGITSIGVRGKDSVCVVTQKKVPDKL 60

Query: 61  LDQSSVSHLFPITKYLGLLATGMTADARTLVQQARNEAAEFRFRYGYEMPVDLLAKWIAD 120
           LDQSSV+HLFPITKY+GL+ATG+TADAR+LVQQARN+AAEFRF YGYEMPVD+LAKWIAD
Sbjct: 61  LDQSSVTHLFPITKYIGLVATGITADARSLVQQARNQAAEFRFTYGYEMPVDILAKWIAD 120

Query: 121 KSQIYTQHAYMRSLGVVAMVLGIDDEFGPRLYKCDPAGHFFGHKATSAGLKEQESINFLE 180
           KSQ+YTQHAYMR LGVVAMV+G+D+E GP LYKCDPAGHF+GHKATSAG+KEQE++NFLE
Sbjct: 121 KSQVYTQHAYMRPLGVVAMVMGVDEENGPLLYKCDPAGHFYGHKATSAGMKEQEAVNFLE 180

Query: 181 KKMKNDPEFTYEETVQTAISALQSVLQEDFKANEIE 217
           KKMK +P FT++ETVQTAISALQSVLQEDFKA EIE
Sbjct: 181 KKMKENPSFTFDETVQTAISALQSVLQEDFKATEIE 216

BLAST of HG10002437 vs. ExPASy Swiss-Prot
Match: Q9LSU3 (Proteasome subunit alpha type-6 OS=Oryza sativa subsp. japonica OX=39947 GN=PAA1 PE=2 SV=1)

HSP 1 Score: 380.6 bits (976), Expect = 4.7e-104
Identity = 183/216 (84.72%), Postives = 206/216 (95.37%), Query Frame = 0

Query: 1   MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVQGKNSVCVVTQKKVPDKL 60
           MSRG+G GYDRHITIFSPEGRL+QVEYAFKAVK+AG+TSIGV+GK+SVCVVTQKKVPDKL
Sbjct: 1   MSRGTGAGYDRHITIFSPEGRLYQVEYAFKAVKSAGVTSIGVRGKDSVCVVTQKKVPDKL 60

Query: 61  LDQSSVSHLFPITKYLGLLATGMTADARTLVQQARNEAAEFRFRYGYEMPVDLLAKWIAD 120
           LD +SV+HLFPITKY+GLLATG+TADAR+LV QARNEAAEFRF++GYEMPVD+LAKWIAD
Sbjct: 61  LDHTSVTHLFPITKYIGLLATGLTADARSLVYQARNEAAEFRFKWGYEMPVDVLAKWIAD 120

Query: 121 KSQIYTQHAYMRSLGVVAMVLGIDDEFGPRLYKCDPAGHFFGHKATSAGLKEQESINFLE 180
           K+Q+YTQHAYMR LGVVAMVLG D+E   +L+KCDPAGHFFGHKATSAGLKEQE+INFLE
Sbjct: 121 KAQVYTQHAYMRPLGVVAMVLGYDEEKNAQLFKCDPAGHFFGHKATSAGLKEQEAINFLE 180

Query: 181 KKMKNDPEFTYEETVQTAISALQSVLQEDFKANEIE 217
           KKMK+DP+F+YEETVQ AISALQSVLQEDFKA EIE
Sbjct: 181 KKMKDDPQFSYEETVQIAISALQSVLQEDFKATEIE 216

BLAST of HG10002437 vs. ExPASy TrEMBL
Match: A0A5J5BAP4 (PROTEASOME_ALPHA_1 domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_024204 PE=3 SV=1)

HSP 1 Score: 1110.5 bits (2871), Expect = 0.0e+00
Identity = 601/888 (67.68%), Postives = 671/888 (75.56%), Query Frame = 0

Query: 1   MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVQGKNSVCVVTQKKVPDKL 60
           MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGV+GK+SVCVVTQKKVPDKL
Sbjct: 1   MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVRGKDSVCVVTQKKVPDKL 60

Query: 61  LDQSSVSHLFPITKYLGLLATGMTADARTLVQQARNEAAEFRFRYGYEMPVDLLAKWIAD 120
           LDQ+SV+HLFPITKYLGLLATGMTADARTLVQQARNEAAEFRF+YGYEMPVD+LA+WIAD
Sbjct: 61  LDQTSVTHLFPITKYLGLLATGMTADARTLVQQARNEAAEFRFKYGYEMPVDVLARWIAD 120

Query: 121 KSQIYTQHAYMRSLGVVAMVLGIDDEFGPRLYKCDPAGHFFGHKATSAGLKEQESINFLE 180
           KSQ+YTQHAYMR LGVVAMVLGIDDEFGPRL+KCDPAGHFFGHKATSAGLKEQE+INFLE
Sbjct: 121 KSQVYTQHAYMRPLGVVAMVLGIDDEFGPRLFKCDPAGHFFGHKATSAGLKEQEAINFLE 180

Query: 181 KKMKNDPEFTYEETVQTAISALQSVLQEDFKANEIEA----------------------- 240
           KKMKNDP F+YEETVQTAISALQSVLQEDFKANEIE                        
Sbjct: 181 KKMKNDPAFSYEETVQTAISALQSVLQEDFKANEIEVGVVKKENPVFRVLSTEEIDEHLT 240

Query: 241 --LCWLIE-----------------------------LILLDCIDNMGREESMFLGNGDY 300
              CW  E                             L + +    +  +  +F  + D 
Sbjct: 241 AIKCWRFEWREGPFILFYVHRIFQCYEKRFGLKIEIALSIYETYHRLSVKIELFRFS-DR 300

Query: 301 KFMATATMATAAGAAALLYYTLNRKLHSSGDDDDGDVDGNDAPSHALLGGDRVSHRLIQA 360
           K MATATMATAAGAAALLYYTLNRKL S+ + +D D  G++  SHA LG DRVS+RLI+A
Sbjct: 301 KLMATATMATAAGAAALLYYTLNRKLQSTTNTEDDDEIGSNLHSHAPLGIDRVSNRLIRA 360

Query: 361 PATWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLK 420
           PATWLETISTLSETLRFTYSETLGKWPIGDLAFGI+FLLKRQGNLHVGSVFG  DS+QLK
Sbjct: 361 PATWLETISTLSETLRFTYSETLGKWPIGDLAFGISFLLKRQGNLHVGSVFGGNDSLQLK 420

Query: 421 GTEMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENILLQEPKAGILKPAFTIIVDH 480
           G+++I+EL+YLL+LLTLCWHFSKKPFPLFLEETG+SKEN+LLQEPKAGILKPAFT +VDH
Sbjct: 421 GSDIISELRYLLNLLTLCWHFSKKPFPLFLEETGYSKENVLLQEPKAGILKPAFTCLVDH 480

Query: 481 NTKCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAK 540
            TK  LLLIRGTHSIKDTLTAATGAVVPFHH+VVHEGGVSNLVLGYAHCGMVAAARWIAK
Sbjct: 481 KTKSFLLLIRGTHSIKDTLTAATGAVVPFHHTVVHEGGVSNLVLGYAHCGMVAAARWIAK 540

Query: 541 LSTPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWE 600
           L+TPCL+KAL +Y  Y +K+VGHSLGGGTAALLTY+LREQKELS  +CVTFA AACMTWE
Sbjct: 541 LATPCLIKALSKYPDYKLKIVGHSLGGGTAALLTYVLREQKELSTATCVTFAAAACMTWE 600

Query: 601 LAESGNEFITSVINGADLVPTFSAASVDDLRGE--------------------------- 660
           LAESGN FITSVINGADLVPTFSAASVDDLR E                           
Sbjct: 601 LAESGNGFITSVINGADLVPTFSAASVDDLRAEVTASAWLNDLRNQIERTRILSTVYRSA 660

Query: 661 --------------------------------VVMKRAQSMAHAAWTRPSLRLSSWSCIG 720
                                           VVMKRAQSMA AAW+RPSL LSSWSC+G
Sbjct: 661 SALGSRLPSIASARARVAGAGAILRPVSNGTQVVMKRAQSMAQAAWSRPSLHLSSWSCMG 720

Query: 721 PRRRAMTSHSVAEEDGSSPKSSPRKMESCEPLRSSPEEI---VEAIELPESSTTAMQWTN 767
           PR RA  +H+   E G+S + S  K E+ EPL +SP +    +E IELP SS+  M W +
Sbjct: 721 PRHRANAAHANLSEGGNSTEPSLTKSETSEPLLTSPRKTTSSIEIIELPISSSEGMVWNS 780

BLAST of HG10002437 vs. ExPASy TrEMBL
Match: A0A1S3CC71 (uncharacterized protein LOC103499170 OS=Cucumis melo OX=3656 GN=LOC103499170 PE=4 SV=1)

HSP 1 Score: 1083.2 bits (2800), Expect = 0.0e+00
Identity = 567/654 (86.70%), Postives = 580/654 (88.69%), Query Frame = 0

Query: 249 MATATMATAAGAAALLYYTLNRKLHSSGDDDDGDVDGNDAPSHALLGGDRVSHRLIQAPA 308
           MATATMATAAGAAALLYYTLNRKLHSSGD DDGDVDGNDAP+HALLGGDRVSHRLIQAPA
Sbjct: 1   MATATMATAAGAAALLYYTLNRKLHSSGDQDDGDVDGNDAPTHALLGGDRVSHRLIQAPA 60

Query: 309 TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGT 368
           TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKG 
Sbjct: 61  TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGK 120

Query: 369 EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENILLQEPKAGILKPAFTIIVDHNT 428
           EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKEN+LLQEPKAGILKPAFTIIVDHNT
Sbjct: 121 EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENVLLQEPKAGILKPAFTIIVDHNT 180

Query: 429 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 488
           KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS
Sbjct: 181 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 240

Query: 489 TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA 548
           TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA
Sbjct: 241 TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA 300

Query: 549 ESGNEFITSVINGADLVPTFSAASVDDLRGE----------------------------- 608
           ESGNEFITSVINGADLVPTFSAASVDDLRGE                             
Sbjct: 301 ESGNEFITSVINGADLVPTFSAASVDDLRGEVTASAWVNDLRNQIERTRILSTVYRSASA 360

Query: 609 ------------------------------VVMKRAQSMAHAAWTRPSLRLSSWSCIGPR 668
                                         VVMKRAQSMA AAWTRPSLRLSSWSCIGPR
Sbjct: 361 LGSRLPSIASARAKVAGAGAILRPVSSGTQVVMKRAQSMAAAAWTRPSLRLSSWSCIGPR 420

Query: 669 RRAMTSHSVAEEDGSSPKSSPRKMESCEPLRSSPEEIVEAIELPESSTTAMQWTNEIECS 728
           RRAM SHSVAEE GSSPK SPRKMESCEPLRS+PEEIVEAIE  ESSTTAM+W+NEIE S
Sbjct: 421 RRAMASHSVAEEGGSSPKPSPRKMESCEPLRSTPEEIVEAIEPTESSTTAMEWSNEIEYS 480

Query: 729 YSEEINPEGMTDELDDDGQALMGHIQDEQMTEVELWQQLEHELYDRGEPDVAKEIREEEA 788
           YSEEINPEG+TDELDDDGQ LM +IQDEQMTEVELWQQLEHELYD+GEPDVA+EIREEEA
Sbjct: 481 YSEEINPEGITDELDDDGQTLMSNIQDEQMTEVELWQQLEHELYDKGEPDVAEEIREEEA 540

Query: 789 AAMAEVGQSDSSTSGIKEAHRFFPAGKIMHIIDIQSDAPDCESDSSSSRSSISDNSPLAE 844
           AAMAEVGQSDSS SGIKEAHRFFPAGKIMHIIDIQSDAPDCESDSSSS SSIS+NSPLAE
Sbjct: 541 AAMAEVGQSDSSASGIKEAHRFFPAGKIMHIIDIQSDAPDCESDSSSSTSSISENSPLAE 600

BLAST of HG10002437 vs. ExPASy TrEMBL
Match: A0A0A0LCM6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G644860 PE=4 SV=1)

HSP 1 Score: 1073.9 bits (2776), Expect = 3.1e-310
Identity = 564/657 (85.84%), Postives = 578/657 (87.98%), Query Frame = 0

Query: 249 MATATMATAAGAAALLYYTLNRKLHSSGDDDDGDVDGNDAPSHALLGGDRVSHRLIQAPA 308
           MATATMATAAGAAALLYYTLNRKLHSSGD DDGDVDGNDA +HALLGGDRVSHRLIQAPA
Sbjct: 1   MATATMATAAGAAALLYYTLNRKLHSSGDQDDGDVDGNDASTHALLGGDRVSHRLIQAPA 60

Query: 309 TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGT 368
           TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGT
Sbjct: 61  TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGT 120

Query: 369 EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENILLQEPKAGILKPAFTIIVDHNT 428
           EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKEN+LLQEPKAGILKPAFTI+VDHNT
Sbjct: 121 EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENVLLQEPKAGILKPAFTILVDHNT 180

Query: 429 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 488
           KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS
Sbjct: 181 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 240

Query: 489 TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA 548
           TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA
Sbjct: 241 TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA 300

Query: 549 ESGNEFITSVINGADLVPTFSAASVDDLRGE----------------------------- 608
           ESGNEFITSVINGADLVPTFSAASVDDLR E                             
Sbjct: 301 ESGNEFITSVINGADLVPTFSAASVDDLRSEVTASAWVNDLRNQIERTRILSTVYRSASA 360

Query: 609 ------------------------------VVMKRAQSMAHAAWTRPSLRLSSWSCIGPR 668
                                         VVMKRAQSMA AAWTRPSL LSSWSCIGPR
Sbjct: 361 LGSRLPSIASARAKVAGAGAILRPVSSGTQVVMKRAQSMAQAAWTRPSLHLSSWSCIGPR 420

Query: 669 RRAMTSHSVAEEDGSSPKSSPRKMESCEPLRSSPEEIVEAIEL---PESSTTAMQWTNEI 728
           RRAM SHSVAEE GSSPK SPRKMESCEPLRSSPEE VEAIE    PESSTTAMQW+NEI
Sbjct: 421 RRAMASHSVAEEGGSSPKPSPRKMESCEPLRSSPEETVEAIEAIEPPESSTTAMQWSNEI 480

Query: 729 ECSYSEEINPEGMTDELDDDGQALMGHIQDEQMTEVELWQQLEHELYDRGEPDVAKEIRE 788
           E SYSEEINPEG+TDEL+DDGQ LMG+IQDEQMTEVELWQQLEHELYD+GEPDVA+EIRE
Sbjct: 481 EYSYSEEINPEGITDELEDDGQTLMGNIQDEQMTEVELWQQLEHELYDKGEPDVAEEIRE 540

Query: 789 EEAAAMAEVGQSDSSTSGIKEAHRFFPAGKIMHIIDIQSDAPDCESDSSSSRSSISDNSP 844
           EEAAAMAEVGQSD+S  GIKEAHRFFPAGKIMH+IDIQSDAPDCESDSSSSRSSIS+NSP
Sbjct: 541 EEAAAMAEVGQSDTSACGIKEAHRFFPAGKIMHMIDIQSDAPDCESDSSSSRSSISENSP 600

BLAST of HG10002437 vs. ExPASy TrEMBL
Match: A0A5A7UE47 (Mono-/di-acylglycerol lipase isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold522G00080 PE=4 SV=1)

HSP 1 Score: 1053.1 bits (2722), Expect = 6.0e-304
Identity = 551/635 (86.77%), Postives = 562/635 (88.50%), Query Frame = 0

Query: 249 MATATMATAAGAAALLYYTLNRKLHSSGDDDDGDVDGNDAPSHALLGGDRVSHRLIQAPA 308
           MATATMATAAGAAALLYYTLNRKLHSSGD DDGDVDGNDAP+HALLGGDRVSHRLIQAPA
Sbjct: 1   MATATMATAAGAAALLYYTLNRKLHSSGDQDDGDVDGNDAPTHALLGGDRVSHRLIQAPA 60

Query: 309 TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGT 368
           TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKG 
Sbjct: 61  TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGK 120

Query: 369 EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENILLQEPKAGILKPAFTIIVDHNT 428
           EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKEN+LLQEPKAGILKPAFTIIVDHNT
Sbjct: 121 EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENVLLQEPKAGILKPAFTIIVDHNT 180

Query: 429 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 488
           KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS
Sbjct: 181 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 240

Query: 489 TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA 548
           TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA
Sbjct: 241 TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA 300

Query: 549 ESGNEFITSVINGADLVPTFSAASVDDLRGE----------------------------- 608
           ESGNEFITSVINGADLVPTFSAASVDDLRGE                             
Sbjct: 301 ESGNEFITSVINGADLVPTFSAASVDDLRGEVTASAWVNDLRNQIERTRILSTVYRSASA 360

Query: 609 ------------------------------VVMKRAQSMAHAAWTRPSLRLSSWSCIGPR 668
                                         VVMKRAQSMA AAWTRPSLRLSSWSCIGPR
Sbjct: 361 LGSRLPSIASARAKVAGAGAILRPVSSGTQVVMKRAQSMAAAAWTRPSLRLSSWSCIGPR 420

Query: 669 RRAMTSHSVAEEDGSSPKSSPRKMESCEPLRSSPEEIVEAIELPESSTTAMQWTNEIECS 728
           RRAM SHSVAEE GSSPK SPRKMESCEPLRS+PEEIVEAIE  ESSTTAM+W+NEIE S
Sbjct: 421 RRAMASHSVAEEGGSSPKPSPRKMESCEPLRSTPEEIVEAIEPTESSTTAMEWSNEIEYS 480

Query: 729 YSEEINPEGMTDELDDDGQALMGHIQDEQMTEVELWQQLEHELYDRGEPDVAKEIREEEA 788
           YSEEINPEG+TDELDDDGQ LM +IQDEQMTEVELWQQLEHELYD+GEPDVA+EIREEEA
Sbjct: 481 YSEEINPEGITDELDDDGQTLMSNIQDEQMTEVELWQQLEHELYDKGEPDVAEEIREEEA 540

Query: 789 AAMAEVGQSDSSTSGIKEAHRFFPAGKIMHIIDIQSDAPDCESDSSSSRSSISDNSPLAE 825
           AAMAEVGQSDSS SGIKEAHRFFPAGKIMHIIDIQSDAPDCESDSSSS SSIS+NSPLAE
Sbjct: 541 AAMAEVGQSDSSASGIKEAHRFFPAGKIMHIIDIQSDAPDCESDSSSSTSSISENSPLAE 600

BLAST of HG10002437 vs. ExPASy TrEMBL
Match: A0A6J1GYN5 (uncharacterized protein LOC111458044 OS=Cucurbita moschata OX=3662 GN=LOC111458044 PE=4 SV=1)

HSP 1 Score: 1037.3 bits (2681), Expect = 3.4e-299
Identity = 548/656 (83.54%), Postives = 566/656 (86.28%), Query Frame = 0

Query: 249 MATATMATAAGAAALLYYTLNRKLHSSGDDDDGDVDGNDAPSHALLGGDRVSHRLIQAPA 308
           MATATMATAAGAAALLYYTLNRKLHSS D DD DV GNDA SHALLGGDRVSHRL+QAPA
Sbjct: 1   MATATMATAAGAAALLYYTLNRKLHSSRDQDDSDVAGNDASSHALLGGDRVSHRLVQAPA 60

Query: 309 TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGT 368
           TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHV  VFGNEDS QLKG 
Sbjct: 61  TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVSGVFGNEDSSQLKGA 120

Query: 369 EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENILLQEPKAGILKPAFTIIVDHNT 428
           EMI+ELKYLLHLLTLCWHFSKKPFPLFLEETGFS+EN+LLQEPKAGILKPAFTIIVDHNT
Sbjct: 121 EMISELKYLLHLLTLCWHFSKKPFPLFLEETGFSEENVLLQEPKAGILKPAFTIIVDHNT 180

Query: 429 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 488
           KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS
Sbjct: 181 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 240

Query: 489 TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA 548
           TPCLL ALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELS+TSCVTFAPAACMTWELA
Sbjct: 241 TPCLLSALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSVTSCVTFAPAACMTWELA 300

Query: 549 ESGNEFITSVINGADLVPTFSAASVDDLRGE----------------------------- 608
           ESGNEFITSVINGADLVPTFSAASVDDLR E                             
Sbjct: 301 ESGNEFITSVINGADLVPTFSAASVDDLRAEVTASAWVNDLRNQIERTRILSTVYRSASA 360

Query: 609 ------------------------------VVMKRAQSMAHAAWTRPSLRLSSWSCIGPR 668
                                         VVMKRAQSMA AAWTRPSL LSSWSCIGPR
Sbjct: 361 LGSRLPSIASARAKVAGAGAILKPVSSGTQVVMKRAQSMAQAAWTRPSLHLSSWSCIGPR 420

Query: 669 RRAMTSHSVAEEDGSSPKSSPRKMESCEPLRSSPEEIVEAIELPESSTTAMQWTNEIECS 728
           RRAMTSHS AEE+GSSPKSSPRKMES EPLRSSP+EIVEA ELPESSTT +QWTNEIECS
Sbjct: 421 RRAMTSHSTAEENGSSPKSSPRKMESFEPLRSSPQEIVEATELPESSTTTIQWTNEIECS 480

Query: 729 YSEEINPEGMTDELDDDGQALMGHIQDEQMTEVELWQQLEHELYDRGEPDVAKEIREEEA 788
           YSEEINP+GM DELDDD QALM HIQDEQ+TEVELWQQLEHEL+DR E DVAKEIREEEA
Sbjct: 481 YSEEINPDGMRDELDDDDQALMSHIQDEQITEVELWQQLEHELHDRSEADVAKEIREEEA 540

Query: 789 AAMAEVGQSDSSTSGIKEAHRFFPAGKIMHIIDI--QSDAPDCESDSSSSRSSISDNSPL 844
           AAMAEVGQSDS TSG+KEAHRFFPAGKIMHII+I  QSDAPDCESD SSSRSS SD+SP 
Sbjct: 541 AAMAEVGQSDSFTSGMKEAHRFFPAGKIMHIIEIETQSDAPDCESD-SSSRSSTSDSSPQ 600

BLAST of HG10002437 vs. TAIR 10
Match: AT3G14075.1 (Mono-/di-acylglycerol lipase, N-terminal;Lipase, class 3 )

HSP 1 Score: 686.8 bits (1771), Expect = 2.2e-197
Identity = 392/656 (59.76%), Postives = 457/656 (69.66%), Query Frame = 0

Query: 249 MATATMATAAGAAALLYYTLNRKLHSSGDDDDGDVDGNDAPSHALLGGDRVSHRLIQAPA 308
           MATATMATAAGAAALLYYTLNRKL  +G  D  D +   + S   L  DRVSHRLIQAPA
Sbjct: 1   MATATMATAAGAAALLYYTLNRKL-IAGPSDVDDENSEASASRPSLRIDRVSHRLIQAPA 60

Query: 309 TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGT 368
           TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFL+KRQG LHV  VFG +DS++L+G+
Sbjct: 61  TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLIKRQGLLHVDRVFGGKDSVELRGS 120

Query: 369 EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENILLQEPKAGILKPAFTIIVDHNT 428
           E+ TELKYLLHLLTLCWHFSKK FP FLEETGF+KEN+L+ EPKAGILKPAFT++VDHNT
Sbjct: 121 EVATELKYLLHLLTLCWHFSKKSFPFFLEETGFTKENVLIHEPKAGILKPAFTVLVDHNT 180

Query: 429 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 488
           K  LLLIRGTHSIKDTLTAATGA+VPFHH+VV+E GVSNLVLGYAHCGMVAAAR IAKL+
Sbjct: 181 KYFLLLIRGTHSIKDTLTAATGAIVPFHHTVVNERGVSNLVLGYAHCGMVAAARCIAKLA 240

Query: 489 TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA 548
           TPCLLK L QY  Y IK+VGHSLGGGTAALLTYI+REQK LS  +CVTFAPAACMTWELA
Sbjct: 241 TPCLLKGLEQYPDYKIKIVGHSLGGGTAALLTYIMREQKMLSTATCVTFAPAACMTWELA 300

Query: 549 ESGNEFITSVINGADLVPTFSAASVDDLRGE----------------------------- 608
           +SGN+FI SVINGADLVPTFSAA+VDDLR E                             
Sbjct: 301 DSGNDFIVSVINGADLVPTFSAAAVDDLRAEVTASAWLNDLRNQIEHTRILSTVYRSATA 360

Query: 609 ------------------------------VVMKRAQSMAHAAWTRPSLRLSSWSCIGPR 668
                                         VVM+RAQSM     TRP+L +SSWSC+GPR
Sbjct: 361 LGSRLPSMATAKAKVAGAGAMLRPVSSGTQVVMRRAQSML----TRPALSISSWSCMGPR 420

Query: 669 RRAMTSHSVAEED-GSSPKSSPRKMESCEPLRSSPEEIVEAIELPESSTTAMQWTNEIEC 728
           RRA  + S++E    +S   S    E+ +PL  + EEI              +W +E EC
Sbjct: 421 RRASATQSISEHQLDTSEAMSQDIPETSDPLLVTDEEITG------------KWKSEAEC 480

Query: 729 SYSEEINPEGMTDELDDDGQALMGHIQDEQMTEVELWQQLEHELYDRG-----EPDVAKE 788
           S  EE +P     +LD+         ++E+MTE ELWQQLEH+LY        E DVAKE
Sbjct: 481 SNYEETSPRLGATDLDECEDPAEMDTREERMTEAELWQQLEHDLYHDSSEQPEETDVAKE 540

Query: 789 IREEEAAAMAEVGQS--DSSTSGIKEAHRFFPAGKIMHIIDIQSDA--PDCESDSSSSRS 835
           I+EEE A +AE G +  +S T+ +KE+ RF PAGKIMHI+ ++ +A  P+ E D   S  
Sbjct: 541 IKEEEEAVIAEAGVAPPESQTAEMKESRRFLPAGKIMHIVTVRPEAVEPNEEEDEDGSAL 600

BLAST of HG10002437 vs. TAIR 10
Match: AT3G14075.2 (Mono-/di-acylglycerol lipase, N-terminal;Lipase, class 3 )

HSP 1 Score: 686.8 bits (1771), Expect = 2.2e-197
Identity = 392/656 (59.76%), Postives = 457/656 (69.66%), Query Frame = 0

Query: 249 MATATMATAAGAAALLYYTLNRKLHSSGDDDDGDVDGNDAPSHALLGGDRVSHRLIQAPA 308
           MATATMATAAGAAALLYYTLNRKL  +G  D  D +   + S   L  DRVSHRLIQAPA
Sbjct: 1   MATATMATAAGAAALLYYTLNRKL-IAGPSDVDDENSEASASRPSLRIDRVSHRLIQAPA 60

Query: 309 TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKGT 368
           TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFL+KRQG LHV  VFG +DS++L+G+
Sbjct: 61  TWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLIKRQGLLHVDRVFGGKDSVELRGS 120

Query: 369 EMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENILLQEPKAGILKPAFTIIVDHNT 428
           E+ TELKYLLHLLTLCWHFSKK FP FLEETGF+KEN+L+ EPKAGILKPAFT++VDHNT
Sbjct: 121 EVATELKYLLHLLTLCWHFSKKSFPFFLEETGFTKENVLIHEPKAGILKPAFTVLVDHNT 180

Query: 429 KCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKLS 488
           K  LLLIRGTHSIKDTLTAATGA+VPFHH+VV+E GVSNLVLGYAHCGMVAAAR IAKL+
Sbjct: 181 KYFLLLIRGTHSIKDTLTAATGAIVPFHHTVVNERGVSNLVLGYAHCGMVAAARCIAKLA 240

Query: 489 TPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWELA 548
           TPCLLK L QY  Y IK+VGHSLGGGTAALLTYI+REQK LS  +CVTFAPAACMTWELA
Sbjct: 241 TPCLLKGLEQYPDYKIKIVGHSLGGGTAALLTYIMREQKMLSTATCVTFAPAACMTWELA 300

Query: 549 ESGNEFITSVINGADLVPTFSAASVDDLRGE----------------------------- 608
           +SGN+FI SVINGADLVPTFSAA+VDDLR E                             
Sbjct: 301 DSGNDFIVSVINGADLVPTFSAAAVDDLRAEVTASAWLNDLRNQIEHTRILSTVYRSATA 360

Query: 609 ------------------------------VVMKRAQSMAHAAWTRPSLRLSSWSCIGPR 668
                                         VVM+RAQSM     TRP+L +SSWSC+GPR
Sbjct: 361 LGSRLPSMATAKAKVAGAGAMLRPVSSGTQVVMRRAQSML----TRPALSISSWSCMGPR 420

Query: 669 RRAMTSHSVAEED-GSSPKSSPRKMESCEPLRSSPEEIVEAIELPESSTTAMQWTNEIEC 728
           RRA  + S++E    +S   S    E+ +PL  + EEI              +W +E EC
Sbjct: 421 RRASATQSISEHQLDTSEAMSQDIPETSDPLLVTDEEITG------------KWKSEAEC 480

Query: 729 SYSEEINPEGMTDELDDDGQALMGHIQDEQMTEVELWQQLEHELYDRG-----EPDVAKE 788
           S  EE +P     +LD+         ++E+MTE ELWQQLEH+LY        E DVAKE
Sbjct: 481 SNYEETSPRLGATDLDECEDPAEMDTREERMTEAELWQQLEHDLYHDSSEQPEETDVAKE 540

Query: 789 IREEEAAAMAEVGQS--DSSTSGIKEAHRFFPAGKIMHIIDIQSDA--PDCESDSSSSRS 835
           I+EEE A +AE G +  +S T+ +KE+ RF PAGKIMHI+ ++ +A  P+ E D   S  
Sbjct: 541 IKEEEEAVIAEAGVAPPESQTAEMKESRRFLPAGKIMHIVTVRPEAVEPNEEEDEDGSAL 600

BLAST of HG10002437 vs. TAIR 10
Match: AT4G16070.1 (Mono-/di-acylglycerol lipase, N-terminal;Lipase, class 3 )

HSP 1 Score: 521.9 bits (1343), Expect = 9.3e-148
Identity = 319/670 (47.61%), Postives = 402/670 (60.00%), Query Frame = 0

Query: 249 MATATMATAAGAAALLYYTLNRKLHS-SGDDDDGDVDGNDAPSHALLGGDRVSHRLIQAP 308
           MA   M TA GA  +LY    R + + +G+DD G   G    S    G  R+  R  QAP
Sbjct: 1   MAAGVMVTATGAVVILYLLSRRIVWARNGEDDSGGELGKSGRS----GRRRIVRRPAQAP 60

Query: 309 ATWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKG 368
           ATWLETISTLSETLRFTYSETLGKWPI DLAFGIN+L++RQGN    SV+   + I+LKG
Sbjct: 61  ATWLETISTLSETLRFTYSETLGKWPIADLAFGINYLMRRQGNFPTASVYAGSNCIELKG 120

Query: 369 TEMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENILLQEPKAGILKPAFTIIVDHN 428
            E+I +L  LL  LTLC  FSKKPF +FLE  G++ E++LLQ+PKAGI++PAFTII D N
Sbjct: 121 PEIIMDLTELLRFLTLCMLFSKKPFAVFLESAGYTHEDVLLQKPKAGIMQPAFTIIRDTN 180

Query: 429 TKCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKL 488
           +KCILLLIRGTHSIKDTLTAATGAVVPFHHSV+H+GG+SNLVLGYAHCGMVAAARWIAKL
Sbjct: 181 SKCILLLIRGTHSIKDTLTAATGAVVPFHHSVLHDGGLSNLVLGYAHCGMVAAARWIAKL 240

Query: 489 STPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWEL 548
           S PCLLKAL +   + +++VGHSLGGGTA+LLTYILREQKE +  +C TFAPAACMTW+L
Sbjct: 241 SVPCLLKALDENPSFKVQIVGHSLGGGTASLLTYILREQKEFASATCFTFAPAACMTWDL 300

Query: 549 AESGNEFITSVINGADLVPTFSAASVDDLRGE---------------------------- 608
           AESG  FIT++ING+DLVPTFSA+SVDDLR E                            
Sbjct: 301 AESGKHFITTIINGSDLVPTFSASSVDDLRSEVTSSSWSNDLRDQVEHTRVLSVVYRSAT 360

Query: 609 -------------------------------VVMKRAQSMAHAAWTRPSLRLSSWSCIGP 668
                                          V++KRAQ +A A   +    LSSWSCIGP
Sbjct: 361 AIGSRLPSIASAKAKVAGAGAILRPVSSGTQVMLKRAQDVAQAV-VQTRSTLSSWSCIGP 420

Query: 669 RRRAMTSHSVAEEDGSSPKSSPRKMESCEPLRSSPEEIVEAIEL-------PESSTTAMQ 728
           RRRA++S     +  S     P         RS+   + E + +        E S+++  
Sbjct: 421 RRRAISS-----QLNSKVTDMPEASAIMAERRSTEALLAETVAIDRKGHKRTEHSSSSSS 480

Query: 729 WTNEIECSYSEEINPEGMTDELDDDGQALMGHIQDEQMTEVELWQQLEHELYDRGEPDVA 788
            ++  E    EE  P    D++  +  ++     +E +TE ELW +L+ EL  R E +  
Sbjct: 481 ESDRDEPDEEEEEEPLISIDQVIAETSSI-----EEDVTEGELWDELDREL-TRQENERD 540

Query: 789 KEIREEEAAAMAEV--------GQSDSSTS-----------GIKEAHRFFPAGKIMHIID 833
            E  EEEAAA  E+        G  DSST             + E  RF+P GKIMHI+ 
Sbjct: 541 SEAMEEEAAAAKEITEEETVITGGGDSSTGQNQSPVSASSMDLIENQRFYPPGKIMHIVS 600

BLAST of HG10002437 vs. TAIR 10
Match: AT4G16070.2 (Mono-/di-acylglycerol lipase, N-terminal;Lipase, class 3 )

HSP 1 Score: 497.7 bits (1280), Expect = 1.9e-140
Identity = 312/670 (46.57%), Postives = 394/670 (58.81%), Query Frame = 0

Query: 249 MATATMATAAGAAALLYYTLNRKLHS-SGDDDDGDVDGNDAPSHALLGGDRVSHRLIQAP 308
           MA   M TA GA  +LY    R + + +G+DD G   G    S    G  R+  R  QAP
Sbjct: 1   MAAGVMVTATGAVVILYLLSRRIVWARNGEDDSGGELGKSGRS----GRRRIVRRPAQAP 60

Query: 309 ATWLETISTLSETLRFTYSETLGKWPIGDLAFGINFLLKRQGNLHVGSVFGNEDSIQLKG 368
           ATWLETISTLSETLRFTYSETLGKWPI DLAFGIN+L++RQGN    SV+   + I+LKG
Sbjct: 61  ATWLETISTLSETLRFTYSETLGKWPIADLAFGINYLMRRQGNFPTASVYAGSNCIELKG 120

Query: 369 TEMITELKYLLHLLTLCWHFSKKPFPLFLEETGFSKENILLQEPKAGILKPAFTIIVDHN 428
            E+I +L  LL  LTLC  FSKKPF +FLE  G++ E++LLQ+PKAGI++PAFTII D N
Sbjct: 121 PEIIMDLTELLRFLTLCMLFSKKPFAVFLESAGYTHEDVLLQKPKAGIMQPAFTIIRDTN 180

Query: 429 TKCILLLIRGTHSIKDTLTAATGAVVPFHHSVVHEGGVSNLVLGYAHCGMVAAARWIAKL 488
           +KCILLLIRGTHSIKDTLTAATGAVVPFHHSV+H+GG+SNLVLGYAHCGMVAAARWIAKL
Sbjct: 181 SKCILLLIRGTHSIKDTLTAATGAVVPFHHSVLHDGGLSNLVLGYAHCGMVAAARWIAKL 240

Query: 489 STPCLLKALGQYSGYNIKVVGHSLGGGTAALLTYILREQKELSITSCVTFAPAACMTWEL 548
           S PCLLKAL +   + +++VGHSLGGGTA+LLTYILREQKE +  +C TFAP        
Sbjct: 241 SVPCLLKALDENPSFKVQIVGHSLGGGTASLLTYILREQKEFASATCFTFAP-------- 300

Query: 549 AESGNEFITSVINGADLVPTFSAASVDDLRGE---------------------------- 608
           AESG  FIT++ING+DLVPTFSA+SVDDLR E                            
Sbjct: 301 AESGKHFITTIINGSDLVPTFSASSVDDLRSEVTSSSWSNDLRDQVEHTRVLSVVYRSAT 360

Query: 609 -------------------------------VVMKRAQSMAHAAWTRPSLRLSSWSCIGP 668
                                          V++KRAQ +A A   +    LSSWSCIGP
Sbjct: 361 AIGSRLPSIASAKAKVAGAGAILRPVSSGTQVMLKRAQDVAQAV-VQTRSTLSSWSCIGP 420

Query: 669 RRRAMTSHSVAEEDGSSPKSSPRKMESCEPLRSSPEEIVEAIEL-------PESSTTAMQ 728
           RRRA++S     +  S     P         RS+   + E + +        E S+++  
Sbjct: 421 RRRAISS-----QLNSKVTDMPEASAIMAERRSTEALLAETVAIDRKGHKRTEHSSSSSS 480

Query: 729 WTNEIECSYSEEINPEGMTDELDDDGQALMGHIQDEQMTEVELWQQLEHELYDRGEPDVA 788
            ++  E    EE  P    D++  +  ++     +E +TE ELW +L+ EL  R E +  
Sbjct: 481 ESDRDEPDEEEEEEPLISIDQVIAETSSI-----EEDVTEGELWDELDREL-TRQENERD 540

Query: 789 KEIREEEAAAMAEV--------GQSDSSTS-----------GIKEAHRFFPAGKIMHIID 833
            E  EEEAAA  E+        G  DSST             + E  RF+P GKIMHI+ 
Sbjct: 541 SEAMEEEAAAAKEITEEETVITGGGDSSTGQNQSPVSASSMDLIENQRFYPPGKIMHIVS 600

BLAST of HG10002437 vs. TAIR 10
Match: AT2G05840.1 (20S proteasome subunit PAA2 )

HSP 1 Score: 397.5 bits (1020), Expect = 2.7e-110
Identity = 193/216 (89.35%), Postives = 208/216 (96.30%), Query Frame = 0

Query: 1   MSRGSGGGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVQGKNSVCVVTQKKVPDKL 60
           MSRGSG GYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGV+GK+SVCVVTQKKVPDKL
Sbjct: 1   MSRGSGAGYDRHITIFSPEGRLFQVEYAFKAVKAAGITSIGVRGKDSVCVVTQKKVPDKL 60

Query: 61  LDQSSVSHLFPITKYLGLLATGMTADARTLVQQARNEAAEFRFRYGYEMPVDLLAKWIAD 120
           LDQSSVSHLFP+TKYLGLLATGMTAD+R+LVQQARNEAAEFRF+YGYEMP D+LAKWIAD
Sbjct: 61  LDQSSVSHLFPVTKYLGLLATGMTADSRSLVQQARNEAAEFRFQYGYEMPADILAKWIAD 120

Query: 121 KSQIYTQHAYMRSLGVVAMVLGIDDEFGPRLYKCDPAGHFFGHKATSAGLKEQESINFLE 180
           KSQ+YTQHAYMR LGVVAMVLGID+E GP LYKCDPAGHF+GHKATSAG+KEQE++NFLE
Sbjct: 121 KSQVYTQHAYMRPLGVVAMVLGIDEERGPLLYKCDPAGHFYGHKATSAGMKEQEAVNFLE 180

Query: 181 KKMKNDPEFTYEETVQTAISALQSVLQEDFKANEIE 217
           KKMK +P FTY+ETVQTAISALQSVLQEDFKA EIE
Sbjct: 181 KKMKENPAFTYDETVQTAISALQSVLQEDFKATEIE 216

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6581770.10.0e+0083.64Proteasome subunit alpha type-6, partial [Cucurbita argyrosperma subsp. sororia][more]
KAA8540233.10.0e+0067.68hypothetical protein F0562_024204 [Nyssa sinensis][more]
XP_038901142.10.0e+0087.92uncharacterized protein LOC120088127 isoform X1 [Benincasa hispida] >XP_03890114... [more]
XP_008460311.10.0e+0086.70PREDICTED: uncharacterized protein LOC103499170 [Cucumis melo][more]
XP_004144431.16.5e-31085.84uncharacterized protein LOC101203983 [Cucumis sativus] >KGN58447.1 hypothetical ... [more]
Match NameE-valueIdentityDescription
O485511.4e-11191.20Proteasome subunit alpha type-6 OS=Glycine max OX=3847 GN=PAA1 PE=2 SV=2[more]
Q9XG771.4e-11191.67Proteasome subunit alpha type-6 OS=Nicotiana tabacum OX=4097 GN=PAA1 PE=2 SV=1[more]
O811473.7e-10989.35Proteasome subunit alpha type-6-B OS=Arabidopsis thaliana OX=3702 GN=PAA2 PE=1 S... [more]
O811466.0e-10786.57Proteasome subunit alpha type-6-A OS=Arabidopsis thaliana OX=3702 GN=PAA1 PE=1 S... [more]
Q9LSU34.7e-10484.72Proteasome subunit alpha type-6 OS=Oryza sativa subsp. japonica OX=39947 GN=PAA1... [more]
Match NameE-valueIdentityDescription
A0A5J5BAP40.0e+0067.68PROTEASOME_ALPHA_1 domain-containing protein OS=Nyssa sinensis OX=561372 GN=F056... [more]
A0A1S3CC710.0e+0086.70uncharacterized protein LOC103499170 OS=Cucumis melo OX=3656 GN=LOC103499170 PE=... [more]
A0A0A0LCM63.1e-31085.84Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G644860 PE=4 SV=1[more]
A0A5A7UE476.0e-30486.77Mono-/di-acylglycerol lipase isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A6J1GYN53.4e-29983.54uncharacterized protein LOC111458044 OS=Cucurbita moschata OX=3662 GN=LOC1114580... [more]
Match NameE-valueIdentityDescription
AT3G14075.12.2e-19759.76Mono-/di-acylglycerol lipase, N-terminal;Lipase, class 3 [more]
AT3G14075.22.2e-19759.76Mono-/di-acylglycerol lipase, N-terminal;Lipase, class 3 [more]
AT4G16070.19.3e-14847.61Mono-/di-acylglycerol lipase, N-terminal;Lipase, class 3 [more]
AT4G16070.21.9e-14046.57Mono-/di-acylglycerol lipase, N-terminal;Lipase, class 3 [more]
AT2G05840.12.7e-11089.3520S proteasome subunit PAA2 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000426Proteasome alpha-subunit, N-terminal domainSMARTSM00948Proteasome_A_N_2coord: 9..31
e-value: 1.1E-11
score: 54.8
IPR000426Proteasome alpha-subunit, N-terminal domainPFAMPF10584Proteasome_A_Ncoord: 9..31
e-value: 1.3E-14
score: 53.5
IPR000426Proteasome alpha-subunit, N-terminal domainPROSITEPS00388PROTEASOME_ALPHA_1coord: 9..31
IPR001353Proteasome, subunit alpha/betaPFAMPF00227Proteasomecoord: 36..216
e-value: 4.2E-49
score: 166.6
IPR029058Alpha/Beta hydrolase foldGENE3D3.40.50.1820alpha/beta hydrolasecoord: 397..610
e-value: 2.2E-30
score: 107.7
IPR029058Alpha/Beta hydrolase foldSUPERFAMILY53474alpha/beta-Hydrolasescoord: 387..570
IPR002921Fungal lipase-like domainPFAMPF01764Lipase_3coord: 433..570
e-value: 4.2E-23
score: 81.8
IPR029055Nucleophile aminohydrolases, N-terminalGENE3D3.60.20.10Glutamine Phosphoribosylpyrophosphate, subunit 1, domain 1coord: 1..223
e-value: 1.7E-87
score: 294.8
IPR029055Nucleophile aminohydrolases, N-terminalSUPERFAMILY56235N-terminal nucleophile aminohydrolases (Ntn hydrolases)coord: 8..217
IPR005592Mono-/di-acylglycerol lipase, N-terminalPFAMPF03893Lipase3_Ncoord: 298..378
e-value: 3.0E-19
score: 68.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 625..641
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 611..641
NoneNo IPR availablePANTHERPTHR46023LIPASE CLASS 3 PROTEIN-LIKEcoord: 578..834
NoneNo IPR availablePANTHERPTHR46023:SF6LIPASE CLASS 3 FAMILY PROTEINcoord: 578..834
NoneNo IPR availablePANTHERPTHR46023:SF6LIPASE CLASS 3 FAMILY PROTEINcoord: 249..580
NoneNo IPR availablePANTHERPTHR46023LIPASE CLASS 3 PROTEIN-LIKEcoord: 249..580
NoneNo IPR availableCDDcd00519Lipase_3coord: 424..569
e-value: 4.257E-36
score: 134.526
IPR023332Proteasome alpha-type subunitPROSITEPS51475PROTEASOME_ALPHA_2coord: 24..258
score: 67.602875
IPR034642Proteasome subunit alpha6CDDcd03754proteasome_alpha_type_6coord: 8..216
e-value: 4.17962E-138
score: 406.62

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10002437.1HG10002437.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016042 lipid catabolic process
biological_process GO:0006511 ubiquitin-dependent protein catabolic process
biological_process GO:0006629 lipid metabolic process
biological_process GO:0051603 proteolysis involved in cellular protein catabolic process
cellular_component GO:0019773 proteasome core complex, alpha-subunit complex
cellular_component GO:0005839 proteasome core complex