HG10007350 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10007350
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptioncalpain-type cysteine protease family
LocationChr10: 4026578 .. 4054233 (+)
RNA-Seq ExpressionHG10007350
SyntenyHG10007350
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGGGGACGAGCATAAGGTGGTATTAGCGTGTGTGATATCAGGGTCGCTCTTCTCGGTGCTGGGCTCTGCCTCGTTTTTCATACTCTGGGCTGTGAACTGGCGCCCATGGCGGATTTATAGGTAAATTTTTTGTTTGTTGTGGATATACTGGTCTCTTCTATTTTGAATTTTCCTTGGATGGCACCAGAATTTTATGTAATGCCGTATTAGTTTAGTGGTTCCAGGTTGTAATCAAGAATGTTGTGGTAAGGTAGTGGATGTTAGTGTGGGACATCGGTATTTGATAATGCTTTATACTTCGTACATCATAAAAATTTACATATTCCAGCGTCACCGTTCAATGTGCGGGTCAATTAACTGAATGCTGGTTGTCCTGGGATGGAAAGATTTATAGTAGGGCAAGGAGTCTGGCTCAAACATTTCTTTATGCTCTTACATAGGTTACGGTTGAATTATAGTTCTTATAATTTGAAAGCCCTAAATGTTAAAACTTTTAGGTAAATTCTCTGGTTTTTGAGCAGTTGAATGAAGATAAACGGTAGGCTAATAATGTCTGTTGTTTTGTTTCATCCGTTATAATCAAACATTTTCCCTGTTTTTAAAAAAAGAAATAAAGATCAAACATCCTCCCTGTTTTGAAATGTCTGTTGAAAATTGTTATATCCATTTTTCTAAGCCTGTTTTTTATTGTGTTATGCATGCATGTGTCTAGTGGAAACTGATATGCTCATGTCTTTATGGTGTAGATGAATGTATTTTTTGTCTTTCTTTTGTGCATAGTATATTTCTCAGGTTTCTTATATATATACACGTATATATATACATATTATATTTTCATCAATTTAATGCAAGCACAATTATTTTTCAGTTGGATCTTTGCTAGAAAGTGGCCAAATATCTTGCAAGGACCCCAGCTGGATTTGCTTTGTGGTTTTCTCTCTTTATCTGCATGGATATTAGTTATTTCTCCAATTGCGGTGCTGATCATATGGGGGTGCTGGCTGATTGTAATATTGGGTCGAGACATAACTGGGCTTGCTGTGGTGATGGCTGGCACGGCTCTTCTACTTGCATTTTATTCAATTATGCTATGGTGGAGAACACAATGGCAAAGCTCAAGTATGTAGCTGTGATTGTTCTAATTTTGAAAAGATTATTACTACTCATCCTTCTATCTGGTGAATAAAATTCTTATTAATTTTTTTTTAAATGATATTTGACTTTCTTGAATGGTTAGTGGTAAGATTTTTGTATTCAGCTAGGGAGAGGAGAGAGATTTTACGAGGAGCAAGTAGTCTGGTTGTTGGTTGTTTAACAATTAGCATACTACTAGTTTATTTTAGCGCCGTCACAACTGTCTAGGCTTAGAAGCCTCTTGAAAACCTTACTGTTTTTTGTTTTCTTAGAAGTTGGACAACAAAGTATGGATAGGGTCTTAGTTATTTAATGATCTCTATATCCGTATGCATATGCATGTTTGGATGATTTTAGAGTTTGAAGTGTCTAAAGCCTTTTCCTCAAATGTGTGAAAAAGTTCCCCAATTGACAACTCTATTCATTTGTCCTCTGTGGTAAAATAATAGTATACTTAAGTAGGTGTGGGCATACATTGTCAATCAAGCAGCACAAAGACCAAGGCATGCCCCACAAGCTATAAGAGCCATATGAACTTATCAACCAACGATTTCCTTCATTTCACCTTGAGGAAAAGTTGAAAGTTGGTTATGATGGTATTCGTAGAGTTAGAAATTCGGGTAGGATTGTTGTTTAGCTATCTTGTGAGGTGTGTGGATCGAGAGAAACGGTAGAATTTTTAGAGGGATGGAGTGGCCTTTGGAGGAGGTGTGGGAGCCGGAGGGGGTGATGTTCAGTGACTTCTTATGGGCATCCGTTTGTTGGACTTTTTGTAATTATCAACTTAGTATCATTCTTTTGGATCGGAGCCCTTCTTGGTTGATGGCTTTTAATGGGCTTGATTTTATATGTCCTTGTATATTCTTTCATTATTTTCAATGAAAGCTCGAATTCTTACTAAAAAAGAAAAAAGAACCATGTTCAATCATGCAAGATGCATGTGCGAGATGAATTGCATTAGTGTAATATTGTATAGGTAGGGTAAGTGGGGATGAGGCTAGGGTAGTAAGCAAATATAGCTGATGATGTAAGTGTGAGACAAATTGCATGTGTCATGCACTTTTGTATCAATTTGGGTATGCAATCCCTATGGAAAAGAAAGTATTCCTATTGGAGGAAATTTCCAACCCTCTTATTGATGATTCATTTCACCCTTTCATATAGCAAAAAAGGTATTCAATCTTATGGATGACTTTTAGTAGTTCTTCCATCCCCACCTGTTCCACCTATCCCTTCACTCCTTTGGGATTCCGTATAGCTAGACAACCCAAAAGGAGTTTTTAGCTCTTTTGCATTTGTGAACTTGATCTTCATTTAGCTCTATACTTGTGCAAACTTTTACCATCCATCGCAAATTATAGCTTGGACATTCTATTACCCGTCTGGAAAGCTTTTCTGTAATTCCTTTGACATGGTGTCCTCCCTTTTTATATTATTTTGTTTCATTGATGAAATATTTGTTTCCTCTAGGGAAAAAAAGGAATTTTAGCCTTCCTAAGCTTGGTGAAGAAATGATAGTTGGAGCAAAAAGTTGCCCAATCAAACAGGCAAAAGAAGATTTAGGTAAGAAACAACGTGAAGGTTTTAAGGGTCCATGCATGTTTACCTTGAAAGGTAAGTGTCATGGGCACCATTGCATGTAAATCCAAAGTTTCCCTATCAAACTCTAGCATAAAAATTATGGGCAAGGAAAATGGCGAGAATCATTTTCGAGCAAAGCAGAGTTAATATAGTTTAATGATCTTCCTAAAAGCGAATAGGTAACCATCCTTGGAAAATGCGTTGTGTATTTGGGTCCCTGACCGTTCTAGACGGTTCTTGTGCTACCCCTTCTTTCATATTTTGTCCGTCCCTTCCCTTCTCCTTCTTGTGTAACCTCTACTTTCTCCTTTTTTTTGGTGGGTTGAAATTCCAAAGAAGGTTAAGGTTTTTGCATGGTAGGATTTACATGGAGAGTGAATATCTTGGACCATATTTAAAGGCATTCTTCCTTCCTCCTCTCTCCGGCCTTTCATGAGAAGGGTCGGATTTTATGGCTGGTTGGGGGGTGTGCTTTTTTGTGGGATTTTTGAGGTGAGTGGAATAATTGTTTGTGGAGGGTTGGAAAGGGGTCCAAGTGATGTTTAATTCTTGCTTCAATCTTCTCCTTAATAAAATGCTTATCAACTTCGATATGTTTGTTCTATCATGTAGAACTAGATTATGAGCAATACAGATGGAAACTTTATCGTCACAATAAGTTTGTATAAGTGTAGTCAAAGAAATCTTTAGTTCTTCAAGAATCCTTTTGATCCATATGCCTTCACAAATCCCATGGGCTAAAGCTCTGAACTCTGCTTCAGCACTACTTCTAGTTGCCACATTTTTTTTTTTACTTTTGCAAGTACCAACAAAGGAGCAATAACCAGTGGTAGATCTTCTATTTGTAGCACTTCCTGCCCAGTCTGCATCAATATAAACTTCTATTTGGAAGTGATTATGCCTTTTGAATGGAATACCTTTCCAAGGAGACCATTTCAAATACCTCAAAATTTTGTAGGTGACTTCAAAATAAGTTTGGTTCAGGAGCATGCATAAACTGACTAACCATGTTCACGGCATATGCAATGTCAGGATGTGTATGAGACAGATAAGTTAATCTTCCAACATGTCTTTGATATCATTTCTTGCCCTCTATTTCTTCAGCTTTTGCTACTTGTAGCTTTAGATCAGGTTCAATAGGGGTTTATGCGATCTTGCAACCATGTAAACTTGTTTCTTTAAGTAAGTCAAGGACAATACTTCCTTTGGTTAACATAGATACCCTTCTTTGATCTTGCGAATTCCATTTCTAGGAAATATTTTAACGTTCCTAGAACCTTGATTTTCACAACTCACATGCAAGACTTCTCTTAAGATCAGCTAAACCTACTTCATCATCACCTGTGAAAATAAATCATCTACATAAATAATCAAAATTGATATTTTATTATTCCCCGAGTGTTTATAAAATATAGTATGATCTGCTTGATTATGAAGAAATCCATAGCTGGACACAACTTTTCCAAAACGTTCAAACCAAAGCTTTAGGGGATTTTATAAGGTCATAAAGAAAATTCTTTAATCGGCAAATTTTACTAATGTCGAGTTCCTTTTCAAAACCTGGTGGTAAGCTCTTAAATACCTCGAGATCACCATTCAGAAAGGCGTTTTTTAAATGTCAAGTTGGTGAAGTGACCAGTCTGAATTAACTGCAAGAGACAAAAGAATTCTGATAGAGTTTTTTTTAGCAATAGAGGCAAATGTTCTTGATAATCAATTTTGTATGTCTGAGTGAATCCTTTAGCTACCAGTCTGGCCTTCTATCTCTCAATACTACCATTTGCATTACATTTTATGGTAAACACCCATTTACAACCCGCTGGTTTCTTATTTTTAGGCATATCTACTATTTCCCATGTTTCATTTTTGTTTTAGAGCATTCATCTCCTGCACAACTGCTAATTTCCAATTTGGATCATTTGAAGTCTCATGTATATTCTTTGGAACAAACTAGTCATCTATCCTAGATGTAAAAACTCCGACTATTTGGCAATTTTTGATATGACATGAAGTTTGCCCTAGGATATTTGGTACAACTTCAAATACATTTTCTAATGGCTATGGGAACATCGAGATCAAACACGTCTAGCAACTTTGAATTGTTGAAGGAGAAGGAGTAGGAAGGAGGATTACCAATTTCATTCACTAGAGTCTTAGACGGGTTTCATGAAGGATCAACTATTTGGTCTTGGTTCCTATGATTGAAATTTCTTCTAGTATAAAATTTCTACTCAGGAACTTGGTCAATATGACCATTTTGTTGTATTTCTCCTCCTGACTGAGAAATTTCTACACTTGGCATCGTAGAACTAATGGCATCTGGACAAATAAGATTGGAAAGAGAAATACAAACATCCCAAAAAATTATCTTCCAAATTTGGTTTCTTCCCTTGAAGAAAGTTTTCAGTAAAATAGGATTTAAGATTGAGGATCAAAACATTTTAAACTTTTTTACTTGCAAAGTAACCCAAAAATATGCATTTAACAGTCCTATAATTAAGTTTGGACCGAAAGTGACTAGGTATGTGAACATAAGTAGTACACCCAAAGACTTTGAGAGGTAAGTTAGAATATATGCAGGAGGTTGGAATTTTTTTTTTTTAAGGTAGTCAATATGACTTGTAAAATTCAAGGGTTTACTAGGCATTCTGTTTATTTAATAGGTTGCTGTTAGAATTGGTTCCCCCATAGATATTTTGGAACATGCATGGAAAACGTTATAGCCCATGCCACTTCAAGTAAATGCCTATTTTTTCTTTCAGCAATCCCATTCTATTAAGGAGTACCCGACATGTAGTCTGATGAATAATACCTTTGTCTTTCAAAAATTCACCAAAATATTCATTAAAGTACTCAATTCCATTATCAGAGTGTAAAATGCCAAATTTGGTTTGAAATTGGGTTCCAATCATATTGTAAAAATGTTTAGAAATGTCTTCCACCTCATATATTTTTGGTGTTTTTTTTTTTTTTTTTTTTTTGGCTAAAATGCAACTTTTACAATCAAAAGAAGGACAATCAATTCCCTTGAATAGATTTGGAAATAGATATTTGAGATAAGAAAAACTTGGATGCCCTAGTCTAAGATGCCAAAGTTTAATATTTTTTTTTGACCGAAAGAGAACTAATATTACTGAGACCATAAGTTATTTTATTGCTAAAAGAACCTTCATCGAAGTAGTAGAGGTCATCAAACATCCTAGCACTCCCAATCATCTTCTCTGATTCTTGTTCTGGAAAAATACAGTGAGATTTAAAGAATATAACACGACAATTAGAATCTCTAGAGAATTTAGAAACAAATAAAAGATTGCAGGCTAATTTTGGGACATGTAGAACATACTGTAAGAGATATTTTCAATCAGTTTAATAGTTCCGTTTCCTGCTATAGAAGAGAAACTTTCGTTTGTAATTTTGAGTTTCTCATTGCAATATGATGGAGAATATGAATCAAAGAAAGAAGAAGAACTAGTCATATGATCTGAAGCTTCAGAATCGATAATCTATGGAGAATTATTACCACAAGAGAAGGCCTTAAGATAATAATCTGTTTGTGTCAAAGAAACACAAGACTTATTAGATGATAAATTGGTTGATAATAGTCTTAAGATTTGATGAATTTGCTTTAGTGACATGATAGCAAGTAATGGGAGCCGGAGGGGGCGATGATCAGTATGAATCTTCCAATAGTAGAAATGAATCACTGGTGAGCCTCAAATTGAATAAATAGCCAAATTCAAAGTTGGAAAATTCGAACCATATCAAGTAGACCAAACCTTTTGAACCAAACCATATTGATCGGACCAAACCAAATATCAGTTGAACCAAATCAAGCCAAGAAAAAAAAGGAACCAGTCCACTACTTAAAAATCCCTTTGTGATAGTCACAATACCCACAGCTTAAAAAGCTGAAGATGACAACCTGCTTGCGAACCGGAGGCCTTAGGTGACGACAATGGGTTTGTCAGTGACAGTCTTGAACGACAGGCGACGAATAAATCTATCGGACAAGCGGCGATGGGTCAAACCCACTAGCGGCGGACATACGTGACGGCTATGGGTCGAACCCACTAGCAGCGGACAGACGTGATATCGATGGGTGCCGCAAACTCGTATAGGATGAGGCGGCTCGACGGTGGGCGACCAACTGAAGACTACAAGCGGATCGACGTGGATGGTGAAAAATCTGACGTTGAAACATACGCTTCTTAATGTTGACCACGAACTGAACTGAAGGTAGGAGGCACTGACGCTAGAGGTAGGATGCACCAATTGAGATAGGTGGCTAGGTTTCCAGCGGTAGTGATTGAGATTTTTAGGGTCTTTTCTAGAGATGGCTAGGGTTTTACTGTCTCTGATACCATCTTAAATTTTGAGGGAATAAGATTATACTTTCTTCTTCGATATGTACAATCTTAGATCTTCTTATATAGTAGAAAGACCTCTTACAAATATGGAAATTAATAAAAACAATAAAGGAAACTATTTAACAATTAATATTTATACACACATATAGAAATCCCAACAGTCTCTTCTTTTTCTCAATTCTCAAATGGTGTATCCCACAAGCACCATGGTTGACCTAGGGTCATTGCGGCCAATGAAAAAGGTAAAAGGGCTTACTGACAACGCGTTCAAATCAAGGTGGTCACTGGTCACTGGTCACCTATCTAGGATAGAGTATCCTATAAGTTCTTTTGACAAGAAAATGTAATATCCTGCCGATAAAATACACAAGTCAAGGTGTGTTCAGGCTAGCTTATATACTAACAAATATTAAAAAAGTTAAATTGGTGTGTATCTCATTTCCCATTAGTAAAATGAATTTACTTCTTTAAAAAAACAGTAGAATGAAGTTTTTGCATGAAGTTTGCTTTTGGTAACTTCACTCTACCTTGGTGGTTTTAAATTTCCTGTTTTTAATCCTTAGACTAGAGGAGTCCTTTTTTTTTAAATAAAAAAAAAAAACAAGAAACTGAGCTTTTGTTGAGATAAAGAATAAGTCTTGGAACAATTAGAAAAGTTGTGAAAGTCCATCAATACCTTCGTGGAGTGCTTTACCACATGTTTGCTTTGATTATGTTGAAGTAAATCCTATGTTGTGCTTATGTTGTTGATATCTAATTGTTATTAACAATGCTCTTATTCATGGACAAGTTAATTGGTAAATGGGAAAATGTTTGTAATGATGGAAAGTTCATCAGGCGTTTGAACCTCACTATTGCAATATGTCATACCAGCAAGATGTTGTTTAAAGTGCTTTCTTGCATCATTTGTAGGGCAGTAATCTTTTCTTTAAAAATCTTTCTCGTACATTCATCTTGACATCACCATGTGTTTGTTCCTTGAACAGGGGCTGTTGCTATTCTTCTTCTTTTGGCGGTTGCGCTTCTATGTGCATATGAACTTTGTGCTGTATATGTTACAGCTGGTTCTAGTGCATCTGAGCGTTATTCACCTTCTGGTTTCTTTTTTGGTATATCAGCAATTGCCTTAGCGATCAACATGCTCTTCATTTGTCGGATGGTCTTTAATGGTACTCATCACCACCAAACAATTTCTTCCATTTTTGCACTTTTTTTTTTTTTCTGACAGGAATCAATTTTCTGCATTCTCGTTTATATATTTTTCTCTTGGTTCAGGAAATGGATTAGATGTGGATGAATATGTGCGAAAGGCATATAAGTTTGCATATTCTGATTGTATGGAAGTGGGTCCCTTGGCTTCTTTACCTGAACCACCTGATCCCAATGAATTGTATCCTCGTCAATCTAGCAGGCAAGTTCTCAAAATTCTCACATTATTCTTTCATCTTTATAAAATCAATTAATTATGAAACTTATAATTTTGTGGCCAGTGACCACTATTCTCACTAGTTTTTTGGTGTCCCTATGATGTAACATTGTGGGTGTATTTTTGGGCTCTTTCTTTTTCTATTTAGTATATTCTATTTTCTTCTGGATGCTTGTGTCCTCAATCTTTCATAATTTGATTTCACGTTAAAAGTTATTTTTATCCATTTTTACTATTTTGTTTCTTCTAATAATTTTTGACCCTTTTGATCATCTGCTCAGGATGAACTTAAGTGAACCACGAGGAAAAAAATCCCTCAATTTGGCGTACTGATCCTGTGATGAGAATAGTTTATGTTGTTGTCAATTAACTATTTAATAATATTTTAAATATGGATTTGCTAGTTTGTGATATTATATCCTAATCTTGAAATGTTAACTGTCCAGCGAGCATATCTAATATTGTTATTGATTGAAAACAGGGTTGGAAGACTTCTCTTTTTTTTTTTAGAGAAGAAACATTTCATTGATAGAATGAAATAAGGGGAAAAAACCCCAAAGCACCAAAGGTGTTACATGAATGATTTCCAATTGGAAACTAAAAATGAAAGACTGTAGTTCTTAAAAGGGTGCCTATTTTTGCACCAAAACATGGCGGTAGATAACACTGAATCCAGAAAAAGATCAAAACTTGATGAGAGGTCTTTAAAAAGACGCTTGTTGCGTTCACCCCATAAAGTCCAGAAAAAAGCGCGCAAAATGGCCAGCCAAAGTAATCTTTTGTCCAGGGTTGGAAGACTTTTCATTAGAAACAAACAAGTATCAGGAAGGTTTGAGAGAACCATGTAGTGCAATGTTGTCTTTCTCTTCAGGAGATTAAGGCCTTAGTTGTTAAATGCTAAGTTTAAGTGGCCAAGACCCTCATTTGTATACTTCTGCTTATTGGCAACATCACCTAAACTCTGGTCATCAATTTGGCCATGTTAATACTCAAGCTGAAAGCATATTTGACAGGTTGATGATGTTTTGAGGATTAGCTTTGGGTTTTTTATCCATCAATGAACAAATATTCGATGTTATCAATGAATTGGGGGTGATTAAGATGTATTTTTCCCATACCAATGAGAAAACCTTCCACCAACTAGACCTCAACTTCTTTAGACATTGTCATCCCAAACACAATCTGCTACAGAGAGAGAGAGAAGGGGAAAGTGATAGGAATCTGGTAGAACTTTATGATTCAATTCAAGTAAATGCCCTAGGGCAAGAACAGACGCTTCTTCCTCCCACTCTCGAGAGAAAAACTCAACACTGACATGAAAAACGGAATACCCCCCTCACTTCCTCTTTTGGGTTACATACTCTTCTCTCCTCCTAACTAACTGTTGTCCCCATCTCACTCTTATTGGACATATCTCCCTTCCCACTAACCAACGCTTTTCCCCTTTTACCCCTACGCTCATATTGAAAGATAATCGGTGGCCTAATAGAAAGATGTTGACAAGGTTCTCTAGCGGTGAGATTCCTCTTGGTTTCCCGTTGAGAATGTTGGTGTAATTTGAATAATGGACTCCTTGCTGACCTTTTTGATATGTAATTTGGAAAAGAGGCCTTGCTTTCTCTTCCTTCTCTGCTCATACACTACTTTATTAGTCACATTTGCAAACAACAAGACCTGATTGGGAAGAGGAGTTAATATGTGCTAGAACTCTGGCTAAACAACCATTCATCTTTGGGTTGTTTAGAATTTACGTATCTTTAGGGGAAAAAAAAGAATTTACATATCATTTTTTTTTTGATAGAAAACATAACTTTTCATAGCTAAAATGAAAAGAGACTAATGCTCAAATTACAAGGAGACAAAGCGAAGAGTCAGATTCTTAAAAAGTACAAACCCAAGAGAATCAAACAATACAATGAAGCATAGGCAGAAAAAATAAAGGATCTTCAACTGAACAAATTTGGAGAGAAAATACACTTCAATCATAGAAGAAGAACATCAATAAGAAATGATAAACTTGAAACTTCAAAACGAACGAGCCAATGGAGCACCAAAAACCCAATGCAGATGCAAAATTGACTGAAGCTTCAAGCACAAGAAATGATTATTGGGCCGATGGAAGAAGAAATATTCTTTAAGACTAAATTGCTGACGAAGGCCCATGAAAGCTTCTGATCCATAAAACCAACAGCTGACAAATGAAAAAAAGGGGGAAATAAAAAATTTCCAGAAGGCACCATTTACCAGCAACTTGAAATAGGAAGAACCCCCTTGTTAGTCATTATTATTGTCAAGTATTATAAATTGCAGTCTGGTTGTGTTTTAAGTCTTCTAAATCTCCCTAGAGATCTGTCTGTTCTACTTTGGACCTAGTGGCTTGTCATGCTCATCGTTGCTTAGGTAATGGTAGGGCCATCTCTTTTTGGCACGATTCTTGGTTGAGTTGTGGAGTTATTACTACTGTTTTCCCTCGTCTTTTTCATCTAACTACTCGGCCAACTAGTATCGTGGCTGAAGTTTGGAATTCTGCTCATGCTGCTTGGGTTTGAGTTTTTACCGTAACCTGAAGGATTTGGAAATCATTGAATGGGGATAATCTTTCTCATCTTTATTCCTTGATTGGTCTAAGGAACCTAGATGATTATTGGTCTTGGCGCTAGATCCTTCCAATAACTTCTCAGTTAAATCTCTTATAAAGGCTCTTGTTTGTACACCTACACCAAATCCTTACAATCTTTACTCAGTGATTTGGAATGATACTTATCCAAAGAAGATTAAGATTTTCTTGTGAGAGCTTTGTCTTGGTGCAAAAAACACACTGATCGTCTACGGAGAAGAATGCCTTATTTTCATCTTTCTCCTTCTTGTTATATTATGTGTTTGAAATTCTGAGTCTCTTGGACACTTATTTGCTCATTGCTTCCTTGCATCATATTTTTGGACGACTGTTCTAGAGGCTTTTGGATGGTCTTTGGCCTTACCAAATAACATTTATGATCTTCTATCTTCTGTTTTTGTGGGTCACCCTTTCAACAGTGCAAAAAATGATTATGGCTTGCGATCAACTGAGTCTGTTTTTTGTCTCCCCTTTTGATTATTTTATGGATTCCATTCTTTTTCATGCATTGTATTGCTGTAAACACCCTTTTACTGACTGTAGCCTCTACTTCAATTGCTAATTGACTAACTTTCATGTAATCCACCTTTAGGTGTTTGACCTCTTTATTTCATTTTATCAATGAAATTTGTTTCCCCCCTCACAAAAATTATTATTGTCAGCTGATAGTATAATTGTTCATTTTTTCAGGGCTTCACATCTTGGGCTTCTTTATGTTGGTTCGGTTTTAGTACTTGTTGCCTACTCTATTTTATATGGCCTTACGGCTAAGGAGGCACGCTGGCTTGGTGCCACTACTTCTGCTGCTGTTATCATTCTTGGTATGTTCTAATAACTGCTTCACCTATATTTAACCTCATTTACTCTTGTATTTTCTGACGTTGTAATTCTTTTGGGTACTTGTTTTGTAGATTGGAATGTAGGGGCATGCTTGTACGGGTTCCAGCTGTTAAAAAGTGGCGTTTTAGCACTTTTTGTGGCTGGCATGTCTCGTGTTTTTCTCATTTGTTTTGGAGTTCATTATTGGTTTGTTTCTACCAACCTCATTTGGTTTTCTTATTACTAAACCCGATAACTGTTATCATGGTTCTCATTTTCCTTTTGCATCAGGTATTTGGGACATTGTATAAGTTATGCAGTTGTAGCTTCTGTACTATTGGGTGCTGCTGTGATGCGTCATCTTTCAGCAACAGATCCGTTTGCTGCTAGGAGGGATGCCTTGCAAAGCACCGTGATTCGATTGAGAGAGGGGTTTCGCAGGAAAGAGCCAAATAGTTCATCAAGCTCATCAGATGGTTGTGGTTCAAGTATGAAACGTAGTAGCAGTGTTGAGGCAGGTCACCTTGGTAATGCTGTTGAATCTACTAGCAAGAGTGGGCCAGCAGCACAATGTACAGTTGACGGTAATAATTGGAATGGCGTCCTATGCCGAGCTAGTAGTTCACAAGAGGGGATTAACAGTGATAAGAGCATGGATAGTGGACGTCCAAGTTTGGCTTTGCGGAGTAGTTCTTGTCGTTCTATCATCCAAGAGCCTGATGCAGCAATGTCGTTCGTGGATAAAATTTTTGATCATAATAGTTCCTTGGTGGTTTGTTCTAGTAGTGGGCTTGACAGCCAAGGCTGTGAATCTAGCACGTCAACTTCTGCAAACCAACAAACATTGGATTTGAACTTGGCTTTGGCGTTGCAGGAAAGGTTGAGTGACCCAAGGATTACATCCATGCTGAAGAGAAGCTCTAGACAAGGGGATCGTGAATTGGCTAGTTTACTGCAAAATAAAGGACTGGATCCTAATTTTGCAATGATGTTAAAGGAGAAGAGCCTGGACCCAACTATCCTTGCATTGCTTCAAAGGAGTAGTTTGGATGCAGACAGAGAACACCGAGATAATACTGATATTACCATCATTGATTCAAACAGTGTGGACAACATGCTGCCCAATCAAATTTCTCTATCTGAAGAACTGAGACTTCATGGGCTTGAAAAGTGGCTTCAGTTCTCCAGGCTTGTACTACACAATGTAGCCGGGACCCCAGAACGGGCATGGGTGATCTTTAGTCTTGTCTTCATAATTGAAACAATCATTGTAGCCATATTCCGTCCAAAAACTATTGACATTATAAATGCGAAACACCAACAGGTGAATAGAGCTTTTTTTTACATATACGGAGTACAATGTCATTATTGGATTTTGGAATTACCAAATTTGTGAAATGTAAAGGTAACCTCTGTGTTTTTTCAAACTAAGATTTTTTCTTGTGGTTTCAGTTTGAATTTGGCTTTGCTGTCTTACTACTATCTCCTGTGGTCTGTTCAATTATGGCTTTCCTCCAGTCACTGCAAGCAGAAGAAATGTCAATGACCTCAAAACCTCGGAAGGTATGCATTCCATTGTGAGATTAAATATATGCAGATAAGAAAGTTGTTTTGATTTACATCAATGAAATATTATTTTGGTAGTTACATTGATTTTTAGTAGCCCCTAGCATTTCTTTTCTTATAGGTGAAACAACTCTTTTCTTGTAATGTGGAGGTTCTGTCTAGAGCTTTATTTTCTAATTTCTACTGTGGATATTGAAGGGTTTGTTGGGTAGGCAATTTGAAAACAAAGAATTTAATGAAAACATAGTTGTATTTCATGTTTTTAGATGTTTTTATGTTGGACCCAAAGTTAAAGTCCATATTTTTGTGTTGAACCCAAATTTAAAGCCCAAATCAATTATAAAAGGTTGAACCATTTTCTCTTAAAAAATTTTCCAAGAGTGTCTAGGAAGTGTCCCAAAATGTCCCCGTGTGTCTGGAATTAAAAATAAATAAACAAAAATAGGACACAAAAATTTGCGTGTTGGACACATGTCCGGAGCGTGTCCTAATGTGTCTGACACAGATACTCCACCTAAAATGGTGTGTCCGTACTTCATAGGATGGAACTAAATAAGGGAGCAAACCTATGTGAATTCTGTCTGTTATGCAAACATACATACTGGCCTAAATCCCAGGTGAATCTAAGCCAAGTAAGAAGTGATGTAGTCTGTCCTTCACATATATGGGACTTTGGAAGTATATGGCTTTAGCTTATGTGGGCGAGCATCATTATTTTCTCAGCATGATGAAGGGTGAATAGTGAAATGCATTGTTCAAACCTTTTCTGGCCATAAGTGTCAAGAAATCTCGCATTAAACTATAAGATTGGGCTTTGATAGTAATCCCCTCGAGGCGCAGTCCAGTTGGTTGAGGACTTGAAAGTTTGGTATGTTAACGTGGAGGTATTTGGTTTTGAGGAGCTCCTAAGTTGTTTAGGGTAGGAGGGATCTCATTTTTGGTCTCCTCCTCCTCCTCCTGGACTTCTCTTCTTTCTTCTGAGGATTGCTTTTCTTCCCCTGGCCTCTCTTAATTTTCTTCACTTGGGAAGATTAAGGTGCAAGGAAGGTCAAGTTCTTTGTCTAGCTCGTCCTGTGTGGAAGTGCGGACATCTTATAATGCATGCAAAACCATTCTCCTTTCTTTTTGCGGCTACAGTGGTGTTCCGTTTGTAGAGGTGTGTCAAAGAATCCAGACCATATCCCCTGGGGTTGCAGTTCTGTGCAGTCGATATGGTCTTGATTTTAGGACACTTTGGGAGTTCCGTTGGCTTGGAGTTGGAGCAGAGATTTTAGGGCTTGGATGGAGGAGGTTTTGATGTCTTGTTGTCTCCCCCTATCTCATTTCTCTGGCAGGTGTGCTTTTTTTGCCCTTCTTTGAGGCTTTGACTTGTGAAGAATGAAAGGCTTTTAAGAGGCACTGCAAGATTTGAGGTAGAGTTTTGGGAGGTTGATAGGTTCATACCTCACTTTGGACCTCGGTGAATAAGGCTTTTTGCAACTATCAGCTATGTCTTATCCTTTAAGATGGGAGTCCTTTCCTGTAGCTTTTTGAACTCTTTCTTGTTGGGTAGTTATTTGTTTTTGTATGTCTTTGTACATTCTTTAATTATTTTTAATTAAAGCTCGATTCTTATTAAGAAAATCTTTTAACTGATACTATTTTGCGTGTATAGTTCTATAATTTGGTTCACATTCATCAGGTTTATAATGATATTATTTATTTTGGTCTATGATTTGATTCACATGCATTAAGTTTCTAACGATATTATTTACTTTGGTTTATGTCTCAATATGATCTTTTTCTGCAGTATGGGTTCATCGCTTGGCTACTCAGCACTTCTGTGGGGCTTTTACTTTCTTTCTTGAGGTTAGCAATATCATCAAGCACCAAGTCTTTTTTTTTTTGTCTTTTTGAAAAGCTGTAGAGCTGTAGTGGCCTGGGAAAGTTGGAAACTGGACCTTTTTATTTATTTATTTATACTTCCATATTGATATTCATACTCTGCTGTTACATTGAGGTCCATGTCCATTTTTCTAAGATTTGAATTCTTTCTTTTGAGAGTTTTATAGTTCCTCTGTTTTATTATTATTTCATCTGCACAGTTTCCAATACCTTTCCAGTTTTCATGCTTTTTTTAAAAAACTTTTTTACCTGGAGCCTTTTTTCATCTTCCGTCACTGTGTCATTTTTTGTAGCGAAAGTTTTTGATTTCCTTCCCTATTTTCAGTTTTATTTTGATCCATTGATACTTTTTTTTCCTTGGTAAATGATTGTTAACTATTTGTTTTGCAGCAAGTCATCGGTCCTTTTAGGACTGTCCTTAACAGTTCCACTCATGGTAGCTTGTCTTTCTCTTGCCATTCCTATATGGATCCGAAATGGGTACCAGTTTTGGATTCCTCGAGTACAGTGCATGGGTTCTGCAGGAAACCAACGATCTCTTGCAACAAAGGAGGTTTGTATGTCTGCTTTTTAATTTTTAAGATTTTAAAACTTCTCCCACTTTACTTGTAAAATGAAAGTCTTGGTTTCTAGTTGTAATTTCTGGTCTTGACTCTTGTAATCTTACTGTTACGGGTGGGACTTGTTATTCTTACCATGTTGCAAATCGTTCTCAATCTTTCCAATAGAAGTAGAAGTAGGTTCTTCTGTCAGAGTAATGAAAAGAAAGTTGAAGAAAACATTTTATTATATGAAAGCGACACATTACAGCACGAGCCATCACAGAGAGATTCAGGATAGGAGGAGGCATGGACTTCACAGCTAGTGTTTTGCTTAACTACTTGGAAGGTGTATTCGAATTTCCAAATGGGGTATAATGTTGGCCATGTTGGGTGTTATGGAATATTAGTGGTGCGTGCATAGCTCGTTGATAGGATCATGAGGAATAAAGCAAAATAATATAATTCTTTTAGGCTTGTTTGGGCCCCAACATCATGTGAGTGGGGTGGGCTATTATAGCTCATTCCATGTTTGGGGCTCCAATTATAATAGTTGGCATTTCCAACTATTATAACCTACAATTAACTACAATATTACTATTTCAAATCCCTTTTTGTTACAGTGTTTACTATTTCCTACCCATCCCCATCTTCTTTTTCCCTGTTTACTATTTGTTACAGTGTTTACTATTTCCTACTCATTCTCTTTTTCCCTTTTTGTTACACAACTTATTTCCTACCCTGACTGAAATAGTTTGCACCCTAAACACAGACTATTATAACCCACAAACTATAATAACCAATAACTATAATAACCAACTCAGCACCCCAAACACCCCCTTAATCATTAAACGAAGCATAGAGACTTTGAGCTCAATGCGCTTTTACAAAGGTGTTGCCTCTTGTGGTTCAAAGGAAGCTCAAGTGGCTAAAAGTTAGAGTTTGCGTCGAGGCGACCCTCAAGGTTTAAGCCTCAAGCGTCTTTTAAAACATTGATATGAATAGCCCATAGACGGGACCAATCCTTAAAGTTCTGCCTATCTCCTATACTTCTAACTCCTTTTATTTTTATGGTTAGTCTCTCTAACCAACTTTTCAGGTAATTACAAGCACACCACTATCATAATCCTAATTACATACTATACCACTACTAAAATTCCTAATATAATTCCTGGTTTTAATCATAATTAACTTGAATTAATTTTGATAACTTTGACTCTTCAGCAATCCCAATTTTCTTTACTCAATTTCATGAGTTACATGGGGGATTATACTAGGAATTTTAGTAGTGTAAAGAACAATTCCAGGATGGAAAGAACAATTCTTTGAGACTGATGCCCAAACATAGCAAGAGATGAACCTCTTCCCAACCTCTTAAAATCCTGCCAGTCCTCTCAAGTCAAATGCTTCATAGAATATTTGACAAACCAGCTACCGAAGGACCTAGACCTTTGCCGAATGGTGGCTGCAGAAAAACCGCCTCTATCATAGAACAATAGTCTCTGTTCTTAGCCAAACAAATGCCAAACAGCTCCATAAAGCATTCCCAGATTGATCGGTTTGTCAAGTTATTTCTATGTTTTCAATATCTCCCTGTTTCTATTATTTATTTATTTCTTCTTTTCATATTTTTGAACTCTTCAATCACTGTTCAATGCTGCATCTAATTTCTCTATCATGATGACTTTACAGGGTATTGTTCTTGTGATTTGTATGTCACTGTTTTCTGGATCTGTGATAGCTCTTGGTGCTATAGTGTCTGCCAAGCCTCTTAATGATTTACGCTACAAAGGGTGGACTGGTGATGACAAAAGCTTCTCATCGCCTTATGCCACATCTGTGTACCTTGGCTGGGCTATGGCATCTGCAATTTCCTTAGTTGTCACTGGTGTGCTGCCAATAGTTTCATGGTTCTCAACATACCGTTTTTCCTTCTCTTCTGCTGTTTGTGTTGCCATATTCACAGGTAACATCTTCAGGTTTTCTGTAGTTGGCTTTTTGGATTAGGTTCTTGGAACTAGTGGTTTATATTTTATAGATCTGCCTAAAGTTGGTCTTAATAGTCTCTTTTCCTTTTTTATAGTGGTTCTTGTCATGTTTTGTGGCGCATCATATTTGGAAGTTGTGAAATCAAGAGATGATGGGGTTCCTACAAATGGAGATTTTCTTGCTGCTTTGCTTCCGCTAGTGTGCATCCCTGCTTTGCTCTCTCTTTGCTCTGGACTATATAAATGGTGAGAGCCTATTCCTTGTACAATCAGTGTGGATGGAATTATCCGGCTGAAGTTTGCAGCTAGTTACATTATCCGTATGTTTTCAGGAAAGATGATGGCTGGAGGCTTTCACGAGGTGTTTATGCATTTCTTTTTATTGGGCTCCTCCTGCTACTTGGTGCTATATCAGCTGTTATAGTTGTAATTAAACCTTGGACGGTAAGAGAAGACTTCTAATACCTCTGTTTATGTATACTTCAGCCATCTTCATTTTGCTCACAGCACCTTATTTCCAATAAACAGATTGGGGCAGCATTTCTTCTGGTGCTTCTTATGATTGTACTAGCAATTGGCTCTGTCCACCATTGGGCTTCAAACAATTTTTATTTGACGAGGACCCAAATGTTTCTTGTTTGTTTCCTTGCTTTTCTTTTGGCTTTGGCAGCATTCCTCGTTGGATGGTTTGAAGGTGAGAAGGAATTGTCGATCTTATTTCTTGTTCTTGTGTTTGTTTCTCTTCCCTTTTGAAAAGTTATGAGTTCGGTTACAAAGTAAAAATTCCCCACCCTAATAATGGCTTTCTTTTTCATCCTTCAGGCAAACCATTTGTGGGAGCTTCTGTTGGCTATTTCTTATTCTTATTTCTTCTGGCTGGAAGAGCATTAACTGTGGGTTATACTTGACCTTGTTTAACATGTTTTGCTATTCTTATTTGATTTTGTCAATTGTTAACTAGTAGAATATCATTACTTGTACTCACCTTTTTTTCCCCTTTCTTTGCAGGTTTTACTCTCACCACCAATTGTAGTCTACTCTCCAAGAGTTCTACCTGTATATGTTTATGATGCTCACGCAGATTGTGGGAAGAACGTCAGGTTTGTTTAGCTCTCTGATTTCTTTTGATTCATTGGCAAATGTCTGTTACTTTCTTCATGATATTGTATATGGGGCTGTATTAAAAAAACCATCTGATATTTGCTAATTTATTCTCCATAGTGCTGCGTTTCTTGTGCTTTATGGGATAGCATTAGCAACTGAAGGCTGGGGTGTTGTTGCAAGTTTGCTTATTTATCCACCATTTGCTGGAGCTGCTGTATCAGCAATTACTCTTGTTGTATCCTTTGGTTTTGCTGTCTCTCGTCCTTGTTTAACTCTCAAGGTATGAATTATCCATTATGAAGGCCTCTGTAAAAGAACTAGCTCTTTAATTTGTACTCACTACTTAATACATTATCAATACCTTCAAATGTATGTAGATGATGCAAGATGCTGTTCACTTTCTCAGCAAGGAAACTATTATTCAAGCAATTTCTCGGTCTGCCACCAAGGTGTGTTTGACCACAGTACATATAAAAATTACCATATTTGATGTACATACTACTAGATGTGTTTTCATCACCTTCTATGTAACTGTATTCGTAAGGATTCTTGACTGCAGCTTGACCTTTTATGTAGTCAAATTCTTTTTCTGATGCTCTGTTATGATGTATGGATCATATAATCCCTTATTCACCCAAAATGTTACTTGTTAAAATGTACATTCATGCTCCTAGTGCATGTTTTACTGGGTGCCTTCATTTATTCGCTATTGCAACTTTTATTTTCCTTAGTTTAGATCTATTTCTATTCTGAAATACTGTGCCTGCATCTAATTGGACTAATTTATGTTTACAATCCAGACTAGAAATGCTTTATCTGGAACATATTCAGCTCCACAGAGGTCTGCTAGTTCTGCAGCTCTTCTAGTTGGTGATCCTACCGTCATGCGTGATAGAGCAGGAAATTTTGTGCTTCCAAGGGCAGATGTTATGAAACTTAGAGATCGCTTGAGAAATGAGGAATTAGTTGCCGGATCATTCTTCTGTAGACTGAGATACAGGAGGCCATTTTTTCATGAGACAACTAATGATGTTGACCACAGAAGGCAAATGTGTGCTCATGCTCGTATTTTGGCCTTGGAAGAGGCAATTGATACAGAATGGGTATACATGTGGGACAAATTTGGTGGCTATTTACTGCTTTTGCTTGGTCTGACCGCCAAAGCTGAGCGAGTTCAAGTATGTCCTGTGCTGTTGCATATTTTTGATGAAATATATGTTCACTTCTGTTAACTGGTAGATGGAAACGCTTCTTATCTGGCAGTTGGAAACTGACAGATACATAATATTGATTCCTTTTAATAAATATTATAATAAGAAGCGTTTTATTGGTAAGTTTTAAATTGTTTAAATTATATTGAGCACTTTTGTAAAATTTAATTTGATTTGGTTGATTCTTTTTTGTAGGATGAGGTTCGTTTAAGACTTTTTCTTGATAGCATAGGCTTTTCAGACCTGAGTGCTAAGAAAATAAAAAAGTGGATGCCTGAGGACCGTAGACAGTTTGAAATCATTCAGGAGAGGTAAATAACTTTGTTATTTTTCTTTAAGAATTATGGGAAAACTCTAATTTTCTTGTATACATTGTAGTTATATAAGGGAAAAAGAAATGGAAGAAGAAATCCTAATGCAAAGACGTGAGGAGGAGGGAAGAGGTAAAGAAAGAAGAAAGGCTTTACTGGAGAAAGAAGAGCGTAAGTGGAAGGAAATAGAAGCTTCTCTAATGTCCTCTATTCCAAATGCTGGTGGTAGAGAAGCTGCTGCCATGGCTGCAGCTGTGCGTGCTGTTGGAGGTGATTCTGTTCTTGAGGATTCTTTTGCTCGAGAAAGAGTGTCAAGCATTGCTCGTAGGATTCGAGTAGCCCAGTTAGCTCGTCGTGCACTTCAGGTTGAGTACTTCAGTCTATCGGATGAAATAAGGAATATGACCTTTTTCTTGGGCTGGGCAGTTTAATATTGCCATTCTATGCTTAGATAAAATCTTTTTTTTTTTTTTTTTTGGGGGGGGGGGGGGGGGGGGGTTCAAAGTACTTTAGATATTTATGAACAAGAAACAATTTTTCATCGATGTAGTGAAAAAAAATAAAAGTTTAAATGATACGAACTCCCAAAAGGAGAGAAGAAAAGATATGAACAACAAATACATCCACATAGAGATAAACTTACAAACAACTATGACCAACATGCCCCTTCAAATACTGAGAAACAAAGGCTACTACAAAACCACTGAAAAGAAACCCAAAAAACGACCACCAAACAAAAGACGTCAAAGAGCAAACTGAAAACTCATTGACGATGCACAAACTAAAAAATTGATGAAAGAAGCTTTTGAAGAACCAACTCCAAGAATCTTGAAAAGAAACTCCCGCCTAACGCTTTAACACCTTCCAACAGGAGAACAAAATAGAGCCGAACAAGTGATATTAAACCGCCATCACATAGATCTTCCAAAAACAACAAACAAGAAGCCAGTCACCCTCTGTAAAGCAACCCCTTTTAAAATACTCCCACGTTCTCTATTGTTCGAAGCAGAAAATCCAACTTTGATTGACGATCTAACTGCCATAACTCCTTAGATTTTTACAAGATGGGATAAAGGGTGAATTTCTTTGAATTGTATTTCACAAACCTTAACTAAAGATGAAAACTTTGAAAGAATGGAGGAAGGTATTATAACTGGCGATAATGGTTGAGACAGTCTAATTATCTTGGGCAGCATCCTTAAATAATGCCTCAAAAGCATCAACAAATCAATCCACAAGATAAGGATCTTCAAACTCTCTTTCTTAAAAAAATGCTTCGAGCTCCATACTAAGACTAATCTCAGAATCTTCATCAAAATCAAAAGCGTGAATACACGTAGAAGGAACATATTCCATAAGCGGAGTAACAACTTTAATTAAGTTAACATTAGATCCAAAGGAGAAATAACTAACTGACTAAAGGACTAATAGGAACAAAGCCTCGATGCGACTCATTTTCTCGTGGCAATATCTCCTGATTGGAAGATGCCAGCAATCTTACATAAGATCTTCCAAAATATCAGCATCAAACTCAGTTTTCGTGAACTAGATAGGGTTTAGGGTTACTGTAGATTAGCCAAAATATTTGTTTTTTAGTCAGAATTCTCTTGCAAGTTGTTTTTTCAATCAAAATATTCTGAATAAGTGATCAAATAGTTATTTTAATGGCTGTTTTCACAATAGGATGTGAGCCTTCTGAGCACAACTTGATGATATGCATCTATCTAGCCTAACAGGCTAAAGCTTCATGCTTTGCTTATTTTATTGCATCAGTGCAATTCCTTGGCTATTTTTGTGGAATGCGTTATCTAGGTTTCTGCCTTCCCTTTTCTATTACCGTGGGTGTGCTGACCATCATATGTTTGACTTATCCAATCCCACATGATTAGTAATTGACACCTTAGTTCCCTCTTTGAAATTGTAATTGTAGTAATATTTTCCCTTGCAAAGAAATTGTGTTGCGCCATTTGGAAAGGAAAATATCCAAAAAGGTGAAATTCTCCCTTTGGGAACCCGGTCACTCCTTGCTTAAGCACCCACGACAAGCTTCAATGAAGATCCCCAAGGCTAAATGTCTCACTATGCTGCTGTATTTTGTGCACAATGATGCCAACTCAAAGGCCCCTTTTCAGCTTGTGTCCCTTCACCCTTGGCTACAATCTTTTGGGATTTTATTCTGAAGGCTTTTGGTTGGTACACTGCCATTCCATTTGAAATTTGGGATTCCTCCTATCCTGCTTATTGGGCATCCCTTCAAGAAGGGGAAAAGAATCCTTTGGCTAAACGTCATTTGGGCTTTTCTGTGGTCCATTTGGTTAGAAAGGAATTCTTGTATCCTTTCTCTCTTCTATCATCGACGTATCTCTTTCTTGGCATAAATAGAATTTCCCCTTTTGTAATTGTAGTTTATCATCTCTTATAGCCCATTGGATTCGTTTTTTGTAATCCCCTCGATAAGAATTTCAGAGTTTCTTGTAACTTCATCATATCAATGAAATGAATGGTTTCCTATCAACAAAAAAAGAAAAGGAAAAAAGGAAAATAACAATAATGATAATATTTACCCTTCGTTTTCATTGTTCTTCTGCAGACGGGAATCCTTGGTGCTGTCTGCGTCCTTGATGATGAGCCAATAGGATGTGGCAAGCACTGTGGTCAGATTGAGGCTAGTTTATGTCAAAGTCGAAAAATCAGCATTTCAATTGCTGCATTGATTCAGCCAGAGTCTGGTCCTGTTTGTCTGTTTGGTACGGAGTATCAGAAGAAGATTTGCTGGGAATTTCTGGTGGCTGGTTCCGAACAAGGCATTGAAGCTGGACAAGTTGGTCTCAGATTGATTACTAAAGGTGATAGACAATCTACTGTAACAAAGGAGTGGAGCATAAGTGCTACCAGTATTGCTGATGGAAGGTCTGTGTCGGTTGTTTTGAGTATTAGACTAAGCAGCTTCTCCCTCTTGCTTCTGCTTTGTTTCATTTTGCGTGAGATTGCTTTCTTTGGTGTAGCTGATGTAAGGCGTTTGTTGTTTGTTTTGTGGACCAGACTGAGCATCCCTTTTTCTCTCTCGTTCCTTGTACTTAGGGTTCTAAGTTTTGTGCTTTCAACACCCTTATCCTGGCTGTTTTGGAGAGTAATCTTTTGGTTCAGCAGAATTCTATTTAATATCTTATCTTTTATTTGCAGGTGGCATATTATAACTATGACCATTGATGCTGATTTGGGAGAAGCAACTTGCTATTTAGATGGTGGGTTTGATGGCTACCAGATTGGGTTGCCATTAAATGTGGGTGATAACATTTGGGAGCAAGGGACAGAAATTTGGGTTGGTGTTAGACCCCCCACAGATGTTGACATATTTGGAAGATCAGATAGTGAAGGAGCTGAGTCCAAGATGCACATTATGGATGTGTTTCTCTGGGGAAGGAGCTTAACTGAAGATGAAATTGCTGCTCTCCACGCAGCTATTAGCTCCACAGACTATAATATGATTGACTTCGCTGAAGATAATTGGGAGTGGGCGGATTCACCATCTAGAGTATGTTCTGTTTCCCTATTAAGATATGAGAGGCAAGCTACCATTCCAGTTGAACCTATTGACTAATGGATGCCTTATTTGTTTTCCTCAGGTTGATGAGTGGGATAGTGATCCCGCAGATGTAGATCTGTATGATAGGGATGATGTGGATTGGGATGGACAGTATTCTAGTGGAAGGAAGCGAAGATTGGAGCGTGATGGAGTAGTAGTTGATGTGGATTCTTTCACAAGGAAGTTTAGGAGGCCTCGTATGGAAACATGTGAGGAAATCAACCAACGGATGCTTTCAGTAGAATTGGCTGTTAAAGAAGCCCTCTCTGCTAGGGGAGAAATGCATTTTACTGATGAGGAGTTTCCTCCAAATGATGAGTCTTTGTATGTGGATCCAAAGAATCCACCTTCTAAACTACAGGTACTTTTTCTGACTGGTTATCTGATTATTACGGCTTTTAATCAGTGTCATATTTTCTTCATAAATTGTTTGTTCTCACTTAGGTTGTGTCCGAGTGGATGAGGCCTCTTGAGTTAATAAAAGAGGGCAGAATAGAATCTCAGCCTTGCTTATTTTCTGAGGCTGCCAATCCATCTGATGTTTGTCAGGTTTGTCTCTGAGAAACCACTAAAATATTTGCCCATTTGGTTTGTTCTTACAAAAGGAACGATCATGTTTGTCTTTTGAGTTTACCAATATTAAGCATTCAATCTCTCCATTTTTGGCTCTGATCTTACTAAACATTAACAAAATCCATGTCTTGTGATAAAATAAAACAGGGACGATTGGGTGATTGTTGGTTCCTAAGTGCTGTTGCGGTGTTAACCGAGGCTTCCAAAATTTCTGAAGTGATTATTACTCCACGTTATAATGATGAAGGGATCTACACAGTTCGCTTCTGTATTCAGGTGAGCTTTTGACTATTATCATTTGGTGCTCATGCATGAACGTGGTGTTGATGAAATAATATGTTATTATATCAGCTGGGCTTCATCATGGTTGGTGAAATTCTTCTATTGTTTCTAATTTATTCACGTTCCTAATCTTTATTGCTAGGTTACTTCTAGGTGGTGGGAGAAAGAGCCTTTCTGTTTGTTTTCAGCCTAGGATCTAGTCTTAAAAAAACCAATAGGGAAAGGCCCTCAATGTCTTGGGTGAATTGTAACTCTTCTAAGCTCTTTAGGTGGTGGTATTTAGCAAGGTGTATATTGTCTGTAACCCTTCTCCTTCTTAAGTTTTAAAGAGAAGAATAATTACGATTGATTAAAAGGGGGAGGAAACCTTCTTACTCTGTCTTTGCTCAAAAGGAGAAGAAACCATTGATCACCTGATGGTTAAGGATGGTCCTTTATTAAAACAGAGTTTGGGCTCTAGATCTGCTTCCCCAAAAAGGTTGAAGACTGGTTGTTTGAGGGCTTGATGGGAAATTTTAGGGGAAAAACAAAGATCCTATGGGGCTGTGGTGTTGAAGTTATTTTATGGTTGACTTGCAAAGGAGCGAAATCTTAGAATCTTTGATGATAGGAGTTTTGAAGATTCTTTTCTCTATAGCATTCAACACATGGCTTCTTGGTCGACAGAATTTTGGTGTAAGGCTTTTGTTTTAGTTCTTCTGTTTGATGAGGAGTTCTCTACCCGCCTTGAGCTGTATTCCTATTTGGAGATATGAATACACATATCTTTCATAAGGGACGAATAATTACTAAAATCTTTACTGAGCGTGCACCAAGAAGCTGTAACAAATAAAACATTGCCTATAAAACATTACAAATCTTTGATCTTATTTCTCTCAAACCATGCGCACCATATAAAACTTGTAATTGTAACTGTATTGAGCCGACTTTTGCCTTGTCTTTGAACGGATGAGAAAGGATAGTAGAGAAGAGATTGTCAGCCACTTTGTAGTTGAAGATAAAAGACCAACCAGAAGAGGTCAAGAATTGTTTTTGATTCAATCTGTACATCTTCCTGTATTGAGTAATAGTATTAGTTTACTAATACATATTCTAGCTCCTAACTAAGTCTTCATTTCTTATTTTTGTAATTTCTGTTTCGATGTATTGTGATTAATATTTTAATTAGGGTATGTTATTCTCTCTTTCTAGAGTGAGTGGGTCCCTGTGGTTGTTGATGATTGGATCCCATGTGAATCACCGGGGAAACCTGCATTTGCTACTAGTAAGAAAGGTAATGAGCTCTGGGTGTCCATATTGGAGAAGGCATATGCTAAGTTACATGGGTCGTACGAGGCATTGGAGGGTGGTCTAGTTCAGGATGCTCTTGTAGATCTTACTGGAGGTGCTGGCGAGGAGATCGACATGAGGAGCGCCCAGGCCCAGATTGATTTAGCTAGTGGTAGGCTATGGTCTCAATTGTTGCGCTTTAAACAAGAGGGATTTTTACTTGGTGCTGGCAGTCCGTCAGGTTCAGATGTGCATATTTCCTCTAGCGGCATTGTGCAAGGACATGCCTATTCGTTGCTACAGGTAAGTTGTTTCTACCTGAGAATCTGATAAATTTGAATCAATTTTCTGATAGAACTTTCACGATAAAGAATGTCCAACGTTTTACTGCATTTTCAGGTAAGAGAGGTTGATGGCCACAAGCTTATTCAGATACGTAATCCATGGGCCAATGAAGTTGAGTGGAATGGCCCTTGGGCTGATACTTCACCTGAATGGACTGATAGGATGAAGCACAAGCTAAAGCATATTCCGCAGGTCTGTCCTTATGGTATTGACTGCCAGATTTTCTGTAAGATGAAAGGACGTACGATGAAAAATGGATGTTAAAACTTAAAGTGTGATCTCAACATAGTAATCGTATGAGACTTGAAATTATAAACTAAAGCATGAATCAAATGTTTGATCTCTGATTCTCAACATAGTAATCATATGAGACTTGATATGAACTAAAAGCACGACACAACTCTTTTTATGTTTTGTATGTGTTTTGCAGTCAAAAGATGGAATATTCTGGATGTCATGGCAAGATTTTCAGATACACTTTCGATCAATATATGTTTGTCGAATCTATCCTCCAGAGATGCGCTACTCCGTGCATGGCCAATGGAGAGGTTATAGTGCTGGTGGCTGTCAAGATTATGATACATGGCATCAAAATCCACAATTTCGATTGAGGGCTTCTGGGCCAGATGCATCATATCCTGTACATGTATTCATAACCTTAACCCAGGTATTCTTCTTTACTTGAATTTGAAATACCTTCTTCCTGGTGGACATTATTGACTTAAAGAATTATACATTTTAAGGGGGTGAGCTTCTCGAGGACTGCTGCTGGTTTTAGGAACTATCAGTCAAGTCATGATTCAATGATGTTCTACATTGGAATGAGAATCTTGAAAACCCGTGGACGTCGAGCAGCTTACAATATTTACCTGCATGAATCCGTTGGGGGCACAGACTATGTCAATTCCCGTGAAATATCATGTGAGATGGTTCTGGAACCTGATCCAAAGGGCTACACAATTGTGCCTACAACGATACACCCAGGCGAAGAAGCACCATTTGTCCTCTCTGTCTTCACTAAAGCATCTATAACCTTGGATGTTTTATAA

mRNA sequence

ATGGAAGGGGACGAGCATAAGGTGGTATTAGCGTGTGTGATATCAGGGTCGCTCTTCTCGGTGCTGGGCTCTGCCTCGTTTTTCATACTCTGGGCTGTGAACTGGCGCCCATGGCGGATTTATAGTTGGATCTTTGCTAGAAAGTGGCCAAATATCTTGCAAGGACCCCAGCTGGATTTGCTTTGTGGTTTTCTCTCTTTATCTGCATGGATATTAGTTATTTCTCCAATTGCGGTGCTGATCATATGGGGGTGCTGGCTGATTGTAATATTGGGTCGAGACATAACTGGGCTTGCTGTGGTGATGGCTGGCACGGCTCTTCTACTTGCATTTTATTCAATTATGCTATGGTGGAGAACACAATGGCAAAGCTCAAGGAAAAAAAGGAATTTTAGCCTTCCTAAGCTTGGTGAAGAAATGATAGTTGGAGCAAAAAGTTGCCCAATCAAACAGGCAAAAGAAGATTTAGGTAAGAAACAACGTGAAGGTTTTAAGGGTCCATGCATGTTTACCTTGAAAGGGGCTGTTGCTATTCTTCTTCTTTTGGCGGTTGCGCTTCTATGTGCATATGAACTTTGTGCTGTATATGTTACAGCTGGTTCTAGTGCATCTGAGCGTTATTCACCTTCTGGTTTCTTTTTTGGTATATCAGCAATTGCCTTAGCGATCAACATGCTCTTCATTTGTCGGATGGTCTTTAATGGAAATGGATTAGATGTGGATGAATATGTGCGAAAGGCATATAAGTTTGCATATTCTGATTGTATGGAAGTGGGTCCCTTGGCTTCTTTACCTGAACCACCTGATCCCAATGAATTGGCTTCACATCTTGGGCTTCTTTATGTTGGTTCGGTTTTAGTACTTGTTGCCTACTCTATTTTATATGGCCTTACGGCTAAGGAGGCACGCTGGCTTGGTGCCACTACTTCTGCTGCTGTTATCATTCTTGATTGGAATGTAGGGGCATGCTTGTACGGGTTCCAGCTGTTAAAAAGTGGCGTTTTAGCACTTTTTGTGGCTGGCATGTCTCGTGTTTTTCTCATTTGTTTTGGAGTTCATTATTGGTATTTGGGACATTGTATAAGTTATGCAGTTGTAGCTTCTGTACTATTGGGTGCTGCTGTGATGCGTCATCTTTCAGCAACAGATCCGTTTGCTGCTAGGAGGGATGCCTTGCAAAGCACCGTGATTCGATTGAGAGAGGGGTTTCGCAGGAAAGAGCCAAATAGTTCATCAAGCTCATCAGATGGTTGTGGTTCAAGTATGAAACGTAGTAGCAGTGTTGAGGCAGGTCACCTTGGTAATGCTGTTGAATCTACTAGCAAGAGTGGGCCAGCAGCACAATGTACAGTTGACGGTAATAATTGGAATGGCGTCCTATGCCGAGCTAGTAGTTCACAAGAGGGGATTAACAGTGATAAGAGCATGGATAGTGGACGTCCAAGTTTGGCTTTGCGGAGTAGTTCTTGTCGTTCTATCATCCAAGAGCCTGATGCAGCAATGTCGTTCGTGGATAAAATTTTTGATCATAATAGTTCCTTGGTGGTTTGTTCTAGTAGTGGGCTTGACAGCCAAGGCTGTGAATCTAGCACGTCAACTTCTGCAAACCAACAAACATTGGATTTGAACTTGGCTTTGGCGTTGCAGGAAAGGTTGAGTGACCCAAGGATTACATCCATGCTGAAGAGAAGCTCTAGACAAGGGGATCGTGAATTGGCTAGTTTACTGCAAAATAAAGGACTGGATCCTAATTTTGCAATGATGTTAAAGGAGAAGAGCCTGGACCCAACTATCCTTGCATTGCTTCAAAGGAGTAGTTTGGATGCAGACAGAGAACACCGAGATAATACTGATATTACCATCATTGATTCAAACAGTGTGGACAACATGCTGCCCAATCAAATTTCTCTATCTGAAGAACTGAGACTTCATGGGCTTGAAAAGTGGCTTCAGTTCTCCAGGCTTGTACTACACAATGTAGCCGGGACCCCAGAACGGGCATGGGTGATCTTTAGTCTTGTCTTCATAATTGAAACAATCATTGTAGCCATATTCCGTCCAAAAACTATTGACATTATAAATGCGAAACACCAACAGTTTGAATTTGGCTTTGCTGTCTTACTACTATCTCCTGTGGTCTGTTCAATTATGGCTTTCCTCCAGTCACTGCAAGCAGAAGAAATGTCAATGACCTCAAAACCTCGGAAGTATGGGTTCATCGCTTGGCTACTCAGCACTTCTGTGGGGCTTTTACTTTCTTTCTTGAGCAAGTCATCGGTCCTTTTAGGACTGTCCTTAACAGTTCCACTCATGGTAGCTTGTCTTTCTCTTGCCATTCCTATATGGATCCGAAATGGGTACCAGTTTTGGATTCCTCGAGTACAGTGCATGGGTTCTGCAGGAAACCAACGATCTCTTGCAACAAAGGAGGGTATTGTTCTTGTGATTTGTATGTCACTGTTTTCTGGATCTGTGATAGCTCTTGGTGCTATAGTGTCTGCCAAGCCTCTTAATGATTTACGCTACAAAGGGTGGACTGGTGATGACAAAAGCTTCTCATCGCCTTATGCCACATCTGTGTACCTTGGCTGGGCTATGGCATCTGCAATTTCCTTAGTTGTCACTGGTGTGCTGCCAATAGTTTCATGGTTCTCAACATACCGTTTTTCCTTCTCTTCTGCTGTTTGTGTTGCCATATTCACAGTGGTTCTTGTCATGTTTTGTGGCGCATCATATTTGGAAGTTGTGAAATCAAGAGATGATGGGGTTCCTACAAATGGAGATTTTCTTGCTGCTTTGCTTCCGCTAGTGTGCATCCCTGCTTTGCTCTCTCTTTGCTCTGGACTATATAAATGGAAAGATGATGGCTGGAGGCTTTCACGAGGTGTTTATGCATTTCTTTTTATTGGGCTCCTCCTGCTACTTGGTGCTATATCAGCTGTTATAGTTGTAATTAAACCTTGGACGATTGGGGCAGCATTTCTTCTGGTGCTTCTTATGATTGTACTAGCAATTGGCTCTGTCCACCATTGGGCTTCAAACAATTTTTATTTGACGAGGACCCAAATGTTTCTTGTTTGTTTCCTTGCTTTTCTTTTGGCTTTGGCAGCATTCCTCGTTGGATGGTTTGAAGGCAAACCATTTGTGGGAGCTTCTGTTGGCTATTTCTTATTCTTATTTCTTCTGGCTGGAAGAGCATTAACTGTTTTACTCTCACCACCAATTGTAGTCTACTCTCCAAGAGTTCTACCTGTATATGTTTATGATGCTCACGCAGATTGTGGGAAGAACGTCAGTGCTGCGTTTCTTGTGCTTTATGGGATAGCATTAGCAACTGAAGGCTGGGGTGTTGTTGCAAGTTTGCTTATTTATCCACCATTTGCTGGAGCTGCTGTATCAGCAATTACTCTTGTTGTATCCTTTGGTTTTGCTGTCTCTCGTCCTTGTTTAACTCTCAAGATGATGCAAGATGCTGTTCACTTTCTCAGCAAGGAAACTATTATTCAAGCAATTTCTCGGTCTGCCACCAAGACTAGAAATGCTTTATCTGGAACATATTCAGCTCCACAGAGGTCTGCTAGTTCTGCAGCTCTTCTAGTTGGTGATCCTACCGTCATGCGTGATAGAGCAGGAAATTTTGTGCTTCCAAGGGCAGATGTTATGAAACTTAGAGATCGCTTGAGAAATGAGGAATTAGTTGCCGGATCATTCTTCTGTAGACTGAGATACAGGAGGCCATTTTTTCATGAGACAACTAATGATGTTGACCACAGAAGGCAAATGTGTGCTCATGCTCGTATTTTGGCCTTGGAAGAGGCAATTGATACAGAATGGGTATACATGTGGGACAAATTTGGTGGCTATTTACTGCTTTTGCTTGGTCTGACCGCCAAAGCTGAGCGAGTTCAAGATGAGGTTCGTTTAAGACTTTTTCTTGATAGCATAGGCTTTTCAGACCTGAGTGCTAAGAAAATAAAAAAGTGGATGCCTGAGGACCGTAGACAGTTTGAAATCATTCAGGAGAGTTATATAAGGGAAAAAGAAATGGAAGAAGAAATCCTAATGCAAAGACGTGAGGAGGAGGGAAGAGGTAAAGAAAGAAGAAAGGCTTTACTGGAGAAAGAAGAGCGTAAGTGGAAGGAAATAGAAGCTTCTCTAATGTCCTCTATTCCAAATGCTGGTGGTAGAGAAGCTGCTGCCATGGCTGCAGCTGTGCGTGCTGTTGGAGGTGATTCTGTTCTTGAGGATTCTTTTGCTCGAGAAAGAGTGTCAAGCATTGCTCGTAGGATTCGAGTAGCCCAGTTAGCTCGTCGTGCACTTCAGACGGGAATCCTTGGTGCTGTCTGCGTCCTTGATGATGAGCCAATAGGATGTGGCAAGCACTGTGGTCAGATTGAGGCTAGTTTATGTCAAAGTCGAAAAATCAGCATTTCAATTGCTGCATTGATTCAGCCAGAGTCTGGTCCTGTTTGTCTGTTTGGTACGGAGTATCAGAAGAAGATTTGCTGGGAATTTCTGGTGGCTGGTTCCGAACAAGGCATTGAAGCTGGACAAGTTGGTCTCAGATTGATTACTAAAGGTGATAGACAATCTACTGTAACAAAGGAGTGGAGCATAAGTGCTACCAGTATTGCTGATGGAAGGTGGCATATTATAACTATGACCATTGATGCTGATTTGGGAGAAGCAACTTGCTATTTAGATGGTGGGTTTGATGGCTACCAGATTGGGTTGCCATTAAATGTGGGTGATAACATTTGGGAGCAAGGGACAGAAATTTGGGTTGGTGTTAGACCCCCCACAGATGTTGACATATTTGGAAGATCAGATAGTGAAGGAGCTGAGTCCAAGATGCACATTATGGATGTGTTTCTCTGGGGAAGGAGCTTAACTGAAGATGAAATTGCTGCTCTCCACGCAGCTATTAGCTCCACAGACTATAATATGATTGACTTCGCTGAAGATAATTGGGAGTGGGCGGATTCACCATCTAGAGTTGATGAGTGGGATAGTGATCCCGCAGATGTAGATCTGTATGATAGGGATGATGTGGATTGGGATGGACAGTATTCTAGTGGAAGGAAGCGAAGATTGGAGCGTGATGGAGTAGTAGTTGATGTGGATTCTTTCACAAGGAAGTTTAGGAGGCCTCGTATGGAAACATGTGAGGAAATCAACCAACGGATGCTTTCAGTAGAATTGGCTGTTAAAGAAGCCCTCTCTGCTAGGGGAGAAATGCATTTTACTGATGAGGAGTTTCCTCCAAATGATGAGTCTTTGTATGTGGATCCAAAGAATCCACCTTCTAAACTACAGGTTGTGTCCGAGTGGATGAGGCCTCTTGAGTTAATAAAAGAGGGCAGAATAGAATCTCAGCCTTGCTTATTTTCTGAGGCTGCCAATCCATCTGATGTTTGTCAGGGACGATTGGGTGATTGTTGGTTCCTAAGTGCTGTTGCGGTGTTAACCGAGGCTTCCAAAATTTCTGAAGTGATTATTACTCCACGTTATAATGATGAAGGGATCTACACAGTTCGCTTCTGTATTCAGAGTGAGTGGGTCCCTGTGGTTGTTGATGATTGGATCCCATGTGAATCACCGGGGAAACCTGCATTTGCTACTAGTAAGAAAGGTAATGAGCTCTGGGTGTCCATATTGGAGAAGGCATATGCTAAGTTACATGGGTCGTACGAGGCATTGGAGGGTGGTCTAGTTCAGGATGCTCTTGTAGATCTTACTGGAGGTGCTGGCGAGGAGATCGACATGAGGAGCGCCCAGGCCCAGATTGATTTAGCTAGTGGTAGGCTATGGTCTCAATTGTTGCGCTTTAAACAAGAGGGATTTTTACTTGGTGCTGGCAGTCCGTCAGGTTCAGATGTGCATATTTCCTCTAGCGGCATTGTGCAAGGACATGCCTATTCGTTGCTACAGGTAAGAGAGGTTGATGGCCACAAGCTTATTCAGATACGTAATCCATGGGCCAATGAAGTTGAGTGGAATGGCCCTTGGGCTGATACTTCACCTGAATGGACTGATAGGATGAAGCACAAGCTAAAGCATATTCCGCAGTCAAAAGATGGAATATTCTGGATGTCATGGCAAGATTTTCAGATACACTTTCGATCAATATATGTTTGTCGAATCTATCCTCCAGAGATGCGCTACTCCGTGCATGGCCAATGGAGAGGTTATAGTGCTGGTGGCTGTCAAGATTATGATACATGGCATCAAAATCCACAATTTCGATTGAGGGCTTCTGGGCCAGATGCATCATATCCTGTACATGTATTCATAACCTTAACCCAGGGGGTGAGCTTCTCGAGGACTGCTGCTGGTTTTAGGAACTATCAGTCAAGTCATGATTCAATGATGTTCTACATTGGAATGAGAATCTTGAAAACCCGTGGACGTCGAGCAGCTTACAATATTTACCTGCATGAATCCGTTGGGGGCACAGACTATGTCAATTCCCGTGAAATATCATGTGAGATGGTTCTGGAACCTGATCCAAAGGGCTACACAATTGTGCCTACAACGATACACCCAGGCGAAGAAGCACCATTTGTCCTCTCTGTCTTCACTAAAGCATCTATAACCTTGGATGTTTTATAA

Coding sequence (CDS)

ATGGAAGGGGACGAGCATAAGGTGGTATTAGCGTGTGTGATATCAGGGTCGCTCTTCTCGGTGCTGGGCTCTGCCTCGTTTTTCATACTCTGGGCTGTGAACTGGCGCCCATGGCGGATTTATAGTTGGATCTTTGCTAGAAAGTGGCCAAATATCTTGCAAGGACCCCAGCTGGATTTGCTTTGTGGTTTTCTCTCTTTATCTGCATGGATATTAGTTATTTCTCCAATTGCGGTGCTGATCATATGGGGGTGCTGGCTGATTGTAATATTGGGTCGAGACATAACTGGGCTTGCTGTGGTGATGGCTGGCACGGCTCTTCTACTTGCATTTTATTCAATTATGCTATGGTGGAGAACACAATGGCAAAGCTCAAGGAAAAAAAGGAATTTTAGCCTTCCTAAGCTTGGTGAAGAAATGATAGTTGGAGCAAAAAGTTGCCCAATCAAACAGGCAAAAGAAGATTTAGGTAAGAAACAACGTGAAGGTTTTAAGGGTCCATGCATGTTTACCTTGAAAGGGGCTGTTGCTATTCTTCTTCTTTTGGCGGTTGCGCTTCTATGTGCATATGAACTTTGTGCTGTATATGTTACAGCTGGTTCTAGTGCATCTGAGCGTTATTCACCTTCTGGTTTCTTTTTTGGTATATCAGCAATTGCCTTAGCGATCAACATGCTCTTCATTTGTCGGATGGTCTTTAATGGAAATGGATTAGATGTGGATGAATATGTGCGAAAGGCATATAAGTTTGCATATTCTGATTGTATGGAAGTGGGTCCCTTGGCTTCTTTACCTGAACCACCTGATCCCAATGAATTGGCTTCACATCTTGGGCTTCTTTATGTTGGTTCGGTTTTAGTACTTGTTGCCTACTCTATTTTATATGGCCTTACGGCTAAGGAGGCACGCTGGCTTGGTGCCACTACTTCTGCTGCTGTTATCATTCTTGATTGGAATGTAGGGGCATGCTTGTACGGGTTCCAGCTGTTAAAAAGTGGCGTTTTAGCACTTTTTGTGGCTGGCATGTCTCGTGTTTTTCTCATTTGTTTTGGAGTTCATTATTGGTATTTGGGACATTGTATAAGTTATGCAGTTGTAGCTTCTGTACTATTGGGTGCTGCTGTGATGCGTCATCTTTCAGCAACAGATCCGTTTGCTGCTAGGAGGGATGCCTTGCAAAGCACCGTGATTCGATTGAGAGAGGGGTTTCGCAGGAAAGAGCCAAATAGTTCATCAAGCTCATCAGATGGTTGTGGTTCAAGTATGAAACGTAGTAGCAGTGTTGAGGCAGGTCACCTTGGTAATGCTGTTGAATCTACTAGCAAGAGTGGGCCAGCAGCACAATGTACAGTTGACGGTAATAATTGGAATGGCGTCCTATGCCGAGCTAGTAGTTCACAAGAGGGGATTAACAGTGATAAGAGCATGGATAGTGGACGTCCAAGTTTGGCTTTGCGGAGTAGTTCTTGTCGTTCTATCATCCAAGAGCCTGATGCAGCAATGTCGTTCGTGGATAAAATTTTTGATCATAATAGTTCCTTGGTGGTTTGTTCTAGTAGTGGGCTTGACAGCCAAGGCTGTGAATCTAGCACGTCAACTTCTGCAAACCAACAAACATTGGATTTGAACTTGGCTTTGGCGTTGCAGGAAAGGTTGAGTGACCCAAGGATTACATCCATGCTGAAGAGAAGCTCTAGACAAGGGGATCGTGAATTGGCTAGTTTACTGCAAAATAAAGGACTGGATCCTAATTTTGCAATGATGTTAAAGGAGAAGAGCCTGGACCCAACTATCCTTGCATTGCTTCAAAGGAGTAGTTTGGATGCAGACAGAGAACACCGAGATAATACTGATATTACCATCATTGATTCAAACAGTGTGGACAACATGCTGCCCAATCAAATTTCTCTATCTGAAGAACTGAGACTTCATGGGCTTGAAAAGTGGCTTCAGTTCTCCAGGCTTGTACTACACAATGTAGCCGGGACCCCAGAACGGGCATGGGTGATCTTTAGTCTTGTCTTCATAATTGAAACAATCATTGTAGCCATATTCCGTCCAAAAACTATTGACATTATAAATGCGAAACACCAACAGTTTGAATTTGGCTTTGCTGTCTTACTACTATCTCCTGTGGTCTGTTCAATTATGGCTTTCCTCCAGTCACTGCAAGCAGAAGAAATGTCAATGACCTCAAAACCTCGGAAGTATGGGTTCATCGCTTGGCTACTCAGCACTTCTGTGGGGCTTTTACTTTCTTTCTTGAGCAAGTCATCGGTCCTTTTAGGACTGTCCTTAACAGTTCCACTCATGGTAGCTTGTCTTTCTCTTGCCATTCCTATATGGATCCGAAATGGGTACCAGTTTTGGATTCCTCGAGTACAGTGCATGGGTTCTGCAGGAAACCAACGATCTCTTGCAACAAAGGAGGGTATTGTTCTTGTGATTTGTATGTCACTGTTTTCTGGATCTGTGATAGCTCTTGGTGCTATAGTGTCTGCCAAGCCTCTTAATGATTTACGCTACAAAGGGTGGACTGGTGATGACAAAAGCTTCTCATCGCCTTATGCCACATCTGTGTACCTTGGCTGGGCTATGGCATCTGCAATTTCCTTAGTTGTCACTGGTGTGCTGCCAATAGTTTCATGGTTCTCAACATACCGTTTTTCCTTCTCTTCTGCTGTTTGTGTTGCCATATTCACAGTGGTTCTTGTCATGTTTTGTGGCGCATCATATTTGGAAGTTGTGAAATCAAGAGATGATGGGGTTCCTACAAATGGAGATTTTCTTGCTGCTTTGCTTCCGCTAGTGTGCATCCCTGCTTTGCTCTCTCTTTGCTCTGGACTATATAAATGGAAAGATGATGGCTGGAGGCTTTCACGAGGTGTTTATGCATTTCTTTTTATTGGGCTCCTCCTGCTACTTGGTGCTATATCAGCTGTTATAGTTGTAATTAAACCTTGGACGATTGGGGCAGCATTTCTTCTGGTGCTTCTTATGATTGTACTAGCAATTGGCTCTGTCCACCATTGGGCTTCAAACAATTTTTATTTGACGAGGACCCAAATGTTTCTTGTTTGTTTCCTTGCTTTTCTTTTGGCTTTGGCAGCATTCCTCGTTGGATGGTTTGAAGGCAAACCATTTGTGGGAGCTTCTGTTGGCTATTTCTTATTCTTATTTCTTCTGGCTGGAAGAGCATTAACTGTTTTACTCTCACCACCAATTGTAGTCTACTCTCCAAGAGTTCTACCTGTATATGTTTATGATGCTCACGCAGATTGTGGGAAGAACGTCAGTGCTGCGTTTCTTGTGCTTTATGGGATAGCATTAGCAACTGAAGGCTGGGGTGTTGTTGCAAGTTTGCTTATTTATCCACCATTTGCTGGAGCTGCTGTATCAGCAATTACTCTTGTTGTATCCTTTGGTTTTGCTGTCTCTCGTCCTTGTTTAACTCTCAAGATGATGCAAGATGCTGTTCACTTTCTCAGCAAGGAAACTATTATTCAAGCAATTTCTCGGTCTGCCACCAAGACTAGAAATGCTTTATCTGGAACATATTCAGCTCCACAGAGGTCTGCTAGTTCTGCAGCTCTTCTAGTTGGTGATCCTACCGTCATGCGTGATAGAGCAGGAAATTTTGTGCTTCCAAGGGCAGATGTTATGAAACTTAGAGATCGCTTGAGAAATGAGGAATTAGTTGCCGGATCATTCTTCTGTAGACTGAGATACAGGAGGCCATTTTTTCATGAGACAACTAATGATGTTGACCACAGAAGGCAAATGTGTGCTCATGCTCGTATTTTGGCCTTGGAAGAGGCAATTGATACAGAATGGGTATACATGTGGGACAAATTTGGTGGCTATTTACTGCTTTTGCTTGGTCTGACCGCCAAAGCTGAGCGAGTTCAAGATGAGGTTCGTTTAAGACTTTTTCTTGATAGCATAGGCTTTTCAGACCTGAGTGCTAAGAAAATAAAAAAGTGGATGCCTGAGGACCGTAGACAGTTTGAAATCATTCAGGAGAGTTATATAAGGGAAAAAGAAATGGAAGAAGAAATCCTAATGCAAAGACGTGAGGAGGAGGGAAGAGGTAAAGAAAGAAGAAAGGCTTTACTGGAGAAAGAAGAGCGTAAGTGGAAGGAAATAGAAGCTTCTCTAATGTCCTCTATTCCAAATGCTGGTGGTAGAGAAGCTGCTGCCATGGCTGCAGCTGTGCGTGCTGTTGGAGGTGATTCTGTTCTTGAGGATTCTTTTGCTCGAGAAAGAGTGTCAAGCATTGCTCGTAGGATTCGAGTAGCCCAGTTAGCTCGTCGTGCACTTCAGACGGGAATCCTTGGTGCTGTCTGCGTCCTTGATGATGAGCCAATAGGATGTGGCAAGCACTGTGGTCAGATTGAGGCTAGTTTATGTCAAAGTCGAAAAATCAGCATTTCAATTGCTGCATTGATTCAGCCAGAGTCTGGTCCTGTTTGTCTGTTTGGTACGGAGTATCAGAAGAAGATTTGCTGGGAATTTCTGGTGGCTGGTTCCGAACAAGGCATTGAAGCTGGACAAGTTGGTCTCAGATTGATTACTAAAGGTGATAGACAATCTACTGTAACAAAGGAGTGGAGCATAAGTGCTACCAGTATTGCTGATGGAAGGTGGCATATTATAACTATGACCATTGATGCTGATTTGGGAGAAGCAACTTGCTATTTAGATGGTGGGTTTGATGGCTACCAGATTGGGTTGCCATTAAATGTGGGTGATAACATTTGGGAGCAAGGGACAGAAATTTGGGTTGGTGTTAGACCCCCCACAGATGTTGACATATTTGGAAGATCAGATAGTGAAGGAGCTGAGTCCAAGATGCACATTATGGATGTGTTTCTCTGGGGAAGGAGCTTAACTGAAGATGAAATTGCTGCTCTCCACGCAGCTATTAGCTCCACAGACTATAATATGATTGACTTCGCTGAAGATAATTGGGAGTGGGCGGATTCACCATCTAGAGTTGATGAGTGGGATAGTGATCCCGCAGATGTAGATCTGTATGATAGGGATGATGTGGATTGGGATGGACAGTATTCTAGTGGAAGGAAGCGAAGATTGGAGCGTGATGGAGTAGTAGTTGATGTGGATTCTTTCACAAGGAAGTTTAGGAGGCCTCGTATGGAAACATGTGAGGAAATCAACCAACGGATGCTTTCAGTAGAATTGGCTGTTAAAGAAGCCCTCTCTGCTAGGGGAGAAATGCATTTTACTGATGAGGAGTTTCCTCCAAATGATGAGTCTTTGTATGTGGATCCAAAGAATCCACCTTCTAAACTACAGGTTGTGTCCGAGTGGATGAGGCCTCTTGAGTTAATAAAAGAGGGCAGAATAGAATCTCAGCCTTGCTTATTTTCTGAGGCTGCCAATCCATCTGATGTTTGTCAGGGACGATTGGGTGATTGTTGGTTCCTAAGTGCTGTTGCGGTGTTAACCGAGGCTTCCAAAATTTCTGAAGTGATTATTACTCCACGTTATAATGATGAAGGGATCTACACAGTTCGCTTCTGTATTCAGAGTGAGTGGGTCCCTGTGGTTGTTGATGATTGGATCCCATGTGAATCACCGGGGAAACCTGCATTTGCTACTAGTAAGAAAGGTAATGAGCTCTGGGTGTCCATATTGGAGAAGGCATATGCTAAGTTACATGGGTCGTACGAGGCATTGGAGGGTGGTCTAGTTCAGGATGCTCTTGTAGATCTTACTGGAGGTGCTGGCGAGGAGATCGACATGAGGAGCGCCCAGGCCCAGATTGATTTAGCTAGTGGTAGGCTATGGTCTCAATTGTTGCGCTTTAAACAAGAGGGATTTTTACTTGGTGCTGGCAGTCCGTCAGGTTCAGATGTGCATATTTCCTCTAGCGGCATTGTGCAAGGACATGCCTATTCGTTGCTACAGGTAAGAGAGGTTGATGGCCACAAGCTTATTCAGATACGTAATCCATGGGCCAATGAAGTTGAGTGGAATGGCCCTTGGGCTGATACTTCACCTGAATGGACTGATAGGATGAAGCACAAGCTAAAGCATATTCCGCAGTCAAAAGATGGAATATTCTGGATGTCATGGCAAGATTTTCAGATACACTTTCGATCAATATATGTTTGTCGAATCTATCCTCCAGAGATGCGCTACTCCGTGCATGGCCAATGGAGAGGTTATAGTGCTGGTGGCTGTCAAGATTATGATACATGGCATCAAAATCCACAATTTCGATTGAGGGCTTCTGGGCCAGATGCATCATATCCTGTACATGTATTCATAACCTTAACCCAGGGGGTGAGCTTCTCGAGGACTGCTGCTGGTTTTAGGAACTATCAGTCAAGTCATGATTCAATGATGTTCTACATTGGAATGAGAATCTTGAAAACCCGTGGACGTCGAGCAGCTTACAATATTTACCTGCATGAATCCGTTGGGGGCACAGACTATGTCAATTCCCGTGAAATATCATGTGAGATGGTTCTGGAACCTGATCCAAAGGGCTACACAATTGTGCCTACAACGATACACCCAGGCGAAGAAGCACCATTTGTCCTCTCTGTCTTCACTAAAGCATCTATAACCTTGGATGTTTTATAA

Protein sequence

MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDLLCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRTQWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILLLLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDVDEYVRKAYKFAYSDCMEVGPLASLPEPPDPNELASHLGLLYVGSVLVLVAYSILYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVHYWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSSSSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSDKSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDHNSSLVVCSSSGLDSQGCESSTSTSANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEKSLDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFSRLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVVCSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVACLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIVSAKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFSSAVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKWKDDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWASNNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSPPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVSAITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSASSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTNDVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLDSIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLEKEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVAQLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCLFGTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIITMTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEGAESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSDPADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVELAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQPCLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSEWVPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLTGGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAYSLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSWQDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYPVHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTDYVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL
Homology
BLAST of HG10007350 vs. NCBI nr
Match: XP_038879134.1 (calpain-type cysteine protease DEK1 isoform X1 [Benincasa hispida])

HSP 1 Score: 4154.4 bits (10773), Expect = 0.0e+00
Identity = 2131/2210 (96.43%), Postives = 2142/2210 (96.92%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            MEG EHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL
Sbjct: 1    MEGGEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCGFLSLSAWILVISPI VLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGFLSLSAWILVISPIVVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVAILL
Sbjct: 121  QWQSSR------------------------------------------------AVAILL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGPLASLPEPPDPNEL       ASHLGLLYVGSVLVLVAYSI
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPLASLPEPPDPNELYPRQSSRASHLGLLYVGSVLVLVAYSI 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH
Sbjct: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS
Sbjct: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPA QCT DGNNWNG+LCRA SSQEGINSD
Sbjct: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPATQCTADGNNWNGILCRAGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDHNSSLVVCSSSGLDSQGCESSTST 540
            KSMDSGRPSLALRSSSCRSIIQEPD AMSFVDKIFDHNSSLVVCSSSGLDSQGCESSTST
Sbjct: 481  KSMDSGRPSLALRSSSCRSIIQEPDVAMSFVDKIFDHNSSLVVCSSSGLDSQGCESSTST 540

Query: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEKS 600
            SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEKS
Sbjct: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEKS 600

Query: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660
            LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS
Sbjct: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660

Query: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV 720
            RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV
Sbjct: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV 720

Query: 721  CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780
            CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA
Sbjct: 721  CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780

Query: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIVS 840
            CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQR+L TKEGIVLVICMSLFSGSVIALGAIVS
Sbjct: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRTLGTKEGIVLVICMSLFSGSVIALGAIVS 840

Query: 841  AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS 900
            AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS
Sbjct: 841  AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS 900

Query: 901  AVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960
            AVCVAIFTVVLVMFCGASYLEVVKSRDDGVPT+GDFLAALLPLVCIPALLSLCSGLYKWK
Sbjct: 901  AVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTSGDFLAALLPLVCIPALLSLCSGLYKWK 960

Query: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS 1020
            DDGWRLSRGVYAFL IGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS
Sbjct: 961  DDGWRLSRGVYAFLCIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS 1020

Query: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080
            NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP
Sbjct: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080

Query: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140
            PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS
Sbjct: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140

Query: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA 1200
            AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA
Sbjct: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA 1200

Query: 1201 SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260
            SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN
Sbjct: 1201 SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260

Query: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320
            DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD
Sbjct: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320

Query: 1321 SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380
            SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE
Sbjct: 1321 SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380

Query: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440
            KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVA
Sbjct: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440

Query: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCLF 1500
            QLARRALQTGI  AVCVLDDEPIGCGKHCGQIEASLCQSRKIS+SIAALIQPESGPVCLF
Sbjct: 1441 QLARRALQTGIHDAVCVLDDEPIGCGKHCGQIEASLCQSRKISVSIAALIQPESGPVCLF 1500

Query: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIIT 1560
            GTEYQKK+CWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHI+T
Sbjct: 1501 GTEYQKKVCWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIVT 1560

Query: 1561 MTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620
            MTIDADLGEATCYLDGGFDGYQ GLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG
Sbjct: 1561 MTIDADLGEATCYLDGGFDGYQTGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620

Query: 1621 AESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSDP 1680
            AESKMHIMDVFLWGRSLTEDEIAALHAAISSTD+NMIDFAEDNWEWADSPSRVDEWDSDP
Sbjct: 1621 AESKMHIMDVFLWGRSLTEDEIAALHAAISSTDFNMIDFAEDNWEWADSPSRVDEWDSDP 1680

Query: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740
            ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE
Sbjct: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740

Query: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQP 1800
            LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRP+ELIKE RIESQP
Sbjct: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPVELIKEARIESQP 1800

Query: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSEW 1860
            CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKI EVIITPRYNDEGIYTVRFCIQSEW
Sbjct: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKICEVIITPRYNDEGIYTVRFCIQSEW 1860

Query: 1861 VPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920
            VPVVVDDWIPCESPGKPAFATS+KGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT
Sbjct: 1861 VPVVVDDWIPCESPGKPAFATSRKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920

Query: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980
            GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY
Sbjct: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980

Query: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040
            SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW
Sbjct: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040

Query: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100
            QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP
Sbjct: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100

Query: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160
            VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD
Sbjct: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160

Query: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL
Sbjct: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2162

BLAST of HG10007350 vs. NCBI nr
Match: XP_011660057.1 (calpain-type cysteine protease DEK1 [Cucumis sativus] >KGN66358.1 hypothetical protein Csa_007361 [Cucumis sativus])

HSP 1 Score: 4140.1 bits (10736), Expect = 0.0e+00
Identity = 2119/2210 (95.88%), Postives = 2141/2210 (96.88%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            MEGD HKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL
Sbjct: 1    MEGDGHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCGFLSLSAWILVISPI VLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGFLSLSAWILVISPIVVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVAILL
Sbjct: 121  QWQSSR------------------------------------------------AVAILL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGPLASLPEPPDPNEL       ASHLGLLYVGSVLVLVAYSI
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPLASLPEPPDPNELYPRQSSRASHLGLLYVGSVLVLVAYSI 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH
Sbjct: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS
Sbjct: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSSMKRSSSVEAGHLGN VESTSKSGPAAQCTVDGNNWNGVLCR  SSQEGINSD
Sbjct: 421  SSDGCGSSMKRSSSVEAGHLGNVVESTSKSGPAAQCTVDGNNWNGVLCRVGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDHNSSLVVCSSSGLDSQGCESSTST 540
            KSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDK FD NSSLVVCSSSGLDSQGCESSTST
Sbjct: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDKSFDQNSSLVVCSSSGLDSQGCESSTST 540

Query: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEKS 600
            SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELA+LLQNKGLDPNFAMMLKEKS
Sbjct: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELANLLQNKGLDPNFAMMLKEKS 600

Query: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660
            LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS
Sbjct: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660

Query: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV 720
            RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKT+DIINAKHQQFEFGFAVLLLSPVV
Sbjct: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTVDIINAKHQQFEFGFAVLLLSPVV 720

Query: 721  CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780
            CSI+AFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA
Sbjct: 721  CSILAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780

Query: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIVS 840
            CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQR+L TKEGIVLVICMSLFSGSVIALGAIVS
Sbjct: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRTLGTKEGIVLVICMSLFSGSVIALGAIVS 840

Query: 841  AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS 900
            AKPLNDLRYKGWTGDDKSFSSPYATS YLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS
Sbjct: 841  AKPLNDLRYKGWTGDDKSFSSPYATSAYLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS 900

Query: 901  AVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960
            AV VAIFTVVLVMFCGASYLEVVKSRDD VPTNGDFLAALLPLVCIPALLSLCSGLYKWK
Sbjct: 901  AVSVAIFTVVLVMFCGASYLEVVKSRDDEVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960

Query: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS 1020
            DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLM+VLAIGSVHHWAS
Sbjct: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMVVLAIGSVHHWAS 1020

Query: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080
            NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP
Sbjct: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080

Query: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140
            PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS
Sbjct: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140

Query: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA 1200
            AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA
Sbjct: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA 1200

Query: 1201 SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260
            SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN
Sbjct: 1201 SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260

Query: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320
            DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD
Sbjct: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320

Query: 1321 SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380
            SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE
Sbjct: 1321 SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380

Query: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440
            KEERKWKEIEASLMSSIPNAGGREAAAM AAVRAVGGDSVLEDSFARERVSSIARRIRVA
Sbjct: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMTAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440

Query: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCLF 1500
            QLARRALQTGILGAVCVLDDEPIGCGKHCGQ+EASLC+SRKIS+SIAALIQPESGPVCLF
Sbjct: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQVEASLCRSRKISVSIAALIQPESGPVCLF 1500

Query: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIIT 1560
            GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHI+T
Sbjct: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIVT 1560

Query: 1561 MTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620
            MTIDADLGEATCYLDGGFDGYQ GLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG
Sbjct: 1561 MTIDADLGEATCYLDGGFDGYQTGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620

Query: 1621 AESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSDP 1680
            AESKMHIMDVFLWGRSLTEDEIAALH+AISS+D+NMIDFAEDNWEWADSPSRVD+WDSDP
Sbjct: 1621 AESKMHIMDVFLWGRSLTEDEIAALHSAISSSDFNMIDFAEDNWEWADSPSRVDDWDSDP 1680

Query: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740
            ADVDLYDRDDVDWDGQYSSGRKRRLERDGV+VDVDSFTRKFRRPRMETCEEINQRMLSVE
Sbjct: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVIVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740

Query: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQP 1800
            LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRP+EL+KEGR+ESQP
Sbjct: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPVELVKEGRLESQP 1800

Query: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSEW 1860
            CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITP YN+EGIYTVRFCIQSEW
Sbjct: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPSYNEEGIYTVRFCIQSEW 1860

Query: 1861 VPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920
            VPVVVDDWIPCESPGKPAFATS+KGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT
Sbjct: 1861 VPVVVDDWIPCESPGKPAFATSRKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920

Query: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980
            GGAGEEIDMRSAQAQIDLASGRLWSQLLRFK+EGFLLGAGSPSGSDVHISSSGIVQGHAY
Sbjct: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKREGFLLGAGSPSGSDVHISSSGIVQGHAY 1980

Query: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040
            SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW
Sbjct: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040

Query: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100
            QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP
Sbjct: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100

Query: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160
            VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD
Sbjct: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160

Query: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL
Sbjct: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2162

BLAST of HG10007350 vs. NCBI nr
Match: KAA0055719.1 (calpain-type cysteine protease DEK1 [Cucumis melo var. makuwa])

HSP 1 Score: 4138.2 bits (10731), Expect = 0.0e+00
Identity = 2120/2210 (95.93%), Postives = 2138/2210 (96.74%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            MEGD HKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL
Sbjct: 1    MEGDGHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCGFLSLSAWILVISPI VLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGFLSLSAWILVISPIMVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVAILL
Sbjct: 121  QWQSSR------------------------------------------------AVAILL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGPLASLPEPPDPNEL       ASHLGLLYVGSVLVLVAYSI
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPLASLPEPPDPNELYPRQSSRASHLGLLYVGSVLVLVAYSI 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH
Sbjct: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS
Sbjct: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSSMKRSSSVEAGHLGN VESTSKSGPAAQCTVDGNNWNGVLCR  SSQEGINSD
Sbjct: 421  SSDGCGSSMKRSSSVEAGHLGNVVESTSKSGPAAQCTVDGNNWNGVLCRVGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDHNSSLVVCSSSGLDSQGCESSTST 540
            KS+DSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFD NSSLVVCSSSGL+SQGCESSTST
Sbjct: 481  KSLDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDQNSSLVVCSSSGLESQGCESSTST 540

Query: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEKS 600
            SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELA+LLQNKGLDPNFAMMLKEKS
Sbjct: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELANLLQNKGLDPNFAMMLKEKS 600

Query: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660
            LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS
Sbjct: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660

Query: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV 720
            RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV
Sbjct: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV 720

Query: 721  CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780
            CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA
Sbjct: 721  CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780

Query: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIVS 840
            CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQR+L TKEGIVLVICMSLFSGSVIALGAIVS
Sbjct: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRTLGTKEGIVLVICMSLFSGSVIALGAIVS 840

Query: 841  AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS 900
            AKPLNDLRYKGWTGDDKSFSSPYATS YLGWAMASAISL+VTGVLPIVSWFSTYRFSFSS
Sbjct: 841  AKPLNDLRYKGWTGDDKSFSSPYATSAYLGWAMASAISLIVTGVLPIVSWFSTYRFSFSS 900

Query: 901  AVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960
            AV VAIFTVVLVMFCGASYLEVVKSRDD VPTNGDFLAALLPLVCIPALLSLCSGLYKWK
Sbjct: 901  AVSVAIFTVVLVMFCGASYLEVVKSRDDEVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960

Query: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS 1020
            DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLM+VLAIGSVHHWAS
Sbjct: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMVVLAIGSVHHWAS 1020

Query: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080
            NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP
Sbjct: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080

Query: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140
            PIVVYSPRVLPVYVYDAHADCGKNVSAAFL+LYGIALATEGWGVVASLLIYPPFAGAAVS
Sbjct: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLMLYGIALATEGWGVVASLLIYPPFAGAAVS 1140

Query: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA 1200
            AITLVVSFGFAVSRPCLTLKMMQDAVHFL KETIIQAISRSATKTRNALSGTYSAPQRSA
Sbjct: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLGKETIIQAISRSATKTRNALSGTYSAPQRSA 1200

Query: 1201 SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260
            SSAALLVGDP VMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN
Sbjct: 1201 SSAALLVGDPAVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260

Query: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320
            DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD
Sbjct: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320

Query: 1321 SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380
            SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE
Sbjct: 1321 SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380

Query: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440
            KEERKWKEIEASLMSSIPNAGGREAAAM AAVRAVGGDSVLEDSFARERVSSIARRIRVA
Sbjct: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMTAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440

Query: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCLF 1500
            QLARRALQTGILGAVCVLDDEPIGCGKHCGQ+EASLCQSRKIS+SIAALIQPESGPVCLF
Sbjct: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQVEASLCQSRKISVSIAALIQPESGPVCLF 1500

Query: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIIT 1560
            GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITK DRQSTVTKEWSISATSIADGRWHI+T
Sbjct: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKSDRQSTVTKEWSISATSIADGRWHIVT 1560

Query: 1561 MTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620
            MTIDADLGEATCYLDGGFDGYQ GLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG
Sbjct: 1561 MTIDADLGEATCYLDGGFDGYQTGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620

Query: 1621 AESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSDP 1680
            AESKMHIMDVFLWGRSLTEDEIAALHAAISSTD+NMI FAEDNWEWADSPSRVDEWDSDP
Sbjct: 1621 AESKMHIMDVFLWGRSLTEDEIAALHAAISSTDFNMIHFAEDNWEWADSPSRVDEWDSDP 1680

Query: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740
            ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE
Sbjct: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740

Query: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQP 1800
            LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRP+EL+KEGR+ESQP
Sbjct: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPVELVKEGRLESQP 1800

Query: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSEW 1860
            CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITP YN+EGIYTVRFCIQSEW
Sbjct: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPSYNEEGIYTVRFCIQSEW 1860

Query: 1861 VPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920
            VPVVVDDWIPCESPGKPAFATS+KGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT
Sbjct: 1861 VPVVVDDWIPCESPGKPAFATSRKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920

Query: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980
            GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY
Sbjct: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980

Query: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040
            SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW
Sbjct: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040

Query: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100
            QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP
Sbjct: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100

Query: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160
            VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD
Sbjct: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160

Query: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL
Sbjct: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2162

BLAST of HG10007350 vs. NCBI nr
Match: XP_008451014.1 (PREDICTED: calpain-type cysteine protease DEK1 [Cucumis melo] >XP_008451015.1 PREDICTED: calpain-type cysteine protease DEK1 [Cucumis melo])

HSP 1 Score: 4136.3 bits (10726), Expect = 0.0e+00
Identity = 2119/2210 (95.88%), Postives = 2137/2210 (96.70%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            MEGD HKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL
Sbjct: 1    MEGDGHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCGFLSLSAWILVISPI VLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGFLSLSAWILVISPIMVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVAILL
Sbjct: 121  QWQSSR------------------------------------------------AVAILL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGPLASLPEPPDPNEL       ASHLGLLYVGSVLVLVAYSI
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPLASLPEPPDPNELYPRQSSRASHLGLLYVGSVLVLVAYSI 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH
Sbjct: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS
Sbjct: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSSMKRSSSVEAGHLGN VESTSKSGPAAQCTVDGNNWNGVLCR  SSQEGINSD
Sbjct: 421  SSDGCGSSMKRSSSVEAGHLGNVVESTSKSGPAAQCTVDGNNWNGVLCRVGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDHNSSLVVCSSSGLDSQGCESSTST 540
            KS+DSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFD NSSLVVCSSSGL+SQGCESSTST
Sbjct: 481  KSLDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDQNSSLVVCSSSGLESQGCESSTST 540

Query: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEKS 600
            SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELA+LLQNKGLDPNFAMMLKEKS
Sbjct: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELANLLQNKGLDPNFAMMLKEKS 600

Query: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660
            LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS
Sbjct: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660

Query: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV 720
            RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV
Sbjct: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV 720

Query: 721  CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780
            CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA
Sbjct: 721  CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780

Query: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIVS 840
            CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQR+L TKEGIVLVICMSLFSGSVIALGAIVS
Sbjct: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRTLGTKEGIVLVICMSLFSGSVIALGAIVS 840

Query: 841  AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS 900
            AKPLNDLRYKGWTGDDKSFSSPYATS YLGWAMASAISL+VTGVLPIVSWFSTYRFSFSS
Sbjct: 841  AKPLNDLRYKGWTGDDKSFSSPYATSAYLGWAMASAISLIVTGVLPIVSWFSTYRFSFSS 900

Query: 901  AVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960
            AV VAIFTVVLVMFCGASYLEVVKSRDD VPTNGDFLAALLPLVCIPALLSLCSGLYKWK
Sbjct: 901  AVSVAIFTVVLVMFCGASYLEVVKSRDDEVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960

Query: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS 1020
            DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLM+VLAIGSVHHWAS
Sbjct: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMVVLAIGSVHHWAS 1020

Query: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080
            NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP
Sbjct: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080

Query: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140
            PIVVYSPRVLPVYVYDAHADCGKNVSAAFL+LYGIALATEGWGVVASLLIYPPFAGAAVS
Sbjct: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLMLYGIALATEGWGVVASLLIYPPFAGAAVS 1140

Query: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA 1200
            AITLVVSFGFAVSRPCLTLKMMQDAVHFL KETIIQAISRSATKTRNALSGTYSAPQRSA
Sbjct: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLGKETIIQAISRSATKTRNALSGTYSAPQRSA 1200

Query: 1201 SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260
            SSAALLVGDP VMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN
Sbjct: 1201 SSAALLVGDPAVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260

Query: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320
            DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD
Sbjct: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320

Query: 1321 SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380
            SIGF DLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE
Sbjct: 1321 SIGFPDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380

Query: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440
            KEERKWKEIEASLMSSIPNAGGREAAAM AAVRAVGGDSVLEDSFARERVSSIARRIRVA
Sbjct: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMTAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440

Query: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCLF 1500
            QLARRALQTGILGAVCVLDDEPIGCGKHCGQ+EASLCQSRKIS+SIAALIQPESGPVCLF
Sbjct: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQVEASLCQSRKISVSIAALIQPESGPVCLF 1500

Query: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIIT 1560
            GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITK DRQSTVTKEWSISATSIADGRWHI+T
Sbjct: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKSDRQSTVTKEWSISATSIADGRWHIVT 1560

Query: 1561 MTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620
            MTIDADLGEATCYLDGGFDGYQ GLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG
Sbjct: 1561 MTIDADLGEATCYLDGGFDGYQTGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620

Query: 1621 AESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSDP 1680
            AESKMHIMDVFLWGRSLTEDEIAALHAAISSTD+NMI FAEDNWEWADSPSRVDEWDSDP
Sbjct: 1621 AESKMHIMDVFLWGRSLTEDEIAALHAAISSTDFNMIHFAEDNWEWADSPSRVDEWDSDP 1680

Query: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740
            ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE
Sbjct: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740

Query: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQP 1800
            LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRP+EL+KEGR+ESQP
Sbjct: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPVELVKEGRLESQP 1800

Query: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSEW 1860
            CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITP YN+EGIYTVRFCIQSEW
Sbjct: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPSYNEEGIYTVRFCIQSEW 1860

Query: 1861 VPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920
            VPVVVDDWIPCESPGKPAFATS+KGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT
Sbjct: 1861 VPVVVDDWIPCESPGKPAFATSRKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920

Query: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980
            GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY
Sbjct: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980

Query: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040
            SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW
Sbjct: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040

Query: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100
            QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP
Sbjct: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100

Query: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160
            VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD
Sbjct: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160

Query: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL
Sbjct: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2162

BLAST of HG10007350 vs. NCBI nr
Match: XP_022960712.1 (calpain-type cysteine protease DEK1-like [Cucurbita moschata])

HSP 1 Score: 4090.8 bits (10608), Expect = 0.0e+00
Identity = 2085/2210 (94.34%), Postives = 2128/2210 (96.29%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL
Sbjct: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCG LSLSAWILVISPIAVLI+WGCWLIVIL RDI GLAVVMAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGLLSLSAWILVISPIAVLIVWGCWLIVILDRDIIGLAVVMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVAILL
Sbjct: 121  QWQSSR------------------------------------------------AVAILL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LLAVALLC+YELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLAVALLCSYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGPLASLPEPPDPNEL       ASHLGLLYVGSV+VLV YSI
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPLASLPEPPDPNELYPRQSSRASHLGLLYVGSVVVLVGYSI 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTAKEARWLGATTSAAVIILDWN+GACLYGFQLLKSGVLALFVAGMSRVFLICFGV+
Sbjct: 301  LYGLTAKEARWLGATTSAAVIILDWNIGACLYGFQLLKSGVLALFVAGMSRVFLICFGVN 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISYAVVASVLLGAAVMRHLS+TDPFAARRDALQSTVIRLREGFRRKEPNSSSS
Sbjct: 361  YWYLGHCISYAVVASVLLGAAVMRHLSSTDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSSMKRSSSVEAGHLGN  ESTSKSGPAAQCTVD NNWNGVLCR  SSQEGINSD
Sbjct: 421  SSDGCGSSMKRSSSVEAGHLGNGTESTSKSGPAAQCTVDANNWNGVLCRTGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDHNSSLVVCSSSGLDSQGCESSTST 540
            KSMDSGRPSLALRSSSCRSIIQEPDAAMSF DKIFDH+SSLVVCSSSG +SQGCESSTST
Sbjct: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMSFTDKIFDHHSSLVVCSSSGPESQGCESSTST 540

Query: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEKS 600
            SANQQTLDLNLALALQERLSDPRITS+LKRSSRQG+REL SLLQNKGLDPNFAMMLKEKS
Sbjct: 541  SANQQTLDLNLALALQERLSDPRITSILKRSSRQGERELTSLLQNKGLDPNFAMMLKEKS 600

Query: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660
            LDPTILALLQRSSLDADREH+DNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS
Sbjct: 601  LDPTILALLQRSSLDADREHQDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660

Query: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV 720
            RLVLHNVAGTPERAWVIFSLVFIIET IVAIFRPKTIDIINAKHQQFEFGFAVLLLSPV+
Sbjct: 661  RLVLHNVAGTPERAWVIFSLVFIIETTIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVI 720

Query: 721  CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780
            CSIMAFLQSL AEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLG+SLTVPLMVA
Sbjct: 721  CSIMAFLQSLLAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGMSLTVPLMVA 780

Query: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIVS 840
            CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQR++ TKEGIVLVICM LFSGSVIALGAIVS
Sbjct: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRTIGTKEGIVLVICMLLFSGSVIALGAIVS 840

Query: 841  AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS 900
            AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMA+ ISLVVTGVLPIVSWFSTYRFSFSS
Sbjct: 841  AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMAATISLVVTGVLPIVSWFSTYRFSFSS 900

Query: 901  AVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960
            AVCVAIFTVVLVMFCGASYLEVVKSRDD VPTNGDFLAALLPLVCIPALLSLCSGLYKWK
Sbjct: 901  AVCVAIFTVVLVMFCGASYLEVVKSRDDEVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960

Query: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS 1020
            DDGWRLS+GVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS
Sbjct: 961  DDGWRLSQGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS 1020

Query: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080
            NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP
Sbjct: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080

Query: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140
            PIVVYSPRVLP YVYDAHADCGK+VSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS
Sbjct: 1081 PIVVYSPRVLPAYVYDAHADCGKDVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140

Query: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA 1200
            AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKET+IQAISRSATKTRNALSGTYSAPQRSA
Sbjct: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETVIQAISRSATKTRNALSGTYSAPQRSA 1200

Query: 1201 SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260
            SSAALLVGDPT+MRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHE+TN
Sbjct: 1201 SSAALLVGDPTIMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHESTN 1260

Query: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320
            DVDHRRQMCAHARIL LEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD
Sbjct: 1261 DVDHRRQMCAHARILGLEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320

Query: 1321 SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380
            SIGFSDLSAKKIKKWMPEDRR+FEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE
Sbjct: 1321 SIGFSDLSAKKIKKWMPEDRRRFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380

Query: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440
            KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRV+
Sbjct: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVS 1440

Query: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCLF 1500
            QLARRALQTGI GAVCVLDDEPIGCGKHCGQIEAS+CQ+RKIS+SIAALIQPESGPVCLF
Sbjct: 1441 QLARRALQTGIHGAVCVLDDEPIGCGKHCGQIEASICQTRKISVSIAALIQPESGPVCLF 1500

Query: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIIT 1560
            GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHI+T
Sbjct: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIVT 1560

Query: 1561 MTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620
            MTID DLGEATCYLDGGFDGYQ GLPLNVGDNIWEQGTE+WVGVRPPTD+DIFGRSDSEG
Sbjct: 1561 MTIDVDLGEATCYLDGGFDGYQTGLPLNVGDNIWEQGTEMWVGVRPPTDIDIFGRSDSEG 1620

Query: 1621 AESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSDP 1680
            AESKMHIMDVFLWGRSLTEDEIA+LHAA+SSTD+NM+DF EDNWEWADSPSRVDEWDSDP
Sbjct: 1621 AESKMHIMDVFLWGRSLTEDEIASLHAAMSSTDFNMLDFTEDNWEWADSPSRVDEWDSDP 1680

Query: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740
            ADVDLYDRDDVDWDGQYSSGRKRRLERDGVV+DVDSF RKFRRPR ETCEEINQRMLSVE
Sbjct: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVLDVDSFARKFRRPRTETCEEINQRMLSVE 1740

Query: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQP 1800
            LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRP+EL+KEGRI+SQP
Sbjct: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPVELLKEGRIDSQP 1800

Query: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSEW 1860
            CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYN+EGIYTVRFCIQSEW
Sbjct: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNEEGIYTVRFCIQSEW 1860

Query: 1861 VPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920
            VPVVVDDWIPCESPGKPAFATS+KGNELWVS+LEKAYAKLHGSYEALEGGLVQDALVDLT
Sbjct: 1861 VPVVVDDWIPCESPGKPAFATSRKGNELWVSMLEKAYAKLHGSYEALEGGLVQDALVDLT 1920

Query: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980
            GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLG GSPSGSDVHISSSGIVQGHAY
Sbjct: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGVGSPSGSDVHISSSGIVQGHAY 1980

Query: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040
            SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW
Sbjct: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040

Query: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100
            QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP
Sbjct: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100

Query: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160
            VHVFITLTQGVSFSRT+AGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD
Sbjct: 2101 VHVFITLTQGVSFSRTSAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160

Query: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL
Sbjct: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2162

BLAST of HG10007350 vs. ExPASy Swiss-Prot
Match: Q8RVL2 (Calpain-type cysteine protease DEK1 OS=Arabidopsis thaliana OX=3702 GN=DEK1 PE=1 SV=1)

HSP 1 Score: 3440.2 bits (8919), Expect = 0.0e+00
Identity = 1743/2211 (78.83%), Postives = 1940/2211 (87.74%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            MEGDE  V+LACVISG+LF+V GS SF+ILWAVNWRPWR+YSWIFARKWP +LQGPQLD+
Sbjct: 1    MEGDERGVLLACVISGTLFTVFGSGSFWILWAVNWRPWRLYSWIFARKWPKVLQGPQLDI 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCG LSL AWI+V+SPIA+LI WG WLIVIL R I GLA++MAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGVLSLFAWIVVVSPIAILIGWGSWLIVILDRHIIGLAIIMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVA+LL
Sbjct: 121  QWQSSR------------------------------------------------AVALLL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LL VALLCAYELCAVYVTAG+ AS++YSPSGFFFG+SAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLGVALLCAYELCAVYVTAGAHASQQYSPSGFFFGVSAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGP+A LPEPPDPNEL       ASHLGLLY+GS++VL+AYS+
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPVACLPEPPDPNELYPRQTSRASHLGLLYLGSLVVLLAYSV 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTA+E+RWLG  TSAAVI+LDWN+GACLYGF+LL++ VLALFVAG+SR+FLICFG+H
Sbjct: 301  LYGLTARESRWLGGITSAAVIVLDWNIGACLYGFKLLQNRVLALFVAGISRLFLICFGIH 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISY  VASVL GAAV RHLS TDP AARRDALQSTVIRLREGFRRKE NSSSS
Sbjct: 361  YWYLGHCISYIFVASVLSGAAVSRHLSITDPSAARRDALQSTVIRLREGFRRKEQNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSS+KRSSS++AGH G   E+   +  A  CT D       L R  SSQEGINSD
Sbjct: 421  SSDGCGSSIKRSSSIDAGHTGCTNEA---NRTAESCTADN------LTRTGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMS-FVDKIFDHNSSLVVCSSSGLDSQGCESSTS 540
            KS +SGRPSL LRSSSCRS++QEP+A  S F+DK+ D N++LVVCSSSGLDSQG ESSTS
Sbjct: 481  KSEESGRPSLGLRSSSCRSVVQEPEAGTSYFMDKVSDQNNTLVVCSSSGLDSQGYESSTS 540

Query: 541  TSANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEK 600
             SANQQ LD+NLALA Q++L++PRI S+LK+ +++GD EL +LLQ+KGLDPNFA+MLKEK
Sbjct: 541  NSANQQLLDMNLALAFQDQLNNPRIASILKKKAKEGDLELTNLLQDKGLDPNFAVMLKEK 600

Query: 601  SLDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQF 660
            +LDPTILALLQRSSLDADR+HRDNTDITIIDSNSVDN LPNQISLSEELRL GLEKWL+ 
Sbjct: 601  NLDPTILALLQRSSLDADRDHRDNTDITIIDSNSVDNTLPNQISLSEELRLRGLEKWLKL 660

Query: 661  SRLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPV 720
            SRL+LH+VAGTPERAW +FSLVFI+ETIIVAIFRPKTI IIN+ HQQFEFGF+VLLLSPV
Sbjct: 661  SRLLLHHVAGTPERAWGLFSLVFILETIIVAIFRPKTITIINSSHQQFEFGFSVLLLSPV 720

Query: 721  VCSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMV 780
            VCSIMAFL+SLQ EEM++TSK RKYGF+AWLLSTSVGL LSFLSKSSVLLG+SLTVPLM 
Sbjct: 721  VCSIMAFLRSLQVEEMALTSKSRKYGFVAWLLSTSVGLSLSFLSKSSVLLGISLTVPLMA 780

Query: 781  ACLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIV 840
            ACLS+A+PIW+ NGYQFW+P++ C   A + RS   K G +L IC+ LF+GSVI+LGAI+
Sbjct: 781  ACLSIAVPIWMHNGYQFWVPQLSCGDQARDLRSPRIK-GFILWICVVLFAGSVISLGAII 840

Query: 841  SAKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFS 900
            SAKPL+DL+YK ++  + + +SPY +SVYLGWAM+S I+LVVT +LPIVSWF+TYRFS S
Sbjct: 841  SAKPLDDLKYKLFSARENNVTSPYTSSVYLGWAMSSGIALVVTAILPIVSWFATYRFSHS 900

Query: 901  SAVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKW 960
            SAVC+ IF+VVLV FCG SYLEVVKSRDD +PT GDFLAALLPL CIPALLSLC G+ KW
Sbjct: 901  SAVCLMIFSVVLVAFCGTSYLEVVKSRDDQLPTKGDFLAALLPLACIPALLSLCCGMVKW 960

Query: 961  KDDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWA 1020
            KDD W LSRGVY F  IGLLLL GAI+AVI V KPWTIG +FLLVL ++V+ IG +H WA
Sbjct: 961  KDDCWILSRGVYVFFSIGLLLLFGAIAAVIAV-KPWTIGVSFLLVLFLMVVTIGVIHLWA 1020

Query: 1021 SNNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLS 1080
            SNNFYLTR Q   VCFLA LL LAAFL+GW + K F GASVGYF FL LLAGRAL VLLS
Sbjct: 1021 SNNFYLTRKQTSFVCFLALLLGLAAFLLGWHQDKAFAGASVGYFTFLSLLAGRALAVLLS 1080

Query: 1081 PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAV 1140
            PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASL+IYPPFAGAAV
Sbjct: 1081 PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLIIYPPFAGAAV 1140

Query: 1141 SAITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRS 1200
            SAITLVV+FGFAVSRPCLTL+MM+ AV FLSK+TI+QAISRSATKTRNALSGTYSAPQRS
Sbjct: 1141 SAITLVVAFGFAVSRPCLTLEMMEVAVRFLSKDTIVQAISRSATKTRNALSGTYSAPQRS 1200

Query: 1201 ASSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETT 1260
            ASSAALLVGDP+ MRD+AGNFVLPR DVMKLRDRLRNEE VAGS F +++ R+ F HE  
Sbjct: 1201 ASSAALLVGDPSAMRDKAGNFVLPRDDVMKLRDRLRNEERVAGSIFYKMQCRKGFRHEPP 1260

Query: 1261 NDVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL 1320
             +VD+RR MCAHAR+LALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL
Sbjct: 1261 TNVDYRRDMCAHARVLALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL 1320

Query: 1321 DSIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALL 1380
            DSIGFSDLSA+KI KW PEDRRQFEIIQESY+REKEMEEE LMQRREEEGRGKERRKALL
Sbjct: 1321 DSIGFSDLSARKISKWKPEDRRQFEIIQESYLREKEMEEESLMQRREEEGRGKERRKALL 1380

Query: 1381 EKEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRV 1440
            EKEERKWKEIEASL+ SIPNAG REAAAMAAA+RAVGGDSVLEDSFARERVS IARRIR 
Sbjct: 1381 EKEERKWKEIEASLIPSIPNAGSREAAAMAAAIRAVGGDSVLEDSFARERVSGIARRIRT 1440

Query: 1441 AQLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCL 1500
            AQL RRA QTGI GAVCVLDDEP+  GKHCGQ+++S+CQS+KIS S+ A+IQ +SGPVCL
Sbjct: 1441 AQLERRAQQTGISGAVCVLDDEPMISGKHCGQMDSSVCQSQKISFSVTAMIQSDSGPVCL 1500

Query: 1501 FGTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHII 1560
            FGTE+QKK+CWE LVAGSEQGIEAGQVGLRLITKG+RQ+TV +EW I ATSI DGRWH +
Sbjct: 1501 FGTEFQKKVCWEILVAGSEQGIEAGQVGLRLITKGERQTTVAREWYIGATSITDGRWHTV 1560

Query: 1561 TMTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSE 1620
            T+TIDAD GEATCY+DGGFDGYQ GLPL++G  IWEQG E+W+GVRPP DVD FGRSDS+
Sbjct: 1561 TITIDADAGEATCYIDGGFDGYQNGLPLSIGSAIWEQGAEVWLGVRPPIDVDAFGRSDSD 1620

Query: 1621 GAESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSD 1680
            G ESKMHIMDVFLWG+ L+E+E A+LHAAI   D +MID ++DNW+W DSP RVD WDSD
Sbjct: 1621 GVESKMHIMDVFLWGKCLSEEEAASLHAAIGMADLDMIDLSDDNWQWTDSPPRVDGWDSD 1680

Query: 1681 PADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSV 1740
            PADVDLYDRDDVDWDGQYSSGRKRR  RD  V+ VDSF R+ R+PRMET E+INQRM SV
Sbjct: 1681 PADVDLYDRDDVDWDGQYSSGRKRRSGRD-FVMSVDSFARRHRKPRMETQEDINQRMRSV 1740

Query: 1741 ELAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQ 1800
            ELAVKEALSARG+  FTD+EFPPND SL+VD +NPPSKLQVVSEWMRP  ++KE   +S+
Sbjct: 1741 ELAVKEALSARGDKQFTDQEFPPNDRSLFVDTQNPPSKLQVVSEWMRPDSIVKENGSDSR 1800

Query: 1801 PCLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSE 1860
            PCLFS  ANPSDVCQGRLGDCWFLSAVAVLTE S+ISEVIITP YN+EGIYTVRFCIQ E
Sbjct: 1801 PCLFSGDANPSDVCQGRLGDCWFLSAVAVLTEVSRISEVIITPEYNEEGIYTVRFCIQGE 1860

Query: 1861 WVPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDL 1920
            WVPVV+DDWIPCESPGKPAFATS+K NELWVS++EKAYAKLHGSYEALEGGLVQDALVDL
Sbjct: 1861 WVPVVIDDWIPCESPGKPAFATSRKLNELWVSMVEKAYAKLHGSYEALEGGLVQDALVDL 1920

Query: 1921 TGGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHA 1980
            TGGAGEEID+RSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVH+SSSGIVQGHA
Sbjct: 1921 TGGAGEEIDLRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHVSSSGIVQGHA 1980

Query: 1981 YSLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMS 2040
            YS+LQVREVDGH+L+QIRNPWANEVEWNGPW+D+SPEWTDRMKHKLKH+PQSK+GIFWMS
Sbjct: 1981 YSVLQVREVDGHRLVQIRNPWANEVEWNGPWSDSSPEWTDRMKHKLKHVPQSKEGIFWMS 2040

Query: 2041 WQDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASY 2100
            WQDFQIHFRSIYVCR+YP EMRYSV+GQWRGYSAGGCQDY +WHQNPQFRLRA+G DAS 
Sbjct: 2041 WQDFQIHFRSIYVCRVYPREMRYSVNGQWRGYSAGGCQDYSSWHQNPQFRLRATGSDASL 2100

Query: 2101 PVHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGT 2160
            P+HVFITLTQGV FSRT  GFRNYQSSHDS +FYIG+RILKTRGRRAAYNI+LHESVGGT
Sbjct: 2101 PIHVFITLTQGVGFSRTTPGFRNYQSSHDSQLFYIGLRILKTRGRRAAYNIFLHESVGGT 2151

Query: 2161 DYVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            DYVNSREISCEMVL+PDPKGYTIVPTTIHPGEEAPFVLSVFTKASI L+ L
Sbjct: 2161 DYVNSREISCEMVLDPDPKGYTIVPTTIHPGEEAPFVLSVFTKASIVLEAL 2151

BLAST of HG10007350 vs. ExPASy Swiss-Prot
Match: Q6ZFZ4 (Calpain-type cysteine protease ADL1 OS=Oryza sativa subsp. japonica OX=39947 GN=ADL1 PE=1 SV=1)

HSP 1 Score: 3147.8 bits (8160), Expect = 0.0e+00
Identity = 1609/2218 (72.54%), Postives = 1851/2218 (83.45%), Query Frame = 0

Query: 1    MEGDEHK-VVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLD 60
            ME +EH+ VVL C I G LF+VLG  SF+ILWAVNWRPWR+YSWI+ARKWP  +QGPQL 
Sbjct: 1    MEEEEHRGVVLVCSICGFLFAVLGPLSFWILWAVNWRPWRLYSWIYARKWPAYVQGPQLS 60

Query: 61   LLCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWR 120
             LC F +L AW++V+SPI VL++WG  LI +L R+I GLAV+M G ALLL+FYSIMLWWR
Sbjct: 61   TLCSFFTLFAWLVVVSPITVLLVWGGILIALLERNIIGLAVIMVGVALLLSFYSIMLWWR 120

Query: 121  TQWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAIL 180
            TQWQSS+                                                AVA L
Sbjct: 121  TQWQSSK------------------------------------------------AVAYL 180

Query: 181  LLLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLD 240
            LLLAV LLCAYE CAVYVT G+SASE  SPSGFFFG+SAI+LAINMLFI +++FNG+G D
Sbjct: 181  LLLAVGLLCAYEFCAVYVTTGASASELNSPSGFFFGVSAISLAINMLFISKILFNGSGFD 240

Query: 241  VDEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYS 300
            VDEYVR+ YKFAYSDC+EV P++  P+PPDP+EL         HLGLLY+ S++VLV YS
Sbjct: 241  VDEYVRRLYKFAYSDCVEVAPVSCSPDPPDPSELYMTKSSRVLHLGLLYLCSLMVLVVYS 300

Query: 301  ILYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGV 360
            ILYGLT+KEARWLGA TS AV+ILDWN+G C + F+LLKS ++ALFVAG SRVFLICFGV
Sbjct: 301  ILYGLTSKEARWLGALTSVAVVILDWNLGLCSFRFELLKSRMIALFVAGTSRVFLICFGV 360

Query: 361  HYWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSS 420
            HYWYLGHCISYA VASVLL AAV   LS ++P  AR DAL+STVI+LREGFRRK   SSS
Sbjct: 361  HYWYLGHCISYAFVASVLLAAAVSCWLSISNPSVARIDALRSTVIKLREGFRRKGQTSSS 420

Query: 421  SSSDGCGSSMKRSS-SVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGV-LCRASSSQEGI 480
            +SSDGCGSS+KRSS SVEAG  GNA +S  +S   + C     NWN V   R++S QEG 
Sbjct: 421  NSSDGCGSSVKRSSGSVEAGPHGNATDSMYRSNSQSDCV----NWNNVPFDRSNSCQEGQ 480

Query: 481  NSDKSMDSGRPSLALRSSSCRS--IIQEPDAAMSFVDKIFDHNSSLVVCSSSGLDSQGCE 540
            +SDK++DSGR SLA RS+SC S   +Q+P+ A+   D+  D  +SLVVCSSSGL+SQGCE
Sbjct: 481  SSDKNIDSGRASLAHRSNSCLSAVAVQDPETAVVSADRHGDPTASLVVCSSSGLESQGCE 540

Query: 541  SSTSTSA--NQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFA 600
            SS S +A  NQQ LDLNLA   Q+RL+DPRITSMLKR+   GD ELA+LLQ+KGLDPNF+
Sbjct: 541  SSGSATASGNQQLLDLNLAAIFQDRLNDPRITSMLKRNGGLGDVELANLLQDKGLDPNFS 600

Query: 601  MMLKEKSLDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGL 660
             M+K+K +DP ILALLQRSSLDADREH+D+ D+T  DS+ +D  + NQISLSEELR  GL
Sbjct: 601  YMMKDKVMDPRILALLQRSSLDADREHQDDVDVTGTDSDRLDTTIANQISLSEELRRSGL 660

Query: 661  EKWLQFSRLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAV 720
            E WL  SRL+ H VAG+P RA+V+F+L+FIIET+ VA+ RPK I +INA H+QFEFGF++
Sbjct: 661  ENWLNLSRLMFHQVAGSPIRAFVVFTLIFIIETVTVAVHRPKPIKVINATHEQFEFGFSI 720

Query: 721  LLLSPVVCSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSL 780
            LLLSPVVCSIMAF+ SL AEEM+MTSKPRKYGFIAWLLST VGLLLSFLSKSSV+LGLSL
Sbjct: 721  LLLSPVVCSIMAFIWSLCAEEMTMTSKPRKYGFIAWLLSTCVGLLLSFLSKSSVILGLSL 780

Query: 781  TVPLMVACLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVI 840
            TVPLMVACLS AIPIW+RNGY+FWIP  +       +++   KE  +  I +++F+ SVI
Sbjct: 781  TVPLMVACLSFAIPIWMRNGYRFWIPGGELDSRENIRQAPGKKERALFAISITVFTASVI 840

Query: 841  ALGAIVSAKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFST 900
             LGAIVSAKPL+ L YKGW  D KSF SPYATS+YLGWA++S I+++ TGV+PIV+WF+T
Sbjct: 841  GLGAIVSAKPLDALGYKGWDADKKSFYSPYATSMYLGWALSSTIAVLATGVIPIVAWFAT 900

Query: 901  YRFSFSSAVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLC 960
            YRFS SSA+CV +F  VLV FCG SY  VV SR DGVP   DFLAALLPL+CIPA+ SL 
Sbjct: 901  YRFSPSSAICVGLFATVLVSFCGVSYWGVVNSRQDGVPLKADFLAALLPLLCIPAVFSLF 960

Query: 961  SGLYKWKDDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIG 1020
            +G+YKWKDD W++SRGVY F+ +G+LLLLGAISAVIV I+PWT+G A LLV+L +V AIG
Sbjct: 961  TGMYKWKDDDWKISRGVYLFVGMGVLLLLGAISAVIVTIRPWTVGVACLLVILFLVFAIG 1020

Query: 1021 SVHHWASNNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRA 1080
             +H+W SNNFYLTRTQM LVC LAFLLALAAFL+G F+ KPFVGAS+GYF FLFLL GRA
Sbjct: 1021 VIHYWTSNNFYLTRTQMLLVCSLAFLLALAAFLMGLFQEKPFVGASIGYFSFLFLLTGRA 1080

Query: 1081 LTVLLSPPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPP 1140
            LTVLLSPPIVVYSPRVLPVYVYDAHAD  KNVS AFL+LYGIALATE WGV+ASL++ PP
Sbjct: 1081 LTVLLSPPIVVYSPRVLPVYVYDAHADSAKNVSYAFLILYGIALATEVWGVIASLILNPP 1140

Query: 1141 FAGAAVSAITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTY 1200
            F GAA+SAITLV++F FAVSRPCLTLKM++DAVHFLSK+T++QA+SRSA KTRNA+SGTY
Sbjct: 1141 FIGAAISAITLVIAFSFAVSRPCLTLKMLEDAVHFLSKDTVVQAMSRSANKTRNAISGTY 1200

Query: 1201 SAPQRSASSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRP 1260
            SAPQRSASSAALLVGDP +  DRAGNFVLPRADVMKLRDRLRNEE+ AGSFFC +  +  
Sbjct: 1201 SAPQRSASSAALLVGDPAITLDRAGNFVLPRADVMKLRDRLRNEEITAGSFFCGV--KNC 1260

Query: 1261 FFHETTNDVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEV 1320
                +  DVD+RR MCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAE++QDEV
Sbjct: 1261 LMIGSPVDVDYRRNMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAEQIQDEV 1320

Query: 1321 RLRLFLDSIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKE 1380
            RLRLFLDSIG SDLSAK+IKKWMPEDRR FE+IQESYIREKEMEEE+LMQRREEEG+G+E
Sbjct: 1321 RLRLFLDSIGLSDLSAKEIKKWMPEDRRHFELIQESYIREKEMEEEVLMQRREEEGKGRE 1380

Query: 1381 RRKALLEKEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSI 1440
            RRKALLE+EERKWKE+E SL+SSIPNAG R+AAAMAAAVRAVGGDS LEDSFAR+RVSSI
Sbjct: 1381 RRKALLEREERKWKELEISLLSSIPNAGSRDAAAMAAAVRAVGGDSALEDSFARDRVSSI 1440

Query: 1441 ARRIRVAQLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPE 1500
            AR IR AQLARRA QTGI   VC+LDDEP   G+HCG+I+  LC+S+K+S SIA ++QP 
Sbjct: 1441 ARHIRKAQLARRAEQTGIPDTVCILDDEPRSTGRHCGEIDLCLCESKKVSFSIAVMVQPV 1500

Query: 1501 SGPVCLFGTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIAD 1560
            SGPVCLFGTE+QKK+CWE LVAGSEQG+EAGQVGLRL+TKG+R +TV KEW+I A+SIAD
Sbjct: 1501 SGPVCLFGTEFQKKVCWEILVAGSEQGMEAGQVGLRLVTKGERMTTVAKEWNIGASSIAD 1560

Query: 1561 GRWHIITMTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIF 1620
            GRWH++T+TIDADLGEAT ++DG +DGYQ  LPL   + IWE GT+IWVG RPPTD+D F
Sbjct: 1561 GRWHLVTVTIDADLGEATSFIDGVYDGYQNALPLPRNNGIWEPGTDIWVGARPPTDLDAF 1620

Query: 1621 GRSDSEGAESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDF-AEDNWEWADSPSR 1680
            GRSDSEG++SKM IMD FLWGR LTEDE+A LH AI S +Y + D  AED W  + S +R
Sbjct: 1621 GRSDSEGSDSKMQIMDAFLWGRCLTEDEVAMLHTAICSAEYGLFDLAAEDAWHGSYS-AR 1680

Query: 1681 VDEWDSDPADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEI 1740
            VD+W+S+ A+ +LYD++DV+WDGQYSSGRKR   RD V +D+DSF R+ R+PR ET EE+
Sbjct: 1681 VDDWESEEANFELYDQEDVEWDGQYSSGRKRH-ARDSVAIDIDSFARRPRKPRFETREEV 1740

Query: 1741 NQRMLSVELAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIK 1800
            NQRMLSVE AV+EAL A+GE +FTD+EFPP+D SL+VDP NP  KLQVVSEWMRP ++ K
Sbjct: 1741 NQRMLSVERAVREALIAKGERNFTDQEFPPDDRSLFVDPMNPSLKLQVVSEWMRPSDIAK 1800

Query: 1801 EGRIESQPCLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTV 1860
            E  I SQPCLFS + N SDVCQGRLGDCWFLSAVAVLTE ++ISEVIITP YN+EGIYTV
Sbjct: 1801 EVSISSQPCLFSGSVNSSDVCQGRLGDCWFLSAVAVLTEMARISEVIITPEYNEEGIYTV 1860

Query: 1861 RFCIQSEWVPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLV 1920
            RFCIQ EWV VVVDDWIPCESPGKPAFATS+K NELWVSILEKAYAKLHGSYEALEGGLV
Sbjct: 1861 RFCIQGEWVAVVVDDWIPCESPGKPAFATSRKQNELWVSILEKAYAKLHGSYEALEGGLV 1920

Query: 1921 QDALVDLTGGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSS 1980
            QDALVDLTGGAGEEIDMRS QAQIDLASGRLWSQLL FKQEGFLLGAGSPSGSD HISSS
Sbjct: 1921 QDALVDLTGGAGEEIDMRSPQAQIDLASGRLWSQLLHFKQEGFLLGAGSPSGSDAHISSS 1980

Query: 1981 GIVQGHAYSLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSK 2040
            GIVQGHAYS+LQVREVDGHKL+QIRNPWANEVEWNGPW+D+S EWT+RMKHKLKH+PQSK
Sbjct: 1981 GIVQGHAYSILQVREVDGHKLVQIRNPWANEVEWNGPWSDSSQEWTERMKHKLKHVPQSK 2040

Query: 2041 DGIFWMSWQDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRA 2100
            +G+FWMSWQDFQIHFRSIYVCR+YPPEMRYSVHGQWRGYSAGGCQDYD+WHQNPQ+RLR 
Sbjct: 2041 NGVFWMSWQDFQIHFRSIYVCRVYPPEMRYSVHGQWRGYSAGGCQDYDSWHQNPQYRLRV 2100

Query: 2101 SGPDASYPVHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYL 2160
            +G DA YPVHVFITLTQGV FSR   GFRNYQSSHDS MFYIGMRILKTRG RAAYNIY+
Sbjct: 2101 TGRDALYPVHVFITLTQGVGFSRKTNGFRNYQSSHDSSMFYIGMRILKTRGCRAAYNIYM 2160

Query: 2161 HESVGGTDYVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            HESVGGTDYVNSREISCE+VLEP PKGYTIVPTTIHPGEEAPFVLSVFTKA I L+ +
Sbjct: 2161 HESVGGTDYVNSREISCELVLEPYPKGYTIVPTTIHPGEEAPFVLSVFTKAPIKLEAV 2162

BLAST of HG10007350 vs. ExPASy Swiss-Prot
Match: Q8RVL1 (Calpain-type cysteine protease DEK1 OS=Zea mays OX=4577 GN=DEK1 PE=1 SV=2)

HSP 1 Score: 3093.9 bits (8020), Expect = 0.0e+00
Identity = 1570/2216 (70.85%), Postives = 1837/2216 (82.90%), Query Frame = 0

Query: 1    MEGD-EHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLD 60
            MEG+  H VVLAC I G LF+VL   SF++LWAVNWRPWR+YSWI+ARKWP  +QGPQL 
Sbjct: 1    MEGEGHHGVVLACSICGFLFAVLSPFSFWVLWAVNWRPWRLYSWIYARKWPTYVQGPQLS 60

Query: 61   LLCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWR 120
             LC  L+L AW++VISPIAVL++WG  LI ++ R+I GLAV+MAG ALLL+FYSIMLWWR
Sbjct: 61   TLCSLLTLCAWLVVISPIAVLLVWGSVLIALMERNIIGLAVIMAGVALLLSFYSIMLWWR 120

Query: 121  TQWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAIL 180
            TQWQSS                                                 AVA L
Sbjct: 121  TQWQSSE------------------------------------------------AVAYL 180

Query: 181  LLLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLD 240
            LLLAV LLCAY+ CA+YVTAG+SASE  SPSGFFFG+S I+LAINMLFIC+++FN +G D
Sbjct: 181  LLLAVCLLCAYDFCAIYVTAGASASELNSPSGFFFGVSVISLAINMLFICKILFNVSGFD 240

Query: 241  VDEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYS 300
            VDEYVR++YKFAYSDC+EV P++  PEPPDP+EL         HLGLLY+ S+LVLV YS
Sbjct: 241  VDEYVRRSYKFAYSDCVEVAPVSCSPEPPDPSELYMTKSSRVKHLGLLYISSLLVLVGYS 300

Query: 301  ILYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGV 360
            ILYGLT+KEARWLGA TS AV+ILDWN+G C + F+LLKS ++ LFVAG SR FL+ FGV
Sbjct: 301  ILYGLTSKEARWLGALTSVAVVILDWNLGLCSFRFELLKSRMIVLFVAGTSRAFLVSFGV 360

Query: 361  HYWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSS 420
            HYWYLGHCISYA VASVLL AAV   LS ++P  AR DAL+STVI+LREGFRRK  NSSS
Sbjct: 361  HYWYLGHCISYAFVASVLLSAAVSSWLSISNPSVARIDALRSTVIKLREGFRRKGQNSSS 420

Query: 421  SSSDGCGSSMKRSS-SVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGV-LCRASSSQEGI 480
            +SS+GCGSS+KRSS SVEAG  GNA++S  +S   +    DG NW+ +   R++S QEG 
Sbjct: 421  NSSEGCGSSVKRSSGSVEAGQNGNAMDSMYRSNSQS----DGVNWSSIPFDRSNSCQEGR 480

Query: 481  NSDKSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDHNSSLVVCSSSGLDSQGCE-- 540
            +SDK++DS R SLA RS+SC S +Q+ + A+  VD+  D  +SL VCSSSGL+S GCE  
Sbjct: 481  SSDKNIDSARASLAHRSNSCLSAVQDSETAVVSVDRHGDPITSL-VCSSSGLESHGCEPS 540

Query: 541  SSTSTSANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMM 600
             S +TS NQQ LDLNLA   Q+RL+DPRI+SMLK++   GD ELA+LLQ+KGLDPNF+ M
Sbjct: 541  GSATTSGNQQLLDLNLAAIFQDRLNDPRISSMLKKNGGLGDVELANLLQDKGLDPNFSYM 600

Query: 601  LKEKSLDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEK 660
            LK+K +DP ILALLQRSSLDADREH+D+ D+T  DS+ +D  + NQISLSEELR  GLEK
Sbjct: 601  LKDKVMDPRILALLQRSSLDADREHQDDVDVTATDSDRLDTTIANQISLSEELRRSGLEK 660

Query: 661  WLQFSRLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLL 720
            WL  SRL+ H++AG+P RA+++F+++FIIET  VAI+RP+TI +INA H+QFEFGF++LL
Sbjct: 661  WLNISRLIFHHLAGSPIRAFIVFTVMFIIETATVAIYRPETIKVINATHEQFEFGFSILL 720

Query: 721  LSPVVCSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTV 780
            LSPVVCSIMAF+ SL+AEEM MTSKP+KYGFIAWLLST VGL LSFLSKSSV+LGLSLTV
Sbjct: 721  LSPVVCSIMAFIWSLRAEEMLMTSKPQKYGFIAWLLSTCVGLFLSFLSKSSVILGLSLTV 780

Query: 781  PLMVACLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIAL 840
            PLMVACLS A+PIWIRNGY FWIP  +        ++   KE  + VI +++F+ S+I L
Sbjct: 781  PLMVACLSFAVPIWIRNGYSFWIPGREFANRENVSQAPGEKERALFVITIAVFTASIIGL 840

Query: 841  GAIVSAKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYR 900
            GAIVSAKPL+ L YKGW  D  S  SPYATS+YLGWA++S I+++ TG++PIV+WF+TYR
Sbjct: 841  GAIVSAKPLDALGYKGWDADKNSSYSPYATSMYLGWALSSTIAVITTGLIPIVAWFATYR 900

Query: 901  FSFSSAVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSG 960
            FS SSA+CV +F  VLV FCGASY  VV SR+DGVP   DFLAALLPL+CIPA  SL +G
Sbjct: 901  FSPSSAICVGLFATVLVSFCGASYWGVVNSREDGVPLKADFLAALLPLLCIPAFFSLFTG 960

Query: 961  LYKWKDDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSV 1020
            LYKWKDD W++SRGVY F+ +G+LLL GA++AVIV I+PWT+G A L+ +L +V  IG +
Sbjct: 961  LYKWKDDDWKISRGVYLFVGMGMLLLFGAVAAVIVTIRPWTVGVACLVAILFLVFVIGVI 1020

Query: 1021 HHWASNNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALT 1080
            H+W SNNFYLTRTQM LVC +AFLLALAAFL+G F GKPFVGAS+GYF F+FLL GRALT
Sbjct: 1021 HYWTSNNFYLTRTQMLLVCSIAFLLALAAFLMGLFHGKPFVGASIGYFSFIFLLTGRALT 1080

Query: 1081 VLLSPPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFA 1140
            VLLSPPIVVYSPRVLPVYVYDAHAD  KNVS AFL+LYGIALATE WGV+ASL++ PPF 
Sbjct: 1081 VLLSPPIVVYSPRVLPVYVYDAHADSAKNVSYAFLILYGIALATEVWGVIASLIMNPPFV 1140

Query: 1141 GAAVSAITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSA 1200
            GA VSA TLV++F FAVSRPCLTLKMM+DAVHFLSK+T++QA+SRSA KTRNA+SGTYSA
Sbjct: 1141 GAGVSATTLVIAFSFAVSRPCLTLKMMEDAVHFLSKDTVVQAMSRSANKTRNAISGTYSA 1200

Query: 1201 PQRSASSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFF 1260
            PQRSASSAALLVGDP +  DRAGNFVLPRADVMKLRDRLRNEE+ AGSF C ++      
Sbjct: 1201 PQRSASSAALLVGDPALTLDRAGNFVLPRADVMKLRDRLRNEEIAAGSFLCGVKDCLLIC 1260

Query: 1261 HETTNDVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRL 1320
             ++ +++D+RR MCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAE++QDEVRL
Sbjct: 1261 PQSLSNIDYRRNMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAEQIQDEVRL 1320

Query: 1321 RLFLDSIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERR 1380
            RLFLDSIG SDLSAK+IKKWMPEDRRQFE+IQESYIREKEMEEE LMQRREEEG+G+ERR
Sbjct: 1321 RLFLDSIGLSDLSAKEIKKWMPEDRRQFELIQESYIREKEMEEEALMQRREEEGKGRERR 1380

Query: 1381 KALLEKEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIAR 1440
            +ALLE+EERKWKE+E SL+SSIPN G R+AAAMAAAVRAVGGDS LEDSFAR+RVSSIA 
Sbjct: 1381 RALLEREERKWKELEISLLSSIPNTGSRDAAAMAAAVRAVGGDSALEDSFARDRVSSIAN 1440

Query: 1441 RIRVAQLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESG 1500
             IR AQLARRA QTGI G +C+LDDEP   G+HCG+++  LCQS+K+++SIA ++QP SG
Sbjct: 1441 HIRKAQLARRAEQTGIPGTICILDDEPRSTGRHCGELDLCLCQSQKVTLSIAVMVQPVSG 1500

Query: 1501 PVCLFGTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGR 1560
            PVCLFG+E+Q K+CWE LVAGSEQG+EAGQVGLRL+TKG+R +TV KEW+I A+SIADGR
Sbjct: 1501 PVCLFGSEFQ-KVCWEILVAGSEQGMEAGQVGLRLVTKGERMTTVAKEWNIGASSIADGR 1560

Query: 1561 WHIITMTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGR 1620
            WH++T+T+DADLGEAT ++DG +DGYQ GLPL   + IWE GT+IWVG RPP D+D FGR
Sbjct: 1561 WHLVTVTLDADLGEATSFIDGVYDGYQNGLPLPTDNGIWEPGTDIWVGARPPMDLDAFGR 1620

Query: 1621 SDSEGAESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAE-DNWEWADSPSRVD 1680
            SDSEG++SKM IMD FLWGR L+EDE+  LH A+S  +Y   D A  D W  + S +RVD
Sbjct: 1621 SDSEGSDSKMQIMDAFLWGRCLSEDEVTVLHTAMSPAEYGFFDLAPGDAWHGSYS-ARVD 1680

Query: 1681 EWDSDPADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQ 1740
            +W+S+ A  +LYD+ DV+WDGQYSSGRKR +  D V +D+DSF R+ R+PR ET +E+NQ
Sbjct: 1681 DWESEEA-YELYDQGDVEWDGQYSSGRKRPV-HDAVAIDLDSFARRPRKPRFETRDEVNQ 1740

Query: 1741 RMLSVELAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEG 1800
            RMLSVE AV++AL A+GE +FTD+EFPP D SL+VDP NPP KLQVVSEWMRP ++ K+ 
Sbjct: 1741 RMLSVERAVRDALIAKGERNFTDQEFPPEDRSLFVDPMNPPLKLQVVSEWMRPSDIAKDI 1800

Query: 1801 RIESQPCLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRF 1860
             I  QPCLFS + N SDVCQGRLGDCWFLSAVAVLTE S+ISEVIITP YNDEGIYTVRF
Sbjct: 1801 SISCQPCLFSGSVNSSDVCQGRLGDCWFLSAVAVLTEMSRISEVIITPEYNDEGIYTVRF 1860

Query: 1861 CIQSEWVPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQD 1920
            CIQ EWV VVVDDWIPCESPGKPAFATS+K NELWVSILEKAYAKLHGSYEALEGGLVQD
Sbjct: 1861 CIQGEWVAVVVDDWIPCESPGKPAFATSRKQNELWVSILEKAYAKLHGSYEALEGGLVQD 1920

Query: 1921 ALVDLTGGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGI 1980
            ALVDLTGGAGEEIDMRS QAQ+DLASGRLWSQLL FKQEGFLLGAGSPSGSD HISSSGI
Sbjct: 1921 ALVDLTGGAGEEIDMRSPQAQLDLASGRLWSQLLHFKQEGFLLGAGSPSGSDAHISSSGI 1980

Query: 1981 VQGHAYSLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDG 2040
            VQGHAYS+LQVREVDGHKLIQIRNPWANEVEWNGPW+D+SPEWT+RMKHKL H+PQSK+G
Sbjct: 1981 VQGHAYSILQVREVDGHKLIQIRNPWANEVEWNGPWSDSSPEWTERMKHKLMHVPQSKNG 2040

Query: 2041 IFWMSWQDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASG 2100
            +FWMSWQDFQIHFRSIYVCR+YPPEMRYSVHGQWRGY+AGGCQDYD+WHQNPQ+RLR +G
Sbjct: 2041 VFWMSWQDFQIHFRSIYVCRVYPPEMRYSVHGQWRGYNAGGCQDYDSWHQNPQYRLRVTG 2100

Query: 2101 PDASYPVHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHE 2160
             DA YPVHVFITLTQGV FSR   GFRNYQSSHDS MFYIGMRILKT+G RAAYNIY+HE
Sbjct: 2101 RDALYPVHVFITLTQGVGFSRKTNGFRNYQSSHDSSMFYIGMRILKTQGCRAAYNIYMHE 2159

Query: 2161 SVGGTDYVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            S GGTDYVNSREISCE+VL+P PKGYTIVPTTIHPGEEAPFVLSVF+KASI L+ +
Sbjct: 2161 SAGGTDYVNSREISCELVLDPYPKGYTIVPTTIHPGEEAPFVLSVFSKASIRLEAV 2159

BLAST of HG10007350 vs. ExPASy Swiss-Prot
Match: P34308 (Calpain clp-1 OS=Caenorhabditis elegans OX=6239 GN=clp-1 PE=2 SV=4)

HSP 1 Score: 241.5 bits (615), Expect = 8.9e-62
Identity = 171/493 (34.69%), Postives = 249/493 (50.51%), Query Frame = 0

Query: 1748 FTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQPCLFSEAANPSDVCQ 1807
            F D +F  ND SL+   K PP ++    EW+RP      G I  +P L +E  +  DV Q
Sbjct: 317  FEDPQFLANDSSLFFS-KRPPKRV----EWLRP------GEITREPQLITEGHSRFDVIQ 376

Query: 1808 GRLGDCWFLSAVAVLTEASKISEVIITP----RYNDEGIYTVRFCIQSEWVPVVVDDWIP 1867
            G LGDCW L+A A LT   ++   ++ P      N  GI+  +F    +WV VV+DD +P
Sbjct: 377  GELGDCWLLAAAANLTLKDELFYRVVPPDQSFTENYAGIFHFQFWQYGKWVDVVIDDRLP 436

Query: 1868 CESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLTGGAGEEIDMR 1927
              S G+  +  S   NE W ++LEKAYAKL GSYEAL+GG   +AL D+TGG  E ID++
Sbjct: 437  -TSNGELLYMHSASNNEFWSALLEKAYAKLFGSYEALKGGTTSEALEDMTGGLTEFIDLK 496

Query: 1928 SAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHIS--SSGIVQGHAYSLLQVREV 1987
            +           L   ++R  + G L G    +  +V  +  S+G+V+GHAYS+   R V
Sbjct: 497  NPPR-------NLMQMMMRGFEMGSLFGCSIEADPNVWEAKMSNGLVKGHAYSITGCRIV 556

Query: 1988 DGHK----LIQIRNPWANEVEWNGPWADTSPEW---TDRMKHKLKHIPQSKDGIFWMSWQ 2047
            DG      +++IRNPW NE EWNGPW+D S EW    D +K  +  +    DG FWMS+ 
Sbjct: 557  DGPNGQTCILRIRNPWGNEQEWNGPWSDNSREWRSVPDSVKQDM-GLKFDHDGEFWMSFD 616

Query: 2048 DFQIHFRSIYVCRIYPPEM----------------RYSVH-GQW-RGYSAGGCQDY-DTW 2107
            DF  +F  + +C + P  M                  + H G W R  +AGGC++Y +T+
Sbjct: 617  DFMRNFEKMEICNLGPDVMDEVYQMTGVKAAGMVWAANTHDGAWVRNQTAGGCRNYINTF 676

Query: 2108 HQNPQFRLRASGPDASYPVHVFITLTQGVSFSRTAAGFRNY-QSSHDSMMFYIGMRIL-- 2167
              NPQFR++ +  D      +       V F+      RN  Q   D++   IG  +   
Sbjct: 677  ANNPQFRVQLTDSDPDDDDELCT-----VIFAVLQKYRRNLKQDGLDNVP--IGFAVYDA 736

Query: 2168 -KTRGRRAAYNIYLHES-VGGTDYVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVL 2204
               RGR +      ++S +    ++N RE++    + P    Y +VP+T  P EEA F+L
Sbjct: 737  GNNRGRLSKQFFAANKSAMRSAAFINLREMTGRFRVPPG--NYVVVPSTFEPNEEAEFML 780

BLAST of HG10007350 vs. ExPASy Swiss-Prot
Match: Q9VT65 (Calpain-B OS=Drosophila melanogaster OX=7227 GN=CalpB PE=1 SV=2)

HSP 1 Score: 238.0 bits (606), Expect = 9.9e-61
Identity = 157/497 (31.59%), Postives = 251/497 (50.50%), Query Frame = 0

Query: 1735 AVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQPC 1794
            +++++  A G M F D +FP  + SL    +  P +     EW+RP      G I   P 
Sbjct: 248  SLRDSCLANGTM-FEDPDFPATNASLMYSRR--PDRYY---EWLRP------GDIADDPQ 307

Query: 1795 LFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITP----RYNDEGIYTVRFCIQ 1854
             F E  +  DV QG LGDCW L+A A LT+ S +   +I P    + N  GI+  +F   
Sbjct: 308  FFVEGYSRFDVQQGELGDCWLLAAAANLTQDSTLFFRVIPPDQDFQENYAGIFHFKFWQY 367

Query: 1855 SEWVPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALV 1914
             +WV VV+DD +P  + G+  +  S + NE W ++LEKAYAKLHGSYEAL+GG   +A+ 
Sbjct: 368  GKWVEVVIDDRLPTYN-GELIYMHSTEKNEFWSALLEKAYAKLHGSYEALKGGTTCEAME 427

Query: 1915 DLTGGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHI----SSSG 1974
            D TGG  E  D++ A          L+S +++  + G ++G       D H+    +  G
Sbjct: 428  DFTGGVTEWYDIKEAPP-------NLFSIMMKAAERGSMMGCSLE--PDPHVLEAETPQG 487

Query: 1975 IVQGHAYSLLQVREVD--------GHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKL 2034
            +++GHAYS+ +V  +D           +I++RNPW N+ EW+GPW+D+SPEW    +H  
Sbjct: 488  LIRGHAYSITKVCLMDISTPNRQGKLPMIRMRNPWGNDAEWSGPWSDSSPEWRFIPEHTK 547

Query: 2035 KHIPQS--KDGIFWMSWQDFQIHFRSIYVCRIYP----PEMRYSVHGQWR---------- 2094
            + I  +  +DG FWMS+QDF  HF  + +C + P     + ++S   +W           
Sbjct: 548  EEIGLNFDRDGEFWMSFQDFLNHFDRVEICNLSPDSLTEDQQHSSRRKWEMSMFEGEWTS 607

Query: 2095 GYSAGGCQDY-DTWHQNPQFRLRASGP---DASYPVHVFITLTQGVSFSRTAAGFRNYQS 2154
            G +AGGC+++ +T+  NPQ+ +    P   D        + L Q    S+   G      
Sbjct: 608  GVTAGGCRNFLETFWHNPQYIISLEDPDDEDDDGKCTAIVALMQKNRRSKRNVGIDCLTI 667

Query: 2155 SHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTDYVNSREISCEMVLEPDPKGYTIVPT 2196
                 ++++  R ++ + +   +  Y         ++N+RE+     L P    Y IVP+
Sbjct: 668  GF--AIYHLTDRDMQVKPQGLNFFKYRASVARSPHFINTREVCARFKLPPG--HYLIVPS 718

BLAST of HG10007350 vs. ExPASy TrEMBL
Match: A0A0A0LX51 (Calpain catalytic domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G599520 PE=3 SV=1)

HSP 1 Score: 4140.1 bits (10736), Expect = 0.0e+00
Identity = 2119/2210 (95.88%), Postives = 2141/2210 (96.88%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            MEGD HKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL
Sbjct: 1    MEGDGHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCGFLSLSAWILVISPI VLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGFLSLSAWILVISPIVVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVAILL
Sbjct: 121  QWQSSR------------------------------------------------AVAILL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGPLASLPEPPDPNEL       ASHLGLLYVGSVLVLVAYSI
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPLASLPEPPDPNELYPRQSSRASHLGLLYVGSVLVLVAYSI 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH
Sbjct: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS
Sbjct: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSSMKRSSSVEAGHLGN VESTSKSGPAAQCTVDGNNWNGVLCR  SSQEGINSD
Sbjct: 421  SSDGCGSSMKRSSSVEAGHLGNVVESTSKSGPAAQCTVDGNNWNGVLCRVGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDHNSSLVVCSSSGLDSQGCESSTST 540
            KSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDK FD NSSLVVCSSSGLDSQGCESSTST
Sbjct: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDKSFDQNSSLVVCSSSGLDSQGCESSTST 540

Query: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEKS 600
            SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELA+LLQNKGLDPNFAMMLKEKS
Sbjct: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELANLLQNKGLDPNFAMMLKEKS 600

Query: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660
            LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS
Sbjct: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660

Query: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV 720
            RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKT+DIINAKHQQFEFGFAVLLLSPVV
Sbjct: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTVDIINAKHQQFEFGFAVLLLSPVV 720

Query: 721  CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780
            CSI+AFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA
Sbjct: 721  CSILAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780

Query: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIVS 840
            CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQR+L TKEGIVLVICMSLFSGSVIALGAIVS
Sbjct: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRTLGTKEGIVLVICMSLFSGSVIALGAIVS 840

Query: 841  AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS 900
            AKPLNDLRYKGWTGDDKSFSSPYATS YLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS
Sbjct: 841  AKPLNDLRYKGWTGDDKSFSSPYATSAYLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS 900

Query: 901  AVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960
            AV VAIFTVVLVMFCGASYLEVVKSRDD VPTNGDFLAALLPLVCIPALLSLCSGLYKWK
Sbjct: 901  AVSVAIFTVVLVMFCGASYLEVVKSRDDEVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960

Query: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS 1020
            DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLM+VLAIGSVHHWAS
Sbjct: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMVVLAIGSVHHWAS 1020

Query: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080
            NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP
Sbjct: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080

Query: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140
            PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS
Sbjct: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140

Query: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA 1200
            AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA
Sbjct: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA 1200

Query: 1201 SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260
            SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN
Sbjct: 1201 SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260

Query: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320
            DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD
Sbjct: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320

Query: 1321 SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380
            SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE
Sbjct: 1321 SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380

Query: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440
            KEERKWKEIEASLMSSIPNAGGREAAAM AAVRAVGGDSVLEDSFARERVSSIARRIRVA
Sbjct: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMTAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440

Query: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCLF 1500
            QLARRALQTGILGAVCVLDDEPIGCGKHCGQ+EASLC+SRKIS+SIAALIQPESGPVCLF
Sbjct: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQVEASLCRSRKISVSIAALIQPESGPVCLF 1500

Query: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIIT 1560
            GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHI+T
Sbjct: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIVT 1560

Query: 1561 MTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620
            MTIDADLGEATCYLDGGFDGYQ GLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG
Sbjct: 1561 MTIDADLGEATCYLDGGFDGYQTGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620

Query: 1621 AESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSDP 1680
            AESKMHIMDVFLWGRSLTEDEIAALH+AISS+D+NMIDFAEDNWEWADSPSRVD+WDSDP
Sbjct: 1621 AESKMHIMDVFLWGRSLTEDEIAALHSAISSSDFNMIDFAEDNWEWADSPSRVDDWDSDP 1680

Query: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740
            ADVDLYDRDDVDWDGQYSSGRKRRLERDGV+VDVDSFTRKFRRPRMETCEEINQRMLSVE
Sbjct: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVIVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740

Query: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQP 1800
            LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRP+EL+KEGR+ESQP
Sbjct: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPVELVKEGRLESQP 1800

Query: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSEW 1860
            CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITP YN+EGIYTVRFCIQSEW
Sbjct: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPSYNEEGIYTVRFCIQSEW 1860

Query: 1861 VPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920
            VPVVVDDWIPCESPGKPAFATS+KGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT
Sbjct: 1861 VPVVVDDWIPCESPGKPAFATSRKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920

Query: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980
            GGAGEEIDMRSAQAQIDLASGRLWSQLLRFK+EGFLLGAGSPSGSDVHISSSGIVQGHAY
Sbjct: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKREGFLLGAGSPSGSDVHISSSGIVQGHAY 1980

Query: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040
            SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW
Sbjct: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040

Query: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100
            QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP
Sbjct: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100

Query: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160
            VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD
Sbjct: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160

Query: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL
Sbjct: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2162

BLAST of HG10007350 vs. ExPASy TrEMBL
Match: A0A5A7UQG4 (Calpain-type cysteine protease DEK1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold181G001020 PE=3 SV=1)

HSP 1 Score: 4138.2 bits (10731), Expect = 0.0e+00
Identity = 2120/2210 (95.93%), Postives = 2138/2210 (96.74%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            MEGD HKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL
Sbjct: 1    MEGDGHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCGFLSLSAWILVISPI VLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGFLSLSAWILVISPIMVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVAILL
Sbjct: 121  QWQSSR------------------------------------------------AVAILL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGPLASLPEPPDPNEL       ASHLGLLYVGSVLVLVAYSI
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPLASLPEPPDPNELYPRQSSRASHLGLLYVGSVLVLVAYSI 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH
Sbjct: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS
Sbjct: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSSMKRSSSVEAGHLGN VESTSKSGPAAQCTVDGNNWNGVLCR  SSQEGINSD
Sbjct: 421  SSDGCGSSMKRSSSVEAGHLGNVVESTSKSGPAAQCTVDGNNWNGVLCRVGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDHNSSLVVCSSSGLDSQGCESSTST 540
            KS+DSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFD NSSLVVCSSSGL+SQGCESSTST
Sbjct: 481  KSLDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDQNSSLVVCSSSGLESQGCESSTST 540

Query: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEKS 600
            SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELA+LLQNKGLDPNFAMMLKEKS
Sbjct: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELANLLQNKGLDPNFAMMLKEKS 600

Query: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660
            LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS
Sbjct: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660

Query: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV 720
            RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV
Sbjct: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV 720

Query: 721  CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780
            CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA
Sbjct: 721  CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780

Query: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIVS 840
            CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQR+L TKEGIVLVICMSLFSGSVIALGAIVS
Sbjct: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRTLGTKEGIVLVICMSLFSGSVIALGAIVS 840

Query: 841  AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS 900
            AKPLNDLRYKGWTGDDKSFSSPYATS YLGWAMASAISL+VTGVLPIVSWFSTYRFSFSS
Sbjct: 841  AKPLNDLRYKGWTGDDKSFSSPYATSAYLGWAMASAISLIVTGVLPIVSWFSTYRFSFSS 900

Query: 901  AVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960
            AV VAIFTVVLVMFCGASYLEVVKSRDD VPTNGDFLAALLPLVCIPALLSLCSGLYKWK
Sbjct: 901  AVSVAIFTVVLVMFCGASYLEVVKSRDDEVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960

Query: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS 1020
            DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLM+VLAIGSVHHWAS
Sbjct: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMVVLAIGSVHHWAS 1020

Query: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080
            NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP
Sbjct: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080

Query: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140
            PIVVYSPRVLPVYVYDAHADCGKNVSAAFL+LYGIALATEGWGVVASLLIYPPFAGAAVS
Sbjct: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLMLYGIALATEGWGVVASLLIYPPFAGAAVS 1140

Query: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA 1200
            AITLVVSFGFAVSRPCLTLKMMQDAVHFL KETIIQAISRSATKTRNALSGTYSAPQRSA
Sbjct: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLGKETIIQAISRSATKTRNALSGTYSAPQRSA 1200

Query: 1201 SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260
            SSAALLVGDP VMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN
Sbjct: 1201 SSAALLVGDPAVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260

Query: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320
            DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD
Sbjct: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320

Query: 1321 SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380
            SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE
Sbjct: 1321 SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380

Query: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440
            KEERKWKEIEASLMSSIPNAGGREAAAM AAVRAVGGDSVLEDSFARERVSSIARRIRVA
Sbjct: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMTAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440

Query: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCLF 1500
            QLARRALQTGILGAVCVLDDEPIGCGKHCGQ+EASLCQSRKIS+SIAALIQPESGPVCLF
Sbjct: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQVEASLCQSRKISVSIAALIQPESGPVCLF 1500

Query: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIIT 1560
            GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITK DRQSTVTKEWSISATSIADGRWHI+T
Sbjct: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKSDRQSTVTKEWSISATSIADGRWHIVT 1560

Query: 1561 MTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620
            MTIDADLGEATCYLDGGFDGYQ GLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG
Sbjct: 1561 MTIDADLGEATCYLDGGFDGYQTGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620

Query: 1621 AESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSDP 1680
            AESKMHIMDVFLWGRSLTEDEIAALHAAISSTD+NMI FAEDNWEWADSPSRVDEWDSDP
Sbjct: 1621 AESKMHIMDVFLWGRSLTEDEIAALHAAISSTDFNMIHFAEDNWEWADSPSRVDEWDSDP 1680

Query: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740
            ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE
Sbjct: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740

Query: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQP 1800
            LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRP+EL+KEGR+ESQP
Sbjct: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPVELVKEGRLESQP 1800

Query: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSEW 1860
            CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITP YN+EGIYTVRFCIQSEW
Sbjct: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPSYNEEGIYTVRFCIQSEW 1860

Query: 1861 VPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920
            VPVVVDDWIPCESPGKPAFATS+KGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT
Sbjct: 1861 VPVVVDDWIPCESPGKPAFATSRKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920

Query: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980
            GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY
Sbjct: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980

Query: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040
            SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW
Sbjct: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040

Query: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100
            QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP
Sbjct: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100

Query: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160
            VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD
Sbjct: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160

Query: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL
Sbjct: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2162

BLAST of HG10007350 vs. ExPASy TrEMBL
Match: A0A1S3BRA0 (calpain-type cysteine protease DEK1 OS=Cucumis melo OX=3656 GN=LOC103492422 PE=3 SV=1)

HSP 1 Score: 4136.3 bits (10726), Expect = 0.0e+00
Identity = 2119/2210 (95.88%), Postives = 2137/2210 (96.70%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            MEGD HKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL
Sbjct: 1    MEGDGHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCGFLSLSAWILVISPI VLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGFLSLSAWILVISPIMVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVAILL
Sbjct: 121  QWQSSR------------------------------------------------AVAILL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGPLASLPEPPDPNEL       ASHLGLLYVGSVLVLVAYSI
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPLASLPEPPDPNELYPRQSSRASHLGLLYVGSVLVLVAYSI 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH
Sbjct: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS
Sbjct: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSSMKRSSSVEAGHLGN VESTSKSGPAAQCTVDGNNWNGVLCR  SSQEGINSD
Sbjct: 421  SSDGCGSSMKRSSSVEAGHLGNVVESTSKSGPAAQCTVDGNNWNGVLCRVGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDHNSSLVVCSSSGLDSQGCESSTST 540
            KS+DSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFD NSSLVVCSSSGL+SQGCESSTST
Sbjct: 481  KSLDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDQNSSLVVCSSSGLESQGCESSTST 540

Query: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEKS 600
            SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELA+LLQNKGLDPNFAMMLKEKS
Sbjct: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELANLLQNKGLDPNFAMMLKEKS 600

Query: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660
            LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS
Sbjct: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660

Query: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV 720
            RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV
Sbjct: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV 720

Query: 721  CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780
            CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA
Sbjct: 721  CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780

Query: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIVS 840
            CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQR+L TKEGIVLVICMSLFSGSVIALGAIVS
Sbjct: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRTLGTKEGIVLVICMSLFSGSVIALGAIVS 840

Query: 841  AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS 900
            AKPLNDLRYKGWTGDDKSFSSPYATS YLGWAMASAISL+VTGVLPIVSWFSTYRFSFSS
Sbjct: 841  AKPLNDLRYKGWTGDDKSFSSPYATSAYLGWAMASAISLIVTGVLPIVSWFSTYRFSFSS 900

Query: 901  AVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960
            AV VAIFTVVLVMFCGASYLEVVKSRDD VPTNGDFLAALLPLVCIPALLSLCSGLYKWK
Sbjct: 901  AVSVAIFTVVLVMFCGASYLEVVKSRDDEVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960

Query: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS 1020
            DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLM+VLAIGSVHHWAS
Sbjct: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMVVLAIGSVHHWAS 1020

Query: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080
            NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP
Sbjct: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080

Query: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140
            PIVVYSPRVLPVYVYDAHADCGKNVSAAFL+LYGIALATEGWGVVASLLIYPPFAGAAVS
Sbjct: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLMLYGIALATEGWGVVASLLIYPPFAGAAVS 1140

Query: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA 1200
            AITLVVSFGFAVSRPCLTLKMMQDAVHFL KETIIQAISRSATKTRNALSGTYSAPQRSA
Sbjct: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLGKETIIQAISRSATKTRNALSGTYSAPQRSA 1200

Query: 1201 SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260
            SSAALLVGDP VMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN
Sbjct: 1201 SSAALLVGDPAVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260

Query: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320
            DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD
Sbjct: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320

Query: 1321 SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380
            SIGF DLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE
Sbjct: 1321 SIGFPDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380

Query: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440
            KEERKWKEIEASLMSSIPNAGGREAAAM AAVRAVGGDSVLEDSFARERVSSIARRIRVA
Sbjct: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMTAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440

Query: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCLF 1500
            QLARRALQTGILGAVCVLDDEPIGCGKHCGQ+EASLCQSRKIS+SIAALIQPESGPVCLF
Sbjct: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQVEASLCQSRKISVSIAALIQPESGPVCLF 1500

Query: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIIT 1560
            GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITK DRQSTVTKEWSISATSIADGRWHI+T
Sbjct: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKSDRQSTVTKEWSISATSIADGRWHIVT 1560

Query: 1561 MTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620
            MTIDADLGEATCYLDGGFDGYQ GLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG
Sbjct: 1561 MTIDADLGEATCYLDGGFDGYQTGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620

Query: 1621 AESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSDP 1680
            AESKMHIMDVFLWGRSLTEDEIAALHAAISSTD+NMI FAEDNWEWADSPSRVDEWDSDP
Sbjct: 1621 AESKMHIMDVFLWGRSLTEDEIAALHAAISSTDFNMIHFAEDNWEWADSPSRVDEWDSDP 1680

Query: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740
            ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE
Sbjct: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740

Query: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQP 1800
            LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRP+EL+KEGR+ESQP
Sbjct: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPVELVKEGRLESQP 1800

Query: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSEW 1860
            CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITP YN+EGIYTVRFCIQSEW
Sbjct: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPSYNEEGIYTVRFCIQSEW 1860

Query: 1861 VPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920
            VPVVVDDWIPCESPGKPAFATS+KGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT
Sbjct: 1861 VPVVVDDWIPCESPGKPAFATSRKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920

Query: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980
            GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY
Sbjct: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980

Query: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040
            SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW
Sbjct: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040

Query: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100
            QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP
Sbjct: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100

Query: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160
            VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD
Sbjct: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160

Query: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL
Sbjct: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2162

BLAST of HG10007350 vs. ExPASy TrEMBL
Match: A0A6J1H873 (calpain-type cysteine protease DEK1-like OS=Cucurbita moschata OX=3662 GN=LOC111461431 PE=3 SV=1)

HSP 1 Score: 4090.8 bits (10608), Expect = 0.0e+00
Identity = 2085/2210 (94.34%), Postives = 2128/2210 (96.29%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL
Sbjct: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCG LSLSAWILVISPIAVLI+WGCWLIVIL RDI GLAVVMAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGLLSLSAWILVISPIAVLIVWGCWLIVILDRDIIGLAVVMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVAILL
Sbjct: 121  QWQSSR------------------------------------------------AVAILL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LLAVALLC+YELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLAVALLCSYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGPLASLPEPPDPNEL       ASHLGLLYVGSV+VLV YSI
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPLASLPEPPDPNELYPRQSSRASHLGLLYVGSVVVLVGYSI 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTAKEARWLGATTSAAVIILDWN+GACLYGFQLLKSGVLALFVAGMSRVFLICFGV+
Sbjct: 301  LYGLTAKEARWLGATTSAAVIILDWNIGACLYGFQLLKSGVLALFVAGMSRVFLICFGVN 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISYAVVASVLLGAAVMRHLS+TDPFAARRDALQSTVIRLREGFRRKEPNSSSS
Sbjct: 361  YWYLGHCISYAVVASVLLGAAVMRHLSSTDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSSMKRSSSVEAGHLGN  ESTSKSGPAAQCTVD NNWNGVLCR  SSQEGINSD
Sbjct: 421  SSDGCGSSMKRSSSVEAGHLGNGTESTSKSGPAAQCTVDANNWNGVLCRTGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDHNSSLVVCSSSGLDSQGCESSTST 540
            KSMDSGRPSLALRSSSCRSIIQEPDAAMSF DKIFDH+SSLVVCSSSG +SQGCESSTST
Sbjct: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMSFTDKIFDHHSSLVVCSSSGPESQGCESSTST 540

Query: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEKS 600
            SANQQTLDLNLALALQERLSDPRITS+LKRSSRQG+REL SLLQNKGLDPNFAMMLKEKS
Sbjct: 541  SANQQTLDLNLALALQERLSDPRITSILKRSSRQGERELTSLLQNKGLDPNFAMMLKEKS 600

Query: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660
            LDPTILALLQRSSLDADREH+DNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS
Sbjct: 601  LDPTILALLQRSSLDADREHQDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660

Query: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV 720
            RLVLHNVAGTPERAWVIFSLVFIIET IVAIFRPKTIDIINAKHQQFEFGFAVLLLSPV+
Sbjct: 661  RLVLHNVAGTPERAWVIFSLVFIIETTIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVI 720

Query: 721  CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780
            CSIMAFLQSL AEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLG+SLTVPLMVA
Sbjct: 721  CSIMAFLQSLLAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGMSLTVPLMVA 780

Query: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIVS 840
            CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQR++ TKEGIVLVICM LFSGSVIALGAIVS
Sbjct: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRTIGTKEGIVLVICMLLFSGSVIALGAIVS 840

Query: 841  AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS 900
            AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMA+ ISLVVTGVLPIVSWFSTYRFSFSS
Sbjct: 841  AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMAATISLVVTGVLPIVSWFSTYRFSFSS 900

Query: 901  AVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960
            AVCVAIFTVVLVMFCGASYLEVVKSRDD VPTNGDFLAALLPLVCIPALLSLCSGLYKWK
Sbjct: 901  AVCVAIFTVVLVMFCGASYLEVVKSRDDEVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960

Query: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS 1020
            DDGWRLS+GVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS
Sbjct: 961  DDGWRLSQGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS 1020

Query: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080
            NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP
Sbjct: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080

Query: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140
            PIVVYSPRVLP YVYDAHADCGK+VSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS
Sbjct: 1081 PIVVYSPRVLPAYVYDAHADCGKDVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140

Query: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA 1200
            AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKET+IQAISRSATKTRNALSGTYSAPQRSA
Sbjct: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETVIQAISRSATKTRNALSGTYSAPQRSA 1200

Query: 1201 SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260
            SSAALLVGDPT+MRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHE+TN
Sbjct: 1201 SSAALLVGDPTIMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHESTN 1260

Query: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320
            DVDHRRQMCAHARIL LEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD
Sbjct: 1261 DVDHRRQMCAHARILGLEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320

Query: 1321 SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380
            SIGFSDLSAKKIKKWMPEDRR+FEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE
Sbjct: 1321 SIGFSDLSAKKIKKWMPEDRRRFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380

Query: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440
            KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRV+
Sbjct: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVS 1440

Query: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCLF 1500
            QLARRALQTGI GAVCVLDDEPIGCGKHCGQIEAS+CQ+RKIS+SIAALIQPESGPVCLF
Sbjct: 1441 QLARRALQTGIHGAVCVLDDEPIGCGKHCGQIEASICQTRKISVSIAALIQPESGPVCLF 1500

Query: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIIT 1560
            GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHI+T
Sbjct: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIVT 1560

Query: 1561 MTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620
            MTID DLGEATCYLDGGFDGYQ GLPLNVGDNIWEQGTE+WVGVRPPTD+DIFGRSDSEG
Sbjct: 1561 MTIDVDLGEATCYLDGGFDGYQTGLPLNVGDNIWEQGTEMWVGVRPPTDIDIFGRSDSEG 1620

Query: 1621 AESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSDP 1680
            AESKMHIMDVFLWGRSLTEDEIA+LHAA+SSTD+NM+DF EDNWEWADSPSRVDEWDSDP
Sbjct: 1621 AESKMHIMDVFLWGRSLTEDEIASLHAAMSSTDFNMLDFTEDNWEWADSPSRVDEWDSDP 1680

Query: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740
            ADVDLYDRDDVDWDGQYSSGRKRRLERDGVV+DVDSF RKFRRPR ETCEEINQRMLSVE
Sbjct: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVLDVDSFARKFRRPRTETCEEINQRMLSVE 1740

Query: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQP 1800
            LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRP+EL+KEGRI+SQP
Sbjct: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPVELLKEGRIDSQP 1800

Query: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSEW 1860
            CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYN+EGIYTVRFCIQSEW
Sbjct: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNEEGIYTVRFCIQSEW 1860

Query: 1861 VPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920
            VPVVVDDWIPCESPGKPAFATS+KGNELWVS+LEKAYAKLHGSYEALEGGLVQDALVDLT
Sbjct: 1861 VPVVVDDWIPCESPGKPAFATSRKGNELWVSMLEKAYAKLHGSYEALEGGLVQDALVDLT 1920

Query: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980
            GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLG GSPSGSDVHISSSGIVQGHAY
Sbjct: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGVGSPSGSDVHISSSGIVQGHAY 1980

Query: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040
            SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW
Sbjct: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040

Query: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100
            QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP
Sbjct: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100

Query: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160
            VHVFITLTQGVSFSRT+AGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD
Sbjct: 2101 VHVFITLTQGVSFSRTSAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160

Query: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL
Sbjct: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2162

BLAST of HG10007350 vs. ExPASy TrEMBL
Match: A0A6J1JFY5 (calpain-type cysteine protease DEK1-like OS=Cucurbita maxima OX=3661 GN=LOC111485372 PE=3 SV=1)

HSP 1 Score: 4079.6 bits (10579), Expect = 0.0e+00
Identity = 2081/2210 (94.16%), Postives = 2126/2210 (96.20%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            M+GDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL
Sbjct: 1    MKGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCG LSLSAWILVISPIAVLIIWGCWLIVIL RDI GLAVVMAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGLLSLSAWILVISPIAVLIIWGCWLIVILDRDIIGLAVVMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVAILL
Sbjct: 121  QWQSSR------------------------------------------------AVAILL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LLAVALLC+YELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLAVALLCSYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGPLASLPEPPDPNEL       ASHLGLLYVGSV+VLV YSI
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPLASLPEPPDPNELYPRQSSRASHLGLLYVGSVVVLVGYSI 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTAKEARWLGATTSAAVIILDWN+GACLYGFQLLKSGVLALFVAGMSRVFLICFGV+
Sbjct: 301  LYGLTAKEARWLGATTSAAVIILDWNIGACLYGFQLLKSGVLALFVAGMSRVFLICFGVN 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS
Sbjct: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSSMKRSSSVEAGHLGN  ESTSKSGPAAQCTVD NNWNGVLCR  SSQEGINSD
Sbjct: 421  SSDGCGSSMKRSSSVEAGHLGNGTESTSKSGPAAQCTVDANNWNGVLCRTGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMSFVDKIFDHNSSLVVCSSSGLDSQGCESSTST 540
            KSMDSGRPS ALRSSSCRSIIQEPDAAMSF DK FDH+SSLVVCSSSG +SQGCESSTST
Sbjct: 481  KSMDSGRPSSALRSSSCRSIIQEPDAAMSFTDKNFDHHSSLVVCSSSGPESQGCESSTST 540

Query: 541  SANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEKS 600
            SANQQ LDLNLALALQERLSDPRITS+LKRSSRQG+REL SLLQNKGLDPNFAMMLKEKS
Sbjct: 541  SANQQMLDLNLALALQERLSDPRITSILKRSSRQGERELTSLLQNKGLDPNFAMMLKEKS 600

Query: 601  LDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660
            LDPTILALLQRSSLDADREH+DNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS
Sbjct: 601  LDPTILALLQRSSLDADREHQDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQFS 660

Query: 661  RLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVV 720
            RLVLHNVAGTPERAWVIFSLVFIIET IVAIFRPKTIDIINAKHQQFEFGFAVLLLSPV+
Sbjct: 661  RLVLHNVAGTPERAWVIFSLVFIIETTIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPVI 720

Query: 721  CSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMVA 780
            CSIMAFLQSL AEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLG+SLTVPLMVA
Sbjct: 721  CSIMAFLQSLLAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGMSLTVPLMVA 780

Query: 781  CLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIVS 840
            CLSLAIPIWIRNGYQFWIPRVQCM S+GNQR++ T++GIVLVICM LFSGSVIALGAIVS
Sbjct: 781  CLSLAIPIWIRNGYQFWIPRVQCMSSSGNQRTIGTRKGIVLVICMLLFSGSVIALGAIVS 840

Query: 841  AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFSS 900
            AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMA+ ISLVVTGVLPIVSWFSTYRFSFSS
Sbjct: 841  AKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMAATISLVVTGVLPIVSWFSTYRFSFSS 900

Query: 901  AVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960
            AVCVAIFTVVLVMFCGASYLEVVKSRDD VPTNGDFLAALLPLVCIPALLSLCSGLYKWK
Sbjct: 901  AVCVAIFTVVLVMFCGASYLEVVKSRDDEVPTNGDFLAALLPLVCIPALLSLCSGLYKWK 960

Query: 961  DDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS 1020
            DDGWRLS+GVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS
Sbjct: 961  DDGWRLSQGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWAS 1020

Query: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080
            NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP
Sbjct: 1021 NNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLSP 1080

Query: 1081 PIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140
            PIVVYSPRVLPVYVYDAHAD GKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS
Sbjct: 1081 PIVVYSPRVLPVYVYDAHADSGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAVS 1140

Query: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRSA 1200
            AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKET+IQAISRSATKTRNALSGTYSAPQRSA
Sbjct: 1141 AITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETVIQAISRSATKTRNALSGTYSAPQRSA 1200

Query: 1201 SSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETTN 1260
            SSAALLVGDPT+MRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHE+TN
Sbjct: 1201 SSAALLVGDPTIMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHESTN 1260

Query: 1261 DVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320
            DVDHRRQMCAHARIL LEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD
Sbjct: 1261 DVDHRRQMCAHARILGLEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFLD 1320

Query: 1321 SIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLE 1380
            SIGFSDLSAKKIKKWMPEDRR+FEIIQESYIREKEMEEEILMQRREEEGRGKERRKALL+
Sbjct: 1321 SIGFSDLSAKKIKKWMPEDRRRFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALLK 1380

Query: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVA 1440
            KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRV+
Sbjct: 1381 KEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRVS 1440

Query: 1441 QLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCLF 1500
            QLARRALQTGI GAVCVLDDEPIGCGKHCGQIEAS+CQ+RKIS+SIAALIQPESGPVCLF
Sbjct: 1441 QLARRALQTGIHGAVCVLDDEPIGCGKHCGQIEASICQTRKISVSIAALIQPESGPVCLF 1500

Query: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIIT 1560
            GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHI+T
Sbjct: 1501 GTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHIVT 1560

Query: 1561 MTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSEG 1620
            MTID DLGEATCYLDGGFDGYQ GLPLNVGDNIWEQGTE+WVGVRPPTD+DIFGRSDSEG
Sbjct: 1561 MTIDVDLGEATCYLDGGFDGYQTGLPLNVGDNIWEQGTEMWVGVRPPTDIDIFGRSDSEG 1620

Query: 1621 AESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSDP 1680
            AESKMHIMDVFLWGRSLTEDEIA+LHAA+SSTD+NM+DF EDNWEWADSPSRVDEWDSDP
Sbjct: 1621 AESKMHIMDVFLWGRSLTEDEIASLHAAMSSTDFNMLDFTEDNWEWADSPSRVDEWDSDP 1680

Query: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSVE 1740
            ADVDLYDRDDVDWDGQYSSGRKRRLERDGVV+DVDSFTRKFRRPR ETCEEINQRMLSVE
Sbjct: 1681 ADVDLYDRDDVDWDGQYSSGRKRRLERDGVVLDVDSFTRKFRRPRTETCEEINQRMLSVE 1740

Query: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQP 1800
            LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRP+EL+KEGRI+SQP
Sbjct: 1741 LAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPVELLKEGRIDSQP 1800

Query: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSEW 1860
            CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYN+EGIYTVRFCIQSEW
Sbjct: 1801 CLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNEEGIYTVRFCIQSEW 1860

Query: 1861 VPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDLT 1920
            VPVVVDDWIPCESPGKPAFATS+KGNELWVS+LEKAYAKLHGSYEALEGGLVQDALVDLT
Sbjct: 1861 VPVVVDDWIPCESPGKPAFATSRKGNELWVSMLEKAYAKLHGSYEALEGGLVQDALVDLT 1920

Query: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980
            GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY
Sbjct: 1921 GGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHAY 1980

Query: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040
            SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW
Sbjct: 1981 SLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMSW 2040

Query: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100
            QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP
Sbjct: 2041 QDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASYP 2100

Query: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160
            VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD
Sbjct: 2101 VHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGTD 2160

Query: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKAS+TLDVL
Sbjct: 2161 YVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASVTLDVL 2162

BLAST of HG10007350 vs. TAIR 10
Match: AT1G55350.3 (calpain-type cysteine protease family )

HSP 1 Score: 3440.2 bits (8919), Expect = 0.0e+00
Identity = 1743/2211 (78.83%), Postives = 1940/2211 (87.74%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            MEGDE  V+LACVISG+LF+V GS SF+ILWAVNWRPWR+YSWIFARKWP +LQGPQLD+
Sbjct: 1    MEGDERGVLLACVISGTLFTVFGSGSFWILWAVNWRPWRLYSWIFARKWPKVLQGPQLDI 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCG LSL AWI+V+SPIA+LI WG WLIVIL R I GLA++MAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGVLSLFAWIVVVSPIAILIGWGSWLIVILDRHIIGLAIIMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVA+LL
Sbjct: 121  QWQSSR------------------------------------------------AVALLL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LL VALLCAYELCAVYVTAG+ AS++YSPSGFFFG+SAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLGVALLCAYELCAVYVTAGAHASQQYSPSGFFFGVSAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGP+A LPEPPDPNEL       ASHLGLLY+GS++VL+AYS+
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPVACLPEPPDPNELYPRQTSRASHLGLLYLGSLVVLLAYSV 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTA+E+RWLG  TSAAVI+LDWN+GACLYGF+LL++ VLALFVAG+SR+FLICFG+H
Sbjct: 301  LYGLTARESRWLGGITSAAVIVLDWNIGACLYGFKLLQNRVLALFVAGISRLFLICFGIH 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISY  VASVL GAAV RHLS TDP AARRDALQSTVIRLREGFRRKE NSSSS
Sbjct: 361  YWYLGHCISYIFVASVLSGAAVSRHLSITDPSAARRDALQSTVIRLREGFRRKEQNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSS+KRSSS++AGH G   E+   +  A  CT D       L R  SSQEGINSD
Sbjct: 421  SSDGCGSSIKRSSSIDAGHTGCTNEA---NRTAESCTADN------LTRTGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMS-FVDKIFDHNSSLVVCSSSGLDSQGCESSTS 540
            KS +SGRPSL LRSSSCRS++QEP+A  S F+DK+ D N++LVVCSSSGLDSQG ESSTS
Sbjct: 481  KSEESGRPSLGLRSSSCRSVVQEPEAGTSYFMDKVSDQNNTLVVCSSSGLDSQGYESSTS 540

Query: 541  TSANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEK 600
             SANQQ LD+NLALA Q++L++PRI S+LK+ +++GD EL +LLQ+KGLDPNFA+MLKEK
Sbjct: 541  NSANQQLLDMNLALAFQDQLNNPRIASILKKKAKEGDLELTNLLQDKGLDPNFAVMLKEK 600

Query: 601  SLDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQF 660
            +LDPTILALLQRSSLDADR+HRDNTDITIIDSNSVDN LPNQISLSEELRL GLEKWL+ 
Sbjct: 601  NLDPTILALLQRSSLDADRDHRDNTDITIIDSNSVDNTLPNQISLSEELRLRGLEKWLKL 660

Query: 661  SRLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPV 720
            SRL+LH+VAGTPERAW +FSLVFI+ETIIVAIFRPKTI IIN+ HQQFEFGF+VLLLSPV
Sbjct: 661  SRLLLHHVAGTPERAWGLFSLVFILETIIVAIFRPKTITIINSSHQQFEFGFSVLLLSPV 720

Query: 721  VCSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMV 780
            VCSIMAFL+SLQ EEM++TSK RKYGF+AWLLSTSVGL LSFLSKSSVLLG+SLTVPLM 
Sbjct: 721  VCSIMAFLRSLQVEEMALTSKSRKYGFVAWLLSTSVGLSLSFLSKSSVLLGISLTVPLMA 780

Query: 781  ACLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIV 840
            ACLS+A+PIW+ NGYQFW+P++ C   A + RS   K G +L IC+ LF+GSVI+LGAI+
Sbjct: 781  ACLSIAVPIWMHNGYQFWVPQLSCGDQARDLRSPRIK-GFILWICVVLFAGSVISLGAII 840

Query: 841  SAKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFS 900
            SAKPL+DL+YK ++  + + +SPY +SVYLGWAM+S I+LVVT +LPIVSWF+TYRFS S
Sbjct: 841  SAKPLDDLKYKLFSARENNVTSPYTSSVYLGWAMSSGIALVVTAILPIVSWFATYRFSHS 900

Query: 901  SAVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKW 960
            SAVC+ IF+VVLV FCG SYLEVVKSRDD +PT GDFLAALLPL CIPALLSLC G+ KW
Sbjct: 901  SAVCLMIFSVVLVAFCGTSYLEVVKSRDDQLPTKGDFLAALLPLACIPALLSLCCGMVKW 960

Query: 961  KDDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWA 1020
            KDD W LSRGVY F  IGLLLL GAI+AVI V KPWTIG +FLLVL ++V+ IG +H WA
Sbjct: 961  KDDCWILSRGVYVFFSIGLLLLFGAIAAVIAV-KPWTIGVSFLLVLFLMVVTIGVIHLWA 1020

Query: 1021 SNNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLS 1080
            SNNFYLTR Q   VCFLA LL LAAFL+GW + K F GASVGYF FL LLAGRAL VLLS
Sbjct: 1021 SNNFYLTRKQTSFVCFLALLLGLAAFLLGWHQDKAFAGASVGYFTFLSLLAGRALAVLLS 1080

Query: 1081 PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAV 1140
            PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASL+IYPPFAGAAV
Sbjct: 1081 PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLIIYPPFAGAAV 1140

Query: 1141 SAITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRS 1200
            SAITLVV+FGFAVSRPCLTL+MM+ AV FLSK+TI+QAISRSATKTRNALSGTYSAPQRS
Sbjct: 1141 SAITLVVAFGFAVSRPCLTLEMMEVAVRFLSKDTIVQAISRSATKTRNALSGTYSAPQRS 1200

Query: 1201 ASSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETT 1260
            ASSAALLVGDP+ MRD+AGNFVLPR DVMKLRDRLRNEE VAGS F +++ R+ F HE  
Sbjct: 1201 ASSAALLVGDPSAMRDKAGNFVLPRDDVMKLRDRLRNEERVAGSIFYKMQCRKGFRHEPP 1260

Query: 1261 NDVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL 1320
             +VD+RR MCAHAR+LALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL
Sbjct: 1261 TNVDYRRDMCAHARVLALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL 1320

Query: 1321 DSIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALL 1380
            DSIGFSDLSA+KI KW PEDRRQFEIIQESY+REKEMEEE LMQRREEEGRGKERRKALL
Sbjct: 1321 DSIGFSDLSARKISKWKPEDRRQFEIIQESYLREKEMEEESLMQRREEEGRGKERRKALL 1380

Query: 1381 EKEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRV 1440
            EKEERKWKEIEASL+ SIPNAG REAAAMAAA+RAVGGDSVLEDSFARERVS IARRIR 
Sbjct: 1381 EKEERKWKEIEASLIPSIPNAGSREAAAMAAAIRAVGGDSVLEDSFARERVSGIARRIRT 1440

Query: 1441 AQLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCL 1500
            AQL RRA QTGI GAVCVLDDEP+  GKHCGQ+++S+CQS+KIS S+ A+IQ +SGPVCL
Sbjct: 1441 AQLERRAQQTGISGAVCVLDDEPMISGKHCGQMDSSVCQSQKISFSVTAMIQSDSGPVCL 1500

Query: 1501 FGTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHII 1560
            FGTE+QKK+CWE LVAGSEQGIEAGQVGLRLITKG+RQ+TV +EW I ATSI DGRWH +
Sbjct: 1501 FGTEFQKKVCWEILVAGSEQGIEAGQVGLRLITKGERQTTVAREWYIGATSITDGRWHTV 1560

Query: 1561 TMTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSE 1620
            T+TIDAD GEATCY+DGGFDGYQ GLPL++G  IWEQG E+W+GVRPP DVD FGRSDS+
Sbjct: 1561 TITIDADAGEATCYIDGGFDGYQNGLPLSIGSAIWEQGAEVWLGVRPPIDVDAFGRSDSD 1620

Query: 1621 GAESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSD 1680
            G ESKMHIMDVFLWG+ L+E+E A+LHAAI   D +MID ++DNW+W DSP RVD WDSD
Sbjct: 1621 GVESKMHIMDVFLWGKCLSEEEAASLHAAIGMADLDMIDLSDDNWQWTDSPPRVDGWDSD 1680

Query: 1681 PADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSV 1740
            PADVDLYDRDDVDWDGQYSSGRKRR  RD  V+ VDSF R+ R+PRMET E+INQRM SV
Sbjct: 1681 PADVDLYDRDDVDWDGQYSSGRKRRSGRD-FVMSVDSFARRHRKPRMETQEDINQRMRSV 1740

Query: 1741 ELAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQ 1800
            ELAVKEALSARG+  FTD+EFPPND SL+VD +NPPSKLQVVSEWMRP  ++KE   +S+
Sbjct: 1741 ELAVKEALSARGDKQFTDQEFPPNDRSLFVDTQNPPSKLQVVSEWMRPDSIVKENGSDSR 1800

Query: 1801 PCLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSE 1860
            PCLFS  ANPSDVCQGRLGDCWFLSAVAVLTE S+ISEVIITP YN+EGIYTVRFCIQ E
Sbjct: 1801 PCLFSGDANPSDVCQGRLGDCWFLSAVAVLTEVSRISEVIITPEYNEEGIYTVRFCIQGE 1860

Query: 1861 WVPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDL 1920
            WVPVV+DDWIPCESPGKPAFATS+K NELWVS++EKAYAKLHGSYEALEGGLVQDALVDL
Sbjct: 1861 WVPVVIDDWIPCESPGKPAFATSRKLNELWVSMVEKAYAKLHGSYEALEGGLVQDALVDL 1920

Query: 1921 TGGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHA 1980
            TGGAGEEID+RSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVH+SSSGIVQGHA
Sbjct: 1921 TGGAGEEIDLRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHVSSSGIVQGHA 1980

Query: 1981 YSLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMS 2040
            YS+LQVREVDGH+L+QIRNPWANEVEWNGPW+D+SPEWTDRMKHKLKH+PQSK+GIFWMS
Sbjct: 1981 YSVLQVREVDGHRLVQIRNPWANEVEWNGPWSDSSPEWTDRMKHKLKHVPQSKEGIFWMS 2040

Query: 2041 WQDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASY 2100
            WQDFQIHFRSIYVCR+YP EMRYSV+GQWRGYSAGGCQDY +WHQNPQFRLRA+G DAS 
Sbjct: 2041 WQDFQIHFRSIYVCRVYPREMRYSVNGQWRGYSAGGCQDYSSWHQNPQFRLRATGSDASL 2100

Query: 2101 PVHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGT 2160
            P+HVFITLTQGV FSRT  GFRNYQSSHDS +FYIG+RILKTRGRRAAYNI+LHESVGGT
Sbjct: 2101 PIHVFITLTQGVGFSRTTPGFRNYQSSHDSQLFYIGLRILKTRGRRAAYNIFLHESVGGT 2151

Query: 2161 DYVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            DYVNSREISCEMVL+PDPKGYTIVPTTIHPGEEAPFVLSVFTKASI L+ L
Sbjct: 2161 DYVNSREISCEMVLDPDPKGYTIVPTTIHPGEEAPFVLSVFTKASIVLEAL 2151

BLAST of HG10007350 vs. TAIR 10
Match: AT1G55350.1 (calpain-type cysteine protease family )

HSP 1 Score: 3440.2 bits (8919), Expect = 0.0e+00
Identity = 1743/2211 (78.83%), Postives = 1940/2211 (87.74%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            MEGDE  V+LACVISG+LF+V GS SF+ILWAVNWRPWR+YSWIFARKWP +LQGPQLD+
Sbjct: 1    MEGDERGVLLACVISGTLFTVFGSGSFWILWAVNWRPWRLYSWIFARKWPKVLQGPQLDI 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCG LSL AWI+V+SPIA+LI WG WLIVIL R I GLA++MAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGVLSLFAWIVVVSPIAILIGWGSWLIVILDRHIIGLAIIMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVA+LL
Sbjct: 121  QWQSSR------------------------------------------------AVALLL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LL VALLCAYELCAVYVTAG+ AS++YSPSGFFFG+SAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLGVALLCAYELCAVYVTAGAHASQQYSPSGFFFGVSAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGP+A LPEPPDPNEL       ASHLGLLY+GS++VL+AYS+
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPVACLPEPPDPNELYPRQTSRASHLGLLYLGSLVVLLAYSV 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTA+E+RWLG  TSAAVI+LDWN+GACLYGF+LL++ VLALFVAG+SR+FLICFG+H
Sbjct: 301  LYGLTARESRWLGGITSAAVIVLDWNIGACLYGFKLLQNRVLALFVAGISRLFLICFGIH 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISY  VASVL GAAV RHLS TDP AARRDALQSTVIRLREGFRRKE NSSSS
Sbjct: 361  YWYLGHCISYIFVASVLSGAAVSRHLSITDPSAARRDALQSTVIRLREGFRRKEQNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSS+KRSSS++AGH G   E+   +  A  CT D       L R  SSQEGINSD
Sbjct: 421  SSDGCGSSIKRSSSIDAGHTGCTNEA---NRTAESCTADN------LTRTGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMS-FVDKIFDHNSSLVVCSSSGLDSQGCESSTS 540
            KS +SGRPSL LRSSSCRS++QEP+A  S F+DK+ D N++LVVCSSSGLDSQG ESSTS
Sbjct: 481  KSEESGRPSLGLRSSSCRSVVQEPEAGTSYFMDKVSDQNNTLVVCSSSGLDSQGYESSTS 540

Query: 541  TSANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEK 600
             SANQQ LD+NLALA Q++L++PRI S+LK+ +++GD EL +LLQ+KGLDPNFA+MLKEK
Sbjct: 541  NSANQQLLDMNLALAFQDQLNNPRIASILKKKAKEGDLELTNLLQDKGLDPNFAVMLKEK 600

Query: 601  SLDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQF 660
            +LDPTILALLQRSSLDADR+HRDNTDITIIDSNSVDN LPNQISLSEELRL GLEKWL+ 
Sbjct: 601  NLDPTILALLQRSSLDADRDHRDNTDITIIDSNSVDNTLPNQISLSEELRLRGLEKWLKL 660

Query: 661  SRLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPV 720
            SRL+LH+VAGTPERAW +FSLVFI+ETIIVAIFRPKTI IIN+ HQQFEFGF+VLLLSPV
Sbjct: 661  SRLLLHHVAGTPERAWGLFSLVFILETIIVAIFRPKTITIINSSHQQFEFGFSVLLLSPV 720

Query: 721  VCSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMV 780
            VCSIMAFL+SLQ EEM++TSK RKYGF+AWLLSTSVGL LSFLSKSSVLLG+SLTVPLM 
Sbjct: 721  VCSIMAFLRSLQVEEMALTSKSRKYGFVAWLLSTSVGLSLSFLSKSSVLLGISLTVPLMA 780

Query: 781  ACLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIV 840
            ACLS+A+PIW+ NGYQFW+P++ C   A + RS   K G +L IC+ LF+GSVI+LGAI+
Sbjct: 781  ACLSIAVPIWMHNGYQFWVPQLSCGDQARDLRSPRIK-GFILWICVVLFAGSVISLGAII 840

Query: 841  SAKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFS 900
            SAKPL+DL+YK ++  + + +SPY +SVYLGWAM+S I+LVVT +LPIVSWF+TYRFS S
Sbjct: 841  SAKPLDDLKYKLFSARENNVTSPYTSSVYLGWAMSSGIALVVTAILPIVSWFATYRFSHS 900

Query: 901  SAVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKW 960
            SAVC+ IF+VVLV FCG SYLEVVKSRDD +PT GDFLAALLPL CIPALLSLC G+ KW
Sbjct: 901  SAVCLMIFSVVLVAFCGTSYLEVVKSRDDQLPTKGDFLAALLPLACIPALLSLCCGMVKW 960

Query: 961  KDDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWA 1020
            KDD W LSRGVY F  IGLLLL GAI+AVI V KPWTIG +FLLVL ++V+ IG +H WA
Sbjct: 961  KDDCWILSRGVYVFFSIGLLLLFGAIAAVIAV-KPWTIGVSFLLVLFLMVVTIGVIHLWA 1020

Query: 1021 SNNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLS 1080
            SNNFYLTR Q   VCFLA LL LAAFL+GW + K F GASVGYF FL LLAGRAL VLLS
Sbjct: 1021 SNNFYLTRKQTSFVCFLALLLGLAAFLLGWHQDKAFAGASVGYFTFLSLLAGRALAVLLS 1080

Query: 1081 PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAV 1140
            PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASL+IYPPFAGAAV
Sbjct: 1081 PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLIIYPPFAGAAV 1140

Query: 1141 SAITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRS 1200
            SAITLVV+FGFAVSRPCLTL+MM+ AV FLSK+TI+QAISRSATKTRNALSGTYSAPQRS
Sbjct: 1141 SAITLVVAFGFAVSRPCLTLEMMEVAVRFLSKDTIVQAISRSATKTRNALSGTYSAPQRS 1200

Query: 1201 ASSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETT 1260
            ASSAALLVGDP+ MRD+AGNFVLPR DVMKLRDRLRNEE VAGS F +++ R+ F HE  
Sbjct: 1201 ASSAALLVGDPSAMRDKAGNFVLPRDDVMKLRDRLRNEERVAGSIFYKMQCRKGFRHEPP 1260

Query: 1261 NDVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL 1320
             +VD+RR MCAHAR+LALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL
Sbjct: 1261 TNVDYRRDMCAHARVLALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL 1320

Query: 1321 DSIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALL 1380
            DSIGFSDLSA+KI KW PEDRRQFEIIQESY+REKEMEEE LMQRREEEGRGKERRKALL
Sbjct: 1321 DSIGFSDLSARKISKWKPEDRRQFEIIQESYLREKEMEEESLMQRREEEGRGKERRKALL 1380

Query: 1381 EKEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRV 1440
            EKEERKWKEIEASL+ SIPNAG REAAAMAAA+RAVGGDSVLEDSFARERVS IARRIR 
Sbjct: 1381 EKEERKWKEIEASLIPSIPNAGSREAAAMAAAIRAVGGDSVLEDSFARERVSGIARRIRT 1440

Query: 1441 AQLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCL 1500
            AQL RRA QTGI GAVCVLDDEP+  GKHCGQ+++S+CQS+KIS S+ A+IQ +SGPVCL
Sbjct: 1441 AQLERRAQQTGISGAVCVLDDEPMISGKHCGQMDSSVCQSQKISFSVTAMIQSDSGPVCL 1500

Query: 1501 FGTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHII 1560
            FGTE+QKK+CWE LVAGSEQGIEAGQVGLRLITKG+RQ+TV +EW I ATSI DGRWH +
Sbjct: 1501 FGTEFQKKVCWEILVAGSEQGIEAGQVGLRLITKGERQTTVAREWYIGATSITDGRWHTV 1560

Query: 1561 TMTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSE 1620
            T+TIDAD GEATCY+DGGFDGYQ GLPL++G  IWEQG E+W+GVRPP DVD FGRSDS+
Sbjct: 1561 TITIDADAGEATCYIDGGFDGYQNGLPLSIGSAIWEQGAEVWLGVRPPIDVDAFGRSDSD 1620

Query: 1621 GAESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSD 1680
            G ESKMHIMDVFLWG+ L+E+E A+LHAAI   D +MID ++DNW+W DSP RVD WDSD
Sbjct: 1621 GVESKMHIMDVFLWGKCLSEEEAASLHAAIGMADLDMIDLSDDNWQWTDSPPRVDGWDSD 1680

Query: 1681 PADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSV 1740
            PADVDLYDRDDVDWDGQYSSGRKRR  RD  V+ VDSF R+ R+PRMET E+INQRM SV
Sbjct: 1681 PADVDLYDRDDVDWDGQYSSGRKRRSGRD-FVMSVDSFARRHRKPRMETQEDINQRMRSV 1740

Query: 1741 ELAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQ 1800
            ELAVKEALSARG+  FTD+EFPPND SL+VD +NPPSKLQVVSEWMRP  ++KE   +S+
Sbjct: 1741 ELAVKEALSARGDKQFTDQEFPPNDRSLFVDTQNPPSKLQVVSEWMRPDSIVKENGSDSR 1800

Query: 1801 PCLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSE 1860
            PCLFS  ANPSDVCQGRLGDCWFLSAVAVLTE S+ISEVIITP YN+EGIYTVRFCIQ E
Sbjct: 1801 PCLFSGDANPSDVCQGRLGDCWFLSAVAVLTEVSRISEVIITPEYNEEGIYTVRFCIQGE 1860

Query: 1861 WVPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDL 1920
            WVPVV+DDWIPCESPGKPAFATS+K NELWVS++EKAYAKLHGSYEALEGGLVQDALVDL
Sbjct: 1861 WVPVVIDDWIPCESPGKPAFATSRKLNELWVSMVEKAYAKLHGSYEALEGGLVQDALVDL 1920

Query: 1921 TGGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHA 1980
            TGGAGEEID+RSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVH+SSSGIVQGHA
Sbjct: 1921 TGGAGEEIDLRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHVSSSGIVQGHA 1980

Query: 1981 YSLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMS 2040
            YS+LQVREVDGH+L+QIRNPWANEVEWNGPW+D+SPEWTDRMKHKLKH+PQSK+GIFWMS
Sbjct: 1981 YSVLQVREVDGHRLVQIRNPWANEVEWNGPWSDSSPEWTDRMKHKLKHVPQSKEGIFWMS 2040

Query: 2041 WQDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASY 2100
            WQDFQIHFRSIYVCR+YP EMRYSV+GQWRGYSAGGCQDY +WHQNPQFRLRA+G DAS 
Sbjct: 2041 WQDFQIHFRSIYVCRVYPREMRYSVNGQWRGYSAGGCQDYSSWHQNPQFRLRATGSDASL 2100

Query: 2101 PVHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGT 2160
            P+HVFITLTQGV FSRT  GFRNYQSSHDS +FYIG+RILKTRGRRAAYNI+LHESVGGT
Sbjct: 2101 PIHVFITLTQGVGFSRTTPGFRNYQSSHDSQLFYIGLRILKTRGRRAAYNIFLHESVGGT 2151

Query: 2161 DYVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            DYVNSREISCEMVL+PDPKGYTIVPTTIHPGEEAPFVLSVFTKASI L+ L
Sbjct: 2161 DYVNSREISCEMVLDPDPKGYTIVPTTIHPGEEAPFVLSVFTKASIVLEAL 2151

BLAST of HG10007350 vs. TAIR 10
Match: AT1G55350.2 (calpain-type cysteine protease family )

HSP 1 Score: 3440.2 bits (8919), Expect = 0.0e+00
Identity = 1743/2211 (78.83%), Postives = 1940/2211 (87.74%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            MEGDE  V+LACVISG+LF+V GS SF+ILWAVNWRPWR+YSWIFARKWP +LQGPQLD+
Sbjct: 1    MEGDERGVLLACVISGTLFTVFGSGSFWILWAVNWRPWRLYSWIFARKWPKVLQGPQLDI 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCG LSL AWI+V+SPIA+LI WG WLIVIL R I GLA++MAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGVLSLFAWIVVVSPIAILIGWGSWLIVILDRHIIGLAIIMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVA+LL
Sbjct: 121  QWQSSR------------------------------------------------AVALLL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LL VALLCAYELCAVYVTAG+ AS++YSPSGFFFG+SAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLGVALLCAYELCAVYVTAGAHASQQYSPSGFFFGVSAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGP+A LPEPPDPNEL       ASHLGLLY+GS++VL+AYS+
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPVACLPEPPDPNELYPRQTSRASHLGLLYLGSLVVLLAYSV 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTA+E+RWLG  TSAAVI+LDWN+GACLYGF+LL++ VLALFVAG+SR+FLICFG+H
Sbjct: 301  LYGLTARESRWLGGITSAAVIVLDWNIGACLYGFKLLQNRVLALFVAGISRLFLICFGIH 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISY  VASVL GAAV RHLS TDP AARRDALQSTVIRLREGFRRKE NSSSS
Sbjct: 361  YWYLGHCISYIFVASVLSGAAVSRHLSITDPSAARRDALQSTVIRLREGFRRKEQNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSS+KRSSS++AGH G   E+   +  A  CT D       L R  SSQEGINSD
Sbjct: 421  SSDGCGSSIKRSSSIDAGHTGCTNEA---NRTAESCTADN------LTRTGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMS-FVDKIFDHNSSLVVCSSSGLDSQGCESSTS 540
            KS +SGRPSL LRSSSCRS++QEP+A  S F+DK+ D N++LVVCSSSGLDSQG ESSTS
Sbjct: 481  KSEESGRPSLGLRSSSCRSVVQEPEAGTSYFMDKVSDQNNTLVVCSSSGLDSQGYESSTS 540

Query: 541  TSANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEK 600
             SANQQ LD+NLALA Q++L++PRI S+LK+ +++GD EL +LLQ+KGLDPNFA+MLKEK
Sbjct: 541  NSANQQLLDMNLALAFQDQLNNPRIASILKKKAKEGDLELTNLLQDKGLDPNFAVMLKEK 600

Query: 601  SLDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQF 660
            +LDPTILALLQRSSLDADR+HRDNTDITIIDSNSVDN LPNQISLSEELRL GLEKWL+ 
Sbjct: 601  NLDPTILALLQRSSLDADRDHRDNTDITIIDSNSVDNTLPNQISLSEELRLRGLEKWLKL 660

Query: 661  SRLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPV 720
            SRL+LH+VAGTPERAW +FSLVFI+ETIIVAIFRPKTI IIN+ HQQFEFGF+VLLLSPV
Sbjct: 661  SRLLLHHVAGTPERAWGLFSLVFILETIIVAIFRPKTITIINSSHQQFEFGFSVLLLSPV 720

Query: 721  VCSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMV 780
            VCSIMAFL+SLQ EEM++TSK RKYGF+AWLLSTSVGL LSFLSKSSVLLG+SLTVPLM 
Sbjct: 721  VCSIMAFLRSLQVEEMALTSKSRKYGFVAWLLSTSVGLSLSFLSKSSVLLGISLTVPLMA 780

Query: 781  ACLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIV 840
            ACLS+A+PIW+ NGYQFW+P++ C   A + RS   K G +L IC+ LF+GSVI+LGAI+
Sbjct: 781  ACLSIAVPIWMHNGYQFWVPQLSCGDQARDLRSPRIK-GFILWICVVLFAGSVISLGAII 840

Query: 841  SAKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFS 900
            SAKPL+DL+YK ++  + + +SPY +SVYLGWAM+S I+LVVT +LPIVSWF+TYRFS S
Sbjct: 841  SAKPLDDLKYKLFSARENNVTSPYTSSVYLGWAMSSGIALVVTAILPIVSWFATYRFSHS 900

Query: 901  SAVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKW 960
            SAVC+ IF+VVLV FCG SYLEVVKSRDD +PT GDFLAALLPL CIPALLSLC G+ KW
Sbjct: 901  SAVCLMIFSVVLVAFCGTSYLEVVKSRDDQLPTKGDFLAALLPLACIPALLSLCCGMVKW 960

Query: 961  KDDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWA 1020
            KDD W LSRGVY F  IGLLLL GAI+AVI V KPWTIG +FLLVL ++V+ IG +H WA
Sbjct: 961  KDDCWILSRGVYVFFSIGLLLLFGAIAAVIAV-KPWTIGVSFLLVLFLMVVTIGVIHLWA 1020

Query: 1021 SNNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLS 1080
            SNNFYLTR Q   VCFLA LL LAAFL+GW + K F GASVGYF FL LLAGRAL VLLS
Sbjct: 1021 SNNFYLTRKQTSFVCFLALLLGLAAFLLGWHQDKAFAGASVGYFTFLSLLAGRALAVLLS 1080

Query: 1081 PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAV 1140
            PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASL+IYPPFAGAAV
Sbjct: 1081 PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLIIYPPFAGAAV 1140

Query: 1141 SAITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRS 1200
            SAITLVV+FGFAVSRPCLTL+MM+ AV FLSK+TI+QAISRSATKTRNALSGTYSAPQRS
Sbjct: 1141 SAITLVVAFGFAVSRPCLTLEMMEVAVRFLSKDTIVQAISRSATKTRNALSGTYSAPQRS 1200

Query: 1201 ASSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETT 1260
            ASSAALLVGDP+ MRD+AGNFVLPR DVMKLRDRLRNEE VAGS F +++ R+ F HE  
Sbjct: 1201 ASSAALLVGDPSAMRDKAGNFVLPRDDVMKLRDRLRNEERVAGSIFYKMQCRKGFRHEPP 1260

Query: 1261 NDVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL 1320
             +VD+RR MCAHAR+LALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL
Sbjct: 1261 TNVDYRRDMCAHARVLALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL 1320

Query: 1321 DSIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALL 1380
            DSIGFSDLSA+KI KW PEDRRQFEIIQESY+REKEMEEE LMQRREEEGRGKERRKALL
Sbjct: 1321 DSIGFSDLSARKISKWKPEDRRQFEIIQESYLREKEMEEESLMQRREEEGRGKERRKALL 1380

Query: 1381 EKEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRV 1440
            EKEERKWKEIEASL+ SIPNAG REAAAMAAA+RAVGGDSVLEDSFARERVS IARRIR 
Sbjct: 1381 EKEERKWKEIEASLIPSIPNAGSREAAAMAAAIRAVGGDSVLEDSFARERVSGIARRIRT 1440

Query: 1441 AQLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCL 1500
            AQL RRA QTGI GAVCVLDDEP+  GKHCGQ+++S+CQS+KIS S+ A+IQ +SGPVCL
Sbjct: 1441 AQLERRAQQTGISGAVCVLDDEPMISGKHCGQMDSSVCQSQKISFSVTAMIQSDSGPVCL 1500

Query: 1501 FGTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHII 1560
            FGTE+QKK+CWE LVAGSEQGIEAGQVGLRLITKG+RQ+TV +EW I ATSI DGRWH +
Sbjct: 1501 FGTEFQKKVCWEILVAGSEQGIEAGQVGLRLITKGERQTTVAREWYIGATSITDGRWHTV 1560

Query: 1561 TMTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSE 1620
            T+TIDAD GEATCY+DGGFDGYQ GLPL++G  IWEQG E+W+GVRPP DVD FGRSDS+
Sbjct: 1561 TITIDADAGEATCYIDGGFDGYQNGLPLSIGSAIWEQGAEVWLGVRPPIDVDAFGRSDSD 1620

Query: 1621 GAESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSD 1680
            G ESKMHIMDVFLWG+ L+E+E A+LHAAI   D +MID ++DNW+W DSP RVD WDSD
Sbjct: 1621 GVESKMHIMDVFLWGKCLSEEEAASLHAAIGMADLDMIDLSDDNWQWTDSPPRVDGWDSD 1680

Query: 1681 PADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSV 1740
            PADVDLYDRDDVDWDGQYSSGRKRR  RD  V+ VDSF R+ R+PRMET E+INQRM SV
Sbjct: 1681 PADVDLYDRDDVDWDGQYSSGRKRRSGRD-FVMSVDSFARRHRKPRMETQEDINQRMRSV 1740

Query: 1741 ELAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQ 1800
            ELAVKEALSARG+  FTD+EFPPND SL+VD +NPPSKLQVVSEWMRP  ++KE   +S+
Sbjct: 1741 ELAVKEALSARGDKQFTDQEFPPNDRSLFVDTQNPPSKLQVVSEWMRPDSIVKENGSDSR 1800

Query: 1801 PCLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSE 1860
            PCLFS  ANPSDVCQGRLGDCWFLSAVAVLTE S+ISEVIITP YN+EGIYTVRFCIQ E
Sbjct: 1801 PCLFSGDANPSDVCQGRLGDCWFLSAVAVLTEVSRISEVIITPEYNEEGIYTVRFCIQGE 1860

Query: 1861 WVPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDL 1920
            WVPVV+DDWIPCESPGKPAFATS+K NELWVS++EKAYAKLHGSYEALEGGLVQDALVDL
Sbjct: 1861 WVPVVIDDWIPCESPGKPAFATSRKLNELWVSMVEKAYAKLHGSYEALEGGLVQDALVDL 1920

Query: 1921 TGGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHA 1980
            TGGAGEEID+RSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVH+SSSGIVQGHA
Sbjct: 1921 TGGAGEEIDLRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHVSSSGIVQGHA 1980

Query: 1981 YSLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMS 2040
            YS+LQVREVDGH+L+QIRNPWANEVEWNGPW+D+SPEWTDRMKHKLKH+PQSK+GIFWMS
Sbjct: 1981 YSVLQVREVDGHRLVQIRNPWANEVEWNGPWSDSSPEWTDRMKHKLKHVPQSKEGIFWMS 2040

Query: 2041 WQDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASY 2100
            WQDFQIHFRSIYVCR+YP EMRYSV+GQWRGYSAGGCQDY +WHQNPQFRLRA+G DAS 
Sbjct: 2041 WQDFQIHFRSIYVCRVYPREMRYSVNGQWRGYSAGGCQDYSSWHQNPQFRLRATGSDASL 2100

Query: 2101 PVHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGT 2160
            P+HVFITLTQGV FSRT  GFRNYQSSHDS +FYIG+RILKTRGRRAAYNI+LHESVGGT
Sbjct: 2101 PIHVFITLTQGVGFSRTTPGFRNYQSSHDSQLFYIGLRILKTRGRRAAYNIFLHESVGGT 2151

Query: 2161 DYVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            DYVNSREISCEMVL+PDPKGYTIVPTTIHPGEEAPFVLSVFTKASI L+ L
Sbjct: 2161 DYVNSREISCEMVLDPDPKGYTIVPTTIHPGEEAPFVLSVFTKASIVLEAL 2151

BLAST of HG10007350 vs. TAIR 10
Match: AT1G55350.4 (calpain-type cysteine protease family )

HSP 1 Score: 3440.2 bits (8919), Expect = 0.0e+00
Identity = 1743/2211 (78.83%), Postives = 1940/2211 (87.74%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            MEGDE  V+LACVISG+LF+V GS SF+ILWAVNWRPWR+YSWIFARKWP +LQGPQLD+
Sbjct: 1    MEGDERGVLLACVISGTLFTVFGSGSFWILWAVNWRPWRLYSWIFARKWPKVLQGPQLDI 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCG LSL AWI+V+SPIA+LI WG WLIVIL R I GLA++MAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGVLSLFAWIVVVSPIAILIGWGSWLIVILDRHIIGLAIIMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVA+LL
Sbjct: 121  QWQSSR------------------------------------------------AVALLL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LL VALLCAYELCAVYVTAG+ AS++YSPSGFFFG+SAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLGVALLCAYELCAVYVTAGAHASQQYSPSGFFFGVSAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGP+A LPEPPDPNEL       ASHLGLLY+GS++VL+AYS+
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPVACLPEPPDPNELYPRQTSRASHLGLLYLGSLVVLLAYSV 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTA+E+RWLG  TSAAVI+LDWN+GACLYGF+LL++ VLALFVAG+SR+FLICFG+H
Sbjct: 301  LYGLTARESRWLGGITSAAVIVLDWNIGACLYGFKLLQNRVLALFVAGISRLFLICFGIH 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISY  VASVL GAAV RHLS TDP AARRDALQSTVIRLREGFRRKE NSSSS
Sbjct: 361  YWYLGHCISYIFVASVLSGAAVSRHLSITDPSAARRDALQSTVIRLREGFRRKEQNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSS+KRSSS++AGH G   E+   +  A  CT D       L R  SSQEGINSD
Sbjct: 421  SSDGCGSSIKRSSSIDAGHTGCTNEA---NRTAESCTADN------LTRTGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMS-FVDKIFDHNSSLVVCSSSGLDSQGCESSTS 540
            KS +SGRPSL LRSSSCRS++QEP+A  S F+DK+ D N++LVVCSSSGLDSQG ESSTS
Sbjct: 481  KSEESGRPSLGLRSSSCRSVVQEPEAGTSYFMDKVSDQNNTLVVCSSSGLDSQGYESSTS 540

Query: 541  TSANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEK 600
             SANQQ LD+NLALA Q++L++PRI S+LK+ +++GD EL +LLQ+KGLDPNFA+MLKEK
Sbjct: 541  NSANQQLLDMNLALAFQDQLNNPRIASILKKKAKEGDLELTNLLQDKGLDPNFAVMLKEK 600

Query: 601  SLDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQF 660
            +LDPTILALLQRSSLDADR+HRDNTDITIIDSNSVDN LPNQISLSEELRL GLEKWL+ 
Sbjct: 601  NLDPTILALLQRSSLDADRDHRDNTDITIIDSNSVDNTLPNQISLSEELRLRGLEKWLKL 660

Query: 661  SRLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPV 720
            SRL+LH+VAGTPERAW +FSLVFI+ETIIVAIFRPKTI IIN+ HQQFEFGF+VLLLSPV
Sbjct: 661  SRLLLHHVAGTPERAWGLFSLVFILETIIVAIFRPKTITIINSSHQQFEFGFSVLLLSPV 720

Query: 721  VCSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMV 780
            VCSIMAFL+SLQ EEM++TSK RKYGF+AWLLSTSVGL LSFLSKSSVLLG+SLTVPLM 
Sbjct: 721  VCSIMAFLRSLQVEEMALTSKSRKYGFVAWLLSTSVGLSLSFLSKSSVLLGISLTVPLMA 780

Query: 781  ACLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIV 840
            ACLS+A+PIW+ NGYQFW+P++ C   A + RS   K G +L IC+ LF+GSVI+LGAI+
Sbjct: 781  ACLSIAVPIWMHNGYQFWVPQLSCGDQARDLRSPRIK-GFILWICVVLFAGSVISLGAII 840

Query: 841  SAKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFS 900
            SAKPL+DL+YK ++  + + +SPY +SVYLGWAM+S I+LVVT +LPIVSWF+TYRFS S
Sbjct: 841  SAKPLDDLKYKLFSARENNVTSPYTSSVYLGWAMSSGIALVVTAILPIVSWFATYRFSHS 900

Query: 901  SAVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKW 960
            SAVC+ IF+VVLV FCG SYLEVVKSRDD +PT GDFLAALLPL CIPALLSLC G+ KW
Sbjct: 901  SAVCLMIFSVVLVAFCGTSYLEVVKSRDDQLPTKGDFLAALLPLACIPALLSLCCGMVKW 960

Query: 961  KDDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWA 1020
            KDD W LSRGVY F  IGLLLL GAI+AVI V KPWTIG +FLLVL ++V+ IG +H WA
Sbjct: 961  KDDCWILSRGVYVFFSIGLLLLFGAIAAVIAV-KPWTIGVSFLLVLFLMVVTIGVIHLWA 1020

Query: 1021 SNNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLS 1080
            SNNFYLTR Q   VCFLA LL LAAFL+GW + K F GASVGYF FL LLAGRAL VLLS
Sbjct: 1021 SNNFYLTRKQTSFVCFLALLLGLAAFLLGWHQDKAFAGASVGYFTFLSLLAGRALAVLLS 1080

Query: 1081 PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAV 1140
            PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASL+IYPPFAGAAV
Sbjct: 1081 PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLIIYPPFAGAAV 1140

Query: 1141 SAITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRS 1200
            SAITLVV+FGFAVSRPCLTL+MM+ AV FLSK+TI+QAISRSATKTRNALSGTYSAPQRS
Sbjct: 1141 SAITLVVAFGFAVSRPCLTLEMMEVAVRFLSKDTIVQAISRSATKTRNALSGTYSAPQRS 1200

Query: 1201 ASSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETT 1260
            ASSAALLVGDP+ MRD+AGNFVLPR DVMKLRDRLRNEE VAGS F +++ R+ F HE  
Sbjct: 1201 ASSAALLVGDPSAMRDKAGNFVLPRDDVMKLRDRLRNEERVAGSIFYKMQCRKGFRHEPP 1260

Query: 1261 NDVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL 1320
             +VD+RR MCAHAR+LALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL
Sbjct: 1261 TNVDYRRDMCAHARVLALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL 1320

Query: 1321 DSIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALL 1380
            DSIGFSDLSA+KI KW PEDRRQFEIIQESY+REKEMEEE LMQRREEEGRGKERRKALL
Sbjct: 1321 DSIGFSDLSARKISKWKPEDRRQFEIIQESYLREKEMEEESLMQRREEEGRGKERRKALL 1380

Query: 1381 EKEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRV 1440
            EKEERKWKEIEASL+ SIPNAG REAAAMAAA+RAVGGDSVLEDSFARERVS IARRIR 
Sbjct: 1381 EKEERKWKEIEASLIPSIPNAGSREAAAMAAAIRAVGGDSVLEDSFARERVSGIARRIRT 1440

Query: 1441 AQLARRALQTGILGAVCVLDDEPIGCGKHCGQIEASLCQSRKISISIAALIQPESGPVCL 1500
            AQL RRA QTGI GAVCVLDDEP+  GKHCGQ+++S+CQS+KIS S+ A+IQ +SGPVCL
Sbjct: 1441 AQLERRAQQTGISGAVCVLDDEPMISGKHCGQMDSSVCQSQKISFSVTAMIQSDSGPVCL 1500

Query: 1501 FGTEYQKKICWEFLVAGSEQGIEAGQVGLRLITKGDRQSTVTKEWSISATSIADGRWHII 1560
            FGTE+QKK+CWE LVAGSEQGIEAGQVGLRLITKG+RQ+TV +EW I ATSI DGRWH +
Sbjct: 1501 FGTEFQKKVCWEILVAGSEQGIEAGQVGLRLITKGERQTTVAREWYIGATSITDGRWHTV 1560

Query: 1561 TMTIDADLGEATCYLDGGFDGYQIGLPLNVGDNIWEQGTEIWVGVRPPTDVDIFGRSDSE 1620
            T+TIDAD GEATCY+DGGFDGYQ GLPL++G  IWEQG E+W+GVRPP DVD FGRSDS+
Sbjct: 1561 TITIDADAGEATCYIDGGFDGYQNGLPLSIGSAIWEQGAEVWLGVRPPIDVDAFGRSDSD 1620

Query: 1621 GAESKMHIMDVFLWGRSLTEDEIAALHAAISSTDYNMIDFAEDNWEWADSPSRVDEWDSD 1680
            G ESKMHIMDVFLWG+ L+E+E A+LHAAI   D +MID ++DNW+W DSP RVD WDSD
Sbjct: 1621 GVESKMHIMDVFLWGKCLSEEEAASLHAAIGMADLDMIDLSDDNWQWTDSPPRVDGWDSD 1680

Query: 1681 PADVDLYDRDDVDWDGQYSSGRKRRLERDGVVVDVDSFTRKFRRPRMETCEEINQRMLSV 1740
            PADVDLYDRDDVDWDGQYSSGRKRR  RD  V+ VDSF R+ R+PRMET E+INQRM SV
Sbjct: 1681 PADVDLYDRDDVDWDGQYSSGRKRRSGRD-FVMSVDSFARRHRKPRMETQEDINQRMRSV 1740

Query: 1741 ELAVKEALSARGEMHFTDEEFPPNDESLYVDPKNPPSKLQVVSEWMRPLELIKEGRIESQ 1800
            ELAVKEALSARG+  FTD+EFPPND SL+VD +NPPSKLQVVSEWMRP  ++KE   +S+
Sbjct: 1741 ELAVKEALSARGDKQFTDQEFPPNDRSLFVDTQNPPSKLQVVSEWMRPDSIVKENGSDSR 1800

Query: 1801 PCLFSEAANPSDVCQGRLGDCWFLSAVAVLTEASKISEVIITPRYNDEGIYTVRFCIQSE 1860
            PCLFS  ANPSDVCQGRLGDCWFLSAVAVLTE S+ISEVIITP YN+EGIYTVRFCIQ E
Sbjct: 1801 PCLFSGDANPSDVCQGRLGDCWFLSAVAVLTEVSRISEVIITPEYNEEGIYTVRFCIQGE 1860

Query: 1861 WVPVVVDDWIPCESPGKPAFATSKKGNELWVSILEKAYAKLHGSYEALEGGLVQDALVDL 1920
            WVPVV+DDWIPCESPGKPAFATS+K NELWVS++EKAYAKLHGSYEALEGGLVQDALVDL
Sbjct: 1861 WVPVVIDDWIPCESPGKPAFATSRKLNELWVSMVEKAYAKLHGSYEALEGGLVQDALVDL 1920

Query: 1921 TGGAGEEIDMRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHISSSGIVQGHA 1980
            TGGAGEEID+RSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVH+SSSGIVQGHA
Sbjct: 1921 TGGAGEEIDLRSAQAQIDLASGRLWSQLLRFKQEGFLLGAGSPSGSDVHVSSSGIVQGHA 1980

Query: 1981 YSLLQVREVDGHKLIQIRNPWANEVEWNGPWADTSPEWTDRMKHKLKHIPQSKDGIFWMS 2040
            YS+LQVREVDGH+L+QIRNPWANEVEWNGPW+D+SPEWTDRMKHKLKH+PQSK+GIFWMS
Sbjct: 1981 YSVLQVREVDGHRLVQIRNPWANEVEWNGPWSDSSPEWTDRMKHKLKHVPQSKEGIFWMS 2040

Query: 2041 WQDFQIHFRSIYVCRIYPPEMRYSVHGQWRGYSAGGCQDYDTWHQNPQFRLRASGPDASY 2100
            WQDFQIHFRSIYVCR+YP EMRYSV+GQWRGYSAGGCQDY +WHQNPQFRLRA+G DAS 
Sbjct: 2041 WQDFQIHFRSIYVCRVYPREMRYSVNGQWRGYSAGGCQDYSSWHQNPQFRLRATGSDASL 2100

Query: 2101 PVHVFITLTQGVSFSRTAAGFRNYQSSHDSMMFYIGMRILKTRGRRAAYNIYLHESVGGT 2160
            P+HVFITLTQGV FSRT  GFRNYQSSHDS +FYIG+RILKTRGRRAAYNI+LHESVGGT
Sbjct: 2101 PIHVFITLTQGVGFSRTTPGFRNYQSSHDSQLFYIGLRILKTRGRRAAYNIFLHESVGGT 2151

Query: 2161 DYVNSREISCEMVLEPDPKGYTIVPTTIHPGEEAPFVLSVFTKASITLDVL 2204
            DYVNSREISCEMVL+PDPKGYTIVPTTIHPGEEAPFVLSVFTKASI L+ L
Sbjct: 2161 DYVNSREISCEMVLDPDPKGYTIVPTTIHPGEEAPFVLSVFTKASIVLEAL 2151

BLAST of HG10007350 vs. TAIR 10
Match: AT1G55350.5 (calpain-type cysteine protease family )

HSP 1 Score: 3425.6 bits (8881), Expect = 0.0e+00
Identity = 1743/2239 (77.85%), Postives = 1940/2239 (86.65%), Query Frame = 0

Query: 1    MEGDEHKVVLACVISGSLFSVLGSASFFILWAVNWRPWRIYSWIFARKWPNILQGPQLDL 60
            MEGDE  V+LACVISG+LF+V GS SF+ILWAVNWRPWR+YSWIFARKWP +LQGPQLD+
Sbjct: 1    MEGDERGVLLACVISGTLFTVFGSGSFWILWAVNWRPWRLYSWIFARKWPKVLQGPQLDI 60

Query: 61   LCGFLSLSAWILVISPIAVLIIWGCWLIVILGRDITGLAVVMAGTALLLAFYSIMLWWRT 120
            LCG LSL AWI+V+SPIA+LI WG WLIVIL R I GLA++MAGTALLLAFYSIMLWWRT
Sbjct: 61   LCGVLSLFAWIVVVSPIAILIGWGSWLIVILDRHIIGLAIIMAGTALLLAFYSIMLWWRT 120

Query: 121  QWQSSRKKRNFSLPKLGEEMIVGAKSCPIKQAKEDLGKKQREGFKGPCMFTLKGAVAILL 180
            QWQSSR                                                AVA+LL
Sbjct: 121  QWQSSR------------------------------------------------AVALLL 180

Query: 181  LLAVALLCAYELCAVYVTAGSSASERYSPSGFFFGISAIALAINMLFICRMVFNGNGLDV 240
            LL VALLCAYELCAVYVTAG+ AS++YSPSGFFFG+SAIALAINMLFICRMVFNGNGLDV
Sbjct: 181  LLGVALLCAYELCAVYVTAGAHASQQYSPSGFFFGVSAIALAINMLFICRMVFNGNGLDV 240

Query: 241  DEYVRKAYKFAYSDCMEVGPLASLPEPPDPNEL-------ASHLGLLYVGSVLVLVAYSI 300
            DEYVR+AYKFAYSDC+EVGP+A LPEPPDPNEL       ASHLGLLY+GS++VL+AYS+
Sbjct: 241  DEYVRRAYKFAYSDCIEVGPVACLPEPPDPNELYPRQTSRASHLGLLYLGSLVVLLAYSV 300

Query: 301  LYGLTAKEARWLGATTSAAVIILDWNVGACLYGFQLLKSGVLALFVAGMSRVFLICFGVH 360
            LYGLTA+E+RWLG  TSAAVI+LDWN+GACLYGF+LL++ VLALFVAG+SR+FLICFG+H
Sbjct: 301  LYGLTARESRWLGGITSAAVIVLDWNIGACLYGFKLLQNRVLALFVAGISRLFLICFGIH 360

Query: 361  YWYLGHCISYAVVASVLLGAAVMRHLSATDPFAARRDALQSTVIRLREGFRRKEPNSSSS 420
            YWYLGHCISY  VASVL GAAV RHLS TDP AARRDALQSTVIRLREGFRRKE NSSSS
Sbjct: 361  YWYLGHCISYIFVASVLSGAAVSRHLSITDPSAARRDALQSTVIRLREGFRRKEQNSSSS 420

Query: 421  SSDGCGSSMKRSSSVEAGHLGNAVESTSKSGPAAQCTVDGNNWNGVLCRASSSQEGINSD 480
            SSDGCGSS+KRSSS++AGH G   E+   +  A  CT D       L R  SSQEGINSD
Sbjct: 421  SSDGCGSSIKRSSSIDAGHTGCTNEA---NRTAESCTADN------LTRTGSSQEGINSD 480

Query: 481  KSMDSGRPSLALRSSSCRSIIQEPDAAMS-FVDKIFDHNSSLVVCSSSGLDSQGCESSTS 540
            KS +SGRPSL LRSSSCRS++QEP+A  S F+DK+ D N++LVVCSSSGLDSQG ESSTS
Sbjct: 481  KSEESGRPSLGLRSSSCRSVVQEPEAGTSYFMDKVSDQNNTLVVCSSSGLDSQGYESSTS 540

Query: 541  TSANQQTLDLNLALALQERLSDPRITSMLKRSSRQGDRELASLLQNKGLDPNFAMMLKEK 600
             SANQQ LD+NLALA Q++L++PRI S+LK+ +++GD EL +LLQ+KGLDPNFA+MLKEK
Sbjct: 541  NSANQQLLDMNLALAFQDQLNNPRIASILKKKAKEGDLELTNLLQDKGLDPNFAVMLKEK 600

Query: 601  SLDPTILALLQRSSLDADREHRDNTDITIIDSNSVDNMLPNQISLSEELRLHGLEKWLQF 660
            +LDPTILALLQRSSLDADR+HRDNTDITIIDSNSVDN LPNQISLSEELRL GLEKWL+ 
Sbjct: 601  NLDPTILALLQRSSLDADRDHRDNTDITIIDSNSVDNTLPNQISLSEELRLRGLEKWLKL 660

Query: 661  SRLVLHNVAGTPERAWVIFSLVFIIETIIVAIFRPKTIDIINAKHQQFEFGFAVLLLSPV 720
            SRL+LH+VAGTPERAW +FSLVFI+ETIIVAIFRPKTI IIN+ HQQFEFGF+VLLLSPV
Sbjct: 661  SRLLLHHVAGTPERAWGLFSLVFILETIIVAIFRPKTITIINSSHQQFEFGFSVLLLSPV 720

Query: 721  VCSIMAFLQSLQAEEMSMTSKPRKYGFIAWLLSTSVGLLLSFLSKSSVLLGLSLTVPLMV 780
            VCSIMAFL+SLQ EEM++TSK RKYGF+AWLLSTSVGL LSFLSKSSVLLG+SLTVPLM 
Sbjct: 721  VCSIMAFLRSLQVEEMALTSKSRKYGFVAWLLSTSVGLSLSFLSKSSVLLGISLTVPLMA 780

Query: 781  ACLSLAIPIWIRNGYQFWIPRVQCMGSAGNQRSLATKEGIVLVICMSLFSGSVIALGAIV 840
            ACLS+A+PIW+ NGYQFW+P++ C   A + RS   K G +L IC+ LF+GSVI+LGAI+
Sbjct: 781  ACLSIAVPIWMHNGYQFWVPQLSCGDQARDLRSPRIK-GFILWICVVLFAGSVISLGAII 840

Query: 841  SAKPLNDLRYKGWTGDDKSFSSPYATSVYLGWAMASAISLVVTGVLPIVSWFSTYRFSFS 900
            SAKPL+DL+YK ++  + + +SPY +SVYLGWAM+S I+LVVT +LPIVSWF+TYRFS S
Sbjct: 841  SAKPLDDLKYKLFSARENNVTSPYTSSVYLGWAMSSGIALVVTAILPIVSWFATYRFSHS 900

Query: 901  SAVCVAIFTVVLVMFCGASYLEVVKSRDDGVPTNGDFLAALLPLVCIPALLSLCSGLYKW 960
            SAVC+ IF+VVLV FCG SYLEVVKSRDD +PT GDFLAALLPL CIPALLSLC G+ KW
Sbjct: 901  SAVCLMIFSVVLVAFCGTSYLEVVKSRDDQLPTKGDFLAALLPLACIPALLSLCCGMVKW 960

Query: 961  KDDGWRLSRGVYAFLFIGLLLLLGAISAVIVVIKPWTIGAAFLLVLLMIVLAIGSVHHWA 1020
            KDD W LSRGVY F  IGLLLL GAI+AVI V KPWTIG +FLLVL ++V+ IG +H WA
Sbjct: 961  KDDCWILSRGVYVFFSIGLLLLFGAIAAVIAV-KPWTIGVSFLLVLFLMVVTIGVIHLWA 1020

Query: 1021 SNNFYLTRTQMFLVCFLAFLLALAAFLVGWFEGKPFVGASVGYFLFLFLLAGRALTVLLS 1080
            SNNFYLTR Q   VCFLA LL LAAFL+GW + K F GASVGYF FL LLAGRAL VLLS
Sbjct: 1021 SNNFYLTRKQTSFVCFLALLLGLAAFLLGWHQDKAFAGASVGYFTFLSLLAGRALAVLLS 1080

Query: 1081 PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLLIYPPFAGAAV 1140
            PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASL+IYPPFAGAAV
Sbjct: 1081 PPIVVYSPRVLPVYVYDAHADCGKNVSAAFLVLYGIALATEGWGVVASLIIYPPFAGAAV 1140

Query: 1141 SAITLVVSFGFAVSRPCLTLKMMQDAVHFLSKETIIQAISRSATKTRNALSGTYSAPQRS 1200
            SAITLVV+FGFAVSRPCLTL+MM+ AV FLSK+TI+QAISRSATKTRNALSGTYSAPQRS
Sbjct: 1141 SAITLVVAFGFAVSRPCLTLEMMEVAVRFLSKDTIVQAISRSATKTRNALSGTYSAPQRS 1200

Query: 1201 ASSAALLVGDPTVMRDRAGNFVLPRADVMKLRDRLRNEELVAGSFFCRLRYRRPFFHETT 1260
            ASSAALLVGDP+ MRD+AGNFVLPR DVMKLRDRLRNEE VAGS F +++ R+ F HE  
Sbjct: 1201 ASSAALLVGDPSAMRDKAGNFVLPRDDVMKLRDRLRNEERVAGSIFYKMQCRKGFRHEPP 1260

Query: 1261 NDVDHRRQMCAHARILALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL 1320
             +VD+RR MCAHAR+LALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL
Sbjct: 1261 TNVDYRRDMCAHARVLALEEAIDTEWVYMWDKFGGYLLLLLGLTAKAERVQDEVRLRLFL 1320

Query: 1321 DSIGFSDLSAKKIKKWMPEDRRQFEIIQESYIREKEMEEEILMQRREEEGRGKERRKALL 1380
            DSIGFSDLSA+KI KW PEDRRQFEIIQESY+REKEMEEE LMQRREEEGRGKERRKALL
Sbjct: 1321 DSIGFSDLSARKISKWKPEDRRQFEIIQESYLREKEMEEESLMQRREEEGRGKERRKALL 1380

Query: 1381 EKEERKWKEIEASLMSSIPNAGGREAAAMAAAVRAVGGDSVLEDSFARERVSSIARRIRV 1440
            EKEERKWKEIEASL+ SIPNAG REAAAMAAA+RAVGGDSVLEDSFARERVS IARRIR 
Sbjct: 1381 EKEERKWKEIEASLIPSIPNAGSREAAAMAAAIRAVGGDSVLEDSFARERVSGIARRIRT 1440

Query: 1441 AQLARRA----------------------------LQTGILGAVCVLDDEPIGCGKHCGQ 1500
            AQL RRA                             QTGI GAVCVLDDEP+  GKHCGQ
Sbjct: 1441 AQLERRAQQVKTYFYILQVFFLMMLINGELTKKSYYQTGISGAVCVLDDEPMISGKHCGQ 1500

Query: 1501 IEASLCQSRKISISIAALIQPESGPVCLFGTEYQKKICWEFLVAGSEQGIEAGQVGLRLI 1560
            +++S+CQS+KIS S+ A+IQ +SGPVCLFGTE+QKK+CWE LVAGSEQGIEAGQVGLRLI
Sbjct: 1501 MDSSVCQSQKISFSVTAMIQSDSGPVCLFGTEFQKKVCWEILVAGSEQGIEAGQVGLRLI 1560

Query: 1561 TKGDRQSTVTKEWSISATSIADGRWHIITMTIDADLGEATCYLDGGFDGYQIGLPLNVGD 1620
            TKG+RQ+TV +EW I ATSI DGRWH +T+TIDAD GEATCY+DGGFDGYQ GLPL++G 
Sbjct: 1561 TKGERQTTVAREWYIGATSITDGRWHTVTITIDADAGEATCYIDGGFDGYQNGLPLSIGS 1620

Query: 1621 NIWEQGTEIWVGVRPPTDVDIFGRSDSEGAESKMHIMDVFLWGRSLTEDEIAALHAAISS 1680
             IWEQG E+W+GVRPP DVD FGRSDS+G ESKMHIMDVFLWG+ L+E+E A+LHAAI  
Sbjct: 1621 AIWEQGAEVWLGVRPPIDVDAFGRSDSDGVESKMHIMDVFLWGKCLSEEEAASLHAAIGM 1680

Query: 1681 TDYNMIDFAEDNWEWADSPSRVDEWDSDPADVDLYDRDDVDWDGQYSSGRKRRLERDGVV 1740
             D +MID ++DNW+W DSP RVD WDSDPADVDLYDRDDVDWDGQYSSGRKRR  RD  V
Sbjct: 1681 ADLDMIDLSDDNWQWTDSPPRVDGWDSDPADVDLYDRDDVDWDGQYSSGRKRRSGRD-FV 1740

Query: 1741 VDVDSFTRKFRRPRMETCEEINQRMLSVELAVKEALSARGEMHFTDEEFPPNDESLYVDP 1800
            + VDSF R+ R+PRMET E+INQRM SVELAVKEALSARG+  FTD+EFPPND SL+VD 
Sbjct: 1741 MSVDSFARRHRKPRMETQEDINQRMRSVELAVKEALSARGDKQFTDQEFPPNDRSLFVDT 1800

Query: 1801 KNPPSKLQVVSEWMRPLELIKEGRIESQPCLFSEAANPSDVCQGRLGDCWFLSAVAVLTE 1860
            +NPPSKLQVVSEWMRP  ++KE   +S+PCLFS  ANPSDVCQGRLGDCWFLSAVAVLTE
Sbjct: 1801 QNPPSKLQVVSEWMRPDSIVKENGSDSRPCLFSGDANPSDVCQGRLGDCWFLSAVAVLTE 1860

Query: 1861 ASKISEVIITPRYNDEGIYTVRFCIQSEWVPVVVDDWIPCESPGKPAFATSKKGNELWVS 1920
             S+ISEVIITP YN+EGIYTVRFCIQ EWVPVV+DDWIPCESPGKPAFATS+K NELWVS
Sbjct: 1861 VSRISEVIITPEYNEEGIYTVRFCIQGEWVPVVIDDWIPCESPGKPAFATSRKLNELWVS 1920

Query: 1921 ILEKAYAKLHGSYEALEGGLVQDALVDLTGGAGEEIDMRSAQAQIDLASGRLWSQLLRFK 1980
            ++EKAYAKLHGSYEALEGGLVQDALVDLTGGAGEEID+RSAQAQIDLASGRLWSQLLRFK
Sbjct: 1921 MVEKAYAKLHGSYEALEGGLVQDALVDLTGGAGEEIDLRSAQAQIDLASGRLWSQLLRFK 1980

Query: 1981 QEGFLLGAGSPSGSDVHISSSGIVQGHAYSLLQVREVDGHKLIQIRNPWANEVEWNGPWA 2040
            QEGFLLGAGSPSGSDVH+SSSGIVQGHAYS+LQVREVDGH+L+QIRNPWANEVEWNGPW+
Sbjct: 1981 QEGFLLGAGSPSGSDVHVSSSGIVQGHAYSVLQVREVDGHRLVQIRNPWANEVEWNGPWS 2040

Query: 2041 DTSPEWTDRMKHKLKHIPQSKDGIFWMSWQDFQIHFRSIYVCRIYPPEMRYSVHGQWRGY 2100
            D+SPEWTDRMKHKLKH+PQSK+GIFWMSWQDFQIHFRSIYVCR+YP EMRYSV+GQWRGY
Sbjct: 2041 DSSPEWTDRMKHKLKHVPQSKEGIFWMSWQDFQIHFRSIYVCRVYPREMRYSVNGQWRGY 2100

Query: 2101 SAGGCQDYDTWHQNPQFRLRASGPDASYPVHVFITLTQGVSFSRTAAGFRNYQSSHDSMM 2160
            SAGGCQDY +WHQNPQFRLRA+G DAS P+HVFITLTQGV FSRT  GFRNYQSSHDS +
Sbjct: 2101 SAGGCQDYSSWHQNPQFRLRATGSDASLPIHVFITLTQGVGFSRTTPGFRNYQSSHDSQL 2160

Query: 2161 FYIGMRILKTRGRRAAYNIYLHESVGGTDYVNSREISCEMVLEPDPKGYTIVPTTIHPGE 2204
            FYIG+RILKTRGRRAAYNI+LHESVGGTDYVNSREISCEMVL+PDPKGYTIVPTTIHPGE
Sbjct: 2161 FYIGLRILKTRGRRAAYNIFLHESVGGTDYVNSREISCEMVLDPDPKGYTIVPTTIHPGE 2179

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038879134.10.0e+0096.43calpain-type cysteine protease DEK1 isoform X1 [Benincasa hispida][more]
XP_011660057.10.0e+0095.88calpain-type cysteine protease DEK1 [Cucumis sativus] >KGN66358.1 hypothetical p... [more]
KAA0055719.10.0e+0095.93calpain-type cysteine protease DEK1 [Cucumis melo var. makuwa][more]
XP_008451014.10.0e+0095.88PREDICTED: calpain-type cysteine protease DEK1 [Cucumis melo] >XP_008451015.1 PR... [more]
XP_022960712.10.0e+0094.34calpain-type cysteine protease DEK1-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q8RVL20.0e+0078.83Calpain-type cysteine protease DEK1 OS=Arabidopsis thaliana OX=3702 GN=DEK1 PE=1... [more]
Q6ZFZ40.0e+0072.54Calpain-type cysteine protease ADL1 OS=Oryza sativa subsp. japonica OX=39947 GN=... [more]
Q8RVL10.0e+0070.85Calpain-type cysteine protease DEK1 OS=Zea mays OX=4577 GN=DEK1 PE=1 SV=2[more]
P343088.9e-6234.69Calpain clp-1 OS=Caenorhabditis elegans OX=6239 GN=clp-1 PE=2 SV=4[more]
Q9VT659.9e-6131.59Calpain-B OS=Drosophila melanogaster OX=7227 GN=CalpB PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LX510.0e+0095.88Calpain catalytic domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G... [more]
A0A5A7UQG40.0e+0095.93Calpain-type cysteine protease DEK1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6... [more]
A0A1S3BRA00.0e+0095.88calpain-type cysteine protease DEK1 OS=Cucumis melo OX=3656 GN=LOC103492422 PE=3... [more]
A0A6J1H8730.0e+0094.34calpain-type cysteine protease DEK1-like OS=Cucurbita moschata OX=3662 GN=LOC111... [more]
A0A6J1JFY50.0e+0094.16calpain-type cysteine protease DEK1-like OS=Cucurbita maxima OX=3661 GN=LOC11148... [more]
Match NameE-valueIdentityDescription
AT1G55350.30.0e+0078.83calpain-type cysteine protease family [more]
AT1G55350.10.0e+0078.83calpain-type cysteine protease family [more]
AT1G55350.20.0e+0078.83calpain-type cysteine protease family [more]
AT1G55350.40.0e+0078.83calpain-type cysteine protease family [more]
AT1G55350.50.0e+0077.85calpain-type cysteine protease family [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR022684Peptidase C2, calpain familyPRINTSPR00704CALPAINcoord: 1895..1922
score: 49.4
coord: 2025..2046
score: 46.82
coord: 2167..2195
score: 37.36
coord: 1807..1823
score: 68.24
coord: 1870..1893
score: 52.64
IPR022683Peptidase C2, calpain, domain IIISMARTSM007202calcoord: 2054..2201
e-value: 3.6E-39
score: 146.1
IPR001300Peptidase C2, calpain, catalytic domainSMARTSM00230cys_prot_2coord: 1732..2057
e-value: 1.4E-90
score: 316.9
IPR001300Peptidase C2, calpain, catalytic domainPFAMPF00648Peptidase_C2coord: 1748..2047
e-value: 3.6E-87
score: 292.1
IPR001300Peptidase C2, calpain, catalytic domainPROSITEPS50203CALPAIN_CATcoord: 1747..2049
score: 62.44796
IPR001300Peptidase C2, calpain, catalytic domainCDDcd00044CysPccoord: 1748..2047
e-value: 1.92641E-111
score: 355.486
NoneNo IPR availableGENE3D2.60.120.200coord: 1471..1640
e-value: 1.0E-10
score: 43.9
NoneNo IPR availableGENE3D3.90.70.10Cysteine proteinasescoord: 1925..2054
e-value: 2.7E-36
score: 126.1
NoneNo IPR availableGENE3D2.60.120.380coord: 2055..2203
e-value: 5.7E-31
score: 109.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 408..426
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 404..426
NoneNo IPR availablePANTHERPTHR10183CALPAINcoord: 1226..2183
NoneNo IPR availablePANTHERPTHR10183:SF379CALPAIN-5coord: 1226..2183
IPR022682Peptidase C2, calpain, large subunit, domain IIIPFAMPF01067Calpain_IIIcoord: 2059..2193
e-value: 1.1E-15
score: 58.2
IPR000169Cysteine peptidase, cysteine active sitePROSITEPS00139THIOL_PROTEASE_CYScoord: 1807..1818
IPR033883Calpain subdomain IIICDDcd00214Calpain_IIIcoord: 2053..2200
e-value: 4.06599E-38
score: 138.585
IPR013320Concanavalin A-like lectin/glucanase domain superfamilySUPERFAMILY49899Concanavalin A-like lectins/glucanasescoord: 1478..1660
IPR036213Calpain large subunit, domain III superfamilySUPERFAMILY49758Calpain large subunit, middle domain (domain III)coord: 2055..2200
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 1742..2058

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10007350.1HG10007350.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005737 cytoplasm
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004198 calcium-dependent cysteine-type endopeptidase activity