HG10021331 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10021331
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCarboxyl-terminal-processing peptidase 1
LocationChr05: 7748802 .. 7762108 (-)
RNA-Seq ExpressionHG10021331
SyntenyHG10021331
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGAATGTCATCTTCTTCTTCAACTCCTTCTCCCCCTTCCATTTCAACTTTCTTCCCAAGTCTCCGCCATTTCTAACCTTCGTCAACTCTGAGAAGAAGAACTTCACCAATTCCCTTAACTTGGTCGATAAAACACTAATTGGAGCTCTTTCAGGAGTGCTCTCGTTCGGCCTTCTTCTCCATTCCCCTTCATCTGTTGCGTTAGATTATTCGTCTGTGGAATTTTTTTCTTTATCAGCTGATTCTTTGCCGTCTTCTTCGCTTTCCAATTCATCTGCGTCTTGTATTGATGACGAGCTACATGAATTTGGAAGCTCTGAGACTGTGTCCCCGCCGGCGACTAACGAGGATATCGTGCGGGAGGCTTGGGAAATTGTAAATGATAGCTTTCTAGATGCTAGTCGCCATCGCTGGTCTCCTGAAGCCTGGAAGGTATAAAGTATAAACTCCCCTGTGACCTCTCTCATTTGGACAAGATTCTACACTTTTTTTTTCGCCTATCGTGTGTTAGTTTATGAAATTATAGGGTAAATCAACGAATAGTTGAGGTTATTGTGTTAAGCTAGTAATCGGTCATGGAAGGTATTGAGAAATTTATTCGGATTTATGAACTAAATCACTTGTTATCTATCTTAACTCGTATGTCTACGTATATGCATGCAAGTAATTTGAAATTATGCCCAAGTAAATAATTGTTAAGAATGTCATTGCAAGCATGACTTAATGAGCACATCCATTGTTGCTTTCAGCAAAGGCAAGAAGACATTACTAATATTTCGATTCAAACTCGATCAAAGGCTCACAATATCATCCGGAGAATGTTAGCCAGCTTGGGCGATCCTTATACGCGTTTTCTCCCCCCTGCAGAGGTAACACTTCCTTTCACTTTTATACTTTCAGTTGCCGAGTTTTCTTCCTTGTAAAATGTTTGATGCTTCAAGAAAATGGTCAGATTCCAGCAATCAAGTTCCTCATTCGTCAAATGTTTAATTTCTTATTCATATTCTATTTTGCCTTTTGGTTTCTCTTTTGGAAGTTCTCCAAAATGGCGAGGTATGACATGACTGGTATTGGAATAAACCTTAGGGAAGTTCCAGATGATGATGGAGGCATGAAAATAAAGGTACTGGGACTCTTATTAGATGGTCCTGCACATTTGGCTGGCATTAGACAGGTAACTGAAGTACAACTACTTATCTGATTTTTTTTTTAAATGGAAACGAGCCTCATTCATTGAATTAATGAAAAGAGACTTATCTGATTTTAGAACCTGTTCACTTCAATGCTTATCAGAAACACATTTGATTGCTATTCTTCTGCCAAGTATGATCAATGACCTCATCTCTGCAATATATATCGTCTCTGATTATTATCTCAAGCTGAGCTGCCTTTAAAATTTGTATTTGGAAAATTTTACTAAACAATTTCTTTTTTTAGTTCAATAATTGCCTGAGTCAAGATTGAACCATCAACATTCATTGGTGTCTTATCCACTTAGCCATGATTGGAATCGGATTGGCTTGTATTTTAAAATTTATTTTCCCTTTAATTTTATTTGTGCTTATGAATTTCTTGGTTCTTTATAATGCTTGAAGTTCTTCATTTCACATTCACTAATTTGTAAATAATAGTTTTAGGTGCACAATATGCCACCATACTTCTTAAATCTGTTTCTGAAAACAATCTTTCAGATAGTGGATTCATCATGTTAGTTTCATCAGAAAAGATGACAATCAAGTCTTTTCTAGCAGAAAGCTTTCTTATTTCCGCATGTAGTTACTTTCCATGTTACCACTTGCTAATTTGGGATCTTATAGTGTGAAATAGTGGTGTGCACTTTCAATAATTTTTTTCTTCTTCCGTTAGGGAGATGAAATTTTAGCTGTAAATGGAGTGGATGCGAGAGGGAAATCAGCATTTGAAGTATCTTCATTACTACAAGGCCCAAATGAAACGCTTGTGACTGTTAAGGTTGTTATCGCAGATTTGTTTCAATTTGATAAAATGTCTTGTTAACTTCTTTCCTTTTTGTTATGTTTCTTTCAATCATCCAGGATGCATTTTTTATAGGTCAAGCATGGCAACTGTGGACCAGTAGAAAGTATACAAGTCCAAAGACAAGTTCTTGCTCGAACCCCTGTCTTTTACCGTCTAGAGCAAACGGATGTTGCCTCTTCTGTTGGGTACATCCGCCTAAAGGAATTCAATGCATTGGCTAAAAAAGACTTGGTTACTGGTAATTGTCGAATTAATTCAACTGCTCAATGTGTAAAGTTTCTTTTTAACTTATATAACTCCACATATGACTCGTGGATCATAAAAAGTTCACTGTAAAAGTAACATTTTAGGAATTGAGTTCTATCTATTTGAATTTAGGATTTTAAAATAAAGATCCAGTTCCTTTAAAATAGGACAAAGACTTGTATGGATTTGAATTATTATAAGCTGAGACCTGCGAATGTTTCTCTGCTTCTTTCTTTTCCTCCTGAAATATTTCTTCTCCTAAAGTATGGAAGATTTATTTTGGACCTTTGGTTTGGTTCAATTAATTTTACAGAGAGTTAAAATGGTGCACATATATTTTCGTTGAAATTGTGAATTTTTATAGACAAGTTCATCTCTTTCAATTCATCTCTTTATTTTTTCAGTTTCGCCTCAAATTTAACATGAATTTAAGAATCAACTCCATTCAAGTTTATCAAGTGTATTCCATGAAAGCACCATGGTTCCCAAAAACCCCAAAACATGAAAGTACCTCATATATCTCATTACTTTGGAAAGTCAAACTGAAGGAAAGACTAAGGAATACCATTTTAATTTATTGGTTATTGTATTGGGGCCGATAAAACCGAACAAAATAGATCGATGCCCACTACTACTGCAATTTTTTATGAACGTAGATCTGAATTGTTTCAATGCTCCCAAATACATGAACGTTTTCTAATAATAATAATGGCATAATTGTTTCAGCAATGAAGCGTCTTGAGGCCATGGGTGCCTCATATTTCATTTTGGATCTCAGAGATAATCTTGGTGGACTGGTGCAGGTATTCAAAATGGTTGTGTAGATGATTTTTAGTTTACTATTGCCCTCTTGCATGAGGAATATCCTTTTTTGCATGGCTATCTTTACAAAGTACGGTGGTAGCCAGCCAAAAGTTTCCCCATTGTGAATTCTCAAGTAGAAGTAAAATTATTACTTTTTTCTTCATAGACCTCTTAAGCCTCTTTACAAACATATATGTGAAATATGCGTTTGTAGCTTTCTATGTGTATATATGCTGTTGATTGAACTTTTACCAATTTTAACTGATGCTTATTTCAGGCTGGAATTGAAATTGCAAAGCTATTTCTGAATGAAGGTAGCACGGTATATGAATTTATTTCTACTCTCTCCTGACACTTATTTGTAACATTTCCTAGCTATCAAATTAAGGATTTCTGTGCTGTACTCTTAGGGTCTTAGTTCTGACCATCCATGCTCCAAACCCTGGAAGTAATTAGAAAGTATTTTGTTCGGTCGATGAAAAATGGGTAGCAAACGTACATCAGTTCAGGCCAAACTATTTTGTTGATGCTTGATTGCATTCATCACTTGCAGCCAGAGATGTTGCTAGTAGTATTAAAATAAAATGAGCTTACTTCCAGATATTTTTCTTAATAGAAAAATCACCATCTGTGACCAATAATTCGTTTGTTGATACTCTTCAATGTAATGGATTTCTTTAATCTTGCTTATTAGGAAAAGAAAGCTGGTGTGATACTATTTGATTACAGCTCCTTTTCCTGATCTACAGAAAATTATGATATGATGACTTGCTCAACTACCTTATTTGGTATGTTTTTTCCTGGTTGACTATATCCAGGTGATCTATACAGTTGGAAGGGATCCTCAATACCAAAAAACTGTTGTTGCAGACACAGGACCATTAGTTACAGCTCCAGTCGTGGTATCTAATGTTTTGTTATGAACTCTGTTTACTTTAAGCACACATGAAGAACTGAATGTCTTCTCATCTTAGCTGCATTAAGTTTTAGTTTCTATTAACCACTGACTTCTAAACGAGGTAACTCATCACTCTTTTTAATGTGGTTAACTTCGCTAATGAAACCAAATGTCTTTGTGTCTGGTTACTTTGTGTTTTTTTGAATCCTTTAGAAGGTATTTATGGCTCAGTTATATCATAGGGTGGCTTGTAAAAAATGGGTGTTATAAATCTTTCCCATCATGCTACTCTGTTAGTTAAGAACCTTATGTTTGTTTTGATTATAAAGAGAGAGTTGTTACTTGGTTCAGTAATATCAAAAGGAGCCTTGTAACAAATGGATGTGTTAGATCTTTCCTGCCATATGCTGGTCTGTCAGGGATGAACTTCATGCTTGTGTTGATTATGAGGGGAAAAACATGTATTTGTTTCTTTCTACAATGACTTCTTTACAAAATGAGAAAGCATAATGACTGTCCTGACACAAGCAGTCAGTTCAGTTTTAGTGCTTCAATATGTACAGATGAAAGAGAATGCAACTACCTGAAGAAGTCATCGAAAAATGATGAAACAGCGGTTGGAATGAACCTGTTTTAATTAATTCTCTACATATTAAAAAAGAAAAAAAAATGGAAAAAGAGAGTTTAAATAGTACTCATGTTCTTGGCTAAAACGTTCGGTAAGTTACCTACAGACCTTCCTTCATTAAAAACCTATGAAGTCAGCTTTATTGAAACTTCCTTTTTTGAGGAAACGGCTGAAGAAACCTCCTATTCTCAATCTAATTCAGCCCAATATTTGATATATTTTTAAATCATTAATCTCTAGCAGTGTGAATATAATTTGTTTGGTCACGGTTAGTTACATTTTGTTTCTCTCTAAAAGTCGCACTGATTAATAGGCATATGCTTCAATGTCTTGGACTTAATTAGAAAGCTCTCGACTACCATCCATGCAGGTTCTGGTGAACAAAAAAACTGCAAGTGCAAGTGAAATTGTGAGTACCTCGATCAACTAAATAACATATTGGATATTTGGTCACTTATATTTTACTTCCCGAGTTATTCTTACTTCAGGTTGCTTCCTCGCTGCATGATAATTGCAGAGCTGTTCTTGTGGGGGAAAGGACTTATGGCAAGGTTTAATCTATTAATCTCGGCTTTAGCTAGTGTTTTTTCCTATGTTCTTAATTTAACTTGAAATTTTCATATGAAGGAACTCTTTCCAAATGTGCTGCAACGTTTACATGTGGCTTCTTTACATCAATTTTCTTTGATTTTCTTTTCTACTCATCTAGTTTTGGTAGCAATCTGGATTGCCATAAATTGGACTTTAGAAACGACCTTGATATGCTTCATTAAAAGTGATGCTTTACCTTCATTGACAGGGTTTAATTCAGTCTGTTTTTGAACTTCATGACGGGTCTGGTGTTGCAGTCACTGTTGGGAAGTATGTTACCCCAAATCACAAAGACATAAATGGAAATGGAATCGAACCCGACTTCAAAAACTTCCCAGGTAGAGTAAATTACCCTCCAAAGAATGACAGGAATCTGGCCAACCTCACCTAACAAGAATGCCCCTTTATTGCAGCATGGAGTGATGTCACTGAACGTCTCTTACAATGCTCCATACTTCATCAAGGATAACACGGCTTTCCACAGGCGCAGTCGGTCTTCTTTCGCTTGGCTTTTTGCTGCTCTTCTCTCTTTAGAGTTTCTTGACGCAAGATCTCTAGTTTGCCACTCAATTTCATGAAAGCTCAGGGCATGAAAGTGGATCAAAATCAGTTTATCCACCCATGAGTTCATTAGCATTTGGAACACAGCTCAGTGTAGATTCACTACACTGAAAGGTGCAGACAAGAACCATGGAGAAGTTTTGCATCTAATAAGCCCTTCTTTTAAGCAAATAGTGGCATTGGAATTTTTTTTTTTAGTGATTCTCTTGTTGTAAATGGTTACTAGTGACATTAGTCAAATGTTCTGGAACTTCAGATTGTTTGAGAATTCGTCAGCTTGAAGAAATGTGCATTTATACTTTCTCAGTTAGAAACAACTTATTCATGCATTTTCATAGGAACCTTTTTGCTTGGTTTATACTCTTTGGATGTAATAAAGAGACTCTAATCATATTACGAAGTGGGTTGGTGTTTTCCAATTTAATAAATAATAAATAAATAAAGCTCGTCTATTTTCAAATATAAAAAAATGAGTCAAATTATTTATAAATTTAGAAAAATTTTACTATCTATCGTGATAGAATTCTATTACTTAAGCGATGGATCGATTTATTGCTGATAGACAGTAAATTTTTTTAATATTTGTAAATAGTTTGATATTTTTTTATGTTGATAGTAATTTTTCTTTTAATTAGGAAGTGGGTTGGTATTTCAAGTTTGTTCCATAATAATAAAAAAGAAAAATCCGTTAAAAAAAGTTCTCTAACTATTTAATTCACTTTTCAATTTTTAAAAGATTTCTAAAATCACTCTCCATTTTTCAAAAATTCTCTAAAATCATTTAAATTTTTCATAGCTCTCGTACTTCCCAAAAGTTTCCTCCAACTCTTCATCTTCCAATCTCTCCATTCAAAAGATTGAACTACAATTGTATTTCTGGCCTTTTTTTTAAAAAAAAAAAAACATTTTTGGTGTTGGGGCGGTGAGAACATACGCTGTTATATTCCCAGTTGATGACTTATTCATTTCTTTTCTAACATTGAGAACAACGACTTAAATAAAAGTCTTGGGGACAGTGACAACGACTTACATTTGGGGTATGGTAAATCATTTTTTATACATTTGAAGTATATTTCATACACTACGTAAATATCATAGGTTTATATGGCATGGTTTTTTTAGTATAATAATTGCGGGGATGAGGATCAAACTTTCAACCTTTAGAGAGGAAGGTCATGTCAATTACCATTGAGCTAACTTAGATTTATATTTCATAAATGTTGGGTATATTTTGTATGACATAAACATAAGATGTTTATATGGCATGGTCAATTTATTAATTGATCAATGTATGTACCATTACTACAAACTGAAAGAGCATATATGGTTCTGCCCTTGAATATATTTAACATTATAGGAATTGATTATATTATTCAAATGGGAATATGTAGGATTATATTGCTGTGATCTTTATGGCTCTTATTGCGTATATATTTTGCAAATCCACTGCTACACAAATACATTTGTCCTACATCCTCATACCTATTTCATACAAACAACGATGACATCTTAATAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGCATGAATAGGCCGTTTCCTAAGGTTGTATATGATTTATATATATATTTGTTTCATTCCACAATTATATTTCATAAATGTAGAGTATTTTTGGGAATACCATGTCTATAGATATATATATTCTTGTTTCATTCTATATATTTATCTTTTATAAATGTGGAATGTATTTATCTATACAATATCCTATTTCCTTCCATATTTATATTTTGTAATATTGAGGATATATAAGGGTATTGTTATTTGTTTGTTTGGGTTGTATATTTCCTTGTATTTTTTGGGCCTATTTATATCATTTGAGTTGATCAACAATAATAGTTGTCTATTGCATCCACTACCATGCCAATTTGCTCTACAAATTTTCATTTCTACTTCCTACTTCCACTTCTAATCTCTCTTTCTCTATTCTTGTTCTTATGGCGTCCCCTTTTTCTTCTACTTTCCTTATTTTCGTTTTAATATTCGCCGTCTTCTCCACTTCATGGTCGTCGACGATTGGAGTCGGATATATTTCCCTCCTTCTTGAAATTTAGGATCTCGAGAGGGTACATTTAATAATGCAAAATATGCATTTCTACCTAGTTTCATTTCATGTTTATATTTTATAAATGTGAGATGAGGTATATTTTATGAGACATATATTCTTCTTTTCATTCTTCTTCTTCTTCTTCTTTTTTTTTTTTTTTTTTTTTTTAAAAAAGATAATATATTCTTCTTTCATTCCATATTTATATATGTATCTCACAAATTTGTTATTCCTTTTCTTTGTTTTTCATTTTCCCAATTTTCTAATGTATGTTCTGGTCTTTTTGTGACCACTTTACCAATTTTCCGTCGTATAGTCGGTCTCTCTTTTTTATGCTATTCCAATTGTTTTTTAAATATATTTAATTATACCATGTTTTATATATATTTCTGTCTTTTGCAAAAAAAAAAACACGAAAAATACATGTTTTGTAAGCATATGTACATATAGTATATGCATTCTATATCAACTCTATCTAAGGGGGTGTTTGAGAGACTTGGAATATAATCAGGATTACTGAAAAAAAAGACGTGTTGGAATGCATGAGGTGCCGAAGGTGAAATGGAAATGAAATCGTGAAACGTGAAAACACGGTGGAAAAGAGAGTTTGGTTAAAAAATTTCGGAACAGAATGAGTTTAATATTATAAATCGGGCTCAAAACACTCAATTCAAATATGACTTTTGAATTACATTACAATCCAAACTCATTCCAATACGGACCCCCAAACAGCACCAATGCTATATTGTTGGTACGAAAAACTCGCATGCGGGCTGTGGAAGCTCTATCGACCAGCTCCACTGTTCGACCTCGTCGTCTAAGTTGACTTGCATGAGACTATTCACATCAATGTAGCGCCGAGTCGAACACATTCTAATGTCTAAGTTAGTACAATGAGTATTTAGTTCAACTAAACACAAAGAACTAAATGGAGCTATTTATAAGACTCTTCATCTAACACCCGCCTAGCAGTACGTGTAATGTTGGTGATGTTTGGACAATTGACATAATTTCTTCGACTCGATCGGCTGCCTTGAAACCCTATGGTTTGACTTTTCTCTTTTGAGACATACGCAGTTCTTCATACTACTTCTTAATTTTTCTTGAGTTTGGACCTGTTCGTGGACTTTGAACTAAGTCCACCCTCAACAAATATAATAGAATGAAAATTAAAAACTGCAAATTGAACATGCATGTTATACAAACATATTCATAATATCATACAAAATGTTGTTCACGAATGAACAAATTATACATATATGCATAAAATAAAACAACGATTTATTTATTTATTAAAAATGGAGGGAAGTAACCATTGATGGGGAAAAAAAAACAACTGTGAACAAACATTATTTTCATTGGAAGTGGAGTATGGAGAAAATAAGAATGGAAGAGGACAATGCTCAATAATTATATAAGGAATGAGTAATTTGGAAAGCAAAGAAGGTAGGGGATTATGTGGCTGATGTGAAATGGATTAACAAAATGTAGTTACTAAATCAGCTTGGATTCATGTTTTCTTTTTATTTTTTTAGAAAATACATTTTTATTTTGAATCTTATGAAAATGAACTAAAATTAAGTTTAATTAATTCGAGAAAAATACAACTATTAGAATTAGACATAACATAACTCGAAAATTGGTTAACTAGTGGAAATAATGTGAAATGTGGTCTTTTTTTTTCTACGTTAGACTACTATCATCTATCATTAATAAACAATAACATTTTTTTTTTTGCTATATTTAAAAATATTTCCAACAATTTTGTCATTTAAAACAATTACTCTTCAACCAACCAAAGTTGCCCTTCTTAAGGAGGAGATATAAAACAAAAGAAGAAAATAAAGTATTTGAAAGATGACACGTGATATTATTTTAATTCACGATCCCACACAATTCTTAAATGTTACTTATATAACAAATGATGAGTTGAAGTTTTCTAATATATGTGTTTTTTCGTATTTTGAAAAGGCGAATTTGATAATTATTCAAAATTTTATTTCTTTATTTATAATTTACTTTTCAAAATGGTATCTAAATCTTAAAAAATAAAAAACATAAAAATAAAAAACACACTTTCAAAATAGAAGCTTTCATTTGTAAAGTGTATCCAAATGAATTTGGCATACCTAAAAAAATTGAAAACGAAACATAGAAACTATGTATAACTATTTTTTTTTTTTAAAAAAAAAAAGAAACATAAAAAATTATAAAATGTAGAATTATCCAACCGAGTTTTAAAATATCGGTATTTTTTTTTTTAATTAAAAAGATCTGTAAGTGAACCTGTTTCAGTGTTTTGTATAGAAATAATGATTTTTTTTTAAAACAAAAAAACAAACAAAAACATATTTATGGCAATTTATTACCATGGTCCAAGGACAAGGGGATTGGATTTTGGATTTTACAACACTCTTCTATTACAATTTATTAAAAAGAAAGAAAAATCGGCTATGGAAATTTCTCTGTTTTCAATATACCTAAATAACTATGGAAGTTTTCAAGCAACCATATCCAAACAAACATATATTTTTCAACATTTATATATTCTCCATAATTTTATTTTTTTTTTTAAAAAAAAAGAAAAACCCATATTTAAATTAGCCTACTTTTGCATGTGCCTCTGCATATTCTTTATAACAAGGTATCATTTTTATTCCCATTTGAATGTTTGAAGTTTATATTCTTGAATCAATCACTCAGCAAAGGTAGGAGACAACATTCTTCATTACAGCCTCTAAGATGCACCAAAAACATATAATTCTCTCATTTCACACTCCCAACTTTCAATTGACTACTCATGTTGCTATTTCTATTTTCGAAAACCGTTTTCGAAGAATTATGATTGAATGACTATATAATATTCATGTCAATCAACATTCACAAACTTACGATTTTCGATTGAAGACAGAAATAACTTTTATAGCATAGTTGTCTTCTTTATATAATTTTTCTGTTCAAAGAACAGAAAAATTGTTCTGAAAACATAAGCCAGCTCCATATATTGTTATTTTGCCTTTCTTTCTTAGAGTACTTTTTTTGGGTTCTTTCTCTTGCTTGTGGTATCCAACTACTCCATTGGCTGCTTTCTCAAGTTTTCTTATATTCTTTCTCTTCTTGGTGATCCAAGAAAGCTTTGATTCTTAAGTTTAATGAATAAAAATATCTACAGTGAAAAGATAATCAGTCCTGTTCAAGAACAGTTCTGGTCTGGGGCTCTCTCATTTGAGCTATTTTTATTAACGCTTGCCTTCTCAGTCATTATAACTACTAAATCCAAGAAGAAGAAACTAGAGACAATACAGAAGTCAGCAGCAACTTAAAGTGATGGGTTTCAACATTCTGAGAAATTTGGCACTATTTACTCTCTATTCATTCTTACTCTTCTCAGGTAATGCTGGTTTTACCATTCTTTTTCTAGCTATTCCTATGCTCATTGTTAGAGAGGTTTGAAAGTAATAGTGAGAGAATAAGTTTGTATTGAAGTATATTTCTACTGGATATCATATGATGTTAATATGACAATGTCTGCAAACAAAATGACATTTTTATTTATATGAGAAACAAAGGATAACATTTTGAATCTAAGTTAATAGTGTCCATCAAGTAATTACCTTGGAATGGTCGCTGGCTGCGTAGTCTTATCCTAGAGTTTAATCTCTTTTGTAAAACAACATGCGTTGAGCACGCCCGCTAGAGGCCGAAAACAAAAATGACATGGTCAATATGAAAATCATGCAAGTGCTTTCTTCATACTTTACACTCATAGTTTAATTTGGAGATTGACTAAGATTTCTCCCCTTTTCATTTATTTACTTTAGATGTTGTCAATTCCACTAGATATTGAATTTTTTATGCAATTCAAATCCGTGGGGAAGATGTTTTTCTTTGCTGGATAGATCCTTTTTTTCTTGGTTGAGAGAATCCTCACTCTGTCAACTTAATAACTTTGAAAAAAAAGAATCCAAAATGGAAATGAATGTGTGCATCCAGGTTTGGAACTTACTCATATTTCAGTGATTGCAGCAAATTTTAAATAAAGAAGGTGAAAAATTGATGCAGTTTTGATCTCTCCAGGTTCAAGTTTTGCAAAGAAATCACCTATACAAGAAGAAAATCAAGAAAACATACCTCTGCAGAATGATGAAGACAGAACAATCTTCTCACCTTCTGTACATTTCACTCAGTTGGATGACACTACCATTGTTAACCCAACAACACCACCAGGAGGAACTCCAATTTCACCACCAAAGTCTGTACCAAACTTTAACCCCAATGTGAATCCCACCGCAGTGCAGGGGAATTCGGGTGGAGGTTCATGGTGTATTGCAAGCTCTGCAGCTTCCCCAACTGCTCTGCAGGTGGCTCTTGATTACGCTTGTGGCTATGGCGGTGCAGACTGTTCGCCGATTCAGCCAGGTGGGAGCTGCTATGACCCAAACACAGTGAAGGACCATGCCTCTTATGCCTTCAATGACTATTACCAGAAGAATCCAGCGGCTACTAGCTGTGTTTTTGGAGGAACAGCACAACTCGTTAGTACAGACCCAAGTATGGCATTTTCACCTTTCTGATTCATCATTAACCCCTGTTGTCACTTATGGTAGCCAAATTTAAGAGGTTTTTTGAACTACCAAACTTAGCTCAGACAGGTTCTATTGACTACAGGAGTTTCTACTTAATCTGCGCATCCTCCTAACATTTTTTTATCTAATAGAAAGTGACCATACTACAAAGTAACAAGAACTATTTTTTAAATCACAAACTATGCTTTTCTTTATAGGTAATCAATCCATTTGTGTAGTACCTAGTACTGCTACTTTTCATTCAACCTTTTTGTGAATGTGCCATCTCAACTTTATCTCTCTCGTCCTCCTTTCCTAGTAGAGAACCAGAGATGAGGAAACTTTAGTGACAAACTTCCCAAAAAGGTCATATCCTATCATCTAAAGATAAAAACTAAGTAGTGATGCCTAACAAAGTCATAGCTCAAAGGTGAAATAAAATCTCAGAAATTGGTCTGTTGTAATTTTTTTTATTAATAATTGTCTTTTCTATATTATATTGCAGGTAATGGTAACTGTCACTATGCAACATCCGGAGCTGTACCAAGGTAATTCCTTAAGAGCTCAGTACTCATTTTACTAAGAGACTATAAGTTACCCATAAAAGGAAAGAAATCAGTGAAGGAAGCAAAATCAGAGCTAACTAATTTCTTTATGTATTTGTCACAGCCCCACTCCACCAGCAAACCCAAACCCAACACCCCCGGCACCTGTAATCCCAACAATGCCACCACCAGCCACCACAATCCCAACAATGCCACCGCCAGCCACCACGAACCCAACATATACGCCCATAGATCCATCAATTTATGGTGCAGAACCATCAGGCATGCCCAGCTCAGCCACTTCAATATCAAAACGATCGGTGCTGCTCTTGACCATGACTTACCTTTTGGGTTTGCTCGTAGCAAATCATCTGTAA

mRNA sequence

ATGGCGAATGTCATCTTCTTCTTCAACTCCTTCTCCCCCTTCCATTTCAACTTTCTTCCCAAGTCTCCGCCATTTCTAACCTTCGTCAACTCTGAGAAGAAGAACTTCACCAATTCCCTTAACTTGGTCGATAAAACACTAATTGGAGCTCTTTCAGGAGTGCTCTCGTTCGGCCTTCTTCTCCATTCCCCTTCATCTGTTGCGTTAGATTATTCGTCTGTGGAATTTTTTTCTTTATCAGCTGATTCTTTGCCGTCTTCTTCGCTTTCCAATTCATCTGCGTCTTGTATTGATGACGAGCTACATGAATTTGGAAGCTCTGAGACTGTGTCCCCGCCGGCGACTAACGAGGATATCGTGCGGGAGGCTTGGGAAATTGTAAATGATAGCTTTCTAGATGCTAGTCGCCATCGCTGGTCTCCTGAAGCCTGGAAGCAAAGGCAAGAAGACATTACTAATATTTCGATTCAAACTCGATCAAAGGCTCACAATATCATCCGGAGAATGTTAGCCAGCTTGGGCGATCCTTATACGCGTTTTCTCCCCCCTGCAGAGTTCTCCAAAATGGCGAGGTATGACATGACTGGTATTGGAATAAACCTTAGGGAAGTTCCAGATGATGATGGAGGCATGAAAATAAAGGTACTGGGACTCTTATTAGATGGTCCTGCACATTTGGCTGGCATTAGACAGGGAGATGAAATTTTAGCTGTAAATGGAGTGGATGCGAGAGGGAAATCAGCATTTGAAGTATCTTCATTACTACAAGGCCCAAATGAAACGCTTGTGACTGTTAAGGTCAAGCATGGCAACTGTGGACCAGTAGAAAGTATACAAGTCCAAAGACAAGTTCTTGCTCGAACCCCTGTCTTTTACCGTCTAGAGCAAACGGATGTTGCCTCTTCTGTTGGGTACATCCGCCTAAAGGAATTCAATGCATTGGCTAAAAAAGACTTGGTTACTGCAATGAAGCGTCTTGAGGCCATGGGTGCCTCATATTTCATTTTGGATCTCAGAGATAATCTTGGTGGACTGGTGCAGGCTGGAATTGAAATTGCAAAGCTATTTCTGAATGAAGGTAGCACGGTGATCTATACAGTTGGAAGGGATCCTCAATACCAAAAAACTGTTGTTGCAGACACAGGACCATTAGTTACAGCTCCAGTCGTGGTTGCTTCCTCGCTGCATGATAATTGCAGAGCTGTTCTTGTGGGGGAAAGGACTTATGGCAAGGGTTTAATTCAGTCTGTTTTTGAACTTCATGACGGGTCTGGTGTTGCAGTCACTGTTGGGAAGTATGTTACCCCAAATCACAAAGACATAAATGGAAATGGAATCGAACCCGACTTCAAAAACTTCCCAGGTTCAAGTTTTGCAAAGAAATCACCTATACAAGAAGAAAATCAAGAAAACATACCTCTGCAGAATGATGAAGACAGAACAATCTTCTCACCTTCTGTACATTTCACTCAGTTGGATGACACTACCATTGTTAACCCAACAACACCACCAGGAGGAACTCCAATTTCACCACCAAAGTCTGTACCAAACTTTAACCCCAATGTGAATCCCACCGCAGTGCAGGGGAATTCGGGTGGAGGTTCATGGTGTATTGCAAGCTCTGCAGCTTCCCCAACTGCTCTGCAGGTGGCTCTTGATTACGCTTGTGGCTATGGCGGTGCAGACTGTTCGCCGATTCAGCCAGGTGGGAGCTGCTATGACCCAAACACAGTGAAGGACCATGCCTCTTATGCCTTCAATGACTATTACCAGAAGAATCCAGCGGCTACTAGCTGTGTTTTTGGAGGAACAGCACAACTCGTTAGTACAGACCCAAGTAATGGTAACTGTCACTATGCAACATCCGGAGCTGTACCAAGCCCCACTCCACCAGCAAACCCAAACCCAACACCCCCGGCACCTGTAATCCCAACAATGCCACCACCAGCCACCACAATCCCAACAATGCCACCGCCAGCCACCACGAACCCAACATATACGCCCATAGATCCATCAATTTATGGTGCAGAACCATCAGGCATGCCCAGCTCAGCCACTTCAATATCAAAACGATCGGTGCTGCTCTTGACCATGACTTACCTTTTGGGTTTGCTCGTAGCAAATCATCTGTAA

Coding sequence (CDS)

ATGGCGAATGTCATCTTCTTCTTCAACTCCTTCTCCCCCTTCCATTTCAACTTTCTTCCCAAGTCTCCGCCATTTCTAACCTTCGTCAACTCTGAGAAGAAGAACTTCACCAATTCCCTTAACTTGGTCGATAAAACACTAATTGGAGCTCTTTCAGGAGTGCTCTCGTTCGGCCTTCTTCTCCATTCCCCTTCATCTGTTGCGTTAGATTATTCGTCTGTGGAATTTTTTTCTTTATCAGCTGATTCTTTGCCGTCTTCTTCGCTTTCCAATTCATCTGCGTCTTGTATTGATGACGAGCTACATGAATTTGGAAGCTCTGAGACTGTGTCCCCGCCGGCGACTAACGAGGATATCGTGCGGGAGGCTTGGGAAATTGTAAATGATAGCTTTCTAGATGCTAGTCGCCATCGCTGGTCTCCTGAAGCCTGGAAGCAAAGGCAAGAAGACATTACTAATATTTCGATTCAAACTCGATCAAAGGCTCACAATATCATCCGGAGAATGTTAGCCAGCTTGGGCGATCCTTATACGCGTTTTCTCCCCCCTGCAGAGTTCTCCAAAATGGCGAGGTATGACATGACTGGTATTGGAATAAACCTTAGGGAAGTTCCAGATGATGATGGAGGCATGAAAATAAAGGTACTGGGACTCTTATTAGATGGTCCTGCACATTTGGCTGGCATTAGACAGGGAGATGAAATTTTAGCTGTAAATGGAGTGGATGCGAGAGGGAAATCAGCATTTGAAGTATCTTCATTACTACAAGGCCCAAATGAAACGCTTGTGACTGTTAAGGTCAAGCATGGCAACTGTGGACCAGTAGAAAGTATACAAGTCCAAAGACAAGTTCTTGCTCGAACCCCTGTCTTTTACCGTCTAGAGCAAACGGATGTTGCCTCTTCTGTTGGGTACATCCGCCTAAAGGAATTCAATGCATTGGCTAAAAAAGACTTGGTTACTGCAATGAAGCGTCTTGAGGCCATGGGTGCCTCATATTTCATTTTGGATCTCAGAGATAATCTTGGTGGACTGGTGCAGGCTGGAATTGAAATTGCAAAGCTATTTCTGAATGAAGGTAGCACGGTGATCTATACAGTTGGAAGGGATCCTCAATACCAAAAAACTGTTGTTGCAGACACAGGACCATTAGTTACAGCTCCAGTCGTGGTTGCTTCCTCGCTGCATGATAATTGCAGAGCTGTTCTTGTGGGGGAAAGGACTTATGGCAAGGGTTTAATTCAGTCTGTTTTTGAACTTCATGACGGGTCTGGTGTTGCAGTCACTGTTGGGAAGTATGTTACCCCAAATCACAAAGACATAAATGGAAATGGAATCGAACCCGACTTCAAAAACTTCCCAGGTTCAAGTTTTGCAAAGAAATCACCTATACAAGAAGAAAATCAAGAAAACATACCTCTGCAGAATGATGAAGACAGAACAATCTTCTCACCTTCTGTACATTTCACTCAGTTGGATGACACTACCATTGTTAACCCAACAACACCACCAGGAGGAACTCCAATTTCACCACCAAAGTCTGTACCAAACTTTAACCCCAATGTGAATCCCACCGCAGTGCAGGGGAATTCGGGTGGAGGTTCATGGTGTATTGCAAGCTCTGCAGCTTCCCCAACTGCTCTGCAGGTGGCTCTTGATTACGCTTGTGGCTATGGCGGTGCAGACTGTTCGCCGATTCAGCCAGGTGGGAGCTGCTATGACCCAAACACAGTGAAGGACCATGCCTCTTATGCCTTCAATGACTATTACCAGAAGAATCCAGCGGCTACTAGCTGTGTTTTTGGAGGAACAGCACAACTCGTTAGTACAGACCCAAGTAATGGTAACTGTCACTATGCAACATCCGGAGCTGTACCAAGCCCCACTCCACCAGCAAACCCAAACCCAACACCCCCGGCACCTGTAATCCCAACAATGCCACCACCAGCCACCACAATCCCAACAATGCCACCGCCAGCCACCACGAACCCAACATATACGCCCATAGATCCATCAATTTATGGTGCAGAACCATCAGGCATGCCCAGCTCAGCCACTTCAATATCAAAACGATCGGTGCTGCTCTTGACCATGACTTACCTTTTGGGTTTGCTCGTAGCAAATCATCTGTAA

Protein sequence

MANVIFFFNSFSPFHFNFLPKSPPFLTFVNSEKKNFTNSLNLVDKTLIGALSGVLSFGLLLHSPSSVALDYSSVEFFSLSADSLPSSSLSNSSASCIDDELHEFGSSETVSPPATNEDIVREAWEIVNDSFLDASRHRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDPYTRFLPPAEFSKMARYDMTGIGINLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGDEILAVNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLEQTDVASSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKLFLNEGSTVIYTVGRDPQYQKTVVADTGPLVTAPVVVASSLHDNCRAVLVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFKNFPGSSFAKKSPIQEENQENIPLQNDEDRTIFSPSVHFTQLDDTTIVNPTTPPGGTPISPPKSVPNFNPNVNPTAVQGNSGGGSWCIASSAASPTALQVALDYACGYGGADCSPIQPGGSCYDPNTVKDHASYAFNDYYQKNPAATSCVFGGTAQLVSTDPSNGNCHYATSGAVPSPTPPANPNPTPPAPVIPTMPPPATTIPTMPPPATTNPTYTPIDPSIYGAEPSGMPSSATSISKRSVLLLTMTYLLGLLVANHL
Homology
BLAST of HG10021331 vs. NCBI nr
Match: XP_038894565.1 (carboxyl-terminal-processing peptidase 1, chloroplastic isoform X1 [Benincasa hispida])

HSP 1 Score: 831.2 bits (2146), Expect = 6.5e-237
Identity = 429/473 (90.70%), Postives = 445/473 (94.08%), Query Frame = 0

Query: 1   MANVIFFFNSFSPFHFNF---LPKSPPFLTFVNSEKKNFTNSLNLVDKTLIGALSGVLSF 60
           MANVIFF +SFSP HF+F   LPK PPFL+ VNSEKKNF+NSLNLVDKTLIGALSGVLSF
Sbjct: 1   MANVIFFSSSFSPSHFHFPSLLPKPPPFLSVVNSEKKNFSNSLNLVDKTLIGALSGVLSF 60

Query: 61  GLLLHSPSSVALDYSSVEFFSLSADSLPSSSLSNSSASCIDDELHEFGSSETVSPPATNE 120
           G LLHSPSSVALDYS+V+FFSLSADSLPSS+LS+SSASCI+DELHE GSSETVSPPATNE
Sbjct: 61  GFLLHSPSSVALDYSAVDFFSLSADSLPSSTLSDSSASCIEDELHEIGSSETVSPPATNE 120

Query: 121 DIVREAWEIVNDSFLDASRHRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDPY 180
           DIV+EAWEIVNDS+LDA RHRWSPE+WKQRQEDITNISIQTRSKAHNIIRRMLASLGDPY
Sbjct: 121 DIVQEAWEIVNDSYLDAGRHRWSPESWKQRQEDITNISIQTRSKAHNIIRRMLASLGDPY 180

Query: 181 TRFLPPAEFSKMARYDMTGIGINLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGDEILA 240
           TRFLPPAEFSKMARYDMTGIGINLREVPDD+GGMKIKVLGLLLDGPAHLAGIRQGDE+LA
Sbjct: 181 TRFLPPAEFSKMARYDMTGIGINLREVPDDNGGMKIKVLGLLLDGPAHLAGIRQGDEVLA 240

Query: 241 VNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLEQT 300
           VNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLEQ 
Sbjct: 241 VNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLEQM 300

Query: 301 DVASSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKLFL 360
           D ASSVGYIRLKEFNALAKKDLVTAM RLEAMGASYFILDLRDNLGGLVQAGIEI+KLFL
Sbjct: 301 DAASSVGYIRLKEFNALAKKDLVTAMMRLEAMGASYFILDLRDNLGGLVQAGIEISKLFL 360

Query: 361 NEGSTVIYTVGRDPQYQKTVVADTGPLVTAPVV-------------VASSLHDNCRAVLV 420
           NEGSTVIYTVGRDPQYQKTVVADTGPLVT+PVV             VASSLHDNCRAVLV
Sbjct: 361 NEGSTVIYTVGRDPQYQKTVVADTGPLVTSPVVVLVNKRTASASEIVASSLHDNCRAVLV 420

Query: 421 GERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFKNFPGSS 458
           GERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFKNFP  S
Sbjct: 421 GERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFKNFPAWS 473

BLAST of HG10021331 vs. NCBI nr
Match: XP_008443944.1 (PREDICTED: carboxyl-terminal-processing peptidase 1, chloroplastic [Cucumis melo] >KAA0057097.1 carboxyl-terminal-processing peptidase 1 [Cucumis melo var. makuwa] >TYK20632.1 carboxyl-terminal-processing peptidase 1 [Cucumis melo var. makuwa])

HSP 1 Score: 804.3 bits (2076), Expect = 8.5e-229
Identity = 419/475 (88.21%), Postives = 441/475 (92.84%), Query Frame = 0

Query: 1   MANVI-FFFNSFSPFHFNF---LPKSPPFLTFVNSEKKNFTNSLNLVDKTLIGALSGVLS 60
           MA+VI FFF+SFSP H +F   LP  PPF++FVNS+KK+F+NSLNLVDKTL+GALSGVLS
Sbjct: 1   MASVIFFFFSSFSPSHLHFPSLLPMPPPFISFVNSDKKSFSNSLNLVDKTLVGALSGVLS 60

Query: 61  FGLLLHSPSSVALDYSSVEFFSLSADSLPSSSLSNSSASCID-DELHEFGSSETVSPPAT 120
           FGLLLHSPSSVALD+S+V+FFSLS+DSLPSSSL +SS SC+D D+LHEFGSSET SPPAT
Sbjct: 61  FGLLLHSPSSVALDHSAVDFFSLSSDSLPSSSLFDSSTSCLDEDQLHEFGSSETGSPPAT 120

Query: 121 NEDIVREAWEIVNDSFLDASRHRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGD 180
           NEDIVREAWEIVNDSFLDA R+RWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGD
Sbjct: 121 NEDIVREAWEIVNDSFLDAGRNRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGD 180

Query: 181 PYTRFLPPAEFSKMARYDMTGIGINLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGDEI 240
           PYTRFLPPAEFSKMARYDMTGIGINLREVPDD+GGMKIKVLGLLLDGPAHLAG+RQGDEI
Sbjct: 181 PYTRFLPPAEFSKMARYDMTGIGINLREVPDDNGGMKIKVLGLLLDGPAHLAGVRQGDEI 240

Query: 241 LAVNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLE 300
           LAVNGV+A GKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLE
Sbjct: 241 LAVNGVEAGGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLE 300

Query: 301 QTDVASSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKL 360
           Q D  SSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKL
Sbjct: 301 QMDANSSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKL 360

Query: 361 FLNEGSTVIYTVGRDPQYQKTVVADTGPLVTAPVV-------------VASSLHDNCRAV 420
           FLNEGSTVIYTVGRDPQYQKTVVAD GPLV APVV             VASSLHDNC+AV
Sbjct: 361 FLNEGSTVIYTVGRDPQYQKTVVADAGPLVKAPVVVLVNKRTASASEIVASSLHDNCKAV 420

Query: 421 LVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFKNFPGSS 458
           LVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDF+NFP  S
Sbjct: 421 LVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFQNFPAWS 475

BLAST of HG10021331 vs. NCBI nr
Match: XP_004147402.1 (carboxyl-terminal-processing peptidase 1, chloroplastic isoform X1 [Cucumis sativus] >KGN65570.1 hypothetical protein Csa_019921 [Cucumis sativus])

HSP 1 Score: 797.7 bits (2059), Expect = 8.0e-227
Identity = 417/474 (87.97%), Postives = 436/474 (91.98%), Query Frame = 0

Query: 1   MANVIFFFNSFSPFHFNF---LPKSPPFLTFVNSEKKNFTNSLNLVDKTLIGALSGVLSF 60
           MA+VIFFF+SFSP H +F   LP  PPF +FVNSEKK+F+NSLNLVDKTLIGA+SGVLSF
Sbjct: 1   MASVIFFFSSFSPSHLHFPSLLPMPPPFPSFVNSEKKSFSNSLNLVDKTLIGAISGVLSF 60

Query: 61  GLLLHSPSSVALDYSSVEFFSLSADSLPSSSLSNSSASCID-DELHEFGSSETVSPPATN 120
           GLLLHSPSSVALDYS+V+FFSLS+ SLPSSSLS+SSASCID DELHEFGSSETVS PATN
Sbjct: 61  GLLLHSPSSVALDYSAVDFFSLSSHSLPSSSLSDSSASCIDEDELHEFGSSETVSSPATN 120

Query: 121 EDIVREAWEIVNDSFLDASRHRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDP 180
           EDIVREAWEIVNDSFLD+ R+RWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDP
Sbjct: 121 EDIVREAWEIVNDSFLDSGRNRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDP 180

Query: 181 YTRFLPPAEFSKMARYDMTGIGINLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGDEIL 240
           YTRFLPPAEFSKMARYDMTGIGINLREVPDD+G MKIKVLGLLLDGPAHLAG+RQGDEI+
Sbjct: 181 YTRFLPPAEFSKMARYDMTGIGINLREVPDDNGVMKIKVLGLLLDGPAHLAGVRQGDEIV 240

Query: 241 AVNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLEQ 300
           AVNGVDA GKSAFEVSSLLQGPNETLVTVKV HGNCGPVESIQVQRQVLARTPVFYRLEQ
Sbjct: 241 AVNGVDAGGKSAFEVSSLLQGPNETLVTVKVMHGNCGPVESIQVQRQVLARTPVFYRLEQ 300

Query: 301 TDVASSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKLF 360
            D  SSVGYIRLKEFN LAKKDLVTA KRLEAMGASYFILDLRDNLGGLVQAGIEIAKLF
Sbjct: 301 MDATSSVGYIRLKEFNGLAKKDLVTATKRLEAMGASYFILDLRDNLGGLVQAGIEIAKLF 360

Query: 361 LNEGSTVIYTVGRDPQYQKTVVADTGPLVTAPVV-------------VASSLHDNCRAVL 420
           LNEGSTVIYTVGRDPQYQKTVVAD  PLV APVV             VASSLHDNC+AVL
Sbjct: 361 LNEGSTVIYTVGRDPQYQKTVVADAEPLVKAPVVVLVNKRTASASEIVASSLHDNCKAVL 420

Query: 421 VGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFKNFPGSS 458
           VGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDF++FP  S
Sbjct: 421 VGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFQSFPAWS 474

BLAST of HG10021331 vs. NCBI nr
Match: XP_022140206.1 (carboxyl-terminal-processing peptidase 1, chloroplastic [Momordica charantia])

HSP 1 Score: 793.5 bits (2048), Expect = 1.5e-225
Identity = 410/474 (86.50%), Postives = 428/474 (90.30%), Query Frame = 0

Query: 1   MANVIFFFNSFSPFHFNFLPKSPPFLTFVNSEKKNFTNSLNLVDKTLIGALSGVLSFGLL 60
           M NVIFFFNS SPFHF  LPK PPF+TF+NS+ +   NS  LVDKTLIGA+SGVLSFGLL
Sbjct: 1   MTNVIFFFNSLSPFHFQ-LPKPPPFITFINSQNRTSANSATLVDKTLIGAVSGVLSFGLL 60

Query: 61  LHSPSSVALDYSSVEFFSLSADSLPSSSLSNSSASCIDDELHEFGSSETVSPPATNEDIV 120
            HSP SVALDYSSVE FSLSADSLPSSS S   +SC +DEL EFG+SET S PATNEDIV
Sbjct: 61  FHSPLSVALDYSSVETFSLSADSLPSSSPSTYDSSCNEDELREFGNSETGSSPATNEDIV 120

Query: 121 REAWEIVNDSFLDASRHRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDPYTRF 180
           REAWEIVNDSFLDA RHRWSPEAWKQ+Q+DI NISIQ+RSKAHNIIR+MLASLGDPYTRF
Sbjct: 121 REAWEIVNDSFLDAGRHRWSPEAWKQKQDDIMNISIQSRSKAHNIIRKMLASLGDPYTRF 180

Query: 181 LPPAEFSKMARYDMTGIGINLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGDEILAVNG 240
           LPPAEFSKMARYDMTGIGINLREVPDD+GG+KIKVLGLLLDGPAH AG+RQGDEILAVNG
Sbjct: 181 LPPAEFSKMARYDMTGIGINLREVPDDNGGIKIKVLGLLLDGPAHSAGVRQGDEILAVNG 240

Query: 241 VDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLEQTDVA 300
           VDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVES+QVQRQVLARTPVFYRLEQ D A
Sbjct: 241 VDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESMQVQRQVLARTPVFYRLEQMDFA 300

Query: 301 SSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKLFLNEG 360
           SSVGYIRLKEFNALAKKDLVTAMKRLE MGASYFILDLRDNLGGLVQAGIEIAKLFLNEG
Sbjct: 301 SSVGYIRLKEFNALAKKDLVTAMKRLEDMGASYFILDLRDNLGGLVQAGIEIAKLFLNEG 360

Query: 361 STVIYTVGRDPQYQKTVVADTGPLVTAPVV-------------VASSLHDNCRAVLVGER 420
           STVIYTVGRDPQYQKTV+ADTGPLVTAPVV             VASSLHDNCRAVLVGER
Sbjct: 361 STVIYTVGRDPQYQKTVIADTGPLVTAPVVVLVNQKTASASEIVASSLHDNCRAVLVGER 420

Query: 421 TYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFKNFPGSSFAKK 462
           TYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDF+NFP  S   K
Sbjct: 421 TYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFQNFPAWSDVTK 473

BLAST of HG10021331 vs. NCBI nr
Match: XP_023542920.1 (carboxyl-terminal-processing peptidase 1, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 785.8 bits (2028), Expect = 3.1e-223
Identity = 407/470 (86.60%), Postives = 425/470 (90.43%), Query Frame = 0

Query: 1   MANVIFFFNSFSPFHFNF---LPKSPPFLTFVNSEKKNFTNSLNLVDKTLIGALSGVLSF 60
           M NVIFFF SFS  HF F   +PK PPFL+F++ +KK+ +NSLN VDKTLIGALSGVLSF
Sbjct: 1   MTNVIFFFKSFSSLHFQFPSLVPKPPPFLSFLSLQKKSSSNSLNSVDKTLIGALSGVLSF 60

Query: 61  GLLLHSPSSVALDYSSVEFFSLSADSLPSSSLSNSSASCIDDELHEFGSSETVSPPATNE 120
           GLL HSP SVALDYSSVE FSLSADS P    S+SSASC++DEL +FG+SETVSPP TNE
Sbjct: 61  GLLFHSPLSVALDYSSVEIFSLSADSSP----SDSSASCVEDELPDFGNSETVSPPVTNE 120

Query: 121 DIVREAWEIVNDSFLDASRHRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDPY 180
           DIV+EAWEIVNDSFLDA  HRWSPEAWKQRQEDI N+SIQTRSKAHNIIRRMLASLGDPY
Sbjct: 121 DIVQEAWEIVNDSFLDAGHHRWSPEAWKQRQEDIMNMSIQTRSKAHNIIRRMLASLGDPY 180

Query: 181 TRFLPPAEFSKMARYDMTGIGINLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGDEILA 240
           TRFLPPAEFSKMARYDMTGIGINLREVPDD GGMKIKVLGLLLDGPAHLAGIRQGDE+LA
Sbjct: 181 TRFLPPAEFSKMARYDMTGIGINLREVPDDSGGMKIKVLGLLLDGPAHLAGIRQGDEVLA 240

Query: 241 VNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLEQT 300
           VNGVDARGKSAFEVSSLLQGPNET VTVKVKHGNCGP ESIQVQRQVL R+PVFYRLEQ 
Sbjct: 241 VNGVDARGKSAFEVSSLLQGPNETQVTVKVKHGNCGPEESIQVQRQVLVRSPVFYRLEQI 300

Query: 301 DVASSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKLFL 360
           D ASSVGY+RLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKLFL
Sbjct: 301 DAASSVGYVRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKLFL 360

Query: 361 NEGSTVIYTVGRDPQYQKTVVADTGPLVTAPVV-------------VASSLHDNCRAVLV 420
           NEGSTVIYTVGRDPQYQKTVVADTGPLVTAPVV             VASSLHDNCRAVLV
Sbjct: 361 NEGSTVIYTVGRDPQYQKTVVADTGPLVTAPVVVLVNKKTASASEIVASSLHDNCRAVLV 420

Query: 421 GERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFKNFP 455
           GERTYGKGLIQSVFELHD SGVAVTVGKYVTPNHKDINGNGIEPDF+NFP
Sbjct: 421 GERTYGKGLIQSVFELHDRSGVAVTVGKYVTPNHKDINGNGIEPDFQNFP 466

BLAST of HG10021331 vs. ExPASy Swiss-Prot
Match: F4KHG6 (Carboxyl-terminal-processing peptidase 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CTPA1 PE=1 SV=1)

HSP 1 Score: 504.6 bits (1298), Expect = 1.8e-141
Identity = 274/463 (59.18%), Postives = 340/463 (73.43%), Query Frame = 0

Query: 12  SPFHFNFLPKSPPFLTFVNSEKKNFTNSLNLVDKTLIGALSGVLSFGLLLHSP-SSVAL- 71
           SP    F+P+ PP   F      +++    ++ K++IG L+G LS  L+  SP SSVA  
Sbjct: 16  SPSTPQFIPELPPPSQF------DYSGLTKILKKSVIGTLTGALSLTLVFSSPISSVAAT 75

Query: 72  --DYSSVEFFSLSADSLPSSSLSNSSASCIDDELHEFGSSETVSPP--ATNEDIVREAWE 131
              Y SV   S S +S   +   ++   C ++E  +    +    P   TNE IV EAWE
Sbjct: 76  NDPYLSVNPPSSSFES-SLNHFDSAPEDCPNEEEADTEIQDDDIEPQLVTNEGIVEEAWE 135

Query: 132 IVNDSFLDASRHRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDPYTRFLPPAE 191
           IVN +FLD   H W+PE W+++++DI    I++RSKAH +I+ MLASLGD YTRFL P E
Sbjct: 136 IVNGAFLDTRSHSWTPETWQKQKDDILASPIKSRSKAHEVIKNMLASLGDQYTRFLSPDE 195

Query: 192 FSKMARYDMTGIGINLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGDEILAVNGVDARG 251
           FS+M++YD+TGIGINLREV D  G +K+KVLGL+LD  A +AG++QGDEILAVNG+D  G
Sbjct: 196 FSRMSKYDITGIGINLREVSDGGGNVKLKVLGLVLDSAADIAGVKQGDEILAVNGMDVSG 255

Query: 252 KSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLEQTDVAS-SVG 311
           KS+FEVSSLLQGP++T V +KVKHG CGPV+S+++QRQV A+TPV YRLE+ D  + SVG
Sbjct: 256 KSSFEVSSLLQGPSKTFVVLKVKHGKCGPVKSLKIQRQVNAQTPVSYRLEKVDNGTVSVG 315

Query: 312 YIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKLFLNEGSTVI 371
           YIRLKEFNALA+KDLV AMKRL   GASYF++DLRDNLGGLVQAGIE AKLFL+EG TVI
Sbjct: 316 YIRLKEFNALARKDLVIAMKRLLDKGASYFVMDLRDNLGGLVQAGIETAKLFLDEGDTVI 375

Query: 372 YTVGRDPQYQKTVVADTGPLVTAPV-------------VVASSLHDNCRAVLVGERTYGK 431
           YT GRDP+ QKTVV+D  PL+TAP+             +VAS+LHDNC+AVLVGERTYGK
Sbjct: 376 YTAGRDPEAQKTVVSDKKPLITAPLIVMVNNRTASASEIVASALHDNCKAVLVGERTYGK 435

Query: 432 GLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFKNFP 455
           GLIQSV+EL DGSGV VT+GKYVTPNH DING GIEPDF+N P
Sbjct: 436 GLIQSVYELRDGSGVVVTIGKYVTPNHMDINGGGIEPDFRNLP 471

BLAST of HG10021331 vs. ExPASy Swiss-Prot
Match: P42784 (Carboxyl-terminal-processing protease OS=Synechococcus sp. (strain ATCC 27264 / PCC 7002 / PR-6) OX=32049 GN=ctpA PE=3 SV=2)

HSP 1 Score: 234.6 bits (597), Expect = 3.5e-60
Identity = 135/362 (37.29%), Postives = 217/362 (59.94%), Query Frame = 0

Query: 104 FGSSETVSPPATNEDIVREAWEIVNDSFLDASRHRWSPEAWKQRQEDITNISIQTRSKAH 163
           FG  E        +D++ +AW  V+ +++D +   ++ + W   ++      ++TR +A+
Sbjct: 21  FGPMERAIAFTDEQDLLLQAWRYVSQAYVDET---FNHQNWWLIRQKFLKRPLKTRDEAY 80

Query: 164 NIIRRMLASLGDPYTRFLPPAEFSKM---ARYDMTGIGINLREVPDDDGGMKIKVLGLLL 223
             +  MLA L DPYTR L P ++  +      +++G+G+ +   P+ D    ++V+  L 
Sbjct: 81  EAVGEMLALLDDPYTRLLRPEQYRSLKVSTSGELSGVGLQINVNPEVD---VLEVILPLP 140

Query: 224 DGPAHLAGIRQGDEILAVNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQV 283
             PA  AGI   D+ILA++G+D R     E ++ ++G   + V++ VK      V +++V
Sbjct: 141 GSPAEAAGIEAKDQILAIDGIDTRNIGLEEAAARMRGKKGSTVSLTVKSPKTDTVRTVKV 200

Query: 284 QRQVLARTPVFYRLEQTDVASSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRD 343
            R  +A  PV+ +L++ +    VGYIRL +F+A AK +++ ++ +L+  GA  ++LDLR+
Sbjct: 201 TRDTIALNPVYDKLDEKN-GEKVGYIRLNQFSANAKTEIIKSLNQLQKQGADRYVLDLRN 260

Query: 344 NLGGLVQAGIEIAKLFLNEGSTVIYTVGRDPQYQKTVVADTGPLVTAPVVV--------- 403
           N GGL+QAGIEIA+L+L++  T++YTV R   ++ +  A   PL  AP+VV         
Sbjct: 261 NPGGLLQAGIEIARLWLDQ-ETIVYTVNRQGIFE-SYSAVGQPLTDAPLVVLVNQATASA 320

Query: 404 ----ASSLHDNCRAVLVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIE 450
               A +L DN RA+LVGE+T+GKGLIQS+FEL DG+G+AVTV KY TP H DIN  GI 
Sbjct: 321 SEILAGALQDNGRAMLVGEKTFGKGLIQSLFELPDGAGMAVTVAKYETPLHHDINKLGIM 373

BLAST of HG10021331 vs. ExPASy Swiss-Prot
Match: Q55669 (Carboxyl-terminal-processing protease OS=Synechocystis sp. (strain PCC 6803 / Kazusa) OX=1111708 GN=ctpA PE=3 SV=1)

HSP 1 Score: 231.9 bits (590), Expect = 2.3e-59
Identity = 129/348 (37.07%), Postives = 212/348 (60.92%), Query Frame = 0

Query: 117 EDIVREAWEIVNDSFLDASRHRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDP 176
           + ++ ++W +VN S+LD +   ++ + W   +E      ++ R + +  I  MLA+L +P
Sbjct: 36  QKLLLQSWRLVNQSYLDET---FNHQNWWLLREKYVKRPLRNREETYTAIEEMLATLDEP 95

Query: 177 YTRFLPPAEFSKM---ARYDMTGIGINLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGD 236
           +TR L P ++  +      +++G+G+ +   P+ +   +++++  L   PA  AG++  D
Sbjct: 96  FTRLLRPEQYGNLQVTTTGELSGVGLQININPETN---QLEIMAPLAGSPAEEAGLQPHD 155

Query: 237 EILAVNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYR 296
           +ILA++GVD +  S  E ++ ++GP  T V++++        +   + RQ+++ +PV  +
Sbjct: 156 QILAIDGVDTQTLSLDEAAARMRGPKNTKVSLEILSAGTEVPQEFTLTRQLISLSPVAAQ 215

Query: 297 LEQTDVASSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIA 356
           L+ +    SVGYIRL +F+A A K++  A+ +LE  GA  +ILDLR+N GGL+QAGI+IA
Sbjct: 216 LDDSRPGQSVGYIRLSQFSANAYKEVAHALHQLEEQGADGYILDLRNNPGGLLQAGIDIA 275

Query: 357 KLFLNEGSTVIYTVGRDPQYQKTV----VADTGPLV--------TAPVVVASSLHDNCRA 416
           +L+L E ST++YTV R    +        A   PLV        +A  ++A +L DN RA
Sbjct: 276 RLWLPE-STIVYTVNRQGTQESFTANGEAATDRPLVVLVNQGTASASEILAGALQDNQRA 335

Query: 417 VLVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPD 450
            LVGE+T+GKGLIQS+FEL DG+G+AVTV KY TP H DI+  GI PD
Sbjct: 336 TLVGEKTFGKGLIQSLFELSDGAGIAVTVAKYETPQHHDIHKLGIMPD 376

BLAST of HG10021331 vs. ExPASy Swiss-Prot
Match: O23614 (Carboxyl-terminal-processing peptidase 2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CTPA2 PE=1 SV=1)

HSP 1 Score: 188.7 bits (478), Expect = 2.2e-46
Identity = 127/375 (33.87%), Postives = 205/375 (54.67%), Query Frame = 0

Query: 109 TVSPPA---TNEDIV-REAWEIVNDSFLDASRHRWSPEAW-KQRQEDITNISIQTRSKAH 168
           T SPP+   T E+++  EAW  ++ +++D +   ++ ++W + R+  + N  + TR + +
Sbjct: 119 TDSPPSWGLTEENLLFLEAWRTIDRAYIDKT---FNGQSWFRYRETALRNEPMNTREETY 178

Query: 169 NIIRRMLASLGDPYTRFLPPAEFSKM---ARYDMTGIGINLREVPDDDG-GMKIKVLGLL 228
             I++M+A+L DP+TRFL P +F  +    +  +TG+G+++      DG    + V+   
Sbjct: 179 MAIKKMVATLDDPFTRFLEPGKFKSLRSGTQGAVTGVGLSIGYPTASDGPPAGLVVISAA 238

Query: 229 LDGPAHLAGIRQGDEILAVNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQ 288
             GPA+ AGI  GD I  ++       + ++ + +LQGP  + V + ++ G       + 
Sbjct: 239 PGGPANRAGILPGDVIQGIDNTTTETLTIYDAAQMLQGPEGSAVELAIRSG--PETRLLT 298

Query: 289 VQRQVLARTPVFYRLEQTDVASS----VGYIRLKEFNALAKKDLVTAMKRLEAMGASYFI 348
           + R+ ++  PV  RL +   + S    +GYI+L  FN  A   +  A++ L     + F+
Sbjct: 299 LTRERVSVNPVKSRLCELPGSGSNSPKIGYIKLTTFNQNASSAVREAIETLRGNNVNAFV 358

Query: 349 LDLRDNLGGLVQAGIEIAKLFLNEGSTVIYTVGRDPQ--YQ---KTVVADTGPL------ 408
           LDLRDN GG    GIEIAK +L++G  V     R  +  Y       +A + PL      
Sbjct: 359 LDLRDNSGGSFPEGIEIAKFWLDKGVIVYICDSRGVRDIYDTDGSNAIATSEPLAVLVNK 418

Query: 409 --VTAPVVVASSLHDNCRAVLVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDIN 455
              +A  ++A +L DN RA++ GE TYGKG IQSVFEL DGSG+AVTV +Y TP H DI+
Sbjct: 419 GTASASEILAGALKDNKRALVYGEPTYGKGKIQSVFELSDGSGLAVTVARYETPAHTDID 478

BLAST of HG10021331 vs. ExPASy Swiss-Prot
Match: O04073 (C-terminal processing peptidase, chloroplastic OS=Tetradesmus obliquus OX=3088 GN=ctpA PE=1 SV=1)

HSP 1 Score: 184.5 bits (467), Expect = 4.2e-45
Identity = 127/381 (33.33%), Postives = 202/381 (53.02%), Query Frame = 0

Query: 105 GSSETVSPPATNEDIV-REAWEIVNDSFLDASRHRWSPEAW-KQRQEDITNISIQTRSKA 164
           G++   +   T+E ++  EAW  V+ +++D S   ++ ++W K R+  +    +  R++ 
Sbjct: 69  GAAALPAQAVTSEQLLFLEAWRAVDRAYVDKS---FNGQSWFKLRETYLKKEPMDRRAQT 128

Query: 165 HNIIRRMLASLGDPYTRFLPPAEFSKMAR---YDMTGIGINLREVPDDDGGMKIKVLGLL 224
           ++ IR++LA L DP+TRFL P+  + + R     +TG+G+ +    D   G  + VL   
Sbjct: 129 YDAIRKLLAVLDDPFTRFLEPSRLAALRRGTAGSVTGVGLEI--TYDGGSGKDVVVLTPA 188

Query: 225 LDGPAHLAGIRQGDEILAVNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGP--VES 284
             GPA  AG R GD I+ V+G   +G S ++VS LLQG  ++ V V V H    P    +
Sbjct: 189 PGGPAEKAGARAGDVIVTVDGTAVKGLSLYDVSDLLQGEADSQVEV-VLHAPGAPSNTRT 248

Query: 285 IQVQRQVLARTPVFYRLEQTDVASS---------VGYIRLKEFNALAKKDLVTAMKRLEA 344
           +Q+ RQ +   PV +       A++         +GY+RL  FN+        A   L  
Sbjct: 249 LQLTRQKVTINPVTFTTCSNVAAAALPPGAAKQQLGYVRLATFNSNTTAAAQQAFTELSK 308

Query: 345 MGASYFILDLRDNLGGLVQAGIEIAKLFLNEGSTVIYTVGRDPQYQKTVVADTG------ 404
            G +  +LD+R+N GGL  AG+ +A++ ++ G  V+     D Q  + + +  G      
Sbjct: 309 QGVAGLVLDIRNNGGGLFPAGVNVARMLVDRGDLVLIA---DSQGIRDIYSADGNSIDSA 368

Query: 405 -PLV--------TAPVVVASSLHDNCRAVLVGERTYGKGLIQSVFELHDGSGVAVTVGKY 455
            PLV        +A  V+A +L D+ R ++ GERT+GKGLIQ+V +L DGSGVAVTV +Y
Sbjct: 369 TPLVVLVNRGTASASEVLAGALKDSKRGLIAGERTFGKGLIQTVVDLSDGSGVAVTVARY 428

BLAST of HG10021331 vs. ExPASy TrEMBL
Match: A0A5A7UR40 (Carboxyl-terminal-processing peptidase 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold480G00140 PE=3 SV=1)

HSP 1 Score: 804.3 bits (2076), Expect = 4.1e-229
Identity = 419/475 (88.21%), Postives = 441/475 (92.84%), Query Frame = 0

Query: 1   MANVI-FFFNSFSPFHFNF---LPKSPPFLTFVNSEKKNFTNSLNLVDKTLIGALSGVLS 60
           MA+VI FFF+SFSP H +F   LP  PPF++FVNS+KK+F+NSLNLVDKTL+GALSGVLS
Sbjct: 1   MASVIFFFFSSFSPSHLHFPSLLPMPPPFISFVNSDKKSFSNSLNLVDKTLVGALSGVLS 60

Query: 61  FGLLLHSPSSVALDYSSVEFFSLSADSLPSSSLSNSSASCID-DELHEFGSSETVSPPAT 120
           FGLLLHSPSSVALD+S+V+FFSLS+DSLPSSSL +SS SC+D D+LHEFGSSET SPPAT
Sbjct: 61  FGLLLHSPSSVALDHSAVDFFSLSSDSLPSSSLFDSSTSCLDEDQLHEFGSSETGSPPAT 120

Query: 121 NEDIVREAWEIVNDSFLDASRHRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGD 180
           NEDIVREAWEIVNDSFLDA R+RWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGD
Sbjct: 121 NEDIVREAWEIVNDSFLDAGRNRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGD 180

Query: 181 PYTRFLPPAEFSKMARYDMTGIGINLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGDEI 240
           PYTRFLPPAEFSKMARYDMTGIGINLREVPDD+GGMKIKVLGLLLDGPAHLAG+RQGDEI
Sbjct: 181 PYTRFLPPAEFSKMARYDMTGIGINLREVPDDNGGMKIKVLGLLLDGPAHLAGVRQGDEI 240

Query: 241 LAVNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLE 300
           LAVNGV+A GKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLE
Sbjct: 241 LAVNGVEAGGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLE 300

Query: 301 QTDVASSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKL 360
           Q D  SSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKL
Sbjct: 301 QMDANSSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKL 360

Query: 361 FLNEGSTVIYTVGRDPQYQKTVVADTGPLVTAPVV-------------VASSLHDNCRAV 420
           FLNEGSTVIYTVGRDPQYQKTVVAD GPLV APVV             VASSLHDNC+AV
Sbjct: 361 FLNEGSTVIYTVGRDPQYQKTVVADAGPLVKAPVVVLVNKRTASASEIVASSLHDNCKAV 420

Query: 421 LVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFKNFPGSS 458
           LVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDF+NFP  S
Sbjct: 421 LVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFQNFPAWS 475

BLAST of HG10021331 vs. ExPASy TrEMBL
Match: A0A1S3B8R9 (carboxyl-terminal-processing peptidase 1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103487412 PE=3 SV=1)

HSP 1 Score: 804.3 bits (2076), Expect = 4.1e-229
Identity = 419/475 (88.21%), Postives = 441/475 (92.84%), Query Frame = 0

Query: 1   MANVI-FFFNSFSPFHFNF---LPKSPPFLTFVNSEKKNFTNSLNLVDKTLIGALSGVLS 60
           MA+VI FFF+SFSP H +F   LP  PPF++FVNS+KK+F+NSLNLVDKTL+GALSGVLS
Sbjct: 1   MASVIFFFFSSFSPSHLHFPSLLPMPPPFISFVNSDKKSFSNSLNLVDKTLVGALSGVLS 60

Query: 61  FGLLLHSPSSVALDYSSVEFFSLSADSLPSSSLSNSSASCID-DELHEFGSSETVSPPAT 120
           FGLLLHSPSSVALD+S+V+FFSLS+DSLPSSSL +SS SC+D D+LHEFGSSET SPPAT
Sbjct: 61  FGLLLHSPSSVALDHSAVDFFSLSSDSLPSSSLFDSSTSCLDEDQLHEFGSSETGSPPAT 120

Query: 121 NEDIVREAWEIVNDSFLDASRHRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGD 180
           NEDIVREAWEIVNDSFLDA R+RWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGD
Sbjct: 121 NEDIVREAWEIVNDSFLDAGRNRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGD 180

Query: 181 PYTRFLPPAEFSKMARYDMTGIGINLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGDEI 240
           PYTRFLPPAEFSKMARYDMTGIGINLREVPDD+GGMKIKVLGLLLDGPAHLAG+RQGDEI
Sbjct: 181 PYTRFLPPAEFSKMARYDMTGIGINLREVPDDNGGMKIKVLGLLLDGPAHLAGVRQGDEI 240

Query: 241 LAVNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLE 300
           LAVNGV+A GKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLE
Sbjct: 241 LAVNGVEAGGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLE 300

Query: 301 QTDVASSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKL 360
           Q D  SSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKL
Sbjct: 301 QMDANSSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKL 360

Query: 361 FLNEGSTVIYTVGRDPQYQKTVVADTGPLVTAPVV-------------VASSLHDNCRAV 420
           FLNEGSTVIYTVGRDPQYQKTVVAD GPLV APVV             VASSLHDNC+AV
Sbjct: 361 FLNEGSTVIYTVGRDPQYQKTVVADAGPLVKAPVVVLVNKRTASASEIVASSLHDNCKAV 420

Query: 421 LVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFKNFPGSS 458
           LVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDF+NFP  S
Sbjct: 421 LVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFQNFPAWS 475

BLAST of HG10021331 vs. ExPASy TrEMBL
Match: A0A0A0LUI0 (PDZ domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G458990 PE=3 SV=1)

HSP 1 Score: 797.7 bits (2059), Expect = 3.9e-227
Identity = 417/474 (87.97%), Postives = 436/474 (91.98%), Query Frame = 0

Query: 1   MANVIFFFNSFSPFHFNF---LPKSPPFLTFVNSEKKNFTNSLNLVDKTLIGALSGVLSF 60
           MA+VIFFF+SFSP H +F   LP  PPF +FVNSEKK+F+NSLNLVDKTLIGA+SGVLSF
Sbjct: 1   MASVIFFFSSFSPSHLHFPSLLPMPPPFPSFVNSEKKSFSNSLNLVDKTLIGAISGVLSF 60

Query: 61  GLLLHSPSSVALDYSSVEFFSLSADSLPSSSLSNSSASCID-DELHEFGSSETVSPPATN 120
           GLLLHSPSSVALDYS+V+FFSLS+ SLPSSSLS+SSASCID DELHEFGSSETVS PATN
Sbjct: 61  GLLLHSPSSVALDYSAVDFFSLSSHSLPSSSLSDSSASCIDEDELHEFGSSETVSSPATN 120

Query: 121 EDIVREAWEIVNDSFLDASRHRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDP 180
           EDIVREAWEIVNDSFLD+ R+RWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDP
Sbjct: 121 EDIVREAWEIVNDSFLDSGRNRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDP 180

Query: 181 YTRFLPPAEFSKMARYDMTGIGINLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGDEIL 240
           YTRFLPPAEFSKMARYDMTGIGINLREVPDD+G MKIKVLGLLLDGPAHLAG+RQGDEI+
Sbjct: 181 YTRFLPPAEFSKMARYDMTGIGINLREVPDDNGVMKIKVLGLLLDGPAHLAGVRQGDEIV 240

Query: 241 AVNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLEQ 300
           AVNGVDA GKSAFEVSSLLQGPNETLVTVKV HGNCGPVESIQVQRQVLARTPVFYRLEQ
Sbjct: 241 AVNGVDAGGKSAFEVSSLLQGPNETLVTVKVMHGNCGPVESIQVQRQVLARTPVFYRLEQ 300

Query: 301 TDVASSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKLF 360
            D  SSVGYIRLKEFN LAKKDLVTA KRLEAMGASYFILDLRDNLGGLVQAGIEIAKLF
Sbjct: 301 MDATSSVGYIRLKEFNGLAKKDLVTATKRLEAMGASYFILDLRDNLGGLVQAGIEIAKLF 360

Query: 361 LNEGSTVIYTVGRDPQYQKTVVADTGPLVTAPVV-------------VASSLHDNCRAVL 420
           LNEGSTVIYTVGRDPQYQKTVVAD  PLV APVV             VASSLHDNC+AVL
Sbjct: 361 LNEGSTVIYTVGRDPQYQKTVVADAEPLVKAPVVVLVNKRTASASEIVASSLHDNCKAVL 420

Query: 421 VGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFKNFPGSS 458
           VGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDF++FP  S
Sbjct: 421 VGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFQSFPAWS 474

BLAST of HG10021331 vs. ExPASy TrEMBL
Match: A0A2N9EJF2 (PDZ domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS2825 PE=3 SV=1)

HSP 1 Score: 796.2 bits (2055), Expect = 1.1e-226
Identity = 457/754 (60.61%), Postives = 527/754 (69.89%), Query Frame = 0

Query: 20  PKSPPFLTFVNSEKKNFTNSLNLVDKTLIGALSGVLSFGLLLHSPSSVALDYSSVEFFSL 79
           P SP  + F  +      NS+N   KT+I ALSG LSFGL+  SP S+AL+   V+    
Sbjct: 10  PLSPILINFTTT---THNNSINWTKKTIITALSGALSFGLVFSSPWSIALESPIVQ---- 69

Query: 80  SADSLPSSSLSNSSASCIDDELHEFGSSETVSPPATNEDIVREAWEIVNDSFLDASRHRW 139
                  S  S SS  C +DE  +   +ET    ATNE IV EAWEIVNDSF+D  RHRW
Sbjct: 70  -------SPPSPSSEYCREDE--QLIKAETGPEVATNEGIVEEAWEIVNDSFIDTGRHRW 129

Query: 140 SPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDPYTRFLPPAEFSKMARYDMTGIGI 199
           SP+ W+Q+++DI N SI TRSKAH+II+RMLASLGDPYTRFL P EFSKMARYDM+GIG+
Sbjct: 130 SPQTWQQKKQDILNTSIPTRSKAHDIIKRMLASLGDPYTRFLAPEEFSKMARYDMSGIGL 189

Query: 200 NLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGDEILAVNGVDARGKSAFEVSSLLQGPN 259
           NLREVP+D+GG+K+KVLGLLLDGPA  AG+RQGDE+LAVNGVD RGKSAFEVSSLLQGPN
Sbjct: 190 NLREVPEDNGGVKLKVLGLLLDGPAQSAGVRQGDEVLAVNGVDVRGKSAFEVSSLLQGPN 249

Query: 260 ETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLEQTD-VASSVGYIRLKEFNALAKKD 319
           ET VT+KVKHGNCGP++SI+VQRQ++AR+PVFYRLE+ D   +SVGY+RLKEFNALA+KD
Sbjct: 250 ETFVTIKVKHGNCGPIQSIEVQRQLVARSPVFYRLEKIDNGTTSVGYMRLKEFNALARKD 309

Query: 320 LVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKLFLNEGSTVIYTVGRDPQYQKTVV 379
           LV AMKRL+ MGASYFILDLRDNLGGLVQAGIEI+KLFLNEG TVIYTVGRD QYQ TVV
Sbjct: 310 LVIAMKRLQDMGASYFILDLRDNLGGLVQAGIEISKLFLNEGETVIYTVGRDMQYQNTVV 369

Query: 380 ADTGPLVTAPVV-------------VASSLHDNCRAVLVGERTYGKGLIQSVFELHDGSG 439
           ADT PLVT PVV             VASSLHDNCRAVLVGERT+GKGLIQSVFELHDGSG
Sbjct: 370 ADTTPLVTVPVVVLVNNRTASASEIVASSLHDNCRAVLVGERTFGKGLIQSVFELHDGSG 429

Query: 440 VAVTVGKYVTPNHKDINGNGIEPDFKNFPGSSFAKKSPIQEENQENIPLQNDEDRTIFSP 499
           V VTVGKYVTPNH+DINGNGIEPDF++ PGSS A+K P  E  Q+N  L+N E+  +FS 
Sbjct: 430 VVVTVGKYVTPNHRDINGNGIEPDFRSLPGSSVAEK-PHSEAIQQNKLLRNQENHMLFSS 489

Query: 500 SVHFTQLDDTTIVNPTTPPG---------GTPISPPKSVPNFNPN--------------- 559
           SV  TQ D   + NPTTP            +P SP  + P  NP                
Sbjct: 490 SVSITQHDTIPVFNPTTPGSAIPTPIVNPNSPPSPTSTEPTTNPTPPRITPMTPTSPTTT 549

Query: 560 -------------------------VNPTAVQGNSGGGSWCIASSAASPTALQVALDYAC 619
                                      PT     S GGSWCIA+  A  TALQVA+DYAC
Sbjct: 550 PTTPTMTPTTPMTPTITPTTPTTTPTTPTTTTPGSSGGSWCIANPTAPETALQVAIDYAC 609

Query: 620 GYGGADCSPIQPGGSCYDPNTVKDHASYAFNDYYQKNPAATSCVFGGTAQLVSTDPSNGN 679
           GYGGADCS IQPG SCYDPNT++DHAS+AFN YYQKNPA+TSCVFGGTAQL STDPS GN
Sbjct: 610 GYGGADCSAIQPGASCYDPNTLRDHASFAFNSYYQKNPASTSCVFGGTAQLSSTDPSTGN 669

Query: 680 CHYATSGAVPSPTPPANPNPTPPAPVIPTMPPPATTIPTMPPPATTNPTYTPIDPSIYGA 711
           CH+    +V S TP     P+  +P IPT P  +TT P  PP  T +    P  P  +G 
Sbjct: 670 CHFQ---SVSSTTP-----PSMSSPAIPTPPSTSTTTPLTPPTPTIS---IPSGPGGFG- 729

BLAST of HG10021331 vs. ExPASy TrEMBL
Match: A0A6J1CFB6 (carboxyl-terminal-processing peptidase 1, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111010931 PE=3 SV=1)

HSP 1 Score: 793.5 bits (2048), Expect = 7.3e-226
Identity = 410/474 (86.50%), Postives = 428/474 (90.30%), Query Frame = 0

Query: 1   MANVIFFFNSFSPFHFNFLPKSPPFLTFVNSEKKNFTNSLNLVDKTLIGALSGVLSFGLL 60
           M NVIFFFNS SPFHF  LPK PPF+TF+NS+ +   NS  LVDKTLIGA+SGVLSFGLL
Sbjct: 1   MTNVIFFFNSLSPFHFQ-LPKPPPFITFINSQNRTSANSATLVDKTLIGAVSGVLSFGLL 60

Query: 61  LHSPSSVALDYSSVEFFSLSADSLPSSSLSNSSASCIDDELHEFGSSETVSPPATNEDIV 120
            HSP SVALDYSSVE FSLSADSLPSSS S   +SC +DEL EFG+SET S PATNEDIV
Sbjct: 61  FHSPLSVALDYSSVETFSLSADSLPSSSPSTYDSSCNEDELREFGNSETGSSPATNEDIV 120

Query: 121 REAWEIVNDSFLDASRHRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDPYTRF 180
           REAWEIVNDSFLDA RHRWSPEAWKQ+Q+DI NISIQ+RSKAHNIIR+MLASLGDPYTRF
Sbjct: 121 REAWEIVNDSFLDAGRHRWSPEAWKQKQDDIMNISIQSRSKAHNIIRKMLASLGDPYTRF 180

Query: 181 LPPAEFSKMARYDMTGIGINLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGDEILAVNG 240
           LPPAEFSKMARYDMTGIGINLREVPDD+GG+KIKVLGLLLDGPAH AG+RQGDEILAVNG
Sbjct: 181 LPPAEFSKMARYDMTGIGINLREVPDDNGGIKIKVLGLLLDGPAHSAGVRQGDEILAVNG 240

Query: 241 VDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLEQTDVA 300
           VDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVES+QVQRQVLARTPVFYRLEQ D A
Sbjct: 241 VDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESMQVQRQVLARTPVFYRLEQMDFA 300

Query: 301 SSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKLFLNEG 360
           SSVGYIRLKEFNALAKKDLVTAMKRLE MGASYFILDLRDNLGGLVQAGIEIAKLFLNEG
Sbjct: 301 SSVGYIRLKEFNALAKKDLVTAMKRLEDMGASYFILDLRDNLGGLVQAGIEIAKLFLNEG 360

Query: 361 STVIYTVGRDPQYQKTVVADTGPLVTAPVV-------------VASSLHDNCRAVLVGER 420
           STVIYTVGRDPQYQKTV+ADTGPLVTAPVV             VASSLHDNCRAVLVGER
Sbjct: 361 STVIYTVGRDPQYQKTVIADTGPLVTAPVVVLVNQKTASASEIVASSLHDNCRAVLVGER 420

Query: 421 TYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFKNFPGSSFAKK 462
           TYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDF+NFP  S   K
Sbjct: 421 TYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFQNFPAWSDVTK 473

BLAST of HG10021331 vs. TAIR 10
Match: AT5G46390.2 (Peptidase S41 family protein )

HSP 1 Score: 504.6 bits (1298), Expect = 1.3e-142
Identity = 274/463 (59.18%), Postives = 340/463 (73.43%), Query Frame = 0

Query: 12  SPFHFNFLPKSPPFLTFVNSEKKNFTNSLNLVDKTLIGALSGVLSFGLLLHSP-SSVAL- 71
           SP    F+P+ PP   F      +++    ++ K++IG L+G LS  L+  SP SSVA  
Sbjct: 16  SPSTPQFIPELPPPSQF------DYSGLTKILKKSVIGTLTGALSLTLVFSSPISSVAAT 75

Query: 72  --DYSSVEFFSLSADSLPSSSLSNSSASCIDDELHEFGSSETVSPP--ATNEDIVREAWE 131
              Y SV   S S +S   +   ++   C ++E  +    +    P   TNE IV EAWE
Sbjct: 76  NDPYLSVNPPSSSFES-SLNHFDSAPEDCPNEEEADTEIQDDDIEPQLVTNEGIVEEAWE 135

Query: 132 IVNDSFLDASRHRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDPYTRFLPPAE 191
           IVN +FLD   H W+PE W+++++DI    I++RSKAH +I+ MLASLGD YTRFL P E
Sbjct: 136 IVNGAFLDTRSHSWTPETWQKQKDDILASPIKSRSKAHEVIKNMLASLGDQYTRFLSPDE 195

Query: 192 FSKMARYDMTGIGINLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGDEILAVNGVDARG 251
           FS+M++YD+TGIGINLREV D  G +K+KVLGL+LD  A +AG++QGDEILAVNG+D  G
Sbjct: 196 FSRMSKYDITGIGINLREVSDGGGNVKLKVLGLVLDSAADIAGVKQGDEILAVNGMDVSG 255

Query: 252 KSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLEQTDVAS-SVG 311
           KS+FEVSSLLQGP++T V +KVKHG CGPV+S+++QRQV A+TPV YRLE+ D  + SVG
Sbjct: 256 KSSFEVSSLLQGPSKTFVVLKVKHGKCGPVKSLKIQRQVNAQTPVSYRLEKVDNGTVSVG 315

Query: 312 YIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKLFLNEGSTVI 371
           YIRLKEFNALA+KDLV AMKRL   GASYF++DLRDNLGGLVQAGIE AKLFL+EG TVI
Sbjct: 316 YIRLKEFNALARKDLVIAMKRLLDKGASYFVMDLRDNLGGLVQAGIETAKLFLDEGDTVI 375

Query: 372 YTVGRDPQYQKTVVADTGPLVTAPV-------------VVASSLHDNCRAVLVGERTYGK 431
           YT GRDP+ QKTVV+D  PL+TAP+             +VAS+LHDNC+AVLVGERTYGK
Sbjct: 376 YTAGRDPEAQKTVVSDKKPLITAPLIVMVNNRTASASEIVASALHDNCKAVLVGERTYGK 435

Query: 432 GLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFKNFP 455
           GLIQSV+EL DGSGV VT+GKYVTPNH DING GIEPDF+N P
Sbjct: 436 GLIQSVYELRDGSGVVVTIGKYVTPNHMDINGGGIEPDFRNLP 471

BLAST of HG10021331 vs. TAIR 10
Match: AT5G46390.1 (Peptidase S41 family protein )

HSP 1 Score: 401.4 bits (1030), Expect = 1.5e-111
Identity = 223/398 (56.03%), Postives = 287/398 (72.11%), Query Frame = 0

Query: 12  SPFHFNFLPKSPPFLTFVNSEKKNFTNSLNLVDKTLIGALSGVLSFGLLLHSP-SSVAL- 71
           SP    F+P+ PP   F      +++    ++ K++IG L+G LS  L+  SP SSVA  
Sbjct: 16  SPSTPQFIPELPPPSQF------DYSGLTKILKKSVIGTLTGALSLTLVFSSPISSVAAT 75

Query: 72  --DYSSVEFFSLSADSLPSSSLSNSSASCIDDELHEFGSSETVSPP--ATNEDIVREAWE 131
              Y SV   S S +S   +   ++   C ++E  +    +    P   TNE IV EAWE
Sbjct: 76  NDPYLSVNPPSSSFES-SLNHFDSAPEDCPNEEEADTEIQDDDIEPQLVTNEGIVEEAWE 135

Query: 132 IVNDSFLDASRHRWSPEAWKQRQEDITNISIQTRSKAHNIIRRMLASLGDPYTRFLPPAE 191
           IVN +FLD   H W+PE W+++++DI    I++RSKAH +I+ MLASLGD YTRFL P E
Sbjct: 136 IVNGAFLDTRSHSWTPETWQKQKDDILASPIKSRSKAHEVIKNMLASLGDQYTRFLSPDE 195

Query: 192 FSKMARYDMTGIGINLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGDEILAVNGVDARG 251
           FS+M++YD+TGIGINLREV D  G +K+KVLGL+LD  A +AG++QGDEILAVNG+D  G
Sbjct: 196 FSRMSKYDITGIGINLREVSDGGGNVKLKVLGLVLDSAADIAGVKQGDEILAVNGMDVSG 255

Query: 252 KSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQVQRQVLARTPVFYRLEQTDVAS-SVG 311
           KS+FEVSSLLQGP++T V +KVKHG CGPV+S+++QRQV A+TPV YRLE+ D  + SVG
Sbjct: 256 KSSFEVSSLLQGPSKTFVVLKVKHGKCGPVKSLKIQRQVNAQTPVSYRLEKVDNGTVSVG 315

Query: 312 YIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQAGIEIAKLFLNEGSTVI 371
           YIRLKEFNALA+KDLV AMKRL   GASYF++DLRDNLGGLVQAGIE AKLFL+EG TVI
Sbjct: 316 YIRLKEFNALARKDLVIAMKRLLDKGASYFVMDLRDNLGGLVQAGIETAKLFLDEGDTVI 375

Query: 372 YTVGRDPQYQKTVVADTGPLVTAPVVVASSLHDNCRAV 403
           YT GRDP+ QKTVV+D  PL+TAP++V     ++C+ V
Sbjct: 376 YTAGRDPEAQKTVVSDKKPLITAPLIVCD---ESCKPV 403

BLAST of HG10021331 vs. TAIR 10
Match: AT4G17740.1 (Peptidase S41 family protein )

HSP 1 Score: 188.7 bits (478), Expect = 1.6e-47
Identity = 127/375 (33.87%), Postives = 205/375 (54.67%), Query Frame = 0

Query: 109 TVSPPA---TNEDIV-REAWEIVNDSFLDASRHRWSPEAW-KQRQEDITNISIQTRSKAH 168
           T SPP+   T E+++  EAW  ++ +++D +   ++ ++W + R+  + N  + TR + +
Sbjct: 119 TDSPPSWGLTEENLLFLEAWRTIDRAYIDKT---FNGQSWFRYRETALRNEPMNTREETY 178

Query: 169 NIIRRMLASLGDPYTRFLPPAEFSKM---ARYDMTGIGINLREVPDDDG-GMKIKVLGLL 228
             I++M+A+L DP+TRFL P +F  +    +  +TG+G+++      DG    + V+   
Sbjct: 179 MAIKKMVATLDDPFTRFLEPGKFKSLRSGTQGAVTGVGLSIGYPTASDGPPAGLVVISAA 238

Query: 229 LDGPAHLAGIRQGDEILAVNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQ 288
             GPA+ AGI  GD I  ++       + ++ + +LQGP  + V + ++ G       + 
Sbjct: 239 PGGPANRAGILPGDVIQGIDNTTTETLTIYDAAQMLQGPEGSAVELAIRSG--PETRLLT 298

Query: 289 VQRQVLARTPVFYRLEQTDVASS----VGYIRLKEFNALAKKDLVTAMKRLEAMGASYFI 348
           + R+ ++  PV  RL +   + S    +GYI+L  FN  A   +  A++ L     + F+
Sbjct: 299 LTRERVSVNPVKSRLCELPGSGSNSPKIGYIKLTTFNQNASSAVREAIETLRGNNVNAFV 358

Query: 349 LDLRDNLGGLVQAGIEIAKLFLNEGSTVIYTVGRDPQ--YQ---KTVVADTGPL------ 408
           LDLRDN GG    GIEIAK +L++G  V     R  +  Y       +A + PL      
Sbjct: 359 LDLRDNSGGSFPEGIEIAKFWLDKGVIVYICDSRGVRDIYDTDGSNAIATSEPLAVLVNK 418

Query: 409 --VTAPVVVASSLHDNCRAVLVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDIN 455
              +A  ++A +L DN RA++ GE TYGKG IQSVFEL DGSG+AVTV +Y TP H DI+
Sbjct: 419 GTASASEILAGALKDNKRALVYGEPTYGKGKIQSVFELSDGSGLAVTVARYETPAHTDID 478

BLAST of HG10021331 vs. TAIR 10
Match: AT4G17740.2 (Peptidase S41 family protein )

HSP 1 Score: 188.7 bits (478), Expect = 1.6e-47
Identity = 127/375 (33.87%), Postives = 205/375 (54.67%), Query Frame = 0

Query: 109 TVSPPA---TNEDIV-REAWEIVNDSFLDASRHRWSPEAW-KQRQEDITNISIQTRSKAH 168
           T SPP+   T E+++  EAW  ++ +++D +   ++ ++W + R+  + N  + TR + +
Sbjct: 109 TDSPPSWGLTEENLLFLEAWRTIDRAYIDKT---FNGQSWFRYRETALRNEPMNTREETY 168

Query: 169 NIIRRMLASLGDPYTRFLPPAEFSKM---ARYDMTGIGINLREVPDDDG-GMKIKVLGLL 228
             I++M+A+L DP+TRFL P +F  +    +  +TG+G+++      DG    + V+   
Sbjct: 169 MAIKKMVATLDDPFTRFLEPGKFKSLRSGTQGAVTGVGLSIGYPTASDGPPAGLVVISAA 228

Query: 229 LDGPAHLAGIRQGDEILAVNGVDARGKSAFEVSSLLQGPNETLVTVKVKHGNCGPVESIQ 288
             GPA+ AGI  GD I  ++       + ++ + +LQGP  + V + ++ G       + 
Sbjct: 229 PGGPANRAGILPGDVIQGIDNTTTETLTIYDAAQMLQGPEGSAVELAIRSG--PETRLLT 288

Query: 289 VQRQVLARTPVFYRLEQTDVASS----VGYIRLKEFNALAKKDLVTAMKRLEAMGASYFI 348
           + R+ ++  PV  RL +   + S    +GYI+L  FN  A   +  A++ L     + F+
Sbjct: 289 LTRERVSVNPVKSRLCELPGSGSNSPKIGYIKLTTFNQNASSAVREAIETLRGNNVNAFV 348

Query: 349 LDLRDNLGGLVQAGIEIAKLFLNEGSTVIYTVGRDPQ--YQ---KTVVADTGPL------ 408
           LDLRDN GG    GIEIAK +L++G  V     R  +  Y       +A + PL      
Sbjct: 349 LDLRDNSGGSFPEGIEIAKFWLDKGVIVYICDSRGVRDIYDTDGSNAIATSEPLAVLVNK 408

Query: 409 --VTAPVVVASSLHDNCRAVLVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDIN 455
              +A  ++A +L DN RA++ GE TYGKG IQSVFEL DGSG+AVTV +Y TP H DI+
Sbjct: 409 GTASASEILAGALKDNKRALVYGEPTYGKGKIQSVFELSDGSGLAVTVARYETPAHTDID 468

BLAST of HG10021331 vs. TAIR 10
Match: AT3G57680.1 (Peptidase S41 family protein )

HSP 1 Score: 183.0 bits (463), Expect = 8.6e-46
Identity = 111/379 (29.29%), Postives = 202/379 (53.30%), Query Frame = 0

Query: 122 EAWEIVNDSFLDASRHRWSPEAW--KQRQEDITNISIQTRSKAHNIIRRMLASLGDPYTR 181
           EAW ++ ++F+D +   ++ + W  K +Q  +    +++   A+  ++ ML++LGDP+TR
Sbjct: 123 EAWGLIRETFVDPT---FNHQDWDFKLQQTMVEMFPLRSADAAYGKLKAMLSTLGDPFTR 182

Query: 182 FLPPAEFSKM---ARYDMTGIGINLREVPDDDGGMKIKVLGLLLDGPAHLAGIRQGDEIL 241
            + P E+      +  ++ G+G+ +   P       + V+  +   PA  AGI +G+E++
Sbjct: 183 LITPKEYQSFRIGSDGNLQGVGLFINSEPRTG---HLVVMSCVEGSPADRAGIHEGEELV 242

Query: 242 AVNGVDARGKSAFEVSSLLQGPNETLVTVKVKH----GNCGPVESIQVQRQVLARTPVFY 301
            +NG       +   +  L+G   T VT+K+K+    G    +  +++ R  +  +P+  
Sbjct: 243 EINGEKLDDVDSEAAAQKLRGRVGTFVTIKLKNVNGSGTDSGIREVKLPRDYIKLSPISS 302

Query: 302 RL----EQTDVASSVGYIRLKEFNALAKKDLVTAMKRLEAMGASYFILDLRDNLGGLVQA 361
            +          +  GY++L  F+  A  D+  A+  +E      +ILDLR+N GGLV+A
Sbjct: 303 AIIPHTTPDGRLAKTGYVKLTAFSQTAASDMENAVHEMENQDVQSYILDLRNNPGGLVRA 362

Query: 362 GIEIAKLFLNEGSTVIYTVGRD-------------PQYQKTVVADTGPLVTAPVVVASSL 421
           G+++A+L+L+   T++YT+ R+               +   VV       +A  ++A +L
Sbjct: 363 GLDVAQLWLDGDETLVYTIDREGVTSPINMINGHAVTHDPLVVLVNEGSASASEILAGAL 422

Query: 422 HDNCRAVLVGERTYGKGLIQSVFELHDGSGVAVTVGKYVTPNHKDINGNGIEPDFKNFPG 475
           HDN RA+LVG RT+GKG IQS+ EL+DGS + VTV KY++P+  +I+  GI PD +   G
Sbjct: 423 HDNGRAILVGNRTFGKGKIQSITELNDGSALFVTVAKYLSPSLHEIDQVGIAPDVQCTTG 482

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038894565.16.5e-23790.70carboxyl-terminal-processing peptidase 1, chloroplastic isoform X1 [Benincasa hi... [more]
XP_008443944.18.5e-22988.21PREDICTED: carboxyl-terminal-processing peptidase 1, chloroplastic [Cucumis melo... [more]
XP_004147402.18.0e-22787.97carboxyl-terminal-processing peptidase 1, chloroplastic isoform X1 [Cucumis sati... [more]
XP_022140206.11.5e-22586.50carboxyl-terminal-processing peptidase 1, chloroplastic [Momordica charantia][more]
XP_023542920.13.1e-22386.60carboxyl-terminal-processing peptidase 1, chloroplastic [Cucurbita pepo subsp. p... [more]
Match NameE-valueIdentityDescription
F4KHG61.8e-14159.18Carboxyl-terminal-processing peptidase 1, chloroplastic OS=Arabidopsis thaliana ... [more]
P427843.5e-6037.29Carboxyl-terminal-processing protease OS=Synechococcus sp. (strain ATCC 27264 / ... [more]
Q556692.3e-5937.07Carboxyl-terminal-processing protease OS=Synechocystis sp. (strain PCC 6803 / Ka... [more]
O236142.2e-4633.87Carboxyl-terminal-processing peptidase 2, chloroplastic OS=Arabidopsis thaliana ... [more]
O040734.2e-4533.33C-terminal processing peptidase, chloroplastic OS=Tetradesmus obliquus OX=3088 G... [more]
Match NameE-valueIdentityDescription
A0A5A7UR404.1e-22988.21Carboxyl-terminal-processing peptidase 1 OS=Cucumis melo var. makuwa OX=1194695 ... [more]
A0A1S3B8R94.1e-22988.21carboxyl-terminal-processing peptidase 1, chloroplastic OS=Cucumis melo OX=3656 ... [more]
A0A0A0LUI03.9e-22787.97PDZ domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G458990 PE=3 SV... [more]
A0A2N9EJF21.1e-22660.61PDZ domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS2825 PE=3 ... [more]
A0A6J1CFB67.3e-22686.50carboxyl-terminal-processing peptidase 1, chloroplastic OS=Momordica charantia O... [more]
Match NameE-valueIdentityDescription
AT5G46390.21.3e-14259.18Peptidase S41 family protein [more]
AT5G46390.11.5e-11156.03Peptidase S41 family protein [more]
AT4G17740.11.6e-4733.87Peptidase S41 family protein [more]
AT4G17740.21.6e-4733.87Peptidase S41 family protein [more]
AT3G57680.18.6e-4629.29Peptidase S41 family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012946X8 domainSMARTSM00768X8_clscoord: 535..619
e-value: 7.8E-47
score: 171.6
IPR012946X8 domainPFAMPF07983X8coord: 536..605
e-value: 2.1E-20
score: 73.2
IPR005151Tail specific proteaseSMARTSM00245tsp_4coord: 273..453
e-value: 7.5E-57
score: 204.9
IPR005151Tail specific proteasePFAMPF03572Peptidase_S41coord: 303..451
e-value: 3.1E-31
score: 108.2
IPR001478PDZ domainSMARTSM00228pdz_newcoord: 195..271
e-value: 2.4E-13
score: 60.4
IPR001478PDZ domainPROSITEPS50106PDZcoord: 183..270
score: 10.8378
IPR036034PDZ superfamilyGENE3D2.30.42.10coord: 195..287
e-value: 2.1E-82
score: 279.1
IPR036034PDZ superfamilySUPERFAMILY50156PDZ domain-likecoord: 169..285
NoneNo IPR availableGENE3D3.30.750.44coord: 122..428
e-value: 2.1E-82
score: 279.1
NoneNo IPR availableGENE3D3.90.226.10coord: 288..450
e-value: 2.1E-82
score: 279.1
NoneNo IPR availableGENE3D1.20.58.1040coord: 533..625
e-value: 3.9E-10
score: 42.0
NoneNo IPR availablePANTHERPTHR32060:SF22CARBOXYL-TERMINAL-PROCESSING PEPTIDASE 1, CHLOROPLASTICcoord: 89..455
NoneNo IPR availablePANTHERPTHR32060TAIL-SPECIFIC PROTEASEcoord: 89..455
NoneNo IPR availableCDDcd00988PDZ_CTP_proteasecoord: 194..283
e-value: 7.73036E-19
score: 79.5772
IPR041489PDZ domain 6PFAMPF17820PDZ_6coord: 221..268
e-value: 2.7E-10
score: 40.0
IPR004447C-terminal-processing peptidase S41ATIGRFAMTIGR00225TIGR00225coord: 155..458
e-value: 6.0E-66
score: 220.7
IPR004447C-terminal-processing peptidase S41ACDDcd07560Peptidase_S41_CPPcoord: 301..449
e-value: 6.72391E-57
score: 190.701
IPR029045ClpP/crotonase-like domain superfamilySUPERFAMILY52096ClpP/crotonasecoord: 119..451

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10021331.1HG10021331.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004175 endopeptidase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0008236 serine-type peptidase activity