HG10023023 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023023
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPeptidase_S9 domain-containing protein
LocationChr05: 30540419 .. 30545772 (+)
RNA-Seq ExpressionHG10023023
SyntenyHG10023023
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTCATCTAGGTTCCGCAACCTTGTTCATCTCAACGCGATCGTTTCTGAAGATGGCGGAGGCGGTGCTAGCGGCGGTGCCGGAGGCGGAGGCTCCAATGGCTCCGTTTCGTCTTCTTCAGCTGTAGTCTCTACTGAAGACGATGGTACTGGAAAAGAGTTTCGTAATTTAGTATTGATTTATTTTATTTTATTTTATCTTATCCTATTTGTTCTCTTGTGGCTGTTTACTTCCGATTGTTTATTTAAATAAGTGCTTTGAAGATGTTTACTTGAACTGGTCAGAGAATTGGTGGGAATGATTTTATTATTCTTAGATTATGAACTTCAGCAAATCATATCGGATTTTATTGCTCTTCTACGATCGTGATTCGTACCTACCCGATGCATTAAGATTTTCATTTCGCGCCTTTTCTCTTGTGGAAGCAATTTACTATTTATCCGTGTGTATATATTTATATACATGTATGCACGTATGTATTTATATTTCTATTACATGAGCTACCTGTATTAAGAAGTGTTTAGTTGAAATTCAGCGTGTTCCCACCCGTGATTTTTAAGCTTTGTAGCTAAGTTTGCTCGTGGCTTGCATAGCGTAATCAAATAATTTCACAATATTGCTCGAACTTCTATTTTATGCTTGCAATGGTGTTTCTGACTCGTGATCCTTTAATTCAGAGAATTCAGTTCTGGGGGTTGGATATCGTCTTCCTCCAGCTGAAATCAGAGACATTGTTGATGCTCCACCGCTTCCCATATTGTCATTCTCGCCATACAGGGATAAAATATTGTTCCTCAAGCGGAGGTCATTGCCTCCAATATCAGAACTTGCAAAACCAGAAGAAAAGTTGGCTGGTATCCGTATTGATGGACAGTGCAATTGTAGAAGTCGAATGTCGTTCTACACTGGAATAGGGATTCATCAATTGATGCCTGATGATTCCCTAGGTCCAGAGAAGGAGGTACATGGCTTACCAGATGGTGCTAAGATCAATTTCATTACCTGGTCACCTGATGGCCGTCATTTATCTTTCAGTGTTCGAGTTGACGAGGAAGATGGCAGTAGCGGTAAGCTTAGAGTTTGGGTTGCTGATGTGGAAACTGGGAAAGCTAGACCTTTGTTTCAGAATACAGACATCTATGTAAATGCAGTTTTTGAGAATTTTGTTTGGGTAAACGATTCTACTTTGTTAGTTTGCACCATTCCCTCCTCTCGTGGAGATCCACCAAAGAAACCTTTGGTTCCTCATGGTCCAAAAGTTCAATCTAATGAGCAGAAGAACATCATCCAAGCTAGAACCTTTCAGGATTTGCTGAAGGACAAATATGATGAGGATTTGTTCGACTACTATGCCACTACCCAGCTTGTTTTGGGTTCATTGGATGGAACAGTTGAGGCATTTGGCACACCAGCTATATATACGTCGCTGGACCCTTCCCCTGATCACAAATATATTTTGATTAGTACTATTCACCGGCCGTATTCTTTTATTGTTCCATGTGGAAGATTTCCTAAAAAGGTAGCTGTGTGGACAACTGACGGCAAGTTTGTTAGGGAGCTTTGTGATTTGCCTCTTGCTGAGGATATCCCCATTGCATTCAACAGTGTGAGAAAGGGGATGCGTTCCATCAATTGGAGAGCAGATAAGCCATCGACACTCTACTGGTGTGCATTTGAGATCCTAATATCATTCTTTTCTGCTCTTTGCATATTTATGGACATCCCTTTTCTATTTTCTTTTTTATTTATATTTAAATTTCTGTGTGTCACGTGTCACCATCTATGGAGGATAGTTAGTTCATGTTACTAATTACTGTTAGCATTTGCACTTGTCTTAATGACTTTTCTAAATTTTTAGGGTGGAAACTCAAGATGGTGGAGATGCCAGAATCGAGGTTTCTCCTCGTGACATTGTTTATACACAATCTGCTGAACCACTGGAAAGTGAACAGCCAGAGATACTGCATAAACTTGATCTTCGTTATGGGTATTTTTTCCTTCCTCTGCTGTTTATCCTGGTCATTATTTTAGATAGCCGCTTATAAAGGTGAAAATGGCCTTTTACTACGACATTGTTCAAATTTAACTGAACCTTATTTTATTTGTATCGTAATCACATATTCGGTTAAACATGTTCCCTGTATATACAGTACCCTGGAGATCTTGATCTCTGCAAAAACTGTTAGTATGTTGTTCAAGGTGGAAACTGGTCCACCTTCGGGAACTGAAATCATACATATCTTGCCTATGCTTTTCCGTATTAATACTCTTATGACATAATTTCGACAAAGTCTAAAACGTGTCTTTCCAACTTCTGCTTCAAGTATTCCTATATCTTTTCATACCATAACTACTAATTTTCTTTTACAGATCTAACTCAAATCCTCAACCTCGAAACTTTGTTTTAATGTGATTCTCCTGTTTATTGCTGCAGAGGAATATCTTGGTGTGATGACTCACTGGCTCTTGTTTATGAATCTTGGTATAAGACGCGCAAAATACGAACGTGGGTAATCTCTCCTGGTTCTAAAGAGGACAATCCTCGCATTCTATTTGATAGGTCATCAGAAGATGTGTATTCAGACCCTGGGTCACCGATGCTACGGAGGACTCCTCTTGGGACTTACGTAATTGCAAAATTAAAGAAGGATAATTATGAAGGCACATATGTTCTACTCAATGGTAGTGGTGCTACTCCGGAAGGGAACATCCCTTTTATTGATTTATTTGACATGTAAGTGTTCTGATTTAACCATATCTTGTTTAGTCCCACTTGTTAGATGTGCTATTAATGAAACACCGGTTCTGAAAATATTCCTTCAGAAACACAGGCAGCAAAGAAAGAATATGGAAGAGCAACAAAGAAACTTATTATGAGAGTGTTGTGGCTTTAATGTCTGATCAGAAAGAAGGAGATCTAAATATTGATGAGCTGAAATTTTTGACTTCCAAAGAATCCAAAACTGAAAATACTCAGTACTACATTCTGAGGTGGCCTGGTAAGAAAGCAAGTCAAATCACAAAATTTCCTCATCCATATCCACAGCTGGCATCACTGCAGAAAGAGATGGTTAGATACGAGAGAAAAGACGGAGTTCAACTGACAGCCACACTATATCTACCCCCAAACTACGATCCAGCAAAAGATGGCCCTCTTCCCTGCTTGATCTGGTCTTACCCTGGAGAATTCAAAAGCAAAGATGCAGCTGGACAAGTTCGTGGTTCCCCCAATGAGTTTGCTAGTATAGGTCCAACATCTGCTCTTCTTTGGTTGGCTCGCAGGTAGAAGTGCCGCTTCAAGATTATTGCCTTTAAATTATTTTTGGGTCATCGTTATTTATTTATTTTTATTTTTATGAATGGATCATAGTTTATTTTGTTCTGAACATAGTTCTTCAAGTCTTTCAGGTTTGCCATTTTGGCTGGACCAACAATACCTATCATTGGTGAAGGTAACGAGGAGGCAAATGATAGGTAATTGGTAGCATATTTACTACAAAATTGGTTCCTTTTCCCCGTTCCATCCTCAATGTGTAGTTTGTTCCTGAAAGCTTATATTGTTTTTGTCTATATTTTTTTGACATTGATGCAAAAACTAAAGCTTTTACCTGCAACTGGGAAGCATGCTGCAATGAGTAATTCTTGGATGTTTGGATGTGTACTTACTACTTACGTACATTCAAGTTGTCTCTAACTATAGTAAGGTTAAAACTAGAGATAGGTTTTGCATAGCTAAGAAATATCCTATAACTGACAATTAATTTATCGAGCTCCTGTTCTATCATCTCTGTCTAGTATATGTACAGGTCATTGAACTACAAACTACTTCACTGTCGTATAACTGTATAAATTGCTTTATATATCCTCGAAATTGGTTGTATGTACATGCTGTATCCGAGTGTCTTACAATGAACTTGCCATCTTATCCAGAAAAACAATGAACTTACCCAAAAAAAAAAATGTCTCATAATGAAAGTAGTTACATGCAGTTATTGCAATATTTGAGAATTGAGATAGTATATATCGTTCTCAAAATTGAGGACCTGAGGTTCTAGACTAATGGACCCTCTTGCAGCACTCTTTACTACTAATCCCCCCCGCCCCCTCCACACGCACACATTCCAATTCATTTTATATCATTTAAGTATATTGCAGTGCACGACTTAGTCTATAGGCCTTTCTCTATACATTTTTATCTATCTCTACCTGTTTGAGTATTTTCATTCAGTGCTTTCTTTGAATAATCGAATTTTGTAAAAAAAAAAAAAAACTAAAACATGCATATTTTACTAGATATGTAGAGCAATTGGTTGCGAGTGCAGAGGCTGCTGTAGAGGAGGTCATTAAACGGGGGGTGAGTGTTGTGTTTTCTTTATAAATATTTCTTGGTGTATAACATAGTTAACCATGTTGATATGCCACCATGACTAGGTTGCTCATCCTAATAAGATTGCTGTTGGTGGACATTCATATGGTGCGTTTATGACTGCAAACCTTCTGGCTCATGCTCCCCATCTTTTTTGCTGTGGAATTGCTCGCTCCGGAGCCTATAACAGAACACTGACCCCTTTTGGCTTTCAGGTAGTTTCATATTTAAATGCTCAATTTTCCTCTAAACATTGACCATACGAAAGTGATATGCCTAAAATTCCCATGCTTAAACGTCTGTTCCAACCTTTCCACACTTTTCCACTCTACCTATCATTTGTTCTGCTAATCTGGGTATACTTTATTGTCATAGAATGAGGATAGAACTCTTTGGGAAGCAACCAACACATATGTAGAGATGAGTCCATTTATATCAGCAAATAAAATCAAGAAGCCAATTTTACTCATTCATGGCGAAGAAGACAACAACCCAGGAACATTACCCATGCAGGTATGGTCTGGAATTTTTAGCTGTTTGGTTCTGTTTCCTTCAACCAATAAATAACGAAATAATAACCTAAGAGAACATCTTAATTTCACCTTGCCGAGCTTTAAGCAATATTATGCCTACAGTTTCATTTTTAACTTGTATTTTGCATGATTTGCAGTCCGATCGATTTTTCAATGCCTTGAAAGGCCATGGAGCATTATGTCGCCTTGTGGTTCTTCCCTTCGAAAGCCATGGTTATTCTTCACGAGAGAGTATCATGCATGTCCTCTGGGAAACTGATCGATGGCTAGAGAAATACGGTTCCTCTAACGCTTCTGATTTAAGTCAAGATGTGGATAAAATTAAAGAGGAAGGCAATGGAACAGCAGATTCCGCAGGGAAAGTTGTTGCTGGTTCTGGAGGCGGTGGCACAGAGAGTCCAAGTCCTGATAATGATGGATTTTACTCTATTCAAAGATCATTGTTGTGGTAA

mRNA sequence

ATGGCCTCATCTAGGTTCCGCAACCTTGTTCATCTCAACGCGATCGTTTCTGAAGATGGCGGAGGCGGTGCTAGCGGCGGTGCCGGAGGCGGAGGCTCCAATGGCTCCGTTTCGTCTTCTTCAGCTGTAGTCTCTACTGAAGACGATGAGAATTCAGTTCTGGGGGTTGGATATCGTCTTCCTCCAGCTGAAATCAGAGACATTGTTGATGCTCCACCGCTTCCCATATTGTCATTCTCGCCATACAGGGATAAAATATTGTTCCTCAAGCGGAGGTCATTGCCTCCAATATCAGAACTTGCAAAACCAGAAGAAAAGTTGGCTGGTATCCGTATTGATGGACAGTGCAATTGTAGAAGTCGAATGTCGTTCTACACTGGAATAGGGATTCATCAATTGATGCCTGATGATTCCCTAGGTCCAGAGAAGGAGGTACATGGCTTACCAGATGGTGCTAAGATCAATTTCATTACCTGGTCACCTGATGGCCGTCATTTATCTTTCAGTGTTCGAGTTGACGAGGAAGATGGCAGTAGCGGTAAGCTTAGAGTTTGGGTTGCTGATGTGGAAACTGGGAAAGCTAGACCTTTGTTTCAGAATACAGACATCTATGTAAATGCAGTTTTTGAGAATTTTGTTTGGGTAAACGATTCTACTTTGTTAGTTTGCACCATTCCCTCCTCTCGTGGAGATCCACCAAAGAAACCTTTGGTTCCTCATGGTCCAAAAGTTCAATCTAATGAGCAGAAGAACATCATCCAAGCTAGAACCTTTCAGGATTTGCTGAAGGACAAATATGATGAGGATTTGTTCGACTACTATGCCACTACCCAGCTTGTTTTGGGTTCATTGGATGGAACAGTTGAGGCATTTGGCACACCAGCTATATATACGTCGCTGGACCCTTCCCCTGATCACAAATATATTTTGATTAGTACTATTCACCGGCCGTATTCTTTTATTGTTCCATGTGGAAGATTTCCTAAAAAGGTAGCTGTGTGGACAACTGACGGCAAGTTTGTTAGGGAGCTTTGTGATTTGCCTCTTGCTGAGGATATCCCCATTGCATTCAACAGTGTGAGAAAGGGGATGCGTTCCATCAATTGGAGAGCAGATAAGCCATCGACACTCTACTGGGTGGAAACTCAAGATGGTGGAGATGCCAGAATCGAGGTTTCTCCTCGTGACATTGTTTATACACAATCTGCTGAACCACTGGAAAGTGAACAGCCAGAGATACTGCATAAACTTGATCTTCGTTATGGAGGAATATCTTGGTGTGATGACTCACTGGCTCTTGTTTATGAATCTTGGTATAAGACGCGCAAAATACGAACGTGGGTAATCTCTCCTGGTTCTAAAGAGGACAATCCTCGCATTCTATTTGATAGGTCATCAGAAGATGTGTATTCAGACCCTGGGTCACCGATGCTACGGAGGACTCCTCTTGGGACTTACGTAATTGCAAAATTAAAGAAGGATAATTATGAAGGCACATATGTTCTACTCAATGGTAGTGGTGCTACTCCGGAAGGGAACATCCCTTTTATTGATTTATTTGACATAAACACAGGCAGCAAAGAAAGAATATGGAAGAGCAACAAAGAAACTTATTATGAGAGTGTTGTGGCTTTAATGTCTGATCAGAAAGAAGGAGATCTAAATATTGATGAGCTGAAATTTTTGACTTCCAAAGAATCCAAAACTGAAAATACTCAGTACTACATTCTGAGGTGGCCTGGTAAGAAAGCAAGTCAAATCACAAAATTTCCTCATCCATATCCACAGCTGGCATCACTGCAGAAAGAGATGGTTAGATACGAGAGAAAAGACGGAGTTCAACTGACAGCCACACTATATCTACCCCCAAACTACGATCCAGCAAAAGATGGCCCTCTTCCCTGCTTGATCTGGTCTTACCCTGGAGAATTCAAAAGCAAAGATGCAGCTGGACAAGTTCGTGGTTCCCCCAATGAGTTTGCTAGTATAGGTCCAACATCTGCTCTTCTTTGGTTGGCTCGCAGGTTTGCCATTTTGGCTGGACCAACAATACCTATCATTGGTGAAGGTAACGAGGAGGCAAATGATAGATATGTAGAGCAATTGGTTGCGAGTGCAGAGGCTGCTGTAGAGGAGGTCATTAAACGGGGGGTTGCTCATCCTAATAAGATTGCTGTTGGTGGACATTCATATGGTGCGTTTATGACTGCAAACCTTCTGGCTCATGCTCCCCATCTTTTTTGCTGTGGAATTGCTCGCTCCGGAGCCTATAACAGAACACTGACCCCTTTTGGCTTTCAGAATGAGGATAGAACTCTTTGGGAAGCAACCAACACATATGTAGAGATGAGTCCATTTATATCAGCAAATAAAATCAAGAAGCCAATTTTACTCATTCATGGCGAAGAAGACAACAACCCAGGAACATTACCCATGCAGTCCGATCGATTTTTCAATGCCTTGAAAGGCCATGGAGCATTATGTCGCCTTGTGGTTCTTCCCTTCGAAAGCCATGGTTATTCTTCACGAGAGAGTATCATGCATGTCCTCTGGGAAACTGATCGATGGCTAGAGAAATACGGTTCCTCTAACGCTTCTGATTTAAGTCAAGATGTGGATAAAATTAAAGAGGAAGGCAATGGAACAGCAGATTCCGCAGGGAAAGTTGTTGCTGGTTCTGGAGGCGGTGGCACAGAGAGTCCAAGTCCTGATAATGATGGATTTTACTCTATTCAAAGATCATTGTTGTGGTAA

Coding sequence (CDS)

ATGGCCTCATCTAGGTTCCGCAACCTTGTTCATCTCAACGCGATCGTTTCTGAAGATGGCGGAGGCGGTGCTAGCGGCGGTGCCGGAGGCGGAGGCTCCAATGGCTCCGTTTCGTCTTCTTCAGCTGTAGTCTCTACTGAAGACGATGAGAATTCAGTTCTGGGGGTTGGATATCGTCTTCCTCCAGCTGAAATCAGAGACATTGTTGATGCTCCACCGCTTCCCATATTGTCATTCTCGCCATACAGGGATAAAATATTGTTCCTCAAGCGGAGGTCATTGCCTCCAATATCAGAACTTGCAAAACCAGAAGAAAAGTTGGCTGGTATCCGTATTGATGGACAGTGCAATTGTAGAAGTCGAATGTCGTTCTACACTGGAATAGGGATTCATCAATTGATGCCTGATGATTCCCTAGGTCCAGAGAAGGAGGTACATGGCTTACCAGATGGTGCTAAGATCAATTTCATTACCTGGTCACCTGATGGCCGTCATTTATCTTTCAGTGTTCGAGTTGACGAGGAAGATGGCAGTAGCGGTAAGCTTAGAGTTTGGGTTGCTGATGTGGAAACTGGGAAAGCTAGACCTTTGTTTCAGAATACAGACATCTATGTAAATGCAGTTTTTGAGAATTTTGTTTGGGTAAACGATTCTACTTTGTTAGTTTGCACCATTCCCTCCTCTCGTGGAGATCCACCAAAGAAACCTTTGGTTCCTCATGGTCCAAAAGTTCAATCTAATGAGCAGAAGAACATCATCCAAGCTAGAACCTTTCAGGATTTGCTGAAGGACAAATATGATGAGGATTTGTTCGACTACTATGCCACTACCCAGCTTGTTTTGGGTTCATTGGATGGAACAGTTGAGGCATTTGGCACACCAGCTATATATACGTCGCTGGACCCTTCCCCTGATCACAAATATATTTTGATTAGTACTATTCACCGGCCGTATTCTTTTATTGTTCCATGTGGAAGATTTCCTAAAAAGGTAGCTGTGTGGACAACTGACGGCAAGTTTGTTAGGGAGCTTTGTGATTTGCCTCTTGCTGAGGATATCCCCATTGCATTCAACAGTGTGAGAAAGGGGATGCGTTCCATCAATTGGAGAGCAGATAAGCCATCGACACTCTACTGGGTGGAAACTCAAGATGGTGGAGATGCCAGAATCGAGGTTTCTCCTCGTGACATTGTTTATACACAATCTGCTGAACCACTGGAAAGTGAACAGCCAGAGATACTGCATAAACTTGATCTTCGTTATGGAGGAATATCTTGGTGTGATGACTCACTGGCTCTTGTTTATGAATCTTGGTATAAGACGCGCAAAATACGAACGTGGGTAATCTCTCCTGGTTCTAAAGAGGACAATCCTCGCATTCTATTTGATAGGTCATCAGAAGATGTGTATTCAGACCCTGGGTCACCGATGCTACGGAGGACTCCTCTTGGGACTTACGTAATTGCAAAATTAAAGAAGGATAATTATGAAGGCACATATGTTCTACTCAATGGTAGTGGTGCTACTCCGGAAGGGAACATCCCTTTTATTGATTTATTTGACATAAACACAGGCAGCAAAGAAAGAATATGGAAGAGCAACAAAGAAACTTATTATGAGAGTGTTGTGGCTTTAATGTCTGATCAGAAAGAAGGAGATCTAAATATTGATGAGCTGAAATTTTTGACTTCCAAAGAATCCAAAACTGAAAATACTCAGTACTACATTCTGAGGTGGCCTGGTAAGAAAGCAAGTCAAATCACAAAATTTCCTCATCCATATCCACAGCTGGCATCACTGCAGAAAGAGATGGTTAGATACGAGAGAAAAGACGGAGTTCAACTGACAGCCACACTATATCTACCCCCAAACTACGATCCAGCAAAAGATGGCCCTCTTCCCTGCTTGATCTGGTCTTACCCTGGAGAATTCAAAAGCAAAGATGCAGCTGGACAAGTTCGTGGTTCCCCCAATGAGTTTGCTAGTATAGGTCCAACATCTGCTCTTCTTTGGTTGGCTCGCAGGTTTGCCATTTTGGCTGGACCAACAATACCTATCATTGGTGAAGGTAACGAGGAGGCAAATGATAGATATGTAGAGCAATTGGTTGCGAGTGCAGAGGCTGCTGTAGAGGAGGTCATTAAACGGGGGGTTGCTCATCCTAATAAGATTGCTGTTGGTGGACATTCATATGGTGCGTTTATGACTGCAAACCTTCTGGCTCATGCTCCCCATCTTTTTTGCTGTGGAATTGCTCGCTCCGGAGCCTATAACAGAACACTGACCCCTTTTGGCTTTCAGAATGAGGATAGAACTCTTTGGGAAGCAACCAACACATATGTAGAGATGAGTCCATTTATATCAGCAAATAAAATCAAGAAGCCAATTTTACTCATTCATGGCGAAGAAGACAACAACCCAGGAACATTACCCATGCAGTCCGATCGATTTTTCAATGCCTTGAAAGGCCATGGAGCATTATGTCGCCTTGTGGTTCTTCCCTTCGAAAGCCATGGTTATTCTTCACGAGAGAGTATCATGCATGTCCTCTGGGAAACTGATCGATGGCTAGAGAAATACGGTTCCTCTAACGCTTCTGATTTAAGTCAAGATGTGGATAAAATTAAAGAGGAAGGCAATGGAACAGCAGATTCCGCAGGGAAAGTTGTTGCTGGTTCTGGAGGCGGTGGCACAGAGAGTCCAAGTCCTGATAATGATGGATTTTACTCTATTCAAAGATCATTGTTGTGGTAA

Protein sequence

MASSRFRNLVHLNAIVSEDGGGGASGGAGGGGSNGSVSSSSAVVSTEDDENSVLGVGYRLPPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCRSRMSFYTGIGIHQLMPDDSLGPEKEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSGKLRVWVADVETGKARPLFQNTDIYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVPHGPKVQSNEQKNIIQARTFQDLLKDKYDEDLFDYYATTQLVLGSLDGTVEAFGTPAIYTSLDPSPDHKYILISTIHRPYSFIVPCGRFPKKVAVWTTDGKFVRELCDLPLAEDIPIAFNSVRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDLRYGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDNPRILFDRSSEDVYSDPGSPMLRRTPLGTYVIAKLKKDNYEGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSNKETYYESVVALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKKASQITKFPHPYPQLASLQKEMVRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNEFASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRGVAHPNKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNTYVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHGYSSRESIMHVLWETDRWLEKYGSSNASDLSQDVDKIKEEGNGTADSAGKVVAGSGGGGTESPSPDNDGFYSIQRSLLW
Homology
BLAST of HG10023023 vs. NCBI nr
Match: XP_038898053.1 (probable glutamyl endopeptidase, chloroplastic [Benincasa hispida])

HSP 1 Score: 1826.2 bits (4729), Expect = 0.0e+00
Identity = 892/916 (97.38%), Postives = 903/916 (98.58%), Query Frame = 0

Query: 1   MASSRFRNLVHLNAIVSEDGGGGASGGAGGGGSNGSVSSSSAVVSTEDDENSVLGVGYRL 60
           MASSRFRNLVHLNAIVSEDGGGGASGGA GGGSNGSVSSSSAVVST+DDENSVLGVGYRL
Sbjct: 54  MASSRFRNLVHLNAIVSEDGGGGASGGAAGGGSNGSVSSSSAVVSTDDDENSVLGVGYRL 113

Query: 61  PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCRS 120
           PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCRS
Sbjct: 114 PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCRS 173

Query: 121 RMSFYTGIGIHQLMPDDSLGPEKEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSG 180
           RMSFYTGIGIHQLMPDDSLGPEKEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSG
Sbjct: 174 RMSFYTGIGIHQLMPDDSLGPEKEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSG 233

Query: 181 KLRVWVADVETGKARPLFQNTDIYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVPH 240
           KLRVWVADVETGKARPLFQN DIYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVPH
Sbjct: 234 KLRVWVADVETGKARPLFQNADIYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVPH 293

Query: 241 GPKVQSNEQKNIIQARTFQDLLKDKYDEDLFDYYATTQLVLGSLDGTVEAFGTPAIYTSL 300
           GPKVQSNEQKNIIQARTFQDLLKDKYDEDLFDYYATTQLVLGSLDGTVEAFGTPAIYTSL
Sbjct: 294 GPKVQSNEQKNIIQARTFQDLLKDKYDEDLFDYYATTQLVLGSLDGTVEAFGTPAIYTSL 353

Query: 301 DPSPDHKYILISTIHRPYSFIVPCGRFPKKVAVWTTDGKFVRELCDLPLAEDIPIAFNSV 360
           DPSPDHKYILISTIHRPYSFIVPCGRFPKKVAVWTTDGKF+RELCDLPLAEDIPIAFNSV
Sbjct: 354 DPSPDHKYILISTIHRPYSFIVPCGRFPKKVAVWTTDGKFIRELCDLPLAEDIPIAFNSV 413

Query: 361 RKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDLR 420
           RKGMRSINWRADKPSTL WVETQDGGDAR+EVSPRDIVYTQSAEPLESEQPEILHKLDLR
Sbjct: 414 RKGMRSINWRADKPSTLCWVETQDGGDARVEVSPRDIVYTQSAEPLESEQPEILHKLDLR 473

Query: 421 YGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDNPRILFDRSSEDVYSDPGSPMLRR 480
           YGGISWCDDSLALVYESWYK RKIRTWVISP SKE+NPRILFDRSSEDVYSDPGSPMLRR
Sbjct: 474 YGGISWCDDSLALVYESWYKMRKIRTWVISPDSKENNPRILFDRSSEDVYSDPGSPMLRR 533

Query: 481 TPLGTYVIAKLKKDNYEGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSNKETYYE 540
           TPLGTYVIAKLKKDNYEGT+VLLNGSGATPEGNIPFIDLFDINTGSKERIWKSNKETYYE
Sbjct: 534 TPLGTYVIAKLKKDNYEGTFVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSNKETYYE 593

Query: 541 SVVALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKKASQITKFPHPYPQLASL 600
           SVVALMSDQ +GDL+IDELKFLTSKESKTENTQYYILRWPGKKA+QITKFPHPYPQLASL
Sbjct: 594 SVVALMSDQIDGDLDIDELKFLTSKESKTENTQYYILRWPGKKATQITKFPHPYPQLASL 653

Query: 601 QKEMVRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNEF 660
           QKEM+RYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNEF
Sbjct: 654 QKEMIRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNEF 713

Query: 661 ASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRGVAH 720
           ASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRGVAH
Sbjct: 714 ASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRGVAH 773

Query: 721 PNKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNTY 780
           P+KIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNTY
Sbjct: 774 PHKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNTY 833

Query: 781 VEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHGY 840
           VEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHGY
Sbjct: 834 VEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHGY 893

Query: 841 SSRESIMHVLWETDRWLEKYGSSNASDLSQDVDKIKEEGNGTADSAGKVVAGSGGGGTES 900
           SSRESIMHVLWETDRWLEKY SSN SDL QDVDK KEEGNG ADSAGKVVAGSGGGGTES
Sbjct: 894 SSRESIMHVLWETDRWLEKYCSSNTSDLGQDVDKSKEEGNGAADSAGKVVAGSGGGGTES 953

Query: 901 PSPDNDGFYSIQRSLL 917
           P PD+ GFYSIQRSLL
Sbjct: 954 PGPDDYGFYSIQRSLL 969

BLAST of HG10023023 vs. NCBI nr
Match: TYK18231.1 (putative glutamyl endopeptidase [Cucumis melo var. makuwa])

HSP 1 Score: 1772.7 bits (4590), Expect = 0.0e+00
Identity = 865/918 (94.23%), Postives = 887/918 (96.62%), Query Frame = 0

Query: 1   MASSRFRNLVHLNAIVSEDGGGGASGGAGGGGSNGSVSSSSAVVSTEDDENSVLGVGYRL 60
           MASSRFRNLVHLNAIVSE      +GGAGGGGSNGSVSSSSAV STEDDE+SVLGVGYRL
Sbjct: 1   MASSRFRNLVHLNAIVSE------NGGAGGGGSNGSVSSSSAVASTEDDEDSVLGVGYRL 60

Query: 61  PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCRS 120
           PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPP+SELAKPEEKLAGIRIDGQCNCRS
Sbjct: 61  PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPLSELAKPEEKLAGIRIDGQCNCRS 120

Query: 121 RMSFYTGIGIHQLMPDDSLGPEKEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSG 180
           R+SFYTGIGIHQLMPDDSLGPE EVHGLPDGAKINF+TWSPDGRHL+F+VR+DEE GSSG
Sbjct: 121 RISFYTGIGIHQLMPDDSLGPELEVHGLPDGAKINFVTWSPDGRHLAFTVRIDEEGGSSG 180

Query: 181 KLRVWVADVETGKARPLFQNTDIYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVPH 240
           KLRVWVADVETGKARPLFQNTDIYVNAVF NFVWVNDSTLLVCTIPSSRGDPPKKPLVP 
Sbjct: 181 KLRVWVADVETGKARPLFQNTDIYVNAVFNNFVWVNDSTLLVCTIPSSRGDPPKKPLVPR 240

Query: 241 GPKVQSNEQKNIIQARTFQDLLKDKYDEDLFDYYATTQLVLGSL-DGTVEAFGTPAIYTS 300
           GPKVQSNEQKNIIQART+QDLLKD YDEDLFDYYAT+QLVLGSL DGTV+ FG PA+YTS
Sbjct: 241 GPKVQSNEQKNIIQARTYQDLLKDTYDEDLFDYYATSQLVLGSLEDGTVKEFGPPAVYTS 300

Query: 301 LDPSPDHKYILISTIHRPYSFIVPCGRFPKKVAVWTTDGKFVRELCDLPLAEDIPIAFNS 360
           LDPSPDHKYILISTIHRPYSFIVPCGRFP +V VWTTDG FVRELCDLPLAEDIPIAFNS
Sbjct: 301 LDPSPDHKYILISTIHRPYSFIVPCGRFPNRVDVWTTDGNFVRELCDLPLAEDIPIAFNS 360

Query: 361 VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL 420
           VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL
Sbjct: 361 VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL 420

Query: 421 RYGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDNPRILFDRSSEDVYSDPGSPMLR 480
           RYGGI WCDDSLALVYESWYKTRKIRTWVISPGS EDNPR+LFDRSSEDVYSDPGSPM R
Sbjct: 421 RYGGIYWCDDSLALVYESWYKTRKIRTWVISPGSLEDNPRLLFDRSSEDVYSDPGSPMQR 480

Query: 481 RTPLGTYVIAKLKKDNYEGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSNKETYY 540
           RTPLGTYVIAKLKK+NY+GTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKS+KETYY
Sbjct: 481 RTPLGTYVIAKLKKENYDGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSDKETYY 540

Query: 541 ESVVALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKKASQITKFPHPYPQLAS 600
           ESV+ALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGK ASQIT FPHPYPQLAS
Sbjct: 541 ESVLALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKTASQITNFPHPYPQLAS 600

Query: 601 LQKEMVRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE 660
           LQKEM+RYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE
Sbjct: 601 LQKEMIRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE 660

Query: 661 FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRGVA 720
           FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAV+EVIKRGVA
Sbjct: 661 FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVQEVIKRGVA 720

Query: 721 HPNKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNT 780
           HP+KIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAT+T
Sbjct: 721 HPDKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATST 780

Query: 781 YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG 840
           YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG
Sbjct: 781 YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG 840

Query: 841 YSSRESIMHVLWETDRWLEKYGSSNASDLSQDVDKIKEEGNGTADSAGKVVAGSGGGGTE 900
           YSSRESIMHVLWETDRWLEKY SSNASDL QD DK KEEGN  ADSAGKVVAGSGGGGTE
Sbjct: 841 YSSRESIMHVLWETDRWLEKYCSSNASDLGQDGDKNKEEGNAAADSAGKVVAGSGGGGTE 900

Query: 901 SPSPDNDGFYSIQRSLLW 918
           S SPDNDGFYSIQRS LW
Sbjct: 901 SSSPDNDGFYSIQRSSLW 912

BLAST of HG10023023 vs. NCBI nr
Match: XP_008451481.1 (PREDICTED: probable glutamyl endopeptidase, chloroplastic [Cucumis melo])

HSP 1 Score: 1768.1 bits (4578), Expect = 0.0e+00
Identity = 864/917 (94.22%), Postives = 886/917 (96.62%), Query Frame = 0

Query: 1   MASSRFRNLVHLNAIVSEDGGGGASGGAGGGGSNGSVSSSSAVVSTEDDENSVLGVGYRL 60
           MASSRFRNLVHLNAIVSE      +GGAGGGGSNGSVSSSSAV STEDDE+SVLGVGYRL
Sbjct: 57  MASSRFRNLVHLNAIVSE------NGGAGGGGSNGSVSSSSAVASTEDDEDSVLGVGYRL 116

Query: 61  PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCRS 120
           PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPP+SELAKPEEKLAGIRIDGQCNCRS
Sbjct: 117 PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPLSELAKPEEKLAGIRIDGQCNCRS 176

Query: 121 RMSFYTGIGIHQLMPDDSLGPEKEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSG 180
           R+SFYTGIGIHQLMPDDSLGPE EVHGLPDGAKINF+TWSPDGRHL+F+VR+DEE GSSG
Sbjct: 177 RISFYTGIGIHQLMPDDSLGPELEVHGLPDGAKINFVTWSPDGRHLAFTVRIDEEGGSSG 236

Query: 181 KLRVWVADVETGKARPLFQNTDIYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVPH 240
           KLRVWVADVETGKARPLFQNTDIYVNAVF NFVWVNDSTLLVCTIPSSRGDPPKKPLVP 
Sbjct: 237 KLRVWVADVETGKARPLFQNTDIYVNAVFNNFVWVNDSTLLVCTIPSSRGDPPKKPLVPR 296

Query: 241 GPKVQSNEQKNIIQARTFQDLLKDKYDEDLFDYYATTQLVLGSL-DGTVEAFGTPAIYTS 300
           GPKVQSNEQKNIIQART+QDLLKD YDEDLFDYYAT+ LVLGSL DGTV+ FG PA+YTS
Sbjct: 297 GPKVQSNEQKNIIQARTYQDLLKDTYDEDLFDYYATSLLVLGSLEDGTVKEFGPPAVYTS 356

Query: 301 LDPSPDHKYILISTIHRPYSFIVPCGRFPKKVAVWTTDGKFVRELCDLPLAEDIPIAFNS 360
           LDPSPDHKYILISTIHRPYSFIVPCGRFP +V VWTTDGKFVRELCDLPLAEDIPIAFNS
Sbjct: 357 LDPSPDHKYILISTIHRPYSFIVPCGRFPNRVDVWTTDGKFVRELCDLPLAEDIPIAFNS 416

Query: 361 VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL 420
           VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL
Sbjct: 417 VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL 476

Query: 421 RYGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDNPRILFDRSSEDVYSDPGSPMLR 480
           RYGGI WCDDSLALVYESWYKTRKIRTWVISPGS EDNPR+LFDRSSEDVYSDPGSPM R
Sbjct: 477 RYGGIYWCDDSLALVYESWYKTRKIRTWVISPGSLEDNPRLLFDRSSEDVYSDPGSPMQR 536

Query: 481 RTPLGTYVIAKLKKDNYEGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSNKETYY 540
           RTPLGTYVIAKLKK+NY+GTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKS+KETYY
Sbjct: 537 RTPLGTYVIAKLKKENYDGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSDKETYY 596

Query: 541 ESVVALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKKASQITKFPHPYPQLAS 600
           ESV+ALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGK ASQIT FPHPYPQLAS
Sbjct: 597 ESVLALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKTASQITNFPHPYPQLAS 656

Query: 601 LQKEMVRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE 660
           LQKEM+RYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE
Sbjct: 657 LQKEMIRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE 716

Query: 661 FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRGVA 720
           FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAV+EVIKRGVA
Sbjct: 717 FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVQEVIKRGVA 776

Query: 721 HPNKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNT 780
           HP+KIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAT+T
Sbjct: 777 HPDKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATST 836

Query: 781 YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG 840
           YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG
Sbjct: 837 YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG 896

Query: 841 YSSRESIMHVLWETDRWLEKYGSSNASDLSQDVDKIKEEGNGTADSAGKVVAGSGGGGTE 900
           YSSRESIMHVLWETDRWLEKY SSNASDL QD DK KEEGN  ADSAGKVVAGSGGGGTE
Sbjct: 897 YSSRESIMHVLWETDRWLEKYCSSNASDLGQDGDKNKEEGNAAADSAGKVVAGSGGGGTE 956

Query: 901 SPSPDNDGFYSIQRSLL 917
           S SPDNDGFYSIQRS L
Sbjct: 957 SSSPDNDGFYSIQRSFL 967

BLAST of HG10023023 vs. NCBI nr
Match: KAA0046695.1 (putative glutamyl endopeptidase [Cucumis melo var. makuwa])

HSP 1 Score: 1763.4 bits (4566), Expect = 0.0e+00
Identity = 863/918 (94.01%), Postives = 884/918 (96.30%), Query Frame = 0

Query: 1   MASSRFRNLVHLNAIVSEDGGGGASGGAGGGGSNGSVSSSSAVVSTEDDENSVLGVGYRL 60
           MASSRFRNLVHLNAIVSE      +GGAGGGGSNGSVSSSSAV STEDD   VLGVGYRL
Sbjct: 1   MASSRFRNLVHLNAIVSE------NGGAGGGGSNGSVSSSSAVASTEDD---VLGVGYRL 60

Query: 61  PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCRS 120
           PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPP+SELAKPEEKLAGIRIDGQCNCRS
Sbjct: 61  PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPLSELAKPEEKLAGIRIDGQCNCRS 120

Query: 121 RMSFYTGIGIHQLMPDDSLGPEKEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSG 180
           R+SFYTGIGIHQLMPDDSLGPE EVHGLPDGAKINF+TWSPDGRHL+F+VR+DEE GSSG
Sbjct: 121 RISFYTGIGIHQLMPDDSLGPELEVHGLPDGAKINFVTWSPDGRHLAFTVRIDEEGGSSG 180

Query: 181 KLRVWVADVETGKARPLFQNTDIYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVPH 240
           KLRVWVADVETGKARPLFQNTDIYVNAVF NFVWVNDSTLLVCTIPSSRGDPPKKPLVP 
Sbjct: 181 KLRVWVADVETGKARPLFQNTDIYVNAVFNNFVWVNDSTLLVCTIPSSRGDPPKKPLVPR 240

Query: 241 GPKVQSNEQKNIIQARTFQDLLKDKYDEDLFDYYATTQLVLGSL-DGTVEAFGTPAIYTS 300
           GPKVQSNEQKNIIQART+QDLLKD YDEDLFDYYAT+QLVLGSL DGTV+ FG PA+YTS
Sbjct: 241 GPKVQSNEQKNIIQARTYQDLLKDTYDEDLFDYYATSQLVLGSLEDGTVKEFGPPAVYTS 300

Query: 301 LDPSPDHKYILISTIHRPYSFIVPCGRFPKKVAVWTTDGKFVRELCDLPLAEDIPIAFNS 360
           LDPSPDHKYILISTIHRPYSFIVPCGRFP +V VWTTDG FVRELCDLPLAEDIPIAFNS
Sbjct: 301 LDPSPDHKYILISTIHRPYSFIVPCGRFPNRVDVWTTDGNFVRELCDLPLAEDIPIAFNS 360

Query: 361 VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL 420
           VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL
Sbjct: 361 VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL 420

Query: 421 RYGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDNPRILFDRSSEDVYSDPGSPMLR 480
           RYGGI WCDDSLALVYESWYKTRKIRTWVISPGS EDNPR+LFDRSSEDVYSDPGSPM R
Sbjct: 421 RYGGIYWCDDSLALVYESWYKTRKIRTWVISPGSLEDNPRLLFDRSSEDVYSDPGSPMQR 480

Query: 481 RTPLGTYVIAKLKKDNYEGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSNKETYY 540
           RTPLGTYVIAKLKK+NY+GTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKS+KETYY
Sbjct: 481 RTPLGTYVIAKLKKENYDGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSDKETYY 540

Query: 541 ESVVALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKKASQITKFPHPYPQLAS 600
           ESV+ALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGK ASQIT FPHPYPQLAS
Sbjct: 541 ESVLALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKTASQITNFPHPYPQLAS 600

Query: 601 LQKEMVRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE 660
           LQKEM+RYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE
Sbjct: 601 LQKEMIRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE 660

Query: 661 FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRGVA 720
           FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAV+EVIKRGVA
Sbjct: 661 FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVQEVIKRGVA 720

Query: 721 HPNKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNT 780
           HP+KIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAT+T
Sbjct: 721 HPDKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATST 780

Query: 781 YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG 840
           YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG
Sbjct: 781 YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG 840

Query: 841 YSSRESIMHVLWETDRWLEKYGSSNASDLSQDVDKIKEEGNGTADSAGKVVAGSGGGGTE 900
           YSSRESIMHVLWETDRWLEKY SSNASDL QD DK KEEGN  ADSAGKVVAGSGGGGTE
Sbjct: 841 YSSRESIMHVLWETDRWLEKYCSSNASDLGQDGDKNKEEGNAAADSAGKVVAGSGGGGTE 900

Query: 901 SPSPDNDGFYSIQRSLLW 918
           S SPDNDGFYSIQRS LW
Sbjct: 901 SSSPDNDGFYSIQRSSLW 909

BLAST of HG10023023 vs. NCBI nr
Match: XP_004135992.1 (probable glutamyl endopeptidase, chloroplastic [Cucumis sativus] >KGN45015.1 hypothetical protein Csa_015806 [Cucumis sativus])

HSP 1 Score: 1750.7 bits (4533), Expect = 0.0e+00
Identity = 853/919 (92.82%), Postives = 887/919 (96.52%), Query Frame = 0

Query: 1   MASSRFRNLVHLNAIVSEDGGGGASGGAGGGGSNGSVSSSSAVVSTEDDENSVLGVGYRL 60
           MASSRFRNLVHLNAIVSEDGG     G GGGGSNGSVSSSSAV ST DDE+SVLGVGYRL
Sbjct: 56  MASSRFRNLVHLNAIVSEDGG----SGGGGGGSNGSVSSSSAVASTVDDEDSVLGVGYRL 115

Query: 61  PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCRS 120
           PPAEIRDIVDAPPLP+LSFSPYRDKILFLKRRSLPP++ELAKPEEKLAGIRIDGQCNCRS
Sbjct: 116 PPAEIRDIVDAPPLPLLSFSPYRDKILFLKRRSLPPLAELAKPEEKLAGIRIDGQCNCRS 175

Query: 121 RMSFYTGIGIHQLMPDDSLGPEKEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSG 180
           R+SFYTGIGIHQLMPDDSLGPEKEV GLP+GAKINF+TWSPDGRHL+F+VRVDE+DGSS 
Sbjct: 176 RISFYTGIGIHQLMPDDSLGPEKEVRGLPNGAKINFVTWSPDGRHLAFTVRVDEDDGSSS 235

Query: 181 KLRVWVADVETGKARPLFQNTDIYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVPH 240
           KLRVWVADVETG+ARPLFQNTDIYVNAVF+NFVWVNDSTLLVCTIP SRGDPPKKPLVP 
Sbjct: 236 KLRVWVADVETGEARPLFQNTDIYVNAVFDNFVWVNDSTLLVCTIPFSRGDPPKKPLVPP 295

Query: 241 GPKVQSNEQKNIIQARTFQDLLKDKYDEDLFDYYATTQLVLGSL-DGTVEAFGT--PAIY 300
           GPKVQSNEQKNIIQART+QDLLKD+YD+DLFDYYAT+QLVLGSL DGTV+ FGT  PA+Y
Sbjct: 296 GPKVQSNEQKNIIQARTYQDLLKDEYDKDLFDYYATSQLVLGSLEDGTVKEFGTSPPAVY 355

Query: 301 TSLDPSPDHKYILISTIHRPYSFIVPCGRFPKKVAVWTTDGKFVRELCDLPLAEDIPIAF 360
           TSLDPSPDHKYILISTIHRPYSFIVPCGRFP +VAVWTTDGKFVR+LCDLPLAEDIPIAF
Sbjct: 356 TSLDPSPDHKYILISTIHRPYSFIVPCGRFPNRVAVWTTDGKFVRDLCDLPLAEDIPIAF 415

Query: 361 NSVRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKL 420
           NSVRKG RSINWRADKPSTLYWVETQDGGDAR+EVSPRDIVYT+SAEPLESEQPEILHKL
Sbjct: 416 NSVRKGKRSINWRADKPSTLYWVETQDGGDARVEVSPRDIVYTESAEPLESEQPEILHKL 475

Query: 421 DLRYGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDNPRILFDRSSEDVYSDPGSPM 480
           DLRYGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDN R+LFDRSSEDVYSDPGSPM
Sbjct: 476 DLRYGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDNARLLFDRSSEDVYSDPGSPM 535

Query: 481 LRRTPLGTYVIAKLKKDNYEGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSNKET 540
           +RRTP GTYVIAKLKK+NY+GTYVLLNG GATPEGNIPFIDLFDINTGSKERIWKS++ET
Sbjct: 536 VRRTPFGTYVIAKLKKENYDGTYVLLNGRGATPEGNIPFIDLFDINTGSKERIWKSDRET 595

Query: 541 YYESVVALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKKASQITKFPHPYPQL 600
           YYESVVALMSDQKEGDLNI+ELKFLTSKESKTENTQYYILRWPGK ASQITKFPHPYPQL
Sbjct: 596 YYESVVALMSDQKEGDLNINELKFLTSKESKTENTQYYILRWPGKTASQITKFPHPYPQL 655

Query: 601 ASLQKEMVRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSP 660
           ASLQKEM+RYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSP
Sbjct: 656 ASLQKEMIRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSP 715

Query: 661 NEFASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRG 720
           NEFA IGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLV SAEAAV+EVIKRG
Sbjct: 716 NEFAGIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVGSAEAAVQEVIKRG 775

Query: 721 VAHPNKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAT 780
           VAHP+KIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAT
Sbjct: 776 VAHPSKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAT 835

Query: 781 NTYVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFES 840
           +TYVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFES
Sbjct: 836 STYVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFES 895

Query: 841 HGYSSRESIMHVLWETDRWLEKYGSSNASDLSQDVDKIKEEGNGTADSAGKVVAGSGGGG 900
           HGYSSRESIMHVLWETDRWLEKY SSNASDL QD DK K+EGNG ADSAGKVVAGSGGG 
Sbjct: 896 HGYSSRESIMHVLWETDRWLEKYCSSNASDLGQDGDKNKQEGNGAADSAGKVVAGSGGGD 955

Query: 901 TESPSPDNDGFYSIQRSLL 917
           TES SPDNDGFYSIQRS L
Sbjct: 956 TESSSPDNDGFYSIQRSFL 970

BLAST of HG10023023 vs. ExPASy Swiss-Prot
Match: Q8VZF3 (Probable glutamyl endopeptidase, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=GEP PE=2 SV=2)

HSP 1 Score: 1455.7 bits (3767), Expect = 0.0e+00
Identity = 706/894 (78.97%), Postives = 794/894 (88.81%), Query Frame = 0

Query: 25  SGGA--GGGGSNGSVSSSSAVVSTEDDENSVLGVGYRLPPAEIRDIVDAPPLPILSFSPY 84
           SGGA  GGG SNGS+S+S+   +TEDDE ++ G GYRLPP EIRDIVDAPP+P LSFSP+
Sbjct: 78  SGGAEDGGGTSNGSLSASA--TATEDDELAI-GTGYRLPPPEIRDIVDAPPVPALSFSPH 137

Query: 85  RDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCRSRMSFYTGIGIHQLMPDDSLGPE 144
           RDKILFLKRR+LPP+++LA+PEEKLAG+RIDG CN RSRMSFYTG+GIHQL+PD +L PE
Sbjct: 138 RDKILFLKRRALPPLADLARPEEKLAGVRIDGYCNTRSRMSFYTGLGIHQLLPDGTLSPE 197

Query: 145 KEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSGKLRVWVADVETGKARPLFQNTD 204
           KE+ G+PDG KINF+TWS DG+HL+FS+RVD E+G+S K  VWVADVETG ARPLF + D
Sbjct: 198 KEITGIPDGGKINFVTWSNDGKHLAFSIRVD-ENGNSSKPVVWVADVETGVARPLFNSQD 257

Query: 205 IYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVPHGPKVQSNEQKNIIQARTFQDLL 264
           I++NA+FE+FVW+++STLLV TIPSSRG+PPKKPLVP GPK  SNE K ++Q RTFQDLL
Sbjct: 258 IFLNAIFESFVWIDNSTLLVSTIPSSRGEPPKKPLVPSGPKTLSNETKTVVQVRTFQDLL 317

Query: 265 KDKYDEDLFDYYATTQLVLGSLDGTVEAFGTPAIYTSLDPSPDHKYILISTIHRPYSFIV 324
           KD+YD DLFDYYA++QLVL SLDGTV+  G PA+YTSLDPS DHKY+L+S++HRPYSFIV
Sbjct: 318 KDEYDADLFDYYASSQLVLASLDGTVKEVGVPAVYTSLDPSTDHKYLLVSSLHRPYSFIV 377

Query: 325 PCGRFPKKVAVWTTDGKFVRELCDLPLAEDIPIAFNSVRKGMRSINWRADKPSTLYWVET 384
           PCGRFPKKV VWTTDG+FVR+LCDLPLAEDIPIA NSVRKGMRSINWRADKPSTL W ET
Sbjct: 378 PCGRFPKKVEVWTTDGRFVRQLCDLPLAEDIPIASNSVRKGMRSINWRADKPSTL-WAET 437

Query: 385 QDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDLRYGGISWCDDSLALVYESWYKTR 444
           QDGGDA++EVSPRDIVY QSAEPL  E+PE+LHKLDLRYGGISWCDD+LALVYESWYKTR
Sbjct: 438 QDGGDAKMEVSPRDIVYMQSAEPLAGEEPEVLHKLDLRYGGISWCDDTLALVYESWYKTR 497

Query: 445 KIRTWVISPGSKEDNPRILFDRSSEDVYSDPGSPMLRRTPLGTYVIAKLKKDNYEGTYVL 504
           + RTWVISPGS + +PRILFDRSSEDVYSDPGS MLRRT  GTYVIAK+KK+N EGTYVL
Sbjct: 498 RTRTWVISPGSNDVSPRILFDRSSEDVYSDPGSTMLRRTDAGTYVIAKIKKENDEGTYVL 557

Query: 505 LNGSGATPEGNIPFIDLFDINTGSKERIWKSNKETYYESVVALMSDQKEGDLNIDELKFL 564
           LNGSGATP+GN+PF+DLFDINTG+KERIW+S+KE Y+E+VVALMSDQKEGDL ++ELK L
Sbjct: 558 LNGSGATPQGNVPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQKEGDLKMEELKIL 617

Query: 565 TSKESKTENTQYYILRWPGKKASQITKFPHPYPQLASLQKEMVRYERKDGVQLTATLYLP 624
           TSKESKTENTQY +  WP +K  QIT FPHPYPQLASLQKEM+RY+RKDGVQLTATLYLP
Sbjct: 618 TSKESKTENTQYSLQLWPDRKVQQITNFPHPYPQLASLQKEMIRYQRKDGVQLTATLYLP 677

Query: 625 PNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNEFASIGPTSALLWLARRFAILAGP 684
           P YDP+KDGPLPCL WSYPGEFKSKDAAGQVRGSPNEFA IG TSALLWLARRFAIL+GP
Sbjct: 678 PGYDPSKDGPLPCLFWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSALLWLARRFAILSGP 737

Query: 685 TIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRGVAHPNKIAVGGHSYGAFMTANLLAH 744
           TIPIIGEG+EEANDRYVEQLVASAEAAVEEV++RGVA  +KIAVGGHSYGAFMTANLLAH
Sbjct: 738 TIPIIGEGDEEANDRYVEQLVASAEAAVEEVVRRGVADRSKIAVGGHSYGAFMTANLLAH 797

Query: 745 APHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNTYVEMSPFISANKIKKPILLIHGE 804
           APHLF CGIARSGAYNRTLTPFGFQNEDRTLWEATN YVEMSPF+SANKIKKPILLIHGE
Sbjct: 798 APHLFACGIARSGAYNRTLTPFGFQNEDRTLWEATNVYVEMSPFMSANKIKKPILLIHGE 857

Query: 805 EDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHGYSSRESIMHVLWETDRWLEKYGS 864
           EDNNPGTL MQSDRFFNALKGHGALCRLVVLP ESHGYS+RESIMHVLWETDRWL+KY  
Sbjct: 858 EDNNPGTLTMQSDRFFNALKGHGALCRLVVLPHESHGYSARESIMHVLWETDRWLQKYCV 917

Query: 865 SNASDLSQDVDKIKEEGNGTADSAGKVVAGSGGGGTESPSPDNDGFYSIQRSLL 917
            N SD     D+ KE     +DSA KV  G+GGG  E    +++    ++RSLL
Sbjct: 918 PNTSDADTSPDQSKE----GSDSADKVSTGTGGGNPE--FGEHEVHSKLRRSLL 960

BLAST of HG10023023 vs. ExPASy Swiss-Prot
Match: Q10MJ1 (Probable glutamyl endopeptidase, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=GEP PE=2 SV=1)

HSP 1 Score: 1432.5 bits (3707), Expect = 0.0e+00
Identity = 690/917 (75.25%), Postives = 791/917 (86.26%), Query Frame = 0

Query: 1   MASSRFRNLVHLNAIVSEDGGGGASGGAGGGGSNGSVSSSSAV-VSTEDDENSVLGVGYR 60
           M+SS    L H+ A         A+GGA G  S    ++++A  ++ EDD+ S   +GYR
Sbjct: 42  MSSSAASRLSHIVA---------AAGGAAGESSEPPAAAAAASGLAQEDDDLSSAMMGYR 101

Query: 61  LPPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCR 120
           LPP EI+DIVDAPPLP+LSFSP +DKILFLKRR+LPP+S+LAKPEEKLAG+RIDG  N R
Sbjct: 102 LPPKEIQDIVDAPPLPVLSFSPSKDKILFLKRRALPPLSDLAKPEEKLAGVRIDGYSNTR 161

Query: 121 SRMSFYTGIGIHQLMPDDSLGPEKEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSS 180
           SRMSFYTGIGIH+LM D +LGPEK VHG P+GA+INF+TWS DGRHLSFSVRVDEED +S
Sbjct: 162 SRMSFYTGIGIHKLMDDGTLGPEKVVHGYPEGARINFVTWSQDGRHLSFSVRVDEEDNTS 221

Query: 181 GKLRVWVADVETGKARPLFQNTDIYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVP 240
           GKLR+W+ADVE+G+ARPLF++ +IY+NA+F++FVWVN+STLLVCTIP SRG PP+KP VP
Sbjct: 222 GKLRLWIADVESGEARPLFKSPEIYLNAIFDSFVWVNNSTLLVCTIPLSRGAPPQKPSVP 281

Query: 241 HGPKVQSNEQKNIIQARTFQDLLKDKYDEDLFDYYATTQLVLGSLDGTVEAFGTPAIYTS 300
            GPK+QSNE  N++Q RTFQDLLKD+YD DLFDYYAT+QLVL S DGTV+  G PA+YTS
Sbjct: 282 SGPKIQSNETSNVVQVRTFQDLLKDEYDADLFDYYATSQLVLASFDGTVKPIGPPAVYTS 341

Query: 301 LDPSPDHKYILISTIHRPYSFIVPCGRFPKKVAVWTTDGKFVRELCDLPLAEDIPIAFNS 360
           +DPSPD KY++IS+IHRPYS+IVPCGRFPKKV +WT DG+F+RELCDLPLAEDIPIA +S
Sbjct: 342 IDPSPDDKYLMISSIHRPYSYIVPCGRFPKKVELWTVDGEFIRELCDLPLAEDIPIATSS 401

Query: 361 VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL 420
           VRKG RSI WR DKP+ LYWVETQDGGDA++EVSPRDIVY ++AEP+  EQPEILHKLDL
Sbjct: 402 VRKGKRSIYWRPDKPAMLYWVETQDGGDAKVEVSPRDIVYMENAEPINGEQPEILHKLDL 461

Query: 421 RYGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDNPRILFDRSSEDVYSDPGSPMLR 480
           RY G SWCD+SLALVYESWYKTRK RTWVISP  K+ +PRILFDRSSEDVYSDPGSPMLR
Sbjct: 462 RYAGTSWCDESLALVYESWYKTRKTRTWVISPDKKDVSPRILFDRSSEDVYSDPGSPMLR 521

Query: 481 RTPLGTYVIAKLKKDNYEGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSNKETYY 540
           RT +GTYVIAK+KK + E TY+LLNG GATPEGN+PF+DLFDINTGSKERIW+S+KE YY
Sbjct: 522 RTAMGTYVIAKVKKQD-ENTYILLNGMGATPEGNVPFLDLFDINTGSKERIWQSDKEKYY 581

Query: 541 ESVVALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKKASQITKFPHPYPQLAS 600
           E+VVALMSD+ +G+L +++LK LTSKESKTENTQYY+  WP KK  QIT FPHPYPQLAS
Sbjct: 582 ETVVALMSDKTDGELPLEKLKILTSKESKTENTQYYLQIWPEKKQVQITDFPHPYPQLAS 641

Query: 601 LQKEMVRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE 660
           L KEM+RY+RKDGVQLTATLYLPP YDP++DGPLPCL+WSYPGEFKSKDAAGQVRGSPNE
Sbjct: 642 LYKEMIRYQRKDGVQLTATLYLPPGYDPSQDGPLPCLVWSYPGEFKSKDAAGQVRGSPNE 701

Query: 661 FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRGVA 720
           F  IG TS LLWLAR FAIL+GPTIPIIGEG+EEANDRYVEQLV SAEAA EEV++RGVA
Sbjct: 702 FPGIGATSPLLWLARGFAILSGPTIPIIGEGDEEANDRYVEQLVTSAEAAAEEVVRRGVA 761

Query: 721 HPNKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNT 780
           HP+KIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNT
Sbjct: 762 HPDKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNT 821

Query: 781 YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG 840
           YVEMSPF+SANKIKKPILLIHGE+DNN GTL MQSDRFFNALKGHGAL RLV+LPFESHG
Sbjct: 822 YVEMSPFMSANKIKKPILLIHGEQDNNSGTLTMQSDRFFNALKGHGALSRLVILPFESHG 881

Query: 841 YSSRESIMHVLWETDRWLEKYGSSNASDLSQDVDKIKEEGNGTADSAGKVVAGSGGGGTE 900
           YS+RESIMHVLWETDRWL+KY  S +S         K + +  AD+  K V+ S GGG  
Sbjct: 882 YSARESIMHVLWETDRWLQKYCLSGSS---------KTDSDSVADTENKTVSAS-GGGAP 938

Query: 901 SPSPDNDGFYSIQRSLL 917
              P+ +GF S+QRSLL
Sbjct: 942 CEGPEAEGFSSMQRSLL 938

BLAST of HG10023023 vs. ExPASy Swiss-Prot
Match: P34422 (Dipeptidyl peptidase family member 6 OS=Caenorhabditis elegans OX=6239 GN=dpf-6 PE=3 SV=2)

HSP 1 Score: 77.8 bits (190), Expect = 7.1e-13
Identity = 86/353 (24.36%), Postives = 146/353 (41.36%), Query Frame = 0

Query: 536 ETYYESVVALMSDQKEGDLN-----IDELKFLTSKESKTENTQYYILRWPGKKASQITKF 595
           ET+ E +  L++ +  G +N     ID   +L +  S  E    Y+ R   KKA ++   
Sbjct: 309 ETFMEDLQYLVNMKPSGSMNIVSMSIDMSTWLVTYSSSDEPYDIYLYRRWNKKA-ELFMS 368

Query: 596 PHPYPQLASLQKEM-VRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDA 655
             P  +  +L K++   +  +D + + A L LPP     K   +P     Y        A
Sbjct: 369 TRPELKKYTLNKQIGFDFRARDEMTIQAYLSLPPQAPLLKSSQVPDGDRPY-ANLGMIPA 428

Query: 656 AGQ-----VRGSPNEFASIGPTSALLWLARR-FAILAGPTIPIIGEG---NEEANDRYVE 715
             Q     V G P      G +    WL  R +++L        G G       N  +  
Sbjct: 429 VPQKMIVLVHGGPKARDHYGFSPMNAWLTNRGYSVLQVNFRGSTGFGKRLTNAGNGEWGR 488

Query: 716 QLVASAEAAVEEVIKRGVAHPNKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYN-- 775
           ++      AVE  + +G+A+ +++AV G SYG + T   L   P  F CG+   G  N  
Sbjct: 489 KMHFDILDAVEFAVSKGIANRSEVAVMGGSYGGYETLVALTFTPQTFACGVDIVGPSNLI 548

Query: 776 ---RTLTPF--GFQNE-------DRTLWEATNTYVEMSPFISANKIKKPILLIHGEEDNN 835
              + + P+  GF+ +       D +  E   +    SP   A+++ KPI++I G   N+
Sbjct: 549 SLVQAIPPYWLGFRKDLIKMVGADISDEEGRQSLQSRSPLFFADRVTKPIMIIQGA--ND 608

Query: 836 PGTLPMQSDRFFNALKGHGALCRLVVLPFESHGYSSRESIMHVLWETDRWLEK 860
           P     +SD+F  AL+        ++ P E HG    ++ M      + +L++
Sbjct: 609 PRVKQAESDQFVAALEKKHIPVTYLLYPDEGHGVRKPQNSMEQHGHIETFLQQ 657

BLAST of HG10023023 vs. ExPASy Swiss-Prot
Match: C3J8X2 (Dipeptidyl-peptidase 5 OS=Porphyromonas endodontalis (strain ATCC 35406 / BCRC 14492 / JCM 8526 / NCTC 13058 / HG 370) OX=553175 GN=dpp5 PE=1 SV=1)

HSP 1 Score: 77.0 bits (188), Expect = 1.2e-12
Identity = 98/420 (23.33%), Postives = 163/420 (38.81%), Query Frame = 0

Query: 481 TPLGTYVI-AKLKKDNYEGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIW-KSNKETY 540
           +P G Y+    +++D YE   + L     T        + F+ N   ++  W +  K  Y
Sbjct: 291 SPDGKYMTWCSMERDGYESDLIRLFLLDRTTGEKTYLTEGFEYNV--EQPTWSQDGKSIY 350

Query: 541 YESVVALMSDQKEGDLNIDELKFLT------------------SKESKTENTQYYILRWP 600
           + + V   S   E  L   +++ +T                  +++S    T  Y +   
Sbjct: 351 FIACVEAESHLYELTLKNKKIRRITQGQMDYVGFDLQGTTLVAARQSMLAPTDLYRIDLK 410

Query: 601 GKKASQITK-FPHPYPQLASLQKEMVRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWS 660
              A+ ITK       QL  ++ E       +G ++   +  P N+D +K    P +++ 
Sbjct: 411 KGTATAITKENESTLAQLGDIRCEKRWMNTTNGEKMLVWVLYPANFDASK--KYPSILYC 470

Query: 661 YPGEFKSKDAAGQVRGSPNEFASIGPTSALLWLARRFAILAGPTIPIIGEG-NEEANDRY 720
             G   +       R +P   A  G    ++ L  R        +P  G+  NE+ +  Y
Sbjct: 471 QGGPQSTISQFWSYRWNPRIMAENG---YIVILPNRHG------VPGFGKAWNEQISGDY 530

Query: 721 VEQLVASAEAAVEEVIKRGVAHPNKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYN 780
             Q +     A +E+ K     PN +   G SYG F    L  H    F C IA +G +N
Sbjct: 531 GGQNMRDYLTAADEMKKESYIDPNGMGCVGASYGGFSVYWLAGHHEKRFNCFIAHAGIFN 590

Query: 781 RTLTPFGFQNEDRTL---------WEATNTYVE----MSPFISANKIKKPILLIHGEEDN 840
             L     + E++           WE +N   +     SP +  +K   PIL+IHGE D 
Sbjct: 591 --LEAQYLETEEKWFANWDMGGAPWEKSNATAQRTFATSPHLFVDKWDTPILIIHGERDY 650

Query: 841 NPGTLPMQSDRFFNALKGHGALCRLVVLPFESHGYSSRESIMHVLWE------TDRWLEK 860
               L  Q    F+A + HG    +++ P E+H     ++   VLW+       DRWL+K
Sbjct: 651 R--ILASQGMMAFDAARMHGVPTEMLLYPDENHWVLQPQNA--VLWQRTFFRWLDRWLKK 691

BLAST of HG10023023 vs. ExPASy Swiss-Prot
Match: V5YMB3 (Dipeptidyl aminopeptidase BIII OS=Pseudoxanthomonas mexicana OX=128785 GN=dapb3 PE=1 SV=1)

HSP 1 Score: 73.9 bits (180), Expect = 1.0e-11
Identity = 76/314 (24.20%), Postives = 134/314 (42.68%), Query Frame = 0

Query: 582 KKASQITKFPHPYPQLAS---LQKEMVRYERKDGVQLTATLYLPPNYDPAKDG----PLP 641
           + A  +TK     P+L     + +  V    +D   L + L LP + D   DG    P+P
Sbjct: 361 RSAGTLTKLFSARPKLEGKPLVPQWPVEIASRDNKTLVSYLTLPRSADANNDGKADAPVP 420

Query: 642 CLIWSYPGEFKSKDAAGQVRGSPNEFASIGPTSALLWLARR-FAILAGPTIPIIGEGNE- 701
            ++  + G + ++D+ G   G  N+           WLA R +A+L+       G G + 
Sbjct: 421 LVLLVHGGPW-ARDSYGY--GGYNQ-----------WLANRGYAVLSVNFRGSTGFGKDF 480

Query: 702 --EANDRYVEQLVASAEAAVEEVIKRGVAHPNKIAVGGHSYGAFMTANLLAHAPHLFCCG 761
               N  +  ++      AV+  +K+GV   +++A+ G SYG + T   L   P  F CG
Sbjct: 481 TNAGNGEWAGKMHDDLIDAVQWAVKQGVTTQDQVAIMGGSYGGYATLTGLTFTPDAFACG 540

Query: 762 IARSGAYN-----RTLTPFG---FQNEDRTLWE-----ATNTYVEMSPFISANKIKKPIL 821
           +   G  N      T+ P+    F+   + + +           E SP   A++IKKP+L
Sbjct: 541 VDIVGPSNLNTLLSTVPPYWASFFEQLAKRMGDPRTDAGKKWLTERSPLTRADQIKKPLL 600

Query: 822 LIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHGYSSRESIMHVLWETDRWL 872
           +  G+  N+P     +SD+   A++        V+ P E HG++  E+       T+ +L
Sbjct: 601 I--GQGANDPRVKQAESDQIVKAMQAKNIPVTYVLFPDEGHGFARPENNKAFNAVTEGFL 658

BLAST of HG10023023 vs. ExPASy TrEMBL
Match: A0A5D3D1V4 (Putative glutamyl endopeptidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold411G001710 PE=4 SV=1)

HSP 1 Score: 1772.7 bits (4590), Expect = 0.0e+00
Identity = 865/918 (94.23%), Postives = 887/918 (96.62%), Query Frame = 0

Query: 1   MASSRFRNLVHLNAIVSEDGGGGASGGAGGGGSNGSVSSSSAVVSTEDDENSVLGVGYRL 60
           MASSRFRNLVHLNAIVSE      +GGAGGGGSNGSVSSSSAV STEDDE+SVLGVGYRL
Sbjct: 1   MASSRFRNLVHLNAIVSE------NGGAGGGGSNGSVSSSSAVASTEDDEDSVLGVGYRL 60

Query: 61  PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCRS 120
           PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPP+SELAKPEEKLAGIRIDGQCNCRS
Sbjct: 61  PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPLSELAKPEEKLAGIRIDGQCNCRS 120

Query: 121 RMSFYTGIGIHQLMPDDSLGPEKEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSG 180
           R+SFYTGIGIHQLMPDDSLGPE EVHGLPDGAKINF+TWSPDGRHL+F+VR+DEE GSSG
Sbjct: 121 RISFYTGIGIHQLMPDDSLGPELEVHGLPDGAKINFVTWSPDGRHLAFTVRIDEEGGSSG 180

Query: 181 KLRVWVADVETGKARPLFQNTDIYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVPH 240
           KLRVWVADVETGKARPLFQNTDIYVNAVF NFVWVNDSTLLVCTIPSSRGDPPKKPLVP 
Sbjct: 181 KLRVWVADVETGKARPLFQNTDIYVNAVFNNFVWVNDSTLLVCTIPSSRGDPPKKPLVPR 240

Query: 241 GPKVQSNEQKNIIQARTFQDLLKDKYDEDLFDYYATTQLVLGSL-DGTVEAFGTPAIYTS 300
           GPKVQSNEQKNIIQART+QDLLKD YDEDLFDYYAT+QLVLGSL DGTV+ FG PA+YTS
Sbjct: 241 GPKVQSNEQKNIIQARTYQDLLKDTYDEDLFDYYATSQLVLGSLEDGTVKEFGPPAVYTS 300

Query: 301 LDPSPDHKYILISTIHRPYSFIVPCGRFPKKVAVWTTDGKFVRELCDLPLAEDIPIAFNS 360
           LDPSPDHKYILISTIHRPYSFIVPCGRFP +V VWTTDG FVRELCDLPLAEDIPIAFNS
Sbjct: 301 LDPSPDHKYILISTIHRPYSFIVPCGRFPNRVDVWTTDGNFVRELCDLPLAEDIPIAFNS 360

Query: 361 VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL 420
           VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL
Sbjct: 361 VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL 420

Query: 421 RYGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDNPRILFDRSSEDVYSDPGSPMLR 480
           RYGGI WCDDSLALVYESWYKTRKIRTWVISPGS EDNPR+LFDRSSEDVYSDPGSPM R
Sbjct: 421 RYGGIYWCDDSLALVYESWYKTRKIRTWVISPGSLEDNPRLLFDRSSEDVYSDPGSPMQR 480

Query: 481 RTPLGTYVIAKLKKDNYEGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSNKETYY 540
           RTPLGTYVIAKLKK+NY+GTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKS+KETYY
Sbjct: 481 RTPLGTYVIAKLKKENYDGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSDKETYY 540

Query: 541 ESVVALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKKASQITKFPHPYPQLAS 600
           ESV+ALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGK ASQIT FPHPYPQLAS
Sbjct: 541 ESVLALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKTASQITNFPHPYPQLAS 600

Query: 601 LQKEMVRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE 660
           LQKEM+RYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE
Sbjct: 601 LQKEMIRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE 660

Query: 661 FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRGVA 720
           FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAV+EVIKRGVA
Sbjct: 661 FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVQEVIKRGVA 720

Query: 721 HPNKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNT 780
           HP+KIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAT+T
Sbjct: 721 HPDKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATST 780

Query: 781 YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG 840
           YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG
Sbjct: 781 YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG 840

Query: 841 YSSRESIMHVLWETDRWLEKYGSSNASDLSQDVDKIKEEGNGTADSAGKVVAGSGGGGTE 900
           YSSRESIMHVLWETDRWLEKY SSNASDL QD DK KEEGN  ADSAGKVVAGSGGGGTE
Sbjct: 841 YSSRESIMHVLWETDRWLEKYCSSNASDLGQDGDKNKEEGNAAADSAGKVVAGSGGGGTE 900

Query: 901 SPSPDNDGFYSIQRSLLW 918
           S SPDNDGFYSIQRS LW
Sbjct: 901 SSSPDNDGFYSIQRSSLW 912

BLAST of HG10023023 vs. ExPASy TrEMBL
Match: A0A1S3BRJ9 (probable glutamyl endopeptidase, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103492756 PE=4 SV=1)

HSP 1 Score: 1768.1 bits (4578), Expect = 0.0e+00
Identity = 864/917 (94.22%), Postives = 886/917 (96.62%), Query Frame = 0

Query: 1   MASSRFRNLVHLNAIVSEDGGGGASGGAGGGGSNGSVSSSSAVVSTEDDENSVLGVGYRL 60
           MASSRFRNLVHLNAIVSE      +GGAGGGGSNGSVSSSSAV STEDDE+SVLGVGYRL
Sbjct: 57  MASSRFRNLVHLNAIVSE------NGGAGGGGSNGSVSSSSAVASTEDDEDSVLGVGYRL 116

Query: 61  PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCRS 120
           PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPP+SELAKPEEKLAGIRIDGQCNCRS
Sbjct: 117 PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPLSELAKPEEKLAGIRIDGQCNCRS 176

Query: 121 RMSFYTGIGIHQLMPDDSLGPEKEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSG 180
           R+SFYTGIGIHQLMPDDSLGPE EVHGLPDGAKINF+TWSPDGRHL+F+VR+DEE GSSG
Sbjct: 177 RISFYTGIGIHQLMPDDSLGPELEVHGLPDGAKINFVTWSPDGRHLAFTVRIDEEGGSSG 236

Query: 181 KLRVWVADVETGKARPLFQNTDIYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVPH 240
           KLRVWVADVETGKARPLFQNTDIYVNAVF NFVWVNDSTLLVCTIPSSRGDPPKKPLVP 
Sbjct: 237 KLRVWVADVETGKARPLFQNTDIYVNAVFNNFVWVNDSTLLVCTIPSSRGDPPKKPLVPR 296

Query: 241 GPKVQSNEQKNIIQARTFQDLLKDKYDEDLFDYYATTQLVLGSL-DGTVEAFGTPAIYTS 300
           GPKVQSNEQKNIIQART+QDLLKD YDEDLFDYYAT+ LVLGSL DGTV+ FG PA+YTS
Sbjct: 297 GPKVQSNEQKNIIQARTYQDLLKDTYDEDLFDYYATSLLVLGSLEDGTVKEFGPPAVYTS 356

Query: 301 LDPSPDHKYILISTIHRPYSFIVPCGRFPKKVAVWTTDGKFVRELCDLPLAEDIPIAFNS 360
           LDPSPDHKYILISTIHRPYSFIVPCGRFP +V VWTTDGKFVRELCDLPLAEDIPIAFNS
Sbjct: 357 LDPSPDHKYILISTIHRPYSFIVPCGRFPNRVDVWTTDGKFVRELCDLPLAEDIPIAFNS 416

Query: 361 VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL 420
           VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL
Sbjct: 417 VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL 476

Query: 421 RYGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDNPRILFDRSSEDVYSDPGSPMLR 480
           RYGGI WCDDSLALVYESWYKTRKIRTWVISPGS EDNPR+LFDRSSEDVYSDPGSPM R
Sbjct: 477 RYGGIYWCDDSLALVYESWYKTRKIRTWVISPGSLEDNPRLLFDRSSEDVYSDPGSPMQR 536

Query: 481 RTPLGTYVIAKLKKDNYEGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSNKETYY 540
           RTPLGTYVIAKLKK+NY+GTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKS+KETYY
Sbjct: 537 RTPLGTYVIAKLKKENYDGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSDKETYY 596

Query: 541 ESVVALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKKASQITKFPHPYPQLAS 600
           ESV+ALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGK ASQIT FPHPYPQLAS
Sbjct: 597 ESVLALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKTASQITNFPHPYPQLAS 656

Query: 601 LQKEMVRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE 660
           LQKEM+RYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE
Sbjct: 657 LQKEMIRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE 716

Query: 661 FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRGVA 720
           FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAV+EVIKRGVA
Sbjct: 717 FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVQEVIKRGVA 776

Query: 721 HPNKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNT 780
           HP+KIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAT+T
Sbjct: 777 HPDKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATST 836

Query: 781 YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG 840
           YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG
Sbjct: 837 YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG 896

Query: 841 YSSRESIMHVLWETDRWLEKYGSSNASDLSQDVDKIKEEGNGTADSAGKVVAGSGGGGTE 900
           YSSRESIMHVLWETDRWLEKY SSNASDL QD DK KEEGN  ADSAGKVVAGSGGGGTE
Sbjct: 897 YSSRESIMHVLWETDRWLEKYCSSNASDLGQDGDKNKEEGNAAADSAGKVVAGSGGGGTE 956

Query: 901 SPSPDNDGFYSIQRSLL 917
           S SPDNDGFYSIQRS L
Sbjct: 957 SSSPDNDGFYSIQRSFL 967

BLAST of HG10023023 vs. ExPASy TrEMBL
Match: A0A5A7TZ84 (Putative glutamyl endopeptidase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold427G00640 PE=4 SV=1)

HSP 1 Score: 1763.4 bits (4566), Expect = 0.0e+00
Identity = 863/918 (94.01%), Postives = 884/918 (96.30%), Query Frame = 0

Query: 1   MASSRFRNLVHLNAIVSEDGGGGASGGAGGGGSNGSVSSSSAVVSTEDDENSVLGVGYRL 60
           MASSRFRNLVHLNAIVSE      +GGAGGGGSNGSVSSSSAV STEDD   VLGVGYRL
Sbjct: 1   MASSRFRNLVHLNAIVSE------NGGAGGGGSNGSVSSSSAVASTEDD---VLGVGYRL 60

Query: 61  PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCRS 120
           PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPP+SELAKPEEKLAGIRIDGQCNCRS
Sbjct: 61  PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPLSELAKPEEKLAGIRIDGQCNCRS 120

Query: 121 RMSFYTGIGIHQLMPDDSLGPEKEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSG 180
           R+SFYTGIGIHQLMPDDSLGPE EVHGLPDGAKINF+TWSPDGRHL+F+VR+DEE GSSG
Sbjct: 121 RISFYTGIGIHQLMPDDSLGPELEVHGLPDGAKINFVTWSPDGRHLAFTVRIDEEGGSSG 180

Query: 181 KLRVWVADVETGKARPLFQNTDIYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVPH 240
           KLRVWVADVETGKARPLFQNTDIYVNAVF NFVWVNDSTLLVCTIPSSRGDPPKKPLVP 
Sbjct: 181 KLRVWVADVETGKARPLFQNTDIYVNAVFNNFVWVNDSTLLVCTIPSSRGDPPKKPLVPR 240

Query: 241 GPKVQSNEQKNIIQARTFQDLLKDKYDEDLFDYYATTQLVLGSL-DGTVEAFGTPAIYTS 300
           GPKVQSNEQKNIIQART+QDLLKD YDEDLFDYYAT+QLVLGSL DGTV+ FG PA+YTS
Sbjct: 241 GPKVQSNEQKNIIQARTYQDLLKDTYDEDLFDYYATSQLVLGSLEDGTVKEFGPPAVYTS 300

Query: 301 LDPSPDHKYILISTIHRPYSFIVPCGRFPKKVAVWTTDGKFVRELCDLPLAEDIPIAFNS 360
           LDPSPDHKYILISTIHRPYSFIVPCGRFP +V VWTTDG FVRELCDLPLAEDIPIAFNS
Sbjct: 301 LDPSPDHKYILISTIHRPYSFIVPCGRFPNRVDVWTTDGNFVRELCDLPLAEDIPIAFNS 360

Query: 361 VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL 420
           VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL
Sbjct: 361 VRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDL 420

Query: 421 RYGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDNPRILFDRSSEDVYSDPGSPMLR 480
           RYGGI WCDDSLALVYESWYKTRKIRTWVISPGS EDNPR+LFDRSSEDVYSDPGSPM R
Sbjct: 421 RYGGIYWCDDSLALVYESWYKTRKIRTWVISPGSLEDNPRLLFDRSSEDVYSDPGSPMQR 480

Query: 481 RTPLGTYVIAKLKKDNYEGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSNKETYY 540
           RTPLGTYVIAKLKK+NY+GTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKS+KETYY
Sbjct: 481 RTPLGTYVIAKLKKENYDGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSDKETYY 540

Query: 541 ESVVALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKKASQITKFPHPYPQLAS 600
           ESV+ALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGK ASQIT FPHPYPQLAS
Sbjct: 541 ESVLALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKTASQITNFPHPYPQLAS 600

Query: 601 LQKEMVRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE 660
           LQKEM+RYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE
Sbjct: 601 LQKEMIRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNE 660

Query: 661 FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRGVA 720
           FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAV+EVIKRGVA
Sbjct: 661 FASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVQEVIKRGVA 720

Query: 721 HPNKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNT 780
           HP+KIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAT+T
Sbjct: 721 HPDKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATST 780

Query: 781 YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG 840
           YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG
Sbjct: 781 YVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHG 840

Query: 841 YSSRESIMHVLWETDRWLEKYGSSNASDLSQDVDKIKEEGNGTADSAGKVVAGSGGGGTE 900
           YSSRESIMHVLWETDRWLEKY SSNASDL QD DK KEEGN  ADSAGKVVAGSGGGGTE
Sbjct: 841 YSSRESIMHVLWETDRWLEKYCSSNASDLGQDGDKNKEEGNAAADSAGKVVAGSGGGGTE 900

Query: 901 SPSPDNDGFYSIQRSLLW 918
           S SPDNDGFYSIQRS LW
Sbjct: 901 SSSPDNDGFYSIQRSSLW 909

BLAST of HG10023023 vs. ExPASy TrEMBL
Match: A0A0A0K5T5 (Peptidase_S9 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G407680 PE=4 SV=1)

HSP 1 Score: 1750.7 bits (4533), Expect = 0.0e+00
Identity = 853/919 (92.82%), Postives = 887/919 (96.52%), Query Frame = 0

Query: 1   MASSRFRNLVHLNAIVSEDGGGGASGGAGGGGSNGSVSSSSAVVSTEDDENSVLGVGYRL 60
           MASSRFRNLVHLNAIVSEDGG     G GGGGSNGSVSSSSAV ST DDE+SVLGVGYRL
Sbjct: 56  MASSRFRNLVHLNAIVSEDGG----SGGGGGGSNGSVSSSSAVASTVDDEDSVLGVGYRL 115

Query: 61  PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCRS 120
           PPAEIRDIVDAPPLP+LSFSPYRDKILFLKRRSLPP++ELAKPEEKLAGIRIDGQCNCRS
Sbjct: 116 PPAEIRDIVDAPPLPLLSFSPYRDKILFLKRRSLPPLAELAKPEEKLAGIRIDGQCNCRS 175

Query: 121 RMSFYTGIGIHQLMPDDSLGPEKEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSG 180
           R+SFYTGIGIHQLMPDDSLGPEKEV GLP+GAKINF+TWSPDGRHL+F+VRVDE+DGSS 
Sbjct: 176 RISFYTGIGIHQLMPDDSLGPEKEVRGLPNGAKINFVTWSPDGRHLAFTVRVDEDDGSSS 235

Query: 181 KLRVWVADVETGKARPLFQNTDIYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVPH 240
           KLRVWVADVETG+ARPLFQNTDIYVNAVF+NFVWVNDSTLLVCTIP SRGDPPKKPLVP 
Sbjct: 236 KLRVWVADVETGEARPLFQNTDIYVNAVFDNFVWVNDSTLLVCTIPFSRGDPPKKPLVPP 295

Query: 241 GPKVQSNEQKNIIQARTFQDLLKDKYDEDLFDYYATTQLVLGSL-DGTVEAFGT--PAIY 300
           GPKVQSNEQKNIIQART+QDLLKD+YD+DLFDYYAT+QLVLGSL DGTV+ FGT  PA+Y
Sbjct: 296 GPKVQSNEQKNIIQARTYQDLLKDEYDKDLFDYYATSQLVLGSLEDGTVKEFGTSPPAVY 355

Query: 301 TSLDPSPDHKYILISTIHRPYSFIVPCGRFPKKVAVWTTDGKFVRELCDLPLAEDIPIAF 360
           TSLDPSPDHKYILISTIHRPYSFIVPCGRFP +VAVWTTDGKFVR+LCDLPLAEDIPIAF
Sbjct: 356 TSLDPSPDHKYILISTIHRPYSFIVPCGRFPNRVAVWTTDGKFVRDLCDLPLAEDIPIAF 415

Query: 361 NSVRKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKL 420
           NSVRKG RSINWRADKPSTLYWVETQDGGDAR+EVSPRDIVYT+SAEPLESEQPEILHKL
Sbjct: 416 NSVRKGKRSINWRADKPSTLYWVETQDGGDARVEVSPRDIVYTESAEPLESEQPEILHKL 475

Query: 421 DLRYGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDNPRILFDRSSEDVYSDPGSPM 480
           DLRYGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDN R+LFDRSSEDVYSDPGSPM
Sbjct: 476 DLRYGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDNARLLFDRSSEDVYSDPGSPM 535

Query: 481 LRRTPLGTYVIAKLKKDNYEGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSNKET 540
           +RRTP GTYVIAKLKK+NY+GTYVLLNG GATPEGNIPFIDLFDINTGSKERIWKS++ET
Sbjct: 536 VRRTPFGTYVIAKLKKENYDGTYVLLNGRGATPEGNIPFIDLFDINTGSKERIWKSDRET 595

Query: 541 YYESVVALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKKASQITKFPHPYPQL 600
           YYESVVALMSDQKEGDLNI+ELKFLTSKESKTENTQYYILRWPGK ASQITKFPHPYPQL
Sbjct: 596 YYESVVALMSDQKEGDLNINELKFLTSKESKTENTQYYILRWPGKTASQITKFPHPYPQL 655

Query: 601 ASLQKEMVRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSP 660
           ASLQKEM+RYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSP
Sbjct: 656 ASLQKEMIRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSP 715

Query: 661 NEFASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRG 720
           NEFA IGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLV SAEAAV+EVIKRG
Sbjct: 716 NEFAGIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVGSAEAAVQEVIKRG 775

Query: 721 VAHPNKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAT 780
           VAHP+KIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAT
Sbjct: 776 VAHPSKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAT 835

Query: 781 NTYVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFES 840
           +TYVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFES
Sbjct: 836 STYVEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFES 895

Query: 841 HGYSSRESIMHVLWETDRWLEKYGSSNASDLSQDVDKIKEEGNGTADSAGKVVAGSGGGG 900
           HGYSSRESIMHVLWETDRWLEKY SSNASDL QD DK K+EGNG ADSAGKVVAGSGGG 
Sbjct: 896 HGYSSRESIMHVLWETDRWLEKYCSSNASDLGQDGDKNKQEGNGAADSAGKVVAGSGGGD 955

Query: 901 TESPSPDNDGFYSIQRSLL 917
           TES SPDNDGFYSIQRS L
Sbjct: 956 TESSSPDNDGFYSIQRSFL 970

BLAST of HG10023023 vs. ExPASy TrEMBL
Match: A0A6J1CY23 (probable glutamyl endopeptidase, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111015856 PE=4 SV=1)

HSP 1 Score: 1744.9 bits (4518), Expect = 0.0e+00
Identity = 844/916 (92.14%), Postives = 886/916 (96.72%), Query Frame = 0

Query: 1   MASSRFRNLVHLNAIVSEDGGGGASGGAGGGGSNGSVSSSSAVVSTEDDENSVLGVGYRL 60
           MASSRFRNLV LNAIVSEDGGG      GGGGSNGSVSSSSA V TEDDE+ VLGVGYRL
Sbjct: 59  MASSRFRNLVPLNAIVSEDGGG------GGGGSNGSVSSSSASVPTEDDESLVLGVGYRL 118

Query: 61  PPAEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCRS 120
           PP+EIRDIVDAPPLPILSFSPYRDKILFLKRRSLPP+SELAKPEEKLAGIRIDGQCNCRS
Sbjct: 119 PPSEIRDIVDAPPLPILSFSPYRDKILFLKRRSLPPLSELAKPEEKLAGIRIDGQCNCRS 178

Query: 121 RMSFYTGIGIHQLMPDDSLGPEKEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSG 180
           RMSFYTGIGIHQLMPDDSLGPEKEV+GLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSG
Sbjct: 179 RMSFYTGIGIHQLMPDDSLGPEKEVYGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSG 238

Query: 181 KLRVWVADVETGKARPLFQNTDIYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVPH 240
           KLRVWVADVETGKARPLFQNTDIYVNAVFENFVWV+DSTLLVCTIPSSRGDPP+KPLVP+
Sbjct: 239 KLRVWVADVETGKARPLFQNTDIYVNAVFENFVWVDDSTLLVCTIPSSRGDPPRKPLVPY 298

Query: 241 GPKVQSNEQKNIIQARTFQDLLKDKYDEDLFDYYATTQLVLGSLDGTVEAFGTPAIYTSL 300
           GPK+QSNEQK IIQARTFQDLLKDKYDEDLFDYYATTQLVLGSLDGTV+ FGTPAIYTSL
Sbjct: 299 GPKIQSNEQKTIIQARTFQDLLKDKYDEDLFDYYATTQLVLGSLDGTVKEFGTPAIYTSL 358

Query: 301 DPSPDHKYILISTIHRPYSFIVPCGRFPKKVAVWTTDGKFVRELCDLPLAEDIPIAFNSV 360
           DPSPDH+++L+++IHRPYSFIVPCGRFPK+VAVWTT+GKFVRELCDLPLAEDIPIAFNSV
Sbjct: 359 DPSPDHRHLLVTSIHRPYSFIVPCGRFPKRVAVWTTNGKFVRELCDLPLAEDIPIAFNSV 418

Query: 361 RKGMRSINWRADKPSTLYWVETQDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDLR 420
           RKG+RS++WRADKPSTLYWVETQD GDARIEVSPRDIVYTQSAEP E EQPEILHKLDLR
Sbjct: 419 RKGIRSVSWRADKPSTLYWVETQDDGDARIEVSPRDIVYTQSAEPPEGEQPEILHKLDLR 478

Query: 421 YGGISWCDDSLALVYESWYKTRKIRTWVISPGSKEDNPRILFDRSSEDVYSDPGSPMLRR 480
           YGG+SWCDDSLALVYESWYKTRKIRTWVISPGSK+D PR+LFDRSSEDVYSDPGSPM RR
Sbjct: 479 YGGVSWCDDSLALVYESWYKTRKIRTWVISPGSKDDTPRVLFDRSSEDVYSDPGSPMQRR 538

Query: 481 TPLGTYVIAKLKKDNYEGTYVLLNGSGATPEGNIPFIDLFDINTGSKERIWKSNKETYYE 540
           TPLGTY+IAKL+K+N EGTYVLLNGSGATPEGNIPFIDLFDI TGSKERIWKS+KETYYE
Sbjct: 539 TPLGTYIIAKLRKENDEGTYVLLNGSGATPEGNIPFIDLFDIKTGSKERIWKSDKETYYE 598

Query: 541 SVVALMSDQKEGDLNIDELKFLTSKESKTENTQYYILRWPGKKASQITKFPHPYPQLASL 600
           SVVALMSD+KEGDLNID+LKFL SKESKTENTQYYILRWP KKA+QITKFPHPYPQLASL
Sbjct: 599 SVVALMSDEKEGDLNIDQLKFLVSKESKTENTQYYILRWPDKKATQITKFPHPYPQLASL 658

Query: 601 QKEMVRYERKDGVQLTATLYLPPNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNEF 660
           QKEM+RYERKDGVQLTATLYLPPNYDPAK+GPLPCLIWSYPGEFKSKDAAGQVRGSPNEF
Sbjct: 659 QKEMIRYERKDGVQLTATLYLPPNYDPAKEGPLPCLIWSYPGEFKSKDAAGQVRGSPNEF 718

Query: 661 ASIGPTSALLWLARRFAILAGPTIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRGVAH 720
           ASIGPTSALLWLA RFAILAGPTIPIIGEG+EEANDRYVEQLVASA+AAVEEVI+RGVAH
Sbjct: 719 ASIGPTSALLWLACRFAILAGPTIPIIGEGDEEANDRYVEQLVASAQAAVEEVIRRGVAH 778

Query: 721 PNKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNTY 780
           PNKIA+GGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEAT TY
Sbjct: 779 PNKIAIGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATKTY 838

Query: 781 VEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHGY 840
           VEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHGY
Sbjct: 839 VEMSPFISANKIKKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHGY 898

Query: 841 SSRESIMHVLWETDRWLEKYGSSNASDLSQDVDKIKEEGNGTADSAGKVVAGSGGGGTES 900
           +SRESIMHVLWETDRWL+KY SSN+SD+ QDVDK KEEGNG ADS GKVV+GSGGGGTES
Sbjct: 899 TSRESIMHVLWETDRWLQKYCSSNSSDVGQDVDKSKEEGNGAADSKGKVVSGSGGGGTES 958

Query: 901 PSPDNDGFYSIQRSLL 917
            + DNDGFYSIQRSLL
Sbjct: 959 SNRDNDGFYSIQRSLL 968

BLAST of HG10023023 vs. TAIR 10
Match: AT2G47390.1 (Prolyl oligopeptidase family protein )

HSP 1 Score: 1463.0 bits (3786), Expect = 0.0e+00
Identity = 707/894 (79.08%), Postives = 795/894 (88.93%), Query Frame = 0

Query: 25  SGGA--GGGGSNGSVSSSSAVVSTEDDENSVLGVGYRLPPAEIRDIVDAPPLPILSFSPY 84
           SGGA  GGG SNGS+S+S+   +TEDDE ++ G GYRLPP EIRDIVDAPP+P LSFSP+
Sbjct: 78  SGGAEDGGGTSNGSLSASA--TATEDDELAI-GTGYRLPPPEIRDIVDAPPVPALSFSPH 137

Query: 85  RDKILFLKRRSLPPISELAKPEEKLAGIRIDGQCNCRSRMSFYTGIGIHQLMPDDSLGPE 144
           RDKILFLKRR+LPP+++LA+PEEKLAG+RIDG CN RSRMSFYTG+GIHQL+PD +L PE
Sbjct: 138 RDKILFLKRRALPPLADLARPEEKLAGVRIDGYCNTRSRMSFYTGLGIHQLLPDGTLSPE 197

Query: 145 KEVHGLPDGAKINFITWSPDGRHLSFSVRVDEEDGSSGKLRVWVADVETGKARPLFQNTD 204
           KE+ G+PDG KINF+TWS DG+HL+FS+RVD E+G+S K  VWVADVETG ARPLF + D
Sbjct: 198 KEITGIPDGGKINFVTWSNDGKHLAFSIRVD-ENGNSSKPVVWVADVETGVARPLFNSQD 257

Query: 205 IYVNAVFENFVWVNDSTLLVCTIPSSRGDPPKKPLVPHGPKVQSNEQKNIIQARTFQDLL 264
           I++NA+FE+FVW+++STLLV TIPSSRG+PPKKPLVP GPK  SNE K ++Q RTFQDLL
Sbjct: 258 IFLNAIFESFVWIDNSTLLVSTIPSSRGEPPKKPLVPSGPKTLSNETKTVVQVRTFQDLL 317

Query: 265 KDKYDEDLFDYYATTQLVLGSLDGTVEAFGTPAIYTSLDPSPDHKYILISTIHRPYSFIV 324
           KD+YD DLFDYYA++QLVL SLDGTV+  G PA+YTSLDPS DHKY+L+S++HRPYSFIV
Sbjct: 318 KDEYDADLFDYYASSQLVLASLDGTVKEVGVPAVYTSLDPSTDHKYLLVSSLHRPYSFIV 377

Query: 325 PCGRFPKKVAVWTTDGKFVRELCDLPLAEDIPIAFNSVRKGMRSINWRADKPSTLYWVET 384
           PCGRFPKKV VWTTDG+FVR+LCDLPLAEDIPIA NSVRKGMRSINWRADKPSTLYW ET
Sbjct: 378 PCGRFPKKVEVWTTDGRFVRQLCDLPLAEDIPIASNSVRKGMRSINWRADKPSTLYWAET 437

Query: 385 QDGGDARIEVSPRDIVYTQSAEPLESEQPEILHKLDLRYGGISWCDDSLALVYESWYKTR 444
           QDGGDA++EVSPRDIVY QSAEPL  E+PE+LHKLDLRYGGISWCDD+LALVYESWYKTR
Sbjct: 438 QDGGDAKMEVSPRDIVYMQSAEPLAGEEPEVLHKLDLRYGGISWCDDTLALVYESWYKTR 497

Query: 445 KIRTWVISPGSKEDNPRILFDRSSEDVYSDPGSPMLRRTPLGTYVIAKLKKDNYEGTYVL 504
           + RTWVISPGS + +PRILFDRSSEDVYSDPGS MLRRT  GTYVIAK+KK+N EGTYVL
Sbjct: 498 RTRTWVISPGSNDVSPRILFDRSSEDVYSDPGSTMLRRTDAGTYVIAKIKKENDEGTYVL 557

Query: 505 LNGSGATPEGNIPFIDLFDINTGSKERIWKSNKETYYESVVALMSDQKEGDLNIDELKFL 564
           LNGSGATP+GN+PF+DLFDINTG+KERIW+S+KE Y+E+VVALMSDQKEGDL ++ELK L
Sbjct: 558 LNGSGATPQGNVPFLDLFDINTGNKERIWESDKEKYFETVVALMSDQKEGDLKMEELKIL 617

Query: 565 TSKESKTENTQYYILRWPGKKASQITKFPHPYPQLASLQKEMVRYERKDGVQLTATLYLP 624
           TSKESKTENTQY +  WP +K  QIT FPHPYPQLASLQKEM+RY+RKDGVQLTATLYLP
Sbjct: 618 TSKESKTENTQYSLQLWPDRKVQQITNFPHPYPQLASLQKEMIRYQRKDGVQLTATLYLP 677

Query: 625 PNYDPAKDGPLPCLIWSYPGEFKSKDAAGQVRGSPNEFASIGPTSALLWLARRFAILAGP 684
           P YDP+KDGPLPCL WSYPGEFKSKDAAGQVRGSPNEFA IG TSALLWLARRFAIL+GP
Sbjct: 678 PGYDPSKDGPLPCLFWSYPGEFKSKDAAGQVRGSPNEFAGIGSTSALLWLARRFAILSGP 737

Query: 685 TIPIIGEGNEEANDRYVEQLVASAEAAVEEVIKRGVAHPNKIAVGGHSYGAFMTANLLAH 744
           TIPIIGEG+EEANDRYVEQLVASAEAAVEEV++RGVA  +KIAVGGHSYGAFMTANLLAH
Sbjct: 738 TIPIIGEGDEEANDRYVEQLVASAEAAVEEVVRRGVADRSKIAVGGHSYGAFMTANLLAH 797

Query: 745 APHLFCCGIARSGAYNRTLTPFGFQNEDRTLWEATNTYVEMSPFISANKIKKPILLIHGE 804
           APHLF CGIARSGAYNRTLTPFGFQNEDRTLWEATN YVEMSPF+SANKIKKPILLIHGE
Sbjct: 798 APHLFACGIARSGAYNRTLTPFGFQNEDRTLWEATNVYVEMSPFMSANKIKKPILLIHGE 857

Query: 805 EDNNPGTLPMQSDRFFNALKGHGALCRLVVLPFESHGYSSRESIMHVLWETDRWLEKYGS 864
           EDNNPGTL MQSDRFFNALKGHGALCRLVVLP ESHGYS+RESIMHVLWETDRWL+KY  
Sbjct: 858 EDNNPGTLTMQSDRFFNALKGHGALCRLVVLPHESHGYSARESIMHVLWETDRWLQKYCV 917

Query: 865 SNASDLSQDVDKIKEEGNGTADSAGKVVAGSGGGGTESPSPDNDGFYSIQRSLL 917
            N SD     D+ KE     +DSA KV  G+GGG  E    +++    ++RSLL
Sbjct: 918 PNTSDADTSPDQSKE----GSDSADKVSTGTGGGNPE--FGEHEVHSKLRRSLL 961

BLAST of HG10023023 vs. TAIR 10
Match: AT5G24260.1 (prolyl oligopeptidase family protein )

HSP 1 Score: 51.6 bits (122), Expect = 3.9e-06
Identity = 36/145 (24.83%), Postives = 67/145 (46.21%), Query Frame = 0

Query: 713 VIKRGVAHPNKIAVGGHSYGAFMTANLLAHAPHLFCCGIARSGAYNRTLTPFGFQNEDRT 772
           +I++G+A P+ I V G SYG +++A LL   P +F C ++ +   +       +  +   
Sbjct: 598 LIEQGLAKPDHIGVYGWSYGGYLSATLLTRYPEIFNCAVSGAPVTSWDGYDSFYTEKYMG 657

Query: 773 LWEATNTYVEMSPFISANKI--KKPILLIHGEEDNNPGTLPMQSDRFFNALKGHGALCRL 832
           L      Y++ S       +  K+ ++L+HG  D N       + R  NAL   G    L
Sbjct: 658 LPTEEERYLKSSVMHHVGNLTDKQKLMLVHGMIDENVHF--RHTARLVNALVEAGKRYEL 717

Query: 833 VVLPFESHGYSSRESIMHV---LWE 853
           ++ P E H    ++  +++   +WE
Sbjct: 718 LIFPDERHMPRKKKDRIYMEQRIWE 740

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038898053.10.0e+0097.38probable glutamyl endopeptidase, chloroplastic [Benincasa hispida][more]
TYK18231.10.0e+0094.23putative glutamyl endopeptidase [Cucumis melo var. makuwa][more]
XP_008451481.10.0e+0094.22PREDICTED: probable glutamyl endopeptidase, chloroplastic [Cucumis melo][more]
KAA0046695.10.0e+0094.01putative glutamyl endopeptidase [Cucumis melo var. makuwa][more]
XP_004135992.10.0e+0092.82probable glutamyl endopeptidase, chloroplastic [Cucumis sativus] >KGN45015.1 hyp... [more]
Match NameE-valueIdentityDescription
Q8VZF30.0e+0078.97Probable glutamyl endopeptidase, chloroplastic OS=Arabidopsis thaliana OX=3702 G... [more]
Q10MJ10.0e+0075.25Probable glutamyl endopeptidase, chloroplastic OS=Oryza sativa subsp. japonica O... [more]
P344227.1e-1324.36Dipeptidyl peptidase family member 6 OS=Caenorhabditis elegans OX=6239 GN=dpf-6 ... [more]
C3J8X21.2e-1223.33Dipeptidyl-peptidase 5 OS=Porphyromonas endodontalis (strain ATCC 35406 / BCRC 1... [more]
V5YMB31.0e-1124.20Dipeptidyl aminopeptidase BIII OS=Pseudoxanthomonas mexicana OX=128785 GN=dapb3 ... [more]
Match NameE-valueIdentityDescription
A0A5D3D1V40.0e+0094.23Putative glutamyl endopeptidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... [more]
A0A1S3BRJ90.0e+0094.22probable glutamyl endopeptidase, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103... [more]
A0A5A7TZ840.0e+0094.01Putative glutamyl endopeptidase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_... [more]
A0A0A0K5T50.0e+0092.82Peptidase_S9 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G40768... [more]
A0A6J1CY230.0e+0092.14probable glutamyl endopeptidase, chloroplastic OS=Momordica charantia OX=3673 GN... [more]
Match NameE-valueIdentityDescription
AT2G47390.10.0e+0079.08Prolyl oligopeptidase family protein [more]
AT5G24260.13.9e-0624.83prolyl oligopeptidase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR029058Alpha/Beta hydrolase foldGENE3D3.40.50.1820alpha/beta hydrolasecoord: 594..860
e-value: 7.0E-46
score: 158.7
IPR029058Alpha/Beta hydrolase foldSUPERFAMILY53474alpha/beta-Hydrolasescoord: 602..860
IPR011042Six-bladed beta-propeller, TolB-likeGENE3D2.120.10.30coord: 123..239
e-value: 4.6E-7
score: 31.4
IPR001375Peptidase S9, prolyl oligopeptidase, catalytic domainPFAMPF00326Peptidase_S9coord: 707..860
e-value: 3.8E-21
score: 75.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 21..48
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 31..48
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 874..909
NoneNo IPR availablePANTHERPTHR42776SERINE PEPTIDASE S9 FAMILY MEMBERcoord: 2..881
NoneNo IPR availablePANTHERPTHR42776:SF19GLUTAMYL ENDOPEPTIDASE, CHLOROPLASTIC-RELATEDcoord: 2..881
NoneNo IPR availableSUPERFAMILY82171DPP6 N-terminal domain-likecoord: 150..535

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023023.1HG10023023.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004252 serine-type endopeptidase activity
molecular_function GO:0008236 serine-type peptidase activity