HG10014100 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10014100
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPeptidase_S9 domain-containing protein
LocationChr02: 7574207 .. 7580379 (+)
RNA-Seq ExpressionHG10014100
SyntenyHG10014100
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCTTAATATTACAATTTGCGATATGCTCTTACATCAAACAGTGTAAAACGGGAAGCAACTCAGCCACATCAAATGATTTTTCAAGAGGTCCAGCGTTAAATCTTATTCTTTATGCATGCGCGTATGCTCAGCCGGGCAACAAGCTCGCTTCAATGAGTCTCTGTGCTCTATTAGGACTTGTTCGCTTTTCTGCTCCATCCTCTTTTCTTATTTCCAATTCCAACGCCTTAAATAGAGTTTTCATCAACCGAGTCTCCACTGGAAGGAAGTTTCGGAGCTACAACACCATGGCTTCATCCATGTCTTCTTCGCCTAACATCAACAAACACGTCTCAGAAGTAGCAGAGCAGGAGCAGCTCCCCAAAATCACTGCGCCGTACGGCTCCTGGAAGTCCCCAATCACTGCTGACGTTGTTACTGGTGCGTCCAAGCGACTTGGTGGTACTGCTGTCGACGGCAATGGACACCTTATCTGGCTCGAATCACGGCCCACCGAATCAGGGTAATGATCTCGATTCATTCCCTCTGTAGTGATAGAAATGGTGAATCTAATTTGGCGGGTTGTTTTCTACTTCGGATTTAGCTCGAAACATTTTGTTGCAATTAATTTGAGATATTCGCTTCGTGTTATTCCGTTTAAAATGGTTCAGGCGGGGAGTGCTTGTTAAGGAATCGGAAAATCCAGGGAACGAGCCTAGTGATATTACTCCAAAGGACTTTTCAGTGCGGAACACGACGCAGGAATACGGCGGCGGTGCATTCACGGTGGCTGGAGACATCGTTGTCTTCTCGAATTACAACGACCAAAGACTTCACAAGCAATCTTTAAATTCAGGTGAACCAACAATCGATTATTCTAGGATTTTATATGTTTAAAATGGAGTGGATGTACCTAGACTTGGTCCTTTTGATTGCTTACTGCCCTGTAAATAAGAATGTGAAGGACTGATTAGTTAAGCTTTCACGCAAGCAGATTCGCCTCCGCAAGCACTAACTCCCGATTACGGTGGACGATCAGTTAGTTATGCAGATGGGGTATTTGATTCTCGTTTTAATCGTTTTATTACCGTCCAGGAAGGTACATTTTTTCCATAGTGTTAAGAATCCATGAATTATGTGTTATCATCATCTTTATCATTGAACTTTTTTATTTAGACGACTTGTATAGTCATGCTCAATTTATTTCGTTTTGGTAGATGGACGTCAAAGTAGCTTGAATCCAATCACCACAATTGTGTCAGTGGAACTTGACGGCAAGGTTATTAATGGTATGTATATTTTCTCTTGCGCGTACAAAATTTGTTGAAAGTTCATACGTAATACGTTGTAGTTGCTGAGGAATTCAAATTATTGACTTCATAATTAGTAACGTAATTTGTGCTAATTTTGTGTATGACTTTCAGAACCGAAGGTCTTAGTTGGAGGAAATGATTTCTATGCCTTCCCACGAGTGGATCCCAAAGGGGAACGGATTGCATGGATTGAATGGGGTCATCCTAACATGCCTTGGGATAAATCTGAGCTCTGGGTTGGCTACCTTTCTGAGAATGGGTTAGTTGAACCATCCTCCTTTGTTGGTTGATCTTACTTTCTGAGAATGGCTACCTTTCTGTTTCCTGTAATATATATCTTCACATCTTAATCTTGTTATTAATTCTAATGTCTGTTGGAATGCATTAAAATTCAGCTAGAAATAAATCTGATCTGTTTGTAATAATAGTTGTCTTTATAGAAGATTTTTATCTCTCTGCCACTCTGAATTGTACATTATTTTTATACCTAATATGGTATAAAACTAAAGTTCTGGATTATGATCTCGATGCTTTTGATGGTCTTGCAGAGAGGTCTACAAACGAGTCTGTGTTGCTGGTGGTGATCCAAAGCTTGTTGAATCTCCTACTGAACCGAAGTGGTCTGCTCAGGGTATGTACATAGCTTTTGAAATGATTAAAGAACCTGTAGTCTTTTGGCGTTGCCCGAGATTGCAGGAGGTAGATTTTTTAAAAAAGGCTGTTGATTACTCGGATTGGTTCTAGCATCCCTGTGCCCAAAAGGATGCGCTACAATGTGTAAAACTTCTATGTGAGGCCAACTTAGCGTCATGCATGGTTGATGATGCCATTGTCTTCTGCCTAGATTTTTTCCTGCCTTGGAAGGGCTGTGAATCTTTTACTAATGATTTAATTTTATAGGGTTGTGATTGGATTTCTTAAACCGGCTATCATATTCATGGCTGAACGTAATGGCTGAGGCTATCTTGTTTTTATTTATTTATTTTGGGAATGTGCAGGAGAACTATTCTTTATTACAGATAGACAAAGTGGGTTTTGGAATCTCTATAAATGGGTAAAGTTTTCTTTTTACATTTTCTATTACCTGTATGTTGTTTCAATTGCAGAACTATAGGCATCCGTACTGTAGTTTGTTTTTGAAATCAAATAGGCAAAATGATGGTTCTTTTTATTTCTTAATTTCTGTTTGCTTAAAGTTGATTAATCGTGCATGCATTAATAAATAGATTCTTGTTAATTCATATATTAAATTTCCTAGAGAGTAGAGATTCAACCTTAGGTTTTTGAATTGTGGATTGCAGTATCTACCTGTATTTTGTTACTCTTATTCCATATTATCTGAAAAGGTAACATGTTTAGAAGGTAATTGCGTAAGAATATATATTTTTCTTTGTATTTTAAGGTATGAAGGGGGGCTAAAATATTTGTTTCATTCATGGTTAAATCTTCATAGGATTCAACGAAATTTGGTTGTTAGAAAATGTAAAAGATGTTAATACCGTAGGTATGTGGTCAACTTGGTTATTCCCTTGCAAGCACTTTGTGGTTGGCTACCAAATACTTACCTCCGTATCTTTATGCTTTAACATACAAAAGGAATGCTTAACTGAAGGTGATATCTGGTTCATGGTCAAGATGAACCCCTCATTTTATGGTATGCACTGAGAGGATCTCAGTGTGTTCTCTAACAACTACAAAACTGGAAAAGGGGATGTAAGGATAAATGTGATGACCTCTAAGTGAATACACTTTTTTAGGGTGGATTCTTTAATCATCTCGTGCAGAGTTAATTTTGAAATTGCTCAGGCAAACAAATAATGGAAAGAAAAAACTAGGTTTGGTAACCATTCTATTATTAGATCATTTGATATGGGTTTTTTCACTACTATAGCTTTTCCTCCCTCCTCTTCAATTGGAGAGGGTTTTTGGCACTTTAAAACGTTCATTTGATACGGGCCTCATTTGCTTTAACTAACTACCTAGTTCCATGATATTACTAATGCCAATTGGAAATAATGGCACTTCTAACTCTTGCTTATTATGGAAACAGTTTGAGGGTAACAATGAGGTGGCTCCAATATATTCTTTAAACACTGAGTTTTCCCGACCCTTATGGGTTTTCGGGACAAACTCTTATGAATTCTTAAGGATCGGCGTTGGGAGAAACATCATAGTCTGCAGCTACAGGTGATTGAAGAAGTTCCTATTTTGTATTCTTCGTGCAACATTTTTTCCTCTTATGAATTCAACTGCTTTTCCTGCCATGGTTCCTGCAGACAGCGTGGGCGATCATATCTTGGAGTTCTGGATGAGGCGCAAAGCTCACTATCCTTGCTTGATATCCCATTCACGGATATTGAAAATATTGTATACTTTGAGATCTTTTTATTTTCTGTTTCGCAGATGGCTCCTTCTTTTTTTTTTTTTTTTTTTTTTTTCCAGTGATATTCTCTTATATTAGATGAACACGTTTATGTTTCAGGCTCTGGGAAGTCAATGTATATATGTGGAAGGATCATCGGCACTTCATCCATCATCTATTGCCAAGGTTTTTTCTTCTTTTTTTATTTATTCCAGCCGTATGATGTCTTCTATTTTCTTTGCACTGCTTTTTGCTTTCTATATTGCTAAAATTTCAACATTTGACAATGCCACTTTTCCCATGCAGGTGACCTTGAATGAAAGAACCTTGGAAGTAGTAGGTTTCTCTATTATCTGGTCATCTTCGCCAGATATTTTGAAATTTAAGTCGTACTTCAGCCTTCCTGAGTTCATTGAATTTCCAACTGAAGTTCCAGGCCAAAAGGCTTATGCTTACTTTTATCCACCGTCCAATCCTATATACCAGGCTAGTCAGGATGAAAAGCCTCCACTGTTGTTGAAAAGCCATGGTATTCAATTACGTTGTCGCGGTCATAAACAGCATAGTTAGGGGGAAATAATTCAAACCATTATCTTGAAATTTTTTAAGACGTTTTGCCTATCAAGTTGGCCTTAATAAAGTGCTGCAAAGTTAGAAATAATTCTTACTGGCATTTATTTCTAGTTATGCCATAGCTCCTGCAAAAATAGATATTTGATAGATATTGAACTACAGGAGGGCCAACTGCTGAAACACGTGGAAATTTAAATCCTAGCATTCAATATTGGACTAGTCGAGGCTGGGGTTATGTTGATGTCAATTATGGTGGTAGCACTGGTATGTTTGCAACCATTTCATGGCTGAATTTCTGTGTATGTACTTGGCTGAGTTGCCTGATACTTGCCAATTAGCTTCTCTATTGAGGAGTATTTATCATAGTTCACATAGTTATTCACTAAGGTATTTATAATTTGGCTATAAAGGTTATGGGAGAGAGTACCGAGAAAGGCTTTTGAGGCAGTGGGGAATTGTTGATGTCAATGACTGCTGCAGTTGTGCAAGATTTTTGGTGGCACCCTCTACAATCTTGCTTTCTTTAGGTTTTCATGATTAATCATTGATACACCGTTCTAACCTTGAGTAATATTAATCCATATTGTTTTCTCCAGGTGGACTCTGGAAAGGTTGATGGAGAACGGTTATGCATCACAGGGGGCTCTGCTGGGGGATATACCACCTTAGCTGCTCTTGCTTTTAGAGATACGTTTAAGGCAGGAGCTTCCTTGTATGGGGTGAGCGTCTAAACCTATGCTAATATGGTCTCTCATAGTCATCCAAGTGACGAGCATAGCTCTCAAACCAAAATTTATGATATAAATAATGCTCATTTTATTTGTTGTCGAACAGTGTTCTGTTATACTGCTTGTATTGAAATAAAAATTAAAGCCTATGGTATAGTATGATTTTAAATTTTACTTTAATCACCCATTACCAAAGCCAAGTAATGGAAGCATCTGGTACTCAGAAGTTTAGATACAACGCCTGTTGGTGATTTTATGGTCCTAGAAGAGAGATTACTACTCTTTATATTCTACTGTTGGTTGTCTACGACCCACTATTATCTTCAATTGTGTTGATGTTATAGATTTTTTCTCTAAGGATTATATGGATTCCTTGTTCATTTAGCCCTTTCCCTTTGGTCAGAGCTCTCTTCTGCTTTCTGTTTCTATATATATATATATATATATATATATATATATATTGTTTGCTAAATTTTGAAACTATATGCCGTGCAACAGTTGATTGATTGTTCCCATGATCTTTGAATAGATAGCTGACTTACGCTTGTTGAGAGCAGATACACACAAGTTTGAATCTCATTATATTGACAATCTTGTTGGTGAGTCAAAGAAACAAATAAAGAATCTTGTATTTAATTGCCTTCTCATTTCAAGAATTTGAACTCTGATCCAATTATAATGGACTGCAGGGAATGAAAAAGATTACTTCGAGAGGTCACCAATCAATTTTGTAGACAAATTTTCTTGCCCTATAATCCTATTCCAGGGATTGGAGGACAAAGTATGCCGATTAATCCATCTTTTTTCTCCTTATAATGTTGACGTAAAGCATGAAACATTATGAACTTTATCTCTTCATGGTAATGAAGGCCTCTGGTTTGATGGGTTTTACACTTTGTATTGAATTTCCAGGTTGTACTACCTAATCAAGCTCGTAAGATATATCATGCATTGAAGGATAAGGGCTTGCCAGTTGCTCTGGTTGAGTATGAAGGAGAACAACATGGTTTCCGCAAGGTACATGCTACTGTATCTTTTGAGTGATACTTCTGAGTTTTGGCATTGAGGTATAGGAAGCTAAGTTTATGTATATAATACAGGCAGAAAATATCAAATTTACCCTGGAACAACAAATGATGTTCTTTGCACGATCAGTGGGACGTTTCCAAGTTGCAGATGATATTAACCCTATCAAAATTGATAACTTTGACTAG

mRNA sequence

ATGCTCTTAATATTACAATTTGCGATATGCTCTTACATCAAACAGTGTAAAACGGGAAGCAACTCAGCCACATCAAATGATTTTTCAAGAGGTCCAGCGTTAAATCTTATTCTTTATGCATGCGCGTATGCTCAGCCGGGCAACAAGCTCGCTTCAATGAGTCTCTGTGCTCTATTAGGACTTGTTCGCTTTTCTGCTCCATCCTCTTTTCTTATTTCCAATTCCAACGCCTTAAATAGAGTTTTCATCAACCGAGTCTCCACTGGAAGGAAGTTTCGGAGCTACAACACCATGGCTTCATCCATGTCTTCTTCGCCTAACATCAACAAACACGTCTCAGAAGTAGCAGAGCAGGAGCAGCTCCCCAAAATCACTGCGCCGTACGGCTCCTGGAAGTCCCCAATCACTGCTGACGTTGTTACTGGTGCGTCCAAGCGACTTGGTGGTACTGCTGTCGACGGCAATGGACACCTTATCTGGCTCGAATCACGGCCCACCGAATCAGGGCGGGGAGTGCTTGTTAAGGAATCGGAAAATCCAGGGAACGAGCCTAGTGATATTACTCCAAAGGACTTTTCAGTGCGGAACACGACGCAGGAATACGGCGGCGGTGCATTCACGGTGGCTGGAGACATCGTTGTCTTCTCGAATTACAACGACCAAAGACTTCACAAGCAATCTTTAAATTCAGATTCGCCTCCGCAAGCACTAACTCCCGATTACGGTGGACGATCAGTTAGTTATGCAGATGGGGTATTTGATTCTCGTTTTAATCGTTTTATTACCGTCCAGGAAGATGGACGTCAAAGTAGCTTGAATCCAATCACCACAATTGTGTCAGTGGAACTTGACGGCAAGGTTATTAATGAACCGAAGGTCTTAGTTGGAGGAAATGATTTCTATGCCTTCCCACGAGTGGATCCCAAAGGGGAACGGATTGCATGGATTGAATGGGGTCATCCTAACATGCCTTGGGATAAATCTGAGCTCTGGGTTGGCTACCTTTCTGAGAATGGAGAGGTCTACAAACGAGTCTGTGTTGCTGGTGGTGATCCAAAGCTTGTTGAATCTCCTACTGAACCGAAGTGGTCTGCTCAGGGTATGTACATAGCTTTTGAAATGATTAAAGAACCTGTAGTCTTTTGGCGTTGCCCGAGATTGCAGGAGTTTGAGGGTAACAATGAGGTGGCTCCAATATATTCTTTAAACACTGAGTTTTCCCGACCCTTATGGGTTTTCGGGACAAACTCTTATGAATTCTTAAGGATCGGCGTTGGGAGAAACATCATAGTCTGCAGCTACAGACAGCGTGGGCGATCATATCTTGGAGTTCTGGATGAGGCGCAAAGCTCACTATCCTTGCTTGATATCCCATTCACGGATATTGAAAATATTGCTCTGGGAAGTCAATGTATATATGTGGAAGGATCATCGGCACTTCATCCATCATCTATTGCCAAGGTGACCTTGAATGAAAGAACCTTGGAAGTAGTAGGTTTCTCTATTATCTGGTCATCTTCGCCAGATATTTTGAAATTTAAGTCGTACTTCAGCCTTCCTGAGTTCATTGAATTTCCAACTGAAGTTCCAGGCCAAAAGGCTTATGCTTACTTTTATCCACCGTCCAATCCTATATACCAGGCTAGTCAGGATGAAAAGCCTCCACTGTTGTTGAAAAGCCATGGAGGGCCAACTGCTGAAACACGTGGAAATTTAAATCCTAGCATTCAATATTGGACTAGTCGAGGCTGGGGTTATGTTGATGTCAATTATGGTGGTAGCACTGGTATGTTTGCAACCATTTCATGGCTGAATTTCTGTGTGGACTCTGGAAAGGTTGATGGAGAACGGTTATGCATCACAGGGGGCTCTGCTGGGGGATATACCACCTTAGCTGCTCTTGCTTTTAGAGATACGTTTAAGGCAGGAGCTTCCTTGTATGGGATAGCTGACTTACGCTTGTTGAGAGCAGATACACACAAGTTTGAATCTCATTATATTGACAATCTTGTTGGGAATGAAAAAGATTACTTCGAGAGGTCACCAATCAATTTTGTAGACAAATTTTCTTGCCCTATAATCCTATTCCAGGGATTGGAGGACAAAGTTGTACTACCTAATCAAGCTCGTAAGATATATCATGCATTGAAGGATAAGGGCTTGCCAGTTGCTCTGGTTGAGTATGAAGGAGAACAACATGGTTTCCGCAAGGCAGAAAATATCAAATTTACCCTGGAACAACAAATGATGTTCTTTGCACGATCAGTGGGACGTTTCCAAGTTGCAGATGATATTAACCCTATCAAAATTGATAACTTTGACTAG

Coding sequence (CDS)

ATGCTCTTAATATTACAATTTGCGATATGCTCTTACATCAAACAGTGTAAAACGGGAAGCAACTCAGCCACATCAAATGATTTTTCAAGAGGTCCAGCGTTAAATCTTATTCTTTATGCATGCGCGTATGCTCAGCCGGGCAACAAGCTCGCTTCAATGAGTCTCTGTGCTCTATTAGGACTTGTTCGCTTTTCTGCTCCATCCTCTTTTCTTATTTCCAATTCCAACGCCTTAAATAGAGTTTTCATCAACCGAGTCTCCACTGGAAGGAAGTTTCGGAGCTACAACACCATGGCTTCATCCATGTCTTCTTCGCCTAACATCAACAAACACGTCTCAGAAGTAGCAGAGCAGGAGCAGCTCCCCAAAATCACTGCGCCGTACGGCTCCTGGAAGTCCCCAATCACTGCTGACGTTGTTACTGGTGCGTCCAAGCGACTTGGTGGTACTGCTGTCGACGGCAATGGACACCTTATCTGGCTCGAATCACGGCCCACCGAATCAGGGCGGGGAGTGCTTGTTAAGGAATCGGAAAATCCAGGGAACGAGCCTAGTGATATTACTCCAAAGGACTTTTCAGTGCGGAACACGACGCAGGAATACGGCGGCGGTGCATTCACGGTGGCTGGAGACATCGTTGTCTTCTCGAATTACAACGACCAAAGACTTCACAAGCAATCTTTAAATTCAGATTCGCCTCCGCAAGCACTAACTCCCGATTACGGTGGACGATCAGTTAGTTATGCAGATGGGGTATTTGATTCTCGTTTTAATCGTTTTATTACCGTCCAGGAAGATGGACGTCAAAGTAGCTTGAATCCAATCACCACAATTGTGTCAGTGGAACTTGACGGCAAGGTTATTAATGAACCGAAGGTCTTAGTTGGAGGAAATGATTTCTATGCCTTCCCACGAGTGGATCCCAAAGGGGAACGGATTGCATGGATTGAATGGGGTCATCCTAACATGCCTTGGGATAAATCTGAGCTCTGGGTTGGCTACCTTTCTGAGAATGGAGAGGTCTACAAACGAGTCTGTGTTGCTGGTGGTGATCCAAAGCTTGTTGAATCTCCTACTGAACCGAAGTGGTCTGCTCAGGGTATGTACATAGCTTTTGAAATGATTAAAGAACCTGTAGTCTTTTGGCGTTGCCCGAGATTGCAGGAGTTTGAGGGTAACAATGAGGTGGCTCCAATATATTCTTTAAACACTGAGTTTTCCCGACCCTTATGGGTTTTCGGGACAAACTCTTATGAATTCTTAAGGATCGGCGTTGGGAGAAACATCATAGTCTGCAGCTACAGACAGCGTGGGCGATCATATCTTGGAGTTCTGGATGAGGCGCAAAGCTCACTATCCTTGCTTGATATCCCATTCACGGATATTGAAAATATTGCTCTGGGAAGTCAATGTATATATGTGGAAGGATCATCGGCACTTCATCCATCATCTATTGCCAAGGTGACCTTGAATGAAAGAACCTTGGAAGTAGTAGGTTTCTCTATTATCTGGTCATCTTCGCCAGATATTTTGAAATTTAAGTCGTACTTCAGCCTTCCTGAGTTCATTGAATTTCCAACTGAAGTTCCAGGCCAAAAGGCTTATGCTTACTTTTATCCACCGTCCAATCCTATATACCAGGCTAGTCAGGATGAAAAGCCTCCACTGTTGTTGAAAAGCCATGGAGGGCCAACTGCTGAAACACGTGGAAATTTAAATCCTAGCATTCAATATTGGACTAGTCGAGGCTGGGGTTATGTTGATGTCAATTATGGTGGTAGCACTGGTATGTTTGCAACCATTTCATGGCTGAATTTCTGTGTGGACTCTGGAAAGGTTGATGGAGAACGGTTATGCATCACAGGGGGCTCTGCTGGGGGATATACCACCTTAGCTGCTCTTGCTTTTAGAGATACGTTTAAGGCAGGAGCTTCCTTGTATGGGATAGCTGACTTACGCTTGTTGAGAGCAGATACACACAAGTTTGAATCTCATTATATTGACAATCTTGTTGGGAATGAAAAAGATTACTTCGAGAGGTCACCAATCAATTTTGTAGACAAATTTTCTTGCCCTATAATCCTATTCCAGGGATTGGAGGACAAAGTTGTACTACCTAATCAAGCTCGTAAGATATATCATGCATTGAAGGATAAGGGCTTGCCAGTTGCTCTGGTTGAGTATGAAGGAGAACAACATGGTTTCCGCAAGGCAGAAAATATCAAATTTACCCTGGAACAACAAATGATGTTCTTTGCACGATCAGTGGGACGTTTCCAAGTTGCAGATGATATTAACCCTATCAAAATTGATAACTTTGACTAG

Protein sequence

MLLILQFAICSYIKQCKTGSNSATSNDFSRGPALNLILYACAYAQPGNKLASMSLCALLGLVRFSAPSSFLISNSNALNRVFINRVSTGRKFRSYNTMASSMSSSPNINKHVSEVAEQEQLPKITAPYGSWKSPITADVVTGASKRLGGTAVDGNGHLIWLESRPTESGRGVLVKESENPGNEPSDITPKDFSVRNTTQEYGGGAFTVAGDIVVFSNYNDQRLHKQSLNSDSPPQALTPDYGGRSVSYADGVFDSRFNRFITVQEDGRQSSLNPITTIVSVELDGKVINEPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVCVAGGDPKLVESPTEPKWSAQGMYIAFEMIKEPVVFWRCPRLQEFEGNNEVAPIYSLNTEFSRPLWVFGTNSYEFLRIGVGRNIIVCSYRQRGRSYLGVLDEAQSSLSLLDIPFTDIENIALGSQCIYVEGSSALHPSSIAKVTLNERTLEVVGFSIIWSSSPDILKFKSYFSLPEFIEFPTEVPGQKAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGYVDVNYGGSTGMFATISWLNFCVDSGKVDGERLCITGGSAGGYTTLAALAFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKFSCPIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQMMFFARSVGRFQVADDINPIKIDNFD
Homology
BLAST of HG10014100 vs. NCBI nr
Match: XP_038896994.1 (uncharacterized protein LOC120085177 [Benincasa hispida])

HSP 1 Score: 1304.7 bits (3375), Expect = 0.0e+00
Identity = 653/741 (88.12%), Postives = 676/741 (91.23%), Query Frame = 0

Query: 50  LASMSLCALLGLVRFSAPSSFLISNSNALNRVFINRVSTGRKFRSYNTMASSMSSSPNIN 109
           LASMSLCALLG+VRFSAPSS  I+N NALNR  INRVSTG KFR+YNTMASSMSSSPN N
Sbjct: 16  LASMSLCALLGVVRFSAPSSLFITNFNALNRASINRVSTGTKFRAYNTMASSMSSSPNTN 75

Query: 110 KHVSEVAEQEQLPKITAPYGSWKSPITADVVTGASKRLGGTAVDGNGHLIWLESRPTESG 169
           K +SEV   EQLPKITAPYGSWKSPITADVVTGASKRLGGTAVDGNGHLIWLESRPTE+G
Sbjct: 76  KDLSEVV--EQLPKITAPYGSWKSPITADVVTGASKRLGGTAVDGNGHLIWLESRPTEAG 135

Query: 170 RGVLVKESENPGNEPSDITPKDFSVRNTTQEYGGGAFTVAGDIVVFSNYNDQRLHKQSLN 229
           RGVLVKESENPG+EPSDITPK+FSVRNTTQEYGGGAFTVAGDIVVFSNY DQRL+KQ+LN
Sbjct: 136 RGVLVKESENPGDEPSDITPKEFSVRNTTQEYGGGAFTVAGDIVVFSNYKDQRLYKQALN 195

Query: 230 SDSPPQALTPDYGGRSVSYADGVFDSRFNRFITVQEDGRQSSLNPITTIVSVELDGKVIN 289
           S SPPQALTPDYGGRSVSYADGVFDSRFNRFIT+QEDGRQSSLNPITTIVSVELDGK IN
Sbjct: 196 SGSPPQALTPDYGGRSVSYADGVFDSRFNRFITIQEDGRQSSLNPITTIVSVELDGKDIN 255

Query: 290 EPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVCVAG 349
           EPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVCVAG
Sbjct: 256 EPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVCVAG 315

Query: 350 GDPKLVESPTEPKWSAQGMYIAFEMIKEPVVFWRCPRLQEFEGNNEVAPIYSLNTEFSRP 409
           GDPKLVESPTEPKWSAQG    F +      FW     + FE NNEVAPIYSL+ EFSRP
Sbjct: 316 GDPKLVESPTEPKWSAQGE--LFFITDRQSGFWNL--YKWFEANNEVAPIYSLSAEFSRP 375

Query: 410 LWVFGTNSYEFLRIGVGRNIIVCSYRQRGRSYLGVLDEAQSSLSLLDIPFTDIENIALGS 469
           LWVFGTNSYEFLRI VGRNI+VCSYRQ+GRSYLGVLDEAQSSLSLLDIPFTDIENIALGS
Sbjct: 376 LWVFGTNSYEFLRISVGRNILVCSYRQQGRSYLGVLDEAQSSLSLLDIPFTDIENIALGS 435

Query: 470 QCIYVEGSSALHPSSIAKVTLNERTLEVVGFSIIWSSSPDILKFKSYFSLPEFIEFPTEV 529
            CIYVEGSSALHPSSIAKVTLNERT EVVGF+IIWSSSPDILK+KSYFSLPEFIEFPTEV
Sbjct: 436 HCIYVEGSSALHPSSIAKVTLNERTWEVVGFTIIWSSSPDILKYKSYFSLPEFIEFPTEV 495

Query: 530 PGQKAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGYVDV 589
           PGQ AYAYFYPPSNP+YQASQDEKPPLLLKSHGGPTAETRG+LNP IQYWTSRGWGYVDV
Sbjct: 496 PGQNAYAYFYPPSNPLYQASQDEKPPLLLKSHGGPTAETRGSLNPGIQYWTSRGWGYVDV 555

Query: 590 NYGGSTG-----------------MFATISWLNFCVDSGKVDGERLCITGGSAGGYTTLA 649
           NYGGSTG                 +    S   F VDSGKVDGERLCITGGSAGGYTTLA
Sbjct: 556 NYGGSTGYGREYRERLLRQWGIVDVNDCCSCARFLVDSGKVDGERLCITGGSAGGYTTLA 615

Query: 650 ALAFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKFSCP 709
           ALAFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYF+RSPINFVDK SCP
Sbjct: 616 ALAFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYFDRSPINFVDKISCP 675

Query: 710 IILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQMMFF 769
           IILFQGLEDKVVLPNQ+RKIY+ALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQMMFF
Sbjct: 676 IILFQGLEDKVVLPNQSRKIYNALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQMMFF 735

Query: 770 ARSVGRFQVADDINPIKIDNF 774
           ARSVG FQVADDINPIKIDNF
Sbjct: 736 ARSVGHFQVADDINPIKIDNF 750

BLAST of HG10014100 vs. NCBI nr
Match: XP_023535387.1 (uncharacterized protein LOC111796842 [Cucurbita pepo subsp. pepo] >XP_023535388.1 uncharacterized protein LOC111796842 [Cucurbita pepo subsp. pepo] >XP_023535389.1 uncharacterized protein LOC111796842 [Cucurbita pepo subsp. pepo] >XP_023535390.1 uncharacterized protein LOC111796842 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1298.1 bits (3358), Expect = 0.0e+00
Identity = 652/745 (87.52%), Postives = 673/745 (90.34%), Query Frame = 0

Query: 47  GNKLASMSLCALLGLVRFSAPSSFLISNSNALNRVFINRVSTGRKFRSYNTMASSMSSSP 106
           G++L SMS+CALLG VRF APSS LISN NALNR FINRVS GR FRSYN MASSMSSS 
Sbjct: 7   GDELVSMSVCALLGPVRFFAPSSSLISNFNALNRAFINRVSAGRHFRSYNPMASSMSSSS 66

Query: 107 NINKHVSEVAEQEQLPKITAPYGSWKSPITADVVTGASKRLGGTAVDGNGHLIWLESRPT 166
           + NK V EVA  EQL KITAPYGSWKSPITA+VVTGASKRLGGTAVDGNG LIWLESRPT
Sbjct: 67  STNKDVPEVA--EQLAKITAPYGSWKSPITAEVVTGASKRLGGTAVDGNGRLIWLESRPT 126

Query: 167 ESGRGVLVKESENPGNEPSDITPKDFSVRNTTQEYGGGAFTVAGDIVVFSNYNDQRLHKQ 226
           ESGRGVLVKES NPG+EPSDITPK+FSVRNTTQEYGGGAFTVAGDIVVFSNY DQRL+KQ
Sbjct: 127 ESGRGVLVKESNNPGDEPSDITPKEFSVRNTTQEYGGGAFTVAGDIVVFSNYKDQRLYKQ 186

Query: 227 SLNSDSPPQALTPDYGGRSVSYADGVFDSRFNRFITVQEDGRQSSLNPITTIVSVELDGK 286
           SL SDSPPQALTPDYGGRSVSYADGVFDSRFNRFIT+QEDGRQSSLN ITTIVSVELDGK
Sbjct: 187 SLISDSPPQALTPDYGGRSVSYADGVFDSRFNRFITIQEDGRQSSLNTITTIVSVELDGK 246

Query: 287 VINEPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC 346
            IN+PKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC
Sbjct: 247 DINDPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC 306

Query: 347 VAGGDPKLVESPTEPKWSAQGMYIAFEMIKEPVVFWRCPRLQEFEGNNEVAPIYSLNTEF 406
           VAGGDPKLVESPTEPKWSAQG    + +      FW     + FEGNNEVAP+YSLN EF
Sbjct: 307 VAGGDPKLVESPTEPKWSAQGE--LYFITDRQSGFWNL--FKWFEGNNEVAPVYSLNAEF 366

Query: 407 SRPLWVFGTNSYEFLRIGVGRNIIVCSYRQRGRSYLGVLDEAQSSLSLLDIPFTDIENIA 466
           SRPLWVFGTNSYEFLRIG GRN+I+CSYRQRG+SYLGVLDEAQSSLSLLDIPFTDI+NIA
Sbjct: 367 SRPLWVFGTNSYEFLRIGAGRNVILCSYRQRGQSYLGVLDEAQSSLSLLDIPFTDIDNIA 426

Query: 467 LGSQCIYVEGSSALHPSSIAKVTLNERTLEVVGFSIIWSSSPDILKFKSYFSLPEFIEFP 526
           LG+ CIYVEGSSALHP SIAKVTLNERTL V GF+IIWSSSPDILKFKSYFSLPEFIEFP
Sbjct: 427 LGNHCIYVEGSSALHPPSIAKVTLNERTLRVEGFTIIWSSSPDILKFKSYFSLPEFIEFP 486

Query: 527 TEVPGQKAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY 586
           TEVPGQ AYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY
Sbjct: 487 TEVPGQNAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY 546

Query: 587 VDVNYGGSTG-----------------MFATISWLNFCVDSGKVDGERLCITGGSAGGYT 646
           VDVNYGGSTG                 +    S   F VDSGKVDGERLCITGGSAGGYT
Sbjct: 547 VDVNYGGSTGYGREYRERLLRQWGIVDVNDCCSCARFLVDSGKVDGERLCITGGSAGGYT 606

Query: 647 TLAALAFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKF 706
           TLAALAFRDTFKAGASLYGIADL LLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKF
Sbjct: 607 TLAALAFRDTFKAGASLYGIADLSLLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKF 666

Query: 707 SCPIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM 766
           SCPIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM
Sbjct: 667 SCPIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM 726

Query: 767 MFFARSVGRFQVADDINPIKIDNFD 775
           MFFARSVGRFQVADDINPIKIDNFD
Sbjct: 727 MFFARSVGRFQVADDINPIKIDNFD 745

BLAST of HG10014100 vs. NCBI nr
Match: KAG6591361.1 (Dipeptidyl aminopeptidase BIII, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1290.4 bits (3338), Expect = 0.0e+00
Identity = 649/745 (87.11%), Postives = 673/745 (90.34%), Query Frame = 0

Query: 47  GNKLASMSLCALLGLVRFSAPSSFLISNSNALNRVFINRVSTGRKFRSYNTMASSMSSSP 106
           G++L SMS+CALLG VRF APSS LISN NALNR FINRVSTGR FRSYN MASSMSSS 
Sbjct: 7   GDELVSMSVCALLGPVRFFAPSSSLISNFNALNRAFINRVSTGRHFRSYNPMASSMSSSS 66

Query: 107 NINKHVSEVAEQEQLPKITAPYGSWKSPITADVVTGASKRLGGTAVDGNGHLIWLESRPT 166
           + NK V EVA  EQL KITAPYGSWKSPITA+VVTGASKRLGGTAVDGNG LIWLESRPT
Sbjct: 67  STNKDVPEVA--EQLAKITAPYGSWKSPITAEVVTGASKRLGGTAVDGNGRLIWLESRPT 126

Query: 167 ESGRGVLVKESENPGNEPSDITPKDFSVRNTTQEYGGGAFTVAGDIVVFSNYNDQRLHKQ 226
           ESGRGVLVKES+NPG+EPSDITPK+FSVRNTTQEYGGGAFTVAGDIVVFSNY DQRL+KQ
Sbjct: 127 ESGRGVLVKESDNPGDEPSDITPKEFSVRNTTQEYGGGAFTVAGDIVVFSNYKDQRLYKQ 186

Query: 227 SLNSDSPPQALTPDYGGRSVSYADGVFDSRFNRFITVQEDGRQSSLNPITTIVSVELDGK 286
           SL SDSPPQALTPDYGGRSVSYADGVFDSRFNRFIT+QEDGRQSSLN ITTIVSVELDGK
Sbjct: 187 SLISDSPPQALTPDYGGRSVSYADGVFDSRFNRFITIQEDGRQSSLNTITTIVSVELDGK 246

Query: 287 VINEPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC 346
            IN+PKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC
Sbjct: 247 DINDPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC 306

Query: 347 VAGGDPKLVESPTEPKWSAQGMYIAFEMIKEPVVFWRCPRLQEFEGNNEVAPIYSLNTEF 406
           VAGGDPKLVESPTEPKWSAQG    F +      FW     + FE NNEVAP+YSLN EF
Sbjct: 307 VAGGDPKLVESPTEPKWSAQGE--LFFITDRQSGFWNL--FKWFEVNNEVAPVYSLNAEF 366

Query: 407 SRPLWVFGTNSYEFLRIGVGRNIIVCSYRQRGRSYLGVLDEAQSSLSLLDIPFTDIENIA 466
           SRPLWVFGTNSYEFLRIG  RN+I+CSYRQRG+SYLGVLDEAQSSLSLLDIPFTDI+NIA
Sbjct: 367 SRPLWVFGTNSYEFLRIGAERNVILCSYRQRGQSYLGVLDEAQSSLSLLDIPFTDIDNIA 426

Query: 467 LGSQCIYVEGSSALHPSSIAKVTLNERTLEVVGFSIIWSSSPDILKFKSYFSLPEFIEFP 526
           LG+ CIYVEGSSALHP SIAKVTLNERTL V GF++IWSSSPDILKFKSYFSLPEFIEFP
Sbjct: 427 LGNHCIYVEGSSALHPPSIAKVTLNERTLGVEGFTVIWSSSPDILKFKSYFSLPEFIEFP 486

Query: 527 TEVPGQKAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY 586
           TEVPGQ AYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY
Sbjct: 487 TEVPGQNAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY 546

Query: 587 VDVNYGGSTG-----------------MFATISWLNFCVDSGKVDGERLCITGGSAGGYT 646
           VDVNYGGSTG                 +    S   F VDSGKVDGERLCITGGSAGGYT
Sbjct: 547 VDVNYGGSTGYGREYRERLLRQWGIVDVNDCCSCARFLVDSGKVDGERLCITGGSAGGYT 606

Query: 647 TLAALAFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKF 706
           TLAALAFRDTFKAGASLYGIADL LLRADTHKFESHYIDNLVG+EKDYFERSPINFVDKF
Sbjct: 607 TLAALAFRDTFKAGASLYGIADLSLLRADTHKFESHYIDNLVGDEKDYFERSPINFVDKF 666

Query: 707 SCPIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM 766
           SCPIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM
Sbjct: 667 SCPIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM 726

Query: 767 MFFARSVGRFQVADDINPIKIDNFD 775
           MFFARSVGRFQVAD+INPIKIDNFD
Sbjct: 727 MFFARSVGRFQVADNINPIKIDNFD 745

BLAST of HG10014100 vs. NCBI nr
Match: KAG7024237.1 (Dipeptidyl aminopeptidase BIII [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1290.0 bits (3337), Expect = 0.0e+00
Identity = 649/745 (87.11%), Postives = 672/745 (90.20%), Query Frame = 0

Query: 47   GNKLASMSLCALLGLVRFSAPSSFLISNSNALNRVFINRVSTGRKFRSYNTMASSMSSSP 106
            G++L SMS+CALLG VRF APSS LISN NALNR FINRVSTGR FRSYN MASSMSSS 
Sbjct: 719  GDELVSMSVCALLGPVRFFAPSSSLISNFNALNRAFINRVSTGRHFRSYNPMASSMSSSS 778

Query: 107  NINKHVSEVAEQEQLPKITAPYGSWKSPITADVVTGASKRLGGTAVDGNGHLIWLESRPT 166
            + NK V EVA  EQL KITAPYGSWKSPITA+VVTGASKRLGGTAVDGNG LIWLESRPT
Sbjct: 779  STNKDVPEVA--EQLAKITAPYGSWKSPITAEVVTGASKRLGGTAVDGNGRLIWLESRPT 838

Query: 167  ESGRGVLVKESENPGNEPSDITPKDFSVRNTTQEYGGGAFTVAGDIVVFSNYNDQRLHKQ 226
            ESGRGVLVKES+NPG+EPSDITPK+FSVRNTTQEYGGGAFTVAGDIVVFSNY DQRL+KQ
Sbjct: 839  ESGRGVLVKESDNPGDEPSDITPKEFSVRNTTQEYGGGAFTVAGDIVVFSNYKDQRLYKQ 898

Query: 227  SLNSDSPPQALTPDYGGRSVSYADGVFDSRFNRFITVQEDGRQSSLNPITTIVSVELDGK 286
            SL SDSPPQALTPDYGGRSVSYADGVFDSRFNRFIT+QEDGRQSSLN ITTIVSVELDGK
Sbjct: 899  SLISDSPPQALTPDYGGRSVSYADGVFDSRFNRFITIQEDGRQSSLNTITTIVSVELDGK 958

Query: 287  VINEPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC 346
             IN+PKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC
Sbjct: 959  DINDPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC 1018

Query: 347  VAGGDPKLVESPTEPKWSAQGMYIAFEMIKEPVVFWRCPRLQEFEGNNEVAPIYSLNTEF 406
            VAGGDPKLVESPTEPKWSAQG    F +      FW     + FE NNEVAP+YSLN EF
Sbjct: 1019 VAGGDPKLVESPTEPKWSAQGE--LFFITDRQSGFWNL--FKWFEVNNEVAPVYSLNAEF 1078

Query: 407  SRPLWVFGTNSYEFLRIGVGRNIIVCSYRQRGRSYLGVLDEAQSSLSLLDIPFTDIENIA 466
            SRPLWVFGTNSYE LRIG  RN+I+CSYRQRG+SYLGVLDEAQSSLSLLDIPFTDI+NIA
Sbjct: 1079 SRPLWVFGTNSYELLRIGAERNVILCSYRQRGQSYLGVLDEAQSSLSLLDIPFTDIDNIA 1138

Query: 467  LGSQCIYVEGSSALHPSSIAKVTLNERTLEVVGFSIIWSSSPDILKFKSYFSLPEFIEFP 526
            LG+ CIYVEGSSALHP SIAKVTLNERTL V GF++IWSSSPDILKFKSYFSLPEFIEFP
Sbjct: 1139 LGNHCIYVEGSSALHPPSIAKVTLNERTLGVEGFTVIWSSSPDILKFKSYFSLPEFIEFP 1198

Query: 527  TEVPGQKAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY 586
            TEVPGQ AYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY
Sbjct: 1199 TEVPGQNAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY 1258

Query: 587  VDVNYGGSTG-----------------MFATISWLNFCVDSGKVDGERLCITGGSAGGYT 646
            VDVNYGGSTG                 +    S   F VDSGKVDGERLCITGGSAGGYT
Sbjct: 1259 VDVNYGGSTGYGREYRERLLRQWGIVDVNDCCSCARFLVDSGKVDGERLCITGGSAGGYT 1318

Query: 647  TLAALAFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKF 706
            TLAALAFRDTFKAGASLYGIADL LLRADTHKFESHYIDNLVG+EKDYFERSPINFVDKF
Sbjct: 1319 TLAALAFRDTFKAGASLYGIADLSLLRADTHKFESHYIDNLVGDEKDYFERSPINFVDKF 1378

Query: 707  SCPIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM 766
            SCPIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM
Sbjct: 1379 SCPIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM 1438

Query: 767  MFFARSVGRFQVADDINPIKIDNFD 775
            MFFARSVGRFQVADDINPIKIDNFD
Sbjct: 1439 MFFARSVGRFQVADDINPIKIDNFD 1457

BLAST of HG10014100 vs. NCBI nr
Match: XP_022936165.1 (uncharacterized protein LOC111442847 [Cucurbita moschata] >XP_022936166.1 uncharacterized protein LOC111442847 [Cucurbita moschata] >XP_022936167.1 uncharacterized protein LOC111442847 [Cucurbita moschata])

HSP 1 Score: 1288.1 bits (3332), Expect = 0.0e+00
Identity = 647/745 (86.85%), Postives = 671/745 (90.07%), Query Frame = 0

Query: 47  GNKLASMSLCALLGLVRFSAPSSFLISNSNALNRVFINRVSTGRKFRSYNTMASSMSSSP 106
           G++L SMS+CALLG VRF APSS LISN NALNR FINRVSTGR FRSYN MA+SMSSS 
Sbjct: 7   GDELVSMSVCALLGPVRFFAPSSSLISNFNALNRAFINRVSTGRHFRSYNPMATSMSSSS 66

Query: 107 NINKHVSEVAEQEQLPKITAPYGSWKSPITADVVTGASKRLGGTAVDGNGHLIWLESRPT 166
           + NK V EVA  EQL KITAPYGSW SPITA+VVTGASKRLGGTAVDGNG LIWLESRPT
Sbjct: 67  STNKDVPEVA--EQLAKITAPYGSWNSPITAEVVTGASKRLGGTAVDGNGRLIWLESRPT 126

Query: 167 ESGRGVLVKESENPGNEPSDITPKDFSVRNTTQEYGGGAFTVAGDIVVFSNYNDQRLHKQ 226
           ESGRGVLVKES+NPG+EPSDITPK+FSVRNTTQEYGGGAFTVAGDIVVFSNY DQRL+KQ
Sbjct: 127 ESGRGVLVKESDNPGDEPSDITPKEFSVRNTTQEYGGGAFTVAGDIVVFSNYKDQRLYKQ 186

Query: 227 SLNSDSPPQALTPDYGGRSVSYADGVFDSRFNRFITVQEDGRQSSLNPITTIVSVELDGK 286
           SL SDSPPQALTPDYGGRSVSYADGVFDSRFNRFIT+QEDGRQSSLN ITTIVSVELDGK
Sbjct: 187 SLISDSPPQALTPDYGGRSVSYADGVFDSRFNRFITIQEDGRQSSLNTITTIVSVELDGK 246

Query: 287 VINEPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC 346
            IN+PKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC
Sbjct: 247 DINDPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC 306

Query: 347 VAGGDPKLVESPTEPKWSAQGMYIAFEMIKEPVVFWRCPRLQEFEGNNEVAPIYSLNTEF 406
           VAGGDPKLVESPTEPKWSAQG    F +      FW     + FE NNEVAP+YSLN EF
Sbjct: 307 VAGGDPKLVESPTEPKWSAQGE--LFFITDRQSGFWNL--FKWFEVNNEVAPVYSLNAEF 366

Query: 407 SRPLWVFGTNSYEFLRIGVGRNIIVCSYRQRGRSYLGVLDEAQSSLSLLDIPFTDIENIA 466
           SRPLWVFGTNSYEFLRIG  RN+I+CSYRQRG+SYLGVLDEAQSSLSLLDIPFTDI+NIA
Sbjct: 367 SRPLWVFGTNSYEFLRIGAERNVILCSYRQRGQSYLGVLDEAQSSLSLLDIPFTDIDNIA 426

Query: 467 LGSQCIYVEGSSALHPSSIAKVTLNERTLEVVGFSIIWSSSPDILKFKSYFSLPEFIEFP 526
           LG+ CIYVEGSSALHP SIAKVTLNER L V GF++IWSSSPDILKFKSYFSLPEFIEFP
Sbjct: 427 LGNHCIYVEGSSALHPPSIAKVTLNERNLGVEGFTVIWSSSPDILKFKSYFSLPEFIEFP 486

Query: 527 TEVPGQKAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY 586
           TEVPGQ AYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY
Sbjct: 487 TEVPGQNAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY 546

Query: 587 VDVNYGGSTG-----------------MFATISWLNFCVDSGKVDGERLCITGGSAGGYT 646
           VDVNYGGSTG                 +    S   F VDSGKVDGERLCITGGSAGGYT
Sbjct: 547 VDVNYGGSTGYGREYRERLLRRWGIVDVNDCCSCARFLVDSGKVDGERLCITGGSAGGYT 606

Query: 647 TLAALAFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKF 706
           TLAALAFRDTFKAGASLYGIADL LLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKF
Sbjct: 607 TLAALAFRDTFKAGASLYGIADLSLLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKF 666

Query: 707 SCPIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM 766
           SCPIILFQGL+DKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM
Sbjct: 667 SCPIILFQGLQDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM 726

Query: 767 MFFARSVGRFQVADDINPIKIDNFD 775
           MFFARSVGRFQVADDINPIKIDNFD
Sbjct: 727 MFFARSVGRFQVADDINPIKIDNFD 745

BLAST of HG10014100 vs. ExPASy Swiss-Prot
Match: V5YMB3 (Dipeptidyl aminopeptidase BIII OS=Pseudoxanthomonas mexicana OX=128785 GN=dapb3 PE=1 SV=1)

HSP 1 Score: 92.0 bits (227), Expect = 3.1e-17
Identity = 66/234 (28.21%), Postives = 101/234 (43.16%), Query Frame = 0

Query: 551 DEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGYVDVNYGGSTGMFA----------- 610
           D   PL+L  HGGP A          Q+  +RG+  + VN+ GSTG              
Sbjct: 416 DAPVPLVLLVHGGPWARDSYGYGGYNQWLANRGYAVLSVNFRGSTGFGKDFTNAGNGEWA 475

Query: 611 ------TISWLNFCVDSGKVDGERLCITGGSAGGYTTLAALAFR-DTFKAGASLYGIADL 670
                  I  + + V  G    +++ I GGS GGY TL  L F  D F  G  + G ++L
Sbjct: 476 GKMHDDLIDAVQWAVKQGVTTQDQVAIMGGSYGGYATLTGLTFTPDAFACGVDIVGPSNL 535

Query: 671 RLLRADTHKFESHYIDNLV---------GNEKDYFERSPINFVDKFSCPIILFQGLEDKV 730
             L +    + + + + L            +K   ERSP+   D+   P+++ QG  D  
Sbjct: 536 NTLLSTVPPYWASFFEQLAKRMGDPRTDAGKKWLTERSPLTRADQIKKPLLIGQGANDPR 595

Query: 731 VLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQMMFFARSVG 758
           V   ++ +I  A++ K +PV  V +  E HGF + EN K        F A+ +G
Sbjct: 596 VKQAESDQIVKAMQAKNIPVTYVLFPDEGHGFARPENNKAFNAVTEGFLAQCLG 649

BLAST of HG10014100 vs. ExPASy Swiss-Prot
Match: P34422 (Dipeptidyl peptidase family member 6 OS=Caenorhabditis elegans OX=6239 GN=dpf-6 PE=3 SV=2)

HSP 1 Score: 92.0 bits (227), Expect = 3.1e-17
Identity = 82/289 (28.37%), Postives = 125/289 (43.25%), Query Frame = 0

Query: 485 IAKVTLNERTLEVVGFSIIWSSSPDILKFKSYFSLPEFIEF--PTEVP-GQKAYAYF-YP 544
           + K TLN++    +GF      + D +  ++Y SLP        ++VP G + YA     
Sbjct: 372 LKKYTLNKQ----IGFDF---RARDEMTIQAYLSLPPQAPLLKSSQVPDGDRPYANLGMI 431

Query: 545 PSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGYVDVNYGGSTGM--- 604
           P+ P           +++  HGGP A      +P   + T+RG+  + VN+ GSTG    
Sbjct: 432 PAVP---------QKMIVLVHGGPKARDHYGFSPMNAWLTNRGYSVLQVNFRGSTGFGKR 491

Query: 605 --------------FATISWLNFCVDSGKVDGERLCITGGSAGGYTTLAALAFR-DTFKA 664
                         F  +  + F V  G  +   + + GGS GGY TL AL F   TF  
Sbjct: 492 LTNAGNGEWGRKMHFDILDAVEFAVSKGIANRSEVAVMGGSYGGYETLVALTFTPQTFAC 551

Query: 665 GASLYGIADLRLL-----------RADTHKFESHYIDNLVGNEKDYFERSPINFVDKFSC 724
           G  + G ++L  L           R D  K     I +  G +     RSP+ F D+ + 
Sbjct: 552 GVDIVGPSNLISLVQAIPPYWLGFRKDLIKMVGADISDEEGRQ-SLQSRSPLFFADRVTK 611

Query: 725 PIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAEN 741
           PI++ QG  D  V   ++ +   AL+ K +PV  + Y  E HG RK +N
Sbjct: 612 PIMIIQGANDPRVKQAESDQFVAALEKKHIPVTYLLYPDEGHGVRKPQN 643

BLAST of HG10014100 vs. ExPASy Swiss-Prot
Match: Q0IXP9 (Acylamino-acid-releasing enzyme 1 OS=Oryza sativa subsp. japonica OX=39947 GN=Os10g0415600 PE=3 SV=1)

HSP 1 Score: 67.0 bits (162), Expect = 1.1e-09
Identity = 49/218 (22.48%), Postives = 94/218 (43.12%), Query Frame = 0

Query: 555 PLLLKSHGGPTAETRGNLNPSIQYWTSRGWGYVDVNYGGSTG-----------------M 614
           P ++  HGGP      + + S+ +  S+G+  + VNY GS G                 +
Sbjct: 541 PTIVVLHGGPHTVYPSSYSKSLAFLYSQGYNLLVVNYRGSLGFGEEALQSLPGNIGSQDV 600

Query: 615 FATISWLNFCVDSGKVDGERLCITGGSAGGYTTLAALA-FRDTFKAGASLYGIADLRLLR 674
              ++ L+F +  G +D  ++ + GGS GG+ T   +     TF A A+   + +L L+ 
Sbjct: 601 NDVLTALDFVIKKGLIDASKVAVVGGSHGGFLTTHLIGQAPGTFVAAAARNPVCNLSLMV 660

Query: 675 ADTHKFESHYIDNLVGNEK--------------DYFERSPINFVDKFSCPIILFQGLEDK 734
             T   E  +++ + G E                + ++SPI+ + K S P +   G +D 
Sbjct: 661 GTTDIPEWCFVE-IYGKEGKNCFSEYPSFDDLCQFHQKSPISHISKVSTPTLFLLGAQDL 720

Query: 735 VVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAEN 741
            V  +   +    LK+ G+   ++ +  + HG  K ++
Sbjct: 721 RVPVSNGLQYARTLKEMGVETKIIVFPEDMHGLDKPQS 757

BLAST of HG10014100 vs. ExPASy Swiss-Prot
Match: Q6F3I7 (Dipeptidyl aminopeptidase 4 OS=Pseudoxanthomonas mexicana OX=128785 GN=dap4 PE=1 SV=1)

HSP 1 Score: 63.2 bits (152), Expect = 1.5e-08
Identity = 51/157 (32.48%), Postives = 73/157 (46.50%), Query Frame = 0

Query: 601 ISWLNFCVDSGKVDGERLCITGGSAGGYTTLAALAFRD-TFKAGASLYGIADLRLLRADT 660
           I WL        VD  R+ + G S GGY TL  LA  D  +  G +   + D  L   DT
Sbjct: 593 IEWLK---SQAFVDPARIGVYGWSNGGYMTLMLLAKHDEAYACGVAGAPVTDWALY--DT 652

Query: 661 HKFESHYIDNLVGNEKDYFERSPINFVDKFSC-PIILFQGLEDKVVLPNQARKIYHALKD 720
           H +   Y+D    NE  Y E S    VD      ++L  G+ D  VL   + K+   L+ 
Sbjct: 653 H-YTERYMDLPKANEAGYREASVFTHVDGIGAGKLLLIHGMADDNVLFTNSTKLMSELQK 712

Query: 721 KGLPVALVEYEGEQHGFRKAENI-KFTLEQQMMFFAR 755
           +G P  L+ Y G +HG R ++ + ++ L +   FFAR
Sbjct: 713 RGTPFELMTYPGAKHGLRGSDLLHRYRLTED--FFAR 741

BLAST of HG10014100 vs. ExPASy Swiss-Prot
Match: Q8R146 (Acylamino-acid-releasing enzyme OS=Mus musculus OX=10090 GN=Apeh PE=1 SV=3)

HSP 1 Score: 59.7 bits (143), Expect = 1.7e-07
Identity = 59/235 (25.11%), Postives = 97/235 (41.28%), Query Frame = 0

Query: 536 AYFYPPSNPIYQASQDEKPPLLLKSHGGPTAE--TRGNLNPSIQYWTSRGWGYVDVNYGG 595
           A    PSN    +    + P+++  HGGP +   T   L P++      G+  + VNY G
Sbjct: 486 AILLQPSN----SPDKSQVPMVVMPHGGPHSSFVTAWMLFPAM--LCKMGFAVLLVNYRG 545

Query: 596 STGM-------------FATISWLNFCV----DSGKVDGERLCITGGSAGGY-------- 655
           STG                 +  + F V         D  R+ + GGS GG+        
Sbjct: 546 STGFGQDSILSLPGNVGHQDVKDVQFAVQQVLQEEHFDARRVALMGGSHGGFLSCHLIGQ 605

Query: 656 --TTLAALAFRDTFKAGASLYGIADLR--LLRADTHKFESHYIDNLVGNEKDYFERSPIN 715
              T +A   R+      S+ G  D+    +      + + Y+ +L   E +  ++SPI 
Sbjct: 606 YPETYSACIARNPVINIVSMMGTTDIPDWCMVETGFPYSNDYLPDLNVLE-EMLDKSPIK 665

Query: 716 FVDKFSCPIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAE 740
           ++ +   P++L  G ED+ V   Q  + YHALK + +PV L+ Y    H   + E
Sbjct: 666 YIPQVKTPVLLMLGQEDRRVPFKQGLEYYHALKARNVPVRLLLYPKSTHALSEVE 713

BLAST of HG10014100 vs. ExPASy TrEMBL
Match: A0A6J1F7N8 (uncharacterized protein LOC111442847 OS=Cucurbita moschata OX=3662 GN=LOC111442847 PE=4 SV=1)

HSP 1 Score: 1288.1 bits (3332), Expect = 0.0e+00
Identity = 647/745 (86.85%), Postives = 671/745 (90.07%), Query Frame = 0

Query: 47  GNKLASMSLCALLGLVRFSAPSSFLISNSNALNRVFINRVSTGRKFRSYNTMASSMSSSP 106
           G++L SMS+CALLG VRF APSS LISN NALNR FINRVSTGR FRSYN MA+SMSSS 
Sbjct: 7   GDELVSMSVCALLGPVRFFAPSSSLISNFNALNRAFINRVSTGRHFRSYNPMATSMSSSS 66

Query: 107 NINKHVSEVAEQEQLPKITAPYGSWKSPITADVVTGASKRLGGTAVDGNGHLIWLESRPT 166
           + NK V EVA  EQL KITAPYGSW SPITA+VVTGASKRLGGTAVDGNG LIWLESRPT
Sbjct: 67  STNKDVPEVA--EQLAKITAPYGSWNSPITAEVVTGASKRLGGTAVDGNGRLIWLESRPT 126

Query: 167 ESGRGVLVKESENPGNEPSDITPKDFSVRNTTQEYGGGAFTVAGDIVVFSNYNDQRLHKQ 226
           ESGRGVLVKES+NPG+EPSDITPK+FSVRNTTQEYGGGAFTVAGDIVVFSNY DQRL+KQ
Sbjct: 127 ESGRGVLVKESDNPGDEPSDITPKEFSVRNTTQEYGGGAFTVAGDIVVFSNYKDQRLYKQ 186

Query: 227 SLNSDSPPQALTPDYGGRSVSYADGVFDSRFNRFITVQEDGRQSSLNPITTIVSVELDGK 286
           SL SDSPPQALTPDYGGRSVSYADGVFDSRFNRFIT+QEDGRQSSLN ITTIVSVELDGK
Sbjct: 187 SLISDSPPQALTPDYGGRSVSYADGVFDSRFNRFITIQEDGRQSSLNTITTIVSVELDGK 246

Query: 287 VINEPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC 346
            IN+PKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC
Sbjct: 247 DINDPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC 306

Query: 347 VAGGDPKLVESPTEPKWSAQGMYIAFEMIKEPVVFWRCPRLQEFEGNNEVAPIYSLNTEF 406
           VAGGDPKLVESPTEPKWSAQG    F +      FW     + FE NNEVAP+YSLN EF
Sbjct: 307 VAGGDPKLVESPTEPKWSAQGE--LFFITDRQSGFWNL--FKWFEVNNEVAPVYSLNAEF 366

Query: 407 SRPLWVFGTNSYEFLRIGVGRNIIVCSYRQRGRSYLGVLDEAQSSLSLLDIPFTDIENIA 466
           SRPLWVFGTNSYEFLRIG  RN+I+CSYRQRG+SYLGVLDEAQSSLSLLDIPFTDI+NIA
Sbjct: 367 SRPLWVFGTNSYEFLRIGAERNVILCSYRQRGQSYLGVLDEAQSSLSLLDIPFTDIDNIA 426

Query: 467 LGSQCIYVEGSSALHPSSIAKVTLNERTLEVVGFSIIWSSSPDILKFKSYFSLPEFIEFP 526
           LG+ CIYVEGSSALHP SIAKVTLNER L V GF++IWSSSPDILKFKSYFSLPEFIEFP
Sbjct: 427 LGNHCIYVEGSSALHPPSIAKVTLNERNLGVEGFTVIWSSSPDILKFKSYFSLPEFIEFP 486

Query: 527 TEVPGQKAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY 586
           TEVPGQ AYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY
Sbjct: 487 TEVPGQNAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY 546

Query: 587 VDVNYGGSTG-----------------MFATISWLNFCVDSGKVDGERLCITGGSAGGYT 646
           VDVNYGGSTG                 +    S   F VDSGKVDGERLCITGGSAGGYT
Sbjct: 547 VDVNYGGSTGYGREYRERLLRRWGIVDVNDCCSCARFLVDSGKVDGERLCITGGSAGGYT 606

Query: 647 TLAALAFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKF 706
           TLAALAFRDTFKAGASLYGIADL LLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKF
Sbjct: 607 TLAALAFRDTFKAGASLYGIADLSLLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKF 666

Query: 707 SCPIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM 766
           SCPIILFQGL+DKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM
Sbjct: 667 SCPIILFQGLQDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM 726

Query: 767 MFFARSVGRFQVADDINPIKIDNFD 775
           MFFARSVGRFQVADDINPIKIDNFD
Sbjct: 727 MFFARSVGRFQVADDINPIKIDNFD 745

BLAST of HG10014100 vs. ExPASy TrEMBL
Match: A0A6J1IFZ3 (uncharacterized protein LOC111474184 OS=Cucurbita maxima OX=3661 GN=LOC111474184 PE=4 SV=1)

HSP 1 Score: 1287.3 bits (3330), Expect = 0.0e+00
Identity = 648/745 (86.98%), Postives = 672/745 (90.20%), Query Frame = 0

Query: 47  GNKLASMSLCALLGLVRFSAPSSFLISNSNALNRVFINRVSTGRKFRSYNTMASSMSSSP 106
           G++L SMS+CALLG VRF APSS LISN NALNR FINRVS GR FRSYN MASSMSSS 
Sbjct: 7   GDELVSMSVCALLGPVRFFAPSSSLISNFNALNRAFINRVSAGRHFRSYNPMASSMSSSS 66

Query: 107 NINKHVSEVAEQEQLPKITAPYGSWKSPITADVVTGASKRLGGTAVDGNGHLIWLESRPT 166
           + NK V EVA  EQL KITAPYGSWKSPITA+VVTGASKRLGGTAVDGNG LIWLESRPT
Sbjct: 67  STNKDVPEVA--EQLAKITAPYGSWKSPITAEVVTGASKRLGGTAVDGNGRLIWLESRPT 126

Query: 167 ESGRGVLVKESENPGNEPSDITPKDFSVRNTTQEYGGGAFTVAGDIVVFSNYNDQRLHKQ 226
           ESGRGVLVKES+NPG++PSDITPK+FSVRNTTQEYGGGAFTVAGDIV+FSNY DQRL+KQ
Sbjct: 127 ESGRGVLVKESDNPGDDPSDITPKEFSVRNTTQEYGGGAFTVAGDIVIFSNYKDQRLYKQ 186

Query: 227 SLNSDSPPQALTPDYGGRSVSYADGVFDSRFNRFITVQEDGRQSSLNPITTIVSVELDGK 286
           SL SDSPPQALTPDYGGRSVSYADGVFDSRFNRFIT+QEDGRQSSLNPITTIVSVELDG 
Sbjct: 187 SLISDSPPQALTPDYGGRSVSYADGVFDSRFNRFITIQEDGRQSSLNPITTIVSVELDGT 246

Query: 287 VINEPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC 346
            IN+PKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC
Sbjct: 247 DINDPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVC 306

Query: 347 VAGGDPKLVESPTEPKWSAQGMYIAFEMIKEPVVFWRCPRLQEFEGNNEVAPIYSLNTEF 406
           VAGGDPKLVESPTEPKWSAQG    F +      FW     + FE NNEVAP+YSLN EF
Sbjct: 307 VAGGDPKLVESPTEPKWSAQGE--LFFITDRQSGFWNL--FKWFEVNNEVAPVYSLNAEF 366

Query: 407 SRPLWVFGTNSYEFLRIGVGRNIIVCSYRQRGRSYLGVLDEAQSSLSLLDIPFTDIENIA 466
           SRPLWVFGTNSYEFLRIG GRN+I+CSYRQRG+SYL VLDEAQSSLSLLDIPFTDI+NIA
Sbjct: 367 SRPLWVFGTNSYEFLRIGAGRNVILCSYRQRGQSYLVVLDEAQSSLSLLDIPFTDIDNIA 426

Query: 467 LGSQCIYVEGSSALHPSSIAKVTLNERTLEVVGFSIIWSSSPDILKFKSYFSLPEFIEFP 526
           LG+ CIYVEGSSALHPSSIAKVTLNERTL V GF+IIWSSSPDILKFKSYFSLPEFIEFP
Sbjct: 427 LGNHCIYVEGSSALHPSSIAKVTLNERTLGVEGFTIIWSSSPDILKFKSYFSLPEFIEFP 486

Query: 527 TEVPGQKAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY 586
           TEVPGQ AYAYFY PSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY
Sbjct: 487 TEVPGQNAYAYFYRPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGY 546

Query: 587 VDVNYGGSTG-----------------MFATISWLNFCVDSGKVDGERLCITGGSAGGYT 646
           VDVNYGGSTG                 +    S   F VDSGKVDGERLCITGGSAGGYT
Sbjct: 547 VDVNYGGSTGYGREYRERLLRQWGIVDVNDCCSCARFLVDSGKVDGERLCITGGSAGGYT 606

Query: 647 TLAALAFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKF 706
           TLAALAFRDTFKAGASLYGIADL LLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKF
Sbjct: 607 TLAALAFRDTFKAGASLYGIADLSLLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKF 666

Query: 707 SCPIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM 766
           SCPIILFQGLEDKVVLPNQARKIY+ALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM
Sbjct: 667 SCPIILFQGLEDKVVLPNQARKIYNALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQM 726

Query: 767 MFFARSVGRFQVADDINPIKIDNFD 775
           MFFARSVGRFQVADDINPIKIDNFD
Sbjct: 727 MFFARSVGRFQVADDINPIKIDNFD 745

BLAST of HG10014100 vs. ExPASy TrEMBL
Match: A0A0A0L3I1 (Peptidase_S9 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G639060 PE=4 SV=1)

HSP 1 Score: 1238.8 bits (3204), Expect = 0.0e+00
Identity = 624/740 (84.32%), Postives = 654/740 (88.38%), Query Frame = 0

Query: 53  MSLCALLGLVRFSAPSSFLISNSNALNRVFINRVSTGRKFRSYN-TMASSMSSSPNINKH 112
           MS CALL L RF +PSS  ISN N LNR  IN +ST ++FRSYN TM SSMSSSPN    
Sbjct: 1   MSPCALLRLFRFPSPSSLFISNFNPLNRASINTLSTRKQFRSYNKTMTSSMSSSPNTTND 60

Query: 113 VSEVAEQEQLPKITAPYGSWKSPITADVVTGASKRLGGTAVDGNGHLIWLESRPTESGRG 172
             +++  +QLPKITAPYGSW SPITADVVTGASKRLGGTAV  NGHLIWLESRPTESGRG
Sbjct: 61  PPQLS--DQLPKITAPYGSWSSPITADVVTGASKRLGGTAVTANGHLIWLESRPTESGRG 120

Query: 173 VLVKESENPGNEPSDITPKDFSVRNTTQEYGGGAFTVAGDIVVFSNYNDQRLHKQSLNSD 232
           VLVKES   G+EP DITPK+FSVRNTTQEYGGGAFTVAGDIVVFSNY+DQRL+KQSLNSD
Sbjct: 121 VLVKESVKEGDEPCDITPKEFSVRNTTQEYGGGAFTVAGDIVVFSNYSDQRLYKQSLNSD 180

Query: 233 SPPQALTPDYGGRSVSYADGVFDSRFNRFITVQEDGRQSSLNPITTIVSVELDGKVINEP 292
             PQALTPDYGGRSVSYADGVFDSRFNRFITVQEDGRQSSLNPITTIVSVELDGK INEP
Sbjct: 181 LSPQALTPDYGGRSVSYADGVFDSRFNRFITVQEDGRQSSLNPITTIVSVELDGKDINEP 240

Query: 293 KVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVCVAGGD 352
           KVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVCVAGGD
Sbjct: 241 KVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVCVAGGD 300

Query: 353 PKLVESPTEPKWSAQGMYIAFEMIKEPVVFWRCPRLQEFEGNNEVAPIYSLNTEFSRPLW 412
           PKLVESPTEPKWSAQG    + +      FW     + FE NNEVAPIYSL+ EFSRPLW
Sbjct: 301 PKLVESPTEPKWSAQGE--LYFITDRQTGFWNL--YKWFEANNEVAPIYSLSAEFSRPLW 360

Query: 413 VFGTNSYEFLRIGVGRNIIVCSYRQRGRSYLGVLDEAQSSLSLLDIPFTDIENIALGSQC 472
           VFGTNSY+ L+ G GRNIIVCSYRQRGRSYLGVLDE QSSLSLLDIPFTDIENIALGS C
Sbjct: 361 VFGTNSYDLLKTGDGRNIIVCSYRQRGRSYLGVLDETQSSLSLLDIPFTDIENIALGSDC 420

Query: 473 IYVEGSSALHPSSIAKVTLNERTLEVVGFSIIWSSSPDILKFKSYFSLPEFIEFPTEVPG 532
           IYVEGSS LHPSSIAKVTLNER+LEVVGF+IIWSSSPDILKFKSYFSLPEFIEFPTEVPG
Sbjct: 421 IYVEGSSGLHPSSIAKVTLNERSLEVVGFTIIWSSSPDILKFKSYFSLPEFIEFPTEVPG 480

Query: 533 QKAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGYVDVNY 592
           Q AYAYFYPPSNP YQAS +EKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGYVDVNY
Sbjct: 481 QNAYAYFYPPSNPKYQASPNEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGYVDVNY 540

Query: 593 GGSTG-----------------MFATISWLNFCVDSGKVDGERLCITGGSAGGYTTLAAL 652
           GGSTG                 +    S   F V+SGKVDGE+LCITGGSAGGYTTLAAL
Sbjct: 541 GGSTGYGREYRERLLRQWGIVDVNDCCSCARFLVESGKVDGEQLCITGGSAGGYTTLAAL 600

Query: 653 AFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKFSCPII 712
           AFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYF+RSPINFVDKFSCPII
Sbjct: 601 AFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYFDRSPINFVDKFSCPII 660

Query: 713 LFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQMMFFAR 772
           LFQGLEDKVVLPNQ+RKIY+ALK+KGLPVALVEYEGEQHGFRKAENIKFTLEQQMMFFAR
Sbjct: 661 LFQGLEDKVVLPNQSRKIYNALKEKGLPVALVEYEGEQHGFRKAENIKFTLEQQMMFFAR 720

Query: 773 SVGRFQVADDINPIKIDNFD 775
           +VGRFQVAD INP+KIDNFD
Sbjct: 721 TVGRFQVADAINPLKIDNFD 734

BLAST of HG10014100 vs. ExPASy TrEMBL
Match: A0A6J1C6I2 (uncharacterized protein LOC111008851 OS=Momordica charantia OX=3673 GN=LOC111008851 PE=4 SV=1)

HSP 1 Score: 1233.0 bits (3189), Expect = 0.0e+00
Identity = 618/741 (83.40%), Postives = 654/741 (88.26%), Query Frame = 0

Query: 53  MSLCALLGLVRFSAPSSFLISNSNALNRVFINRVSTGRKFRSYN--TMASSMSSSPNINK 112
           MS+CALLGL RFSAPS  L+SN NALNR FI R ST R++RSY+   MASS+SSS N NK
Sbjct: 1   MSVCALLGLARFSAPSFSLVSNFNALNRTFIKRFSTRRQYRSYSCKPMASSVSSSLNTNK 60

Query: 113 HVSEVAEQEQLPKITAPYGSWKSPITADVVTGASKRLGGTAVDGNGHLIWLESRPTESGR 172
            +SEVA  EQL KITAPYGSWKSPITADVVTGASKRLGGTAVDGNG LIWLESRP ESGR
Sbjct: 61  DISEVA--EQLEKITAPYGSWKSPITADVVTGASKRLGGTAVDGNGRLIWLESRPAESGR 120

Query: 173 GVLVKESENPGNEPSDITPKDFSVRNTTQEYGGGAFTVAGDIVVFSNYNDQRLHKQSLNS 232
           GVLVKESE PG+EPSDITPK+FSVRNTTQEYGG AFTVAGDIVVFSNY DQRL+KQSLN 
Sbjct: 121 GVLVKESEKPGDEPSDITPKEFSVRNTTQEYGGAAFTVAGDIVVFSNYKDQRLYKQSLNP 180

Query: 233 DSPPQALTPDYGGRSVSYADGVFDSRFNRFITVQEDGRQSSLNPITTIVSVELDGKVINE 292
           DSPPQALTPD+GG SVSYADGVFD RFNRFIT+QEDGRQSSLNPITT+VSV+LDGK I++
Sbjct: 181 DSPPQALTPDHGGPSVSYADGVFDFRFNRFITIQEDGRQSSLNPITTVVSVKLDGKEIDD 240

Query: 293 PKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVCVAGG 352
           PKVLV GNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVCVAGG
Sbjct: 241 PKVLVEGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVCVAGG 300

Query: 353 DPKLVESPTEPKWSAQGMYIAFEMIKEPVVFWRCPRLQEFEGNNEVAPIYSLNTEFSRPL 412
           D KLVESPTEPKWSA G    F +      FW     + FE NNEVAP+YSLN EFS+PL
Sbjct: 301 DSKLVESPTEPKWSAHGE--LFFITDRESGFWNL--YKWFEANNEVAPVYSLNAEFSQPL 360

Query: 413 WVFGTNSYEFLRIGVGRNIIVCSYRQRGRSYLGVLDEAQSSLSLLDIPFTDIENIALGSQ 472
           WVFGTNSYEFL+  VGRN IVCSYRQRGRSYLGVLDEAQSSLSLLDIPFTDI+NI LGS 
Sbjct: 361 WVFGTNSYEFLKSSVGRNTIVCSYRQRGRSYLGVLDEAQSSLSLLDIPFTDIDNITLGSH 420

Query: 473 CIYVEGSSALHPSSIAKVTLNERTLEVVGFSIIWSSSPDILKFKSYFSLPEFIEFPTEVP 532
           C+YV GSS  HPSSIAKVTLNE+TLE  GF+IIWSSSPDILK+KSYFSLPEFIEFPTEVP
Sbjct: 421 CLYVVGSSGRHPSSIAKVTLNEKTLEAAGFTIIWSSSPDILKYKSYFSLPEFIEFPTEVP 480

Query: 533 GQKAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGYVDVN 592
           GQ AYAYFYPPSNPIYQA+Q EKPPLLLKSHGGPTAETRG LNPSIQYWTSRGWG+VDVN
Sbjct: 481 GQNAYAYFYPPSNPIYQANQAEKPPLLLKSHGGPTAETRGILNPSIQYWTSRGWGFVDVN 540

Query: 593 YGGSTG-----------------MFATISWLNFCVDSGKVDGERLCITGGSAGGYTTLAA 652
           YGGSTG                 +    S   F VDSGKVDGERLCITGGSAGGYTTLAA
Sbjct: 541 YGGSTGYGREFRERLLRQWGIVDVNDCCSCARFLVDSGKVDGERLCITGGSAGGYTTLAA 600

Query: 653 LAFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKFSCPI 712
           LAFRDTFKAGASLYG+ADL +LRA+THKFESHYIDNLVG+EKDYFERSPINFVDKFSCPI
Sbjct: 601 LAFRDTFKAGASLYGVADLSMLRAETHKFESHYIDNLVGSEKDYFERSPINFVDKFSCPI 660

Query: 713 ILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQMMFFA 772
           ILFQGLEDKVVLPNQ+RKIYHALK+KGLPVALVEYEGEQHGFRKAENIKFTLEQQMMFFA
Sbjct: 661 ILFQGLEDKVVLPNQSRKIYHALKEKGLPVALVEYEGEQHGFRKAENIKFTLEQQMMFFA 720

Query: 773 RSVGRFQVADDINPIKIDNFD 775
           RSVGRFQVADDINPIKIDNF+
Sbjct: 721 RSVGRFQVADDINPIKIDNFE 735

BLAST of HG10014100 vs. ExPASy TrEMBL
Match: A0A1S3BU10 (uncharacterized protein LOC103493519 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103493519 PE=4 SV=1)

HSP 1 Score: 1227.2 bits (3174), Expect = 0.0e+00
Identity = 618/740 (83.51%), Postives = 648/740 (87.57%), Query Frame = 0

Query: 53  MSLCALLGLVRFSAPSSFLISNSNALNRVFINRVSTGRKFRSY-NTMASSMSSSPNINKH 112
           MS CALL L RF +PSS  ISN N LN   IN +ST ++FRSY  TMASSMSSSPN    
Sbjct: 1   MSPCALLRLFRFPSPSSLFISNFNPLNTASINTLSTRKQFRSYKKTMASSMSSSPN---- 60

Query: 113 VSEVAEQEQLPKITAPYGSWKSPITADVVTGASKRLGGTAVDGNGHLIWLESRPTESGRG 172
                  +QLPKITAPYGSW SPITADVVTGASKRLGGTAV  NGHLIWLESRPTESGRG
Sbjct: 61  ----TSNDQLPKITAPYGSWNSPITADVVTGASKRLGGTAVAANGHLIWLESRPTESGRG 120

Query: 173 VLVKESENPGNEPSDITPKDFSVRNTTQEYGGGAFTVAGDIVVFSNYNDQRLHKQSLNSD 232
           VLVKES   G+EP DITPK+FSVRNTTQEYGGGAF VAGD VVFSNYNDQRL+KQSLNSD
Sbjct: 121 VLVKESIKEGDEPCDITPKEFSVRNTTQEYGGGAFAVAGDTVVFSNYNDQRLYKQSLNSD 180

Query: 233 SPPQALTPDYGGRSVSYADGVFDSRFNRFITVQEDGRQSSLNPITTIVSVELDGKVINEP 292
           S PQALTPDYGGRSVSYADGVFD RFNRFIT+QEDGRQSSLNPITTIVSVELDGK INEP
Sbjct: 181 SSPQALTPDYGGRSVSYADGVFDFRFNRFITIQEDGRQSSLNPITTIVSVELDGKDINEP 240

Query: 293 KVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVCVAGGD 352
           KVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVCVAGGD
Sbjct: 241 KVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVYKRVCVAGGD 300

Query: 353 PKLVESPTEPKWSAQGMYIAFEMIKEPVVFWRCPRLQEFEGNNEVAPIYSLNTEFSRPLW 412
           PKLVESPTEPKWSAQG    + +      FW     + FE NN VAPIYSL+ EFSRPLW
Sbjct: 301 PKLVESPTEPKWSAQGE--LYFITDRQTGFWNL--YKWFEANNVVAPIYSLSAEFSRPLW 360

Query: 413 VFGTNSYEFLRIGVGRNIIVCSYRQRGRSYLGVLDEAQSSLSLLDIPFTDIENIALGSQC 472
           VFGTNSY+ L+ G GRNIIVCSYR+RG+SYLGVLDE QSS+SLLDIPFTDIENIALGS C
Sbjct: 361 VFGTNSYDLLKTGDGRNIIVCSYRRRGQSYLGVLDETQSSISLLDIPFTDIENIALGSDC 420

Query: 473 IYVEGSSALHPSSIAKVTLNERTLEVVGFSIIWSSSPDILKFKSYFSLPEFIEFPTEVPG 532
           IYVEGSS LHPSSIAKVTLNER+LEVVGF+IIWSSSPDILKFKSYFSLPEFIEFPTEVPG
Sbjct: 421 IYVEGSSGLHPSSIAKVTLNERSLEVVGFTIIWSSSPDILKFKSYFSLPEFIEFPTEVPG 480

Query: 533 QKAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSRGWGYVDVNY 592
           Q AYAYFYPPSNP YQAS DEKPPLLLKSHGGPTAETRG+LNPSIQYWTSRGWGYVDVNY
Sbjct: 481 QNAYAYFYPPSNPRYQASPDEKPPLLLKSHGGPTAETRGSLNPSIQYWTSRGWGYVDVNY 540

Query: 593 GGSTG-----------------MFATISWLNFCVDSGKVDGERLCITGGSAGGYTTLAAL 652
           GGSTG                 +    S   F V+SGKVDGE+LCITGGSAGGYTTLAAL
Sbjct: 541 GGSTGYGREYRERLLRRWGIVDVNDCCSCARFLVESGKVDGEQLCITGGSAGGYTTLAAL 600

Query: 653 AFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYFERSPINFVDKFSCPII 712
           AFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYF+RSPINFVDKFSCPII
Sbjct: 601 AFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYFDRSPINFVDKFSCPII 660

Query: 713 LFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTLEQQMMFFAR 772
           LFQGLEDKVVLPNQ+RKIY+ALK+KGLPVALVEYEGEQHGFRKAENIKFTLEQQMMFFAR
Sbjct: 661 LFQGLEDKVVLPNQSRKIYNALKEKGLPVALVEYEGEQHGFRKAENIKFTLEQQMMFFAR 720

Query: 773 SVGRFQVADDINPIKIDNFD 775
           +VGRFQVADDINP+KIDNFD
Sbjct: 721 TVGRFQVADDINPLKIDNFD 728

BLAST of HG10014100 vs. TAIR 10
Match: AT5G36210.1 (alpha/beta-Hydrolases superfamily protein )

HSP 1 Score: 936.8 bits (2420), Expect = 1.1e-272
Identity = 470/749 (62.75%), Postives = 570/749 (76.10%), Query Frame = 0

Query: 50  LASMSLCALLGLVRFS---APSSFLISNSNALNRVFINRVSTGRKF--RSYNTMASSMSS 109
           +A + L +L  LV FS    PSS   +++  L+R F + +    +F  +   + AS  SS
Sbjct: 1   MALLLLTSLNHLVSFSLTRLPSS--SAHNLFLSRSFSSSIRRFNRFSLKPLRSFASMSSS 60

Query: 110 SPNINKHVSEVAEQEQLPKITAPYGSWKSPITADVVTGASKRLGGTAVDGNGHLIWLESR 169
           SP          +  Q P  TAPYGSWKSPITAD+V+GASKRLGGTAVD +G L+ LESR
Sbjct: 61  SP----------DAAQTPLTTAPYGSWKSPITADIVSGASKRLGGTAVDSHGRLVLLESR 120

Query: 170 PTESGRGVLVKESENPGNEPSDITPKDFSVRNTTQEYGGGAFTVAG-DIVVFSNYNDQRL 229
           P ESGRGVLV +    G    DITPKDF+VR  TQEYGGGAF ++  D +VFSNY DQRL
Sbjct: 121 PNESGRGVLVLQ----GETSIDITPKDFAVRTLTQEYGGGAFQISSDDTLVFSNYKDQRL 180

Query: 230 HKQSL-NSDSPPQALTPDYGGRSVSYADGVFDSRFNRFITVQEDGRQSSLNPITTIVSVE 289
           +KQ + + DS P+ +TPDYG  +V+YADGVFDSRFNR++TV+EDGRQ   NPITTIV V 
Sbjct: 181 YKQDITDKDSSPKPITPDYGTPAVTYADGVFDSRFNRYVTVREDGRQDRSNPITTIVEVN 240

Query: 290 LDGKVINEPKVLVGGNDFYAFPRVDPKGERIAWIEWGHPNMPWDKSELWVGYLSENGEVY 349
           L G+ + EPKVLV GNDFYAFPR+DPK ER+AWIEW HPNMPWDK+ELWVGY+SE G + 
Sbjct: 241 LSGETLEEPKVLVSGNDFYAFPRLDPKCERLAWIEWSHPNMPWDKAELWVGYISEGGNID 300

Query: 350 KRVCVAGGDPKLVESPTEPKWSAQGMYIAFEMIKEPVVFWRCPRLQEFEGNNEVAPIYSL 409
           KRVCVAG DPK VESPTEPKWS++G    F +       W   +    E  NEV  +Y L
Sbjct: 301 KRVCVAGCDPKYVESPTEPKWSSRGE--LFFVTDRKNGCWNIHKW--IESTNEVVSVYPL 360

Query: 410 NTEFSRPLWVFGTNSYEFLRIGVGRNIIVCSYRQRGRSYLGVLDEAQSSLSLLDIPFTDI 469
           + EF++PLW+FGTNSYE +     +N+I CSYRQ+G+SYLG++D++Q S SLLDIP TD 
Sbjct: 361 DGEFAKPLWIFGTNSYEIIECSEEKNLIACSYRQKGKSYLGIVDDSQGSCSLLDIPLTDF 420

Query: 470 ENIALGSQCIYVEGSSALHPSSIAKVTLNERTLEVVGFSIIWSSSPDILKFKSYFSLPEF 529
           ++I LG+QC+YVEG+SA+ P S+A+VTL++   + +   I+WSSSPD+LK+K+YFS+PE 
Sbjct: 421 DSITLGNQCLYVEGASAVLPPSVARVTLDQHKTKALSSEIVWSSSPDVLKYKAYFSVPEL 480

Query: 530 IEFPTEVPGQKAYAYFYPPSNPIYQASQDEKPPLLLKSHGGPTAETRGNLNPSIQYWTSR 589
           IEFPTEVPGQ AYAYFYPP+NP+Y AS +EKPPLL+KSHGGPTAE+RG+LN +IQYWTSR
Sbjct: 481 IEFPTEVPGQNAYAYFYPPTNPLYNASMEEKPPLLVKSHGGPTAESRGSLNLNIQYWTSR 540

Query: 590 GWGYVDVNYGGSTGMFATI------SW-----------LNFCVDSGKVDGERLCITGGSA 649
           GW +VDVNYGGSTG            W             + V SGK D +RLCI+GGSA
Sbjct: 541 GWAFVDVNYGGSTGYGREYRERLLRQWGIVDVDDCCGCAKYLVSSGKADVKRLCISGGSA 600

Query: 650 GGYTTLAALAFRDTFKAGASLYGIADLRLLRADTHKFESHYIDNLVGNEKDYFERSPINF 709
           GGYTTLA+LAFRD FKAGASLYG+ADL++L+ + HKFES YIDNLVG+EKD++ERSPINF
Sbjct: 601 GGYTTLASLAFRDVFKAGASLYGVADLKMLKEEGHKFESRYIDNLVGDEKDFYERSPINF 660

Query: 710 VDKFSCPIILFQGLEDKVVLPNQARKIYHALKDKGLPVALVEYEGEQHGFRKAENIKFTL 769
           VDKFSCPIILFQGLEDKVV P+Q+RKIY ALK KGLPVALVEYEGEQHGFRKAENIK+TL
Sbjct: 661 VDKFSCPIILFQGLEDKVVTPDQSRKIYEALKKKGLPVALVEYEGEQHGFRKAENIKYTL 720

Query: 770 EQQMMFFARSVGRFQVADDINPIKIDNFD 775
           EQQM+FFAR VG F+VADDI P+KIDNFD
Sbjct: 721 EQQMVFFARVVGGFKVADDITPLKIDNFD 729

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038896994.10.0e+0088.12uncharacterized protein LOC120085177 [Benincasa hispida][more]
XP_023535387.10.0e+0087.52uncharacterized protein LOC111796842 [Cucurbita pepo subsp. pepo] >XP_023535388.... [more]
KAG6591361.10.0e+0087.11Dipeptidyl aminopeptidase BIII, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7024237.10.0e+0087.11Dipeptidyl aminopeptidase BIII [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022936165.10.0e+0086.85uncharacterized protein LOC111442847 [Cucurbita moschata] >XP_022936166.1 unchar... [more]
Match NameE-valueIdentityDescription
V5YMB33.1e-1728.21Dipeptidyl aminopeptidase BIII OS=Pseudoxanthomonas mexicana OX=128785 GN=dapb3 ... [more]
P344223.1e-1728.37Dipeptidyl peptidase family member 6 OS=Caenorhabditis elegans OX=6239 GN=dpf-6 ... [more]
Q0IXP91.1e-0922.48Acylamino-acid-releasing enzyme 1 OS=Oryza sativa subsp. japonica OX=39947 GN=Os... [more]
Q6F3I71.5e-0832.48Dipeptidyl aminopeptidase 4 OS=Pseudoxanthomonas mexicana OX=128785 GN=dap4 PE=1... [more]
Q8R1461.7e-0725.11Acylamino-acid-releasing enzyme OS=Mus musculus OX=10090 GN=Apeh PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A6J1F7N80.0e+0086.85uncharacterized protein LOC111442847 OS=Cucurbita moschata OX=3662 GN=LOC1114428... [more]
A0A6J1IFZ30.0e+0086.98uncharacterized protein LOC111474184 OS=Cucurbita maxima OX=3661 GN=LOC111474184... [more]
A0A0A0L3I10.0e+0084.32Peptidase_S9 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G63906... [more]
A0A6J1C6I20.0e+0083.40uncharacterized protein LOC111008851 OS=Momordica charantia OX=3673 GN=LOC111008... [more]
A0A1S3BU100.0e+0083.51uncharacterized protein LOC103493519 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT5G36210.11.1e-27262.75alpha/beta-Hydrolases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001375Peptidase S9, prolyl oligopeptidase, catalytic domainPFAMPF00326Peptidase_S9coord: 601..756
e-value: 1.5E-29
score: 103.0
IPR029058Alpha/Beta hydrolase foldGENE3D3.40.50.1820alpha/beta hydrolasecoord: 513..762
e-value: 7.9E-60
score: 204.7
IPR029058Alpha/Beta hydrolase foldSUPERFAMILY53474alpha/beta-Hydrolasescoord: 504..754
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 169..190
NoneNo IPR availablePANTHERPTHR43056:SF14ALPHA/BETA HYDROLASE FOLD PROTEIN-RELATEDcoord: 122..774
NoneNo IPR availablePANTHERPTHR43056PEPTIDASE S9 PROLYL OLIGOPEPTIDASEcoord: 122..774
NoneNo IPR availableSUPERFAMILY82171DPP6 N-terminal domain-likecoord: 199..446

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10014100.1HG10014100.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0008236 serine-type peptidase activity