MS005840 (gene) Bitter gourd (TR) v1

Overview
NameMS005840
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAB hydrolase-1 domain-containing protein
Locationscaffold254: 1954767 .. 1958525 (+)
RNA-Seq ExpressionMS005840
SyntenyMS005840
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCGCTGCTTCCGCGGCAGCGCCGCCCCGTTTGATAAAGCCACTCCTTTTTTACTTCCACTCCTTCCCATGCCGCGCGCTCCCGTTAGTTCCACTCCCAAACCTTCTCTTCGCCTACGCCTACCATCGCCGGAGCTCGGTTCGTTCATCGGCAGTAATGGCCACCGAGACCAATCCCATTAATGCAGCATCGCCGCCGGATCACGCGACCGGGAAATGGTACTCCGTGCCGGAGCTCCGGCTCCGCGACCACCACTTCACTGTGCCTCTCGATTACTCTCTGGATCAGGATGCTTCTCCTAAGATCTCCGTTTTTGCGCGGGAAGTTGTTTTGGGTAAATTAATCAATCAGTAATCAAGCCTTAATTCGACTTTGATATTACTTTCGACATTTTGATTAATCCCTTTTTTTTTTTTTTTGGGTGCTCCTTTCTTATATATTTGAATGATCCAGGACCAATTATGGAGCATCGATGGTTGAAACTTAGAATTTCGTAGTATAGAGGTGACTAGTGTTGACCACTAGCAGAGTTTTCTCCATTCATTTATTAATTTTGTGGAGTTATGGTGTCAGGGAAGTTATAGATTGGTGTCATTGATGTTAATAGGAATTATGACACAACAACGAAATCAATTAATGGGCATCTAATAGAACAAACTTTGTGTTGCTTAGATAATTTTTTCCTTCATTAATGAACACTTCAACCACATGAATCAATTAACTTAATGGTATGCATGAAATCGTAATGAGAAAAGAACGAGAGTAAGAAAAAAAATATATGATTAGATGTGCTTTTAACATCTTTTGAAATCTCTTTAAATTATGATTAGTTTGTGTTGCTTTATTTGTAAATCTATACCCATTTCTACTGTGATATTCAGTCATACTTTTGTATGAATGATTATTATTTGAAAATTATGGATGCACTCATTGAGTAATAGTGGGGACTTTTATGCGTATGGGCTCTGAAAATATGTGGACATCTTAGTCAGAGTACAAATAATTATCAGGAATTGTGTAGAAGTTTTGAGTATTGCTGAAGCTACTGATTCAATGATTGAGGGTCTCCTTAAATATTGAATGATCTTATTTTAACTTAGATTTATCAGGAATTGTGTAGAATTTCTTTGATAATGATTAGCAATCATCCCTTATTTGAACTTAGATTTATTGTGCCAGTTTTTACATGTACATGGATTGTTGTATGATTTTAGCACTAGTTATCCTTAACATATGCAGAATGTGCATTGCAGTGGGGAAAGAAGAGCAGTCGATGCCATATCTTTTATACTTACAAGGTGGACCGGGATTTGAGAGTCCCCGACCGACCGAAGCAAGTGGATGGATGCAAAAAGCGTGTGAAGAATTTCGTGTTTTGTTGATGGATCAGGCATGACTTTTTGTTGACTACCTTTGGAAAAAGGCATTTGAATTTTGTAATATAAACAACTGTCTTCTTGATTGTTGATCCCGCATATATATACGTGTCATTGCAGCGAGGAACAGGATTATCAACTCCTCTGACTTCATCGTCTATGTCGCAATTTCAAAGTGCAGAGGACTTAGCCAACTACTTGAAACATTTTCGAGCTGACAACATCGTGAATGATGCTGAATTTATTAGGACTCGTCTTGTTCCTGATGCTGGACCTTGGACTATTTTGGGTCAGGTATTTAAATCCACTGTTTATTTTTTAGTTTCGAAATTTTGCTACCAACATGTTGAGCTTACTACTATTTTCTCAAACATGTTGAAGTTATATAAAGCAAATAGCAGACTTTTTGGCTTTGAGATTTTTGAATGTGGCTCTGCTTTATCTAAGTTATTGGCTTTTGTTGTATGGGGTAATTTACACCACTACAAGTTCAAGGAAATTCTCTGTTTCCCTGTGCAGAGCTTTGGTGGTTTTTGTGCAGTTACTTATTTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATTGGGAATAAATGCACTGCGGATTCTGTATATAGAGCATGCTTTGAAAAGGTTATAATTCAAAACGAAAAGTACTACAAGCGGTATCCTCAGGATGTTGAAATCATCCGTGAAGTTGCGAAATACTTGGCTGAGCATGGTGGCGGGGTGAGGACAGTTGTCTTAAAACTCGAACTAATTGCTTAAAATTCTCTCAGTTCCCTTAGCCATTCCCTGAAAAACCACCCTTTTCCCTTCTGTTTTTGTTTAGGTTGTTCTTCCCTCTGGTGGTATCCTAACCCCCAAGGGGCTGCAAGTTCTTGGTCTTTTTGCTTTAGGATCTAGTACAGGTTTTGAGCGCTTGCATTATCTGTAAGGCCTCTTATTATTATTATTTTTTTTTTTTCTTTTTTCCTTTCTGGTCTCTATCATTTGGACCAGTGAATGGCCACCCAAGATACTGTAATCTAATTACTTTATATTTCCCACCTCAATTATGGCAGCTTTGAGAGAGTTTGGGATCCTGTAATAGTTCCTGGAGCGCCAAAACGAATCAGTTCTTTCTTCCTCCGCGCTGTTAGTATTCTACAGTTTCTATTGCTATTGAATATAACTATAGCTTCATTCTTGAATGAATCTAAATGATTAGACATATTTGCAGTGTGATAACTGGCTCTCACTTGATTCAAATCCTCTATATGTTCTTCTACATGAATCAATATATTGCCAGGTAGCTTTCTTCTTTTTCCTTTTGATTCTCAGAGTAGAACTATGTTTTGGCCCGTCAGAACTTTTTGTAGTGTTAACTTTTTGCCACTTCCTGTGCCGGTTTCCCGAAGGGTGCCTCATCTCGGTGGTCTGCTCAAAGAATAAAGAATGAACTTAAGAATAAGTTCGATGCAAATAACGCCCTGAAAGAAGGATGTCCCATGTTTTTCACAGGCGAGGTAACTGCACTATGCTCAACTGGTTAAAACACTTATATTTGACTAGAAGGTCGTGGAGGTCGTGAGTTTGAATCTCCATCTAAGTTTATAAGGCTATGCTTGACTTTGGGAATTTAAGCATAAAGATATAAAACTCCCATTGTGTATGCTCCATTAATTTGTTACACTTCTGCAGATGGTCTTCCCATGGATGTTTGACGAGATTCATGCCTTGAGACCATTGAAAGATGCTGCTCGTATATTGGCCGAGAAAGAGGATTGGCCTCCACTATATGACGTTGCCGCTCTTAGAAACAACAAGGTATTCTGTTTCAAGAGCTTCTGCATTTCCATGTTAGCAAAAATGAGCTTGTCTTTCTCCTACATCTTCTGAATGAGTGCGTAAAAATGATCATATTGATAACGTTTTTAACCCACAAAATCTCGCACAACGTTTCATAGAAGCACGTCCATCCGTCACGGGTAGCCTAGTGGCCACTTACTTCTACAAGTTTTCTTGACAACCAAATGTAGTAGAATCAAGTAGTCGTCCTAAGAGACTACTCGAGGTACTTGCAAGTTTGCTAGATAGATATTCGAAGTAAAAAAACCACTAGGATCAGCTTGGGACTCGTCTGTCAACGAAAATTAACATGTTTTCTGGATTCCCCTCACAGGTTCCAGTCGCAGCTGCTGTTTATTACGAAGATATGTTTGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAGATAGCAGGAATAAGGCTGTGGATAACCAATGAATATATGCATTCTGGTCTGCGAGATGCAGGGCCCCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTC

mRNA sequence

ATGTTCGCTGCTTCCGCGGCAGCGCCGCCCCGTTTGATAAAGCCACTCCTTTTTTACTTCCACTCCTTCCCATGCCGCGCGCTCCCGTTAGTTCCACTCCCAAACCTTCTCTTCGCCTACGCCTACCATCGCCGGAGCTCGGTTCGTTCATCGGCAGTAATGGCCACCGAGACCAATCCCATTAATGCAGCATCGCCGCCGGATCACGCGACCGGGAAATGGTACTCCGTGCCGGAGCTCCGGCTCCGCGACCACCACTTCACTGTGCCTCTCGATTACTCTCTGGATCAGGATGCTTCTCCTAAGATCTCCGTTTTTGCGCGGGAAGTTGTTTTGGTGGGGAAAGAAGAGCAGTCGATGCCATATCTTTTATACTTACAAGGTGGACCGGGATTTGAGAGTCCCCGACCGACCGAAGCAAGTGGATGGATGCAAAAAGCGTGTGAAGAATTTCGTCGAGGAACAGGATTATCAACTCCTCTGACTTCATCGTCTATGTCGCAATTTCAAAGTGCAGAGGACTTAGCCAACTACTTGAAACATTTTCGAGCTGACAACATCGTGAATGATGCTGAATTTATTAGGACTCGTCTTGTTCCTGATGCTGGACCTTGGACTATTTTGGGTCAGAGCTTTGGTGGTTTTTGTGCAGTTACTTATTTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATTGGGAATAAATGCACTGCGGATTCTGTATATAGAGCATGCTTTGAAAAGGTTATAATTCAAAACGAAAAGTACTACAAGCGGTATCCTCAGGATGTTGAAATCATCCGTGAAGTTGCGAAATACTTGGCTGAGCATGGTGGCGGGGTTGTTCTTCCCTCTGGTGGTATCCTAACCCCCAAGGGGCTGCAAGTTCTTGGTCTTTTTGCTTTAGGATCTAGTACAGGTTTTGAGCGCTTGCATTATCTCTTTGAGAGAGTTTGGGATCCTGTAATAGTTCCTGGAGCGCCAAAACGAATCAGTTCTTTCTTCCTCCGCGCTTGTGATAACTGGCTCTCACTTGATTCAAATCCTCTATATGTTCTTCTACATGAATCAATATATTGCCAGGGTGCCTCATCTCGGTGGTCTGCTCAAAGAATAAAGAATGAACTTAAGAATAAGTTCGATGCAAATAACGCCCTGAAAGAAGGATGTCCCATGTTTTTCACAGGCGAGATGGTCTTCCCATGGATGTTTGACGAGATTCATGCCTTGAGACCATTGAAAGATGCTGCTCGTATATTGGCCGAGAAAGAGGATTGGCCTCCACTATATGACGTTGCCGCTCTTAGAAACAACAAGGTTCCAGTCGCAGCTGCTGTTTATTACGAAGATATGTTTGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAGATAGCAGGAATAAGGCTGTGGATAACCAATGAATATATGCATTCTGGTCTGCGAGATGCAGGGCCCCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTC

Coding sequence (CDS)

ATGTTCGCTGCTTCCGCGGCAGCGCCGCCCCGTTTGATAAAGCCACTCCTTTTTTACTTCCACTCCTTCCCATGCCGCGCGCTCCCGTTAGTTCCACTCCCAAACCTTCTCTTCGCCTACGCCTACCATCGCCGGAGCTCGGTTCGTTCATCGGCAGTAATGGCCACCGAGACCAATCCCATTAATGCAGCATCGCCGCCGGATCACGCGACCGGGAAATGGTACTCCGTGCCGGAGCTCCGGCTCCGCGACCACCACTTCACTGTGCCTCTCGATTACTCTCTGGATCAGGATGCTTCTCCTAAGATCTCCGTTTTTGCGCGGGAAGTTGTTTTGGTGGGGAAAGAAGAGCAGTCGATGCCATATCTTTTATACTTACAAGGTGGACCGGGATTTGAGAGTCCCCGACCGACCGAAGCAAGTGGATGGATGCAAAAAGCGTGTGAAGAATTTCGTCGAGGAACAGGATTATCAACTCCTCTGACTTCATCGTCTATGTCGCAATTTCAAAGTGCAGAGGACTTAGCCAACTACTTGAAACATTTTCGAGCTGACAACATCGTGAATGATGCTGAATTTATTAGGACTCGTCTTGTTCCTGATGCTGGACCTTGGACTATTTTGGGTCAGAGCTTTGGTGGTTTTTGTGCAGTTACTTATTTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATTGGGAATAAATGCACTGCGGATTCTGTATATAGAGCATGCTTTGAAAAGGTTATAATTCAAAACGAAAAGTACTACAAGCGGTATCCTCAGGATGTTGAAATCATCCGTGAAGTTGCGAAATACTTGGCTGAGCATGGTGGCGGGGTTGTTCTTCCCTCTGGTGGTATCCTAACCCCCAAGGGGCTGCAAGTTCTTGGTCTTTTTGCTTTAGGATCTAGTACAGGTTTTGAGCGCTTGCATTATCTCTTTGAGAGAGTTTGGGATCCTGTAATAGTTCCTGGAGCGCCAAAACGAATCAGTTCTTTCTTCCTCCGCGCTTGTGATAACTGGCTCTCACTTGATTCAAATCCTCTATATGTTCTTCTACATGAATCAATATATTGCCAGGGTGCCTCATCTCGGTGGTCTGCTCAAAGAATAAAGAATGAACTTAAGAATAAGTTCGATGCAAATAACGCCCTGAAAGAAGGATGTCCCATGTTTTTCACAGGCGAGATGGTCTTCCCATGGATGTTTGACGAGATTCATGCCTTGAGACCATTGAAAGATGCTGCTCGTATATTGGCCGAGAAAGAGGATTGGCCTCCACTATATGACGTTGCCGCTCTTAGAAACAACAAGGTTCCAGTCGCAGCTGCTGTTTATTACGAAGATATGTTTGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAGATAGCAGGAATAAGGCTGTGGATAACCAATGAATATATGCATTCTGGTCTGCGAGATGCAGGGCCCCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTC

Protein sequence

MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEFRRGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF
Homology
BLAST of MS005840 vs. NCBI nr
Match: XP_022147514.1 (uncharacterized protein LOC111016418 [Momordica charantia])

HSP 1 Score: 1049.7 bits (2713), Expect = 8.4e-303
Identity = 511/517 (98.84%), Postives = 511/517 (98.84%), Query Frame = 0

Query: 1   MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNP 60
           MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNP
Sbjct: 1   MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNP 60

Query: 61  INAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSM 120
           INAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSM
Sbjct: 61  INAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSM 120

Query: 121 PYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAED 180
           PYLLYLQGGPGFESPRPTEASGWMQKACEEFR      RGTGLSTPLTSSSMSQFQSAED
Sbjct: 121 PYLLYLQGGPGFESPRPTEASGWMQKACEEFRVVLMDQRGTGLSTPLTSSSMSQFQSAED 180

Query: 181 LANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLIT 240
           LANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLIT
Sbjct: 181 LANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLIT 240

Query: 241 GGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG 300
           GGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG
Sbjct: 241 GGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG 300

Query: 301 GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLD 360
           GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLD
Sbjct: 301 GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLD 360

Query: 361 SNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDE 420
           SNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDE
Sbjct: 361 SNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDE 420

Query: 421 IHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIA 480
           IHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIA
Sbjct: 421 IHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIA 480

Query: 481 GIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF 512
           GIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF
Sbjct: 481 GIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF 517

BLAST of MS005840 vs. NCBI nr
Match: XP_008437982.1 (PREDICTED: proline iminopeptidase [Cucumis melo] >TYK17630.1 proline iminopeptidase [Cucumis melo var. makuwa])

HSP 1 Score: 886.3 bits (2289), Expect = 1.2e-253
Identity = 432/517 (83.56%), Postives = 462/517 (89.36%), Query Frame = 0

Query: 1   MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNP 60
           MFA   AAP     PLL +FHS P R LPL+PLPN  F  A H R SVR SA MA   +P
Sbjct: 1   MFAVRTAAP-----PLLLHFHSLPFRLLPLIPLPN--FLSAAHCRRSVRLSAAMAGILSP 60

Query: 61  INAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSM 120
             A SPP H  G WYSVPELRLRDHHF+VPL+YSLDQ +S +ISVFAREVV VGKE+Q M
Sbjct: 61  -RAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPM 120

Query: 121 PYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAED 180
           PYLLYLQGGPGFE  RP+EASGW+QKACEEFR      RGTGLSTPLT SSMSQF+SAED
Sbjct: 121 PYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAED 180

Query: 181 LANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLIT 240
           LANYLKHFRADNIVNDAEFIRTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLIT
Sbjct: 181 LANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLIT 240

Query: 241 GGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG 300
           GGIPPIGN CTADSVYRACFEKVIIQNEKYYKRYPQD+EI+REV KYLA++GGGV+LPSG
Sbjct: 241 GGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSG 300

Query: 301 GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLD 360
           GILTPKGLQ LGL ALG+STGFERLHYLFERVWDP++VPGAPKRIS FFL A DNWLSLD
Sbjct: 301 GILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLD 360

Query: 361 SNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDE 420
           SNPLYVLLHESIYCQGASSRWSAQRIKNE++NKFDAN A+KEGCP++FTGEM+FPWMFDE
Sbjct: 361 SNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDE 420

Query: 421 IHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIA 480
           IHALRP KDAA ILA+KEDWPPLYD+AAL+NNKVPVAAAVYYEDMFVNFKLAMETASQIA
Sbjct: 421 IHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIA 480

Query: 481 GIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF 512
           GIRLWITNE+MHSGLRDAGPQVLDHLMGLLNGKKPLF
Sbjct: 481 GIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF 509

BLAST of MS005840 vs. NCBI nr
Match: XP_004133842.3 (uncharacterized protein LOC101216845 [Cucumis sativus] >KGN56478.1 hypothetical protein Csa_009643 [Cucumis sativus])

HSP 1 Score: 877.1 bits (2265), Expect = 7.4e-251
Identity = 426/517 (82.40%), Postives = 462/517 (89.36%), Query Frame = 0

Query: 1   MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNP 60
           MFAA  AAP     PLL +FHS PCR LPL+PL N  F  A H R SVR SA MA   +P
Sbjct: 1   MFAARTAAP-----PLLLHFHSLPCRVLPLIPLRN--FLSAAHCRRSVRLSAAMAGILSP 60

Query: 61  INAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSM 120
             AASPP H +G WYSVPELRLRDHHF+VPL+YSL+Q +  +ISVFAREVV VGKE+Q M
Sbjct: 61  -RAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPM 120

Query: 121 PYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAED 180
           PYLL+LQGGPGFE  RPTEASGW+QKACEEFR      RGTGLSTPLT SSMSQFQS++D
Sbjct: 121 PYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDD 180

Query: 181 LANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLIT 240
           LANYLKHFRADNIVNDAEFIRTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLIT
Sbjct: 181 LANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLIT 240

Query: 241 GGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG 300
           GGIPPIGN CTADSVYRACFEKVIIQNEKYYKRYPQD+EI+REV KYLAE+GGGV+LPSG
Sbjct: 241 GGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSG 300

Query: 301 GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLD 360
           GILTPKGLQ LGL ALG+STGFERLHYLFERVWDP++V G+PKRIS FFL A DNWLSLD
Sbjct: 301 GILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLD 360

Query: 361 SNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDE 420
           SNPLYVLLHE+IYCQGASSRWSAQRIKNE++NKFDAN A+KEGC ++FTGEM+FPWMFDE
Sbjct: 361 SNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDE 420

Query: 421 IHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIA 480
           IHALRP KDAA ILA+KEDWPPLYD+AAL+NNKVPVAAAVYYEDMFVNFKLAM+TASQIA
Sbjct: 421 IHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIA 480

Query: 481 GIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF 512
           GIRLW+TNE+MHSGLRDAGPQVLDHLMGLLNGKKPLF
Sbjct: 481 GIRLWVTNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF 509

BLAST of MS005840 vs. NCBI nr
Match: XP_022933365.1 (uncharacterized protein LOC111440690 [Cucurbita moschata])

HSP 1 Score: 862.1 bits (2226), Expect = 2.5e-246
Identity = 417/498 (83.73%), Postives = 447/498 (89.76%), Query Frame = 0

Query: 20  FHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPE 79
           FHSFP  A  L+PL  LL   A H RSSVRS AVMA  TNP N ASPP+HA G WYSVPE
Sbjct: 6   FHSFPSPARSLIPLTRLL--SAVHCRSSVRSLAVMAA-TNPSNGASPPEHAAGTWYSVPE 65

Query: 80  LRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTE 139
           LRLRDH+F+VPL+YSLD  +SPKISV+AREVV VGKEEQ MPYLLYLQGGPGFE PRPTE
Sbjct: 66  LRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPMPYLLYLQGGPGFECPRPTE 125

Query: 140 ASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEF 199
           ASGW+QKACEEFR      RGTGLSTPL+ SSMSQFQSAEDLA+YLKHFRADNIVNDAEF
Sbjct: 126 ASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEF 185

Query: 200 IRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRAC 259
           IRTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLITGGIPPIGN CTADSVYRAC
Sbjct: 186 IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRAC 245

Query: 260 FEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLFALGSS 319
           FEK+IIQNEKYYKRYPQDV+I+ EV KYL E+GGG+ LP GGILTPKGLQ LGL ALGSS
Sbjct: 246 FEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSS 305

Query: 320 TGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASS 379
           TGFER+HYLFERVWDP+IVPGAPKRIS FFL A   WLSLDSNPLY L+HESIYCQGASS
Sbjct: 306 TGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASS 365

Query: 380 RWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKED 439
           RWSAQRI NEL+NKFDA  A+KEGCP++FTGEM+FPWMFDEIHAL+P KDAA ILAEKED
Sbjct: 366 RWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKED 425

Query: 440 WPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAG 499
           WPPLYD+AAL+NNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNE+MHSGLRD G
Sbjct: 426 WPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGG 485

Query: 500 PQVLDHLMGLLNGKKPLF 512
           PQVLDHLMGLLNGKKPLF
Sbjct: 486 PQVLDHLMGLLNGKKPLF 499

BLAST of MS005840 vs. NCBI nr
Match: KAG6597118.1 (Proline iminopeptidase, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 860.5 bits (2222), Expect = 7.2e-246
Identity = 416/498 (83.53%), Postives = 447/498 (89.76%), Query Frame = 0

Query: 20  FHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPE 79
           FHSFP  A  L+PL  LL   A H RSSVRS AVMA  TNP N ASPP+HA G WYSVPE
Sbjct: 6   FHSFPSPARSLIPLTRLL--SAVHCRSSVRSLAVMAA-TNPSNGASPPEHAAGTWYSVPE 65

Query: 80  LRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTE 139
           LRLRDH+F+VPL+YSLD  +SPKISV+AREVV VGKEEQ MPYL+YLQGGPGFE PRPTE
Sbjct: 66  LRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPMPYLVYLQGGPGFECPRPTE 125

Query: 140 ASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEF 199
           ASGW+QKACEEFR      RGTGLSTPL+ SSMSQFQ+AEDLANYLKHFRADNIVNDAEF
Sbjct: 126 ASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQTAEDLANYLKHFRADNIVNDAEF 185

Query: 200 IRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRAC 259
           IRTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLITGGIPPIGN CTADSVYRAC
Sbjct: 186 IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRAC 245

Query: 260 FEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLFALGSS 319
           FEK+IIQNEKYYKRYPQDV+I+ EV KYL E+GGGV LP GGILTPKGLQ LGL ALGSS
Sbjct: 246 FEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGVPLPCGGILTPKGLQTLGLSALGSS 305

Query: 320 TGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASS 379
           TGFER+HYLFERVWDP+IVPGAPKRIS FFL A   WLSLDSNPLY L+HESIYCQGASS
Sbjct: 306 TGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASS 365

Query: 380 RWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKED 439
           RWSAQRI+NEL+NKFD   A+KEGCP++FTGEM+FPWMFDEIHAL+P KDAA ILAEKED
Sbjct: 366 RWSAQRIRNELENKFDVIRAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKED 425

Query: 440 WPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAG 499
           WPPLYD+AAL+NNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNE+MHSGLRD G
Sbjct: 426 WPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGG 485

Query: 500 PQVLDHLMGLLNGKKPLF 512
           PQVLDHLMGLLNGKKPLF
Sbjct: 486 PQVLDHLMGLLNGKKPLF 499

BLAST of MS005840 vs. ExPASy Swiss-Prot
Match: P46547 (Proline iminopeptidase OS=Aeromonas sobria OX=646 GN=pip PE=1 SV=3)

HSP 1 Score: 366.7 bits (940), Expect = 4.3e-100
Identity = 195/435 (44.83%), Postives = 269/435 (61.84%), Query Frame = 0

Query: 75  YSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSMPYLLYLQGGPGFES 134
           Y +  +    H FTVPLD+    D    I++F R +    + +  +P+LLYLQGGPGF +
Sbjct: 7   YVLDGIHCEPHFFTVPLDHQ-QPDDEETITLFGRTLCRKDRLDDELPWLLYLQGGPGFGA 66

Query: 135 PRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIV 194
           PRP+   GW+++A +EFR      RGTG STP+ +  ++     +  A+YL HFRAD+IV
Sbjct: 67  PRPSANGGWIKRALQEFRVLLLDQRGTGHSTPIHAELLAHLNPRQQ-ADYLSHFRADSIV 126

Query: 195 NDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADS 254
            DAE IR +L PD  PW++LGQSFGGFC++TYLS  P  L +V +TGG+ PIG   +AD 
Sbjct: 127 RDAELIREQLSPD-HPWSLLGQSFGGFCSLTYLSLFPDSLHEVYLTGGVAPIGR--SADE 186

Query: 255 VYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLF 314
           VYRA +++V  +N  ++ R+P    I   +A +L  H   V LP+G  LT + LQ  GL 
Sbjct: 187 VYRATYQRVADKNRAFFARFPHAQAIANRLATHLQRH--DVRLPNGQRLTVEQLQQQGL- 246

Query: 315 ALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYC 374
            LG+S  FE L+YL E  +         ++++  FL         ++NP++ +LHE IYC
Sbjct: 247 DLGASGAFEELYYLLEDAF-------IGEKLNPAFLYQVQAMQPFNTNPVFAILHELIYC 306

Query: 375 QGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDEIHALRPLKDAARIL 434
           +GA+S W+A+R++ E    F A  A  +G    FTGEM+FPWMF++   L PLK+AA +L
Sbjct: 307 EGAASHWAAERVRGE----FPA-LAWAQGKDFAFTGEMIFPWMFEQFRELIPLKEAAHLL 366

Query: 435 AEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSG 494
           AEK DW PLYD   L  NKVPVA AVY EDM+V F  + ET   ++  R WITNEY H+G
Sbjct: 367 AEKADWGPLYDPVQLARNKVPVACAVYAEDMYVEFDYSRETLKGLSNSRAWITNEYEHNG 421

Query: 495 LRDAGPQVLDHLMGL 504
           LR  G Q+LD L+ L
Sbjct: 427 LRVDGEQILDRLIRL 421

BLAST of MS005840 vs. ExPASy Swiss-Prot
Match: A0A1L9WUM2 (Proline iminopeptidase aneH OS=Aspergillus aculeatus (strain ATCC 16872 / CBS 172.66 / WB 5094) OX=690307 GN=aneH PE=3 SV=1)

HSP 1 Score: 244.2 bits (622), Expect = 3.2e-63
Identity = 149/436 (34.17%), Postives = 231/436 (52.98%), Query Frame = 0

Query: 81  RLRDHHFTVPLDYSLDQDASPKISVFAREV-VLVGKEEQSMPYLLYLQGGPGFESPRPTE 140
           R  +  F VPL++S   + +  + +FAR +  ++G ++  +P++LYLQGGPG     P E
Sbjct: 25  RTSEWRFEVPLNHSKPDEGT--VRLFARSIHCVLGVDDPELPWMLYLQGGPGLGCKTPLE 84

Query: 141 ASGWMQKACEE-------FRRGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAE 200
            + W+    E+         RGTG S+P+T+ +++Q    +  A+ LK FRADNIV D E
Sbjct: 85  YA-WLPSILEKGYRVLFLDERGTGQSSPITAKTLAQQGDHKKQADLLKRFRADNIVRDCE 144

Query: 201 FIRTRLVPDA----GPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADS 260
            +R  L  DA      W+++  SFGGFCA++Y+S  P  L +V I GG  P+ N+     
Sbjct: 145 AVRKHLYQDAPADQSKWSVMAASFGGFCAISYVSMFPNSLVEVFIGGGPCPMVNE--PGQ 204

Query: 261 VYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLF 320
           V    F     +NE YYK+YP+DV  ++ + KYL E+    V  S G LTP+  Q LG+ 
Sbjct: 205 VIPRLFAVAARRNEVYYKKYPEDVGRVKRIIKYLKEN---KVALSKGTLTPERFQQLGVM 264

Query: 321 ALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYC 380
            LG   G + +H + +R  + +      K +++  L   +N   +  N +Y LL E +YC
Sbjct: 265 -LGLHGGIDYIHGVVQRTDNDL---DMFKFLTAPTLDLIEN-SGMAHNVIYSLLQEPMYC 324

Query: 381 QGASSRWSAQRIKNELKNKFDANNALKE-GCPMFFTGEMVFPWMFDEIHALRPLKDAARI 440
           QG +  W A + +     K D   +L E    ++FTGE +F  MF+    L+ LK  A +
Sbjct: 325 QGKAGGWCADKCR-----KADPRFSLNERNAQIWFTGEAIFSDMFESYDELKDLKPVAEL 384

Query: 441 LAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHS 500
           LA   DW  LY+ A L  N+VPV  A   EDM+V++ L   TAS++  ++  + N + H 
Sbjct: 385 LARSSDWGQLYNEAQLARNEVPVYVATAVEDMYVSYDLGCHTASKVKNLQQVVNNTWYHD 442

Query: 501 GLRDAGPQVLDHLMGL 504
            +     +V+  L  L
Sbjct: 445 AVETKASEVMPALFAL 442

BLAST of MS005840 vs. ExPASy TrEMBL
Match: A0A6J1D184 (uncharacterized protein LOC111016418 OS=Momordica charantia OX=3673 GN=LOC111016418 PE=3 SV=1)

HSP 1 Score: 1049.7 bits (2713), Expect = 4.1e-303
Identity = 511/517 (98.84%), Postives = 511/517 (98.84%), Query Frame = 0

Query: 1   MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNP 60
           MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNP
Sbjct: 1   MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNP 60

Query: 61  INAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSM 120
           INAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSM
Sbjct: 61  INAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSM 120

Query: 121 PYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAED 180
           PYLLYLQGGPGFESPRPTEASGWMQKACEEFR      RGTGLSTPLTSSSMSQFQSAED
Sbjct: 121 PYLLYLQGGPGFESPRPTEASGWMQKACEEFRVVLMDQRGTGLSTPLTSSSMSQFQSAED 180

Query: 181 LANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLIT 240
           LANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLIT
Sbjct: 181 LANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLIT 240

Query: 241 GGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG 300
           GGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG
Sbjct: 241 GGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG 300

Query: 301 GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLD 360
           GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLD
Sbjct: 301 GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLD 360

Query: 361 SNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDE 420
           SNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDE
Sbjct: 361 SNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDE 420

Query: 421 IHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIA 480
           IHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIA
Sbjct: 421 IHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIA 480

Query: 481 GIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF 512
           GIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF
Sbjct: 481 GIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF 517

BLAST of MS005840 vs. ExPASy TrEMBL
Match: A0A5D3D1Y5 (Proline iminopeptidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold434G005010 PE=3 SV=1)

HSP 1 Score: 886.3 bits (2289), Expect = 5.9e-254
Identity = 432/517 (83.56%), Postives = 462/517 (89.36%), Query Frame = 0

Query: 1   MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNP 60
           MFA   AAP     PLL +FHS P R LPL+PLPN  F  A H R SVR SA MA   +P
Sbjct: 1   MFAVRTAAP-----PLLLHFHSLPFRLLPLIPLPN--FLSAAHCRRSVRLSAAMAGILSP 60

Query: 61  INAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSM 120
             A SPP H  G WYSVPELRLRDHHF+VPL+YSLDQ +S +ISVFAREVV VGKE+Q M
Sbjct: 61  -RAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPM 120

Query: 121 PYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAED 180
           PYLLYLQGGPGFE  RP+EASGW+QKACEEFR      RGTGLSTPLT SSMSQF+SAED
Sbjct: 121 PYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAED 180

Query: 181 LANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLIT 240
           LANYLKHFRADNIVNDAEFIRTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLIT
Sbjct: 181 LANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLIT 240

Query: 241 GGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG 300
           GGIPPIGN CTADSVYRACFEKVIIQNEKYYKRYPQD+EI+REV KYLA++GGGV+LPSG
Sbjct: 241 GGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSG 300

Query: 301 GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLD 360
           GILTPKGLQ LGL ALG+STGFERLHYLFERVWDP++VPGAPKRIS FFL A DNWLSLD
Sbjct: 301 GILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLD 360

Query: 361 SNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDE 420
           SNPLYVLLHESIYCQGASSRWSAQRIKNE++NKFDAN A+KEGCP++FTGEM+FPWMFDE
Sbjct: 361 SNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDE 420

Query: 421 IHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIA 480
           IHALRP KDAA ILA+KEDWPPLYD+AAL+NNKVPVAAAVYYEDMFVNFKLAMETASQIA
Sbjct: 421 IHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIA 480

Query: 481 GIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF 512
           GIRLWITNE+MHSGLRDAGPQVLDHLMGLLNGKKPLF
Sbjct: 481 GIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF 509

BLAST of MS005840 vs. ExPASy TrEMBL
Match: A0A1S3AUX5 (proline iminopeptidase OS=Cucumis melo OX=3656 GN=LOC103483239 PE=3 SV=1)

HSP 1 Score: 886.3 bits (2289), Expect = 5.9e-254
Identity = 432/517 (83.56%), Postives = 462/517 (89.36%), Query Frame = 0

Query: 1   MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNP 60
           MFA   AAP     PLL +FHS P R LPL+PLPN  F  A H R SVR SA MA   +P
Sbjct: 1   MFAVRTAAP-----PLLLHFHSLPFRLLPLIPLPN--FLSAAHCRRSVRLSAAMAGILSP 60

Query: 61  INAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSM 120
             A SPP H  G WYSVPELRLRDHHF+VPL+YSLDQ +S +ISVFAREVV VGKE+Q M
Sbjct: 61  -RAPSPPVHVAGTWYSVPELRLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPM 120

Query: 121 PYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAED 180
           PYLLYLQGGPGFE  RP+EASGW+QKACEEFR      RGTGLSTPLT SSMSQF+SAED
Sbjct: 121 PYLLYLQGGPGFECARPSEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAED 180

Query: 181 LANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLIT 240
           LANYLKHFRADNIVNDAEFIRTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLIT
Sbjct: 181 LANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLIT 240

Query: 241 GGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG 300
           GGIPPIGN CTADSVYRACFEKVIIQNEKYYKRYPQD+EI+REV KYLA++GGGV+LPSG
Sbjct: 241 GGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSG 300

Query: 301 GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLD 360
           GILTPKGLQ LGL ALG+STGFERLHYLFERVWDP++VPGAPKRIS FFL A DNWLSLD
Sbjct: 301 GILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLD 360

Query: 361 SNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDE 420
           SNPLYVLLHESIYCQGASSRWSAQRIKNE++NKFDAN A+KEGCP++FTGEM+FPWMFDE
Sbjct: 361 SNPLYVLLHESIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDE 420

Query: 421 IHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIA 480
           IHALRP KDAA ILA+KEDWPPLYD+AAL+NNKVPVAAAVYYEDMFVNFKLAMETASQIA
Sbjct: 421 IHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIA 480

Query: 481 GIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF 512
           GIRLWITNE+MHSGLRDAGPQVLDHLMGLLNGKKPLF
Sbjct: 481 GIRLWITNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF 509

BLAST of MS005840 vs. ExPASy TrEMBL
Match: A0A0A0L423 (AB hydrolase-1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G121050 PE=3 SV=1)

HSP 1 Score: 877.1 bits (2265), Expect = 3.6e-251
Identity = 426/517 (82.40%), Postives = 462/517 (89.36%), Query Frame = 0

Query: 1   MFAASAAAPPRLIKPLLFYFHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNP 60
           MFAA  AAP     PLL +FHS PCR LPL+PL N  F  A H R SVR SA MA   +P
Sbjct: 1   MFAARTAAP-----PLLLHFHSLPCRVLPLIPLRN--FLSAAHCRRSVRLSAAMAGILSP 60

Query: 61  INAASPPDHATGKWYSVPELRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSM 120
             AASPP H +G WYSVPELRLRDHHF+VPL+YSL+Q +  +ISVFAREVV VGKE+Q M
Sbjct: 61  -RAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPM 120

Query: 121 PYLLYLQGGPGFESPRPTEASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAED 180
           PYLL+LQGGPGFE  RPTEASGW+QKACEEFR      RGTGLSTPLT SSMSQFQS++D
Sbjct: 121 PYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDD 180

Query: 181 LANYLKHFRADNIVNDAEFIRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLIT 240
           LANYLKHFRADNIVNDAEFIRTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLIT
Sbjct: 181 LANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLIT 240

Query: 241 GGIPPIGNKCTADSVYRACFEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSG 300
           GGIPPIGN CTADSVYRACFEKVIIQNEKYYKRYPQD+EI+REV KYLAE+GGGV+LPSG
Sbjct: 241 GGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSG 300

Query: 301 GILTPKGLQVLGLFALGSSTGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLD 360
           GILTPKGLQ LGL ALG+STGFERLHYLFERVWDP++V G+PKRIS FFL A DNWLSLD
Sbjct: 301 GILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLD 360

Query: 361 SNPLYVLLHESIYCQGASSRWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDE 420
           SNPLYVLLHE+IYCQGASSRWSAQRIKNE++NKFDAN A+KEGC ++FTGEM+FPWMFDE
Sbjct: 361 SNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDE 420

Query: 421 IHALRPLKDAARILAEKEDWPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIA 480
           IHALRP KDAA ILA+KEDWPPLYD+AAL+NNKVPVAAAVYYEDMFVNFKLAM+TASQIA
Sbjct: 421 IHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIA 480

Query: 481 GIRLWITNEYMHSGLRDAGPQVLDHLMGLLNGKKPLF 512
           GIRLW+TNE+MHSGLRDAGPQVLDHLMGLLNGKKPLF
Sbjct: 481 GIRLWVTNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF 509

BLAST of MS005840 vs. ExPASy TrEMBL
Match: A0A6J1F4P5 (uncharacterized protein LOC111440690 OS=Cucurbita moschata OX=3662 GN=LOC111440690 PE=3 SV=1)

HSP 1 Score: 862.1 bits (2226), Expect = 1.2e-246
Identity = 417/498 (83.73%), Postives = 447/498 (89.76%), Query Frame = 0

Query: 20  FHSFPCRALPLVPLPNLLFAYAYHRRSSVRSSAVMATETNPINAASPPDHATGKWYSVPE 79
           FHSFP  A  L+PL  LL   A H RSSVRS AVMA  TNP N ASPP+HA G WYSVPE
Sbjct: 6   FHSFPSPARSLIPLTRLL--SAVHCRSSVRSLAVMAA-TNPSNGASPPEHAAGTWYSVPE 65

Query: 80  LRLRDHHFTVPLDYSLDQDASPKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTE 139
           LRLRDH+F+VPL+YSLD  +SPKISV+AREVV VGKEEQ MPYLLYLQGGPGFE PRPTE
Sbjct: 66  LRLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPMPYLLYLQGGPGFECPRPTE 125

Query: 140 ASGWMQKACEEFR------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEF 199
           ASGW+QKACEEFR      RGTGLSTPL+ SSMSQFQSAEDLA+YLKHFRADNIVNDAEF
Sbjct: 126 ASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEF 185

Query: 200 IRTRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRAC 259
           IRTRLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVLITGGIPPIGN CTADSVYRAC
Sbjct: 186 IRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRAC 245

Query: 260 FEKVIIQNEKYYKRYPQDVEIIREVAKYLAEHGGGVVLPSGGILTPKGLQVLGLFALGSS 319
           FEK+IIQNEKYYKRYPQDV+I+ EV KYL E+GGG+ LP GGILTPKGLQ LGL ALGSS
Sbjct: 246 FEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSS 305

Query: 320 TGFERLHYLFERVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASS 379
           TGFER+HYLFERVWDP+IVPGAPKRIS FFL A   WLSLDSNPLY L+HESIYCQGASS
Sbjct: 306 TGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASS 365

Query: 380 RWSAQRIKNELKNKFDANNALKEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKED 439
           RWSAQRI NEL+NKFDA  A+KEGCP++FTGEM+FPWMFDEIHAL+P KDAA ILAEKED
Sbjct: 366 RWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKED 425

Query: 440 WPPLYDVAALRNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAG 499
           WPPLYD+AAL+NNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNE+MHSGLRD G
Sbjct: 426 WPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGG 485

Query: 500 PQVLDHLMGLLNGKKPLF 512
           PQVLDHLMGLLNGKKPLF
Sbjct: 486 PQVLDHLMGLLNGKKPLF 499

BLAST of MS005840 vs. TAIR 10
Match: AT3G61540.1 (alpha/beta-Hydrolases superfamily protein )

HSP 1 Score: 719.2 bits (1855), Expect = 2.4e-207
Identity = 346/487 (71.05%), Postives = 399/487 (81.93%), Query Frame = 0

Query: 33  LPNLLFAYAYHRRSSVRSSAVMATETN-PINAASPPDHATGKWYSVPELRLRDHHFTVPL 92
           +P L+  Y   R   V +S   A      +   S  +H TGKW+SVPELRLRDH F VPL
Sbjct: 32  VPGLIDFYRRRRFCRVITSMAEAGSVYVDVAGESKSEHVTGKWFSVPELRLRDHRFIVPL 91

Query: 93  DYSLDQDASPKISVFAREVVLVGKEEQSMPYLLYLQGGPGFESPRPTEASGWMQKACEEF 152
           DYS    +SPKI+VFARE+V VGKEEQ+MPYLLYLQGGPGFE PRP+EASGW+Q+ACEEF
Sbjct: 92  DYS---KSSPKITVFAREIVAVGKEEQAMPYLLYLQGGPGFEGPRPSEASGWIQRACEEF 151

Query: 153 R------RGTGLSTPLTSSSMSQFQSAEDLANYLKHFRADNIVNDAEFIRTRLVPDAGPW 212
           R      RGTGLSTPLT SSM QF+SA++LA+YL HFRADNIV DAEFIR RLVP A PW
Sbjct: 152 RVVLLDQRGTGLSTPLTCSSMLQFKSAKELADYLVHFRADNIVKDAEFIRVRLVPKADPW 211

Query: 213 TILGQSFGGFCAVTYLSFAPQGLKQVLITGGIPPIGNKCTADSVYRACFEKVIIQNEKYY 272
           TILGQSFGGFCA+TYLSFAP+GLKQVLITGGIPPIG  CTAD VY A FE+V  QNEKYY
Sbjct: 212 TILGQSFGGFCALTYLSFAPEGLKQVLITGGIPPIGKACTADDVYEAGFEQVARQNEKYY 271

Query: 273 KRYPQDVEIIREVAKYLAE-HGGGVVLPSGGILTPKGLQVLGLFALGSSTGFERLHYLFE 332
           KR+PQD+EI+RE+  YLAE  GGGV LPSGGILTPKGLQ LGL  LGSSTGFERLHY+ E
Sbjct: 272 KRFPQDIEIVRELVNYLAESEGGGVPLPSGGILTPKGLQTLGLSGLGSSTGFERLHYMLE 331

Query: 333 RVWDPVIVPGAPKRISSFFLRACDNWLSLDSNPLYVLLHESIYCQGASSRWSAQRIKNEL 392
           RVWDP++V GAPK IS FFL A ++W S D+NPLY LLHE+IYC+GASS WSA R++++ 
Sbjct: 332 RVWDPILVTGAPKCISQFFLNAFESWHSFDTNPLYALLHEAIYCEGASSGWSAHRLRDKY 391

Query: 393 KNKFDANNALKEGCPMFFTGEMVFPWMFDEIHALRPLKDAARILAEKEDWPPLYDVAALR 452
           + KFDA  A+KE  P+ FTGEM+FPWMFDEIHAL+P K AA +LA+KEDWPPLYDV  L+
Sbjct: 392 EYKFDAMKAVKESQPVLFTGEMIFPWMFDEIHALKPFKAAADLLAKKEDWPPLYDVPRLQ 451

Query: 453 NNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEYMHSGLRDAGPQVLDHLMGLL 512
           NNKVPVAAAVYYEDM+VNFKL  ETAS I+GIRLW+TNE+MHSGLRDAG Q++DHL+G++
Sbjct: 452 NNKVPVAAAVYYEDMYVNFKLVTETASHISGIRLWVTNEFMHSGLRDAGRQIIDHLLGMI 511

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022147514.18.4e-30398.84uncharacterized protein LOC111016418 [Momordica charantia][more]
XP_008437982.11.2e-25383.56PREDICTED: proline iminopeptidase [Cucumis melo] >TYK17630.1 proline iminopeptid... [more]
XP_004133842.37.4e-25182.40uncharacterized protein LOC101216845 [Cucumis sativus] >KGN56478.1 hypothetical ... [more]
XP_022933365.12.5e-24683.73uncharacterized protein LOC111440690 [Cucurbita moschata][more]
KAG6597118.17.2e-24683.53Proline iminopeptidase, partial [Cucurbita argyrosperma subsp. sororia][more]
Match NameE-valueIdentityDescription
P465474.3e-10044.83Proline iminopeptidase OS=Aeromonas sobria OX=646 GN=pip PE=1 SV=3[more]
A0A1L9WUM23.2e-6334.17Proline iminopeptidase aneH OS=Aspergillus aculeatus (strain ATCC 16872 / CBS 17... [more]
Match NameE-valueIdentityDescription
A0A6J1D1844.1e-30398.84uncharacterized protein LOC111016418 OS=Momordica charantia OX=3673 GN=LOC111016... [more]
A0A5D3D1Y55.9e-25483.56Proline iminopeptidase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold4... [more]
A0A1S3AUX55.9e-25483.56proline iminopeptidase OS=Cucumis melo OX=3656 GN=LOC103483239 PE=3 SV=1[more]
A0A0A0L4233.6e-25182.40AB hydrolase-1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G121... [more]
A0A6J1F4P51.2e-24683.73uncharacterized protein LOC111440690 OS=Cucurbita moschata OX=3662 GN=LOC1114406... [more]
Match NameE-valueIdentityDescription
AT3G61540.12.4e-20771.05alpha/beta-Hydrolases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000073Alpha/beta hydrolase fold-1PFAMPF00561Abhydrolase_1coord: 153..236
e-value: 4.4E-6
score: 26.5
IPR029058Alpha/Beta hydrolase foldGENE3D3.40.50.1820alpha/beta hydrolasecoord: 113..501
e-value: 1.1E-9
score: 40.3
IPR029058Alpha/Beta hydrolase foldSUPERFAMILY53474alpha/beta-Hydrolasescoord: 114..501
NoneNo IPR availablePANTHERPTHR432482-SUCCINYL-6-HYDROXY-2,4-CYCLOHEXADIENE-1-CARBOXYLATE SYNTHASEcoord: 64..511
NoneNo IPR availablePANTHERPTHR43248:SF2PROLYL AMINOPEPTIDASE-RELATEDcoord: 64..511

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS005840.1MS005840.1mRNA