CmoCh04G020140 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G020140
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionLysosomal Pro-X carboxypeptidase
LocationCmo_Chr04 : 10745683 .. 10748048 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGCTTCTTCCCTGCTTACTTCTCATCCTTTCAGCCTGTGTTTCTGCAACACAGTACAGAATCCCAAGGCTTAGTCCAACCGACAGAGATTCTGAAGCTCTGTCCTCGCCTCTTTCGGATGATTTCAAGACATTTTATTACAACCAAACGTTAGATCACTTCAATTATAGGCCTGAAAGCTACACGACGTTCGCTCATAGATATATAATCAACTTTAAGTATTGGGGCGGCGCAAATTCCAGCGCTCCCATTCTTGCCTACTTGGGAGCTGAAGGTCCACTGGATAATGATTTGAACGTTGTAGGATTCTCGACTGATAATGCTGCTCAGTTTGGGGCTCTTCTTGTTTATATTGAGGTAAGAGCTTTCATGTTCTTGTGTGGTTCATATTTAATGCTTTGGAGGGTTTATATGATTTTGTTTTGTTTTGTTGTTAGCATCGTTATTATGGGAAATCGATACCCTTTGGATCAAGGGAGGTAGCATTGAAGAATGCAAGCACTTTAGGCTATTTCAACTCTGCTCAAGCAATAGCAGATTATGCGGATGTTCTTATACATGTTAAAAAGGAGTTGCATGCTAAAGATTCTCCTGTGATTGTTCTTGGTGGATCATATGGTGGAAGTAAGTGAGTTCTCGTTTAAAATTGTGAGATTTCACGTCGGTTGGAAAGGGGAATGAAGCATTTCTTACAAGGGTGTGGAAACGAATCGCTAGTCGATGTGTTTTAAAACTGTAAGGCTGATGACGATGCGTAACGGGCCAAAACGGACAATATCTACTAGCGATGGCCTTGAGTTGTTCCAAATGGTATCAGAGCTAGACATTGAGTAGCGTGCTAGCGAGGACACTGGGCCTCTAAGAGGGGTGGATTGTGAGATCCCACATCGGGTGGAGAGGGGAACGAAACTTTTCTTATAAGGGCGTGGAAGCCTCTCCCTAGCACACGCGTTTTAAAACCGTGAGGCTAACGACGATATGTAACGGGCCAAAGCAGACAATATGTGCTAGCGATGGCCTCGAGCTGTTGTCAATGGTATCAGAGCTAGACACCGGGTAGTGTGCCAGCAAGGACGCTGGGCCTCCAAGCGGGTGGATTGTGAGATCCCACATCGGTTGGAAAGGGGAACGAAACATTTCTTGTAAGGGTGTGGATGCCTCTTCCTAGCAGACGCGTTTTAAAACCGTGAGGCTAACGACGATACGTAACGGGCCAAAACGGACAATATCTGCTAGTGGTGGCCTTGAGCTGTTCCAAACGGTATCAGAGCCATACACTAGGCAGTGTGTTAGCGAGGACGCTGAGCCACCAAGAGCGGTGGATTGTGAGATCCCACATCAGTTGGAGAGGGGAACGAAACACTTCTTGTAAGGGTGTGGAAGCCTCTCTCTAGCAAACGCGTTTTAATACCGTGAGGCTAACCACGATACGTAACGTGCGAAGCGAACAATATCTGCTAGCGGTTACAAAAATCTTCAAACTCTATATAACAGAACATGTTTTTTTTGTTTTGTTGGAAACAGTGTTGGCTGCATGGTTTCGTCTGAAATATCCTCATGTCGCCCTTGGAGCTCTTGCTTCTTCAGCTCCAATCCTTTACTTCGACAATATCACGCCACATGATGCATACTATTCCATTGCCACTAAGGATTTTAGAGTAAGATCCCTCAACAATAATGCATCCTTTTGAACCCTTTGAACTTTTTAAATACACAAAAATGTGAATGAATTTGCAGGACGTTAGTGAGAGTTGCTATGAAACCATTCGGGATTCCTGGTCCAAGATTGAAACAATTGCTTCCAAGCCTGATGGCCTTTCCATTCTTAGCAAAGAGTTCAAAAGTTGCAGGTATTAGAATTCATGGCTTCTCATATCCAAATTTAATGAAGTTTGTGCTTCAAATGAGTTGTTCTTGTTTTAGTCCTCTGAAGAACTCCGCTCAGCTGGAAGACTTCTTGTGGTCTATGTATACCGTCGCAGCCCAATACGACCACCCACCAAGGTATCCAGTCACTATAATCTGTGATGCCATTGATGGAGCTTCATCAGGAAGTGGAATTGTTGAACGAATTGCTGCAGGCGTGTTTGCTTATAAAGGAAATCTTTCCTGCTACATGAATCAGGCCAGAGATGAAACTGAAACCGATGTGGGATGGAGGTGGCAGGTAATCTTTTTATTACACGAATACCGACGTGGGATGCATTGAACTTCAAAAAGCTTCATTAATTTATTTTTTATTTTTTGACGTTATAAGAAGCTTTTGTGTTGTGTGTGTAACTCAGAGATGCAGTGAAATGGTGATGCCATTAAGCACAGGCAATGATACTATGTTTCCAGCATGTAACTCAGAGATGCAGTGA

mRNA sequence

ATGATGCTTCTTCCCTGCTTACTTCTCATCCTTTCAGCCTGTGTTTCTGCAACACAGTACAGAATCCCAAGGCTTAGTCCAACCGACAGAGATTCTGAAGCTCTGTCCTCGCCTCTTTCGGATGATTTCAAGACATTTTATTACAACCAAACGTTAGATCACTTCAATTATAGGCCTGAAAGCTACACGACGTTCGCTCATAGATATATAATCAACTTTAAGTATTGGGGCGGCGCAAATTCCAGCGCTCCCATTCTTGCCTACTTGGGAGCTGAAGGTCCACTGGATAATGATTTGAACGTTGTAGGATTCTCGACTGATAATGCTGCTCAGTTTGGGGCTCTTCTTGTTTATATTGAGCATCGTTATTATGGGAAATCGATACCCTTTGGATCAAGGGAGGTAGCATTGAAGAATGCAAGCACTTTAGGCTATTTCAACTCTGCTCAAGCAATAGCAGATTATGCGGATGTTCTTATACATGTTAAAAAGGAGTTGCATGCTAAAGATTCTCCTGTGATTGTTCTTGGTGGATCATATGGTGGAATGTTGGCTGCATGGTTTCGTCTGAAATATCCTCATGTCGCCCTTGGAGCTCTTGCTTCTTCAGCTCCAATCCTTTACTTCGACAATATCACGCCACATGATGCATACTATTCCATTGCCACTAAGGATTTTAGAGACGTTAGTGAGAGTTGCTATGAAACCATTCGGGATTCCTGGTCCAAGATTGAAACAATTGCTTCCAAGCCTGATGGCCTTTCCATTCTTAGCAAAGAGTTCAAAAGTTGCAGTCCTCTGAAGAACTCCGCTCAGCTGGAAGACTTCTTGTGGTCTATGTATACCGTCGCAGCCCAATACGACCACCCACCAAGGTATCCAGTCACTATAATCTGTGATGCCATTGATGGAGCTTCATCAGGAAGTGGAATTGTTGAACGAATTGCTGCAGGCGTGTTTGCTTATAAAGGAAATCTTTCCTGCTACATGAATCAGGCCAGAGATGAAACTGAAACCGATGTGGGATGGAGGTGGCAGAGATGCAGTGAAATGGTGATGCCATTAAGCACAGGCAATGATACTATGTTTCCAGCATGTAACTCAGAGATGCAGTGA

Coding sequence (CDS)

ATGATGCTTCTTCCCTGCTTACTTCTCATCCTTTCAGCCTGTGTTTCTGCAACACAGTACAGAATCCCAAGGCTTAGTCCAACCGACAGAGATTCTGAAGCTCTGTCCTCGCCTCTTTCGGATGATTTCAAGACATTTTATTACAACCAAACGTTAGATCACTTCAATTATAGGCCTGAAAGCTACACGACGTTCGCTCATAGATATATAATCAACTTTAAGTATTGGGGCGGCGCAAATTCCAGCGCTCCCATTCTTGCCTACTTGGGAGCTGAAGGTCCACTGGATAATGATTTGAACGTTGTAGGATTCTCGACTGATAATGCTGCTCAGTTTGGGGCTCTTCTTGTTTATATTGAGCATCGTTATTATGGGAAATCGATACCCTTTGGATCAAGGGAGGTAGCATTGAAGAATGCAAGCACTTTAGGCTATTTCAACTCTGCTCAAGCAATAGCAGATTATGCGGATGTTCTTATACATGTTAAAAAGGAGTTGCATGCTAAAGATTCTCCTGTGATTGTTCTTGGTGGATCATATGGTGGAATGTTGGCTGCATGGTTTCGTCTGAAATATCCTCATGTCGCCCTTGGAGCTCTTGCTTCTTCAGCTCCAATCCTTTACTTCGACAATATCACGCCACATGATGCATACTATTCCATTGCCACTAAGGATTTTAGAGACGTTAGTGAGAGTTGCTATGAAACCATTCGGGATTCCTGGTCCAAGATTGAAACAATTGCTTCCAAGCCTGATGGCCTTTCCATTCTTAGCAAAGAGTTCAAAAGTTGCAGTCCTCTGAAGAACTCCGCTCAGCTGGAAGACTTCTTGTGGTCTATGTATACCGTCGCAGCCCAATACGACCACCCACCAAGGTATCCAGTCACTATAATCTGTGATGCCATTGATGGAGCTTCATCAGGAAGTGGAATTGTTGAACGAATTGCTGCAGGCGTGTTTGCTTATAAAGGAAATCTTTCCTGCTACATGAATCAGGCCAGAGATGAAACTGAAACCGATGTGGGATGGAGGTGGCAGAGATGCAGTGAAATGGTGATGCCATTAAGCACAGGCAATGATACTATGTTTCCAGCATGTAACTCAGAGATGCAGTGA
BLAST of CmoCh04G020140 vs. Swiss-Prot
Match: PCP_PONAB (Lysosomal Pro-X carboxypeptidase OS=Pongo abelii GN=PRCP PE=2 SV=1)

HSP 1 Score: 219.2 bits (557), Expect = 7.7e-56
Identity = 123/343 (35.86%), Postives = 194/343 (56.56%), Query Frame = 1

Query: 39  LSDDFKTFYYNQTLDHFNYRPESYTTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDND 98
           ++ ++   Y+ Q +DHF +   +  TF  RY++  KYW    +   IL Y G EG +   
Sbjct: 44  VAKNYSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWF 103

Query: 99  LNVVGFSTDNAAQFGALLVYIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADV 158
            N  GF  D A +  A+LV+ EHRYYG+S+PFG      K++  L +  S QA+AD+A++
Sbjct: 104 CNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGDN--TFKDSRHLNFLTSEQALADFAEL 163

Query: 159 LIHVKKELH-AKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDA 218
           + H+K+ +  A++ PVI +GGSYGGMLAAWFR+KYPH+ +GALA+SAPI  F+++ P   
Sbjct: 164 IKHLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGV 223

Query: 219 YYSIATKDFRDVSESCYETIRDSWSKIETIASKPDGLSILSKEFKSCSPL--KNSAQLED 278
           +  I T DFR     C E+IR SW  I  +++   GL  L+     CSPL  ++   L+D
Sbjct: 224 FMKIVTTDFRKSGPHCSESIRRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKD 283

Query: 279 FLWSMYTVAAQYDHP---------PRYPVTIICDAIDGAS-SGSGIVERIAAGV---FAY 338
           ++   +   A  D+P         P +P+ ++C  +   + S S +++ I   +   + Y
Sbjct: 284 WISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNY 343

Query: 339 KGNLSCY-MNQARDETETDVGWRWQRCSEMVMPLST-GNDTMF 364
            G + C  +++    +   +GW +Q C+E+VMP  T G D MF
Sbjct: 344 SGQVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMF 380

BLAST of CmoCh04G020140 vs. Swiss-Prot
Match: PCP_HUMAN (Lysosomal Pro-X carboxypeptidase OS=Homo sapiens GN=PRCP PE=1 SV=1)

HSP 1 Score: 217.6 bits (553), Expect = 2.3e-55
Identity = 122/343 (35.57%), Postives = 194/343 (56.56%), Query Frame = 1

Query: 39  LSDDFKTFYYNQTLDHFNYRPESYTTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDND 98
           ++ ++   Y+ Q +DHF +   +  TF  RY++  KYW    +   IL Y G EG +   
Sbjct: 44  VAKNYSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWF 103

Query: 99  LNVVGFSTDNAAQFGALLVYIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADV 158
            N  GF  D A +  A+LV+ EHRYYG+S+PFG    + K++  L +  S QA+AD+A++
Sbjct: 104 CNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAEL 163

Query: 159 LIHVKKELH-AKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDA 218
           + H+K+ +  A++ PVI +GGSYGGMLAAWFR+KYPH+ +GALA+SAPI  F+++ P   
Sbjct: 164 IKHLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGV 223

Query: 219 YYSIATKDFRDVSESCYETIRDSWSKIETIASKPDGLSILSKEFKSCSPL--KNSAQLED 278
           +  I T DFR     C E+I  SW  I  +++   GL  L+     CSPL  ++   L+D
Sbjct: 224 FMKIVTTDFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKD 283

Query: 279 FLWSMYTVAAQYDHP---------PRYPVTIICDAIDGAS-SGSGIVERIAAGV---FAY 338
           ++   +   A  D+P         P +P+ ++C  +   + S S +++ I   +   + Y
Sbjct: 284 WISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNY 343

Query: 339 KGNLSCY-MNQARDETETDVGWRWQRCSEMVMPLST-GNDTMF 364
            G + C  +++    +   +GW +Q C+E+VMP  T G D MF
Sbjct: 344 SGQVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMF 380

BLAST of CmoCh04G020140 vs. Swiss-Prot
Match: PCP_MOUSE (Lysosomal Pro-X carboxypeptidase OS=Mus musculus GN=Prcp PE=1 SV=2)

HSP 1 Score: 211.8 bits (538), Expect = 1.2e-53
Identity = 131/379 (34.56%), Postives = 200/379 (52.77%), Query Frame = 1

Query: 8   LLILSACVSATQYRIPRLSPTDRDSEALSSPLSDD-----FKTFYYNQTLDHFNYRPESY 67
           LL+LS  +      IP    T       +SP  D      +   Y+ Q +DHF +     
Sbjct: 6   LLLLSFLLLGAATTIPPRLKTLGSPHLSASPTPDPAVARKYSVLYFEQKVDHFGFA--DM 65

Query: 68  TTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYIEHR 127
            TF  RY++  K+W    +   IL Y G EG +    N  GF  D A +  A+LV+ EHR
Sbjct: 66  RTFKQRYLVADKHW--QRNGGSILFYTGNEGDIVWFCNNTGFMWDVAEELKAMLVFAEHR 125

Query: 128 YYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELH-AKDSPVIVLGGSYG 187
           YYG+S+PFG  + + K++  L +  S QA+AD+A+++ H++K +  A+  PVI +GGSYG
Sbjct: 126 YYGESLPFG--QDSFKDSQHLNFLTSEQALADFAELIRHLEKTIPGAQGQPVIAIGGSYG 185

Query: 188 GMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRDSW 247
           GMLAAWFR+KYPH+ +GALA+SAPI   D + P   +  I T DFR     C E+IR SW
Sbjct: 186 GMLAAWFRMKYPHIVVGALAASAPIWQLDGMVPCGEFMKIVTNDFRKSGPYCSESIRKSW 245

Query: 248 SKIETIASKPDGLSILSKEFKSCSPLKNS--AQLEDFLWSMYTVAAQYDHP--------- 307
           + I+ ++    GL  L+     CSPL +     L+ ++   +   A  ++P         
Sbjct: 246 NVIDKLSGSGSGLQSLTNILHLCSPLTSEKIPTLKGWIAETWVNLAMVNYPYACNFLQPL 305

Query: 308 PRYPVTIICDAIDGAS-SGSGIVERIAAGV---FAYKGNLSCY-MNQARDETETDVGWRW 364
           P +P+  +C  +   + S + +++ I   +   + Y G  +C  ++Q    +   +GW +
Sbjct: 306 PAWPIKEVCQYLKNPNVSDTVLLQNIFQALSVYYNYSGQAACLNISQTTTSSLGSMGWSF 365

BLAST of CmoCh04G020140 vs. Swiss-Prot
Match: PCP_BOVIN (Lysosomal Pro-X carboxypeptidase OS=Bos taurus GN=PRCP PE=2 SV=1)

HSP 1 Score: 208.4 bits (529), Expect = 1.4e-52
Identity = 140/387 (36.18%), Postives = 206/387 (53.23%), Query Frame = 1

Query: 7   LLLILSACVSATQYRIPRLSPTDRDSEALSSPLSDDFKTF----------YYNQTLDHFN 66
           LLL+L      T      +SP+ R   +L  P S  F++           Y  Q +DHF 
Sbjct: 6   LLLLLLLIAFLTPGAANPVSPSLRAPSSL--PWSTSFRSRPTITLKYSIRYIQQKVDHFG 65

Query: 67  YRPESYTTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALL 126
           +  +   TF  RY+I   YW     S  IL Y G EG +    N  GF  D A +  A+L
Sbjct: 66  FNIDR--TFKQRYLIADNYWKEDGGS--ILFYTGNEGDIIWFCNNTGFMWDIAEEMKAML 125

Query: 127 VYIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELH-AKDSPVIV 186
           V+ EHRYYG+S+PFG+   +  ++  L +  + QA+AD+A ++ ++K+ +  A++  VI 
Sbjct: 126 VFAEHRYYGESLPFGAD--SFSDSRHLNFLTTEQALADFAKLIRYLKRTIPGARNQHVIA 185

Query: 187 LGGSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYE 246
           LGGSYGGMLAAWFR+KYPH+ +GALASSAPI  F+++ P D +  I T DF     +C E
Sbjct: 186 LGGSYGGMLAAWFRMKYPHLVVGALASSAPIWQFNDLVPCDIFMKIVTTDFSQSGPNCSE 245

Query: 247 TIRDSWSKIETIASKPDGLSILSKEFKSCSPLKNS---AQLEDFLWSMYTVAAQYDHP-- 306
           +IR SW  I  +A K  GL  LS+    C+PL  S    +L+D++   +   A  D+P  
Sbjct: 246 SIRRSWDAINRLAKKGTGLRWLSEALHLCTPLTKSQDVQRLKDWISETWVNVAMVDYPYE 305

Query: 307 -------PRYPVTIICDAIDGAS-SGSGIVERIAAGV---FAYKGNLSCYMNQARDETET 364
                  P +PV ++C     ++   + +V+ I   +   + Y G   C +N +   T +
Sbjct: 306 SNFLQPLPAWPVKVVCQYFKYSNVPDTVMVQNIFQALNVYYNYSGQAKC-LNVSETATSS 365

BLAST of CmoCh04G020140 vs. Swiss-Prot
Match: DPP2_HUMAN (Dipeptidyl peptidase 2 OS=Homo sapiens GN=DPP7 PE=1 SV=3)

HSP 1 Score: 186.4 bits (472), Expect = 5.6e-46
Identity = 114/345 (33.04%), Postives = 181/345 (52.46%), Query Frame = 1

Query: 43  FKTFYYNQTLDHFNYRPESYTTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVV 102
           F+  ++ Q LDHFN+      TF  R++++ ++W       PI  Y G EG +    N  
Sbjct: 31  FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFW--VRGEGPIFFYTGNEGDVWAFANNS 90

Query: 103 GFSTDNAAQFGALLVYIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHV 162
            F  + AA+ GALLV+ EHRYYGKS+PFG++     +   L      QA+AD+A++L  +
Sbjct: 91  AFVAELAAERGALLVFAEHRYYGKSLPFGAQSTQRGHTELL---TVEQALADFAELLRAL 150

Query: 163 KKELHAKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIA 222
           +++L A+D+P I  GGSYGGML+A+ R+KYPH+  GALA+SAP+L    +   + ++   
Sbjct: 151 RRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDV 210

Query: 223 TKDFRDVSESCYETIRDSWSKIETIASKPDGLSILSKEFKSCSPL---KNSAQLEDFLWS 282
           T DF   S  C + +R+++ +I+ +  +      +  EF +C PL   K+  QL  F  +
Sbjct: 211 TADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFARN 270

Query: 283 MYTVAAQYDHP---------PRYPVTIICDAIDGASSGSGIVERIAAGVFAYKGNLSCY- 342
            +TV A  D+P         P  PV + CD +   +     +  +A  V+   G+  CY 
Sbjct: 271 AFTVLAMMDYPYPTDFLGPLPANPVKVGCDRLLSEAQRITGLRALAGLVYNASGSEHCYD 330

Query: 343 ----MNQARDETETDVG-----WRWQRCSEMVMPLSTGNDT-MFP 365
                +   D T    G     W +Q C+E+ +  ++ N T MFP
Sbjct: 331 IYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFP 369

BLAST of CmoCh04G020140 vs. TrEMBL
Match: A0A0A0KAY8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G149410 PE=4 SV=1)

HSP 1 Score: 620.2 bits (1598), Expect = 1.7e-174
Identity = 300/372 (80.65%), Postives = 331/372 (88.98%), Query Frame = 1

Query: 4   LPCLLLILSACVSATQYRIPRLSPTDR----DSEALSSPLSDDFKTFYYNQTLDHFNYRP 63
           LP +L ILS CV+ATQYRIPRLSP  R    ++EA+ S +SDDFKTFYYNQTLDHFNYRP
Sbjct: 12  LPFILFILSNCVTATQYRIPRLSPIGRTFLHNAEAIPSSISDDFKTFYYNQTLDHFNYRP 71

Query: 64  ESYTTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYI 123
           ESYT F HRYIINFKYWGGANSSAPILAYLGAEGPL+ DLN +GF TDNAA+F ALLVYI
Sbjct: 72  ESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLNAIGFMTDNAARFDALLVYI 131

Query: 124 EHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGS 183
           EHRYYGKS+PFGSRE ALKNASTLGYF+SAQAIADYA VLIH+K++ HAKDSPVIVLGGS
Sbjct: 132 EHRYYGKSMPFGSREEALKNASTLGYFSSAQAIADYAAVLIHLKQKYHAKDSPVIVLGGS 191

Query: 184 YGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRD 243
           YGGMLAAWFRLKYPHVALGALASSAPILYF++ITPH+ YYSIATKDFR+VSE+CYETIRD
Sbjct: 192 YGGMLAAWFRLKYPHVALGALASSAPILYFEDITPHNGYYSIATKDFREVSETCYETIRD 251

Query: 244 SWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIIC 303
           SWSKIE I SKP+GLSILSKEFK+CSPL +S+QLED+LWSMY  AAQY+HPPRYPVT IC
Sbjct: 252 SWSKIEIIGSKPNGLSILSKEFKTCSPLNSSSQLEDYLWSMYAGAAQYNHPPRYPVTRIC 311

Query: 304 DAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTGN 363
             IDGAS GSGI+ ++AAGVFAYKGNLSCY    R ETETDVGWRWQRCSEMVMPLST N
Sbjct: 312 GGIDGASPGSGIISKVAAGVFAYKGNLSCYNIGPRSETETDVGWRWQRCSEMVMPLSTTN 371

Query: 364 DTMFPACNSEMQ 372
           DTMFP    +++
Sbjct: 372 DTMFPPITFDLK 383

BLAST of CmoCh04G020140 vs. TrEMBL
Match: A0A0A0KDH5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G149400 PE=4 SV=1)

HSP 1 Score: 552.4 bits (1422), Expect = 4.3e-154
Identity = 268/366 (73.22%), Postives = 311/366 (84.97%), Query Frame = 1

Query: 4   LPCLLLILSACVSATQYRIPRLSPTDRD----SEALSSPLSDDFKTFYYNQTLDHFNYRP 63
           LP LLL LS  V+A Q+RIPRLSP        S+AL  P SDDFKTFY+NQTLDHFNYRP
Sbjct: 7   LPFLLLFLSNSVTAFQFRIPRLSPIGEKFLHHSKALELPPSDDFKTFYFNQTLDHFNYRP 66

Query: 64  ESYTTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYI 123
           ESYTTF  RYIINFKYWGGANSSAPILAYLG E P+D+ +NV+GF TDNA +F ALLVYI
Sbjct: 67  ESYTTFPQRYIINFKYWGGANSSAPILAYLGPEAPIDSAMNVIGFMTDNAVKFNALLVYI 126

Query: 124 EHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGS 183
           EHRYYGKSIPFGSR+ AL+NASTLGYFNSAQA+ADYA +LIHVKKE  AK SPVIV+GGS
Sbjct: 127 EHRYYGKSIPFGSRKEALRNASTLGYFNSAQALADYAAILIHVKKEFSAKYSPVIVIGGS 186

Query: 184 YGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRD 243
           YGGMLA WFRLKYPHVALGALASSAPILYF++ITP + YY I TKDFR+VS++CYE+IR+
Sbjct: 187 YGGMLATWFRLKYPHVALGALASSAPILYFNDITPENGYYVIVTKDFREVSQTCYESIRE 246

Query: 244 SWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIIC 303
           SWS+IET+AS+ +GLS+L K FK+CSPL++S QLE++LW MY  AAQY+HP RYPV  IC
Sbjct: 247 SWSEIETVASQSNGLSVLDKVFKTCSPLRSSTQLENYLWFMYASAAQYNHPSRYPVNRIC 306

Query: 304 DAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTGN 363
           DAID   S +G + +IAAGVFAY+G LSCY+N+  + TET VGW+WQRCSEMVMP+STGN
Sbjct: 307 DAIDQTYS-NGTLGKIAAGVFAYRGELSCYINEPINTTETTVGWQWQRCSEMVMPISTGN 366

Query: 364 DTMFPA 366
           DTMFP+
Sbjct: 367 DTMFPS 371

BLAST of CmoCh04G020140 vs. TrEMBL
Match: A0A0A0KBK9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G149390 PE=4 SV=1)

HSP 1 Score: 539.3 bits (1388), Expect = 3.8e-150
Identity = 261/374 (69.79%), Postives = 305/374 (81.55%), Query Frame = 1

Query: 4   LPCLLLILSACV--SATQYRIPRLSPTDRD----SEALSSPLSDDFKTFYYNQTLDHFNY 63
           +P LL + S  V  S    R PRLSP        S  L+S   DDFKT+YYNQTLDHFNY
Sbjct: 25  IPLLLFVFSTSVVTSLQHNRFPRLSPVGEKFLHHSRVLNSLPLDDFKTYYYNQTLDHFNY 84

Query: 64  RPESYTTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLV 123
           RPESYTTF  RYIINFKYWGG NSSAPI AYLGAE P+D+DL+ +GF TDNA QF ALL+
Sbjct: 85  RPESYTTFPQRYIINFKYWGGPNSSAPIFAYLGAEAPIDDDLDFIGFMTDNAIQFNALLI 144

Query: 124 YIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLG 183
           YIEHRYYGKSIPF SR+ AL NASTLGYFNSAQAIADYA +LIHVKKE HA  SPVIV+G
Sbjct: 145 YIEHRYYGKSIPFRSRDEALGNASTLGYFNSAQAIADYAAILIHVKKEFHANYSPVIVIG 204

Query: 184 GSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETI 243
           GSYGGMLA+WFRLKYPHVALGALASSAPILYFD+ITP D YYS+ TKDFR +SE+CYETI
Sbjct: 205 GSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYSVVTKDFRGLSETCYETI 264

Query: 244 RDSWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTI 303
           + SWS+IET+A +P+GLSIL +EFK+C PL+   +LED+LWSMY  AAQY+HPP+YPVT 
Sbjct: 265 KKSWSEIETVAYQPNGLSILDQEFKTCRPLRGYFELEDYLWSMYASAAQYNHPPKYPVTR 324

Query: 304 ICDAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLST 363
           ICDAIDG  S +G + +IAAGVFA++G++SCY+N+ R+ETETDVGWRWQ CSEMVMP+ +
Sbjct: 325 ICDAIDGTYSVNGTLSKIAAGVFAFRGSVSCYINEPRNETETDVGWRWQSCSEMVMPIGS 384

Query: 364 GNDTMFPACNSEMQ 372
            +D MFP    ++Q
Sbjct: 385 -DDDMFPPSPFDLQ 397

BLAST of CmoCh04G020140 vs. TrEMBL
Match: A5C9W6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_018664 PE=4 SV=1)

HSP 1 Score: 521.9 bits (1343), Expect = 6.2e-145
Identity = 252/358 (70.39%), Postives = 297/358 (82.96%), Query Frame = 1

Query: 8   LLILSACVSATQYRIPRLSPTDRDSEALSSPLSDDFKTFYYNQTLDHFNYRPESYTTFAH 67
           L+I   C +AT  ++PRLS   R+SE  S  +SDDF+TF+YNQTLDHFNYRPESY TF  
Sbjct: 19  LIIFPTCATATPSKLPRLSTILRESEIFSELISDDFQTFFYNQTLDHFNYRPESYYTFQQ 78

Query: 68  RYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYIEHRYYGKS 127
           RY++NFKYWGGAN+SAPI AYLGAE  LD DL  VGF  DNA QF ALLVYIEHRYYG+S
Sbjct: 79  RYVMNFKYWGGANASAPIFAYLGAEAALDFDLTGVGFPVDNALQFKALLVYIEHRYYGQS 138

Query: 128 IPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGSYGGMLAAW 187
           IPFGSRE ALKNAST GYFNSAQAIADYA+VL ++KK+L A++SPVIV+GGSYGGMLA+W
Sbjct: 139 IPFGSREEALKNASTRGYFNSAQAIADYAEVLEYIKKKLLAENSPVIVIGGSYGGMLASW 198

Query: 188 FRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRDSWSKIETI 247
           FRLKYPHVALGALASSAPILYFD+ITP + YYSI TKDFR+ SESCY TIR+SWS+I+ +
Sbjct: 199 FRLKYPHVALGALASSAPILYFDDITPQNGYYSIVTKDFREASESCYSTIRESWSEIDRV 258

Query: 248 ASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIICDAIDGASS 307
           AS+P+GLSILSK+F++C+ L  S +L+D+L +MY VAAQY+HPPRYPVT++C  IDGA  
Sbjct: 259 ASEPNGLSILSKKFRTCAELNKSNELKDYLETMYAVAAQYNHPPRYPVTVVCGGIDGAPE 318

Query: 308 GSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTG-NDTMFP 365
           GS I+ RI AGV AY+GN SCY N + + TET  GWRWQ CSEMVMP+  G NDTMFP
Sbjct: 319 GSDILSRIFAGVVAYRGNSSCY-NTSVNPTETSEGWRWQTCSEMVMPIGRGDNDTMFP 375

BLAST of CmoCh04G020140 vs. TrEMBL
Match: V4SKU6_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10028519mg PE=4 SV=1)

HSP 1 Score: 513.5 bits (1321), Expect = 2.2e-142
Identity = 246/370 (66.49%), Postives = 306/370 (82.70%), Query Frame = 1

Query: 3   LLPCLLLILSACVSATQYRIPRLSPTD----RDSEALSSPLSDDFKTFYYNQTLDHFNYR 62
           LL    +I S  VSAT++ IPRLSPT     ++ E LS+ +S+DF+TFYYNQTLDHFNYR
Sbjct: 11  LLYIFTVISSLQVSATRFNIPRLSPTRGTILQNPEILSATISEDFQTFYYNQTLDHFNYR 70

Query: 63  PESYTTFAHRYIINFKYWGG---ANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGAL 122
           PESY+TF  RY+INFKYWGG   A+++API  YLGAEGPLD D++++GF TDNAA+F AL
Sbjct: 71  PESYSTFQQRYVINFKYWGGGAGADANAPIFVYLGAEGPLDGDISIIGFLTDNAARFNAL 130

Query: 123 LVYIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIV 182
           LVYIEHRYYGKSIPFGSRE ALKNASTLGYFNSAQA+ DYA++L+++K++ +A+ SPVIV
Sbjct: 131 LVYIEHRYYGKSIPFGSREEALKNASTLGYFNSAQAVTDYAEILLYIKEKFNARHSPVIV 190

Query: 183 LGGSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYE 242
           +GGSYGGMLA WFRLKYPHVALGALASSAPILYFD+ITP + YYSI T+DFR+ SE+CYE
Sbjct: 191 IGGSYGGMLATWFRLKYPHVALGALASSAPILYFDDITPRNGYYSIVTRDFREASETCYE 250

Query: 243 TIRDSWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPV 302
           TI  SW++IE +ASKPDGLSILSK+F++C PLK+S +LED+L  +Y  AAQY+ PP+YPV
Sbjct: 251 TIMKSWAEIEKVASKPDGLSILSKKFRTCKPLKDSYELEDYLELVYADAAQYNQPPKYPV 310

Query: 303 TIICDAIDGASSG--SGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVM 362
            ++C  IDGA+ G  + I+ +I AGV A++GN+SCY+N    E+ET VGWRWQ CSEMV+
Sbjct: 311 NMLCGGIDGAALGTDNDILSKIFAGVVAHEGNMSCYVNPPTIESETTVGWRWQTCSEMVI 370

Query: 363 PLSTGNDTMF 364
           P+ T N TMF
Sbjct: 371 PIGTDNTTMF 380

BLAST of CmoCh04G020140 vs. TAIR10
Match: AT5G22860.1 (AT5G22860.1 Serine carboxypeptidase S28 family protein)

HSP 1 Score: 382.5 bits (981), Expect = 3.0e-106
Identity = 199/385 (51.69%), Postives = 261/385 (67.79%), Query Frame = 1

Query: 2   MLLPCLLLILSACVSATQYRIP-------RL---SPTDRDSEALSSPLSDD--FKTFYYN 61
           M LP  +LIL    +++ Y IP       RL   S T ++    S+   D+   K +Y+N
Sbjct: 1   MSLPYTILILFIFSTSSSYLIPLAHSKIARLGISSKTLKNEPDGSTQKVDESNLKMYYFN 60

Query: 62  QTLDHFNYRPESYTTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNA 121
           QTLDHF + PESY TF  RY I+  +WGGA ++APILA+LG E  LD+DL  +GF  DN 
Sbjct: 61  QTLDHFTFTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEESSLDSDLAAIGFLRDNG 120

Query: 122 AQFGALLVYIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAK 181
            +  ALLVYIEHRYYG+++PFGS E ALKNASTLGY N+AQA+ADYA +L+HVK++    
Sbjct: 121 PRLNALLVYIEHRYYGETMPFGSAEEALKNASTLGYLNAAQALADYAAILLHVKEKYSTN 180

Query: 182 DSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDV 241
            SP+IV+GGSYGGMLAAWFRLKYPH+ALGALASSAP+LYF++  P   YY I TK F++ 
Sbjct: 181 HSPIIVIGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDTRPKFGYYYIVTKVFKEA 240

Query: 242 SESCYETIRDSWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDH 301
           SE CY TIR+SW +I+ +A KP+GLSILSK+FK+C+PL  S  ++DFL ++Y  A QY+ 
Sbjct: 241 SERCYNTIRNSWIEIDRVAGKPNGLSILSKQFKTCAPLNGSFDIKDFLDTIYAEAVQYNR 300

Query: 302 PPRYPVTIICDAIDG--ASSGSGIVERIAAGVFAYKGNLSCY-MNQARDETETDVGWRWQ 361
            P + V  +C+AI+    +    +++RI AGV A  GN +CY        T  ++ WRWQ
Sbjct: 301 GPNFWVAKVCNAINANPPNRRYNLLDRIFAGVVALVGNRTCYDTKMFAQPTNNNIAWRWQ 360

Query: 362 RCSEMVMPLS-TGNDTMFPACNSEM 371
            CSE+VMP+     DTMFP     M
Sbjct: 361 SCSEIVMPVGYDKQDTMFPTAPFNM 385

BLAST of CmoCh04G020140 vs. TAIR10
Match: AT5G65760.1 (AT5G65760.1 Serine carboxypeptidase S28 family protein)

HSP 1 Score: 295.8 bits (756), Expect = 3.7e-80
Identity = 151/335 (45.07%), Postives = 215/335 (64.18%), Query Frame = 1

Query: 43  FKTFYYNQTLDHFNYRPESYTTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVV 102
           ++T +++Q LDHF++       F+ RY+IN  +W GA++  PI  Y G EG ++      
Sbjct: 58  YETKFFSQQLDHFSFA--DLPKFSQRYLINSDHWLGASALGPIFLYCGNEGDIEWFATNS 117

Query: 103 GFSTDNAAQFGALLVYIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHV 162
           GF  D A +FGALLV+ EHRYYG+S+P+GSRE A KNA+TL Y  + QA+AD+A  +  +
Sbjct: 118 GFIWDIAPKFGALLVFPEHRYYGESMPYGSREEAYKNATTLSYLTTEQALADFAVFVTDL 177

Query: 163 KKELHAKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIA 222
           K+ L A+  PV++ GGSYGGMLAAW RLKYPH+A+GALASSAPIL F+++ P + +Y IA
Sbjct: 178 KRNLSAEACPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDVVPPETFYDIA 237

Query: 223 TKDFRDVSESCYETIRDSWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYT 282
           + DF+  S SC+ TI+DSW  I     K +GL  L+K F  C  L ++  L D+L S Y+
Sbjct: 238 SNDFKRESSSCFNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYS 297

Query: 283 VAAQYDHP---------PRYPVTIICDAIDGASSGSGIVERIAAGV---FAYKGNLSCYM 342
             A  D+P         P +P+  +C  IDGA S + I++RI AG+   + Y GN+ C+ 
Sbjct: 298 YLAMVDYPYPADFMMPLPGHPIREVCRKIDGAGSNASILDRIYAGISVYYNYTGNVDCF- 357

Query: 343 NQARDETETDVGWRWQRCSEMVMPLSTGND-TMFP 365
            +  D+     GW WQ C+EMVMP+S+  + +MFP
Sbjct: 358 -KLDDDPHGLDGWNWQACTEMVMPMSSNQENSMFP 388

BLAST of CmoCh04G020140 vs. TAIR10
Match: AT2G24280.1 (AT2G24280.1 alpha/beta-Hydrolases superfamily protein)

HSP 1 Score: 293.5 bits (750), Expect = 1.8e-79
Identity = 156/382 (40.84%), Postives = 228/382 (59.69%), Query Frame = 1

Query: 6   CLLLILSACVSATQYR---IPRLSPTDRDSEALSSPLSDDFKTFYYNQTLDHFNYRPESY 65
           CL+ +  + V+   Y       LS      +   S     F+T Y+ Q LDHF++ P+SY
Sbjct: 6   CLVFLFFSIVAEATYSPGGFHHLSSLRLKKKVSKSKHELPFETRYFPQNLDHFSFTPDSY 65

Query: 66  TTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYIEHR 125
             F  +Y+IN ++W       PI  Y G EG +D   +  GF  D A +F ALLV+IEHR
Sbjct: 66  KVFHQKYLINNRFW---RKGGPIFVYTGNEGDIDWFASNTGFMLDIAPKFRALLVFIEHR 125

Query: 126 YYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGSYGG 185
           +YG+S PFG +  + K+A TLGY NS QA+ADYA ++  +K+ L ++ SPV+V GGSYGG
Sbjct: 126 FYGESTPFGKK--SHKSAETLGYLNSQQALADYAILIRSLKQNLSSEASPVVVFGGSYGG 185

Query: 186 MLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRDSWS 245
           MLAAWFRLKYPH+ +GALASSAPIL+FDNI P  ++Y   ++DF+D S +C++ I+ SW 
Sbjct: 186 MLAAWFRLKYPHITIGALASSAPILHFDNIVPLTSFYDAISQDFKDASINCFKVIKRSWE 245

Query: 246 KIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHP---------PRY 305
           ++E +++  +GL  LSK+F++C  L +     D+L   +   A  ++P         P Y
Sbjct: 246 ELEAVSTMKNGLQELSKKFRTCKGLHSQYSARDWLSGAFVYTAMVNYPTAANFMAPLPGY 305

Query: 306 PVTIICDAIDGASSGSGIVERIAAGV---FAYKGNLSCY-MNQARDETETDVGWRWQRCS 365
           PV  +C  IDG   GS  ++R  A     + Y G+  C+ M Q  D+   D GW++Q C+
Sbjct: 306 PVEQMCKIIDGFPRGSSNLDRAFAAASLYYNYSGSEKCFEMEQQTDDHGLD-GWQYQACT 365

Query: 366 EMVMPLSTGNDTMFPACNSEMQ 372
           EMVMP+S  N +M P   ++ +
Sbjct: 366 EMVMPMSCSNQSMLPPYENDSE 381

BLAST of CmoCh04G020140 vs. TAIR10
Match: AT3G28680.1 (AT3G28680.1 Serine carboxypeptidase S28 family protein)

HSP 1 Score: 149.8 bits (377), Expect = 3.3e-36
Identity = 75/173 (43.35%), Postives = 116/173 (67.05%), Query Frame = 1

Query: 178 GSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETI 237
           G+   +LAAWF+LKYP++ALGALASSAP+LYF++  P   Y+ I TK F+++S+ C+  I
Sbjct: 18  GAVHKVLAAWFKLKYPYIALGALASSAPLLYFEDTLPKHGYFYIVTKVFKEMSKECHNKI 77

Query: 238 RDSWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTI 297
             SW +I+ IA+KP+ LSILSK FK C+PL +  +L+ ++  +Y   AQY    ++ V  
Sbjct: 78  HKSWDEIDRIAAKPNSLSILSKNFKLCNPLNDIIELKSYVSYIYARTAQYS-DNQFSVAR 137

Query: 298 ICDAIDGA--SSGSGIVERIAAGVFAYKGNLSCY--MNQARDETETDVGWRWQ 347
           +C+AI+ +  ++ S ++++I AGV A +GN+SCY   + +   T  D  W WQ
Sbjct: 138 LCEAINTSPPNTKSDLLDQIFAGVVASRGNISCYGMSSPSYQMTNDDRAWGWQ 189

BLAST of CmoCh04G020140 vs. TAIR10
Match: AT4G36190.1 (AT4G36190.1 Serine carboxypeptidase S28 family protein)

HSP 1 Score: 100.9 bits (250), Expect = 1.7e-21
Identity = 100/345 (28.99%), Postives = 161/345 (46.67%), Query Frame = 1

Query: 47  YYNQTLDHFNYRPESYTTFAHRYIINFKYWGGAN-SSAPILAYLGAEGPLDNDLNVVGFS 106
           ++ QTLDH  Y P  +  F  RY   ++Y         PI   +  EGP +   N   + 
Sbjct: 49  WFTQTLDH--YSPSDHRKFRQRY---YEYLDHLRVPDGPIFLMICGEGPCNGITN--NYI 108

Query: 107 TDNAAQFGALLVYIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKE 166
           +  A +F A +V +EHRYYGKS PF S  +A KN   L Y +S QA++D A    + +  
Sbjct: 109 SVLAKKFDAGIVSLEHRYYGKSSPFKS--LATKN---LKYLSSKQALSDLATFRQYYQDS 168

Query: 167 LHAK-------DSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAY 226
           L+ K       ++P    G SY G L+AWFRLK+PH+  G+LASSA +          A 
Sbjct: 169 LNVKFNRSSNVENPWFFFGVSYSGALSAWFRLKFPHLTCGSLASSAVV---------RAV 228

Query: 227 YSIATKDFRDVSES----CYETIRDSWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLE 286
           Y     D + ++ES    C   ++++   +E       GL + ++  K+   L N+ +L+
Sbjct: 229 YEFPEFD-QQIAESAGPECETALQETNKLLEL------GLKVNNRAVKA---LFNATELD 288

Query: 287 ---DFLWSMY---TVAAQYDHPPRYPVTIICDAIDGASSGSGIVERIA-------AGVFA 346
              DFL+ +     +A QY +P +  V +    ++   +G  +VE  A        GVF 
Sbjct: 289 VDADFLYLIADAGVMAIQYGNPDKLCVPL----VEAQKNGGDLVEAYAKYVREFCMGVFG 348

Query: 347 YKG---NLSCYMNQARDETETDVGWRWQRCSEMV-MPLSTGNDTM 363
                 +    ++ A      D  W +Q C+E+    ++  ND++
Sbjct: 349 QSSKTYSRKHLLDTAVTLESADRLWWFQVCTEVAYFQVAPANDSI 358

BLAST of CmoCh04G020140 vs. NCBI nr
Match: gi|449456064|ref|XP_004145770.1| (PREDICTED: lysosomal Pro-X carboxypeptidase [Cucumis sativus])

HSP 1 Score: 620.2 bits (1598), Expect = 2.4e-174
Identity = 300/372 (80.65%), Postives = 331/372 (88.98%), Query Frame = 1

Query: 4   LPCLLLILSACVSATQYRIPRLSPTDR----DSEALSSPLSDDFKTFYYNQTLDHFNYRP 63
           LP +L ILS CV+ATQYRIPRLSP  R    ++EA+ S +SDDFKTFYYNQTLDHFNYRP
Sbjct: 12  LPFILFILSNCVTATQYRIPRLSPIGRTFLHNAEAIPSSISDDFKTFYYNQTLDHFNYRP 71

Query: 64  ESYTTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYI 123
           ESYT F HRYIINFKYWGGANSSAPILAYLGAEGPL+ DLN +GF TDNAA+F ALLVYI
Sbjct: 72  ESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLNAIGFMTDNAARFDALLVYI 131

Query: 124 EHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGS 183
           EHRYYGKS+PFGSRE ALKNASTLGYF+SAQAIADYA VLIH+K++ HAKDSPVIVLGGS
Sbjct: 132 EHRYYGKSMPFGSREEALKNASTLGYFSSAQAIADYAAVLIHLKQKYHAKDSPVIVLGGS 191

Query: 184 YGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRD 243
           YGGMLAAWFRLKYPHVALGALASSAPILYF++ITPH+ YYSIATKDFR+VSE+CYETIRD
Sbjct: 192 YGGMLAAWFRLKYPHVALGALASSAPILYFEDITPHNGYYSIATKDFREVSETCYETIRD 251

Query: 244 SWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIIC 303
           SWSKIE I SKP+GLSILSKEFK+CSPL +S+QLED+LWSMY  AAQY+HPPRYPVT IC
Sbjct: 252 SWSKIEIIGSKPNGLSILSKEFKTCSPLNSSSQLEDYLWSMYAGAAQYNHPPRYPVTRIC 311

Query: 304 DAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTGN 363
             IDGAS GSGI+ ++AAGVFAYKGNLSCY    R ETETDVGWRWQRCSEMVMPLST N
Sbjct: 312 GGIDGASPGSGIISKVAAGVFAYKGNLSCYNIGPRSETETDVGWRWQRCSEMVMPLSTTN 371

Query: 364 DTMFPACNSEMQ 372
           DTMFP    +++
Sbjct: 372 DTMFPPITFDLK 383

BLAST of CmoCh04G020140 vs. NCBI nr
Match: gi|659117557|ref|XP_008458664.1| (PREDICTED: lysosomal Pro-X carboxypeptidase isoform X2 [Cucumis melo])

HSP 1 Score: 618.2 bits (1593), Expect = 9.1e-174
Identity = 296/372 (79.57%), Postives = 333/372 (89.52%), Query Frame = 1

Query: 4   LPCLLLILSACVSATQYRIPRLSPTDR----DSEALSSPLSDDFKTFYYNQTLDHFNYRP 63
           +P +L ILS CV+ATQYRIPRLSP  R    ++EA+SS +SDDFKTFYYNQ+LDHFNYRP
Sbjct: 12  VPFILFILSNCVTATQYRIPRLSPIGRTFLHNAEAISSSISDDFKTFYYNQSLDHFNYRP 71

Query: 64  ESYTTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYI 123
           ESYT F HRYIINFKYWGGANSSAPILAYLGAEGPL+ DLN +GF TDNA +F ALLVYI
Sbjct: 72  ESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLNAIGFMTDNAVRFDALLVYI 131

Query: 124 EHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGS 183
           EHRYYGKS+PFGSRE ALKNASTLGYF+SAQAIADYA VL+H+K++ HAKDSPVIVLGGS
Sbjct: 132 EHRYYGKSMPFGSREEALKNASTLGYFSSAQAIADYAAVLLHLKQKYHAKDSPVIVLGGS 191

Query: 184 YGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRD 243
           YGGMLAAWFRLKYPHVALGALASSAPILYF++ITPH+ YYSIATKDFR+VSE+CYETIRD
Sbjct: 192 YGGMLAAWFRLKYPHVALGALASSAPILYFEDITPHNGYYSIATKDFREVSETCYETIRD 251

Query: 244 SWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIIC 303
           SWSKIETIASKP+GLSILSKEFK+CSPL +S+QLED+LWSMY  AAQY+HPPRYPVT IC
Sbjct: 252 SWSKIETIASKPNGLSILSKEFKTCSPLNSSSQLEDYLWSMYAGAAQYNHPPRYPVTRIC 311

Query: 304 DAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTGN 363
             IDGAS GSGI+ ++AAGVFAYKGNL CY    R++TETDVGWRWQRCSEMVMP+ST N
Sbjct: 312 GGIDGASPGSGIISKVAAGVFAYKGNLPCYNIGPRNDTETDVGWRWQRCSEMVMPMSTSN 371

Query: 364 DTMFPACNSEMQ 372
           DTMFP    +++
Sbjct: 372 DTMFPPITFDLR 383

BLAST of CmoCh04G020140 vs. NCBI nr
Match: gi|659117555|ref|XP_008458663.1| (PREDICTED: lysosomal Pro-X carboxypeptidase isoform X1 [Cucumis melo])

HSP 1 Score: 618.2 bits (1593), Expect = 9.1e-174
Identity = 296/372 (79.57%), Postives = 333/372 (89.52%), Query Frame = 1

Query: 4   LPCLLLILSACVSATQYRIPRLSPTDR----DSEALSSPLSDDFKTFYYNQTLDHFNYRP 63
           +P +L ILS CV+ATQYRIPRLSP  R    ++EA+SS +SDDFKTFYYNQ+LDHFNYRP
Sbjct: 12  VPFILFILSNCVTATQYRIPRLSPIGRTFLHNAEAISSSISDDFKTFYYNQSLDHFNYRP 71

Query: 64  ESYTTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYI 123
           ESYT F HRYIINFKYWGGANSSAPILAYLGAEGPL+ DLN +GF TDNA +F ALLVYI
Sbjct: 72  ESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLNAIGFMTDNAVRFDALLVYI 131

Query: 124 EHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGS 183
           EHRYYGKS+PFGSRE ALKNASTLGYF+SAQAIADYA VL+H+K++ HAKDSPVIVLGGS
Sbjct: 132 EHRYYGKSMPFGSREEALKNASTLGYFSSAQAIADYAAVLLHLKQKYHAKDSPVIVLGGS 191

Query: 184 YGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRD 243
           YGGMLAAWFRLKYPHVALGALASSAPILYF++ITPH+ YYSIATKDFR+VSE+CYETIRD
Sbjct: 192 YGGMLAAWFRLKYPHVALGALASSAPILYFEDITPHNGYYSIATKDFREVSETCYETIRD 251

Query: 244 SWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIIC 303
           SWSKIETIASKP+GLSILSKEFK+CSPL +S+QLED+LWSMY  AAQY+HPPRYPVT IC
Sbjct: 252 SWSKIETIASKPNGLSILSKEFKTCSPLNSSSQLEDYLWSMYAGAAQYNHPPRYPVTRIC 311

Query: 304 DAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTGN 363
             IDGAS GSGI+ ++AAGVFAYKGNL CY    R++TETDVGWRWQRCSEMVMP+ST N
Sbjct: 312 GGIDGASPGSGIISKVAAGVFAYKGNLPCYNIGPRNDTETDVGWRWQRCSEMVMPMSTSN 371

Query: 364 DTMFPACNSEMQ 372
           DTMFP    +++
Sbjct: 372 DTMFPPITFDLR 383

BLAST of CmoCh04G020140 vs. NCBI nr
Match: gi|659117553|ref|XP_008458662.1| (PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis melo])

HSP 1 Score: 555.1 bits (1429), Expect = 9.5e-155
Identity = 269/365 (73.70%), Postives = 310/365 (84.93%), Query Frame = 1

Query: 4   LPCLLLILSACVSATQYRIPRLSPTDRD----SEALSSPLSDDFKTFYYNQTLDHFNYRP 63
           LP LLL LS  V+A Q+RIPRLSP        S+AL  P SDDFKTFY+NQTLDHFNYRP
Sbjct: 11  LPFLLLFLSNSVTAFQFRIPRLSPIGEKFLYHSKALELPPSDDFKTFYFNQTLDHFNYRP 70

Query: 64  ESYTTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYI 123
           ESYTTF  RYIINFKYWGGANSSAPILAYLG E P+D+ +N +GF TDNA +F ALLVYI
Sbjct: 71  ESYTTFPQRYIINFKYWGGANSSAPILAYLGPEAPIDSAMNAIGFMTDNAVKFNALLVYI 130

Query: 124 EHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGS 183
           EHRYYGKSIPFGSR+ AL+NASTLGYFNSAQAIADYA +LIHVK E +AK SPVIV+GGS
Sbjct: 131 EHRYYGKSIPFGSRKEALRNASTLGYFNSAQAIADYAAILIHVKNEFNAKYSPVIVIGGS 190

Query: 184 YGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRD 243
           YGGMLA WFRLKYPHVALGALASSAPILYF++ITP + YY   TKDFR+VS++CYETIR+
Sbjct: 191 YGGMLATWFRLKYPHVALGALASSAPILYFNDITPQNGYYVTVTKDFREVSQTCYETIRE 250

Query: 244 SWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIIC 303
           SWS+IET+AS+P+GLS+L KEFK+CSPL++S QLE++LW MY  AAQY+HP  YPVT IC
Sbjct: 251 SWSEIETVASQPNGLSVLDKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPSSYPVTRIC 310

Query: 304 DAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTGN 363
           DAID   S +G + +IAAGVFAY+GNLSCY+N+  + TET VGW+WQRCSEMVMP+ST N
Sbjct: 311 DAIDRTYS-NGTLGKIAAGVFAYRGNLSCYINEPINTTETTVGWQWQRCSEMVMPISTSN 370

Query: 364 DTMFP 365
           DTMFP
Sbjct: 371 DTMFP 374

BLAST of CmoCh04G020140 vs. NCBI nr
Match: gi|449456174|ref|XP_004145825.1| (PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus])

HSP 1 Score: 552.4 bits (1422), Expect = 6.2e-154
Identity = 268/366 (73.22%), Postives = 311/366 (84.97%), Query Frame = 1

Query: 4   LPCLLLILSACVSATQYRIPRLSPTDRD----SEALSSPLSDDFKTFYYNQTLDHFNYRP 63
           LP LLL LS  V+A Q+RIPRLSP        S+AL  P SDDFKTFY+NQTLDHFNYRP
Sbjct: 7   LPFLLLFLSNSVTAFQFRIPRLSPIGEKFLHHSKALELPPSDDFKTFYFNQTLDHFNYRP 66

Query: 64  ESYTTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYI 123
           ESYTTF  RYIINFKYWGGANSSAPILAYLG E P+D+ +NV+GF TDNA +F ALLVYI
Sbjct: 67  ESYTTFPQRYIINFKYWGGANSSAPILAYLGPEAPIDSAMNVIGFMTDNAVKFNALLVYI 126

Query: 124 EHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGS 183
           EHRYYGKSIPFGSR+ AL+NASTLGYFNSAQA+ADYA +LIHVKKE  AK SPVIV+GGS
Sbjct: 127 EHRYYGKSIPFGSRKEALRNASTLGYFNSAQALADYAAILIHVKKEFSAKYSPVIVIGGS 186

Query: 184 YGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRD 243
           YGGMLA WFRLKYPHVALGALASSAPILYF++ITP + YY I TKDFR+VS++CYE+IR+
Sbjct: 187 YGGMLATWFRLKYPHVALGALASSAPILYFNDITPENGYYVIVTKDFREVSQTCYESIRE 246

Query: 244 SWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIIC 303
           SWS+IET+AS+ +GLS+L K FK+CSPL++S QLE++LW MY  AAQY+HP RYPV  IC
Sbjct: 247 SWSEIETVASQSNGLSVLDKVFKTCSPLRSSTQLENYLWFMYASAAQYNHPSRYPVNRIC 306

Query: 304 DAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTGN 363
           DAID   S +G + +IAAGVFAY+G LSCY+N+  + TET VGW+WQRCSEMVMP+STGN
Sbjct: 307 DAIDQTYS-NGTLGKIAAGVFAYRGELSCYINEPINTTETTVGWQWQRCSEMVMPISTGN 366

Query: 364 DTMFPA 366
           DTMFP+
Sbjct: 367 DTMFPS 371

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PCP_PONAB7.7e-5635.86Lysosomal Pro-X carboxypeptidase OS=Pongo abelii GN=PRCP PE=2 SV=1[more]
PCP_HUMAN2.3e-5535.57Lysosomal Pro-X carboxypeptidase OS=Homo sapiens GN=PRCP PE=1 SV=1[more]
PCP_MOUSE1.2e-5334.56Lysosomal Pro-X carboxypeptidase OS=Mus musculus GN=Prcp PE=1 SV=2[more]
PCP_BOVIN1.4e-5236.18Lysosomal Pro-X carboxypeptidase OS=Bos taurus GN=PRCP PE=2 SV=1[more]
DPP2_HUMAN5.6e-4633.04Dipeptidyl peptidase 2 OS=Homo sapiens GN=DPP7 PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A0A0KAY8_CUCSA1.7e-17480.65Uncharacterized protein OS=Cucumis sativus GN=Csa_6G149410 PE=4 SV=1[more]
A0A0A0KDH5_CUCSA4.3e-15473.22Uncharacterized protein OS=Cucumis sativus GN=Csa_6G149400 PE=4 SV=1[more]
A0A0A0KBK9_CUCSA3.8e-15069.79Uncharacterized protein OS=Cucumis sativus GN=Csa_6G149390 PE=4 SV=1[more]
A5C9W6_VITVI6.2e-14570.39Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_018664 PE=4 SV=1[more]
V4SKU6_9ROSI2.2e-14266.49Uncharacterized protein OS=Citrus clementina GN=CICLE_v10028519mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G22860.13.0e-10651.69 Serine carboxypeptidase S28 family protein[more]
AT5G65760.13.7e-8045.07 Serine carboxypeptidase S28 family protein[more]
AT2G24280.11.8e-7940.84 alpha/beta-Hydrolases superfamily protein[more]
AT3G28680.13.3e-3643.35 Serine carboxypeptidase S28 family protein[more]
AT4G36190.11.7e-2128.99 Serine carboxypeptidase S28 family protein[more]
Match NameE-valueIdentityDescription
gi|449456064|ref|XP_004145770.1|2.4e-17480.65PREDICTED: lysosomal Pro-X carboxypeptidase [Cucumis sativus][more]
gi|659117557|ref|XP_008458664.1|9.1e-17479.57PREDICTED: lysosomal Pro-X carboxypeptidase isoform X2 [Cucumis melo][more]
gi|659117555|ref|XP_008458663.1|9.1e-17479.57PREDICTED: lysosomal Pro-X carboxypeptidase isoform X1 [Cucumis melo][more]
gi|659117553|ref|XP_008458662.1|9.5e-15573.70PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis melo][more]
gi|449456174|ref|XP_004145825.1|6.2e-15473.22PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008758Peptidase_S28
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0008236serine-type peptidase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004180 carboxypeptidase activity
molecular_function GO:0008236 serine-type peptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G020140.1CmoCh04G020140.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008758Peptidase S28PFAMPF05577Peptidase_S28coord: 50..364
score: 2.0
NoneNo IPR availablePANTHERPTHR11010PROTEASE S28 PRO-X CARBOXYPEPTIDASE-RELATEDcoord: 1..364
score: 1.4E
NoneNo IPR availablePANTHERPTHR11010:SF47PROLYLCARBOXYPEPTIDASE-LIKE PROTEIN-RELATEDcoord: 1..364
score: 1.4E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None