CmoCh04G020170 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G020170
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
Description(Lysosomal Pro-X carboxypeptidase, putative) (3.4.16.2)
LocationCmo_Chr04 : 10759651 .. 10763180 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGCTTCTTCCCTGCTTACTTCTCATCCTTTCAGCCTGTGTTTCTGCAACACAGTACAGAATCCCAAGGCTTAGTCCAACCGACAGAGATTCTGAAGCTCTGTCCTCGCCTCTTTCGGATGATTTCAAGACATTTTATTACAACCAAACGTTAGATCACTTCAATTATAGGCCTGAAAGCTACATGACGTTCGCTCATAGATATATAATCAACTTTAAGTATTGGGGCGGCGCAAATTCCAGCGCTCCCATTCTTGCCTACTTGGGAGCTGAAGGTCCACTGGATAATGATTTGAACGTTGTAGGATTCTCGACTGATAATGCTGCTCAGTTTGGGGCTCTTCTTGTTTATATTGAGGTAAGAGCTTTCATGTTCTTGTGTGGTTCATATTTAATGCTTTGGAGGGTTTATATGATTTTATTTTGTTTTGTTGTTAGCATCGTTATTATGGGAAATCGATACCCTTTGGATCAAGGGAGGTAGCATTGAAGAATGCAAGCACTTTAGGCTATTTCAACTCTGCTCAAGCAATAGCAGATTATGCGGATGTTCTTATACATGTTAAAAAGGAGTTGCATGCTAAAGATTCTCCTGTGATTGTTCTTGGTGGATCATATGGTGGAAGTAAGTGAGTTCTCGTTTAAAATTGTGAGATTTCACGTCGGTTGGAAAGGGGAATGAAGCATTTCTTACAAGGGTGTGGAAACGAATCGCTAGTCGATGTGTTTTAAAACTGTAAGGCTGATGACGATGCGTAACGGGCCAAAACGGACAATATCTACTAGCGATGGCCTTGAGTTGTTCCAAATGGTATCAGAGCTAGACATTGAGTAGCGTGCTAGCGAGGACACTGGGCCTCTAAGAGGGGTGGATTGTGAGATCCCACATCGGGTGGAGAGGGGAACGAAACTTTTCTTATAAGGGCGTGGAAGCCTCTCCCTAGCACACGCGTTTTAAAACCGTGAGGCTAACGACGATATGTAACGGGCCAAAGCAGACAATATGTGCTAGCGATGGCCTCGAGCTGTTGTCAATGGTATCAGAGCTAGACACCGGGTAGTGTGCCAGCAAGGACGCTGGGCCTCCAAGCGGGTGGATTGTGAGATCCCACATCGGTTGGAAAGGGGAACGAAACATTTCTTGTAAGGGTGTGGATGCCTCTTCCTAGCAGACGCGTTTTAAAACCGTGAGGCTAACGACGATACGTAACGGGCCAAAACGGACAATATCTGCTAGTGGTGGCCTTGAGCTGTTCCAAACGGTATCAGAGCCATACACTAGGCAGTGTGTTAGCGAGGACGCTGAGCCACCAAGAGCGGTGGATTGTGAGATCCCACATCAGTTGGAGAGGGGAACGAAACACTTCTTGTAAGGGTGTGGAAGCCTCTCTCTAGCAAACGCGTTTTAATACCGTGAGGCTAACCACGATACGTAACGTGCGAAGCGAACAATATCTGCTAGCGGTTACAAAAATCTTCAAACTCTATATAACAGAACATGTTTTTTTTGTTTTGTTGGAAACAGTGTTGGCTGCATGGTTTCGTCTGAAATATCCTCATGTCGCCCTTGGAGCTCTTGCTTCTTCAGCTCCAATCCTTTACTTCGACAATATCACGCCACATGATGCATACTATTCCATTGCCACTAAGGATTTTAGAGTAAGATCCCTCAACAATAATGCATCCTTTTGAACCCTTTGAACTTTTTAAATACACAAAAATGTGAATGAATTTGCAGGACGTTAGTGAGAGTTGCTATGAAACCATTCGGGATTCCTGGTCCAAGATTGAAACAATTGCTTCCAAGCCTGATGGCCTTTCCATTCTTAGCAAAGAGTTCAAAAGTTGCAGGTATTAGAATTCATGGCTTCTCATATCCAAATTTAATGAAGTTTGTGCTTCAAATGAGTTGTTCTTGTTTTAGTCCTCTGAAGAACTCCGCTCAGCTGGAAGACTTCTTGTGGTCTATGTATACCGTCGCAGCCCAATACGACCACCCACCAAGGTATCCAGTCACTATAATCTGTGATGCCATTGATGGAGCTTCATCAGGAAGTGGAATTGTTGAACGAATTGCTGCAGGCGTGTTTGCTTATAAAGGAAATCTTTCCTGCTACATGAATCAGGCCAGAGATGAAACTGAAACCGATGTGGGATGGAGGTGGCAGGTAATCTTTTTATTACACGAATACCGACGTGGGATGCATTGAACTTCAAAAAGCTTCATTAATTTATTTTTTATTTTTTGACGTTATAAGAAGCTTTTGTGTTGTGTGTGTAACTCAGAGATGCAGTGAAATGGTGATGCCATTAAGCACAGGCAATGATACTATGTTTCCAGCATACAATTTTGAGCTTGGAAGCTTCATAGATTACTGCAATGAGTTATACGGTGTGTCTCCAAGGCCTCACTGGGTCACCACCTATTATGGAGGCCATGTATGCACTTCAACCACCTCTCTCTTTTTCTTTATTTTCTTTATTGAAATTACTGTAACCGCCCAAATCTACAGTAGATATTGTCTTTCTCTGACTTTCTTTTCCAGGCTTCCCCTCAAAGTTTTTAAAACTTCACCCCCGTAAAGAATGTTTTGTTCTCCTCCCCAACCAATGTGGAATCTCACAATCCACCCCCTCCCCCGGGGCCAGCGTCCTCGTTGGCACTCGTTCCTTTCTTCAATCGATGTGGGACTCCACAATCCACTCCACCCTCGAGGCCCAGCGTCCTTGCTAGCACACCGCCTCGTATCCACCCCCCTTCGGGGTGCATCCGACCATTCTGTTAGTGGATCTCACAATCCACTCCCTCGAGGCCCAGCGTCCTTACTGGCACACCGCCTCGTATCCACCCCCTTCGGGGCCCACCACTAAATGATATCATCCTCTTTGTGTTTCCCCTCAAGGTTTTTTATATTAGAGAGGTTTCCACGCCCTTATAAACAATGTTTTTTTTTTCTCCTCATCAACTGATATAGGATCTTACAATCCACCCCCCTCGGGACCCAGCGTCCTTGTTGGCACACCGCCTCGTGTCCACCCCCTTCAGGGATCAGCCTCCTCGCTGGCACATCACCCAGTGTCTGGCTCTGATACCATTTGTAACAGTCCAACCCCACCGCTAACATATATTATCCTCTTTGAGCTTTCCCTTTCAAGCTTCCCTTTAAGGTTTCTAAAACGCGTCTACTAGGGAGAGGTTTCCGCACGCTCATAAATAATAGTTCGTTCTCCTCCCTAACCGACGTAGAATCTCACAATTTCTTTGTCTCCCTTCAAATTTCAGGACATAAAACTCATCCTTAAGAGATTTGGCAGCAACGTCATTTTCTCCAATGGACTAAGGGACCCTTATAGCAGCTGCGGGTAAACGAAAACCAAAGCTACAACATTTATTGCACATAAAGAACAATCAACTTTACTTCAACTCTCTCTCTCTCTTTTTGTGAATGCAGAGTGTTGCATAACTTATCTGACAGACTCCTTGCACTCCATACGCCTAATGGTTTGATTCAGACTCCTAAATTCTAA

mRNA sequence

ATGATGCTTCTTCCCTGCTTACTTCTCATCCTTTCAGCCTGTGTTTCTGCAACACAGTACAGAATCCCAAGGCTTAGTCCAACCGACAGAGATTCTGAAGCTCTGTCCTCGCCTCTTTCGGATGATTTCAAGACATTTTATTACAACCAAACGTTAGATCACTTCAATTATAGGCCTGAAAGCTACATGACGTTCGCTCATAGATATATAATCAACTTTAAGTATTGGGGCGGCGCAAATTCCAGCGCTCCCATTCTTGCCTACTTGGGAGCTGAAGGTCCACTGGATAATGATTTGAACGTTGTAGGATTCTCGACTGATAATGCTGCTCAGTTTGGGGCTCTTCTTGTTTATATTGAGCATCGTTATTATGGGAAATCGATACCCTTTGGATCAAGGGAGGTAGCATTGAAGAATGCAAGCACTTTAGGCTATTTCAACTCTGCTCAAGCAATAGCAGATTATGCGGATGTTCTTATACATGTTAAAAAGGAGTTGCATGCTAAAGATTCTCCTGTGATTGTTCTTGGTGGATCATATGGTGGAATGTTGGCTGCATGGTTTCGTCTGAAATATCCTCATGTCGCCCTTGGAGCTCTTGCTTCTTCAGCTCCAATCCTTTACTTCGACAATATCACGCCACATGATGCATACTATTCCATTGCCACTAAGGATTTTAGAGACGTTAGTGAGAGTTGCTATGAAACCATTCGGGATTCCTGGTCCAAGATTGAAACAATTGCTTCCAAGCCTGATGGCCTTTCCATTCTTAGCAAAGAGTTCAAAAGTTGCAGTCCTCTGAAGAACTCCGCTCAGCTGGAAGACTTCTTGTGGTCTATGTATACCGTCGCAGCCCAATACGACCACCCACCAAGGTATCCAGTCACTATAATCTGTGATGCCATTGATGGAGCTTCATCAGGAAGTGGAATTGTTGAACGAATTGCTGCAGGCGTGTTTGCTTATAAAGGAAATCTTTCCTGCTACATGAATCAGGCCAGAGATGAAACTGAAACCGATGTGGGATGGAGGTGGCAGAGATGCAGTGAAATGGTGATGCCATTAAGCACAGGCAATGATACTATGTTTCCAGCATACAATTTTGAGCTTGGAAGCTTCATAGATTACTGCAATGAGTTATACGGTGTGTCTCCAAGGCCTCACTGGGTCACCACCTATTATGGAGGCCATGACATAAAACTCATCCTTAAGAGATTTGGCAGCAACGTCATTTTCTCCAATGGACTAAGGGACCCTTATAGCAGCTGCGGACTCCTTGCACTCCATACGCCTAATGGTTTGATTCAGACTCCTAAATTCTAA

Coding sequence (CDS)

ATGATGCTTCTTCCCTGCTTACTTCTCATCCTTTCAGCCTGTGTTTCTGCAACACAGTACAGAATCCCAAGGCTTAGTCCAACCGACAGAGATTCTGAAGCTCTGTCCTCGCCTCTTTCGGATGATTTCAAGACATTTTATTACAACCAAACGTTAGATCACTTCAATTATAGGCCTGAAAGCTACATGACGTTCGCTCATAGATATATAATCAACTTTAAGTATTGGGGCGGCGCAAATTCCAGCGCTCCCATTCTTGCCTACTTGGGAGCTGAAGGTCCACTGGATAATGATTTGAACGTTGTAGGATTCTCGACTGATAATGCTGCTCAGTTTGGGGCTCTTCTTGTTTATATTGAGCATCGTTATTATGGGAAATCGATACCCTTTGGATCAAGGGAGGTAGCATTGAAGAATGCAAGCACTTTAGGCTATTTCAACTCTGCTCAAGCAATAGCAGATTATGCGGATGTTCTTATACATGTTAAAAAGGAGTTGCATGCTAAAGATTCTCCTGTGATTGTTCTTGGTGGATCATATGGTGGAATGTTGGCTGCATGGTTTCGTCTGAAATATCCTCATGTCGCCCTTGGAGCTCTTGCTTCTTCAGCTCCAATCCTTTACTTCGACAATATCACGCCACATGATGCATACTATTCCATTGCCACTAAGGATTTTAGAGACGTTAGTGAGAGTTGCTATGAAACCATTCGGGATTCCTGGTCCAAGATTGAAACAATTGCTTCCAAGCCTGATGGCCTTTCCATTCTTAGCAAAGAGTTCAAAAGTTGCAGTCCTCTGAAGAACTCCGCTCAGCTGGAAGACTTCTTGTGGTCTATGTATACCGTCGCAGCCCAATACGACCACCCACCAAGGTATCCAGTCACTATAATCTGTGATGCCATTGATGGAGCTTCATCAGGAAGTGGAATTGTTGAACGAATTGCTGCAGGCGTGTTTGCTTATAAAGGAAATCTTTCCTGCTACATGAATCAGGCCAGAGATGAAACTGAAACCGATGTGGGATGGAGGTGGCAGAGATGCAGTGAAATGGTGATGCCATTAAGCACAGGCAATGATACTATGTTTCCAGCATACAATTTTGAGCTTGGAAGCTTCATAGATTACTGCAATGAGTTATACGGTGTGTCTCCAAGGCCTCACTGGGTCACCACCTATTATGGAGGCCATGACATAAAACTCATCCTTAAGAGATTTGGCAGCAACGTCATTTTCTCCAATGGACTAAGGGACCCTTATAGCAGCTGCGGACTCCTTGCACTCCATACGCCTAATGGTTTGATTCAGACTCCTAAATTCTAA
BLAST of CmoCh04G020170 vs. Swiss-Prot
Match: PCP_PONAB (Lysosomal Pro-X carboxypeptidase OS=Pongo abelii GN=PRCP PE=2 SV=1)

HSP 1 Score: 268.1 bits (684), Expect = 1.7e-70
Identity = 147/405 (36.30%), Postives = 230/405 (56.79%), Query Frame = 1

Query: 39  LSDDFKTFYYNQTLDHFNYRPESYMTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDND 98
           ++ ++   Y+ Q +DHF +   +  TF  RY++  KYW    +   IL Y G EG +   
Sbjct: 44  VAKNYSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWF 103

Query: 99  LNVVGFSTDNAAQFGALLVYIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADV 158
            N  GF  D A +  A+LV+ EHRYYG+S+PFG      K++  L +  S QA+AD+A++
Sbjct: 104 CNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGDN--TFKDSRHLNFLTSEQALADFAEL 163

Query: 159 LIHVKKELH-AKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDA 218
           + H+K+ +  A++ PVI +GGSYGGMLAAWFR+KYPH+ +GALA+SAPI  F+++ P   
Sbjct: 164 IKHLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGV 223

Query: 219 YYSIATKDFRDVSESCYETIRDSWSKIETIASKPDGLSILSKEFKSCSPL--KNSAQLED 278
           +  I T DFR     C E+IR SW  I  +++   GL  L+     CSPL  ++   L+D
Sbjct: 224 FMKIVTTDFRKSGPHCSESIRRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKD 283

Query: 279 FLWSMYTVAAQYDHP---------PRYPVTIICDAIDGAS-SGSGIVERIAAGV---FAY 338
           ++   +   A  D+P         P +P+ ++C  +   + S S +++ I   +   + Y
Sbjct: 284 WISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNY 343

Query: 339 KGNLSCY-MNQARDETETDVGWRWQRCSEMVMPLST-GNDTMFPAYNFELGSFIDYCNEL 398
            G + C  +++    +   +GW +Q C+E+VMP  T G D MF  +++ L    D C + 
Sbjct: 344 SGQVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQ 403

Query: 399 YGVSPRPHWVTTYYGGHDIKLILKRFGSNVIFSNGLRDPYSSCGL 426
           +GV PRP W+TT YGG +I        +N++FSNG  DP+S  G+
Sbjct: 404 WGVRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGV 437

BLAST of CmoCh04G020170 vs. Swiss-Prot
Match: PCP_HUMAN (Lysosomal Pro-X carboxypeptidase OS=Homo sapiens GN=PRCP PE=1 SV=1)

HSP 1 Score: 266.5 bits (680), Expect = 5.0e-70
Identity = 146/405 (36.05%), Postives = 230/405 (56.79%), Query Frame = 1

Query: 39  LSDDFKTFYYNQTLDHFNYRPESYMTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDND 98
           ++ ++   Y+ Q +DHF +   +  TF  RY++  KYW    +   IL Y G EG +   
Sbjct: 44  VAKNYSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWF 103

Query: 99  LNVVGFSTDNAAQFGALLVYIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADV 158
            N  GF  D A +  A+LV+ EHRYYG+S+PFG    + K++  L +  S QA+AD+A++
Sbjct: 104 CNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAEL 163

Query: 159 LIHVKKELH-AKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDA 218
           + H+K+ +  A++ PVI +GGSYGGMLAAWFR+KYPH+ +GALA+SAPI  F+++ P   
Sbjct: 164 IKHLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGV 223

Query: 219 YYSIATKDFRDVSESCYETIRDSWSKIETIASKPDGLSILSKEFKSCSPL--KNSAQLED 278
           +  I T DFR     C E+I  SW  I  +++   GL  L+     CSPL  ++   L+D
Sbjct: 224 FMKIVTTDFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKD 283

Query: 279 FLWSMYTVAAQYDHP---------PRYPVTIICDAIDGAS-SGSGIVERIAAGV---FAY 338
           ++   +   A  D+P         P +P+ ++C  +   + S S +++ I   +   + Y
Sbjct: 284 WISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNY 343

Query: 339 KGNLSCY-MNQARDETETDVGWRWQRCSEMVMPLST-GNDTMFPAYNFELGSFIDYCNEL 398
            G + C  +++    +   +GW +Q C+E+VMP  T G D MF  +++ L    D C + 
Sbjct: 344 SGQVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQ 403

Query: 399 YGVSPRPHWVTTYYGGHDIKLILKRFGSNVIFSNGLRDPYSSCGL 426
           +GV PRP W+TT YGG +I        +N++FSNG  DP+S  G+
Sbjct: 404 WGVRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGV 437

BLAST of CmoCh04G020170 vs. Swiss-Prot
Match: PCP_MOUSE (Lysosomal Pro-X carboxypeptidase OS=Mus musculus GN=Prcp PE=1 SV=2)

HSP 1 Score: 263.1 bits (671), Expect = 5.5e-69
Identity = 157/441 (35.60%), Postives = 237/441 (53.74%), Query Frame = 1

Query: 8   LLILSACVSATQYRIPRLSPTDRDSEALSSPLSDD-----FKTFYYNQTLDHFNYRPESY 67
           LL+LS  +      IP    T       +SP  D      +   Y+ Q +DHF +     
Sbjct: 6   LLLLSFLLLGAATTIPPRLKTLGSPHLSASPTPDPAVARKYSVLYFEQKVDHFGFA--DM 65

Query: 68  MTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYIEHR 127
            TF  RY++  K+W    +   IL Y G EG +    N  GF  D A +  A+LV+ EHR
Sbjct: 66  RTFKQRYLVADKHW--QRNGGSILFYTGNEGDIVWFCNNTGFMWDVAEELKAMLVFAEHR 125

Query: 128 YYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELH-AKDSPVIVLGGSYG 187
           YYG+S+PFG  + + K++  L +  S QA+AD+A+++ H++K +  A+  PVI +GGSYG
Sbjct: 126 YYGESLPFG--QDSFKDSQHLNFLTSEQALADFAELIRHLEKTIPGAQGQPVIAIGGSYG 185

Query: 188 GMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRDSW 247
           GMLAAWFR+KYPH+ +GALA+SAPI   D + P   +  I T DFR     C E+IR SW
Sbjct: 186 GMLAAWFRMKYPHIVVGALAASAPIWQLDGMVPCGEFMKIVTNDFRKSGPYCSESIRKSW 245

Query: 248 SKIETIASKPDGLSILSKEFKSCSPLKNS--AQLEDFLWSMYTVAAQYDHP--------- 307
           + I+ ++    GL  L+     CSPL +     L+ ++   +   A  ++P         
Sbjct: 246 NVIDKLSGSGSGLQSLTNILHLCSPLTSEKIPTLKGWIAETWVNLAMVNYPYACNFLQPL 305

Query: 308 PRYPVTIICDAIDGAS-SGSGIVERIAAGV---FAYKGNLSCY-MNQARDETETDVGWRW 367
           P +P+  +C  +   + S + +++ I   +   + Y G  +C  ++Q    +   +GW +
Sbjct: 306 PAWPIKEVCQYLKNPNVSDTVLLQNIFQALSVYYNYSGQAACLNISQTTTSSLGSMGWSF 365

Query: 368 QRCSEMVMPLST-GNDTMFPAYNFELGSFIDYCNELYGVSPRPHWVTTYYGGHDIKLILK 426
           Q C+EMVMP  T G D MF  + ++L  + + C   +GV PRPHW+TT YGG +I     
Sbjct: 366 QACTEMVMPFCTNGIDDMFEPFLWDLEKYSNDCFNQWGVKPRPHWMTTMYGGKNIS---- 425

BLAST of CmoCh04G020170 vs. Swiss-Prot
Match: PCP_BOVIN (Lysosomal Pro-X carboxypeptidase OS=Bos taurus GN=PRCP PE=2 SV=1)

HSP 1 Score: 258.8 bits (660), Expect = 1.0e-67
Identity = 169/465 (36.34%), Postives = 248/465 (53.33%), Query Frame = 1

Query: 7   LLLILSACVSATQYRIPRLSPTDRDSEALSSPLSDDFKTF----------YYNQTLDHFN 66
           LLL+L      T      +SP+ R   +L  P S  F++           Y  Q +DHF 
Sbjct: 6   LLLLLLLIAFLTPGAANPVSPSLRAPSSL--PWSTSFRSRPTITLKYSIRYIQQKVDHFG 65

Query: 67  YRPESYMTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALL 126
           +  +   TF  RY+I   YW     S  IL Y G EG +    N  GF  D A +  A+L
Sbjct: 66  FNIDR--TFKQRYLIADNYWKEDGGS--ILFYTGNEGDIIWFCNNTGFMWDIAEEMKAML 125

Query: 127 VYIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELH-AKDSPVIV 186
           V+ EHRYYG+S+PFG+   +  ++  L +  + QA+AD+A ++ ++K+ +  A++  VI 
Sbjct: 126 VFAEHRYYGESLPFGAD--SFSDSRHLNFLTTEQALADFAKLIRYLKRTIPGARNQHVIA 185

Query: 187 LGGSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYE 246
           LGGSYGGMLAAWFR+KYPH+ +GALASSAPI  F+++ P D +  I T DF     +C E
Sbjct: 186 LGGSYGGMLAAWFRMKYPHLVVGALASSAPIWQFNDLVPCDIFMKIVTTDFSQSGPNCSE 245

Query: 247 TIRDSWSKIETIASKPDGLSILSKEFKSCSPLKNS---AQLEDFLWSMYTVAAQYDHP-- 306
           +IR SW  I  +A K  GL  LS+    C+PL  S    +L+D++   +   A  D+P  
Sbjct: 246 SIRRSWDAINRLAKKGTGLRWLSEALHLCTPLTKSQDVQRLKDWISETWVNVAMVDYPYE 305

Query: 307 -------PRYPVTIICDAIDGAS-SGSGIVERIAAGV---FAYKGNLSCYMNQARDETET 366
                  P +PV ++C     ++   + +V+ I   +   + Y G   C +N +   T +
Sbjct: 306 SNFLQPLPAWPVKVVCQYFKYSNVPDTVMVQNIFQALNVYYNYSGQAKC-LNVSETATSS 365

Query: 367 --DVGWRWQRCSEMVMP-LSTGNDTMFPAYNFELGSFIDYCNELYGVSPRPHWVTTYYGG 426
              +GW +Q C+EMVMP  S G D MF  +++ +  + D C + +GV PRP W+ T YGG
Sbjct: 366 LGVLGWSYQACTEMVMPTCSDGVDDMFEPHSWNMKEYSDDCFKQWGVRPRPSWIPTMYGG 425

Query: 427 HDIKLILKRFGSNVIFSNGLRDPYSSCG--------LLALHTPNG 434
            +I        +N+IFSNG  DP+S  G        LLA+  PNG
Sbjct: 426 KNIS-----SHTNIIFSNGELDPWSGGGVTKDITDTLLAIVIPNG 456

BLAST of CmoCh04G020170 vs. Swiss-Prot
Match: DPP2_HUMAN (Dipeptidyl peptidase 2 OS=Homo sapiens GN=DPP7 PE=1 SV=3)

HSP 1 Score: 225.7 bits (574), Expect = 9.8e-58
Identity = 137/406 (33.74%), Postives = 213/406 (52.46%), Query Frame = 1

Query: 43  FKTFYYNQTLDHFNYRPESYMTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVV 102
           F+  ++ Q LDHFN+      TF  R++++ ++W       PI  Y G EG +    N  
Sbjct: 31  FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFW--VRGEGPIFFYTGNEGDVWAFANNS 90

Query: 103 GFSTDNAAQFGALLVYIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHV 162
            F  + AA+ GALLV+ EHRYYGKS+PFG++     +   L      QA+AD+A++L  +
Sbjct: 91  AFVAELAAERGALLVFAEHRYYGKSLPFGAQSTQRGHTELL---TVEQALADFAELLRAL 150

Query: 163 KKELHAKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIA 222
           +++L A+D+P I  GGSYGGML+A+ R+KYPH+  GALA+SAP+L    +   + ++   
Sbjct: 151 RRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDV 210

Query: 223 TKDFRDVSESCYETIRDSWSKIETIASKPDGLSILSKEFKSCSPL---KNSAQLEDFLWS 282
           T DF   S  C + +R+++ +I+ +  +      +  EF +C PL   K+  QL  F  +
Sbjct: 211 TADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFARN 270

Query: 283 MYTVAAQYDHP---------PRYPVTIICDAIDGASSGSGIVERIAAGVFAYKGNLSCY- 342
            +TV A  D+P         P  PV + CD +   +     +  +A  V+   G+  CY 
Sbjct: 271 AFTVLAMMDYPYPTDFLGPLPANPVKVGCDRLLSEAQRITGLRALAGLVYNASGSEHCYD 330

Query: 343 ----MNQARDETETDVG-----WRWQRCSEMVMPLSTGNDT-MFPAYNFELGSFIDYCNE 402
                +   D T    G     W +Q C+E+ +  ++ N T MFP   F       YC +
Sbjct: 331 IYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQRYCLD 390

Query: 403 LYGVSPRPHWVTTYYGGHDIKLILKRFGSNVIFSNGLRDPYSSCGL 426
            +GV PRP W+ T + G D+     R  SN+IFSNG  DP++  G+
Sbjct: 391 TWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGI 425

BLAST of CmoCh04G020170 vs. TrEMBL
Match: A0A0A0KAY8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G149410 PE=4 SV=1)

HSP 1 Score: 730.3 bits (1884), Expect = 1.4e-207
Identity = 353/442 (79.86%), Postives = 388/442 (87.78%), Query Frame = 1

Query: 4   LPCLLLILSACVSATQYRIPRLSPTDR----DSEALSSPLSDDFKTFYYNQTLDHFNYRP 63
           LP +L ILS CV+ATQYRIPRLSP  R    ++EA+ S +SDDFKTFYYNQTLDHFNYRP
Sbjct: 12  LPFILFILSNCVTATQYRIPRLSPIGRTFLHNAEAIPSSISDDFKTFYYNQTLDHFNYRP 71

Query: 64  ESYMTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYI 123
           ESY  F HRYIINFKYWGGANSSAPILAYLGAEGPL+ DLN +GF TDNAA+F ALLVYI
Sbjct: 72  ESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLNAIGFMTDNAARFDALLVYI 131

Query: 124 EHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGS 183
           EHRYYGKS+PFGSRE ALKNASTLGYF+SAQAIADYA VLIH+K++ HAKDSPVIVLGGS
Sbjct: 132 EHRYYGKSMPFGSREEALKNASTLGYFSSAQAIADYAAVLIHLKQKYHAKDSPVIVLGGS 191

Query: 184 YGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRD 243
           YGGMLAAWFRLKYPHVALGALASSAPILYF++ITPH+ YYSIATKDFR+VSE+CYETIRD
Sbjct: 192 YGGMLAAWFRLKYPHVALGALASSAPILYFEDITPHNGYYSIATKDFREVSETCYETIRD 251

Query: 244 SWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIIC 303
           SWSKIE I SKP+GLSILSKEFK+CSPL +S+QLED+LWSMY  AAQY+HPPRYPVT IC
Sbjct: 252 SWSKIEIIGSKPNGLSILSKEFKTCSPLNSSSQLEDYLWSMYAGAAQYNHPPRYPVTRIC 311

Query: 304 DAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTGN 363
             IDGAS GSGI+ ++AAGVFAYKGNLSCY    R ETETDVGWRWQRCSEMVMPLST N
Sbjct: 312 GGIDGASPGSGIISKVAAGVFAYKGNLSCYNIGPRSETETDVGWRWQRCSEMVMPLSTTN 371

Query: 364 DTMFPAYNFELGSFIDYCNELYGVSPRPHWVTTYYGGHDIKLILKRFGSNVIFSNGLRDP 423
           DTMFP   F+L SF+DYC +LYGVS RPHWVTTYYGG+DIKLIL+RFGSN+IFSNGLRDP
Sbjct: 372 DTMFPPITFDLKSFVDYCYQLYGVSSRPHWVTTYYGGNDIKLILQRFGSNIIFSNGLRDP 431

Query: 424 YSSCG--------LLALHTPNG 434
           YSS G        LLA+HTP G
Sbjct: 432 YSSGGVLQNLSDSLLAVHTPKG 453

BLAST of CmoCh04G020170 vs. TrEMBL
Match: A0A0A0KDH5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G149400 PE=4 SV=1)

HSP 1 Score: 650.6 bits (1677), Expect = 1.4e-183
Identity = 316/442 (71.49%), Postives = 366/442 (82.81%), Query Frame = 1

Query: 4   LPCLLLILSACVSATQYRIPRLSPTDRD----SEALSSPLSDDFKTFYYNQTLDHFNYRP 63
           LP LLL LS  V+A Q+RIPRLSP        S+AL  P SDDFKTFY+NQTLDHFNYRP
Sbjct: 7   LPFLLLFLSNSVTAFQFRIPRLSPIGEKFLHHSKALELPPSDDFKTFYFNQTLDHFNYRP 66

Query: 64  ESYMTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYI 123
           ESY TF  RYIINFKYWGGANSSAPILAYLG E P+D+ +NV+GF TDNA +F ALLVYI
Sbjct: 67  ESYTTFPQRYIINFKYWGGANSSAPILAYLGPEAPIDSAMNVIGFMTDNAVKFNALLVYI 126

Query: 124 EHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGS 183
           EHRYYGKSIPFGSR+ AL+NASTLGYFNSAQA+ADYA +LIHVKKE  AK SPVIV+GGS
Sbjct: 127 EHRYYGKSIPFGSRKEALRNASTLGYFNSAQALADYAAILIHVKKEFSAKYSPVIVIGGS 186

Query: 184 YGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRD 243
           YGGMLA WFRLKYPHVALGALASSAPILYF++ITP + YY I TKDFR+VS++CYE+IR+
Sbjct: 187 YGGMLATWFRLKYPHVALGALASSAPILYFNDITPENGYYVIVTKDFREVSQTCYESIRE 246

Query: 244 SWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIIC 303
           SWS+IET+AS+ +GLS+L K FK+CSPL++S QLE++LW MY  AAQY+HP RYPV  IC
Sbjct: 247 SWSEIETVASQSNGLSVLDKVFKTCSPLRSSTQLENYLWFMYASAAQYNHPSRYPVNRIC 306

Query: 304 DAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTGN 363
           DAID   S +G + +IAAGVFAY+G LSCY+N+  + TET VGW+WQRCSEMVMP+STGN
Sbjct: 307 DAIDQTYS-NGTLGKIAAGVFAYRGELSCYINEPINTTETTVGWQWQRCSEMVMPISTGN 366

Query: 364 DTMFPAYNFELGSFIDYCNELYGVSPRPHWVTTYYGGHDIKLILKRFGSNVIFSNGLRDP 423
           DTMFP+  F+  SF  YCN+LYGV+PRPHWVTTYYGGHDI LIL RF SN+IFSNGL+DP
Sbjct: 367 DTMFPSETFDHESFSIYCNQLYGVTPRPHWVTTYYGGHDIHLILHRFASNIIFSNGLKDP 426

Query: 424 YS--------SCGLLALHTPNG 434
           YS        S  LLA++T NG
Sbjct: 427 YSIGGVLHNISDSLLAVYTANG 447

BLAST of CmoCh04G020170 vs. TrEMBL
Match: A0A0A0KBK9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G149390 PE=4 SV=1)

HSP 1 Score: 641.0 bits (1652), Expect = 1.1e-180
Identity = 308/444 (69.37%), Postives = 359/444 (80.86%), Query Frame = 1

Query: 4   LPCLLLILSACV--SATQYRIPRLSPTDRD----SEALSSPLSDDFKTFYYNQTLDHFNY 63
           +P LL + S  V  S    R PRLSP        S  L+S   DDFKT+YYNQTLDHFNY
Sbjct: 25  IPLLLFVFSTSVVTSLQHNRFPRLSPVGEKFLHHSRVLNSLPLDDFKTYYYNQTLDHFNY 84

Query: 64  RPESYMTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLV 123
           RPESY TF  RYIINFKYWGG NSSAPI AYLGAE P+D+DL+ +GF TDNA QF ALL+
Sbjct: 85  RPESYTTFPQRYIINFKYWGGPNSSAPIFAYLGAEAPIDDDLDFIGFMTDNAIQFNALLI 144

Query: 124 YIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLG 183
           YIEHRYYGKSIPF SR+ AL NASTLGYFNSAQAIADYA +LIHVKKE HA  SPVIV+G
Sbjct: 145 YIEHRYYGKSIPFRSRDEALGNASTLGYFNSAQAIADYAAILIHVKKEFHANYSPVIVIG 204

Query: 184 GSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETI 243
           GSYGGMLA+WFRLKYPHVALGALASSAPILYFD+ITP D YYS+ TKDFR +SE+CYETI
Sbjct: 205 GSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYSVVTKDFRGLSETCYETI 264

Query: 244 RDSWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTI 303
           + SWS+IET+A +P+GLSIL +EFK+C PL+   +LED+LWSMY  AAQY+HPP+YPVT 
Sbjct: 265 KKSWSEIETVAYQPNGLSILDQEFKTCRPLRGYFELEDYLWSMYASAAQYNHPPKYPVTR 324

Query: 304 ICDAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLST 363
           ICDAIDG  S +G + +IAAGVFA++G++SCY+N+ R+ETETDVGWRWQ CSEMVMP+ +
Sbjct: 325 ICDAIDGTYSVNGTLSKIAAGVFAFRGSVSCYINEPRNETETDVGWRWQSCSEMVMPIGS 384

Query: 364 GNDTMFPAYNFELGSFIDYCNELYGVSPRPHWVTTYYGGHDIKLILKRFGSNVIFSNGLR 423
            +D MFP   F+L S I+YCN LYGV PRPHW TTYYGGHDI+L+L+RFGSN+IFSNGL+
Sbjct: 385 -DDDMFPPSPFDLQSVINYCNRLYGVPPRPHWATTYYGGHDIRLVLQRFGSNIIFSNGLK 444

Query: 424 DPYSSCG--------LLALHTPNG 434
           DPYS  G        LLA++T NG
Sbjct: 445 DPYSIAGVLHNISDSLLAVYTTNG 467

BLAST of CmoCh04G020170 vs. TrEMBL
Match: A5C9W6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_018664 PE=4 SV=1)

HSP 1 Score: 623.6 bits (1607), Expect = 1.8e-175
Identity = 301/435 (69.20%), Postives = 351/435 (80.69%), Query Frame = 1

Query: 8   LLILSACVSATQYRIPRLSPTDRDSEALSSPLSDDFKTFYYNQTLDHFNYRPESYMTFAH 67
           L+I   C +AT  ++PRLS   R+SE  S  +SDDF+TF+YNQTLDHFNYRPESY TF  
Sbjct: 19  LIIFPTCATATPSKLPRLSTILRESEIFSELISDDFQTFFYNQTLDHFNYRPESYYTFQQ 78

Query: 68  RYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYIEHRYYGKS 127
           RY++NFKYWGGAN+SAPI AYLGAE  LD DL  VGF  DNA QF ALLVYIEHRYYG+S
Sbjct: 79  RYVMNFKYWGGANASAPIFAYLGAEAALDFDLTGVGFPVDNALQFKALLVYIEHRYYGQS 138

Query: 128 IPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGSYGGMLAAW 187
           IPFGSRE ALKNAST GYFNSAQAIADYA+VL ++KK+L A++SPVIV+GGSYGGMLA+W
Sbjct: 139 IPFGSREEALKNASTRGYFNSAQAIADYAEVLEYIKKKLLAENSPVIVIGGSYGGMLASW 198

Query: 188 FRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRDSWSKIETI 247
           FRLKYPHVALGALASSAPILYFD+ITP + YYSI TKDFR+ SESCY TIR+SWS+I+ +
Sbjct: 199 FRLKYPHVALGALASSAPILYFDDITPQNGYYSIVTKDFREASESCYSTIRESWSEIDRV 258

Query: 248 ASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIICDAIDGASS 307
           AS+P+GLSILSK+F++C+ L  S +L+D+L +MY VAAQY+HPPRYPVT++C  IDGA  
Sbjct: 259 ASEPNGLSILSKKFRTCAELNKSNELKDYLETMYAVAAQYNHPPRYPVTVVCGGIDGAPE 318

Query: 308 GSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTG-NDTMFPAY 367
           GS I+ RI AGV AY+GN SCY N + + TET  GWRWQ CSEMVMP+  G NDTMFP  
Sbjct: 319 GSDILSRIFAGVVAYRGNSSCY-NTSVNPTETSEGWRWQTCSEMVMPIGRGDNDTMFPPS 378

Query: 368 NFELGSFIDYCNELYGVSPRPHWVTTYYGGHDIKLILKRFGSNVIFSNGLRDPYSSCG-- 427
            F L +FI  C  LY V PRPHW+TTYYGGHDIKLIL RF SN+IFSNGLRDPYSS G  
Sbjct: 379 PFNLTTFIQACTSLYDVPPRPHWITTYYGGHDIKLILHRFASNIIFSNGLRDPYSSAGVL 438

Query: 428 ------LLALHTPNG 434
                 +LA+HT NG
Sbjct: 439 KNISHTVLAIHTVNG 452

BLAST of CmoCh04G020170 vs. TrEMBL
Match: F6GW68_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0061g01010 PE=4 SV=1)

HSP 1 Score: 612.1 bits (1577), Expect = 5.4e-172
Identity = 296/421 (70.31%), Postives = 342/421 (81.24%), Query Frame = 1

Query: 22  IPRLSPTDRDSEALSSPLSDDFKTFYYNQTLDHFNYRPESYMTFAHRYIINFKYWGGANS 81
           I RLS   R+SE  S  +SDDF+TF+YNQTLDHFNYRPESY TF  RY++NFKYWGGAN+
Sbjct: 15  IKRLSTILRESEIFSELISDDFQTFFYNQTLDHFNYRPESYYTFQQRYVMNFKYWGGANA 74

Query: 82  SAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYIEHRYYGKSIPFGSREVALKNAS 141
           SAPI AYLGAE  LD DL  VGF  DNA QF ALLVYIEHRYYG+SIPFGSRE ALKNAS
Sbjct: 75  SAPIFAYLGAEAALDFDLTGVGFPVDNALQFKALLVYIEHRYYGQSIPFGSREEALKNAS 134

Query: 142 TLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALA 201
           T GYFNSAQAIADYA+VL ++KK+L A++SPVIV+GGSYGGMLA+WFRLKYPHVALGALA
Sbjct: 135 TRGYFNSAQAIADYAEVLEYIKKKLLAENSPVIVIGGSYGGMLASWFRLKYPHVALGALA 194

Query: 202 SSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRDSWSKIETIASKPDGLSILSKEF 261
           SSAPILYFD+ITP + YYSI TKDFR+ SESCY TIR+SWS+I+ +AS+P+GLSILSK+F
Sbjct: 195 SSAPILYFDDITPQNGYYSIVTKDFREASESCYSTIRESWSEIDRVASEPNGLSILSKKF 254

Query: 262 KSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIICDAIDGASSGSGIVERIAAGVFA 321
           ++C+ L  S +L+D+L +MY VAAQY+HPPRYPVT++C  IDGA  GS I+ RI AGV A
Sbjct: 255 RTCAELNKSNELKDYLETMYAVAAQYNHPPRYPVTVVCGGIDGAPEGSDILSRIFAGVVA 314

Query: 322 YKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTG-NDTMFPAYNFELGSFIDYCNEL 381
           Y+GN SCY N + + TET  GWRWQ CSEMVMP+  G NDTMFP   F L +FI  C  L
Sbjct: 315 YRGNSSCY-NTSVNPTETSEGWRWQTCSEMVMPIGRGDNDTMFPPSPFNLTTFIQACTSL 374

Query: 382 YGVSPRPHWVTTYYGGHDIKLILKRFGSNVIFSNGLRDPYSSCG--------LLALHTPN 434
           Y V PRPHW+TTYYGGHDIKLIL RF SN+IFSNGLRDPYSS G        +LA+HT N
Sbjct: 375 YDVPPRPHWITTYYGGHDIKLILHRFASNIIFSNGLRDPYSSAGVLKNISHTVLAIHTVN 434

BLAST of CmoCh04G020170 vs. TAIR10
Match: AT5G22860.1 (AT5G22860.1 Serine carboxypeptidase S28 family protein)

HSP 1 Score: 469.5 bits (1207), Expect = 2.2e-132
Identity = 239/456 (52.41%), Postives = 314/456 (68.86%), Query Frame = 1

Query: 2   MLLPCLLLILSACVSATQYRIP-------RL---SPTDRDSEALSSPLSDD--FKTFYYN 61
           M LP  +LIL    +++ Y IP       RL   S T ++    S+   D+   K +Y+N
Sbjct: 1   MSLPYTILILFIFSTSSSYLIPLAHSKIARLGISSKTLKNEPDGSTQKVDESNLKMYYFN 60

Query: 62  QTLDHFNYRPESYMTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNA 121
           QTLDHF + PESYMTF  RY I+  +WGGA ++APILA+LG E  LD+DL  +GF  DN 
Sbjct: 61  QTLDHFTFTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEESSLDSDLAAIGFLRDNG 120

Query: 122 AQFGALLVYIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAK 181
            +  ALLVYIEHRYYG+++PFGS E ALKNASTLGY N+AQA+ADYA +L+HVK++    
Sbjct: 121 PRLNALLVYIEHRYYGETMPFGSAEEALKNASTLGYLNAAQALADYAAILLHVKEKYSTN 180

Query: 182 DSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDV 241
            SP+IV+GGSYGGMLAAWFRLKYPH+ALGALASSAP+LYF++  P   YY I TK F++ 
Sbjct: 181 HSPIIVIGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDTRPKFGYYYIVTKVFKEA 240

Query: 242 SESCYETIRDSWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDH 301
           SE CY TIR+SW +I+ +A KP+GLSILSK+FK+C+PL  S  ++DFL ++Y  A QY+ 
Sbjct: 241 SERCYNTIRNSWIEIDRVAGKPNGLSILSKQFKTCAPLNGSFDIKDFLDTIYAEAVQYNR 300

Query: 302 PPRYPVTIICDAIDG--ASSGSGIVERIAAGVFAYKGNLSCY-MNQARDETETDVGWRWQ 361
            P + V  +C+AI+    +    +++RI AGV A  GN +CY        T  ++ WRWQ
Sbjct: 301 GPNFWVAKVCNAINANPPNRRYNLLDRIFAGVVALVGNRTCYDTKMFAQPTNNNIAWRWQ 360

Query: 362 RCSEMVMPLS-TGNDTMFPAYNFELGSFIDYCNELYGVSPRPHWVTTYYGGHDIKLILKR 421
            CSE+VMP+     DTMFP   F + S+ID C   +GV+PRPHW+TTY+G  ++KLIL++
Sbjct: 361 SCSEIVMPVGYDKQDTMFPTAPFNMTSYIDGCKSYHGVTPRPHWITTYFGIQEVKLILQK 420

Query: 422 FGSNVIFSNGLRDPYSSCG--------LLALHTPNG 434
           FGSN+IFSNGL DPYS  G        L+A+ T NG
Sbjct: 421 FGSNIIFSNGLSDPYSVGGVLEDISDTLVAITTKNG 456

BLAST of CmoCh04G020170 vs. TAIR10
Match: AT2G24280.1 (AT2G24280.1 alpha/beta-Hydrolases superfamily protein)

HSP 1 Score: 369.0 bits (946), Expect = 4.0e-102
Identity = 189/437 (43.25%), Postives = 270/437 (61.78%), Query Frame = 1

Query: 6   CLLLILSACVSATQYR---IPRLSPTDRDSEALSSPLSDDFKTFYYNQTLDHFNYRPESY 65
           CL+ +  + V+   Y       LS      +   S     F+T Y+ Q LDHF++ P+SY
Sbjct: 6   CLVFLFFSIVAEATYSPGGFHHLSSLRLKKKVSKSKHELPFETRYFPQNLDHFSFTPDSY 65

Query: 66  MTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYIEHR 125
             F  +Y+IN ++W       PI  Y G EG +D   +  GF  D A +F ALLV+IEHR
Sbjct: 66  KVFHQKYLINNRFW---RKGGPIFVYTGNEGDIDWFASNTGFMLDIAPKFRALLVFIEHR 125

Query: 126 YYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGSYGG 185
           +YG+S PFG +  + K+A TLGY NS QA+ADYA ++  +K+ L ++ SPV+V GGSYGG
Sbjct: 126 FYGESTPFGKK--SHKSAETLGYLNSQQALADYAILIRSLKQNLSSEASPVVVFGGSYGG 185

Query: 186 MLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRDSWS 245
           MLAAWFRLKYPH+ +GALASSAPIL+FDNI P  ++Y   ++DF+D S +C++ I+ SW 
Sbjct: 186 MLAAWFRLKYPHITIGALASSAPILHFDNIVPLTSFYDAISQDFKDASINCFKVIKRSWE 245

Query: 246 KIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHP---------PRY 305
           ++E +++  +GL  LSK+F++C  L +     D+L   +   A  ++P         P Y
Sbjct: 246 ELEAVSTMKNGLQELSKKFRTCKGLHSQYSARDWLSGAFVYTAMVNYPTAANFMAPLPGY 305

Query: 306 PVTIICDAIDGASSGSGIVERIAAGV---FAYKGNLSCY-MNQARDETETDVGWRWQRCS 365
           PV  +C  IDG   GS  ++R  A     + Y G+  C+ M Q  D+   D GW++Q C+
Sbjct: 306 PVEQMCKIIDGFPRGSSNLDRAFAAASLYYNYSGSEKCFEMEQQTDDHGLD-GWQYQACT 365

Query: 366 EMVMPLSTGNDTMFPAYNFELGSFIDYCNELYGVSPRPHWVTTYYGGHDIKLILKRFGSN 425
           EMVMP+S  N +M P Y  +  +F + C   YGV PRPHW+TT +GG  I+ +LKRFGSN
Sbjct: 366 EMVMPMSCSNQSMLPPYENDSEAFQEQCMTRYGVKPRPHWITTEFGGMRIETVLKRFGSN 425

Query: 426 VIFSNGLRDPYSSCGLL 427
           +IFSNG++DP+S  G+L
Sbjct: 426 IIFSNGMQDPWSRGGVL 436

BLAST of CmoCh04G020170 vs. TAIR10
Match: AT5G65760.1 (AT5G65760.1 Serine carboxypeptidase S28 family protein)

HSP 1 Score: 368.2 bits (944), Expect = 6.9e-102
Identity = 184/397 (46.35%), Postives = 256/397 (64.48%), Query Frame = 1

Query: 43  FKTFYYNQTLDHFNYRPESYMTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVV 102
           ++T +++Q LDHF++       F+ RY+IN  +W GA++  PI  Y G EG ++      
Sbjct: 58  YETKFFSQQLDHFSFA--DLPKFSQRYLINSDHWLGASALGPIFLYCGNEGDIEWFATNS 117

Query: 103 GFSTDNAAQFGALLVYIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHV 162
           GF  D A +FGALLV+ EHRYYG+S+P+GSRE A KNA+TL Y  + QA+AD+A  +  +
Sbjct: 118 GFIWDIAPKFGALLVFPEHRYYGESMPYGSREEAYKNATTLSYLTTEQALADFAVFVTDL 177

Query: 163 KKELHAKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIA 222
           K+ L A+  PV++ GGSYGGMLAAW RLKYPH+A+GALASSAPIL F+++ P + +Y IA
Sbjct: 178 KRNLSAEACPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDVVPPETFYDIA 237

Query: 223 TKDFRDVSESCYETIRDSWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYT 282
           + DF+  S SC+ TI+DSW  I     K +GL  L+K F  C  L ++  L D+L S Y+
Sbjct: 238 SNDFKRESSSCFNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYS 297

Query: 283 VAAQYDHP---------PRYPVTIICDAIDGASSGSGIVERIAAGV---FAYKGNLSCYM 342
             A  D+P         P +P+  +C  IDGA S + I++RI AG+   + Y GN+ C+ 
Sbjct: 298 YLAMVDYPYPADFMMPLPGHPIREVCRKIDGAGSNASILDRIYAGISVYYNYTGNVDCF- 357

Query: 343 NQARDETETDVGWRWQRCSEMVMPLSTGND-TMFPAYNFELGSFIDYCNELYGVSPRPHW 402
            +  D+     GW WQ C+EMVMP+S+  + +MFP Y F   S+ + C   + V+PRP W
Sbjct: 358 -KLDDDPHGLDGWNWQACTEMVMPMSSNQENSMFPGYGFNYSSYKEECWNTFRVNPRPKW 417

Query: 403 VTTYYGGHDIKLILKRFGSNVIFSNGLRDPYSSCGLL 427
           VTT +GGHDI   LK FGSN+IFSNGL DP+S   +L
Sbjct: 418 VTTEFGGHDIATTLKSFGSNIIFSNGLLDPWSGGSVL 450

BLAST of CmoCh04G020170 vs. TAIR10
Match: AT3G28680.1 (AT3G28680.1 Serine carboxypeptidase S28 family protein)

HSP 1 Score: 149.8 bits (377), Expect = 3.9e-36
Identity = 75/173 (43.35%), Postives = 116/173 (67.05%), Query Frame = 1

Query: 178 GSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETI 237
           G+   +LAAWF+LKYP++ALGALASSAP+LYF++  P   Y+ I TK F+++S+ C+  I
Sbjct: 18  GAVHKVLAAWFKLKYPYIALGALASSAPLLYFEDTLPKHGYFYIVTKVFKEMSKECHNKI 77

Query: 238 RDSWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTI 297
             SW +I+ IA+KP+ LSILSK FK C+PL +  +L+ ++  +Y   AQY    ++ V  
Sbjct: 78  HKSWDEIDRIAAKPNSLSILSKNFKLCNPLNDIIELKSYVSYIYARTAQYS-DNQFSVAR 137

Query: 298 ICDAIDGA--SSGSGIVERIAAGVFAYKGNLSCY--MNQARDETETDVGWRWQ 347
           +C+AI+ +  ++ S ++++I AGV A +GN+SCY   + +   T  D  W WQ
Sbjct: 138 LCEAINTSPPNTKSDLLDQIFAGVVASRGNISCYGMSSPSYQMTNDDRAWGWQ 189

BLAST of CmoCh04G020170 vs. TAIR10
Match: AT4G36190.1 (AT4G36190.1 Serine carboxypeptidase S28 family protein)

HSP 1 Score: 121.7 bits (304), Expect = 1.1e-27
Identity = 116/405 (28.64%), Postives = 186/405 (45.93%), Query Frame = 1

Query: 47  YYNQTLDHFNYRPESYMTFAHRYIINFKYWGGAN-SSAPILAYLGAEGPLDNDLNVVGFS 106
           ++ QTLDH  Y P  +  F  RY   ++Y         PI   +  EGP +   N   + 
Sbjct: 49  WFTQTLDH--YSPSDHRKFRQRY---YEYLDHLRVPDGPIFLMICGEGPCNGITN--NYI 108

Query: 107 TDNAAQFGALLVYIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKE 166
           +  A +F A +V +EHRYYGKS PF S  +A KN   L Y +S QA++D A    + +  
Sbjct: 109 SVLAKKFDAGIVSLEHRYYGKSSPFKS--LATKN---LKYLSSKQALSDLATFRQYYQDS 168

Query: 167 LHAK-------DSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAY 226
           L+ K       ++P    G SY G L+AWFRLK+PH+  G+LASSA +          A 
Sbjct: 169 LNVKFNRSSNVENPWFFFGVSYSGALSAWFRLKFPHLTCGSLASSAVV---------RAV 228

Query: 227 YSIATKDFRDVSES----CYETIRDSWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLE 286
           Y     D + ++ES    C   ++++   +E       GL + ++  K+   L N+ +L+
Sbjct: 229 YEFPEFD-QQIAESAGPECETALQETNKLLEL------GLKVNNRAVKA---LFNATELD 288

Query: 287 ---DFLWSMY---TVAAQYDHPPRYPVTIICDAIDGASSGSGIVERIA-------AGVFA 346
              DFL+ +     +A QY +P +  V +    ++   +G  +VE  A        GVF 
Sbjct: 289 VDADFLYLIADAGVMAIQYGNPDKLCVPL----VEAQKNGGDLVEAYAKYVREFCMGVFG 348

Query: 347 YKG---NLSCYMNQARDETETDVGWRWQRCSEMV-MPLSTGNDTMFPAYNFELGSFIDYC 406
                 +    ++ A      D  W +Q C+E+    ++  ND++  ++       +D C
Sbjct: 349 QSSKTYSRKHLLDTAVTLESADRLWWFQVCTEVAYFQVAPANDSI-RSHQINTEYHLDLC 408

Query: 407 NELY--GVSPRPHWVTTYYGGHDIKLILKRFGSNVIFSNGLRDPY 421
             L+  GV P       YYG   I        + +IF+NG +DP+
Sbjct: 409 KSLFGKGVYPEVDATNLYYGSDKIA------ATKIIFTNGSQDPW 411

BLAST of CmoCh04G020170 vs. NCBI nr
Match: gi|449456064|ref|XP_004145770.1| (PREDICTED: lysosomal Pro-X carboxypeptidase [Cucumis sativus])

HSP 1 Score: 730.3 bits (1884), Expect = 2.0e-207
Identity = 353/442 (79.86%), Postives = 388/442 (87.78%), Query Frame = 1

Query: 4   LPCLLLILSACVSATQYRIPRLSPTDR----DSEALSSPLSDDFKTFYYNQTLDHFNYRP 63
           LP +L ILS CV+ATQYRIPRLSP  R    ++EA+ S +SDDFKTFYYNQTLDHFNYRP
Sbjct: 12  LPFILFILSNCVTATQYRIPRLSPIGRTFLHNAEAIPSSISDDFKTFYYNQTLDHFNYRP 71

Query: 64  ESYMTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYI 123
           ESY  F HRYIINFKYWGGANSSAPILAYLGAEGPL+ DLN +GF TDNAA+F ALLVYI
Sbjct: 72  ESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLNAIGFMTDNAARFDALLVYI 131

Query: 124 EHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGS 183
           EHRYYGKS+PFGSRE ALKNASTLGYF+SAQAIADYA VLIH+K++ HAKDSPVIVLGGS
Sbjct: 132 EHRYYGKSMPFGSREEALKNASTLGYFSSAQAIADYAAVLIHLKQKYHAKDSPVIVLGGS 191

Query: 184 YGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRD 243
           YGGMLAAWFRLKYPHVALGALASSAPILYF++ITPH+ YYSIATKDFR+VSE+CYETIRD
Sbjct: 192 YGGMLAAWFRLKYPHVALGALASSAPILYFEDITPHNGYYSIATKDFREVSETCYETIRD 251

Query: 244 SWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIIC 303
           SWSKIE I SKP+GLSILSKEFK+CSPL +S+QLED+LWSMY  AAQY+HPPRYPVT IC
Sbjct: 252 SWSKIEIIGSKPNGLSILSKEFKTCSPLNSSSQLEDYLWSMYAGAAQYNHPPRYPVTRIC 311

Query: 304 DAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTGN 363
             IDGAS GSGI+ ++AAGVFAYKGNLSCY    R ETETDVGWRWQRCSEMVMPLST N
Sbjct: 312 GGIDGASPGSGIISKVAAGVFAYKGNLSCYNIGPRSETETDVGWRWQRCSEMVMPLSTTN 371

Query: 364 DTMFPAYNFELGSFIDYCNELYGVSPRPHWVTTYYGGHDIKLILKRFGSNVIFSNGLRDP 423
           DTMFP   F+L SF+DYC +LYGVS RPHWVTTYYGG+DIKLIL+RFGSN+IFSNGLRDP
Sbjct: 372 DTMFPPITFDLKSFVDYCYQLYGVSSRPHWVTTYYGGNDIKLILQRFGSNIIFSNGLRDP 431

Query: 424 YSSCG--------LLALHTPNG 434
           YSS G        LLA+HTP G
Sbjct: 432 YSSGGVLQNLSDSLLAVHTPKG 453

BLAST of CmoCh04G020170 vs. NCBI nr
Match: gi|659117555|ref|XP_008458663.1| (PREDICTED: lysosomal Pro-X carboxypeptidase isoform X1 [Cucumis melo])

HSP 1 Score: 730.3 bits (1884), Expect = 2.0e-207
Identity = 351/442 (79.41%), Postives = 391/442 (88.46%), Query Frame = 1

Query: 4   LPCLLLILSACVSATQYRIPRLSPTDR----DSEALSSPLSDDFKTFYYNQTLDHFNYRP 63
           +P +L ILS CV+ATQYRIPRLSP  R    ++EA+SS +SDDFKTFYYNQ+LDHFNYRP
Sbjct: 12  VPFILFILSNCVTATQYRIPRLSPIGRTFLHNAEAISSSISDDFKTFYYNQSLDHFNYRP 71

Query: 64  ESYMTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYI 123
           ESY  F HRYIINFKYWGGANSSAPILAYLGAEGPL+ DLN +GF TDNA +F ALLVYI
Sbjct: 72  ESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLNAIGFMTDNAVRFDALLVYI 131

Query: 124 EHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGS 183
           EHRYYGKS+PFGSRE ALKNASTLGYF+SAQAIADYA VL+H+K++ HAKDSPVIVLGGS
Sbjct: 132 EHRYYGKSMPFGSREEALKNASTLGYFSSAQAIADYAAVLLHLKQKYHAKDSPVIVLGGS 191

Query: 184 YGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRD 243
           YGGMLAAWFRLKYPHVALGALASSAPILYF++ITPH+ YYSIATKDFR+VSE+CYETIRD
Sbjct: 192 YGGMLAAWFRLKYPHVALGALASSAPILYFEDITPHNGYYSIATKDFREVSETCYETIRD 251

Query: 244 SWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIIC 303
           SWSKIETIASKP+GLSILSKEFK+CSPL +S+QLED+LWSMY  AAQY+HPPRYPVT IC
Sbjct: 252 SWSKIETIASKPNGLSILSKEFKTCSPLNSSSQLEDYLWSMYAGAAQYNHPPRYPVTRIC 311

Query: 304 DAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTGN 363
             IDGAS GSGI+ ++AAGVFAYKGNL CY    R++TETDVGWRWQRCSEMVMP+ST N
Sbjct: 312 GGIDGASPGSGIISKVAAGVFAYKGNLPCYNIGPRNDTETDVGWRWQRCSEMVMPMSTSN 371

Query: 364 DTMFPAYNFELGSFIDYCNELYGVSPRPHWVTTYYGGHDIKLILKRFGSNVIFSNGLRDP 423
           DTMFP   F+L SFIDYC +LYGVSPRPHWVTTYYGG+DIKLIL+RFGSN+IFSNGLRDP
Sbjct: 372 DTMFPPITFDLRSFIDYCYQLYGVSPRPHWVTTYYGGNDIKLILQRFGSNIIFSNGLRDP 431

Query: 424 YSSCG--------LLALHTPNG 434
           YSS G        LLA+HT NG
Sbjct: 432 YSSGGVLQNLSDSLLAVHTLNG 453

BLAST of CmoCh04G020170 vs. NCBI nr
Match: gi|659117557|ref|XP_008458664.1| (PREDICTED: lysosomal Pro-X carboxypeptidase isoform X2 [Cucumis melo])

HSP 1 Score: 723.8 bits (1867), Expect = 1.8e-205
Identity = 344/425 (80.94%), Postives = 383/425 (90.12%), Query Frame = 1

Query: 4   LPCLLLILSACVSATQYRIPRLSPTDR----DSEALSSPLSDDFKTFYYNQTLDHFNYRP 63
           +P +L ILS CV+ATQYRIPRLSP  R    ++EA+SS +SDDFKTFYYNQ+LDHFNYRP
Sbjct: 12  VPFILFILSNCVTATQYRIPRLSPIGRTFLHNAEAISSSISDDFKTFYYNQSLDHFNYRP 71

Query: 64  ESYMTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYI 123
           ESY  F HRYIINFKYWGGANSSAPILAYLGAEGPL+ DLN +GF TDNA +F ALLVYI
Sbjct: 72  ESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLNAIGFMTDNAVRFDALLVYI 131

Query: 124 EHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGS 183
           EHRYYGKS+PFGSRE ALKNASTLGYF+SAQAIADYA VL+H+K++ HAKDSPVIVLGGS
Sbjct: 132 EHRYYGKSMPFGSREEALKNASTLGYFSSAQAIADYAAVLLHLKQKYHAKDSPVIVLGGS 191

Query: 184 YGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRD 243
           YGGMLAAWFRLKYPHVALGALASSAPILYF++ITPH+ YYSIATKDFR+VSE+CYETIRD
Sbjct: 192 YGGMLAAWFRLKYPHVALGALASSAPILYFEDITPHNGYYSIATKDFREVSETCYETIRD 251

Query: 244 SWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIIC 303
           SWSKIETIASKP+GLSILSKEFK+CSPL +S+QLED+LWSMY  AAQY+HPPRYPVT IC
Sbjct: 252 SWSKIETIASKPNGLSILSKEFKTCSPLNSSSQLEDYLWSMYAGAAQYNHPPRYPVTRIC 311

Query: 304 DAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTGN 363
             IDGAS GSGI+ ++AAGVFAYKGNL CY    R++TETDVGWRWQRCSEMVMP+ST N
Sbjct: 312 GGIDGASPGSGIISKVAAGVFAYKGNLPCYNIGPRNDTETDVGWRWQRCSEMVMPMSTSN 371

Query: 364 DTMFPAYNFELGSFIDYCNELYGVSPRPHWVTTYYGGHDIKLILKRFGSNVIFSNGLRDP 423
           DTMFP   F+L SFIDYC +LYGVSPRPHWVTTYYGG+DIKLIL+RFGSN+IFSNGLRDP
Sbjct: 372 DTMFPPITFDLRSFIDYCYQLYGVSPRPHWVTTYYGGNDIKLILQRFGSNIIFSNGLRDP 431

Query: 424 YSSCG 425
           YSS G
Sbjct: 432 YSSGG 436

BLAST of CmoCh04G020170 vs. NCBI nr
Match: gi|659117559|ref|XP_008458665.1| (PREDICTED: lysosomal Pro-X carboxypeptidase-like isoform X1 [Cucumis melo])

HSP 1 Score: 654.1 bits (1686), Expect = 1.8e-184
Identity = 315/444 (70.95%), Postives = 363/444 (81.76%), Query Frame = 1

Query: 4   LPCLLLILSACV--SATQYRIPRLSPTDRD----SEALSSPLSDDFKTFYYNQTLDHFNY 63
           +P LL +LS  V  S    R PRLSP        S AL S  SDDFKT+YYNQTLDHFNY
Sbjct: 25  IPFLLFVLSTSVVTSLQHNRFPRLSPIGEKFLHHSRALYSLPSDDFKTYYYNQTLDHFNY 84

Query: 64  RPESYMTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLV 123
           RPESY TF  RYIINFKYWGG NSSAPI AYLGAE P+D DLN +GF TDNA QF ALL+
Sbjct: 85  RPESYTTFLQRYIINFKYWGGPNSSAPIFAYLGAEAPIDGDLNFIGFLTDNAIQFNALLI 144

Query: 124 YIEHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLG 183
           YIEHRYYGKSIPF SR+ AL NASTLGYFNSAQAIADYA +LIHVKKE HA  SPVIV+G
Sbjct: 145 YIEHRYYGKSIPFRSRDEALGNASTLGYFNSAQAIADYAAILIHVKKEFHANYSPVIVIG 204

Query: 184 GSYGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETI 243
           GSYGGMLA+WFRLKYPHVALGALASSAPILYFD+ITP D YYS+ TKDFR +SE+CYETI
Sbjct: 205 GSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYSVVTKDFRGLSETCYETI 264

Query: 244 RDSWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTI 303
           + SWS+I+T+AS+P+GLSIL +EFK+C PL+   +LED+LWSMY  AAQY+HPP+YPVT 
Sbjct: 265 KKSWSEIKTVASQPNGLSILDQEFKTCRPLRGYFELEDYLWSMYASAAQYNHPPKYPVTR 324

Query: 304 ICDAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLST 363
           ICDAIDG  S +G + +IAAGVFA++G++SCY+N+ R+ETETDVGWRWQ CSEMVMP+S+
Sbjct: 325 ICDAIDGTYSVNGTLSKIAAGVFAFRGSISCYINEPRNETETDVGWRWQSCSEMVMPISS 384

Query: 364 GNDTMFPAYNFELGSFIDYCNELYGVSPRPHWVTTYYGGHDIKLILKRFGSNVIFSNGLR 423
            +D MFP Y F+L S I+YCN LYGV PRPHW TTYYGGHDI+L+L+RFGSN+IFSNGL+
Sbjct: 385 -DDDMFPPYPFDLQSVINYCNRLYGVPPRPHWATTYYGGHDIRLVLQRFGSNIIFSNGLK 444

Query: 424 DPYSSCG--------LLALHTPNG 434
           DPYS  G        LLA+HT NG
Sbjct: 445 DPYSIAGVLHSISDSLLAVHTTNG 467

BLAST of CmoCh04G020170 vs. NCBI nr
Match: gi|449456174|ref|XP_004145825.1| (PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus])

HSP 1 Score: 650.6 bits (1677), Expect = 2.0e-183
Identity = 316/442 (71.49%), Postives = 366/442 (82.81%), Query Frame = 1

Query: 4   LPCLLLILSACVSATQYRIPRLSPTDRD----SEALSSPLSDDFKTFYYNQTLDHFNYRP 63
           LP LLL LS  V+A Q+RIPRLSP        S+AL  P SDDFKTFY+NQTLDHFNYRP
Sbjct: 7   LPFLLLFLSNSVTAFQFRIPRLSPIGEKFLHHSKALELPPSDDFKTFYFNQTLDHFNYRP 66

Query: 64  ESYMTFAHRYIINFKYWGGANSSAPILAYLGAEGPLDNDLNVVGFSTDNAAQFGALLVYI 123
           ESY TF  RYIINFKYWGGANSSAPILAYLG E P+D+ +NV+GF TDNA +F ALLVYI
Sbjct: 67  ESYTTFPQRYIINFKYWGGANSSAPILAYLGPEAPIDSAMNVIGFMTDNAVKFNALLVYI 126

Query: 124 EHRYYGKSIPFGSREVALKNASTLGYFNSAQAIADYADVLIHVKKELHAKDSPVIVLGGS 183
           EHRYYGKSIPFGSR+ AL+NASTLGYFNSAQA+ADYA +LIHVKKE  AK SPVIV+GGS
Sbjct: 127 EHRYYGKSIPFGSRKEALRNASTLGYFNSAQALADYAAILIHVKKEFSAKYSPVIVIGGS 186

Query: 184 YGGMLAAWFRLKYPHVALGALASSAPILYFDNITPHDAYYSIATKDFRDVSESCYETIRD 243
           YGGMLA WFRLKYPHVALGALASSAPILYF++ITP + YY I TKDFR+VS++CYE+IR+
Sbjct: 187 YGGMLATWFRLKYPHVALGALASSAPILYFNDITPENGYYVIVTKDFREVSQTCYESIRE 246

Query: 244 SWSKIETIASKPDGLSILSKEFKSCSPLKNSAQLEDFLWSMYTVAAQYDHPPRYPVTIIC 303
           SWS+IET+AS+ +GLS+L K FK+CSPL++S QLE++LW MY  AAQY+HP RYPV  IC
Sbjct: 247 SWSEIETVASQSNGLSVLDKVFKTCSPLRSSTQLENYLWFMYASAAQYNHPSRYPVNRIC 306

Query: 304 DAIDGASSGSGIVERIAAGVFAYKGNLSCYMNQARDETETDVGWRWQRCSEMVMPLSTGN 363
           DAID   S +G + +IAAGVFAY+G LSCY+N+  + TET VGW+WQRCSEMVMP+STGN
Sbjct: 307 DAIDQTYS-NGTLGKIAAGVFAYRGELSCYINEPINTTETTVGWQWQRCSEMVMPISTGN 366

Query: 364 DTMFPAYNFELGSFIDYCNELYGVSPRPHWVTTYYGGHDIKLILKRFGSNVIFSNGLRDP 423
           DTMFP+  F+  SF  YCN+LYGV+PRPHWVTTYYGGHDI LIL RF SN+IFSNGL+DP
Sbjct: 367 DTMFPSETFDHESFSIYCNQLYGVTPRPHWVTTYYGGHDIHLILHRFASNIIFSNGLKDP 426

Query: 424 YS--------SCGLLALHTPNG 434
           YS        S  LLA++T NG
Sbjct: 427 YSIGGVLHNISDSLLAVYTANG 447

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PCP_PONAB1.7e-7036.30Lysosomal Pro-X carboxypeptidase OS=Pongo abelii GN=PRCP PE=2 SV=1[more]
PCP_HUMAN5.0e-7036.05Lysosomal Pro-X carboxypeptidase OS=Homo sapiens GN=PRCP PE=1 SV=1[more]
PCP_MOUSE5.5e-6935.60Lysosomal Pro-X carboxypeptidase OS=Mus musculus GN=Prcp PE=1 SV=2[more]
PCP_BOVIN1.0e-6736.34Lysosomal Pro-X carboxypeptidase OS=Bos taurus GN=PRCP PE=2 SV=1[more]
DPP2_HUMAN9.8e-5833.74Dipeptidyl peptidase 2 OS=Homo sapiens GN=DPP7 PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A0A0KAY8_CUCSA1.4e-20779.86Uncharacterized protein OS=Cucumis sativus GN=Csa_6G149410 PE=4 SV=1[more]
A0A0A0KDH5_CUCSA1.4e-18371.49Uncharacterized protein OS=Cucumis sativus GN=Csa_6G149400 PE=4 SV=1[more]
A0A0A0KBK9_CUCSA1.1e-18069.37Uncharacterized protein OS=Cucumis sativus GN=Csa_6G149390 PE=4 SV=1[more]
A5C9W6_VITVI1.8e-17569.20Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_018664 PE=4 SV=1[more]
F6GW68_VITVI5.4e-17270.31Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0061g01010 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT5G22860.12.2e-13252.41 Serine carboxypeptidase S28 family protein[more]
AT2G24280.14.0e-10243.25 alpha/beta-Hydrolases superfamily protein[more]
AT5G65760.16.9e-10246.35 Serine carboxypeptidase S28 family protein[more]
AT3G28680.13.9e-3643.35 Serine carboxypeptidase S28 family protein[more]
AT4G36190.11.1e-2728.64 Serine carboxypeptidase S28 family protein[more]
Match NameE-valueIdentityDescription
gi|449456064|ref|XP_004145770.1|2.0e-20779.86PREDICTED: lysosomal Pro-X carboxypeptidase [Cucumis sativus][more]
gi|659117555|ref|XP_008458663.1|2.0e-20779.41PREDICTED: lysosomal Pro-X carboxypeptidase isoform X1 [Cucumis melo][more]
gi|659117557|ref|XP_008458664.1|1.8e-20580.94PREDICTED: lysosomal Pro-X carboxypeptidase isoform X2 [Cucumis melo][more]
gi|659117559|ref|XP_008458665.1|1.8e-18470.95PREDICTED: lysosomal Pro-X carboxypeptidase-like isoform X1 [Cucumis melo][more]
gi|449456174|ref|XP_004145825.1|2.0e-18371.49PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008758Peptidase_S28
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0008236serine-type peptidase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0008236 serine-type peptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G020170.1CmoCh04G020170.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008758Peptidase S28PFAMPF05577Peptidase_S28coord: 50..425
score: 6.8
NoneNo IPR availablePANTHERPTHR11010PROTEASE S28 PRO-X CARBOXYPEPTIDASE-RELATEDcoord: 1..433
score: 8.8E
NoneNo IPR availablePANTHERPTHR11010:SF47PROLYLCARBOXYPEPTIDASE-LIKE PROTEIN-RELATEDcoord: 1..433
score: 8.8E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmoCh04G020170CmaCh04G019050Cucurbita maxima (Rimu)cmacmoB728
The following gene(s) are paralogous to this gene:

None