CmoCh06G010800 (gene) Cucurbita moschata (Rifu)

NameCmoCh06G010800
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionProline iminopeptidase, putative
LocationCmo_Chr06 : 8344640 .. 8348886 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCCGCCATTTTTGATGAAGGCACTCCTCTTTCACTCTTTCCCCTCCCCCGCCCGTTCATTAATTCCACTCACAAGACTTCTTTCCGCCGTCCATTGCCGGAGCTCCGTCCGTTCATTGGCAGTCATGGCCGCCACCAATCCCTCTAATGGAGCATCTCCGCCGGAGCACGCAGCTGGCACCTGGTACTCCGTGCCGGAGCTCCGGCTCCGAGACCATTACTTTTCTGTGCCTCTCAATTACTCTCTAGATCACTCTTCTCCCAAGATCTCCGTTTATGCGCGGGAAGTTGTTTCAGGTTAATCAAATCCTAATTTCACTTTAAATTTACTTTCGATTTGTTGAAAATTTTGAATATCGTAATCTAGTGGCGACTGTTCTCGACCACTTGCATATGTATACTTTTTCTGTTCATTGATTGATTTTGTGGAGTTATGATGTCATTGTTGTTAATTAGAATTAGGTTAAAACAGTGATATCCATGAAGTACTTCTGCAATTTCTATCTTAGTGCAAAGCTTGTGGCGGCTTCCTTCAATAATTTCTTCTTATCCATGTAATACATCACAAATAAGGTAGTTGCTTCTTGTATAAAGAATAAAAAGGTTCGTTTCACATGTTAAATTCTCATTGTAAAAGGCTCCTTCTAGTGTAGGAATCTATCACATTAATAGGATATTATTCGTTAAGTAAATTTATTCGTCATGTCTATCAATTTCAACGGTTCTATTTCTTTCATACTTGTATAAAAAATTATGAAGCTGGAAAACAAGTTGTGTCACTCTATGAAGAACACTCCTCGTGGAGAGTTGCATTCTTTTTCAAGAAATAAGTTCATTGATATATGTAATGTATTAAAAAAATGGAGAAAGAAAGCCTCAAACCATTGGAGTTACATTAACATTCTCCAATTGGTCTGAGAAAAACTGTAAGAGAAGAAGAAGAGAATTTAGTTCTTACCAAGATAAAGCAAGGAAAAGAACACTATCATAAAAATGTTATAGGGTTTCTTAATGGGGTTATGCTTATGGCATTTGGTAAATGAGCAAAATTTTCAAATCTTATCCTTTCTTTTATTTTCTAATTAAAATTTTGAGAAAGTTAGAACTTAATCTCTATGTTTTTAAATGTTAGAAATCATCTCTATGGTTTGATAAAACATTCGTTAATGATCCTTACCACCGGGACTATTTATGTGTGTTTTATCAAACCATAGGGAATAGTTATGAGATTTTAGCTAACTCTAAGGACGAAATCTAGCGATCTTCCTCCAAATCATATGGACTAAATTTTTACTTGTGATTAAAATTGTTGCAGAATGCAATATGGGCTCTAAAAAATTTTCTGAACATCTCAGTCAGAATACAAATAGTTTGCTTGCATCTTCTAGAAGTTGAGTGTTGCTGAAGCTACTGATTCAGGGACTTCTTAAATATTGAATGATATTATAACCAAATCCTTCAAATACTGAAAGCTATTAGGTGATGATTACTTGATTTGCAATCCATCCCTTTCTCGAAAGAATTATTTTGTCAGTTCCTATATGTACCTGGGTTGTTTTTGTTTTTGGGCATTTATGTTTTGGTTAAATCAATTATGATTTTAGTAATTGCGACCCTTAACAAATACAGATTCTGCATTGCAGTGGGGAAAGAAGAGCAACCAATGCCATACCTTCTATACTTACAAGGTGGACCTGGATTTGAGTGTCCCCGACCGACTGAAGCAAGTGGATGGATACAAAAAGCATGCGAAGAATTTCGTGTTATATTGATGGACCAGGCATGACTTTTGTCAACTACCTTTGAGCAACAGCATTTGAAATATAAACAACTGTTGTCATGATCTTTTATTCCACTTACATATGTATATCACTGCAGCGGGGAACAGGATTATCGACTCCTTTGTCTCCATCATCCATGTCCCAATTCCAAAGTGCAGAGGACTTGGCCGACTACTTGAAACATTTTCGAGCTGATAACATAGTGAATGATGCTGAATTCATTAGGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGGTATGCAATTTAGTGTTTCTTTTTTAGTTTCAAATTTTGCAATCAACATGTTTAGCATAGCGCTCTTTACTCGTCCTTACTTCAATAGATGACTTCTAGTCTTTGCAAGCAACACATTGAGTTTAGTACTCTTCGCTCAACATGTTGAACTTACATAAAGCAATAAGCAGACTTGTTAGCTTTGGGATTTCTAAATGTGGCTTTGCTTTATATAAGTTATTGCCTTCTGTTGTGTGGTAATTTAAATCACTGCAAATTCAAAGATAATCTCTTTTTCCCTGTACAGAGCTATGGTGGGTTTTGTGCAGTTACGTATTTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATAGGGAATGGATGCACGGCAGATTCTGTATATAGAGCATGCTTTGAAAAGATTATTATTCAAAATGAAAAATACTACAAGAGGTATCCTCAGGATGTCAAAATCGTCCATGAAGTTGTGAAATACTTGGAGGAGAATGGAGGCGGGGTGAGTACATTCGTGATTTATGTCATCATAATGCTATAAAAAACCCGAACGAAATGCTTAAATGTCTCTCAGATCCCTTTACAACTCACTGAAAGAAACACTTATTTCTTTTTGTTTTTGTATAGATTCCTCTTCCCTGTGGTGGTATCTTGACACCTAAAGGGCTGCAAACTCTTGGGCTTTCTGCTTTAGGATCTAGTACAGGTTTCGAGCGCATGCACTATTTGTAAGGCGTCTTAATTTTTGTCGTTTTGGTTTTATTCTTTTGGATAAATGAATAATCAACTCAAGAAATGTGTGTAGTAATCTAATTACTCTCTTTTTTCCCCCCTCAATCATGTCAGGTTTGAGAGAGTATGGGATCCTATAATAGTTCCTGGAGCACCAAAACGAATCAGTTATTTCTTCCTTAATGCTGTTAGTATTCTACAATTTCCATTGGTAGTGATTATTACTAAAACTTCATTCTTGAATGATTCTAACAACTAGACATATTTGCAGATCAGTGGCTGGCTCTCACTTGATTCAAATCCTCTTTATGGTCTCATGCACGAGTCGATATATTGCCAGGTAACCTCCTTTTTTACCTTTTAGAATTACATATTGGTCGTTTGAACTCTCTATAGAGTGAACTTTTTGCCACTTCGTGTTCCTGTTCCCCAAAGGGCGCCTCGTCTCGCTGGTCTGCTCAAAGAATAATGAATGAACTGGAGAACAAATTCGATGCAACAAAGGCTGTAAAAGAAGGATGTCCTGTGTATTTCACTGGAGAGGTAGGTGCATTATACTCATAAAATGTCATCATGCTGATCGAGTAAATAGCCATCTGAGTTTATTAGGTCATGCCATACCTTTTCAAGATTTATGCTTCTGCAGATGATCTTCCCGTGGATGTTTGACGAGATTCATGCCTTGAAACCGTTCAAAGACGCCGCTAATATATTGGCCGAGAAGGAGGATTGGCCTCCCCTATATGACATTGCTGCTCTTAAAAATAACAAGGTATTCTCTCCAACTGCTTCTACATTTCCAAGTAGTGTTTTTTATAATGTTTCTATCCCGAAAATCCCGTCTTTGGGGTACTTTACTGCAAAACATCCTAACTTGCCCGAACGAGTCTGCCATAGGTTCATCTACACCGTGCTCGTAGTTACTACGTTGTTAGTTTAGCCACTCGGTTTTACCATAGCCACTCTGCCGTAGGTTCATCTACACGATCCTCTCCCTTCGTAGTCTCTGTCTCCACCATCACCATCTACACCGATGTCGATTTTACCATAACAACATGTCATGACAAGCATTAGGATTAGGTTGGGACTTCTTTGTTGACAAAAAGTTAAACATCTTTTCGGAATCCCAATCACAGGTCCCGGTCGCAGCAGCAGTTTACTACGAAGATATGTACGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAAATAGCAGGAATCAGGCTGTGGGTTACTAATGAATTTATGCATTCTGGTCTGCGTGATGGAGGGCCTCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTCTGAGGTTTTTTTCTCTAAACTTTGTTGCTTTTCCTCATGATTATTGGATGCAGTCTTTGCCATGACCTTTTCATTTCCTCCTCAATAAGCTTTATCTCCTCCTCAATAATGTGTGTG

mRNA sequence

TTCCGCCATTTTTGATGAAGGCACTCCTCTTTCACTCTTTCCCCTCCCCCGCCCGTTCATTAATTCCACTCACAAGACTTCTTTCCGCCGTCCATTGCCGGAGCTCCGTCCGTTCATTGGCAGTCATGGCCGCCACCAATCCCTCTAATGGAGCATCTCCGCCGGAGCACGCAGCTGGCACCTGGTACTCCGTGCCGGAGCTCCGGCTCCGAGACCATTACTTTTCTGTGCCTCTCAATTACTCTCTAGATCACTCTTCTCCCAAGATCTCCGTTTATGCGCGGGAAGTTGTTTCAGTGGGGAAAGAAGAGCAACCAATGCCATACCTTCTATACTTACAAGGTGGACCTGGATTTGAGTGTCCCCGACCGACTGAAGCAAGTGGATGGATACAAAAAGCATGCGAAGAATTTCGTGTTATATTGATGGACCAGCGGGGAACAGGATTATCGACTCCTTTGTCTCCATCATCCATGTCCCAATTCCAAAGTGCAGAGGACTTGGCCGACTACTTGAAACATTTTCGAGCTGATAACATAGTGAATGATGCTGAATTCATTAGGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGAGCTATGGTGGGTTTTGTGCAGTTACGTATTTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATAGGGAATGGATGCACGGCAGATTCTGTATATAGAGCATGCTTTGAAAAGATTATTATTCAAAATGAAAAATACTACAAGAGGTATCCTCAGGATGTCAAAATCGTCCATGAAGTTGTGAAATACTTGGAGGAGAATGGAGGCGGGATTCCTCTTCCCTGTGGTGGTATCTTGACACCTAAAGGGCTGCAAACTCTTGGGCTTTCTGCTTTAGGATCTAGTACAGGTTTCGAGCGCATGCACTATTTGTTTGAGAGAGTATGGGATCCTATAATAGTTCCTGGAGCACCAAAACGAATCAGTTATTTCTTCCTTAATGCTATCAGTGGCTGGCTCTCACTTGATTCAAATCCTCTTTATGGTCTCATGCACGAGTCGATATATTGCCAGGGCGCCTCGTCTCGCTGGTCTGCTCAAAGAATAATGAATGAACTGGAGAACAAATTCGATGCAACAAAGGCTGTAAAAGAAGGATGTCCTGTGTATTTCACTGGAGAGATGATCTTCCCGTGGATGTTTGACGAGATTCATGCCTTGAAACCGTTCAAAGACGCCGCTAATATATTGGCCGAGAAGGAGGATTGGCCTCCCCTATATGACATTGCTGCTCTTAAAAATAACAAGGTCCCGGTCGCAGCAGCAGTTTACTACGAAGATATGTACGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAAATAGCAGGAATCAGGCTGTGGGTTACTAATGAATTTATGCATTCTGGTCTGCGTGATGGAGGGCCTCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTCTGAGGTTTTTTTCTCTAAACTTTGTTGCTTTTCCTCATGATTATTGGATGCAGTCTTTGCCATGACCTTTTCATTTCCTCCTCAATAAGCTTTATCTCCTCCTCAATAATGTGTGTG

Coding sequence (CDS)

ATGAAGGCACTCCTCTTTCACTCTTTCCCCTCCCCCGCCCGTTCATTAATTCCACTCACAAGACTTCTTTCCGCCGTCCATTGCCGGAGCTCCGTCCGTTCATTGGCAGTCATGGCCGCCACCAATCCCTCTAATGGAGCATCTCCGCCGGAGCACGCAGCTGGCACCTGGTACTCCGTGCCGGAGCTCCGGCTCCGAGACCATTACTTTTCTGTGCCTCTCAATTACTCTCTAGATCACTCTTCTCCCAAGATCTCCGTTTATGCGCGGGAAGTTGTTTCAGTGGGGAAAGAAGAGCAACCAATGCCATACCTTCTATACTTACAAGGTGGACCTGGATTTGAGTGTCCCCGACCGACTGAAGCAAGTGGATGGATACAAAAAGCATGCGAAGAATTTCGTGTTATATTGATGGACCAGCGGGGAACAGGATTATCGACTCCTTTGTCTCCATCATCCATGTCCCAATTCCAAAGTGCAGAGGACTTGGCCGACTACTTGAAACATTTTCGAGCTGATAACATAGTGAATGATGCTGAATTCATTAGGACTCGTCTTGTTCCTGATGCTGCACCTTGGACCATTTTGGGTCAGAGCTATGGTGGGTTTTGTGCAGTTACGTATTTGAGTTTTGCACCACAAGGATTGAAACAAGTCCTCATAACTGGAGGAATCCCTCCAATAGGGAATGGATGCACGGCAGATTCTGTATATAGAGCATGCTTTGAAAAGATTATTATTCAAAATGAAAAATACTACAAGAGGTATCCTCAGGATGTCAAAATCGTCCATGAAGTTGTGAAATACTTGGAGGAGAATGGAGGCGGGATTCCTCTTCCCTGTGGTGGTATCTTGACACCTAAAGGGCTGCAAACTCTTGGGCTTTCTGCTTTAGGATCTAGTACAGGTTTCGAGCGCATGCACTATTTGTTTGAGAGAGTATGGGATCCTATAATAGTTCCTGGAGCACCAAAACGAATCAGTTATTTCTTCCTTAATGCTATCAGTGGCTGGCTCTCACTTGATTCAAATCCTCTTTATGGTCTCATGCACGAGTCGATATATTGCCAGGGCGCCTCGTCTCGCTGGTCTGCTCAAAGAATAATGAATGAACTGGAGAACAAATTCGATGCAACAAAGGCTGTAAAAGAAGGATGTCCTGTGTATTTCACTGGAGAGATGATCTTCCCGTGGATGTTTGACGAGATTCATGCCTTGAAACCGTTCAAAGACGCCGCTAATATATTGGCCGAGAAGGAGGATTGGCCTCCCCTATATGACATTGCTGCTCTTAAAAATAACAAGGTCCCGGTCGCAGCAGCAGTTTACTACGAAGATATGTACGTAAACTTCAAGCTGGCCATGGAGACAGCTTCCCAAATAGCAGGAATCAGGCTGTGGGTTACTAATGAATTTATGCATTCTGGTCTGCGTGATGGAGGGCCTCAAGTTCTGGATCACTTGATGGGATTGTTAAATGGAAAGAAGCCTTTATTCTGA
BLAST of CmoCh06G010800 vs. Swiss-Prot
Match: PIP_AERSO (Proline iminopeptidase OS=Aeromonas sobria GN=pip PE=1 SV=3)

HSP 1 Score: 377.5 bits (968), Expect = 2.3e-103
Identity = 187/434 (43.09%), Postives = 271/434 (62.44%), Query Frame = 1

Query: 58  YSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLLYLQGGPGFECP 117
           Y +  +    H+F+VPL++        I+++ R +    + +  +P+LLYLQGGPGF  P
Sbjct: 7   YVLDGIHCEPHFFTVPLDHQQPDDEETITLFGRTLCRKDRLDDELPWLLYLQGGPGFGAP 66

Query: 118 RPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVN 177
           RP+   GWI++A +EFRV+L+DQRGTG STP+    ++     +  ADYL HFRAD+IV 
Sbjct: 67  RPSANGGWIKRALQEFRVLLLDQRGTGHSTPIHAELLAHLNPRQQ-ADYLSHFRADSIVR 126

Query: 178 DAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSV 237
           DAE IR +L PD  PW++LGQS+GGFC++TYLS  P  L +V +TGG+ PIG   +AD V
Sbjct: 127 DAELIREQLSPDH-PWSLLGQSFGGFCSLTYLSLFPDSLHEVYLTGGVAPIGR--SADEV 186

Query: 238 YRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSA 297
           YRA ++++  +N  ++ R+P    I + +  +L+ +   + LP G  LT + LQ  GL  
Sbjct: 187 YRATYQRVADKNRAFFARFPHAQAIANRLATHLQRHD--VRLPNGQRLTVEQLQQQGLD- 246

Query: 298 LGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQ 357
           LG+S  FE ++YL E  +         ++++  FL  +      ++NP++ ++HE IYC+
Sbjct: 247 LGASGAFEELYYLLEDAF-------IGEKLNPAFLYQVQAMQPFNTNPVFAILHELIYCE 306

Query: 358 GASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILA 417
           GA+S W+A+R+  E         A  +G    FTGEMIFPWMF++   L P K+AA++LA
Sbjct: 307 GAASHWAAERVRGEFP-----ALAWAQGKDFAFTGEMIFPWMFEQFRELIPLKEAAHLLA 366

Query: 418 EKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGL 477
           EK DW PLYD   L  NKVPVA AVY EDMYV F  + ET   ++  R W+TNE+ H+GL
Sbjct: 367 EKADWGPLYDPVQLARNKVPVACAVYAEDMYVEFDYSRETLKGLSNSRAWITNEYEHNGL 421

Query: 478 RDGGPQVLDHLMGL 492
           R  G Q+LD L+ L
Sbjct: 427 RVDGEQILDRLIRL 421

BLAST of CmoCh06G010800 vs. TrEMBL
Match: A0A0A0L423_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G121050 PE=4 SV=1)

HSP 1 Score: 894.8 bits (2311), Expect = 4.7e-257
Identity = 428/497 (86.12%), Postives = 457/497 (91.95%), Query Frame = 1

Query: 4   LLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPEL 63
           L FHS P     LIPL   LSA HCR SVR  A MA       ASPP H +GTWYSVPEL
Sbjct: 13  LHFHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPEL 72

Query: 64  RLRDHYFSVPLNYSLDHSS-PKISVYAREVVSVGKEEQPMPYLLYLQGGPGFECPRPTEA 123
           RLRDH+FSVPLNYSL+ +S  +ISV+AREVVSVGKE+QPMPYLL+LQGGPGFEC RPTEA
Sbjct: 73  RLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPMPYLLFLQGGPGFECARPTEA 132

Query: 124 SGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFI 183
           SGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQFQS++DLA+YLKHFRADNIVNDAEFI
Sbjct: 133 SGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFI 192

Query: 184 RTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACF 243
           RTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACF
Sbjct: 193 RTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACF 252

Query: 244 EKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSST 303
           EK+IIQNEKYYKRYPQD++IV EVVKYL ENGGG+ LP GGILTPKGLQTLGLSALG+ST
Sbjct: 253 EKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGTST 312

Query: 304 GFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSR 363
           GFER+HYLFERVWDPI+V G+PKRIS+FFLNAI  WLSLDSNPLY L+HE+IYCQGASSR
Sbjct: 313 GFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSR 372

Query: 364 WSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDW 423
           WSAQRI NE+ENKFDA KAVKEGC VYFTGEMIFPWMFDEIHAL+PFKDAA+ILA+KEDW
Sbjct: 373 WSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDW 432

Query: 424 PPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGP 483
           PPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAM+TASQIAGIRLWVTNEFMHSGLRD GP
Sbjct: 433 PPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGP 492

Query: 484 QVLDHLMGLLNGKKPLF 500
           QVLDHLMGLLNGKKPLF
Sbjct: 493 QVLDHLMGLLNGKKPLF 509

BLAST of CmoCh06G010800 vs. TrEMBL
Match: A0A061DTX8_THECC (Proline iminopeptidase, putative OS=Theobroma cacao GN=TCM_005258 PE=4 SV=1)

HSP 1 Score: 783.5 bits (2022), Expect = 1.5e-223
Identity = 374/503 (74.35%), Postives = 426/503 (84.69%), Query Frame = 1

Query: 5   LFHSFPSPARSL------IPLTRLLSAVHCRSSVRSLAVMA-ATNPSNGASPPEHAAGTW 64
           LF SF S   +L      IP T+LLS    R+S R+L  MA A +   G S PEH AG W
Sbjct: 14  LFFSFSSSLSTLSATLSPIPSTKLLSFRPRRTSFRALTTMAGAKSDCTGYSSPEHVAGNW 73

Query: 65  YSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLLYLQGGPGFECP 124
           YSVP+LRLRDH F VPL+Y    +S KIS++AREVV+ GKEEQ MPYLLYLQGGPGFECP
Sbjct: 74  YSVPDLRLRDHRFMVPLDYKDREASSKISIFAREVVAAGKEEQLMPYLLYLQGGPGFECP 133

Query: 125 RPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVN 184
           RPTE SGWI KACEEFRVILMDQRGTGLSTPL+PSSM Q +SA+ LADYLKHFRAD+IVN
Sbjct: 134 RPTEGSGWILKACEEFRVILMDQRGTGLSTPLTPSSMQQMKSAQSLADYLKHFRADSIVN 193

Query: 185 DAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSV 244
           DAEFIR  LVPDA PWT+LGQSYGGFC VTYLSFAPQGLKQVL+TGGIPP+G+GCTAD++
Sbjct: 194 DAEFIRVHLVPDARPWTVLGQSYGGFCGVTYLSFAPQGLKQVLLTGGIPPMGDGCTADAI 253

Query: 245 YRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEEN-GGGIPLPCGGILTPKGLQTLGLS 304
           Y ACF ++I QNEKYYKR+PQDV+IV +V+ YL E+ GGG+ LP GGILTP+GLQ LGLS
Sbjct: 254 YSACFGQVIRQNEKYYKRFPQDVEIVRDVITYLAESEGGGVLLPSGGILTPRGLQFLGLS 313

Query: 305 ALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYC 364
            LGSS GFER+HYLFERVWDP++VPGAPKRIS +FLNA   WL+ D+NPLY ++HESIYC
Sbjct: 314 GLGSSAGFERLHYLFERVWDPMLVPGAPKRISSYFLNAYESWLAFDTNPLYAILHESIYC 373

Query: 365 QGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANIL 424
           QGASSRWSA R+  + ++KFDA +A +EG PV  TGEMIFPWMFDE++AL+PFKDAA++L
Sbjct: 374 QGASSRWSAHRVRADHDSKFDAIRAAREGRPVLLTGEMIFPWMFDEVNALRPFKDAAHLL 433

Query: 425 AEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSG 484
           AEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVN KL METASQIAGIRLW+TNE+MHSG
Sbjct: 434 AEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNIKLVMETASQIAGIRLWITNEYMHSG 493

Query: 485 LRDGGPQVLDHLMGLLNGKKPLF 500
           LRDGG QV DHLMG+LNGKKPLF
Sbjct: 494 LRDGGGQVFDHLMGMLNGKKPLF 516

BLAST of CmoCh06G010800 vs. TrEMBL
Match: M5XMD2_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004322mg PE=4 SV=1)

HSP 1 Score: 783.1 bits (2021), Expect = 2.0e-223
Identity = 375/507 (73.96%), Postives = 433/507 (85.40%), Query Frame = 1

Query: 1   MKALLFHSFPSPARSL--IPLTRLLSAVHC----RSSVRSLAVMAATNPSNGASPPEHAA 60
           +++LL+   PS + SL  +PL  L++ +HC    + SVR++  M+  N ++G S P+H A
Sbjct: 11  IRSLLYVPSPSLSLSLRTLPLF-LVTKLHCFNCSQRSVRTVTAMSLPNATSGESSPDHVA 70

Query: 61  GTWYSVPELRLRDHYFSVPLNYSLD-HSSPKISVYAREVVSVGKEEQPMPYLLYLQGGPG 120
           G W+SVPELRLRDH F+VPL++S+   +S KIS++AREVVSVGKEEQP+PYLLYLQGGPG
Sbjct: 71  GKWFSVPELRLRDHRFTVPLDHSVGLKASSKISIFAREVVSVGKEEQPLPYLLYLQGGPG 130

Query: 121 FECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRAD 180
           FE PRPTE SGWI+KACEEFRVILMDQRGTGLSTPL+ SSMSQ +S  DLADYLKHFRAD
Sbjct: 131 FEAPRPTEPSGWIRKACEEFRVILMDQRGTGLSTPLTASSMSQLKSEVDLADYLKHFRAD 190

Query: 181 NIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCT 240
           NIVNDAEFIR RLVPDA PWTILGQS+GGFCAVTYLSFAPQGLKQVL+TGGIPPIGNGCT
Sbjct: 191 NIVNDAEFIRVRLVPDAGPWTILGQSFGGFCAVTYLSFAPQGLKQVLLTGGIPPIGNGCT 250

Query: 241 ADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEEN-GGGIPLPCGGILTPKGLQT 300
           AD+VY+ACFE+II QNEKYY+RYPQD+++V EVV YL ++ GGG+ LP GG LTPKGLQ 
Sbjct: 251 ADAVYKACFEQIIHQNEKYYQRYPQDIEVVREVVNYLSKSEGGGVQLPSGGFLTPKGLQI 310

Query: 301 LGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHE 360
           LGL+ LGSS GFER+HY+FER WDPIIVPGA K ISY+FL+A   W S D+NPLY L+HE
Sbjct: 311 LGLTGLGSSAGFERLHYMFERAWDPIIVPGASKEISYYFLDAFDKWSSFDTNPLYALLHE 370

Query: 361 SIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDA 420
            IYCQG SSRW+AQRI  E E KFDA +A KEG P++FTGEMIFPWMFDEIHAL+ FK A
Sbjct: 371 PIYCQGGSSRWAAQRIRAENEGKFDAVRAAKEGRPIFFTGEMIFPWMFDEIHALRKFKGA 430

Query: 421 ANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEF 480
           A+ILAEK+DWPPLYDI AL NNKVPVAAAVYYEDM+VNFKL METASQIAGIRLW+TNEF
Sbjct: 431 AHILAEKKDWPPLYDITALNNNKVPVAAAVYYEDMFVNFKLVMETASQIAGIRLWITNEF 490

Query: 481 MHSGLRDGGPQVLDHLMGLLNGKKPLF 500
           MHSGLRD G QV DHLMG+L+GKKPLF
Sbjct: 491 MHSGLRDAGSQVFDHLMGMLDGKKPLF 516

BLAST of CmoCh06G010800 vs. TrEMBL
Match: D7UCE5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g02160 PE=4 SV=1)

HSP 1 Score: 776.9 bits (2005), Expect = 1.4e-221
Identity = 371/500 (74.20%), Postives = 422/500 (84.40%), Query Frame = 1

Query: 4   LLFHSFPSPARSLI--PLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVP 63
           LL     S A S++  P+ + L     R    ++  MA +N S G S  +H AG WYSVP
Sbjct: 12  LLLRFISSTAHSIVLNPIPKPLCFHSSRRLAGAVIAMAGSNSSAGGSSSDHVAGAWYSVP 71

Query: 64  ELRLRDHYFSVPLNYSLDHSS-PKISVYAREVVSVGKEEQPMPYLLYLQGGPGFECPRPT 123
           +LRLRDHYF+VPL+YSLD S+ PKIS++AREVVSVGKEEQP+P+LLYLQGGPGFE PRPT
Sbjct: 72  DLRLRDHYFTVPLDYSLDCSTCPKISIFAREVVSVGKEEQPLPFLLYLQGGPGFESPRPT 131

Query: 124 EASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAE 183
           E SGWI KACEE+RV+L+DQRGTGLSTPL+ SSM Q +S EDL +YLKHFRADNIVNDAE
Sbjct: 132 EGSGWISKACEEYRVVLLDQRGTGLSTPLTASSMMQMKSPEDLTNYLKHFRADNIVNDAE 191

Query: 184 FIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRA 243
           FIR  LVPDA PWTILGQS+GGFCAVTYLSFAP+GLKQVL+TGGIPPIG+GCTAD+VY  
Sbjct: 192 FIRVHLVPDAGPWTILGQSFGGFCAVTYLSFAPKGLKQVLLTGGIPPIGSGCTADTVYSV 251

Query: 244 CFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEEN-GGGIPLPCGGILTPKGLQTLGLSALG 303
           CFE+I  QNEKYYKR+PQD++IV EVV +L E+ GGG+PLP GGILTP+GLQ LGLS LG
Sbjct: 252 CFEQIFRQNEKYYKRFPQDIEIVREVVTHLAEHEGGGVPLPSGGILTPRGLQLLGLSCLG 311

Query: 304 SSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGA 363
           SSTGFER+HY+ ERVWDPII+PGA K+ISY+FL A    L  D+NPL+ L+HESIYCQGA
Sbjct: 312 SSTGFERLHYMLERVWDPIIIPGAQKQISYYFLTAYERSLDFDTNPLFALLHESIYCQGA 371

Query: 364 SSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEK 423
           SSRWSA RI  + E KFDA KA KEG PV FTGEMIFPWMF+EIHAL+PFKDAAN+LAEK
Sbjct: 372 SSRWSAHRIRAKDEGKFDAMKAAKEGRPVLFTGEMIFPWMFEEIHALRPFKDAANLLAEK 431

Query: 424 EDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRD 483
           EDWPPLYDI +L NNKVPVAAAVYYEDMYVNFKL METASQIAGIRLW+TNEFMHSGLRD
Sbjct: 432 EDWPPLYDIDSLNNNKVPVAAAVYYEDMYVNFKLVMETASQIAGIRLWITNEFMHSGLRD 491

Query: 484 GGPQVLDHLMGLLNGKKPLF 500
           GG QV DHLMG+L+GKKPLF
Sbjct: 492 GGSQVFDHLMGILSGKKPLF 511

BLAST of CmoCh06G010800 vs. TrEMBL
Match: B9RDY5_RICCO (Proline iminopeptidase, putative OS=Ricinus communis GN=RCOM_1616700 PE=4 SV=1)

HSP 1 Score: 775.4 bits (2001), Expect = 4.2e-221
Identity = 357/468 (76.28%), Postives = 411/468 (87.82%), Query Frame = 1

Query: 34  SLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHS-SPKISVYAREV 93
           S   MA  N S   SPP+H AG WYSVP+LRLRDH F+VPL+YS+DH+ SPKIS++AREV
Sbjct: 46  SFTTMAEANESTAYSPPQHIAGHWYSVPDLRLRDHRFTVPLDYSIDHNASPKISIFAREV 105

Query: 94  VSVGKEEQPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPS 153
           V+VGKEEQ +PYLL+LQGGPGFECPRPTE SGWI KACEEFR+ILMDQRGTGLSTPL+PS
Sbjct: 106 VAVGKEEQLLPYLLFLQGGPGFECPRPTEGSGWINKACEEFRLILMDQRGTGLSTPLTPS 165

Query: 154 SMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFA 213
           SM+Q  SAE++A+Y+K+FRADNIVNDAEFIR RLVPDA PWTILGQSYGGFCAVTYLSFA
Sbjct: 166 SMAQLGSAENMAEYIKYFRADNIVNDAEFIRVRLVPDAEPWTILGQSYGGFCAVTYLSFA 225

Query: 214 PQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEE 273
           P GLKQVL+TGGIPPI NGC+AD+VYRAC+E++I QNEKYYKR+P DV+IV EVV +L E
Sbjct: 226 PHGLKQVLLTGGIPPISNGCSADTVYRACYEQVIRQNEKYYKRFPHDVEIVQEVVNHLAE 285

Query: 274 N-GGGIPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFF 333
           + GGG+PLP GGILTP+GLQ LGLS LGSS GFER+HY+FERVWDPIIVPG+ KR+S++F
Sbjct: 286 SEGGGVPLPSGGILTPRGLQALGLSGLGSSAGFERLHYIFERVWDPIIVPGSRKRVSHYF 345

Query: 334 LNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFT 393
           L A   WL  DSNPLY L+HESIYCQGASS+WSA RIM E   + +A KA KEG PV+FT
Sbjct: 346 LKAFENWLDFDSNPLYALLHESIYCQGASSQWSAHRIMAEDNGQLNAVKAAKEGRPVFFT 405

Query: 394 GEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNF 453
           GEM+FPWMFDEIHALK FK+ A +LAEK+DWPPLYDI  L NNK+PVAAAVYYEDMYVNF
Sbjct: 406 GEMVFPWMFDEIHALKQFKETAQLLAEKKDWPPLYDITMLNNNKIPVAAAVYYEDMYVNF 465

Query: 454 KLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF 500
           ++AMETASQIAGIRLW+TNE+MHSGLRD G +VLDHL+G+LNGKKPLF
Sbjct: 466 RVAMETASQIAGIRLWITNEYMHSGLRDAGGRVLDHLLGMLNGKKPLF 513

BLAST of CmoCh06G010800 vs. TAIR10
Match: AT3G61540.1 (AT3G61540.1 alpha/beta-Hydrolases superfamily protein)

HSP 1 Score: 743.0 bits (1917), Expect = 1.2e-214
Identity = 349/455 (76.70%), Postives = 395/455 (86.81%), Query Frame = 1

Query: 46  GASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYL 105
           G S  EH  G W+SVPELRLRDH F VPL+YS   SSPKI+V+ARE+V+VGKEEQ MPYL
Sbjct: 63  GESKSEHVTGKWFSVPELRLRDHRFIVPLDYS--KSSPKITVFAREIVAVGKEEQAMPYL 122

Query: 106 LYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLAD 165
           LYLQGGPGFE PRP+EASGWIQ+ACEEFRV+L+DQRGTGLSTPL+ SSM QF+SA++LAD
Sbjct: 123 LYLQGGPGFEGPRPSEASGWIQRACEEFRVVLLDQRGTGLSTPLTCSSMLQFKSAKELAD 182

Query: 166 YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGI 225
           YL HFRADNIV DAEFIR RLVP A PWTILGQS+GGFCA+TYLSFAP+GLKQVLITGGI
Sbjct: 183 YLVHFRADNIVKDAEFIRVRLVPKADPWTILGQSFGGFCALTYLSFAPEGLKQVLITGGI 242

Query: 226 PPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEEN-GGGIPLPCGGI 285
           PPIG  CTAD VY A FE++  QNEKYYKR+PQD++IV E+V YL E+ GGG+PLP GGI
Sbjct: 243 PPIGKACTADDVYEAGFEQVARQNEKYYKRFPQDIEIVRELVNYLAESEGGGVPLPSGGI 302

Query: 286 LTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSN 345
           LTPKGLQTLGLS LGSSTGFER+HY+ ERVWDPI+V GAPK IS FFLNA   W S D+N
Sbjct: 303 LTPKGLQTLGLSGLGSSTGFERLHYMLERVWDPILVTGAPKCISQFFLNAFESWHSFDTN 362

Query: 346 PLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIH 405
           PLY L+HE+IYC+GASS WSA R+ ++ E KFDA KAVKE  PV FTGEMIFPWMFDEIH
Sbjct: 363 PLYALLHEAIYCEGASSGWSAHRLRDKYEYKFDAMKAVKESQPVLFTGEMIFPWMFDEIH 422

Query: 406 ALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGI 465
           ALKPFK AA++LA+KEDWPPLYD+  L+NNKVPVAAAVYYEDMYVNFKL  ETAS I+GI
Sbjct: 423 ALKPFKAAADLLAKKEDWPPLYDVPRLQNNKVPVAAAVYYEDMYVNFKLVTETASHISGI 482

Query: 466 RLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF 500
           RLWVTNEFMHSGLRD G Q++DHL+G++NGKKPLF
Sbjct: 483 RLWVTNEFMHSGLRDAGRQIIDHLLGMINGKKPLF 515

BLAST of CmoCh06G010800 vs. NCBI nr
Match: gi|659075131|ref|XP_008437982.1| (PREDICTED: uncharacterized protein LOC103483239 [Cucumis melo])

HSP 1 Score: 905.6 bits (2339), Expect = 3.9e-260
Identity = 434/497 (87.32%), Postives = 458/497 (92.15%), Query Frame = 1

Query: 4   LLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPEL 63
           L FHS P     LIPL   LSA HCR SVR  A MA        SPP H AGTWYSVPEL
Sbjct: 13  LHFHSLPFRLLPLIPLPNFLSAAHCRRSVRLSAAMAGILSPRAPSPPVHVAGTWYSVPEL 72

Query: 64  RLRDHYFSVPLNYSLDH-SSPKISVYAREVVSVGKEEQPMPYLLYLQGGPGFECPRPTEA 123
           RLRDH+FSVPLNYSLD  SS +ISV+AREVVSVGKE+QPMPYLLYLQGGPGFEC RP+EA
Sbjct: 73  RLRDHHFSVPLNYSLDQGSSTRISVFAREVVSVGKEDQPMPYLLYLQGGPGFECARPSEA 132

Query: 124 SGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFI 183
           SGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQF+SAEDLA+YLKHFRADNIVNDAEFI
Sbjct: 133 SGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFRSAEDLANYLKHFRADNIVNDAEFI 192

Query: 184 RTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACF 243
           RTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACF
Sbjct: 193 RTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACF 252

Query: 244 EKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSST 303
           EK+IIQNEKYYKRYPQD++IV EVVKYL +NGGG+ LP GGILTPKGLQTLGLSALG+ST
Sbjct: 253 EKVIIQNEKYYKRYPQDIEIVREVVKYLADNGGGVLLPSGGILTPKGLQTLGLSALGTST 312

Query: 304 GFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSR 363
           GFER+HYLFERVWDPI+VPGAPKRIS+FFLNAI  WLSLDSNPLY L+HESIYCQGASSR
Sbjct: 313 GFERLHYLFERVWDPILVPGAPKRISFFFLNAIDNWLSLDSNPLYVLLHESIYCQGASSR 372

Query: 364 WSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDW 423
           WSAQRI NE+ENKFDA KAVKEGCPVYFTGEMIFPWMFDEIHAL+PFKDAA+ILA+KEDW
Sbjct: 373 WSAQRIKNEVENKFDANKAVKEGCPVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDW 432

Query: 424 PPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGP 483
           PPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAMETASQIAGIRLW+TNEFMHSGLRD GP
Sbjct: 433 PPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMETASQIAGIRLWITNEFMHSGLRDAGP 492

Query: 484 QVLDHLMGLLNGKKPLF 500
           QVLDHLMGLLNGKKPLF
Sbjct: 493 QVLDHLMGLLNGKKPLF 509

BLAST of CmoCh06G010800 vs. NCBI nr
Match: gi|700201345|gb|KGN56478.1| (hypothetical protein Csa_3G121050 [Cucumis sativus])

HSP 1 Score: 894.8 bits (2311), Expect = 6.8e-257
Identity = 428/497 (86.12%), Postives = 457/497 (91.95%), Query Frame = 1

Query: 4   LLFHSFPSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPEL 63
           L FHS P     LIPL   LSA HCR SVR  A MA       ASPP H +GTWYSVPEL
Sbjct: 13  LHFHSLPCRVLPLIPLRNFLSAAHCRRSVRLSAAMAGILSPRAASPPVHVSGTWYSVPEL 72

Query: 64  RLRDHYFSVPLNYSLDHSS-PKISVYAREVVSVGKEEQPMPYLLYLQGGPGFECPRPTEA 123
           RLRDH+FSVPLNYSL+ +S  +ISV+AREVVSVGKE+QPMPYLL+LQGGPGFEC RPTEA
Sbjct: 73  RLRDHHFSVPLNYSLNQASCTRISVFAREVVSVGKEDQPMPYLLFLQGGPGFECARPTEA 132

Query: 124 SGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFI 183
           SGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQFQS++DLA+YLKHFRADNIVNDAEFI
Sbjct: 133 SGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQFQSSDDLANYLKHFRADNIVNDAEFI 192

Query: 184 RTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACF 243
           RTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACF
Sbjct: 193 RTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACF 252

Query: 244 EKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGGIPLPCGGILTPKGLQTLGLSALGSST 303
           EK+IIQNEKYYKRYPQD++IV EVVKYL ENGGG+ LP GGILTPKGLQTLGLSALG+ST
Sbjct: 253 EKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGGVLLPSGGILTPKGLQTLGLSALGTST 312

Query: 304 GFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSR 363
           GFER+HYLFERVWDPI+V G+PKRIS+FFLNAI  WLSLDSNPLY L+HE+IYCQGASSR
Sbjct: 313 GFERLHYLFERVWDPILVRGSPKRISFFFLNAIDNWLSLDSNPLYVLLHETIYCQGASSR 372

Query: 364 WSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDW 423
           WSAQRI NE+ENKFDA KAVKEGC VYFTGEMIFPWMFDEIHAL+PFKDAA+ILA+KEDW
Sbjct: 373 WSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIFPWMFDEIHALRPFKDAAHILADKEDW 432

Query: 424 PPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGP 483
           PPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAM+TASQIAGIRLWVTNEFMHSGLRD GP
Sbjct: 433 PPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMDTASQIAGIRLWVTNEFMHSGLRDAGP 492

Query: 484 QVLDHLMGLLNGKKPLF 500
           QVLDHLMGLLNGKKPLF
Sbjct: 493 QVLDHLMGLLNGKKPLF 509

BLAST of CmoCh06G010800 vs. NCBI nr
Match: gi|778677096|ref|XP_004133842.2| (PREDICTED: uncharacterized protein LOC101216845 [Cucumis sativus])

HSP 1 Score: 865.1 bits (2234), Expect = 5.8e-248
Identity = 409/463 (88.34%), Postives = 438/463 (94.60%), Query Frame = 1

Query: 38  MAATNPSNGASPPEHAAGTWYSVPELRLRDHYFSVPLNYSLDHSS-PKISVYAREVVSVG 97
           MA       ASPP H +GTWYSVPELRLRDH+FSVPLNYSL+ +S  +ISV+AREVVSVG
Sbjct: 1   MAGILSPRAASPPVHVSGTWYSVPELRLRDHHFSVPLNYSLNQASCTRISVFAREVVSVG 60

Query: 98  KEEQPMPYLLYLQGGPGFECPRPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQ 157
           KE+QPMPYLL+LQGGPGFEC RPTEASGWIQKACEEFRVILMDQRGTGLSTPL+PSSMSQ
Sbjct: 61  KEDQPMPYLLFLQGGPGFECARPTEASGWIQKACEEFRVILMDQRGTGLSTPLTPSSMSQ 120

Query: 158 FQSAEDLADYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGL 217
           FQS++DLA+YLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGL
Sbjct: 121 FQSSDDLANYLKHFRADNIVNDAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGL 180

Query: 218 KQVLITGGIPPIGNGCTADSVYRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEENGGG 277
           KQVLITGGIPPIGNGCTADSVYRACFEK+IIQNEKYYKRYPQD++IV EVVKYL ENGGG
Sbjct: 181 KQVLITGGIPPIGNGCTADSVYRACFEKVIIQNEKYYKRYPQDIEIVREVVKYLAENGGG 240

Query: 278 IPLPCGGILTPKGLQTLGLSALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAIS 337
           + LP GGILTPKGLQTLGLSALG+STGFER+HYLFERVWDPI+V G+PKRIS+FFLNAI 
Sbjct: 241 VLLPSGGILTPKGLQTLGLSALGTSTGFERLHYLFERVWDPILVRGSPKRISFFFLNAID 300

Query: 338 GWLSLDSNPLYGLMHESIYCQGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIF 397
            WLSLDSNPLY L+HE+IYCQGASSRWSAQRI NE+ENKFDA KAVKEGC VYFTGEMIF
Sbjct: 301 NWLSLDSNPLYVLLHETIYCQGASSRWSAQRIKNEVENKFDANKAVKEGCAVYFTGEMIF 360

Query: 398 PWMFDEIHALKPFKDAANILAEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAME 457
           PWMFDEIHAL+PFKDAA+ILA+KEDWPPLYDIAALKNNKVPVAAAVYYEDM+VNFKLAM+
Sbjct: 361 PWMFDEIHALRPFKDAAHILADKEDWPPLYDIAALKNNKVPVAAAVYYEDMFVNFKLAMD 420

Query: 458 TASQIAGIRLWVTNEFMHSGLRDGGPQVLDHLMGLLNGKKPLF 500
           TASQIAGIRLWVTNEFMHSGLRD GPQVLDHLMGLLNGKKPLF
Sbjct: 421 TASQIAGIRLWVTNEFMHSGLRDAGPQVLDHLMGLLNGKKPLF 463

BLAST of CmoCh06G010800 vs. NCBI nr
Match: gi|747045237|ref|XP_011093018.1| (PREDICTED: uncharacterized protein LOC105173066 [Sesamum indicum])

HSP 1 Score: 785.0 bits (2026), Expect = 7.6e-224
Identity = 376/492 (76.42%), Postives = 429/492 (87.20%), Query Frame = 1

Query: 10  PSPARSLIPLTRLLSAVHCRSSVRSLAVMAATNPSNGASPPEHAAGTWYSVPELRLRDHY 69
           PSP     PL +LL   +  + + S+A    T  +  AS  +H +  W+SVPELRLRDH 
Sbjct: 28  PSPP----PLFKLLRFHYSIAKITSMA--GTTTAAASASDGKHVSSDWFSVPELRLRDHR 87

Query: 70  FSVPLNYSLDHS-SPKISVYAREVVSVGKEEQPMPYLLYLQGGPGFECPRPTEASGWIQK 129
           F+VPL+YSLD S SPKISV+ RE+V+VGKEE  +P+LLYLQGGPGFEC RPTEASGWI K
Sbjct: 88  FTVPLDYSLDDSTSPKISVFVRELVAVGKEELHLPFLLYLQGGPGFECQRPTEASGWISK 147

Query: 130 ACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVNDAEFIRTRLVP 189
           ACEE+RVILMDQRGTGLSTPLSPSSMSQF+SA +LADYLKHFRADNIV DAEFIR RLVP
Sbjct: 148 ACEEYRVILMDQRGTGLSTPLSPSSMSQFKSAMELADYLKHFRADNIVKDAEFIRKRLVP 207

Query: 190 DAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSVYRACFEKIIIQ 249
           D++PWT+LGQSYGGFCAVTYLSFAPQGLKQVL+TGGIPPIG+GCTAD+VYRACFE+++ Q
Sbjct: 208 DSSPWTVLGQSYGGFCAVTYLSFAPQGLKQVLLTGGIPPIGSGCTADAVYRACFEQVMHQ 267

Query: 250 NEKYYKRYPQDVKIVHEVVKYLEEN-GGGIPLPCGGILTPKGLQTLGLSALGSSTGFERM 309
           N KYY+R+P+DV++VHEVVKYL E+ GGG+ LP GGILTP+GLQ LGLS LGSSTGFER+
Sbjct: 268 NGKYYRRFPKDVELVHEVVKYLAESEGGGVALPSGGILTPRGLQLLGLSGLGSSTGFERL 327

Query: 310 HYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYCQGASSRWSAQR 369
           HY+FERVWDPI+VPGAPKRISYFFLNA   WL+ D+NPLY LMHESIYCQGASS WSA R
Sbjct: 328 HYMFERVWDPILVPGAPKRISYFFLNAYERWLAYDTNPLYALMHESIYCQGASSSWSAHR 387

Query: 370 IMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANILAEKEDWPPLYD 429
           I  E E++FDA KAVKEG PV FTGEMIFPW+FDEI AL+PFKDAA++LAEK DWPPLYD
Sbjct: 388 IRAENESQFDAIKAVKEGHPVLFTGEMIFPWLFDEIQALRPFKDAAHLLAEKMDWPPLYD 447

Query: 430 IAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSGLRDGGPQVLDH 489
           +AAL +NKVPVAAAVYYEDMYVNFKL META+QIAGIRLWVTNE+MHSGLRDGG QVLDH
Sbjct: 448 VAALNDNKVPVAAAVYYEDMYVNFKLVMETATQIAGIRLWVTNEYMHSGLRDGGGQVLDH 507

Query: 490 LMGLLNGKKPLF 500
           L+G+LNGKKPLF
Sbjct: 508 LLGMLNGKKPLF 513

BLAST of CmoCh06G010800 vs. NCBI nr
Match: gi|590721738|ref|XP_007051700.1| (Proline iminopeptidase, putative [Theobroma cacao])

HSP 1 Score: 783.5 bits (2022), Expect = 2.2e-223
Identity = 374/503 (74.35%), Postives = 426/503 (84.69%), Query Frame = 1

Query: 5   LFHSFPSPARSL------IPLTRLLSAVHCRSSVRSLAVMA-ATNPSNGASPPEHAAGTW 64
           LF SF S   +L      IP T+LLS    R+S R+L  MA A +   G S PEH AG W
Sbjct: 14  LFFSFSSSLSTLSATLSPIPSTKLLSFRPRRTSFRALTTMAGAKSDCTGYSSPEHVAGNW 73

Query: 65  YSVPELRLRDHYFSVPLNYSLDHSSPKISVYAREVVSVGKEEQPMPYLLYLQGGPGFECP 124
           YSVP+LRLRDH F VPL+Y    +S KIS++AREVV+ GKEEQ MPYLLYLQGGPGFECP
Sbjct: 74  YSVPDLRLRDHRFMVPLDYKDREASSKISIFAREVVAAGKEEQLMPYLLYLQGGPGFECP 133

Query: 125 RPTEASGWIQKACEEFRVILMDQRGTGLSTPLSPSSMSQFQSAEDLADYLKHFRADNIVN 184
           RPTE SGWI KACEEFRVILMDQRGTGLSTPL+PSSM Q +SA+ LADYLKHFRAD+IVN
Sbjct: 134 RPTEGSGWILKACEEFRVILMDQRGTGLSTPLTPSSMQQMKSAQSLADYLKHFRADSIVN 193

Query: 185 DAEFIRTRLVPDAAPWTILGQSYGGFCAVTYLSFAPQGLKQVLITGGIPPIGNGCTADSV 244
           DAEFIR  LVPDA PWT+LGQSYGGFC VTYLSFAPQGLKQVL+TGGIPP+G+GCTAD++
Sbjct: 194 DAEFIRVHLVPDARPWTVLGQSYGGFCGVTYLSFAPQGLKQVLLTGGIPPMGDGCTADAI 253

Query: 245 YRACFEKIIIQNEKYYKRYPQDVKIVHEVVKYLEEN-GGGIPLPCGGILTPKGLQTLGLS 304
           Y ACF ++I QNEKYYKR+PQDV+IV +V+ YL E+ GGG+ LP GGILTP+GLQ LGLS
Sbjct: 254 YSACFGQVIRQNEKYYKRFPQDVEIVRDVITYLAESEGGGVLLPSGGILTPRGLQFLGLS 313

Query: 305 ALGSSTGFERMHYLFERVWDPIIVPGAPKRISYFFLNAISGWLSLDSNPLYGLMHESIYC 364
            LGSS GFER+HYLFERVWDP++VPGAPKRIS +FLNA   WL+ D+NPLY ++HESIYC
Sbjct: 314 GLGSSAGFERLHYLFERVWDPMLVPGAPKRISSYFLNAYESWLAFDTNPLYAILHESIYC 373

Query: 365 QGASSRWSAQRIMNELENKFDATKAVKEGCPVYFTGEMIFPWMFDEIHALKPFKDAANIL 424
           QGASSRWSA R+  + ++KFDA +A +EG PV  TGEMIFPWMFDE++AL+PFKDAA++L
Sbjct: 374 QGASSRWSAHRVRADHDSKFDAIRAAREGRPVLLTGEMIFPWMFDEVNALRPFKDAAHLL 433

Query: 425 AEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNFKLAMETASQIAGIRLWVTNEFMHSG 484
           AEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVN KL METASQIAGIRLW+TNE+MHSG
Sbjct: 434 AEKEDWPPLYDIAALKNNKVPVAAAVYYEDMYVNIKLVMETASQIAGIRLWITNEYMHSG 493

Query: 485 LRDGGPQVLDHLMGLLNGKKPLF 500
           LRDGG QV DHLMG+LNGKKPLF
Sbjct: 494 LRDGGGQVFDHLMGMLNGKKPLF 516

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PIP_AERSO2.3e-10343.09Proline iminopeptidase OS=Aeromonas sobria GN=pip PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A0A0L423_CUCSA4.7e-25786.12Uncharacterized protein OS=Cucumis sativus GN=Csa_3G121050 PE=4 SV=1[more]
A0A061DTX8_THECC1.5e-22374.35Proline iminopeptidase, putative OS=Theobroma cacao GN=TCM_005258 PE=4 SV=1[more]
M5XMD2_PRUPE2.0e-22373.96Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004322mg PE=4 SV=1[more]
D7UCE5_VITVI1.4e-22174.20Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g02160 PE=4 SV=... [more]
B9RDY5_RICCO4.2e-22176.28Proline iminopeptidase, putative OS=Ricinus communis GN=RCOM_1616700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G61540.11.2e-21476.70 alpha/beta-Hydrolases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659075131|ref|XP_008437982.1|3.9e-26087.32PREDICTED: uncharacterized protein LOC103483239 [Cucumis melo][more]
gi|700201345|gb|KGN56478.1|6.8e-25786.12hypothetical protein Csa_3G121050 [Cucumis sativus][more]
gi|778677096|ref|XP_004133842.2|5.8e-24888.34PREDICTED: uncharacterized protein LOC101216845 [Cucumis sativus][more]
gi|747045237|ref|XP_011093018.1|7.6e-22476.42PREDICTED: uncharacterized protein LOC105173066 [Sesamum indicum][more]
gi|590721738|ref|XP_007051700.1|2.2e-22374.35Proline iminopeptidase, putative [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000073AB_hydrolase_1
IPR002410Peptidase_S33
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0008233peptidase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0019344 cysteine biosynthetic process
biological_process GO:0006508 proteolysis
biological_process GO:0008150 biological_process
cellular_component GO:0005829 cytosol
cellular_component GO:0005773 vacuole
cellular_component GO:0005575 cellular_component
molecular_function GO:0004177 aminopeptidase activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0008233 peptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh06G010800.1CmoCh06G010800.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000073Alpha/beta hydrolase fold-1PFAMPF00561Abhydrolase_1coord: 105..258
score: 3.5
IPR002410Peptidase S33PRINTSPR00793PROAMNOPTASEcoord: 135..146
score: 5.0E-9coord: 105..113
score: 5.0E-9coord: 195..209
score: 5.
NoneNo IPR availablePANTHERPTHR10992ALPHA/BETA HYDROLASE FOLD-CONTAINING PROTEINcoord: 28..170
score: 1.8E-168coord: 383..494
score: 1.8E-168coord: 189..268
score: 1.8E
NoneNo IPR availablePANTHERPTHR10992:SF749SUBFAMILY NOT NAMEDcoord: 28..170
score: 1.8E-168coord: 189..268
score: 1.8E-168coord: 383..494
score: 1.8E