MC03g0668 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC03g0668
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotease Do-like 5, chloroplastic
LocationMC03: 13621677 .. 13625715 (-)
RNA-Seq ExpressionMC03g0668
SyntenyMC03g0668
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAAAATCTATTCGAGAGTAAAAATTGCAATACTTTAGTAGTTCTACTCTCAAAACTAAACCATTTCAGATTAATTAACTTCTTAGATTTACCCTTCTGGATTAAACAACTTTCAGATAAGTTGTGGCGAAAGAGTTAGAGCATAGCTTTAATCTTTCATAGGGATAAGGGAACTTCAAGAACTTCAAAATTTATAATGTTCGTTTGTTTCTAAAGCATAATGTTCAATTGATTAAGGTTTGAATTGCAGTTGCTGAATTTAAAAAGGGAAAATCACGATAAGAATCAAGATCAGAGGAAGTGGAACACCATGGCGTTAGCCTCACTGGGAATTCATCTTCTTCCAATTCCAGCTCCCCCAAATTCTTCCCACAACTCTCTACCCTTCACTTCGCGAAGAGCCCTAGTTTTTGCCCCATCTGCTTTGATGGCTTCCCTCCTCGCTTTCCCTCTTCCCACTCACGCCGCTCTCCCCCAAATACAGGCCCAGGTTCCACAAGAAGAAGATCGAGTCGTCGCTCTCTTTCAGGTCCCTCTCTTTCTCACTCACCATAACTGCAGAATTCTTCAGTTTCAATTGAAATTTAATCTTGCTTCTTTGTTTTCTGTTTTGCTCTGTTGGGTCCCAATTCCAAATGTAGGATGCTTCACCTTCTGTCGTTTACATTAAGGACCTTGAATTAGCTAAGAAACCCCAGAACTCCTCTGAAGAGGCCCTGCTCGTCGAGGATGAGAATGTCAAGGTCAAAGGGACTGGTTCGGGCTTTGTATGGGATAAATTTGGCCATATCGTATGCTCTTCTCTCCTTTAATTCTTACTTGCCTTCATTCTTTGTGACTGTCATTTTCATTTTATTGATTTTGGGTATCGGAAACATTTCTTTTTCTGTTCATTTCACAGAGGAATTTCATTAGGGCTTTGTGATTACTTGTGGCAGGTAACTAATTACCATGTTGTTTCCGCATTGGCTACTGATAACAGTGGATTGCAGCGTTGTAAGGTTTTGTTTATTTGTTACTGTCTTTTGAATTGTTATTTATGTCTTCATAATGGAAGATATATTATCAGTCTCTAGCCACTCCATTTGGCTTGCTTCTGTAGGTAAATTTAGTCGATGCTAAAGGAAATGGAATTTATAGGGAAGCAAAAATTGTAGGTTTTGATCCAGAGTATGATCTAGCTGTTCTCAAGGTATAGAACCAAACTTTTTTTTTTCTTTCCTAATTTTCTTTGAAGCTTCTTGTCTGCTCTGTCGTGTTCAAACTGGGTGCTGTCTACTCAATATAGAAGTTGAAATGATTTTGACATTTACAAAATCCCTCTTGAGATGGTGCCCATATATATTCTGTTTCGTTTTGTTCTAATTTAGCTCGAGACTCAAGGGGAATACCAGATTGAGTGAATTGTGTCTCGGTGATTCTCATCCTGTTTTCCGTTCCTTATGTGATGTGTGAAAGCAGCAGATTGGGAGGAGTGGAAAGTGAACCCTGTGAGCTATGGAGTGATGATGATAACATGTCATTTGTTTTCTATTCCATAAAACAGGCTTCACTGTTTGGACAATCTGTACTGTCAATCTCTTATACTATACCATGATTTGCTGAGTTTGAAGTTGCCATGTAACATCCAGGTGGAACTTGGAGGATGTGAACTAAAGCCCATCGTTCTCGGTACCTCTCGAAATTTACGTGTTGGTCAGAGCTGCTATGCCATTGGCAACCCTTTTGGTTATGAGAAGACACTAACAGCAGGGGTAAGACATCAATAATGTTACTTTCTCCATTTAATTCAGAGGAAACCCTCTTTGGATTTTCTATTTCAAAGCTAAAACTAAATTGGATAGTGAAAATACTTCTCTCTTATAATAAACATATTTCTCATAGGAGATATGATATCTCTCATCTCACACACATTTATGTTTTATTTATCAAAGACTGTGGATGAAGTAAGAGTTTTTCTAAGTGTCACTGTCAAGTTTCCTACTCATAGATTGATGGTGTTCTTTACCAAGAAACACCTGCTGAGAGTCAGCATAGTGGATTGACTAAATAATCGAGAACTGTAAATCATTCAGGGTTTGAATAGCAGGTGATCAGTGGATTGGGTAGAGAAATTCCATCACCAAATGGAAGGGCCATTCGGGGGGGTATTCAGACAGATGCTGCTATTAGTTCAGGTTCATGGTTCAACTACCTGAAATCAGATACCGCCATTTCTCAAACTTGTTTTTTCATGCCTTTTCTTCTGCCGCTTTCTAAAGATTTTCTACTTCCCATCATGTATATTGCAAGAGTGAGCCTTTTTTATTCTTGACACTAAAGCGCTGTTAAGCTCCTTGTACATATATGGCAGCTTATCTTGTTCCTTTCTAATCTGGCTTGATTTCATTACTGCCCCAGTCACCCGTAATAATCATCATCATATTTTAACTTTCCCACATTTTAATGCATTAGAGTTTTACAGGGAATTCAGGGGGGCCATTAATTGACTCCTACGGTCATGTAATTGGAGTCAACACAGCAACTTTCACTCGCAAAGGTAAAGGGCCTCAATTTCTTCATCTGCTTAGTTTGTCAGAGAAAAATATGGCTGATTTCTTTGCCTTAAAGCTTAGTAATTATCAAAGCTAACTCCATCTCTCTCTCTCTCTCTTTTATTTGCTACTTGAAAGAACATAAAATAGTAAAGAAAGAAAGAAAGGACTGCGGTTTTTTTGTGCCAATTCTATTCGAAGTTATATTATTCATCTTTGTTTATGCATTGCAGGAACTGGAATGTCTTCTGGTGTTAATTTTGCAATTCCAATAGACACAGTTGTACGTACGGTTCCATACCTTATTGTATACGGAACACCTTACAGTGAGAGATTTTGATAAAGTTTTACAATCTTTTCACAAGGGAACACAACCATTCATGCAACTCCAACTCCTCTCCAAGACCGTGGACTGCACCGTAAAATGACAATTTGCTAACATAAATGAATTTGCTTTCTTCACAAGTGTTATGATTCATATGAATTCTTTCTAGATCTTGGTCTGAGTTGTGAATAATTGTTTATTTATGGAATATTACTCTTCCTTCCCTCTGTTGGGTGGTTCAATCATTGAAAATATTATTGTTGTACAGATTCAAAGAGTTTTTACTCATCTGTTAAAACTTTCTAGCCTTAATTTCTGGAGTGTAGCTTTCCAAACTAGAAACAATTCAGTGTTTCTGTCTCCATCCAAATAGCATGCCATTGTTGAAGATTCCACTTTTATGCACTTGATTCAATGAATAAAGAAAAGGAGAATATATTCTGGTTATCCACAACCTTGTAAGAGAAAATTTTTTTTATCTCCTTATCCCATATTTCTACTGAAAAGAAATAATCATAAACACTCTCTTCAAACAAGTGTGCTAGTCCTTTTTTCTCCTTTCTTAACCCCAGGCTTAGTCTTGATTTCTAGACTGATAGCAGCGCTAATTGTTCCACAATGGTACTATCTACATGACAGGACACACGGCAAGAGTTGGGGATAAGAATAAGGCCGAGGCACCGTTGTTCCATCGGAGAAGGTGAGCTACGTCGAGTACTACGATACTCCACTGCATCTTGTAGAATGATATTTCCTTGCTTGTCAATGCAGTGAAAAGTTCCCAGGAAAAATCTTCCATCTTTAATGCCTATGAGCATTCGGCGAAACAGTAGCTTTCTCACCTTTGCTATGCAATCTAAACTGCCCGGATTAGACTCGACATTGCTCCCAACCTGAACCCTGGGTCCCTCTGATTCTTGTTCCATGTATGGTTCGATTGATCCCTGTAAGTGGGGAAGGTTGTTGAATATAGTTGATGAACAGGACCAAGTATACGATCAGAATTGCAACTTAAATCAGTATGACAACTCAATTAGTGGAAAACAAATGGGACGAAAACCGTCCTAAAGAAATATATGAAGATTTTTTGAAAAAGGGAAAAAATGAAAGCCAGTAAAAGAGAGATTAATGTAAACCATGAACTCTTTACTAACATCCAATAGACACATAATACACTCCA

mRNA sequence

CGAAAATCTATTCGAGAGTAAAAATTGCAATACTTTAGTAGTTCTACTCTCAAAACTAAACCATTTCAGATTAATTAACTTCTTAGATTTACCCTTCTGGATTAAACAACTTTCAGATAAGTTGTGGCGAAAGAGTTAGAGCATAGCTTTAATCTTTCATAGGGATAAGGGAACTTCAAGAACTTCAAAATTTATAATGTTCGTTTGTTTCTAAAGCATAATGTTCAATTGATTAAGGTTTGAATTGCAGTTGCTGAATTTAAAAAGGGAAAATCACGATAAGAATCAAGATCAGAGGAAGTGGAACACCATGGCGTTAGCCTCACTGGGAATTCATCTTCTTCCAATTCCAGCTCCCCCAAATTCTTCCCACAACTCTCTACCCTTCACTTCGCGAAGAGCCCTAGTTTTTGCCCCATCTGCTTTGATGGCTTCCCTCCTCGCTTTCCCTCTTCCCACTCACGCCGCTCTCCCCCAAATACAGGCCCAGGTTCCACAAGAAGAAGATCGAGTCGTCGCTCTCTTTCAGGATGCTTCACCTTCTGTCGTTTACATTAAGGACCTTGAATTAGCTAAGAAACCCCAGAACTCCTCTGAAGAGGCCCTGCTCGTCGAGGATGAGAATGTCAAGGTCAAAGGGACTGGTTCGGGCTTTGTATGGGATAAATTTGGCCATATCGTAACTAATTACCATGTTGTTTCCGCATTGGCTACTGATAACAGTGGATTGCAGCGTTGTAAGGTAAATTTAGTCGATGCTAAAGGAAATGGAATTTATAGGGAAGCAAAAATTGTAGGTTTTGATCCAGAGTATGATCTAGCTGTTCTCAAGGTGGAACTTGGAGGATGTGAACTAAAGCCCATCGTTCTCGGTACCTCTCGAAATTTACGTGTTGGTCAGAGCTGCTATGCCATTGGCAACCCTTTTGGTTATGAGAAGACACTAACAGCAGGGGTGATCAGTGGATTGGGTAGAGAAATTCCATCACCAAATGGAAGGGCCATTCGGGGGGGTATTCAGACAGATGCTGCTATTAGTTCAGGGAATTCAGGGGGGCCATTAATTGACTCCTACGGTCATGTAATTGGAGTCAACACAGCAACTTTCACTCGCAAAGGAACTGGAATGTCTTCTGGTGTTAATTTTGCAATTCCAATAGACACAGTTGTACGTACGGTTCCATACCTTATTGTATACGGAACACCTTACAGTGAGAGATTTTGATAAAGTTTTACAATCTTTTCACAAGGGAACACAACCATTCATGCAACTCCAACTCCTCTCCAAGACCGTGGACTGCACCGTAAAATGACAATTTGCTAACATAAATGAATTTGCTTTCTTCACAAGTGTTATGATTCATATGAATTCTTTCTAGATCTTGGTCTGAGTTGTGAATAATTGTTTATTTATGGAATATTACTCTTCCTTCCCTCTGTTGGGTGGTTCAATCATTGAAAATATTATTGTTGTACAGATTCAAAGAGTTTTTACTCATCTGTTAAAACTTTCTAGCCTTAATTTCTGGAGTGTAGCTTTCCAAACTAGAAACAATTCAGTGTTTCTGTCTCCATCCAAATAGCATGCCATTGTTGAAGATTCCACTTTTATGCACTTGATTCAATGAATAAAGAAAAGGAGAATATATTCTGGTTATCCACAACCTTGTAAGAGAAAATTTTTTTTATCTCCTTATCCCATATTTCTACTGAAAAGAAATAATCATAAACACTCTCTTCAAACAAGTGTGCTAGTCCTTTTTTCTCCTTTCTTAACCCCAGGCTTAGTCTTGATTTCTAGACTGATAGCAGCGCTAATTGTTCCACAATGGTACTATCTACATGACAGGACACACGGCAAGAGTTGGGGATAAGAATAAGGCCGAGGCACCGTTGTTCCATCGGAGAAGGTGAGCTACGTCGAGTACTACGATACTCCACTGCATCTTGTAGAATGATATTTCCTTGCTTGTCAATGCAGTGAAAAGTTCCCAGGAAAAATCTTCCATCTTTAATGCCTATGAGCATTCGGCGAAACAGTAGCTTTCTCACCTTTGCTATGCAATCTAAACTGCCCGGATTAGACTCGACATTGCTCCCAACCTGAACCCTGGGTCCCTCTGATTCTTGTTCCATGTATGGTTCGATTGATCCCTGTAAGTGGGGAAGGTTGTTGAATATAGTTGATGAACAGGACCAAGTATACGATCAGAATTGCAACTTAAATCAGTATGACAACTCAATTAGTGGAAAACAAATGGGACGAAAACCGTCCTAAAGAAATATATGAAGATTTTTTGAAAAAGGGAAAAAATGAAAGCCAGTAAAAGAGAGATTAATGTAAACCATGAACTCTTTACTAACATCCAATAGACACATAATACACTCCA

Coding sequence (CDS)

ATGGCGTTAGCCTCACTGGGAATTCATCTTCTTCCAATTCCAGCTCCCCCAAATTCTTCCCACAACTCTCTACCCTTCACTTCGCGAAGAGCCCTAGTTTTTGCCCCATCTGCTTTGATGGCTTCCCTCCTCGCTTTCCCTCTTCCCACTCACGCCGCTCTCCCCCAAATACAGGCCCAGGTTCCACAAGAAGAAGATCGAGTCGTCGCTCTCTTTCAGGATGCTTCACCTTCTGTCGTTTACATTAAGGACCTTGAATTAGCTAAGAAACCCCAGAACTCCTCTGAAGAGGCCCTGCTCGTCGAGGATGAGAATGTCAAGGTCAAAGGGACTGGTTCGGGCTTTGTATGGGATAAATTTGGCCATATCGTAACTAATTACCATGTTGTTTCCGCATTGGCTACTGATAACAGTGGATTGCAGCGTTGTAAGGTAAATTTAGTCGATGCTAAAGGAAATGGAATTTATAGGGAAGCAAAAATTGTAGGTTTTGATCCAGAGTATGATCTAGCTGTTCTCAAGGTGGAACTTGGAGGATGTGAACTAAAGCCCATCGTTCTCGGTACCTCTCGAAATTTACGTGTTGGTCAGAGCTGCTATGCCATTGGCAACCCTTTTGGTTATGAGAAGACACTAACAGCAGGGGTGATCAGTGGATTGGGTAGAGAAATTCCATCACCAAATGGAAGGGCCATTCGGGGGGGTATTCAGACAGATGCTGCTATTAGTTCAGGGAATTCAGGGGGGCCATTAATTGACTCCTACGGTCATGTAATTGGAGTCAACACAGCAACTTTCACTCGCAAAGGAACTGGAATGTCTTCTGGTGTTAATTTTGCAATTCCAATAGACACAGTTGTACGTACGGTTCCATACCTTATTGTATACGGAACACCTTACAGTGAGAGATTTTGA

Protein sequence

MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQVPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSERF
Homology
BLAST of MC03g0668 vs. ExPASy Swiss-Prot
Match: Q9SEL7 (Protease Do-like 5, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DEGP5 PE=1 SV=3)

HSP 1 Score: 360.9 bits (925), Expect = 1.4e-98
Identity = 190/295 (64.41%), Postives = 227/295 (76.95%), Query Frame = 0

Query: 19  SSHNSLPFTSRRALVFAPS-ALMASLLA-----FPLPTHAALPQI---QAQVPQEEDRVV 78
           S+H  +    RR ++F  S AL +SLL       P+ +  AL Q    + ++ +EE+R V
Sbjct: 33  SNHVDVIDRRRRIMIFGSSLALTSSLLGSNQQRLPMESAIALEQFKEKEEELEEEEERNV 92

Query: 79  ALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKFGHIVTNYHV 138
            LFQ  SPSVVYI+ +EL K    +S   +L ++EN K++GTGSGFVWDK GHIVTNYHV
Sbjct: 93  NLFQKTSPSVVYIEAIELPK----TSSGDILTDEENGKIEGTGSGFVWDKLGHIVTNYHV 152

Query: 139 VSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGT 198
           ++ LATD  GLQRCKV+LVDAKG    +E KIVG DP+ DLAVLK+E  G EL P+VLGT
Sbjct: 153 IAKLATDQFGLQRCKVSLVDAKGTRFSKEGKIVGLDPDNDLAVLKIETEGRELNPVVLGT 212

Query: 199 SRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGG 258
           S +LRVGQSC+AIGNP+GYE TLT GV+SGLGREIPSPNG++I   IQTDA I+SGNSGG
Sbjct: 213 SNDLRVGQSCFAIGNPYGYENTLTIGVVSGLGREIPSPNGKSISEAIQTDADINSGNSGG 272

Query: 259 PLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSERF 305
           PL+DSYGH IGVNTATFTRKG+GMSSGVNFAIPIDTVVRTVPYLIVYGT Y +RF
Sbjct: 273 PLLDSYGHTIGVNTATFTRKGSGMSSGVNFAIPIDTVVRTVPYLIVYGTAYRDRF 323

BLAST of MC03g0668 vs. ExPASy Swiss-Prot
Match: Q9LU10 (Protease Do-like 8, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DEGP8 PE=1 SV=1)

HSP 1 Score: 197.6 bits (501), Expect = 2.0e-49
Identity = 134/297 (45.12%), Postives = 180/297 (60.61%), Query Frame = 0

Query: 8   IHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQV------ 67
           +H L + + P+++   L  +    L F PS  + S LA   P+ A +  +   V      
Sbjct: 54  LHELAVKSVPSTTRRILLTSLFMNLCFNPSRYL-SALALGDPSVATVEDVSPTVFPAGPL 113

Query: 68  PQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKFG 127
              E R+V LF+  + SVV I D+ L  +PQ      + + +      G GSG VWD  G
Sbjct: 114 FPTEGRIVQLFEKNTYSVVNIFDVTL--RPQLKMTGVVEIPE------GNGSGVVWDGQG 173

Query: 128 HIVTNYHVV-SALATDNS-GLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGG 187
           +IVTNYHV+ +AL+ + S G    +VN++ + G     E K+VG D   DLAVLKV+   
Sbjct: 174 YIVTNYHVIGNALSRNPSPGDVVGRVNILASDGVQKNFEGKLVGADRAKDLAVLKVDAPE 233

Query: 188 CELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTD 247
             LKPI +G S +L+VGQ C AIGNPFG++ TLT GVISGL R+I S  G  I GGIQTD
Sbjct: 234 TLLKPIKVGQSNSLKVGQQCLAIGNPFGFDHTLTVGVISGLNRDIFSQTGVTIGGGIQTD 293

Query: 248 AAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY 297
           AAI+ GNSGGPL+DS G++IG+NTA FT+  TG S+GV FAIP  TV++ VP LI +
Sbjct: 294 AAINPGNSGGPLLDSKGNLIGINTAIFTQ--TGTSAGVGFAIPSSTVLKIVPQLIQF 339

BLAST of MC03g0668 vs. ExPASy Swiss-Prot
Match: O22609 (Protease Do-like 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DEGP1 PE=1 SV=2)

HSP 1 Score: 188.0 bits (476), Expect = 1.6e-46
Identity = 124/287 (43.21%), Postives = 173/287 (60.28%), Query Frame = 0

Query: 22  NSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQA----------QVPQEEDRVVAL 81
           ++L FT   A+   P  L+ + +A      AA P +++          ++  +E   V L
Sbjct: 67  DTLHFTPFSAV--KPFFLLCTSVALSFSLFAASPAVESASAFVVSTPKKLQTDELATVRL 126

Query: 82  FQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKFGHIVTNYHVVS 141
           FQ+ +PSVVYI +L +        ++A  ++   V  +G+GSGFVWDK GHIVTNYHV+ 
Sbjct: 127 FQENTPSVVYITNLAV-------RQDAFTLDVLEVP-QGSGSGFVWDKQGHIVTNYHVI- 186

Query: 142 ALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSR 201
                  G    +V L D        +AK+VGFD + D+AVL+++    +L+PI +G S 
Sbjct: 187 ------RGASDLRVTLADQ----TTFDAKVVGFDQDKDVAVLRIDAPKNKLRPIPVGVSA 246

Query: 202 NLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPS-PNGRAIRGGIQTDAAISSGNSGGP 261
           +L VGQ  +AIGNPFG + TLT GVISGL REI S   GR I+  IQTDAAI+ GNSGGP
Sbjct: 247 DLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGP 306

Query: 262 LIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG 298
           L+DS G +IG+NTA ++   +G SSGV F+IP+DTV   V  L+ +G
Sbjct: 307 LLDSSGTLIGINTAIYS--PSGASSGVGFSIPVDTVGGIVDQLVRFG 330

BLAST of MC03g0668 vs. ExPASy Swiss-Prot
Match: Q2SL36 (Probable periplasmic serine endoprotease DegP-like OS=Hahella chejuensis (strain KCTC 2396) OX=349521 GN=mucD PE=3 SV=1)

HSP 1 Score: 134.4 bits (337), Expect = 2.1e-30
Identity = 81/181 (44.75%), Postives = 111/181 (61.33%), Query Frame = 0

Query: 107 KVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDP 166
           + + TGSGF+  K G+I+TN HVV       +G     V L+D +       AK++G D 
Sbjct: 87  EAQSTGSGFIVSKDGYILTNNHVV-------AGADEIFVRLMDRR----ELTAKLIGSDE 146

Query: 167 EYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPS 226
           + DLAVLKVE    +L  + LG S  L+VG+   AIG+PFG+E T+TAG++S  GR +P+
Sbjct: 147 KSDLAVLKVEAD--DLPVLNLGKSSELKVGEWVVAIGSPFGFEYTVTAGIVSAKGRSLPN 206

Query: 227 PNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV 286
            N       IQTD AI+ GNSGGPL +  G V+G+N+  +TR G  M  GV+FAIPID  
Sbjct: 207 ENYVPF---IQTDVAINPGNSGGPLFNLEGEVVGINSQIYTRSGGFM--GVSFAIPIDVA 249

Query: 287 V 288
           +
Sbjct: 267 L 249

BLAST of MC03g0668 vs. ExPASy Swiss-Prot
Match: Q4KGQ4 (Probable periplasmic serine endoprotease DegP-like OS=Pseudomonas fluorescens (strain ATCC BAA-477 / NRRL B-23932 / Pf-5) OX=220664 GN=mucD PE=3 SV=1)

HSP 1 Score: 126.7 bits (317), Expect = 4.4e-28
Identity = 73/185 (39.46%), Postives = 109/185 (58.92%), Query Frame = 0

Query: 103 DENVKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIV 162
           D   + +  GSGF+    G+I+TN HV+       +      V L D        +AK++
Sbjct: 91  DRQREAQSLGSGFIISADGYILTNNHVI-------ADADEILVRLADRS----ELKAKLI 150

Query: 163 GFDPEYDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGR 222
           G DP  D+A+LK++  G +L  + LG S++L+ GQ   AIG+PFG++ T+T G++S +GR
Sbjct: 151 GTDPRSDVALLKID--GKDLPVLKLGKSQDLKAGQWVVAIGSPFGFDHTVTQGIVSAIGR 210

Query: 223 EIPSPNGRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIP 282
            +P+ N       IQTD  I+ GNSGGPL +  G V+G+N+  +TR G  M  GV+FAIP
Sbjct: 211 SLPNENYVPF---IQTDVPINPGNSGGPLFNLAGEVVGINSQIYTRSGGFM--GVSFAIP 257

Query: 283 IDTVV 288
           ID  +
Sbjct: 271 IDVAM 257

BLAST of MC03g0668 vs. NCBI nr
Match: XP_022140016.1 (protease Do-like 5, chloroplastic isoform X2 [Momordica charantia])

HSP 1 Score: 599 bits (1545), Expect = 5.96e-216
Identity = 304/304 (100.00%), Postives = 304/304 (100.00%), Query Frame = 0

Query: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60
           MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ
Sbjct: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60

Query: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKF 120
           VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKF
Sbjct: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKF 120

Query: 121 GHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGC 180
           GHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGC
Sbjct: 121 GHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGC 180

Query: 181 ELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDA 240
           ELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDA
Sbjct: 181 ELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDA 240

Query: 241 AISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300
           AISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY
Sbjct: 241 AISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300

Query: 301 SERF 304
           SERF
Sbjct: 301 SERF 304

BLAST of MC03g0668 vs. NCBI nr
Match: XP_022140015.1 (protease Do-like 5, chloroplastic isoform X1 [Momordica charantia])

HSP 1 Score: 561 bits (1447), Expect = 5.56e-201
Identity = 288/293 (98.29%), Postives = 290/293 (98.98%), Query Frame = 0

Query: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60
           MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ
Sbjct: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60

Query: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKF 120
           VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKF
Sbjct: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKF 120

Query: 121 GHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGC 180
           GHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGC
Sbjct: 121 GHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGC 180

Query: 181 ELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDA 240
           ELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDA
Sbjct: 181 ELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDA 240

Query: 241 AISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYL 293
           AISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV  T P++
Sbjct: 241 AISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV-GTQPFM 292

BLAST of MC03g0668 vs. NCBI nr
Match: XP_022927238.1 (protease Do-like 5, chloroplastic isoform X1 [Cucurbita moschata])

HSP 1 Score: 530 bits (1366), Expect = 1.18e-188
Identity = 267/305 (87.54%), Postives = 286/305 (93.77%), Query Frame = 0

Query: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60
           MAL SLGI LLP+ +PPNSS  SLPFTSRRA+VFAP+ALMASLLAFP+P+ AALPQ+Q +
Sbjct: 1   MALGSLGIRLLPVSSPPNSSEISLPFTSRRAIVFAPTALMASLLAFPVPSFAALPQLQDE 60

Query: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVED-ENVKVKGTGSGFVWDK 120
           VPQEEDR+V LFQ+ SPSVVYIK+LE+AKKPQN SEEA+L+ED EN KVKGTGSGF+WDK
Sbjct: 61  VPQEEDRIVGLFQETSPSVVYIKNLEIAKKPQNPSEEAMLIEDDENAKVKGTGSGFIWDK 120

Query: 121 FGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGG 180
           FGHIVTNYHVVSALATDNSGLQRCKVNLVD KGNGI R+AKIVGFDPEYDLAVLKVEL G
Sbjct: 121 FGHIVTNYHVVSALATDNSGLQRCKVNLVDVKGNGISRDAKIVGFDPEYDLAVLKVELEG 180

Query: 181 CELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTD 240
            ELKPIV GTSR+LRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRG IQTD
Sbjct: 181 HELKPIVFGTSRSLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240

Query: 241 AAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300
           AAIS+GNSGGPL+D YGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Sbjct: 241 AAISAGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300

Query: 301 YSERF 304
           YSERF
Sbjct: 301 YSERF 305

BLAST of MC03g0668 vs. NCBI nr
Match: XP_038893976.1 (protease Do-like 5, chloroplastic [Benincasa hispida])

HSP 1 Score: 528 bits (1360), Expect = 7.78e-188
Identity = 266/304 (87.50%), Postives = 281/304 (92.43%), Query Frame = 0

Query: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60
           MAL SLGI LLPI +PPNS+ N LP TSRRA+VFAP+ALMASLLAFP+PT+AALPQ+Q  
Sbjct: 1   MALGSLGIRLLPISSPPNSAENPLPITSRRAIVFAPTALMASLLAFPVPTYAALPQLQDD 60

Query: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKF 120
           +PQEEDR+VALFQ+ SPSVVYIKDLE+AK PQN S E     DEN KVKGTGSGFVWDKF
Sbjct: 61  IPQEEDRIVALFQETSPSVVYIKDLEVAKNPQNPSGE-----DENAKVKGTGSGFVWDKF 120

Query: 121 GHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGC 180
           GHIVTNYHVVSALATDNSGLQRCKVNLVD KGNGIY+EAKIVGFDPEYDLAVLKVEL G 
Sbjct: 121 GHIVTNYHVVSALATDNSGLQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH 180

Query: 181 ELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDA 240
           ELKPIV GTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRG IQTDA
Sbjct: 181 ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA 240

Query: 241 AISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300
           AIS+GNSGGPL+DSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV+RTVPYLIVYGTPY
Sbjct: 241 AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVIRTVPYLIVYGTPY 299

Query: 301 SERF 304
           SERF
Sbjct: 301 SERF 299

BLAST of MC03g0668 vs. NCBI nr
Match: XP_008444456.1 (PREDICTED: protease Do-like 5, chloroplastic [Cucumis melo] >KAA0053993.1 protease Do-like 5 [Cucumis melo var. makuwa] >TYK20678.1 protease Do-like 5 [Cucumis melo var. makuwa])

HSP 1 Score: 528 bits (1360), Expect = 9.69e-188
Identity = 268/305 (87.87%), Postives = 283/305 (92.79%), Query Frame = 0

Query: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60
           MAL  LGI  LPIPAPPNSS N LPFTSRRA++F+P+ALMASLLAFPLPT AALPQ+Q  
Sbjct: 1   MALGLLGIPPLPIPAPPNSSENPLPFTSRRAILFSPTALMASLLAFPLPTPAALPQLQDP 60

Query: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEA-LLVEDENVKVKGTGSGFVWDK 120
           + QEEDR+V+LFQ+ SPSVVYIKDLELAK PQN SEE  +L+ED+NVKVKGTGSGFVWDK
Sbjct: 61  LLQEEDRIVSLFQETSPSVVYIKDLELAKNPQNRSEEPPMLIEDDNVKVKGTGSGFVWDK 120

Query: 121 FGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGG 180
           FGHIVTNYHVVSALATDNSG QRCKVNLVD KGNGIY+EA IVGFDPEYDLAVLKVEL G
Sbjct: 121 FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEANIVGFDPEYDLAVLKVELEG 180

Query: 181 CELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTD 240
            ELKPIV GTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRG IQTD
Sbjct: 181 HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240

Query: 241 AAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300
           AAIS+GNSGGPL+DSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Sbjct: 241 AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300

Query: 301 YSERF 304
           YSERF
Sbjct: 301 YSERF 305

BLAST of MC03g0668 vs. ExPASy TrEMBL
Match: A0A6J1CEE6 (protease Do-like 5, chloroplastic isoform X2 OS=Momordica charantia OX=3673 GN=LOC111010776 PE=3 SV=1)

HSP 1 Score: 599 bits (1545), Expect = 2.89e-216
Identity = 304/304 (100.00%), Postives = 304/304 (100.00%), Query Frame = 0

Query: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60
           MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ
Sbjct: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60

Query: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKF 120
           VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKF
Sbjct: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKF 120

Query: 121 GHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGC 180
           GHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGC
Sbjct: 121 GHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGC 180

Query: 181 ELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDA 240
           ELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDA
Sbjct: 181 ELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDA 240

Query: 241 AISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300
           AISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY
Sbjct: 241 AISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300

Query: 301 SERF 304
           SERF
Sbjct: 301 SERF 304

BLAST of MC03g0668 vs. ExPASy TrEMBL
Match: A0A6J1CDW0 (protease Do-like 5, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111010776 PE=3 SV=1)

HSP 1 Score: 561 bits (1447), Expect = 2.69e-201
Identity = 288/293 (98.29%), Postives = 290/293 (98.98%), Query Frame = 0

Query: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60
           MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ
Sbjct: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60

Query: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKF 120
           VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKF
Sbjct: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKF 120

Query: 121 GHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGC 180
           GHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGC
Sbjct: 121 GHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGC 180

Query: 181 ELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDA 240
           ELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDA
Sbjct: 181 ELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDA 240

Query: 241 AISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYL 293
           AISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV  T P++
Sbjct: 241 AISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV-GTQPFM 292

BLAST of MC03g0668 vs. ExPASy TrEMBL
Match: A0A6J1EKF8 (protease Do-like 5, chloroplastic isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111434145 PE=3 SV=1)

HSP 1 Score: 530 bits (1366), Expect = 5.72e-189
Identity = 267/305 (87.54%), Postives = 286/305 (93.77%), Query Frame = 0

Query: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60
           MAL SLGI LLP+ +PPNSS  SLPFTSRRA+VFAP+ALMASLLAFP+P+ AALPQ+Q +
Sbjct: 1   MALGSLGIRLLPVSSPPNSSEISLPFTSRRAIVFAPTALMASLLAFPVPSFAALPQLQDE 60

Query: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVED-ENVKVKGTGSGFVWDK 120
           VPQEEDR+V LFQ+ SPSVVYIK+LE+AKKPQN SEEA+L+ED EN KVKGTGSGF+WDK
Sbjct: 61  VPQEEDRIVGLFQETSPSVVYIKNLEIAKKPQNPSEEAMLIEDDENAKVKGTGSGFIWDK 120

Query: 121 FGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGG 180
           FGHIVTNYHVVSALATDNSGLQRCKVNLVD KGNGI R+AKIVGFDPEYDLAVLKVEL G
Sbjct: 121 FGHIVTNYHVVSALATDNSGLQRCKVNLVDVKGNGISRDAKIVGFDPEYDLAVLKVELEG 180

Query: 181 CELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTD 240
            ELKPIV GTSR+LRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRG IQTD
Sbjct: 181 HELKPIVFGTSRSLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240

Query: 241 AAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300
           AAIS+GNSGGPL+D YGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Sbjct: 241 AAISAGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300

Query: 301 YSERF 304
           YSERF
Sbjct: 301 YSERF 305

BLAST of MC03g0668 vs. ExPASy TrEMBL
Match: A0A5D3DAW0 (Protease Do-like 5 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold480G00660 PE=3 SV=1)

HSP 1 Score: 528 bits (1360), Expect = 4.69e-188
Identity = 268/305 (87.87%), Postives = 283/305 (92.79%), Query Frame = 0

Query: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60
           MAL  LGI  LPIPAPPNSS N LPFTSRRA++F+P+ALMASLLAFPLPT AALPQ+Q  
Sbjct: 1   MALGLLGIPPLPIPAPPNSSENPLPFTSRRAILFSPTALMASLLAFPLPTPAALPQLQDP 60

Query: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEA-LLVEDENVKVKGTGSGFVWDK 120
           + QEEDR+V+LFQ+ SPSVVYIKDLELAK PQN SEE  +L+ED+NVKVKGTGSGFVWDK
Sbjct: 61  LLQEEDRIVSLFQETSPSVVYIKDLELAKNPQNRSEEPPMLIEDDNVKVKGTGSGFVWDK 120

Query: 121 FGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGG 180
           FGHIVTNYHVVSALATDNSG QRCKVNLVD KGNGIY+EA IVGFDPEYDLAVLKVEL G
Sbjct: 121 FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEANIVGFDPEYDLAVLKVELEG 180

Query: 181 CELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTD 240
            ELKPIV GTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRG IQTD
Sbjct: 181 HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240

Query: 241 AAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300
           AAIS+GNSGGPL+DSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Sbjct: 241 AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300

Query: 301 YSERF 304
           YSERF
Sbjct: 301 YSERF 305

BLAST of MC03g0668 vs. ExPASy TrEMBL
Match: A0A1S3B9W5 (protease Do-like 5, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103487774 PE=3 SV=1)

HSP 1 Score: 528 bits (1360), Expect = 4.69e-188
Identity = 268/305 (87.87%), Postives = 283/305 (92.79%), Query Frame = 0

Query: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60
           MAL  LGI  LPIPAPPNSS N LPFTSRRA++F+P+ALMASLLAFPLPT AALPQ+Q  
Sbjct: 1   MALGLLGIPPLPIPAPPNSSENPLPFTSRRAILFSPTALMASLLAFPLPTPAALPQLQDP 60

Query: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEA-LLVEDENVKVKGTGSGFVWDK 120
           + QEEDR+V+LFQ+ SPSVVYIKDLELAK PQN SEE  +L+ED+NVKVKGTGSGFVWDK
Sbjct: 61  LLQEEDRIVSLFQETSPSVVYIKDLELAKNPQNRSEEPPMLIEDDNVKVKGTGSGFVWDK 120

Query: 121 FGHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGG 180
           FGHIVTNYHVVSALATDNSG QRCKVNLVD KGNGIY+EA IVGFDPEYDLAVLKVEL G
Sbjct: 121 FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEANIVGFDPEYDLAVLKVELEG 180

Query: 181 CELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTD 240
            ELKPIV GTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRG IQTD
Sbjct: 181 HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240

Query: 241 AAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300
           AAIS+GNSGGPL+DSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Sbjct: 241 AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300

Query: 301 YSERF 304
           YSERF
Sbjct: 301 YSERF 305

BLAST of MC03g0668 vs. TAIR 10
Match: AT4G18370.1 (DEGP protease 5 )

HSP 1 Score: 360.9 bits (925), Expect = 9.9e-100
Identity = 190/295 (64.41%), Postives = 227/295 (76.95%), Query Frame = 0

Query: 19  SSHNSLPFTSRRALVFAPS-ALMASLLA-----FPLPTHAALPQI---QAQVPQEEDRVV 78
           S+H  +    RR ++F  S AL +SLL       P+ +  AL Q    + ++ +EE+R V
Sbjct: 33  SNHVDVIDRRRRIMIFGSSLALTSSLLGSNQQRLPMESAIALEQFKEKEEELEEEEERNV 92

Query: 79  ALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKFGHIVTNYHV 138
            LFQ  SPSVVYI+ +EL K    +S   +L ++EN K++GTGSGFVWDK GHIVTNYHV
Sbjct: 93  NLFQKTSPSVVYIEAIELPK----TSSGDILTDEENGKIEGTGSGFVWDKLGHIVTNYHV 152

Query: 139 VSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGT 198
           ++ LATD  GLQRCKV+LVDAKG    +E KIVG DP+ DLAVLK+E  G EL P+VLGT
Sbjct: 153 IAKLATDQFGLQRCKVSLVDAKGTRFSKEGKIVGLDPDNDLAVLKIETEGRELNPVVLGT 212

Query: 199 SRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDAAISSGNSGG 258
           S +LRVGQSC+AIGNP+GYE TLT GV+SGLGREIPSPNG++I   IQTDA I+SGNSGG
Sbjct: 213 SNDLRVGQSCFAIGNPYGYENTLTIGVVSGLGREIPSPNGKSISEAIQTDADINSGNSGG 272

Query: 259 PLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSERF 305
           PL+DSYGH IGVNTATFTRKG+GMSSGVNFAIPIDTVVRTVPYLIVYGT Y +RF
Sbjct: 273 PLLDSYGHTIGVNTATFTRKGSGMSSGVNFAIPIDTVVRTVPYLIVYGTAYRDRF 323

BLAST of MC03g0668 vs. TAIR 10
Match: AT5G39830.1 (Trypsin family protein with PDZ domain )

HSP 1 Score: 197.6 bits (501), Expect = 1.5e-50
Identity = 134/297 (45.12%), Postives = 180/297 (60.61%), Query Frame = 0

Query: 8   IHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQV------ 67
           +H L + + P+++   L  +    L F PS  + S LA   P+ A +  +   V      
Sbjct: 54  LHELAVKSVPSTTRRILLTSLFMNLCFNPSRYL-SALALGDPSVATVEDVSPTVFPAGPL 113

Query: 68  PQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKFG 127
              E R+V LF+  + SVV I D+ L  +PQ      + + +      G GSG VWD  G
Sbjct: 114 FPTEGRIVQLFEKNTYSVVNIFDVTL--RPQLKMTGVVEIPE------GNGSGVVWDGQG 173

Query: 128 HIVTNYHVV-SALATDNS-GLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGG 187
           +IVTNYHV+ +AL+ + S G    +VN++ + G     E K+VG D   DLAVLKV+   
Sbjct: 174 YIVTNYHVIGNALSRNPSPGDVVGRVNILASDGVQKNFEGKLVGADRAKDLAVLKVDAPE 233

Query: 188 CELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTD 247
             LKPI +G S +L+VGQ C AIGNPFG++ TLT GVISGL R+I S  G  I GGIQTD
Sbjct: 234 TLLKPIKVGQSNSLKVGQQCLAIGNPFGFDHTLTVGVISGLNRDIFSQTGVTIGGGIQTD 293

Query: 248 AAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY 297
           AAI+ GNSGGPL+DS G++IG+NTA FT+  TG S+GV FAIP  TV++ VP LI +
Sbjct: 294 AAINPGNSGGPLLDSKGNLIGINTAIFTQ--TGTSAGVGFAIPSSTVLKIVPQLIQF 339

BLAST of MC03g0668 vs. TAIR 10
Match: AT3G27925.1 (DegP protease 1 )

HSP 1 Score: 188.0 bits (476), Expect = 1.2e-47
Identity = 124/287 (43.21%), Postives = 173/287 (60.28%), Query Frame = 0

Query: 22  NSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQA----------QVPQEEDRVVAL 81
           ++L FT   A+   P  L+ + +A      AA P +++          ++  +E   V L
Sbjct: 67  DTLHFTPFSAV--KPFFLLCTSVALSFSLFAASPAVESASAFVVSTPKKLQTDELATVRL 126

Query: 82  FQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKFGHIVTNYHVVS 141
           FQ+ +PSVVYI +L +        ++A  ++   V  +G+GSGFVWDK GHIVTNYHV+ 
Sbjct: 127 FQENTPSVVYITNLAV-------RQDAFTLDVLEVP-QGSGSGFVWDKQGHIVTNYHVI- 186

Query: 142 ALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGCELKPIVLGTSR 201
                  G    +V L D        +AK+VGFD + D+AVL+++    +L+PI +G S 
Sbjct: 187 ------RGASDLRVTLADQ----TTFDAKVVGFDQDKDVAVLRIDAPKNKLRPIPVGVSA 246

Query: 202 NLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPS-PNGRAIRGGIQTDAAISSGNSGGP 261
           +L VGQ  +AIGNPFG + TLT GVISGL REI S   GR I+  IQTDAAI+ GNSGGP
Sbjct: 247 DLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPGNSGGP 306

Query: 262 LIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG 298
           L+DS G +IG+NTA ++   +G SSGV F+IP+DTV   V  L+ +G
Sbjct: 307 LLDSSGTLIGINTAIYS--PSGASSGVGFSIPVDTVGGIVDQLVRFG 330

BLAST of MC03g0668 vs. TAIR 10
Match: AT5G39830.2 (Trypsin family protein with PDZ domain )

HSP 1 Score: 175.3 bits (443), Expect = 7.7e-44
Identity = 125/297 (42.09%), Postives = 170/297 (57.24%), Query Frame = 0

Query: 8   IHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQV------ 67
           +H L + + P+++   L  +    L F PS  + S LA   P+ A +  +   V      
Sbjct: 54  LHELAVKSVPSTTRRILLTSLFMNLCFNPSRYL-SALALGDPSVATVEDVSPTVFPAGPL 113

Query: 68  PQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKFG 127
              E R+V LF+  + SVV I D+ L  +PQ      + + +      G GSG VWD  G
Sbjct: 114 FPTEGRIVQLFEKNTYSVVNIFDVTL--RPQLKMTGVVEIPE------GNGSGVVWDGQG 173

Query: 128 HIVTNYHVV-SALATDNS-GLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGG 187
           +IVTNYHV+ +AL+ + S G    +VN++ + G     E K+VG D   DLAVLKV+   
Sbjct: 174 YIVTNYHVIGNALSRNPSPGDVVGRVNILASDGVQKNFEGKLVGADRAKDLAVLKVDAPE 233

Query: 188 CELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTD 247
             LKPI +G S +L+VGQ C AIGNPFG++ TLT GVISGL R+I S  G  I GGIQTD
Sbjct: 234 TLLKPIKVGQSNSLKVGQQCLAIGNPFGFDHTLTVGVISGLNRDIFSQTGVTIGGGIQTD 293

Query: 248 AAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY 297
           AAI+ GNSGGPL+DS G++IG+NTA FT+                TV++ VP LI +
Sbjct: 294 AAINPGNSGGPLLDSKGNLIGINTAIFTQ----------------TVLKIVPQLIQF 325

BLAST of MC03g0668 vs. TAIR 10
Match: AT5G27660.1 (Trypsin family protein with PDZ domain )

HSP 1 Score: 85.9 bits (211), Expect = 6.2e-17
Identity = 62/186 (33.33%), Postives = 95/186 (51.08%), Query Frame = 0

Query: 109 KGTGSGFVWDKFGHIVTNYHVVSALATDNSGLQRCKVNLVDAK-GNGIYREAKIVGFDPE 168
           K  GSG + D  G I+T  HVV     D   ++      VD    +G   E  +V  D +
Sbjct: 146 KSIGSGTIIDADGTILTCAHVV----VDFQNIRHSSKGRVDVTLQDGRTFEGVVVNADLQ 205

Query: 169 YDLAVLKVELGGCELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSP 228
            D+A++K++     L    LG S  LR G    A+G P   + T+TAG++S + R+    
Sbjct: 206 SDIALVKIK-SKTPLPTAKLGFSSKLRPGDWVIAVGCPLSLQNTVTAGIVSCVDRKSSDL 265

Query: 229 N-GRAIRGGIQTDAAISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV 288
             G   R  +QTD +I++GNSGGPL++  G VIGVN           + G+ F++PID+V
Sbjct: 266 GLGGKHREYLQTDCSINAGNSGGPLVNLDGEVIGVNIMKVL-----AADGLGFSVPIDSV 321

Query: 289 VRTVPY 293
            + + +
Sbjct: 326 SKIIEH 321

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SEL71.4e-9864.41Protease Do-like 5, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DEGP5 PE=1 ... [more]
Q9LU102.0e-4945.12Protease Do-like 8, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DEGP8 PE=1 ... [more]
O226091.6e-4643.21Protease Do-like 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DEGP1 PE=1 ... [more]
Q2SL362.1e-3044.75Probable periplasmic serine endoprotease DegP-like OS=Hahella chejuensis (strain... [more]
Q4KGQ44.4e-2839.46Probable periplasmic serine endoprotease DegP-like OS=Pseudomonas fluorescens (s... [more]
Match NameE-valueIdentityDescription
XP_022140016.15.96e-216100.00protease Do-like 5, chloroplastic isoform X2 [Momordica charantia][more]
XP_022140015.15.56e-20198.29protease Do-like 5, chloroplastic isoform X1 [Momordica charantia][more]
XP_022927238.11.18e-18887.54protease Do-like 5, chloroplastic isoform X1 [Cucurbita moschata][more]
XP_038893976.17.78e-18887.50protease Do-like 5, chloroplastic [Benincasa hispida][more]
XP_008444456.19.69e-18887.87PREDICTED: protease Do-like 5, chloroplastic [Cucumis melo] >KAA0053993.1 protea... [more]
Match NameE-valueIdentityDescription
A0A6J1CEE62.89e-216100.00protease Do-like 5, chloroplastic isoform X2 OS=Momordica charantia OX=3673 GN=L... [more]
A0A6J1CDW02.69e-20198.29protease Do-like 5, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=L... [more]
A0A6J1EKF85.72e-18987.54protease Do-like 5, chloroplastic isoform X1 OS=Cucurbita moschata OX=3662 GN=LO... [more]
A0A5D3DAW04.69e-18887.87Protease Do-like 5 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold480G0... [more]
A0A1S3B9W54.69e-18887.87protease Do-like 5, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103487774 PE=3 S... [more]
Match NameE-valueIdentityDescription
AT4G18370.19.9e-10064.41DEGP protease 5 [more]
AT5G39830.11.5e-5045.12Trypsin family protein with PDZ domain [more]
AT3G27925.11.2e-4743.21DegP protease 1 [more]
AT5G39830.27.7e-4442.09Trypsin family protein with PDZ domain [more]
AT5G27660.16.2e-1733.33Trypsin family protein with PDZ domain [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001940Peptidase S1CPRINTSPR00834PROTEASES2Ccoord: 195..219
score: 48.88
coord: 254..271
score: 37.67
coord: 232..249
score: 62.33
coord: 121..133
score: 50.47
IPR043504Peptidase S1, PA clan, chymotrypsin-like foldGENE3D2.40.10.10coord: 186..298
e-value: 1.6E-39
score: 136.6
IPR043504Peptidase S1, PA clan, chymotrypsin-like foldGENE3D2.40.10.10coord: 60..185
e-value: 1.2E-30
score: 107.7
NoneNo IPR availablePFAMPF13365Trypsin_2coord: 112..261
e-value: 1.6E-33
score: 116.8
NoneNo IPR availablePANTHERPTHR43343:SF6PROTEASE DO-LIKE 5, CHLOROPLASTIC ISOFORM X1coord: 18..304
NoneNo IPR availablePANTHERPTHR43343PEPTIDASE S12coord: 18..304
IPR009003Peptidase S1, PA clanSUPERFAMILY50494Trypsin-like serine proteasescoord: 70..290

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC03g0668.1MC03g0668.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0004252 serine-type endopeptidase activity