CsGy1G004540 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy1G004540
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptionprotease Do-like 5, chloroplastic
LocationGy14Chr1: 2897499 .. 2900138 (+)
RNA-Seq ExpressionCsGy1G004540
SyntenyCsGy1G004540
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTCAACTGGCAAGGAAGAAGATAATGTGATTTTGGTACACCTAGTTGAAGAAGATTGATTGAATCGAAGCTAAGTGGAGAAAACATCATCCTTCTATGGCGTTGGGCTTACTGGGAATTCCTCCTCTTCCATTTCCAGCTCCCCCAAATTCCTCTCAAAATCCCTTACCCTTCACTTCCCGAAGAGCCATTCTGTTTTCCCCAGCTGCTTTACTCCCTTCCCTCCTCGCTTTTCCTCTTCCCACTCACGCCGCTCTTCCCCAACTCCAAGACCACCTTCTACAAGAAGAAGATAGAACCGTTTCCCTTTTCCAGGTCCCTCCTCTCTTTCTCTTACAATCCTTCTTTTTCAATTCAATTTGGGTCTTTCCTTTATCTCTTTTCTTCTGTTGGTTCTTCATTGCAATCTAGGAAACTTCGCCTTCCGTCGTTTATATTAACGACCTTGAATTACCTAAAAACCCTCAAGCCCCTTCTCAACAACCCATGCTTATCGAGGATGATAATCTCAAGGTTAAAGGGACTGGTTCGGGTTTTGTTTGGGATAAATTTGGCCATATCGTAAGACCATCTTCTTTGATTCTTGCCATCTCTTTTTTTTTTCTTTTTTTTTCACCTTTTTTTTGTTCGTTACTTGAAGCATTTCTGCTTGCGACAGGTTACCAATTACCATGTTGTTTCTGCATTGGCTACTGATAATAGTGGATCACAGCGTTGTAAGGTTTTGTTCGTTCGTTTCTGTGTTTTGGATTGTTGTGCATTCTTAGTGGATGATATGCTAGCAGTCCCTAGCTATGATTCCTCGGATTTGGGTTCTTCTGTAGGTAAACTTAGTCGATGTTAAAGGAAACGGAATCTATAAAGAGGCAAAGATTGTTGGTTTTGATCCAGAGTATGATCTAGCTGTTCTTAAGGTAATATAGCAAGCTTGCTTGTATTTTTCATTGTTTCGAGGAGAATGCCAGATTGAGTGAATCGTGTTTTGGTGATCTTCATACCATCTTCTATTCCAAATATAGTTTTGCAAATACGGCAGACTGGACAAGCTATATTGTCAAACTCTTATATTGTACCATTTATTTGCTGAGTTAGAAATTTCCTTCATTCAGGTGGAACTTGAAGGACATGAACTAAAGCCCATAGTCTTCGGTACCTCTCGAAATCTACGTGTTGGTCAAAGCTGCTATGCCATTGGCAACCCTTTTGGGTATGAGAAGACACTAACAGCAGGGGTAAGAAATCAATAGTGTCATTTCCTCCATTTATTTTAAGGAAACCCACTTTGAATTTTCCATTTCAAAGGTAAAACTAAATTCCAAAGTGAAATACTTCTCTCTTACAATAAACGCATTAGTCATAGAAGGTATGGTATCTCTCATCTCACACTTGTATGGTTTAAATAAAGCGAATGAAGTAAGCGTTTCTCCAAGTGTCACAATCTATTGGTGTTGACTTCCAAGTAAAACCGAGTCATATAGCAGATTGACTAAATAATTGAGAACCGTAAATCATTCAGGGTTTGGATAGCAGGTGATCAGCGGATTGGGTAGAGAAATTCCATCACCAAATGGAAGGGCCATAAGGGGAGCTATTCAGACAGATGCTGCTATTAGTGCAGGTTCATGTTTCAACTACCTGAAATAAGACTTTGCCATTTCAATCACTGACATCTTTTCGTGCTCTTTCTTCTTCCTCTTTTCAAACTTATTTTACTTTCATCATTCACATCCCAAGTGTAAGCCTTATAGTTACTTATTCCACTGATCTTTATTTTTTTATACTATGGAGCAGTTAAATTCCATGTACATTATTTATACCAGCTTATTTTGATTTTTCCATTCCATCTCTGGTTCATTGCTGCCCAAGTCATCCATTATAATAACTATTTCTTAAATTGCCCACATTTTGATGCATAAAACATTTTACAGGAAATTCAGGGGGGCCTTTGGTTGACTCGTACGGTCATGTTATTGGAGTCAACACAGCAACTTTCACTCGCAAAGGTGAAGGATCTAAATTTCTTCATGTGCCATAGTTTGTCAGAGGAAATATGAAATGCATAGTGTTTCCCGCTTTTGCTTTAAATGTGGCTGATTTTTTTTGGCGGTGCTTATTAGTTATTAATTATCAAAGCTAACTCAACCTTTTATGGCCACTTGAAAGAACAGTGTTCAGAGTAGTAAAGAAAGAAAAAACGAACAAATAGTTGTATTTTTTTTTGTTTGTTTATAGAGTTCAATGTTTGTGTTATTTCATGTTTGGTTATGCATTGCAGGAACTGGAATGTCTTCTGGTGTCAACTTTGCTATACCAATAGACACAGTTGTACGGACTGTGCCATACCTTATTGTTTATGGAACACCTTACAGTGAGAGATTTTGAACAAAATTTAACTGTCTCAAGGGAACACAGCAAACTCCTCTCCCCAAGACCATATACCACATTGTAAATGACATTTTGCTAATTGAAACGAATTTGCTCTCTCAAGTGTAATATCCTCGAGCATTCAACATCTTGGTCTTGATTGTGAATAGTTATTTGCTGTGTGTTATTCTTCTCTCTTTGTAGGGTAATTGTATTATTGAAGTAGTGTTGATAAAATACAATGGAAAGAGTTTGAATGATGGATGGTTTGCTTAC

mRNA sequence

TCTCAACTGGCAAGGAAGAAGATAATGTGATTTTGGTACACCTAGTTGAAGAAGATTGATTGAATCGAAGCTAAGTGGAGAAAACATCATCCTTCTATGGCGTTGGGCTTACTGGGAATTCCTCCTCTTCCATTTCCAGCTCCCCCAAATTCCTCTCAAAATCCCTTACCCTTCACTTCCCGAAGAGCCATTCTGTTTTCCCCAGCTGCTTTACTCCCTTCCCTCCTCGCTTTTCCTCTTCCCACTCACGCCGCTCTTCCCCAACTCCAAGACCACCTTCTACAAGAAGAAGATAGAACCGTTTCCCTTTTCCAGGAAACTTCGCCTTCCGTCGTTTATATTAACGACCTTGAATTACCTAAAAACCCTCAAGCCCCTTCTCAACAACCCATGCTTATCGAGGATGATAATCTCAAGGTTAAAGGGACTGGTTCGGGTTTTGTTTGGGATAAATTTGGCCATATCGTTACCAATTACCATGTTGTTTCTGCATTGGCTACTGATAATAGTGGATCACAGCGTTGTAAGGTAAACTTAGTCGATGTTAAAGGAAACGGAATCTATAAAGAGGCAAAGATTGTTGGTTTTGATCCAGAGTATGATCTAGCTGTTCTTAAGGTGGAACTTGAAGGACATGAACTAAAGCCCATAGTCTTCGGTACCTCTCGAAATCTACGTGTTGGTCAAAGCTGCTATGCCATTGGCAACCCTTTTGGGTATGAGAAGACACTAACAGCAGGGGTGATCAGCGGATTGGGTAGAGAAATTCCATCACCAAATGGAAGGGCCATAAGGGGAGCTATTCAGACAGATGCTGCTATTAGTGCAGGAAATTCAGGGGGGCCTTTGGTTGACTCGTACGGTCATGTTATTGGAGTCAACACAGCAACTTTCACTCGCAAAGGAACTGGAATGTCTTCTGGTGTCAACTTTGCTATACCAATAGACACAGTTGTACGGACTGTGCCATACCTTATTGTTTATGGAACACCTTACAGTGAGAGATTTTGAACAAAATTTAACTGTCTCAAGGGAACACAGCAAACTCCTCTCCCCAAGACCATATACCACATTGTAAATGACATTTTGCTAATTGAAACGAATTTGCTCTCTCAAGTGTAATATCCTCGAGCATTCAACATCTTGGTCTTGATTGTGAATAGTTATTTGCTGTGTGTTATTCTTCTCTCTTTGTAGGGTAATTGTATTATTGAAGTAGTGTTGATAAAATACAATGGAAAGAGTTTGAATGATGGATGGTTTGCTTAC

Coding sequence (CDS)

ATGGCGTTGGGCTTACTGGGAATTCCTCCTCTTCCATTTCCAGCTCCCCCAAATTCCTCTCAAAATCCCTTACCCTTCACTTCCCGAAGAGCCATTCTGTTTTCCCCAGCTGCTTTACTCCCTTCCCTCCTCGCTTTTCCTCTTCCCACTCACGCCGCTCTTCCCCAACTCCAAGACCACCTTCTACAAGAAGAAGATAGAACCGTTTCCCTTTTCCAGGAAACTTCGCCTTCCGTCGTTTATATTAACGACCTTGAATTACCTAAAAACCCTCAAGCCCCTTCTCAACAACCCATGCTTATCGAGGATGATAATCTCAAGGTTAAAGGGACTGGTTCGGGTTTTGTTTGGGATAAATTTGGCCATATCGTTACCAATTACCATGTTGTTTCTGCATTGGCTACTGATAATAGTGGATCACAGCGTTGTAAGGTAAACTTAGTCGATGTTAAAGGAAACGGAATCTATAAAGAGGCAAAGATTGTTGGTTTTGATCCAGAGTATGATCTAGCTGTTCTTAAGGTGGAACTTGAAGGACATGAACTAAAGCCCATAGTCTTCGGTACCTCTCGAAATCTACGTGTTGGTCAAAGCTGCTATGCCATTGGCAACCCTTTTGGGTATGAGAAGACACTAACAGCAGGGGTGATCAGCGGATTGGGTAGAGAAATTCCATCACCAAATGGAAGGGCCATAAGGGGAGCTATTCAGACAGATGCTGCTATTAGTGCAGGAAATTCAGGGGGGCCTTTGGTTGACTCGTACGGTCATGTTATTGGAGTCAACACAGCAACTTTCACTCGCAAAGGAACTGGAATGTCTTCTGGTGTCAACTTTGCTATACCAATAGACACAGTTGTACGGACTGTGCCATACCTTATTGTTTATGGAACACCTTACAGTGAGAGATTTTGA

Protein sequence

MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDHLLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDAAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSERF*
Homology
BLAST of CsGy1G004540 vs. ExPASy Swiss-Prot
Match: Q9SEL7 (Protease Do-like 5, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DEGP5 PE=1 SV=3)

HSP 1 Score: 366.3 bits (939), Expect = 3.3e-100
Identity = 189/285 (66.32%), Postives = 223/285 (78.25%), Query Frame = 0

Query: 29  RRAILF-SPAALLPSLLA-----FPLPTHAALPQL---QDHLLQEEDRTVSLFQETSPSV 88
           RR ++F S  AL  SLL       P+ +  AL Q    ++ L +EE+R V+LFQ+TSPSV
Sbjct: 43  RRIMIFGSSLALTSSLLGSNQQRLPMESAIALEQFKEKEEELEEEEERNVNLFQKTSPSV 102

Query: 89  VYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSG 148
           VYI  +ELPK     S   +L +++N K++GTGSGFVWDK GHIVTNYHV++ LATD  G
Sbjct: 103 VYIEAIELPKT----SSGDILTDEENGKIEGTGSGFVWDKLGHIVTNYHVIAKLATDQFG 162

Query: 149 SQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIVFGTSRNLRVGQSC 208
            QRCKV+LVD KG    KE KIVG DP+ DLAVLK+E EG EL P+V GTS +LRVGQSC
Sbjct: 163 LQRCKVSLVDAKGTRFSKEGKIVGLDPDNDLAVLKIETEGRELNPVVLGTSNDLRVGQSC 222

Query: 209 YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDAAISAGNSGGPLVDSYGHVI 268
           +AIGNP+GYE TLT GV+SGLGREIPSPNG++I  AIQTDA I++GNSGGPL+DSYGH I
Sbjct: 223 FAIGNPYGYENTLTIGVVSGLGREIPSPNGKSISEAIQTDADINSGNSGGPLLDSYGHTI 282

Query: 269 GVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSERF 305
           GVNTATFTRKG+GMSSGVNFAIPIDTVVRTVPYLIVYGT Y +RF
Sbjct: 283 GVNTATFTRKGSGMSSGVNFAIPIDTVVRTVPYLIVYGTAYRDRF 323

BLAST of CsGy1G004540 vs. ExPASy Swiss-Prot
Match: O22609 (Protease Do-like 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DEGP1 PE=1 SV=2)

HSP 1 Score: 196.4 bits (498), Expect = 4.6e-49
Identity = 131/292 (44.86%), Postives = 173/292 (59.25%), Query Frame = 0

Query: 18  NSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQD----------HLLQEEDR 77
           +   + L FT   A+   P  LL + +A      AA P ++            L  +E  
Sbjct: 63  DDDDDTLHFTPFSAV--KPFFLLCTSVALSFSLFAASPAVESASAFVVSTPKKLQTDELA 122

Query: 78  TVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKV-KGTGSGFVWDKFGHIVTN 137
           TV LFQE +PSVVYI +L       A  Q    +  D L+V +G+GSGFVWDK GHIVTN
Sbjct: 123 TVRLFQENTPSVVYITNL-------AVRQDAFTL--DVLEVPQGSGSGFVWDKQGHIVTN 182

Query: 138 YHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIV 197
           YHV+        G+   +V L D        +AK+VGFD + D+AVL+++   ++L+PI 
Sbjct: 183 YHVI-------RGASDLRVTLAD----QTTFDAKVVGFDQDKDVAVLRIDAPKNKLRPIP 242

Query: 198 FGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPS-PNGRAIRGAIQTDAAISAG 257
            G S +L VGQ  +AIGNPFG + TLT GVISGL REI S   GR I+  IQTDAAI+ G
Sbjct: 243 VGVSADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPG 302

Query: 258 NSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG 298
           NSGGPL+DS G +IG+NTA ++   +G SSGV F+IP+DTV   V  L+ +G
Sbjct: 303 NSGGPLLDSSGTLIGINTAIYS--PSGASSGVGFSIPVDTVGGIVDQLVRFG 330

BLAST of CsGy1G004540 vs. ExPASy Swiss-Prot
Match: Q9LU10 (Protease Do-like 8, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DEGP8 PE=1 SV=1)

HSP 1 Score: 195.7 bits (496), Expect = 7.8e-49
Identity = 134/289 (46.37%), Postives = 173/289 (59.86%), Query Frame = 0

Query: 24  LPFTSRRAIL--------FSPAALLPSLLAFPLPTHAALPQLQ------DHLLQEEDRTV 83
           +P T+RR +L        F+P+  L S LA   P+ A +  +         L   E R V
Sbjct: 62  VPSTTRRILLTSLFMNLCFNPSRYL-SALALGDPSVATVEDVSPTVFPAGPLFPTEGRIV 121

Query: 84  SLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKFGHIVTNYHV 143
            LF++ + SVV I D+ L   PQ      + I +      G GSG VWD  G+IVTNYHV
Sbjct: 122 QLFEKNTYSVVNIFDVTL--RPQLKMTGVVEIPE------GNGSGVVWDGQGYIVTNYHV 181

Query: 144 V-SALATDNS-GSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIVF 203
           + +AL+ + S G    +VN++   G     E K+VG D   DLAVLKV+     LKPI  
Sbjct: 182 IGNALSRNPSPGDVVGRVNILASDGVQKNFEGKLVGADRAKDLAVLKVDAPETLLKPIKV 241

Query: 204 GTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDAAISAGNS 263
           G S +L+VGQ C AIGNPFG++ TLT GVISGL R+I S  G  I G IQTDAAI+ GNS
Sbjct: 242 GQSNSLKVGQQCLAIGNPFGFDHTLTVGVISGLNRDIFSQTGVTIGGGIQTDAAINPGNS 301

Query: 264 GGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY 297
           GGPL+DS G++IG+NTA FT+  TG S+GV FAIP  TV++ VP LI +
Sbjct: 302 GGPLLDSKGNLIGINTAIFTQ--TGTSAGVGFAIPSSTVLKIVPQLIQF 339

BLAST of CsGy1G004540 vs. ExPASy Swiss-Prot
Match: Q2SL36 (Probable periplasmic serine endoprotease DegP-like OS=Hahella chejuensis (strain KCTC 2396) OX=349521 GN=mucD PE=3 SV=1)

HSP 1 Score: 133.3 bits (334), Expect = 4.8e-30
Identity = 80/181 (44.20%), Postives = 112/181 (61.88%), Query Frame = 0

Query: 107 KVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDP 166
           + + TGSGF+  K G+I+TN HVV       +G+    V L+D +       AK++G D 
Sbjct: 87  EAQSTGSGFIVSKDGYILTNNHVV-------AGADEIFVRLMDRR----ELTAKLIGSDE 146

Query: 167 EYDLAVLKVELEGHELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPS 226
           + DLAVLKVE +  +L  +  G S  L+VG+   AIG+PFG+E T+TAG++S  GR +P+
Sbjct: 147 KSDLAVLKVEAD--DLPVLNLGKSSELKVGEWVVAIGSPFGFEYTVTAGIVSAKGRSLPN 206

Query: 227 PNGRAIRGAIQTDAAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV 286
            N       IQTD AI+ GNSGGPL +  G V+G+N+  +TR G  M  GV+FAIPID  
Sbjct: 207 ENYVPF---IQTDVAINPGNSGGPLFNLEGEVVGINSQIYTRSGGFM--GVSFAIPIDVA 249

Query: 287 V 288
           +
Sbjct: 267 L 249

BLAST of CsGy1G004540 vs. ExPASy Swiss-Prot
Match: F6AA62 (Probable periplasmic serine endoprotease DegP-like OS=Pseudomonas fulva (strain 12-X) OX=743720 GN=Psefu_3239 PE=3 SV=1)

HSP 1 Score: 127.1 bits (318), Expect = 3.4e-28
Identity = 82/211 (38.86%), Postives = 119/211 (56.40%), Query Frame = 0

Query: 87  LPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGSQRCKVN 146
           +P+ P+AP       E  +L     GSGF+  K G+I+TN HVV       + +    V 
Sbjct: 82  IPQQPRAPGGGGRQREAQSL-----GSGFIISKDGYILTNNHVV-------ADADEIIVR 141

Query: 147 LVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIVFGTSRNLRVGQSCYAIGNPF 206
           L D        EAK++G DP  D+A+LKV  E ++L  +  G S NL+VG+   AIG+PF
Sbjct: 142 LSDRS----ELEAKLIGTDPRSDVALLKV--EANDLPTVKLGNSDNLKVGEWVLAIGSPF 201

Query: 207 GYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDAAISAGNSGGPLVDSYGHVIGVNTATF 266
           G++ ++TAG++S  GR +P+    +    IQTD AI+ GNSGGPL +  G V+G+N+  F
Sbjct: 202 GFDHSVTAGIVSAKGRSLPN---ESYVPFIQTDVAINPGNSGGPLFNLDGEVVGINSQIF 261

Query: 267 TRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG 298
           TR G  M  G++FAIP+   +     L   G
Sbjct: 262 TRSGGFM--GLSFAIPMSVAMDVADQLKASG 269

BLAST of CsGy1G004540 vs. NCBI nr
Match: XP_004137559.1 (protease Do-like 5, chloroplastic [Cucumis sativus] >KGN63906.1 hypothetical protein Csa_013941 [Cucumis sativus])

HSP 1 Score: 606 bits (1563), Expect = 1.12e-218
Identity = 304/304 (100.00%), Postives = 304/304 (100.00%), Query Frame = 0

Query: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60
           MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH
Sbjct: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60

Query: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF 120
           LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF
Sbjct: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF 120

Query: 121 GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH 180
           GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH
Sbjct: 121 GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH 180

Query: 181 ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA 240
           ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA
Sbjct: 181 ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA 240

Query: 241 AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300
           AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY
Sbjct: 241 AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300

Query: 301 SERF 304
           SERF
Sbjct: 301 SERF 304

BLAST of CsGy1G004540 vs. NCBI nr
Match: XP_008444456.1 (PREDICTED: protease Do-like 5, chloroplastic [Cucumis melo] >KAA0053993.1 protease Do-like 5 [Cucumis melo var. makuwa] >TYK20678.1 protease Do-like 5 [Cucumis melo var. makuwa])

HSP 1 Score: 566 bits (1459), Expect = 8.28e-203
Identity = 288/305 (94.43%), Postives = 293/305 (96.07%), Query Frame = 0

Query: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60
           MALGLLGIPPLP PAPPNSS+NPLPFTSRRAILFSP AL+ SLLAFPLPT AALPQLQD 
Sbjct: 1   MALGLLGIPPLPIPAPPNSSENPLPFTSRRAILFSPTALMASLLAFPLPTPAALPQLQDP 60

Query: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQP-MLIEDDNLKVKGTGSGFVWDK 120
           LLQEEDR VSLFQETSPSVVYI DLEL KNPQ  S++P MLIEDDN+KVKGTGSGFVWDK
Sbjct: 61  LLQEEDRIVSLFQETSPSVVYIKDLELAKNPQNRSEEPPMLIEDDNVKVKGTGSGFVWDK 120

Query: 121 FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEG 180
           FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEA IVGFDPEYDLAVLKVELEG
Sbjct: 121 FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEANIVGFDPEYDLAVLKVELEG 180

Query: 181 HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240
           HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD
Sbjct: 181 HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240

Query: 241 AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300
           AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Sbjct: 241 AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300

Query: 301 YSERF 304
           YSERF
Sbjct: 301 YSERF 305

BLAST of CsGy1G004540 vs. NCBI nr
Match: XP_038893976.1 (protease Do-like 5, chloroplastic [Benincasa hispida])

HSP 1 Score: 530 bits (1365), Expect = 1.40e-188
Identity = 269/304 (88.49%), Postives = 282/304 (92.76%), Query Frame = 0

Query: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60
           MALG LGI  LP  +PPNS++NPLP TSRRAI+F+P AL+ SLLAFP+PT+AALPQLQD 
Sbjct: 1   MALGSLGIRLLPISSPPNSAENPLPITSRRAIVFAPTALMASLLAFPVPTYAALPQLQDD 60

Query: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF 120
           + QEEDR V+LFQETSPSVVYI DLE+ KNPQ PS      ED+N KVKGTGSGFVWDKF
Sbjct: 61  IPQEEDRIVALFQETSPSVVYIKDLEVAKNPQNPSG-----EDENAKVKGTGSGFVWDKF 120

Query: 121 GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH 180
           GHIVTNYHVVSALATDNSG QRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH
Sbjct: 121 GHIVTNYHVVSALATDNSGLQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH 180

Query: 181 ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA 240
           ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA
Sbjct: 181 ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA 240

Query: 241 AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300
           AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV+RTVPYLIVYGTPY
Sbjct: 241 AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVIRTVPYLIVYGTPY 299

Query: 301 SERF 304
           SERF
Sbjct: 301 SERF 299

BLAST of CsGy1G004540 vs. NCBI nr
Match: XP_022140016.1 (protease Do-like 5, chloroplastic isoform X2 [Momordica charantia])

HSP 1 Score: 522 bits (1344), Expect = 2.66e-185
Identity = 261/304 (85.86%), Postives = 279/304 (91.78%), Query Frame = 0

Query: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60
           MAL  LGI  LP PAPPNSS N LPFTSRRA++F+P+AL+ SLLAFPLPTHAALPQ+Q  
Sbjct: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60

Query: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF 120
           + QEEDR V+LFQ+ SPSVVYI DLEL K PQ  S++ +L+ED+N+KVKGTGSGFVWDKF
Sbjct: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKF 120

Query: 121 GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH 180
           GHIVTNYHVVSALATDNSG QRCKVNLVD KGNGIY+EAKIVGFDPEYDLAVLKVEL G 
Sbjct: 121 GHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGC 180

Query: 181 ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA 240
           ELKPIV GTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRG IQTDA
Sbjct: 181 ELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDA 240

Query: 241 AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300
           AIS+GNSGGPL+DSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY
Sbjct: 241 AISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300

Query: 301 SERF 304
           SERF
Sbjct: 301 SERF 304

BLAST of CsGy1G004540 vs. NCBI nr
Match: XP_022927238.1 (protease Do-like 5, chloroplastic isoform X1 [Cucurbita moschata])

HSP 1 Score: 520 bits (1339), Expect = 1.60e-184
Identity = 265/305 (86.89%), Postives = 281/305 (92.13%), Query Frame = 0

Query: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60
           MALG LGI  LP  +PPNSS+  LPFTSRRAI+F+P AL+ SLLAFP+P+ AALPQLQD 
Sbjct: 1   MALGSLGIRLLPVSSPPNSSEISLPFTSRRAIVFAPTALMASLLAFPVPSFAALPQLQDE 60

Query: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDD-NLKVKGTGSGFVWDK 120
           + QEEDR V LFQETSPSVVYI +LE+ K PQ PS++ MLIEDD N KVKGTGSGF+WDK
Sbjct: 61  VPQEEDRIVGLFQETSPSVVYIKNLEIAKKPQNPSEEAMLIEDDENAKVKGTGSGFIWDK 120

Query: 121 FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEG 180
           FGHIVTNYHVVSALATDNSG QRCKVNLVDVKGNGI ++AKIVGFDPEYDLAVLKVELEG
Sbjct: 121 FGHIVTNYHVVSALATDNSGLQRCKVNLVDVKGNGISRDAKIVGFDPEYDLAVLKVELEG 180

Query: 181 HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240
           HELKPIVFGTSR+LRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD
Sbjct: 181 HELKPIVFGTSRSLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240

Query: 241 AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300
           AAISAGNSGGPLVD YGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Sbjct: 241 AAISAGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300

Query: 301 YSERF 304
           YSERF
Sbjct: 301 YSERF 305

BLAST of CsGy1G004540 vs. ExPASy TrEMBL
Match: A0A0A0LV65 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G025870 PE=3 SV=1)

HSP 1 Score: 606 bits (1563), Expect = 5.42e-219
Identity = 304/304 (100.00%), Postives = 304/304 (100.00%), Query Frame = 0

Query: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60
           MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH
Sbjct: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60

Query: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF 120
           LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF
Sbjct: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF 120

Query: 121 GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH 180
           GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH
Sbjct: 121 GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH 180

Query: 181 ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA 240
           ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA
Sbjct: 181 ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA 240

Query: 241 AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300
           AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY
Sbjct: 241 AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300

Query: 301 SERF 304
           SERF
Sbjct: 301 SERF 304

BLAST of CsGy1G004540 vs. ExPASy TrEMBL
Match: A0A5D3DAW0 (Protease Do-like 5 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold480G00660 PE=3 SV=1)

HSP 1 Score: 566 bits (1459), Expect = 4.01e-203
Identity = 288/305 (94.43%), Postives = 293/305 (96.07%), Query Frame = 0

Query: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60
           MALGLLGIPPLP PAPPNSS+NPLPFTSRRAILFSP AL+ SLLAFPLPT AALPQLQD 
Sbjct: 1   MALGLLGIPPLPIPAPPNSSENPLPFTSRRAILFSPTALMASLLAFPLPTPAALPQLQDP 60

Query: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQP-MLIEDDNLKVKGTGSGFVWDK 120
           LLQEEDR VSLFQETSPSVVYI DLEL KNPQ  S++P MLIEDDN+KVKGTGSGFVWDK
Sbjct: 61  LLQEEDRIVSLFQETSPSVVYIKDLELAKNPQNRSEEPPMLIEDDNVKVKGTGSGFVWDK 120

Query: 121 FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEG 180
           FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEA IVGFDPEYDLAVLKVELEG
Sbjct: 121 FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEANIVGFDPEYDLAVLKVELEG 180

Query: 181 HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240
           HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD
Sbjct: 181 HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240

Query: 241 AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300
           AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Sbjct: 241 AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300

Query: 301 YSERF 304
           YSERF
Sbjct: 301 YSERF 305

BLAST of CsGy1G004540 vs. ExPASy TrEMBL
Match: A0A1S3B9W5 (protease Do-like 5, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103487774 PE=3 SV=1)

HSP 1 Score: 566 bits (1459), Expect = 4.01e-203
Identity = 288/305 (94.43%), Postives = 293/305 (96.07%), Query Frame = 0

Query: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60
           MALGLLGIPPLP PAPPNSS+NPLPFTSRRAILFSP AL+ SLLAFPLPT AALPQLQD 
Sbjct: 1   MALGLLGIPPLPIPAPPNSSENPLPFTSRRAILFSPTALMASLLAFPLPTPAALPQLQDP 60

Query: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQP-MLIEDDNLKVKGTGSGFVWDK 120
           LLQEEDR VSLFQETSPSVVYI DLEL KNPQ  S++P MLIEDDN+KVKGTGSGFVWDK
Sbjct: 61  LLQEEDRIVSLFQETSPSVVYIKDLELAKNPQNRSEEPPMLIEDDNVKVKGTGSGFVWDK 120

Query: 121 FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEG 180
           FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEA IVGFDPEYDLAVLKVELEG
Sbjct: 121 FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEANIVGFDPEYDLAVLKVELEG 180

Query: 181 HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240
           HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD
Sbjct: 181 HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240

Query: 241 AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300
           AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Sbjct: 241 AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300

Query: 301 YSERF 304
           YSERF
Sbjct: 301 YSERF 305

BLAST of CsGy1G004540 vs. ExPASy TrEMBL
Match: A0A6J1CEE6 (protease Do-like 5, chloroplastic isoform X2 OS=Momordica charantia OX=3673 GN=LOC111010776 PE=3 SV=1)

HSP 1 Score: 522 bits (1344), Expect = 1.29e-185
Identity = 261/304 (85.86%), Postives = 279/304 (91.78%), Query Frame = 0

Query: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60
           MAL  LGI  LP PAPPNSS N LPFTSRRA++F+P+AL+ SLLAFPLPTHAALPQ+Q  
Sbjct: 1   MALASLGIHLLPIPAPPNSSHNSLPFTSRRALVFAPSALMASLLAFPLPTHAALPQIQAQ 60

Query: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF 120
           + QEEDR V+LFQ+ SPSVVYI DLEL K PQ  S++ +L+ED+N+KVKGTGSGFVWDKF
Sbjct: 61  VPQEEDRVVALFQDASPSVVYIKDLELAKKPQNSSEEALLVEDENVKVKGTGSGFVWDKF 120

Query: 121 GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH 180
           GHIVTNYHVVSALATDNSG QRCKVNLVD KGNGIY+EAKIVGFDPEYDLAVLKVEL G 
Sbjct: 121 GHIVTNYHVVSALATDNSGLQRCKVNLVDAKGNGIYREAKIVGFDPEYDLAVLKVELGGC 180

Query: 181 ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA 240
           ELKPIV GTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRG IQTDA
Sbjct: 181 ELKPIVLGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGGIQTDA 240

Query: 241 AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300
           AIS+GNSGGPL+DSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY
Sbjct: 241 AISSGNSGGPLIDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300

Query: 301 SERF 304
           SERF
Sbjct: 301 SERF 304

BLAST of CsGy1G004540 vs. ExPASy TrEMBL
Match: A0A6J1EKF8 (protease Do-like 5, chloroplastic isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111434145 PE=3 SV=1)

HSP 1 Score: 520 bits (1339), Expect = 7.73e-185
Identity = 265/305 (86.89%), Postives = 281/305 (92.13%), Query Frame = 0

Query: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60
           MALG LGI  LP  +PPNSS+  LPFTSRRAI+F+P AL+ SLLAFP+P+ AALPQLQD 
Sbjct: 1   MALGSLGIRLLPVSSPPNSSEISLPFTSRRAIVFAPTALMASLLAFPVPSFAALPQLQDE 60

Query: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDD-NLKVKGTGSGFVWDK 120
           + QEEDR V LFQETSPSVVYI +LE+ K PQ PS++ MLIEDD N KVKGTGSGF+WDK
Sbjct: 61  VPQEEDRIVGLFQETSPSVVYIKNLEIAKKPQNPSEEAMLIEDDENAKVKGTGSGFIWDK 120

Query: 121 FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEG 180
           FGHIVTNYHVVSALATDNSG QRCKVNLVDVKGNGI ++AKIVGFDPEYDLAVLKVELEG
Sbjct: 121 FGHIVTNYHVVSALATDNSGLQRCKVNLVDVKGNGISRDAKIVGFDPEYDLAVLKVELEG 180

Query: 181 HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240
           HELKPIVFGTSR+LRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD
Sbjct: 181 HELKPIVFGTSRSLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240

Query: 241 AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300
           AAISAGNSGGPLVD YGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Sbjct: 241 AAISAGNSGGPLVDLYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300

Query: 301 YSERF 304
           YSERF
Sbjct: 301 YSERF 305

BLAST of CsGy1G004540 vs. TAIR 10
Match: AT4G18370.1 (DEGP protease 5 )

HSP 1 Score: 366.3 bits (939), Expect = 2.4e-101
Identity = 189/285 (66.32%), Postives = 223/285 (78.25%), Query Frame = 0

Query: 29  RRAILF-SPAALLPSLLA-----FPLPTHAALPQL---QDHLLQEEDRTVSLFQETSPSV 88
           RR ++F S  AL  SLL       P+ +  AL Q    ++ L +EE+R V+LFQ+TSPSV
Sbjct: 43  RRIMIFGSSLALTSSLLGSNQQRLPMESAIALEQFKEKEEELEEEEERNVNLFQKTSPSV 102

Query: 89  VYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSG 148
           VYI  +ELPK     S   +L +++N K++GTGSGFVWDK GHIVTNYHV++ LATD  G
Sbjct: 103 VYIEAIELPKT----SSGDILTDEENGKIEGTGSGFVWDKLGHIVTNYHVIAKLATDQFG 162

Query: 149 SQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIVFGTSRNLRVGQSC 208
            QRCKV+LVD KG    KE KIVG DP+ DLAVLK+E EG EL P+V GTS +LRVGQSC
Sbjct: 163 LQRCKVSLVDAKGTRFSKEGKIVGLDPDNDLAVLKIETEGRELNPVVLGTSNDLRVGQSC 222

Query: 209 YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDAAISAGNSGGPLVDSYGHVI 268
           +AIGNP+GYE TLT GV+SGLGREIPSPNG++I  AIQTDA I++GNSGGPL+DSYGH I
Sbjct: 223 FAIGNPYGYENTLTIGVVSGLGREIPSPNGKSISEAIQTDADINSGNSGGPLLDSYGHTI 282

Query: 269 GVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSERF 305
           GVNTATFTRKG+GMSSGVNFAIPIDTVVRTVPYLIVYGT Y +RF
Sbjct: 283 GVNTATFTRKGSGMSSGVNFAIPIDTVVRTVPYLIVYGTAYRDRF 323

BLAST of CsGy1G004540 vs. TAIR 10
Match: AT3G27925.1 (DegP protease 1 )

HSP 1 Score: 196.4 bits (498), Expect = 3.2e-50
Identity = 131/292 (44.86%), Postives = 173/292 (59.25%), Query Frame = 0

Query: 18  NSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQD----------HLLQEEDR 77
           +   + L FT   A+   P  LL + +A      AA P ++            L  +E  
Sbjct: 63  DDDDDTLHFTPFSAV--KPFFLLCTSVALSFSLFAASPAVESASAFVVSTPKKLQTDELA 122

Query: 78  TVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKV-KGTGSGFVWDKFGHIVTN 137
           TV LFQE +PSVVYI +L       A  Q    +  D L+V +G+GSGFVWDK GHIVTN
Sbjct: 123 TVRLFQENTPSVVYITNL-------AVRQDAFTL--DVLEVPQGSGSGFVWDKQGHIVTN 182

Query: 138 YHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIV 197
           YHV+        G+   +V L D        +AK+VGFD + D+AVL+++   ++L+PI 
Sbjct: 183 YHVI-------RGASDLRVTLAD----QTTFDAKVVGFDQDKDVAVLRIDAPKNKLRPIP 242

Query: 198 FGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPS-PNGRAIRGAIQTDAAISAG 257
            G S +L VGQ  +AIGNPFG + TLT GVISGL REI S   GR I+  IQTDAAI+ G
Sbjct: 243 VGVSADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPG 302

Query: 258 NSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG 298
           NSGGPL+DS G +IG+NTA ++   +G SSGV F+IP+DTV   V  L+ +G
Sbjct: 303 NSGGPLLDSSGTLIGINTAIYS--PSGASSGVGFSIPVDTVGGIVDQLVRFG 330

BLAST of CsGy1G004540 vs. TAIR 10
Match: AT5G39830.1 (Trypsin family protein with PDZ domain )

HSP 1 Score: 195.7 bits (496), Expect = 5.5e-50
Identity = 134/289 (46.37%), Postives = 173/289 (59.86%), Query Frame = 0

Query: 24  LPFTSRRAIL--------FSPAALLPSLLAFPLPTHAALPQLQ------DHLLQEEDRTV 83
           +P T+RR +L        F+P+  L S LA   P+ A +  +         L   E R V
Sbjct: 62  VPSTTRRILLTSLFMNLCFNPSRYL-SALALGDPSVATVEDVSPTVFPAGPLFPTEGRIV 121

Query: 84  SLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKFGHIVTNYHV 143
            LF++ + SVV I D+ L   PQ      + I +      G GSG VWD  G+IVTNYHV
Sbjct: 122 QLFEKNTYSVVNIFDVTL--RPQLKMTGVVEIPE------GNGSGVVWDGQGYIVTNYHV 181

Query: 144 V-SALATDNS-GSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIVF 203
           + +AL+ + S G    +VN++   G     E K+VG D   DLAVLKV+     LKPI  
Sbjct: 182 IGNALSRNPSPGDVVGRVNILASDGVQKNFEGKLVGADRAKDLAVLKVDAPETLLKPIKV 241

Query: 204 GTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDAAISAGNS 263
           G S +L+VGQ C AIGNPFG++ TLT GVISGL R+I S  G  I G IQTDAAI+ GNS
Sbjct: 242 GQSNSLKVGQQCLAIGNPFGFDHTLTVGVISGLNRDIFSQTGVTIGGGIQTDAAINPGNS 301

Query: 264 GGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY 297
           GGPL+DS G++IG+NTA FT+  TG S+GV FAIP  TV++ VP LI +
Sbjct: 302 GGPLLDSKGNLIGINTAIFTQ--TGTSAGVGFAIPSSTVLKIVPQLIQF 339

BLAST of CsGy1G004540 vs. TAIR 10
Match: AT5G39830.2 (Trypsin family protein with PDZ domain )

HSP 1 Score: 173.3 bits (438), Expect = 2.9e-43
Identity = 125/289 (43.25%), Postives = 163/289 (56.40%), Query Frame = 0

Query: 24  LPFTSRRAIL--------FSPAALLPSLLAFPLPTHAALPQLQ------DHLLQEEDRTV 83
           +P T+RR +L        F+P+  L S LA   P+ A +  +         L   E R V
Sbjct: 62  VPSTTRRILLTSLFMNLCFNPSRYL-SALALGDPSVATVEDVSPTVFPAGPLFPTEGRIV 121

Query: 84  SLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKFGHIVTNYHV 143
            LF++ + SVV I D+ L   PQ      + I +      G GSG VWD  G+IVTNYHV
Sbjct: 122 QLFEKNTYSVVNIFDVTL--RPQLKMTGVVEIPE------GNGSGVVWDGQGYIVTNYHV 181

Query: 144 V-SALATDNS-GSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIVF 203
           + +AL+ + S G    +VN++   G     E K+VG D   DLAVLKV+     LKPI  
Sbjct: 182 IGNALSRNPSPGDVVGRVNILASDGVQKNFEGKLVGADRAKDLAVLKVDAPETLLKPIKV 241

Query: 204 GTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDAAISAGNS 263
           G S +L+VGQ C AIGNPFG++ TLT GVISGL R+I S  G  I G IQTDAAI+ GNS
Sbjct: 242 GQSNSLKVGQQCLAIGNPFGFDHTLTVGVISGLNRDIFSQTGVTIGGGIQTDAAINPGNS 301

Query: 264 GGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY 297
           GGPL+DS G++IG+NTA FT+                TV++ VP LI +
Sbjct: 302 GGPLLDSKGNLIGINTAIFTQ----------------TVLKIVPQLIQF 325

BLAST of CsGy1G004540 vs. TAIR 10
Match: AT5G27660.1 (Trypsin family protein with PDZ domain )

HSP 1 Score: 87.4 bits (215), Expect = 2.1e-17
Identity = 65/186 (34.95%), Postives = 95/186 (51.08%), Query Frame = 0

Query: 109 KGTGSGFVWDKFGHIVTNYHVVSALAT-DNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPE 168
           K  GSG + D  G I+T  HVV       +S   R  V L D    G   E  +V  D +
Sbjct: 146 KSIGSGTIIDADGTILTCAHVVVDFQNIRHSSKGRVDVTLQD----GRTFEGVVVNADLQ 205

Query: 169 YDLAVLKVELEGHELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSP 228
            D+A++K++ +   L     G S  LR G    A+G P   + T+TAG++S + R+    
Sbjct: 206 SDIALVKIKSK-TPLPTAKLGFSSKLRPGDWVIAVGCPLSLQNTVTAGIVSCVDRKSSDL 265

Query: 229 N-GRAIRGAIQTDAAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV 288
             G   R  +QTD +I+AGNSGGPLV+  G VIGVN           + G+ F++PID+V
Sbjct: 266 GLGGKHREYLQTDCSINAGNSGGPLVNLDGEVIGVNIMKVL-----AADGLGFSVPIDSV 321

Query: 289 VRTVPY 293
            + + +
Sbjct: 326 SKIIEH 321

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SEL73.3e-10066.32Protease Do-like 5, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DEGP5 PE=1 ... [more]
O226094.6e-4944.86Protease Do-like 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DEGP1 PE=1 ... [more]
Q9LU107.8e-4946.37Protease Do-like 8, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DEGP8 PE=1 ... [more]
Q2SL364.8e-3044.20Probable periplasmic serine endoprotease DegP-like OS=Hahella chejuensis (strain... [more]
F6AA623.4e-2838.86Probable periplasmic serine endoprotease DegP-like OS=Pseudomonas fulva (strain ... [more]
Match NameE-valueIdentityDescription
XP_004137559.11.12e-218100.00protease Do-like 5, chloroplastic [Cucumis sativus] >KGN63906.1 hypothetical pro... [more]
XP_008444456.18.28e-20394.43PREDICTED: protease Do-like 5, chloroplastic [Cucumis melo] >KAA0053993.1 protea... [more]
XP_038893976.11.40e-18888.49protease Do-like 5, chloroplastic [Benincasa hispida][more]
XP_022140016.12.66e-18585.86protease Do-like 5, chloroplastic isoform X2 [Momordica charantia][more]
XP_022927238.11.60e-18486.89protease Do-like 5, chloroplastic isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A0A0LV655.42e-219100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G025870 PE=3 SV=1[more]
A0A5D3DAW04.01e-20394.43Protease Do-like 5 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold480G0... [more]
A0A1S3B9W54.01e-20394.43protease Do-like 5, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103487774 PE=3 S... [more]
A0A6J1CEE61.29e-18585.86protease Do-like 5, chloroplastic isoform X2 OS=Momordica charantia OX=3673 GN=L... [more]
A0A6J1EKF87.73e-18586.89protease Do-like 5, chloroplastic isoform X1 OS=Cucurbita moschata OX=3662 GN=LO... [more]
Match NameE-valueIdentityDescription
AT4G18370.12.4e-10166.32DEGP protease 5 [more]
AT3G27925.13.2e-5044.86DegP protease 1 [more]
AT5G39830.15.5e-5046.37Trypsin family protein with PDZ domain [more]
AT5G39830.22.9e-4343.25Trypsin family protein with PDZ domain [more]
AT5G27660.12.1e-1734.95Trypsin family protein with PDZ domain [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001940Peptidase S1CPRINTSPR00834PROTEASES2Ccoord: 232..249
score: 62.74
coord: 195..219
score: 48.88
coord: 254..271
score: 37.67
coord: 121..133
score: 50.47
IPR043504Peptidase S1, PA clan, chymotrypsin-like foldGENE3D2.40.10.10coord: 60..185
e-value: 8.2E-31
score: 108.2
IPR043504Peptidase S1, PA clan, chymotrypsin-like foldGENE3D2.40.10.10coord: 186..298
e-value: 5.0E-40
score: 138.3
NoneNo IPR availablePFAMPF13365Trypsin_2coord: 112..261
e-value: 6.3E-34
score: 118.1
NoneNo IPR availablePANTHERPTHR43343:SF6PROTEASE DO-LIKE 5, CHLOROPLASTIC ISOFORM X1coord: 18..304
NoneNo IPR availablePANTHERPTHR43343PEPTIDASE S12coord: 18..304
IPR009003Peptidase S1, PA clanSUPERFAMILY50494Trypsin-like serine proteasescoord: 70..290

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G004540.1CsGy1G004540.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0004252 serine-type endopeptidase activity