Csa1G025870 (gene) Cucumber (Chinese Long) v2

NameCsa1G025870
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionSerine protease; contains IPR001940 (Peptidase S1C), IPR009003 (Trypsin-like cysteine/serine peptidase domain)
LocationChr1 : 2900571 .. 2903201 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGTAATCTCAACTGGCAAGGAAGAAGATAATGTGATTTTGGTACACCTAGTTGAAGAAGATTGATTGAATCGAAGCTAAGTGGAGAAAACATCATCCTTCTATGGCGTTGGGCTTACTGGGAATTCCTCCTCTTCCATTTCCAGCTCCCCCAAATTCCTCTCAAAATCCCTTACCCTTCACTTCCCGAAGAGCCATTCTGTTTTCCCCAGCTGCTTTACTCCCTTCCCTCCTCGCTTTTCCTCTTCCCACTCACGCCGCTCTTCCCCAACTCCAAGACCACCTTCTACAAGAAGAAGATAGAACCGTTTCCCTTTTCCAGGTCCCTCCTCTCTTTCTCTTACAATCCTTCTTTTTCAATTCAATTTGGGTCTTTCCTTTATCTCTTTTCTTCTGTTGGTTCTTCATTGCAATCTAGGAAACTTCGCCTTCCGTCGTTTATATTAACGACCTTGAATTACCTAAAAACCCTCAAGCCCCTTCTCAACAACCCATGCTTATCGAGGATGATAATCTCAAGGTTAAAGGGACTGGTTCGGGTTTTGTTTGGGATAAATTTGGCCATATCGTAAGACCATCTTCTTTGATTCTTGCCATCTCTTTTTTTTTTCTTTTTTTTTCACCTTTTTTTTGTTCGTTACTTGAAGCATTTCTGCTTGCGACAGGTTACCAATTACCATGTTGTTTCTGCATTGGCTACTGATAATAGTGGATCACAGCGTTGTAAGGTTTTGTTCGTTCGTTTCTGTGTTTTGGATTGTTGTGCATTCTTAGTGGATGATATGCTAGCAGTCCCTAGCTATGATTCCTCGGATTTGGGTTCTTCTGTAGGTAAACTTAGTCGATGTTAAAGGAAACGGAATCTATAAAGAGGCAAAGATTGTTGGTTTTGATCCAGAGTATGATCTAGCTGTTCTTAAGGTAATATAGCAAGCTTGCTTGTATTTTTCATTGTTTCGAGGAGAATGCCAGATTGAGTGAATCGTGTTTTGGTGATCTTCATACCATCTTCTATTCCAAATATAGTTTTGCAAATACGGCAGACTGGACAAGCTATATTGTCAAACTCTTATATTGTACCATTTATTTGCTGAGTTAGAAATTTCCTTCATTCAGGTGGAACTTGAAGGACATGAACTAAAGCCCATAGTCTTCGGTACCTCTCGAAATCTACGTGTTGGTCAAAGCTGCTATGCCATTGGCAACCCTTTTGGGTATGAGAAGACACTAACAGCAGGGGTAAGAAATCAATAGTGTCATTTCCTCCATTTATTTTAAGGAAACCCACTTTGAATTTTCCATTTCAAAGGTAAAACTAAATTCCAAAGTGAAATACTTCTCTCTTACAATAAACGCATTAGTCATAGAAGGTATGGTATCTCTCATCTCACACTTGTATGGTTTAAATAAAGCGAATGAAGTAAGCGTTTCTCCAAGTGTCACAATCTATTGGTGTTGACTTCCAAGTAAAACCGAGTCATATAGCAGATTGACTAAATAATTGAGAACCGTAAATCATTCAGGGTTTGGATAGCAGGTGATCAGCGGATTGGGTAGAGAAATTCCATCACCAAATGGAAGGGCCATAAGGGGAGCTATTCAGACAGATGCTGCTATTAGTGCAGGTTCATGTTTCAACTACCTGAAATAAGACTTTGCCATTTCAATCACTGACATCTTTTCGTGCTCTTTCTTCTTCCTCTTTTCAAACTTATTTTACTTTCATCATTCACATCCCAAGTGTAAGCCTTATAGTTACTTATTCCACTGATCTTTATTTTTTTATACTATGGAGCAGTTAAATTCCATGTACATTATTTATACCAGCTTATTTTGATTTTTCCATTCCATCTCTGGTTCATTGCTGCCCAAGTCATCCATTATAATAACTATTTCTTAAATTGCCCACATTTTGATGCATAAAACATTTTACAGGAAATTCAGGGGGGCCTTTGGTTGACTCGTACGGTCATGTTATTGGAGTCAACACAGCAACTTTCACTCGCAAAGGTGAAGGATCTAAATTTCTTCATGTGCCATAGTTTGTCAGAGGAAATATGAAATGCATAGTGTTTCCCGCTTTTGCTTTAAATGTGGCTGATTTTTTTTGGCGGTGCTTATTAGTTATTAATTATCAAAGCTAACTCAACCTTTTATGGCCACTTGAAAGAACAGTGTTCAGAGTAGTAAAGAAAGAAAAAACGAACAAATAGTTGTATTTTTTTTTGTTTGTTTATAGAGTTCAATGTTTGTGTTATTTCATGTTTGGTTATGCATTGCAGGAACTGGAATGTCTTCTGGTGTCAACTTTGCTATACCAATAGACACAGTTGTACGGACTGTGCCATACCTTATTGTTTATGGAACACCTTACAGTGAGAGATTTTGAACAAAATTTAACTGTCTCAAGGGAACACAGCAAACTCCTCTCCCCAAGACCATATACCACATTGTAAATGACATTTTGCTAATTGAAACGAATTTGCTCTCTCAAGTGTAATATCCTCGAGCATTCAACATCTTGGTCTTGATTGTGAATAGTTATTTGCTGTGTGTTATTCTTCTCTCTTTGTAGGGTAATTGTATTATTGAAGTAGTGTTGATAAAATACAATGGAAAGAGTTTGAATGATG

mRNA sequence

ATGGCGTTGGGCTTACTGGGAATTCCTCCTCTTCCATTTCCAGCTCCCCCAAATTCCTCTCAAAATCCCTTACCCTTCACTTCCCGAAGAGCCATTCTGTTTTCCCCAGCTGCTTTACTCCCTTCCCTCCTCGCTTTTCCTCTTCCCACTCACGCCGCTCTTCCCCAACTCCAAGACCACCTTCTACAAGAAGAAGATAGAACCGTTTCCCTTTTCCAGGAAACTTCGCCTTCCGTCGTTTATATTAACGACCTTGAATTACCTAAAAACCCTCAAGCCCCTTCTCAACAACCCATGCTTATCGAGGATGATAATCTCAAGGTTAAAGGGACTGGTTCGGGTTTTGTTTGGGATAAATTTGGCCATATCGTTACCAATTACCATGTTGTTTCTGCATTGGCTACTGATAATAGTGGATCACAGCGTTGTAAGGTAAACTTAGTCGATGTTAAAGGAAACGGAATCTATAAAGAGGCAAAGATTGTTGGTTTTGATCCAGAGTATGATCTAGCTGTTCTTAAGGTGGAACTTGAAGGACATGAACTAAAGCCCATAGTCTTCGGTACCTCTCGAAATCTACGTGTTGGTCAAAGCTGCTATGCCATTGGCAACCCTTTTGGGTATGAGAAGACACTAACAGCAGGGGTGATCAGCGGATTGGGTAGAGAAATTCCATCACCAAATGGAAGGGCCATAAGGGGAGCTATTCAGACAGATGCTGCTATTAGTGCAGGAAATTCAGGGGGGCCTTTGGTTGACTCGTACGGTCATGTTATTGGAGTCAACACAGCAACTTTCACTCGCAAAGGAACTGGAATGTCTTCTGGTGTCAACTTTGCTATACCAATAGACACAGTTGTACGGACTGTGCCATACCTTATTGTTTATGGAACACCTTACAGTGAGAGATTTTGA

Coding sequence (CDS)

ATGGCGTTGGGCTTACTGGGAATTCCTCCTCTTCCATTTCCAGCTCCCCCAAATTCCTCTCAAAATCCCTTACCCTTCACTTCCCGAAGAGCCATTCTGTTTTCCCCAGCTGCTTTACTCCCTTCCCTCCTCGCTTTTCCTCTTCCCACTCACGCCGCTCTTCCCCAACTCCAAGACCACCTTCTACAAGAAGAAGATAGAACCGTTTCCCTTTTCCAGGAAACTTCGCCTTCCGTCGTTTATATTAACGACCTTGAATTACCTAAAAACCCTCAAGCCCCTTCTCAACAACCCATGCTTATCGAGGATGATAATCTCAAGGTTAAAGGGACTGGTTCGGGTTTTGTTTGGGATAAATTTGGCCATATCGTTACCAATTACCATGTTGTTTCTGCATTGGCTACTGATAATAGTGGATCACAGCGTTGTAAGGTAAACTTAGTCGATGTTAAAGGAAACGGAATCTATAAAGAGGCAAAGATTGTTGGTTTTGATCCAGAGTATGATCTAGCTGTTCTTAAGGTGGAACTTGAAGGACATGAACTAAAGCCCATAGTCTTCGGTACCTCTCGAAATCTACGTGTTGGTCAAAGCTGCTATGCCATTGGCAACCCTTTTGGGTATGAGAAGACACTAACAGCAGGGGTGATCAGCGGATTGGGTAGAGAAATTCCATCACCAAATGGAAGGGCCATAAGGGGAGCTATTCAGACAGATGCTGCTATTAGTGCAGGAAATTCAGGGGGGCCTTTGGTTGACTCGTACGGTCATGTTATTGGAGTCAACACAGCAACTTTCACTCGCAAAGGAACTGGAATGTCTTCTGGTGTCAACTTTGCTATACCAATAGACACAGTTGTACGGACTGTGCCATACCTTATTGTTTATGGAACACCTTACAGTGAGAGATTTTGA

Protein sequence

MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDHLLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDAAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSERF*
BLAST of Csa1G025870 vs. Swiss-Prot
Match: DEGP5_ARATH (Protease Do-like 5, chloroplastic OS=Arabidopsis thaliana GN=DEGP5 PE=1 SV=3)

HSP 1 Score: 366.3 bits (939), Expect = 3.2e-100
Identity = 189/285 (66.32%), Postives = 221/285 (77.54%), Query Frame = 1

Query: 29  RRAILF-SPAALLPSLLA-----FPLPTHAALPQL---QDHLLQEEDRTVSLFQETSPSV 88
           RR ++F S  AL  SLL       P+ +  AL Q    ++ L +EE+R V+LFQ+TSPSV
Sbjct: 43  RRIMIFGSSLALTSSLLGSNQQRLPMESAIALEQFKEKEEELEEEEERNVNLFQKTSPSV 102

Query: 89  VYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSG 148
           VYI  +ELPK     S   +L +++N K++GTGSGFVWDK GHIVTNYHV++ LATD  G
Sbjct: 103 VYIEAIELPKT----SSGDILTDEENGKIEGTGSGFVWDKLGHIVTNYHVIAKLATDQFG 162

Query: 149 SQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIVFGTSRNLRVGQSC 208
            QRCKV+LVD KG    KE KIVG DP+ DLAVLK+E EG EL P+V GTS +LRVGQSC
Sbjct: 163 LQRCKVSLVDAKGTRFSKEGKIVGLDPDNDLAVLKIETEGRELNPVVLGTSNDLRVGQSC 222

Query: 209 YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDAAISAGNSGGPLVDSYGHVI 268
           +AIGNP+GYE TLT GV+SGLGREIPSPNG++I  AIQTDA I++GNSGGPL+DSYGH I
Sbjct: 223 FAIGNPYGYENTLTIGVVSGLGREIPSPNGKSISEAIQTDADINSGNSGGPLLDSYGHTI 282

Query: 269 GVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSERF 305
           GVNTATFTRKG+GMSSGVNFAIPIDTVVRTVPYLIVYGT Y +RF
Sbjct: 283 GVNTATFTRKGSGMSSGVNFAIPIDTVVRTVPYLIVYGTAYRDRF 323

BLAST of Csa1G025870 vs. Swiss-Prot
Match: DEGP1_ARATH (Protease Do-like 1, chloroplastic OS=Arabidopsis thaliana GN=DEGP1 PE=1 SV=2)

HSP 1 Score: 197.6 bits (501), Expect = 2.0e-49
Identity = 131/292 (44.86%), Postives = 171/292 (58.56%), Query Frame = 1

Query: 18  NSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQD----------HLLQEEDR 77
           +   + L FT   A+   P  LL + +A      AA P ++            L  +E  
Sbjct: 63  DDDDDTLHFTPFSAV--KPFFLLCTSVALSFSLFAASPAVESASAFVVSTPKKLQTDELA 122

Query: 78  TVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKV-KGTGSGFVWDKFGHIVTN 137
           TV LFQE +PSVVYI +L       A  Q    +  D L+V +G+GSGFVWDK GHIVTN
Sbjct: 123 TVRLFQENTPSVVYITNL-------AVRQDAFTL--DVLEVPQGSGSGFVWDKQGHIVTN 182

Query: 138 YHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIV 197
           YHV+        G+   +V L D        +AK+VGFD + D+AVL+++   ++L+PI 
Sbjct: 183 YHVIR-------GASDLRVTLADQTTF----DAKVVGFDQDKDVAVLRIDAPKNKLRPIP 242

Query: 198 FGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPS-PNGRAIRGAIQTDAAISAG 257
            G S +L VGQ  +AIGNPFG + TLT GVISGL REI S   GR I+  IQTDAAI+ G
Sbjct: 243 VGVSADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPG 302

Query: 258 NSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG 298
           NSGGPL+DS G +IG+NTA ++   +G SSGV F+IP+DTV   V  L+ +G
Sbjct: 303 NSGGPLLDSSGTLIGINTAIYS--PSGASSGVGFSIPVDTVGGIVDQLVRFG 330

BLAST of Csa1G025870 vs. Swiss-Prot
Match: DEGP8_ARATH (Protease Do-like 8, chloroplastic OS=Arabidopsis thaliana GN=DEGP8 PE=1 SV=1)

HSP 1 Score: 197.6 bits (501), Expect = 2.0e-49
Identity = 134/289 (46.37%), Postives = 171/289 (59.17%), Query Frame = 1

Query: 24  LPFTSRRAIL--------FSPAALLPSLLAFPLPTHAALPQLQDH------LLQEEDRTV 83
           +P T+RR +L        F+P+  L S LA   P+ A +  +         L   E R V
Sbjct: 62  VPSTTRRILLTSLFMNLCFNPSRYL-SALALGDPSVATVEDVSPTVFPAGPLFPTEGRIV 121

Query: 84  SLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKFGHIVTNYHV 143
            LF++ + SVV I D+ L   PQ      + I +      G GSG VWD  G+IVTNYHV
Sbjct: 122 QLFEKNTYSVVNIFDVTL--RPQLKMTGVVEIPE------GNGSGVVWDGQGYIVTNYHV 181

Query: 144 V-SALATDNS-GSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIVF 203
           + +AL+ + S G    +VN++   G     E K+VG D   DLAVLKV+     LKPI  
Sbjct: 182 IGNALSRNPSPGDVVGRVNILASDGVQKNFEGKLVGADRAKDLAVLKVDAPETLLKPIKV 241

Query: 204 GTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDAAISAGNS 263
           G S +L+VGQ C AIGNPFG++ TLT GVISGL R+I S  G  I G IQTDAAI+ GNS
Sbjct: 242 GQSNSLKVGQQCLAIGNPFGFDHTLTVGVISGLNRDIFSQTGVTIGGGIQTDAAINPGNS 301

Query: 264 GGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY 297
           GGPL+DS G++IG+NTA FT+  TG S+GV FAIP  TV++ VP LI +
Sbjct: 302 GGPLLDSKGNLIGINTAIFTQ--TGTSAGVGFAIPSSTVLKIVPQLIQF 339

BLAST of Csa1G025870 vs. Swiss-Prot
Match: DEGPL_HAHCH (Probable periplasmic serine endoprotease DegP-like OS=Hahella chejuensis (strain KCTC 2396) GN=mucD PE=3 SV=1)

HSP 1 Score: 132.1 bits (331), Expect = 1.0e-29
Identity = 80/181 (44.20%), Postives = 110/181 (60.77%), Query Frame = 1

Query: 107 KVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDP 166
           + + TGSGF+  K G+I+TN HVV+       G+    V L+D +       AK++G D 
Sbjct: 87  EAQSTGSGFIVSKDGYILTNNHVVA-------GADEIFVRLMDRRE----LTAKLIGSDE 146

Query: 167 EYDLAVLKVELEGHELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPS 226
           + DLAVLKVE +  +L  +  G S  L+VG+   AIG+PFG+E T+TAG++S  GR +P+
Sbjct: 147 KSDLAVLKVEAD--DLPVLNLGKSSELKVGEWVVAIGSPFGFEYTVTAGIVSAKGRSLPN 206

Query: 227 PNGRAIRGAIQTDAAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV 286
            N       IQTD AI+ GNSGGPL +  G V+G+N+  +TR G  M  GV+FAIPID  
Sbjct: 207 ENYVPF---IQTDVAINPGNSGGPLFNLEGEVVGINSQIYTRSGGFM--GVSFAIPIDVA 249

Query: 287 V 288
           +
Sbjct: 267 L 249

BLAST of Csa1G025870 vs. Swiss-Prot
Match: DEGPL_PSEF1 (Probable periplasmic serine endoprotease DegP-like OS=Pseudomonas fulva (strain 12-X) GN=Psefu_3239 PE=3 SV=1)

HSP 1 Score: 126.7 bits (317), Expect = 4.3e-28
Identity = 82/211 (38.86%), Postives = 117/211 (55.45%), Query Frame = 1

Query: 87  LPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGSQRCKVN 146
           +P+ P+AP       E  +L     GSGF+  K G+I+TN HVV+        +    V 
Sbjct: 82  IPQQPRAPGGGGRQREAQSL-----GSGFIISKDGYILTNNHVVA-------DADEIIVR 141

Query: 147 LVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIVFGTSRNLRVGQSCYAIGNPF 206
           L D        EAK++G DP  D+A+LKVE   ++L  +  G S NL+VG+   AIG+PF
Sbjct: 142 LSDRSE----LEAKLIGTDPRSDVALLKVE--ANDLPTVKLGNSDNLKVGEWVLAIGSPF 201

Query: 207 GYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDAAISAGNSGGPLVDSYGHVIGVNTATF 266
           G++ ++TAG++S  GR +P+    +    IQTD AI+ GNSGGPL +  G V+G+N+  F
Sbjct: 202 GFDHSVTAGIVSAKGRSLPN---ESYVPFIQTDVAINPGNSGGPLFNLDGEVVGINSQIF 261

Query: 267 TRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG 298
           TR G  M  G++FAIP+   +     L   G
Sbjct: 262 TRSGGFM--GLSFAIPMSVAMDVADQLKASG 269

BLAST of Csa1G025870 vs. TrEMBL
Match: A0A0A0LV65_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G025870 PE=4 SV=1)

HSP 1 Score: 612.1 bits (1577), Expect = 3.8e-172
Identity = 304/304 (100.00%), Postives = 304/304 (100.00%), Query Frame = 1

Query: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60
           MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH
Sbjct: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60

Query: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF 120
           LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF
Sbjct: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF 120

Query: 121 GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH 180
           GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH
Sbjct: 121 GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH 180

Query: 181 ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA 240
           ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA
Sbjct: 181 ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA 240

Query: 241 AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300
           AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY
Sbjct: 241 AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300

Query: 301 SERF 305
           SERF
Sbjct: 301 SERF 304

BLAST of Csa1G025870 vs. TrEMBL
Match: A0A061GEY2_THECC (Protease degS, putative isoform 1 OS=Theobroma cacao GN=TCM_029456 PE=4 SV=1)

HSP 1 Score: 420.6 bits (1080), Expect = 1.6e-114
Identity = 215/304 (70.72%), Postives = 243/304 (79.93%), Query Frame = 1

Query: 10  PLPFPAPPNSSQNP----LPFTSRRAILFSPAALLPSLLAFPLPTHA-----ALPQLQDH 69
           PLP     +SS++     L  T RRAI+      + SLL    P  +     AL Q  + 
Sbjct: 13  PLPTTTTSSSSESSDNKSLVITRRRAIVSGSTVAVASLLQLSNPVSSLYSAIALQQQDEE 72

Query: 70  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF 129
           L +EEDR V LFQETSPSVV+I DLEL K P++ SQ+  L ED++ KV+GTGSGF+WDKF
Sbjct: 73  LDEEEDRIVRLFQETSPSVVFIKDLELAKIPKSSSQEVTLAEDEDAKVEGTGSGFIWDKF 132

Query: 130 GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH 189
           GHIVTNYHVV  LATD SG QRCKV LVD +G   YKE KIVG DP YDLAVLKV++EG+
Sbjct: 133 GHIVTNYHVVDKLATDQSGLQRCKVFLVDARGTSFYKEGKIVGIDPAYDLAVLKVDVEGY 192

Query: 190 ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA 249
           ELKP+V GTSR+LRVGQSC+AIGNPFGYE TLT GV+SGLGREIPSPNGRAIRGAIQTDA
Sbjct: 193 ELKPVVLGTSRDLRVGQSCFAIGNPFGYENTLTTGVVSGLGREIPSPNGRAIRGAIQTDA 252

Query: 250 AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 305
           AI+AGNSGGPL+DSYGHVIGVNTATFTRKGTG+SSGVNFAIPIDTVVRTVPYLIVYGTPY
Sbjct: 253 AINAGNSGGPLIDSYGHVIGVNTATFTRKGTGVSSGVNFAIPIDTVVRTVPYLIVYGTPY 312

BLAST of Csa1G025870 vs. TrEMBL
Match: M5X1A6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008782mg PE=4 SV=1)

HSP 1 Score: 418.3 bits (1074), Expect = 8.0e-114
Identity = 217/318 (68.24%), Postives = 247/318 (77.67%), Query Frame = 1

Query: 1   MALGLLG-IPPLPFP-----APPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLP----- 60
           + LG L  I P P P     +  N SQ  L  T RRAI+  P+ ++ SLL F  P     
Sbjct: 2   VVLGSLSQIKPSPIPTNSSSSSSNPSQKGLLLTRRRAIVLGPSVVVASLLHFYNPISQQP 61

Query: 61  -THAALPQLQ--DHLLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNL 120
            +H AL Q Q  D L QEEDR+V LFQETSPSVV+I DLE+ K+ +A      L ED N 
Sbjct: 62  SSHLALAQQQQEDELQQEEDRSVHLFQETSPSVVFIKDLEIDKSLKASPDAVFLSEDGNS 121

Query: 121 KVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDP 180
           KV+GTGSGF+WDKFGHIVTNYHVV+ LATD +G QRCKV LVD +GNG Y E KIVG DP
Sbjct: 122 KVEGTGSGFIWDKFGHIVTNYHVVAKLATDQTGLQRCKVYLVDARGNGFYSEGKIVGVDP 181

Query: 181 EYDLAVLKVELEGHELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPS 240
            YDLAVLKV++EGHELKP+V GTS  L VGQSC+AIGNP+GYE TLT GV+SGLGREIPS
Sbjct: 182 AYDLAVLKVDVEGHELKPVVLGTSNGLHVGQSCFAIGNPYGYENTLTIGVVSGLGREIPS 241

Query: 241 PNGRAIRGAIQTDAAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV 300
           P+G+AIRGAIQTDAAI++GNSGGPL+DSYGH+IGVNTATFTRKGTG SSGVNFAIPIDTV
Sbjct: 242 PDGKAIRGAIQTDAAINSGNSGGPLIDSYGHIIGVNTATFTRKGTGASSGVNFAIPIDTV 301

Query: 301 VRTVPYLIVYGTPYSERF 305
           VRTVPYLIVYGTPY +RF
Sbjct: 302 VRTVPYLIVYGTPYRDRF 319

BLAST of Csa1G025870 vs. TrEMBL
Match: A0A067KSL0_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_04636 PE=4 SV=1)

HSP 1 Score: 414.1 bits (1063), Expect = 1.5e-112
Identity = 209/293 (71.33%), Postives = 240/293 (81.91%), Query Frame = 1

Query: 18  NSSQNPLPFTSRRAILFSPAALLPSLLAF------PLPTHAALPQLQDHLLQEEDRTVSL 77
           +SS   L  T RRA  F  + +L SLL        PL  + A+ Q +D L +EE R V+L
Sbjct: 27  DSSNKILILTRRRAAAFGSSLVLASLLNLQNLHSKPLSFNYAIAQ-EDELEKEEHRIVNL 86

Query: 78  FQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKFGHIVTNYHVVS 137
           FQ TSPSVV+I DLEL K+P++ S      ED+N KV+GTGSGF+WDKFGHIVTNYHVV 
Sbjct: 87  FQITSPSVVFIKDLELAKSPESSSFDVAFTEDENAKVEGTGSGFIWDKFGHIVTNYHVVD 146

Query: 138 ALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIVFGTSR 197
            LATD SG QRCKV LVD  GNG+Y+E KI+GFDP YDLAVLKV++EGHELKP V GTS+
Sbjct: 147 KLATDKSGLQRCKVFLVDAGGNGLYREGKIIGFDPAYDLAVLKVDVEGHELKPAVLGTSK 206

Query: 198 NLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDAAISAGNSGGPL 257
           +LRVGQSC+AIGNP+GY+ TLT GV+SGLGREIPSPNGRAIRGAIQTDAAI+AGNSGGPL
Sbjct: 207 DLRVGQSCFAIGNPYGYDNTLTTGVVSGLGREIPSPNGRAIRGAIQTDAAINAGNSGGPL 266

Query: 258 VDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSERF 305
           +DSYGHVIGVNTATFTRKGTG+SSGVNFAIPIDTV+RTVPYLIVYGTPYS+RF
Sbjct: 267 IDSYGHVIGVNTATFTRKGTGISSGVNFAIPIDTVLRTVPYLIVYGTPYSDRF 318

BLAST of Csa1G025870 vs. TrEMBL
Match: A0A0D2VI33_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_011G045700 PE=4 SV=1)

HSP 1 Score: 412.9 bits (1060), Expect = 3.3e-112
Identity = 218/314 (69.43%), Postives = 245/314 (78.03%), Query Frame = 1

Query: 5   LLGIPPLPFPAPP---NSSQNP---LPF-TSRRAILFSPAALLPSLLAFPLPTHAALPQL 64
           L  +  LP P PP   +SS  P    PF T RRAI+ +  A + SLL    P  +  P +
Sbjct: 4   LASLHTLPSPLPPTTTSSSSEPSYKTPFITRRRAIISASTAAIASLLHISNPIPSLYPSI 63

Query: 65  -----QDHLLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQ--QPMLIEDDNLKVKG 124
                Q  L QEEDR V LFQETSPSVV+I DLEL K P++ S+  + ++ ED++ KV+G
Sbjct: 64  ALQPQQVELDQEEDRIVRLFQETSPSVVFIEDLELAKIPKSSSKGDRDVVAEDEDAKVEG 123

Query: 125 TGSGFVWDKFGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDL 184
           TGSGF+WDKFGHIVTNYHVV  LATD SG Q CKV L D  G   YKE KIVG DP YDL
Sbjct: 124 TGSGFIWDKFGHIVTNYHVVDKLATDQSGLQSCKVLLADASGTSFYKEGKIVGIDPAYDL 183

Query: 185 AVLKVELEGHELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGR 244
           AVLKV++EG+ELKPIV GTSR+LRVGQSC+AIGNPFGYE TLT GV+SGLGREIPSPNGR
Sbjct: 184 AVLKVDVEGYELKPIVVGTSRDLRVGQSCFAIGNPFGYENTLTTGVVSGLGREIPSPNGR 243

Query: 245 AIRGAIQTDAAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTV 304
           AIRGAIQTDAAI+AGNSGGPL+DSYGHVIGVNTATFTRKGTG+SSGVNFAIPIDTVVRTV
Sbjct: 244 AIRGAIQTDAAINAGNSGGPLIDSYGHVIGVNTATFTRKGTGISSGVNFAIPIDTVVRTV 303

BLAST of Csa1G025870 vs. TAIR10
Match: AT4G18370.1 (AT4G18370.1 DEGP protease 5)

HSP 1 Score: 366.3 bits (939), Expect = 1.8e-101
Identity = 189/285 (66.32%), Postives = 221/285 (77.54%), Query Frame = 1

Query: 29  RRAILF-SPAALLPSLLA-----FPLPTHAALPQL---QDHLLQEEDRTVSLFQETSPSV 88
           RR ++F S  AL  SLL       P+ +  AL Q    ++ L +EE+R V+LFQ+TSPSV
Sbjct: 43  RRIMIFGSSLALTSSLLGSNQQRLPMESAIALEQFKEKEEELEEEEERNVNLFQKTSPSV 102

Query: 89  VYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSG 148
           VYI  +ELPK     S   +L +++N K++GTGSGFVWDK GHIVTNYHV++ LATD  G
Sbjct: 103 VYIEAIELPKT----SSGDILTDEENGKIEGTGSGFVWDKLGHIVTNYHVIAKLATDQFG 162

Query: 149 SQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIVFGTSRNLRVGQSC 208
            QRCKV+LVD KG    KE KIVG DP+ DLAVLK+E EG EL P+V GTS +LRVGQSC
Sbjct: 163 LQRCKVSLVDAKGTRFSKEGKIVGLDPDNDLAVLKIETEGRELNPVVLGTSNDLRVGQSC 222

Query: 209 YAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDAAISAGNSGGPLVDSYGHVI 268
           +AIGNP+GYE TLT GV+SGLGREIPSPNG++I  AIQTDA I++GNSGGPL+DSYGH I
Sbjct: 223 FAIGNPYGYENTLTIGVVSGLGREIPSPNGKSISEAIQTDADINSGNSGGPLLDSYGHTI 282

Query: 269 GVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPYSERF 305
           GVNTATFTRKG+GMSSGVNFAIPIDTVVRTVPYLIVYGT Y +RF
Sbjct: 283 GVNTATFTRKGSGMSSGVNFAIPIDTVVRTVPYLIVYGTAYRDRF 323

BLAST of Csa1G025870 vs. TAIR10
Match: AT3G27925.1 (AT3G27925.1 DegP protease 1)

HSP 1 Score: 197.6 bits (501), Expect = 1.1e-50
Identity = 131/292 (44.86%), Postives = 171/292 (58.56%), Query Frame = 1

Query: 18  NSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQD----------HLLQEEDR 77
           +   + L FT   A+   P  LL + +A      AA P ++            L  +E  
Sbjct: 63  DDDDDTLHFTPFSAV--KPFFLLCTSVALSFSLFAASPAVESASAFVVSTPKKLQTDELA 122

Query: 78  TVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKV-KGTGSGFVWDKFGHIVTN 137
           TV LFQE +PSVVYI +L       A  Q    +  D L+V +G+GSGFVWDK GHIVTN
Sbjct: 123 TVRLFQENTPSVVYITNL-------AVRQDAFTL--DVLEVPQGSGSGFVWDKQGHIVTN 182

Query: 138 YHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIV 197
           YHV+        G+   +V L D        +AK+VGFD + D+AVL+++   ++L+PI 
Sbjct: 183 YHVIR-------GASDLRVTLADQTTF----DAKVVGFDQDKDVAVLRIDAPKNKLRPIP 242

Query: 198 FGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPS-PNGRAIRGAIQTDAAISAG 257
            G S +L VGQ  +AIGNPFG + TLT GVISGL REI S   GR I+  IQTDAAI+ G
Sbjct: 243 VGVSADLLVGQKVFAIGNPFGLDHTLTTGVISGLRREISSAATGRPIQDVIQTDAAINPG 302

Query: 258 NSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYG 298
           NSGGPL+DS G +IG+NTA ++   +G SSGV F+IP+DTV   V  L+ +G
Sbjct: 303 NSGGPLLDSSGTLIGINTAIYS--PSGASSGVGFSIPVDTVGGIVDQLVRFG 330

BLAST of Csa1G025870 vs. TAIR10
Match: AT5G39830.1 (AT5G39830.1 Trypsin family protein with PDZ domain)

HSP 1 Score: 197.6 bits (501), Expect = 1.1e-50
Identity = 134/289 (46.37%), Postives = 171/289 (59.17%), Query Frame = 1

Query: 24  LPFTSRRAIL--------FSPAALLPSLLAFPLPTHAALPQLQDH------LLQEEDRTV 83
           +P T+RR +L        F+P+  L S LA   P+ A +  +         L   E R V
Sbjct: 62  VPSTTRRILLTSLFMNLCFNPSRYL-SALALGDPSVATVEDVSPTVFPAGPLFPTEGRIV 121

Query: 84  SLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKFGHIVTNYHV 143
            LF++ + SVV I D+ L   PQ      + I +      G GSG VWD  G+IVTNYHV
Sbjct: 122 QLFEKNTYSVVNIFDVTL--RPQLKMTGVVEIPE------GNGSGVVWDGQGYIVTNYHV 181

Query: 144 V-SALATDNS-GSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGHELKPIVF 203
           + +AL+ + S G    +VN++   G     E K+VG D   DLAVLKV+     LKPI  
Sbjct: 182 IGNALSRNPSPGDVVGRVNILASDGVQKNFEGKLVGADRAKDLAVLKVDAPETLLKPIKV 241

Query: 204 GTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDAAISAGNS 263
           G S +L+VGQ C AIGNPFG++ TLT GVISGL R+I S  G  I G IQTDAAI+ GNS
Sbjct: 242 GQSNSLKVGQQCLAIGNPFGFDHTLTVGVISGLNRDIFSQTGVTIGGGIQTDAAINPGNS 301

Query: 264 GGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVY 297
           GGPL+DS G++IG+NTA FT+  TG S+GV FAIP  TV++ VP LI +
Sbjct: 302 GGPLLDSKGNLIGINTAIFTQ--TGTSAGVGFAIPSSTVLKIVPQLIQF 339

BLAST of Csa1G025870 vs. TAIR10
Match: AT5G27660.1 (AT5G27660.1 Trypsin family protein with PDZ domain)

HSP 1 Score: 86.3 bits (212), Expect = 3.6e-17
Identity = 65/186 (34.95%), Postives = 93/186 (50.00%), Query Frame = 1

Query: 109 KGTGSGFVWDKFGHIVTNYHVVSALATDNSGSQ-RCKVNLVDVKGNGIYKEAKIVGFDPE 168
           K  GSG + D  G I+T  HVV         S+ R  V L D    G   E  +V  D +
Sbjct: 146 KSIGSGTIIDADGTILTCAHVVVDFQNIRHSSKGRVDVTLQD----GRTFEGVVVNADLQ 205

Query: 169 YDLAVLKVELEGHELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSP 228
            D+A++K++ +   L     G S  LR G    A+G P   + T+TAG++S + R+    
Sbjct: 206 SDIALVKIKSKT-PLPTAKLGFSSKLRPGDWVIAVGCPLSLQNTVTAGIVSCVDRKSSDL 265

Query: 229 N-GRAIRGAIQTDAAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV 288
             G   R  +QTD +I+AGNSGGPLV+  G VIGVN           + G+ F++PID+V
Sbjct: 266 GLGGKHREYLQTDCSINAGNSGGPLVNLDGEVIGVNIMKVL-----AADGLGFSVPIDSV 321

Query: 289 VRTVPY 293
            + + +
Sbjct: 326 SKIIEH 321

BLAST of Csa1G025870 vs. TAIR10
Match: AT1G65630.1 (AT1G65630.1 DegP protease 3)

HSP 1 Score: 54.3 bits (129), Expect = 1.5e-07
Identity = 55/197 (27.92%), Postives = 94/197 (47.72%), Query Frame = 1

Query: 95  SQQPMLIE--DDNLKVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGSQRCKVNLVDVKG 154
           S +P L +     ++ + TGSGFV      I+TN HVV+     N  S + + +     G
Sbjct: 104 SSKPRLFQPWQITMQSESTGSGFVISG-KKILTNAHVVA-----NQTSVKVRKH-----G 163

Query: 155 NGIYKEAKIVGFDPEYDLAVLKVELEG--HELKPIVFGTSRNLRVGQSCYAIGNPFGYEK 214
           +    +AK+     E DLA+L+++ +     + P+  G   +++   + Y +G P G + 
Sbjct: 164 STTKYKAKVQAVGHECDLAILEIDNDKFWEGMNPLELGDIPSMQ--DTVYVVGYPKGGDT 223

Query: 215 -TLTAGVISGLGREIPSPNGRAIRGAIQTDAAISAGNSGGPLVDSYGHVIGVNTATFTRK 274
            +++ GV+S +G    S +G  +  AIQ DAAI+ GNSGGP+      ++G   A    +
Sbjct: 224 ISVSKGVVSRVGPIKYSHSGTELL-AIQIDAAINNGNSGGPV------IMGNKVAGVAFE 280

Query: 275 GTGMSSGVNFAIPIDTV 287
               S  + + IP   +
Sbjct: 284 SLCYSDSIGYIIPTPVI 280

BLAST of Csa1G025870 vs. NCBI nr
Match: gi|449439571|ref|XP_004137559.1| (PREDICTED: protease Do-like 5, chloroplastic [Cucumis sativus])

HSP 1 Score: 612.1 bits (1577), Expect = 5.4e-172
Identity = 304/304 (100.00%), Postives = 304/304 (100.00%), Query Frame = 1

Query: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60
           MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH
Sbjct: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60

Query: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF 120
           LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF
Sbjct: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF 120

Query: 121 GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH 180
           GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH
Sbjct: 121 GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH 180

Query: 181 ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA 240
           ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA
Sbjct: 181 ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA 240

Query: 241 AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300
           AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY
Sbjct: 241 AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 300

Query: 301 SERF 305
           SERF
Sbjct: 301 SERF 304

BLAST of Csa1G025870 vs. NCBI nr
Match: gi|659068445|ref|XP_008444456.1| (PREDICTED: protease Do-like 5, chloroplastic [Cucumis melo])

HSP 1 Score: 572.0 bits (1473), Expect = 6.2e-160
Identity = 288/305 (94.43%), Postives = 293/305 (96.07%), Query Frame = 1

Query: 1   MALGLLGIPPLPFPAPPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLPTHAALPQLQDH 60
           MALGLLGIPPLP PAPPNSS+NPLPFTSRRAILFSP AL+ SLLAFPLPT AALPQLQD 
Sbjct: 1   MALGLLGIPPLPIPAPPNSSENPLPFTSRRAILFSPTALMASLLAFPLPTPAALPQLQDP 60

Query: 61  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQP-MLIEDDNLKVKGTGSGFVWDK 120
           LLQEEDR VSLFQETSPSVVYI DLEL KNPQ  S++P MLIEDDN+KVKGTGSGFVWDK
Sbjct: 61  LLQEEDRIVSLFQETSPSVVYIKDLELAKNPQNRSEEPPMLIEDDNVKVKGTGSGFVWDK 120

Query: 121 FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEG 180
           FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEA IVGFDPEYDLAVLKVELEG
Sbjct: 121 FGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEANIVGFDPEYDLAVLKVELEG 180

Query: 181 HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240
           HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD
Sbjct: 181 HELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTD 240

Query: 241 AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300
           AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP
Sbjct: 241 AAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTP 300

Query: 301 YSERF 305
           YSERF
Sbjct: 301 YSERF 305

BLAST of Csa1G025870 vs. NCBI nr
Match: gi|590622421|ref|XP_007025045.1| (Protease degS, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 420.6 bits (1080), Expect = 2.3e-114
Identity = 215/304 (70.72%), Postives = 243/304 (79.93%), Query Frame = 1

Query: 10  PLPFPAPPNSSQNP----LPFTSRRAILFSPAALLPSLLAFPLPTHA-----ALPQLQDH 69
           PLP     +SS++     L  T RRAI+      + SLL    P  +     AL Q  + 
Sbjct: 13  PLPTTTTSSSSESSDNKSLVITRRRAIVSGSTVAVASLLQLSNPVSSLYSAIALQQQDEE 72

Query: 70  LLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSGFVWDKF 129
           L +EEDR V LFQETSPSVV+I DLEL K P++ SQ+  L ED++ KV+GTGSGF+WDKF
Sbjct: 73  LDEEEDRIVRLFQETSPSVVFIKDLELAKIPKSSSQEVTLAEDEDAKVEGTGSGFIWDKF 132

Query: 130 GHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLKVELEGH 189
           GHIVTNYHVV  LATD SG QRCKV LVD +G   YKE KIVG DP YDLAVLKV++EG+
Sbjct: 133 GHIVTNYHVVDKLATDQSGLQRCKVFLVDARGTSFYKEGKIVGIDPAYDLAVLKVDVEGY 192

Query: 190 ELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRGAIQTDA 249
           ELKP+V GTSR+LRVGQSC+AIGNPFGYE TLT GV+SGLGREIPSPNGRAIRGAIQTDA
Sbjct: 193 ELKPVVLGTSRDLRVGQSCFAIGNPFGYENTLTTGVVSGLGREIPSPNGRAIRGAIQTDA 252

Query: 250 AISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLIVYGTPY 305
           AI+AGNSGGPL+DSYGHVIGVNTATFTRKGTG+SSGVNFAIPIDTVVRTVPYLIVYGTPY
Sbjct: 253 AINAGNSGGPLIDSYGHVIGVNTATFTRKGTGVSSGVNFAIPIDTVVRTVPYLIVYGTPY 312

BLAST of Csa1G025870 vs. NCBI nr
Match: gi|595863728|ref|XP_007211641.1| (hypothetical protein PRUPE_ppa008782mg [Prunus persica])

HSP 1 Score: 418.3 bits (1074), Expect = 1.1e-113
Identity = 217/318 (68.24%), Postives = 247/318 (77.67%), Query Frame = 1

Query: 1   MALGLLG-IPPLPFP-----APPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLP----- 60
           + LG L  I P P P     +  N SQ  L  T RRAI+  P+ ++ SLL F  P     
Sbjct: 2   VVLGSLSQIKPSPIPTNSSSSSSNPSQKGLLLTRRRAIVLGPSVVVASLLHFYNPISQQP 61

Query: 61  -THAALPQLQ--DHLLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNL 120
            +H AL Q Q  D L QEEDR+V LFQETSPSVV+I DLE+ K+ +A      L ED N 
Sbjct: 62  SSHLALAQQQQEDELQQEEDRSVHLFQETSPSVVFIKDLEIDKSLKASPDAVFLSEDGNS 121

Query: 121 KVKGTGSGFVWDKFGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDP 180
           KV+GTGSGF+WDKFGHIVTNYHVV+ LATD +G QRCKV LVD +GNG Y E KIVG DP
Sbjct: 122 KVEGTGSGFIWDKFGHIVTNYHVVAKLATDQTGLQRCKVYLVDARGNGFYSEGKIVGVDP 181

Query: 181 EYDLAVLKVELEGHELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPS 240
            YDLAVLKV++EGHELKP+V GTS  L VGQSC+AIGNP+GYE TLT GV+SGLGREIPS
Sbjct: 182 AYDLAVLKVDVEGHELKPVVLGTSNGLHVGQSCFAIGNPYGYENTLTIGVVSGLGREIPS 241

Query: 241 PNGRAIRGAIQTDAAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTV 300
           P+G+AIRGAIQTDAAI++GNSGGPL+DSYGH+IGVNTATFTRKGTG SSGVNFAIPIDTV
Sbjct: 242 PDGKAIRGAIQTDAAINSGNSGGPLIDSYGHIIGVNTATFTRKGTGASSGVNFAIPIDTV 301

Query: 301 VRTVPYLIVYGTPYSERF 305
           VRTVPYLIVYGTPY +RF
Sbjct: 302 VRTVPYLIVYGTPYRDRF 319

BLAST of Csa1G025870 vs. NCBI nr
Match: gi|645238068|ref|XP_008225503.1| (PREDICTED: protease Do-like 5, chloroplastic [Prunus mume])

HSP 1 Score: 414.8 bits (1065), Expect = 1.3e-112
Identity = 212/310 (68.39%), Postives = 242/310 (78.06%), Query Frame = 1

Query: 8   IPPLPFP-----APPNSSQNPLPFTSRRAILFSPAALLPSLLAFPLP------THAALPQ 67
           I P P P     +  N SQ  +    RRAI+  P+ ++ SLL F  P      +H AL Q
Sbjct: 10  IKPSPIPTNSSSSSSNPSQKGILLKRRRAIVLGPSVVVASLLHFYNPISPQPSSHLALAQ 69

Query: 68  LQ--DHLLQEEDRTVSLFQETSPSVVYINDLELPKNPQAPSQQPMLIEDDNLKVKGTGSG 127
            Q  D L Q+EDR+V LFQETSPSVV+I DLE+ K+ +A      L ED N KV+GTGSG
Sbjct: 70  QQQEDELQQQEDRSVHLFQETSPSVVFIKDLEIDKSLKASPDAVFLSEDGNSKVEGTGSG 129

Query: 128 FVWDKFGHIVTNYHVVSALATDNSGSQRCKVNLVDVKGNGIYKEAKIVGFDPEYDLAVLK 187
           F+WDKFGHIVTNYHVV+ LATD +G QRCKV LVD +GNG Y E KIVG DP YDLAVLK
Sbjct: 130 FIWDKFGHIVTNYHVVAKLATDQTGLQRCKVYLVDARGNGFYSEGKIVGVDPAYDLAVLK 189

Query: 188 VELEGHELKPIVFGTSRNLRVGQSCYAIGNPFGYEKTLTAGVISGLGREIPSPNGRAIRG 247
           V++EGHELKP+V GTS  L VGQSC+AIGNP+GYE TLT GV+SGLGREIPSP+G+AIRG
Sbjct: 190 VDVEGHELKPVVLGTSNGLHVGQSCFAIGNPYGYENTLTIGVVSGLGREIPSPDGKAIRG 249

Query: 248 AIQTDAAISAGNSGGPLVDSYGHVIGVNTATFTRKGTGMSSGVNFAIPIDTVVRTVPYLI 305
           AIQTDAAI+AGNSGGPL+DSYGH+IGVNTATFTRKGTG SSGVNFAIPIDTVVRTVPYLI
Sbjct: 250 AIQTDAAINAGNSGGPLIDSYGHIIGVNTATFTRKGTGASSGVNFAIPIDTVVRTVPYLI 309

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DEGP5_ARATH3.2e-10066.32Protease Do-like 5, chloroplastic OS=Arabidopsis thaliana GN=DEGP5 PE=1 SV=3[more]
DEGP1_ARATH2.0e-4944.86Protease Do-like 1, chloroplastic OS=Arabidopsis thaliana GN=DEGP1 PE=1 SV=2[more]
DEGP8_ARATH2.0e-4946.37Protease Do-like 8, chloroplastic OS=Arabidopsis thaliana GN=DEGP8 PE=1 SV=1[more]
DEGPL_HAHCH1.0e-2944.20Probable periplasmic serine endoprotease DegP-like OS=Hahella chejuensis (strain... [more]
DEGPL_PSEF14.3e-2838.86Probable periplasmic serine endoprotease DegP-like OS=Pseudomonas fulva (strain ... [more]
Match NameE-valueIdentityDescription
A0A0A0LV65_CUCSA3.8e-172100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G025870 PE=4 SV=1[more]
A0A061GEY2_THECC1.6e-11470.72Protease degS, putative isoform 1 OS=Theobroma cacao GN=TCM_029456 PE=4 SV=1[more]
M5X1A6_PRUPE8.0e-11468.24Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008782mg PE=4 SV=1[more]
A0A067KSL0_JATCU1.5e-11271.33Uncharacterized protein OS=Jatropha curcas GN=JCGZ_04636 PE=4 SV=1[more]
A0A0D2VI33_GOSRA3.3e-11269.43Uncharacterized protein OS=Gossypium raimondii GN=B456_011G045700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G18370.11.8e-10166.32 DEGP protease 5[more]
AT3G27925.11.1e-5044.86 DegP protease 1[more]
AT5G39830.11.1e-5046.37 Trypsin family protein with PDZ domain[more]
AT5G27660.13.6e-1734.95 Trypsin family protein with PDZ domain[more]
AT1G65630.11.5e-0727.92 DegP protease 3[more]
Match NameE-valueIdentityDescription
gi|449439571|ref|XP_004137559.1|5.4e-172100.00PREDICTED: protease Do-like 5, chloroplastic [Cucumis sativus][more]
gi|659068445|ref|XP_008444456.1|6.2e-16094.43PREDICTED: protease Do-like 5, chloroplastic [Cucumis melo][more]
gi|590622421|ref|XP_007025045.1|2.3e-11470.72Protease degS, putative isoform 1 [Theobroma cacao][more]
gi|595863728|ref|XP_007211641.1|1.1e-11368.24hypothetical protein PRUPE_ppa008782mg [Prunus persica][more]
gi|645238068|ref|XP_008225503.1|1.3e-11268.39PREDICTED: protease Do-like 5, chloroplastic [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001940Peptidase_S1C
IPR009003Peptidase_S1_PA
Vocabulary: Molecular Function
TermDefinition
GO:0004252serine-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0010206 photosystem II repair
biological_process GO:0006508 proteolysis
cellular_component GO:0009543 chloroplast thylakoid lumen
molecular_function GO:0032440 2-alkenal reductase [NAD(P)] activity
molecular_function GO:0004252 serine-type endopeptidase activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU138970cucumber EST collection version 3.0transcribed_cluster
CU139938cucumber EST collection version 3.0transcribed_cluster
CU142683cucumber EST collection version 3.0transcribed_cluster
CU163115cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G025870.1Csa1G025870.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU163115CU163115transcribed_cluster
CU139938CU139938transcribed_cluster
CU142683CU142683transcribed_cluster
CU138970CU138970transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001940Peptidase S1CPRINTSPR00834PROTEASES2Ccoord: 232..249
score: 4.5E-27coord: 195..219
score: 4.5E-27coord: 254..271
score: 4.5E-27coord: 121..133
score: 4.5
IPR009003Peptidase S1, PA clanunknownSSF50494Trypsin-like serine proteasescoord: 70..290
score: 5.42
NoneNo IPR availableGENE3DG3DSA:2.40.10.10coord: 190..297
score: 4.4E-39coord: 61..189
score: 2.0
NoneNo IPR availablePANTHERPTHR22939SERINE PROTEASE FAMILY S1C HTRA-RELATEDcoord: 3..298
score: 7.9E
NoneNo IPR availablePANTHERPTHR22939:SF76PROTEASE DO-LIKE 5, CHLOROPLASTICcoord: 3..298
score: 7.9E
NoneNo IPR availablePFAMPF13365Trypsin_2coord: 112..261
score: 1.4

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:

None