CsaV3_3G017860 (gene) Cucumber (Chinese Long) v3

NameCsaV3_3G017860
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
Descriptionaspartic proteinase PCS1
Locationchr3 : 13485208 .. 13486552 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CAACACAATCAACAATGCTTCTAATTCTCTTCTCTCTCTCATTATTCACTCTCTCCTTCTCTCAATCCAATTCCCTCTCTCTCCCCTTCCCTCTTTCTCTCACTGAAAAACCCTCCAATATTACCCCATTATACTACTCTTCCCAGCTTTACGTCAAAAAGCCATCATCCCATGGCCCCTTCAAGCTTCCTTTCAAATACTCCTCCTCTGCCCTCGTCGTCTCTCTTCCGATCGGAACGCCGCCACAGCCCACTGACTTGGTTCTAGACACCGGCAGCCAACTCTCTTGGATTCAATGTCATGACAAAAAAGTTAAGAAAAGATTGCCCCCCTTGCCCAAGCCTAAAACCGCCACCTTTGATCCTTCTCTCTCCTCTTCTTTTTCTCTCCTCCCTTGTAATCACCCCATCTGCAAACCCCGAATTCCAGATTTTACTCTTCCCACTTCTTGTGATCAAAATCGCCTCTGCCACTACTCCTACTTCTACGCTGACGGTACCTTGGCTGAGGGAAATCTCGTCAGAGAAAAATTTACCTTCTCTAATTCTCTTTCTACCCCTCCCGTCATCCTCGGTTGCGCTCAAGGCTCCACCGAAAACAGGGGTATTTTGGGAATGAATCATGGACGTTTGTCCTTTATCTCCCAAGCTAAAATCTCCAAATTCTCCTATTGCGTTCCGAGTCGAACCGGGCCTAATCCCACCGGGCTATTCTACCTGGGAGATAACCCCAATTCTTCCAAATTCAAATACGTCACCATGTTGACTTTTCCGGAAAGTCAAAGCTCTCCGAATCTCGACCCACTGGCTTACACTCTCCCTATGAAGGCAATAAAAATAGCCGGAAAACGGTTAAACATCCCCCCAGCCGCTTTCAAACCGGATGCGGGTGGGTCGGGTCAAACCATGATTGACTCCGGTTCGGACCTGACTTATTTAGTGGATGAAGCGTATGAGAAGGTTAAAGAGGAGGTAGTGAGATTAGTGGGTGCGATGATGAAGAAAGGCTACGTATATGCCGCCGTAGCCGACATGTGTTTCGACGCCGGTGTGACGGTGGAGGTGGGCCGCAGGATTGGCGACATGTCGTTTGAGTTTGATAATGGAGTGGAGATTTTCGTGGGGAGAGGAGAAGGGGTTTTGACGGAGGTGGAAAAAGGAGTGAAGTGTGTGGGGATTGGACGGTCAGGAAGGCTTGGGATTGGAAGTAATATAATCGGTACCGTTCATCAACAGAATATGTGGGTGGAGTATGATTTGGCCAATAAGAGAGTAGGGTTTGGTGGAGCTGAGTGTAGCAGATTGAAGTGATGATGGACAGTAAAGATTTATACACGTGTGTGGGT

mRNA sequence

ATGCTTCTAATTCTCTTCTCTCTCTCATTATTCACTCTCTCCTTCTCTCAATCCAATTCCCTCTCTCTCCCCTTCCCTCTTTCTCTCACTGAAAAACCCTCCAATATTACCCCATTATACTACTCTTCCCAGCTTTACGTCAAAAAGCCATCATCCCATGGCCCCTTCAAGCTTCCTTTCAAATACTCCTCCTCTGCCCTCGTCGTCTCTCTTCCGATCGGAACGCCGCCACAGCCCACTGACTTGGTTCTAGACACCGGCAGCCAACTCTCTTGGATTCAATGTCATGACAAAAAAGTTAAGAAAAGATTGCCCCCCTTGCCCAAGCCTAAAACCGCCACCTTTGATCCTTCTCTCTCCTCTTCTTTTTCTCTCCTCCCTTGTAATCACCCCATCTGCAAACCCCGAATTCCAGATTTTACTCTTCCCACTTCTTGTGATCAAAATCGCCTCTGCCACTACTCCTACTTCTACGCTGACGGTACCTTGGCTGAGGGAAATCTCGTCAGAGAAAAATTTACCTTCTCTAATTCTCTTTCTACCCCTCCCGTCATCCTCGGTTGCGCTCAAGGCTCCACCGAAAACAGGGGTATTTTGGGAATGAATCATGGACGTTTGTCCTTTATCTCCCAAGCTAAAATCTCCAAATTCTCCTATTGCGTTCCGAGTCGAACCGGGCCTAATCCCACCGGGCTATTCTACCTGGGAGATAACCCCAATTCTTCCAAATTCAAATACGTCACCATGTTGACTTTTCCGGAAAGTCAAAGCTCTCCGAATCTCGACCCACTGGCTTACACTCTCCCTATGAAGGCAATAAAAATAGCCGGAAAACGGTTAAACATCCCCCCAGCCGCTTTCAAACCGGATGCGGGTGGGTCGGGTCAAACCATGATTGACTCCGGTTCGGACCTGACTTATTTAGTGGATGAAGCGTATGAGAAGGTTAAAGAGGAGGTAGTGAGATTAGTGGGTGCGATGATGAAGAAAGGCTACGTATATGCCGCCGTAGCCGACATGTGTTTCGACGCCGGTGTGACGGTGGAGGTGGGCCGCAGGATTGGCGACATGTCGTTTGAGTTTGATAATGGAGTGGAGATTTTCGTGGGGAGAGGAGAAGGGGTTTTGACGGAGGTGGAAAAAGGAGTGAAGTGTGTGGGGATTGGACGGTCAGGAAGGCTTGGGATTGGAAGTAATATAATCGGTACCGTTCATCAACAGAATATGTGGGTGGAGTATGATTTGGCCAATAAGAGAGTAGGGTTTGGTGGAGCTGAGTGTAGCAGATTGAAGTGA

Coding sequence (CDS)

ATGCTTCTAATTCTCTTCTCTCTCTCATTATTCACTCTCTCCTTCTCTCAATCCAATTCCCTCTCTCTCCCCTTCCCTCTTTCTCTCACTGAAAAACCCTCCAATATTACCCCATTATACTACTCTTCCCAGCTTTACGTCAAAAAGCCATCATCCCATGGCCCCTTCAAGCTTCCTTTCAAATACTCCTCCTCTGCCCTCGTCGTCTCTCTTCCGATCGGAACGCCGCCACAGCCCACTGACTTGGTTCTAGACACCGGCAGCCAACTCTCTTGGATTCAATGTCATGACAAAAAAGTTAAGAAAAGATTGCCCCCCTTGCCCAAGCCTAAAACCGCCACCTTTGATCCTTCTCTCTCCTCTTCTTTTTCTCTCCTCCCTTGTAATCACCCCATCTGCAAACCCCGAATTCCAGATTTTACTCTTCCCACTTCTTGTGATCAAAATCGCCTCTGCCACTACTCCTACTTCTACGCTGACGGTACCTTGGCTGAGGGAAATCTCGTCAGAGAAAAATTTACCTTCTCTAATTCTCTTTCTACCCCTCCCGTCATCCTCGGTTGCGCTCAAGGCTCCACCGAAAACAGGGGTATTTTGGGAATGAATCATGGACGTTTGTCCTTTATCTCCCAAGCTAAAATCTCCAAATTCTCCTATTGCGTTCCGAGTCGAACCGGGCCTAATCCCACCGGGCTATTCTACCTGGGAGATAACCCCAATTCTTCCAAATTCAAATACGTCACCATGTTGACTTTTCCGGAAAGTCAAAGCTCTCCGAATCTCGACCCACTGGCTTACACTCTCCCTATGAAGGCAATAAAAATAGCCGGAAAACGGTTAAACATCCCCCCAGCCGCTTTCAAACCGGATGCGGGTGGGTCGGGTCAAACCATGATTGACTCCGGTTCGGACCTGACTTATTTAGTGGATGAAGCGTATGAGAAGGTTAAAGAGGAGGTAGTGAGATTAGTGGGTGCGATGATGAAGAAAGGCTACGTATATGCCGCCGTAGCCGACATGTGTTTCGACGCCGGTGTGACGGTGGAGGTGGGCCGCAGGATTGGCGACATGTCGTTTGAGTTTGATAATGGAGTGGAGATTTTCGTGGGGAGAGGAGAAGGGGTTTTGACGGAGGTGGAAAAAGGAGTGAAGTGTGTGGGGATTGGACGGTCAGGAAGGCTTGGGATTGGAAGTAATATAATCGGTACCGTTCATCAACAGAATATGTGGGTGGAGTATGATTTGGCCAATAAGAGAGTAGGGTTTGGTGGAGCTGAGTGTAGCAGATTGAAGTGA

Protein sequence

MLLILFSLSLFTLSFSQSNSLSLPFPLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGPNPTGLFYLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRVGFGGAECSRLK
BLAST of CsaV3_3G017860 vs. NCBI nr
Match: XP_011651212.1 (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])

HSP 1 Score: 830.9 bits (2145), Expect = 2.0e-237
Identity = 429/431 (99.54%), Postives = 430/431 (99.77%), Query Frame = 0

Query: 1   MLLILFSLSXXXXXXXXXXXXXXXXXXSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60
           MLLILFSLSXXXXXXXXXXXXXXXXXXSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF
Sbjct: 1   MLLILFSLSXXXXXXXXXXXXXXXXXXSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60

Query: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLS 120
           KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTA+FDPSLS
Sbjct: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLS 120

Query: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180
           SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS
Sbjct: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180

Query: 181 TPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGPNPTGLFYLGDNPN 240
           TPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTG NPTGLFYLGDNPN
Sbjct: 181 TPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN 240

Query: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300
           SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID
Sbjct: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300

Query: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 360
           SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE
Sbjct: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 360

Query: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 420
           FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV
Sbjct: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 420

Query: 421 GFGGAECSRLK 432
           GFGGAECSRLK
Sbjct: 421 GFGGAECSRLK 431

BLAST of CsaV3_3G017860 vs. NCBI nr
Match: XP_004140731.2 (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])

HSP 1 Score: 778.9 bits (2010), Expect = 9.2e-222
Identity = 407/431 (94.43%), Postives = 412/431 (95.59%), Query Frame = 0

Query: 1   MLLILFSLSXXXXXXXXXXXXXXXXXXSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60
           MLLILFS  XXXXXXXXXXXXXXXXXX   EKPSN  P  YSSQLY K+PSS+G FKLPF
Sbjct: 1   MLLILFSXXXXXXXXXXXXXXXXXXXXXXXEKPSNTIP-SYSSQLYAKRPSSYGSFKLPF 60

Query: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLS 120
           KYSS+ALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTA+FDPSLS
Sbjct: 61  KYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLS 120

Query: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180
           SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS SLS
Sbjct: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLS 180

Query: 181 TPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGPNPTGLFYLGDNPN 240
           TPPVILGCAQ STENRGILGMN GRLSFISQAKISKFSYCVPSRTG NPTGLFYLGDNPN
Sbjct: 181 TPPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN 240

Query: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300
           SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID
Sbjct: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300

Query: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 360
           SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYA VADMCFDAGVT EVGRRIG +SFE
Sbjct: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFE 360

Query: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 420
           FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRS RLGIGSNIIGTVHQQNMWVEYDLANKRV
Sbjct: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRV 420

Query: 421 GFGGAECSRLK 432
           GFGGAECSRLK
Sbjct: 421 GFGGAECSRLK 430

BLAST of CsaV3_3G017860 vs. NCBI nr
Match: XP_008457122.1 (PREDICTED: aspartic proteinase PCS1 [Cucumis melo])

HSP 1 Score: 767.3 bits (1980), Expect = 2.8e-218
Identity = 382/431 (88.63%), Postives = 394/431 (91.42%), Query Frame = 0

Query: 1   MLLILFSLSXXXXXXXXXXXXXXXXXXSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60
           MLLILFSLS                  SL+EKPSNI+P+ Y SQLY KKPSSHG FKLPF
Sbjct: 1   MLLILFSLSLFTLPFSQSNSVSLPFPLSLSEKPSNISPI-YGSQLYAKKPSSHGSFKLPF 60

Query: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLS 120
           KYSS+ALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHD KVKK+LPPLPKPKTA+FDPSLS
Sbjct: 61  KYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHD-KVKKKLPPLPKPKTASFDPSLS 120

Query: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180
           SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKF+ SNSLS
Sbjct: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFSLSNSLS 180

Query: 181 TPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGPNPTGLFYLGDNPN 240
           TPPVILGCAQ STENRGILGMN GRLSFISQAKISKFSYCVP+RTG NPTGLFYLGDNPN
Sbjct: 181 TPPVILGCAQASTENRGILGMNKGRLSFISQAKISKFSYCVPARTGSNPTGLFYLGDNPN 240

Query: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300
           SS+FKYVTMLTFPESQSSPNLDPLAYTLPMK IKIAGKRLNI PAAFKPDAGGSGQTMID
Sbjct: 241 SSRFKYVTMLTFPESQSSPNLDPLAYTLPMKGIKIAGKRLNISPAAFKPDAGGSGQTMID 300

Query: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 360
           SGSDLTYLVDEAYEKVKEEVVRLVGA MKKGYVYAAVADMCFDA VT EVGRRIG +SFE
Sbjct: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAKMKKGYVYAAVADMCFDARVTAEVGRRIGGISFE 360

Query: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 420
           FDNGVEI VGRGEGVLTEVEKGVKCVG GRS RLGIGSNIIGTVHQQNMWVEYDL N+R+
Sbjct: 361 FDNGVEILVGRGEGVLTEVEKGVKCVGFGRSERLGIGSNIIGTVHQQNMWVEYDLTNRRI 420

Query: 421 GFGGAECSRLK 432
           GFGGAECSRLK
Sbjct: 421 GFGGAECSRLK 429

BLAST of CsaV3_3G017860 vs. NCBI nr
Match: XP_022983616.1 (aspartic proteinase PCS1-like [Cucurbita maxima])

HSP 1 Score: 667.9 bits (1722), Expect = 2.3e-188
Identity = 349/431 (80.97%), Postives = 379/431 (87.94%), Query Frame = 0

Query: 1   MLLILFSLSXXXXXXXXXXXXXXXXXXSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60
           MLL LF +S   XXXXXXXXXXXXXXX  +++PS ++PL+ S        SS+G  KLPF
Sbjct: 1   MLLSLFYISLLSXXXXXXXXXXXXXXXXXSKRPSPVSPLFSSL-------SSYGSVKLPF 60

Query: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLS 120
           KY S+ALVVSLPIGTPPQPTDLVLDTGSQLSWIQCH K  KK L    KPKTA+FDPSLS
Sbjct: 61  KY-STALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLL----KPKTASFDPSLS 120

Query: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180
           SSFSLLPCNHP+CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNS +
Sbjct: 121 SSFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRT 180

Query: 181 TPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGPNPTGLFYLGDNPN 240
           TPPVILGCAQ STENRGILGMN GRLSF+SQAKISKFSYCVP RTGP+ TGLFYLGDNPN
Sbjct: 181 TPPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPN 240

Query: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300
           S+ FKY+++LTFP+SQ SPNLDPLAYTLP+K IKIAG RLNI  A FKPD  GSGQTMID
Sbjct: 241 SANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMID 300

Query: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 360
           SGSDLTYLVDEAYEKVKEE+V+LVG +MKKGY YAAVADMCF+ G T EVGRRI DMSFE
Sbjct: 301 SGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFE 360

Query: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 420
           F+NGVEI VG+GEGVLTEVEKGVKCVG GRSGRLGI SNIIGTVHQ+N WVEYDLAN+R+
Sbjct: 361 FENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRRI 419

Query: 421 GFGGAECSRLK 432
           GFGGA+CSRLK
Sbjct: 421 GFGGADCSRLK 419

BLAST of CsaV3_3G017860 vs. NCBI nr
Match: XP_022934878.1 (aspartic proteinase PCS1-like [Cucurbita moschata])

HSP 1 Score: 661.4 bits (1705), Expect = 2.1e-186
Identity = 324/398 (81.41%), Postives = 353/398 (88.69%), Query Frame = 0

Query: 34  SNITPLYYSSQLYVKKPSSHGPFKLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWI 93
           S+++PL+ S        S++G  KLPFKY S+ALVVSLPIGTPPQPTDLVLDTGSQLSWI
Sbjct: 38  SSVSPLFSSL-------SAYGSVKLPFKY-STALVVSLPIGTPPQPTDLVLDTGSQLSWI 97

Query: 94  QCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCH 153
           QCH K  KK L    KPKT +FDPSLSSSFSLLPCNHP+CKPRIPDFTLPTSCDQNRLCH
Sbjct: 98  QCHRKVHKKLL----KPKTTSFDPSLSSSFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCH 157

Query: 154 YSYFYADGTLAEGNLVREKFTFSNSLSTPPVILGCAQGSTENRGILGMNHGRLSFISQAK 213
           YSYFYADGTLAEGNLVREKFTFSNS +TPPVILGCAQ STENRGILGMN GRLSF+SQAK
Sbjct: 158 YSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQASTENRGILGMNTGRLSFVSQAK 217

Query: 214 ISKFSYCVPSRTGPNPTGLFYLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAI 273
           ISKFSYCVP RTGP+ TGLFYLGDNPNS+ FKY+++LTFP+SQ SPNLDPLAYTLP+K I
Sbjct: 218 ISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGI 277

Query: 274 KIAGKRLNIPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYV 333
           KIAG RLNI  A FKPD  GSGQTMIDSGSDLTYLVDEAYEKVKEE+V+LVG +MKKGY 
Sbjct: 278 KIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYE 337

Query: 334 YAAVADMCFDAGVTVEVGRRIGDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGR 393
           YAAVADMCF+ G T EVGRRI DMSFEF+NGVEI VG+GEGVLTEVEKGVKCVG GRSGR
Sbjct: 338 YAAVADMCFNDGETAEVGRRIRDMSFEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGR 397

Query: 394 LGIGSNIIGTVHQQNMWVEYDLANKRVGFGGAECSRLK 432
           LGI SNIIGTVHQ+N W+EYDLAN+R+GFGGA+CSRLK
Sbjct: 398 LGIASNIIGTVHQKNTWIEYDLANRRIGFGGADCSRLK 423

BLAST of CsaV3_3G017860 vs. TAIR10
Match: AT5G37540.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 542.7 bits (1397), Expect = 2.0e-154
Identity = 265/384 (69.01%), Postives = 311/384 (80.99%), Query Frame = 0

Query: 50  PSSHGPFKLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPK 109
           PSS   F+   KY S AL++SLPIGTP Q  +LVLDTGSQLSWIQCH KK+KK LP    
Sbjct: 64  PSSPYTFRSNIKY-SMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLP---- 123

Query: 110 PKTATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLV 169
           P T +FDPSLSSSFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV
Sbjct: 124 PPTTSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLV 183

Query: 170 REKFTFSNSLSTPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRT---G 229
           +EKFTFSNS +TPP+ILGCA+ ST+ +GILGMN GRLSFISQAKISKFSYC+P+R+   G
Sbjct: 184 KEKFTFSNSQTTPPLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPG 243

Query: 230 PNPTGLFYLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAA 289
              TG FYLGDNPNS  FKYV++LTFP+SQ  PNLDPLAYT+P++ I+I  KRLNIP + 
Sbjct: 244 LASTGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSV 303

Query: 290 FKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGV 349
           F+PDAGGSGQTM+DSGS+ T+LVD AY+KVKEE+VRLVG+ +KKGYVY + ADMCFD   
Sbjct: 304 FRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNH 363

Query: 350 TVEVGRRIGDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQ 409
           ++E+GR IGD+ FEF  GVEI V + + +L  V  G+ CVGIGRS  LG  SNIIG VHQ
Sbjct: 364 SMEIGRLIGDLVFEFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQ 423

Query: 410 QNMWVEYDLANKRVGFGGAECSRL 431
           QN+WVE+D+ N+RVGF  AEC  L
Sbjct: 424 QNLWVEFDVTNRRVGFSKAECRLL 441

BLAST of CsaV3_3G017860 vs. TAIR10
Match: AT1G66180.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 517.3 bits (1331), Expect = 9.0e-147
Identity = 261/397 (65.74%), Postives = 307/397 (77.33%), Query Frame = 0

Query: 40  YYSSQLYVKKPSSHGP---FKLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCH 99
           + +S L  K PS   P   F+  FKY S AL++SLPIGTPPQ   +VLDTGSQLSWIQCH
Sbjct: 43  FTTSLLSRKNPSPSSPPYNFRSRFKY-SMALIISLPIGTPPQAQQMVLDTGSQLSWIQCH 102

Query: 100 DKKVKKRLPPLPKPKTATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSY 159
               +K+LP  PKPKT +FDPSLSSSFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSY
Sbjct: 103 ----RKKLP--PKPKT-SFDPSLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSY 162

Query: 160 FYADGTLAEGNLVREKFTFSNSLSTPPVILGCAQGSTENRGILGMNHGRLSFISQAKISK 219
           FYADGT AEGNLV+EK TFSN+  TPP+ILGCA  S+++RGILGMN GRLSF+SQAKISK
Sbjct: 163 FYADGTFAEGNLVKEKITFSNTEITPPLILGCATESSDDRGILGMNRGRLSFVSQAKISK 222

Query: 220 FSYCVP---SRTGPNPTGLFYLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAI 279
           FSYC+P   +R G  PTG FYLGDNPNS  FKYV++LTFPESQ  PNLDPLAYT+PM  I
Sbjct: 223 FSYCIPPKSNRPGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGI 282

Query: 280 KIAGKRLNIPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYV 339
           +   K+LNI  + F+PDAGGSGQTM+DSGS+ T+LVD AY+KV+ E++  VG  +KKGYV
Sbjct: 283 RFGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYV 342

Query: 340 YAAVADMCFDAGVTVEVGRRIGDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGR 399
           Y   ADMCFD  V + + R IGD+ F F  GVEI V + E VL  V  G+ CVGIGRS  
Sbjct: 343 YGGTADMCFDGNVAM-IPRLIGDLVFVFTRGVEILVPK-ERVLVNVGGGIHCVGIGRSSM 402

Query: 400 LGIGSNIIGTVHQQNMWVEYDLANKRVGFGGAECSRL 431
           LG  SNIIG VHQQN+WVE+D+ N+RVGF  A+CSR+
Sbjct: 403 LGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADCSRV 429

BLAST of CsaV3_3G017860 vs. TAIR10
Match: AT2G39710.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 235.3 bits (599), Expect = 6.9e-62
Identity = 139/389 (35.73%), Postives = 211/389 (54.24%), Query Frame = 0

Query: 57  KLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFD 116
           KL F+++ + L V+L +G PPQ   +VLDTGS+LSW+ C      K+ P L     + F+
Sbjct: 56  KLSFRHNVT-LTVTLAVGDPPQNISMVLDTGSELSWLHC------KKSPNL----GSVFN 115

Query: 117 PSLSSSFSLLPCNHPICKPRIPDFTLPTSCD-QNRLCHYSYFYADGTLAEGNLVREKFTF 176
           P  SS++S +PC+ PIC+ R  D  +P SCD +  LCH +  YAD T  EGNL  E F  
Sbjct: 116 PVSSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI 175

Query: 177 SNSLSTPPVILGC--------AQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGP 236
             S++ P  + GC        ++   ++ G++GMN G LSF++Q   SKFSYC+   +G 
Sbjct: 176 -GSVTRPGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI---SGS 235

Query: 237 NPTGLFYLGDNPNS--SKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPA 296
           + +G   LGD   S     +Y  ++   +S   P  D +AYT+ ++ I++  K L++P +
Sbjct: 236 DSSGFLLLGDASYSWLGPIQYTPLVL--QSTPLPYFDRVAYTVQLEGIRVGSKILSLPKS 295

Query: 297 AFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMK----KGYVYAAVADMC 356
            F PD  G+GQTM+DSG+  T+L+   Y  +K E +    ++++      +V+    D+C
Sbjct: 296 VFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLC 355

Query: 357 FDAGVTVEVGRRIGDMSFEFDNGVEIFVG------RGEGVLTEVEKGVKCVGIGRSGRLG 416
           +  G T         M      G E+ V       R  G  +E ++ V C   G S  LG
Sbjct: 356 YKVGSTTRPNFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLG 415

Query: 417 IGSNIIGTVHQQNMWVEYDLANKRVGFGG 425
           I + +IG  HQQN+W+E+DLA  RVGF G
Sbjct: 416 IEAFVIGHHHQQNVWMEFDLAKSRVGFAG 427

BLAST of CsaV3_3G017860 vs. TAIR10
Match: AT5G02190.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 235.3 bits (599), Expect = 6.9e-62
Identity = 142/380 (37.37%), Postives = 202/380 (53.16%), Query Frame = 0

Query: 73  IGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLPCNHPI 132
           +GTPPQ   +V+DTGS+LSW++C+           P P    FDP+ SSS+S +PC+ P 
Sbjct: 79  VGTPPQNISMVIDTGSELSWLRCNRSS-------NPNP-VNNFDPTRSSSYSPIPCSSPT 138

Query: 133 CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPPVILGC---- 192
           C+ R  DF +P SCD ++LCH +  YAD + +EGNL  E F F NS +   +I GC    
Sbjct: 139 CRTRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSV 198

Query: 193 ----AQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGPNPTGLFYLGDNPNSSKF 252
                +  T+  G+LGMN G LSFISQ    KFSYC+ S T   P G   LGD    S F
Sbjct: 199 SGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCI-SGTDDFP-GFLLLGD----SNF 258

Query: 253 KYVTMLTFPE----SQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 312
            ++T L +      S   P  D +AYT+ +  IK+ GK L IP +   PD  G+GQTM+D
Sbjct: 259 TWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVD 318

Query: 313 SGSDLTYLVDEAYEKVKEEVVRLVGAMM----KKGYVYAAVADMCFD-AGVTVEVG--RR 372
           SG+  T+L+   Y  ++   +     ++       +V+    D+C+  + V +  G   R
Sbjct: 319 SGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHR 378

Query: 373 IGDMSFEFDNGVEIFVGRGEGVLTEV------EKGVKCVGIGRSGRLGIGSNIIGTVHQQ 428
           +  +S  F+ G EI V  G+ +L  V         V C   G S  +G+ + +IG  HQQ
Sbjct: 379 LPTVSLVFE-GAEIAVS-GQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQ 438

BLAST of CsaV3_3G017860 vs. TAIR10
Match: AT3G25700.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 157.1 bits (396), Expect = 2.4e-38
Identity = 120/405 (29.63%), Postives = 181/405 (44.69%), Query Frame = 0

Query: 46  YVKKPSSHGPFKLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLP 105
           +VK P   G         S    V L IG PPQ   L+ DTGS L W++C   +      
Sbjct: 68  FVKSPVVSGAAS-----GSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHS 127

Query: 106 PLPKPKTATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRL---CHYSYFYADGT 165
           P        F P  SS+FS   C  P+C+  +P       C+  R+   CHY Y YADG+
Sbjct: 128 P-----ATVFFPRHSSTFSPAHCYDPVCR-LVPKPDRAPICNHTRIHSTCHYEYGYADGS 187

Query: 166 LAEGNLVRE----KFTFSNSLSTPPVILGC--------AQGSTEN--RGILGMNHGRLSF 225
           L  G   RE    K +         V  GC          G++ N   G++G+  G +SF
Sbjct: 188 LTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISF 247

Query: 226 ISQAKI---SKFSYCVPSRT-GPNPTGLFYLGDNPNS-SKFKYVTMLTFPESQSSPNLDP 285
            SQ      +KFSYC+   T  P PT    +G+  +  SK  +  +LT P       L P
Sbjct: 248 ASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGNGGDGISKLFFTPLLTNP-------LSP 307

Query: 286 LAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRL 345
             Y + +K++ + G +L I P+ ++ D  G+G T++DSG+ L +L + AY  V   V R 
Sbjct: 308 TFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRR 367

Query: 346 VGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFEFDNGVEIFVGRGEGVLTEVEKGV 405
           V   +          D+C +     +  + +  + FEF  G  +FV        E E+ +
Sbjct: 368 VKLPIADALTPG--FDLCVNVSGVTKPEKILPRLKFEFSGGA-VFVPPPRNYFIETEEQI 427

Query: 406 KCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRVGFGGAECS 429
           +C+ I +S    +G ++IG + QQ    E+D    R+GF    C+
Sbjct: 428 QCLAI-QSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450

BLAST of CsaV3_3G017860 vs. Swiss-Prot
Match: sp|Q9LZL3|PCS1L_ARATH (Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1)

HSP 1 Score: 235.3 bits (599), Expect = 1.2e-60
Identity = 142/380 (37.37%), Postives = 202/380 (53.16%), Query Frame = 0

Query: 73  IGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLPCNHPI 132
           +GTPPQ   +V+DTGS+LSW++C+           P P    FDP+ SSS+S +PC+ P 
Sbjct: 79  VGTPPQNISMVIDTGSELSWLRCNRSS-------NPNP-VNNFDPTRSSSYSPIPCSSPT 138

Query: 133 CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPPVILGC---- 192
           C+ R  DF +P SCD ++LCH +  YAD + +EGNL  E F F NS +   +I GC    
Sbjct: 139 CRTRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSV 198

Query: 193 ----AQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGPNPTGLFYLGDNPNSSKF 252
                +  T+  G+LGMN G LSFISQ    KFSYC+ S T   P G   LGD    S F
Sbjct: 199 SGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCI-SGTDDFP-GFLLLGD----SNF 258

Query: 253 KYVTMLTFPE----SQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 312
            ++T L +      S   P  D +AYT+ +  IK+ GK L IP +   PD  G+GQTM+D
Sbjct: 259 TWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVD 318

Query: 313 SGSDLTYLVDEAYEKVKEEVVRLVGAMM----KKGYVYAAVADMCFD-AGVTVEVG--RR 372
           SG+  T+L+   Y  ++   +     ++       +V+    D+C+  + V +  G   R
Sbjct: 319 SGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHR 378

Query: 373 IGDMSFEFDNGVEIFVGRGEGVLTEV------EKGVKCVGIGRSGRLGIGSNIIGTVHQQ 428
           +  +S  F+ G EI V  G+ +L  V         V C   G S  +G+ + +IG  HQQ
Sbjct: 379 LPTVSLVFE-GAEIAVS-GQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQ 438

BLAST of CsaV3_3G017860 vs. Swiss-Prot
Match: sp|Q766C3|NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 171.0 bits (432), Expect = 2.9e-41
Identity = 117/367 (31.88%), Postives = 173/367 (47.14%), Query Frame = 0

Query: 68  VVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLP 127
           +++L IGTP QP   ++DTGS L W QC      +         T  F+P  SSSFS LP
Sbjct: 96  LMNLSIGTPAQPFSAIMDTGSDLIWTQC------QPCTQCFNQSTPIFNPQGSSSFSTLP 155

Query: 128 CNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPPVILG 187
           C+  +C+       L +    N  C Y+Y Y DG+  +G++  E  TF  S+S P +  G
Sbjct: 156 CSSQLCQ------ALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTF-GSVSIPNITFG 215

Query: 188 CAQ-----GSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGPNPTGLFYLGDNPNSS 247
           C +     G     G++GM  G LS  SQ  ++KFSYC+       P+ L  LG   NS 
Sbjct: 216 CGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSNLL-LGSLANSV 275

Query: 248 KFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDA-GGSGQTMIDS 307
                       SQ      P  Y + +  + +   RL I P+AF  ++  G+G  +IDS
Sbjct: 276 TAGSPNTTLIQSSQI-----PTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDS 335

Query: 308 GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFEF 367
           G+ LTY V+ AY+ V++E +  +   +  G   ++  D+CF    +     +I      F
Sbjct: 336 GTTLTYFVNNAYQSVRQEFISQINLPVVNG--SSSGFDLCFQT-PSDPSNLQIPTFVMHF 395

Query: 368 DNG-VEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 427
           D G +E+     E        G+ C+ +G S +   G +I G + QQNM V YD  N  V
Sbjct: 396 DGGDLEL---PSENYFISPSNGLICLAMGSSSQ---GMSIFGNIQQQNMLVVYDTGNSVV 434

BLAST of CsaV3_3G017860 vs. Swiss-Prot
Match: sp|Q766C2|NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 165.6 bits (418), Expect = 1.2e-39
Identity = 122/389 (31.36%), Postives = 178/389 (45.76%), Query Frame = 0

Query: 52  SHGPFKLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPK 111
           S    + P        ++++ IGTP      ++DTGS L W QC         P      
Sbjct: 81  SSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQP------ 140

Query: 112 TATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVRE 171
           T  F+P  SSSFS LPC    C+       LP+    N  C Y+Y Y DG+  +G +  E
Sbjct: 141 TPIFNPQDSSSFSTLPCESQYCQ------DLPSETCNNNECQYTYGYGDGSTTQGYMATE 200

Query: 172 KFTFSNSLSTPPVILGCAQ-----GSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTG 231
            FTF  S S P +  GC +     G     G++GM  G LS  SQ  + +FSYC+ S   
Sbjct: 201 TFTFETS-SVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGS 260

Query: 232 PNPTGLFYLGDNPNSSKFKYVTMLTFPESQSS-----PNLDPLAYTLPMKAIKIAGKRLN 291
            +P+ L  LG   +            PE   S      +L+P  Y + ++ I + G  L 
Sbjct: 261 SSPSTL-ALGSAASG----------VPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLG 320

Query: 292 IPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMC 351
           IP + F+    G+G  +IDSG+ LTYL  +AY  V +     +   +      ++    C
Sbjct: 321 IPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQIN--LPTVDESSSGLSTC 380

Query: 352 FDA---GVTVEVGRRIGDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGS 411
           F     G TV+V     ++S +FD GV + +G  + +L    +GV C+ +G S +LGI  
Sbjct: 381 FQQPSDGSTVQV----PEISMQFDGGV-LNLGE-QNILISPAEGVICLAMGSSSQLGI-- 435

Query: 412 NIIGTVHQQNMWVEYDLANKRVGFGGAEC 428
           +I G + QQ   V YDL N  V F   +C
Sbjct: 441 SIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CsaV3_3G017860 vs. Swiss-Prot
Match: sp|Q9LS40|ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 141.7 bits (356), Expect = 1.9e-32
Identity = 100/363 (27.55%), Postives = 163/363 (44.90%), Query Frame = 0

Query: 73  IGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLPCNHPI 132
           +GTP +   LVLDTGS ++WIQC      +      +     F+P+ SS++  L C+ P 
Sbjct: 168 VGTPAKEMYLVLDTGSDVNWIQC------EPCADCYQQSDPVFNPTSSSTYKSLTCSAPQ 227

Query: 133 CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPPVILGCAQGS 192
           C        L TS  ++  C Y   Y DG+   G L  +  TF NS     V LGC   +
Sbjct: 228 CS------LLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDN 287

Query: 193 ----TENRGILGMNHGRLSFISQAKISKFSYCVPSRTGPNPTGLFYLGDNPNSSKFKYVT 252
               T   G+LG+  G LS  +Q K + FSYC+  R            D+  SS   + +
Sbjct: 288 EGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDR------------DSGKSSSLDFNS 347

Query: 253 MLTFPESQSSPNLD----PLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDSGSD 312
           +       ++P L        Y + +    + G+++ +P A F  DA GSG  ++D G+ 
Sbjct: 348 VQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTA 407

Query: 313 LTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFEFDNG 372
           +T L  +AY  +++  ++L    +KKG    ++ D C+D      V  ++  ++F F  G
Sbjct: 408 VTRLQTQAYNSLRDAFLKLT-VNLKKGSSSISLFDTCYDFSSLSTV--KVPTVAFHFTGG 467

Query: 373 VEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRVGFGG 428
             + +     ++   + G  C     +       +IIG V QQ   + YDL+   +G  G
Sbjct: 468 KSLDLPAKNYLIPVDDSGTFCFAFAPTSS---SLSIIGNVQQQGTRITYDLSKNVIGLSG 500

BLAST of CsaV3_3G017860 vs. Swiss-Prot
Match: sp|Q9LNJ3|APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 131.7 bits (330), Expect = 1.9e-29
Identity = 107/367 (29.16%), Postives = 170/367 (46.32%), Query Frame = 0

Query: 71  LPIGTPPQPTDLVLDTGSQLSWIQCHD-KKVKKRLPPLPKPKTATFDPSLSSSFSLLPCN 130
           L +GTP +   +VLDTGS + W+QC   ++   +  P+       FDP  S +++ +PC+
Sbjct: 146 LGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPI-------FDPRKSKTYATIPCS 205

Query: 131 HPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPPVILGCA 190
            P C+ R+      T   + + C Y   Y DG+   G+   E  TF  +     V LGC 
Sbjct: 206 SPHCR-RLDSAGCNT---RRKTCLYQVSYGDGSFTVGDFSTETLTFRRN-RVKGVALGCG 265

Query: 191 QGS----TENRGILGMNHGRLSFISQAK---ISKFSYCVPSRTGPNPTGLFYLGDNPNSS 250
             +        G+LG+  G+LSF  Q       KFSYC+  R+  +       G+   S 
Sbjct: 266 HDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSR 325

Query: 251 KFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRL-NIPPAAFKPDAGGSGQTMIDS 310
             ++  +L      S+P LD   Y + +  I + G R+  +  + FK D  G+G  +IDS
Sbjct: 326 IARFTPLL------SNPKLDTF-YYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDS 385

Query: 311 GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFEF 370
           G+ +T L+  AY  +++     VGA   K     ++ D CFD     EV  ++  +   F
Sbjct: 386 GTSVTRLIRPAYIAMRDAF--RVGAKTLKRAPDFSLFDTCFDLSNMNEV--KVPTVVLHF 445

Query: 371 DNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRVG 429
             G ++ +     ++     G  C     +G +G G +IIG + QQ   V YDLA+ RVG
Sbjct: 446 -RGADVSLPATNYLIPVDTNGKFCFAF--AGTMG-GLSIIGNIQQQGFRVVYDLASSRVG 485

BLAST of CsaV3_3G017860 vs. TrEMBL
Match: tr|A0A1S3C4D2|A0A1S3C4D2_CUCME (aspartic proteinase PCS1 OS=Cucumis melo OX=3656 GN=LOC103496869 PE=3 SV=1)

HSP 1 Score: 767.3 bits (1980), Expect = 1.8e-218
Identity = 382/431 (88.63%), Postives = 394/431 (91.42%), Query Frame = 0

Query: 1   MLLILFSLSXXXXXXXXXXXXXXXXXXSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60
           MLLILFSLS                  SL+EKPSNI+P+ Y SQLY KKPSSHG FKLPF
Sbjct: 1   MLLILFSLSLFTLPFSQSNSVSLPFPLSLSEKPSNISPI-YGSQLYAKKPSSHGSFKLPF 60

Query: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLS 120
           KYSS+ALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHD KVKK+LPPLPKPKTA+FDPSLS
Sbjct: 61  KYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHD-KVKKKLPPLPKPKTASFDPSLS 120

Query: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180
           SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKF+ SNSLS
Sbjct: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFSLSNSLS 180

Query: 181 TPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGPNPTGLFYLGDNPN 240
           TPPVILGCAQ STENRGILGMN GRLSFISQAKISKFSYCVP+RTG NPTGLFYLGDNPN
Sbjct: 181 TPPVILGCAQASTENRGILGMNKGRLSFISQAKISKFSYCVPARTGSNPTGLFYLGDNPN 240

Query: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300
           SS+FKYVTMLTFPESQSSPNLDPLAYTLPMK IKIAGKRLNI PAAFKPDAGGSGQTMID
Sbjct: 241 SSRFKYVTMLTFPESQSSPNLDPLAYTLPMKGIKIAGKRLNISPAAFKPDAGGSGQTMID 300

Query: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 360
           SGSDLTYLVDEAYEKVKEEVVRLVGA MKKGYVYAAVADMCFDA VT EVGRRIG +SFE
Sbjct: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAKMKKGYVYAAVADMCFDARVTAEVGRRIGGISFE 360

Query: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 420
           FDNGVEI VGRGEGVLTEVEKGVKCVG GRS RLGIGSNIIGTVHQQNMWVEYDL N+R+
Sbjct: 361 FDNGVEILVGRGEGVLTEVEKGVKCVGFGRSERLGIGSNIIGTVHQQNMWVEYDLTNRRI 420

Query: 421 GFGGAECSRLK 432
           GFGGAECSRLK
Sbjct: 421 GFGGAECSRLK 429

BLAST of CsaV3_3G017860 vs. TrEMBL
Match: tr|A0A0A0LBQ0|A0A0A0LBQ0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G188340 PE=3 SV=1)

HSP 1 Score: 607.1 bits (1564), Expect = 3.2e-170
Identity = 313/402 (77.86%), Postives = 314/402 (78.11%), Query Frame = 0

Query: 30  TEKPSNITPLYYSSQLYVKKPSSHGPFKLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQ 89
           TEKPSNITPLYYSSQLYVKKPSSHGPFKLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQ
Sbjct: 49  TEKPSNITPLYYSSQLYVKKPSSHGPFKLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQ 108

Query: 90  LSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQN 149
           LSWIQCHDKKVKKRLPPLPKPKTA+FDPSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQN
Sbjct: 109 LSWIQCHDKKVKKRLPPLPKPKTASFDPSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQN 168

Query: 150 RLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPPVILGCAQGSTENRGILGMNHGRLSFI 209
           RLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPPVILGCAQGSTENR             
Sbjct: 169 RLCHYSYFYADGTLAEGNLVREKFTFSNSLSTPPVILGCAQGSTENR------------- 228

Query: 210 SQAKISKFSYCVPSRTGPNPTGLFYLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLP 269
                                                                       
Sbjct: 229 ------------------------------------------------------------ 288

Query: 270 MKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMK 329
                          AAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMK
Sbjct: 289 ---------------AAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMK 348

Query: 330 KGYVYAAVADMCFDAGVTVEVGRRIGDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIG 389
           KGYVYAAVADMCFDAGVTVEVGRRIGDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIG
Sbjct: 349 KGYVYAAVADMCFDAGVTVEVGRRIGDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIG 362

Query: 390 RSGRLGIGSNIIGTVHQQNMWVEYDLANKRVGFGGAECSRLK 432
           RSGRLGIGSNIIGTVHQQNMWVEYDLANKRVGFGGAECSRLK
Sbjct: 409 RSGRLGIGSNIIGTVHQQNMWVEYDLANKRVGFGGAECSRLK 362

BLAST of CsaV3_3G017860 vs. TrEMBL
Match: tr|A0A0A0L6V5|A0A0A0L6V5_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G188350 PE=3 SV=1)

HSP 1 Score: 567.4 bits (1461), Expect = 2.8e-158
Identity = 321/431 (74.48%), Postives = 326/431 (75.64%), Query Frame = 0

Query: 1   MLLILFSLSXXXXXXXXXXXXXXXXXXSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60
           MLLILFS  XXXXXXXXXXXXXXXXXX   EKPSN  P  YSSQLY K+PSS+G FKLPF
Sbjct: 1   MLLILFSXXXXXXXXXXXXXXXXXXXXXXXEKPSNTIP-SYSSQLYAKRPSSYGSFKLPF 60

Query: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLS 120
           KYSS+ALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTA+FDPSLS
Sbjct: 61  KYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLS 120

Query: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180
           SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS SLS
Sbjct: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLS 180

Query: 181 TPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGPNPTGLFYLGDNPN 240
           TPPVILGCAQ STENR                                            
Sbjct: 181 TPPVILGCAQASTENR-------------------------------------------- 240

Query: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300
                                                       AAFKPDAGGSGQTMID
Sbjct: 241 --------------------------------------------AAFKPDAGGSGQTMID 300

Query: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 360
           SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYA VADMCFDAGVT EVGRRIG +SFE
Sbjct: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFE 342

Query: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 420
           FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRS RLGIGSNIIGTVHQQNMWVEYDLANKRV
Sbjct: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRV 342

Query: 421 GFGGAECSRLK 432
           GFGGAECSRLK
Sbjct: 421 GFGGAECSRLK 342

BLAST of CsaV3_3G017860 vs. TrEMBL
Match: tr|Q9FGI3|Q9FGI3_ARATH (AT5g37540/mpa22_p_70 OS=Arabidopsis thaliana OX=3702 GN=MPA22.8 PE=2 SV=1)

HSP 1 Score: 542.7 bits (1397), Expect = 7.3e-151
Identity = 265/384 (69.01%), Postives = 311/384 (80.99%), Query Frame = 0

Query: 50  PSSHGPFKLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPK 109
           PSS   F+   KY S AL++SLPIGTP Q  +LVLDTGSQLSWIQCH KK+KK LP    
Sbjct: 64  PSSPYTFRSNIKY-SMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLP---- 123

Query: 110 PKTATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLV 169
           P T +FDPSLSSSFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV
Sbjct: 124 PPTTSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLV 183

Query: 170 REKFTFSNSLSTPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRT---G 229
           +EKFTFSNS +TPP+ILGCA+ ST+ +GILGMN GRLSFISQAKISKFSYC+P+R+   G
Sbjct: 184 KEKFTFSNSQTTPPLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPG 243

Query: 230 PNPTGLFYLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAA 289
              TG FYLGDNPNS  FKYV++LTFP+SQ  PNLDPLAYT+P++ I+I  KRLNIP + 
Sbjct: 244 LASTGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSV 303

Query: 290 FKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGV 349
           F+PDAGGSGQTM+DSGS+ T+LVD AY+KVKEE+VRLVG+ +KKGYVY + ADMCFD   
Sbjct: 304 FRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNH 363

Query: 350 TVEVGRRIGDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQ 409
           ++E+GR IGD+ FEF  GVEI V + + +L  V  G+ CVGIGRS  LG  SNIIG VHQ
Sbjct: 364 SMEIGRLIGDLVFEFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQ 423

Query: 410 QNMWVEYDLANKRVGFGGAECSRL 431
           QN+WVE+D+ N+RVGF  AEC  L
Sbjct: 424 QNLWVEFDVTNRRVGFSKAECRLL 441

BLAST of CsaV3_3G017860 vs. TrEMBL
Match: tr|A0A178UJV4|A0A178UJV4_ARATH (Uncharacterized protein OS=Arabidopsis thaliana OX=3702 GN=AXX17_At5g34710 PE=3 SV=1)

HSP 1 Score: 541.6 bits (1394), Expect = 1.6e-150
Identity = 266/386 (68.91%), Postives = 311/386 (80.57%), Query Frame = 0

Query: 50  PSSHGP--FKLPFKYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPL 109
           PSS  P  F+   KY S AL++SLPIGTP Q  +LVLDTGSQLSWIQCH KK+KK LP  
Sbjct: 63  PSSSSPYTFRSNVKY-SMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLP-- 122

Query: 110 PKPKTATFDPSLSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGN 169
             P T +FDPSLSSSFS LPC+HP+CKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGN
Sbjct: 123 --PPTTSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGN 182

Query: 170 LVREKFTFSNSLSTPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRT-- 229
           LV+EKFTFSNS +TPP+ILGCA  ST+ +GILGMN GRLSFISQAKISKFSYC+P+R+  
Sbjct: 183 LVKEKFTFSNSQTTPPLILGCANESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNR 242

Query: 230 -GPNPTGLFYLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPP 289
            G   TG FYLGDNPNS  FKYV++LTFP+SQ  PNLDPLAYT+P++ I+I  KRLNIP 
Sbjct: 243 PGLASTGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPG 302

Query: 290 AAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDA 349
           + F+PDAGGSGQTM+DSGS+ T+LVD AY+KVKEE+VRLVG+ +KKGYVY + ADMCFD 
Sbjct: 303 SVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDG 362

Query: 350 GVTVEVGRRIGDMSFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTV 409
             ++E+GR IGD+ FEF  GVEI V + + +L  V  G+ CVGIGRS  LG  SNIIG V
Sbjct: 363 NHSMEIGRLIGDLVFEFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNV 422

Query: 410 HQQNMWVEYDLANKRVGFGGAECSRL 431
           HQQN+WVE+D+ N+RVGF  AEC  L
Sbjct: 423 HQQNLWVEFDVTNRRVGFSKAECRLL 442

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011651212.12.0e-23799.54PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus][more]
XP_004140731.29.2e-22294.43PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus][more]
XP_008457122.12.8e-21888.63PREDICTED: aspartic proteinase PCS1 [Cucumis melo][more]
XP_022983616.12.3e-18880.97aspartic proteinase PCS1-like [Cucurbita maxima][more]
XP_022934878.12.1e-18681.41aspartic proteinase PCS1-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT5G37540.12.0e-15469.01Eukaryotic aspartyl protease family protein[more]
AT1G66180.19.0e-14765.74Eukaryotic aspartyl protease family protein[more]
AT2G39710.16.9e-6235.73Eukaryotic aspartyl protease family protein[more]
AT5G02190.16.9e-6237.37Eukaryotic aspartyl protease family protein[more]
AT3G25700.12.4e-3829.63Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
sp|Q9LZL3|PCS1L_ARATH1.2e-6037.37Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1[more]
sp|Q766C3|NEP1_NEPGR2.9e-4131.88Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
sp|Q766C2|NEP2_NEPGR1.2e-3931.36Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
sp|Q9LS40|ASPG1_ARATH1.9e-3227.55Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
sp|Q9LNJ3|APF2_ARATH1.9e-2929.16Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3C4D2|A0A1S3C4D2_CUCME1.8e-21888.63aspartic proteinase PCS1 OS=Cucumis melo OX=3656 GN=LOC103496869 PE=3 SV=1[more]
tr|A0A0A0LBQ0|A0A0A0LBQ0_CUCSA3.2e-17077.86Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G188340 PE=3 SV=1[more]
tr|A0A0A0L6V5|A0A0A0L6V5_CUCSA2.8e-15874.48Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G188350 PE=3 SV=1[more]
tr|Q9FGI3|Q9FGI3_ARATH7.3e-15169.01AT5g37540/mpa22_p_70 OS=Arabidopsis thaliana OX=3702 GN=MPA22.8 PE=2 SV=1[more]
tr|A0A178UJV4|A0A178UJV4_ARATH1.6e-15068.91Uncharacterized protein OS=Arabidopsis thaliana OX=3702 GN=AXX17_At5g34710 PE=3 ... [more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR034161Pepsin-like_plant
IPR033121PEPTIDASE_A1
IPR001969Aspartic_peptidase_AS
IPR032799TAXi_C
IPR021109Peptidase_aspartic_dom_sf
IPR032861TAXi_N
IPR001461Aspartic_peptidase_A1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0015992 proton transport
biological_process GO:0006511 ubiquitin-dependent protein catabolic process
biological_process GO:0006744 ubiquinone biosynthetic process
biological_process GO:0006814 sodium ion transport
biological_process GO:0051788 response to misfolded protein
biological_process GO:0006979 response to oxidative stress
biological_process GO:0080129 proteasome core complex assembly
biological_process GO:0009853 photorespiration
biological_process GO:0006120 mitochondrial electron transport, NADH to ubiquinone
biological_process GO:0009630 gravitropism
cellular_component GO:0005575 cellular_component
cellular_component GO:0009507 chloroplast
cellular_component GO:0005747 mitochondrial respiratory chain complex I
molecular_function GO:0016740 transferase activity
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0051537 2 iron, 2 sulfur cluster binding
molecular_function GO:0051539 4 iron, 4 sulfur cluster binding
molecular_function GO:0009055 electron carrier activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0008137 NADH dehydrogenase (ubiquinone) activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_3G017860.1CsaV3_3G017860.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 297..308
score: 28.66
coord: 73..93
score: 47.5
coord: 399..414
score: 25.04
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 4..429
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 69..237
e-value: 4.1E-36
score: 124.8
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 56..234
e-value: 4.4E-34
score: 120.2
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 235..431
e-value: 2.6E-38
score: 133.3
IPR021109Aspartic peptidase domain superfamilySUPERFAMILYSSF50630Acid proteasescoord: 64..429
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 265..423
e-value: 3.5E-31
score: 108.1
NoneNo IPR availablePANTHERPTHR13683:SF327ASPARTYL PROTEASE FAMILY PROTEINcoord: 4..429
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 82..93
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 67..423
score: 32.704
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 68..427
e-value: 4.48302E-73
score: 231.768