CsaV3_4G006960 (gene) Cucumber (Chinese Long) v3

NameCsaV3_4G006960
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
Descriptionaspartic proteinase CDR1-like
Locationchr4 : 4716169 .. 4717473 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGCCATTTCAATCTTCTTCTACTTTCTCCTCTTCTTCTCCTCCAAAGTAACCGCTCATGGCGGTGGCCACCATGGCTTCACTACCTCTCTATTCCGCCGCGATTCTCCTCTCTCCCCTCTCCACAACCCATCTCTCTCCCGCTATGACAGCCTTATCGACGCCTTTCGTCGCTCCTTCTCCCGCTCCGCCACCCTCCTCACCCACCTCACTTCTGTCTCCACTGCATGCATACGTTCTCCCATCATCCCCGACAGCGGCGAGTTTCTAATGTCTATCTTTATCGGAACCCCTCCAGTGAATGTCATAGCCATTGCCGATACTGGCAGCGACCTAACGTGGACTCAATGCTTGCCATGTCGGGAATGCTTCAACCAATCACAGCCTATTTTTAATCCACGTCGGTCATCTTCCTACCGCAAAGTGTCTTGTGCATCGGATACGTGTCGCTCCTTGGAAAGTTACCATTGTGGACCAGACCTTCAAAGCTGCAGCTACGGCTATAGCTATGGAGATCGATCATTTACGTATGGTGACCTAGCATCTGATCAAATTACCATTGGGTCCTTCAAACTCCCCAAGACAGTTATCGGATGCGGTCACCAAAATGGTGGCACTTTCGGCGGAGTTACCTCCGGAATCATTGGACTTGGTGGTGGATCTCTCTCTTTGGTGTCTCAAATGAGAACAATTGCCGGCGTCAAACCACGATTCTCATATTGCTTGCCCACTTTCTTTAGCAACGCTAATATCACAGGTACAATAAGCTTTGGCCGAAAGGCTGTCGTTTCAGGGCGTCAAGTCGTTTCTACCCCTCTCGTTCCGAGATCTCCCGATACTTTTTATTTCTTGACTCTTGAGGCAATCTCTGTTGGAAAGAAGCGATTTAAAGCCGCAAACGGCATATCGGCCATGACCAACCATGGGAATATCATTATAGATTCCGGTACGACATTGACTCTTCTACCTCGGAGTCTGTACTACGGCGTCTTTTCGACTTTGGCGAGAGTCATTAAAGCAAAGCGAGTGGACGATCCTTCGGGGATTTTGGAGCTTTGCTATTCTGCGGGGCAAGTTGACGATTTAAATATTCCAATAATCACGGCACATTTTGCAGGTGGTGCCGACGTGAAGTTGCTGCCGGTGAACACATTTGCACCGGTGGCTGATAATGTGACTTGTTTGACTTTTGCACCGGCAACGCAGGTCGCCATTTTTGGAAACTTGGCACAGATAAACTTTGAAGTTGGATATGATCTTGGGAATAAGAGATTATCGTTTGAACCTAAACTTTGTGCTTAG

mRNA sequence

ATGGCTGCCATTTCAATCTTCTTCTACTTTCTCCTCTTCTTCTCCTCCAAAGTAACCGCTCATGGCGGTGGCCACCATGGCTTCACTACCTCTCTATTCCGCCGCGATTCTCCTCTCTCCCCTCTCCACAACCCATCTCTCTCCCGCTATGACAGCCTTATCGACGCCTTTCGTCGCTCCTTCTCCCGCTCCGCCACCCTCCTCACCCACCTCACTTCTGTCTCCACTGCATGCATACGTTCTCCCATCATCCCCGACAGCGGCGAGTTTCTAATGTCTATCTTTATCGGAACCCCTCCAGTGAATGTCATAGCCATTGCCGATACTGGCAGCGACCTAACGTGGACTCAATGCTTGCCATGTCGGGAATGCTTCAACCAATCACAGCCTATTTTTAATCCACGTCGGTCATCTTCCTACCGCAAAGTGTCTTGTGCATCGGATACGTGTCGCTCCTTGGAAAGTTACCATTGTGGACCAGACCTTCAAAGCTGCAGCTACGGCTATAGCTATGGAGATCGATCATTTACGTATGGTGACCTAGCATCTGATCAAATTACCATTGGGTCCTTCAAACTCCCCAAGACAGTTATCGGATGCGGTCACCAAAATGGTGGCACTTTCGGCGGAGTTACCTCCGGAATCATTGGACTTGGTGGTGGATCTCTCTCTTTGGTGTCTCAAATGAGAACAATTGCCGGCGTCAAACCACGATTCTCATATTGCTTGCCCACTTTCTTTAGCAACGCTAATATCACAGGTACAATAAGCTTTGGCCGAAAGGCTGTCGTTTCAGGGCGTCAAGTCGTTTCTACCCCTCTCGTTCCGAGATCTCCCGATACTTTTTATTTCTTGACTCTTGAGGCAATCTCTGTTGGAAAGAAGCGATTTAAAGCCGCAAACGGCATATCGGCCATGACCAACCATGGGAATATCATTATAGATTCCGGTACGACATTGACTCTTCTACCTCGGAGTCTGTACTACGGCGTCTTTTCGACTTTGGCGAGAGTCATTAAAGCAAAGCGAGTGGACGATCCTTCGGGGATTTTGGAGCTTTGCTATTCTGCGGGGCAAGTTGACGATTTAAATATTCCAATAATCACGGCACATTTTGCAGGTGGTGCCGACGTGAAGTTGCTGCCGGTGAACACATTTGCACCGGTGGCTGATAATGTGACTTGTTTGACTTTTGCACCGGCAACGCAGGTCGCCATTTTTGGAAACTTGGCACAGATAAACTTTGAAGTTGGATATGATCTTGGGAATAAGAGATTATCGTTTGAACCTAAACTTTGTGCTTAG

Coding sequence (CDS)

ATGGCTGCCATTTCAATCTTCTTCTACTTTCTCCTCTTCTTCTCCTCCAAAGTAACCGCTCATGGCGGTGGCCACCATGGCTTCACTACCTCTCTATTCCGCCGCGATTCTCCTCTCTCCCCTCTCCACAACCCATCTCTCTCCCGCTATGACAGCCTTATCGACGCCTTTCGTCGCTCCTTCTCCCGCTCCGCCACCCTCCTCACCCACCTCACTTCTGTCTCCACTGCATGCATACGTTCTCCCATCATCCCCGACAGCGGCGAGTTTCTAATGTCTATCTTTATCGGAACCCCTCCAGTGAATGTCATAGCCATTGCCGATACTGGCAGCGACCTAACGTGGACTCAATGCTTGCCATGTCGGGAATGCTTCAACCAATCACAGCCTATTTTTAATCCACGTCGGTCATCTTCCTACCGCAAAGTGTCTTGTGCATCGGATACGTGTCGCTCCTTGGAAAGTTACCATTGTGGACCAGACCTTCAAAGCTGCAGCTACGGCTATAGCTATGGAGATCGATCATTTACGTATGGTGACCTAGCATCTGATCAAATTACCATTGGGTCCTTCAAACTCCCCAAGACAGTTATCGGATGCGGTCACCAAAATGGTGGCACTTTCGGCGGAGTTACCTCCGGAATCATTGGACTTGGTGGTGGATCTCTCTCTTTGGTGTCTCAAATGAGAACAATTGCCGGCGTCAAACCACGATTCTCATATTGCTTGCCCACTTTCTTTAGCAACGCTAATATCACAGGTACAATAAGCTTTGGCCGAAAGGCTGTCGTTTCAGGGCGTCAAGTCGTTTCTACCCCTCTCGTTCCGAGATCTCCCGATACTTTTTATTTCTTGACTCTTGAGGCAATCTCTGTTGGAAAGAAGCGATTTAAAGCCGCAAACGGCATATCGGCCATGACCAACCATGGGAATATCATTATAGATTCCGGTACGACATTGACTCTTCTACCTCGGAGTCTGTACTACGGCGTCTTTTCGACTTTGGCGAGAGTCATTAAAGCAAAGCGAGTGGACGATCCTTCGGGGATTTTGGAGCTTTGCTATTCTGCGGGGCAAGTTGACGATTTAAATATTCCAATAATCACGGCACATTTTGCAGGTGGTGCCGACGTGAAGTTGCTGCCGGTGAACACATTTGCACCGGTGGCTGATAATGTGACTTGTTTGACTTTTGCACCGGCAACGCAGGTCGCCATTTTTGGAAACTTGGCACAGATAAACTTTGAAGTTGGATATGATCTTGGGAATAAGAGATTATCGTTTGAACCTAAACTTTGTGCTTAG

Protein sequence

MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA
BLAST of CsaV3_4G006960 vs. NCBI nr
Match: XP_004149004.1 (PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus] >KGN53446.1 hypothetical protein Csa_4G055410 [Cucumis sativus])

HSP 1 Score: 817.4 bits (2110), Expect = 2.3e-233
Identity = 434/434 (100.00%), Postives = 434/434 (100.00%), Query Frame = 0

Query: 1   MAAISIXXXXXXXXXXXXXXXXXXXXXXTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60
           MAAISIXXXXXXXXXXXXXXXXXXXXXXTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS
Sbjct: 1   MAAISIXXXXXXXXXXXXXXXXXXXXXXTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60

Query: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120
           FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120

Query: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180
           CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180

Query: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240
           LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS
Sbjct: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240

Query: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300
           YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA
Sbjct: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300

Query: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQV 360
           NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQV
Sbjct: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQV 360

Query: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420
           DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD
Sbjct: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420

Query: 421 LGNKRLSFEPKLCA 435
           LGNKRLSFEPKLCA
Sbjct: 421 LGNKRLSFEPKLCA 434

BLAST of CsaV3_4G006960 vs. NCBI nr
Match: XP_008452150.1 (PREDICTED: probable aspartic protease At2g35615 [Cucumis melo])

HSP 1 Score: 738.4 bits (1905), Expect = 1.4e-209
Identity = 369/405 (91.11%), Postives = 382/405 (94.32%), Query Frame = 0

Query: 30  TSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSGE 89
           TSL+ RDS LSPLHNPSLSRYDSL+++FRRSFSRSATLL HLTSVSTACIRSPIIPDSGE
Sbjct: 30  TSLYHRDSLLSPLHNPSLSRYDSLVESFRRSFSRSATLLNHLTSVSTACIRSPIIPDSGE 89

Query: 90  FLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDT 149
           FLMSIFIGTP VN IAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSC+SDT
Sbjct: 90  FLMSIFIGTPRVNFIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCSSDT 149

Query: 150 CRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTFG 209
           CRSLES HCG DL+SCSYGYSYGDRSFTYGDLASD+ITIGSFKLPKTVIGCGHQNGGTFG
Sbjct: 150 CRSLESSHCGLDLKSCSYGYSYGDRSFTYGDLASDKITIGSFKLPKTVIGCGHQNGGTFG 209

Query: 210 GVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQV 269
           GVTSGIIGLGGGSLSLVSQM TIAGVKP+FSYCLPTFFSN NITG ISFGRKAVVSGRQV
Sbjct: 210 GVTSGIIGLGGGSLSLVSQMSTIAGVKPQFSYCLPTFFSNENITGKISFGRKAVVSGRQV 269

Query: 270 VSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLYY 329
           VSTPLVPRSPDTFYFLTLEAISVG KRFKAA  +SAMTN GNIIIDSGTTLTLLPRSLY 
Sbjct: 270 VSTPLVPRSPDTFYFLTLEAISVGNKRFKAAKDMSAMTNQGNIIIDSGTTLTLLPRSLYD 329

Query: 330 GVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPV 389
           GV STLARVIK KRVDDPSGILELCYSAGQ++DLNIPIITAHF+G ADVKLLPVNTFAPV
Sbjct: 330 GVVSTLARVIKTKRVDDPSGILELCYSAGQLEDLNIPIITAHFSGRADVKLLPVNTFAPV 389

Query: 390 ADNVTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 435
           ADNV CLT APAT VAIFGNLAQINFEVGYDLGNKRLSF+P  CA
Sbjct: 390 ADNVICLTLAPATNVAIFGNLAQINFEVGYDLGNKRLSFKPTRCA 434

BLAST of CsaV3_4G006960 vs. NCBI nr
Match: XP_023543528.1 (aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 602.4 bits (1552), Expect = 1.2e-168
Identity = 295/406 (72.66%), Postives = 340/406 (83.74%), Query Frame = 0

Query: 29  TTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSG 88
           TTSLF RDS LSPL+NPSLS YD L +AFRRSFSRS TLL    +VST  I S IIPD G
Sbjct: 32  TTSLFHRDSFLSPLYNPSLSHYDRLTNAFRRSFSRSDTLLNRAAAVSTTGIHSRIIPDDG 91

Query: 89  EFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASD 148
           EFLMSI IGTP V ++AIADTGSDLTWTQC+PC +CFNQS PIFNPRRS SYR VSC S+
Sbjct: 92  EFLMSISIGTPRVKIMAIADTGSDLTWTQCMPCHKCFNQSFPIFNPRRSFSYRHVSCTSN 151

Query: 149 TCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTF 208
            CRSL+ Y CGPD ++CSYGYSYGD+SFTYGDLAS++IT+GSFKL KTVIGCGH NGGTF
Sbjct: 152 ACRSLDDYRCGPDNRTCSYGYSYGDQSFTYGDLASEKITVGSFKLYKTVIGCGHVNGGTF 211

Query: 209 GGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQ 268
            G TSGIIGLGGG LSL+SQMR IA VK RFSYCLPTFFS+ N+TG ISFG+KA+VSGR+
Sbjct: 212 SGDTSGIIGLGGGPLSLISQMRKIAAVKRRFSYCLPTFFSDKNVTGKISFGKKAIVSGRK 271

Query: 269 VVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLY 328
           V+STPLV + P+TFY++TL+A+SV  KRFKAAN +SA    GNI+IDSGTTLT+LP +LY
Sbjct: 272 VISTPLVLKEPNTFYYVTLKAMSVANKRFKAANNMSAAVERGNILIDSGTTLTILPPNLY 331

Query: 329 YGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAP 388
            GV STLA V+KAKRV+DP+G+L+LC++   VD LNIP+ITAHFAGGADVKLLP+NTFA 
Sbjct: 332 KGVASTLAHVVKAKRVNDPTGVLDLCFATRSVDHLNIPVITAHFAGGADVKLLPLNTFAM 391

Query: 389 VADNVTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 435
           VADNV CL F P+   AIFGNLAQ+NF VGYDL  KRLSF+  +CA
Sbjct: 392 VADNVACLAFVPSANFAIFGNLAQVNFLVGYDLERKRLSFKYNVCA 437

BLAST of CsaV3_4G006960 vs. NCBI nr
Match: XP_022942027.1 (aspartic proteinase CDR1-like [Cucurbita moschata])

HSP 1 Score: 597.8 bits (1540), Expect = 2.9e-167
Identity = 295/406 (72.66%), Postives = 338/406 (83.25%), Query Frame = 0

Query: 29  TTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSG 88
           TTSLF RDS LSPL+NPSLS YD L +AFRRSFSRS TLL    +VS   I S IIPD G
Sbjct: 32  TTSLFHRDSFLSPLYNPSLSHYDRLTNAFRRSFSRSDTLLNRAAAVSITGIHSRIIPDDG 91

Query: 89  EFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASD 148
           EFLMSI IGTP V ++AIADTGSDLTWTQC+PC +CFNQS PIFNPRRS SYR VSC S+
Sbjct: 92  EFLMSISIGTPRVKIMAIADTGSDLTWTQCMPCHKCFNQSFPIFNPRRSFSYRHVSCTSN 151

Query: 149 TCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTF 208
            CRSL+ Y CGPD ++CSYGYSYGD+SFTYGDLAS++ITIGSFKL KT+IGCGH NGGTF
Sbjct: 152 ACRSLDDYRCGPDNRTCSYGYSYGDQSFTYGDLASEKITIGSFKLYKTLIGCGHVNGGTF 211

Query: 209 GGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQ 268
              TSGIIGLGGG LSL+SQMR IA VK RFSYCLPTFFS+ N+TG ISFG+KA+VSGR+
Sbjct: 212 SEDTSGIIGLGGGPLSLISQMRKIAAVKRRFSYCLPTFFSDKNVTGKISFGKKAIVSGRK 271

Query: 269 VVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLY 328
           VVSTPLV + P+TFY+LTLEA+SV  KRFKAAN +S     GNI+IDSGTTLT+LP++LY
Sbjct: 272 VVSTPLVLKEPNTFYYLTLEAMSVANKRFKAANNMSTAVEQGNILIDSGTTLTILPQNLY 331

Query: 329 YGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAP 388
            GV STLA V+KAKRV+DP+G+L+LC++A  VD LNIP+ITAHFAG ADVKLLP+NTFA 
Sbjct: 332 KGVASTLAHVVKAKRVNDPTGVLDLCFAACSVDHLNIPVITAHFAGNADVKLLPLNTFAM 391

Query: 389 VADNVTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 435
           VADNV CL F P+   AIFGNLAQ+NF VGYDL  KRLSF+  +CA
Sbjct: 392 VADNVACLAFVPSANFAIFGNLAQVNFLVGYDLERKRLSFKYNVCA 437

BLAST of CsaV3_4G006960 vs. NCBI nr
Match: XP_023552860.1 (aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 582.0 bits (1499), Expect = 1.7e-162
Identity = 301/433 (69.52%), Postives = 345/433 (79.68%), Query Frame = 0

Query: 1   MAAISIXXXXXXXXXXXXXXXXXXXXXXTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60
           MAAISI      XXXXXXXXX       TTS+  RDS LSPLHNPS+S Y+ L  AF RS
Sbjct: 1   MAAISIFFYFLLXXXXXXXXXRGGGNGFTTSIIHRDSLLSPLHNPSVSHYERLTGAFNRS 60

Query: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120
           FSRS TL     +VST  + SP+IPDSGEFL+S+ IGTPPV+  AIADTGSDLTWTQCLP
Sbjct: 61  FSRSTTLTNRAATVSTGGVHSPLIPDSGEFLISLSIGTPPVDFTAIADTGSDLTWTQCLP 120

Query: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180
           C +CFNQS PIFNP RSSSY  VSC SDTC S+ S+ CGPDL++C+YGYSYGD+SFTYGD
Sbjct: 121 CVKCFNQSSPIFNPHRSSSYSNVSCTSDTCNSIVSHRCGPDLKTCTYGYSYGDQSFTYGD 180

Query: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240
           LA ++ITIGSFKL K VIGCGH+NGGTF G TSGI+GLGGG LSLVSQ+ TIA VK +FS
Sbjct: 181 LAYEKITIGSFKLNKVVIGCGHENGGTFLGETSGIVGLGGGPLSLVSQLNTIAAVKRQFS 240

Query: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300
           YCLPTFFS+ NITG ISFG +A VSGR+VVSTPLV + P T+YFLTLEA+SV  KRF+ A
Sbjct: 241 YCLPTFFSDGNITGKISFGEEAAVSGRKVVSTPLVQKHPYTYYFLTLEAVSVANKRFEVA 300

Query: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQV 360
           N +S+    GNIIIDSGTTLTLLP +LY G+ STLARV+KAKRV+DPSGILELCY    +
Sbjct: 301 NNMSSAVVEGNIIIDSGTTLTLLPPNLYDGIVSTLARVVKAKRVNDPSGILELCYGVSSI 360

Query: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420
           DDLNIPIITAHFAGGA V+L P NTFA V ++V CLT APA + AIFGNLAQ+NF VGYD
Sbjct: 361 DDLNIPIITAHFAGGAAVELQPENTFALVNEDVACLTLAPAKKFAIFGNLAQVNFLVGYD 420

Query: 421 LGNKRLSFEPKLC 434
           L  K +SF+  +C
Sbjct: 421 LEQKTVSFKRTVC 433

BLAST of CsaV3_4G006960 vs. TAIR10
Match: AT1G64830.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 379.4 bits (973), Expect = 3.0e-105
Identity = 206/411 (50.12%), Postives = 264/411 (64.23%), Query Frame = 0

Query: 29  TTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSG 88
           T  L  RDSP SP +N + +    + +A RR  S  +TL       S    +S I  + G
Sbjct: 27  TIDLIHRDSPKSPFYNSAETSSQRMRNAIRR--SARSTLQFSNDDASPNSPQSFITSNRG 86

Query: 89  EFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASD 148
           E+LM+I IGTPPV ++AIADTGSDL WTQC PC +C+ Q+ P+F+P+ SS+YRKVSC+S 
Sbjct: 87  EYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSS 146

Query: 149 TCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGS-----FKLPKTVIGCGHQ 208
            CR+LE   C  D  +CSY  +YGD S+T GD+A D +T+GS       L   +IGCGH+
Sbjct: 147 QCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCGHE 206

Query: 209 NGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAV 268
           N GTF    SGIIGLGGGS SLVSQ+R    +  +FSYCL  F S   +T  I+FG   +
Sbjct: 207 NTGTFDPAGSGIIGLGGGSTSLVSQLR--KSINGKFSYCLVPFTSETGLTSKINFGTNGI 266

Query: 269 VSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLL 328
           VSG  VVST +V + P T+YFL LEAISVG K+ +  + I   T  GNI+IDSGTTLTLL
Sbjct: 267 VSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFG-TGEGNIVIDSGTTLTLL 326

Query: 329 PRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKLLPV 388
           P + YY + S +A  IKA+RV DP GIL LCY         +P IT HF GG DVKL  +
Sbjct: 327 PSNFYYELESVVASTIKAERVQDPDGILSLCYR--DSSSFKVPDITVHFKGG-DVKLGNL 386

Query: 389 NTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 435
           NTF  V+++V+C  FA   Q+ IFGNLAQ+NF VGYD  +  +SF+   C+
Sbjct: 387 NTFVAVSEDVSCFAFAANEQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDCS 429

BLAST of CsaV3_4G006960 vs. TAIR10
Match: AT2G35615.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 345.5 bits (885), Expect = 4.7e-95
Identity = 196/427 (45.90%), Postives = 268/427 (62.76%), Query Frame = 0

Query: 29  TTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSG 88
           +  L  RDSPLSP++NP ++  D L  AF RS SRS      L+      ++S +I   G
Sbjct: 27  SVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRRFNHQLSQTD---LQSGLIGADG 86

Query: 89  EFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASD 148
           EF MSI IGTPP+ V AIADTGSDLTW QC PC++C+ ++ PIF+ ++SS+Y+   C S 
Sbjct: 87  EFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSR 146

Query: 149 TCRSLESYHCGPDLQS--CSYGYSYGDRSFTYGDLASDQITIGS-----FKLPKTVIGCG 208
            C++L S   G D  +  C Y YSYGD+SF+ GD+A++ ++I S        P TV GCG
Sbjct: 147 NCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGCG 206

Query: 209 HQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRK 268
           + NGGTF    SGIIGLGGG LSL+SQ+   + +  +FSYCL    +  N T  I+ G  
Sbjct: 207 YNNGGTFDETGSGIIGLGGGHLSLISQLG--SSISKKFSYCLSHKSATTNGTSVINLGTN 266

Query: 269 AVVSGRQ----VVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA--------NGISAMTNH 328
           ++ S       VVSTPLV + P T+Y+LTLEAISVGKK+            +GI + T+ 
Sbjct: 267 SIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSETS- 326

Query: 329 GNIIIDSGTTLTLLPRSLYYGVFSTLAR--VIKAKRVDDPSGILELCYSAGQVDDLNIPI 388
           GNIIIDSGTTLTLL    ++  FS+     V  AKRV DP G+L  C+ +G   ++ +P 
Sbjct: 327 GNIIIDSGTTLTLLEAG-FFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSA-EIGLPE 386

Query: 389 ITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLS 435
           IT HF  GADV+L P+N F  +++++ CL+  P T+VAI+GN AQ++F VGYDL  + +S
Sbjct: 387 ITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTTEVAIYGNFAQMDFLVGYDLETRTVS 444

BLAST of CsaV3_4G006960 vs. TAIR10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 345.5 bits (885), Expect = 4.7e-95
Identity = 190/414 (45.89%), Postives = 262/414 (63.29%), Query Frame = 0

Query: 29  TTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSV-STACIRSPIIPDS 88
           T  L  RDSP SP +NP  +    L +A  RS +R    + H T   +T   +  +  +S
Sbjct: 32  TADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR----VFHFTEKDNTPQPQIDLTSNS 91

Query: 89  GEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCAS 148
           GE+LM++ IGTPP  ++AIADTGSDL WTQC PC +C+ Q  P+F+P+ SS+Y+ VSC+S
Sbjct: 92  GEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSS 151

Query: 149 DTCRSLESY-HCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGS-----FKLPKTVIGCG 208
             C +LE+   C  +  +CSY  SYGD S+T G++A D +T+GS      +L   +IGCG
Sbjct: 152 SQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCG 211

Query: 209 HQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRK 268
           H N GTF    SGI+GLGGG +SL+ Q+     +  +FSYCL    S  + T  I+FG  
Sbjct: 212 HNNAGTFNKKGSGIVGLGGGPVSLIKQLGD--SIDGKFSYCLVPLTSKKDQTSKINFGTN 271

Query: 269 AVVSGRQVVSTPLVPR-SPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTL 328
           A+VSG  VVSTPL+ + S +TFY+LTL++ISVG K+ +  +G  + ++ GNIIIDSGTTL
Sbjct: 272 AIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQ-YSGSDSESSEGNIIIDSGTTL 331

Query: 329 TLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKL 388
           TLLP   Y  +   +A  I A++  DP   L LCYSA    DL +P+IT HF  GADVKL
Sbjct: 332 TLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA--TGDLKVPVITMHF-DGADVKL 391

Query: 389 LPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 435
              N F  V++++ C  F  +   +I+GN+AQ+NF VGYD  +K +SF+P  CA
Sbjct: 392 DSSNAFVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of CsaV3_4G006960 vs. TAIR10
Match: AT1G31450.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 343.2 bits (879), Expect = 2.3e-94
Identity = 195/424 (45.99%), Postives = 256/424 (60.38%), Query Frame = 0

Query: 29  TTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSG 88
           T  L  RDSP SPL+NP  +  D L  AF RS SRS    T         ++S +I + G
Sbjct: 30  TVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSISRSRRFTTKTD------LQSGLISNGG 89

Query: 89  EFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASD 148
           E+ MSI IGTPP  V AIADTGSDLTW QC PC++C+ Q+ P+F+ ++SS+Y+  SC S 
Sbjct: 90  EYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYKTESCDSK 149

Query: 149 TCRSLESYH--CGPDLQSCSYGYSYGDRSFTYGDLASDQITI-----GSFKLPKTVIGCG 208
           TC++L  +   C      C Y YSYGD SFT GD+A++ I+I      S   P TV GCG
Sbjct: 150 TCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFGCG 209

Query: 209 HQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRK 268
           + NGGTF    SGIIGLGGG LSLVSQ+ +  G K  FSYCL    +  N T  I+ G  
Sbjct: 210 YNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKK--FSYCLSHTAATTNGTSVINLGTN 269

Query: 269 AVVSG----RQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNH-----GNI 328
           ++ S        ++TPL+ + P+T+YFLTLEA++VGK +     G   +        GNI
Sbjct: 270 SIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSKRTGNI 329

Query: 329 IIDSGTTLTLLPRSLYYGVFSTLAR--VIKAKRVDDPSGILELCYSAGQVDDLNIPIITA 388
           IIDSGTTLTLL  S +Y  F T     V  AKRV DP G+L  C+ +G   ++ +P IT 
Sbjct: 330 IIDSGTTLTLLD-SGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFKSGD-KEIGLPAITM 389

Query: 389 HFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLSFEP 435
           HF   ADVKL P+N F  + ++  CL+  P T+VAI+GN+ Q++F VGYDL  K +SF+ 
Sbjct: 390 HFT-NADVKLSPINAFVKLNEDTVCLSMIPTTEVAIYGNMVQMDFLVGYDLETKTVSFQR 442

BLAST of CsaV3_4G006960 vs. TAIR10
Match: AT2G28010.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 228.8 bits (582), Expect = 6.5e-60
Identity = 146/359 (40.67%), Postives = 199/359 (55.43%), Query Frame = 0

Query: 86  DSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSC 145
           D+  +LM + +GTPP  + AI DTGS++TWTQCLPC  C+ Q+ PIF+P +SS++++  C
Sbjct: 61  DNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRC 120

Query: 146 ASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGS-----FKLPKTVIGC 205
                          D  SC Y   Y D ++T G LA++ IT+ S     F +P+T+IGC
Sbjct: 121 ---------------DGHSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGC 180

Query: 206 GHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKP-RFSYCLPTFFSNANITGTISFG 265
           GH N   F    SG++GL  G  SL++QM    G  P   SYC    FS    T  I+FG
Sbjct: 181 GH-NNSWFKPSFSGMVGLNWGPSSLITQM---GGEYPGLMSYC----FSGQG-TSKINFG 240

Query: 266 RKAVVSGRQVVSTPL-VPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGT 325
             A+V+G  VVST + +  +   FY+L L+A+SVG  R +   G +     GNI+IDSGT
Sbjct: 241 ANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETM-GTTFHALEGNIVIDSGT 300

Query: 326 TLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADV 385
           TLT  P S    V   +  V+ A R  DP+G   LCY++  +D    P+IT HF+GG D+
Sbjct: 301 TLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTID--IFPVITMHFSGGVDL 360

Query: 386 KLLPVNTFAPVAD-NVTCLTFA--PATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 435
            L   N +    +  V CL       TQ AIFGN AQ NF VGYD  +  +SF P  C+
Sbjct: 361 VLDKYNMYMESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392

BLAST of CsaV3_4G006960 vs. Swiss-Prot
Match: sp|Q3EBM5|ASPR1_ARATH (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 345.5 bits (885), Expect = 8.5e-94
Identity = 196/427 (45.90%), Postives = 268/427 (62.76%), Query Frame = 0

Query: 29  TTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSG 88
           +  L  RDSPLSP++NP ++  D L  AF RS SRS      L+      ++S +I   G
Sbjct: 27  SVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRRFNHQLSQTD---LQSGLIGADG 86

Query: 89  EFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASD 148
           EF MSI IGTPP+ V AIADTGSDLTW QC PC++C+ ++ PIF+ ++SS+Y+   C S 
Sbjct: 87  EFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSR 146

Query: 149 TCRSLESYHCGPDLQS--CSYGYSYGDRSFTYGDLASDQITIGS-----FKLPKTVIGCG 208
            C++L S   G D  +  C Y YSYGD+SF+ GD+A++ ++I S        P TV GCG
Sbjct: 147 NCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGCG 206

Query: 209 HQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRK 268
           + NGGTF    SGIIGLGGG LSL+SQ+   + +  +FSYCL    +  N T  I+ G  
Sbjct: 207 YNNGGTFDETGSGIIGLGGGHLSLISQLG--SSISKKFSYCLSHKSATTNGTSVINLGTN 266

Query: 269 AVVSGRQ----VVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA--------NGISAMTNH 328
           ++ S       VVSTPLV + P T+Y+LTLEAISVGKK+            +GI + T+ 
Sbjct: 267 SIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSETS- 326

Query: 329 GNIIIDSGTTLTLLPRSLYYGVFSTLAR--VIKAKRVDDPSGILELCYSAGQVDDLNIPI 388
           GNIIIDSGTTLTLL    ++  FS+     V  AKRV DP G+L  C+ +G   ++ +P 
Sbjct: 327 GNIIIDSGTTLTLLEAG-FFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSA-EIGLPE 386

Query: 389 ITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLS 435
           IT HF  GADV+L P+N F  +++++ CL+  P T+VAI+GN AQ++F VGYDL  + +S
Sbjct: 387 ITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTTEVAIYGNFAQMDFLVGYDLETRTVS 444

BLAST of CsaV3_4G006960 vs. Swiss-Prot
Match: sp|Q6XBF8|CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 345.5 bits (885), Expect = 8.5e-94
Identity = 190/414 (45.89%), Postives = 262/414 (63.29%), Query Frame = 0

Query: 29  TTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSV-STACIRSPIIPDS 88
           T  L  RDSP SP +NP  +    L +A  RS +R    + H T   +T   +  +  +S
Sbjct: 32  TADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR----VFHFTEKDNTPQPQIDLTSNS 91

Query: 89  GEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCAS 148
           GE+LM++ IGTPP  ++AIADTGSDL WTQC PC +C+ Q  P+F+P+ SS+Y+ VSC+S
Sbjct: 92  GEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSS 151

Query: 149 DTCRSLESY-HCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGS-----FKLPKTVIGCG 208
             C +LE+   C  +  +CSY  SYGD S+T G++A D +T+GS      +L   +IGCG
Sbjct: 152 SQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCG 211

Query: 209 HQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRK 268
           H N GTF    SGI+GLGGG +SL+ Q+     +  +FSYCL    S  + T  I+FG  
Sbjct: 212 HNNAGTFNKKGSGIVGLGGGPVSLIKQLGD--SIDGKFSYCLVPLTSKKDQTSKINFGTN 271

Query: 269 AVVSGRQVVSTPLVPR-SPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTL 328
           A+VSG  VVSTPL+ + S +TFY+LTL++ISVG K+ +  +G  + ++ GNIIIDSGTTL
Sbjct: 272 AIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQ-YSGSDSESSEGNIIIDSGTTL 331

Query: 329 TLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKL 388
           TLLP   Y  +   +A  I A++  DP   L LCYSA    DL +P+IT HF  GADVKL
Sbjct: 332 TLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA--TGDLKVPVITMHF-DGADVKL 391

Query: 389 LPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 435
              N F  V++++ C  F  +   +I+GN+AQ+NF VGYD  +K +SF+P  CA
Sbjct: 392 DSSNAFVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of CsaV3_4G006960 vs. Swiss-Prot
Match: sp|Q766C3|NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 225.7 bits (574), Expect = 9.9e-58
Identity = 143/394 (36.29%), Postives = 203/394 (51.52%), Query Frame = 0

Query: 46  SLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIA 105
           +L+++  L  A  R   R   L   L   S   + + +    GE+LM++ IGTP     A
Sbjct: 53  NLTKFQLLERAIERGSRRLQRLEAMLNGPSG--VETSVYAGDGEYLMNLSIGTPAQPFSA 112

Query: 106 IADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSC 165
           I DTGSDL WTQC PC +CFNQS PIFNP+ SSS+  + C+S  C++L S  C  +   C
Sbjct: 113 IMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNF--C 172

Query: 166 SYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSL 225
            Y Y YGD S T G + ++ +T GS  +P    GCG  N G   G  +G++G+G G LSL
Sbjct: 173 QYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSL 232

Query: 226 VSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSP-DTFYF 285
            SQ+        +FSYC+    S+      +     +V +G    +T L+  S   TFY+
Sbjct: 233 PSQLDV-----TKFSYCMTPIGSSTPSNLLLGSLANSVTAGSP--NTTLIQSSQIPTFYY 292

Query: 286 LTLEAISVGKKRF---KAANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKA 345
           +TL  +SVG  R     +A  +++    G IIIDSGTTLT    + Y  V       I  
Sbjct: 293 ITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINL 352

Query: 346 KRVDDPSGILELCY-SAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAP 405
             V+  S   +LC+ +     +L IP    HF GG D++L   N F   ++ + CL    
Sbjct: 353 PVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG-DLELPSENYFISPSNGLICLAMGS 412

Query: 406 ATQ-VAIFGNLAQINFEVGYDLGNKRLSFEPKLC 434
           ++Q ++IFGN+ Q N  V YD GN  +SF    C
Sbjct: 413 SSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CsaV3_4G006960 vs. Swiss-Prot
Match: sp|Q766C2|NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 221.9 bits (564), Expect = 1.4e-56
Identity = 144/394 (36.55%), Postives = 207/394 (52.54%), Query Frame = 0

Query: 46  SLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIA 105
           +L++Y+ +  A +R   R  ++   L S S   I +P+    GE+LM++ IGTP  +  A
Sbjct: 54  NLTKYELIKRAIKRGERRMRSINAMLQSSSG--IETPVYAGDGEYLMNVAIGTPDSSFSA 113

Query: 106 IADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSC 165
           I DTGSDL WTQC PC +CF+Q  PIFNP+ SSS+  + C S  C+ L S  C  +   C
Sbjct: 114 IMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNN--EC 173

Query: 166 SYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSL 225
            Y Y YGD S T G +A++  T  +  +P    GCG  N G   G  +G+IG+G G LSL
Sbjct: 174 QYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSL 233

Query: 226 VSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPD-TFYF 285
            SQ+    GV  +FSYC+ ++ S++  T  +      V  G    ST L+  S + T+Y+
Sbjct: 234 PSQL----GV-GQFSYCMTSYGSSSPSTLALGSAASGVPEGSP--STTLIHSSLNPTYYY 293

Query: 286 LTLEAISVGKKRFKAANGISAMTNH--GNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAK 345
           +TL+ I+VG       +    + +   G +IIDSGTTLT LP+  Y  V       I   
Sbjct: 294 ITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLP 353

Query: 346 RVDDPSGILELCY-SAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPA 405
            VD+ S  L  C+        + +P I+  F GG  + L   N     A+ V CL    +
Sbjct: 354 TVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGV-LNLGEQNILISPAEGVICLAMGSS 413

Query: 406 TQ--VAIFGNLAQINFEVGYDLGNKRLSFEPKLC 434
           +Q  ++IFGN+ Q   +V YDL N  +SF P  C
Sbjct: 414 SQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CsaV3_4G006960 vs. Swiss-Prot
Match: sp|Q9LNJ3|APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 214.2 bits (544), Expect = 3.0e-54
Identity = 133/368 (36.14%), Postives = 192/368 (52.17%), Query Frame = 0

Query: 76  TACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPR 135
           ++ + S +   SGE+   + +GTP   V  + DTGSD+ W QC PCR C++QS PIF+PR
Sbjct: 128 SSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPR 187

Query: 136 RSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPK 195
           +S +Y  + C+S  CR L+S  C    ++C Y  SYGD SFT GD +++ +T    ++  
Sbjct: 188 KSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG 247

Query: 196 TVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGT 255
             +GCGH N G F G  +G++GLG G LS   Q  T      +FSYCL    S ++   +
Sbjct: 248 VALGCGHDNEGLFVG-AAGLLGLGKGKLSFPGQ--TGHRFNQKFSYCL-VDRSASSKPSS 307

Query: 256 ISFGRKAVVSGRQVVSTPLVPRSP-DTFYFLTLEAISVGKKRFKAANGISA------MTN 315
           + FG  AV   R    TPL+     DTFY++ L  ISVG  R     G++A         
Sbjct: 308 VVFGNAAV--SRIARFTPLLSNPKLDTFYYVGLLGISVGGTR---VPGVTASLFKLDQIG 367

Query: 316 HGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPII 375
           +G +IIDSGT++T L R  Y  +        K  +      + + C+    ++++ +P +
Sbjct: 368 NGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTV 427

Query: 376 TAHFAGGADVKLLPVNTFAPVADN-VTCLTFAPAT-QVAIFGNLAQINFEVGYDLGNKRL 435
             HF  GADV L   N   PV  N   C  FA     ++I GN+ Q  F V YDL + R+
Sbjct: 428 VLHFR-GADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRV 485

BLAST of CsaV3_4G006960 vs. TrEMBL
Match: tr|A0A0A0KZZ3|A0A0A0KZZ3_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G055410 PE=3 SV=1)

HSP 1 Score: 817.4 bits (2110), Expect = 1.5e-233
Identity = 434/434 (100.00%), Postives = 434/434 (100.00%), Query Frame = 0

Query: 1   MAAISIXXXXXXXXXXXXXXXXXXXXXXTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60
           MAAISIXXXXXXXXXXXXXXXXXXXXXXTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS
Sbjct: 1   MAAISIXXXXXXXXXXXXXXXXXXXXXXTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60

Query: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120
           FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP
Sbjct: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120

Query: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180
           CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180

Query: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240
           LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS
Sbjct: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240

Query: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300
           YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA
Sbjct: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300

Query: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQV 360
           NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQV
Sbjct: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQV 360

Query: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420
           DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD
Sbjct: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYD 420

Query: 421 LGNKRLSFEPKLCA 435
           LGNKRLSFEPKLCA
Sbjct: 421 LGNKRLSFEPKLCA 434

BLAST of CsaV3_4G006960 vs. TrEMBL
Match: tr|A0A1S3BT75|A0A1S3BT75_CUCME (probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493255 PE=3 SV=1)

HSP 1 Score: 738.4 bits (1905), Expect = 9.2e-210
Identity = 369/405 (91.11%), Postives = 382/405 (94.32%), Query Frame = 0

Query: 30  TSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSGE 89
           TSL+ RDS LSPLHNPSLSRYDSL+++FRRSFSRSATLL HLTSVSTACIRSPIIPDSGE
Sbjct: 30  TSLYHRDSLLSPLHNPSLSRYDSLVESFRRSFSRSATLLNHLTSVSTACIRSPIIPDSGE 89

Query: 90  FLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDT 149
           FLMSIFIGTP VN IAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSC+SDT
Sbjct: 90  FLMSIFIGTPRVNFIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCSSDT 149

Query: 150 CRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTFG 209
           CRSLES HCG DL+SCSYGYSYGDRSFTYGDLASD+ITIGSFKLPKTVIGCGHQNGGTFG
Sbjct: 150 CRSLESSHCGLDLKSCSYGYSYGDRSFTYGDLASDKITIGSFKLPKTVIGCGHQNGGTFG 209

Query: 210 GVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQV 269
           GVTSGIIGLGGGSLSLVSQM TIAGVKP+FSYCLPTFFSN NITG ISFGRKAVVSGRQV
Sbjct: 210 GVTSGIIGLGGGSLSLVSQMSTIAGVKPQFSYCLPTFFSNENITGKISFGRKAVVSGRQV 269

Query: 270 VSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLYY 329
           VSTPLVPRSPDTFYFLTLEAISVG KRFKAA  +SAMTN GNIIIDSGTTLTLLPRSLY 
Sbjct: 270 VSTPLVPRSPDTFYFLTLEAISVGNKRFKAAKDMSAMTNQGNIIIDSGTTLTLLPRSLYD 329

Query: 330 GVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPV 389
           GV STLARVIK KRVDDPSGILELCYSAGQ++DLNIPIITAHF+G ADVKLLPVNTFAPV
Sbjct: 330 GVVSTLARVIKTKRVDDPSGILELCYSAGQLEDLNIPIITAHFSGRADVKLLPVNTFAPV 389

Query: 390 ADNVTCLTFAPATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 435
           ADNV CLT APAT VAIFGNLAQINFEVGYDLGNKRLSF+P  CA
Sbjct: 390 ADNVICLTLAPATNVAIFGNLAQINFEVGYDLGNKRLSFKPTRCA 434

BLAST of CsaV3_4G006960 vs. TrEMBL
Match: tr|A0A1S3BUB0|A0A1S3BUB0_CUCME (probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493257 PE=3 SV=1)

HSP 1 Score: 446.8 bits (1148), Expect = 5.5e-122
Identity = 234/410 (57.07%), Postives = 291/410 (70.98%), Query Frame = 0

Query: 29  TTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSG 88
           TTSLF RDS LSPL   SLS YD L +AFRRS SRSA LL    +     ++SPI P SG
Sbjct: 29  TTSLFHRDSLLSPLEFSSLSHYDRLSNAFRRSLSRSAALLNRAATSGAVGLQSPIAPGSG 88

Query: 89  EFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASD 148
           E+LMS+ IGTPPV+ I +ADTGSDLTW QCLPC +CF QS+PIFNP +S+S+  V C S 
Sbjct: 89  EYLMSVSIGTPPVDYIGLADTGSDLTWAQCLPCVKCFKQSRPIFNPLKSTSFSHVPCNSQ 148

Query: 149 TCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTF 208
            C++++  HCG     C Y Y+YGD+++T GDL  ++ITIGS  + K+VIGCGH++GG F
Sbjct: 149 ICQAIDDAHCGVQ-GVCDYSYTYGDQTYTKGDLGLEKITIGSSSV-KSVIGCGHESGGGF 208

Query: 209 GGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQ 268
            G  SG+IGLGGG LSLVSQM   +G+  RFSYCLPT  S+AN  G I+FG+ AVVSG  
Sbjct: 209 -GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN--GKINFGQNAVVSGPG 268

Query: 269 VVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLY 328
           VVSTPL+ + P T+Y++TLEAIS+G +R  A+         GN+IIDSGTTLT+LP+ LY
Sbjct: 269 VVSTPLISKDPVTYYYITLEAISIGNERHMAS------AKQGNVIIDSGTTLTVLPKELY 328

Query: 329 YGVFSTLARVIKAKRVDDPSGILELCYSAG--QVDDLNIPIITAHFAGGADVKLLPVNTF 388
            GV S+L +V+KAKRV DP    +LC+  G        IPIITAHF+GGA+V LLPVNTF
Sbjct: 329 DGVVSSLLKVVKAKRVKDPGSFWDLCFDDGINVAASSGIPIITAHFSGGANVNLLPVNTF 388

Query: 389 APVADNVTCLTF---APATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLC 434
             VA+NV CLT    +P  +  I GNLAQ NF +GYDL  KRLSF+P +C
Sbjct: 389 QKVANNVNCLTLTAASPTDEFGIIGNLAQANFLIGYDLEAKRLSFKPTVC 427

BLAST of CsaV3_4G006960 vs. TrEMBL
Match: tr|A0A0A0KV20|A0A0A0KV20_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G055400 PE=3 SV=1)

HSP 1 Score: 444.5 bits (1142), Expect = 2.7e-121
Identity = 233/411 (56.69%), Postives = 290/411 (70.56%), Query Frame = 0

Query: 29  TTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSG 88
           TTSLF RDS LSPL   SLS YD L +AFRRS SRSA LL    +     ++S I P SG
Sbjct: 31  TTSLFHRDSLLSPLEFSSLSHYDRLANAFRRSLSRSAALLNRAATSGAVGLQSSIGPGSG 90

Query: 89  EFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASD 148
           E+LMS+ IGTPPV+ + IADTGSDLTW QCLPC +C+ Q +PIFNP +S+S+  V C + 
Sbjct: 91  EYLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQ 150

Query: 149 TCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTF 208
           TC +++  HCG     C Y Y+YGDR+++ GDL  ++ITIGS  + K+VIGCGH + G F
Sbjct: 151 TCHAVDDGHCGVQ-GVCDYSYTYGDRTYSKGDLGFEKITIGSSSV-KSVIGCGHASSGGF 210

Query: 209 GGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQ 268
            G  SG+IGLGGG LSLVSQM   +G+  RFSYCLPT  S+AN  G I+FG  AVVSG  
Sbjct: 211 -GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN--GKINFGENAVVSGPG 270

Query: 269 VVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLY 328
           VVSTPL+ ++  T+Y++TLEAIS+G +R        A    GN+IIDSGTTLT+LP+ LY
Sbjct: 271 VVSTPLISKNTVTYYYITLEAISIGNERH------MAFAKQGNVIIDSGTTLTILPKELY 330

Query: 329 YGVFSTLARVIKAKRVDDPSGILELCYSAG--QVDDLNIPIITAHFAGGADVKLLPVNTF 388
            GV S+L +V+KAKRV DP G L+LC+  G      L IP+ITAHF+GGA+V LLP+NTF
Sbjct: 331 DGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPVITAHFSGGANVNLLPINTF 390

Query: 389 APVADNVTCLTF---APATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 435
             VADNV CLT    +P T+  I GNLAQ NF +GYDL  KRLSF+P +CA
Sbjct: 391 RKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAKRLSFKPTVCA 430

BLAST of CsaV3_4G006960 vs. TrEMBL
Match: tr|A0A1S3BTZ4|A0A1S3BTZ4_CUCME (probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493256 PE=3 SV=1)

HSP 1 Score: 421.8 bits (1083), Expect = 1.9e-114
Identity = 224/410 (54.63%), Postives = 279/410 (68.05%), Query Frame = 0

Query: 29  TTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDSG 88
           TTSLF RDS LSPL   +LS YD L +AFRRS SRSA LL    +     ++SPI P SG
Sbjct: 31  TTSLFHRDSLLSPLEFSTLSHYDRLSNAFRRSLSRSAALLNRTATSGAVGLQSPIAPGSG 90

Query: 89  EFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASD 148
           E+LM + IGTPPV+ I + DTGSDLTW QCLPCR+CF Q +PIFNP +S+S+  V C S 
Sbjct: 91  EYLMYVSIGTPPVDYIGMIDTGSDLTWAQCLPCRKCFLQLRPIFNPLKSTSFSHVPCNSQ 150

Query: 149 TCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTF 208
            C++++  HCG     C Y Y+YGD+          +ITIGS  + K+VIGCGH++GG F
Sbjct: 151 ICQAIDDAHCGVQ-GVCDYSYTYGDQXXXXXXXXXXKITIGSSSV-KSVIGCGHESGGGF 210

Query: 209 GGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQ 268
            G  SG+IGLGGG LSLVSQM   +G+  RFSYCLP    +AN  G I+F + AVVSG  
Sbjct: 211 -GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPPLLGHAN--GKINFAQNAVVSGPG 270

Query: 269 VVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLY 328
           VVSTPL+ + P T+Y++TLEAIS+G +R  A+         GN+IIDSGTTLT+LP+ LY
Sbjct: 271 VVSTPLISKDPVTYYYITLEAISIGNERHMAS------AKQGNVIIDSGTTLTVLPKELY 330

Query: 329 YGVFSTLARVIKAKRVDDPSGILELCYSAG--QVDDLNIPIITAHFAGGADVKLLPVNTF 388
            GV S+L +V+KAKRV DP    +LC+  G        IPIITAHF+GGA+V LLPVNTF
Sbjct: 331 DGVVSSLLKVVKAKRVKDPGSFWDLCFDDGINVAASSGIPIITAHFSGGANVNLLPVNTF 390

Query: 389 APVADNVTCLTF---APATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLC 434
             VA+NV CLT    +P  +  I GNLAQ NF +GYDL  KRLSF+P +C
Sbjct: 391 QKVANNVNCLTLTAASPTDEFGIIGNLAQANFLIGYDLEAKRLSFKPTVC 429

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004149004.12.3e-233100.00PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus] >KGN53446.1 hy... [more]
XP_008452150.11.4e-20991.11PREDICTED: probable aspartic protease At2g35615 [Cucumis melo][more]
XP_023543528.11.2e-16872.66aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo][more]
XP_022942027.12.9e-16772.66aspartic proteinase CDR1-like [Cucurbita moschata][more]
XP_023552860.11.7e-16269.52aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
AT1G64830.13.0e-10550.12Eukaryotic aspartyl protease family protein[more]
AT2G35615.14.7e-9545.90Eukaryotic aspartyl protease family protein[more]
AT5G33340.14.7e-9545.89Eukaryotic aspartyl protease family protein[more]
AT1G31450.12.3e-9445.99Eukaryotic aspartyl protease family protein[more]
AT2G28010.16.5e-6040.67Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
sp|Q3EBM5|ASPR1_ARATH8.5e-9445.90Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
sp|Q6XBF8|CDR1_ARATH8.5e-9445.89Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
sp|Q766C3|NEP1_NEPGR9.9e-5836.29Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
sp|Q766C2|NEP2_NEPGR1.4e-5636.55Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
sp|Q9LNJ3|APF2_ARATH3.0e-5436.14Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KZZ3|A0A0A0KZZ3_CUCSA1.5e-233100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G055410 PE=3 SV=1[more]
tr|A0A1S3BT75|A0A1S3BT75_CUCME9.2e-21091.11probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493255 PE=... [more]
tr|A0A1S3BUB0|A0A1S3BUB0_CUCME5.5e-12257.07probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493257 PE=... [more]
tr|A0A0A0KV20|A0A0A0KV20_CUCSA2.7e-12156.69Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G055400 PE=3 SV=1[more]
tr|A0A1S3BTZ4|A0A1S3BTZ4_CUCME1.9e-11454.63probable aspartic protease At2g35615 OS=Cucumis melo OX=3656 GN=LOC103493256 PE=... [more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR034161Pepsin-like_plant
IPR033121PEPTIDASE_A1
IPR001969Aspartic_peptidase_AS
IPR001461Aspartic_peptidase_A1
IPR032799TAXi_C
IPR032861TAXi_N
IPR021109Peptidase_aspartic_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_4G006960.1CsaV3_4G006960.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 263..434
e-value: 3.2E-42
score: 146.3
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 68..259
e-value: 1.7E-48
score: 167.3
IPR021109Aspartic peptidase domain superfamilySUPERFAMILYSSF50630Acid proteasescoord: 85..433
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 90..259
e-value: 3.1E-51
score: 174.0
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 283..429
e-value: 1.5E-25
score: 89.8
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 5..433
NoneNo IPR availablePANTHERPTHR13683:SF524ASPARTIC PROTEINASE CDR1-RELATEDcoord: 5..433
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 312..323
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 90..429
score: 42.83
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 89..433
e-value: 7.85266E-66
score: 212.893

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None