CsGy1G033050 (gene) Cucumber (Gy14) v2

NameCsGy1G033050
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionaspartic proteinase nepenthesin-2
LocationChr1 : 32785405 .. 32786961 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTTCGATATCATCAATCTCCACTGCAACAAAATTCTTGAGCCTATTCCTTCTTCTTGTACATGTCTCAACACAAACCCTAGCAACAAACCCTAAAACCAATTTCCCCAAAGATTCTCTAGTTCTTGGTCTTGTTCATTCAAGAACATCCCTCCTTACACCCAAAAAAGGCTATAATTTCATTTCAAAGAAGAGAATGAAGGCAATGGATCAGACGGATGGTGATGATAATGTGATAGAGCCATTGAGAGAAATTAGGGATGGTTATTTAATGTCCTTATCAATAGGGACACCCCCACAAGTTGTTCAAGTGTATATGGACACTGGAAGTGATCTCACGTGGGTTCCTTGTGGTAACCTCTCTTTTGATTGTCAAGATTGTGAGGAGTATCAAAACAATATTTCTGGCCCAAGATTGGCCGCCTTTTTGCCTACTCATTCTTCTACTTCTATTCGAGACACGTGTGGTAGTTCCTTTTGTATGGATATTCATAGCTCTGATAACCCTTTTGATCCTTGCACAATTGCTGGTTGTTCCTTAGCTAGCCTTGTGAAGGGCACTTGCCCTAGACCTTGCCCTTCTTTTGCTTATACTTATGGAGCAAGTGGGGTTGTAACTGGAAGTTTAACAAGAGATGTTCTTTTTACGCATGGAAATTATAATAACAATAATAACAATAATAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGAGCAACTTATAGAGAGCCAATTGGGATTGCTGGTTTTGGAAGAGGCTTACTTTCTCTTCCTTTTCAATTAGGGTTTTCTCATAAGGGGTTTTCTCATTGCTTTTTACCTTTTAAATTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATTCTTGGTAATCTTGCCATTTCTTCAAAAGATGAAAATTTGCAATTTACCCCTTTGTTGAAAAGTCCAATGTACCCTAACTATTACTATATTGGGCTTGAGTCAATTACCATTGGGAATGGGGATAATAATTTTAGATTTGGGGTTTCTTTTAAATTGAGAGAGATTGATACAAAGGGTAATGGAGGGATGTTGATTGATTCTGGTACTACTTATACTCATTTACCTGAACCATTGTATTCACAACTTATTTCTAATCTTGAATTAGTGATAGGTTATCCAAGAGCCAAGCAAGTTGAACTCAATACTGGGTTTGATCTTTGTTATAAAGTTCCTTGTAAAAACAACAATTCTTCTTTTGTTGATGACGCTCAACTCCCTTCTATAACATTCCATTTTTTGAATAATGTCAGTGTTGTTTTGCCTCAAGGCAATAACTTCTATGCCATGGCTGCTCCAATTAACTCCACTGTGGTTAAATGTTTGTTGTATCAAAGCATGGACGGTGTTGGTGACGATAACGACAGTGACGACAATGGGCCGGCGGGTATTTTCGGAAGCTTTCAACAGCAAAATATAGAGGTCGTTTATGATTTGGAGAAGGAAAGATTAGGGTTTCAACCAATGGATTGTGTTTCTGTTGCTGCCAAACAGGGACTTCACAAGAATGTTAGAAGGAATGAAAGTTGA

mRNA sequence

ATGCCTTCGATATCATCAATCTCCACTGCAACAAAATTCTTGAGCCTATTCCTTCTTCTTGTACATGTCTCAACACAAACCCTAGCAACAAACCCTAAAACCAATTTCCCCAAAGATTCTCTAGTTCTTGGTCTTGTTCATTCAAGAACATCCCTCCTTACACCCAAAAAAGGCTATAATTTCATTTCAAAGAAGAGAATGAAGGCAATGGATCAGACGGATGGTGATGATAATGTGATAGAGCCATTGAGAGAAATTAGGGATGGTTATTTAATGTCCTTATCAATAGGGACACCCCCACAAGTTGTTCAAGTGTATATGGACACTGGAAGTGATCTCACGTGGGTTCCTTGTGGTAACCTCTCTTTTGATTGTCAAGATTGTGAGGAGTATCAAAACAATATTTCTGGCCCAAGATTGGCCGCCTTTTTGCCTACTCATTCTTCTACTTCTATTCGAGACACGTGTGGTAGTTCCTTTTGTATGGATATTCATAGCTCTGATAACCCTTTTGATCCTTGCACAATTGCTGGTTGTTCCTTAGCTAGCCTTGTGAAGGGCACTTGCCCTAGACCTTGCCCTTCTTTTGCTTATACTTATGGAGCAAGTGGGGTTGTAACTGGAAGTTTAACAAGAGATGTTCTTTTTACGCATGGAAATTATAATAACAATAATAACAATAATAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGAGCAACTTATAGAGAGCCAATTGGGATTGCTGGTTTTGGAAGAGGCTTACTTTCTCTTCCTTTTCAATTAGGGTTTTCTCATAAGGGGTTTTCTCATTGCTTTTTACCTTTTAAATTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATTCTTGGTAATCTTGCCATTTCTTCAAAAGATGAAAATTTGCAATTTACCCCTTTGTTGAAAAGTCCAATGTACCCTAACTATTACTATATTGGGCTTGAGTCAATTACCATTGGGAATGGGGATAATAATTTTAGATTTGGGGTTTCTTTTAAATTGAGAGAGATTGATACAAAGGGTAATGGAGGGATGTTGATTGATTCTGGTACTACTTATACTCATTTACCTGAACCATTGTATTCACAACTTATTTCTAATCTTGAATTAGTGATAGGTTATCCAAGAGCCAAGCAAGTTGAACTCAATACTGGGTTTGATCTTTGTTATAAAGTTCCTTGTAAAAACAACAATTCTTCTTTTGTTGATGACGCTCAACTCCCTTCTATAACATTCCATTTTTTGAATAATGTCAGTGTTGTTTTGCCTCAAGGCAATAACTTCTATGCCATGGCTGCTCCAATTAACTCCACTGTGGTTAAATGTTTGTTGTATCAAAGCATGGACGGTGTTGGTGACGATAACGACAGTGACGACAATGGGCCGGCGGGTATTTTCGGAAGCTTTCAACAGCAAAATATAGAGGTCGTTTATGATTTGGAGAAGGAAAGATTAGGGTTTCAACCAATGGATTGTGTTTCTGTTGCTGCCAAACAGGGACTTCACAAGAATGTTAGAAGGAATGAAAGTTGA

Coding sequence (CDS)

ATGCCTTCGATATCATCAATCTCCACTGCAACAAAATTCTTGAGCCTATTCCTTCTTCTTGTACATGTCTCAACACAAACCCTAGCAACAAACCCTAAAACCAATTTCCCCAAAGATTCTCTAGTTCTTGGTCTTGTTCATTCAAGAACATCCCTCCTTACACCCAAAAAAGGCTATAATTTCATTTCAAAGAAGAGAATGAAGGCAATGGATCAGACGGATGGTGATGATAATGTGATAGAGCCATTGAGAGAAATTAGGGATGGTTATTTAATGTCCTTATCAATAGGGACACCCCCACAAGTTGTTCAAGTGTATATGGACACTGGAAGTGATCTCACGTGGGTTCCTTGTGGTAACCTCTCTTTTGATTGTCAAGATTGTGAGGAGTATCAAAACAATATTTCTGGCCCAAGATTGGCCGCCTTTTTGCCTACTCATTCTTCTACTTCTATTCGAGACACGTGTGGTAGTTCCTTTTGTATGGATATTCATAGCTCTGATAACCCTTTTGATCCTTGCACAATTGCTGGTTGTTCCTTAGCTAGCCTTGTGAAGGGCACTTGCCCTAGACCTTGCCCTTCTTTTGCTTATACTTATGGAGCAAGTGGGGTTGTAACTGGAAGTTTAACAAGAGATGTTCTTTTTACGCATGGAAATTATAATAACAATAATAACAATAATAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGAGCAACTTATAGAGAGCCAATTGGGATTGCTGGTTTTGGAAGAGGCTTACTTTCTCTTCCTTTTCAATTAGGGTTTTCTCATAAGGGGTTTTCTCATTGCTTTTTACCTTTTAAATTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATTCTTGGTAATCTTGCCATTTCTTCAAAAGATGAAAATTTGCAATTTACCCCTTTGTTGAAAAGTCCAATGTACCCTAACTATTACTATATTGGGCTTGAGTCAATTACCATTGGGAATGGGGATAATAATTTTAGATTTGGGGTTTCTTTTAAATTGAGAGAGATTGATACAAAGGGTAATGGAGGGATGTTGATTGATTCTGGTACTACTTATACTCATTTACCTGAACCATTGTATTCACAACTTATTTCTAATCTTGAATTAGTGATAGGTTATCCAAGAGCCAAGCAAGTTGAACTCAATACTGGGTTTGATCTTTGTTATAAAGTTCCTTGTAAAAACAACAATTCTTCTTTTGTTGATGACGCTCAACTCCCTTCTATAACATTCCATTTTTTGAATAATGTCAGTGTTGTTTTGCCTCAAGGCAATAACTTCTATGCCATGGCTGCTCCAATTAACTCCACTGTGGTTAAATGTTTGTTGTATCAAAGCATGGACGGTGTTGGTGACGATAACGACAGTGACGACAATGGGCCGGCGGGTATTTTCGGAAGCTTTCAACAGCAAAATATAGAGGTCGTTTATGATTTGGAGAAGGAAAGATTAGGGTTTCAACCAATGGATTGTGTTTCTGTTGCTGCCAAACAGGGACTTCACAAGAATGTTAGAAGGAATGAAAGTTGA

Protein sequence

MPSISSISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYNFISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNNNNKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRNES
BLAST of CsGy1G033050 vs. NCBI nr
Match: XP_004145478.2 (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis sativus] >KGN66888.1 hypothetical protein Csa_1G704590 [Cucumis sativus])

HSP 1 Score: 1047.0 bits (2706), Expect = 2.2e-302
Identity = 518/518 (100.00%), Postives = 518/518 (100.00%), Query Frame = 0

Query: 1   MPSISSISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYN 60
           MPSISSISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYN
Sbjct: 1   MPSISSISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYN 60

Query: 61  FISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGN 120
           FISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGN
Sbjct: 61  FISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGN 120

Query: 121 LSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCS 180
           LSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCS
Sbjct: 121 LSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCS 180

Query: 181 LASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVG 240
           LASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVG
Sbjct: 181 LASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVG 240

Query: 241 ATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD 300
           ATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD
Sbjct: 241 ATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD 300

Query: 301 ENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTT 360
           ENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTT
Sbjct: 301 ENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTT 360

Query: 361 YTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITF 420
           YTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITF
Sbjct: 361 YTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITF 420

Query: 421 HFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQ 480
           HFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQ
Sbjct: 421 HFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQ 480

Query: 481 NIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRNES 519
           NIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRNES
Sbjct: 481 NIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRNES 518

BLAST of CsGy1G033050 vs. NCBI nr
Match: XP_008459091.1 (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo])

HSP 1 Score: 998.8 bits (2581), Expect = 6.8e-288
Identity = 495/520 (95.19%), Postives = 504/520 (96.92%), Query Frame = 0

Query: 1   MPSISSISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYN 60
           MPSISS S ATKFLSLFLLLVH S QTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYN
Sbjct: 1   MPSISSTSIATKFLSLFLLLVHASKQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYN 60

Query: 61  FISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGN 120
           FISKKRMKAMDQ DGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGN
Sbjct: 61  FISKKRMKAMDQMDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGN 120

Query: 121 LSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCS 180
           LSFDCQDCEEYQNNISGP+LAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCS
Sbjct: 121 LSFDCQDCEEYQNNISGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCS 180

Query: 181 LASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNY----XXXXXXXXQIPRFCF 240
           LA+LVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLF HGNY    XXXXXXXX +PRFCF
Sbjct: 181 LATLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFMHGNYXXXXXXXXXXXXXVPRFCF 240

Query: 241 GCVGATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI 300
           GCVGATYREPIGIAGFGRGLLSLPFQLGFS KGFSHCFLPFKFSNNPNFSSPLILG+LAI
Sbjct: 241 GCVGATYREPIGIAGFGRGLLSLPFQLGFSQKGFSHCFLPFKFSNNPNFSSPLILGHLAI 300

Query: 301 SSKDENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLID 360
           SSKDENLQFTPLLKSP+YPNYYYIGLESITIGNG+NNFRFGVSFKLREIDTKGNGGMLID
Sbjct: 301 SSKDENLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLID 360

Query: 361 SGTTYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLP 420
           SGTTYTHLPEPLYSQLISNLE VI YPRAKQVELNTGFDLCYKVPCKNNNSSFVDD+QLP
Sbjct: 361 SGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDSQLP 420

Query: 421 SITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGS 480
           SITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGS
Sbjct: 421 SITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGS 480

Query: 481 FQQQNIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRN 517
           FQQQN++VVYDLEKERLGFQ MDCVSVAA QGLHKNVRRN
Sbjct: 481 FQQQNLQVVYDLEKERLGFQAMDCVSVAANQGLHKNVRRN 520

BLAST of CsGy1G033050 vs. NCBI nr
Match: XP_023520027.1 (probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 827.8 bits (2137), Expect = 2.1e-236
Identity = 411/506 (81.23%), Postives = 449/506 (88.74%), Query Frame = 0

Query: 10  ATKFLSLFLLLVHVS----TQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYNFISKK 69
           A  F  L L+LV VS     QTLA NPKT F KDSLVLGLVHSRTSLLTPK+GYN +S+K
Sbjct: 6   AKSFFVLVLVLVLVSGEAMGQTLA-NPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRK 65

Query: 70  RMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDC 129
           R+K M+   G+D+VIEPLREIRDGYLMSL++GTPPQV+QVYMDTGSDLTWVPCGNLSFDC
Sbjct: 66  RIKPMEM--GNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDC 125

Query: 130 QDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLV 189
           QDC+EYQNN+ GP+LAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLA+LV
Sbjct: 126 QDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLV 185

Query: 190 KGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVGATYRE 249
           KGTCPRPCPSF+YTYGASG+V G+LT+DV+F HGN         +IP+FCFGCVGATYRE
Sbjct: 186 KGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGN---SPNSSRKIPKFCFGCVGATYRE 245

Query: 250 PIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQF 309
           PIGIAGFGRGLLSLP+QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK E+L+F
Sbjct: 246 PIGIAGFGRGLLSLPYQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK-EHLKF 305

Query: 310 TPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLP 369
           TPLLKSP YPNYYYIGLESITIGNG+N  RFGVS +LREIDTKGNGG+LIDSGTTYTHLP
Sbjct: 306 TPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLP 365

Query: 370 EPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNN 429
           EPLYSQLISNLE +I YPRAK+ ELNTGFDLCYKVP K NN+ F D+ +LPSITFHFLNN
Sbjct: 366 EPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYK-NNTFFSDEFELPSITFHFLNN 425

Query: 430 VSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVV 489
           VSVVLPQGN+FYAMAAP NSTVVKCLL+QSMDG GD       GPAGIFGSFQQQN+EVV
Sbjct: 426 VSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGD-------GPAGIFGSFQQQNLEVV 485

Query: 490 YDLEKERLGFQPMDCVSVAAKQGLHK 512
           YDLEKERLGF+ MDC SVA  QGLHK
Sbjct: 486 YDLEKERLGFEGMDCASVAVSQGLHK 496

BLAST of CsGy1G033050 vs. NCBI nr
Match: XP_022927421.1 (probable aspartyl protease At4g16563 [Cucurbita moschata])

HSP 1 Score: 819.3 bits (2115), Expect = 7.3e-234
Identity = 408/511 (79.84%), Postives = 446/511 (87.28%), Query Frame = 0

Query: 1   MPSISSISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYN 60
           M SI++ S     L L L+      QTLA NPKT F KDSLVLGLVHSRTSLLTPK+GYN
Sbjct: 1   MASIAARSYFVLVLVLVLVSGEAMGQTLA-NPKTKFLKDSLVLGLVHSRTSLLTPKRGYN 60

Query: 61  FISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGN 120
            +  KR+K M+   G+D+VIEPLREIRDGYLMSL++GTPPQV+QVYMDTGSDLTWVPCGN
Sbjct: 61  SLLTKRIKPMEM--GNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGN 120

Query: 121 LSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCS 180
           LSFDCQDC+EYQNN+ GP+LAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCS
Sbjct: 121 LSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCS 180

Query: 181 LASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVG 240
           LA+LVKGTCPRPCPSF+YTYGASG+V G+LT+DV+F HGN         +IP+FCFGCVG
Sbjct: 181 LATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGN---SPNSSRKIPKFCFGCVG 240

Query: 241 ATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD 300
           ATYREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK 
Sbjct: 241 ATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK- 300

Query: 301 ENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTT 360
           E+L+FTP LKSP YPNYYYIGLESITIGNG+N  RFGVS +LREIDTKGNGG+LIDSGTT
Sbjct: 301 EHLKFTPFLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTT 360

Query: 361 YTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITF 420
           YTHLPEPLYSQLISNLE +I YPRAK+ ELNTGFDLCYKVP K NN+ F D+ +LPSITF
Sbjct: 361 YTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYK-NNTFFSDEFELPSITF 420

Query: 421 HFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQ 480
           HFLNNVSVVLPQGN+FYAMAAP NSTVVKCLL+QSMDG GD       GPAGIFGSFQQQ
Sbjct: 421 HFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGD-------GPAGIFGSFQQQ 480

Query: 481 NIEVVYDLEKERLGFQPMDCVSVAAKQGLHK 512
           N+EVVYDLEKERLGF+ MDC SVA  QGLHK
Sbjct: 481 NLEVVYDLEKERLGFEAMDCASVAVSQGLHK 496

BLAST of CsGy1G033050 vs. NCBI nr
Match: XP_023000974.1 (probable aspartyl protease At4g16563 [Cucurbita maxima])

HSP 1 Score: 815.8 bits (2106), Expect = 8.1e-233
Identity = 405/504 (80.36%), Postives = 442/504 (87.70%), Query Frame = 0

Query: 10  ATKFLSLFLLLV--HVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYNFISKKRM 69
           A  F  L L+LV      QTLA NPKT F KDSLVLGLVHSRTSLLTPK+GYN +S+KR+
Sbjct: 6   ARSFFVLVLVLVSGEAMGQTLA-NPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRI 65

Query: 70  KAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQD 129
           K M+   GDD+VIEPLREIRDGYLMSL++GTPPQV+QVYMDTGSDLTWVPCGNLSFDCQD
Sbjct: 66  KPMEM--GDDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQD 125

Query: 130 CEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKG 189
           C+EYQNN+ GP+LAAFLPTHSSTSIR+TCGSSFC+DIHSSDNPFDPCTIAGCSLA+LVKG
Sbjct: 126 CDEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKG 185

Query: 190 TCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVGATYREPI 249
            CPRPCPSF+YTYGASG+V G+LT+D +F HGN         +IP+FCFGCVGATYREPI
Sbjct: 186 ACPRPCPSFSYTYGASGLVIGTLTKDAIFIHGN---SPNSSRKIPKFCFGCVGATYREPI 245

Query: 250 GIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTP 309
           GIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNP FSSPLILGNLAISSK E+L+FTP
Sbjct: 246 GIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPKFSSPLILGNLAISSK-EHLKFTP 305

Query: 310 LLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEP 369
           LLKSP YPNYYYIGLESITIGNG+N  RFGVS +LREIDTKGNGG+LIDSGTTYTHLPEP
Sbjct: 306 LLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEP 365

Query: 370 LYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVS 429
           LYSQLIS LE +I YPRAK+ ELNTGFDLCYKVP K NN+ F D+ +LPSITFHFLNNVS
Sbjct: 366 LYSQLISILESLISYPRAKEHELNTGFDLCYKVPYK-NNTFFSDEFELPSITFHFLNNVS 425

Query: 430 VVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYD 489
           VVLPQGN+FYAMAAP NSTVVKCLL+QSMDG GD       GPAGIFGSFQQQN+EVVYD
Sbjct: 426 VVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGD-------GPAGIFGSFQQQNLEVVYD 485

Query: 490 LEKERLGFQPMDCVSVAAKQGLHK 512
           LEKERLGF+ MDC SVA  QGLHK
Sbjct: 486 LEKERLGFEAMDCASVAVSQGLHK 494

BLAST of CsGy1G033050 vs. TAIR10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 581.6 bits (1498), Expect = 4.7e-166
Identity = 284/440 (64.55%), Postives = 342/440 (77.73%), Query Frame = 0

Query: 77  DNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN-I 136
           D V+EPLRE+RDGYL++L+IGTPPQ VQVY+DTGSDLTWVPCGNLSFDC +C + +NN +
Sbjct: 70  DVVMEPLREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDL 129

Query: 137 SGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPS 196
             P  + F P HSSTS RD+C SSFC++IHSSDNPFDPC +AGCS++ L+K TC RPCPS
Sbjct: 130 KSP--SVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPS 189

Query: 197 FAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVGATYREPIGIAGFGRG 256
           FAYTYG  G+++G LTRD+L               +PRF FGCV +TYREPIGIAGFGRG
Sbjct: 190 FAYTYGEGGLISGILTRDIL---------KARTRDVPRFSFGCVTSTYREPIGIAGFGRG 249

Query: 257 LLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAIS-SKDENLQFTPLLKSPMY 316
           LLSLP QLGF  KGFSHCFLPFKF NNPN SSPLILG  A+S +  ++LQFTP+L +PMY
Sbjct: 250 LLSLPSQLGFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMY 309

Query: 317 PNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLIS 376
           PN YYIGLESITIG   N     V   LR+ D++GNGGML+DSGTTYTHLPEP YSQL++
Sbjct: 310 PNSYYIGLESITIGT--NITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLT 369

Query: 377 NLELVIGYPRAKQVELNTGFDLCYKVPCKNNN-SSFVDDAQL--PSITFHFLNNVSVVLP 436
            L+  I YPRA + E  TGFDLCYKVPC NNN +S  +D  +  PSITFHFLNN +++LP
Sbjct: 370 TLQSTITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLP 429

Query: 437 QGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKE 496
           QGN+FYAM+AP + +VV+CLL+Q+M       +  D GPAG+FGSFQQQN++VVYDLEKE
Sbjct: 430 QGNSFYAMSAPSDGSVVQCLLFQNM-------EDGDYGPAGVFGSFQQQNVKVVYDLEKE 489

Query: 497 RLGFQPMDCVSVAAKQGLHK 512
           R+GFQ MDCV  AA  GL++
Sbjct: 490 RIGFQAMDCVLEAASHGLNQ 489

BLAST of CsGy1G033050 vs. TAIR10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 190.7 bits (483), Expect = 2.3e-48
Identity = 142/448 (31.70%), Postives = 195/448 (43.53%), Query Frame = 0

Query: 90  YLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHSS 149
           YL+SLS+G+    V +Y+DTGSDL W PC    F C  CE      S P   +       
Sbjct: 83  YLISLSVGSSSSAVSLYLDTGSDLVWFPC--RPFTCILCE------SKPLPPSXXXXXXX 142

Query: 150 TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTC---PRPCPSFAYTYGASGVV 209
                                 D C I+ C L  +  G C     PCP F Y YG  G +
Sbjct: 143 XXXXXXXXXXXXXXXXXXLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYG-DGSL 202

Query: 210 TGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVGATYREPIGIAGFGRGLLSLPFQLGF- 269
              L  D L               +  F FGC   T  EPIG+AGFGRG LSLP QL   
Sbjct: 203 VAKLYSDSL---------SLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVH 262

Query: 270 -SHKG--FSHCFLPFKF-SNNPNFSSPLILGNLAISSKDENL------------------ 329
             H G  FS+C +   F S+     SPLILG   +  K++ +                  
Sbjct: 263 SPHLGNSFSYCLVSHSFDSDRVRRPSPLILGRF-VDKKEKRVGXXXXXXXXXXXXXXXXE 322

Query: 330 -QFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYT 389
             FT +L++P +P +Y + L+ I+IG  +          LR ID  G GG+++DSGTT+T
Sbjct: 323 FVFTEMLENPKHPYFYSVSLQGISIGKRN----IPAPAMLRRIDKNGGGGVVVDSGTTFT 382

Query: 390 HLPEPLYSQLISNLELVIG--YPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITF 449
            LP   Y+ ++   +  +G  + RA +VE ++G   CY +             ++P++  
Sbjct: 383 MLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN---------QTVKVPALVL 442

Query: 450 HFL-NNVSVVLPQGNNFYAMA----APINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFG 504
           HF  N  SV LP+ N FY              + CL+  +    G D      G   I G
Sbjct: 443 HFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMN----GGDESELRGGTGAILG 494

BLAST of CsGy1G033050 vs. TAIR10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 179.1 bits (453), Expect = 7.0e-45
Identity = 133/421 (31.59%), Postives = 188/421 (44.66%), Query Frame = 0

Query: 89  GYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHS 148
           GY +SLS GTP Q +    DTGS L W+PC +  + C  C+   + +    +  F+P +S
Sbjct: 89  GYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTS-RYLCSGCD--FSGLDPTLIPRFIPKNS 148

Query: 149 STSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTG 208
           S+S    C S  C  ++    P   C   GC   +     C   CP +   YG       
Sbjct: 149 SSSKIIGCQSPKCQFLY---GPNVQC--RGCDPNT---RNCTVGCPPYILQYGLGSTAGV 208

Query: 209 SLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVGATYREPIGIAGFGRGLLSLPFQLGFSHK 268
            +T  + F              +P F  GC   + R+P GIAGFGRG +SLP Q+    K
Sbjct: 209 LITEKLDF----------PDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNL--K 268

Query: 269 GFSHCFLPFKFSNNPNFSSPLILGNLA---ISSKDENLQFTPLLKSPMYPN-----YYYI 328
            FSHC +  +F ++ N ++ L L   +     SK   L +TP  K+P   N     YYY+
Sbjct: 269 RFSHCLVSRRF-DDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYL 328

Query: 329 GLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNL-ELV 388
            L  I +G         + +K     T G+GG ++DSG+T+T +  P++  +       +
Sbjct: 329 NLRRIYVGRK----HVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQM 388

Query: 389 IGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAM 448
             Y R K +E  TG   C+ +  K        D  +P + F F     + LP  N F   
Sbjct: 389 SNYTREKDLEKETGLGPCFNISGKG-------DVTVPELIFEFKGGAKLELPLSNYF--- 448

Query: 449 AAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMD 501
              + +T   CL   S   V   N S   GPA I GSFQQQN  V YDLE +R GF    
Sbjct: 449 -TFVGNTDTVCLTVVSDKTV---NPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKK 467

BLAST of CsGy1G033050 vs. TAIR10
Match: AT3G25700.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 147.9 bits (372), Expect = 1.7e-35
Identity = 142/509 (27.90%), Postives = 202/509 (39.69%), Query Frame = 0

Query: 13  FLSLFLL----LVHVSTQT----LATNPKTNFPKDSLVLGLVHSRTSLLT-PKKGYNFIS 72
           FLSLFLL    +  VS       L    K+ FP  +  L L   R   L+  +K   F+ 
Sbjct: 11  FLSLFLLPPSNIAAVSNHNKYLKLPLLRKSPFPSPTQALALDTRRLHFLSLRRKPIPFVK 70

Query: 73  KKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSF 132
              +       G              Y + L IG PPQ + +  DTGSDL WV C     
Sbjct: 71  SPVVSGAASGSGQ-------------YFVDLRIGQPPQSLLLIADTGSDLVWVKCS---- 130

Query: 133 DCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLAS 192
            C++C  +           F P HSST     C    C  +   D        A     +
Sbjct: 131 ACRNCSHHS------PATVFFPRHSSTFSPAHCYDPVCRLVPKPDR-------APICNHT 190

Query: 193 LVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGC----- 252
            +  TC      + Y Y    + +G   R+      +         ++    FGC     
Sbjct: 191 RIHSTC-----HYEYGYADGSLTSGLFARETT----SLKTSSGKEARLKSVAFGCGFRIS 250

Query: 253 ----VGATYREPIGIAGFGRGLLSLPFQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILG 312
                G ++    G+ G GRG +S   QLG  F +K FS+C + +  S  P  +S LI+G
Sbjct: 251 GQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNK-FSYCLMDYTLSPPP--TSYLIIG 310

Query: 313 NLAISSKDENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGG 372
           N         L FTPLL +P+ P +YY+ L+S+ +    N  +  +   + EID  GNGG
Sbjct: 311 N--GGDGISKLFFTPLLTNPLSPTFYYVKLKSVFV----NGAKLRIDPSIWEIDDSGNGG 370

Query: 373 MLIDSGTTYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDD 432
            ++DSGTT   L EP Y  +I+ +   +  P A    L  GFDLC  V     +     +
Sbjct: 371 TVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIAD--ALTPGFDLCVNV-----SGVTKPE 430

Query: 433 AQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMD-GVGDDNDSDDNGPA 492
             LP + F F      V P  N F           ++CL  QS+D  VG           
Sbjct: 431 KILPRLKFEFSGGAVFVPPPRNYFIE-----TEEQIQCLAIQSVDPKVG----------F 449

Query: 493 GIFGSFQQQNIEVVYDLEKERLGFQPMDC 501
            + G+  QQ     +D ++ RLGF    C
Sbjct: 491 SVIGNLMQQGFLFEFDRDRSRLGFSRRGC 449

BLAST of CsGy1G033050 vs. TAIR10
Match: AT2G03200.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 143.7 bits (361), Expect = 3.3e-34
Identity = 156/512 (30.47%), Postives = 216/512 (42.19%), Query Frame = 0

Query: 13  FLSLFLLLVHVSTQTLATNPKT---NFPKDSLVLGLVH-----SRTSLLTPKKGYN--FI 72
           FL LF  L+ VS+   +   +T   N P+    L L H     + T +   ++G N  F 
Sbjct: 14  FLILFSCLISVSSSRRSLIDRTLPKNLPRSGFRLSLRHVDSGKNLTKIQKIQRGINRGFH 73

Query: 73  SKKRMKAM------DQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWV 132
              R+ A+       + D  +N+  P       +LM LSIG P       +DTGSDL W 
Sbjct: 74  RLNRLGAVAVLAVASKPDDTNNIKAPTHGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWT 133

Query: 133 PCGNLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTI 192
            C      C +C +    I       F P  SS+  +  C S  C  +  S+   D    
Sbjct: 134 QCK----PCTECFDQPTPI-------FDPEKSSSYSKVGCSSGLCNALPRSNCNED---- 193

Query: 193 AGCSLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCF 252
                    K  C      + YTYG      G L  +        XXXXXXXX       
Sbjct: 194 ---------KDAC-----EYLYTYGDYSSTRGLLATETF----XXXXXXXXXXXXXXXXX 253

Query: 253 GCVGATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA- 312
                 + +  G+ G GRG LSL  QL      FS+C    + S     SS L +G+LA 
Sbjct: 254 XXXXXXFSQGSGLVGLGRGPLSLISQL--KETKFSYCLTSIEDS---EASSSLFIGSLAS 313

Query: 313 -------ISSKDENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTK 372
                   S   E  +   LL++P  P++YY+ L+ IT+G      R  V     E+   
Sbjct: 314 GIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAK----RLSVEKSTFELAED 373

Query: 373 GNGGMLIDSGTTYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSS 432
           G GGM+IDSGTT T+L E  +  L       +  P       +TG DLC+K+P    N +
Sbjct: 374 GTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLP--VDDSGSTGLDLCFKLPDAAKNIA 433

Query: 433 FVDDAQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDN 492
                 +P + FHF     + LP G N+  M A  +ST V CL   S +G+         
Sbjct: 434 ------VPKMIFHF-KGADLELP-GENY--MVAD-SSTGVLCLAMGSSNGM--------- 458

Query: 493 GPAGIFGSFQQQNIEVVYDLEKERLGFQPMDC 501
               IFG+ QQQN  V++DLEKE + F P +C
Sbjct: 494 ---SIFGNVQQQNFNVLHDLEKETVSFVPTEC 458

BLAST of CsGy1G033050 vs. Swiss-Prot
Match: sp|Q940R4|ASP63_ARATH (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 190.7 bits (483), Expect = 4.2e-47
Identity = 142/448 (31.70%), Postives = 195/448 (43.53%), Query Frame = 0

Query: 90  YLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHSS 149
           YL+SLS+G+    V +Y+DTGSDL W PC    F C  CE      S P   +       
Sbjct: 83  YLISLSVGSSSSAVSLYLDTGSDLVWFPC--RPFTCILCE------SKPLPPSXXXXXXX 142

Query: 150 TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTC---PRPCPSFAYTYGASGVV 209
                                 D C I+ C L  +  G C     PCP F Y YG  G +
Sbjct: 143 XXXXXXXXXXXXXXXXXXLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYG-DGSL 202

Query: 210 TGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVGATYREPIGIAGFGRGLLSLPFQLGF- 269
              L  D L               +  F FGC   T  EPIG+AGFGRG LSLP QL   
Sbjct: 203 VAKLYSDSL---------SLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVH 262

Query: 270 -SHKG--FSHCFLPFKF-SNNPNFSSPLILGNLAISSKDENL------------------ 329
             H G  FS+C +   F S+     SPLILG   +  K++ +                  
Sbjct: 263 SPHLGNSFSYCLVSHSFDSDRVRRPSPLILGRF-VDKKEKRVGXXXXXXXXXXXXXXXXE 322

Query: 330 -QFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYT 389
             FT +L++P +P +Y + L+ I+IG  +          LR ID  G GG+++DSGTT+T
Sbjct: 323 FVFTEMLENPKHPYFYSVSLQGISIGKRN----IPAPAMLRRIDKNGGGGVVVDSGTTFT 382

Query: 390 HLPEPLYSQLISNLELVIG--YPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITF 449
            LP   Y+ ++   +  +G  + RA +VE ++G   CY +             ++P++  
Sbjct: 383 MLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCYYLN---------QTVKVPALVL 442

Query: 450 HFL-NNVSVVLPQGNNFYAMA----APINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFG 504
           HF  N  SV LP+ N FY              + CL+  +    G D      G   I G
Sbjct: 443 HFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMN----GGDESELRGGTGAILG 494

BLAST of CsGy1G033050 vs. Swiss-Prot
Match: sp|Q766C2|NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 147.1 bits (370), Expect = 5.3e-34
Identity = 131/442 (29.64%), Postives = 182/442 (41.18%), Query Frame = 0

Query: 64  KKRMKAMDQTDGDDNVIEPLREIRDG-YLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLS 123
           ++RM++++      + IE      DG YLM+++IGTP       MDTGSDL W  C    
Sbjct: 69  ERRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCE--- 128

Query: 124 FDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLA 183
             C  C      I       F P  SS+     C S +C D+     P + C    C   
Sbjct: 129 -PCTQCFSQPTPI-------FNPQDSSSFSTLPCESQYCQDL-----PSETCNNNECQ-- 188

Query: 184 SLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGC---- 243
                          YTYG      GS T+  + T             +P   FGC    
Sbjct: 189 ---------------YTYGYG---DGSTTQGYMATE----TFTFETSSVPNIAFGCGEDN 248

Query: 244 VGATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS 303
            G       G+ G G G LSLP QLG     FS+C   +  S+     S L LG+ A S 
Sbjct: 249 QGFGQGNGAGLIGMGWGPLSLPSQLGVGQ--FSYCMTSYGSSS----PSTLALGS-AASG 308

Query: 304 KDENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSG 363
             E    T L+ S + P YYYI L+ IT+G GDN    G+     ++   G GGM+IDSG
Sbjct: 309 VPEGSPSTTLIHSSLNPTYYYITLQGITVG-GDN---LGIPSSTFQLQDDGTGGMIIDSG 368

Query: 364 TTYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSI 423
           TT T+LP+  Y+ +       I  P     E ++G   C++ P   +        Q+P I
Sbjct: 369 TTLTYLPQDAYNAVAQAFTDQINLPTVD--ESSSGLSTCFQQPSDGST------VQVPEI 428

Query: 424 TFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQ 483
           +  F   V  +  Q      + +P    +  CL   S   +G            IFG+ Q
Sbjct: 429 SMQFDGGVLNLGEQN----ILISPAEGVI--CLAMGSSSQLG----------ISIFGNIQ 435

Query: 484 QQNIEVVYDLEKERLGFQPMDC 501
           QQ  +V+YDL+   + F P  C
Sbjct: 489 QQETQVLYDLQNLAVSFVPTQC 435

BLAST of CsGy1G033050 vs. Swiss-Prot
Match: sp|Q9LNJ3|APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 142.5 bits (358), Expect = 1.3e-32
Identity = 122/433 (28.18%), Postives = 183/433 (42.26%), Query Frame = 0

Query: 78  NVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISG 137
           +V+  L +    Y   L +GTP + V + +DTGSD+ W+ C      C+ C    + I  
Sbjct: 130 SVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCA----PCRRCYSQSDPIFD 189

Query: 138 PRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFA 197
           PR        S T     C S  C  + S          AGC+     + TC      + 
Sbjct: 190 PR-------KSKTYATIPCSSPHCRRLDS----------AGCNTR---RKTC-----LYQ 249

Query: 198 YTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGC--------VGATYREPIGI 257
            +YG      G  + + L    N         ++     GC        VGA      G+
Sbjct: 250 VSYGDGSFTVGDFSTETLTFRRN---------RVKGVALGCGHDNEGLFVGAA-----GL 309

Query: 258 AGFGRGLLSLPFQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTP 317
            G G+G LS P Q G  F+ K FS+C +    S+ P   S ++ GN A+S      +FTP
Sbjct: 310 LGLGKGKLSFPGQTGHRFNQK-FSYCLVDRSASSKP---SSVVFGNAAVS---RIARFTP 369

Query: 318 LLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEP 377
           LL +P    +YY+GL  I++G        GV+  L ++D  GNGG++IDSGT+ T L  P
Sbjct: 370 LLSNPKLDTFYYVGLLGISVGGTRVP---GVTASLFKLDQIGNGGVIIDSGTSVTRLIRP 429

Query: 378 LYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVS 437
            Y  +       +G    K+    + FD C+ +       S +++ ++P++  HF     
Sbjct: 430 AYIAMRDAFR--VGAKTLKRAPDFSLFDTCFDL-------SNMNEVKVPTVVLHF-RGAD 484

Query: 438 VVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYD 497
           V LP  N       P+++    C  +           +   G   I G+ QQQ   VVYD
Sbjct: 490 VSLPATN----YLIPVDTNGKFCFAF-----------AGTMGGLSIIGNIQQQGFRVVYD 484

Query: 498 LEKERLGFQPMDC 501
           L   R+GF P  C
Sbjct: 550 LASSRVGFAPGGC 484

BLAST of CsGy1G033050 vs. Swiss-Prot
Match: sp|Q766C3|NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 137.9 bits (346), Expect = 3.2e-31
Identity = 127/418 (30.38%), Postives = 170/418 (40.67%), Query Frame = 0

Query: 90  YLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHSS 149
           YLM+LSIGTP Q     MDTGSDL W         CQ C +  N  +      F P  SS
Sbjct: 95  YLMNLSIGTPAQPFSAIMDTGSDLIWT-------QCQPCTQCFNQST----PIFNPQGSS 154

Query: 150 TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTGS 209
           +     C S  C  + S                     TC      + Y YG      GS
Sbjct: 155 SFSTLPCSSQLCQALSSP--------------------TCSNNFCQYTYGYGDGSETQGS 214

Query: 210 LTRDVLFTHGNYXXXXXXXXQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPFQLGF 269
           +  + L T G+          IP   FGC     G       G+ G GRG LSLP QL  
Sbjct: 215 MGTETL-TFGS--------VSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDV 274

Query: 270 SHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNYYYIGLESI 329
           +   FS+C  P   S   N    L+LG+LA +S       T L++S   P +YYI L  +
Sbjct: 275 TK--FSYCMTPIGSSTPSN----LLLGSLA-NSVTAGSPNTTLIQSSQIPTFYYITLNGL 334

Query: 330 TIGNGDNNFRFGV---SFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVIGY 389
           ++G    + R  +   +F L      G GG++IDSGTT T+     Y  +       I  
Sbjct: 335 SVG----STRLPIDPSAFALN--SNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINL 394

Query: 390 PRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAMAAP 449
           P       ++GFDLC++ P   +N       Q+P+   HF +   + LP  N F    +P
Sbjct: 395 PVVN--GSSSGFDLCFQTPSDPSN------LQIPTFVMHF-DGGDLELPSENYF---ISP 434

Query: 450 INSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMDC 501
            N  +  CL   S            +    IFG+ QQQN+ VVYD     + F    C
Sbjct: 455 SNGLI--CLAMGS-----------SSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CsGy1G033050 vs. Swiss-Prot
Match: sp|Q6XBF8|CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 2.6e-28
Identity = 123/435 (28.28%), Postives = 181/435 (41.61%), Query Frame = 0

Query: 77  DNVIEPLREIRDG---YLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQN 136
           DN  +P  ++      YLM++SIGTPP  +    DTGSDL W  C      C DC    +
Sbjct: 74  DNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCA----PCDDCYTQVD 133

Query: 137 NISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPC 196
            +       F P  SST    +C SS C  + +          A CS       TC    
Sbjct: 134 PL-------FDPKTSSTYKDVSCSSSQCTALENQ---------ASCSTND---NTC---- 193

Query: 197 PSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGC----VGATYREPIGI 256
            S++ +YG +    G++  D L T G+         Q+     GC     G   ++  GI
Sbjct: 194 -SYSLSYGDNSYTKGNIAVDTL-TLGS---SDTRPMQLKNIIIGCGHNNAGTFNKKGSGI 253

Query: 257 AGFGRGLLSLPFQLGFSHKG-FSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPL 316
            G G G +SL  QLG S  G FS+C +P   ++  + +S +  G  AI S    +  TPL
Sbjct: 254 VGLGGGPVSLIKQLGDSIDGKFSYCLVP--LTSKKDQTSKINFGTNAIVS-GSGVVSTPL 313

Query: 317 LKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPL 376
           +       +YY+ L+SI++G+    +    S           G ++IDSGTT T LP   
Sbjct: 314 IAKASQETFYYLTLKSISVGSKQIQYSGSDS-------ESSEGNIIIDSGTTLTLLPTEF 373

Query: 377 YSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSV 436
           YS+L   +   I     K+ +  +G  LCY         S   D ++P IT HF +   V
Sbjct: 374 YSELEDAVASSI--DAEKKQDPQSGLSLCY---------SATGDLKVPVITMHF-DGADV 433

Query: 437 VLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDL 496
            L   N F  +     S  + C  ++                  I+G+  Q N  V YD 
Sbjct: 434 KLDSSNAFVQV-----SEDLVCFAFRGSPSF------------SIYGNVAQMNFLVGYDT 437

Query: 497 EKERLGFQPMDCVSV 504
             + + F+P DC  +
Sbjct: 494 VSKTVSFKPTDCAKM 437

BLAST of CsGy1G033050 vs. TrEMBL
Match: tr|A0A0A0LYP0|A0A0A0LYP0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G704590 PE=3 SV=1)

HSP 1 Score: 1047.0 bits (2706), Expect = 1.4e-302
Identity = 518/518 (100.00%), Postives = 518/518 (100.00%), Query Frame = 0

Query: 1   MPSISSISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYN 60
           MPSISSISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYN
Sbjct: 1   MPSISSISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYN 60

Query: 61  FISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGN 120
           FISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGN
Sbjct: 61  FISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGN 120

Query: 121 LSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCS 180
           LSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCS
Sbjct: 121 LSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCS 180

Query: 181 LASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVG 240
           LASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVG
Sbjct: 181 LASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVG 240

Query: 241 ATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD 300
           ATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD
Sbjct: 241 ATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD 300

Query: 301 ENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTT 360
           ENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTT
Sbjct: 301 ENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTT 360

Query: 361 YTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITF 420
           YTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITF
Sbjct: 361 YTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITF 420

Query: 421 HFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQ 480
           HFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQ
Sbjct: 421 HFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQ 480

Query: 481 NIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRNES 519
           NIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRNES
Sbjct: 481 NIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRNES 518

BLAST of CsGy1G033050 vs. TrEMBL
Match: tr|A0A1S3CAK9|A0A1S3CAK9_CUCME (aspartic proteinase nepenthesin-2 OS=Cucumis melo OX=3656 GN=LOC103498305 PE=3 SV=1)

HSP 1 Score: 998.8 bits (2581), Expect = 4.5e-288
Identity = 495/520 (95.19%), Postives = 504/520 (96.92%), Query Frame = 0

Query: 1   MPSISSISTATKFLSLFLLLVHVSTQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYN 60
           MPSISS S ATKFLSLFLLLVH S QTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYN
Sbjct: 1   MPSISSTSIATKFLSLFLLLVHASKQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYN 60

Query: 61  FISKKRMKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGN 120
           FISKKRMKAMDQ DGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGN
Sbjct: 61  FISKKRMKAMDQMDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGN 120

Query: 121 LSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCS 180
           LSFDCQDCEEYQNNISGP+LAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCS
Sbjct: 121 LSFDCQDCEEYQNNISGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCS 180

Query: 181 LASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNY----XXXXXXXXQIPRFCF 240
           LA+LVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLF HGNY    XXXXXXXX +PRFCF
Sbjct: 181 LATLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFMHGNYXXXXXXXXXXXXXVPRFCF 240

Query: 241 GCVGATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI 300
           GCVGATYREPIGIAGFGRGLLSLPFQLGFS KGFSHCFLPFKFSNNPNFSSPLILG+LAI
Sbjct: 241 GCVGATYREPIGIAGFGRGLLSLPFQLGFSQKGFSHCFLPFKFSNNPNFSSPLILGHLAI 300

Query: 301 SSKDENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLID 360
           SSKDENLQFTPLLKSP+YPNYYYIGLESITIGNG+NNFRFGVSFKLREIDTKGNGGMLID
Sbjct: 301 SSKDENLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLID 360

Query: 361 SGTTYTHLPEPLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLP 420
           SGTTYTHLPEPLYSQLISNLE VI YPRAKQVELNTGFDLCYKVPCKNNNSSFVDD+QLP
Sbjct: 361 SGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDSQLP 420

Query: 421 SITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGS 480
           SITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGS
Sbjct: 421 SITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGS 480

Query: 481 FQQQNIEVVYDLEKERLGFQPMDCVSVAAKQGLHKNVRRN 517
           FQQQN++VVYDLEKERLGFQ MDCVSVAA QGLHKNVRRN
Sbjct: 481 FQQQNLQVVYDLEKERLGFQAMDCVSVAANQGLHKNVRRN 520

BLAST of CsGy1G033050 vs. TrEMBL
Match: tr|V4TN99|V4TN99_9ROSI (Uncharacterized protein OS=Citrus clementina OX=85681 GN=CICLE_v10017863mg PE=3 SV=1)

HSP 1 Score: 647.1 bits (1668), Expect = 3.3e-182
Identity = 331/508 (65.16%), Postives = 394/508 (77.56%), Query Frame = 0

Query: 8   STATKFLSLFLLLVHVS-TQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYNFISKKR 67
           S     + LFLL + ++  QTLAT  + N  K SLVLGL +SR SLL P    + I KK 
Sbjct: 6   SNIATIILLFLLSMSLTFHQTLAT--QKNNGKHSLVLGLTNSRASLLIPSASKSSI-KKP 65

Query: 68  MKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQ 127
            + +D       ++EPLRE+RDGYL+SL+IGTP QV+QVYMDTGSDLTWVPCGNLSFDC 
Sbjct: 66  SETLD-------MMEPLREVRDGYLISLNIGTPTQVIQVYMDTGSDLTWVPCGNLSFDCV 125

Query: 128 DCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVK 187
           DC++Y+NN     ++ F P+ SS+S RDTC SSFC++IHSSDNPFDPCT++GCSL++L+K
Sbjct: 126 DCDDYRNN---KLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNPFDPCTMSGCSLSTLLK 185

Query: 188 GTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVGATYREP 247
            TC RPCPSFAYTYG  G+VTG LTRD L  HG+         +IP+FCFGCVG+TYREP
Sbjct: 186 STCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS---SPGIIREIPKFCFGCVGSTYREP 245

Query: 248 IGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFT 307
           IGIAGFGRG LS+P QLGF  KGFSHCFL FK++N+PN SSPL+LG++AISSKD NLQFT
Sbjct: 246 IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVLGDVAISSKD-NLQFT 305

Query: 308 PLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPE 367
           P+LKSPMYPNYYYIGLE+ITIGN        V   LRE D++GNGG+L+DSGTTYTHLPE
Sbjct: 306 PMLKSPMYPNYYYIGLEAITIGNSSLT---EVPLSLREFDSQGNGGLLVDSGTTYTHLPE 365

Query: 368 PLYSQLISNLELVIG-YPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNN 427
           P YSQL+S L+  I  YPRAK+VE  TGFDLCY+VPC NN  +F DD   PSITFHFLNN
Sbjct: 366 PFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN--TFTDDL-FPSITFHFLNN 425

Query: 428 VSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVV 487
           VS+VLPQGN+FYAM+AP NS+ VKCLL+QSM       D  D GP+G+FGSFQQQN+EVV
Sbjct: 426 VSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM-------DDGDYGPSGVFGSFQQQNVEVV 483

Query: 488 YDLEKERLGFQPMDCVSVAAKQGLHKNV 514
           YDLEKER+GFQPMDC S A+ QGLHK +
Sbjct: 486 YDLEKERIGFQPMDCASTASAQGLHKKL 483

BLAST of CsGy1G033050 vs. TrEMBL
Match: tr|A0A2H5N1B2|A0A2H5N1B2_CITUN (Uncharacterized protein OS=Citrus unshiu OX=55188 GN=CUMW_007680 PE=3 SV=1)

HSP 1 Score: 646.4 bits (1666), Expect = 5.6e-182
Identity = 329/508 (64.76%), Postives = 394/508 (77.56%), Query Frame = 0

Query: 8   STATKFLSLFLLLVHVS-TQTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYNFISKKR 67
           S     + LFLL + +   QTLAT  + N  K SLVLGL +SR SLL P    + + KK 
Sbjct: 6   SNIATIILLFLLYMSLRFHQTLAT--QKNNGKHSLVLGLTNSRASLLIPSASKSSV-KKP 65

Query: 68  MKAMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQ 127
            + +D       ++EPLRE+RDGYL+SL+IGTP QV+QVYMDTGSDLTWVPCGNLSFDC 
Sbjct: 66  SETLD-------MMEPLREVRDGYLISLNIGTPTQVIQVYMDTGSDLTWVPCGNLSFDCV 125

Query: 128 DCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVK 187
           DC++Y+NN     ++ F P+ SS+S RDTC SSFC++IHSSDNPFDPCT++GCSL++L+K
Sbjct: 126 DCDDYRNN---KLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNPFDPCTMSGCSLSTLLK 185

Query: 188 GTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVGATYREP 247
            TC RPCPSFAYTYG  G+VTG LTRD+L  HG+         +IP+FCFGCVG+TYREP
Sbjct: 186 STCCRPCPSFAYTYGEGGLVTGILTRDILKVHGS---SPGIIREIPKFCFGCVGSTYREP 245

Query: 248 IGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFT 307
           IGIAGFG+G LS+P QLGF  KGFSHCFL FK++N+PN SSPL+LG++AISSKD NLQFT
Sbjct: 246 IGIAGFGKGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVLGDVAISSKD-NLQFT 305

Query: 308 PLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPE 367
           P+LKSPMYPNYYYIGLE+ITIGN        V   LRE D++GNGG+L+DSGTTYTHLPE
Sbjct: 306 PMLKSPMYPNYYYIGLEAITIGNSSLT---EVPLSLREFDSQGNGGLLVDSGTTYTHLPE 365

Query: 368 PLYSQLISNLELVIG-YPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNN 427
           P YSQL+S L+  I  YPRAK+VE  TGFDLCY+VPC NN  +F DD   PSITFHFLNN
Sbjct: 366 PFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN--TFTDDL-FPSITFHFLNN 425

Query: 428 VSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVV 487
           VS+VLPQGN+FYAM+AP NS+ VKCLL+QSM       D  D GP+G+FGSFQQQN+EVV
Sbjct: 426 VSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM-------DDGDYGPSGVFGSFQQQNVEVV 483

Query: 488 YDLEKERLGFQPMDCVSVAAKQGLHKNV 514
           YDLEKER+GFQPMDC S A+ QGLHK +
Sbjct: 486 YDLEKERIGFQPMDCASTASAQGLHKKL 483

BLAST of CsGy1G033050 vs. TrEMBL
Match: tr|A0A2R6R206|A0A2R6R206_ACTCH (Aspartyl protease OS=Actinidia chinensis var. chinensis OX=1590841 GN=CEY00_Acc11327 PE=3 SV=1)

HSP 1 Score: 640.6 bits (1651), Expect = 3.1e-180
Identity = 324/508 (63.78%), Postives = 386/508 (75.98%), Query Frame = 0

Query: 15  SLFLLLVHVSTQTLATNPKTNFPKD------SLVLGLVHSRTSLLTPKKGYNFISKKRMK 74
           +LF +      Q L  N   NF +       SLVLGL HSR  L TP +  +F S    K
Sbjct: 3   TLFTIQASFLFQFLLFNALFNFSQSHAIRPHSLVLGLKHSRAILPTPMEA-SFDSTSNYK 62

Query: 75  AMDQTDGDDNVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDC 134
             +  D    ++EPLRE+RDGY++SL++GTPPQV+ VYMDTGSDLTWVPCGNLSFDC DC
Sbjct: 63  PAEMLD----MMEPLREVRDGYIISLTLGTPPQVIPVYMDTGSDLTWVPCGNLSFDCMDC 122

Query: 135 EEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGT 194
           ++++NN     +A+F P++SS+S RD+C S FC+D+HSSDN +DPCT+AGCSL++LVK T
Sbjct: 123 DDHRNN---KLMASFSPSYSSSSFRDSCASPFCIDVHSSDNSYDPCTVAGCSLSTLVKAT 182

Query: 195 CPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNYXXXXXXXXQIPRFCFGCVGATYREPIG 254
           C RPCPSFAYTYG  GVV GSLTRD+L  HG          +I +FCFGCVG+TYREPIG
Sbjct: 183 CSRPCPSFAYTYG-DGVVVGSLTRDILRVHG--ISSPGVTREISKFCFGCVGSTYREPIG 242

Query: 255 IAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPL 314
           IAGFG+G LSLP QLG+  KGFSHCF+ FKF+NNPN SSPL++G+LAISSKD +LQFTP+
Sbjct: 243 IAGFGKGPLSLPSQLGYLQKGFSHCFVAFKFANNPNISSPLVVGDLAISSKD-HLQFTPM 302

Query: 315 LKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPL 374
           LKSPMYPNYYYIGLE +T+GN        V   LRE D+ GNGGMLIDSGTTYTHLP+ L
Sbjct: 303 LKSPMYPNYYYIGLEGLTVGNSSAT---KVPLSLREFDSLGNGGMLIDSGTTYTHLPQSL 362

Query: 375 YSQLISNLELVIGYPRAKQVELNTGFDLCYKVPC-KNNNSSFV----DDAQLPSITFHFL 434
           YSQL+S ++ VI YPRAK VE  TGF LCYK+PC  NNNSS +    DD  LP+ITFHFL
Sbjct: 363 YSQLLSVIQSVIPYPRAKDVEARTGFGLCYKIPCTSNNNSSTIFTDHDDLLLPTITFHFL 422

Query: 435 NNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIE 494
           NNVS++LPQGN+FYAM AP NSTVVKCLL QSM     D+D ++ GPAG+FGSFQQQN+E
Sbjct: 423 NNVSLLLPQGNHFYAMGAPSNSTVVKCLLLQSM-----DDDENEYGPAGVFGSFQQQNVE 482

Query: 495 VVYDLEKERLGFQPMDCVSVAAKQGLHK 512
           VVYDLEKER+GFQPMDC S AA QGLHK
Sbjct: 483 VVYDLEKERIGFQPMDCASAAASQGLHK 490

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004145478.22.2e-302100.00PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis sativus] >KGN66888.1 hypot... [more]
XP_008459091.16.8e-28895.19PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo][more]
XP_023520027.12.1e-23681.23probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo][more]
XP_022927421.17.3e-23479.84probable aspartyl protease At4g16563 [Cucurbita moschata][more]
XP_023000974.18.1e-23380.36probable aspartyl protease At4g16563 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT5G45120.14.7e-16664.55Eukaryotic aspartyl protease family protein[more]
AT4G16563.12.3e-4831.70Eukaryotic aspartyl protease family protein[more]
AT3G52500.17.0e-4531.59Eukaryotic aspartyl protease family protein[more]
AT3G25700.11.7e-3527.90Eukaryotic aspartyl protease family protein[more]
AT2G03200.13.3e-3430.47Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
sp|Q940R4|ASP63_ARATH4.2e-4731.70Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
sp|Q766C2|NEP2_NEPGR5.3e-3429.64Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
sp|Q9LNJ3|APF2_ARATH1.3e-3228.18Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
sp|Q766C3|NEP1_NEPGR3.2e-3130.38Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
sp|Q6XBF8|CDR1_ARATH2.6e-2828.28Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
tr|A0A0A0LYP0|A0A0A0LYP0_CUCSA1.4e-302100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G704590 PE=3 SV=1[more]
tr|A0A1S3CAK9|A0A1S3CAK9_CUCME4.5e-28895.19aspartic proteinase nepenthesin-2 OS=Cucumis melo OX=3656 GN=LOC103498305 PE=3 S... [more]
tr|V4TN99|V4TN99_9ROSI3.3e-18265.16Uncharacterized protein OS=Citrus clementina OX=85681 GN=CICLE_v10017863mg PE=3 ... [more]
tr|A0A2H5N1B2|A0A2H5N1B2_CITUN5.6e-18264.76Uncharacterized protein OS=Citrus unshiu OX=55188 GN=CUMW_007680 PE=3 SV=1[more]
tr|A0A2R6R206|A0A2R6R206_ACTCH3.1e-18063.78Aspartyl protease OS=Actinidia chinensis var. chinensis OX=1590841 GN=CEY00_Acc1... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: INTERPRO
TermDefinition
IPR034161Pepsin-like_plant
IPR033121PEPTIDASE_A1
IPR001969Aspartic_peptidase_AS
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
IPR032799TAXi_C
IPR032861TAXi_N
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G033050.1CsGy1G033050.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 90..293
e-value: 1.5E-28
score: 100.2
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 318..496
e-value: 2.9E-26
score: 92.1
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 78..292
e-value: 1.1E-32
score: 115.6
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 293..505
e-value: 3.3E-46
score: 159.1
IPR021109Aspartic peptidase domain superfamilySUPERFAMILYSSF50630Acid proteasescoord: 89..502
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 16..505
NoneNo IPR availablePANTHERPTHR13683:SF264ASPARTYL PROTEASE FAMILY PROTEINcoord: 16..505
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 353..364
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 90..496
score: 32.847
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 89..500
e-value: 4.88948E-69
score: 223.679

The following gene(s) are paralogous to this gene:

None