Cp4.1LG01g09670 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG01g09670
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionaspartyl protease AED3-like
LocationCp4.1LG01: 7166280 .. 7168804 (-)
RNA-Seq ExpressionCp4.1LG01g09670
SyntenyCp4.1LG01g09670
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATAGAAATTTCATTATTATTTTTTTTAACTTAATATATTAGAAGAAAAATAAAAATTTAATTAATTAAGATTAAATTGTTTTCTATAAATATCTTGTCTCACATTCTTCCCCACATACCCTTTCTTCACTTCCACACTCTCTCAGTCCCAAATTGAAGGCATGAAGACAAAATCCCCCTTCTTCCTCGCTCTCCTCCTCCTTCTCCTATCCATATCCGCCGAATCCCTCCACCACCACCACCACCACCAACACCCCAACTGCAACGCCGCCGCCGACCGCAGCTCCACCCTCCAAGTGTTCCACATTTTCAGCCCCTGTTCCCCCTTCCGCCCCTCCAAGCCACTGTCATGGGCCGACAATGTGCTCCAAATGCAGGCCAAGGACCAAGCCCGCCTCCAATTCCTGTCCTCCCTTGTCGCCCGCAAGTCCGTCGCCCCCATTTCCTCCGCCCGCCAGCTCATCCAGAGCCCCACCTTCGTGGTCCGGGCCAAGATCGGCACCCCTGCTCAGACCCTCCTCATGGCCCTCGATACCAGCAACGACGCCGCTTGGATTCCCTGTTCCGGCTGTGTCGGCTGCCCTTCCACCACCGTCTTCTCCTCCGATAAGTCCTCCTCCTTCCGCCCCCTCCCCTGCCGATCCCCTCAATGCAACCAGGTACCTTCCTCTGTTCTATTTCAAATCAATTTTCAATTTTTGCTGCTTTTATTTTTCCAACTCACCAATGTGGGGGCATTTCAATTCCCATATTCTACATCAAACCTTTATTATCGAAGTGCACGCGCTCACGCGCCGCCTGTATCGGTGGAGAGGCCAGATATCCGTTGGAATATTCAATGATTATAAACCTACCTTTATTCACCCACATAGCATTCCCCTTTTAGTTGGTAATATGTGGGGTTCCCCTCTTTATTATGCTTTTTTTTCCTTAAAAAAAATACTTTTAATTTTATTTCTAAATCCAATAAAATAAATTAATTGAATATATTTTTTTTTGAAAATTAATAAGAACTTCGGATGCTTTTGTAGAATCAAATCCATGTTCCCACGTGCCAGTGTCCTTTCCTTGAAGGTGACACGTGTGTCCAAAACTCCATTCCTCGTGATACGAGCCCCTCTTTATCACGTCAATCATGTAACAAATCAACTATTTCTTAAATTATTTAAAAAGAAAGGAATTAATTTCGGTAAAAGTTAAATAATAAATTATTTATTTATTTTTTCATTTTTGGCTTTCTCTACAATCTATGACTGTCAGTGAGTCACCGACGGGAACGTTCAGTCACGGGCCAAAAAACTTCTTCCCAACAATACTTAAATCGTATTATTAAACGTCGCACGTGACCCTTTTGTTTATCAGGTACCGAACCCCAGTTGCAGCAGTAGCTCGTGCGGCTTCAACTTGACGTACGGAAGCTCGACTGTAGCGGCGGATCTGGTTCAAGACAACATAACTCTGGCCAACGACTCGGTTCCGGCTTACACATTTGGGTGCATCACGAAGGCGACGGGTAGTTCAGTGCCGCCGCAGGGACTATTGGGCCTGGGTCGAGGCCCATTATCGCTTTTGGGCCAGAGCCAGAGTCTGTACCGGTCCACATTCTCGTACTGCCTTCCGAGCTTCAAATCGACCGGCTTTTCTGGGTCGTTGCGTCTTGGGCCTGTGGCCCAGCCCAAGAGGATTAAGTACACGCCGCTGCTCCGAAACCCGAGGCGGTCGTCTTTGTATTATGTGAATCTAATCGCCATCAGAGTCGGCCGGCGAATCGTCGATATTCCTCCCTCTGCTCTTGCTTTTAACTCCGCCACCGGCGCCGGCACCATCATCGACTCCGGTATCGTTTTAAATAGATTTTAAAAATATATTTAAACTAAATTAATAAAAATACACTTTTAATTTTGAAAATATCTTTAATTCAGGGACGACATTCACAAGGCTGGTGGCACCGGCGTATACGGCGGTGAGGAACGAATTCCGGCGAAGAATAGGCAACGCAACCATCTCATCCCTCGGCGGCTTCGACACGTGCTACACAGTCCCCATCATCTCTCCCACCATAACCTTCATGTTCGCCGGAATGAACGTCACTCTTCCGCCGGACAACTTCCTCCTCCACAGCTCCGCCGGAACCACCACCTGCCTCGCCATGGCCGCCGCCCCGGATAACGTCAACTCCGTACTCAACGTCATTGCCAGCATGCAGCAGCAGAACCACCGCATTCTCTTCGACGTTCCCAATTCCCGAATCGGCGTCGCCCGCGAACCCTGCTCTTAAACCGCCGTCGTCCGCCGTATCATCTCCAGCTCCGGCGATATTTATTTTTGTTGGTTTTAGAATAATAAGAATGTGTGGTCATTATGGTTTGGTGGGGGGATGGGGGAAAGGGTAAAGTAGTAATTTTACATGGCTGTTTTTATAGTTGTGTTGTTTGGGATTCTTAAGGTTTTAGCTTATTATATCATGGATGTCTTAAAAGCAAAGCTTTTTTTTTTTCAGTCTACCACTAATAACTTAAGCCATTA

mRNA sequence

ATAGAAATTTCATTATTATTTTTTTTAACTTAATATATTAGAAGAAAAATAAAAATTTAATTAATTAAGATTAAATTGTTTTCTATAAATATCTTGTCTCACATTCTTCCCCACATACCCTTTCTTCACTTCCACACTCTCTCAGTCCCAAATTGAAGGCATGAAGACAAAATCCCCCTTCTTCCTCGCTCTCCTCCTCCTTCTCCTATCCATATCCGCCGAATCCCTCCACCACCACCACCACCACCAACACCCCAACTGCAACGCCGCCGCCGACCGCAGCTCCACCCTCCAAGTGTTCCACATTTTCAGCCCCTGTTCCCCCTTCCGCCCCTCCAAGCCACTGTCATGGGCCGACAATGTGCTCCAAATGCAGGCCAAGGACCAAGCCCGCCTCCAATTCCTGTCCTCCCTTGTCGCCCGCAAGTCCGTCGCCCCCATTTCCTCCGCCCGCCAGCTCATCCAGAGCCCCACCTTCGTGGTCCGGGCCAAGATCGGCACCCCTGCTCAGACCCTCCTCATGGCCCTCGATACCAGCAACGACGCCGCTTGGATTCCCTGTTCCGGCTGTGTCGGCTGCCCTTCCACCACCGTCTTCTCCTCCGATAAGTCCTCCTCCTTCCGCCCCCTCCCCTGCCGATCCCCTCAATGCAACCAGGTACCGAACCCCAGTTGCAGCAGTAGCTCGTGCGGCTTCAACTTGACGTACGGAAGCTCGACTGTAGCGGCGGATCTGGTTCAAGACAACATAACTCTGGCCAACGACTCGGTTCCGGCTTACACATTTGGGTGCATCACGAAGGCGACGGGTAGTTCAGTGCCGCCGCAGGGACTATTGGGCCTGGGTCGAGGCCCATTATCGCTTTTGGGCCAGAGCCAGAGTCTGTACCGGTCCACATTCTCGTACTGCCTTCCGAGCTTCAAATCGACCGGCTTTTCTGGGTCGTTGCGTCTTGGGCCTGTGGCCCAGCCCAAGAGGATTAAGTACACGCCGCTGCTCCGAAACCCGAGGCGGTCGTCTTTGTATTATGTGAATCTAATCGCCATCAGAGTCGGCCGGCGAATCGTCGATATTCCTCCCTCTGCTCTTGCTTTTAACTCCGCCACCGGCGCCGGCACCATCATCGACTCCGGGACGACATTCACAAGGCTGGTGGCACCGGCGTATACGGCGGTGAGGAACGAATTCCGGCGAAGAATAGGCAACGCAACCATCTCATCCCTCGGCGGCTTCGACACGTGCTACACAGTCCCCATCATCTCTCCCACCATAACCTTCATGTTCGCCGGAATGAACGTCACTCTTCCGCCGGACAACTTCCTCCTCCACAGCTCCGCCGGAACCACCACCTGCCTCGCCATGGCCGCCGCCCCGGATAACGTCAACTCCGTACTCAACGTCATTGCCAGCATGCAGCAGCAGAACCACCGCATTCTCTTCGACGTTCCCAATTCCCGAATCGGCGTCGCCCGCGAACCCTGCTCTTAAACCGCCGTCGTCCGCCGTATCATCTCCAGCTCCGGCGATATTTATTTTTGTTGGTTTTAGAATAATAAGAATGTGTGGTCATTATGGTTTGGTGGGGGGATGGGGGAAAGGGTAAAGTAGTAATTTTACATGGCTGTTTTTATAGTTGTGTTGTTTGGGATTCTTAAGGTTTTAGCTTATTATATCATGGATGTCTTAAAAGCAAAGCTTTTTTTTTTTCAGTCTACCACTAATAACTTAAGCCATTA

Coding sequence (CDS)

ATGAAGACAAAATCCCCCTTCTTCCTCGCTCTCCTCCTCCTTCTCCTATCCATATCCGCCGAATCCCTCCACCACCACCACCACCACCAACACCCCAACTGCAACGCCGCCGCCGACCGCAGCTCCACCCTCCAAGTGTTCCACATTTTCAGCCCCTGTTCCCCCTTCCGCCCCTCCAAGCCACTGTCATGGGCCGACAATGTGCTCCAAATGCAGGCCAAGGACCAAGCCCGCCTCCAATTCCTGTCCTCCCTTGTCGCCCGCAAGTCCGTCGCCCCCATTTCCTCCGCCCGCCAGCTCATCCAGAGCCCCACCTTCGTGGTCCGGGCCAAGATCGGCACCCCTGCTCAGACCCTCCTCATGGCCCTCGATACCAGCAACGACGCCGCTTGGATTCCCTGTTCCGGCTGTGTCGGCTGCCCTTCCACCACCGTCTTCTCCTCCGATAAGTCCTCCTCCTTCCGCCCCCTCCCCTGCCGATCCCCTCAATGCAACCAGGTACCGAACCCCAGTTGCAGCAGTAGCTCGTGCGGCTTCAACTTGACGTACGGAAGCTCGACTGTAGCGGCGGATCTGGTTCAAGACAACATAACTCTGGCCAACGACTCGGTTCCGGCTTACACATTTGGGTGCATCACGAAGGCGACGGGTAGTTCAGTGCCGCCGCAGGGACTATTGGGCCTGGGTCGAGGCCCATTATCGCTTTTGGGCCAGAGCCAGAGTCTGTACCGGTCCACATTCTCGTACTGCCTTCCGAGCTTCAAATCGACCGGCTTTTCTGGGTCGTTGCGTCTTGGGCCTGTGGCCCAGCCCAAGAGGATTAAGTACACGCCGCTGCTCCGAAACCCGAGGCGGTCGTCTTTGTATTATGTGAATCTAATCGCCATCAGAGTCGGCCGGCGAATCGTCGATATTCCTCCCTCTGCTCTTGCTTTTAACTCCGCCACCGGCGCCGGCACCATCATCGACTCCGGGACGACATTCACAAGGCTGGTGGCACCGGCGTATACGGCGGTGAGGAACGAATTCCGGCGAAGAATAGGCAACGCAACCATCTCATCCCTCGGCGGCTTCGACACGTGCTACACAGTCCCCATCATCTCTCCCACCATAACCTTCATGTTCGCCGGAATGAACGTCACTCTTCCGCCGGACAACTTCCTCCTCCACAGCTCCGCCGGAACCACCACCTGCCTCGCCATGGCCGCCGCCCCGGATAACGTCAACTCCGTACTCAACGTCATTGCCAGCATGCAGCAGCAGAACCACCGCATTCTCTTCGACGTTCCCAATTCCCGAATCGGCGTCGCCCGCGAACCCTGCTCTTAA

Protein sequence

MKTKSPFFLALLLLLLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQMQAKDQARLQFLSSLVARKSVAPISSARQLIQSPTFVVRAKIGTPAQTLLMALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSSCGFNLTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDTCYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRIGVAREPCS
Homology
BLAST of Cp4.1LG01g09670 vs. ExPASy Swiss-Prot
Match: O04496 (Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1)

HSP 1 Score: 413.3 bits (1061), Expect = 3.4e-114
Identity = 229/449 (51.00%), Postives = 299/449 (66.59%), Query Frame = 0

Query: 7   FFLALLLLLLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSK-PLSWA 66
           FFL LLL     +A         +     AA D S  L +  I + CSPF P+    S  
Sbjct: 10  FFLTLLLPFTFTTAT--------RDTCATAAPDGSDDLSIIPINAKCSPFAPTHVSASVI 69

Query: 67  DNVLQMQAKDQARLQFLSSLVARK---SVAPISSARQLIQSPTFVVRAKIGTPAQTLLMA 126
           D VL M + D  RL +LSSLVA K   +  P++S  QL     +VVRAK+GTP Q + M 
Sbjct: 70  DTVLHMASSDSHRLTYLSSLVAGKPKPTSVPVASGNQL-HIGNYVVRAKLGTPPQLMFMV 129

Query: 127 LDTSNDAAWIPCSGCVGCP-STTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSS----- 186
           LDTSNDA W+PCSGC GC  ++T F+++ SS++  + C + QC Q    +C SSS     
Sbjct: 130 LDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSV 189

Query: 187 CGFNLTY-GSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSL 246
           C FN +Y G S+ +A LVQD +TLA D +P ++FGCI  A+G+S+PPQGL+GLGRGP+SL
Sbjct: 190 CSFNQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSL 249

Query: 247 LGQSQSLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIA 306
           + Q+ SLY   FSYCLPSF+S  FSGSL+LG + QPK I+YTPLLRNPRR SLYYVNL  
Sbjct: 250 VSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTG 309

Query: 307 IRVGRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSL 366
           + VG   V + P  L F++ +GAGTIIDSGT  TR   P Y A+R+EFR+++  ++ S+L
Sbjct: 310 VSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTL 369

Query: 367 GGFDTCYTV--PIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLN 426
           G FDTC++     ++P IT     +++ LP +N L+HSSAGT TCL+MA    N N+VLN
Sbjct: 370 GAFDTCFSADNENVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN 429

Query: 427 VIASMQQQNHRILFDVPNSRIGVAREPCS 443
           VIA++QQQN RILFDVPNSRIG+A EPC+
Sbjct: 430 VIANLQQQNLRILFDVPNSRIGIAPEPCN 449

BLAST of Cp4.1LG01g09670 vs. ExPASy Swiss-Prot
Match: Q6F4N5 (Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1)

HSP 1 Score: 354.4 bits (908), Expect = 1.9e-96
Identity = 198/430 (46.05%), Postives = 274/430 (63.72%), Query Frame = 0

Query: 38  ADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQMQAKDQARLQFLSSLVARKSV--APIS 97
           A  ++ L V+H   P SP     PL   ++++ +   D ARL FLSS  A   V  AP++
Sbjct: 19  AAAAAELSVYHNVHPSSP----SPL---ESIIALARDDDARLLFLSSKAATAGVSSAPVA 78

Query: 98  SARQLIQSPTFVVRAKIGTPAQTLLMALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFR 157
           S +     P++VVRA +G+P+Q LL+ALDTS DA W  CS C  CPS+++F+   SSS+ 
Sbjct: 79  SGQ---APPSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSSLFAPANSSSYA 138

Query: 158 PLPCRSPQCNQVPNPSCSS--------------SSCGFNLTYGSSTVAADLVQDNITLAN 217
            LPC S  C      +C +               +C F+  +  ++  A L  D + L  
Sbjct: 139 SLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDTLRLGK 198

Query: 218 DSVPAYTFGCITKATG--SSVPPQGLLGLGRGPLSLLGQSQSLYRSTFSYCLPSFKSTGF 277
           D++P YTFGC++  TG  +++P QGLLGLGRGP++LL Q+ SLY   FSYCLPS++S  F
Sbjct: 199 DAIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYF 258

Query: 278 SGSLRLGP-VAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPPSALAFNSATGA 337
           SGSLRLG    QP+ ++YTP+LRNP RSSLYYVN+  + VG   V +P  + AF++ATGA
Sbjct: 259 SGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGA 318

Query: 338 GTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNAT-ISSLGGFDTCYTVPII----SPTIT- 397
           GT++DSGT  TR  AP Y A+R EFRR++   +  +SLG FDTC+    +    +P +T 
Sbjct: 319 GTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTV 378

Query: 398 FMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNS 443
            M  G+++ LP +N L+HSSA    CLAMA AP NVNSV+NVIA++QQQN R++FDV NS
Sbjct: 379 HMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANS 438

BLAST of Cp4.1LG01g09670 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 191.8 bits (486), Expect = 1.6e-47
Identity = 117/353 (33.14%), Postives = 181/353 (51.27%), Query Frame = 0

Query: 103 SPTFVVRAKIGTPAQTLLMALDTSNDAAWIPCSGCVGC--PSTTVFSSDKSSSFRPLPCR 162
           S  +  R  +GTPA+ + M LDT +D  W+ C+ C  C   S  +F   KS ++  +PC 
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCS 198

Query: 163 SPQCNQVPNPSCSS--SSCGFNLTYG-SSTVAADLVQDNITLANDSVPAYTFGCITKATG 222
           SP C ++ +  C++   +C + ++YG  S    D   + +T   + V     GC     G
Sbjct: 199 SPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEG 258

Query: 223 SSVPPQGLLGLGRGPLSLLGQSQSLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYT 282
             V   GLLGLG+G LS  GQ+   +   FSYCL    ++    S+  G  A  +  ++T
Sbjct: 259 LFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFT 318

Query: 283 PLLRNPRRSSLYYVNLIAIRV-GRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAY 342
           PLL NP+  + YYV L+ I V G R+  +  S    +     G IIDSGT+ TRL+ PAY
Sbjct: 319 PLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAY 378

Query: 343 TAVRNEFRRRIGNATIS---SLGGFDTCYTVPIIS----PTITFMFAGMNVTLPPDNFLL 402
            A+R+ F  R+G  T+        FDTC+ +  ++    PT+   F G +V+LP  N+L+
Sbjct: 379 IAMRDAF--RVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLI 438

Query: 403 HSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRIGVAREPCS 443
                   C A A         L++I ++QQQ  R+++D+ +SR+G A   C+
Sbjct: 439 PVDTNGKFCFAFAGTMGG----LSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of Cp4.1LG01g09670 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 178.7 bits (452), Expect = 1.4e-43
Identity = 116/348 (33.33%), Postives = 166/348 (47.70%), Query Frame = 0

Query: 103 SPTFVVRAKIGTPAQTLLMALDTSNDAAWIPCSGCVGC--PSTTVFSSDKSSSFRPLPCR 162
           S  + VR  +G+P +   M +D+ +D  W+ C  C  C   S  VF   KS S+  + C 
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 187

Query: 163 SPQCNQVPNPSCSSSSCGFNLTYG-SSTVAADLVQDNITLANDSVPAYTFGCITKATGSS 222
           S  C+++ N  C S  C + + YG  S     L  + +T A   V     GC  +  G  
Sbjct: 188 SSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMF 247

Query: 223 VPPQGLLGLGRGPLSLLGQSQSLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPL 282
           +   GLLG+G G +S +GQ        F YCL S + T  +GSL  G  A P    + PL
Sbjct: 248 IGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSLVFGREALPVGASWVPL 307

Query: 283 LRNPRRSSLYYVNLIAIRVGRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAV 342
           +RNPR  S YYV L  + VG   + +P            G ++D+GT  TRL   AY A 
Sbjct: 308 VRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAF 367

Query: 343 RNEFRRRIGN-ATISSLGGFDTCYT----VPIISPTITFMFA-GMNVTLPPDNFLLHSSA 402
           R+ F+ +  N    S +  FDTCY     V +  PT++F F  G  +TLP  NFL+    
Sbjct: 368 RDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDD 427

Query: 403 GTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRIGVAREPC 442
             T C A AA+P      L++I ++QQ+  ++ FD  N  +G     C
Sbjct: 428 SGTYCFAFAASPTG----LSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of Cp4.1LG01g09670 vs. ExPASy Swiss-Prot
Match: Q9LEW3 (Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 9.3e-43
Identity = 135/424 (31.84%), Postives = 206/424 (48.58%), Query Frame = 0

Query: 37  AADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQMQAKDQARLQFLSSLVARKSVAPISS 96
           A++  S+L+V H+   CS       +   D +++   +DQAR++ + S +++ S   +S 
Sbjct: 58  ASNTKSSLRVVHMHGACSHLSSDARVD-HDEIIR---RDQARVESIYSKLSKNSANEVSE 117

Query: 97  ARQ---------LIQSPTFVVRAKIGTPAQTLLMALDTSNDAAWIPCSGCVG-CPS--TT 156
           A+           + S  ++V   IGTP   L +  DT +D  W  C  C+G C S    
Sbjct: 118 AKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEP 177

Query: 157 VFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSSCGFNLTYG-SSTVAADLVQDNITLAN-D 216
            F+   SS+++ + C SP C      SCS+S+C +++ YG  S     L ++  TL N D
Sbjct: 178 KFNPSSSSTYQNVSCSSPMCEDA--ESCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSD 237

Query: 217 SVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFSYCLPSFKSTGFSGS 276
            +    FGC     G      GLLGLG G LSL  Q+ + Y + FSYCLPSF S   +G 
Sbjct: 238 VLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNS-TGH 297

Query: 277 LRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPPSALAFNSATGAGTII 336
           L  G     + +K+TP+   P   + Y +++I I VG + + I P     NS +  G II
Sbjct: 298 LTFGSAGISESVKFTPISSFPSAFN-YGIDIIGISVGDKELAITP-----NSFSTEGAII 357

Query: 337 DSGTTFTRLVAPAYTAVRNEFRRRIGN-ATISSLGGFDTCYTV----PIISPTITFMFAG 396
           DSGT FTRL    Y  +R+ F+ ++ +  + S  G FDTCY       +  PTI F FAG
Sbjct: 358 DSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAG 417

Query: 397 MNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRIGVA 442
             V     + +      +  CLA A   D    +  +  ++QQ    +++DV   R+G A
Sbjct: 418 STVVELDGSGISLPIKISQVCLAFAGNDD----LPAIFGNVQQTTLDVVYDVAGGRVGFA 464

BLAST of Cp4.1LG01g09670 vs. NCBI nr
Match: XP_023526864.1 (aspartyl protease AED3-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 876 bits (2264), Expect = 0.0
Identity = 442/442 (100.00%), Postives = 442/442 (100.00%), Query Frame = 0

Query: 1   MKTKSPFFLALLLLLLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSK 60
           MKTKSPFFLALLLLLLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSK
Sbjct: 1   MKTKSPFFLALLLLLLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSK 60

Query: 61  PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPISSARQLIQSPTFVVRAKIGTPAQTLL 120
           PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPISSARQLIQSPTFVVRAKIGTPAQTLL
Sbjct: 61  PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPISSARQLIQSPTFVVRAKIGTPAQTLL 120

Query: 121 MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSSCGFN 180
           MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSSCGFN
Sbjct: 121 MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSSCGFN 180

Query: 181 LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ 240
           LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ
Sbjct: 181 LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ 240

Query: 241 SLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR 300
           SLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR
Sbjct: 241 SLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR 300

Query: 301 RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT 360
           RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT
Sbjct: 301 RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT 360

Query: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ 420
           CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ
Sbjct: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ 420

Query: 421 QNHRILFDVPNSRIGVAREPCS 442
           QNHRILFDVPNSRIGVAREPCS
Sbjct: 421 QNHRILFDVPNSRIGVAREPCS 442

BLAST of Cp4.1LG01g09670 vs. NCBI nr
Match: KAG6601125.1 (Aspartyl protease AED3, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 865 bits (2235), Expect = 0.0
Identity = 437/442 (98.87%), Postives = 440/442 (99.55%), Query Frame = 0

Query: 1   MKTKSPFFLALLLLLLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSK 60
           MKTKSPFFLALLL LLSISAES HHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSK
Sbjct: 1   MKTKSPFFLALLL-LLSISAESHHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSK 60

Query: 61  PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPISSARQLIQSPTFVVRAKIGTPAQTLL 120
           PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPI+SARQLIQSPTFVVRAKIGTPAQTLL
Sbjct: 61  PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPIASARQLIQSPTFVVRAKIGTPAQTLL 120

Query: 121 MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSSCGFN 180
           MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSS+CGFN
Sbjct: 121 MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSACGFN 180

Query: 181 LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ 240
           LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ
Sbjct: 181 LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ 240

Query: 241 SLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR 300
           SLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR
Sbjct: 241 SLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR 300

Query: 301 RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT 360
           RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT
Sbjct: 301 RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT 360

Query: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ 420
           CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ
Sbjct: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ 420

Query: 421 QNHRILFDVPNSRIGVAREPCS 442
           QNHRILFD+PNSRIGVAREPCS
Sbjct: 421 QNHRILFDIPNSRIGVAREPCS 441

BLAST of Cp4.1LG01g09670 vs. NCBI nr
Match: XP_022988941.1 (aspartyl protease AED3-like [Cucurbita maxima])

HSP 1 Score: 850 bits (2196), Expect = 1.49e-310
Identity = 431/442 (97.51%), Postives = 436/442 (98.64%), Query Frame = 0

Query: 1   MKTKSPFFLALLLLLLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSK 60
           MKTKSPF L+LLLLLLSISAESLHHHHH   PNCNAAADRSSTLQVFHIFSPCSPFRPSK
Sbjct: 1   MKTKSPFLLSLLLLLLSISAESLHHHHH---PNCNAAADRSSTLQVFHIFSPCSPFRPSK 60

Query: 61  PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPISSARQLIQSPTFVVRAKIGTPAQTLL 120
           PLSWADNVLQMQAKDQARLQFLSSLVA KSVAPI+SARQLIQSPTFVVRAKIGTPAQTLL
Sbjct: 61  PLSWADNVLQMQAKDQARLQFLSSLVACKSVAPIASARQLIQSPTFVVRAKIGTPAQTLL 120

Query: 121 MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSSCGFN 180
           MALDTSNDAAWIPCSGCVGCPSTTVFSSDKS+SFRPLPCRSPQCNQVPNPSCSSS+CGFN
Sbjct: 121 MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCSSSACGFN 180

Query: 181 LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ 240
           LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ
Sbjct: 181 LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ 240

Query: 241 SLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR 300
           SLYRSTFSYCLPSFKST FSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR
Sbjct: 241 SLYRSTFSYCLPSFKSTSFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR 300

Query: 301 RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT 360
           RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT
Sbjct: 301 RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT 360

Query: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ 420
           CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ
Sbjct: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ 420

Query: 421 QNHRILFDVPNSRIGVAREPCS 442
           QNHRILFDVPNSR+GVAREPCS
Sbjct: 421 QNHRILFDVPNSRMGVAREPCS 439

BLAST of Cp4.1LG01g09670 vs. NCBI nr
Match: XP_022957241.1 (aspartyl protease AED3 [Cucurbita moschata])

HSP 1 Score: 847 bits (2189), Expect = 1.61e-309
Identity = 432/442 (97.74%), Postives = 435/442 (98.42%), Query Frame = 0

Query: 1   MKTKSPFFLALLLLLLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSK 60
           MKTKSPF L+LLL LLSISAES    HHH HPNCNAAADRSSTLQVFHIFSPCSPFRPSK
Sbjct: 1   MKTKSPFLLSLLL-LLSISAES----HHHHHPNCNAAADRSSTLQVFHIFSPCSPFRPSK 60

Query: 61  PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPISSARQLIQSPTFVVRAKIGTPAQTLL 120
           PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPI+SARQLIQSPTFVVRAKIGTPAQTLL
Sbjct: 61  PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPIASARQLIQSPTFVVRAKIGTPAQTLL 120

Query: 121 MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSSCGFN 180
           MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSS+CGFN
Sbjct: 121 MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSACGFN 180

Query: 181 LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ 240
           LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ
Sbjct: 181 LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ 240

Query: 241 SLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR 300
           SLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR
Sbjct: 241 SLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR 300

Query: 301 RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT 360
           RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT
Sbjct: 301 RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT 360

Query: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ 420
           CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ
Sbjct: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ 420

Query: 421 QNHRILFDVPNSRIGVAREPCS 442
           QNHRILFDVPNSRIGVAREPCS
Sbjct: 421 QNHRILFDVPNSRIGVAREPCS 437

BLAST of Cp4.1LG01g09670 vs. NCBI nr
Match: KAG7031923.1 (Aspartyl protease AED3, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 841 bits (2172), Expect = 5.21e-306
Identity = 437/494 (88.46%), Postives = 440/494 (89.07%), Query Frame = 0

Query: 1   MKTKSPFFLALLLLLLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSK 60
           MKTKSPFFLALLL LLSISAES HHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSK
Sbjct: 1   MKTKSPFFLALLL-LLSISAESHHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSK 60

Query: 61  PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPISSARQLIQSPTFVVRAKIGTPAQTLL 120
           PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPI+SARQLIQSPTFVVRAKIGTPAQTLL
Sbjct: 61  PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPIASARQLIQSPTFVVRAKIGTPAQTLL 120

Query: 121 MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVP------------ 180
           MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVP            
Sbjct: 121 MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPTSVLFQINFEFL 180

Query: 181 ----------------------------------------NPSCSSSSCGFNLTYGSSTV 240
                                                   NPSCSSS+CGFNLTYGSSTV
Sbjct: 181 LLLFFQLTNSRAKQLLPNSTYRIIKHRTLTARDPFVYQVPNPSCSSSACGFNLTYGSSTV 240

Query: 241 AADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFS 300
           AADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFS
Sbjct: 241 AADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFS 300

Query: 301 YCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPPS 360
           YCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPPS
Sbjct: 301 YCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIPPS 360

Query: 361 ALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDTCYTVPIIS 420
           ALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDTCYTVPIIS
Sbjct: 361 ALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDTCYTVPIIS 420

Query: 421 PTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFD 442
           PTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFD
Sbjct: 421 PTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFD 480

BLAST of Cp4.1LG01g09670 vs. ExPASy TrEMBL
Match: A0A6J1JIM3 (aspartyl protease AED3-like OS=Cucurbita maxima OX=3661 GN=LOC111486144 PE=4 SV=1)

HSP 1 Score: 850 bits (2196), Expect = 7.21e-311
Identity = 431/442 (97.51%), Postives = 436/442 (98.64%), Query Frame = 0

Query: 1   MKTKSPFFLALLLLLLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSK 60
           MKTKSPF L+LLLLLLSISAESLHHHHH   PNCNAAADRSSTLQVFHIFSPCSPFRPSK
Sbjct: 1   MKTKSPFLLSLLLLLLSISAESLHHHHH---PNCNAAADRSSTLQVFHIFSPCSPFRPSK 60

Query: 61  PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPISSARQLIQSPTFVVRAKIGTPAQTLL 120
           PLSWADNVLQMQAKDQARLQFLSSLVA KSVAPI+SARQLIQSPTFVVRAKIGTPAQTLL
Sbjct: 61  PLSWADNVLQMQAKDQARLQFLSSLVACKSVAPIASARQLIQSPTFVVRAKIGTPAQTLL 120

Query: 121 MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSSCGFN 180
           MALDTSNDAAWIPCSGCVGCPSTTVFSSDKS+SFRPLPCRSPQCNQVPNPSCSSS+CGFN
Sbjct: 121 MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSTSFRPLPCRSPQCNQVPNPSCSSSACGFN 180

Query: 181 LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ 240
           LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ
Sbjct: 181 LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ 240

Query: 241 SLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR 300
           SLYRSTFSYCLPSFKST FSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR
Sbjct: 241 SLYRSTFSYCLPSFKSTSFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR 300

Query: 301 RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT 360
           RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT
Sbjct: 301 RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT 360

Query: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ 420
           CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ
Sbjct: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ 420

Query: 421 QNHRILFDVPNSRIGVAREPCS 442
           QNHRILFDVPNSR+GVAREPCS
Sbjct: 421 QNHRILFDVPNSRMGVAREPCS 439

BLAST of Cp4.1LG01g09670 vs. ExPASy TrEMBL
Match: A0A6J1GYP2 (aspartyl protease AED3 OS=Cucurbita moschata OX=3662 GN=LOC111458686 PE=4 SV=1)

HSP 1 Score: 847 bits (2189), Expect = 7.79e-310
Identity = 432/442 (97.74%), Postives = 435/442 (98.42%), Query Frame = 0

Query: 1   MKTKSPFFLALLLLLLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSK 60
           MKTKSPF L+LLL LLSISAES    HHH HPNCNAAADRSSTLQVFHIFSPCSPFRPSK
Sbjct: 1   MKTKSPFLLSLLL-LLSISAES----HHHHHPNCNAAADRSSTLQVFHIFSPCSPFRPSK 60

Query: 61  PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPISSARQLIQSPTFVVRAKIGTPAQTLL 120
           PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPI+SARQLIQSPTFVVRAKIGTPAQTLL
Sbjct: 61  PLSWADNVLQMQAKDQARLQFLSSLVARKSVAPIASARQLIQSPTFVVRAKIGTPAQTLL 120

Query: 121 MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSSCGFN 180
           MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSS+CGFN
Sbjct: 121 MALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSACGFN 180

Query: 181 LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ 240
           LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ
Sbjct: 181 LTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQ 240

Query: 241 SLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR 300
           SLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR
Sbjct: 241 SLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGR 300

Query: 301 RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT 360
           RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT
Sbjct: 301 RIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDT 360

Query: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ 420
           CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ
Sbjct: 361 CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ 420

Query: 421 QNHRILFDVPNSRIGVAREPCS 442
           QNHRILFDVPNSRIGVAREPCS
Sbjct: 421 QNHRILFDVPNSRIGVAREPCS 437

BLAST of Cp4.1LG01g09670 vs. ExPASy TrEMBL
Match: A0A5A7SU62 (Aspartyl protease AED3 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G003150 PE=4 SV=1)

HSP 1 Score: 781 bits (2018), Expect = 1.16e-283
Identity = 399/445 (89.66%), Postives = 415/445 (93.26%), Query Frame = 0

Query: 3   TKSPFF----LALLLLLLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRP 62
           TKSPF     L LLLLL SIS    H        NCN A DRSSTL+VFHIFSPCSPFRP
Sbjct: 4   TKSPFLPLPLLLLLLLLFSISTAKSH-----IPLNCNPADDRSSTLKVFHIFSPCSPFRP 63

Query: 63  SKPLSWADNVLQMQAKDQARLQFLSSLVARKSVAPISSARQLIQSPTFVVRAKIGTPAQT 122
           SKPLSWADNVLQMQAKDQARLQFLSSLVAR+SV PI+SARQLIQSPTFVVRAKIGTPAQT
Sbjct: 64  SKPLSWADNVLQMQAKDQARLQFLSSLVARRSVVPIASARQLIQSPTFVVRAKIGTPAQT 123

Query: 123 LLMALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSSCG 182
           LL+ALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPC+SPQCNQVPNPSCS ++CG
Sbjct: 124 LLLALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGTACG 183

Query: 183 FNLTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQ 242
           FNLTYGSSTVAADLVQDN+TLA DSVPAYTFGCI KATGSSVPPQGLLGLGRGPLSLLGQ
Sbjct: 184 FNLTYGSSTVAADLVQDNVTLATDSVPAYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQ 243

Query: 243 SQSLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRV 302
           SQSLYRSTFSYCLPSFKS  FSGSLRLGPVAQP RIKYTPLLRNPRRSSLYYVNLIAIRV
Sbjct: 244 SQSLYRSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLIAIRV 303

Query: 303 GRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIG-NATISSLGG 362
           GRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVR+EFRRR+G N T+SSLGG
Sbjct: 304 GRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGG 363

Query: 363 FDTCYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIAS 422
           FDTCYTVPIISPTITFMFAGMNVTLPPDNFL+HS+AG+TTCLAMAAAPDNVNSVLNVIAS
Sbjct: 364 FDTCYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIAS 423

Query: 423 MQQQNHRILFDVPNSRIGVAREPCS 442
           MQQQNHRILFD+PNSR+GVAREPCS
Sbjct: 424 MQQQNHRILFDIPNSRVGVAREPCS 443

BLAST of Cp4.1LG01g09670 vs. ExPASy TrEMBL
Match: A0A1S3BGB2 (aspartyl protease AED3 OS=Cucumis melo OX=3656 GN=LOC103489519 PE=4 SV=1)

HSP 1 Score: 781 bits (2018), Expect = 1.16e-283
Identity = 399/445 (89.66%), Postives = 415/445 (93.26%), Query Frame = 0

Query: 3   TKSPFF----LALLLLLLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRP 62
           TKSPF     L LLLLL SIS    H        NCN A DRSSTL+VFHIFSPCSPFRP
Sbjct: 4   TKSPFLPLPLLLLLLLLFSISTAKSH-----IPLNCNPADDRSSTLKVFHIFSPCSPFRP 63

Query: 63  SKPLSWADNVLQMQAKDQARLQFLSSLVARKSVAPISSARQLIQSPTFVVRAKIGTPAQT 122
           SKPLSWADNVLQMQAKDQARLQFLSSLVAR+SV PI+SARQLIQSPTFVVRAKIGTPAQT
Sbjct: 64  SKPLSWADNVLQMQAKDQARLQFLSSLVARRSVVPIASARQLIQSPTFVVRAKIGTPAQT 123

Query: 123 LLMALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSSCG 182
           LL+ALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPC+SPQCNQVPNPSCS ++CG
Sbjct: 124 LLLALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGTACG 183

Query: 183 FNLTYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQ 242
           FNLTYGSSTVAADLVQDN+TLA DSVPAYTFGCI KATGSSVPPQGLLGLGRGPLSLLGQ
Sbjct: 184 FNLTYGSSTVAADLVQDNVTLATDSVPAYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQ 243

Query: 243 SQSLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRV 302
           SQSLYRSTFSYCLPSFKS  FSGSLRLGPVAQP RIKYTPLLRNPRRSSLYYVNLIAIRV
Sbjct: 244 SQSLYRSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLIAIRV 303

Query: 303 GRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIG-NATISSLGG 362
           GRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVR+EFRRR+G N T+SSLGG
Sbjct: 304 GRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGG 363

Query: 363 FDTCYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIAS 422
           FDTCYTVPIISPTITFMFAGMNVTLPPDNFL+HS+AG+TTCLAMAAAPDNVNSVLNVIAS
Sbjct: 364 FDTCYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIAS 423

Query: 423 MQQQNHRILFDVPNSRIGVAREPCS 442
           MQQQNHRILFD+PNSR+GVAREPCS
Sbjct: 424 MQQQNHRILFDIPNSRVGVAREPCS 443

BLAST of Cp4.1LG01g09670 vs. ExPASy TrEMBL
Match: A0A0A0KTR5 (Aspartic proteinase nepenthesin-1 OS=Cucumis sativus OX=3659 GN=Csa_5G623870 PE=4 SV=1)

HSP 1 Score: 777 bits (2007), Expect = 4.89e-282
Identity = 394/442 (89.14%), Postives = 414/442 (93.67%), Query Frame = 0

Query: 3   TKSPFFLALLLL-LLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSKP 62
           TKSPF L LLLL L SIS    H        NCN AADRSSTLQVFHIFSPCSPFRPSKP
Sbjct: 4   TKSPFLLLLLLLHLFSISTAKSH-----IPSNCNPAADRSSTLQVFHIFSPCSPFRPSKP 63

Query: 63  LSWADNVLQMQAKDQARLQFLSSLVARKSVAPISSARQLIQSPTFVVRAKIGTPAQTLLM 122
           LSWADNVLQMQAKDQARLQFLSSLVAR+S  PI+SARQLIQSPTFVVRAKIGTPAQTLL+
Sbjct: 64  LSWADNVLQMQAKDQARLQFLSSLVARRSFVPIASARQLIQSPTFVVRAKIGTPAQTLLL 123

Query: 123 ALDTSNDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSSCGFNL 182
           ALDTSNDAAWIPCSGC+GCPSTTVFSSDKSSSFRPLPC+SPQCNQVPNPSCS S+CGFNL
Sbjct: 124 ALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGSACGFNL 183

Query: 183 TYGSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQS 242
           TYGSSTVAADLVQDN+TLA DSVP+YTFGCI KATGSSVPPQGLLGLGRGPLSLLGQSQS
Sbjct: 184 TYGSSTVAADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQS 243

Query: 243 LYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRR 302
           LY+STFSYCLPSFKS  FSGSLRLGPVAQP RIKYTPLLRNPRRSSLYYVNLI+IRVGR+
Sbjct: 244 LYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRK 303

Query: 303 IVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIG-NATISSLGGFDT 362
           IVDIPPSALAFNSATGAGT+IDSGTTFTRLVAPAYTAVR+EFRRR+G N T+SSLGGFDT
Sbjct: 304 IVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDT 363

Query: 363 CYTVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQ 422
           CYTVPIISPTITFMFAGMNVTLPPDNFL+HS+AG+TTCLAMAAAPDNVNSVLNVIASMQQ
Sbjct: 364 CYTVPIISPTITFMFAGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQ 423

Query: 423 QNHRILFDVPNSRIGVAREPCS 442
           QNHRILFD+PNSR+GVARE CS
Sbjct: 424 QNHRILFDIPNSRVGVARESCS 440

BLAST of Cp4.1LG01g09670 vs. TAIR 10
Match: AT5G07030.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 579.3 bits (1492), Expect = 2.6e-165
Identity = 288/440 (65.45%), Postives = 354/440 (80.45%), Query Frame = 0

Query: 9   LALLLLLLSISAESLHHHHHHQHPNCN--AAADRSSTLQVFHIFSPCSPFRPSKPLSWAD 68
           L L L L SI   +L  +    HPNC+     D+ STL++FHI SPCSPF+ S PLSW  
Sbjct: 20  LVLFLQLFSILPLALGLN----HPNCDLTKTQDQGSTLRIFHIDSPCSPFKSSSPLSWEA 79

Query: 69  NVLQMQAKDQARLQFLSSLVARKSVAPISSARQLIQSPTFVVRAKIGTPAQTLLMALDTS 128
            VLQ  A+DQARLQ+LSSLVA +SV PI+S RQ++QS T++V+A IGTPAQ LL+A+DTS
Sbjct: 80  RVLQTLAQDQARLQYLSSLVAGRSVVPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTS 139

Query: 129 NDAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSSCGFNLTYGSS 188
           +D AWIPCSGCVGCPS T FS  KS+SF+ + C +PQC QVPNP+C + +C FNLTYGSS
Sbjct: 140 SDVAWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSS 199

Query: 189 TVAADLVQDNITLANDSVPAYTFGCITKATGSSV--PPQGLLGLGRGPLSLLGQSQSLYR 248
           ++AA+L QD I LA D + A+TFGC+ K  G     PPQGLLGLGRGPLSL+ Q+QS+Y+
Sbjct: 200 SIAANLSQDTIRLAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYK 259

Query: 249 STFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVD 308
           STFSYCLPSF+S  FSGSLRLGP +QP+R+KYT LLRNPRRSSLYYVNL+AIRVGR++VD
Sbjct: 260 STFSYCLPSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVD 319

Query: 309 IPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRI--GNATISSLGGFDTCY 368
           +PP+A+AFN +TGAGTI DSGT +TRL  P Y AVRNEFR+R+    A ++SLGGFDTCY
Sbjct: 320 LPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCY 379

Query: 369 TVPIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQN 428
           +  +  PTITFMF G+N+T+P DN +LHS+AG+T+CLAMAAAP+NVNSV+NVIASMQQQN
Sbjct: 380 SGQVKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQN 439

Query: 429 HRILFDVPNSRIGVAREPCS 443
           HR+L DVPN R+G+ARE CS
Sbjct: 440 HRVLIDVPNGRLGLARERCS 455

BLAST of Cp4.1LG01g09670 vs. TAIR 10
Match: AT3G54400.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 567.8 bits (1462), Expect = 7.8e-162
Identity = 293/436 (67.20%), Postives = 348/436 (79.82%), Query Frame = 0

Query: 9   LALLLLLLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSKPLSWADNV 68
           L LL+ LL + +ES+         NCN  +  SS L+VFHI S CSPF+ S  +SWAD +
Sbjct: 6   LILLISLLILKSESI---------NCNEKS-HSSDLRVFHINSLCSPFKTS--VSWADTL 65

Query: 69  LQMQAKDQARLQFLSSLV-ARKSVAPISSARQLIQSPTFVVRAKIGTPAQTLLMALDTSN 128
           LQ    D+AR  +LSSL   RKS  PI+S R ++QSPT++VRA IGTPAQ +L+ALDTSN
Sbjct: 66  LQ----DKARFLYLSSLAGVRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSN 125

Query: 129 DAAWIPCSGCVGCPSTTVFSSDKSSSFRPLPCRSPQCNQVPNPSCS-SSSCGFNLTYGSS 188
           DAAWIPCSGCVGC S+ +F   KSSS R L C +PQC Q PNPSC+ S SCGFN+TYG S
Sbjct: 126 DAAWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGS 185

Query: 189 TVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSLLGQSQSLYRST 248
           T+ A L QD +TLA+D +P YTFGCI KA+G+S+P QGL+GLGRGPLSL+ QSQ+LY+ST
Sbjct: 186 TIEAYLTQDTLTLASDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQST 245

Query: 249 FSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIAIRVGRRIVDIP 308
           FSYCLP+ KS+ FSGSLRLGP  QP RIK TPLL+NPRRSSLYYVNL+ IRVG +IVDIP
Sbjct: 246 FSYCLPNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIP 305

Query: 309 PSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSLGGFDTCYTVPI 368
            SALAF+ ATGAGTI DSGT +TRLV PAY AVRNEFRRR+ NA  +SLGGFDTCY+  +
Sbjct: 306 TSALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGGFDTCYSGSV 365

Query: 369 ISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRIL 428
           + P++TFMFAGMNVTLPPDN L+HSSAG  +CLAMAAAP NVNSVLNVIASMQQQNHR+L
Sbjct: 366 VFPSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVL 425

Query: 429 FDVPNSRIGVAREPCS 443
            DVPNSR+G++RE C+
Sbjct: 426 IDVPNSRLGISRETCT 425

BLAST of Cp4.1LG01g09670 vs. TAIR 10
Match: AT1G09750.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 413.3 bits (1061), Expect = 2.4e-115
Identity = 229/449 (51.00%), Postives = 299/449 (66.59%), Query Frame = 0

Query: 7   FFLALLLLLLSISAESLHHHHHHQHPNCNAAADRSSTLQVFHIFSPCSPFRPSK-PLSWA 66
           FFL LLL     +A         +     AA D S  L +  I + CSPF P+    S  
Sbjct: 10  FFLTLLLPFTFTTAT--------RDTCATAAPDGSDDLSIIPINAKCSPFAPTHVSASVI 69

Query: 67  DNVLQMQAKDQARLQFLSSLVARK---SVAPISSARQLIQSPTFVVRAKIGTPAQTLLMA 126
           D VL M + D  RL +LSSLVA K   +  P++S  QL     +VVRAK+GTP Q + M 
Sbjct: 70  DTVLHMASSDSHRLTYLSSLVAGKPKPTSVPVASGNQL-HIGNYVVRAKLGTPPQLMFMV 129

Query: 127 LDTSNDAAWIPCSGCVGCP-STTVFSSDKSSSFRPLPCRSPQCNQVPNPSCSSSS----- 186
           LDTSNDA W+PCSGC GC  ++T F+++ SS++  + C + QC Q    +C SSS     
Sbjct: 130 LDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSV 189

Query: 187 CGFNLTY-GSSTVAADLVQDNITLANDSVPAYTFGCITKATGSSVPPQGLLGLGRGPLSL 246
           C FN +Y G S+ +A LVQD +TLA D +P ++FGCI  A+G+S+PPQGL+GLGRGP+SL
Sbjct: 190 CSFNQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSL 249

Query: 247 LGQSQSLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYTPLLRNPRRSSLYYVNLIA 306
           + Q+ SLY   FSYCLPSF+S  FSGSL+LG + QPK I+YTPLLRNPRR SLYYVNL  
Sbjct: 250 VSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTG 309

Query: 307 IRVGRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAYTAVRNEFRRRIGNATISSL 366
           + VG   V + P  L F++ +GAGTIIDSGT  TR   P Y A+R+EFR+++  ++ S+L
Sbjct: 310 VSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTL 369

Query: 367 GGFDTCYTV--PIISPTITFMFAGMNVTLPPDNFLLHSSAGTTTCLAMAAAPDNVNSVLN 426
           G FDTC++     ++P IT     +++ LP +N L+HSSAGT TCL+MA    N N+VLN
Sbjct: 370 GAFDTCFSADNENVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN 429

Query: 427 VIASMQQQNHRILFDVPNSRIGVAREPCS 443
           VIA++QQQN RILFDVPNSRIG+A EPC+
Sbjct: 430 VIANLQQQNLRILFDVPNSRIGIAPEPCN 449

BLAST of Cp4.1LG01g09670 vs. TAIR 10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 191.8 bits (486), Expect = 1.2e-48
Identity = 117/353 (33.14%), Postives = 181/353 (51.27%), Query Frame = 0

Query: 103 SPTFVVRAKIGTPAQTLLMALDTSNDAAWIPCSGCVGC--PSTTVFSSDKSSSFRPLPCR 162
           S  +  R  +GTPA+ + M LDT +D  W+ C+ C  C   S  +F   KS ++  +PC 
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCS 198

Query: 163 SPQCNQVPNPSCSS--SSCGFNLTYG-SSTVAADLVQDNITLANDSVPAYTFGCITKATG 222
           SP C ++ +  C++   +C + ++YG  S    D   + +T   + V     GC     G
Sbjct: 199 SPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEG 258

Query: 223 SSVPPQGLLGLGRGPLSLLGQSQSLYRSTFSYCLPSFKSTGFSGSLRLGPVAQPKRIKYT 282
             V   GLLGLG+G LS  GQ+   +   FSYCL    ++    S+  G  A  +  ++T
Sbjct: 259 LFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFT 318

Query: 283 PLLRNPRRSSLYYVNLIAIRV-GRRIVDIPPSALAFNSATGAGTIIDSGTTFTRLVAPAY 342
           PLL NP+  + YYV L+ I V G R+  +  S    +     G IIDSGT+ TRL+ PAY
Sbjct: 319 PLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAY 378

Query: 343 TAVRNEFRRRIGNATIS---SLGGFDTCYTVPIIS----PTITFMFAGMNVTLPPDNFLL 402
            A+R+ F  R+G  T+        FDTC+ +  ++    PT+   F G +V+LP  N+L+
Sbjct: 379 IAMRDAF--RVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLI 438

Query: 403 HSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRIGVAREPCS 443
                   C A A         L++I ++QQQ  R+++D+ +SR+G A   C+
Sbjct: 439 PVDTNGKFCFAFAGTMGG----LSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of Cp4.1LG01g09670 vs. TAIR 10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 186.8 bits (473), Expect = 3.7e-47
Identity = 125/358 (34.92%), Postives = 181/358 (50.56%), Query Frame = 0

Query: 103 SPTFVVRAKIGTPAQTLLMALDTSNDAAWIPCSGCVGCPSTT--VFSSDKSSSFRPLPCR 162
           S  + +R  +GTPA  + M LDT +D  W+ CS C  C + T  +F   KS +F  +PC 
Sbjct: 132 SGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCG 191

Query: 163 SPQCNQVPNPS-C---SSSSCGFNLTYG-SSTVAADLVQDNITLANDSVPAYTFGCITKA 222
           S  C ++ + S C    S +C + ++YG  S    D   + +T     V     GC    
Sbjct: 192 SRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDN 251

Query: 223 TGSSVPPQGLLGLGRGPLSLLGQSQSLYRSTFSYCLPSFKSTGFS----GSLRLGPVAQP 282
            G  V   GLLGLGRG LS   Q+++ Y   FSYCL    S+G S     ++  G  A P
Sbjct: 252 EGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVP 311

Query: 283 KRIKYTPLLRNPRRSSLYYVNLIAIRV-GRRIVDIPPSALAFNSATGAGTIIDSGTTFTR 342
           K   +TPLL NP+  + YY+ L+ I V G R+  +  S    ++    G IIDSGT+ TR
Sbjct: 312 KTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTR 371

Query: 343 LVAPAYTAVRNEFR---RRIGNATISSLGGFDTCYTV----PIISPTITFMFAGMNVTLP 402
           L  PAY A+R+ FR    ++  A   SL  FDTC+ +     +  PT+ F F G  V+LP
Sbjct: 372 LTQPAYVALRDAFRLGATKLKRAPSYSL--FDTCFDLSGMTTVKVPTVVFHFGGGEVSLP 431

Query: 403 PDNFLLHSSAGTTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRIGVAREPC 442
             N+L+  +     C A A    +    L++I ++QQQ  R+ +D+  SR+G     C
Sbjct: 432 ASNYLIPVNTEGRFCFAFAGTMGS----LSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O044963.4e-11451.00Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1[more]
Q6F4N51.9e-9646.05Aspartyl protease 25 OS=Oryza sativa subsp. japonica OX=39947 GN=AP25 PE=2 SV=1[more]
Q9LNJ31.6e-4733.14Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q9LHE31.4e-4333.33Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q9LEW39.3e-4331.84Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_023526864.10.0100.00aspartyl protease AED3-like [Cucurbita pepo subsp. pepo][more]
KAG6601125.10.098.87Aspartyl protease AED3, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022988941.11.49e-31097.51aspartyl protease AED3-like [Cucurbita maxima][more]
XP_022957241.11.61e-30997.74aspartyl protease AED3 [Cucurbita moschata][more]
KAG7031923.15.21e-30688.46Aspartyl protease AED3, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
A0A6J1JIM37.21e-31197.51aspartyl protease AED3-like OS=Cucurbita maxima OX=3661 GN=LOC111486144 PE=4 SV=... [more]
A0A6J1GYP27.79e-31097.74aspartyl protease AED3 OS=Cucurbita moschata OX=3662 GN=LOC111458686 PE=4 SV=1[more]
A0A5A7SU621.16e-28389.66Aspartyl protease AED3 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold4... [more]
A0A1S3BGB21.16e-28389.66aspartyl protease AED3 OS=Cucumis melo OX=3656 GN=LOC103489519 PE=4 SV=1[more]
A0A0A0KTR54.89e-28289.14Aspartic proteinase nepenthesin-1 OS=Cucumis sativus OX=3659 GN=Csa_5G623870 PE=... [more]
Match NameE-valueIdentityDescription
AT5G07030.12.6e-16565.45Eukaryotic aspartyl protease family protein [more]
AT3G54400.17.8e-16267.20Eukaryotic aspartyl protease family protein [more]
AT1G09750.12.4e-11551.00Eukaryotic aspartyl protease family protein [more]
AT1G01300.11.2e-4833.14Eukaryotic aspartyl protease family protein [more]
AT3G61820.13.7e-4734.92Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 106..266
e-value: 3.9E-35
score: 121.7
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 267..442
e-value: 4.0E-46
score: 159.0
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 83..266
e-value: 4.0E-36
score: 126.7
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 105..441
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 289..435
e-value: 8.3E-28
score: 97.2
NoneNo IPR availablePANTHERPTHR13683:SF798ASPARTYL PROTEASE AED3-LIKEcoord: 10..442
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 10..442
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 106..437
score: 40.342762

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g09670.1Cp4.1LG01g09670.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity