Cp4.1LG16g04850 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG16g04850
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionaspartic proteinase-like protein 2
LocationCp4.1LG16: 5997256 .. 6000417 (+)
RNA-Seq ExpressionCp4.1LG16g04850
SyntenyCp4.1LG16g04850
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
AATGAAATTTTAAGACGAGGGATTGAGCAATGGCCGCGATTGTTCGTACTGGAGCGTTGGTGGCCGTTGCGGTGGTGCTGATTCATGCTGCGACGGTTTCGTATGGATTTTCGGCGAAAATGACGCTGGAGAGGGCTTTTCCGACTAATCATGGCGTCGAAATGGTTCATCTCCGCGGTCGGGACCGGGCTAGACATGGTAGAATGTTGCAGTCTTCTGGTGGTGTCATTGATTTTCTTCTGTCCGGAACCTACGAACCGTATTACCTTGGGTAAGGAAAAGGAACGAATCTTTTTTTTCAATGTGTACTTTTAGTTTTCATTTGGAATGTTTGTTTACCTTTCTTATTTTGACGAGGTGTCTTAATTTTAGGATCATAATTGCGGTTTTATTCCTGATTGAATTGAAACTTTTAGTGAATGAGTTTGAATGGAGGCTTTGTTGTTGAATTCTCAAATGAATGAACCTCTGGAAATATATCATACTTCTGTTATCTTGTGCCTTATCTTTTCCCCCAATTTTAGTTTAAAAGAACCGATATATTTGTGTATCATCATGGCCATTGCCCATGAATTAATCTTTCAATATATTAGGTTCTTCACTTTCTTATTGCATTAATTACCTTAGTTGTTCTCATTATCTTGGTTTGAACGAATATTTTGTGGTTCTCTCACAGGCTTTATTACACTAAAGTGCAACTAGGTAATCCTCCAAAGGATTTTTATGTACAGATTGATACTGGAAGTAATATTTTGTGGGTTTGTTGCAACTCTTGCATTGGCTGCTCAGAGACTGGTGCGCTCCAGGTGATTTCGTGTTATATAAGTTTTGAGCTCGTTTTAAACCGGTGGCATTGTCTGTCACCTAAACCCCAACCCTTTTGCTGCTACATATTTTCAATTCATTTGTTTTTTTCCTCACAGGTTGAGCTCAATGTCTTTGATCCTGGTAACTCATCAACAGCTTCTTTGGTCTCTTGTTCAGACAAAATATGTACGCCTGGGGTTGTATCCTCCGACTCTTCCTGTTTTGGCCAGACCAATCAGTGTGCTTTTGCCTTGCAATACGGCGATGGAAGTGAAACATCAGGCTATTTTGTTATAGACAAGATGCGTCTTAATGTGGTAGGTAATGATCATGATACTTCGAATCCTTCAGCTTCAGTTGTGTTTGGGTGAGTTTATGCTTCATTCCAAAACTAACTCTCTTAATTCTATGTTCTTGGTCACAACTCACATCATTTGTTAAACAGGTGTAGCACATCACAGACTGGAGATTTAACTAAGTCAGACAAGACAGTTGATGGAATCTTTGGATTTGGGCAACGGGATTTGTCTGTAATTTCTCAACTGTCATCACGAGGATTAGCTCCAAAAGTGTTCTCTCACTGCTTGAACGGAGATGATAGTGGTGGGGGAATATTGGTTCTTGGTGAGATTTTGGATCCAAATGTTGTTTATACTCCTCTAGTCCCATCACAGTATGTTCTTCATTTCCCACCAACATCTCATCCATTTTCGTTTGGTTAAACTTATTCGAGTCTGAAAGTGTGTGGTTTCTGATTGTTGTTGGGTTCAGGTCTCATTATAACTTGAATCTGCAAAGCATCTCCGTTAATGGTCGAGTATTACCTATCAATCCGGCTCTCTTTGCAACAGCTAGTGGCCAAGGAGCCATAATTGACTCTGGCACTACCTTGGCATACCTTGCAGAAGAAGCTTACGACATTTTTATTGTTGCCGTGAGTCTTTCAGATGAGTTCATTATGATGCTTCATTGTTCATACTGTTACATATGTTATTTTCCATTATAGGATGGTTGAAGTAGACACGATCCTTTGAATAGCTTTATTTCTCACTCATTTTTGTAAAAACTCAGATCGCGAACACAATTTCAAAATCGACTCAGTCTGTTACCTTCAAGGGAAATCAGTGTTATTTAACCTCCTCCAGGTTGATTTAATTTACAATCTACATTTCTGTTTGAAGTTAATGTTTACCATGAGGAGGAATTTGACTCATCATTTTTGTCTTTGGGTATAATATCAGTATCTCTGATATATTTCCTCAAGCAAGCTTTAACTTCGCCGGCGGGGCATCGTTGTTATTGAGACCCCAAGACTACCTCATACAACAATTCATTGTAAGTCTATAGCTTTCACTTCATTGTCTGGACATTTACATTTCCATAGTAGTTACGGAGCATTAAAGATAACACTTTTAGACATATACTTTGGTTGATATAACTTTTATCCTGGTTTGGAATTGGGGTTGACAAAAGTTTCATTACAGGGTGATACTGTTGTTTGGTGCGTTGGTTTCCAGAAAATTCAAGGTCAAGGGGCTACAATTTTAGGAGGTGAGTTCAGAAATCATATATATATATGGATAAGTTTATGTTTATTTGTCAACCATTTTCTATCCTGAAATACTGACACTGCAGCTCACTTTTTTGGCAGACCTTGTTCTGAAAGACAAAATCTTCGTTTATGATTTAGCTAATCAACAAATTGGATGGACAAACTTTAACTGTGAGTTTCCACAACCCTCTGTAAAGGATCGTTTTTGTAAGGGCCTAAGCTCACCATTAGCAGATATTATCATTTTTGAAGTTTACCTTTCGGACTTTCCCTCAAAGTTTTTAAAACGCGTCTGCTAAGGGAGAGATTTCCACACCCTGTAAAGAATGCTTCGTTCTCCTCTCCAATGACGTGGGATCTCGCAGTTTTGGTTAACGATACTTAGATGTTTGACAAGGTTCTTGTAGAGTAAAATATAATTCAACCATTCTTATTAAAACTGAGTTTCATTCACTTTCGATCTTGGTTCATTCAACTTGCTACTTTAGATCATGGTTCACTGGTAGTCGTCTTTGGGTCTATTTGGGTGGACTTTCTAACTACTTAAAAACATTTATTTTTTCTGTTAAAAACACCTTTTTAAGCACTAAAAAATTCAACAACATATTTGTAACATGTTTTTCTCCATATTAGGTGCAATGTCAGTAAATGTTTCTACAACTACCAGGACCTCAAAGAGTGGTTTGAAAGCTCAAGTCAGTGATGGTGGCTCTGTGGGGAATCAGCCCGACAGACTTGTTCTACACTTGAGCATTCTTGTATTCTTCGTTCACTTATCCATCTTCACCAGCTTCCTCAACTCA

mRNA sequence

AATGAAATTTTAAGACGAGGGATTGAGCAATGGCCGCGATTGTTCGTACTGGAGCGTTGGTGGCCGTTGCGGTGGTGCTGATTCATGCTGCGACGGTTTCGTATGGATTTTCGGCGAAAATGACGCTGGAGAGGGCTTTTCCGACTAATCATGGCGTCGAAATGGTTCATCTCCGCGGTCGGGACCGGGCTAGACATGGTAGAATGTTGCAGTCTTCTGGTGGTGTCATTGATTTTCTTCTGTCCGGAACCTACGAACCGTATTACCTTGGGCTTTATTACACTAAAGTGCAACTAGGTAATCCTCCAAAGGATTTTTATGTACAGATTGATACTGGAAGTAATATTTTGTGGGTTTGTTGCAACTCTTGCATTGGCTGCTCAGAGACTGGTGCGCTCCAGGTTGAGCTCAATGTCTTTGATCCTGGTAACTCATCAACAGCTTCTTTGGTCTCTTGTTCAGACAAAATATGTACGCCTGGGGTTGTATCCTCCGACTCTTCCTGTTTTGGCCAGACCAATCAGTGTGCTTTTGCCTTGCAATACGGCGATGGAAGTGAAACATCAGGCTATTTTGTTATAGACAAGATGCGTCTTAATGTGGTAGGTAATGATCATGATACTTCGAATCCTTCAGCTTCAGTTGTGTTTGGGTGTAGCACATCACAGACTGGAGATTTAACTAAGTCAGACAAGACAGTTGATGGAATCTTTGGATTTGGGCAACGGGATTTGTCTGTAATTTCTCAACTGTCATCACGAGGATTAGCTCCAAAAGTGTTCTCTCACTGCTTGAACGGAGATGATAGTGGTGGGGGAATATTGGTTCTTGGTGAGATTTTGGATCCAAATGTTGTTTATACTCCTCTAGTCCCATCACAGTCTCATTATAACTTGAATCTGCAAAGCATCTCCGTTAATGGTCGAGTATTACCTATCAATCCGGCTCTCTTTGCAACAGCTAGTGGCCAAGGAGCCATAATTGACTCTGGCACTACCTTGGCATACCTTGCAGAAGAAGCTTACGACATTTTTATTGTTGCCATCGCGAACACAATTTCAAAATCGACTCAGTCTGTTACCTTCAAGGGAAATCAGTGTTATTTAACCTCCTCCAGTATCTCTGATATATTTCCTCAAGCAAGCTTTAACTTCGCCGGCGGGGCATCGTTGTTATTGAGACCCCAAGACTACCTCATACAACAATTCATTGGTGATACTGTTGTTTGGTGCGTTGGTTTCCAGAAAATTCAAGGTCAAGGGGCTACAATTTTAGGAGTCAGTGATGGTGGCTCTGTGGGGAATCAGCCCGACAGACTTGTTCTACACTTGAGCATTCTTGTATTCTTCGTTCACTTATCCATCTTCACCAGCTTCCTCAACTCA

Coding sequence (CDS)

ATGGCCGCGATTGTTCGTACTGGAGCGTTGGTGGCCGTTGCGGTGGTGCTGATTCATGCTGCGACGGTTTCGTATGGATTTTCGGCGAAAATGACGCTGGAGAGGGCTTTTCCGACTAATCATGGCGTCGAAATGGTTCATCTCCGCGGTCGGGACCGGGCTAGACATGGTAGAATGTTGCAGTCTTCTGGTGGTGTCATTGATTTTCTTCTGTCCGGAACCTACGAACCGTATTACCTTGGGCTTTATTACACTAAAGTGCAACTAGGTAATCCTCCAAAGGATTTTTATGTACAGATTGATACTGGAAGTAATATTTTGTGGGTTTGTTGCAACTCTTGCATTGGCTGCTCAGAGACTGGTGCGCTCCAGGTTGAGCTCAATGTCTTTGATCCTGGTAACTCATCAACAGCTTCTTTGGTCTCTTGTTCAGACAAAATATGTACGCCTGGGGTTGTATCCTCCGACTCTTCCTGTTTTGGCCAGACCAATCAGTGTGCTTTTGCCTTGCAATACGGCGATGGAAGTGAAACATCAGGCTATTTTGTTATAGACAAGATGCGTCTTAATGTGGTAGGTAATGATCATGATACTTCGAATCCTTCAGCTTCAGTTGTGTTTGGGTGTAGCACATCACAGACTGGAGATTTAACTAAGTCAGACAAGACAGTTGATGGAATCTTTGGATTTGGGCAACGGGATTTGTCTGTAATTTCTCAACTGTCATCACGAGGATTAGCTCCAAAAGTGTTCTCTCACTGCTTGAACGGAGATGATAGTGGTGGGGGAATATTGGTTCTTGGTGAGATTTTGGATCCAAATGTTGTTTATACTCCTCTAGTCCCATCACAGTCTCATTATAACTTGAATCTGCAAAGCATCTCCGTTAATGGTCGAGTATTACCTATCAATCCGGCTCTCTTTGCAACAGCTAGTGGCCAAGGAGCCATAATTGACTCTGGCACTACCTTGGCATACCTTGCAGAAGAAGCTTACGACATTTTTATTGTTGCCATCGCGAACACAATTTCAAAATCGACTCAGTCTGTTACCTTCAAGGGAAATCAGTGTTATTTAACCTCCTCCAGTATCTCTGATATATTTCCTCAAGCAAGCTTTAACTTCGCCGGCGGGGCATCGTTGTTATTGAGACCCCAAGACTACCTCATACAACAATTCATTGGTGATACTGTTGTTTGGTGCGTTGGTTTCCAGAAAATTCAAGGTCAAGGGGCTACAATTTTAGGAGTCAGTGATGGTGGCTCTGTGGGGAATCAGCCCGACAGACTTGTTCTACACTTGAGCATTCTTGTATTCTTCGTTCACTTATCCATCTTCACCAGCTTCCTCAACTCA

Protein sequence

MAAIVRTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRMLQSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSETGALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSGYFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQLSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRVLPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFKGNQCYLTSSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQKIQGQGATILGVSDGGSVGNQPDRLVLHLSILVFFVHLSIFTSFLNS
Homology
BLAST of Cp4.1LG16g04850 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 244.6 bits (623), Expect = 2.2e-63
Identity = 148/398 (37.19%), Postives = 212/398 (53.27%), Query Frame = 0

Query: 10  LVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRMLQSSGGVIDF 69
           +VAV V++I  A+ ++ F A+              + H +  D  RH RML S    ID 
Sbjct: 10  VVAVFVIVIEFASANFVFKAQHKF-----AGKKKNLEHFKSHDTRRHSRMLAS----IDL 69

Query: 70  LLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSETGALQVELNV 129
            L G      +GLY+TK++LG+PPK+++VQ+DTGS+ILW+ C  C  C     L   L++
Sbjct: 70  PLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSL 129

Query: 130 FDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSGYFVIDKMRL 189
           FD   SST+  V C D  C+   +S   SC      C++ + Y D S + G F+ D + L
Sbjct: 130 FDMNASSTSKKVGCDDDFCS--FISQSDSCQPALG-CSYHIVYADESTSDGKFIRDMLTL 189

Query: 190 NVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQLSSRGLAPK 249
             V  D  T      VVFGC + Q+G L   D  VDG+ GFGQ + SV+SQL++ G A +
Sbjct: 190 EQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKR 249

Query: 250 VFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRVLPINPALFA 309
           VFSHCL+ +  GGGI  +G +  P V  TP+VP+Q HYN+ L  + V+G  L +  ++  
Sbjct: 250 VFSHCLD-NVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVR 309

Query: 310 TASGQGAIIDSGTTLAYLAEEAYDIFIVAI--ANTISKSTQSVTFKGNQCYLTSSSISDI 369
                G I+DSGTTLAY  +  YD  I  I     +       TF   QC+  S+++ + 
Sbjct: 310 NG---GTIVDSGTTLAYFPKVLYDSLIETILARQPVKLHIVEETF---QCFSFSTNVDEA 369

Query: 370 FPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQ 406
           FP  SF F     L + P DYL   F  +  ++C G+Q
Sbjct: 370 FPPVSFEFEDSVKLTVYPHDYL---FTLEEELYCFGWQ 385

BLAST of Cp4.1LG16g04850 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 243.8 bits (621), Expect = 3.7e-63
Identity = 150/402 (37.31%), Postives = 222/402 (55.22%), Query Frame = 0

Query: 6   RTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRMLQSSGG 65
           R   +VAV  VL+    VS  F   +T + A       ++  L+  D  RH RML +   
Sbjct: 9   RISRIVAVVFVLV-IQVVSGNFVFNVTHKFA---GKEKQLSELKSHDSFRHARMLAN--- 68

Query: 66  VIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSETGALQV 125
            ID  L G      +GLY+TK++LG+PPK++YVQ+DTGS+ILWV C  C  C     L +
Sbjct: 69  -IDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGI 128

Query: 126 ELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSGYFVID 185
            L+++D   SST+  V C D  C+   +    +C G    C++ + YGDGS + G F+ D
Sbjct: 129 PLSLYDSKTSSTSKNVGCEDDFCS--FIMQSETC-GAKKPCSYHVVYGDGSTSDGDFIKD 188

Query: 186 KMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQLSSRG 245
            + L  V  +  T+  +  VVFGC  +Q+G L ++D  VDGI GFGQ + S+ISQL++ G
Sbjct: 189 NITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGG 248

Query: 246 LAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRVLPINP 305
              ++FSHCL+ + +GGGI  +GE+  P V  TP+VP+Q HYN+ L+ + V+G  + + P
Sbjct: 249 STKRIFSHCLD-NMNGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPP 308

Query: 306 ALFATASGQGAIIDSGTTLAYLAEEAYDIFI--VAIANTISKSTQSVTFKGNQCYLTSSS 365
           +L +T    G IIDSGTTLAYL +  Y+  I  +     +       TF    C+  +S+
Sbjct: 309 SLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETF---ACFSFTSN 368

Query: 366 ISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQ 406
               FP  + +F     L + P DYL   F     ++C G+Q
Sbjct: 369 TDKAFPVVNLHFEDSLKLSVYPHDYL---FSLREDMYCFGWQ 392

BLAST of Cp4.1LG16g04850 vs. ExPASy Swiss-Prot
Match: Q8VYV9 (Aspartyl protease family protein 1 OS=Arabidopsis thaliana OX=3702 GN=APF1 PE=2 SV=1)

HSP 1 Score: 133.7 bits (335), Expect = 5.4e-30
Identity = 112/375 (29.87%), Postives = 172/375 (45.87%), Query Frame = 0

Query: 51  RDRARHGRMLQSSGGVIDFLLSG--TYEPYYLG-LYYTKVQLGNPPKDFYVQIDTGSNIL 110
           RDR   GR L +    +     G  T     LG L+Y  V +G P   F V +DTGS++ 
Sbjct: 69  RDRLIRGRRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLF 128

Query: 111 WVCCNSCIGC----SETGALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQT 170
           W+ C+ C  C       G   ++LN++ P  SST++ V C+  +CT G       C    
Sbjct: 129 WLPCD-CTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRG-----DRCASPE 188

Query: 171 NQCAFALQY-GDGSETSGYFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDK 230
           + C + ++Y  +G+ ++G  V D   L++V ND  +    A V FGC   QTG +     
Sbjct: 189 SDCPYQIRYLSNGTSSTGVLVEDV--LHLVSNDKSSKAIPARVTFGCGQVQTG-VFHDGA 248

Query: 231 TVDGIFGFGQRDLSVISQLSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVP 290
             +G+FG G  D+SV S L+  G+A   FS C   D  G G +  G+    +   TPL  
Sbjct: 249 APNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGND--GAGRISFGDKGSVDQRETPLNI 308

Query: 291 SQSH--YNLNLQSISVNGRVLPINPALFATASGQGAIIDSGTTLAYLAEEAYDIF----- 350
            Q H  YN+ +  ISV G    +            A+ DSGT+  YL + AY +      
Sbjct: 309 RQPHPTYNITVTKISVGGNTGDLE---------FDAVFDSGTSFTYLTDAAYTLISESFN 368

Query: 351 IVAIANTISKSTQSVTFKGNQCY-LTSSSISDIFPQASFNFAGGASL-LLRPQDYLIQQF 409
            +A+      +   + F+   CY L+ +  S  +P  +    GG+S  +  P   L+   
Sbjct: 369 SLALDKRYQTTDSELPFE--YCYALSPNKDSFQYPAVNLTMKGGSSYPVYHP---LVVIP 418

BLAST of Cp4.1LG16g04850 vs. ExPASy Swiss-Prot
Match: Q9LX20 (Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=At5g10080 PE=2 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 6.2e-26
Identity = 100/352 (28.41%), Postives = 160/352 (45.45%), Query Frame = 0

Query: 82  LYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSE------TGALQVELNVFDPGNS 141
           L+YT + +G P   F V +DTGSN+LW+ CN C+ C+       +     +LN ++P +S
Sbjct: 99  LHYTWIDIGTPSVSFLVALDTGSNLLWIPCN-CVQCAPLTSTYYSSLATKDLNEYNPSSS 158

Query: 142 STASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDG-SETSGYFVIDKMRLNVVGN 201
           ST+ +  CS K+C      S S C     QC + + Y  G + +SG  V D + L    N
Sbjct: 159 STSKVFLCSHKLC-----DSASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTN 218

Query: 202 D---HDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQLSSRGLAPKVF 261
           +   + +S+  A VV GC   Q+GD        DG+ G G  ++SV S LS  GL    F
Sbjct: 219 NRLMNGSSSVKARVVIGCGKKQSGDYL-DGVAPDGLMGLGPAEISVPSFLSKAGLMRNSF 278

Query: 262 SHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRVLPINPALFATA 321
           S C + +DSG   +  G+ + P++        QS   L L +   +G ++ +       +
Sbjct: 279 SLCFDEEDSGR--IYFGD-MGPSI-------QQSTPFLQLDNNKYSGYIVGVEACCIGNS 338

Query: 322 ----SGQGAIIDSGTTLAYLAEEAYDIFIVAI---ANTISKSTQSVTFKGNQCYLTSSSI 381
               +     IDSG +  YL EE Y    + I    N  SK+ + V+++    Y   SS 
Sbjct: 339 CLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSKNFEGVSWE----YCYESSA 398

Query: 382 SDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQKIQGQGATILG 417
               P     F+   + ++    ++ QQ  G  V +C+       +G   +G
Sbjct: 399 EPKVPAIKLKFSHNNTFVIHKPLFVFQQSQG-LVQFCLPISPSGQEGIGSIG 428

BLAST of Cp4.1LG16g04850 vs. ExPASy Swiss-Prot
Match: A2ZC67 (Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica OX=39946 GN=ASP1 PE=2 SV=2)

HSP 1 Score: 119.0 bits (297), Expect = 1.4e-25
Identity = 99/366 (27.05%), Postives = 162/366 (44.26%), Query Frame = 0

Query: 78  YYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCN-SCIGCSET--GALQVELNVFDPGN 137
           Y +G ++  + +G+P K +++ IDTGS + W+ C+  CI C++   G  + EL       
Sbjct: 33  YPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKPELKY----- 92

Query: 138 SSTASLVSCSDKICTP--GVVSSDSSCFGQTNQCAFALQYGDGSETSGYFVIDKMRLNVV 197
                 V C+++ C      +     C G  NQC + +QY  GS + G  ++D   L   
Sbjct: 93  -----AVKCTEQRCADLYADLRKPMKC-GPKNQCHYGIQYVGGS-SIGVLIVDSFSLPA- 152

Query: 198 GNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQLSSRGLAPK-VF 257
               + +NP+ S+ FGC  +Q  +       V+GI G G+  ++++SQL S+G+  K V 
Sbjct: 153 ---SNGTNPT-SIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVL 212

Query: 258 SHCLNGDDSGGGILVLGEILDP--NVVYTPLVPSQSHYNLNLQSISVNGRVLPINPALFA 317
            HC++    G G L  G+   P   V ++P+     HY+    ++  N    PI      
Sbjct: 213 GHCIS--SKGKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLQFNSNSKPI------ 272

Query: 318 TASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQ------------SVTFKGNQC 377
           +A+    I DSG T  Y A + Y   +  + +T+SK  +            +V +KG   
Sbjct: 273 SAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDK 332

Query: 378 YLTSSSISDIFPQASFNFAGG---ASLLLRPQDYLIQQFIGDTVVWCVGFQKIQGQGATI 421
             T   +   F   S  FA G   A+L + P+ YLI                I  +G   
Sbjct: 333 IRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYLI----------------ISQEGHVC 357

BLAST of Cp4.1LG16g04850 vs. NCBI nr
Match: XP_023513122.1 (aspartic proteinase-like protein 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 872 bits (2252), Expect = 0.0
Identity = 452/497 (90.95%), Postives = 452/497 (90.95%), Query Frame = 0

Query: 1   MAAIVRTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRML 60
           MAAIVRTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRML
Sbjct: 1   MAAIVRTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRML 60

Query: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120
           QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET
Sbjct: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120

Query: 121 GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG 180
           GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG
Sbjct: 121 GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG 180

Query: 181 YFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240
           YFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ
Sbjct: 181 YFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240

Query: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRV 300
           LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRV
Sbjct: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRV 300

Query: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFKGNQCYLT 360
           LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFKGNQCYLT
Sbjct: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFKGNQCYLT 360

Query: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQKIQGQGATILG---- 420
           SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQKIQGQGATILG    
Sbjct: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQKIQGQGATILGDLVL 420

Query: 421 -----------------------------------------VSDGGSVGNQPDRLVLHLS 452
                                                    VSDGGSVGNQPDRLVLHLS
Sbjct: 421 KDKIFVYDLANQQIGWTNFNCAMSVNVSTTTRTSKSGLKAQVSDGGSVGNQPDRLVLHLS 480

BLAST of Cp4.1LG16g04850 vs. NCBI nr
Match: KAG7010675.1 (Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 864 bits (2232), Expect = 3.25e-315
Identity = 443/473 (93.66%), Postives = 445/473 (94.08%), Query Frame = 0

Query: 1   MAAIVRTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRML 60
           MAAIVR GALVAVAVVLIHAATVSYGFSAK+TLERAFPTNHGVEMVHLRGRDRARHGRML
Sbjct: 1   MAAIVRIGALVAVAVVLIHAATVSYGFSAKLTLERAFPTNHGVEMVHLRGRDRARHGRML 60

Query: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120
           QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET
Sbjct: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120

Query: 121 GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG 180
           GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG
Sbjct: 121 GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG 180

Query: 181 YFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240
           YFV DKMRLNVVGN HDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ
Sbjct: 181 YFVTDKMRLNVVGNGHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240

Query: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRV 300
           LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNG+V
Sbjct: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGQV 300

Query: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFKGNQCYLT 360
           LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAI NTISKSTQS  FKGNQCYLT
Sbjct: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAITNTISKSTQSFNFKGNQCYLT 360

Query: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQKIQGQGATILG---- 420
           SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGD VVWCVGFQKIQGQGATILG    
Sbjct: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDNVVWCVGFQKIQGQGATILGGAMS 420

Query: 421 -----------------VSDGGSVGNQPDRLVLHLSILVFFVHLSIFTSFLNS 452
                            VSDGGSVGNQPDRLVLHLSILVFFVHLSIFTSFLNS
Sbjct: 421 VNVSTTTRTSKSGLKAQVSDGGSVGNQPDRLVLHLSILVFFVHLSIFTSFLNS 473

BLAST of Cp4.1LG16g04850 vs. NCBI nr
Match: XP_022944255.1 (aspartic proteinase-like protein 2 [Cucurbita moschata])

HSP 1 Score: 857 bits (2213), Expect = 5.13e-312
Identity = 444/497 (89.34%), Postives = 446/497 (89.74%), Query Frame = 0

Query: 1   MAAIVRTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRML 60
           MAAIVR GALVAVAVVLIHAATVSYGFSAK+TLERAFPTNHGVEMVHLRGRDRARHGRML
Sbjct: 1   MAAIVRIGALVAVAVVLIHAATVSYGFSAKLTLERAFPTNHGVEMVHLRGRDRARHGRML 60

Query: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120
           QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET
Sbjct: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120

Query: 121 GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG 180
           GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG
Sbjct: 121 GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG 180

Query: 181 YFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240
           YFVIDKMRLNVVGN HDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ
Sbjct: 181 YFVIDKMRLNVVGNGHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240

Query: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRV 300
           LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNG+V
Sbjct: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGQV 300

Query: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFKGNQCYLT 360
           LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAI NTISKSTQS  FKGNQCYLT
Sbjct: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAITNTISKSTQSFNFKGNQCYLT 360

Query: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQKIQGQGATILG---- 420
           SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGD VVWCVGFQKIQGQGATILG    
Sbjct: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDNVVWCVGFQKIQGQGATILGDLVL 420

Query: 421 -----------------------------------------VSDGGSVGNQPDRLVLHLS 452
                                                    VSDGGSVGNQPDRLVLHLS
Sbjct: 421 KDKIFVYDLANQQIGWTNFNCAMSINVSTTTRTSKSGLKAQVSDGGSVGNQPDRLVLHLS 480

BLAST of Cp4.1LG16g04850 vs. NCBI nr
Match: XP_022986298.1 (aspartic proteinase-like protein 2 [Cucurbita maxima])

HSP 1 Score: 838 bits (2166), Expect = 7.70e-305
Identity = 436/498 (87.55%), Postives = 442/498 (88.76%), Query Frame = 0

Query: 1   MAAIVRTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRML 60
           MAAIVR GALVAVAVVLIH ATVSYGFSAK+TLERAFPTNHGVEMVHLRGRDRARHGRML
Sbjct: 1   MAAIVRAGALVAVAVVLIHDATVSYGFSAKLTLERAFPTNHGVEMVHLRGRDRARHGRML 60

Query: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120
           QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET
Sbjct: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120

Query: 121 GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG 180
           GALQV LNVFDP NSSTASLVSCSDKICTPGV+SS SSCFGQTN+CAFALQYGDGSETSG
Sbjct: 121 GALQVGLNVFDPDNSSTASLVSCSDKICTPGVLSSGSSCFGQTNRCAFALQYGDGSETSG 180

Query: 181 YFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240
           YFVIDKMRLNVVGN HDTSN SASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ
Sbjct: 181 YFVIDKMRLNVVGNGHDTSNSSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240

Query: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRV 300
           LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNG+V
Sbjct: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGQV 300

Query: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFKGNQCYLT 360
           LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAI NTISKSTQS TFKGNQCYLT
Sbjct: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAITNTISKSTQSFTFKGNQCYLT 360

Query: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQKIQGQGATILG---- 420
           SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQK+QGQGATILG    
Sbjct: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQKLQGQGATILGDLVL 420

Query: 421 ------------------------------------------VSDGGSVGNQPDRLVLHL 452
                                                     VSDGGSV NQPDRLVLHL
Sbjct: 421 KDKIFVYDLANQQIGWTNFNCAMSVNVSTTTRTAKSGLKAQQVSDGGSVPNQPDRLVLHL 480

BLAST of Cp4.1LG16g04850 vs. NCBI nr
Match: KAG6570834.1 (Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 701 bits (1808), Expect = 7.91e-251
Identity = 384/497 (77.26%), Postives = 392/497 (78.87%), Query Frame = 0

Query: 1   MAAIVRTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRML 60
           MAAIVR GALVAVAVVLIHAATVSYGFSAK+TLERAFPTNHGVEMVHLRGRDRARHGRML
Sbjct: 1   MAAIVRIGALVAVAVVLIHAATVSYGFSAKLTLERAFPTNHGVEMVHLRGRDRARHGRML 60

Query: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120
           QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNP + F  ++     IL V     I C   
Sbjct: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPQRIFMYRL-----ILEV-----IFCGFV 120

Query: 121 GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG 180
             L +          S+ SL+                     TNQCAFALQYGDGSETSG
Sbjct: 121 ATLALAAQRLVRSRLSSMSLI--------------------LTNQCAFALQYGDGSETSG 180

Query: 181 YFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240
           YFVIDKMRLNVVGN HDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ
Sbjct: 181 YFVIDKMRLNVVGNGHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240

Query: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRV 300
           LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNG+V
Sbjct: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGQV 300

Query: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFKGNQCYLT 360
           LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAI NTISKSTQS  FKGNQCYLT
Sbjct: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAITNTISKSTQSFNFKGNQCYLT 360

Query: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQKIQGQGATILG---- 420
           SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGD VVWCVGFQKIQGQGATILG    
Sbjct: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDNVVWCVGFQKIQGQGATILGDLVL 420

Query: 421 -----------------------------------------VSDGGSVGNQPDRLVLHLS 452
                                                    VSDGGSVGNQPDRLVLHLS
Sbjct: 421 KDKIFVYDLANQQIGWTNFNCAMAVNVSTTTRTSKSGLKAQVSDGGSVGNQPDRLVLHLS 467

BLAST of Cp4.1LG16g04850 vs. ExPASy TrEMBL
Match: A0A6J1FYJ9 (aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111448759 PE=3 SV=1)

HSP 1 Score: 857 bits (2213), Expect = 2.48e-312
Identity = 444/497 (89.34%), Postives = 446/497 (89.74%), Query Frame = 0

Query: 1   MAAIVRTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRML 60
           MAAIVR GALVAVAVVLIHAATVSYGFSAK+TLERAFPTNHGVEMVHLRGRDRARHGRML
Sbjct: 1   MAAIVRIGALVAVAVVLIHAATVSYGFSAKLTLERAFPTNHGVEMVHLRGRDRARHGRML 60

Query: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120
           QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET
Sbjct: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120

Query: 121 GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG 180
           GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG
Sbjct: 121 GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG 180

Query: 181 YFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240
           YFVIDKMRLNVVGN HDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ
Sbjct: 181 YFVIDKMRLNVVGNGHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240

Query: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRV 300
           LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNG+V
Sbjct: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGQV 300

Query: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFKGNQCYLT 360
           LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAI NTISKSTQS  FKGNQCYLT
Sbjct: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAITNTISKSTQSFNFKGNQCYLT 360

Query: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQKIQGQGATILG---- 420
           SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGD VVWCVGFQKIQGQGATILG    
Sbjct: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDNVVWCVGFQKIQGQGATILGDLVL 420

Query: 421 -----------------------------------------VSDGGSVGNQPDRLVLHLS 452
                                                    VSDGGSVGNQPDRLVLHLS
Sbjct: 421 KDKIFVYDLANQQIGWTNFNCAMSINVSTTTRTSKSGLKAQVSDGGSVGNQPDRLVLHLS 480

BLAST of Cp4.1LG16g04850 vs. ExPASy TrEMBL
Match: A0A6J1JG44 (aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111484084 PE=3 SV=1)

HSP 1 Score: 838 bits (2166), Expect = 3.73e-305
Identity = 436/498 (87.55%), Postives = 442/498 (88.76%), Query Frame = 0

Query: 1   MAAIVRTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRML 60
           MAAIVR GALVAVAVVLIH ATVSYGFSAK+TLERAFPTNHGVEMVHLRGRDRARHGRML
Sbjct: 1   MAAIVRAGALVAVAVVLIHDATVSYGFSAKLTLERAFPTNHGVEMVHLRGRDRARHGRML 60

Query: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120
           QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET
Sbjct: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120

Query: 121 GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG 180
           GALQV LNVFDP NSSTASLVSCSDKICTPGV+SS SSCFGQTN+CAFALQYGDGSETSG
Sbjct: 121 GALQVGLNVFDPDNSSTASLVSCSDKICTPGVLSSGSSCFGQTNRCAFALQYGDGSETSG 180

Query: 181 YFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240
           YFVIDKMRLNVVGN HDTSN SASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ
Sbjct: 181 YFVIDKMRLNVVGNGHDTSNSSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240

Query: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRV 300
           LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNG+V
Sbjct: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGQV 300

Query: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFKGNQCYLT 360
           LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAI NTISKSTQS TFKGNQCYLT
Sbjct: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAITNTISKSTQSFTFKGNQCYLT 360

Query: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQKIQGQGATILG---- 420
           SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQK+QGQGATILG    
Sbjct: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQKLQGQGATILGDLVL 420

Query: 421 ------------------------------------------VSDGGSVGNQPDRLVLHL 452
                                                     VSDGGSV NQPDRLVLHL
Sbjct: 421 KDKIFVYDLANQQIGWTNFNCAMSVNVSTTTRTAKSGLKAQQVSDGGSVPNQPDRLVLHL 480

BLAST of Cp4.1LG16g04850 vs. ExPASy TrEMBL
Match: A0A6J1K1L8 (aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111490894 PE=3 SV=1)

HSP 1 Score: 666 bits (1718), Expect = 6.07e-237
Identity = 343/499 (68.74%), Postives = 393/499 (78.76%), Query Frame = 0

Query: 1   MAAIVRTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRML 60
           MAAIVRTG  V VAV++I  ATV  GF AK+TLERAFPTNHGVE+  LRGRDR RHGR+L
Sbjct: 1   MAAIVRTGVSVVVAVMMIQVATVLCGFPAKLTLERAFPTNHGVELAQLRGRDRIRHGRIL 60

Query: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120
           QSSGGVIDF ++GTY+P+ +GLYYTKVQLGNPPKDF+VQIDTGS++LWV CNSC GC ET
Sbjct: 61  QSSGGVIDFPVAGTYDPFLVGLYYTKVQLGNPPKDFFVQIDTGSDVLWVSCNSCSGCPET 120

Query: 121 GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG 180
             LQ++LN FDPG+SSTASLVSCSD+IC  GV SSDS+C GQ+NQCA+  QYGDGS TSG
Sbjct: 121 SGLQIQLNFFDPGSSSTASLVSCSDQICAVGVQSSDSACLGQSNQCAYVFQYGDGSGTSG 180

Query: 181 YFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240
           Y+V+D + L++V +   T+N SASV+FGCSTSQTGDLTKSD+ +DGIFGFGQ+DLSVISQ
Sbjct: 181 YYVMDMIHLDIVVDSAMTTNSSASVMFGCSTSQTGDLTKSDRAIDGIFGFGQQDLSVISQ 240

Query: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRV 300
           LSSRGLAPKVFSHCL GDDSGGGILVLGEI++PNVVYTPLVPSQ HYNLNLQSISVNGRV
Sbjct: 241 LSSRGLAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGRV 300

Query: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFKGNQCYLT 360
           LPINPA+FAT++ QG IIDSGTTLAYLAEEAYD F+ AI NT+S+STQS+  +GNQCY+T
Sbjct: 301 LPINPAVFATSNSQGTIIDSGTTLAYLAEEAYDTFVAAITNTVSQSTQSIVLRGNQCYMT 360

Query: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQF-IGDTVVWCVGFQKIQGQGATILG--- 420
           SSSISDIFP  S NFAGGASL+LRPQDYLIQQ  +  T VWCVGFQKI GQG TILG   
Sbjct: 361 SSSISDIFPLVSLNFAGGASLVLRPQDYLIQQSSVSGTTVWCVGFQKIPGQGITILGDLV 420

Query: 421 --------------------------------------------VSDGGSVGNQPDRLVL 451
                                                       +SD GSV NQP+R++L
Sbjct: 421 LKDKIFIYDLANQRIGWANYDCSTSVNVSTATKTGKSEFVNAGQLSDSGSVQNQPNRVIL 480

BLAST of Cp4.1LG16g04850 vs. ExPASy TrEMBL
Match: A0A6J1H528 (aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111459652 PE=3 SV=1)

HSP 1 Score: 665 bits (1717), Expect = 8.62e-237
Identity = 343/499 (68.74%), Postives = 394/499 (78.96%), Query Frame = 0

Query: 1   MAAIVRTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRML 60
           M AIVRTG  VAVAVV+I AATV  GF AK+TLERAFPTNHGVE+  LRGRDR RHGR+L
Sbjct: 1   MVAIVRTGVSVAVAVVMIQAATVLCGFPAKLTLERAFPTNHGVELAQLRGRDRIRHGRIL 60

Query: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120
           QSSGGVIDF ++GTY+P+ +GLYYTKVQLGNPPKDF+VQIDTGS++LWV CNSC GC ET
Sbjct: 61  QSSGGVIDFPVAGTYDPFLVGLYYTKVQLGNPPKDFFVQIDTGSDVLWVSCNSCSGCPET 120

Query: 121 GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG 180
             LQ++LN FDPG+SSTASLVSCSD+IC  GV SSDS+C GQ+NQCA+  QYGDGS TSG
Sbjct: 121 SGLQIQLNFFDPGSSSTASLVSCSDQICAVGVQSSDSACLGQSNQCAYVFQYGDGSGTSG 180

Query: 181 YFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240
           Y+V+D + L++V +   T+N SASV+FGCSTSQTGDLTKSD+ +DGIFGFGQ+DLSVISQ
Sbjct: 181 YYVMDMIHLDIVVDSAMTTNSSASVMFGCSTSQTGDLTKSDRAIDGIFGFGQQDLSVISQ 240

Query: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRV 300
           LSSRGLAPKVFSHCL GDDSGGGILVLGEI++PNVVYTPLVPSQ HYNLNLQSISVNGRV
Sbjct: 241 LSSRGLAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGRV 300

Query: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFKGNQCYLT 360
           LPINPA+FAT++ QG IIDSGTTLAYLAEEAYD F+ AI NT+S+S+QS+  +GNQCY+T
Sbjct: 301 LPINPAVFATSNSQGTIIDSGTTLAYLAEEAYDTFVAAITNTVSQSSQSIILRGNQCYMT 360

Query: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQF-IGDTVVWCVGFQKIQGQGATILG--- 420
           SSSISDIFP  S NFAGGASL+LRPQDYLIQQ  +  T VWCVGFQKI GQG TILG   
Sbjct: 361 SSSISDIFPLVSLNFAGGASLVLRPQDYLIQQSSVSGTTVWCVGFQKIPGQGITILGDLV 420

Query: 421 --------------------------------------------VSDGGSVGNQPDRLVL 451
                                                       +SD GSV NQP+R+++
Sbjct: 421 LKDKIFIYDLANQRIGWTNYDCSTSVNVSTATKTGKSEFVNAGQLSDSGSVQNQPNRVIV 480

BLAST of Cp4.1LG16g04850 vs. ExPASy TrEMBL
Match: A0A0A0KER5 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G149430 PE=3 SV=1)

HSP 1 Score: 658 bits (1698), Expect = 6.93e-234
Identity = 340/500 (68.00%), Postives = 386/500 (77.20%), Query Frame = 0

Query: 1   MAAIVRTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRML 60
           MA IV  G  V V VVL+ AA V  GF AK+TLERAFPTNHGVE+ HLR RDR RHGRML
Sbjct: 1   MARIVYAGVSVGVLVVLLQAAMVLCGFPAKLTLERAFPTNHGVEIAHLRSRDRVRHGRML 60

Query: 61  QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSET 120
           QSSGGVIDF +SGTY+P+ +GLYYT+VQLGNPPKDFYVQIDTGS++LWV CNSC GC  T
Sbjct: 61  QSSGGVIDFSVSGTYDPFLVGLYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPAT 120

Query: 121 GALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSG 180
             LQ+ LN FDPG+S+TASLVSCSD+IC  GV SSDS+CFGQ+NQCA+  QYGDGS TSG
Sbjct: 121 SGLQIPLNFFDPGSSTTASLVSCSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSG 180

Query: 181 YFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQ 240
           Y+V+D + L+VV +   TSN SASVVFGCSTSQTGDLTKSD+ VDGIFGFGQ+DLSVISQ
Sbjct: 181 YYVMDMIHLDVVIDSSVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQ 240

Query: 241 LSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRV 300
           LSSRG+APKVFSHCL GDDSGGGILVLGEI++PNVVYTPLVPSQ HYNLNLQSISVNG+V
Sbjct: 241 LSSRGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV 300

Query: 301 LPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFKGNQCYLT 360
           LPI+PA+FAT+S QG IIDSGTTLAYLAEEAY+ F+VA+ N +S+STQSV  KGN+CY+T
Sbjct: 301 LPISPAVFATSSSQGTIIDSGTTLAYLAEEAYNAFVVAVTNIVSQSTQSVVLKGNRCYVT 360

Query: 361 SSSISDIFPQASFNFAGGASLLLRPQDYLIQQ-FIGDTVVWCVGFQKIQGQGATILG--- 420
           SSS+SDIFPQ S NFAGGASL+L  QDYLIQQ  +G T VWC+GFQKI GQG TILG   
Sbjct: 361 SSSVSDIFPQVSLNFAGGASLVLGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITILGDLV 420

Query: 421 --------------------------------------------VSDGGSVGNQPDRLVL 452
                                                        SD GS+ NQPDR +L
Sbjct: 421 LKDKIFIYDLANQRIGWTNYDCSMSVNVSTATKTGKSEFVNAGQFSDSGSMQNQPDRFIL 480

BLAST of Cp4.1LG16g04850 vs. TAIR 10
Match: AT5G22850.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 529.3 bits (1362), Expect = 3.1e-150
Identity = 253/415 (60.96%), Postives = 325/415 (78.31%), Query Frame = 0

Query: 3   AIVRTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRMLQS 62
           A +R  A + +   L+ AA +SYGF A + LER  P NH +E+  L+ RD ARHGR+LQS
Sbjct: 2   AAIRFAAAILIC-CLLPAAVLSYGFPAALKLERVIPANHEMELSQLKARDEARHGRLLQS 61

Query: 63  SGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSETGA 122
            GGVIDF + GT++P+ +GLYYTK++LG PP+DFYVQ+DTGS++LWV C SC GC +T  
Sbjct: 62  LGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSG 121

Query: 123 LQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETSGYF 182
           LQ++LN FDPG+S TAS +SCSD+ C+ G+ SSDS C  Q N CA+  QYGDGS TSG++
Sbjct: 122 LQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFY 181

Query: 183 VIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVISQLS 242
           V D ++ +++       N +A VVFGCSTSQTGDL KSD+ VDGIFGFGQ+ +SVISQL+
Sbjct: 182 VSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLA 241

Query: 243 SRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGRVLP 302
           S+G+AP+VFSHCL G++ GGGILVLGEI++PN+V+TPLVPSQ HYN+NL SISVNG+ LP
Sbjct: 242 SQGIAPRVFSHCLKGENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALP 301

Query: 303 INPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFKGNQCYLTSS 362
           INP++F+T++GQG IID+GTTLAYL+E AY  F+ AI N +S+S + V  KGNQCY+ ++
Sbjct: 302 INPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVITT 361

Query: 363 SISDIFPQASFNFAGGASLLLRPQDYLIQQ-FIGDTVVWCVGFQKIQGQGATILG 417
           S+ DIFP  S NFAGGAS+ L PQDYLIQQ  +G T VWC+GFQ+IQ QG TILG
Sbjct: 362 SVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILG 415

BLAST of Cp4.1LG16g04850 vs. TAIR 10
Match: AT1G08210.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 431.8 bits (1109), Expect = 6.8e-121
Identity = 218/417 (52.28%), Postives = 294/417 (70.50%), Query Frame = 0

Query: 1   MAAIVRTGALVAVAVVLIHAATVSYGFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRML 60
           MA     G ++  AV+L+ A T++ G  A + LER  P NH + +  LR  D ARHGR+L
Sbjct: 1   MAVDSPAGVIIIAAVLLLAATTLACGSDAVLKLERLIPPNHELGLTELRAFDSARHGRLL 60

Query: 61  QSS-GGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNSCIGCSE 120
           QS  GGV++F + G  +P+ +GLYYTKV+LG PP++F VQIDTGS++LWV C SC GC +
Sbjct: 61  QSPVGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPK 120

Query: 121 TGALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYGDGSETS 180
           T  LQ++L+ FDPG SS+ASLVSCSD+ C      ++S C    N C+++ +YGDGS TS
Sbjct: 121 TSELQIQLSFFDPGVSSSASLVSCSDRRCYSN-FQTESGC-SPNNLCSYSFKYGDGSGTS 180

Query: 181 GYFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQRDLSVIS 240
           GY++ D M  + V       N SA  VFGCS  Q+GDL +  + VDGIFG GQ  LSVIS
Sbjct: 181 GYYISDFMSFDTVITSTLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVIS 240

Query: 241 QLSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQSISVNGR 300
           QL+ +GLAP+VFSHCL GD SGGGI+VLG+I  P+ VYTPLVPSQ HYN+NLQSI+VNG+
Sbjct: 241 QLAVQGLAPRVFSHCLKGDKSGGGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQ 300

Query: 301 VLPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFKGNQCYL 360
           +LPI+P++F  A+G G IID+GTTLAYL +EAY  FI A+AN +S+  + +T++  QC+ 
Sbjct: 301 ILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRPITYESYQCFE 360

Query: 361 TSSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQKIQGQGATILG 417
            ++   D+FPQ S +FAGGAS++L P+ YL       + +WC+GFQ++  +  TILG
Sbjct: 361 ITAGDVDVFPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILG 415

BLAST of Cp4.1LG16g04850 vs. TAIR 10
Match: AT2G36670.2 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 428.3 bits (1100), Expect = 7.5e-120
Identity = 221/424 (52.12%), Postives = 294/424 (69.34%), Query Frame = 0

Query: 2   AAIVRTGALVAVAVVLIHAATVSY--GFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRM 61
           AA+    A+   A   + +A   Y  G +  + L+RAFP +  VE+  LR RDR RH R+
Sbjct: 11  AALAVALAVTGFAASPLPSAYAKYAAGPTKILPLQRAFPLDELVELSELRARDRVRHARI 70

Query: 62  L------QSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNILWVCCNS 121
           L       S GGV+DF + G+ +PY +GLY+TKV+LG+PP +F VQIDTGS+ILWV C+S
Sbjct: 71  LLGGGRQSSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS 130

Query: 122 CIGCSETGALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAFALQYG 181
           C  C  +  L ++L+ FD   S TA  V+CSD IC+    ++ + C  + NQC ++ +YG
Sbjct: 131 CSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG 190

Query: 182 DGSETSGYFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIFGFGQR 241
           DGS TSGY++ D    + +  +   +N SA +VFGCST Q+GDLTKSDK VDGIFGFG+ 
Sbjct: 191 DGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKG 250

Query: 242 DLSVISQLSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYNLNLQS 301
            LSV+SQLSSRG+ P VFSHCL GD SGGG+ VLGEIL P +VY+PLVPSQ HYNLNL S
Sbjct: 251 KLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLS 310

Query: 302 ISVNGRVLPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQSVTFK 361
           I VNG++LP++ A+F  ++ +G I+D+GTTL YL +EAYD+F+ AI+N++S+    +   
Sbjct: 311 IGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISN 370

Query: 362 GNQCYLTSSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGD-TVVWCVGFQKIQGQGA 417
           G QCYL S+SISD+FP  S NFAGGAS++LRPQDYL    I D   +WC+GFQK   +  
Sbjct: 371 GEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAP-EEQ 430

BLAST of Cp4.1LG16g04850 vs. TAIR 10
Match: AT2G36670.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 422.2 bits (1084), Expect = 5.4e-118
Identity = 221/429 (51.52%), Postives = 294/429 (68.53%), Query Frame = 0

Query: 2   AAIVRTGALVAVAVVLIHAATVSY--GFSAKMTLERAFPTNHGVEMVHLRGRDRARHGRM 61
           AA+    A+   A   + +A   Y  G +  + L+RAFP +  VE+  LR RDR RH R+
Sbjct: 11  AALAVALAVTGFAASPLPSAYAKYAAGPTKILPLQRAFPLDELVELSELRARDRVRHARI 70

Query: 62  L------QSSGGVIDFLLSGTYEPYYLG-----LYYTKVQLGNPPKDFYVQIDTGSNILW 121
           L       S GGV+DF + G+ +PY +G     LY+TKV+LG+PP +F VQIDTGS+ILW
Sbjct: 71  LLGGGRQSSVGGVVDFPVQGSSDPYLVGSKMTMLYFTKVKLGSPPTEFNVQIDTGSDILW 130

Query: 122 VCCNSCIGCSETGALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCAF 181
           V C+SC  C  +  L ++L+ FD   S TA  V+CSD IC+    ++ + C  + NQC +
Sbjct: 131 VTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGY 190

Query: 182 ALQYGDGSETSGYFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGIF 241
           + +YGDGS TSGY++ D    + +  +   +N SA +VFGCST Q+GDLTKSDK VDGIF
Sbjct: 191 SFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIF 250

Query: 242 GFGQRDLSVISQLSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHYN 301
           GFG+  LSV+SQLSSRG+ P VFSHCL GD SGGG+ VLGEIL P +VY+PLVPSQ HYN
Sbjct: 251 GFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYSPLVPSQPHYN 310

Query: 302 LNLQSISVNGRVLPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKSTQ 361
           LNL SI VNG++LP++ A+F  ++ +G I+D+GTTL YL +EAYD+F+ AI+N++S+   
Sbjct: 311 LNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVT 370

Query: 362 SVTFKGNQCYLTSSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGD-TVVWCVGFQKI 417
            +   G QCYL S+SISD+FP  S NFAGGAS++LRPQDYL    I D   +WC+GFQK 
Sbjct: 371 PIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKA 430

BLAST of Cp4.1LG16g04850 vs. TAIR 10
Match: AT3G02740.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 260.8 bits (665), Expect = 2.1e-69
Identity = 150/374 (40.11%), Postives = 217/374 (58.02%), Query Frame = 0

Query: 48  LRGRDRARHGRMLQSSGGVIDFLLSGTYEPYYLGLYYTKVQLGNPPKDFYVQIDTGSNIL 107
           LR  D  RH R+L +    ID  L G  +P  +GLY+ K+ LG P +DF+VQ+DTGS+IL
Sbjct: 54  LRAHDVHRHSRLLSA----IDIPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDIL 113

Query: 108 WVCCNSCIGCSETGALQVELNVFDPGNSSTASLVSCSDKICTPGVVSSDSSCFGQTNQCA 167
           WV C  CI C     L VEL  +D   SSTA  VSCSD  C+   V+  S C   +  C 
Sbjct: 114 WVNCAGCIRCPRKSDL-VELTPYDVDASSTAKSVSCSDNFCS--YVNQRSECHSGST-CQ 173

Query: 168 FALQYGDGSETSGYFVIDKMRLNVVGNDHDTSNPSASVVFGCSTSQTGDLTKSDKTVDGI 227
           + + YGDGS T+GY V D + L++V  +  T + + +++FGC + Q+G L +S   VDGI
Sbjct: 174 YVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGI 233

Query: 228 FGFGQRDLSVISQLSSRGLAPKVFSHCLNGDDSGGGILVLGEILDPNVVYTPLVPSQSHY 287
            GFGQ + S ISQL+S+G   + F+HCL+ +++GGGI  +GE++ P V  TP++   +HY
Sbjct: 234 MGFGQSNSSFISQLASQGKVKRSFAHCLD-NNNGGGIFAIGEVVSPKVKTTPMLSKSAHY 293

Query: 288 NLNLQSISVNGRVLPINPALFATASGQGAIIDSGTTLAYLAEEAYDIFIVAIANTISKST 347
           ++NL +I V   VL ++   F +   +G IIDSGTTL YL +  Y+  +  I  +  + T
Sbjct: 294 SVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELT 353

Query: 348 QSVTFKGNQCYLTSSSISDIFPQASFNFAGGASLLLRPQDYLIQQFIGDTVVWCVGFQK- 407
                +   C+  +  + D FP  +F F    SL + P++YL  Q   DT  WC G+Q  
Sbjct: 354 LHTVQESFTCFHYTDKL-DRFPTVTFQFDKSVSLAVYPREYLF-QVREDT--WCFGWQNG 413

Query: 408 -IQGQGA---TILG 417
            +Q +G    TILG
Sbjct: 414 GLQTKGGASLTILG 414

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9S9K42.2e-6337.19Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Q4V3D23.7e-6337.31Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Q8VYV95.4e-3029.87Aspartyl protease family protein 1 OS=Arabidopsis thaliana OX=3702 GN=APF1 PE=2 ... [more]
Q9LX206.2e-2628.41Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=At5g10080 ... [more]
A2ZC671.4e-2527.05Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica OX=39946 GN=ASP1 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
XP_023513122.10.090.95aspartic proteinase-like protein 2 [Cucurbita pepo subsp. pepo][more]
KAG7010675.13.25e-31593.66Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. argyr... [more]
XP_022944255.15.13e-31289.34aspartic proteinase-like protein 2 [Cucurbita moschata][more]
XP_022986298.17.70e-30587.55aspartic proteinase-like protein 2 [Cucurbita maxima][more]
KAG6570834.17.91e-25177.26Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. soror... [more]
Match NameE-valueIdentityDescription
A0A6J1FYJ92.48e-31289.34aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111448759... [more]
A0A6J1JG443.73e-30587.55aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111484084 P... [more]
A0A6J1K1L86.07e-23768.74aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111490894 P... [more]
A0A6J1H5288.62e-23768.74aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111459652... [more]
A0A0A0KER56.93e-23468.00Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G14943... [more]
Match NameE-valueIdentityDescription
AT5G22850.13.1e-15060.96Eukaryotic aspartyl protease family protein [more]
AT1G08210.16.8e-12152.28Eukaryotic aspartyl protease family protein [more]
AT2G36670.27.5e-12052.12Eukaryotic aspartyl protease family protein [more]
AT2G36670.15.4e-11851.52Eukaryotic aspartyl protease family protein [more]
AT3G02740.12.1e-6940.11Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 316..327
score: 47.98
coord: 89..109
score: 50.53
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 13..417
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 270..418
e-value: 1.6E-31
score: 111.4
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 68..269
e-value: 1.2E-45
score: 157.9
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 78..416
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 83..268
e-value: 7.8E-39
score: 133.7
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 286..417
e-value: 7.2E-17
score: 61.6
NoneNo IPR availablePANTHERPTHR13683:SF784EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 13..417
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 83..437
score: 37.122509
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 83..416
e-value: 6.22388E-56
score: 184.774

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG16g04850.1Cp4.1LG16g04850.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity