Cp4.1LG01g03340 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g03340
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionEukaryotic aspartyl protease family protein
LocationCp4.1LG01 : 1913966 .. 1915768 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTACGTGATTATAAATTGTTAATTAATTATCGAGGAATACTTTAAATTTATTTAATTAGCTAAATATAAAATATATTTTTTTTAATAATCTTACATTTGTTTGTAGTCCAATACTATTTTTTTTTAATTAGTTTACAAATTTTATTTAATTTATTTATGGAAAAATAATTATTAAAATAGAATTAAAAAAAAAATATTTATAATAAAATATTTTTATTTACCTTATTTAGATTAAAACTAAATTTAATAAATGGTAGGAAATATGAAAATTTAATAGGATCTTAATAAATAGAAATAACTAAATAAATACTTAAATAACATTCTATTTTTATATTTAAAAAACAGTGAACGTTATCGAAACACGGCCGTCCTTTCCTAGTTCCTACGTAATAATTTTAATTTCCACGTCATTATCTTTCTCACGAGTATAAATAAAACAAAGAAATTTAAAATTCATGCAATCCCATGACGCCTACCATATTTCTCCTTTTCGCACTATTTTCCATCGTGGAGTCCACCGCCGGCGAAGGCGGTGGTCTCAAGCTGGAACTCATCCGCCACCGTCTCTCACCCGACAACGTTTCACCGATGGTAGCCAAATCACAAATTTGGCCGGAAACCAGCCAATTTATAGTGAAAATCGCTGTTGGAACGCCGCCGACGGAGGTGCATGCAATCCTCGACACTGGCAGCGATTTATTTTGGGCTCAGTGTCGTCCATGTGCGAAATGTTACCGGCAAACAAATCCGATTTACGACCCTTCGAAATCGTCAACCTTTCGAACCCTTTCTTGCAAGTCGCCGCAGTGCCATTTGTGGGGGTCTGGTGCGGCGTGCTCCGGCACCGACACGTGTAAGTACGGCTATGGGTATGGAAGCGGATCTACGCAGGGAGAATTGGCGACTGAAAAAATGGCTGTAACTTCGAGGTCTGGAGCGACGACGCCGTTTCCGGGGGTGGTGTTTGGTTGCGGACATAATAATAGTGGAACGTTTAATGCGAATGAAATGGGATTGATCGGATTTGGAAGAGGAGCGATTTCCTTCGTTTCTCAGGTATATATATATTTATCATTTTCAATTATTATTATTATTTTTAAAAACATTATTCGCATTAAATATTGAAATGAACAGTTAATTATATTCTTATTCAAATAACTATAAAGTTCAAAACATTATAGTTAATGTAAATTCATATTCACAGATAGGTCCATCGGTTGGCGGCAGAAAGTTCTCCCTTTGTCTGATGCCATACAACACCGACCCGAGAATCTCAAGTAGCCTCTCTATCGGGTCAGGTTCTGAAGTTAAAGGACCCGGAGTCATCACAGCCCAACTCGTTCGAACATCCGACCAGACATCTTACTCTCTCACCCTCAAGGGAATCTCTGTCGGAAAAACTTTCGTTCCGTACAGTACGTCGGGACCTCCGGCCAAGGGGAACGCAATTCTCGATACCGGCACGCCGCCAACTCTCCTCCCCAAAGAATTGTACGGACGATTGGCTATTGAAGTTCGGCGACATATCCCGTTGAAGCCCATTGACGATACTCTTTGCTACAAAGGTAATTTGGGGAATTTGGTGATAACTCTACACTTCGACGGCCACGTGGATCTGCGATTGAGTACGGCTCAAACTTTCAATAAGATGCTGGATGGGTCCTTTTGCTTCAATGCGATGGGCGTTGACGACAAGGACGCACTCATTGGGAATAGTATGATGTCAAATTTTTTGGTTGGGTATGATATTGACAATATGACGGTGTCGTTTAAGCCCACTGATTGCACCAAAATTGGTTGA

mRNA sequence

ATGTACTGTCGTCCATGTGCGAAATGTTACCGGCAAACAAATCCGATTTACGACCCTTCGAAATCGTCAACCTTTCGAACCCTTTCTTGCAAGTCGCCGCAGTGCCATTTGTGGGGGTCTGGTGCGGCGTGCTCCGGCACCGACACGTGTAAGTACGGCTATGGGTATGGAAGCGGATCTACGCAGGGAGAATTGGCGACTGAAAAAATGGCTGTAACTTCGAGGTCTGGAGCGACGACGCCGTTTCCGGGGGTGGTGTTTGGTTGCGGACATAATAATAGTGGAACGTTTAATGCGAATGAAATGGGATTGATCGGATTTGGAAGAGGAGCGATTTCCTTCGTTTCTCAGATAGGTCCATCGGTTGGCGGCAGAAAGTTCTCCCTTTGTCTGATGCCATACAACACCGACCCGAGAATCTCAAGTAGCCTCTCTATCGGGTCAGGTTCTGAAGTTAAAGGACCCGGAGTCATCACAGCCCAACTCGTTCGAACATCCGACCAGACATCTTACTCTCTCACCCTCAAGGGAATCTCTGTCGGAAAAACTTTCGTTCCGTACAGTACGTCGGGACCTCCGGCCAAGGGGAACGCAATTCTCGATACCGGCACGCCGCCAACTCTCCTCCCCAAAGAATTGTACGGACGATTGGCTATTGAAGTTCGGCGACATATCCCGTTGAAGCCCATTGACGATACTCTTTGCTACAAAGGTAATTTGGGGAATTTGGTGATAACTCTACACTTCGACGGCCACGTGGATCTGCGATTGAGTACGGCTCAAACTTTCAATAAGATGCTGGATGGGTCCTTTTGCTTCAATGCGATGGGCGTTGACGACAAGGACGCACTCATTGGGAATAGTATGATGTCAAATTTTTTGGTTGGGTATGATATTGACAATATGACGGTGTCGTTTAAGCCCACTGATTGCACCAAAATTGGTTGA

Coding sequence (CDS)

ATGTACTGTCGTCCATGTGCGAAATGTTACCGGCAAACAAATCCGATTTACGACCCTTCGAAATCGTCAACCTTTCGAACCCTTTCTTGCAAGTCGCCGCAGTGCCATTTGTGGGGGTCTGGTGCGGCGTGCTCCGGCACCGACACGTGTAAGTACGGCTATGGGTATGGAAGCGGATCTACGCAGGGAGAATTGGCGACTGAAAAAATGGCTGTAACTTCGAGGTCTGGAGCGACGACGCCGTTTCCGGGGGTGGTGTTTGGTTGCGGACATAATAATAGTGGAACGTTTAATGCGAATGAAATGGGATTGATCGGATTTGGAAGAGGAGCGATTTCCTTCGTTTCTCAGATAGGTCCATCGGTTGGCGGCAGAAAGTTCTCCCTTTGTCTGATGCCATACAACACCGACCCGAGAATCTCAAGTAGCCTCTCTATCGGGTCAGGTTCTGAAGTTAAAGGACCCGGAGTCATCACAGCCCAACTCGTTCGAACATCCGACCAGACATCTTACTCTCTCACCCTCAAGGGAATCTCTGTCGGAAAAACTTTCGTTCCGTACAGTACGTCGGGACCTCCGGCCAAGGGGAACGCAATTCTCGATACCGGCACGCCGCCAACTCTCCTCCCCAAAGAATTGTACGGACGATTGGCTATTGAAGTTCGGCGACATATCCCGTTGAAGCCCATTGACGATACTCTTTGCTACAAAGGTAATTTGGGGAATTTGGTGATAACTCTACACTTCGACGGCCACGTGGATCTGCGATTGAGTACGGCTCAAACTTTCAATAAGATGCTGGATGGGTCCTTTTGCTTCAATGCGATGGGCGTTGACGACAAGGACGCACTCATTGGGAATAGTATGATGTCAAATTTTTTGGTTGGGTATGATATTGACAATATGACGGTGTCGTTTAAGCCCACTGATTGCACCAAAATTGGTTGA

Protein sequence

MYCRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGSTQGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTSDQTSYSLTLKGISVGKTFVPYSTSGPPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDDTLCYKGNLGNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDKDALIGNSMMSNFLVGYDIDNMTVSFKPTDCTKIG
BLAST of Cp4.1LG01g03340 vs. Swiss-Prot
Match: CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1)

HSP 1 Score: 225.7 bits (574), Expect = 7.0e-58
Identity = 130/323 (40.25%), Postives = 184/323 (56.97%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTD-TCKYGYGYGSGS- 62
           C PC  CY Q +P++DP  SST++ +SC S QC    + A+CS  D TC Y   YG  S 
Sbjct: 118 CAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSY 177

Query: 63  TQGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGP 122
           T+G +A + + + S          ++ GCGHNN+GTFN    G++G G G +S + Q+G 
Sbjct: 178 TKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGD 237

Query: 123 SVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLV-RTSDQTSYSLTLKGIS 182
           S+ G KFS CL+P  +    +S ++ G+ + V G GV++  L+ + S +T Y LTLK IS
Sbjct: 238 SIDG-KFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSIS 297

Query: 183 VGKTFVPYSTS-GPPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDD-----T 242
           VG   + YS S    ++GN I+D+GT  TLLP E Y  L   V   I  +   D     +
Sbjct: 298 VGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLS 357

Query: 243 LCYK--GNLGNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDKDALIGNSMMS 302
           LCY   G+L   VIT+HFDG  D++L ++  F ++ +   CF   G     ++ GN    
Sbjct: 358 LCYSATGDLKVPVITMHFDG-ADVKLDSSNAFVQVSEDLVCFAFRG-SPSFSIYGNVAQM 417

Query: 303 NFLVGYDIDNMTVSFKPTDCTKI 315
           NFLVGYD  + TVSFKPTDC K+
Sbjct: 418 NFLVGYDTVSKTVSFKPTDCAKM 437

BLAST of Cp4.1LG01g03340 vs. Swiss-Prot
Match: ASPR1_ARATH (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 194.5 bits (493), Expect = 1.7e-48
Identity = 122/335 (36.42%), Postives = 183/335 (54.63%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCH-LWGSGAACSGTDT-CKYGYGYGSGS 62
           C+PC +CY++  PI+D  KSST+++  C S  C  L  +   C  ++  CKY Y YG  S
Sbjct: 113 CKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQS 172

Query: 63  -TQGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIG 122
            ++G++ATE +++ S SG+   FPG VFGCG+NN GTF+    G+IG G G +S +SQ+G
Sbjct: 173 FSKGDVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLG 232

Query: 123 PSVGGRKFSLCLMPYNTDPRISSSLSIGS----GSEVKGPGVITAQLVRTSDQTSYSLTL 182
            S+  +KFS CL   +     +S +++G+     S  K  GV++  LV     T Y LTL
Sbjct: 233 SSI-SKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTL 292

Query: 183 KGISVGKTFVPYSTSG---------PPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIP 242
           + ISVGK  +PY+ S              GN I+D+GT  TLL    + + +  V   + 
Sbjct: 293 EAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVT 352

Query: 243 -LKPIDD-----TLCYK---GNLGNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMG 302
             K + D     + C+K     +G   IT+HF G  D+RLS    F K+ +   C + + 
Sbjct: 353 GAKRVSDPQGLLSHCFKSGSAEIGLPEITVHFTG-ADVRLSPINAFVKLSEDMVCLSMVP 412

Query: 303 VDDKDALIGNSMMSNFLVGYDIDNMTVSFKPTDCT 313
             +  A+ GN    +FLVGYD++  TVSF+  DC+
Sbjct: 413 TTEV-AIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444

BLAST of Cp4.1LG01g03340 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 155.2 bits (391), Expect = 1.2e-36
Identity = 113/327 (34.56%), Postives = 157/327 (48.01%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGS-T 62
           C+PC +C+ Q+ PI++P  SS+F TL C S  C    S   CS  + C+Y YGYG GS T
Sbjct: 123 CQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQAL-SSPTCS-NNFCQYTYGYGDGSET 182

Query: 63  QGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPS 122
           QG + TE +   S S      P + FGCG NN G    N  GL+G GRG +S  SQ+  +
Sbjct: 183 QGSMGTETLTFGSVS-----IPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT 242

Query: 123 VGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTSD-QTSYSLTLKGISV 182
               KFS C+ P  +     S+L +GS +     G     L+++S   T Y +TL G+SV
Sbjct: 243 ----KFSYCMTPIGSS--TPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSV 302

Query: 183 GKTFVP-----YSTSGPPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDDT-- 242
           G T +P     ++ +     G  I+D+GT  T      Y  +  E    I L  ++ +  
Sbjct: 303 GSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSS 362

Query: 243 ---LCYK-----GNLGNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDKD-AL 302
              LC++      NL      +HFDG  DL L +   F    +G  C  AMG   +  ++
Sbjct: 363 GFDLCFQTPSDPSNLQIPTFVMHFDGG-DLELPSENYFISPSNGLICL-AMGSSSQGMSI 422

Query: 303 IGNSMMSNFLVGYDIDNMTVSFKPTDC 312
            GN    N LV YD  N  VSF    C
Sbjct: 423 FGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Cp4.1LG01g03340 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 148.3 bits (373), Expect = 1.4e-34
Identity = 115/327 (35.17%), Postives = 163/327 (49.85%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGS-T 62
           C PC +C+ Q  PI++P  SS+F TL C+S  C    S   C+  + C+Y YGYG GS T
Sbjct: 124 CEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPS-ETCNNNE-CQYTYGYGDGSTT 183

Query: 63  QGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPS 122
           QG +ATE          T+  P + FGCG +N G    N  GLIG G G +S  SQ+G  
Sbjct: 184 QGYMATETFTF-----ETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLG-- 243

Query: 123 VGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTS-DQTSYSLTLKGISV 182
           VG  +FS C+  Y +     S+L++GS +     G  +  L+ +S + T Y +TL+GI+V
Sbjct: 244 VG--QFSYCMTSYGSSS--PSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITV 303

Query: 183 G--KTFVPYST--SGPPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDD---- 242
           G     +P ST        G  I+D+GT  T LP++ Y  +A      I L  +D+    
Sbjct: 304 GGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSG 363

Query: 243 --TLCYKGNLGNLV----ITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDK--DAL 302
             T   + + G+ V    I++ FDG V L L          +G  C  AMG   +   ++
Sbjct: 364 LSTCFQQPSDGSTVQVPEISMQFDGGV-LNLGEQNILISPAEGVICL-AMGSSSQLGISI 423

Query: 303 IGNSMMSNFLVGYDIDNMTVSFKPTDC 312
            GN       V YD+ N+ VSF PT C
Sbjct: 424 FGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of Cp4.1LG01g03340 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 139.0 bits (349), Expect = 8.6e-32
Identity = 106/327 (32.42%), Postives = 146/327 (44.65%), Query Frame = 1

Query: 1   MYCRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGS 60
           + C PC +CY Q++PI+DP KS T+ T+ C SP C    S    +   TC Y   YG GS
Sbjct: 168 LQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGS 227

Query: 61  -TQGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIG 120
            T G+ +TE +             GV  GCGH+N G F     GL+G G+G +SF  Q G
Sbjct: 228 FTVGDFSTETLTFRRNR-----VKGVALGCGHDNEGLF-VGAAGLLGLGKGKLSFPGQTG 287

Query: 121 PSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTSDQTSYSLTLKGIS 180
                +KFS CL+  +   + SS   +   + V      T  L      T Y + L GIS
Sbjct: 288 HRF-NQKFSYCLVDRSASSKPSS--VVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGIS 347

Query: 181 VGKTFVPYSTSG-----PPAKGNAILDTGTPPTLLPKELY------GRLAIEVRRHIPLK 240
           VG T VP  T+          G  I+D+GT  T L +  Y       R+  +  +  P  
Sbjct: 348 VGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDF 407

Query: 241 PIDDTLCYKGNLGNL---VITLHFDGHVDLRL-STAQTFNKMLDGSFCFNAMGVDDKDAL 300
            + DT     N+  +    + LHF G  D+ L +T        +G FCF   G     ++
Sbjct: 408 SLFDTCFDLSNMNEVKVPTVVLHFRG-ADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSI 467

Query: 301 IGNSMMSNFLVGYDIDNMTVSFKPTDC 312
           IGN     F V YD+ +  V F P  C
Sbjct: 468 IGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Cp4.1LG01g03340 vs. TrEMBL
Match: A0A059DKL1_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A02955 PE=3 SV=1)

HSP 1 Score: 325.9 bits (834), Expect = 5.6e-86
Identity = 160/320 (50.00%), Postives = 208/320 (65.00%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGS-T 62
           C PC  CY Q NP YDP  SST+  ++C S QC L  + +  + ++TC Y YGY S S T
Sbjct: 53  CLPCDHCYPQKNPKYDPKSSSTYGEVACPSQQCQLLDTTSCAAPSNTCNYTYGYASTSLT 112

Query: 63  QGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPS 122
           +G L+TE +   S  G+    P VVFGCGHNN+GTFN NEMG++G G+G IS +SQIG S
Sbjct: 113 KGFLSTETLTFASTEGSPVTLPNVVFGCGHNNTGTFNENEMGIVGIGKGPISLISQIGTS 172

Query: 123 VGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTSDQTSYSLTLKGISVG 182
            GGR+FS CL+P++T P ISS +S GSGSEV GPG +T  LV   D T Y +TL GISVG
Sbjct: 173 FGGRRFSQCLVPFHTPPTISSKMSFGSGSEVSGPGTVTTSLVALQDPTYYFVTLNGISVG 232

Query: 183 KTFVPYSTSGPPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDD-----TLCY 242
            T++P+S+SG   KGN  LD+GTPPT++P++ Y RL  EV+R + L PIDD      LCY
Sbjct: 233 STYLPFSSSGAVTKGNMFLDSGTPPTIVPRDFYNRLEAEVKRAVELTPIDDPQLRPQLCY 292

Query: 243 KGNL--GNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDKDALIGNSMMSNFL 302
             ++     V+T HFDG  ++ L    TF +  DG FCF     D    + GN   ++ L
Sbjct: 293 GRDVQAKGPVLTAHFDGKAEVELKQTSTFIEAKDGIFCFAMTSTDSPGGIFGNFAQTDHL 352

Query: 303 VGYDIDNMTVSFKPTDCTKI 315
           +G+D+D  TVSFKPTDCTK+
Sbjct: 353 IGFDLDKNTVSFKPTDCTKL 372

BLAST of Cp4.1LG01g03340 vs. TrEMBL
Match: M5WJE5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022155mg PE=3 SV=1)

HSP 1 Score: 325.5 bits (833), Expect = 7.3e-86
Identity = 160/320 (50.00%), Postives = 210/320 (65.62%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGS-T 62
           C PC  CY+Q NP +DP +SST+  LSC + +C   G+G  CS   TC Y Y YG G+ T
Sbjct: 27  CAPCDGCYKQINPKFDPKQSSTYSDLSCDAQECKAIGTGT-CSPQHTCSYSYAYGGGALT 86

Query: 63  QGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPS 122
           QG LA E + +TS SG       +VFGCGHNN+G FN NEMG++G G G++S VSQ+GP 
Sbjct: 87  QGLLAKETITITSTSGEANSLKNIVFGCGHNNTGGFNENEMGIVGLGGGSLSLVSQLGPL 146

Query: 123 VGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTSDQTSYSLTLKGISVG 182
           VGG+K S CL+P+ TDPR+ S +S G GSEV G GV++  LV   D+T Y +T++GISVG
Sbjct: 147 VGGKKLSFCLVPFRTDPRVESKISFGEGSEVSGDGVVSTPLVSKEDKTPYFVTVEGISVG 206

Query: 183 KTFVPYSTSGPPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDD------TLC 242
              VP+S+SG  +KGN  +DTGTPPTLLP++ Y RL  EV+  IP+ PI++       LC
Sbjct: 207 DKLVPFSSSGKVSKGNMFMDTGTPPTLLPQDFYDRLVAEVKNQIPMAPIENDPSLATQLC 266

Query: 243 Y--KGNLGNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDKDALIGNSMMSNF 302
           Y  K NL   ++T+HF+G  D++L+  QTF    D  FC +A  V     + GN   SN 
Sbjct: 267 YNSKTNLEGPILTVHFEG-ADVKLTPTQTFISPRDEVFCLSAQNVTSDGGIYGNFFQSNL 326

Query: 303 LVGYDIDNMTVSFKPTDCTK 314
           L+GYD++ M  SFKPTDCTK
Sbjct: 327 LIGYDLEKMVASFKPTDCTK 344

BLAST of Cp4.1LG01g03340 vs. TrEMBL
Match: I1LG29_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_11G013100 PE=3 SV=2)

HSP 1 Score: 303.5 bits (776), Expect = 3.0e-79
Identity = 155/322 (48.14%), Postives = 205/322 (63.66%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGS-T 62
           C PC KCY+Q NPI+DP KS+++R +SC S  CH   +G  CS    C Y Y Y S + T
Sbjct: 109 CVPCNKCYKQRNPIFDPQKSTSYRNISCDSKLCHKLDTGV-CSPQKHCNYTYAYASAAIT 168

Query: 63  QGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPS 122
           QG LA E + ++S  G + P  G+VFGCGHNN+G FN  EMG+IG G G +SF+SQIG S
Sbjct: 169 QGVLAQETITLSSTKGESVPLKGIVFGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSS 228

Query: 123 VGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTSDQTSYSLTLKGISVG 182
            GG++FS CL+P++TD  +SS +S+G GSEV G GV++  LV   D+T Y +TL GISVG
Sbjct: 229 FGGKRFSQCLVPFHTDVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVG 288

Query: 183 KTFVPY--STSGPPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDD------T 242
            T++ +  S+S    KGN  LD+GTPPT+LP +LY RL  +VR  + +KP+ +       
Sbjct: 289 NTYLHFNGSSSQSVEKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQ 348

Query: 243 LCY--KGNLGNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDKDALIGNSMMS 302
           LCY  K NL   V+T HF+G  D++L   QTF    DG FC           + GN   S
Sbjct: 349 LCYRTKNNLRGPVLTAHFEGG-DVKLLPTQTFVSPKDGVFCLGFTNTSSDGGVYGNFAQS 408

Query: 303 NFLVGYDIDNMTVSFKPTDCTK 314
           N+L+G+D+D   VSFKP DCTK
Sbjct: 409 NYLIGFDLDRQVVSFKPMDCTK 428

BLAST of Cp4.1LG01g03340 vs. TrEMBL
Match: A0A0B2RKL3_GLYSO (Putative aspartic protease OS=Glycine soja GN=glysoja_004882 PE=3 SV=1)

HSP 1 Score: 302.8 bits (774), Expect = 5.0e-79
Identity = 155/322 (48.14%), Postives = 204/322 (63.35%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGS-T 62
           C PC KCY+Q NPI+DP KS+++R +SC S  CH   +G  CS    C Y Y Y S + T
Sbjct: 46  CVPCNKCYKQRNPIFDPQKSTSYRNISCDSKLCHKLDTGV-CSPQKHCNYTYAYASAAIT 105

Query: 63  QGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPS 122
           QG LA E + ++S  G + P  G+VFGCGHNN+G FN  EMG+IG G G +SF+SQIG S
Sbjct: 106 QGVLAQETITLSSTKGESVPLKGIVFGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSS 165

Query: 123 VGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTSDQTSYSLTLKGISVG 182
            GG++FS CL+P++TD  +SS +S G GSEV G GV++  LV   D+T Y +TL GISVG
Sbjct: 166 FGGKRFSQCLVPFHTDVSVSSKMSFGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVG 225

Query: 183 KTFVPY--STSGPPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDD------T 242
            T++ +  S+S    KGN  LD+GTPPT+LP +LY RL  +VR  + +KP+ +       
Sbjct: 226 NTYLHFNGSSSQSVEKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGHQ 285

Query: 243 LCY--KGNLGNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDKDALIGNSMMS 302
           LCY  K NL   V+T HF+G  D++L   QTF    DG FC           + GN   S
Sbjct: 286 LCYRTKNNLRGPVLTAHFEGG-DVKLLPTQTFVSPKDGVFCLGFTNTSSDGGVYGNFAQS 345

Query: 303 NFLVGYDIDNMTVSFKPTDCTK 314
           N+L+G+D+D   VSFKP DCTK
Sbjct: 346 NYLIGFDLDRQVVSFKPMDCTK 365

BLAST of Cp4.1LG01g03340 vs. TrEMBL
Match: A0A059DKK9_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A02958 PE=3 SV=1)

HSP 1 Score: 301.6 bits (771), Expect = 1.1e-78
Identity = 150/322 (46.58%), Postives = 202/322 (62.73%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGS-T 62
           C PC +C+ Q  P YDP  SST+R ++C S QC L GS +  S  +TC Y   Y   S T
Sbjct: 53  CLPCDQCFPQKKPKYDPKSSSTYRDVACPSQQCQLLGSTSCASPLNTCNYTSAYADSSLT 112

Query: 63  QGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPS 122
           +G LATE +   S +G     P +VFGCGHNN+G FN NEMGL G  +G  S +SQIG S
Sbjct: 113 KGVLATETLTFASTTGPPVTLPNIVFGCGHNNTGVFNDNEMGLAGLAKGPASLISQIGTS 172

Query: 123 VGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTSDQTSYSLTLKGISVG 182
            G R+FS CL+P++T P ++S +S GSGSEV GP  +T  L+   + + Y +T+ GISVG
Sbjct: 173 FGARRFSQCLVPFHTPPTVTSKMSFGSGSEVSGPDTVTTSLLTMQNPSFYYVTVNGISVG 232

Query: 183 KTFVPYSTSGPP--AKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDD-----TL 242
            T++P+++SG    +KGN  LD+GTP T++PK+ Y RLA EV+R + L PIDD      L
Sbjct: 233 STYLPFNSSGASHVSKGNVFLDSGTPITIVPKDFYDRLAAEVKRAVELTPIDDPQLRPQL 292

Query: 243 CYKGNL--GNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDKDALIGNSMMSN 302
           CY  ++     V+T HFDG  D+      TF +  DG FCF     D    +IGN   +N
Sbjct: 293 CYGRDVQAKGPVLTAHFDGEADVEWKQTSTFIEAKDGIFCFAMTSTDSPGGIIGNYAQTN 352

Query: 303 FLVGYDIDNMTVSFKPTDCTKI 315
           +L+G+D+D  T+SFKPTDCTK+
Sbjct: 353 YLIGFDLDANTISFKPTDCTKL 374

BLAST of Cp4.1LG01g03340 vs. TAIR10
Match: AT5G33340.1 (AT5G33340.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 225.7 bits (574), Expect = 4.0e-59
Identity = 130/323 (40.25%), Postives = 184/323 (56.97%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTD-TCKYGYGYGSGS- 62
           C PC  CY Q +P++DP  SST++ +SC S QC    + A+CS  D TC Y   YG  S 
Sbjct: 118 CAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSY 177

Query: 63  TQGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGP 122
           T+G +A + + + S          ++ GCGHNN+GTFN    G++G G G +S + Q+G 
Sbjct: 178 TKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGD 237

Query: 123 SVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLV-RTSDQTSYSLTLKGIS 182
           S+ G KFS CL+P  +    +S ++ G+ + V G GV++  L+ + S +T Y LTLK IS
Sbjct: 238 SIDG-KFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSIS 297

Query: 183 VGKTFVPYSTS-GPPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDD-----T 242
           VG   + YS S    ++GN I+D+GT  TLLP E Y  L   V   I  +   D     +
Sbjct: 298 VGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLS 357

Query: 243 LCYK--GNLGNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDKDALIGNSMMS 302
           LCY   G+L   VIT+HFDG  D++L ++  F ++ +   CF   G     ++ GN    
Sbjct: 358 LCYSATGDLKVPVITMHFDG-ADVKLDSSNAFVQVSEDLVCFAFRG-SPSFSIYGNVAQM 417

Query: 303 NFLVGYDIDNMTVSFKPTDCTKI 315
           NFLVGYD  + TVSFKPTDC K+
Sbjct: 418 NFLVGYDTVSKTVSFKPTDCAKM 437

BLAST of Cp4.1LG01g03340 vs. TAIR10
Match: AT1G64830.1 (AT1G64830.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 208.0 bits (528), Expect = 8.5e-54
Identity = 118/321 (36.76%), Postives = 180/321 (56.07%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGS-T 62
           C PC  CY+QT+P++DP +SST+R +SC S QC      +  +  +TC Y   YG  S T
Sbjct: 114 CNPCEDCYQQTSPLFDPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYT 173

Query: 63  QGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPS 122
           +G++A + + + S          ++ GCGH N+GTF+    G+IG G G+ S VSQ+  S
Sbjct: 174 KGDVAVDTVTMGSSGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKS 233

Query: 123 VGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTSDQTSYSLTLKGISVG 182
           + G KFS CL+P+ ++  ++S ++ G+   V G GV++  +V+    T Y L L+ ISVG
Sbjct: 234 ING-KFSYCLVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVG 293

Query: 183 KTFVPY-STSGPPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDD-----TLC 242
              + + ST     +GN ++D+GT  TLLP   Y  L   V   I  + + D     +LC
Sbjct: 294 SKKIQFTSTIFGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLC 353

Query: 243 YKGNLGNLV--ITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDKDALIGNSMMSNF 302
           Y+ +    V  IT+HF G  D++L    TF  + +   CF A   +++  + GN    NF
Sbjct: 354 YRDSSSFKVPDITVHFKGG-DVKLGNLNTFVAVSEDVSCF-AFAANEQLTIFGNLAQMNF 413

Query: 303 LVGYDIDNMTVSFKPTDCTKI 315
           LVGYD  + TVSFK TDC+++
Sbjct: 414 LVGYDTVSGTVSFKKTDCSQM 431

BLAST of Cp4.1LG01g03340 vs. TAIR10
Match: AT1G31450.1 (AT1G31450.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 206.8 bits (525), Expect = 1.9e-53
Identity = 125/333 (37.54%), Postives = 185/333 (55.56%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCH-LWGSGAACSGT-DTCKYGYGYGSGS 62
           C+PC +CY+Q +P++D  KSST++T SC S  C  L      C  + D CKY Y YG  S
Sbjct: 113 CKPCQQCYKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNS 172

Query: 63  -TQGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIG 122
            T+G++ATE +++ S SG++  FPG VFGCG+NN GTF     G+IG G G +S VSQ+G
Sbjct: 173 FTKGDVATETISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLG 232

Query: 123 PSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGP----GVITAQLVRTSDQTSYSLTL 182
            S+ G+KFS CL         +S +++G+ S    P      +T  L++   +T Y LTL
Sbjct: 233 SSI-GKKFSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTL 292

Query: 183 KGISVGKTFVPYSTSG-------PPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIP-L 242
           + ++VGKT +PY+  G           GN I+D+GT  TLL    Y      V   +   
Sbjct: 293 EAVTVGKTKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGA 352

Query: 243 KPIDD-----TLCYKG---NLGNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVD 302
           K + D     T C+K     +G   IT+HF  + D++LS    F K+ + + C + +   
Sbjct: 353 KRVSDPQGLLTHCFKSGDKEIGLPAITMHFT-NADVKLSPINAFVKLNEDTVCLSMIPTT 412

Query: 303 DKDALIGNSMMSNFLVGYDIDNMTVSFKPTDCT 313
           +  A+ GN +  +FLVGYD++  TVSF+  DC+
Sbjct: 413 EV-AIYGNMVQMDFLVGYDLETKTVSFQRMDCS 442

BLAST of Cp4.1LG01g03340 vs. TAIR10
Match: AT2G35615.1 (AT2G35615.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 194.5 bits (493), Expect = 9.8e-50
Identity = 122/335 (36.42%), Postives = 183/335 (54.63%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCH-LWGSGAACSGTDT-CKYGYGYGSGS 62
           C+PC +CY++  PI+D  KSST+++  C S  C  L  +   C  ++  CKY Y YG  S
Sbjct: 113 CKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQS 172

Query: 63  -TQGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIG 122
            ++G++ATE +++ S SG+   FPG VFGCG+NN GTF+    G+IG G G +S +SQ+G
Sbjct: 173 FSKGDVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLG 232

Query: 123 PSVGGRKFSLCLMPYNTDPRISSSLSIGS----GSEVKGPGVITAQLVRTSDQTSYSLTL 182
            S+  +KFS CL   +     +S +++G+     S  K  GV++  LV     T Y LTL
Sbjct: 233 SSI-SKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTL 292

Query: 183 KGISVGKTFVPYSTSG---------PPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIP 242
           + ISVGK  +PY+ S              GN I+D+GT  TLL    + + +  V   + 
Sbjct: 293 EAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVT 352

Query: 243 -LKPIDD-----TLCYK---GNLGNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMG 302
             K + D     + C+K     +G   IT+HF G  D+RLS    F K+ +   C + + 
Sbjct: 353 GAKRVSDPQGLLSHCFKSGSAEIGLPEITVHFTG-ADVRLSPINAFVKLSEDMVCLSMVP 412

Query: 303 VDDKDALIGNSMMSNFLVGYDIDNMTVSFKPTDCT 313
             +  A+ GN    +FLVGYD++  TVSF+  DC+
Sbjct: 413 TTEV-AIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444

BLAST of Cp4.1LG01g03340 vs. TAIR10
Match: AT2G28040.1 (AT2G28040.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 160.2 bits (404), Expect = 2.0e-39
Identity = 114/323 (35.29%), Postives = 159/323 (49.23%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGS-T 62
           C PC  CY QT PI+DPSKSSTF+ + C +                +C Y   YG  S T
Sbjct: 93  CLPCVHCYNQTAPIFDPSKSSTFKEIRCDTHD-------------HSCPYELVYGGKSYT 152

Query: 63  QGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPS 122
           +G L TE + + S SG     P  + GCG NNSG F     G++G  RG  S ++Q+G  
Sbjct: 153 KGTLVTETVTIHSTSGQPFVMPETIIGCGRNNSG-FKPGFAGVVGLDRGPKSLITQMGGE 212

Query: 123 VGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVI-TAQLVRTSDQTSYSLTLKGISV 182
             G      LM Y    + +S ++ G+ + V G GV+ T   V+T+    Y L L  +SV
Sbjct: 213 YPG------LMSYCFAGKGTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSV 272

Query: 183 GKTFVPYSTSGPP---AKGNAILDTGTPPTLLPKELYG--RLAIEVRRHIPLKPIDDTLC 242
           G T +   T G P    KGN ++D+G+  T  P+      R A+E        P  D LC
Sbjct: 273 GNTRI--ETVGTPFHALKGNIVIDSGSTLTYFPESYCNLVRKAVEQVVTAVRFPRSDILC 332

Query: 243 YKGNLGNL--VITLHFDGHVDLRLSTAQTF-NKMLDGSFCFNAM-GVDDKDALIGNSMMS 302
           Y     ++  VIT+HF G  DL L     +      G FC   +     ++A+ GN   +
Sbjct: 333 YYSKTIDIFPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAIICNSPIEEAIFGNRAQN 392

Query: 303 NFLVGYDIDNMTVSFKPTDCTKI 315
           NFLVGYD  ++ VSFKPT+C+ +
Sbjct: 393 NFLVGYDSSSLLVSFKPTNCSAL 393

BLAST of Cp4.1LG01g03340 vs. NCBI nr
Match: gi|629126499|gb|KCW90924.1| (hypothetical protein EUGRSUZ_A02955 [Eucalyptus grandis])

HSP 1 Score: 325.9 bits (834), Expect = 8.0e-86
Identity = 160/320 (50.00%), Postives = 208/320 (65.00%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGS-T 62
           C PC  CY Q NP YDP  SST+  ++C S QC L  + +  + ++TC Y YGY S S T
Sbjct: 53  CLPCDHCYPQKNPKYDPKSSSTYGEVACPSQQCQLLDTTSCAAPSNTCNYTYGYASTSLT 112

Query: 63  QGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPS 122
           +G L+TE +   S  G+    P VVFGCGHNN+GTFN NEMG++G G+G IS +SQIG S
Sbjct: 113 KGFLSTETLTFASTEGSPVTLPNVVFGCGHNNTGTFNENEMGIVGIGKGPISLISQIGTS 172

Query: 123 VGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTSDQTSYSLTLKGISVG 182
            GGR+FS CL+P++T P ISS +S GSGSEV GPG +T  LV   D T Y +TL GISVG
Sbjct: 173 FGGRRFSQCLVPFHTPPTISSKMSFGSGSEVSGPGTVTTSLVALQDPTYYFVTLNGISVG 232

Query: 183 KTFVPYSTSGPPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDD-----TLCY 242
            T++P+S+SG   KGN  LD+GTPPT++P++ Y RL  EV+R + L PIDD      LCY
Sbjct: 233 STYLPFSSSGAVTKGNMFLDSGTPPTIVPRDFYNRLEAEVKRAVELTPIDDPQLRPQLCY 292

Query: 243 KGNL--GNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDKDALIGNSMMSNFL 302
             ++     V+T HFDG  ++ L    TF +  DG FCF     D    + GN   ++ L
Sbjct: 293 GRDVQAKGPVLTAHFDGKAEVELKQTSTFIEAKDGIFCFAMTSTDSPGGIFGNFAQTDHL 352

Query: 303 VGYDIDNMTVSFKPTDCTKI 315
           +G+D+D  TVSFKPTDCTK+
Sbjct: 353 IGFDLDKNTVSFKPTDCTKL 372

BLAST of Cp4.1LG01g03340 vs. NCBI nr
Match: gi|702255356|ref|XP_010025443.1| (PREDICTED: aspartic proteinase CDR1-like [Eucalyptus grandis])

HSP 1 Score: 325.9 bits (834), Expect = 8.0e-86
Identity = 160/320 (50.00%), Postives = 208/320 (65.00%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGS-T 62
           C PC  CY Q NP YDP  SST+  ++C S QC L  + +  + ++TC Y YGY S S T
Sbjct: 126 CLPCDHCYPQKNPKYDPKSSSTYGEVACPSQQCQLLDTTSCAAPSNTCNYTYGYASTSLT 185

Query: 63  QGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPS 122
           +G L+TE +   S  G+    P VVFGCGHNN+GTFN NEMG++G G+G IS +SQIG S
Sbjct: 186 KGFLSTETLTFASTEGSPVTLPNVVFGCGHNNTGTFNENEMGIVGIGKGPISLISQIGTS 245

Query: 123 VGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTSDQTSYSLTLKGISVG 182
            GGR+FS CL+P++T P ISS +S GSGSEV GPG +T  LV   D T Y +TL GISVG
Sbjct: 246 FGGRRFSQCLVPFHTPPTISSKMSFGSGSEVSGPGTVTTSLVALQDPTYYFVTLNGISVG 305

Query: 183 KTFVPYSTSGPPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDD-----TLCY 242
            T++P+S+SG   KGN  LD+GTPPT++P++ Y RL  EV+R + L PIDD      LCY
Sbjct: 306 STYLPFSSSGAVTKGNMFLDSGTPPTIVPRDFYNRLEAEVKRAVELTPIDDPQLRPQLCY 365

Query: 243 KGNL--GNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDKDALIGNSMMSNFL 302
             ++     V+T HFDG  ++ L    TF +  DG FCF     D    + GN   ++ L
Sbjct: 366 GRDVQAKGPVLTAHFDGKAEVELKQTSTFIEAKDGIFCFAMTSTDSPGGIFGNFAQTDHL 425

Query: 303 VGYDIDNMTVSFKPTDCTKI 315
           +G+D+D  TVSFKPTDCTK+
Sbjct: 426 IGFDLDKNTVSFKPTDCTKL 445

BLAST of Cp4.1LG01g03340 vs. NCBI nr
Match: gi|595841136|ref|XP_007208154.1| (hypothetical protein PRUPE_ppa022155mg [Prunus persica])

HSP 1 Score: 325.5 bits (833), Expect = 1.0e-85
Identity = 160/320 (50.00%), Postives = 210/320 (65.62%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGS-T 62
           C PC  CY+Q NP +DP +SST+  LSC + +C   G+G  CS   TC Y Y YG G+ T
Sbjct: 27  CAPCDGCYKQINPKFDPKQSSTYSDLSCDAQECKAIGTGT-CSPQHTCSYSYAYGGGALT 86

Query: 63  QGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPS 122
           QG LA E + +TS SG       +VFGCGHNN+G FN NEMG++G G G++S VSQ+GP 
Sbjct: 87  QGLLAKETITITSTSGEANSLKNIVFGCGHNNTGGFNENEMGIVGLGGGSLSLVSQLGPL 146

Query: 123 VGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTSDQTSYSLTLKGISVG 182
           VGG+K S CL+P+ TDPR+ S +S G GSEV G GV++  LV   D+T Y +T++GISVG
Sbjct: 147 VGGKKLSFCLVPFRTDPRVESKISFGEGSEVSGDGVVSTPLVSKEDKTPYFVTVEGISVG 206

Query: 183 KTFVPYSTSGPPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDD------TLC 242
              VP+S+SG  +KGN  +DTGTPPTLLP++ Y RL  EV+  IP+ PI++       LC
Sbjct: 207 DKLVPFSSSGKVSKGNMFMDTGTPPTLLPQDFYDRLVAEVKNQIPMAPIENDPSLATQLC 266

Query: 243 Y--KGNLGNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDKDALIGNSMMSNF 302
           Y  K NL   ++T+HF+G  D++L+  QTF    D  FC +A  V     + GN   SN 
Sbjct: 267 YNSKTNLEGPILTVHFEG-ADVKLTPTQTFISPRDEVFCLSAQNVTSDGGIYGNFFQSNL 326

Query: 303 LVGYDIDNMTVSFKPTDCTK 314
           L+GYD++ M  SFKPTDCTK
Sbjct: 327 LIGYDLEKMVASFKPTDCTK 344

BLAST of Cp4.1LG01g03340 vs. NCBI nr
Match: gi|694393472|ref|XP_009372173.1| (PREDICTED: aspartic proteinase CDR1-like [Pyrus x bretschneideri])

HSP 1 Score: 324.7 bits (831), Expect = 1.8e-85
Identity = 164/320 (51.25%), Postives = 211/320 (65.94%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGS-T 62
           C PC  CY+Q NP++DP+KSST++ LSC + +C   G+   CS    C Y Y YGS + T
Sbjct: 107 CAPCPGCYKQINPLFDPTKSSTYKDLSCYAQECQTIGT-ITCSLRHACNYTYAYGSAAVT 166

Query: 63  QGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPS 122
           QG L+ E + +TS SG  T    +VFGCGHNN+GTFN NEMG+IG G G +S VSQ+ P 
Sbjct: 167 QGILSKETITITSTSGNATSLENIVFGCGHNNTGTFNENEMGIIGLGGGDLSLVSQLSPL 226

Query: 123 VGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTSDQTSYSLTLKGISVG 182
           VGG+KFS CL+P++TDP I S +S G GSEV G GV++  LV   D+T Y +TLKGISVG
Sbjct: 227 VGGKKFSFCLVPFHTDPSIESKISFGEGSEVFGDGVVSTPLVTKEDKTPYFVTLKGISVG 286

Query: 183 KTFVPYSTSGPPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDD------TLC 242
             FVP+++SG  +KGN  +DTGTPPTL+P++ Y RL  EVR  IP+ PI D       LC
Sbjct: 287 NKFVPFNSSGEVSKGNMFMDTGTPPTLIPQDFYDRLVAEVRSQIPMTPIGDDPSLGTQLC 346

Query: 243 YKG--NLGNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDKDALIGNSMMSNF 302
           YK   NL   ++T+HF+G  D++L+T QTF    D  FCF    V     + G    SNF
Sbjct: 347 YKSKTNLKGPILTVHFEG-ADVKLTTIQTFVPPKDEVFCFAMQTVPWDVGIYGGFAQSNF 406

Query: 303 LVGYDIDNMTVSFKPTDCTK 314
           L+GYD++ M   FKPTDCTK
Sbjct: 407 LIGYDLETMVAFFKPTDCTK 424

BLAST of Cp4.1LG01g03340 vs. NCBI nr
Match: gi|658040568|ref|XP_008355882.1| (PREDICTED: aspartic proteinase CDR1-like [Malus domestica])

HSP 1 Score: 318.2 bits (814), Expect = 1.7e-83
Identity = 161/320 (50.31%), Postives = 208/320 (65.00%), Query Frame = 1

Query: 3   CRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLWGSGAACSGTDTCKYGYGYGSGS-T 62
           C PC  CY+Q NP++DP+KSST++ LSC + +C   G+   CS    C Y Y YGS + T
Sbjct: 107 CAPCPGCYKQINPLFDPTKSSTYKDLSCYAQECQTIGT-ITCSLRHACNYTYAYGSAAVT 166

Query: 63  QGELATEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPS 122
           QG L+ E + +TS S   T    +VFGCGHNN+GTFN NEMG+IG G G +S VSQ+ P 
Sbjct: 167 QGVLSKETITITSTSXNATSLENIVFGCGHNNTGTFNENEMGIIGLGGGDLSLVSQLSPL 226

Query: 123 VGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTSDQTSYSLTLKGISVG 182
           VGG+KFS CL+P++TDP I S +S G GSEV G GV++  LV   D+T Y +TLKGISVG
Sbjct: 227 VGGKKFSFCLVPFHTDPSIESKISFGEGSEVSGDGVVSTPLVTKEDKTPYFVTLKGISVG 286

Query: 183 KTFVPYSTSGPPAKGNAILDTGTPPTLLPKELYGRLAIEVRRHIPLKPIDD------TLC 242
             FVP+++SG  +KGN  +DTGTPPTL+P++   RL  EVR  IP+ PI D       LC
Sbjct: 287 NKFVPFNSSGEVSKGNMFMDTGTPPTLIPQDFXDRLVAEVRSQIPMTPIGDDPSLGTQLC 346

Query: 243 YKG--NLGNLVITLHFDGHVDLRLSTAQTFNKMLDGSFCFNAMGVDDKDALIGNSMMSNF 302
           YK   NL   ++T+HF+G  D++L+  QTF    D  FCF    V     + G    SNF
Sbjct: 347 YKSKTNLQGPILTVHFEG-ADVKLTPIQTFVPPKDEVFCFAMQTVPWDVGIYGGFAQSNF 406

Query: 303 LVGYDIDNMTVSFKPTDCTK 314
           L+GYD++ M   FKPTDCTK
Sbjct: 407 LIGYDLETMVAFFKPTDCTK 424

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CDR1_ARATH7.0e-5840.25Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1[more]
ASPR1_ARATH1.7e-4836.42Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 S... [more]
NEP1_NEPGR1.2e-3634.56Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
NEP2_NEPGR1.4e-3435.17Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
APF2_ARATH8.6e-3232.42Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A059DKL1_EUCGR5.6e-8650.00Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A02955 PE=3 SV=1[more]
M5WJE5_PRUPE7.3e-8650.00Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022155mg PE=3 SV=1[more]
I1LG29_SOYBN3.0e-7948.14Uncharacterized protein OS=Glycine max GN=GLYMA_11G013100 PE=3 SV=2[more]
A0A0B2RKL3_GLYSO5.0e-7948.14Putative aspartic protease OS=Glycine soja GN=glysoja_004882 PE=3 SV=1[more]
A0A059DKK9_EUCGR1.1e-7846.58Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A02958 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G33340.14.0e-5940.25 Eukaryotic aspartyl protease family protein[more]
AT1G64830.18.5e-5436.76 Eukaryotic aspartyl protease family protein[more]
AT1G31450.11.9e-5337.54 Eukaryotic aspartyl protease family protein[more]
AT2G35615.19.8e-5036.42 Eukaryotic aspartyl protease family protein[more]
AT2G28040.12.0e-3935.29 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|629126499|gb|KCW90924.1|8.0e-8650.00hypothetical protein EUGRSUZ_A02955 [Eucalyptus grandis][more]
gi|702255356|ref|XP_010025443.1|8.0e-8650.00PREDICTED: aspartic proteinase CDR1-like [Eucalyptus grandis][more]
gi|595841136|ref|XP_007208154.1|1.0e-8550.00hypothetical protein PRUPE_ppa022155mg [Prunus persica][more]
gi|694393472|ref|XP_009372173.1|1.8e-8551.25PREDICTED: aspartic proteinase CDR1-like [Pyrus x bretschneideri][more]
gi|658040568|ref|XP_008355882.1|1.7e-8350.31PREDICTED: aspartic proteinase CDR1-like [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR021109Peptidase_aspartic_dom_sf
IPR001969Aspartic_peptidase_AS
IPR001461Aspartic_peptidase_A1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g03340.1Cp4.1LG01g03340.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 3..313
score: 1.4E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 198..209
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 6..147
score: 2.5E-23coord: 155..313
score: 3.6
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 2..311
score: 1.08
NoneNo IPR availablePANTHERPTHR13683:SF309SUBFAMILY NOT NAMEDcoord: 3..313
score: 1.4E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None