Cp4.1LG01g17040 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g17040
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionEukaryotic aspartyl protease family protein
LocationCp4.1LG01 : 11306997 .. 11308268 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTGCCATTTCAATCTTCTTCTCTCTCCTCCTCATCTCCTTTCGCTTCACCGCCGTCTATGGCGGCGGCATTGGCTTCACCACCACTCTCTTCCACCGCGATTCTCCACAATCACCCATCCGCAACCAATCTCTCTCTCACTACGACCGCCTAAATAACGCCATCCGCCGTTCTATCTCCCGTGCCGACGCTCTCTTCCAACGCGCCGCCGCTCTCACCGACAACACCATCGAATCTCCAATCTCCCCTGGCGGCGGCGAGTATGTAATGTCTGTGTCAATTGGAACACCGCCGGTGGCTTACGTGGCCATAGCTGATACGGGCAGCGATCCAGCGTGGACTCAATGCATGCCATGTAAGAAATGTTACCCCCAATCAAAACCCATTTTTGACCCCAAAAAATCCTCCTCTTTCCGTCACGTGCCTTGCACGTCCAATACCTGTCAGTCAGTGGGCGGCGCCACATGTGGGGACCAGGGGTCTTGCAATTACAGTATTGTGTACGGAGATCAAACCTACTCTAAGGGAGAGTTGGGAACTGATACGATCACCATCGGGTCAACGTCTGTGAACATGGTGATTGGATGTGGCCACGAGAGCGGCGGCGGGTTCGGCACCACCTCCGGTGTCATCGGACTCGCCGGCGGCGAACTCTCTATAGTCACTCAAATGAGCAAAAAAAGCGCCGTGAGCCGGAAATTCTCCTATTGCTTACCGTCGGTATCGAGTCAAGGAAGTGGCAAAATCAACTTTGGCAAAAACGCCGTCGTTTCGGGTCCTGGTGTCGTTTCGACCCCACTGGGCCCCAGAACGATGTATCAGATGACTCTGGAAGCCATTTCCGTTGGTAACGAACGTCACGCGGTCGGAAACGCTGTCGTAAAGGACAACATGATCATTGATTCCGGGACCACATTGAGTTACATTCCGAAGGAGATGCACGACGGCGTCGTTTCGTCGATGGCGAAGATCATCGGATCGAAGCGGGTGAAGGATCCCGGTAACTTTTTTGGTCTGTGCTATTCTTCAGATGGCCACGACGTGAATATTCCGGCCATTACCGCTCATTTCGCCGGCGGCGCTGACGTGAGGTTGGGGATGGAGAATATGTTTATGAGGGTGGCGGGTGGTGTGAGTTGCTTGATGTTGACGCCGATGACGGCGAGCGACCCGTTTGGGATTTGGGGGAATATAGCGCAGGCCAATTTCTTGATCGGATATGATTTGGAGAGGAAGACCTTGTCCTTCAAACCAACCGTCTGTGCTTAG

mRNA sequence

ATGCCTGCCATTTCAATCTTCTTCTCTCTCCTCCTCATCTCCTTTCGCTTCACCGCCGTCTATGGCGGCGGCATTGGCTTCACCACCACTCTCTTCCACCGCGATTCTCCACAATCACCCATCCGCAACCAATCTCTCTCTCACTACGACCGCCTAAATAACGCCATCCGCCGTTCTATCTCCCGTGCCGACGCTCTCTTCCAACGCGCCGCCGCTCTCACCGACAACACCATCGAATCTCCAATCTCCCCTGGCGGCGGCGAGTATGTAATGTCTGTGTCAATTGGAACACCGCCGGTGGCTTACGTGGCCATAGCTGATACGGGCAGCGATCCAGCGTGGACTCAATGCATGCCATGTAAGAAATGTTACCCCCAATCAAAACCCATTTTTGACCCCAAAAAATCCTCCTCTTTCCGTCACGTGCCTTGCACGTCCAATACCTGTCAGTCAGTGGGCGGCGCCACATGTGGGGACCAGGGGTCTTGCAATTACAGTATTGTGTACGGAGATCAAACCTACTCTAAGGGAGAGTTGGGAACTGATACGATCACCATCGGGTCAACGTCTGTGAACATGGTGATTGGATGTGGCCACGAGAGCGGCGGCGGGTTCGGCACCACCTCCGGTGTCATCGGACTCGCCGGCGGCGAACTCTCTATAGTCACTCAAATGAGCAAAAAAAGCGCCGTGAGCCGGAAATTCTCCTATTGCTTACCGTCGGTATCGAGTCAAGGAAGTGGCAAAATCAACTTTGGCAAAAACGCCGTCGTTTCGGGTCCTGGTGTCGTTTCGACCCCACTGGGCCCCAGAACGATGTATCAGATGACTCTGGAAGCCATTTCCGTTGGTAACGAACGTCACGCGGTCGGAAACGCTGTCGTAAAGGACAACATGATCATTGATTCCGGGACCACATTGAGTTACATTCCGAAGGAGATGCACGACGGCGTCGTTTCGTCGATGGCGAAGATCATCGGATCGAAGCGGGTGAAGGATCCCGGTAACTTTTTTGGTCTGTGCTATTCTTCAGATGGCCACGACGTGAATATTCCGGCCATTACCGCTCATTTCGCCGGCGGCGCTGACGTGAGGTTGGGGATGGAGAATATGTTTATGAGGGTGGCGGGTGGTGTGAGTTGCTTGATGTTGACGCCGATGACGGCGAGCGACCCGTTTGGGATTTGGGGGAATATAGCGCAGGCCAATTTCTTGATCGGATATGATTTGGAGAGGAAGACCTTGTCCTTCAAACCAACCGTCTGTGCTTAG

Coding sequence (CDS)

ATGCCTGCCATTTCAATCTTCTTCTCTCTCCTCCTCATCTCCTTTCGCTTCACCGCCGTCTATGGCGGCGGCATTGGCTTCACCACCACTCTCTTCCACCGCGATTCTCCACAATCACCCATCCGCAACCAATCTCTCTCTCACTACGACCGCCTAAATAACGCCATCCGCCGTTCTATCTCCCGTGCCGACGCTCTCTTCCAACGCGCCGCCGCTCTCACCGACAACACCATCGAATCTCCAATCTCCCCTGGCGGCGGCGAGTATGTAATGTCTGTGTCAATTGGAACACCGCCGGTGGCTTACGTGGCCATAGCTGATACGGGCAGCGATCCAGCGTGGACTCAATGCATGCCATGTAAGAAATGTTACCCCCAATCAAAACCCATTTTTGACCCCAAAAAATCCTCCTCTTTCCGTCACGTGCCTTGCACGTCCAATACCTGTCAGTCAGTGGGCGGCGCCACATGTGGGGACCAGGGGTCTTGCAATTACAGTATTGTGTACGGAGATCAAACCTACTCTAAGGGAGAGTTGGGAACTGATACGATCACCATCGGGTCAACGTCTGTGAACATGGTGATTGGATGTGGCCACGAGAGCGGCGGCGGGTTCGGCACCACCTCCGGTGTCATCGGACTCGCCGGCGGCGAACTCTCTATAGTCACTCAAATGAGCAAAAAAAGCGCCGTGAGCCGGAAATTCTCCTATTGCTTACCGTCGGTATCGAGTCAAGGAAGTGGCAAAATCAACTTTGGCAAAAACGCCGTCGTTTCGGGTCCTGGTGTCGTTTCGACCCCACTGGGCCCCAGAACGATGTATCAGATGACTCTGGAAGCCATTTCCGTTGGTAACGAACGTCACGCGGTCGGAAACGCTGTCGTAAAGGACAACATGATCATTGATTCCGGGACCACATTGAGTTACATTCCGAAGGAGATGCACGACGGCGTCGTTTCGTCGATGGCGAAGATCATCGGATCGAAGCGGGTGAAGGATCCCGGTAACTTTTTTGGTCTGTGCTATTCTTCAGATGGCCACGACGTGAATATTCCGGCCATTACCGCTCATTTCGCCGGCGGCGCTGACGTGAGGTTGGGGATGGAGAATATGTTTATGAGGGTGGCGGGTGGTGTGAGTTGCTTGATGTTGACGCCGATGACGGCGAGCGACCCGTTTGGGATTTGGGGGAATATAGCGCAGGCCAATTTCTTGATCGGATATGATTTGGAGAGGAAGACCTTGTCCTTCAAACCAACCGTCTGTGCTTAG

Protein sequence

MPAISIFFSLLLISFRFTAVYGGGIGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSISRADALFQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPCKKCYPQSKPIFDPKKSSSFRHVPCTSNTCQSVGGATCGDQGSCNYSIVYGDQTYSKGELGTDTITIGSTSVNMVIGCGHESGGGFGTTSGVIGLAGGELSIVTQMSKKSAVSRKFSYCLPSVSSQGSGKINFGKNAVVSGPGVVSTPLGPRTMYQMTLEAISVGNERHAVGNAVVKDNMIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPGNFFGLCYSSDGHDVNIPAITAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFGIWGNIAQANFLIGYDLERKTLSFKPTVCA
BLAST of Cp4.1LG01g17040 vs. Swiss-Prot
Match: CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1)

HSP 1 Score: 325.9 bits (834), Expect = 6.7e-88
Identity = 190/441 (43.08%), Postives = 272/441 (61.68%), Query Frame = 1

Query: 5   SIFFSLLLISFRFTAVYGGG--IGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSISR 64
           S+  SL L+S  F +       +GFT  L HRDSP+SP  N   +   RL NAI RS++R
Sbjct: 7   SVLLSLCLLSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR 66

Query: 65  ADALFQRAAALTDNTIESPI--SPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPC 124
                ++     DNT +  I  +   GEY+M+VSIGTPP   +AIADTGSD  WTQC PC
Sbjct: 67  VFHFTEK-----DNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPC 126

Query: 125 KKCYPQSKPIFDPKKSSSFRHVPCTSNTCQSV-GGATCG-DQGSCNYSIVYGDQTYSKGE 184
             CY Q  P+FDPK SS+++ V C+S+ C ++   A+C  +  +C+YS+ YGD +Y+KG 
Sbjct: 127 DDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGN 186

Query: 185 LGTDTITIGSTSV------NMVIGCGHESGGGFGTT-SGVIGLAGGELSIVTQMSKKSAV 244
           +  DT+T+GS+        N++IGCGH + G F    SG++GL GG +S++ Q+    ++
Sbjct: 187 IAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLG--DSI 246

Query: 245 SRKFSYCLPSVSSQ--GSGKINFGKNAVVSGPGVVSTPL----GPRTMYQMTLEAISVGN 304
             KFSYCL  ++S+   + KINFG NA+VSG GVVSTPL       T Y +TL++ISVG+
Sbjct: 247 DGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGS 306

Query: 305 ER---HAVGNAVVKDNMIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPGNFFGLCY 364
           ++       +   + N+IIDSGTTL+ +P E +  +  ++A  I +++ +DP +   LCY
Sbjct: 307 KQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY 366

Query: 365 SSDGHDVNIPAITAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFGIWGNIAQA 424
           S+ G D+ +P IT HF  GADV+L   N F++V+  + C        S  F I+GN+AQ 
Sbjct: 367 SATG-DLKVPVITMHF-DGADVKLDSSNAFVQVSEDLVCF---AFRGSPSFSIYGNVAQM 426

BLAST of Cp4.1LG01g17040 vs. Swiss-Prot
Match: ASPR1_ARATH (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 300.1 bits (767), Expect = 3.9e-80
Identity = 186/449 (41.43%), Postives = 265/449 (59.02%), Query Frame = 1

Query: 6   IFFSLLLISFRFTAVYGGGIGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSISRADA 65
           +FFS+ L S       G    F+  L HRDSP SPI N  ++  DRLN A  RS+SR+  
Sbjct: 11  LFFSVTLSSS------GHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRR 70

Query: 66  LFQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPCKKCYP 125
            F    + TD  ++S +    GE+ MS++IGTPP+   AIADTGSD  W QC PC++CY 
Sbjct: 71  -FNHQLSQTD--LQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYK 130

Query: 126 QSKPIFDPKKSSSFRHVPCTSNTCQSVGGATCGDQGS---CNYSIVYGDQTYSKGELGTD 185
           ++ PIFD KKSS+++  PC S  CQ++     G   S   C Y   YGDQ++SKG++ T+
Sbjct: 131 ENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATE 190

Query: 186 TITIGSTS------VNMVIGCGHESGGGFGTT-SGVIGLAGGELSIVTQMSKKSAVSRKF 245
           T++I S S         V GCG+ +GG F  T SG+IGL GG LS+++Q+   S++S+KF
Sbjct: 191 TVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLG--SSISKKF 250

Query: 246 SYCL--PSVSSQGSGKINFGKNAVVSG----PGVVSTPL---GPRTMYQMTLEAISVGNE 305
           SYCL   S ++ G+  IN G N++ S      GVVSTPL    P T Y +TLEAISVG +
Sbjct: 251 SYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKK 310

Query: 306 R--------HAVGNAVVKD---NMIIDSGTTLSYIPKEMHDGVVSSMAK-IIGSKRVKDP 365
           +        +   + ++ +   N+IIDSGTTL+ +     D   S++ + + G+KRV DP
Sbjct: 311 KIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDP 370

Query: 366 GNFFGLCYSSDGHDVNIPAITAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFG 424
                 C+ S   ++ +P IT HF  GADVRL   N F++++  + CL + P T      
Sbjct: 371 QGLLSHCFKSGSAEIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTT---EVA 430

BLAST of Cp4.1LG01g17040 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 221.5 bits (563), Expect = 1.8e-56
Identity = 151/442 (34.16%), Postives = 222/442 (50.23%), Query Frame = 1

Query: 3   AISIFFSLLLISFRFTAVYGGGIGFTTTLFHRDSPQSPIRNQSLSHYDRLNNA-----IR 62
           A S++  LL +S  +  V        T L HR   +       L H D   N      + 
Sbjct: 2   ASSLYSFLLALSIVYIFVAPTHSTSRTALNHRHEAKVTGFQIMLEHVDSGKNLTKFQLLE 61

Query: 63  RSISRADALFQRAAALTDNT--IESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWT 122
           R+I R     QR  A+ +    +E+ +  G GEY+M++SIGTP   + AI DTGSD  WT
Sbjct: 62  RAIERGSRRLQRLEAMLNGPSGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWT 121

Query: 123 QCMPCKKCYPQSKPIFDPKKSSSFRHVPCTSNTCQSVGGATCGDQGSCNYSIVYGDQTYS 182
           QC PC +C+ QS PIF+P+ SSSF  +PC+S  CQ++   TC +   C Y+  YGD + +
Sbjct: 122 QCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNN-FCQYTYGYGDGSET 181

Query: 183 KGELGTDTITIGSTSV-NMVIGCGHESGG-GFGTTSGVIGLAGGELSIVTQMSKKSAVSR 242
           +G +GT+T+T GS S+ N+  GCG  + G G G  +G++G+  G LS+ +Q+        
Sbjct: 182 QGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLD-----VT 241

Query: 243 KFSYCLPSVSSQGSGKINFGK--NAVVSGPGVVSTPLGPR--TMYQMTLEAISVGNERHA 302
           KFSYC+  + S     +  G   N+V +G    +     +  T Y +TL  +SVG+ R  
Sbjct: 242 KFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLP 301

Query: 303 VGNAVVKDN-------MIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPGNFFGLCY 362
           +  +    N       +IIDSGTTL+Y     +  V       I    V    + F LC+
Sbjct: 302 IDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCF 361

Query: 363 S--SDGHDVNIPAITAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFGIWGNIA 422
              SD  ++ IP    HF GG D+ L  EN F+  + G+ CL +   ++S    I+GNI 
Sbjct: 362 QTPSDPSNLQIPTFVMHFDGG-DLELPSENYFISPSNGLICLAMG--SSSQGMSIFGNIQ 421

BLAST of Cp4.1LG01g17040 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 221.5 bits (563), Expect = 1.8e-56
Identity = 154/447 (34.45%), Postives = 232/447 (51.90%), Query Frame = 1

Query: 2   PAISIFFSLLLISFRFTAVYGGGIGFTTTLFH-RDSPQSPIR--------NQSLSHYDRL 61
           P  S+   L ++S           G  T L H +  PQ  +R         ++L+ Y+ +
Sbjct: 4   PLYSVVLGLAIVSAIVAPTSSTSRG--TLLHHGQKRPQPGLRVDLEQVDSGKNLTKYELI 63

Query: 62  NNAIRRSISRADALFQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDP 121
             AI+R   R  ++   A   + + IE+P+  G GEY+M+V+IGTP  ++ AI DTGSD 
Sbjct: 64  KRAIKRGERRMRSI--NAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDL 123

Query: 122 AWTQCMPCKKCYPQSKPIFDPKKSSSFRHVPCTSNTCQSVGGATCGDQGSCNYSIVYGDQ 181
            WTQC PC +C+ Q  PIF+P+ SSSF  +PC S  CQ +   TC +   C Y+  YGD 
Sbjct: 124 IWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETC-NNNECQYTYGYGDG 183

Query: 182 TYSKGELGTDTITIGSTSV-NMVIGCGHESGG-GFGTTSGVIGLAGGELSIVTQMSKKSA 241
           + ++G + T+T T  ++SV N+  GCG ++ G G G  +G+IG+  G LS+ +Q+     
Sbjct: 184 STTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLG---- 243

Query: 242 VSRKFSYCLPSVSSQGSGKINFGKNAVVSGPGVVST-----PLGPRTMYQMTLEAISVGN 301
              +FSYC+ S  S     +  G  A     G  ST      L P T Y +TL+ I+VG 
Sbjct: 244 -VGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNP-TYYYITLQGITVGG 303

Query: 302 ERHAVGNAVVK------DNMIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPGNFFG 361
           +   + ++  +        MIIDSGTTL+Y+P++ ++ V  +    I    V +  +   
Sbjct: 304 DNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLS 363

Query: 362 LCYS--SDGHDVNIPAITAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFG--I 421
            C+   SDG  V +P I+  F GG  + LG +N+ +  A GV CL    M +S   G  I
Sbjct: 364 TCFQQPSDGSTVQVPEISMQFDGGV-LNLGEQNILISPAEGVICL---AMGSSSQLGISI 423

Query: 422 WGNIAQANFLIGYDLERKTLSFKPTVC 423
           +GNI Q    + YDL+   +SF PT C
Sbjct: 424 FGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of Cp4.1LG01g17040 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 5.5e-50
Identity = 135/427 (31.62%), Postives = 204/427 (47.78%), Query Frame = 1

Query: 27  FTTTLFHRDS-PQSPIRNQSLSHYDRLNNAIRRSISRADALFQRAAALT----------- 86
           +T  L HRD  P    RN    H+ RL+  +RR   R  A+ +R +              
Sbjct: 59  YTLRLLHRDRFPSVTYRN----HHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVN 118

Query: 87  --DNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPCKKCYPQSKPIFD 146
              + I S +  G GEY + + +G+PP     + D+GSD  W QC PCK CY QS P+FD
Sbjct: 119 DFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFD 178

Query: 147 PKKSSSFRHVPCTSNTCQSVGGATCGDQGSCNYSIVYGDQTYSKGELGTDTITIGSTSV- 206
           P KS S+  V C S+ C  +  + C   G C Y ++YGD +Y+KG L  +T+T   T V 
Sbjct: 179 PAKSGSYTGVSCGSSVCDRIENSGC-HSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVR 238

Query: 207 NMVIGCGHESGGGFGTTSGVIGLAGGELSIVTQMSKKSAVSRKFSYCLPSVSSQGSGKIN 266
           N+ +GCGH + G F   +G++G+ GG +S V Q+S ++  +  F YCL S  +  +G + 
Sbjct: 239 NVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGA--FGYCLVSRGTDSTGSLV 298

Query: 267 FGKNAVVSGPGVVSTPLGPR--TMYQMTLEAISVGNERHAVGNAVV------KDNMIIDS 326
           FG+ A+  G   V     PR  + Y + L+ + VG  R  + + V          +++D+
Sbjct: 299 FGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDT 358

Query: 327 GTTLSYIPKEMH----DGVVSSMAKIIGSKRVKDPGNFFGLCYSSDGH-DVNIPAITAHF 386
           GT ++ +P   +    DG  S  A +  +  V    + F  CY   G   V +P ++ +F
Sbjct: 359 GTAVTRLPTAAYVAFRDGFKSQTANLPRASGV----SIFDTCYDLSGFVSVRVPTVSFYF 418

Query: 387 AGGADVRLGMENMFMRV-AGGVSCLMLTPMTASDPFG--IWGNIAQANFLIGYDLERKTL 423
             G  + L   N  M V   G  C       A+ P G  I GNI Q    + +D     +
Sbjct: 419 TEGPVLTLPARNFLMPVDDSGTYCFAF----AASPTGLSIIGNIQQEGIQVSFDGANGFV 470

BLAST of Cp4.1LG01g17040 vs. TrEMBL
Match: A0A0A0KV20_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055400 PE=3 SV=1)

HSP 1 Score: 492.3 bits (1266), Expect = 6.0e-136
Identity = 258/428 (60.28%), Postives = 313/428 (73.13%), Query Frame = 1

Query: 4   ISIFFSLLL--ISFRFTAVYGGGIGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSIS 63
           IS+FF L+L  ISF  T +  G  GFTT+LFHRDS  SP+   SLSHYDRL NA RRS+S
Sbjct: 5   ISLFFHLILFLISFSQTTIINGNNGFTTSLFHRDSLLSPLEFSSLSHYDRLANAFRRSLS 64

Query: 64  RADALFQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPCK 123
           R+ AL  RAA      ++S I PG GEY+MSVSIGTPPV Y+ IADTGSD  W QC+PC 
Sbjct: 65  RSAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCL 124

Query: 124 KCYPQSKPIFDPKKSSSFRHVPCTSNTCQSVGGATCGDQGSCNYSIVYGDQTYSKGELGT 183
           KCY Q +PIF+P KS+SF HVPC + TC +V    CG QG C+YS  YGD+TYSKG+LG 
Sbjct: 125 KCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGF 184

Query: 184 DTITIGSTSVNMVIGCGHESGGGFGTTSGVIGLAGGELSIVTQMSKKSAVSRKFSYCLPS 243
           + ITIGS+SV  VIGCGH S GGFG  SGVIGL GG+LS+V+QMS+ S +SR+FSYCLP+
Sbjct: 185 EKITIGSSSVKSVIGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPT 244

Query: 244 VSSQGSGKINFGKNAVVSGPGVVSTPLGPR---TMYQMTLEAISVGNERHAVGNAVVKDN 303
           + S  +GKINFG+NAVVSGPGVVSTPL  +   T Y +TLEAIS+GNERH       + N
Sbjct: 245 LLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMA--FAKQGN 304

Query: 304 MIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPGNFFGLCYS---SDGHDVNIPAIT 363
           +IIDSGTTL+ +PKE++DGVVSS+ K++ +KRVKDP     LC+    +    + IP IT
Sbjct: 305 VIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPVIT 364

Query: 364 AHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFGIWGNIAQANFLIGYDLERKTL 423
           AHF+GGA+V L   N F +VA  V+CL L   + +  FGI GN+AQANFLIGYDLE K L
Sbjct: 365 AHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAKRL 424

BLAST of Cp4.1LG01g17040 vs. TrEMBL
Match: A0A0A0KX67_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055390 PE=3 SV=1)

HSP 1 Score: 479.9 bits (1234), Expect = 3.1e-132
Identity = 256/428 (59.81%), Postives = 308/428 (71.96%), Query Frame = 1

Query: 4   ISIFFSL--LLISFRFTAVYGGGIGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSIS 63
           ISIFF L  LLISF  T +  G  GFTT+LFHRDS  SP+   SLSHYDRL NA RRS+S
Sbjct: 5   ISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLS 64

Query: 64  RADALFQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPCK 123
           R+  L  RAA      +++P++PG GEY+MSVSIGTPPV Y+ +ADTGSD  W QC+PC 
Sbjct: 65  RSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCL 124

Query: 124 KCYPQSKPIFDPKKSSSFRHVPCTSNTCQSVGGATCGDQGSCNYSIVYGDQTYSKGELGT 183
           KCY QS+PIFDP KS+SF HVPC S  C+++  + CG QG C+YS  YGD+TYSKG+LG 
Sbjct: 125 KCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDRTYSKGDLGF 184

Query: 184 DTITIGSTSVNMVIGCGHESGGGFGTTSGVIGLAGGELSIVTQMSKKSAVSRKFSYCLPS 243
           + ITIGS+SV  VIGCGHESGGGFG  SGVIGL GG    V                LP+
Sbjct: 185 EKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGANPPV----------------LPT 244

Query: 244 VSSQGSGKINFGKNAVVSGPGVVSTPL---GPRTMYQMTLEAISVGNERHAVGNAVVKDN 303
           + S  +GKINFG+NAVVSGPGVVSTPL    P T Y +TLEAIS+GNERH    +  + N
Sbjct: 245 LLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMA--SAKQGN 304

Query: 304 MIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPGNFFGLCYSSDGHDV----NIPAI 363
           +IIDSGTTLS++PKE++DGVVSS+ K++ +KRVKDPGNF+ LC+  DG +V     IP I
Sbjct: 305 VIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCF-DDGINVATSSGIPII 364

Query: 364 TAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFGIWGNIAQANFLIGYDLERKT 423
           TA F+GGA+V L   N F +VA  V+CL LTP + +D FGI GN+A ANFLIGYDLE K 
Sbjct: 365 TAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKR 413

BLAST of Cp4.1LG01g17040 vs. TrEMBL
Match: A0A0A0KZZ3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055410 PE=3 SV=1)

HSP 1 Score: 409.8 bits (1052), Expect = 3.9e-111
Identity = 230/437 (52.63%), Postives = 296/437 (67.73%), Query Frame = 1

Query: 1   MPAISIFFSLLLI-SFRFTAVYGGGIGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRS 60
           M AISIFF  LL  S + TA  GG  GFTT+LF RDSP SP+ N SLS YD L +A RRS
Sbjct: 1   MAAISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRS 60

Query: 61  ISRADALFQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMP 120
            SR+  L     +++   I SPI P  GE++MS+ IGTPPV  +AIADTGSD  WTQC+P
Sbjct: 61  FSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLP 120

Query: 121 CKKCYPQSKPIFDPKKSSSFRHVPCTSNTCQSVGGATCG-DQGSCNYSIVYGDQTYSKGE 180
           C++C+ QS+PIF+P++SSS+R V C S+TC+S+    CG D  SC+Y   YGD++++ G+
Sbjct: 121 CRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGD 180

Query: 181 LGTDTITIGSTSV-NMVIGCGHESGGGF-GTTSGVIGLAGGELSIVTQMSKKSAVSRKFS 240
           L +D ITIGS  +   VIGCGH++GG F G TSG+IGL GG LS+V+QM   + V  +FS
Sbjct: 181 LASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFS 240

Query: 241 YCLPSVSSQG--SGKINFGKNAVVSGPGVVSTPLGPR---TMYQMTLEAISVGNERHAVG 300
           YCLP+  S    +G I+FG+ AVVSG  VVSTPL PR   T Y +TLEAISVG +R    
Sbjct: 241 YCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAA 300

Query: 301 NAVV----KDNMIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPGNFFGLCYSS-DG 360
           N +       N+IIDSGTTL+ +P+ ++ GV S++A++I +KRV DP     LCYS+   
Sbjct: 301 NGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQV 360

Query: 361 HDVNIPAITAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFGIWGNIAQANFLI 420
            D+NIP ITAHFAGGADV+L   N F  VA  V+CL   P T      I+GN+AQ NF +
Sbjct: 361 DDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPAT---QVAIFGNLAQINFEV 420

Query: 421 GYDLERKTLSFKPTVCA 424
           GYDL  K LSF+P +CA
Sbjct: 421 GYDLGNKRLSFEPKLCA 434

BLAST of Cp4.1LG01g17040 vs. TrEMBL
Match: M5WRG3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025167mg PE=3 SV=1)

HSP 1 Score: 370.9 bits (951), Expect = 2.0e-99
Identity = 202/447 (45.19%), Postives = 277/447 (61.97%), Query Frame = 1

Query: 6   IFFSLLLISFRFTAVYGGGIGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSISRAD- 65
           ++F L L++           GFT  L HRDSP SP+ N S+SH DRL+NA RRS++R   
Sbjct: 12  LYFPLALLACFILLAQASSHGFTADLIHRDSPLSPLYNSSMSHLDRLHNAFRRSVTRVHH 71

Query: 66  ----ALFQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPC 125
                +   +++L    I+S I P  GEY+M+VSIGTPPV  + IADTGSD  WTQC PC
Sbjct: 72  FIKPTMTSLSSSLAAPNIQSIIIPSAGEYLMNVSIGTPPVEVLGIADTGSDLIWTQCKPC 131

Query: 126 KKCYPQSKPIFDPKKSSSFRHVPCTSNTCQSVGGATC-----GDQGSCNYSIVYGDQTYS 185
           K+C+ Q+ P+FDPKKSS++  +PC S++C  +  A C     GD  +C YS  YGD++++
Sbjct: 132 KQCFNQNPPLFDPKKSSTYHSIPCQSSSCTYLEEAACGTLINGDHDTCEYSYRYGDRSFT 191

Query: 186 KGELGTDTITIGSTS------VNMVIGCGHESGGGFGTT-SGVIGLAGGELSIVTQMSKK 245
           +G L  +T+T GSTS        +V GCGHE+GG F  + SG+IGL GG LS+++Q++K 
Sbjct: 192 RGTLALETLTFGSTSGRPTSLPKVVFGCGHENGGTFDESGSGLIGLGGGPLSLISQLTKL 251

Query: 246 SAVSRKFSYCLPSVSSQGSGKINFGKNAVVSGPGVVSTPL---GPRTMYQMTLEAISVGN 305
           +    KFSYCL   ++  + KI+FG   +VSG G VSTPL    P T Y +TLEAISVG 
Sbjct: 252 TN-GGKFSYCLLPTANTAASKISFGSAGIVSGSGAVSTPLVAKNPDTFYYLTLEAISVGE 311

Query: 306 ERHA----------VGNAVVKDNMIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPG 365
           +R A             A  + N+IIDSGTTL+ +P   HD +VS++   I ++RV DP 
Sbjct: 312 KRLAYKTKSPDCEKAAVAANEGNIIIDSGTTLTLLPPGFHDDLVSALETAINAERVSDPR 371

Query: 366 NFFGLCYSSDGHDVNIPAITAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFGI 423
               LC+ S   D+ +P IT HF+GGADV+L   N F R+   + C  + P   S    I
Sbjct: 372 GILSLCFKSKSDDIGVPVITVHFSGGADVKLQALNTFARMDDDMICFTMIP---SSDVAI 431

BLAST of Cp4.1LG01g17040 vs. TrEMBL
Match: M1DUW2_SOLTU (Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400044361 PE=3 SV=1)

HSP 1 Score: 347.4 bits (890), Expect = 2.4e-92
Identity = 199/444 (44.82%), Postives = 266/444 (59.91%), Query Frame = 1

Query: 7   FFSLLLISFRFTAVYGGGIGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSISRADAL 66
           FF L LIS   T  Y  G GFT  L HRDSP SP  N S +  +RL NA  RS SRA + 
Sbjct: 17  FFHLSLISCHKTISYRVGNGFTLDLIHRDSPLSPFYNPSNTQSNRLRNAFHRSFSRA-SF 76

Query: 67  FQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPCKKCYPQ 126
           F++++  T NTI+S ISP  GEY+M +SIGTPPV  VAIADTGSD  WTQCMPC+ C+ Q
Sbjct: 77  FKKSSLATTNTIQSDISPIPGEYLMKLSIGTPPVEIVAIADTGSDLTWTQCMPCENCFQQ 136

Query: 127 SKPIFDPKKSSSFRHVPCTSNTCQSVGGATCGDQGSCNYSIVYGDQTYSKGELGTDTITI 186
           S P+FD KKSS+++ V C    C S+ G++C     C Y + YGDQ+++ G+L  D  T 
Sbjct: 137 SSPLFDSKKSSTYKTVGCNVEVCTSLEGSSCVKGNVCEYQMSYGDQSHTIGDLAFDKFTF 196

Query: 187 GSTS------VNMVIGCGHESGGGFGT-TSGVIGLAGGELSIVTQMSKKSAVSRKFSYCL 246
            STS       N+  GCGH++GG F   TSG+IGL GG++S++ Q+ K+  ++ KFSYCL
Sbjct: 197 PSTSGENVVIPNVAFGCGHDNGGTFNNYTSGIIGLGGGKVSMINQLDKE--INGKFSYCL 256

Query: 247 ------PSVSSQGSGKINFGKNAVVSGPGVVSTPL---GPRTMYQMTLEAISVGNER--- 306
                  S++S  +  INFG +A+VSGP VVSTPL    P T Y + LE +SVGN+    
Sbjct: 257 IPIPFDSSINSNITSHINFGISAIVSGPNVVSTPLIKKEPSTYYYLNLEGVSVGNKTLKF 316

Query: 307 ---------HAVGNAVVKDNMIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPGNFF 366
                    +A G      N+IIDSGTTL+ +P + +  + S++   I + R  DP   F
Sbjct: 317 KSSKTSPSDNASGGDGQAGNIIIDSGTTLTLLPNDFYSNLESTLVNSIRANRKDDPSGNF 376

Query: 367 GLCYSSDGHDVNIPAITAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFGIWGN 423
            LCY S+   ++ P I  HF   AD+ L   + F  +  G+ CL + P   +D   I+GN
Sbjct: 377 HLCYESENGTIDAPTIVTHFT-NADLELSPSSTFAEIEQGLVCLTIVP---ADEIAIFGN 436

BLAST of Cp4.1LG01g17040 vs. TAIR10
Match: AT1G64830.1 (AT1G64830.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 335.9 bits (860), Expect = 3.6e-92
Identity = 191/414 (46.14%), Postives = 268/414 (64.73%), Query Frame = 1

Query: 26  GFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSISRADALFQRAAALTDNTIESPISPG 85
           GFT  L HRDSP+SP  N + +   R+ NAIRRS +R+   F    A + N+ +S I+  
Sbjct: 25  GFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRS-ARSTLQFSNDDA-SPNSPQSFITSN 84

Query: 86  GGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPCKKCYPQSKPIFDPKKSSSFRHVPCT 145
            GEY+M++SIGTPPV  +AIADTGSD  WTQC PC+ CY Q+ P+FDPK+SS++R V C+
Sbjct: 85  RGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCS 144

Query: 146 SNTCQSVGGATCG-DQGSCNYSIVYGDQTYSKGELGTDTITIGSTS------VNMVIGCG 205
           S+ C+++  A+C  D+ +C+Y+I YGD +Y+KG++  DT+T+GS+        NM+IGCG
Sbjct: 145 SSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCG 204

Query: 206 HESGGGFGTT-SGVIGLAGGELSIVTQMSKKSAVSRKFSYCL-PSVSSQG-SGKINFGKN 265
           HE+ G F    SG+IGL GG  S+V+Q+ K  +++ KFSYCL P  S  G + KINFG N
Sbjct: 205 HENTGTFDPAGSGIIGLGGGSTSLVSQLRK--SINGKFSYCLVPFTSETGLTSKINFGTN 264

Query: 266 AVVSGPGVVSTPL---GPRTMYQMTLEAISVGNERHAVGNAVV---KDNMIIDSGTTLSY 325
            +VSG GVVST +    P T Y + LEAISVG+++    + +    + N++IDSGTTL+ 
Sbjct: 265 GIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIVIDSGTTLTL 324

Query: 326 IPKEMHDGVVSSMAKIIGSKRVKDPGNFFGLCYSSDGHDVNIPAITAHFAGGADVRLGME 385
           +P   +  + S +A  I ++RV+DP     LCY  D     +P IT HF GG DV+LG  
Sbjct: 325 LPSNFYYELESVVASTIKAERVQDPDGILSLCY-RDSSSFKVPDITVHFKGG-DVKLGNL 384

Query: 386 NMFMRVAGGVSCLMLTPMTASDPFGIWGNIAQANFLIGYDLERKTLSFKPTVCA 424
           N F+ V+  VSC       A++   I+GN+AQ NFL+GYD    T+SFK T C+
Sbjct: 385 NTFVAVSEDVSCF---AFAANEQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDCS 429

BLAST of Cp4.1LG01g17040 vs. TAIR10
Match: AT5G33340.1 (AT5G33340.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 325.9 bits (834), Expect = 3.8e-89
Identity = 190/441 (43.08%), Postives = 272/441 (61.68%), Query Frame = 1

Query: 5   SIFFSLLLISFRFTAVYGGG--IGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSISR 64
           S+  SL L+S  F +       +GFT  L HRDSP+SP  N   +   RL NAI RS++R
Sbjct: 7   SVLLSLCLLSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNR 66

Query: 65  ADALFQRAAALTDNTIESPI--SPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPC 124
                ++     DNT +  I  +   GEY+M+VSIGTPP   +AIADTGSD  WTQC PC
Sbjct: 67  VFHFTEK-----DNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPC 126

Query: 125 KKCYPQSKPIFDPKKSSSFRHVPCTSNTCQSV-GGATCG-DQGSCNYSIVYGDQTYSKGE 184
             CY Q  P+FDPK SS+++ V C+S+ C ++   A+C  +  +C+YS+ YGD +Y+KG 
Sbjct: 127 DDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGN 186

Query: 185 LGTDTITIGSTSV------NMVIGCGHESGGGFGTT-SGVIGLAGGELSIVTQMSKKSAV 244
           +  DT+T+GS+        N++IGCGH + G F    SG++GL GG +S++ Q+    ++
Sbjct: 187 IAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLG--DSI 246

Query: 245 SRKFSYCLPSVSSQ--GSGKINFGKNAVVSGPGVVSTPL----GPRTMYQMTLEAISVGN 304
             KFSYCL  ++S+   + KINFG NA+VSG GVVSTPL       T Y +TL++ISVG+
Sbjct: 247 DGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGS 306

Query: 305 ER---HAVGNAVVKDNMIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPGNFFGLCY 364
           ++       +   + N+IIDSGTTL+ +P E +  +  ++A  I +++ +DP +   LCY
Sbjct: 307 KQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY 366

Query: 365 SSDGHDVNIPAITAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFGIWGNIAQA 424
           S+ G D+ +P IT HF  GADV+L   N F++V+  + C        S  F I+GN+AQ 
Sbjct: 367 SATG-DLKVPVITMHF-DGADVKLDSSNAFVQVSEDLVCF---AFRGSPSFSIYGNVAQM 426

BLAST of Cp4.1LG01g17040 vs. TAIR10
Match: AT2G35615.1 (AT2G35615.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 300.1 bits (767), Expect = 2.2e-81
Identity = 186/449 (41.43%), Postives = 265/449 (59.02%), Query Frame = 1

Query: 6   IFFSLLLISFRFTAVYGGGIGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSISRADA 65
           +FFS+ L S       G    F+  L HRDSP SPI N  ++  DRLN A  RS+SR+  
Sbjct: 11  LFFSVTLSSS------GHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRR 70

Query: 66  LFQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPCKKCYP 125
            F    + TD  ++S +    GE+ MS++IGTPP+   AIADTGSD  W QC PC++CY 
Sbjct: 71  -FNHQLSQTD--LQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYK 130

Query: 126 QSKPIFDPKKSSSFRHVPCTSNTCQSVGGATCGDQGS---CNYSIVYGDQTYSKGELGTD 185
           ++ PIFD KKSS+++  PC S  CQ++     G   S   C Y   YGDQ++SKG++ T+
Sbjct: 131 ENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATE 190

Query: 186 TITIGSTS------VNMVIGCGHESGGGFGTT-SGVIGLAGGELSIVTQMSKKSAVSRKF 245
           T++I S S         V GCG+ +GG F  T SG+IGL GG LS+++Q+   S++S+KF
Sbjct: 191 TVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLG--SSISKKF 250

Query: 246 SYCL--PSVSSQGSGKINFGKNAVVSG----PGVVSTPL---GPRTMYQMTLEAISVGNE 305
           SYCL   S ++ G+  IN G N++ S      GVVSTPL    P T Y +TLEAISVG +
Sbjct: 251 SYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKK 310

Query: 306 R--------HAVGNAVVKD---NMIIDSGTTLSYIPKEMHDGVVSSMAK-IIGSKRVKDP 365
           +        +   + ++ +   N+IIDSGTTL+ +     D   S++ + + G+KRV DP
Sbjct: 311 KIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDP 370

Query: 366 GNFFGLCYSSDGHDVNIPAITAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFG 424
                 C+ S   ++ +P IT HF  GADVRL   N F++++  + CL + P T      
Sbjct: 371 QGLLSHCFKSGSAEIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTT---EVA 430

BLAST of Cp4.1LG01g17040 vs. TAIR10
Match: AT1G31450.1 (AT1G31450.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 287.0 bits (733), Expect = 1.9e-77
Identity = 182/448 (40.62%), Postives = 259/448 (57.81%), Query Frame = 1

Query: 6   IFFSLLLISFRFTAVYGGGI-GFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSISRAD 65
           ++ SLL ISF F +         T  L HRDSP SP+ N   +  DRLN A  RSISR+ 
Sbjct: 7   LYCSLLAISFFFASNSSANRENLTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSISRS- 66

Query: 66  ALFQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPCKKCY 125
              +R    TD  ++S +   GGEY MS+SIGTPP    AIADTGSD  W QC PC++CY
Sbjct: 67  ---RRFTTKTD--LQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCY 126

Query: 126 PQSKPIFDPKKSSSFRHVPCTSNTCQSVGGATCGDQGS---CNYSIVYGDQTYSKGELGT 185
            Q+ P+FD KKSS+++   C S TCQ++     G   S   C Y   YGD +++KG++ T
Sbjct: 127 KQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVAT 186

Query: 186 DTITIGSTS------VNMVIGCGHESGGGFGTT-SGVIGLAGGELSIVTQMSKKSAVSRK 245
           +TI+I S+S         V GCG+ +GG F  T SG+IGL GG LS+V+Q+   S++ +K
Sbjct: 187 ETISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLG--SSIGKK 246

Query: 246 FSYCL--PSVSSQGSGKINFGKNAVVSGP----GVVSTPL---GPRTMYQMTLEAISVGN 305
           FSYCL   + ++ G+  IN G N++ S P      ++TPL    P T Y +TLEA++VG 
Sbjct: 247 FSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGK 306

Query: 306 ER-------HAVGNAVVK--DNMIIDSGTTLSYIPKEMHDGVVSSMAK-IIGSKRVKDPG 365
            +       + +     K   N+IIDSGTTL+ +    +D   +++ + + G+KRV DP 
Sbjct: 307 TKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQ 366

Query: 366 NFFGLCYSSDGHDVNIPAITAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFGI 424
                C+ S   ++ +PAIT HF   ADV+L   N F+++     CL + P T      I
Sbjct: 367 GLLTHCFKSGDKEIGLPAITMHFT-NADVKLSPINAFVKLNEDTVCLSMIPTT---EVAI 426

BLAST of Cp4.1LG01g17040 vs. TAIR10
Match: AT2G28010.1 (AT2G28010.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 241.9 bits (616), Expect = 7.2e-64
Identity = 165/434 (38.02%), Postives = 225/434 (51.84%), Query Frame = 1

Query: 4   ISIFFSLLLISFRFTAVYGGGIGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSISRA 63
           I + F  + + F FT       GFT  L HR S  S           R++N    S   A
Sbjct: 7   IIVLFLQISLCFLFTTTASPPHGFTMDLIHRRSNAS----------SRVSNTQSGSSPYA 66

Query: 64  DALFQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPCKKC 123
           + +F       DN++          Y+M + +GTPP    AI DTGS+  WTQC+PC  C
Sbjct: 67  NTVF-------DNSV----------YLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHC 126

Query: 124 YPQSKPIFDPKKSSSFRHVPCTSNTCQSVGGATCGDQGSCNYSIVYGDQTYSKGELGTDT 183
           Y Q+ PIFDP KSS+F+   C              D  SC Y + Y D TY+ G L T+T
Sbjct: 127 YEQNAPIFDPSKSSTFKEKRC--------------DGHSCPYEVDYFDHTYTMGTLATET 186

Query: 184 ITIGSTS------VNMVIGCGHESGGGFGTTSGVIGLAGGELSIVTQMSKKSAVSRKFSY 243
           IT+ STS         +IGCGH +     + SG++GL  G  S++TQM  +       SY
Sbjct: 187 ITLHSTSGEPFVMPETIIGCGHNNSWFKPSFSGMVGLNWGPSSLITQMGGE--YPGLMSY 246

Query: 244 CLPSVSSQGSGKINFGKNAVVSGPGVVSTPLGPRT----MYQMTLEAISVGNER-HAVGN 303
           C    S QG+ KINFG NA+V+G GVVST +   T     Y + L+A+SVGN R   +G 
Sbjct: 247 CF---SGQGTSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGT 306

Query: 304 A--VVKDNMIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPGNFFGLCYSSDGHDVN 363
               ++ N++IDSGTTL+Y P    + V  ++  ++ + R  DP     LCY+SD  D+ 
Sbjct: 307 TFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTIDI- 366

Query: 364 IPAITAHFAGGADVRLGMENMFMRV-AGGVSCLMLTPMTASDPFGIWGNIAQANFLIGYD 423
            P IT HF+GG D+ L   NM+M    GGV CL +   + +    I+GN AQ NFL+GYD
Sbjct: 367 FPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQE-AIFGNRAQNNFLVGYD 392

BLAST of Cp4.1LG01g17040 vs. NCBI nr
Match: gi|659102476|ref|XP_008452153.1| (PREDICTED: probable aspartic protease At2g35615 [Cucumis melo])

HSP 1 Score: 517.7 bits (1332), Expect = 1.9e-143
Identity = 268/429 (62.47%), Postives = 327/429 (76.22%), Query Frame = 1

Query: 3   AISIFFSLLL--ISFRFTAVYGGGIGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSI 62
           A SIF  L+L  ISF  T +  G  GFTT+LFHRDS  SP+   SLSHYDRL+NA RRS+
Sbjct: 2   AASIFCRLILFLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLSNAFRRSL 61

Query: 63  SRADALFQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPC 122
           SR+ AL  RAA      ++SPI+PG GEY+MSVSIGTPPV Y+ +ADTGSD  W QC+PC
Sbjct: 62  SRSAALLNRAATSGAVGLQSPIAPGSGEYLMSVSIGTPPVDYIGLADTGSDLTWAQCLPC 121

Query: 123 KKCYPQSKPIFDPKKSSSFRHVPCTSNTCQSVGGATCGDQGSCNYSIVYGDQTYSKGELG 182
            KC+ QS+PIF+P KS+SF HVPC S  CQ++  A CG QG C+YS  YGDQTY+KG+LG
Sbjct: 122 VKCFKQSRPIFNPLKSTSFSHVPCNSQICQAIDDAHCGVQGVCDYSYTYGDQTYTKGDLG 181

Query: 183 TDTITIGSTSVNMVIGCGHESGGGFGTTSGVIGLAGGELSIVTQMSKKSAVSRKFSYCLP 242
            + ITIGS+SV  VIGCGHESGGGFG  SGVIGL GG+LS+V+QMS+ S +SR+FSYCLP
Sbjct: 182 LEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLP 241

Query: 243 SVSSQGSGKINFGKNAVVSGPGVVSTPL---GPRTMYQMTLEAISVGNERHAVGNAVVKD 302
           ++ S  +GKINFG+NAVVSGPGVVSTPL    P T Y +TLEAIS+GNERH    +  + 
Sbjct: 242 TLLSHANGKINFGQNAVVSGPGVVSTPLISKDPVTYYYITLEAISIGNERHMA--SAKQG 301

Query: 303 NMIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPGNFFGLCYSSDGHDV----NIPA 362
           N+IIDSGTTL+ +PKE++DGVVSS+ K++ +KRVKDPG+F+ LC+  DG +V     IP 
Sbjct: 302 NVIIDSGTTLTVLPKELYDGVVSSLLKVVKAKRVKDPGSFWDLCF-DDGINVAASSGIPI 361

Query: 363 ITAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFGIWGNIAQANFLIGYDLERK 422
           ITAHF+GGA+V L   N F +VA  V+CL LT  + +D FGI GN+AQANFLIGYDLE K
Sbjct: 362 ITAHFSGGANVNLLPVNTFQKVANNVNCLTLTAASPTDEFGIIGNLAQANFLIGYDLEAK 421

BLAST of Cp4.1LG01g17040 vs. NCBI nr
Match: gi|659102474|ref|XP_008452152.1| (PREDICTED: probable aspartic protease At2g35615 [Cucumis melo])

HSP 1 Score: 507.7 bits (1306), Expect = 2.0e-140
Identity = 263/428 (61.45%), Postives = 322/428 (75.23%), Query Frame = 1

Query: 4   ISIFFSL--LLISFRFTAVYGGGIGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSIS 63
           ISIFF L  LLISF  T +  G  GFTT+LFHRDS  SP+   +LSHYDRL+NA RRS+S
Sbjct: 5   ISIFFLLFLLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSTLSHYDRLSNAFRRSLS 64

Query: 64  RADALFQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPCK 123
           R+ AL  R A      ++SPI+PG GEY+M VSIGTPPV Y+ + DTGSD  W QC+PC+
Sbjct: 65  RSAALLNRTATSGAVGLQSPIAPGSGEYLMYVSIGTPPVDYIGMIDTGSDLTWAQCLPCR 124

Query: 124 KCYPQSKPIFDPKKSSSFRHVPCTSNTCQSVGGATCGDQGSCNYSIVYGDQTYSKGELGT 183
           KC+ Q +PIF+P KS+SF HVPC S  CQ++  A CG QG C+YS  YGDQTY+KG+LG 
Sbjct: 125 KCFLQLRPIFNPLKSTSFSHVPCNSQICQAIDDAHCGVQGVCDYSYTYGDQTYTKGDLGF 184

Query: 184 DTITIGSTSVNMVIGCGHESGGGFGTTSGVIGLAGGELSIVTQMSKKSAVSRKFSYCLPS 243
           + ITIGS+SV  VIGCGHESGGGFG  SGVIGL GG+LS+V+QMS+ S +SR+FSYCLP 
Sbjct: 185 EKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPP 244

Query: 244 VSSQGSGKINFGKNAVVSGPGVVSTPL---GPRTMYQMTLEAISVGNERHAVGNAVVKDN 303
           +    +GKINF +NAVVSGPGVVSTPL    P T Y +TLEAIS+GNERH    +  + N
Sbjct: 245 LLGHANGKINFAQNAVVSGPGVVSTPLISKDPVTYYYITLEAISIGNERHMA--SAKQGN 304

Query: 304 MIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPGNFFGLCYSSDGHDV----NIPAI 363
           +IIDSGTTL+ +PKE++DGVVSS+ K++ +KRVKDPG+F+ LC+  DG +V     IP I
Sbjct: 305 VIIDSGTTLTVLPKELYDGVVSSLLKVVKAKRVKDPGSFWDLCF-DDGINVAASSGIPII 364

Query: 364 TAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFGIWGNIAQANFLIGYDLERKT 423
           TAHF+GGA+V L   N F +VA  V+CL LT  + +D FGI GN+AQANFLIGYDLE K 
Sbjct: 365 TAHFSGGANVNLLPVNTFQKVANNVNCLTLTAASPTDEFGIIGNLAQANFLIGYDLEAKR 424

BLAST of Cp4.1LG01g17040 vs. NCBI nr
Match: gi|778697533|ref|XP_004149005.2| (PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus])

HSP 1 Score: 492.3 bits (1266), Expect = 8.6e-136
Identity = 258/428 (60.28%), Postives = 313/428 (73.13%), Query Frame = 1

Query: 4   ISIFFSLLL--ISFRFTAVYGGGIGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSIS 63
           IS+FF L+L  ISF  T +  G  GFTT+LFHRDS  SP+   SLSHYDRL NA RRS+S
Sbjct: 5   ISLFFHLILFLISFSQTTIINGNNGFTTSLFHRDSLLSPLEFSSLSHYDRLANAFRRSLS 64

Query: 64  RADALFQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPCK 123
           R+ AL  RAA      ++S I PG GEY+MSVSIGTPPV Y+ IADTGSD  W QC+PC 
Sbjct: 65  RSAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCL 124

Query: 124 KCYPQSKPIFDPKKSSSFRHVPCTSNTCQSVGGATCGDQGSCNYSIVYGDQTYSKGELGT 183
           KCY Q +PIF+P KS+SF HVPC + TC +V    CG QG C+YS  YGD+TYSKG+LG 
Sbjct: 125 KCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGF 184

Query: 184 DTITIGSTSVNMVIGCGHESGGGFGTTSGVIGLAGGELSIVTQMSKKSAVSRKFSYCLPS 243
           + ITIGS+SV  VIGCGH S GGFG  SGVIGL GG+LS+V+QMS+ S +SR+FSYCLP+
Sbjct: 185 EKITIGSSSVKSVIGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPT 244

Query: 244 VSSQGSGKINFGKNAVVSGPGVVSTPLGPR---TMYQMTLEAISVGNERHAVGNAVVKDN 303
           + S  +GKINFG+NAVVSGPGVVSTPL  +   T Y +TLEAIS+GNERH       + N
Sbjct: 245 LLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMA--FAKQGN 304

Query: 304 MIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPGNFFGLCYS---SDGHDVNIPAIT 363
           +IIDSGTTL+ +PKE++DGVVSS+ K++ +KRVKDP     LC+    +    + IP IT
Sbjct: 305 VIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPVIT 364

Query: 364 AHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFGIWGNIAQANFLIGYDLERKTL 423
           AHF+GGA+V L   N F +VA  V+CL L   + +  FGI GN+AQANFLIGYDLE K L
Sbjct: 365 AHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAKRL 424

BLAST of Cp4.1LG01g17040 vs. NCBI nr
Match: gi|700198286|gb|KGN53444.1| (hypothetical protein Csa_4G055390 [Cucumis sativus])

HSP 1 Score: 479.9 bits (1234), Expect = 4.4e-132
Identity = 256/428 (59.81%), Postives = 308/428 (71.96%), Query Frame = 1

Query: 4   ISIFFSL--LLISFRFTAVYGGGIGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSIS 63
           ISIFF L  LLISF  T +  G  GFTT+LFHRDS  SP+   SLSHYDRL NA RRS+S
Sbjct: 5   ISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLS 64

Query: 64  RADALFQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPCK 123
           R+  L  RAA      +++P++PG GEY+MSVSIGTPPV Y+ +ADTGSD  W QC+PC 
Sbjct: 65  RSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCL 124

Query: 124 KCYPQSKPIFDPKKSSSFRHVPCTSNTCQSVGGATCGDQGSCNYSIVYGDQTYSKGELGT 183
           KCY QS+PIFDP KS+SF HVPC S  C+++  + CG QG C+YS  YGD+TYSKG+LG 
Sbjct: 125 KCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDRTYSKGDLGF 184

Query: 184 DTITIGSTSVNMVIGCGHESGGGFGTTSGVIGLAGGELSIVTQMSKKSAVSRKFSYCLPS 243
           + ITIGS+SV  VIGCGHESGGGFG  SGVIGL GG    V                LP+
Sbjct: 185 EKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGANPPV----------------LPT 244

Query: 244 VSSQGSGKINFGKNAVVSGPGVVSTPL---GPRTMYQMTLEAISVGNERHAVGNAVVKDN 303
           + S  +GKINFG+NAVVSGPGVVSTPL    P T Y +TLEAIS+GNERH    +  + N
Sbjct: 245 LLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMA--SAKQGN 304

Query: 304 MIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPGNFFGLCYSSDGHDV----NIPAI 363
           +IIDSGTTLS++PKE++DGVVSS+ K++ +KRVKDPGNF+ LC+  DG +V     IP I
Sbjct: 305 VIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCF-DDGINVATSSGIPII 364

Query: 364 TAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFGIWGNIAQANFLIGYDLERKT 423
           TA F+GGA+V L   N F +VA  V+CL LTP + +D FGI GN+A ANFLIGYDLE K 
Sbjct: 365 TAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKR 413

BLAST of Cp4.1LG01g17040 vs. NCBI nr
Match: gi|778697530|ref|XP_011654342.1| (PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus])

HSP 1 Score: 465.3 bits (1196), Expect = 1.1e-127
Identity = 249/428 (58.18%), Postives = 301/428 (70.33%), Query Frame = 1

Query: 4   ISIFFSL--LLISFRFTAVYGGGIGFTTTLFHRDSPQSPIRNQSLSHYDRLNNAIRRSIS 63
           ISIFF L  LLISF  T +  G  GFTT+LFHRDS  SP+   SLSHYDRL NA RRS+S
Sbjct: 5   ISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLS 64

Query: 64  RADALFQRAAALTDNTIESPISPGGGEYVMSVSIGTPPVAYVAIADTGSDPAWTQCMPCK 123
           R+  L  RAA      +++P++PG GEY+MSVSIGTPPV Y+ +ADTGSD  W QC+PC 
Sbjct: 65  RSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCL 124

Query: 124 KCYPQSKPIFDPKKSSSFRHVPCTSNTCQSVGGATCGDQGSCNYSIVYGDQTYSKGELGT 183
           KCY QS+PIFDP KS+SF HVPC S  C+++  + CG QG C+YS  YGD+TYSKG+LG 
Sbjct: 125 KCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDRTYSKGDLGF 184

Query: 184 DTITIGSTSVNMVIGCGHESGGGFGTTSGVIGLAGGELSIVTQMSKKSAVSRKFSYCLPS 243
           + ITIGS+SV  VIGCGHESGGGFG  SG                            LP+
Sbjct: 185 EKITIGSSSVKSVIGCGHESGGGFGFASGA-----------------------NPPVLPT 244

Query: 244 VSSQGSGKINFGKNAVVSGPGVVSTPL---GPRTMYQMTLEAISVGNERHAVGNAVVKDN 303
           + S  +GKINFG+NAVVSGPGVVSTPL    P T Y +TLEAIS+GNERH    +  + N
Sbjct: 245 LLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMA--SAKQGN 304

Query: 304 MIIDSGTTLSYIPKEMHDGVVSSMAKIIGSKRVKDPGNFFGLCYSSDGHDV----NIPAI 363
           +IIDSGTTLS++PKE++DGVVSS+ K++ +KRVKDPGNF+ LC+  DG +V     IP I
Sbjct: 305 VIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCF-DDGINVATSSGIPII 364

Query: 364 TAHFAGGADVRLGMENMFMRVAGGVSCLMLTPMTASDPFGIWGNIAQANFLIGYDLERKT 423
           TA F+GGA+V L   N F +VA  V+CL LTP + +D FGI GN+A ANFLIGYDLE K 
Sbjct: 365 TAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKR 406

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CDR1_ARATH6.7e-8843.08Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1[more]
ASPR1_ARATH3.9e-8041.43Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 S... [more]
NEP1_NEPGR1.8e-5634.16Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
NEP2_NEPGR1.8e-5634.45Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
ASPG2_ARATH5.5e-5031.62Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KV20_CUCSA6.0e-13660.28Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055400 PE=3 SV=1[more]
A0A0A0KX67_CUCSA3.1e-13259.81Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055390 PE=3 SV=1[more]
A0A0A0KZZ3_CUCSA3.9e-11152.63Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055410 PE=3 SV=1[more]
M5WRG3_PRUPE2.0e-9945.19Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025167mg PE=3 SV=1[more]
M1DUW2_SOLTU2.4e-9244.82Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400044361 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G64830.13.6e-9246.14 Eukaryotic aspartyl protease family protein[more]
AT5G33340.13.8e-8943.08 Eukaryotic aspartyl protease family protein[more]
AT2G35615.12.2e-8141.43 Eukaryotic aspartyl protease family protein[more]
AT1G31450.11.9e-7740.63 Eukaryotic aspartyl protease family protein[more]
AT2G28010.17.2e-6438.02 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659102476|ref|XP_008452153.1|1.9e-14362.47PREDICTED: probable aspartic protease At2g35615 [Cucumis melo][more]
gi|659102474|ref|XP_008452152.1|2.0e-14061.45PREDICTED: probable aspartic protease At2g35615 [Cucumis melo][more]
gi|778697533|ref|XP_004149005.2|8.6e-13660.28PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus][more]
gi|700198286|gb|KGN53444.1|4.4e-13259.81hypothetical protein Csa_4G055390 [Cucumis sativus][more]
gi|778697530|ref|XP_011654342.1|1.1e-12758.18PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR021109Peptidase_aspartic_dom_sf
IPR001969Aspartic_peptidase_AS
IPR001461Aspartic_peptidase_A1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g17040.1Cp4.1LG01g17040.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 1..423
score: 8.7E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 299..310
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 259..423
score: 1.8E-32coord: 86..254
score: 1.7
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 82..422
score: 3.03
NoneNo IPR availablePANTHERPTHR13683:SF298ASPARTIC PROTEINASE CDR1-RELATEDcoord: 1..423
score: 8.7E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None