CSPI04G06720 (gene) Wild cucumber (PI 183967)

NameCSPI04G06720
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionEukaryotic aspartyl protease family protein
LocationChr4 : 4708120 .. 4709412 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGGCTATCATTTCCATCTTCTTCCACCTAATTCTCTTATTGATCTCCTTCTCTCAAACAACCATTATTAACGGTGATAATGGCTTCACCACCTCTCTCTTCCACCGTGATTCCCTTCTTTCCCCTCTTGAATTTTCATCTCTATCTCATTACGATCGCCTCACCAATGCCTTTCGCCGCTCATTGTCTCGCTCCGCTACTCTCCTCAATCGCGCTGCCACTAATGGTGCTCTTGACCTCCAAGCCCCACTTACCCCAGGAAGTGGCGAGTATCTAATGTCTGTCTCTATTGGAACCCCACCAGTTGATTACATAGGCATGGCTGACACAGGCAGTGATCTGATGTGGGCTCAATGCTTGCCATGTCTGAAATGCTACAAACAATCACGTCCCATTTTCGACCCTCTCAAATCCACATCCTTCAGTCACGTGCCATGCAATTCACAGAATTGTAAAGCTATTGATGATTCCCATTGTGGGGCTCAGGGGGTTTGCGATTACAGTTACACGTACGGAGATCAAACTTACACAAAGGGGGATTTGGGTTTTGAAAAGATCACCATTGGGTCATCTTCTGTGAAATCAGTCATCGGATGTGGCCATGAGAGTGGTGGCGGATTTGGGTTTGCTTCAGGTGTCATTGGACTTGGTGGTGGTCAACTCTCGTTGGTCTCACAAATGAGCCAAACCTCCGGTATCAGCCGTCGATTCTCTTATTGCTTACCAACGTTACTCAGTCACGCAAATGGCAAAATAAACTTTGGCCAAAACGCTGTGGTTTCTGGCCCTGGAGTCGTTTCAACGCCACTGATCTCCAAAAACCCCGTCACTTACTATTACGTCACTTTGGAAGCCATTTCCATTGGCAAAGAACGTCACATGGCCTCTGCCAAACAAGGCAATGTGATTATCGACAGTGGGACGACATTATCGTTTCTTCCAAAGGAGTTGTATGATGGTGTCGTGTCGTCGCTACTCAAGGTTGTTAAAGCAAAGCGAGTGAAGGATCCCGGTAACTTTTGGGATCTATGCTTTGATGATGGCATCAACGTCGCCACCTCCTCTGGTATTCCGATTATCACTGCACAATTTTCCGGTGGTGCTAACGTGAATTTGTTGCCAGTGAATACGTTTCAGAAGGTTGCAAATAATGTGAATTGCTTAACATTAACACCTGCATCACCGACAGACGAATTTGGGATAATTGGAAATTTGGCGCTGGCCAATTTTTTGATCGGATATGATTTGGAGGCTAAGAGATTATCATTCAAGCCAACCGTTTGTACCTAG

mRNA sequence

ATGGTGGCTATCATTTCCATCTTCTTCCACCTAATTCTCTTATTGATCTCCTTCTCTCAAACAACCATTATTAACGGTGATAATGGCTTCACCACCTCTCTCTTCCACCGTGATTCCCTTCTTTCCCCTCTTGAATTTTCATCTCTATCTCATTACGATCGCCTCACCAATGCCTTTCGCCGCTCATTGTCTCGCTCCGCTACTCTCCTCAATCGCGCTGCCACTAATGGTGCTCTTGACCTCCAAGCCCCACTTACCCCAGGAAGTGGCGAGTATCTAATGTCTGTCTCTATTGGAACCCCACCAGTTGATTACATAGGCATGGCTGACACAGGCAGTGATCTGATGTGGGCTCAATGCTTGCCATGTCTGAAATGCTACAAACAATCACGTCCCATTTTCGACCCTCTCAAATCCACATCCTTCAGTCACGTGCCATGCAATTCACAGAATTGTAAAGCTATTGATGATTCCCATTGTGGGGCTCAGGGGGTTTGCGATTACAGTTACACGTACGGAGATCAAACTTACACAAAGGGGGATTTGGGTTTTGAAAAGATCACCATTGGGTCATCTTCTGTGAAATCAGTCATCGGATGTGGCCATGAGAGTGGTGGCGGATTTGGGTTTGCTTCAGGTGTCATTGGACTTGGTGGTGGTCAACTCTCGTTGGTCTCACAAATGAGCCAAACCTCCGGTATCAGCCGTCGATTCTCTTATTGCTTACCAACGTTACTCAGTCACGCAAATGGCAAAATAAACTTTGGCCAAAACGCTGTGGTTTCTGGCCCTGGAGTCGTTTCAACGCCACTGATCTCCAAAAACCCCGTCACTTACTATTACGTCACTTTGGAAGCCATTTCCATTGGCAAAGAACGTCACATGGCCTCTGCCAAACAAGGCAATGTGATTATCGACAGTGGGACGACATTATCGTTTCTTCCAAAGGAGTTGTATGATGGTGTCGTGTCGTCGCTACTCAAGGTTGTTAAAGCAAAGCGAGTGAAGGATCCCGGTAACTTTTGGGATCTATGCTTTGATGATGGCATCAACGTCGCCACCTCCTCTGGTATTCCGATTATCACTGCACAATTTTCCGGTGGTGCTAACGTGAATTTGTTGCCAGTGAATACGTTTCAGAAGGTTGCAAATAATGTGAATTGCTTAACATTAACACCTGCATCACCGACAGACGAATTTGGGATAATTGGAAATTTGGCGCTGGCCAATTTTTTGATCGGATATGATTTGGAGGCTAAGAGATTATCATTCAAGCCAACCGTTTGTACCTAG

Coding sequence (CDS)

ATGGTGGCTATCATTTCCATCTTCTTCCACCTAATTCTCTTATTGATCTCCTTCTCTCAAACAACCATTATTAACGGTGATAATGGCTTCACCACCTCTCTCTTCCACCGTGATTCCCTTCTTTCCCCTCTTGAATTTTCATCTCTATCTCATTACGATCGCCTCACCAATGCCTTTCGCCGCTCATTGTCTCGCTCCGCTACTCTCCTCAATCGCGCTGCCACTAATGGTGCTCTTGACCTCCAAGCCCCACTTACCCCAGGAAGTGGCGAGTATCTAATGTCTGTCTCTATTGGAACCCCACCAGTTGATTACATAGGCATGGCTGACACAGGCAGTGATCTGATGTGGGCTCAATGCTTGCCATGTCTGAAATGCTACAAACAATCACGTCCCATTTTCGACCCTCTCAAATCCACATCCTTCAGTCACGTGCCATGCAATTCACAGAATTGTAAAGCTATTGATGATTCCCATTGTGGGGCTCAGGGGGTTTGCGATTACAGTTACACGTACGGAGATCAAACTTACACAAAGGGGGATTTGGGTTTTGAAAAGATCACCATTGGGTCATCTTCTGTGAAATCAGTCATCGGATGTGGCCATGAGAGTGGTGGCGGATTTGGGTTTGCTTCAGGTGTCATTGGACTTGGTGGTGGTCAACTCTCGTTGGTCTCACAAATGAGCCAAACCTCCGGTATCAGCCGTCGATTCTCTTATTGCTTACCAACGTTACTCAGTCACGCAAATGGCAAAATAAACTTTGGCCAAAACGCTGTGGTTTCTGGCCCTGGAGTCGTTTCAACGCCACTGATCTCCAAAAACCCCGTCACTTACTATTACGTCACTTTGGAAGCCATTTCCATTGGCAAAGAACGTCACATGGCCTCTGCCAAACAAGGCAATGTGATTATCGACAGTGGGACGACATTATCGTTTCTTCCAAAGGAGTTGTATGATGGTGTCGTGTCGTCGCTACTCAAGGTTGTTAAAGCAAAGCGAGTGAAGGATCCCGGTAACTTTTGGGATCTATGCTTTGATGATGGCATCAACGTCGCCACCTCCTCTGGTATTCCGATTATCACTGCACAATTTTCCGGTGGTGCTAACGTGAATTTGTTGCCAGTGAATACGTTTCAGAAGGTTGCAAATAATGTGAATTGCTTAACATTAACACCTGCATCACCGACAGACGAATTTGGGATAATTGGAAATTTGGCGCTGGCCAATTTTTTGATCGGATATGATTTGGAGGCTAAGAGATTATCATTCAAGCCAACCGTTTGTACCTAG
BLAST of CSPI04G06720 vs. Swiss-Prot
Match: ASPR1_ARATH (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 321.6 bits (823), Expect = 1.3e-86
Identity = 191/451 (42.35%), Postives = 270/451 (59.87%), Query Frame = 1

Query: 11  LILLLISFSQTTIINGD-NGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATL 70
           L+   + FS T   +G    F+  L HRDS LSP+    ++  DRL  AF RS+SRS   
Sbjct: 6   LLCFFLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRRF 65

Query: 71  LNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQ 130
            ++ +     DLQ+ L    GE+ MS++IGTPP+    +ADTGSDL W QC PC +CYK+
Sbjct: 66  NHQLSQT---DLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKE 125

Query: 131 SRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCG---AQGVCDYSYTYGDQTYTKGDLGFEK 190
           + PIFD  KS+++   PC+S+NC+A+  +  G   +  +C Y Y+YGDQ+++KGD+  E 
Sbjct: 126 NGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATET 185

Query: 191 ITIGSSSVK------SVIGCGHESGGGFG-FASGVIGLGGGQLSLVSQMSQTSGISRRFS 250
           ++I S+S        +V GCG+ +GG F    SG+IGLGGG LSL+SQ+   S IS++FS
Sbjct: 186 VSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLG--SSISKKFS 245

Query: 251 YCLPTLLSHANGK--INFGQNAVVSG----PGVVSTPLISKNPVTYYYVTLEAISIGKER 310
           YCL    +  NG   IN G N++ S      GVVSTPL+ K P+TYYY+TLEAIS+GK++
Sbjct: 246 YCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKK 305

Query: 311 -------------HMASAKQGNVIIDSGTTLSFLPKELYDGVVSSLLK-VVKAKRVKDPG 370
                         + S   GN+IIDSGTTL+ L    +D   S++ + V  AKRV DP 
Sbjct: 306 IPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQ 365

Query: 371 NFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDE 430
                CF  G   +   G+P IT  F+ GA+V L P+N F K++ ++ CL++    PT E
Sbjct: 366 GLLSHCFKSG---SAEIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLSMV---PTTE 425

BLAST of CSPI04G06720 vs. Swiss-Prot
Match: CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1)

HSP 1 Score: 303.1 bits (775), Expect = 4.7e-81
Identity = 183/445 (41.12%), Postives = 261/445 (58.65%), Query Frame = 1

Query: 6   SIFFHLILLLISFSQTTIINGDN----GFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRR 65
           S+F  ++L L   S   + N +     GFT  L HRDS  SP      +   RL NA  R
Sbjct: 3   SLFSSVLLSLCLLSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHR 62

Query: 66  SLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCL 125
           S++R   + +    +     Q  LT  SGEYLM+VSIGTPP   + +ADTGSDL+W QC 
Sbjct: 63  SVNR---VFHFTEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCA 122

Query: 126 PCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDD-SHCGA-QGVCDYSYTYGDQTYTK 185
           PC  CY Q  P+FDP  S+++  V C+S  C A+++ + C      C YS +YGD +YTK
Sbjct: 123 PCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTK 182

Query: 186 GDLGFEKITIGSSSVKS------VIGCGHESGGGFG-FASGVIGLGGGQLSLVSQMSQTS 245
           G++  + +T+GSS  +       +IGCGH + G F    SG++GLGGG +SL+ Q+  + 
Sbjct: 183 GNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDS- 242

Query: 246 GISRRFSYCLPTLLSHAN--GKINFGQNAVVSGPGVVSTPLISK-NPVTYYYVTLEAISI 305
            I  +FSYCL  L S  +   KINFG NA+VSG GVVSTPLI+K +  T+YY+TL++IS+
Sbjct: 243 -IDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISV 302

Query: 306 GKERHMAS-----AKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDL 365
           G ++   S     + +GN+IIDSGTTL+ LP E Y  +  ++   + A++ +DP +   L
Sbjct: 303 GSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSL 362

Query: 366 CFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIG 425
           C+    +      +P+IT  F  GA+V L   N F +V+ ++ C      SP+  F I G
Sbjct: 363 CY----SATGDLKVPVITMHFD-GADVKLDSSNAFVQVSEDLVCFAFR-GSPS--FSIYG 422

Query: 426 NLALANFLIGYDLEAKRLSFKPTVC 430
           N+A  NFL+GYD  +K +SFKPT C
Sbjct: 423 NVAQMNFLVGYDTVSKTVSFKPTDC 434

BLAST of CSPI04G06720 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 217.2 bits (552), Expect = 3.4e-55
Identity = 140/394 (35.53%), Postives = 209/394 (53.05%), Query Frame = 1

Query: 48  SLSHYDRLTNAFRRSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIG 107
           +L+ Y+ +  A +R   R  ++   A    +  ++ P+  G GEYLM+V+IGTP   +  
Sbjct: 54  NLTKYELIKRAIKRGERRMRSI--NAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSA 113

Query: 108 MADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCD 167
           + DTGSDL+W QC PC +C+ Q  PIF+P  S+SFS +PC SQ C+ +    C     C 
Sbjct: 114 IMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCN-NNECQ 173

Query: 168 YSYTYGDQTYTKGDLGFEKITIGSSSVKSV-IGCGHESGG-GFGFASGVIGLGGGQLSLV 227
           Y+Y YGD + T+G +  E  T  +SSV ++  GCG ++ G G G  +G+IG+G G LSL 
Sbjct: 174 YTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLP 233

Query: 228 SQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLI--SKNPVTYYYVT 287
           SQ+    G+  +FSYC+ +  S +   +  G  A     G  ST LI  S NP TYYY+T
Sbjct: 234 SQL----GVG-QFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNP-TYYYIT 293

Query: 288 LEAISIGKER--------HMASAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRV 347
           L+ I++G +          +     G +IIDSGTTL++LP++ Y+ V  +    +    V
Sbjct: 294 LQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTV 353

Query: 348 KDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPAS 407
            +  +    CF    + +T   +P I+ QF GG  +NL   N     A  V CL +  +S
Sbjct: 354 DESSSGLSTCFQQPSDGSTVQ-VPEISMQFDGGV-LNLGEQNILISPAEGVICLAMGSSS 413

Query: 408 PTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 430
                 I GN+      + YDL+   +SF PT C
Sbjct: 414 QLG-ISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CSPI04G06720 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 216.9 bits (551), Expect = 4.5e-55
Identity = 142/413 (34.38%), Postives = 208/413 (50.36%), Query Frame = 1

Query: 29  GFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATLLNRAATNGALDLQAPLTPG 88
           GF   L H DS        +L+ +  L  A  R   R   L   A  NG   ++  +  G
Sbjct: 40  GFQIMLEHVDS------GKNLTKFQLLERAIERGSRRLQRL--EAMLNGPSGVETSVYAG 99

Query: 89  SGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPCN 148
            GEYLM++SIGTP   +  + DTGSDL+W QC PC +C+ QS PIF+P  S+SFS +PC+
Sbjct: 100 DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCS 159

Query: 149 SQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSSSVKSV-IGCGHESGG- 208
           SQ C+A+    C +   C Y+Y YGD + T+G +G E +T GS S+ ++  GCG  + G 
Sbjct: 160 SQLCQALSSPTC-SNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGF 219

Query: 209 GFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGV 268
           G G  +G++G+G G LSL SQ+  T     +FSYC+  + S     +  G  A     G 
Sbjct: 220 GQGNGAGLVGMGRGPLSLPSQLDVT-----KFSYCMTPIGSSTPSNLLLGSLANSVTAGS 279

Query: 269 VSTPLISKNPV-TYYYVTLEAISIGKER---------HMASAKQGNVIIDSGTTLSFLPK 328
            +T LI  + + T+YY+TL  +S+G  R           ++   G +IIDSGTTL++   
Sbjct: 280 PNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVN 339

Query: 329 ELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPV 388
             Y  V    +  +    V    + +DLCF    +  ++  IP     F GG ++ L   
Sbjct: 340 NAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSD-PSNLQIPTFVMHFDGG-DLELPSE 399

Query: 389 NTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 430
           N F   +N + CL +   S +    I GN+   N L+ YD     +SF    C
Sbjct: 400 NYFISPSNGLICLAM--GSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CSPI04G06720 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 206.5 bits (524), Expect = 6.0e-52
Identity = 142/429 (33.10%), Postives = 217/429 (50.58%), Query Frame = 1

Query: 30  FTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATLLNRAA-------------TN 89
           +T  L HRD   S    +  +H+ RL    RR   R + +L R +              +
Sbjct: 59  YTLRLLHRDRFPS---VTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVND 118

Query: 90  GALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDP 149
              D+ + +  GSGEY + + +G+PP D   + D+GSD++W QC PC  CYKQS P+FDP
Sbjct: 119 FGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDP 178

Query: 150 LKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSSSVKS 209
            KS S++ V C S  C  I++S C + G C Y   YGD +YTKG L  E +T   + V++
Sbjct: 179 AKSGSYTGVSCGSSVCDRIENSGCHSGG-CRYEVMYGDGSYTKGTLALETLTFAKTVVRN 238

Query: 210 V-IGCGHESGGGFGFASGVIGLGGGQLSLVSQMS-QTSGISRRFSYCLPTLLSHANGKIN 269
           V +GCGH + G F  A+G++G+GGG +S V Q+S QT G    F YCL +  + + G + 
Sbjct: 239 VAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGG---AFGYCLVSRGTDSTGSLV 298

Query: 270 FGQNAVVSGPGVVSTPLISKNP--VTYYYVTLEAISIGKER--------HMASAKQGNVI 329
           FG+ A+  G   V  PL+ +NP   ++YYV L+ + +G  R         +     G V+
Sbjct: 299 FGREALPVGASWV--PLV-RNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVV 358

Query: 330 IDSGTTLSFLPKELY----DGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI 389
           +D+GT ++ LP   Y    DG  S    + +A  V    + +D C+D  ++   S  +P 
Sbjct: 359 MDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGV----SIFDTCYD--LSGFVSVRVPT 418

Query: 390 ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK 430
           ++  F+ G  + L   N    V ++        ASPT    IIGN+      + +D    
Sbjct: 419 VSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTG-LSIIGNIQQEGIQVSFDGANG 470

BLAST of CSPI04G06720 vs. TrEMBL
Match: A0A0A0KX67_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055390 PE=3 SV=1)

HSP 1 Score: 796.6 bits (2056), Expect = 1.5e-227
Identity = 406/430 (94.42%), Postives = 408/430 (94.88%), Query Frame = 1

Query: 1   MVAIISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFR 60
           MVA ISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFR
Sbjct: 1   MVATISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFR 60

Query: 61  RSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQC 120
           RSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQC
Sbjct: 61  RSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQC 120

Query: 121 LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKG 180
           LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGD+TY+KG
Sbjct: 121 LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDRTYSKG 180

Query: 181 DLGFEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240
           DLGFEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGG    V               
Sbjct: 181 DLGFEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGANPPV--------------- 240

Query: 241 CLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGKERHMASAKQ 300
            LPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIG ERHMASAKQ
Sbjct: 241 -LPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQ 300

Query: 301 GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI 360
           GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI
Sbjct: 301 GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI 360

Query: 361 ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK 420
           ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK
Sbjct: 361 ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK 414

Query: 421 RLSFKPTVCT 431
           RLSFKPTVCT
Sbjct: 421 RLSFKPTVCT 414

BLAST of CSPI04G06720 vs. TrEMBL
Match: A0A0A0KV20_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055400 PE=3 SV=1)

HSP 1 Score: 752.7 bits (1942), Expect = 2.5e-214
Identity = 376/429 (87.65%), Postives = 397/429 (92.54%), Query Frame = 1

Query: 1   MVAIISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFR 60
           M A IS+FFHLIL LISFSQTTIING+NGFTTSLFHRDSLLSPLEFSSLSHYDRL NAFR
Sbjct: 1   MAATISLFFHLILFLISFSQTTIINGNNGFTTSLFHRDSLLSPLEFSSLSHYDRLANAFR 60

Query: 61  RSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQC 120
           RSLSRSA LLNRAAT+GA+ LQ+ + PGSGEYLMSVSIGTPPVDY+G+ADTGSDL WAQC
Sbjct: 61  RSLSRSAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQC 120

Query: 121 LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKG 180
           LPCLKCY+Q RPIF+PLKSTSFSHVPCN+Q C A+DD HCG QGVCDYSYTYGD+TY+KG
Sbjct: 121 LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKG 180

Query: 181 DLGFEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240
           DLGFEKITIGSSSVKSVIGCGH S GGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY
Sbjct: 181 DLGFEKITIGSSSVKSVIGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240

Query: 241 CLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGKERHMASAKQ 300
           CLPTLLSHANGKINFG+NAVVSGPGVVSTPLISKN VTYYY+TLEAISIG ERHMA AKQ
Sbjct: 241 CLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQ 300

Query: 301 GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI 360
           GNVIIDSGTTL+ LPKELYDGVVSSLLKVVKAKRVKDP    DLCFDDGIN A S GIP+
Sbjct: 301 GNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPV 360

Query: 361 ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK 420
           ITA FSGGANVNLLP+NTF+KVA+NVNCLTL  ASPT EFGIIGNLA ANFLIGYDLEAK
Sbjct: 361 ITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAK 420

Query: 421 RLSFKPTVC 430
           RLSFKPTVC
Sbjct: 421 RLSFKPTVC 429

BLAST of CSPI04G06720 vs. TrEMBL
Match: A0A0A0KZZ3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055410 PE=3 SV=1)

HSP 1 Score: 447.6 bits (1150), Expect = 1.7e-122
Identity = 242/439 (55.13%), Postives = 307/439 (69.93%), Query Frame = 1

Query: 2   VAIISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRR 61
           +A ISIFF+  LL  S   T    G +GFTTSLF RDS LSPL   SLS YD L +AFRR
Sbjct: 1   MAAISIFFYF-LLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRR 60

Query: 62  SLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCL 121
           S SRSATLL    +     +++P+ P SGE+LMS+ IGTPPV+ I +ADTGSDL W QCL
Sbjct: 61  SFSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCL 120

Query: 122 PCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQ-GVCDYSYTYGDQTYTKG 181
           PC +C+ QS+PIF+P +S+S+  V C S  C++++  HCG     C Y Y+YGD+++T G
Sbjct: 121 PCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYG 180

Query: 182 DLGFEKITIGSSSV-KSVIGCGHESGGGF-GFASGVIGLGGGQLSLVSQMSQTSGISRRF 241
           DL  ++ITIGS  + K+VIGCGH++GG F G  SG+IGLGGG LSLVSQM   +G+  RF
Sbjct: 181 DLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRF 240

Query: 242 SYCLPTLLSHAN--GKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGKERHMA 301
           SYCLPT  S+AN  G I+FG+ AVVSG  VVSTPL+ ++P T+Y++TLEAIS+GK+R  A
Sbjct: 241 SYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKA 300

Query: 302 S------AKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGI 361
           +         GN+IIDSGTTL+ LP+ LY GV S+L +V+KAKRV DP    +LC+  G 
Sbjct: 301 ANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAG- 360

Query: 362 NVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALAN 421
                  IPIITA F+GGA+V LLPVNTF  VA+NV CLT  PA+   +  I GNLA  N
Sbjct: 361 -QVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPAT---QVAIFGNLAQIN 420

Query: 422 FLIGYDLEAKRLSFKPTVC 430
           F +GYDL  KRLSF+P +C
Sbjct: 421 FEVGYDLGNKRLSFEPKLC 433

BLAST of CSPI04G06720 vs. TrEMBL
Match: M5WRG3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025167mg PE=3 SV=1)

HSP 1 Score: 386.7 bits (992), Expect = 3.6e-104
Identity = 210/455 (46.15%), Postives = 297/455 (65.27%), Query Frame = 1

Query: 7   IFFHLILL--LISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLS 66
           ++F L LL   I  +Q +     +GFT  L HRDS LSPL  SS+SH DRL NAFRRS++
Sbjct: 12  LYFPLALLACFILLAQAS----SHGFTADLIHRDSPLSPLYNSSMSHLDRLHNAFRRSVT 71

Query: 67  R-----SATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQ 126
           R       T+ + +++  A ++Q+ + P +GEYLM+VSIGTPPV+ +G+ADTGSDL+W Q
Sbjct: 72  RVHHFIKPTMTSLSSSLAAPNIQSIIIPSAGEYLMNVSIGTPPVEVLGIADTGSDLIWTQ 131

Query: 127 CLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGA-----QGVCDYSYTYGD 186
           C PC +C+ Q+ P+FDP KS+++  +PC S +C  ++++ CG         C+YSY YGD
Sbjct: 132 CKPCKQCFNQNPPLFDPKKSSTYHSIPCQSSSCTYLEEAACGTLINGDHDTCEYSYRYGD 191

Query: 187 QTYTKGDLGFEKITIGSSS------VKSVIGCGHESGGGFG-FASGVIGLGGGQLSLVSQ 246
           +++T+G L  E +T GS+S       K V GCGHE+GG F    SG+IGLGGG LSL+SQ
Sbjct: 192 RSFTRGTLALETLTFGSTSGRPTSLPKVVFGCGHENGGTFDESGSGLIGLGGGPLSLISQ 251

Query: 247 MSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAI 306
           +++ +    +FSYCL    + A  KI+FG   +VSG G VSTPL++KNP T+YY+TLEAI
Sbjct: 252 LTKLTN-GGKFSYCLLPTANTAASKISFGSAGIVSGSGAVSTPLVAKNPDTFYYLTLEAI 311

Query: 307 SIGK------------ERHMASAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRV 366
           S+G+            E+   +A +GN+IIDSGTTL+ LP   +D +VS+L   + A+RV
Sbjct: 312 SVGEKRLAYKTKSPDCEKAAVAANEGNIIIDSGTTLTLLPPGFHDDLVSALETAINAERV 371

Query: 367 KDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPAS 426
            DP     LCF    + +   G+P+IT  FSGGA+V L  +NTF ++ +++ C T+ P+S
Sbjct: 372 SDPRGILSLCFK---SKSDDIGVPVITVHFSGGADVKLQALNTFARMDDDMICFTMIPSS 431

Query: 427 PTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVCT 431
              +  I GNLA  NFL+GYDLE + +SFKPT CT
Sbjct: 432 ---DVAIFGNLAQMNFLVGYDLEERSVSFKPTDCT 455

BLAST of CSPI04G06720 vs. TrEMBL
Match: W9SK79_9ROSA (Putative aspartic protease OS=Morus notabilis GN=L484_022741 PE=3 SV=1)

HSP 1 Score: 357.8 bits (917), Expect = 1.8e-95
Identity = 207/453 (45.70%), Postives = 293/453 (64.68%), Query Frame = 1

Query: 7   IFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLS--HYDRLTNAFRRSLS 66
           + F  +++L  FS  T      GF   L  RDS  SP  ++ L+  ++DRL +AF RS S
Sbjct: 13  VLFGCLIMLNDFSLPTEAL-TRGFIIDLIQRDSPFSPA-YNPLAADNFDRLRSAFGRSFS 72

Query: 67  R------SATLLN-RAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMW 126
           R        TLL+  ++++ ++ +Q+ + P  GEYLM+VS+GTPPV  +G+ADTGSDLMW
Sbjct: 73  RVDRLYKPTTLLSFSSSSSSSIPIQSKIIPSEGEYLMNVSLGTPPVPVLGIADTGSDLMW 132

Query: 127 AQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQG-----VCDYSYTY 186
            QC PC +C+KQ+ P+F+P KS+++ ++ C S+ C  + +S C A        C+Y Y+Y
Sbjct: 133 TQCKPCTQCFKQNPPMFNPNKSSTYRNIACESKPCSELLESSCDAAAERGGDTCEYRYSY 192

Query: 187 GDQTYTKGDLGFEKITIGSSSV-KSVIGCGHESGGGFG-FASGVIGLGGGQLSLVSQMSQ 246
           GD ++TKG+L  + +TIGS+S+ K + GCG E+GG F    SG+IGLGGG LSLVSQ+ +
Sbjct: 193 GDHSFTKGNLASDTLTIGSTSLPKIIFGCGRENGGTFDESGSGLIGLGGGPLSLVSQLGK 252

Query: 247 TSGISRRFSYCLPTLLS--HANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAIS 306
           +  I  +FSYCL  L S  +   KI+FG+  +VSGP VVSTPL++K P T+YY+TLEAIS
Sbjct: 253 S--IGGKFSYCLVPLTSEPYVTSKISFGRAGIVSGPSVVSTPLVAKEPNTFYYLTLEAIS 312

Query: 307 IGKER-----------HMASAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKD 366
           +GK+R              +  +GN+IIDSGTTL+FLP   +D +VS+L + V A+RV D
Sbjct: 313 VGKKRLVYYHENHNQSKALAGNEGNIIIDSGTTLTFLPVGFHDDLVSALAEAVDAERVSD 372

Query: 367 PGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPT 426
           P     LCF       + +  PIITA FS GA+V L P+NTF KV +++ C T+    P+
Sbjct: 373 PKGVLSLCFRAEKESESLASAPIITAHFS-GADVVLQPMNTFAKVEDDLFCFTMI---PS 432

Query: 427 DEFGIIGNLALANFLIGYDLEAKRLSFKPTVCT 431
           ++  I GNLA  NFL+GYDLE+  +SFKPT CT
Sbjct: 433 NDVAIFGNLAQMNFLVGYDLESGIVSFKPTDCT 457

BLAST of CSPI04G06720 vs. TAIR10
Match: AT1G64830.1 (AT1G64830.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 329.7 bits (844), Expect = 2.7e-90
Identity = 189/440 (42.95%), Postives = 268/440 (60.91%), Query Frame = 1

Query: 6   SIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSR 65
           S+ F  +L L+  S       D GFT  L HRDS  SP   S+ +   R+ NA RRS   
Sbjct: 3   SLIFATLLSLLLLSNVNAYPKD-GFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSARS 62

Query: 66  SATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLK 125
           +    N  A+  +   Q+ +T   GEYLM++SIGTPPV  + +ADTGSDL+W QC PC  
Sbjct: 63  TLQFSNDDASPNSP--QSFITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCED 122

Query: 126 CYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGA-QGVCDYSYTYGDQTYTKGDLGF 185
           CY+Q+ P+FDP +S+++  V C+S  C+A++D+ C   +  C Y+ TYGD +YTKGD+  
Sbjct: 123 CYQQTSPLFDPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAV 182

Query: 186 EKITIGSSSVKSV------IGCGHESGGGFGFA-SGVIGLGGGQLSLVSQMSQTSGISRR 245
           + +T+GSS  + V      IGCGHE+ G F  A SG+IGLGGG  SLVSQ+ ++  I+ +
Sbjct: 183 DTVTMGSSGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKS--INGK 242

Query: 246 FSYCLPTLLSHA--NGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGKERHM 305
           FSYCL    S      KINFG N +VSG GVVST ++ K+P TYY++ LEAIS+G ++  
Sbjct: 243 FSYCLVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQ 302

Query: 306 ASAK-----QGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGI 365
            ++      +GN++IDSGTTL+ LP   Y  + S +   +KA+RV+DP     LC+ D  
Sbjct: 303 FTSTIFGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYRD-- 362

Query: 366 NVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALAN 425
             ++S  +P IT  F GG +V L  +NTF  V+ +V+C      +  ++  I GNLA  N
Sbjct: 363 --SSSFKVPDITVHFKGG-DVKLGNLNTFVAVSEDVSCFAF---AANEQLTIFGNLAQMN 422

Query: 426 FLIGYDLEAKRLSFKPTVCT 431
           FL+GYD  +  +SFK T C+
Sbjct: 423 FLVGYDTVSGTVSFKKTDCS 429

BLAST of CSPI04G06720 vs. TAIR10
Match: AT2G35615.1 (AT2G35615.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 321.6 bits (823), Expect = 7.3e-88
Identity = 191/451 (42.35%), Postives = 270/451 (59.87%), Query Frame = 1

Query: 11  LILLLISFSQTTIINGD-NGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATL 70
           L+   + FS T   +G    F+  L HRDS LSP+    ++  DRL  AF RS+SRS   
Sbjct: 6   LLCFFLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRRF 65

Query: 71  LNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQ 130
            ++ +     DLQ+ L    GE+ MS++IGTPP+    +ADTGSDL W QC PC +CYK+
Sbjct: 66  NHQLSQT---DLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKE 125

Query: 131 SRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCG---AQGVCDYSYTYGDQTYTKGDLGFEK 190
           + PIFD  KS+++   PC+S+NC+A+  +  G   +  +C Y Y+YGDQ+++KGD+  E 
Sbjct: 126 NGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATET 185

Query: 191 ITIGSSSVK------SVIGCGHESGGGFG-FASGVIGLGGGQLSLVSQMSQTSGISRRFS 250
           ++I S+S        +V GCG+ +GG F    SG+IGLGGG LSL+SQ+   S IS++FS
Sbjct: 186 VSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLG--SSISKKFS 245

Query: 251 YCLPTLLSHANGK--INFGQNAVVSG----PGVVSTPLISKNPVTYYYVTLEAISIGKER 310
           YCL    +  NG   IN G N++ S      GVVSTPL+ K P+TYYY+TLEAIS+GK++
Sbjct: 246 YCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKK 305

Query: 311 -------------HMASAKQGNVIIDSGTTLSFLPKELYDGVVSSLLK-VVKAKRVKDPG 370
                         + S   GN+IIDSGTTL+ L    +D   S++ + V  AKRV DP 
Sbjct: 306 IPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQ 365

Query: 371 NFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDE 430
                CF  G   +   G+P IT  F+ GA+V L P+N F K++ ++ CL++    PT E
Sbjct: 366 GLLSHCFKSG---SAEIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLSMV---PTTE 425

BLAST of CSPI04G06720 vs. TAIR10
Match: AT1G31450.1 (AT1G31450.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 305.1 bits (780), Expect = 7.0e-83
Identity = 191/453 (42.16%), Postives = 260/453 (57.40%), Query Frame = 1

Query: 8   FFHLILLLISF--SQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSR 67
           F +  LL ISF  +  +  N +N  T  L HRDS  SPL     +  DRL  AF RS+SR
Sbjct: 6   FLYCSLLAISFFFASNSSANREN-LTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSISR 65

Query: 68  SATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLK 127
           S     +       DLQ+ L    GEY MS+SIGTPP     +ADTGSDL W QC PC +
Sbjct: 66  SRRFTTKT------DLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQ 125

Query: 128 CYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCG---AQGVCDYSYTYGDQTYTKGDL 187
           CYKQ+ P+FD  KS+++    C+S+ C+A+ +   G   ++ +C Y Y+YGD ++TKGD+
Sbjct: 126 CYKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDV 185

Query: 188 GFEKITIGSSSVKS------VIGCGHESGGGF-GFASGVIGLGGGQLSLVSQMSQTSGIS 247
             E I+I SSS  S      V GCG+ +GG F    SG+IGLGGG LSLVSQ+   S I 
Sbjct: 186 ATETISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLG--SSIG 245

Query: 248 RRFSYCLPTLLSHANGK--INFGQNAVVSGP----GVVSTPLISKNPVTYYYVTLEAISI 307
           ++FSYCL    +  NG   IN G N++ S P      ++TPLI K+P TYY++TLEA+++
Sbjct: 246 KKFSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTV 305

Query: 308 GKER-----------HMASAKQGNVIIDSGTTLSFLPKELYDGVVSSLLK-VVKAKRVKD 367
           GK +             +S + GN+IIDSGTTL+ L    YD   +++ + V  AKRV D
Sbjct: 306 GKTKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSD 365

Query: 368 PGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPT 427
           P      CF  G       G+P IT  F+  A+V L P+N F K+  +  CL++    PT
Sbjct: 366 PQGLLTHCFKSG---DKEIGLPAITMHFT-NADVKLSPINAFVKLNEDTVCLSMI---PT 425

Query: 428 DEFGIIGNLALANFLIGYDLEAKRLSFKPTVCT 431
            E  I GN+   +FL+GYDLE K +SF+   C+
Sbjct: 426 TEVAIYGNMVQMDFLVGYDLETKTVSFQRMDCS 442

BLAST of CSPI04G06720 vs. TAIR10
Match: AT5G33340.1 (AT5G33340.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 303.1 bits (775), Expect = 2.7e-82
Identity = 183/445 (41.12%), Postives = 261/445 (58.65%), Query Frame = 1

Query: 6   SIFFHLILLLISFSQTTIINGDN----GFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRR 65
           S+F  ++L L   S   + N +     GFT  L HRDS  SP      +   RL NA  R
Sbjct: 3   SLFSSVLLSLCLLSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHR 62

Query: 66  SLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCL 125
           S++R   + +    +     Q  LT  SGEYLM+VSIGTPP   + +ADTGSDL+W QC 
Sbjct: 63  SVNR---VFHFTEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCA 122

Query: 126 PCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDD-SHCGA-QGVCDYSYTYGDQTYTK 185
           PC  CY Q  P+FDP  S+++  V C+S  C A+++ + C      C YS +YGD +YTK
Sbjct: 123 PCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTK 182

Query: 186 GDLGFEKITIGSSSVKS------VIGCGHESGGGFG-FASGVIGLGGGQLSLVSQMSQTS 245
           G++  + +T+GSS  +       +IGCGH + G F    SG++GLGGG +SL+ Q+  + 
Sbjct: 183 GNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDS- 242

Query: 246 GISRRFSYCLPTLLSHAN--GKINFGQNAVVSGPGVVSTPLISK-NPVTYYYVTLEAISI 305
            I  +FSYCL  L S  +   KINFG NA+VSG GVVSTPLI+K +  T+YY+TL++IS+
Sbjct: 243 -IDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISV 302

Query: 306 GKERHMAS-----AKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDL 365
           G ++   S     + +GN+IIDSGTTL+ LP E Y  +  ++   + A++ +DP +   L
Sbjct: 303 GSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSL 362

Query: 366 CFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIG 425
           C+    +      +P+IT  F  GA+V L   N F +V+ ++ C      SP+  F I G
Sbjct: 363 CY----SATGDLKVPVITMHFD-GADVKLDSSNAFVQVSEDLVCFAFR-GSPS--FSIYG 422

Query: 426 NLALANFLIGYDLEAKRLSFKPTVC 430
           N+A  NFL+GYD  +K +SFKPT C
Sbjct: 423 NVAQMNFLVGYDTVSKTVSFKPTDC 434

BLAST of CSPI04G06720 vs. TAIR10
Match: AT2G28010.1 (AT2G28010.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 215.3 bits (547), Expect = 7.3e-56
Identity = 155/441 (35.15%), Postives = 217/441 (49.21%), Query Frame = 1

Query: 4   IISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSL 63
           II +F  + L    F  TT  +  +GFT  L HR S  S           R++N    S 
Sbjct: 7   IIVLFLQISLC---FLFTTTASPPHGFTMDLIHRRSNAS----------SRVSNTQSGSS 66

Query: 64  SRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPC 123
             + T+ + +                  YLM + +GTPP +   + DTGS++ W QCLPC
Sbjct: 67  PYANTVFDNSV-----------------YLMKLQVGTPPFEIQAIIDTGSEITWTQCLPC 126

Query: 124 LKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLG 183
           + CY+Q+ PIFDP KS++F    C+  +C               Y   Y D TYT G L 
Sbjct: 127 VHCYEQNAPIFDPSKSSTFKEKRCDGHSCP--------------YEVDYFDHTYTMGTLA 186

Query: 184 FEKITIGSSS------VKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMS-QTSGISR 243
            E IT+ S+S       +++IGCGH +       SG++GL  G  SL++QM  +  G+  
Sbjct: 187 TETITLHSTSGEPFVMPETIIGCGHNNSWFKPSFSGMVGLNWGPSSLITQMGGEYPGL-- 246

Query: 244 RFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVT-YYYVTLEAISIGKERHM 303
             SYC          KINFG NA+V+G GVVST +        +YY+ L+A+S+G  R  
Sbjct: 247 -MSYCFS---GQGTSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIE 306

Query: 304 AS-----AKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGI 363
                  A +GN++IDSGTTL++ P    + V  ++  VV A R  DP     LC+    
Sbjct: 307 TMGTTFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCY---- 366

Query: 364 NVATSSGIPIITAQFSGGANVNLLPVNTFQKVAN-NVNCLTLTPASPTDEFGIIGNLALA 423
           N  T    P+IT  FSGG ++ L   N + +  N  V CL +   SPT E  I GN A  
Sbjct: 367 NSDTIDIFPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQE-AIFGNRAQN 392

Query: 424 NFLIGYDLEAKRLSFKPTVCT 431
           NFL+GYD  +  +SF PT C+
Sbjct: 427 NFLVGYDSSSLLVSFSPTNCS 392

BLAST of CSPI04G06720 vs. NCBI nr
Match: gi|700198286|gb|KGN53444.1| (hypothetical protein Csa_4G055390 [Cucumis sativus])

HSP 1 Score: 796.6 bits (2056), Expect = 2.2e-227
Identity = 406/430 (94.42%), Postives = 408/430 (94.88%), Query Frame = 1

Query: 1   MVAIISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFR 60
           MVA ISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFR
Sbjct: 1   MVATISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFR 60

Query: 61  RSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQC 120
           RSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQC
Sbjct: 61  RSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQC 120

Query: 121 LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKG 180
           LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGD+TY+KG
Sbjct: 121 LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDRTYSKG 180

Query: 181 DLGFEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240
           DLGFEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGG    V               
Sbjct: 181 DLGFEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGANPPV--------------- 240

Query: 241 CLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGKERHMASAKQ 300
            LPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIG ERHMASAKQ
Sbjct: 241 -LPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQ 300

Query: 301 GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI 360
           GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI
Sbjct: 301 GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI 360

Query: 361 ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK 420
           ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK
Sbjct: 361 ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK 414

Query: 421 RLSFKPTVCT 431
           RLSFKPTVCT
Sbjct: 421 RLSFKPTVCT 414

BLAST of CSPI04G06720 vs. NCBI nr
Match: gi|659102476|ref|XP_008452153.1| (PREDICTED: probable aspartic protease At2g35615 [Cucumis melo])

HSP 1 Score: 791.2 bits (2042), Expect = 9.2e-226
Identity = 394/425 (92.71%), Postives = 409/425 (96.24%), Query Frame = 1

Query: 6   SIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSR 65
           SIF  LIL LISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRL+NAFRRSLSR
Sbjct: 4   SIFCRLILFLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLSNAFRRSLSR 63

Query: 66  SATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLK 125
           SA LLNRAAT+GA+ LQ+P+ PGSGEYLMSVSIGTPPVDYIG+ADTGSDL WAQCLPC+K
Sbjct: 64  SAALLNRAATSGAVGLQSPIAPGSGEYLMSVSIGTPPVDYIGLADTGSDLTWAQCLPCVK 123

Query: 126 CYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFE 185
           C+KQSRPIF+PLKSTSFSHVPCNSQ C+AIDD+HCG QGVCDYSYTYGDQTYTKGDLG E
Sbjct: 124 CFKQSRPIFNPLKSTSFSHVPCNSQICQAIDDAHCGVQGVCDYSYTYGDQTYTKGDLGLE 183

Query: 186 KITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTL 245
           KITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTL
Sbjct: 184 KITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTL 243

Query: 246 LSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGKERHMASAKQGNVII 305
           LSHANGKINFGQNAVVSGPGVVSTPLISK+PVTYYY+TLEAISIG ERHMASAKQGNVII
Sbjct: 244 LSHANGKINFGQNAVVSGPGVVSTPLISKDPVTYYYITLEAISIGNERHMASAKQGNVII 303

Query: 306 DSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQF 365
           DSGTTL+ LPKELYDGVVSSLLKVVKAKRVKDPG+FWDLCFDDGINVA SSGIPIITA F
Sbjct: 304 DSGTTLTVLPKELYDGVVSSLLKVVKAKRVKDPGSFWDLCFDDGINVAASSGIPIITAHF 363

Query: 366 SGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFK 425
           SGGANVNLLPVNTFQKVANNVNCLTLT ASPTDEFGIIGNLA ANFLIGYDLEAKRLSFK
Sbjct: 364 SGGANVNLLPVNTFQKVANNVNCLTLTAASPTDEFGIIGNLAQANFLIGYDLEAKRLSFK 423

Query: 426 PTVCT 431
           PTVCT
Sbjct: 424 PTVCT 428

BLAST of CSPI04G06720 vs. NCBI nr
Match: gi|778697530|ref|XP_011654342.1| (PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus])

HSP 1 Score: 780.4 bits (2014), Expect = 1.6e-222
Identity = 398/430 (92.56%), Postives = 400/430 (93.02%), Query Frame = 1

Query: 1   MVAIISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFR 60
           MVA ISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFR
Sbjct: 1   MVATISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFR 60

Query: 61  RSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQC 120
           RSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQC
Sbjct: 61  RSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQC 120

Query: 121 LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKG 180
           LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGD+TY+KG
Sbjct: 121 LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDRTYSKG 180

Query: 181 DLGFEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240
           DLGFEKITIGSSSVKSVIGCGHESGGGFGFASG                           
Sbjct: 181 DLGFEKITIGSSSVKSVIGCGHESGGGFGFASGA-----------------------NPP 240

Query: 241 CLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGKERHMASAKQ 300
            LPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIG ERHMASAKQ
Sbjct: 241 VLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQ 300

Query: 301 GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI 360
           GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI
Sbjct: 301 GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI 360

Query: 361 ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK 420
           ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK
Sbjct: 361 ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK 407

Query: 421 RLSFKPTVCT 431
           RLSFKPTVCT
Sbjct: 421 RLSFKPTVCT 407

BLAST of CSPI04G06720 vs. NCBI nr
Match: gi|659102474|ref|XP_008452152.1| (PREDICTED: probable aspartic protease At2g35615 [Cucumis melo])

HSP 1 Score: 779.6 bits (2012), Expect = 2.8e-222
Identity = 391/430 (90.93%), Postives = 405/430 (94.19%), Query Frame = 1

Query: 1   MVAIISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFR 60
           M A ISIFF L LLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFS+LSHYDRL+NAFR
Sbjct: 1   MAATISIFFLLFLLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSTLSHYDRLSNAFR 60

Query: 61  RSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQC 120
           RSLSRSA LLNR AT+GA+ LQ+P+ PGSGEYLM VSIGTPPVDYIGM DTGSDL WAQC
Sbjct: 61  RSLSRSAALLNRTATSGAVGLQSPIAPGSGEYLMYVSIGTPPVDYIGMIDTGSDLTWAQC 120

Query: 121 LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKG 180
           LPC KC+ Q RPIF+PLKSTSFSHVPCNSQ C+AIDD+HCG QGVCDYSYTYGDQTYTKG
Sbjct: 121 LPCRKCFLQLRPIFNPLKSTSFSHVPCNSQICQAIDDAHCGVQGVCDYSYTYGDQTYTKG 180

Query: 181 DLGFEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240
           DLGFEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY
Sbjct: 181 DLGFEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240

Query: 241 CLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGKERHMASAKQ 300
           CLP LL HANGKINF QNAVVSGPGVVSTPLISK+PVTYYY+TLEAISIG ERHMASAKQ
Sbjct: 241 CLPPLLGHANGKINFAQNAVVSGPGVVSTPLISKDPVTYYYITLEAISIGNERHMASAKQ 300

Query: 301 GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI 360
           GNVIIDSGTTL+ LPKELYDGVVSSLLKVVKAKRVKDPG+FWDLCFDDGINVA SSGIPI
Sbjct: 301 GNVIIDSGTTLTVLPKELYDGVVSSLLKVVKAKRVKDPGSFWDLCFDDGINVAASSGIPI 360

Query: 361 ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK 420
           ITA FSGGANVNLLPVNTFQKVANNVNCLTLT ASPTDEFGIIGNLA ANFLIGYDLEAK
Sbjct: 361 ITAHFSGGANVNLLPVNTFQKVANNVNCLTLTAASPTDEFGIIGNLAQANFLIGYDLEAK 420

Query: 421 RLSFKPTVCT 431
           RLSFKPTVCT
Sbjct: 421 RLSFKPTVCT 430

BLAST of CSPI04G06720 vs. NCBI nr
Match: gi|778697533|ref|XP_004149005.2| (PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus])

HSP 1 Score: 752.7 bits (1942), Expect = 3.6e-214
Identity = 376/429 (87.65%), Postives = 397/429 (92.54%), Query Frame = 1

Query: 1   MVAIISIFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFR 60
           M A IS+FFHLIL LISFSQTTIING+NGFTTSLFHRDSLLSPLEFSSLSHYDRL NAFR
Sbjct: 1   MAATISLFFHLILFLISFSQTTIINGNNGFTTSLFHRDSLLSPLEFSSLSHYDRLANAFR 60

Query: 61  RSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQC 120
           RSLSRSA LLNRAAT+GA+ LQ+ + PGSGEYLMSVSIGTPPVDY+G+ADTGSDL WAQC
Sbjct: 61  RSLSRSAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQC 120

Query: 121 LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKG 180
           LPCLKCY+Q RPIF+PLKSTSFSHVPCN+Q C A+DD HCG QGVCDYSYTYGD+TY+KG
Sbjct: 121 LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKG 180

Query: 181 DLGFEKITIGSSSVKSVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240
           DLGFEKITIGSSSVKSVIGCGH S GGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY
Sbjct: 181 DLGFEKITIGSSSVKSVIGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240

Query: 241 CLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGKERHMASAKQ 300
           CLPTLLSHANGKINFG+NAVVSGPGVVSTPLISKN VTYYY+TLEAISIG ERHMA AKQ
Sbjct: 241 CLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQ 300

Query: 301 GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI 360
           GNVIIDSGTTL+ LPKELYDGVVSSLLKVVKAKRVKDP    DLCFDDGIN A S GIP+
Sbjct: 301 GNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPV 360

Query: 361 ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK 420
           ITA FSGGANVNLLP+NTF+KVA+NVNCLTL  ASPT EFGIIGNLA ANFLIGYDLEAK
Sbjct: 361 ITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAK 420

Query: 421 RLSFKPTVC 430
           RLSFKPTVC
Sbjct: 421 RLSFKPTVC 429

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPR1_ARATH1.3e-8642.35Probable aspartic protease At2g35615 OS=Arabidopsis thaliana GN=At2g35615 PE=3 S... [more]
CDR1_ARATH4.7e-8141.12Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1[more]
NEP2_NEPGR3.4e-5535.53Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
NEP1_NEPGR4.5e-5534.38Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
ASPG2_ARATH6.0e-5233.10Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KX67_CUCSA1.5e-22794.42Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055390 PE=3 SV=1[more]
A0A0A0KV20_CUCSA2.5e-21487.65Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055400 PE=3 SV=1[more]
A0A0A0KZZ3_CUCSA1.7e-12255.13Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055410 PE=3 SV=1[more]
M5WRG3_PRUPE3.6e-10446.15Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025167mg PE=3 SV=1[more]
W9SK79_9ROSA1.8e-9545.70Putative aspartic protease OS=Morus notabilis GN=L484_022741 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G64830.12.7e-9042.95 Eukaryotic aspartyl protease family protein[more]
AT2G35615.17.3e-8842.35 Eukaryotic aspartyl protease family protein[more]
AT1G31450.17.0e-8342.16 Eukaryotic aspartyl protease family protein[more]
AT5G33340.12.7e-8241.12 Eukaryotic aspartyl protease family protein[more]
AT2G28010.17.3e-5635.15 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|700198286|gb|KGN53444.1|2.2e-22794.42hypothetical protein Csa_4G055390 [Cucumis sativus][more]
gi|659102476|ref|XP_008452153.1|9.2e-22692.71PREDICTED: probable aspartic protease At2g35615 [Cucumis melo][more]
gi|778697530|ref|XP_011654342.1|1.6e-22292.56PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus][more]
gi|659102474|ref|XP_008452152.1|2.8e-22290.93PREDICTED: probable aspartic protease At2g35615 [Cucumis melo][more]
gi|778697533|ref|XP_004149005.2|3.6e-21487.65PREDICTED: probable aspartic protease At2g35615 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G06720.1CSPI04G06720.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 303..314
score: 7.9E-6coord: 401..416
score: 7.9E-6coord: 98..118
score: 7.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 4..430
score: 2.9E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 107..118
score: -coord: 303..314
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 261..430
score: 4.9E-33coord: 88..257
score: 3.1
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 86..429
score: 2.38
NoneNo IPR availablePANTHERPTHR13683:SF298ASPARTIC PROTEINASE CDR1-RELATEDcoord: 4..430
score: 2.9E