Cp4.1LG12g07170 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG12g07170
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionEukaryotic aspartyl protease family protein
LocationCp4.1LG12 : 7100211 .. 7102823 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGATGGGCCCAAAACAAAGAAGCCAGCCCTTCATCTTTTGTTCCTATGCCAACCTCCAATTCCTCGTTTCCTTCAATTTTCTCTTCTCCATCTTTCCTCTCTTATCTCAGCCATGGAGATTTCTAAATCACTCTGTTTTTTCCTTCTTCTTCTTCTTCTTCTGTTCTTCGTCGACCAAGCTCGTTCGGCCGCCATTAATGGAGATTCTGAGAAGCTTCATCGTCTTCTTCATCTTCAGAAACTTCCATGGAAGCAGCAGGAAGAAGCTGTTATTAACTGCATCTTTCAGAAGCCAAGTTAGTGTTCTAAAATAACCCTCTCTTTTCTTTTTTTTTTTTACTTTTTTTTTTTCATGTTTTATTTGAATTAAAAGCTTAATAAAATGATAAAATGTTATGCTAATGTTATTAAAATGATGTCATGACTGAAATATTTAATTTTCGTTGATATACTTGCATATTATTAATATTTAGCAAATAACTTTAACGGTGAAATTCCAAAATTGCCCTTGATTAGTGACTTGGATGGGTTGCATTGAAGGTTATTTTGGTAATTAAGAAAAAAAAATGAGAGGAATGCAAGCATAGGCATAAGCCTCTCTGTTTCTTCTCCATATGACTGTCTATTTTAAGTATTATATATATATATATATTTATTATTATTATTATTATTTTTTGAATTTGAGCTATTTTAAGTATAATTATCGGAAATTATTTGTTGATATTAAACAAGGAAACCTCTTTTTATTATTATTATTATTATTATCTTTTTTTTTGACACATCATTTTCTATTTTAATTAATTAATTTATTTAGTCTTTAAACTTTACTAACTGGTATTATATTAGGTAATTTTAGTCTTAAATTTTAATGTTTTTAAAATTACGGATCTACAAAACCTAATAGTGAAAGTTGAAGATTATATTAGATAAAAGAATTAAATTTATGTAATAAATTAAGTAATATTTTTTTAAAAATTAAATTTGAAGATTAAAAAAAATATATAACAGATATAAAATTAAAGGTTTGAGAAATCTAAATTTAAAAATAGAAAATATATTTAAAATTACGTTTTTGTAATTTTTTTAATTGGAATTGTTTAAAATAATTTATGGGATTTGAATTTCAGGAGTGAGGGAGGGAATAACGACATTGGAAATGAAAGAAAGAGACTATTGTTCAGGCAAAGTCACGGACTGGCAAAAGAATCTCCAAAACCGCCTAATCTCCGACGCCATTCACGTCCAATCTCTTCAATCACGGATCAAATCCGCCATTTTTTCCGGCGACACCCACCAAATCTCCGACTCCCAAATTCCCTTGTCCTCCGGCACCAGGCTCCAAACCCTCAATTACATCGTCACCGTCGCCCTCGGCGGCCGGGATTCAACTCTCATTGTCGATACCGGCAGCGACCTCACTTGGGTCCAATGCCGCCCTTGCCGCCTCTGTTACAACCAACAAGAACCCCTCTTCGATCCCTCAAATTCCTCTTCGTTCCTTTCCCTTTCTTGCAATTCCCCCACCTGTTTGGCTCTTCCTTCCGCCACCGGAAATTCCGGCCTCTGTGGGTATGGAAATTCAAGCTCCTGCGGTTACGAGATTAACTACGGCGATGGGTCTTATTCCCGCGGGGAACTTGGATTTGAGAGGCTGAATTTAGGGGAAATCGTTATCGATAATTTCATATTTGGGTGTGGCCGGAATAACAAGGGGTTGTTCGGCGGCGCTTCGGGATTAATGGGTTTAGGTCGGAGTAAATTATCTCTGGTTTCTCAAACTTCCTCTGTTTTTGACGGAATTTTTTCCTACTGTTTGCCTTCAACCGGCGCCGGAGCTTCAGGTTCTTTAACAATGGGCGGTGGCGATTTCTCAAATTTCAGAAACGTTTCTCCAATTTCCTACACAAGAATGGTCTCAAATCCTCAGATGCCAAATTTTTACTTTCTGAATCTCACAGGAATTACCATCGGTGGGGTAAATCTGGGTGTGCCAAACAACGGGGCTTTGAGTTTAATCGATTCAGGGACTGTAATTACCAGATTGACTCCATCGATTTACAGAGCTTTCAAAGCGGAATTTGAGAAGCAATTTTCTGGGTTTCAAACAGCGCCAGGATTCTCGATTTTGAACACTTGTTTTAATCTCACTGGGTTTAAAGAAGTGAATATTCCCACGGTGAAATTTTACTTCGAAGGGAATGCAGAAATGACTGTGGATGTTGAAGGGGTTTTTTACTTTGTGAAATCAGATGCTTCTCAGATCTGTTTGGCGTTTGCGAGTTTGGGTTATGAAGATCAGAGTATGATAATCGGGAATTACCAGCAGAAGAATCAGAGGGTTGTTTATAATTCCAAGGAATCCACGTTGGGTTTTGCAGCGGAGCCTTGCGGTTTCTAGCCCAGAGAGGATTTTCCGGGAAAGTGGAACGGATTTTCCCGGAAAATTTTGTTTATTACAGACAGGGTGGGGAATTGTCTCGTTTCCTCTCAGTAACAAAATAAAAAAGGGTGAGAATTTGGATTTGTATATTTCATTTTTTGTTTGTCTTGTAATTTGTACATTTCATTTCAAATATTGCAATTAATTCATCCGCCTATTTTCAAATTCCATTT

mRNA sequence

TGATGGGCCCAAAACAAAGAAGCCAGCCCTTCATCTTTTGTTCCTATGCCAACCTCCAATTCCTCGTTTCCTTCAATTTTCTCTTCTCCATCTTTCCTCTCTTATCTCAGCCATGGAGATTTCTAAATCACTCTGTTTTTTCCTTCTTCTTCTTCTTCTTCTGTTCTTCGTCGACCAAGCTCGTTCGGCCGCCATTAATGGAGATTCTGAGAAGCTTCATCGTCTTCTTCATCTTCAGAAACTTCCATGGAAGCAGCAGGAAGAAGCTGTTATTAACTGCATCTTTCAGAAGCCAAGAGTGAGGGAGGGAATAACGACATTGGAAATGAAAGAAAGAGACTATTGTTCAGGCAAAGTCACGGACTGGCAAAAGAATCTCCAAAACCGCCTAATCTCCGACGCCATTCACGTCCAATCTCTTCAATCACGGATCAAATCCGCCATTTTTTCCGGCGACACCCACCAAATCTCCGACTCCCAAATTCCCTTGTCCTCCGGCACCAGGCTCCAAACCCTCAATTACATCGTCACCGTCGCCCTCGGCGGCCGGGATTCAACTCTCATTGTCGATACCGGCAGCGACCTCACTTGGGTCCAATGCCGCCCTTGCCGCCTCTGTTACAACCAACAAGAACCCCTCTTCGATCCCTCAAATTCCTCTTCGTTCCTTTCCCTTTCTTGCAATTCCCCCACCTGTTTGGCTCTTCCTTCCGCCACCGGAAATTCCGGCCTCTGTGGGTATGGAAATTCAAGCTCCTGCGGTTACGAGATTAACTACGGCGATGGGTCTTATTCCCGCGGGGAACTTGGATTTGAGAGGCTGAATTTAGGGGAAATCGTTATCGATAATTTCATATTTGGGTGTGGCCGGAATAACAAGGGGTTGTTCGGCGGCGCTTCGGGATTAATGGGTTTAGGTCGGAGTAAATTATCTCTGGTTTCTCAAACTTCCTCTGTTTTTGACGGAATTTTTTCCTACTGTTTGCCTTCAACCGGCGCCGGAGCTTCAGGTTCTTTAACAATGGGCGGTGGCGATTTCTCAAATTTCAGAAACGTTTCTCCAATTTCCTACACAAGAATGGTCTCAAATCCTCAGATGCCAAATTTTTACTTTCTGAATCTCACAGGAATTACCATCGGTGGGGTAAATCTGGGTGTGCCAAACAACGGGGCTTTGAGTTTAATCGATTCAGGGACTGTAATTACCAGATTGACTCCATCGATTTACAGAGCTTTCAAAGCGGAATTTGAGAAGCAATTTTCTGGGTTTCAAACAGCGCCAGGATTCTCGATTTTGAACACTTGTTTTAATCTCACTGGGTTTAAAGAAGTGAATATTCCCACGGTGAAATTTTACTTCGAAGGGAATGCAGAAATGACTGTGGATGTTGAAGGGGTTTTTTACTTTGTGAAATCAGATGCTTCTCAGATCTGTTTGGCGTTTGCGAGTTTGGGTTATGAAGATCAGAGTATGATAATCGGGAATTACCAGCAGAAGAATCAGAGGGTTGTTTATAATTCCAAGGAATCCACGTTGGGTTTTGCAGCGGAGCCTTGCGGTTTCTAGCCCAGAGAGGATTTTCCGGGAAAGTGGAACGGATTTTCCCGGAAAATTTTGTTTATTACAGACAGGGTGGGGAATTGTCTCGTTTCCTCTCAGTAACAAAATAAAAAAGGGTGAGAATTTGGATTTGTATATTTCATTTTTTGTTTGTCTTGTAATTTGTACATTTCATTTCAAATATTGCAATTAATTCATCCGCCTATTTTCAAATTCCATTT

Coding sequence (CDS)

ATGGAGATTTCTAAATCACTCTGTTTTTTCCTTCTTCTTCTTCTTCTTCTGTTCTTCGTCGACCAAGCTCGTTCGGCCGCCATTAATGGAGATTCTGAGAAGCTTCATCGTCTTCTTCATCTTCAGAAACTTCCATGGAAGCAGCAGGAAGAAGCTGTTATTAACTGCATCTTTCAGAAGCCAAGAGTGAGGGAGGGAATAACGACATTGGAAATGAAAGAAAGAGACTATTGTTCAGGCAAAGTCACGGACTGGCAAAAGAATCTCCAAAACCGCCTAATCTCCGACGCCATTCACGTCCAATCTCTTCAATCACGGATCAAATCCGCCATTTTTTCCGGCGACACCCACCAAATCTCCGACTCCCAAATTCCCTTGTCCTCCGGCACCAGGCTCCAAACCCTCAATTACATCGTCACCGTCGCCCTCGGCGGCCGGGATTCAACTCTCATTGTCGATACCGGCAGCGACCTCACTTGGGTCCAATGCCGCCCTTGCCGCCTCTGTTACAACCAACAAGAACCCCTCTTCGATCCCTCAAATTCCTCTTCGTTCCTTTCCCTTTCTTGCAATTCCCCCACCTGTTTGGCTCTTCCTTCCGCCACCGGAAATTCCGGCCTCTGTGGGTATGGAAATTCAAGCTCCTGCGGTTACGAGATTAACTACGGCGATGGGTCTTATTCCCGCGGGGAACTTGGATTTGAGAGGCTGAATTTAGGGGAAATCGTTATCGATAATTTCATATTTGGGTGTGGCCGGAATAACAAGGGGTTGTTCGGCGGCGCTTCGGGATTAATGGGTTTAGGTCGGAGTAAATTATCTCTGGTTTCTCAAACTTCCTCTGTTTTTGACGGAATTTTTTCCTACTGTTTGCCTTCAACCGGCGCCGGAGCTTCAGGTTCTTTAACAATGGGCGGTGGCGATTTCTCAAATTTCAGAAACGTTTCTCCAATTTCCTACACAAGAATGGTCTCAAATCCTCAGATGCCAAATTTTTACTTTCTGAATCTCACAGGAATTACCATCGGTGGGGTAAATCTGGGTGTGCCAAACAACGGGGCTTTGAGTTTAATCGATTCAGGGACTGTAATTACCAGATTGACTCCATCGATTTACAGAGCTTTCAAAGCGGAATTTGAGAAGCAATTTTCTGGGTTTCAAACAGCGCCAGGATTCTCGATTTTGAACACTTGTTTTAATCTCACTGGGTTTAAAGAAGTGAATATTCCCACGGTGAAATTTTACTTCGAAGGGAATGCAGAAATGACTGTGGATGTTGAAGGGGTTTTTTACTTTGTGAAATCAGATGCTTCTCAGATCTGTTTGGCGTTTGCGAGTTTGGGTTATGAAGATCAGAGTATGATAATCGGGAATTACCAGCAGAAGAATCAGAGGGTTGTTTATAATTCCAAGGAATCCACGTTGGGTTTTGCAGCGGAGCCTTGCGGTTTCTAG

Protein sequence

MEISKSLCFFLLLLLLLFFVDQARSAAINGDSEKLHRLLHLQKLPWKQQEEAVINCIFQKPRVREGITTLEMKERDYCSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAIFSGDTHQISDSQIPLSSGTRLQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLFDPSNSSSFLSLSCNSPTCLALPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGELGFERLNLGEIVIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAGASGSLTMGGGDFSNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGVPNNGALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTVKFYFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKESTLGFAAEPCGF
BLAST of Cp4.1LG12g07170 vs. Swiss-Prot
Match: ASPA_ARATH (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 307.4 bits (786), Expect = 2.8e-82
Identity = 178/396 (44.95%), Postives = 240/396 (60.61%), Query Frame = 1

Query: 96  DAIHVQSLQSRIKSAIFSGDTHQISDSQIPLSSGTRLQTLNYIVTVALGG--RDSTLIVD 155
           D   V S+ S++   + +    +   + +P   G+ L + NYIVTV LG    D +LI D
Sbjct: 91  DQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFD 150

Query: 156 TGSDLTWVQCRPC-RLCYNQQEPLFDPSNSSSFLSLSCNSPTCLALPSATGNSGLCGYGN 215
           TGSDLTW QC+PC R CY+Q+EP+F+PS S+S+ ++SC+S  C +L SATGN+G C   +
Sbjct: 151 TGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSC---S 210

Query: 216 SSSCGYEINYGDGSYSRGELGFERLNL-GEIVIDNFIFGCGRNNKGLFGGASGLMGLGRS 275
           +S+C Y I YGD S+S G L  E+  L    V D   FGCG NN+GLF G +GL+GLGR 
Sbjct: 211 ASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRD 270

Query: 276 KLSLVSQTSSVFDGIFSYCLPSTGAGASGSLTMGGGDFSNFRNVSPISYTRMVSNPQMPN 335
           KLS  SQT++ ++ IFSYCLPS+ A  +G LT G    S     +PIS     +     +
Sbjct: 271 KLSFPSQTATAYNKIFSYCLPSS-ASYTGHLTFGSAGISRSVKFTPIS-----TITDGTS 330

Query: 336 FYFLNLTGITIGGVNLGVPNN-----GALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGF 395
           FY LN+  IT+GG  L +P+      GAL  IDSGTVITRL P  Y A ++ F+ + S +
Sbjct: 331 FYGLNIVAITVGGQKLPIPSTVFSTPGAL--IDSGTVITRLPPKAYAALRSSFKAKMSKY 390

Query: 396 QTAPGFSILNTCFNLTGFKEVNIPTVKFYFEGNAEMTVDVEGVFYFVKSDASQICLAFAS 455
            T  G SIL+TCF+L+GFK V IP V F F G A + +  +G+FY  K   SQ+CLAFA 
Sbjct: 391 PTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFK--ISQVCLAFAG 450

Query: 456 LGYEDQSMIIGNYQQKNQRVVYNSKESTLGFAAEPC 483
              +  + I GN QQ+   VVY+     +GFA   C
Sbjct: 451 NSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 473

BLAST of Cp4.1LG12g07170 vs. Swiss-Prot
Match: AED1_ARATH (Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1)

HSP 1 Score: 258.8 bits (660), Expect = 1.2e-67
Identity = 155/412 (37.62%), Postives = 239/412 (58.01%), Query Frame = 1

Query: 78  CSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAIFSGDTHQISDSQIPLSSGTRLQTLNY 137
           CS   +D + +    +  D   V+S+ S++ S   + +  +   +++P  SG  L + NY
Sbjct: 74  CSHLSSDARVDHDEIIRRDQARVESIYSKL-SKNSANEVSEAKSTELPAKSGITLGSGNY 133

Query: 138 IVTVALGG--RDSTLIVDTGSDLTWVQCRPCR-LCYNQQEPLFDPSNSSSFLSLSCNSPT 197
           IVT+ +G    D +L+ DTGSDLTW QC PC   CY+Q+EP F+PS+SS++ ++SC+SP 
Sbjct: 134 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPM 193

Query: 198 CLALPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGELGFERLNL-GEIVIDNFIFGCGR 257
           C    S +          +S+C Y I YGD S+++G L  E+  L    V+++  FGCG 
Sbjct: 194 CEDAESCS----------ASNCVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGE 253

Query: 258 NNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAGASGSLTMGGGDFSNFR 317
           NN+GLF G +GL+GLG  KLSL +QT++ ++ IFSYCLPS  + ++G LT G    S   
Sbjct: 254 NNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESV 313

Query: 318 NVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGV-PNNGAL--SLIDSGTVITRLTPS 377
             +PIS     S P   N Y +++ GI++G   L + PN+ +   ++IDSGTV TRL   
Sbjct: 314 KFTPIS-----SFPSAFN-YGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTK 373

Query: 378 IYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTVKFYFEGNAEMTVDVEGVF 437
           +Y   ++ F+++ S +++  G+ + +TC++ TG   V  PT+ F F G+  + +D  G+ 
Sbjct: 374 VYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGIS 433

Query: 438 YFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKESTLGFAAEPC 483
             +K   SQ+CLAFA  G +D   I GN QQ    VVY+     +GFA   C
Sbjct: 434 LPIK--ISQVCLAFA--GNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of Cp4.1LG12g07170 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 255.4 bits (651), Expect = 1.3e-66
Identity = 151/430 (35.12%), Postives = 226/430 (52.56%), Query Frame = 1

Query: 69  TLEMKERD-YCSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAIF--SGDTHQISDSQIP 128
           TL +  RD + S    +    L  R+  D   V ++  RI   +   S   ++++D    
Sbjct: 60  TLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSD 119

Query: 129 LSSGTRLQTLNYIVTVALGG--RDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLFDPSNSS 188
           + SG    +  Y V + +G   RD  +++D+GSD+ WVQC+PC+LCY Q +P+FDP+ S 
Sbjct: 120 IVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSG 179

Query: 189 SFLSLSCNSPTCLALPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGELGFERLNLGEIV 248
           S+  +SC S  C  +     NSG     +S  C YE+ YGDGSY++G L  E L   + V
Sbjct: 180 SYTGVSCGSSVCDRIE----NSGC----HSGGCRYEVMYGDGSYTKGTLALETLTFAKTV 239

Query: 249 IDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAGASGSLT 308
           + N   GCG  N+G+F GA+GL+G+G   +S V Q S    G F YCL S G  ++GSL 
Sbjct: 240 VRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLV 299

Query: 309 MGGGDFSNFRNVSPI--SYTRMVSNPQMPNFYFLNLTGITIGGVNLGVPNNGALSL---- 368
            G       R   P+  S+  +V NP+ P+FY++ L G+ +GGV + +P +G   L    
Sbjct: 300 FG-------REALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLP-DGVFDLTETG 359

Query: 369 -----IDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTV 428
                +D+GT +TRL  + Y AF+  F+ Q +    A G SI +TC++L+GF  V +PTV
Sbjct: 360 DGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTV 419

Query: 429 KFYFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKE 483
            FYF     +T+     F     D+   C AFA+        IIGN QQ+  +V ++   
Sbjct: 420 SFYFTEGPVLTLPARN-FLMPVDDSGTYCFAFAA--SPTGLSIIGNIQQEGIQVSFDGAN 470

BLAST of Cp4.1LG12g07170 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 220.3 bits (560), Expect = 4.5e-56
Identity = 161/494 (32.59%), Postives = 238/494 (48.18%), Query Frame = 1

Query: 6   SLCFFLLLLLLLFFVDQARSAAINGDSEKLHRLLHLQKLPWKQQEEAVINCIFQKPRVRE 65
           SLCFF L L     +   ++   N  S      +  Q        E+++   F+     E
Sbjct: 11  SLCFFFLSLPSFSSLPSFQTLFPNSHSLPCASPVSFQP---DSDSESLLESEFESGSDSE 70

Query: 66  GIT--TLEMKERDYCSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAIFSGDTHQISDSQ 125
             +  TL +   D  S   T   +   +RL  D+  V+S+ +          TH      
Sbjct: 71  SSSSITLNLDHIDALSSNKTP-DELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGG 130

Query: 126 IPLS--SGTRLQTLNYIVTVALG--GRDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLFDP 185
              S  SG    +  Y   + +G   R   +++DTGSD+ W+QC PCR CY+Q +P+FDP
Sbjct: 131 FSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDP 190

Query: 186 SNSSSFLSLSCNSPTCLALPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGELGFERLNL 245
             S ++ ++ C+SP C  L SA  N+         +C Y+++YGDGS++ G+   E L  
Sbjct: 191 RKSKTYATIPCSSPHCRRLDSAGCNT------RRKTCLYQVSYGDGSFTVGDFSTETLTF 250

Query: 246 GEIVIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAGAS 305
               +     GCG +N+GLF GA+GL+GLG+ KLS   QT   F+  FSYCL    A + 
Sbjct: 251 RRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSK 310

Query: 306 GSLTMGGGDFSNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNL-GVP-------- 365
            S  +    F N        +T ++SNP++  FY++ L GI++GG  + GV         
Sbjct: 311 PSSVV----FGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQ 370

Query: 366 -NNGALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNI 425
             NG + +IDSGT +TRL    Y A +  F       + AP FS+ +TCF+L+   EV +
Sbjct: 371 IGNGGV-IIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKV 430

Query: 426 PTVKFYFEGNAEMTVDVEGVFYFVKSDAS-QICLAFASLGYEDQSMIIGNYQQKNQRVVY 483
           PTV  +F G     V +    Y +  D + + C AFA  G      IIGN QQ+  RVVY
Sbjct: 431 PTVVLHFRG---ADVSLPATNYLIPVDTNGKFCFAFA--GTMGGLSIIGNIQQQGFRVVY 484

BLAST of Cp4.1LG12g07170 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 6.1e-53
Identity = 133/445 (29.89%), Postives = 231/445 (51.91%), Query Frame = 1

Query: 69  TLEMKERD-YCSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAI-----------FSGDT 128
           +LE+  RD + + +  D++    +RL  D+  V  + ++I+ A+           ++ DT
Sbjct: 81  SLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDT 140

Query: 129 -HQISDSQIPLSSGTRLQTLNYIVTVALG--GRDSTLIVDTGSDLTWVQCRPCRLCYNQQ 188
            +Q  D   P+ SG    +  Y   + +G   ++  L++DTGSD+ W+QC PC  CY Q 
Sbjct: 141 RYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQS 200

Query: 189 EPLFDPSNSSSFLSLSCNSPTCLALPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGELG 248
           +P+F+P++SS++ SL+C++P C  L ++      C    S+ C Y+++YGDGS++ GEL 
Sbjct: 201 DPVFNPTSSSTYKSLTCSAPQCSLLETSA-----C---RSNKCLYQVSYGDGSFTVGELA 260

Query: 249 FERLNLGEI-VIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLP 308
            + +  G    I+N   GCG +N+GLF GA+GL+GLG   LS+ +Q  +     FSYCL 
Sbjct: 261 TDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKAT---SFSYCLV 320

Query: 309 STGAGASGSL-----TMGGGDFSNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNL 368
              +G S SL      +GGGD          +   ++ N ++  FY++ L+G ++GG  +
Sbjct: 321 DRDSGKSSSLDFNSVQLGGGD----------ATAPLLRNKKIDTFYYVGLSGFSVGGEKV 380

Query: 369 GVPN---------NGALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQT-APGFSILNT 428
            +P+         +G + ++D GT +TRL    Y + +  F K     +  +   S+ +T
Sbjct: 381 VLPDAIFDVDASGSGGV-ILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDT 440

Query: 429 CFNLTGFKEVNIPTVKFYFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQSMIIG 483
           C++ +    V +PTV F+F G   + +  +  +     D+   C AFA         IIG
Sbjct: 441 CYDFSSLSTVKVPTVAFHFTGGKSLDLPAKN-YLIPVDDSGTFCFAFAPT--SSSLSIIG 500

BLAST of Cp4.1LG12g07170 vs. TrEMBL
Match: A0A0A0K8J2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G431320 PE=3 SV=1)

HSP 1 Score: 750.7 bits (1937), Expect = 1.1e-213
Identity = 378/497 (76.06%), Postives = 425/497 (85.51%), Query Frame = 1

Query: 1   MEISKSLCFFLLLLLLLFF------VDQARSAAIN---GDSEKLHRLLHLQKLPWKQQEE 60
           MEISKSL F L LLLLL        VD ARS++ N   GD+ +   L   Q  PWK+  E
Sbjct: 1   MEISKSLHFPLSLLLLLLLPLLSIGVD-ARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGE 60

Query: 61  AVINCIFQKPRVREGITTLEMKERDYCSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAI 120
           AV+NCIFQKP++ +GITTLEMK+RDYCSGK+TDW+K  QNR+I DAI+V SL S  KSAI
Sbjct: 61  AVVNCIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAI 120

Query: 121 FSGDTHQISDSQIPLSSGTRLQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYN 180
           F G THQ+SDSQIP+SSG RLQTLNYIVTV +GG++STLIVDTGSDLTWVQC PCRLCYN
Sbjct: 121 FPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYN 180

Query: 181 QQEPLFDPSNSSSFLSLSCNSPTCLALPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGE 240
           QQEPLF+PSNSSSFLSL CNSPTC+AL    G+SGLC   NS+SC Y+I+YGDGSYSRGE
Sbjct: 181 QQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGE 240

Query: 241 LGFERLNLGEIVIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCL 300
           LGFE+L LG+  IDNFIFGCGRNNKGLFGGASGLMGL RS+LSLVSQTSS+F  +FSYCL
Sbjct: 241 LGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCL 300

Query: 301 PSTGAGASGSLTMGGGDFSNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGVP- 360
           P+TG G+SGSLT+GG DFSNF+N+SPISYTRM+ NPQM NFYFLNLTGI+IGGVNL VP 
Sbjct: 301 PTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPR 360

Query: 361 ---NNGALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEV 420
              N G LSL+DSGTVITRL+PSIY+AFKAEFEKQFSG++T PGFSILNTCFNLTG++EV
Sbjct: 361 LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEV 420

Query: 421 NIPTVKFYFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVV 480
           NIPTVKF FEGNAEM VDVEGVFYFVKSDASQICLAFASLGYEDQ+MIIGNYQQKNQRV+
Sbjct: 421 NIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVI 480

Query: 481 YNSKESTLGFAAEPCGF 485
           YNSKES +GFA EPC F
Sbjct: 481 YNSKESKVGFAGEPCSF 496

BLAST of Cp4.1LG12g07170 vs. TrEMBL
Match: M5WTF1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005040mg PE=3 SV=1)

HSP 1 Score: 573.5 bits (1477), Expect = 2.3e-160
Identity = 304/481 (63.20%), Postives = 369/481 (76.72%), Query Frame = 1

Query: 10  FLLLLLLLFFVDQARSAAING-DSEKLHRLLHLQKLPWKQQEEAVIN-CIFQKPRVREGI 69
           F +LL LLF        + NG  S K  ++L LQ+  W+Q        C+ QK R  +G 
Sbjct: 8   FQVLLHLLFLC----LCSANGVQSFKEKKVLKLQEFRWRQHGGTRSTVCLSQKSRKEKGA 67

Query: 70  TTLEMKERDYCSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAIFSGDTHQISDSQIPLS 129
           T LE+K RDYCSGK+ DW K  Q RLI D +HV+SLQS+ K+ + SG    +S++QIPL+
Sbjct: 68  TILEIKHRDYCSGKIVDWDKKQQKRLIFDDLHVRSLQSQFKNRV-SGRIKDLSEAQIPLT 127

Query: 130 SGTRLQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLFDPSNSSSFLS 189
           SG RLQTLNYIVTV LGGR+ T+IVDTGSDLTWVQC+PC+LCYNQQEPLF+ S S S+ S
Sbjct: 128 SGIRLQTLNYIVTVELGGRNMTVIVDTGSDLTWVQCQPCKLCYNQQEPLFNSSASPSYKS 187

Query: 190 LSCNSPTCLALPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGELGFERLNLGEIVIDNF 249
           + CNS TC AL   TGNSG CG  N +SC Y +NYGDGSY+RGELG + L+LG   ++NF
Sbjct: 188 VLCNSSTCQALQFDTGNSGACG-SNPTSCNYVVNYGDGSYTRGELGSDHLSLGATPVNNF 247

Query: 250 IFGCGRNNKGLFGGASGLMGLGRSK-LSLVSQTSSVFDGIFSYCLPSTGAGASGSLTMGG 309
           +FGCGRNNKGLFGGASGLMGLGRS+ +SLVSQTS++F G+FSYCLP+T A ASGSL M G
Sbjct: 248 VFGCGRNNKGLFGGASGLMGLGRSESVSLVSQTSALFGGVFSYCLPTTEATASGSLIM-G 307

Query: 310 GDFSNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGVPNNGALS---LIDSGTV 369
           GD S ++N +PISYTRMV NP++  FYFLNLTGI+IGGV L   N    S   LIDSGTV
Sbjct: 308 GDASIYKNSTPISYTRMVPNPELSTFYFLNLTGISIGGVAL--QNQSFASGGILIDSGTV 367

Query: 370 ITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTVKFYFEGNAEMT 429
           I+RL PS+Y+A KAEF KQFSG+  APGF+IL+TCFNL+ ++EV+IPT+KF+FEGNAE+ 
Sbjct: 368 ISRLAPSVYKAVKAEFLKQFSGYPPAPGFAILDTCFNLSAYQEVSIPTLKFHFEGNAELN 427

Query: 430 VDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKESTLGFAAEPCG 485
           VDV G+FY VK+DASQICLA ASL YED+  IIGNYQQKNQRV+YN+K+S LGFA E C 
Sbjct: 428 VDVTGIFYLVKTDASQICLALASLSYEDEIGIIGNYQQKNQRVIYNTKDSKLGFAEESCS 479

BLAST of Cp4.1LG12g07170 vs. TrEMBL
Match: A0A067EW14_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011482mg PE=3 SV=1)

HSP 1 Score: 572.0 bits (1473), Expect = 6.8e-160
Identity = 291/449 (64.81%), Postives = 349/449 (77.73%), Query Frame = 1

Query: 39  LHLQKLPWKQQEEAVINCI-FQKPRVREGITTLEMKERDYCSGKVTDWQKNLQNRLISDA 98
           LHL KL W+Q+  +  +C+  QK R+  G  TLE+K ++YCSGK+ DW +  QNRLI D 
Sbjct: 37  LHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDN 96

Query: 99  IHVQSLQSRIKSAIFSGDTHQISDSQIPLSSGTRLQTLNYIVTVALGGRDSTLIVDTGSD 158
           +HVQ LQSRIK+ I SG+   +S+++IPL+SG RLQTLNYI T+ LGGR+ T+IVDTGSD
Sbjct: 97  LHVQYLQSRIKNMI-SGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSD 156

Query: 159 LTWVQCRPCRLCYNQQEPLFDPSNSSSFLSLSCNSPTCLALPSATGNSGLCGYGNSSSCG 218
           LTWVQC+PC+ CYNQQ+P+FDPS S S+  + CNS TC AL  ATGNSG+C   +   C 
Sbjct: 157 LTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCN 216

Query: 219 YEINYGDGSYSRGELGFERLNLGEIVIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVS 278
           Y ++YGDGSY+RGELG E L LG+  +++FIFGCGRNNKGLFGG SGLMGLGRS LSLVS
Sbjct: 217 YFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVS 276

Query: 279 QTSSVFDGIFSYCLPST-GAGASGSLTMGGGDFSNFRNVSPISYTRMVSNPQMPNFYFLN 338
           QTS +F G+FSYCLPST  AGASGSL +GG   S F+N +PI+YT M+ NPQ+  FY LN
Sbjct: 277 QTSEIFGGLFSYCLPSTQDAGASGSLILGGNS-SVFKNSTPITYTNMIPNPQLATFYILN 336

Query: 339 LTGITIGGVNL---GVPNNGALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFS 398
           LTGI+IGG  L   G    G   LIDSGTVITRL PSIY A KAEF KQFSGF +APGFS
Sbjct: 337 LTGISIGGKQLQASGFAKGGI--LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 396

Query: 399 ILNTCFNLTGFKEVNIPTVKFYFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQS 458
           IL+TCFNL+ ++EVNIP VK  FEGNAEMTVDV G+ YFVKSDASQ+CLA ASL YED++
Sbjct: 397 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 456

Query: 459 MIIGNYQQKNQRVVYNSKESTLGFAAEPC 483
            IIGNYQQKNQRV+Y++K S LGFA E C
Sbjct: 457 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481

BLAST of Cp4.1LG12g07170 vs. TrEMBL
Match: B9RGP5_RICCO (Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_1442500 PE=3 SV=1)

HSP 1 Score: 569.7 bits (1467), Expect = 3.4e-159
Identity = 286/416 (68.75%), Postives = 338/416 (81.25%), Query Frame = 1

Query: 72  MKERDYC--SGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAIFSGDTHQISDSQIPLSSG 131
           MK RD+C  SGK TDW K LQ  LI D   V+SLQSRIKS IFSG+     DSQIPLSSG
Sbjct: 1   MKHRDFCNSSGKSTDWNKKLQKSLILDDFRVRSLQSRIKS-IFSGNNIDALDSQIPLSSG 60

Query: 132 TRLQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLFDPSNSSSFLSLS 191
            RLQTLNYIVTV +GGR+ T+IVDTGSDLTWVQC+PCRLCYNQQ+PLF+PS S S+ ++ 
Sbjct: 61  VRLQTLNYIVTVEIGGRNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTIL 120

Query: 192 CNSPTCLALPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGELGFERLNLGEIVIDNFIF 251
           CNS TC +L  ATGN G+CG  N+ +C Y +NYGDGSY+RG+LG E+LNLG   + NFIF
Sbjct: 121 CNSSTCQSLQYATGNLGVCG-SNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTTHVSNFIF 180

Query: 252 GCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAGASGSLTMGGGDF 311
           GCGRNNKGLFGGASGLMGLG+S LSLVSQTS++F+G+FSYCLP+T A ASGSL +GG   
Sbjct: 181 GCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAADASGSLILGGNS- 240

Query: 312 SNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGVPNNGALS-LIDSGTVITRLT 371
           S ++N +PISYTRM++NPQ+P FYFLNLTGI+IGGV L  PN      LIDSGTVITRL 
Sbjct: 241 SVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYRQSGILIDSGTVITRLP 300

Query: 372 PSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTVKFYFEGNAEMTVDVEG 431
           P +YR  KAEF KQFSGF +AP FSIL+TCFNL G+ EV+IPT++  FEGNAE+TVDV G
Sbjct: 301 PPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQFEGNAELTVDVTG 360

Query: 432 VFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKESTLGFAAEPCGF 485
           +FYFVK+DASQ+CLA ASL ++D+  IIGNYQQ+NQRV+YN+KES LGFAAE C F
Sbjct: 361 IFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEACSF 413

BLAST of Cp4.1LG12g07170 vs. TrEMBL
Match: V4SPS2_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025395mg PE=3 SV=1)

HSP 1 Score: 566.6 bits (1459), Expect = 2.9e-158
Identity = 289/445 (64.94%), Postives = 347/445 (77.98%), Query Frame = 1

Query: 39  LHLQKLPWKQQEEAVINCI-FQKPRVREGITTLEMKERDYCSGKVTDWQKNLQNRLISDA 98
           LHL KL W+Q+  +  +C+  QK R+  G  TLE+K ++YCSGK+ DW +  QNRLI D 
Sbjct: 37  LHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDN 96

Query: 99  IHVQSLQSRIKSAIFSGDTHQISDSQIPLSSGTRLQTLNYIVTVALGGRDSTLIVDTGSD 158
           +HVQ LQSRIK+ I SG+   +S+++IPL+SG RLQTLNYI T+ LGGR+ T+IVDTGSD
Sbjct: 97  LHVQYLQSRIKNMI-SGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSD 156

Query: 159 LTWVQCRPCRLCYNQQEPLFDPSNSSSFLSLSCNSPTCLALPSATGNSGLCGYGNSSSCG 218
           LTWVQC+PC+ CYNQQ+P+FDPS S S+  + CNS TC AL  ATGNSG+C   +   C 
Sbjct: 157 LTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCN 216

Query: 219 YEINYGDGSYSRGELGFERLNLGEIVIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVS 278
           Y ++YGDGSY+RGELG E L LG+  +++FIFGCGRNNKGLFGG SGLMGLGRS LSLVS
Sbjct: 217 YFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVS 276

Query: 279 QTSSVFDGIFSYCLPST-GAGASGSLTMGGGDFSNFRNVSPISYTRMVSNPQMPNFYFLN 338
           QTS +F G+FSYCLPST  AGASGSL +GG   S F+N +PI+YT M+ NPQ+  FY LN
Sbjct: 277 QTSEIFGGLFSYCLPSTQDAGASGSLILGGNS-SVFKNSTPITYTNMIPNPQLATFYILN 336

Query: 339 LTGITIGGVNL---GVPNNGALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFS 398
           LTGI+IGG  L   G    G   LIDSGTVITRL PSIY A KAEF KQFSGF +APGFS
Sbjct: 337 LTGISIGGKQLQASGFAKGGI--LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 396

Query: 399 ILNTCFNLTGFKEVNIPTVKFYFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQS 458
           IL+TCFNL+ ++EVNIP VK  FEGNAEMTVDV G+ YFVKSDASQ+CLA ASL YED++
Sbjct: 397 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 456

Query: 459 MIIGNYQQKNQRVVYNSKESTLGFA 479
            IIGNYQQKNQRV+Y++K S LGFA
Sbjct: 457 GIIGNYQQKNQRVIYDTKNSQLGFA 477

BLAST of Cp4.1LG12g07170 vs. TAIR10
Match: AT1G79720.1 (AT1G79720.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 509.6 bits (1311), Expect = 2.1e-144
Identity = 267/474 (56.33%), Postives = 338/474 (71.31%), Query Frame = 1

Query: 12  LLLLLLFFVDQARSAAINGDSEKLHRLLHLQKLPWKQQEEAVINCIFQKPRVREGITTLE 71
           LLL+ LF +    S  ++G  EK    +H      K+  EA  +C  +        TTLE
Sbjct: 14  LLLVFLFLL----SCVVHGVDEKKILSVHNNIWSPKKSYEASTSCFSRSLGKGRESTTLE 73

Query: 72  MKERDYCSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAIFSGDTHQISDSQIPLSSGTR 131
           MK R+ CSGK  D  K ++  L+ D I VQSLQ +IK+   S     +S++QIPL+SG +
Sbjct: 74  MKHRELCSGKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIK 133

Query: 132 LQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLFDPSNSSSFLSLSCN 191
           L++LNYIVTV LGG++ +LIVDTGSDLTWVQC+PCR CYNQQ PL+DPS SSS+ ++ CN
Sbjct: 134 LESLNYIVTVELGGKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCN 193

Query: 192 SPTCLALPSATGNSGLCGYGNS---SSCGYEINYGDGSYSRGELGFERLNLGEIVIDNFI 251
           S TC  L +AT NSG CG  N    + C Y ++YGDGSY+RG+L  E + LG+  ++NF+
Sbjct: 194 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFV 253

Query: 252 FGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAGASGSLTMGGGD 311
           FGCGRNNKGLFGG+SGLMGLGRS +SLVSQT   F+G+FSYCLPS   GASGSL+ G  D
Sbjct: 254 FGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGN-D 313

Query: 312 FSNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGVPNNGALSLIDSGTVITRLT 371
            S + N + +SYT +V NPQ+ +FY LNLTG +IGGV L   + G   LIDSGTVITRL 
Sbjct: 314 SSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGILIDSGTVITRLP 373

Query: 372 PSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTVKFYFEGNAEMTVDVEG 431
           PSIY+A K EF KQFSGF TAPG+SIL+TCFNLT +++++IP +K  F+GNAE+ VDV G
Sbjct: 374 PSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTG 433

Query: 432 VFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKESTLGFAAEPC 483
           VFYFVK DAS +CLA ASL YE++  IIGNYQQKNQRV+Y++ +  LG   E C
Sbjct: 434 VFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482

BLAST of Cp4.1LG12g07170 vs. TAIR10
Match: AT5G10770.1 (AT5G10770.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 307.4 bits (786), Expect = 1.6e-83
Identity = 178/396 (44.95%), Postives = 240/396 (60.61%), Query Frame = 1

Query: 96  DAIHVQSLQSRIKSAIFSGDTHQISDSQIPLSSGTRLQTLNYIVTVALGG--RDSTLIVD 155
           D   V S+ S++   + +    +   + +P   G+ L + NYIVTV LG    D +LI D
Sbjct: 91  DQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFD 150

Query: 156 TGSDLTWVQCRPC-RLCYNQQEPLFDPSNSSSFLSLSCNSPTCLALPSATGNSGLCGYGN 215
           TGSDLTW QC+PC R CY+Q+EP+F+PS S+S+ ++SC+S  C +L SATGN+G C   +
Sbjct: 151 TGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSC---S 210

Query: 216 SSSCGYEINYGDGSYSRGELGFERLNL-GEIVIDNFIFGCGRNNKGLFGGASGLMGLGRS 275
           +S+C Y I YGD S+S G L  E+  L    V D   FGCG NN+GLF G +GL+GLGR 
Sbjct: 211 ASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRD 270

Query: 276 KLSLVSQTSSVFDGIFSYCLPSTGAGASGSLTMGGGDFSNFRNVSPISYTRMVSNPQMPN 335
           KLS  SQT++ ++ IFSYCLPS+ A  +G LT G    S     +PIS     +     +
Sbjct: 271 KLSFPSQTATAYNKIFSYCLPSS-ASYTGHLTFGSAGISRSVKFTPIS-----TITDGTS 330

Query: 336 FYFLNLTGITIGGVNLGVPNN-----GALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGF 395
           FY LN+  IT+GG  L +P+      GAL  IDSGTVITRL P  Y A ++ F+ + S +
Sbjct: 331 FYGLNIVAITVGGQKLPIPSTVFSTPGAL--IDSGTVITRLPPKAYAALRSSFKAKMSKY 390

Query: 396 QTAPGFSILNTCFNLTGFKEVNIPTVKFYFEGNAEMTVDVEGVFYFVKSDASQICLAFAS 455
            T  G SIL+TCF+L+GFK V IP V F F G A + +  +G+FY  K   SQ+CLAFA 
Sbjct: 391 PTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFK--ISQVCLAFAG 450

Query: 456 LGYEDQSMIIGNYQQKNQRVVYNSKESTLGFAAEPC 483
              +  + I GN QQ+   VVY+     +GFA   C
Sbjct: 451 NSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 473

BLAST of Cp4.1LG12g07170 vs. TAIR10
Match: AT5G10760.1 (AT5G10760.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 258.8 bits (660), Expect = 6.5e-69
Identity = 155/412 (37.62%), Postives = 239/412 (58.01%), Query Frame = 1

Query: 78  CSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAIFSGDTHQISDSQIPLSSGTRLQTLNY 137
           CS   +D + +    +  D   V+S+ S++ S   + +  +   +++P  SG  L + NY
Sbjct: 74  CSHLSSDARVDHDEIIRRDQARVESIYSKL-SKNSANEVSEAKSTELPAKSGITLGSGNY 133

Query: 138 IVTVALGG--RDSTLIVDTGSDLTWVQCRPCR-LCYNQQEPLFDPSNSSSFLSLSCNSPT 197
           IVT+ +G    D +L+ DTGSDLTW QC PC   CY+Q+EP F+PS+SS++ ++SC+SP 
Sbjct: 134 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPM 193

Query: 198 CLALPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGELGFERLNL-GEIVIDNFIFGCGR 257
           C    S +          +S+C Y I YGD S+++G L  E+  L    V+++  FGCG 
Sbjct: 194 CEDAESCS----------ASNCVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGE 253

Query: 258 NNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAGASGSLTMGGGDFSNFR 317
           NN+GLF G +GL+GLG  KLSL +QT++ ++ IFSYCLPS  + ++G LT G    S   
Sbjct: 254 NNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESV 313

Query: 318 NVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGV-PNNGAL--SLIDSGTVITRLTPS 377
             +PIS     S P   N Y +++ GI++G   L + PN+ +   ++IDSGTV TRL   
Sbjct: 314 KFTPIS-----SFPSAFN-YGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTK 373

Query: 378 IYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTVKFYFEGNAEMTVDVEGVF 437
           +Y   ++ F+++ S +++  G+ + +TC++ TG   V  PT+ F F G+  + +D  G+ 
Sbjct: 374 VYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGIS 433

Query: 438 YFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKESTLGFAAEPC 483
             +K   SQ+CLAFA  G +D   I GN QQ    VVY+     +GFA   C
Sbjct: 434 LPIK--ISQVCLAFA--GNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of Cp4.1LG12g07170 vs. TAIR10
Match: AT3G20015.1 (AT3G20015.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 255.4 bits (651), Expect = 7.2e-68
Identity = 151/430 (35.12%), Postives = 226/430 (52.56%), Query Frame = 1

Query: 69  TLEMKERD-YCSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAIF--SGDTHQISDSQIP 128
           TL +  RD + S    +    L  R+  D   V ++  RI   +   S   ++++D    
Sbjct: 60  TLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSD 119

Query: 129 LSSGTRLQTLNYIVTVALGG--RDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLFDPSNSS 188
           + SG    +  Y V + +G   RD  +++D+GSD+ WVQC+PC+LCY Q +P+FDP+ S 
Sbjct: 120 IVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSG 179

Query: 189 SFLSLSCNSPTCLALPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGELGFERLNLGEIV 248
           S+  +SC S  C  +     NSG     +S  C YE+ YGDGSY++G L  E L   + V
Sbjct: 180 SYTGVSCGSSVCDRIE----NSGC----HSGGCRYEVMYGDGSYTKGTLALETLTFAKTV 239

Query: 249 IDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAGASGSLT 308
           + N   GCG  N+G+F GA+GL+G+G   +S V Q S    G F YCL S G  ++GSL 
Sbjct: 240 VRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLV 299

Query: 309 MGGGDFSNFRNVSPI--SYTRMVSNPQMPNFYFLNLTGITIGGVNLGVPNNGALSL---- 368
            G       R   P+  S+  +V NP+ P+FY++ L G+ +GGV + +P +G   L    
Sbjct: 300 FG-------REALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLP-DGVFDLTETG 359

Query: 369 -----IDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTV 428
                +D+GT +TRL  + Y AF+  F+ Q +    A G SI +TC++L+GF  V +PTV
Sbjct: 360 DGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTV 419

Query: 429 KFYFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKE 483
            FYF     +T+     F     D+   C AFA+        IIGN QQ+  +V ++   
Sbjct: 420 SFYFTEGPVLTLPARN-FLMPVDDSGTYCFAFAA--SPTGLSIIGNIQQEGIQVSFDGAN 470

BLAST of Cp4.1LG12g07170 vs. TAIR10
Match: AT1G25510.1 (AT1G25510.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 237.3 bits (604), Expect = 2.0e-62
Identity = 159/475 (33.47%), Postives = 247/475 (52.00%), Query Frame = 1

Query: 32  SEKLHRLLHLQKLPWKQQEEAVINCIFQKPRVREGITTLEMKERDYCSG-KVTDWQKNLQ 91
           ++ +HR  +       QQEE   +             +L++  R    G + +D++    
Sbjct: 39  ADSIHRTKYTSSFRLNQQEEQTHSA--------SSSFSLQLHSRVSVRGTEHSDYKSLTL 98

Query: 92  NRLISDAIHVQSLQSRIKSAIFS---GDTHQIS--------DSQIPLSSGTRLQTLNYIV 151
            RL  D   V+SL +R+  AI +    D   IS        D + PL SGT   +  Y  
Sbjct: 99  ARLNRDTARVKSLITRLDLAINNISKADLKPISTMYTTEEQDIEAPLISGTTQGSGEYFT 158

Query: 152 TVALG--GRDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLFDPSNSSSFLSLSCNSPTCLA 211
            V +G   R+  +++DTGSD+ W+QC PC  CY+Q EP+F+PS+SSS+  LSC++P C A
Sbjct: 159 RVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNA 218

Query: 212 LPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGELGFERLNLGEIVIDNFIFGCGRNNKG 271
           L  +      C    +++C YE++YGDGSY+ G+   E L +G  ++ N   GCG +N+G
Sbjct: 219 LEVSE-----C---RNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAVGCGHSNEG 278

Query: 272 LFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAGASGSLTMGGGDFSNFRNVSP 331
           LF GA+GL+GLG   L+L SQ ++     FSYCL    + ++ ++  G        ++SP
Sbjct: 279 LFVGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDRDSDSASTVDFG-------TSLSP 338

Query: 332 ISYTR-MVSNPQMPNFYFLNLTGITIGGVNLGVP---------NNGALSLIDSGTVITRL 391
            +    ++ N Q+  FY+L LTGI++GG  L +P          +G + +IDSGT +TRL
Sbjct: 339 DAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGI-IIDSGTAVTRL 398

Query: 392 TPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTVKFYFEGNAEMTVDVE 451
              IY + +  F K     + A G ++ +TC+NL+    V +PTV F+F G   + +  +
Sbjct: 399 QTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALPAK 458

Query: 452 GVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKESTLGFAAEPC 483
                V S     CLAFA         IIGN QQ+  RV ++   S +GF++  C
Sbjct: 459 NYMIPVDS-VGTFCLAFAPTA--SSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483

BLAST of Cp4.1LG12g07170 vs. NCBI nr
Match: gi|778728858|ref|XP_004135889.2| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus])

HSP 1 Score: 750.7 bits (1937), Expect = 1.5e-213
Identity = 378/497 (76.06%), Postives = 425/497 (85.51%), Query Frame = 1

Query: 1   MEISKSLCFFLLLLLLLFF------VDQARSAAIN---GDSEKLHRLLHLQKLPWKQQEE 60
           MEISKSL F L LLLLL        VD ARS++ N   GD+ +   L   Q  PWK+  E
Sbjct: 1   MEISKSLHFPLSLLLLLLLPLLSIGVD-ARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGE 60

Query: 61  AVINCIFQKPRVREGITTLEMKERDYCSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAI 120
           AV+NCIFQKP++ +GITTLEMK+RDYCSGK+TDW+K  QNR+I DAI+V SL S  KSAI
Sbjct: 61  AVVNCIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAI 120

Query: 121 FSGDTHQISDSQIPLSSGTRLQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYN 180
           F G THQ+SDSQIP+SSG RLQTLNYIVTV +GG++STLIVDTGSDLTWVQC PCRLCYN
Sbjct: 121 FPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYN 180

Query: 181 QQEPLFDPSNSSSFLSLSCNSPTCLALPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGE 240
           QQEPLF+PSNSSSFLSL CNSPTC+AL    G+SGLC   NS+SC Y+I+YGDGSYSRGE
Sbjct: 181 QQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGE 240

Query: 241 LGFERLNLGEIVIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCL 300
           LGFE+L LG+  IDNFIFGCGRNNKGLFGGASGLMGL RS+LSLVSQTSS+F  +FSYCL
Sbjct: 241 LGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCL 300

Query: 301 PSTGAGASGSLTMGGGDFSNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGVP- 360
           P+TG G+SGSLT+GG DFSNF+N+SPISYTRM+ NPQM NFYFLNLTGI+IGGVNL VP 
Sbjct: 301 PTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPR 360

Query: 361 ---NNGALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEV 420
              N G LSL+DSGTVITRL+PSIY+AFKAEFEKQFSG++T PGFSILNTCFNLTG++EV
Sbjct: 361 LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEV 420

Query: 421 NIPTVKFYFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVV 480
           NIPTVKF FEGNAEM VDVEGVFYFVKSDASQICLAFASLGYEDQ+MIIGNYQQKNQRV+
Sbjct: 421 NIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVI 480

Query: 481 YNSKESTLGFAAEPCGF 485
           YNSKES +GFA EPC F
Sbjct: 481 YNSKESKVGFAGEPCSF 496

BLAST of Cp4.1LG12g07170 vs. NCBI nr
Match: gi|659122560|ref|XP_008461208.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis melo])

HSP 1 Score: 750.7 bits (1937), Expect = 1.5e-213
Identity = 378/494 (76.52%), Postives = 425/494 (86.03%), Query Frame = 1

Query: 1   MEISKSLCF-----FLLLLLLLFFVDQARSAAINGDSEKLHRLLHL-QKLPWKQQEEAVI 60
           ME+SKSL F     FLLLL LLF +  ARS+  NG +     LL L Q  PWK+  EAV+
Sbjct: 3   MEVSKSLHFPLSLLFLLLLPLLFIIVDARSSVGNGGNYHEKGLLQLFQNFPWKEHGEAVV 62

Query: 61  NCIFQKPRVREGITTLEMKERDYCSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAIFSG 120
           NCIFQKP++ +GITTLEMK+RDYCSGK+TD +K  QNR+I DAI+V SL S +KSAIF G
Sbjct: 63  NCIFQKPKITKGITTLEMKQRDYCSGKITDLEKIFQNRIILDAINVNSLLSHVKSAIFPG 122

Query: 121 DTHQISDSQIPLSSGTRLQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYNQQE 180
            THQ+SDSQIP+SSG RLQTLNYIVTV +GG++STLIVDTGSDLTWVQC PCRLCYNQQE
Sbjct: 123 QTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQE 182

Query: 181 PLFDPSNSSSFLSLSCNSPTCLALPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGELGF 240
           PLF+PSNSSSFLSL C+SPTCLAL    G+SGLC   NS+SC Y+I+YGDGSYSRGELG+
Sbjct: 183 PLFNPSNSSSFLSLPCSSPTCLALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGY 242

Query: 241 ERLNLGEIVIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPST 300
           E+L LG+  IDNFIFGCGRNNKGLFGGASGLMGL RS+LSLVSQTSSVF  IFSYCLP+T
Sbjct: 243 EKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSVFGSIFSYCLPTT 302

Query: 301 GAGASGSLTMGGGDFSNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGVP---- 360
           G G+SGSLT+GG DFS+F+N+SPISYTRM+ NPQM NFYFLNLTGI+IGGVNL VP    
Sbjct: 303 GVGSSGSLTLGGTDFSSFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSS 362

Query: 361 NNGALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIP 420
           N G LSL+DSGTVITRL+PSIY+AFKAEFEKQFSG++T PGFSILNTCFNLTG++EVNIP
Sbjct: 363 NEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIP 422

Query: 421 TVKFYFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNS 480
           TVKF FEGNAEM VDVEGVFYFVKSDASQICLAFASLGYEDQ+MIIGNYQQKNQRVVYNS
Sbjct: 423 TVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVVYNS 482

Query: 481 KESTLGFAAEPCGF 485
           KES +GFA EPC F
Sbjct: 483 KESKVGFAGEPCSF 496

BLAST of Cp4.1LG12g07170 vs. NCBI nr
Match: gi|1000979824|ref|XP_015570839.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Ricinus communis])

HSP 1 Score: 593.6 bits (1529), Expect = 3.1e-166
Identity = 306/483 (63.35%), Postives = 373/483 (77.23%), Query Frame = 1

Query: 9   FFLLLLLLLFFVDQARSAAINGDSEKLH--RLLHLQKLPWKQQEEAVIN--CIFQKPRVR 68
           ++ LL LL+F +       +NG ++ L   ++L LQ+  W+ +     N  C+ QK +  
Sbjct: 16  YYTLLSLLVFLL-----TVVNGGAQSLQEKKVLSLQEYQWQLKSNTDTNSSCLSQKSKRE 75

Query: 69  EGITTLEMKERDYC--SGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAIFSGDTHQISDS 128
           +G T LEMK RD+C  SGK TDW K LQ  LI D   V+SLQSRIKS IFSG+     DS
Sbjct: 76  KGATILEMKHRDFCNSSGKSTDWNKKLQKSLILDDFRVRSLQSRIKS-IFSGNNIDALDS 135

Query: 129 QIPLSSGTRLQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLFDPSNS 188
           QIPLSSG RLQTLNYIVTV +GGR+ T+IVDTGSDLTWVQC+PCRLCYNQQ+PLF+PS S
Sbjct: 136 QIPLSSGVRLQTLNYIVTVEIGGRNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGS 195

Query: 189 SSFLSLSCNSPTCLALPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGELGFERLNLGEI 248
            S+ ++ CNS TC +L  ATGN G+CG  N+ +C Y +NYGDGSY+RG+LG E+LNLG  
Sbjct: 196 PSYQTILCNSSTCQSLQYATGNLGVCG-SNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTT 255

Query: 249 VIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVSQTSSVFDGIFSYCLPSTGAGASGSL 308
            + NFIFGCGRNNKGLFGGASGLMGLG+S LSLVSQTS++F+G+FSYCLP+T A ASGSL
Sbjct: 256 HVSNFIFGCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAADASGSL 315

Query: 309 TMGGGDFSNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGVPNNGALS-LIDSG 368
            +GG   S ++N +PISYTRM++NPQ+P FYFLNLTGI+IGGV L  PN      LIDSG
Sbjct: 316 ILGGNS-SVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYRQSGILIDSG 375

Query: 369 TVITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTVKFYFEGNAE 428
           TVITRL P +YR  KAEF KQFSGF +AP FSIL+TCFNL G+ EV+IPT++  FEGNAE
Sbjct: 376 TVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQFEGNAE 435

Query: 429 MTVDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKESTLGFAAEP 485
           +TVDV G+FYFVK+DASQ+CLA ASL ++D+  IIGNYQQ+NQRV+YN+KES LGFAAE 
Sbjct: 436 LTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEA 490

BLAST of Cp4.1LG12g07170 vs. NCBI nr
Match: gi|595931085|ref|XP_007215297.1| (hypothetical protein PRUPE_ppa005040mg [Prunus persica])

HSP 1 Score: 573.5 bits (1477), Expect = 3.4e-160
Identity = 304/481 (63.20%), Postives = 369/481 (76.72%), Query Frame = 1

Query: 10  FLLLLLLLFFVDQARSAAING-DSEKLHRLLHLQKLPWKQQEEAVIN-CIFQKPRVREGI 69
           F +LL LLF        + NG  S K  ++L LQ+  W+Q        C+ QK R  +G 
Sbjct: 8   FQVLLHLLFLC----LCSANGVQSFKEKKVLKLQEFRWRQHGGTRSTVCLSQKSRKEKGA 67

Query: 70  TTLEMKERDYCSGKVTDWQKNLQNRLISDAIHVQSLQSRIKSAIFSGDTHQISDSQIPLS 129
           T LE+K RDYCSGK+ DW K  Q RLI D +HV+SLQS+ K+ + SG    +S++QIPL+
Sbjct: 68  TILEIKHRDYCSGKIVDWDKKQQKRLIFDDLHVRSLQSQFKNRV-SGRIKDLSEAQIPLT 127

Query: 130 SGTRLQTLNYIVTVALGGRDSTLIVDTGSDLTWVQCRPCRLCYNQQEPLFDPSNSSSFLS 189
           SG RLQTLNYIVTV LGGR+ T+IVDTGSDLTWVQC+PC+LCYNQQEPLF+ S S S+ S
Sbjct: 128 SGIRLQTLNYIVTVELGGRNMTVIVDTGSDLTWVQCQPCKLCYNQQEPLFNSSASPSYKS 187

Query: 190 LSCNSPTCLALPSATGNSGLCGYGNSSSCGYEINYGDGSYSRGELGFERLNLGEIVIDNF 249
           + CNS TC AL   TGNSG CG  N +SC Y +NYGDGSY+RGELG + L+LG   ++NF
Sbjct: 188 VLCNSSTCQALQFDTGNSGACG-SNPTSCNYVVNYGDGSYTRGELGSDHLSLGATPVNNF 247

Query: 250 IFGCGRNNKGLFGGASGLMGLGRSK-LSLVSQTSSVFDGIFSYCLPSTGAGASGSLTMGG 309
           +FGCGRNNKGLFGGASGLMGLGRS+ +SLVSQTS++F G+FSYCLP+T A ASGSL M G
Sbjct: 248 VFGCGRNNKGLFGGASGLMGLGRSESVSLVSQTSALFGGVFSYCLPTTEATASGSLIM-G 307

Query: 310 GDFSNFRNVSPISYTRMVSNPQMPNFYFLNLTGITIGGVNLGVPNNGALS---LIDSGTV 369
           GD S ++N +PISYTRMV NP++  FYFLNLTGI+IGGV L   N    S   LIDSGTV
Sbjct: 308 GDASIYKNSTPISYTRMVPNPELSTFYFLNLTGISIGGVAL--QNQSFASGGILIDSGTV 367

Query: 370 ITRLTPSIYRAFKAEFEKQFSGFQTAPGFSILNTCFNLTGFKEVNIPTVKFYFEGNAEMT 429
           I+RL PS+Y+A KAEF KQFSG+  APGF+IL+TCFNL+ ++EV+IPT+KF+FEGNAE+ 
Sbjct: 368 ISRLAPSVYKAVKAEFLKQFSGYPPAPGFAILDTCFNLSAYQEVSIPTLKFHFEGNAELN 427

Query: 430 VDVEGVFYFVKSDASQICLAFASLGYEDQSMIIGNYQQKNQRVVYNSKESTLGFAAEPCG 485
           VDV G+FY VK+DASQICLA ASL YED+  IIGNYQQKNQRV+YN+K+S LGFA E C 
Sbjct: 428 VDVTGIFYLVKTDASQICLALASLSYEDEIGIIGNYQQKNQRVIYNTKDSKLGFAEESCS 479

BLAST of Cp4.1LG12g07170 vs. NCBI nr
Match: gi|641836437|gb|KDO55402.1| (hypothetical protein CISIN_1g011482mg [Citrus sinensis])

HSP 1 Score: 572.0 bits (1473), Expect = 9.8e-160
Identity = 291/449 (64.81%), Postives = 349/449 (77.73%), Query Frame = 1

Query: 39  LHLQKLPWKQQEEAVINCI-FQKPRVREGITTLEMKERDYCSGKVTDWQKNLQNRLISDA 98
           LHL KL W+Q+  +  +C+  QK R+  G  TLE+K ++YCSGK+ DW +  QNRLI D 
Sbjct: 37  LHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDN 96

Query: 99  IHVQSLQSRIKSAIFSGDTHQISDSQIPLSSGTRLQTLNYIVTVALGGRDSTLIVDTGSD 158
           +HVQ LQSRIK+ I SG+   +S+++IPL+SG RLQTLNYI T+ LGGR+ T+IVDTGSD
Sbjct: 97  LHVQYLQSRIKNMI-SGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSD 156

Query: 159 LTWVQCRPCRLCYNQQEPLFDPSNSSSFLSLSCNSPTCLALPSATGNSGLCGYGNSSSCG 218
           LTWVQC+PC+ CYNQQ+P+FDPS S S+  + CNS TC AL  ATGNSG+C   +   C 
Sbjct: 157 LTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCN 216

Query: 219 YEINYGDGSYSRGELGFERLNLGEIVIDNFIFGCGRNNKGLFGGASGLMGLGRSKLSLVS 278
           Y ++YGDGSY+RGELG E L LG+  +++FIFGCGRNNKGLFGG SGLMGLGRS LSLVS
Sbjct: 217 YFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVS 276

Query: 279 QTSSVFDGIFSYCLPST-GAGASGSLTMGGGDFSNFRNVSPISYTRMVSNPQMPNFYFLN 338
           QTS +F G+FSYCLPST  AGASGSL +GG   S F+N +PI+YT M+ NPQ+  FY LN
Sbjct: 277 QTSEIFGGLFSYCLPSTQDAGASGSLILGGNS-SVFKNSTPITYTNMIPNPQLATFYILN 336

Query: 339 LTGITIGGVNL---GVPNNGALSLIDSGTVITRLTPSIYRAFKAEFEKQFSGFQTAPGFS 398
           LTGI+IGG  L   G    G   LIDSGTVITRL PSIY A KAEF KQFSGF +APGFS
Sbjct: 337 LTGISIGGKQLQASGFAKGGI--LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 396

Query: 399 ILNTCFNLTGFKEVNIPTVKFYFEGNAEMTVDVEGVFYFVKSDASQICLAFASLGYEDQS 458
           IL+TCFNL+ ++EVNIP VK  FEGNAEMTVDV G+ YFVKSDASQ+CLA ASL YED++
Sbjct: 397 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 456

Query: 459 MIIGNYQQKNQRVVYNSKESTLGFAAEPC 483
            IIGNYQQKNQRV+Y++K S LGFA E C
Sbjct: 457 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPA_ARATH2.8e-8244.95Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana GN=At5g10770 ... [more]
AED1_ARATH1.2e-6737.62Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1[more]
ASPG2_ARATH1.3e-6635.12Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
APF2_ARATH4.5e-5632.59Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
ASPG1_ARATH6.1e-5329.89Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
Match NameE-valueIdentityDescription
A0A0A0K8J2_CUCSA1.1e-21376.06Uncharacterized protein OS=Cucumis sativus GN=Csa_7G431320 PE=3 SV=1[more]
M5WTF1_PRUPE2.3e-16063.20Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005040mg PE=3 SV=1[more]
A0A067EW14_CITSI6.8e-16064.81Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011482mg PE=3 SV=1[more]
B9RGP5_RICCO3.4e-15968.75Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_1442500 ... [more]
V4SPS2_9ROSI2.9e-15864.94Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025395mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G79720.12.1e-14456.33 Eukaryotic aspartyl protease family protein[more]
AT5G10770.11.6e-8344.95 Eukaryotic aspartyl protease family protein[more]
AT5G10760.16.5e-6937.62 Eukaryotic aspartyl protease family protein[more]
AT3G20015.17.2e-6835.12 Eukaryotic aspartyl protease family protein[more]
AT1G25510.12.0e-6233.47 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|778728858|ref|XP_004135889.2|1.5e-21376.06PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus][more]
gi|659122560|ref|XP_008461208.1|1.5e-21376.52PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis melo][more]
gi|1000979824|ref|XP_015570839.1|3.1e-16663.35PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Ricinus communis][more]
gi|595931085|ref|XP_007215297.1|3.4e-16063.20hypothetical protein PRUPE_ppa005040mg [Prunus persica][more]
gi|641836437|gb|KDO55402.1|9.8e-16064.81hypothetical protein CISIN_1g011482mg [Citrus sinensis][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR021109Peptidase_aspartic_dom_sf
IPR001969Aspartic_peptidase_AS
IPR001461Aspartic_peptidase_A1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG12g07170.1Cp4.1LG12g07170.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 4..28
score: 3.5E-230coord: 58..482
score: 3.5E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 150..161
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 316..482
score: 2.3E-39coord: 125..306
score: 1.9
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 131..482
score: 1.59
NoneNo IPR availablePANTHERPTHR13683:SF263SUBFAMILY NOT NAMEDcoord: 58..482
score: 3.5E-230coord: 4..28
score: 3.5E

The following gene(s) are paralogous to this gene:

None