Csa1G022490 (gene) Cucumber (Chinese Long) v2

NameCsa1G022490
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionAspartic proteinase nepenthesin-1, putative; contains IPR001461 (Peptidase A1), IPR021109 (Aspartic peptidase)
LocationChr1 : 2302493 .. 2304085 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCCAATAATTCATTATCTTTTCATTCCCAACATGAACACTTCACTTTCCTCTGTTTTTCTCTTCCTAACAATCTTCACTTCCCTTCAATTCCCTTCAATTCTCTCTCGCAAGTTAACACCATCTTCCTATTCCACTTCCATCTTCGATGTCTCTGCCTCCACAAACCAAGCCCTAGATGCCCTCTCCATTAAACCCAAACCTCTTCAAAATCATTCTCACCTTCCAAATTCCCCTTTCTCTCTGCCATTGTACCCTAGATTGGCCCTTCATAACCCTTCTTACAAGGACTACAATACCCTTGTTAGGGCCCGACTCACTCGTGATGCCGCTCGAGTTCAATTCCTTAACCGAAATCTTGAGCGCTCTTTAAATGGGGGTACTCATTTTGGTGAAAGTATTAATGAATCTCTAATTGGAGATTCAATTACTGCTCCGGTTGTTTCGGGGCAAAGTAAAGGGAGTGGTGCTGAGTATTTAGCTCAGATTGGGGTTGGTCAGCCTGTGAAGTTGTTTTATTTGGTGCCTGATACTGGTAGCGATGTCACGTGGCTTCAATGTCAACCTTGTGCTAGTGAGAATACTTGTTATAAACAATTTGACCCTATTTTCGATCCGAAATCATCGTCTTCATACAGTCCTCTGTCTTGCAATTCTCAACAATGTAAGTTGCTAGATAAAGCCAATTGCAATTCCGACACATGCATATACCAAGTCCACTACGGTGATGGATCATTCACAACTGGTGAACTCGCTACCGAAACACTATCGTTTGGAAATTCAAATTCTATTCCTAATCTCCCAATTGGTTGTGGGCATGACAATGAAGGCTTGTTTGCTGGTGGAGCTGGTTTAATAGGCCTCGGTGGTGGGGCCATTTCCCTTTCTTCCCAATTAAAAGCGTCATCATTTTCATATTGCCTCGTCAACTTAGATTCAGACTCATCCTCCACTCTTGAGTTTAACTCAAACATGCCCAGTGACTCGTTGACCTCTCCGCTCGTGAAAAACGATCGATTTCACTCGTATAGGTACGTCAAAGTCGTTGGAATAAGTGTTGGGGGAAAAACTCTACCAATTTCACCGACAAGATTTGAAATTGATGAATCGGGATTGGGAGGAATAATCGTTGATTCTGGTACAATTATATCTCGACTACCGAGTGATGTCTATGAATCATTAAGAGAGGCATTTGTGAAGCTGACGAGTAGCCTATCACCGGCACCAGGGATATCAGTGTTCGATACATGTTATAACTTTTCAGGTCAATCGAATGTGGAGGTCCCAACAATAGCATTTGTGTTGTCGGAAGGAACCTCGCTACGACTACCTGCAAGAAATTACTTAATTATGTTGGACACAGCAGGAACTTATTGTTTGGCGTTTATTAAAACGAAATCTTCACTTTCTATAATTGGTAGCTTCCAACAACAAGGAATACGTGTTAGTTATGACTTGACAAACTCCCTCGTTGGATTCTCAACTAATAAATGTTAGTACCATAAACTTATAATAACTCATGTAACACCACGTGGGATTAGAATTTGGCTTTAATCACATTGTTCTAAGTCAATAAAATATTGTTGTTTATCAT

mRNA sequence

ATGAACACTTCACTTTCCTCTGTTTTTCTCTTCCTAACAATCTTCACTTCCCTTCAATTCCCTTCAATTCTCTCTCGCAAGTTAACACCATCTTCCTATTCCACTTCCATCTTCGATGTCTCTGCCTCCACAAACCAAGCCCTAGATGCCCTCTCCATTAAACCCAAACCTCTTCAAAATCATTCTCACCTTCCAAATTCCCCTTTCTCTCTGCCATTGTACCCTAGATTGGCCCTTCATAACCCTTCTTACAAGGACTACAATACCCTTGTTAGGGCCCGACTCACTCGTGATGCCGCTCGAGTTCAATTCCTTAACCGAAATCTTGAGCGCTCTTTAAATGGGGGTACTCATTTTGGTGAAAGTATTAATGAATCTCTAATTGGAGATTCAATTACTGCTCCGGTTGTTTCGGGGCAAAGTAAAGGGAGTGGTGCTGAGTATTTAGCTCAGATTGGGGTTGGTCAGCCTGTGAAGTTGTTTTATTTGGTGCCTGATACTGGTAGCGATGTCACGTGGCTTCAATGTCAACCTTGTGCTAGTGAGAATACTTGTTATAAACAATTTGACCCTATTTTCGATCCGAAATCATCGTCTTCATACAGTCCTCTGTCTTGCAATTCTCAACAATGTAAGTTGCTAGATAAAGCCAATTGCAATTCCGACACATGCATATACCAAGTCCACTACGGTGATGGATCATTCACAACTGGTGAACTCGCTACCGAAACACTATCGTTTGGAAATTCAAATTCTATTCCTAATCTCCCAATTGGTTGTGGGCATGACAATGAAGGCTTGTTTGCTGGTGGAGCTGGTTTAATAGGCCTCGGTGGTGGGGCCATTTCCCTTTCTTCCCAATTAAAAGCGTCATCATTTTCATATTGCCTCGTCAACTTAGATTCAGACTCATCCTCCACTCTTGAGTTTAACTCAAACATGCCCAGTGACTCGTTGACCTCTCCGCTCGTGAAAAACGATCGATTTCACTCGTATAGGTACGTCAAAGTCGTTGGAATAAGTGTTGGGGGAAAAACTCTACCAATTTCACCGACAAGATTTGAAATTGATGAATCGGGATTGGGAGGAATAATCGTTGATTCTGGTACAATTATATCTCGACTACCGAGTGATGTCTATGAATCATTAAGAGAGGCATTTGTGAAGCTGACGAGTAGCCTATCACCGGCACCAGGGATATCAGTGTTCGATACATGTTATAACTTTTCAGGTCAATCGAATGTGGAGGTCCCAACAATAGCATTTGTGTTGTCGGAAGGAACCTCGCTACGACTACCTGCAAGAAATTACTTAATTATGTTGGACACAGCAGGAACTTATTGTTTGGCGTTTATTAAAACGAAATCTTCACTTTCTATAATTGGTAGCTTCCAACAACAAGGAATACGTGTTAGTTATGACTTGACAAACTCCCTCGTTGGATTCTCAACTAATAAATGTTAG

Coding sequence (CDS)

ATGAACACTTCACTTTCCTCTGTTTTTCTCTTCCTAACAATCTTCACTTCCCTTCAATTCCCTTCAATTCTCTCTCGCAAGTTAACACCATCTTCCTATTCCACTTCCATCTTCGATGTCTCTGCCTCCACAAACCAAGCCCTAGATGCCCTCTCCATTAAACCCAAACCTCTTCAAAATCATTCTCACCTTCCAAATTCCCCTTTCTCTCTGCCATTGTACCCTAGATTGGCCCTTCATAACCCTTCTTACAAGGACTACAATACCCTTGTTAGGGCCCGACTCACTCGTGATGCCGCTCGAGTTCAATTCCTTAACCGAAATCTTGAGCGCTCTTTAAATGGGGGTACTCATTTTGGTGAAAGTATTAATGAATCTCTAATTGGAGATTCAATTACTGCTCCGGTTGTTTCGGGGCAAAGTAAAGGGAGTGGTGCTGAGTATTTAGCTCAGATTGGGGTTGGTCAGCCTGTGAAGTTGTTTTATTTGGTGCCTGATACTGGTAGCGATGTCACGTGGCTTCAATGTCAACCTTGTGCTAGTGAGAATACTTGTTATAAACAATTTGACCCTATTTTCGATCCGAAATCATCGTCTTCATACAGTCCTCTGTCTTGCAATTCTCAACAATGTAAGTTGCTAGATAAAGCCAATTGCAATTCCGACACATGCATATACCAAGTCCACTACGGTGATGGATCATTCACAACTGGTGAACTCGCTACCGAAACACTATCGTTTGGAAATTCAAATTCTATTCCTAATCTCCCAATTGGTTGTGGGCATGACAATGAAGGCTTGTTTGCTGGTGGAGCTGGTTTAATAGGCCTCGGTGGTGGGGCCATTTCCCTTTCTTCCCAATTAAAAGCGTCATCATTTTCATATTGCCTCGTCAACTTAGATTCAGACTCATCCTCCACTCTTGAGTTTAACTCAAACATGCCCAGTGACTCGTTGACCTCTCCGCTCGTGAAAAACGATCGATTTCACTCGTATAGGTACGTCAAAGTCGTTGGAATAAGTGTTGGGGGAAAAACTCTACCAATTTCACCGACAAGATTTGAAATTGATGAATCGGGATTGGGAGGAATAATCGTTGATTCTGGTACAATTATATCTCGACTACCGAGTGATGTCTATGAATCATTAAGAGAGGCATTTGTGAAGCTGACGAGTAGCCTATCACCGGCACCAGGGATATCAGTGTTCGATACATGTTATAACTTTTCAGGTCAATCGAATGTGGAGGTCCCAACAATAGCATTTGTGTTGTCGGAAGGAACCTCGCTACGACTACCTGCAAGAAATTACTTAATTATGTTGGACACAGCAGGAACTTATTGTTTGGCGTTTATTAAAACGAAATCTTCACTTTCTATAATTGGTAGCTTCCAACAACAAGGAATACGTGTTAGTTATGACTTGACAAACTCCCTCGTTGGATTCTCAACTAATAAATGTTAG

Protein sequence

MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQNHSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC*
BLAST of Csa1G022490 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 436.8 bits (1122), Expect = 3.1e-121
Identity = 233/498 (46.79%), Postives = 323/498 (64.86%), Query Frame = 1

Query: 7   SVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQNHSHLP- 66
           S+   +T+   L      SR L+     T++ DV +S  Q    LS+ P      +  P 
Sbjct: 8   SLLAVVTLSLFLTTTDASSRSLSTPP-KTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPE 67

Query: 67  ----------NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNG 126
                     +SP SL L+ R       +KDY +L  +RL RD++RV  +   +  ++ G
Sbjct: 68  SLSDPVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEG 127

Query: 127 --GTHFGESINESLI--GDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDV 186
              +      NE      + +T PVVSG S+GSG EY ++IGVG P K  YLV DTGSDV
Sbjct: 128 VDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSG-EYFSRIGVGTPAKEMYLVLDTGSDV 187

Query: 187 TWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYG 246
            W+QC+PCA    CY+Q DP+F+P SSS+Y  L+C++ QC LL+ + C S+ C+YQV YG
Sbjct: 188 NWIQCEPCAD---CYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYG 247

Query: 247 DGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKAS 306
           DGSFT GELAT+T++FGNS  I N+ +GCGHDNEGLF G AGL+GLGGG +S+++Q+KA+
Sbjct: 248 DGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKAT 307

Query: 307 SFSYCLVNLDSDSSSTLEFNS-NMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPIS 366
           SFSYCLV+ DS  SS+L+FNS  +     T+PL++N +  ++ YV + G SVGG+ + + 
Sbjct: 308 SFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLP 367

Query: 367 PTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSP-APGISVFDTCYNF 426
              F++D SG GG+I+D GT ++RL +  Y SLR+AF+KLT +L   +  IS+FDTCY+F
Sbjct: 368 DAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDF 427

Query: 427 SGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGI 486
           S  S V+VPT+AF  + G SL LPA+NYLI +D +GT+C AF  T SSLSIIG+ QQQG 
Sbjct: 428 SSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGT 487

Query: 487 RVSYDLTNSLVGFSTNKC 488
           R++YDL+ +++G S NKC
Sbjct: 488 RITYDLSKNVIGLSGNKC 500

BLAST of Csa1G022490 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 350.9 bits (899), Expect = 2.2e-95
Identity = 181/435 (41.61%), Postives = 270/435 (62.07%), Query Frame = 1

Query: 60  NHSHLPN---SPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGG 119
           N++H  +   S ++L L  R    + +Y++++  + AR+ RD  RV  + R +   +   
Sbjct: 47  NNTHFSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPS 106

Query: 120 THFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQC 179
           +     +N+        + +VSG  +GSG EY  +IGVG P +  Y+V D+GSD+ W+QC
Sbjct: 107 SDSRYEVND------FGSDIVSGMDQGSG-EYFVRIGVGSPPRDQYMVIDSGSDMVWVQC 166

Query: 180 QPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFT 239
           QPC     CYKQ DP+FDP  S SY+ +SC S  C  ++ + C+S  C Y+V YGDGS+T
Sbjct: 167 QPC---KLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYT 226

Query: 240 TGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASS---F 299
            G LA ETL+F  +  + N+ +GCGH N G+F G AGL+G+GGG++S   QL   +   F
Sbjct: 227 KGTLALETLTFAKT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAF 286

Query: 300 SYCLVNLDSDSSSTLEFNSN-MPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPT 359
            YCLV+  +DS+ +L F    +P  +   PLV+N R  S+ YV + G+ VGG  +P+   
Sbjct: 287 GYCLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDG 346

Query: 360 RFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQ 419
            F++ E+G GG+++D+GT ++RLP+  Y + R+ F   T++L  A G+S+FDTCY+ SG 
Sbjct: 347 VFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGF 406

Query: 420 SNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVS 479
            +V VPT++F  +EG  L LPARN+L+ +D +GTYC AF  + + LSIIG+ QQ+GI+VS
Sbjct: 407 VSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVS 466

Query: 480 YDLTNSLVGFSTNKC 488
           +D  N  VGF  N C
Sbjct: 467 FDGANGFVGFGPNVC 470

BLAST of Csa1G022490 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 330.1 bits (845), Expect = 4.1e-89
Identity = 181/413 (43.83%), Postives = 253/413 (61.26%), Query Frame = 1

Query: 83  SYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSK 142
           S K  + L  +RL RD+ RV+ +   L   + G      ++  +      ++ VVSG S+
Sbjct: 84  SNKTPDELFSSRLQRDSRRVKSI-ATLAAQIPG-----RNVTHAPRPGGFSSSVVSGLSQ 143

Query: 143 GSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYS 202
           GSG EY  ++GVG P +  Y+V DTGSD+ WLQC PC     CY Q DPIFDP+ S +Y+
Sbjct: 144 GSG-EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC---RRCYSQSDPIFDPRKSKTYA 203

Query: 203 PLSCNSQQCKLLDKANCNS--DTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGC 262
            + C+S  C+ LD A CN+   TC+YQV YGDGSFT G+ +TETL+F   N +  + +GC
Sbjct: 204 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF-RRNRVKGVALGC 263

Query: 263 GHDNEGLFAGGAGLIGLGGGAISLSSQLKA---SSFSYCLVNLDSDS--SSTLEFNSNMP 322
           GHDNEGLF G AGL+GLG G +S   Q        FSYCLV+  + S  SS +  N+ + 
Sbjct: 264 GHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVS 323

Query: 323 SDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLP-ISPTRFEIDESGLGGIIVDSGTIISR 382
             +  +PL+ N +  ++ YV ++GISVGG  +P ++ + F++D+ G GG+I+DSGT ++R
Sbjct: 324 RIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTR 383

Query: 383 LPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPA 442
           L    Y ++R+AF     +L  AP  S+FDTC++ S  + V+VPT+      G  + LPA
Sbjct: 384 LIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHF-RGADVSLPA 443

Query: 443 RNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 488
            NYLI +DT G +C AF  T   LSIIG+ QQQG RV YDL +S VGF+   C
Sbjct: 444 TNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Csa1G022490 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 261.9 bits (668), Expect = 1.4e-68
Identity = 152/387 (39.28%), Postives = 223/387 (57.62%), Query Frame = 1

Query: 109 LERSLNGGTHFGESINESLIGDS-ITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDT 168
           LER++  G+   + +   L G S +   V +G       EYL  + +G P + F  + DT
Sbjct: 60  LERAIERGSRRLQRLEAMLNGPSGVETSVYAGDG-----EYLMNLSIGTPAQPFSAIMDT 119

Query: 169 GSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQ 228
           GSD+ W QCQPC     C+ Q  PIF+P+ SSS+S L C+SQ C+ L    C+++ C Y 
Sbjct: 120 GSDLIWTQCQPCTQ---CFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYT 179

Query: 229 VHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAG-GAGLIGLGGGAISLSS 288
             YGDGS T G + TETL+FG S SIPN+  GCG +N+G   G GAGL+G+G G +SL S
Sbjct: 180 YGYGDGSETQGSMGTETLTFG-SVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPS 239

Query: 289 QLKASSFSYCLVNLDSDSSSTLEFNSNMPSDSLTSP---LVKNDRFHSYRYVKVVGISVG 348
           QL  + FSYC+  + S + S L   S   S +  SP   L+++ +  ++ Y+ + G+SVG
Sbjct: 240 QLDVTKFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVG 299

Query: 349 GKTLPISPTRFEID-ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISV 408
              LPI P+ F ++  +G GGII+DSGT ++   ++ Y+S+R+ F+   +        S 
Sbjct: 300 STRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSG 359

Query: 409 FDTCYNF-SGQSNVEVPTIAFVLS-EGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSI 468
           FD C+   S  SN+++PT  FV+  +G  L LP+ NY I   + G  CLA   +   +SI
Sbjct: 360 FDLCFQTPSDPSNLQIPT--FVMHFDGGDLELPSENYFIS-PSNGLICLAMGSSSQGMSI 419

Query: 469 IGSFQQQGIRVSYDLTNSLVGFSTNKC 488
            G+ QQQ + V YD  NS+V F++ +C
Sbjct: 420 FGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Csa1G022490 vs. Swiss-Prot
Match: AED1_ARATH (Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1)

HSP 1 Score: 257.3 bits (656), Expect = 3.4e-67
Identity = 156/406 (38.42%), Postives = 221/406 (54.43%), Query Frame = 1

Query: 86  DYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSG 145
           D++ ++R    RD ARV+ +   L ++         S NE     S   P  SG + GSG
Sbjct: 84  DHDEIIR----RDQARVESIYSKLSKN---------SANEVSEAKSTELPAKSGITLGSG 143

Query: 146 AEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLS 205
             Y+  IG+G P     LV DTGSD+TW QC+PC    +CY Q +P F+P SSS+Y  +S
Sbjct: 144 -NYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLG--SCYSQKEPKFNPSSSSTYQNVS 203

Query: 206 CNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNE 265
           C+S  C+  D  +C++  C+Y + YGD SFT G LA E  +  NS+ + ++  GCG +N+
Sbjct: 204 CSSPMCE--DAESCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQ 263

Query: 266 GLFAGGAGLIGLGGGAISLSSQLKA---SSFSYCLVNLDSDSSSTLEFNSNMPSDSLT-S 325
           GLF G AGL+GLG G +SL +Q      + FSYCL +  S+S+  L F S   S+S+  +
Sbjct: 264 GLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFT 323

Query: 326 PLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLPSDVYE 385
           P+       +Y  + ++GISVG K L I+P  F  +     G I+DSGT+ +RLP+ VY 
Sbjct: 324 PISSFPSAFNYG-IDIIGISVGDKELAITPNSFSTE-----GAIIDSGTVFTRLPTKVYA 383

Query: 386 SLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNYLIML 445
            LR  F +  SS     G  +FDTCY+F+G   V  PTIAF  +  T + L      + +
Sbjct: 384 ELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPI 443

Query: 446 DTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 488
             +   CLAF       +I G+ QQ  + V YD+    VGF+ N C
Sbjct: 444 KIS-QVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of Csa1G022490 vs. TrEMBL
Match: A0A0A0LPJ3_CUCSA (Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_1G022490 PE=3 SV=1)

HSP 1 Score: 969.9 bits (2506), Expect = 1.1e-279
Identity = 487/487 (100.00%), Postives = 487/487 (100.00%), Query Frame = 1

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60
           MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN
Sbjct: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60

Query: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120
           HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG
Sbjct: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120

Query: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180
           ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA
Sbjct: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180

Query: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240
           SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL
Sbjct: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240

Query: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300
           ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL
Sbjct: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300

Query: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360
           DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG
Sbjct: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360

Query: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420
           LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI
Sbjct: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420

Query: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 480
           AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV
Sbjct: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 480

Query: 481 GFSTNKC 488
           GFSTNKC
Sbjct: 481 GFSTNKC 487

BLAST of Csa1G022490 vs. TrEMBL
Match: A0A0A0LS14_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G022480 PE=3 SV=1)

HSP 1 Score: 694.5 bits (1791), Expect = 9.2e-197
Identity = 353/490 (72.04%), Postives = 408/490 (83.27%), Query Frame = 1

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60
           MNTSLS  FLFLT F SL FPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60

Query: 61  N-HSHL-PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 120
             HS+   +SP SL L+PRL +HNPSY+DY +LVRARL R AAR Q LNR LE SL GG 
Sbjct: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120

Query: 121 HFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ 180
            FG  IN S   +S+TAPV SG S+G+G EY A+IGVGQPV+ ++ VPDTGSDV+WLQCQ
Sbjct: 121 QFGRRINGSDSTNSLTAPVTSGASQGAG-EYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ 180

Query: 181 PCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTT 240
           PC  EN CYKQ  PIFDPKSSSSYSPLSC+S+QC LLD+A C++++CIY+V YGDGSFT 
Sbjct: 181 PCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTV 240

Query: 241 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 300
           GELATET SF +SNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQL+A+SFSYCL
Sbjct: 241 GELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCL 300

Query: 301 VNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 360
           V+LDS+SSSTL+FN++ PSDSLTSPLVKNDRF ++RYVKV+G+SVGGK LPIS + FEID
Sbjct: 301 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 360

Query: 361 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 420
           ESG GGIIVDSGT I+ +PSDVY+ LR+AFV LT +L PAPG+S FDTCY+ S QSNVEV
Sbjct: 361 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 420

Query: 421 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 480
           PTIAF+L    SL+LPA+N L  +D+AGT+CLAF+ +   LSIIG+ QQQGIRVSYDL N
Sbjct: 421 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 480

Query: 481 SLVGFSTNKC 488
           SLVGFST+KC
Sbjct: 481 SLVGFSTDKC 489

BLAST of Csa1G022490 vs. TrEMBL
Match: M5WPB5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004726mg PE=3 SV=1)

HSP 1 Score: 512.7 bits (1319), Expect = 5.0e-142
Identity = 269/497 (54.12%), Postives = 350/497 (70.42%), Query Frame = 1

Query: 7   SVFLFLTIFTSLQ----FPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKP---KPL- 66
           + FL+L I ++      FPS  SR L  S  +T++ DVSAS  QA D LS  P   KPL 
Sbjct: 4   TAFLYLAILSAFTLTSLFPSTHSRSL--SEETTTLLDVSASLTQAHDVLSFNPQTLKPLD 63

Query: 67  ------QNHSHLP-NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLER 126
                 Q H+  P NS FSL L PR ALHN  +KDY +LV++RL RD+ARV  L+  L+ 
Sbjct: 64  RQETQAQAHTLTPLNSSFSLQLLPRDALHNSQHKDYESLVQSRLGRDSARVNSLHTKLQL 123

Query: 127 SLNGGTHFG-ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSD 186
            +        E ++  +  + ++ PVVSG S+GSG EY  +IGVG P K  Y+V DTGSD
Sbjct: 124 VVQNIKKSDLEPMHTEIRPEDLSTPVVSGVSQGSG-EYFTRIGVGTPAKSLYMVLDTGSD 183

Query: 187 VTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHY 246
           + WLQC+PC+    CY+Q DP+F+P  SS+Y P++C+S QC  L  + C +D C+YQV Y
Sbjct: 184 INWLQCEPCSD---CYQQTDPVFNPTGSSTYRPVTCDSAQCHSLHVSACRADKCLYQVSY 243

Query: 247 GDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKA 306
           GDGS+T G+  TET+SFGNS +I N+ +GCGHDNEGLF G AGL+GLGGGA+SL SQ KA
Sbjct: 244 GDGSYTVGDFVTETVSFGNSGAIHNVGLGCGHDNEGLFVGAAGLLGLGGGALSLPSQFKA 303

Query: 307 SSFSYCLVNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPIS 366
           +SFSYCLVN DS +SSTLEFNS  PSDS+T+PL+K+ R  ++ YV + G SVGG+ + + 
Sbjct: 304 TSFSYCLVNRDSSTSSTLEFNSAPPSDSVTAPLLKDSRVETFYYVGLKGFSVGGQPVSVP 363

Query: 367 PTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFS 426
           P+ FE+DESG GGIIVDSGT I+RL ++ Y SLR+AF +LT  L  A G ++FDTCY+ S
Sbjct: 364 PSVFEVDESGNGGIIVDSGTAITRLQTEAYNSLRDAFKRLTRDLPSASGFALFDTCYDLS 423

Query: 427 GQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIR 486
            +S V+VPT++F+ ++G SL LPA+NYLI +D+AGT+C AF  T SS SIIG+ QQQG R
Sbjct: 424 SRSRVQVPTVSFLFADGKSLSLPAKNYLIPVDSAGTFCFAFAPTSSSPSIIGNVQQQGTR 483

Query: 487 VSYDLTNSLVGFSTNKC 488
           VSYDL N+ VGFS NKC
Sbjct: 484 VSYDLANNRVGFSPNKC 494

BLAST of Csa1G022490 vs. TrEMBL
Match: A0A0A0LPR7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G021980 PE=3 SV=1)

HSP 1 Score: 503.8 bits (1296), Expect = 2.3e-139
Identity = 246/334 (73.65%), Postives = 287/334 (85.93%), Query Frame = 1

Query: 154 VGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKL 213
           VGQP +  + V DTGSDVTWLQC PCA +N CY+Q  PIFDP+ SSSY+P+SC+S+QC+L
Sbjct: 3   VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62

Query: 214 LDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAG 273
           LD+A CN ++CIY+V YGDGSFT GELATETL+F +SNSIPN+ IGCGHDNEGLF G  G
Sbjct: 63  LDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGADG 122

Query: 274 LIGLGGGAISLSSQLKASSFSYCLVNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYR 333
           LIGLGGGAIS+SSQLKASSFSYCLV++DS S STL+FN++ PSDSL SPLVKNDRF S+R
Sbjct: 123 LIGLGGGAISISSQLKASSFSYCLVDIDSPSFSTLDFNTDPPSDSLISPLVKNDRFPSFR 182

Query: 334 YVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSS 393
           YVKV+G+SVGGK LPIS +RFEIDESGLGGIIVDSGT I++LPSDVYE LREAF+ LT++
Sbjct: 183 YVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLTTN 242

Query: 394 LSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIK 453
           L PAP IS FDTCY+ S QSNVEVPTIAF+L    SL+LPA+N L  +D+AGT+CLAF+ 
Sbjct: 243 LPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFVS 302

Query: 454 TKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 488
               LSIIG+FQQQGIRVSYDLTNSLVGFSTNKC
Sbjct: 303 ATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336

BLAST of Csa1G022490 vs. TrEMBL
Match: A0A0A0LQD2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G042220 PE=3 SV=1)

HSP 1 Score: 501.1 bits (1289), Expect = 1.5e-138
Identity = 259/491 (52.75%), Postives = 341/491 (69.45%), Query Frame = 1

Query: 6   SSVFLFLTIFTSLQFPSILSRKLTP-SSYSTSIFDVSASTNQALDALSIKPKP---LQNH 65
           +S  LFL +F    FP  LSR  +  S +S++  DVSAS  QA   L   P      Q  
Sbjct: 5   TSNLLFLFLFFLSLFPFTLSRSSSHLSPHSSASLDVSASLQQANQVLKFDPTASISFQQQ 64

Query: 66  SHLPNS----PFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 125
            HL  S     FSL L+PR +LHN  +KDY +LV +RL+RD++RV+ +   LE +L+   
Sbjct: 65  VHLVPSNSSFSFSLQLHPRDSLHNAGHKDYKSLVLSRLSRDSSRVKSIYDRLEFALSELK 124

Query: 126 HFG-ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQC 185
               E +   ++ + ++ P++SG S+GSG EY +++GVGQP K FY+V DTGSD+ WLQC
Sbjct: 125 RSDLEPLKTEILPEDLSTPIISGTSQGSG-EYFSRVGVGQPAKPFYMVLDTGSDINWLQC 184

Query: 186 QPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFT 245
           QPC     CY+Q DPIFDP+SSSS++ L C SQQC+ L+ + C +  C+YQV YGDGSFT
Sbjct: 185 QPCTD---CYQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFT 244

Query: 246 TGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYC 305
            GE   ETL+FGNS  I N+ +GCGHDNEGLF G AGL+GLGGG++SL+SQ+KASSFSYC
Sbjct: 245 VGEFVIETLTFGNSGMINNVAVGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMKASSFSYC 304

Query: 306 LVNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEI 365
           LV+ DS SSS LEFNS  PSDS+ +PL+K+ +  ++ YV + G+SVGG+ L I P  F++
Sbjct: 305 LVDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQM 364

Query: 366 DESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVE 425
           D+SG GGIIVDSGT I+RL +  Y +LR+AFV  T  L    G ++FDTCY+ S QS V 
Sbjct: 365 DDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVT 424

Query: 426 VPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLT 485
           +PT++F  + G SL+LP +NYLI +D+ GT+C AF  T SSLSIIG+ QQQG RV YDL 
Sbjct: 425 IPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLA 484

Query: 486 NSLVGFSTNKC 488
           NS+VGFS +KC
Sbjct: 485 NSVVGFSPHKC 491

BLAST of Csa1G022490 vs. TAIR10
Match: AT1G25510.1 (AT1G25510.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 460.7 bits (1184), Expect = 1.1e-129
Identity = 233/485 (48.04%), Postives = 330/485 (68.04%), Query Frame = 1

Query: 7   SVFLFLTIFTSLQFPSILSRKLTPSSYST-SIFDVSASTNQALDALSIKPKPLQNHSHLP 66
           S F F+   TS    S+ SR L  +S +T SI +V+ S ++     S +    +  +H  
Sbjct: 6   SFFFFIFFLTS--HSSVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLNQQEEQTHSA 65

Query: 67  NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINE 126
           +S FSL L+ R+++    + DY +L  ARL RD ARV+ L   L+ ++N  +        
Sbjct: 66  SSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPIS 125

Query: 127 SLIG---DSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASE 186
           ++       I AP++SG ++GSG EY  ++G+G+P +  Y+V DTGSDV WLQC PCA  
Sbjct: 126 TMYTTEEQDIEAPLISGTTQGSG-EYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCAD- 185

Query: 187 NTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELAT 246
             CY Q +PIF+P SSSSY PLSC++ QC  L+ + C + TC+Y+V YGDGS+T G+ AT
Sbjct: 186 --CYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFAT 245

Query: 247 ETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLDS 306
           ETL+ G S  + N+ +GCGH NEGLF G AGL+GLGGG ++L SQL  +SFSYCLV+ DS
Sbjct: 246 ETLTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDS 305

Query: 307 DSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLG 366
           DS+ST++F +++  D++ +PL++N +  ++ Y+ + GISVGG+ L I  + FE+DESG G
Sbjct: 306 DSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSG 365

Query: 367 GIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAF 426
           GII+DSGT ++RL +++Y SLR++FVK T  L  A G+++FDTCYN S ++ VEVPT+AF
Sbjct: 366 GIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAF 425

Query: 427 VLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGF 486
               G  L LPA+NY+I +D+ GT+CLAF  T SSL+IIG+ QQQG RV++DL NSL+GF
Sbjct: 426 HFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGF 483

Query: 487 STNKC 488
           S+NKC
Sbjct: 486 SSNKC 483

BLAST of Csa1G022490 vs. TAIR10
Match: AT3G18490.1 (AT3G18490.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 436.8 bits (1122), Expect = 1.8e-122
Identity = 233/498 (46.79%), Postives = 323/498 (64.86%), Query Frame = 1

Query: 7   SVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQNHSHLP- 66
           S+   +T+   L      SR L+     T++ DV +S  Q    LS+ P      +  P 
Sbjct: 8   SLLAVVTLSLFLTTTDASSRSLSTPP-KTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPE 67

Query: 67  ----------NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNG 126
                     +SP SL L+ R       +KDY +L  +RL RD++RV  +   +  ++ G
Sbjct: 68  SLSDPVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEG 127

Query: 127 --GTHFGESINESLI--GDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDV 186
              +      NE      + +T PVVSG S+GSG EY ++IGVG P K  YLV DTGSDV
Sbjct: 128 VDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSG-EYFSRIGVGTPAKEMYLVLDTGSDV 187

Query: 187 TWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYG 246
            W+QC+PCA    CY+Q DP+F+P SSS+Y  L+C++ QC LL+ + C S+ C+YQV YG
Sbjct: 188 NWIQCEPCAD---CYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYG 247

Query: 247 DGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKAS 306
           DGSFT GELAT+T++FGNS  I N+ +GCGHDNEGLF G AGL+GLGGG +S+++Q+KA+
Sbjct: 248 DGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKAT 307

Query: 307 SFSYCLVNLDSDSSSTLEFNS-NMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPIS 366
           SFSYCLV+ DS  SS+L+FNS  +     T+PL++N +  ++ YV + G SVGG+ + + 
Sbjct: 308 SFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLP 367

Query: 367 PTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSP-APGISVFDTCYNF 426
              F++D SG GG+I+D GT ++RL +  Y SLR+AF+KLT +L   +  IS+FDTCY+F
Sbjct: 368 DAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDF 427

Query: 427 SGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGI 486
           S  S V+VPT+AF  + G SL LPA+NYLI +D +GT+C AF  T SSLSIIG+ QQQG 
Sbjct: 428 SSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGT 487

Query: 487 RVSYDLTNSLVGFSTNKC 488
           R++YDL+ +++G S NKC
Sbjct: 488 RITYDLSKNVIGLSGNKC 500

BLAST of Csa1G022490 vs. TAIR10
Match: AT3G20015.1 (AT3G20015.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 350.9 bits (899), Expect = 1.3e-96
Identity = 181/435 (41.61%), Postives = 270/435 (62.07%), Query Frame = 1

Query: 60  NHSHLPN---SPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGG 119
           N++H  +   S ++L L  R    + +Y++++  + AR+ RD  RV  + R +   +   
Sbjct: 47  NNTHFSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPS 106

Query: 120 THFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQC 179
           +     +N+        + +VSG  +GSG EY  +IGVG P +  Y+V D+GSD+ W+QC
Sbjct: 107 SDSRYEVND------FGSDIVSGMDQGSG-EYFVRIGVGSPPRDQYMVIDSGSDMVWVQC 166

Query: 180 QPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFT 239
           QPC     CYKQ DP+FDP  S SY+ +SC S  C  ++ + C+S  C Y+V YGDGS+T
Sbjct: 167 QPC---KLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYT 226

Query: 240 TGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASS---F 299
            G LA ETL+F  +  + N+ +GCGH N G+F G AGL+G+GGG++S   QL   +   F
Sbjct: 227 KGTLALETLTFAKT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAF 286

Query: 300 SYCLVNLDSDSSSTLEFNSN-MPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPT 359
            YCLV+  +DS+ +L F    +P  +   PLV+N R  S+ YV + G+ VGG  +P+   
Sbjct: 287 GYCLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDG 346

Query: 360 RFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQ 419
            F++ E+G GG+++D+GT ++RLP+  Y + R+ F   T++L  A G+S+FDTCY+ SG 
Sbjct: 347 VFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGF 406

Query: 420 SNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVS 479
            +V VPT++F  +EG  L LPARN+L+ +D +GTYC AF  + + LSIIG+ QQ+GI+VS
Sbjct: 407 VSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVS 466

Query: 480 YDLTNSLVGFSTNKC 488
           +D  N  VGF  N C
Sbjct: 467 FDGANGFVGFGPNVC 470

BLAST of Csa1G022490 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 339.3 bits (869), Expect = 3.8e-93
Identity = 211/504 (41.87%), Postives = 293/504 (58.13%), Query Frame = 1

Query: 1   MNTSLSSVF--LFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPL 60
           +NT   SVF  LF T   S Q+ +++   L PSS + S  +  + T+++L   S     L
Sbjct: 6   LNTLAFSVFAVLFFTSSASSQYQTLVVNTL-PSSATLSWPESESLTDESL---SESTTSL 65

Query: 61  QNH-SHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 120
             H SH+             AL + S      L   RL RD+ RV+ +      S    T
Sbjct: 66  SVHLSHVD------------ALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVS----T 125

Query: 121 HFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ 180
               +          +  V+SG S+GSG EY  ++GVG P    Y+V DTGSDV WLQC 
Sbjct: 126 GRNATKRTPRTAGGFSGAVISGLSQGSG-EYFMRLGVGTPATNVYMVLDTGSDVVWLQCS 185

Query: 181 PCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKAN-C---NSDTCIYQVHYGDG 240
           PC +   CY Q D IFDPK S +++ + C S+ C+ LD ++ C    S TC+YQV YGDG
Sbjct: 186 PCKA---CYNQTDAIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDG 245

Query: 241 SFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKA--- 300
           SFT G+ +TETL+F  +  + ++P+GCGHDNEGLF G AGL+GLG G +S  SQ K    
Sbjct: 246 SFTEGDFSTETLTFHGAR-VDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYN 305

Query: 301 SSFSYCLVNLDSDSSS-----TLEF-NSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGG 360
             FSYCLV+  S  SS     T+ F N+ +P  S+ +PL+ N +  ++ Y++++GISVGG
Sbjct: 306 GKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGG 365

Query: 361 KTLP-ISPTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVF 420
             +P +S ++F++D +G GG+I+DSGT ++RL    Y +LR+AF    + L  AP  S+F
Sbjct: 366 SRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLF 425

Query: 421 DTCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGS 480
           DTC++ SG + V+VPT+ F    G  + LPA NYLI ++T G +C AF  T  SLSIIG+
Sbjct: 426 DTCFDLSGMTTVKVPTVVFHFG-GGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGN 483

Query: 481 FQQQGIRVSYDLTNSLVGFSTNKC 488
            QQQG RV+YDL  S VGF +  C
Sbjct: 486 IQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of Csa1G022490 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 330.1 bits (845), Expect = 2.3e-90
Identity = 181/413 (43.83%), Postives = 253/413 (61.26%), Query Frame = 1

Query: 83  SYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSK 142
           S K  + L  +RL RD+ RV+ +   L   + G      ++  +      ++ VVSG S+
Sbjct: 84  SNKTPDELFSSRLQRDSRRVKSI-ATLAAQIPG-----RNVTHAPRPGGFSSSVVSGLSQ 143

Query: 143 GSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYS 202
           GSG EY  ++GVG P +  Y+V DTGSD+ WLQC PC     CY Q DPIFDP+ S +Y+
Sbjct: 144 GSG-EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC---RRCYSQSDPIFDPRKSKTYA 203

Query: 203 PLSCNSQQCKLLDKANCNS--DTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGC 262
            + C+S  C+ LD A CN+   TC+YQV YGDGSFT G+ +TETL+F   N +  + +GC
Sbjct: 204 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF-RRNRVKGVALGC 263

Query: 263 GHDNEGLFAGGAGLIGLGGGAISLSSQLKA---SSFSYCLVNLDSDS--SSTLEFNSNMP 322
           GHDNEGLF G AGL+GLG G +S   Q        FSYCLV+  + S  SS +  N+ + 
Sbjct: 264 GHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVS 323

Query: 323 SDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLP-ISPTRFEIDESGLGGIIVDSGTIISR 382
             +  +PL+ N +  ++ YV ++GISVGG  +P ++ + F++D+ G GG+I+DSGT ++R
Sbjct: 324 RIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTR 383

Query: 383 LPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPA 442
           L    Y ++R+AF     +L  AP  S+FDTC++ S  + V+VPT+      G  + LPA
Sbjct: 384 LIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHF-RGADVSLPA 443

Query: 443 RNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 488
            NYLI +DT G +C AF  T   LSIIG+ QQQG RV YDL +S VGF+   C
Sbjct: 444 TNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Csa1G022490 vs. NCBI nr
Match: gi|449440933|ref|XP_004138238.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus])

HSP 1 Score: 969.9 bits (2506), Expect = 1.6e-279
Identity = 487/487 (100.00%), Postives = 487/487 (100.00%), Query Frame = 1

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60
           MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN
Sbjct: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60

Query: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120
           HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG
Sbjct: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120

Query: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180
           ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA
Sbjct: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180

Query: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240
           SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL
Sbjct: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240

Query: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300
           ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL
Sbjct: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300

Query: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360
           DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG
Sbjct: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360

Query: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420
           LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI
Sbjct: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420

Query: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 480
           AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV
Sbjct: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 480

Query: 481 GFSTNKC 488
           GFSTNKC
Sbjct: 481 GFSTNKC 487

BLAST of Csa1G022490 vs. NCBI nr
Match: gi|659106559|ref|XP_008453384.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis melo])

HSP 1 Score: 892.5 bits (2305), Expect = 3.3e-256
Identity = 450/487 (92.40%), Postives = 461/487 (94.66%), Query Frame = 1

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60
           M TSLSSVFLFLTIFTSLQF SILSRKLT S YSTSIFDV ASTNQAL+ALSIKPK LQ 
Sbjct: 1   MKTSLSSVFLFLTIFTSLQFSSILSRKLTQSPYSTSIFDVLASTNQALNALSIKPKHLQT 60

Query: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120
           HSHLPNS  SLPLYPRL+LHNPSYKDY++LVRARL RDAARVQFLNRNLE SLNGG  FG
Sbjct: 61  HSHLPNSSLSLPLYPRLSLHNPSYKDYDSLVRARLARDAARVQFLNRNLEHSLNGGKDFG 120

Query: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180
           E  N SLIGDSITAPVVSGQSKGSGAEYLAQ+GVGQPVKLFYLVPDTGSDVTWLQCQPCA
Sbjct: 121 EVTNGSLIGDSITAPVVSGQSKGSGAEYLAQVGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180

Query: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240
           +EN CYKQ DPIFDPKSSSSY+PLSCNSQQC LLD+ NCNS TCIYQVHYGDGSFTTGEL
Sbjct: 181 TENACYKQIDPIFDPKSSSSYTPLSCNSQQCGLLDRPNCNSGTCIYQVHYGDGSFTTGEL 240

Query: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300
           ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVN+
Sbjct: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNM 300

Query: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360
           DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKV+GISVGGKTLPIS TRFEIDESG
Sbjct: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVIGISVGGKTLPISSTRFEIDESG 360

Query: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420
           LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYN S QSNVEVPTI
Sbjct: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNLSSQSNVEVPTI 420

Query: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 480
           AFVLS GTSLRLPARNYLI +DTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV
Sbjct: 421 AFVLSGGTSLRLPARNYLIRVDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 480

Query: 481 GFSTNKC 488
           GFSTNKC
Sbjct: 481 GFSTNKC 487

BLAST of Csa1G022490 vs. NCBI nr
Match: gi|659106557|ref|XP_008453383.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis melo])

HSP 1 Score: 723.0 bits (1865), Expect = 3.5e-205
Identity = 365/488 (74.80%), Postives = 415/488 (85.04%), Query Frame = 1

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60
           MNTSLS   LFLTIFT LQFPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 2   MNTSLSYALLFLTIFTFLQFPSILSRKLTAQSPYSTTTFDVSASINQALNALSIKPKPFQ 61

Query: 61  NHSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHF 120
            HS+  NSP SL L+PRL +HNPSYKDY TLVRARL R A RVQ LNR LE SLNG   F
Sbjct: 62  THSYHSNSPLSLSLHPRLTVHNPSYKDYGTLVRARLARHATRVQSLNRKLELSLNGAKQF 121

Query: 121 GESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPC 180
           G+ IN S   +S+TAPV SG S G G EY A+IGVGQPV+ F+LVPDTGSDVTWLQC+PC
Sbjct: 122 GKRINGSASTNSLTAPVTSGASHGDG-EYFARIGVGQPVQSFFLVPDTGSDVTWLQCKPC 181

Query: 181 ASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGE 240
           A+EN C+KQ DPIFDPKSSSSYS LSCNS+QC+LLD+A C+S++CIY+V YGDGSFT GE
Sbjct: 182 ANENACFKQLDPIFDPKSSSSYSSLSCNSEQCQLLDEAGCSSNSCIYEVEYGDGSFTIGE 241

Query: 241 LATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVN 300
           LATETLSFGNSNSIPNLPIGCGHDNEGLF   AGLIGLGGGAISLSSQL+ASSFSYCLV+
Sbjct: 242 LATETLSFGNSNSIPNLPIGCGHDNEGLFDAAAGLIGLGGGAISLSSQLQASSFSYCLVD 301

Query: 301 LDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDES 360
           LDSDSSSTL+FN++ PSDSLTSPLVKN+RF S+RYVKV+G+SVGGK LPIS +RFEIDES
Sbjct: 302 LDSDSSSTLDFNADQPSDSLTSPLVKNNRFPSFRYVKVIGMSVGGKRLPISSSRFEIDES 361

Query: 361 GLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPT 420
           G GGIIVDSGT I++LPSDVY+ LR+AFV LT++L  APG+S FDTCY+ S QS+VEVP 
Sbjct: 362 GSGGIIVDSGTTITQLPSDVYDVLRDAFVGLTTNLPTAPGVSPFDTCYDLSSQSSVEVPI 421

Query: 421 IAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSL 480
           IAF+L  G SL+LPA+N LI +D+AGT+CLAF+     LSIIG+ QQQGIRVSYDL NS+
Sbjct: 422 IAFILPGGKSLKLPAKNCLIQVDSAGTFCLAFLPGTFPLSIIGNVQQQGIRVSYDLDNSI 481

Query: 481 VGFSTNKC 488
           VGF+TNKC
Sbjct: 482 VGFATNKC 488

BLAST of Csa1G022490 vs. NCBI nr
Match: gi|778664722|ref|XP_004138237.2| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus])

HSP 1 Score: 694.5 bits (1791), Expect = 1.3e-196
Identity = 353/490 (72.04%), Postives = 408/490 (83.27%), Query Frame = 1

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60
           MNTSLS  FLFLT F SL FPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60

Query: 61  N-HSHL-PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 120
             HS+   +SP SL L+PRL +HNPSY+DY +LVRARL R AAR Q LNR LE SL GG 
Sbjct: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120

Query: 121 HFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ 180
            FG  IN S   +S+TAPV SG S+G+G EY A+IGVGQPV+ ++ VPDTGSDV+WLQCQ
Sbjct: 121 QFGRRINGSDSTNSLTAPVTSGASQGAG-EYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ 180

Query: 181 PCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTT 240
           PC  EN CYKQ  PIFDPKSSSSYSPLSC+S+QC LLD+A C++++CIY+V YGDGSFT 
Sbjct: 181 PCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTV 240

Query: 241 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 300
           GELATET SF +SNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQL+A+SFSYCL
Sbjct: 241 GELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCL 300

Query: 301 VNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 360
           V+LDS+SSSTL+FN++ PSDSLTSPLVKNDRF ++RYVKV+G+SVGGK LPIS + FEID
Sbjct: 301 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 360

Query: 361 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 420
           ESG GGIIVDSGT I+ +PSDVY+ LR+AFV LT +L PAPG+S FDTCY+ S QSNVEV
Sbjct: 361 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 420

Query: 421 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 480
           PTIAF+L    SL+LPA+N L  +D+AGT+CLAF+ +   LSIIG+ QQQGIRVSYDL N
Sbjct: 421 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 480

Query: 481 SLVGFSTNKC 488
           SLVGFST+KC
Sbjct: 481 SLVGFSTDKC 489

BLAST of Csa1G022490 vs. NCBI nr
Match: gi|645277963|ref|XP_008244017.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Prunus mume])

HSP 1 Score: 513.5 bits (1321), Expect = 4.2e-142
Identity = 270/497 (54.33%), Postives = 350/497 (70.42%), Query Frame = 1

Query: 7   SVFLFLTIFTSLQ----FPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKP---KPL- 66
           + FL+L IF++      FPS LSR L  S  ++++ DVSAS  QA   LS  P   KPL 
Sbjct: 4   TAFLYLAIFSAFTLTALFPSTLSRSL--SEETSTLLDVSASLTQAHVVLSFNPETLKPLD 63

Query: 67  ------QNHSHLP-NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLER 126
                 Q HS  P NS FSL L PR ALHN  +KDY +LV +RL RD+ARV  L+  L+ 
Sbjct: 64  RQETQAQTHSLTPLNSSFSLQLLPRDALHNSQHKDYESLVLSRLGRDSARVNSLHTKLQL 123

Query: 127 SLNGGTHFG-ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSD 186
           ++        E ++  +  + ++ PVVSG S+GSG EY  +IGVG P K  Y+V DTGSD
Sbjct: 124 AVQNIKKSDLEPMHTEIRPEDLSTPVVSGVSQGSG-EYFTRIGVGTPAKSLYMVLDTGSD 183

Query: 187 VTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHY 246
           + WLQC+PC+    CY+Q DP+F+P  SS+Y P++C+S QC  L  + C +D C+YQV Y
Sbjct: 184 INWLQCEPCSD---CYQQTDPVFNPTGSSTYHPVTCDSAQCHSLHVSACRADKCLYQVSY 243

Query: 247 GDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKA 306
           GDGS+T G+  TET+SFGNS +I N+ +GCGHDNEGLF G AGL+GLGGGA+SL SQ KA
Sbjct: 244 GDGSYTVGDFVTETVSFGNSGAIHNVGLGCGHDNEGLFVGAAGLLGLGGGALSLPSQFKA 303

Query: 307 SSFSYCLVNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPIS 366
           +SFSYCLVN DS +SSTLEFNS  PSDS+T+PL+K+ R  ++ YV + G SVGG+ + + 
Sbjct: 304 TSFSYCLVNRDSSTSSTLEFNSAPPSDSVTAPLLKDSRVETFYYVGLKGFSVGGQPVSVP 363

Query: 367 PTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFS 426
           P+ FE+DESG GGIIVDSGT I+RL ++ Y SLR+AF +LT  L  A G ++FDTCY+ S
Sbjct: 364 PSVFEVDESGNGGIIVDSGTAITRLQTEAYNSLRDAFKRLTRDLPSASGFALFDTCYDLS 423

Query: 427 GQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIR 486
            +S V+VPT++F+ + G SL LPA+NYLI +D+AGT+C AF  T SS SIIG+ QQQG R
Sbjct: 424 SRSRVQVPTVSFLFAGGKSLSLPAKNYLIPVDSAGTFCFAFAPTSSSPSIIGNVQQQGTR 483

Query: 487 VSYDLTNSLVGFSTNKC 488
           VSYDL N+ VGFS NKC
Sbjct: 484 VSYDLANNRVGFSANKC 494

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPG1_ARATH3.1e-12146.79Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
ASPG2_ARATH2.2e-9541.61Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
APF2_ARATH4.1e-8943.83Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
NEP1_NEPGR1.4e-6839.28Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
AED1_ARATH3.4e-6738.42Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LPJ3_CUCSA1.1e-279100.00Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_1G022490 PE=3 SV=1[more]
A0A0A0LS14_CUCSA9.2e-19772.04Uncharacterized protein OS=Cucumis sativus GN=Csa_1G022480 PE=3 SV=1[more]
M5WPB5_PRUPE5.0e-14254.12Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004726mg PE=3 SV=1[more]
A0A0A0LPR7_CUCSA2.3e-13973.65Uncharacterized protein OS=Cucumis sativus GN=Csa_1G021980 PE=3 SV=1[more]
A0A0A0LQD2_CUCSA1.5e-13852.75Uncharacterized protein OS=Cucumis sativus GN=Csa_1G042220 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G25510.11.1e-12948.04 Eukaryotic aspartyl protease family protein[more]
AT3G18490.11.8e-12246.79 Eukaryotic aspartyl protease family protein[more]
AT3G20015.11.3e-9641.61 Eukaryotic aspartyl protease family protein[more]
AT3G61820.13.8e-9341.87 Eukaryotic aspartyl protease family protein[more]
AT1G01300.12.3e-9043.83 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|449440933|ref|XP_004138238.1|1.6e-279100.00PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus][more]
gi|659106559|ref|XP_008453384.1|3.3e-25692.40PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis melo][more]
gi|659106557|ref|XP_008453383.1|3.5e-20574.80PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis melo][more]
gi|778664722|ref|XP_004138237.2|1.3e-19672.04PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus][more]
gi|645277963|ref|XP_008244017.1|4.2e-14254.33PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0050896 response to stimulus
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU094048cucumber EST collection version 3.0transcribed_cluster
CU116870cucumber EST collection version 3.0transcribed_cluster
CU140878cucumber EST collection version 3.0transcribed_cluster
CU147651cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G022490.1Csa1G022490.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU147651CU147651transcribed_cluster
CU140878CU140878transcribed_cluster
CU094048CU094048transcribed_cluster
CU116870CU116870transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 59..487
score: 5.2E-193coord: 2..24
score: 5.2E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 319..487
score: 3.9E-43coord: 135..307
score: 4.0
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 144..487
score: 1.6
NoneNo IPR availablePANTHERPTHR13683:SF274ASPARTYL PROTEASE FAMILY PROTEINcoord: 59..487
score: 5.2E-193coord: 2..24
score: 5.2E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Csa1G022490Cucurbita moschata (Rifu)cmocuB076
Csa1G022490Cucurbita moschata (Rifu)cmocuB050
Csa1G022490Cucurbita moschata (Rifu)cmocuB207
Csa1G022490Cucurbita moschata (Rifu)cmocuB389
Csa1G022490Cucurbita moschata (Rifu)cmocuB679
Csa1G022490Melon (DHL92) v3.5.1cumeB010
Csa1G022490Melon (DHL92) v3.5.1cumeB033
Csa1G022490Melon (DHL92) v3.5.1cumeB055
Csa1G022490Watermelon (Charleston Gray)cuwcgB058
Csa1G022490Watermelon (Charleston Gray)cuwcgB063
Csa1G022490Watermelon (Charleston Gray)cuwcgB082
Csa1G022490Watermelon (97103) v1cuwmB039
Csa1G022490Watermelon (97103) v1cuwmB063
Csa1G022490Watermelon (97103) v1cuwmB095
Csa1G022490Cucurbita pepo (Zucchini)cpecuB024
Csa1G022490Cucurbita pepo (Zucchini)cpecuB352
Csa1G022490Cucurbita pepo (Zucchini)cpecuB381
Csa1G022490Cucurbita pepo (Zucchini)cpecuB509
Csa1G022490Cucurbita pepo (Zucchini)cpecuB654
Csa1G022490Bottle gourd (USVL1VR-Ls)culsiB025
Csa1G022490Bottle gourd (USVL1VR-Ls)culsiB054
Csa1G022490Bottle gourd (USVL1VR-Ls)culsiB071
Csa1G022490Cucumber (Gy14) v2cgybcuB001
Csa1G022490Cucumber (Gy14) v2cgybcuB005
Csa1G022490Cucumber (Gy14) v2cgybcuB196
Csa1G022490Melon (DHL92) v3.6.1cumedB005
Csa1G022490Melon (DHL92) v3.6.1cumedB028
Csa1G022490Melon (DHL92) v3.6.1cumedB047
Csa1G022490Silver-seed gourdcarcuB0416
Csa1G022490Silver-seed gourdcarcuB0491
Csa1G022490Silver-seed gourdcarcuB0543
Csa1G022490Silver-seed gourdcarcuB0622
Csa1G022490Silver-seed gourdcarcuB0856
Csa1G022490Silver-seed gourdcarcuB1056
Csa1G022490Watermelon (97103) v2cuwmbB055
Csa1G022490Watermelon (97103) v2cuwmbB061
Csa1G022490Watermelon (97103) v2cuwmbB079
Csa1G022490Wax gourdcuwgoB012
Csa1G022490Wax gourdcuwgoB040
Csa1G022490Wax gourdcuwgoB071
Csa1G022490Cucumber (Chinese Long) v2cucuB000
Csa1G022490Cucumber (Chinese Long) v2cucuB030
Csa1G022490Cucumber (Gy14) v1cgycuB299
Csa1G022490Cucumber (Gy14) v1cgycuB405
Csa1G022490Cucurbita maxima (Rimu)cmacuB061
Csa1G022490Cucurbita maxima (Rimu)cmacuB087
Csa1G022490Cucurbita maxima (Rimu)cmacuB221
Csa1G022490Cucurbita maxima (Rimu)cmacuB393
Csa1G022490Cucurbita maxima (Rimu)cmacuB404
Csa1G022490Cucurbita maxima (Rimu)cmacuB686