Csa1G022490.1 (mRNA) Cucumber (Chinese Long) v2

NameCsa1G022490.1
TypemRNA
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionAspartic proteinase nepenthesin-1, putative; contains IPR001461 (Peptidase A1), IPR021109 (Aspartic peptidase)
LocationChr1 : 2302493 .. 2304085 (+)
Sequence length1464
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCCAATAATTCATTATCTTTTCATTCCCAACATGAACACTTCACTTTCCTCTGTTTTTCTCTTCCTAACAATCTTCACTTCCCTTCAATTCCCTTCAATTCTCTCTCGCAAGTTAACACCATCTTCCTATTCCACTTCCATCTTCGATGTCTCTGCCTCCACAAACCAAGCCCTAGATGCCCTCTCCATTAAACCCAAACCTCTTCAAAATCATTCTCACCTTCCAAATTCCCCTTTCTCTCTGCCATTGTACCCTAGATTGGCCCTTCATAACCCTTCTTACAAGGACTACAATACCCTTGTTAGGGCCCGACTCACTCGTGATGCCGCTCGAGTTCAATTCCTTAACCGAAATCTTGAGCGCTCTTTAAATGGGGGTACTCATTTTGGTGAAAGTATTAATGAATCTCTAATTGGAGATTCAATTACTGCTCCGGTTGTTTCGGGGCAAAGTAAAGGGAGTGGTGCTGAGTATTTAGCTCAGATTGGGGTTGGTCAGCCTGTGAAGTTGTTTTATTTGGTGCCTGATACTGGTAGCGATGTCACGTGGCTTCAATGTCAACCTTGTGCTAGTGAGAATACTTGTTATAAACAATTTGACCCTATTTTCGATCCGAAATCATCGTCTTCATACAGTCCTCTGTCTTGCAATTCTCAACAATGTAAGTTGCTAGATAAAGCCAATTGCAATTCCGACACATGCATATACCAAGTCCACTACGGTGATGGATCATTCACAACTGGTGAACTCGCTACCGAAACACTATCGTTTGGAAATTCAAATTCTATTCCTAATCTCCCAATTGGTTGTGGGCATGACAATGAAGGCTTGTTTGCTGGTGGAGCTGGTTTAATAGGCCTCGGTGGTGGGGCCATTTCCCTTTCTTCCCAATTAAAAGCGTCATCATTTTCATATTGCCTCGTCAACTTAGATTCAGACTCATCCTCCACTCTTGAGTTTAACTCAAACATGCCCAGTGACTCGTTGACCTCTCCGCTCGTGAAAAACGATCGATTTCACTCGTATAGGTACGTCAAAGTCGTTGGAATAAGTGTTGGGGGAAAAACTCTACCAATTTCACCGACAAGATTTGAAATTGATGAATCGGGATTGGGAGGAATAATCGTTGATTCTGGTACAATTATATCTCGACTACCGAGTGATGTCTATGAATCATTAAGAGAGGCATTTGTGAAGCTGACGAGTAGCCTATCACCGGCACCAGGGATATCAGTGTTCGATACATGTTATAACTTTTCAGGTCAATCGAATGTGGAGGTCCCAACAATAGCATTTGTGTTGTCGGAAGGAACCTCGCTACGACTACCTGCAAGAAATTACTTAATTATGTTGGACACAGCAGGAACTTATTGTTTGGCGTTTATTAAAACGAAATCTTCACTTTCTATAATTGGTAGCTTCCAACAACAAGGAATACGTGTTAGTTATGACTTGACAAACTCCCTCGTTGGATTCTCAACTAATAAATGTTAGTACCATAAACTTATAATAACTCATGTAACACCACGTGGGATTAGAATTTGGCTTTAATCACATTGTTCTAAGTCAATAAAATATTGTTGTTTATCAT

mRNA sequence

ATGAACACTTCACTTTCCTCTGTTTTTCTCTTCCTAACAATCTTCACTTCCCTTCAATTCCCTTCAATTCTCTCTCGCAAGTTAACACCATCTTCCTATTCCACTTCCATCTTCGATGTCTCTGCCTCCACAAACCAAGCCCTAGATGCCCTCTCCATTAAACCCAAACCTCTTCAAAATCATTCTCACCTTCCAAATTCCCCTTTCTCTCTGCCATTGTACCCTAGATTGGCCCTTCATAACCCTTCTTACAAGGACTACAATACCCTTGTTAGGGCCCGACTCACTCGTGATGCCGCTCGAGTTCAATTCCTTAACCGAAATCTTGAGCGCTCTTTAAATGGGGGTACTCATTTTGGTGAAAGTATTAATGAATCTCTAATTGGAGATTCAATTACTGCTCCGGTTGTTTCGGGGCAAAGTAAAGGGAGTGGTGCTGAGTATTTAGCTCAGATTGGGGTTGGTCAGCCTGTGAAGTTGTTTTATTTGGTGCCTGATACTGGTAGCGATGTCACGTGGCTTCAATGTCAACCTTGTGCTAGTGAGAATACTTGTTATAAACAATTTGACCCTATTTTCGATCCGAAATCATCGTCTTCATACAGTCCTCTGTCTTGCAATTCTCAACAATGTAAGTTGCTAGATAAAGCCAATTGCAATTCCGACACATGCATATACCAAGTCCACTACGGTGATGGATCATTCACAACTGGTGAACTCGCTACCGAAACACTATCGTTTGGAAATTCAAATTCTATTCCTAATCTCCCAATTGGTTGTGGGCATGACAATGAAGGCTTGTTTGCTGGTGGAGCTGGTTTAATAGGCCTCGGTGGTGGGGCCATTTCCCTTTCTTCCCAATTAAAAGCGTCATCATTTTCATATTGCCTCGTCAACTTAGATTCAGACTCATCCTCCACTCTTGAGTTTAACTCAAACATGCCCAGTGACTCGTTGACCTCTCCGCTCGTGAAAAACGATCGATTTCACTCGTATAGGTACGTCAAAGTCGTTGGAATAAGTGTTGGGGGAAAAACTCTACCAATTTCACCGACAAGATTTGAAATTGATGAATCGGGATTGGGAGGAATAATCGTTGATTCTGGTACAATTATATCTCGACTACCGAGTGATGTCTATGAATCATTAAGAGAGGCATTTGTGAAGCTGACGAGTAGCCTATCACCGGCACCAGGGATATCAGTGTTCGATACATGTTATAACTTTTCAGGTCAATCGAATGTGGAGGTCCCAACAATAGCATTTGTGTTGTCGGAAGGAACCTCGCTACGACTACCTGCAAGAAATTACTTAATTATGTTGGACACAGCAGGAACTTATTGTTTGGCGTTTATTAAAACGAAATCTTCACTTTCTATAATTGGTAGCTTCCAACAACAAGGAATACGTGTTAGTTATGACTTGACAAACTCCCTCGTTGGATTCTCAACTAATAAATGTTAG

Coding sequence (CDS)

ATGAACACTTCACTTTCCTCTGTTTTTCTCTTCCTAACAATCTTCACTTCCCTTCAATTCCCTTCAATTCTCTCTCGCAAGTTAACACCATCTTCCTATTCCACTTCCATCTTCGATGTCTCTGCCTCCACAAACCAAGCCCTAGATGCCCTCTCCATTAAACCCAAACCTCTTCAAAATCATTCTCACCTTCCAAATTCCCCTTTCTCTCTGCCATTGTACCCTAGATTGGCCCTTCATAACCCTTCTTACAAGGACTACAATACCCTTGTTAGGGCCCGACTCACTCGTGATGCCGCTCGAGTTCAATTCCTTAACCGAAATCTTGAGCGCTCTTTAAATGGGGGTACTCATTTTGGTGAAAGTATTAATGAATCTCTAATTGGAGATTCAATTACTGCTCCGGTTGTTTCGGGGCAAAGTAAAGGGAGTGGTGCTGAGTATTTAGCTCAGATTGGGGTTGGTCAGCCTGTGAAGTTGTTTTATTTGGTGCCTGATACTGGTAGCGATGTCACGTGGCTTCAATGTCAACCTTGTGCTAGTGAGAATACTTGTTATAAACAATTTGACCCTATTTTCGATCCGAAATCATCGTCTTCATACAGTCCTCTGTCTTGCAATTCTCAACAATGTAAGTTGCTAGATAAAGCCAATTGCAATTCCGACACATGCATATACCAAGTCCACTACGGTGATGGATCATTCACAACTGGTGAACTCGCTACCGAAACACTATCGTTTGGAAATTCAAATTCTATTCCTAATCTCCCAATTGGTTGTGGGCATGACAATGAAGGCTTGTTTGCTGGTGGAGCTGGTTTAATAGGCCTCGGTGGTGGGGCCATTTCCCTTTCTTCCCAATTAAAAGCGTCATCATTTTCATATTGCCTCGTCAACTTAGATTCAGACTCATCCTCCACTCTTGAGTTTAACTCAAACATGCCCAGTGACTCGTTGACCTCTCCGCTCGTGAAAAACGATCGATTTCACTCGTATAGGTACGTCAAAGTCGTTGGAATAAGTGTTGGGGGAAAAACTCTACCAATTTCACCGACAAGATTTGAAATTGATGAATCGGGATTGGGAGGAATAATCGTTGATTCTGGTACAATTATATCTCGACTACCGAGTGATGTCTATGAATCATTAAGAGAGGCATTTGTGAAGCTGACGAGTAGCCTATCACCGGCACCAGGGATATCAGTGTTCGATACATGTTATAACTTTTCAGGTCAATCGAATGTGGAGGTCCCAACAATAGCATTTGTGTTGTCGGAAGGAACCTCGCTACGACTACCTGCAAGAAATTACTTAATTATGTTGGACACAGCAGGAACTTATTGTTTGGCGTTTATTAAAACGAAATCTTCACTTTCTATAATTGGTAGCTTCCAACAACAAGGAATACGTGTTAGTTATGACTTGACAAACTCCCTCGTTGGATTCTCAACTAATAAATGTTAG

Protein sequence

MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQNHSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC*
BLAST of Csa1G022490.1 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 436.8 bits (1122), Expect = 3.1e-121
Identity = 233/498 (46.79%), Postives = 323/498 (64.86%), Query Frame = 1

Query: 7   SVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQNHSHLP- 66
           S+   +T+   L      SR L+     T++ DV +S  Q    LS+ P      +  P 
Sbjct: 8   SLLAVVTLSLFLTTTDASSRSLSTPP-KTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPE 67

Query: 67  ----------NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNG 126
                     +SP SL L+ R       +KDY +L  +RL RD++RV  +   +  ++ G
Sbjct: 68  SLSDPVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEG 127

Query: 127 --GTHFGESINESLI--GDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDV 186
              +      NE      + +T PVVSG S+GSG EY ++IGVG P K  YLV DTGSDV
Sbjct: 128 VDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSG-EYFSRIGVGTPAKEMYLVLDTGSDV 187

Query: 187 TWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYG 246
            W+QC+PCA    CY+Q DP+F+P SSS+Y  L+C++ QC LL+ + C S+ C+YQV YG
Sbjct: 188 NWIQCEPCAD---CYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYG 247

Query: 247 DGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKAS 306
           DGSFT GELAT+T++FGNS  I N+ +GCGHDNEGLF G AGL+GLGGG +S+++Q+KA+
Sbjct: 248 DGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKAT 307

Query: 307 SFSYCLVNLDSDSSSTLEFNS-NMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPIS 366
           SFSYCLV+ DS  SS+L+FNS  +     T+PL++N +  ++ YV + G SVGG+ + + 
Sbjct: 308 SFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLP 367

Query: 367 PTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSP-APGISVFDTCYNF 426
              F++D SG GG+I+D GT ++RL +  Y SLR+AF+KLT +L   +  IS+FDTCY+F
Sbjct: 368 DAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDF 427

Query: 427 SGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGI 486
           S  S V+VPT+AF  + G SL LPA+NYLI +D +GT+C AF  T SSLSIIG+ QQQG 
Sbjct: 428 SSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGT 487

Query: 487 RVSYDLTNSLVGFSTNKC 488
           R++YDL+ +++G S NKC
Sbjct: 488 RITYDLSKNVIGLSGNKC 500

BLAST of Csa1G022490.1 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 350.9 bits (899), Expect = 2.2e-95
Identity = 181/435 (41.61%), Postives = 270/435 (62.07%), Query Frame = 1

Query: 60  NHSHLPN---SPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGG 119
           N++H  +   S ++L L  R    + +Y++++  + AR+ RD  RV  + R +   +   
Sbjct: 47  NNTHFSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPS 106

Query: 120 THFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQC 179
           +     +N+        + +VSG  +GSG EY  +IGVG P +  Y+V D+GSD+ W+QC
Sbjct: 107 SDSRYEVND------FGSDIVSGMDQGSG-EYFVRIGVGSPPRDQYMVIDSGSDMVWVQC 166

Query: 180 QPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFT 239
           QPC     CYKQ DP+FDP  S SY+ +SC S  C  ++ + C+S  C Y+V YGDGS+T
Sbjct: 167 QPC---KLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYT 226

Query: 240 TGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASS---F 299
            G LA ETL+F  +  + N+ +GCGH N G+F G AGL+G+GGG++S   QL   +   F
Sbjct: 227 KGTLALETLTFAKT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAF 286

Query: 300 SYCLVNLDSDSSSTLEFNSN-MPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPT 359
            YCLV+  +DS+ +L F    +P  +   PLV+N R  S+ YV + G+ VGG  +P+   
Sbjct: 287 GYCLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDG 346

Query: 360 RFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQ 419
            F++ E+G GG+++D+GT ++RLP+  Y + R+ F   T++L  A G+S+FDTCY+ SG 
Sbjct: 347 VFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGF 406

Query: 420 SNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVS 479
            +V VPT++F  +EG  L LPARN+L+ +D +GTYC AF  + + LSIIG+ QQ+GI+VS
Sbjct: 407 VSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVS 466

Query: 480 YDLTNSLVGFSTNKC 488
           +D  N  VGF  N C
Sbjct: 467 FDGANGFVGFGPNVC 470

BLAST of Csa1G022490.1 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 330.1 bits (845), Expect = 4.1e-89
Identity = 181/413 (43.83%), Postives = 253/413 (61.26%), Query Frame = 1

Query: 83  SYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSK 142
           S K  + L  +RL RD+ RV+ +   L   + G      ++  +      ++ VVSG S+
Sbjct: 84  SNKTPDELFSSRLQRDSRRVKSI-ATLAAQIPG-----RNVTHAPRPGGFSSSVVSGLSQ 143

Query: 143 GSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYS 202
           GSG EY  ++GVG P +  Y+V DTGSD+ WLQC PC     CY Q DPIFDP+ S +Y+
Sbjct: 144 GSG-EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC---RRCYSQSDPIFDPRKSKTYA 203

Query: 203 PLSCNSQQCKLLDKANCNS--DTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGC 262
            + C+S  C+ LD A CN+   TC+YQV YGDGSFT G+ +TETL+F   N +  + +GC
Sbjct: 204 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF-RRNRVKGVALGC 263

Query: 263 GHDNEGLFAGGAGLIGLGGGAISLSSQLKA---SSFSYCLVNLDSDS--SSTLEFNSNMP 322
           GHDNEGLF G AGL+GLG G +S   Q        FSYCLV+  + S  SS +  N+ + 
Sbjct: 264 GHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVS 323

Query: 323 SDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLP-ISPTRFEIDESGLGGIIVDSGTIISR 382
             +  +PL+ N +  ++ YV ++GISVGG  +P ++ + F++D+ G GG+I+DSGT ++R
Sbjct: 324 RIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTR 383

Query: 383 LPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPA 442
           L    Y ++R+AF     +L  AP  S+FDTC++ S  + V+VPT+      G  + LPA
Sbjct: 384 LIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHF-RGADVSLPA 443

Query: 443 RNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 488
            NYLI +DT G +C AF  T   LSIIG+ QQQG RV YDL +S VGF+   C
Sbjct: 444 TNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Csa1G022490.1 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 261.9 bits (668), Expect = 1.4e-68
Identity = 152/387 (39.28%), Postives = 223/387 (57.62%), Query Frame = 1

Query: 109 LERSLNGGTHFGESINESLIGDS-ITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDT 168
           LER++  G+   + +   L G S +   V +G       EYL  + +G P + F  + DT
Sbjct: 60  LERAIERGSRRLQRLEAMLNGPSGVETSVYAGDG-----EYLMNLSIGTPAQPFSAIMDT 119

Query: 169 GSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQ 228
           GSD+ W QCQPC     C+ Q  PIF+P+ SSS+S L C+SQ C+ L    C+++ C Y 
Sbjct: 120 GSDLIWTQCQPCTQ---CFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYT 179

Query: 229 VHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAG-GAGLIGLGGGAISLSS 288
             YGDGS T G + TETL+FG S SIPN+  GCG +N+G   G GAGL+G+G G +SL S
Sbjct: 180 YGYGDGSETQGSMGTETLTFG-SVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPS 239

Query: 289 QLKASSFSYCLVNLDSDSSSTLEFNSNMPSDSLTSP---LVKNDRFHSYRYVKVVGISVG 348
           QL  + FSYC+  + S + S L   S   S +  SP   L+++ +  ++ Y+ + G+SVG
Sbjct: 240 QLDVTKFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVG 299

Query: 349 GKTLPISPTRFEID-ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISV 408
              LPI P+ F ++  +G GGII+DSGT ++   ++ Y+S+R+ F+   +        S 
Sbjct: 300 STRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSG 359

Query: 409 FDTCYNF-SGQSNVEVPTIAFVLS-EGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSI 468
           FD C+   S  SN+++PT  FV+  +G  L LP+ NY I   + G  CLA   +   +SI
Sbjct: 360 FDLCFQTPSDPSNLQIPT--FVMHFDGGDLELPSENYFIS-PSNGLICLAMGSSSQGMSI 419

Query: 469 IGSFQQQGIRVSYDLTNSLVGFSTNKC 488
            G+ QQQ + V YD  NS+V F++ +C
Sbjct: 420 FGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Csa1G022490.1 vs. Swiss-Prot
Match: AED1_ARATH (Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1)

HSP 1 Score: 257.3 bits (656), Expect = 3.4e-67
Identity = 156/406 (38.42%), Postives = 221/406 (54.43%), Query Frame = 1

Query: 86  DYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSG 145
           D++ ++R    RD ARV+ +   L ++         S NE     S   P  SG + GSG
Sbjct: 84  DHDEIIR----RDQARVESIYSKLSKN---------SANEVSEAKSTELPAKSGITLGSG 143

Query: 146 AEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLS 205
             Y+  IG+G P     LV DTGSD+TW QC+PC    +CY Q +P F+P SSS+Y  +S
Sbjct: 144 -NYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLG--SCYSQKEPKFNPSSSSTYQNVS 203

Query: 206 CNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNE 265
           C+S  C+  D  +C++  C+Y + YGD SFT G LA E  +  NS+ + ++  GCG +N+
Sbjct: 204 CSSPMCE--DAESCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQ 263

Query: 266 GLFAGGAGLIGLGGGAISLSSQLKA---SSFSYCLVNLDSDSSSTLEFNSNMPSDSLT-S 325
           GLF G AGL+GLG G +SL +Q      + FSYCL +  S+S+  L F S   S+S+  +
Sbjct: 264 GLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFT 323

Query: 326 PLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLPSDVYE 385
           P+       +Y  + ++GISVG K L I+P  F  +     G I+DSGT+ +RLP+ VY 
Sbjct: 324 PISSFPSAFNYG-IDIIGISVGDKELAITPNSFSTE-----GAIIDSGTVFTRLPTKVYA 383

Query: 386 SLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNYLIML 445
            LR  F +  SS     G  +FDTCY+F+G   V  PTIAF  +  T + L      + +
Sbjct: 384 ELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPI 443

Query: 446 DTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 488
             +   CLAF       +I G+ QQ  + V YD+    VGF+ N C
Sbjct: 444 KIS-QVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of Csa1G022490.1 vs. TrEMBL
Match: A0A0A0LPJ3_CUCSA (Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_1G022490 PE=3 SV=1)

HSP 1 Score: 969.9 bits (2506), Expect = 1.1e-279
Identity = 487/487 (100.00%), Postives = 487/487 (100.00%), Query Frame = 1

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60
           MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN
Sbjct: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60

Query: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120
           HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG
Sbjct: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120

Query: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180
           ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA
Sbjct: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180

Query: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240
           SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL
Sbjct: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240

Query: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300
           ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL
Sbjct: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300

Query: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360
           DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG
Sbjct: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360

Query: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420
           LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI
Sbjct: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420

Query: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 480
           AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV
Sbjct: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 480

Query: 481 GFSTNKC 488
           GFSTNKC
Sbjct: 481 GFSTNKC 487

BLAST of Csa1G022490.1 vs. TrEMBL
Match: A0A0A0LS14_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G022480 PE=3 SV=1)

HSP 1 Score: 694.5 bits (1791), Expect = 9.2e-197
Identity = 353/490 (72.04%), Postives = 408/490 (83.27%), Query Frame = 1

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60
           MNTSLS  FLFLT F SL FPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60

Query: 61  N-HSHL-PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 120
             HS+   +SP SL L+PRL +HNPSY+DY +LVRARL R AAR Q LNR LE SL GG 
Sbjct: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120

Query: 121 HFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ 180
            FG  IN S   +S+TAPV SG S+G+G EY A+IGVGQPV+ ++ VPDTGSDV+WLQCQ
Sbjct: 121 QFGRRINGSDSTNSLTAPVTSGASQGAG-EYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ 180

Query: 181 PCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTT 240
           PC  EN CYKQ  PIFDPKSSSSYSPLSC+S+QC LLD+A C++++CIY+V YGDGSFT 
Sbjct: 181 PCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTV 240

Query: 241 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 300
           GELATET SF +SNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQL+A+SFSYCL
Sbjct: 241 GELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCL 300

Query: 301 VNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 360
           V+LDS+SSSTL+FN++ PSDSLTSPLVKNDRF ++RYVKV+G+SVGGK LPIS + FEID
Sbjct: 301 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 360

Query: 361 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 420
           ESG GGIIVDSGT I+ +PSDVY+ LR+AFV LT +L PAPG+S FDTCY+ S QSNVEV
Sbjct: 361 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 420

Query: 421 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 480
           PTIAF+L    SL+LPA+N L  +D+AGT+CLAF+ +   LSIIG+ QQQGIRVSYDL N
Sbjct: 421 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 480

Query: 481 SLVGFSTNKC 488
           SLVGFST+KC
Sbjct: 481 SLVGFSTDKC 489

BLAST of Csa1G022490.1 vs. TrEMBL
Match: M5WPB5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004726mg PE=3 SV=1)

HSP 1 Score: 512.7 bits (1319), Expect = 5.0e-142
Identity = 269/497 (54.12%), Postives = 350/497 (70.42%), Query Frame = 1

Query: 7   SVFLFLTIFTSLQ----FPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKP---KPL- 66
           + FL+L I ++      FPS  SR L  S  +T++ DVSAS  QA D LS  P   KPL 
Sbjct: 4   TAFLYLAILSAFTLTSLFPSTHSRSL--SEETTTLLDVSASLTQAHDVLSFNPQTLKPLD 63

Query: 67  ------QNHSHLP-NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLER 126
                 Q H+  P NS FSL L PR ALHN  +KDY +LV++RL RD+ARV  L+  L+ 
Sbjct: 64  RQETQAQAHTLTPLNSSFSLQLLPRDALHNSQHKDYESLVQSRLGRDSARVNSLHTKLQL 123

Query: 127 SLNGGTHFG-ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSD 186
            +        E ++  +  + ++ PVVSG S+GSG EY  +IGVG P K  Y+V DTGSD
Sbjct: 124 VVQNIKKSDLEPMHTEIRPEDLSTPVVSGVSQGSG-EYFTRIGVGTPAKSLYMVLDTGSD 183

Query: 187 VTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHY 246
           + WLQC+PC+    CY+Q DP+F+P  SS+Y P++C+S QC  L  + C +D C+YQV Y
Sbjct: 184 INWLQCEPCSD---CYQQTDPVFNPTGSSTYRPVTCDSAQCHSLHVSACRADKCLYQVSY 243

Query: 247 GDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKA 306
           GDGS+T G+  TET+SFGNS +I N+ +GCGHDNEGLF G AGL+GLGGGA+SL SQ KA
Sbjct: 244 GDGSYTVGDFVTETVSFGNSGAIHNVGLGCGHDNEGLFVGAAGLLGLGGGALSLPSQFKA 303

Query: 307 SSFSYCLVNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPIS 366
           +SFSYCLVN DS +SSTLEFNS  PSDS+T+PL+K+ R  ++ YV + G SVGG+ + + 
Sbjct: 304 TSFSYCLVNRDSSTSSTLEFNSAPPSDSVTAPLLKDSRVETFYYVGLKGFSVGGQPVSVP 363

Query: 367 PTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFS 426
           P+ FE+DESG GGIIVDSGT I+RL ++ Y SLR+AF +LT  L  A G ++FDTCY+ S
Sbjct: 364 PSVFEVDESGNGGIIVDSGTAITRLQTEAYNSLRDAFKRLTRDLPSASGFALFDTCYDLS 423

Query: 427 GQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIR 486
            +S V+VPT++F+ ++G SL LPA+NYLI +D+AGT+C AF  T SS SIIG+ QQQG R
Sbjct: 424 SRSRVQVPTVSFLFADGKSLSLPAKNYLIPVDSAGTFCFAFAPTSSSPSIIGNVQQQGTR 483

Query: 487 VSYDLTNSLVGFSTNKC 488
           VSYDL N+ VGFS NKC
Sbjct: 484 VSYDLANNRVGFSPNKC 494

BLAST of Csa1G022490.1 vs. TrEMBL
Match: A0A0A0LPR7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G021980 PE=3 SV=1)

HSP 1 Score: 503.8 bits (1296), Expect = 2.3e-139
Identity = 246/334 (73.65%), Postives = 287/334 (85.93%), Query Frame = 1

Query: 154 VGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKL 213
           VGQP +  + V DTGSDVTWLQC PCA +N CY+Q  PIFDP+ SSSY+P+SC+S+QC+L
Sbjct: 3   VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62

Query: 214 LDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAG 273
           LD+A CN ++CIY+V YGDGSFT GELATETL+F +SNSIPN+ IGCGHDNEGLF G  G
Sbjct: 63  LDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGADG 122

Query: 274 LIGLGGGAISLSSQLKASSFSYCLVNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYR 333
           LIGLGGGAIS+SSQLKASSFSYCLV++DS S STL+FN++ PSDSL SPLVKNDRF S+R
Sbjct: 123 LIGLGGGAISISSQLKASSFSYCLVDIDSPSFSTLDFNTDPPSDSLISPLVKNDRFPSFR 182

Query: 334 YVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSS 393
           YVKV+G+SVGGK LPIS +RFEIDESGLGGIIVDSGT I++LPSDVYE LREAF+ LT++
Sbjct: 183 YVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLTTN 242

Query: 394 LSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIK 453
           L PAP IS FDTCY+ S QSNVEVPTIAF+L    SL+LPA+N L  +D+AGT+CLAF+ 
Sbjct: 243 LPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFVS 302

Query: 454 TKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 488
               LSIIG+FQQQGIRVSYDLTNSLVGFSTNKC
Sbjct: 303 ATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336

BLAST of Csa1G022490.1 vs. TrEMBL
Match: A0A0A0LQD2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G042220 PE=3 SV=1)

HSP 1 Score: 501.1 bits (1289), Expect = 1.5e-138
Identity = 259/491 (52.75%), Postives = 341/491 (69.45%), Query Frame = 1

Query: 6   SSVFLFLTIFTSLQFPSILSRKLTP-SSYSTSIFDVSASTNQALDALSIKPKP---LQNH 65
           +S  LFL +F    FP  LSR  +  S +S++  DVSAS  QA   L   P      Q  
Sbjct: 5   TSNLLFLFLFFLSLFPFTLSRSSSHLSPHSSASLDVSASLQQANQVLKFDPTASISFQQQ 64

Query: 66  SHLPNS----PFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 125
            HL  S     FSL L+PR +LHN  +KDY +LV +RL+RD++RV+ +   LE +L+   
Sbjct: 65  VHLVPSNSSFSFSLQLHPRDSLHNAGHKDYKSLVLSRLSRDSSRVKSIYDRLEFALSELK 124

Query: 126 HFG-ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQC 185
               E +   ++ + ++ P++SG S+GSG EY +++GVGQP K FY+V DTGSD+ WLQC
Sbjct: 125 RSDLEPLKTEILPEDLSTPIISGTSQGSG-EYFSRVGVGQPAKPFYMVLDTGSDINWLQC 184

Query: 186 QPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFT 245
           QPC     CY+Q DPIFDP+SSSS++ L C SQQC+ L+ + C +  C+YQV YGDGSFT
Sbjct: 185 QPCTD---CYQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFT 244

Query: 246 TGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYC 305
            GE   ETL+FGNS  I N+ +GCGHDNEGLF G AGL+GLGGG++SL+SQ+KASSFSYC
Sbjct: 245 VGEFVIETLTFGNSGMINNVAVGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMKASSFSYC 304

Query: 306 LVNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEI 365
           LV+ DS SSS LEFNS  PSDS+ +PL+K+ +  ++ YV + G+SVGG+ L I P  F++
Sbjct: 305 LVDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQM 364

Query: 366 DESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVE 425
           D+SG GGIIVDSGT I+RL +  Y +LR+AFV  T  L    G ++FDTCY+ S QS V 
Sbjct: 365 DDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVT 424

Query: 426 VPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLT 485
           +PT++F  + G SL+LP +NYLI +D+ GT+C AF  T SSLSIIG+ QQQG RV YDL 
Sbjct: 425 IPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLA 484

Query: 486 NSLVGFSTNKC 488
           NS+VGFS +KC
Sbjct: 485 NSVVGFSPHKC 491

BLAST of Csa1G022490.1 vs. TAIR10
Match: AT1G25510.1 (AT1G25510.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 460.7 bits (1184), Expect = 1.1e-129
Identity = 233/485 (48.04%), Postives = 330/485 (68.04%), Query Frame = 1

Query: 7   SVFLFLTIFTSLQFPSILSRKLTPSSYST-SIFDVSASTNQALDALSIKPKPLQNHSHLP 66
           S F F+   TS    S+ SR L  +S +T SI +V+ S ++     S +    +  +H  
Sbjct: 6   SFFFFIFFLTS--HSSVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLNQQEEQTHSA 65

Query: 67  NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINE 126
           +S FSL L+ R+++    + DY +L  ARL RD ARV+ L   L+ ++N  +        
Sbjct: 66  SSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPIS 125

Query: 127 SLIG---DSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASE 186
           ++       I AP++SG ++GSG EY  ++G+G+P +  Y+V DTGSDV WLQC PCA  
Sbjct: 126 TMYTTEEQDIEAPLISGTTQGSG-EYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCAD- 185

Query: 187 NTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELAT 246
             CY Q +PIF+P SSSSY PLSC++ QC  L+ + C + TC+Y+V YGDGS+T G+ AT
Sbjct: 186 --CYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFAT 245

Query: 247 ETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLDS 306
           ETL+ G S  + N+ +GCGH NEGLF G AGL+GLGGG ++L SQL  +SFSYCLV+ DS
Sbjct: 246 ETLTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDS 305

Query: 307 DSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLG 366
           DS+ST++F +++  D++ +PL++N +  ++ Y+ + GISVGG+ L I  + FE+DESG G
Sbjct: 306 DSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSG 365

Query: 367 GIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAF 426
           GII+DSGT ++RL +++Y SLR++FVK T  L  A G+++FDTCYN S ++ VEVPT+AF
Sbjct: 366 GIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAF 425

Query: 427 VLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGF 486
               G  L LPA+NY+I +D+ GT+CLAF  T SSL+IIG+ QQQG RV++DL NSL+GF
Sbjct: 426 HFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGF 483

Query: 487 STNKC 488
           S+NKC
Sbjct: 486 SSNKC 483

BLAST of Csa1G022490.1 vs. TAIR10
Match: AT3G18490.1 (AT3G18490.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 436.8 bits (1122), Expect = 1.8e-122
Identity = 233/498 (46.79%), Postives = 323/498 (64.86%), Query Frame = 1

Query: 7   SVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQNHSHLP- 66
           S+   +T+   L      SR L+     T++ DV +S  Q    LS+ P      +  P 
Sbjct: 8   SLLAVVTLSLFLTTTDASSRSLSTPP-KTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPE 67

Query: 67  ----------NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNG 126
                     +SP SL L+ R       +KDY +L  +RL RD++RV  +   +  ++ G
Sbjct: 68  SLSDPVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEG 127

Query: 127 --GTHFGESINESLI--GDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDV 186
              +      NE      + +T PVVSG S+GSG EY ++IGVG P K  YLV DTGSDV
Sbjct: 128 VDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSG-EYFSRIGVGTPAKEMYLVLDTGSDV 187

Query: 187 TWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYG 246
            W+QC+PCA    CY+Q DP+F+P SSS+Y  L+C++ QC LL+ + C S+ C+YQV YG
Sbjct: 188 NWIQCEPCAD---CYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYG 247

Query: 247 DGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKAS 306
           DGSFT GELAT+T++FGNS  I N+ +GCGHDNEGLF G AGL+GLGGG +S+++Q+KA+
Sbjct: 248 DGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKAT 307

Query: 307 SFSYCLVNLDSDSSSTLEFNS-NMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPIS 366
           SFSYCLV+ DS  SS+L+FNS  +     T+PL++N +  ++ YV + G SVGG+ + + 
Sbjct: 308 SFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLP 367

Query: 367 PTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSP-APGISVFDTCYNF 426
              F++D SG GG+I+D GT ++RL +  Y SLR+AF+KLT +L   +  IS+FDTCY+F
Sbjct: 368 DAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDF 427

Query: 427 SGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGI 486
           S  S V+VPT+AF  + G SL LPA+NYLI +D +GT+C AF  T SSLSIIG+ QQQG 
Sbjct: 428 SSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGT 487

Query: 487 RVSYDLTNSLVGFSTNKC 488
           R++YDL+ +++G S NKC
Sbjct: 488 RITYDLSKNVIGLSGNKC 500

BLAST of Csa1G022490.1 vs. TAIR10
Match: AT3G20015.1 (AT3G20015.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 350.9 bits (899), Expect = 1.3e-96
Identity = 181/435 (41.61%), Postives = 270/435 (62.07%), Query Frame = 1

Query: 60  NHSHLPN---SPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGG 119
           N++H  +   S ++L L  R    + +Y++++  + AR+ RD  RV  + R +   +   
Sbjct: 47  NNTHFSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPS 106

Query: 120 THFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQC 179
           +     +N+        + +VSG  +GSG EY  +IGVG P +  Y+V D+GSD+ W+QC
Sbjct: 107 SDSRYEVND------FGSDIVSGMDQGSG-EYFVRIGVGSPPRDQYMVIDSGSDMVWVQC 166

Query: 180 QPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFT 239
           QPC     CYKQ DP+FDP  S SY+ +SC S  C  ++ + C+S  C Y+V YGDGS+T
Sbjct: 167 QPC---KLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYT 226

Query: 240 TGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASS---F 299
            G LA ETL+F  +  + N+ +GCGH N G+F G AGL+G+GGG++S   QL   +   F
Sbjct: 227 KGTLALETLTFAKT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAF 286

Query: 300 SYCLVNLDSDSSSTLEFNSN-MPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPT 359
            YCLV+  +DS+ +L F    +P  +   PLV+N R  S+ YV + G+ VGG  +P+   
Sbjct: 287 GYCLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDG 346

Query: 360 RFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQ 419
            F++ E+G GG+++D+GT ++RLP+  Y + R+ F   T++L  A G+S+FDTCY+ SG 
Sbjct: 347 VFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGF 406

Query: 420 SNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVS 479
            +V VPT++F  +EG  L LPARN+L+ +D +GTYC AF  + + LSIIG+ QQ+GI+VS
Sbjct: 407 VSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVS 466

Query: 480 YDLTNSLVGFSTNKC 488
           +D  N  VGF  N C
Sbjct: 467 FDGANGFVGFGPNVC 470

BLAST of Csa1G022490.1 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 339.3 bits (869), Expect = 3.8e-93
Identity = 211/504 (41.87%), Postives = 293/504 (58.13%), Query Frame = 1

Query: 1   MNTSLSSVF--LFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPL 60
           +NT   SVF  LF T   S Q+ +++   L PSS + S  +  + T+++L   S     L
Sbjct: 6   LNTLAFSVFAVLFFTSSASSQYQTLVVNTL-PSSATLSWPESESLTDESL---SESTTSL 65

Query: 61  QNH-SHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 120
             H SH+             AL + S      L   RL RD+ RV+ +      S    T
Sbjct: 66  SVHLSHVD------------ALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVS----T 125

Query: 121 HFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ 180
               +          +  V+SG S+GSG EY  ++GVG P    Y+V DTGSDV WLQC 
Sbjct: 126 GRNATKRTPRTAGGFSGAVISGLSQGSG-EYFMRLGVGTPATNVYMVLDTGSDVVWLQCS 185

Query: 181 PCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKAN-C---NSDTCIYQVHYGDG 240
           PC +   CY Q D IFDPK S +++ + C S+ C+ LD ++ C    S TC+YQV YGDG
Sbjct: 186 PCKA---CYNQTDAIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDG 245

Query: 241 SFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKA--- 300
           SFT G+ +TETL+F  +  + ++P+GCGHDNEGLF G AGL+GLG G +S  SQ K    
Sbjct: 246 SFTEGDFSTETLTFHGAR-VDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYN 305

Query: 301 SSFSYCLVNLDSDSSS-----TLEF-NSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGG 360
             FSYCLV+  S  SS     T+ F N+ +P  S+ +PL+ N +  ++ Y++++GISVGG
Sbjct: 306 GKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGG 365

Query: 361 KTLP-ISPTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVF 420
             +P +S ++F++D +G GG+I+DSGT ++RL    Y +LR+AF    + L  AP  S+F
Sbjct: 366 SRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLF 425

Query: 421 DTCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGS 480
           DTC++ SG + V+VPT+ F    G  + LPA NYLI ++T G +C AF  T  SLSIIG+
Sbjct: 426 DTCFDLSGMTTVKVPTVVFHFG-GGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGN 483

Query: 481 FQQQGIRVSYDLTNSLVGFSTNKC 488
            QQQG RV+YDL  S VGF +  C
Sbjct: 486 IQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of Csa1G022490.1 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 330.1 bits (845), Expect = 2.3e-90
Identity = 181/413 (43.83%), Postives = 253/413 (61.26%), Query Frame = 1

Query: 83  SYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSK 142
           S K  + L  +RL RD+ RV+ +   L   + G      ++  +      ++ VVSG S+
Sbjct: 84  SNKTPDELFSSRLQRDSRRVKSI-ATLAAQIPG-----RNVTHAPRPGGFSSSVVSGLSQ 143

Query: 143 GSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYS 202
           GSG EY  ++GVG P +  Y+V DTGSD+ WLQC PC     CY Q DPIFDP+ S +Y+
Sbjct: 144 GSG-EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC---RRCYSQSDPIFDPRKSKTYA 203

Query: 203 PLSCNSQQCKLLDKANCNS--DTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGC 262
            + C+S  C+ LD A CN+   TC+YQV YGDGSFT G+ +TETL+F   N +  + +GC
Sbjct: 204 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF-RRNRVKGVALGC 263

Query: 263 GHDNEGLFAGGAGLIGLGGGAISLSSQLKA---SSFSYCLVNLDSDS--SSTLEFNSNMP 322
           GHDNEGLF G AGL+GLG G +S   Q        FSYCLV+  + S  SS +  N+ + 
Sbjct: 264 GHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVS 323

Query: 323 SDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLP-ISPTRFEIDESGLGGIIVDSGTIISR 382
             +  +PL+ N +  ++ YV ++GISVGG  +P ++ + F++D+ G GG+I+DSGT ++R
Sbjct: 324 RIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTR 383

Query: 383 LPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPA 442
           L    Y ++R+AF     +L  AP  S+FDTC++ S  + V+VPT+      G  + LPA
Sbjct: 384 LIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHF-RGADVSLPA 443

Query: 443 RNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 488
            NYLI +DT G +C AF  T   LSIIG+ QQQG RV YDL +S VGF+   C
Sbjct: 444 TNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Csa1G022490.1 vs. NCBI nr
Match: gi|449440933|ref|XP_004138238.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus])

HSP 1 Score: 969.9 bits (2506), Expect = 1.6e-279
Identity = 487/487 (100.00%), Postives = 487/487 (100.00%), Query Frame = 1

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60
           MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN
Sbjct: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60

Query: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120
           HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG
Sbjct: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120

Query: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180
           ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA
Sbjct: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180

Query: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240
           SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL
Sbjct: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240

Query: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300
           ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL
Sbjct: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300

Query: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360
           DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG
Sbjct: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360

Query: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420
           LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI
Sbjct: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420

Query: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 480
           AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV
Sbjct: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 480

Query: 481 GFSTNKC 488
           GFSTNKC
Sbjct: 481 GFSTNKC 487

BLAST of Csa1G022490.1 vs. NCBI nr
Match: gi|659106559|ref|XP_008453384.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis melo])

HSP 1 Score: 892.5 bits (2305), Expect = 3.3e-256
Identity = 450/487 (92.40%), Postives = 461/487 (94.66%), Query Frame = 1

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60
           M TSLSSVFLFLTIFTSLQF SILSRKLT S YSTSIFDV ASTNQAL+ALSIKPK LQ 
Sbjct: 1   MKTSLSSVFLFLTIFTSLQFSSILSRKLTQSPYSTSIFDVLASTNQALNALSIKPKHLQT 60

Query: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120
           HSHLPNS  SLPLYPRL+LHNPSYKDY++LVRARL RDAARVQFLNRNLE SLNGG  FG
Sbjct: 61  HSHLPNSSLSLPLYPRLSLHNPSYKDYDSLVRARLARDAARVQFLNRNLEHSLNGGKDFG 120

Query: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180
           E  N SLIGDSITAPVVSGQSKGSGAEYLAQ+GVGQPVKLFYLVPDTGSDVTWLQCQPCA
Sbjct: 121 EVTNGSLIGDSITAPVVSGQSKGSGAEYLAQVGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180

Query: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240
           +EN CYKQ DPIFDPKSSSSY+PLSCNSQQC LLD+ NCNS TCIYQVHYGDGSFTTGEL
Sbjct: 181 TENACYKQIDPIFDPKSSSSYTPLSCNSQQCGLLDRPNCNSGTCIYQVHYGDGSFTTGEL 240

Query: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300
           ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVN+
Sbjct: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNM 300

Query: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360
           DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKV+GISVGGKTLPIS TRFEIDESG
Sbjct: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVIGISVGGKTLPISSTRFEIDESG 360

Query: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420
           LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYN S QSNVEVPTI
Sbjct: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNLSSQSNVEVPTI 420

Query: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 480
           AFVLS GTSLRLPARNYLI +DTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV
Sbjct: 421 AFVLSGGTSLRLPARNYLIRVDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 480

Query: 481 GFSTNKC 488
           GFSTNKC
Sbjct: 481 GFSTNKC 487

BLAST of Csa1G022490.1 vs. NCBI nr
Match: gi|659106557|ref|XP_008453383.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis melo])

HSP 1 Score: 723.0 bits (1865), Expect = 3.5e-205
Identity = 365/488 (74.80%), Postives = 415/488 (85.04%), Query Frame = 1

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60
           MNTSLS   LFLTIFT LQFPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 2   MNTSLSYALLFLTIFTFLQFPSILSRKLTAQSPYSTTTFDVSASINQALNALSIKPKPFQ 61

Query: 61  NHSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHF 120
            HS+  NSP SL L+PRL +HNPSYKDY TLVRARL R A RVQ LNR LE SLNG   F
Sbjct: 62  THSYHSNSPLSLSLHPRLTVHNPSYKDYGTLVRARLARHATRVQSLNRKLELSLNGAKQF 121

Query: 121 GESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPC 180
           G+ IN S   +S+TAPV SG S G G EY A+IGVGQPV+ F+LVPDTGSDVTWLQC+PC
Sbjct: 122 GKRINGSASTNSLTAPVTSGASHGDG-EYFARIGVGQPVQSFFLVPDTGSDVTWLQCKPC 181

Query: 181 ASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGE 240
           A+EN C+KQ DPIFDPKSSSSYS LSCNS+QC+LLD+A C+S++CIY+V YGDGSFT GE
Sbjct: 182 ANENACFKQLDPIFDPKSSSSYSSLSCNSEQCQLLDEAGCSSNSCIYEVEYGDGSFTIGE 241

Query: 241 LATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVN 300
           LATETLSFGNSNSIPNLPIGCGHDNEGLF   AGLIGLGGGAISLSSQL+ASSFSYCLV+
Sbjct: 242 LATETLSFGNSNSIPNLPIGCGHDNEGLFDAAAGLIGLGGGAISLSSQLQASSFSYCLVD 301

Query: 301 LDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDES 360
           LDSDSSSTL+FN++ PSDSLTSPLVKN+RF S+RYVKV+G+SVGGK LPIS +RFEIDES
Sbjct: 302 LDSDSSSTLDFNADQPSDSLTSPLVKNNRFPSFRYVKVIGMSVGGKRLPISSSRFEIDES 361

Query: 361 GLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPT 420
           G GGIIVDSGT I++LPSDVY+ LR+AFV LT++L  APG+S FDTCY+ S QS+VEVP 
Sbjct: 362 GSGGIIVDSGTTITQLPSDVYDVLRDAFVGLTTNLPTAPGVSPFDTCYDLSSQSSVEVPI 421

Query: 421 IAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSL 480
           IAF+L  G SL+LPA+N LI +D+AGT+CLAF+     LSIIG+ QQQGIRVSYDL NS+
Sbjct: 422 IAFILPGGKSLKLPAKNCLIQVDSAGTFCLAFLPGTFPLSIIGNVQQQGIRVSYDLDNSI 481

Query: 481 VGFSTNKC 488
           VGF+TNKC
Sbjct: 482 VGFATNKC 488

BLAST of Csa1G022490.1 vs. NCBI nr
Match: gi|778664722|ref|XP_004138237.2| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus])

HSP 1 Score: 694.5 bits (1791), Expect = 1.3e-196
Identity = 353/490 (72.04%), Postives = 408/490 (83.27%), Query Frame = 1

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60
           MNTSLS  FLFLT F SL FPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60

Query: 61  N-HSHL-PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 120
             HS+   +SP SL L+PRL +HNPSY+DY +LVRARL R AAR Q LNR LE SL GG 
Sbjct: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120

Query: 121 HFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ 180
            FG  IN S   +S+TAPV SG S+G+G EY A+IGVGQPV+ ++ VPDTGSDV+WLQCQ
Sbjct: 121 QFGRRINGSDSTNSLTAPVTSGASQGAG-EYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ 180

Query: 181 PCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTT 240
           PC  EN CYKQ  PIFDPKSSSSYSPLSC+S+QC LLD+A C++++CIY+V YGDGSFT 
Sbjct: 181 PCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTV 240

Query: 241 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 300
           GELATET SF +SNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQL+A+SFSYCL
Sbjct: 241 GELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCL 300

Query: 301 VNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 360
           V+LDS+SSSTL+FN++ PSDSLTSPLVKNDRF ++RYVKV+G+SVGGK LPIS + FEID
Sbjct: 301 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 360

Query: 361 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 420
           ESG GGIIVDSGT I+ +PSDVY+ LR+AFV LT +L PAPG+S FDTCY+ S QSNVEV
Sbjct: 361 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 420

Query: 421 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 480
           PTIAF+L    SL+LPA+N L  +D+AGT+CLAF+ +   LSIIG+ QQQGIRVSYDL N
Sbjct: 421 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 480

Query: 481 SLVGFSTNKC 488
           SLVGFST+KC
Sbjct: 481 SLVGFSTDKC 489

BLAST of Csa1G022490.1 vs. NCBI nr
Match: gi|645277963|ref|XP_008244017.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Prunus mume])

HSP 1 Score: 513.5 bits (1321), Expect = 4.2e-142
Identity = 270/497 (54.33%), Postives = 350/497 (70.42%), Query Frame = 1

Query: 7   SVFLFLTIFTSLQ----FPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKP---KPL- 66
           + FL+L IF++      FPS LSR L  S  ++++ DVSAS  QA   LS  P   KPL 
Sbjct: 4   TAFLYLAIFSAFTLTALFPSTLSRSL--SEETSTLLDVSASLTQAHVVLSFNPETLKPLD 63

Query: 67  ------QNHSHLP-NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLER 126
                 Q HS  P NS FSL L PR ALHN  +KDY +LV +RL RD+ARV  L+  L+ 
Sbjct: 64  RQETQAQTHSLTPLNSSFSLQLLPRDALHNSQHKDYESLVLSRLGRDSARVNSLHTKLQL 123

Query: 127 SLNGGTHFG-ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSD 186
           ++        E ++  +  + ++ PVVSG S+GSG EY  +IGVG P K  Y+V DTGSD
Sbjct: 124 AVQNIKKSDLEPMHTEIRPEDLSTPVVSGVSQGSG-EYFTRIGVGTPAKSLYMVLDTGSD 183

Query: 187 VTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHY 246
           + WLQC+PC+    CY+Q DP+F+P  SS+Y P++C+S QC  L  + C +D C+YQV Y
Sbjct: 184 INWLQCEPCSD---CYQQTDPVFNPTGSSTYHPVTCDSAQCHSLHVSACRADKCLYQVSY 243

Query: 247 GDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKA 306
           GDGS+T G+  TET+SFGNS +I N+ +GCGHDNEGLF G AGL+GLGGGA+SL SQ KA
Sbjct: 244 GDGSYTVGDFVTETVSFGNSGAIHNVGLGCGHDNEGLFVGAAGLLGLGGGALSLPSQFKA 303

Query: 307 SSFSYCLVNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPIS 366
           +SFSYCLVN DS +SSTLEFNS  PSDS+T+PL+K+ R  ++ YV + G SVGG+ + + 
Sbjct: 304 TSFSYCLVNRDSSTSSTLEFNSAPPSDSVTAPLLKDSRVETFYYVGLKGFSVGGQPVSVP 363

Query: 367 PTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFS 426
           P+ FE+DESG GGIIVDSGT I+RL ++ Y SLR+AF +LT  L  A G ++FDTCY+ S
Sbjct: 364 PSVFEVDESGNGGIIVDSGTAITRLQTEAYNSLRDAFKRLTRDLPSASGFALFDTCYDLS 423

Query: 427 GQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIR 486
            +S V+VPT++F+ + G SL LPA+NYLI +D+AGT+C AF  T SS SIIG+ QQQG R
Sbjct: 424 SRSRVQVPTVSFLFAGGKSLSLPAKNYLIPVDSAGTFCFAFAPTSSSPSIIGNVQQQGTR 483

Query: 487 VSYDLTNSLVGFSTNKC 488
           VSYDL N+ VGFS NKC
Sbjct: 484 VSYDLANNRVGFSANKC 494

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPG1_ARATH3.1e-12146.79Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
ASPG2_ARATH2.2e-9541.61Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
APF2_ARATH4.1e-8943.83Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
NEP1_NEPGR1.4e-6839.28Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
AED1_ARATH3.4e-6738.42Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LPJ3_CUCSA1.1e-279100.00Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_1G022490 PE=3 SV=1[more]
A0A0A0LS14_CUCSA9.2e-19772.04Uncharacterized protein OS=Cucumis sativus GN=Csa_1G022480 PE=3 SV=1[more]
M5WPB5_PRUPE5.0e-14254.12Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004726mg PE=3 SV=1[more]
A0A0A0LPR7_CUCSA2.3e-13973.65Uncharacterized protein OS=Cucumis sativus GN=Csa_1G021980 PE=3 SV=1[more]
A0A0A0LQD2_CUCSA1.5e-13852.75Uncharacterized protein OS=Cucumis sativus GN=Csa_1G042220 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G25510.11.1e-12948.04 Eukaryotic aspartyl protease family protein[more]
AT3G18490.11.8e-12246.79 Eukaryotic aspartyl protease family protein[more]
AT3G20015.11.3e-9641.61 Eukaryotic aspartyl protease family protein[more]
AT3G61820.13.8e-9341.87 Eukaryotic aspartyl protease family protein[more]
AT1G01300.12.3e-9043.83 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|449440933|ref|XP_004138238.1|1.6e-279100.00PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus][more]
gi|659106559|ref|XP_008453384.1|3.3e-25692.40PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis melo][more]
gi|659106557|ref|XP_008453383.1|3.5e-20574.80PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis melo][more]
gi|778664722|ref|XP_004138237.2|1.3e-19672.04PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus][more]
gi|645277963|ref|XP_008244017.1|4.2e-14254.33PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Prunus mume][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0050896 response to stimulus
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Csa1G022490Csa1G022490gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Csa1G022490.1Csa1G022490.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G022490.1.utr5p1Csa1G022490.1.utr5p1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G022490.1.cds1Csa1G022490.1.cds1CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G022490.1.utr3p1Csa1G022490.1.utr3p1three_prime_UTR


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 59..487
score: 5.2E-193coord: 2..24
score: 5.2E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 319..487
score: 3.9E-43coord: 135..307
score: 4.0
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 144..487
score: 1.6
NoneNo IPR availablePANTHERPTHR13683:SF274ASPARTYL PROTEASE FAMILY PROTEINcoord: 59..487
score: 5.2E-193coord: 2..24
score: 5.2E