CsGy1G003630.1 (mRNA) Cucumber (Gy14) v2

NameCsGy1G003630.1
TypemRNA
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionprotein ASPARTIC PROTEASE IN GUARD CELL 1-like
LocationChr1 : 2300670 .. 2302133 (+)
Sequence length1464
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACACTTCACTTTCCTCTGTTTTTCTCTTCCTAACAATCTTCACTTCCCTTCAATTCCCTTCAATTCTCTCTCGCAAGTTAACACCATCTTCCTATTCCACTTCCATCTTCGATGTCTCTGCCTCCACAAACCAAGCCCTAGATGCCCTCTCCATTAAACCCAAACCTCTTCAAAATCATTCTCACCTTCCAAATTCCCCTTTCTCTCTGCCATTGTACCCTAGATTGGCCCTTCATAACCCTTCTTACAAGGACTACAATACCCTTGTTAGGGCCCGACTCACTCGTGATGCCGCTCGAGTTCAATTCCTTAACCGAAATCTTGAGCGCTCTTTAAATGGGGGTACTCATTTTGGTGAAAGTATTAATGAATCTCTAATTGGAGATTCAATTACTGCTCCGGTTGTTTCGGGGCAAAGTAAAGGGAGTGGTGCTGAGTATTTAGCTCAGATTGGGGTTGGTCAGCCTGTGAAGTTGTTTTATTTGGTGCCTGATACTGGTAGCGATGTCACGTGGCTTCAATGTCAACCTTGTGCTAGTGAGAATACTTGTTATAAACAATTTGACCCTATTTTCGATCCGAAATCATCGTCTTCTTACAGTCCTCTGTCTTGCAATTCTCAACAATGTAAGTTGCTAGATAAAGCCAATTGCAATTCCGACACATGCATATACCAAGTCCACTACGGTGATGGATCATTCACAACTGGTGAACTCGCCACCGAAACACTATCGTTTGGAAATTCAAATTCTATTCCTAATCTCCCAATTGGTTGTGGGCATGACAATGAAGGCTTGTTTGCTGGTGGAGCTGGTTTAATAGGCCTCGGTGGTGGGGCCATTTCCCTTTCTTCCCAATTAAAAGCGTCATCATTTTCATATTGCCTCGTCAACTTAGATTCAGACTCATCCTCCACTCTTGAGTTTAACTCATACATGCCCAGTGACTCGTTGACCTCTCCGCTCGTGAAAAATGATCGATTTCACTCGTATAGGTACGTCAAAGTCGTTGGAATAAGTGTTGGGGGAAAAACTCTACCAATTTCACCGACAAGATTTGAAATTGATGAATCGGGATTGGGAGGAATAATCGTTGATTCTGGTACAATTATATCTCGACTACCGAGTGATGTCTATGAATCATTAAGAGAGGCATTTGTGAAGCTGACGAGTAGCCTCTCACCGGCACCAGGGATATCAGTGTTCGATACATGTTATAACTTTTCAGGTCAATCGAATGTGGAGGTCCCAACAATAGCATTTGTGTTGTCGGAAGGAACCTCGCTACGACTACCTGCAAGAAATTACTTAATTATGTTGGACACAGCAGGAACTTATTGTTTGGCGTTTATCAAAACGAAATCTTCACTTTCTATAATTGGTAGCTTCCAACAACAAGGAATACGTGTTAGTTATGACTTGACAAACTCCATCGTTGGATTCTCAACTAATAAATGTTAG

mRNA sequence

ATGAACACTTCACTTTCCTCTGTTTTTCTCTTCCTAACAATCTTCACTTCCCTTCAATTCCCTTCAATTCTCTCTCGCAAGTTAACACCATCTTCCTATTCCACTTCCATCTTCGATGTCTCTGCCTCCACAAACCAAGCCCTAGATGCCCTCTCCATTAAACCCAAACCTCTTCAAAATCATTCTCACCTTCCAAATTCCCCTTTCTCTCTGCCATTGTACCCTAGATTGGCCCTTCATAACCCTTCTTACAAGGACTACAATACCCTTGTTAGGGCCCGACTCACTCGTGATGCCGCTCGAGTTCAATTCCTTAACCGAAATCTTGAGCGCTCTTTAAATGGGGGTACTCATTTTGGTGAAAGTATTAATGAATCTCTAATTGGAGATTCAATTACTGCTCCGGTTGTTTCGGGGCAAAGTAAAGGGAGTGGTGCTGAGTATTTAGCTCAGATTGGGGTTGGTCAGCCTGTGAAGTTGTTTTATTTGGTGCCTGATACTGGTAGCGATGTCACGTGGCTTCAATGTCAACCTTGTGCTAGTGAGAATACTTGTTATAAACAATTTGACCCTATTTTCGATCCGAAATCATCGTCTTCTTACAGTCCTCTGTCTTGCAATTCTCAACAATGTAAGTTGCTAGATAAAGCCAATTGCAATTCCGACACATGCATATACCAAGTCCACTACGGTGATGGATCATTCACAACTGGTGAACTCGCCACCGAAACACTATCGTTTGGAAATTCAAATTCTATTCCTAATCTCCCAATTGGTTGTGGGCATGACAATGAAGGCTTGTTTGCTGGTGGAGCTGGTTTAATAGGCCTCGGTGGTGGGGCCATTTCCCTTTCTTCCCAATTAAAAGCGTCATCATTTTCATATTGCCTCGTCAACTTAGATTCAGACTCATCCTCCACTCTTGAGTTTAACTCATACATGCCCAGTGACTCGTTGACCTCTCCGCTCGTGAAAAATGATCGATTTCACTCGTATAGGTACGTCAAAGTCGTTGGAATAAGTGTTGGGGGAAAAACTCTACCAATTTCACCGACAAGATTTGAAATTGATGAATCGGGATTGGGAGGAATAATCGTTGATTCTGGTACAATTATATCTCGACTACCGAGTGATGTCTATGAATCATTAAGAGAGGCATTTGTGAAGCTGACGAGTAGCCTCTCACCGGCACCAGGGATATCAGTGTTCGATACATGTTATAACTTTTCAGGTCAATCGAATGTGGAGGTCCCAACAATAGCATTTGTGTTGTCGGAAGGAACCTCGCTACGACTACCTGCAAGAAATTACTTAATTATGTTGGACACAGCAGGAACTTATTGTTTGGCGTTTATCAAAACGAAATCTTCACTTTCTATAATTGGTAGCTTCCAACAACAAGGAATACGTGTTAGTTATGACTTGACAAACTCCATCGTTGGATTCTCAACTAATAAATGTTAG

Coding sequence (CDS)

ATGAACACTTCACTTTCCTCTGTTTTTCTCTTCCTAACAATCTTCACTTCCCTTCAATTCCCTTCAATTCTCTCTCGCAAGTTAACACCATCTTCCTATTCCACTTCCATCTTCGATGTCTCTGCCTCCACAAACCAAGCCCTAGATGCCCTCTCCATTAAACCCAAACCTCTTCAAAATCATTCTCACCTTCCAAATTCCCCTTTCTCTCTGCCATTGTACCCTAGATTGGCCCTTCATAACCCTTCTTACAAGGACTACAATACCCTTGTTAGGGCCCGACTCACTCGTGATGCCGCTCGAGTTCAATTCCTTAACCGAAATCTTGAGCGCTCTTTAAATGGGGGTACTCATTTTGGTGAAAGTATTAATGAATCTCTAATTGGAGATTCAATTACTGCTCCGGTTGTTTCGGGGCAAAGTAAAGGGAGTGGTGCTGAGTATTTAGCTCAGATTGGGGTTGGTCAGCCTGTGAAGTTGTTTTATTTGGTGCCTGATACTGGTAGCGATGTCACGTGGCTTCAATGTCAACCTTGTGCTAGTGAGAATACTTGTTATAAACAATTTGACCCTATTTTCGATCCGAAATCATCGTCTTCTTACAGTCCTCTGTCTTGCAATTCTCAACAATGTAAGTTGCTAGATAAAGCCAATTGCAATTCCGACACATGCATATACCAAGTCCACTACGGTGATGGATCATTCACAACTGGTGAACTCGCCACCGAAACACTATCGTTTGGAAATTCAAATTCTATTCCTAATCTCCCAATTGGTTGTGGGCATGACAATGAAGGCTTGTTTGCTGGTGGAGCTGGTTTAATAGGCCTCGGTGGTGGGGCCATTTCCCTTTCTTCCCAATTAAAAGCGTCATCATTTTCATATTGCCTCGTCAACTTAGATTCAGACTCATCCTCCACTCTTGAGTTTAACTCATACATGCCCAGTGACTCGTTGACCTCTCCGCTCGTGAAAAATGATCGATTTCACTCGTATAGGTACGTCAAAGTCGTTGGAATAAGTGTTGGGGGAAAAACTCTACCAATTTCACCGACAAGATTTGAAATTGATGAATCGGGATTGGGAGGAATAATCGTTGATTCTGGTACAATTATATCTCGACTACCGAGTGATGTCTATGAATCATTAAGAGAGGCATTTGTGAAGCTGACGAGTAGCCTCTCACCGGCACCAGGGATATCAGTGTTCGATACATGTTATAACTTTTCAGGTCAATCGAATGTGGAGGTCCCAACAATAGCATTTGTGTTGTCGGAAGGAACCTCGCTACGACTACCTGCAAGAAATTACTTAATTATGTTGGACACAGCAGGAACTTATTGTTTGGCGTTTATCAAAACGAAATCTTCACTTTCTATAATTGGTAGCTTCCAACAACAAGGAATACGTGTTAGTTATGACTTGACAAACTCCATCGTTGGATTCTCAACTAATAAATGTTAG

Protein sequence

MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQNHSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLDSDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTNKC
BLAST of CsGy1G003630.1 vs. NCBI nr
Match: XP_004138238.1 (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus] >KGN63810.1 Aspartic proteinase nepenthesin-1 [Cucumis sativus])

HSP 1 Score: 952.2 bits (2460), Expect = 6.8e-274
Identity = 485/487 (99.59%), Postives = 486/487 (99.79%), Query Frame = 0

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60
           MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN
Sbjct: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60

Query: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120
           HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG
Sbjct: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120

Query: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180
           ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA
Sbjct: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180

Query: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240
           SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL
Sbjct: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240

Query: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300
           ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL
Sbjct: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300

Query: 301 DSDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360
           DSDSSSTLEFNS MPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG
Sbjct: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360

Query: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420
           LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI
Sbjct: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420

Query: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIV 480
           AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNS+V
Sbjct: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 480

Query: 481 GFSTNKC 488
           GFSTNKC
Sbjct: 481 GFSTNKC 487

BLAST of CsGy1G003630.1 vs. NCBI nr
Match: XP_016901370.1 (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis melo])

HSP 1 Score: 814.3 bits (2102), Expect = 2.2e-232
Identity = 424/487 (87.06%), Postives = 436/487 (89.53%), Query Frame = 0

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60
           M TSLSSVFLFLTIFTSLQF SILSRKLT S YSTSIFDV ASTNQAL+ALSIKPK LQ 
Sbjct: 1   MKTSLSSVFLFLTIFTSLQFSSILSRKLTQSPYSTSIFDVLASTNQALNALSIKPKHLQT 60

Query: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120
           HSHLPNS  SLPLYPRL+LHNPSYKDY++LVRARL RDAARVQFLNRNLE SLNGG  FG
Sbjct: 61  HSHLPNSSLSLPLYPRLSLHNPSYKDYDSLVRARLARDAARVQFLNRNLEHSLNGGKDFG 120

Query: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180
           E  N SLIGDSITAPVVSGQSKGSGAEYLAQ+GVGQPVKLFYLVPDTGSDVTWLQCQPCA
Sbjct: 121 EVTNGSLIGDSITAPVVSGQSKGSGAEYLAQVGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180

Query: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240
           +EN CYKQ DPIFDPKSSSSY+PLSCNSQQC LLD+ NCNS TCIYQVHYGDGSFTTGEL
Sbjct: 181 TENACYKQIDPIFDPKSSSSYTPLSCNSQQCGLLDRPNCNSGTCIYQVHYGDGSFTTGEL 240

Query: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300
           ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVN+
Sbjct: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNM 300

Query: 301 DSDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360
           DSDSSSTLEFNS MPSDSLTSPLVKNDRFHSYRYVKV+GISVGGKTLPIS TRFEIDESG
Sbjct: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVIGISVGGKTLPISSTRFEIDESG 360

Query: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420
           LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPG                     
Sbjct: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPG--------------------- 420

Query: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIV 480
                 GTSLRLPARNYLI +DTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNS+V
Sbjct: 421 ------GTSLRLPARNYLIRVDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 460

Query: 481 GFSTNKC 488
           GFSTNKC
Sbjct: 481 GFSTNKC 460

BLAST of CsGy1G003630.1 vs. NCBI nr
Match: XP_008453383.1 (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis melo])

HSP 1 Score: 708.4 bits (1827), Expect = 1.7e-200
Identity = 366/488 (75.00%), Postives = 414/488 (84.84%), Query Frame = 0

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60
           MNTSLS   LFLTIFT LQFPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 2   MNTSLSYALLFLTIFTFLQFPSILSRKLTAQSPYSTTTFDVSASINQALNALSIKPKPFQ 61

Query: 61  NHSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHF 120
            HS+  NSP SL L+PRL +HNPSYKDY TLVRARL R A RVQ LNR LE SLNG   F
Sbjct: 62  THSYHSNSPLSLSLHPRLTVHNPSYKDYGTLVRARLARHATRVQSLNRKLELSLNGAKQF 121

Query: 121 GESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPC 180
           G+ IN S   +S+TAPV SG S G G EY A+IGVGQPV+ F+LVPDTGSDVTWLQC+PC
Sbjct: 122 GKRINGSASTNSLTAPVTSGASHGDG-EYFARIGVGQPVQSFFLVPDTGSDVTWLQCKPC 181

Query: 181 ASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGE 240
           A+EN C+KQ DPIFDPKSSSSYS LSCNS+QC+LLD+A C+S++CIY+V YGDGSFT GE
Sbjct: 182 ANENACFKQLDPIFDPKSSSSYSSLSCNSEQCQLLDEAGCSSNSCIYEVEYGDGSFTIGE 241

Query: 241 LATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVN 300
           LATETLSFGNSNSIPNLPIGCGHDNEGLF   AGLIGLGGGAISLSSQL+ASSFSYCLV+
Sbjct: 242 LATETLSFGNSNSIPNLPIGCGHDNEGLFDAAAGLIGLGGGAISLSSQLQASSFSYCLVD 301

Query: 301 LDSDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDES 360
           LDSDSSSTL+FN+  PSDSLTSPLVKN+RF S+RYVKV+G+SVGGK LPIS +RFEIDES
Sbjct: 302 LDSDSSSTLDFNADQPSDSLTSPLVKNNRFPSFRYVKVIGMSVGGKRLPISSSRFEIDES 361

Query: 361 GLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPT 420
           G GGIIVDSGT I++LPSDVY+ LR+AFV LT++L  APG+S FDTCY+ S QS+VEVP 
Sbjct: 362 GSGGIIVDSGTTITQLPSDVYDVLRDAFVGLTTNLPTAPGVSPFDTCYDLSSQSSVEVPI 421

Query: 421 IAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSI 480
           IAF+L  G SL+LPA+N LI +D+AGT+CLAF+     LSIIG+ QQQGIRVSYDL NSI
Sbjct: 422 IAFILPGGKSLKLPAKNCLIQVDSAGTFCLAFLPGTFPLSIIGNVQQQGIRVSYDLDNSI 481

Query: 481 VGFSTNKC 488
           VGF+TNKC
Sbjct: 482 VGFATNKC 488

BLAST of CsGy1G003630.1 vs. NCBI nr
Match: XP_004138237.2 (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus] >KGN63809.1 hypothetical protein Csa_1G022480 [Cucumis sativus])

HSP 1 Score: 678.3 bits (1749), Expect = 1.9e-191
Identity = 352/490 (71.84%), Postives = 407/490 (83.06%), Query Frame = 0

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60
           MNTSLS  FLFLT F SL FPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60

Query: 61  -NHSHL-PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 120
             HS+   +SP SL L+PRL +HNPSY+DY +LVRARL R AAR Q LNR LE SL GG 
Sbjct: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120

Query: 121 HFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ 180
            FG  IN S   +S+TAPV SG S+G+G EY A+IGVGQPV+ ++ VPDTGSDV+WLQCQ
Sbjct: 121 QFGRRINGSDSTNSLTAPVTSGASQGAG-EYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ 180

Query: 181 PCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTT 240
           PC  EN CYKQ  PIFDPKSSSSYSPLSC+S+QC LLD+A C++++CIY+V YGDGSFT 
Sbjct: 181 PCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTV 240

Query: 241 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 300
           GELATET SF +SNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQL+A+SFSYCL
Sbjct: 241 GELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCL 300

Query: 301 VNLDSDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 360
           V+LDS+SSSTL+FN+  PSDSLTSPLVKNDRF ++RYVKV+G+SVGGK LPIS + FEID
Sbjct: 301 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 360

Query: 361 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 420
           ESG GGIIVDSGT I+ +PSDVY+ LR+AFV LT +L PAPG+S FDTCY+ S QSNVEV
Sbjct: 361 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 420

Query: 421 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 480
           PTIAF+L    SL+LPA+N L  +D+AGT+CLAF+ +   LSIIG+ QQQGIRVSYDL N
Sbjct: 421 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 480

Query: 481 SIVGFSTNKC 488
           S+VGFST+KC
Sbjct: 481 SLVGFSTDKC 489

BLAST of CsGy1G003630.1 vs. NCBI nr
Match: XP_023007215.1 (protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucurbita maxima])

HSP 1 Score: 632.5 bits (1630), Expect = 1.2e-177
Identity = 334/487 (68.58%), Postives = 391/487 (80.29%), Query Frame = 0

Query: 2   NTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQNH 61
           N  + S FLF  I  SL F S L+R  T    +T++FDVSAS+N+A +ALSI P    +H
Sbjct: 10  NKPIFSAFLFTAILNSLLFSSSLARVFTE---TTTVFDVSASSNRAQNALSITPPQFHSH 69

Query: 62  SHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGE 121
            HL NS  SL L+ RLA+H  +YKDY +LVRARL RDAARVQ LNRNL  +L G      
Sbjct: 70  -HLSNSSLSLSLHSRLAIHKHNYKDYESLVRARLARDAARVQSLNRNLNLALAG------ 129

Query: 122 SINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCAS 181
              +++  +S+TAPVVSGQS+GSG EY A+I VGQP + FYLVPDTGSD+TWLQC PC+ 
Sbjct: 130 ---DAVRPNSLTAPVVSGQSQGSG-EYFARIAVGQPAQSFYLVPDTGSDITWLQCLPCSI 189

Query: 182 ENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELA 241
            NTCY Q DPIF+P SSSSY PLSC+SQQC+ L++  C S TC+YQV YGDGSFTTG+ A
Sbjct: 190 GNTCYPQTDPIFNPTSSSSYRPLSCDSQQCQSLNRPGCQSGTCVYQVWYGDGSFTTGDFA 249

Query: 242 TETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLD 301
           TETL+FGNS SIPNLPIGCGHDN+GLF G AGLIGLGGGA+SLSSQLKASSFSYCLV+ D
Sbjct: 250 TETLTFGNSKSIPNLPIGCGHDNKGLFVGAAGLIGLGGGALSLSSQLKASSFSYCLVDRD 309

Query: 302 SDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGL 361
           SDSSSTLEF+S  PSDS+T+PL+KN+R  SYRYV+V G+SVGGK L IS TRFEID SG+
Sbjct: 310 SDSSSTLEFDSARPSDSITTPLLKNNRIDSYRYVQVTGMSVGGKALSISSTRFEIDGSGM 369

Query: 362 GGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIA 421
           GGIIVDSGT I+RLP+DVYESLREAFV+   SL+ A  IS FDTCYN +GQSNV+VPT+A
Sbjct: 370 GGIIVDSGTFITRLPTDVYESLREAFVQGARSLTAAGAISPFDTCYNLAGQSNVQVPTVA 429

Query: 422 FVLSEGTSLRLPARNYLIMLDTAGTYCLAFIK-TKSSLSIIGSFQQQGIRVSYDLTNSIV 481
           F LS+G  L+LPARNYLI +DTAGTYCLAF+K T SSLSIIGSFQQQG+RVSYDL NS+V
Sbjct: 430 FELSKGNLLQLPARNYLIRMDTAGTYCLAFLKLTTSSLSIIGSFQQQGMRVSYDLVNSLV 482

Query: 482 GFSTNKC 488
           GFS+NKC
Sbjct: 490 GFSSNKC 482

BLAST of CsGy1G003630.1 vs. TAIR10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 445.7 bits (1145), Expect = 3.8e-125
Identity = 233/485 (48.04%), Postives = 331/485 (68.25%), Query Frame = 0

Query: 7   SVFLFLTIFTSLQFPSILSRKL-TPSSYSTSIFDVSASTNQALDALSIKPKPLQNHSHLP 66
           S F F+   TS    S+ SR L   S+ +TSI +V+ S ++     S +    +  +H  
Sbjct: 6   SFFFFIFFLTS--HSSVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLNQQEEQTHSA 65

Query: 67  NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG---ES 126
           +S FSL L+ R+++    + DY +L  ARL RD ARV+ L   L+ ++N  +       S
Sbjct: 66  SSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPIS 125

Query: 127 INESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASE 186
              +     I AP++SG ++GSG EY  ++G+G+P +  Y+V DTGSDV WLQC PCA  
Sbjct: 126 TMYTTEEQDIEAPLISGTTQGSG-EYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCAD- 185

Query: 187 NTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELAT 246
             CY Q +PIF+P SSSSY PLSC++ QC  L+ + C + TC+Y+V YGDGS+T G+ AT
Sbjct: 186 --CYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFAT 245

Query: 247 ETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLDS 306
           ETL+ G S  + N+ +GCGH NEGLF G AGL+GLGGG ++L SQL  +SFSYCLV+ DS
Sbjct: 246 ETLTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDS 305

Query: 307 DSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLG 366
           DS+ST++F + +  D++ +PL++N +  ++ Y+ + GISVGG+ L I  + FE+DESG G
Sbjct: 306 DSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSG 365

Query: 367 GIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAF 426
           GII+DSGT ++RL +++Y SLR++FVK T  L  A G+++FDTCYN S ++ VEVPT+AF
Sbjct: 366 GIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAF 425

Query: 427 VLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGF 486
               G  L LPA+NY+I +D+ GT+CLAF  T SSL+IIG+ QQQG RV++DL NS++GF
Sbjct: 426 HFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGF 483

Query: 487 STNKC 488
           S+NKC
Sbjct: 486 SSNKC 483

BLAST of CsGy1G003630.1 vs. TAIR10
Match: AT3G18490.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 423.7 bits (1088), Expect = 1.5e-118
Identity = 234/498 (46.99%), Postives = 328/498 (65.86%), Query Frame = 0

Query: 7   SVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSI----------KPK 66
           S+   +T+   L      SR L+ +   T++ DV +S  Q    LS+          KP+
Sbjct: 8   SLLAVVTLSLFLTTTDASSRSLS-TPPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPE 67

Query: 67  PLQNHSHL-PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNG 126
            L +      +SP SL L+ R       +KDY +L  +RL RD++RV  +   +  ++ G
Sbjct: 68  SLSDPVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEG 127

Query: 127 --GTHFGESINES--LIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDV 186
              +      NE      + +T PVVSG S+GSG EY ++IGVG P K  YLV DTGSDV
Sbjct: 128 VDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSG-EYFSRIGVGTPAKEMYLVLDTGSDV 187

Query: 187 TWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYG 246
            W+QC+PCA    CY+Q DP+F+P SSS+Y  L+C++ QC LL+ + C S+ C+YQV YG
Sbjct: 188 NWIQCEPCAD---CYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYG 247

Query: 247 DGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKAS 306
           DGSFT GELAT+T++FGNS  I N+ +GCGHDNEGLF G AGL+GLGGG +S+++Q+KA+
Sbjct: 248 DGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKAT 307

Query: 307 SFSYCLVNLDSDSSSTLEFNS-YMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPIS 366
           SFSYCLV+ DS  SS+L+FNS  +     T+PL++N +  ++ YV + G SVGG+ + + 
Sbjct: 308 SFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLP 367

Query: 367 PTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSP-APGISVFDTCYNF 426
              F++D SG GG+I+D GT ++RL +  Y SLR+AF+KLT +L   +  IS+FDTCY+F
Sbjct: 368 DAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDF 427

Query: 427 SGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGI 486
           S  S V+VPT+AF  + G SL LPA+NYLI +D +GT+C AF  T SSLSIIG+ QQQG 
Sbjct: 428 SSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGT 487

Query: 487 RVSYDLTNSIVGFSTNKC 488
           R++YDL+ +++G S NKC
Sbjct: 488 RITYDLSKNVIGLSGNKC 500

BLAST of CsGy1G003630.1 vs. TAIR10
Match: AT3G20015.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 343.2 bits (879), Expect = 2.6e-94
Identity = 181/435 (41.61%), Postives = 272/435 (62.53%), Query Frame = 0

Query: 60  NHSHL---PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGG 119
           N++H     +S ++L L  R    + +Y++++  + AR+ RD  RV  + R +   +   
Sbjct: 47  NNTHFSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPS 106

Query: 120 THFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQC 179
           +     +N+        + +VSG  +GSG EY  +IGVG P +  Y+V D+GSD+ W+QC
Sbjct: 107 SDSRYEVND------FGSDIVSGMDQGSG-EYFVRIGVGSPPRDQYMVIDSGSDMVWVQC 166

Query: 180 QPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFT 239
           QPC     CYKQ DP+FDP  S SY+ +SC S  C  ++ + C+S  C Y+V YGDGS+T
Sbjct: 167 QPC---KLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYT 226

Query: 240 TGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLK---ASSF 299
            G LA ETL+F  +  + N+ +GCGH N G+F G AGL+G+GGG++S   QL      +F
Sbjct: 227 KGTLALETLTFAKT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAF 286

Query: 300 SYCLVNLDSDSSSTLEF-NSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPT 359
            YCLV+  +DS+ +L F    +P  +   PLV+N R  S+ YV + G+ VGG  +P+   
Sbjct: 287 GYCLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDG 346

Query: 360 RFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQ 419
            F++ E+G GG+++D+GT ++RLP+  Y + R+ F   T++L  A G+S+FDTCY+ SG 
Sbjct: 347 VFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGF 406

Query: 420 SNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVS 479
            +V VPT++F  +EG  L LPARN+L+ +D +GTYC AF  + + LSIIG+ QQ+GI+VS
Sbjct: 407 VSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVS 466

Query: 480 YDLTNSIVGFSTNKC 488
           +D  N  VGF  N C
Sbjct: 467 FDGANGFVGFGPNVC 470

BLAST of CsGy1G003630.1 vs. TAIR10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 325.9 bits (834), Expect = 4.3e-89
Identity = 211/504 (41.87%), Postives = 293/504 (58.13%), Query Frame = 0

Query: 1   MNTSLSSVF--LFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPL 60
           +NT   SVF  LF T   S Q+ +++   L PSS + S  +  + T+   ++LS     L
Sbjct: 6   LNTLAFSVFAVLFFTSSASSQYQTLVVNTL-PSSATLSWPESESLTD---ESLSESTTSL 65

Query: 61  QNH-SHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 120
             H SH+             AL + S      L   RL RD+ RV+ +      S    T
Sbjct: 66  SVHLSHVD------------ALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVS----T 125

Query: 121 HFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ 180
               +          +  V+SG S+GSG EY  ++GVG P    Y+V DTGSDV WLQC 
Sbjct: 126 GRNATKRTPRTAGGFSGAVISGLSQGSG-EYFMRLGVGTPATNVYMVLDTGSDVVWLQCS 185

Query: 181 PCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLL-DKANC---NSDTCIYQVHYGDG 240
           PC     CY Q D IFDPK S +++ + C S+ C+ L D + C    S TC+YQV YGDG
Sbjct: 186 PC---KACYNQTDAIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDG 245

Query: 241 SFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLK---A 300
           SFT G+ +TETL+F  +  + ++P+GCGHDNEGLF G AGL+GLG G +S  SQ K    
Sbjct: 246 SFTEGDFSTETLTFHGAR-VDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYN 305

Query: 301 SSFSYCLVNLDSDSS-----STLEF-NSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGG 360
             FSYCLV+  S  S     ST+ F N+ +P  S+ +PL+ N +  ++ Y++++GISVGG
Sbjct: 306 GKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGG 365

Query: 361 KTLP-ISPTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVF 420
             +P +S ++F++D +G GG+I+DSGT ++RL    Y +LR+AF    + L  AP  S+F
Sbjct: 366 SRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLF 425

Query: 421 DTCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGS 480
           DTC++ SG + V+VPT+ F    G  + LPA NYLI ++T G +C AF  T  SLSIIG+
Sbjct: 426 DTCFDLSGMTTVKVPTVVFHFG-GGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGN 483

Query: 481 FQQQGIRVSYDLTNSIVGFSTNKC 488
            QQQG RV+YDL  S VGF +  C
Sbjct: 486 IQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of CsGy1G003630.1 vs. TAIR10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 323.9 bits (829), Expect = 1.7e-88
Identity = 181/413 (43.83%), Postives = 254/413 (61.50%), Query Frame = 0

Query: 83  SYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSK 142
           S K  + L  +RL RD+ RV+ +   L   +      G ++  +      ++ VVSG S+
Sbjct: 84  SNKTPDELFSSRLQRDSRRVKSI-ATLAAQIP-----GRNVTHAPRPGGFSSSVVSGLSQ 143

Query: 143 GSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYS 202
           GSG EY  ++GVG P +  Y+V DTGSD+ WLQC PC     CY Q DPIFDP+ S +Y+
Sbjct: 144 GSG-EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC---RRCYSQSDPIFDPRKSKTYA 203

Query: 203 PLSCNSQQCKLLDKANCNS--DTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGC 262
            + C+S  C+ LD A CN+   TC+YQV YGDGSFT G+ +TETL+F   N +  + +GC
Sbjct: 204 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF-RRNRVKGVALGC 263

Query: 263 GHDNEGLFAGGAGLIGLGGGAISLSSQLK---ASSFSYCLVNLDSDS--SSTLEFNSYMP 322
           GHDNEGLF G AGL+GLG G +S   Q        FSYCLV+  + S  SS +  N+ + 
Sbjct: 264 GHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVS 323

Query: 323 SDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLP-ISPTRFEIDESGLGGIIVDSGTIISR 382
             +  +PL+ N +  ++ YV ++GISVGG  +P ++ + F++D+ G GG+I+DSGT ++R
Sbjct: 324 RIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTR 383

Query: 383 LPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPA 442
           L    Y ++R+AF     +L  AP  S+FDTC++ S  + V+VPT+      G  + LPA
Sbjct: 384 LIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHF-RGADVSLPA 443

Query: 443 RNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTNKC 488
            NYLI +DT G +C AF  T   LSIIG+ QQQG RV YDL +S VGF+   C
Sbjct: 444 TNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CsGy1G003630.1 vs. Swiss-Prot
Match: sp|Q9LS40|ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 423.7 bits (1088), Expect = 2.8e-117
Identity = 234/498 (46.99%), Postives = 328/498 (65.86%), Query Frame = 0

Query: 7   SVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSI----------KPK 66
           S+   +T+   L      SR L+ +   T++ DV +S  Q    LS+          KP+
Sbjct: 8   SLLAVVTLSLFLTTTDASSRSLS-TPPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPE 67

Query: 67  PLQNHSHL-PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNG 126
            L +      +SP SL L+ R       +KDY +L  +RL RD++RV  +   +  ++ G
Sbjct: 68  SLSDPVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEG 127

Query: 127 --GTHFGESINES--LIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDV 186
              +      NE      + +T PVVSG S+GSG EY ++IGVG P K  YLV DTGSDV
Sbjct: 128 VDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSG-EYFSRIGVGTPAKEMYLVLDTGSDV 187

Query: 187 TWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYG 246
            W+QC+PCA    CY+Q DP+F+P SSS+Y  L+C++ QC LL+ + C S+ C+YQV YG
Sbjct: 188 NWIQCEPCAD---CYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYG 247

Query: 247 DGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKAS 306
           DGSFT GELAT+T++FGNS  I N+ +GCGHDNEGLF G AGL+GLGGG +S+++Q+KA+
Sbjct: 248 DGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKAT 307

Query: 307 SFSYCLVNLDSDSSSTLEFNS-YMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPIS 366
           SFSYCLV+ DS  SS+L+FNS  +     T+PL++N +  ++ YV + G SVGG+ + + 
Sbjct: 308 SFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLP 367

Query: 367 PTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSP-APGISVFDTCYNF 426
              F++D SG GG+I+D GT ++RL +  Y SLR+AF+KLT +L   +  IS+FDTCY+F
Sbjct: 368 DAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDF 427

Query: 427 SGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGI 486
           S  S V+VPT+AF  + G SL LPA+NYLI +D +GT+C AF  T SSLSIIG+ QQQG 
Sbjct: 428 SSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGT 487

Query: 487 RVSYDLTNSIVGFSTNKC 488
           R++YDL+ +++G S NKC
Sbjct: 488 RITYDLSKNVIGLSGNKC 500

BLAST of CsGy1G003630.1 vs. Swiss-Prot
Match: sp|Q9LHE3|ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 343.2 bits (879), Expect = 4.7e-93
Identity = 181/435 (41.61%), Postives = 272/435 (62.53%), Query Frame = 0

Query: 60  NHSHL---PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGG 119
           N++H     +S ++L L  R    + +Y++++  + AR+ RD  RV  + R +   +   
Sbjct: 47  NNTHFSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPS 106

Query: 120 THFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQC 179
           +     +N+        + +VSG  +GSG EY  +IGVG P +  Y+V D+GSD+ W+QC
Sbjct: 107 SDSRYEVND------FGSDIVSGMDQGSG-EYFVRIGVGSPPRDQYMVIDSGSDMVWVQC 166

Query: 180 QPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFT 239
           QPC     CYKQ DP+FDP  S SY+ +SC S  C  ++ + C+S  C Y+V YGDGS+T
Sbjct: 167 QPC---KLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYT 226

Query: 240 TGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLK---ASSF 299
            G LA ETL+F  +  + N+ +GCGH N G+F G AGL+G+GGG++S   QL      +F
Sbjct: 227 KGTLALETLTFAKT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAF 286

Query: 300 SYCLVNLDSDSSSTLEF-NSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPT 359
            YCLV+  +DS+ +L F    +P  +   PLV+N R  S+ YV + G+ VGG  +P+   
Sbjct: 287 GYCLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDG 346

Query: 360 RFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQ 419
            F++ E+G GG+++D+GT ++RLP+  Y + R+ F   T++L  A G+S+FDTCY+ SG 
Sbjct: 347 VFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGF 406

Query: 420 SNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVS 479
            +V VPT++F  +EG  L LPARN+L+ +D +GTYC AF  + + LSIIG+ QQ+GI+VS
Sbjct: 407 VSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVS 466

Query: 480 YDLTNSIVGFSTNKC 488
           +D  N  VGF  N C
Sbjct: 467 FDGANGFVGFGPNVC 470

BLAST of CsGy1G003630.1 vs. Swiss-Prot
Match: sp|Q9LNJ3|APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 323.9 bits (829), Expect = 3.0e-87
Identity = 181/413 (43.83%), Postives = 254/413 (61.50%), Query Frame = 0

Query: 83  SYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSK 142
           S K  + L  +RL RD+ RV+ +   L   +      G ++  +      ++ VVSG S+
Sbjct: 84  SNKTPDELFSSRLQRDSRRVKSI-ATLAAQIP-----GRNVTHAPRPGGFSSSVVSGLSQ 143

Query: 143 GSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYS 202
           GSG EY  ++GVG P +  Y+V DTGSD+ WLQC PC     CY Q DPIFDP+ S +Y+
Sbjct: 144 GSG-EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC---RRCYSQSDPIFDPRKSKTYA 203

Query: 203 PLSCNSQQCKLLDKANCNS--DTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGC 262
            + C+S  C+ LD A CN+   TC+YQV YGDGSFT G+ +TETL+F   N +  + +GC
Sbjct: 204 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF-RRNRVKGVALGC 263

Query: 263 GHDNEGLFAGGAGLIGLGGGAISLSSQLK---ASSFSYCLVNLDSDS--SSTLEFNSYMP 322
           GHDNEGLF G AGL+GLG G +S   Q        FSYCLV+  + S  SS +  N+ + 
Sbjct: 264 GHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVS 323

Query: 323 SDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLP-ISPTRFEIDESGLGGIIVDSGTIISR 382
             +  +PL+ N +  ++ YV ++GISVGG  +P ++ + F++D+ G GG+I+DSGT ++R
Sbjct: 324 RIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTR 383

Query: 383 LPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPA 442
           L    Y ++R+AF     +L  AP  S+FDTC++ S  + V+VPT+      G  + LPA
Sbjct: 384 LIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHF-RGADVSLPA 443

Query: 443 RNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTNKC 488
            NYLI +DT G +C AF  T   LSIIG+ QQQG RV YDL +S VGF+   C
Sbjct: 444 TNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CsGy1G003630.1 vs. Swiss-Prot
Match: sp|Q766C3|NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 258.5 bits (659), Expect = 1.5e-67
Identity = 152/387 (39.28%), Postives = 225/387 (58.14%), Query Frame = 0

Query: 109 LERSLNGGTHFGESINESLIGDS-ITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDT 168
           LER++  G+   + +   L G S +   V +G       EYL  + +G P + F  + DT
Sbjct: 60  LERAIERGSRRLQRLEAMLNGPSGVETSVYAGD-----GEYLMNLSIGTPAQPFSAIMDT 119

Query: 169 GSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQ 228
           GSD+ W QCQPC     C+ Q  PIF+P+ SSS+S L C+SQ C+ L    C+++ C Y 
Sbjct: 120 GSDLIWTQCQPCTQ---CFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYT 179

Query: 229 VHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAG-GAGLIGLGGGAISLSS 288
             YGDGS T G + TETL+FG S SIPN+  GCG +N+G   G GAGL+G+G G +SL S
Sbjct: 180 YGYGDGSETQGSMGTETLTFG-SVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPS 239

Query: 289 QLKASSFSYCLVNLDSDSSSTLEFNSYMPSDSLTSP---LVKNDRFHSYRYVKVVGISVG 348
           QL  + FSYC+  + S + S L   S   S +  SP   L+++ +  ++ Y+ + G+SVG
Sbjct: 240 QLDVTKFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVG 299

Query: 349 GKTLPISPTRFEID-ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISV 408
              LPI P+ F ++  +G GGII+DSGT ++   ++ Y+S+R+ F+   +        S 
Sbjct: 300 STRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSG 359

Query: 409 FDTCYNF-SGQSNVEVPTIAFVLS-EGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSI 468
           FD C+   S  SN+++PT  FV+  +G  L LP+ NY I   + G  CLA   +   +SI
Sbjct: 360 FDLCFQTPSDPSNLQIPT--FVMHFDGGDLELPSENYFIS-PSNGLICLAMGSSSQGMSI 419

Query: 469 IGSFQQQGIRVSYDLTNSIVGFSTNKC 488
            G+ QQQ + V YD  NS+V F++ +C
Sbjct: 420 FGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CsGy1G003630.1 vs. Swiss-Prot
Match: sp|Q9LEW3|AED1_ARATH (Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1)

HSP 1 Score: 250.4 bits (638), Expect = 4.2e-65
Identity = 156/406 (38.42%), Postives = 223/406 (54.93%), Query Frame = 0

Query: 86  DYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSG 145
           D++ ++R    RD ARV+ +   L ++         S NE     S   P  SG + GSG
Sbjct: 84  DHDEIIR----RDQARVESIYSKLSKN---------SANEVSEAKSTELPAKSGITLGSG 143

Query: 146 AEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLS 205
             Y+  IG+G P     LV DTGSD+TW QC+PC    +CY Q +P F+P SSS+Y  +S
Sbjct: 144 -NYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCL--GSCYSQKEPKFNPSSSSTYQNVS 203

Query: 206 CNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNE 265
           C+S  C+  D  +C++  C+Y + YGD SFT G LA E  +  NS+ + ++  GCG +N+
Sbjct: 204 CSSPMCE--DAESCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQ 263

Query: 266 GLFAGGAGLIGLGGGAISLSSQLKA---SSFSYCLVNLDSDSSSTLEFNSYMPSDSLT-S 325
           GLF G AGL+GLG G +SL +Q      + FSYCL +  S+S+  L F S   S+S+  +
Sbjct: 264 GLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFT 323

Query: 326 PLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRLPSDVYE 385
           P+       +Y  + ++GISVG K L I+P  F  +     G I+DSGT+ +RLP+ VY 
Sbjct: 324 PISSFPSAFNYG-IDIIGISVGDKELAITPNSFSTE-----GAIIDSGTVFTRLPTKVYA 383

Query: 386 SLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPARNYLIML 445
            LR  F +  SS     G  +FDTCY+F+G   V  PTIAF  +  T + L      + +
Sbjct: 384 ELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPI 443

Query: 446 DTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTNKC 488
             +   CLAF       +I G+ QQ  + V YD+    VGF+ N C
Sbjct: 444 KIS-QVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of CsGy1G003630.1 vs. TrEMBL
Match: tr|A0A0A0LPJ3|A0A0A0LPJ3_CUCSA (Aspartic proteinase nepenthesin-1 OS=Cucumis sativus OX=3659 GN=Csa_1G022490 PE=3 SV=1)

HSP 1 Score: 952.2 bits (2460), Expect = 4.5e-274
Identity = 485/487 (99.59%), Postives = 486/487 (99.79%), Query Frame = 0

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60
           MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN
Sbjct: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60

Query: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120
           HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG
Sbjct: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120

Query: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180
           ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA
Sbjct: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180

Query: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240
           SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL
Sbjct: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240

Query: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300
           ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL
Sbjct: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300

Query: 301 DSDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360
           DSDSSSTLEFNS MPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG
Sbjct: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360

Query: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420
           LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI
Sbjct: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420

Query: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIV 480
           AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNS+V
Sbjct: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 480

Query: 481 GFSTNKC 488
           GFSTNKC
Sbjct: 481 GFSTNKC 487

BLAST of CsGy1G003630.1 vs. TrEMBL
Match: tr|A0A1S4E062|A0A1S4E062_CUCME (protein ASPARTIC PROTEASE IN GUARD CELL 1-like OS=Cucumis melo OX=3656 GN=LOC103494117 PE=3 SV=1)

HSP 1 Score: 814.3 bits (2102), Expect = 1.5e-232
Identity = 424/487 (87.06%), Postives = 436/487 (89.53%), Query Frame = 0

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQN 60
           M TSLSSVFLFLTIFTSLQF SILSRKLT S YSTSIFDV ASTNQAL+ALSIKPK LQ 
Sbjct: 1   MKTSLSSVFLFLTIFTSLQFSSILSRKLTQSPYSTSIFDVLASTNQALNALSIKPKHLQT 60

Query: 61  HSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFG 120
           HSHLPNS  SLPLYPRL+LHNPSYKDY++LVRARL RDAARVQFLNRNLE SLNGG  FG
Sbjct: 61  HSHLPNSSLSLPLYPRLSLHNPSYKDYDSLVRARLARDAARVQFLNRNLEHSLNGGKDFG 120

Query: 121 ESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180
           E  N SLIGDSITAPVVSGQSKGSGAEYLAQ+GVGQPVKLFYLVPDTGSDVTWLQCQPCA
Sbjct: 121 EVTNGSLIGDSITAPVVSGQSKGSGAEYLAQVGVGQPVKLFYLVPDTGSDVTWLQCQPCA 180

Query: 181 SENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGEL 240
           +EN CYKQ DPIFDPKSSSSY+PLSCNSQQC LLD+ NCNS TCIYQVHYGDGSFTTGEL
Sbjct: 181 TENACYKQIDPIFDPKSSSSYTPLSCNSQQCGLLDRPNCNSGTCIYQVHYGDGSFTTGEL 240

Query: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL 300
           ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVN+
Sbjct: 241 ATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNM 300

Query: 301 DSDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESG 360
           DSDSSSTLEFNS MPSDSLTSPLVKNDRFHSYRYVKV+GISVGGKTLPIS TRFEIDESG
Sbjct: 301 DSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVIGISVGGKTLPISSTRFEIDESG 360

Query: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTI 420
           LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPG                     
Sbjct: 361 LGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPG--------------------- 420

Query: 421 AFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIV 480
                 GTSLRLPARNYLI +DTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNS+V
Sbjct: 421 ------GTSLRLPARNYLIRVDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLV 460

Query: 481 GFSTNKC 488
           GFSTNKC
Sbjct: 481 GFSTNKC 460

BLAST of CsGy1G003630.1 vs. TrEMBL
Match: tr|A0A1S3BW42|A0A1S3BW42_CUCME (protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Cucumis melo OX=3656 GN=LOC103494116 PE=3 SV=1)

HSP 1 Score: 708.4 bits (1827), Expect = 1.1e-200
Identity = 366/488 (75.00%), Postives = 414/488 (84.84%), Query Frame = 0

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60
           MNTSLS   LFLTIFT LQFPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 2   MNTSLSYALLFLTIFTFLQFPSILSRKLTAQSPYSTTTFDVSASINQALNALSIKPKPFQ 61

Query: 61  NHSHLPNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHF 120
            HS+  NSP SL L+PRL +HNPSYKDY TLVRARL R A RVQ LNR LE SLNG   F
Sbjct: 62  THSYHSNSPLSLSLHPRLTVHNPSYKDYGTLVRARLARHATRVQSLNRKLELSLNGAKQF 121

Query: 121 GESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPC 180
           G+ IN S   +S+TAPV SG S G G EY A+IGVGQPV+ F+LVPDTGSDVTWLQC+PC
Sbjct: 122 GKRINGSASTNSLTAPVTSGASHGDG-EYFARIGVGQPVQSFFLVPDTGSDVTWLQCKPC 181

Query: 181 ASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGE 240
           A+EN C+KQ DPIFDPKSSSSYS LSCNS+QC+LLD+A C+S++CIY+V YGDGSFT GE
Sbjct: 182 ANENACFKQLDPIFDPKSSSSYSSLSCNSEQCQLLDEAGCSSNSCIYEVEYGDGSFTIGE 241

Query: 241 LATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVN 300
           LATETLSFGNSNSIPNLPIGCGHDNEGLF   AGLIGLGGGAISLSSQL+ASSFSYCLV+
Sbjct: 242 LATETLSFGNSNSIPNLPIGCGHDNEGLFDAAAGLIGLGGGAISLSSQLQASSFSYCLVD 301

Query: 301 LDSDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDES 360
           LDSDSSSTL+FN+  PSDSLTSPLVKN+RF S+RYVKV+G+SVGGK LPIS +RFEIDES
Sbjct: 302 LDSDSSSTLDFNADQPSDSLTSPLVKNNRFPSFRYVKVIGMSVGGKRLPISSSRFEIDES 361

Query: 361 GLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPT 420
           G GGIIVDSGT I++LPSDVY+ LR+AFV LT++L  APG+S FDTCY+ S QS+VEVP 
Sbjct: 362 GSGGIIVDSGTTITQLPSDVYDVLRDAFVGLTTNLPTAPGVSPFDTCYDLSSQSSVEVPI 421

Query: 421 IAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSI 480
           IAF+L  G SL+LPA+N LI +D+AGT+CLAF+     LSIIG+ QQQGIRVSYDL NSI
Sbjct: 422 IAFILPGGKSLKLPAKNCLIQVDSAGTFCLAFLPGTFPLSIIGNVQQQGIRVSYDLDNSI 481

Query: 481 VGFSTNKC 488
           VGF+TNKC
Sbjct: 482 VGFATNKC 488

BLAST of CsGy1G003630.1 vs. TrEMBL
Match: tr|A0A0A0LS14|A0A0A0LS14_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G022480 PE=3 SV=1)

HSP 1 Score: 678.3 bits (1749), Expect = 1.3e-191
Identity = 352/490 (71.84%), Postives = 407/490 (83.06%), Query Frame = 0

Query: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60
           MNTSLS  FLFLT F SL FPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60

Query: 61  -NHSHL-PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 120
             HS+   +SP SL L+PRL +HNPSY+DY +LVRARL R AAR Q LNR LE SL GG 
Sbjct: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120

Query: 121 HFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ 180
            FG  IN S   +S+TAPV SG S+G+G EY A+IGVGQPV+ ++ VPDTGSDV+WLQCQ
Sbjct: 121 QFGRRINGSDSTNSLTAPVTSGASQGAG-EYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ 180

Query: 181 PCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTT 240
           PC  EN CYKQ  PIFDPKSSSSYSPLSC+S+QC LLD+A C++++CIY+V YGDGSFT 
Sbjct: 181 PCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTV 240

Query: 241 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 300
           GELATET SF +SNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQL+A+SFSYCL
Sbjct: 241 GELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCL 300

Query: 301 VNLDSDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 360
           V+LDS+SSSTL+FN+  PSDSLTSPLVKNDRF ++RYVKV+G+SVGGK LPIS + FEID
Sbjct: 301 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 360

Query: 361 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 420
           ESG GGIIVDSGT I+ +PSDVY+ LR+AFV LT +L PAPG+S FDTCY+ S QSNVEV
Sbjct: 361 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 420

Query: 421 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 480
           PTIAF+L    SL+LPA+N L  +D+AGT+CLAF+ +   LSIIG+ QQQGIRVSYDL N
Sbjct: 421 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 480

Query: 481 SIVGFSTNKC 488
           S+VGFST+KC
Sbjct: 481 SLVGFSTDKC 489

BLAST of CsGy1G003630.1 vs. TrEMBL
Match: tr|M5WPB5|M5WPB5_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G218800 PE=3 SV=1)

HSP 1 Score: 498.8 bits (1283), Expect = 1.4e-137
Identity = 270/497 (54.33%), Postives = 351/497 (70.62%), Query Frame = 0

Query: 7   SVFLFLTIFTSLQ----FPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKP---KPL- 66
           + FL+L I ++      FPS  SR L  S  +T++ DVSAS  QA D LS  P   KPL 
Sbjct: 4   TAFLYLAILSAFTLTSLFPSTHSRSL--SEETTTLLDVSASLTQAHDVLSFNPQTLKPLD 63

Query: 67  ------QNHSHLP-NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLER 126
                 Q H+  P NS FSL L PR ALHN  +KDY +LV++RL RD+ARV  L+  L+ 
Sbjct: 64  RQETQAQAHTLTPLNSSFSLQLLPRDALHNSQHKDYESLVQSRLGRDSARVNSLHTKLQL 123

Query: 127 SL-NGGTHFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSD 186
            + N      E ++  +  + ++ PVVSG S+GSG EY  +IGVG P K  Y+V DTGSD
Sbjct: 124 VVQNIKKSDLEPMHTEIRPEDLSTPVVSGVSQGSG-EYFTRIGVGTPAKSLYMVLDTGSD 183

Query: 187 VTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHY 246
           + WLQC+PC+    CY+Q DP+F+P  SS+Y P++C+S QC  L  + C +D C+YQV Y
Sbjct: 184 INWLQCEPCSD---CYQQTDPVFNPTGSSTYRPVTCDSAQCHSLHVSACRADKCLYQVSY 243

Query: 247 GDGSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKA 306
           GDGS+T G+  TET+SFGNS +I N+ +GCGHDNEGLF G AGL+GLGGGA+SL SQ KA
Sbjct: 244 GDGSYTVGDFVTETVSFGNSGAIHNVGLGCGHDNEGLFVGAAGLLGLGGGALSLPSQFKA 303

Query: 307 SSFSYCLVNLDSDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPIS 366
           +SFSYCLVN DS +SSTLEFNS  PSDS+T+PL+K+ R  ++ YV + G SVGG+ + + 
Sbjct: 304 TSFSYCLVNRDSSTSSTLEFNSAPPSDSVTAPLLKDSRVETFYYVGLKGFSVGGQPVSVP 363

Query: 367 PTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFS 426
           P+ FE+DESG GGIIVDSGT I+RL ++ Y SLR+AF +LT  L  A G ++FDTCY+ S
Sbjct: 364 PSVFEVDESGNGGIIVDSGTAITRLQTEAYNSLRDAFKRLTRDLPSASGFALFDTCYDLS 423

Query: 427 GQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIR 486
            +S V+VPT++F+ ++G SL LPA+NYLI +D+AGT+C AF  T SS SIIG+ QQQG R
Sbjct: 424 SRSRVQVPTVSFLFADGKSLSLPAKNYLIPVDSAGTFCFAFAPTSSSPSIIGNVQQQGTR 483

Query: 487 VSYDLTNSIVGFSTNKC 488
           VSYDL N+ VGFS NKC
Sbjct: 484 VSYDLANNRVGFSPNKC 494

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004138238.16.8e-27499.59PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus] >KGN... [more]
XP_016901370.12.2e-23287.06PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis melo][more]
XP_008453383.11.7e-20075.00PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis melo][more]
XP_004138237.21.9e-19171.84PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus] >KGN... [more]
XP_023007215.11.2e-17768.58protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT1G25510.13.8e-12548.04Eukaryotic aspartyl protease family protein[more]
AT3G18490.11.5e-11846.99Eukaryotic aspartyl protease family protein[more]
AT3G20015.12.6e-9441.61Eukaryotic aspartyl protease family protein[more]
AT3G61820.14.3e-8941.87Eukaryotic aspartyl protease family protein[more]
AT1G01300.11.7e-8843.83Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
sp|Q9LS40|ASPG1_ARATH2.8e-11746.99Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
sp|Q9LHE3|ASPG2_ARATH4.7e-9341.61Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
sp|Q9LNJ3|APF2_ARATH3.0e-8743.83Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
sp|Q766C3|NEP1_NEPGR1.5e-6739.28Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
sp|Q9LEW3|AED1_ARATH4.2e-6538.42Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
tr|A0A0A0LPJ3|A0A0A0LPJ3_CUCSA4.5e-27499.59Aspartic proteinase nepenthesin-1 OS=Cucumis sativus OX=3659 GN=Csa_1G022490 PE=... [more]
tr|A0A1S4E062|A0A1S4E062_CUCME1.5e-23287.06protein ASPARTIC PROTEASE IN GUARD CELL 1-like OS=Cucumis melo OX=3656 GN=LOC103... [more]
tr|A0A1S3BW42|A0A1S3BW42_CUCME1.1e-20075.00protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Cucumis melo OX=3656 GN=LOC10349411... [more]
tr|A0A0A0LS14|A0A0A0LS14_CUCSA1.3e-19171.84Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G022480 PE=3 SV=1[more]
tr|M5WPB5|M5WPB5_PRUPE1.4e-13754.33Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G218800 PE=3 SV=1[more]
The following terms have been associated with this mRNA:
Vocabulary: Biological Process
TermDefinition
GO:0009737response to abscisic acid
GO:0009414response to water deprivation
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR033121PEPTIDASE_A1
IPR001461Aspartic_peptidase_A1
IPR033148ASPG1
IPR032799TAXi_C
IPR032861TAXi_N
IPR021109Peptidase_aspartic_dom_sf
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0009737 response to abscisic acid
biological_process GO:0009414 response to water deprivation
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsGy1G003630CsGy1G003630gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy1G003630.1.exon.1CsGy1G003630.1.exon.1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy1G003630.1.CDS.1CsGy1G003630.1.CDS.1CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsGy1G003630.1CsGy1G003630.1-proteinpolypeptide


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 130..311
e-value: 9.9E-45
score: 155.0
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 314..487
e-value: 6.9E-46
score: 158.2
IPR021109Aspartic peptidase domain superfamilySUPERFAMILYSSF50630Acid proteasescoord: 144..487
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 148..310
e-value: 2.1E-47
score: 161.6
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 334..483
e-value: 3.8E-35
score: 121.0
IPR033148ASPG1PANTHERPTHR13683:SF531PROTEIN ASPARTIC PROTEASE IN GUARD CELL 1coord: 7..487
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 7..487
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 148..483
score: 42.919