CsaV3_5G021850 (gene) Cucumber (Chinese Long) v3

NameCsaV3_5G021850
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
Descriptionprotein ASPARTIC PROTEASE IN GUARD CELL 2
Locationchr5 : 16537681 .. 16540308 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GCAAAATAGAAATCCCAGAATGGCTACTTATTTACATAAAGTATCTTTTTCTTATGCAATAAGATTATAAGATAAGAAGATAGCAACCAACATTAAGCACAATGGGATTTGTTTATGCTTTTGATGTAGATAGAGAGATAGAGATAGAGAGAGCTGTATTCCAGCATCCCCATGCACCCCGTAGTCCCTCAACTTAACATAATACATAACTTTCAAAAACAGGCTAAAACAAAAGACAGTACAAACAAAGTCCATCAATCACTTGCAGTAAAAAGAGCACACATTTCTACATTATTATTTTTTAAAAATATCCATCTCTGTATTCACCAATTAATGCACACCTCAACTAATGTTATAAAACAACCCGTTTGATCCTACAATATTTAAGTATCACCACCGTAGATTGAACTCAGCATTTCTACTCGTGGCTTCTACTTACTGCCTACATGGATAAATAGAGGTCTACCCATGATTGGTGTACACATCTATACATCCATATATATCTTTTACCAACTTCACCTTCAGTTTCTTATCTATTTCTTTCTACAGAAAACACTCACAAACCAGTAACTCAATCATAATCCAACCCCACAAATTTTCTTTTTCCCTCTCATATTTCTTCTCACACTTAAATCCCCCATTTCTCATTTCCATTCTCAAGCTCCCCTACAACACTCACTTCACAGTTCAACTACTAAACCATGCTACCTTTCCTTCTTCTTTCTCTTTTAGCCACCGCAGTCGCTTCGGTTGCCACTGGTCCTGCCGCTACTTACCCGGCCACCCAACTCCTAAATGTCAAAGACACAATCAAAGAAGCAGAAACCGCACCCTCTAGACTACCACAAGATCTTGAACTCCATGAAAACTACCCTATTTTTGAGTTAGACAACAACAGCAGCCAAAGTCAATGGAAGCTCAAGCTTTTCCATAGAGATAAGCTACCCCTCAACTTCGACCCCGACCATCCCCGTCGTTTCAAGGAGCGTATCAGCAGAGATTCCAAAAGGGTCTCCTCTCTGCTCCGCCTACTCTCCAGTGGCAGCGATGAGCAGGTGATCAGGAGTGAAGCTAAGAAGGAAGAAGGGAGCCAATTCCCCTCCTTTGAGTTCATATAAATATACTCAATATTGTTTCAAAGATTATAGATAAAAAAAATGTTGGAATATAGAATTTTTGTTTCTTTCTGATTTTGCTTCTATTTTCATTTTTTGGCTGTGGCCGTGCAGGTGACGGACTTCGGATCGGATGTGGTCTCCGGCACGGAGCAGGGTAGCGGAGAGTACTTCGTAAGGATAGGCGTTGGCAGCCCGCCGAGGAGCCAATACGTAGTGATTGATTCCGGCAGCGACATTGTATGGGTGCAATGCCAGCCCTGCAGCGAATGCTACCAACAATCCGACCCGGTGTTTGACCCGGCCGGTTCCGCCACCTACGCCGGAATCTCCTGTGACTCATCAGTGTGTGACCGCCTCGACAACGCAGGCTGCAACGATGGCCGGTGCCGGTACGAGGTGTCATACGGCGACGGATCCTACACCCGCGGCACCCTCGCGCTCGAAACCTTAACTTTCGGCCGGGTCCTAATCCGAAACATCGCGATTGGGTGCGGCCATATGAACCGAGGAATGTTCATCGGAGCCGCAGGGTTGCTCGGTCTCGGCGGCGGCGCCATGTCATTCGTCGGCCAACTCGGCGGCCAGACTGGCGGCGCGTTCAGCTACTGTTTAGTCAGTCGAGGCACAGAGTCCACAGGAACACTGGAATTCGGCCGCGGCGCTATGCCAGTCGGCGCCGCGTGGGTTCCCCTAATCCGAAATCCACGCGCTCCAAGTTTCTACTACGTCGGGCTTTCAGGTCTCGGAGTCGGAGGGATCCGAGTACCAATACCCGAACAAATCTTCGAACTCACCGATTTAGGGTACGGTGGTGTAGTAATGGACACCGGAACCGCCGTGACGAGGCTACCGGCGCCGGCGTACGAAGCATTCCGAGACACGTTCATCGGACAAACGGCAAACCTACCTCGATCGGACAGAGTATCGATCTTCGACACATGCTATAACCTAAACGGGTTCGTATCGGTAAGGGTACCGACGGTGTCTTTCTACTTCTCCGGTGGGCCAATACTGACGTTGCCGGCGAGGAATTTTCTAATCCCGGTGGACGGAGAAGGGACATTTTGCTTTGCATTTGCAGCATCGGCGTCAGGATTGTCGATCATAGGGAACATTCAACAAGAAGGGATTCAAATCTCCATTGATGGATCAAATGGGTTTGTGGGATTTGGGCCAACTATTTGTTAATTATTCCCTTCATCAAATATCAAACGAAGACCCAATCTCAAATTATGTCTTAAATACGTTTTTTCCCCTCAAACTTAAAATCTTTTTTTTTTCTTATTTTGTAAGTTATAGATTTCTTTTTAACTTTTATTTTTATTTTATTTTTTTTTTGTAGAATTTGAGTTTAGAAAGAATATGTTAGTGATTGAGAAGTTCATTGTTTGGTAGATTTTTCGTTTCTTTTGTGTTAATACATTCATCAAATATCTAACATTTTTTTTAAGATTAGTTAATCATAATTTGAGAATTTATTTCCAATAATAAAATTTTAAAAATATTTACG

mRNA sequence

ATGCTACCTTTCCTTCTTCTTTCTCTTTTAGCCACCGCAGTCGCTTCGGTTGCCACTGGTCCTGCCGCTACTTACCCGGCCACCCAACTCCTAAATGTCAAAGACACAATCAAAGAAGCAGAAACCGCACCCTCTAGACTACCACAAGATCTTGAACTCCATGAAAACTACCCTATTTTTGAGTTAGACAACAACAGCAGCCAAAGTCAATGGAAGCTCAAGCTTTTCCATAGAGATAAGCTACCCCTCAACTTCGACCCCGACCATCCCCGTCGTTTCAAGGAGCGTATCAGCAGAGATTCCAAAAGGGTCTCCTCTCTGCTCCGCCTACTCTCCAGTGGCAGCGATGAGCAGGTGACGGACTTCGGATCGGATGTGGTCTCCGGCACGGAGCAGGGTAGCGGAGAGTACTTCGTAAGGATAGGCGTTGGCAGCCCGCCGAGGAGCCAATACGTAGTGATTGATTCCGGCAGCGACATTGTATGGGTGCAATGCCAGCCCTGCAGCGAATGCTACCAACAATCCGACCCGGTGTTTGACCCGGCCGGTTCCGCCACCTACGCCGGAATCTCCTGTGACTCATCAGTGTGTGACCGCCTCGACAACGCAGGCTGCAACGATGGCCGGTGCCGGTACGAGGTGTCATACGGCGACGGATCCTACACCCGCGGCACCCTCGCGCTCGAAACCTTAACTTTCGGCCGGGTCCTAATCCGAAACATCGCGATTGGGTGCGGCCATATGAACCGAGGAATGTTCATCGGAGCCGCAGGGTTGCTCGGTCTCGGCGGCGGCGCCATGTCATTCGTCGGCCAACTCGGCGGCCAGACTGGCGGCGCGTTCAGCTACTGTTTAGTCAGTCGAGGCACAGAGTCCACAGGAACACTGGAATTCGGCCGCGGCGCTATGCCAGTCGGCGCCGCGTGGGTTCCCCTAATCCGAAATCCACGCGCTCCAAGTTTCTACTACGTCGGGCTTTCAGGTCTCGGAGTCGGAGGGATCCGAGTACCAATACCCGAACAAATCTTCGAACTCACCGATTTAGGGTACGGTGGTGTAGTAATGGACACCGGAACCGCCGTGACGAGGCTACCGGCGCCGGCGTACGAAGCATTCCGAGACACGTTCATCGGACAAACGGCAAACCTACCTCGATCGGACAGAGTATCGATCTTCGACACATGCTATAACCTAAACGGGTTCGTATCGGTAAGGGTACCGACGGTGTCTTTCTACTTCTCCGGTGGGCCAATACTGACGTTGCCGGCGAGGAATTTTCTAATCCCGGTGGACGGAGAAGGGACATTTTGCTTTGCATTTGCAGCATCGGCGTCAGGATTGTCGATCATAGGGAACATTCAACAAGAAGGGATTCAAATCTCCATTGATGGATCAAATGGGTTTGTGGGATTTGGGCCAACTATTTGTTAA

Coding sequence (CDS)

ATGCTACCTTTCCTTCTTCTTTCTCTTTTAGCCACCGCAGTCGCTTCGGTTGCCACTGGTCCTGCCGCTACTTACCCGGCCACCCAACTCCTAAATGTCAAAGACACAATCAAAGAAGCAGAAACCGCACCCTCTAGACTACCACAAGATCTTGAACTCCATGAAAACTACCCTATTTTTGAGTTAGACAACAACAGCAGCCAAAGTCAATGGAAGCTCAAGCTTTTCCATAGAGATAAGCTACCCCTCAACTTCGACCCCGACCATCCCCGTCGTTTCAAGGAGCGTATCAGCAGAGATTCCAAAAGGGTCTCCTCTCTGCTCCGCCTACTCTCCAGTGGCAGCGATGAGCAGGTGACGGACTTCGGATCGGATGTGGTCTCCGGCACGGAGCAGGGTAGCGGAGAGTACTTCGTAAGGATAGGCGTTGGCAGCCCGCCGAGGAGCCAATACGTAGTGATTGATTCCGGCAGCGACATTGTATGGGTGCAATGCCAGCCCTGCAGCGAATGCTACCAACAATCCGACCCGGTGTTTGACCCGGCCGGTTCCGCCACCTACGCCGGAATCTCCTGTGACTCATCAGTGTGTGACCGCCTCGACAACGCAGGCTGCAACGATGGCCGGTGCCGGTACGAGGTGTCATACGGCGACGGATCCTACACCCGCGGCACCCTCGCGCTCGAAACCTTAACTTTCGGCCGGGTCCTAATCCGAAACATCGCGATTGGGTGCGGCCATATGAACCGAGGAATGTTCATCGGAGCCGCAGGGTTGCTCGGTCTCGGCGGCGGCGCCATGTCATTCGTCGGCCAACTCGGCGGCCAGACTGGCGGCGCGTTCAGCTACTGTTTAGTCAGTCGAGGCACAGAGTCCACAGGAACACTGGAATTCGGCCGCGGCGCTATGCCAGTCGGCGCCGCGTGGGTTCCCCTAATCCGAAATCCACGCGCTCCAAGTTTCTACTACGTCGGGCTTTCAGGTCTCGGAGTCGGAGGGATCCGAGTACCAATACCCGAACAAATCTTCGAACTCACCGATTTAGGGTACGGTGGTGTAGTAATGGACACCGGAACCGCCGTGACGAGGCTACCGGCGCCGGCGTACGAAGCATTCCGAGACACGTTCATCGGACAAACGGCAAACCTACCTCGATCGGACAGAGTATCGATCTTCGACACATGCTATAACCTAAACGGGTTCGTATCGGTAAGGGTACCGACGGTGTCTTTCTACTTCTCCGGTGGGCCAATACTGACGTTGCCGGCGAGGAATTTTCTAATCCCGGTGGACGGAGAAGGGACATTTTGCTTTGCATTTGCAGCATCGGCGTCAGGATTGTCGATCATAGGGAACATTCAACAAGAAGGGATTCAAATCTCCATTGATGGATCAAATGGGTTTGTGGGATTTGGGCCAACTATTTGTTAA

Protein sequence

MLPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC
BLAST of CsaV3_5G021850 vs. NCBI nr
Match: XP_004150193.1 (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus])

HSP 1 Score: 954.1 bits (2465), Expect = 1.8e-274
Identity = 476/476 (100.00%), Postives = 476/476 (100.00%), Query Frame = 0

Query: 1   MLPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIF 60
           MLPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIF
Sbjct: 1   MLPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIF 60

Query: 61  ELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVT 120
           ELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVT
Sbjct: 61  ELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVT 120

Query: 121 DFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFD 180
           DFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFD
Sbjct: 121 DFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFD 180

Query: 181 PAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRN 240
           PAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRN
Sbjct: 181 PAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRN 240

Query: 241 IAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGR 300
           IAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGR
Sbjct: 241 IAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGR 300

Query: 301 GAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTA 360
           GAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTA
Sbjct: 301 GAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTA 360

Query: 361 VTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILT 420
           VTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILT
Sbjct: 361 VTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILT 420

Query: 421 LPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           LPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC
Sbjct: 421 LPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476

BLAST of CsaV3_5G021850 vs. NCBI nr
Match: XP_008465249.1 (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis melo])

HSP 1 Score: 929.5 bits (2401), Expect = 4.6e-267
Identity = 464/477 (97.27%), Postives = 469/477 (98.32%), Query Frame = 0

Query: 1   MLP-FLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPI 60
           MLP  LLL LLATAV+SVATGPAATYPATQLLNVKDTIKE ET PSRLPQDL LHENYP+
Sbjct: 1   MLPLLLLLPLLATAVSSVATGPAATYPATQLLNVKDTIKETETTPSRLPQDLNLHENYPL 60

Query: 61  FELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQV 120
           FELDNNSSQSQWKLKLFHRDKLPLNFD +HPRRFKERISRDSKRVSSLLRLLS+ SDEQV
Sbjct: 61  FELDNNSSQSQWKLKLFHRDKLPLNFDTNHPRRFKERISRDSKRVSSLLRLLSNASDEQV 120

Query: 121 TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF 180
           TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF
Sbjct: 121 TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF 180

Query: 181 DPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIR 240
           DPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGR+LIR
Sbjct: 181 DPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRILIR 240

Query: 241 NIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG 300
           NIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG
Sbjct: 241 NIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG 300

Query: 301 RGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT 360
           RGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT
Sbjct: 301 RGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT 360

Query: 361 AVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL 420
           AVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL
Sbjct: 361 AVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL 420

Query: 421 TLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           TLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC
Sbjct: 421 TLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477

BLAST of CsaV3_5G021850 vs. NCBI nr
Match: XP_022990025.1 (protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucurbita maxima])

HSP 1 Score: 859.8 bits (2220), Expect = 4.5e-246
Identity = 431/481 (89.60%), Postives = 449/481 (93.35%), Query Frame = 0

Query: 1   MLPFLLLSLL-----ATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHE 60
           MLPFLLL LL       A+ +VATG AA YPATQLL+VKDTIKE E  PSRLPQDLEL+E
Sbjct: 1   MLPFLLLLLLPPLLTRAAIDTVATGLAANYPATQLLHVKDTIKETEIKPSRLPQDLELYE 60

Query: 61  NYPIFELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGS 120
           NYP  +   NS+Q+QWKL+LFHRDKLPLNFDPDH RRFKERI RD +RVSSLLR LS+GS
Sbjct: 61  NYPPID---NSTQNQWKLELFHRDKLPLNFDPDHCRRFKERIGRDVERVSSLLRRLSNGS 120

Query: 121 DEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQS 180
           DEQVT+FGSDV+SGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQS
Sbjct: 121 DEQVTEFGSDVISGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQS 180

Query: 181 DPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGR 240
           DPVFDPA SA+YAGISCDSSVC RLDNAGCNDGRCRYEVSYGDGSYTRG LALETLTFGR
Sbjct: 181 DPVFDPASSASYAGISCDSSVCGRLDNAGCNDGRCRYEVSYGDGSYTRGNLALETLTFGR 240

Query: 241 VLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGT 300
           VLIRNIAIGCGHMNRGMF+GAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGT
Sbjct: 241 VLIRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGT 300

Query: 301 LEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVM 360
           LEFGR AMPVG AWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVM
Sbjct: 301 LEFGRSAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVM 360

Query: 361 DTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSG 420
           DTGTAVTRLP PAYEAFRDTFIGQTANLPRS  VSIFDTCY+LNGFVSVRVPTVSFYFSG
Sbjct: 361 DTGTAVTRLPVPAYEAFRDTFIGQTANLPRSREVSIFDTCYDLNGFVSVRVPTVSFYFSG 420

Query: 421 GPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTI 477
           GPILTLPARNFLIPVDGEGTFCFAFAAS SGLSIIGNIQQEGIQIS+DG+NGFVGFGP+I
Sbjct: 421 GPILTLPARNFLIPVDGEGTFCFAFAASPSGLSIIGNIQQEGIQISVDGANGFVGFGPSI 478

BLAST of CsaV3_5G021850 vs. NCBI nr
Match: XP_023514620.1 (protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 857.4 bits (2214), Expect = 2.2e-245
Identity = 430/480 (89.58%), Postives = 446/480 (92.92%), Query Frame = 0

Query: 1   MLPFLLL----SLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHEN 60
           MLPF LL     L   AV +VATG  A YPATQLL+VKDTIKE E  PSRLPQDLEL+EN
Sbjct: 1   MLPFFLLLLPPLLTRAAVDTVATGLVANYPATQLLHVKDTIKETEIKPSRLPQDLELYEN 60

Query: 61  YPIFELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSD 120
           YP  +   NS+Q+QWKL+LFHRDKLPLNFDPDH RRFKERI RD +RVSSLLR LS+GSD
Sbjct: 61  YPPID---NSTQNQWKLELFHRDKLPLNFDPDHRRRFKERIGRDVQRVSSLLRRLSNGSD 120

Query: 121 EQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSD 180
           EQVTDFGSDV+SGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSD
Sbjct: 121 EQVTDFGSDVISGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSD 180

Query: 181 PVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRV 240
           PVFDPA SA+YAGISCDSSVC RLDNAGCNDGRCRYEVSYGDGSYTRG LALETLTFGRV
Sbjct: 181 PVFDPASSASYAGISCDSSVCGRLDNAGCNDGRCRYEVSYGDGSYTRGNLALETLTFGRV 240

Query: 241 LIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTL 300
           LIRNIAIGCGHMNRGMF+GAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTL
Sbjct: 241 LIRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTL 300

Query: 301 EFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMD 360
           EFGR AMPVG AWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMD
Sbjct: 301 EFGRSAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMD 360

Query: 361 TGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGG 420
           TGTAVTRLP PAYEAFRDTFIGQTANLPRS  VSIFDTCY+LNGFVSVRVPTVSFYFSGG
Sbjct: 361 TGTAVTRLPVPAYEAFRDTFIGQTANLPRSREVSIFDTCYDLNGFVSVRVPTVSFYFSGG 420

Query: 421 PILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           PILTLPARNFLIPVDGEGTFCFAFAAS SGLSIIGNIQQEGIQIS+DG+NGFVGFGP+IC
Sbjct: 421 PILTLPARNFLIPVDGEGTFCFAFAASPSGLSIIGNIQQEGIQISVDGANGFVGFGPSIC 477

BLAST of CsaV3_5G021850 vs. NCBI nr
Match: XP_022921534.1 (protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucurbita moschata])

HSP 1 Score: 853.2 bits (2203), Expect = 4.2e-244
Identity = 428/482 (88.80%), Postives = 445/482 (92.32%), Query Frame = 0

Query: 1   MLPFL------LLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELH 60
           MLPFL             AV +VA+  AA+YPATQLL+VKDTIKE E  PSRLPQDLELH
Sbjct: 1   MLPFLXXXXXXXXXXXRAAVDTVASSLAASYPATQLLHVKDTIKETEIKPSRLPQDLELH 60

Query: 61  ENYPIFELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSG 120
           ENYP  +   NS+Q+QWKL+LFHRDKLPLNFDPDH RRFKERI RD +RVSSLLR LS+G
Sbjct: 61  ENYPPID---NSTQNQWKLELFHRDKLPLNFDPDHRRRFKERIGRDVERVSSLLRRLSNG 120

Query: 121 SDEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQ 180
           SDEQVTDFGSDV+SGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQ
Sbjct: 121 SDEQVTDFGSDVISGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQ 180

Query: 181 SDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFG 240
           SDPVFDPA SA+YAGISCDSSVC RLDNAGCNDGRCRYEVSYGDGSYTRG LALETLTFG
Sbjct: 181 SDPVFDPASSASYAGISCDSSVCGRLDNAGCNDGRCRYEVSYGDGSYTRGNLALETLTFG 240

Query: 241 RVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTG 300
           RVLIRNIAIGCGHMNRGMF+GAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTG
Sbjct: 241 RVLIRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTG 300

Query: 301 TLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVV 360
           TLEFGR AMPVG AWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVV
Sbjct: 301 TLEFGRSAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVV 360

Query: 361 MDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFS 420
           MDTGTAVTRLP PAYEAFRDTFIGQTANLPRS  VSIFDTCY+LNGFVSVRVPTVSFYFS
Sbjct: 361 MDTGTAVTRLPVPAYEAFRDTFIGQTANLPRSREVSIFDTCYDLNGFVSVRVPTVSFYFS 420

Query: 421 GGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPT 477
           GGPILTLPARNFLIPVDGEGTFCFAFAAS SGLSIIGNIQQEGIQIS+DG+NGFVGFGP+
Sbjct: 421 GGPILTLPARNFLIPVDGEGTFCFAFAASPSGLSIIGNIQQEGIQISVDGANGFVGFGPS 479

BLAST of CsaV3_5G021850 vs. TAIR10
Match: AT3G20015.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 632.5 bits (1630), Expect = 2.1e-181
Identity = 300/419 (71.60%), Postives = 355/419 (84.73%), Query Frame = 0

Query: 64  NNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLL------SSGSDE 123
           ++ S S++ L+L HRD+ P     +H  R   R+ RD+ RVS++LR +      SS S  
Sbjct: 52  SDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRY 111

Query: 124 QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDP 183
           +V DFGSD+VSG +QGSGEYFVRIGVGSPPR QY+VIDSGSD+VWVQCQPC  CY+QSDP
Sbjct: 112 EVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDP 171

Query: 184 VFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVL 243
           VFDPA S +Y G+SC SSVCDR++N+GC+ G CRYEV YGDGSYT+GTLALETLTF + +
Sbjct: 172 VFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTV 231

Query: 244 IRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLE 303
           +RN+A+GCGH NRGMFIGAAGLLG+GGG+MSFVGQL GQTGGAF YCLVSRGT+STG+L 
Sbjct: 232 VRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLV 291

Query: 304 FGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDT 363
           FGR A+PVGA+WVPL+RNPRAPSFYYVGL GLGVGG+R+P+P+ +F+LT+ G GGVVMDT
Sbjct: 292 FGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDT 351

Query: 364 GTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGP 423
           GTAVTRLP  AY AFRD F  QTANLPR+  VSIFDTCY+L+GFVSVRVPTVSFYF+ GP
Sbjct: 352 GTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGP 411

Query: 424 ILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           +LTLPARNFL+PVD  GT+CFAFAAS +GLSIIGNIQQEGIQ+S DG+NGFVGFGP +C
Sbjct: 412 VLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CsaV3_5G021850 vs. TAIR10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 405.6 bits (1041), Expect = 4.2e-113
Identity = 205/472 (43.43%), Postives = 300/472 (63.56%), Query Frame = 0

Query: 21  PAATYPATQLLNVKDTIKEAE-TAPSRLPQDLELHENYPIFELDNNSSQSQWKLKLFHRD 80
           P  +   T +LNV D+I   + T+  RL Q           E   +S+ S + L+L  R 
Sbjct: 26  PETSTTTTSILNVADSIHRTKYTSSFRLNQQ----------EEQTHSASSSFSLQLHSRV 85

Query: 81  KLPLNFDPDHPRRFKERISRDSKRVSSL---------------LRLLSSGSDEQVTDFGS 140
            +      D+      R++RD+ RV SL               L+ +S+    +  D  +
Sbjct: 86  SVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPISTMYTTEEQDIEA 145

Query: 141 DVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGS 200
            ++SGT QGSGEYF R+G+G P R  Y+V+D+GSD+ W+QC PC++CY Q++P+F+P+ S
Sbjct: 146 PLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSS 205

Query: 201 ATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRNIAIG 260
           ++Y  +SCD+  C+ L+ + C +  C YEVSYGDGSYT G  A ETLT G  L++N+A+G
Sbjct: 206 SSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAVG 265

Query: 261 CGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAMP 320
           CGH N G+F+GAAGLLGLGGG ++   QL      +FSYCLV R ++S  T++FG    P
Sbjct: 266 CGHSNEGLFVGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDRDSDSASTVDFGTSLSP 325

Query: 321 VGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRL 380
             A   PL+RN +  +FYY+GL+G+ VGG  + IP+  FE+ + G GG+++D+GTAVTRL
Sbjct: 326 -DAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRL 385

Query: 381 PAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPAR 440
               Y + RD+F+  T +L ++  V++FDTCYNL+   +V VPTV+F+F GG +L LPA+
Sbjct: 386 QTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALPAK 445

Query: 441 NFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           N++IPVD  GTFC AFA +AS L+IIGN+QQ+G +++ D +N  +GF    C
Sbjct: 446 NYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483

BLAST of CsaV3_5G021850 vs. TAIR10
Match: AT3G18490.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 401.4 bits (1030), Expect = 8.0e-112
Identity = 218/496 (43.95%), Postives = 303/496 (61.09%), Query Frame = 0

Query: 1   MLPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIF 60
           +L  + LSL  T     ++   +T P T +L+V  ++++ +T  S  P    L    P  
Sbjct: 9   LLAVVTLSLFLT-TTDASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPES 68

Query: 61  ELDN--NSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSL---LRLLSSGS 120
             D    +S S   L+L  RD    +   D+      R+ RDS RV+ +   +R    G 
Sbjct: 69  LSDPVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGV 128

Query: 121 DE-------------QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVW 180
           D              Q  D  + VVSG  QGSGEYF RIGVG+P +  Y+V+D+GSD+ W
Sbjct: 129 DRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNW 188

Query: 181 VQCQPCSECYQQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYT 240
           +QC+PC++CYQQSDPVF+P  S+TY  ++C +  C  L+ + C   +C Y+VSYGDGS+T
Sbjct: 189 IQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFT 248

Query: 241 RGTLALETLTFGRV-LIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAF 300
            G LA +T+TFG    I N+A+GCGH N G+F GAAGLLGLGGG +S   Q+      +F
Sbjct: 249 VGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQM---KATSF 308

Query: 301 SYCLVSRGTESTGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQ 360
           SYCLV R +  + +L+F    +  G A  PL+RN +  +FYYVGLSG  VGG +V +P+ 
Sbjct: 309 SYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDA 368

Query: 361 IFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPR-SDRVSIFDTCYNLNG 420
           IF++   G GGV++D GTAVTRL   AY + RD F+  T NL + S  +S+FDTCY+ + 
Sbjct: 369 IFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSS 428

Query: 421 FVSVRVPTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQI 477
             +V+VPTV+F+F+GG  L LPA+N+LIPVD  GTFCFAFA ++S LSIIGN+QQ+G +I
Sbjct: 429 LSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRI 488

BLAST of CsaV3_5G021850 vs. TAIR10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 397.5 bits (1020), Expect = 1.2e-110
Identity = 212/412 (51.46%), Postives = 267/412 (64.81%), Query Frame = 0

Query: 75  LFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVT------DFGSDVVS 134
           L H D L  N  PD    F  R+ RDS+RV S+  L +      VT       F S VVS
Sbjct: 76  LDHIDALSSNKTPD--ELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVS 135

Query: 135 GTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYA 194
           G  QGSGEYF R+GVG+P R  Y+V+D+GSDIVW+QC PC  CY QSDP+FDP  S TYA
Sbjct: 136 GLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYA 195

Query: 195 GISCDSSVCDRLDNAGCNDGR--CRYEVSYGDGSYTRGTLALETLTFGRVLIRNIAIGCG 254
            I C S  C RLD+AGCN  R  C Y+VSYGDGS+T G  + ETLTF R  ++ +A+GCG
Sbjct: 196 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCG 255

Query: 255 HMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES-TGTLEFGRGAMPV 314
           H N G+F+GAAGLLGLG G +SF GQ G +    FSYCLV R   S   ++ FG  A+  
Sbjct: 256 HDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSR 315

Query: 315 GAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVP-IPEQIFELTDLGYGGVVMDTGTAVTRL 374
            A + PL+ NP+  +FYYVGL G+ VGG RVP +   +F+L  +G GGV++D+GT+VTRL
Sbjct: 316 IARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRL 375

Query: 375 PAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPAR 434
             PAY A RD F      L R+   S+FDTC++L+    V+VPTV  +F G  + +LPA 
Sbjct: 376 IRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADV-SLPAT 435

Query: 435 NFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           N+LIPVD  G FCFAFA +  GLSIIGNIQQ+G ++  D ++  VGF P  C
Sbjct: 436 NYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CsaV3_5G021850 vs. TAIR10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 369.0 bits (946), Expect = 4.4e-102
Identity = 196/430 (45.58%), Postives = 265/430 (61.63%), Query Frame = 0

Query: 65  NSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVT---- 124
           + S +   + L H D L    D      F  R+ RDS RV S+  L +  +    T    
Sbjct: 55  SESTTSLSVHLSHVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVSTGRNATKRTP 114

Query: 125 ----DFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSD 184
                F   V+SG  QGSGEYF+R+GVG+P  + Y+V+D+GSD+VW+QC PC  CY Q+D
Sbjct: 115 RTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTD 174

Query: 185 PVFDPAGSATYAGISCDSSVCDRLDNAG-CNDGR---CRYEVSYGDGSYTRGTLALETLT 244
            +FDP  S T+A + C S +C RLD++  C   R   C Y+VSYGDGS+T G  + ETLT
Sbjct: 175 AIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLT 234

Query: 245 FGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSR---- 304
           F    + ++ +GCGH N G+F+GAAGLLGLG G +SF  Q   +  G FSYCLV R    
Sbjct: 235 FHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSG 294

Query: 305 -GTESTGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVP-IPEQIFELT 364
             ++   T+ FG  A+P  + + PL+ NP+  +FYY+ L G+ VGG RVP + E  F+L 
Sbjct: 295 SSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLD 354

Query: 365 DLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRV 424
             G GGV++D+GT+VTRL  PAY A RD F      L R+   S+FDTC++L+G  +V+V
Sbjct: 355 ATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKV 414

Query: 425 PTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSN 477
           PTV F+F GG + +LPA N+LIPV+ EG FCFAFA +   LSIIGNIQQ+G +++ D   
Sbjct: 415 PTVVFHFGGGEV-SLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVG 474

BLAST of CsaV3_5G021850 vs. Swiss-Prot
Match: sp|Q9LHE3|ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 632.5 bits (1630), Expect = 3.8e-180
Identity = 300/419 (71.60%), Postives = 355/419 (84.73%), Query Frame = 0

Query: 64  NNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLL------SSGSDE 123
           ++ S S++ L+L HRD+ P     +H  R   R+ RD+ RVS++LR +      SS S  
Sbjct: 52  SDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRY 111

Query: 124 QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDP 183
           +V DFGSD+VSG +QGSGEYFVRIGVGSPPR QY+VIDSGSD+VWVQCQPC  CY+QSDP
Sbjct: 112 EVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDP 171

Query: 184 VFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVL 243
           VFDPA S +Y G+SC SSVCDR++N+GC+ G CRYEV YGDGSYT+GTLALETLTF + +
Sbjct: 172 VFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTV 231

Query: 244 IRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLE 303
           +RN+A+GCGH NRGMFIGAAGLLG+GGG+MSFVGQL GQTGGAF YCLVSRGT+STG+L 
Sbjct: 232 VRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLV 291

Query: 304 FGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDT 363
           FGR A+PVGA+WVPL+RNPRAPSFYYVGL GLGVGG+R+P+P+ +F+LT+ G GGVVMDT
Sbjct: 292 FGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDT 351

Query: 364 GTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGP 423
           GTAVTRLP  AY AFRD F  QTANLPR+  VSIFDTCY+L+GFVSVRVPTVSFYF+ GP
Sbjct: 352 GTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGP 411

Query: 424 ILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           +LTLPARNFL+PVD  GT+CFAFAAS +GLSIIGNIQQEGIQ+S DG+NGFVGFGP +C
Sbjct: 412 VLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CsaV3_5G021850 vs. Swiss-Prot
Match: sp|Q9LS40|ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 401.4 bits (1030), Expect = 1.4e-110
Identity = 218/496 (43.95%), Postives = 303/496 (61.09%), Query Frame = 0

Query: 1   MLPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIF 60
           +L  + LSL  T     ++   +T P T +L+V  ++++ +T  S  P    L    P  
Sbjct: 9   LLAVVTLSLFLT-TTDASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPES 68

Query: 61  ELDN--NSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSL---LRLLSSGS 120
             D    +S S   L+L  RD    +   D+      R+ RDS RV+ +   +R    G 
Sbjct: 69  LSDPVFFNSSSPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGV 128

Query: 121 DE-------------QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVW 180
           D              Q  D  + VVSG  QGSGEYF RIGVG+P +  Y+V+D+GSD+ W
Sbjct: 129 DRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNW 188

Query: 181 VQCQPCSECYQQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYT 240
           +QC+PC++CYQQSDPVF+P  S+TY  ++C +  C  L+ + C   +C Y+VSYGDGS+T
Sbjct: 189 IQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFT 248

Query: 241 RGTLALETLTFGRV-LIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAF 300
            G LA +T+TFG    I N+A+GCGH N G+F GAAGLLGLGGG +S   Q+      +F
Sbjct: 249 VGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQM---KATSF 308

Query: 301 SYCLVSRGTESTGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQ 360
           SYCLV R +  + +L+F    +  G A  PL+RN +  +FYYVGLSG  VGG +V +P+ 
Sbjct: 309 SYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDA 368

Query: 361 IFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPR-SDRVSIFDTCYNLNG 420
           IF++   G GGV++D GTAVTRL   AY + RD F+  T NL + S  +S+FDTCY+ + 
Sbjct: 369 IFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSS 428

Query: 421 FVSVRVPTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQI 477
             +V+VPTV+F+F+GG  L LPA+N+LIPVD  GTFCFAFA ++S LSIIGN+QQ+G +I
Sbjct: 429 LSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRI 488

BLAST of CsaV3_5G021850 vs. Swiss-Prot
Match: sp|Q9LNJ3|APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 2.1e-109
Identity = 212/412 (51.46%), Postives = 267/412 (64.81%), Query Frame = 0

Query: 75  LFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVT------DFGSDVVS 134
           L H D L  N  PD    F  R+ RDS+RV S+  L +      VT       F S VVS
Sbjct: 76  LDHIDALSSNKTPD--ELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVS 135

Query: 135 GTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYA 194
           G  QGSGEYF R+GVG+P R  Y+V+D+GSDIVW+QC PC  CY QSDP+FDP  S TYA
Sbjct: 136 GLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYA 195

Query: 195 GISCDSSVCDRLDNAGCNDGR--CRYEVSYGDGSYTRGTLALETLTFGRVLIRNIAIGCG 254
            I C S  C RLD+AGCN  R  C Y+VSYGDGS+T G  + ETLTF R  ++ +A+GCG
Sbjct: 196 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCG 255

Query: 255 HMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES-TGTLEFGRGAMPV 314
           H N G+F+GAAGLLGLG G +SF GQ G +    FSYCLV R   S   ++ FG  A+  
Sbjct: 256 HDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSR 315

Query: 315 GAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVP-IPEQIFELTDLGYGGVVMDTGTAVTRL 374
            A + PL+ NP+  +FYYVGL G+ VGG RVP +   +F+L  +G GGV++D+GT+VTRL
Sbjct: 316 IARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRL 375

Query: 375 PAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPAR 434
             PAY A RD F      L R+   S+FDTC++L+    V+VPTV  +F G  + +LPA 
Sbjct: 376 IRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADV-SLPAT 435

Query: 435 NFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           N+LIPVD  G FCFAFA +  GLSIIGNIQQ+G ++  D ++  VGF P  C
Sbjct: 436 NYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CsaV3_5G021850 vs. Swiss-Prot
Match: sp|Q766C2|NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 263.8 bits (673), Expect = 3.6e-69
Identity = 152/389 (39.07%), Postives = 217/389 (55.78%), Query Frame = 0

Query: 94  KERISRDSKRVSSLLRLLSSGSDEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVV 153
           K  I R  +R+ S+  +L S S  +   +          G GEY + + +G+P  S   +
Sbjct: 62  KRAIKRGERRMRSINAMLQSSSGIETPVYA---------GDGEYLMNVAIGTPDSSFSAI 121

Query: 154 IDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYE 213
           +D+GSD++W QC+PC++C+ Q  P+F+P  S++++ + C+S  C  L +  CN+  C+Y 
Sbjct: 122 MDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNNECQYT 181

Query: 214 VSYGDGSYTRGTLALETLTFGRVLIRNIAIGCGHMNRGMFIG-AAGLLGLGGGAMSFVGQ 273
             YGDGS T+G +A ET TF    + NIA GCG  N+G   G  AGL+G+G G +S   Q
Sbjct: 182 YGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQ 241

Query: 274 LGGQTGGAFSYCLVSRGTESTGTLEFGRGA--MPVGAAWVPLIRNPRAPSFYYVGLSGLG 333
           LG    G FSYC+ S G+ S  TL  G  A  +P G+    LI +   P++YY+ L G+ 
Sbjct: 242 LG---VGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGIT 301

Query: 334 VGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVS 393
           VGG  + IP   F+L D G GG+++D+GT +T LP  AY A    F  Q  NLP  D  S
Sbjct: 302 VGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQ-INLPTVDESS 361

Query: 394 I-FDTCYNL-NGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASAS-GL 453
               TC+   +   +V+VP +S  F GG +L L  +N LI    EG  C A  +S+  G+
Sbjct: 362 SGLSTCFQQPSDGSTVQVPEISMQFDGG-VLNLGEQNILIS-PAEGVICLAMGSSSQLGI 421

Query: 454 SIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           SI GNIQQ+  Q+  D  N  V F PT C
Sbjct: 422 SIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CsaV3_5G021850 vs. Swiss-Prot
Match: sp|Q766C3|NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 252.7 bits (644), Expect = 8.3e-66
Identity = 142/386 (36.79%), Postives = 217/386 (56.22%), Query Frame = 0

Query: 97  ISRDSKRVSSLLRLLSSGSDEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDS 156
           I R S+R+  L  +L+  S  + + +          G GEY + + +G+P +    ++D+
Sbjct: 64  IERGSRRLQRLEAMLNGPSGVETSVYA---------GDGEYLMNLSIGTPAQPFSAIMDT 123

Query: 157 GSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSY 216
           GSD++W QCQPC++C+ QS P+F+P GS++++ + C S +C  L +  C++  C+Y   Y
Sbjct: 124 GSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGY 183

Query: 217 GDGSYTRGTLALETLTFGRVLIRNIAIGCGHMNRGMFIG-AAGLLGLGGGAMSFVGQLGG 276
           GDGS T+G++  ETLTFG V I NI  GCG  N+G   G  AGL+G+G G +S   QL  
Sbjct: 184 GDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDV 243

Query: 277 QTGGAFSYCLVSRGTESTGTLEFG--RGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGG 336
                FSYC+   G+ +   L  G    ++  G+    LI++ + P+FYY+ L+GL VG 
Sbjct: 244 T---KFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGS 303

Query: 337 IRVPIPEQIFEL-TDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLP-RSDRVSI 396
            R+PI    F L ++ G GG+++D+GT +T     AY++ R  FI Q  NLP  +   S 
Sbjct: 304 TRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQ-INLPVVNGSSSG 363

Query: 397 FDTCYNLNGFVS-VRVPTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSII 456
           FD C+      S +++PT   +F GG  L LP+ N+ I     G  C A  +S+ G+SI 
Sbjct: 364 FDLCFQTPSDPSNLQIPTFVMHFDGGD-LELPSENYFIS-PSNGLICLAMGSSSQGMSIF 423

Query: 457 GNIQQEGIQISIDGSNGFVGFGPTIC 477
           GNIQQ+ + +  D  N  V F    C
Sbjct: 424 GNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CsaV3_5G021850 vs. TrEMBL
Match: tr|A0A1S3CNF7|A0A1S3CNF7_CUCME (protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Cucumis melo OX=3656 GN=LOC103502903 PE=3 SV=1)

HSP 1 Score: 929.5 bits (2401), Expect = 3.1e-267
Identity = 464/477 (97.27%), Postives = 469/477 (98.32%), Query Frame = 0

Query: 1   MLP-FLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPI 60
           MLP  LLL LLATAV+SVATGPAATYPATQLLNVKDTIKE ET PSRLPQDL LHENYP+
Sbjct: 1   MLPLLLLLPLLATAVSSVATGPAATYPATQLLNVKDTIKETETTPSRLPQDLNLHENYPL 60

Query: 61  FELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQV 120
           FELDNNSSQSQWKLKLFHRDKLPLNFD +HPRRFKERISRDSKRVSSLLRLLS+ SDEQV
Sbjct: 61  FELDNNSSQSQWKLKLFHRDKLPLNFDTNHPRRFKERISRDSKRVSSLLRLLSNASDEQV 120

Query: 121 TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF 180
           TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF
Sbjct: 121 TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF 180

Query: 181 DPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIR 240
           DPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGR+LIR
Sbjct: 181 DPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRILIR 240

Query: 241 NIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG 300
           NIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG
Sbjct: 241 NIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG 300

Query: 301 RGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT 360
           RGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT
Sbjct: 301 RGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT 360

Query: 361 AVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL 420
           AVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL
Sbjct: 361 AVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL 420

Query: 421 TLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477
           TLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC
Sbjct: 421 TLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477

BLAST of CsaV3_5G021850 vs. TrEMBL
Match: tr|A0A2P5R9B1|A0A2P5R9B1_GOSBA (Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_DD19674 PE=3 SV=1)

HSP 1 Score: 692.2 bits (1785), Expect = 8.2e-196
Identity = 346/484 (71.49%), Postives = 404/484 (83.47%), Query Frame = 0

Query: 2   LPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFE 61
           LP +L+++L   ++SVAT   A+YP  QLLNVK T+   E     +P+ L+  E++ +  
Sbjct: 7   LPIVLVAMLHLTLSSVAT---ASYPDFQLLNVKQTLIGTE-----IPRPLQTSEHHQV-- 66

Query: 62  LDNNSSQSQWKLKLFHRDKLPLNFDP---DHPRRFKERISRDSKRVSSLLRLLSSGSDE- 121
            D + +Q +WKLKL HRDKL  N      DH RRF  R+ RD KRV+SLLR LS G    
Sbjct: 67  SDVSETQGKWKLKLVHRDKLSSNTSATFRDHSRRFHARMQRDVKRVASLLRRLSGGGGHD 126

Query: 122 -----QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECY 181
                +V DFGSDVVSG +QGSGEYFVR+GVGSPP+SQY+VIDSGSDIVWVQCQPC++CY
Sbjct: 127 GGAAYEVNDFGSDVVSGMDQGSGEYFVRLGVGSPPKSQYMVIDSGSDIVWVQCQPCNQCY 186

Query: 182 QQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLT 241
           +QSDPVFDPA SA+YAGISC S+VCDR++N+GCN GRCRYEV YGDGSYT+GTLALETLT
Sbjct: 187 RQSDPVFDPADSASYAGISCSSAVCDRIENSGCNAGRCRYEVLYGDGSYTKGTLALETLT 246

Query: 242 FGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES 301
           FGR +++N+AIGCGH+NRGMFIGAAGLLGLGGG++S VGQLGGQTGGAFSYCLVSRG+++
Sbjct: 247 FGRTVVKNVAIGCGHINRGMFIGAAGLLGLGGGSLSLVGQLGGQTGGAFSYCLVSRGSDA 306

Query: 302 TGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGG 361
           +G+LEFGRGAMPVGAAWVPL+RNP+APSFYYVGLSGLGVGGIRVP+ E IF+LT+LGYGG
Sbjct: 307 SGSLEFGRGAMPVGAAWVPLLRNPQAPSFYYVGLSGLGVGGIRVPVSEDIFQLTELGYGG 366

Query: 362 VVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFY 421
           VVMDTGTAV+R P  AY+A RD FI QTANLPR   VSIFDTCYNL+ FV++RVPTVSFY
Sbjct: 367 VVMDTGTAVSRFPTLAYKALRDAFIAQTANLPRISTVSIFDTCYNLSDFVTIRVPTVSFY 426

Query: 422 FSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFG 477
           FSGGPILTLPA NFLIPVD  GTFC AFA+S SGLSIIGNIQQEGIQIS DG+NGFVGFG
Sbjct: 427 FSGGPILTLPASNFLIPVDDVGTFCLAFASSTSGLSIIGNIQQEGIQISFDGANGFVGFG 480

BLAST of CsaV3_5G021850 vs. TrEMBL
Match: tr|A0A1U8N9X5|A0A1U8N9X5_GOSHI (protein ASPARTIC PROTEASE IN GUARD CELL 2-like OS=Gossypium hirsutum OX=3635 GN=LOC107946097 PE=3 SV=1)

HSP 1 Score: 689.9 bits (1779), Expect = 4.1e-195
Identity = 345/484 (71.28%), Postives = 403/484 (83.26%), Query Frame = 0

Query: 2   LPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFE 61
           LP +L+++L   ++SVAT   A+YP  QLLNVK T+   E     +P+ L+  E++ +  
Sbjct: 7   LPIVLVAMLHLTLSSVAT---ASYPDFQLLNVKQTLIGTE-----IPRPLQTSEHHQV-- 66

Query: 62  LDNNSSQSQWKLKLFHRDKLPLNFDP---DHPRRFKERISRDSKRVSSLLRLLSSGSDE- 121
            D + +Q +WKLKL HRDKL  N      DH RR   R+ RD KRV+SLLR LS G    
Sbjct: 67  SDVSETQGKWKLKLVHRDKLSSNTSATFRDHSRRLHARMQRDVKRVASLLRRLSGGGGHD 126

Query: 122 -----QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECY 181
                +V DFGSDVVSG +QGSGEYFVR+GVGSPP+SQY+VIDSGSDIVWVQCQPC++CY
Sbjct: 127 GGAAYEVNDFGSDVVSGMDQGSGEYFVRLGVGSPPKSQYMVIDSGSDIVWVQCQPCNQCY 186

Query: 182 QQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLT 241
           +QSDPVFDPA SA+YAGISC S+VCDR++N+GCN GRCRYEV YGDGSYT+GTLALETLT
Sbjct: 187 RQSDPVFDPADSASYAGISCSSAVCDRIENSGCNAGRCRYEVLYGDGSYTKGTLALETLT 246

Query: 242 FGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES 301
           FGR +++N+AIGCGH+NRGMFIGAAGLLGLGGG++S VGQLGGQTGGAFSYCLVSRG+++
Sbjct: 247 FGRTVVKNVAIGCGHINRGMFIGAAGLLGLGGGSLSLVGQLGGQTGGAFSYCLVSRGSDA 306

Query: 302 TGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGG 361
           +G+LEFGRGAMPVGAAWVPL+RNP+APSFYYVGLSGLGVGGIRVP+ E IF+LT+LGYGG
Sbjct: 307 SGSLEFGRGAMPVGAAWVPLLRNPQAPSFYYVGLSGLGVGGIRVPVSEDIFQLTELGYGG 366

Query: 362 VVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFY 421
           VVMDTGTAV+R P  AY+A RD FI QTANLPR   VSIFDTCYNL+ FV++RVPTVSFY
Sbjct: 367 VVMDTGTAVSRFPTLAYKALRDAFIAQTANLPRISTVSIFDTCYNLSDFVTIRVPTVSFY 426

Query: 422 FSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFG 477
           FSGGPILTLPA NFLIPVD  GTFC AFA+S SGLSIIGNIQQEGIQIS DG+NGFVGFG
Sbjct: 427 FSGGPILTLPASNFLIPVDDVGTFCLAFASSTSGLSIIGNIQQEGIQISFDGANGFVGFG 480

BLAST of CsaV3_5G021850 vs. TrEMBL
Match: tr|A0A0D2T690|A0A0D2T690_GOSRA (Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_008G147500 PE=3 SV=1)

HSP 1 Score: 689.5 bits (1778), Expect = 5.3e-195
Identity = 344/484 (71.07%), Postives = 403/484 (83.26%), Query Frame = 0

Query: 2   LPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFE 61
           LP +L+++L   ++S AT   A+YP  QLLNVK T+       +++P+ L+  E++ +  
Sbjct: 7   LPIVLVAMLHLTLSSAAT---ASYPDFQLLNVKQTL-----IGTKIPRPLQTSEHHQV-- 66

Query: 62  LDNNSSQSQWKLKLFHRDKLPLNFDP---DHPRRFKERISRDSKRVSSLLRLLSSGSDE- 121
            D + +Q +WKLKL HRDKL  N      DH RRF  R+ RD KRV+SLLR LS G    
Sbjct: 67  SDVSETQGKWKLKLVHRDKLSSNTSATFRDHSRRFHARMQRDVKRVASLLRRLSGGGGHD 126

Query: 122 -----QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECY 181
                +V DFGSDVVSG +QGSGEYFVRIGVGSPP+SQY+VIDSGSDIVWVQCQPC++CY
Sbjct: 127 GGAAYEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPKSQYMVIDSGSDIVWVQCQPCNQCY 186

Query: 182 QQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLT 241
           +QSDPVFDPA SA+YAGISC S+VCDR++N+GCN GRCRYEV YGDGSYT+GTLALETLT
Sbjct: 187 RQSDPVFDPADSASYAGISCSSAVCDRIENSGCNAGRCRYEVLYGDGSYTKGTLALETLT 246

Query: 242 FGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES 301
           FGR +++N+AIGCGH+NRGMFIGAAGLLGLGGG++S VGQLGGQTGGAFSYCLVSRG+++
Sbjct: 247 FGRTVVKNVAIGCGHINRGMFIGAAGLLGLGGGSLSLVGQLGGQTGGAFSYCLVSRGSDA 306

Query: 302 TGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGG 361
           +G+LEFGRGAMPVGAAWVPL+RNP+APSFYYVGLSGLGVGGIRVP+ E IF+LT+LGYGG
Sbjct: 307 SGSLEFGRGAMPVGAAWVPLLRNPQAPSFYYVGLSGLGVGGIRVPVSEDIFQLTELGYGG 366

Query: 362 VVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFY 421
           VVMDTGTAV+R P  AY+A RD FI QTANLPR   VSIFDTCY L+ FV++RVPTVSFY
Sbjct: 367 VVMDTGTAVSRFPTLAYKALRDAFIAQTANLPRISTVSIFDTCYKLSDFVTIRVPTVSFY 426

Query: 422 FSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFG 477
           FSGGPILTLPA NFLIPVD  GTFC AFA+S SGLSIIGNIQQEGIQIS DG+NGFVGFG
Sbjct: 427 FSGGPILTLPASNFLIPVDDVGTFCLAFASSTSGLSIIGNIQQEGIQISFDGANGFVGFG 480

BLAST of CsaV3_5G021850 vs. TrEMBL
Match: tr|A0A061E1W9|A0A061E1W9_THECC (Aspartic proteinase nepenthesin-1, putative OS=Theobroma cacao OX=3641 GN=TCM_007665 PE=3 SV=1)

HSP 1 Score: 688.7 bits (1776), Expect = 9.1e-195
Identity = 349/484 (72.11%), Postives = 401/484 (82.85%), Query Frame = 0

Query: 2   LPFLLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFE 61
           L  +L+++L   V+ +AT   A++P  QLLNVK T+   +  P+ L +  E HE     E
Sbjct: 7   LAMILVAVLQLTVSGIAT---ASHPDFQLLNVKQTLIGTK-KPTPL-KTFEYHEQSNASE 66

Query: 62  LDNNSSQSQWKLKLFHRDKLPLNFDP---DHPRRFKERISRDSKRVSSLLRLLSSGSDE- 121
               S Q +WKLKL HRDKL  N      DH  RF  R+ RD KRV+SL+RLLS G    
Sbjct: 67  ----SDQGKWKLKLVHRDKLFSNTTTAFHDHSHRFLARMQRDVKRVASLVRLLSGGGGHD 126

Query: 122 -----QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECY 181
                +V DFGSDVVSG +QGSGEYFVRIGVGSPPRSQY+VIDSGSDIVWVQCQPC++CY
Sbjct: 127 GDAAYEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCNQCY 186

Query: 182 QQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLT 241
           +QSDPVFDPA SA+Y+G+SC SSVCDR++N+GC+ GRCRYEV YGDGSYT+GTLALETLT
Sbjct: 187 RQSDPVFDPANSASYSGVSCTSSVCDRIENSGCHAGRCRYEVMYGDGSYTKGTLALETLT 246

Query: 242 FGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES 301
           FGR +++N+AIGCGH+NRGMFIGAAGLLG+GGG+MS VGQLGGQTGGAFSYCLVSRG+++
Sbjct: 247 FGRTVVKNVAIGCGHINRGMFIGAAGLLGVGGGSMSLVGQLGGQTGGAFSYCLVSRGSDA 306

Query: 302 TGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGG 361
           +G+L FGRGAMPVGAAWVPL+RNPRAPSFYYVGLSGLGVGGIRVP+ E  F L++LGYGG
Sbjct: 307 SGSLVFGRGAMPVGAAWVPLLRNPRAPSFYYVGLSGLGVGGIRVPVSEDTFRLSELGYGG 366

Query: 362 VVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFY 421
           VVMDTGTAVTR P  AY AFRD F+ QTANLPR+  VSIFDTCYNL+GFVSVRVPTVSFY
Sbjct: 367 VVMDTGTAVTRFPTLAYNAFRDAFVAQTANLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 426

Query: 422 FSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFG 477
           FSGGPILTLPARNFLIPVD  GTFCFAFA+SASGLSIIGNIQQEGIQIS DG+NGFVGFG
Sbjct: 427 FSGGPILTLPARNFLIPVDDVGTFCFAFASSASGLSIIGNIQQEGIQISFDGANGFVGFG 481

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004150193.11.8e-274100.00PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus][more]
XP_008465249.14.6e-26797.27PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis melo][more]
XP_022990025.14.5e-24689.60protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucurbita maxima][more]
XP_023514620.12.2e-24589.58protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucurbita pepo subsp. pepo][more]
XP_022921534.14.2e-24488.80protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT3G20015.12.1e-18171.60Eukaryotic aspartyl protease family protein[more]
AT1G25510.14.2e-11343.43Eukaryotic aspartyl protease family protein[more]
AT3G18490.18.0e-11243.95Eukaryotic aspartyl protease family protein[more]
AT1G01300.11.2e-11051.46Eukaryotic aspartyl protease family protein[more]
AT3G61820.14.4e-10245.58Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
sp|Q9LHE3|ASPG2_ARATH3.8e-18071.60Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
sp|Q9LS40|ASPG1_ARATH1.4e-11043.95Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
sp|Q9LNJ3|APF2_ARATH2.1e-10951.46Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
sp|Q766C2|NEP2_NEPGR3.6e-6939.07Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
sp|Q766C3|NEP1_NEPGR8.3e-6636.79Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3CNF7|A0A1S3CNF7_CUCME3.1e-26797.27protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Cucumis melo OX=3656 GN=LOC10350290... [more]
tr|A0A2P5R9B1|A0A2P5R9B1_GOSBA8.2e-19671.49Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_DD19674 PE=3 SV... [more]
tr|A0A1U8N9X5|A0A1U8N9X5_GOSHI4.1e-19571.28protein ASPARTIC PROTEASE IN GUARD CELL 2-like OS=Gossypium hirsutum OX=3635 GN=... [more]
tr|A0A0D2T690|A0A0D2T690_GOSRA5.3e-19571.07Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_008G147500 PE=3 ... [more]
tr|A0A061E1W9|A0A061E1W9_THECC9.1e-19572.11Aspartic proteinase nepenthesin-1, putative OS=Theobroma cacao OX=3641 GN=TCM_00... [more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR033121PEPTIDASE_A1
IPR001969Aspartic_peptidase_AS
IPR001461Aspartic_peptidase_A1
IPR032861TAXi_N
IPR032799TAXi_C
IPR021109Peptidase_aspartic_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_5G021850.1CsaV3_5G021850.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 114..299
e-value: 4.0E-48
score: 166.1
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 306..476
e-value: 2.9E-43
score: 149.5
IPR021109Aspartic peptidase domain superfamilySUPERFAMILYSSF50630Acid proteasescoord: 132..476
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 322..472
e-value: 1.3E-30
score: 106.2
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 137..299
e-value: 1.5E-50
score: 171.8
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 47..476
NoneNo IPR availablePANTHERPTHR13683:SF265PROTEIN ASPARTIC PROTEASE IN GUARD CELL 2coord: 47..476
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 152..163
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 137..472
score: 44.529

The following gene(s) are paralogous to this gene:

None