CsaV3_6G041050 (gene) Cucumber (Chinese Long) v3

NameCsaV3_6G041050
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
Descriptionaspartic proteinase nepenthesin-2-like
Locationchr6 : 23794830 .. 23797012 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CGATTTATTGCAATCAAAATTAAGAGCCCCTAAACTTTAACAGAGCAGACCAAACATTAAAACCTGTACTGACCAAAATCAAAATGGATTAAAATAGTAATTCAATCCAAACCTTTATTGTCCCACTCACTTAATACTCATATGCAATAAAGGATATAATGATTCTTAGCTATGGTCAAACCATACTAATTTTACCATACAATATTAATCCAATTGATTAAATTTCAGTGCTCTCTAACCCTATATCCTATATGGTTGCATGATAATTGATTTAGTCCAAAAGTGATTTTAACAATTTTAAAATGAGTTTGGGAAGAAGAAACAATTGAAACATAGATGTCATTCATGTAGGTATATAACGTGGCTTCAAGTGTCAATACGGTGGGAGTGAAACTTGTCTATTTCCCTCTTCTTCCATATAACGAAACGACTAATACTCAACCCAAAAGCTTGTGCAAAGCAAACCCTTTGTTGATCTTCATCTTCTCGATTCCATGGAGTTTTTACCCATTCCATTTCTCTTTTCCATTTTTCTCCTTCTTCCCACTTCATCTTCTTCCTCCACCACAGTACTCCCACTCACTACTTTTCCTTCAGTTTCATTTACAGATCCATTCAAAACCATCAACCTTCTTCTCTCTGCTTCACTCAACAGAGCTCAACATCTCAAAACCCCACAATCAAAGTCCAACACTTCCATACAGAATGTCTCTCTCTTCCCTCGTAGCTACGGAGCTTACTCGGTTTCCCTCGCCTTCGGAACTCCACCGCAGAACTTATCGTTTATCTTCGATACTGGAAGTAGTCTCGTCTGGTTCCCCTGCACCGCTGGTTATCGTTGTTCCCGTTGTTCGTTTCCCTATGTGGATCCTGCAACGATTTCGAAATTTGTTCCTAAGTTATCTTCCTCTGTGAAGGTTGTTGGTTGTCGAAATCCTAAATGTGCGTGGATTTTTGGCCCTAATTTGAAGTCCAGATGTAGAAATTGTAACTCTAAATCTCGAAAATGTTCCGATTCTTGTCCTGGTTATGGACTTCAGTACGGCTCTGGCGCAACCGCTGGAATTCTCCTCTCTGAAACGCTCGATTTAGAGAATAAACGAGTGCCGGATTTTCTCGTTGGTTGTTCCGTTATGTCTGTTCATCAACCAGCCGGCATTGCCGGATTTGGCCGCGGTCCTGAATCGTTGCCGTCCCAAATGCGACTCAAACGATTCTCCCATTGCCTCGTTTCTCGCGGGTTCGACGACTCGCCAGTGAGTAGTCCTCTAGTACTTGACTCCGGTTCGGAATCCGATGAATCGAAAACTAAGAGTTTCATTTACGCACCCTTCCGAGAGAATCCATCAGTATCCAACGCCGCATTTCGAGAGTACTATTACCTTAGTCTTCGGAGAATCCTCATCGGTGGAAAGCCGGTGAAATTCCCGTACAAGTATCTCGTGCCGGATTCCACCGGAAACGGAGGCGCGATAATCGATTCCGGTTCAACGTTTACGTTTCTAGATAAGCCGATTTTCGAAGCCATAGCGGATGAATTGGAGAAGCAGCTGGTGAAATATCCTCGAGCTAAGGACGTTGAAGCGCAGTCGGGTTTGAGGCCATGCTTTAATATTCCCAAGGAGGAGGAATCAGCGGAGTTTCCGGACGTGGTTTTGAAGTTTAAAGGTGGAGGGAAGCTGAGTTTGGCGGCAGAGAATTACTTGGCGATGGTGACGGATGAGGGCGTGGTGTGCTTGACGATGATGACGGATGAAGCCGTCGTGGGCGGAGGCGGAGGGCCGGCGATTATATTGGGGGCGTTTCAGCAGCAGAATGTTTTGGTTGAGTATGATTTAGCAAAGCAGCGAATCGGATTTCGGAAGCAGAAATGCACGTGAGAATTGAGTTTGTGTTACTGAAAAATATAATGTGGTAAAAGAATAACAAAGTAAAAGCAGTCAATTTGGTATAGTTTAACCCATCATTCAAGACTTTTAACTTTCCATGGTGTACGTTTCAGATGTTTTGGGCTTGGGCCTTCTCTTGGCAACCATCGCCAAACTTAATAAATAAACATTACCATCTATAACCTAAGTTTAATTTCAATTGTTATTTTCATATTACATTTACATTAGTCCTGACAACGTACCCATGTATTAACAAGTAGATCATATGTTATAAGACCTAATTATAATGACTCG

mRNA sequence

ATGGAGTTTTTACCCATTCCATTTCTCTTTTCCATTTTTCTCCTTCTTCCCACTTCATCTTCTTCCTCCACCACAGTACTCCCACTCACTACTTTTCCTTCAGTTTCATTTACAGATCCATTCAAAACCATCAACCTTCTTCTCTCTGCTTCACTCAACAGAGCTCAACATCTCAAAACCCCACAATCAAAGTCCAACACTTCCATACAGAATGTCTCTCTCTTCCCTCGTAGCTACGGAGCTTACTCGGTTTCCCTCGCCTTCGGAACTCCACCGCAGAACTTATCGTTTATCTTCGATACTGGAAGTAGTCTCGTCTGGTTCCCCTGCACCGCTGGTTATCGTTGTTCCCGTTGTTCGTTTCCCTATGTGGATCCTGCAACGATTTCGAAATTTGTTCCTAAGTTATCTTCCTCTGTGAAGGTTGTTGGTTGTCGAAATCCTAAATGTGCGTGGATTTTTGGCCCTAATTTGAAGTCCAGATGTAGAAATTGTAACTCTAAATCTCGAAAATGTTCCGATTCTTGTCCTGGTTATGGACTTCAGTACGGCTCTGGCGCAACCGCTGGAATTCTCCTCTCTGAAACGCTCGATTTAGAGAATAAACGAGTGCCGGATTTTCTCGTTGGTTGTTCCGTTATGTCTGTTCATCAACCAGCCGGCATTGCCGGATTTGGCCGCGGTCCTGAATCGTTGCCGTCCCAAATGCGACTCAAACGATTCTCCCATTGCCTCGTTTCTCGCGGGTTCGACGACTCGCCAGTGAGTAGTCCTCTAGTACTTGACTCCGGTTCGGAATCCGATGAATCGAAAACTAAGAGTTTCATTTACGCACCCTTCCGAGAGAATCCATCAGTATCCAACGCCGCATTTCGAGAGTACTATTACCTTAGTCTTCGGAGAATCCTCATCGGTGGAAAGCCGGTGAAATTCCCGTACAAGTATCTCGTGCCGGATTCCACCGGAAACGGAGGCGCGATAATCGATTCCGGTTCAACGTTTACGTTTCTAGATAAGCCGATTTTCGAAGCCATAGCGGATGAATTGGAGAAGCAGCTGGTGAAATATCCTCGAGCTAAGGACGTTGAAGCGCAGTCGGGTTTGAGGCCATGCTTTAATATTCCCAAGGAGGAGGAATCAGCGGAGTTTCCGGACGTGGTTTTGAAGTTTAAAGGTGGAGGGAAGCTGAGTTTGGCGGCAGAGAATTACTTGGCGATGGTGACGGATGAGGGCGTGGTGTGCTTGACGATGATGACGGATGAAGCCGTCGTGGGCGGAGGCGGAGGGCCGGCGATTATATTGGGGGCGTTTCAGCAGCAGAATGTTTTGGTTGAGTATGATTTAGCAAAGCAGCGAATCGGATTTCGGAAGCAGAAATGCACGTGA

Coding sequence (CDS)

ATGGAGTTTTTACCCATTCCATTTCTCTTTTCCATTTTTCTCCTTCTTCCCACTTCATCTTCTTCCTCCACCACAGTACTCCCACTCACTACTTTTCCTTCAGTTTCATTTACAGATCCATTCAAAACCATCAACCTTCTTCTCTCTGCTTCACTCAACAGAGCTCAACATCTCAAAACCCCACAATCAAAGTCCAACACTTCCATACAGAATGTCTCTCTCTTCCCTCGTAGCTACGGAGCTTACTCGGTTTCCCTCGCCTTCGGAACTCCACCGCAGAACTTATCGTTTATCTTCGATACTGGAAGTAGTCTCGTCTGGTTCCCCTGCACCGCTGGTTATCGTTGTTCCCGTTGTTCGTTTCCCTATGTGGATCCTGCAACGATTTCGAAATTTGTTCCTAAGTTATCTTCCTCTGTGAAGGTTGTTGGTTGTCGAAATCCTAAATGTGCGTGGATTTTTGGCCCTAATTTGAAGTCCAGATGTAGAAATTGTAACTCTAAATCTCGAAAATGTTCCGATTCTTGTCCTGGTTATGGACTTCAGTACGGCTCTGGCGCAACCGCTGGAATTCTCCTCTCTGAAACGCTCGATTTAGAGAATAAACGAGTGCCGGATTTTCTCGTTGGTTGTTCCGTTATGTCTGTTCATCAACCAGCCGGCATTGCCGGATTTGGCCGCGGTCCTGAATCGTTGCCGTCCCAAATGCGACTCAAACGATTCTCCCATTGCCTCGTTTCTCGCGGGTTCGACGACTCGCCAGTGAGTAGTCCTCTAGTACTTGACTCCGGTTCGGAATCCGATGAATCGAAAACTAAGAGTTTCATTTACGCACCCTTCCGAGAGAATCCATCAGTATCCAACGCCGCATTTCGAGAGTACTATTACCTTAGTCTTCGGAGAATCCTCATCGGTGGAAAGCCGGTGAAATTCCCGTACAAGTATCTCGTGCCGGATTCCACCGGAAACGGAGGCGCGATAATCGATTCCGGTTCAACGTTTACGTTTCTAGATAAGCCGATTTTCGAAGCCATAGCGGATGAATTGGAGAAGCAGCTGGTGAAATATCCTCGAGCTAAGGACGTTGAAGCGCAGTCGGGTTTGAGGCCATGCTTTAATATTCCCAAGGAGGAGGAATCAGCGGAGTTTCCGGACGTGGTTTTGAAGTTTAAAGGTGGAGGGAAGCTGAGTTTGGCGGCAGAGAATTACTTGGCGATGGTGACGGATGAGGGCGTGGTGTGCTTGACGATGATGACGGATGAAGCCGTCGTGGGCGGAGGCGGAGGGCCGGCGATTATATTGGGGGCGTTTCAGCAGCAGAATGTTTTGGTTGAGTATGATTTAGCAAAGCAGCGAATCGGATTTCGGAAGCAGAAATGCACGTGA

Protein sequence

MEFLPIPFLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKTPQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKRFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAVVGGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKCT
BLAST of CsaV3_6G041050 vs. NCBI nr
Match: XP_011657732.1 (PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus] >KGN48299.1 hypothetical protein Csa_6G454470 [Cucumis sativus])

HSP 1 Score: 883.6 bits (2282), Expect = 2.8e-253
Identity = 461/461 (100.00%), Postives = 461/461 (100.00%), Query Frame = 0

Query: 1   MEFLPIPFLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKT 60
           MEFLPIPFLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKT
Sbjct: 1   MEFLPIPFLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKT 60

Query: 61  PQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCS 120
           PQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCS
Sbjct: 61  PQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCS 120

Query: 121 FPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYG 180
           FPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYG
Sbjct: 121 FPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYG 180

Query: 181 LQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR 240
           LQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR
Sbjct: 181 LQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR 240

Query: 241 FSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLR 300
           FSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLR
Sbjct: 241 FSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLR 300

Query: 301 RILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAK 360
           RILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAK
Sbjct: 301 RILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAK 360

Query: 361 DVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXXXX 420
           DVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXXXX
Sbjct: 361 DVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXXXX 420

Query: 421 XXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKCT 462
           XXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKCT
Sbjct: 421 XXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKCT 461

BLAST of CsaV3_6G041050 vs. NCBI nr
Match: XP_008462617.1 (PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis melo])

HSP 1 Score: 813.9 bits (2101), Expect = 2.8e-232
Identity = 405/461 (87.85%), Postives = 421/461 (91.32%), Query Frame = 0

Query: 1   MEFLPIPFLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKT 60
           MEFLPIPFLFSIFLLLPTSSSSS T LPL TFPS+ FTDP KTIN LLSASL+RAQHLK+
Sbjct: 1   MEFLPIPFLFSIFLLLPTSSSSSIT-LPLATFPSIPFTDPLKTINHLLSASLSRAQHLKS 60

Query: 61  PQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCS 120
           PQSKSNTS +NVSLFPRSYGAY+VSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRC+ CS
Sbjct: 61  PQSKSNTSTENVSLFPRSYGAYAVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCAHCS 120

Query: 121 FPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYG 180
           FP+VDPATISKFVPKLSSSVK+VGCRNPKCAWIFGPNLKSRCRNCN KSRKCSDSCPGYG
Sbjct: 121 FPHVDPATISKFVPKLSSSVKIVGCRNPKCAWIFGPNLKSRCRNCNPKSRKCSDSCPGYG 180

Query: 181 LQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR 240
           +QYGSGATAGILLSETLDL+NKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR
Sbjct: 181 IQYGSGATAGILLSETLDLQNKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR 240

Query: 241 FSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLR 300
           FSHCL+ RGFDDSPVSSPLVLDSG ESDESKTKSFIYAPF+ENPS SN AFREYYYLSLR
Sbjct: 241 FSHCLLPRGFDDSPVSSPLVLDSGPESDESKTKSFIYAPFQENPSRSNTAFREYYYLSLR 300

Query: 301 RILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAK 360
           RILIGGKPVKFPYKYLVPDSTG GGAIIDSGSTFTFLDKPIFEAIA ELEKQLVKYPRAK
Sbjct: 301 RILIGGKPVKFPYKYLVPDSTGKGGAIIDSGSTFTFLDKPIFEAIAGELEKQLVKYPRAK 360

Query: 361 DVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXXXX 420
           D+EA++GLRPCFNI KEEESAEFP+V LKFKGGGKLSL  ENYL MVTD  VVCLT    
Sbjct: 361 DIEAKTGLRPCFNISKEEESAEFPEVALKFKGGGKLSLPPENYLVMVTDANVVCLTMMTN 420

Query: 421 XXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKCT 462
                         GAFQQQNVLVEYDLAKQRIGFRKQKCT
Sbjct: 421 AEVVGVGGGPAIIFGAFQQQNVLVEYDLAKQRIGFRKQKCT 460

BLAST of CsaV3_6G041050 vs. NCBI nr
Match: XP_023543736.1 (probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 701.0 bits (1808), Expect = 2.6e-198
Identity = 351/463 (75.81%), Postives = 382/463 (82.51%), Query Frame = 0

Query: 1   MEFLPIPFLFS--IFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHL 60
           MEF PIPFL S                  PLT FPS+ FT P+K I  L+SASL RAQHL
Sbjct: 1   MEFFPIPFLLSXXXXXXXXXXXXXXXXXXPLTVFPSLPFTHPWKNIKHLVSASLTRAQHL 60

Query: 61  KTPQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSR 120
           KTP+ KSNTSIQNV+LFPRSYGAYS+SLAFGTPPQ+LS +FDTGSSLVWFPCTAGYRCS 
Sbjct: 61  KTPRIKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120

Query: 121 CSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPG 180
           CSFP VD ATI KF+PKLSSS K++GCRN KC+WIFGPNLKS CR+C+ +SRKCSD+CPG
Sbjct: 121 CSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKSLCRSCSPRSRKCSDTCPG 180

Query: 181 YGLQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRL 240
           YG+QYGSGATAG LLSETLD   KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQM L
Sbjct: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQMGL 240

Query: 241 KRFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLS 300
           KRFSHCLV R FDDSPVSSPLVLDS SES ESK  S IYAPFRENPS SNAAFREYYYL+
Sbjct: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYYLT 300

Query: 301 LRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPR 360
           LRRILIG KPVKFPYKYLVP+S GNGGAIIDSGSTFTFLDKPIFEA+A+ELEKQLVKYPR
Sbjct: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360

Query: 361 AKDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXX 420
           AK VEA+SGLRPCF+I K EES EFP+++LKFKGG  L+L   NYLA+VTD GVVCLT  
Sbjct: 361 AKGVEAESGLRPCFDISK-EESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMI 420

Query: 421 XXXXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKCT 462
                           GAFQQQNVLV+YDLAK RIGFRKQ+CT
Sbjct: 421 TDVTFLGGGGGPAIIFGAFQQQNVLVQYDLAKDRIGFRKQRCT 462

BLAST of CsaV3_6G041050 vs. NCBI nr
Match: XP_022925946.1 (probable aspartyl protease At4g16563 [Cucurbita moschata])

HSP 1 Score: 694.1 bits (1790), Expect = 3.2e-196
Identity = 347/463 (74.95%), Postives = 381/463 (82.29%), Query Frame = 0

Query: 1   MEFLPIPFLFS--IFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHL 60
           MEF  IPFL S                 LPLT FPS+ F  P+K I  L+SASL RAQHL
Sbjct: 1   MEFFLIPFLLSXXXXXXXXXXXXXXXXXLPLTVFPSLPFAHPWKNIKHLVSASLTRAQHL 60

Query: 61  KTPQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSR 120
           KTP++KSNTSIQNV+LFPRSYGAYS+SLAFGTPPQ+LS +FDTGSSLVWFPCTAGYRCS 
Sbjct: 61  KTPRTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120

Query: 121 CSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPG 180
           CSFP VD ATI KF+PKLSSS K++GCRN KC+WIFGPNLK+ CR+C+ +SRKCSD+CPG
Sbjct: 121 CSFPNVDAATIPKFIPKLSSSAKIIGCRNRKCSWIFGPNLKTLCRSCSPRSRKCSDTCPG 180

Query: 181 YGLQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRL 240
           YG+QYGSGATAG LLSETLD   KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQM L
Sbjct: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVVSVHQPAGIAGFGRGPESLPSQMGL 240

Query: 241 KRFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLS 300
           KRFSHCLV R FDDSPVSSPLVLDS SES ESK  S IYAPFRENPS SNAAFREYYYL+
Sbjct: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSSESGESKNNSLIYAPFRENPSGSNAAFREYYYLT 300

Query: 301 LRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPR 360
           LRRILIG KPVKFPYKYLVP+S GNGGAIIDSGSTFTFLDKPIFEA+A+ELEKQLVKYPR
Sbjct: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360

Query: 361 AKDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXX 420
           AK VEA+SGLRPCF+I K EES EFP+++LKFKGG  L+L   NYLA+V D  VVCLT  
Sbjct: 361 AKGVEAESGLRPCFDISK-EESVEFPELILKFKGGATLALPPSNYLALVADTSVVCLTMI 420

Query: 421 XXXXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKCT 462
                           GAFQQQNVLV+YDLAK+RIGFRKQ+CT
Sbjct: 421 TDVTFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCT 462

BLAST of CsaV3_6G041050 vs. NCBI nr
Match: XP_022979057.1 (probable aspartyl protease At4g16563 [Cucurbita maxima])

HSP 1 Score: 654.4 bits (1687), Expect = 2.8e-184
Identity = 333/463 (71.92%), Postives = 365/463 (78.83%), Query Frame = 0

Query: 1   MEFLPIPFLFSI--FLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHL 60
           MEF PI FL SI                  LT FPS+  T P+K I  L+SASL RAQHL
Sbjct: 1   MEFFPIQFLLSIVXXXXXXXXXXXXXXXXXLTAFPSLPLTHPWKNIKHLVSASLARAQHL 60

Query: 61  KTPQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSR 120
           KTP++KSNTSIQNV+LFPRSYGAYS+SLAFGTPPQ+LS +FDTGSSLVWFPCTAGYRCS 
Sbjct: 61  KTPKTKSNTSIQNVALFPRSYGAYSISLAFGTPPQSLSLVFDTGSSLVWFPCTAGYRCSN 120

Query: 121 CSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPG 180
           CSFP VD ATI KF+PKLSSS +++GCRN KC+WIF                   D+CPG
Sbjct: 121 CSFPNVDAATIPKFIPKLSSSARIIGCRNRKCSWIFXXXXXXXXXXXXXXXXXXXDTCPG 180

Query: 181 YGLQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRL 240
           YG+QYGSGATAG LLSETLD   KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQM L
Sbjct: 181 YGIQYGSGATAGFLLSETLDFPEKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMGL 240

Query: 241 KRFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLS 300
           KRFSHCLV R FDDSPVSSPLVLDS  ES +SKT S IYAPFRENPS SNAAFREYYYL+
Sbjct: 241 KRFSHCLVPRQFDDSPVSSPLVLDSSPESGDSKTNSLIYAPFRENPSGSNAAFREYYYLT 300

Query: 301 LRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPR 360
           LRRILIG KPVKFPYKYLVP+S GNGGAIIDSGSTFTFLDKPIFEA+A+ELEKQLVKYPR
Sbjct: 301 LRRILIGRKPVKFPYKYLVPNSAGNGGAIIDSGSTFTFLDKPIFEAVAEELEKQLVKYPR 360

Query: 361 AKDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXX 420
           AK VEA+SGLRPCF+I K EES EFP+++LKFKGG  L+L   NYLA+VTD GVVCLT  
Sbjct: 361 AKGVEAESGLRPCFDISK-EESVEFPELILKFKGGATLALPPANYLALVTDTGVVCLTMI 420

Query: 421 XXXXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKCT 462
                           GAFQQQNVLV+YDLAK+RIGFRKQ+CT
Sbjct: 421 TDVNFLGGGGGPAIIFGAFQQQNVLVQYDLAKERIGFRKQRCT 462

BLAST of CsaV3_6G041050 vs. TAIR10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 450.7 bits (1158), Expect = 1.1e-126
Identity = 233/470 (49.57%), Postives = 306/470 (65.11%), Query Frame = 0

Query: 8   FLFSIFLLLPTSSSSSTTVLPLTTF--PSVSFTDPFKTINLLLSASLNRAQHLK------ 67
           F F IFL     S  S   LPL+ F     S  DP+ ++  L  +S+ RA  LK      
Sbjct: 7   FFFLIFL-----SVVSAVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIK 66

Query: 68  ------TPQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAG 127
                 +  + ++ ++    L  +SYG YSVSL+FGTP Q + F+FDTGSSLVW PCT+ 
Sbjct: 67  PDEDALSSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSR 126

Query: 128 YRCSRCSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCS 187
           Y CS C F  +DP  I +F+PK SSS K++GC++PKC +++GPN+  +CR C+  +R C+
Sbjct: 127 YLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNV--QCRGCDPNTRNCT 186

Query: 188 DSCPGYGLQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLP 247
             CP Y LQYG G+TAG+L++E LD  +  VPDF+VGCS++S  QPAGIAGFGRGP SLP
Sbjct: 187 VGCPPYILQYGLGSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLP 246

Query: 248 SQMRLKRFSHCLVSRGFDDSPVSSPLVLDSGS-ESDESKTKSFIYAPFRENPSVSNAAFR 307
           SQM LKRFSHCLVSR FDD+ V++ L LD+GS  +  SKT    Y PFR+NP+VSN AF 
Sbjct: 247 SQMNLKRFSHCLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFL 306

Query: 308 EYYYLSLRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQ 367
           EYYYL+LRRI +G K VK PYKYL P + G+GG+I+DSGSTFTF+++P+FE +A+E   Q
Sbjct: 307 EYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQ 366

Query: 368 LVKYPRAKDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGV 427
           +  Y R KD+E ++GL PCFNI  + +    P+++ +FKGG KL L   NY   V +   
Sbjct: 367 MSNYTREKDLEKETGLGPCFNISGKGD-VTVPELIFEFKGGAKLELPLSNYFTFVGNTDT 426

Query: 428 VCLT-XXXXXXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKCT 462
           VCLT                   G+FQQQN LVEYDL   R GF K+KC+
Sbjct: 427 VCLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468

BLAST of CsaV3_6G041050 vs. TAIR10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 186.4 bits (472), Expect = 3.9e-47
Identity = 131/403 (32.51%), Postives = 181/403 (44.91%), Query Frame = 0

Query: 82  YSVSLAFGTPPQNLSFIFDTGSSLVWFPC-TAGYRCSRC-SFPYVDPATISKFVPKLSSS 141
           Y ++L  GTPPQ +    DTGS L W PC    + C  C      D  + S F P  SS+
Sbjct: 83  YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSST 142

Query: 142 VKVVGCRNPKCAWI------FGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGA-TAGIL 201
                C +  C  I      F P   + C         C   CP +   YG G   +GIL
Sbjct: 143 SFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGIL 202

Query: 202 LSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRL--KRFSHCLVSRGF 261
             + L    + VP F  GC   +  +P GIAGFGRG  SLPSQ+    K FSHC +   F
Sbjct: 203 TRDILKARTRDVPRFSFGCVTSTYREPIGIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKF 262

Query: 262 DDSP-VSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGK-- 321
            ++P +SSPL+L + + S  + T S  + P    P   N+     YY+ L  I IG    
Sbjct: 263 VNNPNISSPLILGASALS-INLTDSLQFTPMLNTPMYPNS-----YYIGLESITIGTNIT 322

Query: 322 PVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQSG 381
           P + P      DS GNGG ++DSG+T+T L +P +  +   L+   + YPRA + E+++G
Sbjct: 323 PTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQ-STITYPRATETESRTG 382

Query: 382 LRPCFNIP---------KEEESAEFPDVVLKFKGGGKLSLAAEN-YLAMVTDEGVVCLTX 441
              C+ +P         + +    FP +   F     L L   N + AM        +  
Sbjct: 383 FDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQC 442

Query: 442 XXXXXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKC 461
                            G+FQQQNV V YDL K+RIGF+   C
Sbjct: 443 LLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 478

BLAST of CsaV3_6G041050 vs. TAIR10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 163.3 bits (412), Expect = 3.5e-40
Identity = 122/425 (28.71%), Postives = 186/425 (43.76%), Query Frame = 0

Query: 76  PRSYGA-YSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVP 135
           P S G+ Y +SL+ G+    +S   DTGS LVWFPC   + C  C    + P+       
Sbjct: 76  PISSGSDYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILCESKPLPPSXXXXXXX 135

Query: 136 KLSSSVKVVGCRNPKCAWIFGPNL--KSRCRNCNSKSRKCSDS---CPGYGLQYGSGATA 195
                             +   +L   S C     ++  C+ S   CP +   YG G+  
Sbjct: 136 XXXXXXXXXXXXXXXXXXLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLV 195

Query: 196 GILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRL------KRFSH 255
             L S++L L +  V +F  GC+  ++ +P G+AGFGRG  SLP+Q+ +        FS+
Sbjct: 196 AKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSY 255

Query: 256 CLVSRGFDDSPV--SSPLVLDSGSESDESKT----------------KSFIYAPFRENPS 315
           CLVS  FD   V   SPL+L    +  E +                   F++    ENP 
Sbjct: 256 CLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGXXXXXXXXXXXXXXXXEFVFTEMLENPK 315

Query: 316 VSNAAFREYYYLSLRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAI 375
                   +Y +SL+ I IG + +  P      D  G GG ++DSG+TFT L    + ++
Sbjct: 316 -----HPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSV 375

Query: 376 ADELEKQLVK-YPRAKDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGG-GKLSLAAENY 435
            +E + ++ + + RA  VE  SG+ PC+ +    ++ + P +VL F G    ++L   NY
Sbjct: 376 VEEFDSRVGRVHERADRVEPSSGMSPCYYL---NQTVKVPALVLHFAGNRSSVTLPRRNY 435

Query: 436 LAMVTDEG--------VVCLTXXXXXXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGF 461
                D G        + CL                   G +QQQ   V YDL  +R+GF
Sbjct: 436 FYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGF 491

BLAST of CsaV3_6G041050 vs. TAIR10
Match: AT2G42980.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 159.1 bits (401), Expect = 6.7e-39
Identity = 122/399 (30.58%), Postives = 181/399 (45.36%), Query Frame = 0

Query: 80  GAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSS 139
           G Y + +  GTPP++ S I DTGS L W  C   Y C   +  + D        PK S+S
Sbjct: 158 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYD--------PKTSAS 217

Query: 140 VKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGA-TAGILLSETLD 199
            K + C +P+C+ I  P+   +C + N        SCP Y   YG  + T G    ET  
Sbjct: 218 FKNITCNDPRCSLISSPDPPVQCESDN-------QSCP-YFYWYGDRSNTTGDFAVETFT 277

Query: 200 L---------ENKRVPDFLVGCSVMS---VHQPAGIAGFGRGPESLPSQMRL---KRFSH 259
           +            +V + + GC   +       +G+ G GRGP S  SQ++      FS+
Sbjct: 278 VNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSY 337

Query: 260 CLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRIL 319
           CLV R   ++ VSS L+   G + D     +  +  F      S   F   YY+ ++ IL
Sbjct: 338 CLVDRN-SNTNVSSKLIF--GEDKDLLNHTNLNFTSFVNGKENSVETF---YYIQIKSIL 397

Query: 320 IGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADEL-EKQLVKYPRAKDV 379
           +GGK +  P +     S G+GG IIDSG+T ++  +P +E I ++  EK    YP  +D 
Sbjct: 398 VGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDF 457

Query: 380 EAQSGLRPCFNIPK-EEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXXXXX 439
                L PCFN+   EE +   P++ + F  G   +  AEN    ++ E +VCL      
Sbjct: 458 PV---LDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLS-EDLVCLA----- 517

Query: 440 XXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKC 461
                        G +QQQN  + YD  + R+GF   KC
Sbjct: 518 -ILGTPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKC 524

BLAST of CsaV3_6G041050 vs. TAIR10
Match: AT3G59080.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 151.8 bits (382), Expect = 1.1e-36
Identity = 117/400 (29.25%), Postives = 179/400 (44.75%), Query Frame = 0

Query: 80  GAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSS 139
           G Y + +  G+PP++ S I DTGS L W  C   Y C + +  + D        PK S+S
Sbjct: 168 GEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYD--------PKASAS 227

Query: 140 VKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATAGILLSETLDL 199
            K + C + +C  +  P+    C++ N        SCP Y     S  T G    ET  +
Sbjct: 228 YKNITCNDQRCNLVSSPDPPMPCKSDN-------QSCPYYYWYGDSSNTTGDFAVETFTV 287

Query: 200 ---------ENKRVPDFLVGCSVMS---VHQPAGIAGFGRGPESLPSQMRL---KRFSHC 259
                    E   V + + GC   +    H  AG+ G GRGP S  SQ++      FS+C
Sbjct: 288 NLTTNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYC 347

Query: 260 LVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPF---RENPSVSNAAFREYYYLSLRR 319
           LV R   D+ VSS L+   G + D     +  +  F   +EN          +YY+ ++ 
Sbjct: 348 LVDRN-SDTNVSSKLIF--GEDKDLLSHPNLNFTSFVAGKEN------LVDTFYYVQIKS 407

Query: 320 ILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADEL-EKQLVKYPRAK 379
           IL+ G+ +  P +     S G GG IIDSG+T ++  +P +E I +++ EK   KYP  +
Sbjct: 408 ILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYR 467

Query: 380 DVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXXXX 439
           D      L PCFN+     + + P++ + F  G   +   EN    + +E +VCL     
Sbjct: 468 DFPI---LDPCFNV-SGIHNVQLPELGIAFADGAVWNFPTENSFIWL-NEDLVCLA---- 527

Query: 440 XXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKC 461
                         G +QQQN  + YD  + R+G+   KC
Sbjct: 528 --MLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 532

BLAST of CsaV3_6G041050 vs. Swiss-Prot
Match: sp|Q940R4|ASP63_ARATH (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 163.3 bits (412), Expect = 6.4e-39
Identity = 122/425 (28.71%), Postives = 186/425 (43.76%), Query Frame = 0

Query: 76  PRSYGA-YSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVP 135
           P S G+ Y +SL+ G+    +S   DTGS LVWFPC   + C  C    + P+       
Sbjct: 76  PISSGSDYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILCESKPLPPSXXXXXXX 135

Query: 136 KLSSSVKVVGCRNPKCAWIFGPNL--KSRCRNCNSKSRKCSDS---CPGYGLQYGSGATA 195
                             +   +L   S C     ++  C+ S   CP +   YG G+  
Sbjct: 136 XXXXXXXXXXXXXXXXXXLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLV 195

Query: 196 GILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRL------KRFSH 255
             L S++L L +  V +F  GC+  ++ +P G+AGFGRG  SLP+Q+ +        FS+
Sbjct: 196 AKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSY 255

Query: 256 CLVSRGFDDSPV--SSPLVLDSGSESDESKT----------------KSFIYAPFRENPS 315
           CLVS  FD   V   SPL+L    +  E +                   F++    ENP 
Sbjct: 256 CLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGXXXXXXXXXXXXXXXXEFVFTEMLENPK 315

Query: 316 VSNAAFREYYYLSLRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAI 375
                   +Y +SL+ I IG + +  P      D  G GG ++DSG+TFT L    + ++
Sbjct: 316 -----HPYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSV 375

Query: 376 ADELEKQLVK-YPRAKDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGG-GKLSLAAENY 435
            +E + ++ + + RA  VE  SG+ PC+ +    ++ + P +VL F G    ++L   NY
Sbjct: 376 VEEFDSRVGRVHERADRVEPSSGMSPCYYL---NQTVKVPALVLHFAGNRSSVTLPRRNY 435

Query: 436 LAMVTDEG--------VVCLTXXXXXXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGF 461
                D G        + CL                   G +QQQ   V YDL  +R+GF
Sbjct: 436 FYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGF 491

BLAST of CsaV3_6G041050 vs. Swiss-Prot
Match: sp|Q766C2|NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 156.4 bits (394), Expect = 7.8e-37
Identity = 117/386 (30.31%), Postives = 175/386 (45.34%), Query Frame = 0

Query: 80  GAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSS 139
           G Y +++A GTP  + S I DTGS L+W  C     C++C   +  P  I  F P+ SSS
Sbjct: 94  GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEP---CTQC---FSQPTPI--FNPQDSSS 153

Query: 140 VKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATA-GILLSETLD 199
              + C +  C               +  S  C+++   Y   YG G+T  G + +ET  
Sbjct: 154 FSTLPCESQYCQ--------------DLPSETCNNNECQYTYGYGDGSTTQGYMATETFT 213

Query: 200 LENKRVPDFLVGCSV----MSVHQPAGIAGFGRGPESLPSQMRLKRFSHCLVSRGFDDSP 259
            E   VP+   GC            AG+ G G GP SLPSQ+ + +FS+C+ S G   SP
Sbjct: 214 FETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYG-SSSP 273

Query: 260 VSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPVKFPYK 319
            +  L   +    + S + + I++    NP+        YYY++L+ I +GG  +  P  
Sbjct: 274 STLALGSAASGVPEGSPSTTLIHSSL--NPT--------YYYITLQGITVGGDNLGIPSS 333

Query: 320 YLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQSGLRPCFNI 379
                  G GG IIDSG+T T+L +  + A+A     Q +  P     E+ SGL  CF  
Sbjct: 334 TFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQ-INLPTVD--ESSSGLSTCFQQ 393

Query: 380 PKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXXXXXXXXXXXXXXXXXX 439
           P +  + + P++ ++F  GG L+L  +N L +   EGV+CL                   
Sbjct: 394 PSDGSTVQVPEISMQF-DGGVLNLGEQNIL-ISPAEGVICLA------MGSSSQLGISIF 435

Query: 440 GAFQQQNVLVEYDLAKQRIGFRKQKC 461
           G  QQQ   V YDL    + F   +C
Sbjct: 454 GNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CsaV3_6G041050 vs. Swiss-Prot
Match: sp|Q766C3|NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 150.2 bits (378), Expect = 5.6e-35
Identity = 111/387 (28.68%), Postives = 168/387 (43.41%), Query Frame = 0

Query: 80  GAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSS 139
           G Y ++L+ GTP Q  S I DTGS L+W  C    +C   S P         F P+ SSS
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPI--------FNPQGSSS 152

Query: 140 VKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGA-TAGILLSETLD 199
              + C +  C  +  P               CS++   Y   YG G+ T G + +ETL 
Sbjct: 153 FSTLPCSSQLCQALSSPT--------------CSNNFCQYTYGYGDGSETQGSMGTETLT 212

Query: 200 LENKRVPDFLVGCSV----MSVHQPAGIAGFGRGPESLPSQMRLKRFSHCLVSRGFDDSP 259
             +  +P+   GC            AG+ G GRGP SLPSQ+ + +FS+C+   G   S 
Sbjct: 213 FGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIG---SS 272

Query: 260 VSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPVKF-PY 319
             S L+L S + S        + A       + ++    +YY++L  + +G   +   P 
Sbjct: 273 TPSNLLLGSLANS--------VTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPS 332

Query: 320 KYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQSGLRPCFN 379
            + +  + G GG IIDSG+T T+     ++++  E   Q +  P      + SG   CF 
Sbjct: 333 AFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQ-INLPVVNG--SSSGFDLCFQ 392

Query: 380 IPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXXXXXXXXXXXXXXXXX 439
            P +  + + P  V+ F  GG L L +ENY  +    G++CL                  
Sbjct: 393 TPSDPSNLQIPTFVMHF-DGGDLELPSENYF-ISPSNGLICLA-------MGSSSQGMSI 434

Query: 440 XGAFQQQNVLVEYDLAKQRIGFRKQKC 461
            G  QQQN+LV YD     + F   +C
Sbjct: 453 FGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CsaV3_6G041050 vs. Swiss-Prot
Match: sp|Q9LNJ3|APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 141.7 bits (356), Expect = 2.0e-32
Identity = 123/412 (29.85%), Postives = 169/412 (41.02%), Query Frame = 0

Query: 57  HLKTPQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRC 116
           H   P   S++ +  +S   +  G Y   L  GTP + +  + DTGS +VW  C    RC
Sbjct: 120 HAPRPGGFSSSVVSGLS---QGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRC 179

Query: 117 SRCSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSC 176
              S P  D        P+ S +   + C +P C        +     CN++ + C    
Sbjct: 180 YSQSDPIFD--------PRKSKTYATIPCSSPHCR-------RLDSAGCNTRRKTCL--- 239

Query: 177 PGYGLQYGSGA-TAGILLSETLDLENKRVPDFLVGCSVMS---VHQPAGIAGFGRGPESL 236
             Y + YG G+ T G   +ETL     RV    +GC   +       AG+ G G+G  S 
Sbjct: 240 --YQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSF 299

Query: 237 PSQMRLK---RFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNA 296
           P Q   +   +FS+CLV R     P  S +V  + + S  +          R  P +SN 
Sbjct: 300 PGQTGHRFNQKFSYCLVDRSASSKP--SSVVFGNAAVSRIA----------RFTPLLSNP 359

Query: 297 AFREYYYLSLRRILIGGKPVKFPYKYLVP-DSTGNGGAIIDSGSTFTFLDKPIFEAIADE 356
               +YY+ L  I +GG  V      L   D  GNGG IIDSG++ T L +P + A+ D 
Sbjct: 360 KLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDA 419

Query: 357 LEKQLVKYPRAKDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVT 416
                    RA D    S    CF++    E  + P VVL F+ G  +SL A NYL  V 
Sbjct: 420 FRVGAKTLKRAPDF---SLFDTCFDLSNMNE-VKVPTVVLHFR-GADVSLPATNYLIPVD 479

Query: 417 DEGVVCLTXXXXXXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKC 461
             G  C                    G  QQQ   V YDLA  R+GF    C
Sbjct: 480 TNGKFCFA-------FAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CsaV3_6G041050 vs. Swiss-Prot
Match: sp|Q9LS40|ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 133.7 bits (335), Expect = 5.4e-30
Identity = 113/386 (29.27%), Postives = 162/386 (41.97%), Query Frame = 0

Query: 80  GAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSS 139
           G Y   +  GTP + +  + DTGS + W  C     C+ C +   DP     F P  SS+
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEP---CADC-YQQSDPV----FNPTSSST 219

Query: 140 VKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGA-TAGILLSETLD 199
            K + C  P+C+ +      S CR     S KC      Y + YG G+ T G L ++T+ 
Sbjct: 220 YKSLTCSAPQCSLL----ETSACR-----SNKCL-----YQVSYGDGSFTVGELATDTVT 279

Query: 200 LENK-RVPDFLVGCSVMS---VHQPAGIAGFGRGPESLPSQMRLKRFSHCLVSRGFDDSP 259
             N  ++ +  +GC   +       AG+ G G G  S+ +QM+   FS+CLV R   DS 
Sbjct: 280 FGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDR---DSG 339

Query: 260 VSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPVKFPYK 319
            SS L  +S        T     AP   N  +       +YY+ L    +GG+ V  P  
Sbjct: 340 KSSSLDFNSVQLGGGDAT-----APLLRNKKIDT-----FYYVGLSGFSVGGEKVVLPDA 399

Query: 320 YLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQSGLRPCFNI 379
               D++G+GG I+D G+  T L    + ++ D   K  V     K   + S    C++ 
Sbjct: 400 IFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNL--KKGSSSISLFDTCYDF 459

Query: 380 PKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXXXXXXXXXXXXXXXXXX 439
                + + P V   F GG  L L A+NYL  V D G  C                    
Sbjct: 460 -SSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFA-------FAPTSSSLSII 500

Query: 440 GAFQQQNVLVEYDLAKQRIGFRKQKC 461
           G  QQQ   + YDL+K  IG    KC
Sbjct: 520 GNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CsaV3_6G041050 vs. TrEMBL
Match: tr|A0A0A0KHK2|A0A0A0KHK2_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G454470 PE=3 SV=1)

HSP 1 Score: 883.6 bits (2282), Expect = 1.9e-253
Identity = 461/461 (100.00%), Postives = 461/461 (100.00%), Query Frame = 0

Query: 1   MEFLPIPFLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKT 60
           MEFLPIPFLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKT
Sbjct: 1   MEFLPIPFLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKT 60

Query: 61  PQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCS 120
           PQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCS
Sbjct: 61  PQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCS 120

Query: 121 FPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYG 180
           FPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYG
Sbjct: 121 FPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYG 180

Query: 181 LQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR 240
           LQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR
Sbjct: 181 LQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR 240

Query: 241 FSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLR 300
           FSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLR
Sbjct: 241 FSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLR 300

Query: 301 RILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAK 360
           RILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAK
Sbjct: 301 RILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAK 360

Query: 361 DVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXXXX 420
           DVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXXXX
Sbjct: 361 DVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXXXX 420

Query: 421 XXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKCT 462
           XXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKCT
Sbjct: 421 XXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKCT 461

BLAST of CsaV3_6G041050 vs. TrEMBL
Match: tr|A0A1S3CHV2|A0A1S3CHV2_CUCME (aspartic proteinase nepenthesin-2-like OS=Cucumis melo OX=3656 GN=LOC103500932 PE=3 SV=1)

HSP 1 Score: 813.9 bits (2101), Expect = 1.8e-232
Identity = 405/461 (87.85%), Postives = 421/461 (91.32%), Query Frame = 0

Query: 1   MEFLPIPFLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKT 60
           MEFLPIPFLFSIFLLLPTSSSSS T LPL TFPS+ FTDP KTIN LLSASL+RAQHLK+
Sbjct: 1   MEFLPIPFLFSIFLLLPTSSSSSIT-LPLATFPSIPFTDPLKTINHLLSASLSRAQHLKS 60

Query: 61  PQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCS 120
           PQSKSNTS +NVSLFPRSYGAY+VSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRC+ CS
Sbjct: 61  PQSKSNTSTENVSLFPRSYGAYAVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCAHCS 120

Query: 121 FPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYG 180
           FP+VDPATISKFVPKLSSSVK+VGCRNPKCAWIFGPNLKSRCRNCN KSRKCSDSCPGYG
Sbjct: 121 FPHVDPATISKFVPKLSSSVKIVGCRNPKCAWIFGPNLKSRCRNCNPKSRKCSDSCPGYG 180

Query: 181 LQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR 240
           +QYGSGATAGILLSETLDL+NKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR
Sbjct: 181 IQYGSGATAGILLSETLDLQNKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKR 240

Query: 241 FSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLR 300
           FSHCL+ RGFDDSPVSSPLVLDSG ESDESKTKSFIYAPF+ENPS SN AFREYYYLSLR
Sbjct: 241 FSHCLLPRGFDDSPVSSPLVLDSGPESDESKTKSFIYAPFQENPSRSNTAFREYYYLSLR 300

Query: 301 RILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAK 360
           RILIGGKPVKFPYKYLVPDSTG GGAIIDSGSTFTFLDKPIFEAIA ELEKQLVKYPRAK
Sbjct: 301 RILIGGKPVKFPYKYLVPDSTGKGGAIIDSGSTFTFLDKPIFEAIAGELEKQLVKYPRAK 360

Query: 361 DVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXXXX 420
           D+EA++GLRPCFNI KEEESAEFP+V LKFKGGGKLSL  ENYL MVTD  VVCLT    
Sbjct: 361 DIEAKTGLRPCFNISKEEESAEFPEVALKFKGGGKLSLPPENYLVMVTDANVVCLTMMTN 420

Query: 421 XXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKCT 462
                         GAFQQQNVLVEYDLAKQRIGFRKQKCT
Sbjct: 421 AEVVGVGGGPAIIFGAFQQQNVLVEYDLAKQRIGFRKQKCT 460

BLAST of CsaV3_6G041050 vs. TrEMBL
Match: tr|A0A1S3B6B5|A0A1S3B6B5_CUCME (aspartic proteinase nepenthesin-2 OS=Cucumis melo OX=3656 GN=LOC103486666 PE=3 SV=1)

HSP 1 Score: 529.3 bits (1362), Expect = 9.0e-147
Identity = 273/454 (60.13%), Postives = 339/454 (74.67%), Query Frame = 0

Query: 8   FLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKTPQSKSNT 67
           F   +F  L   S+S+   LPL + P +S +DP + +  L SAS NRA  +KTP+S    
Sbjct: 10  FYILLFSSLSAISNSNPITLPLNSSPHLSSSDPLQALTFLASASKNRAHRIKTPKS---N 69

Query: 68  SIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPA 127
           S+    L P SYGAYS  L+FGTP Q L  IFDTGSSLVWFPCT+ Y C+ CSFP +DP 
Sbjct: 70  SVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCTECSFPKIDPT 129

Query: 128 TISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGA 187
            I +FVPKLSSS K+VGC+NPKCAWIFGP++KS+CR+CN K+  C+ +CP Y +QYGSG+
Sbjct: 130 GIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS 189

Query: 188 TAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKRFSHCLVS 247
           TAG+LLSETLD  NK++P+F+VGCS +S+HQP+GIAGFGRG ESLPSQM LK+F++CL S
Sbjct: 190 TAGLLLSETLDFPNKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLAS 249

Query: 248 RGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGK 307
           R FDDS  S  L+LDS       KT    Y  FR+NPSVSN A++EYYYL++R+I++G +
Sbjct: 250 RKFDDSAHSGQLILDSSG----VKTSGLTYTSFRQNPSVSNHAYKEYYYLNIRKIIVGNQ 309

Query: 308 PVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQSG 367
            VK PYKYLVP   GNGG+IIDSGSTFTF+DKP+ + +A E EKQL    RA DVE  +G
Sbjct: 310 AVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLDVVAQEFEKQLANRTRATDVETLTG 369

Query: 368 LRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTXXXXXXXXXXX 427
           LRPCF++ K E+S EFP+++ +FKGG K +L   NY A+V+  GV CL XXXXXXXXXXX
Sbjct: 370 LRPCFDVSK-EKSVEFPELIFQFKGGAKWALPLNNYFALVSSSGVACLXXXXXXXXXXXX 429

Query: 428 XXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKCT 462
           XXXXXX GAFQQQN  VEYDL  +R+GFRKQ CT
Sbjct: 430 XXXXXXLGAFQQQNFYVEYDLVNERLGFRKQTCT 455

BLAST of CsaV3_6G041050 vs. TrEMBL
Match: tr|A0A2P6RCE3|A0A2P6RCE3_ROSCH (Putative nepenthesin OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr3g0475521 PE=3 SV=1)

HSP 1 Score: 512.3 bits (1318), Expect = 1.1e-141
Identity = 256/455 (56.26%), Postives = 330/455 (72.53%), Query Frame = 0

Query: 9   LFSIFLLLPTSSSSSTTVLPLTTFPS-VSFTDPFKTINLLLSASLNRAQHLKTPQSKSNT 68
           L S+F L   + SS  T LPL+   +  S +DP +T++LL SASL+RA HLK P+  SN+
Sbjct: 11  LISLFSLFHLAFSSKLT-LPLSLLATHQSSSDPIQTVSLLSSASLSRAHHLKRPK-HSNS 70

Query: 69  SIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPA 128
           S   V L+PRS+G YS+ L+FGTPPQ  +F+ DTGSSLVWFPCT+ Y CS C FP +DP 
Sbjct: 71  STTKVPLYPRSFGGYSIPLSFGTPPQTSTFVMDTGSSLVWFPCTSRYSCSSCVFPNIDPT 130

Query: 129 TISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGA 188
            I  F+P+LSS+ ++VGC+NPKCAWIFGP + ++C N        S +CP Y +QYGSGA
Sbjct: 131 HIPTFIPRLSSTSRIVGCKNPKCAWIFGPQINTKCPN-------SSQACPTYRIQYGSGA 190

Query: 189 TAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKRFSHCLVS 248
           TAG+LLSE+LDL NK VPDFLVGCS++S+ QPAGIAGFGRG ESLP QM L +FS+CLVS
Sbjct: 191 TAGVLLSESLDLPNKTVPDFLVGCSILSIRQPAGIAGFGRGQESLPVQMGLSKFSYCLVS 250

Query: 249 RGFDDSPVSSPLVLDSGSESDESKTKSFI-YAPFRENPSVSNAAFREYYYLSLRRILIGG 308
           R FDD+PVSS LVL SGS SD+      + Y PF++NP  SN+A+REYYYL+LR++++G 
Sbjct: 251 RRFDDTPVSSDLVLYSGSTSDDHDDSGGVSYTPFQKNPGASNSAYREYYYLALRKVVVGK 310

Query: 309 KPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQS 368
           K VK PYKYLVP +  NGG I+DSGSTFTF+++P+FEA+A+    Q+V+Y RAKD+E  +
Sbjct: 311 KHVKIPYKYLVPGADDNGGTIVDSGSTFTFMERPVFEAVAEAFAAQMVQYTRAKDIENGT 370

Query: 369 GLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLT-XXXXXXXXX 428
           GL+PCF+I K E++ +FP++V +FKGG K++L   NY A+VT  GVVCLT          
Sbjct: 371 GLKPCFDISK-EKTVDFPELVFQFKGGAKMALPLSNYFALVTSGGVVCLTIVTDGVAGPS 430

Query: 429 XXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKC 461
                    G+FQQQN  VEYDLA  R GFR+Q C
Sbjct: 431 LPSGPAIILGSFQQQNFYVEYDLAHDRFGFRQQSC 455

BLAST of CsaV3_6G041050 vs. TrEMBL
Match: tr|A0A0A0LBI9|A0A0A0LBI9_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G778440 PE=3 SV=1)

HSP 1 Score: 510.8 bits (1314), Expect = 3.3e-141
Identity = 271/462 (58.66%), Postives = 341/462 (73.81%), Query Frame = 0

Query: 5   PIPFLFSIFLLLPTSSS---SSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKTP 64
           P P  F   LL  + S+   S+   LPL +FP +S  DP + +  L S+S  RA  +KTP
Sbjct: 4   PSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIKTP 63

Query: 65  QSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSF 124
           +S    S+    L P SYGAYS  L+FGTP Q L  IFDTGSSLVWFPCT+ Y CS CSF
Sbjct: 64  KS---NSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 123

Query: 125 PYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGL 184
           P +DP  I +FVPKLSSS K+VGC+NPKC+WIFGP++KS+CR+CN K+  C+ +CP Y +
Sbjct: 124 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 183

Query: 185 QYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRLKRF 244
           QYGSG+TAG+LLSETLD  +K++P+F+VGCS +S+HQP+GIAGFGRG ESLPSQM LK+F
Sbjct: 184 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 243

Query: 245 SHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRR 304
           ++CL SR FDDSP S  L+LDS       K+    Y PFR+NPSVSN A++EYYYL++R+
Sbjct: 244 AYCLASRKFDDSPHSGQLILDSTG----VKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRK 303

Query: 305 ILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKD 364
           I++G + VK PYK+LVP   GNGG+IIDSGSTFTF+DKP+ E +A E EKQL  + RA D
Sbjct: 304 IIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATD 363

Query: 365 VEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGV--VCLTXXX 424
           VE  +GLRPCF+I K E+S +FP+++ +FKGG K +L   NY A+V+  GV      XXX
Sbjct: 364 VETLTGLRPCFDISK-EKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVXXXXXXXXX 423

Query: 425 XXXXXXXXXXXXXXXGAFQQQNVLVEYDLAKQRIGFRKQKCT 462
           XXXXXXXXXXXXXXX AFQQQN  VEYDL  QR+GFR+Q C+
Sbjct: 424 XXXXXXXXXXXXXXXXAFQQQNFYVEYDLVNQRLGFRQQTCS 457

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011657732.12.8e-253100.00PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus] >KGN48299.1 ... [more]
XP_008462617.12.8e-23287.85PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis melo][more]
XP_023543736.12.6e-19875.81probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo][more]
XP_022925946.13.2e-19674.95probable aspartyl protease At4g16563 [Cucurbita moschata][more]
XP_022979057.12.8e-18471.92probable aspartyl protease At4g16563 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT3G52500.11.1e-12649.57Eukaryotic aspartyl protease family protein[more]
AT5G45120.13.9e-4732.51Eukaryotic aspartyl protease family protein[more]
AT4G16563.13.5e-4028.71Eukaryotic aspartyl protease family protein[more]
AT2G42980.16.7e-3930.58Eukaryotic aspartyl protease family protein[more]
AT3G59080.11.1e-3629.25Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
sp|Q940R4|ASP63_ARATH6.4e-3928.71Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
sp|Q766C2|NEP2_NEPGR7.8e-3730.31Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
sp|Q766C3|NEP1_NEPGR5.6e-3528.68Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
sp|Q9LNJ3|APF2_ARATH2.0e-3229.85Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
sp|Q9LS40|ASPG1_ARATH5.4e-3029.27Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KHK2|A0A0A0KHK2_CUCSA1.9e-253100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G454470 PE=3 SV=1[more]
tr|A0A1S3CHV2|A0A1S3CHV2_CUCME1.8e-23287.85aspartic proteinase nepenthesin-2-like OS=Cucumis melo OX=3656 GN=LOC103500932 P... [more]
tr|A0A1S3B6B5|A0A1S3B6B5_CUCME9.0e-14760.13aspartic proteinase nepenthesin-2 OS=Cucumis melo OX=3656 GN=LOC103486666 PE=3 S... [more]
tr|A0A2P6RCE3|A0A2P6RCE3_ROSCH1.1e-14156.26Putative nepenthesin OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr3g0475521 PE=3 SV... [more]
tr|A0A0A0LBI9|A0A0A0LBI9_CUCSA3.3e-14158.66Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G778440 PE=3 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR034161Pepsin-like_plant
IPR033121PEPTIDASE_A1
IPR001969Aspartic_peptidase_AS
IPR032861TAXi_N
IPR021109Peptidase_aspartic_dom_sf
IPR032799TAXi_C
IPR001461Aspartic_peptidase_A1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
cellular_component GO:0005618 cell wall
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_6G041050.1CsaV3_6G041050.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 88..108
score: 51.84
coord: 326..337
score: 36.31
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 9..460
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 295..456
e-value: 6.8E-38
score: 129.9
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 264..461
e-value: 5.0E-52
score: 178.2
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 63..261
e-value: 4.7E-33
score: 116.9
IPR021109Aspartic peptidase domain superfamilySUPERFAMILYSSF50630Acid proteasescoord: 76..460
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 82..249
e-value: 8.2E-29
score: 101.0
NoneNo IPR availablePANTHERPTHR13683:SF340SUBFAMILY NOT NAMEDcoord: 9..460
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 97..108
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 82..456
score: 35.155
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 81..460
e-value: 1.19889E-78
score: 247.176

The following gene(s) are paralogous to this gene:

None