Cp4.1LG16g04150 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG16g04150
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionervatamin-B-like
LocationCp4.1LG16: 5589312 .. 5590755 (-)
RNA-Seq ExpressionCp4.1LG16g04150
SyntenyCp4.1LG16g04150
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATCCATGGCAACGGACAGCCCCTCCAATGGCTTACAAGACAGGTACAAGAAATGGATGAATAAACACAGCCGAGAGTACAAGAGCAGAGAGGAGCAGGAACGGAGATTCACAGTTTATCAGCTGAATGTTCAGTACATTGACAACTTCAATTCACTGAATCATTCATATACTCTTGCTGAAAATAGCTTTGCAGACCTCACAAATGATGAGTTTAAGACAACTTATTTGGGGTATCAAACTCATTGCCTGCCTGATACATGCTTCAGATATGAACATGTTAATAGCTTGCCTACTCATGTCGACTGGAGAATGGAAGATGCCGTTACTCCGATAAAGGATCAAGGCCAATGCGGTATGTGTTAGGAATAACGAACCTCCACAATGGTATGCTTTGGACTCCCTAAAAGGTCTCGTGCCAATGGAGATGTTTTCTCTACTTATAAACCCATGATCATTCTCTAAATTAGTCAACGTGGGACTCCCCCAACAATCCTCCTGAGGCCTATGGAGTCCTCGAACAGTCTCCCCTTAATTGAGACTCGACTCCTTTCTCTAGAGTTCTCGAACAAAGTGCACCCTTTTGTTCAACAATTGAGTCACTTTTGACTACACCTTCAAGGCTCACAACTACTTTGTTCGATATTTGAGGATTCTATTGACATAGCTAAGTTAAGAGTATAGCTCTATACCATGTTAGTAATAACGAACCTCCACAGTAGTATGATAATGTCTACTTTGAGCATAAACCACATGATCATTTCCTTAATTAGCCAACGTAAAACGAACTCATTCGTTAAAACAAGTAGCCCTTTCTCTATACTCACAAGTTTTGAAGTATTTGAAAACAGGGAGTTGCTGGGCATTCTCTGCAGTAGCAGCAGTGGAAGGCATCCACAAAATAAGAACAGGAAAGTTAGAGTCTTTATCAGAGCAAGAGCTTGTGGACTGTGATATCATCTCAGGGAACCAGGGCTGCGATGGCGGATTCATGAACAAAGCGTTTGAGTACATCAAGAGAAGTGGGCTGACAACAGAGAGAGAATATCCATACAGAGGAATTGAAGCTTTTTGCAACACACAAAAAGTGAGATACCACTCTGTGACAATAAGTGGGTATGAAAAAGTACCTATGAACAACGAGAAGAAATTGAAAGCTGCTGTTGCTAATCAGCCAGTTTCTGTAGCCATTGATGCAGGGGGATATGACTTTCAATTCTATTCTAGTGGTATCTTCTCAGGTAGCTGTGGGAAGCAGCTCAATCATGGAGTGGCAATCGTTGGGTATGGGGAAGTTGGGGATAATACTTACTGGCTTGTCAAGAACTCGTGGGGGACTGAGTGGGGTGAATCTGGGTACATAAGGATGAAGCGTGATTCGATTGATAAGCGAGGTGCCTGTGGCATAGCCATGGAGGCTAGCTACCCGATCAAAGACTGA

mRNA sequence

ATGGCATCCATGGCAACGGACAGCCCCTCCAATGGCTTACAAGACAGGTACAAGAAATGGATGAATAAACACAGCCGAGAGTACAAGAGCAGAGAGGAGCAGGAACGGAGATTCACAGTTTATCAGCTGAATGTTCAGTACATTGACAACTTCAATTCACTGAATCATTCATATACTCTTGCTGAAAATAGCTTTGCAGACCTCACAAATGATGAGTTTAAGACAACTTATTTGGGGTATCAAACTCATTGCCTGCCTGATACATGCTTCAGATATGAACATTATTTGAAAACAGGGAGTTGCTGGGCATTCTCTGCAGTAGCAGCAGTGGAAGGCATCCACAAAATAAGAACAGGAAAGTTAGAGTCTTTATCAGAGCAAGAGCTTGTGGACTGTGATATCATCTCAGGGAACCAGGGCTGCGATGGCGGATTCATGAACAAAGCGTTTGAGTACATCAAGAGAAGTGGGCTGACAACAGAGAGAGAATATCCATACAGAGGAATTGAAGCTTTTTGCAACACACAAAAAGTGAGATACCACTCTGTGACAATAAGTGGGTATGAAAAAGTACCTATGAACAACGAGAAGAAATTGAAAGCTGCTGTTGCTAATCAGCCAGTTTCTGTAGCCATTGATGCAGGGGGATATGACTTTCAATTCTATTCTAGTGGTATCTTCTCAGGTAGCTGTGGGAAGCAGCTCAATCATGGAGTGGCAATCGTTGGGTATGGGGAAGTTGGGGATAATACTTACTGGCTTGTCAAGAACTCGTGGGGGACTGAGTGGGGTGAATCTGGGTACATAAGGATGAAGCGTGATTCGATTGATAAGCGAGGTGCCTGTGGCATAGCCATGGAGGCTAGCTACCCGATCAAAGACTGA

Coding sequence (CDS)

ATGGCATCCATGGCAACGGACAGCCCCTCCAATGGCTTACAAGACAGGTACAAGAAATGGATGAATAAACACAGCCGAGAGTACAAGAGCAGAGAGGAGCAGGAACGGAGATTCACAGTTTATCAGCTGAATGTTCAGTACATTGACAACTTCAATTCACTGAATCATTCATATACTCTTGCTGAAAATAGCTTTGCAGACCTCACAAATGATGAGTTTAAGACAACTTATTTGGGGTATCAAACTCATTGCCTGCCTGATACATGCTTCAGATATGAACATTATTTGAAAACAGGGAGTTGCTGGGCATTCTCTGCAGTAGCAGCAGTGGAAGGCATCCACAAAATAAGAACAGGAAAGTTAGAGTCTTTATCAGAGCAAGAGCTTGTGGACTGTGATATCATCTCAGGGAACCAGGGCTGCGATGGCGGATTCATGAACAAAGCGTTTGAGTACATCAAGAGAAGTGGGCTGACAACAGAGAGAGAATATCCATACAGAGGAATTGAAGCTTTTTGCAACACACAAAAAGTGAGATACCACTCTGTGACAATAAGTGGGTATGAAAAAGTACCTATGAACAACGAGAAGAAATTGAAAGCTGCTGTTGCTAATCAGCCAGTTTCTGTAGCCATTGATGCAGGGGGATATGACTTTCAATTCTATTCTAGTGGTATCTTCTCAGGTAGCTGTGGGAAGCAGCTCAATCATGGAGTGGCAATCGTTGGGTATGGGGAAGTTGGGGATAATACTTACTGGCTTGTCAAGAACTCGTGGGGGACTGAGTGGGGTGAATCTGGGTACATAAGGATGAAGCGTGATTCGATTGATAAGCGAGGTGCCTGTGGCATAGCCATGGAGGCTAGCTACCCGATCAAAGACTGA

Protein sequence

MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGYQTHCLPDTCFRYEHYLKTGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD
Homology
BLAST of Cp4.1LG16g04150 vs. ExPASy Swiss-Prot
Match: Q9STL4 (KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana OX=3702 GN=CEP2 PE=1 SV=1)

HSP 1 Score: 294.7 bits (753), Expect = 1.2e-78
Identity = 162/314 (51.59%), Postives = 195/314 (62.10%), Query Frame = 0

Query: 12  GLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTND 71
           GL   Y +W + HS   +S  E+E+RF V++ NV ++ N N  N SY L  N FADLT +
Sbjct: 33  GLSTLYDRWRSHHSVP-RSLNEREKRFNVFRHNVMHVHNTNKKNRSYKLKLNKFADLTIN 92

Query: 72  EFKTTYLG--------YQTHCLPDTCFRYEH-----------------------YLKTGS 131
           EFK  Y G         Q        F Y+H                         K GS
Sbjct: 93  EFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGS 152

Query: 132 CWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRS-GLT 191
           CWAFS VAAVEGI+KI+T KL SLSEQELVDCD    N+GC+GG M  AFE+IK++ G+T
Sbjct: 153 CWAFSTVAAVEGINKIKTNKLVSLSEQELVDCD-TKQNEGCNGGLMEIAFEFIKKNGGIT 212

Query: 192 TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDF 251
           TE  YPY GI+  C+  K     VTI G+E VP N+E  L  AVANQPVSVAIDAG  DF
Sbjct: 213 TEDSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDF 272

Query: 252 QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR 294
           QFYS G+F+GSCG +LNHGVA VGYG      YW+V+NSWG EWGE GYI+++R+  +  
Sbjct: 273 QFYSEGVFTGSCGTELNHGVAAVGYGSERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPE 332

BLAST of Cp4.1LG16g04150 vs. ExPASy Swiss-Prot
Match: O65039 (Vignain OS=Ricinus communis OX=3988 GN=CYSEP PE=1 SV=1)

HSP 1 Score: 291.6 bits (745), Expect = 1.0e-77
Identity = 157/308 (50.97%), Postives = 196/308 (63.64%), Query Frame = 0

Query: 17  YKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTT 76
           Y++W + H+   +S  E+++RF V++ N  ++ N N ++  Y L  N FAD+TN EF+ T
Sbjct: 38  YERWRSHHTVS-RSLHEKQKRFNVFKHNAMHVHNANKMDKPYKLKLNKFADMTNHEFRNT 97

Query: 77  YLG--------YQTHCLPDTCFRYE---------------------HYLKTGSCWAFSAV 136
           Y G        ++     +  F YE                        + GSCWAFS +
Sbjct: 98  YSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFSTI 157

Query: 137 AAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIK-RSGLTTEREYPY 196
            AVEGI++I+T KL SLSEQELVDCD    NQGC+GG M+ AFE+IK R G+TTE  YPY
Sbjct: 158 VAVEGINQIKTNKLVSLSEQELVDCD-TDQNQGCNGGLMDYAFEFIKQRGGITTEANYPY 217

Query: 197 RGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDFQFYSSGI 256
              +  C+  K    +V+I G+E VP N+E  L  AVANQPVSVAIDAGG DFQFYS G+
Sbjct: 218 EAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGV 277

Query: 257 FSGSCGKQLNHGVAIVGYGEVGDNT-YWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIA 294
           F+GSCG +L+HGVAIVGYG   D T YW VKNSWG EWGE GYIRM+R   DK G CGIA
Sbjct: 278 FTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIA 337

BLAST of Cp4.1LG16g04150 vs. ExPASy Swiss-Prot
Match: Q9FGR9 (KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana OX=3702 GN=CEP1 PE=1 SV=1)

HSP 1 Score: 290.4 bits (742), Expect = 2.2e-77
Identity = 159/315 (50.48%), Postives = 202/315 (64.13%), Query Frame = 0

Query: 11  NGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTN 70
           N L + Y++W + H+   +S EE+ +RF V++ NV++I   N  + SY L  N F D+T+
Sbjct: 32  NSLWELYERWRSHHT-VARSLEEKAKRFNVFKHNVKHIHETNKKDKSYKLKLNKFGDMTS 91

Query: 71  DEFKTTYLG--------YQTHCLPDTCFRY---------------------EHYLKTGSC 130
           +EF+ TY G        +Q        F Y                     ++  + GSC
Sbjct: 92  EEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSC 151

Query: 131 WAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIK-RSGLTT 190
           WAFS V AVEGI++IRT KL SLSEQELVDCD  + NQGC+GG M+ AFE+IK + GLT+
Sbjct: 152 WAFSTVVAVEGINQIRTKKLTSLSEQELVDCD-TNQNQGCNGGLMDLAFEFIKEKGGLTS 211

Query: 191 EREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDFQ 250
           E  YPY+  +  C+T K     V+I G+E VP N+E  L  AVANQPVSVAIDAGG DFQ
Sbjct: 212 ELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQ 271

Query: 251 FYSSGIFSGSCGKQLNHGVAIVGYGEVGDNT-YWLVKNSWGTEWGESGYIRMKRDSIDKR 295
           FYS G+F+G CG +LNHGVA+VGYG   D T YW+VKNSWG EWGE GYIRM+R    K 
Sbjct: 272 FYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKE 331

BLAST of Cp4.1LG16g04150 vs. ExPASy Swiss-Prot
Match: P12412 (Vignain OS=Vigna mungo OX=3915 PE=1 SV=1)

HSP 1 Score: 288.5 bits (737), Expect = 8.5e-77
Identity = 157/313 (50.16%), Postives = 197/313 (62.94%), Query Frame = 0

Query: 13  LQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDE 72
           L D Y++W + H+   +S  E+ +RF V++ NV ++ N N ++  Y L  N FAD+TN E
Sbjct: 36  LWDLYERWRSHHTVS-RSLGEKHKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADMTNHE 95

Query: 73  FKTTYLG--------YQTHCLPDTCFRYE---------------------HYLKTGSCWA 132
           F++TY G        ++        F YE                        + GSCWA
Sbjct: 96  FRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWA 155

Query: 133 FSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIK-RSGLTTER 192
           FS + AVEGI++I+T KL SLSEQELVDCD    NQGC+GG M  AFE+IK + G+TTE 
Sbjct: 156 FSTIVAVEGINQIKTNKLVSLSEQELVDCD-KEENQGCNGGLMESAFEFIKQKGGITTES 215

Query: 193 EYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDFQFY 252
            YPY   E  C+  KV   +V+I G+E VP+N+E  L  AVANQPVSVAIDAGG DFQFY
Sbjct: 216 NYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFY 275

Query: 253 SSGIFSGSCGKQLNHGVAIVGYGEVGDNT-YWLVKNSWGTEWGESGYIRMKRDSIDKRGA 295
           S G+F+G C   LNHGVAIVGYG   D T YW+V+NSWG EWGE GYIRM+R+   K G 
Sbjct: 276 SEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEGL 335

BLAST of Cp4.1LG16g04150 vs. ExPASy Swiss-Prot
Match: A2XQE8 (Senescence-specific cysteine protease SAG39 OS=Oryza sativa subsp. indica OX=39946 GN=OsI_14861 PE=3 SV=1)

HSP 1 Score: 288.1 bits (736), Expect = 1.1e-76
Identity = 153/305 (50.16%), Postives = 198/305 (64.92%), Query Frame = 0

Query: 16  RYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFK- 75
           R+++WM ++ R Y+   E+ RRF V++ NV +I++FN+ NH++ L  N FADLTNDEF+ 
Sbjct: 36  RHERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNHNFWLGVNQFADLTNDEFRW 95

Query: 76  -TTYLGY--QTHCLPDTCFRYEHYL-----------------------KTGSCWAFSAVA 135
             T  G+   T  +P T FRYE+                         + G CWAFSAVA
Sbjct: 96  TKTNKGFIPSTTRVP-TGFRYENVNIDALPATVDWRTKGAVTPIKDQGQCGCCWAFSAVA 155

Query: 136 AVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEY-IKRSGLTTEREYPYR 195
           A+EGI K+ TGKL SLSEQELVDCD+   +QGC+GG M+ AF++ IK  GLTTE  YPY 
Sbjct: 156 AMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYA 215

Query: 196 GIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDFQFYSSGIF 255
             +  C  + V     +I GYE VP NNE  L  AVANQPVSVA+D G   FQFY  G+ 
Sbjct: 216 AADDKC--KSVSNSVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVM 275

Query: 256 SGSCGKQLNHGVAIVGYGEVGDNT-YWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIAM 292
           +GSCG  L+HG+  +GYG+  D T YWL+KNSWGT WGE+G++RM++D  DKRG CG+AM
Sbjct: 276 TGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAM 335

BLAST of Cp4.1LG16g04150 vs. NCBI nr
Match: XP_023513224.1 (ervatamin-B-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 584 bits (1506), Expect = 1.28e-209
Identity = 292/315 (92.70%), Postives = 292/315 (92.70%), Query Frame = 0

Query: 1   MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 60
           MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL
Sbjct: 25  MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 84

Query: 61  AENSFADLTNDEFKTTYLGYQTHCLPDTCFRYEHY--LKT-------------------G 120
           AENSFADLTNDEFKTTYLGYQTHCLPDTCFRYEH   L T                   G
Sbjct: 85  AENSFADLTNDEFKTTYLGYQTHCLPDTCFRYEHVNSLPTHVDWRMEDAVTPIKDQGQCG 144

Query: 121 SCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRSGLT 180
           SCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRSGLT
Sbjct: 145 SCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRSGLT 204

Query: 181 TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDF 240
           TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDF
Sbjct: 205 TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDF 264

Query: 241 QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR 294
           QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR
Sbjct: 265 QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR 324

BLAST of Cp4.1LG16g04150 vs. NCBI nr
Match: KAG6570907.1 (Senescence-specific cysteine protease SAG12, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 578 bits (1491), Expect = 1.32e-207
Identity = 287/322 (89.13%), Postives = 291/322 (90.37%), Query Frame = 0

Query: 1   MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 60
           MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL
Sbjct: 1   MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 60

Query: 61  AENSFADLTNDEFKTTYLGYQTHCLPDTCFRYEHYL------------------------ 120
           AENSFADLTNDEFKTTYLGYQTHCLPDTCFRY+H +                        
Sbjct: 61  AENSFADLTNDEFKTTYLGYQTHCLPDTCFRYDHVISLPTHVDWRMEDAVTPSFLSILTS 120

Query: 121 ----KTGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEY 180
               +TGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEY
Sbjct: 121 FEVFETGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEY 180

Query: 181 IKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAI 240
           IKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVP NNEKKLKAAVA+QPVSVAI
Sbjct: 181 IKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPTNNEKKLKAAVAHQPVSVAI 240

Query: 241 DAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMK 294
           DAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMK
Sbjct: 241 DAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMK 300

BLAST of Cp4.1LG16g04150 vs. NCBI nr
Match: XP_022944762.1 (ervatamin-B-like [Cucurbita moschata])

HSP 1 Score: 579 bits (1492), Expect = 1.74e-207
Identity = 286/315 (90.79%), Postives = 290/315 (92.06%), Query Frame = 0

Query: 1   MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 60
           MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL
Sbjct: 25  MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 84

Query: 61  AENSFADLTNDEFKTTYLGYQTHCLPDTCFRYEHYL---------------------KTG 120
           AENSFADLTNDEFKTTYLGYQTHCLPDTCFRY+H +                     + G
Sbjct: 85  AENSFADLTNDEFKTTYLGYQTHCLPDTCFRYDHVISLPTHVDWRMEDAVTPVKDQGQCG 144

Query: 121 SCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRSGLT 180
           SCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRSGLT
Sbjct: 145 SCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRSGLT 204

Query: 181 TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDF 240
           TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVP NNEKKLKAAVA+QPVSVAIDAGGYDF
Sbjct: 205 TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPTNNEKKLKAAVAHQPVSVAIDAGGYDF 264

Query: 241 QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR 294
           QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR
Sbjct: 265 QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR 324

BLAST of Cp4.1LG16g04150 vs. NCBI nr
Match: XP_022986332.1 (ervatamin-B [Cucurbita maxima])

HSP 1 Score: 570 bits (1469), Expect = 5.57e-204
Identity = 283/315 (89.84%), Postives = 287/315 (91.11%), Query Frame = 0

Query: 1   MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 60
           MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL
Sbjct: 25  MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 84

Query: 61  AENSFADLTNDEFKTTYLGYQTHCLPDTCFRYEHYL---------------------KTG 120
           AENSFADLTNDEFK TYLGYQT CL DTCFRY+H +                     + G
Sbjct: 85  AENSFADLTNDEFKITYLGYQTDCLSDTCFRYDHVISLPNHVDWRMEDAVTPVKDQGQCG 144

Query: 121 SCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRSGLT 180
           SCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDI  GNQGCDGGFMNKAFEYIKRSGLT
Sbjct: 145 SCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDITLGNQGCDGGFMNKAFEYIKRSGLT 204

Query: 181 TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDF 240
           TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVA+QPVSVAIDAGGYDF
Sbjct: 205 TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVAHQPVSVAIDAGGYDF 264

Query: 241 QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR 294
           QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR
Sbjct: 265 QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR 324

BLAST of Cp4.1LG16g04150 vs. NCBI nr
Match: KAG7010752.1 (Senescence-specific cysteine protease SAG12, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 531 bits (1369), Expect = 1.23e-189
Identity = 264/294 (89.80%), Postives = 266/294 (90.48%), Query Frame = 0

Query: 1   MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 60
           MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL
Sbjct: 16  MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 75

Query: 61  AENSFADLTNDEFKTTYLGYQTHCLPDTCFRYEHYLKTGSCWAFSAVAAVEGIHKIRTGK 120
           AENSFADLTNDEFKTTYLGYQTHCLPDTCFRY+H                          
Sbjct: 76  AENSFADLTNDEFKTTYLGYQTHCLPDTCFRYDH-------------------------- 135

Query: 121 LESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRY 180
           LESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRY
Sbjct: 136 LESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRY 195

Query: 181 HSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVA 240
           HSVTISGYEKVP NNEKKLKAAVA+QPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVA
Sbjct: 196 HSVTISGYEKVPTNNEKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVA 255

Query: 241 IVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD 294
           IVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYP KD
Sbjct: 256 IVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPTKD 283

BLAST of Cp4.1LG16g04150 vs. ExPASy TrEMBL
Match: A0A6J1FYZ3 (ervatamin-B-like OS=Cucurbita moschata OX=3662 GN=LOC111449119 PE=3 SV=1)

HSP 1 Score: 579 bits (1492), Expect = 8.43e-208
Identity = 286/315 (90.79%), Postives = 290/315 (92.06%), Query Frame = 0

Query: 1   MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 60
           MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL
Sbjct: 25  MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 84

Query: 61  AENSFADLTNDEFKTTYLGYQTHCLPDTCFRYEHYL---------------------KTG 120
           AENSFADLTNDEFKTTYLGYQTHCLPDTCFRY+H +                     + G
Sbjct: 85  AENSFADLTNDEFKTTYLGYQTHCLPDTCFRYDHVISLPTHVDWRMEDAVTPVKDQGQCG 144

Query: 121 SCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRSGLT 180
           SCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRSGLT
Sbjct: 145 SCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRSGLT 204

Query: 181 TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDF 240
           TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVP NNEKKLKAAVA+QPVSVAIDAGGYDF
Sbjct: 205 TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPTNNEKKLKAAVAHQPVSVAIDAGGYDF 264

Query: 241 QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR 294
           QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR
Sbjct: 265 QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR 324

BLAST of Cp4.1LG16g04150 vs. ExPASy TrEMBL
Match: A0A6J1J793 (ervatamin-B OS=Cucurbita maxima OX=3661 GN=LOC111484106 PE=3 SV=1)

HSP 1 Score: 570 bits (1469), Expect = 2.70e-204
Identity = 283/315 (89.84%), Postives = 287/315 (91.11%), Query Frame = 0

Query: 1   MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 60
           MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL
Sbjct: 25  MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 84

Query: 61  AENSFADLTNDEFKTTYLGYQTHCLPDTCFRYEHYL---------------------KTG 120
           AENSFADLTNDEFK TYLGYQT CL DTCFRY+H +                     + G
Sbjct: 85  AENSFADLTNDEFKITYLGYQTDCLSDTCFRYDHVISLPNHVDWRMEDAVTPVKDQGQCG 144

Query: 121 SCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRSGLT 180
           SCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDI  GNQGCDGGFMNKAFEYIKRSGLT
Sbjct: 145 SCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDITLGNQGCDGGFMNKAFEYIKRSGLT 204

Query: 181 TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDF 240
           TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVA+QPVSVAIDAGGYDF
Sbjct: 205 TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVAHQPVSVAIDAGGYDF 264

Query: 241 QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR 294
           QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR
Sbjct: 265 QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR 324

BLAST of Cp4.1LG16g04150 vs. ExPASy TrEMBL
Match: A0A384S0D9 (Cysteine proteinase 1 (Fragment) OS=Citrullus lanatus OX=3654 GN=ClCP1 PE=2 SV=1)

HSP 1 Score: 477 bits (1228), Expect = 6.67e-168
Identity = 239/319 (74.92%), Postives = 265/319 (83.07%), Query Frame = 0

Query: 1   MASMATD----SPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNH 60
           MASM  D    S S  LQDRY+KWM+K+ REYKSREE E+RF +YQLNVQYIDNFNSLNH
Sbjct: 1   MASMEMDYRPGSSSGDLQDRYQKWMSKYGREYKSREEWEQRFNIYQLNVQYIDNFNSLNH 60

Query: 61  SYTLAENSFADLTNDEFKTTYLGYQTHCLPDTCFRYEHYL-------------------- 120
           SYTLAENSFADLTNDEFKTTYLG++T  LPDT FRY + +                    
Sbjct: 61  SYTLAENSFADLTNDEFKTTYLGFKTDWLPDTWFRYGNMVNLPTNVDWRKENAVTPVKDQ 120

Query: 121 -KTGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKR 180
            + GSCWAFSAVAAVEGI+KI+TGKL SLSEQELVDCD+ SGNQGC+GG+M KAFE+IK+
Sbjct: 121 GQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVASGNQGCNGGYMYKAFEFIKK 180

Query: 181 SGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAG 240
           +GLTTE EYPYRGIE+ CN QKVRY +VTISGYEKVP+N+EK LKAAVANQPVSVAIDAG
Sbjct: 181 TGLTTEIEYPYRGIESVCNKQKVRYRTVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAG 240

Query: 241 GYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDS 294
           GYDFQFYS G+FSG+CGKQLNHGVAIVGYGE  + TYWLVKNSWGT+WGESGYIRMKRDS
Sbjct: 241 GYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEASNKTYWLVKNSWGTDWGESGYIRMKRDS 300

BLAST of Cp4.1LG16g04150 vs. ExPASy TrEMBL
Match: A0A6J1CH04 (ervatamin-B OS=Momordica charantia OX=3673 GN=LOC111011342 PE=3 SV=1)

HSP 1 Score: 452 bits (1164), Expect = 8.66e-158
Identity = 225/319 (70.53%), Postives = 256/319 (80.25%), Query Frame = 0

Query: 1   MASMATDSP----SNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNH 60
           MAS+A D+P    S+ ++DRY+KW++K+ REYKS EE+E+RF +YQ NVQYID FNSLN 
Sbjct: 25  MASVAEDNPPGDGSDDMRDRYQKWIDKYGREYKSGEEREKRFPIYQSNVQYIDYFNSLNR 84

Query: 61  SYTLAENSFADLTNDEFKTTYLGYQTHCLPDTCFRYEHYL-------------------- 120
           SYTLA+N FADLTNDEFKTTYLGY T   PDTCF+Y + +                    
Sbjct: 85  SYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKYGNIVNLPTNVDWRKEGAVTPIKDQ 144

Query: 121 -KTGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKR 180
            + GSCWAFSAVAAVEGI KI+TGKL SLSEQEL+DCD+ISGNQGC GGFM KAFE+IK+
Sbjct: 145 GQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLDCDVISGNQGCSGGFMPKAFEFIKK 204

Query: 181 SGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAG 240
            G+TTE+EYPYRG+E  CN QKVRYHS TISGYEKVP N+EK LKAAVANQPVSVAIDAG
Sbjct: 205 IGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKVPANDEKSLKAAVANQPVSVAIDAG 264

Query: 241 GYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDS 294
           GYDFQFYS GIFSG+CGKQLNHGV IVGYGE    +YWLVKNSWGT WGE GY+RMK +S
Sbjct: 265 GYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKSYWLVKNSWGTSWGEYGYVRMKSNS 324

BLAST of Cp4.1LG16g04150 vs. ExPASy TrEMBL
Match: A0A0A0LJV6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G292830 PE=3 SV=1)

HSP 1 Score: 445 bits (1144), Expect = 8.59e-155
Identity = 221/320 (69.06%), Postives = 257/320 (80.31%), Query Frame = 0

Query: 1   MASMATD-----SPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLN 60
           + SMA D     S S+ +QDRY+KWM+K+ R+YKSREE ERRFT+YQ NVQYIDNFNS+N
Sbjct: 21  LVSMAMDYSLGSSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMN 80

Query: 61  HSYTLAENSFADLTNDEFKTTYLGYQTHCLPDTCFRYEHYL------------------- 120
           HS+TLAEN+FADLTN+EFK TYLGY+T  +PDTCFRY + +                   
Sbjct: 81  HSHTLAENNFADLTNEEFKATYLGYKTVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKN 140

Query: 121 --KTGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIK 180
             + GSCWAFSAVAAVEGI+KI+ GKL SLSEQELVDCD+ SGNQGC+GG+M KAFE+IK
Sbjct: 141 QGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFIK 200

Query: 181 RSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDA 240
           R+GLTTE EYPY+G E+ CN QK +Y  V+ISGYEKVP+N+EK LKAAVANQPVSVAIDA
Sbjct: 201 RTGLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDA 260

Query: 241 GGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRD 294
            G +FQFYS GIFSG+CG QLNHGVAIVGYGE  +  YWLVKNSWGT+WGESGYIRMKRD
Sbjct: 261 EGNNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQAYWLVKNSWGTDWGESGYIRMKRD 320

BLAST of Cp4.1LG16g04150 vs. TAIR 10
Match: AT1G06260.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 304.7 bits (779), Expect = 8.2e-83
Identity = 160/310 (51.61%), Postives = 199/310 (64.19%), Query Frame = 0

Query: 9   PSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADL 68
           P   L+ R++KW+  HS+ Y  R+E   RF +YQ NVQ ID  NSL+  + L +N FAD+
Sbjct: 35  PHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADM 94

Query: 69  TNDEFKTTYLGYQTHCL----------------PDTC-FRYEHYL-------KTGSCWAF 128
           TN EFK  +LG  T  L                PD   +R +  +       K G CWAF
Sbjct: 95  TNSEFKAHFLGLNTSSLRLHKKQRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAF 154

Query: 129 SAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIK-RSGLTTERE 188
           SAVAA+EGI+KI+TG L SLSEQ+L+DCD+ + N+GC GG M  AFE+IK   GL TE +
Sbjct: 155 SAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATETD 214

Query: 189 YPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDFQFYS 248
           YPY GIE  C+ +K +   VTI GY+KV   NE  L+ A A QPVSV IDAGG+ FQ YS
Sbjct: 215 YPYTGIEGTCDQEKSKNKVVTIQGYQKV-AQNEASLQIAAAQQPVSVGIDAGGFIFQLYS 274

Query: 249 SGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKRGACG 294
           SG+F+  CG  LNHGV +VGYG  GD  YW+VKNSWGT WGE GYIRM+R   +  G CG
Sbjct: 275 SGVFTNYCGTNLNHGVTVVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCG 334

BLAST of Cp4.1LG16g04150 vs. TAIR 10
Match: AT3G48340.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 294.7 bits (753), Expect = 8.4e-80
Identity = 162/314 (51.59%), Postives = 195/314 (62.10%), Query Frame = 0

Query: 12  GLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTND 71
           GL   Y +W + HS   +S  E+E+RF V++ NV ++ N N  N SY L  N FADLT +
Sbjct: 33  GLSTLYDRWRSHHSVP-RSLNEREKRFNVFRHNVMHVHNTNKKNRSYKLKLNKFADLTIN 92

Query: 72  EFKTTYLG--------YQTHCLPDTCFRYEH-----------------------YLKTGS 131
           EFK  Y G         Q        F Y+H                         K GS
Sbjct: 93  EFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGS 152

Query: 132 CWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRS-GLT 191
           CWAFS VAAVEGI+KI+T KL SLSEQELVDCD    N+GC+GG M  AFE+IK++ G+T
Sbjct: 153 CWAFSTVAAVEGINKIKTNKLVSLSEQELVDCD-TKQNEGCNGGLMEIAFEFIKKNGGIT 212

Query: 192 TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDF 251
           TE  YPY GI+  C+  K     VTI G+E VP N+E  L  AVANQPVSVAIDAG  DF
Sbjct: 213 TEDSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDF 272

Query: 252 QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR 294
           QFYS G+F+GSCG +LNHGVA VGYG      YW+V+NSWG EWGE GYI+++R+  +  
Sbjct: 273 QFYSEGVFTGSCGTELNHGVAAVGYGSERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPE 332

BLAST of Cp4.1LG16g04150 vs. TAIR 10
Match: AT5G50260.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 290.4 bits (742), Expect = 1.6e-78
Identity = 159/315 (50.48%), Postives = 202/315 (64.13%), Query Frame = 0

Query: 11  NGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTN 70
           N L + Y++W + H+   +S EE+ +RF V++ NV++I   N  + SY L  N F D+T+
Sbjct: 32  NSLWELYERWRSHHT-VARSLEEKAKRFNVFKHNVKHIHETNKKDKSYKLKLNKFGDMTS 91

Query: 71  DEFKTTYLG--------YQTHCLPDTCFRY---------------------EHYLKTGSC 130
           +EF+ TY G        +Q        F Y                     ++  + GSC
Sbjct: 92  EEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSC 151

Query: 131 WAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIK-RSGLTT 190
           WAFS V AVEGI++IRT KL SLSEQELVDCD  + NQGC+GG M+ AFE+IK + GLT+
Sbjct: 152 WAFSTVVAVEGINQIRTKKLTSLSEQELVDCD-TNQNQGCNGGLMDLAFEFIKEKGGLTS 211

Query: 191 EREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDFQ 250
           E  YPY+  +  C+T K     V+I G+E VP N+E  L  AVANQPVSVAIDAGG DFQ
Sbjct: 212 ELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQ 271

Query: 251 FYSSGIFSGSCGKQLNHGVAIVGYGEVGDNT-YWLVKNSWGTEWGESGYIRMKRDSIDKR 295
           FYS G+F+G CG +LNHGVA+VGYG   D T YW+VKNSWG EWGE GYIRM+R    K 
Sbjct: 272 FYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKE 331

BLAST of Cp4.1LG16g04150 vs. TAIR 10
Match: AT5G45890.1 (senescence-associated gene 12 )

HSP 1 Score: 287.3 bits (734), Expect = 1.3e-77
Identity = 150/316 (47.47%), Postives = 207/316 (65.51%), Query Frame = 0

Query: 13  LQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSL--NHSYTLAENSFADLTN 72
           +Q R+ +WM KH R Y   +E+  R+ V++ NV+ I++ NS+    ++ LA N FADLTN
Sbjct: 34  MQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTN 93

Query: 73  DEFKTTYLGY----------QTHCLPDTCFRYEHYL-----------------------K 132
           DEF++ Y G+          QT   P   FRY++                          
Sbjct: 94  DEFRSMYTGFKGVSALSSQSQTKMSP---FRYQNVSSGALPVSVDWRKKGAVTPIKNQGS 153

Query: 133 TGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEYIKRS- 192
            G CWAFSAVAA+EG  +I+ GKL SLSEQ+LVDCD  + + GC+GG M+ AFE+IK + 
Sbjct: 154 CGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD--TNDFGCEGGLMDTAFEHIKATG 213

Query: 193 GLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGG 252
           GLTTE  YPY+G +A CN++K    + +I+GYE VP+N+E+ L  AVA+QPVSV I+ GG
Sbjct: 214 GLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGG 273

Query: 253 YDFQFYSSGIFSGSCGKQLNHGVAIVGYGE-VGDNTYWLVKNSWGTEWGESGYIRMKRDS 292
           +DFQFYSSG+F+G C   L+H V  +GYGE    + YW++KNSWGT+WGESGY+R+++D 
Sbjct: 274 FDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDV 333

BLAST of Cp4.1LG16g04150 vs. TAIR 10
Match: AT1G47128.1 (Granulin repeat cysteine protease family protein )

HSP 1 Score: 274.6 bits (701), Expect = 9.0e-74
Identity = 144/306 (47.06%), Postives = 193/306 (63.07%), Query Frame = 0

Query: 17  YKKWMNKH--SREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFK 76
           Y+ W+ KH  ++   S  E++RRF +++ N++++D  N  N SY L    FADLTNDE++
Sbjct: 50  YEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYR 109

Query: 77  TTYLGYQTHCLPD--TCFRYEHYL-----------------------KTGSCWAFSAVAA 136
           + YLG +     +  T  RYE  +                         GSCWAFS + A
Sbjct: 110 SKYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGA 169

Query: 137 VEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEY-IKRSGLTTEREYPYRG 196
           VEGI++I TG L +LSEQELVDCD  S N+GC+GG M+ AFE+ IK  G+ T+++YPY+G
Sbjct: 170 VEGINQIVTGDLITLSEQELVDCD-TSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKG 229

Query: 197 IEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVANQPVSVAIDAGGYDFQFYSSGIFS 256
           ++  C+  +     VTI  YE VP  +E+ LK AVA+QP+S+AI+AGG  FQ Y SGIF 
Sbjct: 230 VDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFD 289

Query: 257 GSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIAMEA 295
           GSCG QL+HGV  VGYG      YW+V+NSWG  WGESGY+RM R+     G CGIA+E 
Sbjct: 290 GSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEP 349

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9STL41.2e-7851.59KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana OX=3702 GN=CEP2 ... [more]
O650391.0e-7750.97Vignain OS=Ricinus communis OX=3988 GN=CYSEP PE=1 SV=1[more]
Q9FGR92.2e-7750.48KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana OX=3702 GN=CEP1 ... [more]
P124128.5e-7750.16Vignain OS=Vigna mungo OX=3915 PE=1 SV=1[more]
A2XQE81.1e-7650.16Senescence-specific cysteine protease SAG39 OS=Oryza sativa subsp. indica OX=399... [more]
Match NameE-valueIdentityDescription
XP_023513224.11.28e-20992.70ervatamin-B-like [Cucurbita pepo subsp. pepo][more]
KAG6570907.11.32e-20789.13Senescence-specific cysteine protease SAG12, partial [Cucurbita argyrosperma sub... [more]
XP_022944762.11.74e-20790.79ervatamin-B-like [Cucurbita moschata][more]
XP_022986332.15.57e-20489.84ervatamin-B [Cucurbita maxima][more]
KAG7010752.11.23e-18989.80Senescence-specific cysteine protease SAG12, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
A0A6J1FYZ38.43e-20890.79ervatamin-B-like OS=Cucurbita moschata OX=3662 GN=LOC111449119 PE=3 SV=1[more]
A0A6J1J7932.70e-20489.84ervatamin-B OS=Cucurbita maxima OX=3661 GN=LOC111484106 PE=3 SV=1[more]
A0A384S0D96.67e-16874.92Cysteine proteinase 1 (Fragment) OS=Citrullus lanatus OX=3654 GN=ClCP1 PE=2 SV=1[more]
A0A6J1CH048.66e-15870.53ervatamin-B OS=Momordica charantia OX=3673 GN=LOC111011342 PE=3 SV=1[more]
A0A0A0LJV68.59e-15569.06Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G292830 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G06260.18.2e-8351.61Cysteine proteinases superfamily protein [more]
AT3G48340.18.4e-8051.59Cysteine proteinases superfamily protein [more]
AT5G50260.11.6e-7850.48Cysteine proteinases superfamily protein [more]
AT5G45890.11.3e-7747.47senescence-associated gene 12 [more]
AT1G47128.19.0e-7447.06Granulin repeat cysteine protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000668Peptidase C1A, papain C-terminalPRINTSPR00705PAPAINcoord: 252..258
score: 75.76
coord: 237..247
score: 54.87
coord: 95..110
score: 48.87
IPR000668Peptidase C1A, papain C-terminalSMARTSM00645pept_c1coord: 85..292
e-value: 7.4E-92
score: 321.2
IPR000668Peptidase C1A, papain C-terminalPFAMPF00112Peptidase_C1coord: 98..292
e-value: 8.8E-70
score: 235.2
IPR013201Cathepsin propeptide inhibitor domain (I29)SMARTSM00848Inhibitor_I29_2coord: 17..73
e-value: 2.1E-20
score: 83.8
IPR013201Cathepsin propeptide inhibitor domain (I29)PFAMPF08246Inhibitor_I29coord: 17..73
e-value: 7.1E-13
score: 48.8
NoneNo IPR availableGENE3D3.90.70.10Cysteine proteinasescoord: 82..294
e-value: 4.1E-72
score: 244.8
NoneNo IPR availableGENE3D1.10.287.2250coord: 2..81
e-value: 2.0E-22
score: 80.9
NoneNo IPR availablePANTHERPTHR12411:SF796ERVATAMIN-B-LIKEcoord: 96..293
coord: 9..85
NoneNo IPR availablePANTHERPTHR12411CYSTEINE PROTEASE FAMILY C1-RELATEDcoord: 96..293
NoneNo IPR availablePANTHERPTHR12411CYSTEINE PROTEASE FAMILY C1-RELATEDcoord: 9..85
IPR025660Cysteine peptidase, histidine active sitePROSITEPS00639THIOL_PROTEASE_HIScoord: 235..245
IPR025661Cysteine peptidase, asparagine active sitePROSITEPS00640THIOL_PROTEASE_ASNcoord: 252..271
IPR039417Papain-like cysteine endopeptidaseCDDcd02248Peptidase_C1Acoord: 99..291
e-value: 1.32432E-92
score: 271.42
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 13..292

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG16g04150.1Cp4.1LG16g04150.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0051603 proteolysis involved in cellular protein catabolic process
biological_process GO:0006508 proteolysis
cellular_component GO:0005615 extracellular space
cellular_component GO:0005764 lysosome
cellular_component GO:0016020 membrane
molecular_function GO:0004197 cysteine-type endopeptidase activity
molecular_function GO:0008234 cysteine-type peptidase activity