CmaCh20G005860 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh20G005860
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
Descriptionervatamin-B
LocationCma_Chr20: 2780924 .. 2782500 (-)
RNA-Seq ExpressionCmaCh20G005860
SyntenyCmaCh20G005860
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGCATATAAAACGATTTGGAACATGGGTTTAACGTCCCTGATTCTCTGGGTTATCTGCACACCCTCAATGGCATCCATGGCAACGGACAGCCCCTCCAATGGCTTACAAGACAGGTACAAGAAATGGATGAATAAACACAGCCGAGAATACAAGAGCAGAGAAGAGCAGGAACGGAGATTCACAGTTTATCAGCTGAATGTTCAGTACATTGACAACTTCAATTCACTGAATCATTCATATACTCTTGCTGAAAATAGTTTTGCAGACCTCACAAATGATGAGTTTAAGATCACTTATTTGGGGTATCAAACTGATTGCCTGTCTGATACATGCTTCAGATATGATCATGTTATTAGCTTGCCTAATCATGTTGACTGGAGAATGGAAGATGCTGTTACTCCGGTAAAGGATCAAGGCCAATGCGGTATGTGTTAGGAATAACAAACCTCCACAATAGTATGCTTTAGGACTTTTCAAAAGGTCTCATACCAATGGAGATGTTTTCTATACTTATAAACCCATGATCATTCTATAAATTAGCCAAGGTGAGACTCTCCCAACAATTTTCCCCTCATACGAAGTACACTATAGAGTCTCCCGTGAGGCCTATGGAGCCCTGGAACAGTTTCCCCTTAATTGAGACTCAACTTCTTTCTCTAGAGTCCTGGAACAAAGTGCACCCTTTTATTCAACAATTGAGTCACTTTTGACTATACCTTCGAGGCTCACAACTACTTTGTTCGATATTTGAGAATTCTATTGACAGCTAAGTTAAGAGCATAGCTCTATACCATGTTAGGAATAACGAACCTTCACAGTAGTATGATATTGTCTACTTTGAGCATAAGCTCTCGTACCTTTACTTTGGACTTCCCCAAAAGGCCATGAACATTTCTTTAATTAGCCAACGTAAAATGAACTCATTCATTAAAACAACAGCCCTTTTATCTATACTCACAAGTTTTGAAGTATTTGAAACAGGGAGTTGCTGGGCATTCTCTGCAGTAGCAGCAGTGGAAGGCATCCACAAAATAAGAACAGGAAAGTTAGAGTCTTTATCAGAGCAAGAGCTTGTGGACTGTGATATCACCTTGGGGAACCAGGGCTGCGATGGCGGTTTCATGAACAAAGCGTTTGAGTACATCAAGAGAAGTGGGCTGACAACAGAGAGAGAATATCCATACAGAGGAATTGAAGCTTTTTGCAACACGCAAAAAGTGAGATACCACTCTGTGACAATAAGTGGGTATGAAAAAGTACCTATGAATAACGAGAAGAAATTGAAAGCTGCTGTTGCTCATCAGCCAGTTTCTGTAGCCATTGATGCAGGGGGATATGATTTTCAATTCTATTCTAGTGGTATCTTCTCAGGTAGCTGTGGGAAGCAGCTCAATCATGGAGTAGCAATCGTTGGGTATGGGGAAGTTGGGGATAATACTTACTGGCTTGTCAAGAATTCGTGGGGGACTGAGTGGGGTGAATCTGGGTACATAAGGATGAAGCGTGATTCGATTGACAAGCGAGGTGCCTGTGGCATAGCCATGGAGGCGAGCTACCCGATCAAAGACTGA

mRNA sequence

ATGGAAGCATATAAAACGATTTGGAACATGGGTTTAACGTCCCTGATTCTCTGGGTTATCTGCACACCCTCAATGGCATCCATGGCAACGGACAGCCCCTCCAATGGCTTACAAGACAGGTACAAGAAATGGATGAATAAACACAGCCGAGAATACAAGAGCAGAGAAGAGCAGGAACGGAGATTCACAGTTTATCAGCTGAATGTTCAGTACATTGACAACTTCAATTCACTGAATCATTCATATACTCTTGCTGAAAATAGTTTTGCAGACCTCACAAATGATGAGTTTAAGATCACTTATTTGGGGTATCAAACTGATTGCCTGTCTGATACATGCTTCAGATATGATCATGTTATTAGCTTGCCTAATCATGTTGACTGGAGAATGGAAGATGCTGTTACTCCGGTAAAGGATCAAGGCCAATGCGGGAGTTGCTGGGCATTCTCTGCAGTAGCAGCAGTGGAAGGCATCCACAAAATAAGAACAGGAAAGTTAGAGTCTTTATCAGAGCAAGAGCTTGTGGACTGTGATATCACCTTGGGGAACCAGGGCTGCGATGGCGGTTTCATGAACAAAGCGTTTGAGTACATCAAGAGAAGTGGGCTGACAACAGAGAGAGAATATCCATACAGAGGAATTGAAGCTTTTTGCAACACGCAAAAAGTGAGATACCACTCTGTGACAATAAGTGGGTATGAAAAAGTACCTATGAATAACGAGAAGAAATTGAAAGCTGCTGTTGCTCATCAGCCAGTTTCTGTAGCCATTGATGCAGGGGGATATGATTTTCAATTCTATTCTAGTGGTATCTTCTCAGGTAGCTGTGGGAAGCAGCTCAATCATGGAGTAGCAATCGTTGGGTATGGGGAAGTTGGGGATAATACTTACTGGCTTGTCAAGAATTCGTGGGGGACTGAGTGGGGTGAATCTGGGTACATAAGGATGAAGCGTGATTCGATTGACAAGCGAGGTGCCTGTGGCATAGCCATGGAGGCGAGCTACCCGATCAAAGACTGA

Coding sequence (CDS)

ATGGAAGCATATAAAACGATTTGGAACATGGGTTTAACGTCCCTGATTCTCTGGGTTATCTGCACACCCTCAATGGCATCCATGGCAACGGACAGCCCCTCCAATGGCTTACAAGACAGGTACAAGAAATGGATGAATAAACACAGCCGAGAATACAAGAGCAGAGAAGAGCAGGAACGGAGATTCACAGTTTATCAGCTGAATGTTCAGTACATTGACAACTTCAATTCACTGAATCATTCATATACTCTTGCTGAAAATAGTTTTGCAGACCTCACAAATGATGAGTTTAAGATCACTTATTTGGGGTATCAAACTGATTGCCTGTCTGATACATGCTTCAGATATGATCATGTTATTAGCTTGCCTAATCATGTTGACTGGAGAATGGAAGATGCTGTTACTCCGGTAAAGGATCAAGGCCAATGCGGGAGTTGCTGGGCATTCTCTGCAGTAGCAGCAGTGGAAGGCATCCACAAAATAAGAACAGGAAAGTTAGAGTCTTTATCAGAGCAAGAGCTTGTGGACTGTGATATCACCTTGGGGAACCAGGGCTGCGATGGCGGTTTCATGAACAAAGCGTTTGAGTACATCAAGAGAAGTGGGCTGACAACAGAGAGAGAATATCCATACAGAGGAATTGAAGCTTTTTGCAACACGCAAAAAGTGAGATACCACTCTGTGACAATAAGTGGGTATGAAAAAGTACCTATGAATAACGAGAAGAAATTGAAAGCTGCTGTTGCTCATCAGCCAGTTTCTGTAGCCATTGATGCAGGGGGATATGATTTTCAATTCTATTCTAGTGGTATCTTCTCAGGTAGCTGTGGGAAGCAGCTCAATCATGGAGTAGCAATCGTTGGGTATGGGGAAGTTGGGGATAATACTTACTGGCTTGTCAAGAATTCGTGGGGGACTGAGTGGGGTGAATCTGGGTACATAAGGATGAAGCGTGATTCGATTGACAAGCGAGGTGCCTGTGGCATAGCCATGGAGGCGAGCTACCCGATCAAAGACTGA

Protein sequence

MEAYKTIWNMGLTSLILWVICTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLGYQTDCLSDTCFRYDHVISLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDITLGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD
Homology
BLAST of CmaCh20G005860 vs. ExPASy Swiss-Prot
Match: Q9FGR9 (KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana OX=3702 GN=CEP1 PE=1 SV=1)

HSP 1 Score: 341.7 bits (875), Expect = 9.8e-93
Identity = 176/315 (55.87%), Postives = 221/315 (70.16%), Query Frame = 0

Query: 35  NGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTN 94
           N L + Y++W + H+   +S EE+ +RF V++ NV++I   N  + SY L  N F D+T+
Sbjct: 32  NSLWELYERWRSHHT-VARSLEEKAKRFNVFKHNVKHIHETNKKDKSYKLKLNKFGDMTS 91

Query: 95  DEFKITYLG--------YQTDCLSDTCFRYDHVISLPNHVDWRMEDAVTPVKDQGQCGSC 154
           +EF+ TY G        +Q +  +   F Y +V +LP  VDWR   AVTPVK+QGQCGSC
Sbjct: 92  EEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSC 151

Query: 155 WAFSAVAAVEGIHKIRTGKLESLSEQELVDCDITLGNQGCDGGFMNKAFEYIK-RSGLTT 214
           WAFS V AVEGI++IRT KL SLSEQELVDCD T  NQGC+GG M+ AFE+IK + GLT+
Sbjct: 152 WAFSTVVAVEGINQIRTKKLTSLSEQELVDCD-TNQNQGCNGGLMDLAFEFIKEKGGLTS 211

Query: 215 EREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVAHQPVSVAIDAGGYDFQ 274
           E  YPY+  +  C+T K     V+I G+E VP N+E  L  AVA+QPVSVAIDAGG DFQ
Sbjct: 212 ELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQ 271

Query: 275 FYSSGIFSGSCGKQLNHGVAIVGYGEVGDNT-YWLVKNSWGTEWGESGYIRMKRDSIDKR 334
           FYS G+F+G CG +LNHGVA+VGYG   D T YW+VKNSWG EWGE GYIRM+R    K 
Sbjct: 272 FYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKE 331

Query: 335 GACGIAMEASYPIKD 340
           G CGIAMEASYP+K+
Sbjct: 332 GLCGIAMEASYPLKN 344

BLAST of CmaCh20G005860 vs. ExPASy Swiss-Prot
Match: O65039 (Vignain OS=Ricinus communis OX=3988 GN=CYSEP PE=1 SV=1)

HSP 1 Score: 338.6 bits (867), Expect = 8.3e-92
Identity = 172/308 (55.84%), Postives = 215/308 (69.81%), Query Frame = 0

Query: 41  YKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKIT 100
           Y++W + H+   +S  E+++RF V++ N  ++ N N ++  Y L  N FAD+TN EF+ T
Sbjct: 38  YERWRSHHTVS-RSLHEKQKRFNVFKHNAMHVHNANKMDKPYKLKLNKFADMTNHEFRNT 97

Query: 101 YLG--------YQTDCLSDTCFRYDHVISLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAV 160
           Y G        ++     +  F Y+ V ++P  VDWR + AVT VKDQGQCGSCWAFS +
Sbjct: 98  YSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFSTI 157

Query: 161 AAVEGIHKIRTGKLESLSEQELVDCDITLGNQGCDGGFMNKAFEYIK-RSGLTTEREYPY 220
            AVEGI++I+T KL SLSEQELVDCD T  NQGC+GG M+ AFE+IK R G+TTE  YPY
Sbjct: 158 VAVEGINQIKTNKLVSLSEQELVDCD-TDQNQGCNGGLMDYAFEFIKQRGGITTEANYPY 217

Query: 221 RGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVAHQPVSVAIDAGGYDFQFYSSGI 280
              +  C+  K    +V+I G+E VP N+E  L  AVA+QPVSVAIDAGG DFQFYS G+
Sbjct: 218 EAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGV 277

Query: 281 FSGSCGKQLNHGVAIVGYGEVGDNT-YWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIA 339
           F+GSCG +L+HGVAIVGYG   D T YW VKNSWG EWGE GYIRM+R   DK G CGIA
Sbjct: 278 FTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIA 337

BLAST of CmaCh20G005860 vs. ExPASy Swiss-Prot
Match: P12412 (Vignain OS=Vigna mungo OX=3915 PE=1 SV=1)

HSP 1 Score: 337.8 bits (865), Expect = 1.4e-91
Identity = 180/349 (51.58%), Postives = 227/349 (65.04%), Query Frame = 0

Query: 1   MEAYKTIWNMGLTSLILWVICTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER 60
           M   K +W +   SL+L V  +        +S    L D Y++W + H+   +S  E+ +
Sbjct: 1   MAMKKLLWVVLSLSLVLGVANSFDFHEKDLES-EESLWDLYERWRSHHTVS-RSLGEKHK 60

Query: 61  RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLG--------YQTDCLSDT 120
           RF V++ NV ++ N N ++  Y L  N FAD+TN EF+ TY G        ++       
Sbjct: 61  RFNVFKANVMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSG 120

Query: 121 CFRYDHVISLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQ 180
            F Y+ V S+P  VDWR + AVT VKDQGQCGSCWAFS + AVEGI++I+T KL SLSEQ
Sbjct: 121 TFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQ 180

Query: 181 ELVDCDITLGNQGCDGGFMNKAFEYIK-RSGLTTEREYPYRGIEAFCNTQKVRYHSVTIS 240
           ELVDCD    NQGC+GG M  AFE+IK + G+TTE  YPY   E  C+  KV   +V+I 
Sbjct: 181 ELVDCD-KEENQGCNGGLMESAFEFIKQKGGITTESNYPYTAQEGTCDESKVNDLAVSID 240

Query: 241 GYEKVPMNNEKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGE 300
           G+E VP+N+E  L  AVA+QPVSVAIDAGG DFQFYS G+F+G C   LNHGVAIVGYG 
Sbjct: 241 GHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCNTDLNHGVAIVGYGT 300

Query: 301 VGDNT-YWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD 340
             D T YW+V+NSWG EWGE GYIRM+R+   K G CGIAM ASYPIK+
Sbjct: 301 TVDGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEGLCGIAMMASYPIKN 346

BLAST of CmaCh20G005860 vs. ExPASy Swiss-Prot
Match: Q9STL4 (KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana OX=3702 GN=CEP2 PE=1 SV=1)

HSP 1 Score: 337.0 bits (863), Expect = 2.4e-91
Identity = 175/314 (55.73%), Postives = 214/314 (68.15%), Query Frame = 0

Query: 36  GLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTND 95
           GL   Y +W + HS   +S  E+E+RF V++ NV ++ N N  N SY L  N FADLT +
Sbjct: 33  GLSTLYDRWRSHHSVP-RSLNEREKRFNVFRHNVMHVHNTNKKNRSYKLKLNKFADLTIN 92

Query: 96  EFKITYLG--------YQTDCLSDTCFRYDH--VISLPNHVDWRMEDAVTPVKDQGQCGS 155
           EFK  Y G         Q        F YDH  +  LP+ VDWR + AVT +K+QG+CGS
Sbjct: 93  EFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGS 152

Query: 156 CWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDITLGNQGCDGGFMNKAFEYIKRS-GLT 215
           CWAFS VAAVEGI+KI+T KL SLSEQELVDCD T  N+GC+GG M  AFE+IK++ G+T
Sbjct: 153 CWAFSTVAAVEGINKIKTNKLVSLSEQELVDCD-TKQNEGCNGGLMEIAFEFIKKNGGIT 212

Query: 216 TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVAHQPVSVAIDAGGYDF 275
           TE  YPY GI+  C+  K     VTI G+E VP N+E  L  AVA+QPVSVAIDAG  DF
Sbjct: 213 TEDSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDF 272

Query: 276 QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR 335
           QFYS G+F+GSCG +LNHGVA VGYG      YW+V+NSWG EWGE GYI+++R+  +  
Sbjct: 273 QFYSEGVFTGSCGTELNHGVAAVGYGSERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPE 332

Query: 336 GACGIAMEASYPIK 339
           G CGIAMEASYPIK
Sbjct: 333 GRCGIAMEASYPIK 344

BLAST of CmaCh20G005860 vs. ExPASy Swiss-Prot
Match: P25803 (Vignain OS=Phaseolus vulgaris OX=3885 PE=2 SV=2)

HSP 1 Score: 336.7 bits (862), Expect = 3.1e-91
Identity = 176/346 (50.87%), Postives = 226/346 (65.32%), Query Frame = 0

Query: 13  TSLILWVICTPSMASMATDS---------PSNGLQDRYKKWMNKHSREYKSREEQERRFT 72
           T  +LWV+ + S+     +S             L D Y++W + H+   +S  E+ +RF 
Sbjct: 3   TKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVS-RSLGEKHKRFN 62

Query: 73  VYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLG--------YQTDCLSDTCFR 132
           V++ N+ ++ N N ++  Y L  N FAD+TN EF+ TY G        ++     +  F 
Sbjct: 63  VFKANLMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFM 122

Query: 133 YDHVISLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELV 192
           Y+ V+S+P  VDWR + AVT VKDQGQCGSCWAFS V AVEGI++I+T KL +LSEQELV
Sbjct: 123 YEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELV 182

Query: 193 DCDITLGNQGCDGGFMNKAFEYIK-RSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYE 252
           DCD    NQGC+GG M  AFE+IK + G+TTE  YPY+  E  C+  KV   +V+I G+E
Sbjct: 183 DCD-KEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHE 242

Query: 253 KVPMNNEKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGD 312
            VP N+E  L  AVA+QPVSVAIDAGG DFQFYS G+F+G C   LNHGVAIVGYG   D
Sbjct: 243 NVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVD 302

Query: 313 NT-YWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD 340
            T YW+V+NSWG EWGE GYIRM+R+   K G CGIAM  SYPIK+
Sbjct: 303 GTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPIKN 346

BLAST of CmaCh20G005860 vs. ExPASy TrEMBL
Match: A0A6J1J793 (ervatamin-B OS=Cucurbita maxima OX=3661 GN=LOC111484106 PE=3 SV=1)

HSP 1 Score: 709.1 bits (1829), Expect = 8.6e-201
Identity = 339/339 (100.00%), Postives = 339/339 (100.00%), Query Frame = 0

Query: 1   MEAYKTIWNMGLTSLILWVICTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER 60
           MEAYKTIWNMGLTSLILWVICTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER
Sbjct: 1   MEAYKTIWNMGLTSLILWVICTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER 60

Query: 61  RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLGYQTDCLSDTCFRYDHVI 120
           RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLGYQTDCLSDTCFRYDHVI
Sbjct: 61  RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLGYQTDCLSDTCFRYDHVI 120

Query: 121 SLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIT 180
           SLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIT
Sbjct: 121 SLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIT 180

Query: 181 LGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNN 240
           LGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNN
Sbjct: 181 LGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNN 240

Query: 241 EKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLV 300
           EKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLV
Sbjct: 241 EKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLV 300

Query: 301 KNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD 340
           KNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD
Sbjct: 301 KNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD 339

BLAST of CmaCh20G005860 vs. ExPASy TrEMBL
Match: A0A6J1FYZ3 (ervatamin-B-like OS=Cucurbita moschata OX=3662 GN=LOC111449119 PE=3 SV=1)

HSP 1 Score: 690.6 bits (1781), Expect = 3.2e-195
Identity = 329/339 (97.05%), Postives = 331/339 (97.64%), Query Frame = 0

Query: 1   MEAYKTIWNMGLTSLILWVICTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER 60
           MEAYKTIWNMGLTSLILW++CTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER
Sbjct: 1   MEAYKTIWNMGLTSLILWIVCTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER 60

Query: 61  RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLGYQTDCLSDTCFRYDHVI 120
           RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFK TYLGYQT CL DTCFRYDHVI
Sbjct: 61  RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGYQTHCLPDTCFRYDHVI 120

Query: 121 SLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIT 180
           SLP HVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDI 
Sbjct: 121 SLPTHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDII 180

Query: 181 LGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNN 240
            GNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVP NN
Sbjct: 181 SGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPTNN 240

Query: 241 EKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLV 300
           EKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLV
Sbjct: 241 EKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLV 300

Query: 301 KNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD 340
           KNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYP KD
Sbjct: 301 KNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPTKD 339

BLAST of CmaCh20G005860 vs. ExPASy TrEMBL
Match: A0A384S0D9 (Cysteine proteinase 1 (Fragment) OS=Citrullus lanatus OX=3654 GN=ClCP1 PE=2 SV=1)

HSP 1 Score: 532.7 bits (1371), Expect = 1.1e-147
Identity = 254/319 (79.62%), Postives = 284/319 (89.03%), Query Frame = 0

Query: 25  MASMATD----SPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNH 84
           MASM  D    S S  LQDRY+KWM+K+ REYKSREE E+RF +YQLNVQYIDNFNSLNH
Sbjct: 1   MASMEMDYRPGSSSGDLQDRYQKWMSKYGREYKSREEWEQRFNIYQLNVQYIDNFNSLNH 60

Query: 85  SYTLAENSFADLTNDEFKITYLGYQTDCLSDTCFRYDHVISLPNHVDWRMEDAVTPVKDQ 144
           SYTLAENSFADLTNDEFK TYLG++TD L DT FRY ++++LP +VDWR E+AVTPVKDQ
Sbjct: 61  SYTLAENSFADLTNDEFKTTYLGFKTDWLPDTWFRYGNMVNLPTNVDWRKENAVTPVKDQ 120

Query: 145 GQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDITLGNQGCDGGFMNKAFEYIKR 204
           GQCGSCWAFSAVAAVEGI+KI+TGKL SLSEQELVDCD+  GNQGC+GG+M KAFE+IK+
Sbjct: 121 GQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVASGNQGCNGGYMYKAFEFIKK 180

Query: 205 SGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVAHQPVSVAIDAG 264
           +GLTTE EYPYRGIE+ CN QKVRY +VTISGYEKVP+N+EK LKAAVA+QPVSVAIDAG
Sbjct: 181 TGLTTEIEYPYRGIESVCNKQKVRYRTVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAG 240

Query: 265 GYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDS 324
           GYDFQFYS G+FSG+CGKQLNHGVAIVGYGE  + TYWLVKNSWGT+WGESGYIRMKRDS
Sbjct: 241 GYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEASNKTYWLVKNSWGTDWGESGYIRMKRDS 300

Query: 325 IDKRGACGIAMEASYPIKD 340
            DKRG CGIAM ASYPIKD
Sbjct: 301 TDKRGTCGIAMMASYPIKD 319

BLAST of CmaCh20G005860 vs. ExPASy TrEMBL
Match: A0A6J1CH04 (ervatamin-B OS=Momordica charantia OX=3673 GN=LOC111011342 PE=3 SV=1)

HSP 1 Score: 522.7 bits (1345), Expect = 1.1e-144
Identity = 251/343 (73.18%), Postives = 287/343 (83.67%), Query Frame = 0

Query: 1   MEAYKTIWNMGLTSLILWVICTPSMASMATDSP----SNGLQDRYKKWMNKHSREYKSRE 60
           MEAY  I N+G   LIL V  T SMAS+A D+P    S+ ++DRY+KW++K+ REYKS E
Sbjct: 1   MEAYGMIRNVGFMWLILCVFWTLSMASVAEDNPPGDGSDDMRDRYQKWIDKYGREYKSGE 60

Query: 61  EQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLGYQTDCLSDTCFRY 120
           E+E+RF +YQ NVQYID FNSLN SYTLA+N FADLTNDEFK TYLGY TD   DTCF+Y
Sbjct: 61  EREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKY 120

Query: 121 DHVISLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVD 180
            ++++LP +VDWR E AVTP+KDQGQCGSCWAFSAVAAVEGI KI+TGKL SLSEQEL+D
Sbjct: 121 GNIVNLPTNVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLD 180

Query: 181 CDITLGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKV 240
           CD+  GNQGC GGFM KAFE+IK+ G+TTE+EYPYRG+E  CN QKVRYHS TISGYEKV
Sbjct: 181 CDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKV 240

Query: 241 PMNNEKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNT 300
           P N+EK LKAAVA+QPVSVAIDAGGYDFQFYS GIFSG+CGKQLNHGV IVGYGE    +
Sbjct: 241 PANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKS 300

Query: 301 YWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD 340
           YWLVKNSWGT WGE GY+RMK +S DKRG CGIAM+ASYPIKD
Sbjct: 301 YWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD 343

BLAST of CmaCh20G005860 vs. ExPASy TrEMBL
Match: A0A5A7SQK0 (Ervatamin-B-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold111G001250 PE=3 SV=1)

HSP 1 Score: 521.2 bits (1341), Expect = 3.3e-144
Identity = 251/346 (72.54%), Postives = 289/346 (83.53%), Query Frame = 0

Query: 1   MEAYKTIWNMGLTSLILWVICTPSMASMATDSPSNG-----LQDRYKKWMNKHSREYKSR 60
           MEAYK IW++ L SLILWV  TP+  SMA D PS       LQDRY+KWM+K+ R+YKSR
Sbjct: 1   MEAYKMIWSVSLISLILWVFWTPTRVSMAMDYPSGSSNSGELQDRYQKWMDKYGRQYKSR 60

Query: 61  EEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLGYQTDCL--SDTC 120
           EE E+RFT+YQ NVQYIDNFNSLNHSYTLAEN+F DLTN+EF  TYLGY+T  L  +DT 
Sbjct: 61  EEWEQRFTIYQANVQYIDNFNSLNHSYTLAENNFTDLTNEEFMATYLGYETVSLPDTDTF 120

Query: 121 FRYDHVISLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQE 180
           FRY ++++LP +VDWR E AVTP+K+QGQCGSCWAFSAVAAVEGI+KI+ GKL SLSEQE
Sbjct: 121 FRYGNMVNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKAGKLMSLSEQE 180

Query: 181 LVDCDITLGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGY 240
           LVDCD+T GNQGC+GG+M KAFE+IK++GLTTE EYPY    + C+ QK +Y SV+ISGY
Sbjct: 181 LVDCDVTSGNQGCNGGYMYKAFEFIKKTGLTTELEYPYTATRSACDKQKEKYQSVSISGY 240

Query: 241 EKVPMNNEKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVG 300
           EKVP+N+EK L+AAVA QPVSVAIDAGG DFQFYS GIFSG+CGKQLNHGVAIVGYGE  
Sbjct: 241 EKVPVNDEKSLQAAVAKQPVSVAIDAGGNDFQFYSGGIFSGNCGKQLNHGVAIVGYGEDS 300

Query: 301 DNTYWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD 340
           +  YWLVKNSWGT WGESGYIRM RDS DK+G CGIAM ASYPIKD
Sbjct: 301 NQAYWLVKNSWGTSWGESGYIRMMRDSTDKQGTCGIAMMASYPIKD 346

BLAST of CmaCh20G005860 vs. NCBI nr
Match: XP_022986332.1 (ervatamin-B [Cucurbita maxima])

HSP 1 Score: 709.1 bits (1829), Expect = 1.8e-200
Identity = 339/339 (100.00%), Postives = 339/339 (100.00%), Query Frame = 0

Query: 1   MEAYKTIWNMGLTSLILWVICTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER 60
           MEAYKTIWNMGLTSLILWVICTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER
Sbjct: 1   MEAYKTIWNMGLTSLILWVICTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER 60

Query: 61  RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLGYQTDCLSDTCFRYDHVI 120
           RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLGYQTDCLSDTCFRYDHVI
Sbjct: 61  RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLGYQTDCLSDTCFRYDHVI 120

Query: 121 SLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIT 180
           SLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIT
Sbjct: 121 SLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIT 180

Query: 181 LGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNN 240
           LGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNN
Sbjct: 181 LGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNN 240

Query: 241 EKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLV 300
           EKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLV
Sbjct: 241 EKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLV 300

Query: 301 KNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD 340
           KNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD
Sbjct: 301 KNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD 339

BLAST of CmaCh20G005860 vs. NCBI nr
Match: XP_022944762.1 (ervatamin-B-like [Cucurbita moschata])

HSP 1 Score: 690.6 bits (1781), Expect = 6.5e-195
Identity = 329/339 (97.05%), Postives = 331/339 (97.64%), Query Frame = 0

Query: 1   MEAYKTIWNMGLTSLILWVICTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER 60
           MEAYKTIWNMGLTSLILW++CTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER
Sbjct: 1   MEAYKTIWNMGLTSLILWIVCTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER 60

Query: 61  RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLGYQTDCLSDTCFRYDHVI 120
           RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFK TYLGYQT CL DTCFRYDHVI
Sbjct: 61  RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGYQTHCLPDTCFRYDHVI 120

Query: 121 SLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIT 180
           SLP HVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDI 
Sbjct: 121 SLPTHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDII 180

Query: 181 LGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNN 240
            GNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVP NN
Sbjct: 181 SGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPTNN 240

Query: 241 EKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLV 300
           EKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLV
Sbjct: 241 EKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLV 300

Query: 301 KNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD 340
           KNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYP KD
Sbjct: 301 KNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPTKD 339

BLAST of CmaCh20G005860 vs. NCBI nr
Match: XP_023513224.1 (ervatamin-B-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 688.0 bits (1774), Expect = 4.2e-194
Identity = 328/339 (96.76%), Postives = 332/339 (97.94%), Query Frame = 0

Query: 1   MEAYKTIWNMGLTSLILWVICTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER 60
           MEAYKTIWNMGLTSLILWV+CTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER
Sbjct: 1   MEAYKTIWNMGLTSLILWVVCTPSMASMATDSPSNGLQDRYKKWMNKHSREYKSREEQER 60

Query: 61  RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLGYQTDCLSDTCFRYDHVI 120
           RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFK TYLGYQT CL DTCFRY+HV 
Sbjct: 61  RFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGYQTHCLPDTCFRYEHVN 120

Query: 121 SLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIT 180
           SLP HVDWRMEDAVTP+KDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDI 
Sbjct: 121 SLPTHVDWRMEDAVTPIKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDII 180

Query: 181 LGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNN 240
            GNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNN
Sbjct: 181 SGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNN 240

Query: 241 EKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLV 300
           EKKLKAAVA+QPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLV
Sbjct: 241 EKKLKAAVANQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLV 300

Query: 301 KNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD 340
           KNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD
Sbjct: 301 KNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD 339

BLAST of CmaCh20G005860 vs. NCBI nr
Match: KAG6570907.1 (Senescence-specific cysteine protease SAG12, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 615.9 bits (1587), Expect = 2.0e-172
Identity = 300/322 (93.17%), Postives = 302/322 (93.79%), Query Frame = 0

Query: 25  MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 84
           MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL
Sbjct: 1   MASMATDSPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTL 60

Query: 85  AENSFADLTNDEFKITYLGYQTDCLSDTCFRYDHVISLPNHVDWRMEDAVTP-------V 144
           AENSFADLTNDEFK TYLGYQT CL DTCFRYDHVISLP HVDWRMEDAVTP        
Sbjct: 61  AENSFADLTNDEFKTTYLGYQTHCLPDTCFRYDHVISLPTHVDWRMEDAVTPSFLSILTS 120

Query: 145 KDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDITLGNQGCDGGFMNKAFEY 204
            +  + GSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDI  GNQGCDGGFMNKAFEY
Sbjct: 121 FEVFETGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIISGNQGCDGGFMNKAFEY 180

Query: 205 IKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVAHQPVSVAI 264
           IKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVP NNEKKLKAAVAHQPVSVAI
Sbjct: 181 IKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPTNNEKKLKAAVAHQPVSVAI 240

Query: 265 DAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMK 324
           DAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMK
Sbjct: 241 DAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMK 300

Query: 325 RDSIDKRGACGIAMEASYPIKD 340
           RDSIDKRGACGIAMEASYP KD
Sbjct: 301 RDSIDKRGACGIAMEASYPTKD 322

BLAST of CmaCh20G005860 vs. NCBI nr
Match: XP_038902939.1 (ervatamin-B [Benincasa hispida])

HSP 1 Score: 560.8 bits (1444), Expect = 7.8e-156
Identity = 265/343 (77.26%), Postives = 302/343 (88.05%), Query Frame = 0

Query: 1   MEAYKTIWNMGLTSLILWVICTPSMASMATDSP----SNGLQDRYKKWMNKHSREYKSRE 60
           MEAY+ IWN+GL SLILWVI TP+M  MA D P    S  LQ RY+KWM+K+ R+YKSRE
Sbjct: 1   MEAYRMIWNVGLMSLILWVIWTPTMVFMAMDYPPGSSSGDLQGRYQKWMSKYGRQYKSRE 60

Query: 61  EQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLGYQTDCLSDTCFRY 120
           E ERRFT+YQLNVQYIDNFNSL+HSYTLAEN+FADLTNDEFK TYLGY+TD L DTCFRY
Sbjct: 61  EWERRFTIYQLNVQYIDNFNSLDHSYTLAENNFADLTNDEFKETYLGYKTDWLPDTCFRY 120

Query: 121 DHVISLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVD 180
            +++ LP +V+WR E AVTP+K+QGQCGSCWAFSAVAAVEGI+KI+TGKL SLSEQELVD
Sbjct: 121 GNMVDLPTNVNWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVD 180

Query: 181 CDITLGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKV 240
           CD+  GNQGC+GGFM+KAF++IK++GLTTE EYPYRGIE+ CN QKVR H+V ISGYEKV
Sbjct: 181 CDVASGNQGCNGGFMDKAFQFIKKTGLTTETEYPYRGIESTCNKQKVRNHTVEISGYEKV 240

Query: 241 PMNNEKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNT 300
           P N+EK LKAAVA+QPVSVAIDAGGYDFQFYS G+FSG+CGKQLNHGVAIVGYG+  + +
Sbjct: 241 PANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGQASNKS 300

Query: 301 YWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD 340
           YWLVKNSWGT+WGESGYIRMKRDS DKRG CGIAM ASYPIKD
Sbjct: 301 YWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 343

BLAST of CmaCh20G005860 vs. TAIR 10
Match: AT1G06260.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 349.0 bits (894), Expect = 4.4e-96
Identity = 176/339 (51.92%), Postives = 225/339 (66.37%), Query Frame = 0

Query: 9   NMGLTSLILWVICTPSMASMATD--SPSNGLQDRYKKWMNKHSREYKSREEQERRFTVYQ 68
           N+ L  LI +V+    + S+ +    P   L+ R++KW+  HS+ Y  R+E   RF +YQ
Sbjct: 9   NLTLAVLICFVLIASKLCSVDSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQ 68

Query: 69  LNVQYIDNFNSLNHSYTLAENSFADLTNDEFKITYLGYQTDCL------SDTCFRYDHVI 128
            NVQ ID  NSL+  + L +N FAD+TN EFK  +LG  T  L         C   D   
Sbjct: 69  SNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQRPVC---DPAG 128

Query: 129 SLPNHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDIT 188
           ++P+ VDWR + AVTP+++QG+CG CWAFSAVAA+EGI+KI+TG L SLSEQ+L+DCD+ 
Sbjct: 129 NVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVG 188

Query: 189 LGNQGCDGGFMNKAFEYIK-RSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMN 248
             N+GC GG M  AFE+IK   GL TE +YPY GIE  C+ +K +   VTI GY+KV   
Sbjct: 189 TYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQKV-AQ 248

Query: 249 NEKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWL 308
           NE  L+ A A QPVSV IDAGG+ FQ YSSG+F+  CG  LNHGV +VGYG  GD  YW+
Sbjct: 249 NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGVEGDQKYWI 308

Query: 309 VKNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIK 339
           VKNSWGT WGE GYIRM+R   +  G CGIAM ASYP++
Sbjct: 309 VKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPLQ 343

BLAST of CmaCh20G005860 vs. TAIR 10
Match: AT5G50260.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 341.7 bits (875), Expect = 6.9e-94
Identity = 176/315 (55.87%), Postives = 221/315 (70.16%), Query Frame = 0

Query: 35  NGLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTN 94
           N L + Y++W + H+   +S EE+ +RF V++ NV++I   N  + SY L  N F D+T+
Sbjct: 32  NSLWELYERWRSHHT-VARSLEEKAKRFNVFKHNVKHIHETNKKDKSYKLKLNKFGDMTS 91

Query: 95  DEFKITYLG--------YQTDCLSDTCFRYDHVISLPNHVDWRMEDAVTPVKDQGQCGSC 154
           +EF+ TY G        +Q +  +   F Y +V +LP  VDWR   AVTPVK+QGQCGSC
Sbjct: 92  EEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSC 151

Query: 155 WAFSAVAAVEGIHKIRTGKLESLSEQELVDCDITLGNQGCDGGFMNKAFEYIK-RSGLTT 214
           WAFS V AVEGI++IRT KL SLSEQELVDCD T  NQGC+GG M+ AFE+IK + GLT+
Sbjct: 152 WAFSTVVAVEGINQIRTKKLTSLSEQELVDCD-TNQNQGCNGGLMDLAFEFIKEKGGLTS 211

Query: 215 EREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVAHQPVSVAIDAGGYDFQ 274
           E  YPY+  +  C+T K     V+I G+E VP N+E  L  AVA+QPVSVAIDAGG DFQ
Sbjct: 212 ELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQ 271

Query: 275 FYSSGIFSGSCGKQLNHGVAIVGYGEVGDNT-YWLVKNSWGTEWGESGYIRMKRDSIDKR 334
           FYS G+F+G CG +LNHGVA+VGYG   D T YW+VKNSWG EWGE GYIRM+R    K 
Sbjct: 272 FYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKE 331

Query: 335 GACGIAMEASYPIKD 340
           G CGIAMEASYP+K+
Sbjct: 332 GLCGIAMEASYPLKN 344

BLAST of CmaCh20G005860 vs. TAIR 10
Match: AT3G48340.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 337.0 bits (863), Expect = 1.7e-92
Identity = 175/314 (55.73%), Postives = 214/314 (68.15%), Query Frame = 0

Query: 36  GLQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTND 95
           GL   Y +W + HS   +S  E+E+RF V++ NV ++ N N  N SY L  N FADLT +
Sbjct: 33  GLSTLYDRWRSHHSVP-RSLNEREKRFNVFRHNVMHVHNTNKKNRSYKLKLNKFADLTIN 92

Query: 96  EFKITYLG--------YQTDCLSDTCFRYDH--VISLPNHVDWRMEDAVTPVKDQGQCGS 155
           EFK  Y G         Q        F YDH  +  LP+ VDWR + AVT +K+QG+CGS
Sbjct: 93  EFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGS 152

Query: 156 CWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDITLGNQGCDGGFMNKAFEYIKRS-GLT 215
           CWAFS VAAVEGI+KI+T KL SLSEQELVDCD T  N+GC+GG M  AFE+IK++ G+T
Sbjct: 153 CWAFSTVAAVEGINKIKTNKLVSLSEQELVDCD-TKQNEGCNGGLMEIAFEFIKKNGGIT 212

Query: 216 TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVAHQPVSVAIDAGGYDF 275
           TE  YPY GI+  C+  K     VTI G+E VP N+E  L  AVA+QPVSVAIDAG  DF
Sbjct: 213 TEDSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDF 272

Query: 276 QFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKR 335
           QFYS G+F+GSCG +LNHGVA VGYG      YW+V+NSWG EWGE GYI+++R+  +  
Sbjct: 273 QFYSEGVFTGSCGTELNHGVAAVGYGSERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPE 332

Query: 336 GACGIAMEASYPIK 339
           G CGIAMEASYPIK
Sbjct: 333 GRCGIAMEASYPIK 344

BLAST of CmaCh20G005860 vs. TAIR 10
Match: AT5G45890.1 (senescence-associated gene 12 )

HSP 1 Score: 333.6 bits (854), Expect = 1.9e-91
Identity = 166/313 (53.04%), Postives = 224/313 (71.57%), Query Frame = 0

Query: 37  LQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSL--NHSYTLAENSFADLTN 96
           +Q R+ +WM KH R Y   +E+  R+ V++ NV+ I++ NS+    ++ LA N FADLTN
Sbjct: 34  MQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTN 93

Query: 97  DEFKITYLGYQ-TDCLSD------TCFRYDHVIS--LPNHVDWRMEDAVTPVKDQGQCGS 156
           DEF+  Y G++    LS       + FRY +V S  LP  VDWR + AVTP+K+QG CG 
Sbjct: 94  DEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGC 153

Query: 157 CWAFSAVAAVEGIHKIRTGKLESLSEQELVDCDITLGNQGCDGGFMNKAFEYIKRS-GLT 216
           CWAFSAVAA+EG  +I+ GKL SLSEQ+LVDCD    + GC+GG M+ AFE+IK + GLT
Sbjct: 154 CWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD--TNDFGCEGGLMDTAFEHIKATGGLT 213

Query: 217 TEREYPYRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVAHQPVSVAIDAGGYDF 276
           TE  YPY+G +A CN++K    + +I+GYE VP+N+E+ L  AVAHQPVSV I+ GG+DF
Sbjct: 214 TESNYPYKGEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDF 273

Query: 277 QFYSSGIFSGSCGKQLNHGVAIVGYGE-VGDNTYWLVKNSWGTEWGESGYIRMKRDSIDK 336
           QFYSSG+F+G C   L+H V  +GYGE    + YW++KNSWGT+WGESGY+R+++D  DK
Sbjct: 274 QFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDK 333

BLAST of CmaCh20G005860 vs. TAIR 10
Match: AT4G35350.1 (xylem cysteine peptidase 1 )

HSP 1 Score: 327.0 bits (837), Expect = 1.8e-89
Identity = 168/308 (54.55%), Postives = 205/308 (66.56%), Query Frame = 0

Query: 37  LQDRYKKWMNKHSREYKSREEQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDE 96
           L + ++ WM++HS+ YKS EE+  RF V++ N+ +ID  N+  +SY L  N FADLT++E
Sbjct: 47  LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEE 106

Query: 97  FKITYLGYQTDCLS-----DTCFRYDHVISLPNHVDWRMEDAVTPVKDQGQCGSCWAFSA 156
           FK  YLG      S        FRY  +  LP  VDWR + AV PVKDQGQCGSCWAFS 
Sbjct: 107 FKGRYLGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFST 166

Query: 157 VAAVEGIHKIRTGKLESLSEQELVDCDITLGNQGCDGGFMNKAFEY-IKRSGLTTEREYP 216
           VAAVEGI++I TG L SLSEQEL+DCD T  N GC+GG M+ AF+Y I   GL  E +YP
Sbjct: 167 VAAVEGINQITTGNLSSLSEQELIDCDTTF-NSGCNGGLMDYAFQYIISTGGLHKEDDYP 226

Query: 217 YRGIEAFCNTQKVRYHSVTISGYEKVPMNNEKKLKAAVAHQPVSVAIDAGGYDFQFYSSG 276
           Y   E  C  QK     VTISGYE VP N+++ L  A+AHQPVSVAI+A G DFQFY  G
Sbjct: 227 YLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGG 286

Query: 277 IFSGSCGKQLNHGVAIVGYGEVGDNTYWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIA 336
           +F+G CG  L+HGVA VGYG    + Y +VKNSWG  WGE G+IRMKR++    G CGI 
Sbjct: 287 VFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEGLCGIN 346

Query: 337 MEASYPIK 339
             ASYP K
Sbjct: 347 KMASYPTK 353

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FGR99.8e-9355.87KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana OX=3702 GN=CEP1 ... [more]
O650398.3e-9255.84Vignain OS=Ricinus communis OX=3988 GN=CYSEP PE=1 SV=1[more]
P124121.4e-9151.58Vignain OS=Vigna mungo OX=3915 PE=1 SV=1[more]
Q9STL42.4e-9155.73KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana OX=3702 GN=CEP2 ... [more]
P258033.1e-9150.87Vignain OS=Phaseolus vulgaris OX=3885 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A6J1J7938.6e-201100.00ervatamin-B OS=Cucurbita maxima OX=3661 GN=LOC111484106 PE=3 SV=1[more]
A0A6J1FYZ33.2e-19597.05ervatamin-B-like OS=Cucurbita moschata OX=3662 GN=LOC111449119 PE=3 SV=1[more]
A0A384S0D91.1e-14779.62Cysteine proteinase 1 (Fragment) OS=Citrullus lanatus OX=3654 GN=ClCP1 PE=2 SV=1[more]
A0A6J1CH041.1e-14473.18ervatamin-B OS=Momordica charantia OX=3673 GN=LOC111011342 PE=3 SV=1[more]
A0A5A7SQK03.3e-14472.54Ervatamin-B-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold111G001... [more]
Match NameE-valueIdentityDescription
XP_022986332.11.8e-200100.00ervatamin-B [Cucurbita maxima][more]
XP_022944762.16.5e-19597.05ervatamin-B-like [Cucurbita moschata][more]
XP_023513224.14.2e-19496.76ervatamin-B-like [Cucurbita pepo subsp. pepo][more]
KAG6570907.12.0e-17293.17Senescence-specific cysteine protease SAG12, partial [Cucurbita argyrosperma sub... [more]
XP_038902939.17.8e-15677.26ervatamin-B [Benincasa hispida][more]
Match NameE-valueIdentityDescription
AT1G06260.14.4e-9651.92Cysteine proteinases superfamily protein [more]
AT5G50260.16.9e-9455.87Cysteine proteinases superfamily protein [more]
AT3G48340.11.7e-9255.73Cysteine proteinases superfamily protein [more]
AT5G45890.11.9e-9153.04senescence-associated gene 12 [more]
AT4G35350.11.8e-8954.55xylem cysteine peptidase 1 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000668Peptidase C1A, papain C-terminalPRINTSPR00705PAPAINcoord: 282..292
score: 54.87
coord: 297..303
score: 75.76
coord: 140..155
score: 67.03
IPR000668Peptidase C1A, papain C-terminalSMARTSM00645pept_c1coord: 122..337
e-value: 6.5E-116
score: 401.1
IPR000668Peptidase C1A, papain C-terminalPFAMPF00112Peptidase_C1coord: 122..337
e-value: 1.7E-81
score: 273.5
IPR013201Cathepsin propeptide inhibitor domain (I29)SMARTSM00848Inhibitor_I29_2coord: 41..97
e-value: 2.1E-20
score: 83.8
IPR013201Cathepsin propeptide inhibitor domain (I29)PFAMPF08246Inhibitor_I29coord: 41..97
e-value: 8.9E-13
score: 48.5
NoneNo IPR availableGENE3D3.90.70.10Cysteine proteinasescoord: 26..339
e-value: 1.4E-115
score: 388.2
NoneNo IPR availablePANTHERPTHR12411CYSTEINE PROTEASE FAMILY C1-RELATEDcoord: 25..338
NoneNo IPR availablePANTHERPTHR12411:SF796ERVATAMIN-B-LIKEcoord: 25..338
IPR000169Cysteine peptidase, cysteine active sitePROSITEPS00139THIOL_PROTEASE_CYScoord: 140..151
IPR025660Cysteine peptidase, histidine active sitePROSITEPS00639THIOL_PROTEASE_HIScoord: 280..290
IPR025661Cysteine peptidase, asparagine active sitePROSITEPS00640THIOL_PROTEASE_ASNcoord: 297..316
IPR039417Papain-like cysteine endopeptidaseCDDcd02248Peptidase_C1Acoord: 123..336
e-value: 1.06957E-107
score: 311.48
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 38..337

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh20G005860.1CmaCh20G005860.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0051603 proteolysis involved in cellular protein catabolic process
biological_process GO:0006508 proteolysis
cellular_component GO:0005615 extracellular space
cellular_component GO:0005764 lysosome
molecular_function GO:0004197 cysteine-type endopeptidase activity
molecular_function GO:0008234 cysteine-type peptidase activity