CmaCh19G006820.1 (mRNA) Cucurbita maxima (Rimu)

NameCmaCh19G006820.1
TypemRNA
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionCathepsin B-like cysteine proteinase 2
LocationCma_Chr19 : 7077999 .. 7078883 (+)
Sequence length750
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCATACGTCTGTTTCCTCTTTAGTGATACTATTCTGAAACCTTTGAACTTGCATTTTATAAATTCTGCATATATCTTCTGCACACGGCAGTGTGATCCAGAGGGAGCTGGTTCCTGTGACTCTGGTTGCAATGGTGGCTCGATGAACAGTGCATTTGAATACACATTAAAAGTTGGTTGCAATCGTGAAGCCTGTAAGTTTGACAGGTCCAAGATCGCCGCTGCATCAGTTGCCAATTTCAGTGTTGTTTCACTTGATGAGGACCAAATTGCTGCAAATCTGGTGGAAAATGGCCCACTTGCAAGTAAAATACCCCATCAAATCATTTAACTTTCATTATTCTGATACAGATTCCATATGTCCTTAAGAACATCCAATATGGATCTAACTAGAATCAGCAGCTAAATTCTGGTTTTTTTTTTTTTTTACCACAATGCAGTTGCTATCAATGCGGTGTTCATGCAGTCATATATAGGTGGAGTATCTTGTCCATTCATATGTTCAAAGCGGTTGGAACATGGAGTTTTGCTGGTGGGTTATGGCTCAGCTGACTATTGGATCATCAAAAACTCATGGGGAGAAAATGGATACTACAGAATCTGCAGAGGAAGGAATATTTGTGGAGTTGATTCCTTGGTCTCAACTGTTGCAGCTGTTCATACCCCAATTGCAGCAGCAGGTCAGTAAATTGGATCTCTCTTAACACTGTAATTTATGAAAATGTGTGAATTCTAAGCCTTTAGGTCCCTCCTTTTCTTTCTTTCCTTATTCTTATTGGATTCATTATTGTTCTCCTGTTCCATAAAATATGCCAAGAAACCAAAAGAATCTGGTTCATGTACATTCCCTTCTTATACTATCACATTTTAAATGAAATGGAAG

mRNA sequence

ATGCCATACGTCTGTTTCCTCTTTAGTGATACTATTCTGAAACCTTTGAACTTGCATTTTATAAATTCTGCATATATCTTCTGCACACGGCAGTGTGATCCAGAGGGAGCTGGTTCCTGTGACTCTGGTTGCAATGGTGGCTCGATGAACAGTGCATTTGAATACACATTAAAAGTTGGTTGCAATCGTGAAGCCTGTAAGTTTGACAGGTCCAAGATCGCCGCTGCATCAGTTGCCAATTTCAGTGTTGTTTCACTTGATGAGGACCAAATTGCTGCAAATCTGGTGGAAAATGGCCCACTTGCAATTGCTATCAATGCGGTGTTCATGCAGTCATATATAGGTGGAGTATCTTGTCCATTCATATGTTCAAAGCGGTTGGAACATGGAGTTTTGCTGGTGGGTTATGGCTCAGCTGACTATTGGATCATCAAAAACTCATGGGGAGAAAATGGATACTACAGAATCTGCAGAGGAAGGAATATTTGTGGAGTTGATTCCTTGGTCTCAACTGTTGCAGCTGTTCATACCCCAATTGCAGCAGCAGGTCAGTAAATTGGATCTCTCTTAACACTGTAATTTATGAAAATGTGTGAATTCTAAGCCTTTAGGTCCCTCCTTTTCTTTCTTTCCTTATTCTTATTGGATTCATTATTGTTCTCCTGTTCCATAAAATATGCCAAGAAACCAAAAGAATCTGGTTCATGTACATTCCCTTCTTATACTATCACATTTTAAATGAAATGGAAG

Coding sequence (CDS)

ATGCCATACGTCTGTTTCCTCTTTAGTGATACTATTCTGAAACCTTTGAACTTGCATTTTATAAATTCTGCATATATCTTCTGCACACGGCAGTGTGATCCAGAGGGAGCTGGTTCCTGTGACTCTGGTTGCAATGGTGGCTCGATGAACAGTGCATTTGAATACACATTAAAAGTTGGTTGCAATCGTGAAGCCTGTAAGTTTGACAGGTCCAAGATCGCCGCTGCATCAGTTGCCAATTTCAGTGTTGTTTCACTTGATGAGGACCAAATTGCTGCAAATCTGGTGGAAAATGGCCCACTTGCAATTGCTATCAATGCGGTGTTCATGCAGTCATATATAGGTGGAGTATCTTGTCCATTCATATGTTCAAAGCGGTTGGAACATGGAGTTTTGCTGGTGGGTTATGGCTCAGCTGACTATTGGATCATCAAAAACTCATGGGGAGAAAATGGATACTACAGAATCTGCAGAGGAAGGAATATTTGTGGAGTTGATTCCTTGGTCTCAACTGTTGCAGCTGTTCATACCCCAATTGCAGCAGCAGGTCAGTAA

Protein sequence

MPYVCFLFSDTILKPLNLHFINSAYIFCTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVGCNREACKFDRSKIAAASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVGYGSADYWIIKNSWGENGYYRICRGRNICGVDSLVSTVAAVHTPIAAAGQ
BLAST of CmaCh19G006820.1 vs. Swiss-Prot
Match: RD19B_ARATH (Probable cysteine protease RD19B OS=Arabidopsis thaliana GN=RD19B PE=2 SV=2)

HSP 1 Score: 226.9 bits (577), Expect = 1.8e-58
Identity = 118/173 (68.21%), Postives = 133/173 (76.88%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVG-CNRE-----------ACKFDRSKIAA 87
           C  +CDPE  GSCDSGCNGG MNSAFEYTLK G   RE           +CK DRSKI A
Sbjct: 187 CDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLDRSKIVA 246

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
            SV+NFSVVS++EDQIAANL++NGPLA+AINA +MQ+YIGGVSCP+ICS+RL HGVLLVG
Sbjct: 247 -SVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCPYICSRRLNHGVLLVG 306

Query: 148 YGSA----------DYWIIKN----SWGENGYYRICRGRNICGVDSLVSTVAA 175
           YGSA           YWIIKN    SWGENG+Y+IC+GRNICGVDSLVSTVAA
Sbjct: 307 YGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVSTVAA 358

BLAST of CmaCh19G006820.1 vs. Swiss-Prot
Match: RD19A_ARATH (Cysteine protease RD19A OS=Arabidopsis thaliana GN=RD19A PE=1 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 1.0e-56
Identity = 112/173 (64.74%), Postives = 132/173 (76.30%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVGC------------NREACKFDRSKIAA 87
           C  +CDPE A SCDSGCNGG MNSAFEYTLK G             + + CK D+SKI A
Sbjct: 190 CDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDGKTCKLDKSKIVA 249

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
            SV+NFSV+S+DE+QIAANLV+NGPLA+AINA +MQ+YIGGVSCP+IC++RL HGVLLVG
Sbjct: 250 -SVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYICTRRLNHGVLLVG 309

Query: 148 YGSA----------DYWIIKNS----WGENGYYRICRGRNICGVDSLVSTVAA 175
           YG+A           YWIIKNS    WGENG+Y+IC+GRNICGVDS+VSTVAA
Sbjct: 310 YGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNICGVDSMVSTVAA 361

BLAST of CmaCh19G006820.1 vs. Swiss-Prot
Match: RD19C_ARATH (Probable cysteine protease RD19C OS=Arabidopsis thaliana GN=RD19C PE=2 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 6.1e-54
Identity = 110/177 (62.15%), Postives = 132/177 (74.58%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVGC------------NREACKFDRSKIAA 87
           C  +CDP  A SCDSGC+GG MN+AFEY LK G             +  ACKFD+SKI A
Sbjct: 195 CDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDYPYTGRDHTACKFDKSKIVA 254

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
            SV+NFSVVS DEDQIAANLV++GPLAIAINA++MQ+YIGGVSCP++CSK  +HGVLLVG
Sbjct: 255 -SVSNFSVVSSDEDQIAANLVQHGPLAIAINAMWMQTYIGGVSCPYVCSKSQDHGVLLVG 314

Query: 148 YGSA----------DYWIIKNS----WGENGYYRICRG-RNICGVDSLVSTVAAVHT 178
           +GS+           YWIIKNS    WGE+GYY+ICRG  N+CG+D++VSTVAAVHT
Sbjct: 315 FGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKICRGPHNMCGMDTMVSTVAAVHT 370

BLAST of CmaCh19G006820.1 vs. Swiss-Prot
Match: CYSP_PEA (Cysteine proteinase 15A OS=Pisum sativum PE=2 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 6.1e-54
Identity = 109/176 (61.93%), Postives = 132/176 (75.00%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVG----------CNRE-ACKFDRSKIAAA 87
           C   CDPE AGSCDSGCNGG MN+AFEY L+ G            R+ +CKFD+SK+ A 
Sbjct: 187 CDHVCDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQEKDYAYTGRDGSCKFDKSKVVA- 246

Query: 88  SVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSK-RLEHGVLLVG 147
           SV+NFSVV+LDEDQIAANLV+NGPLA+AINA +MQ+Y+ GVSCP++C+K RL+HGVLLVG
Sbjct: 247 SVSNFSVVTLDEDQIAANLVKNGPLAVAINAAWMQTYMSGVSCPYVCAKSRLDHGVLLVG 306

Query: 148 YGSA----------DYWIIKNSWGEN----GYYRICRGRNICGVDSLVSTVAAVHT 178
           +G             YWIIKNSWG+N    GYY+ICRGRN+CGVDS+VSTVAA  +
Sbjct: 307 FGKGAYAPIRLKEKPYWIIKNSWGQNWGEQGYYKICRGRNVCGVDSMVSTVAAAQS 361

BLAST of CmaCh19G006820.1 vs. Swiss-Prot
Match: CYSP1_MAIZE (Cysteine proteinase 1 OS=Zea mays GN=CCP1 PE=2 SV=1)

HSP 1 Score: 196.1 bits (497), Expect = 3.5e-49
Identity = 103/180 (57.22%), Postives = 123/180 (68.33%), Query Frame = 1

Query: 25  YIFCTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVGCNREA-----------CKFDRSKI 84
           ++ C  +CD     SCDSGCNGG M +AF Y  K G                CKFD+SKI
Sbjct: 189 FVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTGSDGKCKFDKSKI 248

Query: 85  AAASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLL 144
            A SV NFSVVS+DE QI+ANL+++GPLAI INA +MQ+YIGGVSCP+IC + L+HGVLL
Sbjct: 249 VA-SVQNFSVVSVDEAQISANLIKHGPLAIGINAAYMQTYIGGVSCPYICGRHLDHGVLL 308

Query: 145 VGYGSA----------DYWIIKNS----WGENGYYRICRG---RNICGVDSLVSTVAAVH 177
           VGYG++           YWIIKNS    WGENGYY+ICRG   RN CGVDS+VSTV+AVH
Sbjct: 309 VGYGASGFAPIRLKDKPYWIIKNSWGENWGENGYYKICRGSNVRNKCGVDSMVSTVSAVH 367

BLAST of CmaCh19G006820.1 vs. TrEMBL
Match: A0A059BDD0_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_G01706 PE=3 SV=1)

HSP 1 Score: 248.4 bits (633), Expect = 6.6e-63
Identity = 127/176 (72.16%), Postives = 140/176 (79.55%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVG------------CNREACKFDRSKIAA 87
           C  +CDPE  GSCDSGCNGG MNSAFEYTLK G             +R +CKFD+SKIAA
Sbjct: 199 CDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRSSCKFDKSKIAA 258

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
            SVANFSVVSLDEDQIAANLV+NGPLAIAINAVFMQ+Y+GGVSCP+ICSKRL+HGVLLVG
Sbjct: 259 -SVANFSVVSLDEDQIAANLVKNGPLAIAINAVFMQTYVGGVSCPYICSKRLDHGVLLVG 318

Query: 148 YGSA----------DYWIIKNS----WGENGYYRICRGRNICGVDSLVSTVAAVHT 178
           YGSA           YWIIKNS    WGENG+Y+ICRGRNICGVDS+VSTVAA+HT
Sbjct: 319 YGSAAYSPIRMKEKPYWIIKNSWGENWGENGFYKICRGRNICGVDSMVSTVAAIHT 373

BLAST of CmaCh19G006820.1 vs. TrEMBL
Match: A0A0B0PI03_GOSAR (Cysteinease RD19a-like protein OS=Gossypium arboreum GN=F383_09016 PE=3 SV=1)

HSP 1 Score: 245.7 bits (626), Expect = 4.3e-62
Identity = 127/176 (72.16%), Postives = 138/176 (78.41%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVG------------CNREACKFDRSKIAA 87
           C  +CDPE AGSCDSGCNGG MNSAFEYTLK G             +R  CKFD+SKI A
Sbjct: 197 CDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGTCKFDKSKIVA 256

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
             VANFSVVSLDEDQIAANLV+NGPLA+AINAVFMQ+YIGGVSCP+ICSKRL+HGVLLVG
Sbjct: 257 -KVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSKRLDHGVLLVG 316

Query: 148 YGSA----------DYWIIKNS----WGENGYYRICRGRNICGVDSLVSTVAAVHT 178
           YGSA           YWIIKNS    WGENGYY+ICRGRN+CGVDSLVSTVAAV+T
Sbjct: 317 YGSAGYAPIRLKDKPYWIIKNSWGETWGENGYYKICRGRNVCGVDSLVSTVAAVNT 371

BLAST of CmaCh19G006820.1 vs. TrEMBL
Match: A0A0D2RSY7_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G247000 PE=3 SV=1)

HSP 1 Score: 244.2 bits (622), Expect = 1.2e-61
Identity = 126/176 (71.59%), Postives = 138/176 (78.41%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVG------------CNREACKFDRSKIAA 87
           C  +CDPE AGSCDSGCNGG MNSAFEYTLK G             +R  CKFD+SKI A
Sbjct: 197 CDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGTCKFDKSKIVA 256

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
             VANFSVVSLDEDQIAANLV+NGPLA+AINAVFMQ+YIGGVSCP+ICSKRL+HGVLLVG
Sbjct: 257 -KVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSKRLDHGVLLVG 316

Query: 148 YGSA----------DYWIIKNS----WGENGYYRICRGRNICGVDSLVSTVAAVHT 178
           YGSA           YWIIKNS    WGENG+Y+ICRGRN+CGVDSLVSTVAAV+T
Sbjct: 317 YGSAGYAPIRLKDKPYWIIKNSWGETWGENGFYKICRGRNVCGVDSLVSTVAAVNT 371

BLAST of CmaCh19G006820.1 vs. TrEMBL
Match: F6I1A2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0038g00280 PE=3 SV=1)

HSP 1 Score: 243.4 bits (620), Expect = 2.1e-61
Identity = 122/176 (69.32%), Postives = 140/176 (79.55%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVG------------CNREACKFDRSKIAA 87
           C  +CDPE  GSCDSGCNGG MN+AFEYTLK G             +R +CKFD++KIAA
Sbjct: 200 CDHECDPEEMGSCDSGCNGGLMNTAFEYTLKAGGLMKEEDYPYTGTDRGSCKFDKTKIAA 259

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
            SV+NFSV+SLDEDQIAANLV+NGPLA+AINAVFMQ+Y+GGVSCP+ICSKRL+HGVLLVG
Sbjct: 260 -SVSNFSVISLDEDQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICSKRLDHGVLLVG 319

Query: 148 YGSA----------DYWIIKNS----WGENGYYRICRGRNICGVDSLVSTVAAVHT 178
           YGSA           YWIIKNS    WGENG+Y+ICRGRN+CGVDS+VSTVAAVHT
Sbjct: 320 YGSAGYAPIRMKDKPYWIIKNSWGENWGENGFYKICRGRNVCGVDSMVSTVAAVHT 374

BLAST of CmaCh19G006820.1 vs. TrEMBL
Match: A0A067L3H8_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26761 PE=3 SV=1)

HSP 1 Score: 243.0 bits (619), Expect = 2.8e-61
Identity = 123/176 (69.89%), Postives = 140/176 (79.55%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVG------------CNREACKFDRSKIAA 87
           C  +CDPE AGSCDSGCNGG MNSAFEYTLK G             +R ACKFD++K+AA
Sbjct: 193 CDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGACKFDKTKVAA 252

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
            +VANFSV+SLDEDQIAANLV+NGPLA+AINAV+MQ+YIGGVSCP+ICSKRL+HGVLLVG
Sbjct: 253 -TVANFSVISLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSKRLDHGVLLVG 312

Query: 148 YGSA----------DYWIIKNS----WGENGYYRICRGRNICGVDSLVSTVAAVHT 178
           YGSA           YWIIKNS    WGE+GYY+ICRGRN+CGVDS+VSTVAAV T
Sbjct: 313 YGSAGYAPIRLKEKPYWIIKNSWGETWGESGYYKICRGRNVCGVDSMVSTVAAVQT 367

BLAST of CmaCh19G006820.1 vs. TAIR10
Match: AT2G21430.1 (AT2G21430.1 Papain family cysteine protease)

HSP 1 Score: 226.9 bits (577), Expect = 1.0e-59
Identity = 118/173 (68.21%), Postives = 133/173 (76.88%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVG-CNRE-----------ACKFDRSKIAA 87
           C  +CDPE  GSCDSGCNGG MNSAFEYTLK G   RE           +CK DRSKI A
Sbjct: 187 CDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLDRSKIVA 246

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
            SV+NFSVVS++EDQIAANL++NGPLA+AINA +MQ+YIGGVSCP+ICS+RL HGVLLVG
Sbjct: 247 -SVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCPYICSRRLNHGVLLVG 306

Query: 148 YGSA----------DYWIIKN----SWGENGYYRICRGRNICGVDSLVSTVAA 175
           YGSA           YWIIKN    SWGENG+Y+IC+GRNICGVDSLVSTVAA
Sbjct: 307 YGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVSTVAA 358

BLAST of CmaCh19G006820.1 vs. TAIR10
Match: AT4G39090.1 (AT4G39090.1 Papain family cysteine protease)

HSP 1 Score: 221.1 bits (562), Expect = 5.7e-58
Identity = 112/173 (64.74%), Postives = 132/173 (76.30%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVGC------------NREACKFDRSKIAA 87
           C  +CDPE A SCDSGCNGG MNSAFEYTLK G             + + CK D+SKI A
Sbjct: 190 CDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDGKTCKLDKSKIVA 249

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
            SV+NFSV+S+DE+QIAANLV+NGPLA+AINA +MQ+YIGGVSCP+IC++RL HGVLLVG
Sbjct: 250 -SVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYICTRRLNHGVLLVG 309

Query: 148 YGSA----------DYWIIKNS----WGENGYYRICRGRNICGVDSLVSTVAA 175
           YG+A           YWIIKNS    WGENG+Y+IC+GRNICGVDS+VSTVAA
Sbjct: 310 YGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNICGVDSMVSTVAA 361

BLAST of CmaCh19G006820.1 vs. TAIR10
Match: AT4G16190.1 (AT4G16190.1 Papain family cysteine protease)

HSP 1 Score: 211.8 bits (538), Expect = 3.5e-55
Identity = 110/177 (62.15%), Postives = 132/177 (74.58%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVGC------------NREACKFDRSKIAA 87
           C  +CDP  A SCDSGC+GG MN+AFEY LK G             +  ACKFD+SKI A
Sbjct: 195 CDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDYPYTGRDHTACKFDKSKIVA 254

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
            SV+NFSVVS DEDQIAANLV++GPLAIAINA++MQ+YIGGVSCP++CSK  +HGVLLVG
Sbjct: 255 -SVSNFSVVSSDEDQIAANLVQHGPLAIAINAMWMQTYIGGVSCPYVCSKSQDHGVLLVG 314

Query: 148 YGSA----------DYWIIKNS----WGENGYYRICRG-RNICGVDSLVSTVAAVHT 178
           +GS+           YWIIKNS    WGE+GYY+ICRG  N+CG+D++VSTVAAVHT
Sbjct: 315 FGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKICRGPHNMCGMDTMVSTVAAVHT 370

BLAST of CmaCh19G006820.1 vs. TAIR10
Match: AT3G54940.2 (AT3G54940.2 Papain family cysteine protease)

HSP 1 Score: 182.6 bits (462), Expect = 2.2e-46
Identity = 99/204 (48.53%), Postives = 127/204 (62.25%), Query Frame = 1

Query: 5   CFLFSDTILKPLNLHFINSAYIF---------CTRQCDPEGAGSCDSGCNGGSMNSAFEY 64
           C+ FS T       HF+++  +          C + CDP+   +CD+GC GG M +A+EY
Sbjct: 161 CWAFSTTGAAE-GAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEY 220

Query: 65  TLKVGCNREA-----------CKFDRSKIAAASVANFSVVSLDEDQIAANLVENGPLAIA 124
            ++ G   E            CKFD  K+A   V NF+ + LDE+QIAANLV +GPLA+ 
Sbjct: 221 LMEAGGLEEERSYPYTGKRGHCKFDPEKVAVR-VLNFTTIPLDENQIAANLVRHGPLAVG 280

Query: 125 INAVFMQSYIGGVSCPFICSKR-LEHGVLLVGYGS----------ADYWIIKNS----WG 174
           +NAVFMQ+YIGGVSCP ICSKR + HGVLLVGYGS            YWIIKNS    WG
Sbjct: 281 LNAVFMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWG 340

BLAST of CmaCh19G006820.1 vs. TAIR10
Match: AT1G09850.1 (AT1G09850.1 xylem bark cysteine peptidase 3)

HSP 1 Score: 75.5 bits (184), Expect = 3.9e-14
Identity = 53/163 (32.52%), Postives = 84/163 (51.53%), Query Frame = 1

Query: 39  SCDSGCNGGSMNSAFEYTLKV-GCNRE----------ACKFDRSKIAAASVANFSVVSLD 98
           S ++GCNGG M+ AFE+ +K  G + E           CK D+ K    ++ +++ V  +
Sbjct: 176 SYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSN 235

Query: 99  EDQIAANLVENGPLAIAI--NAVFMQSYIGGV-SCPFICSKRLEHGVLLVGYGS---ADY 158
           +++     V   P+++ I  +    Q Y  G+ S P  CS  L+H VL+VGYGS    DY
Sbjct: 236 DEKALMEAVAAQPVSVGICGSERAFQLYSSGIFSGP--CSTSLDHAVLIVGYGSQNGVDY 295

Query: 159 WIIKNSWGE----NGYYRICRGR----NICGVDSLVSTVAAVH 177
           WI+KNSWG+    +G+  + R       +CG++ L S     H
Sbjct: 296 WIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTH 336

BLAST of CmaCh19G006820.1 vs. NCBI nr
Match: gi|702402595|ref|XP_010066197.1| (PREDICTED: probable cysteine proteinase A494 [Eucalyptus grandis])

HSP 1 Score: 248.4 bits (633), Expect = 9.4e-63
Identity = 127/176 (72.16%), Postives = 140/176 (79.55%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVG------------CNREACKFDRSKIAA 87
           C  +CDPE  GSCDSGCNGG MNSAFEYTLK G             +R +CKFD+SKIAA
Sbjct: 199 CDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRSSCKFDKSKIAA 258

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
            SVANFSVVSLDEDQIAANLV+NGPLAIAINAVFMQ+Y+GGVSCP+ICSKRL+HGVLLVG
Sbjct: 259 -SVANFSVVSLDEDQIAANLVKNGPLAIAINAVFMQTYVGGVSCPYICSKRLDHGVLLVG 318

Query: 148 YGSA----------DYWIIKNS----WGENGYYRICRGRNICGVDSLVSTVAAVHT 178
           YGSA           YWIIKNS    WGENG+Y+ICRGRNICGVDS+VSTVAA+HT
Sbjct: 319 YGSAAYSPIRMKEKPYWIIKNSWGENWGENGFYKICRGRNICGVDSMVSTVAAIHT 373

BLAST of CmaCh19G006820.1 vs. NCBI nr
Match: gi|1009149343|ref|XP_015892430.1| (PREDICTED: cysteine proteinase RD19a [Ziziphus jujuba])

HSP 1 Score: 246.1 bits (627), Expect = 4.7e-62
Identity = 126/176 (71.59%), Postives = 138/176 (78.41%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVG------------CNREACKFDRSKIAA 87
           C  +CDPE  GSCDSGCNGG MNSAFEYTLK G             +R  CKFD++KIAA
Sbjct: 195 CDHECDPEEKGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGTCKFDKTKIAA 254

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
            SVANFSVVSLDEDQIAANLV+NGPLA+AINAVFMQ+Y+GGVSCP+ICSKRL+HGVLLVG
Sbjct: 255 -SVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICSKRLDHGVLLVG 314

Query: 148 YGSADY----------WIIKNS----WGENGYYRICRGRNICGVDSLVSTVAAVHT 178
           YGSA Y          WIIKNS    WGENGYY+ICRGRNICGVDS+VSTVAA HT
Sbjct: 315 YGSAGYAPIRMKDKPFWIIKNSWGETWGENGYYKICRGRNICGVDSMVSTVAAAHT 369

BLAST of CmaCh19G006820.1 vs. NCBI nr
Match: gi|728844621|gb|KHG24064.1| (Cysteinease RD19a -like protein [Gossypium arboreum])

HSP 1 Score: 245.7 bits (626), Expect = 6.1e-62
Identity = 127/176 (72.16%), Postives = 138/176 (78.41%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVG------------CNREACKFDRSKIAA 87
           C  +CDPE AGSCDSGCNGG MNSAFEYTLK G             +R  CKFD+SKI A
Sbjct: 197 CDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGTCKFDKSKIVA 256

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
             VANFSVVSLDEDQIAANLV+NGPLA+AINAVFMQ+YIGGVSCP+ICSKRL+HGVLLVG
Sbjct: 257 -KVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSKRLDHGVLLVG 316

Query: 148 YGSA----------DYWIIKNS----WGENGYYRICRGRNICGVDSLVSTVAAVHT 178
           YGSA           YWIIKNS    WGENGYY+ICRGRN+CGVDSLVSTVAAV+T
Sbjct: 317 YGSAGYAPIRLKDKPYWIIKNSWGETWGENGYYKICRGRNVCGVDSLVSTVAAVNT 371

BLAST of CmaCh19G006820.1 vs. NCBI nr
Match: gi|823161035|ref|XP_012480377.1| (PREDICTED: cysteine proteinase RD19a [Gossypium raimondii])

HSP 1 Score: 244.2 bits (622), Expect = 1.8e-61
Identity = 126/176 (71.59%), Postives = 138/176 (78.41%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVG------------CNREACKFDRSKIAA 87
           C  +CDPE AGSCDSGCNGG MNSAFEYTLK G             +R  CKFD+SKI A
Sbjct: 197 CDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGTCKFDKSKIVA 256

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
             VANFSVVSLDEDQIAANLV+NGPLA+AINAVFMQ+YIGGVSCP+ICSKRL+HGVLLVG
Sbjct: 257 -KVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCPYICSKRLDHGVLLVG 316

Query: 148 YGSA----------DYWIIKNS----WGENGYYRICRGRNICGVDSLVSTVAAVHT 178
           YGSA           YWIIKNS    WGENG+Y+ICRGRN+CGVDSLVSTVAAV+T
Sbjct: 317 YGSAGYAPIRLKDKPYWIIKNSWGETWGENGFYKICRGRNVCGVDSLVSTVAAVNT 371

BLAST of CmaCh19G006820.1 vs. NCBI nr
Match: gi|802564032|ref|XP_012067202.1| (PREDICTED: probable cysteine proteinase A494 [Jatropha curcas])

HSP 1 Score: 243.0 bits (619), Expect = 4.0e-61
Identity = 123/176 (69.89%), Postives = 140/176 (79.55%), Query Frame = 1

Query: 28  CTRQCDPEGAGSCDSGCNGGSMNSAFEYTLKVG------------CNREACKFDRSKIAA 87
           C  +CDPE AGSCDSGCNGG MNSAFEYTLK G             +R ACKFD++K+AA
Sbjct: 193 CDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGACKFDKTKVAA 252

Query: 88  ASVANFSVVSLDEDQIAANLVENGPLAIAINAVFMQSYIGGVSCPFICSKRLEHGVLLVG 147
            +VANFSV+SLDEDQIAANLV+NGPLA+AINAV+MQ+YIGGVSCP+ICSKRL+HGVLLVG
Sbjct: 253 -TVANFSVISLDEDQIAANLVKNGPLAVAINAVYMQTYIGGVSCPYICSKRLDHGVLLVG 312

Query: 148 YGSA----------DYWIIKNS----WGENGYYRICRGRNICGVDSLVSTVAAVHT 178
           YGSA           YWIIKNS    WGE+GYY+ICRGRN+CGVDS+VSTVAAV T
Sbjct: 313 YGSAGYAPIRLKEKPYWIIKNSWGETWGESGYYKICRGRNVCGVDSMVSTVAAVQT 367

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RD19B_ARATH1.8e-5868.21Probable cysteine protease RD19B OS=Arabidopsis thaliana GN=RD19B PE=2 SV=2[more]
RD19A_ARATH1.0e-5664.74Cysteine protease RD19A OS=Arabidopsis thaliana GN=RD19A PE=1 SV=1[more]
RD19C_ARATH6.1e-5462.15Probable cysteine protease RD19C OS=Arabidopsis thaliana GN=RD19C PE=2 SV=1[more]
CYSP_PEA6.1e-5461.93Cysteine proteinase 15A OS=Pisum sativum PE=2 SV=1[more]
CYSP1_MAIZE3.5e-4957.22Cysteine proteinase 1 OS=Zea mays GN=CCP1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A059BDD0_EUCGR6.6e-6372.16Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_G01706 PE=3 SV=1[more]
A0A0B0PI03_GOSAR4.3e-6272.16Cysteinease RD19a-like protein OS=Gossypium arboreum GN=F383_09016 PE=3 SV=1[more]
A0A0D2RSY7_GOSRA1.2e-6171.59Uncharacterized protein OS=Gossypium raimondii GN=B456_005G247000 PE=3 SV=1[more]
F6I1A2_VITVI2.1e-6169.32Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0038g00280 PE=3 SV=... [more]
A0A067L3H8_JATCU2.8e-6169.89Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26761 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT2G21430.11.0e-5968.21 Papain family cysteine protease[more]
AT4G39090.15.7e-5864.74 Papain family cysteine protease[more]
AT4G16190.13.5e-5562.15 Papain family cysteine protease[more]
AT3G54940.22.2e-4648.53 Papain family cysteine protease[more]
AT1G09850.13.9e-1432.52 xylem bark cysteine peptidase 3[more]
Match NameE-valueIdentityDescription
gi|702402595|ref|XP_010066197.1|9.4e-6372.16PREDICTED: probable cysteine proteinase A494 [Eucalyptus grandis][more]
gi|1009149343|ref|XP_015892430.1|4.7e-6271.59PREDICTED: cysteine proteinase RD19a [Ziziphus jujuba][more]
gi|728844621|gb|KHG24064.1|6.1e-6272.16Cysteinease RD19a -like protein [Gossypium arboreum][more]
gi|823161035|ref|XP_012480377.1|1.8e-6171.59PREDICTED: cysteine proteinase RD19a [Gossypium raimondii][more]
gi|802564032|ref|XP_012067202.1|4.0e-6169.89PREDICTED: probable cysteine proteinase A494 [Jatropha curcas][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR000668Peptidase_C1A_C
IPR013128Peptidase_C1A
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0008234cysteine-type peptidase activity
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0008234 cysteine-type peptidase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmaCh19G006820CmaCh19G006820gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmaCh19G006820.1CmaCh19G006820.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh19G006820.1.exon.2CmaCh19G006820.1.exon.2exon
CmaCh19G006820.1.exon.1CmaCh19G006820.1.exon.1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh19G006820.1.CDS.1CmaCh19G006820.1.CDS.1CDS
CmaCh19G006820.1.CDS.2CmaCh19G006820.1.CDS.2CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh19G006820.1.three_prime_UTR.1CmaCh19G006820.1.three_prime_UTR.1three_prime_UTR


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000668Peptidase C1A, papain C-terminalPFAMPF00112Peptidase_C1coord: 40..171
score: 4.8
IPR000668Peptidase C1A, papain C-terminalSMARTSM00645pept_c1coord: 2..173
score: 7.3
IPR013128Peptidase C1APANTHERPTHR12411CYSTEINE PROTEASE FAMILY C1-RELATEDcoord: 32..175
score: 1.6
NoneNo IPR availableGENE3DG3DSA:3.90.70.10coord: 39..172
score: 3.9
NoneNo IPR availablePANTHERPTHR12411:SF338CYSTEINE PROTEINASE RD19A-RELATEDcoord: 32..175
score: 1.6
NoneNo IPR availableunknownSSF54001Cysteine proteinasescoord: 38..174
score: 4.91