CmoCh12G003890 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh12G003890
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionzingipain-2
LocationCmo_Chr12: 2409820 .. 2417031 (+)
RNA-Seq ExpressionCmoCh12G003890
SyntenyCmoCh12G003890
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTCAAAATCATTCTCGAAAAATCCAAAAAATCCAAACAAAATCATTCAAGTTCGACGATTTTCGTGAGTTAATTGCCTGTCAATTGCCGATTGGTTTGGACCATTGTTCATCGAAGATCGAATAACCAATCCTCTGGTTATTTGATAGCAGCCATCTACGGCGGCCGTGTACCGACTCGCCGGCGATTATCCATCACCGCCACGCTTTTATCTCGGCAAAGAATGGCCAATCTCAGCTTCCATTTTGTGATTACCTTTCTCTTTCTTCTTCGCCATTTTTCTGCTACTTCGGATATTTCGGAGCTCTTTGAAATCTGGTGCACAGAACATGGAAAATCGTATTCCTCCGCGGAAGAAAAGCTTTACAGACTCGGTGTTTTTGCCGATAACTATGAATTTGTTACTCATCACAATAATCAGGGAAATTCTTCTTATACTCTTTCTCTCAATGCTTATGCCGATATTACTCACCATGAGTTCAAGGCCGCTCGCTTAGGTCTTTCCTCTGCTTTGCGGAGCTCGCGGCCGGTTTCGCCGCAAGAACCCTATCTTCATCGGGATGTTCCTGAATCGTTAGATTGGAGGAAGAAAGGGGCTGTGACTGCTGTTAAGGATCAAGGAAGTTGTGGTATGTTTAATTTCTGTTTTCGATTGTTTTGTTTGGTGAAGATCTTGAATCTGATTGAACTACTCTGTTCCGTGATTTTGAATTTGATTGAAACCCCAGGACACTAGGGTTAGTTTTGACGATCGATGCATTATTAGTGGAAATTTCTATAAATGGCTTCATGAACATGGGGTTATCTCTTACAAATCCAAACAAAGAAATCATTATAAGTGGAAATTGTGCTTCTGTTTCTTGAAACCAAACATTATTGCTTGGCCATATGAGTTTAAGCATTTGTTCTTCAGTCTATTTTAAGATTCTGTATGTAGAATTGCAGAAATCTAGTTCATGTTTGAGCTTAACTTCAGATATTGTCCTTTTTGTGCTTCTTCTCCTCTCCTAAGGAGAGGTTCCACGCCCTTATAAAGAATGTTTGGTTCTCCTCCCCAATTGATGTAGAATCTCACAGTCCACCTCCCTTTGGGGCCCAGCGTCCTTGCTGACACTCGTTCCCTTCTCCAATCGATGTGGGACCTCCCAATCCATCCTCTTCGAGACCCAACGTCCTTGCTGGCACATCGCTCCTAGCCCACCACTAGCCGATATTGTTTTCTTTAGGCTTCCCCTCAAGGTTTTTAAAACGCGTATGCTAGGGAGAGGTTACCAACCCTTATAATGAATATTTCGTTCTCCTCGGGATCTCAAACAAAATATAAAGAAAAGTGTTGAAAATGTATTAAATTATGAACTATACTCTAGCTGAGTTTTGGATCTAGTGATGATTTTGTCACATAACCTGATATTGATCATCAAGGTAGTTAAGTTGTATCATCATCTCTTCATGATAAGCTTTGAATTATGTGGAAGCTTTGCTCATCTAAATGGGAAAGAACCAATATCTTTTGTTTTCACTTTTCAGGTGCTTGCTGGTCTTTCTCAGCAACAGGAGCTATTGAAGGGATCAACCAAATTAGAACAGGGTCTCTTATCAGTGTTTCTGAACAGGAATTAATTGATTGTGACAGATCGTATAATTCTGGCTGTGGAGGAGGACTGATGGATTATGCATACCAATTTGTTATAAAAAACCACGGGATCGACACCGAAGATGATTATCCTTTTCAGGGTCGTGATGGATCGTGTCATAAGGACAAGGTAATATAATTCATCTCTTCCTCACATCTTCCAAATACGTTCACAATATTAGTAAATCACTTTTTATGGACTCATTTTGAATGATTCATGAGCTCACCGTAAGCAAATATTGTTCTCTTTGAGTTTTCCCTTTCGGGCTTCATCTCAAGGTTTTTAAAACGTGTCTACTAGGGAGAGTTTTCCACTCCCTTATAAAGAATGCTTCGTTCTTCGTTCTCCTCACAATCCACCCCCTTCTGGGCGCAGTATTCTCGCTGGCATTCATTCCCTTTCTCCAATCAATGTGGGACCCCCAATCCACCCCTTCAGGGTCTAGCGTCCTTACTGGCACACTGCCTCGTGTCCACCCACTTCGAGGTGCAACCTCCTCGATGTGAGACTCCCCAATCCACCTCCCTTCGGTGCCCAGTGTCGTTGCTGACACACTGCCTCGTGTCCACCCCCTTCGGGGCTCAGCCTTTTGCTAGCACATCACTCAGTGTCTAGCTCTGTTACCATTTATAACGGCTCAAGCCCACCGCTAGCAAATATTGTTTTCATTGGGCTTTCCCTTTCGGGGTTCCTTTCAAGGTTTTTAAAACGCGTCTACTAGGGAGAGGTTTCCATGCCCTTATAAAAAATGTTCCGTTTTTCTCCCCGATGTAGGATCTCACATAGAAGATGGATTCCTTACGCATAGAAAAAGATTGCTTGCGCTGAAATGCATTTTGTGTCTATTGAGCTTGCAGCTAAATAGGAAGGTCGTTACCATTGATGGCTATTCCGATGTTCCTCCAAACAATGAGGAAAAATTACTGCAAGCAGTAGCAATTCAACCTGTGAGTGTTGGTATCTGTGGCAGTGAGAGAGCTTTTCAATTATATTCAAAGGTTGGTTCTATTCTCCACTAATGTTCGAAAATTCCATCATACGTGAAATTATGACTAAAATCAAGAATGGTACTTGTTTCTGTTAGGGAATTTTCTCTGGTCCATGTTCAACTTCCTTGGATCATGCTGTGTTGATTGTAGGATATGGATCAGAAAATGGTGTTGATTATTGGATCGTGAAGAACTCGTGGGGTAAACGTTGGGGAATGGATGGTTATATTCACATGCAGCGCAACAGCGGAAATTCTGAAGGCGTTTGCGGAATCAACATGCTTGCTTCATATCCAACTAAAACGAGTCCCAACCCACCTCCCTCCCCTCCGCCAGGTCCAACAAAATGCAGTTTTCTTACTAGCTGTGCTGCTGGGGAGACCTGTTGTTGTGCGAAGGAATTTTTTGGCCTTTGCTTGTCTTGGAAATGCTGTGGACTGAGCTCTGCTGTCTGTTGCAAGGACGGTCGTCATTGTTGCCCCTTTGATTATCCCATTTGTGATACTCAGAGGAACCTATGCCTCAAGGTTTGTTCCTTTTCCCACTTCTTATGATTTTTTGTTTGAGATGAAAATATCATTTTAGTGTTGGAAGTAGCTTGTCTGTGTGTTGTTTTCTTCCTTCAAACAAGTTTAGTCATCTGGAAATGGATCTGTTTTCATAAGCATATGTTAGAGAGAAAAAGAAGAGAAGGAACATGCTATAACTCGTCTCTTGTTGTGAGATCTTACATCGGTTGGAGAAAGGAACGAAGCATTCTTTATAAGAGTGTGGAAACATCTCCCTAGCAAATGCATTTTAAAAACCTTAAGGGGAAGGCTGGAAGGGAAAGTCGGAAAAGGATAATATCTGCTAACGGTGGGTTTGGGCTGTTACAGATGGTATCAGAGCCCAGTGAGGATGTTGGGCCCCGAAAAGGGATGAATTGTGAGATCCCACGTCGATTGGAGAAGGGAATGAGTGCCTGCGAGAACGTTAGGCCCCGAAGAGGGGTGGATTGTGAGATCCTACGTCGGTTGGAGAGGGGAACGAGGCATTCTTTATAAGAGTGTGGAAACCTCTCCCCTGTAGATGCGTTTTAAAAACCTTGAGGCAAAGCCCAAAGAGGACAATGTTTGCGAACGGTGGGCTTGGGTTGTTATAGATAGTATCAGAGCACACCAGGCAATGACCAACGAGGACATTGAGCCCCGAAAGGGGGTGGATTGTGAGATCCTACATCGATTGGAGAGAGGAATGAGTGCCAGTGAGGACGTTGGGCTCCGAAGGGGGGTGGATTGTGAGATCCCCACATCGATTGGAGAGGGGAACGAAACATTTTTTATAAGGGTGTGGAAACCTCTCCCTAGCAGAGCAGACGCATTTTAAAACCTTGAGGGAAGCCCGGAAGTGAAAACCCAAAATGGACAATATCTGCTAGCAGTGGGTTTTGACCGTTTCAGTTACCGTCTTTTCTTGTACCTGATAGACAATTGTCACACTCTGTCTCTATCTAATAGTTAGTAACCATACCAAACACTCACCTGTTTACAGCTGCATAGCGACACAACTTCTAACAAAAGAACATTTTCCAAGAATGCATCAAAAAAATCAGTTGAAACATGTTCAAGTCATATGTTTGAAGTGTTTCTTTGTCTATGTTCTCTCTGTTTCCATATTCTGGGCTAACAGAAACATCATCTTGGCAGAGAACGATGAACGGTACAAGAACAGAAGCACTCGAGAATCGGAGTCCTTCGGGAACATCCGGTTCGTGGAGCTCTTCCTAAGGTTTGAATTCTATGAAATGGCTCCATCTGTAGAATGAATAAACGCTGCCATTCTAGTTACCACTGTATTCTACTAGCTTTTAGCTACATTTGAAGTCTAAAAAGGAGGTTCAGTCATTAGCTTTCTGCTATGGTCCTAGAGCTTCCATGGATTAGCTTTGGAGAGTTCTTACCTCGGCCGAGCTCGGAGTTTTATCGGGTTCTGGCTCGTCACTGTTCTTAGGTTTCTGATCAGACTGCTGCATTTCAAGTTCTTAAAGTTGCATCTTACTCGATGCATAGATTTGTTCTTGAATATTATGGATTGGTTTGGATTTGAATGTTTGTGATTAGTTGTAAGAGAAATTTGGTTCATATATTGCTGCCTCATATGCTCCACCATAGTTTTTTTTATGCTTTTCTGTTCCCTAAATCTTTAGGACGGAGTTAGCGATGACTTCCGGTTCTTATTTTGCTTTACAATACCTTGCCATATTCGATGAAAGTAGTATATACGGCTTGGAGGGAGATCTTTCAAATCTTTCGAGATCCACCCTACAATATAGGTTCAGAAGTCAAAATAAATTATTTTAGCTCTTGTAATTAAAAATTATTCTCGGTTTATGTTTTTTATTAAAATAACATTTCATTAAAGTTTAAATTTAGGATATTTAAAAATAAAGAAAATAGGACGGAGGCAAAGTCGACCACGATGACAAAGTTCACGGGTAATTTTTTTGTTCCTCTACTTCCTTGCCAATTCATAACGAGACGATTCATGCAAGATGGGTCATACCAGAGAAACAAACATTTCTAGTTTGAACCACCGTTTACGAGTATCTTAAAAAATAGTAATTCAATCAAATAGATAAGAACGTGTTTTATCCTATCTAAATTAAACGTGCGACTATCAAAAAAATTTAACGAATAACTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTATTTTCATATTTTAGATATACTATCGAATATATATTATATTTACTTTCTAAAAAGAGAAATATAAACTTTAAATACTAACTCTATTTTAAAAATAAATAAATAAATATCAATCAAATATGCTCTGAAATATTTCAAACCATATGAATGAATTTAGAATTAATTTAATTTTTTTCAAGATAATTAAAATAATTAACCATAGATTAAAATCTAATTCTTTTATTATTATTTTTTTAAATTCATTGTATATATATTTGTATTTATTTATTTTTTAATTCGACGAAATTTAAAATTTTGAGATCGAGAATAAATGTATTGACCGTTGAATTATATTGTACGGCAAATTTGAGGCATTTTAAAAATAACAATGAATTATAATGAATTATTTATTTATTTATTTTTATTCTTTTTATCCGTTCCCATTCAGAATAACGACATTTTTATGGATAAATGCATTCACATTTTTGGCATAGGCGCTCGCGTCATGAACCACTGAGATTATCTCGTAATAAAACTCAAATATGAGGGTAATTTGGTAAATTGATACGAACGACATCTAGGGCCATTTCTCTCTCTATATAAACCCCCTTGCTCCTTTCGTCTTCATCCATCGCTTCGTTTTCAAATACAGTGAGACCTTTGCAGGATACACACTCGTTTTTCTTTGAAGCAATGGCTGTTGAAGGTCAAGTGATTCCACTTCGCGATGCTAAAGAATTCGATGTGATAATCGATAAAGAGAAGGAATCTGGCAAACTGGTATGATTTTTTATTTCTTTTCCTCTCCATTTCTGCTTTCCAATTTCTGATTTCCTGATGTTCTTGTCGTTTCAAACGATTTTTGTTGGATTTGTGTTGTTATTGTTGGATATTGTTCATGATTTGGAGGAAATCGTGATTGATTCGTTAGGATTTGTGATATTGACTTGTATTTGTGTCTAGGTTTCTTTTGTTTAATCTAATTTATTGAGTCCTGGCTAGCTATGTTTGATTGCCTCGGATTTTATGATTTCTAATCTTCAATTCTTCTGTGTTACTCGCTTCGTGTCATTGAATTTGATGATTCCATCCTCCTTTGACAGTAGTAGTACTTCACTGGATTAAATATAGTATCACCTCTATAGTTTTGGGGGTTCTTTTTGTTTCTGATGTATATAAAATCTTCTTTGATAGATTGTGATCGATTTTACTGCTTCCTGGTGCCCGCCATGCCGTATCATTGCTCCAGTATTCGCAGAGTTGGCTAAGGCGCACGTCAATGTCACTTTCTTGAAAGTGGACGTCGATAATGTCAAGGTAAAGCAATTTGAGCCTTTTCAACTCTTCCAAACTTTTTATATTGTTGATGCAGTGGAATTGAACTGTTCTGTTTCTTTGCAGGAAATTGCTGAGAGGTTCGAGGTGAATGCGATGCCGACCTTTGTTTTCCTGAAAGGAGGAAAAGAAGTTCACAGGATTGTTGGTGCCGACAAGGTGGAGCTAGGGGTTAAAGTACTGGAGTTAAGCGCTGCACCTGCTACTTCTGCTGCTTAGATGGTCAGCTTTGATGACCATTTTCCCTCTTAATATTATCATTATGTTGTAATAACGATTCTGGTTTTGGGGTTTGTTGTTCTTGTGGATTCCATGACAGTATCTCTCTGTGATGCTAAGAATGTTTGTAAACGATATATGTAGAATAAGATTTAATGTTTCATAATGTATCTCTGCTCATCAAGCTTTTGAATTCAGTGATCGTTGCCTCAATGTTCACAGATTACTTAAGAACAGCTTACTTTTTCACCTTTTTTCTCTCATGTTTAGAAATGTCTTGATTCTGAAAAGGGTTTTAAGTTTTATGATTGGTTGAATTTGTCAACTTTGTCAATTTTGTTCTTGGC

mRNA sequence

GTCAAAATCATTCTCGAAAAATCCAAAAAATCCAAACAAAATCATTCAAGTTCGACGATTTTCGTGAGTTAATTGCCTGTCAATTGCCGATTGGTTTGGACCATTGTTCATCGAAGATCGAATAACCAATCCTCTGGTTATTTGATAGCAGCCATCTACGGCGGCCGTGTACCGACTCGCCGGCGATTATCCATCACCGCCACGCTTTTATCTCGGCAAAGAATGGCCAATCTCAGCTTCCATTTTGTGATTACCTTTCTCTTTCTTCTTCGCCATTTTTCTGCTACTTCGGATATTTCGGAGCTCTTTGAAATCTGGTGCACAGAACATGGAAAATCGTATTCCTCCGCGGAAGAAAAGCTTTACAGACTCGGTGTTTTTGCCGATAACTATGAATTTGTTACTCATCACAATAATCAGGGAAATTCTTCTTATACTCTTTCTCTCAATGCTTATGCCGATATTACTCACCATGAGTTCAAGGCCGCTCGCTTAGGTCTTTCCTCTGCTTTGCGGAGCTCGCGGCCGGTTTCGCCGCAAGAACCCTATCTTCATCGGGATGTTCCTGAATCGTTAGATTGGAGGAAGAAAGGGGCTGTGACTGCTGTTAAGGATCAAGGAAGTTGTGGTGCTTGCTGGTCTTTCTCAGCAACAGGAGCTATTGAAGGGATCAACCAAATTAGAACAGGGTCTCTTATCAGTGTTTCTGAACAGGAATTAATTGATTGTGACAGATCGTATAATTCTGGCTGTGGAGGAGGACTGATGGATTATGCATACCAATTTGTTATAAAAAACCACGGGATCGACACCGAAGATGATTATCCTTTTCAGGGTCGTGATGGATCGTGTCATAAGGACAAGCTAAATAGGAAGGTCGTTACCATTGATGGCTATTCCGATGTTCCTCCAAACAATGAGGAAAAATTACTGCAAGCAGTAGCAATTCAACCTGGAATTTTCTCTGGTCCATGTTCAACTTCCTTGGATCATGCTGTGTTGATTGTAGGATATGGATCAGAAAATGGTGTTGATTATTGGATCGTGAAGAACTCGTGGGGTAAACGTTGGGGAATGGATGGTTATATTCACATGCAGCGCAACAGCGGAAATTCTGAAGGCGTTTGCGGAATCAACATGCTTGCTTCATATCCAACTAAAACGAGTCCCAACCCACCTCCCTCCCCTCCGCCAGGTCCAACAAAATGCAGTTTTCTTACTAGCTGTGCTGCTGGGGAGACCTGTTGTTGTGCGAAGGAATTTTTTGGCCTTTGCTTGTCTTGGAAATGCTGTGGACTGAGCTCTGCTGTCTGTTGCAAGGACGGTCGTCATTGTTGCCCCTTTGATTATCCCATTTGTGATACTCAGAGGAACCTATGCCTCAAGAGAACGATGAACGGTACAAGAACAGAAGCACTCGAGAATCGGAGTCCTTCGGGAACATCCGTGAGACCTTTGCAGGATACACACTCGTTTTTCTTTGAAGCAATGGCTGTTGAAGGTCAAGTGATTCCACTTCGCGATGCTAAAGAATTCGATGTGATAATCGATAAAGAGAAGGAATCTGGCAAACTGATTGTGATCGATTTTACTGCTTCCTGGTGCCCGCCATGCCGTATCATTGCTCCAGTATTCGCAGAGTTGGCTAAGGCGCACGTCAATGTCACTTTCTTGAAAGTGGACGTCGATAATGTCAAGGAAATTGCTGAGAGGTTCGAGGTGAATGCGATGCCGACCTTTGTTTTCCTGAAAGGAGGAAAAGAAGTTCACAGGATTGTTGGTGCCGACAAGGTGGAGCTAGGGGTTAAAGTACTGGAGTTAAGCGCTGCACCTGCTACTTCTGCTGCTTAGATGGTCAGCTTTGATGACCATTTTCCCTCTTAATATTATCATTATGTTGTAATAACGATTCTGGTTTTGGGGTTTGTTGTTCTTGTGGATTCCATGACAGTATCTCTCTGTGATGCTAAGAATGTTTGTAAACGATATATGTAGAATAAGATTTAATGTTTCATAATGTATCTCTGCTCATCAAGCTTTTGAATTCAGTGATCGTTGCCTCAATGTTCACAGATTACTTAAGAACAGCTTACTTTTTCACCTTTTTTCTCTCATGTTTAGAAATGTCTTGATTCTGAAAAGGGTTTTAAGTTTTATGATTGGTTGAATTTGTCAACTTTGTCAATTTTGTTCTTGGC

Coding sequence (CDS)

ATGGCCAATCTCAGCTTCCATTTTGTGATTACCTTTCTCTTTCTTCTTCGCCATTTTTCTGCTACTTCGGATATTTCGGAGCTCTTTGAAATCTGGTGCACAGAACATGGAAAATCGTATTCCTCCGCGGAAGAAAAGCTTTACAGACTCGGTGTTTTTGCCGATAACTATGAATTTGTTACTCATCACAATAATCAGGGAAATTCTTCTTATACTCTTTCTCTCAATGCTTATGCCGATATTACTCACCATGAGTTCAAGGCCGCTCGCTTAGGTCTTTCCTCTGCTTTGCGGAGCTCGCGGCCGGTTTCGCCGCAAGAACCCTATCTTCATCGGGATGTTCCTGAATCGTTAGATTGGAGGAAGAAAGGGGCTGTGACTGCTGTTAAGGATCAAGGAAGTTGTGGTGCTTGCTGGTCTTTCTCAGCAACAGGAGCTATTGAAGGGATCAACCAAATTAGAACAGGGTCTCTTATCAGTGTTTCTGAACAGGAATTAATTGATTGTGACAGATCGTATAATTCTGGCTGTGGAGGAGGACTGATGGATTATGCATACCAATTTGTTATAAAAAACCACGGGATCGACACCGAAGATGATTATCCTTTTCAGGGTCGTGATGGATCGTGTCATAAGGACAAGCTAAATAGGAAGGTCGTTACCATTGATGGCTATTCCGATGTTCCTCCAAACAATGAGGAAAAATTACTGCAAGCAGTAGCAATTCAACCTGGAATTTTCTCTGGTCCATGTTCAACTTCCTTGGATCATGCTGTGTTGATTGTAGGATATGGATCAGAAAATGGTGTTGATTATTGGATCGTGAAGAACTCGTGGGGTAAACGTTGGGGAATGGATGGTTATATTCACATGCAGCGCAACAGCGGAAATTCTGAAGGCGTTTGCGGAATCAACATGCTTGCTTCATATCCAACTAAAACGAGTCCCAACCCACCTCCCTCCCCTCCGCCAGGTCCAACAAAATGCAGTTTTCTTACTAGCTGTGCTGCTGGGGAGACCTGTTGTTGTGCGAAGGAATTTTTTGGCCTTTGCTTGTCTTGGAAATGCTGTGGACTGAGCTCTGCTGTCTGTTGCAAGGACGGTCGTCATTGTTGCCCCTTTGATTATCCCATTTGTGATACTCAGAGGAACCTATGCCTCAAGAGAACGATGAACGGTACAAGAACAGAAGCACTCGAGAATCGGAGTCCTTCGGGAACATCCGTGAGACCTTTGCAGGATACACACTCGTTTTTCTTTGAAGCAATGGCTGTTGAAGGTCAAGTGATTCCACTTCGCGATGCTAAAGAATTCGATGTGATAATCGATAAAGAGAAGGAATCTGGCAAACTGATTGTGATCGATTTTACTGCTTCCTGGTGCCCGCCATGCCGTATCATTGCTCCAGTATTCGCAGAGTTGGCTAAGGCGCACGTCAATGTCACTTTCTTGAAAGTGGACGTCGATAATGTCAAGGAAATTGCTGAGAGGTTCGAGGTGAATGCGATGCCGACCTTTGTTTTCCTGAAAGGAGGAAAAGAAGTTCACAGGATTGTTGGTGCCGACAAGGTGGAGCTAGGGGTTAAAGTACTGGAGTTAAGCGCTGCACCTGCTACTTCTGCTGCTTAG

Protein sequence

MANLSFHFVITFLFLLRHFSATSDISELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNQGNSSYTLSLNAYADITHHEFKAARLGLSSALRSSRPVSPQEPYLHRDVPESLDWRKKGAVTAVKDQGSCGACWSFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEDDYPFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAVAIQPGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRWGMDGYIHMQRNSGNSEGVCGINMLASYPTKTSPNPPPSPPPGPTKCSFLTSCAAGETCCCAKEFFGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALENRSPSGTSVRPLQDTHSFFFEAMAVEGQVIPLRDAKEFDVIIDKEKESGKLIVIDFTASWCPPCRIIAPVFAELAKAHVNVTFLKVDVDNVKEIAERFEVNAMPTFVFLKGGKEVHRIVGADKVELGVKVLELSAAPATSAA
Homology
BLAST of CmoCh12G003890 vs. ExPASy Swiss-Prot
Match: Q9LT78 (Probable cysteine protease RD21C OS=Arabidopsis thaliana OX=3702 GN=RD21C PE=1 SV=1)

HSP 1 Score: 409.5 bits (1051), Expect = 6.1e-113
Identity = 200/405 (49.38%), Postives = 262/405 (64.69%), Query Frame = 0

Query: 23  SDISELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNQGNSSYTLSLNAYADIT 82
           ++   ++E W  E+ K+Y+   EK  R  +F DN +FV  H++  N +Y + L  +AD+T
Sbjct: 37  AEARRMYERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLT 96

Query: 83  HHEFKAARLGLSSALRSSRPVSPQEPYLHR---DVPESLDWRKKGAVTAVKDQGSCGACW 142
           + EF+A  + L S +  +R     E YL++    +P+++DWR KGAV  VKDQGSCG+CW
Sbjct: 97  NDEFRA--IYLRSKMERTRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCW 156

Query: 143 SFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTED 202
           +FSA GA+EGINQI+TG LIS+SEQEL+DCD SYN GCGGGLMDYA++F+I+N GIDTE+
Sbjct: 157 AFSAIGAVEGINQIKTGELISLSEQELVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEE 216

Query: 203 DYPFQGRD-GSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAVAIQP-------------- 262
           DYP+   D   C+ DK N +VVTIDGY DVP N+E+ L +A+A QP              
Sbjct: 217 DYPYIATDVNVCNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQL 276

Query: 263 ---GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRWGMDGYIHMQRNSGNSEGV 322
              G+F+G C TSLDH V+ VGYGSE G DYWIV+NSWG  WG  GY  ++RN   S G 
Sbjct: 277 YTSGVFTGTCGTSLDHGVVAVGYGSEGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGK 336

Query: 323 CGINMLASYPTKTSPNPPPSPP-PGPTKCSFLTSCAAGETCCCAKEFFGLCLSWKCCGLS 382
           CG+ M+ASYPTK+S + PP PP P P  C    +C A  TCCC  E+ G C SW CC   
Sbjct: 337 CGVAMMASYPTKSSGSNPPKPPAPSPVVCDKSNTCPAKSTCCCLYEYNGKCYSWGCCPYE 396

Query: 383 SAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALENRSPS 406
           SA CC DG  CCP  YP+CD + N C  +  +    +AL  R P+
Sbjct: 397 SATCCDDGSSCCPQSYPVCDLKANTCRMKGNSPLSIKAL-TRGPA 438

BLAST of CmoCh12G003890 vs. ExPASy Swiss-Prot
Match: P25776 (Oryzain alpha chain OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g0650000 PE=1 SV=2)

HSP 1 Score: 401.4 bits (1030), Expect = 1.7e-110
Identity = 201/402 (50.00%), Postives = 249/402 (61.94%), Query Frame = 0

Query: 28  LFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHN---NQGNSSYTLSLNAYADITHH 87
           L+  W  EHGKSY++  E+  R   F DN  ++  HN   + G  S+ L LN +AD+T+ 
Sbjct: 39  LYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNE 98

Query: 88  EFKAARLGLSSALRSSRPVSPQEPYLHRD---VPESLDWRKKGAVTAVKDQGSCGACWSF 147
           E++   LGL +  R  R VS  + YL  D   +PES+DWR KGAV  +KDQG CG+CW+F
Sbjct: 99  EYRDTYLGLRNKPRRERKVS--DRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAF 158

Query: 148 SATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEDDY 207
           SA  A+EGINQI TG LIS+SEQEL+DCD SYN GC GGLMDYA+ F+I N GIDTEDDY
Sbjct: 159 SAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDTEDDY 218

Query: 208 PFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAVAIQP----------------- 267
           P++G+D  C  ++ N KVVTID Y DV PN+E  L +AVA QP                 
Sbjct: 219 PYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSS 278

Query: 268 GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRWGMDGYIHMQRNSGNSEGVCGI 327
           GIF+G C T+LDH V  VGYG+ENG DYWIV+NSWGK WG  GY+ M+RN   S G CGI
Sbjct: 279 GIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCGI 338

Query: 328 NMLASYPTKTSPNP------PPSPPPGPTKCSFLTSCAAGETCCCAKEFFGLCLSWKCCG 387
            +  SYP K   NP      PPSP P PT C    +C    TCCC  E+   C +W CC 
Sbjct: 339 AVEPSYPLKKGENPPNPGPTPPSPTPPPTVCDNYYTCPDSTTCCCIYEYGKYCYAWGCCP 398

Query: 388 LSSAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALE 401
           L  A CC D   CCP +YPIC+ Q+  CL    +    +AL+
Sbjct: 399 LEGATCCDDHYSCCPHEYPICNVQQGTCLMAKDSPLAVKALK 438

BLAST of CmoCh12G003890 vs. ExPASy Swiss-Prot
Match: P43297 (Cysteine proteinase RD21A OS=Arabidopsis thaliana OX=3702 GN=RD21A PE=1 SV=1)

HSP 1 Score: 399.8 bits (1026), Expect = 4.8e-110
Identity = 197/406 (48.52%), Postives = 253/406 (62.32%), Query Frame = 0

Query: 23  SDISELFEIWCTEHGK--SYSSAEEKLYRLGVFADNYEFVTHHNNQGNSSYTLSLNAYAD 82
           +++  ++E W  +HGK  S +S  EK  R  +F DN  FV  HN + N SY L L  +AD
Sbjct: 44  AEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK-NLSYRLGLTRFAD 103

Query: 83  ITHHEFKAARLGLSSALRSSRPVSPQ-EPYLHRDVPESLDWRKKGAVTAVKDQGSCGACW 142
           +T+ E+++  LG     +  R  S + E  +  ++PES+DWRKKGAV  VKDQG CG+CW
Sbjct: 104 LTNDEYRSKYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCW 163

Query: 143 SFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTED 202
           +FS  GA+EGINQI TG LI++SEQEL+DCD SYN GC GGLMDYA++F+IKN GIDT+ 
Sbjct: 164 AFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDK 223

Query: 203 DYPFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAVAIQP--------------- 262
           DYP++G DG+C + + N KVVTID Y DVP  +EE L +AVA QP               
Sbjct: 224 DYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLY 283

Query: 263 --GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRWGMDGYIHMQRNSGNSEGVC 322
             GIF G C T LDH V+ VGYG+ENG DYWIV+NSWGK WG  GY+ M RN  +S G C
Sbjct: 284 DSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKC 343

Query: 323 GINMLASYPTKTSPNP------PPSPPPGPTKCSFLTSCAAGETCCCAKEFFGLCLSWKC 382
           GI +  SYP K   NP      PPSP   PT+C    +C    TCCC  E+   C +W C
Sbjct: 344 GIAIEPSYPIKNGENPPNPGPSPPSPIKPPTQCDSYYTCPESNTCCCLFEYGKYCFAWGC 403

Query: 383 CGLSSAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALENR 403
           C L +A CC D   CCP +YP+CD  +  CL    +    +AL+ +
Sbjct: 404 CPLEAATCCDDNYSCCPHEYPVCDLDQGTCLLSKNSPFSVKALKRK 448

BLAST of CmoCh12G003890 vs. ExPASy Swiss-Prot
Match: Q9FMH8 (Probable cysteine protease RD21B OS=Arabidopsis thaliana OX=3702 GN=RD21B PE=1 SV=1)

HSP 1 Score: 394.8 bits (1013), Expect = 1.6e-108
Identity = 196/410 (47.80%), Postives = 248/410 (60.49%), Query Frame = 0

Query: 23  SDISELFEIWCTEHGKSYSS----AEEKLYRLGVFADNYEFVTHHNNQGNSSYTLSLNAY 82
           S++  ++E W  EHGK   +      EK  R  +F DN  F+  HN + N SY L L  +
Sbjct: 44  SEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTK-NLSYKLGLTRF 103

Query: 83  ADITHHEFKAARLGLSSALRSSRPVSPQEPYLHRDVPESLDWRKKGAVTAVKDQGSCGAC 142
           AD+T+ E+++  LG     R  +     +  +   +P+S+DWRK+GAV  VKDQGSCG+C
Sbjct: 104 ADLTNEEYRSMYLGAKPTKRVLKTSDRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSC 163

Query: 143 WSFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 202
           W+FS  GA+EGIN+I TG LIS+SEQEL+DCD SYN GC GGLMDYA++F+IKN GIDTE
Sbjct: 164 WAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTE 223

Query: 203 DDYPFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAVAIQP-------------- 262
            DYP++  DG C +++ N KVVTID Y DVP N+E  L +A+A QP              
Sbjct: 224 ADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQL 283

Query: 263 ---GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRWGMDGYIHMQRNSGNSEGV 322
              G+F G C T LDH V+ VGYG+ENG DYWIV+NSWG RWG  GYI M RN     G 
Sbjct: 284 YSSGVFDGLCGTELDHGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNIEAPTGK 343

Query: 323 CGINMLASYPTKTSPNP------PPSPPPGPTKCSFLTSCAAGETCCCAKEFFGLCLSWK 382
           CGI M ASYP K   NP      PPSP   PT C    SC    TCCC  ++   C  W 
Sbjct: 344 CGIAMEASYPIKKGQNPPNPGPSPPSPIKPPTTCDKYFSCPESNTCCCLYKYGKYCFGWG 403

Query: 383 CCGLSSAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALENRSPS 406
           CC L +A CC D   CCP +YP+CD  R  CL    +    +AL+ R+P+
Sbjct: 404 CCPLEAATCCDDNSSCCPHEYPVCDVNRGTCLMSKNSPFSVKALK-RTPA 451

BLAST of CmoCh12G003890 vs. ExPASy Swiss-Prot
Match: P25777 (Oryzain beta chain OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g0670200 PE=1 SV=2)

HSP 1 Score: 372.1 bits (954), Expect = 1.1e-101
Identity = 191/395 (48.35%), Postives = 247/395 (62.53%), Query Frame = 0

Query: 29  FEIWCTEHGKSYSSA--EEKLYRLGVFADNYEFVTHHNNQGN--SSYTLSLNAYADITHH 88
           +++W  E+G    +A   E   R  VF DN +FV  HN + +    + L +N +AD+T+ 
Sbjct: 52  YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111

Query: 89  EFKAARLGLSSALRSSRPVSPQEPYLH---RDVPESLDWRKKGAVTAVKDQGSCGACWSF 148
           EF+A  LG   A RS    +  E Y H    ++PES+DWR+KGAV  VK+QG CG+CW+F
Sbjct: 112 EFRATFLGAKVAERSR---AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAF 171

Query: 149 SATGAIEGINQIRTGSLISVSEQELIDCD-RSYNSGCGGGLMDYAYQFVIKNHGIDTEDD 208
           SA   +E INQ+ TG +I++SEQEL++C     NSGC GGLMD A+ F+IKN GIDTEDD
Sbjct: 172 SAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDD 231

Query: 209 YPFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAVAIQP---------------- 268
           YP++  DG C  ++ N KVV+IDG+ DVP N+E+ L +AVA QP                
Sbjct: 232 YPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYH 291

Query: 269 -GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRWGMDGYIHMQRNSGNSEGVCG 328
            G+FSG C TSLDH V+ VGYG++NG DYWIV+NSWG +WG  GY+ M+RN   + G CG
Sbjct: 292 SGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCG 351

Query: 329 INMLASYPTKTSPNPP---PSPPPGPTK---------CSFLTSCAAGETCCCAKEFFGLC 387
           I M+ASYPTK+  NPP   P+PP  PT          C    SC AG TCCCA  F  LC
Sbjct: 352 IAMMASYPTKSGANPPKPSPTPPTPPTPPPPSAPDHVCDDNFSCPAGSTCCCAFGFRNLC 411

BLAST of CmoCh12G003890 vs. ExPASy TrEMBL
Match: A0A6J1GH92 (zingipain-2 OS=Cucurbita moschata OX=3662 GN=LOC111454188 PE=3 SV=1)

HSP 1 Score: 846.7 bits (2186), Expect = 5.5e-242
Identity = 408/425 (96.00%), Postives = 408/425 (96.00%), Query Frame = 0

Query: 1   MANLSFHFVITFLFLLRHFSATSDISELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFV 60
           MANLSFHFVITFLFLLRHFSATSDISELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFV
Sbjct: 1   MANLSFHFVITFLFLLRHFSATSDISELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFV 60

Query: 61  THHNNQGNSSYTLSLNAYADITHHEFKAARLGLSSALRSSRPVSPQEPYLHRDVPESLDW 120
           THHNNQGNSSYTLSLNAYADITHHEFKAARLGLSSALRSSRPVSPQEPYLHRDVPESLDW
Sbjct: 61  THHNNQGNSSYTLSLNAYADITHHEFKAARLGLSSALRSSRPVSPQEPYLHRDVPESLDW 120

Query: 121 RKKGAVTAVKDQGSCGACWSFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGG 180
           RKKGAVTAVKDQGSCGACWSFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGG
Sbjct: 121 RKKGAVTAVKDQGSCGACWSFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGG 180

Query: 181 LMDYAYQFVIKNHGIDTEDDYPFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAV 240
           LMDYAYQFVIKNHGIDTEDDYPFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAV
Sbjct: 181 LMDYAYQFVIKNHGIDTEDDYPFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAV 240

Query: 241 AIQP-----------------GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRW 300
           AIQP                 GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRW
Sbjct: 241 AIQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRW 300

Query: 301 GMDGYIHMQRNSGNSEGVCGINMLASYPTKTSPNPPPSPPPGPTKCSFLTSCAAGETCCC 360
           GMDGYIHMQRNSGNSEGVCGINMLASYPTKTSPNPPPSPPPGPTKCSFLTSCAAGETCCC
Sbjct: 301 GMDGYIHMQRNSGNSEGVCGINMLASYPTKTSPNPPPSPPPGPTKCSFLTSCAAGETCCC 360

Query: 361 AKEFFGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALENRS 409
           AKEFFGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALENRS
Sbjct: 361 AKEFFGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALENRS 420

BLAST of CmoCh12G003890 vs. ExPASy TrEMBL
Match: A0A6J1KL32 (zingipain-2 OS=Cucurbita maxima OX=3661 GN=LOC111496216 PE=3 SV=1)

HSP 1 Score: 832.8 bits (2150), Expect = 8.3e-238
Identity = 401/425 (94.35%), Postives = 404/425 (95.06%), Query Frame = 0

Query: 1   MANLSFHFVITFLFLLRHFSATSDISELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFV 60
           MANL+FHFVITFLFLLRHFSATSDISELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFV
Sbjct: 1   MANLNFHFVITFLFLLRHFSATSDISELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFV 60

Query: 61  THHNNQGNSSYTLSLNAYADITHHEFKAARLGLSSALRSSRPVSPQEPYLHRDVPESLDW 120
           THHNNQGNSSYTLSLNAYADITHHEFKAARLGLSSALR+SRPVSPQEPYLH+DVPE LDW
Sbjct: 61  THHNNQGNSSYTLSLNAYADITHHEFKAARLGLSSALRNSRPVSPQEPYLHQDVPELLDW 120

Query: 121 RKKGAVTAVKDQGSCGACWSFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGG 180
           RKKGAVTAVKDQGSCGACWSFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGG
Sbjct: 121 RKKGAVTAVKDQGSCGACWSFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGG 180

Query: 181 LMDYAYQFVIKNHGIDTEDDYPFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAV 240
           LMDYAYQFVIKNHGIDTEDDYPFQGRDGSC KDKLNRKVVTIDGYSDVPPNNEEKLLQAV
Sbjct: 181 LMDYAYQFVIKNHGIDTEDDYPFQGRDGSCRKDKLNRKVVTIDGYSDVPPNNEEKLLQAV 240

Query: 241 AIQP-----------------GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRW 300
           AIQP                 GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRW
Sbjct: 241 AIQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRW 300

Query: 301 GMDGYIHMQRNSGNSEGVCGINMLASYPTKTSPNPPPSPPPGPTKCSFLTSCAAGETCCC 360
           GMDGYIHMQRNSGNSEGVCGINMLASYP KTSPNPPPSPPPGPTKCSFLTSCAAGETCCC
Sbjct: 301 GMDGYIHMQRNSGNSEGVCGINMLASYPIKTSPNPPPSPPPGPTKCSFLTSCAAGETCCC 360

Query: 361 AKEFFGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALENRS 409
           AKEFFGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICD QRNLCLKRTMNGTRTEALENRS
Sbjct: 361 AKEFFGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDPQRNLCLKRTMNGTRTEALENRS 420

BLAST of CmoCh12G003890 vs. ExPASy TrEMBL
Match: A0A1S3BBY8 (zingipain-2 OS=Cucumis melo OX=3656 GN=LOC103488009 PE=3 SV=1)

HSP 1 Score: 750.4 bits (1936), Expect = 5.4e-213
Identity = 358/425 (84.24%), Postives = 374/425 (88.00%), Query Frame = 0

Query: 1   MANLSFHFVITFLFLLRHFSATSDISELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFV 60
           M N +FHF+  FL   R   ATS++SELFEIWCTEHGKSYSSAEEKLYRL VFADNYEFV
Sbjct: 1   MGNFAFHFLTLFLLFFRPLFATSNVSELFEIWCTEHGKSYSSAEEKLYRLSVFADNYEFV 60

Query: 61  THHNNQGNSSYTLSLNAYADITHHEFKAARLGLSSALRSSRPVSPQEPYLHRDVPESLDW 120
           THHNN GNSSYTLSLN+YAD+THHEFK +RLG S ALR+ RPV PQEP L RDVP+SLDW
Sbjct: 61  THHNNLGNSSYTLSLNSYADLTHHEFKVSRLGFSPALRNFRPVLPQEPSLPRDVPDSLDW 120

Query: 121 RKKGAVTAVKDQGSCGACWSFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGG 180
           RKKGAVTAVKDQGSCGACWSFSATGAIEGINQI TGSLISVSEQELIDCDRSYNSGCGGG
Sbjct: 121 RKKGAVTAVKDQGSCGACWSFSATGAIEGINQIMTGSLISVSEQELIDCDRSYNSGCGGG 180

Query: 181 LMDYAYQFVIKNHGIDTEDDYPFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAV 240
           LMDYAYQFVI NHGIDTEDDYP+QGRDGSC KDKL R VVTIDGY+D+PPN+E KLLQAV
Sbjct: 181 LMDYAYQFVISNHGIDTEDDYPYQGRDGSCRKDKLQRNVVTIDGYTDIPPNDEGKLLQAV 240

Query: 241 AIQP-----------------GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRW 300
           A QP                 GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGK W
Sbjct: 241 AAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSW 300

Query: 301 GMDGYIHMQRNSGNSEGVCGINMLASYPTKTSPNPPPSPPPGPTKCSFLTSCAAGETCCC 360
           GMDGY+HMQRNSGNSEGVCGIN LASYPTKTSPNPPPSPPPGPTKCS LTSCAAGETCCC
Sbjct: 301 GMDGYMHMQRNSGNSEGVCGINKLASYPTKTSPNPPPSPPPGPTKCSILTSCAAGETCCC 360

Query: 361 AKEFFGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALENRS 409
           AK+F GLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDT RNLCLKRTMNGTR E LENRS
Sbjct: 361 AKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKRTMNGTRMEVLENRS 420

BLAST of CmoCh12G003890 vs. ExPASy TrEMBL
Match: A0A5A7VC45 (Zingipain-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G005350 PE=3 SV=1)

HSP 1 Score: 743.8 bits (1919), Expect = 5.0e-211
Identity = 357/425 (84.00%), Postives = 373/425 (87.76%), Query Frame = 0

Query: 1   MANLSFHFVITFLFLLRHFSATSDISELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFV 60
           M N +FHF+  FL   R   ATS++SELFEIWCTEHGKSYSSAEEKLYRL VFADNYEFV
Sbjct: 1   MGNFAFHFLTLFLLFFRPLFATSNVSELFEIWCTEHGKSYSSAEEKLYRLSVFADNYEFV 60

Query: 61  THHNNQGNSSYTLSLNAYADITHHEFKAARLGLSSALRSSRPVSPQEPYLHRDVPESLDW 120
           THHNN GNSSYTLSLN+YAD+THHEFK +RLG S ALR+ RPV PQEP L RDVP+SLDW
Sbjct: 61  THHNNLGNSSYTLSLNSYADLTHHEFKVSRLGFSPALRNFRPVLPQEPSLPRDVPDSLDW 120

Query: 121 RKKGAVTAVKDQGSCGACWSFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGG 180
           RKKGAVTAVKDQGSC ACWSFSATGAIEGINQI TGSLISVSEQELIDCDRSYNSGCGGG
Sbjct: 121 RKKGAVTAVKDQGSC-ACWSFSATGAIEGINQIMTGSLISVSEQELIDCDRSYNSGCGGG 180

Query: 181 LMDYAYQFVIKNHGIDTEDDYPFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAV 240
           LMDYAYQFVI NHGIDTEDDYP+QGRDGSC KDKL R VVTIDGY+D+PPN+E KLLQAV
Sbjct: 181 LMDYAYQFVISNHGIDTEDDYPYQGRDGSCRKDKLQRNVVTIDGYTDIPPNDEGKLLQAV 240

Query: 241 AIQP-----------------GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRW 300
           A QP                 GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGK W
Sbjct: 241 AAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSW 300

Query: 301 GMDGYIHMQRNSGNSEGVCGINMLASYPTKTSPNPPPSPPPGPTKCSFLTSCAAGETCCC 360
           GMDGY+HMQRNSGNSEGVCGIN LASYPTKTSPNPPPSPPPGPTKCS LTSCAAGETCCC
Sbjct: 301 GMDGYMHMQRNSGNSEGVCGINKLASYPTKTSPNPPPSPPPGPTKCSILTSCAAGETCCC 360

Query: 361 AKEFFGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALENRS 409
           AK+F GLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDT RNLCLKRTMNGTR E LENRS
Sbjct: 361 AKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKRTMNGTRMEVLENRS 420

BLAST of CmoCh12G003890 vs. ExPASy TrEMBL
Match: A0A0A0LNP7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G362510 PE=3 SV=1)

HSP 1 Score: 743.0 bits (1917), Expect = 8.6e-211
Identity = 354/425 (83.29%), Postives = 375/425 (88.24%), Query Frame = 0

Query: 1   MANLSFHFVITFLFLLRHFSATSDISELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFV 60
           M N +FHF+  FL L R  SATS++SELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFV
Sbjct: 1   MGNYAFHFLTLFLLLFRPLSATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFV 60

Query: 61  THHNNQGNSSYTLSLNAYADITHHEFKAARLGLSSALRSSRPVSPQEPYLHRDVPESLDW 120
           THHNN  NSSYTLSLN+YAD+THHEFK +RLG S ALR+ RPV PQEP L RDVP+SLDW
Sbjct: 61  THHNNLDNSSYTLSLNSYADLTHHEFKVSRLGFSPALRNFRPVLPQEPSLPRDVPDSLDW 120

Query: 121 RKKGAVTAVKDQGSCGACWSFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGG 180
           RKKGAVTAVKDQGSCGACWSFSATGA+EGINQI TGSLIS+SEQELIDCDRSYNSGCGGG
Sbjct: 121 RKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGG 180

Query: 181 LMDYAYQFVIKNHGIDTEDDYPFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAV 240
           LMDYAYQFVI NHGIDTE+DYP+Q RDGSC KDKL R VVTIDGY+D+P N+E KLLQAV
Sbjct: 181 LMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAV 240

Query: 241 AIQP-----------------GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRW 300
           A QP                 GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGK W
Sbjct: 241 AAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSW 300

Query: 301 GMDGYIHMQRNSGNSEGVCGINMLASYPTKTSPNPPPSPPPGPTKCSFLTSCAAGETCCC 360
           GMDGY+HMQRNSGNSEGVCGIN LASYPTKT+PNPPPSPPPGPTKCS LTSCAAGETCCC
Sbjct: 301 GMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCC 360

Query: 361 AKEFFGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALENRS 409
           AK+F GLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDT RNLCLK+TMNGTRTE LENRS
Sbjct: 361 AKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRS 420

BLAST of CmoCh12G003890 vs. TAIR 10
Match: AT1G09850.1 (xylem bark cysteine peptidase 3 )

HSP 1 Score: 574.3 bits (1479), Expect = 1.0e-163
Identity = 280/425 (65.88%), Postives = 332/425 (78.12%), Query Frame = 0

Query: 1   MANLSFHFVITFLFLLRHFSATS--DISELFEIWCTEHGKSYSSAEEKLYRLGVFADNYE 60
           M++ SF   +TF FLL   S++S  DISELF+ WC +HGK+Y S EE+  R+ +F DN++
Sbjct: 3   MSSSSF-ISLTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHD 62

Query: 61  FVTHHNNQGNSSYTLSLNAYADITHHEFKAARLGLSSALRSSRPVSP-QEPYLHRDVPES 120
           FVT HN   N++Y+LSLNA+AD+THHEFKA+RLGLS +  S    S  Q       VP+S
Sbjct: 63  FVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKGQSLGGSVKVPDS 122

Query: 121 LDWRKKGAVTAVKDQGSCGACWSFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGC 180
           +DWRKKGAVT VKDQGSCGACWSFSATGA+EGINQI TG LIS+SEQELIDCD+SYN+GC
Sbjct: 123 VDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGC 182

Query: 181 GGGLMDYAYQFVIKNHGIDTEDDYPFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLL 240
            GGLMDYA++FVIKNHGIDTE DYP+Q RDG+C KDKL +KVVTID Y+ V  N+E+ L+
Sbjct: 183 NGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALM 242

Query: 241 QAVAIQP-----------------GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG 300
           +AVA QP                 GIFSGPCSTSLDHAVLIVGYGS+NGVDYWIVKNSWG
Sbjct: 243 EAVAAQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWG 302

Query: 301 KRWGMDGYIHMQRNSGNSEGVCGINMLASYPTKTSPNPPPSPPPGPTKCSFLTSCAAGET 360
           K WGMDG++HMQRN+ NS+GVCGINMLASYP KT PNPPP  PPGPTKC+  T C++GET
Sbjct: 303 KSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGET 362

Query: 361 CCCAKEFFGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALE 406
           CCCA+E FGLC SWKCC + SAVCCKDGRHCCP DYP+CDT R+LCLK+T N T  +   
Sbjct: 363 CCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFW 422

BLAST of CmoCh12G003890 vs. TAIR 10
Match: AT3G19390.1 (Granulin repeat cysteine protease family protein )

HSP 1 Score: 409.5 bits (1051), Expect = 4.3e-114
Identity = 200/405 (49.38%), Postives = 262/405 (64.69%), Query Frame = 0

Query: 23  SDISELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNQGNSSYTLSLNAYADIT 82
           ++   ++E W  E+ K+Y+   EK  R  +F DN +FV  H++  N +Y + L  +AD+T
Sbjct: 37  AEARRMYERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLT 96

Query: 83  HHEFKAARLGLSSALRSSRPVSPQEPYLHR---DVPESLDWRKKGAVTAVKDQGSCGACW 142
           + EF+A  + L S +  +R     E YL++    +P+++DWR KGAV  VKDQGSCG+CW
Sbjct: 97  NDEFRA--IYLRSKMERTRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCW 156

Query: 143 SFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTED 202
           +FSA GA+EGINQI+TG LIS+SEQEL+DCD SYN GCGGGLMDYA++F+I+N GIDTE+
Sbjct: 157 AFSAIGAVEGINQIKTGELISLSEQELVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEE 216

Query: 203 DYPFQGRD-GSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAVAIQP-------------- 262
           DYP+   D   C+ DK N +VVTIDGY DVP N+E+ L +A+A QP              
Sbjct: 217 DYPYIATDVNVCNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQL 276

Query: 263 ---GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRWGMDGYIHMQRNSGNSEGV 322
              G+F+G C TSLDH V+ VGYGSE G DYWIV+NSWG  WG  GY  ++RN   S G 
Sbjct: 277 YTSGVFTGTCGTSLDHGVVAVGYGSEGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGK 336

Query: 323 CGINMLASYPTKTSPNPPPSPP-PGPTKCSFLTSCAAGETCCCAKEFFGLCLSWKCCGLS 382
           CG+ M+ASYPTK+S + PP PP P P  C    +C A  TCCC  E+ G C SW CC   
Sbjct: 337 CGVAMMASYPTKSSGSNPPKPPAPSPVVCDKSNTCPAKSTCCCLYEYNGKCYSWGCCPYE 396

Query: 383 SAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALENRSPS 406
           SA CC DG  CCP  YP+CD + N C  +  +    +AL  R P+
Sbjct: 397 SATCCDDGSSCCPQSYPVCDLKANTCRMKGNSPLSIKAL-TRGPA 438

BLAST of CmoCh12G003890 vs. TAIR 10
Match: AT1G47128.1 (Granulin repeat cysteine protease family protein )

HSP 1 Score: 399.8 bits (1026), Expect = 3.4e-111
Identity = 197/406 (48.52%), Postives = 253/406 (62.32%), Query Frame = 0

Query: 23  SDISELFEIWCTEHGK--SYSSAEEKLYRLGVFADNYEFVTHHNNQGNSSYTLSLNAYAD 82
           +++  ++E W  +HGK  S +S  EK  R  +F DN  FV  HN + N SY L L  +AD
Sbjct: 44  AEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK-NLSYRLGLTRFAD 103

Query: 83  ITHHEFKAARLGLSSALRSSRPVSPQ-EPYLHRDVPESLDWRKKGAVTAVKDQGSCGACW 142
           +T+ E+++  LG     +  R  S + E  +  ++PES+DWRKKGAV  VKDQG CG+CW
Sbjct: 104 LTNDEYRSKYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCW 163

Query: 143 SFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTED 202
           +FS  GA+EGINQI TG LI++SEQEL+DCD SYN GC GGLMDYA++F+IKN GIDT+ 
Sbjct: 164 AFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDK 223

Query: 203 DYPFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAVAIQP--------------- 262
           DYP++G DG+C + + N KVVTID Y DVP  +EE L +AVA QP               
Sbjct: 224 DYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLY 283

Query: 263 --GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRWGMDGYIHMQRNSGNSEGVC 322
             GIF G C T LDH V+ VGYG+ENG DYWIV+NSWGK WG  GY+ M RN  +S G C
Sbjct: 284 DSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKC 343

Query: 323 GINMLASYPTKTSPNP------PPSPPPGPTKCSFLTSCAAGETCCCAKEFFGLCLSWKC 382
           GI +  SYP K   NP      PPSP   PT+C    +C    TCCC  E+   C +W C
Sbjct: 344 GIAIEPSYPIKNGENPPNPGPSPPSPIKPPTQCDSYYTCPESNTCCCLFEYGKYCFAWGC 403

Query: 383 CGLSSAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALENR 403
           C L +A CC D   CCP +YP+CD  +  CL    +    +AL+ +
Sbjct: 404 CPLEAATCCDDNYSCCPHEYPVCDLDQGTCLLSKNSPFSVKALKRK 448

BLAST of CmoCh12G003890 vs. TAIR 10
Match: AT5G43060.1 (Granulin repeat cysteine protease family protein )

HSP 1 Score: 394.8 bits (1013), Expect = 1.1e-109
Identity = 196/410 (47.80%), Postives = 248/410 (60.49%), Query Frame = 0

Query: 23  SDISELFEIWCTEHGKSYSS----AEEKLYRLGVFADNYEFVTHHNNQGNSSYTLSLNAY 82
           S++  ++E W  EHGK   +      EK  R  +F DN  F+  HN + N SY L L  +
Sbjct: 44  SEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTK-NLSYKLGLTRF 103

Query: 83  ADITHHEFKAARLGLSSALRSSRPVSPQEPYLHRDVPESLDWRKKGAVTAVKDQGSCGAC 142
           AD+T+ E+++  LG     R  +     +  +   +P+S+DWRK+GAV  VKDQGSCG+C
Sbjct: 104 ADLTNEEYRSMYLGAKPTKRVLKTSDRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSC 163

Query: 143 WSFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 202
           W+FS  GA+EGIN+I TG LIS+SEQEL+DCD SYN GC GGLMDYA++F+IKN GIDTE
Sbjct: 164 WAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTE 223

Query: 203 DDYPFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAVAIQP-------------- 262
            DYP++  DG C +++ N KVVTID Y DVP N+E  L +A+A QP              
Sbjct: 224 ADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQL 283

Query: 263 ---GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRWGMDGYIHMQRNSGNSEGV 322
              G+F G C T LDH V+ VGYG+ENG DYWIV+NSWG RWG  GYI M RN     G 
Sbjct: 284 YSSGVFDGLCGTELDHGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNIEAPTGK 343

Query: 323 CGINMLASYPTKTSPNP------PPSPPPGPTKCSFLTSCAAGETCCCAKEFFGLCLSWK 382
           CGI M ASYP K   NP      PPSP   PT C    SC    TCCC  ++   C  W 
Sbjct: 344 CGIAMEASYPIKKGQNPPNPGPSPPSPIKPPTTCDKYFSCPESNTCCCLYKYGKYCFGWG 403

Query: 383 CCGLSSAVCCKDGRHCCPFDYPICDTQRNLCLKRTMNGTRTEALENRSPS 406
           CC L +A CC D   CCP +YP+CD  R  CL    +    +AL+ R+P+
Sbjct: 404 CCPLEAATCCDDNSSCCPHEYPVCDVNRGTCLMSKNSPFSVKALK-RTPA 451

BLAST of CmoCh12G003890 vs. TAIR 10
Match: AT4G35350.1 (xylem cysteine peptidase 1 )

HSP 1 Score: 339.0 bits (868), Expect = 7.2e-93
Identity = 166/317 (52.37%), Postives = 212/317 (66.88%), Query Frame = 0

Query: 18  HFSATSDISELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNQGNSSYTLSLNA 77
           H + T  + ELFE W +EH K+Y S EEK++R  VF +N   +   NN+ N SY L LN 
Sbjct: 40  HLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNE 99

Query: 78  YADITHHEFKAARLGLSSALRSSRPVSPQEPYLHRDV---PESLDWRKKGAVTAVKDQGS 137
           +AD+TH EFK   LGL+   + SR   P   + +RD+   P+S+DWRKKGAV  VKDQG 
Sbjct: 100 FADLTHEEFKGRYLGLAKP-QFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQ 159

Query: 138 CGACWSFSATGAIEGINQIRTGSLISVSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHG 197
           CG+CW+FS   A+EGINQI TG+L S+SEQELIDCD ++NSGC GGLMDYA+Q++I   G
Sbjct: 160 CGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGG 219

Query: 198 IDTEDDYPFQGRDGSCHKDKLNRKVVTIDGYSDVPPNNEEKLLQAVAIQP---------- 257
           +  EDDYP+   +G C + K + + VTI GY DVP N++E L++A+A QP          
Sbjct: 220 LHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGR 279

Query: 258 -------GIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKRWGMDGYIHMQRNSGN 315
                  G+F+G C T LDH V  VGYGS  G DY IVKNSWG RWG  G+I M+RN+G 
Sbjct: 280 DFQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGK 339

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LT786.1e-11349.38Probable cysteine protease RD21C OS=Arabidopsis thaliana OX=3702 GN=RD21C PE=1 S... [more]
P257761.7e-11050.00Oryzain alpha chain OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g0650000 PE=... [more]
P432974.8e-11048.52Cysteine proteinase RD21A OS=Arabidopsis thaliana OX=3702 GN=RD21A PE=1 SV=1[more]
Q9FMH81.6e-10847.80Probable cysteine protease RD21B OS=Arabidopsis thaliana OX=3702 GN=RD21B PE=1 S... [more]
P257771.1e-10148.35Oryzain beta chain OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g0670200 PE=1... [more]
Match NameE-valueIdentityDescription
A0A6J1GH925.5e-24296.00zingipain-2 OS=Cucurbita moschata OX=3662 GN=LOC111454188 PE=3 SV=1[more]
A0A6J1KL328.3e-23894.35zingipain-2 OS=Cucurbita maxima OX=3661 GN=LOC111496216 PE=3 SV=1[more]
A0A1S3BBY85.4e-21384.24zingipain-2 OS=Cucumis melo OX=3656 GN=LOC103488009 PE=3 SV=1[more]
A0A5A7VC455.0e-21184.00Zingipain-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G005350 PE... [more]
A0A0A0LNP78.6e-21183.29Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G362510 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G09850.11.0e-16365.88xylem bark cysteine peptidase 3 [more]
AT3G19390.14.3e-11449.38Granulin repeat cysteine protease family protein [more]
AT1G47128.13.4e-11148.52Granulin repeat cysteine protease family protein [more]
AT5G43060.11.1e-10947.80Granulin repeat cysteine protease family protein [more]
AT4G35350.17.2e-9352.37xylem cysteine peptidase 1 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000668Peptidase C1A, papain C-terminalPRINTSPR00705PAPAINcoord: 257..267
score: 62.56
coord: 272..278
score: 79.08
coord: 132..147
score: 58.83
IPR000668Peptidase C1A, papain C-terminalSMARTSM00645pept_c1coord: 114..312
e-value: 1.5E-104
score: 363.3
IPR000668Peptidase C1A, papain C-terminalPFAMPF00112Peptidase_C1coord: 114..311
e-value: 3.9E-66
score: 223.2
IPR000118GranulinSMARTSM00277GRAN_2coord: 329..386
e-value: 1.8E-21
score: 87.3
IPR000118GranulinPFAMPF00396Granulincoord: 340..386
e-value: 5.8E-10
score: 39.4
IPR013201Cathepsin propeptide inhibitor domain (I29)SMARTSM00848Inhibitor_I29_2coord: 29..86
e-value: 4.3E-23
score: 92.7
IPR013201Cathepsin propeptide inhibitor domain (I29)PFAMPF08246Inhibitor_I29coord: 29..86
e-value: 8.8E-16
score: 58.1
NoneNo IPR availableGENE3D3.40.30.10Glutaredoxincoord: 415..538
e-value: 1.6E-35
score: 123.7
NoneNo IPR availableGENE3D3.90.70.10Cysteine proteinasescoord: 14..314
e-value: 2.2E-101
score: 341.5
NoneNo IPR availablePANTHERPTHR12411CYSTEINE PROTEASE FAMILY C1-RELATEDcoord: 245..347
coord: 15..244
NoneNo IPR availablePANTHERPTHR12411:SF414OS05G0508300 PROTEINcoord: 245..347
NoneNo IPR availablePANTHERPTHR12411:SF414OS05G0508300 PROTEINcoord: 15..244
NoneNo IPR availableCDDcd02947TRX_familycoord: 443..525
e-value: 1.23616E-36
score: 129.215
NoneNo IPR availableSUPERFAMILY57277Granulin repeatcoord: 326..359
IPR013766Thioredoxin domainPFAMPF00085Thioredoxincoord: 435..523
e-value: 1.7E-24
score: 85.8
IPR013766Thioredoxin domainPROSITEPS51352THIOREDOXIN_2coord: 411..536
score: 14.441047
IPR037277Granulin superfamilyGENE3D2.10.25.160Granulincoord: 327..397
e-value: 8.3E-10
score: 40.8
IPR025661Cysteine peptidase, asparagine active sitePROSITEPS00640THIOL_PROTEASE_ASNcoord: 272..291
IPR000169Cysteine peptidase, cysteine active sitePROSITEPS00139THIOL_PROTEASE_CYScoord: 132..143
IPR017937Thioredoxin, conserved sitePROSITEPS00194THIOREDOXIN_1coord: 453..471
IPR025660Cysteine peptidase, histidine active sitePROSITEPS00639THIOL_PROTEASE_HIScoord: 255..265
IPR039417Papain-like cysteine endopeptidaseCDDcd02248Peptidase_C1Acoord: 115..311
e-value: 3.76026E-103
score: 307.243
IPR036249Thioredoxin-like superfamilySUPERFAMILY52833Thioredoxin-likecoord: 425..524
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 25..312

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh12G003890.1CmoCh12G003890.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0051603 proteolysis involved in cellular protein catabolic process
biological_process GO:0006508 proteolysis
cellular_component GO:0005615 extracellular space
cellular_component GO:0005764 lysosome
molecular_function GO:0004197 cysteine-type endopeptidase activity
molecular_function GO:0008234 cysteine-type peptidase activity