CmoCh04G020890 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G020890
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionCysteine protease 1
LocationCmo_Chr04 : 12661423 .. 12668962 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAGCCCAAACTCAGCTCAGCGACAACTCATTCCAACTCGAAGCTCACCTGAAAGGAAAGAAAGAGCTCCAAGAACCAATCCTTGTTCCCTCTTTGACGTCGATCGAACCCAGTTTTTATCTTCTTGGAACCTCATCGCCTACGCAACTACAATCAATTCTTGATCGCTCGCGGTTGAAGCCTCACAAGTTTGTGAAATTGGGAGCCCATTATTTGAAATAGAATTCCAACTTTGATTGCTTCTTCCTCGGAGTCTCTCTCGCCGAATAGAGGATGCTGGATTTTAACGCCTTTGAGTAGGTTTGTTTTTTTTATGTATCTGATTGTTCTTGTTTCTGTGGCGAATTGGTCTTCCGTTCTTTGTTCTCTCGATTTCTCTCATATTTTGGCTACTTGGGGTTTTCTTCATCTCTCTGATCACCGCCTCTTATTTTCTGCAAGTTCGAATGGGTTTGTTCGCTTTCAGTGGTCCGTTCATGACTTCGCAGTTTTAATCTTGGAGTCAGCCCATCTTACCCTCGTAGCTCTACCAAATTCTCGCCATCTACTGTCTTGGCCAAGCCTTTGACAGGGATATCATGTTATCACTTGTCATAATTAACTTTGTCATGAGCACGCGACTTTGAAAACCAGAAATCATTGTGGTAAACTTGGAAAACGCTTTAACCAATCATGGTCTTGCTTAACGCATTCCATTCTGATTCTTCTTTTTGGTTTTGTCTATTTATATTTATTGATTGCAGCTATTTTTATAAGATAACTCTTAATCGGCCTTCATCAAAATGGAAGTTGAGAGTTCAATAATACTTTGTTTATGTTGATCGTCAAATATCTGTCAATGAGGGTTTGTTTTTGGTGAATCATTATGCTGTTTTTTTTTTCTTAGTTATTCGCCAGGTTTGATGCCTCAATTTGGACGAATATGTTTAAGATAATTTGTGATTATCCTGTCGGTGTTATTCTTTTGTTGGAGCTATTTTTGTCTTGTTAGGTCTTTAAAAAGTTCCAAAGCTACAAAGCCAAAATGGCTTGACACTCATGATTGATCGTTCATTTGAAGGGGAATGAAATGAAATACAAAGTACTTTCATTTTTGTTTTCTTAGTCACTTGTTAGTACACACTTTATGAATGAATTTTGTAGTTTTTTCTCTTGCAACTTTTACCGGGTATGTAGATATAGGCATTAAATATATCATCCCTCGTCCTGCGTTTTCTTTTACGTGTTCTTTTGTAGTGTATTTTCATTCCATGTGCTTTTTCGAGGATCCTTATTTTCTTCTTTATTAGAAACTCGCCTTTTTTTGCCTATAAAAGGCTCGCCCTTTTATTTCTTCTTATCTCTATATTAACTTTGACTGGTGAGTTTTTTTTTAAACTAACTTCATTCCTATTGGATGTATTCCCTTTGTCTCTCTTTGTTTACTAGGTAATGAAGGATTTTCATGGGAAGGGGGAAAGATCTAAATTCAACATGCTCTTCTGAAAGTGTAACTGATATTGTAGATAGAAGTCAGCAATCTATTTGTCCCACGTTAGGGTCTAGAAATCATATATCCTCCAAAGCCTCCTTATGGTCAGGCTTCTTTACATCTACTTTTTCAGTCTTTGAACATAACAAGGAGTCCTCTGTCAGTGAAAAGAAGGCAGTTCATTCTCGACACAATGTCTGGACAACTGTAAGAAGAGTTATGACCAGTGGCTCAATGAGGAGAATACAAGAGCGCATACTGGGTTCTCGCAGGAGTGGTGTTTATAGCTCTGGTGGTGATATATGGCTCCTCGGTGTGTGCCATAAAATTTCCCAAGATCAGGCTTCTGATGATGCAGTTACTAGTGACAGTGTAGCAGGATTTGAGCTAGACTTTTCATCTAGAATTCTGATGACCTATCGTAAAGGTTTCAAGAAATATGCCTTCTTTTAGTTATACTTATGTCATAAGCTTGCATAATTTTTCCTGGACAACTTGCGGTTAACTTATATTTTGCCTCTCCATTTATAGGTTTTAATGGTATTCAAGACTCAAAATACACCAGTGATGTAAATTGGGGTTGCATGCTTAGAAGCAGCCAAATGCTTGTTGCTCAGGTGTTCATACTTCATTTATTTTTGTATCTCAGTTCTGGTGTCTTATTTTTATTTATTTACTTGTAGTGACCTTGTTTATTTTGGTTTTGAGGTTTTCCATAAAGGTATTCTGTTGTTGTCTTCTTCTATTAATCTCCATATTATCTACAGGCATTACTTTTTCATAGATTAGGAAGATCTTGGAGAAAGACTTCACAGAAGGTATTGGCTGTATGGTGTGATGAGACTGGAAGATTTATTTGACATACTGGTTTATTTCAGAAACAAATCTAGCAATAAACCTAATTGGCTTTAATTTGCAGCCACTGGACAAAGAATATGTTGAAATTTTGCATCTTTTTGGTGACTCGGAAACGTCAGCATTTTCGATCCATAATCTTCTTCAGGCAGGAAGGCCCTATGACCTTGCTGCTGGGTCATGGGTGGGACCATATGCCATATGTAGGTCATGGGAGACTCTAGTCCGCTTAAAGAGGGAGACTCCTACTCCCCAAGACCAGCAACTTCCAATGGCCATTTACATTGTTTCTGGAGACGAAGATGGAGAGAGAGGTGGTGCTCCGGTTTTATGCATTGACGTTGCGTCAAGACATTGTTTTCAGTTTTCTAAAGGCCAACTTGATTGGACTCCCATTCTGTTATTAGTTCCTTTGGTTCTTGGACTTGAAAAAATCAATCCAAGGTATGTTATAGTTGTTTACAGGCCTTGTTTTTCTTTTGTTTTTTTTATTAATACTTTGGGCTTCATTACTGATTTTAATAACAATGAAATAGAATTCTTGTTCTAGGTGTTGTGCTTTTAGAGGCTATAGTTTTTCCCTAGTTATTAGTTTTGCATCTCCATGCTGGCAGTTTTGTTGAAAATTGAATGTTTTGGTAATAAGTTGTAAAATTTGAACCTATAGGTTTGGAAATTGTGGCTGTACTGAGTTCTTTTAAGTCTAGCATTGCTTCATGATGGGAGGATTTGCAAGGAAGGGTGGGGAAGATTTTATGTCTATTATTAAATTGGATTTTGTGATAAGAAAGTTGGAAAAATGAAATGTGAAGATGTTTGTTGCTTATGGACGAGGCTAAAACCCAAGATTTAGTGGGAAATGTAATTGTGATACCACGTTTATAGGAAATGTGGCTTTAACCAGACGAAAATCAACCGGTAAGCTTGTTAAATTGCTGAAGATGAGACATTCATTGCAACATATCAATTTTGGTCAAGTTTTTTGGCTTGCCTTCGTTTTAAGATTGGATCTGAACAAAATTATGGCTACTGCCATTAATCTTTTGGGTAGGAAATTAAAGAAGGGACAATTTCATACCCTGCTCTAAGCACATTGCCCCCAACTATCTAGGTTCGTTCCCCCCCTTGGTAAAAACTGGAAATGGTGAAATACTTGACCTTTTGGGAGACCCTCGTGTTCAAAATGCAGCTGGAAATCCAAAAAATGGGGGAACATTCTAATCGCTAAGAGAGGTCAATTCACCTTGGTGTTGTCAGTTTTTAATTGCCTTCCACTATTACTTGTATGTGTTTAAAGCACCTATCACTTGTAACAAAGGATCGGATGAAATTCAGGCTGTTTGGTATGATGGAATCATATAGGAAAAGGGATGGCAATGGGTAAAGAATTTGCATTCCCTTAATTGGAAAAAGGTTTATAAAGGGGGACATGAGGCCCATACCCAAGGATTTTTTATGCATATAGAGAAGGGTGTTTTGCATCCTTCCAATCACATGTGAGTTCCCCATTTCCTCTTATGTCTTTTTTATTCTTTTCTCTTCCTTTTTATTTGTTGCTTTCTTTTAAACATATGCTGATTCTGATGGGAAATTTGGTCAAGCAAATTTATTGTGCCATTGAAAGGGTCAAATTAGGGTTGAGAACTTCTGCAGGATCGATTGTTTCTCCTGATGTTGAGGTGAATAGGGATTCATTATGAGCTTGATACAATTGCTTTGATCTGTGATGGGCTTAGTTGTCAATCTCTTCTATCTGTTGGTTGAGATTGTCACTCTTGCCTACCAATATTATTGGAGCCTAATCTCTCTTATTCTTGTGAAAAGACGTGTAGTTTGAGGACGACTGTCTGAAGCAATTTGAAAACCATTTTAGCACTGGATTAAGAAGCAGAATGAGAGATTTCCATTGGACATAACCCTTAGCGGTTTGACTCTCCTATTTCCATTATTCTCAAACAATTTTGTTTCAAAATATAAACGGGAATGCAATTTAGTGGATGATGTTATTTTCTAACTCTGCATCATAGCATCTCTCCCTGCATTAGGCAACTGCTCATGCCTTTAAATCCATGTTCATATCAACAAAAATTCTATTTATGTCCAATTTATAAGCCATGATAACAAAAACGTTGATGGTTATTCACTTCATGGTCCCTTGCCCTAGCAGCCATGACCTGTTTGGTCAGATTGCCAATTTCCGAACACCGTTTACCAAGTCTTAATAAAGGTCTTGAAATCTTTATTCATGTTATAAACTTTTATTGATTGTCATAATGATGTATTTTGTGTGTTTGCTCGCTTGTTTGCTATAATATAGCAGGCTCGCTTGACTAATATGGAGACCTACATGTTATATTTATAGTTTGACTATTAGAAATTGGAGCTTAGAAATTTAAGCTCAACATATTTATGCAACTTCTGAAGGTACATCCCATCATTACGAGCAACATTTACCTTTCCCCAAAGTCTCGGGATCTTGGGGGGGAAAGCTGGTGCTTCAACATACATCGTGGGTGTTCAAGATGAAAATGCCTTTTACCTTGATCCACATGAGGTTCAGCAGGTACATTAATTTCTTTTGGTCTTTGAAGAATGTCTAAATTATCCTTGCTTTCTTATGTGTCTGATATGTGATATCAGTGTGGTGACACTAATAGACAAATAATAAACTTGCAATATTCTGGTTTTTAGTGCTCAAGCTGGATGCTTTGTGAATCCGGAACAAGTCTACATACGTGTTAGAAACTTGAATTAAGGTCTGAAATGTTTTATAGTACCACTTAGAACTTGCAGGCGTTATGATCAATTGTAGTCATGTGGATTCAGCAAAGTAATGTTACAATGTAGTACTTTTCTTAACTCTATTATCTGAACAAATGTCTCCCCCTCTCTAGGATAGATGTACTTTTTTGTAAATAAATCTTAGTTGGATAAAATGTTCGATCTCTCAATTATATATTTATTTGGTTTAGGATATACTCTATAGCCTTGGTCACGAACTATGGAATCATGGAAAAATATGTTTTCGTTATGATTATGACTTTGAATTTTGTTTCTTTATTTGAGAATTGCAGTATGCATAAGAAAATTTTCACATATTAACTTTTGTTTAATCCCCACCCATTGTGTTGTGGTCTTCATTCTTCAAACAAGAGCACGTCTTTGTTCCTTATGCTTAATCTTTGTTGCACAATTTGCTTGTTCATATCAATTTGCTGTCGTAGGTAATGACTGTCTAGCAATGCTTACATTGTAGGTAGTTAATATTGACAAGGATGACCTAGAGGCTGATACTTCCTCTTATCACTGCAAGTGAGTGTTTCCATCTCATAAATTTCTCATGGCTTACTGTTTTTTTTTCTCTCTTGCCTTTCATTCCTTCTTTTTTGCTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGGTCTCTCATTTATATATATCTATGTTGGTTTCTGTAGTGTCATCCGTCACATCCCGTTAGAATCAATAGATCCTTCTCTAGCCATCGGGTTTTATTGTCGAGACAAAGGTTTGCACACCACTCTAGCTGCATCATTGAGGATGGGAACTTGATTGAAGCATATCATTTTGTAGAACTTAATTTTGGTTTCCTTTCTTTTCTGGTGCCAGATGATTTTGATGACTTCTGTTATCGGGCATCAAAGTTAGCAGGCGACTCATACGGAGCTCCATTATTTACAGTTGCTGAAACACATTCCTCAAATTCAGTGAGACACGGCAATGCATTGAATGATGGTAGTAGATTAGTAGTGGACAATGCCGATGTGCACGTGCCAGACGAAGAAGGGGCGCAGGAGGATGACTGGCAATTACTCTGATGAAAAAACAAAGAACACTTTGGGGAATCTGGGAGGAGTTGGTAATCGATTCGTCCTTAATTTCAGCATTAGACTTCGAAGTTCCGTTGAGCTTTCTGGCTAATATGACTGAATAGATGGGAAGGGTATGCCTAATATGTCAATTGTTTTAGTTTATTTAGTGTTACTTGTGCAAAATGCATTAAGCTAATTTAGGTTTTTATTTGTTGAATTTGTTTATTCTTGGTTCATTTTGGGATGAGGTCAGCCTACATTCGTGTGTATGTGTATAATAAGCAGGTTCAATTCCAATTCATAATGAATTATTAATGCTCTGATTACTTAGAGAACTAGGAAAATTATGTATAATGTCTTCTTTTTTTTAGTTCATGAAACAGAAAGGCCTGGA

mRNA sequence

AGAGCCCAAACTCAGCTCAGCGACAACTCATTCCAACTCGAAGCTCACCTGAAAGGAAAGAAAGAGCTCCAAGAACCAATCCTTGTTCCCTCTTTGACGTCGATCGAACCCAGTTTTTATCTTCTTGGAACCTCATCGCCTACGCAACTACAATCAATTCTTGATCGCTCGCGGTTGAAGCCTCACAAGTTTGTGAAATTGGGAGCCCATTATTTGAAATAGAATTCCAACTTTGATTGCTTCTTCCTCGGAGTCTCTCTCGCCGAATAGAGGATGCTGGATTTTAACGCCTTTGAGTAGGTTTGTTTTTTTTATGTATCTGATTGTTCTTGTTTCTGTGGCGAATTGGTCTTCCGTTCTTTGTTCTCTCGATTTCTCTCATATTTTGGCTACTTGGGGTTTTCTTCATCTCTCTGATCACCGCCTCTTATTTTCTGCAAGTTCGAATGGGTTTGTTCGCTTTCAGTGGTCCGTTCATGACTTCGCAGTTTTAATCTTGGAGTCAGCCCATCTTACCCTCGTAGCTCTACCAAATTCTCGCCATCTACTGTCTTGGCCAAGCCTTTGACAGGGATATCATGTTATCACTTGTCATAATTAACTTTGTCATGAGCACGCGACTTTGAAAACCAGAAATCATTGTGGTAATGAAGGATTTTCATGGGAAGGGGGAAAGATCTAAATTCAACATGCTCTTCTGAAAGTGTAACTGATATTGTAGATAGAAGTCAGCAATCTATTTGTCCCACGTTAGGGTCTAGAAATCATATATCCTCCAAAGCCTCCTTATGGTCAGGCTTCTTTACATCTACTTTTTCAGTCTTTGAACATAACAAGGAGTCCTCTGTCAGTGAAAAGAAGGCAGTTCATTCTCGACACAATGTCTGGACAACTGTAAGAAGAGTTATGACCAGTGGCTCAATGAGGAGAATACAAGAGCGCATACTGGGTTCTCGCAGGAGTGGTGTTTATAGCTCTGGTGGTGATATATGGCTCCTCGGTGTGTGCCATAAAATTTCCCAAGATCAGGCTTCTGATGATGCAGTTACTAGTGACAGTGTAGCAGGATTTGAGCTAGACTTTTCATCTAGAATTCTGATGACCTATCGTAAAGGTTTTAATGGTATTCAAGACTCAAAATACACCAGTGATGTAAATTGGGGTTGCATGCTTAGAAGCAGCCAAATGCTTGTTGCTCAGGCATTACTTTTTCATAGATTAGGAAGATCTTGGAGAAAGACTTCACAGAAGCCACTGGACAAAGAATATGTTGAAATTTTGCATCTTTTTGGTGACTCGGAAACGTCAGCATTTTCGATCCATAATCTTCTTCAGGCAGGAAGGCCCTATGACCTTGCTGCTGGGTCATGGGTGGGACCATATGCCATATGTAGGTCATGGGAGACTCTAGTCCGCTTAAAGAGGGAGACTCCTACTCCCCAAGACCAGCAACTTCCAATGGCCATTTACATTGTTTCTGGAGACGAAGATGGAGAGAGAGGTGGTGCTCCGGTTTTATGCATTGACGTTGCGTCAAGACATTGTTTTCAGTTTTCTAAAGGCCAACTTGATTGGACTCCCATTCTGTTATTAGTTCCTTTGGTTCTTGGACTTGAAAAAATCAATCCAAGGTACATCCCATCATTACGAGCAACATTTACCTTTCCCCAAAGTCTCGGGATCTTGGGGGGGAAAGCTGGTGCTTCAACATACATCGTGGGTGTTCAAGATGAAAATGCCTTTTACCTTGATCCACATGAGGTTCAGCAGGTAGTTAATATTGACAAGGATGACCTAGAGGCTGATACTTCCTCTTATCACTGCAATGTCATCCGTCACATCCCGTTAGAATCAATAGATCCTTCTCTAGCCATCGGGTTTTATTGTCGAGACAAAGATGATTTTGATGACTTCTGTTATCGGGCATCAAAGTTAGCAGGCGACTCATACGGAGCTCCATTATTTACAGTTGCTGAAACACATTCCTCAAATTCAGTGAGACACGGCAATGCATTGAATGATGGTAGTAGATTAGTAGTGGACAATGCCGATGTGCACGTGCCAGACGAAGAAGGGGCGCAGGAGGATGACTGGCAATTACTCTGATGAAAAAACAAAGAACACTTTGGGGAATCTGGGAGGAGTTGGTAATCGATTCGTCCTTAATTTCAGCATTAGACTTCGAAGTTCCGTTGAGCTTTCTGGCTAATATGACTGAATAGATGGGAAGGGTATGCCTAATATGTCAATTGTTTTAGTTTATTTAGTGTTACTTGTGCAAAATGCATTAAGCTAATTTAGGTTTTTATTTGTTGAATTTGTTTATTCTTGGTTCATTTTGGGATGAGGTCAGCCTACATTCGTGTGTATGTGTATAATAAGCAGGTTCAATTCCAATTCATAATGAATTATTAATGCTCTGATTACTTAGAGAACTAGGAAAATTATGTATAATGTCTTCTTTTTTTTAGTTCATGAAACAGAAAGGCCTGGA

Coding sequence (CDS)

ATGGGAAGGGGGAAAGATCTAAATTCAACATGCTCTTCTGAAAGTGTAACTGATATTGTAGATAGAAGTCAGCAATCTATTTGTCCCACGTTAGGGTCTAGAAATCATATATCCTCCAAAGCCTCCTTATGGTCAGGCTTCTTTACATCTACTTTTTCAGTCTTTGAACATAACAAGGAGTCCTCTGTCAGTGAAAAGAAGGCAGTTCATTCTCGACACAATGTCTGGACAACTGTAAGAAGAGTTATGACCAGTGGCTCAATGAGGAGAATACAAGAGCGCATACTGGGTTCTCGCAGGAGTGGTGTTTATAGCTCTGGTGGTGATATATGGCTCCTCGGTGTGTGCCATAAAATTTCCCAAGATCAGGCTTCTGATGATGCAGTTACTAGTGACAGTGTAGCAGGATTTGAGCTAGACTTTTCATCTAGAATTCTGATGACCTATCGTAAAGGTTTTAATGGTATTCAAGACTCAAAATACACCAGTGATGTAAATTGGGGTTGCATGCTTAGAAGCAGCCAAATGCTTGTTGCTCAGGCATTACTTTTTCATAGATTAGGAAGATCTTGGAGAAAGACTTCACAGAAGCCACTGGACAAAGAATATGTTGAAATTTTGCATCTTTTTGGTGACTCGGAAACGTCAGCATTTTCGATCCATAATCTTCTTCAGGCAGGAAGGCCCTATGACCTTGCTGCTGGGTCATGGGTGGGACCATATGCCATATGTAGGTCATGGGAGACTCTAGTCCGCTTAAAGAGGGAGACTCCTACTCCCCAAGACCAGCAACTTCCAATGGCCATTTACATTGTTTCTGGAGACGAAGATGGAGAGAGAGGTGGTGCTCCGGTTTTATGCATTGACGTTGCGTCAAGACATTGTTTTCAGTTTTCTAAAGGCCAACTTGATTGGACTCCCATTCTGTTATTAGTTCCTTTGGTTCTTGGACTTGAAAAAATCAATCCAAGGTACATCCCATCATTACGAGCAACATTTACCTTTCCCCAAAGTCTCGGGATCTTGGGGGGGAAAGCTGGTGCTTCAACATACATCGTGGGTGTTCAAGATGAAAATGCCTTTTACCTTGATCCACATGAGGTTCAGCAGGTAGTTAATATTGACAAGGATGACCTAGAGGCTGATACTTCCTCTTATCACTGCAATGTCATCCGTCACATCCCGTTAGAATCAATAGATCCTTCTCTAGCCATCGGGTTTTATTGTCGAGACAAAGATGATTTTGATGACTTCTGTTATCGGGCATCAAAGTTAGCAGGCGACTCATACGGAGCTCCATTATTTACAGTTGCTGAAACACATTCCTCAAATTCAGTGAGACACGGCAATGCATTGAATGATGGTAGTAGATTAGTAGTGGACAATGCCGATGTGCACGTGCCAGACGAAGAAGGGGCGCAGGAGGATGACTGGCAATTACTCTGA
BLAST of CmoCh04G020890 vs. Swiss-Prot
Match: ATG4_MEDTR (Cysteine protease ATG4 OS=Medicago truncatula GN=ATG4 PE=3 SV=1)

HSP 1 Score: 616.7 bits (1589), Expect = 2.2e-175
Identity = 309/473 (65.33%), Postives = 366/473 (77.38%), Query Frame = 1

Query: 11  CSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKESSVSEKKAVH 70
           CSS+S T+IVD +Q       GS +    KASLWS FFTS FSV E   ESS SEKK VH
Sbjct: 15  CSSKSSTEIVDNTQVPASSKAGSSDSKFPKASLWSTFFTSGFSVDETYSESSSSEKKTVH 74

Query: 71  SRHNVWTT-VRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKISQDQASDDAV 130
           SR++ W   VR+V++ GSMRR QER+LGS R+ V SS GDIWLLGVCHKISQ +++ D  
Sbjct: 75  SRNSGWAAAVRKVVSGGSMRRFQERVLGSCRTDVSSSDGDIWLLGVCHKISQHESTGDVD 134

Query: 131 TSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLGR 190
             +  A FE DF SRIL+TYRKGF+ I+DSKYTSDVNWGCMLRSSQMLVAQALLFH+LGR
Sbjct: 135 IRNVFAAFEQDFFSRILITYRKGFDAIEDSKYTSDVNWGCMLRSSQMLVAQALLFHKLGR 194

Query: 191 SWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVGPYAICRSWET 250
           SWRKT  KP+DKEY++IL LFGDSE +AFSIHNLLQAG+ Y LA GSWVGPYA+CR+WE 
Sbjct: 195 SWRKTVDKPVDKEYIDILQLFGDSEAAAFSIHNLLQAGKGYGLAVGSWVGPYAMCRTWEV 254

Query: 251 LVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFSKGQLDWTPIL 310
           L R +RE     +Q LPMAIY+VSGDEDGERGGAPV+CI+ A + C +FS+G + WTP+L
Sbjct: 255 LARNQREKNEQGEQLLPMAIYVVSGDEDGERGGAPVVCIEDACKRCLEFSRGLVPWTPLL 314

Query: 311 LLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDENAFYLDPHEVQ 370
           LLVPLVLGL+K+N RYIP L++TF FPQSLGILGGK GASTYI+GVQ++ AFYLDPHEV+
Sbjct: 315 LLVPLVLGLDKVNLRYIPLLQSTFKFPQSLGILGGKPGASTYIIGVQNDKAFYLDPHEVK 374

Query: 371 QVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRASKLAGDS 430
            VVNI  D  E +TSSYHCN+ RH+PL+SIDPSLAIGFYCRDKDDFDDFC RA+KLA +S
Sbjct: 375 PVVNITGDTQEPNTSSYHCNISRHMPLDSIDPSLAIGFYCRDKDDFDDFCSRATKLAEES 434

Query: 431 YGAPLFTVAETHS-SNSVRHGNALNDGSRLVVDNADVHVPDEEGAQEDDWQLL 482
            GAPLFTVA++ S    V   +   D +R   D++       +   EDDWQ L
Sbjct: 435 NGAPLFTVAQSRSLPMQVTSNSVSGDDTRFEEDDSLSMNLVNDAGNEDDWQFL 487

BLAST of CmoCh04G020890 vs. Swiss-Prot
Match: ATG4A_ARATH (Cysteine protease ATG4a OS=Arabidopsis thaliana GN=ATG4A PE=2 SV=1)

HSP 1 Score: 566.2 bits (1458), Expect = 3.3e-160
Identity = 287/473 (60.68%), Postives = 356/473 (75.26%), Query Frame = 1

Query: 11  CSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKESSVSEKKAVH 70
           CSS S +D  D+S   +    G  ++  SK +LWS  FTS+ SV +  +ESS S  K V 
Sbjct: 13  CSSSSKSDTHDKSP--LVSDSGPSDN-KSKFTLWSNVFTSSSSVSQPYRESSTSGHKQVC 72

Query: 71  SRHNVWTT-VRRV-MTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKISQDQASDDA 130
           +  N WT  V+RV M SG++RR QER+LG  R+G+ S+  D+WLLGVC+KIS D+ S + 
Sbjct: 73  TTRNGWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISADENSGET 132

Query: 131 VTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLG 190
            T   +A  +LDFSS+ILMTYRKGF   +D+ YTSDVNWGCM+RSSQML AQALLFHRLG
Sbjct: 133 DTGTVLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLG 192

Query: 191 RSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVGPYAICRSWE 250
           R+W K S+ P ++EY+E L  FGDSE SAFSIHNL+ AG  Y LAAGSWVGPYAICR+WE
Sbjct: 193 RAWTKKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWE 252

Query: 251 TLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFSKGQLDWTPI 310
           +L   KR+    ++Q LPMA++IVSG EDGERGGAP+LCI+ A++ C +FSKGQ +WTPI
Sbjct: 253 SLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPI 312

Query: 311 LLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDENAFYLDPHEV 370
           +LLVPLVLGL+ +NPRYIPSL ATFTFPQS+GILGGK GASTYIVGVQ++  FYLDPHEV
Sbjct: 313 ILLVPLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEV 372

Query: 371 QQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRASKLAGD 430
           QQVV ++K+  + DTSSYHCNV+R++PLES+DPSLA+GFYCRDKDDFDDFC RA KLA +
Sbjct: 373 QQVVTVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDDFCLRALKLAEE 432

Query: 431 SYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDNADVHVPDEEGAQEDDWQLL 482
           S GAPLFTV +TH+        A+N  +    D+      D E  +EDDWQ+L
Sbjct: 433 SNGAPLFTVTQTHT--------AINQSNYGFADD------DSEDEREDDWQML 467

BLAST of CmoCh04G020890 vs. Swiss-Prot
Match: ATG4B_ARATH (Cysteine protease ATG4b OS=Arabidopsis thaliana GN=ATG4B PE=1 SV=1)

HSP 1 Score: 541.6 bits (1394), Expect = 8.8e-153
Identity = 270/475 (56.84%), Postives = 342/475 (72.00%), Query Frame = 1

Query: 9   STCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKESSVSEKKA 68
           S CSS S ++  D S  +   +  + +   S  +L S    S+  V +  +E+S S    
Sbjct: 11  SKCSSSSTSEKRDISSPTSLVSDSASSDNKSNLTLCSDVVASSSPVSQLCREASTSGHNP 70

Query: 69  VHSRHNVWTTVRRV--MTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKISQDQASD 128
           V + H+ WT + +   M SG++RR Q+R+LG  R+G+ SS  +IWLLGVC+KIS+ ++S+
Sbjct: 71  VCTTHSSWTVILKTASMASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSE 130

Query: 129 DAVTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVAQALLFHR 188
           +A     +A F  DFSS ILMTYR+GF  I D+ YTSDVNWGCMLRS QML AQALLF R
Sbjct: 131 EADAGRVLAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQR 190

Query: 189 LGRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVGPYAICRS 248
           LGRSWRK   +P D++Y+EIL LFGD+E SAFSIHNL+ AG  Y LAAGSWVGPYA+CRS
Sbjct: 191 LGRSWRKKDSEPADEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRS 250

Query: 249 WETLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFSKGQLDWT 308
           WE+L R  +E    + +   MA++IVSG EDGERGGAP+LCI+  ++ C +FS+G+ +W 
Sbjct: 251 WESLARKNKEETDDKHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWP 310

Query: 309 PILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDENAFYLDPH 368
           PILLLVPLVLGL+++NPRYIPSL ATFTFPQSLGILGGK GASTYIVGVQ++  FYLDPH
Sbjct: 311 PILLLVPLVLGLDRVNPRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQEDKGFYLDPH 370

Query: 369 EVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRASKLA 428
           +VQQVV + K++ + DTSSYHCN +R++PLES+DPSLA+GFYC+ KDDFDDFC RA+KLA
Sbjct: 371 DVQQVVTVKKENQDVDTSSYHCNTLRYVPLESLDPSLALGFYCQHKDDFDDFCIRATKLA 430

Query: 429 GDSYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDNADVHVPDEEGAQEDDWQLL 482
           GDS GAPLFTV ++H  N    G A    S          +  EE   EDDWQLL
Sbjct: 431 GDSNGAPLFTVTQSHRRNDC--GIAETSSS----TETSTEISGEE--HEDDWQLL 477

BLAST of CmoCh04G020890 vs. Swiss-Prot
Match: ATG4A_ORYSI (Cysteine protease ATG4A OS=Oryza sativa subsp. indica GN=ATG4A PE=3 SV=1)

HSP 1 Score: 515.4 bits (1326), Expect = 6.8e-145
Identity = 260/447 (58.17%), Postives = 323/447 (72.26%), Query Frame = 1

Query: 39  SKASLWSGFFTSTFSVFEHNKESSVSEKKAVHSRHNVWTT-VRRVMTSGSMRRIQERILG 98
           SK S+ S  F+S FS+FE +++SS       HS    W+  +RR+  +GSM     R LG
Sbjct: 36  SKNSILSCVFSSPFSIFEAHQDSSAHRPLKPHSGSYAWSRFLRRIACTGSM----WRFLG 95

Query: 99  SRRSGVYSSGGDIWLLGVCHKISQDQASDDAVTSDSVAGFELDFSSRILMTYRKGFNGIQ 158
           + ++    +  D+W LG C+K+S ++ S+ +      A F  DFSSRI +TYRKGF+ I 
Sbjct: 96  ASKA---LTSSDVWFLGKCYKLSSEELSNSSDCESGNAAFLEDFSSRIWITYRKGFDAIS 155

Query: 159 DSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKTSQKPLDKEYVEILHLFGDSETSA 218
           DSKYTSDVNWGCM+RSSQMLVAQAL+FH LGRSWRK SQKP   EY+ ILH+FGDSE  A
Sbjct: 156 DSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRKPSQKPYSPEYIGILHMFGDSEACA 215

Query: 219 FSIHNLLQAGRPYDLAAGSWVGPYAICRSWETLVRLKRETPTPQD--QQLPMAIYIVSGD 278
           FSIHNLLQAG+ Y LAAGSWVGPYA+CR+W+TLVR  RE     D     PMA+Y+VSGD
Sbjct: 216 FSIHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLVRTNREHHEAVDGNGNFPMALYVVSGD 275

Query: 279 EDGERGGAPVLCIDVASRHCFQFSKGQLDWTPILLLVPLVLGLEKINPRYIPSLRATFTF 338
           EDGERGGAPV+CIDVA++ C  F+KGQ  W+PILLLVPLVLGL+K+NPRYIP L+ TFTF
Sbjct: 276 EDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKLNPRYIPLLKETFTF 335

Query: 339 PQSLGILGGKAGASTYIVGVQDENAFYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIP 398
           PQSLGILGGK G STY+ GVQD+   YLDPHEVQ  V+I  D+LEADTSSYHC+ +R + 
Sbjct: 336 PQSLGILGGKPGTSTYVAGVQDDRVLYLDPHEVQLAVDIAADNLEADTSSYHCSTVRDLA 395

Query: 399 LESIDPSLAIGFYCRDKDDFDDFCYRASKLAGDSYGAPLFTVAETHSSNSVRHGNALNDG 458
           L+ IDPSLAIGFYCRDKDDFDDFC RAS+L   + GAPLFTV ++   +   +    + G
Sbjct: 396 LDLIDPSLAIGFYCRDKDDFDDFCSRASELVDKANGAPLFTVMQSVQPSKQMYNEESSSG 455

Query: 459 SRLVVDNAD-VHVPDEEGAQEDDWQLL 482
             + + N + +    E G  E++WQ+L
Sbjct: 456 DGMDIINVEGLDGSGETG--EEEWQIL 473

BLAST of CmoCh04G020890 vs. Swiss-Prot
Match: ATG4B_ORYSI (Cysteine protease ATG4B OS=Oryza sativa subsp. indica GN=ATG4B PE=1 SV=2)

HSP 1 Score: 513.5 bits (1321), Expect = 2.6e-144
Identity = 260/448 (58.04%), Postives = 324/448 (72.32%), Query Frame = 1

Query: 39  SKASLWSGFFTSTFSVFEHNKESSVSEKKAVHSRHNVWTTV-RRVMTSGSMRRIQERILG 98
           SK S+ S  F S F++FE +++SS ++     S    W  V RR++ SGSM R     LG
Sbjct: 40  SKTSILSCVFNSPFNIFEAHQDSSANKSPKSSSGSYDWLRVLRRIVCSGSMWRF----LG 99

Query: 99  SRRSGVYSSGGDIWLLGVCHKISQDQASDDAVTSDSVAGFELDFSSRILMTYRKGFNGIQ 158
           + +     +  D+W LG C+K+S +++S D+ +    A F  DFSSRI +TYR+GF+ I 
Sbjct: 100 TSK---VLTSSDVWFLGKCYKLSSEESSSDSDSESGHATFLEDFSSRIWITYRRGFDAIS 159

Query: 159 DSKYTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKTSQKPLDKEYVEILHLFGDSETSA 218
           DSKYTSDVNWGCM+RSSQMLVAQAL+FH LGRSWR+ S+KP + EY+ ILH+FGDSE  A
Sbjct: 160 DSKYTSDVNWGCMVRSSQMLVAQALIFHHLGRSWRRPSEKPYNPEYIGILHMFGDSEACA 219

Query: 219 FSIHNLLQAGRPYDLAAGSWVGPYAICRSWETLVRLKRETPTPQD--QQLPMAIYIVSGD 278
           FSIHNLLQAG  Y LAAGSWVGPYA+CR+W+TLVR  RE     D  +  PMA+Y+VSGD
Sbjct: 220 FSIHNLLQAGNSYGLAAGSWVGPYAMCRAWQTLVRTNREQHEVVDGNESFPMALYVVSGD 279

Query: 279 EDGERGGAPVLCIDVASRHCFQFSKGQLDWTPILLLVPLVLGLEKINPRYIPSLRATFTF 338
           EDGERGGAPV+CIDVA++ C  F+KGQ  W+PILLLVPLVLGL+KINPRYIP L+ TFTF
Sbjct: 280 EDGERGGAPVVCIDVAAQLCCDFNKGQSTWSPILLLVPLVLGLDKINPRYIPLLKETFTF 339

Query: 339 PQSLGILGGKAGASTYIVGVQDENAFYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIP 398
           PQSLGILGGK G STYI GVQD+ A YLDPHEVQ  V+I  D++EADTSSYHC+ +R + 
Sbjct: 340 PQSLGILGGKPGTSTYIAGVQDDRALYLDPHEVQMAVDIAADNIEADTSSYHCSTVRDLA 399

Query: 399 LESIDPSLAIGFYCRDKDDFDDFCYRASKLAGDSYGAPLFTVAET--HSSNSVRHGNALN 458
           L+ IDPSLAIGFYCRDKDDFDDFC RA++L   + GAPLFTV ++   S       + L 
Sbjct: 400 LDLIDPSLAIGFYCRDKDDFDDFCSRATELVDKANGAPLFTVVQSVQPSKQMYNQDDVLG 459

Query: 459 DGSRLVVDNADVHVPDEEGAQEDDWQLL 482
                 ++  D+    E G  E++WQ+L
Sbjct: 460 ISGDGNINVEDLDASGETG--EEEWQIL 478

BLAST of CmoCh04G020890 vs. TrEMBL
Match: A0A0A0LK81_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G128640 PE=3 SV=1)

HSP 1 Score: 833.9 bits (2153), Expect = 9.6e-239
Identity = 409/483 (84.68%), Postives = 436/483 (90.27%), Query Frame = 1

Query: 1   MGRGKDLNSTCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKE 60
           MGRGKDL STCS E   D +DR+ +S+ P LGS+NHISSKAS WSGFF+S FS+FEH+K+
Sbjct: 1   MGRGKDLKSTCSPEPAADAIDRTHRSVYPELGSKNHISSKASSWSGFFSSNFSIFEHHKD 60

Query: 61  SSVSEKKAVHSRHNVWTTVRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKIS 120
           SSV+EKK  H RHNVW TVR+VMTSGSMRRIQER+LGSRRSGVYSSGGDIWLLGVCHKIS
Sbjct: 61  SSVTEKKVFHPRHNVWATVRKVMTSGSMRRIQERLLGSRRSGVYSSGGDIWLLGVCHKIS 120

Query: 121 QDQASDDAVTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVAQ 180
           QD   DDA +S  VAG+E DFSSRILMTYRKGF+ IQDSKYTSDVNWGCMLRSSQMLVAQ
Sbjct: 121 QDHPPDDAASSPGVAGYEQDFSSRILMTYRKGFHVIQDSKYTSDVNWGCMLRSSQMLVAQ 180

Query: 181 ALLFHRLGRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVGP 240
           ALLFHRLGRSWRK SQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGR YDLAAGSWVGP
Sbjct: 181 ALLFHRLGRSWRKPSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVGP 240

Query: 241 YAICRSWETLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFSK 300
           YA+CRSWETLVR KRETP  QDQQLPMAIYIVSGDEDGERGGAPVL ID ASRHCF+FSK
Sbjct: 241 YAMCRSWETLVRSKRETPILQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSK 300

Query: 301 GQLDWTPILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDENA 360
           GQ DW+PILLLVPLVLGLEKINPRYIPSLR TFTFPQSLGILGGK GASTYIVGVQDENA
Sbjct: 301 GQHDWSPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDENA 360

Query: 361 FYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCY 420
           FYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFD+FC+
Sbjct: 361 FYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFCH 420

Query: 421 RASKLAGDSYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDNAD--VHVPDEEGAQEDDW 480
           RASKLA +S GAPLFTVAETHS+N  R  +ALND SRLV D+ D  VH+P+EE + EDDW
Sbjct: 421 RASKLAEESDGAPLFTVAETHSTNPGRQSSALNDHSRLVEDDGDGVVHMPNEEESHEDDW 480

Query: 481 QLL 482
           Q L
Sbjct: 481 QFL 483

BLAST of CmoCh04G020890 vs. TrEMBL
Match: M5XJB4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004885mg PE=3 SV=1)

HSP 1 Score: 684.1 bits (1764), Expect = 1.2e-193
Identity = 339/478 (70.92%), Postives = 392/478 (82.01%), Query Frame = 1

Query: 9   STCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKESSVSEKKA 68
           S  SS+S T+  DR   S+C   GSR+    KASLWS FF S FS+FE + ESS++EKK 
Sbjct: 11  SKYSSKSSTESTDRGPSSVCSDSGSRDSKHDKASLWSNFFASAFSIFETHSESSITEKKE 70

Query: 69  VHSRHNVWT-TVRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKISQDQASDD 128
           +HSR+N WT  VR+V+T GSMRRI ER+LGS R+G+ SS  DIWLLGV +K+SQD++S D
Sbjct: 71  IHSRNNGWTEAVRKVVTGGSMRRIHERVLGSSRTGI-SSASDIWLLGVLYKVSQDESSGD 130

Query: 129 AVTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRL 188
           A T++ +  FE DFSSRILMTYRKGF+ I DSKYTSDVNWGCMLRSSQMLVAQALLFHRL
Sbjct: 131 AATNNGLRAFEQDFSSRILMTYRKGFDAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRL 190

Query: 189 GRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVGPYAICRSW 248
           GRSWR+T  KPLD++Y+EILH FGDSE SAFSIHNLLQAG+ YDLAAGSWVGPYA+CRSW
Sbjct: 191 GRSWRRTLHKPLDEQYIEILHHFGDSEGSAFSIHNLLQAGKAYDLAAGSWVGPYAMCRSW 250

Query: 249 ETLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFSKGQLDWTP 308
           ETLVR KRE     +Q LPMA+YIVSGDEDGERGGAPV+CI  ASRHC +FS+G++DWTP
Sbjct: 251 ETLVRCKREGTAFDNQPLPMAVYIVSGDEDGERGGAPVVCIQDASRHCLEFSRGRVDWTP 310

Query: 309 ILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDENAFYLDPHE 368
           ILLLVPLVLGLEK+NPRYIPSL ATFTFPQSLGI+GGK GASTYI+GVQDE A YLDPHE
Sbjct: 311 ILLLVPLVLGLEKVNPRYIPSLWATFTFPQSLGIMGGKPGASTYIIGVQDEKALYLDPHE 370

Query: 369 VQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRASKLAG 428
           VQ  +NI +DDLEADT SYHCNVIRHIPL+SIDPSLAIGFYCRD+DDFDDFC+RASKLA 
Sbjct: 371 VQPAINIRRDDLEADTLSYHCNVIRHIPLDSIDPSLAIGFYCRDRDDFDDFCFRASKLAD 430

Query: 429 DSYGAPLFTVAETHS-SNSVRHGNALNDGSRLVVDNADVHVP--DEEG-AQEDDWQLL 482
            S GAPLFTV ++H+    V H + L+D   +  D++ V  P  D +G A EDDWQLL
Sbjct: 431 GSNGAPLFTVTQSHNFPKPVNHSDVLDDSGGVQNDDSFVAPPISDADGSAHEDDWQLL 487

BLAST of CmoCh04G020890 vs. TrEMBL
Match: A0A067DMT5_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011418mg PE=3 SV=1)

HSP 1 Score: 644.8 bits (1662), Expect = 8.2e-182
Identity = 330/477 (69.18%), Postives = 375/477 (78.62%), Query Frame = 1

Query: 9   STCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKESSVSEKKA 68
           S C S+S  D  +RS  S+   LGS    SSK SL S  F S FSVFE   ESS SEKKA
Sbjct: 11  SKCFSKSTPDTPNRSLASVGSELGSSESKSSKGSLLSSLFNSAFSVFETYSESSASEKKA 70

Query: 69  VHSRHNVWTT-VRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKISQDQASDD 128
           VH++ N WT  V+R++T+GSMRRI ER+LG  R+G+ SS  DIWLLGVCHKI+QD+A  D
Sbjct: 71  VHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGD 130

Query: 129 AVTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRL 188
           A  ++ +A F  DFSSRIL++YRKGF+ I DSK TSDV WGCMLRSSQMLVAQALLFHRL
Sbjct: 131 AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 190

Query: 189 GRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVGPYAICRSW 248
           GR WRK  QKP D+EYVEILHLFGDSETS FSIHNLLQAG+ Y LAAGSWVGPYA+CRSW
Sbjct: 191 GRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 250

Query: 249 ETLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFSKGQLDWTP 308
           E L R +R       Q LPMAIY+VSGDEDGERGGAPV+CID ASRHC  FSKGQ DWTP
Sbjct: 251 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP 310

Query: 309 ILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDENAFYLDPHE 368
           ILLLVPLVLGLEK+NPRYIP+LR TFTFPQSLGI+GGK GASTYIVGVQ+E+A YLDPH+
Sbjct: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370

Query: 369 VQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRASKLAG 428
           VQ V+NI KDDLEADTS+YH +VIRHI L+SIDPSLAIGFYCRDKDDFDDFC RASKLA 
Sbjct: 371 VQPVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 430

Query: 429 DSYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDNA--DVHVPDEEG-AQEDDWQLL 482
           +S GAPLFTV +TH    V H + L +   +  D++   + + D  G A EDDWQLL
Sbjct: 431 ESNGAPLFTVTQTH-KKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486

BLAST of CmoCh04G020890 vs. TrEMBL
Match: V4T4S5_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019906mg PE=3 SV=1)

HSP 1 Score: 641.7 bits (1654), Expect = 7.0e-181
Identity = 329/477 (68.97%), Postives = 374/477 (78.41%), Query Frame = 1

Query: 9   STCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKESSVSEKKA 68
           S C S+S  D  +RS  S+    GS    SSK SL S  F S FSVFE   ESS SEKKA
Sbjct: 11  SKCFSKSTPDTPNRSLASVGSEPGSSESKSSKGSLLSSLFNSAFSVFETYSESSASEKKA 70

Query: 69  VHSRHNVWTT-VRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKISQDQASDD 128
           VH++ N WT  V+R++T+GSMRRI ER+LG  R+G+ SS  DIWLLGVCHKI+QD+A  D
Sbjct: 71  VHNKSNGWTAAVKRLVTAGSMRRIHERVLGPSRTGISSSTSDIWLLGVCHKIAQDEALGD 130

Query: 129 AVTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRL 188
           A  ++ +A F  DFSSRIL++YRKGF+ I DSK TSDV WGCMLRSSQMLVAQALLFHRL
Sbjct: 131 AAGNNGLAEFNQDFSSRILISYRKGFDPIGDSKITSDVGWGCMLRSSQMLVAQALLFHRL 190

Query: 189 GRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVGPYAICRSW 248
           GR WRK  QKP D+EYVEILHLFGDSETS FSIHNLLQAG+ Y LAAGSWVGPYA+CRSW
Sbjct: 191 GRPWRKPLQKPFDREYVEILHLFGDSETSPFSIHNLLQAGKAYGLAAGSWVGPYAMCRSW 250

Query: 249 ETLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFSKGQLDWTP 308
           E L R +R       Q LPMAIY+VSGDEDGERGGAPV+CID ASRHC  FSKGQ DWTP
Sbjct: 251 EALARCQRAETGLGCQSLPMAIYVVSGDEDGERGGAPVVCIDDASRHCSVFSKGQADWTP 310

Query: 309 ILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDENAFYLDPHE 368
           ILLLVPLVLGLEK+NPRYIP+LR TFTFPQSLGI+GGK GASTYIVGVQ+E+A YLDPH+
Sbjct: 311 ILLLVPLVLGLEKVNPRYIPTLRLTFTFPQSLGIVGGKPGASTYIVGVQEESAIYLDPHD 370

Query: 369 VQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRASKLAG 428
           VQ V+NI KDDLEADTS+YH +VIRHI L+SIDPSLAIGFYCRDKDDFDDFC RASKLA 
Sbjct: 371 VQLVINIGKDDLEADTSTYHSDVIRHIHLDSIDPSLAIGFYCRDKDDFDDFCARASKLAE 430

Query: 429 DSYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDNA--DVHVPDEEG-AQEDDWQLL 482
           +S GAPLFTV +TH    V H + L +   +  D++   + + D  G A EDDWQLL
Sbjct: 431 ESNGAPLFTVTQTH-KKPVNHSDVLGETGGVPEDDSLGVMSMNDAVGNAHEDDWQLL 486

BLAST of CmoCh04G020890 vs. TrEMBL
Match: B9STA6_RICCO (Cysteine protease ATG4B, putative OS=Ricinus communis GN=RCOM_0365260 PE=3 SV=1)

HSP 1 Score: 638.6 bits (1646), Expect = 5.9e-180
Identity = 324/483 (67.08%), Postives = 384/483 (79.50%), Query Frame = 1

Query: 9   STCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKESS-VSEKK 68
           S CSS+   D  +RS  S C  L S ++ S+K SLWS FF S FSVFE  +ES   SEKK
Sbjct: 10  SRCSSKCPVDTPNRSLTSDC--LESGSNFSTKGSLWSSFFASAFSVFETYRESPPASEKK 69

Query: 69  AVHSRHNVWTT-VRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKISQDQASD 128
             HSRHN WT+ V+++++ GSMRRI ER+LG  R+G+ S+  DIWLLGVC+KIS+D+ S 
Sbjct: 70  GSHSRHNGWTSAVKKIVSGGSMRRIHERVLGPSRTGISSTTSDIWLLGVCYKISEDE-SG 129

Query: 129 DAVTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVAQALLFHR 188
           +A T +++A F  D+SSRILMTYR+GF+ I DSKY SDV WGCMLRSSQMLVAQALLFH+
Sbjct: 130 NADTGNALAEFTHDYSSRILMTYRRGFDAIGDSKYISDVGWGCMLRSSQMLVAQALLFHK 189

Query: 189 LGRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVGPYAICRS 248
           LGR+W K  QKP+D+ YVEILHLFGDSE + FSIHNL+QAG+ Y LAAGSWVGPYA+CRS
Sbjct: 190 LGRAWTKPFQKPMDQAYVEILHLFGDSEAAPFSIHNLIQAGKAYSLAAGSWVGPYAMCRS 249

Query: 249 WETLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFSKGQLDWT 308
           WE+L R KRE  + + Q LPMA+Y+VSGDEDGERGGAPV+ I+ ASRHC +FS+GQ DWT
Sbjct: 250 WESLARSKREENSLEYQSLPMAVYVVSGDEDGERGGAPVVYIEDASRHCLEFSRGQADWT 309

Query: 309 PILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDENAFYLDPH 368
           PILLLVPLVLGL+K+NPRYIPSL+ATFTF QSLGI+GGK GASTYIVGVQD+NAFYLDPH
Sbjct: 310 PILLLVPLVLGLDKVNPRYIPSLQATFTFSQSLGIMGGKPGASTYIVGVQDDNAFYLDPH 369

Query: 369 EVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRASKLA 428
           EVQ VVNI +DD+EADTSSYH +++RHIPL SIDPSLAIGFYCRDKDDFD+FC  ASKLA
Sbjct: 370 EVQSVVNIGRDDIEADTSSYHSDIVRHIPLHSIDPSLAIGFYCRDKDDFDEFCLLASKLA 429

Query: 429 GDSYGAPLFTVAETHS-SNSVRHGNALNDGSRLVVDNADVHV-----PDEE--GAQEDDW 482
            DS GAPLFTVA  H     V HG+ LN+    V ++  V+V      D E  GAQED+W
Sbjct: 430 DDSQGAPLFTVAHCHKLPKPVSHGDMLNNEDDEVQEDDSVNVMMPVNDDAEGGGAQEDEW 489

BLAST of CmoCh04G020890 vs. TAIR10
Match: AT2G44140.1 (AT2G44140.1 Peptidase family C54 protein)

HSP 1 Score: 566.2 bits (1458), Expect = 1.9e-161
Identity = 287/473 (60.68%), Postives = 356/473 (75.26%), Query Frame = 1

Query: 11  CSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKESSVSEKKAVH 70
           CSS S +D  D+S   +    G  ++  SK +LWS  FTS+ SV +  +ESS S  K V 
Sbjct: 13  CSSSSKSDTHDKSP--LVSDSGPSDN-KSKFTLWSNVFTSSSSVSQPYRESSTSGHKQVC 72

Query: 71  SRHNVWTT-VRRV-MTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKISQDQASDDA 130
           +  N WT  V+RV M SG++RR QER+LG  R+G+ S+  D+WLLGVC+KIS D+ S + 
Sbjct: 73  TTRNGWTAFVKRVSMASGAIRRFQERVLGPNRTGLPSTTSDVWLLGVCYKISADENSGET 132

Query: 131 VTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRLG 190
            T   +A  +LDFSS+ILMTYRKGF   +D+ YTSDVNWGCM+RSSQML AQALLFHRLG
Sbjct: 133 DTGTVLAALQLDFSSKILMTYRKGFEPFRDTTYTSDVNWGCMIRSSQMLFAQALLFHRLG 192

Query: 191 RSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVGPYAICRSWE 250
           R+W K S+ P ++EY+E L  FGDSE SAFSIHNL+ AG  Y LAAGSWVGPYAICR+WE
Sbjct: 193 RAWTKKSELP-EQEYLETLEPFGDSEPSAFSIHNLIIAGASYGLAAGSWVGPYAICRAWE 252

Query: 251 TLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFSKGQLDWTPI 310
           +L   KR+    ++Q LPMA++IVSG EDGERGGAP+LCI+ A++ C +FSKGQ +WTPI
Sbjct: 253 SLACKKRKQTDSKNQTLPMAVHIVSGSEDGERGGAPILCIEDATKSCLEFSKGQSEWTPI 312

Query: 311 LLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDENAFYLDPHEV 370
           +LLVPLVLGL+ +NPRYIPSL ATFTFPQS+GILGGK GASTYIVGVQ++  FYLDPHEV
Sbjct: 313 ILLVPLVLGLDSVNPRYIPSLVATFTFPQSVGILGGKPGASTYIVGVQEDKGFYLDPHEV 372

Query: 371 QQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRASKLAGD 430
           QQVV ++K+  + DTSSYHCNV+R++PLES+DPSLA+GFYCRDKDDFDDFC RA KLA +
Sbjct: 373 QQVVTVNKETPDVDTSSYHCNVLRYVPLESLDPSLALGFYCRDKDDFDDFCLRALKLAEE 432

Query: 431 SYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDNADVHVPDEEGAQEDDWQLL 482
           S GAPLFTV +TH+        A+N  +    D+      D E  +EDDWQ+L
Sbjct: 433 SNGAPLFTVTQTHT--------AINQSNYGFADD------DSEDEREDDWQML 467

BLAST of CmoCh04G020890 vs. TAIR10
Match: AT3G59950.1 (AT3G59950.1 Peptidase family C54 protein)

HSP 1 Score: 541.6 bits (1394), Expect = 5.0e-154
Identity = 270/475 (56.84%), Postives = 342/475 (72.00%), Query Frame = 1

Query: 9   STCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKESSVSEKKA 68
           S CSS S ++  D S  +   +  + +   S  +L S    S+  V +  +E+S S    
Sbjct: 11  SKCSSSSTSEKRDISSPTSLVSDSASSDNKSNLTLCSDVVASSSPVSQLCREASTSGHNP 70

Query: 69  VHSRHNVWTTVRRV--MTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKISQDQASD 128
           V + H+ WT + +   M SG++RR Q+R+LG  R+G+ SS  +IWLLGVC+KIS+ ++S+
Sbjct: 71  VCTTHSSWTVILKTASMASGAIRRFQDRVLGPSRTGISSSTSEIWLLGVCYKISEGESSE 130

Query: 129 DAVTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVAQALLFHR 188
           +A     +A F  DFSS ILMTYR+GF  I D+ YTSDVNWGCMLRS QML AQALLF R
Sbjct: 131 EADAGRVLAAFRQDFSSLILMTYRRGFEPIGDTTYTSDVNWGCMLRSGQMLFAQALLFQR 190

Query: 189 LGRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVGPYAICRS 248
           LGRSWRK   +P D++Y+EIL LFGD+E SAFSIHNL+ AG  Y LAAGSWVGPYA+CRS
Sbjct: 191 LGRSWRKKDSEPADEKYLEILELFGDTEASAFSIHNLILAGESYGLAAGSWVGPYAVCRS 250

Query: 249 WETLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFSKGQLDWT 308
           WE+L R  +E    + +   MA++IVSG EDGERGGAP+LCI+  ++ C +FS+G+ +W 
Sbjct: 251 WESLARKNKEETDDKHKSFSMAVHIVSGSEDGERGGAPILCIEDVTKTCLEFSEGETEWP 310

Query: 309 PILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDENAFYLDPH 368
           PILLLVPLVLGL+++NPRYIPSL ATFTFPQSLGILGGK GASTYIVGVQ++  FYLDPH
Sbjct: 311 PILLLVPLVLGLDRVNPRYIPSLIATFTFPQSLGILGGKPGASTYIVGVQEDKGFYLDPH 370

Query: 369 EVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRASKLA 428
           +VQQVV + K++ + DTSSYHCN +R++PLES+DPSLA+GFYC+ KDDFDDFC RA+KLA
Sbjct: 371 DVQQVVTVKKENQDVDTSSYHCNTLRYVPLESLDPSLALGFYCQHKDDFDDFCIRATKLA 430

Query: 429 GDSYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDNADVHVPDEEGAQEDDWQLL 482
           GDS GAPLFTV ++H  N    G A    S          +  EE   EDDWQLL
Sbjct: 431 GDSNGAPLFTVTQSHRRNDC--GIAETSSS----TETSTEISGEE--HEDDWQLL 477

BLAST of CmoCh04G020890 vs. NCBI nr
Match: gi|659082126|ref|XP_008441684.1| (PREDICTED: cysteine protease ATG4-like isoform X1 [Cucumis melo])

HSP 1 Score: 847.0 bits (2187), Expect = 1.6e-242
Identity = 415/483 (85.92%), Postives = 441/483 (91.30%), Query Frame = 1

Query: 1   MGRGKDLNSTCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKE 60
           MGRGKDL STCSSE+  D++DR+ +S+C  LGS+NHISSKASLWSGFF+S FS+ +H+K+
Sbjct: 1   MGRGKDLKSTCSSETTADVIDRTHRSVCSELGSKNHISSKASLWSGFFSSNFSICDHHKD 60

Query: 61  SSVSEKKAVHSRHNVWTTVRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKIS 120
           SSVSEKK  HSRHNVW TVR+VMTSGSMRRIQERILGSRRSGVY+SGGDIWLLGVCHKIS
Sbjct: 61  SSVSEKKVFHSRHNVWATVRKVMTSGSMRRIQERILGSRRSGVYTSGGDIWLLGVCHKIS 120

Query: 121 QDQASDDAVTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVAQ 180
           QD   DDA +S  VAGFE DFSSRILMTYRKGFN IQDSKYTSDVNWGCMLRSSQMLV+Q
Sbjct: 121 QDHLPDDAASSTGVAGFEQDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVSQ 180

Query: 181 ALLFHRLGRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVGP 240
           ALLFHRLGRSWRK SQKP DKEYVEILHLFGDSETSAFSIHNLLQAGR YDLAAGSWVGP
Sbjct: 181 ALLFHRLGRSWRKPSQKPFDKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVGP 240

Query: 241 YAICRSWETLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFSK 300
           YA+CRSWETLVR KRETP  QDQQLPMAIYIVSGDEDGERGGAPVL ID ASRHCF+FSK
Sbjct: 241 YAMCRSWETLVRSKRETPILQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSK 300

Query: 301 GQLDWTPILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDENA 360
           GQ DW+PILLLVPLVLGLEKINPRYIPSLR TFTFPQSLGILGGK GASTYIVGVQDENA
Sbjct: 301 GQHDWSPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDENA 360

Query: 361 FYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCY 420
           FYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFD+FCY
Sbjct: 361 FYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFCY 420

Query: 421 RASKLAGDSYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDNAD--VHVPDEEGAQEDDW 480
           RASKLA +S GAPLFTVAETHS+NS R  +ALND SRLV D+AD  VH+P+EE A EDDW
Sbjct: 421 RASKLAEESDGAPLFTVAETHSTNSGRQSSALNDHSRLVEDDADGAVHMPNEEEAHEDDW 480

Query: 481 QLL 482
           Q L
Sbjct: 481 QFL 483

BLAST of CmoCh04G020890 vs. NCBI nr
Match: gi|778668355|ref|XP_011649086.1| (PREDICTED: cysteine protease ATG4-like isoform X1 [Cucumis sativus])

HSP 1 Score: 835.9 bits (2158), Expect = 3.6e-239
Identity = 410/483 (84.89%), Postives = 436/483 (90.27%), Query Frame = 1

Query: 1   MGRGKDLNSTCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKE 60
           MGRGKDL STCS E   D +DR+ +S+ P LGS+NHISSKAS WSGFF+S FS+FEH+K+
Sbjct: 1   MGRGKDLKSTCSPEPAADAIDRTHRSVYPELGSKNHISSKASSWSGFFSSNFSIFEHHKD 60

Query: 61  SSVSEKKAVHSRHNVWTTVRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKIS 120
           SSV+EKK  H RHNVW TVR+VMTSGSMRRIQER+LGSRRSGVYSSGGDIWLLGVCHKIS
Sbjct: 61  SSVTEKKVFHPRHNVWATVRKVMTSGSMRRIQERLLGSRRSGVYSSGGDIWLLGVCHKIS 120

Query: 121 QDQASDDAVTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVAQ 180
           QD   DDA +S  VAG+E DFSSRILMTYRKGFN IQDSKYTSDVNWGCMLRSSQMLVAQ
Sbjct: 121 QDHPPDDAASSPGVAGYEQDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVAQ 180

Query: 181 ALLFHRLGRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVGP 240
           ALLFHRLGRSWRK SQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGR YDLAAGSWVGP
Sbjct: 181 ALLFHRLGRSWRKPSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVGP 240

Query: 241 YAICRSWETLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFSK 300
           YA+CRSWETLVR KRETP  QDQQLPMAIYIVSGDEDGERGGAPVL ID ASRHCF+FSK
Sbjct: 241 YAMCRSWETLVRSKRETPILQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSK 300

Query: 301 GQLDWTPILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDENA 360
           GQ DW+PILLLVPLVLGLEKINPRYIPSLR TFTFPQSLGILGGK GASTYIVGVQDENA
Sbjct: 301 GQHDWSPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDENA 360

Query: 361 FYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCY 420
           FYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFD+FC+
Sbjct: 361 FYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFCH 420

Query: 421 RASKLAGDSYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDNAD--VHVPDEEGAQEDDW 480
           RASKLA +S GAPLFTVAETHS+N  R  +ALND SRLV D+ D  VH+P+EE + EDDW
Sbjct: 421 RASKLAEESDGAPLFTVAETHSTNPGRQSSALNDHSRLVEDDGDGVVHMPNEEESHEDDW 480

Query: 481 QLL 482
           Q L
Sbjct: 481 QFL 483

BLAST of CmoCh04G020890 vs. NCBI nr
Match: gi|449442361|ref|XP_004138950.1| (PREDICTED: cysteine protease ATG4-like isoform X2 [Cucumis sativus])

HSP 1 Score: 833.9 bits (2153), Expect = 1.4e-238
Identity = 409/483 (84.68%), Postives = 436/483 (90.27%), Query Frame = 1

Query: 1   MGRGKDLNSTCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKE 60
           MGRGKDL STCS E   D +DR+ +S+ P LGS+NHISSKAS WSGFF+S FS+FEH+K+
Sbjct: 1   MGRGKDLKSTCSPEPAADAIDRTHRSVYPELGSKNHISSKASSWSGFFSSNFSIFEHHKD 60

Query: 61  SSVSEKKAVHSRHNVWTTVRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKIS 120
           SSV+EKK  H RHNVW TVR+VMTSGSMRRIQER+LGSRRSGVYSSGGDIWLLGVCHKIS
Sbjct: 61  SSVTEKKVFHPRHNVWATVRKVMTSGSMRRIQERLLGSRRSGVYSSGGDIWLLGVCHKIS 120

Query: 121 QDQASDDAVTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVAQ 180
           QD   DDA +S  VAG+E DFSSRILMTYRKGF+ IQDSKYTSDVNWGCMLRSSQMLVAQ
Sbjct: 121 QDHPPDDAASSPGVAGYEQDFSSRILMTYRKGFHVIQDSKYTSDVNWGCMLRSSQMLVAQ 180

Query: 181 ALLFHRLGRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVGP 240
           ALLFHRLGRSWRK SQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGR YDLAAGSWVGP
Sbjct: 181 ALLFHRLGRSWRKPSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRAYDLAAGSWVGP 240

Query: 241 YAICRSWETLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFSK 300
           YA+CRSWETLVR KRETP  QDQQLPMAIYIVSGDEDGERGGAPVL ID ASRHCF+FSK
Sbjct: 241 YAMCRSWETLVRSKRETPILQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSK 300

Query: 301 GQLDWTPILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDENA 360
           GQ DW+PILLLVPLVLGLEKINPRYIPSLR TFTFPQSLGILGGK GASTYIVGVQDENA
Sbjct: 301 GQHDWSPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDENA 360

Query: 361 FYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCY 420
           FYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFD+FC+
Sbjct: 361 FYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFCH 420

Query: 421 RASKLAGDSYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDNAD--VHVPDEEGAQEDDW 480
           RASKLA +S GAPLFTVAETHS+N  R  +ALND SRLV D+ D  VH+P+EE + EDDW
Sbjct: 421 RASKLAEESDGAPLFTVAETHSTNPGRQSSALNDHSRLVEDDGDGVVHMPNEEESHEDDW 480

Query: 481 QLL 482
           Q L
Sbjct: 481 QFL 483

BLAST of CmoCh04G020890 vs. NCBI nr
Match: gi|659082128|ref|XP_008441686.1| (PREDICTED: cysteine protease ATG4-like isoform X2 [Cucumis melo])

HSP 1 Score: 776.5 bits (2004), Expect = 2.6e-221
Identity = 388/483 (80.33%), Postives = 414/483 (85.71%), Query Frame = 1

Query: 1   MGRGKDLNSTCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKE 60
           MGRGKDL STCSSE+  D++DR+ +S+C  LGS+NHISSKASLWSGFF+S FS+ +H+K+
Sbjct: 1   MGRGKDLKSTCSSETTADVIDRTHRSVCSELGSKNHISSKASLWSGFFSSNFSICDHHKD 60

Query: 61  SSVSEKKAVHSRHNVWTTVRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKIS 120
           SSVSEKK  HSRHNVW TVR+VMTSGSMRRIQERILGSRRSGVY+SGGDIWLLGVCHKIS
Sbjct: 61  SSVSEKKVFHSRHNVWATVRKVMTSGSMRRIQERILGSRRSGVYTSGGDIWLLGVCHKIS 120

Query: 121 QDQASDDAVTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVAQ 180
           QD   DDA +S  VAGFE DFSSRILMTYRKGFN IQDSKYTSDVNWGCMLRSSQMLV+Q
Sbjct: 121 QDHLPDDAASSTGVAGFEQDFSSRILMTYRKGFNVIQDSKYTSDVNWGCMLRSSQMLVSQ 180

Query: 181 ALLFHRLGRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVGP 240
           ALLFHRLGRSWRK SQK                            AGR YDLAAGSWVGP
Sbjct: 181 ALLFHRLGRSWRKPSQK----------------------------AGRAYDLAAGSWVGP 240

Query: 241 YAICRSWETLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFSK 300
           YA+CRSWETLVR KRETP  QDQQLPMAIYIVSGDEDGERGGAPVL ID ASRHCF+FSK
Sbjct: 241 YAMCRSWETLVRSKRETPILQDQQLPMAIYIVSGDEDGERGGAPVLYIDDASRHCFEFSK 300

Query: 301 GQLDWTPILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDENA 360
           GQ DW+PILLLVPLVLGLEKINPRYIPSLR TFTFPQSLGILGGK GASTYIVGVQDENA
Sbjct: 301 GQHDWSPILLLVPLVLGLEKINPRYIPSLRTTFTFPQSLGILGGKPGASTYIVGVQDENA 360

Query: 361 FYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCY 420
           FYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFD+FCY
Sbjct: 361 FYLDPHEVQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDNFCY 420

Query: 421 RASKLAGDSYGAPLFTVAETHSSNSVRHGNALNDGSRLVVDNAD--VHVPDEEGAQEDDW 480
           RASKLA +S GAPLFTVAETHS+NS R  +ALND SRLV D+AD  VH+P+EE A EDDW
Sbjct: 421 RASKLAEESDGAPLFTVAETHSTNSGRQSSALNDHSRLVEDDADGAVHMPNEEEAHEDDW 455

Query: 481 QLL 482
           Q L
Sbjct: 481 QFL 455

BLAST of CmoCh04G020890 vs. NCBI nr
Match: gi|694327603|ref|XP_009354662.1| (PREDICTED: cysteine protease ATG4-like [Pyrus x bretschneideri])

HSP 1 Score: 686.4 bits (1770), Expect = 3.5e-194
Identity = 340/478 (71.13%), Postives = 392/478 (82.01%), Query Frame = 1

Query: 9   STCSSESVTDIVDRSQQSICPTLGSRNHISSKASLWSGFFTSTFSVFEHNKESSVSEKKA 68
           S  SS+S TD  DR   S C   GSR+   +KASLW+ FF S FS+FE + ESS++EKK 
Sbjct: 11  SKYSSKSSTDSTDRGSSSACSDSGSRDSKHNKASLWTNFFASAFSIFETHSESSITEKKE 70

Query: 69  VHSRHNVWTT-VRRVMTSGSMRRIQERILGSRRSGVYSSGGDIWLLGVCHKISQDQASDD 128
            HSR+N WT  VR+V+TSGSMRRI ER+LGS R+G+ SS  DIWLLGVC+K+SQD +S D
Sbjct: 71  SHSRNNGWTAAVRKVVTSGSMRRIHERVLGSSRTGI-SSASDIWLLGVCYKVSQDDSSGD 130

Query: 129 AVTSDSVAGFELDFSSRILMTYRKGFNGIQDSKYTSDVNWGCMLRSSQMLVAQALLFHRL 188
           A  ++ +  FE DFSS+ILMTYRKGF  I DSKYTSDVNWGCMLRSSQMLVAQALLFHRL
Sbjct: 131 APINNGLGAFEQDFSSKILMTYRKGFEAIGDSKYTSDVNWGCMLRSSQMLVAQALLFHRL 190

Query: 189 GRSWRKTSQKPLDKEYVEILHLFGDSETSAFSIHNLLQAGRPYDLAAGSWVGPYAICRSW 248
           GRSWR+   KPLD+ Y+EIL+ FGDSETS FSIHNLLQAG+ YDLAAGSWVGPYA+CR+W
Sbjct: 191 GRSWRRPLHKPLDEAYIEILYHFGDSETSTFSIHNLLQAGKAYDLAAGSWVGPYAMCRTW 250

Query: 249 ETLVRLKRETPTPQDQQLPMAIYIVSGDEDGERGGAPVLCIDVASRHCFQFSKGQLDWTP 308
           ETLVR +RE     DQ LPMA+YIVSGDEDGERGGAPV+CI+ ASRHC +FS+GQ+DWTP
Sbjct: 251 ETLVRCRREVTDLDDQPLPMAVYIVSGDEDGERGGAPVVCIEDASRHCLEFSRGQVDWTP 310

Query: 309 ILLLVPLVLGLEKINPRYIPSLRATFTFPQSLGILGGKAGASTYIVGVQDENAFYLDPHE 368
           ILLLVPLVLGLEK+NPRYIPSLRATFTFPQSLGI+GGK GASTYI+GVQDE A YLDPHE
Sbjct: 311 ILLLVPLVLGLEKVNPRYIPSLRATFTFPQSLGIMGGKPGASTYIIGVQDEKALYLDPHE 370

Query: 369 VQQVVNIDKDDLEADTSSYHCNVIRHIPLESIDPSLAIGFYCRDKDDFDDFCYRASKLAG 428
           VQ V+NI +DDLEADT SYHCNVIRHIPL+ IDPSLAIGFYCRD+DDF+DFC+RASKLA 
Sbjct: 371 VQPVINIRRDDLEADTLSYHCNVIRHIPLDLIDPSLAIGFYCRDRDDFNDFCFRASKLAD 430

Query: 429 DSYGAPLFTVAETHS-SNSVRHGNALNDGSRLVVDNADVHVP--DEEG-AQEDDWQLL 482
           +S GAPLFTV +THS    V H +AL D   +  D++   +P  D +G AQEDDWQLL
Sbjct: 431 ESNGAPLFTVTQTHSFPRPVNHSDALGDSGAVENDDSFSVLPMSDADGSAQEDDWQLL 487

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ATG4_MEDTR2.2e-17565.33Cysteine protease ATG4 OS=Medicago truncatula GN=ATG4 PE=3 SV=1[more]
ATG4A_ARATH3.3e-16060.68Cysteine protease ATG4a OS=Arabidopsis thaliana GN=ATG4A PE=2 SV=1[more]
ATG4B_ARATH8.8e-15356.84Cysteine protease ATG4b OS=Arabidopsis thaliana GN=ATG4B PE=1 SV=1[more]
ATG4A_ORYSI6.8e-14558.17Cysteine protease ATG4A OS=Oryza sativa subsp. indica GN=ATG4A PE=3 SV=1[more]
ATG4B_ORYSI2.6e-14458.04Cysteine protease ATG4B OS=Oryza sativa subsp. indica GN=ATG4B PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LK81_CUCSA9.6e-23984.68Uncharacterized protein OS=Cucumis sativus GN=Csa_2G128640 PE=3 SV=1[more]
M5XJB4_PRUPE1.2e-19370.92Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004885mg PE=3 SV=1[more]
A0A067DMT5_CITSI8.2e-18269.18Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011418mg PE=3 SV=1[more]
V4T4S5_9ROSI7.0e-18168.97Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019906mg PE=3 SV=1[more]
B9STA6_RICCO5.9e-18067.08Cysteine protease ATG4B, putative OS=Ricinus communis GN=RCOM_0365260 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT2G44140.11.9e-16160.68 Peptidase family C54 protein[more]
AT3G59950.15.0e-15456.84 Peptidase family C54 protein[more]
Match NameE-valueIdentityDescription
gi|659082126|ref|XP_008441684.1|1.6e-24285.92PREDICTED: cysteine protease ATG4-like isoform X1 [Cucumis melo][more]
gi|778668355|ref|XP_011649086.1|3.6e-23984.89PREDICTED: cysteine protease ATG4-like isoform X1 [Cucumis sativus][more]
gi|449442361|ref|XP_004138950.1|1.4e-23884.68PREDICTED: cysteine protease ATG4-like isoform X2 [Cucumis sativus][more]
gi|659082128|ref|XP_008441686.1|2.6e-22180.33PREDICTED: cysteine protease ATG4-like isoform X2 [Cucumis melo][more]
gi|694327603|ref|XP_009354662.1|3.5e-19471.13PREDICTED: cysteine protease ATG4-like [Pyrus x bretschneideri][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005078Peptidase_C54
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006914 autophagy
biological_process GO:0015031 protein transport
biological_process GO:0006508 proteolysis
biological_process GO:0008150 biological_process
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005575 cellular_component
molecular_function GO:0008234 cysteine-type peptidase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G020890.1CmoCh04G020890.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005078Peptidase C54PANTHERPTHR22624APG4 AUTOPHAGY 4-RELATEDcoord: 9..477
score: 8.7E
IPR005078Peptidase C54PFAMPF03416Peptidase_C54coord: 137..420
score: 1.6
NoneNo IPR availablePANTHERPTHR22624:SF34AUTOPHAGY-SPECIFIC GENE 4, ISOFORM Acoord: 9..477
score: 8.7E
NoneNo IPR availableunknownSSF54001Cysteine proteinasescoord: 95..439
score: 1.72

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmoCh04G020890CmaCh00G001390Cucurbita maxima (Rimu)cmacmoB014
CmoCh04G020890Cp4.1LG01g17500Cucurbita pepo (Zucchini)cmocpeB682
CmoCh04G020890Carg27580Silver-seed gourdcarcmoB1127
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmoCh04G020890Cucurbita maxima (Rimu)cmacmoB741
CmoCh04G020890Cucurbita pepo (Zucchini)cmocpeB673
CmoCh04G020890Cucurbita pepo (Zucchini)cmocpeB680