MC09g1382 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC09g1382
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptioncysteine proteinase 15A
LocationMC09: 19892850 .. 19900553 (-)
RNA-Seq ExpressionMC09g1382
SyntenyMC09g1382
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
CTGATTCTAACGTGTGCATTTGTCCTCGCACTACCCTCCGCCGCCGCGCTCCGCCGAGCCCCGGAATTGCTCCGCCAAGTCACCGACAATAATATTACCGACGACGACGGGGAGCAGTACGAAATTAATAATGCCCTCGTGGGGACGGGGAGCGAGAGGAAGTTCACAATGTTCATGGAGAAGTACGGTAAGAGTTATCCGACGAGGAGGGAGTATCTGCGGCGGCTGGGGATTTTCGCGAAGAATCTGGTGAGGGCGGCGGAGCACCAGGTTCTGGACCCGACGGCGGTGCACGGGGTGACGCCGTTTTCCGACCTGTCGGAGGAGGAGTTCGAGGAGATGTTTTTGGGGGTCAGATCGGCCGGCGGATCCGAGTTTGAGAGAAGTAATGAGGCGGCGGAGGAGGTGGTGGAGGGTTTGCCCGAGAGCTTTGATTGGCGTGAAAAGGGAGCTGTTACTGATGTTAAGATGCAGGTAAGAAAATGGTAAATAGTAATTTTTTTTTTCTGGTTTTATGACTATTGGTAGGTAGTTATTTTATTTTACGTTTTATGAATTAATATTTTTTAGCTCAAGGCGTCTCTGGCACTGGGGTTCATTTTGACAATTTTATTCATAAAAATCTGCTGAGTACGTATAATTTTATCTATAATATGCATGTCCCGTTTATAATTTTATCATAAGAAGTTGAGATATATTTAAAATTTGTAGAGTTTCTTTTATTACAACATATATGGAATAGAGGATTGAATTCAAGAGGTTGGAATTGAGCTATATCTAGCTTTGGCTTAAAAATTTGTAGAATTAAATTTGCTATTATATTATTTTGGGTATATGGCCTTTAAAATTAAGTTTAGATGAGTACATTATCAATTTATCCACTAGTATTTAGTTTTACTTGAAAATTTAAGTTCAACTTATTTAAAGAAAAAGAAAACTTTGAATTCTGAAATTTGAAAAAAGCACACTTTTGAAAAGAATTAGGAAATTTTCACTGGAAATAAGAGAGAAAAAAGAGAGAAAAAAAGATATTTATTGGGTATAGCGGCGGTTGCAGGGACAATGCGGGTCATGCTGGGCGTTCAGTACGTGTGGAGTAGTGGAAGGAGCCAACTTCATCTCCACCGGAAAACTCCTCAGCCTCAGCGAACAACAGCTCATCGATTGCGACCACACGGTCGGTAAATTCCAAATCCCTAATTTCAAATCCCGATTTCCAAAACCCTAAAACCCTAAATCTTGCAGTGCGATGGGATGGAGGAGGGCGTGTGCGACAGCGGCTGCAATGGCGGCCTAATGACCAACGCCTACAAGTACATAATGGCGGCCGGAGGCCTGGAGGAGGAGAGCTCCTACCCCTACACCGGGAAGCCCGCCGAGTGCCGGTTTGAATCCGACAAAATCGCCGTTAGGGTTTCCAATTTCACGACGATCCCCTTGGACGAGAACCAGATCGCGGCGCACCTGGTCCGGCGCGGCCCGCTCGCGGTGGGCCTCAACGCCGTCTTCATGCAGACCTACATCGGCGGCGTCTCCTGCCCGCTCATCTGCTGGAAGAGATTCGTCAACCACGGGGTGCTGCTCGTCGGCTACGGCGCCGACGGCTACTCGATTCTGCGGCTTCGGAAGGTGCCGTACTGGATAATCAAGAACTCGTGGGGGAAGCGGTGGGGGGAGCAGGGGTATTATCGTCTCTGCCGCGGCCACGGGATGTGTGGGATGAATACTATGGTTTCCGCCGTCGTCACTCAGCCCCAGCCCTAGCGTTTCCACAAATCCGCCATTGAAATTGGCTTAGGGTTTCGGTTTAATTTACAGATTAGGAACTAAGTGTTGGCGTAGCGTTGCATTTTGTCCTTTGCTTTAAAAAACATAAACACTACCAAAAAAAAAAAATATTTTGTCACAATAATATTTTAGGGTTTAGGATGGTAATTCAGTATATCCACTATAAAACTATTAACTAAAATATATCTGTGATTAACTAAACTAATTGGTGGGCGGTCATGCTCTAAGCGGACTCACATAAACACATAACATAAATGTTATAATAAAAACAATATAGTCAAATATGAATCACTCTCTCATCTCATTAGAGTTCCCACTAAACGACTAATATGGAAAATAAATAATTAGTTACAAATCAAACAACTAACTAGTCAAAGTAGTTGGATCAAAATACATCTAATACGCCCCCTCAAGATGGAGGATATAACCAAAACGCCCATCTTGAATAATAAGGGAAACAAAGCAGAAGTAGGAAGGGTTTTAGTGAACATATCAGCCAGCTGCATAGAAAATCGGATGGGAGGATCGGATCATTTGAGGAGATTCTGACAACCATCGAATTAACAGATGAACTCACATCGGATCAATGGCGGTCATTGGAGGTGTGGAATTTTGCGATGCGACTCCTTGTCCATAAGGAATTGATGAGTTAGAAGTTAAATTTCGAGAAATAGCTTTCTGTCTGTATCCAGCCAGATAGCCATGGAGTTTGTAACATTGATCGAAGGTGTGGCTTTGAAGACCACAATGAGTGCAAATAGGTCGCTCTTTACGTCCGCGAGAGGAATTAGAAGTAGATGCATTCAATTGACGAATACTCTATTGAACAGCAATGCTGAGAGCAGGAGATGGGAAAATAATGGGAAAATAATGGGAAAATAGGGATGAAGCGTTGTTGCTCTTCTTGAGAAATTAATGAAAATGCCTTGTTGATGGAAGGTGGTGGATCCATGAGCAAAATTTGCACACGAGTTGATGCAAAAGACTCGTTCAAACCCATTAGAAAACTCATGAGGTATTCGAATTGAAAAAAATCTTCAATTAGCTTATTACTTGACCAGTACATTTACCACAACTACAAGCTACATAGTGACAAGCACTGATATTGAGTTAAATTGGCCAAATTACACTTGAGTGGGAAGATTCTTAGACCATTCTTCTTCTGAAATCGCTCTTTGAGATCAAGCCAAATTGTCTTTGTCGAATAAGAGAAAATCAAACTTGCAGAGATTGGCTTGAAAACGAAATTAAAAATCCAAATAGAGAGAAGATCGCATGTAGACTTCTTAATTGTACCGTAAATTTGTTTTTGATGGAAAGAGCAATGATCATTGACCTGTTCCATGACATGGTGAGTTGCTGTGTGATAAGAACAAGACTAGATGTATAGCTATGATGGAGATGGTGATGACCAGAGGATTTTCACGAAGTGTGGGAGGAGGATCCATAGGCGCCATGTGAGAAAGGAAAAGAAAATTAAAAATACCCTGATATCATAATAAAAATAATATGGTATTTCTCATCTCATTAGAGTTCTCCAATGCATAGACTAATATGAGAAATAACATAATCGAATGATTAGTTACAAATTAAACAACTAACTAGTCAAAGTAGTTAGACCAAAATGCATCTAATATGTTACATGACTTTAGTTGTTCCTAATTTATCTAAATACAAAAAATTATATTGAATATTGGAATTGCTCTATCATATATAATTAGTATAAATATAAAATTTTGATCTATGTATAAATAATTAATGTATTGATTTCAATCGGCCAACGTGTGTTGTCGTAAAAATGGTCAAGAGCAAAGCCTTTATTTCTTTTTATTTTATTTTATATTTTTTTTAAGAGCAAATTGCCTTTATTCGTCTAACGATACTTTACGCAGCAGCGGTTATGGTAAGAGACGTACAGCTTAGCTGACGACGTTGGGGATAACACAGAAAGCAGAGAGCTCCCATGGCTTCGCCTCTTTCCTTCTCTCCGCGTCGCCTTCTCAACTCTCGCGCGCTTGCAATTGCAAGCACTCACTTCCACGCCATTAATGTCGCGCCTTCCCAAATTTCAGCACCATCCGTTTCTTCTGTTAGTAATCCCCTCAAGTTTAAGCTCAAACAACCTCTCAGAGCGTCTTCTGAAGGTATGTTTCTCTCCCTTCTCACACTCTTCCACTTCGCAATCATGGGAATTTTACTTTAGATATACGTTTCGAGTTGTTTGAGACGCTACCCATTAGAAAGCGTTTTACGATTTTGAATCGGGATTTCTGAGAAATCTTCAGCTATTTCATTCGCGTTTTTCCTGATTACAAAAGGCTTAGAATCTCTAAAAATCTGGAAATAATTTGCAAGGGCGGTGGCTTGTATATAGCACAGTATCACTGTTTCTTTTCATACGGCATAACTTGTTTCACTCCTACAGGAGCCCCTAATGAGTTAGTCGAAGATTCTAAGTTTGTTCCCTTGAATGCTGATGATCCCAGATATGGTCCACCTGTAAGTTCTGGTTAGTTTCAGCGCAATTGTTGACGTGGTTTATCTCTCGTTGCTTCTTTTCGGTGGATTTTCATTTTGATTGTTGTATATGTTTTCACTTTACTTATTGTCTGAGAATTGGGTTTTAATTTAAATTGAGCTCCTGTATTTTATGTTTATTGTACTATTAAGTTGAATGCTCTTGGATGTGTACGTACCATATGTATTAGTTGTCGTCATCACCTTCATAATTTTCATTTTGGATTCTGTTTGTCTCATAGGATAAGTGTTATTTTATGATTTCCCTTGGGTTTTTCATCCTGTAATGTATGAAATTATGTCTTTAAATGCTTCCTTTGCGTTTCTTTATGGAATTAGAATAGGGGAAAGCAGGTTGATATTTAGACTGTTCATTATTGCAGGCATTGCTGTTACTGGGCTTTGAATTGGAGGAGGCCATGAAGGTACGGTGTTGATTTTTTTCCTTCTTTCTGCGCTTCAACATCTGAAAACTAAAATAAAGTAAAAATCATACTGCTAAGTGAAAGTAGAAACCAAACTCCTGGTGGAGTATAGTTCTTTTACTGTTTAGTGGATAATCTGCCGGTTAGTTAAGGAGGTGATTCTGCAAACTGTTTGGTTTTGCTTTCTAGTTCTTCAAGTATTTTCGGAAGGAAGAGAGCAAAGGAATGCACTTCCTTTCTGATATTTATTAGTGGACCACGCTTCGAATGTCTACTGTGCTACAATTTACAAATGCTTGCAAATGCTATTGTCAAAATTTATTGAGGAATGCTTTCATTATTAGCTAACTACGCTTGCAGTATGGTCTTTCAGTCCTCTAATGCACTTATTTCTCACGAATTTTTTTGGTACTTGGAGATTTTTTCTAGATTTTGTGTTGTTTATCCTACAGATCGTGACAACACCTTCTGGTTAGTTCTTTTAGAGGTCAAATTTAATTGTATGGAGGGTTGAGGTTTATTATTTATAGTCACTGCTTTTTTCTTTGCCTGAACCTTCATAATATATTCTTATAAAGAAACGCTAAGCAACTTCATTCAAAGTTCTGCTATTCAGTCACTACGACACTATATAACCAAAACAGCCAAGCGGGCACCTACAAACTCCTTGTCTCCTCACTTATCCTTTAGCTTTGAAATGCTAGTTTCGAAAACTACTACCCCGTTTGATTTTTTGCTTTACATTTGATTACAGATTCAAGAGCTTTTGAAAGATTTGGGCGGTGAATTCATGCAGGTAGTTCATTTCCCCTCGTGTCCAATTACTTTTAGGTAAGATGAAAGGCGGTTGTTATTAAGAATAGTAGAATAAAATATGAAAATGTTGTTCAATGCATTGAAGCACTAATTGTGGAAAATTAACCCTCTAATCATCAATCCAAATCTCGTGTGAAAAGTTAAAACTCGCTTGATATTATGCTAAATGTAACCAAGTGCATTACTGATAAGTGATTATTCCAAGAGATTTTTCTAGTGCAAACTCATGGTTTAATCTATTTTTCACAATCTTTCGGTTCAAATTACTATTTATGTATATTTAACATTTTTCCCCATATCATGCATTATCTTCTGATCCTATAGGTTGGGAAACTTTTCTTTGGCAACCTCCTTTCTGAACTTCAAACCAAAAAAATTGGTCGATCTCATCTAGTTCCCTGATATTTTTTTGTCTAATCGAATACATGTAAATAAAAGAATTCCCATCACGGTAGCAAGTCTCTGTGAAGTAGTTCTGGCATGCTACGTGTGTTGTCTTACACATTGCATAAATTTCTTAATCATCTCTAACAACAGATTGCACAAAACTACCAACATAAATAATCTGTCGAACAGTTTTTTTAGTGTAGGTTATTCTTTGTACTGAAGACATGATTGCCAGATCCCTGTGGGACGCAGTGCATACAAGCCAGCCAGTTTTGGCAAACGTGAAGGTCAAGTTCTGGACTTCCTGAATTTTTACATTTTTAGTTTAGTATGTGCTTGCTATATTGTTCTCATAAAGAATCTAACTTTTGAAGCATGAAAACAGATCAAATCAATGAGTTGCAGATCTTGATTATTCTCACATAACATTCTTTGTTAATGTGATACTGTAATGTCACTAACACCATGTATATTGTTTCTACAGATAGCAAGATCATTGCCACGAATCTGCTTCTTATCCGGTCTTAGCGGAGAGGAAATGATGATGTTCATCGACACTTTTCCAGAAACTGGTATTGAAAAGATTACCCTCTTCTTTTTCATTTTAAGGAGAAGTTGGGACTTGGGATCTTTTAAAATCCAAAAGTACATTTGATCATTTGCATATTCCTTTACTCTCTTTCTCCATATTGTCATTCTGAAAACAGCACGGGTCTGTTATAGTATGATCACCACTAGCACCATGACCAGAGAAGCAGAGCAAAACTCTCTCTCTCTCTCTCTCAGTCTTGATTTTCTTATGGTGCAGGACTTGAACCAGCTGTATATGCTGCTCTTGTTCCCAACGGTGCCAATAAACCAGTAGGGGAGTTAATAGCAGAGATTATGGGGGACCATGAAATGATGGTAAGGGAGAATTCCTTGGTTTATTTATCTGAATTTCTGAAGTCATCGACCGCCTAGGACCCATGGTTGAATTTTTATGAACACTGATACCAATGAAACCAATCTTCTATGTTTGAAACTCTGAACTAATCCAGAGCAACCAATTTGATTGCTTTGGCCTCATGAGAATCTTTCTTCCTGCATAGCTATCTCACTCGGTCCTACATGTCAAAACTTTAACGATGGTCAGCAAAAGATGGGTGCTAACCCGAGAGCATCCAAATATTTTATTTCGAAACATATAAAATAAGATTGGATATGTTTAAACAGCTCATATGTTTTTCCACCATTTTATGAAACTGATTTAATGCTAATTCTAAATCCTGCCAAATGTGTTTTGAGTTTGATTGAAATAATTGATTATTTCATTTTGATCGAATATAATTATATTTTGAATGTTGCCCCCATCATTAAGACAGACACCTTACATTTTCATGCAGACTGGTGCAACATCAGACAGTCCATAGGGGAGTAGAAAGCTGGAGATTTGGTGAAAAATGAAAATCCACATTCATAATATGAAATCTCAAACACCTGTGGCTCCCTCGAATCACCTAGATTCACACGCGTAATTTTTATAGCTGCTTTTATCTGTCTAAAGTCGTTCATGGTAATGAAATACCATATTCTATATTCACGACTTGCTCAAACTTTTGTTCTTTTGTTCTTTTTTTCCACTTTTTGTTTTGGATATTTGTTCATCATGACTCCTTAATCCAACCTGGATTTGTCTCACATTTTATTCAACTCTGTATGATGTTATGTCACGTCATATGTGGTGAAATTAAATCTAAAGATATTTTTTCCCTTTTAAATTCATCTGTCT

mRNA sequence

CTGATTCTAACGTGTGCATTTGTCCTCGCACTACCCTCCGCCGCCGCGCTCCGCCGAGCCCCGGAATTGCTCCGCCAAGTCACCGACAATAATATTACCGACGACGACGGGGAGCAGTACGAAATTAATAATGCCCTCGTGGGGACGGGGAGCGAGAGGAAGTTCACAATGTTCATGGAGAAGTACGGTAAGAGTTATCCGACGAGGAGGGAGTATCTGCGGCGGCTGGGGATTTTCGCGAAGAATCTGGTGAGGGCGGCGGAGCACCAGGTTCTGGACCCGACGGCGGTGCACGGGGTGACGCCGTTTTCCGACCTGTCGGAGGAGGAGTTCGAGGAGATGTTTTTGGGGGTCAGATCGGCCGGCGGATCCGAGTTTGAGAGAAGTAATGAGGCGGCGGAGGAGGTGGTGGAGGGTTTGCCCGAGAGCTTTGATTGGCGTGAAAAGGGAGCTGTTACTGATGTTAAGATGCAGGGACAATGCGGGTCATGCTGGGCGTTCAGTACGTGTGGAGTAGTGGAAGGAGCCAACTTCATCTCCACCGGAAAACTCCTCAGCCTCAGCGAACAACAGCTCATCGATTGCGACCACACGTGCGATGGGATGGAGGAGGGCGTGTGCGACAGCGGCTGCAATGGCGGCCTAATGACCAACGCCTACAAGTACATAATGGCGGCCGGAGGCCTGGAGGAGGAGAGCTCCTACCCCTACACCGGGAAGCCCGCCGAGTGCCGGTTTGAATCCGACAAAATCGCCGTTAGGGTTTCCAATTTCACGACGATCCCCTTGGACGAGAACCAGATCGCGGCGCACCTGGTCCGGCGCGGCCCGCTCGCGGTGGGCCTCAACGCCGTCTTCATGCAGACCTACATCGGCGGCGTCTCCTGCCCGCTCATCTGCTGGAAGAGATTCGTCAACCACGGGGTGCTGCTCGTCGGCTACGGCGCCGACGGCTACTCGATTCTGCGGCTTCGGAAGGTGCCGTACTGGATAATCAAGAACTCGTGGGGGAAGCGGTGGGGGGAGCAGGGGTATTATCGTCTCTGCCGCGGCCACGGGATCTTAGCTGACGACGTTGGGGATAACACAGAAAGCGCTCCCATGGCTTCGCCTCTTTCCTTCTCTCCGCGTCGCCTTCTCAACTCTCGCGCGCTTGCAATTGCAAGCACTCACTTCCACGCCATTAATGTCGCGCCTTCCCAAATTTCAGCACCATCCGTTTCTTCTGTTAGTAATCCCCTCAAGTTTAAGCTCAAACAACCTCTCAGAGCGTCTTCTGAAGGAGCCCCTAATGAGTTAGTCGAAGATTCTAAGTTTGTTCCCTTGAATGCTGATGATCCCAGATATGGTCCACCTGCATTGCTGTTACTGGGCTTTGAATTGGAGGAGGCCATGAAGATTCAAGAGCTTTTGAAAGATTTGGGCGGTGAATTCATGCAGGTTATTCTTTGTACTGAAGACATGATTGCCAGATCCCTGTGGGACGCAGTGCATACAAGCCAGCCAGTTTTGGCAAACGTGAAGATAGCAAGATCATTGCCACGAATCTGCTTCTTATCCGGTCTTAGCGGAGAGGAAATGATGATGTTCATCGACACTTTTCCAGAAACTGGACTTGAACCAGCTGTATATGCTGCTCTTGTTCCCAACGGTGCCAATAAACCAGTAGGGGAGTTAATAGCAGAGATTATGGGGGACCATGAAATGATGACTGGTGCAACATCAGACAGTCCATAGGGGAGTAGAAAGCTGGAGATTTGGTGAAAAATGAAAATCCACATTCATAATATGAAATCTCAAACACCTGTGGCTCCCTCGAATCACCTAGATTCACACGCGTAATTTTTATAGCTGCTTTTATCTGTCTAAAGTCGTTCATGGTAATGAAATACCATATTCTATATTCACGACTTGCTCAAACTTTTGTTCTTTTGTTCTTTTTTTCCACTTTTTGTTTTGGATATTTGTTCATCATGACTCCTTAATCCAACCTGGATTTGTCTCACATTTTATTCAACTCTGTATGATGTTATGTCACGTCATATGTGGTGAAATTAAATCTAAAGATATTTTTTCCCTTTTAAATTCATCTGTCT

Coding sequence (CDS)

CTGATTCTAACGTGTGCATTTGTCCTCGCACTACCCTCCGCCGCCGCGCTCCGCCGAGCCCCGGAATTGCTCCGCCAAGTCACCGACAATAATATTACCGACGACGACGGGGAGCAGTACGAAATTAATAATGCCCTCGTGGGGACGGGGAGCGAGAGGAAGTTCACAATGTTCATGGAGAAGTACGGTAAGAGTTATCCGACGAGGAGGGAGTATCTGCGGCGGCTGGGGATTTTCGCGAAGAATCTGGTGAGGGCGGCGGAGCACCAGGTTCTGGACCCGACGGCGGTGCACGGGGTGACGCCGTTTTCCGACCTGTCGGAGGAGGAGTTCGAGGAGATGTTTTTGGGGGTCAGATCGGCCGGCGGATCCGAGTTTGAGAGAAGTAATGAGGCGGCGGAGGAGGTGGTGGAGGGTTTGCCCGAGAGCTTTGATTGGCGTGAAAAGGGAGCTGTTACTGATGTTAAGATGCAGGGACAATGCGGGTCATGCTGGGCGTTCAGTACGTGTGGAGTAGTGGAAGGAGCCAACTTCATCTCCACCGGAAAACTCCTCAGCCTCAGCGAACAACAGCTCATCGATTGCGACCACACGTGCGATGGGATGGAGGAGGGCGTGTGCGACAGCGGCTGCAATGGCGGCCTAATGACCAACGCCTACAAGTACATAATGGCGGCCGGAGGCCTGGAGGAGGAGAGCTCCTACCCCTACACCGGGAAGCCCGCCGAGTGCCGGTTTGAATCCGACAAAATCGCCGTTAGGGTTTCCAATTTCACGACGATCCCCTTGGACGAGAACCAGATCGCGGCGCACCTGGTCCGGCGCGGCCCGCTCGCGGTGGGCCTCAACGCCGTCTTCATGCAGACCTACATCGGCGGCGTCTCCTGCCCGCTCATCTGCTGGAAGAGATTCGTCAACCACGGGGTGCTGCTCGTCGGCTACGGCGCCGACGGCTACTCGATTCTGCGGCTTCGGAAGGTGCCGTACTGGATAATCAAGAACTCGTGGGGGAAGCGGTGGGGGGAGCAGGGGTATTATCGTCTCTGCCGCGGCCACGGGATCTTAGCTGACGACGTTGGGGATAACACAGAAAGCGCTCCCATGGCTTCGCCTCTTTCCTTCTCTCCGCGTCGCCTTCTCAACTCTCGCGCGCTTGCAATTGCAAGCACTCACTTCCACGCCATTAATGTCGCGCCTTCCCAAATTTCAGCACCATCCGTTTCTTCTGTTAGTAATCCCCTCAAGTTTAAGCTCAAACAACCTCTCAGAGCGTCTTCTGAAGGAGCCCCTAATGAGTTAGTCGAAGATTCTAAGTTTGTTCCCTTGAATGCTGATGATCCCAGATATGGTCCACCTGCATTGCTGTTACTGGGCTTTGAATTGGAGGAGGCCATGAAGATTCAAGAGCTTTTGAAAGATTTGGGCGGTGAATTCATGCAGGTTATTCTTTGTACTGAAGACATGATTGCCAGATCCCTGTGGGACGCAGTGCATACAAGCCAGCCAGTTTTGGCAAACGTGAAGATAGCAAGATCATTGCCACGAATCTGCTTCTTATCCGGTCTTAGCGGAGAGGAAATGATGATGTTCATCGACACTTTTCCAGAAACTGGACTTGAACCAGCTGTATATGCTGCTCTTGTTCCCAACGGTGCCAATAAACCAGTAGGGGAGTTAATAGCAGAGATTATGGGGGACCATGAAATGATGACTGGTGCAACATCAGACAGTCCATAG

Protein sequence

LILTCAFVLALPSAAALRRAPELLRQVTDNNITDDDGEQYEINNALVGTGSERKFTMFMEKYGKSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEEFEEMFLGVRSAGGSEFERSNEAAEEVVEGLPESFDWREKGAVTDVKMQGQCGSCWAFSTCGVVEGANFISTGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGLEEESSYPYTGKPAECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQTYIGGVSCPLICWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRLCRGHGILADDVGDNTESAPMASPLSFSPRRLLNSRALAIASTHFHAINVAPSQISAPSVSSVSNPLKFKLKQPLRASSEGAPNELVEDSKFVPLNADDPRYGPPALLLLGFELEEAMKIQELLKDLGGEFMQVILCTEDMIARSLWDAVHTSQPVLANVKIARSLPRICFLSGLSGEEMMMFIDTFPETGLEPAVYAALVPNGANKPVGELIAEIMGDHEMMTGATSDSP
Homology
BLAST of MC09g1382 vs. ExPASy Swiss-Prot
Match: Q8VYS0 (Probable cysteine protease RD19D OS=Arabidopsis thaliana OX=3702 GN=RD19D PE=2 SV=1)

HSP 1 Score: 475.7 bits (1223), Expect = 7.4e-133
Identity = 226/333 (67.87%), Postives = 262/333 (78.68%), Query Frame = 0

Query: 23  LLRQVTDNNITDDDGEQYEINNALVGTGSERKFTMFMEKYGKSYPTRREYLRRLGIFAKN 82
           ++  V D  I     +   I   L+GT +E KF +FM  YGK+Y TR EY+ RLGIFAKN
Sbjct: 19  VVASVEDLTIRQVTADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAKN 78

Query: 83  LVRAAEHQVLDPTAVHGVTPFSDLSEEEFEEMFLGVRSAGGSEFERSNEAAEEV-VEGLP 142
           +++AAEHQ++DP+AVHGVT FSDL+EEEF+ M+ GV   GGS        A  V V+GLP
Sbjct: 79  VLKAAEHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGSRGGTVGAEAPMVEVDGLP 138

Query: 143 ESFDWREKGAVTDVKMQGQCGSCWAFSTCGVVEGANFISTGKLLSLSEQQLIDCDHTCDG 202
           E FDWREKG VT+VK QG CGSCWAFST G  EGA+F+STGKLLSLSEQQL+DCD  CD 
Sbjct: 139 EDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQACDP 198

Query: 203 MEEGVCDSGCNGGLMTNAYKYIMAAGGLEEESSYPYTGKPAECRFESDKIAVRVSNFTTI 262
            ++  CD+GC GGLMTNAY+Y+M AGGLEEE SYPYTGK   C+F+ +K+AVRV NFTTI
Sbjct: 199 KDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPEKVAVRVLNFTTI 258

Query: 263 PLDENQIAAHLVRRGPLAVGLNAVFMQTYIGGVSCPLICWKRFVNHGVLLVGYGADGYSI 322
           PLDENQIAA+LVR GPLAVGLNAVFMQTYIGGVSCPLIC KR VNHGVLLVGYG+ G+SI
Sbjct: 259 PLDENQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSI 318

Query: 323 LRLRKVPYWIIKNSWGKRWGEQGYYRLCRGHGI 355
           LRL   PYWIIKNSWGK+WGE GYY+LCRGH I
Sbjct: 319 LRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDI 351

BLAST of MC09g1382 vs. ExPASy Swiss-Prot
Match: P25804 (Cysteine proteinase 15A OS=Pisum sativum OX=3888 PE=2 SV=1)

HSP 1 Score: 378.6 bits (971), Expect = 1.2e-103
Identity = 186/371 (50.13%), Postives = 246/371 (66.31%), Query Frame = 0

Query: 7   FVLALPSAAALRRAPELLRQVTDNNITDD-------DGEQYEINNALVGTGSERKFTMFM 66
           F+ AL   AA+  A      VTD+   DD       D E+  + NA      E  FT F 
Sbjct: 5   FLFALFLFAAVATA------VTDDTNNDDFIIRQVVDNEEDHLLNA------EHHFTSFK 64

Query: 67  EKYGKSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEEFEEMFLGVR 126
            K+ KSY T+ E+  R G+F  NL++A  HQ  DPTA HG+T FSDL+  EF   FLG++
Sbjct: 65  SKFSKSYATKEEHDYRFGVFKSNLIKAKLHQNRDPTAEHGITKFSDLTASEFRRQFLGLK 124

Query: 127 SAGGSEFERSNEAAEEVVEGLPESFDWREKGAVTDVKMQGQCGSCWAFSTCGVVEGANFI 186
                    + +A       LPE FDWREKGAVT VK QG CGSCWAFST G +EGA+++
Sbjct: 125 KRLRLP-AHAQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYL 184

Query: 187 STGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGLEEESSYPYTG 246
           +TGKL+SLSEQQL+DCDH CD  + G CDSGCNGGLM NA++Y++ +GG+ +E  Y YTG
Sbjct: 185 ATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQEKDYAYTG 244

Query: 247 KPAECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQTYIGGVSCPLI 306
           +   C+F+  K+   VSNF+ + LDE+QIAA+LV+ GPLAV +NA +MQTY+ GVSCP +
Sbjct: 245 RDGSCKFDKSKVVASVSNFSVVTLDEDQIAANLVKNGPLAVAINAAWMQTYMSGVSCPYV 304

Query: 307 CWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRLCRGHGILADDV 366
           C K  ++HGVLLVG+G   Y+ +RL++ PYWIIKNSWG+ WGEQGYY++CRG  +   D 
Sbjct: 305 CAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWGEQGYYKICRGRNVCGVDS 362

Query: 367 GDNTESAPMAS 371
             +T +A  ++
Sbjct: 365 MVSTVAAAQSN 362

BLAST of MC09g1382 vs. ExPASy Swiss-Prot
Match: Q10716 (Cysteine proteinase 1 OS=Zea mays OX=4577 GN=CCP1 PE=2 SV=1)

HSP 1 Score: 377.9 bits (969), Expect = 2.1e-103
Identity = 182/355 (51.27%), Postives = 239/355 (67.32%), Query Frame = 0

Query: 8   VLALPSAAALRRAPE----LLRQVTDNNITDDDGEQYEINNALVGTGSERKFTMFMEKYG 67
           +L+L SAAA+  A +    L+RQV    +   D    E+N       +E  F  F++++G
Sbjct: 8   LLSLASAAAVAAAVDAEDPLIRQV----VPGGDDNDLELN-------AESHFLSFVQRFG 67

Query: 68  KSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEEFEEMFLGVRSAGG 127
           KSY    E+  RL +F  NL RA  HQ+LDP+A HGVT FSDL+  EF   +LG+R +  
Sbjct: 68  KSYKDADEHAYRLSVFKDNLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRTYLGLRKSRR 127

Query: 128 SEFERSNEAAEEV----VEGLPESFDWREKGAVTDVKMQGQCGSCWAFSTCGVVEGANFI 187
           +      E+A E      +GLP+ FDWR+ GAV  VK QG CGSCW+FS  G +EGA+++
Sbjct: 128 ALLRELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYL 187

Query: 188 STGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGLEEESSYPYTG 247
           +TGKL  LSEQQ +DCDH CD  E   CDSGCNGGLMT A+ Y+  AGGLE E  YPYTG
Sbjct: 188 ATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTG 247

Query: 248 KPAECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQTYIGGVSCPLI 307
              +C+F+  KI   V NF+ + +DE QI+A+L++ GPLA+G+NA +MQTYIGGVSCP I
Sbjct: 248 SDGKCKFDKSKIVASVQNFSVVSVDEAQISANLIKHGPLAIGINAAYMQTYIGGVSCPYI 307

Query: 308 CWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRLCRGHGI 355
           C  R ++HGVLLVGYGA G++ +RL+  PYWIIKNSWG+ WGE GYY++CRG  +
Sbjct: 308 C-GRHLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWGENGYYKICRGSNV 350

BLAST of MC09g1382 vs. ExPASy Swiss-Prot
Match: P43296 (Cysteine protease RD19A OS=Arabidopsis thaliana OX=3702 GN=RD19A PE=1 SV=1)

HSP 1 Score: 375.2 bits (962), Expect = 1.4e-102
Identity = 184/342 (53.80%), Postives = 235/342 (68.71%), Query Frame = 0

Query: 35  DDGEQYEINNALVGT-----GSERKFTMFMEKYGKSYPTRREYLRRLGIFAKNLVRAAEH 94
           +DG+   I   + G       SE  F++F  K+GK Y +  E+  R  +F  NL RA  H
Sbjct: 26  NDGDDLVIRQVVGGAEPQVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRH 85

Query: 95  QVLDPTAVHGVTPFSDLSEEEFEEMFLGVRSAGGSEFERSNEAAEEVVEGLPESFDWREK 154
           Q LDP+A HGVT FSDL+  EF +  LGVRS G    + +N+A     E LPE FDWR+ 
Sbjct: 86  QKLDPSATHGVTQFSDLTRSEFRKKHLGVRS-GFKLPKDANKAPILPTENLPEDFDWRDH 145

Query: 155 GAVTDVKMQGQCGSCWAFSTCGVVEGANFISTGKLLSLSEQQLIDCDHTCDGMEEGVCDS 214
           GAVT VK QG CGSCW+FS  G +EGANF++TGKL+SLSEQQL+DCDH CD  E   CDS
Sbjct: 146 GAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDS 205

Query: 215 GCNGGLMTNAYKYIMAAGGLEEESSYPYTGKPAE-CRFESDKIAVRVSNFTTIPLDENQI 274
           GCNGGLM +A++Y +  GGL +E  YPYTGK  + C+ +  KI   VSNF+ I +DE QI
Sbjct: 206 GCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEEQI 265

Query: 275 AAHLVRRGPLAVGLNAVFMQTYIGGVSCPLICWKRFVNHGVLLVGYGADGYSILRLRKVP 334
           AA+LV+ GPLAV +NA +MQTYIGGVSCP IC +R +NHGVLLVGYGA GY+  R ++ P
Sbjct: 266 AANLVKNGPLAVAINAGYMQTYIGGVSCPYICTRR-LNHGVLLVGYGAAGYAPARFKEKP 325

Query: 335 YWIIKNSWGKRWGEQGYYRLCRGHGILADDVGDNTESAPMAS 371
           YWIIKNSWG+ WGE G+Y++C+G  I   D   +T +A +++
Sbjct: 326 YWIIKNSWGETWGENGFYKICKGRNICGVDSMVSTVAATVST 365

BLAST of MC09g1382 vs. ExPASy Swiss-Prot
Match: P43295 (Probable cysteine protease RD19B OS=Arabidopsis thaliana OX=3702 GN=RD19B PE=2 SV=2)

HSP 1 Score: 373.2 bits (957), Expect = 5.2e-102
Identity = 174/310 (56.13%), Postives = 226/310 (72.90%), Query Frame = 0

Query: 51  SERKFTMFMEKYGKSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEE 110
           SE  FT+F +K+GK Y +  E+  R  +F  NL+RA  HQ +DP+A HGVT FSDL+  E
Sbjct: 44  SEDHFTLFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSE 103

Query: 111 FEEMFLGVRSAGGSEFER-SNEAAEEVVEGLPESFDWREKGAVTDVKMQGQCGSCWAFST 170
           F    LGV+  GG +  + +N+A     + LPE FDWR++GAVT VK QG CGSCW+FST
Sbjct: 104 FRRKHLGVK--GGFKLPKDANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFST 163

Query: 171 CGVVEGANFISTGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGL 230
            G +EGA+F++TGKL+SLSEQQL+DCDH CD  EEG CDSGCNGGLM +A++Y +  GGL
Sbjct: 164 TGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGL 223

Query: 231 EEESSYPYTGKP-AECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQ 290
             E  YPYTG     C+ +  KI   VSNF+ + ++E+QIAA+L++ GPLAV +NA +MQ
Sbjct: 224 MREKDYPYTGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQ 283

Query: 291 TYIGGVSCPLICWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRL 350
           TYIGGVSCP IC +R +NHGVLLVGYG+ G+S  RL++ PYWIIKNSWG+ WGE G+Y++
Sbjct: 284 TYIGGVSCPYICSRR-LNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKI 343

Query: 351 CRGHGILADD 359
           C+G  I   D
Sbjct: 344 CKGRNICGVD 350

BLAST of MC09g1382 vs. NCBI nr
Match: KAG7031982.1 (putative cysteine protease RD19D, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 560 bits (1442), Expect = 9.26e-195
Identity = 277/356 (77.81%), Postives = 304/356 (85.39%), Query Frame = 0

Query: 1   LILTCAFVLAL-PSAAALRRAPELLRQVTDNNITDDDGEQYEINNALVGTGSERKFTMFM 60
           L+LTCAF LAL   A ALR++PE LRQVTD           EI N+L   GSERKF MFM
Sbjct: 7   LLLTCAFSLALLDCATALRQSPEFLRQVTDG----------EIINSLPA-GSERKFVMFM 66

Query: 61  EKYGKSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEEFEEMFLGVR 120
           EKYGKSYPTR+EYL RLGIFAKNLVRAAEHQ LDPTAVHGVT FSDLSEEEFE+MF+GVR
Sbjct: 67  EKYGKSYPTRKEYLHRLGIFAKNLVRAAEHQALDPTAVHGVTQFSDLSEEEFEQMFMGVR 126

Query: 121 S-AGGSEFERSNEAAEEVVEGLPESFDWREKGAVTDVKMQGQCGSCWAFSTCGVVEGANF 180
             AGG+E    N+A E   EGLPE FDWREKGAVT VKMQG CGSCWAFSTCG VEGANF
Sbjct: 127 GGAGGAELLEMNQAEEMTAEGLPERFDWREKGAVTAVKMQGTCGSCWAFSTCGAVEGANF 186

Query: 181 ISTGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGLEEESSYPYT 240
           I+TGKLLSLSEQQL+DCDHTCD  ++  C++GCNGGLMTNAYKY++ +GGLEEESSYPYT
Sbjct: 187 IATGKLLSLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPYT 246

Query: 241 GKPAECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQTYIGGVSCPL 300
           G   EC F+SDKIAVRVSNFTTIP+DE+QIAAHLV RGPLAVGLNAVFMQTYIGGVSCPL
Sbjct: 247 GHRGECNFQSDKIAVRVSNFTTIPIDEDQIAAHLVHRGPLAVGLNAVFMQTYIGGVSCPL 306

Query: 301 ICWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRLCRGHGI 354
           IC KRFVNHGVLLVGYG +G+SILR RK+PYWIIKNSWG+RWGE+GYYRLCRGHG+
Sbjct: 307 ICGKRFVNHGVLLVGYGDEGFSILRFRKLPYWIIKNSWGERWGERGYYRLCRGHGM 351

BLAST of MC09g1382 vs. NCBI nr
Match: XP_023535677.1 (probable cysteine protease RD19D [Cucurbita pepo subsp. pepo])

HSP 1 Score: 559 bits (1441), Expect = 1.31e-194
Identity = 276/356 (77.53%), Postives = 305/356 (85.67%), Query Frame = 0

Query: 1   LILTCAFVLALPSAA-ALRRAPELLRQVTDNNITDDDGEQYEINNALVGTGSERKFTMFM 60
           L+LTCAF LAL + A ALR++PE LRQVTD           EI N L   GSERKF MFM
Sbjct: 7   LLLTCAFSLALLACATALRQSPEFLRQVTDG----------EIINGLPA-GSERKFVMFM 66

Query: 61  EKYGKSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEEFEEMFLGVR 120
           EKYGKSYPTR+EYL RLGIFAKNLVRAAEHQ LDPTAVHGVT F+DLSEEEFE+MF+GVR
Sbjct: 67  EKYGKSYPTRKEYLHRLGIFAKNLVRAAEHQALDPTAVHGVTQFADLSEEEFEQMFMGVR 126

Query: 121 S-AGGSEFERSNEAAEEVVEGLPESFDWREKGAVTDVKMQGQCGSCWAFSTCGVVEGANF 180
             AGG+E    N+A E   EGLPE FDWREKGAVT VKMQG CGSCWAFSTCG VEGANF
Sbjct: 127 GGAGGAELLEMNQAEEMTAEGLPERFDWREKGAVTAVKMQGTCGSCWAFSTCGAVEGANF 186

Query: 181 ISTGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGLEEESSYPYT 240
           I+TGKLLSLSEQQL+DCDHTCD  ++  C++GCNGGLMTNAYKY++ +GGLEEESSYPYT
Sbjct: 187 IATGKLLSLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPYT 246

Query: 241 GKPAECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQTYIGGVSCPL 300
           G+  EC F+SDKIAVRVSNFTTIP+DE+QIAAHLV RGPLAVGLNAVFMQTYIGGVSCPL
Sbjct: 247 GRRGECNFQSDKIAVRVSNFTTIPIDEDQIAAHLVHRGPLAVGLNAVFMQTYIGGVSCPL 306

Query: 301 ICWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRLCRGHGI 354
           IC KRFVNHGVLLVGYG +G+SILR RK+PYWIIKNSWG+RWGE+GYYRLCRGHG+
Sbjct: 307 ICGKRFVNHGVLLVGYGDEGFSILRFRKLPYWIIKNSWGERWGERGYYRLCRGHGM 351

BLAST of MC09g1382 vs. NCBI nr
Match: XP_022956816.1 (probable cysteine protease RD19D [Cucurbita moschata])

HSP 1 Score: 558 bits (1439), Expect = 2.64e-194
Identity = 274/356 (76.97%), Postives = 302/356 (84.83%), Query Frame = 0

Query: 1   LILTCAFVLALPSAA-ALRRAPELLRQVTDNNITDDDGEQYEINNALVGTGSERKFTMFM 60
           L+LTCAF LAL + A ALR+ PE LRQVTD  I +            +  GSERKF MFM
Sbjct: 7   LLLTCAFSLALLACATALRQGPEFLRQVTDGGIINS-----------LPAGSERKFVMFM 66

Query: 61  EKYGKSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEEFEEMFLGVR 120
           EKYGKSYPTR+EYL RLGIFAKNLVRAAEHQ LDPTAVHGVT FSDLSEEEFE+MF+GVR
Sbjct: 67  EKYGKSYPTRKEYLHRLGIFAKNLVRAAEHQALDPTAVHGVTQFSDLSEEEFEQMFMGVR 126

Query: 121 S-AGGSEFERSNEAAEEVVEGLPESFDWREKGAVTDVKMQGQCGSCWAFSTCGVVEGANF 180
             AGG+E    N+A E   EGLPE FDWREKGAVT VKMQG CGSCWAFSTCG VEGANF
Sbjct: 127 GGAGGAELLEMNQAEEMTAEGLPERFDWREKGAVTAVKMQGTCGSCWAFSTCGAVEGANF 186

Query: 181 ISTGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGLEEESSYPYT 240
           I+TGKLLSLSEQQL+DCDHTCD  ++  C++GCNGGLMTNAYKY++ +GGLEEESSYPYT
Sbjct: 187 IATGKLLSLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPYT 246

Query: 241 GKPAECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQTYIGGVSCPL 300
           G   EC F+SDKIAVRVSNFTTIP+DE+QIAAHLV RGPLAVGLNAVFMQTYIGGVSCPL
Sbjct: 247 GHRGECNFQSDKIAVRVSNFTTIPIDEDQIAAHLVHRGPLAVGLNAVFMQTYIGGVSCPL 306

Query: 301 ICWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRLCRGHGI 354
           IC KRFVNHGVLLVGYG +G+SILR RK+PYWIIKNSWG+RWGE+GYYRLCRGHG+
Sbjct: 307 ICGKRFVNHGVLLVGYGDEGFSILRFRKLPYWIIKNSWGERWGERGYYRLCRGHGM 351

BLAST of MC09g1382 vs. NCBI nr
Match: XP_038892721.1 (probable cysteine protease RD19D [Benincasa hispida])

HSP 1 Score: 558 bits (1439), Expect = 1.62e-193
Identity = 278/363 (76.58%), Postives = 308/363 (84.85%), Query Frame = 0

Query: 1   LILTCAFVL-----ALPSAAALRRAPELLRQVTDNNITDDDGEQYEINNALVGTGSERKF 60
           L+LTCAF L     A+ SA ALRR P LLRQVTD  I         INN  +  GSERKF
Sbjct: 50  LLLTCAFSLTLLTCAIHSATALRRDPGLLRQVTDGEI---------INN--LPAGSERKF 109

Query: 61  TMFMEKYGKSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEEFEEMF 120
            MFMEKYGKSYPTR+EYL RLGIFAKNL+RAAEHQ LDPTAVHGVT FSDLSEEEFE MF
Sbjct: 110 VMFMEKYGKSYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFERMF 169

Query: 121 LGVRSAG-GSEFERSNEAAE---EVVEGLPESFDWREKGAVTDVKMQGQCGSCWAFSTCG 180
           +GVRS G G+E    N+AAE   E VEGLPE FDWREKGAVT VKMQG CGSCWAFSTCG
Sbjct: 170 MGVRSGGAGAELREMNQAAEMMAEEVEGLPERFDWREKGAVTGVKMQGTCGSCWAFSTCG 229

Query: 181 VVEGANFISTGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGLEE 240
            VEGANFI+TGKLLSLSEQQL+DCDHTCD  ++  C++GCNGGLMTNAYKY++ +GGLEE
Sbjct: 230 AVEGANFIATGKLLSLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEE 289

Query: 241 ESSYPYTGKPAECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQTYI 300
           +SSYPYTG+  +C F+SDKIAVRVSNFTTIP+DE+QIAAHLVRRGPLAVGLNAVFMQTYI
Sbjct: 290 DSSYPYTGRRGDCDFQSDKIAVRVSNFTTIPIDEDQIAAHLVRRGPLAVGLNAVFMQTYI 349

Query: 301 GGVSCPLICWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRLCRG 354
           GGVSCPLIC KRFVNHGVL+VGYG +G+SILR RK+PYWIIKNSWG+RWGE+GYYRLCRG
Sbjct: 350 GGVSCPLICGKRFVNHGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGERGYYRLCRG 401

BLAST of MC09g1382 vs. NCBI nr
Match: XP_022984713.1 (probable cysteine protease RD19D [Cucurbita maxima])

HSP 1 Score: 551 bits (1421), Expect = 1.42e-191
Identity = 273/356 (76.69%), Postives = 300/356 (84.27%), Query Frame = 0

Query: 1   LILTCAFVLALPSAA-ALRRAPELLRQVTDNNITDDDGEQYEINNALVGTGSERKFTMFM 60
           L+LTCAF L L + A ALR+ PE LRQ+TD           EI N+L   GSERKF MFM
Sbjct: 7   LLLTCAFSLTLLACATALRQGPEFLRQITDG----------EIINSLPA-GSERKFVMFM 66

Query: 61  EKYGKSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEEFEEMFLGVR 120
           EKYGKSYPTR+EYL RLGIFAKNLVRAAEHQ LDPTA+HGVT FSDLSEEEFE MF+GVR
Sbjct: 67  EKYGKSYPTRKEYLHRLGIFAKNLVRAAEHQALDPTAMHGVTQFSDLSEEEFELMFMGVR 126

Query: 121 SA-GGSEFERSNEAAEEVVEGLPESFDWREKGAVTDVKMQGQCGSCWAFSTCGVVEGANF 180
              GG+E    N+A E   EGLPE FDWREKGAVT VKMQG CGSCWAFSTCG VEGANF
Sbjct: 127 GGVGGAELLEMNQAEEMTAEGLPERFDWREKGAVTAVKMQGTCGSCWAFSTCGAVEGANF 186

Query: 181 ISTGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGLEEESSYPYT 240
           I+TGKLLSLSEQQL+DCDHTCD  ++  C++GCNGGLMTNAYKY++ +GGLEEESSYPYT
Sbjct: 187 IATGKLLSLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPYT 246

Query: 241 GKPAECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQTYIGGVSCPL 300
           G   EC F+S KIAVRVSNFTTIP+DE+QIAAHLV RGPLAVGLNAVFMQTYIGGVSCPL
Sbjct: 247 GHRGECNFQSHKIAVRVSNFTTIPIDEDQIAAHLVHRGPLAVGLNAVFMQTYIGGVSCPL 306

Query: 301 ICWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRLCRGHGI 354
           IC KRFVNHGVLLVGYG +G+SILR RK+PYWIIKNSWG+RWGEQGYYRLCRGHG+
Sbjct: 307 ICGKRFVNHGVLLVGYGDEGFSILRFRKLPYWIIKNSWGERWGEQGYYRLCRGHGM 351

BLAST of MC09g1382 vs. ExPASy TrEMBL
Match: A0A6J1GXJ8 (probable cysteine protease RD19D OS=Cucurbita moschata OX=3662 GN=LOC111458402 PE=3 SV=1)

HSP 1 Score: 558 bits (1439), Expect = 1.28e-194
Identity = 274/356 (76.97%), Postives = 302/356 (84.83%), Query Frame = 0

Query: 1   LILTCAFVLALPSAA-ALRRAPELLRQVTDNNITDDDGEQYEINNALVGTGSERKFTMFM 60
           L+LTCAF LAL + A ALR+ PE LRQVTD  I +            +  GSERKF MFM
Sbjct: 7   LLLTCAFSLALLACATALRQGPEFLRQVTDGGIINS-----------LPAGSERKFVMFM 66

Query: 61  EKYGKSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEEFEEMFLGVR 120
           EKYGKSYPTR+EYL RLGIFAKNLVRAAEHQ LDPTAVHGVT FSDLSEEEFE+MF+GVR
Sbjct: 67  EKYGKSYPTRKEYLHRLGIFAKNLVRAAEHQALDPTAVHGVTQFSDLSEEEFEQMFMGVR 126

Query: 121 S-AGGSEFERSNEAAEEVVEGLPESFDWREKGAVTDVKMQGQCGSCWAFSTCGVVEGANF 180
             AGG+E    N+A E   EGLPE FDWREKGAVT VKMQG CGSCWAFSTCG VEGANF
Sbjct: 127 GGAGGAELLEMNQAEEMTAEGLPERFDWREKGAVTAVKMQGTCGSCWAFSTCGAVEGANF 186

Query: 181 ISTGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGLEEESSYPYT 240
           I+TGKLLSLSEQQL+DCDHTCD  ++  C++GCNGGLMTNAYKY++ +GGLEEESSYPYT
Sbjct: 187 IATGKLLSLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPYT 246

Query: 241 GKPAECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQTYIGGVSCPL 300
           G   EC F+SDKIAVRVSNFTTIP+DE+QIAAHLV RGPLAVGLNAVFMQTYIGGVSCPL
Sbjct: 247 GHRGECNFQSDKIAVRVSNFTTIPIDEDQIAAHLVHRGPLAVGLNAVFMQTYIGGVSCPL 306

Query: 301 ICWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRLCRGHGI 354
           IC KRFVNHGVLLVGYG +G+SILR RK+PYWIIKNSWG+RWGE+GYYRLCRGHG+
Sbjct: 307 ICGKRFVNHGVLLVGYGDEGFSILRFRKLPYWIIKNSWGERWGERGYYRLCRGHGM 351

BLAST of MC09g1382 vs. ExPASy TrEMBL
Match: A0A6J1JBC1 (probable cysteine protease RD19D OS=Cucurbita maxima OX=3661 GN=LOC111482910 PE=3 SV=1)

HSP 1 Score: 551 bits (1421), Expect = 6.89e-192
Identity = 273/356 (76.69%), Postives = 300/356 (84.27%), Query Frame = 0

Query: 1   LILTCAFVLALPSAA-ALRRAPELLRQVTDNNITDDDGEQYEINNALVGTGSERKFTMFM 60
           L+LTCAF L L + A ALR+ PE LRQ+TD           EI N+L   GSERKF MFM
Sbjct: 7   LLLTCAFSLTLLACATALRQGPEFLRQITDG----------EIINSLPA-GSERKFVMFM 66

Query: 61  EKYGKSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEEFEEMFLGVR 120
           EKYGKSYPTR+EYL RLGIFAKNLVRAAEHQ LDPTA+HGVT FSDLSEEEFE MF+GVR
Sbjct: 67  EKYGKSYPTRKEYLHRLGIFAKNLVRAAEHQALDPTAMHGVTQFSDLSEEEFELMFMGVR 126

Query: 121 SA-GGSEFERSNEAAEEVVEGLPESFDWREKGAVTDVKMQGQCGSCWAFSTCGVVEGANF 180
              GG+E    N+A E   EGLPE FDWREKGAVT VKMQG CGSCWAFSTCG VEGANF
Sbjct: 127 GGVGGAELLEMNQAEEMTAEGLPERFDWREKGAVTAVKMQGTCGSCWAFSTCGAVEGANF 186

Query: 181 ISTGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGLEEESSYPYT 240
           I+TGKLLSLSEQQL+DCDHTCD  ++  C++GCNGGLMTNAYKY++ +GGLEEESSYPYT
Sbjct: 187 IATGKLLSLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEESSYPYT 246

Query: 241 GKPAECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQTYIGGVSCPL 300
           G   EC F+S KIAVRVSNFTTIP+DE+QIAAHLV RGPLAVGLNAVFMQTYIGGVSCPL
Sbjct: 247 GHRGECNFQSHKIAVRVSNFTTIPIDEDQIAAHLVHRGPLAVGLNAVFMQTYIGGVSCPL 306

Query: 301 ICWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRLCRGHGI 354
           IC KRFVNHGVLLVGYG +G+SILR RK+PYWIIKNSWG+RWGEQGYYRLCRGHG+
Sbjct: 307 ICGKRFVNHGVLLVGYGDEGFSILRFRKLPYWIIKNSWGERWGEQGYYRLCRGHGM 351

BLAST of MC09g1382 vs. ExPASy TrEMBL
Match: A0A5D3CBQ6 (Cysteine proteinase 15A OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G002450 PE=3 SV=1)

HSP 1 Score: 548 bits (1412), Expect = 1.99e-190
Identity = 270/362 (74.59%), Postives = 300/362 (82.87%), Query Frame = 0

Query: 2   ILTCAFVL-----ALPSAAALRRAPELLRQVTDNNITDDDGEQYEINNALVGTGSERKFT 61
           +L CAF L     A+PSA ALR  PE LRQVTD  I          NN  +  GSERKF 
Sbjct: 7   LLACAFSLTLLISAIPSATALRHDPEFLRQVTDGEI---------FNN--LPAGSERKFA 66

Query: 62  MFMEKYGKSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEEFEEMFL 121
           MFMEKYGKSYPTR+EYL RLGIF KNL+RAAEHQ LDPTAVHGVT FSDLSEEEFE MF+
Sbjct: 67  MFMEKYGKSYPTRKEYLHRLGIFVKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFERMFM 126

Query: 122 GVRS-AGGSEFERSNEAAE---EVVEGLPESFDWREKGAVTDVKMQGQCGSCWAFSTCGV 181
           GVR  AGG+     N+A E   E V+GLPE FDWREKGAVT VKMQG CGSCWAFSTCG 
Sbjct: 127 GVRGGAGGAGLPEMNQAVEVSAEEVKGLPERFDWREKGAVTGVKMQGTCGSCWAFSTCGA 186

Query: 182 VEGANFISTGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGLEEE 241
           VEGANFI+TGKLL+LSEQQL+DCDHTCD  ++  C++GCNGGLMTNAYKY++ +GGLEEE
Sbjct: 187 VEGANFIATGKLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEE 246

Query: 242 SSYPYTGKPAECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQTYIG 301
           SSYPYTG+  +C F+SDKIAV+VSNFTTIP+DENQIAAHLVR GPLAVGLNAVFMQTYIG
Sbjct: 247 SSYPYTGRSGQCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAVFMQTYIG 306

Query: 302 GVSCPLICWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRLCRGH 354
           GVSCPLIC KRFVNHGVL+VGYG +G+SILR RK+PYWIIKNSWG+RWGE GYYRLCRGH
Sbjct: 307 GVSCPLICGKRFVNHGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEHGYYRLCRGH 357

BLAST of MC09g1382 vs. ExPASy TrEMBL
Match: A0A1S3BG12 (cysteine proteinase 15A OS=Cucumis melo OX=3656 GN=LOC103489441 PE=3 SV=1)

HSP 1 Score: 548 bits (1412), Expect = 1.99e-190
Identity = 270/362 (74.59%), Postives = 300/362 (82.87%), Query Frame = 0

Query: 2   ILTCAFVL-----ALPSAAALRRAPELLRQVTDNNITDDDGEQYEINNALVGTGSERKFT 61
           +L CAF L     A+PSA ALR  PE LRQVTD  I          NN  +  GSERKF 
Sbjct: 7   LLACAFSLTLLISAIPSATALRHDPEFLRQVTDGEI---------FNN--LPAGSERKFA 66

Query: 62  MFMEKYGKSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEEFEEMFL 121
           MFMEKYGKSYPTR+EYL RLGIF KNL+RAAEHQ LDPTAVHGVT FSDLSEEEFE MF+
Sbjct: 67  MFMEKYGKSYPTRKEYLHRLGIFVKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFERMFM 126

Query: 122 GVRS-AGGSEFERSNEAAE---EVVEGLPESFDWREKGAVTDVKMQGQCGSCWAFSTCGV 181
           GVR  AGG+     N+A E   E V+GLPE FDWREKGAVT VKMQG CGSCWAFSTCG 
Sbjct: 127 GVRGGAGGAGLPEMNQAVEVSAEEVKGLPERFDWREKGAVTGVKMQGTCGSCWAFSTCGA 186

Query: 182 VEGANFISTGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGLEEE 241
           VEGANFI+TGKLL+LSEQQL+DCDHTCD  ++  C++GCNGGLMTNAYKY++ +GGLEEE
Sbjct: 187 VEGANFIATGKLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEEE 246

Query: 242 SSYPYTGKPAECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQTYIG 301
           SSYPYTG+  +C F+SDKIAV+VSNFTTIP+DENQIAAHLVR GPLAVGLNAVFMQTYIG
Sbjct: 247 SSYPYTGRSGQCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAVFMQTYIG 306

Query: 302 GVSCPLICWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRLCRGH 354
           GVSCPLIC KRFVNHGVL+VGYG +G+SILR RK+PYWIIKNSWG+RWGE GYYRLCRGH
Sbjct: 307 GVSCPLICGKRFVNHGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEHGYYRLCRGH 357

BLAST of MC09g1382 vs. ExPASy TrEMBL
Match: A0A0A0KTI9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G622590 PE=3 SV=1)

HSP 1 Score: 545 bits (1405), Expect = 8.11e-189
Identity = 268/363 (73.83%), Postives = 299/363 (82.37%), Query Frame = 0

Query: 1   LILTCAFVLAL-----PSAAALRRAPELLRQVTDNNITDDDGEQYEINNALVGTGSERKF 60
           L+L CA  LAL     PSA ALRR PE LRQVTD  I          NN  +  GSERKF
Sbjct: 41  LLLACAISLALLISAIPSATALRRDPEFLRQVTDGEI---------FNN--LPAGSERKF 100

Query: 61  TMFMEKYGKSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEEFEEMF 120
            MFMEKYGKSYPTR+EYL R GIF KNL+RAAEHQ LDPTAVHGVT FSDLSEEEFE MF
Sbjct: 101 VMFMEKYGKSYPTRKEYLHRFGIFVKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFERMF 160

Query: 121 LGVRS-AGGSEFERSNEAAE---EVVEGLPESFDWREKGAVTDVKMQGQCGSCWAFSTCG 180
           +GVR  AGG      N+A E   E V+GLPE FDWR+KGAVT+VKMQG CGSCWAFSTCG
Sbjct: 161 MGVRGGAGGEGLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKMQGTCGSCWAFSTCG 220

Query: 181 VVEGANFISTGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGLEE 240
            VEGANFI+TG LL+LSEQQL+DCDHTCD  ++  C++GCNGGLMTNAYKY++ +GGLEE
Sbjct: 221 AVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQSGGLEE 280

Query: 241 ESSYPYTGKPAECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQTYI 300
           ESSYPYTG+  +C F+SDKIAV+VSNFTTIP+DENQIAAHLVR GPLAVGLNAVFMQTYI
Sbjct: 281 ESSYPYTGRSGQCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAVFMQTYI 340

Query: 301 GGVSCPLICWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRLCRG 354
           GGVSCPLIC KRFVNHGVL+VGYG +G+SILR RK+PYW+IKNSWG+RWGE GYYRLCRG
Sbjct: 341 GGVSCPLICGKRFVNHGVLMVGYGDEGFSILRFRKLPYWVIKNSWGERWGEHGYYRLCRG 392

BLAST of MC09g1382 vs. TAIR 10
Match: AT3G54940.2 (Papain family cysteine protease )

HSP 1 Score: 475.7 bits (1223), Expect = 5.3e-134
Identity = 226/333 (67.87%), Postives = 262/333 (78.68%), Query Frame = 0

Query: 23  LLRQVTDNNITDDDGEQYEINNALVGTGSERKFTMFMEKYGKSYPTRREYLRRLGIFAKN 82
           ++  V D  I     +   I   L+GT +E KF +FM  YGK+Y TR EY+ RLGIFAKN
Sbjct: 19  VVASVEDLTIRQVTADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAKN 78

Query: 83  LVRAAEHQVLDPTAVHGVTPFSDLSEEEFEEMFLGVRSAGGSEFERSNEAAEEV-VEGLP 142
           +++AAEHQ++DP+AVHGVT FSDL+EEEF+ M+ GV   GGS        A  V V+GLP
Sbjct: 79  VLKAAEHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGSRGGTVGAEAPMVEVDGLP 138

Query: 143 ESFDWREKGAVTDVKMQGQCGSCWAFSTCGVVEGANFISTGKLLSLSEQQLIDCDHTCDG 202
           E FDWREKG VT+VK QG CGSCWAFST G  EGA+F+STGKLLSLSEQQL+DCD  CD 
Sbjct: 139 EDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQACDP 198

Query: 203 MEEGVCDSGCNGGLMTNAYKYIMAAGGLEEESSYPYTGKPAECRFESDKIAVRVSNFTTI 262
            ++  CD+GC GGLMTNAY+Y+M AGGLEEE SYPYTGK   C+F+ +K+AVRV NFTTI
Sbjct: 199 KDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPEKVAVRVLNFTTI 258

Query: 263 PLDENQIAAHLVRRGPLAVGLNAVFMQTYIGGVSCPLICWKRFVNHGVLLVGYGADGYSI 322
           PLDENQIAA+LVR GPLAVGLNAVFMQTYIGGVSCPLIC KR VNHGVLLVGYG+ G+SI
Sbjct: 259 PLDENQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSI 318

Query: 323 LRLRKVPYWIIKNSWGKRWGEQGYYRLCRGHGI 355
           LRL   PYWIIKNSWGK+WGE GYY+LCRGH I
Sbjct: 319 LRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDI 351

BLAST of MC09g1382 vs. TAIR 10
Match: AT4G39090.1 (Papain family cysteine protease )

HSP 1 Score: 375.2 bits (962), Expect = 9.7e-104
Identity = 184/342 (53.80%), Postives = 235/342 (68.71%), Query Frame = 0

Query: 35  DDGEQYEINNALVGT-----GSERKFTMFMEKYGKSYPTRREYLRRLGIFAKNLVRAAEH 94
           +DG+   I   + G       SE  F++F  K+GK Y +  E+  R  +F  NL RA  H
Sbjct: 26  NDGDDLVIRQVVGGAEPQVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRH 85

Query: 95  QVLDPTAVHGVTPFSDLSEEEFEEMFLGVRSAGGSEFERSNEAAEEVVEGLPESFDWREK 154
           Q LDP+A HGVT FSDL+  EF +  LGVRS G    + +N+A     E LPE FDWR+ 
Sbjct: 86  QKLDPSATHGVTQFSDLTRSEFRKKHLGVRS-GFKLPKDANKAPILPTENLPEDFDWRDH 145

Query: 155 GAVTDVKMQGQCGSCWAFSTCGVVEGANFISTGKLLSLSEQQLIDCDHTCDGMEEGVCDS 214
           GAVT VK QG CGSCW+FS  G +EGANF++TGKL+SLSEQQL+DCDH CD  E   CDS
Sbjct: 146 GAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDS 205

Query: 215 GCNGGLMTNAYKYIMAAGGLEEESSYPYTGKPAE-CRFESDKIAVRVSNFTTIPLDENQI 274
           GCNGGLM +A++Y +  GGL +E  YPYTGK  + C+ +  KI   VSNF+ I +DE QI
Sbjct: 206 GCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEEQI 265

Query: 275 AAHLVRRGPLAVGLNAVFMQTYIGGVSCPLICWKRFVNHGVLLVGYGADGYSILRLRKVP 334
           AA+LV+ GPLAV +NA +MQTYIGGVSCP IC +R +NHGVLLVGYGA GY+  R ++ P
Sbjct: 266 AANLVKNGPLAVAINAGYMQTYIGGVSCPYICTRR-LNHGVLLVGYGAAGYAPARFKEKP 325

Query: 335 YWIIKNSWGKRWGEQGYYRLCRGHGILADDVGDNTESAPMAS 371
           YWIIKNSWG+ WGE G+Y++C+G  I   D   +T +A +++
Sbjct: 326 YWIIKNSWGETWGENGFYKICKGRNICGVDSMVSTVAATVST 365

BLAST of MC09g1382 vs. TAIR 10
Match: AT2G21430.1 (Papain family cysteine protease )

HSP 1 Score: 373.2 bits (957), Expect = 3.7e-103
Identity = 174/310 (56.13%), Postives = 226/310 (72.90%), Query Frame = 0

Query: 51  SERKFTMFMEKYGKSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEE 110
           SE  FT+F +K+GK Y +  E+  R  +F  NL+RA  HQ +DP+A HGVT FSDL+  E
Sbjct: 44  SEDHFTLFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSE 103

Query: 111 FEEMFLGVRSAGGSEFER-SNEAAEEVVEGLPESFDWREKGAVTDVKMQGQCGSCWAFST 170
           F    LGV+  GG +  + +N+A     + LPE FDWR++GAVT VK QG CGSCW+FST
Sbjct: 104 FRRKHLGVK--GGFKLPKDANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFST 163

Query: 171 CGVVEGANFISTGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGL 230
            G +EGA+F++TGKL+SLSEQQL+DCDH CD  EEG CDSGCNGGLM +A++Y +  GGL
Sbjct: 164 TGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGL 223

Query: 231 EEESSYPYTGKP-AECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQ 290
             E  YPYTG     C+ +  KI   VSNF+ + ++E+QIAA+L++ GPLAV +NA +MQ
Sbjct: 224 MREKDYPYTGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQ 283

Query: 291 TYIGGVSCPLICWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRL 350
           TYIGGVSCP IC +R +NHGVLLVGYG+ G+S  RL++ PYWIIKNSWG+ WGE G+Y++
Sbjct: 284 TYIGGVSCPYICSRR-LNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKI 343

Query: 351 CRGHGILADD 359
           C+G  I   D
Sbjct: 344 CKGRNICGVD 350

BLAST of MC09g1382 vs. TAIR 10
Match: AT4G16190.1 (Papain family cysteine protease )

HSP 1 Score: 362.5 bits (929), Expect = 6.5e-100
Identity = 173/323 (53.56%), Postives = 227/323 (70.28%), Query Frame = 0

Query: 51  SERKFTMFMEKYGKSYPTRREYLRRLGIFAKNLVRAAEHQVLDPTAVHGVTPFSDLSEEE 110
           +E  FT+F  KY K+Y T+ E+  R  +F  NL RA  +Q+LDP+AVHGVT FSDL+ +E
Sbjct: 51  AEHHFTLFKSKYEKTYATQVEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKE 110

Query: 111 FEEMFLGVRSAGGSEFERSNEAAEEVVEGLPESFDWREKGAVTDVKMQGQCGSCWAFSTC 170
           F   FLG++  G      +  A       LP  FDWRE+GAVT VK QG CGSCW+FS  
Sbjct: 111 FRRKFLGLKRRGFRLPTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAI 170

Query: 171 GVVEGANFISTGKLLSLSEQQLIDCDHTCDGMEEGVCDSGCNGGLMTNAYKYIMAAGGLE 230
           G +EGA+F++T +L+SLSEQQL+DCDH CD  +   CDSGC+GGLM NA++Y + AGGL 
Sbjct: 171 GALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLM 230

Query: 231 EESSYPYTGKP-AECRFESDKIAVRVSNFTTIPLDENQIAAHLVRRGPLAVGLNAVFMQT 290
           +E  YPYTG+    C+F+  KI   VSNF+ +  DE+QIAA+LV+ GPLA+ +NA++MQT
Sbjct: 231 KEEDYPYTGRDHTACKFDKSKIVASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMWMQT 290

Query: 291 YIGGVSCPLICWKRFVNHGVLLVGYGADGYSILRLRKVPYWIIKNSWGKRWGEQGYYRLC 350
           YIGGVSCP +C K   +HGVLLVG+G+ GY+ +RL++ PYWIIKNSWG  WGE GYY++C
Sbjct: 291 YIGGVSCPYVCSKS-QDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKIC 350

Query: 351 RG-HGILADDVGDNTESAPMASP 372
           RG H +   D   +T +A   SP
Sbjct: 351 RGPHNMCGMDTMVSTVAAVHTSP 372

BLAST of MC09g1382 vs. TAIR 10
Match: AT3G10405.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: pollen development; LOCATED IN: chloroplast; Has 44 Blast hits to 44 proteins in 20 species: Archae - 0; Bacteria - 4; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 225.3 bits (573), Expect = 1.2e-58
Identity = 113/179 (63.13%), Postives = 136/179 (75.98%), Query Frame = 0

Query: 397 VAPSQISAPSVSSVSNPLKFKLKQPLRASSEGAPNELVEDSKFVPLNADDPRYGPPALLL 456
           V  S +   S S  +N    K K  +R S++  P  L EDSKFVPL+  DPR+GPP LLL
Sbjct: 33  VTSSLLWTKSKSHHTNTKLKKQKLCVRNSAQEIPKTLEEDSKFVPLDPQDPRFGPPVLLL 92

Query: 457 LGFELEEAMKIQELLKDLGGEFMQVILCTEDMIARSLWDAVHTSQPVLANVKIARSLPRI 516
           LG +L EA KIQELLK+L GEFM+++ CT+DMI RSLW+AV T QP L  VKIA SLPRI
Sbjct: 93  LGLQLHEAQKIQELLKELDGEFMEIVFCTDDMIKRSLWEAVTTKQPDLKRVKIAESLPRI 152

Query: 517 CFLSGLSGEEMMMFIDTFPETGLEPAVYAALVPNGANKPVGELIAEIMGDHEMMTGATS 576
           CFLSGL+GEEMMMFID FPETGLEP V+AA+VPN A+KP+ EL  EIMGDHE++TG++S
Sbjct: 153 CFLSGLTGEEMMMFIDAFPETGLEPVVFAAMVPNSADKPIFELTEEIMGDHELLTGSSS 211

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8VYS07.4e-13367.87Probable cysteine protease RD19D OS=Arabidopsis thaliana OX=3702 GN=RD19D PE=2 S... [more]
P258041.2e-10350.13Cysteine proteinase 15A OS=Pisum sativum OX=3888 PE=2 SV=1[more]
Q107162.1e-10351.27Cysteine proteinase 1 OS=Zea mays OX=4577 GN=CCP1 PE=2 SV=1[more]
P432961.4e-10253.80Cysteine protease RD19A OS=Arabidopsis thaliana OX=3702 GN=RD19A PE=1 SV=1[more]
P432955.2e-10256.13Probable cysteine protease RD19B OS=Arabidopsis thaliana OX=3702 GN=RD19B PE=2 S... [more]
Match NameE-valueIdentityDescription
KAG7031982.19.26e-19577.81putative cysteine protease RD19D, partial [Cucurbita argyrosperma subsp. argyros... [more]
XP_023535677.11.31e-19477.53probable cysteine protease RD19D [Cucurbita pepo subsp. pepo][more]
XP_022956816.12.64e-19476.97probable cysteine protease RD19D [Cucurbita moschata][more]
XP_038892721.11.62e-19376.58probable cysteine protease RD19D [Benincasa hispida][more]
XP_022984713.11.42e-19176.69probable cysteine protease RD19D [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1GXJ81.28e-19476.97probable cysteine protease RD19D OS=Cucurbita moschata OX=3662 GN=LOC111458402 P... [more]
A0A6J1JBC16.89e-19276.69probable cysteine protease RD19D OS=Cucurbita maxima OX=3661 GN=LOC111482910 PE=... [more]
A0A5D3CBQ61.99e-19074.59Cysteine proteinase 15A OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BG121.99e-19074.59cysteine proteinase 15A OS=Cucumis melo OX=3656 GN=LOC103489441 PE=3 SV=1[more]
A0A0A0KTI98.11e-18973.83Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G622590 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G54940.25.3e-13467.87Papain family cysteine protease [more]
AT4G39090.19.7e-10453.80Papain family cysteine protease [more]
AT2G21430.13.7e-10356.13Papain family cysteine protease [more]
AT4G16190.16.5e-10053.56Papain family cysteine protease [more]
AT3G10405.11.2e-5863.13unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: pollen d... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000668Peptidase C1A, papain C-terminalPRINTSPR00705PAPAINcoord: 158..173
score: 62.97
coord: 307..317
score: 59.59
coord: 329..335
score: 75.14
IPR000668Peptidase C1A, papain C-terminalSMARTSM00645pept_c1coord: 140..368
e-value: 1.3E-93
score: 327.0
IPR000668Peptidase C1A, papain C-terminalPFAMPF00112Peptidase_C1coord: 140..352
e-value: 3.7E-67
score: 226.6
IPR013201Cathepsin propeptide inhibitor domain (I29)SMARTSM00848Inhibitor_I29_2coord: 55..111
e-value: 4.9E-17
score: 72.6
IPR013201Cathepsin propeptide inhibitor domain (I29)PFAMPF08246Inhibitor_I29coord: 55..111
e-value: 8.1E-10
score: 39.0
NoneNo IPR availableGENE3D3.90.70.10Cysteine proteinasescoord: 27..357
e-value: 1.1E-92
score: 313.2
NoneNo IPR availablePANTHERPTHR12411CYSTEINE PROTEASE FAMILY C1-RELATEDcoord: 43..353
NoneNo IPR availablePANTHERPTHR12411:SF783CYSTEINE PROTEASE RD19C-RELATEDcoord: 43..353
IPR016621Uncharacterised conserved protein UCP014543PFAMPF12646DUF3783coord: 515..570
e-value: 4.9E-13
score: 48.8
IPR025660Cysteine peptidase, histidine active sitePROSITEPS00639THIOL_PROTEASE_HIScoord: 305..315
IPR000169Cysteine peptidase, cysteine active sitePROSITEPS00139THIOL_PROTEASE_CYScoord: 158..169
IPR025661Cysteine peptidase, asparagine active sitePROSITEPS00640THIOL_PROTEASE_ASNcoord: 329..348
IPR039417Papain-like cysteine endopeptidaseCDDcd02248Peptidase_C1Acoord: 141..351
e-value: 6.72817E-97
score: 292.22
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 52..352

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC09g1382.1MC09g1382.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0051603 proteolysis involved in cellular protein catabolic process
biological_process GO:0006508 proteolysis
cellular_component GO:0005615 extracellular space
cellular_component GO:0005764 lysosome
molecular_function GO:0004197 cysteine-type endopeptidase activity
molecular_function GO:0008234 cysteine-type peptidase activity