Cla97C02G033150 (gene) Watermelon (97103) v2.5

Overview
NameCla97C02G033150
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionCysteine protease
LocationCla97Chr02: 6507769 .. 6510176 (+)
RNA-Seq ExpressionCla97C02G033150
SyntenyCla97C02G033150
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGATTTCCCCCACTTTCTCCTTCCCCATCCTTGCTTTTCTTGCCCTCTTCCTCTGTTTTTCCCCATTTTCATCCGCTTCCCTTTCATCCACCTTCTCCATCATCGATGAAAATGCAAAACACCACCTGGGTATCCCCGAAATTGCCGATTCTGATGCCCAAGGATCCCCTCAACGGACCGACGCCGAGGTTGCGGCTCTGTACGAGTCCTGGTTGGTCCATCATGGAAAAGCCTACAACGCTCTCGGCGAGAAGGAGAGGCGGTTTGAGATTTTCAAGGATAATCTCAGGTTCATCGATGAACATAACCGGGAATCGCGGACGTATAAAGTTGGTTTGACCCGTTTCGCCGATCTCACCAATGAGGAGTATCGGGCAAAGTTTTTGGGTGGCCGAATCTCCCGGAAGCCTCGCCTATCTGCCGTCAAGAGCGGTAGATACGCGGCTACGCTCGGCGATGATCTTCCGGATCATGTCGATTGGAGAAAGAAGGGCGCTGTTGCGGCTGTTAAAGATCAAGGACAGTGTGGTGAGTTTTTTGTCCCTCTTTTTTTGGACCATTTTTTGTTGAATTGATCCGATGGGAATGATGCTTGGATTCTGTGTTTTCTTTTCGTTTTCTCATTTGGTCGTTTATCTTTTGACTCTTGTAAATTACAAGTCGTTGTCGATTTGCGTCTGTTTGGGTTGGTGCTGATTTTTCCAGGAAATTTGTTACGAGATCCATATTTATAGGAAACGTTTCTCACAAAATTAGATTCATTAAGATATTGGCCCATGCTCAACCCTGATTATAATTTGTATCTGTTATAATTAGCTTCGTTTAATCTCTGACGATTTGTTGATAATCACGATCACATCAGAAAACGCATTAGAAACTTTCCGATCAACACTTGTTTTAATCATGGATGATGATGATTAACGTAATGGGTATTGTTTTGAATGAAGGGAGCTGTTGGGCTTTCTCAACAGTTGCTGCAGTGGAAGGAATAAATCAAATCACCACCGGTGAATTGATCTCTCTGTCGGAGCAGGAACTTGTGGACTGTGACAAATCATATAACATGGGGTGCAATGGTGGTCTTATGGACTATGCTTTCCAGTTCATCATTGATAATGGTGGAATTGACACTGACGAAGATTACCCTTACAAGGGCCATGATGGTGCTTGTGATCCCAACCGGGTACGTCCCGATCAGCTTCATTTGATTCATTTATCTTACCATTTATCCTAATTTGACTTGGTTGCTAATGTTTTGTCAACAGAAAAATGCCAAGGTTGTAACCATAGATGGATATGAAGATGTTCCTGAGAATGATGAGAGCTCCTTGAAAAAGGCTGTGGCAAGTCAACCAGTTAGTGTTGCCATTGAAGCTGGTGGCAGAGCCTTCCAACTTTACCAATCGGTAATCTCTTCAACTTTGAACAATTTCCTGAATGTTGTTTCACCAAGCTAAGATATTGACATCTCTGACTTAGGAAATGATATCTGTTTTGAATGCTTATTCGAACATATTCTGTTGTGTGTTTATGCATCATTCAGGCGTGGGAATCGATATTTGAGTTTAATCTGTTGTGTGTGTGTGTTAAAGACAGACAATCTGTATCTATGCATCATTTAATCTGTTTCAAACGCTTATTCAAGATTAATTCGTTGTTTGTTTTAAAGTCAATTGGTGCTTATGCATCATTCAAGGTATTGTGAGTGTGTTTTTAAAGATAATCTGTGTTTATGAATCAATTCAGGGTGTCTTCACTGGCCGTTGCGGAACCAATCTGGATCACGGTGTTGTCGCTGTTGGATATGGTACAGACAATGGTACAGATTATTGGATTGTGAGGAATTCATGGGGTAAAAACTGGGGAGAGAGTGGTTATATCAGGTTGGAGAGAAATGTGGCCAATATTACCACAGGCAAGTGTGGTATAGCAGTAGAGCCTTCATACCCCATCAAAACCGGAATGAACCCTCCAAAACCTGCTGCTTCTCCACCATCTCCTGTGAAGCCCCCGACCGAATGTGATGACTACTTCTCATGCGAGGAAGGAACTACCTGCTGCTGTATCTATCAATATGGAAGCACCTGCTTTGGTTGGGGATGCTGCCCGCTTGAATCCGCAACCTGCTGCGATGACCATTACAGTTGCTGCCCCCATGAATATCCAATCTGTGACCTCGAGGCAGGAACTTGCCTAATGGTAAGTCACATCACATAGTTTTCTAGTTCCTTGTCCTGTTAACTCATGATACTCAATCAAGTTCATGTTTGATATACTCTAATCATTGTAAATGGATGAAACACAGAGCAAAGGCAGCACGATGGGAGTCAAACTGCTGAAGCGCCTTCCTGCGAAACGTACAAGAGGCATTGAGAAGCTTGGGAAGTTGTTTGTTGGTGCTTGA

mRNA sequence

ATGGCGATTTCCCCCACTTTCTCCTTCCCCATCCTTGCTTTTCTTGCCCTCTTCCTCTGTTTTTCCCCATTTTCATCCGCTTCCCTTTCATCCACCTTCTCCATCATCGATGAAAATGCAAAACACCACCTGGGTATCCCCGAAATTGCCGATTCTGATGCCCAAGGATCCCCTCAACGGACCGACGCCGAGGTTGCGGCTCTGTACGAGTCCTGGTTGGTCCATCATGGAAAAGCCTACAACGCTCTCGGCGAGAAGGAGAGGCGGTTTGAGATTTTCAAGGATAATCTCAGGTTCATCGATGAACATAACCGGGAATCGCGGACGTATAAAGTTGGTTTGACCCGTTTCGCCGATCTCACCAATGAGGAGTATCGGGCAAAGTTTTTGGGTGGCCGAATCTCCCGGAAGCCTCGCCTATCTGCCGTCAAGAGCGGTAGATACGCGGCTACGCTCGGCGATGATCTTCCGGATCATGTCGATTGGAGAAAGAAGGGCGCTGTTGCGGCTGTTAAAGATCAAGGACAGTGTGGGAGCTGTTGGGCTTTCTCAACAGTTGCTGCAGTGGAAGGAATAAATCAAATCACCACCGGTGAATTGATCTCTCTGTCGGAGCAGGAACTTGTGGACTGTGACAAATCATATAACATGGGGTGCAATGGTGGTCTTATGGACTATGCTTTCCAGTTCATCATTGATAATGGTGGAATTGACACTGACGAAGATTACCCTTACAAGGGCCATGATGGTGCTTGTGATCCCAACCGGAAAAATGCCAAGGTTGTAACCATAGATGGATATGAAGATGTTCCTGAGAATGATGAGAGCTCCTTGAAAAAGGCTGTGGCAAGTCAACCAGTTAGTGTTGCCATTGAAGCTGGTGGCAGAGCCTTCCAACTTTACCAATCGGGTGTCTTCACTGGCCGTTGCGGAACCAATCTGGATCACGGTGTTGTCGCTGTTGGATATGGTACAGACAATGGTACAGATTATTGGATTGTGAGGAATTCATGGGGTAAAAACTGGGGAGAGAGTGGTTATATCAGGTTGGAGAGAAATGTGGCCAATATTACCACAGGCAAGTGTGGTATAGCAGTAGAGCCTTCATACCCCATCAAAACCGGAATGAACCCTCCAAAACCTGCTGCTTCTCCACCATCTCCTGTGAAGCCCCCGACCGAATGTGATGACTACTTCTCATGCGAGGAAGGAACTACCTGCTGCTGTATCTATCAATATGGAAGCACCTGCTTTGGTTGGGGATGCTGCCCGCTTGAATCCGCAACCTGCTGCGATGACCATTACAGTTGCTGCCCCCATGAATATCCAATCTGTGACCTCGAGGCAGGAACTTGCCTAATGAGCAAAGGCAGCACGATGGGAGTCAAACTGCTGAAGCGCCTTCCTGCGAAACGTACAAGAGGCATTGAGAAGCTTGGGAAGTTGTTTGTTGGTGCTTGA

Coding sequence (CDS)

ATGGCGATTTCCCCCACTTTCTCCTTCCCCATCCTTGCTTTTCTTGCCCTCTTCCTCTGTTTTTCCCCATTTTCATCCGCTTCCCTTTCATCCACCTTCTCCATCATCGATGAAAATGCAAAACACCACCTGGGTATCCCCGAAATTGCCGATTCTGATGCCCAAGGATCCCCTCAACGGACCGACGCCGAGGTTGCGGCTCTGTACGAGTCCTGGTTGGTCCATCATGGAAAAGCCTACAACGCTCTCGGCGAGAAGGAGAGGCGGTTTGAGATTTTCAAGGATAATCTCAGGTTCATCGATGAACATAACCGGGAATCGCGGACGTATAAAGTTGGTTTGACCCGTTTCGCCGATCTCACCAATGAGGAGTATCGGGCAAAGTTTTTGGGTGGCCGAATCTCCCGGAAGCCTCGCCTATCTGCCGTCAAGAGCGGTAGATACGCGGCTACGCTCGGCGATGATCTTCCGGATCATGTCGATTGGAGAAAGAAGGGCGCTGTTGCGGCTGTTAAAGATCAAGGACAGTGTGGGAGCTGTTGGGCTTTCTCAACAGTTGCTGCAGTGGAAGGAATAAATCAAATCACCACCGGTGAATTGATCTCTCTGTCGGAGCAGGAACTTGTGGACTGTGACAAATCATATAACATGGGGTGCAATGGTGGTCTTATGGACTATGCTTTCCAGTTCATCATTGATAATGGTGGAATTGACACTGACGAAGATTACCCTTACAAGGGCCATGATGGTGCTTGTGATCCCAACCGGAAAAATGCCAAGGTTGTAACCATAGATGGATATGAAGATGTTCCTGAGAATGATGAGAGCTCCTTGAAAAAGGCTGTGGCAAGTCAACCAGTTAGTGTTGCCATTGAAGCTGGTGGCAGAGCCTTCCAACTTTACCAATCGGGTGTCTTCACTGGCCGTTGCGGAACCAATCTGGATCACGGTGTTGTCGCTGTTGGATATGGTACAGACAATGGTACAGATTATTGGATTGTGAGGAATTCATGGGGTAAAAACTGGGGAGAGAGTGGTTATATCAGGTTGGAGAGAAATGTGGCCAATATTACCACAGGCAAGTGTGGTATAGCAGTAGAGCCTTCATACCCCATCAAAACCGGAATGAACCCTCCAAAACCTGCTGCTTCTCCACCATCTCCTGTGAAGCCCCCGACCGAATGTGATGACTACTTCTCATGCGAGGAAGGAACTACCTGCTGCTGTATCTATCAATATGGAAGCACCTGCTTTGGTTGGGGATGCTGCCCGCTTGAATCCGCAACCTGCTGCGATGACCATTACAGTTGCTGCCCCCATGAATATCCAATCTGTGACCTCGAGGCAGGAACTTGCCTAATGAGCAAAGGCAGCACGATGGGAGTCAAACTGCTGAAGCGCCTTCCTGCGAAACGTACAAGAGGCATTGAGAAGCTTGGGAAGTTGTTTGTTGGTGCTTGA

Protein sequence

MAISPTFSFPILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQRTDAEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADLTNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCGSCWAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDTDEDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRAFQLYQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANITTGKCGIAVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGSTCFGWGCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPAKRTRGIEKLGKLFVGA
Homology
BLAST of Cla97C02G033150 vs. NCBI nr
Match: XP_038889032.1 (low-temperature-induced cysteine proteinase-like [Benincasa hispida])

HSP 1 Score: 939.5 bits (2427), Expect = 1.2e-269
Identity = 453/486 (93.21%), Postives = 466/486 (95.88%), Query Frame = 0

Query: 1   MAISPTFSFPILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQR 60
           MAIS TFSF  LAFLALFLCFSPFSSAS SSTFSIIDENAKHH+GIP+IADSD   SPQR
Sbjct: 1   MAISSTFSFLTLAFLALFLCFSPFSSASHSSTFSIIDENAKHHMGIPDIADSDHLRSPQR 60

Query: 61  TDAEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL 120
           TD EVAALYESWLVHH KAYNALGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL
Sbjct: 61  TDDEVAALYESWLVHHRKAYNALGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL 120

Query: 121 TNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCGSC 180
           TNEEYRA+FLGGR   KPRLSA KSGRYAA LGDDLPDHVDWRKKGAVAAVKDQGQCGSC
Sbjct: 121 TNEEYRARFLGGRFPGKPRLSAAKSGRYAAALGDDLPDHVDWRKKGAVAAVKDQGQCGSC 180

Query: 181 WAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDTD 240
           WAFSTVAAVEGINQI TGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDT+
Sbjct: 181 WAFSTVAAVEGINQIVTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDTE 240

Query: 241 EDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRAFQL 300
           EDYPYKGHD ACDPNRKNAKVVTIDG+EDVPENDESSLKKAVA+QPVSVAIEAGGRAFQL
Sbjct: 241 EDYPYKGHDAACDPNRKNAKVVTIDGFEDVPENDESSLKKAVANQPVSVAIEAGGRAFQL 300

Query: 301 YQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANITTG 360
           YQSGVFTGRCGTNLDHGVVAVGYGT+NGTDYWIVRNSWGKNWGESGYIRLERNVANITTG
Sbjct: 301 YQSGVFTGRCGTNLDHGVVAVGYGTENGTDYWIVRNSWGKNWGESGYIRLERNVANITTG 360

Query: 361 KCGIAVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGSTCFGW 420
           KCGIAV+PSYPIKTG NPPKPAASPPSPVKPPTECDDY+SCEEGTTCCCIYQYGSTCFGW
Sbjct: 361 KCGIAVQPSYPIKTGSNPPKPAASPPSPVKPPTECDDYYSCEEGTTCCCIYQYGSTCFGW 420

Query: 421 GCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPAKRTRGIEKLG 480
           GCCPLESATCCDDHYSCCPHEYPICDLEAGTCL+ K STMGVKLLKRLPAKRT+GI+K G
Sbjct: 421 GCCPLESATCCDDHYSCCPHEYPICDLEAGTCLIDKDSTMGVKLLKRLPAKRTKGIQKFG 480

Query: 481 KLFVGA 487
           +LFVGA
Sbjct: 481 ELFVGA 486

BLAST of Cla97C02G033150 vs. NCBI nr
Match: XP_008454976.1 (PREDICTED: low-temperature-induced cysteine proteinase-like [Cucumis melo])

HSP 1 Score: 892.9 bits (2306), Expect = 1.2e-255
Identity = 436/486 (89.71%), Postives = 450/486 (92.59%), Query Frame = 0

Query: 1   MAISPTFSFPILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQR 60
           MAIS    F   AFLALFLC SPFSSAS SSTFSIIDENAKHHLGIPEI  SDA    QR
Sbjct: 1   MAISSPIFF---AFLALFLCLSPFSSASHSSTFSIIDENAKHHLGIPEIPHSDAH---QR 60

Query: 61  TDAEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL 120
           TD EVAALYESWLVHHGKAYNALGEKERRFEIFKDNL FIDEHNRESRTYKVGLTRFADL
Sbjct: 61  TDEEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLMFIDEHNRESRTYKVGLTRFADL 120

Query: 121 TNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCGSC 180
           TNEEYRA+FLGGR SRKP LSA KSGRYAA LGDDLPD VDWRKKGAVA VKDQGQCGSC
Sbjct: 121 TNEEYRARFLGGRFSRKPSLSAAKSGRYAAALGDDLPDDVDWRKKGAVANVKDQGQCGSC 180

Query: 181 WAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDTD 240
           WAFSTVAAVEGINQI TGELISLSEQELVDCDKS+NMGCNGGLMDYAFQFIIDNGGIDTD
Sbjct: 181 WAFSTVAAVEGINQIVTGELISLSEQELVDCDKSFNMGCNGGLMDYAFQFIIDNGGIDTD 240

Query: 241 EDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRAFQL 300
           EDYPYKG DGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVA+QPVSVAIEAGGRAFQL
Sbjct: 241 EDYPYKGRDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQL 300

Query: 301 YQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANITTG 360
           YQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGK+WGE+GYIRLERNVAN TTG
Sbjct: 301 YQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGENGYIRLERNVANSTTG 360

Query: 361 KCGIAVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGSTCFGW 420
           KCGIAVEPSYPIK+G NPPKP+ASPPSPV PPTECD+YFSC+EG+TCCCIYQYGSTCF W
Sbjct: 361 KCGIAVEPSYPIKSGSNPPKPSASPPSPVNPPTECDEYFSCDEGSTCCCIYQYGSTCFAW 420

Query: 421 GCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPAKRTRGIEKLG 480
           GCCPLESATCCDDHYSCCPHEYP+CDLEAGTC  SK S MGV LLKRLPA +T+ I+KLG
Sbjct: 421 GCCPLESATCCDDHYSCCPHEYPVCDLEAGTCRASKDSLMGVNLLKRLPANQTKRIQKLG 480

Query: 481 KLFVGA 487
           KLFVGA
Sbjct: 481 KLFVGA 480

BLAST of Cla97C02G033150 vs. NCBI nr
Match: XP_004136967.1 (low-temperature-induced cysteine proteinase [Cucumis sativus] >KGN43904.1 hypothetical protein Csa_017340 [Cucumis sativus])

HSP 1 Score: 887.5 bits (2292), Expect = 5.2e-254
Identity = 430/486 (88.48%), Postives = 451/486 (92.80%), Query Frame = 0

Query: 1   MAISPTFSFPILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQR 60
           MAISP F     AFLALF C SPFSSAS SSTFSIIDENAKHHLGIPEI  SDA    QR
Sbjct: 1   MAISPIF----FAFLALFFCLSPFSSASHSSTFSIIDENAKHHLGIPEIPHSDAH---QR 60

Query: 61  TDAEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL 120
            D EVAALYESWLVHHGKAYNA+GEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL
Sbjct: 61  PDEEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL 120

Query: 121 TNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCGSC 180
           TNEEYRA+FLGGR SRKPRLSA KSGRYAA LGDDLPD VDWRKKGAVA VKDQGQCGSC
Sbjct: 121 TNEEYRARFLGGRFSRKPRLSAAKSGRYAAALGDDLPDDVDWRKKGAVATVKDQGQCGSC 180

Query: 181 WAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDTD 240
           WAFS+VAAVEGINQI TGELI LSEQELVDCDKS+NMGCNGGLMDYAFQFII NGGIDT+
Sbjct: 181 WAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGIDTE 240

Query: 241 EDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRAFQL 300
           EDYPYKG D ACDPNRKNAKVVTIDGYEDVPENDESSLKKAVA+QPVSVAIEAGGRAFQL
Sbjct: 241 EDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQL 300

Query: 301 YQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANITTG 360
           YQSGVFTGRCGT+LDHGVVAVGYGTDNGTDYWIVRNSWGK+WGESGYIRLERNVANITTG
Sbjct: 301 YQSGVFTGRCGTDLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGESGYIRLERNVANITTG 360

Query: 361 KCGIAVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGSTCFGW 420
           KCGIAV+PSYP K+G NPPKP+ASPPSPVKPPTECD+YFSCEEG+TCCCIYQ+GSTCF W
Sbjct: 361 KCGIAVQPSYPTKSGANPPKPSASPPSPVKPPTECDEYFSCEEGSTCCCIYQFGSTCFAW 420

Query: 421 GCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPAKRTRGIEKLG 480
           GCCPLESATCCDDHYSCCPHEYP+CDLEAGTC +SK S+MGV LLKRLPA +T+ ++KLG
Sbjct: 421 GCCPLESATCCDDHYSCCPHEYPVCDLEAGTCRVSKDSSMGVNLLKRLPAIQTKKVQKLG 479

Query: 481 KLFVGA 487
           KLFVGA
Sbjct: 481 KLFVGA 479

BLAST of Cla97C02G033150 vs. NCBI nr
Match: KAA0031322.1 (low-temperature-induced cysteine proteinase-like [Cucumis melo var. makuwa] >TYK06773.1 low-temperature-induced cysteine proteinase-like [Cucumis melo var. makuwa])

HSP 1 Score: 881.7 bits (2277), Expect = 2.9e-252
Identity = 435/494 (88.06%), Postives = 449/494 (90.89%), Query Frame = 0

Query: 1   MAISPTFSFPILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQR 60
           MAIS    F   AFLALFLC SPFSSAS SSTFSIIDENAKHHLGIPEI  SDA    QR
Sbjct: 1   MAISSPIFF---AFLALFLCLSPFSSASHSSTFSIIDENAKHHLGIPEIPHSDAH---QR 60

Query: 61  TDAEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL 120
           TD EVAALYESWLVHHGKAYNALGEKERRFEIFKDNL FIDEHNRESRTYKVGLTRFADL
Sbjct: 61  TDEEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLMFIDEHNRESRTYKVGLTRFADL 120

Query: 121 TNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQ---- 180
           TNEEYRA+FLGGR SRKP LSA KSGRYAA LGDDLPD VDWRKKGAVA VKDQGQ    
Sbjct: 121 TNEEYRARFLGGRFSRKPSLSAAKSGRYAAALGDDLPDDVDWRKKGAVANVKDQGQKFVM 180

Query: 181 ----CGSCWAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFII 240
                GSCWAFSTVAAVEGINQI TGELISLSEQELVDCDKS+NMGCNGGLMDYAFQFII
Sbjct: 181 RSIFIGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDKSFNMGCNGGLMDYAFQFII 240

Query: 241 DNGGIDTDEDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIE 300
           DNGGIDTDEDYPYKG DGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVA+QPVSVAIE
Sbjct: 241 DNGGIDTDEDYPYKGRDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIE 300

Query: 301 AGGRAFQLYQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLER 360
           AGGRAFQLYQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGK+WGE+GYIRLER
Sbjct: 301 AGGRAFQLYQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGENGYIRLER 360

Query: 361 NVANITTGKCGIAVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQ 420
           NVAN TTGKCGIAVEPSYPIK+G NPPKP+ASPPSPV PPTECD+YFSC+EG+TCCCIYQ
Sbjct: 361 NVANSTTGKCGIAVEPSYPIKSGSNPPKPSASPPSPVNPPTECDEYFSCDEGSTCCCIYQ 420

Query: 421 YGSTCFGWGCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPAKR 480
           YGSTCF WGCCPLESATCCDDHYSCCPHEYP+CDLEAGTC  SK S MGV LLKRLPA +
Sbjct: 421 YGSTCFAWGCCPLESATCCDDHYSCCPHEYPVCDLEAGTCRASKDSLMGVNLLKRLPANQ 480

Query: 481 TRGIEKLGKLFVGA 487
           T+ I+KLGKLFVGA
Sbjct: 481 TKRIQKLGKLFVGA 488

BLAST of Cla97C02G033150 vs. NCBI nr
Match: XP_022147795.1 (low-temperature-induced cysteine proteinase-like [Momordica charantia])

HSP 1 Score: 862.4 bits (2227), Expect = 1.8e-246
Identity = 410/486 (84.36%), Postives = 442/486 (90.95%), Query Frame = 0

Query: 1   MAISPTFSFPILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQR 60
           MAI  +FSFPILA LALFLCF  FSSAS SS+FSIIDENAKHHLG P+IA SDA   P R
Sbjct: 1   MAIFSSFSFPILASLALFLCFLSFSSASDSSSFSIIDENAKHHLGFPDIAGSDAGTPPLR 60

Query: 61  TDAEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL 120
           T  +VAALY+SWLV HGKAYNALGE+ERRFEIFKDNLRFIDEHNRE R+Y +GLTRFADL
Sbjct: 61  TQEQVAALYKSWLVKHGKAYNALGERERRFEIFKDNLRFIDEHNREPRSYTLGLTRFADL 120

Query: 121 TNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCGSC 180
           TNEEYRA+FLGGR S KPR SA K+GRYA++LG DLP+HVDWR+KGAV AVKDQGQCGSC
Sbjct: 121 TNEEYRARFLGGRFSPKPRPSAAKNGRYASSLGGDLPEHVDWREKGAVTAVKDQGQCGSC 180

Query: 181 WAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDTD 240
           WAFST+AAVEGINQI TGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFI+DNGGIDT+
Sbjct: 181 WAFSTIAAVEGINQIVTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIVDNGGIDTE 240

Query: 241 EDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRAFQL 300
           EDYPYKG DG CDPNR+NA VVTIDGYEDVPENDE +LKKAVASQPVSVAIEAGGRAFQL
Sbjct: 241 EDYPYKGRDGVCDPNRRNANVVTIDGYEDVPENDEGALKKAVASQPVSVAIEAGGRAFQL 300

Query: 301 YQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANITTG 360
           YQSGVFTGRCGT+LDHGVVAVGYGT+NG DYWIVRNSWGKNWGE+GYIRLERNVANITTG
Sbjct: 301 YQSGVFTGRCGTDLDHGVVAVGYGTENGLDYWIVRNSWGKNWGENGYIRLERNVANITTG 360

Query: 361 KCGIAVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGSTCFGW 420
           KCGIA+EPSYP+KTG NPPKPA SPPSPVKPPTECDDY+SC EGTTCCCIYQYGSTCFGW
Sbjct: 361 KCGIAIEPSYPVKTGKNPPKPAPSPPSPVKPPTECDDYYSCPEGTTCCCIYQYGSTCFGW 420

Query: 421 GCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPAKRTRGIEKLG 480
           GCCPLESATCCDD YSCCP EYP+CDL  GTC MSKGS +GV LLKRLPAK    ++KLG
Sbjct: 421 GCCPLESATCCDDQYSCCPREYPVCDLAEGTCRMSKGSLIGVSLLKRLPAKHKGVVQKLG 480

Query: 481 KLFVGA 487
           K+ +G+
Sbjct: 481 KMIIGS 486

BLAST of Cla97C02G033150 vs. ExPASy Swiss-Prot
Match: P43297 (Cysteine proteinase RD21A OS=Arabidopsis thaliana OX=3702 GN=RD21A PE=1 SV=1)

HSP 1 Score: 644.8 bits (1662), Expect = 7.7e-184
Identity = 308/470 (65.53%), Postives = 368/470 (78.30%), Query Frame = 0

Query: 3   ISPTFSFPILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQRTD 62
           + PT +   LA +A+          S +   SII  + KH +            +  R++
Sbjct: 4   LKPTMAILFLAMVAV----------SSAVDMSIISYDEKHGVST----------TGGRSE 63

Query: 63  AEVAALYESWLVHHGKA--YNALGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL 122
           AEV ++YE+WLV HGKA   N+L EK+RRFEIFKDNLRF+DEHN ++ +Y++GLTRFADL
Sbjct: 64  AEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADL 123

Query: 123 TNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCGSC 182
           TN+EYR+K+LG ++ +K       S RY A +GD+LP+ +DWRKKGAVA VKDQG CGSC
Sbjct: 124 TNDEYRSKYLGAKMEKKGERRT--SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSC 183

Query: 183 WAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDTD 242
           WAFST+ AVEGINQI TG+LI+LSEQELVDCD SYN GCNGGLMDYAF+FII NGGIDTD
Sbjct: 184 WAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTD 243

Query: 243 EDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRAFQL 302
           +DYPYKG DG CD  RKNAKVVTID YEDVP   E SLKKAVA QP+S+AIEAGGRAFQL
Sbjct: 244 KDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQL 303

Query: 303 YQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANITTG 362
           Y SG+F G CGT LDHGVVAVGYGT+NG DYWIVRNSWGK+WGESGY+R+ RN+A+ ++G
Sbjct: 304 YDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIAS-SSG 363

Query: 363 KCGIAVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGSTCFGW 422
           KCGIA+EPSYPIK G NPP P  SPPSP+KPPT+CD Y++C E  TCCC+++YG  CF W
Sbjct: 364 KCGIAIEPSYPIKNGENPPNPGPSPPSPIKPPTQCDSYYTCPESNTCCCLFEYGKYCFAW 423

Query: 423 GCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPA 471
           GCCPLE+ATCCDD+YSCCPHEYP+CDL+ GTCL+SK S   VK LKR PA
Sbjct: 424 GCCPLEAATCCDDNYSCCPHEYPVCDLDQGTCLLSKNSPFSVKALKRKPA 450

BLAST of Cla97C02G033150 vs. ExPASy Swiss-Prot
Match: Q9FMH8 (Probable cysteine protease RD21B OS=Arabidopsis thaliana OX=3702 GN=RD21B PE=1 SV=1)

HSP 1 Score: 636.3 bits (1640), Expect = 2.8e-181
Identity = 312/466 (66.95%), Postives = 354/466 (75.97%), Query Frame = 0

Query: 10  PILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQRTDAEVAALY 69
           P++  LA+          S +   SII  +  HH+               R+D+EV  +Y
Sbjct: 8   PMILLLAMI-------GVSYAMDMSIISYDENHHI----------TTETSRSDSEVERIY 67

Query: 70  ESWLVHHGKA---YNALG-EKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADLTNEEY 129
           E+W+V HGK     N LG EK++RFEIFKDNLRFIDEHN ++ +YK+GLTRFADLTNEEY
Sbjct: 68  EAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYKLGLTRFADLTNEEY 127

Query: 130 RAKFLGGRISRKPRLSAVK-SGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCGSCWAFS 189
           R+ +LG     KP    +K S RY A +GD LPD VDWRK+GAVA VKDQG CGSCWAFS
Sbjct: 128 RSMYLGA----KPTKRVLKTSDRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFS 187

Query: 190 TVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDTDEDYP 249
           T+ AVEGIN+I TG+LISLSEQELVDCD SYN GCNGGLMDYAF+FII NGGIDT+ DYP
Sbjct: 188 TIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYP 247

Query: 250 YKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRAFQLYQSG 309
           YK  DG CD NRKNAKVVTID YEDVPEN E+SLKKA+A QP+SVAIEAGGRAFQLY SG
Sbjct: 248 YKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSG 307

Query: 310 VFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANITTGKCGI 369
           VF G CGT LDHGVVAVGYGT+NG DYWIVRNSWG  WGESGYI++ RN+    TGKCGI
Sbjct: 308 VFDGLCGTELDHGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNI-EAPTGKCGI 367

Query: 370 AVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGSTCFGWGCCP 429
           A+E SYPIK G NPP P  SPPSP+KPPT CD YFSC E  TCCC+Y+YG  CFGWGCCP
Sbjct: 368 AMEASYPIKKGQNPPNPGPSPPSPIKPPTTCDKYFSCPESNTCCCLYKYGKYCFGWGCCP 427

Query: 430 LESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPA 471
           LE+ATCCDD+ SCCPHEYP+CD+  GTCLMSK S   VK LKR PA
Sbjct: 428 LEAATCCDDNSSCCPHEYPVCDVNRGTCLMSKNSPFSVKALKRTPA 451

BLAST of Cla97C02G033150 vs. ExPASy Swiss-Prot
Match: P25776 (Oryzain alpha chain OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g0650000 PE=1 SV=2)

HSP 1 Score: 619.4 bits (1596), Expect = 3.5e-176
Identity = 289/426 (67.84%), Postives = 338/426 (79.34%), Query Frame = 0

Query: 50  ADSDAQGSPQRTDAEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLRFIDEHNRES-- 109
           AD       +R++ E   LY  W   HGK+YNA+GE+ERR+  F+DNLR+IDEHN  +  
Sbjct: 21  ADMSIVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADA 80

Query: 110 --RTYKVGLTRFADLTNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKK 169
              ++++GL RFADLTNEEYR  +LG  +  KPR     S RY A   + LP+ VDWR K
Sbjct: 81  GVHSFRLGLNRFADLTNEEYRDTYLG--LRNKPRRERKVSDRYLAADNEALPESVDWRTK 140

Query: 170 GAVAAVKDQGQCGSCWAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMD 229
           GAVA +KDQG CGSCWAFS +AAVEGINQI TG+LISLSEQELVDCD SYN GCNGGLMD
Sbjct: 141 GAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD 200

Query: 230 YAFQFIIDNGGIDTDEDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQ 289
           YAF FII+NGGIDT++DYPYKG D  CD NRKNAKVVTID YEDV  N E+SL+KAVA+Q
Sbjct: 201 YAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQ 260

Query: 290 PVSVAIEAGGRAFQLYQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGES 349
           PVSVAIEAGGRAFQLY SG+FTG+CGT LDHGV AVGYGT+NG DYWIVRNSWGK+WGES
Sbjct: 261 PVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGES 320

Query: 350 GYIRLERNVANITTGKCGIAVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGT 409
           GY+R+ERN+   ++GKCGIAVEPSYP+K G NPP P  +PPSP  PPT CD+Y++C + T
Sbjct: 321 GYVRMERNI-KASSGKCGIAVEPSYPLKKGENPPNPGPTPPSPTPPPTVCDNYYTCPDST 380

Query: 410 TCCCIYQYGSTCFGWGCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLL 469
           TCCCIY+YG  C+ WGCCPLE ATCCDDHYSCCPHEYPIC+++ GTCLM+K S + VK L
Sbjct: 381 TCCCIYEYGKYCYAWGCCPLEGATCCDDHYSCCPHEYPICNVQQGTCLMAKDSPLAVKAL 440

Query: 470 KRLPAK 472
           KR  AK
Sbjct: 441 KRTLAK 443

BLAST of Cla97C02G033150 vs. ExPASy Swiss-Prot
Match: Q9LT78 (Probable cysteine protease RD21C OS=Arabidopsis thaliana OX=3702 GN=RD21C PE=1 SV=1)

HSP 1 Score: 557.0 bits (1434), Expect = 2.1e-157
Identity = 272/417 (65.23%), Postives = 320/417 (76.74%), Query Frame = 0

Query: 60  RTDAEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLRFIDEHNR-ESRTYKVGLTRFA 119
           R +AE   +YE WLV + K YN LGEKERRFEIFKDNL+F++EH+   +RTY+VGLTRFA
Sbjct: 34  RNEAEARRMYERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFA 93

Query: 120 DLTNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCG 179
           DLTN+E+RA +L  ++ R      VK  +Y   +GD LPD +DWR KGAV  VKDQG CG
Sbjct: 94  DLTNDEFRAIYLRSKMER--TRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCG 153

Query: 180 SCWAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGID 239
           SCWAFS + AVEGINQI TGELISLSEQELVDCD SYN GC GGLMDYAF+FII+NGGID
Sbjct: 154 SCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTSYNDGCGGGLMDYAFKFIIENGGID 213

Query: 240 TDEDYPYKGHD-GACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRA 299
           T+EDYPY   D   C+ ++KN +VVTIDGYEDVP+NDE SLKKA+A+QP+SVAIEAGGRA
Sbjct: 214 TEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRA 273

Query: 300 FQLYQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANI 359
           FQLY SGVFTG CGT+LDHGVVAVGYG++ G DYWIVRNSWG NWGESGY +LERN+   
Sbjct: 274 FQLYTSGVFTGTCGTSLDHGVVAVGYGSEGGQDYWIVRNSWGSNWGESGYFKLERNIKE- 333

Query: 360 TTGKCGIAVEPSYPIK-TGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGST 419
           ++GKCG+A+  SYP K +G NPPKP A  PSPV     CD   +C   +TCCC+Y+Y   
Sbjct: 334 SSGKCGVAMMASYPTKSSGSNPPKPPA--PSPV----VCDKSNTCPAKSTCCCLYEYNGK 393

Query: 420 CFGWGCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPAKRT 474
           C+ WGCCP ESATCCDD  SCCP  YP+CDL+A TC M   S + +K L R PA  T
Sbjct: 394 CYSWGCCPYESATCCDDGSSCCPQSYPVCDLKANTCRMKGNSPLSIKALTRGPAIAT 441

BLAST of Cla97C02G033150 vs. ExPASy Swiss-Prot
Match: P25777 (Oryzain beta chain OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g0670200 PE=1 SV=2)

HSP 1 Score: 546.6 bits (1407), Expect = 2.9e-154
Identity = 280/472 (59.32%), Postives = 340/472 (72.03%), Query Frame = 0

Query: 13  AFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQ-RTDAEVAALYES 72
           A  A FL      +A+ +   SII  NA+H           A+G  +  T+AE  A Y+ 
Sbjct: 5   AAAAAFLLLLIVGAATAAPDMSIISYNAEH----------GARGLEEGPTEAEARAAYDL 64

Query: 73  WLVHH-GKAYNAL-GEKERRFEIFKDNLRFIDEHNR---ESRTYKVGLTRFADLTNEEYR 132
           WL  + G + NAL GE ERRF +F DNL+F+D HN    E   +++G+ RFADLTNEE+R
Sbjct: 65  WLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFR 124

Query: 133 AKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCGSCWAFSTV 192
           A FLG +++ + R +     RY     ++LP+ VDWR+KGAVA VK+QGQCGSCWAFS V
Sbjct: 125 ATFLGAKVAERSRAA---GERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAV 184

Query: 193 AAVEGINQITTGELISLSEQELVDCD-KSYNMGCNGGLMDYAFQFIIDNGGIDTDEDYPY 252
           + VE INQ+ TGE+I+LSEQELV+C     N GCNGGLMD AF FII NGGIDT++DYPY
Sbjct: 185 STVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPY 244

Query: 253 KGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRAFQLYQSGV 312
           K  DG CD NR+NAKVV+IDG+EDVP+NDE SL+KAVA QPVSVAIEAGGR FQLY SGV
Sbjct: 245 KAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGV 304

Query: 313 FTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANITTGKCGIA 372
           F+GRCGT+LDHGVVAVGYGTDNG DYWIVRNSWG  WGESGY+R+ERN+ N+TTGKCGIA
Sbjct: 305 FSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNI-NVTTGKCGIA 364

Query: 373 VEPSYPIKTGMNPPKPAASPPSPVKPPTE------CDDYFSCEEGTTCCCIYQYGSTCFG 432
           +  SYP K+G NPPKP+ +PP+P  PP        CDD FSC  G+TCCC + + + C  
Sbjct: 365 MMASYPTKSGANPPKPSPTPPTPPTPPPPSAPDHVCDDNFSCPAGSTCCCAFGFRNLCLV 424

Query: 433 WGCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPAK 472
           WGCCP+E ATCC DH SCCP +YP+C+  AGTC  SK S + VK LKR  AK
Sbjct: 425 WGCCPVEGATCCKDHASCCPPDYPVCNTRAGTCSASKNSPLSVKALKRTLAK 462

BLAST of Cla97C02G033150 vs. ExPASy TrEMBL
Match: A0A1S3BZW8 (low-temperature-induced cysteine proteinase-like OS=Cucumis melo OX=3656 GN=LOC103495259 PE=3 SV=1)

HSP 1 Score: 892.9 bits (2306), Expect = 6.0e-256
Identity = 436/486 (89.71%), Postives = 450/486 (92.59%), Query Frame = 0

Query: 1   MAISPTFSFPILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQR 60
           MAIS    F   AFLALFLC SPFSSAS SSTFSIIDENAKHHLGIPEI  SDA    QR
Sbjct: 1   MAISSPIFF---AFLALFLCLSPFSSASHSSTFSIIDENAKHHLGIPEIPHSDAH---QR 60

Query: 61  TDAEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL 120
           TD EVAALYESWLVHHGKAYNALGEKERRFEIFKDNL FIDEHNRESRTYKVGLTRFADL
Sbjct: 61  TDEEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLMFIDEHNRESRTYKVGLTRFADL 120

Query: 121 TNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCGSC 180
           TNEEYRA+FLGGR SRKP LSA KSGRYAA LGDDLPD VDWRKKGAVA VKDQGQCGSC
Sbjct: 121 TNEEYRARFLGGRFSRKPSLSAAKSGRYAAALGDDLPDDVDWRKKGAVANVKDQGQCGSC 180

Query: 181 WAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDTD 240
           WAFSTVAAVEGINQI TGELISLSEQELVDCDKS+NMGCNGGLMDYAFQFIIDNGGIDTD
Sbjct: 181 WAFSTVAAVEGINQIVTGELISLSEQELVDCDKSFNMGCNGGLMDYAFQFIIDNGGIDTD 240

Query: 241 EDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRAFQL 300
           EDYPYKG DGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVA+QPVSVAIEAGGRAFQL
Sbjct: 241 EDYPYKGRDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQL 300

Query: 301 YQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANITTG 360
           YQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGK+WGE+GYIRLERNVAN TTG
Sbjct: 301 YQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGENGYIRLERNVANSTTG 360

Query: 361 KCGIAVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGSTCFGW 420
           KCGIAVEPSYPIK+G NPPKP+ASPPSPV PPTECD+YFSC+EG+TCCCIYQYGSTCF W
Sbjct: 361 KCGIAVEPSYPIKSGSNPPKPSASPPSPVNPPTECDEYFSCDEGSTCCCIYQYGSTCFAW 420

Query: 421 GCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPAKRTRGIEKLG 480
           GCCPLESATCCDDHYSCCPHEYP+CDLEAGTC  SK S MGV LLKRLPA +T+ I+KLG
Sbjct: 421 GCCPLESATCCDDHYSCCPHEYPVCDLEAGTCRASKDSLMGVNLLKRLPANQTKRIQKLG 480

Query: 481 KLFVGA 487
           KLFVGA
Sbjct: 481 KLFVGA 480

BLAST of Cla97C02G033150 vs. ExPASy TrEMBL
Match: A0A0A0K4N3 (Cysteine protease OS=Cucumis sativus OX=3659 GN=Csa_7G073400 PE=3 SV=1)

HSP 1 Score: 887.5 bits (2292), Expect = 2.5e-254
Identity = 430/486 (88.48%), Postives = 451/486 (92.80%), Query Frame = 0

Query: 1   MAISPTFSFPILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQR 60
           MAISP F     AFLALF C SPFSSAS SSTFSIIDENAKHHLGIPEI  SDA    QR
Sbjct: 1   MAISPIF----FAFLALFFCLSPFSSASHSSTFSIIDENAKHHLGIPEIPHSDAH---QR 60

Query: 61  TDAEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL 120
            D EVAALYESWLVHHGKAYNA+GEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL
Sbjct: 61  PDEEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL 120

Query: 121 TNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCGSC 180
           TNEEYRA+FLGGR SRKPRLSA KSGRYAA LGDDLPD VDWRKKGAVA VKDQGQCGSC
Sbjct: 121 TNEEYRARFLGGRFSRKPRLSAAKSGRYAAALGDDLPDDVDWRKKGAVATVKDQGQCGSC 180

Query: 181 WAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDTD 240
           WAFS+VAAVEGINQI TGELI LSEQELVDCDKS+NMGCNGGLMDYAFQFII NGGIDT+
Sbjct: 181 WAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGIDTE 240

Query: 241 EDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRAFQL 300
           EDYPYKG D ACDPNRKNAKVVTIDGYEDVPENDESSLKKAVA+QPVSVAIEAGGRAFQL
Sbjct: 241 EDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQL 300

Query: 301 YQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANITTG 360
           YQSGVFTGRCGT+LDHGVVAVGYGTDNGTDYWIVRNSWGK+WGESGYIRLERNVANITTG
Sbjct: 301 YQSGVFTGRCGTDLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGESGYIRLERNVANITTG 360

Query: 361 KCGIAVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGSTCFGW 420
           KCGIAV+PSYP K+G NPPKP+ASPPSPVKPPTECD+YFSCEEG+TCCCIYQ+GSTCF W
Sbjct: 361 KCGIAVQPSYPTKSGANPPKPSASPPSPVKPPTECDEYFSCEEGSTCCCIYQFGSTCFAW 420

Query: 421 GCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPAKRTRGIEKLG 480
           GCCPLESATCCDDHYSCCPHEYP+CDLEAGTC +SK S+MGV LLKRLPA +T+ ++KLG
Sbjct: 421 GCCPLESATCCDDHYSCCPHEYPVCDLEAGTCRVSKDSSMGVNLLKRLPAIQTKKVQKLG 479

Query: 481 KLFVGA 487
           KLFVGA
Sbjct: 481 KLFVGA 479

BLAST of Cla97C02G033150 vs. ExPASy TrEMBL
Match: A0A5A7SJU5 (Low-temperature-induced cysteine proteinase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G00750 PE=3 SV=1)

HSP 1 Score: 881.7 bits (2277), Expect = 1.4e-252
Identity = 435/494 (88.06%), Postives = 449/494 (90.89%), Query Frame = 0

Query: 1   MAISPTFSFPILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQR 60
           MAIS    F   AFLALFLC SPFSSAS SSTFSIIDENAKHHLGIPEI  SDA    QR
Sbjct: 1   MAISSPIFF---AFLALFLCLSPFSSASHSSTFSIIDENAKHHLGIPEIPHSDAH---QR 60

Query: 61  TDAEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL 120
           TD EVAALYESWLVHHGKAYNALGEKERRFEIFKDNL FIDEHNRESRTYKVGLTRFADL
Sbjct: 61  TDEEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLMFIDEHNRESRTYKVGLTRFADL 120

Query: 121 TNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQ---- 180
           TNEEYRA+FLGGR SRKP LSA KSGRYAA LGDDLPD VDWRKKGAVA VKDQGQ    
Sbjct: 121 TNEEYRARFLGGRFSRKPSLSAAKSGRYAAALGDDLPDDVDWRKKGAVANVKDQGQKFVM 180

Query: 181 ----CGSCWAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFII 240
                GSCWAFSTVAAVEGINQI TGELISLSEQELVDCDKS+NMGCNGGLMDYAFQFII
Sbjct: 181 RSIFIGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDKSFNMGCNGGLMDYAFQFII 240

Query: 241 DNGGIDTDEDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIE 300
           DNGGIDTDEDYPYKG DGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVA+QPVSVAIE
Sbjct: 241 DNGGIDTDEDYPYKGRDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIE 300

Query: 301 AGGRAFQLYQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLER 360
           AGGRAFQLYQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGK+WGE+GYIRLER
Sbjct: 301 AGGRAFQLYQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKDWGENGYIRLER 360

Query: 361 NVANITTGKCGIAVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQ 420
           NVAN TTGKCGIAVEPSYPIK+G NPPKP+ASPPSPV PPTECD+YFSC+EG+TCCCIYQ
Sbjct: 361 NVANSTTGKCGIAVEPSYPIKSGSNPPKPSASPPSPVNPPTECDEYFSCDEGSTCCCIYQ 420

Query: 421 YGSTCFGWGCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPAKR 480
           YGSTCF WGCCPLESATCCDDHYSCCPHEYP+CDLEAGTC  SK S MGV LLKRLPA +
Sbjct: 421 YGSTCFAWGCCPLESATCCDDHYSCCPHEYPVCDLEAGTCRASKDSLMGVNLLKRLPANQ 480

Query: 481 TRGIEKLGKLFVGA 487
           T+ I+KLGKLFVGA
Sbjct: 481 TKRIQKLGKLFVGA 488

BLAST of Cla97C02G033150 vs. ExPASy TrEMBL
Match: A0A6J1D149 (low-temperature-induced cysteine proteinase-like OS=Momordica charantia OX=3673 GN=LOC111016644 PE=3 SV=1)

HSP 1 Score: 862.4 bits (2227), Expect = 8.7e-247
Identity = 410/486 (84.36%), Postives = 442/486 (90.95%), Query Frame = 0

Query: 1   MAISPTFSFPILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQR 60
           MAI  +FSFPILA LALFLCF  FSSAS SS+FSIIDENAKHHLG P+IA SDA   P R
Sbjct: 1   MAIFSSFSFPILASLALFLCFLSFSSASDSSSFSIIDENAKHHLGFPDIAGSDAGTPPLR 60

Query: 61  TDAEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL 120
           T  +VAALY+SWLV HGKAYNALGE+ERRFEIFKDNLRFIDEHNRE R+Y +GLTRFADL
Sbjct: 61  TQEQVAALYKSWLVKHGKAYNALGERERRFEIFKDNLRFIDEHNREPRSYTLGLTRFADL 120

Query: 121 TNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCGSC 180
           TNEEYRA+FLGGR S KPR SA K+GRYA++LG DLP+HVDWR+KGAV AVKDQGQCGSC
Sbjct: 121 TNEEYRARFLGGRFSPKPRPSAAKNGRYASSLGGDLPEHVDWREKGAVTAVKDQGQCGSC 180

Query: 181 WAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDTD 240
           WAFST+AAVEGINQI TGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFI+DNGGIDT+
Sbjct: 181 WAFSTIAAVEGINQIVTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIVDNGGIDTE 240

Query: 241 EDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRAFQL 300
           EDYPYKG DG CDPNR+NA VVTIDGYEDVPENDE +LKKAVASQPVSVAIEAGGRAFQL
Sbjct: 241 EDYPYKGRDGVCDPNRRNANVVTIDGYEDVPENDEGALKKAVASQPVSVAIEAGGRAFQL 300

Query: 301 YQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANITTG 360
           YQSGVFTGRCGT+LDHGVVAVGYGT+NG DYWIVRNSWGKNWGE+GYIRLERNVANITTG
Sbjct: 301 YQSGVFTGRCGTDLDHGVVAVGYGTENGLDYWIVRNSWGKNWGENGYIRLERNVANITTG 360

Query: 361 KCGIAVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGSTCFGW 420
           KCGIA+EPSYP+KTG NPPKPA SPPSPVKPPTECDDY+SC EGTTCCCIYQYGSTCFGW
Sbjct: 361 KCGIAIEPSYPVKTGKNPPKPAPSPPSPVKPPTECDDYYSCPEGTTCCCIYQYGSTCFGW 420

Query: 421 GCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPAKRTRGIEKLG 480
           GCCPLESATCCDD YSCCP EYP+CDL  GTC MSKGS +GV LLKRLPAK    ++KLG
Sbjct: 421 GCCPLESATCCDDQYSCCPREYPVCDLAEGTCRMSKGSLIGVSLLKRLPAKHKGVVQKLG 480

Query: 481 KLFVGA 487
           K+ +G+
Sbjct: 481 KMIIGS 486

BLAST of Cla97C02G033150 vs. ExPASy TrEMBL
Match: A0A6J1JKG6 (low-temperature-induced cysteine proteinase-like OS=Cucurbita maxima OX=3661 GN=LOC111486001 PE=3 SV=1)

HSP 1 Score: 852.0 bits (2200), Expect = 1.2e-243
Identity = 412/487 (84.60%), Postives = 440/487 (90.35%), Query Frame = 0

Query: 1   MAISPTFSFPILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQR 60
           MAIS TFSF I+AFLALFLCFS  S AS SS+FSIIDENAKHHL +         GSP R
Sbjct: 1   MAISTTFSFSIVAFLALFLCFSSSSLASDSSSFSIIDENAKHHLDV-------QIGSPHR 60

Query: 61  TDAEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL 120
           +D EVAALYESWLVH+GKAYNALGEKERRFEIFKDNLRFIDEHNRESR+YKVGL RFADL
Sbjct: 61  SDEEVAALYESWLVHNGKAYNALGEKERRFEIFKDNLRFIDEHNRESRSYKVGLNRFADL 120

Query: 121 TNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCGSC 180
           +NEEYRA FL GR+SRK RLSA KS RYA TLGDDLPDHVDWRKKGAV  VKDQGQCGSC
Sbjct: 121 SNEEYRAMFLSGRLSRKNRLSATKSRRYAITLGDDLPDHVDWRKKGAVTDVKDQGQCGSC 180

Query: 181 WAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDTD 240
           WAFSTVAAVEGINQITTGELISLSEQELVDCD+SYNMGCNGGLMDYAFQFIIDNGGIDT+
Sbjct: 181 WAFSTVAAVEGINQITTGELISLSEQELVDCDRSYNMGCNGGLMDYAFQFIIDNGGIDTE 240

Query: 241 EDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRAFQL 300
           EDYPYKG DG CDPNR+N+K VTIDGYEDVPENDESSLKKAVA+QPVSVAIEAGGRAFQL
Sbjct: 241 EDYPYKGRDGTCDPNRRNSKAVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQL 300

Query: 301 YQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANITTG 360
           YQSGVF G CGT LDHGVVAVGYGT++G DYWIVRNSWGKNWGE+GYI+LERNV NITTG
Sbjct: 301 YQSGVFNGLCGTELDHGVVAVGYGTEDGKDYWIVRNSWGKNWGENGYIKLERNVGNITTG 360

Query: 361 KCGIAVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGSTCFGW 420
           KCGIA+E SYP KTG NPPKPA SPPSPV PP +CD+YFSC+EGTTCCCIY+YGSTCFGW
Sbjct: 361 KCGIAIEASYPTKTGKNPPKPAPSPPSPVSPPNQCDEYFSCQEGTTCCCIYKYGSTCFGW 420

Query: 421 GCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPAKRTRG-IEKL 480
           GCCPLESATCCDDHYSCCPHEYPICDL+AGTCL SKGST GVKLLKR+PAK  RG ++KL
Sbjct: 421 GCCPLESATCCDDHYSCCPHEYPICDLDAGTCLKSKGSTTGVKLLKRVPAKYNRGCVQKL 480

Query: 481 GKLFVGA 487
           GK+ +GA
Sbjct: 481 GKMSIGA 480

BLAST of Cla97C02G033150 vs. TAIR 10
Match: AT1G47128.1 (Granulin repeat cysteine protease family protein )

HSP 1 Score: 644.8 bits (1662), Expect = 5.5e-185
Identity = 308/470 (65.53%), Postives = 368/470 (78.30%), Query Frame = 0

Query: 3   ISPTFSFPILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQRTD 62
           + PT +   LA +A+          S +   SII  + KH +            +  R++
Sbjct: 4   LKPTMAILFLAMVAV----------SSAVDMSIISYDEKHGVST----------TGGRSE 63

Query: 63  AEVAALYESWLVHHGKA--YNALGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADL 122
           AEV ++YE+WLV HGKA   N+L EK+RRFEIFKDNLRF+DEHN ++ +Y++GLTRFADL
Sbjct: 64  AEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADL 123

Query: 123 TNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCGSC 182
           TN+EYR+K+LG ++ +K       S RY A +GD+LP+ +DWRKKGAVA VKDQG CGSC
Sbjct: 124 TNDEYRSKYLGAKMEKKGERRT--SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSC 183

Query: 183 WAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDTD 242
           WAFST+ AVEGINQI TG+LI+LSEQELVDCD SYN GCNGGLMDYAF+FII NGGIDTD
Sbjct: 184 WAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTD 243

Query: 243 EDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRAFQL 302
           +DYPYKG DG CD  RKNAKVVTID YEDVP   E SLKKAVA QP+S+AIEAGGRAFQL
Sbjct: 244 KDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQL 303

Query: 303 YQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANITTG 362
           Y SG+F G CGT LDHGVVAVGYGT+NG DYWIVRNSWGK+WGESGY+R+ RN+A+ ++G
Sbjct: 304 YDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIAS-SSG 363

Query: 363 KCGIAVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGSTCFGW 422
           KCGIA+EPSYPIK G NPP P  SPPSP+KPPT+CD Y++C E  TCCC+++YG  CF W
Sbjct: 364 KCGIAIEPSYPIKNGENPPNPGPSPPSPIKPPTQCDSYYTCPESNTCCCLFEYGKYCFAW 423

Query: 423 GCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPA 471
           GCCPLE+ATCCDD+YSCCPHEYP+CDL+ GTCL+SK S   VK LKR PA
Sbjct: 424 GCCPLEAATCCDDNYSCCPHEYPVCDLDQGTCLLSKNSPFSVKALKRKPA 450

BLAST of Cla97C02G033150 vs. TAIR 10
Match: AT5G43060.1 (Granulin repeat cysteine protease family protein )

HSP 1 Score: 636.3 bits (1640), Expect = 2.0e-182
Identity = 312/466 (66.95%), Postives = 354/466 (75.97%), Query Frame = 0

Query: 10  PILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQRTDAEVAALY 69
           P++  LA+          S +   SII  +  HH+               R+D+EV  +Y
Sbjct: 8   PMILLLAMI-------GVSYAMDMSIISYDENHHI----------TTETSRSDSEVERIY 67

Query: 70  ESWLVHHGKA---YNALG-EKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADLTNEEY 129
           E+W+V HGK     N LG EK++RFEIFKDNLRFIDEHN ++ +YK+GLTRFADLTNEEY
Sbjct: 68  EAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYKLGLTRFADLTNEEY 127

Query: 130 RAKFLGGRISRKPRLSAVK-SGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCGSCWAFS 189
           R+ +LG     KP    +K S RY A +GD LPD VDWRK+GAVA VKDQG CGSCWAFS
Sbjct: 128 RSMYLGA----KPTKRVLKTSDRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFS 187

Query: 190 TVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDTDEDYP 249
           T+ AVEGIN+I TG+LISLSEQELVDCD SYN GCNGGLMDYAF+FII NGGIDT+ DYP
Sbjct: 188 TIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYP 247

Query: 250 YKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRAFQLYQSG 309
           YK  DG CD NRKNAKVVTID YEDVPEN E+SLKKA+A QP+SVAIEAGGRAFQLY SG
Sbjct: 248 YKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSG 307

Query: 310 VFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANITTGKCGI 369
           VF G CGT LDHGVVAVGYGT+NG DYWIVRNSWG  WGESGYI++ RN+    TGKCGI
Sbjct: 308 VFDGLCGTELDHGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNI-EAPTGKCGI 367

Query: 370 AVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGSTCFGWGCCP 429
           A+E SYPIK G NPP P  SPPSP+KPPT CD YFSC E  TCCC+Y+YG  CFGWGCCP
Sbjct: 368 AMEASYPIKKGQNPPNPGPSPPSPIKPPTTCDKYFSCPESNTCCCLYKYGKYCFGWGCCP 427

Query: 430 LESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPA 471
           LE+ATCCDD+ SCCPHEYP+CD+  GTCLMSK S   VK LKR PA
Sbjct: 428 LEAATCCDDNSSCCPHEYPVCDVNRGTCLMSKNSPFSVKALKRTPA 451

BLAST of Cla97C02G033150 vs. TAIR 10
Match: AT3G19390.1 (Granulin repeat cysteine protease family protein )

HSP 1 Score: 557.0 bits (1434), Expect = 1.5e-158
Identity = 272/417 (65.23%), Postives = 320/417 (76.74%), Query Frame = 0

Query: 60  RTDAEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLRFIDEHNR-ESRTYKVGLTRFA 119
           R +AE   +YE WLV + K YN LGEKERRFEIFKDNL+F++EH+   +RTY+VGLTRFA
Sbjct: 34  RNEAEARRMYERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFA 93

Query: 120 DLTNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQGQCG 179
           DLTN+E+RA +L  ++ R      VK  +Y   +GD LPD +DWR KGAV  VKDQG CG
Sbjct: 94  DLTNDEFRAIYLRSKMER--TRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCG 153

Query: 180 SCWAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGID 239
           SCWAFS + AVEGINQI TGELISLSEQELVDCD SYN GC GGLMDYAF+FII+NGGID
Sbjct: 154 SCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTSYNDGCGGGLMDYAFKFIIENGGID 213

Query: 240 TDEDYPYKGHD-GACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRA 299
           T+EDYPY   D   C+ ++KN +VVTIDGYEDVP+NDE SLKKA+A+QP+SVAIEAGGRA
Sbjct: 214 TEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRA 273

Query: 300 FQLYQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANI 359
           FQLY SGVFTG CGT+LDHGVVAVGYG++ G DYWIVRNSWG NWGESGY +LERN+   
Sbjct: 274 FQLYTSGVFTGTCGTSLDHGVVAVGYGSEGGQDYWIVRNSWGSNWGESGYFKLERNIKE- 333

Query: 360 TTGKCGIAVEPSYPIK-TGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGST 419
           ++GKCG+A+  SYP K +G NPPKP A  PSPV     CD   +C   +TCCC+Y+Y   
Sbjct: 334 SSGKCGVAMMASYPTKSSGSNPPKPPA--PSPV----VCDKSNTCPAKSTCCCLYEYNGK 393

Query: 420 CFGWGCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVKLLKRLPAKRT 474
           C+ WGCCP ESATCCDD  SCCP  YP+CDL+A TC M   S + +K L R PA  T
Sbjct: 394 CYSWGCCPYESATCCDDGSSCCPQSYPVCDLKANTCRMKGNSPLSIKALTRGPAIAT 441

BLAST of Cla97C02G033150 vs. TAIR 10
Match: AT1G09850.1 (xylem bark cysteine peptidase 3 )

HSP 1 Score: 441.4 bits (1134), Expect = 9.2e-124
Identity = 212/408 (51.96%), Postives = 272/408 (66.67%), Query Frame = 0

Query: 57  SPQRTDAEVAALYESWLVHHGKAYNALGEKERRFEIFKDNLRFIDEHNR-ESRTYKVGLT 116
           S   +  +++ L++ W   HGK Y +  E+++R +IFKDN  F+ +HN   + TY + L 
Sbjct: 20  SSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLN 79

Query: 117 RFADLTNEEYRAKFLGGRISRKPRLSAVKSGRYAATLGDDLPDHVDWRKKGAVAAVKDQG 176
            FADLT+ E++A  LG  +S    + A K      ++   +PD VDWRKKGAV  VKDQG
Sbjct: 80  AFADLTHHEFKASRLGLSVSAPSVIMASKGQSLGGSV--KVPDSVDWRKKGAVTNVKDQG 139

Query: 177 QCGSCWAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNG 236
            CG+CW+FS   A+EGINQI TG+LISLSEQEL+DCDKSYN GCNGGLMDYAF+F+I N 
Sbjct: 140 SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 199

Query: 237 GIDTDEDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGG 296
           GIDT++DYPY+  DG C  ++   KVVTID Y  V  NDE +L +AVA+QPVSV I    
Sbjct: 200 GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 259

Query: 297 RAFQLYQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVA 356
           RAFQLY SG+F+G C T+LDH V+ VGYG+ NG DYWIV+NSWGK+WG  G++ ++RN  
Sbjct: 260 RAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 319

Query: 357 NITTGKCGIAVEPSYPIKTGMNPPKPAASPPSPVKPPTECDDYFSCEEGTTCCCIYQYGS 416
           N + G CGI +  SYPIKT  NPP P  SPP     PT+C+ +  C  G TCCC  +   
Sbjct: 320 N-SDGVCGINMLASYPIKTHPNPPPP--SPPG----PTKCNLFTYCSSGETCCCARELFG 379

Query: 417 TCFGWGCCPLESATCCDDHYSCCPHEYPICDLEAGTCLMSKGSTMGVK 464
            CF W CC +ESA CC D   CCPH+YP+CD     CL   G+   +K
Sbjct: 380 LCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIK 418

BLAST of Cla97C02G033150 vs. TAIR 10
Match: AT4G36880.1 (cysteine proteinase1 )

HSP 1 Score: 438.7 bits (1127), Expect = 6.0e-123
Identity = 227/378 (60.05%), Postives = 279/378 (73.81%), Query Frame = 0

Query: 10  PILAFLALFLCFSPFSSASLSSTFSIIDENAKHHLGIPEIADSDAQGSPQRTDAEVAALY 69
           P    L+L L +   S A  S   SII++    HL +P    SD +    RTD EV ++Y
Sbjct: 3   PSTKVLSLLLLYVVVSLA--SGDESIIND----HLQLP----SDGK---WRTDEEVRSIY 62

Query: 70  ESWLVHHGKAYN----ALGEKERRFEIFKDNLRFIDEHNRESR--TYKVGLTRFADLTNE 129
             W   HGK  N     + ++++RF IFKDNLRFID HN  ++  TYK+GLT+F DLTN+
Sbjct: 63  LQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTND 122

Query: 130 EYRAKFLGGRISRKPRLSAVK--SGRYAATL-GDDLPDHVDWRKKGAVAAVKDQGQCGSC 189
           EYR  +LG R     R++  K  + +Y+A + G ++P+ VDWR+KGAV  +KDQG CGSC
Sbjct: 123 EYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSC 182

Query: 190 WAFSTVAAVEGINQITTGELISLSEQELVDCDKSYNMGCNGGLMDYAFQFIIDNGGIDTD 249
           WAFST AAVEGIN+I TGELISLSEQELVDCDKSYN GCNGGLMDYAFQFI+ NGG++T+
Sbjct: 183 WAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTE 242

Query: 250 EDYPYKGHDGACDPNRKNAKVVTIDGYEDVPENDESSLKKAVASQPVSVAIEAGGRAFQL 309
           +DYPY+G  G C+   KN++VV+IDGYEDVP  DE++LKKA++ QPVSVAIEAGGR FQ 
Sbjct: 243 KDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQH 302

Query: 310 YQSGVFTGRCGTNLDHGVVAVGYGTDNGTDYWIVRNSWGKNWGESGYIRLERNVANITTG 369
           YQSG+FTG CGTNLDH VVAVGYG++NG DYWIVRNSWG  WGE GYIR+ERN+A   +G
Sbjct: 303 YQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRMERNLAASKSG 362

Query: 370 KCGIAVEPSYPIKTGMNP 379
           KCGIAVE SYP+K   NP
Sbjct: 363 KCGIAVEASYPVKYSPNP 367

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038889032.11.2e-26993.21low-temperature-induced cysteine proteinase-like [Benincasa hispida][more]
XP_008454976.11.2e-25589.71PREDICTED: low-temperature-induced cysteine proteinase-like [Cucumis melo][more]
XP_004136967.15.2e-25488.48low-temperature-induced cysteine proteinase [Cucumis sativus] >KGN43904.1 hypoth... [more]
KAA0031322.12.9e-25288.06low-temperature-induced cysteine proteinase-like [Cucumis melo var. makuwa] >TYK... [more]
XP_022147795.11.8e-24684.36low-temperature-induced cysteine proteinase-like [Momordica charantia][more]
Match NameE-valueIdentityDescription
P432977.7e-18465.53Cysteine proteinase RD21A OS=Arabidopsis thaliana OX=3702 GN=RD21A PE=1 SV=1[more]
Q9FMH82.8e-18166.95Probable cysteine protease RD21B OS=Arabidopsis thaliana OX=3702 GN=RD21B PE=1 S... [more]
P257763.5e-17667.84Oryzain alpha chain OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g0650000 PE=... [more]
Q9LT782.1e-15765.23Probable cysteine protease RD21C OS=Arabidopsis thaliana OX=3702 GN=RD21C PE=1 S... [more]
P257772.9e-15459.32Oryzain beta chain OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g0670200 PE=1... [more]
Match NameE-valueIdentityDescription
A0A1S3BZW86.0e-25689.71low-temperature-induced cysteine proteinase-like OS=Cucumis melo OX=3656 GN=LOC1... [more]
A0A0A0K4N32.5e-25488.48Cysteine protease OS=Cucumis sativus OX=3659 GN=Csa_7G073400 PE=3 SV=1[more]
A0A5A7SJU51.4e-25288.06Low-temperature-induced cysteine proteinase-like OS=Cucumis melo var. makuwa OX=... [more]
A0A6J1D1498.7e-24784.36low-temperature-induced cysteine proteinase-like OS=Momordica charantia OX=3673 ... [more]
A0A6J1JKG61.2e-24384.60low-temperature-induced cysteine proteinase-like OS=Cucurbita maxima OX=3661 GN=... [more]
Match NameE-valueIdentityDescription
AT1G47128.15.5e-18565.53Granulin repeat cysteine protease family protein [more]
AT5G43060.12.0e-18266.95Granulin repeat cysteine protease family protein [more]
AT3G19390.11.5e-15865.23Granulin repeat cysteine protease family protein [more]
AT1G09850.19.2e-12451.96xylem bark cysteine peptidase 3 [more]
AT4G36880.16.0e-12360.05cysteine proteinase1 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000668Peptidase C1A, papain C-terminalPRINTSPR00705PAPAINcoord: 174..189
score: 66.81
coord: 316..326
score: 57.41
coord: 331..337
score: 73.06
IPR000668Peptidase C1A, papain C-terminalSMARTSM00645pept_c1coord: 156..372
e-value: 1.1E-126
score: 436.8
IPR000668Peptidase C1A, papain C-terminalPFAMPF00112Peptidase_C1coord: 156..372
e-value: 2.0E-85
score: 286.3
IPR013201Cathepsin propeptide inhibitor domain (I29)SMARTSM00848Inhibitor_I29_2coord: 69..125
e-value: 7.4E-26
score: 101.9
IPR013201Cathepsin propeptide inhibitor domain (I29)PFAMPF08246Inhibitor_I29coord: 69..125
e-value: 5.9E-18
score: 65.1
IPR000118GranulinSMARTSM00277GRAN_2coord: 395..452
e-value: 1.5E-24
score: 97.6
IPR000118GranulinPFAMPF00396Granulincoord: 406..453
e-value: 2.7E-10
score: 40.5
NoneNo IPR availableGENE3D3.90.70.10Cysteine proteinasescoord: 52..374
e-value: 2.1E-121
score: 407.4
NoneNo IPR availablePANTHERPTHR12411CYSTEINE PROTEASE FAMILY C1-RELATEDcoord: 49..401
NoneNo IPR availablePANTHERPTHR12411:SF749CYSTEINE PROTEASEcoord: 49..401
NoneNo IPR availableSUPERFAMILY57277Granulin repeatcoord: 392..426
IPR037277Granulin superfamilyGENE3D2.10.25.160Granulincoord: 392..455
e-value: 1.9E-13
score: 52.5
IPR025660Cysteine peptidase, histidine active sitePROSITEPS00639THIOL_PROTEASE_HIScoord: 314..324
IPR025661Cysteine peptidase, asparagine active sitePROSITEPS00640THIOL_PROTEASE_ASNcoord: 331..350
IPR000169Cysteine peptidase, cysteine active sitePROSITEPS00139THIOL_PROTEASE_CYScoord: 174..185
IPR039417Papain-like cysteine endopeptidaseCDDcd02248Peptidase_C1Acoord: 157..371
e-value: 1.11093E-112
score: 329.585
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 61..372

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G033150.1Cla97C02G033150.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0051603 proteolysis involved in cellular protein catabolic process
biological_process GO:0006508 proteolysis
cellular_component GO:0005615 extracellular space
cellular_component GO:0005764 lysosome
molecular_function GO:0004197 cysteine-type endopeptidase activity
molecular_function GO:0008234 cysteine-type peptidase activity