Cla97C02G035510.1 (mRNA) Watermelon (97103) v2

NameCla97C02G035510.1
TypemRNA
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionCysteine protease, putative
LocationCla97Chr02 : 11872094 .. 11873275 (-)
Sequence length1032
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGCATATAGAATGATTTGGAATGTGGGTTTGTTGTCTCTGACTCTCTGGGTTTTCTGGACACCCTCAATGGCATCCATGGAAATGGACTACCGTCCAGGATCTAGTTCCGGTGACTTACAAGATAGGTACCAGAAATGGATGAGTAAATACGGTCGAGAATACAAGAGCAGAGAAGAGTGGGAGCAGAGATTCAATATTTATCAGTTGAATGTTCAGTACATTGACAACTTCAATTCTCTGAATCATTCATATACTCTGGCTGAAAATAGCTTTGCAGATCTCACAAATGATGAGTTCAAGACAACTTACTTGGGGTTTAAAACTGATTGGCTTCCTGATACATGGTTCAGATATGGAAATATGGTTAATTTGCCTACTAATGTTGACTGGAGAAAGGAAAATGCAGTTACTCCAGTAAAGGATCAAGGTCAATGCGGTATGAACTCCTTCATCTAACTAATCAGAAATCAGTATCTTTTCGTCTGATTTTTGAGTTCCTTTTCCTAGCAACTACCCGACCCTCTGTGCGTTGTAAGTCATCCCTTATTATTTGAACGAACTCACCAGTTTTGATTTTCTGACACAGGGAGCTGCTGGGCATTCTCTGCAGTAGCAGCAGTGGAAGGCATCAACAAAATTAAAACAGGCAAATTGATGTCTCTATCAGAACAGGAGCTTGTGGACTGCGATGTGGCGTCGGGGAACCAGGGATGCAATGGTGGTTACATGTACAAAGCATTTGAGTTCATCAAGAAAACTGGACTCACTACAGAAATAGAATATCCATACAGGGGAATTGAATCTGTATGCAACAAACAAAAAGTGAGATACCGCACTGTGACAATAAGTGGATATGAAAAAGTACCCGTCAATGATGAGAAAAGCTTAAAGGCAGCAGTTGCTAACCAGCCAGTCTCTGTAGCAATTGATGCAGGGGGATATGATTTTCAGTTCTATTCAGGTGGAGTCTTTTCAGGCAATTGTGGGAAGCAACTCAATCATGGAGTGGCAATAGTTGGGTATGGGGAAGCTAGCAATAAAACTTATTGGCTTGTCAAGAATTCATGGGGCACTGACTGGGGTGAATCTGGTTACATAAGAATGAAACGTGATTCAACCGACAAGCGAGGTACTTGTGGCATAGCTATGATGGCTAGCTACCCCATCAAAGACTGA

mRNA sequence

ATGGAAGCATATAGAATGATTTGGAATGTGGGTTTGTTGTCTCTGACTCTCTGGGTTTTCTGGACACCCTCAATGGCATCCATGGAAATGGACTACCGTCCAGGATCTAGTTCCGGTGACTTACAAGATAGGTACCAGAAATGGATGAGTAAATACGGTCGAGAATACAAGAGCAGAGAAGAGTGGGAGCAGAGATTCAATATTTATCAGTTGAATGTTCAGTACATTGACAACTTCAATTCTCTGAATCATTCATATACTCTGGCTGAAAATAGCTTTGCAGATCTCACAAATGATGAGTTCAAGACAACTTACTTGGGGTTTAAAACTGATTGGCTTCCTGATACATGGTTCAGATATGGAAATATGGTTAATTTGCCTACTAATGTTGACTGGAGAAAGGAAAATGCAGTTACTCCAGTAAAGGATCAAGGTCAATGCGGGAGCTGCTGGGCATTCTCTGCAGTAGCAGCAGTGGAAGGCATCAACAAAATTAAAACAGGCAAATTGATGTCTCTATCAGAACAGGAGCTTGTGGACTGCGATGTGGCGTCGGGGAACCAGGGATGCAATGGTGGTTACATGTACAAAGCATTTGAGTTCATCAAGAAAACTGGACTCACTACAGAAATAGAATATCCATACAGGGGAATTGAATCTGTATGCAACAAACAAAAAGTGAGATACCGCACTGTGACAATAAGTGGATATGAAAAAGTACCCGTCAATGATGAGAAAAGCTTAAAGGCAGCAGTTGCTAACCAGCCAGTCTCTGTAGCAATTGATGCAGGGGGATATGATTTTCAGTTCTATTCAGGTGGAGTCTTTTCAGGCAATTGTGGGAAGCAACTCAATCATGGAGTGGCAATAGTTGGGTATGGGGAAGCTAGCAATAAAACTTATTGGCTTGTCAAGAATTCATGGGGCACTGACTGGGGTGAATCTGGTTACATAAGAATGAAACGTGATTCAACCGACAAGCGAGGTACTTGTGGCATAGCTATGATGGCTAGCTACCCCATCAAAGACTGA

Coding sequence (CDS)

ATGGAAGCATATAGAATGATTTGGAATGTGGGTTTGTTGTCTCTGACTCTCTGGGTTTTCTGGACACCCTCAATGGCATCCATGGAAATGGACTACCGTCCAGGATCTAGTTCCGGTGACTTACAAGATAGGTACCAGAAATGGATGAGTAAATACGGTCGAGAATACAAGAGCAGAGAAGAGTGGGAGCAGAGATTCAATATTTATCAGTTGAATGTTCAGTACATTGACAACTTCAATTCTCTGAATCATTCATATACTCTGGCTGAAAATAGCTTTGCAGATCTCACAAATGATGAGTTCAAGACAACTTACTTGGGGTTTAAAACTGATTGGCTTCCTGATACATGGTTCAGATATGGAAATATGGTTAATTTGCCTACTAATGTTGACTGGAGAAAGGAAAATGCAGTTACTCCAGTAAAGGATCAAGGTCAATGCGGGAGCTGCTGGGCATTCTCTGCAGTAGCAGCAGTGGAAGGCATCAACAAAATTAAAACAGGCAAATTGATGTCTCTATCAGAACAGGAGCTTGTGGACTGCGATGTGGCGTCGGGGAACCAGGGATGCAATGGTGGTTACATGTACAAAGCATTTGAGTTCATCAAGAAAACTGGACTCACTACAGAAATAGAATATCCATACAGGGGAATTGAATCTGTATGCAACAAACAAAAAGTGAGATACCGCACTGTGACAATAAGTGGATATGAAAAAGTACCCGTCAATGATGAGAAAAGCTTAAAGGCAGCAGTTGCTAACCAGCCAGTCTCTGTAGCAATTGATGCAGGGGGATATGATTTTCAGTTCTATTCAGGTGGAGTCTTTTCAGGCAATTGTGGGAAGCAACTCAATCATGGAGTGGCAATAGTTGGGTATGGGGAAGCTAGCAATAAAACTTATTGGCTTGTCAAGAATTCATGGGGCACTGACTGGGGTGAATCTGGTTACATAAGAATGAAACGTGATTCAACCGACAAGCGAGGTACTTGTGGCATAGCTATGATGGCTAGCTACCCCATCAAAGACTGA

Protein sequence

MEAYRMIWNVGLLSLTLWVFWTPSMASMEMDYRPGSSSGDLQDRYQKWMSKYGREYKSREEWEQRFNIYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGFKTDWLPDTWFRYGNMVNLPTNVDWRKENAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVASGNQGCNGGYMYKAFEFIKKTGLTTEIEYPYRGIESVCNKQKVRYRTVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEASNKTYWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD
BLAST of Cla97C02G035510.1 vs. NCBI nr
Match: XP_008458487.1 (PREDICTED: ervatamin-B-like [Cucumis melo])

HSP 1 Score: 605.9 bits (1561), Expect = 8.4e-170
Identity = 290/346 (83.82%), Postives = 313/346 (90.46%), Query Frame = 0

Query: 1   MEAYRMIWNVGLLSLTLWVFWTPSMASMEMDYRPGSS-SGDLQDRYQKWMSKYGREYKSR 60
           MEAY+MIW+V L+SL LWVFWTP+  SM MDY  GSS SG+LQDRYQKWM KYGR+YKSR
Sbjct: 1   MEAYKMIWSVSLISLILWVFWTPTRVSMAMDYPSGSSNSGELQDRYQKWMDKYGRQYKSR 60

Query: 61  EEWEQRFNIYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGFKTDWLP--DTW 120
           EEWEQRF IYQ NVQYIDNFNSLNHSYTLAEN+F DLTN+EF  TYLG++T  LP  DT+
Sbjct: 61  EEWEQRFTIYQANVQYIDNFNSLNHSYTLAENNFTDLTNEEFMATYLGYETVSLPDTDTF 120

Query: 121 FRYGNMVNLPTNVDWRKENAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQE 180
           FRYGNMVNLPTNVDWRKE AVTP+K+QGQCGSCWAFSAVAAVEGINKIK GKLMSLSEQE
Sbjct: 121 FRYGNMVNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKAGKLMSLSEQE 180

Query: 181 LVDCDVASGNQGCNGGYMYKAFEFIKKTGLTTEIEYPYRGIESVCNKQKVRYRTVTISGY 240
           LVDCDV SGNQGCNGGYMYKAFEFIKKTGLTTE+EYPY    S C+KQK +Y++V+ISGY
Sbjct: 181 LVDCDVTSGNQGCNGGYMYKAFEFIKKTGLTTELEYPYTATRSACDKQKEKYQSVSISGY 240

Query: 241 EKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEAS 300
           EKVPVNDEKSL+AAVA QPVSVAIDAGG DFQFYSGG+FSGNCGKQLNHGVAIVGYGE S
Sbjct: 241 EKVPVNDEKSLQAAVAKQPVSVAIDAGGNDFQFYSGGIFSGNCGKQLNHGVAIVGYGEDS 300

Query: 301 NKTYWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 344
           N+ YWLVKNSWGT WGESGYIRM RDSTDK+GTCGIAMMASYPIKD
Sbjct: 301 NQAYWLVKNSWGTSWGESGYIRMMRDSTDKQGTCGIAMMASYPIKD 346

BLAST of Cla97C02G035510.1 vs. NCBI nr
Match: KGN62053.1 (hypothetical protein Csa_2G292830 [Cucumis sativus])

HSP 1 Score: 592.4 bits (1526), Expect = 9.7e-166
Identity = 282/340 (82.94%), Postives = 308/340 (90.59%), Query Frame = 0

Query: 6   MIW-NVGLLSLTLWVFWTPSMASMEMDYRPGSS-SGDLQDRYQKWMSKYGREYKSREEWE 65
           M W NV L+ L LWVFWTP + SM MDY  GSS S D+QDRYQKWM KYGR+YKSREEWE
Sbjct: 1   MTWINVSLIFLILWVFWTPRLVSMAMDYSLGSSCSSDIQDRYQKWMDKYGRQYKSREEWE 60

Query: 66  QRFNIYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGFKTDWLPDTWFRYGNM 125
           +RF IYQ NVQYIDNFNS+NHS+TLAEN+FADLTN+EFK TYLG+KT  +PDT FRYGNM
Sbjct: 61  RRFTIYQANVQYIDNFNSMNHSHTLAENNFADLTNEEFKATYLGYKTVSIPDTCFRYGNM 120

Query: 126 VNLPTNVDWRKENAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDV 185
           VNLPTNVDWR+E AVTP+K+QGQCGSCWAFSAVAAVEGINKIK GKL+SLSEQELVDCDV
Sbjct: 121 VNLPTNVDWRQEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDV 180

Query: 186 ASGNQGCNGGYMYKAFEFIKKTGLTTEIEYPYRGIESVCNKQKVRYRTVTISGYEKVPVN 245
            SGNQGCNGGYMYKAFEFIK+TGLTTEIEYPY+G ES CN+QK +Y+ V+ISGYEKVPVN
Sbjct: 181 TSGNQGCNGGYMYKAFEFIKRTGLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVN 240

Query: 246 DEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEASNKTYWL 305
           DEKSLKAAVANQPVSVAIDA G +FQFYSGG+FSGNCG QLNHGVAIVGYGE SN+ YWL
Sbjct: 241 DEKSLKAAVANQPVSVAIDAEGNNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQAYWL 300

Query: 306 VKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 344
           VKNSWGTDWGESGYIRMKRDSTD++GTCGIAMMASYP KD
Sbjct: 301 VKNSWGTDWGESGYIRMKRDSTDRQGTCGIAMMASYPTKD 340

BLAST of Cla97C02G035510.1 vs. NCBI nr
Match: XP_022140756.1 (ervatamin-B [Momordica charantia])

HSP 1 Score: 576.6 bits (1485), Expect = 5.5e-161
Identity = 277/343 (80.76%), Postives = 301/343 (87.76%), Query Frame = 0

Query: 1   MEAYRMIWNVGLLSLTLWVFWTPSMASMEMDYRPGSSSGDLQDRYQKWMSKYGREYKSRE 60
           MEAY MI NVG + L L VFWT SMAS+  D  PG  S D++DRYQKW+ KYGREYKS E
Sbjct: 1   MEAYGMIRNVGFMWLILCVFWTLSMASVAEDNPPGDGSDDMRDRYQKWIDKYGREYKSGE 60

Query: 61  EWEQRFNIYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGFKTDWLPDTWFRY 120
           E E+RF IYQ NVQYID FNSLN SYTLA+N FADLTNDEFKTTYLG+ TDW PDT F+Y
Sbjct: 61  EREKRFPIYQSNVQYIDYFNSLNRSYTLADNMFADLTNDEFKTTYLGYLTDWSPDTCFKY 120

Query: 121 GNMVNLPTNVDWRKENAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVD 180
           GN+VNLPTNVDWRKE AVTP+KDQGQCGSCWAFSAVAAVEGI KIKTGKL+SLSEQEL+D
Sbjct: 121 GNIVNLPTNVDWRKEGAVTPIKDQGQCGSCWAFSAVAAVEGITKIKTGKLVSLSEQELLD 180

Query: 181 CDVASGNQGCNGGYMYKAFEFIKKTGLTTEIEYPYRGIESVCNKQKVRYRTVTISGYEKV 240
           CDV SGNQGC+GG+M KAFEFIKK G+TTE EYPYRG+E+VCNKQKVRY + TISGYEKV
Sbjct: 181 CDVISGNQGCSGGFMPKAFEFIKKIGITTEKEYPYRGVENVCNKQKVRYHSATISGYEKV 240

Query: 241 PVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEASNKT 300
           P NDEKSLKAAVANQPVSVAIDAGGYDFQFYSGG+FSGNCGKQLNHGV IVGYGE   K+
Sbjct: 241 PANDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGIFSGNCGKQLNHGVTIVGYGEDVGKS 300

Query: 301 YWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 344
           YWLVKNSWGT WGE GY+RMK +S+DKRGTCGIAM ASYPIKD
Sbjct: 301 YWLVKNSWGTSWGEYGYVRMKSNSSDKRGTCGIAMDASYPIKD 343

BLAST of Cla97C02G035510.1 vs. NCBI nr
Match: XP_023513224.1 (ervatamin-B-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 570.1 bits (1468), Expect = 5.1e-159
Identity = 274/343 (79.88%), Postives = 305/343 (88.92%), Query Frame = 0

Query: 1   MEAYRMIWNVGLLSLTLWVFWTPSMASMEMDYRPGSSSGDLQDRYQKWMSKYGREYKSRE 60
           MEAY+ IWN+GL SL LWV  TPSMASM  D    S S  LQDRY+KWM+K+ REYKSRE
Sbjct: 1   MEAYKTIWNMGLTSLILWVVCTPSMASMATD----SPSNGLQDRYKKWMNKHSREYKSRE 60

Query: 61  EWEQRFNIYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGFKTDWLPDTWFRY 120
           E E+RF +YQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLG++T  LPDT FRY
Sbjct: 61  EQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGYQTHCLPDTCFRY 120

Query: 121 GNMVNLPTNVDWRKENAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVD 180
            ++ +LPT+VDWR E+AVTP+KDQGQCGSCWAFSAVAAVEGI+KI+TGKL SLSEQELVD
Sbjct: 121 EHVNSLPTHVDWRMEDAVTPIKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVD 180

Query: 181 CDVASGNQGCNGGYMYKAFEFIKKTGLTTEIEYPYRGIESVCNKQKVRYRTVTISGYEKV 240
           CD+ SGNQGC+GG+M KAFE+IK++GLTTE EYPYRGIE+ CN QKVRY +VTISGYEKV
Sbjct: 181 CDIISGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKV 240

Query: 241 PVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEASNKT 300
           P+N+EK LKAAVANQPVSVAIDAGGYDFQFYS G+FSG+CGKQLNHGVAIVGYGE  + T
Sbjct: 241 PMNNEKKLKAAVANQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNT 300

Query: 301 YWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 344
           YWLVKNSWGT+WGESGYIRMKRDS DKRG CGIAM ASYPIKD
Sbjct: 301 YWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPIKD 339

BLAST of Cla97C02G035510.1 vs. NCBI nr
Match: XP_022944762.1 (ervatamin-B-like [Cucurbita moschata])

HSP 1 Score: 568.5 bits (1464), Expect = 1.5e-158
Identity = 272/343 (79.30%), Postives = 304/343 (88.63%), Query Frame = 0

Query: 1   MEAYRMIWNVGLLSLTLWVFWTPSMASMEMDYRPGSSSGDLQDRYQKWMSKYGREYKSRE 60
           MEAY+ IWN+GL SL LW+  TPSMASM  D    S S  LQDRY+KWM+K+ REYKSRE
Sbjct: 1   MEAYKTIWNMGLTSLILWIVCTPSMASMATD----SPSNGLQDRYKKWMNKHSREYKSRE 60

Query: 61  EWEQRFNIYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGFKTDWLPDTWFRY 120
           E E+RF +YQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLG++T  LPDT FRY
Sbjct: 61  EQERRFTVYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGYQTHCLPDTCFRY 120

Query: 121 GNMVNLPTNVDWRKENAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVD 180
            ++++LPT+VDWR E+AVTPVKDQGQCGSCWAFSAVAAVEGI+KI+TGKL SLSEQELVD
Sbjct: 121 DHVISLPTHVDWRMEDAVTPVKDQGQCGSCWAFSAVAAVEGIHKIRTGKLESLSEQELVD 180

Query: 181 CDVASGNQGCNGGYMYKAFEFIKKTGLTTEIEYPYRGIESVCNKQKVRYRTVTISGYEKV 240
           CD+ SGNQGC+GG+M KAFE+IK++GLTTE EYPYRGIE+ CN QKVRY +VTISGYEKV
Sbjct: 181 CDIISGNQGCDGGFMNKAFEYIKRSGLTTEREYPYRGIEAFCNTQKVRYHSVTISGYEKV 240

Query: 241 PVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEASNKT 300
           P N+EK LKAAVA+QPVSVAIDAGGYDFQFYS G+FSG+CGKQLNHGVAIVGYGE  + T
Sbjct: 241 PTNNEKKLKAAVAHQPVSVAIDAGGYDFQFYSSGIFSGSCGKQLNHGVAIVGYGEVGDNT 300

Query: 301 YWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 344
           YWLVKNSWGT+WGESGYIRMKRDS DKRG CGIAM ASYP KD
Sbjct: 301 YWLVKNSWGTEWGESGYIRMKRDSIDKRGACGIAMEASYPTKD 339

BLAST of Cla97C02G035510.1 vs. TrEMBL
Match: tr|A0A1S3C828|A0A1S3C828_CUCME (ervatamin-B-like OS=Cucumis melo OX=3656 GN=LOC103497881 PE=3 SV=1)

HSP 1 Score: 605.9 bits (1561), Expect = 5.6e-170
Identity = 290/346 (83.82%), Postives = 313/346 (90.46%), Query Frame = 0

Query: 1   MEAYRMIWNVGLLSLTLWVFWTPSMASMEMDYRPGSS-SGDLQDRYQKWMSKYGREYKSR 60
           MEAY+MIW+V L+SL LWVFWTP+  SM MDY  GSS SG+LQDRYQKWM KYGR+YKSR
Sbjct: 1   MEAYKMIWSVSLISLILWVFWTPTRVSMAMDYPSGSSNSGELQDRYQKWMDKYGRQYKSR 60

Query: 61  EEWEQRFNIYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGFKTDWLP--DTW 120
           EEWEQRF IYQ NVQYIDNFNSLNHSYTLAEN+F DLTN+EF  TYLG++T  LP  DT+
Sbjct: 61  EEWEQRFTIYQANVQYIDNFNSLNHSYTLAENNFTDLTNEEFMATYLGYETVSLPDTDTF 120

Query: 121 FRYGNMVNLPTNVDWRKENAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQE 180
           FRYGNMVNLPTNVDWRKE AVTP+K+QGQCGSCWAFSAVAAVEGINKIK GKLMSLSEQE
Sbjct: 121 FRYGNMVNLPTNVDWRKEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKAGKLMSLSEQE 180

Query: 181 LVDCDVASGNQGCNGGYMYKAFEFIKKTGLTTEIEYPYRGIESVCNKQKVRYRTVTISGY 240
           LVDCDV SGNQGCNGGYMYKAFEFIKKTGLTTE+EYPY    S C+KQK +Y++V+ISGY
Sbjct: 181 LVDCDVTSGNQGCNGGYMYKAFEFIKKTGLTTELEYPYTATRSACDKQKEKYQSVSISGY 240

Query: 241 EKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEAS 300
           EKVPVNDEKSL+AAVA QPVSVAIDAGG DFQFYSGG+FSGNCGKQLNHGVAIVGYGE S
Sbjct: 241 EKVPVNDEKSLQAAVAKQPVSVAIDAGGNDFQFYSGGIFSGNCGKQLNHGVAIVGYGEDS 300

Query: 301 NKTYWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 344
           N+ YWLVKNSWGT WGESGYIRM RDSTDK+GTCGIAMMASYPIKD
Sbjct: 301 NQAYWLVKNSWGTSWGESGYIRMMRDSTDKQGTCGIAMMASYPIKD 346

BLAST of Cla97C02G035510.1 vs. TrEMBL
Match: tr|A0A0A0LJV6|A0A0A0LJV6_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G292830 PE=3 SV=1)

HSP 1 Score: 592.4 bits (1526), Expect = 6.4e-166
Identity = 282/340 (82.94%), Postives = 308/340 (90.59%), Query Frame = 0

Query: 6   MIW-NVGLLSLTLWVFWTPSMASMEMDYRPGSS-SGDLQDRYQKWMSKYGREYKSREEWE 65
           M W NV L+ L LWVFWTP + SM MDY  GSS S D+QDRYQKWM KYGR+YKSREEWE
Sbjct: 1   MTWINVSLIFLILWVFWTPRLVSMAMDYSLGSSCSSDIQDRYQKWMDKYGRQYKSREEWE 60

Query: 66  QRFNIYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGFKTDWLPDTWFRYGNM 125
           +RF IYQ NVQYIDNFNS+NHS+TLAEN+FADLTN+EFK TYLG+KT  +PDT FRYGNM
Sbjct: 61  RRFTIYQANVQYIDNFNSMNHSHTLAENNFADLTNEEFKATYLGYKTVSIPDTCFRYGNM 120

Query: 126 VNLPTNVDWRKENAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDV 185
           VNLPTNVDWR+E AVTP+K+QGQCGSCWAFSAVAAVEGINKIK GKL+SLSEQELVDCDV
Sbjct: 121 VNLPTNVDWRQEGAVTPIKNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDV 180

Query: 186 ASGNQGCNGGYMYKAFEFIKKTGLTTEIEYPYRGIESVCNKQKVRYRTVTISGYEKVPVN 245
            SGNQGCNGGYMYKAFEFIK+TGLTTEIEYPY+G ES CN+QK +Y+ V+ISGYEKVPVN
Sbjct: 181 TSGNQGCNGGYMYKAFEFIKRTGLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVN 240

Query: 246 DEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEASNKTYWL 305
           DEKSLKAAVANQPVSVAIDA G +FQFYSGG+FSGNCG QLNHGVAIVGYGE SN+ YWL
Sbjct: 241 DEKSLKAAVANQPVSVAIDAEGNNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQAYWL 300

Query: 306 VKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 344
           VKNSWGTDWGESGYIRMKRDSTD++GTCGIAMMASYP KD
Sbjct: 301 VKNSWGTDWGESGYIRMKRDSTDRQGTCGIAMMASYPTKD 340

BLAST of Cla97C02G035510.1 vs. TrEMBL
Match: tr|M5X1M2|M5X1M2_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_3G125700 PE=3 SV=1)

HSP 1 Score: 438.0 bits (1125), Expect = 2.0e-119
Identity = 209/343 (60.93%), Postives = 256/343 (74.64%), Query Frame = 0

Query: 1   MEAYRMIWNVGLLSLTLWVFWTPSMASMEMDYRP-GSSSGDLQDRYQKWMSKYGREYKSR 60
           ME   ++    L  L +W+F   S A  E  Y+P  +    +++RY++W+ KYGR YK+R
Sbjct: 1   METSMVLTRASLTFLMVWIFCISSTACSE-TYKPLRTDPKAMKERYERWLQKYGRIYKNR 60

Query: 61  EEWEQRFNIYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGFKTDWLPDTWFR 120
           EE   RF +Y+ N++++D  NS N SY L +N FAD+TN EF  T++GF+T   P T F 
Sbjct: 61  EEAAYRFGVYKSNIEFVDFVNSQNLSYKLTDNKFADITNLEFTKTFMGFQTRSHPKTKFS 120

Query: 121 YGNMVNLPTNVDWRKENAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELV 180
           Y     LPT VDWRK  AVTP+K+QGQCGSCWAFSAVAAVEGIN+IKTGKL+SLSEQELV
Sbjct: 121 YDKDEELPTAVDWRKHGAVTPIKNQGQCGSCWAFSAVAAVEGINQIKTGKLVSLSEQELV 180

Query: 181 DCDVASGNQGCNGGYMYKAFEFIKKTGLTTEIEYPYRGIESVCNKQKVRYRTVTISGYEK 240
           DCDV +GN+GCNGGYM KAF FIK  GL+TE +YPY+G + +C++  ++   V ISGYE 
Sbjct: 181 DCDVKTGNEGCNGGYMEKAFSFIKDNGLSTEKDYPYKGSDGICDEDSLKNSAVNISGYES 240

Query: 241 VPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEASNK 300
           +P N EKSL+AAVA+QPVSVA+DA GY FQFYS G F+G CGK LNHGV  VGYGE S K
Sbjct: 241 IPANSEKSLQAAVAHQPVSVAVDAAGYAFQFYSSGTFTGQCGKNLNHGVTAVGYGEDSGK 300

Query: 301 TYWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIK 343
            YW+VKNSWG DWGESGYIRM RDS DK+GTCGIAM ASYP+K
Sbjct: 301 KYWIVKNSWGPDWGESGYIRMTRDSVDKKGTCGIAMQASYPVK 342

BLAST of Cla97C02G035510.1 vs. TrEMBL
Match: tr|A0A2P4KBT5|A0A2P4KBT5_QUESU (Senescence-specific cysteine protease sag12 OS=Quercus suber OX=58331 GN=CFP56_48593 PE=3 SV=1)

HSP 1 Score: 434.9 bits (1117), Expect = 1.7e-118
Identity = 215/347 (61.96%), Postives = 259/347 (74.64%), Query Frame = 0

Query: 1   MEAYRMIWNVGLLSLTLWVFWTPSMA---SMEMDYRPGSSSGDLQDRYQKWMSKYGREYK 60
           ME    + N  L  L +W+ W  +     ++ + Y P +    +++RY+ W+ +YGR YK
Sbjct: 1   METPTKLRNASLTLLIMWILWARACCEKYTLPLTYDPKA----MRERYESWVVRYGRRYK 60

Query: 61  SREEWEQRFNIYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGFKTDWLPDTW 120
           ++EE E RF IYQ+NV+ ID FNS NHS+ L +N FADL+N E++  YLGF T   P T 
Sbjct: 61  NKEEEELRFGIYQMNVELIDYFNSQNHSFKLTDNRFADLSNREYQAAYLGFGTMSHPTTN 120

Query: 121 FRYGNMVNLPTNVDWRKENAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQE 180
           F +    +LPT++DWRKE AVTP+KDQG+CGSCWAFSAVAAVEGINKIK GKL+SLSEQE
Sbjct: 121 FCHFKNKSLPTSMDWRKEGAVTPMKDQGECGSCWAFSAVAAVEGINKIKKGKLVSLSEQE 180

Query: 181 LVDCDVASGNQGCNGGYMYKAFEFIKKT-GLTTEIEYPYRGIESVCNKQKVRYRTVTISG 240
           L+DCD  +GN+GCNGGYM KAFEFIKK  GLTTE +YPY+G E  C+K K +  T TISG
Sbjct: 181 LMDCDTDTGNEGCNGGYMDKAFEFIKKNGGLTTEEDYPYKGKEGSCDKAKEKTHTATISG 240

Query: 241 YEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEA 300
           YEKVP NDEKSL+AAVANQPVSVA+DAG + FQFYS G+FSG CG  LNHGV  VGYGE 
Sbjct: 241 YEKVPANDEKSLQAAVANQPVSVAVDAGSFKFQFYSEGIFSGQCGTHLNHGVTAVGYGED 300

Query: 301 SNKTYWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 344
             K YW+VKNSWG DWGESGYIRM RDS +K G CGIAM ASYP+ D
Sbjct: 301 GGK-YWIVKNSWGADWGESGYIRMTRDSQNKHGICGIAMDASYPLMD 342

BLAST of Cla97C02G035510.1 vs. TrEMBL
Match: tr|A0A2I4EXQ7|A0A2I4EXQ7_9ROSI (ervatamin-B-like OS=Juglans regia OX=51240 GN=LOC108993649 PE=3 SV=1)

HSP 1 Score: 434.5 bits (1116), Expect = 2.2e-118
Identity = 215/346 (62.14%), Postives = 253/346 (73.12%), Query Frame = 0

Query: 1   MEAYRMIWNVGLLSLTLWVFWTPSMASMEMDYR--PGSSSGDLQDRYQKWMSKYGREYKS 60
           MEA     N  L  L L + W P  A  +  Y   P      +++RY+ W+ ++GR YKS
Sbjct: 1   MEALAASNNATLTLLILLILWIPLRALSDEKYTRPPVYDPKAMRERYESWLERHGRTYKS 60

Query: 61  REEWEQRFNIYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGFKTDWLPDTWF 120
            +EWE RF IYQ NVQ+ID  NS N S+ L +N FADLTN EFK  YLGF+  W   T F
Sbjct: 61  EQEWEMRFGIYQFNVQFIDYTNSQNLSFKLTDNKFADLTNGEFKAIYLGFRPMWHQKTNF 120

Query: 121 RYGNMVNLPTNVDWRKENAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQEL 180
            YG  V+LPT VDWRK+ AVTP+K+QGQCGSCWAFS VAAVEGINKIKTG+L+SLSEQEL
Sbjct: 121 SYGKDVDLPTRVDWRKKGAVTPIKNQGQCGSCWAFSTVAAVEGINKIKTGQLISLSEQEL 180

Query: 181 VDCDVASGNQGCNGGYMYKAFEFIKKT-GLTTEIEYPYRGIESVCNKQKVRYRTVTISGY 240
           VDC++ +  QGCNGGYM KAFE+IKKT GLTTE EYPYR     C+K K +   VTISGY
Sbjct: 181 VDCNLDTWCQGCNGGYMDKAFEYIKKTGGLTTEEEYPYRASTGTCDKAKEKDHVVTISGY 240

Query: 241 EKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEAS 300
           E+VP N+EKSL++AVANQPVSVAIDA GY+FQ YS G+F+  CG QLNHGV  VGYGE +
Sbjct: 241 ERVPANNEKSLQSAVANQPVSVAIDASGYEFQLYSQGIFTDRCGTQLNHGVTAVGYGEKN 300

Query: 301 NKTYWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 344
              YWLVKNSWGT WGESGYIR+ RD  D +G CGIAM ASYP+K+
Sbjct: 301 GMKYWLVKNSWGTGWGESGYIRLNRDIADNQGICGIAMEASYPLKN 346

BLAST of Cla97C02G035510.1 vs. Swiss-Prot
Match: sp|P12412|CYSEP_VIGMU (Vignain OS=Vigna mungo OX=3915 PE=1 SV=1)

HSP 1 Score: 353.6 bits (906), Expect = 2.5e-96
Identity = 190/353 (53.82%), Postives = 235/353 (66.57%), Query Frame = 0

Query: 1   MEAYRMIWNVGLLSLTLWVFWTPSMASMEMDYRPGSSSGDLQDRYQKWMSKYGREYKSRE 60
           M   +++W V  LSL L V       S +   +   S   L D Y++W S +    +S  
Sbjct: 1   MAMKKLLWVVLSLSLVLGV-----ANSFDFHEKDLESEESLWDLYERWRSHH-TVSRSLG 60

Query: 61  EWEQRFNIYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLG--------FKTDW 120
           E  +RFN+++ NV ++ N N ++  Y L  N FAD+TN EF++TY G        F+   
Sbjct: 61  EKHKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHKMFRGSQ 120

Query: 121 LPDTWFRYGNMVNLPTNVDWRKENAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMS 180
                F Y  + ++P +VDWRK+ AVT VKDQGQCGSCWAFS + AVEGIN+IKT KL+S
Sbjct: 121 HGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVS 180

Query: 181 LSEQELVDCDVASGNQGCNGGYMYKAFEFIK-KTGLTTEIEYPYRGIESVCNKQKVRYRT 240
           LSEQELVDCD    NQGCNGG M  AFEFIK K G+TTE  YPY   E  C++ KV    
Sbjct: 181 LSEQELVDCD-KEENQGCNGGLMESAFEFIKQKGGITTESNYPYTAQEGTCDESKVNDLA 240

Query: 241 VTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIV 300
           V+I G+E VPVNDE +L  AVANQPVSVAIDAGG DFQFYS GVF+G+C   LNHGVAIV
Sbjct: 241 VSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCNTDLNHGVAIV 300

Query: 301 GYGEASNKT-YWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 344
           GYG   + T YW+V+NSWG +WGE GYIRM+R+ + K G CGIAMMASYPIK+
Sbjct: 301 GYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEGLCGIAMMASYPIKN 346

BLAST of Cla97C02G035510.1 vs. Swiss-Prot
Match: sp|P25803|CYSEP_PHAVU (Vignain OS=Phaseolus vulgaris OX=3885 PE=2 SV=2)

HSP 1 Score: 347.4 bits (890), Expect = 1.8e-94
Identity = 186/353 (52.69%), Postives = 235/353 (66.57%), Query Frame = 0

Query: 1   MEAYRMIWNVGLLSLTLWVFWTPSMASMEMDYRPGSSSGDLQDRYQKWMSKYGREYKSRE 60
           M   +++W V   SL L V       S +   +  +S   L D Y++W S +    +S  
Sbjct: 1   MATKKLLWVVLSFSLVLGV-----ANSFDFHDKDLASEESLWDLYERWRSHH-TVSRSLG 60

Query: 61  EWEQRFNIYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGFKTDW-------- 120
           E  +RFN+++ N+ ++ N N ++  Y L  N FAD+TN EF++TY G K +         
Sbjct: 61  EKHKRFNVFKANLMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTP 120

Query: 121 LPDTWFRYGNMVNLPTNVDWRKENAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMS 180
             +  F Y  +V++P +VDWRK+ AVT VKDQGQCGSCWAFS V AVEGIN+IKT KL++
Sbjct: 121 HENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVA 180

Query: 181 LSEQELVDCDVASGNQGCNGGYMYKAFEFIK-KTGLTTEIEYPYRGIESVCNKQKVRYRT 240
           LSEQELVDCD    NQGCNGG M  AFEFIK K G+TTE  YPY+  E  C+  KV    
Sbjct: 181 LSEQELVDCD-KEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLA 240

Query: 241 VTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIV 300
           V+I G+E VP NDE +L  AVANQPVSVAIDAGG DFQFYS GVF+G+C   LNHGVAIV
Sbjct: 241 VSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIV 300

Query: 301 GYGEASNKT-YWLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIKD 344
           GYG   + T YW+V+NSWG +WGE GYIRM+R+ + K G CGIAM+ SYPIK+
Sbjct: 301 GYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPIKN 346

BLAST of Cla97C02G035510.1 vs. Swiss-Prot
Match: sp|Q9FGR9|CEP1_ARATH (KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana OX=3702 GN=CEP1 PE=1 SV=1)

HSP 1 Score: 340.9 bits (873), Expect = 1.7e-92
Identity = 178/317 (56.15%), Postives = 218/317 (68.77%), Query Frame = 0

Query: 37  SSGDLQDRYQKWMSKYGREYKSREEWEQRFNIYQLNVQYIDNFNSLNHSYTLAENSFADL 96
           S   L + Y++W S +    +S EE  +RFN+++ NV++I   N  + SY L  N F D+
Sbjct: 30  SENSLWELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHETNKKDKSYKLKLNKFGDM 89

Query: 97  TNDEFKTTYLG--------FKTDWLPDTWFRYGNMVNLPTNVDWRKENAVTPVKDQGQCG 156
           T++EF+ TY G        F+ +      F Y N+  LPT+VDWRK  AVTPVK+QGQCG
Sbjct: 90  TSEEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCG 149

Query: 157 SCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVASGNQGCNGGYMYKAFEFIK-KTGL 216
           SCWAFS V AVEGIN+I+T KL SLSEQELVDCD  + NQGCNGG M  AFEFIK K GL
Sbjct: 150 SCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCD-TNQNQGCNGGLMDLAFEFIKEKGGL 209

Query: 217 TTEIEYPYRGIESVCNKQKVRYRTVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYD 276
           T+E+ YPY+  +  C+  K     V+I G+E VP N E  L  AVANQPVSVAIDAGG D
Sbjct: 210 TSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSD 269

Query: 277 FQFYSGGVFSGNCGKQLNHGVAIVGYGEASNKT-YWLVKNSWGTDWGESGYIRMKRDSTD 336
           FQFYS GVF+G CG +LNHGVA+VGYG   + T YW+VKNSWG +WGE GYIRM+R    
Sbjct: 270 FQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRH 329

Query: 337 KRGTCGIAMMASYPIKD 344
           K G CGIAM ASYP+K+
Sbjct: 330 KEGLCGIAMEASYPLKN 344

BLAST of Cla97C02G035510.1 vs. Swiss-Prot
Match: sp|Q9STL4|CEP2_ARATH (KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana OX=3702 GN=CEP2 PE=1 SV=1)

HSP 1 Score: 340.1 bits (871), Expect = 2.8e-92
Identity = 177/325 (54.46%), Postives = 216/325 (66.46%), Query Frame = 0

Query: 29  EMDYRPGSSSGDLQDRYQKWMSKYGREYKSREEWEQRFNIYQLNVQYIDNFNSLNHSYTL 88
           + D +   S   L   Y +W S +    +S  E E+RFN+++ NV ++ N N  N SY L
Sbjct: 22  DYDDKEIESEEGLSTLYDRWRSHHSVP-RSLNEREKRFNVFRHNVMHVHNTNKKNRSYKL 81

Query: 89  AENSFADLTNDEFKTTYLGFKTD----------WLPDTWFRYGNMVNLPTNVDWRKENAV 148
             N FADLT +EFK  Y G                    + + N+  LP++VDWRK+ AV
Sbjct: 82  KLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHENLSKLPSSVDWRKKGAV 141

Query: 149 TPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVASGNQGCNGGYMYKA 208
           T +K+QG+CGSCWAFS VAAVEGINKIKT KL+SLSEQELVDCD    N+GCNGG M  A
Sbjct: 142 TEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTKQ-NEGCNGGLMEIA 201

Query: 209 FEFIKKT-GLTTEIEYPYRGIESVCNKQKVRYRTVTISGYEKVPVNDEKSLKAAVANQPV 268
           FEFIKK  G+TTE  YPY GI+  C+  K     VTI G+E VP NDE +L  AVANQPV
Sbjct: 202 FEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVANQPV 261

Query: 269 SVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEASNKTYWLVKNSWGTDWGESGY 328
           SVAIDAG  DFQFYS GVF+G+CG +LNHGVA VGYG    K YW+V+NSWG +WGE GY
Sbjct: 262 SVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYGSERGKKYWIVRNSWGAEWGEGGY 321

Query: 329 IRMKRDSTDKRGTCGIAMMASYPIK 343
           I+++R+  +  G CGIAM ASYPIK
Sbjct: 322 IKIEREIDEPEGRCGIAMEASYPIK 344

BLAST of Cla97C02G035510.1 vs. Swiss-Prot
Match: sp|A2XQE8|SAG39_ORYSI (Senescence-specific cysteine protease SAG39 OS=Oryza sativa subsp. indica OX=39946 GN=OsI_14861 PE=3 SV=1)

HSP 1 Score: 340.1 bits (871), Expect = 2.8e-92
Identity = 177/325 (54.46%), Postives = 226/325 (69.54%), Query Frame = 0

Query: 25  MASMEMDYRPGSSSGDLQDRYQKWMSKYGREYKSREEWEQRFNIYQLNVQYIDNFNSLNH 84
           + S  +  R  S    +  R+++WM++YGR Y+   E  +RF +++ NV +I++FN+ NH
Sbjct: 17  LCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNH 76

Query: 85  SYTLAENSFADLTNDEFK--TTYLGF--KTDWLPDTWFRYGNMVN---LPTNVDWRKENA 144
           ++ L  N FADLTNDEF+   T  GF   T  +P T FRY N VN   LP  VDWR + A
Sbjct: 77  NFWLGVNQFADLTNDEFRWTKTNKGFIPSTTRVP-TGFRYEN-VNIDALPATVDWRTKGA 136

Query: 145 VTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVASGNQGCNGGYMYK 204
           VTP+KDQGQCG CWAFSAVAA+EGI K+ TGKL+SLSEQELVDCDV   +QGC GG M  
Sbjct: 137 VTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDD 196

Query: 205 AFEF-IKKTGLTTEIEYPYRGIESVCNKQKVRYRTVTISGYEKVPVNDEKSLKAAVANQP 264
           AF+F IK  GLTTE  YPY   +  C  + V     +I GYE VP N+E +L  AVANQP
Sbjct: 197 AFKFIIKNGGLTTESNYPYAAADDKC--KSVSNSVASIKGYEDVPANNEAALMKAVANQP 256

Query: 265 VSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEASNKT-YWLVKNSWGTDWGES 324
           VSVA+D G   FQFY GGV +G+CG  L+HG+  +GYG+AS+ T YWL+KNSWGT WGE+
Sbjct: 257 VSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGEN 316

Query: 325 GYIRMKRDSTDKRGTCGIAMMASYP 341
           G++RM++D +DKRG CG+AM  SYP
Sbjct: 317 GFLRMEKDISDKRGMCGLAMEPSYP 337

BLAST of Cla97C02G035510.1 vs. TAIR10
Match: AT1G06260.1 (Cysteine proteinases superfamily protein)

HSP 1 Score: 351.3 bits (900), Expect = 6.8e-97
Identity = 180/341 (52.79%), Postives = 231/341 (67.74%), Query Frame = 0

Query: 9   NVGLLSLTLWVFWTPSMASMEMD-YRPGSSSGDLQDRYQKWMSKYGREYKSREEWEQRFN 68
           N+ L  L  +V     + S++   Y P  +   L+ R++KW+  + + Y  R+EW  RF 
Sbjct: 9   NLTLAVLICFVLIASKLCSVDSSVYDPHKT---LKQRFEKWLKTHSKLYGGRDEWMLRFG 68

Query: 69  IYQLNVQYIDNFNSLNHSYTLAENSFADLTNDEFKTTYLGFKTDWLPDTWFRYGNMV--- 128
           IYQ NVQ ID  NSL+  + L +N FAD+TN EFK  +LG  T  L     +    V   
Sbjct: 69  IYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAHFLGLNTSSL--RLHKKQRPVCDP 128

Query: 129 --NLPTNVDWRKENAVTPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCD 188
             N+P  VDWR + AVTP+++QG+CG CWAFSAVAA+EGINKIKTG L+SLSEQ+L+DCD
Sbjct: 129 AGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCD 188

Query: 189 VASGNQGCNGGYMYKAFEFIKKT-GLTTEIEYPYRGIESVCNKQKVRYRTVTISGYEKVP 248
           V + N+GC+GG M  AFEFIK   GL TE +YPY GIE  C+++K + + VTI GY+KV 
Sbjct: 189 VGTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQKVA 248

Query: 249 VNDEKSLKAAVANQPVSVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEASNKTY 308
            N E SL+ A A QPVSV IDAGG+ FQ YS GVF+  CG  LNHGV +VGYG   ++ Y
Sbjct: 249 QN-EASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGVEGDQKY 308

Query: 309 WLVKNSWGTDWGESGYIRMKRDSTDKRGTCGIAMMASYPIK 343
           W+VKNSWGT WGE GYIRM+R  ++  G CGIAMMASYP++
Sbjct: 309 WIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPLQ 343

BLAST of Cla97C02G035510.1 vs. TAIR10
Match: AT5G50260.1 (Cysteine proteinases superfamily protein)

HSP 1 Score: 340.9 bits (873), Expect = 9.2e-94
Identity = 178/317 (56.15%), Postives = 218/317 (68.77%), Query Frame = 0

Query: 37  SSGDLQDRYQKWMSKYGREYKSREEWEQRFNIYQLNVQYIDNFNSLNHSYTLAENSFADL 96
           S   L + Y++W S +    +S EE  +RFN+++ NV++I   N  + SY L  N F D+
Sbjct: 30  SENSLWELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHETNKKDKSYKLKLNKFGDM 89

Query: 97  TNDEFKTTYLG--------FKTDWLPDTWFRYGNMVNLPTNVDWRKENAVTPVKDQGQCG 156
           T++EF+ TY G        F+ +      F Y N+  LPT+VDWRK  AVTPVK+QGQCG
Sbjct: 90  TSEEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCG 149

Query: 157 SCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVASGNQGCNGGYMYKAFEFIK-KTGL 216
           SCWAFS V AVEGIN+I+T KL SLSEQELVDCD  + NQGCNGG M  AFEFIK K GL
Sbjct: 150 SCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCD-TNQNQGCNGGLMDLAFEFIKEKGGL 209

Query: 217 TTEIEYPYRGIESVCNKQKVRYRTVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYD 276
           T+E+ YPY+  +  C+  K     V+I G+E VP N E  L  AVANQPVSVAIDAGG D
Sbjct: 210 TSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSD 269

Query: 277 FQFYSGGVFSGNCGKQLNHGVAIVGYGEASNKT-YWLVKNSWGTDWGESGYIRMKRDSTD 336
           FQFYS GVF+G CG +LNHGVA+VGYG   + T YW+VKNSWG +WGE GYIRM+R    
Sbjct: 270 FQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRH 329

Query: 337 KRGTCGIAMMASYPIKD 344
           K G CGIAM ASYP+K+
Sbjct: 330 KEGLCGIAMEASYPLKN 344

BLAST of Cla97C02G035510.1 vs. TAIR10
Match: AT3G48340.1 (Cysteine proteinases superfamily protein)

HSP 1 Score: 340.1 bits (871), Expect = 1.6e-93
Identity = 177/325 (54.46%), Postives = 216/325 (66.46%), Query Frame = 0

Query: 29  EMDYRPGSSSGDLQDRYQKWMSKYGREYKSREEWEQRFNIYQLNVQYIDNFNSLNHSYTL 88
           + D +   S   L   Y +W S +    +S  E E+RFN+++ NV ++ N N  N SY L
Sbjct: 22  DYDDKEIESEEGLSTLYDRWRSHHSVP-RSLNEREKRFNVFRHNVMHVHNTNKKNRSYKL 81

Query: 89  AENSFADLTNDEFKTTYLGFKTD----------WLPDTWFRYGNMVNLPTNVDWRKENAV 148
             N FADLT +EFK  Y G                    + + N+  LP++VDWRK+ AV
Sbjct: 82  KLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHENLSKLPSSVDWRKKGAV 141

Query: 149 TPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVASGNQGCNGGYMYKA 208
           T +K+QG+CGSCWAFS VAAVEGINKIKT KL+SLSEQELVDCD    N+GCNGG M  A
Sbjct: 142 TEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTKQ-NEGCNGGLMEIA 201

Query: 209 FEFIKKT-GLTTEIEYPYRGIESVCNKQKVRYRTVTISGYEKVPVNDEKSLKAAVANQPV 268
           FEFIKK  G+TTE  YPY GI+  C+  K     VTI G+E VP NDE +L  AVANQPV
Sbjct: 202 FEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVANQPV 261

Query: 269 SVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEASNKTYWLVKNSWGTDWGESGY 328
           SVAIDAG  DFQFYS GVF+G+CG +LNHGVA VGYG    K YW+V+NSWG +WGE GY
Sbjct: 262 SVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYGSERGKKYWIVRNSWGAEWGEGGY 321

Query: 329 IRMKRDSTDKRGTCGIAMMASYPIK 343
           I+++R+  +  G CGIAM ASYPIK
Sbjct: 322 IKIEREIDEPEGRCGIAMEASYPIK 344

BLAST of Cla97C02G035510.1 vs. TAIR10
Match: AT5G45890.1 (senescence-associated gene 12)

HSP 1 Score: 338.6 bits (867), Expect = 4.6e-93
Identity = 172/324 (53.09%), Postives = 228/324 (70.37%), Query Frame = 0

Query: 33  RPGSSSGDLQDRYQKWMSKYGREYKSREEWEQRFNIYQLNVQYIDNFNSL--NHSYTLAE 92
           RP  +   +Q R+ +WM+K+GR Y   +E   R+ +++ NV+ I++ NS+    ++ LA 
Sbjct: 26  RPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAV 85

Query: 93  NSFADLTNDEFKTTYLGFK----------TDWLPDTWFRYGNMVN--LPTNVDWRKENAV 152
           N FADLTNDEF++ Y GFK          T   P   FRY N+ +  LP +VDWRK+ AV
Sbjct: 86  NQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSP---FRYQNVSSGALPVSVDWRKKGAV 145

Query: 153 TPVKDQGQCGSCWAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVASGNQGCNGGYMYKA 212
           TP+K+QG CG CWAFSAVAA+EG  +IK GKL+SLSEQ+LVDCD  + + GC GG M  A
Sbjct: 146 TPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD--TNDFGCEGGLMDTA 205

Query: 213 FEFIKKT-GLTTEIEYPYRGIESVCNKQKVRYRTVTISGYEKVPVNDEKSLKAAVANQPV 272
           FE IK T GLTTE  YPY+G ++ CN +K   +  +I+GYE VPVNDE++L  AVA+QPV
Sbjct: 206 FEHIKATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPV 265

Query: 273 SVAIDAGGYDFQFYSGGVFSGNCGKQLNHGVAIVGYGEASN-KTYWLVKNSWGTDWGESG 332
           SV I+ GG+DFQFYS GVF+G C   L+H V  +GYGE++N   YW++KNSWGT WGESG
Sbjct: 266 SVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESG 325

Query: 333 YIRMKRDSTDKRGTCGIAMMASYP 341
           Y+R+++D  DK+G CG+AM ASYP
Sbjct: 326 YMRIQKDVKDKQGLCGLAMKASYP 344

BLAST of Cla97C02G035510.1 vs. TAIR10
Match: AT4G35350.1 (xylem cysteine peptidase 1)

HSP 1 Score: 336.7 bits (862), Expect = 1.7e-92
Identity = 173/313 (55.27%), Postives = 216/313 (69.01%), Query Frame = 0

Query: 36  SSSGDLQDRYQKWMSKYGREYKSREEWEQRFNIYQLNVQYIDNFNSLNHSYTLAENSFAD 95
           +++  L + ++ WMS++ + YKS EE   RF +++ N+ +ID  N+  +SY L  N FAD
Sbjct: 42  TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFAD 101

Query: 96  LTNDEFKTTYLG-----FKTDWLPDTWFRYGNMVNLPTNVDWRKENAVTPVKDQGQCGSC 155
           LT++EFK  YLG     F     P   FRY ++ +LP +VDWRK+ AV PVKDQGQCGSC
Sbjct: 102 LTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSC 161

Query: 156 WAFSAVAAVEGINKIKTGKLMSLSEQELVDCDVASGNQGCNGGYMYKAFEFIKKT-GLTT 215
           WAFS VAAVEGIN+I TG L SLSEQEL+DCD  + N GCNGG M  AF++I  T GL  
Sbjct: 162 WAFSTVAAVEGINQITTGNLSSLSEQELIDCD-TTFNSGCNGGLMDYAFQYIISTGGLHK 221

Query: 216 EIEYPYRGIESVCNKQKVRYRTVTISGYEKVPVNDEKSLKAAVANQPVSVAIDAGGYDFQ 275
           E +YPY   E +C +QK     VTISGYE VP ND++SL  A+A+QPVSVAI+A G DFQ
Sbjct: 222 EDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQ 281

Query: 276 FYSGGVFSGNCGKQLNHGVAIVGYGEASNKTYWLVKNSWGTDWGESGYIRMKRDSTDKRG 335
           FY GGVF+G CG  L+HGVA VGYG +    Y +VKNSWG  WGE G+IRMKR++    G
Sbjct: 282 FYKGGVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEG 341

Query: 336 TCGIAMMASYPIK 343
            CGI  MASYP K
Sbjct: 342 LCGINKMASYPTK 353

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008458487.18.4e-17083.82PREDICTED: ervatamin-B-like [Cucumis melo][more]
KGN62053.19.7e-16682.94hypothetical protein Csa_2G292830 [Cucumis sativus][more]
XP_022140756.15.5e-16180.76ervatamin-B [Momordica charantia][more]
XP_023513224.15.1e-15979.88ervatamin-B-like [Cucurbita pepo subsp. pepo][more]
XP_022944762.11.5e-15879.30ervatamin-B-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
tr|A0A1S3C828|A0A1S3C828_CUCME5.6e-17083.82ervatamin-B-like OS=Cucumis melo OX=3656 GN=LOC103497881 PE=3 SV=1[more]
tr|A0A0A0LJV6|A0A0A0LJV6_CUCSA6.4e-16682.94Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G292830 PE=3 SV=1[more]
tr|M5X1M2|M5X1M2_PRUPE2.0e-11960.93Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_3G125700 PE=3 SV=1[more]
tr|A0A2P4KBT5|A0A2P4KBT5_QUESU1.7e-11861.96Senescence-specific cysteine protease sag12 OS=Quercus suber OX=58331 GN=CFP56_4... [more]
tr|A0A2I4EXQ7|A0A2I4EXQ7_9ROSI2.2e-11862.14ervatamin-B-like OS=Juglans regia OX=51240 GN=LOC108993649 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
sp|P12412|CYSEP_VIGMU2.5e-9653.82Vignain OS=Vigna mungo OX=3915 PE=1 SV=1[more]
sp|P25803|CYSEP_PHAVU1.8e-9452.69Vignain OS=Phaseolus vulgaris OX=3885 PE=2 SV=2[more]
sp|Q9FGR9|CEP1_ARATH1.7e-9256.15KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana OX=3702 GN=CEP1 ... [more]
sp|Q9STL4|CEP2_ARATH2.8e-9254.46KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana OX=3702 GN=CEP2 ... [more]
sp|A2XQE8|SAG39_ORYSI2.8e-9254.46Senescence-specific cysteine protease SAG39 OS=Oryza sativa subsp. indica OX=399... [more]
Match NameE-valueIdentityDescription
AT1G06260.16.8e-9752.79Cysteine proteinases superfamily protein[more]
AT5G50260.19.2e-9456.15Cysteine proteinases superfamily protein[more]
AT3G48340.11.6e-9354.46Cysteine proteinases superfamily protein[more]
AT5G45890.14.6e-9353.09senescence-associated gene 12[more]
AT4G35350.11.7e-9255.27xylem cysteine peptidase 1[more]
The following terms have been associated with this mRNA:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0008234cysteine-type peptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR038765Papain_like_cys_pep_sf
IPR039417Peptidase_C1A_papain-like
IPR025661Pept_asp_AS
IPR025660Pept_his_AS
IPR000169Pept_cys_AS
IPR013128Peptidase_C1A
IPR013201Prot_inhib_I29
IPR000668Peptidase_C1A_C
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0055114 oxidation-reduction process
molecular_function GO:0008234 cysteine-type peptidase activity
molecular_function GO:0032440 2-alkenal reductase [NAD(P)] activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0008233 peptidase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla97C02G035510Cla97C02G035510gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C02G035510.1.CDS.2Cla97C02G035510.1.CDS.2CDS
Cla97C02G035510.1.CDS.1Cla97C02G035510.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C02G035510.1.exon.2Cla97C02G035510.1.exon.2exon
Cla97C02G035510.1.exon.1Cla97C02G035510.1.exon.1exon


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla97C02G035510.1Cla97C02G035510.1-proteinpolypeptide


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000668Peptidase C1A, papain C-terminalPRINTSPR00705PAPAINcoord: 301..307
score: 75.76
coord: 144..159
score: 67.03
coord: 286..296
score: 54.87
IPR000668Peptidase C1A, papain C-terminalSMARTSM00645pept_c1coord: 126..341
e-value: 2.2E-119
score: 412.6
IPR000668Peptidase C1A, papain C-terminalPFAMPF00112Peptidase_C1coord: 126..341
e-value: 8.0E-83
score: 277.8
IPR013201Cathepsin propeptide inhibitor domain (I29)SMARTSM00848Inhibitor_I29_2coord: 45..101
e-value: 4.7E-21
score: 86.0
IPR013201Cathepsin propeptide inhibitor domain (I29)PFAMPF08246Inhibitor_I29coord: 45..101
e-value: 6.6E-14
score: 52.0
NoneNo IPR availableGENE3DG3DSA:3.90.70.10coord: 26..343
e-value: 1.2E-116
score: 391.6
NoneNo IPR availablePANTHERPTHR12411:SF545SUBFAMILY NOT NAMEDcoord: 24..343
IPR013128Peptidase C1APANTHERPTHR12411CYSTEINE PROTEASE FAMILY C1-RELATEDcoord: 24..343
IPR000169Cysteine peptidase, cysteine active sitePROSITEPS00139THIOL_PROTEASE_CYScoord: 144..155
IPR025660Cysteine peptidase, histidine active sitePROSITEPS00639THIOL_PROTEASE_HIScoord: 284..294
IPR025661Cysteine peptidase, asparagine active sitePROSITEPS00640THIOL_PROTEASE_ASNcoord: 301..320
IPR039417Papain-like cysteine endopeptidaseCDDcd02248Peptidase_C1Acoord: 127..340
e-value: 2.65058E-109
score: 319.57
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILYSSF54001Cysteine proteinasescoord: 41..342