Cla97C01G016150 (gene) Watermelon (97103) v2.5

Overview
NameCla97C01G016150
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionUPF0481 protein At3g47200
LocationCla97Chr01: 29898064 .. 29903082 (+)
RNA-Seq ExpressionCla97C01G016150
SyntenyCla97C01G016150
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCACTTTTGAGTTCCTTCAGGCTATAAATAACCATAGCCTCTTCCCCACTTTCTCCATAACTCAGAATCGAACAATCTTCCCTGTTCTCCAAGCAACCTTCGCCATGGCCACCGCCACCACTTTGCTCGCCCTGCTCTCCTTCTTCTTCCTATCTAATTCTGCCTCCGCTCTCACCCGCCGGAGCGATGGCGAGGTTAGAGAAATCTACGACCTGTGGCTGGCGAAGCACGGCAAGGCCTATAACGGAATCGAAGAACGGGAGAAGAGGTTTCAGATCTTCAAGGAGAATCTGAACTTCATCGATGAACATAATTCGGAGAATCGGACTTATAAGGTTGGATTGAACATGTTCGCGGATTTGACCAACGACGAGTATCGGGCTGTGTATTTGGGGACTAGGTCTCCTCCTGCTCGACGAGTCATGAAGGCCAAGACCGCCAGCCGCCGATACGCCGTCAACATCCGCGATCGGTTGCCGGAATCTGTCGATTGGAGGACCAGAGGTGCCGTTGCTCCAGTCAAAAATCAAGGAAGTTGCGGTGAGTTTTTTCAAATATCAACCGATGTTGAGAATATTTTGAGTTTTCTAAATATTTTTCCGAAATAAATACGATTAAACTGGGTAAAACTGGGTTGAATATTATTGATCAGCCCAAATAATATATATAGGCCCACAAAGAAAGACACGGCCCAAATATAAAAGAAGGTATAATAAAATTATACCATTATACACCCTTAATAATTATATCATTCAAAATATACATGCATATGTGTGTATGTTTCTACCATACATTCGAATCTCTGATCTTGAGTTACGGTATGTAGGAACAATTCTTTGAATTAAAGTAAGCCGTCAGACGATTAACTATGAAATTATAGCTTTGAATCAATTTTGTAGTAATCTCACAATTATGGTTGTTTTCAAATATAAAAAAATGAATCAAATTATTTATAGATATAACAAATTTCATTGTCTATTTGCGACAGCTGCAATAGATTGTGATATTTTATAAATATTTTTTAGTAATTTTGTCATTTAAAATAATTTTTTGAAAAAAATTTAATTGAGATAGTATGATTGAAAGTAATTATAAAGGAGGAATAGTTAGATCGAGTTTCCATGCAAAGTAAAAAGAAGAAAAAACAATTCATTTTCTAAAAAAGAATGTTTGCCAAATATTTTATTAAGTAAAAAAATACATTATTTTTTAAAATAAATATTTGCTACGTGATAAATTTTTATTTAACTAAGTGGCATAAAATTGATACAGTTCAACCAACATCTACCTATTTTTCAATTAAAAATTATGAAAAAGAAAATGCAAGAGGGATGGCCTTGGATTTCTGATGCAAAGTTAGTGAATGACGAGCTTTCTTGTTTGATAAAAACTCAATTGGAAAGAAGGTTAATTATTTGAAGGTGGAAAGGATAAATCTCCTCTCTTAAATAACTCAAATTAATTATTTGGATTAGAAAAATCATAACAAGATGTCTTTAAAAGGAAATATTTCCTTCTATTGCTAATTAGATAAATATGGTGTAATTTAATCCAAAAATGGTGAATCCAAACAAATGTACCTTCATGCATGTATATAAAGGACATGGGCTACTTTCTTTGTTTTTCAATAAGTCAAACGGACCTATACAAATGATAATGGATGTACGCACGAATTTGTTGCACTGAAAATGTTAGAATACAAATTAATTTGGAAGATTATAAAGTTAGTGCAGAGATCTTGGCATGCGTTAATTATGTGCGAAAACCCATTACAATTGGAAACAAGTGAGCTCAGTAATGTCAAATGTTTGTGTTGTTGGGGGAAACAGGGAGCTGCTGGGCATTCTCGACCATAGCAGCTGTTGAAGGCATAAATCAAATCGTCACCGGAGAACTCATCTCTCTCTCTGAACAAGAGCTTGTTAGCTGTGACAAAAAGTACAATTCAGGTTGCAATGGAGGCCTTATGGACTATGCCTTCCAGTTCATCATTGACAATGGCGGCTTGGACACCGAGGAAGATTATCCTTATGAAGGCTTTGATGGTCAATGCGATCCCACCAGGGTGAGATTATTTCTGCTTTCATTTTCCGTCATCTCTTTGACCTTTTTACCTGGCTGGATTTTAATTCCCAACATTTCTCTCTCTCTCTCTCAGAAAAATGCCAAGGTCGTTAGCATTGACGGGTACGAGGATGTCCCTGCTGATGACGAGAAAGCATTGAAGAAGGCTGTTGCTCATCAGCCAGTCAGCGTCGCCATTGAAGCTAGTGGCTTAGCTTTGCAACTCTACCAGTCGGTGAGAAAACCTTTCTCTCACACACTTCTCAAACTGTCCACTTCAATCCATAAATTGTGATTGAATCACATTTGATTGAACTTTACAGGGTGTATTCACTGGTAAATGTGGCTCAGCTCTCGACCATGGTGTCATCGCTGTTGGTTATGGCACAGAGAACGGAGTTGATTATTGGCTTGTAAGGAACTCATGGGGCACAGGATGGGGTGAAGATGGCTACTTCAAGCTAGAGCGCAATGTAAAGCACACTACCAATGGCAAGTGTGGGATCGCAATGATGGCTTCTTACCCTGTTAAGAATGGTAACAACCCAACAAAATCATACTTAAGTTTGGAAACAACTAATGGGGACAAGAACAAGATCAACACTGCTTGAATGGAGAGAGGTTGTGAAACATCTGCTACTTCTAGTTTGATGGAAGTTGATTATGTCTTTCGATGTTTCAATTCATGTTGGTCTGAAGTTTGGCCTATTGCTATACCTAAAATTTGTGCTTGTAAATGATTCAGTTCAATATCAATGTTCCGTAACAGTTAAATGATATTTGTGGCATTTAAGATGGGGAGTTGTCTCTTTGTACTATCACAAAAATAGTTATGATAATGACTTAATGAGGCAAGTTTCCTCTCATGATTTAGCCCATTGCTTCTACCCACTGCTATCATTGAGGTCAACTGTGGGAGAGGCAAGATGGTGGCAATATTTAATACTTGAAAATTCCAGGGAAATTTCATCTCTCCATATGAATGATATAATAATACAAAAGTAAAAGGAGAGAGAGGGGAAAGGAGAACCGTGCACTTCTGTTATAAAGAAAACACAAGAAGGGAAAACAGAATTGGAATTAAAGAATGGGAGTTTGATTTTTCAATGAAAATCACATTCACGCCAAACATCTTAGATAAACCTTCCAAGCAAATGGGTCCATTTCGTTTTTAATTATCTACACCTTTGCTTCACGTCTATTGGAAACTTAAAGCATCAATGATTTGGATATTTTCCCTTCTCATTAATTATTACGTCCTAGTTGTCTTTGTTTGAAGTACAGTGGTATCATTTCTGGGGAATGAGAGTTCTTCTAAAGCCAGTCAAAGTCTTAACAATTTCAGTTCTGTGCACAAAATGGTGGCTGTGTTCAATAAAGAGTTATTGAGCTGGTACCTGATCACCCTCAAGCTCAGAGAAACGGTAGAATCTGGACTTCCCAGAAACTCACTTTCAGCCAATTCTGTTGATTCTCATGGAAAACCAGAAATCCAGCTCCAGGAACAGAAACAGATTCAATCAGAATCCCATCATGTTATAATAGAAGATGAAGATCAGAAGCTTGAAGAAGACCCCGAATCACCTGAGACAGAATGGGTTGTCACCATCAAGGAAAAGCTTCACCAAGCTCATCAAGATGAAGTAGAAAGTACATGGGCAAAGCTCTGCATTTACAAGGTCCCTCACTACCTGAAAGATGGTGAAGACAAAGCTGTTGTTCCTCAGATTGTCTCTTTAGGACCTTACCACCATGGAAAGCGCCGGCTCCGGCAAATGGAACGCCATAAATGGCGGTCGCTTTATCACATCCTAGAGAGAGCAAAGCAGGACATAAAGATTTATCTGGACGGCATGAAAGAACTTGAAGAAAAAGCCCGTAGTTGTTATGAAGGACCGCTTAGTTTAAGCAGCAATGAATTTGTGGAAATGATGGTGCTCGATGGTTGCTTTGTGCTTGAACTCTTCAGAGGAGCTGCAGAAGGATTCAAACAACTTGGGTATCCTCGAAATGATCCAATCTTCGCAATGCGTGGCTCAATGCATTCGATCCAGAGGGATATGATAATGCTGGAAAATCAGTTGCCCTTGTTTGTATTGGATCGACTGCTTGAGCTTCAGCTTGGTGACCACTACCAGAAAGGACTCGTAGCCGAATTAGCACTCAGATTCTTCGATCCATTAACCCCAAACGATGAACCCTTAACCAAAAGTAGCTTGAACAAATTAGAATCATCTCTCGGAAACGCAACCGCCTTTGACCCGCTTGGTTATCAAGACGGACTTCATTGCCTCGATGTTTTTCGACGAAGTCTCCTCCGATCTGGCCCGAAATTAGCACCGAAAGTGTGGATCAAACGGCGGTCTCATGCGAATCGGGTGGCCGATAAACGGAGGCAGCAATTGATTCACTGTGTGAAAGAGTTGAAAGAGGCAGGGATCAGATTTCAGAAGAAGAAAACCGATCGATTTTGGGACATAAATTTCAACAATGGGGTTATGCAAATTCCACGACTATTGATTCACGATGGAACTAGGTCATTGTTTCTCAATCTAATAGCATTCGAACAATGTCATCTTGATTGCAGCAATGACATAACCTCTTATGTGGTTTTCATGGATAATCTAATAGATTCTCATGAAGATGTTGCTTACCTCCATTACTGTGGAATAATAGAGCATTGGCTTGGAAGTGATGAAGAAGTTGCAGAGCTTTTCAATCGTCTCTGTCAAGAGGTAGTTTATGATATCAATGATAGCTATCTTTCCCAATTGTCTGAGGATGTGAATCGCTACTACAACCATAGATGGAATGCTTGGAGAGCAACTTTGAAACACAACTACTTCGGTAATCCATGGGCCATTATCTCTTTGGTTGCAGCAGTAGTTCTTTTGTTGCTTACTTTTGCACAAGCCTTCTATGGAGTTTATGCTTATTACAAACCCCCAAATTGA

mRNA sequence

CTCACTTTTGAGTTCCTTCAGGCTATAAATAACCATAGCCTCTTCCCCACTTTCTCCATAACTCAGAATCGAACAATCTTCCCTGTTCTCCAAGCAACCTTCGCCATGGCCACCGCCACCACTTTGCTCGCCCTGCTCTCCTTCTTCTTCCTATCTAATTCTGCCTCCGCTCTCACCCGCCGGAGCGATGGCGAGGTTAGAGAAATCTACGACCTGTGGCTGGCGAAGCACGGCAAGGCCTATAACGGAATCGAAGAACGGGAGAAGAGGTTTCAGATCTTCAAGGAGAATCTGAACTTCATCGATGAACATAATTCGGAGAATCGGACTTATAAGGTTGGATTGAACATGTTCGCGGATTTGACCAACGACGAGTATCGGGCTGTGTATTTGGGGACTAGGTCTCCTCCTGCTCGACGAGTCATGAAGGCCAAGACCGCCAGCCGCCGATACGCCGTCAACATCCGCGATCGGTTGCCGGAATCTGTCGATTGGAGGACCAGAGGTGCCGTTGCTCCAGTCAAAAATCAAGGAAGTTGCGGGAGCTGCTGGGCATTCTCGACCATAGCAGCTGTTGAAGGCATAAATCAAATCGTCACCGGAGAACTCATCTCTCTCTCTGAACAAGAGCTTGTTAGCTGTGACAAAAAGTACAATTCAGGTTGCAATGGAGGCCTTATGGACTATGCCTTCCAGTTCATCATTGACAATGGCGGCTTGGACACCGAGGAAGATTATCCTTATGAAGGCTTTGATGGTCAATGCGATCCCACCAGGAAAAATGCCAAGGTCGTTAGCATTGACGGGTACGAGGATGTCCCTGCTGATGACGAGAAAGCATTGAAGAAGGCTGTTGCTCATCAGCCAGTCAGCGTCGCCATTGAAGCTAGTGGCTTAGCTTTGCAACTCTACCAGTCGGGTGTATTCACTGGTAAATGTGGCTCAGCTCTCGACCATGGTGTCATCGCTGTTGGTTATGGCACAGAGAACGGAGTTGATTATTGGCTTGTAAGGAACTCATGGGGCACAGGATGGGGTGAAGATGGCTACTTCAAGCTAGAGCGCAATGTAAAGCACACTACCAATGGCAAGTGTGGGATCGCAATGATGGCTTCTTACCCTGTTAAGAATGGTTGTGAAACATCTGCTACTTCTAGTTTGATGGAAGTTGATTATGTCTTTCGATGTTTCAATTCATGTTGTACAGTGGTATCATTTCTGGGGAATGAGAGTTCTTCTAAAGCCAGTCAAAGTCTTAACAATTTCAGTTCTGTGCACAAAATGGTGGCTGTGTTCAATAAAGAGTTATTGAGCTGGTACCTGATCACCCTCAAGCTCAGAGAAACGGTAGAATCTGGACTTCCCAGAAACTCACTTTCAGCCAATTCTGTTGATTCTCATGGAAAACCAGAAATCCAGCTCCAGGAACAGAAACAGATTCAATCAGAATCCCATCATGTTATAATAGAAGATGAAGATCAGAAGCTTGAAGAAGACCCCGAATCACCTGAGACAGAATGGGTTGTCACCATCAAGGAAAAGCTTCACCAAGCTCATCAAGATGAAGTAGAAAGTACATGGGCAAAGCTCTGCATTTACAAGGTCCCTCACTACCTGAAAGATGGTGAAGACAAAGCTGTTGTTCCTCAGATTGTCTCTTTAGGACCTTACCACCATGGAAAGCGCCGGCTCCGGCAAATGGAACGCCATAAATGGCGGTCGCTTTATCACATCCTAGAGAGAGCAAAGCAGGACATAAAGATTTATCTGGACGGCATGAAAGAACTTGAAGAAAAAGCCCGTAGTTGTTATGAAGGACCGCTTAGTTTAAGCAGCAATGAATTTGTGGAAATGATGGTGCTCGATGGTTGCTTTGTGCTTGAACTCTTCAGAGGAGCTGCAGAAGGATTCAAACAACTTGGGTATCCTCGAAATGATCCAATCTTCGCAATGCGTGGCTCAATGCATTCGATCCAGAGGGATATGATAATGCTGGAAAATCAGTTGCCCTTGTTTGTATTGGATCGACTGCTTGAGCTTCAGCTTGGTGACCACTACCAGAAAGGACTCGTAGCCGAATTAGCACTCAGATTCTTCGATCCATTAACCCCAAACGATGAACCCTTAACCAAAAGTAGCTTGAACAAATTAGAATCATCTCTCGGAAACGCAACCGCCTTTGACCCGCTTGGTTATCAAGACGGACTTCATTGCCTCGATGTTTTTCGACGAAGTCTCCTCCGATCTGGCCCGAAATTAGCACCGAAAGTGTGGATCAAACGGCGGTCTCATGCGAATCGGGTGGCCGATAAACGGAGGCAGCAATTGATTCACTGTGTGAAAGAGTTGAAAGAGGCAGGGATCAGATTTCAGAAGAAGAAAACCGATCGATTTTGGGACATAAATTTCAACAATGGGGTTATGCAAATTCCACGACTATTGATTCACGATGGAACTAGGTCATTGTTTCTCAATCTAATAGCATTCGAACAATGTCATCTTGATTGCAGCAATGACATAACCTCTTATGTGGTTTTCATGGATAATCTAATAGATTCTCATGAAGATGTTGCTTACCTCCATTACTGTGGAATAATAGAGCATTGGCTTGGAAGTGATGAAGAAGTTGCAGAGCTTTTCAATCGTCTCTGTCAAGAGGTAGTTTATGATATCAATGATAGCTATCTTTCCCAATTGTCTGAGGATGTGAATCGCTACTACAACCATAGATGGAATGCTTGGAGAGCAACTTTGAAACACAACTACTTCGGTAATCCATGGGCCATTATCTCTTTGGTTGCAGCAGTAGTTCTTTTGTTGCTTACTTTTGCACAAGCCTTCTATGGAGTTTATGCTTATTACAAACCCCCAAATTGA

Coding sequence (CDS)

ATGGCCACCGCCACCACTTTGCTCGCCCTGCTCTCCTTCTTCTTCCTATCTAATTCTGCCTCCGCTCTCACCCGCCGGAGCGATGGCGAGGTTAGAGAAATCTACGACCTGTGGCTGGCGAAGCACGGCAAGGCCTATAACGGAATCGAAGAACGGGAGAAGAGGTTTCAGATCTTCAAGGAGAATCTGAACTTCATCGATGAACATAATTCGGAGAATCGGACTTATAAGGTTGGATTGAACATGTTCGCGGATTTGACCAACGACGAGTATCGGGCTGTGTATTTGGGGACTAGGTCTCCTCCTGCTCGACGAGTCATGAAGGCCAAGACCGCCAGCCGCCGATACGCCGTCAACATCCGCGATCGGTTGCCGGAATCTGTCGATTGGAGGACCAGAGGTGCCGTTGCTCCAGTCAAAAATCAAGGAAGTTGCGGGAGCTGCTGGGCATTCTCGACCATAGCAGCTGTTGAAGGCATAAATCAAATCGTCACCGGAGAACTCATCTCTCTCTCTGAACAAGAGCTTGTTAGCTGTGACAAAAAGTACAATTCAGGTTGCAATGGAGGCCTTATGGACTATGCCTTCCAGTTCATCATTGACAATGGCGGCTTGGACACCGAGGAAGATTATCCTTATGAAGGCTTTGATGGTCAATGCGATCCCACCAGGAAAAATGCCAAGGTCGTTAGCATTGACGGGTACGAGGATGTCCCTGCTGATGACGAGAAAGCATTGAAGAAGGCTGTTGCTCATCAGCCAGTCAGCGTCGCCATTGAAGCTAGTGGCTTAGCTTTGCAACTCTACCAGTCGGGTGTATTCACTGGTAAATGTGGCTCAGCTCTCGACCATGGTGTCATCGCTGTTGGTTATGGCACAGAGAACGGAGTTGATTATTGGCTTGTAAGGAACTCATGGGGCACAGGATGGGGTGAAGATGGCTACTTCAAGCTAGAGCGCAATGTAAAGCACACTACCAATGGCAAGTGTGGGATCGCAATGATGGCTTCTTACCCTGTTAAGAATGGTTGTGAAACATCTGCTACTTCTAGTTTGATGGAAGTTGATTATGTCTTTCGATGTTTCAATTCATGTTGTACAGTGGTATCATTTCTGGGGAATGAGAGTTCTTCTAAAGCCAGTCAAAGTCTTAACAATTTCAGTTCTGTGCACAAAATGGTGGCTGTGTTCAATAAAGAGTTATTGAGCTGGTACCTGATCACCCTCAAGCTCAGAGAAACGGTAGAATCTGGACTTCCCAGAAACTCACTTTCAGCCAATTCTGTTGATTCTCATGGAAAACCAGAAATCCAGCTCCAGGAACAGAAACAGATTCAATCAGAATCCCATCATGTTATAATAGAAGATGAAGATCAGAAGCTTGAAGAAGACCCCGAATCACCTGAGACAGAATGGGTTGTCACCATCAAGGAAAAGCTTCACCAAGCTCATCAAGATGAAGTAGAAAGTACATGGGCAAAGCTCTGCATTTACAAGGTCCCTCACTACCTGAAAGATGGTGAAGACAAAGCTGTTGTTCCTCAGATTGTCTCTTTAGGACCTTACCACCATGGAAAGCGCCGGCTCCGGCAAATGGAACGCCATAAATGGCGGTCGCTTTATCACATCCTAGAGAGAGCAAAGCAGGACATAAAGATTTATCTGGACGGCATGAAAGAACTTGAAGAAAAAGCCCGTAGTTGTTATGAAGGACCGCTTAGTTTAAGCAGCAATGAATTTGTGGAAATGATGGTGCTCGATGGTTGCTTTGTGCTTGAACTCTTCAGAGGAGCTGCAGAAGGATTCAAACAACTTGGGTATCCTCGAAATGATCCAATCTTCGCAATGCGTGGCTCAATGCATTCGATCCAGAGGGATATGATAATGCTGGAAAATCAGTTGCCCTTGTTTGTATTGGATCGACTGCTTGAGCTTCAGCTTGGTGACCACTACCAGAAAGGACTCGTAGCCGAATTAGCACTCAGATTCTTCGATCCATTAACCCCAAACGATGAACCCTTAACCAAAAGTAGCTTGAACAAATTAGAATCATCTCTCGGAAACGCAACCGCCTTTGACCCGCTTGGTTATCAAGACGGACTTCATTGCCTCGATGTTTTTCGACGAAGTCTCCTCCGATCTGGCCCGAAATTAGCACCGAAAGTGTGGATCAAACGGCGGTCTCATGCGAATCGGGTGGCCGATAAACGGAGGCAGCAATTGATTCACTGTGTGAAAGAGTTGAAAGAGGCAGGGATCAGATTTCAGAAGAAGAAAACCGATCGATTTTGGGACATAAATTTCAACAATGGGGTTATGCAAATTCCACGACTATTGATTCACGATGGAACTAGGTCATTGTTTCTCAATCTAATAGCATTCGAACAATGTCATCTTGATTGCAGCAATGACATAACCTCTTATGTGGTTTTCATGGATAATCTAATAGATTCTCATGAAGATGTTGCTTACCTCCATTACTGTGGAATAATAGAGCATTGGCTTGGAAGTGATGAAGAAGTTGCAGAGCTTTTCAATCGTCTCTGTCAAGAGGTAGTTTATGATATCAATGATAGCTATCTTTCCCAATTGTCTGAGGATGTGAATCGCTACTACAACCATAGATGGAATGCTTGGAGAGCAACTTTGAAACACAACTACTTCGGTAATCCATGGGCCATTATCTCTTTGGTTGCAGCAGTAGTTCTTTTGTTGCTTACTTTTGCACAAGCCTTCTATGGAGTTTATGCTTATTACAAACCCCCAAATTGA

Protein sequence

MATATTLLALLSFFFLSNSASALTRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKENLNFIDEHNSENRTYKVGLNMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVNIRDRLPESVDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVIAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNGCETSATSSLMEVDYVFRCFNSCCTVVSFLGNESSSKASQSLNNFSSVHKMVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARSCYEGPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
Homology
BLAST of Cla97C01G016150 vs. NCBI nr
Match: XP_038880921.1 (UPF0481 protein At3g47200-like [Benincasa hispida])

HSP 1 Score: 1038.9 bits (2685), Expect = 2.7e-299
Identity = 509/531 (95.86%), Postives = 521/531 (98.12%), Query Frame = 0

Query: 393 MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHV 452
           MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDS GK E QLQE KQIQSESHHV
Sbjct: 1   MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSLGKSEPQLQELKQIQSESHHV 60

Query: 453 IIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAV 512
           IIEDEDQKLEEDPESPE+EWV+TIKEKL+QAHQDEVES+WAKLCIYKVPHYLKDGEDKAV
Sbjct: 61  IIEDEDQKLEEDPESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDKAV 120

Query: 513 VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARSCYEGP 572
           VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILER KQDIK+YLD MKELEE+AR+CYEGP
Sbjct: 121 VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERTKQDIKLYLDAMKELEERARNCYEGP 180

Query: 573 LSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN 632
            S SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN
Sbjct: 181 FSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN 240

Query: 633 QLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF 692
           QLPLFVLDRLL LQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF
Sbjct: 241 QLPLFVLDRLLGLQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF 300

Query: 693 DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAG 752
           DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW+KRRSHANRVADKRRQQLIHCVKELKEAG
Sbjct: 301 DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWMKRRSHANRVADKRRQQLIHCVKELKEAG 360

Query: 753 IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM 812
           +RF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM
Sbjct: 361 VRFRKKKTDRFWDINFNNGVMEIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM 420

Query: 813 DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYY 872
           DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDIN+SYLSQLSEDVNRYY
Sbjct: 421 DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINNSYLSQLSEDVNRYY 480

Query: 873 NHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN 924
           NHRWNAWRATLKHNYF NPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN
Sbjct: 481 NHRWNAWRATLKHNYFSNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN 531

BLAST of Cla97C01G016150 vs. NCBI nr
Match: XP_011657877.1 (UPF0481 protein At3g47200 [Cucumis sativus] >KGN48549.1 hypothetical protein Csa_003183 [Cucumis sativus])

HSP 1 Score: 1026.2 bits (2652), Expect = 1.8e-295
Identity = 502/534 (94.01%), Postives = 523/534 (97.94%), Query Frame = 0

Query: 393 MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHV 452
           MVAVFNKELLSWYLITLKLRETVESGLPRNS+SANSVDSHGK E+QLQE KQIQSESHHV
Sbjct: 1   MVAVFNKELLSWYLITLKLRETVESGLPRNSISANSVDSHGKSELQLQESKQIQSESHHV 60

Query: 453 IIEDEDQKL-EEDP--ESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGED 512
           I+E+EDQKL EEDP  ESP +EWV+TIKEKL+QAHQDEVES+WAKLCIYKVPHYLKDGED
Sbjct: 61  IVENEDQKLEEEDPELESPVSEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGED 120

Query: 513 KAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARSCY 572
           KAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHIL+R+KQDIK+YLD MKELEE+AR+CY
Sbjct: 121 KAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILDRSKQDIKLYLDAMKELEERARNCY 180

Query: 573 EGPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIM 632
           EGP S SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIM
Sbjct: 181 EGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIM 240

Query: 633 LENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNA 692
           LENQLPLFVLDRLLELQLGD+YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN 
Sbjct: 241 LENQLPLFVLDRLLELQLGDNYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNT 300

Query: 693 TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELK 752
           TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW+KRRSHANRVADKRRQQLIHCVKELK
Sbjct: 301 TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWMKRRSHANRVADKRRQQLIHCVKELK 360

Query: 753 EAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYV 812
           +AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYV
Sbjct: 361 DAGIRFKKKKTDRFWDINFNNGVMEIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYV 420

Query: 813 VFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVN 872
           VFMDNLIDSHEDV+YLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVN
Sbjct: 421 VFMDNLIDSHEDVSYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVN 480

Query: 873 RYYNHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN 924
           RYYNHRWNAWRATLKHNYF NPWAIISL+AAVVLLLLTFAQAFYGV+AYYKPPN
Sbjct: 481 RYYNHRWNAWRATLKHNYFSNPWAIISLIAAVVLLLLTFAQAFYGVFAYYKPPN 534

BLAST of Cla97C01G016150 vs. NCBI nr
Match: XP_008440314.1 (PREDICTED: UPF0481 protein At3g47200 [Cucumis melo] >TYK12866.1 UPF0481 protein [Cucumis melo var. makuwa])

HSP 1 Score: 1023.5 bits (2645), Expect = 1.2e-294
Identity = 499/533 (93.62%), Postives = 521/533 (97.75%), Query Frame = 0

Query: 393 MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHV 452
           MVAVFNKELLSWYLITLKLRETVESGLPR+S+SANSVDSHGK E+QL E KQIQSESH+V
Sbjct: 1   MVAVFNKELLSWYLITLKLRETVESGLPRSSISANSVDSHGKSELQLGEPKQIQSESHNV 60

Query: 453 IIEDEDQKLEEDP--ESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDK 512
           IIE+ED KLEEDP  ESPE+EWV+TIKEKL+QAHQDEVES+WAKLCIYKVPHYLKDGEDK
Sbjct: 61  IIENEDHKLEEDPEFESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDK 120

Query: 513 AVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARSCYE 572
           AVVPQI+SLGPYHHGKRRLRQMERHKWRSLYHILER+K DIK+YLD MKELEE+AR+CYE
Sbjct: 121 AVVPQIISLGPYHHGKRRLRQMERHKWRSLYHILERSKHDIKLYLDAMKELEERARNCYE 180

Query: 573 GPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIML 632
           GP S SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIML
Sbjct: 181 GPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIML 240

Query: 633 ENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNAT 692
           ENQLPLFVLDRLLELQLGD+YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN T
Sbjct: 241 ENQLPLFVLDRLLELQLGDYYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNTT 300

Query: 693 AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKE 752
           AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW+KRRSHANRVADKRRQQLIHCVKELK+
Sbjct: 301 AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWMKRRSHANRVADKRRQQLIHCVKELKD 360

Query: 753 AGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVV 812
           AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVV
Sbjct: 361 AGIRFKKKKTDRFWDINFNNGVMEIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVV 420

Query: 813 FMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNR 872
           FMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVA+LFNRLCQEVVYDINDSYLSQLSEDVNR
Sbjct: 421 FMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNR 480

Query: 873 YYNHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN 924
           YYNHRWNAWRATLKHNYF NPWAIISL+AAVVLLLLTFAQAFYGV+AYYKPPN
Sbjct: 481 YYNHRWNAWRATLKHNYFSNPWAIISLIAAVVLLLLTFAQAFYGVFAYYKPPN 533

BLAST of Cla97C01G016150 vs. NCBI nr
Match: XP_023003973.1 (UPF0481 protein At3g47200-like [Cucurbita maxima])

HSP 1 Score: 1011.9 bits (2615), Expect = 3.5e-291
Identity = 493/531 (92.84%), Postives = 512/531 (96.42%), Query Frame = 0

Query: 393 MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHV 452
           MVAVFNKELLSWYLITLKL+ETVESGLPRNS S NSVDSHGKP++QLQE +QIQSESHHV
Sbjct: 1   MVAVFNKELLSWYLITLKLKETVESGLPRNSNSPNSVDSHGKPDLQLQEYRQIQSESHHV 60

Query: 453 IIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAV 512
           I+EDEDQKLEED ESPE+EWV++IKEKL QAHQDEVES+WAKLCIYKVPHYLKDG+DKAV
Sbjct: 61  IVEDEDQKLEEDSESPESEWVISIKEKLDQAHQDEVESSWAKLCIYKVPHYLKDGDDKAV 120

Query: 513 VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARSCYEGP 572
           VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILER K DI IYLD MKELEE ARSCYEGP
Sbjct: 121 VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERTKHDINIYLDAMKELEENARSCYEGP 180

Query: 573 LSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN 632
            S SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN
Sbjct: 181 FSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN 240

Query: 633 QLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF 692
           QLPLFVLDRLL +QLG++YQKGL+AELALRFFDPLTPNDEPLTKS+LNKLESSL NATAF
Sbjct: 241 QLPLFVLDRLLGIQLGENYQKGLLAELALRFFDPLTPNDEPLTKSNLNKLESSLRNATAF 300

Query: 693 DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAG 752
           DPLG QDGLHCLDVFRRSLLRSG KLAPKVWIKRRSHA+RVADKRRQQLIHCVKELKEAG
Sbjct: 301 DPLGNQDGLHCLDVFRRSLLRSGQKLAPKVWIKRRSHAHRVADKRRQQLIHCVKELKEAG 360

Query: 753 IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM 812
           IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM
Sbjct: 361 IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM 420

Query: 813 DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYY 872
           DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYY
Sbjct: 421 DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYY 480

Query: 873 NHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN 924
           NHRWNAWRA+LKHNYF NPWAIISL+AAVVLLLLTFAQ FYGVY YY+PPN
Sbjct: 481 NHRWNAWRASLKHNYFSNPWAIISLIAAVVLLLLTFAQTFYGVYGYYRPPN 531

BLAST of Cla97C01G016150 vs. NCBI nr
Match: XP_023518140.1 (UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1011.1 bits (2613), Expect = 6.0e-291
Identity = 493/531 (92.84%), Postives = 511/531 (96.23%), Query Frame = 0

Query: 393 MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHV 452
           MVAVFNKELLSWYLITLKL+ETVESGLPRNS SANSVDSHGKPE+QLQE +QIQSESHHV
Sbjct: 1   MVAVFNKELLSWYLITLKLKETVESGLPRNSNSANSVDSHGKPELQLQEYRQIQSESHHV 60

Query: 453 IIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAV 512
           I+EDEDQKLEED ESPE+EWV++IKEKL QAHQDEVES+WAKLCIYKVPHYLKDG+DKAV
Sbjct: 61  IVEDEDQKLEEDSESPESEWVISIKEKLDQAHQDEVESSWAKLCIYKVPHYLKDGDDKAV 120

Query: 513 VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARSCYEGP 572
           VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILER K DI IYLD MKELEE ARSCYEGP
Sbjct: 121 VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERTKHDINIYLDAMKELEENARSCYEGP 180

Query: 573 LSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN 632
            S SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN
Sbjct: 181 FSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN 240

Query: 633 QLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF 692
           QLPLFVLDRLL LQLG++YQKGL+AELALRFFDPLTPNDEPLTKS+LNKLESSL NATAF
Sbjct: 241 QLPLFVLDRLLGLQLGENYQKGLLAELALRFFDPLTPNDEPLTKSNLNKLESSLRNATAF 300

Query: 693 DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAG 752
           DPLG QDGLHCLDVFRRSLLRSG KLAPKVWIKRRSH +RVADKRRQQLIHCVKELKEAG
Sbjct: 301 DPLGNQDGLHCLDVFRRSLLRSGQKLAPKVWIKRRSHTHRVADKRRQQLIHCVKELKEAG 360

Query: 753 IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM 812
           IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM
Sbjct: 361 IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM 420

Query: 813 DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYY 872
           DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQ+SEDVN YY
Sbjct: 421 DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQVSEDVNHYY 480

Query: 873 NHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN 924
           NHRWNAWRA+LKHNYF NPWAIISL+AAVVLLLLTFAQ FYGVY YY+PPN
Sbjct: 481 NHRWNAWRASLKHNYFSNPWAIISLIAAVVLLLLTFAQTFYGVYGYYRPPN 531

BLAST of Cla97C01G016150 vs. ExPASy Swiss-Prot
Match: Q9FMH8 (Probable cysteine protease RD21B OS=Arabidopsis thaliana OX=3702 GN=RD21B PE=1 SV=1)

HSP 1 Score: 447.6 bits (1150), Expect = 3.4e-124
Identity = 224/362 (61.88%), Postives = 267/362 (73.76%), Query Frame = 0

Query: 24  TRRSDGEVREIYDLWLAKHGKA---YNGI-EEREKRFQIFKENLNFIDEHNSENRTYKVG 83
           T RSD EV  IY+ W+ +HGK     NG+  E+++RF+IFK+NL FIDEHN++N +YK+G
Sbjct: 39  TSRSDSEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYKLG 98

Query: 84  LNMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVNIRDRLPESVDWRTRGAVAPV 143
           L  FADLTN+EYR++YLG +  P +RV+K    S RY   + D LP+SVDWR  GAVA V
Sbjct: 99  LTRFADLTNEEYRSMYLGAK--PTKRVLK---TSDRYQARVGDALPDSVDWRKEGAVADV 158

Query: 144 KNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFI 203
           K+QGSCGSCWAFSTI AVEGIN+IVTG+LISLSEQELV CD  YN GCNGGLMDYAF+FI
Sbjct: 159 KDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFI 218

Query: 204 IDNGGLDTEEDYPYEGFDGQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAI 263
           I NGG+DTE DYPY+  DG+CD  RKNAKVV+ID YEDVP + E +LKKA+AHQP+SVAI
Sbjct: 219 IKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAI 278

Query: 264 EASGLALQLYQSGVFTGKCGSALDHGVIAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLE 323
           EA G A QLY SGVF G CG+ LDHGV+AVGYGTENG DYW+VRNSWG  WGE GY K+ 
Sbjct: 279 EAGGRAFQLYSSGVFDGLCGTELDHGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMA 338

Query: 324 RNVKHTTNGKCGIAMMASYPVKNG--------CETSATSSLMEVDYVFRC--FNSCCTVV 372
           RN++  T GKCGIAM ASYP+K G           S        D  F C   N+CC + 
Sbjct: 339 RNIEAPT-GKCGIAMEASYPIKKGQNPPNPGPSPPSPIKPPTTCDKYFSCPESNTCCCLY 394

BLAST of Cla97C01G016150 vs. ExPASy Swiss-Prot
Match: Q94B08 (Germination-specific cysteine protease 1 OS=Arabidopsis thaliana OX=3702 GN=GCP1 PE=2 SV=2)

HSP 1 Score: 445.7 bits (1145), Expect = 1.3e-123
Identity = 221/362 (61.05%), Postives = 274/362 (75.69%), Query Frame = 0

Query: 1   MATATTLLALLSFFFLSNSASALTR--------------RSDGEVREIYDLWLAKHGKAY 60
           MA +T +L+LL  + + + AS                  R+D EVR IY  W A+HGK  
Sbjct: 1   MAPSTKVLSLLLLYVVVSLASGDESIINDHLQLPSDGKWRTDEEVRSIYLQWSAEHGKTN 60

Query: 61  NG----IEEREKRFQIFKENLNFIDEHNSENR--TYKVGLNMFADLTNDEYRAVYLGTRS 120
           N     I +++KRF IFK+NL FID HN +N+  TYK+GL  F DLTNDEYR +YLG R+
Sbjct: 61  NNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDEYRKLYLGART 120

Query: 121 PPARRVMKAKTASRRYAVNIRDR-LPESVDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEG 180
            PARR+ KAK  +++Y+  +  + +PE+VDWR +GAV P+K+QG+CGSCWAFST AAVEG
Sbjct: 121 EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEG 180

Query: 181 INQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQ 240
           IN+IVTGELISLSEQELV CDK YN GCNGGLMDYAFQFI+ NGGL+TE+DYPY GF G+
Sbjct: 181 INKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGK 240

Query: 241 CDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCG 300
           C+   KN++VVSIDGYEDVP  DE ALKKA+++QPVSVAIEA G   Q YQSG+FTG CG
Sbjct: 241 CNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGSCG 300

Query: 301 SALDHGVIAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTNGKCGIAMMASYP 342
           + LDH V+AVGYG+ENGVDYW+VRNSWG  WGE+GY ++ERN+  + +GKCGIA+ ASYP
Sbjct: 301 TNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYP 360

BLAST of Cla97C01G016150 vs. ExPASy Swiss-Prot
Match: P43297 (Cysteine proteinase RD21A OS=Arabidopsis thaliana OX=3702 GN=RD21A PE=1 SV=1)

HSP 1 Score: 443.0 bits (1138), Expect = 8.5e-123
Identity = 213/358 (59.50%), Postives = 262/358 (73.18%), Query Frame = 0

Query: 26  RSDGEVREIYDLWLAKHGKA--YNGIEEREKRFQIFKENLNFIDEHNSENRTYKVGLNMF 85
           RS+ EV  IY+ WL KHGKA   N + E+++RF+IFK+NL F+DEHN +N +Y++GL  F
Sbjct: 41  RSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRF 100

Query: 86  ADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVNIRDRLPESVDWRTRGAVAPVKNQG 145
           ADLTNDEYR+ YLG +          +  S RY   + D LPES+DWR +GAVA VK+QG
Sbjct: 101 ADLTNDEYRSKYLGAKMEKKGE----RRTSLRYEARVGDELPESIDWRKKGAVAEVKDQG 160

Query: 146 SCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFIIDNG 205
            CGSCWAFSTI AVEGINQIVTG+LI+LSEQELV CD  YN GCNGGLMDYAF+FII NG
Sbjct: 161 GCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNG 220

Query: 206 GLDTEEDYPYEGFDGQCDPTRKNAKVVSIDGYEDVPADDEKALKKAVAHQPVSVAIEASG 265
           G+DT++DYPY+G DG CD  RKNAKVV+ID YEDVP   E++LKKAVAHQP+S+AIEA G
Sbjct: 221 GIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGG 280

Query: 266 LALQLYQSGVFTGKCGSALDHGVIAVGYGTENGVDYWLVRNSWGTGWGEDGYFKLERNVK 325
            A QLY SG+F G CG+ LDHGV+AVGYGTENG DYW+VRNSWG  WGE GY ++ RN+ 
Sbjct: 281 RAFQLYDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIA 340

Query: 326 HTTNGKCGIAMMASYPVKNG--------CETSATSSLMEVDYVFRC--FNSCCTVVSF 372
            +++GKCGIA+  SYP+KNG           S      + D  + C   N+CC +  +
Sbjct: 341 -SSSGKCGIAIEPSYPIKNGENPPNPGPSPPSPIKPPTQCDSYYTCPESNTCCCLFEY 393

BLAST of Cla97C01G016150 vs. ExPASy Swiss-Prot
Match: P25776 (Oryzain alpha chain OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g0650000 PE=1 SV=2)

HSP 1 Score: 435.6 bits (1119), Expect = 1.4e-120
Identity = 222/382 (58.12%), Postives = 270/382 (70.68%), Query Frame = 0

Query: 4   ATTLLALLSFFFLSNSASALTRRSDGEVREIYDLWLAKHGKAYNGIEEREKRFQIFKENL 63
           A  LL LLS      S  +   RS+ E R +Y  W A+HGK+YN + E E+R+  F++NL
Sbjct: 9   AAALLLLLSLAAADMSIVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNL 68

Query: 64  NFIDEHNSEN----RTYKVGLNMFADLTNDEYRAVYLGTRSPPARRVMKAKTASRRYAVN 123
            +IDEHN+       ++++GLN FADLTN+EYR  YLG R+ P R     +  S RY   
Sbjct: 69  RYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRE----RKVSDRYLAA 128

Query: 124 IRDRLPESVDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSC 183
             + LPESVDWRT+GAVA +K+QG CGSCWAFS IAAVEGINQIVTG+LISLSEQELV C
Sbjct: 129 DNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDC 188

Query: 184 DKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFDGQCDPTRKNAKVVSIDGYEDVP 243
           D  YN GCNGGLMDYAF FII+NGG+DTE+DYPY+G D +CD  RKNAKVV+ID YEDV 
Sbjct: 189 DTSYNEGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVT 248

Query: 244 ADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVIAVGYGTENGVDY 303
            + E +L+KAVA+QPVSVAIEA G A QLY SG+FTGKCG+ALDHGV AVGYGTENG DY
Sbjct: 249 PNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDY 308

Query: 304 WLVRNSWGTGWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKNG--------CETSATSS 363
           W+VRNSWG  WGE GY ++ERN+K  ++GKCGIA+  SYP+K G           S T  
Sbjct: 309 WIVRNSWGKSWGESGYVRMERNIK-ASSGKCGIAVEPSYPLKKGENPPNPGPTPPSPTPP 368

Query: 364 LMEVDYVFRCFNS--CCTVVSF 372
               D  + C +S  CC +  +
Sbjct: 369 PTVCDNYYTCPDSTTCCCIYEY 385

BLAST of Cla97C01G016150 vs. ExPASy Swiss-Prot
Match: Q9LT78 (Probable cysteine protease RD21C OS=Arabidopsis thaliana OX=3702 GN=RD21C PE=1 SV=1)

HSP 1 Score: 434.5 bits (1116), Expect = 3.0e-120
Identity = 218/352 (61.93%), Postives = 271/352 (76.99%), Query Frame = 0

Query: 1   MATA--TTLLALLSFFFLSNSAS------ALTRRSDGEVREIYDLWLAKHGKAYNGIEER 60
           MAT+  +  LALL F  L  S S        T R++ E R +Y+ WL ++ K YNG+ E+
Sbjct: 1   MATSIKSITLALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEK 60

Query: 61  EKRFQIFKENLNFIDEHNS-ENRTYKVGLNMFADLTNDEYRAVYLGTRSPPARRVMKAKT 120
           E+RF+IFK+NL F++EH+S  NRTY+VGL  FADLTNDE+RA+YL ++    R  +K + 
Sbjct: 61  ERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGE- 120

Query: 121 ASRRYAVNIRDRLPESVDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISL 180
              +Y   + D LP+++DWR +GAV PVK+QGSCGSCWAFS I AVEGINQI TGELISL
Sbjct: 121 ---KYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISL 180

Query: 181 SEQELVSCDKKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEGFD-GQCDPTRKNAKVV 240
           SEQELV CD  YN GC GGLMDYAF+FII+NGG+DTEEDYPY   D   C+  +KN +VV
Sbjct: 181 SEQELVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVV 240

Query: 241 SIDGYEDVPADDEKALKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVIAVG 300
           +IDGYEDVP +DEK+LKKA+A+QP+SVAIEA G A QLY SGVFTG CG++LDHGV+AVG
Sbjct: 241 TIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVG 300

Query: 301 YGTENGVDYWLVRNSWGTGWGEDGYFKLERNVKHTTNGKCGIAMMASYPVKN 343
           YG+E G DYW+VRNSWG+ WGE GYFKLERN+K  ++GKCG+AMMASYP K+
Sbjct: 301 YGSEGGQDYWIVRNSWGSNWGESGYFKLERNIKE-SSGKCGVAMMASYPTKS 347

BLAST of Cla97C01G016150 vs. ExPASy TrEMBL
Match: A0A0A0KID5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G491610 PE=4 SV=1)

HSP 1 Score: 1026.2 bits (2652), Expect = 8.7e-296
Identity = 502/534 (94.01%), Postives = 523/534 (97.94%), Query Frame = 0

Query: 393 MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHV 452
           MVAVFNKELLSWYLITLKLRETVESGLPRNS+SANSVDSHGK E+QLQE KQIQSESHHV
Sbjct: 1   MVAVFNKELLSWYLITLKLRETVESGLPRNSISANSVDSHGKSELQLQESKQIQSESHHV 60

Query: 453 IIEDEDQKL-EEDP--ESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGED 512
           I+E+EDQKL EEDP  ESP +EWV+TIKEKL+QAHQDEVES+WAKLCIYKVPHYLKDGED
Sbjct: 61  IVENEDQKLEEEDPELESPVSEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGED 120

Query: 513 KAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARSCY 572
           KAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHIL+R+KQDIK+YLD MKELEE+AR+CY
Sbjct: 121 KAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILDRSKQDIKLYLDAMKELEERARNCY 180

Query: 573 EGPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIM 632
           EGP S SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIM
Sbjct: 181 EGPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIM 240

Query: 633 LENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNA 692
           LENQLPLFVLDRLLELQLGD+YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN 
Sbjct: 241 LENQLPLFVLDRLLELQLGDNYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNT 300

Query: 693 TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELK 752
           TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW+KRRSHANRVADKRRQQLIHCVKELK
Sbjct: 301 TAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWMKRRSHANRVADKRRQQLIHCVKELK 360

Query: 753 EAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYV 812
           +AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYV
Sbjct: 361 DAGIRFKKKKTDRFWDINFNNGVMEIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYV 420

Query: 813 VFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVN 872
           VFMDNLIDSHEDV+YLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVN
Sbjct: 421 VFMDNLIDSHEDVSYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVN 480

Query: 873 RYYNHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN 924
           RYYNHRWNAWRATLKHNYF NPWAIISL+AAVVLLLLTFAQAFYGV+AYYKPPN
Sbjct: 481 RYYNHRWNAWRATLKHNYFSNPWAIISLIAAVVLLLLTFAQAFYGVFAYYKPPN 534

BLAST of Cla97C01G016150 vs. ExPASy TrEMBL
Match: A0A5D3CR40 (UPF0481 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G004580 PE=4 SV=1)

HSP 1 Score: 1023.5 bits (2645), Expect = 5.6e-295
Identity = 499/533 (93.62%), Postives = 521/533 (97.75%), Query Frame = 0

Query: 393 MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHV 452
           MVAVFNKELLSWYLITLKLRETVESGLPR+S+SANSVDSHGK E+QL E KQIQSESH+V
Sbjct: 1   MVAVFNKELLSWYLITLKLRETVESGLPRSSISANSVDSHGKSELQLGEPKQIQSESHNV 60

Query: 453 IIEDEDQKLEEDP--ESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDK 512
           IIE+ED KLEEDP  ESPE+EWV+TIKEKL+QAHQDEVES+WAKLCIYKVPHYLKDGEDK
Sbjct: 61  IIENEDHKLEEDPEFESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDK 120

Query: 513 AVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARSCYE 572
           AVVPQI+SLGPYHHGKRRLRQMERHKWRSLYHILER+K DIK+YLD MKELEE+AR+CYE
Sbjct: 121 AVVPQIISLGPYHHGKRRLRQMERHKWRSLYHILERSKHDIKLYLDAMKELEERARNCYE 180

Query: 573 GPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIML 632
           GP S SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIML
Sbjct: 181 GPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIML 240

Query: 633 ENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNAT 692
           ENQLPLFVLDRLLELQLGD+YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN T
Sbjct: 241 ENQLPLFVLDRLLELQLGDYYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNTT 300

Query: 693 AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKE 752
           AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW+KRRSHANRVADKRRQQLIHCVKELK+
Sbjct: 301 AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWMKRRSHANRVADKRRQQLIHCVKELKD 360

Query: 753 AGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVV 812
           AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVV
Sbjct: 361 AGIRFKKKKTDRFWDINFNNGVMEIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVV 420

Query: 813 FMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNR 872
           FMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVA+LFNRLCQEVVYDINDSYLSQLSEDVNR
Sbjct: 421 FMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNR 480

Query: 873 YYNHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN 924
           YYNHRWNAWRATLKHNYF NPWAIISL+AAVVLLLLTFAQAFYGV+AYYKPPN
Sbjct: 481 YYNHRWNAWRATLKHNYFSNPWAIISLIAAVVLLLLTFAQAFYGVFAYYKPPN 533

BLAST of Cla97C01G016150 vs. ExPASy TrEMBL
Match: A0A1S3B0V1 (UPF0481 protein At3g47200 OS=Cucumis melo OX=3656 GN=LOC103484799 PE=4 SV=1)

HSP 1 Score: 1023.5 bits (2645), Expect = 5.6e-295
Identity = 499/533 (93.62%), Postives = 521/533 (97.75%), Query Frame = 0

Query: 393 MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHV 452
           MVAVFNKELLSWYLITLKLRETVESGLPR+S+SANSVDSHGK E+QL E KQIQSESH+V
Sbjct: 1   MVAVFNKELLSWYLITLKLRETVESGLPRSSISANSVDSHGKSELQLGEPKQIQSESHNV 60

Query: 453 IIEDEDQKLEEDP--ESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDK 512
           IIE+ED KLEEDP  ESPE+EWV+TIKEKL+QAHQDEVES+WAKLCIYKVPHYLKDGEDK
Sbjct: 61  IIENEDHKLEEDPEFESPESEWVITIKEKLNQAHQDEVESSWAKLCIYKVPHYLKDGEDK 120

Query: 513 AVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARSCYE 572
           AVVPQI+SLGPYHHGKRRLRQMERHKWRSLYHILER+K DIK+YLD MKELEE+AR+CYE
Sbjct: 121 AVVPQIISLGPYHHGKRRLRQMERHKWRSLYHILERSKHDIKLYLDAMKELEERARNCYE 180

Query: 573 GPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIML 632
           GP S SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIML
Sbjct: 181 GPFSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIML 240

Query: 633 ENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNAT 692
           ENQLPLFVLDRLLELQLGD+YQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN T
Sbjct: 241 ENQLPLFVLDRLLELQLGDYYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNTT 300

Query: 693 AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKE 752
           AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVW+KRRSHANRVADKRRQQLIHCVKELK+
Sbjct: 301 AFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWMKRRSHANRVADKRRQQLIHCVKELKD 360

Query: 753 AGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVV 812
           AGIRF+KKKTDRFWDINFNNGVM+IPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVV
Sbjct: 361 AGIRFKKKKTDRFWDINFNNGVMEIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVV 420

Query: 813 FMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNR 872
           FMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVA+LFNRLCQEVVYDINDSYLSQLSEDVNR
Sbjct: 421 FMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVADLFNRLCQEVVYDINDSYLSQLSEDVNR 480

Query: 873 YYNHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN 924
           YYNHRWNAWRATLKHNYF NPWAIISL+AAVVLLLLTFAQAFYGV+AYYKPPN
Sbjct: 481 YYNHRWNAWRATLKHNYFSNPWAIISLIAAVVLLLLTFAQAFYGVFAYYKPPN 533

BLAST of Cla97C01G016150 vs. ExPASy TrEMBL
Match: A0A6J1KY55 (UPF0481 protein At3g47200-like OS=Cucurbita maxima OX=3661 GN=LOC111497424 PE=4 SV=1)

HSP 1 Score: 1011.9 bits (2615), Expect = 1.7e-291
Identity = 493/531 (92.84%), Postives = 512/531 (96.42%), Query Frame = 0

Query: 393 MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHV 452
           MVAVFNKELLSWYLITLKL+ETVESGLPRNS S NSVDSHGKP++QLQE +QIQSESHHV
Sbjct: 1   MVAVFNKELLSWYLITLKLKETVESGLPRNSNSPNSVDSHGKPDLQLQEYRQIQSESHHV 60

Query: 453 IIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAV 512
           I+EDEDQKLEED ESPE+EWV++IKEKL QAHQDEVES+WAKLCIYKVPHYLKDG+DKAV
Sbjct: 61  IVEDEDQKLEEDSESPESEWVISIKEKLDQAHQDEVESSWAKLCIYKVPHYLKDGDDKAV 120

Query: 513 VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARSCYEGP 572
           VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILER K DI IYLD MKELEE ARSCYEGP
Sbjct: 121 VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERTKHDINIYLDAMKELEENARSCYEGP 180

Query: 573 LSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN 632
            S SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN
Sbjct: 181 FSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN 240

Query: 633 QLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF 692
           QLPLFVLDRLL +QLG++YQKGL+AELALRFFDPLTPNDEPLTKS+LNKLESSL NATAF
Sbjct: 241 QLPLFVLDRLLGIQLGENYQKGLLAELALRFFDPLTPNDEPLTKSNLNKLESSLRNATAF 300

Query: 693 DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAG 752
           DPLG QDGLHCLDVFRRSLLRSG KLAPKVWIKRRSHA+RVADKRRQQLIHCVKELKEAG
Sbjct: 301 DPLGNQDGLHCLDVFRRSLLRSGQKLAPKVWIKRRSHAHRVADKRRQQLIHCVKELKEAG 360

Query: 753 IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM 812
           IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM
Sbjct: 361 IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM 420

Query: 813 DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYY 872
           DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYY
Sbjct: 421 DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYY 480

Query: 873 NHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN 924
           NHRWNAWRA+LKHNYF NPWAIISL+AAVVLLLLTFAQ FYGVY YY+PPN
Sbjct: 481 NHRWNAWRASLKHNYFSNPWAIISLIAAVVLLLLTFAQTFYGVYGYYRPPN 531

BLAST of Cla97C01G016150 vs. ExPASy TrEMBL
Match: A0A6J1HGP0 (UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111463383 PE=4 SV=1)

HSP 1 Score: 1010.4 bits (2611), Expect = 4.9e-291
Identity = 493/531 (92.84%), Postives = 511/531 (96.23%), Query Frame = 0

Query: 393 MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHV 452
           MVAVFNKELLSWYLITLKL+ETVESGLPRNS SANSVDSHGKPE+QLQE +QIQSESHHV
Sbjct: 1   MVAVFNKELLSWYLITLKLKETVESGLPRNSNSANSVDSHGKPELQLQEYRQIQSESHHV 60

Query: 453 IIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAV 512
           I+EDEDQKLEED ESPE+EWV++IKE L QAHQDEVES+WAKLCIYKVPHYLKDG+DKAV
Sbjct: 61  IVEDEDQKLEEDSESPESEWVISIKEMLDQAHQDEVESSWAKLCIYKVPHYLKDGDDKAV 120

Query: 513 VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARSCYEGP 572
           VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILER K DI IYLD MKELEE ARSCYEGP
Sbjct: 121 VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERTKHDINIYLDAMKELEENARSCYEGP 180

Query: 573 LSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN 632
            S SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN
Sbjct: 181 FSFSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN 240

Query: 633 QLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF 692
           QLPLFVLDRLL LQLG++YQKGL+AELALRFFDPLTPNDEPLTKS+LNKLESSL NATAF
Sbjct: 241 QLPLFVLDRLLGLQLGENYQKGLLAELALRFFDPLTPNDEPLTKSNLNKLESSLRNATAF 300

Query: 693 DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAG 752
           DPLG QDGLHCLDVFRRSLLRSG KLAPKVWIKRRSHA+RVADKRRQQLIHCVKELKEAG
Sbjct: 301 DPLGNQDGLHCLDVFRRSLLRSGQKLAPKVWIKRRSHAHRVADKRRQQLIHCVKELKEAG 360

Query: 753 IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM 812
           IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM
Sbjct: 361 IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM 420

Query: 813 DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYY 872
           DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQ+SEDVN YY
Sbjct: 421 DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQVSEDVNHYY 480

Query: 873 NHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN 924
           NHRWNAWRA+LKHNYF NPWAIISL+AAVVLLLLTFAQ FYGVY YY+PPN
Sbjct: 481 NHRWNAWRASLKHNYFSNPWAIISLIAAVVLLLLTFAQTFYGVYGYYRPPN 531

BLAST of Cla97C01G016150 vs. TAIR 10
Match: AT3G50120.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 748.4 bits (1931), Expect = 6.7e-216
Identity = 357/531 (67.23%), Postives = 432/531 (81.36%), Query Frame = 0

Query: 393 MVAVFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEIQLQEQKQIQSESHHV 452
           MVAVF K++LSWYL+TLK+RE +E+    + L     +  G PEI   +Q Q    +   
Sbjct: 1   MVAVFYKDMLSWYLLTLKIREKLETQNQESVLVNQDQNLPGLPEITRSDQDQNIHNNQQT 60

Query: 453 IIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAV 512
             +     ++E P+    +WV++I +KL QAH+D+  + W KLCIY+VP+YL++ ++K+ 
Sbjct: 61  QSDPAIYVIKESPKDSRDDWVISITDKLEQAHRDDDTTLWGKLCIYRVPYYLQENDNKSY 120

Query: 513 VPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARSCYEGP 572
            PQ VSLGPYHHGK+RLR M+RHKWR++  +L+R  Q IK+Y+D M+ELEEKAR+CYEGP
Sbjct: 121 FPQTVSLGPYHHGKKRLRSMDRHKWRAVNRVLKRTNQGIKMYIDAMRELEEKARACYEGP 180

Query: 573 LSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLEN 632
           LSLSSNEF+EM+VLDGCFVLELFRGA EGF +LGY RNDP+FAMRGSMHSIQRDM+MLEN
Sbjct: 181 LSLSSNEFIEMLVLDGCFVLELFRGAVEGFTELGYARNDPVFAMRGSMHSIQRDMVMLEN 240

Query: 633 QLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAF 692
           QLPLFVL+RLLELQLG   Q GLVA+LA+RFFDPL P DEPLTKS  +KLE+SL    +F
Sbjct: 241 QLPLFVLNRLLELQLGTRNQTGLVAQLAIRFFDPLMPTDEPLTKSGQSKLENSLARDKSF 300

Query: 693 DPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAG 752
           DP      LHCLDVFRRSLLRS PK  P++  KR S   RVADKRRQQLIHCV ELKEAG
Sbjct: 301 DPFADMGELHCLDVFRRSLLRSSPKPEPRLTRKRWSRNTRVADKRRQQLIHCVTELKEAG 360

Query: 753 IRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFM 812
           I+F+++KTDRFWD+ F NG ++IPRLLIHDGT+SLFLNLIAFEQCH+D SNDITSY++FM
Sbjct: 361 IKFRRRKTDRFWDMQFKNGYLEIPRLLIHDGTKSLFLNLIAFEQCHIDSSNDITSYIIFM 420

Query: 813 DNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYY 872
           DNLIDSHEDV+YLHYCGIIEHWLGSD EVA+LFNRLCQEVV+D  DSYLS+LS +VNRYY
Sbjct: 421 DNLIDSHEDVSYLHYCGIIEHWLGSDSEVADLFNRLCQEVVFDTEDSYLSRLSIEVNRYY 480

Query: 873 NHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN 924
           +H+WNAWRATLKH YF NPWAI+S  AAV+LL+LTF+Q+FY VYAYYKPP+
Sbjct: 481 DHKWNAWRATLKHKYFNNPWAIVSFCAAVILLVLTFSQSFYAVYAYYKPPS 531

BLAST of Cla97C01G016150 vs. TAIR 10
Match: AT3G50170.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 663.7 bits (1711), Expect = 2.2e-190
Identity = 325/533 (60.98%), Postives = 407/533 (76.36%), Query Frame = 0

Query: 396 VFNKELLSWYLITLKLRETVESGLPRNSLSANSVDSHGKPEI-------QLQEQKQIQSE 455
           + NK++L+WYL++LKLR+  ++   ++S      + HG PE+        +Q  KQ  SE
Sbjct: 14  IINKDMLTWYLLSLKLRQKFQT-KNQHSEPVQDPNLHGLPEVTRPNQDQNVQNHKQSHSE 73

Query: 456 SHHVIIEDEDQKLEEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGE 515
           S   ++E+  ++   D       WV++I++KL QA +D+  + W KLCIY+VPHYL++ +
Sbjct: 74  SGKEVVEERPEETTGD------SWVISIRDKLEQADRDDDTTIWGKLCIYRVPHYLQEND 133

Query: 516 DKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARSC 575
            K+  PQ VSLGPYHHGK+RLR MERHKWR+L  +L+R KQ I++Y + M+ELEEKAR+C
Sbjct: 134 KKSYFPQTVSLGPYHHGKKRLRPMERHKWRALNKVLKRLKQRIEMYTNAMRELEEKARAC 193

Query: 576 YEGPLSLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMI 635
           YEGP+SLS NEF EM+VLDGCFVLELFRG  EGF ++GY RNDP+FAMRG MHSIQRDMI
Sbjct: 194 YEGPISLSRNEFTEMLVLDGCFVLELFRGTVEGFTEIGYARNDPVFAMRGLMHSIQRDMI 253

Query: 636 MLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGN 695
           MLENQLPLFVLDRLLELQLG   Q G+VA +A++FFDPL P  E LTK   +KL + L  
Sbjct: 254 MLENQLPLFVLDRLLELQLGTQNQTGIVAHVAVKFFDPLMPTGEALTKPDQSKLMNWL-- 313

Query: 696 ATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKEL 755
             + D LG +  LHCLDVFRRSLL+S P    +  +KR +   RV DKR+QQL+HCV EL
Sbjct: 314 EKSLDTLGDKGELHCLDVFRRSLLQSSPTPNTRSLLKRLTRNTRVVDKRQQQLVHCVTEL 373

Query: 756 KEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSY 815
           +EAG++F+K+KTDRFWDI F NG ++IP+LLIHDGT+SLF NLIAFEQCH++ SN ITSY
Sbjct: 374 REAGVKFRKRKTDRFWDIEFKNGYLEIPKLLIHDGTKSLFSNLIAFEQCHIESSNHITSY 433

Query: 816 VVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDV 875
           ++FMDNLI+S EDV+YLHYCGIIEHWLGSD EVA+LFNRLCQEVV+D  DS+LS+LS DV
Sbjct: 434 IIFMDNLINSSEDVSYLHYCGIIEHWLGSDSEVADLFNRLCQEVVFDPKDSHLSRLSGDV 493

Query: 876 NRYYNHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKP 922
           NRYYN +WN  +ATL H YF NPWA  S  AAV+LLLLT  Q+FY VYAYYKP
Sbjct: 494 NRYYNRKWNVLKATLTHKYFNNPWAYFSFSAAVILLLLTLCQSFYAVYAYYKP 537

BLAST of Cla97C01G016150 vs. TAIR 10
Match: AT3G50130.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 616.7 bits (1589), Expect = 3.0e-176
Identity = 293/467 (62.74%), Postives = 366/467 (78.37%), Query Frame = 0

Query: 459 QKLEEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAVVPQIVS 518
           QK  + PE    EWV++I++K+ QA +++  ++W KLCIY+VP YL++   K+  PQ VS
Sbjct: 103 QKQNQKPEETREEWVISIRDKMEQALREDATTSWDKLCIYRVPQYLQENNKKSYFPQTVS 162

Query: 519 LGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARSCYEGPLSLSSN 578
           LGP+HHG + L  M+RHKWR++  ++ R K DI++Y+D MKELE++AR+CYEGP+ LSSN
Sbjct: 163 LGPFHHGNKHLLPMDRHKWRAVNMVMARTKHDIEMYIDAMKELEDRARACYEGPIDLSSN 222

Query: 579 EFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFV 638
           +F EM+VLDGCFVLELFRGA EGF +LGY RNDP+FAMRGSMHSIQRDM+MLENQLPLFV
Sbjct: 223 KFSEMLVLDGCFVLELFRGADEGFSELGYDRNDPVFAMRGSMHSIQRDMVMLENQLPLFV 282

Query: 639 LDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSLGNATAFDPLGYQ 698
           L+RLLE+QLG  +Q GLV+ LA+RFFDPL P DEPLTK+     + SL     F+P+  +
Sbjct: 283 LNRLLEIQLGKRHQTGLVSRLAVRFFDPLMPTDEPLTKT-----DDSLEQDKFFNPIADK 342

Query: 699 D--GLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRFQ 758
           D   LHCLDVFRR+LLR      P++   R S   RVADKR+QQLIHCV EL+EAGI+F+
Sbjct: 343 DKGELHCLDVFRRNLLRPCSNPEPRLSRMRWSWRTRVADKRQQQLIHCVTELREAGIKFR 402

Query: 759 KKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNLI 818
            +KTDRFWDI F NG ++IP+LLIHDGT+SLF NLIAFEQCH+D SNDITSY++FMDNLI
Sbjct: 403 TRKTDRFWDIRFKNGYLEIPKLLIHDGTKSLFSNLIAFEQCHIDSSNDITSYIIFMDNLI 462

Query: 819 DSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHRW 878
           DS EDV YLHYCGIIEHWLG+D EVA+LFNRLCQEV +D  +SYLSQLS  V+R Y+ +W
Sbjct: 463 DSSEDVRYLHYCGIIEHWLGNDYEVADLFNRLCQEVAFDPQNSYLSQLSNKVDRNYSRKW 522

Query: 879 NAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN 924
           N  +A LKH YF NPWA  S  AA+VLL+LT  Q+F+  Y Y+ PP+
Sbjct: 523 NVLKAILKHKYFNNPWAYFSFFAALVLLVLTLFQSFFTAYPYFNPPS 564

BLAST of Cla97C01G016150 vs. TAIR 10
Match: AT3G50140.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 594.3 bits (1531), Expect = 1.6e-169
Identity = 284/468 (60.68%), Postives = 362/468 (77.35%), Query Frame = 0

Query: 459 QKLEEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPHYLKDGEDKAVVPQIVS 518
           Q   + PE    EWV+ IK+K+ Q  +D   ++W K+CIY+VP  LK  +  +  PQ VS
Sbjct: 77  QTRNQQPEETREEWVIWIKDKMEQVMRDAATTSWDKICIYRVPLSLKKSDKNSYFPQAVS 136

Query: 519 LGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELEEKARSCYEGPLSLSSN 578
           LGPYHHG   LR M+ HKWR++  +++R KQ I++Y+D MKELEE+AR+CYEGP+ LSSN
Sbjct: 137 LGPYHHGDEHLRPMDYHKWRAVNMVMKRTKQGIEMYIDAMKELEERARACYEGPIGLSSN 196

Query: 579 EFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMHSIQRDMIMLENQLPLFV 638
           +F +M+VLDGCFVL+LFRGA EGF +LGY RNDP+FAMRGSMHSI+RDM+MLENQLPLFV
Sbjct: 197 KFTQMLVLDGCFVLDLFRGAYEGFSKLGYDRNDPVFAMRGSMHSIRRDMLMLENQLPLFV 256

Query: 639 LDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNKLESSL-GNATAFDPLG- 698
           L+RLLELQLG  YQ GLVA+LA+RFF+PL P     T  S  K+E+S   N   F+P+  
Sbjct: 257 LNRLLELQLGTQYQTGLVAQLAVRFFNPLMP-----TYMSSTKIENSQENNNKFFNPIAD 316

Query: 699 -YQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQLIHCVKELKEAGIRF 758
             ++ LHCLDVFRRSLL+   K  P++   R S    VADKR+QQL+HCV EL+EAGI+F
Sbjct: 317 KEKEELHCLDVFRRSLLQPSLKPDPRLSRSRWSRKPLVADKRQQQLLHCVTELREAGIKF 376

Query: 759 QKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDCSNDITSYVVFMDNL 818
           +++K+DRFWDI F NG ++IP+LLIHDGT+SLF NLIA+EQCH+D +NDITSY++FMDNL
Sbjct: 377 KRRKSDRFWDIQFKNGCLEIPKLLIHDGTKSLFSNLIAYEQCHIDSTNDITSYIIFMDNL 436

Query: 819 IDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYLSQLSEDVNRYYNHR 878
           IDS ED+ YLHY  IIEHWLG+D EVA++FNRLCQEV +D+ ++YLS+LS  V+RYYN +
Sbjct: 437 IDSAEDIRYLHYYDIIEHWLGNDSEVADVFNRLCQEVAFDLENTYLSELSNKVDRYYNRK 496

Query: 879 WNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKPPN 924
           WN  +ATLKH YF NPWA  S  AAV+LLLLT  Q+F+  Y Y+KPP+
Sbjct: 497 WNVLKATLKHKYFSNPWAYFSFFAAVILLLLTLFQSFFTSYPYFKPPS 539

BLAST of Cla97C01G016150 vs. TAIR 10
Match: AT3G50150.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 561.6 bits (1446), Expect = 1.2e-159
Identity = 277/480 (57.71%), Postives = 357/480 (74.38%), Query Frame = 0

Query: 446 QSESHHVIIEDEDQKL---EEDPESPETEWVVTIKEKLHQAHQDEVESTWAKLCIYKVPH 505
           Q+  +HV    E  K+   EE P     EWV++IK+K+ +A   +  ++W KLCIY+VP 
Sbjct: 38  QNLHNHVETYVEPSKIEVKEEKPRETREEWVISIKDKMEKALSYDATNSWDKLCIYRVPF 97

Query: 506 YLKDGEDKAVVPQIVSLGPYHHGKRRLRQMERHKWRSLYHILERAKQDIKIYLDGMKELE 565
           YL++ + K+ +PQ VS+GPYHHGK  LR MERHKWR++  I+ R K +I++Y+D MKELE
Sbjct: 98  YLQENDKKSYLPQTVSIGPYHHGKVHLRPMERHKWRAVNMIMARTKHNIEMYIDAMKELE 157

Query: 566 EKARSCYEGPLSL-SSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYPRNDPIFAMRGSMH 625
           E+AR+CY+GP+ + +SNEF EM+VLDGCFVLELF+G  +GF+++GY RNDP+FA RG MH
Sbjct: 158 EEARACYQGPIDMKNSNEFTEMLVLDGCFVLELFKGTIQGFQKIGYARNDPVFAKRGLMH 217

Query: 626 SIQRDMIMLENQLPLFVLDRLLELQLGDHYQKGLVAELALRFFDPLTPNDEPLTKSSLNK 685
           SIQRDMIMLENQLPLFVLDRLL LQ G   Q G+VAE+A+RFF  L P  E LTKS    
Sbjct: 218 SIQRDMIMLENQLPLFVLDRLLGLQTGTPNQTGIVAEVAVRFFKTLMPTSEVLTKS---- 277

Query: 686 LESSLGNATAFDPLGYQDGLHCLDVFRRSLLRSGPKLAPKVWIKRRSHANRVADKRRQQL 745
            E SL +    D LG   GLHCLDVF RSL++S      +   +   + +    +++QQL
Sbjct: 278 -ERSLDSQEKSDELGDNGGLHCLDVFHRSLIQSS-----ETTNQGTPYEDMSMVEKQQQL 337

Query: 746 IHCVKELKEAGIRFQKKKTDRFWDINFNNGVMQIPRLLIHDGTRSLFLNLIAFEQCHLDC 805
           IHCV EL+ AG+ F +K+T + WDI F NG ++IP+LLIHDGT+SLF NLIAFEQCH   
Sbjct: 338 IHCVTELRGAGVNFMRKETGQLWDIEFKNGYLKIPKLLIHDGTKSLFSNLIAFEQCHTQS 397

Query: 806 SNDITSYVVFMDNLIDSHEDVAYLHYCGIIEHWLGSDEEVAELFNRLCQEVVYDINDSYL 865
           SN+ITSY++FMDNLI+S +DV+YLH+ GIIEHWLGSD EVA+LFNRLC+EV++D  D YL
Sbjct: 398 SNNITSYIIFMDNLINSSQDVSYLHHDGIIEHWLGSDSEVADLFNRLCKEVIFDPKDGYL 457

Query: 866 SQLSEDVNRYYNHRWNAWRATLKHNYFGNPWAIISLVAAVVLLLLTFAQAFYGVYAYYKP 922
           SQLS +VNRYY+ +WN+ +ATL+  YF NPWA  S  AAV+LL LTF Q+F+ VYAYYKP
Sbjct: 458 SQLSREVNRYYSRKWNSLKATLRQKYFNNPWAYFSFSAAVILLFLTFFQSFFAVYAYYKP 507

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038880921.12.7e-29995.86UPF0481 protein At3g47200-like [Benincasa hispida][more]
XP_011657877.11.8e-29594.01UPF0481 protein At3g47200 [Cucumis sativus] >KGN48549.1 hypothetical protein Csa... [more]
XP_008440314.11.2e-29493.62PREDICTED: UPF0481 protein At3g47200 [Cucumis melo] >TYK12866.1 UPF0481 protein ... [more]
XP_023003973.13.5e-29192.84UPF0481 protein At3g47200-like [Cucurbita maxima][more]
XP_023518140.16.0e-29192.84UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q9FMH83.4e-12461.88Probable cysteine protease RD21B OS=Arabidopsis thaliana OX=3702 GN=RD21B PE=1 S... [more]
Q94B081.3e-12361.05Germination-specific cysteine protease 1 OS=Arabidopsis thaliana OX=3702 GN=GCP1... [more]
P432978.5e-12359.50Cysteine proteinase RD21A OS=Arabidopsis thaliana OX=3702 GN=RD21A PE=1 SV=1[more]
P257761.4e-12058.12Oryzain alpha chain OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g0650000 PE=... [more]
Q9LT783.0e-12061.93Probable cysteine protease RD21C OS=Arabidopsis thaliana OX=3702 GN=RD21C PE=1 S... [more]
Match NameE-valueIdentityDescription
A0A0A0KID58.7e-29694.01Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G491610 PE=4 SV=1[more]
A0A5D3CR405.6e-29593.62UPF0481 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G0045... [more]
A0A1S3B0V15.6e-29593.62UPF0481 protein At3g47200 OS=Cucumis melo OX=3656 GN=LOC103484799 PE=4 SV=1[more]
A0A6J1KY551.7e-29192.84UPF0481 protein At3g47200-like OS=Cucurbita maxima OX=3661 GN=LOC111497424 PE=4 ... [more]
A0A6J1HGP04.9e-29192.84UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111463383 PE=... [more]
Match NameE-valueIdentityDescription
AT3G50120.16.7e-21667.23Plant protein of unknown function (DUF247) [more]
AT3G50170.12.2e-19060.98Plant protein of unknown function (DUF247) [more]
AT3G50130.13.0e-17662.74Plant protein of unknown function (DUF247) [more]
AT3G50140.11.6e-16960.68Plant protein of unknown function (DUF247) [more]
AT3G50150.11.2e-15957.71Plant protein of unknown function (DUF247) [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 544..564
NoneNo IPR availableGENE3D3.90.70.10Cysteine proteinasescoord: 18..342
e-value: 6.4E-121
score: 405.7
NoneNo IPR availablePANTHERPTHR31549:SF147PROTEIN, PUTATIVE (DUF247)-RELATEDcoord: 440..922
IPR000668Peptidase C1A, papain C-terminalPRINTSPR00705PAPAINcoord: 142..157
score: 65.92
coord: 284..294
score: 57.91
coord: 299..305
score: 69.74
IPR000668Peptidase C1A, papain C-terminalSMARTSM00645pept_c1coord: 124..340
e-value: 4.8E-126
score: 434.7
IPR000668Peptidase C1A, papain C-terminalPFAMPF00112Peptidase_C1coord: 124..340
e-value: 9.1E-83
score: 277.6
IPR013201Cathepsin propeptide inhibitor domain (I29)SMARTSM00848Inhibitor_I29_2coord: 35..91
e-value: 2.0E-25
score: 100.5
IPR013201Cathepsin propeptide inhibitor domain (I29)PFAMPF08246Inhibitor_I29coord: 36..91
e-value: 2.6E-17
score: 63.0
IPR004158Protein of unknown function DUF247, plantPFAMPF03140DUF247coord: 497..906
e-value: 4.0E-125
score: 418.2
IPR004158Protein of unknown function DUF247, plantPANTHERPTHR31549PROTEIN, PUTATIVE (DUF247)-RELATED-RELATEDcoord: 440..922
IPR000169Cysteine peptidase, cysteine active sitePROSITEPS00139THIOL_PROTEASE_CYScoord: 142..153
IPR025661Cysteine peptidase, asparagine active sitePROSITEPS00640THIOL_PROTEASE_ASNcoord: 299..318
IPR039417Papain-like cysteine endopeptidaseCDDcd02248Peptidase_C1Acoord: 125..339
e-value: 4.13499E-111
score: 338.444
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 28..340

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G016150.2Cla97C01G016150.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008234 cysteine-type peptidase activity