Cla97C07G144470 (gene) Watermelon (97103) v2.5

Overview
NameCla97C07G144470
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionWEB family protein At5g16730, chloroplastic
LocationCla97Chr07: 31828998 .. 31837763 (-)
RNA-Seq ExpressionCla97C07G144470
SyntenyCla97C07G144470
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGAAGTCAAAGAGTTCAACTTCTTCGTCTTTCCAACCAACCATGTTCAACACGTTCTTCATCTCGTTTTCTTCATTTATTCTTATTTCTTCTTCTTCCCCTGCATCTTCTACATTTTATTCTTCAAATAATACAATTCAAACTCAAATCCATAGTACCTATTACAATCTATCAGAGATAGACTATAGTGATAGAACTCTATCACAATCTATCAATGATAAAATTTGAGCTAAACAGTTGAAAGAACAAAATACAATCTGATTTTTGTCTATCGCTGAAAGACAATGATAGAACTCTATTACAGTCTATCAAAGAGATAGACAGTGATATAGCTCTATGCATTTCTTTATGTAATAGAATTTGAGCTGAAACAGTTTAAATAACAAAATACAATATGATTTTAATACCATACTCAATTGAGAAGTAATCTTGGACAAAAGTAGAGAATAACCACACAGTAAATTGATCATCCTATTAACATTTCTAAACTTTTTTTCTTATTAAAATTTTAGAGTTATTTTTTCAAGGAGGTATTAACCATTTTTATTTATATACTAACAATGAAAGTATTTTTTATCTCTTTTAAAAAAATTTAAACGTATTTTCGAAACTTTTGAACGTTTAGGGAAATTTTGAACAAAACTTTAACGGTTTCCATCCAAAACTAACAAAAAGGTATTTTAGAATTTTTTTGGAAGCTTGAGATTAATGTTGATACTTTAAAAGTTTAAAGATATTTTTAACCCAAAATGTAATGTTCATAGACATTTTTTATAATTTAACCAAATAATTATAATTTAGAAATTGCAATTATCTTTTAAGAAAAATATTGAAATTCCACCCCCAAAACTAGATAAATCCTACTACCAATTATCTTTTAAGAAAAATTATCGTATAAGAGTCTATTTTTGGTCATCCAGTTGGAAAAAGAATGAAAAAAAAAAAAATAGAAAAAACAAATTGCGTAACTTTAGGTTGACAAAGTCAAAATCGAAAAGCCCGGAAAAGGGTAATCAACTCACGTTTCACGGTTTCATTCTCTAATCCCCCCAGTAGCGCCCATTACAATTAAAAAATAAAATGTCACACATTTATCGAAATTCCAACCAATAGGAAACGACTATTGTATCCTCACGATCCTCGTCAACCCTTTCAATGTTACATCGCGGCCGTTCACTCTACTCGAGAAAACACAACTCAAGCCCTCATTGACCGTTAGATTAAAACACCTAAAAAACACACGGATATGCTCTCAGAACCGTCCGATCATCAGATTCTTTCTTCATTTACGACGCTATAAATATACCCTAACCGCACTTATCGTTGAACTTTTGGCTTCTTCGACGGAGAGAGGATCTGCTCGAGAGAGAAGACGAAGCGCCGAAGCAACACCAGGTTTGTCTTATTTCTTTTTCTCTTGCTTTTCTTATTTTCTTATTGCCAATTACGTCTAGGGATTCCGATTGTTTTTTAATTCTCGTTTGAGTAGTGGATTTGTTCCATCATCTTTCTCTGTTTTCGTCAATGATTCAACTCCTTGTAGTCGTTATGCGCTTTTAATTTTGTTTCCAGACTTAGATTTACGGGATATTAGAACTTTCCGACTCTATCTAGGTTACAGGGTTCGCATGCTTTCTAATTTTTCCCCTGATATGTGAACGCTTTAGGGTTCTTGTATTGTTTGGTATTCCAGTTTACGAGAACATTGAATTAGGTAATCGAATCCTCTGTCAATTATATAGGAAAATTTTTTGTGGTAGTTCCATCCCATTTTCTATAATCTCATGTAGGCGGTTTGTAAGGACGTCCAGCATGATAATTCGTGTTGAATAACGTCTAATTGTGAACAATTTAAAATTGGTGTTTATTTTTCTGTAGATGGCTCGTACCAAGCAAACTGCCCGTAAGTCCACTGGAGGAAAGGCTCCAAGGAAGCAGCTGGCCACAAAGGTTTGTTCAAGTTTCTTGCGGGAATTTTGTGGATGAATTTGATTTAACTTTTATGAGCTTTCTATTCATTTATGTTTCTTTTTGGGGGTGGTATTATGTAGGCTGCTCGTAAGTCCGCCCCAACCACAGGAGGAGTGAAGAAGCCCCACAGATATCGTCCTGGTACTGTTGCTCTTCGGTGAGTTCTCATGGATTAGCTCCATGGTTTTCCCCTCTAGTAGATGGACTACTGTATACCGGTTTGAGTAAAAGTTATTAACTGCTTCATTCCATTTTTATTCATGCAGTGAAATCCGTAAGTATCAAAAGAGTACTGAGCTTTTGATTAGGAAGTTGCCCTTCCAGAGGTTGGTTCGTGAAATTGCCCAGGACTTTAAGGTATGTTGCAATCTCTTTCATTCCCTGCTTTGCAATACAGCTTGGTTTGACATTTATTTGACGGATGTTAATTGGTTTTGGTGTAGACTGATCTGCGTTTCCAGAGCCATGCTGTTCTGGCATTGCAAGAGGCAGCCGAGGCCTACCTTGTTGGGTTGTTCGAGGACACCAATCTCTGTGCCATTCATGCAAAGCGCGTTACTATAATGCCTAAAGATATCCAGTTGGCTCGGAGAATCCGAGGTGAACGTGCTTAGAAGATGTGTATTGTTGTTTGGTGGATGGTATTGGCTGCAAGGAGGTGATCAAAGACCTAATTGATGATGATGGTGGATGATGGGTGGTTCTTATTTTTGTTATTAGCGACGTTGGTAGATAGTTATACTAGAAAAACTTGTAGTGAATATGGTAAAGGCATAATGCTTTGCTTGTGCATGTTAATGTTTAGTTGTGGATGATTTTGTTTTTGGTTCTTTTCAAAGGATGGAGTAGCTTAATTCTGTATTCCATTTAATTATGCTTTTGGGGTATGCAGCCAAACTAATATTTGTTGGATAACCTAAACCTATCACATCTAATTTGTAGTTCAGATTTGCAAGTCCATGAATTATTCTCACCTATTATTTATGTTGATGTGTGTTTCATGTAGAAGATAATTGTTTGAGCACGGCGATGGGGGAAATGGAATTGTTTTTAGGATGTTGAATCTTTGCTTCCATACATCAATGATTGTACGGATCTTTCAGTTCGACTGATATTTTATGTGCAGCGTTTTGCTGACTGCTTGTTTATTTTGTAGTCTATTTTGTGGAGTTAATAAGATTGTGTTGGGGGGGTCACAGTACAGTTCGTTAGGAAAGATTGTTTGCTTGGGGGACAGTAGTCTGGAGTGTAGTGAAGTTTTTGCAGATTTTGGAAAGTGTTCTTATTTTCAGTCGTTCCATTTGCAGTTTTTAAAGTTCCAATCTCAATTTTTTGGTTAACTGAAACATTTTGTGCGTCTTTTACGGTTTTTTTGTATTCAATCAATGGAGATGTAGTCGAAATATTTCTATATTTTCTTTCTCAGGTAAACTTGTTCTCATATTTGTGGAACATTCCTTGGGTTTTAAAAGTACTCTTAAAAATTTCTAGAAATAGTTTAATGGTCAATTTTATGTTAAATGTCTTTGAACTTCTAAAAGTTGTAATATTGTCCAATTCTTTCACATCATATTGTATTATGGGGTTAAAAAGAGCATTCATTTTAAATACATATCATCTGTTCTTTAGACATAAAACAACAATGCATTGTTATCAAGAAATGAGGTTCAATCCCACCTTTCATAGTGTAAAAACAGAATCTGGTATGTTTAGATTGCCTAAAGAAGACAAGTTTTTCTTCAACGTTTTCCCATGTTTATTTAATATTCTGACCCATTCCAAAGTATATACATTAAATTATAATTTTAGTTTATAGAAGTCATTTAACTTTTTTATGTGCTTGATAGATTTTCAAATTTTAATTTTATTTTTGTTAAGTTATTAAATCTTAAAAAGTGTCTAAGAGATATTTTTCGATTTTATTCAATAGAGAATATATATCTTTTAATTGTATCTAATGGGTCATGTGTGTATGTGCGTGTTTGTGAAAATTTCATAGACATAAAATCAACTTATCTATTAGTTTATTTATCAATATATATTTTTTTAAAATCAATTTATTAGATTCAAAATTGAAAGTTGAAGGACTTATAGGATAACATTTAAAGTTTTATTAAATACAAAATTGAAAAACTGAAAGTTTAAAATCTCTTAGACACTTTTGAATGTTTAAAGACCTCTATCAAATTGACAAAGCTTTGAGACTTCGTGGACTAAACTTAATCATATGTATAGAAATTTAATCATTTTTTTGGGGTAAAGTTTTGTATTATTTGAATATCATACATATTACAATATATAGTACAAACTTTAACTACTAACTTTTTTCCTTATGTTGTCCTAATGTATAGAAAAGAATAGTCAATTCAAGTATAATTTAGTGGATAAAGCGGACAATAGTTATTTATTACAAAAGCTCTTGAGAGTTCTCTCTAGCAAGGCCTAAAGCCCATGAAAAAATGGGAACCTACATGGGGAAATGCATGTCCCGAAAGGGAAAAGTCAAAATCTTTAGTTATGTGTTGACTAGCCGAACCAGTGAATATATGCTAATCAGGAAATAACTGAGTTATGCCCGGTTTGATGTCGTGCAACCTTGAGCAAGAAAAACTACCATGACATTGCTAGAATTACAAATAACCACAAATGACTTTAGAGAAAGGATATTATAAATAACCCCATTTGTTAACTTTAAATGAAAAGACTAAGTAATCTTCAAAAAAAAATCGTGACCTTACTTTTTTTTTTTTTTGCTTAGGAACTATATCTCTATACTAAACTTAGGTATCAGAGTGTGGTAGGCTCGCCACTACATCAGTGCAAGGATCTTTCACGCAAGTCAGTTCAAAAAGCGAGACCAAACTAGGAGAAGAGGTCGGAGATTAAGCCCATGAGGAGCAAGTGACAACGACAATTGCGTGCGATATCTACGCGCCAACCTAAGCAAATAAAAGAAGTAAAACACACCAAATTAGGGAATATCAATTAAAGTAAATTCAGAGATGGATTCATTTTGAAATTTAAAATGGTTTTCAGTAGTCATTTCCTAATTTTTCTATGCAGTTTGGGCATCTTTATGGAGGAAATGTGTATGTTTTGTACATTGAGTTGAAGCAAAAAAAATATTTGAAAATAGAAAAGATTTAAACTTCTATCATTTTGATTAAGGATATATGGTTAAATAATGTTTAGATTGATGAAAAATATAGTTGTATTATACCAAATATTCGAGAAGAATGAGTCCATATTGACAAACTTTGAATGTCACTTTAGAATAAGATCATAGAATTAAACCGATGTAACCTTCTAGTTAATACTAAAAAAAAATCAACAAATTAAGAAATTGGAAACTGATAAAACGTTTCGTCATATTTTCAAAAATAAATAAATAAACGTTTGGTCATAAGATGGTTCTTTTGAAAGAAGAAAATTCCTTATTGCAGCCTTGAAAAGTTCTTCCTCCCATTTGGACAGATAAATTAAAAAAAAAAATCATTTTCTCCCAACTTAAAAATACAATCGTTCGCAGCCCAACTTGAAACTTTTTTCTTTTATTTATTTATTTTAAAAAAACTTGAATTTGTAATTAAAATTTTGATCATCTCTCTTTATGTATTTTCTTCTCAGAGTTTCAAATAATAATAATAACAATAATAATAAACTTAATTAAGTGATATTAATATTTACTCCATCTCTAGGAGTGGGAGATTCAATAATTCACTCCTAACTCTAACTCTATTTATAAGTTTTAAAAGAATTGTAGTAAACAACAAAGAAAAGAAAAAGTTAAAAATTGAAAATTAGTTATAAGGCAGTTAGTTGTTGCGTTCCAAGTTTCAACCACCCCTCTCTGCCGGCTGCTTCCAGCCAAATGGGGCACAGTTCACTACTCAACCAAATATATATATTTATTAAAATATCATTTGATGTATGTATTTTTAATTTTTTTTTATTTTTCAAAACTTCGCTCGATTTTTTAAAATATTGGTAAAAGTTAGATAACAAAACAAAAAAATTAGAGATGAGAAAGATGTTTATAGACTTCATTCCAAAAACTTAAAACCAAAAGGTTATGGAACTTTAGTGATTCACAAATCCCACATAATTCTTAAACGTTTCTTTTATGAAAAACAATGAGAAAAGAAATAGTATAAAAAATTCTCATATAAGCATAGTCATATTATTAAATTTTGATATATCTTTTGATATTGTCACGAGGAGAAAATCAATAATTTCTCTATATCGTAGAATAATCGAAAAAACATAAAAATAAGCAACAAAACATACAATATACGTATATCTTAATTTGGTTAAAAATCATAAAACATTTATTTAGTAATTAAAATTTATAGTTGGATGGACATAGGATATTTTAAATAAGTGACTAGCTAAGGCTAAAGTAGTTATTGTTGAAAAGTAGGAAAAGCAATAGAAGGATTAGAAAAAGCCAAATTGACATCGTCAAATCCAATGGCTAATTAACATATTTATTTCTCTGTTTTTAATAATATTTCAGCCCATTGGTTCCTTTACTGGAACATCCGCCATCTTCGGATTTTCCTTCATTTCCAAACCTCTCTTCTTTCTCTTCTCCCTCTCTCTCTCTCTCTTAATTCATCTTCTCCTCCCAATTTCACTGTATTCTCCAGAAAAAACTATAATTATAAACAAACAAACACACACAATTTTCTCTTGTTTTTTGCAGCTTTTTTTTTTTTTTTTCTTTCTTTTCCCCTTTGAATGGAATTGCGTGTTCTTTCAACTAATCCGATTCTGTAAGTTTCCCTTTCTCTTTCCCCCATTTTTTTTTTCTAAAACAATTATTTTATGATTGAAAATTTCCAGCCATTGACCTATTCTCATCCCCTGTTCTTCAATTTTGATTCTTTTCATTATGGTAAAGAGATTTTGATTCGTTGAGTAACTGGGTTTTGTTTTGTTTGGGTGTGGTTGTGTTTTGTGTTTTTGCCAATAAGCAGAGGGTTTGATCAGAGTTTATGAAATGCATCAACAAATGGAGAAGACAGATGAGAACTTCATCAATGCGGCAGAGGAAGAGGAAAAAGATTCAACAGAGCTTCAAAAGCTTAAAGAGGTGAAGAAAATGGGGGATGAGGCCAATACAAAACTTAGTTCCGAGTTGGAATCTGTGGTTAGAAAAAGAGAATCCGCCATGGAAAAGGCTGAGGAATTGGCAGAGAAAATGGAGGAGCTCTCTGTTTTGAAACACTCTGAATCACAAACAGAAACAAGAATTGAACAACTCCAAAAAAAATACCAAATTTCCAAAGAATCAGAAGAGAAAACAAAGGAATTGCTATCAGCACAAACAAAACAGCTGGAACAGACAAAGATCTCCCTTGAAGAATCAAAGCTACAAATCCAATCCCTTCACGAAAAACTCCAGAAATATTCTTCCAACACAAATTACAATCACAACAACATTCCAAGAAACCACGTTTCCGACAGCCTAAAATCGGAACTTGAATCGACAAAACACCAACTGGGTCTGCTAAAAAATGAGCTAAAATTGGCCACAGAAGCAGAGGAGAACAACAAGAAAGCAATGGACGATCTAGCAATGGCATTGAAAGAGGTAGCAACAGAAGCCAATCAATTGAAGGGAAAATACAGCACAATCCAAGAAGAATTAAAGCAAACAAAAGAGGAAACAGAGAATTTGAAAACAACATTGAAAAACACAGAGGAAAAGTACAAAACTTTGTTACAAGAGGCAAGAAGAGATGCAGATTTATACAAAAGCACAGTGGATAGGCTGAGATTAGAAGCAGAGGAGTCCTTATTGGCATGGAGTGGGAGAGAAACAAGCTTAGTGGATTGCATAAGAAGAGCTGAAGATGACAGATACAATGCTCAGCAAGAGAATCGCCGTTTAATGGATGCTCTGAGGCTAGCAGAGCTCAAGAACATGACATCAAAAGAGGAGATTAAGAAACTGAGGGACATTCTAAAACAAGCCTTAAATGAAGCCACTGTGGCCAAAGAAGCTGCAGGGATTGCAATTGAAGAGAATTCACAGCTCAAAGACTGTTTGGCTGAGAAGGAAAATGCCTTGGATTTTGTGAGCAGTGAGAATGAGACTCTCAAAGTCAGCCAAGCCTCAGCTCTAGAGGAGATTAAAGAGCTGAAGCAATTGCTAGAAGAAGCAAACAAAAGAGAAGAAAACAGTAAAGAGGAAAGCAAGACCAAAGAAGAAGCCAAGGAGCAAGTGGAGATGGGCAAATCGAAGCCGCCGTTGAGTCCAAGTCCAAATCAGAATCCAACTCCGGCTCCAGCCGAGAAGGAAGATACATTTGGGAAAAGGCTGGGGAAGGCTTTCAGTTTCAGTTTCTTGGAGTTGAGAATCACACCACAGAAGAAGGAGGTGGAAGAAGAAGCAGCAGAGGAGGAGCCTGAAATGGAGGAGACGCTGAAAGGGTCGATTTTCGATGAAAATGTTGACTCGCCTGGTTCGGCCAGGCTGCACGAGAGGAAGCCATCATTGTCTCAGTATAGTGAAGATGGGGAGATGATGAATTATGAAGGTGAGGATCTTGATCAGTTGGAGGAAGGGCATTTGGATGAGTTAGAAGGGGATAGGAATTCAAGGAAGAAGAAGGCATTGATAAGGAGATTTGGGGATCTTTTGATGAGGAGGAGGAGCTTTCAACAGAAGAAGGAACAATCACCTGAGTGAAGTTTTGTGATGTTCATGCAGTTTTGAATTTTTCCTTTTTCATCTTTTTGTAAAACTGTAAATTCATGCAAGTAGCTTACATTGTGTATTCACG

mRNA sequence

ATGGCGGAAGTCAAAGAGTTCAACTTCTTCGTCTTTCCAACCAACCATGTTCAACACGGATTCCGATTGTTTTTTAATTCTCGTTTGAGTAGTGGATTTGTTCCATCATCTTTCTCTGTTTTCGTCAATGATTCAACTCCTTGTAGTCGTTATGCGCTTTTAATTTTGTTTCCAGACTTAGATTTACGGGATATTAGAACTTTCCGACTCTATCTAGGTTACAGGATGGCTCGTACCAAGCAAACTGCCCGTAAGTCCACTGGAGGAAAGGCTCCAAGGAAGCAGCTGGCCACAAAGGCTGCTCGTAAGTCCGCCCCAACCACAGGAGGAGTGAAGAAGCCCCACAGATATCGTCCTGGTACTGTTGCTCTTCGTGAAATCCGTAAGTATCAAAAGAGTACTGAGCTTTTGATTAGGAAGTTGCCCTTCCAGAGGTTGGTTCGTGAAATTGCCCAGGACTTTAAGACTGATCTGCGTTTCCAGAGCCATGCTGTTCTGGCATTGCAAGAGGCAGCCGAGGCCTACCTTGTTGGGTTGTTCGAGGACACCAATCTCTGTGCCATTCATGCAAAGCGCGTTACTATAATGCCTAAAGATATCCAGTTGGCTCGGAGAATCCGAGCGACGTTGGTAGATAGTTATACTAGAAAAACTTGTAGTGAATATGAGGGTTTGATCAGAGTTTATGAAATGCATCAACAAATGGAGAAGACAGATGAGAACTTCATCAATGCGGCAGAGGAAGAGGAAAAAGATTCAACAGAGCTTCAAAAGCTTAAAGAGGTGAAGAAAATGGGGGATGAGGCCAATACAAAACTTAGTTCCGAGTTGGAATCTGTGGTTAGAAAAAGAGAATCCGCCATGGAAAAGGCTGAGGAATTGGCAGAGAAAATGGAGGAGCTCTCTGTTTTGAAACACTCTGAATCACAAACAGAAACAAGAATTGAACAACTCCAAAAAAAATACCAAATTTCCAAAGAATCAGAAGAGAAAACAAAGGAATTGCTATCAGCACAAACAAAACAGCTGGAACAGACAAAGATCTCCCTTGAAGAATCAAAGCTACAAATCCAATCCCTTCACGAAAAACTCCAGAAATATTCTTCCAACACAAATTACAATCACAACAACATTCCAAGAAACCACGTTTCCGACAGCCTAAAATCGGAACTTGAATCGACAAAACACCAACTGGGTCTGCTAAAAAATGAGCTAAAATTGGCCACAGAAGCAGAGGAGAACAACAAGAAAGCAATGGACGATCTAGCAATGGCATTGAAAGAGGTAGCAACAGAAGCCAATCAATTGAAGGGAAAATACAGCACAATCCAAGAAGAATTAAAGCAAACAAAAGAGGAAACAGAGAATTTGAAAACAACATTGAAAAACACAGAGGAAAAGTACAAAACTTTGTTACAAGAGGCAAGAAGAGATGCAGATTTATACAAAAGCACAGTGGATAGGCTGAGATTAGAAGCAGAGGAGTCCTTATTGGCATGGAGTGGGAGAGAAACAAGCTTAGTGGATTGCATAAGAAGAGCTGAAGATGACAGATACAATGCTCAGCAAGAGAATCGCCGTTTAATGGATGCTCTGAGGCTAGCAGAGCTCAAGAACATGACATCAAAAGAGGAGATTAAGAAACTGAGGGACATTCTAAAACAAGCCTTAAATGAAGCCACTGTGGCCAAAGAAGCTGCAGGGATTGCAATTGAAGAGAATTCACAGCTCAAAGACTGTTTGGCTGAGAAGGAAAATGCCTTGGATTTTGTGAGCAGTGAGAATGAGACTCTCAAAGTCAGCCAAGCCTCAGCTCTAGAGGAGATTAAAGAGCTGAAGCAATTGCTAGAAGAAGCAAACAAAAGAGAAGAAAACAGTAAAGAGGAAAGCAAGACCAAAGAAGAAGCCAAGGAGCAAGTGGAGATGGGCAAATCGAAGCCGCCGTTGAGTCCAAGTCCAAATCAGAATCCAACTCCGGCTCCAGCCGAGAAGGAAGATACATTTGGGAAAAGGCTGGGGAAGGCTTTCAGTTTCAGTTTCTTGGAGTTGAGAATCACACCACAGAAGAAGGAGGTGGAAGAAGAAGCAGCAGAGGAGGAGCCTGAAATGGAGGAGACGCTGAAAGGGTCGATTTTCGATGAAAATGTTGACTCGCCTGGTTCGGCCAGGCTGCACGAGAGGAAGCCATCATTGTCTCAGTATAGTGAAGATGGGGAGATGATGAATTATGAAGGTGAGGATCTTGATCAGTTGGAGGAAGGGCATTTGGATGAGTTAGAAGGGGATAGGAATTCAAGGAAGAAGAAGGCATTGATAAGGAGATTTGGGGATCTTTTGATGAGGAGGAGGAGCTTTCAACAGAAGAAGGAACAATCACCTGAGTGAAGTTTTGTGATGTTCATGCAGTTTTGAATTTTTCCTTTTTCATCTTTTTGTAAAACTGTAAATTCATGCAAGTAGCTTACATTGTGTATTCACG

Coding sequence (CDS)

ATGGCGGAAGTCAAAGAGTTCAACTTCTTCGTCTTTCCAACCAACCATGTTCAACACGGATTCCGATTGTTTTTTAATTCTCGTTTGAGTAGTGGATTTGTTCCATCATCTTTCTCTGTTTTCGTCAATGATTCAACTCCTTGTAGTCGTTATGCGCTTTTAATTTTGTTTCCAGACTTAGATTTACGGGATATTAGAACTTTCCGACTCTATCTAGGTTACAGGATGGCTCGTACCAAGCAAACTGCCCGTAAGTCCACTGGAGGAAAGGCTCCAAGGAAGCAGCTGGCCACAAAGGCTGCTCGTAAGTCCGCCCCAACCACAGGAGGAGTGAAGAAGCCCCACAGATATCGTCCTGGTACTGTTGCTCTTCGTGAAATCCGTAAGTATCAAAAGAGTACTGAGCTTTTGATTAGGAAGTTGCCCTTCCAGAGGTTGGTTCGTGAAATTGCCCAGGACTTTAAGACTGATCTGCGTTTCCAGAGCCATGCTGTTCTGGCATTGCAAGAGGCAGCCGAGGCCTACCTTGTTGGGTTGTTCGAGGACACCAATCTCTGTGCCATTCATGCAAAGCGCGTTACTATAATGCCTAAAGATATCCAGTTGGCTCGGAGAATCCGAGCGACGTTGGTAGATAGTTATACTAGAAAAACTTGTAGTGAATATGAGGGTTTGATCAGAGTTTATGAAATGCATCAACAAATGGAGAAGACAGATGAGAACTTCATCAATGCGGCAGAGGAAGAGGAAAAAGATTCAACAGAGCTTCAAAAGCTTAAAGAGGTGAAGAAAATGGGGGATGAGGCCAATACAAAACTTAGTTCCGAGTTGGAATCTGTGGTTAGAAAAAGAGAATCCGCCATGGAAAAGGCTGAGGAATTGGCAGAGAAAATGGAGGAGCTCTCTGTTTTGAAACACTCTGAATCACAAACAGAAACAAGAATTGAACAACTCCAAAAAAAATACCAAATTTCCAAAGAATCAGAAGAGAAAACAAAGGAATTGCTATCAGCACAAACAAAACAGCTGGAACAGACAAAGATCTCCCTTGAAGAATCAAAGCTACAAATCCAATCCCTTCACGAAAAACTCCAGAAATATTCTTCCAACACAAATTACAATCACAACAACATTCCAAGAAACCACGTTTCCGACAGCCTAAAATCGGAACTTGAATCGACAAAACACCAACTGGGTCTGCTAAAAAATGAGCTAAAATTGGCCACAGAAGCAGAGGAGAACAACAAGAAAGCAATGGACGATCTAGCAATGGCATTGAAAGAGGTAGCAACAGAAGCCAATCAATTGAAGGGAAAATACAGCACAATCCAAGAAGAATTAAAGCAAACAAAAGAGGAAACAGAGAATTTGAAAACAACATTGAAAAACACAGAGGAAAAGTACAAAACTTTGTTACAAGAGGCAAGAAGAGATGCAGATTTATACAAAAGCACAGTGGATAGGCTGAGATTAGAAGCAGAGGAGTCCTTATTGGCATGGAGTGGGAGAGAAACAAGCTTAGTGGATTGCATAAGAAGAGCTGAAGATGACAGATACAATGCTCAGCAAGAGAATCGCCGTTTAATGGATGCTCTGAGGCTAGCAGAGCTCAAGAACATGACATCAAAAGAGGAGATTAAGAAACTGAGGGACATTCTAAAACAAGCCTTAAATGAAGCCACTGTGGCCAAAGAAGCTGCAGGGATTGCAATTGAAGAGAATTCACAGCTCAAAGACTGTTTGGCTGAGAAGGAAAATGCCTTGGATTTTGTGAGCAGTGAGAATGAGACTCTCAAAGTCAGCCAAGCCTCAGCTCTAGAGGAGATTAAAGAGCTGAAGCAATTGCTAGAAGAAGCAAACAAAAGAGAAGAAAACAGTAAAGAGGAAAGCAAGACCAAAGAAGAAGCCAAGGAGCAAGTGGAGATGGGCAAATCGAAGCCGCCGTTGAGTCCAAGTCCAAATCAGAATCCAACTCCGGCTCCAGCCGAGAAGGAAGATACATTTGGGAAAAGGCTGGGGAAGGCTTTCAGTTTCAGTTTCTTGGAGTTGAGAATCACACCACAGAAGAAGGAGGTGGAAGAAGAAGCAGCAGAGGAGGAGCCTGAAATGGAGGAGACGCTGAAAGGGTCGATTTTCGATGAAAATGTTGACTCGCCTGGTTCGGCCAGGCTGCACGAGAGGAAGCCATCATTGTCTCAGTATAGTGAAGATGGGGAGATGATGAATTATGAAGGTGAGGATCTTGATCAGTTGGAGGAAGGGCATTTGGATGAGTTAGAAGGGGATAGGAATTCAAGGAAGAAGAAGGCATTGATAAGGAGATTTGGGGATCTTTTGATGAGGAGGAGGAGCTTTCAACAGAAGAAGGAACAATCACCTGAGTGA

Protein sequence

MAEVKEFNFFVFPTNHVQHGFRLFFNSRLSSGFVPSSFSVFVNDSTPCSRYALLILFPDLDLRDIRTFRLYLGYRMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRATLVDSYTRKTCSEYEGLIRVYEMHQQMEKTDENFINAAEEEEKDSTELQKLKEVKKMGDEANTKLSSELESVVRKRESAMEKAEELAEKMEELSVLKHSESQTETRIEQLQKKYQISKESEEKTKELLSAQTKQLEQTKISLEESKLQIQSLHEKLQKYSSNTNYNHNNIPRNHVSDSLKSELESTKHQLGLLKNELKLATEAEENNKKAMDDLAMALKEVATEANQLKGKYSTIQEELKQTKEETENLKTTLKNTEEKYKTLLQEARRDADLYKSTVDRLRLEAEESLLAWSGRETSLVDCIRRAEDDRYNAQQENRRLMDALRLAELKNMTSKEEIKKLRDILKQALNEATVAKEAAGIAIEENSQLKDCLAEKENALDFVSSENETLKVSQASALEEIKELKQLLEEANKREENSKEESKTKEEAKEQVEMGKSKPPLSPSPNQNPTPAPAEKEDTFGKRLGKAFSFSFLELRITPQKKEVEEEAAEEEPEMEETLKGSIFDENVDSPGSARLHERKPSLSQYSEDGEMMNYEGEDLDQLEEGHLDELEGDRNSRKKKALIRRFGDLLMRRRSFQQKKEQSPE
Homology
BLAST of Cla97C07G144470 vs. NCBI nr
Match: KAG6600643.1 (hypothetical protein SDJN03_05876, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 956.4 bits (2471), Expect = 1.5e-274
Identity = 587/778 (75.45%), Postives = 650/778 (83.55%), Query Frame = 0

Query: 39   SVFVNDSTPCSRYALLILFPDLDLRDIRTFRLYLG----YRMARTKQTARKSTGGKAPRK 98
            SV  N    CS  A+  L   L++ D+  F         + MARTKQTARKSTGGKAPRK
Sbjct: 525  SVASNSGVVCS--AIRRLAKHLNVADLFNFHERRSSAKQHEMARTKQTARKSTGGKAPRK 584

Query: 99   QLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDF 158
            QLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDF
Sbjct: 585  QLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDF 644

Query: 159  KTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRATLVDSY 218
            KTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIR       
Sbjct: 645  KTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIR------- 704

Query: 219  TRKTCSEYEGLIRVYEMHQQMEKTDENFINAAEEEEKDSTEL-QKLKEVKKMGDEANTKL 278
                    E LIRV EMHQQMEKTDE  I  AEE++ DS+EL  +L+E+KKM DE NTKL
Sbjct: 705  --------ERLIRVSEMHQQMEKTDEK-IRVAEEKQNDSSELNDQLEEMKKMADETNTKL 764

Query: 279  SSELESVVRKRESAMEKAEEL-----------AEKMEELSVLKHSESQTETRIEQLQKKY 338
             SELESV  KR+SAMEKA+EL           A++ EELSVLK  ESQT+TRI++L+KKY
Sbjct: 765  RSELESVKSKRDSAMEKAKELELQLAEKSSNMAKQKEELSVLKRFESQTQTRIQELEKKY 824

Query: 339  QISKESEEKTKELLSAQTKQLEQTKISLEESKLQIQSLHEKLQKYSSNTNYNH---NNIP 398
            Q SKESEEKTKELL+ QTK LEQTKISLEESK++I SLHEKL K+S+ T++N     NIP
Sbjct: 825  QNSKESEEKTKELLAEQTKHLEQTKISLEESKIEILSLHEKLVKFSTETHFNELPTYNIP 884

Query: 399  RNHVSDSLKSELESTKHQLGLLKNELKLATEAEENNKKAMDDLAMALKEVATEANQLKGK 458
              +  + LK EL+ST+HQLG+LKNELK+ TEAEENNK AMDDLAMALKEVATEA+ LK K
Sbjct: 885  TKNEFERLKFELQSTRHQLGVLKNELKVTTEAEENNKTAMDDLAMALKEVATEAHHLKRK 944

Query: 459  YSTIQEELKQTKEETENLKTTLKNTEEKYKTLLQEARRDADLYKSTVDRLRLEAEESLLA 518
             ST ++EL++TKEE + LKTTLKNTEEKYK+LLQEARR+ADLYKSTVDRLRLEAEESLLA
Sbjct: 945  CSTTEKELQKTKEEADYLKTTLKNTEEKYKSLLQEARREADLYKSTVDRLRLEAEESLLA 1004

Query: 519  WSGRETSLVDCIRRAEDDRYNAQQENRRLMDALRLAELKNMTSKEEIKKLRDILKQALNE 578
            WSGRETSLVDCIRRAEDDR+NAQQENRRLMD LRLAELKNMTSKEEIKKLRDILKQALNE
Sbjct: 1005 WSGRETSLVDCIRRAEDDRFNAQQENRRLMDTLRLAELKNMTSKEEIKKLRDILKQALNE 1064

Query: 579  ATVAKEAAGIAIEENSQLKDCLAEKENALDFVSSENETLKVSQASALEEIKELKQLLEEA 638
            ATVAKEAAGIAIEENSQLKD LAEKENALDFVSSENETLKV++A+ALEEIKELKQLLE +
Sbjct: 1065 ATVAKEAAGIAIEENSQLKDSLAEKENALDFVSSENETLKVNKAAALEEIKELKQLLEAS 1124

Query: 639  NKREENSKEESKTKEEAKEQV--EMGKSKPPLSPSPNQNPTPAPAEKEDTFGKRLGKAFS 698
             + E N KEE+K KEE KEQV  E+ +S+PPLSPSP+   TP P EKEDTFG+RLGKAFS
Sbjct: 1125 KRGESNGKEENKGKEEGKEQVEKEITRSRPPLSPSPSL--TPPPVEKEDTFGRRLGKAFS 1184

Query: 699  FSFLELRITPQ-KKEVEEEAAEEEPEMEETLKGSIFDENVDSPGSARLHERKPSLSQYSE 758
            FSFLELR+T + KKEVEE+  E EP+MEETLKGSIFDE VDSPGS R+HERK SLSQ+  
Sbjct: 1185 FSFLELRLTSEKKKEVEED--EGEPQMEETLKGSIFDE-VDSPGSGRVHERKRSLSQFDG 1244

Query: 759  DGEMMNYEGEDLDQLEEGHLDELEGDRNSRKKKALIRRFGDLLMRRRSFQQKKEQSPE 795
            D +++N E EDL+ LEEG+LD  EGDRNSRKKKALIRRFGDLLMRRRSF QKKEQSPE
Sbjct: 1245 DRDILNDEIEDLEHLEEGNLDGEEGDRNSRKKKALIRRFGDLLMRRRSF-QKKEQSPE 1278

BLAST of Cla97C07G144470 vs. NCBI nr
Match: XP_038878303.1 (WEB family protein At5g16730, chloroplastic isoform X1 [Benincasa hispida])

HSP 1 Score: 862.8 bits (2228), Expect = 2.3e-246
Identity = 514/590 (87.12%), Postives = 535/590 (90.68%), Query Frame = 0

Query: 226 IRVYEMHQQMEKTDENFINAAEEEEKDSTELQKLKEVKKMGDEANTKLSSELESVVRKRE 285
           I VYEMHQQMEKTDE F  A EEE++DSTELQ LKEVKKM DE NTKL+SELES   K E
Sbjct: 11  ILVYEMHQQMEKTDEKFNVAEEEEKEDSTELQNLKEVKKMADETNTKLTSELESAKIKIE 70

Query: 286 SAMEKAEE-----LAEKMEELSVLKHSESQTETRIEQLQKKYQISKESEEKTKELLSAQT 345
           SAME AEE     L EK EELSVLK SESQ  TRIE+L+KKYQISKESEEKTK+LL AQT
Sbjct: 71  SAMEMAEELELLQLPEKTEELSVLKRSESQ--TRIEELEKKYQISKESEEKTKDLLLAQT 130

Query: 346 KQLEQTKISLEESKLQIQSLHEKLQKYSSNTNYNHNNIPRNHVSDSLKSELESTKHQLGL 405
           K LEQTKISLEESKL+I+SL EKLQKYSSNT+YN+ NIPRNH  DSLKSELESTK QLGL
Sbjct: 131 KHLEQTKISLEESKLEIKSLQEKLQKYSSNTDYNY-NIPRNHEFDSLKSELESTKRQLGL 190

Query: 406 LKNELKLATEAEENNKKAMDDLAMALKEVATEANQLKGKYSTIQEELKQTKEETENLKTT 465
           LKNELKL TEAEENNKKAMDDLAMALKEVATEAN LKGKYS  +EELKQTKEETENLKTT
Sbjct: 191 LKNELKLTTEAEENNKKAMDDLAMALKEVATEANHLKGKYSMSEEELKQTKEETENLKTT 250

Query: 466 LKNTEEKYKTLLQEARRDADLYKSTVDRLRLEAEESLLAWSGRETSLVDCIRRAEDDRYN 525
           LKNTEEKYKTLLQEARR+ADLYKSTVDRLRLEAEESL+AWSGRETSLVDCIRRAEDDRYN
Sbjct: 251 LKNTEEKYKTLLQEARREADLYKSTVDRLRLEAEESLVAWSGRETSLVDCIRRAEDDRYN 310

Query: 526 AQQENRRLMDALRLAELKNMTSKEEIKKLRDILKQALNEATVAKEAAGIAIEENSQLKDC 585
           AQQENRRLMDALRLAELKNMTSKEEIKKLRDILKQALNEATVAKEAAGIAIEENSQLKDC
Sbjct: 311 AQQENRRLMDALRLAELKNMTSKEEIKKLRDILKQALNEATVAKEAAGIAIEENSQLKDC 370

Query: 586 LAEKENALDFVSSENETLKVSQASALEEIKELKQLLEEANKREEN-------SKEESKTK 645
           LAEKENALDFVSSENE+LKVSQASALEEIKELKQLLEEA KREEN       SKEESK+K
Sbjct: 371 LAEKENALDFVSSENESLKVSQASALEEIKELKQLLEEAKKREENNSKEESKSKEESKSK 430

Query: 646 EEAKE--QVEMGKSKPPLSPSPNQ------NPTPAPAEKEDTFGKRLGKAFSFSFLELRI 705
           EE KE  QVE+ KSKPPLSPSPNQ      +P+PAPAEKEDTFGKRLGKAFSFSFLELRI
Sbjct: 431 EEGKEQQQVEITKSKPPLSPSPNQHPSPSPSPSPAPAEKEDTFGKRLGKAFSFSFLELRI 490

Query: 706 TPQKKEVEEEA-AEEEPEMEETLKGSIFDENVDSPGSARLHERKPSLSQYSEDGEMMNYE 765
           TPQKKEVEEEA  EEEPEMEETLKGSIFDENVDSPGSARLHERKPSLSQ+SEDGEMMNYE
Sbjct: 491 TPQKKEVEEEAEEEEEPEMEETLKGSIFDENVDSPGSARLHERKPSLSQFSEDGEMMNYE 550

Query: 766 GEDLDQLEEGHLDELEGDRNSRKKKALIRRFGDLLMRRRSFQQKKEQSPE 795
           GEDLDQLEEG+LD+ EGDRNSRKKKALIRRFGDLLMRRRSFQQKKEQSPE
Sbjct: 551 GEDLDQLEEGNLDDFEGDRNSRKKKALIRRFGDLLMRRRSFQQKKEQSPE 597

BLAST of Cla97C07G144470 vs. NCBI nr
Match: XP_038878314.1 (WEB family protein At5g16730, chloroplastic isoform X2 [Benincasa hispida])

HSP 1 Score: 856.3 bits (2211), Expect = 2.1e-244
Identity = 510/585 (87.18%), Postives = 531/585 (90.77%), Query Frame = 0

Query: 231 MHQQMEKTDENFINAAEEEEKDSTELQKLKEVKKMGDEANTKLSSELESVVRKRESAMEK 290
           MHQQMEKTDE F  A EEE++DSTELQ LKEVKKM DE NTKL+SELES   K ESAME 
Sbjct: 1   MHQQMEKTDEKFNVAEEEEKEDSTELQNLKEVKKMADETNTKLTSELESAKIKIESAMEM 60

Query: 291 AEE-----LAEKMEELSVLKHSESQTETRIEQLQKKYQISKESEEKTKELLSAQTKQLEQ 350
           AEE     L EK EELSVLK SESQ  TRIE+L+KKYQISKESEEKTK+LL AQTK LEQ
Sbjct: 61  AEELELLQLPEKTEELSVLKRSESQ--TRIEELEKKYQISKESEEKTKDLLLAQTKHLEQ 120

Query: 351 TKISLEESKLQIQSLHEKLQKYSSNTNYNHNNIPRNHVSDSLKSELESTKHQLGLLKNEL 410
           TKISLEESKL+I+SL EKLQKYSSNT+YN+ NIPRNH  DSLKSELESTK QLGLLKNEL
Sbjct: 121 TKISLEESKLEIKSLQEKLQKYSSNTDYNY-NIPRNHEFDSLKSELESTKRQLGLLKNEL 180

Query: 411 KLATEAEENNKKAMDDLAMALKEVATEANQLKGKYSTIQEELKQTKEETENLKTTLKNTE 470
           KL TEAEENNKKAMDDLAMALKEVATEAN LKGKYS  +EELKQTKEETENLKTTLKNTE
Sbjct: 181 KLTTEAEENNKKAMDDLAMALKEVATEANHLKGKYSMSEEELKQTKEETENLKTTLKNTE 240

Query: 471 EKYKTLLQEARRDADLYKSTVDRLRLEAEESLLAWSGRETSLVDCIRRAEDDRYNAQQEN 530
           EKYKTLLQEARR+ADLYKSTVDRLRLEAEESL+AWSGRETSLVDCIRRAEDDRYNAQQEN
Sbjct: 241 EKYKTLLQEARREADLYKSTVDRLRLEAEESLVAWSGRETSLVDCIRRAEDDRYNAQQEN 300

Query: 531 RRLMDALRLAELKNMTSKEEIKKLRDILKQALNEATVAKEAAGIAIEENSQLKDCLAEKE 590
           RRLMDALRLAELKNMTSKEEIKKLRDILKQALNEATVAKEAAGIAIEENSQLKDCLAEKE
Sbjct: 301 RRLMDALRLAELKNMTSKEEIKKLRDILKQALNEATVAKEAAGIAIEENSQLKDCLAEKE 360

Query: 591 NALDFVSSENETLKVSQASALEEIKELKQLLEEANKREEN-------SKEESKTKEEAKE 650
           NALDFVSSENE+LKVSQASALEEIKELKQLLEEA KREEN       SKEESK+KEE KE
Sbjct: 361 NALDFVSSENESLKVSQASALEEIKELKQLLEEAKKREENNSKEESKSKEESKSKEEGKE 420

Query: 651 --QVEMGKSKPPLSPSPNQ------NPTPAPAEKEDTFGKRLGKAFSFSFLELRITPQKK 710
             QVE+ KSKPPLSPSPNQ      +P+PAPAEKEDTFGKRLGKAFSFSFLELRITPQKK
Sbjct: 421 QQQVEITKSKPPLSPSPNQHPSPSPSPSPAPAEKEDTFGKRLGKAFSFSFLELRITPQKK 480

Query: 711 EVEEEA-AEEEPEMEETLKGSIFDENVDSPGSARLHERKPSLSQYSEDGEMMNYEGEDLD 770
           EVEEEA  EEEPEMEETLKGSIFDENVDSPGSARLHERKPSLSQ+SEDGEMMNYEGEDLD
Sbjct: 481 EVEEEAEEEEEPEMEETLKGSIFDENVDSPGSARLHERKPSLSQFSEDGEMMNYEGEDLD 540

Query: 771 QLEEGHLDELEGDRNSRKKKALIRRFGDLLMRRRSFQQKKEQSPE 795
           QLEEG+LD+ EGDRNSRKKKALIRRFGDLLMRRRSFQQKKEQSPE
Sbjct: 541 QLEEGNLDDFEGDRNSRKKKALIRRFGDLLMRRRSFQQKKEQSPE 582

BLAST of Cla97C07G144470 vs. NCBI nr
Match: XP_008453807.1 (PREDICTED: WEB family protein At5g16730, chloroplastic [Cucumis melo] >KAA0044714.1 WEB family protein [Cucumis melo var. makuwa] >TYK16873.1 WEB family protein [Cucumis melo var. makuwa])

HSP 1 Score: 815.8 bits (2106), Expect = 3.2e-232
Identity = 484/608 (79.61%), Postives = 519/608 (85.36%), Query Frame = 0

Query: 231 MHQQMEKTDENFINAAEEEEKDSTELQKLKEVKKMGDEANTKLSSELESVVRKRESAMEK 290
           MHQQMEK D  F      EE DSTEL K K+VKKM +E NTKL+SELESV R +E A+EK
Sbjct: 1   MHQQMEKIDMKF----NVEENDSTELPKFKDVKKMANEINTKLNSELESVKRNKEFAVEK 60

Query: 291 AE--------------------------------------ELAEKMEELSVLKHSESQTE 350
           AE                                      +LAEK E+LSV K+SESQTE
Sbjct: 61  AEGLESVKRNKESTVEKAESLESVKRNKESAMEKAEGLELQLAEKTEKLSVSKNSESQTE 120

Query: 351 TRIEQLQKKYQISKESEEKTKELLSAQTKQLEQTKISLEESKLQIQSLHEKLQKYSSNTN 410
           +RIE+L+KKY+ISKESEEKTK+LL AQTKQLEQ KISLEESKL+IQSLHEKLQKYSSNTN
Sbjct: 121 SRIEELEKKYRISKESEEKTKDLLLAQTKQLEQAKISLEESKLEIQSLHEKLQKYSSNTN 180

Query: 411 YNHNNIPRNHVSDSLKSELESTKHQLGLLKNELKLATEAEENNKKAMDDLAMALKEVATE 470
             ++NIP NH   SLK ELESTK  L LLKNELKLATEAEENNKKAMDDLAMALKEVATE
Sbjct: 181 NYNHNIPGNHEFGSLKIELESTKKNLALLKNELKLATEAEENNKKAMDDLAMALKEVATE 240

Query: 471 ANQLKGKYSTIQEELKQTKEETENLKTTLKNTEEKYKTLLQEARRDADLYKSTVDRLRLE 530
           AN LKGKYS  +EELK+TKEE+ENL+TTLKN EEK KTLLQEAR++ADLYKSTVDRLRLE
Sbjct: 241 ANHLKGKYSISEEELKKTKEESENLRTTLKNIEEKNKTLLQEARKEADLYKSTVDRLRLE 300

Query: 531 AEESLLAWSGRETSLVDCIRRAEDDRYNAQQENRRLMDALRLAELKNMTSKEEIKKLRDI 590
           AEESLLAWSGRETSLVDCIRRAEDDRYNAQQENRRL D+LRLAELKNMTSKEEIKKLRDI
Sbjct: 301 AEESLLAWSGRETSLVDCIRRAEDDRYNAQQENRRLTDSLRLAELKNMTSKEEIKKLRDI 360

Query: 591 LKQALNEATVAKEAAGIAIEENSQLKDCLAEKENALDFVSSENETLKVSQASALEEIKEL 650
           LKQALNEATVAKEAAGIAIEENSQLKDCLAEKENALDFVSSENETLKVSQASALEEIKEL
Sbjct: 361 LKQALNEATVAKEAAGIAIEENSQLKDCLAEKENALDFVSSENETLKVSQASALEEIKEL 420

Query: 651 KQLLEEANKREEN------SKEESKTKEEAKEQVEMGKSKPPLSPSPNQNPTPAPAEKED 710
           KQLLEEA K+EEN      SKEESK+KEE KEQVE+ KSKPPLSP PNQNP+P+PAEKED
Sbjct: 421 KQLLEEATKKEENIKEESKSKEESKSKEEGKEQVEITKSKPPLSPCPNQNPSPSPAEKED 480

Query: 711 TFGKRLGKAFSFSFLELRITPQKKEVEEEAAEEEPEMEETLKGSIFDENVDSPGSARLHE 770
           TFGKR GKAFSFSFLELR+ PQKKEVEEEA +EE EMEETLKGSIFDENVDSPGSARLHE
Sbjct: 481 TFGKRFGKAFSFSFLELRVAPQKKEVEEEAEDEELEMEETLKGSIFDENVDSPGSARLHE 540

Query: 771 RKPSLSQYSEDGEMMNYEGEDLDQLEEGHLDELEGDRNSRKKKALIRRFGDLLMRRRSFQ 795
           R+PSLSQYSEDGE+M+YEGEDLDQLEEG+LDELEGDRNSRKKKAL+RRFGDLLMRRRSFQ
Sbjct: 541 RRPSLSQYSEDGELMHYEGEDLDQLEEGNLDELEGDRNSRKKKALMRRFGDLLMRRRSFQ 600

BLAST of Cla97C07G144470 vs. NCBI nr
Match: XP_031740609.1 (WEB family protein At5g16730, chloroplastic isoform X1 [Cucumis sativus] >XP_031745662.1 WEB family protein At5g16730, chloroplastic-like isoform X1 [Cucumis sativus] >XP_031745665.1 WEB family protein At5g16730, chloroplastic-like isoform X1 [Cucumis sativus])

HSP 1 Score: 768.8 bits (1984), Expect = 4.4e-218
Identity = 454/576 (78.82%), Postives = 493/576 (85.59%), Query Frame = 0

Query: 249 EEKDSTELQKLKEVKKMGDEANTKLSSE---------------LESVVRKRESAMEKAE- 308
           EE DSTEL K K+VKKM +E NTKL+SE               LESV R + SA+EKAE 
Sbjct: 20  EEDDSTELPKFKDVKKMANEINTKLNSELESRNKESAVGKAEGLESVKRNKNSAVEKAEG 79

Query: 309 --------------------ELAEKMEELSVLKHSESQTETRIEQLQKKYQISKESEEKT 368
                               +LAEK E+LSVLK+SE QTE+RIE+L+KKY+ISKESEEKT
Sbjct: 80  LKSVKRNKDSAVENAEGLKLQLAEKTEQLSVLKNSELQTESRIEELEKKYRISKESEEKT 139

Query: 369 KELLSAQTKQLEQTKISLEESKLQIQSLHEKLQKYSSNTNYNHNNIPRNHVSDSLKSELE 428
           K+L+ AQT+QLEQ K+SLEESKL+IQSLHEKLQK SSNTN +++NIP NH  +SLK ELE
Sbjct: 140 KDLILAQTEQLEQAKVSLEESKLEIQSLHEKLQKCSSNTNNDNHNIPGNHEFESLKFELE 199

Query: 429 STKHQLGLLKNELKLATEAEENNKKAMDDLAMALKEVATEANQLKGKYSTIQEELKQTKE 488
           STK  L LLKNELKLATEAEENNKKAMDDLAMALKEVATEAN  KGKYST +EELKQ KE
Sbjct: 200 STKQNLALLKNELKLATEAEENNKKAMDDLAMALKEVATEANHFKGKYSTSEEELKQRKE 259

Query: 489 ETENLKTTLKNTEEKYKTLLQEARRDADLYKSTVDRLRLEAEESLLAWSGRETSLVDCIR 548
           ETENL+TTLK  EEK KTLLQEAR++ADLYKSTVDRLRLEAEESLLAWSGRETSLVDCIR
Sbjct: 260 ETENLRTTLKTIEEKNKTLLQEARKEADLYKSTVDRLRLEAEESLLAWSGRETSLVDCIR 319

Query: 549 RAEDDRYNAQQENRRLMDALRLAELKNMTSKEEIKKLRDILKQALNEATVAKEAAGIAIE 608
           RAEDDRYNAQQENRRLMD+LRLA+LKNMTSKEEIKKLRDILKQALNEATVAKEAAGIAIE
Sbjct: 320 RAEDDRYNAQQENRRLMDSLRLADLKNMTSKEEIKKLRDILKQALNEATVAKEAAGIAIE 379

Query: 609 ENSQLKDCLAEKENALDFVSSENETLKVSQASALEEIKELKQLLEEANKREEN------- 668
           ENSQLKDCL EKENALDFVS+ENETLKVSQASALEEIKELKQLLEEA K+E N       
Sbjct: 380 ENSQLKDCLVEKENALDFVSTENETLKVSQASALEEIKELKQLLEEATKKEGNGKEESKS 439

Query: 669 -----SKEESKTKEEAKEQVEMGKSKPPLSPSPNQNPTPAPAEKEDTFGKRLGKAFSFSF 728
                SKEESK KEE KEQVEM KSKPPLSPSPNQNP+P+PAEKEDTFGKRLGKAFSFSF
Sbjct: 440 KEENKSKEESKNKEEGKEQVEMTKSKPPLSPSPNQNPSPSPAEKEDTFGKRLGKAFSFSF 499

Query: 729 LELRITPQKKEVEEEAAEEEPEMEETLKGSIFDENVDSPGSARLHERKPSLSQYSEDGEM 777
           LELRI+PQKKEVEEEA +EEPEMEETLKGSIFDENVDSPGSARLHER+PSLSQYSEDGE+
Sbjct: 500 LELRISPQKKEVEEEAEDEEPEMEETLKGSIFDENVDSPGSARLHERRPSLSQYSEDGEL 559

BLAST of Cla97C07G144470 vs. ExPASy Swiss-Prot
Match: P59169 (Histone H3.3 OS=Arabidopsis thaliana OX=3702 GN=HTR4 PE=1 SV=2)

HSP 1 Score: 250.0 bits (637), Expect = 9.1e-65
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 0

Query: 76  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 135
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 136 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 195
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 196 MPKDIQLARRIR 208
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

BLAST of Cla97C07G144470 vs. ExPASy Swiss-Prot
Match: Q6RUR1 (Histone H3.3 OS=Capsicum annuum OX=4072 PE=2 SV=3)

HSP 1 Score: 250.0 bits (637), Expect = 9.1e-65
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 0

Query: 76  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 135
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 136 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 195
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 196 MPKDIQLARRIR 208
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

BLAST of Cla97C07G144470 vs. ExPASy Swiss-Prot
Match: Q71V89 (Histone H3.3 OS=Gossypium hirsutum OX=3635 GN=HIS3 PE=2 SV=3)

HSP 1 Score: 250.0 bits (637), Expect = 9.1e-65
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 0

Query: 76  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 135
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 136 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 195
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 196 MPKDIQLARRIR 208
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

BLAST of Cla97C07G144470 vs. ExPASy Swiss-Prot
Match: Q3C2E5 (Histone H3.3 OS=Lolium multiflorum OX=4521 GN=RH3 PE=2 SV=3)

HSP 1 Score: 250.0 bits (637), Expect = 9.1e-65
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 0

Query: 76  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 135
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 136 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 195
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 196 MPKDIQLARRIR 208
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

BLAST of Cla97C07G144470 vs. ExPASy Swiss-Prot
Match: P69245 (Histone H3.3 OS=Lolium temulentum OX=34176 PE=2 SV=2)

HSP 1 Score: 250.0 bits (637), Expect = 9.1e-65
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 0

Query: 76  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 135
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 136 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 195
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 196 MPKDIQLARRIR 208
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

BLAST of Cla97C07G144470 vs. ExPASy TrEMBL
Match: A0A5A7TNL1 (WEB family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold130G00040 PE=4 SV=1)

HSP 1 Score: 815.8 bits (2106), Expect = 1.5e-232
Identity = 484/608 (79.61%), Postives = 519/608 (85.36%), Query Frame = 0

Query: 231 MHQQMEKTDENFINAAEEEEKDSTELQKLKEVKKMGDEANTKLSSELESVVRKRESAMEK 290
           MHQQMEK D  F      EE DSTEL K K+VKKM +E NTKL+SELESV R +E A+EK
Sbjct: 1   MHQQMEKIDMKF----NVEENDSTELPKFKDVKKMANEINTKLNSELESVKRNKEFAVEK 60

Query: 291 AE--------------------------------------ELAEKMEELSVLKHSESQTE 350
           AE                                      +LAEK E+LSV K+SESQTE
Sbjct: 61  AEGLESVKRNKESTVEKAESLESVKRNKESAMEKAEGLELQLAEKTEKLSVSKNSESQTE 120

Query: 351 TRIEQLQKKYQISKESEEKTKELLSAQTKQLEQTKISLEESKLQIQSLHEKLQKYSSNTN 410
           +RIE+L+KKY+ISKESEEKTK+LL AQTKQLEQ KISLEESKL+IQSLHEKLQKYSSNTN
Sbjct: 121 SRIEELEKKYRISKESEEKTKDLLLAQTKQLEQAKISLEESKLEIQSLHEKLQKYSSNTN 180

Query: 411 YNHNNIPRNHVSDSLKSELESTKHQLGLLKNELKLATEAEENNKKAMDDLAMALKEVATE 470
             ++NIP NH   SLK ELESTK  L LLKNELKLATEAEENNKKAMDDLAMALKEVATE
Sbjct: 181 NYNHNIPGNHEFGSLKIELESTKKNLALLKNELKLATEAEENNKKAMDDLAMALKEVATE 240

Query: 471 ANQLKGKYSTIQEELKQTKEETENLKTTLKNTEEKYKTLLQEARRDADLYKSTVDRLRLE 530
           AN LKGKYS  +EELK+TKEE+ENL+TTLKN EEK KTLLQEAR++ADLYKSTVDRLRLE
Sbjct: 241 ANHLKGKYSISEEELKKTKEESENLRTTLKNIEEKNKTLLQEARKEADLYKSTVDRLRLE 300

Query: 531 AEESLLAWSGRETSLVDCIRRAEDDRYNAQQENRRLMDALRLAELKNMTSKEEIKKLRDI 590
           AEESLLAWSGRETSLVDCIRRAEDDRYNAQQENRRL D+LRLAELKNMTSKEEIKKLRDI
Sbjct: 301 AEESLLAWSGRETSLVDCIRRAEDDRYNAQQENRRLTDSLRLAELKNMTSKEEIKKLRDI 360

Query: 591 LKQALNEATVAKEAAGIAIEENSQLKDCLAEKENALDFVSSENETLKVSQASALEEIKEL 650
           LKQALNEATVAKEAAGIAIEENSQLKDCLAEKENALDFVSSENETLKVSQASALEEIKEL
Sbjct: 361 LKQALNEATVAKEAAGIAIEENSQLKDCLAEKENALDFVSSENETLKVSQASALEEIKEL 420

Query: 651 KQLLEEANKREEN------SKEESKTKEEAKEQVEMGKSKPPLSPSPNQNPTPAPAEKED 710
           KQLLEEA K+EEN      SKEESK+KEE KEQVE+ KSKPPLSP PNQNP+P+PAEKED
Sbjct: 421 KQLLEEATKKEENIKEESKSKEESKSKEEGKEQVEITKSKPPLSPCPNQNPSPSPAEKED 480

Query: 711 TFGKRLGKAFSFSFLELRITPQKKEVEEEAAEEEPEMEETLKGSIFDENVDSPGSARLHE 770
           TFGKR GKAFSFSFLELR+ PQKKEVEEEA +EE EMEETLKGSIFDENVDSPGSARLHE
Sbjct: 481 TFGKRFGKAFSFSFLELRVAPQKKEVEEEAEDEELEMEETLKGSIFDENVDSPGSARLHE 540

Query: 771 RKPSLSQYSEDGEMMNYEGEDLDQLEEGHLDELEGDRNSRKKKALIRRFGDLLMRRRSFQ 795
           R+PSLSQYSEDGE+M+YEGEDLDQLEEG+LDELEGDRNSRKKKAL+RRFGDLLMRRRSFQ
Sbjct: 541 RRPSLSQYSEDGELMHYEGEDLDQLEEGNLDELEGDRNSRKKKALMRRFGDLLMRRRSFQ 600

BLAST of Cla97C07G144470 vs. ExPASy TrEMBL
Match: A0A1S3BYC2 (WEB family protein At5g16730, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103494423 PE=4 SV=1)

HSP 1 Score: 815.8 bits (2106), Expect = 1.5e-232
Identity = 484/608 (79.61%), Postives = 519/608 (85.36%), Query Frame = 0

Query: 231 MHQQMEKTDENFINAAEEEEKDSTELQKLKEVKKMGDEANTKLSSELESVVRKRESAMEK 290
           MHQQMEK D  F      EE DSTEL K K+VKKM +E NTKL+SELESV R +E A+EK
Sbjct: 1   MHQQMEKIDMKF----NVEENDSTELPKFKDVKKMANEINTKLNSELESVKRNKEFAVEK 60

Query: 291 AE--------------------------------------ELAEKMEELSVLKHSESQTE 350
           AE                                      +LAEK E+LSV K+SESQTE
Sbjct: 61  AEGLESVKRNKESTVEKAESLESVKRNKESAMEKAEGLELQLAEKTEKLSVSKNSESQTE 120

Query: 351 TRIEQLQKKYQISKESEEKTKELLSAQTKQLEQTKISLEESKLQIQSLHEKLQKYSSNTN 410
           +RIE+L+KKY+ISKESEEKTK+LL AQTKQLEQ KISLEESKL+IQSLHEKLQKYSSNTN
Sbjct: 121 SRIEELEKKYRISKESEEKTKDLLLAQTKQLEQAKISLEESKLEIQSLHEKLQKYSSNTN 180

Query: 411 YNHNNIPRNHVSDSLKSELESTKHQLGLLKNELKLATEAEENNKKAMDDLAMALKEVATE 470
             ++NIP NH   SLK ELESTK  L LLKNELKLATEAEENNKKAMDDLAMALKEVATE
Sbjct: 181 NYNHNIPGNHEFGSLKIELESTKKNLALLKNELKLATEAEENNKKAMDDLAMALKEVATE 240

Query: 471 ANQLKGKYSTIQEELKQTKEETENLKTTLKNTEEKYKTLLQEARRDADLYKSTVDRLRLE 530
           AN LKGKYS  +EELK+TKEE+ENL+TTLKN EEK KTLLQEAR++ADLYKSTVDRLRLE
Sbjct: 241 ANHLKGKYSISEEELKKTKEESENLRTTLKNIEEKNKTLLQEARKEADLYKSTVDRLRLE 300

Query: 531 AEESLLAWSGRETSLVDCIRRAEDDRYNAQQENRRLMDALRLAELKNMTSKEEIKKLRDI 590
           AEESLLAWSGRETSLVDCIRRAEDDRYNAQQENRRL D+LRLAELKNMTSKEEIKKLRDI
Sbjct: 301 AEESLLAWSGRETSLVDCIRRAEDDRYNAQQENRRLTDSLRLAELKNMTSKEEIKKLRDI 360

Query: 591 LKQALNEATVAKEAAGIAIEENSQLKDCLAEKENALDFVSSENETLKVSQASALEEIKEL 650
           LKQALNEATVAKEAAGIAIEENSQLKDCLAEKENALDFVSSENETLKVSQASALEEIKEL
Sbjct: 361 LKQALNEATVAKEAAGIAIEENSQLKDCLAEKENALDFVSSENETLKVSQASALEEIKEL 420

Query: 651 KQLLEEANKREEN------SKEESKTKEEAKEQVEMGKSKPPLSPSPNQNPTPAPAEKED 710
           KQLLEEA K+EEN      SKEESK+KEE KEQVE+ KSKPPLSP PNQNP+P+PAEKED
Sbjct: 421 KQLLEEATKKEENIKEESKSKEESKSKEEGKEQVEITKSKPPLSPCPNQNPSPSPAEKED 480

Query: 711 TFGKRLGKAFSFSFLELRITPQKKEVEEEAAEEEPEMEETLKGSIFDENVDSPGSARLHE 770
           TFGKR GKAFSFSFLELR+ PQKKEVEEEA +EE EMEETLKGSIFDENVDSPGSARLHE
Sbjct: 481 TFGKRFGKAFSFSFLELRVAPQKKEVEEEAEDEELEMEETLKGSIFDENVDSPGSARLHE 540

Query: 771 RKPSLSQYSEDGEMMNYEGEDLDQLEEGHLDELEGDRNSRKKKALIRRFGDLLMRRRSFQ 795
           R+PSLSQYSEDGE+M+YEGEDLDQLEEG+LDELEGDRNSRKKKAL+RRFGDLLMRRRSFQ
Sbjct: 541 RRPSLSQYSEDGELMHYEGEDLDQLEEGNLDELEGDRNSRKKKALMRRFGDLLMRRRSFQ 600

BLAST of Cla97C07G144470 vs. ExPASy TrEMBL
Match: A0A0A0KXS4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G026890 PE=4 SV=1)

HSP 1 Score: 787.7 bits (2033), Expect = 4.5e-224
Identity = 470/603 (77.94%), Postives = 510/603 (84.58%), Query Frame = 0

Query: 235 MEKTDENFINAAEEEEKDSTELQKLKEVKKMGDEANTKLSSE---------------LES 294
           MEK    F      EE DSTEL K K+VKKM +E NTKL+SE               LES
Sbjct: 1   MEKVGMKF----NVEEDDSTELPKFKDVKKMANEINTKLNSELESRNKESAVGKAEGLES 60

Query: 295 VVRKRESAMEKAE---------------------ELAEKMEELSVLKHSESQTETRIEQL 354
           V R + SA+EKAE                     +LAEK E+LSVLK+SE QTE+RIE+L
Sbjct: 61  VKRNKNSAVEKAEGLKSVKRNKDSAVENAEGLKLQLAEKTEQLSVLKNSELQTESRIEEL 120

Query: 355 QKKYQISKESEEKTKELLSAQTKQLEQTKISLEESKLQIQSLHEKLQKYSSNTNYNHNNI 414
           +KKY+ISKESEEKTK+L+ AQT+QLEQ K+SLEESKL+IQSLHEKLQK SSNTN +++NI
Sbjct: 121 EKKYRISKESEEKTKDLILAQTEQLEQAKVSLEESKLEIQSLHEKLQKCSSNTNNDNHNI 180

Query: 415 PRNHVSDSLKSELESTKHQLGLLKNELKLATEAEENNKKAMDDLAMALKEVATEANQLKG 474
           P NH  +SLK ELESTK  L LLKNELKLATEAEENNKKAMDDLAMALKEVATEAN  KG
Sbjct: 181 PGNHEFESLKFELESTKQNLALLKNELKLATEAEENNKKAMDDLAMALKEVATEANHFKG 240

Query: 475 KYSTIQEELKQTKEETENLKTTLKNTEEKYKTLLQEARRDADLYKSTVDRLRLEAEESLL 534
           KYST +EELKQ KEETENL+TTLK  EEK KTLLQEAR++ADLYKSTVDRLRLEAEESLL
Sbjct: 241 KYSTSEEELKQRKEETENLRTTLKTIEEKNKTLLQEARKEADLYKSTVDRLRLEAEESLL 300

Query: 535 AWSGRETSLVDCIRRAEDDRYNAQQENRRLMDALRLAELKNMTSKEEIKKLRDILKQALN 594
           AWSGRETSLVDCIRRAEDDRYNAQQENRRLMD+LRLA+LKNMTSKEEIKKLRDILKQALN
Sbjct: 301 AWSGRETSLVDCIRRAEDDRYNAQQENRRLMDSLRLADLKNMTSKEEIKKLRDILKQALN 360

Query: 595 EATVAKEAAGIAIEENSQLKDCLAEKENALDFVSSENETLKVSQASALEEIKELKQLLEE 654
           EATVAKEAAGIAIEENSQLKDCL EKENALDFVS+ENETLKVSQASALEEIKELKQLLEE
Sbjct: 361 EATVAKEAAGIAIEENSQLKDCLVEKENALDFVSTENETLKVSQASALEEIKELKQLLEE 420

Query: 655 ANKREEN------------SKEESKTKEEAKEQVEMGKSKPPLSPSPNQNPTPAPAEKED 714
           A K+E N            SKEESK KEE KEQVEM KSKPPLSPSPNQNP+P+PAEKED
Sbjct: 421 ATKKEGNGKEESKSKEENKSKEESKNKEEGKEQVEMTKSKPPLSPSPNQNPSPSPAEKED 480

Query: 715 TFGKRLGKAFSFSFLELRITPQKKEVEEEAAEEEPEMEETLKGSIFDENVDSPGSARLHE 774
           TFGKRLGKAFSFSFLELRI+PQKKEVEEEA +EEPEMEETLKGSIFDENVDSPGSARLHE
Sbjct: 481 TFGKRLGKAFSFSFLELRISPQKKEVEEEAEDEEPEMEETLKGSIFDENVDSPGSARLHE 540

Query: 775 RKPSLSQYSEDGEMMNYEGEDLDQLEEGHLDELEGDRNSRKKKALIRRFGDLLMRRRSFQ 790
           R+PSLSQYSEDGE+M+++GEDLDQLEEG+LDELEGDRNSRKKKALIRRFGDLLMRRRSFQ
Sbjct: 541 RRPSLSQYSEDGELMHFDGEDLDQLEEGNLDELEGDRNSRKKKALIRRFGDLLMRRRSFQ 599

BLAST of Cla97C07G144470 vs. ExPASy TrEMBL
Match: A0A6J1JDI4 (caldesmon-like OS=Cucurbita maxima OX=3661 GN=LOC111483512 PE=4 SV=1)

HSP 1 Score: 714.5 bits (1843), Expect = 4.8e-202
Identity = 442/591 (74.79%), Postives = 497/591 (84.09%), Query Frame = 0

Query: 234 QMEKTDENFINAAEEEEKDSTELQKLKEVKKMGDEANTKLSSELESVVRKRESAMEKAEE 293
           Q E+TD+  +  AEE+E D TEL    E++KM DE + +L +EL+SV R R+ AM KA+E
Sbjct: 2   QTEETDDKSV-VAEEDEIDLTELLMFNEIQKMADEVDIELITELKSVKRDRDLAMGKAKE 61

Query: 294 LAEKM-----------EELSVLKHSESQTETRIEQLQKKYQISKESEEKTKELLSAQTKQ 353
           L  K+           +ELSVLK SES+++TRIE+L+KKY+ISKESEEKTK+LLSAQTK 
Sbjct: 62  LELKLVEKDSNLAKLAKELSVLKRSESRSQTRIEELEKKYKISKESEEKTKDLLSAQTKH 121

Query: 354 LEQTKISLEESKLQIQSLHEKLQKYSSNTNYNHN---NIPRNHVSDSLKSELESTKHQLG 413
           LEQTKISLEESKL+IQSL EKL+K SS TNY +N   +I   +  D +KSELESTKH+L 
Sbjct: 122 LEQTKISLEESKLEIQSLREKLEKCSSITNYIYNPKQSITTKNEFDRMKSELESTKHRLF 181

Query: 414 LLKNELKLATEAEENNKKAMDDLAMALKEVATEANQLKGKYSTIQEELKQTKEETENLKT 473
           LL NELKLATEAEENNKKAMDDLAMALKEVATEA+ LKGKYST++EEL QTK+E ENLKT
Sbjct: 182 LLNNELKLATEAEENNKKAMDDLAMALKEVATEASHLKGKYSTVEEELMQTKDEAENLKT 241

Query: 474 TLKNTEEKYKTLLQEARRDADLYKSTVDRLRLEAEESLLAWSGRETSLVDCIRRAEDDRY 533
           TLK TEEKYK+LLQ+AR+DADLYKSTVDRLRLEAEESL+AWSGRETSLVDCIR AE+DRY
Sbjct: 242 TLKYTEEKYKSLLQDARKDADLYKSTVDRLRLEAEESLVAWSGRETSLVDCIRSAEEDRY 301

Query: 534 NAQQENRRLMDALRLAELKNMTSKEEIKKLRDILKQALNEATVAKEAAGIAIEENSQLKD 593
           NAQQENRRLMDALRLAELKNMTSKEEIKK+RDILKQALNEA+VAKEAAGIAIEENSQLKD
Sbjct: 302 NAQQENRRLMDALRLAELKNMTSKEEIKKVRDILKQALNEASVAKEAAGIAIEENSQLKD 361

Query: 594 CLAEKENALDFVSSENETLKVSQASALEEIKELKQLLEEANKREEN---------SKEES 653
           CLAEKENALDFVSSENETLKVSQA+ALEEIKELKQLLE+ANK+EEN         SKEE 
Sbjct: 362 CLAEKENALDFVSSENETLKVSQAAALEEIKELKQLLEDANKKEENKSKSKEEGKSKEEG 421

Query: 654 KTKEEAKEQVEMGKSKPPLS--PSPNQNPTPAPAEKEDTFGKRLGKAFSFSFLELRITPQ 713
           K+KEE KEQVE+ KSKPPLS  P+PN NP+P P EKEDTFG+RLGKAFSFSFL++RI P+
Sbjct: 422 KSKEEGKEQVEITKSKPPLSPCPTPNPNPSPTPVEKEDTFGRRLGKAFSFSFLDMRIMPE 481

Query: 714 -KKEVEEE---AAEEEPEMEETLKGSIFDENVDSPGSARLHERKPSLSQYSEDGEMMNYE 773
            KKEVEEE     EEEPEMEETLKGSIFD+ VDSPGS R HERK S S   EDGEMMN  
Sbjct: 482 KKKEVEEEKEKEKEEEPEMEETLKGSIFDD-VDSPGSPRKHERKHSFSLCGEDGEMMN-- 541

Query: 774 GEDLDQLEEGHLDELEGDRNSRKKKALIRRFGDLLM-RRRSFQQKKEQSPE 795
            EDLD LEEG+LDEL+GDRNSRKKKAL+RRFGDLLM RRR+   KKEQSPE
Sbjct: 542 DEDLDPLEEGNLDELDGDRNSRKKKALMRRFGDLLMLRRRTVPPKKEQSPE 588

BLAST of Cla97C07G144470 vs. ExPASy TrEMBL
Match: A0A6J1FRQ0 (putative WEB family protein At1g65010, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111447844 PE=4 SV=1)

HSP 1 Score: 713.0 bits (1839), Expect = 1.4e-201
Identity = 439/582 (75.43%), Postives = 498/582 (85.57%), Query Frame = 0

Query: 231 MHQQMEKTDENFINAAEEEEKDSTEL-QKLKEVKKMGDEANTKLSSELESVVRKRESAME 290
           MHQQMEKTDE     AEEE+ DS+EL  +L+E+KKM DE NTKL SELESV  KR+SAME
Sbjct: 1   MHQQMEKTDEK-TRVAEEEQNDSSELNDQLEEMKKMADETNTKLRSELESVKSKRDSAME 60

Query: 291 KAEEL-----------AEKMEELSVLKHSESQTETRIEQLQKKYQISKESEEKTKELLSA 350
           KA+EL           A++ EELSVLK  ESQT+TRI++L+KKYQ SKESEEKTKELL+ 
Sbjct: 61  KAKELELQLAEKSSNMAKQKEELSVLKRFESQTQTRIQELEKKYQNSKESEEKTKELLAE 120

Query: 351 QTKQLEQTKISLEESKLQIQSLHEKLQKYSSNTNYNH---NNIPRNHVSDSLKSELESTK 410
           QTK+LEQTKISLEESK++I SLHEKL K+S+ T++N    +NIP  +  + LK EL+ST+
Sbjct: 121 QTKRLEQTKISLEESKIEILSLHEKLVKFSTETHFNELPTHNIPTKYEYERLKFELQSTR 180

Query: 411 HQLGLLKNELKLATEAEENNKKAMDDLAMALKEVATEANQLKGKYSTIQEELKQTKEETE 470
           HQLG+LKNELK+ TEAEENNK AMDDLAMALKEVATEA+ LK K ST ++EL++TKEE +
Sbjct: 181 HQLGVLKNELKVTTEAEENNKTAMDDLAMALKEVATEAHHLKRKCSTTEKELQKTKEEAD 240

Query: 471 NLKTTLKNTEEKYKTLLQEARRDADLYKSTVDRLRLEAEESLLAWSGRETSLVDCIRRAE 530
            LKTTLKNTEEKYK+LLQEARR+ADLYKSTVDRLRLEAEESLLAWSGRETSLVDCIRRAE
Sbjct: 241 YLKTTLKNTEEKYKSLLQEARREADLYKSTVDRLRLEAEESLLAWSGRETSLVDCIRRAE 300

Query: 531 DDRYNAQQENRRLMDALRLAELKNMTSKEEIKKLRDILKQALNEATVAKEAAGIAIEENS 590
           DDR+NAQQENRRLMD LRLAELKNMTSKEEIKKLRDILKQALNEATVAKEAAGIAIEENS
Sbjct: 301 DDRFNAQQENRRLMDTLRLAELKNMTSKEEIKKLRDILKQALNEATVAKEAAGIAIEENS 360

Query: 591 QLKDCLAEKENALDFVSSENETLKVSQASALEEIKELKQLLEEANKREENSKEESKTKEE 650
           QLKD LAEKENALDFVSSENETLKV++A+ALEEIKELKQLLE + + E N KEE+K KEE
Sbjct: 361 QLKDSLAEKENALDFVSSENETLKVNKAAALEEIKELKQLLEASKRGESNGKEENKGKEE 420

Query: 651 AKEQV--EMGKSKPPLSPSPNQNPTPAPAEKEDTFGKRLGKAFSFSFLELRITPQ-KKEV 710
            KEQV  E+ +S+PPLSPSP+   TP P EKEDTFG+RLGKAFSFSFLELR+T + KKEV
Sbjct: 421 GKEQVEKEITRSRPPLSPSPSL--TPPPVEKEDTFGRRLGKAFSFSFLELRLTSEKKKEV 480

Query: 711 EEEAAEEEPEMEETLKGSIFDENVDSPGSARLHERKPSLSQYSEDGEMMNYEGEDLDQLE 770
           EE+  E EP+MEETLKGSIFDE VDSPGS R+HERK SLSQ+  D +++N E EDL+ LE
Sbjct: 481 EED--EGEPQMEETLKGSIFDE-VDSPGSGRVHERKRSLSQFDGDRDILNDEIEDLEHLE 540

Query: 771 EGHLDELEGDRNSRKKKALIRRFGDLLMRRRSFQQKKEQSPE 795
           EG+LD  EGDRNSRKKKALIRRFGDLLMRRRSF QKKEQSPE
Sbjct: 541 EGNLDGEEGDRNSRKKKALIRRFGDLLMRRRSF-QKKEQSPE 575

BLAST of Cla97C07G144470 vs. TAIR 10
Match: AT4G40030.2 (Histone superfamily protein )

HSP 1 Score: 251.5 bits (641), Expect = 2.2e-66
Identity = 140/158 (88.61%), Postives = 143/158 (90.51%), Query Frame = 0

Query: 54  LILFPDLDLRDIRTFRLY----LGYRMARTKQTARKSTGGKAPRKQLATKAARKSAPTTG 113
           L+L P  D   I  FR+     L  +MARTKQTARKSTGGKAPRKQLATKAARKSAPTTG
Sbjct: 4   LLLSPRSDFTTIE-FRVLSHSSLKIKMARTKQTARKSTGGKAPRKQLATKAARKSAPTTG 63

Query: 114 GVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQ 173
           GVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQ
Sbjct: 64  GVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQ 123

Query: 174 EAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIR 208
           EAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIR
Sbjct: 124 EAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIR 160

BLAST of Cla97C07G144470 vs. TAIR 10
Match: AT4G40030.1 (Histone superfamily protein )

HSP 1 Score: 250.0 bits (637), Expect = 6.4e-66
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 0

Query: 76  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 135
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 136 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 195
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 196 MPKDIQLARRIR 208
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

BLAST of Cla97C07G144470 vs. TAIR 10
Match: AT4G40040.1 (Histone superfamily protein )

HSP 1 Score: 250.0 bits (637), Expect = 6.4e-66
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 0

Query: 76  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 135
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 136 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 195
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 196 MPKDIQLARRIR 208
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

BLAST of Cla97C07G144470 vs. TAIR 10
Match: AT4G40040.2 (Histone superfamily protein )

HSP 1 Score: 250.0 bits (637), Expect = 6.4e-66
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 0

Query: 76  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 135
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 136 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 195
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 196 MPKDIQLARRIR 208
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

BLAST of Cla97C07G144470 vs. TAIR 10
Match: AT4G40030.3 (Histone superfamily protein )

HSP 1 Score: 250.0 bits (637), Expect = 6.4e-66
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 0

Query: 76  MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 135
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 136 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 195
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 196 MPKDIQLARRIR 208
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6600643.11.5e-27475.45hypothetical protein SDJN03_05876, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_038878303.12.3e-24687.12WEB family protein At5g16730, chloroplastic isoform X1 [Benincasa hispida][more]
XP_038878314.12.1e-24487.18WEB family protein At5g16730, chloroplastic isoform X2 [Benincasa hispida][more]
XP_008453807.13.2e-23279.61PREDICTED: WEB family protein At5g16730, chloroplastic [Cucumis melo] >KAA004471... [more]
XP_031740609.14.4e-21878.82WEB family protein At5g16730, chloroplastic isoform X1 [Cucumis sativus] >XP_031... [more]
Match NameE-valueIdentityDescription
P591699.1e-65100.00Histone H3.3 OS=Arabidopsis thaliana OX=3702 GN=HTR4 PE=1 SV=2[more]
Q6RUR19.1e-65100.00Histone H3.3 OS=Capsicum annuum OX=4072 PE=2 SV=3[more]
Q71V899.1e-65100.00Histone H3.3 OS=Gossypium hirsutum OX=3635 GN=HIS3 PE=2 SV=3[more]
Q3C2E59.1e-65100.00Histone H3.3 OS=Lolium multiflorum OX=4521 GN=RH3 PE=2 SV=3[more]
P692459.1e-65100.00Histone H3.3 OS=Lolium temulentum OX=34176 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A5A7TNL11.5e-23279.61WEB family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold130G0... [more]
A0A1S3BYC21.5e-23279.61WEB family protein At5g16730, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103494... [more]
A0A0A0KXS44.5e-22477.94Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G026890 PE=4 SV=1[more]
A0A6J1JDI44.8e-20274.79caldesmon-like OS=Cucurbita maxima OX=3661 GN=LOC111483512 PE=4 SV=1[more]
A0A6J1FRQ01.4e-20175.43putative WEB family protein At1g65010, chloroplastic OS=Cucurbita moschata OX=36... [more]
Match NameE-valueIdentityDescription
AT4G40030.22.2e-6688.61Histone superfamily protein [more]
AT4G40030.16.4e-66100.00Histone superfamily protein [more]
AT4G40040.16.4e-66100.00Histone superfamily protein [more]
AT4G40040.26.4e-66100.00Histone superfamily protein [more]
AT4G40030.36.4e-66100.00Histone superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 315..370
NoneNo IPR availableCOILSCoilCoilcoord: 539..566
NoneNo IPR availableCOILSCoilCoilcoord: 511..531
NoneNo IPR availableCOILSCoilCoilcoord: 384..492
NoneNo IPR availableCOILSCoilCoilcoord: 277..304
NoneNo IPR availableCOILSCoilCoilcoord: 595..639
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 740..755
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 687..763
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 619..669
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 619..642
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 82..116
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 719..733
NoneNo IPR availablePANTHERPTHR23160:SF7MYOSIN HEAVY CHAIN-RELATED PROTEINcoord: 227..787
NoneNo IPR availablePANTHERPTHR23160SYNAPTONEMAL COMPLEX PROTEIN-RELATEDcoord: 227..787
IPR000164Histone H3/CENP-APRINTSPR00622HISTONEH3coord: 92..106
score: 92.32
coord: 189..210
score: 81.23
coord: 173..189
score: 96.93
coord: 109..130
score: 91.8
coord: 78..92
score: 97.1
coord: 133..150
score: 97.46
coord: 155..173
score: 82.38
IPR000164Histone H3/CENP-ASMARTSM00428h35coord: 109..211
e-value: 2.6E-71
score: 252.9
IPR000164Histone H3/CENP-APROSITEPS00322HISTONE_H3_1coord: 90..96
IPR000164Histone H3/CENP-APROSITEPS00959HISTONE_H3_2coord: 142..150
IPR009072Histone-foldGENE3D1.10.20.10Histone, subunit Acoord: 77..209
e-value: 1.4E-72
score: 243.9
IPR009072Histone-foldSUPERFAMILY47113Histone-foldcoord: 77..207
IPR007125Histone H2A/H2B/H3PFAMPF00125Histonecoord: 76..207
e-value: 6.4E-52
score: 175.4

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C07G144470.2Cla97C07G144470.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0000786 nucleosome
molecular_function GO:0003677 DNA binding
molecular_function GO:0046982 protein heterodimerization activity