Cla97C07G131680 (gene) Watermelon (97103) v2.5

Overview
NameCla97C07G131680
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionProtein INVOLVED IN DE NOVO 2
LocationCla97Chr07: 3287775 .. 3295694 (-)
RNA-Seq ExpressionCla97C07G131680
SyntenyCla97C07G131680
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGCCAAAATCCTAATGGTGGCTCGCCACTTCGGAAGGCTCAGAAGTCAATAAGCTAAACTTTGATGGTGAGAGCTCACAGGGTTCAGCCATGGAGATTGACGCTTTTGGAGTGTGTGTGAGCCTCTCTTCATTTCCCTAAACGAACGCGCTCACATGCGCGTACGATTTTCATCATCTTTTCCTCTTAAGCCTTTGCACTATATTGCGGACCCCGCCGCAGGCCCCAATTCAACTTTGTCGCATTTCTCCTGTGACTCAACTTTTCCAGTGTCTAGACTCAAGTCTTTACTCCAACCCCAAGTCTTCGATTCGCTCTCAATTCGCGAATTCAATTCCCCTCCTTTTCTTTCTCAGCTCACAGAAGCCTCCACCTGCTTGTTTTTCTGCATCAGACGCCTCTGGGTTAGGCGATTGTTGTCAGGTGATCTCTATGTTGTGTTTTTGGCTTTGCGAAGGTTTTCGTTCTTTTTTCCCTTTTCTGGATTTTGGGGTTGCTTCCTTTTGAATTGGGGTGGATTAGTGAGTTTTGATTTATCTGGGGCATTTCATGCATGGTTTTGCCTCTCTTGCATGATTATAATTGCAATGCATTTCTCTTTTGAGCTGGGGATTTGAGATTCTTGCATATTCCGATGTTTAACATTTGGTTTGGAATTCATCCTTGTTGTTCTTTTTGACTGATCTGGTTCTACTGTTCAGTTTTGGTTGTCATTACTGCGTTTTCTTTTCCTCCTTTCATGTGGGAAGGATGAATTTAGTCGTTGAGCTCTGGATATTTTAACTGTGGCATTCTCTAGATCTCTTTCTTTATTTTTTTTGAACGTGATCATGGGGTTGAGGATGGCTTTATCAGGGGTTTGTGGTTGTGCAAAAAACTCGGTAAAGAAATTTTCTTTACCTTATGATAAGTATAATTTGAAGGTGTCTTTGGTTTATTAAGCATGGTTAAACCTCTAGAAATTGCTTTATCTCTGTGTCTCCCTCTCTGCTGGACAATTTCATTCTATAATCTGTGAAGCTTCAATTGTGTCATCTATTGTTTTTCATTCTATAGATTCAAATGTCACCACCATAAATATGAATGTAACTTAAGAATTTTTTTTCCTTAGGTGCTTTTCTTCAAATTTATGGAAAGTTCTACTGATGATTCCGACGTAGACACCGATATGAGTGAATCTGAGTTGGGTGAGCGGGAAAGCAAGTCATATGATGAACTGAAAAATGGAAAACGCATTGTGAAACTCTCGCACGAGACATTTACTTGCCCCTACTGTACAAAAAAAAGAAAGAGGGATTTCTTATACAAGGATCTCCTGCAGCATGCTTCTGGGGTAGGAAACAGTCCTTCAAATAAACGGAGTACCAAAGAGAAAGCTAATCATTTAGCTCTATTGAAATATTTGGAAAAAGATCTAGCTGATGCTGTTGGTCCATCAAAACCCGCTAGCAATAATGATCCTGTTATGGATTGCAATCATGATGAAAAGTTTGTGTGGCCTTGGAGAGGAATTGTGGTAAACATTCCAACTAGGCGCACAGATGATGGGCGATACGTGGGAGGAAGCGGATCAAAGTTTAGGGATGAGTTGAAAGAAAGAGGATTTAATCCCACAAGGGTTACTCCTTTGTGGAATTACCGGGGCCACTCAGGTTGTGCTATTGTGGAATTTAATAAAGATTGGCCTGGTTTGCATAATGCTATTTCTTTTGAGAGGGCTTATGAGGCAGATCGTCATGGGAAAAAGGATTGGCTGGCTAATGGCACTACTACTGAGAAACTAGGAATTTATGCTTGGGTTGCTCGAGCTGATGATTACAACTCAAATAATATAGTTGGTGAACATTTACGCAAGATTGGAGACCTTAAGACCATATCTGAAATTATTCAGGAGGAAGCACGGAAGCAAGATAGACTTGTGTCCAATCTTACAAGTATCATTGAGCTCAAGAACAAGCACTTGATAGAGATGGAGAAAAGATGTAGTGAAACTGCCACCACCCTTAACAATTTGATGGGGGAGAGAGAGAAATTACTTCAAGCTTATAACGAAGGTTTCTTATATAGCTTTTAAAATGATTTTTCCTGCATTTTAAAACCAGGAAATTCGTTCCATGTTATTACATATGAGTTTTTCTTTCATGGATATTCCAAGTCAAATCATTGAATTTATGTAATATTTCTCACATGTACAGAGATAAAAAAGATCCAACTGGGTGCCAGGGATCACCTCAAGAAGATCTTCAGCGATCATGAAAAGCTGAAGTTGCAATTGGAATCTCAGAAAAAAGAGTTTGAGTTAAGAGGAAGAGAACTGGAGAAGCGTGAAGCACAAAATGAAAACGAGAGCAAGTATCTGGCTGAAGAAATTGAAAAGGTACACTTTTTGTCTTTCAATTTAAAAAAATATTGCAGGAAATAATATATGCATTAAATAGTGAAGCAATTAAGAGAAGGTACATTATTTAATGGTTGATTCTTCCGCAACTATTCTTGTGTTCATGATTTGCAGTGACGAAGGGAAAATTGAGTGTCTTCTTTCACCAGCTTTCACTGGTAGTTATGACAATTGCGAATGTGAAAACTTGAGATAGTCATTAATCTGATTGTCAGACATGTTTTAGGGGTTACTGTATATGAGCAATCATTCCAGTTCGCCTGAACTTTGGTTTTCAAATAAATTAATACCTGACACCTCTGATTTTTTTCCCCCTGTTGTTTACTCATCTATTTTTTTAAGCATAGTTTTCAATCGCACTACAGCTGAACAATGTTGTAGGCAAGATCTTTGTTGCATCCCATGGTCTTCACATTTTGGGATGTGCTTGCATGGTTGCATCTATAAGTTAGCAAATCAATGAAATGTTTTTTCAGGGTTTTTCAATTGCTTCTCACTTCTACCTTCACTGTCTAGTGCACAAGTTTGCTAATTTTTTTGACCACATAAATTAAAGCATCTTATATACATCCCTTATGCTGCACGCTTACACTTACAAGTTACAACTCCCAAACTCACAAATCAAACACGGGCAGCATAAACTTAGGTTCATGTAACTTTGTCTTTACACAGTCCAAAATCAGATTACACACCTCAAAGTTCTTCGTGCGTGGGCACTTGAAATTTCAGAAATTTTCATGAATTTTAAAATATGAAATTATGATTTTTTTTAAACAAAAAATGAATAAAATAAAGGAAAATTATCTTAAATGGCAAAATTGCTGAAACTATTTACAATTAATAGCAAAATACACAGTTTATTTGCTATAGATCGCGATAGACAGTGAAATTTTGCTATATTTGCAAATATTTTGGTTCCTTTTGCTACATTTGAAAAGAGCCCTACAATAAATATTTGATATATTTATCCTCTTTTCTTTTACTTCATTATTACATGAACTCATTTAACTTATTGCTCTTTTAACCCCCACCCCCCTTTTATTTATTTTTATTTATTTTTATTTATTTTTATTTTTTGCCTGATAAGAAACAAAGGACTATATTCATCAAGAGTGCTCTTTTAATTATTCAATTCTTTATGTTCCTCAGTTTATTAATTTCTTAATTTATTCTTTTATGGTAACCAATTAAATTTGTTTGGCTTCTTCAACTCTAACTCTAGATCCACCCCGCACATCTACACACAGAAAGAAGGCACATAATTGTGCAGTTTGCCTTGGACATATCAACTTGTTAGTGTTACCAACAAGGTGGTTTTATGTTGTGCTGATGCCTGAAGAAATGTTCTAGGATGCTCCATGTTGTTCAGAAACTTATTGATTAATATTTTCTGATTATTGCTTTCTGTAATGGCTGAACCAACTTCCATCCGGGTACATATAAATGCTGCAGTGTATGACTTAACGAGTGTGTGATGTGGTATCATGTTTTTGTTTTGTGTGTGTGTGTGTGTTTTTTTTTTTTGGGGGGGGGGGGGGGGGGGGGGGGGGGTATTCTTCTCTACAGAAGTCAATCAACGAGGTTTATTAATTTATTTTGTTTTATTATTTTTTTGAATCATTTGTCTCAACACTTGGGGCATGGAGAGGTTTTTCAGCATTCGCCTTGTATTCTTTTTTCTTTTCAGTATGAGGTGAGAAATAGTTCTCTTCAATTGGCTGAACTAGAGCAGCAGAAGGCTGATGAAGATTTTATGAAGCTGGCAGATGATCAGAAGGTTTGTGGCGTGTGTGCACTGTTAATCATCAGAATATTTCATTATCAAAACTAGAAACGAGGAATACTTATTTGTGATTTCTAATGATTCAGAAACAAAAGGAGGACCTTCATAATAGAATAATCCAACTGGAAAAACAGCTGGATGCCAAGCAAGCACTAGAGTTGGAAATTGAGCGTCTACGTGGGACGTTGAATGTCATGAAGCACATGGAAGATGATGAGGATGTGGAAGTCCTCCAGAAGGCAGAGTCAACACTAAAAGAGTTGAGTGAAAAGGAAGGAGAACTTGAAGCGCTCGATGAGCTTAACCAAGCATTGATAGTAAAGCAGCGCATGAGTAATGACGAGCTCCAAGAAGCTCGTAAAGAGATCACTAATGTAAGAGTATTTTCTTACTGCAAAGTACTTTCTAAGTTGTGAACAGGATAAAAGTATGCCACAGTTGTTCTTCGTGAACTATTTGTAAAAAGTGTTGATTTTAACCGGCGGGAAAGCTTGAAATTACTGTATAGGCAGCATGCCTGGAGGCTGTGTTTGTTATCCCATAAAGGTCTGATATATGTCAAATATGAATTTAAAGTCTTGGTCATGAGGGAGGTTGCATGCAACTTGATTCATTTTGTGTGTCAATGCATCTATTGAAGTTAGTGAATTCTATATTCCTGTAGCTGAATGGATATCTTTTCTCCTGTTTGTAAAATAAAATTTTCAAGTTAAAAATTCCCGACTCCTAATACAGTTTAATTCTCTCATTTCTCAGGCTTTTAAAGATTTGCCTGGTCGTTCTTACTTGCGTGTTAAGAGAATGGGCGAATTAGATACAAAACCATTCCATGAAGCAATGAAGAAAATATATAATGAGGATGAAGCAGATGAGAGAGCTTCAGAGCTGTGCTCATTGTGGGCAGAGTATCTCAAGGATCCAGACTGGCATCCCTTCAAAGTAATTAAAGTAGAAGGAAAAGATACTGCGGACGGAAAAGATAAGGTGACTCTTTTGCTAGCTCCTTTCATCAGTTTAGTTCATTGCTGCTAAGAAATGTAGGCATGTTTTTGTGGAAACTGATGTTGGATATCTATTTAGAATCAAATGAGTTAAAATACTGTTAGATATGCCATACCTAGCCATCTTGGATTTTCCTTTCTTACATACTAGGGATATTTCTTGCCAATATTGCATTTTCATTGATTAGAGAATCCACAATCTCCCCGTGGCCAAAAAAAAGAAGAAGAAGAAGAACTTGATAAGTATGTCTCTTGTTGATGTAGAAATAGAAATGTAATTGAACGCAGAAACTATTTTCTGTTCCTTGTTATGATTTCTTAACTAAGAAAATGCGCATTGAAAGTGTAATAATTTCTTGTCCTCTTCCTAAATGATTTCTTATTTCTGGTTCTCTGAATGTTGATTTGTGAGAAATGAAATTTCCACTGATTTGGTAACCATTTCGTTTCTTATTTTTTGTTTTTGAAAATTAAGCTTATTTCCTCCACTTACAATAATTTGCATCTTTCTTAAGTACAGTGGTTGGATTCTCAACCAAATTCCAAAAACAAAAACAAGTTTTTAAAAGTTACTTTTTTTAGTTTTCAAAATTTGGCTTGGTTTTTTAAACCATTGGTAAAAACTAAACAACAAAGGAAGAAATTTGGAGGTGGAAGTAGTGTCTATTAGGCCTAATTTTCAAAAACAAAAACTAAAAATGAAATAGTTACCAAATGGGACCTAAGTTATTTCATGTGTTTTTCTAAAAATCCATCTTCAATTTAAAGCCTAAATGATAAAAAATTGCTTTAACAATTTCGTTAGTATCAGCTTGGTCTAATTCTTGGATTCAGAATTTGGGTCCGTTCTTAGCTAGTGTTGGGGTTCTTTTTGGTGAGCTTTTTTTTTGTATTCCCTTGTAGATTCTTTCATTTTTTTCAATGATAGCTCGGTCTTTTTTTAATTAAAAATAAATAAATAAATAGTACTCGACAACCATAATCTTGGCGTATGATTACCTCTGCATCTTACGGGGAAAAGTTTCTAGTAGAACTGCATTGCCTCTTCCCACCATTAAAAATATTAAGCCACATGACTAGTTGGTATAATATTTCATTATGTAGGCTCCAAATTGTACTTAAATATTGCTAATATAGTTGATATCAAACTAGTATGGTATAATGTCTGTAGGCTCCAAAATTTTCTTAAATATTGCTGATAAGTTGAAACCAGAATAGTATGGAGTACCATCCATAGTTAATGTTGTTATTCCAGTCATCTATCTTAGTGGCGCACAGGAGCCGCAGATTGAACATATATCATTGTAATAGATGTTATACACAACAGCGATTATATCTCACTTTAGCTTCATATGTAGTGCATTTATTTTTGCATAGACATTCTTGAATTTGTATTTTGATTTTTGGCTCACTAGGACGATTTATTATGCATGACTATTCAGTGTAGAAGAAAATCTGCAATTATATTAGCTTTCTTCATGCTTCTCTTTGTCGCTATTCTCGAGCTATGTACATTTTCCTTACCTGAAGAGGCTTTTTGATGTTTATGTTTAATTATTAGTCATTATGCTGTTATAGGAAATTGAAGTTTTGGATGATGAAGATGAGAAACTGAAAGGTCTGAAAAAGGATTATGGCGAGGAAGTATGCAAGGCTGTGACATCAGCTTTAATGGAGATCAATGAATACAATCCTAGTGGACGGTATATCACATCGGAGCTATGGAACTACCAAGAGGAAAGGAAAGCAACGTTGCGAGAGGGAGTAAGATATTTACTGGACAAGCTGGGTAGAAGCAACTAGAAAAGGGGAACACACCCTGAAGGTTGGTATTCAATATGGTTGATATACTTTACTGGCTTTTTGACTGGAAATAAGAATAAAATTGCAAATTAGAAGCTTTGTGTACATAATGCTAAATACCGCACAACTGATTTTGTGTGTGAACGCTGAATAAATTATATTCAAACTGGTTTCAATATCATCAAAATGTTCAAATTAGAACTTGTCATGTCAGTCTGAATCCTTTTCAAGGCATTGCTACTCCTACATTAGTCTTCTTGAAGTAATTTGCTTGGCTGCTGGTTTGCGTTATTTTGTCAATAACATTAGATTTATTGAACTGCTGCTGCAGCCATGATGATCAAACCAACATCATTGCCATGAAATAAAACGCACCGCGTTCTATCGGGTCAGAATTCTCGTATGGAGTTATCTGATCTTTGTTTAAAGAAAATCCAAACACAGGCATGGAGATGAAGTTCTCTTCTTTGCAATTATATGACGTCTAACCAGGAGATGGAATATGAGCCAAATAAGTTAAGTCAGTGTATGTTTTCAAGTCATTTCTCACCTGTGTATCAATTATTTCCAGATCATAGAGATTTCCATAATGCAGACCTGCTTATCTATTATCATTTACTGATCTTACTATTTAATTTGTTTCCAAAAAACCTATAAAACTACGTCATCTTCGATCCGAGGGCGACAGTTGTAACCTACATGTAGTGTGAGTCAAGAGACTGAGTGATTATTTCTATCTTATTCATGTTTCAATAATGTAAGTGCTTCTGAGCAATTTTATGAATAGCTTGGATAGCCATTGTTGTTGAACTCAGCTTGTTTATGAGGATGACTTGGAAAACAAGGTTTCATACCTTGTCATATGTGGTTTGAGACCCATTTGGTTTCTGCTTTGGAGATCATTTTTCTTTTAGAGATTATTAAGAAAATGGAGGT

mRNA sequence

GGCCAAAATCCTAATGGTGGCTCGCCACTTCGGAAGGCTCAGAAGTCAATAAGCTAAACTTTGATGGTGAGAGCTCACAGGGTTCAGCCATGGAGATTGACGCTTTTGGAGTGTGTGTGAGCCTCTCTTCATTTCCCTAAACGAACGCGCTCACATGCGCGTACGATTTTCATCATCTTTTCCTCTTAAGCCTTTGCACTATATTGCGGACCCCGCCGCAGGCCCCAATTCAACTTTGTCGCATTTCTCCTGTGACTCAACTTTTCCAGTGTCTAGACTCAAGTCTTTACTCCAACCCCAAGTCTTCGATTCGCTCTCAATTCGCGAATTCAATTCCCCTCCTTTTCTTTCTCAGCTCACAGAAGCCTCCACCTGCTTGTTTTTCTGCATCAGACGCCTCTGGGTTAGGCGATTGTTGTCAGGGGTTTGTGGTTGTGCAAAAAACTCGGTAAAGAAATTTTCTTTACCTTATGATAAGTATAATTTGAAGGTGCTTTTCTTCAAATTTATGGAAAGTTCTACTGATGATTCCGACGTAGACACCGATATGAGTGAATCTGAGTTGGGTGAGCGGGAAAGCAAGTCATATGATGAACTGAAAAATGGAAAACGCATTGTGAAACTCTCGCACGAGACATTTACTTGCCCCTACTGTACAAAAAAAAGAAAGAGGGATTTCTTATACAAGGATCTCCTGCAGCATGCTTCTGGGGTAGGAAACAGTCCTTCAAATAAACGGAGTACCAAAGAGAAAGCTAATCATTTAGCTCTATTGAAATATTTGGAAAAAGATCTAGCTGATGCTGTTGGTCCATCAAAACCCGCTAGCAATAATGATCCTGTTATGGATTGCAATCATGATGAAAAGTTTGTGTGGCCTTGGAGAGGAATTGTGGTAAACATTCCAACTAGGCGCACAGATGATGGGCGATACGTGGGAGGAAGCGGATCAAAGTTTAGGGATGAGTTGAAAGAAAGAGGATTTAATCCCACAAGGGTTACTCCTTTGTGGAATTACCGGGGCCACTCAGGTTGTGCTATTGTGGAATTTAATAAAGATTGGCCTGGTTTGCATAATGCTATTTCTTTTGAGAGGGCTTATGAGGCAGATCGTCATGGGAAAAAGGATTGGCTGGCTAATGGCACTACTACTGAGAAACTAGGAATTTATGCTTGGGTTGCTCGAGCTGATGATTACAACTCAAATAATATAGTTGGTGAACATTTACGCAAGATTGGAGACCTTAAGACCATATCTGAAATTATTCAGGAGGAAGCACGGAAGCAAGATAGACTTGTGTCCAATCTTACAAGTATCATTGAGCTCAAGAACAAGCACTTGATAGAGATGGAGAAAAGATGTAGTGAAACTGCCACCACCCTTAACAATTTGATGGGGGAGAGAGAGAAATTACTTCAAGCTTATAACGAAGAGATAAAAAAGATCCAACTGGGTGCCAGGGATCACCTCAAGAAGATCTTCAGCGATCATGAAAAGCTGAAGTTGCAATTGGAATCTCAGAAAAAAGAGTTTGAGTTAAGAGGAAGAGAACTGGAGAAGCGTGAAGCACAAAATGAAAACGAGAGCAAGTATCTGGCTGAAGAAATTGAAAAGTATGAGGTGAGAAATAGTTCTCTTCAATTGGCTGAACTAGAGCAGCAGAAGGCTGATGAAGATTTTATGAAGCTGGCAGATGATCAGAAGAAACAAAAGGAGGACCTTCATAATAGAATAATCCAACTGGAAAAACAGCTGGATGCCAAGCAAGCACTAGAGTTGGAAATTGAGCGTCTACGTGGGACGTTGAATGTCATGAAGCACATGGAAGATGATGAGGATGTGGAAGTCCTCCAGAAGGCAGAGTCAACACTAAAAGAGTTGAGTGAAAAGGAAGGAGAACTTGAAGCGCTCGATGAGCTTAACCAAGCATTGATAGTAAAGCAGCGCATGAGTAATGACGAGCTCCAAGAAGCTCGTAAAGAGATCACTAATGCTTTTAAAGATTTGCCTGGTCGTTCTTACTTGCGTGTTAAGAGAATGGGCGAATTAGATACAAAACCATTCCATGAAGCAATGAAGAAAATATATAATGAGGATGAAGCAGATGAGAGAGCTTCAGAGCTGTGCTCATTGTGGGCAGAGTATCTCAAGGATCCAGACTGGCATCCCTTCAAAGTAATTAAAGTAGAAGGAAAAGATACTGCGGACGGAAAAGATAAGGAAATTGAAGTTTTGGATGATGAAGATGAGAAACTGAAAGGTCTGAAAAAGGATTATGGCGAGGAAGTATGCAAGGCTGTGACATCAGCTTTAATGGAGATCAATGAATACAATCCTAGTGGACGGTATATCACATCGGAGCTATGGAACTACCAAGAGGAAAGGAAAGCAACGTTGCGAGAGGGAGTAAGATATTTACTGGACAAGCTGGGTAGAAGCAACTAGAAAAGGGGAACACACCCTGAAGCCATGATGATCAAACCAACATCATTGCCATGAAATAAAACGCACCGCGTTCTATCGGGTCAGAATTCTCGTATGGAGTTATCTGATCTTTGTTTAAAGAAAATCCAAACACAGGCATGGAGATGAAGTTCTCTTCTTTGCAATTATATGACGTCTAACCAGGAGATGGAATATGAGCCAAATAAGTTAAGTCAGTGTATGTTTTCAAGTCATTTCTCACCTGTGTATCAATTATTTCCAGATCATAGAGATTTCCATAATGCAGACCTGCTTATCTATTATCATTTACTGATCTTACTATTTAATTTGTTTCCAAAAAACCTATAAAACTACGTCATCTTCGATCCGAGGGCGACAGTTGTAACCTACATGTAGTGTGAGTCAAGAGACTGAGTGATTATTTCTATCTTATTCATGTTTCAATAATGTAAGTGCTTCTGAGCAATTTTATGAATAGCTTGGATAGCCATTGTTGTTGAACTCAGCTTGTTTATGAGGATGACTTGGAAAACAAGGTTTCATACCTTGTCATATGTGGTTTGAGACCCATTTGGTTTCTGCTTTGGAGATCATTTTTCTTTTAGAGATTATTAAGAAAATGGAGGT

Coding sequence (CDS)

ATGCGCGTACGATTTTCATCATCTTTTCCTCTTAAGCCTTTGCACTATATTGCGGACCCCGCCGCAGGCCCCAATTCAACTTTGTCGCATTTCTCCTGTGACTCAACTTTTCCAGTGTCTAGACTCAAGTCTTTACTCCAACCCCAAGTCTTCGATTCGCTCTCAATTCGCGAATTCAATTCCCCTCCTTTTCTTTCTCAGCTCACAGAAGCCTCCACCTGCTTGTTTTTCTGCATCAGACGCCTCTGGGTTAGGCGATTGTTGTCAGGGGTTTGTGGTTGTGCAAAAAACTCGGTAAAGAAATTTTCTTTACCTTATGATAAGTATAATTTGAAGGTGCTTTTCTTCAAATTTATGGAAAGTTCTACTGATGATTCCGACGTAGACACCGATATGAGTGAATCTGAGTTGGGTGAGCGGGAAAGCAAGTCATATGATGAACTGAAAAATGGAAAACGCATTGTGAAACTCTCGCACGAGACATTTACTTGCCCCTACTGTACAAAAAAAAGAAAGAGGGATTTCTTATACAAGGATCTCCTGCAGCATGCTTCTGGGGTAGGAAACAGTCCTTCAAATAAACGGAGTACCAAAGAGAAAGCTAATCATTTAGCTCTATTGAAATATTTGGAAAAAGATCTAGCTGATGCTGTTGGTCCATCAAAACCCGCTAGCAATAATGATCCTGTTATGGATTGCAATCATGATGAAAAGTTTGTGTGGCCTTGGAGAGGAATTGTGGTAAACATTCCAACTAGGCGCACAGATGATGGGCGATACGTGGGAGGAAGCGGATCAAAGTTTAGGGATGAGTTGAAAGAAAGAGGATTTAATCCCACAAGGGTTACTCCTTTGTGGAATTACCGGGGCCACTCAGGTTGTGCTATTGTGGAATTTAATAAAGATTGGCCTGGTTTGCATAATGCTATTTCTTTTGAGAGGGCTTATGAGGCAGATCGTCATGGGAAAAAGGATTGGCTGGCTAATGGCACTACTACTGAGAAACTAGGAATTTATGCTTGGGTTGCTCGAGCTGATGATTACAACTCAAATAATATAGTTGGTGAACATTTACGCAAGATTGGAGACCTTAAGACCATATCTGAAATTATTCAGGAGGAAGCACGGAAGCAAGATAGACTTGTGTCCAATCTTACAAGTATCATTGAGCTCAAGAACAAGCACTTGATAGAGATGGAGAAAAGATGTAGTGAAACTGCCACCACCCTTAACAATTTGATGGGGGAGAGAGAGAAATTACTTCAAGCTTATAACGAAGAGATAAAAAAGATCCAACTGGGTGCCAGGGATCACCTCAAGAAGATCTTCAGCGATCATGAAAAGCTGAAGTTGCAATTGGAATCTCAGAAAAAAGAGTTTGAGTTAAGAGGAAGAGAACTGGAGAAGCGTGAAGCACAAAATGAAAACGAGAGCAAGTATCTGGCTGAAGAAATTGAAAAGTATGAGGTGAGAAATAGTTCTCTTCAATTGGCTGAACTAGAGCAGCAGAAGGCTGATGAAGATTTTATGAAGCTGGCAGATGATCAGAAGAAACAAAAGGAGGACCTTCATAATAGAATAATCCAACTGGAAAAACAGCTGGATGCCAAGCAAGCACTAGAGTTGGAAATTGAGCGTCTACGTGGGACGTTGAATGTCATGAAGCACATGGAAGATGATGAGGATGTGGAAGTCCTCCAGAAGGCAGAGTCAACACTAAAAGAGTTGAGTGAAAAGGAAGGAGAACTTGAAGCGCTCGATGAGCTTAACCAAGCATTGATAGTAAAGCAGCGCATGAGTAATGACGAGCTCCAAGAAGCTCGTAAAGAGATCACTAATGCTTTTAAAGATTTGCCTGGTCGTTCTTACTTGCGTGTTAAGAGAATGGGCGAATTAGATACAAAACCATTCCATGAAGCAATGAAGAAAATATATAATGAGGATGAAGCAGATGAGAGAGCTTCAGAGCTGTGCTCATTGTGGGCAGAGTATCTCAAGGATCCAGACTGGCATCCCTTCAAAGTAATTAAAGTAGAAGGAAAAGATACTGCGGACGGAAAAGATAAGGAAATTGAAGTTTTGGATGATGAAGATGAGAAACTGAAAGGTCTGAAAAAGGATTATGGCGAGGAAGTATGCAAGGCTGTGACATCAGCTTTAATGGAGATCAATGAATACAATCCTAGTGGACGGTATATCACATCGGAGCTATGGAACTACCAAGAGGAAAGGAAAGCAACGTTGCGAGAGGGAGTAAGATATTTACTGGACAAGCTGGGTAGAAGCAACTAG

Protein sequence

MRVRFSSSFPLKPLHYIADPAAGPNSTLSHFSCDSTFPVSRLKSLLQPQVFDSLSIREFNSPPFLSQLTEASTCLFFCIRRLWVRRLLSGVCGCAKNSVKKFSLPYDKYNLKVLFFKFMESSTDDSDVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYKDLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKPASNNDPVMDCNHDEKFVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFNKDWPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGEHLRKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGEREKLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQALELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQRMSNDELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASELCSLWAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAVTSALMEINEYNPSGRYITSELWNYQEERKATLREGVRYLLDKLGRSN
Homology
BLAST of Cla97C07G131680 vs. NCBI nr
Match: XP_038890085.1 (protein INVOLVED IN DE NOVO 2-like [Benincasa hispida] >XP_038890086.1 protein INVOLVED IN DE NOVO 2-like [Benincasa hispida])

HSP 1 Score: 1240.3 bits (3208), Expect = 0.0e+00
Identity = 629/653 (96.32%), Postives = 642/653 (98.32%), Query Frame = 0

Query: 111 LKVLFFKFMESSTDDSDVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKK 170
           +KVLF KFMESSTDDSD+D+DMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKK
Sbjct: 7   VKVLFIKFMESSTDDSDIDSDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKK 66

Query: 171 RKRDFLYKDLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKPASNNDPV 230
           RKRDFLYKDLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSK  SNNDPV
Sbjct: 67  RKRDFLYKDLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKLTSNNDPV 126

Query: 231 MDCNHDEKFVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVTPLWNYRG 290
           MDCNHDEKFVWPWRGIVVNIPT+RTDDGR+VGGSGSKFRDELKERGFNPTRVTPLWNYRG
Sbjct: 127 MDCNHDEKFVWPWRGIVVNIPTKRTDDGRFVGGSGSKFRDELKERGFNPTRVTPLWNYRG 186

Query: 291 HSGCAIVEFNKDWPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNS 350
           HSGCAIVEFNKDWPGLHNAISFERAYEADRHGKKDWLANGTT EKLGIYAWVARADDYNS
Sbjct: 187 HSGCAIVEFNKDWPGLHNAISFERAYEADRHGKKDWLANGTTAEKLGIYAWVARADDYNS 246

Query: 351 NNIVGEHLRKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTL 410
           NNI+GEHLRKIGDLKTISE+IQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTL
Sbjct: 247 NNIIGEHLRKIGDLKTISEVIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTL 306

Query: 411 NNLMGEREKLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKR 470
           NNLMGEREKLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELE R
Sbjct: 307 NNLMGEREKLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEMR 366

Query: 471 EAQNENESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQL 530
           EAQNENESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLH RII+L
Sbjct: 367 EAQNENESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHERIIRL 426

Query: 531 EKQLDAKQALELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELN 590
           EKQLDAKQALELEIERLRGTLNVMKHMEDDEDVEVLQKAE+ LK+LSEKEGELE LD+LN
Sbjct: 427 EKQLDAKQALELEIERLRGTLNVMKHMEDDEDVEVLQKAEAILKDLSEKEGELEELDDLN 486

Query: 591 QALIVKQRMSNDELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEA 650
           QALIVKQR SNDELQEARKEI NAFKDLPGRS+LRVKRMGELDTKPFHEAMKKIYNEDEA
Sbjct: 487 QALIVKQRKSNDELQEARKEIINAFKDLPGRSHLRVKRMGELDTKPFHEAMKKIYNEDEA 546

Query: 651 DERASELCSLWAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGE 710
           DERASELCSLWAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGE
Sbjct: 547 DERASELCSLWAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGE 606

Query: 711 EVCKAVTSALMEINEYNPSGRYITSELWNYQEERKATLREGVRYLLDKLGRSN 764
           EVCKAVTSALMEINEYNPSGRYITSELWNYQEERKATLREGVR+LLDKL RSN
Sbjct: 607 EVCKAVTSALMEINEYNPSGRYITSELWNYQEERKATLREGVRFLLDKLNRSN 659

BLAST of Cla97C07G131680 vs. NCBI nr
Match: XP_008461675.1 (PREDICTED: protein INVOLVED IN DE NOVO 2 [Cucumis melo] >XP_008461676.1 PREDICTED: protein INVOLVED IN DE NOVO 2 [Cucumis melo] >KAA0050320.1 protein INVOLVED IN DE NOVO 2 [Cucumis melo var. makuwa] >TYK03538.1 protein INVOLVED IN DE NOVO 2 [Cucumis melo var. makuwa])

HSP 1 Score: 1189.9 bits (3077), Expect = 0.0e+00
Identity = 612/647 (94.59%), Postives = 625/647 (96.60%), Query Frame = 0

Query: 119 MESSTDDSDVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYK 178
           MESSTDDSDVDTD+SESE+ ERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYK
Sbjct: 1   MESSTDDSDVDTDVSESEMDERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYK 60

Query: 179 DLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKP--ASNNDPVMDCNHD 238
           DLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLAD VGPSKP  ASN DPVMDCNHD
Sbjct: 61  DLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADTVGPSKPATASNKDPVMDCNHD 120

Query: 239 EKFVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAI 298
           EKFVWPWRGIVVNIPTRRTDDGR+VGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAI
Sbjct: 121 EKFVWPWRGIVVNIPTRRTDDGRFVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAI 180

Query: 299 VEFNKDWPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGE 358
           VEFNKDWPGLHNAISFERAYEADRHGKKDWLANG TTEKLGIYAWVARADDYN+NNIVGE
Sbjct: 181 VEFNKDWPGLHNAISFERAYEADRHGKKDWLANG-TTEKLGIYAWVARADDYNTNNIVGE 240

Query: 359 HLRKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGE 418
           HLRKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRC+ETATTLNNLMGE
Sbjct: 241 HLRKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCTETATTLNNLMGE 300

Query: 419 REKLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNEN 478
           REKLL AYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNEN
Sbjct: 301 REKLLHAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNEN 360

Query: 479 ESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDA 538
           ESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRII+LEKQLDA
Sbjct: 361 ESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDA 420

Query: 539 KQALELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVK 598
           KQALELEIERLRGTLNVMKHMED EDV   QKAES LKELSEKE +LE LD+LNQALIVK
Sbjct: 421 KQALELEIERLRGTLNVMKHMEDVEDV---QKAESILKELSEKERDLEELDDLNQALIVK 480

Query: 599 QRMSNDELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASE 658
           QR SNDELQEARKEI NAFKDLPGRS+LRVKRMGELDTKPFHEAMKKIYNEDEADERASE
Sbjct: 481 QRKSNDELQEARKEIINAFKDLPGRSHLRVKRMGELDTKPFHEAMKKIYNEDEADERASE 540

Query: 659 LCSLWAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAV 718
           LCSLWAEYLKDPDWHPF+VIKVE KD  DGK+KEIE+LDDEDEKLKGLKKDYGEEVCKAV
Sbjct: 541 LCSLWAEYLKDPDWHPFRVIKVEAKDAPDGKEKEIEILDDEDEKLKGLKKDYGEEVCKAV 600

Query: 719 TSALMEINEYNPSGRYITSELWNYQEERKATLREGVRYLLDKLGRSN 764
            SALMEINEYNPSGRYITSELWNYQE RKATLREGVR+LLDKL RSN
Sbjct: 601 ISALMEINEYNPSGRYITSELWNYQEGRKATLREGVRFLLDKLNRSN 643

BLAST of Cla97C07G131680 vs. NCBI nr
Match: XP_023536648.1 (protein INVOLVED IN DE NOVO 2-like [Cucurbita pepo subsp. pepo] >XP_023536649.1 protein INVOLVED IN DE NOVO 2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1180.6 bits (3053), Expect = 0.0e+00
Identity = 601/645 (93.18%), Postives = 622/645 (96.43%), Query Frame = 0

Query: 119 MESSTDDSDVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYK 178
           M SSTDDSDVDTD+SESEL ERES+SY+ELKNG  IVKLSHETFTCPYCT+KRKRDFLYK
Sbjct: 1   MGSSTDDSDVDTDISESELEERESRSYEELKNGNHIVKLSHETFTCPYCTRKRKRDFLYK 60

Query: 179 DLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKPASNNDPVMDCNHDEK 238
           DLLQHASGVG S SNKR+ KEKANHLALLKYLEKDLADAVGPSKPASNNDPVMDCNHDEK
Sbjct: 61  DLLQHASGVGKSSSNKRNAKEKANHLALLKYLEKDLADAVGPSKPASNNDPVMDCNHDEK 120

Query: 239 FVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVE 298
           FVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRV PLWNYRGHSGCAIVE
Sbjct: 121 FVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVIPLWNYRGHSGCAIVE 180

Query: 299 FNKDWPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGEHL 358
           FNKDWPGLHNAISFERAYEAD HGKKDWLA G  TEKLG+YAWVARADDYN+NNI+GEHL
Sbjct: 181 FNKDWPGLHNAISFERAYEADHHGKKDWLAKG--TEKLGLYAWVARADDYNANNIIGEHL 240

Query: 359 RKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGERE 418
           RKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHL EMEKRCSETATTLNNLMGERE
Sbjct: 241 RKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLKEMEKRCSETATTLNNLMGERE 300

Query: 419 KLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENES 478
            LLQAYNEEIKKIQLGARDHLKKIF+DHEKLKLQL+SQKKEFELRGRELEKREAQNENES
Sbjct: 301 TLLQAYNEEIKKIQLGARDHLKKIFNDHEKLKLQLDSQKKEFELRGRELEKREAQNENES 360

Query: 479 KYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQ 538
           KYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRII+LEKQLDAKQ
Sbjct: 361 KYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDAKQ 420

Query: 539 ALELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQR 598
           ALELEIERLRGTLNVMKHMEDDEDVEVLQKAES LK+LSEKEGELE LDELNQ LIVKQR
Sbjct: 421 ALELEIERLRGTLNVMKHMEDDEDVEVLQKAESILKDLSEKEGELEELDELNQTLIVKQR 480

Query: 599 MSNDELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASELC 658
            SNDELQEARKEI NAFKDLPGRS+LRVKRMGELDTKPFHEAMKKIYNEDEADERASELC
Sbjct: 481 KSNDELQEARKEIINAFKDLPGRSHLRVKRMGELDTKPFHEAMKKIYNEDEADERASELC 540

Query: 659 SLWAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAVTS 718
           SLWAEYLKDPDWHPFKVIKVEGKDTA+GKDKEIE+L+DEDEKL+GLKKDYGEEV KAV S
Sbjct: 541 SLWAEYLKDPDWHPFKVIKVEGKDTAEGKDKEIEILNDEDEKLEGLKKDYGEEVYKAVAS 600

Query: 719 ALMEINEYNPSGRYITSELWNYQEERKATLREGVRYLLDKLGRSN 764
           ALMEINEYNPSGRYI SELWNYQEERKATLREGV++LLDKL ++N
Sbjct: 601 ALMEINEYNPSGRYIISELWNYQEERKATLREGVKFLLDKLNKNN 643

BLAST of Cla97C07G131680 vs. NCBI nr
Match: XP_022977373.1 (protein INVOLVED IN DE NOVO 2-like [Cucurbita maxima] >XP_022977376.1 protein INVOLVED IN DE NOVO 2-like [Cucurbita maxima] >XP_022977377.1 protein INVOLVED IN DE NOVO 2-like [Cucurbita maxima])

HSP 1 Score: 1180.2 bits (3052), Expect = 0.0e+00
Identity = 601/645 (93.18%), Postives = 621/645 (96.28%), Query Frame = 0

Query: 119 MESSTDDSDVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYK 178
           M SSTDDSDVDTD+SESEL ERESKSY+ELKNG  IVKLSHETFTCPYCT+KRKRDFLYK
Sbjct: 1   MGSSTDDSDVDTDISESELEERESKSYEELKNGNHIVKLSHETFTCPYCTRKRKRDFLYK 60

Query: 179 DLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKPASNNDPVMDCNHDEK 238
           DLLQHASGVG S SNKR+ KEKANHLALLKYLEKDLADAVGPSKPA NNDPVMDCNHDEK
Sbjct: 61  DLLQHASGVGKSSSNKRNAKEKANHLALLKYLEKDLADAVGPSKPAGNNDPVMDCNHDEK 120

Query: 239 FVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVE 298
           FVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRV PLWNYRGHSGCAIVE
Sbjct: 121 FVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVIPLWNYRGHSGCAIVE 180

Query: 299 FNKDWPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGEHL 358
           FNKDWPGLHNAISFERAYEAD HGKKDWLA G  TEKLG+YAWVARADDYN+NNI+GEHL
Sbjct: 181 FNKDWPGLHNAISFERAYEADHHGKKDWLAKG--TEKLGLYAWVARADDYNANNIIGEHL 240

Query: 359 RKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGERE 418
           RKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHL EMEKRCSETATTLNNLMGERE
Sbjct: 241 RKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLKEMEKRCSETATTLNNLMGERE 300

Query: 419 KLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENES 478
            LLQAYNEEIKKIQLGARDHLKKIF+DHEKLKLQL+SQKKEFELRGRELEKREAQNENES
Sbjct: 301 TLLQAYNEEIKKIQLGARDHLKKIFNDHEKLKLQLDSQKKEFELRGRELEKREAQNENES 360

Query: 479 KYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQ 538
           KYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRII+LEKQLDAKQ
Sbjct: 361 KYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDAKQ 420

Query: 539 ALELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQR 598
           ALELEIERLRGTLNVMKHMEDDEDVEVLQKAES LK+LSEKEGELE LDELNQ LIVKQR
Sbjct: 421 ALELEIERLRGTLNVMKHMEDDEDVEVLQKAESILKDLSEKEGELEELDELNQTLIVKQR 480

Query: 599 MSNDELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASELC 658
            SNDELQEARKEI NAFKDLPGRS+LRVKRMGELDTKPFHEAMKKIYNEDEADERASELC
Sbjct: 481 KSNDELQEARKEIINAFKDLPGRSHLRVKRMGELDTKPFHEAMKKIYNEDEADERASELC 540

Query: 659 SLWAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAVTS 718
           SLWAEYLKDPDWHPFKVIKVEGKDTA+GKDKEIE+L+DEDEKL+GLKKDYGEEV KAV S
Sbjct: 541 SLWAEYLKDPDWHPFKVIKVEGKDTAEGKDKEIEILNDEDEKLEGLKKDYGEEVYKAVAS 600

Query: 719 ALMEINEYNPSGRYITSELWNYQEERKATLREGVRYLLDKLGRSN 764
           ALMEINEYNPSGRYI SELWNYQEERKATLREGV++LLDKL ++N
Sbjct: 601 ALMEINEYNPSGRYIISELWNYQEERKATLREGVKFLLDKLNKNN 643

BLAST of Cla97C07G131680 vs. NCBI nr
Match: XP_022956639.1 (protein INVOLVED IN DE NOVO 2-like [Cucurbita moschata] >XP_022956640.1 protein INVOLVED IN DE NOVO 2-like [Cucurbita moschata] >XP_022956641.1 protein INVOLVED IN DE NOVO 2-like [Cucurbita moschata] >KAG7031607.1 Protein INVOLVED IN DE NOVO 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1172.5 bits (3032), Expect = 0.0e+00
Identity = 598/645 (92.71%), Postives = 620/645 (96.12%), Query Frame = 0

Query: 119 MESSTDDSDVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYK 178
           M SSTDDSDVDTD+SESEL ERES+SY+ELKNG  IVKLSHETFTCPYCT+KRKRDFLYK
Sbjct: 1   MGSSTDDSDVDTDISESELEERESRSYEELKNGNHIVKLSHETFTCPYCTRKRKRDFLYK 60

Query: 179 DLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKPASNNDPVMDCNHDEK 238
           DLLQHASGVG S SNKR+ KEKANHLALLKYLEKDLADAVGPSKPASNNDPVMDCNHDEK
Sbjct: 61  DLLQHASGVGKSSSNKRNAKEKANHLALLKYLEKDLADAVGPSKPASNNDPVMDCNHDEK 120

Query: 239 FVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVE 298
           FVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRV PLWNYRGHSG AIVE
Sbjct: 121 FVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVIPLWNYRGHSGYAIVE 180

Query: 299 FNKDWPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGEHL 358
           FNKDWPGLHNAISFERAYEAD HGKKDWLA G  TEKLG+YAWVARADDYN+NNI+GEHL
Sbjct: 181 FNKDWPGLHNAISFERAYEADHHGKKDWLAKG--TEKLGLYAWVARADDYNANNIIGEHL 240

Query: 359 RKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGERE 418
           RKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHL EMEKRCSETATTLNNLMGERE
Sbjct: 241 RKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLKEMEKRCSETATTLNNLMGERE 300

Query: 419 KLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENES 478
            LLQAYNEEIKKIQLGARDHLKKIF+DHEKLKLQL+SQKKEFE RGRELEKREAQNENES
Sbjct: 301 TLLQAYNEEIKKIQLGARDHLKKIFNDHEKLKLQLDSQKKEFESRGRELEKREAQNENES 360

Query: 479 KYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQ 538
           KYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRII+LEKQLDAKQ
Sbjct: 361 KYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDAKQ 420

Query: 539 ALELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQR 598
           ALELEIERLRGTLNVMKHMEDDEDVEVLQKAES LK+LSEKEGELE LDELNQ LIVKQR
Sbjct: 421 ALELEIERLRGTLNVMKHMEDDEDVEVLQKAESILKDLSEKEGELEELDELNQTLIVKQR 480

Query: 599 MSNDELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASELC 658
            SNDELQEARKEI NAFKDLPGRS+LRVKRMGELDTKPFHEAMKKIYNE+EADERASELC
Sbjct: 481 KSNDELQEARKEIINAFKDLPGRSHLRVKRMGELDTKPFHEAMKKIYNEEEADERASELC 540

Query: 659 SLWAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAVTS 718
           SLWAEYLKDPDWHPFKVIKVEGKDTA+GKDKEIE+L+DEDEKL+GLKKDYGEEV KAV S
Sbjct: 541 SLWAEYLKDPDWHPFKVIKVEGKDTAEGKDKEIEILNDEDEKLEGLKKDYGEEVYKAVAS 600

Query: 719 ALMEINEYNPSGRYITSELWNYQEERKATLREGVRYLLDKLGRSN 764
           ALMEINEYNPSGRYI SELWNYQEERKATLREGV++LLDKL ++N
Sbjct: 601 ALMEINEYNPSGRYIISELWNYQEERKATLREGVKFLLDKLNKNN 643

BLAST of Cla97C07G131680 vs. ExPASy Swiss-Prot
Match: Q8VZ79 (Protein INVOLVED IN DE NOVO 2 OS=Arabidopsis thaliana OX=3702 GN=IDN2 PE=1 SV=1)

HSP 1 Score: 694.1 bits (1790), Expect = 1.7e-198
Identity = 360/636 (56.60%), Postives = 465/636 (73.11%), Query Frame = 0

Query: 127 DVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYKDLLQHASG 186
           D D+D+SESE+ E   K Y  LK GK  V+LS + F CPYC  K+K  F YKDLLQHASG
Sbjct: 11  DEDSDISESEMDEYGDKMYLNLKGGKLKVRLSPQAFICPYCPNKKKTSFQYKDLLQHASG 70

Query: 187 VGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKPAS----NNDPVMDCNHDEKFVWP 246
           VGNS S+KRS KEKA+HLAL+KYL++DLAD+   ++P+S    N +P+ DC+HDEK V+P
Sbjct: 71  VGNSNSDKRSAKEKASHLALVKYLQQDLADSASEAEPSSKRQKNGNPIQDCDHDEKLVYP 130

Query: 247 WRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFNKD 306
           W+GIVVNIPT +  DGR  G SGSK RDE   RGFNPTRV PLWNY GHSG AIVEFNKD
Sbjct: 131 WKGIVVNIPTTKAQDGRSAGESGSKLRDEYILRGFNPTRVRPLWNYLGHSGTAIVEFNKD 190

Query: 307 WPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGEHLRKIG 366
           W GLHN + F++AY  D HGKKDWL       KLG+Y W+ARADDYN NNI+GE+LRK G
Sbjct: 191 WNGLHNGLLFDKAYTVDGHGKKDWLKK--DGPKLGLYGWIARADDYNGNNIIGENLRKTG 250

Query: 367 DLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGEREKLLQ 426
           DLKTI+E+ +EEARKQ+ LV NL  ++E K K + E+E+ CS  +  LN LM E+EK  Q
Sbjct: 251 DLKTIAELTEEEARKQELLVQNLRQLVEEKKKDMKEIEELCSVKSEELNQLMEEKEKNQQ 310

Query: 427 AYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENESKYLA 486
            +  E+  IQ     H++KI  DHEKLK  LES++K+ E++  EL KRE  N  E   L+
Sbjct: 311 KHYRELNAIQERTMSHIQKIVDDHEKLKRLLESERKKLEIKCNELAKREVHNGTERMKLS 370

Query: 487 EEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQALEL 546
           E++E+   +NSSL+LA +EQQKADE+  KLA+DQ++QKE+LH +II+LE+Q D KQA+EL
Sbjct: 371 EDLEQNASKNSSLELAAMEQQKADEEVKKLAEDQRRQKEELHEKIIRLERQRDQKQAIEL 430

Query: 547 EIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQRMSND 606
           E+E+L+G LNVMKHM  D D EV+++ +   K+L EKE +L  LD+ NQ LI+++R +ND
Sbjct: 431 EVEQLKGQLNVMKHMASDGDAEVVKEVDIIFKDLGEKEAQLADLDKFNQTLILRERRTND 490

Query: 607 ELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASELCSLWA 666
           ELQEA KE+ N  K+    + + VKRMGEL TKPF +AM++ Y + + ++RA E+  LW 
Sbjct: 491 ELQEAHKELVNIMKE--WNTNIGVKRMGELVTKPFVDAMQQKYCQQDVEDRAVEVLQLWE 550

Query: 667 EYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAVTSALME 726
            YLKD DWHPFK +K+E       +D+E+EV+DD DEKL+ LK D G+    AVT AL+E
Sbjct: 551 HYLKDSDWHPFKRVKLE------NEDREVEVIDDRDEKLRELKADLGDGPYNAVTKALLE 610

Query: 727 INEYNPSGRYITSELWNYQEERKATLREGVRYLLDK 759
           INEYNPSGRYIT+ELWN++ ++KATL EGV  LLD+
Sbjct: 611 INEYNPSGRYITTELWNFKADKKATLEEGVTCLLDQ 636

BLAST of Cla97C07G131680 vs. ExPASy Swiss-Prot
Match: Q9LHB1 (Factor of DNA methylation 3 OS=Arabidopsis thaliana OX=3702 GN=FDM3 PE=4 SV=1)

HSP 1 Score: 583.6 bits (1503), Expect = 3.3e-165
Identity = 322/638 (50.47%), Postives = 436/638 (68.34%), Query Frame = 0

Query: 135 SELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYKDLLQHASGVGNSPSNK 194
           ++L + E   Y +LK+GK  VK+S+ TF CPYC   +K+  LY D+LQHASGVGNS S K
Sbjct: 3   NKLSDFEKNLYKKLKSGKLEVKVSYRTFLCPYCPDNKKKVGLYVDILQHASGVGNSQSKK 62

Query: 195 RSTKEKANHLALLKYLEKDLA-----------DAVGPSKPASNNDP--VMDCNHDEKFVW 254
           RS  EKA+H AL KYL KDLA            A     PA   D   + D    EK VW
Sbjct: 63  RSLTEKASHRALAKYLIKDLAHYATSTISKRLKARTSFIPAETGDAPIIYDDAQFEKLVW 122

Query: 255 PWRGIVVNIPTRRTDDGR-YVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFN 314
           PW+G++VNIPT  T+DGR   G SG K +DEL  RGFNP RV  +W+  GHSG  IVEFN
Sbjct: 123 PWKGVLVNIPTTSTEDGRSCTGESGPKLKDELIRRGFNPIRVRTVWDRFGHSGTGIVEFN 182

Query: 315 KDWPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGEHLRK 374
           +DW GL +A+ F++AYE D HGKKDWL   T +    +YAW+A ADDY   NI+GE+LRK
Sbjct: 183 RDWNGLQDALVFKKAYEGDGHGKKDWLCGATDS---SLYAWLANADDYYRANILGENLRK 242

Query: 375 IGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGEREKL 434
           +GDLK+I    +EEARK  +L+  L  ++E K   L +++ + S+ +  L     E+EK+
Sbjct: 243 MGDLKSIYRFAEEEARKDQKLLQRLNFMVENKQYRLKKLQIKYSQDSVKLKYETEEKEKI 302

Query: 435 LQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENESKY 494
           L+AY+E++   Q  + DH  +IF+DHEK K+QLESQ KE E+R  EL KREA+NE + K 
Sbjct: 303 LRAYSEDLTGRQQKSTDHFNRIFADHEKQKVQLESQIKELEIRKLELAKREAENETQRKI 362

Query: 495 LAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQAL 554
           +A+E+E+    NS +QL+ LEQQK  E   +LA D K QKE LH RI  LE+QLD KQ L
Sbjct: 363 VAKELEQNAAINSYVQLSALEQQKTREKAQRLAVDHKMQKEKLHKRIAALERQLDQKQEL 422

Query: 555 ELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQRMS 614
           ELE+++L+  L+VM+ +E D   E++ K E+ L++LSE EGEL  L++ NQ L+V++R S
Sbjct: 423 ELEVQQLKSQLSVMRLVELDSGSEIVNKVETFLRDLSETEGELAHLNQFNQDLVVQERKS 482

Query: 615 NDELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASELCSL 674
           NDELQEAR+ + +  +D+    ++ VKRMGELDTKPF +AM+  Y +++ ++ A E+  L
Sbjct: 483 NDELQEARRALISNLRDM--GLHIGVKRMGELDTKPFMKAMRIKYCQEDLEDWAVEVIQL 542

Query: 675 WAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAVTSAL 734
           W EYLKDPDWHPFK IK+E  +T       +EV+D++DEKL+ LK + G++  +AV +AL
Sbjct: 543 WEEYLKDPDWHPFKRIKLETAETI------VEVIDEDDEKLRTLKNELGDDAYQAVANAL 602

Query: 735 MEINEYNPSGRYITSELWNYQEERKATLREGVRYLLDK 759
           +EINEYNPSGRYI+SELWN++E+RKATL EGV  LL++
Sbjct: 603 LEINEYNPSGRYISSELWNFREDRKATLEEGVNSLLEQ 629

BLAST of Cla97C07G131680 vs. ExPASy Swiss-Prot
Match: Q9LMH6 (Factor of DNA methylation 4 OS=Arabidopsis thaliana OX=3702 GN=FDM4 PE=4 SV=1)

HSP 1 Score: 439.5 bits (1129), Expect = 7.8e-122
Identity = 281/740 (37.97%), Postives = 407/740 (55.00%), Query Frame = 0

Query: 133 SESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYKDLLQHASGVGNSPS 192
           S  EL + E + Y E+K+G R VK+S   F CP+C   RKRD+ + DLL+HASG+G S S
Sbjct: 3   SRRELEDLEYRYYSEMKDGTRKVKISESLFRCPFCYIDRKRDYQFDDLLRHASGIGGS-S 62

Query: 193 NKRSTKEKANHLALLKYLEKDLADAVGP-------------------------------- 252
             +  ++KA HLAL +Y+ K L     P                                
Sbjct: 63  RTKDGRDKARHLALERYMRKYLRPRERPRPSPTSDVSSLPKEEFTGKWKSTLSTTEEGEF 122

Query: 253 -----------------------------------------------SKPA--------- 312
                                                          S PA         
Sbjct: 123 ITTENSSSPHIVKAEPKFVSGDDSGRSGEERLKFSDKPDPFFSNEDKSYPAKRPCLVSGA 182

Query: 313 -SNNDPVMDC----------------------NHDEKFVWPWRGIVVNIP-TRRTDDGRY 372
              ++PV                         N D+ +V PW+GI+ N+  T      +Y
Sbjct: 183 KEGDEPVQRIGLSHGASFAPTYPQKLVSLGAGNGDQMYVHPWKGILANMKRTFNEKTRKY 242

Query: 373 VGGSGSKFRDELKERGFNPTRVTPLWNYR-GHSGCAIVEFNKDWPGLHNAISFERAYEAD 432
            G SGSK R++L ++GFNP +VTPLWN R G +G AIV+F K+W G  NA  F++ +E  
Sbjct: 243 AGESGSKIREDLIKKGFNPHKVTPLWNGRLGFTGFAIVDFGKEWEGFRNATMFDKHFEVS 302

Query: 433 RHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGEHLRKIGDLKTISEIIQEEARKQD 492
           + GK+D        +KL  Y WVA+ DDY S   +G+HLRK GDLK++S    E+ RK  
Sbjct: 303 QCGKRDHDLTRDPGDKL--YGWVAKQDDYYSRTAIGDHLRKQGDLKSVSGKEAEDQRKTF 362

Query: 493 RLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGEREKLLQAYNEEIKKIQLGARDHL 552
            LVSNL + +  K+ +L +ME    +T++ L   M E+++++  +NE++  +Q  ARD+L
Sbjct: 363 TLVSNLENTLVTKSDNLQQMESIYKQTSSVLEKRMKEKDEMINTHNEKMSIMQQTARDYL 422

Query: 553 KKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENESKYLAEEIEKYEVRNSSLQLAE 612
             I+ +HEK    LE+Q+KE+E R   L+K +A+N+ E +       K + +     +A 
Sbjct: 423 ASIYEEHEKASQHLEAQRKEYEDRENYLDKCQAKNKTERR-------KLQWQKHKNLMAT 482

Query: 613 LEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQALELEIERLRGTLNVMKHME- 672
            EQ KADED M+LA+ Q+++K++L  ++ +LE+++DA+QALELEIER+RG L VM HM+ 
Sbjct: 483 QEQNKADEDMMRLAEQQQREKDELRKQVRELEEKIDAEQALELEIERMRGDLQVMGHMQE 542

Query: 673 -DDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQRMSNDELQEARKEITNAFKD 732
            + ED ++ +  E T +EL EKE + E  + L Q L+VK   +NDELQ+ARK +  + ++
Sbjct: 543 GEGEDSKIKEMIEKTKEELKEKEEDWEYQESLYQTLVVKHGYTNDELQDARKALIRSMRE 602

Query: 733 LPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASELCSLWAEYLKDPDWHPFKVIK 758
           L  R+Y+ VKRMG LD  PF +  K+ Y   EAD++A ELCSLW E+L D  WHP KV++
Sbjct: 603 LTTRAYIGVKRMGALDETPFKKVAKEKYPAVEADKKAEELCSLWEEHLGDSAWHPIKVVE 662

BLAST of Cla97C07G131680 vs. ExPASy Swiss-Prot
Match: Q9SAI1 (Factor of DNA methylation 5 OS=Arabidopsis thaliana OX=3702 GN=FDM5 PE=2 SV=1)

HSP 1 Score: 432.2 bits (1110), Expect = 1.2e-119
Identity = 260/642 (40.50%), Postives = 401/642 (62.46%), Query Frame = 0

Query: 124 DDSDVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYKDLLQH 183
           + SD ++++SESE+     K Y++L NG   VK+  +TF CP+C  K+K+ + YK+LL H
Sbjct: 3   NSSDEESEISESEIDVYYEKPYEKLMNGDYKVKVK-DTFRCPFCAGKKKQHYKYKELLAH 62

Query: 184 ASGVGNSPSNKRSTKEKANHLALLKYLEKDL---ADAVGPSKPASNNDPVMDCNHDEKFV 243
           ASGV    S  RS K+KANH AL KY+E +L   AD   P  P+S+ +       D+ +V
Sbjct: 63  ASGVAKG-SASRSAKQKANHFALAKYMENELAGDADVPRPQIPSSSTEQ-SQAVVDDIYV 122

Query: 244 WPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFN 303
           WPW GIV+N P RRTD+   +  S    +   K   FNP  V  LW  +      I +FN
Sbjct: 123 WPWMGIVIN-PVRRTDNKNVLLDSAYWLK---KLARFNPLEVKTLWLDQESVVAVIPQFN 182

Query: 304 KDWPGLHNAISFERAYEADRHGKKDWL-ANGTTTEKLGIYAWVARADDYNSNNIVGEHLR 363
             W G  +    E+ YE    G+KDW+   G    K   Y W ARADDYNS   + E+L 
Sbjct: 183 SGWSGFKSVTELEKEYEIRGCGRKDWIDKRGDWRSK--AYGWCARADDYNSQGSIAEYLS 242

Query: 364 KIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGEREK 423
           K+G L++ S+I +EE + +  +V +L + I + N+ L +++   +E   +L  ++ E+++
Sbjct: 243 KVGKLRSFSDITKEEIQNKSIVVDDLANKIAMTNEDLNKLQYMNNEKTLSLRRVLIEKDE 302

Query: 424 LLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENESK 483
           L + Y +E KK+Q  +R+ + +IF + E+L  +LE++    ++  ++L+K++A  E E +
Sbjct: 303 LDRVYKQETKKMQELSREKINRIFREKERLTNELEAKMNNLKIWSKQLDKKQALTELERQ 362

Query: 484 YLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQA 543
            L E+ +K +V NSSLQLA LEQ+K D+  ++L D+ K++KE+  N+I+QLEK+LD+KQ 
Sbjct: 363 KLDEDKKKSDVMNSSLQLASLEQKKTDDRVLRLVDEHKRKKEETLNKILQLEKELDSKQK 422

Query: 544 LELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQRM 603
           L++EI+ L+G L VMKH ED++D  + +K +   +EL EK  EL+ L++ N AL+VK+R 
Sbjct: 423 LQMEIQELKGKLKVMKH-EDEDDEGIKKKMKKMKEELEEKCSELQDLEDTNSALMVKERK 482

Query: 604 SNDELQEARKEITNAFKDL-PGRSYLRVKRMGELDTKPFHEAMK-KIYNEDEADERASEL 663
           SNDE+ EARK +    ++L   R+ +RVKRMGEL+ KPF  A + +   E+EA  + + L
Sbjct: 483 SNDEIVEARKFLITELRELVSDRNIIRVKRMGELEEKPFMTACRQRCTVEEEAQVQYAML 542

Query: 664 CSLWAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAVT 723
           CS W E +KD  W PFK +           D++ EV+D+EDE++K L++++GEEV  AV 
Sbjct: 543 CSKWQEKVKDSAWQPFKHVGT--------GDRKKEVVDEEDEEIKKLREEWGEEVKNAVK 602

Query: 724 SALMEINEYNPSGRYITSELWNYQEERKATLREGVRYLLDKL 760
           +AL E+NE+NPSGRY   ELWN ++ RKATL+E + Y+  ++
Sbjct: 603 TALEELNEFNPSGRYSVPELWNSKQGRKATLKEVIDYITQQV 626

BLAST of Cla97C07G131680 vs. ExPASy Swiss-Prot
Match: F4JH53 (Factor of DNA methylation 2 OS=Arabidopsis thaliana OX=3702 GN=FDM2 PE=1 SV=1)

HSP 1 Score: 430.3 bits (1105), Expect = 4.7e-119
Identity = 260/636 (40.88%), Postives = 382/636 (60.06%), Query Frame = 0

Query: 124 DDSDVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYKDLLQH 183
           D SD ++++SESE+ E     Y  L++        +    CP+C  K+K+D+ YK+L  H
Sbjct: 2   DISDEESEISESEIEEYSKTPYHLLRSETYYKVKVNGRLRCPFCVGKKKQDYKYKELHAH 61

Query: 184 ASGVGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKPASNNDPVMD---CNHDEKFV 243
           A+GV    S  RS  +K+NHLAL K+LE DLA    P        P++D    N    +V
Sbjct: 62  ATGVSKG-SATRSALQKSNHLALAKFLENDLAGYAEPLPRPPVVPPLLDETEPNPHNVYV 121

Query: 244 WPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFN 303
           WPW GIVVN P + TDD   +  S    +   K   F P  V   W  +      I +F+
Sbjct: 122 WPWMGIVVN-PLKETDDKELLLDSVYWLQTLSK---FKPVEVNAFWVEQDSIVGVIAKFD 181

Query: 304 KDWPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGEHLRK 363
            DW G   A   E+ +E     KK+W      +E    Y W ARADD+ S   +GE+L K
Sbjct: 182 SDWSGFAAATELEKEFETQGSCKKEWTERSGDSESKA-YGWCARADDFQSQGPIGEYLSK 241

Query: 364 IGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGEREKL 423
            G L+T+S+I+Q   + ++ L+  L+++I++ N+ L + +   + TA +L  ++ E++ L
Sbjct: 242 EGTLRTVSDILQNNVQDRNTLLDVLSNMIDMTNEDLNKAQHSYNRTAMSLQRVLDEKKNL 301

Query: 424 LQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENESKY 483
            QA+ EE KK+Q  +  H+++I  D EKL+ +L+ + ++ E R ++LEK EA  E E + 
Sbjct: 302 HQAFAEETKKMQQMSLRHIQRILYDKEKLRNELDRKMRDLESRAKQLEKHEALTELERQK 361

Query: 484 LAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQAL 543
           L E+  K +  N SLQLA  EQ+KADE  ++L ++ ++QKED  N+I+ LEKQLD KQ L
Sbjct: 362 LDEDKRKSDAMNKSLQLASREQKKADESVLRLVEEHQRQKEDALNKILLLEKQLDTKQTL 421

Query: 544 ELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQRMS 603
           E+EI+ L+G L VMKH+ DD+D  V  K +    EL +K+ ELE L+ +N  L+ K+R S
Sbjct: 422 EMEIQELKGKLQVMKHLGDDDDEAVQTKMKEMNDELDDKKAELEDLESMNSVLMTKERQS 481

Query: 604 NDELQEARKEITNAFKDLPG-RSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASELCS 663
           NDE+Q AR+++      L G  S + VKRMGELD KPF +  K  Y+ +EA   A+ LCS
Sbjct: 482 NDEIQAARQKMIAGLTGLLGAESDIGVKRMGELDEKPFLDVCKLRYSANEARVEAATLCS 541

Query: 664 LWAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAVTSA 723
            W E LK+P W PFK      + T DG +   EV+D++DE+LK LK+++G+EV  AV +A
Sbjct: 542 TWKENLKNPSWQPFK-----REGTGDGAE---EVVDEDDEQLKKLKREWGKEVHNAVKAA 601

Query: 724 LMEINEYNPSGRYITSELWNYQEERKATLREGVRYL 756
           L+E+NEYN SGRY TSELWN++E RKATL+E + ++
Sbjct: 602 LVEMNEYNASGRYPTSELWNFKEGRKATLKEVITFI 623

BLAST of Cla97C07G131680 vs. ExPASy TrEMBL
Match: A0A1S3CF47 (protein INVOLVED IN DE NOVO 2 OS=Cucumis melo OX=3656 GN=LOC103500220 PE=4 SV=1)

HSP 1 Score: 1189.9 bits (3077), Expect = 0.0e+00
Identity = 612/647 (94.59%), Postives = 625/647 (96.60%), Query Frame = 0

Query: 119 MESSTDDSDVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYK 178
           MESSTDDSDVDTD+SESE+ ERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYK
Sbjct: 1   MESSTDDSDVDTDVSESEMDERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYK 60

Query: 179 DLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKP--ASNNDPVMDCNHD 238
           DLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLAD VGPSKP  ASN DPVMDCNHD
Sbjct: 61  DLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADTVGPSKPATASNKDPVMDCNHD 120

Query: 239 EKFVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAI 298
           EKFVWPWRGIVVNIPTRRTDDGR+VGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAI
Sbjct: 121 EKFVWPWRGIVVNIPTRRTDDGRFVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAI 180

Query: 299 VEFNKDWPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGE 358
           VEFNKDWPGLHNAISFERAYEADRHGKKDWLANG TTEKLGIYAWVARADDYN+NNIVGE
Sbjct: 181 VEFNKDWPGLHNAISFERAYEADRHGKKDWLANG-TTEKLGIYAWVARADDYNTNNIVGE 240

Query: 359 HLRKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGE 418
           HLRKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRC+ETATTLNNLMGE
Sbjct: 241 HLRKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCTETATTLNNLMGE 300

Query: 419 REKLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNEN 478
           REKLL AYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNEN
Sbjct: 301 REKLLHAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNEN 360

Query: 479 ESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDA 538
           ESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRII+LEKQLDA
Sbjct: 361 ESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDA 420

Query: 539 KQALELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVK 598
           KQALELEIERLRGTLNVMKHMED EDV   QKAES LKELSEKE +LE LD+LNQALIVK
Sbjct: 421 KQALELEIERLRGTLNVMKHMEDVEDV---QKAESILKELSEKERDLEELDDLNQALIVK 480

Query: 599 QRMSNDELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASE 658
           QR SNDELQEARKEI NAFKDLPGRS+LRVKRMGELDTKPFHEAMKKIYNEDEADERASE
Sbjct: 481 QRKSNDELQEARKEIINAFKDLPGRSHLRVKRMGELDTKPFHEAMKKIYNEDEADERASE 540

Query: 659 LCSLWAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAV 718
           LCSLWAEYLKDPDWHPF+VIKVE KD  DGK+KEIE+LDDEDEKLKGLKKDYGEEVCKAV
Sbjct: 541 LCSLWAEYLKDPDWHPFRVIKVEAKDAPDGKEKEIEILDDEDEKLKGLKKDYGEEVCKAV 600

Query: 719 TSALMEINEYNPSGRYITSELWNYQEERKATLREGVRYLLDKLGRSN 764
            SALMEINEYNPSGRYITSELWNYQE RKATLREGVR+LLDKL RSN
Sbjct: 601 ISALMEINEYNPSGRYITSELWNYQEGRKATLREGVRFLLDKLNRSN 643

BLAST of Cla97C07G131680 vs. ExPASy TrEMBL
Match: A0A5A7U517 (Protein INVOLVED IN DE NOVO 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold293G00300 PE=4 SV=1)

HSP 1 Score: 1189.9 bits (3077), Expect = 0.0e+00
Identity = 612/647 (94.59%), Postives = 625/647 (96.60%), Query Frame = 0

Query: 119 MESSTDDSDVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYK 178
           MESSTDDSDVDTD+SESE+ ERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYK
Sbjct: 1   MESSTDDSDVDTDVSESEMDERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYK 60

Query: 179 DLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKP--ASNNDPVMDCNHD 238
           DLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLAD VGPSKP  ASN DPVMDCNHD
Sbjct: 61  DLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADTVGPSKPATASNKDPVMDCNHD 120

Query: 239 EKFVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAI 298
           EKFVWPWRGIVVNIPTRRTDDGR+VGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAI
Sbjct: 121 EKFVWPWRGIVVNIPTRRTDDGRFVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAI 180

Query: 299 VEFNKDWPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGE 358
           VEFNKDWPGLHNAISFERAYEADRHGKKDWLANG TTEKLGIYAWVARADDYN+NNIVGE
Sbjct: 181 VEFNKDWPGLHNAISFERAYEADRHGKKDWLANG-TTEKLGIYAWVARADDYNTNNIVGE 240

Query: 359 HLRKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGE 418
           HLRKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRC+ETATTLNNLMGE
Sbjct: 241 HLRKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCTETATTLNNLMGE 300

Query: 419 REKLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNEN 478
           REKLL AYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNEN
Sbjct: 301 REKLLHAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNEN 360

Query: 479 ESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDA 538
           ESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRII+LEKQLDA
Sbjct: 361 ESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDA 420

Query: 539 KQALELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVK 598
           KQALELEIERLRGTLNVMKHMED EDV   QKAES LKELSEKE +LE LD+LNQALIVK
Sbjct: 421 KQALELEIERLRGTLNVMKHMEDVEDV---QKAESILKELSEKERDLEELDDLNQALIVK 480

Query: 599 QRMSNDELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASE 658
           QR SNDELQEARKEI NAFKDLPGRS+LRVKRMGELDTKPFHEAMKKIYNEDEADERASE
Sbjct: 481 QRKSNDELQEARKEIINAFKDLPGRSHLRVKRMGELDTKPFHEAMKKIYNEDEADERASE 540

Query: 659 LCSLWAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAV 718
           LCSLWAEYLKDPDWHPF+VIKVE KD  DGK+KEIE+LDDEDEKLKGLKKDYGEEVCKAV
Sbjct: 541 LCSLWAEYLKDPDWHPFRVIKVEAKDAPDGKEKEIEILDDEDEKLKGLKKDYGEEVCKAV 600

Query: 719 TSALMEINEYNPSGRYITSELWNYQEERKATLREGVRYLLDKLGRSN 764
            SALMEINEYNPSGRYITSELWNYQE RKATLREGVR+LLDKL RSN
Sbjct: 601 ISALMEINEYNPSGRYITSELWNYQEGRKATLREGVRFLLDKLNRSN 643

BLAST of Cla97C07G131680 vs. ExPASy TrEMBL
Match: A0A0A0KNW6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G182100 PE=4 SV=1)

HSP 1 Score: 1181.8 bits (3056), Expect = 0.0e+00
Identity = 624/721 (86.55%), Postives = 654/721 (90.71%), Query Frame = 0

Query: 51  FDSLSIREFNSPPFLSQLTEASTCLFFCIRR-----LWVRRLLSGVCG-CAKNSVKKFSL 110
           F SL+ R      F S L   S CL +C  R      +V  LLS     C  +S   F L
Sbjct: 71  FLSLNQRAHMRVSFSSSL---SLCLLYCGPRPRSHFAFVAFLLSDNASLCCFSSSHAFGL 130

Query: 111 PYDKYNLKVLFFKFMESSTDDSDVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTC 170
                  +VLF KFMESSTDDSDVDTD+SESE+ ERESKSYDELKNGKRIVKLSHETFTC
Sbjct: 131 ---GDCCQVLFIKFMESSTDDSDVDTDVSESEMDERESKSYDELKNGKRIVKLSHETFTC 190

Query: 171 PYCTKKRKRDFLYKDLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKP- 230
           PYCTKKRKRDFLYKDLLQHASGVG SPSNKRSTKEKANHLALLKYLEKDLADAVGPSKP 
Sbjct: 191 PYCTKKRKRDFLYKDLLQHASGVGKSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKPA 250

Query: 231 -ASNNDPVMDCNHDEKFVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRV 290
            ASNNDPVMDCNHDEKFVWPWRGIVVNIPTRRTDDGR+VGGSGSKFRDELKERGFNP+RV
Sbjct: 251 TASNNDPVMDCNHDEKFVWPWRGIVVNIPTRRTDDGRFVGGSGSKFRDELKERGFNPSRV 310

Query: 291 TPLWNYRGHSGCAIVEFNKDWPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWV 350
           TPLWNYRGHSGCAIVEFNKDWPGLHNAISFERAYEADRHGKKDWLANG TTEKLG+YAWV
Sbjct: 311 TPLWNYRGHSGCAIVEFNKDWPGLHNAISFERAYEADRHGKKDWLANG-TTEKLGVYAWV 370

Query: 351 ARADDYNSNNIVGEHLRKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKR 410
           ARADDYNSNNI+GEHLRKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHL EMEKR
Sbjct: 371 ARADDYNSNNIIGEHLRKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLTEMEKR 430

Query: 411 CSETATTLNNLMGEREKLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFEL 470
           C+ET+ T+++LM E EKLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKL+LESQKKEFEL
Sbjct: 431 CNETSATVDSLMREIEKLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLKLESQKKEFEL 490

Query: 471 RGRELEKREAQNENESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKED 530
           RGRELEKREAQNENESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKED
Sbjct: 491 RGRELEKREAQNENESKYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKED 550

Query: 531 LHNRIIQLEKQLDAKQALELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGE 590
           LH+RII+LEKQLDAKQALELEIERLRGTLNVMKHMED EDV   QKAES LK+LSEKE +
Sbjct: 551 LHDRIIRLEKQLDAKQALELEIERLRGTLNVMKHMEDAEDV---QKAESILKDLSEKERD 610

Query: 591 LEALDELNQALIVKQRMSNDELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMK 650
           LE LD+LNQALIVKQR SNDELQEARKEI NAFKDLPGRS+LR+KRMGELDTKPFHEAMK
Sbjct: 611 LEELDDLNQALIVKQRKSNDELQEARKEIINAFKDLPGRSHLRIKRMGELDTKPFHEAMK 670

Query: 651 KIYNEDEADERASELCSLWAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLK 710
           KIYNEDEADERASELCSLWAEYLKDPDWHPFKVIKVEGKD  DGK+KEIE+LDDEDEKLK
Sbjct: 671 KIYNEDEADERASELCSLWAEYLKDPDWHPFKVIKVEGKDAPDGKEKEIEILDDEDEKLK 730

Query: 711 GLKKDYGEEVCKAVTSALMEINEYNPSGRYITSELWNYQEERKATLREGVRYLLDKLGRS 764
           GLKKDYGEEVCKAV SAL+EINEYNPSGRYITSELWNYQE ++ATLREGVR+LLDKL RS
Sbjct: 731 GLKKDYGEEVCKAVISALVEINEYNPSGRYITSELWNYQEGKRATLREGVRFLLDKLNRS 781

BLAST of Cla97C07G131680 vs. ExPASy TrEMBL
Match: A0A6J1II99 (protein INVOLVED IN DE NOVO 2-like OS=Cucurbita maxima OX=3661 GN=LOC111477722 PE=4 SV=1)

HSP 1 Score: 1180.2 bits (3052), Expect = 0.0e+00
Identity = 601/645 (93.18%), Postives = 621/645 (96.28%), Query Frame = 0

Query: 119 MESSTDDSDVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYK 178
           M SSTDDSDVDTD+SESEL ERESKSY+ELKNG  IVKLSHETFTCPYCT+KRKRDFLYK
Sbjct: 1   MGSSTDDSDVDTDISESELEERESKSYEELKNGNHIVKLSHETFTCPYCTRKRKRDFLYK 60

Query: 179 DLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKPASNNDPVMDCNHDEK 238
           DLLQHASGVG S SNKR+ KEKANHLALLKYLEKDLADAVGPSKPA NNDPVMDCNHDEK
Sbjct: 61  DLLQHASGVGKSSSNKRNAKEKANHLALLKYLEKDLADAVGPSKPAGNNDPVMDCNHDEK 120

Query: 239 FVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVE 298
           FVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRV PLWNYRGHSGCAIVE
Sbjct: 121 FVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVIPLWNYRGHSGCAIVE 180

Query: 299 FNKDWPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGEHL 358
           FNKDWPGLHNAISFERAYEAD HGKKDWLA G  TEKLG+YAWVARADDYN+NNI+GEHL
Sbjct: 181 FNKDWPGLHNAISFERAYEADHHGKKDWLAKG--TEKLGLYAWVARADDYNANNIIGEHL 240

Query: 359 RKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGERE 418
           RKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHL EMEKRCSETATTLNNLMGERE
Sbjct: 241 RKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLKEMEKRCSETATTLNNLMGERE 300

Query: 419 KLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENES 478
            LLQAYNEEIKKIQLGARDHLKKIF+DHEKLKLQL+SQKKEFELRGRELEKREAQNENES
Sbjct: 301 TLLQAYNEEIKKIQLGARDHLKKIFNDHEKLKLQLDSQKKEFELRGRELEKREAQNENES 360

Query: 479 KYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQ 538
           KYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRII+LEKQLDAKQ
Sbjct: 361 KYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDAKQ 420

Query: 539 ALELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQR 598
           ALELEIERLRGTLNVMKHMEDDEDVEVLQKAES LK+LSEKEGELE LDELNQ LIVKQR
Sbjct: 421 ALELEIERLRGTLNVMKHMEDDEDVEVLQKAESILKDLSEKEGELEELDELNQTLIVKQR 480

Query: 599 MSNDELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASELC 658
            SNDELQEARKEI NAFKDLPGRS+LRVKRMGELDTKPFHEAMKKIYNEDEADERASELC
Sbjct: 481 KSNDELQEARKEIINAFKDLPGRSHLRVKRMGELDTKPFHEAMKKIYNEDEADERASELC 540

Query: 659 SLWAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAVTS 718
           SLWAEYLKDPDWHPFKVIKVEGKDTA+GKDKEIE+L+DEDEKL+GLKKDYGEEV KAV S
Sbjct: 541 SLWAEYLKDPDWHPFKVIKVEGKDTAEGKDKEIEILNDEDEKLEGLKKDYGEEVYKAVAS 600

Query: 719 ALMEINEYNPSGRYITSELWNYQEERKATLREGVRYLLDKLGRSN 764
           ALMEINEYNPSGRYI SELWNYQEERKATLREGV++LLDKL ++N
Sbjct: 601 ALMEINEYNPSGRYIISELWNYQEERKATLREGVKFLLDKLNKNN 643

BLAST of Cla97C07G131680 vs. ExPASy TrEMBL
Match: A0A6J1GZM5 (protein INVOLVED IN DE NOVO 2-like OS=Cucurbita moschata OX=3662 GN=LOC111458309 PE=4 SV=1)

HSP 1 Score: 1172.5 bits (3032), Expect = 0.0e+00
Identity = 598/645 (92.71%), Postives = 620/645 (96.12%), Query Frame = 0

Query: 119 MESSTDDSDVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYK 178
           M SSTDDSDVDTD+SESEL ERES+SY+ELKNG  IVKLSHETFTCPYCT+KRKRDFLYK
Sbjct: 1   MGSSTDDSDVDTDISESELEERESRSYEELKNGNHIVKLSHETFTCPYCTRKRKRDFLYK 60

Query: 179 DLLQHASGVGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKPASNNDPVMDCNHDEK 238
           DLLQHASGVG S SNKR+ KEKANHLALLKYLEKDLADAVGPSKPASNNDPVMDCNHDEK
Sbjct: 61  DLLQHASGVGKSSSNKRNAKEKANHLALLKYLEKDLADAVGPSKPASNNDPVMDCNHDEK 120

Query: 239 FVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVE 298
           FVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRV PLWNYRGHSG AIVE
Sbjct: 121 FVWPWRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVIPLWNYRGHSGYAIVE 180

Query: 299 FNKDWPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGEHL 358
           FNKDWPGLHNAISFERAYEAD HGKKDWLA G  TEKLG+YAWVARADDYN+NNI+GEHL
Sbjct: 181 FNKDWPGLHNAISFERAYEADHHGKKDWLAKG--TEKLGLYAWVARADDYNANNIIGEHL 240

Query: 359 RKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGERE 418
           RKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHL EMEKRCSETATTLNNLMGERE
Sbjct: 241 RKIGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLKEMEKRCSETATTLNNLMGERE 300

Query: 419 KLLQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENES 478
            LLQAYNEEIKKIQLGARDHLKKIF+DHEKLKLQL+SQKKEFE RGRELEKREAQNENES
Sbjct: 301 TLLQAYNEEIKKIQLGARDHLKKIFNDHEKLKLQLDSQKKEFESRGRELEKREAQNENES 360

Query: 479 KYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQ 538
           KYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRII+LEKQLDAKQ
Sbjct: 361 KYLAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIRLEKQLDAKQ 420

Query: 539 ALELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQR 598
           ALELEIERLRGTLNVMKHMEDDEDVEVLQKAES LK+LSEKEGELE LDELNQ LIVKQR
Sbjct: 421 ALELEIERLRGTLNVMKHMEDDEDVEVLQKAESILKDLSEKEGELEELDELNQTLIVKQR 480

Query: 599 MSNDELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASELC 658
            SNDELQEARKEI NAFKDLPGRS+LRVKRMGELDTKPFHEAMKKIYNE+EADERASELC
Sbjct: 481 KSNDELQEARKEIINAFKDLPGRSHLRVKRMGELDTKPFHEAMKKIYNEEEADERASELC 540

Query: 659 SLWAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAVTS 718
           SLWAEYLKDPDWHPFKVIKVEGKDTA+GKDKEIE+L+DEDEKL+GLKKDYGEEV KAV S
Sbjct: 541 SLWAEYLKDPDWHPFKVIKVEGKDTAEGKDKEIEILNDEDEKLEGLKKDYGEEVYKAVAS 600

Query: 719 ALMEINEYNPSGRYITSELWNYQEERKATLREGVRYLLDKLGRSN 764
           ALMEINEYNPSGRYI SELWNYQEERKATLREGV++LLDKL ++N
Sbjct: 601 ALMEINEYNPSGRYIISELWNYQEERKATLREGVKFLLDKLNKNN 643

BLAST of Cla97C07G131680 vs. TAIR 10
Match: AT3G48670.1 (XH/XS domain-containing protein )

HSP 1 Score: 694.1 bits (1790), Expect = 1.2e-199
Identity = 360/636 (56.60%), Postives = 465/636 (73.11%), Query Frame = 0

Query: 127 DVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYKDLLQHASG 186
           D D+D+SESE+ E   K Y  LK GK  V+LS + F CPYC  K+K  F YKDLLQHASG
Sbjct: 11  DEDSDISESEMDEYGDKMYLNLKGGKLKVRLSPQAFICPYCPNKKKTSFQYKDLLQHASG 70

Query: 187 VGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKPAS----NNDPVMDCNHDEKFVWP 246
           VGNS S+KRS KEKA+HLAL+KYL++DLAD+   ++P+S    N +P+ DC+HDEK V+P
Sbjct: 71  VGNSNSDKRSAKEKASHLALVKYLQQDLADSASEAEPSSKRQKNGNPIQDCDHDEKLVYP 130

Query: 247 WRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFNKD 306
           W+GIVVNIPT +  DGR  G SGSK RDE   RGFNPTRV PLWNY GHSG AIVEFNKD
Sbjct: 131 WKGIVVNIPTTKAQDGRSAGESGSKLRDEYILRGFNPTRVRPLWNYLGHSGTAIVEFNKD 190

Query: 307 WPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGEHLRKIG 366
           W GLHN + F++AY  D HGKKDWL       KLG+Y W+ARADDYN NNI+GE+LRK G
Sbjct: 191 WNGLHNGLLFDKAYTVDGHGKKDWLKK--DGPKLGLYGWIARADDYNGNNIIGENLRKTG 250

Query: 367 DLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGEREKLLQ 426
           DLKTI+E+ +EEARKQ+ LV NL  ++E K K + E+E+ CS  +  LN LM E+EK  Q
Sbjct: 251 DLKTIAELTEEEARKQELLVQNLRQLVEEKKKDMKEIEELCSVKSEELNQLMEEKEKNQQ 310

Query: 427 AYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENESKYLA 486
            +  E+  IQ     H++KI  DHEKLK  LES++K+ E++  EL KRE  N  E   L+
Sbjct: 311 KHYRELNAIQERTMSHIQKIVDDHEKLKRLLESERKKLEIKCNELAKREVHNGTERMKLS 370

Query: 487 EEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQALEL 546
           E++E+   +NSSL+LA +EQQKADE+  KLA+DQ++QKE+LH +II+LE+Q D KQA+EL
Sbjct: 371 EDLEQNASKNSSLELAAMEQQKADEEVKKLAEDQRRQKEELHEKIIRLERQRDQKQAIEL 430

Query: 547 EIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQRMSND 606
           E+E+L+G LNVMKHM  D D EV+++ +   K+L EKE +L  LD+ NQ LI+++R +ND
Sbjct: 431 EVEQLKGQLNVMKHMASDGDAEVVKEVDIIFKDLGEKEAQLADLDKFNQTLILRERRTND 490

Query: 607 ELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASELCSLWA 666
           ELQEA KE+ N  K+    + + VKRMGEL TKPF +AM++ Y + + ++RA E+  LW 
Sbjct: 491 ELQEAHKELVNIMKE--WNTNIGVKRMGELVTKPFVDAMQQKYCQQDVEDRAVEVLQLWE 550

Query: 667 EYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAVTSALME 726
            YLKD DWHPFK +K+E       +D+E+EV+DD DEKL+ LK D G+    AVT AL+E
Sbjct: 551 HYLKDSDWHPFKRVKLE------NEDREVEVIDDRDEKLRELKADLGDGPYNAVTKALLE 610

Query: 727 INEYNPSGRYITSELWNYQEERKATLREGVRYLLDK 759
           INEYNPSGRYIT+ELWN++ ++KATL EGV  LLD+
Sbjct: 611 INEYNPSGRYITTELWNFKADKKATLEEGVTCLLDQ 636

BLAST of Cla97C07G131680 vs. TAIR 10
Match: AT3G48670.2 (XH/XS domain-containing protein )

HSP 1 Score: 694.1 bits (1790), Expect = 1.2e-199
Identity = 360/636 (56.60%), Postives = 465/636 (73.11%), Query Frame = 0

Query: 127 DVDTDMSESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYKDLLQHASG 186
           D D+D+SESE+ E   K Y  LK GK  V+LS + F CPYC  K+K  F YKDLLQHASG
Sbjct: 11  DEDSDISESEMDEYGDKMYLNLKGGKLKVRLSPQAFICPYCPNKKKTSFQYKDLLQHASG 70

Query: 187 VGNSPSNKRSTKEKANHLALLKYLEKDLADAVGPSKPAS----NNDPVMDCNHDEKFVWP 246
           VGNS S+KRS KEKA+HLAL+KYL++DLAD+   ++P+S    N +P+ DC+HDEK V+P
Sbjct: 71  VGNSNSDKRSAKEKASHLALVKYLQQDLADSASEAEPSSKRQKNGNPIQDCDHDEKLVYP 130

Query: 247 WRGIVVNIPTRRTDDGRYVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFNKD 306
           W+GIVVNIPT +  DGR  G SGSK RDE   RGFNPTRV PLWNY GHSG AIVEFNKD
Sbjct: 131 WKGIVVNIPTTKAQDGRSAGESGSKLRDEYILRGFNPTRVRPLWNYLGHSGTAIVEFNKD 190

Query: 307 WPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGEHLRKIG 366
           W GLHN + F++AY  D HGKKDWL       KLG+Y W+ARADDYN NNI+GE+LRK G
Sbjct: 191 WNGLHNGLLFDKAYTVDGHGKKDWLKK--DGPKLGLYGWIARADDYNGNNIIGENLRKTG 250

Query: 367 DLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGEREKLLQ 426
           DLKTI+E+ +EEARKQ+ LV NL  ++E K K + E+E+ CS  +  LN LM E+EK  Q
Sbjct: 251 DLKTIAELTEEEARKQELLVQNLRQLVEEKKKDMKEIEELCSVKSEELNQLMEEKEKNQQ 310

Query: 427 AYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENESKYLA 486
            +  E+  IQ     H++KI  DHEKLK  LES++K+ E++  EL KRE  N  E   L+
Sbjct: 311 KHYRELNAIQERTMSHIQKIVDDHEKLKRLLESERKKLEIKCNELAKREVHNGTERMKLS 370

Query: 487 EEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQALEL 546
           E++E+   +NSSL+LA +EQQKADE+  KLA+DQ++QKE+LH +II+LE+Q D KQA+EL
Sbjct: 371 EDLEQNASKNSSLELAAMEQQKADEEVKKLAEDQRRQKEELHEKIIRLERQRDQKQAIEL 430

Query: 547 EIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQRMSND 606
           E+E+L+G LNVMKHM  D D EV+++ +   K+L EKE +L  LD+ NQ LI+++R +ND
Sbjct: 431 EVEQLKGQLNVMKHMASDGDAEVVKEVDIIFKDLGEKEAQLADLDKFNQTLILRERRTND 490

Query: 607 ELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASELCSLWA 666
           ELQEA KE+ N  K+    + + VKRMGEL TKPF +AM++ Y + + ++RA E+  LW 
Sbjct: 491 ELQEAHKELVNIMKE--WNTNIGVKRMGELVTKPFVDAMQQKYCQQDVEDRAVEVLQLWE 550

Query: 667 EYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAVTSALME 726
            YLKD DWHPFK +K+E       +D+E+EV+DD DEKL+ LK D G+    AVT AL+E
Sbjct: 551 HYLKDSDWHPFKRVKLE------NEDREVEVIDDRDEKLRELKADLGDGPYNAVTKALLE 610

Query: 727 INEYNPSGRYITSELWNYQEERKATLREGVRYLLDK 759
           INEYNPSGRYIT+ELWN++ ++KATL EGV  LLD+
Sbjct: 611 INEYNPSGRYITTELWNFKADKKATLEEGVTCLLDQ 636

BLAST of Cla97C07G131680 vs. TAIR 10
Match: AT3G12550.1 (XH/XS domain-containing protein )

HSP 1 Score: 583.6 bits (1503), Expect = 2.4e-166
Identity = 322/638 (50.47%), Postives = 436/638 (68.34%), Query Frame = 0

Query: 135 SELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYKDLLQHASGVGNSPSNK 194
           ++L + E   Y +LK+GK  VK+S+ TF CPYC   +K+  LY D+LQHASGVGNS S K
Sbjct: 3   NKLSDFEKNLYKKLKSGKLEVKVSYRTFLCPYCPDNKKKVGLYVDILQHASGVGNSQSKK 62

Query: 195 RSTKEKANHLALLKYLEKDLA-----------DAVGPSKPASNNDP--VMDCNHDEKFVW 254
           RS  EKA+H AL KYL KDLA            A     PA   D   + D    EK VW
Sbjct: 63  RSLTEKASHRALAKYLIKDLAHYATSTISKRLKARTSFIPAETGDAPIIYDDAQFEKLVW 122

Query: 255 PWRGIVVNIPTRRTDDGR-YVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFN 314
           PW+G++VNIPT  T+DGR   G SG K +DEL  RGFNP RV  +W+  GHSG  IVEFN
Sbjct: 123 PWKGVLVNIPTTSTEDGRSCTGESGPKLKDELIRRGFNPIRVRTVWDRFGHSGTGIVEFN 182

Query: 315 KDWPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGEHLRK 374
           +DW GL +A+ F++AYE D HGKKDWL   T +    +YAW+A ADDY   NI+GE+LRK
Sbjct: 183 RDWNGLQDALVFKKAYEGDGHGKKDWLCGATDS---SLYAWLANADDYYRANILGENLRK 242

Query: 375 IGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGEREKL 434
           +GDLK+I    +EEARK  +L+  L  ++E K   L +++ + S+ +  L     E+EK+
Sbjct: 243 MGDLKSIYRFAEEEARKDQKLLQRLNFMVENKQYRLKKLQIKYSQDSVKLKYETEEKEKI 302

Query: 435 LQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENESKY 494
           L+AY+E++   Q  + DH  +IF+DHEK K+QLESQ KE E+R  EL KREA+NE + K 
Sbjct: 303 LRAYSEDLTGRQQKSTDHFNRIFADHEKQKVQLESQIKELEIRKLELAKREAENETQRKI 362

Query: 495 LAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQAL 554
           +A+E+E+    NS +QL+ LEQQK  E   +LA D K QKE LH RI  LE+QLD KQ L
Sbjct: 363 VAKELEQNAAINSYVQLSALEQQKTREKAQRLAVDHKMQKEKLHKRIAALERQLDQKQEL 422

Query: 555 ELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQRMS 614
           ELE+++L+  L+VM+ +E D   E++ K E+ L++LSE EGEL  L++ NQ L+V++R S
Sbjct: 423 ELEVQQLKSQLSVMRLVELDSGSEIVNKVETFLRDLSETEGELAHLNQFNQDLVVQERKS 482

Query: 615 NDELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASELCSL 674
           NDELQEAR+ + +  +D+    ++ VKRMGELDTKPF +AM+  Y +++ ++ A E+  L
Sbjct: 483 NDELQEARRALISNLRDM--GLHIGVKRMGELDTKPFMKAMRIKYCQEDLEDWAVEVIQL 542

Query: 675 WAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAVTSAL 734
           W EYLKDPDWHPFK IK+E  +T       +EV+D++DEKL+ LK + G++  +AV +AL
Sbjct: 543 WEEYLKDPDWHPFKRIKLETAETI------VEVIDEDDEKLRTLKNELGDDAYQAVANAL 602

Query: 735 MEINEYNPSGRYITSELWNYQEERKATLREGVRYLLDK 759
           +EINEYNPSGRYI+SELWN++E+RKATL EGV  LL++
Sbjct: 603 LEINEYNPSGRYISSELWNFREDRKATLEEGVNSLLEQ 629

BLAST of Cla97C07G131680 vs. TAIR 10
Match: AT3G12550.2 (XH/XS domain-containing protein )

HSP 1 Score: 583.6 bits (1503), Expect = 2.4e-166
Identity = 322/638 (50.47%), Postives = 436/638 (68.34%), Query Frame = 0

Query: 135 SELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYKDLLQHASGVGNSPSNK 194
           ++L + E   Y +LK+GK  VK+S+ TF CPYC   +K+  LY D+LQHASGVGNS S K
Sbjct: 3   NKLSDFEKNLYKKLKSGKLEVKVSYRTFLCPYCPDNKKKVGLYVDILQHASGVGNSQSKK 62

Query: 195 RSTKEKANHLALLKYLEKDLA-----------DAVGPSKPASNNDP--VMDCNHDEKFVW 254
           RS  EKA+H AL KYL KDLA            A     PA   D   + D    EK VW
Sbjct: 63  RSLTEKASHRALAKYLIKDLAHYATSTISKRLKARTSFIPAETGDAPIIYDDAQFEKLVW 122

Query: 255 PWRGIVVNIPTRRTDDGR-YVGGSGSKFRDELKERGFNPTRVTPLWNYRGHSGCAIVEFN 314
           PW+G++VNIPT  T+DGR   G SG K +DEL  RGFNP RV  +W+  GHSG  IVEFN
Sbjct: 123 PWKGVLVNIPTTSTEDGRSCTGESGPKLKDELIRRGFNPIRVRTVWDRFGHSGTGIVEFN 182

Query: 315 KDWPGLHNAISFERAYEADRHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGEHLRK 374
           +DW GL +A+ F++AYE D HGKKDWL   T +    +YAW+A ADDY   NI+GE+LRK
Sbjct: 183 RDWNGLQDALVFKKAYEGDGHGKKDWLCGATDS---SLYAWLANADDYYRANILGENLRK 242

Query: 375 IGDLKTISEIIQEEARKQDRLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGEREKL 434
           +GDLK+I    +EEARK  +L+  L  ++E K   L +++ + S+ +  L     E+EK+
Sbjct: 243 MGDLKSIYRFAEEEARKDQKLLQRLNFMVENKQYRLKKLQIKYSQDSVKLKYETEEKEKI 302

Query: 435 LQAYNEEIKKIQLGARDHLKKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENESKY 494
           L+AY+E++   Q  + DH  +IF+DHEK K+QLESQ KE E+R  EL KREA+NE + K 
Sbjct: 303 LRAYSEDLTGRQQKSTDHFNRIFADHEKQKVQLESQIKELEIRKLELAKREAENETQRKI 362

Query: 495 LAEEIEKYEVRNSSLQLAELEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQAL 554
           +A+E+E+    NS +QL+ LEQQK  E   +LA D K QKE LH RI  LE+QLD KQ L
Sbjct: 363 VAKELEQNAAINSYVQLSALEQQKTREKAQRLAVDHKMQKEKLHKRIAALERQLDQKQEL 422

Query: 555 ELEIERLRGTLNVMKHMEDDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQRMS 614
           ELE+++L+  L+VM+ +E D   E++ K E+ L++LSE EGEL  L++ NQ L+V++R S
Sbjct: 423 ELEVQQLKSQLSVMRLVELDSGSEIVNKVETFLRDLSETEGELAHLNQFNQDLVVQERKS 482

Query: 615 NDELQEARKEITNAFKDLPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASELCSL 674
           NDELQEAR+ + +  +D+    ++ VKRMGELDTKPF +AM+  Y +++ ++ A E+  L
Sbjct: 483 NDELQEARRALISNLRDM--GLHIGVKRMGELDTKPFMKAMRIKYCQEDLEDWAVEVIQL 542

Query: 675 WAEYLKDPDWHPFKVIKVEGKDTADGKDKEIEVLDDEDEKLKGLKKDYGEEVCKAVTSAL 734
           W EYLKDPDWHPFK IK+E  +T       +EV+D++DEKL+ LK + G++  +AV +AL
Sbjct: 543 WEEYLKDPDWHPFKRIKLETAETI------VEVIDEDDEKLRTLKNELGDDAYQAVANAL 602

Query: 735 MEINEYNPSGRYITSELWNYQEERKATLREGVRYLLDK 759
           +EINEYNPSGRYI+SELWN++E+RKATL EGV  LL++
Sbjct: 603 LEINEYNPSGRYISSELWNFREDRKATLEEGVNSLLEQ 629

BLAST of Cla97C07G131680 vs. TAIR 10
Match: AT1G13790.1 (XH/XS domain-containing protein )

HSP 1 Score: 439.5 bits (1129), Expect = 5.5e-123
Identity = 281/740 (37.97%), Postives = 407/740 (55.00%), Query Frame = 0

Query: 133 SESELGERESKSYDELKNGKRIVKLSHETFTCPYCTKKRKRDFLYKDLLQHASGVGNSPS 192
           S  EL + E + Y E+K+G R VK+S   F CP+C   RKRD+ + DLL+HASG+G S S
Sbjct: 3   SRRELEDLEYRYYSEMKDGTRKVKISESLFRCPFCYIDRKRDYQFDDLLRHASGIGGS-S 62

Query: 193 NKRSTKEKANHLALLKYLEKDLADAVGP-------------------------------- 252
             +  ++KA HLAL +Y+ K L     P                                
Sbjct: 63  RTKDGRDKARHLALERYMRKYLRPRERPRPSPTSDVSSLPKEEFTGKWKSTLSTTEEGEF 122

Query: 253 -----------------------------------------------SKPA--------- 312
                                                          S PA         
Sbjct: 123 ITTENSSSPHIVKAEPKFVSGDDSGRSGEERLKFSDKPDPFFSNEDKSYPAKRPCLVSGA 182

Query: 313 -SNNDPVMDC----------------------NHDEKFVWPWRGIVVNIP-TRRTDDGRY 372
              ++PV                         N D+ +V PW+GI+ N+  T      +Y
Sbjct: 183 KEGDEPVQRIGLSHGASFAPTYPQKLVSLGAGNGDQMYVHPWKGILANMKRTFNEKTRKY 242

Query: 373 VGGSGSKFRDELKERGFNPTRVTPLWNYR-GHSGCAIVEFNKDWPGLHNAISFERAYEAD 432
            G SGSK R++L ++GFNP +VTPLWN R G +G AIV+F K+W G  NA  F++ +E  
Sbjct: 243 AGESGSKIREDLIKKGFNPHKVTPLWNGRLGFTGFAIVDFGKEWEGFRNATMFDKHFEVS 302

Query: 433 RHGKKDWLANGTTTEKLGIYAWVARADDYNSNNIVGEHLRKIGDLKTISEIIQEEARKQD 492
           + GK+D        +KL  Y WVA+ DDY S   +G+HLRK GDLK++S    E+ RK  
Sbjct: 303 QCGKRDHDLTRDPGDKL--YGWVAKQDDYYSRTAIGDHLRKQGDLKSVSGKEAEDQRKTF 362

Query: 493 RLVSNLTSIIELKNKHLIEMEKRCSETATTLNNLMGEREKLLQAYNEEIKKIQLGARDHL 552
            LVSNL + +  K+ +L +ME    +T++ L   M E+++++  +NE++  +Q  ARD+L
Sbjct: 363 TLVSNLENTLVTKSDNLQQMESIYKQTSSVLEKRMKEKDEMINTHNEKMSIMQQTARDYL 422

Query: 553 KKIFSDHEKLKLQLESQKKEFELRGRELEKREAQNENESKYLAEEIEKYEVRNSSLQLAE 612
             I+ +HEK    LE+Q+KE+E R   L+K +A+N+ E +       K + +     +A 
Sbjct: 423 ASIYEEHEKASQHLEAQRKEYEDRENYLDKCQAKNKTERR-------KLQWQKHKNLMAT 482

Query: 613 LEQQKADEDFMKLADDQKKQKEDLHNRIIQLEKQLDAKQALELEIERLRGTLNVMKHME- 672
            EQ KADED M+LA+ Q+++K++L  ++ +LE+++DA+QALELEIER+RG L VM HM+ 
Sbjct: 483 QEQNKADEDMMRLAEQQQREKDELRKQVRELEEKIDAEQALELEIERMRGDLQVMGHMQE 542

Query: 673 -DDEDVEVLQKAESTLKELSEKEGELEALDELNQALIVKQRMSNDELQEARKEITNAFKD 732
            + ED ++ +  E T +EL EKE + E  + L Q L+VK   +NDELQ+ARK +  + ++
Sbjct: 543 GEGEDSKIKEMIEKTKEELKEKEEDWEYQESLYQTLVVKHGYTNDELQDARKALIRSMRE 602

Query: 733 LPGRSYLRVKRMGELDTKPFHEAMKKIYNEDEADERASELCSLWAEYLKDPDWHPFKVIK 758
           L  R+Y+ VKRMG LD  PF +  K+ Y   EAD++A ELCSLW E+L D  WHP KV++
Sbjct: 603 LTTRAYIGVKRMGALDETPFKKVAKEKYPAVEADKKAEELCSLWEEHLGDSAWHPIKVVE 662

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890085.10.0e+0096.32protein INVOLVED IN DE NOVO 2-like [Benincasa hispida] >XP_038890086.1 protein I... [more]
XP_008461675.10.0e+0094.59PREDICTED: protein INVOLVED IN DE NOVO 2 [Cucumis melo] >XP_008461676.1 PREDICTE... [more]
XP_023536648.10.0e+0093.18protein INVOLVED IN DE NOVO 2-like [Cucurbita pepo subsp. pepo] >XP_023536649.1 ... [more]
XP_022977373.10.0e+0093.18protein INVOLVED IN DE NOVO 2-like [Cucurbita maxima] >XP_022977376.1 protein IN... [more]
XP_022956639.10.0e+0092.71protein INVOLVED IN DE NOVO 2-like [Cucurbita moschata] >XP_022956640.1 protein ... [more]
Match NameE-valueIdentityDescription
Q8VZ791.7e-19856.60Protein INVOLVED IN DE NOVO 2 OS=Arabidopsis thaliana OX=3702 GN=IDN2 PE=1 SV=1[more]
Q9LHB13.3e-16550.47Factor of DNA methylation 3 OS=Arabidopsis thaliana OX=3702 GN=FDM3 PE=4 SV=1[more]
Q9LMH67.8e-12237.97Factor of DNA methylation 4 OS=Arabidopsis thaliana OX=3702 GN=FDM4 PE=4 SV=1[more]
Q9SAI11.2e-11940.50Factor of DNA methylation 5 OS=Arabidopsis thaliana OX=3702 GN=FDM5 PE=2 SV=1[more]
F4JH534.7e-11940.88Factor of DNA methylation 2 OS=Arabidopsis thaliana OX=3702 GN=FDM2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3CF470.0e+0094.59protein INVOLVED IN DE NOVO 2 OS=Cucumis melo OX=3656 GN=LOC103500220 PE=4 SV=1[more]
A0A5A7U5170.0e+0094.59Protein INVOLVED IN DE NOVO 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sc... [more]
A0A0A0KNW60.0e+0086.55Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G182100 PE=4 SV=1[more]
A0A6J1II990.0e+0093.18protein INVOLVED IN DE NOVO 2-like OS=Cucurbita maxima OX=3661 GN=LOC111477722 P... [more]
A0A6J1GZM50.0e+0092.71protein INVOLVED IN DE NOVO 2-like OS=Cucurbita moschata OX=3662 GN=LOC111458309... [more]
Match NameE-valueIdentityDescription
AT3G48670.11.2e-19956.60XH/XS domain-containing protein [more]
AT3G48670.21.2e-19956.60XH/XS domain-containing protein [more]
AT3G12550.12.4e-16650.47XH/XS domain-containing protein [more]
AT3G12550.22.4e-16650.47XH/XS domain-containing protein [more]
AT1G13790.15.5e-12337.97XH/XS domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 410..430
NoneNo IPR availableCOILSCoilCoilcoord: 439..477
NoneNo IPR availableCOILSCoilCoilcoord: 513..557
NoneNo IPR availableCOILSCoilCoilcoord: 566..596
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 123..142
NoneNo IPR availablePANTHERPTHR21596:SF65PROTEIN INVOLVED IN DE NOVO 2-RELATEDcoord: 126..758
NoneNo IPR availablePANTHERPTHR21596RIBONUCLEASE P SUBUNIT P38coord: 126..758
IPR005379Uncharacterised domain XHPFAMPF03469XHcoord: 627..759
e-value: 2.6E-52
score: 176.2
IPR005381Zinc finger-XS domainPFAMPF03470zf-XScoord: 164..206
e-value: 5.0E-19
score: 68.3
IPR005380XS domainPFAMPF03468XScoord: 236..350
e-value: 6.9E-39
score: 132.6
IPR005380XS domainCDDcd12266RRM_like_XScoord: 239..348
e-value: 2.23483E-43
score: 150.192
IPR038588XS domain superfamilyGENE3D3.30.70.2890XS domaincoord: 231..404
e-value: 6.9E-62
score: 210.2

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C07G131680.2Cla97C07G131680.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0031047 gene silencing by RNA