ClCG09G020050 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG09G020050
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionzf-C3H1 domain-containing protein
LocationCG_Chr09: 37096873 .. 37109435 (+)
RNA-Seq ExpressionClCG09G020050
SyntenyClCG09G020050
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCCGCTCTCACCTTCGTCCTCAAACTTCCTTTCCCTTGGACCGCTTCTTCAATCTCCCTTTCACTGAGCGGGGGGATGGAGAAGAAGAACTCCGAGGAGCTCACCGTCAAAGCCATGGCATCCAATTCAAAACCTAGCAAAAGCAAAGCCTCCGAAAGCAGAGAAGAAGGAGAAGTATCTTCATCCGACAACGATACACAAACACACGACGTACTTTCTTCTTCCTACTATCTTCTCATTCTGTTTCTAATTTCTTTTTGTTTTTCTCAATCATCTGCATCTCATGCCCTTCGTTTTCGTCCTGTCCGATGCTGAGAATGATTGTATATCCGCTACCATTTTCTTGCTGCCTCTGTTTTCCTTGGAAATGAAGGCTTCCTAGAGCTTCATACACTATTCTACTAATTCGAGTTCTAGGTCTGTGGTTATCTCTTAATTCGTAGTTGTATATGTCATCGAGGTGAGAACAGAGAGAAACTGCTATACTTTGAAGCTCAAACGGTTTATCTCTACGCGGTAGGGTTTAAATGTTTTAGTTGTGACTTGTGAACACTCGTTCTCTCCTGATTTAAACTGCATCTGTTTTTAAGTATGATTATGTTAAGAACACTTTGGATTTGTGCATTGTTTTCATTCTTTATAGTCGGATTAACATGTTTTGTGGTATCCAAAATGGATTTTCTGATTCGCCCCGATTCAATTTGATCTTTTCTAGTCGCAAAATGATTTTCATATTTCTATTTTGTGCGGCTTATGAGGAATTTTTACTTCTCAGGTACATCCTGTTTGTTCTACAGTGCCTGCCTCGGTCACATCACCTATCTCATCCATTCTTCCTCCTAAGAATAAATGTAACGAAGGGATCCAGGCTGGTAAGTATTGTCTCGCTTATAGAAGACGTTGGTGTTACACCACCCTTTCACTTTGAAATCTGCAACAGTATCTAAAATTGCTTTTAAGACTATTTTTATTAAGAAGCTTTTGGTATACCTGCGATCACGTTGAATGTAGTGTTTAGGCATATTTTTAAACCAAGGATATTGGACATCTGGTGCATTTTTTTCTGTGGCAAGTTACATGGACTTAGGTAATATGGATTTTAGTGGGCTTGGTTTTTTGTATGCCCTTCTATTCTTTCCTTGTTCTTTCTCTAAGCAGTTGTTTATTAAAAAAAAGAAAGTTACATGGATTTAGGTTTAACCTTGCAGGAATACTCTGCAGCTTTACTAAAGTTGAAGTGTGTACACATCACTTTTCCCTCAGGATTGAAAAGATGTGGATTCTAGCTGTGGTTCTTTCAAACTTCCATGCTTTATGTTTATTTACTGCATTTAAGGTTTCAATGCTTGAATATTTCCTGTTACTCTGGACTTTTGACGGGGTTCTCAAATTGTTGGTCTTATAAATGAATCTTGCGGTGCACTAAACTTTTTGGTGGTCTTGTATTAACAGTATTTTGTGATGACCAATATCTGCCACTAATACAGTAGGCTGTTTCTGGAAATAAATTTTCAGCTTCTGCTGATGTTTGCACAAGAACATCCATACAAACTACTTCTCAGAAGACGTGTGATACTGCTCAAGTTGTGAATAAAGCTAGTACTCCTTGGGGTGCTTCCAGGGAGGCCAATTCGAATCTTGTGATTAGCTTCTCAGATGACAGTGGCAGTGAATTGGAGGAATGCAGTAAAGTCAGAACTTCAAAATCTCATAGTGATGCTGTCAGGCACTTTAAACCTCCAACTTCAACACTTGATAGATCAAACAGGTTACGGAGTATGACAAGGAACAAAGTAGTGGCAAACAAACTGTCTTTGAGTCAGCCATTTATCCCTTCAATGACCAAGAATCACAGAGCATATTCCACGGGTGCTGGGCCCTCATTGGCTGAACAAGGATCTAAAATCAGAGCTTTCAGTGGAAACCTACAAAGCCAAGGGCGTGGGAATGATCAAGGAATGAACTTAAACACTAGTAAGCTGCAGGACTTGCGGGAGCAGATAGCCATTTGCGAAAGCAAACTGAAGCTCAAGTCTGCCCAACAGAACAAGGAGAGCATTTCAATTACAAACCAGGATTATATTGTCACAAATTCAAAATCTGATTTGGGTAGGAAAGGGAATGCTACTATCTCTCAAGTTTCTTCACTGGGGCCTAAGGAACCAGATGCAAAGCGCCTGAAAACCAGTGGGTCTTATTCTACCAAGCTGAGTTTGAGTGGGCAACAACATCTTCGTGCAACGTATGCTGGAAAATCTGTTTTTCGGCCACAGGAGCCTGGAGAAGAGACACAGAACATTAAGGTCACTTACAACCAGAAAGGGAATTCTTTGAGTAGAGAAGAGTCCAGTGTGTTGAAGCAGAGTAAGGAAGATATCAAACATGTGGCTGCTTCACCTTCACCTGGAATTGACCTTGGCAAAGTACAGGATGGTGAATAATTTGTTCATACTGAATAGATAATTAGATCTTTCAGTTTAGCACAATCTTACTGTTTTATGAGGTGTCTGCCTAACTTATTACGTAAAATGCTGTGCAGACACCGACATTGTTGCTAATGGAAATCAATCAGACTGGATCAGTAAGCAGGTGGATCCTCACCCTCTTGTTGTCTTAGATCAAGCTACGAGTCTTCCAAATGTCACATCCAATGTCCAAACTCAGTTCGTAAGTCAAGAACTGTATATGTGGCCTGATGTCTTTTCTCATTGATGTATTTTTTTTTTTTTTGATCAAGTATCTAAAATTCATTATACTTCTCAGAAATTCTAGAAGTTGTTTCATTAAACCTTCGATTATCTTATGTCTCCATTATCTTATGTTTGATGATGCTGTTTTTGGAATCTCCTAAAATTGTGAAATTCATGGTGAAGTTAATTGAACCAAATCTGTATCAGGATTCCTTATCATATTTAGGATATGATAAAATATTTGCTTACATGAATGGACTTCTCTTCAAAATCATGCTCCCGTTTATTATACATAGTTCTTCCTCCTCTCCCCTTCAAGCACAATTTGAATATTTGTCTCAACAGGATAATGTTGAGTTTCACCGTCAAAGTGATGGCCTTCAACCATCTGCATCAACTGCAAAATTATTTGAAGGAACACTTCCTCAATCAGCATCCAATGTCAAGATACCAGAGCCATGCAGTAATTTTTTTAAGGTTTATCTTCAAGCTCAACTGTCACATTTTAGTTTTTTCTTCTCTCTCTTTCATCGTAGTTACATGGATTTCCTCTGGATTTATTATTATCGCAGTCATTGATAAACAGTAAAAGCTCCGGGTCTGCTTTTGGTAATTCATCAAGTTGCTTGGGCTTCAGCAATCTTGATCTCCAATCATTATTTGAAATGGAGGAGTCTCTAGACAAGGATCTGGAAGAAGCACAAGATTGCAGGCGCCAATGTGAAATTGAAGAAAGAAATGCTTTCAAAATTTATTCTAGAGCTCAAAGGGCTTTGATTGAGGCTAATTCTAGATGTCTTGATCTTTATCACAAGAGAGAATTATTTTCAGCTCATTTTCACTCTTTTTGCATGAATAATCCTGGTTTAATTAGTTCCTCGAGACAGCAAGAAGACATGAAAATTGGTGTGGATCACTTAAATAGTATGTCTGGAAATGCAAATAGAGCTTCTTCTTTGTATCAGAAGCATTTTGAATATAATAGTTCTACTAAGCTACATAATGATTTAAATATGCAACATGAAAATGCTGGTCCCATCAATACTTCAAACCTGCATGAGAATGGACAAAATTTGGGGTCTGAACCTGGATCCTGCTCTGCCTTATGTGGTAATACATTGGATCCATTGCCTTCCAAAGGCAATAATATTGCAGATAGAATTTGCTCTCCATCCTTTGATCCAAACGTTTCAGTGGATGGAGATGAAGAGTCATTGCCTTCTGACCATGAAATGATCGATTCCTATGATGAATGCTACATAGGAAGAAAACAGTTTGAAGATGATCAATTGGAAGCATATAATATGTCAAAGAAAAACCACAGTGACAATAATATTGAGGATTCTTTGCGTCTTGAAGCAAAATTAAGGTCTGAACTATTTGCACGTCTAGGAACAAGAAATTTGTCAAAGACTTGTAATCCATGCCATAACATTCAAACGTCAGTCGAACAGGGGACTGAGAATGATGCCAGAGACGATAGCACTCAGCAAAATAATACAGAACCTACAGTAGGTCTAGCAGTTGGAAGTGACGTCGACCTCATAAGTAAGAAGACTGAGATTGCTTTACTATCAGGAAAGGGAGATCAACAGTTTGGTTTTGGAGGTAACTTTATCTTCAGGAATGAGACTTATGCTTCTTCATCATATGTGAACTTACAGGCATGTTTGAAGTGGCTTTTAAAGATGCATTTTCTATTGTTCTCATGCTATTCATGTTATTTTGGGTGAACATCTTTTACCTAAAGGCTAAAATTGATCATCACCCCAGTCCTTTTCTTGAAGGCTGGTTTTGGAGGTCAGTCTTTGAACTGAAAAATATGGCCATATGGGTATATATGCTTAGATTTAATTGGGGCCGGCTATGCTTTTGTTCGGGTAGGCAGGAGACTTTTGGGAACTTGAATCCTGACGTGTTTTGGGTCTATTTCTGGGTTGAAAATTAATAGAGATAAGAGCTTGTGAACTTTGAGTCATTAAGTCTATTTTTGAGTTTATTCCAATCGTTGTTTGAGTATATTCCTAGCTCCTTGAAGCTTTCTAAGGGTGGAAGATGCCTAAAAGGGCTAGTATTTTGTGGAATTGTGTGGTTAGGGTATTGTTGTGGCACATTTTGCTGCAAAGGAATTAGGGGACCGCTTCTTGGAAAACTTGTTGTGGTTAGGGTATTGCTCTTTTCTCTTTTGTTTTTTTTTTGGGGGAAAAATTTATTGTTTCCTGGTGGGGATATAGTCACAGGAATTCTTTGTAGTTACAACCTTTCTACTATCTTTAACCATTGGTAAGCTTTCTTTTAGGTCATGTTGGACGGGGCTCCCTCGTCCTCCAACCCATAGGTTGTATTTATCTCTCTCTCTTTGTTTCCCTTCCTACATAATTGTTTTTTATTAAAAAGAAAAAAAAAAAGAGCTTGATCTTGGACATGAATTGTGATTTAGTGAAGTTGATAGGTTCCTTCCTTTCCGTCCTCCTACTTAGGTTTTTCCCTTGATCACGACCCTCATAATTTTCCTATATGAGTTCCGGTTTTGGGTAAAATTAATGATGTCTAGCTTCTGGAAGAGGTAGTTCTTCTCTAACGTGGGCAGACTTACTCTTATCTAGTTGATCTTGTGTGGTATCTCGACCTGCTTCTCTCATTGCTTAGAGTCCCTGTATCGCACCTTTGGAGAAGTTGATGAAAGATTTTCTTTGGCGTTCCACATCTGGCTTTTATCACCTTTCTCCATGGAGCATCCTTCGGATACTGAAATTTCCAAGCCTACTTGGCTAAGAAGGCCTTGTTCGGAAGCTTCTAGTCCACCATATGGATCCCCTCCTTCCACCTAAGGGGTTGAGAAAGCTCCTTACTCCTATTGCCAGACCATAAAAATTTACTTTGGAAGGCAAATGGAGTAAGGACAAGAAGTATATGCACATGGGCAGGTTTGTAAAGCGTAAGTCTACCTCCTTGTTGCTAAGGATGTAAACCTTTTGTAGACTTTATCCAAGATAGGCTGCCAAAAGGCAAGGGTAGTGGTATTATTGTTGGGAGGAAGTCCCAAATATGTTGACGGTCAAGATCCCACCTTACAACCAAACTGCTTTGCCAAGTGAGAAATAGAATTAATATCACAATTAATTCCTAGAAATTCAGATCTGTGTCAGTTGATGATTAACCTAGACTTATGTTGAATCATTATTATTGGTTCATTCAGTAGCAGCAATCTCTTTGTGTTTATGCACTTGTTTATTCAGTGATGGAACATAATATAACTAAAAATTGAAAAGAAGGGTGTGGTTGGTATAGTTGTGCAATTGTTGCATCATCCATTCTATGTATCAAGAAGAATATAATTCAATGAACGTAAAGTGACAGAAAAGGTATTTGCTGGTTAGAAAATTATTCATGGCTCTTTTTGTCTTGGCATCTTGCCTTGCAAAGTATGAGATACTTTTCCCACATTTCTAAGAGGTTCAATTAGCGAATAATTTTGGCCTTTGGTATCTACTTATGAAACCTCAGTGTTTAATGCTTCAGTTTTCTTTTCTTAGCATTGAGCGTTTCAGATTGCTGATTGCGGATGCTCTGTTAACTTGAACCTCCATTGCCCGTTCATACTGCCATAAAATGAACCAAACCTTAAAGTTATGTATAGGGAATCATGGATCAATGATTCAAGAGATAAGGCATAACTTGGTTGTTTTTGAGTATATCAGTTCAAGATTTAGTCTCTTGCTGATTATGATATCGATAACTTTTTCTTATTTCAAAAGAATATTTTTAGCCTTTTTTACTCTAAAAGCTGTGTTTCAGATGGTAGTTTTGTCAGATTAGTTGAGTGTAAATCATGATTAAAAACGATTTAATGAAAAACTGAGGATTATGCTAAGAATATTTTCTTGGTTCTTCCCGCTTCCTTTTTTTTTTTTTTTCTCTCTCTCTCTCTCTCTCTCTCTCTAAATTAACTTCTTTATGCAATATGCAGGCACAAACATATGCAAAACCCCAGATGACATCCATGGTCGTTGTCATTTTGAGAACTTGCCATCAGAAGCTCAGGATTCTGCGGACTCTGATGAAAATGAACGATTCAATAGAGAAGGATCTTGCTCCAAAACTACTTTTAGTTTTACACCTTTGACTATGAACAGTGTTCTGCAACATATAAAGGCCATATCCTCAGTTAGTATAGAAGTCTTGCTCACTAGAACTCGAGGGAGTCTCTCTAATCTTGGTTTCCCTGAAGACGGTGATTCTTTGCAAGTGGATCAAATCCACTGGAGAAAATTAAAAGAGAACTCTGTCCATGAGACTGTCAGACCTATGTTTCAGAGTGATGGCTCTTATATTGATGATCTTGCGATTGATCCATCGTGGCCACTTTGCATGTATGAACTCCGTGGAAAATGCAACAATGATGAATGCCCTTGGCAACATGTGAAGGACTACTCTTTTGCCAATAGAAGGCAGTGTCAGCATGGCCACATCAACTATTCTGGTATGATGATTCTGATGTTCTTTTAATTGAATGGTTGAAATTCTGTAGACTTTCACCTTCATCTTTTGCAGATTCTTGCAATGGACTATCATTTTCTTCAGATGAAACAAAAGTCTTCAAGTATGAAGATGGCATGACTCCTCCAACTTACCTGGTTGGCATAGATATTCTAAAAGCTGATTCACATTCATATGACCCTGTTTTAACTCAGAAAAGTAGTCAATGCTGGCAAAGCTTTTTTAGTATTTCTTTGACGTTACCAAATTTGCTCCAAAAGGATGCTTCTGCTGATGGGCTATTTTTACATGATGCTCGTATAGTGGCCAATGGAAATTGGAATAGACCATCATCATACTTTCAGAGGGGAAGCTCTATATTGGTTTGTTTATTACTCCTTATATATATACCCTTGTTTCAGTCTGCGGCTGAACAATTTTTTTTTCCCTTTTTCTTTTCCTTTTGACATGGGACACAATGGCTGATTGAACTAATTTGTATTTGCCTTCCTCTGTCTCTCGTCTTCCTCCGTCATGCATGATTGCATACCTGTACAGTGTTTTTCGGTTAGAAATATCTTTATTCAACTTTCTTGGTATGCTGTGATCACAGTACTTGATCATTGTTACTTTCTTCAATGAAACTTCATTTTGAATGAAGTTTTCCCCAAATGAAAAGTAGACCCAAGTTTCAAAACAAATGAATTTTGGACTTTTGCTGATGTCATCAGCCTGTGAAATTACCACTCTAACCCTTCTACCTTTTTCATTCTTTCTTTTTTTTTTCTTTTTCATTTTTTGCATTTTTCTTCTTTCTTTTTTCTTTTTTTTCTTTGTTTCTTCTTTCATTGTCTTCTTCTGCCATTATTTTTTCTGCGTTCTTCTTCTTCTTTCTCTTTTCCTGCACTCTTCTTCTTCGTTTTTCAAATTTTTTTTTTCTCTTCCTTCTTCGTTCTACAGTTTTTTTCTTGAAGAGAAAAATTGCAGTTTTGTCTGATGTGGAACAAGAGGAAGGAGTGAGAAGCTAGAGGAAGAATCTGAACAAAGAGAAAAACCGAATAAAAAAATTACAGTTGTGCTTGATGAGGAACAAGAAAAAGGAGTGAGAAACTAGAGCGAGAAGCCAGAGGTAGGAGCAAGAAGCGAGAGGTAGGAGTGAGAAGTGAGAGCAAGAATCTGAACAAAAGAAGGAGCGAGGTGCGAGAGGAAGGAGCGAGAATTCGTTTGAGCGTTACTCCCTTGCTCACAATTTGGGGACAATATGGTATTTTCACACGCGATTGACATAAGCAGAAGGCCATTTTTTGAGCAAACTTGAATCTTGGGCCAAATTTCATTTGGGCCCCTTTTTACAAAACCCCCAATTTTCAAGTCTGCCATTATATCTGAACTATCTGATTCTGGGAAACACGGTTACTCCTGCCTTGCTCAAATTGTGGCATTTAATATGACCATTTTTAAACTGAGAAATATCATCTTTTTTCCCTCCTTTTTGCATATTTCATTTTGCACACACACACATGTATATAGTCTGCATAGAAGTATTTCATTTCTCCTTTTTGCATATTGTTATTGTATGAGAATTTTGGTTTTTATTTTGTTATGCTTGTATTTCTTTTGCCATTTTTTTTTCTGTTTGTTTGCTTCTATTTGAAGTTATCTCTTTTTTCCCTCTTTTCCTTTTTACATAAAGATAAAATGATCCTTTTTTAGTTTTTGTGATGTTACTGTTGTATTACTGATTACTGATCCTTATCATCTTCCTGTTGTTTGGTCCATGATCTCACTTGTACTGACTTTTCTTGCAACTGCTATAAAGTTTTCTGTCACTGATTACTGAATACAGAACATTAACCATCTTTCTGTTGTTTGACCCATGATCACTTGTGCCAACTTCTTACAAAGTAGAGTCAGCTGAAACAGGGTGATGAGAACCTAGCTCTGGAAACAGCTCTAATTATTATTAACCAGGAAACAAACAGTCGAGAGGGCATGAAAAAGGTATGTCTTTTTTTTATTCATTCTTTTGTTGGTTCTTGCTTGTGTGAACCATGTTGCAATACAGTTGGAAATATTTGTATTTGCAGGCTCTTCCTGTACTATCACGTGCTGTAGAGAACAATCCAAAATCTATAGCTCTCTGGACCATTTACCTTCTAATATTCTATAGCTATACTACAACCGGGGGGAAGGATGACATGTTCTCTTTTGCGGTATGGTTTATTATATTTGTTCCTGCTCTGCTGTACAATATAATCCGGATGTAGGAAAGTACTGCAAAAACGTCTTTAAAGATTCAATGTATTAAAGTATATATAGTAGAGAGATGAAACAAGCTTTGCGATCTCGAAGGTTTTTGCTTTGATCAAAATTTAGTAAACACTGCTTAGCGTTTTGGTGATGCTGAAACTTTCAATCCAAGTGAGCTGTAGTTATTTTTCGGTTGTTATCAACTTTATAAAGTGAGGATTAAAACACAATATGTTGTGATATTCCACTACCTATACATGTTCTATAATGTAAAAACTGATAATTCAAACTTCGAAGTATGAGGAATGGATGATTTACTACAGTGGTCCAAAACTCTGAAATTTATTGTCACGCATCTTAAAGTTTCTTGTATTTTGAAAACAGTCCATCTTTGAAGTAACATTGAAATAACAAAAAAGCTGTCATTGCTAAACAAGGGAGAAAGAAAAAGCTGACGTCTACTCGAACTTTATTGGGCATGCTGATTTTAGATTCTTTTAATAAAAATAAAAATAATATTATTATTATTTTTACTGTGTTTGAAAACCTTTTATTTCAGCTGAAGTGCGGTTTTTGGCTTTTGTTTATTTCGTTTCCTCTTAGAAAAATTCTCCTTGTACTCTTATCACCTCTATGATTCAATGCCGAGTCATCTCTTCCTCCATACTTGAATTGGTGGGCACTTCTGTCCATTACTTATTCTCACTCGCATAATCAAGCATTGTGTGGAAATTCTTTGTTCTTGTTATTCAGGTCAAGCACAATGGGCAATCTTATGAACTCTGGCTCATGTACATTAACAGCCGCATGAATCTCGATGCTCGATTGGCTGCATATGATGCTGCACTTTCTGCACTCTGCGACAATATATTTACTCATAACTTGGATGGGAAATATGCTAGTGCCCATATCTTGGACCTGATTTTACAGATGACAAATTGTTTGTGTATGTCTGGGAACGTGGAGAAGGGTATTCAGAGGATTTTTGGACTTCTTCGAGTTGCTATGGATTCTGATGAGCCTTATTCTTTTACGCATTCTGATATGCTCGCATGCTTAAATATATCTGACAAATGTATTTTCTGGGTTTGTGTTGTGTATTTAGTTATTTACAGGAAACTGCCTCATGCTATAGTGCAGCAGCTTGAATGTGAGAAAGAACTGATCGAGATTGAATGGCCTGCCATTCAATTGACAGATGGTGAGAGGCTGAGGGCTTCTAGGGTGGTCAAGAAAGCAGTCGATTTTGTTGATTCATGCCTGAACAATGAATCACTTGAAAGTAAATGCTACCAAAAATCTATTCAAATGTTTGCTGTCAATCATATAAGGTGCTTGATGGCATTTGAGGACATAGGATTCAGTAGGAACTTGTTGGATAAGTATGTTAAACTTTATCCATCTTGCCTAGAACTTCTTTTACTTAAAGTACGGGCAAAGAAACATGGTTTTGGGGATGAAACTGTCGTGGCATTTGAACAAGCGATCAGGAACTGGCCGAAAGAAGTACCTGGTGTCCAATGCATCTGGAATCAATATGCTGAATATTTACTTCAGAATGGGAGAATCAAATGTACTGAAGAACTAATGGTGCGCTGGTTTGAGTCTACTTCAAAAATGGATTGTTCTAAAACTAGAACAGTGGATAATAGTGACTGTGACTCCTTGCACTTGCGAGAGTATGCTTCAGGATCAATTCTACATGCATTAGATTGCAGTCCCAATGAGGTGGACGTGGTGTTTTGGTATCTTAATCTTTCTGTTCACAAGTTACTGCTTAATGACCAATTAGAAGCACGTTTGGCCTTTGACAATGCTCTGAGGGCTGCAGGTTCTGGGACTTTTAGATATTGCATGAGAGAGTATGCTATGTTTTTGCTTACAGACGAATCCTTACTGAATGAGGCTGCTTCTGTTGGTGGAATAAGGAGCATTTTAGAGGGTTATCTCAACGATGCCCGAGCTTTCCCTGTCCCTAAACCATTATCCAGAAAATTCATTAACGATATCAAGAAGCCAAGAGTTCAACTTCTTGTCAGTAACATGCTGTCTCCACTTTCTCTGGATGTTTCTCTAGTGAACTGTATTCTTGAAGTCTGGTATGGGCCATCTCTTTTACCCCAAAAATTTAACAAACCAAGGGAATTGGTGGATTTCGTGGAAACTATCTTAGAGATGTTGCCTTCTAATTATCAGTTGGTACTTTCTGTCTGTAAGCAATTATGCAATGGTGACAACTCTTCCCAAGTTGCCTCCCCCAGTCTTATTTTCTGGGCCTGCTCAAATTTGATCAGTGCAATCTTTAGTTCTGTCCCAATACCACCAGAGTTCATTTGGGTAGAAGCTGCTAATATTCTGGTCAATGTCAAAGGTTTTGAAGCCATATATGAGAGGTTTCACAAGAGAGCTTTATCTGTTTACCCGTTCTCTGTTCAGCTGTGGAAATCATACTACAACATATGTAAAACTAGAGGAGATACGAGTGCTGTTCTGCGAGAAGTAAATGAAAGGGGAATCGAACTCAACGAGCCTTCTTTGTGATAGAGTTTTACCTTCTTTTGGTAGGATCAGTAAAATTACAGGAAAACTAGGATCAGTTTTTAGTTATGGGAGAAGCCTCATTGTACAGATAGTCTACTTCATACCGATTTATCATGCTAGGCACGGTGGTTTCGCCAGTTTTGTTAAGTGGTAGAGAGCTAATTGCGGCTAGGAATTAGATATAAATACTTGCACAGAGAAAAATTTGGTGGAAAAAGGTTTTCTTGTCTGTAAAGTCATTGAGACCTTATTGGAACTGGATCTGTTGAGGATATCATCAGGTTTCTCAGCCAGTTGCCCACTGACCATTGTTCGGCCGTTCATAAAGGTGTATCGTCGAACGTCAGGTTTTCCCCATCTTTTTTTTGGTAGCCCACACAGTTTACTTTTGTTCTTCATCCTTCCCCGCTTGCTAGAATTAATTTCTTTAATTGACCTTACCCTTTTAAATGTAAGTACCAACCATTTTTAGGTTTAAATTTGATATGGTTTAAATGAAATACAATACTCGATGCTCATCGATGTAGGTTTCATTTCAATTTTCCCACTATCTACGAGCATGTTTTGTTTGTTAAAGCTCTCAGATCAAACACAGGTTTGTC

mRNA sequence

TTCCGCTCTCACCTTCGTCCTCAAACTTCCTTTCCCTTGGACCGCTTCTTCAATCTCCCTTTCACTGAGCGGGGGGATGGAGAAGAAGAACTCCGAGGAGCTCACCGTCAAAGCCATGGCATCCAATTCAAAACCTAGCAAAAGCAAAGCCTCCGAAAGCAGAGAAGAAGGAGAAGTATCTTCATCCGACAACGATACACAAACACACGACGTACATCCTGTTTGTTCTACAGTGCCTGCCTCGGTCACATCACCTATCTCATCCATTCTTCCTCCTAAGAATAAATGTAACGAAGGGATCCAGGCTGCTTCTGCTGATGTTTGCACAAGAACATCCATACAAACTACTTCTCAGAAGACGTGTGATACTGCTCAAGTTGTGAATAAAGCTAGTACTCCTTGGGGTGCTTCCAGGGAGGCCAATTCGAATCTTGTGATTAGCTTCTCAGATGACAGTGGCAGTGAATTGGAGGAATGCAGTAAAGTCAGAACTTCAAAATCTCATAGTGATGCTGTCAGGCACTTTAAACCTCCAACTTCAACACTTGATAGATCAAACAGGTTACGGAGTATGACAAGGAACAAAGTAGTGGCAAACAAACTGTCTTTGAGTCAGCCATTTATCCCTTCAATGACCAAGAATCACAGAGCATATTCCACGGGTGCTGGGCCCTCATTGGCTGAACAAGGATCTAAAATCAGAGCTTTCAGTGGAAACCTACAAAGCCAAGGGCGTGGGAATGATCAAGGAATGAACTTAAACACTAGTAAGCTGCAGGACTTGCGGGAGCAGATAGCCATTTGCGAAAGCAAACTGAAGCTCAAGTCTGCCCAACAGAACAAGGAGAGCATTTCAATTACAAACCAGGATTATATTGTCACAAATTCAAAATCTGATTTGGGTAGGAAAGGGAATGCTACTATCTCTCAAGTTTCTTCACTGGGGCCTAAGGAACCAGATGCAAAGCGCCTGAAAACCAGTGGGTCTTATTCTACCAAGCTGAGTTTGAGTGGGCAACAACATCTTCGTGCAACGTATGCTGGAAAATCTGTTTTTCGGCCACAGGAGCCTGGAGAAGAGACACAGAACATTAAGGTCACTTACAACCAGAAAGGGAATTCTTTGAGTAGAGAAGAGTCCAGTGTGTTGAAGCAGAGTAAGGAAGATATCAAACATGTGGCTGCTTCACCTTCACCTGGAATTGACCTTGGCAAAGTACAGGATGACACCGACATTGTTGCTAATGGAAATCAATCAGACTGGATCAGTAAGCAGGTGGATCCTCACCCTCTTGTTGTCTTAGATCAAGCTACGAGTCTTCCAAATGTCACATCCAATGTCCAAACTCAGTTCGATAATGTTGAGTTTCACCGTCAAAGTGATGGCCTTCAACCATCTGCATCAACTGCAAAATTATTTGAAGGAACACTTCCTCAATCAGCATCCAATGTCAAGATACCAGAGCCATGCAGTAATTTTTTTAAGTCATTGATAAACAGTAAAAGCTCCGGGTCTGCTTTTGGTAATTCATCAAGTTGCTTGGGCTTCAGCAATCTTGATCTCCAATCATTATTTGAAATGGAGGAGTCTCTAGACAAGGATCTGGAAGAAGCACAAGATTGCAGGCGCCAATGTGAAATTGAAGAAAGAAATGCTTTCAAAATTTATTCTAGAGCTCAAAGGGCTTTGATTGAGGCTAATTCTAGATGTCTTGATCTTTATCACAAGAGAGAATTATTTTCAGCTCATTTTCACTCTTTTTGCATGAATAATCCTGGTTTAATTAGTTCCTCGAGACAGCAAGAAGACATGAAAATTGGTGTGGATCACTTAAATAGTATGTCTGGAAATGCAAATAGAGCTTCTTCTTTGTATCAGAAGCATTTTGAATATAATAGTTCTACTAAGCTACATAATGATTTAAATATGCAACATGAAAATGCTGGTCCCATCAATACTTCAAACCTGCATGAGAATGGACAAAATTTGGGGTCTGAACCTGGATCCTGCTCTGCCTTATGTGGTAATACATTGGATCCATTGCCTTCCAAAGGCAATAATATTGCAGATAGAATTTGCTCTCCATCCTTTGATCCAAACGTTTCAGTGGATGGAGATGAAGAGTCATTGCCTTCTGACCATGAAATGATCGATTCCTATGATGAATGCTACATAGGAAGAAAACAGTTTGAAGATGATCAATTGGAAGCATATAATATGTCAAAGAAAAACCACAGTGACAATAATATTGAGGATTCTTTGCGTCTTGAAGCAAAATTAAGGTCTGAACTATTTGCACGTCTAGGAACAAGAAATTTGTCAAAGACTTGTAATCCATGCCATAACATTCAAACGTCAGTCGAACAGGGGACTGAGAATGATGCCAGAGACGATAGCACTCAGCAAAATAATACAGAACCTACAGTAGGTCTAGCAGTTGGAAGTGACGTCGACCTCATAAGTAAGAAGACTGAGATTGCTTTACTATCAGGAAAGGGAGATCAACAGTTTGGTTTTGGAGGCACAAACATATGCAAAACCCCAGATGACATCCATGGTCGTTGTCATTTTGAGAACTTGCCATCAGAAGCTCAGGATTCTGCGGACTCTGATGAAAATGAACGATTCAATAGAGAAGGATCTTGCTCCAAAACTACTTTTAGTTTTACACCTTTGACTATGAACAGTGTTCTGCAACATATAAAGGCCATATCCTCAGTTAGTATAGAAGTCTTGCTCACTAGAACTCGAGGGAGTCTCTCTAATCTTGGTTTCCCTGAAGACGGTGATTCTTTGCAAGTGGATCAAATCCACTGGAGAAAATTAAAAGAGAACTCTGTCCATGAGACTGTCAGACCTATGTTTCAGAGTGATGGCTCTTATATTGATGATCTTGCGATTGATCCATCGTGGCCACTTTGCATGTATGAACTCCGTGGAAAATGCAACAATGATGAATGCCCTTGGCAACATGTGAAGGACTACTCTTTTGCCAATAGAAGGCAGTGTCAGCATGGCCACATCAACTATTCTGATTCTTGCAATGGACTATCATTTTCTTCAGATGAAACAAAAGTCTTCAAGTATGAAGATGGCATGACTCCTCCAACTTACCTGGTTGGCATAGATATTCTAAAAGCTGATTCACATTCATATGACCCTGTTTTAACTCAGAAAAGTAGTCAATGCTGGCAAAGCTTTTTTAGTATTTCTTTGACGTTACCAAATTTGCTCCAAAAGGATGCTTCTGCTGATGGGCTATTTTTACATGATGCTCGTATAGTGGCCAATGGAAATTGGAATAGACCATCATCATACTTTCAGAGGGGAAGCTCTATATTGAGTCAGCTGAAACAGGGTGATGAGAACCTAGCTCTGGAAACAGCTCTAATTATTATTAACCAGGAAACAAACAGTCGAGAGGGCATGAAAAAGGCTCTTCCTGTACTATCACGTGCTGTAGAGAACAATCCAAAATCTATAGCTCTCTGGACCATTTACCTTCTAATATTCTATAGCTATACTACAACCGGGGGGAAGGATGACATGTTCTCTTTTGCGGTCAAGCACAATGGGCAATCTTATGAACTCTGGCTCATGTACATTAACAGCCGCATGAATCTCGATGCTCGATTGGCTGCATATGATGCTGCACTTTCTGCACTCTGCGACAATATATTTACTCATAACTTGGATGGGAAATATGCTAGTGCCCATATCTTGGACCTGATTTTACAGATGACAAATTGTTTGTGTATGTCTGGGAACGTGGAGAAGGGTATTCAGAGGATTTTTGGACTTCTTCGAGTTGCTATGGATTCTGATGAGCCTTATTCTTTTACGCATTCTGATATGCTCGCATGCTTAAATATATCTGACAAATGTATTTTCTGGGTTTGTGTTGTGTATTTAGTTATTTACAGGAAACTGCCTCATGCTATAGTGCAGCAGCTTGAATGTGAGAAAGAACTGATCGAGATTGAATGGCCTGCCATTCAATTGACAGATGGTGAGAGGCTGAGGGCTTCTAGGGTGGTCAAGAAAGCAGTCGATTTTGTTGATTCATGCCTGAACAATGAATCACTTGAAAGTAAATGCTACCAAAAATCTATTCAAATGTTTGCTGTCAATCATATAAGGTGCTTGATGGCATTTGAGGACATAGGATTCAGTAGGAACTTGTTGGATAAGTATGTTAAACTTTATCCATCTTGCCTAGAACTTCTTTTACTTAAAGTACGGGCAAAGAAACATGGTTTTGGGGATGAAACTGTCGTGGCATTTGAACAAGCGATCAGGAACTGGCCGAAAGAAGTACCTGGTGTCCAATGCATCTGGAATCAATATGCTGAATATTTACTTCAGAATGGGAGAATCAAATGTACTGAAGAACTAATGGTGCGCTGGTTTGAGTCTACTTCAAAAATGGATTGTTCTAAAACTAGAACAGTGGATAATAGTGACTGTGACTCCTTGCACTTGCGAGAGTATGCTTCAGGATCAATTCTACATGCATTAGATTGCAGTCCCAATGAGGTGGACGTGGTGTTTTGGTATCTTAATCTTTCTGTTCACAAGTTACTGCTTAATGACCAATTAGAAGCACGTTTGGCCTTTGACAATGCTCTGAGGGCTGCAGGTTCTGGGACTTTTAGATATTGCATGAGAGAGTATGCTATGTTTTTGCTTACAGACGAATCCTTACTGAATGAGGCTGCTTCTGTTGGTGGAATAAGGAGCATTTTAGAGGGTTATCTCAACGATGCCCGAGCTTTCCCTGTCCCTAAACCATTATCCAGAAAATTCATTAACGATATCAAGAAGCCAAGAGTTCAACTTCTTGTCAGTAACATGCTGTCTCCACTTTCTCTGGATGTTTCTCTAGTGAACTGTATTCTTGAAGTCTGGTATGGGCCATCTCTTTTACCCCAAAAATTTAACAAACCAAGGGAATTGGTGGATTTCGTGGAAACTATCTTAGAGATGTTGCCTTCTAATTATCAGTTGGTACTTTCTGTCTGTAAGCAATTATGCAATGGTGACAACTCTTCCCAAGTTGCCTCCCCCAGTCTTATTTTCTGGGCCTGCTCAAATTTGATCAGTGCAATCTTTAGTTCTGTCCCAATACCACCAGAGTTCATTTGGGTAGAAGCTGCTAATATTCTGGTCAATGTCAAAGGTTTTGAAGCCATATATGAGAGGTTTCACAAGAGAGCTTTATCTGTTTACCCGTTCTCTGTTCAGCTGTGGAAATCATACTACAACATATGTAAAACTAGAGGAGATACGAGTGCTGTTCTGCGAGAAGTAAATGAAAGGGGAATCGAACTCAACGAGCCTTCTTTGTGATAGAGTTTTACCTTCTTTTGGTAGGATCAGTAAAATTACAGGAAAACTAGGATCAGTTTTTAGTTATGGGAGAAGCCTCATTGTACAGATAGTCTACTTCATACCGATTTATCATGCTAGGCACGGTGGTTTCGCCAGTTTTGTTAAGTGGTAGAGAGCTAATTGCGGCTAGGAATTAGATATAAATACTTGCACAGAGAAAAATTTGGTGGAAAAAGGTTTTCTTGTCTGTAAAGTCATTGAGACCTTATTGGAACTGGATCTGTTGAGGATATCATCAGGTTTCTCAGCCAGTTGCCCACTGACCATTGTTCGGCCGTTCATAAAGGTGTATCGTCGAACGTCAGGTTTTCCCCATCTTTTTTTTGGTAGCCCACACAGTTTACTTTTGTTCTTCATCCTTCCCCGCTTGCTAGAATTAATTTCTTTAATTGACCTTACCCTTTTAAATGTAAGTACCAACCATTTTTAGGTTTAAATTTGATATGGTTTAAATGAAATACAATACTCGATGCTCATCGATGTAGGTTTCATTTCAATTTTCCCACTATCTACGAGCATGTTTTGTTTGTTAAAGCTCTCAGATCAAACACAGGTTTGTC

Coding sequence (CDS)

ATGGAGAAGAAGAACTCCGAGGAGCTCACCGTCAAAGCCATGGCATCCAATTCAAAACCTAGCAAAAGCAAAGCCTCCGAAAGCAGAGAAGAAGGAGAAGTATCTTCATCCGACAACGATACACAAACACACGACGTACATCCTGTTTGTTCTACAGTGCCTGCCTCGGTCACATCACCTATCTCATCCATTCTTCCTCCTAAGAATAAATGTAACGAAGGGATCCAGGCTGCTTCTGCTGATGTTTGCACAAGAACATCCATACAAACTACTTCTCAGAAGACGTGTGATACTGCTCAAGTTGTGAATAAAGCTAGTACTCCTTGGGGTGCTTCCAGGGAGGCCAATTCGAATCTTGTGATTAGCTTCTCAGATGACAGTGGCAGTGAATTGGAGGAATGCAGTAAAGTCAGAACTTCAAAATCTCATAGTGATGCTGTCAGGCACTTTAAACCTCCAACTTCAACACTTGATAGATCAAACAGGTTACGGAGTATGACAAGGAACAAAGTAGTGGCAAACAAACTGTCTTTGAGTCAGCCATTTATCCCTTCAATGACCAAGAATCACAGAGCATATTCCACGGGTGCTGGGCCCTCATTGGCTGAACAAGGATCTAAAATCAGAGCTTTCAGTGGAAACCTACAAAGCCAAGGGCGTGGGAATGATCAAGGAATGAACTTAAACACTAGTAAGCTGCAGGACTTGCGGGAGCAGATAGCCATTTGCGAAAGCAAACTGAAGCTCAAGTCTGCCCAACAGAACAAGGAGAGCATTTCAATTACAAACCAGGATTATATTGTCACAAATTCAAAATCTGATTTGGGTAGGAAAGGGAATGCTACTATCTCTCAAGTTTCTTCACTGGGGCCTAAGGAACCAGATGCAAAGCGCCTGAAAACCAGTGGGTCTTATTCTACCAAGCTGAGTTTGAGTGGGCAACAACATCTTCGTGCAACGTATGCTGGAAAATCTGTTTTTCGGCCACAGGAGCCTGGAGAAGAGACACAGAACATTAAGGTCACTTACAACCAGAAAGGGAATTCTTTGAGTAGAGAAGAGTCCAGTGTGTTGAAGCAGAGTAAGGAAGATATCAAACATGTGGCTGCTTCACCTTCACCTGGAATTGACCTTGGCAAAGTACAGGATGACACCGACATTGTTGCTAATGGAAATCAATCAGACTGGATCAGTAAGCAGGTGGATCCTCACCCTCTTGTTGTCTTAGATCAAGCTACGAGTCTTCCAAATGTCACATCCAATGTCCAAACTCAGTTCGATAATGTTGAGTTTCACCGTCAAAGTGATGGCCTTCAACCATCTGCATCAACTGCAAAATTATTTGAAGGAACACTTCCTCAATCAGCATCCAATGTCAAGATACCAGAGCCATGCAGTAATTTTTTTAAGTCATTGATAAACAGTAAAAGCTCCGGGTCTGCTTTTGGTAATTCATCAAGTTGCTTGGGCTTCAGCAATCTTGATCTCCAATCATTATTTGAAATGGAGGAGTCTCTAGACAAGGATCTGGAAGAAGCACAAGATTGCAGGCGCCAATGTGAAATTGAAGAAAGAAATGCTTTCAAAATTTATTCTAGAGCTCAAAGGGCTTTGATTGAGGCTAATTCTAGATGTCTTGATCTTTATCACAAGAGAGAATTATTTTCAGCTCATTTTCACTCTTTTTGCATGAATAATCCTGGTTTAATTAGTTCCTCGAGACAGCAAGAAGACATGAAAATTGGTGTGGATCACTTAAATAGTATGTCTGGAAATGCAAATAGAGCTTCTTCTTTGTATCAGAAGCATTTTGAATATAATAGTTCTACTAAGCTACATAATGATTTAAATATGCAACATGAAAATGCTGGTCCCATCAATACTTCAAACCTGCATGAGAATGGACAAAATTTGGGGTCTGAACCTGGATCCTGCTCTGCCTTATGTGGTAATACATTGGATCCATTGCCTTCCAAAGGCAATAATATTGCAGATAGAATTTGCTCTCCATCCTTTGATCCAAACGTTTCAGTGGATGGAGATGAAGAGTCATTGCCTTCTGACCATGAAATGATCGATTCCTATGATGAATGCTACATAGGAAGAAAACAGTTTGAAGATGATCAATTGGAAGCATATAATATGTCAAAGAAAAACCACAGTGACAATAATATTGAGGATTCTTTGCGTCTTGAAGCAAAATTAAGGTCTGAACTATTTGCACGTCTAGGAACAAGAAATTTGTCAAAGACTTGTAATCCATGCCATAACATTCAAACGTCAGTCGAACAGGGGACTGAGAATGATGCCAGAGACGATAGCACTCAGCAAAATAATACAGAACCTACAGTAGGTCTAGCAGTTGGAAGTGACGTCGACCTCATAAGTAAGAAGACTGAGATTGCTTTACTATCAGGAAAGGGAGATCAACAGTTTGGTTTTGGAGGCACAAACATATGCAAAACCCCAGATGACATCCATGGTCGTTGTCATTTTGAGAACTTGCCATCAGAAGCTCAGGATTCTGCGGACTCTGATGAAAATGAACGATTCAATAGAGAAGGATCTTGCTCCAAAACTACTTTTAGTTTTACACCTTTGACTATGAACAGTGTTCTGCAACATATAAAGGCCATATCCTCAGTTAGTATAGAAGTCTTGCTCACTAGAACTCGAGGGAGTCTCTCTAATCTTGGTTTCCCTGAAGACGGTGATTCTTTGCAAGTGGATCAAATCCACTGGAGAAAATTAAAAGAGAACTCTGTCCATGAGACTGTCAGACCTATGTTTCAGAGTGATGGCTCTTATATTGATGATCTTGCGATTGATCCATCGTGGCCACTTTGCATGTATGAACTCCGTGGAAAATGCAACAATGATGAATGCCCTTGGCAACATGTGAAGGACTACTCTTTTGCCAATAGAAGGCAGTGTCAGCATGGCCACATCAACTATTCTGATTCTTGCAATGGACTATCATTTTCTTCAGATGAAACAAAAGTCTTCAAGTATGAAGATGGCATGACTCCTCCAACTTACCTGGTTGGCATAGATATTCTAAAAGCTGATTCACATTCATATGACCCTGTTTTAACTCAGAAAAGTAGTCAATGCTGGCAAAGCTTTTTTAGTATTTCTTTGACGTTACCAAATTTGCTCCAAAAGGATGCTTCTGCTGATGGGCTATTTTTACATGATGCTCGTATAGTGGCCAATGGAAATTGGAATAGACCATCATCATACTTTCAGAGGGGAAGCTCTATATTGAGTCAGCTGAAACAGGGTGATGAGAACCTAGCTCTGGAAACAGCTCTAATTATTATTAACCAGGAAACAAACAGTCGAGAGGGCATGAAAAAGGCTCTTCCTGTACTATCACGTGCTGTAGAGAACAATCCAAAATCTATAGCTCTCTGGACCATTTACCTTCTAATATTCTATAGCTATACTACAACCGGGGGGAAGGATGACATGTTCTCTTTTGCGGTCAAGCACAATGGGCAATCTTATGAACTCTGGCTCATGTACATTAACAGCCGCATGAATCTCGATGCTCGATTGGCTGCATATGATGCTGCACTTTCTGCACTCTGCGACAATATATTTACTCATAACTTGGATGGGAAATATGCTAGTGCCCATATCTTGGACCTGATTTTACAGATGACAAATTGTTTGTGTATGTCTGGGAACGTGGAGAAGGGTATTCAGAGGATTTTTGGACTTCTTCGAGTTGCTATGGATTCTGATGAGCCTTATTCTTTTACGCATTCTGATATGCTCGCATGCTTAAATATATCTGACAAATGTATTTTCTGGGTTTGTGTTGTGTATTTAGTTATTTACAGGAAACTGCCTCATGCTATAGTGCAGCAGCTTGAATGTGAGAAAGAACTGATCGAGATTGAATGGCCTGCCATTCAATTGACAGATGGTGAGAGGCTGAGGGCTTCTAGGGTGGTCAAGAAAGCAGTCGATTTTGTTGATTCATGCCTGAACAATGAATCACTTGAAAGTAAATGCTACCAAAAATCTATTCAAATGTTTGCTGTCAATCATATAAGGTGCTTGATGGCATTTGAGGACATAGGATTCAGTAGGAACTTGTTGGATAAGTATGTTAAACTTTATCCATCTTGCCTAGAACTTCTTTTACTTAAAGTACGGGCAAAGAAACATGGTTTTGGGGATGAAACTGTCGTGGCATTTGAACAAGCGATCAGGAACTGGCCGAAAGAAGTACCTGGTGTCCAATGCATCTGGAATCAATATGCTGAATATTTACTTCAGAATGGGAGAATCAAATGTACTGAAGAACTAATGGTGCGCTGGTTTGAGTCTACTTCAAAAATGGATTGTTCTAAAACTAGAACAGTGGATAATAGTGACTGTGACTCCTTGCACTTGCGAGAGTATGCTTCAGGATCAATTCTACATGCATTAGATTGCAGTCCCAATGAGGTGGACGTGGTGTTTTGGTATCTTAATCTTTCTGTTCACAAGTTACTGCTTAATGACCAATTAGAAGCACGTTTGGCCTTTGACAATGCTCTGAGGGCTGCAGGTTCTGGGACTTTTAGATATTGCATGAGAGAGTATGCTATGTTTTTGCTTACAGACGAATCCTTACTGAATGAGGCTGCTTCTGTTGGTGGAATAAGGAGCATTTTAGAGGGTTATCTCAACGATGCCCGAGCTTTCCCTGTCCCTAAACCATTATCCAGAAAATTCATTAACGATATCAAGAAGCCAAGAGTTCAACTTCTTGTCAGTAACATGCTGTCTCCACTTTCTCTGGATGTTTCTCTAGTGAACTGTATTCTTGAAGTCTGGTATGGGCCATCTCTTTTACCCCAAAAATTTAACAAACCAAGGGAATTGGTGGATTTCGTGGAAACTATCTTAGAGATGTTGCCTTCTAATTATCAGTTGGTACTTTCTGTCTGTAAGCAATTATGCAATGGTGACAACTCTTCCCAAGTTGCCTCCCCCAGTCTTATTTTCTGGGCCTGCTCAAATTTGATCAGTGCAATCTTTAGTTCTGTCCCAATACCACCAGAGTTCATTTGGGTAGAAGCTGCTAATATTCTGGTCAATGTCAAAGGTTTTGAAGCCATATATGAGAGGTTTCACAAGAGAGCTTTATCTGTTTACCCGTTCTCTGTTCAGCTGTGGAAATCATACTACAACATATGTAAAACTAGAGGAGATACGAGTGCTGTTCTGCGAGAAGTAAATGAAAGGGGAATCGAACTCAACGAGCCTTCTTTGTGA

Protein sequence

MEKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSPISSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQPFIPSMTKNHRAYSTGAGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQIAICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRLKTSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLKQSKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVTSNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSGSAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSLYQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSKGNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKNHSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNNTEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTPDDIHGRCHFENLPSEAQDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEVLLTRTRGSLSNLGFPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSWPLCMYELRGKCNNDECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYEDGMTPPTYLVGIDILKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIVANGNWNRPSSYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALWTIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCDNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFTHSDMLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPAIQLTDGERLRASRVVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPSCLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAEYLLQNGRIKCTEELMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPNEVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGYLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNCILEVWYGPSLLPQKFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDNSSQVASPSLIFWACSNLISAIFSSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWKSYYNICKTRGDTSAVLREVNERGIELNEPSL
Homology
BLAST of ClCG09G020050 vs. NCBI nr
Match: XP_038890115.1 (uncharacterized protein LOC120079791 isoform X3 [Benincasa hispida])

HSP 1 Score: 3206.4 bits (8312), Expect = 0.0e+00
Identity = 1620/1757 (92.20%), Postives = 1667/1757 (94.88%), Query Frame = 0

Query: 1    MEKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60
            MEKKN+EELTVK+MASNS+PSKSKAS+SREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP
Sbjct: 1    MEKKNTEELTVKSMASNSQPSKSKASDSREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60

Query: 61   ISSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLV 120
            ISSILPPKNK N GIQA SADVCTRTSIQT SQK CD AQVVNK STPWGASREANSNLV
Sbjct: 61   ISSILPPKNKYNPGIQAVSADVCTRTSIQTISQKICDNAQVVNKVSTPWGASREANSNLV 120

Query: 121  ISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQ 180
            ISFSDDSGSELEECSKVRTSKSHSDAVRH+KPPTS +DRSN+LRSMTRNKVVANKLSLSQ
Sbjct: 121  ISFSDDSGSELEECSKVRTSKSHSDAVRHYKPPTSIIDRSNKLRSMTRNKVVANKLSLSQ 180

Query: 181  PFIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQ 240
            PFIPSMTKNHRAYS G AGPSLAEQGSKIRAFSGNLQSQGRGNDQG NLNTSKLQDLREQ
Sbjct: 181  PFIPSMTKNHRAYSKGAAGPSLAEQGSKIRAFSGNLQSQGRGNDQGKNLNTSKLQDLREQ 240

Query: 241  IAICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRL 300
            IAICESKLK KSAQQNKESIS+TNQDYIVTNSKSDL RKG+ATI Q   L PKEPD KRL
Sbjct: 241  IAICESKLKFKSAQQNKESISVTNQDYIVTNSKSDLARKGSATIPQFPPLVPKEPDVKRL 300

Query: 301  KTSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLK 360
            KTSGSYSTKLSLSGQQHLR  YAGKSVFRPQEPGEETQNIKVTYNQKG SL REESSVLK
Sbjct: 301  KTSGSYSTKLSLSGQQHLRTMYAGKSVFRPQEPGEETQNIKVTYNQKGISLGREESSVLK 360

Query: 361  QSKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVT 420
            QSKEDIKHVAASPSPGIDLGKVQDD DIVANGNQ DWISKQVDPHPLVVLD AT LPN+T
Sbjct: 361  QSKEDIKHVAASPSPGIDLGKVQDDNDIVANGNQLDWISKQVDPHPLVVLDLATVLPNMT 420

Query: 421  SNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480
            SNVQTQFDNVEFHRQSDGLQPSAS AK FEGTLPQSASNVKIPEPCSNFFKSLINSKSSG
Sbjct: 421  SNVQTQFDNVEFHRQSDGLQPSASAAKHFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480

Query: 481  SAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI 540
            +AFGNS SCLGFSN DLQSLFEMEESLDKDLEEAQD RRQCEIEERNAFKIYSRAQRALI
Sbjct: 481  TAFGNSPSCLGFSNFDLQSLFEMEESLDKDLEEAQDIRRQCEIEERNAFKIYSRAQRALI 540

Query: 541  EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSL 600
            EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIG DHLNSMSGNAN AS L
Sbjct: 541  EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGADHLNSMSGNANGASPL 600

Query: 601  YQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSK 660
            YQKH EYNSST+LH DLNMQHENAGPIN+SNLHENGQNLGSEP  CS L GN LDPLPSK
Sbjct: 601  YQKHSEYNSSTQLHTDLNMQHENAGPINSSNLHENGQNLGSEPELCSDLGGNKLDPLPSK 660

Query: 661  GNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKN 720
            GNNIADRICSPS DPNVSVDGDEESLPSDHEMIDSYDECY+G+KQFEDDQ+E YN+SKKN
Sbjct: 661  GNNIADRICSPSVDPNVSVDGDEESLPSDHEMIDSYDECYMGKKQFEDDQMETYNISKKN 720

Query: 721  HSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNN 780
              DNNIEDSLRLEAKLRSELFARLG RNLSKTCNPCHNIQT VEQGT++DARDD TQQNN
Sbjct: 721  QCDNNIEDSLRLEAKLRSELFARLGIRNLSKTCNPCHNIQTPVEQGTKSDARDDRTQQNN 780

Query: 781  TEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTPDDIHGRCHFENLPSEA 840
            TEPTVGLAVGSD DL SKKTE  LLSGKGDQQFGFGG N C TPDDIHGR HFENLPSE 
Sbjct: 781  TEPTVGLAVGSDADLTSKKTESTLLSGKGDQQFGFGGPNRCNTPDDIHGRYHFENLPSET 840

Query: 841  QDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEVLLTRTRGSLSNLG 900
            QDSADSD+NE FNREGSCSKTTFSFTPLTMNSVLQHIKAI SVSIEVLL RTRGSLSNLG
Sbjct: 841  QDSADSDDNEPFNREGSCSKTTFSFTPLTMNSVLQHIKAIPSVSIEVLLARTRGSLSNLG 900

Query: 901  FPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSWPLCMYELRGKCNN 960
            FPEDGDSL+VDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDP WPLCMYELRGKCNN
Sbjct: 901  FPEDGDSLEVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPLWPLCMYELRGKCNN 960

Query: 961  DECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYEDGMTPPTYLVGIDI 1020
            DECPWQHVKDYS ANRRQCQH HINYSDSCNGLSFSSDETK+FKYED MTPPTYLVGIDI
Sbjct: 961  DECPWQHVKDYSLANRRQCQHDHINYSDSCNGLSFSSDETKIFKYEDCMTPPTYLVGIDI 1020

Query: 1021 LKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIVANGNWNRPS 1080
            LKADSHSYDPVL QKSSQCWQ+FFSISLTLPNLLQKDASADGLFLHDARI A G+WNRPS
Sbjct: 1021 LKADSHSYDPVLAQKSSQCWQNFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPS 1080

Query: 1081 SYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALW 1140
            SYFQRGSSILSQLKQGDE+LALETALIIINQETNSREGMKKALPVLSRAVENNPKS+ALW
Sbjct: 1081 SYFQRGSSILSQLKQGDEDLALETALIIINQETNSREGMKKALPVLSRAVENNPKSVALW 1140

Query: 1141 TIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCD 1200
            TIYLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCD
Sbjct: 1141 TIYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCD 1200

Query: 1201 NIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFTHSD 1260
            NI T NLDGKYAS HILDLILQMTNCLCMSGNVEK IQRI GLLRVAMDSDEPYSFTHSD
Sbjct: 1201 NIVTPNLDGKYASTHILDLILQMTNCLCMSGNVEKAIQRILGLLRVAMDSDEPYSFTHSD 1260

Query: 1261 MLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPAIQLTDGERLRASR 1320
            ML CLNISDKCIFWVCVVYLVIYRKLPHA+VQQLECEKELIEIEWPAIQLTDGE+LRASR
Sbjct: 1261 MLTCLNISDKCIFWVCVVYLVIYRKLPHAVVQQLECEKELIEIEWPAIQLTDGEKLRASR 1320

Query: 1321 VVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPS 1380
            VVKKAVDFVDSC NNES +SKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPS
Sbjct: 1321 VVKKAVDFVDSCPNNESPDSKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPS 1380

Query: 1381 CLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAEYLLQNGRIKCTEE 1440
            CLEL+LLKVRAKK  FGDETVVAFEQAI NWPKEVPG+QCIWNQYAEYLLQNGRIKCTEE
Sbjct: 1381 CLELILLKVRAKKRDFGDETVVAFEQAIGNWPKEVPGIQCIWNQYAEYLLQNGRIKCTEE 1440

Query: 1441 LMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPNEVDVVFWYLNLSV 1500
            LMVRWFEST KMDCSKTRT+DN DCD L+L +YASGSI+HA+DCSPNEVDVVFWYLNLSV
Sbjct: 1441 LMVRWFESTPKMDCSKTRTLDNGDCDCLNLLDYASGSIVHAMDCSPNEVDVVFWYLNLSV 1500

Query: 1501 HKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNEAASVGGIRSILEG 1560
            HKLLLNDQLEARLAFDNALRAA SGTFRYCMREYAMFLLTDESLLNEAASVGGIR+ILEG
Sbjct: 1501 HKLLLNDQLEARLAFDNALRAASSGTFRYCMREYAMFLLTDESLLNEAASVGGIRNILEG 1560

Query: 1561 YLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNCILEVWYGPSLLPQ 1620
            YLNDARAFP+P+PLSRKFINDIKKPRV+LL+SNMLSPLS DVSLVNCILEVWYGPSLLPQ
Sbjct: 1561 YLNDARAFPIPEPLSRKFINDIKKPRVRLLISNMLSPLSPDVSLVNCILEVWYGPSLLPQ 1620

Query: 1621 KFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDNSSQVASPSLIFWACSNLISAIF 1680
            KFNKP+ELVDFVETILEMLPSNYQLVLSVCKQLCNGD+SSQ AS SLIFWACSNLISAIF
Sbjct: 1621 KFNKPKELVDFVETILEMLPSNYQLVLSVCKQLCNGDDSSQAASASLIFWACSNLISAIF 1680

Query: 1681 SSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWKSYYNICKTRGDTS 1740
            SSVPIPPE +WVEAANIL+NVKGFEAI ERFHKRALSVYPFSVQLW SYYN+CKTRGDTS
Sbjct: 1681 SSVPIPPESVWVEAANILINVKGFEAIIERFHKRALSVYPFSVQLWTSYYNMCKTRGDTS 1740

Query: 1741 AVLREVNERGIELNEPS 1757
            AVLREVNERGIELNEPS
Sbjct: 1741 AVLREVNERGIELNEPS 1757

BLAST of ClCG09G020050 vs. NCBI nr
Match: XP_038890113.1 (uncharacterized protein LOC120079791 isoform X1 [Benincasa hispida])

HSP 1 Score: 3197.1 bits (8288), Expect = 0.0e+00
Identity = 1620/1770 (91.53%), Postives = 1667/1770 (94.18%), Query Frame = 0

Query: 1    MEKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60
            MEKKN+EELTVK+MASNS+PSKSKAS+SREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP
Sbjct: 1    MEKKNTEELTVKSMASNSQPSKSKASDSREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60

Query: 61   ISSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLV 120
            ISSILPPKNK N GIQA SADVCTRTSIQT SQK CD AQVVNK STPWGASREANSNLV
Sbjct: 61   ISSILPPKNKYNPGIQAVSADVCTRTSIQTISQKICDNAQVVNKVSTPWGASREANSNLV 120

Query: 121  ISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQ 180
            ISFSDDSGSELEECSKVRTSKSHSDAVRH+KPPTS +DRSN+LRSMTRNKVVANKLSLSQ
Sbjct: 121  ISFSDDSGSELEECSKVRTSKSHSDAVRHYKPPTSIIDRSNKLRSMTRNKVVANKLSLSQ 180

Query: 181  PFIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQ 240
            PFIPSMTKNHRAYS G AGPSLAEQGSKIRAFSGNLQSQGRGNDQG NLNTSKLQDLREQ
Sbjct: 181  PFIPSMTKNHRAYSKGAAGPSLAEQGSKIRAFSGNLQSQGRGNDQGKNLNTSKLQDLREQ 240

Query: 241  IAICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRL 300
            IAICESKLK KSAQQNKESIS+TNQDYIVTNSKSDL RKG+ATI Q   L PKEPD KRL
Sbjct: 241  IAICESKLKFKSAQQNKESISVTNQDYIVTNSKSDLARKGSATIPQFPPLVPKEPDVKRL 300

Query: 301  KTSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLK 360
            KTSGSYSTKLSLSGQQHLR  YAGKSVFRPQEPGEETQNIKVTYNQKG SL REESSVLK
Sbjct: 301  KTSGSYSTKLSLSGQQHLRTMYAGKSVFRPQEPGEETQNIKVTYNQKGISLGREESSVLK 360

Query: 361  QSKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVT 420
            QSKEDIKHVAASPSPGIDLGKVQDD DIVANGNQ DWISKQVDPHPLVVLD AT LPN+T
Sbjct: 361  QSKEDIKHVAASPSPGIDLGKVQDDNDIVANGNQLDWISKQVDPHPLVVLDLATVLPNMT 420

Query: 421  SNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480
            SNVQTQFDNVEFHRQSDGLQPSAS AK FEGTLPQSASNVKIPEPCSNFFKSLINSKSSG
Sbjct: 421  SNVQTQFDNVEFHRQSDGLQPSASAAKHFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480

Query: 481  SAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI 540
            +AFGNS SCLGFSN DLQSLFEMEESLDKDLEEAQD RRQCEIEERNAFKIYSRAQRALI
Sbjct: 481  TAFGNSPSCLGFSNFDLQSLFEMEESLDKDLEEAQDIRRQCEIEERNAFKIYSRAQRALI 540

Query: 541  EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSL 600
            EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIG DHLNSMSGNAN AS L
Sbjct: 541  EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGADHLNSMSGNANGASPL 600

Query: 601  YQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSK 660
            YQKH EYNSST+LH DLNMQHENAGPIN+SNLHENGQNLGSEP  CS L GN LDPLPSK
Sbjct: 601  YQKHSEYNSSTQLHTDLNMQHENAGPINSSNLHENGQNLGSEPELCSDLGGNKLDPLPSK 660

Query: 661  GNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKN 720
            GNNIADRICSPS DPNVSVDGDEESLPSDHEMIDSYDECY+G+KQFEDDQ+E YN+SKKN
Sbjct: 661  GNNIADRICSPSVDPNVSVDGDEESLPSDHEMIDSYDECYMGKKQFEDDQMETYNISKKN 720

Query: 721  HSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNN 780
              DNNIEDSLRLEAKLRSELFARLG RNLSKTCNPCHNIQT VEQGT++DARDD TQQNN
Sbjct: 721  QCDNNIEDSLRLEAKLRSELFARLGIRNLSKTCNPCHNIQTPVEQGTKSDARDDRTQQNN 780

Query: 781  TEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFG-------------GTNICKTPDDI 840
            TEPTVGLAVGSD DL SKKTE  LLSGKGDQQFGFG             G N C TPDDI
Sbjct: 781  TEPTVGLAVGSDADLTSKKTESTLLSGKGDQQFGFGGDVGWDSLCLQPVGPNRCNTPDDI 840

Query: 841  HGRCHFENLPSEAQDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEV 900
            HGR HFENLPSE QDSADSD+NE FNREGSCSKTTFSFTPLTMNSVLQHIKAI SVSIEV
Sbjct: 841  HGRYHFENLPSETQDSADSDDNEPFNREGSCSKTTFSFTPLTMNSVLQHIKAIPSVSIEV 900

Query: 901  LLTRTRGSLSNLGFPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSW 960
            LL RTRGSLSNLGFPEDGDSL+VDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDP W
Sbjct: 901  LLARTRGSLSNLGFPEDGDSLEVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPLW 960

Query: 961  PLCMYELRGKCNNDECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYED 1020
            PLCMYELRGKCNNDECPWQHVKDYS ANRRQCQH HINYSDSCNGLSFSSDETK+FKYED
Sbjct: 961  PLCMYELRGKCNNDECPWQHVKDYSLANRRQCQHDHINYSDSCNGLSFSSDETKIFKYED 1020

Query: 1021 GMTPPTYLVGIDILKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHD 1080
             MTPPTYLVGIDILKADSHSYDPVL QKSSQCWQ+FFSISLTLPNLLQKDASADGLFLHD
Sbjct: 1021 CMTPPTYLVGIDILKADSHSYDPVLAQKSSQCWQNFFSISLTLPNLLQKDASADGLFLHD 1080

Query: 1081 ARIVANGNWNRPSSYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLS 1140
            ARI A G+WNRPSSYFQRGSSILSQLKQGDE+LALETALIIINQETNSREGMKKALPVLS
Sbjct: 1081 ARIEAKGSWNRPSSYFQRGSSILSQLKQGDEDLALETALIIINQETNSREGMKKALPVLS 1140

Query: 1141 RAVENNPKSIALWTIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDAR 1200
            RAVENNPKS+ALWTIYLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNLDAR
Sbjct: 1141 RAVENNPKSVALWTIYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDAR 1200

Query: 1201 LAAYDAALSALCDNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVA 1260
            LAAYDAALSALCDNI T NLDGKYAS HILDLILQMTNCLCMSGNVEK IQRI GLLRVA
Sbjct: 1201 LAAYDAALSALCDNIVTPNLDGKYASTHILDLILQMTNCLCMSGNVEKAIQRILGLLRVA 1260

Query: 1261 MDSDEPYSFTHSDMLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPA 1320
            MDSDEPYSFTHSDML CLNISDKCIFWVCVVYLVIYRKLPHA+VQQLECEKELIEIEWPA
Sbjct: 1261 MDSDEPYSFTHSDMLTCLNISDKCIFWVCVVYLVIYRKLPHAVVQQLECEKELIEIEWPA 1320

Query: 1321 IQLTDGERLRASRVVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFS 1380
            IQLTDGE+LRASRVVKKAVDFVDSC NNES +SKCYQKSIQMFAVNHIRCLMAFEDIGFS
Sbjct: 1321 IQLTDGEKLRASRVVKKAVDFVDSCPNNESPDSKCYQKSIQMFAVNHIRCLMAFEDIGFS 1380

Query: 1381 RNLLDKYVKLYPSCLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAE 1440
            RNLLDKYVKLYPSCLEL+LLKVRAKK  FGDETVVAFEQAI NWPKEVPG+QCIWNQYAE
Sbjct: 1381 RNLLDKYVKLYPSCLELILLKVRAKKRDFGDETVVAFEQAIGNWPKEVPGIQCIWNQYAE 1440

Query: 1441 YLLQNGRIKCTEELMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPN 1500
            YLLQNGRIKCTEELMVRWFEST KMDCSKTRT+DN DCD L+L +YASGSI+HA+DCSPN
Sbjct: 1441 YLLQNGRIKCTEELMVRWFESTPKMDCSKTRTLDNGDCDCLNLLDYASGSIVHAMDCSPN 1500

Query: 1501 EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNE 1560
            EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAA SGTFRYCMREYAMFLLTDESLLNE
Sbjct: 1501 EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAASSGTFRYCMREYAMFLLTDESLLNE 1560

Query: 1561 AASVGGIRSILEGYLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNC 1620
            AASVGGIR+ILEGYLNDARAFP+P+PLSRKFINDIKKPRV+LL+SNMLSPLS DVSLVNC
Sbjct: 1561 AASVGGIRNILEGYLNDARAFPIPEPLSRKFINDIKKPRVRLLISNMLSPLSPDVSLVNC 1620

Query: 1621 ILEVWYGPSLLPQKFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDNSSQVASPSL 1680
            ILEVWYGPSLLPQKFNKP+ELVDFVETILEMLPSNYQLVLSVCKQLCNGD+SSQ AS SL
Sbjct: 1621 ILEVWYGPSLLPQKFNKPKELVDFVETILEMLPSNYQLVLSVCKQLCNGDDSSQAASASL 1680

Query: 1681 IFWACSNLISAIFSSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWK 1740
            IFWACSNLISAIFSSVPIPPE +WVEAANIL+NVKGFEAI ERFHKRALSVYPFSVQLW 
Sbjct: 1681 IFWACSNLISAIFSSVPIPPESVWVEAANILINVKGFEAIIERFHKRALSVYPFSVQLWT 1740

Query: 1741 SYYNICKTRGDTSAVLREVNERGIELNEPS 1757
            SYYN+CKTRGDTSAVLREVNERGIELNEPS
Sbjct: 1741 SYYNMCKTRGDTSAVLREVNERGIELNEPS 1770

BLAST of ClCG09G020050 vs. NCBI nr
Match: XP_038890114.1 (uncharacterized protein LOC120079791 isoform X2 [Benincasa hispida])

HSP 1 Score: 3169.0 bits (8215), Expect = 0.0e+00
Identity = 1610/1770 (90.96%), Postives = 1657/1770 (93.62%), Query Frame = 0

Query: 1    MEKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60
            MEKKN+EELTVK+MASNS+PSKSKAS+SREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP
Sbjct: 1    MEKKNTEELTVKSMASNSQPSKSKASDSREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60

Query: 61   ISSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLV 120
            ISSILPPKNK N GIQA SADVCTRTSIQT SQK CD AQVVNK STPWGASREANSNLV
Sbjct: 61   ISSILPPKNKYNPGIQAVSADVCTRTSIQTISQKICDNAQVVNKVSTPWGASREANSNLV 120

Query: 121  ISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQ 180
            ISFSDDSGSELEECSKVRTSKSHSDAVRH+KPPTS +DRSN+LRSMTRNKVVANKLSLSQ
Sbjct: 121  ISFSDDSGSELEECSKVRTSKSHSDAVRHYKPPTSIIDRSNKLRSMTRNKVVANKLSLSQ 180

Query: 181  PFIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQ 240
            PFIPSMTKNHRAYS G AGPSLAEQGSKIRAFSGNLQSQGRGNDQG NLNTSKLQDLREQ
Sbjct: 181  PFIPSMTKNHRAYSKGAAGPSLAEQGSKIRAFSGNLQSQGRGNDQGKNLNTSKLQDLREQ 240

Query: 241  IAICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRL 300
            IAICESKLK KSAQQNKESIS+TNQDYIVTNSKSDL RKG+ATI Q   L PKEPD KRL
Sbjct: 241  IAICESKLKFKSAQQNKESISVTNQDYIVTNSKSDLARKGSATIPQFPPLVPKEPDVKRL 300

Query: 301  KTSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLK 360
            KTSGSYSTKLSLSGQQHLR  YAGKSVFRPQEPGEETQNIKVTYNQKG SL REESSVLK
Sbjct: 301  KTSGSYSTKLSLSGQQHLRTMYAGKSVFRPQEPGEETQNIKVTYNQKGISLGREESSVLK 360

Query: 361  QSKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVT 420
            QSKEDIKHVAASPSPGIDLGKVQDD DIVANGNQ DWISKQVDPHPLVVLD AT LPN+T
Sbjct: 361  QSKEDIKHVAASPSPGIDLGKVQDDNDIVANGNQLDWISKQVDPHPLVVLDLATVLPNMT 420

Query: 421  SNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480
            SNVQTQFDNVEFHRQSDGLQPSAS AK FEGTLPQSASNVKIPEPCSNFFKSLINSKSSG
Sbjct: 421  SNVQTQFDNVEFHRQSDGLQPSASAAKHFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480

Query: 481  SAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI 540
            +AFGNS SCLGFSN DLQSLFEMEESLDKDLEEAQD RRQCEIEERNAFKIYSRAQRALI
Sbjct: 481  TAFGNSPSCLGFSNFDLQSLFEMEESLDKDLEEAQDIRRQCEIEERNAFKIYSRAQRALI 540

Query: 541  EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSL 600
            EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIG DHLNSMSGNAN AS L
Sbjct: 541  EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGADHLNSMSGNANGASPL 600

Query: 601  YQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSK 660
            YQKH EYNSST+LH DLNMQHENAGPIN+SNLHENGQNLGSEP  CS L GN LDPLPSK
Sbjct: 601  YQKHSEYNSSTQLHTDLNMQHENAGPINSSNLHENGQNLGSEPELCSDLGGNKLDPLPSK 660

Query: 661  GNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKN 720
            GNNIADRICSPS DPNVSVDGDEESLPSDHEMIDSYDECY+G+KQFEDDQ+E YN+SKKN
Sbjct: 661  GNNIADRICSPSVDPNVSVDGDEESLPSDHEMIDSYDECYMGKKQFEDDQMETYNISKKN 720

Query: 721  HSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNN 780
              DNNIEDSLRLEAKLRSELFARLG RNLSKTCNPCHNIQT VEQGT++DARDD TQQNN
Sbjct: 721  QCDNNIEDSLRLEAKLRSELFARLGIRNLSKTCNPCHNIQTPVEQGTKSDARDDRTQQNN 780

Query: 781  TEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFG-------------GTNICKTPDDI 840
            TEPTVGLAVGSD DL SKKTE  LLSGKGDQQFGFG             G N C TPDDI
Sbjct: 781  TEPTVGLAVGSDADLTSKKTESTLLSGKGDQQFGFGGDVGWDSLCLQPVGPNRCNTPDDI 840

Query: 841  HGRCHFENLPSEAQDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEV 900
            HGR HFENLPSE QDSADSD+NE FNREGSCSKTTFSFTPLTMNSVLQHIKAI SVSIEV
Sbjct: 841  HGRYHFENLPSETQDSADSDDNEPFNREGSCSKTTFSFTPLTMNSVLQHIKAIPSVSIEV 900

Query: 901  LLTRTRGSLSNLGFPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSW 960
            LL RTRGSLSNLGFPEDGDSL+VDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDP W
Sbjct: 901  LLARTRGSLSNLGFPEDGDSLEVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPLW 960

Query: 961  PLCMYELRGKCNNDECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYED 1020
            PLCMYELRGKCNNDECPWQHVKDYS ANRRQCQH HINY          SDETK+FKYED
Sbjct: 961  PLCMYELRGKCNNDECPWQHVKDYSLANRRQCQHDHINY----------SDETKIFKYED 1020

Query: 1021 GMTPPTYLVGIDILKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHD 1080
             MTPPTYLVGIDILKADSHSYDPVL QKSSQCWQ+FFSISLTLPNLLQKDASADGLFLHD
Sbjct: 1021 CMTPPTYLVGIDILKADSHSYDPVLAQKSSQCWQNFFSISLTLPNLLQKDASADGLFLHD 1080

Query: 1081 ARIVANGNWNRPSSYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLS 1140
            ARI A G+WNRPSSYFQRGSSILSQLKQGDE+LALETALIIINQETNSREGMKKALPVLS
Sbjct: 1081 ARIEAKGSWNRPSSYFQRGSSILSQLKQGDEDLALETALIIINQETNSREGMKKALPVLS 1140

Query: 1141 RAVENNPKSIALWTIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDAR 1200
            RAVENNPKS+ALWTIYLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNLDAR
Sbjct: 1141 RAVENNPKSVALWTIYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDAR 1200

Query: 1201 LAAYDAALSALCDNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVA 1260
            LAAYDAALSALCDNI T NLDGKYAS HILDLILQMTNCLCMSGNVEK IQRI GLLRVA
Sbjct: 1201 LAAYDAALSALCDNIVTPNLDGKYASTHILDLILQMTNCLCMSGNVEKAIQRILGLLRVA 1260

Query: 1261 MDSDEPYSFTHSDMLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPA 1320
            MDSDEPYSFTHSDML CLNISDKCIFWVCVVYLVIYRKLPHA+VQQLECEKELIEIEWPA
Sbjct: 1261 MDSDEPYSFTHSDMLTCLNISDKCIFWVCVVYLVIYRKLPHAVVQQLECEKELIEIEWPA 1320

Query: 1321 IQLTDGERLRASRVVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFS 1380
            IQLTDGE+LRASRVVKKAVDFVDSC NNES +SKCYQKSIQMFAVNHIRCLMAFEDIGFS
Sbjct: 1321 IQLTDGEKLRASRVVKKAVDFVDSCPNNESPDSKCYQKSIQMFAVNHIRCLMAFEDIGFS 1380

Query: 1381 RNLLDKYVKLYPSCLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAE 1440
            RNLLDKYVKLYPSCLEL+LLKVRAKK  FGDETVVAFEQAI NWPKEVPG+QCIWNQYAE
Sbjct: 1381 RNLLDKYVKLYPSCLELILLKVRAKKRDFGDETVVAFEQAIGNWPKEVPGIQCIWNQYAE 1440

Query: 1441 YLLQNGRIKCTEELMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPN 1500
            YLLQNGRIKCTEELMVRWFEST KMDCSKTRT+DN DCD L+L +YASGSI+HA+DCSPN
Sbjct: 1441 YLLQNGRIKCTEELMVRWFESTPKMDCSKTRTLDNGDCDCLNLLDYASGSIVHAMDCSPN 1500

Query: 1501 EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNE 1560
            EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAA SGTFRYCMREYAMFLLTDESLLNE
Sbjct: 1501 EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAASSGTFRYCMREYAMFLLTDESLLNE 1560

Query: 1561 AASVGGIRSILEGYLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNC 1620
            AASVGGIR+ILEGYLNDARAFP+P+PLSRKFINDIKKPRV+LL+SNMLSPLS DVSLVNC
Sbjct: 1561 AASVGGIRNILEGYLNDARAFPIPEPLSRKFINDIKKPRVRLLISNMLSPLSPDVSLVNC 1620

Query: 1621 ILEVWYGPSLLPQKFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDNSSQVASPSL 1680
            ILEVWYGPSLLPQKFNKP+ELVDFVETILEMLPSNYQLVLSVCKQLCNGD+SSQ AS SL
Sbjct: 1621 ILEVWYGPSLLPQKFNKPKELVDFVETILEMLPSNYQLVLSVCKQLCNGDDSSQAASASL 1680

Query: 1681 IFWACSNLISAIFSSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWK 1740
            IFWACSNLISAIFSSVPIPPE +WVEAANIL+NVKGFEAI ERFHKRALSVYPFSVQLW 
Sbjct: 1681 IFWACSNLISAIFSSVPIPPESVWVEAANILINVKGFEAIIERFHKRALSVYPFSVQLWT 1740

Query: 1741 SYYNICKTRGDTSAVLREVNERGIELNEPS 1757
            SYYN+CKTRGDTSAVLREVNERGIELNEPS
Sbjct: 1741 SYYNMCKTRGDTSAVLREVNERGIELNEPS 1760

BLAST of ClCG09G020050 vs. NCBI nr
Match: XP_038890116.1 (uncharacterized protein LOC120079791 isoform X4 [Benincasa hispida])

HSP 1 Score: 3138.2 bits (8135), Expect = 0.0e+00
Identity = 1597/1770 (90.23%), Postives = 1643/1770 (92.82%), Query Frame = 0

Query: 1    MEKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60
            MEKKN+EELTVK+MASNS+PSKSKAS+SREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP
Sbjct: 1    MEKKNTEELTVKSMASNSQPSKSKASDSREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60

Query: 61   ISSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLV 120
            ISSILPPKNK N GIQA SADVCTRTSIQT SQK CD AQVVNK STPWGASREANSNLV
Sbjct: 61   ISSILPPKNKYNPGIQAVSADVCTRTSIQTISQKICDNAQVVNKVSTPWGASREANSNLV 120

Query: 121  ISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQ 180
            ISFSDDSGSELEECSKVRTSKSHSDAVRH+KPPTS +DRSN+LRSMTRNKVVANKLSLSQ
Sbjct: 121  ISFSDDSGSELEECSKVRTSKSHSDAVRHYKPPTSIIDRSNKLRSMTRNKVVANKLSLSQ 180

Query: 181  PFIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQ 240
            PFIPSMTKNHRAYS G AGPSLAEQGSKIRAFSGNLQSQGRGNDQG NLNTSKLQDLREQ
Sbjct: 181  PFIPSMTKNHRAYSKGAAGPSLAEQGSKIRAFSGNLQSQGRGNDQGKNLNTSKLQDLREQ 240

Query: 241  IAICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRL 300
            IAICESKLK KSAQQNKESIS+TNQDYIVTNSKSDL RKG+ATI Q   L PKEPD KRL
Sbjct: 241  IAICESKLKFKSAQQNKESISVTNQDYIVTNSKSDLARKGSATIPQFPPLVPKEPDVKRL 300

Query: 301  KTSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLK 360
            KTSGSYSTKLSLSGQQHLR  YAGKSVFRPQEPGEETQNIKVTYNQKG SL REESSVLK
Sbjct: 301  KTSGSYSTKLSLSGQQHLRTMYAGKSVFRPQEPGEETQNIKVTYNQKGISLGREESSVLK 360

Query: 361  QSKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVT 420
            QSKEDIKHVAASPSPGIDLGKVQDD DIVANGNQ DWISKQ                   
Sbjct: 361  QSKEDIKHVAASPSPGIDLGKVQDDNDIVANGNQLDWISKQ------------------- 420

Query: 421  SNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480
                   DNVEFHRQSDGLQPSAS AK FEGTLPQSASNVKIPEPCSNFFKSLINSKSSG
Sbjct: 421  -------DNVEFHRQSDGLQPSASAAKHFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480

Query: 481  SAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI 540
            +AFGNS SCLGFSN DLQSLFEMEESLDKDLEEAQD RRQCEIEERNAFKIYSRAQRALI
Sbjct: 481  TAFGNSPSCLGFSNFDLQSLFEMEESLDKDLEEAQDIRRQCEIEERNAFKIYSRAQRALI 540

Query: 541  EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSL 600
            EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIG DHLNSMSGNAN AS L
Sbjct: 541  EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGADHLNSMSGNANGASPL 600

Query: 601  YQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSK 660
            YQKH EYNSST+LH DLNMQHENAGPIN+SNLHENGQNLGSEP  CS L GN LDPLPSK
Sbjct: 601  YQKHSEYNSSTQLHTDLNMQHENAGPINSSNLHENGQNLGSEPELCSDLGGNKLDPLPSK 660

Query: 661  GNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKN 720
            GNNIADRICSPS DPNVSVDGDEESLPSDHEMIDSYDECY+G+KQFEDDQ+E YN+SKKN
Sbjct: 661  GNNIADRICSPSVDPNVSVDGDEESLPSDHEMIDSYDECYMGKKQFEDDQMETYNISKKN 720

Query: 721  HSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNN 780
              DNNIEDSLRLEAKLRSELFARLG RNLSKTCNPCHNIQT VEQGT++DARDD TQQNN
Sbjct: 721  QCDNNIEDSLRLEAKLRSELFARLGIRNLSKTCNPCHNIQTPVEQGTKSDARDDRTQQNN 780

Query: 781  TEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFG-------------GTNICKTPDDI 840
            TEPTVGLAVGSD DL SKKTE  LLSGKGDQQFGFG             G N C TPDDI
Sbjct: 781  TEPTVGLAVGSDADLTSKKTESTLLSGKGDQQFGFGGDVGWDSLCLQPVGPNRCNTPDDI 840

Query: 841  HGRCHFENLPSEAQDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEV 900
            HGR HFENLPSE QDSADSD+NE FNREGSCSKTTFSFTPLTMNSVLQHIKAI SVSIEV
Sbjct: 841  HGRYHFENLPSETQDSADSDDNEPFNREGSCSKTTFSFTPLTMNSVLQHIKAIPSVSIEV 900

Query: 901  LLTRTRGSLSNLGFPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSW 960
            LL RTRGSLSNLGFPEDGDSL+VDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDP W
Sbjct: 901  LLARTRGSLSNLGFPEDGDSLEVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPLW 960

Query: 961  PLCMYELRGKCNNDECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYED 1020
            PLCMYELRGKCNNDECPWQHVKDYS ANRRQCQH HINYSDSCNGLSFSSDETK+FKYED
Sbjct: 961  PLCMYELRGKCNNDECPWQHVKDYSLANRRQCQHDHINYSDSCNGLSFSSDETKIFKYED 1020

Query: 1021 GMTPPTYLVGIDILKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHD 1080
             MTPPTYLVGIDILKADSHSYDPVL QKSSQCWQ+FFSISLTLPNLLQKDASADGLFLHD
Sbjct: 1021 CMTPPTYLVGIDILKADSHSYDPVLAQKSSQCWQNFFSISLTLPNLLQKDASADGLFLHD 1080

Query: 1081 ARIVANGNWNRPSSYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLS 1140
            ARI A G+WNRPSSYFQRGSSILSQLKQGDE+LALETALIIINQETNSREGMKKALPVLS
Sbjct: 1081 ARIEAKGSWNRPSSYFQRGSSILSQLKQGDEDLALETALIIINQETNSREGMKKALPVLS 1140

Query: 1141 RAVENNPKSIALWTIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDAR 1200
            RAVENNPKS+ALWTIYLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNLDAR
Sbjct: 1141 RAVENNPKSVALWTIYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDAR 1200

Query: 1201 LAAYDAALSALCDNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVA 1260
            LAAYDAALSALCDNI T NLDGKYAS HILDLILQMTNCLCMSGNVEK IQRI GLLRVA
Sbjct: 1201 LAAYDAALSALCDNIVTPNLDGKYASTHILDLILQMTNCLCMSGNVEKAIQRILGLLRVA 1260

Query: 1261 MDSDEPYSFTHSDMLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPA 1320
            MDSDEPYSFTHSDML CLNISDKCIFWVCVVYLVIYRKLPHA+VQQLECEKELIEIEWPA
Sbjct: 1261 MDSDEPYSFTHSDMLTCLNISDKCIFWVCVVYLVIYRKLPHAVVQQLECEKELIEIEWPA 1320

Query: 1321 IQLTDGERLRASRVVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFS 1380
            IQLTDGE+LRASRVVKKAVDFVDSC NNES +SKCYQKSIQMFAVNHIRCLMAFEDIGFS
Sbjct: 1321 IQLTDGEKLRASRVVKKAVDFVDSCPNNESPDSKCYQKSIQMFAVNHIRCLMAFEDIGFS 1380

Query: 1381 RNLLDKYVKLYPSCLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAE 1440
            RNLLDKYVKLYPSCLEL+LLKVRAKK  FGDETVVAFEQAI NWPKEVPG+QCIWNQYAE
Sbjct: 1381 RNLLDKYVKLYPSCLELILLKVRAKKRDFGDETVVAFEQAIGNWPKEVPGIQCIWNQYAE 1440

Query: 1441 YLLQNGRIKCTEELMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPN 1500
            YLLQNGRIKCTEELMVRWFEST KMDCSKTRT+DN DCD L+L +YASGSI+HA+DCSPN
Sbjct: 1441 YLLQNGRIKCTEELMVRWFESTPKMDCSKTRTLDNGDCDCLNLLDYASGSIVHAMDCSPN 1500

Query: 1501 EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNE 1560
            EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAA SGTFRYCMREYAMFLLTDESLLNE
Sbjct: 1501 EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAASSGTFRYCMREYAMFLLTDESLLNE 1560

Query: 1561 AASVGGIRSILEGYLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNC 1620
            AASVGGIR+ILEGYLNDARAFP+P+PLSRKFINDIKKPRV+LL+SNMLSPLS DVSLVNC
Sbjct: 1561 AASVGGIRNILEGYLNDARAFPIPEPLSRKFINDIKKPRVRLLISNMLSPLSPDVSLVNC 1620

Query: 1621 ILEVWYGPSLLPQKFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDNSSQVASPSL 1680
            ILEVWYGPSLLPQKFNKP+ELVDFVETILEMLPSNYQLVLSVCKQLCNGD+SSQ AS SL
Sbjct: 1621 ILEVWYGPSLLPQKFNKPKELVDFVETILEMLPSNYQLVLSVCKQLCNGDDSSQAASASL 1680

Query: 1681 IFWACSNLISAIFSSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWK 1740
            IFWACSNLISAIFSSVPIPPE +WVEAANIL+NVKGFEAI ERFHKRALSVYPFSVQLW 
Sbjct: 1681 IFWACSNLISAIFSSVPIPPESVWVEAANILINVKGFEAIIERFHKRALSVYPFSVQLWT 1740

Query: 1741 SYYNICKTRGDTSAVLREVNERGIELNEPS 1757
            SYYN+CKTRGDTSAVLREVNERGIELNEPS
Sbjct: 1741 SYYNMCKTRGDTSAVLREVNERGIELNEPS 1744

BLAST of ClCG09G020050 vs. NCBI nr
Match: XP_011655356.2 (uncharacterized protein LOC101211906 [Cucumis sativus] >KGN51732.2 hypothetical protein Csa_009223 [Cucumis sativus])

HSP 1 Score: 2996.8 bits (7768), Expect = 0.0e+00
Identity = 1530/1757 (87.08%), Postives = 1619/1757 (92.15%), Query Frame = 0

Query: 2    EKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSPI 61
            ++K+S+ELT+K+M SNSKP+K KAS+ +EEGEVSSSDNDTQTHDVHPVCSTVPAS+ S I
Sbjct: 15   KEKDSDELTLKSMPSNSKPTKIKASDGKEEGEVSSSDNDTQTHDVHPVCSTVPASIASRI 74

Query: 62   SSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLVI 121
            SSILPPKNKCN GI+ ASADVCTRTSI T SQK  D AQ+VNKASTPW ASR+ANSNLVI
Sbjct: 75   SSILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKASTPWVASRKANSNLVI 134

Query: 122  SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQP 181
            SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTS LDRSN+LRSMTRNKVV NKL LSQ 
Sbjct: 135  SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRSMTRNKVVVNKLPLSQA 194

Query: 182  FIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQI 241
            FIPSMTKNH+AYS G AGPS AEQGSKIRAFSGNLQSQGRGNDQGMN+NTSKLQDLR+QI
Sbjct: 195  FIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNVNTSKLQDLRQQI 254

Query: 242  AICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRLK 301
            AI ESKLKLKSAQQNKE + +TNQDYIVTNSKSDLGRKGNATISQ   LGPK+ +AKR+K
Sbjct: 255  AIRESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATISQFPPLGPKDLNAKRMK 314

Query: 302  TSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLKQ 361
            TSGSYS+KLSL+GQQ LR+  A K ++ PQEPGEETQNIK +YNQKG SLSREESSVLKQ
Sbjct: 315  TSGSYSSKLSLNGQQ-LRSLIAAKFIW-PQEPGEETQNIKGSYNQKGKSLSREESSVLKQ 374

Query: 362  SKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVTS 421
            SKEDIKHVAASPS GIDLGKVQDDTDIVANGNQSD+I  QVDPHPLVVLDQAT+LPNV S
Sbjct: 375  SKEDIKHVAASPSLGIDLGKVQDDTDIVANGNQSDFIGNQVDPHPLVVLDQATALPNVAS 434

Query: 422  NVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSGS 481
            NVQ+QFDNVEFHRQSDGLQPSASTAK FE T PQSASNVK PEPCSNFFKSLINSK+SG+
Sbjct: 435  NVQSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPCSNFFKSLINSKTSGT 494

Query: 482  AFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIE 541
            AFGN SSCL F N DLQSLFE+EESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIE
Sbjct: 495  AFGNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIE 554

Query: 542  ANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSLY 601
            ANSRC++LYHKRELFS HFHSFCMNNPG +SSSRQQEDM I VDHLNSMSG+AN AS LY
Sbjct: 555  ANSRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGHANIASPLY 614

Query: 602  QKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSKG 661
            QKH EYNSST+LHNDLNMQ ENAG INTSNLHENGQ+LGSEPGSCS L GNTLDPLP KG
Sbjct: 615  QKHSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDLGGNTLDPLPFKG 674

Query: 662  NNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKNH 721
            NNIADRI SPS DPNVS+DGDEES PSDHEMIDSY+ECY+ +K FE+DQ+EAYN SK NH
Sbjct: 675  NNIADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFENDQMEAYNTSKNNH 734

Query: 722  SDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNNT 781
             DNNIEDSLRLEAKLRSELFARLGTRNLSK CNPC+N+QTSVEQGTENDARDD TQQNNT
Sbjct: 735  CDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTENDARDDITQQNNT 794

Query: 782  EPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTPDDIHGRCHFENLPSEAQ 841
            E TV LAVGSDVDLISKK E ALLSGKGDQQFGFGGT+ CKTPD+IHGR HFENLPSEA 
Sbjct: 795  ELTVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGRYHFENLPSEAP 854

Query: 842  DSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEVLLTRTRGSLSNLGF 901
            D  DSD+NE F+REGSCSKTT SFTPLTMNSVLQH+K ISSVSIEVLLTRT GSLSNLGF
Sbjct: 855  DLTDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLLTRTHGSLSNLGF 914

Query: 902  PEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSWPLCMYELRGKCNND 961
            PEDGDSL+VDQIHWRKLKENSVHE  RPM QSDGSY DDLAIDPSWPLCMYELRGKCNND
Sbjct: 915  PEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPLCMYELRGKCNND 974

Query: 962  ECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYEDGMTPPTYLVGIDIL 1021
            ECPWQH+KD+SFANR QCQHGHIN          SSDETKVFK ED MTPPTYLVGIDIL
Sbjct: 975  ECPWQHMKDFSFANRSQCQHGHIN----------SSDETKVFKNEDQMTPPTYLVGIDIL 1034

Query: 1022 KADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIVANGNWNRPSS 1081
            KADS SY  VL Q+SSQCWQSFFSISLTLPNLLQKDASADGLFLHDARI A G+WNRPSS
Sbjct: 1035 KADSRSYGHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPSS 1094

Query: 1082 YFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALWT 1141
            YFQRG S+LSQLKQGDENLALETALIIINQE NSREGMKKALPVLSRAVENNPKSIALW 
Sbjct: 1095 YFQRGGSVLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIALWA 1154

Query: 1142 IYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCDN 1201
            +YLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNLDARLAAYD+A+SALC N
Sbjct: 1155 VYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCHN 1214

Query: 1202 IFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFTHSDM 1261
            IFTHNLDGKYASAHILDLILQMTNCLCMSGNVEK IQRIFGLL+VAMDSDEPYSFTHSDM
Sbjct: 1215 IFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHSDM 1274

Query: 1262 LACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPAIQLTDGERLRASRV 1321
            L CLNISDKCIFWV VVYLV+YRKLPHAIVQQLECEKELIEIEWPA+ LT+GE+LRASRV
Sbjct: 1275 LTCLNISDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRASRV 1334

Query: 1322 VKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPSC 1381
            VKKAVDFVDSCLNNESL+SKCYQKSIQMFAVNHIRCLMAFEDI FSRNLLDKYVKLYPSC
Sbjct: 1335 VKKAVDFVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSC 1394

Query: 1382 LELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAEYLLQNGRIKCTEEL 1441
             ELLLL +RA+KH FGD TV+AFE+ IR WPKEVPGVQCIWNQYAEYLL+NGRIKCTEEL
Sbjct: 1395 PELLLLDIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTEEL 1454

Query: 1442 MVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPNEVDVVFWYLNLSVH 1501
            M R F+STSKMDCSKTRT  NSDCDSLHL ++ASGSI+ ALDCSPNEVDVVFWYLN SVH
Sbjct: 1455 MARRFDSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHSVH 1514

Query: 1502 KLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGY 1561
            KLLLNDQLEARLAF+NALRAA S TFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGY
Sbjct: 1515 KLLLNDQLEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGY 1574

Query: 1562 LNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNCILEVWYGPSLLPQK 1621
            LNDARAFPVP+PLSR+FI DI+KPRV+LLVSNMLSP+S DVSLVNCILEVWYGPSLLPQK
Sbjct: 1575 LNDARAFPVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLPQK 1634

Query: 1622 FNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDN-SSQVASPSLIFWACSNLISAIF 1681
            FNKP+ELVDFVETILE+LPSNYQLVLSVCKQLCN DN SSQ ASPSLIFWACSNLI AIF
Sbjct: 1635 FNKPKELVDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIAIF 1694

Query: 1682 SSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWKSYYNICKTRGDTS 1741
            SSVPIPPEFIWVEAANIL NVKG EAI ERFHKRALSVYPFSVQLWKSYYNIC+TRGDTS
Sbjct: 1695 SSVPIPPEFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGDTS 1754

Query: 1742 AVLREVNERGIELNEPS 1757
            AVL+EVNERGI+LNEPS
Sbjct: 1755 AVLQEVNERGIQLNEPS 1759

BLAST of ClCG09G020050 vs. ExPASy Swiss-Prot
Match: O60293 (Zinc finger C3H1 domain-containing protein OS=Homo sapiens OX=9606 GN=ZFC3H1 PE=1 SV=3)

HSP 1 Score: 73.6 bits (179), Expect = 2.6e-11
Identity = 99/482 (20.54%), Postives = 179/482 (37.14%), Query Frame = 0

Query: 942  IDPSWPLCMYELRGKCNNDECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKV 1001
            I+P    C ++L G CN+D+C WQH++DY+  +R+Q     ++Y+ S  G + +S   ++
Sbjct: 1179 IEPDQCFCRFDLTGTCNDDDCQWQHIQDYTL-SRKQLFQDILSYNLSLIGCAETSTNEEI 1238

Query: 1002 F----KYED----------GMTPPTYLVGIDILKADSHSYDPVLTQKSSQCWQSFFSISL 1061
                 KY +           M     L+  +I ++  H+  P  T K  + W+  F    
Sbjct: 1239 TASAEKYVEKLFGVNKDRMSMDQMAVLLVSNINESKGHT-PPFTTYKDKRKWKPKFWRKP 1298

Query: 1062 TLPNLLQKDASADGLFLHDARIVANGNWNRPS----------SYFQRGSSILSQLK---- 1121
               N    D       +  A        N P+           YF   +  ++ L+    
Sbjct: 1299 ISDNSFSSDEEQSTGPIKYA-FQPENQINVPALDTVVTPDDVRYFTNETDDIANLEASVL 1358

Query: 1122 --QGDENLALETALIIINQ-ETNSREGMKKALPVLSRAVENNPKSIALWTIYLLIFYSYT 1181
                   L L+ A   +NQ E    E +  AL VL+RA+ENN  +  +W  YL +F    
Sbjct: 1359 ENPSHVQLWLKLAYKYLNQNEGECSESLDSALNVLARALENNKDNPEIWCHYLRLFSKRG 1418

Query: 1182 TTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCDNIF--THNLDG 1241
            T     +M   AV++       W       ++L++     D     + + +         
Sbjct: 1419 TKDEVQEMCETAVEYAPDYQSFWTF-----LHLESTFEEKDYVCERMLEFLMGAAKQETS 1478

Query: 1242 KYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFTHSDMLACLNISD 1301
               S  +L+ +L        +G  +  +  +   L+ A D           +   L  SD
Sbjct: 1479 NILSFQLLEALLFRVQLHIFTGRCQSALAILQNALKSAND---------GIVAEYLKTSD 1538

Query: 1302 KCIFWVCVVYLVIYRKLPHAIVQQLE------CEKELIEIEWPAIQLTDGERLRASRVVK 1361
            +C+ W+  ++L+ +  LP                 E   + W A+Q     +     ++ 
Sbjct: 1539 RCLAWLAYIHLIEFNILPSKFYDPSNDNPSRIVNTESFVMPWQAVQ---DVKTNPDMLLA 1598

Query: 1362 KAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPSCLE 1385
               D V +C +    ES   ++ I+        CL  + ++     LL++Y      C  
Sbjct: 1599 VFEDAVKACTD----ESLAVEERIE-------ACLPLYTNMIALHQLLERYEAAMELCKS 1629

BLAST of ClCG09G020050 vs. ExPASy TrEMBL
Match: A0A0A0KS73 (zf-C3H1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G512890 PE=4 SV=1)

HSP 1 Score: 2996.8 bits (7768), Expect = 0.0e+00
Identity = 1530/1757 (87.08%), Postives = 1619/1757 (92.15%), Query Frame = 0

Query: 2    EKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSPI 61
            ++K+S+ELT+K+M SNSKP+K KAS+ +EEGEVSSSDNDTQTHDVHPVCSTVPAS+ S I
Sbjct: 7    KEKDSDELTLKSMPSNSKPTKIKASDGKEEGEVSSSDNDTQTHDVHPVCSTVPASIASRI 66

Query: 62   SSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLVI 121
            SSILPPKNKCN GI+ ASADVCTRTSI T SQK  D AQ+VNKASTPW ASR+ANSNLVI
Sbjct: 67   SSILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKASTPWVASRKANSNLVI 126

Query: 122  SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQP 181
            SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTS LDRSN+LRSMTRNKVV NKL LSQ 
Sbjct: 127  SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRSMTRNKVVVNKLPLSQA 186

Query: 182  FIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQI 241
            FIPSMTKNH+AYS G AGPS AEQGSKIRAFSGNLQSQGRGNDQGMN+NTSKLQDLR+QI
Sbjct: 187  FIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNVNTSKLQDLRQQI 246

Query: 242  AICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRLK 301
            AI ESKLKLKSAQQNKE + +TNQDYIVTNSKSDLGRKGNATISQ   LGPK+ +AKR+K
Sbjct: 247  AIRESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATISQFPPLGPKDLNAKRMK 306

Query: 302  TSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLKQ 361
            TSGSYS+KLSL+GQQ LR+  A K ++ PQEPGEETQNIK +YNQKG SLSREESSVLKQ
Sbjct: 307  TSGSYSSKLSLNGQQ-LRSLIAAKFIW-PQEPGEETQNIKGSYNQKGKSLSREESSVLKQ 366

Query: 362  SKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVTS 421
            SKEDIKHVAASPS GIDLGKVQDDTDIVANGNQSD+I  QVDPHPLVVLDQAT+LPNV S
Sbjct: 367  SKEDIKHVAASPSLGIDLGKVQDDTDIVANGNQSDFIGNQVDPHPLVVLDQATALPNVAS 426

Query: 422  NVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSGS 481
            NVQ+QFDNVEFHRQSDGLQPSASTAK FE T PQSASNVK PEPCSNFFKSLINSK+SG+
Sbjct: 427  NVQSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPCSNFFKSLINSKTSGT 486

Query: 482  AFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIE 541
            AFGN SSCL F N DLQSLFE+EESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIE
Sbjct: 487  AFGNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIE 546

Query: 542  ANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSLY 601
            ANSRC++LYHKRELFS HFHSFCMNNPG +SSSRQQEDM I VDHLNSMSG+AN AS LY
Sbjct: 547  ANSRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGHANIASPLY 606

Query: 602  QKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSKG 661
            QKH EYNSST+LHNDLNMQ ENAG INTSNLHENGQ+LGSEPGSCS L GNTLDPLP KG
Sbjct: 607  QKHSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDLGGNTLDPLPFKG 666

Query: 662  NNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKNH 721
            NNIADRI SPS DPNVS+DGDEES PSDHEMIDSY+ECY+ +K FE+DQ+EAYN SK NH
Sbjct: 667  NNIADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFENDQMEAYNTSKNNH 726

Query: 722  SDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNNT 781
             DNNIEDSLRLEAKLRSELFARLGTRNLSK CNPC+N+QTSVEQGTENDARDD TQQNNT
Sbjct: 727  CDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTENDARDDITQQNNT 786

Query: 782  EPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTPDDIHGRCHFENLPSEAQ 841
            E TV LAVGSDVDLISKK E ALLSGKGDQQFGFGGT+ CKTPD+IHGR HFENLPSEA 
Sbjct: 787  ELTVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGRYHFENLPSEAP 846

Query: 842  DSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEVLLTRTRGSLSNLGF 901
            D  DSD+NE F+REGSCSKTT SFTPLTMNSVLQH+K ISSVSIEVLLTRT GSLSNLGF
Sbjct: 847  DLTDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLLTRTHGSLSNLGF 906

Query: 902  PEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSWPLCMYELRGKCNND 961
            PEDGDSL+VDQIHWRKLKENSVHE  RPM QSDGSY DDLAIDPSWPLCMYELRGKCNND
Sbjct: 907  PEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPLCMYELRGKCNND 966

Query: 962  ECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYEDGMTPPTYLVGIDIL 1021
            ECPWQH+KD+SFANR QCQHGHIN          SSDETKVFK ED MTPPTYLVGIDIL
Sbjct: 967  ECPWQHMKDFSFANRSQCQHGHIN----------SSDETKVFKNEDQMTPPTYLVGIDIL 1026

Query: 1022 KADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIVANGNWNRPSS 1081
            KADS SY  VL Q+SSQCWQSFFSISLTLPNLLQKDASADGLFLHDARI A G+WNRPSS
Sbjct: 1027 KADSRSYGHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPSS 1086

Query: 1082 YFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALWT 1141
            YFQRG S+LSQLKQGDENLALETALIIINQE NSREGMKKALPVLSRAVENNPKSIALW 
Sbjct: 1087 YFQRGGSVLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIALWA 1146

Query: 1142 IYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCDN 1201
            +YLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNLDARLAAYD+A+SALC N
Sbjct: 1147 VYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCHN 1206

Query: 1202 IFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFTHSDM 1261
            IFTHNLDGKYASAHILDLILQMTNCLCMSGNVEK IQRIFGLL+VAMDSDEPYSFTHSDM
Sbjct: 1207 IFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHSDM 1266

Query: 1262 LACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPAIQLTDGERLRASRV 1321
            L CLNISDKCIFWV VVYLV+YRKLPHAIVQQLECEKELIEIEWPA+ LT+GE+LRASRV
Sbjct: 1267 LTCLNISDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRASRV 1326

Query: 1322 VKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPSC 1381
            VKKAVDFVDSCLNNESL+SKCYQKSIQMFAVNHIRCLMAFEDI FSRNLLDKYVKLYPSC
Sbjct: 1327 VKKAVDFVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSC 1386

Query: 1382 LELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAEYLLQNGRIKCTEEL 1441
             ELLLL +RA+KH FGD TV+AFE+ IR WPKEVPGVQCIWNQYAEYLL+NGRIKCTEEL
Sbjct: 1387 PELLLLDIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTEEL 1446

Query: 1442 MVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPNEVDVVFWYLNLSVH 1501
            M R F+STSKMDCSKTRT  NSDCDSLHL ++ASGSI+ ALDCSPNEVDVVFWYLN SVH
Sbjct: 1447 MARRFDSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHSVH 1506

Query: 1502 KLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGY 1561
            KLLLNDQLEARLAF+NALRAA S TFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGY
Sbjct: 1507 KLLLNDQLEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGY 1566

Query: 1562 LNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNCILEVWYGPSLLPQK 1621
            LNDARAFPVP+PLSR+FI DI+KPRV+LLVSNMLSP+S DVSLVNCILEVWYGPSLLPQK
Sbjct: 1567 LNDARAFPVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLPQK 1626

Query: 1622 FNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDN-SSQVASPSLIFWACSNLISAIF 1681
            FNKP+ELVDFVETILE+LPSNYQLVLSVCKQLCN DN SSQ ASPSLIFWACSNLI AIF
Sbjct: 1627 FNKPKELVDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIAIF 1686

Query: 1682 SSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWKSYYNICKTRGDTS 1741
            SSVPIPPEFIWVEAANIL NVKG EAI ERFHKRALSVYPFSVQLWKSYYNIC+TRGDTS
Sbjct: 1687 SSVPIPPEFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGDTS 1746

Query: 1742 AVLREVNERGIELNEPS 1757
            AVL+EVNERGI+LNEPS
Sbjct: 1747 AVLQEVNERGIQLNEPS 1751

BLAST of ClCG09G020050 vs. ExPASy TrEMBL
Match: A0A1S3CJD3 (uncharacterized protein LOC103501638 OS=Cucumis melo OX=3656 GN=LOC103501638 PE=4 SV=1)

HSP 1 Score: 2987.6 bits (7744), Expect = 0.0e+00
Identity = 1527/1758 (86.86%), Postives = 1615/1758 (91.87%), Query Frame = 0

Query: 2    EKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSPI 61
            ++KNS+ELTVK+  SNSKPSK KAS+++EEGE+SSSDNDTQTHDV PVCSTVPAS+ SPI
Sbjct: 7    KEKNSDELTVKSTPSNSKPSKIKASDTKEEGELSSSDNDTQTHDVRPVCSTVPASIASPI 66

Query: 62   SSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLVI 121
            SS LPPK+KCN GIQ ASAD+C RTSI T SQK  D AQ+VNKASTPWGASR+ANSNLVI
Sbjct: 67   SSSLPPKDKCNPGIQTASADICPRTSISTMSQKIRDNAQIVNKASTPWGASRKANSNLVI 126

Query: 122  SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQP 181
            SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSN+LRSMTRNKV+ANKL LSQ 
Sbjct: 127  SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNKLRSMTRNKVMANKLPLSQV 186

Query: 182  FIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQI 241
            FIPSMTKNH+AYS G AGPS AEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLR+QI
Sbjct: 187  FIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLRQQI 246

Query: 242  AICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRLK 301
            AI ESKLKLKSAQQNKES+ +TNQDYIVTNSK DLGRKGN TISQ   LGPKEP+ KR+K
Sbjct: 247  AIRESKLKLKSAQQNKESLLVTNQDYIVTNSKPDLGRKGNNTISQFPPLGPKEPNVKRMK 306

Query: 302  TSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLKQ 361
            TSGSYS+KLSL+ QQ L +  A K V+ PQEPGEE QNIK +YNQKG SLSREE+SVLKQ
Sbjct: 307  TSGSYSSKLSLNEQQ-LHSLIAAKFVW-PQEPGEEIQNIKGSYNQKGKSLSREEASVLKQ 366

Query: 362  SKEDIKHVAASPSPGIDLGKVQDD-TDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVT 421
            SKEDIKHVAASPS GIDLGKVQDD TDIVANGN SD I KQVDPHPLVVLDQAT+LPNV 
Sbjct: 367  SKEDIKHVAASPSLGIDLGKVQDDITDIVANGNHSDLIGKQVDPHPLVVLDQATALPNVA 426

Query: 422  SNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 481
            SNVQ+QFDNVEF RQSDGLQPSASTAK FEGT PQSA NVKIPEPCSNFFKSLIN KSSG
Sbjct: 427  SNVQSQFDNVEFRRQSDGLQPSASTAKSFEGTPPQSAYNVKIPEPCSNFFKSLINCKSSG 486

Query: 482  SAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI 541
            +AFGNSSSCL F N DLQSLFE+EESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI
Sbjct: 487  TAFGNSSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI 546

Query: 542  EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSL 601
            EANSRCLDLY+KRELFSAHFHSFCMNNPG +SSSRQQEDM I VDHLNSMSGNAN  S L
Sbjct: 547  EANSRCLDLYNKRELFSAHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGNANITSPL 606

Query: 602  YQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSK 661
            YQKH EYNSST+L NDLNMQHENAGPINTSNLHENGQNLGSEPGSCS L GNT+DPLP K
Sbjct: 607  YQKHSEYNSSTRLRNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSDLGGNTVDPLPFK 666

Query: 662  GNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKN 721
            GNNIADRICSPS DPN+S+DGDEESLPSDHEMIDSY+ECY+ +K FEDDQ+EAYNM KKN
Sbjct: 667  GNNIADRICSPSVDPNISLDGDEESLPSDHEMIDSYNECYVRKKHFEDDQMEAYNMLKKN 726

Query: 722  HSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNN 781
            H DNNIEDSLRLEAKLRSELFARLGTRNLSK CNPC+NIQTSVEQGTENDAR+D TQQNN
Sbjct: 727  HCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNIQTSVEQGTENDARNDRTQQNN 786

Query: 782  TEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTPDDIHGRCHFENLPSEA 841
            TE TVGLAVGSDVDLISKK E ALLSGKGDQQFGFGGT+ CKTPD+IHG  HFENLPSE 
Sbjct: 787  TELTVGLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGPYHFENLPSET 846

Query: 842  QDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEVLLTRTRGSLSNLG 901
             D  DSD+NE F+REGSCSKTTFSFTPLTMNSVLQH+K ISSVSIEVLLTRT     NLG
Sbjct: 847  PDLTDSDDNEPFSREGSCSKTTFSFTPLTMNSVLQHMKVISSVSIEVLLTRT----LNLG 906

Query: 902  FPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSWPLCMYELRGKCNN 961
            FPEDGDSL+VD+IHWRK  ENSV E VRPM QSDGSY DDLAIDPSWPLCMYELRGKCNN
Sbjct: 907  FPEDGDSLEVDRIHWRKFIENSVLEIVRPMLQSDGSYTDDLAIDPSWPLCMYELRGKCNN 966

Query: 962  DECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYEDGMTPPTYLVGIDI 1021
            DECPWQH+KD+SFANRRQCQHGHIN          SSDETKVFKYED MTPPTYLVGIDI
Sbjct: 967  DECPWQHMKDFSFANRRQCQHGHIN----------SSDETKVFKYEDRMTPPTYLVGIDI 1026

Query: 1022 LKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIVANGNWNRPS 1081
            LKADS SY PVL Q+SSQCWQ+FFSISLTLPNLL+KDASADGLFLHDARI A G+WNRPS
Sbjct: 1027 LKADSRSYGPVLAQRSSQCWQNFFSISLTLPNLLRKDASADGLFLHDARIEAKGSWNRPS 1086

Query: 1082 SYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALW 1141
            SYFQRG S+LSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALW
Sbjct: 1087 SYFQRGGSVLSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALW 1146

Query: 1142 TIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCD 1201
             +YLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNLDARLAAYD+A+SALCD
Sbjct: 1147 AVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCD 1206

Query: 1202 NIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFTHSD 1261
            NIF+HNLDGK ASAHILDLILQMTNCLCMSGNVEK IQRIFGLL+VAMDSDEPYSF HSD
Sbjct: 1207 NIFSHNLDGKDASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFMHSD 1266

Query: 1262 MLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPAIQLTDGERLRASR 1321
            ML CLNISDKCIFWVCVVYLV+YRKLPHAIVQQLECEKELIEIEWPA+QLT+GE+LRASR
Sbjct: 1267 MLTCLNISDKCIFWVCVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVQLTNGEKLRASR 1326

Query: 1322 VVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPS 1381
            VVKK VDF DSCLNNES ESKCYQKSIQMFAVNHIRCLMAFEDI FSRNLLDKYVKLYPS
Sbjct: 1327 VVKKVVDFADSCLNNESPESKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPS 1386

Query: 1382 CLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAEYLLQNGRIKCTEE 1441
            C EL+LL +RA+KH FGD TVVAFEQAIR WPKEVPG+QCIWNQYAEYLL+NGRIKCTEE
Sbjct: 1387 CPELILLDIRARKHDFGDATVVAFEQAIRYWPKEVPGIQCIWNQYAEYLLRNGRIKCTEE 1446

Query: 1442 LMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPNEVDVVFWYLNLSV 1501
            LM RWF STSKMDCSKTRT  NSDCDSLHL ++ASGSI+ ALDCSP+EVDVVFWYLN SV
Sbjct: 1447 LMARWFNSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPSEVDVVFWYLNHSV 1506

Query: 1502 HKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNEAASVGGIRSILEG 1561
            HKLL+NDQLEARLAFDNALRAA +GTFRYCMREYAMFLLTD SLLNEAASVGGIRSILEG
Sbjct: 1507 HKLLVNDQLEARLAFDNALRAASAGTFRYCMREYAMFLLTDGSLLNEAASVGGIRSILEG 1566

Query: 1562 YLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNCILEVWYGPSLLPQ 1621
            YLNDARAFPV +PLSR+FINDIKKPRV+LLVSN LSP+S DVSLVNCILEVWYGPSLLPQ
Sbjct: 1567 YLNDARAFPVCEPLSRRFINDIKKPRVRLLVSNTLSPISPDVSLVNCILEVWYGPSLLPQ 1626

Query: 1622 KFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDN-SSQVASPSLIFWACSNLISAI 1681
            KFNKP+ELVDFVETILEMLPSNYQLVLSVCKQL NGDN SSQ ASPSLIFWACSNLI+AI
Sbjct: 1627 KFNKPKELVDFVETILEMLPSNYQLVLSVCKQLSNGDNYSSQAASPSLIFWACSNLITAI 1686

Query: 1682 FSSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWKSYYNICKTRGDT 1741
            F+ VPIPPEFIWVEAANILVNVKG EAI ERFHKRALSVYPFSVQLWKSYY++CKTRGDT
Sbjct: 1687 FNCVPIPPEFIWVEAANILVNVKGLEAITERFHKRALSVYPFSVQLWKSYYSMCKTRGDT 1746

Query: 1742 SAVLREVNERGIELNEPS 1757
            S VL+EVNERGIELNEPS
Sbjct: 1747 STVLQEVNERGIELNEPS 1748

BLAST of ClCG09G020050 vs. ExPASy TrEMBL
Match: A0A5D3C3A9 (Zinc finger C3H1 domain-containing protein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold98G003300 PE=4 SV=1)

HSP 1 Score: 2934.4 bits (7606), Expect = 0.0e+00
Identity = 1494/1714 (87.16%), Postives = 1576/1714 (91.95%), Query Frame = 0

Query: 46   VHPVCSTVPASVTSPISSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKA 105
            V PVCSTVPAS+ SPISS LPPK+KCN GIQ ASAD+C RTSI T SQK  D AQ+VNKA
Sbjct: 167  VRPVCSTVPASIASPISSSLPPKDKCNPGIQTASADICPRTSISTMSQKIRDNAQIVNKA 226

Query: 106  STPWGASREANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRS 165
            STPWGASR+ANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSN+LRS
Sbjct: 227  STPWGASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNKLRS 286

Query: 166  MTRNKVVANKLSLSQPFIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQ 225
            MTRNKV+ANKL LSQ FIPSMTKNH+AYS G AGPS AEQGSKIRAFSGNLQSQGRGNDQ
Sbjct: 287  MTRNKVMANKLPLSQVFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQ 346

Query: 226  GMNLNTSKLQDLREQIAICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATIS 285
            GMNLNTSKLQDLR+QIAI ESKLKLKSAQQNKES+ +TNQDYIVTNSK DLGRKGN TIS
Sbjct: 347  GMNLNTSKLQDLRQQIAIRESKLKLKSAQQNKESLLVTNQDYIVTNSKPDLGRKGNNTIS 406

Query: 286  QVSSLGPKEPDAKRLKTSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYN 345
            Q   LGPKEP+ KR+KTSGSYS+KLSL+ QQ L +  A K V+ PQEPGEE QNIK +YN
Sbjct: 407  QFPPLGPKEPNVKRMKTSGSYSSKLSLNEQQ-LHSLIAAKFVW-PQEPGEEIQNIKGSYN 466

Query: 346  QKGNSLSREESSVLKQSKEDIKHVAASPSPGIDLGKVQDD-TDIVANGNQSDWISKQVDP 405
            QKG SLSREE+SVLKQSKEDIKHVAASPS GIDLGKVQDD TDIVANGN SD I KQVDP
Sbjct: 467  QKGKSLSREEASVLKQSKEDIKHVAASPSLGIDLGKVQDDITDIVANGNHSDLIGKQVDP 526

Query: 406  HPLVVLDQATSLPNVTSNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPE 465
            HPLVVLDQAT+LPNV SNVQ+QFDNVEF RQSDGLQPSASTAK FEGT PQSA NVKIPE
Sbjct: 527  HPLVVLDQATALPNVASNVQSQFDNVEFRRQSDGLQPSASTAKSFEGTPPQSAYNVKIPE 586

Query: 466  PCSNFFKSLINSKSSGSAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIE 525
            PCSNFFKSLIN KSSG+AFGNSSSCL F N DLQSLFE+EESLDKDLEEAQDCRRQCEIE
Sbjct: 587  PCSNFFKSLINCKSSGTAFGNSSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIE 646

Query: 526  ERNAFKIYSRAQRALIEANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGV 585
            ERNAFKIYSRAQRALIEANSRCLDLY+KRELFSAHFHSFCMNNPG +SSSRQQEDM I V
Sbjct: 647  ERNAFKIYSRAQRALIEANSRCLDLYNKRELFSAHFHSFCMNNPGSVSSSRQQEDMIIDV 706

Query: 586  DHLNSMSGNANRASSLYQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPG 645
            DHLNSMSGNAN  S LYQKH EYNSST+L NDLNMQHENAGPINTSNLHENGQNLGSEPG
Sbjct: 707  DHLNSMSGNANITSPLYQKHSEYNSSTRLRNDLNMQHENAGPINTSNLHENGQNLGSEPG 766

Query: 646  SCSALCGNTLDPLPSKGNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRK 705
            SCS L GNT+DPLP KGNNIADRICSPS +PN+S+DGDEESLPSDHEMIDSY+ECY+ +K
Sbjct: 767  SCSDLGGNTVDPLPFKGNNIADRICSPSVNPNISLDGDEESLPSDHEMIDSYNECYMRKK 826

Query: 706  QFEDDQLEAYNMSKKNHSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVE 765
             FEDDQ+EAYNM KKNH DNNIEDSLRLEAKLRSELFARLGTRNLSK CNPC+NIQTSVE
Sbjct: 827  HFEDDQMEAYNMLKKNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNIQTSVE 886

Query: 766  QGTENDARDDSTQQNNTEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTP 825
            QGTENDAR+D TQQNNTE TVGLAVGSDVDLISKK E ALLSGKGDQQFGFGGT+ CKTP
Sbjct: 887  QGTENDARNDRTQQNNTELTVGLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTP 946

Query: 826  DDIHGRCHFENLPSEAQDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVS 885
            D+IHG  HFENLPSE  D  DSD+NE F+REGSCSKTTFSFTPLTMNSVLQH+K ISSVS
Sbjct: 947  DEIHGPYHFENLPSETPDLTDSDDNEPFSREGSCSKTTFSFTPLTMNSVLQHMKVISSVS 1006

Query: 886  IEVLLTRTRGSLSNLGFPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAID 945
            IEVLL+RT     NLGFPEDGDSL+VD+IHWRK  ENSVHE VRPM QSDGSY DDLAID
Sbjct: 1007 IEVLLSRT----LNLGFPEDGDSLEVDRIHWRKFIENSVHEIVRPMLQSDGSYTDDLAID 1066

Query: 946  PSWPLCMYELRGKCNNDECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFK 1005
            PSWPLCMYELRGKCNNDECPWQH+KD+SFANRRQCQHGHIN          SSDETKVFK
Sbjct: 1067 PSWPLCMYELRGKCNNDECPWQHMKDFSFANRRQCQHGHIN----------SSDETKVFK 1126

Query: 1006 YEDGMTPPTYLVGIDILKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLF 1065
            YED MTPPTYLVGIDILKADS SY PVL Q+SSQCWQ+FFSISLTLPNLL+KDASADGLF
Sbjct: 1127 YEDRMTPPTYLVGIDILKADSRSYGPVLAQRSSQCWQNFFSISLTLPNLLRKDASADGLF 1186

Query: 1066 LHDARIVANGNWNRPSSYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALP 1125
            LHDARI A G+WNRPSSYFQRG S+LSQLKQGDENLALETALIIINQETNSREGMKKALP
Sbjct: 1187 LHDARIEAKGSWNRPSSYFQRGGSVLSQLKQGDENLALETALIIINQETNSREGMKKALP 1246

Query: 1126 VLSRAVENNPKSIALWTIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNL 1185
            VLSRAVENNPKSIALW +YLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNL
Sbjct: 1247 VLSRAVENNPKSIALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNL 1306

Query: 1186 DARLAAYDAALSALCDNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLL 1245
            DARLAAYD+A+SALCDNIF+HNLDGK ASAHILDLILQMTNCLCMSGNVEK IQRIFGLL
Sbjct: 1307 DARLAAYDSAISALCDNIFSHNLDGKDASAHILDLILQMTNCLCMSGNVEKAIQRIFGLL 1366

Query: 1246 RVAMDSDEPYSFTHSDMLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIE 1305
            +VAMDSDEPYSF HSDML CLNISDKCIFWVCVVYLV+YRKLPHAIVQQLECEKELIEIE
Sbjct: 1367 QVAMDSDEPYSFMHSDMLTCLNISDKCIFWVCVVYLVLYRKLPHAIVQQLECEKELIEIE 1426

Query: 1306 WPAIQLTDGERLRASRVVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDI 1365
            WPA+QLT+GE+LRASRVVKK VDF DSCLNNES ESKCYQKSIQMFAVNHIRCLMAFEDI
Sbjct: 1427 WPAVQLTNGEKLRASRVVKKVVDFADSCLNNESPESKCYQKSIQMFAVNHIRCLMAFEDI 1486

Query: 1366 GFSRNLLDKYVKLYPSCLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQ 1425
             FSRNLLDKYVKLYPSC EL+LL +RA+KH FGD TVVAFEQAIR WPKEVPG+QCIWNQ
Sbjct: 1487 EFSRNLLDKYVKLYPSCPELILLDIRARKHDFGDATVVAFEQAIRYWPKEVPGIQCIWNQ 1546

Query: 1426 YAEYLLQNGRIKCTEELMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDC 1485
            YAEYLL+NGRIKCTEELM RWF STSKMDCSKTRT  NSDCDSLHL ++ASGSI+ ALDC
Sbjct: 1547 YAEYLLRNGRIKCTEELMARWFNSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDC 1606

Query: 1486 SPNEVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESL 1545
            SP+EVDVVFWYLN SVHKLL+NDQLEARLAFDNALRAA +GTFRYCMREYAMFLLTDESL
Sbjct: 1607 SPSEVDVVFWYLNHSVHKLLVNDQLEARLAFDNALRAASAGTFRYCMREYAMFLLTDESL 1666

Query: 1546 LNEAASVGGIRSILEGYLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSL 1605
            LNEAASVGGIRSILEGYLNDARAFPV +PLSR+FINDIKKPRV+LLVSN LSP+S DVSL
Sbjct: 1667 LNEAASVGGIRSILEGYLNDARAFPVCEPLSRRFINDIKKPRVRLLVSNTLSPISPDVSL 1726

Query: 1606 VNCILEVWYGPSLLPQKFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDN-SSQVA 1665
            VNCILEVWYGPSLLPQKFNKP+ELVDFVETILEMLPSNYQLVLSVCKQL NGDN SSQ A
Sbjct: 1727 VNCILEVWYGPSLLPQKFNKPKELVDFVETILEMLPSNYQLVLSVCKQLSNGDNYSSQAA 1786

Query: 1666 SPSLIFWACSNLISAIFSSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSV 1725
            SPSLIFWACSNLI+AIF+ VPIPPEFIWVEAANILVNVKG EAI ERFHKRALSVYPFSV
Sbjct: 1787 SPSLIFWACSNLITAIFNCVPIPPEFIWVEAANILVNVKGLEAITERFHKRALSVYPFSV 1846

Query: 1726 QLWKSYYNICKTRGDTSAVLREVNERGIELNEPS 1757
            QLWKSYY++CKTRGDTS VL+EVNERGIELNEPS
Sbjct: 1847 QLWKSYYSMCKTRGDTSTVLQEVNERGIELNEPS 1864

BLAST of ClCG09G020050 vs. ExPASy TrEMBL
Match: A0A5A7VFE0 (Zinc finger C3H1 domain-containing protein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold32G00140 PE=4 SV=1)

HSP 1 Score: 2867.0 bits (7431), Expect = 0.0e+00
Identity = 1459/1668 (87.47%), Postives = 1538/1668 (92.21%), Query Frame = 0

Query: 92   SQKTCDTAQVVNKASTPWGASREANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFK 151
            SQK  D AQ+VNKASTPWGASR+ANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFK
Sbjct: 2    SQKIRDNAQIVNKASTPWGASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFK 61

Query: 152  PPTSTLDRSNRLRSMTRNKVVANKLSLSQPFIPSMTKNHRAYSTG-AGPSLAEQGSKIRA 211
            PPTSTLDRSN+LRSMTRNKV+ANKL LSQ FIPSMTKNH+AYS G AGPS AEQGSKIRA
Sbjct: 62   PPTSTLDRSNKLRSMTRNKVMANKLPLSQVFIPSMTKNHKAYSKGAAGPSFAEQGSKIRA 121

Query: 212  FSGNLQSQGRGNDQGMNLNTSKLQDLREQIAICESKLKLKSAQQNKESISITNQDYIVTN 271
            FSGNLQSQGRGNDQGMNLNTSKLQDLR+QIAI ESKLKLKSAQQNKES+ +TNQDYIVTN
Sbjct: 122  FSGNLQSQGRGNDQGMNLNTSKLQDLRQQIAIRESKLKLKSAQQNKESLLVTNQDYIVTN 181

Query: 272  SKSDLGRKGNATISQVSSLGPKEPDAKRLKTSGSYSTKLSLSGQQHLRATYAGKSVFRPQ 331
            SK DLGRKGN TISQ   LGPKEP+ KR+KTSGSYS+KLSL+ QQ L +  A K V+ PQ
Sbjct: 182  SKPDLGRKGNNTISQFPPLGPKEPNVKRMKTSGSYSSKLSLNEQQ-LHSLIAAKFVW-PQ 241

Query: 332  EPGEETQNIKVTYNQKGNSLSREESSVLKQSKEDIKHVAASPSPGIDLGKVQDD-TDIVA 391
            EPGEE QNIK +YNQKG SLSREE+SVLKQSKEDIKHVAASPS GIDLGKVQDD TDIVA
Sbjct: 242  EPGEEIQNIKGSYNQKGKSLSREEASVLKQSKEDIKHVAASPSLGIDLGKVQDDITDIVA 301

Query: 392  NGNQSDWISKQVDPHPLVVLDQATSLPNVTSNVQTQFDNVEFHRQSDGLQPSASTAKLFE 451
            NGN SD I KQVDPHPLVVLDQAT+LPNV SNVQ+QFDNVEF RQSDGLQPSASTAK FE
Sbjct: 302  NGNHSDLIGKQVDPHPLVVLDQATALPNVASNVQSQFDNVEFRRQSDGLQPSASTAKSFE 361

Query: 452  GTLPQSASNVKIPEPCSNFFKSLINSKSSGSAFGNSSSCLGFSNLDLQSLFEMEESLDKD 511
            GT PQSA NVKIPEPCSNFFKSLIN KSSG+AFGNSSSCL F N DLQSLFE+EESLDKD
Sbjct: 362  GTPPQSAYNVKIPEPCSNFFKSLINCKSSGTAFGNSSSCLDFGNFDLQSLFEIEESLDKD 421

Query: 512  LEEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCLDLYHKRELFSAHFHSFCMNNPGL 571
            LEEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCLDLY+KRELFSAHFHSFCMNNPG 
Sbjct: 422  LEEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCLDLYNKRELFSAHFHSFCMNNPGS 481

Query: 572  ISSSRQQEDMKIGVDHLNSMSGNANRASSLYQKHFEYNSSTKLHNDLNMQHENAGPINTS 631
            +SSSRQQEDM I VDHLNSMSGNAN  S LYQKH EYNSST+L NDLNMQHENAGPINTS
Sbjct: 482  VSSSRQQEDMIIDVDHLNSMSGNANITSPLYQKHSEYNSSTRLRNDLNMQHENAGPINTS 541

Query: 632  NLHENGQNLGSEPGSCSALCGNTLDPLPSKGNNIADRICSPSFDPNVSVDGDEESLPSDH 691
            NLHENGQNLGSEPGSCS L GNT+DPLP KGNNIADRICSPS +PN+S+DGDEESLPSDH
Sbjct: 542  NLHENGQNLGSEPGSCSDLGGNTVDPLPFKGNNIADRICSPSVNPNISLDGDEESLPSDH 601

Query: 692  EMIDSYDECYIGRKQFEDDQLEAYNMSKKNHSDNNIEDSLRLEAKLRSELFARLGTRNLS 751
            EMIDSY+ECY+ +K FEDDQ+EAYNM KKNH DNNIEDSLRLEAKLRSELFARLGTRNLS
Sbjct: 602  EMIDSYNECYMRKKHFEDDQMEAYNMLKKNHCDNNIEDSLRLEAKLRSELFARLGTRNLS 661

Query: 752  KTCNPCHNIQTSVEQGTENDARDDSTQQNNTEPTVGLAVGSDVDLISKKTEIALLSGKGD 811
            K CNPC+NIQTSVEQGTENDAR+D TQQNNTE TVGLAVGSDVDLISKK E ALLSGKGD
Sbjct: 662  KACNPCNNIQTSVEQGTENDARNDRTQQNNTELTVGLAVGSDVDLISKKNESALLSGKGD 721

Query: 812  QQFGFGGTNICKTPDDIHGRCHFENLPSEAQDSADSDENERFNREGSCSKTTFSFTPLTM 871
            QQFGFGGT+ CKTPD+IHG  HFENLPSE  D  DSD+NE F+REGSCSKTTFSFTPLTM
Sbjct: 722  QQFGFGGTDRCKTPDEIHGPYHFENLPSETPDLTDSDDNEPFSREGSCSKTTFSFTPLTM 781

Query: 872  NSVLQHIKAISSVSIEVLLTRTRGSLSNLGFPEDGDSLQVDQIHWRKLKENSVHETVRPM 931
            NSVLQH+K ISSVSIEVLL+RT     NLGFPEDGDSL+VD+IHWRK  ENSVHE VRPM
Sbjct: 782  NSVLQHMKVISSVSIEVLLSRT----LNLGFPEDGDSLEVDRIHWRKFIENSVHEIVRPM 841

Query: 932  FQSDGSYIDDLAIDPSWPLCMYELRGKCNNDECPWQHVKDYSFANRRQCQHGHINYSDSC 991
             QSDGSY DDLAIDPSWPLCMYELRGKCNNDECPWQH+KD+SFANRRQCQHGHIN     
Sbjct: 842  LQSDGSYTDDLAIDPSWPLCMYELRGKCNNDECPWQHMKDFSFANRRQCQHGHIN----- 901

Query: 992  NGLSFSSDETKVFKYEDGMTPPTYLVGIDILKADSHSYDPVLTQKSSQCWQSFFSISLTL 1051
                 SSDETKVFKYED MTPPTYLVGIDILKADS SY PVL Q+SSQCWQ+FFSISLTL
Sbjct: 902  -----SSDETKVFKYEDRMTPPTYLVGIDILKADSRSYGPVLAQRSSQCWQNFFSISLTL 961

Query: 1052 PNLLQKDASADGLFLHDARIVANGNWNRPSSYFQRGSSILSQLKQGDENLALETALIIIN 1111
            PNLL+KDASADGLFLHDARI A G+WNRPSSYFQRG S+LSQLKQGDENLALETALIIIN
Sbjct: 962  PNLLRKDASADGLFLHDARIEAKGSWNRPSSYFQRGGSVLSQLKQGDENLALETALIIIN 1021

Query: 1112 QETNSREGMKKALPVLSRAVENNPKSIALWTIYLLIFYSYTTTGGKDDMFSFAVKHNGQS 1171
            QETNSREGMKKALPVLSRAVENNPKSIALW +YLLIFYSYTTTGGKDDMFS+AVKHNGQS
Sbjct: 1022 QETNSREGMKKALPVLSRAVENNPKSIALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQS 1081

Query: 1172 YELWLMYINSRMNLDARLAAYDAALSALCDNIFTHNLDGKYASAHILDLILQMTNCLCMS 1231
            YELWLMYINSRMNLDARLAAYD+A+SALCDNIF+HNLDGK ASAHILDLILQMTNCLCMS
Sbjct: 1082 YELWLMYINSRMNLDARLAAYDSAISALCDNIFSHNLDGKDASAHILDLILQMTNCLCMS 1141

Query: 1232 GNVEKGIQRIFGLLRVAMDSDEPYSFTHSDMLACLNISDKCIFWVCVVYLVIYRKLPHAI 1291
            GNVEK IQRIFGLL+VAMDSDEPYSF HSDML CLNISDKCIFWVCVVYLV+YRKLPHAI
Sbjct: 1142 GNVEKAIQRIFGLLQVAMDSDEPYSFMHSDMLTCLNISDKCIFWVCVVYLVLYRKLPHAI 1201

Query: 1292 VQQLECEKELIEIEWPAIQLTDGERLRASRVVKKAVDFVDSCLNNESLESKCYQKSIQMF 1351
            VQQLECEKELIEIEWPA+QLT+GE+LRASRVVKK VDF DSCLNNES ESKCYQKSIQMF
Sbjct: 1202 VQQLECEKELIEIEWPAVQLTNGEKLRASRVVKKVVDFADSCLNNESPESKCYQKSIQMF 1261

Query: 1352 AVNHIRCLMAFEDIGFSRNLLDKYVKLYPSCLELLLLKVRAKKHGFGDETVVAFEQAIRN 1411
            AVNHIRCLMAFEDI FSRNLLDKYVKLYPSC EL+LL +RA+KH FGD TVVAFEQAIR 
Sbjct: 1262 AVNHIRCLMAFEDIEFSRNLLDKYVKLYPSCPELILLDIRARKHDFGDATVVAFEQAIRY 1321

Query: 1412 WPKEVPGVQCIWNQYAEYLLQNGRIKCTEELMVRWFESTSKMDCSKTRTVDNSDCDSLHL 1471
            WPKEVPG+QCIWNQYAEYLL+NGRIKCTEELM RWF STSKMDCSKTRT  NSDCDSLHL
Sbjct: 1322 WPKEVPGIQCIWNQYAEYLLRNGRIKCTEELMARWFNSTSKMDCSKTRTPVNSDCDSLHL 1381

Query: 1472 REYASGSILHALDCSPNEVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAAGSGTFRYC 1531
             ++ASGSI+ ALDCSP+EVDVVFWYLN SVHKLL+NDQLEARLAFDNALRAA +GTFRYC
Sbjct: 1382 LDHASGSIVRALDCSPSEVDVVFWYLNHSVHKLLVNDQLEARLAFDNALRAASAGTFRYC 1441

Query: 1532 MREYAMFLLTDESLLNEAASVGGIRSILEGYLNDARAFPVPKPLSRKFINDIKKPRVQLL 1591
            MREYAMFLLTDESLLNEAASVGGIRSILEGYLNDARAFPV +PLSR+FINDIKKPRV+LL
Sbjct: 1442 MREYAMFLLTDESLLNEAASVGGIRSILEGYLNDARAFPVCEPLSRRFINDIKKPRVRLL 1501

Query: 1592 VSNMLSPLSLDVSLVNCILEVWYGPSLLPQKFNKPRELVDFVETILEMLPSNYQLVLSVC 1651
            VSN LSP+S DVSLVNCILEVWYGPSLLPQKFNKP+ELVDFVETILEMLPSNYQLVLSVC
Sbjct: 1502 VSNTLSPISPDVSLVNCILEVWYGPSLLPQKFNKPKELVDFVETILEMLPSNYQLVLSVC 1561

Query: 1652 KQLCNGDN-SSQVASPSLIFWACSNLISAIFSSVPIPPEFIWVEAANILVNVKGFEAIYE 1711
            KQL NGDN SSQ ASPSLIFWACSNLI+AIF+ VPIPPEFIWVEAANILVNVKG EAI E
Sbjct: 1562 KQLSNGDNYSSQAASPSLIFWACSNLITAIFNCVPIPPEFIWVEAANILVNVKGLEAITE 1621

Query: 1712 RFHKRALSVYPFSVQLWKSYYNICKTRGDTSAVLREVNERGIELNEPS 1757
            RFHKRALSVYPFSVQLWKSYY++CKTRGDTS VL+EVNERGIELNEPS
Sbjct: 1622 RFHKRALSVYPFSVQLWKSYYSMCKTRGDTSTVLQEVNERGIELNEPS 1653

BLAST of ClCG09G020050 vs. ExPASy TrEMBL
Match: A0A6J1GRB4 (uncharacterized protein LOC111456410 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111456410 PE=4 SV=1)

HSP 1 Score: 2793.8 bits (7241), Expect = 0.0e+00
Identity = 1441/1758 (81.97%), Postives = 1553/1758 (88.34%), Query Frame = 0

Query: 1    MEKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60
            MEK ++EELTVKAM SN KP++SK S SREEGEVSSSDNDTQTH VH V S +PASVTSP
Sbjct: 1    MEKNDAEELTVKAMESNLKPTRSKTSNSREEGEVSSSDNDTQTHGVHHVRSAMPASVTSP 60

Query: 61   ISSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLV 120
            ISSILPPKNK N GIQAASADVC +TSIQTT+QK CD  Q+V+KA TPW ASR+AN+NLV
Sbjct: 61   ISSILPPKNKSNAGIQAASADVCPKTSIQTTAQKICDNDQIVHKAITPWVASRDANANLV 120

Query: 121  ISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQ 180
            ISFSDDSGS+++E SK +TSKS S+AV HFKPPTS LD+SN+LRSMTRN VVANK S SQ
Sbjct: 121  ISFSDDSGSDMDERSKEKTSKSRSNAVGHFKPPTSLLDKSNKLRSMTRNNVVANKFSSSQ 180

Query: 181  PFIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQ 240
             FI S T   RA S G AGPSL EQGS+IRAFSGNL  QG  NDQG+NL +SKLQDLREQ
Sbjct: 181  SFITSKTMTKRACSKGAAGPSLVEQGSRIRAFSGNLPIQGHRNDQGVNLKSSKLQDLREQ 240

Query: 241  IAICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRL 300
            IAI ESKLKLKSAQQNKE IS TNQDYIVTNSKSDLGRKG+ATISQ    GP +PDAKR+
Sbjct: 241  IAIWESKLKLKSAQQNKEIISATNQDYIVTNSKSDLGRKGDATISQFPPSGPTQPDAKRM 300

Query: 301  KTSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLK 360
            KT GSYSTKLSLSG QHLRAT A KSVFRPQEPGEETQNIKVTYNQKGNS++R+ES+ LK
Sbjct: 301  KTIGSYSTKLSLSG-QHLRATNAVKSVFRPQEPGEETQNIKVTYNQKGNSMNRDESNALK 360

Query: 361  QSKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVT 420
            Q KEDIKHVAAS SPG DLGKV D TDIVANGNQSDWISKQVDPHPLVVL QA+ LPN  
Sbjct: 361  QKKEDIKHVAASSSPGSDLGKVHDGTDIVANGNQSDWISKQVDPHPLVVLGQASVLPNTA 420

Query: 421  SNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480
            SNVQT FDN EFH  +DGLQ SASTA   EGT PQSASNVKIPE  SNFFKSLINSKS+G
Sbjct: 421  SNVQTLFDNSEFHSPNDGLQQSASTANFSEGTCPQSASNVKIPESFSNFFKSLINSKSTG 480

Query: 481  SAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI 540
            +AFGN SSCLGFSN+DL+SLFEMEESLDKDLEEAQD RR+CE+EERNAFKIYSRAQRALI
Sbjct: 481  TAFGNPSSCLGFSNVDLESLFEMEESLDKDLEEAQDFRRRCEVEERNAFKIYSRAQRALI 540

Query: 541  EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSL 600
            EANSRCLDLYHKRELFSAHFHSFCMNNPGL+SSSRQQE+MKIGVDH NSMSGN NRAS L
Sbjct: 541  EANSRCLDLYHKRELFSAHFHSFCMNNPGLVSSSRQQENMKIGVDHSNSMSGNENRASPL 600

Query: 601  YQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSK 660
            YQKH EYNS T+L NDLNMQHENA PINTS LHEN QNLGSEP SCS LCG TL+P+PSK
Sbjct: 601  YQKHSEYNSFTQLRNDLNMQHENASPINTSILHENRQNLGSEPESCSDLCGITLNPVPSK 660

Query: 661  GNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKN 720
            G NIADRICSPS +PNVSVDGDEES  SDHE+IDSYDECYIG+K+FEDDQ+EA NMSKKN
Sbjct: 661  GKNIADRICSPSIEPNVSVDGDEESFHSDHEIIDSYDECYIGKKRFEDDQMEACNMSKKN 720

Query: 721  HSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNN 780
            H D+   DSLRLEAKLRSELFARLGTRN S+TCNPCHNIQTSVE+G E DARDD TQQN 
Sbjct: 721  HYDDKTGDSLRLEAKLRSELFARLGTRNSSQTCNPCHNIQTSVEKGAEKDARDDKTQQNY 780

Query: 781  TEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTPDDIHGRCHFENLPSEA 840
            TEPTV  AVG+D+D    KT+ ALLSGK DQ+FGFGGT+ CKTPDDI   C+FEN P E 
Sbjct: 781  TEPTVRQAVGNDID----KTKSALLSGKRDQKFGFGGTDRCKTPDDIRSHCNFENFPLET 840

Query: 841  QDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEVLLTRTRGSLSNLG 900
             D ADSD NE  NREG CS   FS+ PLT+NSVLQH+KA++SVS EVLL+RTR S SNLG
Sbjct: 841  HDVADSDVNEPSNREGPCS--YFSYAPLTLNSVLQHMKAVTSVSTEVLLSRTRESFSNLG 900

Query: 901  FPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSWPLCMYELRGKCNN 960
             PE+GD L+VD+IHWRKL+EN V +TV  MFQSDGSY DDL+IDPSWPLCMYELRGKCNN
Sbjct: 901  LPEEGDLLEVDRIHWRKLEENHVPDTVSCMFQSDGSYTDDLSIDPSWPLCMYELRGKCNN 960

Query: 961  DECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYEDGMTPPTYLVGIDI 1020
            DECPWQHVKD S +NRR CQ    NYSDSCNGL FSSDETKVFKYED MTPPTYLVG+DI
Sbjct: 961  DECPWQHVKDSSLSNRRPCQDSQSNYSDSCNGLLFSSDETKVFKYEDLMTPPTYLVGVDI 1020

Query: 1021 LKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIVANGNWNRPS 1080
            LKADSHSY+PVL QKSS+CWQ+FFSISLTLPNLLQKDASADGLFLHDARI A G+WNR S
Sbjct: 1021 LKADSHSYNPVLVQKSSKCWQNFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRQS 1080

Query: 1081 SYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALW 1140
            SYFQ GS+ LSQLKQ DEN ALETALIIINQE NSREGMK+ALP+LSRA+E+NPKSIALW
Sbjct: 1081 SYFQSGSTTLSQLKQADENQALETALIIINQEMNSREGMKRALPILSRAIESNPKSIALW 1140

Query: 1141 TIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCD 1200
            T+YLLIFYSYTT GGKDDMFS+AVKHN QSYELWL+YINS MNLDAR+AAYDAALSAL +
Sbjct: 1141 TMYLLIFYSYTTNGGKDDMFSYAVKHNEQSYELWLLYINSHMNLDARIAAYDAALSALFN 1200

Query: 1201 NIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFTHSD 1260
            NI T  +D K ASAHILDLILQMTNCLCMSGNVEK  Q+IFGLLRVAMDSDEP SF HSD
Sbjct: 1201 NILT-QMDEKCASAHILDLILQMTNCLCMSGNVEKATQKIFGLLRVAMDSDEPGSFMHSD 1260

Query: 1261 MLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPAIQLTDGERLRASR 1320
            ML CLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKEL+EIEWP I LTDGE+ RAS 
Sbjct: 1261 MLTCLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELVEIEWPTIHLTDGEKQRAST 1320

Query: 1321 VVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPS 1380
            VVKKAVDFVDSCLNNESLES+ YQKSIQMFAVNHIRCLMAFEDIGF+RNLLDKYVK YPS
Sbjct: 1321 VVKKAVDFVDSCLNNESLESQSYQKSIQMFAVNHIRCLMAFEDIGFTRNLLDKYVKRYPS 1380

Query: 1381 CLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAEYLLQNGRIKCTEE 1440
            CLELLLL    KKH FG E V AFE+ IRNWPKEVPGVQCIWNQYAEYLLQNGRIK TEE
Sbjct: 1381 CLELLLLNAWTKKHDFG-EMVAAFEEVIRNWPKEVPGVQCIWNQYAEYLLQNGRIKYTEE 1440

Query: 1441 LMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPNEVDVVFWYLNLSV 1500
            LM RWF+S+SK+  S+TRT+DNSDC+SLHL +YASGSI+HALDCSP+EVD+VFWYLNLSV
Sbjct: 1441 LMARWFDSSSKIG-SRTRTLDNSDCNSLHLLDYASGSIVHALDCSPSEVDLVFWYLNLSV 1500

Query: 1501 HKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNEAASVGGIRSILEG 1560
            HKLLLND LEARLAFDNALRAA SGTF+YCMREYAMFLLTDESLLNEA SVGGIRSILEG
Sbjct: 1501 HKLLLNDLLEARLAFDNALRAASSGTFKYCMREYAMFLLTDESLLNEAGSVGGIRSILEG 1560

Query: 1561 YLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNCILEVWYGPSLLPQ 1620
            YL+D RAFPVP+ LSRKFINDIKKPRVQLLVSNMLSPLS DVSLVNC+LE WYGPSLLP 
Sbjct: 1561 YLSDVRAFPVPETLSRKFINDIKKPRVQLLVSNMLSPLSPDVSLVNCVLEAWYGPSLLPP 1620

Query: 1621 KFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDNSSQVASPSLIFWACSNLISAIF 1680
            KF+KP+ELVDFVETILEMLPSNYQLVLSVCKQLCNG+NSSQV S SLIFWACSNLISAIF
Sbjct: 1621 KFSKPKELVDFVETILEMLPSNYQLVLSVCKQLCNGNNSSQVTSASLIFWACSNLISAIF 1680

Query: 1681 SSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWKSYYNICKTRGDTS 1740
             +VPIPPEFIWVEA++ILVNVKGF AI ERFHKRALSVYPFSVQLWKSYYN CK RGDTS
Sbjct: 1681 CAVPIPPEFIWVEASDILVNVKGFGAITERFHKRALSVYPFSVQLWKSYYNKCKARGDTS 1740

Query: 1741 AVLREVNERGIELNEPSL 1758
            AVL+ VNERGIEL+ PSL
Sbjct: 1741 AVLQAVNERGIELSLPSL 1748

BLAST of ClCG09G020050 vs. TAIR 10
Match: AT2G39580.1 (CONTAINS InterPro DOMAIN/s: Putative zinc-finger domain (InterPro:IPR019607); Has 249 Blast hits to 219 proteins in 85 species: Archae - 0; Bacteria - 144; Metazoa - 29; Fungi - 8; Plants - 50; Viruses - 0; Other Eukaryotes - 18 (source: NCBI BLink). )

HSP 1 Score: 780.4 bits (2014), Expect = 3.0e-225
Identity = 604/1760 (34.32%), Postives = 897/1760 (50.97%), Query Frame = 0

Query: 22   KSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSPISSILP-PKNKCNEGIQAASA 81
            K+     +EEGE+S+SD++ Q     P+ ++  + +T  IS+     + +   G      
Sbjct: 8    KNSPVTGKEEGELSTSDDEVQ-----PMQTSTRSPLTEHISANTNIQRRQAGNGGSFIKP 67

Query: 82   DVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLVISFS-DDSGSELEECSKVRT 141
               T T +     +  +T Q +          R  NSNLVI+FS DDSGSE +   + +T
Sbjct: 68   SDATPTKLTNPGGRIFETKQAIAAIHGKKFPVRGNNSNLVINFSDDDSGSESDCKGRTQT 127

Query: 142  SKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANK----LSLSQPFIPSMTKNHRAYST 201
            SK           P  T+  +    + ++ K+   +     ++++  + + T +H A S 
Sbjct: 128  SKIQ---------PKGTISGNRNPSTFSQTKLKGPRQIDIRAITKKALSTSTFSHAATSK 187

Query: 202  GAGPSLAEQ---GSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQIAICESKLKLKSA 261
             +  S A++      I +    +    +  +Q +  N++KLQDL++QIA+ ES+LKLK+A
Sbjct: 188  VSNLSFAKEMKSNKYIHSSERTVSKDAQRPEQIVESNSNKLQDLKQQIALRESELKLKAA 247

Query: 262  QQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRLKTSGSYSTKLSLS 321
            Q  K+++          N K    R+ +        L P EP  KRLK SG         
Sbjct: 248  QPKKDAV----------NPKITPARRVSIISDDTRHLEPNEPPKKRLKVSGI-------- 307

Query: 322  GQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLKQSKEDIKHVAASP 381
                                  +T    + Y           S+    +  DI+    S 
Sbjct: 308  ----------------------DTSQPVIDYRVAA-------SAAAPMNAPDIR---KSL 367

Query: 382  SPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVTSNVQTQFD---NV 441
             PG++      ++     G++SD I   V P   V  + ++S+   ++     ++    +
Sbjct: 368  LPGVNA-----NSSCKHLGSKSDEIVPPVIPQHTVEGNTSSSVLQKSTGKVNHYEGGREL 427

Query: 442  EFHRQSDGLQPSASTAKLFEG---TLPQSASNVKIPEPCSNFFKSLINSKSSGSAFGNSS 501
            E  +  D    S    K+  G      +S++N     PCSN         S       S+
Sbjct: 428  ETMKNVDRSVSSEQLLKIVNGNHQVFSRSSNNNWKRLPCSN--------NSGLYNIPGST 487

Query: 502  SCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCL 561
            +  G S LD+ SL  +EESLDK+LEEAQ+ +R  EIEERNA K+Y +AQR+LIEAN+RC 
Sbjct: 488  TVPGHSQLDMLSLTNLEESLDKELEEAQERKRLFEIEERNALKVYRKAQRSLIEANARCA 547

Query: 562  DLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSLYQKHFEY 621
            +LY KRE+ SAH+ S  + +  L+  S   E+ + G   LN+ +G+ + A+   +     
Sbjct: 548  ELYSKREILSAHYGSLIVRDSRLLWPSIHGENPETGFHFLNNSTGSIDLAT---KTDIAQ 607

Query: 622  NSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSKGNNIADR 681
            +S  + ++  N ++  + P   S    +GQNLG      S L  +T D LP      A R
Sbjct: 608  HSQLESNHKYNSEYVGSHPPPHS---RSGQNLG-----YSDLGASTSDGLPCGNKQTASR 667

Query: 682  ICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKNHSDNNI- 741
            +CSPS D N+    D+ES P DHE                    E     +K + D  + 
Sbjct: 668  LCSPSSDANIL--PDDESFPVDHE------------------STEGNPGHQKENIDQTLG 727

Query: 742  -EDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNNTEPTV 801
             +++L LEA LRS+LF RLG R  S+    C N +T +++G E D   + TQ++N  P  
Sbjct: 728  NQNALLLEASLRSKLFDRLGMRAESRG-GTCFNEETVIDRGDERDFGSEGTQRDNGSPF- 787

Query: 802  GLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTPDDIH-GRCHFENLPSEAQDSA 861
                 S++ L +   E              G   +  +P +    R   E      Q S 
Sbjct: 788  -----SEIYLHNDSLEP-------------GANKLQGSPSEAPVERRSIEENSLNYQLSI 847

Query: 862  DSDENERFNREGSCSKTTFSFTPLTMNSVLQHIK----AISSVSIEVLLTRTRGSLSNLG 921
            D  E+ R + E +   +     PL   S + H+K    +I+S+  E +L     SL +  
Sbjct: 848  DM-ESHRSSPENALLSSVALSGPL-FRSTIYHLKVPGSSITSLGPEYILQNKTYSLYS-- 907

Query: 922  FPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSWPLCMYELRGKCNN 981
                      D+   R L E  V+E      +  G Y  +L +DPSWPLCMYELRG+CNN
Sbjct: 908  ----------DKRQCRSLTETIVYE------KKIGFYTCNLKVDPSWPLCMYELRGRCNN 967

Query: 982  DECPWQHVKDYSFANRRQCQH---GHINYSDSCNGLSFSSDETKVFKYEDGMTPPTYLVG 1041
            DEC WQH KD+S  +  Q  H   G +  S        + + +K  +  D +  PTYLV 
Sbjct: 968  DECSWQHFKDFSDDSLHQSLHDPDGRVGSSSH----QKTHNSSKGSQILDSVFSPTYLVS 1027

Query: 1042 IDILKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIVANGNWN 1101
            +D +K DS SY+ VL Q+  Q W   FS  L   N L ++  A     ++ RIV  GN  
Sbjct: 1028 LDTMKVDSWSYESVLAQRHGQIWCKHFSACLASSNSLYRNVPAKE---NEGRIVVLGNSK 1087

Query: 1102 RPSSYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSI 1161
              SSYF+   S++  + Q                          AL +LS+ +E +P S 
Sbjct: 1088 TYSSYFRIKHSLMWHIFQ--------------------------ALSLLSQGLEGDPTSE 1147

Query: 1162 ALWTIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSA 1221
             LW +YLLI+++Y  + GK DMFS+ VKH+ +SY +WLMYINSR  L+ +L AYD ALSA
Sbjct: 1148 ILWAVYLLIYHAYEGSDGK-DMFSYGVKHSSRSYVIWLMYINSRGQLNDQLIAYDTALSA 1207

Query: 1222 LCDNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFT 1281
            LC N  + ++D  +ASA ILD++LQM N LC+SGNV K IQRI  L   A  SD+P    
Sbjct: 1208 LC-NHASGSIDRNHASACILDVLLQMFNLLCISGNVSKAIQRISKLQAPAAVSDDPDFSL 1267

Query: 1282 HSDMLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPAIQLTDGERLR 1341
             S +L CL  SDKC+FWVC VYLVIYRKLP +I+++LE EKEL+EIEWP + L    +  
Sbjct: 1268 MSHILTCLTYSDKCVFWVCCVYLVIYRKLPDSIIRRLEMEKELLEIEWPTVNLDGDLKQM 1327

Query: 1342 ASRVVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKL 1401
            A R+  K +  V+   NN        ++   +FA+N+   ++A +++   R++L   V+L
Sbjct: 1328 ALRLFDKGMRSVEHGTNN-----GIQKRPAGLFALNYALFMIAVDELESRRDILKASVQL 1387

Query: 1402 YPSCLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAEYLLQNGRIKC 1461
            YP+CLEL LL VR + +   D     FE+ ++   KE   +QCIWNQYAEY L+ G    
Sbjct: 1388 YPTCLELKLLAVRMQSNELKDMFSSGFEELLKQEAKEASCIQCIWNQYAEYALEGGSYDL 1447

Query: 1462 TEELMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPNEVDVVFWYLN 1521
              ELM RW+ S   +   K +TV  ++ +     +    S L  L+ + ++VDV+F YLN
Sbjct: 1448 ARELMSRWYGSVWDVLSHKYKTVRGNEEEG---DDNMLESALSDLNVASDQVDVMFGYLN 1507

Query: 1522 LSVHKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNEAASVGGIRSI 1581
            LS+H LL ++  EARLA D AL+A     F +C+RE+A+F L +E       S+     +
Sbjct: 1508 LSLHNLLQSNWTEARLAIDQALKATAPEHFMHCLREHAVFQLINELQATGEFSINLQMRL 1567

Query: 1582 LEGYLNDARAFPVPKPLSRKFI-NDIKKPRVQLLVSNMLSPLSLDVSLVNCILEVWYGPS 1641
            L  YL+ A + PV +PLS KFI N  +KPRV+ LV+N+L+P+S ++ +VN +LE W+GPS
Sbjct: 1568 LNSYLDRASSLPVKEPLSWKFISNSAEKPRVRKLVTNLLAPVSSELFVVNVVLEAWHGPS 1567

Query: 1642 LLPQKFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDNSSQVASPSLI-FWACSNL 1701
            L+P+K +K +ELVDFVETIL ++PSNY L LSV K L   +  S   S S I FWA  NL
Sbjct: 1628 LVPEKLSKQKELVDFVETILGLVPSNYPLALSVSKLLRKEEKQSDSGSSSGIHFWAGLNL 1567

Query: 1702 ISAIFSSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWKSYYNICKT 1755
             S I  ++P+ PE+IWVEA  I+ ++ GF+   ERF K+ALSVYP SV+LW+ Y+++CK+
Sbjct: 1688 ASTISCAIPVAPEYIWVEAGEIVSDINGFKTRAERFLKKALSVYPMSVKLWRCYWSLCKS 1567

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890115.10.0e+0092.20uncharacterized protein LOC120079791 isoform X3 [Benincasa hispida][more]
XP_038890113.10.0e+0091.53uncharacterized protein LOC120079791 isoform X1 [Benincasa hispida][more]
XP_038890114.10.0e+0090.96uncharacterized protein LOC120079791 isoform X2 [Benincasa hispida][more]
XP_038890116.10.0e+0090.23uncharacterized protein LOC120079791 isoform X4 [Benincasa hispida][more]
XP_011655356.20.0e+0087.08uncharacterized protein LOC101211906 [Cucumis sativus] >KGN51732.2 hypothetical ... [more]
Match NameE-valueIdentityDescription
O602932.6e-1120.54Zinc finger C3H1 domain-containing protein OS=Homo sapiens OX=9606 GN=ZFC3H1 PE=... [more]
Match NameE-valueIdentityDescription
A0A0A0KS730.0e+0087.08zf-C3H1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G512890 PE=... [more]
A0A1S3CJD30.0e+0086.86uncharacterized protein LOC103501638 OS=Cucumis melo OX=3656 GN=LOC103501638 PE=... [more]
A0A5D3C3A90.0e+0087.16Zinc finger C3H1 domain-containing protein isoform X1 OS=Cucumis melo var. makuw... [more]
A0A5A7VFE00.0e+0087.47Zinc finger C3H1 domain-containing protein isoform X1 OS=Cucumis melo var. makuw... [more]
A0A6J1GRB40.0e+0081.97uncharacterized protein LOC111456410 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT2G39580.13.0e-22534.32CONTAINS InterPro DOMAIN/s: Putative zinc-finger domain (InterPro:IPR019607); Ha... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 233..253
NoneNo IPR availableCOILSCoilCoilcoord: 506..526
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..56
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 621..642
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 761..781
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 41..56
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 284..311
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 23..40
IPR019607Putative zinc-finger domainPFAMPF10650zf-C3H1coord: 948..968
e-value: 8.9E-10
score: 38.1
IPR039278NURS complex subunit red1PANTHERPTHR21563UNCHARACTERIZEDcoord: 1..1748
IPR000571Zinc finger, CCCH-typePROSITEPS50103ZF_C3H1coord: 943..969
score: 9.018795
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 1117..1452

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG09G020050.1ClCG09G020050.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006396 RNA processing
cellular_component GO:0000178 exosome (RNase complex)
cellular_component GO:0005634 nucleus
molecular_function GO:0046872 metal ion binding
molecular_function GO:0005515 protein binding