Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCCGCTCTCACCTTCGTCCTCAAACTTCCTTTCCCTTGGACCGCTTCTTCAATCTCCCTTTCACTGAGCGGGGGGATGGAGAAGAAGAACTCCGAGGAGCTCACCGTCAAAGCCATGGCATCCAATTCAAAACCTAGCAAAAGCAAAGCCTCCGAAAGCAGAGAAGAAGGAGAAGTATCTTCATCCGACAACGATACACAAACACACGACGTACTTTCTTCTTCCTACTATCTTCTCATTCTGTTTCTAATTTCTTTTTGTTTTTCTCAATCATCTGCATCTCATGCCCTTCGTTTTCGTCCTGTCCGATGCTGAGAATGATTGTATATCCGCTACCATTTTCTTGCTGCCTCTGTTTTCCTTGGAAATGAAGGCTTCCTAGAGCTTCATACACTATTCTACTAATTCGAGTTCTAGGTCTGTGGTTATCTCTTAATTCGTAGTTGTATATGTCATCGAGGTGAGAACAGAGAGAAACTGCTATACTTTGAAGCTCAAACGGTTTATCTCTACGCGGTAGGGTTTAAATGTTTTAGTTGTGACTTGTGAACACTCGTTCTCTCCTGATTTAAACTGCATCTGTTTTTAAGTATGATTATGTTAAGAACACTTTGGATTTGTGCATTGTTTTCATTCTTTATAGTCGGATTAACATGTTTTGTGGTATCCAAAATGGATTTTCTGATTCGCCCCGATTCAATTTGATCTTTTCTAGTCGCAAAATGATTTTCATATTTCTATTTTGTGCGGCTTATGAGGAATTTTTACTTCTCAGGTACATCCTGTTTGTTCTACAGTGCCTGCCTCGGTCACATCACCTATCTCATCCATTCTTCCTCCTAAGAATAAATGTAACGAAGGGATCCAGGCTGGTAAGTATTGTCTCGCTTATAGAAGACGTTGGTGTTACACCACCCTTTCACTTTGAAATCTGCAACAGTATCTAAAATTGCTTTTAAGACTATTTTTATTAAGAAGCTTTTGGTATACCTGCGATCACGTTGAATGTAGTGTTTAGGCATATTTTTAAACCAAGGATATTGGACATCTGGTGCATTTTTTTCTGTGGCAAGTTACATGGACTTAGGTAATATGGATTTTAGTGGGCTTGGTTTTTTGTATGCCCTTCTATTCTTTCCTTGTTCTTTCTCTAAGCAGTTGTTTATTAAAAAAAAGAAAGTTACATGGATTTAGGTTTAACCTTGCAGGAATACTCTGCAGCTTTACTAAAGTTGAAGTGTGTACACATCACTTTTCCCTCAGGATTGAAAAGATGTGGATTCTAGCTGTGGTTCTTTCAAACTTCCATGCTTTATGTTTATTTACTGCATTTAAGGTTTCAATGCTTGAATATTTCCTGTTACTCTGGACTTTTGACGGGGTTCTCAAATTGTTGGTCTTATAAATGAATCTTGCGGTGCACTAAACTTTTTGGTGGTCTTGTATTAACAGTATTTTGTGATGACCAATATCTGCCACTAATACAGTAGGCTGTTTCTGGAAATAAATTTTCAGCTTCTGCTGATGTTTGCACAAGAACATCCATACAAACTACTTCTCAGAAGACGTGTGATACTGCTCAAGTTGTGAATAAAGCTAGTACTCCTTGGGGTGCTTCCAGGGAGGCCAATTCGAATCTTGTGATTAGCTTCTCAGATGACAGTGGCAGTGAATTGGAGGAATGCAGTAAAGTCAGAACTTCAAAATCTCATAGTGATGCTGTCAGGCACTTTAAACCTCCAACTTCAACACTTGATAGATCAAACAGGTTACGGAGTATGACAAGGAACAAAGTAGTGGCAAACAAACTGTCTTTGAGTCAGCCATTTATCCCTTCAATGACCAAGAATCACAGAGCATATTCCACGGGTGCTGGGCCCTCATTGGCTGAACAAGGATCTAAAATCAGAGCTTTCAGTGGAAACCTACAAAGCCAAGGGCGTGGGAATGATCAAGGAATGAACTTAAACACTAGTAAGCTGCAGGACTTGCGGGAGCAGATAGCCATTTGCGAAAGCAAACTGAAGCTCAAGTCTGCCCAACAGAACAAGGAGAGCATTTCAATTACAAACCAGGATTATATTGTCACAAATTCAAAATCTGATTTGGGTAGGAAAGGGAATGCTACTATCTCTCAAGTTTCTTCACTGGGGCCTAAGGAACCAGATGCAAAGCGCCTGAAAACCAGTGGGTCTTATTCTACCAAGCTGAGTTTGAGTGGGCAACAACATCTTCGTGCAACGTATGCTGGAAAATCTGTTTTTCGGCCACAGGAGCCTGGAGAAGAGACACAGAACATTAAGGTCACTTACAACCAGAAAGGGAATTCTTTGAGTAGAGAAGAGTCCAGTGTGTTGAAGCAGAGTAAGGAAGATATCAAACATGTGGCTGCTTCACCTTCACCTGGAATTGACCTTGGCAAAGTACAGGATGGTGAATAATTTGTTCATACTGAATAGATAATTAGATCTTTCAGTTTAGCACAATCTTACTGTTTTATGAGGTGTCTGCCTAACTTATTACGTAAAATGCTGTGCAGACACCGACATTGTTGCTAATGGAAATCAATCAGACTGGATCAGTAAGCAGGTGGATCCTCACCCTCTTGTTGTCTTAGATCAAGCTACGAGTCTTCCAAATGTCACATCCAATGTCCAAACTCAGTTCGTAAGTCAAGAACTGTATATGTGGCCTGATGTCTTTTCTCATTGATGTATTTTTTTTTTTTTTGATCAAGTATCTAAAATTCATTATACTTCTCAGAAATTCTAGAAGTTGTTTCATTAAACCTTCGATTATCTTATGTCTCCATTATCTTATGTTTGATGATGCTGTTTTTGGAATCTCCTAAAATTGTGAAATTCATGGTGAAGTTAATTGAACCAAATCTGTATCAGGATTCCTTATCATATTTAGGATATGATAAAATATTTGCTTACATGAATGGACTTCTCTTCAAAATCATGCTCCCGTTTATTATACATAGTTCTTCCTCCTCTCCCCTTCAAGCACAATTTGAATATTTGTCTCAACAGGATAATGTTGAGTTTCACCGTCAAAGTGATGGCCTTCAACCATCTGCATCAACTGCAAAATTATTTGAAGGAACACTTCCTCAATCAGCATCCAATGTCAAGATACCAGAGCCATGCAGTAATTTTTTTAAGGTTTATCTTCAAGCTCAACTGTCACATTTTAGTTTTTTCTTCTCTCTCTTTCATCGTAGTTACATGGATTTCCTCTGGATTTATTATTATCGCAGTCATTGATAAACAGTAAAAGCTCCGGGTCTGCTTTTGGTAATTCATCAAGTTGCTTGGGCTTCAGCAATCTTGATCTCCAATCATTATTTGAAATGGAGGAGTCTCTAGACAAGGATCTGGAAGAAGCACAAGATTGCAGGCGCCAATGTGAAATTGAAGAAAGAAATGCTTTCAAAATTTATTCTAGAGCTCAAAGGGCTTTGATTGAGGCTAATTCTAGATGTCTTGATCTTTATCACAAGAGAGAATTATTTTCAGCTCATTTTCACTCTTTTTGCATGAATAATCCTGGTTTAATTAGTTCCTCGAGACAGCAAGAAGACATGAAAATTGGTGTGGATCACTTAAATAGTATGTCTGGAAATGCAAATAGAGCTTCTTCTTTGTATCAGAAGCATTTTGAATATAATAGTTCTACTAAGCTACATAATGATTTAAATATGCAACATGAAAATGCTGGTCCCATCAATACTTCAAACCTGCATGAGAATGGACAAAATTTGGGGTCTGAACCTGGATCCTGCTCTGCCTTATGTGGTAATACATTGGATCCATTGCCTTCCAAAGGCAATAATATTGCAGATAGAATTTGCTCTCCATCCTTTGATCCAAACGTTTCAGTGGATGGAGATGAAGAGTCATTGCCTTCTGACCATGAAATGATCGATTCCTATGATGAATGCTACATAGGAAGAAAACAGTTTGAAGATGATCAATTGGAAGCATATAATATGTCAAAGAAAAACCACAGTGACAATAATATTGAGGATTCTTTGCGTCTTGAAGCAAAATTAAGGTCTGAACTATTTGCACGTCTAGGAACAAGAAATTTGTCAAAGACTTGTAATCCATGCCATAACATTCAAACGTCAGTCGAACAGGGGACTGAGAATGATGCCAGAGACGATAGCACTCAGCAAAATAATACAGAACCTACAGTAGGTCTAGCAGTTGGAAGTGACGTCGACCTCATAAGTAAGAAGACTGAGATTGCTTTACTATCAGGAAAGGGAGATCAACAGTTTGGTTTTGGAGGTAACTTTATCTTCAGGAATGAGACTTATGCTTCTTCATCATATGTGAACTTACAGGCATGTTTGAAGTGGCTTTTAAAGATGCATTTTCTATTGTTCTCATGCTATTCATGTTATTTTGGGTGAACATCTTTTACCTAAAGGCTAAAATTGATCATCACCCCAGTCCTTTTCTTGAAGGCTGGTTTTGGAGGTCAGTCTTTGAACTGAAAAATATGGCCATATGGGTATATATGCTTAGATTTAATTGGGGCCGGCTATGCTTTTGTTCGGGTAGGCAGGAGACTTTTGGGAACTTGAATCCTGACGTGTTTTGGGTCTATTTCTGGGTTGAAAATTAATAGAGATAAGAGCTTGTGAACTTTGAGTCATTAAGTCTATTTTTGAGTTTATTCCAATCGTTGTTTGAGTATATTCCTAGCTCCTTGAAGCTTTCTAAGGGTGGAAGATGCCTAAAAGGGCTAGTATTTTGTGGAATTGTGTGGTTAGGGTATTGTTGTGGCACATTTTGCTGCAAAGGAATTAGGGGACCGCTTCTTGGAAAACTTGTTGTGGTTAGGGTATTGCTCTTTTCTCTTTTGTTTTTTTTTTGGGGGAAAAATTTATTGTTTCCTGGTGGGGATATAGTCACAGGAATTCTTTGTAGTTACAACCTTTCTACTATCTTTAACCATTGGTAAGCTTTCTTTTAGGTCATGTTGGACGGGGCTCCCTCGTCCTCCAACCCATAGGTTGTATTTATCTCTCTCTCTTTGTTTCCCTTCCTACATAATTGTTTTTTATTAAAAAGAAAAAAAAAAAGAGCTTGATCTTGGACATGAATTGTGATTTAGTGAAGTTGATAGGTTCCTTCCTTTCCGTCCTCCTACTTAGGTTTTTCCCTTGATCACGACCCTCATAATTTTCCTATATGAGTTCCGGTTTTGGGTAAAATTAATGATGTCTAGCTTCTGGAAGAGGTAGTTCTTCTCTAACGTGGGCAGACTTACTCTTATCTAGTTGATCTTGTGTGGTATCTCGACCTGCTTCTCTCATTGCTTAGAGTCCCTGTATCGCACCTTTGGAGAAGTTGATGAAAGATTTTCTTTGGCGTTCCACATCTGGCTTTTATCACCTTTCTCCATGGAGCATCCTTCGGATACTGAAATTTCCAAGCCTACTTGGCTAAGAAGGCCTTGTTCGGAAGCTTCTAGTCCACCATATGGATCCCCTCCTTCCACCTAAGGGGTTGAGAAAGCTCCTTACTCCTATTGCCAGACCATAAAAATTTACTTTGGAAGGCAAATGGAGTAAGGACAAGAAGTATATGCACATGGGCAGGTTTGTAAAGCGTAAGTCTACCTCCTTGTTGCTAAGGATGTAAACCTTTTGTAGACTTTATCCAAGATAGGCTGCCAAAAGGCAAGGGTAGTGGTATTATTGTTGGGAGGAAGTCCCAAATATGTTGACGGTCAAGATCCCACCTTACAACCAAACTGCTTTGCCAAGTGAGAAATAGAATTAATATCACAATTAATTCCTAGAAATTCAGATCTGTGTCAGTTGATGATTAACCTAGACTTATGTTGAATCATTATTATTGGTTCATTCAGTAGCAGCAATCTCTTTGTGTTTATGCACTTGTTTATTCAGTGATGGAACATAATATAACTAAAAATTGAAAAGAAGGGTGTGGTTGGTATAGTTGTGCAATTGTTGCATCATCCATTCTATGTATCAAGAAGAATATAATTCAATGAACGTAAAGTGACAGAAAAGGTATTTGCTGGTTAGAAAATTATTCATGGCTCTTTTTGTCTTGGCATCTTGCCTTGCAAAGTATGAGATACTTTTCCCACATTTCTAAGAGGTTCAATTAGCGAATAATTTTGGCCTTTGGTATCTACTTATGAAACCTCAGTGTTTAATGCTTCAGTTTTCTTTTCTTAGCATTGAGCGTTTCAGATTGCTGATTGCGGATGCTCTGTTAACTTGAACCTCCATTGCCCGTTCATACTGCCATAAAATGAACCAAACCTTAAAGTTATGTATAGGGAATCATGGATCAATGATTCAAGAGATAAGGCATAACTTGGTTGTTTTTGAGTATATCAGTTCAAGATTTAGTCTCTTGCTGATTATGATATCGATAACTTTTTCTTATTTCAAAAGAATATTTTTAGCCTTTTTTACTCTAAAAGCTGTGTTTCAGATGGTAGTTTTGTCAGATTAGTTGAGTGTAAATCATGATTAAAAACGATTTAATGAAAAACTGAGGATTATGCTAAGAATATTTTCTTGGTTCTTCCCGCTTCCTTTTTTTTTTTTTTTCTCTCTCTCTCTCTCTCTCTCTCTCTAAATTAACTTCTTTATGCAATATGCAGGCACAAACATATGCAAAACCCCAGATGACATCCATGGTCGTTGTCATTTTGAGAACTTGCCATCAGAAGCTCAGGATTCTGCGGACTCTGATGAAAATGAACGATTCAATAGAGAAGGATCTTGCTCCAAAACTACTTTTAGTTTTACACCTTTGACTATGAACAGTGTTCTGCAACATATAAAGGCCATATCCTCAGTTAGTATAGAAGTCTTGCTCACTAGAACTCGAGGGAGTCTCTCTAATCTTGGTTTCCCTGAAGACGGTGATTCTTTGCAAGTGGATCAAATCCACTGGAGAAAATTAAAAGAGAACTCTGTCCATGAGACTGTCAGACCTATGTTTCAGAGTGATGGCTCTTATATTGATGATCTTGCGATTGATCCATCGTGGCCACTTTGCATGTATGAACTCCGTGGAAAATGCAACAATGATGAATGCCCTTGGCAACATGTGAAGGACTACTCTTTTGCCAATAGAAGGCAGTGTCAGCATGGCCACATCAACTATTCTGGTATGATGATTCTGATGTTCTTTTAATTGAATGGTTGAAATTCTGTAGACTTTCACCTTCATCTTTTGCAGATTCTTGCAATGGACTATCATTTTCTTCAGATGAAACAAAAGTCTTCAAGTATGAAGATGGCATGACTCCTCCAACTTACCTGGTTGGCATAGATATTCTAAAAGCTGATTCACATTCATATGACCCTGTTTTAACTCAGAAAAGTAGTCAATGCTGGCAAAGCTTTTTTAGTATTTCTTTGACGTTACCAAATTTGCTCCAAAAGGATGCTTCTGCTGATGGGCTATTTTTACATGATGCTCGTATAGTGGCCAATGGAAATTGGAATAGACCATCATCATACTTTCAGAGGGGAAGCTCTATATTGGTTTGTTTATTACTCCTTATATATATACCCTTGTTTCAGTCTGCGGCTGAACAATTTTTTTTTCCCTTTTTCTTTTCCTTTTGACATGGGACACAATGGCTGATTGAACTAATTTGTATTTGCCTTCCTCTGTCTCTCGTCTTCCTCCGTCATGCATGATTGCATACCTGTACAGTGTTTTTCGGTTAGAAATATCTTTATTCAACTTTCTTGGTATGCTGTGATCACAGTACTTGATCATTGTTACTTTCTTCAATGAAACTTCATTTTGAATGAAGTTTTCCCCAAATGAAAAGTAGACCCAAGTTTCAAAACAAATGAATTTTGGACTTTTGCTGATGTCATCAGCCTGTGAAATTACCACTCTAACCCTTCTACCTTTTTCATTCTTTCTTTTTTTTTTCTTTTTCATTTTTTGCATTTTTCTTCTTTCTTTTTTCTTTTTTTTCTTTGTTTCTTCTTTCATTGTCTTCTTCTGCCATTATTTTTTCTGCGTTCTTCTTCTTCTTTCTCTTTTCCTGCACTCTTCTTCTTCGTTTTTCAAATTTTTTTTTTCTCTTCCTTCTTCGTTCTACAGTTTTTTTCTTGAAGAGAAAAATTGCAGTTTTGTCTGATGTGGAACAAGAGGAAGGAGTGAGAAGCTAGAGGAAGAATCTGAACAAAGAGAAAAACCGAATAAAAAAATTACAGTTGTGCTTGATGAGGAACAAGAAAAAGGAGTGAGAAACTAGAGCGAGAAGCCAGAGGTAGGAGCAAGAAGCGAGAGGTAGGAGTGAGAAGTGAGAGCAAGAATCTGAACAAAAGAAGGAGCGAGGTGCGAGAGGAAGGAGCGAGAATTCGTTTGAGCGTTACTCCCTTGCTCACAATTTGGGGACAATATGGTATTTTCACACGCGATTGACATAAGCAGAAGGCCATTTTTTGAGCAAACTTGAATCTTGGGCCAAATTTCATTTGGGCCCCTTTTTACAAAACCCCCAATTTTCAAGTCTGCCATTATATCTGAACTATCTGATTCTGGGAAACACGGTTACTCCTGCCTTGCTCAAATTGTGGCATTTAATATGACCATTTTTAAACTGAGAAATATCATCTTTTTTCCCTCCTTTTTGCATATTTCATTTTGCACACACACACATGTATATAGTCTGCATAGAAGTATTTCATTTCTCCTTTTTGCATATTGTTATTGTATGAGAATTTTGGTTTTTATTTTGTTATGCTTGTATTTCTTTTGCCATTTTTTTTTCTGTTTGTTTGCTTCTATTTGAAGTTATCTCTTTTTTCCCTCTTTTCCTTTTTACATAAAGATAAAATGATCCTTTTTTAGTTTTTGTGATGTTACTGTTGTATTACTGATTACTGATCCTTATCATCTTCCTGTTGTTTGGTCCATGATCTCACTTGTACTGACTTTTCTTGCAACTGCTATAAAGTTTTCTGTCACTGATTACTGAATACAGAACATTAACCATCTTTCTGTTGTTTGACCCATGATCACTTGTGCCAACTTCTTACAAAGTAGAGTCAGCTGAAACAGGGTGATGAGAACCTAGCTCTGGAAACAGCTCTAATTATTATTAACCAGGAAACAAACAGTCGAGAGGGCATGAAAAAGGTATGTCTTTTTTTTATTCATTCTTTTGTTGGTTCTTGCTTGTGTGAACCATGTTGCAATACAGTTGGAAATATTTGTATTTGCAGGCTCTTCCTGTACTATCACGTGCTGTAGAGAACAATCCAAAATCTATAGCTCTCTGGACCATTTACCTTCTAATATTCTATAGCTATACTACAACCGGGGGGAAGGATGACATGTTCTCTTTTGCGGTATGGTTTATTATATTTGTTCCTGCTCTGCTGTACAATATAATCCGGATGTAGGAAAGTACTGCAAAAACGTCTTTAAAGATTCAATGTATTAAAGTATATATAGTAGAGAGATGAAACAAGCTTTGCGATCTCGAAGGTTTTTGCTTTGATCAAAATTTAGTAAACACTGCTTAGCGTTTTGGTGATGCTGAAACTTTCAATCCAAGTGAGCTGTAGTTATTTTTCGGTTGTTATCAACTTTATAAAGTGAGGATTAAAACACAATATGTTGTGATATTCCACTACCTATACATGTTCTATAATGTAAAAACTGATAATTCAAACTTCGAAGTATGAGGAATGGATGATTTACTACAGTGGTCCAAAACTCTGAAATTTATTGTCACGCATCTTAAAGTTTCTTGTATTTTGAAAACAGTCCATCTTTGAAGTAACATTGAAATAACAAAAAAGCTGTCATTGCTAAACAAGGGAGAAAGAAAAAGCTGACGTCTACTCGAACTTTATTGGGCATGCTGATTTTAGATTCTTTTAATAAAAATAAAAATAATATTATTATTATTTTTACTGTGTTTGAAAACCTTTTATTTCAGCTGAAGTGCGGTTTTTGGCTTTTGTTTATTTCGTTTCCTCTTAGAAAAATTCTCCTTGTACTCTTATCACCTCTATGATTCAATGCCGAGTCATCTCTTCCTCCATACTTGAATTGGTGGGCACTTCTGTCCATTACTTATTCTCACTCGCATAATCAAGCATTGTGTGGAAATTCTTTGTTCTTGTTATTCAGGTCAAGCACAATGGGCAATCTTATGAACTCTGGCTCATGTACATTAACAGCCGCATGAATCTCGATGCTCGATTGGCTGCATATGATGCTGCACTTTCTGCACTCTGCGACAATATATTTACTCATAACTTGGATGGGAAATATGCTAGTGCCCATATCTTGGACCTGATTTTACAGATGACAAATTGTTTGTGTATGTCTGGGAACGTGGAGAAGGGTATTCAGAGGATTTTTGGACTTCTTCGAGTTGCTATGGATTCTGATGAGCCTTATTCTTTTACGCATTCTGATATGCTCGCATGCTTAAATATATCTGACAAATGTATTTTCTGGGTTTGTGTTGTGTATTTAGTTATTTACAGGAAACTGCCTCATGCTATAGTGCAGCAGCTTGAATGTGAGAAAGAACTGATCGAGATTGAATGGCCTGCCATTCAATTGACAGATGGTGAGAGGCTGAGGGCTTCTAGGGTGGTCAAGAAAGCAGTCGATTTTGTTGATTCATGCCTGAACAATGAATCACTTGAAAGTAAATGCTACCAAAAATCTATTCAAATGTTTGCTGTCAATCATATAAGGTGCTTGATGGCATTTGAGGACATAGGATTCAGTAGGAACTTGTTGGATAAGTATGTTAAACTTTATCCATCTTGCCTAGAACTTCTTTTACTTAAAGTACGGGCAAAGAAACATGGTTTTGGGGATGAAACTGTCGTGGCATTTGAACAAGCGATCAGGAACTGGCCGAAAGAAGTACCTGGTGTCCAATGCATCTGGAATCAATATGCTGAATATTTACTTCAGAATGGGAGAATCAAATGTACTGAAGAACTAATGGTGCGCTGGTTTGAGTCTACTTCAAAAATGGATTGTTCTAAAACTAGAACAGTGGATAATAGTGACTGTGACTCCTTGCACTTGCGAGAGTATGCTTCAGGATCAATTCTACATGCATTAGATTGCAGTCCCAATGAGGTGGACGTGGTGTTTTGGTATCTTAATCTTTCTGTTCACAAGTTACTGCTTAATGACCAATTAGAAGCACGTTTGGCCTTTGACAATGCTCTGAGGGCTGCAGGTTCTGGGACTTTTAGATATTGCATGAGAGAGTATGCTATGTTTTTGCTTACAGACGAATCCTTACTGAATGAGGCTGCTTCTGTTGGTGGAATAAGGAGCATTTTAGAGGGTTATCTCAACGATGCCCGAGCTTTCCCTGTCCCTAAACCATTATCCAGAAAATTCATTAACGATATCAAGAAGCCAAGAGTTCAACTTCTTGTCAGTAACATGCTGTCTCCACTTTCTCTGGATGTTTCTCTAGTGAACTGTATTCTTGAAGTCTGGTATGGGCCATCTCTTTTACCCCAAAAATTTAACAAACCAAGGGAATTGGTGGATTTCGTGGAAACTATCTTAGAGATGTTGCCTTCTAATTATCAGTTGGTACTTTCTGTCTGTAAGCAATTATGCAATGGTGACAACTCTTCCCAAGTTGCCTCCCCCAGTCTTATTTTCTGGGCCTGCTCAAATTTGATCAGTGCAATCTTTAGTTCTGTCCCAATACCACCAGAGTTCATTTGGGTAGAAGCTGCTAATATTCTGGTCAATGTCAAAGGTTTTGAAGCCATATATGAGAGGTTTCACAAGAGAGCTTTATCTGTTTACCCGTTCTCTGTTCAGCTGTGGAAATCATACTACAACATATGTAAAACTAGAGGAGATACGAGTGCTGTTCTGCGAGAAGTAAATGAAAGGGGAATCGAACTCAACGAGCCTTCTTTGTGATAGAGTTTTACCTTCTTTTGGTAGGATCAGTAAAATTACAGGAAAACTAGGATCAGTTTTTAGTTATGGGAGAAGCCTCATTGTACAGATAGTCTACTTCATACCGATTTATCATGCTAGGCACGGTGGTTTCGCCAGTTTTGTTAAGTGGTAGAGAGCTAATTGCGGCTAGGAATTAGATATAAATACTTGCACAGAGAAAAATTTGGTGGAAAAAGGTTTTCTTGTCTGTAAAGTCATTGAGACCTTATTGGAACTGGATCTGTTGAGGATATCATCAGGTTTCTCAGCCAGTTGCCCACTGACCATTGTTCGGCCGTTCATAAAGGTGTATCGTCGAACGTCAGGTTTTCCCCATCTTTTTTTTGGTAGCCCACACAGTTTACTTTTGTTCTTCATCCTTCCCCGCTTGCTAGAATTAATTTCTTTAATTGACCTTACCCTTTTAAATGTAAGTACCAACCATTTTTAGGTTTAAATTTGATATGGTTTAAATGAAATACAATACTCGATGCTCATCGATGTAGGTTTCATTTCAATTTTCCCACTATCTACGAGCATGTTTTGTTTGTTAAAGCTCTCAGATCAAACACAGGTTTGTC
mRNA sequence
TTCCGCTCTCACCTTCGTCCTCAAACTTCCTTTCCCTTGGACCGCTTCTTCAATCTCCCTTTCACTGAGCGGGGGGATGGAGAAGAAGAACTCCGAGGAGCTCACCGTCAAAGCCATGGCATCCAATTCAAAACCTAGCAAAAGCAAAGCCTCCGAAAGCAGAGAAGAAGGAGAAGTATCTTCATCCGACAACGATACACAAACACACGACGTACATCCTGTTTGTTCTACAGTGCCTGCCTCGGTCACATCACCTATCTCATCCATTCTTCCTCCTAAGAATAAATGTAACGAAGGGATCCAGGCTGCTTCTGCTGATGTTTGCACAAGAACATCCATACAAACTACTTCTCAGAAGACGTGTGATACTGCTCAAGTTGTGAATAAAGCTAGTACTCCTTGGGGTGCTTCCAGGGAGGCCAATTCGAATCTTGTGATTAGCTTCTCAGATGACAGTGGCAGTGAATTGGAGGAATGCAGTAAAGTCAGAACTTCAAAATCTCATAGTGATGCTGTCAGGCACTTTAAACCTCCAACTTCAACACTTGATAGATCAAACAGGTTACGGAGTATGACAAGGAACAAAGTAGTGGCAAACAAACTGTCTTTGAGTCAGCCATTTATCCCTTCAATGACCAAGAATCACAGAGCATATTCCACGGGTGCTGGGCCCTCATTGGCTGAACAAGGATCTAAAATCAGAGCTTTCAGTGGAAACCTACAAAGCCAAGGGCGTGGGAATGATCAAGGAATGAACTTAAACACTAGTAAGCTGCAGGACTTGCGGGAGCAGATAGCCATTTGCGAAAGCAAACTGAAGCTCAAGTCTGCCCAACAGAACAAGGAGAGCATTTCAATTACAAACCAGGATTATATTGTCACAAATTCAAAATCTGATTTGGGTAGGAAAGGGAATGCTACTATCTCTCAAGTTTCTTCACTGGGGCCTAAGGAACCAGATGCAAAGCGCCTGAAAACCAGTGGGTCTTATTCTACCAAGCTGAGTTTGAGTGGGCAACAACATCTTCGTGCAACGTATGCTGGAAAATCTGTTTTTCGGCCACAGGAGCCTGGAGAAGAGACACAGAACATTAAGGTCACTTACAACCAGAAAGGGAATTCTTTGAGTAGAGAAGAGTCCAGTGTGTTGAAGCAGAGTAAGGAAGATATCAAACATGTGGCTGCTTCACCTTCACCTGGAATTGACCTTGGCAAAGTACAGGATGACACCGACATTGTTGCTAATGGAAATCAATCAGACTGGATCAGTAAGCAGGTGGATCCTCACCCTCTTGTTGTCTTAGATCAAGCTACGAGTCTTCCAAATGTCACATCCAATGTCCAAACTCAGTTCGATAATGTTGAGTTTCACCGTCAAAGTGATGGCCTTCAACCATCTGCATCAACTGCAAAATTATTTGAAGGAACACTTCCTCAATCAGCATCCAATGTCAAGATACCAGAGCCATGCAGTAATTTTTTTAAGTCATTGATAAACAGTAAAAGCTCCGGGTCTGCTTTTGGTAATTCATCAAGTTGCTTGGGCTTCAGCAATCTTGATCTCCAATCATTATTTGAAATGGAGGAGTCTCTAGACAAGGATCTGGAAGAAGCACAAGATTGCAGGCGCCAATGTGAAATTGAAGAAAGAAATGCTTTCAAAATTTATTCTAGAGCTCAAAGGGCTTTGATTGAGGCTAATTCTAGATGTCTTGATCTTTATCACAAGAGAGAATTATTTTCAGCTCATTTTCACTCTTTTTGCATGAATAATCCTGGTTTAATTAGTTCCTCGAGACAGCAAGAAGACATGAAAATTGGTGTGGATCACTTAAATAGTATGTCTGGAAATGCAAATAGAGCTTCTTCTTTGTATCAGAAGCATTTTGAATATAATAGTTCTACTAAGCTACATAATGATTTAAATATGCAACATGAAAATGCTGGTCCCATCAATACTTCAAACCTGCATGAGAATGGACAAAATTTGGGGTCTGAACCTGGATCCTGCTCTGCCTTATGTGGTAATACATTGGATCCATTGCCTTCCAAAGGCAATAATATTGCAGATAGAATTTGCTCTCCATCCTTTGATCCAAACGTTTCAGTGGATGGAGATGAAGAGTCATTGCCTTCTGACCATGAAATGATCGATTCCTATGATGAATGCTACATAGGAAGAAAACAGTTTGAAGATGATCAATTGGAAGCATATAATATGTCAAAGAAAAACCACAGTGACAATAATATTGAGGATTCTTTGCGTCTTGAAGCAAAATTAAGGTCTGAACTATTTGCACGTCTAGGAACAAGAAATTTGTCAAAGACTTGTAATCCATGCCATAACATTCAAACGTCAGTCGAACAGGGGACTGAGAATGATGCCAGAGACGATAGCACTCAGCAAAATAATACAGAACCTACAGTAGGTCTAGCAGTTGGAAGTGACGTCGACCTCATAAGTAAGAAGACTGAGATTGCTTTACTATCAGGAAAGGGAGATCAACAGTTTGGTTTTGGAGGCACAAACATATGCAAAACCCCAGATGACATCCATGGTCGTTGTCATTTTGAGAACTTGCCATCAGAAGCTCAGGATTCTGCGGACTCTGATGAAAATGAACGATTCAATAGAGAAGGATCTTGCTCCAAAACTACTTTTAGTTTTACACCTTTGACTATGAACAGTGTTCTGCAACATATAAAGGCCATATCCTCAGTTAGTATAGAAGTCTTGCTCACTAGAACTCGAGGGAGTCTCTCTAATCTTGGTTTCCCTGAAGACGGTGATTCTTTGCAAGTGGATCAAATCCACTGGAGAAAATTAAAAGAGAACTCTGTCCATGAGACTGTCAGACCTATGTTTCAGAGTGATGGCTCTTATATTGATGATCTTGCGATTGATCCATCGTGGCCACTTTGCATGTATGAACTCCGTGGAAAATGCAACAATGATGAATGCCCTTGGCAACATGTGAAGGACTACTCTTTTGCCAATAGAAGGCAGTGTCAGCATGGCCACATCAACTATTCTGATTCTTGCAATGGACTATCATTTTCTTCAGATGAAACAAAAGTCTTCAAGTATGAAGATGGCATGACTCCTCCAACTTACCTGGTTGGCATAGATATTCTAAAAGCTGATTCACATTCATATGACCCTGTTTTAACTCAGAAAAGTAGTCAATGCTGGCAAAGCTTTTTTAGTATTTCTTTGACGTTACCAAATTTGCTCCAAAAGGATGCTTCTGCTGATGGGCTATTTTTACATGATGCTCGTATAGTGGCCAATGGAAATTGGAATAGACCATCATCATACTTTCAGAGGGGAAGCTCTATATTGAGTCAGCTGAAACAGGGTGATGAGAACCTAGCTCTGGAAACAGCTCTAATTATTATTAACCAGGAAACAAACAGTCGAGAGGGCATGAAAAAGGCTCTTCCTGTACTATCACGTGCTGTAGAGAACAATCCAAAATCTATAGCTCTCTGGACCATTTACCTTCTAATATTCTATAGCTATACTACAACCGGGGGGAAGGATGACATGTTCTCTTTTGCGGTCAAGCACAATGGGCAATCTTATGAACTCTGGCTCATGTACATTAACAGCCGCATGAATCTCGATGCTCGATTGGCTGCATATGATGCTGCACTTTCTGCACTCTGCGACAATATATTTACTCATAACTTGGATGGGAAATATGCTAGTGCCCATATCTTGGACCTGATTTTACAGATGACAAATTGTTTGTGTATGTCTGGGAACGTGGAGAAGGGTATTCAGAGGATTTTTGGACTTCTTCGAGTTGCTATGGATTCTGATGAGCCTTATTCTTTTACGCATTCTGATATGCTCGCATGCTTAAATATATCTGACAAATGTATTTTCTGGGTTTGTGTTGTGTATTTAGTTATTTACAGGAAACTGCCTCATGCTATAGTGCAGCAGCTTGAATGTGAGAAAGAACTGATCGAGATTGAATGGCCTGCCATTCAATTGACAGATGGTGAGAGGCTGAGGGCTTCTAGGGTGGTCAAGAAAGCAGTCGATTTTGTTGATTCATGCCTGAACAATGAATCACTTGAAAGTAAATGCTACCAAAAATCTATTCAAATGTTTGCTGTCAATCATATAAGGTGCTTGATGGCATTTGAGGACATAGGATTCAGTAGGAACTTGTTGGATAAGTATGTTAAACTTTATCCATCTTGCCTAGAACTTCTTTTACTTAAAGTACGGGCAAAGAAACATGGTTTTGGGGATGAAACTGTCGTGGCATTTGAACAAGCGATCAGGAACTGGCCGAAAGAAGTACCTGGTGTCCAATGCATCTGGAATCAATATGCTGAATATTTACTTCAGAATGGGAGAATCAAATGTACTGAAGAACTAATGGTGCGCTGGTTTGAGTCTACTTCAAAAATGGATTGTTCTAAAACTAGAACAGTGGATAATAGTGACTGTGACTCCTTGCACTTGCGAGAGTATGCTTCAGGATCAATTCTACATGCATTAGATTGCAGTCCCAATGAGGTGGACGTGGTGTTTTGGTATCTTAATCTTTCTGTTCACAAGTTACTGCTTAATGACCAATTAGAAGCACGTTTGGCCTTTGACAATGCTCTGAGGGCTGCAGGTTCTGGGACTTTTAGATATTGCATGAGAGAGTATGCTATGTTTTTGCTTACAGACGAATCCTTACTGAATGAGGCTGCTTCTGTTGGTGGAATAAGGAGCATTTTAGAGGGTTATCTCAACGATGCCCGAGCTTTCCCTGTCCCTAAACCATTATCCAGAAAATTCATTAACGATATCAAGAAGCCAAGAGTTCAACTTCTTGTCAGTAACATGCTGTCTCCACTTTCTCTGGATGTTTCTCTAGTGAACTGTATTCTTGAAGTCTGGTATGGGCCATCTCTTTTACCCCAAAAATTTAACAAACCAAGGGAATTGGTGGATTTCGTGGAAACTATCTTAGAGATGTTGCCTTCTAATTATCAGTTGGTACTTTCTGTCTGTAAGCAATTATGCAATGGTGACAACTCTTCCCAAGTTGCCTCCCCCAGTCTTATTTTCTGGGCCTGCTCAAATTTGATCAGTGCAATCTTTAGTTCTGTCCCAATACCACCAGAGTTCATTTGGGTAGAAGCTGCTAATATTCTGGTCAATGTCAAAGGTTTTGAAGCCATATATGAGAGGTTTCACAAGAGAGCTTTATCTGTTTACCCGTTCTCTGTTCAGCTGTGGAAATCATACTACAACATATGTAAAACTAGAGGAGATACGAGTGCTGTTCTGCGAGAAGTAAATGAAAGGGGAATCGAACTCAACGAGCCTTCTTTGTGATAGAGTTTTACCTTCTTTTGGTAGGATCAGTAAAATTACAGGAAAACTAGGATCAGTTTTTAGTTATGGGAGAAGCCTCATTGTACAGATAGTCTACTTCATACCGATTTATCATGCTAGGCACGGTGGTTTCGCCAGTTTTGTTAAGTGGTAGAGAGCTAATTGCGGCTAGGAATTAGATATAAATACTTGCACAGAGAAAAATTTGGTGGAAAAAGGTTTTCTTGTCTGTAAAGTCATTGAGACCTTATTGGAACTGGATCTGTTGAGGATATCATCAGGTTTCTCAGCCAGTTGCCCACTGACCATTGTTCGGCCGTTCATAAAGGTGTATCGTCGAACGTCAGGTTTTCCCCATCTTTTTTTTGGTAGCCCACACAGTTTACTTTTGTTCTTCATCCTTCCCCGCTTGCTAGAATTAATTTCTTTAATTGACCTTACCCTTTTAAATGTAAGTACCAACCATTTTTAGGTTTAAATTTGATATGGTTTAAATGAAATACAATACTCGATGCTCATCGATGTAGGTTTCATTTCAATTTTCCCACTATCTACGAGCATGTTTTGTTTGTTAAAGCTCTCAGATCAAACACAGGTTTGTC
Coding sequence (CDS)
ATGGAGAAGAAGAACTCCGAGGAGCTCACCGTCAAAGCCATGGCATCCAATTCAAAACCTAGCAAAAGCAAAGCCTCCGAAAGCAGAGAAGAAGGAGAAGTATCTTCATCCGACAACGATACACAAACACACGACGTACATCCTGTTTGTTCTACAGTGCCTGCCTCGGTCACATCACCTATCTCATCCATTCTTCCTCCTAAGAATAAATGTAACGAAGGGATCCAGGCTGCTTCTGCTGATGTTTGCACAAGAACATCCATACAAACTACTTCTCAGAAGACGTGTGATACTGCTCAAGTTGTGAATAAAGCTAGTACTCCTTGGGGTGCTTCCAGGGAGGCCAATTCGAATCTTGTGATTAGCTTCTCAGATGACAGTGGCAGTGAATTGGAGGAATGCAGTAAAGTCAGAACTTCAAAATCTCATAGTGATGCTGTCAGGCACTTTAAACCTCCAACTTCAACACTTGATAGATCAAACAGGTTACGGAGTATGACAAGGAACAAAGTAGTGGCAAACAAACTGTCTTTGAGTCAGCCATTTATCCCTTCAATGACCAAGAATCACAGAGCATATTCCACGGGTGCTGGGCCCTCATTGGCTGAACAAGGATCTAAAATCAGAGCTTTCAGTGGAAACCTACAAAGCCAAGGGCGTGGGAATGATCAAGGAATGAACTTAAACACTAGTAAGCTGCAGGACTTGCGGGAGCAGATAGCCATTTGCGAAAGCAAACTGAAGCTCAAGTCTGCCCAACAGAACAAGGAGAGCATTTCAATTACAAACCAGGATTATATTGTCACAAATTCAAAATCTGATTTGGGTAGGAAAGGGAATGCTACTATCTCTCAAGTTTCTTCACTGGGGCCTAAGGAACCAGATGCAAAGCGCCTGAAAACCAGTGGGTCTTATTCTACCAAGCTGAGTTTGAGTGGGCAACAACATCTTCGTGCAACGTATGCTGGAAAATCTGTTTTTCGGCCACAGGAGCCTGGAGAAGAGACACAGAACATTAAGGTCACTTACAACCAGAAAGGGAATTCTTTGAGTAGAGAAGAGTCCAGTGTGTTGAAGCAGAGTAAGGAAGATATCAAACATGTGGCTGCTTCACCTTCACCTGGAATTGACCTTGGCAAAGTACAGGATGACACCGACATTGTTGCTAATGGAAATCAATCAGACTGGATCAGTAAGCAGGTGGATCCTCACCCTCTTGTTGTCTTAGATCAAGCTACGAGTCTTCCAAATGTCACATCCAATGTCCAAACTCAGTTCGATAATGTTGAGTTTCACCGTCAAAGTGATGGCCTTCAACCATCTGCATCAACTGCAAAATTATTTGAAGGAACACTTCCTCAATCAGCATCCAATGTCAAGATACCAGAGCCATGCAGTAATTTTTTTAAGTCATTGATAAACAGTAAAAGCTCCGGGTCTGCTTTTGGTAATTCATCAAGTTGCTTGGGCTTCAGCAATCTTGATCTCCAATCATTATTTGAAATGGAGGAGTCTCTAGACAAGGATCTGGAAGAAGCACAAGATTGCAGGCGCCAATGTGAAATTGAAGAAAGAAATGCTTTCAAAATTTATTCTAGAGCTCAAAGGGCTTTGATTGAGGCTAATTCTAGATGTCTTGATCTTTATCACAAGAGAGAATTATTTTCAGCTCATTTTCACTCTTTTTGCATGAATAATCCTGGTTTAATTAGTTCCTCGAGACAGCAAGAAGACATGAAAATTGGTGTGGATCACTTAAATAGTATGTCTGGAAATGCAAATAGAGCTTCTTCTTTGTATCAGAAGCATTTTGAATATAATAGTTCTACTAAGCTACATAATGATTTAAATATGCAACATGAAAATGCTGGTCCCATCAATACTTCAAACCTGCATGAGAATGGACAAAATTTGGGGTCTGAACCTGGATCCTGCTCTGCCTTATGTGGTAATACATTGGATCCATTGCCTTCCAAAGGCAATAATATTGCAGATAGAATTTGCTCTCCATCCTTTGATCCAAACGTTTCAGTGGATGGAGATGAAGAGTCATTGCCTTCTGACCATGAAATGATCGATTCCTATGATGAATGCTACATAGGAAGAAAACAGTTTGAAGATGATCAATTGGAAGCATATAATATGTCAAAGAAAAACCACAGTGACAATAATATTGAGGATTCTTTGCGTCTTGAAGCAAAATTAAGGTCTGAACTATTTGCACGTCTAGGAACAAGAAATTTGTCAAAGACTTGTAATCCATGCCATAACATTCAAACGTCAGTCGAACAGGGGACTGAGAATGATGCCAGAGACGATAGCACTCAGCAAAATAATACAGAACCTACAGTAGGTCTAGCAGTTGGAAGTGACGTCGACCTCATAAGTAAGAAGACTGAGATTGCTTTACTATCAGGAAAGGGAGATCAACAGTTTGGTTTTGGAGGCACAAACATATGCAAAACCCCAGATGACATCCATGGTCGTTGTCATTTTGAGAACTTGCCATCAGAAGCTCAGGATTCTGCGGACTCTGATGAAAATGAACGATTCAATAGAGAAGGATCTTGCTCCAAAACTACTTTTAGTTTTACACCTTTGACTATGAACAGTGTTCTGCAACATATAAAGGCCATATCCTCAGTTAGTATAGAAGTCTTGCTCACTAGAACTCGAGGGAGTCTCTCTAATCTTGGTTTCCCTGAAGACGGTGATTCTTTGCAAGTGGATCAAATCCACTGGAGAAAATTAAAAGAGAACTCTGTCCATGAGACTGTCAGACCTATGTTTCAGAGTGATGGCTCTTATATTGATGATCTTGCGATTGATCCATCGTGGCCACTTTGCATGTATGAACTCCGTGGAAAATGCAACAATGATGAATGCCCTTGGCAACATGTGAAGGACTACTCTTTTGCCAATAGAAGGCAGTGTCAGCATGGCCACATCAACTATTCTGATTCTTGCAATGGACTATCATTTTCTTCAGATGAAACAAAAGTCTTCAAGTATGAAGATGGCATGACTCCTCCAACTTACCTGGTTGGCATAGATATTCTAAAAGCTGATTCACATTCATATGACCCTGTTTTAACTCAGAAAAGTAGTCAATGCTGGCAAAGCTTTTTTAGTATTTCTTTGACGTTACCAAATTTGCTCCAAAAGGATGCTTCTGCTGATGGGCTATTTTTACATGATGCTCGTATAGTGGCCAATGGAAATTGGAATAGACCATCATCATACTTTCAGAGGGGAAGCTCTATATTGAGTCAGCTGAAACAGGGTGATGAGAACCTAGCTCTGGAAACAGCTCTAATTATTATTAACCAGGAAACAAACAGTCGAGAGGGCATGAAAAAGGCTCTTCCTGTACTATCACGTGCTGTAGAGAACAATCCAAAATCTATAGCTCTCTGGACCATTTACCTTCTAATATTCTATAGCTATACTACAACCGGGGGGAAGGATGACATGTTCTCTTTTGCGGTCAAGCACAATGGGCAATCTTATGAACTCTGGCTCATGTACATTAACAGCCGCATGAATCTCGATGCTCGATTGGCTGCATATGATGCTGCACTTTCTGCACTCTGCGACAATATATTTACTCATAACTTGGATGGGAAATATGCTAGTGCCCATATCTTGGACCTGATTTTACAGATGACAAATTGTTTGTGTATGTCTGGGAACGTGGAGAAGGGTATTCAGAGGATTTTTGGACTTCTTCGAGTTGCTATGGATTCTGATGAGCCTTATTCTTTTACGCATTCTGATATGCTCGCATGCTTAAATATATCTGACAAATGTATTTTCTGGGTTTGTGTTGTGTATTTAGTTATTTACAGGAAACTGCCTCATGCTATAGTGCAGCAGCTTGAATGTGAGAAAGAACTGATCGAGATTGAATGGCCTGCCATTCAATTGACAGATGGTGAGAGGCTGAGGGCTTCTAGGGTGGTCAAGAAAGCAGTCGATTTTGTTGATTCATGCCTGAACAATGAATCACTTGAAAGTAAATGCTACCAAAAATCTATTCAAATGTTTGCTGTCAATCATATAAGGTGCTTGATGGCATTTGAGGACATAGGATTCAGTAGGAACTTGTTGGATAAGTATGTTAAACTTTATCCATCTTGCCTAGAACTTCTTTTACTTAAAGTACGGGCAAAGAAACATGGTTTTGGGGATGAAACTGTCGTGGCATTTGAACAAGCGATCAGGAACTGGCCGAAAGAAGTACCTGGTGTCCAATGCATCTGGAATCAATATGCTGAATATTTACTTCAGAATGGGAGAATCAAATGTACTGAAGAACTAATGGTGCGCTGGTTTGAGTCTACTTCAAAAATGGATTGTTCTAAAACTAGAACAGTGGATAATAGTGACTGTGACTCCTTGCACTTGCGAGAGTATGCTTCAGGATCAATTCTACATGCATTAGATTGCAGTCCCAATGAGGTGGACGTGGTGTTTTGGTATCTTAATCTTTCTGTTCACAAGTTACTGCTTAATGACCAATTAGAAGCACGTTTGGCCTTTGACAATGCTCTGAGGGCTGCAGGTTCTGGGACTTTTAGATATTGCATGAGAGAGTATGCTATGTTTTTGCTTACAGACGAATCCTTACTGAATGAGGCTGCTTCTGTTGGTGGAATAAGGAGCATTTTAGAGGGTTATCTCAACGATGCCCGAGCTTTCCCTGTCCCTAAACCATTATCCAGAAAATTCATTAACGATATCAAGAAGCCAAGAGTTCAACTTCTTGTCAGTAACATGCTGTCTCCACTTTCTCTGGATGTTTCTCTAGTGAACTGTATTCTTGAAGTCTGGTATGGGCCATCTCTTTTACCCCAAAAATTTAACAAACCAAGGGAATTGGTGGATTTCGTGGAAACTATCTTAGAGATGTTGCCTTCTAATTATCAGTTGGTACTTTCTGTCTGTAAGCAATTATGCAATGGTGACAACTCTTCCCAAGTTGCCTCCCCCAGTCTTATTTTCTGGGCCTGCTCAAATTTGATCAGTGCAATCTTTAGTTCTGTCCCAATACCACCAGAGTTCATTTGGGTAGAAGCTGCTAATATTCTGGTCAATGTCAAAGGTTTTGAAGCCATATATGAGAGGTTTCACAAGAGAGCTTTATCTGTTTACCCGTTCTCTGTTCAGCTGTGGAAATCATACTACAACATATGTAAAACTAGAGGAGATACGAGTGCTGTTCTGCGAGAAGTAAATGAAAGGGGAATCGAACTCAACGAGCCTTCTTTGTGA
Protein sequence
MEKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSPISSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQPFIPSMTKNHRAYSTGAGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQIAICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRLKTSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLKQSKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVTSNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSGSAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSLYQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSKGNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKNHSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNNTEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTPDDIHGRCHFENLPSEAQDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEVLLTRTRGSLSNLGFPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSWPLCMYELRGKCNNDECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYEDGMTPPTYLVGIDILKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIVANGNWNRPSSYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALWTIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCDNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFTHSDMLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPAIQLTDGERLRASRVVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPSCLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAEYLLQNGRIKCTEELMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPNEVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGYLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNCILEVWYGPSLLPQKFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDNSSQVASPSLIFWACSNLISAIFSSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWKSYYNICKTRGDTSAVLREVNERGIELNEPSL
Homology
BLAST of ClCG09G020050 vs. NCBI nr
Match:
XP_038890115.1 (uncharacterized protein LOC120079791 isoform X3 [Benincasa hispida])
HSP 1 Score: 3206.4 bits (8312), Expect = 0.0e+00
Identity = 1620/1757 (92.20%), Postives = 1667/1757 (94.88%), Query Frame = 0
Query: 1 MEKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60
MEKKN+EELTVK+MASNS+PSKSKAS+SREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP
Sbjct: 1 MEKKNTEELTVKSMASNSQPSKSKASDSREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60
Query: 61 ISSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLV 120
ISSILPPKNK N GIQA SADVCTRTSIQT SQK CD AQVVNK STPWGASREANSNLV
Sbjct: 61 ISSILPPKNKYNPGIQAVSADVCTRTSIQTISQKICDNAQVVNKVSTPWGASREANSNLV 120
Query: 121 ISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQ 180
ISFSDDSGSELEECSKVRTSKSHSDAVRH+KPPTS +DRSN+LRSMTRNKVVANKLSLSQ
Sbjct: 121 ISFSDDSGSELEECSKVRTSKSHSDAVRHYKPPTSIIDRSNKLRSMTRNKVVANKLSLSQ 180
Query: 181 PFIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQ 240
PFIPSMTKNHRAYS G AGPSLAEQGSKIRAFSGNLQSQGRGNDQG NLNTSKLQDLREQ
Sbjct: 181 PFIPSMTKNHRAYSKGAAGPSLAEQGSKIRAFSGNLQSQGRGNDQGKNLNTSKLQDLREQ 240
Query: 241 IAICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRL 300
IAICESKLK KSAQQNKESIS+TNQDYIVTNSKSDL RKG+ATI Q L PKEPD KRL
Sbjct: 241 IAICESKLKFKSAQQNKESISVTNQDYIVTNSKSDLARKGSATIPQFPPLVPKEPDVKRL 300
Query: 301 KTSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLK 360
KTSGSYSTKLSLSGQQHLR YAGKSVFRPQEPGEETQNIKVTYNQKG SL REESSVLK
Sbjct: 301 KTSGSYSTKLSLSGQQHLRTMYAGKSVFRPQEPGEETQNIKVTYNQKGISLGREESSVLK 360
Query: 361 QSKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVT 420
QSKEDIKHVAASPSPGIDLGKVQDD DIVANGNQ DWISKQVDPHPLVVLD AT LPN+T
Sbjct: 361 QSKEDIKHVAASPSPGIDLGKVQDDNDIVANGNQLDWISKQVDPHPLVVLDLATVLPNMT 420
Query: 421 SNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480
SNVQTQFDNVEFHRQSDGLQPSAS AK FEGTLPQSASNVKIPEPCSNFFKSLINSKSSG
Sbjct: 421 SNVQTQFDNVEFHRQSDGLQPSASAAKHFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480
Query: 481 SAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI 540
+AFGNS SCLGFSN DLQSLFEMEESLDKDLEEAQD RRQCEIEERNAFKIYSRAQRALI
Sbjct: 481 TAFGNSPSCLGFSNFDLQSLFEMEESLDKDLEEAQDIRRQCEIEERNAFKIYSRAQRALI 540
Query: 541 EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSL 600
EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIG DHLNSMSGNAN AS L
Sbjct: 541 EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGADHLNSMSGNANGASPL 600
Query: 601 YQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSK 660
YQKH EYNSST+LH DLNMQHENAGPIN+SNLHENGQNLGSEP CS L GN LDPLPSK
Sbjct: 601 YQKHSEYNSSTQLHTDLNMQHENAGPINSSNLHENGQNLGSEPELCSDLGGNKLDPLPSK 660
Query: 661 GNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKN 720
GNNIADRICSPS DPNVSVDGDEESLPSDHEMIDSYDECY+G+KQFEDDQ+E YN+SKKN
Sbjct: 661 GNNIADRICSPSVDPNVSVDGDEESLPSDHEMIDSYDECYMGKKQFEDDQMETYNISKKN 720
Query: 721 HSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNN 780
DNNIEDSLRLEAKLRSELFARLG RNLSKTCNPCHNIQT VEQGT++DARDD TQQNN
Sbjct: 721 QCDNNIEDSLRLEAKLRSELFARLGIRNLSKTCNPCHNIQTPVEQGTKSDARDDRTQQNN 780
Query: 781 TEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTPDDIHGRCHFENLPSEA 840
TEPTVGLAVGSD DL SKKTE LLSGKGDQQFGFGG N C TPDDIHGR HFENLPSE
Sbjct: 781 TEPTVGLAVGSDADLTSKKTESTLLSGKGDQQFGFGGPNRCNTPDDIHGRYHFENLPSET 840
Query: 841 QDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEVLLTRTRGSLSNLG 900
QDSADSD+NE FNREGSCSKTTFSFTPLTMNSVLQHIKAI SVSIEVLL RTRGSLSNLG
Sbjct: 841 QDSADSDDNEPFNREGSCSKTTFSFTPLTMNSVLQHIKAIPSVSIEVLLARTRGSLSNLG 900
Query: 901 FPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSWPLCMYELRGKCNN 960
FPEDGDSL+VDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDP WPLCMYELRGKCNN
Sbjct: 901 FPEDGDSLEVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPLWPLCMYELRGKCNN 960
Query: 961 DECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYEDGMTPPTYLVGIDI 1020
DECPWQHVKDYS ANRRQCQH HINYSDSCNGLSFSSDETK+FKYED MTPPTYLVGIDI
Sbjct: 961 DECPWQHVKDYSLANRRQCQHDHINYSDSCNGLSFSSDETKIFKYEDCMTPPTYLVGIDI 1020
Query: 1021 LKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIVANGNWNRPS 1080
LKADSHSYDPVL QKSSQCWQ+FFSISLTLPNLLQKDASADGLFLHDARI A G+WNRPS
Sbjct: 1021 LKADSHSYDPVLAQKSSQCWQNFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPS 1080
Query: 1081 SYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALW 1140
SYFQRGSSILSQLKQGDE+LALETALIIINQETNSREGMKKALPVLSRAVENNPKS+ALW
Sbjct: 1081 SYFQRGSSILSQLKQGDEDLALETALIIINQETNSREGMKKALPVLSRAVENNPKSVALW 1140
Query: 1141 TIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCD 1200
TIYLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCD
Sbjct: 1141 TIYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCD 1200
Query: 1201 NIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFTHSD 1260
NI T NLDGKYAS HILDLILQMTNCLCMSGNVEK IQRI GLLRVAMDSDEPYSFTHSD
Sbjct: 1201 NIVTPNLDGKYASTHILDLILQMTNCLCMSGNVEKAIQRILGLLRVAMDSDEPYSFTHSD 1260
Query: 1261 MLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPAIQLTDGERLRASR 1320
ML CLNISDKCIFWVCVVYLVIYRKLPHA+VQQLECEKELIEIEWPAIQLTDGE+LRASR
Sbjct: 1261 MLTCLNISDKCIFWVCVVYLVIYRKLPHAVVQQLECEKELIEIEWPAIQLTDGEKLRASR 1320
Query: 1321 VVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPS 1380
VVKKAVDFVDSC NNES +SKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPS
Sbjct: 1321 VVKKAVDFVDSCPNNESPDSKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPS 1380
Query: 1381 CLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAEYLLQNGRIKCTEE 1440
CLEL+LLKVRAKK FGDETVVAFEQAI NWPKEVPG+QCIWNQYAEYLLQNGRIKCTEE
Sbjct: 1381 CLELILLKVRAKKRDFGDETVVAFEQAIGNWPKEVPGIQCIWNQYAEYLLQNGRIKCTEE 1440
Query: 1441 LMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPNEVDVVFWYLNLSV 1500
LMVRWFEST KMDCSKTRT+DN DCD L+L +YASGSI+HA+DCSPNEVDVVFWYLNLSV
Sbjct: 1441 LMVRWFESTPKMDCSKTRTLDNGDCDCLNLLDYASGSIVHAMDCSPNEVDVVFWYLNLSV 1500
Query: 1501 HKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNEAASVGGIRSILEG 1560
HKLLLNDQLEARLAFDNALRAA SGTFRYCMREYAMFLLTDESLLNEAASVGGIR+ILEG
Sbjct: 1501 HKLLLNDQLEARLAFDNALRAASSGTFRYCMREYAMFLLTDESLLNEAASVGGIRNILEG 1560
Query: 1561 YLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNCILEVWYGPSLLPQ 1620
YLNDARAFP+P+PLSRKFINDIKKPRV+LL+SNMLSPLS DVSLVNCILEVWYGPSLLPQ
Sbjct: 1561 YLNDARAFPIPEPLSRKFINDIKKPRVRLLISNMLSPLSPDVSLVNCILEVWYGPSLLPQ 1620
Query: 1621 KFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDNSSQVASPSLIFWACSNLISAIF 1680
KFNKP+ELVDFVETILEMLPSNYQLVLSVCKQLCNGD+SSQ AS SLIFWACSNLISAIF
Sbjct: 1621 KFNKPKELVDFVETILEMLPSNYQLVLSVCKQLCNGDDSSQAASASLIFWACSNLISAIF 1680
Query: 1681 SSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWKSYYNICKTRGDTS 1740
SSVPIPPE +WVEAANIL+NVKGFEAI ERFHKRALSVYPFSVQLW SYYN+CKTRGDTS
Sbjct: 1681 SSVPIPPESVWVEAANILINVKGFEAIIERFHKRALSVYPFSVQLWTSYYNMCKTRGDTS 1740
Query: 1741 AVLREVNERGIELNEPS 1757
AVLREVNERGIELNEPS
Sbjct: 1741 AVLREVNERGIELNEPS 1757
BLAST of ClCG09G020050 vs. NCBI nr
Match:
XP_038890113.1 (uncharacterized protein LOC120079791 isoform X1 [Benincasa hispida])
HSP 1 Score: 3197.1 bits (8288), Expect = 0.0e+00
Identity = 1620/1770 (91.53%), Postives = 1667/1770 (94.18%), Query Frame = 0
Query: 1 MEKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60
MEKKN+EELTVK+MASNS+PSKSKAS+SREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP
Sbjct: 1 MEKKNTEELTVKSMASNSQPSKSKASDSREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60
Query: 61 ISSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLV 120
ISSILPPKNK N GIQA SADVCTRTSIQT SQK CD AQVVNK STPWGASREANSNLV
Sbjct: 61 ISSILPPKNKYNPGIQAVSADVCTRTSIQTISQKICDNAQVVNKVSTPWGASREANSNLV 120
Query: 121 ISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQ 180
ISFSDDSGSELEECSKVRTSKSHSDAVRH+KPPTS +DRSN+LRSMTRNKVVANKLSLSQ
Sbjct: 121 ISFSDDSGSELEECSKVRTSKSHSDAVRHYKPPTSIIDRSNKLRSMTRNKVVANKLSLSQ 180
Query: 181 PFIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQ 240
PFIPSMTKNHRAYS G AGPSLAEQGSKIRAFSGNLQSQGRGNDQG NLNTSKLQDLREQ
Sbjct: 181 PFIPSMTKNHRAYSKGAAGPSLAEQGSKIRAFSGNLQSQGRGNDQGKNLNTSKLQDLREQ 240
Query: 241 IAICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRL 300
IAICESKLK KSAQQNKESIS+TNQDYIVTNSKSDL RKG+ATI Q L PKEPD KRL
Sbjct: 241 IAICESKLKFKSAQQNKESISVTNQDYIVTNSKSDLARKGSATIPQFPPLVPKEPDVKRL 300
Query: 301 KTSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLK 360
KTSGSYSTKLSLSGQQHLR YAGKSVFRPQEPGEETQNIKVTYNQKG SL REESSVLK
Sbjct: 301 KTSGSYSTKLSLSGQQHLRTMYAGKSVFRPQEPGEETQNIKVTYNQKGISLGREESSVLK 360
Query: 361 QSKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVT 420
QSKEDIKHVAASPSPGIDLGKVQDD DIVANGNQ DWISKQVDPHPLVVLD AT LPN+T
Sbjct: 361 QSKEDIKHVAASPSPGIDLGKVQDDNDIVANGNQLDWISKQVDPHPLVVLDLATVLPNMT 420
Query: 421 SNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480
SNVQTQFDNVEFHRQSDGLQPSAS AK FEGTLPQSASNVKIPEPCSNFFKSLINSKSSG
Sbjct: 421 SNVQTQFDNVEFHRQSDGLQPSASAAKHFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480
Query: 481 SAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI 540
+AFGNS SCLGFSN DLQSLFEMEESLDKDLEEAQD RRQCEIEERNAFKIYSRAQRALI
Sbjct: 481 TAFGNSPSCLGFSNFDLQSLFEMEESLDKDLEEAQDIRRQCEIEERNAFKIYSRAQRALI 540
Query: 541 EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSL 600
EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIG DHLNSMSGNAN AS L
Sbjct: 541 EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGADHLNSMSGNANGASPL 600
Query: 601 YQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSK 660
YQKH EYNSST+LH DLNMQHENAGPIN+SNLHENGQNLGSEP CS L GN LDPLPSK
Sbjct: 601 YQKHSEYNSSTQLHTDLNMQHENAGPINSSNLHENGQNLGSEPELCSDLGGNKLDPLPSK 660
Query: 661 GNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKN 720
GNNIADRICSPS DPNVSVDGDEESLPSDHEMIDSYDECY+G+KQFEDDQ+E YN+SKKN
Sbjct: 661 GNNIADRICSPSVDPNVSVDGDEESLPSDHEMIDSYDECYMGKKQFEDDQMETYNISKKN 720
Query: 721 HSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNN 780
DNNIEDSLRLEAKLRSELFARLG RNLSKTCNPCHNIQT VEQGT++DARDD TQQNN
Sbjct: 721 QCDNNIEDSLRLEAKLRSELFARLGIRNLSKTCNPCHNIQTPVEQGTKSDARDDRTQQNN 780
Query: 781 TEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFG-------------GTNICKTPDDI 840
TEPTVGLAVGSD DL SKKTE LLSGKGDQQFGFG G N C TPDDI
Sbjct: 781 TEPTVGLAVGSDADLTSKKTESTLLSGKGDQQFGFGGDVGWDSLCLQPVGPNRCNTPDDI 840
Query: 841 HGRCHFENLPSEAQDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEV 900
HGR HFENLPSE QDSADSD+NE FNREGSCSKTTFSFTPLTMNSVLQHIKAI SVSIEV
Sbjct: 841 HGRYHFENLPSETQDSADSDDNEPFNREGSCSKTTFSFTPLTMNSVLQHIKAIPSVSIEV 900
Query: 901 LLTRTRGSLSNLGFPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSW 960
LL RTRGSLSNLGFPEDGDSL+VDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDP W
Sbjct: 901 LLARTRGSLSNLGFPEDGDSLEVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPLW 960
Query: 961 PLCMYELRGKCNNDECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYED 1020
PLCMYELRGKCNNDECPWQHVKDYS ANRRQCQH HINYSDSCNGLSFSSDETK+FKYED
Sbjct: 961 PLCMYELRGKCNNDECPWQHVKDYSLANRRQCQHDHINYSDSCNGLSFSSDETKIFKYED 1020
Query: 1021 GMTPPTYLVGIDILKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHD 1080
MTPPTYLVGIDILKADSHSYDPVL QKSSQCWQ+FFSISLTLPNLLQKDASADGLFLHD
Sbjct: 1021 CMTPPTYLVGIDILKADSHSYDPVLAQKSSQCWQNFFSISLTLPNLLQKDASADGLFLHD 1080
Query: 1081 ARIVANGNWNRPSSYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLS 1140
ARI A G+WNRPSSYFQRGSSILSQLKQGDE+LALETALIIINQETNSREGMKKALPVLS
Sbjct: 1081 ARIEAKGSWNRPSSYFQRGSSILSQLKQGDEDLALETALIIINQETNSREGMKKALPVLS 1140
Query: 1141 RAVENNPKSIALWTIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDAR 1200
RAVENNPKS+ALWTIYLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNLDAR
Sbjct: 1141 RAVENNPKSVALWTIYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDAR 1200
Query: 1201 LAAYDAALSALCDNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVA 1260
LAAYDAALSALCDNI T NLDGKYAS HILDLILQMTNCLCMSGNVEK IQRI GLLRVA
Sbjct: 1201 LAAYDAALSALCDNIVTPNLDGKYASTHILDLILQMTNCLCMSGNVEKAIQRILGLLRVA 1260
Query: 1261 MDSDEPYSFTHSDMLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPA 1320
MDSDEPYSFTHSDML CLNISDKCIFWVCVVYLVIYRKLPHA+VQQLECEKELIEIEWPA
Sbjct: 1261 MDSDEPYSFTHSDMLTCLNISDKCIFWVCVVYLVIYRKLPHAVVQQLECEKELIEIEWPA 1320
Query: 1321 IQLTDGERLRASRVVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFS 1380
IQLTDGE+LRASRVVKKAVDFVDSC NNES +SKCYQKSIQMFAVNHIRCLMAFEDIGFS
Sbjct: 1321 IQLTDGEKLRASRVVKKAVDFVDSCPNNESPDSKCYQKSIQMFAVNHIRCLMAFEDIGFS 1380
Query: 1381 RNLLDKYVKLYPSCLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAE 1440
RNLLDKYVKLYPSCLEL+LLKVRAKK FGDETVVAFEQAI NWPKEVPG+QCIWNQYAE
Sbjct: 1381 RNLLDKYVKLYPSCLELILLKVRAKKRDFGDETVVAFEQAIGNWPKEVPGIQCIWNQYAE 1440
Query: 1441 YLLQNGRIKCTEELMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPN 1500
YLLQNGRIKCTEELMVRWFEST KMDCSKTRT+DN DCD L+L +YASGSI+HA+DCSPN
Sbjct: 1441 YLLQNGRIKCTEELMVRWFESTPKMDCSKTRTLDNGDCDCLNLLDYASGSIVHAMDCSPN 1500
Query: 1501 EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNE 1560
EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAA SGTFRYCMREYAMFLLTDESLLNE
Sbjct: 1501 EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAASSGTFRYCMREYAMFLLTDESLLNE 1560
Query: 1561 AASVGGIRSILEGYLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNC 1620
AASVGGIR+ILEGYLNDARAFP+P+PLSRKFINDIKKPRV+LL+SNMLSPLS DVSLVNC
Sbjct: 1561 AASVGGIRNILEGYLNDARAFPIPEPLSRKFINDIKKPRVRLLISNMLSPLSPDVSLVNC 1620
Query: 1621 ILEVWYGPSLLPQKFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDNSSQVASPSL 1680
ILEVWYGPSLLPQKFNKP+ELVDFVETILEMLPSNYQLVLSVCKQLCNGD+SSQ AS SL
Sbjct: 1621 ILEVWYGPSLLPQKFNKPKELVDFVETILEMLPSNYQLVLSVCKQLCNGDDSSQAASASL 1680
Query: 1681 IFWACSNLISAIFSSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWK 1740
IFWACSNLISAIFSSVPIPPE +WVEAANIL+NVKGFEAI ERFHKRALSVYPFSVQLW
Sbjct: 1681 IFWACSNLISAIFSSVPIPPESVWVEAANILINVKGFEAIIERFHKRALSVYPFSVQLWT 1740
Query: 1741 SYYNICKTRGDTSAVLREVNERGIELNEPS 1757
SYYN+CKTRGDTSAVLREVNERGIELNEPS
Sbjct: 1741 SYYNMCKTRGDTSAVLREVNERGIELNEPS 1770
BLAST of ClCG09G020050 vs. NCBI nr
Match:
XP_038890114.1 (uncharacterized protein LOC120079791 isoform X2 [Benincasa hispida])
HSP 1 Score: 3169.0 bits (8215), Expect = 0.0e+00
Identity = 1610/1770 (90.96%), Postives = 1657/1770 (93.62%), Query Frame = 0
Query: 1 MEKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60
MEKKN+EELTVK+MASNS+PSKSKAS+SREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP
Sbjct: 1 MEKKNTEELTVKSMASNSQPSKSKASDSREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60
Query: 61 ISSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLV 120
ISSILPPKNK N GIQA SADVCTRTSIQT SQK CD AQVVNK STPWGASREANSNLV
Sbjct: 61 ISSILPPKNKYNPGIQAVSADVCTRTSIQTISQKICDNAQVVNKVSTPWGASREANSNLV 120
Query: 121 ISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQ 180
ISFSDDSGSELEECSKVRTSKSHSDAVRH+KPPTS +DRSN+LRSMTRNKVVANKLSLSQ
Sbjct: 121 ISFSDDSGSELEECSKVRTSKSHSDAVRHYKPPTSIIDRSNKLRSMTRNKVVANKLSLSQ 180
Query: 181 PFIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQ 240
PFIPSMTKNHRAYS G AGPSLAEQGSKIRAFSGNLQSQGRGNDQG NLNTSKLQDLREQ
Sbjct: 181 PFIPSMTKNHRAYSKGAAGPSLAEQGSKIRAFSGNLQSQGRGNDQGKNLNTSKLQDLREQ 240
Query: 241 IAICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRL 300
IAICESKLK KSAQQNKESIS+TNQDYIVTNSKSDL RKG+ATI Q L PKEPD KRL
Sbjct: 241 IAICESKLKFKSAQQNKESISVTNQDYIVTNSKSDLARKGSATIPQFPPLVPKEPDVKRL 300
Query: 301 KTSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLK 360
KTSGSYSTKLSLSGQQHLR YAGKSVFRPQEPGEETQNIKVTYNQKG SL REESSVLK
Sbjct: 301 KTSGSYSTKLSLSGQQHLRTMYAGKSVFRPQEPGEETQNIKVTYNQKGISLGREESSVLK 360
Query: 361 QSKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVT 420
QSKEDIKHVAASPSPGIDLGKVQDD DIVANGNQ DWISKQVDPHPLVVLD AT LPN+T
Sbjct: 361 QSKEDIKHVAASPSPGIDLGKVQDDNDIVANGNQLDWISKQVDPHPLVVLDLATVLPNMT 420
Query: 421 SNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480
SNVQTQFDNVEFHRQSDGLQPSAS AK FEGTLPQSASNVKIPEPCSNFFKSLINSKSSG
Sbjct: 421 SNVQTQFDNVEFHRQSDGLQPSASAAKHFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480
Query: 481 SAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI 540
+AFGNS SCLGFSN DLQSLFEMEESLDKDLEEAQD RRQCEIEERNAFKIYSRAQRALI
Sbjct: 481 TAFGNSPSCLGFSNFDLQSLFEMEESLDKDLEEAQDIRRQCEIEERNAFKIYSRAQRALI 540
Query: 541 EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSL 600
EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIG DHLNSMSGNAN AS L
Sbjct: 541 EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGADHLNSMSGNANGASPL 600
Query: 601 YQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSK 660
YQKH EYNSST+LH DLNMQHENAGPIN+SNLHENGQNLGSEP CS L GN LDPLPSK
Sbjct: 601 YQKHSEYNSSTQLHTDLNMQHENAGPINSSNLHENGQNLGSEPELCSDLGGNKLDPLPSK 660
Query: 661 GNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKN 720
GNNIADRICSPS DPNVSVDGDEESLPSDHEMIDSYDECY+G+KQFEDDQ+E YN+SKKN
Sbjct: 661 GNNIADRICSPSVDPNVSVDGDEESLPSDHEMIDSYDECYMGKKQFEDDQMETYNISKKN 720
Query: 721 HSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNN 780
DNNIEDSLRLEAKLRSELFARLG RNLSKTCNPCHNIQT VEQGT++DARDD TQQNN
Sbjct: 721 QCDNNIEDSLRLEAKLRSELFARLGIRNLSKTCNPCHNIQTPVEQGTKSDARDDRTQQNN 780
Query: 781 TEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFG-------------GTNICKTPDDI 840
TEPTVGLAVGSD DL SKKTE LLSGKGDQQFGFG G N C TPDDI
Sbjct: 781 TEPTVGLAVGSDADLTSKKTESTLLSGKGDQQFGFGGDVGWDSLCLQPVGPNRCNTPDDI 840
Query: 841 HGRCHFENLPSEAQDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEV 900
HGR HFENLPSE QDSADSD+NE FNREGSCSKTTFSFTPLTMNSVLQHIKAI SVSIEV
Sbjct: 841 HGRYHFENLPSETQDSADSDDNEPFNREGSCSKTTFSFTPLTMNSVLQHIKAIPSVSIEV 900
Query: 901 LLTRTRGSLSNLGFPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSW 960
LL RTRGSLSNLGFPEDGDSL+VDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDP W
Sbjct: 901 LLARTRGSLSNLGFPEDGDSLEVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPLW 960
Query: 961 PLCMYELRGKCNNDECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYED 1020
PLCMYELRGKCNNDECPWQHVKDYS ANRRQCQH HINY SDETK+FKYED
Sbjct: 961 PLCMYELRGKCNNDECPWQHVKDYSLANRRQCQHDHINY----------SDETKIFKYED 1020
Query: 1021 GMTPPTYLVGIDILKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHD 1080
MTPPTYLVGIDILKADSHSYDPVL QKSSQCWQ+FFSISLTLPNLLQKDASADGLFLHD
Sbjct: 1021 CMTPPTYLVGIDILKADSHSYDPVLAQKSSQCWQNFFSISLTLPNLLQKDASADGLFLHD 1080
Query: 1081 ARIVANGNWNRPSSYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLS 1140
ARI A G+WNRPSSYFQRGSSILSQLKQGDE+LALETALIIINQETNSREGMKKALPVLS
Sbjct: 1081 ARIEAKGSWNRPSSYFQRGSSILSQLKQGDEDLALETALIIINQETNSREGMKKALPVLS 1140
Query: 1141 RAVENNPKSIALWTIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDAR 1200
RAVENNPKS+ALWTIYLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNLDAR
Sbjct: 1141 RAVENNPKSVALWTIYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDAR 1200
Query: 1201 LAAYDAALSALCDNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVA 1260
LAAYDAALSALCDNI T NLDGKYAS HILDLILQMTNCLCMSGNVEK IQRI GLLRVA
Sbjct: 1201 LAAYDAALSALCDNIVTPNLDGKYASTHILDLILQMTNCLCMSGNVEKAIQRILGLLRVA 1260
Query: 1261 MDSDEPYSFTHSDMLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPA 1320
MDSDEPYSFTHSDML CLNISDKCIFWVCVVYLVIYRKLPHA+VQQLECEKELIEIEWPA
Sbjct: 1261 MDSDEPYSFTHSDMLTCLNISDKCIFWVCVVYLVIYRKLPHAVVQQLECEKELIEIEWPA 1320
Query: 1321 IQLTDGERLRASRVVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFS 1380
IQLTDGE+LRASRVVKKAVDFVDSC NNES +SKCYQKSIQMFAVNHIRCLMAFEDIGFS
Sbjct: 1321 IQLTDGEKLRASRVVKKAVDFVDSCPNNESPDSKCYQKSIQMFAVNHIRCLMAFEDIGFS 1380
Query: 1381 RNLLDKYVKLYPSCLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAE 1440
RNLLDKYVKLYPSCLEL+LLKVRAKK FGDETVVAFEQAI NWPKEVPG+QCIWNQYAE
Sbjct: 1381 RNLLDKYVKLYPSCLELILLKVRAKKRDFGDETVVAFEQAIGNWPKEVPGIQCIWNQYAE 1440
Query: 1441 YLLQNGRIKCTEELMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPN 1500
YLLQNGRIKCTEELMVRWFEST KMDCSKTRT+DN DCD L+L +YASGSI+HA+DCSPN
Sbjct: 1441 YLLQNGRIKCTEELMVRWFESTPKMDCSKTRTLDNGDCDCLNLLDYASGSIVHAMDCSPN 1500
Query: 1501 EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNE 1560
EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAA SGTFRYCMREYAMFLLTDESLLNE
Sbjct: 1501 EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAASSGTFRYCMREYAMFLLTDESLLNE 1560
Query: 1561 AASVGGIRSILEGYLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNC 1620
AASVGGIR+ILEGYLNDARAFP+P+PLSRKFINDIKKPRV+LL+SNMLSPLS DVSLVNC
Sbjct: 1561 AASVGGIRNILEGYLNDARAFPIPEPLSRKFINDIKKPRVRLLISNMLSPLSPDVSLVNC 1620
Query: 1621 ILEVWYGPSLLPQKFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDNSSQVASPSL 1680
ILEVWYGPSLLPQKFNKP+ELVDFVETILEMLPSNYQLVLSVCKQLCNGD+SSQ AS SL
Sbjct: 1621 ILEVWYGPSLLPQKFNKPKELVDFVETILEMLPSNYQLVLSVCKQLCNGDDSSQAASASL 1680
Query: 1681 IFWACSNLISAIFSSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWK 1740
IFWACSNLISAIFSSVPIPPE +WVEAANIL+NVKGFEAI ERFHKRALSVYPFSVQLW
Sbjct: 1681 IFWACSNLISAIFSSVPIPPESVWVEAANILINVKGFEAIIERFHKRALSVYPFSVQLWT 1740
Query: 1741 SYYNICKTRGDTSAVLREVNERGIELNEPS 1757
SYYN+CKTRGDTSAVLREVNERGIELNEPS
Sbjct: 1741 SYYNMCKTRGDTSAVLREVNERGIELNEPS 1760
BLAST of ClCG09G020050 vs. NCBI nr
Match:
XP_038890116.1 (uncharacterized protein LOC120079791 isoform X4 [Benincasa hispida])
HSP 1 Score: 3138.2 bits (8135), Expect = 0.0e+00
Identity = 1597/1770 (90.23%), Postives = 1643/1770 (92.82%), Query Frame = 0
Query: 1 MEKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60
MEKKN+EELTVK+MASNS+PSKSKAS+SREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP
Sbjct: 1 MEKKNTEELTVKSMASNSQPSKSKASDSREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60
Query: 61 ISSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLV 120
ISSILPPKNK N GIQA SADVCTRTSIQT SQK CD AQVVNK STPWGASREANSNLV
Sbjct: 61 ISSILPPKNKYNPGIQAVSADVCTRTSIQTISQKICDNAQVVNKVSTPWGASREANSNLV 120
Query: 121 ISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQ 180
ISFSDDSGSELEECSKVRTSKSHSDAVRH+KPPTS +DRSN+LRSMTRNKVVANKLSLSQ
Sbjct: 121 ISFSDDSGSELEECSKVRTSKSHSDAVRHYKPPTSIIDRSNKLRSMTRNKVVANKLSLSQ 180
Query: 181 PFIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQ 240
PFIPSMTKNHRAYS G AGPSLAEQGSKIRAFSGNLQSQGRGNDQG NLNTSKLQDLREQ
Sbjct: 181 PFIPSMTKNHRAYSKGAAGPSLAEQGSKIRAFSGNLQSQGRGNDQGKNLNTSKLQDLREQ 240
Query: 241 IAICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRL 300
IAICESKLK KSAQQNKESIS+TNQDYIVTNSKSDL RKG+ATI Q L PKEPD KRL
Sbjct: 241 IAICESKLKFKSAQQNKESISVTNQDYIVTNSKSDLARKGSATIPQFPPLVPKEPDVKRL 300
Query: 301 KTSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLK 360
KTSGSYSTKLSLSGQQHLR YAGKSVFRPQEPGEETQNIKVTYNQKG SL REESSVLK
Sbjct: 301 KTSGSYSTKLSLSGQQHLRTMYAGKSVFRPQEPGEETQNIKVTYNQKGISLGREESSVLK 360
Query: 361 QSKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVT 420
QSKEDIKHVAASPSPGIDLGKVQDD DIVANGNQ DWISKQ
Sbjct: 361 QSKEDIKHVAASPSPGIDLGKVQDDNDIVANGNQLDWISKQ------------------- 420
Query: 421 SNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480
DNVEFHRQSDGLQPSAS AK FEGTLPQSASNVKIPEPCSNFFKSLINSKSSG
Sbjct: 421 -------DNVEFHRQSDGLQPSASAAKHFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480
Query: 481 SAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI 540
+AFGNS SCLGFSN DLQSLFEMEESLDKDLEEAQD RRQCEIEERNAFKIYSRAQRALI
Sbjct: 481 TAFGNSPSCLGFSNFDLQSLFEMEESLDKDLEEAQDIRRQCEIEERNAFKIYSRAQRALI 540
Query: 541 EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSL 600
EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIG DHLNSMSGNAN AS L
Sbjct: 541 EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGADHLNSMSGNANGASPL 600
Query: 601 YQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSK 660
YQKH EYNSST+LH DLNMQHENAGPIN+SNLHENGQNLGSEP CS L GN LDPLPSK
Sbjct: 601 YQKHSEYNSSTQLHTDLNMQHENAGPINSSNLHENGQNLGSEPELCSDLGGNKLDPLPSK 660
Query: 661 GNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKN 720
GNNIADRICSPS DPNVSVDGDEESLPSDHEMIDSYDECY+G+KQFEDDQ+E YN+SKKN
Sbjct: 661 GNNIADRICSPSVDPNVSVDGDEESLPSDHEMIDSYDECYMGKKQFEDDQMETYNISKKN 720
Query: 721 HSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNN 780
DNNIEDSLRLEAKLRSELFARLG RNLSKTCNPCHNIQT VEQGT++DARDD TQQNN
Sbjct: 721 QCDNNIEDSLRLEAKLRSELFARLGIRNLSKTCNPCHNIQTPVEQGTKSDARDDRTQQNN 780
Query: 781 TEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFG-------------GTNICKTPDDI 840
TEPTVGLAVGSD DL SKKTE LLSGKGDQQFGFG G N C TPDDI
Sbjct: 781 TEPTVGLAVGSDADLTSKKTESTLLSGKGDQQFGFGGDVGWDSLCLQPVGPNRCNTPDDI 840
Query: 841 HGRCHFENLPSEAQDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEV 900
HGR HFENLPSE QDSADSD+NE FNREGSCSKTTFSFTPLTMNSVLQHIKAI SVSIEV
Sbjct: 841 HGRYHFENLPSETQDSADSDDNEPFNREGSCSKTTFSFTPLTMNSVLQHIKAIPSVSIEV 900
Query: 901 LLTRTRGSLSNLGFPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSW 960
LL RTRGSLSNLGFPEDGDSL+VDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDP W
Sbjct: 901 LLARTRGSLSNLGFPEDGDSLEVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPLW 960
Query: 961 PLCMYELRGKCNNDECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYED 1020
PLCMYELRGKCNNDECPWQHVKDYS ANRRQCQH HINYSDSCNGLSFSSDETK+FKYED
Sbjct: 961 PLCMYELRGKCNNDECPWQHVKDYSLANRRQCQHDHINYSDSCNGLSFSSDETKIFKYED 1020
Query: 1021 GMTPPTYLVGIDILKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHD 1080
MTPPTYLVGIDILKADSHSYDPVL QKSSQCWQ+FFSISLTLPNLLQKDASADGLFLHD
Sbjct: 1021 CMTPPTYLVGIDILKADSHSYDPVLAQKSSQCWQNFFSISLTLPNLLQKDASADGLFLHD 1080
Query: 1081 ARIVANGNWNRPSSYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLS 1140
ARI A G+WNRPSSYFQRGSSILSQLKQGDE+LALETALIIINQETNSREGMKKALPVLS
Sbjct: 1081 ARIEAKGSWNRPSSYFQRGSSILSQLKQGDEDLALETALIIINQETNSREGMKKALPVLS 1140
Query: 1141 RAVENNPKSIALWTIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDAR 1200
RAVENNPKS+ALWTIYLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNLDAR
Sbjct: 1141 RAVENNPKSVALWTIYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDAR 1200
Query: 1201 LAAYDAALSALCDNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVA 1260
LAAYDAALSALCDNI T NLDGKYAS HILDLILQMTNCLCMSGNVEK IQRI GLLRVA
Sbjct: 1201 LAAYDAALSALCDNIVTPNLDGKYASTHILDLILQMTNCLCMSGNVEKAIQRILGLLRVA 1260
Query: 1261 MDSDEPYSFTHSDMLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPA 1320
MDSDEPYSFTHSDML CLNISDKCIFWVCVVYLVIYRKLPHA+VQQLECEKELIEIEWPA
Sbjct: 1261 MDSDEPYSFTHSDMLTCLNISDKCIFWVCVVYLVIYRKLPHAVVQQLECEKELIEIEWPA 1320
Query: 1321 IQLTDGERLRASRVVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFS 1380
IQLTDGE+LRASRVVKKAVDFVDSC NNES +SKCYQKSIQMFAVNHIRCLMAFEDIGFS
Sbjct: 1321 IQLTDGEKLRASRVVKKAVDFVDSCPNNESPDSKCYQKSIQMFAVNHIRCLMAFEDIGFS 1380
Query: 1381 RNLLDKYVKLYPSCLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAE 1440
RNLLDKYVKLYPSCLEL+LLKVRAKK FGDETVVAFEQAI NWPKEVPG+QCIWNQYAE
Sbjct: 1381 RNLLDKYVKLYPSCLELILLKVRAKKRDFGDETVVAFEQAIGNWPKEVPGIQCIWNQYAE 1440
Query: 1441 YLLQNGRIKCTEELMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPN 1500
YLLQNGRIKCTEELMVRWFEST KMDCSKTRT+DN DCD L+L +YASGSI+HA+DCSPN
Sbjct: 1441 YLLQNGRIKCTEELMVRWFESTPKMDCSKTRTLDNGDCDCLNLLDYASGSIVHAMDCSPN 1500
Query: 1501 EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNE 1560
EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAA SGTFRYCMREYAMFLLTDESLLNE
Sbjct: 1501 EVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAASSGTFRYCMREYAMFLLTDESLLNE 1560
Query: 1561 AASVGGIRSILEGYLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNC 1620
AASVGGIR+ILEGYLNDARAFP+P+PLSRKFINDIKKPRV+LL+SNMLSPLS DVSLVNC
Sbjct: 1561 AASVGGIRNILEGYLNDARAFPIPEPLSRKFINDIKKPRVRLLISNMLSPLSPDVSLVNC 1620
Query: 1621 ILEVWYGPSLLPQKFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDNSSQVASPSL 1680
ILEVWYGPSLLPQKFNKP+ELVDFVETILEMLPSNYQLVLSVCKQLCNGD+SSQ AS SL
Sbjct: 1621 ILEVWYGPSLLPQKFNKPKELVDFVETILEMLPSNYQLVLSVCKQLCNGDDSSQAASASL 1680
Query: 1681 IFWACSNLISAIFSSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWK 1740
IFWACSNLISAIFSSVPIPPE +WVEAANIL+NVKGFEAI ERFHKRALSVYPFSVQLW
Sbjct: 1681 IFWACSNLISAIFSSVPIPPESVWVEAANILINVKGFEAIIERFHKRALSVYPFSVQLWT 1740
Query: 1741 SYYNICKTRGDTSAVLREVNERGIELNEPS 1757
SYYN+CKTRGDTSAVLREVNERGIELNEPS
Sbjct: 1741 SYYNMCKTRGDTSAVLREVNERGIELNEPS 1744
BLAST of ClCG09G020050 vs. NCBI nr
Match:
XP_011655356.2 (uncharacterized protein LOC101211906 [Cucumis sativus] >KGN51732.2 hypothetical protein Csa_009223 [Cucumis sativus])
HSP 1 Score: 2996.8 bits (7768), Expect = 0.0e+00
Identity = 1530/1757 (87.08%), Postives = 1619/1757 (92.15%), Query Frame = 0
Query: 2 EKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSPI 61
++K+S+ELT+K+M SNSKP+K KAS+ +EEGEVSSSDNDTQTHDVHPVCSTVPAS+ S I
Sbjct: 15 KEKDSDELTLKSMPSNSKPTKIKASDGKEEGEVSSSDNDTQTHDVHPVCSTVPASIASRI 74
Query: 62 SSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLVI 121
SSILPPKNKCN GI+ ASADVCTRTSI T SQK D AQ+VNKASTPW ASR+ANSNLVI
Sbjct: 75 SSILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKASTPWVASRKANSNLVI 134
Query: 122 SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQP 181
SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTS LDRSN+LRSMTRNKVV NKL LSQ
Sbjct: 135 SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRSMTRNKVVVNKLPLSQA 194
Query: 182 FIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQI 241
FIPSMTKNH+AYS G AGPS AEQGSKIRAFSGNLQSQGRGNDQGMN+NTSKLQDLR+QI
Sbjct: 195 FIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNVNTSKLQDLRQQI 254
Query: 242 AICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRLK 301
AI ESKLKLKSAQQNKE + +TNQDYIVTNSKSDLGRKGNATISQ LGPK+ +AKR+K
Sbjct: 255 AIRESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATISQFPPLGPKDLNAKRMK 314
Query: 302 TSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLKQ 361
TSGSYS+KLSL+GQQ LR+ A K ++ PQEPGEETQNIK +YNQKG SLSREESSVLKQ
Sbjct: 315 TSGSYSSKLSLNGQQ-LRSLIAAKFIW-PQEPGEETQNIKGSYNQKGKSLSREESSVLKQ 374
Query: 362 SKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVTS 421
SKEDIKHVAASPS GIDLGKVQDDTDIVANGNQSD+I QVDPHPLVVLDQAT+LPNV S
Sbjct: 375 SKEDIKHVAASPSLGIDLGKVQDDTDIVANGNQSDFIGNQVDPHPLVVLDQATALPNVAS 434
Query: 422 NVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSGS 481
NVQ+QFDNVEFHRQSDGLQPSASTAK FE T PQSASNVK PEPCSNFFKSLINSK+SG+
Sbjct: 435 NVQSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPCSNFFKSLINSKTSGT 494
Query: 482 AFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIE 541
AFGN SSCL F N DLQSLFE+EESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIE
Sbjct: 495 AFGNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIE 554
Query: 542 ANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSLY 601
ANSRC++LYHKRELFS HFHSFCMNNPG +SSSRQQEDM I VDHLNSMSG+AN AS LY
Sbjct: 555 ANSRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGHANIASPLY 614
Query: 602 QKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSKG 661
QKH EYNSST+LHNDLNMQ ENAG INTSNLHENGQ+LGSEPGSCS L GNTLDPLP KG
Sbjct: 615 QKHSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDLGGNTLDPLPFKG 674
Query: 662 NNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKNH 721
NNIADRI SPS DPNVS+DGDEES PSDHEMIDSY+ECY+ +K FE+DQ+EAYN SK NH
Sbjct: 675 NNIADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFENDQMEAYNTSKNNH 734
Query: 722 SDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNNT 781
DNNIEDSLRLEAKLRSELFARLGTRNLSK CNPC+N+QTSVEQGTENDARDD TQQNNT
Sbjct: 735 CDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTENDARDDITQQNNT 794
Query: 782 EPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTPDDIHGRCHFENLPSEAQ 841
E TV LAVGSDVDLISKK E ALLSGKGDQQFGFGGT+ CKTPD+IHGR HFENLPSEA
Sbjct: 795 ELTVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGRYHFENLPSEAP 854
Query: 842 DSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEVLLTRTRGSLSNLGF 901
D DSD+NE F+REGSCSKTT SFTPLTMNSVLQH+K ISSVSIEVLLTRT GSLSNLGF
Sbjct: 855 DLTDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLLTRTHGSLSNLGF 914
Query: 902 PEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSWPLCMYELRGKCNND 961
PEDGDSL+VDQIHWRKLKENSVHE RPM QSDGSY DDLAIDPSWPLCMYELRGKCNND
Sbjct: 915 PEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPLCMYELRGKCNND 974
Query: 962 ECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYEDGMTPPTYLVGIDIL 1021
ECPWQH+KD+SFANR QCQHGHIN SSDETKVFK ED MTPPTYLVGIDIL
Sbjct: 975 ECPWQHMKDFSFANRSQCQHGHIN----------SSDETKVFKNEDQMTPPTYLVGIDIL 1034
Query: 1022 KADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIVANGNWNRPSS 1081
KADS SY VL Q+SSQCWQSFFSISLTLPNLLQKDASADGLFLHDARI A G+WNRPSS
Sbjct: 1035 KADSRSYGHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPSS 1094
Query: 1082 YFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALWT 1141
YFQRG S+LSQLKQGDENLALETALIIINQE NSREGMKKALPVLSRAVENNPKSIALW
Sbjct: 1095 YFQRGGSVLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIALWA 1154
Query: 1142 IYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCDN 1201
+YLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNLDARLAAYD+A+SALC N
Sbjct: 1155 VYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCHN 1214
Query: 1202 IFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFTHSDM 1261
IFTHNLDGKYASAHILDLILQMTNCLCMSGNVEK IQRIFGLL+VAMDSDEPYSFTHSDM
Sbjct: 1215 IFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHSDM 1274
Query: 1262 LACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPAIQLTDGERLRASRV 1321
L CLNISDKCIFWV VVYLV+YRKLPHAIVQQLECEKELIEIEWPA+ LT+GE+LRASRV
Sbjct: 1275 LTCLNISDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRASRV 1334
Query: 1322 VKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPSC 1381
VKKAVDFVDSCLNNESL+SKCYQKSIQMFAVNHIRCLMAFEDI FSRNLLDKYVKLYPSC
Sbjct: 1335 VKKAVDFVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSC 1394
Query: 1382 LELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAEYLLQNGRIKCTEEL 1441
ELLLL +RA+KH FGD TV+AFE+ IR WPKEVPGVQCIWNQYAEYLL+NGRIKCTEEL
Sbjct: 1395 PELLLLDIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTEEL 1454
Query: 1442 MVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPNEVDVVFWYLNLSVH 1501
M R F+STSKMDCSKTRT NSDCDSLHL ++ASGSI+ ALDCSPNEVDVVFWYLN SVH
Sbjct: 1455 MARRFDSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHSVH 1514
Query: 1502 KLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGY 1561
KLLLNDQLEARLAF+NALRAA S TFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGY
Sbjct: 1515 KLLLNDQLEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGY 1574
Query: 1562 LNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNCILEVWYGPSLLPQK 1621
LNDARAFPVP+PLSR+FI DI+KPRV+LLVSNMLSP+S DVSLVNCILEVWYGPSLLPQK
Sbjct: 1575 LNDARAFPVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLPQK 1634
Query: 1622 FNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDN-SSQVASPSLIFWACSNLISAIF 1681
FNKP+ELVDFVETILE+LPSNYQLVLSVCKQLCN DN SSQ ASPSLIFWACSNLI AIF
Sbjct: 1635 FNKPKELVDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIAIF 1694
Query: 1682 SSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWKSYYNICKTRGDTS 1741
SSVPIPPEFIWVEAANIL NVKG EAI ERFHKRALSVYPFSVQLWKSYYNIC+TRGDTS
Sbjct: 1695 SSVPIPPEFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGDTS 1754
Query: 1742 AVLREVNERGIELNEPS 1757
AVL+EVNERGI+LNEPS
Sbjct: 1755 AVLQEVNERGIQLNEPS 1759
BLAST of ClCG09G020050 vs. ExPASy Swiss-Prot
Match:
O60293 (Zinc finger C3H1 domain-containing protein OS=Homo sapiens OX=9606 GN=ZFC3H1 PE=1 SV=3)
HSP 1 Score: 73.6 bits (179), Expect = 2.6e-11
Identity = 99/482 (20.54%), Postives = 179/482 (37.14%), Query Frame = 0
Query: 942 IDPSWPLCMYELRGKCNNDECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKV 1001
I+P C ++L G CN+D+C WQH++DY+ +R+Q ++Y+ S G + +S ++
Sbjct: 1179 IEPDQCFCRFDLTGTCNDDDCQWQHIQDYTL-SRKQLFQDILSYNLSLIGCAETSTNEEI 1238
Query: 1002 F----KYED----------GMTPPTYLVGIDILKADSHSYDPVLTQKSSQCWQSFFSISL 1061
KY + M L+ +I ++ H+ P T K + W+ F
Sbjct: 1239 TASAEKYVEKLFGVNKDRMSMDQMAVLLVSNINESKGHT-PPFTTYKDKRKWKPKFWRKP 1298
Query: 1062 TLPNLLQKDASADGLFLHDARIVANGNWNRPS----------SYFQRGSSILSQLK---- 1121
N D + A N P+ YF + ++ L+
Sbjct: 1299 ISDNSFSSDEEQSTGPIKYA-FQPENQINVPALDTVVTPDDVRYFTNETDDIANLEASVL 1358
Query: 1122 --QGDENLALETALIIINQ-ETNSREGMKKALPVLSRAVENNPKSIALWTIYLLIFYSYT 1181
L L+ A +NQ E E + AL VL+RA+ENN + +W YL +F
Sbjct: 1359 ENPSHVQLWLKLAYKYLNQNEGECSESLDSALNVLARALENNKDNPEIWCHYLRLFSKRG 1418
Query: 1182 TTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCDNIF--THNLDG 1241
T +M AV++ W ++L++ D + + +
Sbjct: 1419 TKDEVQEMCETAVEYAPDYQSFWTF-----LHLESTFEEKDYVCERMLEFLMGAAKQETS 1478
Query: 1242 KYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFTHSDMLACLNISD 1301
S +L+ +L +G + + + L+ A D + L SD
Sbjct: 1479 NILSFQLLEALLFRVQLHIFTGRCQSALAILQNALKSAND---------GIVAEYLKTSD 1538
Query: 1302 KCIFWVCVVYLVIYRKLPHAIVQQLE------CEKELIEIEWPAIQLTDGERLRASRVVK 1361
+C+ W+ ++L+ + LP E + W A+Q + ++
Sbjct: 1539 RCLAWLAYIHLIEFNILPSKFYDPSNDNPSRIVNTESFVMPWQAVQ---DVKTNPDMLLA 1598
Query: 1362 KAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPSCLE 1385
D V +C + ES ++ I+ CL + ++ LL++Y C
Sbjct: 1599 VFEDAVKACTD----ESLAVEERIE-------ACLPLYTNMIALHQLLERYEAAMELCKS 1629
BLAST of ClCG09G020050 vs. ExPASy TrEMBL
Match:
A0A0A0KS73 (zf-C3H1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G512890 PE=4 SV=1)
HSP 1 Score: 2996.8 bits (7768), Expect = 0.0e+00
Identity = 1530/1757 (87.08%), Postives = 1619/1757 (92.15%), Query Frame = 0
Query: 2 EKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSPI 61
++K+S+ELT+K+M SNSKP+K KAS+ +EEGEVSSSDNDTQTHDVHPVCSTVPAS+ S I
Sbjct: 7 KEKDSDELTLKSMPSNSKPTKIKASDGKEEGEVSSSDNDTQTHDVHPVCSTVPASIASRI 66
Query: 62 SSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLVI 121
SSILPPKNKCN GI+ ASADVCTRTSI T SQK D AQ+VNKASTPW ASR+ANSNLVI
Sbjct: 67 SSILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKASTPWVASRKANSNLVI 126
Query: 122 SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQP 181
SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTS LDRSN+LRSMTRNKVV NKL LSQ
Sbjct: 127 SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRSMTRNKVVVNKLPLSQA 186
Query: 182 FIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQI 241
FIPSMTKNH+AYS G AGPS AEQGSKIRAFSGNLQSQGRGNDQGMN+NTSKLQDLR+QI
Sbjct: 187 FIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNVNTSKLQDLRQQI 246
Query: 242 AICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRLK 301
AI ESKLKLKSAQQNKE + +TNQDYIVTNSKSDLGRKGNATISQ LGPK+ +AKR+K
Sbjct: 247 AIRESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATISQFPPLGPKDLNAKRMK 306
Query: 302 TSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLKQ 361
TSGSYS+KLSL+GQQ LR+ A K ++ PQEPGEETQNIK +YNQKG SLSREESSVLKQ
Sbjct: 307 TSGSYSSKLSLNGQQ-LRSLIAAKFIW-PQEPGEETQNIKGSYNQKGKSLSREESSVLKQ 366
Query: 362 SKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVTS 421
SKEDIKHVAASPS GIDLGKVQDDTDIVANGNQSD+I QVDPHPLVVLDQAT+LPNV S
Sbjct: 367 SKEDIKHVAASPSLGIDLGKVQDDTDIVANGNQSDFIGNQVDPHPLVVLDQATALPNVAS 426
Query: 422 NVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSGS 481
NVQ+QFDNVEFHRQSDGLQPSASTAK FE T PQSASNVK PEPCSNFFKSLINSK+SG+
Sbjct: 427 NVQSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPCSNFFKSLINSKTSGT 486
Query: 482 AFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIE 541
AFGN SSCL F N DLQSLFE+EESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIE
Sbjct: 487 AFGNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIE 546
Query: 542 ANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSLY 601
ANSRC++LYHKRELFS HFHSFCMNNPG +SSSRQQEDM I VDHLNSMSG+AN AS LY
Sbjct: 547 ANSRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGHANIASPLY 606
Query: 602 QKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSKG 661
QKH EYNSST+LHNDLNMQ ENAG INTSNLHENGQ+LGSEPGSCS L GNTLDPLP KG
Sbjct: 607 QKHSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDLGGNTLDPLPFKG 666
Query: 662 NNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKNH 721
NNIADRI SPS DPNVS+DGDEES PSDHEMIDSY+ECY+ +K FE+DQ+EAYN SK NH
Sbjct: 667 NNIADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFENDQMEAYNTSKNNH 726
Query: 722 SDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNNT 781
DNNIEDSLRLEAKLRSELFARLGTRNLSK CNPC+N+QTSVEQGTENDARDD TQQNNT
Sbjct: 727 CDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTENDARDDITQQNNT 786
Query: 782 EPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTPDDIHGRCHFENLPSEAQ 841
E TV LAVGSDVDLISKK E ALLSGKGDQQFGFGGT+ CKTPD+IHGR HFENLPSEA
Sbjct: 787 ELTVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGRYHFENLPSEAP 846
Query: 842 DSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEVLLTRTRGSLSNLGF 901
D DSD+NE F+REGSCSKTT SFTPLTMNSVLQH+K ISSVSIEVLLTRT GSLSNLGF
Sbjct: 847 DLTDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLLTRTHGSLSNLGF 906
Query: 902 PEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSWPLCMYELRGKCNND 961
PEDGDSL+VDQIHWRKLKENSVHE RPM QSDGSY DDLAIDPSWPLCMYELRGKCNND
Sbjct: 907 PEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPLCMYELRGKCNND 966
Query: 962 ECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYEDGMTPPTYLVGIDIL 1021
ECPWQH+KD+SFANR QCQHGHIN SSDETKVFK ED MTPPTYLVGIDIL
Sbjct: 967 ECPWQHMKDFSFANRSQCQHGHIN----------SSDETKVFKNEDQMTPPTYLVGIDIL 1026
Query: 1022 KADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIVANGNWNRPSS 1081
KADS SY VL Q+SSQCWQSFFSISLTLPNLLQKDASADGLFLHDARI A G+WNRPSS
Sbjct: 1027 KADSRSYGHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPSS 1086
Query: 1082 YFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALWT 1141
YFQRG S+LSQLKQGDENLALETALIIINQE NSREGMKKALPVLSRAVENNPKSIALW
Sbjct: 1087 YFQRGGSVLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIALWA 1146
Query: 1142 IYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCDN 1201
+YLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNLDARLAAYD+A+SALC N
Sbjct: 1147 VYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCHN 1206
Query: 1202 IFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFTHSDM 1261
IFTHNLDGKYASAHILDLILQMTNCLCMSGNVEK IQRIFGLL+VAMDSDEPYSFTHSDM
Sbjct: 1207 IFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHSDM 1266
Query: 1262 LACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPAIQLTDGERLRASRV 1321
L CLNISDKCIFWV VVYLV+YRKLPHAIVQQLECEKELIEIEWPA+ LT+GE+LRASRV
Sbjct: 1267 LTCLNISDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRASRV 1326
Query: 1322 VKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPSC 1381
VKKAVDFVDSCLNNESL+SKCYQKSIQMFAVNHIRCLMAFEDI FSRNLLDKYVKLYPSC
Sbjct: 1327 VKKAVDFVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSC 1386
Query: 1382 LELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAEYLLQNGRIKCTEEL 1441
ELLLL +RA+KH FGD TV+AFE+ IR WPKEVPGVQCIWNQYAEYLL+NGRIKCTEEL
Sbjct: 1387 PELLLLDIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTEEL 1446
Query: 1442 MVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPNEVDVVFWYLNLSVH 1501
M R F+STSKMDCSKTRT NSDCDSLHL ++ASGSI+ ALDCSPNEVDVVFWYLN SVH
Sbjct: 1447 MARRFDSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHSVH 1506
Query: 1502 KLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGY 1561
KLLLNDQLEARLAF+NALRAA S TFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGY
Sbjct: 1507 KLLLNDQLEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGY 1566
Query: 1562 LNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNCILEVWYGPSLLPQK 1621
LNDARAFPVP+PLSR+FI DI+KPRV+LLVSNMLSP+S DVSLVNCILEVWYGPSLLPQK
Sbjct: 1567 LNDARAFPVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLPQK 1626
Query: 1622 FNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDN-SSQVASPSLIFWACSNLISAIF 1681
FNKP+ELVDFVETILE+LPSNYQLVLSVCKQLCN DN SSQ ASPSLIFWACSNLI AIF
Sbjct: 1627 FNKPKELVDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIAIF 1686
Query: 1682 SSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWKSYYNICKTRGDTS 1741
SSVPIPPEFIWVEAANIL NVKG EAI ERFHKRALSVYPFSVQLWKSYYNIC+TRGDTS
Sbjct: 1687 SSVPIPPEFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGDTS 1746
Query: 1742 AVLREVNERGIELNEPS 1757
AVL+EVNERGI+LNEPS
Sbjct: 1747 AVLQEVNERGIQLNEPS 1751
BLAST of ClCG09G020050 vs. ExPASy TrEMBL
Match:
A0A1S3CJD3 (uncharacterized protein LOC103501638 OS=Cucumis melo OX=3656 GN=LOC103501638 PE=4 SV=1)
HSP 1 Score: 2987.6 bits (7744), Expect = 0.0e+00
Identity = 1527/1758 (86.86%), Postives = 1615/1758 (91.87%), Query Frame = 0
Query: 2 EKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSPI 61
++KNS+ELTVK+ SNSKPSK KAS+++EEGE+SSSDNDTQTHDV PVCSTVPAS+ SPI
Sbjct: 7 KEKNSDELTVKSTPSNSKPSKIKASDTKEEGELSSSDNDTQTHDVRPVCSTVPASIASPI 66
Query: 62 SSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLVI 121
SS LPPK+KCN GIQ ASAD+C RTSI T SQK D AQ+VNKASTPWGASR+ANSNLVI
Sbjct: 67 SSSLPPKDKCNPGIQTASADICPRTSISTMSQKIRDNAQIVNKASTPWGASRKANSNLVI 126
Query: 122 SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQP 181
SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSN+LRSMTRNKV+ANKL LSQ
Sbjct: 127 SFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNKLRSMTRNKVMANKLPLSQV 186
Query: 182 FIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQI 241
FIPSMTKNH+AYS G AGPS AEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLR+QI
Sbjct: 187 FIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLRQQI 246
Query: 242 AICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRLK 301
AI ESKLKLKSAQQNKES+ +TNQDYIVTNSK DLGRKGN TISQ LGPKEP+ KR+K
Sbjct: 247 AIRESKLKLKSAQQNKESLLVTNQDYIVTNSKPDLGRKGNNTISQFPPLGPKEPNVKRMK 306
Query: 302 TSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLKQ 361
TSGSYS+KLSL+ QQ L + A K V+ PQEPGEE QNIK +YNQKG SLSREE+SVLKQ
Sbjct: 307 TSGSYSSKLSLNEQQ-LHSLIAAKFVW-PQEPGEEIQNIKGSYNQKGKSLSREEASVLKQ 366
Query: 362 SKEDIKHVAASPSPGIDLGKVQDD-TDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVT 421
SKEDIKHVAASPS GIDLGKVQDD TDIVANGN SD I KQVDPHPLVVLDQAT+LPNV
Sbjct: 367 SKEDIKHVAASPSLGIDLGKVQDDITDIVANGNHSDLIGKQVDPHPLVVLDQATALPNVA 426
Query: 422 SNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 481
SNVQ+QFDNVEF RQSDGLQPSASTAK FEGT PQSA NVKIPEPCSNFFKSLIN KSSG
Sbjct: 427 SNVQSQFDNVEFRRQSDGLQPSASTAKSFEGTPPQSAYNVKIPEPCSNFFKSLINCKSSG 486
Query: 482 SAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI 541
+AFGNSSSCL F N DLQSLFE+EESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI
Sbjct: 487 TAFGNSSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI 546
Query: 542 EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSL 601
EANSRCLDLY+KRELFSAHFHSFCMNNPG +SSSRQQEDM I VDHLNSMSGNAN S L
Sbjct: 547 EANSRCLDLYNKRELFSAHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGNANITSPL 606
Query: 602 YQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSK 661
YQKH EYNSST+L NDLNMQHENAGPINTSNLHENGQNLGSEPGSCS L GNT+DPLP K
Sbjct: 607 YQKHSEYNSSTRLRNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSDLGGNTVDPLPFK 666
Query: 662 GNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKN 721
GNNIADRICSPS DPN+S+DGDEESLPSDHEMIDSY+ECY+ +K FEDDQ+EAYNM KKN
Sbjct: 667 GNNIADRICSPSVDPNISLDGDEESLPSDHEMIDSYNECYVRKKHFEDDQMEAYNMLKKN 726
Query: 722 HSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNN 781
H DNNIEDSLRLEAKLRSELFARLGTRNLSK CNPC+NIQTSVEQGTENDAR+D TQQNN
Sbjct: 727 HCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNIQTSVEQGTENDARNDRTQQNN 786
Query: 782 TEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTPDDIHGRCHFENLPSEA 841
TE TVGLAVGSDVDLISKK E ALLSGKGDQQFGFGGT+ CKTPD+IHG HFENLPSE
Sbjct: 787 TELTVGLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGPYHFENLPSET 846
Query: 842 QDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEVLLTRTRGSLSNLG 901
D DSD+NE F+REGSCSKTTFSFTPLTMNSVLQH+K ISSVSIEVLLTRT NLG
Sbjct: 847 PDLTDSDDNEPFSREGSCSKTTFSFTPLTMNSVLQHMKVISSVSIEVLLTRT----LNLG 906
Query: 902 FPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSWPLCMYELRGKCNN 961
FPEDGDSL+VD+IHWRK ENSV E VRPM QSDGSY DDLAIDPSWPLCMYELRGKCNN
Sbjct: 907 FPEDGDSLEVDRIHWRKFIENSVLEIVRPMLQSDGSYTDDLAIDPSWPLCMYELRGKCNN 966
Query: 962 DECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYEDGMTPPTYLVGIDI 1021
DECPWQH+KD+SFANRRQCQHGHIN SSDETKVFKYED MTPPTYLVGIDI
Sbjct: 967 DECPWQHMKDFSFANRRQCQHGHIN----------SSDETKVFKYEDRMTPPTYLVGIDI 1026
Query: 1022 LKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIVANGNWNRPS 1081
LKADS SY PVL Q+SSQCWQ+FFSISLTLPNLL+KDASADGLFLHDARI A G+WNRPS
Sbjct: 1027 LKADSRSYGPVLAQRSSQCWQNFFSISLTLPNLLRKDASADGLFLHDARIEAKGSWNRPS 1086
Query: 1082 SYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALW 1141
SYFQRG S+LSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALW
Sbjct: 1087 SYFQRGGSVLSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALW 1146
Query: 1142 TIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCD 1201
+YLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNLDARLAAYD+A+SALCD
Sbjct: 1147 AVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCD 1206
Query: 1202 NIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFTHSD 1261
NIF+HNLDGK ASAHILDLILQMTNCLCMSGNVEK IQRIFGLL+VAMDSDEPYSF HSD
Sbjct: 1207 NIFSHNLDGKDASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFMHSD 1266
Query: 1262 MLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPAIQLTDGERLRASR 1321
ML CLNISDKCIFWVCVVYLV+YRKLPHAIVQQLECEKELIEIEWPA+QLT+GE+LRASR
Sbjct: 1267 MLTCLNISDKCIFWVCVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVQLTNGEKLRASR 1326
Query: 1322 VVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPS 1381
VVKK VDF DSCLNNES ESKCYQKSIQMFAVNHIRCLMAFEDI FSRNLLDKYVKLYPS
Sbjct: 1327 VVKKVVDFADSCLNNESPESKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPS 1386
Query: 1382 CLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAEYLLQNGRIKCTEE 1441
C EL+LL +RA+KH FGD TVVAFEQAIR WPKEVPG+QCIWNQYAEYLL+NGRIKCTEE
Sbjct: 1387 CPELILLDIRARKHDFGDATVVAFEQAIRYWPKEVPGIQCIWNQYAEYLLRNGRIKCTEE 1446
Query: 1442 LMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPNEVDVVFWYLNLSV 1501
LM RWF STSKMDCSKTRT NSDCDSLHL ++ASGSI+ ALDCSP+EVDVVFWYLN SV
Sbjct: 1447 LMARWFNSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPSEVDVVFWYLNHSV 1506
Query: 1502 HKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNEAASVGGIRSILEG 1561
HKLL+NDQLEARLAFDNALRAA +GTFRYCMREYAMFLLTD SLLNEAASVGGIRSILEG
Sbjct: 1507 HKLLVNDQLEARLAFDNALRAASAGTFRYCMREYAMFLLTDGSLLNEAASVGGIRSILEG 1566
Query: 1562 YLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNCILEVWYGPSLLPQ 1621
YLNDARAFPV +PLSR+FINDIKKPRV+LLVSN LSP+S DVSLVNCILEVWYGPSLLPQ
Sbjct: 1567 YLNDARAFPVCEPLSRRFINDIKKPRVRLLVSNTLSPISPDVSLVNCILEVWYGPSLLPQ 1626
Query: 1622 KFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDN-SSQVASPSLIFWACSNLISAI 1681
KFNKP+ELVDFVETILEMLPSNYQLVLSVCKQL NGDN SSQ ASPSLIFWACSNLI+AI
Sbjct: 1627 KFNKPKELVDFVETILEMLPSNYQLVLSVCKQLSNGDNYSSQAASPSLIFWACSNLITAI 1686
Query: 1682 FSSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWKSYYNICKTRGDT 1741
F+ VPIPPEFIWVEAANILVNVKG EAI ERFHKRALSVYPFSVQLWKSYY++CKTRGDT
Sbjct: 1687 FNCVPIPPEFIWVEAANILVNVKGLEAITERFHKRALSVYPFSVQLWKSYYSMCKTRGDT 1746
Query: 1742 SAVLREVNERGIELNEPS 1757
S VL+EVNERGIELNEPS
Sbjct: 1747 STVLQEVNERGIELNEPS 1748
BLAST of ClCG09G020050 vs. ExPASy TrEMBL
Match:
A0A5D3C3A9 (Zinc finger C3H1 domain-containing protein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold98G003300 PE=4 SV=1)
HSP 1 Score: 2934.4 bits (7606), Expect = 0.0e+00
Identity = 1494/1714 (87.16%), Postives = 1576/1714 (91.95%), Query Frame = 0
Query: 46 VHPVCSTVPASVTSPISSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKA 105
V PVCSTVPAS+ SPISS LPPK+KCN GIQ ASAD+C RTSI T SQK D AQ+VNKA
Sbjct: 167 VRPVCSTVPASIASPISSSLPPKDKCNPGIQTASADICPRTSISTMSQKIRDNAQIVNKA 226
Query: 106 STPWGASREANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRS 165
STPWGASR+ANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSN+LRS
Sbjct: 227 STPWGASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNKLRS 286
Query: 166 MTRNKVVANKLSLSQPFIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQ 225
MTRNKV+ANKL LSQ FIPSMTKNH+AYS G AGPS AEQGSKIRAFSGNLQSQGRGNDQ
Sbjct: 287 MTRNKVMANKLPLSQVFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQ 346
Query: 226 GMNLNTSKLQDLREQIAICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATIS 285
GMNLNTSKLQDLR+QIAI ESKLKLKSAQQNKES+ +TNQDYIVTNSK DLGRKGN TIS
Sbjct: 347 GMNLNTSKLQDLRQQIAIRESKLKLKSAQQNKESLLVTNQDYIVTNSKPDLGRKGNNTIS 406
Query: 286 QVSSLGPKEPDAKRLKTSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYN 345
Q LGPKEP+ KR+KTSGSYS+KLSL+ QQ L + A K V+ PQEPGEE QNIK +YN
Sbjct: 407 QFPPLGPKEPNVKRMKTSGSYSSKLSLNEQQ-LHSLIAAKFVW-PQEPGEEIQNIKGSYN 466
Query: 346 QKGNSLSREESSVLKQSKEDIKHVAASPSPGIDLGKVQDD-TDIVANGNQSDWISKQVDP 405
QKG SLSREE+SVLKQSKEDIKHVAASPS GIDLGKVQDD TDIVANGN SD I KQVDP
Sbjct: 467 QKGKSLSREEASVLKQSKEDIKHVAASPSLGIDLGKVQDDITDIVANGNHSDLIGKQVDP 526
Query: 406 HPLVVLDQATSLPNVTSNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPE 465
HPLVVLDQAT+LPNV SNVQ+QFDNVEF RQSDGLQPSASTAK FEGT PQSA NVKIPE
Sbjct: 527 HPLVVLDQATALPNVASNVQSQFDNVEFRRQSDGLQPSASTAKSFEGTPPQSAYNVKIPE 586
Query: 466 PCSNFFKSLINSKSSGSAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIE 525
PCSNFFKSLIN KSSG+AFGNSSSCL F N DLQSLFE+EESLDKDLEEAQDCRRQCEIE
Sbjct: 587 PCSNFFKSLINCKSSGTAFGNSSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIE 646
Query: 526 ERNAFKIYSRAQRALIEANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGV 585
ERNAFKIYSRAQRALIEANSRCLDLY+KRELFSAHFHSFCMNNPG +SSSRQQEDM I V
Sbjct: 647 ERNAFKIYSRAQRALIEANSRCLDLYNKRELFSAHFHSFCMNNPGSVSSSRQQEDMIIDV 706
Query: 586 DHLNSMSGNANRASSLYQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPG 645
DHLNSMSGNAN S LYQKH EYNSST+L NDLNMQHENAGPINTSNLHENGQNLGSEPG
Sbjct: 707 DHLNSMSGNANITSPLYQKHSEYNSSTRLRNDLNMQHENAGPINTSNLHENGQNLGSEPG 766
Query: 646 SCSALCGNTLDPLPSKGNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRK 705
SCS L GNT+DPLP KGNNIADRICSPS +PN+S+DGDEESLPSDHEMIDSY+ECY+ +K
Sbjct: 767 SCSDLGGNTVDPLPFKGNNIADRICSPSVNPNISLDGDEESLPSDHEMIDSYNECYMRKK 826
Query: 706 QFEDDQLEAYNMSKKNHSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVE 765
FEDDQ+EAYNM KKNH DNNIEDSLRLEAKLRSELFARLGTRNLSK CNPC+NIQTSVE
Sbjct: 827 HFEDDQMEAYNMLKKNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNIQTSVE 886
Query: 766 QGTENDARDDSTQQNNTEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTP 825
QGTENDAR+D TQQNNTE TVGLAVGSDVDLISKK E ALLSGKGDQQFGFGGT+ CKTP
Sbjct: 887 QGTENDARNDRTQQNNTELTVGLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTP 946
Query: 826 DDIHGRCHFENLPSEAQDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVS 885
D+IHG HFENLPSE D DSD+NE F+REGSCSKTTFSFTPLTMNSVLQH+K ISSVS
Sbjct: 947 DEIHGPYHFENLPSETPDLTDSDDNEPFSREGSCSKTTFSFTPLTMNSVLQHMKVISSVS 1006
Query: 886 IEVLLTRTRGSLSNLGFPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAID 945
IEVLL+RT NLGFPEDGDSL+VD+IHWRK ENSVHE VRPM QSDGSY DDLAID
Sbjct: 1007 IEVLLSRT----LNLGFPEDGDSLEVDRIHWRKFIENSVHEIVRPMLQSDGSYTDDLAID 1066
Query: 946 PSWPLCMYELRGKCNNDECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFK 1005
PSWPLCMYELRGKCNNDECPWQH+KD+SFANRRQCQHGHIN SSDETKVFK
Sbjct: 1067 PSWPLCMYELRGKCNNDECPWQHMKDFSFANRRQCQHGHIN----------SSDETKVFK 1126
Query: 1006 YEDGMTPPTYLVGIDILKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLF 1065
YED MTPPTYLVGIDILKADS SY PVL Q+SSQCWQ+FFSISLTLPNLL+KDASADGLF
Sbjct: 1127 YEDRMTPPTYLVGIDILKADSRSYGPVLAQRSSQCWQNFFSISLTLPNLLRKDASADGLF 1186
Query: 1066 LHDARIVANGNWNRPSSYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALP 1125
LHDARI A G+WNRPSSYFQRG S+LSQLKQGDENLALETALIIINQETNSREGMKKALP
Sbjct: 1187 LHDARIEAKGSWNRPSSYFQRGGSVLSQLKQGDENLALETALIIINQETNSREGMKKALP 1246
Query: 1126 VLSRAVENNPKSIALWTIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNL 1185
VLSRAVENNPKSIALW +YLLIFYSYTTTGGKDDMFS+AVKHNGQSYELWLMYINSRMNL
Sbjct: 1247 VLSRAVENNPKSIALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNL 1306
Query: 1186 DARLAAYDAALSALCDNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLL 1245
DARLAAYD+A+SALCDNIF+HNLDGK ASAHILDLILQMTNCLCMSGNVEK IQRIFGLL
Sbjct: 1307 DARLAAYDSAISALCDNIFSHNLDGKDASAHILDLILQMTNCLCMSGNVEKAIQRIFGLL 1366
Query: 1246 RVAMDSDEPYSFTHSDMLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIE 1305
+VAMDSDEPYSF HSDML CLNISDKCIFWVCVVYLV+YRKLPHAIVQQLECEKELIEIE
Sbjct: 1367 QVAMDSDEPYSFMHSDMLTCLNISDKCIFWVCVVYLVLYRKLPHAIVQQLECEKELIEIE 1426
Query: 1306 WPAIQLTDGERLRASRVVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDI 1365
WPA+QLT+GE+LRASRVVKK VDF DSCLNNES ESKCYQKSIQMFAVNHIRCLMAFEDI
Sbjct: 1427 WPAVQLTNGEKLRASRVVKKVVDFADSCLNNESPESKCYQKSIQMFAVNHIRCLMAFEDI 1486
Query: 1366 GFSRNLLDKYVKLYPSCLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQ 1425
FSRNLLDKYVKLYPSC EL+LL +RA+KH FGD TVVAFEQAIR WPKEVPG+QCIWNQ
Sbjct: 1487 EFSRNLLDKYVKLYPSCPELILLDIRARKHDFGDATVVAFEQAIRYWPKEVPGIQCIWNQ 1546
Query: 1426 YAEYLLQNGRIKCTEELMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDC 1485
YAEYLL+NGRIKCTEELM RWF STSKMDCSKTRT NSDCDSLHL ++ASGSI+ ALDC
Sbjct: 1547 YAEYLLRNGRIKCTEELMARWFNSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDC 1606
Query: 1486 SPNEVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESL 1545
SP+EVDVVFWYLN SVHKLL+NDQLEARLAFDNALRAA +GTFRYCMREYAMFLLTDESL
Sbjct: 1607 SPSEVDVVFWYLNHSVHKLLVNDQLEARLAFDNALRAASAGTFRYCMREYAMFLLTDESL 1666
Query: 1546 LNEAASVGGIRSILEGYLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSL 1605
LNEAASVGGIRSILEGYLNDARAFPV +PLSR+FINDIKKPRV+LLVSN LSP+S DVSL
Sbjct: 1667 LNEAASVGGIRSILEGYLNDARAFPVCEPLSRRFINDIKKPRVRLLVSNTLSPISPDVSL 1726
Query: 1606 VNCILEVWYGPSLLPQKFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDN-SSQVA 1665
VNCILEVWYGPSLLPQKFNKP+ELVDFVETILEMLPSNYQLVLSVCKQL NGDN SSQ A
Sbjct: 1727 VNCILEVWYGPSLLPQKFNKPKELVDFVETILEMLPSNYQLVLSVCKQLSNGDNYSSQAA 1786
Query: 1666 SPSLIFWACSNLISAIFSSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSV 1725
SPSLIFWACSNLI+AIF+ VPIPPEFIWVEAANILVNVKG EAI ERFHKRALSVYPFSV
Sbjct: 1787 SPSLIFWACSNLITAIFNCVPIPPEFIWVEAANILVNVKGLEAITERFHKRALSVYPFSV 1846
Query: 1726 QLWKSYYNICKTRGDTSAVLREVNERGIELNEPS 1757
QLWKSYY++CKTRGDTS VL+EVNERGIELNEPS
Sbjct: 1847 QLWKSYYSMCKTRGDTSTVLQEVNERGIELNEPS 1864
BLAST of ClCG09G020050 vs. ExPASy TrEMBL
Match:
A0A5A7VFE0 (Zinc finger C3H1 domain-containing protein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold32G00140 PE=4 SV=1)
HSP 1 Score: 2867.0 bits (7431), Expect = 0.0e+00
Identity = 1459/1668 (87.47%), Postives = 1538/1668 (92.21%), Query Frame = 0
Query: 92 SQKTCDTAQVVNKASTPWGASREANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFK 151
SQK D AQ+VNKASTPWGASR+ANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFK
Sbjct: 2 SQKIRDNAQIVNKASTPWGASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFK 61
Query: 152 PPTSTLDRSNRLRSMTRNKVVANKLSLSQPFIPSMTKNHRAYSTG-AGPSLAEQGSKIRA 211
PPTSTLDRSN+LRSMTRNKV+ANKL LSQ FIPSMTKNH+AYS G AGPS AEQGSKIRA
Sbjct: 62 PPTSTLDRSNKLRSMTRNKVMANKLPLSQVFIPSMTKNHKAYSKGAAGPSFAEQGSKIRA 121
Query: 212 FSGNLQSQGRGNDQGMNLNTSKLQDLREQIAICESKLKLKSAQQNKESISITNQDYIVTN 271
FSGNLQSQGRGNDQGMNLNTSKLQDLR+QIAI ESKLKLKSAQQNKES+ +TNQDYIVTN
Sbjct: 122 FSGNLQSQGRGNDQGMNLNTSKLQDLRQQIAIRESKLKLKSAQQNKESLLVTNQDYIVTN 181
Query: 272 SKSDLGRKGNATISQVSSLGPKEPDAKRLKTSGSYSTKLSLSGQQHLRATYAGKSVFRPQ 331
SK DLGRKGN TISQ LGPKEP+ KR+KTSGSYS+KLSL+ QQ L + A K V+ PQ
Sbjct: 182 SKPDLGRKGNNTISQFPPLGPKEPNVKRMKTSGSYSSKLSLNEQQ-LHSLIAAKFVW-PQ 241
Query: 332 EPGEETQNIKVTYNQKGNSLSREESSVLKQSKEDIKHVAASPSPGIDLGKVQDD-TDIVA 391
EPGEE QNIK +YNQKG SLSREE+SVLKQSKEDIKHVAASPS GIDLGKVQDD TDIVA
Sbjct: 242 EPGEEIQNIKGSYNQKGKSLSREEASVLKQSKEDIKHVAASPSLGIDLGKVQDDITDIVA 301
Query: 392 NGNQSDWISKQVDPHPLVVLDQATSLPNVTSNVQTQFDNVEFHRQSDGLQPSASTAKLFE 451
NGN SD I KQVDPHPLVVLDQAT+LPNV SNVQ+QFDNVEF RQSDGLQPSASTAK FE
Sbjct: 302 NGNHSDLIGKQVDPHPLVVLDQATALPNVASNVQSQFDNVEFRRQSDGLQPSASTAKSFE 361
Query: 452 GTLPQSASNVKIPEPCSNFFKSLINSKSSGSAFGNSSSCLGFSNLDLQSLFEMEESLDKD 511
GT PQSA NVKIPEPCSNFFKSLIN KSSG+AFGNSSSCL F N DLQSLFE+EESLDKD
Sbjct: 362 GTPPQSAYNVKIPEPCSNFFKSLINCKSSGTAFGNSSSCLDFGNFDLQSLFEIEESLDKD 421
Query: 512 LEEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCLDLYHKRELFSAHFHSFCMNNPGL 571
LEEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCLDLY+KRELFSAHFHSFCMNNPG
Sbjct: 422 LEEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCLDLYNKRELFSAHFHSFCMNNPGS 481
Query: 572 ISSSRQQEDMKIGVDHLNSMSGNANRASSLYQKHFEYNSSTKLHNDLNMQHENAGPINTS 631
+SSSRQQEDM I VDHLNSMSGNAN S LYQKH EYNSST+L NDLNMQHENAGPINTS
Sbjct: 482 VSSSRQQEDMIIDVDHLNSMSGNANITSPLYQKHSEYNSSTRLRNDLNMQHENAGPINTS 541
Query: 632 NLHENGQNLGSEPGSCSALCGNTLDPLPSKGNNIADRICSPSFDPNVSVDGDEESLPSDH 691
NLHENGQNLGSEPGSCS L GNT+DPLP KGNNIADRICSPS +PN+S+DGDEESLPSDH
Sbjct: 542 NLHENGQNLGSEPGSCSDLGGNTVDPLPFKGNNIADRICSPSVNPNISLDGDEESLPSDH 601
Query: 692 EMIDSYDECYIGRKQFEDDQLEAYNMSKKNHSDNNIEDSLRLEAKLRSELFARLGTRNLS 751
EMIDSY+ECY+ +K FEDDQ+EAYNM KKNH DNNIEDSLRLEAKLRSELFARLGTRNLS
Sbjct: 602 EMIDSYNECYMRKKHFEDDQMEAYNMLKKNHCDNNIEDSLRLEAKLRSELFARLGTRNLS 661
Query: 752 KTCNPCHNIQTSVEQGTENDARDDSTQQNNTEPTVGLAVGSDVDLISKKTEIALLSGKGD 811
K CNPC+NIQTSVEQGTENDAR+D TQQNNTE TVGLAVGSDVDLISKK E ALLSGKGD
Sbjct: 662 KACNPCNNIQTSVEQGTENDARNDRTQQNNTELTVGLAVGSDVDLISKKNESALLSGKGD 721
Query: 812 QQFGFGGTNICKTPDDIHGRCHFENLPSEAQDSADSDENERFNREGSCSKTTFSFTPLTM 871
QQFGFGGT+ CKTPD+IHG HFENLPSE D DSD+NE F+REGSCSKTTFSFTPLTM
Sbjct: 722 QQFGFGGTDRCKTPDEIHGPYHFENLPSETPDLTDSDDNEPFSREGSCSKTTFSFTPLTM 781
Query: 872 NSVLQHIKAISSVSIEVLLTRTRGSLSNLGFPEDGDSLQVDQIHWRKLKENSVHETVRPM 931
NSVLQH+K ISSVSIEVLL+RT NLGFPEDGDSL+VD+IHWRK ENSVHE VRPM
Sbjct: 782 NSVLQHMKVISSVSIEVLLSRT----LNLGFPEDGDSLEVDRIHWRKFIENSVHEIVRPM 841
Query: 932 FQSDGSYIDDLAIDPSWPLCMYELRGKCNNDECPWQHVKDYSFANRRQCQHGHINYSDSC 991
QSDGSY DDLAIDPSWPLCMYELRGKCNNDECPWQH+KD+SFANRRQCQHGHIN
Sbjct: 842 LQSDGSYTDDLAIDPSWPLCMYELRGKCNNDECPWQHMKDFSFANRRQCQHGHIN----- 901
Query: 992 NGLSFSSDETKVFKYEDGMTPPTYLVGIDILKADSHSYDPVLTQKSSQCWQSFFSISLTL 1051
SSDETKVFKYED MTPPTYLVGIDILKADS SY PVL Q+SSQCWQ+FFSISLTL
Sbjct: 902 -----SSDETKVFKYEDRMTPPTYLVGIDILKADSRSYGPVLAQRSSQCWQNFFSISLTL 961
Query: 1052 PNLLQKDASADGLFLHDARIVANGNWNRPSSYFQRGSSILSQLKQGDENLALETALIIIN 1111
PNLL+KDASADGLFLHDARI A G+WNRPSSYFQRG S+LSQLKQGDENLALETALIIIN
Sbjct: 962 PNLLRKDASADGLFLHDARIEAKGSWNRPSSYFQRGGSVLSQLKQGDENLALETALIIIN 1021
Query: 1112 QETNSREGMKKALPVLSRAVENNPKSIALWTIYLLIFYSYTTTGGKDDMFSFAVKHNGQS 1171
QETNSREGMKKALPVLSRAVENNPKSIALW +YLLIFYSYTTTGGKDDMFS+AVKHNGQS
Sbjct: 1022 QETNSREGMKKALPVLSRAVENNPKSIALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQS 1081
Query: 1172 YELWLMYINSRMNLDARLAAYDAALSALCDNIFTHNLDGKYASAHILDLILQMTNCLCMS 1231
YELWLMYINSRMNLDARLAAYD+A+SALCDNIF+HNLDGK ASAHILDLILQMTNCLCMS
Sbjct: 1082 YELWLMYINSRMNLDARLAAYDSAISALCDNIFSHNLDGKDASAHILDLILQMTNCLCMS 1141
Query: 1232 GNVEKGIQRIFGLLRVAMDSDEPYSFTHSDMLACLNISDKCIFWVCVVYLVIYRKLPHAI 1291
GNVEK IQRIFGLL+VAMDSDEPYSF HSDML CLNISDKCIFWVCVVYLV+YRKLPHAI
Sbjct: 1142 GNVEKAIQRIFGLLQVAMDSDEPYSFMHSDMLTCLNISDKCIFWVCVVYLVLYRKLPHAI 1201
Query: 1292 VQQLECEKELIEIEWPAIQLTDGERLRASRVVKKAVDFVDSCLNNESLESKCYQKSIQMF 1351
VQQLECEKELIEIEWPA+QLT+GE+LRASRVVKK VDF DSCLNNES ESKCYQKSIQMF
Sbjct: 1202 VQQLECEKELIEIEWPAVQLTNGEKLRASRVVKKVVDFADSCLNNESPESKCYQKSIQMF 1261
Query: 1352 AVNHIRCLMAFEDIGFSRNLLDKYVKLYPSCLELLLLKVRAKKHGFGDETVVAFEQAIRN 1411
AVNHIRCLMAFEDI FSRNLLDKYVKLYPSC EL+LL +RA+KH FGD TVVAFEQAIR
Sbjct: 1262 AVNHIRCLMAFEDIEFSRNLLDKYVKLYPSCPELILLDIRARKHDFGDATVVAFEQAIRY 1321
Query: 1412 WPKEVPGVQCIWNQYAEYLLQNGRIKCTEELMVRWFESTSKMDCSKTRTVDNSDCDSLHL 1471
WPKEVPG+QCIWNQYAEYLL+NGRIKCTEELM RWF STSKMDCSKTRT NSDCDSLHL
Sbjct: 1322 WPKEVPGIQCIWNQYAEYLLRNGRIKCTEELMARWFNSTSKMDCSKTRTPVNSDCDSLHL 1381
Query: 1472 REYASGSILHALDCSPNEVDVVFWYLNLSVHKLLLNDQLEARLAFDNALRAAGSGTFRYC 1531
++ASGSI+ ALDCSP+EVDVVFWYLN SVHKLL+NDQLEARLAFDNALRAA +GTFRYC
Sbjct: 1382 LDHASGSIVRALDCSPSEVDVVFWYLNHSVHKLLVNDQLEARLAFDNALRAASAGTFRYC 1441
Query: 1532 MREYAMFLLTDESLLNEAASVGGIRSILEGYLNDARAFPVPKPLSRKFINDIKKPRVQLL 1591
MREYAMFLLTDESLLNEAASVGGIRSILEGYLNDARAFPV +PLSR+FINDIKKPRV+LL
Sbjct: 1442 MREYAMFLLTDESLLNEAASVGGIRSILEGYLNDARAFPVCEPLSRRFINDIKKPRVRLL 1501
Query: 1592 VSNMLSPLSLDVSLVNCILEVWYGPSLLPQKFNKPRELVDFVETILEMLPSNYQLVLSVC 1651
VSN LSP+S DVSLVNCILEVWYGPSLLPQKFNKP+ELVDFVETILEMLPSNYQLVLSVC
Sbjct: 1502 VSNTLSPISPDVSLVNCILEVWYGPSLLPQKFNKPKELVDFVETILEMLPSNYQLVLSVC 1561
Query: 1652 KQLCNGDN-SSQVASPSLIFWACSNLISAIFSSVPIPPEFIWVEAANILVNVKGFEAIYE 1711
KQL NGDN SSQ ASPSLIFWACSNLI+AIF+ VPIPPEFIWVEAANILVNVKG EAI E
Sbjct: 1562 KQLSNGDNYSSQAASPSLIFWACSNLITAIFNCVPIPPEFIWVEAANILVNVKGLEAITE 1621
Query: 1712 RFHKRALSVYPFSVQLWKSYYNICKTRGDTSAVLREVNERGIELNEPS 1757
RFHKRALSVYPFSVQLWKSYY++CKTRGDTS VL+EVNERGIELNEPS
Sbjct: 1622 RFHKRALSVYPFSVQLWKSYYSMCKTRGDTSTVLQEVNERGIELNEPS 1653
BLAST of ClCG09G020050 vs. ExPASy TrEMBL
Match:
A0A6J1GRB4 (uncharacterized protein LOC111456410 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111456410 PE=4 SV=1)
HSP 1 Score: 2793.8 bits (7241), Expect = 0.0e+00
Identity = 1441/1758 (81.97%), Postives = 1553/1758 (88.34%), Query Frame = 0
Query: 1 MEKKNSEELTVKAMASNSKPSKSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSP 60
MEK ++EELTVKAM SN KP++SK S SREEGEVSSSDNDTQTH VH V S +PASVTSP
Sbjct: 1 MEKNDAEELTVKAMESNLKPTRSKTSNSREEGEVSSSDNDTQTHGVHHVRSAMPASVTSP 60
Query: 61 ISSILPPKNKCNEGIQAASADVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLV 120
ISSILPPKNK N GIQAASADVC +TSIQTT+QK CD Q+V+KA TPW ASR+AN+NLV
Sbjct: 61 ISSILPPKNKSNAGIQAASADVCPKTSIQTTAQKICDNDQIVHKAITPWVASRDANANLV 120
Query: 121 ISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANKLSLSQ 180
ISFSDDSGS+++E SK +TSKS S+AV HFKPPTS LD+SN+LRSMTRN VVANK S SQ
Sbjct: 121 ISFSDDSGSDMDERSKEKTSKSRSNAVGHFKPPTSLLDKSNKLRSMTRNNVVANKFSSSQ 180
Query: 181 PFIPSMTKNHRAYSTG-AGPSLAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQ 240
FI S T RA S G AGPSL EQGS+IRAFSGNL QG NDQG+NL +SKLQDLREQ
Sbjct: 181 SFITSKTMTKRACSKGAAGPSLVEQGSRIRAFSGNLPIQGHRNDQGVNLKSSKLQDLREQ 240
Query: 241 IAICESKLKLKSAQQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRL 300
IAI ESKLKLKSAQQNKE IS TNQDYIVTNSKSDLGRKG+ATISQ GP +PDAKR+
Sbjct: 241 IAIWESKLKLKSAQQNKEIISATNQDYIVTNSKSDLGRKGDATISQFPPSGPTQPDAKRM 300
Query: 301 KTSGSYSTKLSLSGQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLK 360
KT GSYSTKLSLSG QHLRAT A KSVFRPQEPGEETQNIKVTYNQKGNS++R+ES+ LK
Sbjct: 301 KTIGSYSTKLSLSG-QHLRATNAVKSVFRPQEPGEETQNIKVTYNQKGNSMNRDESNALK 360
Query: 361 QSKEDIKHVAASPSPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVT 420
Q KEDIKHVAAS SPG DLGKV D TDIVANGNQSDWISKQVDPHPLVVL QA+ LPN
Sbjct: 361 QKKEDIKHVAASSSPGSDLGKVHDGTDIVANGNQSDWISKQVDPHPLVVLGQASVLPNTA 420
Query: 421 SNVQTQFDNVEFHRQSDGLQPSASTAKLFEGTLPQSASNVKIPEPCSNFFKSLINSKSSG 480
SNVQT FDN EFH +DGLQ SASTA EGT PQSASNVKIPE SNFFKSLINSKS+G
Sbjct: 421 SNVQTLFDNSEFHSPNDGLQQSASTANFSEGTCPQSASNVKIPESFSNFFKSLINSKSTG 480
Query: 481 SAFGNSSSCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALI 540
+AFGN SSCLGFSN+DL+SLFEMEESLDKDLEEAQD RR+CE+EERNAFKIYSRAQRALI
Sbjct: 481 TAFGNPSSCLGFSNVDLESLFEMEESLDKDLEEAQDFRRRCEVEERNAFKIYSRAQRALI 540
Query: 541 EANSRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSL 600
EANSRCLDLYHKRELFSAHFHSFCMNNPGL+SSSRQQE+MKIGVDH NSMSGN NRAS L
Sbjct: 541 EANSRCLDLYHKRELFSAHFHSFCMNNPGLVSSSRQQENMKIGVDHSNSMSGNENRASPL 600
Query: 601 YQKHFEYNSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSK 660
YQKH EYNS T+L NDLNMQHENA PINTS LHEN QNLGSEP SCS LCG TL+P+PSK
Sbjct: 601 YQKHSEYNSFTQLRNDLNMQHENASPINTSILHENRQNLGSEPESCSDLCGITLNPVPSK 660
Query: 661 GNNIADRICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKN 720
G NIADRICSPS +PNVSVDGDEES SDHE+IDSYDECYIG+K+FEDDQ+EA NMSKKN
Sbjct: 661 GKNIADRICSPSIEPNVSVDGDEESFHSDHEIIDSYDECYIGKKRFEDDQMEACNMSKKN 720
Query: 721 HSDNNIEDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNN 780
H D+ DSLRLEAKLRSELFARLGTRN S+TCNPCHNIQTSVE+G E DARDD TQQN
Sbjct: 721 HYDDKTGDSLRLEAKLRSELFARLGTRNSSQTCNPCHNIQTSVEKGAEKDARDDKTQQNY 780
Query: 781 TEPTVGLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTPDDIHGRCHFENLPSEA 840
TEPTV AVG+D+D KT+ ALLSGK DQ+FGFGGT+ CKTPDDI C+FEN P E
Sbjct: 781 TEPTVRQAVGNDID----KTKSALLSGKRDQKFGFGGTDRCKTPDDIRSHCNFENFPLET 840
Query: 841 QDSADSDENERFNREGSCSKTTFSFTPLTMNSVLQHIKAISSVSIEVLLTRTRGSLSNLG 900
D ADSD NE NREG CS FS+ PLT+NSVLQH+KA++SVS EVLL+RTR S SNLG
Sbjct: 841 HDVADSDVNEPSNREGPCS--YFSYAPLTLNSVLQHMKAVTSVSTEVLLSRTRESFSNLG 900
Query: 901 FPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSWPLCMYELRGKCNN 960
PE+GD L+VD+IHWRKL+EN V +TV MFQSDGSY DDL+IDPSWPLCMYELRGKCNN
Sbjct: 901 LPEEGDLLEVDRIHWRKLEENHVPDTVSCMFQSDGSYTDDLSIDPSWPLCMYELRGKCNN 960
Query: 961 DECPWQHVKDYSFANRRQCQHGHINYSDSCNGLSFSSDETKVFKYEDGMTPPTYLVGIDI 1020
DECPWQHVKD S +NRR CQ NYSDSCNGL FSSDETKVFKYED MTPPTYLVG+DI
Sbjct: 961 DECPWQHVKDSSLSNRRPCQDSQSNYSDSCNGLLFSSDETKVFKYEDLMTPPTYLVGVDI 1020
Query: 1021 LKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIVANGNWNRPS 1080
LKADSHSY+PVL QKSS+CWQ+FFSISLTLPNLLQKDASADGLFLHDARI A G+WNR S
Sbjct: 1021 LKADSHSYNPVLVQKSSKCWQNFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRQS 1080
Query: 1081 SYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALW 1140
SYFQ GS+ LSQLKQ DEN ALETALIIINQE NSREGMK+ALP+LSRA+E+NPKSIALW
Sbjct: 1081 SYFQSGSTTLSQLKQADENQALETALIIINQEMNSREGMKRALPILSRAIESNPKSIALW 1140
Query: 1141 TIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCD 1200
T+YLLIFYSYTT GGKDDMFS+AVKHN QSYELWL+YINS MNLDAR+AAYDAALSAL +
Sbjct: 1141 TMYLLIFYSYTTNGGKDDMFSYAVKHNEQSYELWLLYINSHMNLDARIAAYDAALSALFN 1200
Query: 1201 NIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFTHSD 1260
NI T +D K ASAHILDLILQMTNCLCMSGNVEK Q+IFGLLRVAMDSDEP SF HSD
Sbjct: 1201 NILT-QMDEKCASAHILDLILQMTNCLCMSGNVEKATQKIFGLLRVAMDSDEPGSFMHSD 1260
Query: 1261 MLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPAIQLTDGERLRASR 1320
ML CLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKEL+EIEWP I LTDGE+ RAS
Sbjct: 1261 MLTCLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELVEIEWPTIHLTDGEKQRAST 1320
Query: 1321 VVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPS 1380
VVKKAVDFVDSCLNNESLES+ YQKSIQMFAVNHIRCLMAFEDIGF+RNLLDKYVK YPS
Sbjct: 1321 VVKKAVDFVDSCLNNESLESQSYQKSIQMFAVNHIRCLMAFEDIGFTRNLLDKYVKRYPS 1380
Query: 1381 CLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAEYLLQNGRIKCTEE 1440
CLELLLL KKH FG E V AFE+ IRNWPKEVPGVQCIWNQYAEYLLQNGRIK TEE
Sbjct: 1381 CLELLLLNAWTKKHDFG-EMVAAFEEVIRNWPKEVPGVQCIWNQYAEYLLQNGRIKYTEE 1440
Query: 1441 LMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPNEVDVVFWYLNLSV 1500
LM RWF+S+SK+ S+TRT+DNSDC+SLHL +YASGSI+HALDCSP+EVD+VFWYLNLSV
Sbjct: 1441 LMARWFDSSSKIG-SRTRTLDNSDCNSLHLLDYASGSIVHALDCSPSEVDLVFWYLNLSV 1500
Query: 1501 HKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNEAASVGGIRSILEG 1560
HKLLLND LEARLAFDNALRAA SGTF+YCMREYAMFLLTDESLLNEA SVGGIRSILEG
Sbjct: 1501 HKLLLNDLLEARLAFDNALRAASSGTFKYCMREYAMFLLTDESLLNEAGSVGGIRSILEG 1560
Query: 1561 YLNDARAFPVPKPLSRKFINDIKKPRVQLLVSNMLSPLSLDVSLVNCILEVWYGPSLLPQ 1620
YL+D RAFPVP+ LSRKFINDIKKPRVQLLVSNMLSPLS DVSLVNC+LE WYGPSLLP
Sbjct: 1561 YLSDVRAFPVPETLSRKFINDIKKPRVQLLVSNMLSPLSPDVSLVNCVLEAWYGPSLLPP 1620
Query: 1621 KFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDNSSQVASPSLIFWACSNLISAIF 1680
KF+KP+ELVDFVETILEMLPSNYQLVLSVCKQLCNG+NSSQV S SLIFWACSNLISAIF
Sbjct: 1621 KFSKPKELVDFVETILEMLPSNYQLVLSVCKQLCNGNNSSQVTSASLIFWACSNLISAIF 1680
Query: 1681 SSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWKSYYNICKTRGDTS 1740
+VPIPPEFIWVEA++ILVNVKGF AI ERFHKRALSVYPFSVQLWKSYYN CK RGDTS
Sbjct: 1681 CAVPIPPEFIWVEASDILVNVKGFGAITERFHKRALSVYPFSVQLWKSYYNKCKARGDTS 1740
Query: 1741 AVLREVNERGIELNEPSL 1758
AVL+ VNERGIEL+ PSL
Sbjct: 1741 AVLQAVNERGIELSLPSL 1748
BLAST of ClCG09G020050 vs. TAIR 10
Match:
AT2G39580.1 (CONTAINS InterPro DOMAIN/s: Putative zinc-finger domain (InterPro:IPR019607); Has 249 Blast hits to 219 proteins in 85 species: Archae - 0; Bacteria - 144; Metazoa - 29; Fungi - 8; Plants - 50; Viruses - 0; Other Eukaryotes - 18 (source: NCBI BLink). )
HSP 1 Score: 780.4 bits (2014), Expect = 3.0e-225
Identity = 604/1760 (34.32%), Postives = 897/1760 (50.97%), Query Frame = 0
Query: 22 KSKASESREEGEVSSSDNDTQTHDVHPVCSTVPASVTSPISSILP-PKNKCNEGIQAASA 81
K+ +EEGE+S+SD++ Q P+ ++ + +T IS+ + + G
Sbjct: 8 KNSPVTGKEEGELSTSDDEVQ-----PMQTSTRSPLTEHISANTNIQRRQAGNGGSFIKP 67
Query: 82 DVCTRTSIQTTSQKTCDTAQVVNKASTPWGASREANSNLVISFS-DDSGSELEECSKVRT 141
T T + + +T Q + R NSNLVI+FS DDSGSE + + +T
Sbjct: 68 SDATPTKLTNPGGRIFETKQAIAAIHGKKFPVRGNNSNLVINFSDDDSGSESDCKGRTQT 127
Query: 142 SKSHSDAVRHFKPPTSTLDRSNRLRSMTRNKVVANK----LSLSQPFIPSMTKNHRAYST 201
SK P T+ + + ++ K+ + ++++ + + T +H A S
Sbjct: 128 SKIQ---------PKGTISGNRNPSTFSQTKLKGPRQIDIRAITKKALSTSTFSHAATSK 187
Query: 202 GAGPSLAEQ---GSKIRAFSGNLQSQGRGNDQGMNLNTSKLQDLREQIAICESKLKLKSA 261
+ S A++ I + + + +Q + N++KLQDL++QIA+ ES+LKLK+A
Sbjct: 188 VSNLSFAKEMKSNKYIHSSERTVSKDAQRPEQIVESNSNKLQDLKQQIALRESELKLKAA 247
Query: 262 QQNKESISITNQDYIVTNSKSDLGRKGNATISQVSSLGPKEPDAKRLKTSGSYSTKLSLS 321
Q K+++ N K R+ + L P EP KRLK SG
Sbjct: 248 QPKKDAV----------NPKITPARRVSIISDDTRHLEPNEPPKKRLKVSGI-------- 307
Query: 322 GQQHLRATYAGKSVFRPQEPGEETQNIKVTYNQKGNSLSREESSVLKQSKEDIKHVAASP 381
+T + Y S+ + DI+ S
Sbjct: 308 ----------------------DTSQPVIDYRVAA-------SAAAPMNAPDIR---KSL 367
Query: 382 SPGIDLGKVQDDTDIVANGNQSDWISKQVDPHPLVVLDQATSLPNVTSNVQTQFD---NV 441
PG++ ++ G++SD I V P V + ++S+ ++ ++ +
Sbjct: 368 LPGVNA-----NSSCKHLGSKSDEIVPPVIPQHTVEGNTSSSVLQKSTGKVNHYEGGREL 427
Query: 442 EFHRQSDGLQPSASTAKLFEG---TLPQSASNVKIPEPCSNFFKSLINSKSSGSAFGNSS 501
E + D S K+ G +S++N PCSN S S+
Sbjct: 428 ETMKNVDRSVSSEQLLKIVNGNHQVFSRSSNNNWKRLPCSN--------NSGLYNIPGST 487
Query: 502 SCLGFSNLDLQSLFEMEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCL 561
+ G S LD+ SL +EESLDK+LEEAQ+ +R EIEERNA K+Y +AQR+LIEAN+RC
Sbjct: 488 TVPGHSQLDMLSLTNLEESLDKELEEAQERKRLFEIEERNALKVYRKAQRSLIEANARCA 547
Query: 562 DLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGVDHLNSMSGNANRASSLYQKHFEY 621
+LY KRE+ SAH+ S + + L+ S E+ + G LN+ +G+ + A+ +
Sbjct: 548 ELYSKREILSAHYGSLIVRDSRLLWPSIHGENPETGFHFLNNSTGSIDLAT---KTDIAQ 607
Query: 622 NSSTKLHNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSALCGNTLDPLPSKGNNIADR 681
+S + ++ N ++ + P S +GQNLG S L +T D LP A R
Sbjct: 608 HSQLESNHKYNSEYVGSHPPPHS---RSGQNLG-----YSDLGASTSDGLPCGNKQTASR 667
Query: 682 ICSPSFDPNVSVDGDEESLPSDHEMIDSYDECYIGRKQFEDDQLEAYNMSKKNHSDNNI- 741
+CSPS D N+ D+ES P DHE E +K + D +
Sbjct: 668 LCSPSSDANIL--PDDESFPVDHE------------------STEGNPGHQKENIDQTLG 727
Query: 742 -EDSLRLEAKLRSELFARLGTRNLSKTCNPCHNIQTSVEQGTENDARDDSTQQNNTEPTV 801
+++L LEA LRS+LF RLG R S+ C N +T +++G E D + TQ++N P
Sbjct: 728 NQNALLLEASLRSKLFDRLGMRAESRG-GTCFNEETVIDRGDERDFGSEGTQRDNGSPF- 787
Query: 802 GLAVGSDVDLISKKTEIALLSGKGDQQFGFGGTNICKTPDDIH-GRCHFENLPSEAQDSA 861
S++ L + E G + +P + R E Q S
Sbjct: 788 -----SEIYLHNDSLEP-------------GANKLQGSPSEAPVERRSIEENSLNYQLSI 847
Query: 862 DSDENERFNREGSCSKTTFSFTPLTMNSVLQHIK----AISSVSIEVLLTRTRGSLSNLG 921
D E+ R + E + + PL S + H+K +I+S+ E +L SL +
Sbjct: 848 DM-ESHRSSPENALLSSVALSGPL-FRSTIYHLKVPGSSITSLGPEYILQNKTYSLYS-- 907
Query: 922 FPEDGDSLQVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPSWPLCMYELRGKCNN 981
D+ R L E V+E + G Y +L +DPSWPLCMYELRG+CNN
Sbjct: 908 ----------DKRQCRSLTETIVYE------KKIGFYTCNLKVDPSWPLCMYELRGRCNN 967
Query: 982 DECPWQHVKDYSFANRRQCQH---GHINYSDSCNGLSFSSDETKVFKYEDGMTPPTYLVG 1041
DEC WQH KD+S + Q H G + S + + +K + D + PTYLV
Sbjct: 968 DECSWQHFKDFSDDSLHQSLHDPDGRVGSSSH----QKTHNSSKGSQILDSVFSPTYLVS 1027
Query: 1042 IDILKADSHSYDPVLTQKSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIVANGNWN 1101
+D +K DS SY+ VL Q+ Q W FS L N L ++ A ++ RIV GN
Sbjct: 1028 LDTMKVDSWSYESVLAQRHGQIWCKHFSACLASSNSLYRNVPAKE---NEGRIVVLGNSK 1087
Query: 1102 RPSSYFQRGSSILSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSI 1161
SSYF+ S++ + Q AL +LS+ +E +P S
Sbjct: 1088 TYSSYFRIKHSLMWHIFQ--------------------------ALSLLSQGLEGDPTSE 1147
Query: 1162 ALWTIYLLIFYSYTTTGGKDDMFSFAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSA 1221
LW +YLLI+++Y + GK DMFS+ VKH+ +SY +WLMYINSR L+ +L AYD ALSA
Sbjct: 1148 ILWAVYLLIYHAYEGSDGK-DMFSYGVKHSSRSYVIWLMYINSRGQLNDQLIAYDTALSA 1207
Query: 1222 LCDNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKGIQRIFGLLRVAMDSDEPYSFT 1281
LC N + ++D +ASA ILD++LQM N LC+SGNV K IQRI L A SD+P
Sbjct: 1208 LC-NHASGSIDRNHASACILDVLLQMFNLLCISGNVSKAIQRISKLQAPAAVSDDPDFSL 1267
Query: 1282 HSDMLACLNISDKCIFWVCVVYLVIYRKLPHAIVQQLECEKELIEIEWPAIQLTDGERLR 1341
S +L CL SDKC+FWVC VYLVIYRKLP +I+++LE EKEL+EIEWP + L +
Sbjct: 1268 MSHILTCLTYSDKCVFWVCCVYLVIYRKLPDSIIRRLEMEKELLEIEWPTVNLDGDLKQM 1327
Query: 1342 ASRVVKKAVDFVDSCLNNESLESKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKL 1401
A R+ K + V+ NN ++ +FA+N+ ++A +++ R++L V+L
Sbjct: 1328 ALRLFDKGMRSVEHGTNN-----GIQKRPAGLFALNYALFMIAVDELESRRDILKASVQL 1387
Query: 1402 YPSCLELLLLKVRAKKHGFGDETVVAFEQAIRNWPKEVPGVQCIWNQYAEYLLQNGRIKC 1461
YP+CLEL LL VR + + D FE+ ++ KE +QCIWNQYAEY L+ G
Sbjct: 1388 YPTCLELKLLAVRMQSNELKDMFSSGFEELLKQEAKEASCIQCIWNQYAEYALEGGSYDL 1447
Query: 1462 TEELMVRWFESTSKMDCSKTRTVDNSDCDSLHLREYASGSILHALDCSPNEVDVVFWYLN 1521
ELM RW+ S + K +TV ++ + + S L L+ + ++VDV+F YLN
Sbjct: 1448 ARELMSRWYGSVWDVLSHKYKTVRGNEEEG---DDNMLESALSDLNVASDQVDVMFGYLN 1507
Query: 1522 LSVHKLLLNDQLEARLAFDNALRAAGSGTFRYCMREYAMFLLTDESLLNEAASVGGIRSI 1581
LS+H LL ++ EARLA D AL+A F +C+RE+A+F L +E S+ +
Sbjct: 1508 LSLHNLLQSNWTEARLAIDQALKATAPEHFMHCLREHAVFQLINELQATGEFSINLQMRL 1567
Query: 1582 LEGYLNDARAFPVPKPLSRKFI-NDIKKPRVQLLVSNMLSPLSLDVSLVNCILEVWYGPS 1641
L YL+ A + PV +PLS KFI N +KPRV+ LV+N+L+P+S ++ +VN +LE W+GPS
Sbjct: 1568 LNSYLDRASSLPVKEPLSWKFISNSAEKPRVRKLVTNLLAPVSSELFVVNVVLEAWHGPS 1567
Query: 1642 LLPQKFNKPRELVDFVETILEMLPSNYQLVLSVCKQLCNGDNSSQVASPSLI-FWACSNL 1701
L+P+K +K +ELVDFVETIL ++PSNY L LSV K L + S S S I FWA NL
Sbjct: 1628 LVPEKLSKQKELVDFVETILGLVPSNYPLALSVSKLLRKEEKQSDSGSSSGIHFWAGLNL 1567
Query: 1702 ISAIFSSVPIPPEFIWVEAANILVNVKGFEAIYERFHKRALSVYPFSVQLWKSYYNICKT 1755
S I ++P+ PE+IWVEA I+ ++ GF+ ERF K+ALSVYP SV+LW+ Y+++CK+
Sbjct: 1688 ASTISCAIPVAPEYIWVEAGEIVSDINGFKTRAERFLKKALSVYPMSVKLWRCYWSLCKS 1567
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038890115.1 | 0.0e+00 | 92.20 | uncharacterized protein LOC120079791 isoform X3 [Benincasa hispida] | [more] |
XP_038890113.1 | 0.0e+00 | 91.53 | uncharacterized protein LOC120079791 isoform X1 [Benincasa hispida] | [more] |
XP_038890114.1 | 0.0e+00 | 90.96 | uncharacterized protein LOC120079791 isoform X2 [Benincasa hispida] | [more] |
XP_038890116.1 | 0.0e+00 | 90.23 | uncharacterized protein LOC120079791 isoform X4 [Benincasa hispida] | [more] |
XP_011655356.2 | 0.0e+00 | 87.08 | uncharacterized protein LOC101211906 [Cucumis sativus] >KGN51732.2 hypothetical ... | [more] |
Match Name | E-value | Identity | Description | |
O60293 | 2.6e-11 | 20.54 | Zinc finger C3H1 domain-containing protein OS=Homo sapiens OX=9606 GN=ZFC3H1 PE=... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0KS73 | 0.0e+00 | 87.08 | zf-C3H1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G512890 PE=... | [more] |
A0A1S3CJD3 | 0.0e+00 | 86.86 | uncharacterized protein LOC103501638 OS=Cucumis melo OX=3656 GN=LOC103501638 PE=... | [more] |
A0A5D3C3A9 | 0.0e+00 | 87.16 | Zinc finger C3H1 domain-containing protein isoform X1 OS=Cucumis melo var. makuw... | [more] |
A0A5A7VFE0 | 0.0e+00 | 87.47 | Zinc finger C3H1 domain-containing protein isoform X1 OS=Cucumis melo var. makuw... | [more] |
A0A6J1GRB4 | 0.0e+00 | 81.97 | uncharacterized protein LOC111456410 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT2G39580.1 | 3.0e-225 | 34.32 | CONTAINS InterPro DOMAIN/s: Putative zinc-finger domain (InterPro:IPR019607); Ha... | [more] |