Cla97C11G220620 (gene) Watermelon (97103) v2.5

Overview
NameCla97C11G220620
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionHXXXD-type acyl-transferase family protein
LocationCla97Chr11: 26601015 .. 26621687 (-)
RNA-Seq ExpressionCla97C11G220620
SyntenyCla97C11G220620
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACGGCGGTAAAAACAGTCTAATTTCCGAGTTAAAATTTTCGTCGGTGGTTCCGGCAAAGGCGACCGGCGACGACAAGGTCCGGGAATTAACGGCGATCGATCTGGCGATGAAGCTTCATTATATTAGAGGCGTTTATTTGTTCAGAGGGAGCGAAGAAGTGAGAAATTTGACGATTTATGACCTGAAAAAACCTTTGTTTCCGTTGTTGGAGCAATACTACGTCGTTTCGGGGAGGATTCGAAGGAGAATCGAAGATGGAGATCGGCCGTTCATTAAGTGTAATGATAGTGGAGTGAGAATTGTGGAAGCAAATTGTGAGAAAAGTATTGATGAATGGCTTTCGATTATTGAGGGAGACGATAAATTTCTGCATCGCGATGGCTGTTTGGTTCATACTCAAGCCATTGGTCCCGATCTTGGATTTTCCCCTCTTGCTTTCATCCAGGTATGTTCATGTACTTCATTCCTTATCATCACGTCATCATTCATAACTGCTCTAACCCTAAATGAAAAATACATTTCTACAACAATTACGGTATGGATTTGAACTTCTAATCACTAAGCACAGTACATGTCTCACCCAAATTCACTTTCGCCGCTTTTTTTTCCTTGTTGTCGTAGTTTCATTTTAGAAAATTATTTTAAATTGGAAAAACGAATAAAAATATTATGTGATAGACCCCAGTAGACTACGTTGAAATTTGGCTATATTTCGAAAATATAGATACCAAAAAAATGAACCATAATTGAAATCATAAATACTAAAATTAAGCCTTTTGGCCCACCAATTTCTTAAGTTGGTATAAACAACTCAACTTTAATAGTAAATACTATTGAAGTTTGTACATTTATTGATAACTTCATTTTTTAATATTTCTCTACCTTTCACATTTATTGTGGAATTTCTGCTTTACTTCATCTCTACTTTTCATATTTATCGGGTAACTCTATCTTCTTCTCTTAATATACTTTTCACAATTATTGAGTAACTCAATCTTATAACTTTACATCACAAATATATATATTAACCCCTATTGGTGGGCCAAAGGCTTCCAGTCTTTTTCTTTTTTTAACATTTCTACCACGGTTTTCATCACTTTTTCTAAGTGAAATAGTTGAATTATTAGTCAATAAAAAAAATAATAAATAAATAAATAAATAAAATATTTTCGAAAAGATTTTTTTTATTAGTTTTCAAAAATTGCGTAATATTTTGAAAACATTGGTTATGTATAACAATGCAATAACTTTTTGGTAGGAGCTTTCCATGATGGGAAAGAATGAAGGTGAGAGGACACACGATGATAATTTTTTTTAGTCTATGAAAATATAAAGTACTAAACAATTCGTGTGAATCTAAGATCAGTCCCAACGCTCTGTCTTAAAATTGGATGTCCGACGAACGTTAGATATCTCAATTTTATGGAAATGTCGGTATTCAATGAAATGTTAAAATCTATGGATATTTTTAAAAAATTAATAAAAGCAAAATAAATTCAAATAAAACATTTTTTAAAAAAATTAAATTTATAGATAAACATATTATACTTTAAAATAGGTTAAAATATTTATTGTTTATATTAAAATTCAAATAAATGACCATCTTGTTACTTTATTTCTTATATTTTATTGATTTTTTTACAATATAATGGAAATATCGGTTTGTCTATTATGTTGATGTTGAACTCATAAAAATGTAAAAAAATCAATGGAAATATTAACATGTCAATAAAAATTTAATATCATATTTTTAGCCTGATTAACATTATGATTAGATGAGTTATTGTTAGAAAGTGAATAAAATATTACTAGCTAAAAGGAGAATTTAAATACCTAAACAAAGGATGGGGTATAATATTTAAGAGACAACAACTACAGCAAGACTTTAATAGTAGATTAATTAATGATAACCACACCTAGAAGCCTCGAGTGAAGCGATTAATGGAGCTGCCATTGGAGTTAATTTCATTAACGATGGAATAAATGCTTTTTTCTTTTTTTCTTTTTCTAGGATGGTTAATTATGGATGTGACAGCGTGAACTATATGAGCCATCAACCATGCAAAAGTTGTTGAAATGTCACGTGATTAATTGGATTTTATTTTTATTTTTGTGTGTGTTTTAAGTTACAATCTCGTAATAAATGAAATTCATACCTAAAATTGCCCTATAACTAGAAAACGAGATTTTTCGTATTTGATTTGACTTTTTCAAGTGTTTAATTTTGAAAATAAGTCATTTTAGAAAGAAAAAAAAATTGAAATGTTTAGCAATGAAATACTTTTAAAGTTTAGTTTAAAGTATTTGTATCAGAAAAAGTTTAAATAAAAATGATTTTTTTGAAAACCCTTTTTTTTCCTAATCGTTCCAAATAGCCCCTTCATCCAAGTGCAATTGTTTCGTGGATTCATTTCAATACGTTTGTTTATAGGGTTCAACATCAATATCACTTACTTAAGTTTAGGGGTAAAAAAACTTTACTCATATTATTAATTTGAATAACTTTTTTATTTATTTAAATATTAGTTGTTATCAATATGAACATTGATATTTAATTATCATCGTGTGTAAAATACTTTTGGAGTTATGTCAATTTAGTCCCTAAATTTTAAAATGAAAGTCTAATGATCAAGTTTTAGACCTCTTGTTTAGTTGTTATTTTGTTTTTAGTTTTCAATTTTTAAATTTAAGTGGTTGAGTTCTCCTAATTTTCTTGTTATATAAAAGTTATTATGTTTCTCCTTAAATAAATATTTGATTTTTCAGTCATTTTTCATTTTCATTTTTAGTTTGGTTTGATTTATTATTGTTATTATTATTATTATTGATTTTTTTTTTCAAAATTTAACTTTATTTTTTAAAACATTAAAAAAAAAATAGACACCACAACTAACAAACACATAGAATGTATAATATTTAAGAGGCTAAAACTACAGCAGAATTTTAATAATAAATTAATTAAGTATAATCTAGCAACTAAAAACTTTTATTAGATAATATGAAATTCTATCTCAAAACCATTTTATAATAAGAGGAGTAATTCATATATCTTATTAAAAATGTGAAGTCTTTTAATTTTTTTTAATGTGAAATCTTCGACGACCTCGAATGGAACAATCAATGGAGTTACGGTTGGAATTATTTTAATTGATACACTATTTTTCTTCTAGAACTGGTAATTATGAACATGACAACATGAGCTATATGAACCATGCAATACGTGATATTGGTTCATCTTTAATGTCAGCGTTGGAGCAATGTTCTCCACCTATCATTAAAAGTTGTTGAAATCTCACGTGACATAATTGGACTTTATTATTAATTTTTTAATATCAGTGTAATTATATTTTACAAATTAAAGAGTCCATTTTAAGTTCTACATTTTTCTAAAAGACCAAGTCAGAATAAAAATAAAATTGTAGTCATACTTTAGGGAATTATTTAAGTTTAGTACATATATTTTTAAAAGCTCAATTTTAATTTATATATTTTTTACTTCTAACTTATTATTGATATTTTTAAATACTTTTTTTATTACTATTTATTAGTATTTCTCTATAAATTTTAGTTTCTTAAGTTCTAATTCTAAAAATAGTCATAGTCTAAATTTCATTGGATTTTATGTGAAAATACTATATTGGGTAATGTACTTTGAGTTCTGTTTATTTTAATTGTCATATTTTTAATTATCTAATTTTAATTTCTATATTTTTAATAGATATTAAATTTTATGAAGTATATAACTTATTAAAGATTTATCAAGCCAGCGAAACATCTCTTTGCGATATGTATACTCCAAATATATCTCTACAGAAAAATAGTTAAATGACGAAAACTTTTTGTAAACAATATTACACTGAATTTTAATGAAATCAATACTTAAAAAATTGTAAACACCTTAAAATCATTCAACACCATCTCTTACATTAATCTCCCGAAAGCATTAAAAAAAGAAAGAAAGAAAGAAACTAACTAATATTTATAATGATAAAATATTATTTTTGTCTCCATATTTTGTGTTGGTTCGTTCAAATTTCTAATTTTAGTTTCTTAATTTTAATTAATTTTAAATTTAGTCTCTTAAAATTAAGTTTTATTGATATATATTAAATAATAATAATAATTTTCATATAATAAAATATAATAGTGAAAATGTTAATACATAGCCAAACTTATTTTTAAAAAATCAACAATAAACTAACCGCACGAGATTAAATTTAGGACCATTGAAATTTTTAAAGGATAAAAATTTGAATAAACTTTAAAATATAGGGACCAAAATAATATTTTAACATATTTTTATAAAAACAATAAAACCAATATTGTTTTAAAAAAACAATTTTATAGGTTAAAAATGAAATAAAACAAACATTCATAAAATTAAAAAAAAAAAAAATTGACATTTAATTTTGATAACTTTTTTTATGTTATGACTAAATTGTTACAAATTTAAAAGTACAAGAGCTAAATTGTTAATCTTAAAATTTAAAAATTAAATTGTTACAAAATTAAATACATAAATTAAATCATTAAAAACTAAAGTTTTGGGACCAAAATTAGTTGTTAGTTTCGTCAAAGCATAGGATGAGAATGTTTTTCTAACAAAGAGAAAAAAAGTAAAAAGTGACAAGCTCATTGAATGTATATATTAAGAAAGAAAATGCTTTTGCCTGCACGCAGCTGACTCGGTTCAAGTGTGGCGGTCTCTCCGTGGGCCTCAGTTGGACTCACATTCTCGGCGATATCTTCTCCGCCTCCACCTTCATCAACGCATGGGGTTCCATCATGAACAACCGCCCGGCCCACCAACTCCGTCCGGCGCCGGCTGGTCTTATCTGGCCGTTCAGATCAACCAGACTGTCCACACCGCCGGTCAAGAGGCTCGACCCGACCGGAGACCTTTGGATCGGGTCAAGCGACTGCAAAATGGCGACGCTGTCATTTCGAATCACGGGGGAGCAATTGGATCGAATATTGAGCGTCGTCGGCCGGAATCGAGCGGTGAACTTCTCAACTTTCGAAGCTATTGCTGCGATTTTCTGGAAATCTTTGTCGAAAATACGGCTTGAGGACTCGGATTCGAGGACGATCTCGATCTATTCGACGAAATGCCCTAACAGAGAGGGTGAAATTCCGAGGAACGGAATGGAGATGAGCGGCGTCGAGGCCGATTTTCCAGTCGCAGGAGCGGCGGAAGGCGAATTGGCGGAGCTGATTGTGAAGAAGAAAATTGATGAGGGCGGAGAAATCGAAGAAGTGGTGGAGAAAGGAAAGGAGGAATCGGATTTCATAGCGTACGGAGCGAGATTGACGTTCGTTGATTTGGAAGAAGCGAATATTTACGGCTTCGAATTGGAAGGGCAGAAGCCAGTTCATGTGAATTATGAAATTGGGGGAGTTGGTGAAAACGGCGTCGTTTTGGTACTTCCAGGACCACCGCGTGACGACGGAAGAGACGGTGGCCGAACGGTGACGGTTATCTTGCCGGAGAAACAGCTGCCGGACCTTATCGATGAACTGCAGAAGCAGTGGGAGATCGTTTGAAATCACAATGGACTTCTACAAATTATGTTAAGAAAGAGAAAAATTAAGAGATGAATTTCGCGTGAAAAAATAAATTAAGTTTTTATGTGATTAAATTTGTAATTTAAATGAGCATGTGTAGTTTGCATAAAGTTTGTCCATTCTATATATTGTAAAGAAATGTTGATCTAAAGCTTATTTCTTTGCGGAAAAAAAAAAAAAAAAGTTTTTATAATTGTTTTTAATGTCATAGAGTAGTAATTTTTAGGAACAGTTACAAATATAGTAATTAGGTTCGGTTGCAAATATAGTAATCGGACTCAAAATATTTATAGATATAACACAATTTGTAAAAAACTCTCGAACTGGGACTATATTGCTAATCAGTGATAGACTATACCATTAATAAATTTTGCTATATTTGTAAATTTTTTAAAATGCTGCTATATACTTACTTATTATCTCTAAGAATGCTACCCACTGCGATTACCATACTTTTTTTTTTTTAAAAAAAATATAGAATTAATTATATCTTTGATTGGTTATTTTGTATGGTATCCTACATTTCATTCATAAATTTGGCTAATTTAATAAGTGGATAAAATAAGAATTAATATCTCATGATCTAATTTTCGATTTGTTTAATTATTTAAATATAATTACTTTTTGATGGTTGTAGATGGGAGAGAATCATACCTTAAATTTCATTTATCAAGGGCAGAGATCTAGGGTTAGTGTCAATCAATAAGTTAAAACTACTTCATAAGTATAGATTCGGTATAGAAATCTAAATGATCCAATTCGAACAAGTTTCAAATTCAAACATCAAATTTGATCAAAAGTGTATCACACGAGAATCCATTATTTATATGATTTTTTGATAAAAAGAAATCACAAAAAATGGTATGTTGCTGCCATTTTGAAATGATTAAAGATCATCGAAGTAATGTCTAAACCCAATGATTCAAGGCAAGGATAAAGGATCCCGGAACAAGTAAAAATCTTTTTCAATTGTCTCAATAACTAAACTAGATTCAAATGAAGAATCGAAATAGATTCTAAAGGAGACCGGCTCTAATTTTCGATTTGTTTAATTATTTAAATAAAATTACTTTTTGATGGTTAGTAGACGGGAGAGAATCAGACCTTAAATGTCAAAATGGATAGTGTAAGTCAGCTATGATCATATTGGTAACTAGAGATATTTTTTAATTAGGCTAAAAAAATATTTTTAAAAAAGAAAAAGAAACTGAAAAAAAAAAAAGGATTTTTGTCGACTCTTTAAATATCCATTTAAGCCTTTGAATATTTTAAAATAAAAATTTTTAAATACAGAATACTTCTGAATTTTTATCTGTTAACGAAAACTACTATTTTGATTATTTATAACCAAAAGTAAAATAATGTTAAATTTAAATCATATTTTCATCCTTAAGCTTTCTGAATATATATTTAGTGCTAATGTGTCTAGGTATTAGATTTACAATCTAACATGTTATATTTCCAGGGTTATTTTTACATATTTATAAGTTTGACATCAATGTAAGAGAATAATTGAAATTTCCATTAGATCATAGAAAACCAACAATAACGTCAACATAATCACTTACGTGGACTTAGGACACAAAATTTACTTCCCAAAATACATTAATCAATCAATCCTAACATGAAAAGCGAGGTCATTTAAATATGTTTACAAAATAACCCGTTAACATTATTGTGTGAATCATGAATAATAAAGAAAATTTATTACGATTGCAACAATTTTGATGTGTTCGACATTTCAATATTTATGCATTCTACATCAACATAAAAAATGAATCGACATTTCCATTATATCAAAGAAAAATTCATGAAACATAAAAGGTAAGTAATAAAATATTGCTAATGTAAATACAATAATATATAATTAACATATTAACTTGTTTTAAAAACTATAAAATATTAATTTATTAATTTAAATACTTTATTTAAAATAAAAATTTCAGAAGTGTCTACCAACATAATTATTTATTGATATTTTCGTAAAATTAAATCTCAACATATTAAACTTTAACTAATACACTTTTTAAAATTAGAATTAAAATAAATCTTGTGAAGATTCAACGATAAAAAGTCATAATAGATCACAAATTATAAATACAAGAAAATGAGAACTAAAATTGACATTGAAATTAAATCTAATTTTTAAAATGTCAACTACTACCCAATAACCAATCTAATTTAATTTCAACTGATTTTTAAAATCTCAAGCACAATTATATGATGTGAACCTTTAACTTGAAAAAGAGGCATTTCCATTATATTATTGGTGGACAAAACTATACACATAAAAGATGAGGTGAAGAGAGAGTATATTTTTTAGACAAAATATAAACTTTTTTAATTCCAAAATATTTTTTTAACAATTATTTTCTATTATATTATTTTTGTACCAAACTATTGCAAGAGACATCCATCTGACAAGATCTTTAAACAATTTTTTTTTTTCTAATTCCAATATGTATATATCAAGTAGAAATGGGCTTAGATGATAAAAAGTATTTTTTTTAAAGAAAAATTATAAAGCACATAAAAATAAGGGAACTTTTCAAATATAAAAAATGGGCTATACTATTTACAAAATTTAACAAATTCAGTTATACCCAGTTATACCCAATTATAGCCTAGTGAAATAGCAAAAAAGCCAGTCCATAACTTGATTAAATTGCCTGACCCGTGCAATTAATGGAAACACAAGCCTACAACCCTATTAAATCGCTCAATCCCCAACTGTTTAAAACAACCAAGTGAAAAGTTGAAACTGCGCACTTGCTTTTTCATCTTCTTCACAACGTTCTTCTTCTTCTTCCTCTTTACCGAACTTCTTGTGTTAAAGGGGTGTATATCACAGTCTACATACTCAGTAATAAACACTGATAGTTGTCTATCAATGTCTATCTATAGATAAACAGTGATAGTATCATTGTCTATCAGTAGATAGACACAAATAGACACTATTAGTGTCTACCATAGTGATAGAAGTTCACCTACCCCTTGCTTGTTGAAGTTCAACTACCCCTTGCTTGTAATAGACACTGATAATTCACAGATAGACATTATCACAGTTATATTTGGTGATAATTCACAATCTTTATACACAGTGATATGTTCAAGGGTGTATGCTATGGTCTATGTACTTAGTGATACACAATGATAATGTTATGTCAATGTTATCACTGATTGATAGTGATAAACAACTATCACTATCTGTTTATCACTATCCATCGCTGAAACTCTAGCATTTATGGAACAACTTTAACATAAATGCAATGTTCATGGGTGTATGCCACAATTTATCAATAAATTGACATAGATATACATCTATTAGTGCCGATCACTGTGATAAAAGTTCACCTACACCTATCAGTGTCTATCACAGTGATAAAAGTTTACCTACACCCTTTGATTGAATGTTTGCCAAAGGGTTGAGAGGTGGAAACAAGTATTGAGTGGATTTAGTTTATTTTATTGAAGTTTAAGTGCTTTCCTTCAAATTAAGATGAGATCAAAAGAGTTTGTTTTATTTAAGTTTGAGTGTTTTTCTTCAAATTTTCATGAGATCAAAAGGTATATCAGTGATAGACATGACAGTTATAGTTGTCTATACAGTGGTTGTGTATTGTCCAGGGGTGTTTATCATAGTTTATGGGTGATAGACAGTGATACTTACTAATATGTGTCTATCACTGATAGTTAGTTTGTTACATTAAAAAGTGAAGATACTATTTCATTTGTTTACAACTCAAGAACAAAATAAATCAAAAAATAACTTTCATAAACTCTAATATTAATCCATTTGTCAATACATGTTAAAAAAATATCAATCCAGTTGTCTATATTTATAATCATCTATGTATAATCATCTAAAAAATATCAGTTAGAAATATTCAATTAGTTAAAAAAAAATATTATTTTGAAAAAATAAATCTAAAATGAAATTAAAATGTAAACTTGATTAACTAGACTTTCTATTAACAGTTACACTAATTGGTAAATATAATTAAAAGTAGTTTGATTATGTTTGAGTCCATTATTTCTATACATTTATAAACATAGTAGAAGTTATTCCATTTACTATTACATTTTAATTACTTTACAATTCAATCTCAGAGAAAGAGTAAACAAGGCCTTAAATGTTTTTTACATTGAATAATTATAAAAATAGACTTTAGTCAAATTTGCTATTTCATTATGCTAGTAACAAGCCTCGTTTTTATTTTTTATTTTTGTAATTCAACATTTTTTTTTCTTTTAAAAATCAATCAATTCAAAGGTTCTTTTTTCTATTTTTGTAATTCAACACTTTCATTTTTTTTTTGTTTTTTTTTAAAAACCATTCAATTCAAAGGAAATAAAACTTACGGAAATAGTAAGCTATCTTATTATGTTTTTTTTTTATTTAAATTCCATTTGTTACTTTAAATTTAATGTCAAATATTTACAATTCAAACTTAAGCATATTTATTCACCACAAAAATTTATATACAAAAGATTTTTAATCATATTCAACGCATTTTATATTTATTATAAATTAACTTAAAATAAAAAAAGCCTAGTTTCAACGACTTTTTATTTTATTTATAAGTGTTTTATAGAGATATTTTATTATTAAATTATGTCAATTACTTAATATTCAAATAAAACAAATAATATTCAGCCAATCTTTAAATTTCACTACAAAAATGCATGTTTTAACAAATTTTCTTTTTTAAAAATAAATGTTTTAACAAAATTTATACATGTGCATCATAATTGTTTTTTTAAAGTAAAAAATACAAAAACCACCCAGAGTGATAATTGTAATTACACTCTCAAACTTTTAATTATAAAAATTTGGCTCTTAAACTTCAACAATTGTTAAAATTTGACCATTAAACAAGTAAAGGACAACAGACAATTTTCGTGGATTGGGAAGACCAACTCTCATTCAAGGAAGTGGACCGTTTACGACAGCAGGCTGGGAAGGACAGATACTACTAAAAAAGCCGAAACTCGCATGTTTTTAAGCTTACAATAATGGTAGAGATTGAACCTAAAAATGATAAAAATTGAACCTCCAAACTTATATAAATGTTACAACTTATATAATTAAGTAAATTTGAAGGTTCAATTTGATTCAATTTTAATACTCGTATAAATTTTAAGTTTCAATTTTCAAAACTAAACTAAAAGTTTAAAGACATAATTGTAACTCCAATCATACTTCAGAGTTTAGTTTTGCAATTTTCCATTTTTTTACCCTTGAAATTAATAATTAATATGAACTATTCTTTCTTTCTTTCTTTTTTTTTTCTCCAGATCGAGAATCTTTGGAATTTACCACAAGACAGGTAAAGATTGAGTTGGTTTGCACTTGCTGTGAGCAGCACTTCTGTGCTTGCGTGTCTATCATTGAGCACTGCACCCTCACCACCGTCGCCGATAAGTCCGAATTTCTTTTGCAACCGCAACGCTTAACATTCGTTTTCCGGTTTCGTCTTCAAACAATTTCCGATAAGCGATTTCAGCCTCATCGGCGACAGTCACCGGCGCTGCACCTGTAAGTTTTTCTTCTCTCCATCTTCCTTTCTTTTCAACAATTCACTCTTTGGTTAAGATTCCATAAGTTTCTCTGTCGTTCAAGTATTTGCAGGGTTTAGGGTTTTGGCTGATCTCCGATTAGAGGACTTTGGTTTCTTTTGTTTCTAAAAGCTTCTTATGCGATTTCATTACGAGCTCTACAATGATTGAGAGGAAAGAGAAGCAAAGGACAGGGACAATTAGTATGGAAGATTGTTCCACTCTATTGGAAAGGTATTGGGCCTTCCCATTCCTGCCTTATGCTCCATGTTATGGAATTTGAGTTTTCCGGTTACCTGGTACTGCTGATTTTTCGTTCTTCACGTATTTGGATGTTATTTAATGAGGGGGTGTTTGCTTGTGGGACAGATATTCAGTAAGGACGATATTTACTTTGCTTCGGGAGGTGGCCCAGGTTTCGGGAGTGAGAATTGATTGGGACAAATTGGTGAAGAACACGTCCACTGGGATTTCTAATGTTCGGGAGTATCAGTTGTTATGGCGGCATTTGGCTTATCGTCACACATTACTGGAGAACATGGATTCTGTTACTGATCCTCTGGTCAGTTTACCAGTTTTCTTAAAGCTTCTGTTGATGACATGAGAATGCTTTTACCTTATCTTAAGTACTAAAATTGCGAGCTTAAGGGGGTATGATAAATAAGGGGTTTGAGAGGAGTAGTAGGATCTCTAGGGGTTGGAAGTTCTCTAAATAAGGGGTGTGAGAGAAATCAGTTAGGTCTTATTCTTTTTGACTGGAATGCTTTCTTGTAGGCTGTGGTGGATTCATTTTGTGGGCTGTTCTTTTTTTTGGTAACTCCTTTGACATTCTTTCATTTTTCTCCATGAAAGCTCGGTTTCTTATTAAAATTGGATATGAAAATATATAATCGAATGGGAAATTCACTATTCTAGTCTGGATTTTTAAAATTGAAAAATGCCTTAAAATCTAAAGGAAATAAACAGACCATTTATATGAGAAGATTTTGTGCTGTATTGTTTTCAGCCATAGGTCATGGACTCTTTTTCTTTATGATTTGTATGATCTTTGACTTTGACTTAGGTTTTTATATTTCCTTTATTACGGTTGTTTTTCTTTGTTTTGTCTGATCCACTTTTTTTGTTCACTGATATTGTAATTTGTCTCATTGTGTGACTGTTGTGAGTTCTAGACTTTTGACATTTGAAAGTCTTTCTGTTTGTTGAAACCACTGGAGTTCACAAGTAACTGAATTAACTTCTATTTTCTGGAAGAGAACTAGTTGAGTTTCATGGCATGTGCTTACAATTGATTAAATGAGTTTAAATTTTCTTGGAGGAGATGCTCATAATTTAAACTTGTTGCATTTCGTTGTTGATTTTCTTATTTAAAAGTTTGGTTTACAGGATTATGATAGTGACTTAGATTTTGAAATAGAACCTTTTCCATCTGTTAGCAGTGAGTCCTCGTATGAAGCTGCAGCATGTGTAAAGGTCAGTCCATTTGTTCATGGAACAGTTTTTTCCTCCTAGCTATGTGGCTTCGAACTTTGATGTCCTCAACATTCAAGATAAAAGTTTTTCCGGACAATGAAAATATGTTTGTAGTTCTAGGTTTAGTTAGACTTCACATCACATTTGGTTTCAAATCAATGTCTTGGCAGGTACTGATTGCTAATGGTATACCAAGTGAGTCAGATGTTCCAACTAGTTCTGCAGTTGAGGCCCCATTGACTATAGGTATATCCAATAGTCAAGCATCTACAGACAATCTTAAAAATCCTCAATATGCTTGTTTGATGCAAGGGATGTCTGTTACAATTCCACTTTCCATTCAGAGGCAGCCGATTCCAATGGCAGCAGCAACTGAAGTATTTGATGTGAATGGAGCAGCTGGTGCTAATGCAGCTTCTCAGAAAAGAAGGAAGCCTTGGTCGAAGGCAGAGGATTTGGAATTGATGGCTGCTGTGGAGAAGTGTGGTGAAGGAAACTGGGCGAATATCTTGAAAGGAGACTTCAAAGGGGATAGAACTGCTTCACAGCTATCTCAGGTGTTCTTTCATACTTTCTTTCGATGCTATTCTCCACCATTCTTTTAATTTAATAAATTATCCTACACGTATCAATTTCACTAATACATTCCTTATCTTGATCTTGCGTGGATGTTTTTTAAAGAGAAAAACTTGCATGGATGTTAGAGAGATATTCGTTGCAATATCGAATATTTAGTTTGCAGATATCATAAACAGTACTTTATACTTCTATGGGAACCTTTAGGGCTTGTTTGGTAGATAATCCAAATTTTGTTTTATGTTTTCAAATTCACTGGGTTCCATAGATATGTGTTTGACAGACAACCCGAATTTTGTTTCTGAAAACTGTTTTTAGATTCTATGATCTAAACCGTGCAAATTTTGAAAACAACATTTTTATATTTTCAATGTATCCAGATTTTGAATTTTAAAATTTAAAATGTAGAATTGTATTAAATAAAGATAAAAACATTTAGAAATACAATATATCATGTTATAAACACAAATCATTTTTTAAAAAATGTAATATGTATTATGTAATGTATTATACATTATATAATATTTAAAACAATAATTATATAATTTTTTTTAACATAAATCATATTCATAAATAATTAATAATAATTTATTGTCAACTAAATATTAGTGAGTACAATTATTTTTATGATTTATAAATATATAGTATCAAACACATTTTAAACAACTGTTTAAATTAGAATCCAATTTCTGAATCTGCTACCAAGCACATATTTGAAAACATGAAATACAACTCCTTTCATTGAATCTCTTGTTTTGAGATTTATGTCTTTAAATTCTCTACCAAACAGACCCTTAAGTTTTCAGTTTCAGAAGATGGATAACTCTTTACTTTCTACTTGGATAGACAATCACTTCATTTGTAGTTATGATTAAAACTTGCATGCTGTGCTAGGATTGGGAAGAATTTTATTCTTGTGCGACCTAGCTGTACACCGTTCATTCTCCGTAACAATTTTACTTTTCCAAAGGTTGTGGTCTTTTAAAAAATGCCTTTTGTATTCTATTTCCTTTCCAGGGAATTATTATCGACTATGTGCATCATGTGTTTAAATTACCTCTAGGATTGCGTACTTTGGATTTTAATGTCAACTGTATTTTCTGTGTGCATAAAATATAATCATCCATTGAGGATAAAGTTTACCCATGCCTTTTCTTCTAAATTTTTCTAAGCTCAATTGGCATATTATTCAGATATCCTGTTGGAGCAAAATTGTAGTTACGTTATGGCTTTATCCCTATCCATTGTCAACTACATACAAACATCAAGGTATTAACCATGGATAAATTTCAAAGAGATGAGCGTCAGAAAGTTACTACTATACCCCAATCTGATGTAGGGAGATTCAGGAAAAATCAGGTTAATCTTGGATAAATCCTAGATTTATACTATTGACTATCCCTTTTATTTATTACTTCCCTTATAAGTAAGGAGTCCCCTCTCATCTATTGGACACAATTGTCTATTGGGTTTAGATTCTTGGAGTATTCTCTCCTTTGTCCCTTTGGCCACACCAATTTGGTATGAAGTGTTTGCCTAGGCCGAGGTTGGTTGCCGTGGAAAAGCCAAAAATTCTTGATGATGTCGATCGCAAGAGGGAGCTTTGGGGGTTGTACTCATTTTCGAACGATAGACACCTAATTAGGGGAACCATTTTAAGTCGAGAAAACGAGAAGAAACTTGAAGAATTCGCCAAAAATTTGCATGAATGAAAATTATAGTTTGAAGAAGATGGCCAAGAAAAAGGTACATCAATTTGTGTAGGTAGGAAAGTAAAAAATACTCCCAAAAACAATGTTGGAATTCGTATTCAAGTGACTCAGCAGAAGAGAAAAATGTCGCTTGTGCTCGATTAGTAAAGAACCTATAAAGATCTACATTGAAATTTTGATTATAGTGATTTTGATTTTAGCAATTCTTACGAAGATTGTAAAGCAGATAAGGATAAAATGATGAATCGACATTACCTAGAAAGAGCTCGACGATCAAATTAGTCTTGGTTTTCAAGAAATCAATGTGAGAATTGAGAACGCAGGTGTGAAAATTGTGCCCATTCGAAGTCGTGATCGTTGTGATATTGAGGATGACCAATCTCTATTGACAACACACGAATTCCCCATGAAGTTTTAATCGAAGTATGAGTCCTTTGAAACCCTAAGCATGAGCAATGTTAAAAAAGAAGACGCGAAGTTTTCGTGTGAAGTAGTAGATGGAAAATTGTGTGTTCTTGGTATGGGTTTTTCTTTTGATTGTGTGTTTATTTAGATCCCAGAGATAAAATACAAAGAATTAGAAGGTAGAATTTGTGCAGAGAAGAAGATAAGAGCGGGGAGTTCTCCTCAGACCCAAACTATGTGGTTCGACCCAACCCATGTGGAGAATGTCAATTAGATAAGTTCAACGGATATGTTCGTTGAATTTCTTTTGAATTTTGTTGGATATTTTGATGAAGAGAATGTCATTATGTTTGAATAAATATTTTGAAACCCATGAAGAATTGATGAGGAGAATAAAGGAAAATTAGGTTAATCTTGGGTAAATTTTAAGTTAATCCCTTTTACACTCCCTTTTTGGTATTACTTTCCTTCTCAAAGTATTGGATACAATTCTCATTCTAATGAAGATTTAGTTTACATCCGGGGAGAATTTCTTCCTTATCTCTTTGGCTACACCACTCTTCATTGTCATGAGGCCATGATGTTTAGGAATCAAGAATTAAAATATCGATGTTGATACTGATATCGAGATTTCAATTTTACAGATATATTGATATAATATCGATATCAACAATTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTAAATAAAGAAATTTATACACATTATTTAAATTAATAATTGAGTTTTATTTTATTTTATTTTATTTATAATACTTAACTTTCATTATCAAAGGGAAGATACAGAAGAGGGGGAGATTGATATACGCACAAACCGACAAGTTGGCCTTGTTAGCTAAATGTGCCTGGCGGTATGCCAAGGAAGAAGGTTCACTTTGGTGTCAAGTAGTAGAAGCATCCATGGAAAAAACTTGTTTAATTGGCATATGGCTGGAAAATCATGTCTCAGTTTACGTAGTCCATGGATTAGCATTTCTAGATCCTGGATGAAAGTTGAAGCATTGGCTACATTCAAGATTGGTAGAGGCAGTAGAGTTGGGTTTTGGAACGTCCATGGATTGACAAGCTGCCTTTGAAAATGAGGTTTCCTCACATTTTTCGCGTTGCTCTTAATCCTAAAGGCTCAGTAGCAGAACATTGGATCATTGTCCTCATGGGCAGTTTTCTTCCGTAGACTTTTAAAGGAGGAGGAGATTCTTGAGTTTCAAAGCCTACTTGAATTATTATTAGAATTTAGAATGGGGGACAGGCCAGGTAGCCGAATATGGTCACTAGAAGCTAGTGAGGTTTTTGGGTAAAATCTCTTGTGAATCATTTATCATATTCTTCCCCCCTTGAAAATCTAGTTGAGAAAGCCCTTTGAAAGACTAAGAGCCCTCGCCGGGTGAACATTACAGTTTGGATAATGATGTTTGGCCTTCTAAATTGTGCATCTATTATGCAAAGCCTTCAATGACAAAGCAACTCCTTGGTTGGATCGATTTGAAATAGCAAGACTTAATGTTTCCTCATGGAGTGCTCTATCCAAACCCTTTGAAGATTTCTCAATCCAGGATATAAGTCTTAATTGGTGGGCTTTCATACACTTTGAGTCTTAGTTTTTGTTCTTCAATACAGCTGTGTTCAGCCTTGTCTATCATTAGAAGCCATAGTCTCTGAGCTTGCATTGTGTTCTTCATGGGTTTGTACTTCATTAGTATTGGTAGTGGATTTTTCTTGTAATATCCTCTCGATTTGCCCCTTTGTTTGTGGGTAGTTGTTCTTATATTATTGTTCTTTTTGCTTCAAGTTGTTTTTGCCCCACTGTATTCAGGGTGTATTATTATTGTTAAGGTTGGGTTGTTTTTGGAATTTGGATATTTTGCTTTCGTTCGGATATGATGAGAGTGCTAAGGGGGTGTTAGCCTAGCTGAGATGTCCGAGTGCACTCACTGATCCACAGCGCTATTGTAACTTAAGCATTAGTCTCATTTCATTAATTCAATGAAGAGACTCATTTCTGAAACTCAACTTGTGTAAGTATGAGTTGGGCAATAATTACAAAAGAATTTGGAGTTAAAAGACCGAGTAGAAGCAAAAAGAGTATCAATATCTCAAAAGTCATCCCTGCTTAGTTTTTTACTACAAAAGATTCTCGAGTTTCTTTCAAACCATAGCTTCAAGAGAAGGGCAAAAAACCCCATTAGACCAAAGGACTTTCGCCTGAGTATGAAACCTGGGGCCCCCAATAAGCTAAAGAAGAGCATCAGAGGCTTTGTTAAGCGAAACACTACTGCAAGTTGAACAATTTTTTTCTTTTTTACTTTTCATTGAAATAATGAATTAAGACTAATGCTCAAATTACAACGAACATATACAAAGAACAATAAGGCCAAATAAGTTGAACAAGTTAAACAACCTTCCCCAACAATCCACGACAAAAGGCAAAAAAAAAAGGAGATGCATGAAGGACTCCTCACTTCTTTAACAAAGAATGCACCAATTAGGGGAGTGAGCATAATTCCGCAAGGTTCTTTGGATTAAAGAGAAGATCCATAAGTAGTCTTTGACCTCCTTAGGATAGTTCGTTTTCCATATAAGATTAGAACTCTATTTGTTCAGGGCACCTTCTTTCACAATAAGCTTAGAGAGAAGGATTTAACTGAAAAGGAGCCCCATCATTTTCCAACATCCACGAGATAACATCATCTTTATTATTGGACCTAAAGATTTGGAGTTTGGTCATAAGATTGGCCCAATCATCAAATTCAATATCAGTGAGCTTTATTCTGAACTTTAAGTCCCAAGCATAGAGTGTTATACCAGCGATAGCTGACCGCTGAGTCCTTAGCAGTCACAATTATGAAAAGTCTAGGAATTCCACATAGAGTTTACTTCCAAACTAAGTAATAGATCTTATTACTAGTATTCAAATTCATATTGAGATTAATAGACAACTATTTGGTGTGTTTTATGTGTATTTATAAAAGATATCAGAGATATTGATTAATCGTTAATATCGAAGTTGAACCCTTTGATTTACGGATATATCGATGTATTAACGAATATTTCAACCTTGTTGGGATTTCATAGTTCTCCTGGTTCAAATCCTGATGACAAAGTGTTCCATTACGCGAATCTAAATTTTTATGATGGTCTAAGCATGTATATTCTTTCGCCCTTACTTGAACATAATCTATTCTATAAGTTGTATAGCATGTAGAATTTTTCAAACTTTGAAGTCGTATTTATGATTGAATATTCTTCCCCTATTATTTCTATTAATTCTAATTTCATTCTGTAGAGGTGGTCCATTATTAGGAAGCGACATGGTAATTTGAATGTGGGAGCTAGCACCACAAGTACTACCCAGAAAGCTCAGATTGATGCTGCACACCGTGCATTGTCCTTTGCCCTTGATTTGCCTGTGAATAACTCAAAAACAGGTTGCTCTATTGACCTTGATTATTATCTACGTTTGAATAACTTTAGATATTTATTATCAGCCTTTTAAATTTTATTTTCCTGTGGAATGATCATTCTGTTGAAAAGTATCTTACCCTTCCATTCGAAGTACTAGATAAAAAAAAAATTGATGGCAGGACTGAATAAAATAGACTGTTTGTAAATAAAAAATTCACTAATAGATTAGTTGACGTCATTATTGTTCAAGTGTATATCTCGAGTTTTAGCTGGGCTGTAATATATGTGGTTGCATTTATCTCAATAGTTTTTGTTGCTTCCATTTCTGCTAAATTTGAATATTTTCTTTGTTTACTTGTAGCAGCAAATTCAAATATAAACAGTAGCATTGTTTCTCCTGCAAGTGGTGCCGAAGCTTTGGTTCAAATGCAGAACCAGTCTCCACAGATTTCCATGCCTTCAAGGCCGCTGCTGGTAGAGCCTTTGCCTTCAGCAGTGAAATCTGGAATCAACACTTCCAAGAATTCATTGATTATGAAGTCTACTCACAATTCTGATTCTATAGTTAGAGCAACTGCAGTAGCTGCAGGGGCCCGAATTGTTTCTCTGTCCGATGCTGCATCTTTACTGAAAGTTGCACAAACAAAAAAGGCCATCCACATAAAGTCCAAATGTGTTTCATCAACCCAATCACCTGTGGCTGGAAATGCACCAATCCACTTGGATGGACGCCCCAGTGTACATTATATTTCCCCAGGAAAAACACCGACTCCAGGGTCAAGCCATGTCGGCGGTAAATCTACTATGGGGTGCAATAACTCAGTGAAGGCTGTCTCACCAAAAGTTCTGCATAATCGTTCTACTGCTATTTTGACAAACCCGCCATCAGACCAAGTAAGCCCAACAACTGAGTCTCCACTGAAGCAAGAGGTTAACAGTTCAGAAGAACGGAAAATTCCCAAGCCAATCATTACTGCAAAAGAGGAGTTCCGAGAAAACAGCTTGGCAAATGATGTCAAGATTAGGGGCTGACCAAACATAAAAGGAAGCAAGACCAAATCTATACCATATAATCATGGGGACCATATCACAAAGCTCTACAGGCATGTTAATAGCAGGTGAATGCACAGAAATATCAGTAGGTGTAAGTAGTCAATTGTACAGCAACAAAGAATTCCAGGTTTTCTTGCTAAAGAATGAACATTAATTGTGCACCTTTTTTTCTATTCTCTATGACATAGTGTAGTGTGCATATATATTGCCTTTTGTTTCCCTTGCATTCGTTTATTTATGAATTCTGTTGTTATTTAGGGTTGTGGGGTCAATGCAAAAGGCCAATGACAAGACTAAAATTTGGAAGCTCATATTCTTCAGCAAGACAAGGAAAATGTATACTGTTTATCTGGGATTGAAGGTATAATTCATTCTTGAAATTAATGGATGTTCTTCCTTTCATTACATTGGCGAAATTACACCCTCAAAAAAAGCTCATCACTGAGGTAGGGGCCCTTTAGCTAGATCACTATATTGTTTCTGTAGTTGTGGACTTGCACATGAGAACAGTGATGGCTTTTGCCTCTTTAACTTCACCCCAAACAAGAAAGACCGTAAAGGAATTCGTGTTCGGCTTAACGATTTCGATTGTTGTATGCTCTTCCCTGCTGAAATGCCAAATTCTTGATCAAGTCTCTCCAAAGCCTTCTCAACTTGGACTTGTAGTACTCTTACCCGACCTTGACCAACTTGTAACTCATCAGCTATCTTCCTATTTTCTTGCTTCATGTTGAGTACTTCACCTTGAAACTTTGCAGCTTGATAATCACTCAGTTCTGTCTTTTCTTCAGCTGAACCTTCATCTGTGATTCTTGATATATCACTTTGTATGTCACATAAGGATGAGAATCTATTGCATAGTTCATCTTTCAATACTGCACTATGTTCCAACCAAAGTGATAA

mRNA sequence

ATGGACGGCGGTAAAAACAGTCTAATTTCCGAGTTAAAATTTTCGTCGGTGGTTCCGGCAAAGGCGACCGGCGACGACAAGGTCCGGGAATTAACGGCGATCGATCTGGCGATGAAGCTTCATTATATTAGAGGCGTTTATTTGTTCAGAGGGAGCGAAGAAGTGAGAAATTTGACGATTTATGACCTGAAAAAACCTTTGTTTCCGTTGTTGGAGCAATACTACGTCGTTTCGGGGAGGATTCGAAGGAGAATCGAAGATGGAGATCGGCCGTTCATTAAGTGTAATGATAGTGGAGTGAGAATTGTGGAAGCAAATTGTGAGAAAAGTATTGATGAATGGCTTTCGATTATTGAGGGAGACGATAAATTTCTGCATCGCGATGGCTGTTTGGTTCATACTCAAGCCATTGGTCCCGATCTTGGATTTTCCCCTCTTGCTTTCATCCAGCTGACTCGGTTCAAGTGTGGCGGTCTCTCCGTGGGCCTCAGTTGGACTCACATTCTCGGCGATATCTTCTCCGCCTCCACCTTCATCAACGCATGGGGTTCCATCATGAACAACCGCCCGGCCCACCAACTCCGTCCGGCGCCGGCTGGTCTTATCTGGCCGTTCAGATCAACCAGACTGTCCACACCGCCGGTCAAGAGGCTCGACCCGACCGGAGACCTTTGGATCGGGTCAAGCGACTGCAAAATGGCGACGCTGTCATTTCGAATCACGGGGGAGCAATTGGATCGAATATTGAGCGTCGTCGGCCGGAATCGAGCGGTGAACTTCTCAACTTTCGAAGCTATTGCTGCGATTTTCTGGAAATCTTTGTCGAAAATACGGCTTGAGGACTCGGATTCGAGGACGATCTCGATCTATTCGACGAAATGCCCTAACAGAGAGGGTGAAATTCCGAGGAACGGAATGGAGATGAGCGGCGTCGAGGCCGATTTTCCAGTCGCAGGAGCGGCGGAAGGCGAATTGGCGGAGCTGATTGTGAAGAAGAAAATTGATGAGGGCGGAGAAATCGAAGAAGTGGTGGAGAAAGGAAAGGAGGAATCGGATTTCATAGCGTACGGAGCGAGATTGACGTTCGTTGATTTGGAAGAAGCGAATATTTACGGCTTCGAATTGGAAGGGCAGAAGCCAGTTCATGTGAATTATGAAATTGGGGGAGTTGGTGAAAACGGCGTCGTTTTGGTACTTCCAGGACCACCGCGTGACGACGGAAGAGACGGTGGCCGAACGGTGACGGTTATCTTGCCGGAGAAACAGCTGCCGGACCTTATCGATGAACTGCAGAAGCAGTGGGAGATCCACTTCTGTGCTTGCGTGTCTATCATTGAGCACTGCACCCTCACCACCGTCGCCGATAAGTCCGAATTTCTTTTGCAACCGCAACGCTTAACATTCGTTTTCCGGTTTCGTCTTCAAACAATTTCCGATAAGCGATTTCAGCCTCATCGGCGACAGTCACCGGCGCTGCACCTCTTCTTATGCGATTTCATTACGAGCTCTACAATGATTGAGAGGAAAGAGAAGCAAAGGACAGGGACAATTAGTATGGAAGATTGTTCCACTCTATTGGAAAGATATTCAGTAAGGACGATATTTACTTTGCTTCGGGAGGTGGCCCAGGTTTCGGGAGTGAGAATTGATTGGGACAAATTGGTGAAGAACACGTCCACTGGGATTTCTAATGTTCGGGAGTATCAGTTGTTATGGCGGCATTTGGCTTATCGTCACACATTACTGGAGAACATGGATTCTGTTACTGATCCTCTGGATTATGATAGTGACTTAGATTTTGAAATAGAACCTTTTCCATCTGTTAGCAGTGAGTCCTCGTATGAAGCTGCAGCATGTGTAAAGGTACTGATTGCTAATGGTATACCAAGTGAGTCAGATGTTCCAACTAGTTCTGCAGTTGAGGCCCCATTGACTATAGGTATATCCAATAGTCAAGCATCTACAGACAATCTTAAAAATCCTCAATATGCTTGTTTGATGCAAGGGATGTCTGTTACAATTCCACTTTCCATTCAGAGGCAGCCGATTCCAATGGCAGCAGCAACTGAAGTATTTGATGTGAATGGAGCAGCTGGTGCTAATGCAGCTTCTCAGAAAAGAAGGAAGCCTTGGTCGAAGGCAGAGGATTTGGAATTGATGGCTGCTGTGGAGAAGTGTGGTGAAGGAAACTGGGCGAATATCTTGAAAGGAGACTTCAAAGGGGATAGAACTGCTTCACAGCTATCTCAGATTCTTGGAGTATTCTCTCCTTTGTCCCTTTGGCCACACCAATTTGGTATGAAGTGTTTGCCTAGGCCGAGGTTGGTTGCCGTGGAAAAGCCAAAAATTCTTGATGATGTCGATCGCAAGAGGGAGCTTTGGGGGTTGAAGCGACATGGTAATTTGAATGTGGGAGCTAGCACCACAAGTACTACCCAGAAAGCTCAGATTGATGCTGCACACCGTGCATTGTCCTTTGCCCTTGATTTGCCTGTGAATAACTCAAAAACAGCAAATTCAAATATAAACAGTAGCATTGTTTCTCCTGCAAGTGGTGCCGAAGCTTTGGTTCAAATGCAGAACCAGTCTCCACAGATTTCCATGCCTTCAAGGCCGCTGCTGGTAGAGCCTTTGCCTTCAGCAGTGAAATCTGGAATCAACACTTCCAAGAATTCATTGATTATGAAGTCTACTCACAATTCTGATTCTATAGTTAGAGCAACTGCAGTAGCTGCAGGGGCCCGAATTGTTTCTCTGTCCGATGCTGCATCTTTACTGAAAGTTGCACAAACAAAAAAGGCCATCCACATAAAGTCCAAATGTGTTTCATCAACCCAATCACCTGTGGCTGGAAATGCACCAATCCACTTGGATGGACGCCCCAGTGTACATTATATTTCCCCAGGAAAAACACCGACTCCAGGGTCAAGCCATGTCGGCGGTAAATCTACTATGGGGTGCAATAACTCAGTGAAGGCTGTCTCACCAAAAGTTCTGCATAATCGTTCTACTGCTATTTTGACAAACCCGCCATCAGACCAAGTAAGCCCAACAACTGAGTCTCCACTGAAGCAAGAGGTTAACAGTTCAGAAGAACGGAAAATTCCCAAGCCAATCATTACTGCAAAAGAGGAGTTCCGAGAAAACAGCTTGGCAAATGATGTCAAGATTAGGGGCTGACCAAACATAAAAGGAAGCAAGACCAAATCTATACCATATAATCATGGGGACCATATCACAAAGCTCTACAGGCATGTTAATAGCAGGGTTGTGGGGTCAATGCAAAAGGCCAATGACAAGACTAAAATTTGGAAGCTCATATTCTTCAGCAAGACAAGGAAAATGTATACTGTTTATCTGGGATTGAAGGTATAATTCATTCTTGAAATTAATGGATGTTCTTCCTTTCATTACATTGGCGAAATTACACCCTCAAAAAAAGCTCATCACTGAGGTAGGGGCCCTTTAGCTAGATCACTATATTGTTTCTGTAGTTGTGGACTTGCACATGAGAACAGTGATGGCTTTTGCCTCTTTAACTTCACCCCAAACAAGAAAGACCGTAAAGGAATTCGTGTTCGGCTTAACGATTTCGATTGTTGTATGCTCTTCCCTGCTGAAATGCCAAATTCTTGATCAAGTCTCTCCAAAGCCTTCTCAACTTGGACTTGTAGTACTCTTACCCGACCTTGACCAACTTGTAACTCATCAGCTATCTTCCTATTTTCTTGCTTCATGTTGAGTACTTCACCTTGAAACTTTGCAGCTTGATAATCACTCAGTTCTGTCTTTTCTTCAGCTGAACCTTCATCTGTGATTCTTGATATATCACTTTGTATGTCACATAAGGATGAGAATCTATTGCATAGTTCATCTTTCAATACTGCACTATGTTCCAACCAAAGTGATAA

Coding sequence (CDS)

ATGGACGGCGGTAAAAACAGTCTAATTTCCGAGTTAAAATTTTCGTCGGTGGTTCCGGCAAAGGCGACCGGCGACGACAAGGTCCGGGAATTAACGGCGATCGATCTGGCGATGAAGCTTCATTATATTAGAGGCGTTTATTTGTTCAGAGGGAGCGAAGAAGTGAGAAATTTGACGATTTATGACCTGAAAAAACCTTTGTTTCCGTTGTTGGAGCAATACTACGTCGTTTCGGGGAGGATTCGAAGGAGAATCGAAGATGGAGATCGGCCGTTCATTAAGTGTAATGATAGTGGAGTGAGAATTGTGGAAGCAAATTGTGAGAAAAGTATTGATGAATGGCTTTCGATTATTGAGGGAGACGATAAATTTCTGCATCGCGATGGCTGTTTGGTTCATACTCAAGCCATTGGTCCCGATCTTGGATTTTCCCCTCTTGCTTTCATCCAGCTGACTCGGTTCAAGTGTGGCGGTCTCTCCGTGGGCCTCAGTTGGACTCACATTCTCGGCGATATCTTCTCCGCCTCCACCTTCATCAACGCATGGGGTTCCATCATGAACAACCGCCCGGCCCACCAACTCCGTCCGGCGCCGGCTGGTCTTATCTGGCCGTTCAGATCAACCAGACTGTCCACACCGCCGGTCAAGAGGCTCGACCCGACCGGAGACCTTTGGATCGGGTCAAGCGACTGCAAAATGGCGACGCTGTCATTTCGAATCACGGGGGAGCAATTGGATCGAATATTGAGCGTCGTCGGCCGGAATCGAGCGGTGAACTTCTCAACTTTCGAAGCTATTGCTGCGATTTTCTGGAAATCTTTGTCGAAAATACGGCTTGAGGACTCGGATTCGAGGACGATCTCGATCTATTCGACGAAATGCCCTAACAGAGAGGGTGAAATTCCGAGGAACGGAATGGAGATGAGCGGCGTCGAGGCCGATTTTCCAGTCGCAGGAGCGGCGGAAGGCGAATTGGCGGAGCTGATTGTGAAGAAGAAAATTGATGAGGGCGGAGAAATCGAAGAAGTGGTGGAGAAAGGAAAGGAGGAATCGGATTTCATAGCGTACGGAGCGAGATTGACGTTCGTTGATTTGGAAGAAGCGAATATTTACGGCTTCGAATTGGAAGGGCAGAAGCCAGTTCATGTGAATTATGAAATTGGGGGAGTTGGTGAAAACGGCGTCGTTTTGGTACTTCCAGGACCACCGCGTGACGACGGAAGAGACGGTGGCCGAACGGTGACGGTTATCTTGCCGGAGAAACAGCTGCCGGACCTTATCGATGAACTGCAGAAGCAGTGGGAGATCCACTTCTGTGCTTGCGTGTCTATCATTGAGCACTGCACCCTCACCACCGTCGCCGATAAGTCCGAATTTCTTTTGCAACCGCAACGCTTAACATTCGTTTTCCGGTTTCGTCTTCAAACAATTTCCGATAAGCGATTTCAGCCTCATCGGCGACAGTCACCGGCGCTGCACCTCTTCTTATGCGATTTCATTACGAGCTCTACAATGATTGAGAGGAAAGAGAAGCAAAGGACAGGGACAATTAGTATGGAAGATTGTTCCACTCTATTGGAAAGATATTCAGTAAGGACGATATTTACTTTGCTTCGGGAGGTGGCCCAGGTTTCGGGAGTGAGAATTGATTGGGACAAATTGGTGAAGAACACGTCCACTGGGATTTCTAATGTTCGGGAGTATCAGTTGTTATGGCGGCATTTGGCTTATCGTCACACATTACTGGAGAACATGGATTCTGTTACTGATCCTCTGGATTATGATAGTGACTTAGATTTTGAAATAGAACCTTTTCCATCTGTTAGCAGTGAGTCCTCGTATGAAGCTGCAGCATGTGTAAAGGTACTGATTGCTAATGGTATACCAAGTGAGTCAGATGTTCCAACTAGTTCTGCAGTTGAGGCCCCATTGACTATAGGTATATCCAATAGTCAAGCATCTACAGACAATCTTAAAAATCCTCAATATGCTTGTTTGATGCAAGGGATGTCTGTTACAATTCCACTTTCCATTCAGAGGCAGCCGATTCCAATGGCAGCAGCAACTGAAGTATTTGATGTGAATGGAGCAGCTGGTGCTAATGCAGCTTCTCAGAAAAGAAGGAAGCCTTGGTCGAAGGCAGAGGATTTGGAATTGATGGCTGCTGTGGAGAAGTGTGGTGAAGGAAACTGGGCGAATATCTTGAAAGGAGACTTCAAAGGGGATAGAACTGCTTCACAGCTATCTCAGATTCTTGGAGTATTCTCTCCTTTGTCCCTTTGGCCACACCAATTTGGTATGAAGTGTTTGCCTAGGCCGAGGTTGGTTGCCGTGGAAAAGCCAAAAATTCTTGATGATGTCGATCGCAAGAGGGAGCTTTGGGGGTTGAAGCGACATGGTAATTTGAATGTGGGAGCTAGCACCACAAGTACTACCCAGAAAGCTCAGATTGATGCTGCACACCGTGCATTGTCCTTTGCCCTTGATTTGCCTGTGAATAACTCAAAAACAGCAAATTCAAATATAAACAGTAGCATTGTTTCTCCTGCAAGTGGTGCCGAAGCTTTGGTTCAAATGCAGAACCAGTCTCCACAGATTTCCATGCCTTCAAGGCCGCTGCTGGTAGAGCCTTTGCCTTCAGCAGTGAAATCTGGAATCAACACTTCCAAGAATTCATTGATTATGAAGTCTACTCACAATTCTGATTCTATAGTTAGAGCAACTGCAGTAGCTGCAGGGGCCCGAATTGTTTCTCTGTCCGATGCTGCATCTTTACTGAAAGTTGCACAAACAAAAAAGGCCATCCACATAAAGTCCAAATGTGTTTCATCAACCCAATCACCTGTGGCTGGAAATGCACCAATCCACTTGGATGGACGCCCCAGTGTACATTATATTTCCCCAGGAAAAACACCGACTCCAGGGTCAAGCCATGTCGGCGGTAAATCTACTATGGGGTGCAATAACTCAGTGAAGGCTGTCTCACCAAAAGTTCTGCATAATCGTTCTACTGCTATTTTGACAAACCCGCCATCAGACCAAGTAAGCCCAACAACTGAGTCTCCACTGAAGCAAGAGGTTAACAGTTCAGAAGAACGGAAAATTCCCAAGCCAATCATTACTGCAAAAGAGGAGTTCCGAGAAAACAGCTTGGCAAATGATGTCAAGATTAGGGGCTGA

Protein sequence

MDGGKNSLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEKSIDEWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAHQLRPAPAGLIWPFRSTRLSTPPVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNGMEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRDDGRDGGRTVTVILPEKQLPDLIDELQKQWEIHFCACVSIIEHCTLTTVADKSEFLLQPQRLTFVFRFRLQTISDKRFQPHRRQSPALHLFLCDFITSSTMIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEPFPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDSIVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNSVKAVSPKVLHNRSTAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLANDVKIRG
Homology
BLAST of Cla97C11G220620 vs. NCBI nr
Match: XP_038898739.1 (uncharacterized protein LOC120086263 isoform X2 [Benincasa hispida])

HSP 1 Score: 790.0 bits (2039), Expect = 2.5e-224
Identity = 442/546 (80.95%), Postives = 459/546 (84.07%), Query Frame = 0

Query: 505  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISN 564
            MIERKE  RTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDW  LVKNTSTGISN
Sbjct: 1    MIERKEYIRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWHTLVKNTSTGISN 60

Query: 565  VREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEPFPSVSSESSYEAAACVKVLI 624
            VREYQLLWRHLAYRHT LENMDSVTDPLDYDSDLDFEIEPFPSVSSESS EAAACVKVLI
Sbjct: 61   VREYQLLWRHLAYRHTFLENMDSVTDPLDYDSDLDFEIEPFPSVSSESSNEAAACVKVLI 120

Query: 625  ANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIP 684
            ANGIPSESDVP+SSAVEAPLTIGISNSQ+ST NL+N Q ACLMQGMSVTIPLS+QRQPIP
Sbjct: 121  ANGIPSESDVPSSSAVEAPLTIGISNSQSSTYNLENYQSACLMQGMSVTIPLSLQRQPIP 180

Query: 685  MAAATEVFDVNGAAGANAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRT 744
            M +ATEV DVNGAA ANAAS+KRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRT
Sbjct: 181  MPSATEVLDVNGAAAANAASRKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRT 240

Query: 745  ASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNV 804
            ASQLSQ   V                                          KR GNLNV
Sbjct: 241  ASQLSQRWSVIR----------------------------------------KRRGNLNV 300

Query: 805  GASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSPASGAEALVQMQNQSP 864
            GAST+STTQKAQIDAAHRALSFALDLPVNNSKTANSNINSS+VS ASGAEA VQMQNQSP
Sbjct: 301  GASTSSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSVVSSASGAEASVQMQNQSP 360

Query: 865  QISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDSIVRATAVAAGARIVSLSDAAS 924
            QISMPSRPLLVEPLPS+VKSGI TSKNSL+MKSTHNSDSIVRATAVAAGARIVS SDAAS
Sbjct: 361  QISMPSRPLLVEPLPSSVKSGIITSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDAAS 420

Query: 925  LLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKST 984
            LLK AQ K AIHIKSKC           +P+HLD RPSVHYIS GKTPTPGS+ V GKST
Sbjct: 421  LLKAAQMKNAIHIKSKC-----------SPMHLDARPSVHYISTGKTPTPGSNFVVGKST 480

Query: 985  MGCNNSVKAVSPKVLHNRSTAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKE 1044
            M  NNSVKAVSPKV HNRSTAI TNPPSD+VSPTTESPLKQ+VNSSEERKI + IITAKE
Sbjct: 481  MLGNNSVKAVSPKVQHNRSTAISTNPPSDRVSPTTESPLKQKVNSSEERKISELIITAKE 495

Query: 1045 EFRENS 1051
            EFRE +
Sbjct: 541  EFRETA 495

BLAST of Cla97C11G220620 vs. NCBI nr
Match: XP_038898738.1 (uncharacterized protein LOC120086263 isoform X1 [Benincasa hispida])

HSP 1 Score: 785.4 bits (2027), Expect = 6.1e-223
Identity = 442/547 (80.80%), Postives = 459/547 (83.91%), Query Frame = 0

Query: 505  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISN 564
            MIERKE  RTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDW  LVKNTSTGISN
Sbjct: 1    MIERKEYIRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWHTLVKNTSTGISN 60

Query: 565  VREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEPFPSVSSESSYEAAACVKVLI 624
            VREYQLLWRHLAYRHT LENMDSVTDPLDYDSDLDFEIEPFPSVSSESS EAAACVKVLI
Sbjct: 61   VREYQLLWRHLAYRHTFLENMDSVTDPLDYDSDLDFEIEPFPSVSSESSNEAAACVKVLI 120

Query: 625  ANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIP 684
            ANGIPSESDVP+SSAVEAPLTIGISNSQ+ST NL+N Q ACLMQGMSVTIPLS+QRQPIP
Sbjct: 121  ANGIPSESDVPSSSAVEAPLTIGISNSQSSTYNLENYQSACLMQGMSVTIPLSLQRQPIP 180

Query: 685  MAAATEVFDVNGAAGANAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRT 744
            M +ATEV DVNGAA ANAAS+KRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRT
Sbjct: 181  MPSATEVLDVNGAAAANAASRKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRT 240

Query: 745  ASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNV 804
            ASQLSQ   V                                          KR GNLNV
Sbjct: 241  ASQLSQRWSVIR----------------------------------------KRRGNLNV 300

Query: 805  GASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIVSPASGAEALVQMQNQS 864
            GAST+STTQKAQIDAAHRALSFALDLPVNNSKT ANSNINSS+VS ASGAEA VQMQNQS
Sbjct: 301  GASTSSTTQKAQIDAAHRALSFALDLPVNNSKTAANSNINSSVVSSASGAEASVQMQNQS 360

Query: 865  PQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDSIVRATAVAAGARIVSLSDAA 924
            PQISMPSRPLLVEPLPS+VKSGI TSKNSL+MKSTHNSDSIVRATAVAAGARIVS SDAA
Sbjct: 361  PQISMPSRPLLVEPLPSSVKSGIITSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDAA 420

Query: 925  SLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKS 984
            SLLK AQ K AIHIKSKC           +P+HLD RPSVHYIS GKTPTPGS+ V GKS
Sbjct: 421  SLLKAAQMKNAIHIKSKC-----------SPMHLDARPSVHYISTGKTPTPGSNFVVGKS 480

Query: 985  TMGCNNSVKAVSPKVLHNRSTAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAK 1044
            TM  NNSVKAVSPKV HNRSTAI TNPPSD+VSPTTESPLKQ+VNSSEERKI + IITAK
Sbjct: 481  TMLGNNSVKAVSPKVQHNRSTAISTNPPSDRVSPTTESPLKQKVNSSEERKISELIITAK 496

Query: 1045 EEFRENS 1051
            EEFRE +
Sbjct: 541  EEFRETA 496

BLAST of Cla97C11G220620 vs. NCBI nr
Match: KAA8528364.1 (hypothetical protein F0562_035719 [Nyssa sinensis])

HSP 1 Score: 772.7 bits (1994), Expect = 4.1e-219
Identity = 486/1080 (45.00%), Postives = 645/1080 (59.72%), Query Frame = 0

Query: 9    ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLF 68
            + ++K SSVVPA+ TG+DKV ELT +DL MKLHYIRG+Y F+ +E V+ L IYDLKKP+F
Sbjct: 10   VYDIKLSSVVPARVTGEDKVHELTNMDLIMKLHYIRGLYFFK-NEAVQGLDIYDLKKPMF 69

Query: 69   PLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEKSIDEWLSIIEGDDKFLHRD 128
              L+ YY  SGR+R+   +  RPFIKCNDSGVRIVEA C K++DEWL++     K    D
Sbjct: 70   QWLDLYYSTSGRVRQ--SEKGRPFIKCNDSGVRIVEARCCKTLDEWLAM-----KDHSLD 129

Query: 129  GCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNN 188
              L++ Q +GPDLGFSPL FIQ T FKCGG SVGLSW HI+GD+FSASTFIN WG I+  
Sbjct: 130  NQLIYNQVLGPDLGFSPLVFIQFTWFKCGGFSVGLSWAHIIGDVFSASTFINMWGQILAG 189

Query: 189  RPAHQLRPAPAGLIWPFRSTRLSTP-PVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDR 248
            +   Q    P    + F  +    P  +K +DP GD W+ +++CKM T S  +T +QLD 
Sbjct: 190  QVPPQSLRMPNTRKYEFPHSIAEKPFSLKMVDPVGDYWLTANNCKMETHSLHVTAKQLDH 249

Query: 249  ILS-VVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNGM 308
            ILS   G  +A     F+  +A+ WKSL+K+R    + R ++I +    + E EI  NG+
Sbjct: 250  ILSKTYGPRKAAKVPPFQVFSALMWKSLAKVR-GKLEPRIVTICNNNSHSTENEISSNGL 309

Query: 309  EMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLE 368
             +S VEADFPV  A   EL  LI +K++DE   IEE+VE+G  +SDFI YGA LTFV+LE
Sbjct: 310  VISIVEADFPVMNANLSELVTLIGEKRVDETKMIEEMVERGNGKSDFIMYGANLTFVNLE 369

Query: 369  EANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPR-DDGRDGGRTVTVILPEKQLPD 428
            EANIYG EL+GQKPV  NY I G+G+ GVVLVLP P     G   GRT+TV LP  Q+ +
Sbjct: 370  EANIYGLELKGQKPVFANYTIAGIGDEGVVLVLPEPENTKQGGSNGRTLTVTLPGNQISE 429

Query: 429  LIDELQKQWEIHFCACVSIIEHCTLTTVADKSEFLLQPQRLTFVFRFRLQTI-----SDK 488
            L DEL+K+W+        I ++    TV   S  L  P +         Q++     S+K
Sbjct: 430  LKDELRKEWD-----RAGIKQNHEGVTVISVSRVL--PSQAPAPLSLNSQSVSSGTGSEK 489

Query: 489  RFQPHRRQSPALHLFLCDFITSSTMIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLRE 548
                  ++S  L       +  +TM+E+ +KQ+ G+I+ E+ STLL+RYS  T+ T+L+E
Sbjct: 490  SVSSTLQRSVTL-------LQRNTMVEKTKKQKKGSINEEEVSTLLQRYSPMTVLTVLQE 549

Query: 549  VAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDF 608
            VAQV  V IDW+ LV+ TS+GISN REYQ+LWRHLAYR+TL E +D   +PLD DSDL++
Sbjct: 550  VAQVPDVNIDWNALVEKTSSGISNAREYQMLWRHLAYRNTLHEVLDDEAEPLDDDSDLEY 609

Query: 609  EIEPFPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKN 668
            E+E FP+VSSE+S EAAA VKVLIA+G P++S +P  + VEAPLTI I N Q+S    +N
Sbjct: 610  ELEAFPAVSSEASTEAAAYVKVLIASGAPTDSSLPNGTTVEAPLTINIPNGQSSGVPSEN 669

Query: 669  PQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAASQKRRKPWSKAEDLELM 728
             Q A  +QG ++TIP+ +Q+ P+P   A E  D NG+       +K+RK WS AED+EL+
Sbjct: 670  SQLA--IQGKNITIPVFVQKLPLP---AAEGMDANGSTVCGLPPRKKRKRWSTAEDMELI 729

Query: 729  AAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEK 788
            AAV+KCGEGNWA ILKGDFKGDRTASQLSQ   +                          
Sbjct: 730  AAVQKCGEGNWATILKGDFKGDRTASQLSQRWSIIR------------------------ 789

Query: 789  PKILDDVDRKRELWGLKRHGNLNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANS 848
                            KR  NLNVG+   S   +AQ+ AA RA+S AL++P+ ++ TA+ 
Sbjct: 790  ----------------KRQANLNVGSG--SQPSEAQL-AARRAVSLALNMPMVDNLTASC 849

Query: 849  NIN----------SSIVSPASG-----AEALVQMQNQSPQISMPSRPLLVEPLPSAVKSG 908
            +I+          S+   P +G       ++ Q+Q+QSPQ S+P+    +    S+ KS 
Sbjct: 850  SISTAGTISNTMPSNSARPTAGEALSVGGSISQIQHQSPQGSVPTIAPRMGTAGSSSKSQ 909

Query: 909  INTSKNSLIMKSTHNSDSIVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSST 968
            + + K S   KS  +  S+V+A AVAAGA I +  DAASLLK  Q K AI I     S  
Sbjct: 910  VTSKKTS--TKSAISQGSLVKAAAVAAGAHIATPLDAASLLKATQAKNAIPIVPGGGSLL 969

Query: 969  QSPVAGNA----PIHLDGRPSVHYISPGKTPTPGSSHVGGKST--------MGCNNSVKA 1028
            +  VAGNA      H D  P+VHYI  G   TP SS+    S          G  NSV+ 
Sbjct: 970  KPSVAGNANPFPSTHSDAPPNVHYIRTGLAATPFSSYPAVPSNASGPSGTQQGQGNSVRP 1016

Query: 1029 VSPKVLHNRSTAILT-NPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLA 1053
              P V  N S A  T N  S+  +  T  P K EV  +EER +       KE+ +E+ +A
Sbjct: 1030 AEPPVQLNPSAAASTSNMSSEVTNAATSCPAKDEVKIAEERLVSGSHSAPKEQVQEDQVA 1016

BLAST of Cla97C11G220620 vs. NCBI nr
Match: XP_038898516.1 (protein ECERIFERUM 2 [Benincasa hispida])

HSP 1 Score: 768.5 bits (1983), Expect = 7.7e-218
Identity = 387/436 (88.76%), Postives = 402/436 (92.20%), Query Frame = 0

Query: 2   DGGKNSLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIY 61
           DG KN+LISELKFSSVVPAKATGD+KV+ELTAIDLAMKLHYIRGVY FR SEEVRNLTIY
Sbjct: 3   DGSKNTLISELKFSSVVPAKATGDNKVKELTAIDLAMKLHYIRGVYFFRASEEVRNLTIY 62

Query: 62  DLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEKSIDEWLSIIEGD 121
           DLKKPLF LLE YYVVSGRIRRRIED DRPFIKCNDSGVRIVEA+CEK+I+EWLSI  GD
Sbjct: 63  DLKKPLFQLLELYYVVSGRIRRRIEDEDRPFIKCNDSGVRIVEADCEKTIEEWLSI--GD 122

Query: 122 DKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINA 181
           DK  HRD CLVH+QAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTH+LGDIFSASTFIN 
Sbjct: 123 DKISHRDDCLVHSQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHVLGDIFSASTFINV 182

Query: 182 WGSIMNNRPAHQLRPAPAGLIWPFRSTRLSTPPVKRLDPTGDLWIGSSDCKMATLSFRIT 241
           WG IMNNRPA  L PAPA L  P RSTRLSTPPVKRLDPTGDLWIGSSDCKMAT SFRIT
Sbjct: 183 WGYIMNNRPADHLLPAPAALTRPSRSTRLSTPPVKRLDPTGDLWIGSSDCKMATRSFRIT 242

Query: 242 GEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEI 301
            EQLDRIL V GRNRAVNFSTFEAIAAIFWK LSKIRLEDS+SRTISIYSTK PNREGEI
Sbjct: 243 AEQLDRILGVFGRNRAVNFSTFEAIAAIFWKFLSKIRLEDSNSRTISIYSTKFPNREGEI 302

Query: 302 PRNGMEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLT 361
           PRNGMEMSGVEA+FPVAGAAEGELAE+IVKKKIDEGGEI  +VEK ++ESDFIAYGARLT
Sbjct: 303 PRNGMEMSGVEAEFPVAGAAEGELAEMIVKKKIDEGGEIGGLVEKERDESDFIAYGARLT 362

Query: 362 FVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRDDGRD-GGRTVTVILPE 421
           FVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPP  DGRD GG TVTVILPE
Sbjct: 363 FVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPCGDGRDGGGLTVTVILPE 422

Query: 422 KQLPDLIDELQKQWEI 437
           K+LPDLIDELQKQW I
Sbjct: 423 KELPDLIDELQKQWGI 436

BLAST of Cla97C11G220620 vs. NCBI nr
Match: XP_008466161.1 (PREDICTED: uncharacterized protein LOC103503656 isoform X2 [Cucumis melo])

HSP 1 Score: 756.9 bits (1953), Expect = 2.3e-214
Identity = 424/556 (76.26%), Postives = 454/556 (81.65%), Query Frame = 0

Query: 505  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISN 564
            MI +KE +RTGTISMEDCSTLL RYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGIS+
Sbjct: 1    MIGKKENRRTGTISMEDCSTLLGRYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISD 60

Query: 565  VREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEPFPSVSSESSYEAAACVKVLI 624
             REYQLLWRHLAYRHTLLE+M SVTD LDYDSDLDFE+EPFPSV SESS EAAACVKVLI
Sbjct: 61   AREYQLLWRHLAYRHTLLEDMHSVTDSLDYDSDLDFEVEPFPSVGSESSNEAAACVKVLI 120

Query: 625  ANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIP 684
            ANGIP+ESDVP SSAVEAPLTI ISNSQ  TDN  N Q A L QG+SVTIPLSIQRQPIP
Sbjct: 121  ANGIPNESDVPNSSAVEAPLTIRISNSQPPTDNFDNHQSASL-QGISVTIPLSIQRQPIP 180

Query: 685  MAAATEVFDVNGAAGANAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRT 744
            +  A EVFDVNGAAGA+AAS+KRRKPWSKAEDLEL+AAVEKCGEGNWANILKGDFKGDRT
Sbjct: 181  VPPAAEVFDVNGAAGASAASRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRT 240

Query: 745  ASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNV 804
            ASQLSQ   V                                          KR  NLN+
Sbjct: 241  ASQLSQRWSVIR----------------------------------------KRRCNLNL 300

Query: 805  GASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIV-SPASGAEALVQMQNQS 864
            GAST+STTQKAQIDAAHRAL+FALDLPVNN+KTANSNINSSIV S AS +E+ VQMQNQS
Sbjct: 301  GASTSSTTQKAQIDAAHRALNFALDLPVNNTKTANSNINSSIVSSSASASESSVQMQNQS 360

Query: 865  PQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDSIVRATAVAAGARIVSLSDAA 924
            PQISMPSRPLLV+PLPSAVKSGINTSKNSL++ STHNSDSIVRATAVAAGARIVS SDAA
Sbjct: 361  PQISMPSRPLLVDPLPSAVKSGINTSKNSLMINSTHNSDSIVRATAVAAGARIVSPSDAA 420

Query: 925  SLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKS 984
            SL+K  QTK AIHIKSKC            P+HLD RP+VHYIS GKTPTP S++V GKS
Sbjct: 421  SLMKATQTKNAIHIKSKC-----------TPMHLDARPNVHYISTGKTPTPSSNYVSGKS 480

Query: 985  TMGCNNSVKAVSPKVLHNRSTAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAK 1044
            TM  NNS+KAVSPK+LH+RS AI TN PS+QVSPTTESPLKQEVNSSEERK P+ IITAK
Sbjct: 481  TMVGNNSMKAVSPKILHHRSAAISTNTPSNQVSPTTESPLKQEVNSSEERKTPEAIITAK 504

Query: 1045 EEFRENSLANDVKIRG 1060
            EEFRENS  NDVKIRG
Sbjct: 541  EEFRENSTGNDVKIRG 504

BLAST of Cla97C11G220620 vs. ExPASy Swiss-Prot
Match: Q39048 (Protein ECERIFERUM 2 OS=Arabidopsis thaliana OX=3702 GN=CER2 PE=1 SV=1)

HSP 1 Score: 294.7 bits (753), Expect = 4.3e-78
Identity = 174/437 (39.82%), Postives = 264/437 (60.41%), Query Frame = 0

Query: 5   KNSLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLK 64
           + S ++ ++ SSVVPA   G++K R+LT +DLAMKLHY+R VY F+G+   R+ T+ D+K
Sbjct: 2   EGSPVTSVRLSSVVPASVVGENKPRQLTPMDLAMKLHYVRAVYFFKGA---RDFTVADVK 61

Query: 65  KPLF---PLLEQYYVVSGRIRRRIEDGDR-----PFIKCNDSGVRIVEANCEK-SIDEWL 124
             +F    LL+ Y+ VSGRIR    D D      P+I+CNDSG+R+VEAN E+ ++++WL
Sbjct: 62  NTMFTLQSLLQSYHHVSGRIRMSDNDNDTSAAAIPYIRCNDSGIRVVEANVEEFTVEKWL 121

Query: 125 SIIEGDDKFL-HRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFS 184
            +   DD+ + HR   LV+   +GPDL FSPL F+Q+T+FKCGGL +GLSW HILGD+FS
Sbjct: 122 EL---DDRSIDHR--FLVYDHVLGPDLTFSPLVFLQITQFKCGGLCIGLSWAHILGDVFS 181

Query: 185 ASTFINAWGSIMN-NRPAHQLRPAPAGLIWPFRSTRLSTPPVKRLDPTGDLWIGSSDCKM 244
           ASTF+   G +++ + P   + P    L    R+       ++++D  G+ W+ ++ CKM
Sbjct: 182 ASTFMKTLGQLVSGHAPTKPVYPKTPELTSHARNDG-EAISIEKIDSVGEYWLLTNKCKM 241

Query: 245 ATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTK 304
               F  +   +D +++     R   FS  + + A+ WKSL  IR E +++  I+I   K
Sbjct: 242 GRHIFNFSLNHIDSLMAKY-TTRDQPFSEVDILYALIWKSLLNIRGE-TNTNVITICDRK 301

Query: 305 CPNREGEIPRNGMEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDF 364
              +        + +S VE +  + G +  ELA LI  +K +E G I+ ++E+ K  SDF
Sbjct: 302 ---KSSTCWNEDLVISVVEKNDEMVGIS--ELAALIAGEKREENGAIKRMIEQDKGSSDF 361

Query: 365 IAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRDDGRDGGRT 424
             YGA LTFV+L+E ++Y  E+ G KP  VNY I GVG+ GVVLV P       ++  R 
Sbjct: 362 FTYGANLTFVNLDEIDMYELEINGGKPDFVNYTIHGVGDKGVVLVFP------KQNFARI 416

Query: 425 VTVILPEKQLPDLIDEL 431
           V+V++PE+ L  L +E+
Sbjct: 422 VSVVMPEEDLAKLKEEV 416

BLAST of Cla97C11G220620 vs. ExPASy Swiss-Prot
Match: Q9LIS1 (Protein ECERIFERUM 26-like OS=Arabidopsis thaliana OX=3702 GN=CER26L PE=2 SV=1)

HSP 1 Score: 266.9 bits (681), Expect = 9.6e-70
Identity = 158/433 (36.49%), Postives = 241/433 (55.66%), Query Frame = 0

Query: 13  KFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLE 72
           + S+V  +  +      E T +DLAMKLHY++ VY++  +   R+LT+ D+K PLF +  
Sbjct: 16  RLSTVSASLPSETGTTHEPTGLDLAMKLHYLKAVYIY-SAGTARDLTVMDVKAPLFSVFY 75

Query: 73  QYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEKSIDEWLSIIEGDDKFLHRDGCLV 132
           Q   + GR RR   +  RP++KCND G R VE++C+ +++EWL +    D+ +  D  LV
Sbjct: 76  QIPCIIGRFRR--HESGRPYLKCNDCGTRFVESHCDLTVEEWLRV---PDRSV--DESLV 135

Query: 133 HTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAH 192
           + Q +GPDL FSPL +IQ+TRF CGGL++GLSW HI+GD FS S F N W         +
Sbjct: 136 YHQPVGPDLAFSPLLYIQMTRFSCGGLALGLSWAHIMGDPFSLSHFFNLWAQAFAGGKIY 195

Query: 193 QLRPAPAGLIWPFRSTRLSTP-PVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILSV 252
             + +     +   ++    P  VK++D  GDLW+  ++ KM T SF +T   L     V
Sbjct: 196 CPKTSVTERDFQNPTSTFKKPDSVKQVDLVGDLWVAPNNSKMTTFSFNLTVNDLKTHFPV 255

Query: 253 VGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNGMEMSGV 312
            G         FE +  I WK ++ +R E S   TI++  +     +    RNG  +S +
Sbjct: 256 NGDGE------FEILTGIIWKCVATVRGE-SAPVTITVIRSDPKKLKPRAVRNGQMISSI 315

Query: 313 EADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIY 372
             DF VA A+  E+ + I + K DE   I+E+V+   + SDFI YGA LTFVD+ E + Y
Sbjct: 316 HVDFSVAEASLEEIVKSIGEAK-DERVVIDEIVD---DVSDFIVYGANLTFVDMSEVDFY 375

Query: 373 GFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRDDGRDGGRTVTVILPEKQLPDLIDELQ 432
             ++ G+ P  V   + G+G++G V+VLPG   ++     R VTV LP       +DE++
Sbjct: 376 EAKVMGKSPESVYCNVQGIGDDGAVVVLPGVVEEE-----RVVTVTLP-------VDEIE 417

Query: 433 K-QWEIHFCACVS 444
           K +WE+  C  ++
Sbjct: 436 KVKWEMKKCGLIT 417

BLAST of Cla97C11G220620 vs. ExPASy Swiss-Prot
Match: Q9SVM9 (Protein ECERIFERUM 26 OS=Arabidopsis thaliana OX=3702 GN=CER26 PE=2 SV=1)

HSP 1 Score: 250.8 bits (639), Expect = 7.1e-65
Identity = 150/427 (35.13%), Postives = 232/427 (54.33%), Query Frame = 0

Query: 9   ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLF 68
           +  ++ S+V   + T      E T +DLAMKLHY++  Y++  +E  R+LT+  LK+ +F
Sbjct: 14  VHSIRLSTVGATRPTETGTTHEPTGLDLAMKLHYLKAAYIY-SAETARDLTVRHLKEAMF 73

Query: 69  PLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEKSIDEWLSIIEGDDKFLHRD 128
            L +Q    +GR  RR  D  RP+IKCND G R VE  C  +++EWLS     D+ +  D
Sbjct: 74  MLFDQIAWTTGRFSRR--DSGRPYIKCNDCGTRFVEGQCNLTVEEWLS---KPDRSV--D 133

Query: 129 GCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNN 188
             LV+   IGP+L FSPL ++Q+TRFKCGGL +GLSW +I+GD FS     N W   +  
Sbjct: 134 EFLVYHHPIGPELTFSPLIYVQMTRFKCGGLGLGLSWANIIGDAFSLFYAFNLWAKAITG 193

Query: 189 RPAH-QLRPAPAGLIWPFRSTRLSTP-PVKRLDPTGDLWIGSSDCKMATLSFRIT-GEQL 248
              +    P+     +   +  +  P  +KR++P GDLW+  +D K+A   F ++  +Q+
Sbjct: 194 EKIYAPTTPSIGERRFQSPNPTVKDPVSIKRVEPVGDLWVTPNDKKLANYCFNLSVADQI 253

Query: 249 DRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNG 308
                  G +   +   FE +A I WK ++K+R+E     T++I      + +    RN 
Sbjct: 254 SPHFPAKGDD---SIPVFEILAGIIWKCIAKVRVEPKPV-TVTIIKKDPNDLKLNAIRNS 313

Query: 309 MEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDL 368
             +S V  DFPVA A   EL + + + K DE   IEE+ E      DF+ YGA+LTF+DL
Sbjct: 314 QVISSVSVDFPVAEATVEELVKAMGEAK-DERCGIEEIGESCDGNLDFVVYGAKLTFLDL 373

Query: 369 EEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRDDGRDGGRTVTVILPEKQLPD 428
              ++Y  ++ G+ P  V   + G+GE G+V+V      ++     R VTV LPE+++  
Sbjct: 374 TGEDLYEAKVMGKSPESVYCNVEGIGEEGLVVVYAAAKSEE-----RVVTVTLPEEEMER 422

Query: 429 LIDELQK 433
           +  E +K
Sbjct: 434 VKLEFKK 422

BLAST of Cla97C11G220620 vs. ExPASy Swiss-Prot
Match: A0A2H5AIZ1 (Hydroxycinnamoyltransferase OS=Narcissus pseudonarcissus OX=39639 GN=HCT PE=2 SV=1)

HSP 1 Score: 92.4 bits (228), Expect = 3.2e-17
Identity = 84/296 (28.38%), Postives = 136/296 (45.95%), Query Frame = 0

Query: 8   LISELKFSSVV-PAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYD---L 67
           +I  +K +++V P++ T   ++   + +DL +   +   VY +R      +   +D   L
Sbjct: 1   MIINIKETTMVRPSQPTPSQRLWN-SNLDLVVPRFHTPSVYFYRRPGNASD--FFDARVL 60

Query: 68  KKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEKSIDEWLSIIEGDDK 127
           K+ L   L  +Y ++GR+ R  EDG R  I CN  GVR V A  + +IDE+     GD  
Sbjct: 61  KEALGRALVPFYPMAGRLARD-EDG-RVEIDCNGEGVRFVVAETDSAIDEF-----GDFA 120

Query: 128 FLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWG 187
                  L+     G D+   PL  +Q+T FKCGG S+G+   H + D  S   FIN+W 
Sbjct: 121 PTMELKKLIPKVEYGDDISAFPLLVLQITHFKCGGTSLGVGMQHHVADGASGLHFINSWS 180

Query: 188 SIMN----------NRPAHQLR--PAPAGLIWPFR-STRLSTPPVKRLDPTGDLWIGSSD 247
            I            +R   + R  P+P+     ++ +  ++T P    DPT    + S  
Sbjct: 181 DIARGLDIAVPPFIDRSLLRARDPPSPSFPHIEYQPAPSMNTSPAPIQDPT----VKSDP 240

Query: 248 CKMATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRT 287
              A   F++T +QLD + S V    +  +S++  +A   W+  S  R    D RT
Sbjct: 241 TATAVSIFKLTKQQLDLLKSRV----SAKYSSYALVAGHVWRCTSIARGLPDDQRT 278

BLAST of Cla97C11G220620 vs. ExPASy Swiss-Prot
Match: A0PDV5 (Rosmarinate synthase OS=Plectranthus scutellarioides OX=4142 GN=RAS PE=1 SV=1)

HSP 1 Score: 87.0 bits (214), Expect = 1.4e-15
Identity = 81/281 (28.83%), Postives = 128/281 (45.55%), Query Frame = 0

Query: 11  ELKFSSVVPAKATGDDKVRELTAIDLAMKLHY-IRGVYLFRGSEEVRNLTIYDLKKPLFP 70
           E+K S+++   A        L+ +DL    +Y    V+ +             LK+ L  
Sbjct: 4   EVKDSTMIKPSAETPGGSLWLSNLDLLSPANYHTLSVHFYSHDGSDNFFDAAGLKESLSR 63

Query: 71  LLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEKSIDEWLSIIEGDDKFLHRDG 130
            L ++Y  +GR++    +G+R  I CN+ G+ +VEA C+ ++DE      GD  F  R  
Sbjct: 64  ALVEFYPYAGRLKL---NGNRLEIDCNNEGLLLVEAECDGALDEL-----GD--FAPRPE 123

Query: 131 C-LVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNN 190
             L+        +   PL   QLTRFKCGG+++G++  H L D  +A  FIN W      
Sbjct: 124 LNLIPKVDYSRGISTYPLMVFQLTRFKCGGVALGVANEHHLSDGVAALHFINTW------ 183

Query: 191 RPAHQLRPAPAGLIWP-FRSTRLS--TPPVKRLD-------PTGDLWIGSSDCKMATLSF 250
             AH  R APA    P F  + LS   PP  +         PT +  +  +D  +A   F
Sbjct: 184 --AHLSRGAPAPTPLPHFDRSSLSARNPPQPQFSHAEYQPPPTLENPLPHTD--IAHSRF 243

Query: 251 RITGEQLDRILS-----VVGRNRAVNFSTFEAIAAIFWKSL 275
           ++T +QL+ + S             ++STFE +A   W+S+
Sbjct: 244 KLTRDQLNSLKSKFKTAPADGGAGKSYSTFEVLAGHIWRSV 264

BLAST of Cla97C11G220620 vs. ExPASy TrEMBL
Match: A0A5J5ADN2 (HTH myb-type domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_035719 PE=3 SV=1)

HSP 1 Score: 772.7 bits (1994), Expect = 2.0e-219
Identity = 486/1080 (45.00%), Postives = 645/1080 (59.72%), Query Frame = 0

Query: 9    ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLF 68
            + ++K SSVVPA+ TG+DKV ELT +DL MKLHYIRG+Y F+ +E V+ L IYDLKKP+F
Sbjct: 10   VYDIKLSSVVPARVTGEDKVHELTNMDLIMKLHYIRGLYFFK-NEAVQGLDIYDLKKPMF 69

Query: 69   PLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEKSIDEWLSIIEGDDKFLHRD 128
              L+ YY  SGR+R+   +  RPFIKCNDSGVRIVEA C K++DEWL++     K    D
Sbjct: 70   QWLDLYYSTSGRVRQ--SEKGRPFIKCNDSGVRIVEARCCKTLDEWLAM-----KDHSLD 129

Query: 129  GCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNN 188
              L++ Q +GPDLGFSPL FIQ T FKCGG SVGLSW HI+GD+FSASTFIN WG I+  
Sbjct: 130  NQLIYNQVLGPDLGFSPLVFIQFTWFKCGGFSVGLSWAHIIGDVFSASTFINMWGQILAG 189

Query: 189  RPAHQLRPAPAGLIWPFRSTRLSTP-PVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDR 248
            +   Q    P    + F  +    P  +K +DP GD W+ +++CKM T S  +T +QLD 
Sbjct: 190  QVPPQSLRMPNTRKYEFPHSIAEKPFSLKMVDPVGDYWLTANNCKMETHSLHVTAKQLDH 249

Query: 249  ILS-VVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNGM 308
            ILS   G  +A     F+  +A+ WKSL+K+R    + R ++I +    + E EI  NG+
Sbjct: 250  ILSKTYGPRKAAKVPPFQVFSALMWKSLAKVR-GKLEPRIVTICNNNSHSTENEISSNGL 309

Query: 309  EMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLE 368
             +S VEADFPV  A   EL  LI +K++DE   IEE+VE+G  +SDFI YGA LTFV+LE
Sbjct: 310  VISIVEADFPVMNANLSELVTLIGEKRVDETKMIEEMVERGNGKSDFIMYGANLTFVNLE 369

Query: 369  EANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPR-DDGRDGGRTVTVILPEKQLPD 428
            EANIYG EL+GQKPV  NY I G+G+ GVVLVLP P     G   GRT+TV LP  Q+ +
Sbjct: 370  EANIYGLELKGQKPVFANYTIAGIGDEGVVLVLPEPENTKQGGSNGRTLTVTLPGNQISE 429

Query: 429  LIDELQKQWEIHFCACVSIIEHCTLTTVADKSEFLLQPQRLTFVFRFRLQTI-----SDK 488
            L DEL+K+W+        I ++    TV   S  L  P +         Q++     S+K
Sbjct: 430  LKDELRKEWD-----RAGIKQNHEGVTVISVSRVL--PSQAPAPLSLNSQSVSSGTGSEK 489

Query: 489  RFQPHRRQSPALHLFLCDFITSSTMIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLRE 548
                  ++S  L       +  +TM+E+ +KQ+ G+I+ E+ STLL+RYS  T+ T+L+E
Sbjct: 490  SVSSTLQRSVTL-------LQRNTMVEKTKKQKKGSINEEEVSTLLQRYSPMTVLTVLQE 549

Query: 549  VAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDF 608
            VAQV  V IDW+ LV+ TS+GISN REYQ+LWRHLAYR+TL E +D   +PLD DSDL++
Sbjct: 550  VAQVPDVNIDWNALVEKTSSGISNAREYQMLWRHLAYRNTLHEVLDDEAEPLDDDSDLEY 609

Query: 609  EIEPFPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKN 668
            E+E FP+VSSE+S EAAA VKVLIA+G P++S +P  + VEAPLTI I N Q+S    +N
Sbjct: 610  ELEAFPAVSSEASTEAAAYVKVLIASGAPTDSSLPNGTTVEAPLTINIPNGQSSGVPSEN 669

Query: 669  PQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAASQKRRKPWSKAEDLELM 728
             Q A  +QG ++TIP+ +Q+ P+P   A E  D NG+       +K+RK WS AED+EL+
Sbjct: 670  SQLA--IQGKNITIPVFVQKLPLP---AAEGMDANGSTVCGLPPRKKRKRWSTAEDMELI 729

Query: 729  AAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEK 788
            AAV+KCGEGNWA ILKGDFKGDRTASQLSQ   +                          
Sbjct: 730  AAVQKCGEGNWATILKGDFKGDRTASQLSQRWSIIR------------------------ 789

Query: 789  PKILDDVDRKRELWGLKRHGNLNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANS 848
                            KR  NLNVG+   S   +AQ+ AA RA+S AL++P+ ++ TA+ 
Sbjct: 790  ----------------KRQANLNVGSG--SQPSEAQL-AARRAVSLALNMPMVDNLTASC 849

Query: 849  NIN----------SSIVSPASG-----AEALVQMQNQSPQISMPSRPLLVEPLPSAVKSG 908
            +I+          S+   P +G       ++ Q+Q+QSPQ S+P+    +    S+ KS 
Sbjct: 850  SISTAGTISNTMPSNSARPTAGEALSVGGSISQIQHQSPQGSVPTIAPRMGTAGSSSKSQ 909

Query: 909  INTSKNSLIMKSTHNSDSIVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSST 968
            + + K S   KS  +  S+V+A AVAAGA I +  DAASLLK  Q K AI I     S  
Sbjct: 910  VTSKKTS--TKSAISQGSLVKAAAVAAGAHIATPLDAASLLKATQAKNAIPIVPGGGSLL 969

Query: 969  QSPVAGNA----PIHLDGRPSVHYISPGKTPTPGSSHVGGKST--------MGCNNSVKA 1028
            +  VAGNA      H D  P+VHYI  G   TP SS+    S          G  NSV+ 
Sbjct: 970  KPSVAGNANPFPSTHSDAPPNVHYIRTGLAATPFSSYPAVPSNASGPSGTQQGQGNSVRP 1016

Query: 1029 VSPKVLHNRSTAILT-NPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLA 1053
              P V  N S A  T N  S+  +  T  P K EV  +EER +       KE+ +E+ +A
Sbjct: 1030 AEPPVQLNPSAAASTSNMSSEVTNAATSCPAKDEVKIAEERLVSGSHSAPKEQVQEDQVA 1016

BLAST of Cla97C11G220620 vs. ExPASy TrEMBL
Match: A0A1S3CRZ4 (uncharacterized protein LOC103503656 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103503656 PE=4 SV=1)

HSP 1 Score: 756.9 bits (1953), Expect = 1.1e-214
Identity = 424/556 (76.26%), Postives = 454/556 (81.65%), Query Frame = 0

Query: 505  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISN 564
            MI +KE +RTGTISMEDCSTLL RYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGIS+
Sbjct: 1    MIGKKENRRTGTISMEDCSTLLGRYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISD 60

Query: 565  VREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEPFPSVSSESSYEAAACVKVLI 624
             REYQLLWRHLAYRHTLLE+M SVTD LDYDSDLDFE+EPFPSV SESS EAAACVKVLI
Sbjct: 61   AREYQLLWRHLAYRHTLLEDMHSVTDSLDYDSDLDFEVEPFPSVGSESSNEAAACVKVLI 120

Query: 625  ANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIP 684
            ANGIP+ESDVP SSAVEAPLTI ISNSQ  TDN  N Q A L QG+SVTIPLSIQRQPIP
Sbjct: 121  ANGIPNESDVPNSSAVEAPLTIRISNSQPPTDNFDNHQSASL-QGISVTIPLSIQRQPIP 180

Query: 685  MAAATEVFDVNGAAGANAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRT 744
            +  A EVFDVNGAAGA+AAS+KRRKPWSKAEDLEL+AAVEKCGEGNWANILKGDFKGDRT
Sbjct: 181  VPPAAEVFDVNGAAGASAASRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRT 240

Query: 745  ASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNV 804
            ASQLSQ   V                                          KR  NLN+
Sbjct: 241  ASQLSQRWSVIR----------------------------------------KRRCNLNL 300

Query: 805  GASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIV-SPASGAEALVQMQNQS 864
            GAST+STTQKAQIDAAHRAL+FALDLPVNN+KTANSNINSSIV S AS +E+ VQMQNQS
Sbjct: 301  GASTSSTTQKAQIDAAHRALNFALDLPVNNTKTANSNINSSIVSSSASASESSVQMQNQS 360

Query: 865  PQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDSIVRATAVAAGARIVSLSDAA 924
            PQISMPSRPLLV+PLPSAVKSGINTSKNSL++ STHNSDSIVRATAVAAGARIVS SDAA
Sbjct: 361  PQISMPSRPLLVDPLPSAVKSGINTSKNSLMINSTHNSDSIVRATAVAAGARIVSPSDAA 420

Query: 925  SLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKS 984
            SL+K  QTK AIHIKSKC            P+HLD RP+VHYIS GKTPTP S++V GKS
Sbjct: 421  SLMKATQTKNAIHIKSKC-----------TPMHLDARPNVHYISTGKTPTPSSNYVSGKS 480

Query: 985  TMGCNNSVKAVSPKVLHNRSTAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAK 1044
            TM  NNS+KAVSPK+LH+RS AI TN PS+QVSPTTESPLKQEVNSSEERK P+ IITAK
Sbjct: 481  TMVGNNSMKAVSPKILHHRSAAISTNTPSNQVSPTTESPLKQEVNSSEERKTPEAIITAK 504

Query: 1045 EEFRENSLANDVKIRG 1060
            EEFRENS  NDVKIRG
Sbjct: 541  EEFRENSTGNDVKIRG 504

BLAST of Cla97C11G220620 vs. ExPASy TrEMBL
Match: A0A5D3E5P5 (HTH myb-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G005670 PE=4 SV=1)

HSP 1 Score: 752.3 bits (1941), Expect = 2.8e-213
Identity = 424/557 (76.12%), Postives = 454/557 (81.51%), Query Frame = 0

Query: 505  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISN 564
            MI +KE +RTGTISMEDCSTLL RYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGIS+
Sbjct: 1    MIGKKENRRTGTISMEDCSTLLGRYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISD 60

Query: 565  VREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEPFPSVSSESSYEAAACVKVLI 624
             REYQLLWRHLAYRHTLLE+M SVTD LDYDSDLDFE+EPFPSV SESS EAAACVKVLI
Sbjct: 61   AREYQLLWRHLAYRHTLLEDMHSVTDSLDYDSDLDFEVEPFPSVGSESSNEAAACVKVLI 120

Query: 625  ANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIP 684
            ANGIP+ESDVP SSAVEAPLTI ISNSQ  TDN  N Q A L QG+SVTIPLSIQRQPIP
Sbjct: 121  ANGIPNESDVPNSSAVEAPLTIRISNSQPPTDNFDNHQSASL-QGISVTIPLSIQRQPIP 180

Query: 685  MAAATEVFDVNGAAGANAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRT 744
            +  A EVFDVNGAAGA+AAS+KRRKPWSKAEDLEL+AAVEKCGEGNWANILKGDFKGDRT
Sbjct: 181  VPPAAEVFDVNGAAGASAASRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRT 240

Query: 745  ASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNV 804
            ASQLSQ   V                                          KR  NLN+
Sbjct: 241  ASQLSQRWSVIR----------------------------------------KRRCNLNL 300

Query: 805  GASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIV-SPASGAEALVQMQNQ 864
            GAST+STTQKAQIDAAHRAL+FALDLPVNN+KT ANSNINSSIV S AS +E+ VQMQNQ
Sbjct: 301  GASTSSTTQKAQIDAAHRALNFALDLPVNNTKTAANSNINSSIVSSSASASESSVQMQNQ 360

Query: 865  SPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDSIVRATAVAAGARIVSLSDA 924
            SPQISMPSRPLLV+PLPSAVKSGINTSKNSL++ STHNSDSIVRATAVAAGARIVS SDA
Sbjct: 361  SPQISMPSRPLLVDPLPSAVKSGINTSKNSLMINSTHNSDSIVRATAVAAGARIVSPSDA 420

Query: 925  ASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGK 984
            ASL+K  QTK AIHIKSKC            P+HLD RP+VHYIS GKTPTP S++V GK
Sbjct: 421  ASLMKATQTKNAIHIKSKC-----------TPMHLDARPNVHYISTGKTPTPSSNYVSGK 480

Query: 985  STMGCNNSVKAVSPKVLHNRSTAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITA 1044
            STM  NNS+KAVSPK+LH+RS AI TN PS+QVSPTTESPLKQEVNSSEERK P+ IITA
Sbjct: 481  STMVGNNSMKAVSPKILHHRSAAISTNTPSNQVSPTTESPLKQEVNSSEERKTPEAIITA 505

Query: 1045 KEEFRENSLANDVKIRG 1060
            KEEFRENS  NDVKIRG
Sbjct: 541  KEEFRENSTGNDVKIRG 505

BLAST of Cla97C11G220620 vs. ExPASy TrEMBL
Match: A0A1S3CQJ6 (uncharacterized protein LOC103503656 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103503656 PE=4 SV=1)

HSP 1 Score: 750.7 bits (1937), Expect = 8.1e-213
Identity = 424/561 (75.58%), Postives = 454/561 (80.93%), Query Frame = 0

Query: 505  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISN 564
            MI +KE +RTGTISMEDCSTLL RYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGIS+
Sbjct: 1    MIGKKENRRTGTISMEDCSTLLGRYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISD 60

Query: 565  VREYQLLWRHLAYRHTLLENMDSVTDPL-----DYDSDLDFEIEPFPSVSSESSYEAAAC 624
             REYQLLWRHLAYRHTLLE+M SVTD L     DYDSDLDFE+EPFPSV SESS EAAAC
Sbjct: 61   AREYQLLWRHLAYRHTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVGSESSNEAAAC 120

Query: 625  VKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQ 684
            VKVLIANGIP+ESDVP SSAVEAPLTI ISNSQ  TDN  N Q A L QG+SVTIPLSIQ
Sbjct: 121  VKVLIANGIPNESDVPNSSAVEAPLTIRISNSQPPTDNFDNHQSASL-QGISVTIPLSIQ 180

Query: 685  RQPIPMAAATEVFDVNGAAGANAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDF 744
            RQPIP+  A EVFDVNGAAGA+AAS+KRRKPWSKAEDLEL+AAVEKCGEGNWANILKGDF
Sbjct: 181  RQPIPVPPAAEVFDVNGAAGASAASRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDF 240

Query: 745  KGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRH 804
            KGDRTASQLSQ   V                                          KR 
Sbjct: 241  KGDRTASQLSQRWSVIR----------------------------------------KRR 300

Query: 805  GNLNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIV-SPASGAEALVQ 864
             NLN+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KTANSNINSSIV S AS +E+ VQ
Sbjct: 301  CNLNLGASTSSTTQKAQIDAAHRALNFALDLPVNNTKTANSNINSSIVSSSASASESSVQ 360

Query: 865  MQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDSIVRATAVAAGARIVS 924
            MQNQSPQISMPSRPLLV+PLPSAVKSGINTSKNSL++ STHNSDSIVRATAVAAGARIVS
Sbjct: 361  MQNQSPQISMPSRPLLVDPLPSAVKSGINTSKNSLMINSTHNSDSIVRATAVAAGARIVS 420

Query: 925  LSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSH 984
             SDAASL+K  QTK AIHIKSKC            P+HLD RP+VHYIS GKTPTP S++
Sbjct: 421  PSDAASLMKATQTKNAIHIKSKC-----------TPMHLDARPNVHYISTGKTPTPSSNY 480

Query: 985  VGGKSTMGCNNSVKAVSPKVLHNRSTAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKP 1044
            V GKSTM  NNS+KAVSPK+LH+RS AI TN PS+QVSPTTESPLKQEVNSSEERK P+ 
Sbjct: 481  VSGKSTMVGNNSMKAVSPKILHHRSAAISTNTPSNQVSPTTESPLKQEVNSSEERKTPEA 509

Query: 1045 IITAKEEFRENSLANDVKIRG 1060
            IITAKEEFRENS  NDVKIRG
Sbjct: 541  IITAKEEFRENSTGNDVKIRG 509

BLAST of Cla97C11G220620 vs. ExPASy TrEMBL
Match: A0A5A7T5C8 (HTH myb-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold92G002020 PE=4 SV=1)

HSP 1 Score: 746.1 bits (1925), Expect = 2.0e-211
Identity = 424/562 (75.44%), Postives = 454/562 (80.78%), Query Frame = 0

Query: 505  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISN 564
            MI +KE +RTGTISMEDCSTLL RYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGIS+
Sbjct: 1    MIGKKENRRTGTISMEDCSTLLGRYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISD 60

Query: 565  VREYQLLWRHLAYRHTLLENMDSVTDPL-----DYDSDLDFEIEPFPSVSSESSYEAAAC 624
             REYQLLWRHLAYRHTLLE+M SVTD L     DYDSDLDFE+EPFPSV SESS EAAAC
Sbjct: 61   AREYQLLWRHLAYRHTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVGSESSNEAAAC 120

Query: 625  VKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQ 684
            VKVLIANGIP+ESDVP SSAVEAPLTI ISNSQ  TDN  N Q A L QG+SVTIPLSIQ
Sbjct: 121  VKVLIANGIPNESDVPNSSAVEAPLTIRISNSQPPTDNFDNHQSASL-QGISVTIPLSIQ 180

Query: 685  RQPIPMAAATEVFDVNGAAGANAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDF 744
            RQPIP+  A EVFDVNGAAGA+AAS+KRRKPWSKAEDLEL+AAVEKCGEGNWANILKGDF
Sbjct: 181  RQPIPVPPAAEVFDVNGAAGASAASRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDF 240

Query: 745  KGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRH 804
            KGDRTASQLSQ   V                                          KR 
Sbjct: 241  KGDRTASQLSQRWSVIR----------------------------------------KRR 300

Query: 805  GNLNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIV-SPASGAEALV 864
             NLN+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KT ANSNINSSIV S AS +E+ V
Sbjct: 301  CNLNLGASTSSTTQKAQIDAAHRALNFALDLPVNNTKTAANSNINSSIVSSSASASESSV 360

Query: 865  QMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDSIVRATAVAAGARIV 924
            QMQNQSPQISMPSRPLLV+PLPSAVKSGINTSKNSL++ STHNSDSIVRATAVAAGARIV
Sbjct: 361  QMQNQSPQISMPSRPLLVDPLPSAVKSGINTSKNSLMINSTHNSDSIVRATAVAAGARIV 420

Query: 925  SLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSS 984
            S SDAASL+K  QTK AIHIKSKC            P+HLD RP+VHYIS GKTPTP S+
Sbjct: 421  SPSDAASLMKATQTKNAIHIKSKC-----------TPMHLDARPNVHYISTGKTPTPSSN 480

Query: 985  HVGGKSTMGCNNSVKAVSPKVLHNRSTAILTNPPSDQVSPTTESPLKQEVNSSEERKIPK 1044
            +V GKSTM  NNS+KAVSPK+LH+RS AI TN PS+QVSPTTESPLKQEVNSSEERK P+
Sbjct: 481  YVSGKSTMVGNNSMKAVSPKILHHRSAAISTNTPSNQVSPTTESPLKQEVNSSEERKTPE 510

Query: 1045 PIITAKEEFRENSLANDVKIRG 1060
             IITAKEEFRENS  NDVKIRG
Sbjct: 541  AIITAKEEFRENSTGNDVKIRG 510

BLAST of Cla97C11G220620 vs. TAIR 10
Match: AT4G24510.1 (HXXXD-type acyl-transferase family protein )

HSP 1 Score: 294.7 bits (753), Expect = 3.0e-79
Identity = 174/437 (39.82%), Postives = 264/437 (60.41%), Query Frame = 0

Query: 5   KNSLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLK 64
           + S ++ ++ SSVVPA   G++K R+LT +DLAMKLHY+R VY F+G+   R+ T+ D+K
Sbjct: 2   EGSPVTSVRLSSVVPASVVGENKPRQLTPMDLAMKLHYVRAVYFFKGA---RDFTVADVK 61

Query: 65  KPLF---PLLEQYYVVSGRIRRRIEDGDR-----PFIKCNDSGVRIVEANCEK-SIDEWL 124
             +F    LL+ Y+ VSGRIR    D D      P+I+CNDSG+R+VEAN E+ ++++WL
Sbjct: 62  NTMFTLQSLLQSYHHVSGRIRMSDNDNDTSAAAIPYIRCNDSGIRVVEANVEEFTVEKWL 121

Query: 125 SIIEGDDKFL-HRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFS 184
            +   DD+ + HR   LV+   +GPDL FSPL F+Q+T+FKCGGL +GLSW HILGD+FS
Sbjct: 122 EL---DDRSIDHR--FLVYDHVLGPDLTFSPLVFLQITQFKCGGLCIGLSWAHILGDVFS 181

Query: 185 ASTFINAWGSIMN-NRPAHQLRPAPAGLIWPFRSTRLSTPPVKRLDPTGDLWIGSSDCKM 244
           ASTF+   G +++ + P   + P    L    R+       ++++D  G+ W+ ++ CKM
Sbjct: 182 ASTFMKTLGQLVSGHAPTKPVYPKTPELTSHARNDG-EAISIEKIDSVGEYWLLTNKCKM 241

Query: 245 ATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTK 304
               F  +   +D +++     R   FS  + + A+ WKSL  IR E +++  I+I   K
Sbjct: 242 GRHIFNFSLNHIDSLMAKY-TTRDQPFSEVDILYALIWKSLLNIRGE-TNTNVITICDRK 301

Query: 305 CPNREGEIPRNGMEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDF 364
              +        + +S VE +  + G +  ELA LI  +K +E G I+ ++E+ K  SDF
Sbjct: 302 ---KSSTCWNEDLVISVVEKNDEMVGIS--ELAALIAGEKREENGAIKRMIEQDKGSSDF 361

Query: 365 IAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRDDGRDGGRT 424
             YGA LTFV+L+E ++Y  E+ G KP  VNY I GVG+ GVVLV P       ++  R 
Sbjct: 362 FTYGANLTFVNLDEIDMYELEINGGKPDFVNYTIHGVGDKGVVLVFP------KQNFARI 416

Query: 425 VTVILPEKQLPDLIDEL 431
           V+V++PE+ L  L +E+
Sbjct: 422 VSVVMPEEDLAKLKEEV 416

BLAST of Cla97C11G220620 vs. TAIR 10
Match: AT3G23840.1 (HXXXD-type acyl-transferase family protein )

HSP 1 Score: 266.9 bits (681), Expect = 6.8e-71
Identity = 158/433 (36.49%), Postives = 241/433 (55.66%), Query Frame = 0

Query: 13  KFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLE 72
           + S+V  +  +      E T +DLAMKLHY++ VY++  +   R+LT+ D+K PLF +  
Sbjct: 16  RLSTVSASLPSETGTTHEPTGLDLAMKLHYLKAVYIY-SAGTARDLTVMDVKAPLFSVFY 75

Query: 73  QYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEKSIDEWLSIIEGDDKFLHRDGCLV 132
           Q   + GR RR   +  RP++KCND G R VE++C+ +++EWL +    D+ +  D  LV
Sbjct: 76  QIPCIIGRFRR--HESGRPYLKCNDCGTRFVESHCDLTVEEWLRV---PDRSV--DESLV 135

Query: 133 HTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAH 192
           + Q +GPDL FSPL +IQ+TRF CGGL++GLSW HI+GD FS S F N W         +
Sbjct: 136 YHQPVGPDLAFSPLLYIQMTRFSCGGLALGLSWAHIMGDPFSLSHFFNLWAQAFAGGKIY 195

Query: 193 QLRPAPAGLIWPFRSTRLSTP-PVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILSV 252
             + +     +   ++    P  VK++D  GDLW+  ++ KM T SF +T   L     V
Sbjct: 196 CPKTSVTERDFQNPTSTFKKPDSVKQVDLVGDLWVAPNNSKMTTFSFNLTVNDLKTHFPV 255

Query: 253 VGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNGMEMSGV 312
            G         FE +  I WK ++ +R E S   TI++  +     +    RNG  +S +
Sbjct: 256 NGDGE------FEILTGIIWKCVATVRGE-SAPVTITVIRSDPKKLKPRAVRNGQMISSI 315

Query: 313 EADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIY 372
             DF VA A+  E+ + I + K DE   I+E+V+   + SDFI YGA LTFVD+ E + Y
Sbjct: 316 HVDFSVAEASLEEIVKSIGEAK-DERVVIDEIVD---DVSDFIVYGANLTFVDMSEVDFY 375

Query: 373 GFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRDDGRDGGRTVTVILPEKQLPDLIDELQ 432
             ++ G+ P  V   + G+G++G V+VLPG   ++     R VTV LP       +DE++
Sbjct: 376 EAKVMGKSPESVYCNVQGIGDDGAVVVLPGVVEEE-----RVVTVTLP-------VDEIE 417

Query: 433 K-QWEIHFCACVS 444
           K +WE+  C  ++
Sbjct: 436 KVKWEMKKCGLIT 417

BLAST of Cla97C11G220620 vs. TAIR 10
Match: AT4G13840.1 (HXXXD-type acyl-transferase family protein )

HSP 1 Score: 250.8 bits (639), Expect = 5.0e-66
Identity = 150/427 (35.13%), Postives = 232/427 (54.33%), Query Frame = 0

Query: 9   ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLF 68
           +  ++ S+V   + T      E T +DLAMKLHY++  Y++  +E  R+LT+  LK+ +F
Sbjct: 14  VHSIRLSTVGATRPTETGTTHEPTGLDLAMKLHYLKAAYIY-SAETARDLTVRHLKEAMF 73

Query: 69  PLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEKSIDEWLSIIEGDDKFLHRD 128
            L +Q    +GR  RR  D  RP+IKCND G R VE  C  +++EWLS     D+ +  D
Sbjct: 74  MLFDQIAWTTGRFSRR--DSGRPYIKCNDCGTRFVEGQCNLTVEEWLS---KPDRSV--D 133

Query: 129 GCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNN 188
             LV+   IGP+L FSPL ++Q+TRFKCGGL +GLSW +I+GD FS     N W   +  
Sbjct: 134 EFLVYHHPIGPELTFSPLIYVQMTRFKCGGLGLGLSWANIIGDAFSLFYAFNLWAKAITG 193

Query: 189 RPAH-QLRPAPAGLIWPFRSTRLSTP-PVKRLDPTGDLWIGSSDCKMATLSFRIT-GEQL 248
              +    P+     +   +  +  P  +KR++P GDLW+  +D K+A   F ++  +Q+
Sbjct: 194 EKIYAPTTPSIGERRFQSPNPTVKDPVSIKRVEPVGDLWVTPNDKKLANYCFNLSVADQI 253

Query: 249 DRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNG 308
                  G +   +   FE +A I WK ++K+R+E     T++I      + +    RN 
Sbjct: 254 SPHFPAKGDD---SIPVFEILAGIIWKCIAKVRVEPKPV-TVTIIKKDPNDLKLNAIRNS 313

Query: 309 MEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDL 368
             +S V  DFPVA A   EL + + + K DE   IEE+ E      DF+ YGA+LTF+DL
Sbjct: 314 QVISSVSVDFPVAEATVEELVKAMGEAK-DERCGIEEIGESCDGNLDFVVYGAKLTFLDL 373

Query: 369 EEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRDDGRDGGRTVTVILPEKQLPD 428
              ++Y  ++ G+ P  V   + G+GE G+V+V      ++     R VTV LPE+++  
Sbjct: 374 TGEDLYEAKVMGKSPESVYCNVEGIGEEGLVVVYAAAKSEE-----RVVTVTLPEEEMER 422

Query: 429 LIDELQK 433
           +  E +K
Sbjct: 434 VKLEFKK 422

BLAST of Cla97C11G220620 vs. TAIR 10
Match: AT1G09710.1 (Homeodomain-like superfamily protein )

HSP 1 Score: 203.0 bits (515), Expect = 1.2e-51
Identity = 160/427 (37.47%), Postives = 234/427 (54.80%), Query Frame = 0

Query: 512 QRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLL 571
           +R   I+  D +TLL RY + TI  +L+E++  S  ++DW+ LVK T+TGI+N REYQLL
Sbjct: 11  RRKRIITEGDIATLLLRYDMETILRMLQEISYCSETKMDWNALVKKTTTGITNAREYQLL 70

Query: 572 WRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEPFPSVSSESSYEAAACVKVLIANGIPSE 631
           WRHL+YRH LL   D    PLD DSD++ E+E  P+VS E+S EA A VKV+ A+ + SE
Sbjct: 71  WRHLSYRHPLLPVEDDAL-PLDDDSDMECELEASPAVSHEASVEAIAHVKVMAASYVLSE 130

Query: 632 SDVPTSSAVEAPLTIGISNS--QASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAAT 691
           SD+   S VEAPLTI I  +  + S +  ++P  +   +GM++  P+ +Q+       +T
Sbjct: 131 SDILDDSTVEAPLTINIPYALPEGSQEPSESPWSS---RGMNINFPVCLQK-----VTST 190

Query: 692 EVFDVNGAAGANAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLS 751
           E  + NG+AG + A +++RK WS  ED EL AAV++CGEGNWA+I+KGDF+G+RTASQLS
Sbjct: 191 EGMNGNGSAGISMAFRRKRKRWSAEEDEELFAAVKRCGEGNWAHIVKGDFRGERTASQLS 250

Query: 752 QILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNVGASTT 811
           Q                       R   + K                + H + +V     
Sbjct: 251 Q-----------------------RWALIRK----------------RCHTSTSVSQCGL 310

Query: 812 STTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSPASGAEALVQMQNQSPQISMP 871
             T+ A++ A + ALS AL     ++K A   + ++     +  EA     +Q  Q S P
Sbjct: 311 QGTE-AKL-AVNHALSLALGNRPPSNKLAIGLMPTTSSCTITETEANGGSSSQGQQQSKP 370

Query: 872 SRPLLVEPLPSAVKSGINTSKNSLIMK----STHNSDSIVRATAVAAGARIVSLSDAASL 931
               +V+ LP A  S +  +K+ ++ K    ST  SD +V A +VAA A +  +  AAS 
Sbjct: 371 ----IVQALPRAGTS-LPAAKSRVVKKTTASSTSRSDLMVTANSVAAAACMGDVLTAASG 382

Query: 932 LKVAQTK 933
            KV   K
Sbjct: 431 RKVEPGK 382

BLAST of Cla97C11G220620 vs. TAIR 10
Match: AT1G09710.2 (Homeodomain-like superfamily protein )

HSP 1 Score: 201.4 bits (511), Expect = 3.5e-51
Identity = 159/431 (36.89%), Postives = 236/431 (54.76%), Query Frame = 0

Query: 512 QRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLL 571
           +R   I+  D +TLL RY + TI  +L+E++  S  ++DW+ LVK T+TGI+N REYQLL
Sbjct: 11  RRKRIITEGDIATLLLRYDMETILRMLQEISYCSETKMDWNALVKKTTTGITNAREYQLL 70

Query: 572 WRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEPFPSVSSESSYEAAACVKVLIANGIPSE 631
           WRHL+YRH LL   D    PLD DSD++ E+E  P+VS E+S EA A VKV+ A+ + SE
Sbjct: 71  WRHLSYRHPLLPVEDDAL-PLDDDSDMECELEASPAVSHEASVEAIAHVKVMAASYVLSE 130

Query: 632 SDVPTSSAVEAPLTIGISNS--QASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAAT 691
           SD+   S VEAPLTI I  +  + S +  ++P  +   +GM++  P+ +Q+       +T
Sbjct: 131 SDILDDSTVEAPLTINIPYALPEGSQEPSESPWSS---RGMNINFPVCLQK-----VTST 190

Query: 692 EVFDVNGAAGANAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLS 751
           E  + NG+AG + A +++RK WS  ED EL AAV++CGEGNWA+I+KGDF+G+RTASQLS
Sbjct: 191 EGMNGNGSAGISMAFRRKRKRWSAEEDEELFAAVKRCGEGNWAHIVKGDFRGERTASQLS 250

Query: 752 Q---ILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKREL-WGLKRHGNLNVG 811
           Q   ++      S    Q G++       V       L +     +L  G     +    
Sbjct: 251 QRWALIRKRCHTSTSVSQCGLQGTEAKLAVNHALSLALGNRPPSNKLAIGTSSRRSFPAN 310

Query: 812 ASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSPASGAEALVQMQNQSPQ 871
           +S    T+ A +      L+  L    N      ++  +   + A+G  +     +Q  Q
Sbjct: 311 SSIYVITEDALVWLPLACLNQKLAYLFNCGLMPTTSSCTITETEANGGSS-----SQGQQ 370

Query: 872 ISMPSRPLLVEPLPSAVKSGINTSKNSLIMK----STHNSDSIVRATAVAAGARIVSLSD 931
            S P    +V+ LP A  S +  +K+ ++ K    ST  SD +V A +VAA A +  +  
Sbjct: 371 QSKP----IVQALPRAGTS-LPAAKSRVVKKTTASSTSRSDLMVTANSVAAAACMGDVLT 422

Query: 932 AASLLKVAQTK 933
           AAS  KV   K
Sbjct: 431 AASGRKVEPGK 422

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038898739.12.5e-22480.95uncharacterized protein LOC120086263 isoform X2 [Benincasa hispida][more]
XP_038898738.16.1e-22380.80uncharacterized protein LOC120086263 isoform X1 [Benincasa hispida][more]
KAA8528364.14.1e-21945.00hypothetical protein F0562_035719 [Nyssa sinensis][more]
XP_038898516.17.7e-21888.76protein ECERIFERUM 2 [Benincasa hispida][more]
XP_008466161.12.3e-21476.26PREDICTED: uncharacterized protein LOC103503656 isoform X2 [Cucumis melo][more]
Match NameE-valueIdentityDescription
Q390484.3e-7839.82Protein ECERIFERUM 2 OS=Arabidopsis thaliana OX=3702 GN=CER2 PE=1 SV=1[more]
Q9LIS19.6e-7036.49Protein ECERIFERUM 26-like OS=Arabidopsis thaliana OX=3702 GN=CER26L PE=2 SV=1[more]
Q9SVM97.1e-6535.13Protein ECERIFERUM 26 OS=Arabidopsis thaliana OX=3702 GN=CER26 PE=2 SV=1[more]
A0A2H5AIZ13.2e-1728.38Hydroxycinnamoyltransferase OS=Narcissus pseudonarcissus OX=39639 GN=HCT PE=2 SV... [more]
A0PDV51.4e-1528.83Rosmarinate synthase OS=Plectranthus scutellarioides OX=4142 GN=RAS PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5J5ADN22.0e-21945.00HTH myb-type domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_0357... [more]
A0A1S3CRZ41.1e-21476.26uncharacterized protein LOC103503656 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5D3E5P52.8e-21376.12HTH myb-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A1S3CQJ68.1e-21375.58uncharacterized protein LOC103503656 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5A7T5C82.0e-21175.44HTH myb-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
Match NameE-valueIdentityDescription
AT4G24510.13.0e-7939.82HXXXD-type acyl-transferase family protein [more]
AT3G23840.16.8e-7136.49HXXXD-type acyl-transferase family protein [more]
AT4G13840.15.0e-6635.13HXXXD-type acyl-transferase family protein [more]
AT1G09710.11.2e-5137.47Homeodomain-like superfamily protein [more]
AT1G09710.23.5e-5136.89Homeodomain-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D1.10.10.60coord: 700..750
e-value: 7.9E-11
score: 43.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1006..1036
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 969..989
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1006..1025
NoneNo IPR availablePANTHERPTHR47206HOMEODOMAIN-LIKE SUPERFAMILY PROTEINcoord: 505..1052
NoneNo IPR availablePANTHERPTHR47206:SF1HOMEODOMAIN-LIKE SUPERFAMILY PROTEINcoord: 505..1052
NoneNo IPR availableCDDcd11660SANT_TRFcoord: 709..750
e-value: 6.73686E-11
score: 56.4215
IPR023213Chloramphenicol acetyltransferase-like domain superfamilyGENE3D3.30.559.10coord: 229..428
e-value: 1.9E-10
score: 42.2
IPR023213Chloramphenicol acetyltransferase-like domain superfamilyGENE3D3.30.559.10coord: 8..214
e-value: 1.1E-35
score: 125.1
IPR003480TransferasePFAMPF02458Transferasecoord: 16..286
e-value: 4.9E-28
score: 97.9
IPR017930Myb domainPROSITEPS51294HTH_MYBcoord: 703..736
score: 10.579382
IPR001005SANT/Myb domainPROSITEPS50090MYB_LIKEcoord: 703..747
score: 6.574026
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 706..749

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C11G220620.2Cla97C11G220620.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0016747 acyltransferase activity, transferring groups other than amino-acyl groups