CmUC01G009150 (gene) Watermelon (USVL531) v1

Overview
NameCmUC01G009150
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
Descriptionformin-like protein 5
LocationCmU531Chr01: 11007065 .. 11051357 (+)
RNA-Seq ExpressionCmUC01G009150
SyntenyCmUC01G009150
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAAGGGAATATATTGTGGGAGGTAAATGTAACAGTTGCCGCTCACATCTATTTTAATCGTCTCTCCTTCTATCTAATACTTTCTTGTTCTTAGGCTATGTCAATATAACCGTCGTTTCCTCGCCTGTTTAGAATATAGGGACAAAGGTGATAAATCACACAAATGCTTGTATTTTATCGTACTGTAGTCTCGCCGAACATTTCTTGTTCTTTTTGACGACTTGTTGCTGAATTATTGTATATGAATAAGATAAATCGCCACAATCGGTTTTCCTCAACGACGCTTATGTCGTGACAACTTTTGCTAATTTTCCGATAACAGCTACAGTTGATAGAAGAAATAAGAAATTAATGTGAGTGTTGAGGAAAAGAGATTCCTTTATGTGCCAAGATTAGAATGGTAACAGTAAAAAATCACACGAGTTTATAGCGTCAAGTGTAAATCTAAATATAGTGAAATGAGTACGTGGGTGTCGTCCTCATGATTGGCATGCTACTTGGTCGAGTATTGTAGTTACCAGCCGCTTGTCTTATCTTGAGAACCTAACAGAAGAAGTGGAATTTATCTACAATATACCTAAGATGTACTTAGTCGGATACAACAATGCGAGAAGACACTTCAGAGCATACCTCATGTATTGGAACAACATTCGGCCAACTGAATAACACAAACAAAATTTTTGTGAACTTTGAATGCTTACAGACGCCATAGGTAGCTGGTCAATGTTGTCCTCTCTCGATCTCGACGCGTTCAACCCCGCTGCTTTTTTGCTTGGAAGCTCCAACTTATGCGTCCAGCTGGTCCAATTAGATCATATTATCGTCCAAATCGTCGATGTCGTTCTAATTGGTCAAAATCAGACAAAGCCATCAAATATAGACGCGAGAGCACCAAAATACATTGCAAAATTGTGAAAATTGTGATTCATTCGTGGAAACGTTCCTGTGTCGACAGAAAATTTCAAAAGTTTTGTAGATAAGTGATAAACGATGACGGTTTTAAAATGTCATCAGAAGGAGCATTTGATGGCAGTTCTGGACCTTTTGGGTTGCATGCAAATATAATGGGATTGCGCTTTGGATGGCCTTTAACAGCGCCGGAACTGTTTACGAAAATAAAATTAGACGCTTTTATATGGGTTTGGCAAAGTTGAGTATTAACGAATATGTTAAAAATCAGTTGTTAAACACACGTCGATTCTCACGCCTATTAAATTCAGTGACACGACCATTCAACAAGTGGTTAACACATTTGATTTATTTTAATAAAAAGAAAAGAAAGCCAGCACGTCGCTTTTGGAGACTTTGAAAACATCACCTGCAAACATCAACAACAAAATGTCAATTTTGGGTTGTTAGTTTTACTGCGAAAGCTTCTTGCCAAAAGGATTTCTAACAAATTTCGTACCTTTCTAGTTTCTCGAAAAACGAACATGTTATTTTTAGCTTTTATGCCAAGATGGTACCTTGCTTTCAATTGTGGCGGCAATGCTATATAAAAAGGAAGCAATAGAAGAATGAGATGGGCAAACGCACAACAATAATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGGAGCTTGCGCTCTCATACTCCTTGGGCAGCACATTGCAACCTACCAAGCAGATGCTAAAAATTTTTTTGTAGACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCGCCGCCTCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGGTTAAATTTGGGATGTATCCTAGAAATAACCCTATTCCACCATCTGCTCCAAGCGGACGAGTACCATCATGAGTGCCCCGACCAGCCTCGATTTTCGTCGCAACCCCCATCAATCAAGGGTTATTTTTACTGCTTTTAATGCAAGTGTTACGGCTGTAATTTGGGTCAGCCCCCTTCAATTTCACCGATACGAAAAATGGGGGTCTAAAAGTTTATGAAAAGGGAATATATTGTGGGAGGTAACTGTAACAGTTGCCGCTCACATCTATTTTAATCGTCTCTCCTTCTATCTAATACTTTCTTGTTCTTAGGCTATGTCAATATAACCGTCGTTTCCTCGCCTGTTTAGAATATAGGGACAAAGGTGATAAATCACACAAATGCTTGTATTTTATCGTACTGTAGTCTCGCCGAACATTTCTTGTTCTTTTTGACGACTTGTTGCTGAATTATTGTATATGAATAAGATAAATCGCCACAATCGGTTTTCCTCAACGACGCTTATGTCGTGACAACTTTTGCTAATTTTCCGATAACAGCTACAGTTGATAGAAGAAATAAGAAATTAATGTGAGTGTTGAGGAAAAGAGATTCCTTTATGTGCCAAGATTAGAATGGTAACAGTAAAAAATCACACGAGTTTATAGCGTCAAGTGTAAATCTAAATATAGTGAAATGAGTACGTGGGTGTCGTCCTCATGATTGGCATGCTACTTGGTCGAGTATTGTAGTTACCAGCCGCTTGTCTTATCTTGAGAACCTAACAGAAGAAGTGGAATTTATCTACAATATACCTAAGATGTACTTAGTCGGATACAACAATGCGAGAAGACACTTCAGAGCATACCTCATGTATTGGAACAACATTCGGCCAACTGAATAACACAAACAAAATTTTTGTGAACTTTGAATGCTTACAGACGCCATAGGTAGCTGGTCAATGTTGTCCTCTCTCGATCTCGACGCGTTCAACCCCGCTGCTTTTTTGCTTGGAAGCTCCAACTTATGCGTCCAGCTGGTCCAATTAGATCATATTATCGTCCAAATCGTCGATGTCGTTCTAATTGGTCAAAATCAGACAAAGCCATCAAATATAGACGCGAGAGCACCAAAATACATTGCAAAATTGTGAAAATTGTGATTCATTCGTGGAAACGTTCCTGTGTCGACAGAAAATTTCAAAAGTTTTGTAGATAAGTGATAAACGATGACGGTTTTAAAATGTCATCAGAAGGAGCATTTGATGGCAGTTCTGGACCTTTTGGGTTGCATGCAAATATAATGGGATTGCGCTTTGGATGGCCTTTAACAGCGCCGGAACTGTTTACGAAAATAAAATTAGACGCTTTTATATGGGTTTGGCAAAGTTGAGTATTAACGAATATGTTAAAAATCAGTTGTTAAACACACGTCGATTCTCACGCCTATTAAATTCAGTGACACGACCATTCAACAAGTGGTTAACACATTTGATTTATTTTAATAAAAAGAAAAGAAAGCCAGCACGTCGCTTTTGGAGACTTTGAAAACATCACCTGCAAACATCAACAACAAAATGTCAATTTTGGGTTGTTAGTTTTACTGCGAAAGCTTCTTGCCAAAAGGATTTCTAACAAATTTCGTACCTTTCTAGTTTCTCGAAAAACGAACATGTTATTTTTAGCTTTTATGCCAAGATGGTACCTTGCTTTCAATTGTGGCGGCAATGCTATATAAAAAGGAAGCAATAGAAGAATGAGATGGGCAAACGCACAACAATAATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGGAGCTTGCGCTCTCATACTCCTTGGGCAGCACATTGCAACCTACCAAGCAGATGCTAAAAATTTTTTTGTAGACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCGCCGCCTCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGGTTAAATTTGGGATGTATCCTAGAAATAACCCTATTCCACCATCTGCTCCAAGCGGACGAGTACCATCATGAGTGCCCCGACCAGCCTCGATTTTCGTCGCAACCCCCATCAATCAAGGGTTATTTTTACTGCTTTTAATGCAAGTGTTACGGCTGTAATTTGGGTCAGCCCCCTTCAATTTCACCGATACGAAAAATGGGGGTCTAAAAGTTTATGAAAAGGGAATATATTGTGGGAGGTAACTGTAACAGTTGCCGCTCACATCTATTTTAATCGTCTCTCCTTCTATCTAATACTTTCTTGTTCTTAGGCTATGTCAATATAACCGTCGTTTCCTCGCCTGTTTAGAATATAGGGACAAAGGTGATAAATCACACAAATGCTTGTATTTTATCGTACTGTAGTCTCGCCGAACATTTCTTGTTCTTTTTGACGACTTGTTGCTGAATTATTGTATATGAATAAGATAAATCGCCACAATCGGTTTTCCTCAACGACGCTTATGTCGTGACAACTTTTGCTAATTTTCCGATAACAGCTACAGTTGATAGAAGAAATAAGAAATTAATGTGAGTGTTGAGGAAAAGAGATTCCTTTATGTGCCAAGATTAGAATGGTAACAGTAAAAAATCACACGAGTTTATAGCGTCAAGTGTAAATCTAAATATAGTGAAATGAGTACGTGGGTGTCGTCCTCATGATTGGCATGCTACTTGGTCGAGTATTGTAGTTACCAGCCGCTTGTCTTATCTTGAGAACCTAACAGAAGAAGTGGAATTTATCTACAATATACCTAAGATGTACTTAGTCGGATACAACAATGCGAGAAGACACTTCAGAGCATACCTCATGTATTGGAACAACATTCGGCCAACTGAATAACACAAACAAAATTTTTGTGAACTTTGAATGCTTACAGACGCCATAGGTAGCTGGTCAATGTTGTCCTCTCTCGATCTCGACGCGTTCAACCCCGCTGCTTTTTTGCTTGGAAGCTCCAACTTATGCGTCCAGCTGGTCCAATTAGATCATATTATCGTCCAAATCGTCGATGTCGTTCTAATTGGTCAAAATCAGACAAAGCCATCAAATATAGACGCGAGAGCACCAAAATACATTGCAAAATTGTGAAAATTGTGATTCATTCGTGGAAACGTTCCTGTGTCGACAGAAAATTTCAAAAGTTTTGTAGATAAGTGATAAACGATGACGGTTTTAAAATGTCATCAGAAGGAGCATTTGATGGCAGTTCTGGACCTTTTGGGTTGCATGCAAATATAATGGGATTGCGCTTTGGATGGCCTTTAACAGCGCCGGAACTGTTTACGAAAATAAAATTAGACGCTTTTATATGGGTTTGGCAAAGTTGAGTATTAACGAATATGTTAAAAATCAGTTGTTAAACACACGTCGATTCTCACGCCTATTAAATTCAGTGACACGACCATTCAACAAGTGGTTAACACATTTGATTTATTTTAATAAAAAGAAAAGAAAGCCAGCACGTCGCTTTTGGAGACTTTGAAAACATCACCTGCAAACATCAACAACAAAATGTCAATTTTGGGTTGTTAGTTTTACTGCGAAAGCTTCTTGCCAAAAGGATTTCTAACAAATTTCGTACCTTTCTAGTTTCTCGAAAAACGAACATGTTATTTTTAGCTTTTATGCCAAGATGGTACCTTGCTTTCAATTGTGGCGGCAATGCTATATAAAAAGGAAGCAATAGAAGAATGAGATGGGCAAACGCACAACAATAATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGGAGCTTGCGCTCTCATACTCCTTGGGCAGCACATTGCAACCTACCAAGCAGATGCTAAAAATTTTTTTGTAGACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCGCCGCCTCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGGTTAAATTTGGGATGTATCCTAGAAATAACCCTATTCCACCATCTGCTCCAAGCGGACGAGTACCATCATGAGTGCCCCGACCAGCCTCGATTTTCGTCGCAACCCCCATCAATCAAGGGTTATTTTTACTGCTTTTAATGCAAGTGTTACGGCTGTAATTTGGGTCAGCCCCCTTCAATTTCACCGATACGAAAAATGGGGGTCTAAAAGTTTATGAAAAGGGAATATATTGTGGGAGGTAACTGTAACAGTTGCCGCTCACATCTATTTTAATCGTCTCTCCTTCTATCTAATACTTTCTTGTTCTTAGGCTATGTCAATATAACCGTCGTTTCCTCGCCTGTTTAGAATATAGGGACAAAGGTGATAAATCACACAAATGCTTGTATTTTATCGTACTGTAGTCTCGCCGAACATTTCTTGTTCTTTTTGACGACTTGTTGCTGAATTATTGTATATGAATAAGATAAATCGCCACAATCGGTTTTCCTCAACGACGCTTATGTCGTGACAACTTTTGCTAATTTTCCGATAACAGCTACAGTTGATAGAAGAAATAAGAAATTAATGTGAGTGTTGAGGAAAAGAGATTCCTTTATGTGCCAAGATTAGAATGGTAACAGTAAAAAATCACACGAGTTTATAGCGTCAAGTGTAAATCTAAATATAGTGAAATGAGTACGTGGGTGTCGTCCTCATGATTGGCATGCTACTTGGTCGAGTATTGTAGTTACCAGCCGCTTGTCTTATCTTGAGAACCTAACAGAAGAAGTGGAATTTATCTACAATATACCTAAGATGTACTTAGTCGGATACAACAATGCGAGAAGACACTTCAGAGCATACCTCATGTATTGGAACAACATTCGGCCAACTGAATAACACAAACAAAATTTTTGTGAACTTTGAATGCTTACAGACGCCATAGGTAGCTGGTCAATGTTGTCCTCTCTCGATCTCGACGCGTTCAACCCCGCTGCTTTTTTGCTTGGAAGCTCCAACTTATGCGTCCAGCTGGTCCAATTAGATCATATTATCGTCCAAATCGTCGATGTCGTTCTAATTGGTCAAAATCAGACAAAGCCATCAAATATAGACGCGAGAGCACCAAAATACATTGCAAAATTGTGAAAATTGTGATTCATTCGTGGAAACGTTCCTGTGTCGACAGAAAATTTCAAAAGTTTTGTAGATAAGTGATAAACGATGACGGTTTTAAAATGTCATCAGAAGGAGCATTTGATGGCAGTTCTGGACCTTTTGGGTTGCATGCAAATATAATGGGATTGCGCTTTGGATGGCCTTTAACAGCGCCGGAACTGTTTACGAAAATAAAATTAGACGCTTTTATATGGGTTTGGCAAAGTTGAGTATTAACGAATATGTTAAAAATCAGTTGTTAAACACACGTCGATTCTCACGCCTATTAAATTCAGTGACACGACCATTCAACAAGTGGTTAACACATTTGATTTATTTTAATAAAAAGAAAAGAAAGCCAGCACGTCGCTTTTGGAGACTTTGAAAACATCACCTGCAAACATCAACAACAAAATGTCAATTTTGGGTTGTTAGTTTTACTGCGAAAGCTTCTTGCCAAAAGGATTTCTAACAAATTTCGTACCTTTCTAGTTTCTCGAAAAACGAACATGTTATTTTTAGCTTTTATGCCAAGATGGTACCTTGCTTTCAATTGTGGCGGCAATGCTATATAAAAAGGAAGCAATAGAAGAATGAGATGGGCAAACGCACAACAATAATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGGAGCTTGCGCTCTCATACTCCTTGGGCAGCACATTGCAACCTACCAAGCAGATGCTAAAAATTTTTTTGTAGACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCGCCGCCTCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGGTTAAATTTGGGATGTATCCTAGAAATAACCCTATTCCACCATCTGCTCCAAGCGGACGAGTACCATCATGAGTGCCCCGACCAGCCTCGATTTTCGTCGCAACCCCCATCAATCAAGGGTTATTTTTACTGCTTTTAATGCAAGTGTTACGGCTGTAATTTGGGTCAGCCCCCTTCAATTTCACCGATACGAAAAATGGGGGTCTAAAAGTTTATGAAAAGGGAATATATTGTGGGAGGTAACTGTAACAGTTGCCGCTCACATCTATTTTAATCGTCTCTCCTTCTATCTAATACTTTCTTGTTCTTAGGCTATGTCAATATAACCGTCGTTTCCTCGCCTGTTTAGAATATAGGGACAAAGGTGATAAATCACACAAATGCTTGTATTTTATCGTACTGTAGTCTCGCCGAACATTTCTTGTTCTTTTTGACGACTTGTTGCTGAATTATTGTATATGAATAAGATAAATCGCCACAATCGGTTTTCCTCAACGACGCTTATGTCGTGACAACTTTTGCTAATTTTCCGATAACAGCTACAGTTGATAGAAGAAATAAGAAATTAATGTGAGTGTTGAGGAAAAGAGATTCCTTTATGTGCCAAGATTAGAATGGTAACAGTAAAAAATCACACGAGTTTATAGCGTCAAGTGTAAATCTAAATATAGTGAAATGAGTACGTGGGTGTCGTCCTCATGATTGGCATGCTACTTGGTCGAGTATTGTAGTTACCAGCCGCTTGTCTTATCTTGAGAACCTAACAGAAGAAGTGGAATTTATCTACAATATACCTAAGATGTACTTAGTCGGATACAACAATGCGAGAAGACACTTCAGAGCATACCTCATGTATTGGAACAACATTCGGCCAACTGAATAACACAAACAAAATTTTTGTGAACTTTGAATGCTTACAGACGCCATAGGTAGCTGGTCAATGTTGTCCTCTCTCGATCTCGACGCGTTCAACCCCGCTGCTTTTTTGCTTGGAAGCTCCAACTTATGCGTCCAGCTGGTCCAATTAGATCATATTATCGTCCAAATCGTCGATGTCGTTCTAATTGGTCAAAATCAGACAAAGCCATCAAATATAGACGCGAGAGCACCAAAATACATTGCAAAATTGTGAAAATTGTGATTCATTCGTGGAAACGTTCCTGTGTCGACAGAAAATTTCAAAAGTTTTGTAGATAAGTGATAAACGATGACGGTTTTAAAATGTCATCAGAAGGAGCATTTGATGGCAGTTCTGGACCTTTTGGGTTGCATGCAAATATAATGGGATTGCGCTTTGGATGGCCTTTAACAGCGCCGGAACTGTTTACGAAAATAAAATTAGACGCTTTTATATGGGTTTGGCAAAGTTGAGTATTAACGAATATGTTAAAAATCAGTTGTTAAACACACGTCGATTCTCACGCCTATTAAATTCAGTGACACGACCATTCAACAAGTGGTTAACACATTTGATTTATTTTAATAAAAAGAAAAGAAAGCCAGCACGTCGCTTTTGGAGACTTTGAAAACATCACCTGCAAACATCAACAACAAAATGTCAATTTTGGGTTGTTAGTTTTACTGCGAAAGCTTCTTGCCAAAAGGATTTCTAACAAATTTCGTACCTTTCTAGTTTCTCGAAAAACGAACATGTTATTTTTAGCTTTTATGCCAAGATGGTACCTTGCTTTCAATTGTGGCGGCAATGCTATATAAAAAGGAAGCAATAGAAGAATGAGATGGGCAAACGCACAACAATAATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGGAGCTTGCGCTCTCATACTCCTTGGGCAGCACATTGCAACCTACCAAGCAGATGCTAAAAATCTTTTTGTAGACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCGCCGCCTCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGGTTAAATTTGGGATGTATCCTAGAAATAACCCTATTCCACCATCTGCTCCAAGCGGACGAGTACCATCATGAGTGCCCCGACCAGCCTCGATTTTCGTCGCAACCCCCATCAATCAAGGGTTATTTTTACTGCTTTTAATGCAAGTGTTACGGCTGTAATTTGGGTCAGCCCCCTTCAATTTCACCGATACGAAAAATGGGGGTCTAAAAGTTTATGAAAAGGGAATATATTGTGGGAGGTAACTGTAACAGTTGCCGCTCACATCTATTTTAATCGTCTCTCCTTCTATCTAATACTTTCTTGTTCTTAGGCTATGTCAATATAACCGTCGTTTCCTCGCCTGTTTAGAATATAGGGACAAAGGTGATAAATCACACAAATGCTTGTATTTTATCGTACTGTAGTCTCGCCGAACATTTCTTGTTCTTTTTGACGACTTGTTGCTGAATTATTGTATATGAATAAGATAAATCGCCACAATCGGTTTTCCTCAACGACGCTTATGTCGTGACAACTTTTGCTAATTTTCCGATAACAGCTACAGTTGATAGAAGAAATAAGAAATTAATGTGAGTGTTGAGGAAAAGAGATTCCTTTATGTGCCAAGATTAGAATGGTAACAGTAAAAAATCACACGAGTTTATAGCGTCAAGTGTAAATCTAAATATAGTGAAATGAGTACGTGGGTGTCGTCCTCATGATTGGCATGCTACTTGGTCGAGTATTGTAGTTACCAGCCGCTTGTCTTATCTTGAGAACCTAACAGAAGAAGTGGAATTTATCTACAATATACCTAAGATGTACTTAGTCGGATACAACAATGCGAGAAGACACTTCAGAGCATACCTCATGTATTGGAACAACATTCGGCCAACTGAATAACACAAACAAAATTTTTGTGAACTTTGAATGCTTACAGACGCCATAGGTAGCTGGTCAATGTTGTCCTCTCTCGATCTCGACGCGTTCAACCCCGCTGCTTTTTTGCTTGGAAGCTCCAACTTATGCGTCCAGCTGGTCCAATTAGATCATATTATCGTCCAAATCGTCGATGTCGTTCTAATTGGTCAAAATCAGACAAAGCCATCAAATATAGACGCGAGAGCACCAAAATACATTGCAAAATTGTGAAAATTGTGATTCATTCGTGGAAACGTTCCTGTGTCGACAGAAAATTTCAAAAGTTTTGTAGATAAGTGATAAACGATGACGGTTTTAAAATGTCATCAGAAGGAGCATTTGATGGCAGTTCTGGACCTTTTGGGTTGCATGCAAATATAATGGGATTGCGCTTTGGATGGCCTTTAACAGCGCCGGAACTGTTTACGAAAATAAAATTAGACGCTTTTATATGGGTTTGGCAAAGTTGAGTATTAACGAATATGTTAAAAATCAGTTGTTAAACACACGTCGATTCTCACGCCTATTAAATTCAGTGACACGACCATTCAACAAGTGGTTAACACATTTGATTTATTTTAATAAAAAGAAAAGAAAGCCAGCACGTCGCTTTTGGAGACTTTGAAAACATCACCTGCAAACATCAACAACAAAATGTCAATTTTGGGTTGTTAGTTTTACTGCGAAAGCTTCTTGCCAAAAGGATTTCTAACAAATTTCGTACCTTTCTAGTTTCTCGAAAAACGAACATGTTATTTTTAGCTTTTATGCCAAGATGGTACCTTGCTTTCAATTGTGGCGGCAATGCTATATAAAAAGGAAGCAATAGAAGAATGAGATGGGCAAACGCACAACAATAATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGGAGCTTGCGCTCTCATACTCCTTGGGCAGCACATTGCAACCTACCAAGCAGATGCTAAAAATTTTTTTGTAGACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCGCCGCCTCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGGTTAAATTTGGGATGTATCCTAGAAATAACCCTATTCCACCATCTGCTCCAAGCGGACGAGTACCATCATGAGTGCCCCGACCAGCCTCGATTTTCGTCGCAACCCCCATCAATCAAGGGTTATTTTTACTGCTTTTAATGCAAGTGTTACGGCTGTAATTTGGGTCAGCCCCCTTCAATTTCACCGATACGAAAAATGGGGGTCTAAAAGTTTATGAAAAGGGAATATATTGTGGGAGGTAACTGTAACAGTTGCCGCTCACATCTATTTTAATCGTCTCTCCTTCTATCTAATACTTTCTTGTTCTTAGGCTATGTCAATATAACCGTCGTTTCCTCGCCTGTTTAGAATATAGGGACAAAGGTGATAAATCACACAAATGCTTGTATTTTATCGTACTGTAGTCTCGCCGAACATTTCTTGTTCTTTTTGACGACTTGTTGCTGAATTATTGTATATGAATAAGATAAATCGCCACAATCGGTTTTCCTCAACGACGCTTATGTCGTGACAACTTTTGCTAATTTTCCGATAACAGCTACAGTTGATAGAAGAAATAAGAAATTAATGTGAGTGTTGAGGAAAAGAGATTCCTTTATGTGCCAAGATTAGAATGGTAACAGTAAAAAATCACACGAGTTTATAGCGTCAAGTGTAAATCTAAATATAGTGAAATGAGTACGTGGGTGTCGTCCTCATGATTGGCATGCTACTTGGTCGAGTATTGTAGTTACCAGCCGCTTGTCTTATCTTGAGAACCTAACAGAAGAAGTGGAATTTATCTACAATATACCTAAGATGTACTTAGTCGGATACAACAATGCGAGAAGACACTTCAGAGCATACCTCATGTATTGGAACAACATTCGGCCAACTGAATAACACAAACAAAATTTTTGTGAACTTTGAATGCTTACAGACGCCATAGGTAGCTGGTCAATGTTGTCCTCTCTCGATCTCGACGCGTTCAACCCCGCTGCTTTTTTGCTTGGAAGCTCCAACTTATGCGTCCAGCTGGTCCAATTAGATCATATTATCGTCCAAATCGTCGATGTCGTTCTAATTGGTCAAAATCAGACAAAGCCATCAAATATAGACGCGAGAGCACCAAAATACATTGCAAAATTGTGAAAATTGTGATTCATTCGTGGAAACGTTCCTGTGTCGACAGAAAATTTCAAAAGTTTTGTAGATAAGTGATAAACGATGACGGTTTTAAAATGTCATCAGAAGGAGCATTTGATGGCAGTTCTGGACCTTTTGGGTTGCATGCAAATATAATGGGATTGCGCTTTGGATGGCCTTTAACAGCGCCGGAACTGTTTACGAAAATAAAATTAGACGCTTTTATATGGGTTTGGCAAAGTTGAGTATTAACGAATATGTTAAAAATCAGTTGTTAAACACACGTCGATTCTCACGCCTATTAAATTCAGTGACACGACCATTCAACAAGTGGTTAACACATTTGATTTATTTTAATAAAAAGAAAAGAAAGCCAGCACGTCGCTTTTGGAGACTTTGAAAACATCACCTGCAAACATCAACAACAAAATGTCAATTTTGGGTTGTTAGTTTTACTGCGAAAGCTTCTTGCCAAAAGGATTTCTAACAAATTTCGTACCTTTCTAGTTTCTCGAAAAACGAACATGTTATTTTTAGCTTTTATGCCAAGATGGTACCTTGCTTTCAATTGTGGCGGCAATGCTATATAAAAAGGAAGCAATAGAAGAATGAGATGGGCAAACGCACAACAATAATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGGAGCTTGCGCTCTCATACTCCTTGGGCAGCACATTGCAACCTACCAAGCAGATGCTAAAAATTTTTTTGTAGACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCGCCGCCTCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGGTTAAATTTGGGATGTATCCTAGAAATAACCCTATTCCACCATCTGCTCCAAGCGGACGAGTACCATCATGAGTGCCCCGACCAGCCTCGATTTTCGTCGCAACCCCCATCAATCAAGGGTTATTTTTACTGCTTTTAATGCAAGTGTTACGGCTGTAATTTGGGTCAGCCCCCTTCAATTTCACCGATACGAAAAATGGGGGTCTAAAAGTTTATGAAAAGGGAATATATTGTGGGAGGTAACTGTAACAGTTGCCGCTCACATCTATTTTAATCGTCTCTCCTTCTATCTAATACTTTCTTGTTCTTAGGCTATGTCAATATAACCGTCGTTTCCTCGCCTGTTTAGAATATAGGGACAAAGGTGATAAATCACACAAATGCTTGTATTTTATCGTACTGTAGTCTCGCCGAACATTTCTTGTTCTTTTTGACGACTTGTTGCTGAATTATTGTATATGAATAAGATAAATCGCCACAATCGGTTTTCCTCAACGACGCTTATGTCGTGACAACTTTTGCTAATTTTCCGATAACAGCTACAGTTGATAGAAGAAATAAGAAATTAATGTGAGTGTTGAGGAAAAGAGATTCCTTTATGTGCCAAGATTAGAATGGTAACAGTAAAAAATCACACGAGTTTATAGCGTCAAGTGTAAATCTAAATATAGTGAAATGAGTACGTGGGTGTCGTCCTCATGATTGGCATGCTACTTGGTCGAGTATTGTAGTTACCAGCCGCTTGTCTTATCTTGAGAACCTAACAGAAGAAGTGGAATTTATCTACAATATACCTAAGATGTACTTAGTCGGATACAACAATGCGAGAAGACACTTCAGAGCATACCTCATGTATTGGAACAACATTCGGCCAACTGAATAACACAAACAAAATTTTTGTGAACTTTGAATGCTTACAGACGCCATAGGTAGCTGGTCAATGTTGTCCTCTCTCGATCTCGACGCGTTCAACCCCGCTGCTTTTTTGCTTGGAAGCTCCAACTTATGCGTCCAGCTGGTCCAATTAGATCATATTATCGTCCAAATCGTCGATGTCGTTCTAATTGGTCAAAATCAGACAAAGCCATCAAATATAGACGCGAGAGCACCAAAATACATTGCAAAATTGTGAAAATTGTGATTCATTCGTGGAAACGTTCCTGTGTCGACAGAAAATTTCAAAAGTTTTGTAGATAAGTGATAAACGATGACGGTTTTAAAATGTCATCAGAAGGAGCATTTGATGGCAGTTCTGGACCTTTTGGGTTGCATGCAAATATAATGGGATTGCGCTTTGGATGGCCTTTAACAGCGCCGGAACTGTTTACGAAAATAAAATTAGACGCTTTTATATGGGTTTGGCAAAGTTGAGTATTAACGAATATGTTAAAAATCAGTTGTTAAACACACGTCGATTCTCACGCCTATTAAATTCAGTGACACGACCATTCAACAAGTGGTTAACACATTTGATTTATTTTAATAAAAAGAAAAGAAAGCCAGCACGTCGCTTTTGGAGACTTTGAAAACATCACCTGCAAACATCAACAACAAAATGTCAATTTTGGGTTGTTAGTTTTACTGCGAAAGCTTCTTGCCAAAAGGATTTCTAACAAATTTCGTACCTTTCTAGTTTCTCGAAAAACGAACATGTTATTTTTAGCTTTTATGCCAAGATGGTACCTTGCTTTCAATTGTGGCGGCAATGCTATATAAAAAGGAAGCAATAGAAGAATGAGATGGGCAAACGCACAACAATAATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGGAGCTTGCGCTCTCATACTCCTTGGGCAGCACATTGCAACCTACCAAGCAGATGCTAAAAATTTTTTTGTAGACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCGCCGCCTCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGGTTAAATTTGGGATGTATCCTAGAAATAACCCTATTCCACCATCTGCTCCAAGCGGACGAGTACCATCATGAGTGCCCCGACCAGCCTCGATTTTCGTCGCAACCCCCATCAATCAAGGGTTATTTTTACTGCTTTTAATGCAAGTGTTACGGCTGTAATTTGGGTCAGCCCCCTTCAATTTCACCGATACGAAAAATGGGGGTCTAAAAGTTTATGAAAAGGGAATATATTGTGGGAGGTAACTGTAACAGTTGCCGCTCACATCTATTTTAATCGTCTCTCCTTCTATCTAATACTTTCTTGTTCTTAGGCTATGTCAATATAACCGTCGTTTCCTCGCCTGTTTAGAATATAGGGACAAAGGTGATAAATCACACAAATGCTTGTATTTTATCGTACTGTAGTCTCGCCGAACATTTCTTGTTCTTTTTGACGACTTGTTGCTGAATTATTGTATATGAATAAGATAAATCGCCACAATCGGTTTTCCTCAACGACGCTTATGTCGTGACAACTTTTGCTAATTTTCCGATAACAGCTACAGTTGATAGAAGAAATAAGAAATTAATGTGAGTGTTGAGGAAAAGAGATTCCTTTATGTGCCAAGATTAGAATGGTAACAGTAAAAAATCACACGAGTTTATAGCGTCAAGTGTAAATCTAAATATAGTGAAATGAGTACGTGGGTGTCGTCCTCATGATTGGCATGCTACTTGGTCGAGTATTGTAGTTACCAGCCGCTTGTCTTATCTTGAGAACCTAACAGAAGAAGTGGAATTTATCTACAATATACCTAAGATGTACTTAGTCGGATACAACAATGCGAGAAGACACTTCAGAGCATACCTCATGTATTGGAACAACATTCGGCCAACTGAATAACACAAACAAAATTTTTGTGAACTTTGAATGCTTACAGACGCCATAGGTAGCTGGTCAATGTTGTCCTCTCTCGATCTCGACGCGTTCAACCCCGCTGCTTTTTTGCTTGGAAGCTCCAACTTATGCGTCCAGCTGGTCCAATTAGATCATATTATCGTCCAAATCGTCGATGTCGTTCTAATTGGTCAAAATCAGACAAAGCCATCAAATATAGACGCGAGAGCACCAAAATACATTGCAAAATTGTGAAAATTGTGATTCATTCGTGGAAACGTTCCTGTGTCGACAGAAAATTTCAAAAGTTTTGTAGATAAGTGATAAACGATGACGGTTTTAAAATGTCATCAGAAGGAGCATTTGATGGCAGTTCTGGACCTTTTGGGTTGCATGCAAATATAATGGGATTGCGCTTTGGATGGCCTTTAACAGCGCCGGAACTGTTTACGAAAATAAAATTAGACGCTTTTATATGGGTTTGGCAAAGTTGAGTATTAACGAATATGTTAAAAATCAGTTGTTAAACACACGTCGATTCTCACGCCTATTAAATTCAGTGACACGACCATTCAACAAGTGGTTAACACATTTGATTTATTTTAATAAAAAGAAAAGAAAGCCAGCACGTCGCTTTTGGAGACTTTGAAAACATCACCTGCAAACATCAACAACAAAATGTCAATTTTGGGTTGTTAGTTTTACTGCGAAAGCTTCTTGCCAAAAGGATTTCTAACAAATTTCGTACCTTTCTAGTTTCTCGAAAAACGAACATGTTATTTTTAGCTTTTATGCCAAGATGGTACCTTGCTTTCAATTGTGGCGGCAATGCTATATAAAAAGGAAGCAATAGAAGAATGAGATGGGCAAACGCACAACAATAATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGGAGCTTGCGCTCTCATACTCCTTGGGCAGCACATTGCAACCTACCAAGCAGATGCTAAAAATCTTTTTGTAGACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCGCCGCCTCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGGTTAAATTTGGGATGTATCCTAGAAATAACCCTATTCCACCATCTGCTCCAAGCGGACGAGTACCATCATGAGTGCCCCGACCAGCCTCGATTTTCGTCGCAACCCCCATCAATCAAGGGTTATTTTTACTGCTTTTAATGCAAGTGTTACGGCTGTAATTTGGGTCAGCCCCCTTCAATTTCACCGATACGAAAAATGGGGGTCTAAAAGTTTATGAAAAGGGAATATATTGTGGGAGGTAACTGTAACAGTTGCCGCTCACATCTATTTTAATCGTCTCTCCTTCTATCTAATACTTTCTTGTTCTTAGGCTATGTCAATATAACCGTCGTTTCCTCGCCTGTTTAGAATATAGGGACAAAGGTGATAAATCACACAAATGCTTGTATTTTATCGTACTGTAGTCTCGCCGAACATTTCTTGTTCTTTTTGACGACTTGTTGCTGAATTATTGTATATGAATAAGATAAATCGCCACAATCGGTTTTCCTCAACGACGCTTATGTCGTGACAACTTTTGCTAATTTTCCGATAACAGCTACAGTTGATAGAAGAAATAAGAAATTAATGTGAGTGTTGAGGAAAAGAGATTCCTTTATGTGCCAAGATTAGAATGGTAACAGTAAAAAATCACACGAGTTTATAGCGTCAAGTGTAAATCTAAATATAGTGAAATGAGTACGTGGGTGTCGTCCTCATGATTGGCATGCTACTTGGTCGAGTATTGTAGTTACCAGCCGCTTGTCTTATCTTGAGAACCTAACAGAAGAAGTGGAATTTATCTACAATATACCTAAGATGTACTTAGTCGGATACAACAATGCGAGAAGACACTTCAGAGCATACCTCATGTATTGGAACAACATTCGGCCAACTGAATAACACAAACAAAATTTTTGTGAACTTTGAATGCTTACAGACGCCATAGGTAGCTGGTCAATGTTGTCCTCTCTCGATCTCGACGCGTTCAACCCCGCTGCTTTTTTGCTTGGAAGCTCCAACTTATGCGTCCAGCTGGTCCAATTAGATCATATTATCGTCCAAATCGTCGATGTCGTTCTAATTGGTCAAAATCAGACAAAGCCATCAAATATAGACGCGAGAGCACCAAAATACATTGCAAAATTGTGAAAATTGTGATTCATTCGTGGAAACGTTCCTGTGTCGACAGAAAATTTCAAAAGTTTTGTAGATAAGTGATAAACGATGACGGTTTTAAAATGTCATCAGAAGGAGCATTTGATGGCAGTTCTGGACCTTTTGGGTTGCATGCAAATATAATGGGATTGCGCTTTGGATGGCCTTTAACAGCGCCGGAACTGTTTACGAAAATAAAATTAGACGCTTTTATATGGGTTTGGCAAAGTTGAGTATTAACGAATATGTTAAAAATCAGTTGTTAAACACACGTCGATTCTCACGCCTATTAAATTCAGTGACACGACCATTCAACAAGTGGTTAACACATTTGATTTATTTTAATAAAAAGAAAAGAAAGCCAGCACGTCGCTTTTGGAGACTTTGAAAACATCACCTGCAAACATCAACAACAAAATGTCAATTTTGGGTTGTTAGTTTTACTGCGAAAGCTTCTTGCCAAAAGGATTTCTAACAAATTTCGTACCTTTCTAGTTTCTCGAAAAACGAACATGTTATTTTTAGCTTTTATGCCAAGATGGTACCTTGCTTTCAATTGTGGCGGCAATGCTATATAAAAAGGAAGCAATAGAAGAATGAGATGGGCAAACGCACAACAATAATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGGAGCTTGCGCTCTCATACTCCTTGGGCAGCACATTGCAACCTACCAAGCAGATGCTAAAAATCTTTTTGTAGACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCGCCGCCTCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGGTTAAATTTGGGATGTATCCTAGAAATAACCCTATTCCACCATCTGCTCCAAGCGGACGAGTACCATCATGAGTGCCCCGACCAGCCTCGATTTTCGTCGCAACCCCCATCAATCAAGGGTTATTTTTACTGCTTTTAATGCAAGTGTTACGGCTGTAATTTGGGTCAGCCCCCTTCAATTTCACCGATACGAAAAATGGGGGTCTAAAAGTTTATGAAAAGGGAATATATTGTGGGAGGTAACTGTAACAGTTGCCGCTCACATCTATTTTAATCGTCTCTCCTTCTATCTAATACTTTCTTGTTCTTAGGCTATGTCAATATAACCGTCGTTTCCTCGCCTGTTTAGAATATAGGGACAAAGGTGATAAATCACACAAATGCTTGTATTTTATCGTACTGTAGTCTCGCCGAACATTTCTTGTTCTTTTTGACGACTTGTTGCTGAATTATTGTATATGAATAAGATAAATCGCCACAATCGGTTTTCCTCAACGACGCTTATGTCGTGACAACTTTTGCTAATTTTCCGATAACAGCTACAGTTGATAGAAGAAATAAGAAATTAATGTGAGTGTTGAGGAAAAGAGATTCCTTTATGTGCCAAGATTAGAATGGTAACAGTAAAAAATCACACGAGTTTATAGCGTCAAGTGTAAATCTAAATATAGTGAAATGAGTACGTGGGTGTCGTCCTCATGATTGGCATGCTACTTGGTCGAGTATTGTAGTTACCAGCCGCTTGTCTTATCTTGAGAACCTAACAGAAGAAGTGGAATTTATCTACAATATACCTAAGATGTACTTAGTCGGATACAACAATGCGAGAAGACACTTCAGAGCATACCTCATGTATTGGAACAACATTCGGCCAACTGAATAACACAAACAAAATTTTTGTGAACTTTGAATGCTTACAGACGCCATAGGTAGCTGGTCAATGTTGTCCTCTCTCGATCTCGACGCGTTCAACCCCGCTGCTTTTTTGCTTGGAAGCTCCAACTTATGCGTCCAGCTGGTCCAATTAGATCATATTATCGTCCAAATCGTCGATGTCGTTCTAATTGGTCAAAATCAGACAAAGCCATCAAATATAGACGCGAGAGCACCAAAATACATTGCAAAATTGTGAAAATTGTGATTCATTCGTGGAAACGTTCCTGTGTCGACAGAAAATTTCAAAAGTTTTGTAGATAAGTGATAAACGATGACGGTTTTAAAATGTCATCAGAAGGAGCATTTGATGGCAGTTCTGGACCTTTTGGGTTGCATGCAAATATAATGGGATTGCGCTTTGGATGGCCTTTAACAGCGCCGGAACTGTTTACGAAAATAAAATTAGACGCTTTTATATGGGTTTGGCAAAGTTGAGTATTAACGAATATGTTAAAAATCAGTTGTTAAACACACGTCGATTCTCACGCCTATTAAATTCAGTGACACGACCATTCAACAAGTGGTTAACACATTTGATTTATTTTAATAAAAAGAAAAGAAAGCCAGCACGTCGCTTTTGGAGACTTTGAAAACATCACCTGCAAACATCAACAACAAAATGTCAATTTTGGGTTGTTAGTTTTACTGCGAAAGCTTCTTGCCAAAAGGATTTCTAACAAATTTCGTACCTTTCTAGTTTCTCGAAAAACGAACATGTTATTTTTAGCTTTTATGCCAAGATGGTACCTTGCTTTCAATTGTGGCGGCAATGCTATATAAAAAGGAAGCAATAGAAGAATGAGATGGGCAAACGCACAACAATAATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGGAGCTTGCGCTCTCATACTCCTTGGGCAGCACATTGCAACCTACCAAGCAGATGCTAAAAATTTTTTTGTAGACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCGCCGCCTCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGGTTAAATTTGGGATGTATCCTAGAAATAACCCTATTCCACCATCTGCTCCAAGCGGACGAGTACCATCATGAGTGCCCCGACCAGCCTCGATTTTCGTCGCAACCCCCATCAATCAAGGGTTATTTTTACTGCTTTTAATGCAAGTGTTACGGCTGTAATTTGGGTCAGCCCCCTTCAATTTCACCGATACGAAAAATGGGGGTCTAAAAGTTTATGAAAAGGGAATATATTGTGGGAGGTAACTGTAACAGTTGCCGCTCACATCTATTTTAATCGTCTCTCCTTCTATCTAATACTTTCTTGTTCTTAGGCTATGTCAATATAACCGTCGTTTCCTCGCCTGTTTAGAATATAGGGACAAAGGTGATAAATCACACAAATGCTTGTATTTTATCGTACTGTAGTCTCGCCGAACATTTCTTGTTCTTTTTGACGACTTGTTGCTGAATTATTGTATATGAATAAGATAAATCGCCACAATCGGTTTTCCTCAACGACGCTTATGTCGTGACAACTTTTGCTAATTTTCCGATAACAGCTACAGTTGATAGAAGAAATAAGAAATTAATGTGAGTGTTGAGGAAAAGAGATTCCTTTATGTGCCAAGATTAGAATGGTAACAGTAAAAAATCACACGAGTTTATAGCGTCAAGTGTAAATCTAAATATAGTGAAATGAGTACGTGGGTGTCGTCCTCATGATTGGCATGCTACTTGGTCGAGTATTGTAGTTACCAGCCGCTTGTCTTATCTTGAGAACCTAACAGAAGAAGTGGAATTTATCTACAATATACCTAAGATGTACTTAGTCGGATACAACAATGCGAGAAGACACTTCAGAGCATACCTCATGTATTGGAACAACATTCGGCCAACTGAATAACACAAACAAAATTTTTGTGAACTTTGAATGCTTACAGACGCCATAGGTAGCTGGTCAATGTTGTCCTCTCTCGATCTCGACGCGTTCAACCCCGCTGCTTTTTTGCTTGGAAGCTCCAACTTATGCGTCCAGCTGGTCCAATTAGATCATATTATCGTCCAAATCGTCGATGTCGTTCTAATTGGTCAAAATCAGACAAAGCCATCAAATATAGACGCGAGAGCACCAAAATACATTGCAAAATTGTGAAAATTGTGATTCATTCGTGGAAACGTTCCTGTGTCGACAGAAAATTTCAAAAGTTTTGTAGATAAGTGATAAACGATGACGGTTTTAAAATGTCATCAGAAGGAGCATTTGATGGCAGTTCTGGACCTTTTGGGTTGCATGCAAATATAATGGGATTGCGCTTTGGATGGCCTTTAACAGCGCCGGAACTGTTTACGAAAATAAAATTAGACGCAAGGTAAATGTCTCTTGCAGGCTAAGAAGAAATAGGGAAAGCGTATAACCCACGTAAAGCCTAGAAAGAGTTTTGAGTGACTATTTTATTCTTCAATTCTATTTTATTAAAAATGTTCTTATGAAGTATGATTTACAATGAACTGTGTAATATGCATAATGTTGATTGCATTACTCATAAGACATGTTATATATTTCTAAAGTTGATAGAAAGCATTCTTATGAAATGTTATTGAATAAAGTTTTGTATGATTTCTATGAAAGGATTTTCATATGAACTATTATGTGAACGAAATTCTAAAATGTTTTAGAGGAGGCTCAGTGCTCAAGGACACGAGAAGGCACGTGAGTTTCCCCGGGACCCAGTACTAAAGGACATGAAAAGGTACTTGGGCAACCCGTTATATTCCTATGTGCACATAGAGGTTGATTTGGCTGGTGCGTCGAATTTGCTACGCATTGAACCAGGGACGATTTTGACGTTGGGAGTGTTGAGCCTTGCTCCACCCCAACTAAGGTGGTTTTCGACGTTGAGTGCGTAGAAGCGTTGCTCCGCCTCAACTAAGGGAAAAGTTTGAAAAACTAAATTGATATTGATAAACTCCTTGGAAAACACTCATGTATGATTTAAGATTTGATTTATGAGATTAATGGATTTTATGGAAAGCATGTTTAATTTGAACAAAATGTTAAACTCTTACTTATTTGTTATACGTAACATGCCTGTTTTCAATGAATGCTCGTTTTACAAAGATTTGTCACTCACTGGGCTGTTAGCTTATGTTTTCAAATGTTTTCCTTTCCTCCCCCCAGGTAGCAAGGACGTTCCTGGTGCTTAGCCTACTTCAGCTTACCCTGCCTAAGTCTCGAGTTCTCAAAATTCAGTCAAGTCTGAGAGTTGTACGTTGTATATGATTTATGTAGATGGCATGTAGACCATGTAGTTTGAGTGGGGATTGCAGTGGAATTTTGTACATTGTGTGATGGATATGGAACTGAAATTATGCTATGTAATAAAAGTGTACTCTATACAGGGAATCATATAATGTTATATGCATGTGTAGTATGACTGAGTTCTATGTATGTATATGAGTAAATGTATGCACAGGAAGGAATGTAAGATGGGACAATATGCGTCACAAGTGTTGGTGATGTTTATTGTCTTCACATCCCCTTTCAGGTTAGGTGGGTAATCCGGGAGGGGGTGTGACATAAACGCATGATTTTGGTAGTTGAATGATGAAAAATGTATCACCATGCGTTGGCACTACTTACAAAAGAAAATACCTGAAAAATGATGAGCTTTTATGTTCCTAACGCAAGGTTACGAAGAAAAGAAGCGCAAACAATCGAAGTACGAAGAGAGGAAGCTGCTGCGAGGACAAGTGGAAACCCATGCGTCGAGCCTATACGCTGAGCCCATGCTTTGAGACCATGCGTCGAGCCGAGTAGGTTGAGACTATGCGTCGAGCCTATGCGTTGACAATCTGAAAAGTAAAAACTGACGCATGCTCAAACAGTTGTCATGTCATTAAGAAAAAGGACAACACCCAATCAGATCATGGCAGAACAAATAGGCTGACAATCTGAAAAGTCTATGCAGCGTGGCTGAAATTTAATGCAAACGTGTCAGGTCATCATGATATCAACTTTTTGACGCCTATAAATAGTCCACAAGCTTCTCCAAGAAGAGGAGGAAGAGAGACCAGTTCAAGAGGCGAAAGCTCTGCACTATTTCTCCCTCACCATAACCAGCCTCCATTTCGAGCTCTTCTGAGAGAAGGATTCTGAGCATCTTCAGAGTCGTCTCCCAGTGAGTGAGAACCTTTCACCTTATGGAATCTTTGTATCAGATAGCAACCATCTCAAGCCTCCATGAAGCAAGACGTTGCCAGTGGCGGCTTGACGCCTTTCATCTCTATTTTATTTCCATTTCTCTCCTTCTTGCTTATATTTTGTTAATGACATTTGAGAGAATGATTTATTGTCTTAATATGAATTTGATTGACTATCATCTTCCGTCTATTTCCTATTAAATACTTTACACCATGCTGTCTAAATGCTAAATGAGTTAAATGTTAAAAGTTAGTTTTTCTTGAGCAATTCTGAGTATGTTACATCTTGATTAATGCATATCTTTAGCTTGCATAATTTTGTCTGACAAACAAACGTAGTAAGCATGACTGTCGAGCTTGAGAGAGTGGATAGTTAATCTTTAGCATTAATTAAGACTAGATGAGTGCAGAGATGTAGTTCATCTTCTGCCAACGCAAACTTACCATGCGTCTTAGAGATAAGATAAATGTGGGTCGAAACATCGAGAGATGGATTCACACACGCAGTTTGTAGGTATGTAATTGTACTTAGAAAGTTTAGGTAACAACTAGCTCTTGCGTTCAACTCTACCTCTATCGCATGGTTCCATTCATTCTCAACGCATGCTATCTGACGCATTGTTTTACATATGACTACCCAACGCCGCGTCCACTTCATTATTAGTAGTAGGAAATTCTTTTCATTCTTTATGTATAGTTTTACACACATAATCAACAAACGCATATATATTTACATGTACAGTCACCGCATAGTCATACATTTAGTCTCAACGCCTAGTTATTACAAATCCCTGCGTTCGACCCTGGACTAACCAGGAAACCTATAAATGCTTATACTTGGGTCTTTATAGGAAAACTTGCTTAGTCATGATCACATCTAGAGTAATTCTCAACGCATCCTTGTACGGAAATGATAAATCTATTTTTTTGTGCAACGAGCTAAGAAACTCAATTTAATTGGATTAGAAATTTAAACGAACGAACTTAATGAACACTATACTTAAGTACTTCCTGATCAGCACAATGATCTTTTATAACAACCTTCGGCGCATAGCTGCTGCCTATCGCATAGCTACTGCCCATCGCATAGTTTATCAATACATCGCTAACCTTCAGCACATACCTATAGCTTATCGTATACAACTACTCTCCAATTCAAGGTATTCTTAACGTATGGTAAGTTAAATACAACCTTTGGTCAGGTTCCATCTTCACTATGCGTTCACTTCCGACGCATAGTGATACCTTTCAACGCTTAACATGACAACTCTTCCATCTTGCCATGGTAACAGTCTACTTCCATCTCTTTACAATAGCTTCCCATACTAGGTTTACCAACGCATAGCGGTGGTCAAAGCTCATAGCTCTTGTTCCTTCACTTCCACGCACTATTTCTCCACAATGCATAACACATTACTTTCCTTAAATTTCCATTTCTCCGTTGCATGCTAACTTTTCCCACGCATAGTACGTTTCCGCTCTCTCATCATCAACGCATACTACATCACACTATTTACCTTGTCACCATTTAAATCTATTTTCATCACTTGGCAATTTCTCAAACTACGCGTGAAGCATGCGATCAACTCTTCCCTTAAATCATCATTTTCTTTTCTAGCATTTCAATACTAAATCAATTTTCCTTCAACCAGAATATCATGCGTTCATACTATACCATCCGTGGTGATATTCATTCCAAAAATGTCTAACTCATAATCAGTTTTCCAACTATCCTTTCAAATTTTCATTCCTTTATGAAATGTCATGCGTCCATCTAGCCTTCTTTATCGCATACTTTTTGCTCTCAAGCATAACTGTAATACTTGAATTCTCTTATATTTTTAGATCATTAATTTTTCATCTAAGATTCCAATACGCCTATCTTGATTTCTATCAAATTAACTTATGGCTTATGTTTAATCCATAACTTTCACTAACATAAACATTTTCTCCCTTTCCTTTCATCTCAACGCATAGAACTCCTCAATCTGACCAAAATTTACGCAGGCTTATATTCCTTCTATTCTCTGTTCTCTTTCTGGGATTTTTCTTATACCCTCTCAACATCATTTAATTAACCACTGAATTATTCAATAGACTTTTATTTTAGATTCATACCTTCTTGACCATTGGGTCCCATTTCTATAATCTGGTTACTCATGTGCCTTCATCTAGTCCTTGAGCATTTGGGTACCTTTTACAACTCGGGCTGCCCACAATGCCTTCTACAAGTCCTTGAGCATTGGGGTCCCTTTTCAGAAAAACCATTAATACAAGTTTCAATCAATCACCATGCGTTACCCAACATTCTCAAATTTTCAAGTAAAACGCATGTTTCTCAGCTGGCTCATATTCTTTTATATATATATAAACGCATGCTCTTAGTTTTATAAACAACATGTTCATATGCTTCATATATGAAAACACATGCATTGTTTCATTAGAAACCCATAAAATAATATTCATAAAGAACTCATGTATATTTTGTAAAACGCAGGTGTTACTCACCTGTACTTGCCAAAGAATAATTCCTCTTCCTTTCCTCTCTTTCCTCTTCCCGAGTTTCTTGGACGCAGCGTTCTACCTCTTGAATGCAAGGCTCCATTTTCCTCTATCAGCTCGGTTGAACGCAACCCTGCCCTTCAACGCAAGGTTGATATTTCCGCAAAGTGCATCCCGAATCTTCTTACTTCTCTTCCGGATGCGTCGCCCTCTCTTGGACGCAAGGTCCAACTTGAACGCATCCTTTAACTTTGGATGCTGCGTTCACCTTGGACACCATGTTTGTCCTCAACGCAAGGCCGACCCCTTTGAACACAAGACAACTTCTTGACATAAAATAGCTAAGTTGTCAAATACCTTCTTTCTTGGACGCATACTCCTACAACGGATGTCCTTGCTGCCTTCTTCTTCTCTTTTTCTCCAGATTTCTTATCTGAGAGTCATCAGTCGCTCCTTTTCCCTCAAAGTTCTATTTATAAGGAGCTTGCCATCCAACTTTTCCTCTCCTTTTGCCATTTGTTTGCTACATGTCATGCTGCATGCTTAATTAACCATGCATTAATCCAAATGTCACCTTGCATGAAACACTTGACATGCCTTCAACACCTCCTACTGATAAGGTGTCAAATACATACGACGCCACCTGGCCTTTATAAGCTTATCTTATGCGTCCAACATTTCCAATGAACAACTTTTATTCAACACAATGCCCTCATGCGTTCGTTGAGTTTCCTTTAAGATTTGAGCTTCACTACTCACTTTTCTAACTCTCAGCCAACTATACTACTGTCTTTGACGCATGAGATAACTCTGTCTTCAATGCATAGGGACCTCAACGCATGACAACCTTACCAACGCATGGTAAGCCAAGTTCGTTCCTTCCCCCCGCCAACGCGGCGACTGCTTCAGCTCACACCACCATTTGTGATTTCACCATTAATGCATAACCACACTATTATAAACCAGTGCTTACATATGCCACTAGGCGTCAATGACAAACGTGCTTACATCGCATCATAACGAATAAATCCTTCTCCAGTTCTTCCTCTATGTGTTGTCCTATTGCTTTTACCTTCATCCTGCGTCCATGACCATCTTCCTATCCTTGAACATTCCTTGATTATTCCCAAGGTTACTCTTTTAACCTTATCTTTGCACTTAAACTATTTCTCTTAAGATAGATTAGCATGTGTCCACCACATGCTCCAGTTTTATGTTTACTTCACAACCTTCTCCAACACTTGTGAACATTTCTTGTTATAAGTCTATGCGTTACTCAATTTGTTGTAGTATTCCATCTTTCCGAGACTTAAATCTCTTTCAACACTTCACAACATCACTTATTCCTTTCTTGCTTATTCGTTTCACATCTTTCCCTTCTCGGTTTACTTCCATGCGTCCACCCTTACCATGCGTTACTTAGTCCTGACCATGCGTCCAACATGGTCTTTTACCTCATTTCTTCATAAACTCTTCAATGCATCGTTTTCTTAAAAAGTAGGGTGTGACAATACCTCCGGATAGCATCACCATGTGGGAGATCTTAACTCAAGCTTTCTTGAACAAATATTTTCCACCGGCTAAATCTCAAAGGCTAAGTATGGAGATTGGAACATTCCACCAACTTGAGGATGAACAATTATATGAGGCTTGGGAAAGGTACAAGGACCTCTTAAAGAGGTGCCCTCAACATAGCTACCCGAATTGGTTGCAAATTCAACTCTTCTACAATGGATTAGCAAGCTCAACCAAATCCATTCTAAATGCAACCATTGAGGCTCAATTTTTTCAAAGAATGCTCAAGAGGCGTATACCATACTTGAAGACTTGCCCACTACATCATATAATTGGCCATGTGAACGGTCTTCTCCAATTATCCCAAAAGCCACCGAAGATATGAGATGGACGAGGTAAGTTTTCTAAAAGCTCAATTGGCTTCTCTCACTAATGCTTTATCTAAATTGTCTCAAGGAAGCCAAGCCCAAGCAAGTACACCATCCATAGCTTCCCTTGCAGCCATGGCAAATCAACAAGAGTCTAGTGAGTTAGAAGTGACCAATTATGTAGATAGAGGACAATATCGAGGTGAACAACAACTCTCGATTCACTATCATCCCAACTTGAGGAATCACGAGAACTTCTCATATACCAACAACAAGAATGTATTGCAAGCACCTTAAGGATTCAATGGAGCAGGGAATGCAAAGACATCATCACTAGAGGACATAATGCTTGACTTTGTCAAAGAGTCAAGATCAAGGACAACCACATTGGAGAATTCATCCAAGCTATTGCAAGGACCGTTCAAAGCAAAGGTAAGACAATTCAAAACTTGGAGGTTCAACTTAGTCAAATGGCCACTTCCCTCCAAACAATGCAAAAGGGCAAGTTCCCTAGTTGCCCCGAGAGAAACCCAAAGGAGGAATGCAAGGCCCTAATTTTGAGGAGTGGAAAAAATCTATCCACTCCCTTGATCAATGATGAAGACGATGAGCTTCCACAAGGAGAGGATGAAGCCATCATAGAGGATGTTCCACTAAAAGAGAGTGCATCTAAGGAAAAGGAGAAATCAAAGTCAAACATAGGGAGTACTTCTAACTCTACTTCTTTTATCCCTAATACTTTACCTTTTCCTCAAAGATTTCAAAAGAAAAAGATTTATGCTCAATTTTCTAAGTTCTTAGACATATTCAAAAAGTTGGAAATTAACCTTCCTTTTGCAGAGGCATTAGAACAAATGTTTAAGTATGCCAAGTTCATGAAGGATGTGTTGTCTAAGAAGAAAAGGTATGGAGACTATGAAATGGTTGCACTAACCGAAGAATGTAGTGCGACACTAAAAAGGAAATTGCCTCCAAAACTAAAGGTTCCAGGGAGTTTCTCTATCCCTTGTCATTTAGGCAATAATAAAGTTACTAAAGCACTTTGTGATCTTGGAGCTTCTATTAACTTAATGCTTTTATCCATGTTTAGGAAGCGAGGTGCAACAAGTTACCATATCCTTGCTTTTGGCCGATCGATCATTTGTTCATCCCTACAGCATTGTGGAGGACGTCTTGGTCAAGGTAGGCAAGTTTATCTTCCTGGCGGACTTTGTGATATTAGACATGGAGGAAGATGCCAAAGTCCCCATAATCCTTGGGAGACCATTCTTGGCCACCGGAAGGGCCCTCATTGATGTGCAAGATGGAAAGCTCACATTCCCCTTAACGATGAGGAGGTAACATTTGACATCTATAAAGCCCTTAAGTATCCAGATGTGGAAAGGTATGAACATCATGAGCCAAGAACAAAGGTAACATCGCCTCTCTCTTCTAATAAAGACTCTATGCATGATGAGCCTCCTTTAAATGGTAAGACTAATGCCATGCTTGATGTTAAAGTAGAAAATCCTATAAATCATGTTTCTAGTTGTGATAATTCTCCTAGTTCTTCTTGCATGCCTCCCAAACCTATAATGAATTATGAACAAATCATGGCGAAGCATGTTCAAAAGGAAGTGGTGAGTCCGGTAAAATCTAAGAATAATGACTCCTTAATGATTCTTGACTACATTAGAAAAGATGATGTCCAACTTGATAATGATTTGTTGACTAACTCACAAACGGTAATGAATATGTTTGTGAGTGTCAATGTGGGCAAATACTAAAAGTTGTAGGTGATCCAAAGCATGAACGGAAGAAACCACCTTGGGATCCGCCATGACTCGCGAGCCTCGTCTAGCTAGAAGACGTTAAACCAAGCGCTTATTGGGAGGCAACCCAATTTTATTGCTTTCTTTACTTTTCTTTTTATTTTATTTTCAATAAATCTTTGAGTCACTCATATGCTTTATCTTTCAGGATAGTGGTTTATCACACAAAGTACTCCTTACACCAAGCCTATTTGGTGGCTTTCTGCTTGCTCCTGTTGGCTTTTTGGCCCTTAATCTCAAATTAATTAATTTTAATGAATTGATTAATTAATGTTTTGATGATAACAAACCTGATTTTATTTATTAACCATTTGAGTGTTTTGAAGTGTCTATTGTGTGGAGCTTGGAACAGCGGTTCCAATGCAATTTGAATCACTCAAATCGGAGTTTAAATGAAGAAGATATGACCGAAACAAGCTTGATGGAAAAATCCCGAGCTGATGACGTGGCAAATTTTTTTTTAATCAAAGTGGACAAATTGAAAGGTGCCACTTGAAGCAATGGAGAAGTGACAATTGGTGGGTCTTTTATTTTTGGATTGATGTTAAAAAGAAAATGATTTAGAAAAAAAAGGAAAATGAATTTAAAAAGAAAATGAATTTAATATTATTATATATGTTTGTGTCTAACGGCTAGTTGCTTTTTTAAAAGGATTAATTTAATATTATATATGTTTGTGTCTAACGGCTAGTTGCCTTTTTAAAAGGATTAATTTAATATTATTATATGTTTGTGTCTAACGGCTAGTTGAATTTAAATCTCCACCATTCAATTTTCTTTTACAAAGATCTAATGGTCCAAATTAATTTGGTTTTCACTCACTTTCATTTCTTTTTAAATATTTCATCTTCTCCAAAAAGTTTTCCCCAACTTTTACAATTTTACAATCCTCAAAAATTGAAAAAGGTCACCATTGTTGCTCTCTCTCCAAGCTTTCAAATTTCTTTCTTTTCATCTTATTCTAGAGAGATACTTTCATTGTATTGTGAGTTCAAATTTGTATATCTACTCTTTTTTAGAGAGTTTTAGTGTGTGATTTAAAATTTCAAGAATATTGTAACGGTTGAATCCTAACCCGTGGAAAGGATCGGGTTTGCTCTTGAACCCGAAAAAGGAGCAAGGTAGCGGTTCGACTCTCATCCGTGGAAAGAGTCAAAGGAGATTTTTTAATCGAAATCCCAAGAGGCGCTTGGGAAAGTGGATGTAGGTCGAGTTGGACCGAACCACTATAAAAATCACGGTGTCTTCTTCTCCAACTCTTTCTCAATTTATTTTGCATTCAATTTTTAGTAGCATGTTATAGATTACTATCCTTCTTTGTTGAGATTGATTGAATCTTGTGTTTCATATATTTTTTATCAATCTTGATATTTAGCATCAATTAAGGTTTTTATTAATTTAATTCAAAAAATAATTCAATTAGTTTAATTTGAGCATATTTAGGAATAAGTTAAATTAAACCCTATTCACACCCCCTCTAGGGTTGCCATACCGGTCCAACAGCTCCCAGGCTTGTTCTTTCTTTGTTAGATGCTCTTATGCATTGGGGACAATGCATGATCCTAAGTTTGGGGTGGGAGGTGTCACTCGTACTTTTGTGTGTTTTCTTATCCTACTATGCATGTTTGTGAAGTTTTAATGATCATATGCATATTAATTTTGAAAAATTTTGAAAATTTTTGAGCTTTTAAAGTTAATGGGAATGTGTTTTTGGTTGTGTGTCTTTATGTTGATTGCCCATGATGAGATTTGAATTTACATGATAGATGACTTATTGTGGAAATAGAATAGCATTATCTTGAGTTAAATTCATTTAAGTCCCTCTTAAAATCTCCCATATTCTTGAGCATATAGCCACTTTAATTCAGAATCCTTAACTGAAATAAGGATTAAGTTGGGTGTATTTAGTTGTCACTCTGAAAGGCCCCATCGGTTAGAGAGTGGAAATGCATGATAGTTTGTAATCTTAGACTAAGGAAAAAGAAAAGAGTATACATGCCTTTGAGAAAAAGAAAAAGAAGTGTACACAAAAATGTACGAGCTTTATTTGCCTATCTTTTTAAATGTTAAACTTGTAATGAGTATGCATGGATGAGAGTAAAAAGAGCTGCCTTTGTTGTGATCAGTAAGGACGCTTGGGAATTGACTAGGTATGAGTCCCTAAAGTGAATGTGTGAAAGCTAGTGCCCTTAGGTGAAAGAATGCAAGTTTAGGGCCCATGAAAAAAATATATGATTGAAAGGCACCTTAGTAAACATATTACTCACTTGAAATGTATGAATGCTTGAAGATTGGTGCATGAGCTAGGGTAGGGGGAAAACTAAAGAAGCTCCCTTAAGTTCTAATTTGAGTTCTAACATTTCAAATTGAAAGAGGCTTGCACACATAACATGGGAGATAACAAGATTTGGCCTTGAATGATTATAACAATATAGATATACTTGCATATTTCTACTTAAGCTCTAAATGCTATTCTCTGAGACATGAGTCAATTGCACACTTCTTCATCTAAACTTCCTACTTAAGCCCTAAATGCTATTCTTGGGTTTTTCCGCGACGCTAAAACCCGAGAGGGGGTCTGCACAACATAGCTGCAGCGCCGCAACGCTCTCTTTCCATAATTGGCAGCGCCACGACGCCAGTTTCGTTTTTTGTAAAATTTGTATTTTTGGAAAGTTTTCTTGGGCAGCCAACTCTATAAATTTGGGGATTTTGAGCAGCTCTAATTATCAATTAAAGCATGAAATTACAGACCAAAAGAGTCACGAGAAGGAGAGAATTGAGTGAGAGTTCATAAAAGAGTTCAGAGAGTGAGTTAGTTGCAAAGAAGAGGAAAAGAGTTCAAGAGAGTTTAAGAGGAAAAGAGTTCTCCTCTTATCAAAGATTTCATATGAATGATCTACTCAACTGGTTGATAAAATGTTTAAGAGGAGGCTCAGTGCTCTTCTCCTTGAGAAGGCTCGTGAGTTTCCCCGGGACCCAGTACTCAAGAACATGAGAAGGTACATGGGCAACCCGTCATATTCCTATGTGCACATAGAGGTTAATCTGGCAAGTGCATTGAGCTTGCTCTGCTTGAAGCCACGGATGATTTCGACGTTGGGTGTGTTGAGCCTTGCTCCACCCCAACTAAGGATGTTTTCGACGTTGAGAGTGTTGAGTCTTGCTCCGCCTCAACTAAGGGAAATGATTGAGAAACCAAACGAGTTTTTCGTATAAATGTTTCTAAACGATTCTATTGATTTGGAACGATTTGGTATAGTTCAGAAGTGATGTTGAAAATACGTTCCTGTTGTATTGATGTACTTATCTTATTTTATATGTATTAATGTAACATGCCTGTTTTGACTAAAAGATAGTTTTACACAACCAGTCACTCACGACGCTTTAGCTCAAATTTTCCTATGTTCTCAAATTTTCCAGGTAGCAAGCGAGTTCAGGATGCTTTGCCTATGTAAGAACCTCGTCTGGGCCATCTTTGGTAAAGAAATGGTTGGTCAGTGAACCTTACAGAGGAGTCAAGGTTGCTGACTTCACATCTTCTTTCGGGTTATGAAGGATAATTCGGGAATGGGTGTGACAAATCCAAAGCGCAATCTCATTATATTTGCATGCTACCTATGGCGTCTGTAAGCATTCAAAGTTCACAAAAATTTTGTTTGTGTTATTCAGTTGGCCGAATGTTGTTCCAATACATGAGGTATGCTCTGAAGTGTCTTCTCGCATTGTTGTATCCGACTAAGTACATCTTAGGTATATTATAGATAAATTCGAGTTTCCTCGGGACCCAGTACTGAAGGACATGAGAAGGTACTTGGGCAACCCGTTATATTCCTATGTGCACATAGAGATTGATTTGGCTGGTGCGTCGAGTTTGCTCCGCATTGAACCAGGGACGATTTTGACGTTGGGAGTGTTGAGCCTTGCTCCACCCCAACTAAGGTGGTTTTCGACGTTGAGTGCGTAGAAGCGTTGCTCCGCCTCAATTAAGGGAAAAGTTTGAAAAACTAAATTGATATTGATAAACTCCTTGGAACACACTCATGTATGATTTAAAATTTGATTTATGAGATTAATGGATTTTATGGAAAGCATGTTTAATTTGAACAAAATGTTAAACTCTTACTTATTTGTTATACGTAACATGCCTGTTTTCAATGAATGCTCGTTTTACAAAGATTTTTCACTCACTGGGCTTTTAGCTTATGTTTTCAAATGTTTTCCTTTCCTCCCCCCAGGTAGCAAGGACGTTCCTGGTGCTTAGCCTACTTCAGCTCACCCTGCCTAAGTCTCGAGTTCTCAAAATTCAGTCAAGTCTGAGAGTTGTACGTTGTATATGATTTATGTAGATGGCATGTAGACCACGTAGTTTGAGTGGGGATTGCAGTGGAATTTTGTACATTGTGTGATGGATATGGAACTGAAATTATGCTATGTAATAAAAGTGTACTCTATACAGGGAATCATATAATGTTATATGCATGTGTAGTATGACTGAGTTCTATGTATGTATATGAGTAAATGTATGCACAGGAAGGAATGTAAGATGGGACAATATGCGTCACAAGTGTTGGTGATGTTTATTGTCTTCACATCCCCTTCAGGTTAGGTGGGTAATCCGGGAGGGGGTGTGACAGTTTAGCTCCAAAGTTACTCTAATTAAATCAAATAGTGACTATAATAACTTGTATTTCTACAAGTTATCATATAGTATAACATGTGTAAAAGACGTCGTTAAGCATTGAAAGTTGCTGTAATTGGAAAACATGTTATGCGTTGAAGAAATTGTATGCATTGGCGAGGCGACACGTTGGAAAACCGAAGTTCAGTTAAGGATTGAGGGACAATTCTACTCATCCGGGAGAGGGTGTGACAGGTATGATTTCCAACCAATCCTTAGCATGGTCCTGTAAAGAGAAAGGAAAAAGTCTTAATTTAATAGCATCATTAGAAACACCATTCATCTTTACCATTCCGCATATTTCTAAGAAAGACCGTAAGTGCTTATGAGGATCTTCATTAGTTCTTTCTCTAAAAGCTAGCTCTCTAGTCATTTGGATCAATCCCAGTTTCAACTCAAAGTTATTGACATTGATGGGCACATTCATTATGCCAAGGTAATTTGCCGGTAATGTGGGTTGGAAATAGTCTCGAATTGCCTTTGGTATTTCCTCCGCCATCTCCTCGATGTGATTGTTTTGAACCCTTAGATTTCTTCTACACGTCCTTTCAATCTCGGGATCAAGAGGGAGAATATTAGTATTGTCTCGAGGCATAAACCGACAAAAAAAAAAAAGAAAATATAAAATAGAATAAAATAAAATATAAAATATAAAATAAAATAAAATAAAATAAAAATAAATAAATAAATAATAAACCAAATAATATACACCTCAAATTAATATCTATAAATTCTAATAAATATTAAATAACAAAAACCGTTCTTTGGATTCCCGAAACGGCACCAAAAACTTGATTGCACCACTTTGAGTGTTAACCACAAGTGTATGGGTCAAGTTATAATATAAGACGATAAAGTCCAAATATCGTTCCACTGAGGATCCAATTGTGAAGGTTTAAATTACTTAGCTATTATAGAATCGAATCAATTTTATTTGAGGATTGGAATTGAAGTTGGGAATTATGGAAAATATAATCTAAATAAGGAAAGTAATAAAGTGAGAAACACAAGAGTCAAAGAGAGTTTTAGGGTAATGAATCCTTGTTTATTTCATCATGCAATATTATAGAATTCATGGATTTGATTGTAATTCATATGCCTAAGTCTAGAAACTTAATTTCAAGTTATTTTCTCTAAAAAATAACCATTTCTCATGCAATTCAATTGATTGCTAATCTTTTTAAACCAATACAAATTACATGGCATTAAGACTAGTCTTTAACAACTTTTATTGTCAACCTAACTACTCTCGTGATAAATTAACGACACTTCATTGTTATAGATCAAGTTAAACATATAATCTCTCAATCATACATCTAACCTAACATGTGAAGGTGGTCAATCCTCAAGGTTTAAACACAATAACAATGAAAAGCATAATCATTCCAACAAAACCATGAAAGTCTACAATATCCACATGAATTCATAAACAATTCATCAAAACCCTAAAACTAAAGAGTTTAGCCATTCATAGCTTGAAGCAACATGTTCAAACAATATAAACAAAGAAAGAGTGTAAAGAAAGGGAAACTCTACTGAAAAGGGGCTCTTGATTGAGGTAGCACCGTCACAATCTTCTTGGCTCTCACGTCGGAACTTGAACGGCACCACGACTCTCTTGAACTCTTTTCCTCTTCCTTGCAACTAACTCACTCTTTGAACTTTTCTATGAACTCTCACTCAATTCTCTCCTTCTTTGTGACTCTTTTGGTTTGTAATTTCGTGCTTTGATTGATGATTATGGTTGCTAAAAACCCCCAAATTTATAGAGTTGGCCGCCCAAGAAAACTTTCCAAAAATACAAATTTTACAAAAAAGGAAACTGGCGTCGCGGCCATGCGGCTGTGTTGCACAGACCCCCTCTCGGGTTTTAGCGCCGCGGCACTACCCCAATTTTGGCAACTTTGTTGTTTTCTTCTCTTTTTCACCGTATTCTTGCTCGATTTTGATCTTTTGGTATTTTCTTTTAATTTTCCTCATTTTAAATTGAAAACCACTAGCAAACAAGTGAAATATAATATAAACTTCCTTAATTTGAGAGAATAGAAAGCACTTTTCAAGTGCTTATCAGGTAGCGAAAGAGTTCCCGGTGCTTAGCCTACCCAAGCCTTGTCCTGCTCAAGTTTCAAGGTGTCAGCGAATTTCTTCATATTTGTTGTATGTGTAAATATTTTGAGTAATGTGGCATGTAGACATAGCTAGTTAAGTTGTTGGGAAATGAGAGTGCATAATGTGTATGAATTCTCAGTTTTGTTGTAGACAATATTAAGATTTTTGTTGATGTGAATATTTATCGACATGGCATTGTAATTATTAGAATGGTGGATTTGAATGTTTTTTATTTAAGGATTAGGTATTGTGGGCTGTTATGTTAAATGTTGTTCCTAGCAGGCTAGTCAGGTATGGATGGACAGTGAACCTCACAAAAGGGGTGAAGTTCACTGCCCTCCCATCTCTTTCCAAGTTTAAAAGGGTGATCTGGGAGGGGGGCTGACAGACATTACCCAAACCGTGTGATGCGCCAATTTTGGAGGTTGCAACATGAAAAACGCTTGAAAAGTTCTTCTTAATCTCTTAAATAAGAAGTTTCTATTAGATTTCACTTGTTTGTTAGTGGTTTTCAAGTCAATATAAAGAAAACGAGAATCGAAGGCCAAAAGAACCAAACCGAGCTAAAATCAAGTGAAAAAGAGGAAAGAATCATAAAATTCCCAAAAATAGGGCAGCGCCAGGTTGCCAGTTCGGCGCTGTCACAGCGCCTTTTCCCAATAGCATCAGTCAAAGAAAGTGTAGCAGCACCAATGGGGCATTGCAGCACAGTGCTGAGGGCAGGTTTTAATGACCCATAGAGCGGCGACGCCACCCTTTTATGTTGGTCAGCGCAACGACGCCATTTTCCAACGTCACGGCACCAACTTTCCTTATTTGAAAAATCAACAATTTTGGAAAGAAATTTGTGCGGCCAAGTCTACAAATTGGAGGGTTTCCAGCAACCTTAAACACAAAAGAAAGAAGAGCAATCAGAGAAAATACAATAGAGAGTCCCAAAGAGTAAGTTGGCTGCAAGGAAGAGTGAAGAGAGTTTGAGAGAGAGTTTGGTGGCGTTCGAGTTCCCGACAAGAAGACTGAGGACATCTTAGTGATTTCTCCTCAATCAAGGGCTTCTTCACAATAGAGTTTTTCTTTCTTTACACTCTTTCTATGCTAAAATTGGTTTTGAACATGTTGCCTAAAGCTATGAGTGGCTAAACTCTTTAGTTTTAGGGTATTGATGAATTGTTGATGAAGTTATGTTGATATTATAGACTCTTCATGGTTTTGTTGTAATGAATACGCTTTTCATTGTTATTGTGCTTAATTCTTGTTGATTCGCCATCTTTGAATTCTAGATTAGATGTATGGTTGAAATAATTGTATGTCTAATTGGATCTATAACAATGAATTGTTGTTAATCTATCATGCGAATATTTAGGTTAGCAATGAAAGTTGCTAAAGACTACTCCTAATGCCATGTAACTTGTATTGGTTTCAAGATATTAGTAATCAATTAGGTTGCATGAAAAAAATGTTGTTTTTGATAGAAAACAACTTGGACTATGTTCCTAGACTTAAGCATATGATTGTAATAAAATTCATGAACTCTAGAATGTTGCACAATGAATTAAACAAGGATTTAATATCCTAGAACTCTCTTTGACTCTTGTGTTTCTCATTTACTTGCTTTCCTTTTTAAGTTTTTTTCTAAATCACAACTTGAATTCGAATCCTCAAATAAAATTGATTCAACTCTAAAATAGCTAAGTAACTTAAACCTTCACAATTGGATCCTTAGTGGAATGATATTCGGACTTAACCGTCTTATATTGTAACTTGACCCGTATGCTTACGGTAACACGGAAAGTGCACATCACAACACATTCATGAACTTTGGGATACAGACATTCGACTATATTTCACACAATTTACAAGGTAGACATGAGCGAGATTGACTACATATTTGGCTCCTCAAATATCAATTTGAGACAATCGACAAAATTTGATTGTTAATGGACCTCTCATTAACAATGGTATAGATATTAAGCTTGGGTACATTGCATTATATTCTAATATTAGCAGACAATAAATAATGAAAGCAAAGCATCATATCACTTTTTTATATGAGTTTATTATATAATAGGATGATTTGCTTTTATTGTTTAATTACATGATTAATTGTTCTAATATATTTCAATCCTTTTTCGATATAAATTTGCAAATATGCCAATATCTAAAAGAATGGCTACAATGGCTAGAATGACTACAACAACCCTGCTTACACTCACATCTTTATATAAAGTGTGATATTTCGATGATGATGTTCCCAATGAAGAACAAGGTGATCCCGCTAGACATAGGGATGTTTCAATTCAACCTGATGTTTAGAATGAATATATGACAACCCTAGTCAATCATTTTACCTCTTGTTCCATGATGCCAACACCGTCAACATTTGAACTTTTTCCAACCACCTGCAGTATGCTTAA

mRNA sequence

ATGAAAAGGGAATATATTGTGGGAGACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCGCCGCCTCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGTATGCTTAA

Coding sequence (CDS)

ATGAAAAGGGAATATATTGTGGGAGACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAACGACGGAGGCTTCCCTATTATCAACAACGTATTGAGTGACCCGAGTACAAAACATCCAAAAAGGCATATCCGTTTCTCAACAGGGAAACCCAAGAGGGTACCTGACTTCGATCCACCTTCCAGCGTGGGGATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGCCCAAGAACTTCGGACAATCCACCACCTCCTCCACCGCACGTCTCGTCCACCATTTTGCATAAGCAATCTAGCATCAACTTCGGAATGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGTCAAAGAACTTCAGACAGCCCACCTCCCCCACCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTCGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCGCCGCCTCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGTATGCTTAA

Protein sequence

MKREYIVGDDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSDYPPPPPHAPFVILKKESKYA
Homology
BLAST of CmUC01G009150 vs. NCBI nr
Match: KAG6603169.1 (hypothetical protein SDJN03_03778, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 331.6 bits (849), Expect = 1.0e-86
Identity = 216/433 (49.88%), Postives = 240/433 (55.43%), Query Frame = 0

Query: 20  LSDPSTKHPKRHIRFSTGKPKRVPDF----DPPSSVGILSKEERTTPSGPSPRTSDNPPP 79
           L+D   KHPK  I  S+ +  +   F     PP  + +L K    +PSGPS RTSD  PP
Sbjct: 47  LTDQMVKHPK-GISISSSRSSQRTLFHLTPPPPVVLRMLPKGVPISPSGPSQRTSDY-PP 106

Query: 80  PPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFG 139
           PPP  SS IL KQS INFGMLPKG  IPPSGPSQRTSD PPPPP A SVIL+K+S INFG
Sbjct: 107 PPPRASSVILSKQSKINFGMLPKGVPIPPSGPSQRTSDYPPPPPRASSVILNKQSKINFG 166

Query: 140 ILPKGVRIPPSGPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILS 199
           +LPKGV IP                                                   
Sbjct: 167 MLPKGVPIP--------------------------------------------------- 226

Query: 200 KEERTTPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSP 259
                 PSGPS RTS+ PPPPP H SS IL+ QS INFGMLPKG  IPPSGPSQRTSD P
Sbjct: 227 ------PSGPSQRTSNYPPPPPLHASSVILNTQSKINFGMLPKGVPIPPSGPSQRTSDYP 286

Query: 260 PPPPHAPSVILHKESGINFGILPKGVRIPPSGPNDGGFPIINNVLSDPSTKHPKRHIRFS 319
           PPPPHA S IL+ +S INFG+LPKGV IP                               
Sbjct: 287 PPPPHASSFILNTQSKINFGMLPKGVPIP------------------------------- 346

Query: 320 TGKPKRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGM 379
                                     PSGPS RTSD  PPPPPH SS IL+ QS INFGM
Sbjct: 347 --------------------------PSGPSQRTSDY-PPPPPHASSVILNTQSKINFGM 356

Query: 380 LPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSDY 439
           LPKG  IPPSGPSQRTS+ PPPPPH    +L  +  INFG+LPKGV IPPSGPS R+SD+
Sbjct: 407 LPKGVPIPPSGPSQRTSNYPPPPPH----VLRPK--INFGMLPKGVPIPPSGPSRRTSDH 356

Query: 440 PPPPPHAPFVILK 449
           PPP PH PF+ L+
Sbjct: 467 PPPAPHTPFITLR 356

BLAST of CmUC01G009150 vs. NCBI nr
Match: XP_031744042.1 (proline-rich receptor-like protein kinase PERK9 [Cucumis sativus])

HSP 1 Score: 291.6 bits (745), Expect = 1.2e-74
Identity = 196/414 (47.34%), Postives = 226/414 (54.59%), Query Frame = 0

Query: 27  HPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILH 86
           HPK+HIRFS  K KR+PDFD  S  GILSK  R  PSGPS R+SD+ PPP     S +LH
Sbjct: 43  HPKQHIRFSRTKSKRIPDFDTHSGFGILSKAIRIPPSGPSQRSSDSTPPP-----SIVLH 102

Query: 87  KQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRIPPS 146
           K+S +NFG+LPKG     SGPSQR SDSPP PP  PS++LHKES I FGILPKGV     
Sbjct: 103 KESMMNFGILPKGVPTHSSGPSQRFSDSPPSPP--PSIVLHKESRIKFGILPKGV----- 162

Query: 147 GPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPS 206
                                                                T  SGPS
Sbjct: 163 ----------------------------------------------------PTHSSGPS 222

Query: 207 PRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVIL 266
            R SD+PPPPPP   S +LHK+S INFG+L KG R   SGPS+R SDSPPPPP  PS++L
Sbjct: 223 RRFSDSPPPPPP---SIVLHKESRINFGILSKGVRTHSSGPSRRFSDSPPPPP--PSIVL 282

Query: 267 HKESGINFGILPKGVRIPPSGPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFD 326
           HKES +NFGILPKGV                                             
Sbjct: 283 HKESRMNFGILPKGV--------------------------------------------- 325

Query: 327 PPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSG 386
                        T  SGPS R SD+PP PPP   S +LHK+S I+FG+L +G  I  SG
Sbjct: 343 ------------PTHSSGPSRRFSDSPPLPPP---SIVLHKKSGISFGILSEGVHILSSG 325

Query: 387 PSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSDYPPPPP 441
           PS+R SDSPPPPP  PS++LHK+SGI+FGILPKGVRIPPSGPS R +D PP  P
Sbjct: 403 PSERFSDSPPPPP--PSIVLHKKSGISFGILPKGVRIPPSGPSPRFADSPPSNP 325

BLAST of CmUC01G009150 vs. NCBI nr
Match: XP_022967687.1 (actin cytoskeleton-regulatory complex protein PAN1-like [Cucurbita maxima])

HSP 1 Score: 280.8 bits (717), Expect = 2.1e-71
Identity = 194/402 (48.26%), Postives = 211/402 (52.49%), Query Frame = 0

Query: 47  PPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSG 106
           P    G+L K     PS PS RTSD  PPPPP  SS IL+K S IN GMLP+G  IPPSG
Sbjct: 237 PSLVFGMLPKGVPIPPSRPSQRTSDY-PPPPPRASSIILNKHSKINLGMLPRGVPIPPSG 296

Query: 107 PSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPNDGGFPIINNVLSDPSTK 166
           PSQRTSD PPPPPHA SVIL+K+S INFG+LPKGV IP                      
Sbjct: 297 PSQRTSDYPPPPPHASSVILNKQSKINFGMLPKGVPIP---------------------- 356

Query: 167 HPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILH 226
                                              PSGPS RTS   PPPPP  SS IL+
Sbjct: 357 -----------------------------------PSGPSQRTSXY-PPPPPRASSVILN 416

Query: 227 KQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRIPPS 286
           KQS I  GMLP+G  IPP G SQRTSD P PPPHA SVIL+K+S INFG+LPKGV IP  
Sbjct: 417 KQSKIYLGMLPRGVPIPPPGLSQRTSDYPHPPPHASSVILNKQSKINFGMLPKGVPIP-- 476

Query: 287 GPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPS 346
                                                                  PSGPS
Sbjct: 477 -------------------------------------------------------PSGPS 509

Query: 347 PRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVIL 406
            RTSD  PPPPPHV          INFGMLPK   IPPSGPSQRTSD PPPPPH   V+ 
Sbjct: 537 HRTSDY-PPPPPHV------LWPKINFGMLPKDVPIPPSGPSQRTSDYPPPPPH---VLW 509

Query: 407 HKESGINFGILPKGVRIPPSGPSTRSSDYPPPPPHAPFVILK 449
            K   INFG+LPKGV IPP GPS R+SDYPPP P+ P + L+
Sbjct: 597 PK---INFGMLPKGVPIPPHGPSRRTSDYPPPAPNTPSITLR 509

BLAST of CmUC01G009150 vs. NCBI nr
Match: XP_031744003.1 (abl interactor homolog [Cucumis sativus])

HSP 1 Score: 238.8 bits (608), Expect = 9.1e-59
Identity = 167/381 (43.83%), Postives = 196/381 (51.44%), Query Frame = 0

Query: 27  HPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILH 86
           HPK+HIRFS  K KR+PDFD  S  GILSK+ R  P GPS R+SD+ PPP     S +LH
Sbjct: 43  HPKQHIRFSRTKSKRIPDFDTHSGFGILSKDIRIPPFGPSQRSSDSTPPP-----SIVLH 102

Query: 87  KQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRIPPS 146
           K+S +NFG+L KG R   SG SQR SDSPP PP  PS++LHKE  I FGILPKGV     
Sbjct: 103 KESRMNFGILLKGVRTHSSGSSQRFSDSPPSPP--PSIVLHKEPRIKFGILPKGV----- 162

Query: 147 GPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPS 206
                                                                T  SGPS
Sbjct: 163 ----------------------------------------------------PTHSSGPS 222

Query: 207 PRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVIL 266
            R SD+PPPPPP   S +LHK+S +NFG+LPKG     SGPS+R SDSPP PP  PS++L
Sbjct: 223 RRFSDSPPPPPP---SIVLHKESRMNFGILPKGVSTHSSGPSRRFSDSPPLPP--PSIVL 282

Query: 267 HKESGINFGILPKGVRIPPSGPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFD 326
           HK+SGI+FGIL KGV I                                           
Sbjct: 283 HKKSGISFGILSKGVHI------------------------------------------- 291

Query: 327 PPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSG 386
                           SGPS R SD+PPPPPP   S +LHK+S I+FG+LPKG RIPPSG
Sbjct: 343 --------------LSSGPSERFSDSPPPPPP---SIVLHKKSGISFGILPKGVRIPPSG 291

Query: 387 PSQRTSDSPPPPPHAPSVILH 408
           PS R +DS   PP  PS++LH
Sbjct: 403 PSPRFADS---PPSNPSIVLH 291

BLAST of CmUC01G009150 vs. NCBI nr
Match: XP_038882352.1 (uncharacterized protein LOC120073615 [Benincasa hispida])

HSP 1 Score: 211.8 bits (538), Expect = 1.2e-50
Identity = 111/155 (71.61%), Postives = 121/155 (78.06%), Query Frame = 0

Query: 292 GFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPSPRTSD 351
           G   I N LSD + KHPKRHI FS GK KR+PD  PP ++GILSKEER  PSG S  TSD
Sbjct: 39  GDSTIENELSDENIKHPKRHIHFSLGKYKRIPDLAPPFNLGILSKEERVPPSGLSQSTSD 98

Query: 352 NPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESG 411
           N PPPPPHV S ILHK+S INF +L KG RIPPSGPSQRTS+SPPPPPHA SVILHK+ G
Sbjct: 99  N-PPPPPHVISIILHKESRINFRVLSKGNRIPPSGPSQRTSESPPPPPHALSVILHKKPG 158

Query: 412 INFGILPKGVRIPPSGPSTRSSDYPPPPPHAPFVI 447
           INFGILPK + IPPSGPS R S+YP PP HAP VI
Sbjct: 159 INFGILPKSMHIPPSGPSKRFSNYPSPPTHAPSVI 192

BLAST of CmUC01G009150 vs. ExPASy Swiss-Prot
Match: Q9FLQ7 (Formin-like protein 20 OS=Arabidopsis thaliana OX=3702 GN=FH20 PE=2 SV=3)

HSP 1 Score: 62.8 bits (151), Expect = 1.2e-08
Identity = 117/443 (26.41%), Postives = 155/443 (34.99%), Query Frame = 0

Query: 30   RHIRFSTGKP---KRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSS---T 89
            +++R S   P    R P    P S        + TPS   P +    PPP P ++S   T
Sbjct: 617  KYLRASVSSPDMRSRAPICSSPDS------SPKETPSSLPPASPHQAPPPLPSLTSEAKT 676

Query: 90   ILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRI 149
            +LH   ++     P      P+    +TS  PPPPP  P      E   +  +LP     
Sbjct: 677  VLHSSQAVASPPPPPPPPPLPTYSHYQTSQLPPPPPPPPP--FSSERPNSGTVLPPPP-- 736

Query: 150  PPSGPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERT--- 209
            PP  P     P    VL  P    P   + FS+ +P       PP S    S        
Sbjct: 737  PPPPPFSSERPNSGTVLPPP----PPPPLPFSSERPNSGTVLPPPPSPPWKSVYASALAI 796

Query: 210  ---TPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGP-------SQR 269
                 +  +P +S  PPPPPP   S +  K S +    LP     PP  P       S+ 
Sbjct: 797  PAICSTSQAPTSSPTPPPPPPAYYS-VGQKSSDLQTSQLPSPPPPPPPPPFASVRRNSET 856

Query: 270  TSDSPPPPPHAPSVILHKESGINFGILPKGVRIPP----SGPNDGGFPIINNVLSDPSTK 329
                PPPPP  P   + + S     +LP     PP               +   S P   
Sbjct: 857  LLPPPPPPPPPPFASVRRNSET---LLPPPPPPPPWKSLYASTFETHEACSTSSSPPPPP 916

Query: 330  HPKRHIRFSTGK---------PKRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPP 389
             P      +T K         P       P  SV IL     ++   P  +T+  PPPPP
Sbjct: 917  PPPPFSPLNTTKANDYILPPPPLPYTSIAPSPSVKILPLHGISSAPSPPVKTAPPPPPPP 976

Query: 390  PHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGIL 441
            P  ++  +      ++G  P     PP  P       PPPPP  PS          +G  
Sbjct: 977  PFSNAHSVLSPPPPSYGSPP-----PPPPPPPSYGSPPPPPPPPPS----------YGSP 1022

BLAST of CmUC01G009150 vs. ExPASy Swiss-Prot
Match: Q84ZL0 (Formin-like protein 5 OS=Oryza sativa subsp. japonica OX=39947 GN=FH5 PE=2 SV=2)

HSP 1 Score: 50.1 bits (118), Expect = 7.9e-05
Identity = 118/443 (26.64%), Postives = 144/443 (32.51%), Query Frame = 0

Query: 21   SDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTT----PSGPSPRTSDNPPPP 80
            SD S + PK        KPK V  +  P      +KE  TT    PS P  R   +P   
Sbjct: 744  SDQSQEQPK------AVKPKTVRRWISP------NKESETTSVHRPSHPPSRYDSSPAAL 803

Query: 81   PPHVSSTILHKQSSINFG----MLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGI 140
              H     +H  +  N G    ++  G +  P   +      PPPPP+A S  L    G 
Sbjct: 804  AIH----SMHTNNKFNVGKDAPLVSSGAQAVPKIQAAPPPPPPPPPPYASSSSLSMHMGS 863

Query: 141  NFGILPKGVRIPPSGPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVG 200
                 P     PP  P     P  + + S P    P   + F       VP   PP    
Sbjct: 864  ATKQQPPPPPPPPPLPPPPPPPASSGLSSIPPPPPPPPLMSFGAQTRTFVPPPPPPPPPP 923

Query: 201  ILSKEERTTPSGPS-------PRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPS 260
                   T P+ P        P  S  PPPPPP +  +     +       P     PPS
Sbjct: 924  RSGVGGNTPPAPPPPPLRSTVPAISPPPPPPPPPLKPS---SGAPCPPPPPPPPPPPPPS 983

Query: 261  GPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRIPP----SGPNDGGFPIINNVLS 320
             PS R   S PPPP  P ++           +P     PP    + P     P       
Sbjct: 984  APSSRAFSSAPPPPPPPPLLRS---------VPPPPPPPPISHSNAPPPPPLPAARFNAP 1043

Query: 321  DPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVS 380
             P    P  H  F+   P   P      +           P  P P     PPPPPP   
Sbjct: 1044 PPPPPPPTTH--FNAPPPPPPPPITRSGAPPSPPPPPSPPPPPPPPGARPGPPPPPPPPG 1103

Query: 381  STILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKG- 440
            +           G  P    +PP  P  R S  PPPPP  PS  L    G      P G 
Sbjct: 1104 ARPGPPPPPPPPGGRPSAPPLPP--PGGRASAPPPPPP--PSTRL----GAPPPPPPPGA 1148

BLAST of CmUC01G009150 vs. ExPASy TrEMBL
Match: A0A6J1HRH7 (actin cytoskeleton-regulatory complex protein PAN1-like OS=Cucurbita maxima OX=3661 GN=LOC111467141 PE=4 SV=1)

HSP 1 Score: 280.8 bits (717), Expect = 1.0e-71
Identity = 194/402 (48.26%), Postives = 211/402 (52.49%), Query Frame = 0

Query: 47  PPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSG 106
           P    G+L K     PS PS RTSD  PPPPP  SS IL+K S IN GMLP+G  IPPSG
Sbjct: 237 PSLVFGMLPKGVPIPPSRPSQRTSDY-PPPPPRASSIILNKHSKINLGMLPRGVPIPPSG 296

Query: 107 PSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPNDGGFPIINNVLSDPSTK 166
           PSQRTSD PPPPPHA SVIL+K+S INFG+LPKGV IP                      
Sbjct: 297 PSQRTSDYPPPPPHASSVILNKQSKINFGMLPKGVPIP---------------------- 356

Query: 167 HPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILH 226
                                              PSGPS RTS   PPPPP  SS IL+
Sbjct: 357 -----------------------------------PSGPSQRTSXY-PPPPPRASSVILN 416

Query: 227 KQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRIPPS 286
           KQS I  GMLP+G  IPP G SQRTSD P PPPHA SVIL+K+S INFG+LPKGV IP  
Sbjct: 417 KQSKIYLGMLPRGVPIPPPGLSQRTSDYPHPPPHASSVILNKQSKINFGMLPKGVPIP-- 476

Query: 287 GPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPS 346
                                                                  PSGPS
Sbjct: 477 -------------------------------------------------------PSGPS 509

Query: 347 PRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVIL 406
            RTSD  PPPPPHV          INFGMLPK   IPPSGPSQRTSD PPPPPH   V+ 
Sbjct: 537 HRTSDY-PPPPPHV------LWPKINFGMLPKDVPIPPSGPSQRTSDYPPPPPH---VLW 509

Query: 407 HKESGINFGILPKGVRIPPSGPSTRSSDYPPPPPHAPFVILK 449
            K   INFG+LPKGV IPP GPS R+SDYPPP P+ P + L+
Sbjct: 597 PK---INFGMLPKGVPIPPHGPSRRTSDYPPPAPNTPSITLR 509

BLAST of CmUC01G009150 vs. ExPASy TrEMBL
Match: A0A6P5RDM4 (formin-like protein 5 OS=Prunus avium OX=42229 GN=LOC110745147 PE=4 SV=1)

HSP 1 Score: 207.6 bits (527), Expect = 1.1e-49
Identity = 176/423 (41.61%), Postives = 184/423 (43.50%), Query Frame = 0

Query: 30  RHIRFSTGKPKRVPDFDPPS-------SVGILSKEERTTPSGPSPRTSDNPPPPPPHVSS 89
           R + F      RVP   PP+       + G L K     PS PS RTS  PPPPPPH S 
Sbjct: 35  RSLNFWKAARARVPPTGPPNTGTSRKFNFGTLPKGTLIPPSWPSKRTS-RPPPPPPHPS- 94

Query: 90  TILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVR 149
                    NFG LPK   IPPSGPS RTSD PPPP              NFG LPK   
Sbjct: 95  ---------NFGTLPKYTPIPPSGPSGRTSDPPPPP-------------FNFGTLPKYTP 154

Query: 150 IPPSGPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTP 209
           IPPSGP        +   SDP                       PP + G L K     P
Sbjct: 155 IPPSGP--------SGRTSDPP---------------------PPPFNFGTLPKYTPIPP 214

Query: 210 SGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAP 269
           SGPS RTSD PPPP               NFG LPK   IPPSGPS RTSD PPPP    
Sbjct: 215 SGPSGRTSDPPPPP--------------FNFGTLPKYTPIPPSGPSGRTSDPPPPP---- 274

Query: 270 SVILHKESGINFGILPKGVRIPPSGPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRV 329
                     NFG LPK   IPPSGP        +   SDP                   
Sbjct: 275 ---------FNFGTLPKYTPIPPSGP--------SGRTSDPP------------------ 321

Query: 330 PDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERI 389
               PP + G L K     PSGPS RTSD PPPP               NFG LPK   I
Sbjct: 335 ---PPPFNFGTLPKYTPIPPSGPSGRTSDPPPPP--------------FNFGTLPKYTPI 321

Query: 390 PPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSDYPPP-PPH 445
           PPSGPS RTSD PPPP              NFG LPK   IPPSGPS R+S  PPP PP 
Sbjct: 395 PPSGPSGRTSDPPPPP-------------FNFGTLPKYTPIPPSGPSRRTSSPPPPSPPT 321

BLAST of CmUC01G009150 vs. ExPASy TrEMBL
Match: A0A6P5RC98 (formin-like protein 5 OS=Prunus avium OX=42229 GN=LOC110744709 PE=4 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 2.3e-47
Identity = 170/403 (42.18%), Postives = 176/403 (43.67%), Query Frame = 0

Query: 43  PDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERI 102
           PD     + G L K     PS PS RTS  PPPPPPH S          NFG LPK   I
Sbjct: 14  PDSTRKFNFGTLPKGTLIPPSWPSKRTS-RPPPPPPHPS----------NFGTLPKYTPI 73

Query: 103 PPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPNDGGFPIINNVLSD 162
           PPSGPS RTSD PPPP              NFG LPK   IPPSGP        +   SD
Sbjct: 74  PPSGPSGRTSDPPPPP-------------FNFGTLPKYTPIPPSGP--------SGRTSD 133

Query: 163 PSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSS 222
           P                       PP + G L K     PSGPS RTSD PPPP      
Sbjct: 134 PP---------------------PPPFNFGTLPKYTPIPPSGPSGRTSDPPPPP------ 193

Query: 223 TILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVR 282
                    NFG LPK   IPPSGPS RTSD PPPP              NFG LPK   
Sbjct: 194 --------FNFGTLPKYTPIPPSGPSGRTSDPPPPP-------------FNFGTLPKYTP 253

Query: 283 IPPSGPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTP 342
           IPPSGP        +   SDP                       PP + G L K     P
Sbjct: 254 IPPSGP--------SGRTSDPP---------------------PPPFNFGTLPKYTPIPP 280

Query: 343 SGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAP 402
           SGPS RTSD PPPP               NFG LPK   IPPSGPS RTSD PPPP    
Sbjct: 314 SGPSGRTSDPPPPP--------------FNFGTLPKYTPIPPSGPSGRTSDPPPPP---- 280

Query: 403 SVILHKESGINFGILPKGVRIPPSGPSTRSSDYPPP-PPHAPF 445
                     NFG LPK   IP SGPS R+S  PPP PP  PF
Sbjct: 374 ---------FNFGTLPKYTPIPLSGPSRRTSRPPPPSPPTHPF 280

BLAST of CmUC01G009150 vs. ExPASy TrEMBL
Match: A0A2P5WMU6 (Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_AA28257 PE=4 SV=1)

HSP 1 Score: 193.4 bits (490), Expect = 2.1e-45
Identity = 177/429 (41.26%), Postives = 202/429 (47.09%), Query Frame = 0

Query: 62  PSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHA 121
           PSGPS  TSD PPP     SS++L    S NF +LP G  IPPSGPS  TSD PPP   +
Sbjct: 77  PSGPSRSTSDPPPPSSILTSSSLL---KSSNFKILPIGVPIPPSGPSGSTSDPPPPSSIS 136

Query: 122 PSVILHKESGINFGILPKGVRIPPSGPND--GGFPIINNVLSDPSTKHPKRHIRFSTG-- 181
               L K S  NF ILP GV IPPSGPN+     P  +++ +  S           TG  
Sbjct: 137 KFSSLLKSS--NFKILPTGVHIPPSGPNESTSDLPPPSSITTSSSLVKSLNFKILPTGVA 196

Query: 182 ----KPK-RVPDFDPPSSVGILSKEERTT------------PSGPSPRTSDNPPPPPPHV 241
               +P     D  PP S+   S   +++            PSGPS  TSD PPPP    
Sbjct: 197 IPLSRPSGSTSDPPPPPSIATSSSLLKSSNFRIIPTGVPIPPSGPSGSTSDPPPPPSVST 256

Query: 242 SSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKG 301
           SS++L    S NF +LP G  IPP GPS  T D PPPP  + S  L K S  NF ILP G
Sbjct: 257 SSSLL---KSSNFRILPTGVPIPPPGPSGSTLDPPPPPSVSTSSSLLKSS--NFRILPTG 316

Query: 302 VRIPPSGPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERT 361
           V IPPSGP+           SDP               P  +      S+  IL      
Sbjct: 317 VPIPPSGPSGS--------TSDPPP------------PPLILTSLLKSSNFKILPTGVPI 376

Query: 362 TPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPP--- 421
            PS PS  TS    PPPP + ST L    S+NFGMLPK    PPSGPS  T D PPP   
Sbjct: 377 PPSRPSESTS---YPPPPLLISTSLPISKSLNFGMLPKS---PPSGPSGHTLDPPPPPTK 436

Query: 422 ----------------PPHAPSVI----------LHKESGINFGILPKGVRIPPSGPSTR 441
                           PP APS            L     I F +LPKGV IPPSGPS R
Sbjct: 437 MFRAFDQCNPSKGVPIPPSAPSTYPPFPPMILTSLSLLKSIKFEMLPKGVPIPPSGPSRR 469

BLAST of CmUC01G009150 vs. ExPASy TrEMBL
Match: A0A6P4LZB0 (formin-like protein 20 OS=Gossypium arboreum OX=29729 GN=LOC108455019 PE=4 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 5.8e-43
Identity = 165/378 (43.65%), Postives = 189/378 (50.00%), Query Frame = 0

Query: 62  PSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHA 121
           PSGPS  T D PPP     SS++L    S NF +LP G  IPPSGPS  TSD PPP   +
Sbjct: 76  PSGPSGSTLDPPPPSSILTSSSLL---KSSNFKILPIGVPIPPSGPSGSTSDPPPPSSIS 135

Query: 122 PSVILHKESGINFGILPKGVRIPPSGPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKR 181
               L K S  NF ILP GV IPPSGPN+             S   P   I  S+   K 
Sbjct: 136 KFSSLLKSS--NFKILPTGVHIPPSGPNE-----------STSDLPPPSSITTSSSLVKS 195

Query: 182 VPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGER 241
           +     P+ V I         SGPS  TSD PPPP    SS++L    S NF ++P G  
Sbjct: 196 LNFKILPTGVAI-------PLSGPSGSTSDPPPPPSISTSSSLL---KSSNFRIIPTGVV 255

Query: 242 IPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPNDGGFPIINNVLS 301
           IPPSGPS  TSD PPPP  + S  L K S  NF ILP GV IPP GP+            
Sbjct: 256 IPPSGPSGSTSDPPPPPSVSTSSSLLKSS--NFRILPTGVPIPPPGPSG----------- 315

Query: 302 DPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVS 361
             ST  P      ST            S+  ILS      PSGPS  TSD  PPPPP + 
Sbjct: 316 --STSDPPHPPSVSTSS-----SLLKSSNFRILSTGVPIPPSGPSGSTSD--PPPPPLIL 375

Query: 362 STILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGV 421
           +++L    S NF +L  G  IPPS PS+ T  S PPPP   S  L     +NFG+LPK  
Sbjct: 376 TSLL---KSSNFKILSTGVPIPPSRPSEST--SYPPPPLLISTSLSISKSLNFGMLPKS- 397

Query: 422 RIPPSGPSTRSSDYPPPP 440
             PPSGPS  + D PPPP
Sbjct: 436 --PPSGPSGHTLDPPPPP 397

BLAST of CmUC01G009150 vs. TAIR 10
Match: AT5G07740.1 (actin binding )

HSP 1 Score: 62.8 bits (151), Expect = 8.3e-10
Identity = 117/443 (26.41%), Postives = 155/443 (34.99%), Query Frame = 0

Query: 30   RHIRFSTGKP---KRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSS---T 89
            +++R S   P    R P    P S        + TPS   P +    PPP P ++S   T
Sbjct: 617  KYLRASVSSPDMRSRAPICSSPDS------SPKETPSSLPPASPHQAPPPLPSLTSEAKT 676

Query: 90   ILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKGVRI 149
            +LH   ++     P      P+    +TS  PPPPP  P      E   +  +LP     
Sbjct: 677  VLHSSQAVASPPPPPPPPPLPTYSHYQTSQLPPPPPPPPP--FSSERPNSGTVLPPPP-- 736

Query: 150  PPSGPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERT--- 209
            PP  P     P    VL  P    P   + FS+ +P       PP S    S        
Sbjct: 737  PPPPPFSSERPNSGTVLPPP----PPPPLPFSSERPNSGTVLPPPPSPPWKSVYASALAI 796

Query: 210  ---TPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGP-------SQR 269
                 +  +P +S  PPPPPP   S +  K S +    LP     PP  P       S+ 
Sbjct: 797  PAICSTSQAPTSSPTPPPPPPAYYS-VGQKSSDLQTSQLPSPPPPPPPPPFASVRRNSET 856

Query: 270  TSDSPPPPPHAPSVILHKESGINFGILPKGVRIPP----SGPNDGGFPIINNVLSDPSTK 329
                PPPPP  P   + + S     +LP     PP               +   S P   
Sbjct: 857  LLPPPPPPPPPPFASVRRNSET---LLPPPPPPPPWKSLYASTFETHEACSTSSSPPPPP 916

Query: 330  HPKRHIRFSTGK---------PKRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPP 389
             P      +T K         P       P  SV IL     ++   P  +T+  PPPPP
Sbjct: 917  PPPPFSPLNTTKANDYILPPPPLPYTSIAPSPSVKILPLHGISSAPSPPVKTAPPPPPPP 976

Query: 390  PHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGIL 441
            P  ++  +      ++G  P     PP  P       PPPPP  PS          +G  
Sbjct: 977  PFSNAHSVLSPPPPSYGSPP-----PPPPPPPSYGSPPPPPPPPPS----------YGSP 1022

BLAST of CmUC01G009150 vs. TAIR 10
Match: AT4G22505.1 (Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein )

HSP 1 Score: 62.4 bits (150), Expect = 1.1e-09
Identity = 125/427 (29.27%), Postives = 153/427 (35.83%), Query Frame = 0

Query: 23  PSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSS 82
           P T  P R  R  +  P R P   PP            TP  P PRT   PPPPPP    
Sbjct: 32  PRTPPPPRTPRTPSPPPPRTPKTPPPP-----PPRTPRTPPPPPPRTPRTPPPPPPRTPR 91

Query: 83  TILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILP--KG 142
           T      +      P   RIPP  P +     P  PP  P V   K    +    P    
Sbjct: 92  T----PPTAPPRTPPVSPRIPPILPPK---TPPTAPPQTPPVSPPKSPPNSPPRAPPLSP 151

Query: 143 VRIPPSGPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPKRVPDFDPPSSVGILSKEERT 202
            R PP+ P     P +   LS P T  P    R     P R P   PP +  +     RT
Sbjct: 152 PRTPPTSP-----PRV-PPLSPPRTP-PTSPPRAPPIPPPRTPSTSPPRAPPL--SPPRT 211

Query: 203 TPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPH 262
            P+ P PR    PP PPP+   T   +   ++        R PP+ P +    SPP  P 
Sbjct: 212 PPTSP-PRA---PPVPPPNTPPTSPPRAPPLS------PPRTPPNSPPRTPPTSPPRAPP 271

Query: 263 APSVILHKESGINFGILPKGVRIPPSGPNDGGFPIINNVLSDPSTKHPKRHIRFSTGKPK 322
            P   +   +      L    R PP+ P           LS P T  P    R     P 
Sbjct: 272 VPPPRISPTAPPRAPPL-SPPRTPPTSPPR------TPPLSPPITP-PTSPPRAPPLSPP 331

Query: 323 RVPDFDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPPHVSSTILHKQSSINFGMLPKGE 382
           R P   PP +  I     RT PS P PR    PPP  P  S  +           +P   
Sbjct: 332 RTPPTSPPRAPPI--SPPRTPPSSP-PRAPPMPPPRTPPTSPPLSPLSPPPRSPPMPP-T 391

Query: 383 RIPPSGPSQRTSDSPP-PPPHAPSVILHKESGINFGILPKG---VRIPPSGPSTRSSDYP 442
           R PP  P    S +PP  PP AP     +   ++  I+P      R P S P T  +  P
Sbjct: 392 RTPPVSPPTSPSRTPPVTPPRAPPTAPPQTPPVSPPIVPPNSPPKRPPMSPPITPPTSPP 415

Query: 443 PPPPHAP 444
             PP +P
Sbjct: 452 RAPPQSP 415

BLAST of CmUC01G009150 vs. TAIR 10
Match: AT1G68690.1 (Protein kinase superfamily protein )

HSP 1 Score: 44.7 bits (104), Expect = 2.4e-04
Identity = 73/260 (28.08%), Postives = 94/260 (36.15%), Query Frame = 0

Query: 200 TTPSGPSPRTSDNPP---PPPPHVSSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPP 259
           TTP    P  S++PP   PPPP  ++T             P    +PPS P    +  PP
Sbjct: 3   TTP--VQPPVSNSPPVTSPPPPLNNATSPATPP-------PVTSPLPPSAPPPNRAPPPP 62

Query: 260 PPPHAPSVILHKESGINFGILPKGVRIPPSGPNDGGFPIINNVLSDPSTKHPKRHIRFST 319
           PP      +      +  G  P  +  PP   +    P+I +    PST  P + +   +
Sbjct: 63  PP------VTTSPPPVANGAPPPPLPKPPESSSPPPQPVIPS--PPPSTSPPPQPV-IPS 122

Query: 320 GKPKRVPDFDPPSSVGILSKEERTTPSGPSPRTSDNP----------------PPPPPHV 379
             P   P   PP+ V  L        S P PR S +P                PPPPP  
Sbjct: 123 PPPSASP---PPALVPPLPSSPPPPASVPPPRPSPSPPILVRSPPPSVRPIQSPPPPPSD 182

Query: 380 SSTILHKQSSINFGMLPKGERIPPSGPSQRTSDSPPPPPHAPSVILHKESGINFGILPKG 439
             T      S       +  + PPS PS+R + SPPPP                      
Sbjct: 183 RPTQSPPPPSPPSPPSERPTQSPPSPPSERPTQSPPPPS--------------------- 217

Query: 440 VRIPPSGPSTRSSDYPPPPP 441
              PPS PS R S  PPPPP
Sbjct: 243 ---PPSPPSDRPSQSPPPPP 217

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6603169.11.0e-8649.88hypothetical protein SDJN03_03778, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_031744042.11.2e-7447.34proline-rich receptor-like protein kinase PERK9 [Cucumis sativus][more]
XP_022967687.12.1e-7148.26actin cytoskeleton-regulatory complex protein PAN1-like [Cucurbita maxima][more]
XP_031744003.19.1e-5943.83abl interactor homolog [Cucumis sativus][more]
XP_038882352.11.2e-5071.61uncharacterized protein LOC120073615 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
Q9FLQ71.2e-0826.41Formin-like protein 20 OS=Arabidopsis thaliana OX=3702 GN=FH20 PE=2 SV=3[more]
Q84ZL07.9e-0526.64Formin-like protein 5 OS=Oryza sativa subsp. japonica OX=39947 GN=FH5 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A6J1HRH71.0e-7148.26actin cytoskeleton-regulatory complex protein PAN1-like OS=Cucurbita maxima OX=3... [more]
A0A6P5RDM41.1e-4941.61formin-like protein 5 OS=Prunus avium OX=42229 GN=LOC110745147 PE=4 SV=1[more]
A0A6P5RC982.3e-4742.18formin-like protein 5 OS=Prunus avium OX=42229 GN=LOC110744709 PE=4 SV=1[more]
A0A2P5WMU62.1e-4541.26Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_AA28257 PE=4 SV... [more]
A0A6P4LZB05.8e-4343.65formin-like protein 20 OS=Gossypium arboreum OX=29729 GN=LOC108455019 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G07740.18.3e-1026.41actin binding [more]
AT4G22505.11.1e-0929.27Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamil... [more]
AT1G68690.12.4e-0428.08Protein kinase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 389..403
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 67..81
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 347..361
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 207..221
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 430..444
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..454
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 109..123
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 249..263
NoneNo IPR availablePANTHERPTHR33599:SF13FORMIN-LIKE PROTEIN 20coord: 323..397
NoneNo IPR availablePANTHERPTHR33599:SF13FORMIN-LIKE PROTEIN 20coord: 189..289
coord: 49..149
NoneNo IPR availablePANTHERPTHR33599:SF13FORMIN-LIKE PROTEIN 20coord: 367..450
IPR039639Protein IDA-likePANTHERPTHR33599PROTEIN IDA-LIKE 5coord: 189..289
IPR039639Protein IDA-likePANTHERPTHR33599PROTEIN IDA-LIKE 5coord: 323..397
IPR039639Protein IDA-likePANTHERPTHR33599PROTEIN IDA-LIKE 5coord: 367..450
coord: 49..149

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC01G009150.1CmUC01G009150.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010227 floral organ abscission