CmUC05G085020 (gene) Watermelon (USVL531) v1

Overview
NameCmUC05G085020
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
LocationCmU531Chr05: 3944764 .. 3986164 (-)
RNA-Seq ExpressionCmUC05G085020
SyntenyCmUC05G085020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGTTTCAGCTTCCGTAGTCGATGGATTCATCCTCACTCCGCTATCCAAAGCCGACGAAAACTACCACAGACCCACCGAAATCAAAGCTTTCGACGATACTAAAGCCGGCGTCAAGGGCTTAGTCGACGCTGGAATTAACGAAATTCCACGCATCTTTTACCAGCCACCGGAAGATTACTACTCCGACAATATTTCCGGCGAAACCCAGTATCAAATTCCAGTGATCGACCTCGATGACGTCCACAGAAATTCACTCAAACGGAAAGACACCATAAACAGAGTCCGAGAAGCTTCAGAGAAATTGGGTTTTTTCCAACTGATTAACCATGGGATTCCGGTGGGCGTTCTGGAAGAGCTGAAAGGTGCGGTCAAGAGATTCAACGAACAAGACACGGAAGTGAAGAAACAATACTACACCCGGGACAACACCAAGCCTTTGATTTACAACAGCAATTTCGATCTGTACAGCGCATCGACCACGAATTGGAGAGACACTCTTGGGTATATAAGTGCCCCTAATCCTCCCAATCCGCAAGACTTACCCGAAATCATCAGGTAATTAAATTGGCTGAAATTTCATATTCTCTGCCTGTGATTTGAAATTAACAGTGTGTGTTTACAGAGATAATCTGGTGGATTACTCGAAAAGAGTAATGGAAATTGGGAAATTGTTGTTTGAATTGTTGTCGGAAGCTTTGGGGCTGAACCCAAATTACTTGAACGAAATAGGCTGCAGCGAGGGGCTGGCAATTGGGTGCCATTATTACCCACCATGCCCACAGCCGAATTTGACACTGGGCACATCCGAGCACAGTGACAATGTTTTCATCACTGTTCTGTTTCAAGACAACATCGGCGGGCTTCAGATTCGACACCAGAAAAAGTGGGTGGATGTGCCACCTGTCGCCGGAGCGTTGGTGGTCAACATCGGAGAGCTTATGCAGGTAAGTGTATAATATAAGAACAATCTCTCTGTTTTTATGCTTGTAATTGAATTGGATTACTGGGATTTGCAGTTAATAACAAACGACAGATTCATAAGCGTGGCGCATAGAGTGTTGGCAAAGAAGGAAGGACCGAGAATTTCAGTAGCGAGCTTTTTTTCGACATTGGCTTATCGGAGCTCAAAAGTGTATGGACCCATAAAGGAATTGTTGTCGGAAGAGAATCCTCCAAAGTACAGAGAAACCACAATCAGAGATTTCCATATGCTGTATCGTGCAGATGGGCTTGGGACTTCGAGTCTGCAGCGTCTCAAGCTTTGATGTGTAATTAATAATAAGAAAGTTATGGAATTGTAGTTTAGTAACGAATGTACAATTGTTCTGTATGTTCCCCGTTTTGTTTGAACCTGTGTGTGTTGAGAAAGCGATGGAGTCTTTCTCTTCCTCCCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCAGTCGTGGGGTATTGTAATGTTGTTTATGAAAAAACTTCAGATCTTGCCTTCTAAACTGTGCAATCAATAGTTGTTTGTAGTTAATTGAAACACCAATATTTGGGTGTATTTAAGATGTAGGATATTTTAAAAAACAAGTAAGGATTTGACTTACTTGGATAGTTGTAAATATAGTAATCATGCTCAAAATATTTGTAGATGACACAATGCAAAAATGTATATATAGCAAAATTCAAATTTTAACTTCAGAGTCTATCATTGATGGACCATATCACTAGTAGGGTTCTATCAATCAGTCTATCGTGAATAGATTTTGCTATATTTGCAATGGTTTAGAAATGTTGCTATAAACTTAATTATTATCCTAACAAATGTTACCCAATACAATTACTCTTAAGAACTTTTCTAATTGAAATGAATGCACTAATTTATAAGAAAATACTTAGGTGAGGCTACACTTATCTCCTAGCGATCCAATAAAAAATTTCTATGGGACAAATTTTTTTAAATTAATTTTTCTATATTATTATTGTATATATGTAAAAGAAAGGGGACACGTTGTAACCTTATTGTTATTATAAGAAAAGCTCAATTGGTTGACTTTTAAAAAAAATATATTTAGGTGTCGTATTTTGTGCATACCACCTATTAGCTGATTGAAATTTGTGAGAAAATTTATAGTACTGCGATGCTACCTCCATTCATTGCTTGCCACAAAAATAACTTTTAAGAATTAAGTGGGATTCATGAAATAATGAAATAATAAATTAATAATTGATTAAATGACGTGTCAAATTATGAATGGAGTAGTATATACTATAATAGGACTAGATTTTGTTTTCTTTGAAATTTGCGATATATTGTAGTAAATATATATATTTTTTTTTTTTGAGAAATTGAATTAATTTTTTTTACTATATAATAAAAGAAGGTTCGTGAAATATATGCACTTACCCATTATTTTTTCTTTTAATTGTAGAGGTGGCCTAAAATATTATTTAGTTTATATATAAAACTTTTCTCTTTCACTGGCTATTTCTTTTTCTTCATGCTTTTGCTCTCGGGAGATCTTATTGCCTTCGTAAGTGATATAGAAAAATGGTACATGATTGTTAATATGAATATAACAATCCTCCTACCTAACATATAATACATATCTTCTTTCGGAGATATTATTCATGTAACACATTAACCCAAATAGTTAAAAGTGATATATTTTTGACAAAAAAAAAAAAAAAAAAGTCAAGCTTCAAATCCTCTTACCCTATTTTCGTATAGTATCAACATATGTAGCAACCTTGAAATAAAAACTCCCATAATAGAATCACACACACTGAATAATACTTCACGATAAATAATAATAATAATAATAATAATAATAATAATAATAATAATAATAATAATAATAATAATAAACAACTCAAAATACACAATTAATAACCACCATGTGCTCTAACAAACGTGCACATAATGATGGGCTTATGCCCAATATAAAATGGTTGAAAATAGCAATATTAATAGACTATTTTAGCTATTTTATTAGCACTCCTTCACTCGTTCTTGAACTAGAAATGGTATATAAAATGGTCTACCATGTTCATAAATAAAGTCTATATAGTACTTTATAAAGTAACCCATCCCACACTTTATAACATAACTTGTACACATCTCATGCACATGAAAAACTTTAAATAAAGTGGGTAGGGAGAAAAGAAATACGTCGAAGATACTATTGAACTAAGTTTTTTTTTTTTTTTTTTTTTTTTAATTAATACTTTTTGTATTAAAGACCCATATATTTGTGGTTTGAATTCTCTCTATTTAAAAAATTTTCAAGCTACTATTTAAACACTAATTTTGTACCTTTTATTTCATATTTTTTTGGTTGATTACTATATTCTATTTATGTGACTCAGCATGAAACGTGTTTCGTATGTGATTATAGGAAAAAAGTTTCTACTCTTTGATTCTTACAATACAACGTCCATCATTATAAAATTTTAATGACGCTCAATCAAATTTTATTATGTTGTATTGTATTGTACTTCCAAATTGGAAATATAATAAAATAATAAATGGATAGACATTAGTGGAGTGGATAGTGAAAATTAAATTGACCGCCCGTTCTTTTCCATTAAGAGATGGACACATTGAATTAGGGGACAAATTAGTGTCATTTCATTTGGAGGAAATCATTATATAATAATAATCCACGTGCTTAATTATTTGCACTATTCCAATCCTAAATGCATTATTTATTGATGAATTGGTCAAGCACGGCAACTTTGGTCATGTCTACGAAAAATGTCATCATTTGAATATAGCGAATACAATTCAAATGACATGGGATATGTGTTAGCTATAGTTTATGATTCAAATTCTTTTCTCTCAATACTAAAATAGAAAAAAAAATCATTTGAATATAGAATTTTTTTTTAGAGAATATTCTCCCAACTTTAAAAATGAATTTATCTTGTATTTAGCCATGAATTTGGCCTAATAATTTATTTATCAAACAAATTTAGTTAGTTTGAATCTCCACACATTTTAGGATAATTTTGAGATTGTTTAGATTCGTTAAAAAAATTCAATTAGTCCAATTGAAATGGTTAAACTTATATAAGTTTGGATTGTATTTATATAAATGAAAAGTTTACTGGGTGAGATATTGATTAATTCATCTCATTAAAATTAGTTAGTTGAATGGAATCAACTCTACTAACTCATCAAAACAAGATTGATTAACTTTTTTTTTTTTAATTATACAAAATACTCTTAAAATTTTTAGGATATGTTTCAATTATATATATGACTCTAAATTTTATAATTTTTTTTTATTTTTACCGCTTAAATTTGAAAATTGTTTCATAAAATACTAGTGGAGAGTGTTTTCATTAAATTAAAGAAAAAATAAAAATAAAAAATATATATTTAACACATTTATGATTGATGTATAAAATTGTTAAGATGATTATAATTCATTTGGTGTAAGTTATCCTATCAATTTCAAGATTTGAATTTCAATTTATCTTTAAAAGTAATTTGAAAATGACTGGATTATAAATCTTAATCCACAAGTAAAAATTTTTCAGCTTTATCCCAAAAAAGAAAAAAAAAAGGCAAAAAAGGGAAAAGTTTTTCAGTAAGAACAAAATGATAAACTGCCAATTACCGTCTTCCATTTCTGCGTTCTTTATGTTACAATTTTGAAAGACGTTCTTTCAATACACAAAACTGGTTCACAGAAGCTAAGAGATCGACGATGGCGGTCGCAGCCTCAAGAGTCAACGGAGCCAATCTCACTCCGCTATCCGAAGCCGACGAAAACTACCACAGACCCACAGAACTCAGAGCTTTCGATGACACCAAAGCCGGAGTTAAAGGTTTAGTCGACGCCGGTATTACCGAAATTCCTCGAATTTTTTACTTCCCACCGGAGGATTATAACTCCGACAACGCAACCGTGGAAATCCAAATCCAGATTCCGGTGATAGACTTCGACCACGTTGGCAGAAATTCACTCAAACGCAAATACACCATCGACAGAATCCGAGAAGCTTCTGAGAAATTGGGGTTTTTCCAACTGATTAACCATGGGATTCCAGTGAGCGTTCTGGAAGAGATGAAAGATGCGGTTCGGAGATTTCACGAACAAGAAACGGAATTGAAGAAACAATATTATACCCGCGACCTGACTAAGCCTTTGATTTACACCAGTAATTTCGATCTGTATTCTGCCGCGACCACCAATTGGAGAGATGCGTTTAGATATGTTAGTTCCCCAAACGCCCATGATCCGCAAGTCCTGCCAGAAATTTGCAGGTATATCTCCTAATTTTTTACCAAAAAACTAATACGCTGTTTAAACTAACCTCTGTTTGTGTAGAGATATTTTAGTGGAATACTCGAAACAAGTGATGGAGATTGGGAAATTAGTGTTTGAATTGCTGTCGGAAGCTCTGGGTTTGAATCCAAATTACTTGAATGATATAGACTGCAGCGAGGGGCTTGCATTTGTGTGCCACTATTACCCACCATGCCCACAGCCAAATTTGGCCATCGGCACATCGGAGCACACTGACAATGGCTTCATCACTGTGTTGTTGCAAGACCACATCGGTGGCTTACAAATTCGCCATGGGAACAATTGGGTGGACATTCCCCCTGTCGCCAGAGCTTTAGTGGTCAACATCGGAGATCTTATGCAGCTAATAACAAATGACAAATTCAAAAGCGTTAAGCATAGAGTTCTTGCAAACAAGGAGGGTCCGAGAGTATCAGTGGCGGGCTTTTTTTCCACGCCTGCTTTATCATTTTCCAAATTGTATGGACCCATCAAGGAATTGGTATCAGAAGAAAATCCGGCAATATACAGAGAAATCACAATTAGAGACTTCAATGTCCAGTTCCATTCAAAAGGCATTGGAACTTCTACTTTGCAGCATTTCAAGTTGAATCAAGAAGATACTTAGTTGTTGTTGGAGGGAAATAAAGAAATTAGAGTGATTGATTCTGGGTGTTCTTTGTTTAAGGAACACGTGTCTGTTTACAAAATGAGTTTAAACATTCTGTTGCTGAGAAACTAATCTGAAACTCGTTCCATGTTGTTGTTGTGTAAGGTAAGATTGCAAGATGAACATGAACAATAAGATTGTTGTCGTAGGCTGTTTGGTAGGCAATCATGTTTTGTTGTAACGTTGGTAGAAAAAACACTAGTGAATTGATGTGAAAACTTGTATGTCTAAAAAAGAATATTAGATAAATTGTTGGGTAATGAATATGAACACTGCTAAGACGCCATCAACTCCTAAATTTATGTCTATATATGGTTTGGATTGAACTTGTAGCTAGGGGCCGGGAGGCACTGCCTAGTTACTCTTGCCCCACTCGCCGCTAAGTCAAAACATGTGAAGTGTGAACATTAAATATTTCGAACCCTACCTTGTAATGAAAGCAATTTGAATCAATTGAGGGGCACGTTCCCTCCTTCGTTCCCATGTGGATCCACCCTTGGTAACAAGTAACAACTACAAGATACTTCTTGTATTTTTTTTTTCTCAAATATCTAATAATACAAAGGAGTCCACATTGGTTGTGGTCTCCTCAAATATTTTGGCTAACTTAAAAGGCGACTCATTGATTAATAATCAAAACATATAATATCAATTTACATGCCAAAATAGATAAAGTTGTATCTTTATGTGGTAGGATATATATTTGTTCATACCAAATTTCAGAAATTTGATAAGAAATGCCACAATATTGCTCTCTCTAAAGTTCTATCCTAAAGGGTGAAGGACACAAATCAAATCCAAAACTCATAGTTGATGATTGATGAACAAGTAGTGAATATTGATATATACTATACTATTGAGAGACGTGAAAAAGGCATGGATAGCCAAAAAGATGATTATTTGTTAAAATATATTTATTGCCTTTAAAAGGAATGAGTAGTTCACACTATTATGTTTACTACTGGTAAGTTTTTTTTTTTAAAAAAACTCTACGTGTTAGGTCTTTAGATTCACTAAACTTAGAAAATATAGGTTCAATATTTTTATATTTTTAGTGTTTAATTTTGAAATTTGAAATTTTAAAATGGAGATTATATTAAAAAGATAGAAACAATCATACATTAACAAATATATTATTCTAGGTTAAAAATTCAATTATTTTTGTTAAATTAAAAAATGTAAATTATATGATATTACAAAATAATAATTATACGAATTTCTTATGAGAAAATATGTTCACCAATTAATTGATAATAATTTATATTTAATTTAATATTTAATGAATATTTATAATTAGTTTTATATTGCATATCAAACACAACTTCAACCATTGTTTAAATTGAAATTTAATTTATGAACTTCCTACTAAATGCATATATAAAAATATGGAATATAAACCTTATTTTTATTGAATGTTTTGTTTCAAATGGTTTATGAAACATACTTTTCCTTTAGTTATGAATTTTGGTATTGATTTGAACCATAAACACAAGAACACGATAATTGATTAAGCAATGATATTACATGCATTTGAATTCAAAATAATATACTGTCAATATCATGTTAAATTAATATTATACTCAATTCAATAGAGTTAAAAGAAGCCAACTCAAATTCGATAGAGTTAAAGAAGCCAATTAAAATTAAGCGTCCAACGGAGCCATGATAATCGATTTCTTACTTGAAAAGAAAATAATAGAGTTAAGGTATATTTAATCTTTCCAAAAGCAGAAGAGGTTAATGAGAAGAATAAGGTAATTTTTAAAAAATATCCTGCTCTCAAACTATTTAGGAAAATAAGATATTTTTTATTTTTTAAAAATATTTATAATAGATAGCAAAATAAGAGAAAAAATCAGTACATATTTTAAAATATTTGAAAATATAACAAATGTTGACTAGTCAACTTTTTTATTTTTGTGATTTTCAATTTTGTCTTCTTGTCTTGCATAACTTCTTTTCTTCTTCTTTTTTTTCTCATTTTCTTTTTCCAACTTTTGCTTCCATCTCCCCAATCTCCTCATATATATATATATATATTTTTTTTTTTTTCAATTTTTCTTTCATTTCTCCAATCCTCTCAAATTTTTTTCTTTTTTTTTTCTTTTATTTTATAATTTTTTTCCATCTTCCCAATCTTCTCATATTTTCTTTTTCTTTTTTTTTTTTTCACAGACGAAGTTAATTAAATTAAATAAATATTTGTTAAATTTAAACGTTTAGTTAACTTTCCGTAACTTTTTTTAAATACTTTTTGAATGTGGCACAATTCCTAAAATAAGGACGATTTTTGAACCAACGTATTCTACCAAAAGACAAAAAAAAAAGTGCCTTATTTGTCTAAATAGTTTGAATTGTACCCCTTTTATATAATATTTTATAAAAATTATCTTATTCTTCTAATTAAACCGAAGTAGGAGCCATCTTAGCGAACATAAAAGTTGTCTGTCAATTTGTGTCTAATGACAAACAATTTTGATTGAACATGTATTGTCGAATTGTGTACTTAACCACATACTAAGATTGTTGATAGTATTATTATTCAACATGTGCAACATCAACTTGCATATTTAGCCATCGTACCTACATAGTTCACATAATTTAGATTTATGATTATAACTAAAGTTAAAGTTTAATAAATACATTCCCAATTTTATTTAAAAGATGACATAGATTTAAATTTATATTCGCATATCCAAAAAAAAATTGTATTTGAATTGTAGCTTGTAAGTGCTACTAAAAGAAATGATATACAATTGACAACATAGTTTAACTTATTAAGTTATATTGAATCTTTTGATGGGATATCGATTAAAAAAAAGAGATACATAATTTTTATTACATTGTTGACATGTATCACATATCCAAATTGTCCACCATGTTTGCACACATGGTTTATTAATGAATGAATTGGCCCACAAATTATAGGAAAATTTGTCAACACCTCATGCATATGGAAAAAAAATTAGAAAAAGAGAAGTGTCCAAAAGACTACCATTCATTCAAGACTATTCTTTCTTCCTTTCATTACATATTTTAATTTTGGTTGATTATTCTTCCCTTCATGTTATTCATGACTTATTAAGAAACGTGCTTCTTTAAGGATTTCCTCCCTTTTTTCTTCTAAAGGTTGTTTCCAAGTAAAATGGTGTAGGATTGACATTATTTATAAAATAATGGATGAAGTGAGGTGATAAGATTGAATTTACTATTAATGACGTTTTTCCATAGAAATAGACAAATTAGAGAAAACATTATTGAGAATACGATTAAAGTCCCACATTCATTAGATCAAGACATAGTCATGAGTTTATAAAGGGAGACAACTATATGTCTCCAATAATATGAAGTTTTTAATTAAACAATATCGTAGAAGTTGTTTTAACATTTACTTTTTTTTTTAATTATTAAAAACTTATTATTTATTAAAGGAAATAATTATTTATTAAAAAGAATTCACGTGCTTATTTACACTATTCCTATTCCAATCTAAAATCATTATCTATTGATGAGTTGGGTTAGCATGCCAACTATGGTCATGTGACTATCTACAGAAGAAAAAAGACCATTTCTTATCCTTACAGTCAAGTGGATTAGTTAAAAGATGGCAAATCAACTATTTTATAAAGGTTAAATACTTCAAATTACAAAATCAATTCCACACTTTCTAGTAGGAAACTATTATAATTTTTTTTAAAAAAAATCAAACTATTTACAAATATAAAAATATTTTAGTGCCTATTTTTCTTTCTCCCATTTTTAATACTATATCTATTTTATTTAACTAAACTAAAAAATTTTACATATTATATTTCTAAAATTTCAAATTTAATTAATAGGTCGCTATAATCCATTCTTCAAGTCTAATGTTTACATAGTTGTACGATCTATATTTATGAATAAGTGTTTTTCATCCATATCTTGTTAATCTCTTTTATTATGTTGTCCATCTAAAATTAGTCTATTTAACATGTTTCAATTGTCATATTGTATCACATAATAATTAATTGAAGTATTCTATATGACATGTACGTATTACTTTTTAAATGGACAAATTCAAAAGAGTTGAAAGGCTAAATAGATAGAATCAAATTTGTAATTAACTTTTTTATTGAGTTATTTGATCTCTATAAACAAAGGATCATCCTCAACAATATGTGATTAAATTGCTATCTGCAAAGTGCCAATTTGAGGATACATTTGTTGGTGAATAAGACATTAATAACTATCCAAAGCTCGGTCAATCTCCTACGTTCCAAATTCTTGAATAAAAGAAATTATTCTCCAAGTAAATTTTTGCCGTCTATCACATAGTAAATTTTTGTTGTATTTATAATTATTTTCAACCGTTTTACCATTTAAAACAATTTACTTAACTAACTAAATTATAATTAGAAATACGAATATGTAAGTATAATAGTCTATTTTTATTATTATTATTTTTTTTAGTCAGTAGGGACAAACCGGGCCCTAATACCCTATCAGTTTCTCTATTAAATATGTTTACAGTCGATAACCTCTTTGCTAAAACACTCTACAATTTTTGAAAAGACATTCATCGGATATACAGAGGAAGTTGTTCCATTGAAGCTAAGAGATCGACAATGGCGGTCGTAGCCTCAAGAGTCAACGCACCCAATCTCACTCCGCTATCCAAAGCCGACGAAAACTATCACAGACCCACTGATCTCAAAGCTTTCGATGACACTAAAGCCGGCGTCAAAGGCTTAGTCGACGCAGGAATCACCGAAATTCCTCGTATCTTTTACCGCCCACCGGAGACTTTCGACTCCGACAATATTTCCGGCGAAACCCAAATTCACATACCAGTGGTAGACCTCGACCACATCAACAAAAATTCACTCAAACGCAAGTACACAATCGACATAGTCAGAGAAGCGTCAGAGAAATTGGGTTTTTTCCAACTGGTTAACCATGGGATTCCGGTGGACGTTCTCGAGGAGATGAAAGATGCAGTTCGGAGATTTAACGAACAAGAAACGGAATCCAAGAAACAATATTACACTCGTGACCTCACCAAGCCTTTAATTTACAACAGTAATTTTGATCTGTATACTGCAGCCACCACCAATTGGAGGGATACCTTTGGCTATATAAGTGCCCCAAATTCCCACAATCCTCAAGACCTGCCGGAAATTTGCAGGTAAAGATGCTGATTTTGATTAGTCAAATAATCCCTTTTACTAATTACTAACCTCTGTTTTTGCAGGGATATTCTAGTGGATTACTCGAAACGAGTGATGGAGATTGGGAATTTACTGTTTGAATTGCTGTCGGAAGCTCTGGGTTTGAATCCAAATTATTTGAAGAACATAGACTGCAATGAGGGGCTTGCACTTGTATGCCACTATTACCCACCATGCCCACAGCCGAATTTGGCCATCGGCACATCGGAGCACACCGACAATGACTTTATCACTGTGCTATTGCAAGACCAAATCGGCGGCCTTCAAATTCGGTATGAGAACAAGTGGGTCGACGTGCCCCCAGTCGCCGGAGCTTTAGTGGTCAACATCGGAGATCTTATGCAGCTAATAACAAATGACAAATTCAAAAGCGTTAAGCACAGAGTTCTTGCAAACAAGAAGGGTCCGAGAGTATCAGTGGCAGGCATTTTTTCGACGCTTACTTTTCCAACCTCAAAATTGTATGGACCTATCAAGGAGTTGCTATCAGAACAAAATCCTGCAATATACAGAGAAACCACCATCAGAGACTTCAATATCCGGTTCCGTTCAGACGGGCTTGGAACTTCTACTTTGCAGCATTTCAAGCTGAGTCAAGCAGATGCTTAATTGGTTGTTAACACAGGAAAGTAAAGGAAACAGAGTGTTTGATTCTGTGTGTTATCTGTTTTGATTGAACTTGTGATCGTCAACCTTCACAATCCTATAGTTTGAAAAACTAAACTGAAACTTGTTCCAGGGTGTACGTTGTTGTCAATGAAAGGTTGGTACATGAATATGAATAAATAATAAAATTGTTGTGTTGTGTCAAATTTAGATATTTTGTAAAATTAAGGTGGTTTGAGACACAAAACAAGTGTTCAATAGAACGCTAGGACAATCACCAAGATAACAAATAAAATGCATAAATAAAGAACTAAATAAATGTTCTAGTTTGATAACCCAGTTTGGTGACAAATATATTGATATTAATTTACTTTTGGCCGACAATATATTCGATAAAGATAAACTCAAAAGTCAAACGCTTGTAACATTATACTTATTTGTAAAGTAATGACAAATATATTAACATTAACTTTCGTAAAAAGTGATTAAAAAAAATCTTAACGATAAAATAAAACAACTTATTTATTAATTTGTAAAAATCACTCTTAAATTTAAAATAACCTATTCAAGCTTACAAGAAGCTTTTAGATTTAAGCAAATTTGTTAACTTGTTAACCACTCAAATAAATATGTTAATATTATTGTATATTAATATATTATGTCAAATCCATTTTCAAATTTTAATTTATTCAATTTTTTATAACAAATGTTTTTCTAATGATCTAGATTAAAGAAAATTAATTGAAATCTAGATCATACATAATTTTAAACAAAAAAAAATTTAAAAAAATTTAAAATTTAATTATCAATATGATAATAATAGAAATATGCTACTTTGGCTCGTATAACTAAATTACACTAACTTATTATTAATTTACCAAATACCATAATTAACTTTTTCTAATTTAAAATTAATATTTACCAAATATTATTTTACTTCCGTTCACAGTCGATTTTTCTAGAAGCTCATCGGCCAGAAATTATTTTAAAATCTACAACAATCCCAAATGGAGCCTTAGACTCTCTTTAAATTATGAATATTATTGAATGAACATGTACGCTCCCCTTAAATGTGAGACTCCCTCTCAAGCAATGATGCTTTAAGATCCTCCAAGTGTGAGACTGCTTCTCACTTAACTGTTATCTTTCCACTTCAAATGAAGATCACTTCACTTGATGAACTTAAACTTACTCTAAGTTTGAGAACTCCTTCTCACTCAACAATGCCTTAGGATCCTCCTAAGGATGAGAATTTCATTCTCAATCAATGATGCCTTAGGTTCCCCTTAAAGCCAAGACTCTTTTCTCAAGATTGATTCATCAAGACAAACGAAACAACATGCCTCAAAACATTGTTTAACCTCTTGGTTCCCAAACATTTGGATTGACAAAGAACTTTGTTATTCGGCCACCACCCAAAGCATAAATACCTTAACTTAAAACAAAAACGGCTACCCTAGCAAGACAAAAACTTTCTCTTAAATAGGCTCGGCATAAATGCGAAAAAATACCCCATTAAAACAATGTAGAACATAGACAAAGAAAAAATCCAAAGGAAAATAAAGATGAAGGATGTTCCAAAAAATGAAAGATATAATCTGCAAGATCAACATTGTGAGAGGCATTCAATGGAACATTTCTAAGACATTTAGCAACCAGGGCAACAATAAATCCCAAACATTCTTAAACAAAATATAGCCAATTCGAATCTTTAACAAGTTGTATATTTATTTTGAAGGATAATTGTATGTAGCAATAGCAAATCTCATAAATTTGATAAGAGATGTTTCAATATCAATCATTTTGGCACTGAAGTGTTACAATATTGCTGACCCCAAGTTTGAACATAAGAGAAGTGTTTAAAATTAGTGCAAGACATATGTATCTTTTTTTAGTTCAATAATTGTGGTAAGAGATTTAACCATCTAACAGTAAAATGTTAGTTTTGAACTTTATCTTGCGTTACCCAAGGAGGGTCCATATGAAGAAAATCCTGCAATATACAGAGAAACCACAATCAGAGACTTCACTATTCAATTCCCTTCAAAAGGTATTGGAACTAATAATTTGCAGCATTTCAATCATAAAACAAGCTGATGCTTAATTAGTAAATGAAAAATAAACCAAACAAAGTGATTCATTTTGGGTGCTCCCTCTTTGATTCAACTAGTAAATGGTAACCTCTGCAATTCTATACTGGATTAAACTGAAAAACTTAACTCAATTAAACCTTCCTCCATGGTGTTTCTTTTCATAAATTTGACAAAGGTATAGTTAGTAGTAGATCGGCTATTCGTTCAAGTGGGGAAAAAATTCACACACGTGATTATCTCTAAAGCGATACATAATAAAATGCGGCCTATAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTATTGTGTTTGTATGAAAAAAAGTAGAGATTGGAGAGTGATCAGATTAGGGCTACAATAGAAATAGTCATTCCAAATATTATATTTTATAAAAAAAAAAAAAAGCAATATTCCTCGCATCAATCAAAAAGACAAATGTGAAAAGGTCATCCATACGAAAAAAAAAAATATATCCCTTAAACTGATCTTGATAGATTTGGCAGTCGATTTGAGGATTTAAATCTATATGTTAATTTGAATATATGTAACTTTTTTTAAAGTGAGTATACAGTAACATTTATTTTTAAAAACTTTAGAATTTAAAAATAGTAAAAATTAACTTCTTGATCTATTAAATCTTAGCATAAACAATTAAAATTTTAAGCAATTTAAAATTAAATTAATGAAAATTTTAATACATTCATAATTATTCATAGATGAAGAAGTGTTACGATAAAATATGATACTAGTATCAATTTTTTTTTATTTTTTATTTATTTATTTATTTTTGTCATACTTCATATCAAATCATATTTATTTATTTTAGCATATCATTTTTGGACTATTATGTCAATATACTAACTAATTTACATATTCCAATATGTTATGTCCTTAATTATTTCGTGATGATATAATACTATTTGAAGTTTTAAGTTTTTGAATATTATTATGCATCGGTTATATATTTTTTATTTAATTATTTTGAACTAAAAAGAAAACAAATAATAGTTAATATGATATAAATTTATTATTTTTATATAATATAACATTTTAATTATTTGGTAACTTTAGCTTGGAGTATCTTAATCTATTAATTATATTTTATAAAACAAAAAAAATTCTAGGTCAACTAAAAATTAAAAGAACCCAATATCTAGATACATTTAGTAATTTTTCTTTAAAATTGAATATACAATAATATTTATTTTAAAAACTTTAGAACTTACAAATAGTAAAAATTAACTTAATGATCAATTAATCCTTAGCATAAAGAATTATAACTTTAAACAATCAAGGGAAGTTATTTTAAATGATAGGAAATATTTTCGCAAAAAAAAGAGCTATTCTAATGTGTTGAGAATTTTTTTTTTTTTTTTTGCTGTTTTGTTATATTTGTAAGTAAGTTGATTATTTTGTTCTATATGAAAACAACCCTTAAATTAATGAAGATTTAAATACATTCATACTGGTTAATAAATGAGAAAATGTAAAGATATGAAGATTTAAATACACTCATAATCGTTAATAATAGATGTGGGAGTGTAAAGATAGAGTATAGAAAGTTTTTGTTGGAATGTGATAAGTTTTAGACTCATTGTTTTATCATAGATTGAAAAGTAACCTTATGAAGTTTAATTTGGAATTTCAAAAAGTGTGAAAACTATTCAATTTTCTACAACAAAATAAGTGGTTCCTTTTCCATATCTAATTTTTTATAGGTTGAAAATAATTAAAATTAACTTTATAATTTATTGAATCTTATAATTATACTAAAACAAAATTTAAGGGTTCTTTTTAAATATAAAAAAAAAACTAAAATATTTATAAATATAACAAAATTTCTTTGTCTATCTATGATAGATTATTATCTGTGTCATGATAGACACGAATAGTAGTCTGTTGTGATCTATCGCAACCTATGGCAGATAGACGATGTATTTTGCTATTAGTTGTAAATATTTTCAACAGTTTTACTATTTAAGATAATTTCCCAAAATTTTAACCAATTGAAAATTTAAATATAAACAAAAATCAATGAAATTTCAATTTACTAATAACAATTGATAAATGAAGTGAATATATTTTTTTAAAGCATTCATTTATAAAATTTTAAAGATATTTACGAGAATTTATAAAGAAGTTGTAACATTAGAAAAGTCTTACTGCTCCATTTCGTACAACTACATAAATAACTCTTCCCAAAATGTATTTGAATAATAGTGATAATATGAGTATATAAAAAATCCAACCTCATTTGACAATAGTACATGCAAAATCAATTTTTAAAATGGTTATAAACTCATATTTGAATCATATCTAATTAAAGTCACATTTCAACTATTTAATAAAATTACATCTTGACATTTATTGAACACTGTGACGTGGTTTCGAACTGGCTAAATAGAAAAGGAAAGAAAAAGAAAATGTATAGATTTGTCAAAATTCATAAATACTTTGAAATGTTTAATGAATTATCAAGTCTGGTGTAAAATTAAGTTTCATTAATGCATAAAATAGGCATAAAATAACATCTCTCAAATTCATAAATTTCAACCCTAATGCGATGCATTATAAAATGTGTGTATAGTTCGTGCCATTGAGAAAGATATCGGGATAGGACTACAATAGATAGTCATCCCAAATGCTATATTTTATTTAAAAGAAAAAAAAAAAAGGATTGCTCGCACCAATCCATCAATCATGGAGACTTGTGAAAAGATCATTTGTATAAATTTTTTATATTTTTTATATTTTTTTTAAAAAAAATTCCTTAAACCAGTCTGGATAGATTTGCCAATTGATTTTGGGAATTTAAATTTAGATGTTAACTTTGATTTCTCTCCACCAAAAAAAGAAAAAAAAAAACTACAAAACCGACCTTGTTTAATTGTATCATAATTTCAACCGTAGCTTATTTATTGAAATTATAAATTTTTTAAAGTTGTGTATTACTAAAGGTAAGAGAAAGAAGTTAAGTGAAGTAATTAGGTTCATAAAAGAGTATTTCAACCTACCAAACTCAAAACTTAAATATTAAATAGGTAAAATACTTATTAACTAAATGAAAATTAGATTTAAAATTTAGGGGAAAAATATACTTTTCCTTATAATTTACTACCTTGTTGATAGTATGAATTTTCTTTCTTCATAGAAGTTCTTGAAAAAGTGTCATAAATAACCATTAAAATTAAATAAATAAATAAAAACTGTTTTTCTGCTTGGCATATGCATCCCACCAATTTTGAAAAAAATATATTCTTTTTCTTTCTAACTTCTGTCTAAAATTTTCGCATTTTCTCGATAAAATGAGTGCATTTATACCATTTTGTAATTTTAATTTCTTTTTTTGAAAAATTATTATAAATAAATAAAATCAAACTATTTAAAAATATAGCAAAATTTTACTTTCTATATGTGATATTTTTCGGACTTTGAATTTTTAAGAATGTGTACATTTGATTCCTAAAGTTCCAATTTGCCATTTTAGTTCCTTATTTTTACAAAATAGGTTTAAAAGGTCATTGTATCTATTTTAGACATATTTTTATTATTTCAAATTTTAATAGGATGTTTTGAAATTAAAAAGAAATTGTATGTATTTTATATTCTCTCTCTTCCCATTCACTCTCACCTCTCTCCCTCCTCAGTCCTCGCCTGCCTTTGGGCTCCATTTCAATTACTCCTCCCTGAAGCCATCCACCAAAATCTCCCAAAGTACACCACCTTTTAAACCTATTTTATATATTTTTAATAATTATTGAATGCGCTACACAAAAATAATTCCCAACTCCTCCGGTATTTTTTTTTTTTTTTTTTTTTTAAAAAAACTGAATTCTAAAATGATTTTTCCAGCATGGATTGGCTCCCAAATCTAAAATTCAAATTAGGAAAAGCTATGAACATCCAGTCAAACTTTGGAAACTCCTTATTAGAGGCACCAACTAACAAACCGTCGTACCATTGGCAGTGTCACTTCAGCATTTATAAATGCTACGGTTGCCTTGCCACCTTTTCCCCAAACCACTTTACAATCATTTTCAAACCCAAGAGATCCAGATCGATAATGGCGGTTGCAGCCTCAGAAATCAACGGACTCAACCTCACTCCACTATCCAAAGCCGACGACAACTTCCACAGACTCACTGAACTCAAAGCTTTCGATGACACTAAAGCCGGCGTTAAAGGCTTAGTCGACGCCAAAGTTACCGAAATCCCCCGTATCTTTTACCACCCACCGGAGGAATATGACTCCGTTGAAACCCAAATCCGTATTCCACTGATAGACCTCGATGGCGTCGGCAAAGATTCACTCAAACGCAAACACATCGTCGATCAAATCCGTGACGCTTCAGAGGAATTGGGTTTCTTCCAAGTGATTAACCATGGAATTTCGGTGAGTGTTCTTGATGAAATAAAAGACTCTGTTCGGAGATTTCACGAACAAGACACGGAAGTGAAGAAACAATATTATACTCGAGACCTCATGAAGCCTTTTATTTACAACAGTAATTTCGATTTGTATTCTGCACCCACCACCAATTGGAGGGATACTTTTAGCTATGTTAGTGCTCCAAATCCTCCCAATCCACAGGAATTGCCGGAAATTTGCAGGTAAATGTGCTGATTTTGGACCCAAATAAAGTACCTTTTTAGTTTTACTAACCTCTGTTTTTAGCAGAGATATTCTGGTGGACTACTCGAAATGGGTGATGAAGATTGGAAAATTAGTGTTGGAATTGCTGTCTGAAGCTCTAGGTTTGAATCCCAATTACTTGAACAACATTGGGTGCAGTGATGGGCTTGAATTTGTATGCCACTATTACCCTGCATGTCCACATCCAAAATTGACCACTGGGATATCCGAGCACACGGACGCTGACTTCATCACGGTGCTGTTGCAAGACCACATCGGCGGCCTACAAATCCGTCATCATAACAATTGGATTGACGTGCATCCTGTCGCCGGAGCCTTAGTGGTCAACATCGGAGACCTTATGCAGGTTAGTCTGCACCTATCATATTATATTACATCTTAAAAATGGAATGAATATCCTTTAATTAGATTGGAAATTTAGATGCAGTTAATAACAAATGACAGGTTTAAAAGCGTTAATCATAGGGTTGTATCGAAACATGAAGGTTCGAGAATATCAGTGGCAGGTGTTTTCTCGACGATGCTTTTGCCAAATGACAAACTTTATGGACCCATCAAGGAATTGATATCGGAAGAAAAGCCTGCAGTTTATAGAGAAACTACAATCAGAGACTTTAGTATTCAGTTCCAGTCGGACGGACTTGGAACTTCTACTTTGAAGCATTACAAGCTGAATCAAGCAGATGTTTAGATTGCAAGGAAATTGACGATCCAAGATAACAGTTCAAGTGATTATGTGGACAAATTGCTACTTTATGTCTTTATCACATTTTATGCTTATTGTCTTCCGCTTCGTTTCCTAGTGTTACTATCGTCAAATTTGAATAAAATAATAGTTATCTTTTCGACGAGGAAAATTATTATACCCCAAAAAATTATCCGACTAAAAATTTCACTATTTATCTAGTATTTAAACTTTCAATTATATATTTAACTAGTCTTTAAATTTTAAATTTTATATTTAATAAATTTCTAAAACTTCAATTTTGTATCTTAAAAACATTAACTAATAATAATTAGTTTATTAAATATTATTTTATGTTTAATAGATAATTAATATTTACTTACCGTTATTAGCATTCATATTTATTAATTAATTAAGAATAATCATGAAATGAAAATTTAACTTTAATTTTATCAATAAAAAATCGAGCTTAAAAGTTTAAATCTTAAAACTTAAAGATTAAATTGAAATAAAAGTAAAATCTCACTAATAAAATTGTAATATTCTAATACTTAATAACTAATTTAATCAAAACTCAAAATTTAAATAGGTAACCTTTTGATAATAAAGACAACTAAACCCAAAATCTAGAGAAATTTTTTTTTTTTTTTCAATTCTTTTTATTTTTTTAATTACAATATTTAGGTTGGCGAGCTTGATTAGCATTATTATTATTTTGGAAACATGGCGAGATTGAATCTTGATAGTACAAAAGGTAAACTATTTTATTTTATATAAATGCGAGTCTAGAAACTTGAAACTGAAATTGAGAGTCTCAAAGCAAAATCGACTCAGAGAAAATTTGTTATAATGTTGCACCATGCCGTTGACTTAACTCCGGCAGCCAATGCCGACGAACATTTCGATAGAGCTGCCGAACTGAAACTCTTCGACGACACAAAAGCCGGAGTGAAAGGCCTTGTCGACAGCGGCATAACCCAAATACCGCGAATATTTTACCGACTGCCTGATTCCGGCGTATCCCCTGTTCCCGGAGACACCGAACTGAGCATTCCAGTGATAGATCTCGAAGCCATCGACAGAGATTCATCCAAACGAAGAGATGTCGTCAACAAAGTCCGAGAAGCCTCTGAGAAATGGGGTTTTTTCCAATTGGTCAACCATGGAGTTCCGGTGAGTGTTCTGGATGAGATGAAAAAAGGGACACTAAGATTTTACGAACAAGATACCCAATTGAAGAAGCAATTCTACACCCGCCACAACACAAAATCCATTGTTTACAACAGTAATTTCGATCTGTTTACTGCACCAGCCGCTAATTGGAGAGACACATTCCTCTGTTTCATGGCTCCCAATCTTCCAAATCCACAAGACTTGCCTGAAATTTGCCGGTAACATTGTGGTCTCCGATGAGATAAAAACATGGGTTTTCAATGTTTTTTGCTATCCTCTGTTTCTCTTTTTCTTAATTGGTTGGTGTTTGTTGTAGAGATATCCTGTTTGATTACTCAAAAGAAATGAAGAAATTGGGGAGAATATTATTTGGGTTGCTTTCGGAGGCTCTTGGGTTGAACACAAATTATCTGAGCGACATCGAATGCGACAGGGGACTAGCTGTTCTGTGTCATTATTATCCAGCATGTCCACAGCCAGAGCTCACTTTAGGCACAACCGAGCACGCCGACAATGATTTCTTGACGGTACTTCTACAAGACGACCAAATCGGAGGTCTTCAAGTTCTTCACCAGAAGAAGTGGATTGATATTCCCCCAATTCCCGGGGCCTTGGTCGTCAACATTGGCGATCTTCTGCAGGCAAGCTTCTAGGATTTACGCATAAAGATTCAGAATGAAAGCTTTCCCATTTTGTAATACTTAGCATTGAATTATTTTTGCAGCTGATTTCGAACGATGGGTTCAAGAGCGTGGAGCATCGAGTGCTGGCGAATCGTGATGGTCCGAGAGTTTCAATTGCGAGCTTTTTTGGTATCGGCGTTTATACAACATCTCAAGTTTATGGACCCATAAAGGAATTGTTATCGGAACAAAATCCTGCAAAGTATGGAGAAACGACGCTCAAAGACTTCTATTTCTATCACAACAGCAGAGGCTTGAATGGAACTTCTGCTCTGCAACATTTCAGGCTCAGTCTGGACGATGAAGGAGATGCTACGCCCATTAAGGATTGTTTCGTTTAGTGGGTGGAAAATCCCGACGGTATTGGGTTTGATTTGTTTGATTTCATTTCTTTTAGTTGGTTGGATCTGTATCTGTTATTTGAAGTTTATTTAGTTAAGATCATCCATTTCTTTATTAGACATGTCTTAACATTTTCATTCATATCTTCAATCATTTCTGGAGTGGATCTACATTTCTATTATAATCTTAAAAAAGTCAACCAAACATTAATATACACATATATATAAATAATGAACATTTTAACTTAATATTGTAGTGTAAAATGTTACATATACATTTGAAGCCACCACATACATAGACTTAGATAATAAAAGAAAAGTTAATAATTTCAATAAAAAGAGAATTAAAACTTTTTTTTCAAATATTTGTTTGGTAACAAATTTAGAAATTTGATTCTAATTTAAACATTTTTTTAAAAAAATGTGTTTGATACAATATTTATAAACCATAAAATTAATTGTAGTTACACACTCAAAATTATATTGACTATGAACTATTATTAATTAATATATGAATATTTATGTTAAAAAAAATTTGTATAATTATTATTTTATAATATTATTTACTTTACATAGTACATGTTTTTTTTTTTTTAAAATGAATTGAGTTTATAACATGCCATATAAAAAAACATAAAATAAATTTATGTCATAAACAAATTTTCAGTTATGAATTACTATATACAATCACAATAAAAAATGAAAATATTGATAAAATATCAATATTGATACATATTATATGTTGGGTCCGTATGAGAATTTATACCCAGTTTGATTTAACAATAAAAATATTGAGAAATATTTTGCTGCTATGATTTGGTTTGATTGGTGACCATTAATGTTTTGATACTTTACAACCCTGTTGTCACCAAATGCCCTAAAATCTCCACTTATCCCAATCACCCTACCAATCTCCTTCATTCAGTCCAAAAACCCATCCAGAACTACATCTTTAATTGAACAATTATGATTGTAAAGCGATCATACGAATTGACGCATATTTCGCCCCGAAGTATTTTCAATCTCCATATTTTGTTCCCAACTGAAGCTCCCTTAGTTCCAACAACAGAGGAATACCACACTGATTGAAATGGTTGTTGGGTTGACCATATTAGAATCTCCAATATCTCACTTACTTCGCCAAAGTCCATTGTTACCCATCTAACAATAGGGAGTTAGTATGGAATAAGGCTCCACCTAATCATTTAGGTTCAACAAACCTCTATCTTATCACCAAGTAAGTTAATTAGTTCTGTTTAGGCTTTTCACTATCAATATCATACAAAACATTGGAATTCTATCATATCTTACACAAAAAAAAAAAAAAAAAAAAAAAAAAAATTAAAAGGTGAAATGATGGGTAGAAAGTTGGAAACTAGAAGCAAGAAAGGCGGCGTGAGGGGAAAGAGTTGTGAACTATGTTATCATGTTTTCGTCGCATTATCAGATTTCTTGCGGTGGAACCCACAATGAACCCAATGGATTAGCAGCCATTCATGTCCTCCCCATCCCGATAAACATCCAAACTCCTCAAGATTCAAAGCTCCAAACAAAACAAAATGGCCAACCTTACACCTTTCTCCAAACTTGACCAGACCTTCGACAGAGCTTCGGAACTGAAAGCCTTCGACCAAACTAAGGCCGGCGTCAAAGGCCTTGTGGACTCTGGCGTCGCCGAGATCCCAGGAATATTCTACTGCCCACCCAAAGAACATTCCAATTCCGTTCCCGAGGAAACCCATTTGGGTATTCCGGTGGTGGATCTGGAGGACATCGATAAAGATCCCTTCAAACGCAGAGAAGTGGTGGGCAAAATCCGAGAAGCTTCAGAGACCTGGGGTTTCTTCCAAGTGCTTAACCATGGAGTTCCGGCGAGTGTTCAGGAGGAGATCATAAATGGGGTGCATCGATTTTTCGAACAGGACATTGAAGTGAAGAAACAATATTATACAAGAGACAATACAAAGCCATTCGTTCACAATTGCAATTTCGATTTGTTTAGCGCACCTGTAGCCAATTGGAGGGATACTTTCTTCACTCTCATGGCCCCAATTTCTCCTAGCCCCCAAGACTTGCCTCAAGTTTGCAGGTAAACCATTTTTTCTTTTTCCGCTCTCTCTCTAATTCAGTTTCCAAATCCTAACCATTTAATTTTAAAAAGATTTTAAGTTCAGTTTCAAATTATGTTACAATTTTACCGTTAGAATTTTGTTTTAATTAGGTTCTCAAATTTTAAGATTTAAACTTCTAAACTTGATCATTAAATATTTATTTTATATTTTTACAGTTAAATTTATTGATTAATTGAAAACAGTTAATTATTTTAAATTAATTAATAAAATAAATATTTAGTGGAAAATTAAAGGATGAAAATATAAATCTTAATATCGATGGACCATATTAAAATAAATCTATTTTGACACCATTCCAATATAAGTCATAAAAGTTAATTTTCCCAACATTTTTTCACCTTCTATCTCTTTTTTTTTTTTCCGCATGTTATTTAATTTTTTGGTTAAATTACTAGTTTAATTGGTGAAATTTTTGTGTCCAGTTTGGTCTTTAAATTTTTTAACATATATAATAATTATCTCAAACTTGTTCAATAGCAAACTAACATATATATCCAAGATTTTCAAAAATTAATTAATCAATATGTATAAAATTAAATTTTATATCTAATAAAATTTAAAACATTCAATTAAGAATTGTGTTGCACGGAAAATTAATACTTTAATGATCTACGTTTTTTTTTCTTTAGATAGAGGTAGTTTTGAAAGTAGACTTATAAGTTGCCAAAAATATATATATTATAGAAAAGGAGATTCATTAGTGTTAAAATTGAGAGTTCAAACATCTATTTTAAAATTTCAAAGCGAAAGAATTAAATAGCACAAATCAAATATTTTTAAGGACTAAAGTATCAAATAGCACAAAATTAGATATTTTAGGGACTAAAGTATCAAAGGGAATAAACTTAATTTAACTTTTTATTTTTCATTTTCTTATACTTCTTAACTTAGGCTCGGAGGGACTGTGCTTAGTTATAAATGTGCATCACTGATAGCGTAGAAATGATCTTCTTCTTGAAGTGATACATAAAATTAATCTAGCAGTTTACAAATCTATCACCCGTGGACACGTGTGTATATTATACTTAGATTATCTTCAAATCTAGAAAAATGAACCAAAATATTTATAAATATAACAAAATTTCACTGTTTATTTGCGATATACTACTATCTGTATCTATCTGTCACAAAATAATTTTCCTATGAGTCATTTGCCACTGTTTATTGTAGATAAACTGTGATATTTTGCTATTGTTTTTAAATATTTTCAACAACTTTGTCATTTAAAATAATTTTCCTATAACATAAAAATTGATTTTGCTAATGTCATATGACCCAAAATTTAAAAATAATGAAAATTAAGTTTTTTCTTATGTAACAAAGTAATGCCAATTAAAGCAAAGATAAATTTGATGGTTTACTTATGGACATTTTTGTTACAGAGACATCCTAGTTGAATATTCAAAACAAATAATGAAACTTGGAGAATTGATATTTGGACTACTTTCAGAGGCCCTTGGTCTGAAATCAACTCATCTCGTGGACTTGGACTGCAACGAGGGACTTTCCATTCTCGGCCACTATTATCCGCCGTGTCCTCAGCCTGAACTCTCCATCGGCACAACCGAGCACTCTGACAATACCTTCATCACCGTGCTACTGCAGGATGGTATGGGAGGCCTTCAGGTGCGTCAACACAACAAATGGGTGGATGTTCCGCCAGTTCCTGGGGCCTTCGTTATTAATGTTGGCAGCTTATTACAGGTAAGTGATAGGACAATTATTTCTATTGATACGAAGGTGAAGTGAACACTATCATTTTGAACACTAGTTGATTATGTTACATGAGAATAAAGGAAAAATTAATGAAGGTTGTTTTGCAGTTGATAACGAATGACAGATTTGTGAGCTCGGAGCATAGAGTGGTGGCGAACCGTAAGGGTCCAAGAGTATCGGTGGCAGGTTTCTTCAGCACCGGCTCTCTACCAACCTCCAAACTTTATGGACCCATCGAAGAATTGTTGTCGGAACAAAATCCTCCCAGATACAAAAAAATAAGTGTTAAAGAGTACAATCTCTACTTTGCTGAAAAAGGGTTGGATGGCACTTCTGCTCTTCCACATTTCAAGCTTTGATTATATAGATAATTGAAGTTACCAATAAACATGAGATTTCATCTTAAAATTATAAAATGTCTTCTCTTTTTAATGTAGGATCCTCAACCTCTAAAATCCAGATGGTTTAAGATGCTTACATCACAAATATTGGCAAAGGATAAAAAAAAAAAAAAAAAAATCAAATTATAGACCGGCCAATTTTTCAATATCCAAAACTTTTTACTATTTTTTCATAATTATTGGCCTCCATCACTTAATAAGCTCATACTAATCTTTAAATCCACTAGTTTTATTACCACTCATCATCCACAATTTTATAGGGATAATTATTTTAAGTGATGAAACTATTGGAAATATTTTCAAATATAATAAAATATCATTGTTCATTAGTAAATAGACAGTGATAGACATTGATAAACTCGTTAGTGTTTATTATTAATTAGTCTATTAGTGTGTATCATATATATATTTCGTTATATTTATAAATATTTTGGCTCATTTTACTGTATTTAAAAATAATTTTTTATTTTGTGATTTTATAACTAAATAAATACTAATATGGCTAATCCGAGTATCAATCATTCCTCATATAGTGGTAAATGACATCATATTTGCCTTCTAATTTTCTTACCTAAGTCCCATTTCAATTCTTATTCTTTATTCTAGATTAAAAAAAAAAAAAAAACTTTTTTGGAACTTCAAAGTTTCAAAATTATTTTTAGAATAAATTTCAAAGTTTAGGTTGAAAATGAAGCTTTAAAAAGTGAGGGGCATAATCTCTACTATGTGTATTCCCATTTTGTCCTTCATTCTTCAATATATATCCATTATATCATTGAATTGGTTTTTTTATAAATGAAATTAGATTTAATAATTAGTTATGATTCAACCCACTCATTGTAAAATTTCCACACTTTCATAATTTTAGATTTAAGGGTTAGTTAATGATTTTATTTAATTCTCATCCTTGTGATTGTTATGATGTACACTCACAAACTTTCCATTTAATTTTCACCGTCATTTTGCACTATATTTTCCTATTTTGCAGTTAGCTTTTTTAAACCAAACAAACTCTAAAACTTGAAGTTAAAAGCATAAAATCTCCAAAAATCTCACAAGGTTCCAAAGATCAAAATGTCAACTTTAAAATTAAGCAAAACAAGGAATCATAAAATACAAAAAGGGTTGCCAACAAAGAAATATTTTATTGGATCATAGACAACCTCCTTAAATGATCATAATAAATAAGGGCATGTTTGGTAGACAATTCAGATTCTATTTTATGTTATCAGATTCACAAATTAAACTTTAGTGAATTTGTATTTCATAGATAATCTAAATTATGTTTTCGAAAATTGCTTCCGAATTCTGCGATCTAAACAATACAATTCTGAAAACAACAATTTTATATTTTCAGTGTATCAAGATTTTGAATTTAAAATTTTGAAATGTTGATATATTAGGAAAGATAAAAACAACAATTATACTAGTTCATGCTATAAACTAAATTATTTTTGTTAAATTAAAATATGTATCATATAATATATTGTAATTATACAAAAAAAAAAATTAACACAAAACAAGTTCATAAAATAATTAGTAATAGTATATATTCAACCTAATATTTAGTAACTATAATTAATTTTATTGTTTATAAATATATCATCAAACATATTTTGAAAAAATTATTTAAATTGGAATTTAATTACTAAATCTATACAATTCTGTTTTCATTGAATCCATTGTTTTCAATTACTTTGAATTGCAAACATGGTCAACAAAGTCTTTATCACTACAAGTAATTAAGAGAGTATGGAAGAAGAGGGTGAAAATAGAAAAATGTTAGCTCATATATTGAAATAGAGAAAATTACAAAAAAAAATGGAAGATGGAGATGGAATGGGGTGTAAGTAAACAATTTTCACGTATATACTAATGCTTAAGGGTTTTTTAACTTCTTATATGCATGGTTAAGGCTACTATTTTTTAAAGTTCAACTGTCTTTTTGAATTAAGGTGGTAACATTTCTTAAGAAATAGGGTACTAATTGTTTGTATGTATATACAACGGTGAGAATAATTTTTAATATTTTTGGTTAAAATATAAAGATAAAAGTATTTTTAATCTTTTTTTAGTTCAAATATATAAAGATATTAGCTTTTGGAAAGTTCAGAAATACTTTTTAATCGAATACAAAGTTAACGGGTATTTCTATTTTCTATAATTTAGGCTGTAGAAAACATTATTTTTGCTTTATTCTTTTTATTTATTTATTTATTTATTTATTATTATTATTATGATTATTATTATTATAATAGTGCTGATTTGAATTAAGTATGTGACCATCAATTCAATTATTTATTAATTTTAGAGAAATTATTTTAGATAGTACAATTGCTTAAAATATTAACAATTAATAGTAAAATATATTGTCTATCTGCGATAGACCGCGATAAACTGAGATAGGCAATGAATTTTTGCTATATTTATAAATATTTTTTTTCATTTTGCTATATTTGAGAAATGCCCTTAATTTTAAGTAGTATATTTAATTAAATGACCCAAATAATTATCCGCTCCCCAAGACTTAGCCTGAAATTTGCATTAAATATGAAATTTTAATTAAGTAGAAATTTATATTTGAATTAAATATGTAATCTAACAATTATTGATAATTATCTTTTAGTATATCAAACGTAAACTTTGACCTCGAAAAAAATATATGTCAATTATTGTTGTAAGTGGCATTTTTGATAAATATAAAATTTAATGAATGATATATACTCCTGAAAAAACTTTTTAAAGATGTAGTTCTTAGTAGATAATAGATTATTGATCATCTCAAATTCAACCGTTTAATTCAAATTTAAATTAAATAGGACAATTTAATTAAATGAAAATATATGATTTGAACCAAATATGTAATCTATCAATTCTTGATTTTTTTGGAAGGAACCTATCAATGATTGATGATTAATTGGTTTGAGTACAGCAAAGATAGGGCATGAGATTTGAAGTTTTGGAATATTGTTCAATTATTGTTGATGCTCATGTTAACTTAATAATTTTGTTTATATATATATATAAAACAATATTATTCAAGAGACAACAGTAAGCCTTTTTCTTACAACTGCAATTTCGATTTGTTTAGTGCACCCAGTGCCAATTGGAGGGACACCATCTTCACTCAAATGACCCCAAATTCTCCTAACCCACAAGACTTGCCTCAAGTTTGCAGGTAACCTCTACTGATCTCTTCCTTCTTTCTCTCTGTTATTTGCCTTTTGTTAAAAATCAGCTAGAGAATAATTCAGATTACAGATCAATAAGTGTTAATAAGTGTTAGAGAGTTCATATAACACACCTTCTTAAATTTTAGGGTTAACTAAAATAGTACATAACATAAATAAAATAAATGAGGTATGTGATAATCTATCATAAAATAATAGCAAACTTAATGCAAACTAAATATATTAAAATAGTGCGAAAAGTCAAAATATTTATGGTGTATTTTAATGTTGTAAGTTTTCCACTAATAGATGTCTATTCATTAATGAAGGAAAAATAACTTAGGTGCATTACATGATACACCTATCTTTAATTAGTGCATCCCATCAAATTATTTAATCATAACTTGTCACATTACAAATTATTTTTACATCTTTCTTCAAATATTTCAACTTTTCTATTGTGTGTGTGATGAGTAGGAGTGAGCAGGGCTGGCCAAAAAATCGAGCCAAACGACCGAAACTGGCTGAATCAAGGGTTGGAGGCGGGTTTTTTTTAAAAAAAAAAAAATCGATCAGGCCAATTTTCTAAAAAAAATAGTTATAAAACGAACCGACCCGACCAAAAATTAGACCGAACCGTCCGACCCCGACCAAATTATAAAATAGTATAACATTTATTTATAATTGAATATACTTTATTATAAATAATATTATAAAATTAACCCACTTTAAAATATTTTTTATTATACTAATATTATATAATATATTTATATCAATAAACCCTTTTTCAAATCTTACTTTTCTTTCCAAATTCCATCAAACTAACCTAATTTTGTATGGTATAATGGATAGGATAGCTTTACTTACTAATCTTGTTTTAAAATTTTCTATTAATATTAATTAAATTACGCTGTAATTTTGTAATTTGTATGACTATTTTATTGTATTATTTTTAATTGATAGTCAAAAGACTAATTTTACCAACTTTCTTAAAAGAAAAAAAAAGGGTTAAAATCGAAACCGACCTACCAACTTCGAAACCGACCAGTTGGTTTAAGCAATATATTTTGAGGGTGCGGTTCTCAAATTTTTAAACCGAATAGGGCGGTTTCGGCGCCAGTTTTCATGAAAAAATGACCCCACTAGACCGATGCTCACCCCTAGTGATAAGAGTATGATGCATAAAAATAACTATATCTTGAAATGCACTAAAGTATTACTTTTAATAAATTGTTTTACTCTGTTTATAATTTATTTTTTTTAGACAATATTAAAAAAAAAAAAAAGACTGATTTAGGGTAGTTGTGAGCAAAATATAATTTTAATAATATGAATTTAAAAGAAAAAAGAAATCAGATTTAGAGTAAAGATTTGCACTATTGACAAATTTAAGGTGGTTAGAAAAAGGAGAAAGTGAAAAAAATATGGGTAAGTTGTATATGAGTTTTCTCTTTCATCAATCTCCACGTGGTGAAGACAATAAACATGTGCATGGTACCATAATATGAAATTTCTAAGAGTTTGTCATTTTCGTCTATTCAACATAGGAGTGAATAGATATTTTTAATAGATCTAAAAAATCTATTTAATATTAAAAATAAGCAACAAAATATCATTAACATTAATATAATATTAATAATAGACACGTTGACTTATTTAAATACCACAATGTTTATTTATATATTAAATTTTCTTTTGAAATTTTTTATAATTTTTCTAAAATTAAGATCTCAATATTTCCATCAACATTGACATTTTAAACTTTGTATAGAACACGAAAATATTTTAATATAAACTTATATGTACTAAAAATTTATATTTTTAATACATTTTTATTTTACTTTAAACATACTATAAACTTTAATTAATTTTCTCCATGTCAAATTTTGAATACAATATTAATGAGGTATCAAATTATTATTTTGTTGAAATATGTGTAACCTCCCAGGGGTCATATAACTTTGTTAGGAATCGATTGAATTAAGAATATTTTTGAACAAATTAGAGTAAAGAACTTTGGTGTTGTTTTTATTTAGATTGGTGAAAAAGGTTTTTGCTGCAGAGACATCCTAATTGATTATTCAAAACAAATGGAGAAGGTTGGAGAATTGATATTTGGATTGCTTTCGGAAGCCCTCGGTCTAAAATCAACCCATCTGGTGGACTTAGATTGCAACCAGGGACATGCCATTCTTTGCCACTATTATCCGCCGTGTCCTCAGCCGGAACTCACCATCGGTACTACCGAGCACTCCGACGATACCTTTATCACCGTGCTGCTGCAGGATCATATCGGAGGTCTTCAAGTGCTTCATCACAACAAATGGGTGGATATTCCGCCAGTTCCCGGAGCTTTTGTTGTTAATGTCGGCGGCTTATTGCAGGCAAGGCTTCAATATGCATTGTTGCATTCATTAATATACTCTAAACTTTTTAAAGCTTCTAAAATACCATTTTGAAAAATTTCAAAATACCCTTGCGAATCATGAATTTTTCAAATTATAAAAAATATCCTAATTTCTTTTTTTTTAAAAAAAAATGTTAAAAACTACTCTTAATGTGTTAGTATATGAACAAACAAACAGTTAATACCTCGTTATAAACATAACTCTAGAATTTCATTAAAAAAGTTAAAAAGAATTAATTTTAGACCTCCTCATTTTTGGGTTCTAAAAGAATCCATTTTAAAACAAATCATATAGCTAGACATTCTCATTTTTTTCATATAGATACTCTCATTTTCTCATATTTATCCAATTCTTACATCAATTTTTTCAGTAATTAAAATTTTAAGTTAAAACATAAAATGTGACAAAATTCCAAAATCAAAATCTCTATTTAAAATATATCAAAACAATAAATAACCAAATACAAAAAAATATATTTGGCTATTAACACACACTAACGTATAATCTTGTTAAAATATTCTTTTTCTTCTTCTTATAGTCTAATTGAAGTGGGCAATTTTTATTTATTTATTATTATTATTTTGAGATCAACAAATGTCTAACCGTTAACAAAAAGGGAGGTAAAAAGTATTGAAGTAGAGTGGGAGAAAAAAGAGGAAAAATAGAGAAGTCAAAGGGTATTAATGATTTTTGTTCATATATTAATAGTTAAGGGTGTTTTTTTGAACTTTTTTTTAGTTGAAGGGTATTTTTGAAACTTTTGAAAGTTCGAGAATATTTTTGGCACAAAACACTAACGGTTTCCATCCAAAATTAATGGTAAGGGCATTTAAAAAAAAAAATTTTGAAAGTTTAAGGTGTTTTTTACCCAAAATATAAAGTATATGGGTATTTTAGATAATCTAGCCTCAATTGTATTGATACAAATAAAAAATATAATGATGGGTTTGGGTTCCTGTTATGGAAGCAGTTGATTTCGAATGACAAATTGGTGAGCTCGGTTCATAGAGTGTTGGCCAATCGTGAGGGTCCAAGAGTATCGGTGGCATGTTTCTTCACTACTGGCGCTATACCAACCTCCAAACTTTATGGACCCATCAAACAATTGTTGTCCCAACAAAATCCTCCAAAGTATAGACAAATCACTGTTAGAGAGTACGATCACCTTCATGCTCAAAAAGGCTTGGATGGAACGCATGTTCTCACACATTTTAGGCTTTGATTTAAACCAACTCTTCTTTGTTTCCCAATTCCAAGGTTTGTTAAATGTTTGAATATTTGTGTTAAGTTTAGGTTTGTAAGGTGTGTTATTTTTAATTTGTAAAACATGGGGTTTTTAAAATGTTGTTTTCTGCGCTGTTGTAAAATTATTTAAGTCTAGATCGTTTGTTAAACATTTGATTGTATTAAGTTTGAATTTATGTTGTGTGATTTTTACTTTGTTCAACTATTTAACTATTGAACTTAACTTTATTTTACAACTAATAAAGTTTTCTTAAGTTTATTGAACTTAACGTTTGATTGTATTAAATTTGAATTTATGTTTATTTATTAATTTCGTCAAAATGACTGTTGAACTTAACTTTATTTTACAATTAACACATAATGATTATAGAGTTAAAAACATTTTAAATAAACTTTATTTTAAAAGCCATTGACCGACAAACAAACCAACCAAAAAAAAGATAGTTTACGTTAAATTACAAATTTGGTGATTAAGAAAGTTTGAATTTAGTATTTAATACAATTAAAAAAAAATCTATAGATTGTTAAGGATTTAATTAAAACAAAACTCCACGGCTAAAATAGTATAGATACTTTAATCATTGAATTTAATTGAGAATGATAGGATAATTAGTATATTTGAGTTTTTTTTACTTATCAAATGAGCCCTTAGTTTTCAAAATTTATCTTGGTTTTTGAAGACACGAGGAGGAAATTGATAACAAAACATTAAAATTTATGGATGAAATTTGTGTTCATAAACTTAATTTTTAAAAATTAAAAATTGAAAAACTTAGGTCTCGTTTGGTGACCATTTCGTTTTTTGTTTTTGGCTTTTGAAAATTAAGACTATCTCCTTCTCATTTCTTATAATGATTTGCATATTTCTTGAGTAAAATGGATGAGTTCTTAGCTAAATTCCAAAAATAATACAAATTTTTAAAAGCTACTTTTTTTAGTTTTCAAATTTTAGTTCGGTTTTTTAAACCCATTGGTAAAAAGTAGATAACAAATTAAGAAATTTGGAGGTAGAAGTAGCTTCTTTAGACTTAATGTTTAAAACAAAAAACCAATAACCAAACTGTTACCAAACGGGGCCTTAGGTTTCATTTTGATAATTAACTATTTGGTTTTTAAAATTTGTATTTATGTTTCCACACTAATTCATTAGAATAATTTTCATTATTTGTAGACAAACATGTGAATTCTTAGTCAAATTCTAAAAAGAAAAACAAGTTTTTGGCAACTTCTTTTTCTAAACTTGATTTGGTTTTTGAAAACACATAAATAAATAAATAAATAAGAAGAAGAATAGTAAATTACATAACAAAGAAATTCATGGGCATAAGTAATAATTATAAGCTTAATTTTCAAAAATTAAACCAAGCTTAAATGGTTGTCAAAATATAAACTTAATAAGCAATATAGATTGCATGTTTGAAAATGATTCTGAAATTGCTAAAATCACTTTTGTTATATTCAAAATCACTTCAGAACATGCTTTTAATCACTCAAAATTAATTTAATATTTAATTTTAAATTTTGAATGCAATTTTCATATCATTAAAATTAGTTTTGAATAATTAATGTTAAAAAACACATTAGGTCCCTAAACTTTCATGAAAGAAATAATTTAGTCCCCGAACTTTGATTTATAACTATTTAGTCCATGTACTTTCAATTATGTAACAATTTAACGCTCGACTTTAGTATGTAATAATTTTGCTCCTTTACTTTCAAATTTATAACAATTTAGTGTCTAATGTAAAAATATATTTTTTTAATCAAAATTAGATATCAATTTTTATTATTTTATGATATGTACTATATATTTTATAAAAATATTGACCTTCTAATTAGTTTATCAATTTATTTATGCAAAAAATTTCATTAAATCTTTAACATTAATTTTTATAATAGAACTAAATTGTTATAACTTTTAAAGTATAATAATTAAATTGTTACATACTAAAATTTATAGACTAAATTATTATTAAATTGAAAATTAAGAGACCAAATCATTACAAATCAAAGTTTAAATACTATATTATCAATTTTTATGAAAGTTGAACAACTTTAAAAATATGTTTGGAATTGGTTTTAAAAATATCAAAAATGATTTTAACTCAAACACTCAGATTATTCTCAATTTGAGCTATCACATGATCAAAGTTTTTTAAATGAAGCGAATAATGTAAGAACAGCATTCACATTGTGTCAAATGGGGCATGTAAGATCAAGCTTTTGGATCCAACTCCTCAGAATTCGAAGTCAGAAACAAAAAATGGCCAACCTCACACCCTTCTCCAAACTTGACCAAACCTTCGACAGAGCTTCAGAACTGAAAGCCTTCGACCAAACTAAGGCCGGCTGGTCAAAGTCCTTGTGGACTCCGGCGTCGCAGAGATCCCAAGAATATTCTACCGCCCACTCAAACACGTCTCTAATTCCGGCAAGACCTCCCTTCCCGACGAACCCCATTTGGGTGTTCCGGTGGTGGATCTGGAAGACATCGATAAAGACCCCTTCAAACGAAGAGAAGTGGTGGACAAAATCCGAGAAGCTTCAGAAACGTGGGGGTTCTTCCAAGTGCTTAACCATGGGGTTCCGGTGAGTGTTCAGGAGGAGATAATAAATGGGGCACATCGATTTTTCGAACAGGATATTGAGGTGAAGAAACAATATTATTCAAGAGACTACACTAAGCCTTTCGTTTACAATTGCAATTTCGATTTGTTTAGTGCACCCAATGCCAATTGGAGGGACACCATTTTCACTCAAATGACCCCAAATTCTCCTAACCCACAAGACTTGCCTCAAGTTTGCAGGTAAACTCTACTCTGTTCATTCTCTGTCTCTGTTCCTTTTAATGTTAACTAAATAACAAGTAATAGTATATTTGCTAACTTAATATACCATTTTTATGTCCATTAACGATAGATGTATATATTATTGGACGGTGTAGGAATTGTTTATAAATAACAAATGTAGATTACTGATGTATGAAAATTATAGAAAAATGTCTTAATATGTGCATAAATTTTCTGATGAACTCACGGTCAAGTTCATGGCTAATTTAATTTAAAATAAATGACTGATTTAGGGTAGTTTTGAAAAAAAAAAAGAAAAAAAGAAAAAAAAAAAAAAAAGCAAACAAATTTCAGATTTGGGTAGAGATTTGCGTTATTGACAGATTCGGGTGGATTTTATTTTATTTTATTTAAAAAAAACGTGTACATGGAACATTTGAATATATTTTTAATTCATTTTTATTTTACTTTTAACATACTAACAAAAATAAACTATTTCTTCCAGAAAAAAAAATGTGAACATTTTATAATACAATATCAACGTGTTATTAAGATATTATTTTGTTAAAATTGTTGTGACCTCCTAACGTGCCATTTGAACATGTCATTTAAACAATTGTCAAAAAATTTAAACTTTCATCTGAATTAATTGAATCAAGAATTTTAAACACATTAAAGTAAAGAATTTTGGTGTTATTTGTGGTGATGGGTGATAAAGATTATTTTTCGTTGCAGTGACATCCTAATTGATTATTCAAAACAAATGGAGAAGCTTGGAGAAATAATATTTGGACTATTTTCGGAGGCCCTTGGTTTGAAACCAACCCATCTCATAGACTTGGATTGCAATGAGGGACATGCCATTCTCTGCCACTATTATCCGCCGTGTCCTCAGCCAGAACTCACCATTGGCGCCACCGAGCACTCCGACAGTAGCTTCATCACAGTGCTGCTGCAGGACCATATCGGAGGTCTTCAGGTGCTTCATAACAACGAATGGGCAGATATTCCACCAGTTTCCGGAGCTTTGGTCGTCAACGTTGGCAGCTTATTGCAGGCAAGAATTTATTATGTATTCATGAAGTAATTAAATTGTAATTGATATACGTAGAAAAAATAATGACGGATTGGGGTTCGTTTTATGGACGCAGCTGATTTCGAATGACAAATTTGTGAGCTCAGTGCATAGAGTGGTGGCCAATCGTGAAGGTTGTCCAAGAGTATCGGTCGCAAGTTTCTTCACCACTGGCATTATTTCAACCTCCAAACTTTATGGACCCATCAAACAATTATTGTCCGAACAAAATCCTCCCAAGTATACACAAATCACAGTTAAAGAGTATCGTCTCTACTATACTCAAAAAGGGTTGGATGGAACGCATGCTCTCACACATTTCAGGCTTTGATTTACTCAACTGTTTTTTGGTTCCCAATTCCAAGGTTTGTTAAATTTTAAATTGTGTTAAGTTTGAGTTTATTATGGTGTGTGATTTTTAATTTGTAAAACATGGGTTTATGGAATGTTATTTCCTAGCTCTCGGTAAGCTTTGTTATGATCAACCTCAATTGTTAATGTTTTGTTATATTTTTAAAATATATATTTTCATTTTGTAAGGGTGAAATTTGAACCTCCAATCTTTAGTGACGAAATGCATTCGCTACCACTAAGACAAGCTCATTTAAGCTAACTTTAATATACAATTAGTCCACATGGTTATAGAGTTTTAAAATTTTATTTTAAAAGCTATTGACCAACAAACAAAACCAACAAAAATAGTTTATGATAAATTACAAATTTGGTATTTATGATGAGGAGAAGTTCGAATTTAGTCTCTATAATTTTAAAAAGATAGAATTTAATCTAAATGGTTTGATAAAAAAAACCTCATATGTTTAATTAATTAGGCTAATATGTTTTTTTTTTTTTTTTCACACTATTACAAGGAAAGGAGTTTGACAAAATTGTGGATTTGGGGAAATTGATGTATTAAATCGGATAATTTGTATAATTGGTTTGTACATTTCAAATTCACAAGAATATGAGGAATCACTCTCAAAAAGTCTCCATATATTTAAAATTTGTTTTCTCATACCTGGAAAATCTGTTTGACAACACTTCTTTTAGGACCTCTAAATTTACGTTGAACGCATAATTTACCATATGTTATATTCAAAATATAAATTTCACATTTCAAATGTTAATTTATTATTCTATTGTATATTTAATATATCAATTATATTAAACATTTTATTTAAATGTTTCAAATTTGTATATAAACTAAATTGCTCCTTCGTGTTACAATAAGTTGTAGAGATGGAGAGTTTACTAACAAACAAACAAGCCGAGAGGGGAGTGAAAGGAAATTGGAAGAGGGGAGAAGAGAGAGAGAGATAAAGATAATAGATAATTTATTTACACTATTTTATAATTTAAAATGTCTAAATGATTTTGACATGGTATTTCTCTCATTTATCTCGACTATTTACATGTATGTACTTTTATAAACTTTTAACAACAAGGAAGTTTGAAATGAATAGTAATAATTTTTTTTAGTACAACAATAGTGAGATGAATGATAGGTTCAATTCCCTATCTTTACTATTGTTGTAATAAAAAAGAACCCACCACACGTTAAAATATATACTTGAATAATAAAGAAAAGGGTGATGGAGAAGTAGACCTTTTTTTTTCTAAATATTCTATTTTTTAAAACTAATTAAAAAAATTATAGTGACTTCAAAATTATTTTTTAAAATGTGTTTAATATTATATATTTATAAATTATAAAAATAGTAAATTATTATTAATTTATTTATGAATATAATTTATGTTAAAAGAATTTTATGAAATTATTATTTTTTAATATTAATATAATTTATAATACATTATGTAATATATATTATATTTTTTAAAAAAATGAATTGAGTTTATTTCTAAATGTTTTTATCTTTATATTTAATATAATTGTGATTTTTAAAATTTAAAATTTGGATGAAATGAAAATATTAAAATATTATTTACAAAATTTGCATTGTTTGGATCATAGAATATGAAAACAATTTTTAAATAGAATCTGCATTGTCTGTAAAACACATTTGATGAACTGAAACATAAACAAAATCTAAAATTTTTTACCAAACCAACCCTAAATAATCTTCCTTTGGATTCACATTGCCGTTGGGCACAGAGCCTTCTAAACTTCCACTCGTGAATGAGTATACACAGTCACAATAGGAAATTAGAATATCAATAAAATATATACCATATTTTCAGGCCTCAGTTTCTCACCTATGAGAATTTTTGTCGAATTTGATTAAACAAGTTGATCCTAATCTCCGTATCAAGAAGGAAGTTAATTAATATCGGTAAAAAATTTCAAGCATAGTTCTGTTTTAGGCCATTGACGATCAATATCATCCAAAATGGAGGAATTTAATAATCTTACAAATTGGAAAAGGGGATATCCGTTAAAATAAATAAATAAATAAATAAATAACTCACAAAAAATTAAAAGGTCAAATCAATGTAAAAACTAGAAGGCACTGTGTGCATTGACGGATCCATGTAGGGTGGGAGACGCCTCCCTCTAATTATCAGCTTTTGTATTTTTATGAATAATAGAACATTATTATTATTATTATTCTTTTTATATTTTTTTTTTATTTCGAGTGTGTCTCTTTGCATATAAAATTGGAAATTTTGTAGGTGTATTACACACATTTAAAATTTCTAAAAAAAAAAAAAATAAGTTTCTGAATAAATATTGGTGATCCGCCACCGACCGTGAGGGAAAAGAGTTGTAAACTATGTTCTCATGTTATCGTATTTGTCCTCTCACATGATCGGATTTCTTTGAGTGGAGCCGACAATGGATAACCAACCATTCACTTCCTCACCCCCAAAAATTTCTCTCCCATTAGGATAGATTGGGCATTCAAATTCGAGGTCGAAACAAACATAATGGCCAACCTCACACCTTTCTCTAAACTTGATGAAACCTTCGACAGAGCTTCGGAACTGAAAGCCTTCGACCAAACTAAGGCCGGCGTCAAAGGCCTTGTGGACTCCGGCGTCGCCGAGATCCCAAGAATATTCTACCGCCCACTCAAACACGTCTCTAATTCCGGCGAGACCTCTGTTCCCCACGAGCCCCATTTGGGTGTTCCGGTGGTGGATCTGGAAGACATCGATAAACACCCCTTCAAACGCAGAGAAGTGGTGGACAAAATCCGGGAAGCTTCAGAAACGTGGGGGTTCTTCCAAGTGCTTAACCATGGAGTTCCGGTGAGTGTTCAGGAGGAGATCATAAATGGGGTGCATCGATTTTTCGAACAGGATATTGAAGTGAAGAAACGATATTATACCAGAGATAACACTAAGCCTTTCGTTTACAATTGCAATTTCGATTTGTTTAGCGCACCCACTACGAATTGGAGGGACACCACCTTCACTCAAATGACCCAAAATTCTCCCAACCCACAAGACTTGCCTCAAGTTTGCAGGTAAACTCTACTCTGTCCATTCTCTGTCTCTGTTCTTTTGCCTAATTGCGATAGCTAGCAATGCCTATATGCACCATATCAATAGATTAGAAATGGTATAGTTACTAATGACCCCTTTAACACTTAATATATCATTTTTATGTCTATCAATGATAAATATGTATGGGTGGTAGATGTATATTATGGAAAGGTGTGTGAATTGTTTATGAGTAACAAAAGTAGATTAGTGATGTATGAGAATGAAAGAGAATTATATAAAAATGTCTAAACATGTTCATAATTTTTTAAAAATGCTATAGATTTACAATTAATTTTATTTGTTAAAACAATTTGAAATTTGAAGATTCCAGCCGTAGCTACTGAACTGTAGCAGGAACAACACAGATAAGCATCATCCAACCTTCTTTCTACCAACAAACTTGAAAATCCAAGAAAACAAATAACCCTTCTCTTTCAACATTTTGGAGGGCTTTGTGGTGTTTGGATTGGCGTTATTCAACTGTTTATCCAATGTCCTATAACTCCCTTAGCAAAGTTACAATGACCCTCCAGTCATTGTTCATCCAAACTAACATCTGCTCTTCTCACAACTCCCTCGATGTCTCTCTCTCAAAGAACAAGTTTCGGATTTCTGCTTCAAAGGTGGAAAATTTCAAAACCCACTTTCAAAAATTAAGCCCAAACTCGCTGACTCAAAGCTGGCATCGCCTTCTCGCTAGAGTACTAGTGACGATGTTATGGTTCTTGTCAGGTATAATCTACTCTGCTCCTTGTGTCTTCCTACCTCTTTTTCAACACTAATCTGCTCGTCAAAATTGCCTTTTTTGGAGTCCAGAAGTTCCATCGTCGTTCTCGAGCTAGGCCTTCCAATAGATTTGAAATTGAAAGTAAATCTAAGGATATAAATATAGGGAATAGGTATTTAAAAACACTCATATTTATTGTTTAAAAAAAATGTCATCTGGTGAATGAAAAATGACCCATCTGCACACAATTTGCCAAAAATCTAATTAATTCCAAATTCTACCAAATATCCAAACAAAACTAAAGACTTTGCTAATACAAGTTTTTTCTTCTACATCAACTTTCAAATAGATGTTGGTGCCGTTAATGAAGATAATAAACGTGTACATGGAATACGAATATATTGTTAATGTACATTTATATTCGTATTTTTAGTACATTTTTATTTCAGTTTTAACATACTAACAAAAATCAACTATCTTCCTCAAATAAAAATTTCAAAATTTTATAACACAATATCAACGTGGTGATTGGTGATAAAATATTTTTGTTGCAGAGACATCCTAATTGAATATTCAAAACAAATGGAGAAGGTTGGAGAATTGATATTTGGGCTACTTTCCGAGGCGCTTGGTTTGAAATCAACCCATCTCCTAGAATTGGATTGCAATGAGGGACATGCCATTATGTGCCACTATTATCCGCCGTGTCCTCAGCCAGAACTCGCCATCGGCACCACCGAGCACTCCGACAGTGGCTTCATCACCGTGCTGCTGCAAGACCATATCGGAGGTCTTCAGGTGCTTCATCACAACAAATGGGTGGATATTCCACCAGTTCCCGGAGCCTTAGCCGTCAATGTCGGCAGCTTACTGCAGGCAAGGCTTTAA

mRNA sequence

ATGGCGGTTTCAGCTTCCGTAGTCGATGGATTCATCCTCACTCCGCTATCCAAAGCCGACGAAAACTACCACAGACCCACCGAAATCAAAGCTTTCGACGATACTAAAGCCGGCGTCAAGGGCTTAGTCGACGCTGGAATTAACGAAATTCCACGCATCTTTTACCAGCCACCGGAAGATTACTACTCCGACAATATTTCCGGCGAAACCCAGTATCAAATTCCAGTGATCGACCTCGATGACGTCCACAGAAATTCACTCAAACGGAAAGACACCATAAACAGAGTCCGAGAAGCTTCAGAGAAATTGGGTTTTTTCCAACTGATTAACCATGGGATTCCGGTGGGCGTTCTGGAAGAGCTGAAAGGTGCGGTCAAGAGATTCAACGAACAAGACACGGAAGTGAAGAAACAATACTACACCCGGGACAACACCAAGCCTTTGATTTACAACAGCAATTTCGATCTGTACAGCGCATCGACCACGAATTGGAGAGACACTCTTGGGTATATAAGTGCCCCTAATCCTCCCAATCCGCAAGACTTACCCGAAATCATCAGAGATAATCTGGTGGATTACTCGAAAAGAGTAATGGAAATTGGGAAATTGTTGTTTGAATTGTTGTCGGAAGCTTTGGGGCTGAACCCAAATTACTTGAACGAAATAGGCTGCAGCGAGGGGCTGGCAATTGGGTGCCATTATTACCCACCATGCCCACAGCCGAATTTGACACTGGGCACATCCGAGCACAGTGACAATGTTTTCATCACTGTTCTGTTTCAAGACAACATCGGCGGGCTTCAGATTCGACACCAGAAAAAGTGGGTGGATGTGCCACCTGTCGCCGGAGCGTTGGTGGTCAACATCGGAGAGCTTATGCAGTTAATAACAAACGACAGATTCATAAGCGTGGCGCATAGAGTGTTGGCAAAGAAGGAAGGACCGAGAATTTCAGTAGCGAGCTTTTTTTCGACATTGGCTTATCGGAGCTCAAAAGTGTATGGACCCATAAAGGAATTGTTGTCGGAAGAGAATCCTCCAAAGTACAGAGAAACCACAATCAGAGATTTCCATATGCTGTATCGTGCAGATGGGCTTGGGACTTCGAAAGCTAAGAGATCGACGATGGCGGTCGCAGCCTCAAGAGTCAACGGAGCCAATCTCACTCCGCTATCCGAAGCCGACGAAAACTACCACAGACCCACAGAACTCAGAGCTTTCGATGACACCAAAGCCGGAGTTAAAGGTTTAGTCGACGCCGGTATTACCGAAATTCCTCGAATTTTTTACTTCCCACCGGAGGATTATAACTCCGACAACGCAACCGTGGAAATCCAAATCCAGATTCCGGTGATAGACTTCGACCACGTTGGCAGAAATTCACTCAAACGCAAATACACCATCGACAGAATCCGAGAAGCTTCTGAGAAATTGGGGTTTTTCCAACTGATTAACCATGGGATTCCAGTGAGCGTTCTGGAAGAGATGAAAGATGCGGTTCGGAGATTTCACGAACAAGAAACGGAATTGAAGAAACAATATTATACCCGCGACCTGACTAAGCCTTTGATTTACACCAGTAATTTCGATCTGTATTCTGCCGCGACCACCAATTGGAGAGATGCGTTTAGATATGTTAGTTCCCCAAACGCCCATGATCCGCAAGTCCTGCCAGAAATTTGCAGAGATATTTTAGTGGAATACTCGAAACAAGTGATGGAGATTGGGAAATTAGTGTTTGAATTGCTGTCGGAAGCTCTGGGTTTGAATCCAAATTACTTGAATGATATAGACTGCAGCGAGGGGCTTGCATTTGTGTGCCACTATTACCCACCATGCCCACAGCCAAATTTGGCCATCGGCACATCGGAGCACACTGACAATGGCTTCATCACTGTGTTGTTGCAAGACCACATCGGTGGCTTACAAATTCGCCATGGGAACAATTGGGTGGACATTCCCCCTGTCGCCAGAGCTTTAGTGAGATCGACAATGGCGGTCGTAGCCTCAAGAGTCAACGCACCCAATCTCACTCCGCTATCCAAAGCCGACGAAAACTATCACAGACCCACTGATCTCAAAGCTTTCGATGACACTAAAGCCGGCGTCAAAGGCTTAGTCGACGCAGGAATCACCGAAATTCCTCGTATCTTTTACCGCCCACCGGAGACTTTCGACTCCGACAATATTTCCGGCGAAACCCAAATTCACATACCAGTGGTAGACCTCGACCACATCAACAAAAATTCACTCAAACGCAAGTACACAATCGACATAGTCAGAGAAGCGTCAGAGAAATTGGGTTTTTTCCAACTGGTTAACCATGGGATTCCGGTGGACGTTCTCGAGGAGATGAAAGATGCAGTTCGGAGATTTAACGAACAAGAAACGGAATCCAAGAAACAATATTACACTCGTGACCTCACCAAGCCTTTAATTTACAACAGTAATTTTGATCTGTATACTGCAGCCACCACCAATTGGAGGGATACCTTTGGCTATATAAGTGCCCCAAATTCCCACAATCCTCAAGACCTGCCGGAAATTTGCAGGGATATTCTAGTGGATTACTCGAAACGAGTGATGGAGATTGGGAATTTACTGTTTGAATTGCTGTCGGAAGCTCTGGGTTTGAATCCAAATTATTTGAAGAACATAGACTGCAATGAGGGGCTTGCACTTGTATGCCACTATTACCCACCATGCCCACAGCCGAATTTGGCCATCGGCACATCGGAGCACACCGACAATGACTTTATCACTGTGCTATTGCAAGACCAAATCGGCGGCCTTCAAATTCGGTATGAGAACAAGTGGGTCGACGTGCCCCCAGTCGCCGGAGCTTTAGTGGTCAACATCGGAGATCTTATGCAGCTAATAACAAATGACAAATTCAAAAGCGTTAAGCACAGAGTTCTTGCAAACAAGAAGGGTCCGAGAAGAAACCACAATCAGAGACTTCACTATTCAATTCCCTTCAAAAGATCGATAATGGCGGTTGCAGCCTCAGAAATCAACGGACTCAACCTCACTCCACTATCCAAAGCCGACGACAACTTCCACAGACTCACTGAACTCAAAGCTTTCGATGACACTAAAGCCGGCGTTAAAGGCTTAGTCGACGCCAAAGTTACCGAAATCCCCCGTATCTTTTACCACCCACCGGAGGAATATGACTCCGTTGAAACCCAAATCCGTATTCCACTGATAGACCTCGATGGCGTCGGCAAAGATTCACTCAAACGCAAACACATCGTCGATCAAATCCGTGACGCTTCAGAGGAATTGGGTTTCTTCCAAGTGATTAACCATGGAATTTCGGTGAGTGTTCTTGATGAAATAAAAGACTCTGTTCGGAGATTTCACGAACAAGACACGGAAGTGAAGAAACAATATTATACTCGAGACCTCATGAAGCCTTTTATTTACAACAGTAATTTCGATTTGTATTCTGCACCCACCACCAATTGGAGGGATACTTTTAGCTATGTTAGTGCTCCAAATCCTCCCAATCCACAGGAATTGCCGGAAATTTGCAGCAGAGATATTCTGGTGGACTACTCGAAATGGGTGATGAAGATTGGAAAATTAGTGTTGGAATTGCTGTCTGAAGCTCTAGGTTTGAATCCCAATTACTTGAACAACATTGGGTGCAGTGATGGGCTTGAATTTGTATGCCACTATTACCCTGCATGTCCACATCCAAAATTGACCACTGGGATATCCGAGCACACGGACGCTGACTTCATCACGGTGCTGTTGCAAGACCACATCGGCGGCCTACAAATCCGTCATCATAACAATTGGATTGACGTGCATCCTGTCGCCGGAGCCTTAGTGGTCAACATCGGAGACCTTATGCAGGTTACCAATGCCGACGAACATTTCGATAGAGCTGCCGAACTGAAACTCTTCGACGACACAAAAGCCGGAGTGAAAGGCCTTGTCGACAGCGGCATAACCCAAATACCGCGAATATTTTACCGACTGCCTGATTCCGGCGTATCCCCTGTTCCCGGAGACACCGAACTGAGCATTCCAGTGATAGATCTCGAAGCCATCGACAGAGATTCATCCAAACGAAGAGATGTCGTCAACAAAGTCCGAGAAGCCTCTGAGAAATGGGGTTTTTTCCAATTGGTCAACCATGGAGTTCCGGTGAGTGTTCTGGATGAGATGAAAAAAGGGACACTAAGATTTTACGAACAAGATACCCAATTGAAGAAGCAATTCTACACCCGCCACAACACAAAATCCATTGTTTACAACAGTAATTTCGATCTGTTTACTGCACCAGCCGCTAATTGGAGAGACACATTCCTCTGTTTCATGGCTCCCAATCTTCCAAATCCACAAGACTTGCCTGAAATTTGCCGAGATATCCTGTTTGATTACTCAAAAGAAATGAAGAAATTGGGGAGAATATTATTTGGGTTGCTTTCGGAGGCTCTTGGGTTGAACACAAATTATCTGAGCGACATCGAATGCGACAGGGGACTAGCTGTTCTGTGTCATTATTATCCAGCATGTCCACAGCCAGAGCTCACTTTAGGCACAACCGAGCACGCCGACAATGATTTCTTGACGGTACTTCTACAAGACGACCAAATCGGAGGTCTTCAAGTTCTTCACCAGAAGAAGTGGATTGATATTCCCCCAATTCCCGGGGCCTTGGTCCTTTCCCATTTTGTAATACTTAGCATTGAATTATTTTTGCAGCTGATTTCGAACGATGGGTTCAAGAGCGTGGAGCATCGAGTGCTGGCGAATCGTGATGGTCCGAGAGTTTCAATTGCGAGCTTTTTTGGTATCGGCGTTTATACAACATCTCAAGTTTATGGACCCATAAAGGAATTGTTATCGGAACAAAATCCTGCAAAGTATGGAGAAACGACGCTCAAAGACTTCTATTTCTATCACAACAGCAGAGGCTTGAATGGAACTTCTGCTCTGCAACATTTCAGGCTCAGTCTGGACGATGAAGGAGATGCTACGCCCATTAAGGATTGTTTCATTCAAAGCTCCAAACAAAACAAAATGGCCAACCTTACACCTTTCTCCAAACTTGACCAGACCTTCGACAGAGCTTCGGAACTGAAAGCCTTCGACCAAACTAAGGCCGGCGTCAAAGGCCTTGTGGACTCTGGCGTCGCCGAGATCCCAGGAATATTCTACTGCCCACCCAAAGAACATTCCAATTCCGTTCCCGAGGAAACCCATTTGGGTATTCCGGTGGTGGATCTGGAGGACATCGATAAAGATCCCTTCAAACGCAGAGAAGTGGTGGGCAAAATCCGAGAAGCTTCAGAGACCTGGGGTTTCTTCCAAGTGCTTAACCATGGAGTTCCGGCGAGTGTTCAGGAGGAGATCATAAATGGGGTGCATCGATTTTTCGAACAGGACATTGAAGTGAAGAAACAATATTATACAAGAGACAATACAAAGCCATTCGTTCACAATTGCAATTTCGATTTGTTTAGCGCACCTGTAGCCAATTGGAGGGATACTTTCTTCACTCTCATGGCCCCAATTTCTCCTAGCCCCCAAGACTTGCCTCAAGTTTGCAGAGACATCCTAGTTGAATATTCAAAACAAATAATGAAACTTGGAGAATTGATATTTGGACTACTTTCAGAGGCCCTTGGTCTGAAATCAACTCATCTCGTGGACTTGGACTGCAACGAGGGACTTTCCATTCTCGGCCACTATTATCCGCCGTGTCCTCAGCCTGAACTCTCCATCGGCACAACCGAGCACTCTGACAATACCTTCATCACCGTGCTACTGCAGGATGGTATGGGAGGCCTTCAGGTGCGTCAACACAACAAATGGGTGGATGTTCCGCCAGTTCCTGGGGCCTTCGTTATTAATGTTGGCAGCTTATTACAGTTGATAACGAATGACAGATTTGTGAGCTCGGAGCATAGAGTGGTGGCGAACCGTAAGGGTCCAAGAGTATCGGTGGCAGGTTTCTTCAGCACCGGCTCTCTACCAACCTCCAAACTTTATGGACCCATCGAAGAATTGTTGTCGGAACAAAATCCTCCCAGATACAAAAAAATAAGTGTTAAAGAGTACAATCTCTACTTTGCTGAAAAAGGAGACAACAGTAAGCCTTTTTCTTACAACTGCAATTTCGATTTGTTTAGTGCACCCAGTGCCAATTGGAGGGACACCATCTTCACTCAAATGACCCCAAATTCTCCTAACCCACAAGACTTGCCTCAAGTTTGCAGAGACATCCTAATTGATTATTCAAAACAAATGGAGAAGGTTGGAGAATTGATATTTGGATTGCTTTCGGAAGCCCTCGGTCTAAAATCAACCCATCTGGTGGACTTAGATTGCAACCAGGGACATGCCATTCTTTGCCACTATTATCCGCCGTGTCCTCAGCCGGAACTCACCATCGGTACTACCGAGCACTCCGACGATACCTTTATCACCGTGCTGCTGCAGGATCATATCGGAGGTCTTCAAGTGCTTCATCACAACAAATGGGTGGATATTCCGCCAGTTCCCGGAGCTTTTGTTGTTAATCAGTTGATTTCGAATGACAAATTGGTGAGCTCGGTTCATAGAGTGTTGGCCAATCGTGAGGGTCCAAGAGTATCGGTGGCATGTTTCTTCACTACTGGCGCTATACCAACCTCCAAACTTTATGGACCCATCAAACAATTGTTGTCCCAACAAAATCCTCCAAAGTATAGACAAATCACTGTTAGAGAGTACGATCACCTTCATGCTCAAAAAGGCTTGGATGGAACGCATAAACAAAAAATGGCCAACCTCACACCCTTCTCCAAACTTGACCAAACCTTCGACAGAGCTTCAGAACTGAAAGCCTTCGACCAAACTAAGGCCGGCTGGTCAAAGTCCTTGTGGACTCCGGCGTCGCAGAGATCCCAAGAATATTCTACCGCCCACTCAAACACGTCTCTAATTCCGGCAAGACCTCCCTTCCCGACGAACCCCATTTGGGTGTTCCGGTGGTGGATCTGGAAGACATCGATAAAGACCCCTTCAAACGAAGAGAAGTGGTGGACAAAATCCGAGAAGCTTCAGAAACGTGGGGGTTCTTCCAAGTGCTTAACCATGGGGTTCCGTGTTCAGGAGGAGATAATAAATGGGGCACATCGATTTTTCGAACAGGATATTGAGGTGAAGAAACAATATTATTCAAGAGACTACACTAAGCCTTTCGTTTACAATTGCAATTTCGATTTGTTTAGTGCACCCAATGCCAATTGGAGGGACACCATTTTCACTCAAATGACCCCAAATTCTCCTAACCCACAAGACTTGCCTCAAGTTTGCAGTGACATCCTAATTGATTATTCAAAACAAATGGAGAAGCTTGGAGAAATAATATTTGGACTATTTTCGGAGGCCCTTGGTTTGAAACCAACCCATCTCATAGACTTGGATTGCAATGAGGGACATGCCATTCTCTGCCACTATTATCCGCCGTGTCCTCAGCCAGAACTCACCATTGGCGCCACCGAGCACTCCGACAGTAGCTTCATCACAGTGCTGCTGCAGGACCATATCGGAGGTCTTCAGGTGCTTCATAACAACGAATGGGCAGATATTCCACCAGTTTCCGGAGCTTTGGTCGTCAACCTGATTTCGAATGACAAATTTGTGAGCTCAGTGCATAGAGTGGTGGCCAATCGTGAAGGTTGTCCAAGAGTATCGGTCGCAAGTTTCTTCACCACTGGCATTATTTCAACCTCCAAACTTTATGGACCCATCAAACAATTATTGTCCGAACAAAATCCTCCCAAGTATACACAAATCACAGTTAAAGAGTATCATTGGGCATTCAAATTCGAGGTCGAAACAAACATAATGGCCAACCTCACACCTTTCTCTAAACTTGATGAAACCTTCGACAGAGCTTCGGAACTGAAAGCCTTCGACCAAACTAAGGCCGGCGTCAAAGGCCTTGTGGACTCCGGCGTCGCCGAGATCCCAAGAATATTCTACCGCCCACTCAAACACGTCTCTAATTCCGGCGAGACCTCTGTTCCCCACGAGCCCCATTTGGGTGTTCCGGTGGTGGATCTGGAAGACATCGATAAACACCCCTTCAAACGCAGAGAAGTGGTGGACAAAATCCGGGAAGCTTCAGAAACGTGGGGGTTCTTCCAAGTGCTTAACCATGGAGTTCCGGTGAGTGTTCAGGAGGAGATCATAAATGGGGTGCATCGATTTTTCGAACAGGATATTGAAGTGAAGAAACGATATTATACCAGAGATAACACTAAGCCTTTCGTTTACAATTGCAATTTCGATTTGTTTAGCGCACCCACTACGAATTGGAGGGACACCACCTTCACTCAAATGACCCAAAATTCTCCCAACCCACAAGACTTGCCTCAAGTTTGCAGTTTTAACATACTAACAAAAATCAACTATCTTCCTCAAATAAAAATTTCAAAATTTTATAACACAATATCAACAGACATCCTAATTGAATATTCAAAACAAATGGAGAAGGTTGGAGAATTGATATTTGGGCTACTTTCCGAGGCGCTTGGTTTGAAATCAACCCATCTCCTAGAATTGGATTGCAATGAGGGACATGCCATTATGTGCCACTATTATCCGCCGTGTCCTCAGCCAGAACTCGCCATCGGCACCACCGAGCACTCCGACAGTGGCTTCATCACCGTGCTGCTGCAAGACCATATCGGAGGTCTTCAGGTGCTTCATCACAACAAATGGGTGGATATTCCACCAGTTCCCGGAGCCTTAGCCGTCAATGTCGGCAGCTTACTGCAGGCAAGGCTTTAA

Coding sequence (CDS)

ATGGCGGTTTCAGCTTCCGTAGTCGATGGATTCATCCTCACTCCGCTATCCAAAGCCGACGAAAACTACCACAGACCCACCGAAATCAAAGCTTTCGACGATACTAAAGCCGGCGTCAAGGGCTTAGTCGACGCTGGAATTAACGAAATTCCACGCATCTTTTACCAGCCACCGGAAGATTACTACTCCGACAATATTTCCGGCGAAACCCAGTATCAAATTCCAGTGATCGACCTCGATGACGTCCACAGAAATTCACTCAAACGGAAAGACACCATAAACAGAGTCCGAGAAGCTTCAGAGAAATTGGGTTTTTTCCAACTGATTAACCATGGGATTCCGGTGGGCGTTCTGGAAGAGCTGAAAGGTGCGGTCAAGAGATTCAACGAACAAGACACGGAAGTGAAGAAACAATACTACACCCGGGACAACACCAAGCCTTTGATTTACAACAGCAATTTCGATCTGTACAGCGCATCGACCACGAATTGGAGAGACACTCTTGGGTATATAAGTGCCCCTAATCCTCCCAATCCGCAAGACTTACCCGAAATCATCAGAGATAATCTGGTGGATTACTCGAAAAGAGTAATGGAAATTGGGAAATTGTTGTTTGAATTGTTGTCGGAAGCTTTGGGGCTGAACCCAAATTACTTGAACGAAATAGGCTGCAGCGAGGGGCTGGCAATTGGGTGCCATTATTACCCACCATGCCCACAGCCGAATTTGACACTGGGCACATCCGAGCACAGTGACAATGTTTTCATCACTGTTCTGTTTCAAGACAACATCGGCGGGCTTCAGATTCGACACCAGAAAAAGTGGGTGGATGTGCCACCTGTCGCCGGAGCGTTGGTGGTCAACATCGGAGAGCTTATGCAGTTAATAACAAACGACAGATTCATAAGCGTGGCGCATAGAGTGTTGGCAAAGAAGGAAGGACCGAGAATTTCAGTAGCGAGCTTTTTTTCGACATTGGCTTATCGGAGCTCAAAAGTGTATGGACCCATAAAGGAATTGTTGTCGGAAGAGAATCCTCCAAAGTACAGAGAAACCACAATCAGAGATTTCCATATGCTGTATCGTGCAGATGGGCTTGGGACTTCGAAAGCTAAGAGATCGACGATGGCGGTCGCAGCCTCAAGAGTCAACGGAGCCAATCTCACTCCGCTATCCGAAGCCGACGAAAACTACCACAGACCCACAGAACTCAGAGCTTTCGATGACACCAAAGCCGGAGTTAAAGGTTTAGTCGACGCCGGTATTACCGAAATTCCTCGAATTTTTTACTTCCCACCGGAGGATTATAACTCCGACAACGCAACCGTGGAAATCCAAATCCAGATTCCGGTGATAGACTTCGACCACGTTGGCAGAAATTCACTCAAACGCAAATACACCATCGACAGAATCCGAGAAGCTTCTGAGAAATTGGGGTTTTTCCAACTGATTAACCATGGGATTCCAGTGAGCGTTCTGGAAGAGATGAAAGATGCGGTTCGGAGATTTCACGAACAAGAAACGGAATTGAAGAAACAATATTATACCCGCGACCTGACTAAGCCTTTGATTTACACCAGTAATTTCGATCTGTATTCTGCCGCGACCACCAATTGGAGAGATGCGTTTAGATATGTTAGTTCCCCAAACGCCCATGATCCGCAAGTCCTGCCAGAAATTTGCAGAGATATTTTAGTGGAATACTCGAAACAAGTGATGGAGATTGGGAAATTAGTGTTTGAATTGCTGTCGGAAGCTCTGGGTTTGAATCCAAATTACTTGAATGATATAGACTGCAGCGAGGGGCTTGCATTTGTGTGCCACTATTACCCACCATGCCCACAGCCAAATTTGGCCATCGGCACATCGGAGCACACTGACAATGGCTTCATCACTGTGTTGTTGCAAGACCACATCGGTGGCTTACAAATTCGCCATGGGAACAATTGGGTGGACATTCCCCCTGTCGCCAGAGCTTTAGTGAGATCGACAATGGCGGTCGTAGCCTCAAGAGTCAACGCACCCAATCTCACTCCGCTATCCAAAGCCGACGAAAACTATCACAGACCCACTGATCTCAAAGCTTTCGATGACACTAAAGCCGGCGTCAAAGGCTTAGTCGACGCAGGAATCACCGAAATTCCTCGTATCTTTTACCGCCCACCGGAGACTTTCGACTCCGACAATATTTCCGGCGAAACCCAAATTCACATACCAGTGGTAGACCTCGACCACATCAACAAAAATTCACTCAAACGCAAGTACACAATCGACATAGTCAGAGAAGCGTCAGAGAAATTGGGTTTTTTCCAACTGGTTAACCATGGGATTCCGGTGGACGTTCTCGAGGAGATGAAAGATGCAGTTCGGAGATTTAACGAACAAGAAACGGAATCCAAGAAACAATATTACACTCGTGACCTCACCAAGCCTTTAATTTACAACAGTAATTTTGATCTGTATACTGCAGCCACCACCAATTGGAGGGATACCTTTGGCTATATAAGTGCCCCAAATTCCCACAATCCTCAAGACCTGCCGGAAATTTGCAGGGATATTCTAGTGGATTACTCGAAACGAGTGATGGAGATTGGGAATTTACTGTTTGAATTGCTGTCGGAAGCTCTGGGTTTGAATCCAAATTATTTGAAGAACATAGACTGCAATGAGGGGCTTGCACTTGTATGCCACTATTACCCACCATGCCCACAGCCGAATTTGGCCATCGGCACATCGGAGCACACCGACAATGACTTTATCACTGTGCTATTGCAAGACCAAATCGGCGGCCTTCAAATTCGGTATGAGAACAAGTGGGTCGACGTGCCCCCAGTCGCCGGAGCTTTAGTGGTCAACATCGGAGATCTTATGCAGCTAATAACAAATGACAAATTCAAAAGCGTTAAGCACAGAGTTCTTGCAAACAAGAAGGGTCCGAGAAGAAACCACAATCAGAGACTTCACTATTCAATTCCCTTCAAAAGATCGATAATGGCGGTTGCAGCCTCAGAAATCAACGGACTCAACCTCACTCCACTATCCAAAGCCGACGACAACTTCCACAGACTCACTGAACTCAAAGCTTTCGATGACACTAAAGCCGGCGTTAAAGGCTTAGTCGACGCCAAAGTTACCGAAATCCCCCGTATCTTTTACCACCCACCGGAGGAATATGACTCCGTTGAAACCCAAATCCGTATTCCACTGATAGACCTCGATGGCGTCGGCAAAGATTCACTCAAACGCAAACACATCGTCGATCAAATCCGTGACGCTTCAGAGGAATTGGGTTTCTTCCAAGTGATTAACCATGGAATTTCGGTGAGTGTTCTTGATGAAATAAAAGACTCTGTTCGGAGATTTCACGAACAAGACACGGAAGTGAAGAAACAATATTATACTCGAGACCTCATGAAGCCTTTTATTTACAACAGTAATTTCGATTTGTATTCTGCACCCACCACCAATTGGAGGGATACTTTTAGCTATGTTAGTGCTCCAAATCCTCCCAATCCACAGGAATTGCCGGAAATTTGCAGCAGAGATATTCTGGTGGACTACTCGAAATGGGTGATGAAGATTGGAAAATTAGTGTTGGAATTGCTGTCTGAAGCTCTAGGTTTGAATCCCAATTACTTGAACAACATTGGGTGCAGTGATGGGCTTGAATTTGTATGCCACTATTACCCTGCATGTCCACATCCAAAATTGACCACTGGGATATCCGAGCACACGGACGCTGACTTCATCACGGTGCTGTTGCAAGACCACATCGGCGGCCTACAAATCCGTCATCATAACAATTGGATTGACGTGCATCCTGTCGCCGGAGCCTTAGTGGTCAACATCGGAGACCTTATGCAGGTTACCAATGCCGACGAACATTTCGATAGAGCTGCCGAACTGAAACTCTTCGACGACACAAAAGCCGGAGTGAAAGGCCTTGTCGACAGCGGCATAACCCAAATACCGCGAATATTTTACCGACTGCCTGATTCCGGCGTATCCCCTGTTCCCGGAGACACCGAACTGAGCATTCCAGTGATAGATCTCGAAGCCATCGACAGAGATTCATCCAAACGAAGAGATGTCGTCAACAAAGTCCGAGAAGCCTCTGAGAAATGGGGTTTTTTCCAATTGGTCAACCATGGAGTTCCGGTGAGTGTTCTGGATGAGATGAAAAAAGGGACACTAAGATTTTACGAACAAGATACCCAATTGAAGAAGCAATTCTACACCCGCCACAACACAAAATCCATTGTTTACAACAGTAATTTCGATCTGTTTACTGCACCAGCCGCTAATTGGAGAGACACATTCCTCTGTTTCATGGCTCCCAATCTTCCAAATCCACAAGACTTGCCTGAAATTTGCCGAGATATCCTGTTTGATTACTCAAAAGAAATGAAGAAATTGGGGAGAATATTATTTGGGTTGCTTTCGGAGGCTCTTGGGTTGAACACAAATTATCTGAGCGACATCGAATGCGACAGGGGACTAGCTGTTCTGTGTCATTATTATCCAGCATGTCCACAGCCAGAGCTCACTTTAGGCACAACCGAGCACGCCGACAATGATTTCTTGACGGTACTTCTACAAGACGACCAAATCGGAGGTCTTCAAGTTCTTCACCAGAAGAAGTGGATTGATATTCCCCCAATTCCCGGGGCCTTGGTCCTTTCCCATTTTGTAATACTTAGCATTGAATTATTTTTGCAGCTGATTTCGAACGATGGGTTCAAGAGCGTGGAGCATCGAGTGCTGGCGAATCGTGATGGTCCGAGAGTTTCAATTGCGAGCTTTTTTGGTATCGGCGTTTATACAACATCTCAAGTTTATGGACCCATAAAGGAATTGTTATCGGAACAAAATCCTGCAAAGTATGGAGAAACGACGCTCAAAGACTTCTATTTCTATCACAACAGCAGAGGCTTGAATGGAACTTCTGCTCTGCAACATTTCAGGCTCAGTCTGGACGATGAAGGAGATGCTACGCCCATTAAGGATTGTTTCATTCAAAGCTCCAAACAAAACAAAATGGCCAACCTTACACCTTTCTCCAAACTTGACCAGACCTTCGACAGAGCTTCGGAACTGAAAGCCTTCGACCAAACTAAGGCCGGCGTCAAAGGCCTTGTGGACTCTGGCGTCGCCGAGATCCCAGGAATATTCTACTGCCCACCCAAAGAACATTCCAATTCCGTTCCCGAGGAAACCCATTTGGGTATTCCGGTGGTGGATCTGGAGGACATCGATAAAGATCCCTTCAAACGCAGAGAAGTGGTGGGCAAAATCCGAGAAGCTTCAGAGACCTGGGGTTTCTTCCAAGTGCTTAACCATGGAGTTCCGGCGAGTGTTCAGGAGGAGATCATAAATGGGGTGCATCGATTTTTCGAACAGGACATTGAAGTGAAGAAACAATATTATACAAGAGACAATACAAAGCCATTCGTTCACAATTGCAATTTCGATTTGTTTAGCGCACCTGTAGCCAATTGGAGGGATACTTTCTTCACTCTCATGGCCCCAATTTCTCCTAGCCCCCAAGACTTGCCTCAAGTTTGCAGAGACATCCTAGTTGAATATTCAAAACAAATAATGAAACTTGGAGAATTGATATTTGGACTACTTTCAGAGGCCCTTGGTCTGAAATCAACTCATCTCGTGGACTTGGACTGCAACGAGGGACTTTCCATTCTCGGCCACTATTATCCGCCGTGTCCTCAGCCTGAACTCTCCATCGGCACAACCGAGCACTCTGACAATACCTTCATCACCGTGCTACTGCAGGATGGTATGGGAGGCCTTCAGGTGCGTCAACACAACAAATGGGTGGATGTTCCGCCAGTTCCTGGGGCCTTCGTTATTAATGTTGGCAGCTTATTACAGTTGATAACGAATGACAGATTTGTGAGCTCGGAGCATAGAGTGGTGGCGAACCGTAAGGGTCCAAGAGTATCGGTGGCAGGTTTCTTCAGCACCGGCTCTCTACCAACCTCCAAACTTTATGGACCCATCGAAGAATTGTTGTCGGAACAAAATCCTCCCAGATACAAAAAAATAAGTGTTAAAGAGTACAATCTCTACTTTGCTGAAAAAGGAGACAACAGTAAGCCTTTTTCTTACAACTGCAATTTCGATTTGTTTAGTGCACCCAGTGCCAATTGGAGGGACACCATCTTCACTCAAATGACCCCAAATTCTCCTAACCCACAAGACTTGCCTCAAGTTTGCAGAGACATCCTAATTGATTATTCAAAACAAATGGAGAAGGTTGGAGAATTGATATTTGGATTGCTTTCGGAAGCCCTCGGTCTAAAATCAACCCATCTGGTGGACTTAGATTGCAACCAGGGACATGCCATTCTTTGCCACTATTATCCGCCGTGTCCTCAGCCGGAACTCACCATCGGTACTACCGAGCACTCCGACGATACCTTTATCACCGTGCTGCTGCAGGATCATATCGGAGGTCTTCAAGTGCTTCATCACAACAAATGGGTGGATATTCCGCCAGTTCCCGGAGCTTTTGTTGTTAATCAGTTGATTTCGAATGACAAATTGGTGAGCTCGGTTCATAGAGTGTTGGCCAATCGTGAGGGTCCAAGAGTATCGGTGGCATGTTTCTTCACTACTGGCGCTATACCAACCTCCAAACTTTATGGACCCATCAAACAATTGTTGTCCCAACAAAATCCTCCAAAGTATAGACAAATCACTGTTAGAGAGTACGATCACCTTCATGCTCAAAAAGGCTTGGATGGAACGCATAAACAAAAAATGGCCAACCTCACACCCTTCTCCAAACTTGACCAAACCTTCGACAGAGCTTCAGAACTGAAAGCCTTCGACCAAACTAAGGCCGGCTGGTCAAAGTCCTTGTGGACTCCGGCGTCGCAGAGATCCCAAGAATATTCTACCGCCCACTCAAACACGTCTCTAATTCCGGCAAGACCTCCCTTCCCGACGAACCCCATTTGGGTGTTCCGGTGGTGGATCTGGAAGACATCGATAAAGACCCCTTCAAACGAAGAGAAGTGGTGGACAAAATCCGAGAAGCTTCAGAAACGTGGGGGTTCTTCCAAGTGCTTAACCATGGGGTTCCGTGTTCAGGAGGAGATAATAAATGGGGCACATCGATTTTTCGAACAGGATATTGAGGTGAAGAAACAATATTATTCAAGAGACTACACTAAGCCTTTCGTTTACAATTGCAATTTCGATTTGTTTAGTGCACCCAATGCCAATTGGAGGGACACCATTTTCACTCAAATGACCCCAAATTCTCCTAACCCACAAGACTTGCCTCAAGTTTGCAGTGACATCCTAATTGATTATTCAAAACAAATGGAGAAGCTTGGAGAAATAATATTTGGACTATTTTCGGAGGCCCTTGGTTTGAAACCAACCCATCTCATAGACTTGGATTGCAATGAGGGACATGCCATTCTCTGCCACTATTATCCGCCGTGTCCTCAGCCAGAACTCACCATTGGCGCCACCGAGCACTCCGACAGTAGCTTCATCACAGTGCTGCTGCAGGACCATATCGGAGGTCTTCAGGTGCTTCATAACAACGAATGGGCAGATATTCCACCAGTTTCCGGAGCTTTGGTCGTCAACCTGATTTCGAATGACAAATTTGTGAGCTCAGTGCATAGAGTGGTGGCCAATCGTGAAGGTTGTCCAAGAGTATCGGTCGCAAGTTTCTTCACCACTGGCATTATTTCAACCTCCAAACTTTATGGACCCATCAAACAATTATTGTCCGAACAAAATCCTCCCAAGTATACACAAATCACAGTTAAAGAGTATCATTGGGCATTCAAATTCGAGGTCGAAACAAACATAATGGCCAACCTCACACCTTTCTCTAAACTTGATGAAACCTTCGACAGAGCTTCGGAACTGAAAGCCTTCGACCAAACTAAGGCCGGCGTCAAAGGCCTTGTGGACTCCGGCGTCGCCGAGATCCCAAGAATATTCTACCGCCCACTCAAACACGTCTCTAATTCCGGCGAGACCTCTGTTCCCCACGAGCCCCATTTGGGTGTTCCGGTGGTGGATCTGGAAGACATCGATAAACACCCCTTCAAACGCAGAGAAGTGGTGGACAAAATCCGGGAAGCTTCAGAAACGTGGGGGTTCTTCCAAGTGCTTAACCATGGAGTTCCGGTGAGTGTTCAGGAGGAGATCATAAATGGGGTGCATCGATTTTTCGAACAGGATATTGAAGTGAAGAAACGATATTATACCAGAGATAACACTAAGCCTTTCGTTTACAATTGCAATTTCGATTTGTTTAGCGCACCCACTACGAATTGGAGGGACACCACCTTCACTCAAATGACCCAAAATTCTCCCAACCCACAAGACTTGCCTCAAGTTTGCAGTTTTAACATACTAACAAAAATCAACTATCTTCCTCAAATAAAAATTTCAAAATTTTATAACACAATATCAACAGACATCCTAATTGAATATTCAAAACAAATGGAGAAGGTTGGAGAATTGATATTTGGGCTACTTTCCGAGGCGCTTGGTTTGAAATCAACCCATCTCCTAGAATTGGATTGCAATGAGGGACATGCCATTATGTGCCACTATTATCCGCCGTGTCCTCAGCCAGAACTCGCCATCGGCACCACCGAGCACTCCGACAGTGGCTTCATCACCGTGCTGCTGCAAGACCATATCGGAGGTCTTCAGGTGCTTCATCACAACAAATGGGTGGATATTCCACCAGTTCCCGGAGCCTTAGCCGTCAATGTCGGCAGCTTACTGCAGGCAAGGCTTTAA

Protein sequence

MAVSASVVDGFILTPLSKADENYHRPTEIKAFDDTKAGVKGLVDAGINEIPRIFYQPPEDYYSDNISGETQYQIPVIDLDDVHRNSLKRKDTINRVREASEKLGFFQLINHGIPVGVLEELKGAVKRFNEQDTEVKKQYYTRDNTKPLIYNSNFDLYSASTTNWRDTLGYISAPNPPNPQDLPEIIRDNLVDYSKRVMEIGKLLFELLSEALGLNPNYLNEIGCSEGLAIGCHYYPPCPQPNLTLGTSEHSDNVFITVLFQDNIGGLQIRHQKKWVDVPPVAGALVVNIGELMQLITNDRFISVAHRVLAKKEGPRISVASFFSTLAYRSSKVYGPIKELLSEENPPKYRETTIRDFHMLYRADGLGTSKAKRSTMAVAASRVNGANLTPLSEADENYHRPTELRAFDDTKAGVKGLVDAGITEIPRIFYFPPEDYNSDNATVEIQIQIPVIDFDHVGRNSLKRKYTIDRIREASEKLGFFQLINHGIPVSVLEEMKDAVRRFHEQETELKKQYYTRDLTKPLIYTSNFDLYSAATTNWRDAFRYVSSPNAHDPQVLPEICRDILVEYSKQVMEIGKLVFELLSEALGLNPNYLNDIDCSEGLAFVCHYYPPCPQPNLAIGTSEHTDNGFITVLLQDHIGGLQIRHGNNWVDIPPVARALVRSTMAVVASRVNAPNLTPLSKADENYHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYRPPETFDSDNISGETQIHIPVVDLDHINKNSLKRKYTIDIVREASEKLGFFQLVNHGIPVDVLEEMKDAVRRFNEQETESKKQYYTRDLTKPLIYNSNFDLYTAATTNWRDTFGYISAPNSHNPQDLPEICRDILVDYSKRVMEIGNLLFELLSEALGLNPNYLKNIDCNEGLALVCHYYPPCPQPNLAIGTSEHTDNDFITVLLQDQIGGLQIRYENKWVDVPPVAGALVVNIGDLMQLITNDKFKSVKHRVLANKKGPRRNHNQRLHYSIPFKRSIMAVAASEINGLNLTPLSKADDNFHRLTELKAFDDTKAGVKGLVDAKVTEIPRIFYHPPEEYDSVETQIRIPLIDLDGVGKDSLKRKHIVDQIRDASEELGFFQVINHGISVSVLDEIKDSVRRFHEQDTEVKKQYYTRDLMKPFIYNSNFDLYSAPTTNWRDTFSYVSAPNPPNPQELPEICSRDILVDYSKWVMKIGKLVLELLSEALGLNPNYLNNIGCSDGLEFVCHYYPACPHPKLTTGISEHTDADFITVLLQDHIGGLQIRHHNNWIDVHPVAGALVVNIGDLMQVTNADEHFDRAAELKLFDDTKAGVKGLVDSGITQIPRIFYRLPDSGVSPVPGDTELSIPVIDLEAIDRDSSKRRDVVNKVREASEKWGFFQLVNHGVPVSVLDEMKKGTLRFYEQDTQLKKQFYTRHNTKSIVYNSNFDLFTAPAANWRDTFLCFMAPNLPNPQDLPEICRDILFDYSKEMKKLGRILFGLLSEALGLNTNYLSDIECDRGLAVLCHYYPACPQPELTLGTTEHADNDFLTVLLQDDQIGGLQVLHQKKWIDIPPIPGALVLSHFVILSIELFLQLISNDGFKSVEHRVLANRDGPRVSIASFFGIGVYTTSQVYGPIKELLSEQNPAKYGETTLKDFYFYHNSRGLNGTSALQHFRLSLDDEGDATPIKDCFIQSSKQNKMANLTPFSKLDQTFDRASELKAFDQTKAGVKGLVDSGVAEIPGIFYCPPKEHSNSVPEETHLGIPVVDLEDIDKDPFKRREVVGKIREASETWGFFQVLNHGVPASVQEEIINGVHRFFEQDIEVKKQYYTRDNTKPFVHNCNFDLFSAPVANWRDTFFTLMAPISPSPQDLPQVCRDILVEYSKQIMKLGELIFGLLSEALGLKSTHLVDLDCNEGLSILGHYYPPCPQPELSIGTTEHSDNTFITVLLQDGMGGLQVRQHNKWVDVPPVPGAFVINVGSLLQLITNDRFVSSEHRVVANRKGPRVSVAGFFSTGSLPTSKLYGPIEELLSEQNPPRYKKISVKEYNLYFAEKGDNSKPFSYNCNFDLFSAPSANWRDTIFTQMTPNSPNPQDLPQVCRDILIDYSKQMEKVGELIFGLLSEALGLKSTHLVDLDCNQGHAILCHYYPPCPQPELTIGTTEHSDDTFITVLLQDHIGGLQVLHHNKWVDIPPVPGAFVVNQLISNDKLVSSVHRVLANREGPRVSVACFFTTGAIPTSKLYGPIKQLLSQQNPPKYRQITVREYDHLHAQKGLDGTHKQKMANLTPFSKLDQTFDRASELKAFDQTKAGWSKSLWTPASQRSQEYSTAHSNTSLIPARPPFPTNPIWVFRWWIWKTSIKTPSNEEKWWTKSEKLQKRGGSSKCLTMGFRVQEEIINGAHRFFEQDIEVKKQYYSRDYTKPFVYNCNFDLFSAPNANWRDTIFTQMTPNSPNPQDLPQVCSDILIDYSKQMEKLGEIIFGLFSEALGLKPTHLIDLDCNEGHAILCHYYPPCPQPELTIGATEHSDSSFITVLLQDHIGGLQVLHNNEWADIPPVSGALVVNLISNDKFVSSVHRVVANREGCPRVSVASFFTTGIISTSKLYGPIKQLLSEQNPPKYTQITVKEYHWAFKFEVETNIMANLTPFSKLDETFDRASELKAFDQTKAGVKGLVDSGVAEIPRIFYRPLKHVSNSGETSVPHEPHLGVPVVDLEDIDKHPFKRREVVDKIREASETWGFFQVLNHGVPVSVQEEIINGVHRFFEQDIEVKKRYYTRDNTKPFVYNCNFDLFSAPTTNWRDTTFTQMTQNSPNPQDLPQVCSFNILTKINYLPQIKISKFYNTISTDILIEYSKQMEKVGELIFGLLSEALGLKSTHLLELDCNEGHAIMCHYYPPCPQPELAIGTTEHSDSGFITVLLQDHIGGLQVLHHNKWVDIPPVPGALAVNVGSLLQARL
Homology
BLAST of CmUC05G085020 vs. NCBI nr
Match: KAG5588104.1 (hypothetical protein H5410_048538 [Solanum commersonii])

HSP 1 Score: 1843.6 bits (4774), Expect = 0.0e+00
Identity = 976/2132 (45.78%), Postives = 1319/2132 (61.87%), Query Frame = 0

Query: 22   NYHRPTEIKAFDDTKAGVKGLVDAG-INEIPRIFYQPPEDYYSDNISGETQYQIPVIDLD 81
            +Y + +E+KAFDDTKAGVKGLVDAG   E+PRIF  P E  ++ +   E ++  PVIDL+
Sbjct: 23   SYDKHSELKAFDDTKAGVKGLVDAGNSTEVPRIFVHPRESIHNSSGFTEKEFVFPVIDLE 82

Query: 82   DVHRNSLKRKDTINRVREASEKLGFFQLINHGIPVGVLEELKGAVKRFNEQDTEVKKQYY 141
             +  N ++ K+ +++VR+ASE  GFFQ++NHGIP  VLEE+    +RF EQD E+KKQYY
Sbjct: 83   GID-NPMRHKEIVDKVRDASETWGFFQVVNHGIPFPVLEEMLQGARRFFEQDVEIKKQYY 142

Query: 142  TRDNTKPLIYNSNFDLYSAS--TTNWRDTLGYISAPNPPNPQDLPEIIRDNLVDYSKRVM 201
            TRD  K + + SNF L+S S    +WRD+L   + PNPP+ ++ P   R+ L+++SK++M
Sbjct: 143  TRDTMKKVAHVSNFYLFSPSVPAESWRDSLYCFTCPNPPSLEEFPRACREILIEFSKKMM 202

Query: 202  EIGKLLFELLSEALGLNPNYLNEIGCSEGLAIGCHYYPPCPQPNLTLGTSEHSDNVFITV 261
            ++G  LFELLSE LGLNP +L ++ C+EGL+I  HYYP CPQP LT+GT +HSD VFITV
Sbjct: 203  KLGNSLFELLSEGLGLNPCHLKDMNCAEGLSIAQHYYPACPQPELTIGTRQHSDCVFITV 262

Query: 262  LFQDNIGGLQIRHQKKWVDVPPVAGALVVNIGELMQLITNDRFISVAHRVLAKKEGPRIS 321
            L QD+I GLQ+RHQ +W+DVPP  GALVVNIG+L+QLI+ND++ISV HRVL+ K GPRIS
Sbjct: 263  LLQDDIEGLQVRHQNQWIDVPPTPGALVVNIGDLLQLISNDKYISVEHRVLSNKVGPRIS 322

Query: 322  VASFFSTLAYRSSKVYGPIKELLSEENPPKYRETTIRDFHMLYRADGL-GTS-------- 381
            V  FFST  + S ++YGPIKEL+SE NPPKYR TT++++   +R  G  GTS        
Sbjct: 323  VPCFFSTGTFPSPRIYGPIKELVSECNPPKYRATTVKEYTDYFRKKGFDGTSMLLDYKIL 382

Query: 382  ------------KAKRSTMAVA---------ASRV---------------NGANLTPLSE 441
                        +    T A+A          SR                  A    +S 
Sbjct: 383  HGDKSAKDVLRKQTHLVTCAIACILEYFVLLVSRAAYEFLGTSGILLVYNTLAKKMAISS 442

Query: 442  ADE-------NYHRPTELRAFDDTKAGVKGLVDAGITEIPRIFYFPPEDYNSDNATVEIQ 501
             D+       +Y + +EL+AFDDTKAG+KGLVDAGIT++PRIF  PP+D    + T E Q
Sbjct: 443  TDDFQTTIQKSYDKMSELKAFDDTKAGIKGLVDAGITKVPRIFMLPPKDRPESSDTCETQ 502

Query: 502  IQIPVIDFDHVGRNSLKRKYTIDRIREASEKLGFFQLINHGIPVSVLEEMKDAVRRFHEQ 561
               PV+D + + ++ +K K  +D++R+ASE  GFFQ++NHGIPVSVLEEM    R+F EQ
Sbjct: 503  FIFPVMDLEGISKDPIKHKEIVDKVRDASETWGFFQVVNHGIPVSVLEEMLQGTRKFFEQ 562

Query: 562  ETELKKQYYTRDLTKPLIYTSNFDLYSAA--TTNWRDAFRYVSSPNAHDPQVLPEICRDI 621
            + E+K QYYTRD+T  ++++ NFDLYS +    NWRD+   + +PN   P+  P  CR+I
Sbjct: 563  DIEVKNQYYTRDITNKVVHSCNFDLYSPSVPAANWRDSLFCLMAPNPPSPEEFPTACREI 622

Query: 622  LVEYSKQVMEIGKLVFELLSEALGLNPNYLNDIDCSEGLAFVCHYYPPCPQPNLAIGTSE 681
            L+E+S  +M++GK VFELLSE LGLNP++LNDI C+EGLA + HYYP CPQP L +GTS+
Sbjct: 623  LMEFSDHIMKLGKSVFELLSEGLGLNPSHLNDIGCAEGLAVLGHYYPACPQPELTMGTSK 682

Query: 682  HTDNGFITVLLQDHIGGLQIRHGNNWVDIPPVARALVRS--------------------- 741
            H+D+GFITVLLQDHIGGLQ+ H N WVD+PP   A+V +                     
Sbjct: 683  HSDHGFITVLLQDHIGGLQVLHQNQWVDVPPTPGAIVVNIGDLLQASILVSNDKYISVEH 742

Query: 742  ------------------TMAVVASRVNAPNLTPLSKADENYHRPTDL---------KAF 801
                              T  + +S +  P    LS+ +   +R T +         KAF
Sbjct: 743  RVLTNKLSSRISVACFFGTGPLPSSNLYGPITELLSEDNPPKYRSTTVNDYTGYYRKKAF 802

Query: 802  DDTKAGVKGLVDAGITEIPRIFYRPPETFDSDNISGETQIHIPVVDLDHINKNSLKRKYT 861
            DDTKAGVKGLVDAGIT++P+IF  PP+         E Q   PV+DL+ I+++ +K K  
Sbjct: 803  DDTKAGVKGLVDAGITKVPQIFILPPKNRPESLDISEKQFIFPVIDLEGIDEDPIKHKEI 862

Query: 862  IDIVREASEKLGFFQLVNHGIPVDVLEEMKDAVRRFNEQETESKKQYYTRDLTKPLIYNS 921
            +D VR+ASE  GFFQ+VNHGIP  VLE M    R F EQ+ E    + +    + L++  
Sbjct: 863  VDNVRDASETWGFFQVVNHGIPTSVLEVMLQGTREFFEQDIE---PFCSSCKLERLLF-- 922

Query: 922  NFDLYTAATTNWRDTFGYISAPNSHNPQDLPEICRDILVDYSKRVMEIGNLLFELLSEAL 981
                             +  APN  +P++ P  CR IL+DYSK VME+G  L  LLSE L
Sbjct: 923  -----------------FSMAPNPPSPEEFPRPCRGILMDYSKHVMELGCSLLGLLSEGL 982

Query: 982  GLNPNYLKNIDCNEGLALVCHYYPPCPQPNLAIGTSEHTDNDFITVLLQDQIGGLQIRYE 1041
            GL+  +L+++DC EGL +V HYYPPCPQP L IGT++H+DNDFITVLLQD IGGLQ+ ++
Sbjct: 983  GLDCCHLEDMDCAEGLGVVGHYYPPCPQPELTIGTNKHSDNDFITVLLQDDIGGLQVLHQ 1042

Query: 1042 NKWVDVPPVAGALVVNIGDLMQLITNDKFKSVKHRVLANKKGPRRNHNQRLHYSIPFKRS 1101
            N+WVDVPP  GA+V        LI+NDK+ SV+HRVL+NK GPR         S+    S
Sbjct: 1043 NQWVDVPPTPGAIV--------LISNDKYLSVEHRVLSNKVGPR--------ISVACFFS 1102

Query: 1102 IMAVAASEINGLNLTPLSKADDNFHRLTELKAFDDTKAGVKGLVDAKVTEIPRIFYHPPE 1161
               + +S++ G     LS+ +   +  T +KAF D                    Y   +
Sbjct: 1103 TGPLPSSKLYGPIAELLSEDNPPKYCATTVKAFSD--------------------YFRKK 1162

Query: 1162 EYDSVETQIRIPLIDLDGVGKDSLKRKHIVDQIRDASEELGFFQVINHGISVSVLDEIKD 1221
              D        P+IDL+G+ +D +K K IVD++RDASE  GFFQV+NHGI  SVL+E+  
Sbjct: 1163 GLDGTSALFIFPVIDLEGIDEDPIKHKEIVDKVRDASETWGFFQVVNHGIPTSVLEEMLQ 1222

Query: 1222 SVRRFHEQDTEVKKQYYTRDLMKPFIYNSNFDLYS--APTTNWRDTFSYVSAPNPPNPQE 1281
              R+F EQD  +KKQYY+RD  K  I+ SNFDLYS   P  NWRDT   + AP+P +PQE
Sbjct: 1223 GTRQFFEQDVVIKKQYYSRDTTKRVIHTSNFDLYSPYVPAANWRDTLFCLMAPDPLSPQE 1282

Query: 1282 LPEICSRDILVDYSKWVMKIGKLVLELLSEALGLNPNYLNNIGCSDGLEFVCHYYPACPH 1341
            LP  C R+IL+DYSK VMK+G  +LELLSE                          ACP 
Sbjct: 1283 LPTAC-REILMDYSKDVMKLGFSLLELLSE--------------------------ACPQ 1342

Query: 1342 PKLTTGISEHTDADFITVLLQDHIGGLQIRHHNNWIDVHPVAGALVV------------- 1401
            P+L  G ++H+D DFITVLLQDHIGGLQ+ H N W++V P  GALV+             
Sbjct: 1343 PELAIGTNKHSDNDFITVLLQDHIGGLQVLHQNQWVNVPPTPGALVLISNDKYISVEHRV 1402

Query: 1402 ---------------------------NIGDLMQVTNADEH------------------- 1461
                                        I +L+   N  ++                   
Sbjct: 1403 LANKVGPRISVACFFYTGSMPSSKLYGPITELLSEDNPPKYRATTVKDYRDYFRKKGLDG 1462

Query: 1462 ---------------FDRAAELKLFDDTKAGVKGLVDSGITQIPRIFYRLPDSGVSPVPG 1521
                           +DR +EL+ FD+TKAGVKG+VD+GIT++PRIF +           
Sbjct: 1463 TSTFTDDFEARVPGSYDRMSELRAFDNTKAGVKGIVDAGITEVPRIFVQPTKIEECVSSC 1522

Query: 1522 DTELSIPVIDLEAIDRDSSKRRDVVNKVREASEKWGFFQLVNHGVPVSVLDEMKKGTLRF 1581
            +T+   PVIDLE ID+D  K +++V+KVR+ASE WGFFQ+VNH +P+SV++EM +GT RF
Sbjct: 1523 ETKFIFPVIDLEGIDKDPIKHKEIVDKVRDASETWGFFQVVNHDIPLSVMEEMLQGTRRF 1582

Query: 1582 YEQDTQLKKQFYTRHNTKSIVYNSNFDLFT--APAANWRDTFLCFMAPNLPNPQDLPEIC 1641
            +EQD  +KKQ+YTR NTK +V+ SNFDL++   PA NWRD+  C MAPN P+P++LP   
Sbjct: 1583 FEQDVDIKKQYYTRDNTKKVVHVSNFDLYSPFVPATNWRDSIFCLMAPNHPSPEELPIAY 1642

Query: 1642 RDILFDYSKEMKKLGRILFGLLSEALGLNTNYLSDIECDRGLAVLCHYYPACPQPELTLG 1701
            R+IL ++S  +  LG+ LF LLSE LGL+ ++L++I+C  GL VL HYYPACPQPELT+G
Sbjct: 1643 REILMEFSNHVMTLGKSLFELLSEGLGLDPSHLNNIDCSEGLRVLGHYYPACPQPELTIG 1702

Query: 1702 TTEHADNDFLTVLLQDDQIGGLQVLHQKKWIDIPPIPGALVLSHFVILSIELFLQLISND 1761
            T +H+DNDF+TVLLQ DQIGGLQVLH+ +WID+PP PGALV      ++I   LQLISND
Sbjct: 1703 TNKHSDNDFITVLLQ-DQIGGLQVLHKTQWIDVPPTPGALV------VNIGDLLQLISND 1762

Query: 1762 GFKSVEHRVLANRDGPRVSIASFFGIGVYTTSQVYGPIKELLSEQNPAKYGETTLKDFYF 1821
             + SVEHRVL+N+ GPR+S+A FF  G   T+++YGPIKELLS+ NP KY  TT+KD+  
Sbjct: 1763 KYLSVEHRVLSNKVGPRISVACFFYTGSLPTTKLYGPIKELLSDDNPPKYRTTTVKDYAD 1822

Query: 1822 YHNSRGLNGTSALQHFRLSLDDEGDATPIKDCFIQSSKQNKMANLTPFSKLDQTFDRASE 1881
            Y   + + G   L + R    D  +                             +DR  E
Sbjct: 1823 YFREKDMIG---LNNGRTEQYDSSE-----------------------------YDRERE 1882

Query: 1882 LKAFDQTKAGVKGLVDSGVAEIPGIFYCPPKEHSNSVPE-------ETHLGIPVVDLEDI 1941
            ++AFD +KAGVKGLVD GV  +P IF      H+  V E        T   IPV+D E +
Sbjct: 1883 VQAFDDSKAGVKGLVDGGVTRLPRIFL-----HNQYVAEMKSDSEIVTKFSIPVIDFEGL 1942

Query: 1942 DKDPFKRREVVGKIREASETWGFFQVLNHGVPASVQEEIINGVHRFFEQDIEVKKQYYTR 1964
             K   +R ++V  I++A E WGFFQV++H +P+ + E++I GV  F EQD EVKK++Y+R
Sbjct: 1943 GKSAAQRADIVRGIKDACEKWGFFQVVHHEIPSIIMEKVIEGVRHFHEQDSEVKKEFYSR 2002

BLAST of CmUC05G085020 vs. NCBI nr
Match: KAG8484564.1 (hypothetical protein CXB51_023069 [Gossypium anomalum])

HSP 1 Score: 1713.7 bits (4437), Expect = 0.0e+00
Identity = 947/2455 (38.57%), Postives = 1337/2455 (54.46%), Query Frame = 0

Query: 23   YHRPTEIKAFDDTKAGVKGLVDAGINEIPRIFYQPPEDYYSDNISGETQYQIPVIDLDDV 82
            Y R +++KAFD+T+AGVKGLVD+GI  +PR+F+   E+             IP+IDL+ V
Sbjct: 24   YDRLSKLKAFDETRAGVKGLVDSGIKHVPRMFHHQFEN---------NSVSIPIIDLEKV 83

Query: 83   HRNSLKRKDTINRVREASEKLGFFQLINHGIPVGVLEELKGAVKRFNEQDTEVKKQYYTR 142
             +N   R++ + +VR AS+  GFFQ++NHGIP+ V+EE+K   +RF EQD E K Q+++R
Sbjct: 84   KQNRTTREEIVGKVRNASKTWGFFQVLNHGIPMNVMEEMKDGARRFFEQDVESKSQFFSR 143

Query: 143  DNTKPLIYNSNFDLYSASTTNWRDTLGYISAPNPPNPQDLPEIIRDNLVDYSKRVMEIGK 202
            D TK ++YNSNFDLYSA    WRDT+    AP+PP P++LP + RD +++YSK+VM +G 
Sbjct: 144  DYTKRVVYNSNFDLYSAPAAKWRDTVVCSMAPDPPKPEELPAVFRDIMLEYSKQVMNLGY 203

Query: 203  LLFELLSEALGLNPNYLNEIGCSEGLAIGCHYYPPCPQPNLTLGTSEHSDNVFITVLFQD 262
            LLFELLSEALGLNP+YL +I C++GL +  HYYP CPQP LTLG+S+H+DN F+T+L QD
Sbjct: 204  LLFELLSEALGLNPDYLKDIDCAKGLVMLSHYYPICPQPELTLGSSKHTDNGFLTILLQD 263

Query: 263  NIGGLQIRHQKKWVDVPPVAGALVVNIGELMQLITNDRFISVAHRVLAKKEGPRISVASF 322
            N+GGLQ+ HQ +W++VPP  GAL         LI+NDRF SV HRV+     PR+SVASF
Sbjct: 264  NVGGLQVLHQNQWINVPPTPGAL---------LISNDRFTSVDHRVVTNSVSPRVSVASF 323

Query: 323  FSTLAYRSSKVYGPIKELLSEENPPKYRETTIRDFHMLYRADGLGTSKAKRSTMAVAASR 382
            F+T     +++YGPIKELLS+ NPPKY+ETT++DF   + + GL       S + +A  +
Sbjct: 324  FTTALVPDTRLYGPIKELLSQNNPPKYKETTVKDFITYFNSKGL-------SELLMAKPK 383

Query: 383  VNGANLTPLSEADENYHRPTELRAFDDTKAGVKGLVDAGITEIPRIFYFPPEDYNSDNAT 442
                           Y R +EL+AFD+TKAGVKGLVD+GI  +PR+F++ P+  + +++ 
Sbjct: 384  -------------PEYDRISELKAFDETKAGVKGLVDSGIKNVPRMFHYRPDKLDKNSSV 443

Query: 443  VE-IQIQIPVIDFDHVGRNSLKRKYTIDRIREASEKLGFFQLINHGIPVSVLEEMKDAVR 502
               + + IPVID + V  +    +  +D++R AS+  G FQ++NHGIP+SVL++MKD V 
Sbjct: 444  SNGVHVSIPVIDLEGVKEDPGTLRNIVDKVRNASKSWGVFQVVNHGIPLSVLQDMKDGVV 503

Query: 503  RFHEQETELKKQYYTRDL-TKPLIYTSNFDLYSAATTNWRDAFRYVSSPNAHDPQVLPEI 562
            +F EQ+   KK+++TRD  TK + Y SNFDLYS+   NWRD    + +PN   P  LPE+
Sbjct: 504  QFFEQDLAAKKKFFTRDYSTKNVAYNSNFDLYSSPAANWRDTLFCLMAPNPPMPHELPEV 563

Query: 563  CRDILVEYSKQVMEIGKL-------------------------VFELLS-EALGLNPNYL 622
             R ++      V+ + KL                           +  S EALGL P++L
Sbjct: 564  SRYLITYVKCFVLFMTKLQNSGYHPSESNNRNMIQKSSHQQWSYLDFYSIEALGLQPDHL 623

Query: 623  NDIDCSEGLAFVCHYYPPCPQPNLAIGTSEHTDNGFITVLLQDHIGGLQIRHGNNWVDIP 682
             D+ C++GL  + HYYP CPQP L +GT++H+DN F+TVLLQDHIGGLQ+ H N WVD+P
Sbjct: 624  KDMGCTQGLGMLSHYYPACPQPELTLGTTKHSDNDFLTVLLQDHIGGLQVLHQNQWVDVP 683

Query: 683  PVARALV----------------------------------------------------- 742
            P   ALV                                                     
Sbjct: 684  PTPGALVLISNDGFRSLEHRVVANCVGPRVSVACFFSTFFVPDLRTYGPIKELISEENPP 743

Query: 743  ---------------------RSTMAVVASRVNAPNLTP--------------------- 802
                                   +  + A  VN+ N  P                     
Sbjct: 744  KYREVTMREYAGNYNAKGLDASRSFLICAELVNSLNHGPNQFQFSSTTKTRDDGKCDIHF 803

Query: 803  -----------LSKADENYHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYRPPETFDSD 862
                       ++  + +Y R T+L A D +K+G+KGLVDAG+ ++PR+F       +  
Sbjct: 804  LISLNETKKMMIATPEPDYDRKTELMAIDQSKSGIKGLVDAGLAKLPRVFVHDHLKLNIK 863

Query: 863  NISGETQIHIPVVDLDHINKNSLKRKYTIDIVREASEKLGFFQLVNHGIPVDVLEEMKDA 922
            +      ++IPV+D   +N +S +R   ID VR+A  K GFFQ+VNHGIPV  LEEM + 
Sbjct: 864  SGPAPDNVNIPVIDFAGVNTDSNRRAMIIDEVRKACMKWGFFQIVNHGIPVTTLEEMING 923

Query: 923  VRRFNEQETESKKQYYTRDLTKPLIYNSNFDLYTAATTNWRDTFGYISAPNSHNPQDLPE 982
            +RRF+EQE E+K++ Y+RD +K + +N+  D+       WRDT   + APN    ++LP 
Sbjct: 924  IRRFHEQEPEAKEKLYSRDESKKVTFNTKIDMSQTMAAYWRDTLTCVMAPNPPATEELPA 983

Query: 983  ICRDILVDYSKRVMEIGNLLFELLSEALGLNPNYLKNIDCNEGLALVCHYYPPCPQPNLA 1042
             CRDI++DY+  VM +G  LFEL+SEAL LNPN+LK+I C EGL ++ HY P CP+P L 
Sbjct: 984  TCRDIMLDYTNNVMNLGTTLFELISEALWLNPNHLKDIGCTEGLYVMGHYSPACPEPELT 1043

Query: 1043 IGTSEHTDNDFITVLLQDQIGGLQIRYENKWVDVPPVAGALVVNIGDLMQLITNDKFKSV 1102
            +GT  HTD+ F+TVLLQDQIGGLQ+ ++N+W+DV P+ GAL++N+GDL+QLI+NDKF SV
Sbjct: 1044 MGTGIHTDSGFLTVLLQDQIGGLQVLHDNRWIDVTPIPGALIINLGDLLQLISNDKFISV 1103

Query: 1103 KHRVLANKKGPR------------------------------------------------ 1162
             HRV+A   GPR                                                
Sbjct: 1104 YHRVVARNIGPRISIASFFRTYIQPQNALRMYGPIKELLSEDNPPIYKETNVVDYFKFKH 1163

Query: 1163 --------------------------------------RNHNQ----RLHYSIPF----- 1222
                                                   +H+     RLH S PF     
Sbjct: 1164 LKGVEGTSALAHFKLCTAIYSFSRKVYLKYALCQVPSNSSHSSTKAPRLHRS-PFHGLSP 1223

Query: 1223 ----------------------------KRSIMAVAASEING-----------LNLTPLS 1282
                                        K+ I    A   N            LN    +
Sbjct: 1224 DLAAFSDVEPKFTTARPCCKILQIGDDLKQYIDPCLAKHHNNYTFSKLPHQLPLNFKMAT 1283

Query: 1283 KADDNFHRLTELKAFDDTKAGVKGLVDAKVTEIPRIFYHPPE-----EYDSVETQIRIPL 1342
            +   N+ R  ELK FDDTK GVKGL DA +  IP+IF  P E     E +S +  I +P+
Sbjct: 1284 ETSVNYDRTKELKQFDDTKTGVKGLFDAGILNIPKIFVRPAEDLAADELNSSQKTIEVPI 1343

Query: 1343 IDLDGVGKDSLKRKHIVDQIRDASEELGFFQVINHGISVSVLDEIKDSVRRFHEQDTEVK 1402
            IDL  +G DS++RK I+++++ AS E GFFQVINHGI +SVLDE+ + +R F+EQD E+K
Sbjct: 1344 IDLSSIG-DSIRRKEIINEVKIASGEWGFFQVINHGIPLSVLDEMIEGIRLFNEQDLELK 1403

Query: 1403 KQYYTRDLMKPFIYNSNFDLYSAPTTNWRDTFSYVSAPNPPNPQELP------------- 1462
            K+ Y+RD  K   +NSNFDLY++ T +WRDT       + P+P ++P             
Sbjct: 1404 KEMYSRDSAKKVKFNSNFDLYTSKTADWRDTLQLTFLDSDPDPSQMPPKFDFWEGIPTSS 1463

Query: 1463 --------------------------------------------------------EICS 1522
                                                                     IC 
Sbjct: 1464 QGIFPSVIYLKLFNPSKHPVLSSVKNLHLNSVKSCMKFTEPKHFVGSNRVLQKKMVAICH 1523

Query: 1523 RDILVDYSKWVMKIGKLVLELLSEALGLNPNYLNNIGCSDGLEFVCHYYPACPHPKLTTG 1582
            R   ++Y K + ++G+ + ELL EALGL P+ L     SD ++ V HYYP CP P+LT G
Sbjct: 1524 RKSTMEYFKHMKRLGEALFELLLEALGLQPDTLTQ---SDVVK-VTHYYPPCPQPELTLG 1583

Query: 1583 ISEHTDADFITVLLQDHIGGLQIRHHNNWIDVHPVAGALVVNI----------------- 1642
            + +H D   +TVLLQ+H+GGLQ+ H++ W +VHP  G L+ +I                 
Sbjct: 1584 VRKHADPGILTVLLQNHMGGLQVLHNSQWFNVHPTQGGLLASIHINFIFNHPNRLATNYF 1643

Query: 1643 -----------------------------------------GDLMQV------------- 1702
                                                     G + ++             
Sbjct: 1644 NVLSNDKFKSVKHRATANHDGPRISVPCFFSGHASLHDKSFGPIKELISEANPPRYKEFL 1703

Query: 1703 -----------------------------------------TNADEHFDRAAELKLFDDT 1762
                                                     T     +DR  ELK FDD 
Sbjct: 1704 LKEYIAKFLSSSLDNKPPKDYYKIQILYERKKATQLAEKMATETSVDYDRTKELKQFDDA 1763

Query: 1763 KAGVKGLVDSGITQIPRIFYRLPDSGVSPV--PGDTELSIPVIDLEAIDRDSSKRRDVVN 1822
            KAGVKGLVD+GI  IP+IF R  +   +     G   + +P+ID+  I  DS +R+++V 
Sbjct: 1764 KAGVKGLVDAGILNIPKIFVRPAEDLAAEELNSGHKNVEVPIIDVSNIG-DSIRRQEIVK 1823

Query: 1823 KVREASEKWGFFQLVNHGVPVSVLDEMKKGTLRFYEQDTQLKKQFYTRHNTKSIVYNSNF 1882
            +V+ AS +WGFFQ++NHG+P+SVLDEM +G   F EQD +LKK+ Y+R + K + ++SNF
Sbjct: 1824 EVKIASGEWGFFQVINHGIPLSVLDEMIEGIRLFNEQDLELKKELYSRDSAKKVKFHSNF 1883

Query: 1883 DLFTAPAANWRDTFLCFMAPNLPNPQDLPEICRDILFDYSKEMKKLGRILFGLLSEALGL 1942
            DL+T+   +WRDT       + P+P  +P +CR    +Y K MKKLG  LF LLSEALGL
Sbjct: 1884 DLYTSETVDWRDTLQLTFLDSDPDPSQMPPVCRKSTMEYFKHMKKLGETLFELLSEALGL 1943

Query: 1943 NTNYLSDIECDRGLAVLCHYYPACPQPELTLGTTEHADNDFLTVLLQDDQIGGLQVLHQK 2002
              ++L+ +   +G +++ HYYP CPQPELTLG  +HAD   LT+LLQ + IGGLQVLH  
Sbjct: 1944 QADHLNSMGYSKGCSIVTHYYPPCPQPELTLGVRKHADAGILTMLLQ-NHIGGLQVLHNG 2003

Query: 2003 KWIDIPPIPGALVLSHFVILSIELFLQLISNDGFKSVEHRVLANRDGPRVSIASFFGIGV 2020
            +W DI P  G LV      ++I   LQ++SND FKSV+HRV++N  GPR+S+A FF    
Sbjct: 2004 QWFDIHPTRGGLV------INIGDLLQVLSNDEFKSVKHRVISNHVGPRISVACFFSGHA 2063

BLAST of CmUC05G085020 vs. NCBI nr
Match: KAG5588113.1 (hypothetical protein H5410_048547 [Solanum commersonii])

HSP 1 Score: 1620.5 bits (4195), Expect = 0.0e+00
Identity = 898/2079 (43.19%), Postives = 1198/2079 (57.62%), Query Frame = 0

Query: 17   SKADENYHRPTEIKAFDDTKAGVKGLVDAG-INEIPRIFYQPPEDYYSDNISGETQYQIP 76
            S    NY + +E+KAFDDTKAGVKGLVDAG   E+PRIF  P E  ++ +   E ++  P
Sbjct: 18   STLQPNYDKHSELKAFDDTKAGVKGLVDAGNSTEVPRIFVHPRESIHNSSGFTEKEFVFP 77

Query: 77   VIDLDDVHRNSLKRKDTINRVREASEKLGFFQLINHGIPVGVLEELKGAVKRFNEQDTEV 136
            VIDL+ +  N ++ K+ +++VR+ASE  GFFQ++NHGIP  VLEE+    +RF EQD E+
Sbjct: 78   VIDLEGID-NPMRHKEIVDKVRDASETWGFFQVVNHGIPFPVLEEMLQGARRFFEQDVEI 137

Query: 137  KKQYYTRDNTKPLIYNSNFDLYSAS--TTNWRDTLGYISAPNPPNPQDLPEIIRDNLVDY 196
            KKQYYTRD  K + + SNFDL+S S    +WRD+L   + PNPP+P++ P   R+ L+++
Sbjct: 138  KKQYYTRDTMKKVAHVSNFDLFSPSVPAASWRDSLYCFTCPNPPSPEEFPTACREILIEF 197

Query: 197  SKRVMEIGKLLFELLSEALGLNPNYLNEIGCSEGLAIGCHYYPPCPQPNLTLGTSEHSDN 256
            SK++M++G  LFELLSE LGLNP +L ++ C+EGL+I  HYYP CPQP LT+GT +HSD 
Sbjct: 198  SKKMMKLGYSLFELLSEGLGLNPCHLKDMNCAEGLSIAQHYYPACPQPELTIGTRQHSDC 257

Query: 257  VFITVLFQDNIGGLQIRHQKKWVDVPPVAGALVVNIGELMQLITNDRFISVAHRVLAKKE 316
             FITVL Q++I GLQ+RHQ +W+DVPP  GALVVNIG+L+QLI+ND++ISV HRVL+ K 
Sbjct: 258  AFITVLLQNDIEGLQVRHQNQWIDVPPTPGALVVNIGDLLQLISNDKYISVEHRVLSNKV 317

Query: 317  GPRISVASFFSTLAYRSSKVYGPIKELLSEENPPKYRETTIRDFHMLYRADGLGTSKAKR 376
            GPRISV  FFST A+ S ++YGPIKEL+SE NPPKYR TT++++   +R  G        
Sbjct: 318  GPRISVPCFFSTGAFPSPRIYGPIKELVSECNPPKYRATTVKEYTDYFRKKG-------- 377

Query: 377  STMAVAASRVNGANLTPLSEADENYHRPTELRAFDDTKAGVKGLVDAGITEIPRIFYFPP 436
                                             FD T                       
Sbjct: 378  ---------------------------------FDGTSM--------------------- 437

Query: 437  EDYNSDNATVEIQIQIPVIDFDHVGRNSLKRKYTIDRIREASEKLGFFQLINHGIPVSVL 496
                             ++D+          K     IR A E LG   ++         
Sbjct: 438  -----------------LLDY----------KILTFSIRAAYEFLGTSGIL--------- 497

Query: 497  EEMKDAVRRFHEQETELKKQYYTRDLTKPLIYTSNFDLYSAATTNWRDAFRYVSSPNAHD 556
                                                                        
Sbjct: 498  ------------------------------------------------------------ 557

Query: 557  PQVLPEICRDILVEYSKQVMEIGKLVFELLSEALGLNPNYLNDIDCSEGLAFVCHYYPPC 616
                                    LV+  L++ + ++                       
Sbjct: 558  ------------------------LVYNTLAKKMAIS----------------------- 617

Query: 617  PQPNLAIGTSEHTDNGFITVLLQDHIGGLQIRHGNNWVDIPPVARALVRSTMAVVASRVN 676
                        TD+   T+                                        
Sbjct: 618  -----------STDDFQTTI---------------------------------------- 677

Query: 677  APNLTPLSKADENYHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYRPPETFDSDNISGE 736
                       ++Y + ++LKAFDDTKAG+KGLVDAGIT++PRIF  PP+     + + E
Sbjct: 678  ----------QKSYDKMSELKAFDDTKAGIKGLVDAGITKVPRIFMLPPKDRPESSDTCE 737

Query: 737  TQIHIPVVDLDHINKNSLKRKYTIDIVREASEKLGFFQLVNHGIPVDVLEEMKDAVRRFN 796
            TQ   PV+DL+ I+K+ +K K  +D VR+A E  GFFQ+VNHGIPV VLEEM    R+F 
Sbjct: 738  TQFIFPVMDLEGISKDPIKHKEIVDKVRDAPETWGFFQVVNHGIPVSVLEEMLQGTRKFF 797

Query: 797  EQETESKKQYYTRDLTKPLIYNSNFDLYTAA--TTNWRDTFGYISAPNSHNPQDLPEICR 856
            EQ+ E K QYYTRD+T  ++++ NFDLY+ +    NWRD+   + APN  +P++ P  CR
Sbjct: 798  EQDIEVKNQYYTRDITNKVVHSCNFDLYSPSVPAANWRDSLFCLMAPNPPSPEEFPTACR 857

Query: 857  DILVDYSKRVMEIGNLLFELLSEALGLNPNYLKNIDCNEGLALVCHYYPPCPQPNLAIGT 916
            +IL+++S  +M++G  +FELLSE LGLNP++L +I C EGLA++ HYYP CPQP L +GT
Sbjct: 858  EILMEFSDHIMKLGKSVFELLSEGLGLNPSHLNDIGCAEGLAVLGHYYPACPQPELTMGT 917

Query: 917  SEHTDNDFITVLLQDQIGGLQIRYENKWVDVPPVAGALVVNIGDLMQ---LITNDKFKSV 976
            S+H+D+ FITVLLQD IGGLQ+ ++N+WVDVPP  GA+VVNIGDL+Q   L++NDK+ SV
Sbjct: 918  SKHSDHGFITVLLQDHIGGLQVLHQNQWVDVPPTPGAIVVNIGDLLQASILVSNDKYISV 977

Query: 977  KHRVLANK------------KGPRRNHN------QRLHYSIPFK---------------- 1036
            +HRVL NK             GP  + N      + L    P K                
Sbjct: 978  EHRVLTNKLSSRISVACFFGTGPLPSSNLYGPITELLSEDNPPKYRSTTVNDYTGYYRKK 1037

Query: 1037 -----RSIMAVAASEINGLNLTPLSKADDNFHRLTELKAFDDTKAGVKGLVDAKVTEIPR 1096
                  ++  V   E +  N+     +  N              A VKGLVDA +T++P+
Sbjct: 1038 GLDGTSALKIVLCIESSIYNIVRGPLSSKNLRNSIVYNIVAKKMACVKGLVDAGITKVPQ 1097

Query: 1097 IFYHPPEEY-DSVETQIRIPLIDLDGVGKDSLKRKHIVDQIRDASEELGFFQVINHGISV 1156
            IF  PP+   +S++T I           +D +K K IVD++RDASE  GFFQV+NHGI  
Sbjct: 1098 IFILPPKNRPESLDTSI----------DEDPIKHKEIVDKVRDASETWGFFQVVNHGIPT 1157

Query: 1157 SVLDEIKDSVRRFHEQDTEVKKQYYTRDLMKPFIYNSNFDLY--SAPTTNWRDTFSYVSA 1216
            SVL+ +    R F EQD EV             ++ SNFDLY  S P  NWRD+  +  A
Sbjct: 1158 SVLEVMLQGTREFFEQDIEV-------------VHTSNFDLYSPSVPAANWRDSLFFSMA 1217

Query: 1217 PNPPNPQELPEICSRDILVDYSKWVMKIGKLVLELLSEALGLNPNYLNNIGCSDGLEFVC 1276
            PNPP+P+E P  C R IL+DYSK VM++G  +L LLSE LGL+  +L ++ C++GL  V 
Sbjct: 1218 PNPPSPEEFPRPC-RGILMDYSKHVMELGCSLLGLLSEGLGLDCCHLEDMDCAEGLGVVG 1277

Query: 1277 HYYPACPHPKLTTGISEHTDADFITVLLQDHIGGLQIRHHNNWIDVHPVAGALVVNIGDL 1336
            HYYP CP P+LT G ++++D DFITVLLQD IGGLQ+ H N W+DV P  GA+VVNIGDL
Sbjct: 1278 HYYPPCPQPELTIGTNKYSDNDFITVLLQDDIGGLQVLHQNQWVDVPPTPGAIVVNIGDL 1337

Query: 1337 MQVTNADEHFDRAAELKLFDDTKAGVKGLVDSGITQIPRIFYRLPDSGVSPVPGDTELSI 1396
            +Q +          ++KL  D +  + G + S     P I   L +         T  + 
Sbjct: 1338 LQASILFHFLIEYCQIKL--DQEYQLHGPLPSSKLYGP-IAELLSEDNPPKYCATTVKAF 1397

Query: 1397 PVI----DLEAIDRDSSKRRDVVNKVREASEKWGFFQLVNHGVPVSVLDEMKKGTLRFYE 1456
                   DLE ID D  K +++V+KVR+ASE WGFFQ+VNHG+P SVL+EM +GT +F+E
Sbjct: 1398 SDYFRKKDLEGIDEDPIKHKEIVDKVRDASETWGFFQVVNHGIPTSVLEEMLQGTQQFFE 1457

Query: 1457 QDTQLKKQFYTRHNTKSIVYNSNFDLF--TAPAANWRDTFLCFMAPNLPNPQDLPEICRD 1516
            QD ++KKQ+Y+R  TK +++ SNFDL+  + PAANWRDT  C  AP+  +PQ+LP  CR+
Sbjct: 1458 QDVEIKKQYYSRDTTKRVIHTSNFDLYSPSVPAANWRDTLFCLKAPDPLSPQELPTACRE 1517

Query: 1517 ILFDYSKEMKKLGRILFGLLSEALGLNTNYLSDIECDRGLAVLCHYYPACPQPELTLGTT 1576
            IL DYSK++ KLG  L  LLSE LGL+  +L D++C  GL +L HYYPACPQPEL +GT 
Sbjct: 1518 ILMDYSKDVMKLGFSLLELLSEGLGLDHCHLKDMDCAEGLGILGHYYPACPQPELAIGTN 1577

Query: 1577 EHADNDFLTVLLQDDQIGGLQVLHQKKWIDIPPIPGALV-----LSHFVILSIELFLQLI 1636
            +H+DNDF+TVLLQ D IGGLQVLHQ +W+++PP PGALV     L    IL   +   LI
Sbjct: 1578 KHSDNDFITVLLQ-DHIGGLQVLHQNQWVNVPPTPGALVVNIGDLLQASILCFSIPYTLI 1637

Query: 1637 SNDGFKSVEHRVLANRDGPRVSIASFFGIGVYTTSQVYGPIKELLSEQNPAKYGETTLKD 1696
            SND +KSVEHRVLAN+ GPR+S+A FF  G   +S++YGPI ELLSE NP KY  TT+KD
Sbjct: 1638 SNDKYKSVEHRVLANKVGPRISVACFFYTGSMPSSKLYGPITELLSEDNPPKYRATTVKD 1697

Query: 1697 FYFYHNSRGLNGTSALQHFRLSLDDEGDATPIKDCFIQSSKQNKMANLTPFSKLDQTFDR 1756
            +  Y +                 D E                         +++  ++DR
Sbjct: 1698 YRDYFH-----------------DFE-------------------------ARVPGSYDR 1729

Query: 1757 ASELKAFDQTKAGVKGLVDSGVAEIPGIFYCPPKEHSNSVPEETHLGIPVVDLEDIDKDP 1816
             SEL+AFD TKAGVKG+VD+G+ E+P IF  P K        ET    PV+DLE IDKDP
Sbjct: 1758 MSELRAFDNTKAGVKGIVDAGITEVPRIFVQPTKIEECVSSCETKFIFPVIDLEGIDKDP 1729

Query: 1817 FKRREVVGKIREASETWGFFQVLNHGVPASVQEEIINGVHRFFEQDIEVKKQYYTRDNTK 1876
             K +E+V K+R+ASETWGFFQV+NH +P SV EE++ G                    T+
Sbjct: 1818 IKHKEIVDKVRDASETWGFFQVVNHDIPLSVMEEMLQG--------------------TR 1729

Query: 1877 PFVHNCNFDLFSAPVANWRDTFFTLMAPISPSPQDLPQVCRDILVEYSKQIMKLGELIFG 1936
            P          S P  NWRD+ F LMAP  PSP++LP  CR+IL+E+S  +M LG+ +F 
Sbjct: 1878 P----------SVPATNWRDSIFCLMAPNHPSPEELPIACREILMEFSNHVMTLGKSLFE 1729

Query: 1937 LLSEALGLKSTHLVDLDCNEGLSILGHYYPPCPQPELSIGTTEHSDNTFITVLLQDGMGG 1996
            LLSE LGL  +HL ++DC+EGL +LGHYYP CPQPEL+IGT +HSDN FITVLLQD +GG
Sbjct: 1938 LLSEGLGLDPSHLNNIDCSEGLRVLGHYYPACPQPELTIGTNKHSDNDFITVLLQDQIGG 1729

Query: 1997 LQVRQHNKWVDVPPVPGAFVINVGSLLQLITNDRFVSSEHRVVANRKGPRVSVAGFFSTG 2035
            LQV    +W+DVPP PGA V+N+G LLQLI+ND++ S EHRV++N+ GPR+SVA FF TG
Sbjct: 1998 LQVLHKTQWIDVPPTPGALVVNIGDLLQLISNDKYPSVEHRVLSNKVGPRISVACFFYTG 1729

BLAST of CmUC05G085020 vs. NCBI nr
Match: AAF24827.1 (F12K11.6 [Arabidopsis thaliana])

HSP 1 Score: 1250.7 bits (3235), Expect = 0.0e+00
Identity = 779/2072 (37.60%), Postives = 1089/2072 (52.56%), Query Frame = 0

Query: 25   RPTEIKAFDDTKAGVKGLVDAGINEIPRIFYQPPEDYYSDNISGETQYQIPVIDLDDVHR 84
            R T +KAFD+TK GVKGL+DAGI EIP IF  PP    S      + + IP IDL     
Sbjct: 13   RSTLLKAFDETKTGVKGLIDAGITEIPSIFRAPPATLTSPKPPSSSDFSIPTIDLKGGGT 72

Query: 85   NSLKRKDTINRVREASEKLGFFQLINHGIPVGVLEELKGAVKRFNEQDTEVKKQYYTRDN 144
            +S+ R+  + ++ +A+EK GFFQ+INHGIP+ VLE++   ++ F+EQDTEVKK +Y+RD 
Sbjct: 73   DSITRRSLVEKIGDAAEKWGFFQVINHGIPMDVLEKMIDGIREFHEQDTEVKKGFYSRDP 132

Query: 145  TKPLIYNSNFDLYSASTTNWRDTLGYISAPNPPNPQDLP--------------------E 204
               ++Y+SNFDL+S+   NWRDTLG  +AP+PP P+DLP                     
Sbjct: 133  ASKMVYSSNFDLFSSPAANWRDTLGCYTAPDPPRPEDLPATCGFLRPHYSLVCLKDFGGH 192

Query: 205  IIRDNLVDYSKRVMEIGKLLFELLSEALGLNPNYLNEIGCSEGLAIGCHYYPPCPQPNLT 264
              R+ +++YSK VM++GKLLFELLSEALGLN N+L ++ C+  L +  HYYPPCPQP+LT
Sbjct: 193  FFREMMIEYSKEVMKLGKLLFELLSEALGLNTNHLKDMDCTNSLLLLGHYYPPCPQPDLT 252

Query: 265  LGTSEHSDNVFITVLFQDNIGGLQIRHQKKWVDVPPVAGALVVNIGELMQLITNDRFISV 324
            LG ++HSDN F+T+L QD+IGGLQ+ H + W                   LITND+FISV
Sbjct: 253  LGLTKHSDNSFLTILLQDHIGGLQVLHDQYW-------------------LITNDKFISV 312

Query: 325  AHRVLAKKEGPRISVASFFSTLAYRSSKVYGPIKELLSEENPPKYRETTIRDFHMLYRAD 384
             HRVLA   GPRISVA FFS+    + +VYGPIKE+LSEENPP YR+TTI ++   YR+ 
Sbjct: 313  EHRVLANVAGPRISVACFFSSYLMANPRVYGPIKEILSEENPPNYRDTTITEYAKFYRSK 372

Query: 385  G--------LG-----------TSKAKRSTMA------VAASRVNGANLTPL-------- 444
            G        LG           T KA   T+       +    VN   L  L        
Sbjct: 373  GFDACLIRVLGTLIICVMPSFMTVKAMHETVTRSIFGYIIGLCVNSRQLNHLMFQVLLFI 432

Query: 445  ---------SEADENYHRPTE----LRAFDDTKAGVK------------GLVD------- 504
                        D     P E    + +   TK  V              LVD       
Sbjct: 433  SMDAKKLDTGPRDAINWLPDEILGKILSLLATKQAVSTSVLSKKWRTLFKLVDTLEFDDS 492

Query: 505  -AGITEIPRIFYFPPEDYNSDNATVEIQIQIPVIDFD---HVGRNSLKRKYTIDR----- 564
             +G+ E    + FP    +  + TV +Q   P+       HVGR+  +RK  + R     
Sbjct: 493  VSGMGEQEASYVFPESFKDLVDRTVALQCDYPIRKLSLKCHVGRDDEQRKACVGRWISNV 552

Query: 565  ------------------------------------------------------------ 624
                                                                        
Sbjct: 553  VGRGVSEVVLRINDRGLHFLSPQLLTCKTLVKLTLGTRLFLGKLPSYVSLPSLKFLFIHS 612

Query: 625  ------------------------IREASEKLGF-----------------------FQL 684
                                    + +  E + +                       F L
Sbjct: 613  VFFDDFGELSNVLLAGCPVVEALYLNQNGESMPYTISSPTLKRLSVHYEYHFESVISFDL 672

Query: 685  IN-----------HGIPVSVLEEMKDAVRRFHEQETELKKQYYTRDLTKPLIYTSNFDLY 744
             N           +G P   LE + +A     + E        + D+TK ++   N ++ 
Sbjct: 673  PNLEYLDYSDYALYGYPQVNLESLVEAYLNLDKAE-----HVESPDVTKLIMGIRNVEIL 732

Query: 745  SAAT----------------------------TNWRDAFRYVSSPNAHDPQVLPEICRDI 804
            S +                             T    A++ ++      P++   I  D+
Sbjct: 733  SLSPDSVGVIYSCCKYGLLLPVFNNLVSLSFGTKKTRAWKLLADILKQSPKLETLIIEDL 792

Query: 805  ----------LVEYSK-QVMEIGKLVFELLSEALGLNPNYLNDIDCSEGLAFVCHYYPP- 864
                      L +  + Q++E G+   E L   L  + N L   D  +   F+    PP 
Sbjct: 793  NGYPLDVSMPLNQVKELQILEYGESDDERLQSRLE-SKNKLGSFDFEK---FLFPIPPPI 852

Query: 865  ----CPQPNLAIG----TSEHTDNGFITVLLQDHIGGLQIRHGNNWVDIP-PVARALVRS 924
                 P P+ + G    +   +D+ F++  L     GL    G  +  I   +A +++R 
Sbjct: 853  KITVSPHPSDSGGDGSNSKSSSDDEFVS--LDSSTSGLCSSSGKFFDLIKFLIALSVLRE 912

Query: 925  TMAVVASRVNAPNLTPLSKADENYHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYRP-- 984
            T  + ++++ AP          ++ R ++LKAFD+TK GVKGLVD+GI++IPRIF+    
Sbjct: 913  TKTMESTKI-AP----------SFDRASELKAFDETKTGVKGLVDSGISKIPRIFHHSSV 972

Query: 985  ----PETFDSDNISGETQIHIPVVDLDHIN-KNSLKRKYTIDIVREASEKLGFFQLVNHG 1044
                P+   SD +  +T   IP +DL   + ++++K K  I+ ++EA+ K GFFQ++NHG
Sbjct: 973  ELANPKPLPSDLLHLKT---IPTIDLGGRDFQDAIKHKNAIEGIKEAAAKWGFFQVINHG 1032

Query: 1045 IPVDVLEEMKDAVRRFNEQETESKKQYYTRDLTKPLIYNSNFDLYTAATTNWRDTFGYIS 1104
            + +++LE+MKD VR F+EQ  E +K  Y+RD  +  IY SNFDLYTAA  NWRDTF    
Sbjct: 1033 VSLELLEKMKDGVRDFHEQPPEVRKDLYSRDFGRKFIYLSNFDLYTAAAANWRDTFYCYM 1092

Query: 1105 APNSHNPQDLPEICRDILVDYSKRVMEIGNLLFELLSEALGLNPNYLKNIDCNEGLALVC 1164
            AP+   PQDLPEICRD++++YSK+VM +G  LFELLSEALGLNPN+LK+++C +GL ++C
Sbjct: 1093 APDPPEPQDLPEICRDVMMEYSKQVMILGEFLFELLSEALGLNPNHLKDMECLKGLRMLC 1152

Query: 1165 HYYPPCPQPNLAIGTSEHTDNDFITVLLQDQIGGLQIRYENKWVDVPPVAGALVVNIGDL 1224
            HY+PPCP+P+L  GTS+H+D  F+TVLL D I GLQ+  E  W DVP V GAL++NIGDL
Sbjct: 1153 HYFPPCPEPDLTFGTSKHSDGSFLTVLLPDNIEGLQVCREGYWFDVPHVPGALIINIGDL 1212

Query: 1225 MQ--------------------------------LITNDKFKSVKHRVLANKKGPRR--- 1284
            +Q                                LITNDKF S+KHRVLAN+    R   
Sbjct: 1213 LQASLRNQYMFASSDFTSLFTMLDLFKKSSSWFKLITNDKFISLKHRVLANRATRARVSV 1272

Query: 1285 --------NHNQRLHYSI----------PFKRSIMAVAASEINGLNLTPLSKADD----- 1344
                      N R++  I           ++ + +   A+  NG  L   S   D     
Sbjct: 1273 ACFFHTHVKPNPRVYGPIKELVSEENPPKYRETTIRDYATYFNGKGLGGTSALLDFKQLV 1332

Query: 1345 ----------------------------NFHRLTELKAFDDTKAGVKGLVDAKVTEIPRI 1404
                                        +F R +ELKAFD+TK GVKGLVD+ +++IPRI
Sbjct: 1333 PSLNLRKMFHSCSVIREAKTMETKNIAPSFDRASELKAFDETKTGVKGLVDSGISQIPRI 1392

Query: 1405 FYHP------PEEYDSVETQIR-IPLIDLDG-VGKDSLKRKHIVDQIRDASEELGFFQVI 1464
            F+H       PE   S    ++ IP IDL G V +D LK K+ +++I++A+E+ GFFQVI
Sbjct: 1393 FHHSSVKLANPEPVSSDLLHLKTIPTIDLGGRVFEDELKHKNAIEKIKEAAEKWGFFQVI 1452

Query: 1465 NHGISVSVLDEIKDSVRRFHEQDTEVKKQYYTRDLMKPFIYNSNFDLYSAPTTNWRDTFS 1524
            NHG+S+ +L+++KD VR FHEQ  EV+K +Y+RDL + F Y+S             + +S
Sbjct: 1453 NHGVSLELLEKMKDGVRGFHEQSPEVRKDFYSRDLTRKFQYSSMSPFLILDL--MCNLYS 1512

Query: 1525 YVSAPNPPNPQELPEICSRDILVDYSKWVMKIGKLVLELLSEALGLNPNYLNNIGCSDGL 1584
            Y+              C RD+ ++YS+ VM +G+ +  LLSEALGLNPN+LN++ CS GL
Sbjct: 1513 YMVN------------CFRDVTIEYSEQVMNLGEFLFTLLSEALGLNPNHLNDMDCSKGL 1572

Query: 1585 EFVCHYYPACPHPKLTTGISEHTDADFITVLLQDHIGGLQIRHHNNWIDVHPVAGALVVN 1644
              +CHYYP CP P LT G S+H D  F+TVLL D I GLQ+     W +V  V GAL++N
Sbjct: 1573 IMLCHYYPPCPEPDLTLGTSQHADNTFLTVLLPDQIEGLQVLREGYWFNVPHVPGALIIN 1632

Query: 1645 IGDLMQ----------------VTN--------------------------------ADE 1657
            IGDL+Q                +TN                                 D 
Sbjct: 1633 IGDLLQASLHNQYIFLLCFAELITNDKFVSLEHRVLANRATRARVSVAETKTMEMMKIDP 1692

BLAST of CmUC05G085020 vs. NCBI nr
Match: KAF9673550.1 (hypothetical protein SADUNF_Sadunf10G0035800 [Salix dunnii])

HSP 1 Score: 1182.5 bits (3058), Expect = 0.0e+00
Identity = 638/1395 (45.73%), Postives = 856/1395 (61.36%), Query Frame = 0

Query: 1280 VVNIGDLMQVTNADEHFDRAAELKLFDDTKAGVKGLVDSGITQIPRIFYRLPDSGVSPVP 1339
            +V  G   +++  +  +DR +ELK FD+TKAGVKGLVD+G++++P+IF    +S      
Sbjct: 1    MVGSGRTAEISVQETSYDRRSELKAFDETKAGVKGLVDAGVSKVPQIFIHPSESSGHRTL 60

Query: 1340 GDTE--LSIPVIDLEAIDRDSSKRRDVVNKVREASEKWGFFQLVNHGVPVSVLDEMKKGT 1399
              ++  + IPVIDLEAID+D  KR+ +V+KVR+ASE WGFFQ++NHG+PV VL+EM  G 
Sbjct: 61   STSKNPVDIPVIDLEAIDKDPIKRKGIVDKVRDASETWGFFQVINHGIPVGVLEEMVAGV 120

Query: 1400 LRFYEQDTQLKKQFYTRHNTKSIVYNSNFDLFTAPAANWRDTFLCFMAPNLPNPQDLPEI 1459
             RF+EQD ++KK FYTR  TK  VYNSNFDL TAP ANWRDTF  +MAP  P P++LPE 
Sbjct: 121  RRFFEQDIEMKKIFYTRDVTKRFVYNSNFDLHTAPFANWRDTFFSYMAPYPPKPKELPEA 180

Query: 1460 CRDILFDYSKEMKKLGRILFGLLSEALGLNTNYLSDIECDRGLAVLCHYYPACPQPELTL 1519
            CRDI+ ++SK++  LG  LFGLLSEALGL T++L  ++C  GLA++ HYYPACP+PELTL
Sbjct: 181  CRDIMMEFSKQVTSLGVSLFGLLSEALGLKTDHLEKMDCAEGLALISHYYPACPEPELTL 240

Query: 1520 GTTEHADNDFLTVLLQDDQIGGLQVLHQKKWIDIPPIPGALVLS------------HFVI 1579
            GT++H+DNDFLTVLLQ DQIGGLQ+L+Q +WIDIPP+PGALV++               I
Sbjct: 241  GTSKHSDNDFLTVLLQ-DQIGGLQMLYQDQWIDIPPVPGALVINIGDLMQASFSFISICI 300

Query: 1580 LSIELFLQLISNDGFKSVEHRVLANRDGPRVSIASFFGIGVYTTSQVYGPIKELLSEQNP 1639
            LS  L  QLISND FKSVEHRVLANR GPR+S+A FF      +S++YGPIKELLSE NP
Sbjct: 301  LSFLLNFQLISNDKFKSVEHRVLANRIGPRISVACFFSTSFQPSSKLYGPIKELLSEDNP 360

Query: 1640 AKYGETTLKDFYFYHNSRGLNGTSALQHFRLSLDDEGDATPIKDCFIQSSKQNKMANLTP 1699
              Y ETT+ ++  Y    GL+GTS L HF+LS+ +        D FI   +   +A+   
Sbjct: 361  PIYRETTVNEYLSYFYDHGLDGTSPLTHFKLSISN--------DKFI-GVEHGVLASFKE 420

Query: 1700 FSKLDQTFDRASELKAFDQTKAGVKGLVDSGVAEIPGIFYCPPKEHSNSVP--EETHLGI 1759
              +    +DR SELKAFD TKAGVKGLVD+G+  +P IF+    +   ++P   E    I
Sbjct: 421  PRRSKMVYDRISELKAFDDTKAGVKGLVDAGITRVPRIFHDLRDDSDKTLPVAAEGKFRI 480

Query: 1760 PVVDLEDIDKDPFKRREVVGKIREASETWGFFQVLNHGVPASVQEEIINGVHRFFEQDIE 1819
            PV+DLED+ K P +R+E+V ++R A+ETWGFF V+NHG+P  V  E+ +GV RFFEQD+E
Sbjct: 481  PVIDLEDVHKGPPQRKEIVDRVRNAAETWGFFAVVNHGIPVDVLGEMKDGVRRFFEQDVE 540

Query: 1820 VKKQYYTRDNTKPFVHNCNFDLFSAPVANWRDTFFTLMAPISPSPQDLPQVCRDILVEYS 1879
            +KKQ+++RD T+ F +N NFDLFS+  ANWRDTF  +MAP SP P++LP   RDI+++Y+
Sbjct: 541  LKKQHFSRDYTRKFGYNSNFDLFSSASANWRDTFSCVMAPGSPRPEELPAAFRDIIIQYT 600

Query: 1880 KQIMKLGELIFGLLSEALGLKSTHLVDLDCNEGLSILGHYYPPCPQPELSIGTTEHSDNT 1939
            K +M+LG ++  LLSEALGL   +L D+DCN+GL+ILGHYYP CPQPEL++G T+H+DN 
Sbjct: 601  KGVMELGNILLELLSEALGLNPNYLKDIDCNKGLTILGHYYPACPQPELTVGATKHTDND 660

Query: 1940 FITVLLQDGMGGLQVRQHNKWVDVPPVPGAFVINVGSLLQ--------LITNDRFVSSEH 1999
            F+TVLLQD +GGLQV   N+WVDV P PGA +IN+G LLQ        LI+ND+F+S EH
Sbjct: 661  FLTVLLQDHIGGLQVMHQNQWVDVHPTPGALLINIGDLLQASHETLVMLISNDKFISVEH 720

Query: 2000 RVVANRKGPRVSVAGFFSTGSLPTSKLYGPIEELLSEQNPPRYKKISVKEYNLYFAEKGD 2059
            +V+ANR GPRVSVA FFST   P S+LYGPI++LLSE NPPRY++ +V++Y  YF  KG 
Sbjct: 721  KVLANRTGPRVSVASFFSTNLAPNSRLYGPIKDLLSEDNPPRYRETTVRDYVAYFNHKG- 780

Query: 2060 NSKPFSYNCNFDLFSAPSANWRDTIFTQMTPNSPNPQDLPQVCRDILIDYSKQMEKVGEL 2119
                                                           +D +  +     L
Sbjct: 781  -----------------------------------------------LDGTSAL-----L 840

Query: 2120 IFGLLSEALGLKSTHLVDLDCNQGHAILCHYYPPCPQPELTIGTTEHSDDTFITVLLQDH 2179
             F L  +ALGLK  HL ++   +G   +  YYP CPQP LT+  ++ +D  F+TVL + +
Sbjct: 841  HFKLNKKALGLKPNHLKNMSRAEGLYFIGQYYPACPQPNLTLSLSKQTDSAFLTVLSKTN 900

Query: 2180 IGGLQVLHHNKWVDIPPVPGAFVVNQLISNDKLVSSVHRVLANREGPRVSVACFFTTGAI 2239
                +    N  +D   +PGA     LI++ K     H V+A    PR+S   F      
Sbjct: 901  WVAFESTMKN--IDGTHIPGAIT---LITSGKFERVYHGVVAT--SPRISAREFL----- 960

Query: 2240 PTSKLYGPIKQLLSQQNPPKYRQITVREYDHLHAQKGLDGTHKQKMANLTPFSKLDQTFD 2299
                                                      K KMA      + D  +D
Sbjct: 961  ------------------------------------------KIKMAARRIQEENDSDYD 1020

Query: 2300 RASELKAFDQTKAGWSKSLWTPASQRSQEYSTAHSNTSLIPARPPFPTNPIWVFRWWIWK 2359
            R S+LKAFD TKAG    +    ++  + +       + +  RP   +N          +
Sbjct: 1021 RQSKLKAFDDTKAGVKGLVDAGVTKIPRIF--IQEQCTKVDDRP--ASN----------E 1080

Query: 2360 TSIKTPSNEEKWWTKSEK-----LQKRGGSSKCLTMGF----------RVQEEIINGAHR 2419
             +   P  + + W   E      +++ GG+  C   GF           V E++I+G HR
Sbjct: 1081 PNFNIPIIDVEGWDGDENRRHNIIEEIGGA--CKKWGFFQVVNHGIPRNVLEDMIDGIHR 1140

Query: 2420 FFEQDIEVKKQYYSRDYTKPFVYNCNFDLFSAPNANWRDTIFTQMTPNSPNPQDLPQVCS 2479
            F  QD+EVKK +Y+RDYT+  +YN NFDL+ AP A+WRDT+   M PN PNP++LP +C 
Sbjct: 1141 FHHQDVEVKKGFYTRDYTRKVLYNSNFDLYRAPAASWRDTLTIVMAPNPPNPEELPPICR 1200

Query: 2480 DILIDYSKQMEKLGEIIFGLFSEALGLKPTHLIDLDCNEGHAILCHYYPPCPQPELTIGA 2539
            DIL+DY+K++  LG  +F L SEALGLKP HL D+ C EG  +L H YP CP+PELT+G 
Sbjct: 1201 DILVDYTKRIMALGITLFELLSEALGLKPNHLKDIGCAEGLYVLGHCYPACPEPELTLGT 1260

Query: 2540 TEHSDSSFITVLLQDHIGGLQVLHNNEWADIPPVSGALVVN------------------- 2599
             +H+DS F+T+LLQD IGGLQVLH N+W ++ P  GALVVN                   
Sbjct: 1261 RKHADSGFLTLLLQDQIGGLQVLHENQWVNVTPAPGALVVNVGDLFQASSGYNFDPILHY 1261

Query: 2600 ----------LISNDKFVSSVHRVVANREGCPRVSVASFFTTGII-STSKLYGPIKQLLS 2606
                      +ISND F S  HRV+A   G PR+SVA  F   ++  TS++YGPIK+LLS
Sbjct: 1321 GTQAYCEHKTVISNDIFTSVHHRVLAKNIG-PRISVACIFRQPLLPETSRMYGPIKELLS 1261

BLAST of CmUC05G085020 vs. ExPASy Swiss-Prot
Match: Q84MB3 (1-aminocyclopropane-1-carboxylate oxidase homolog 1 OS=Arabidopsis thaliana OX=3702 GN=At1g06620 PE=2 SV=1)

HSP 1 Score: 427.6 bits (1098), Expect = 1.2e-117
Identity = 199/346 (57.51%), Postives = 260/346 (75.14%), Query Frame = 0

Query: 25  RPTEIKAFDDTKAGVKGLVDAGINEIPRIFYQPPEDYYSDNISGETQYQIPVIDLDDVHR 84
           R T +KAFD+TK GVKGL+DAGI EIP IF  PP    S      + + IP IDL     
Sbjct: 13  RSTLLKAFDETKTGVKGLIDAGITEIPSIFRAPPATLTSPKPPSSSDFSIPTIDLKGGGT 72

Query: 85  NSLKRKDTINRVREASEKLGFFQLINHGIPVGVLEELKGAVKRFNEQDTEVKKQYYTRDN 144
           +S+ R+  + ++ +A+EK GFFQ+INHGIP+ VLE++   ++ F+EQDTEVKK +Y+RD 
Sbjct: 73  DSITRRSLVEKIGDAAEKWGFFQVINHGIPMDVLEKMIDGIREFHEQDTEVKKGFYSRDP 132

Query: 145 TKPLIYNSNFDLYSASTTNWRDTLGYISAPNPPNPQDLPEIIRDNLVDYSKRVMEIGKLL 204
              ++Y+SNFDL+S+   NWRDTLG  +AP+PP P+DLP    + +++YSK VM++GKLL
Sbjct: 133 ASKMVYSSNFDLFSSPAANWRDTLGCYTAPDPPRPEDLPATCGEMMIEYSKEVMKLGKLL 192

Query: 205 FELLSEALGLNPNYLNEIGCSEGLAIGCHYYPPCPQPNLTLGTSEHSDNVFITVLFQDNI 264
           FELLSEALGLN N+L ++ C+  L +  HYYPPCPQP+LTLG ++HSDN F+T+L QD+I
Sbjct: 193 FELLSEALGLNTNHLKDMDCTNSLLLLGHYYPPCPQPDLTLGLTKHSDNSFLTILLQDHI 252

Query: 265 GGLQIRHQKKWVDVPPVAGALVVNIGELMQLITNDRFISVAHRVLAKKEGPRISVASFFS 324
           GGLQ+ H + WVDVPPV GALVVN+G+L+QLITND+FISV HRVLA   GPRISVA FFS
Sbjct: 253 GGLQVLHDQYWVDVPPVPGALVVNVGDLLQLITNDKFISVEHRVLANVAGPRISVACFFS 312

Query: 325 TLAYRSSKVYGPIKELLSEENPPKYRETTIRDFHMLYRADGL-GTS 370
           +    + +VYGPIKE+LSEENPP YR+TTI ++   YR+ G  GTS
Sbjct: 313 SYLMANPRVYGPIKEILSEENPPNYRDTTITEYAKFYRSKGFDGTS 358

BLAST of CmUC05G085020 vs. ExPASy Swiss-Prot
Match: Q8H1S4 (1-aminocyclopropane-1-carboxylate oxidase homolog 3 OS=Arabidopsis thaliana OX=3702 GN=At1g06650 PE=2 SV=1)

HSP 1 Score: 420.2 bits (1079), Expect = 1.9e-115
Identity = 199/353 (56.37%), Postives = 265/353 (75.07%), Query Frame = 0

Query: 1688 KLDQTFDRASELKAFDQTKAGVKGLVDSGVAEIPGIFYCPPKEHSNSVPEETHL----GI 1747
            K+D  FDRASELKAFD+TK GVKGLVDSGV+++P IF+ P  + S   P  + L     I
Sbjct: 5    KIDPLFDRASELKAFDETKTGVKGLVDSGVSQVPRIFHHPTVKLSTPKPLPSDLLHLKTI 64

Query: 1748 PVVDLEDID-KDPFKRREVVGKIREASETWGFFQVLNHGVPASVQEEIINGVHRFFEQDI 1807
            P +DL   D +D  KR   + +I+EA+  WGFFQV+NHGV   + E++  GV  F EQ  
Sbjct: 65   PTIDLGGRDFQDAIKRNNAIEEIKEAAAKWGFFQVINHGVSLELLEKMKKGVRDFHEQSQ 124

Query: 1808 EVKKQYYTRDNTKPFVHNCNFDLFSAPVANWRDTFFTLMAPISPSPQDLPQVCRDILVEY 1867
            EV+K++Y+RD ++ F++  NFDLFS+P ANWRDTF   MAP +P PQDLP++CRDI++EY
Sbjct: 125  EVRKEFYSRDFSRRFLYLSNFDLFSSPAANWRDTFSCTMAPDTPKPQDLPEICRDIMMEY 184

Query: 1868 SKQIMKLGELIFGLLSEALGLKSTHLVDLDCNEGLSILGHYYPPCPQPELSIGTTEHSDN 1927
            SKQ+M LG+ +F LLSEALGL+  HL D+DC++GL +L HYYPPCP+P+L++GT++HSDN
Sbjct: 185  SKQVMNLGKFLFELLSEALGLEPNHLNDMDCSKGLLMLSHYYPPCPEPDLTLGTSQHSDN 244

Query: 1928 TFITVLLQDGMGGLQVRQHNKWVDVPPVPGAFVINVGSLLQLITNDRFVSSEHRVVANR- 1987
            +F+TVLL D + GLQVR+   W DVP V GA +IN+G LLQLITND+F+S EHRV+ANR 
Sbjct: 245  SFLTVLLPDQIEGLQVRREGHWFDVPHVSGALIINIGDLLQLITNDKFISLEHRVLANRA 304

Query: 1988 KGPRVSVAGFFSTGSLPTSKLYGPIEELLSEQNPPRYKKISVKEYNLYFAEKG 2035
               RVSVA FF+TG  P  ++YGPI EL+SE+NPP+Y++ ++K+Y  YF  KG
Sbjct: 305  TRARVSVACFFTTGVRPNPRMYGPIRELVSEENPPKYRETTIKDYATYFNAKG 357

BLAST of CmUC05G085020 vs. ExPASy Swiss-Prot
Match: P10967 (1-aminocyclopropane-1-carboxylate oxidase homolog OS=Solanum lycopersicum OX=4081 GN=ACO3 PE=2 SV=1)

HSP 1 Score: 413.7 bits (1062), Expect = 1.8e-113
Identity = 201/372 (54.03%), Postives = 263/372 (70.70%), Query Frame = 0

Query: 1287 MQVTNADEHFDRAAELKLFDDTKAGVKGLVDSGITQIPRIFYRLPDSGVSPVPGDTELSI 1346
            M+    +E +D+ +ELK FDDTKAGVKGLVDSGIT++P+IF   P         +T    
Sbjct: 1    MESPRVEESYDKMSELKAFDDTKAGVKGLVDSGITKVPQIFVLPPKDRAKKC--ETHFVF 60

Query: 1347 PVIDLEAIDRDSSKRRDVVNKVREASEKWGFFQLVNHGVPVSVLDEMKKGTLRFYEQDTQ 1406
            PVIDL+ ID D  K +++V+KVR+ASEKWGFFQ+VNHG+P SVLD   +GT +F+EQD +
Sbjct: 61   PVIDLQGIDEDPIKHKEIVDKVRDASEKWGFFQVVNHGIPTSVLDRTLQGTRQFFEQDNE 120

Query: 1407 LKKQFYTRHNTKSIVYNSNFDLF--TAPAANWRDTFLCFMAPNLPNPQDLPEICRDILFD 1466
            +KKQ+YTR   K +VY SN DL+  + PAA+WRDT  C+MAPN P+ Q+ P  C + L D
Sbjct: 121  VKKQYYTRDTAKKVVYTSNLDLYKSSVPAASWRDTIFCYMAPNPPSLQEFPTPCGESLID 180

Query: 1467 YSKEMKKLGRILFGLLSEALGLNTNYLSDIECDRGLAVLCHYYPACPQPELTLGTTEHAD 1526
            +SK++KKLG  L  LLSE LGL+ +YL D      L   C+YYP CPQPELT+GT +H D
Sbjct: 181  FSKDVKKLGFTLLELLSEGLGLDRSYLKDYMDCFHLFCSCNYYPPCPQPELTMGTIQHTD 240

Query: 1527 NDFLTVLLQDDQIGGLQVLHQKKWIDIPPIPGALVLSHFVILSIELFLQLISNDGFKSVE 1586
              F+T+LLQDD +GGLQVLHQ  W+D+PP PG+LV      ++I  FLQL+SND + SVE
Sbjct: 241  IGFVTILLQDD-MGGLQVLHQNHWVDVPPTPGSLV------VNIGDFLQLLSNDKYLSVE 300

Query: 1587 HRVLANRDGPRVSIASFFGIGVYTTSQVYGPIKELLSEQNPAKYGETTLKDFYFYHNSRG 1646
            HR ++N  G R+SI  FFG   Y +S++YGPI ELLSE NP KY  TT+KD   Y ++RG
Sbjct: 301  HRAISNNVGSRMSITCFFGESPYQSSKLYGPITELLSEDNPPKYRATTVKDHTSYLHNRG 360

Query: 1647 LNGTSALQHFRL 1657
            L+GTSAL  +++
Sbjct: 361  LDGTSALSRYKI 363

BLAST of CmUC05G085020 vs. ExPASy Swiss-Prot
Match: Q9C5K7 (1-aminocyclopropane-1-carboxylate oxidase homolog 2 OS=Arabidopsis thaliana OX=3702 GN=At1g06640 PE=2 SV=1)

HSP 1 Score: 405.6 bits (1041), Expect = 4.8e-111
Identity = 200/376 (53.19%), Postives = 268/376 (71.28%), Query Frame = 0

Query: 1287 MQVTNADEHFDRAAELKLFDDTKAGVKGLVDSGITQIPRIFYRLPDSGVSPVPGDTEL-- 1346
            M+ T     FDRA+ELK FD+TK GVKGLVDSGI++IPRIF+       +P P  ++L  
Sbjct: 1    MESTKIAPSFDRASELKAFDETKTGVKGLVDSGISKIPRIFHHSSVELANPKPLPSDLLH 60

Query: 1347 --SIPVIDLEAID-RDSSKRRDVVNKVREASEKWGFFQLVNHGVPVSVLDEMKKGTLRFY 1406
              +IP IDL   D +D+ K ++ +  ++EA+ KWGFFQ++NHGV + +L++MK G   F+
Sbjct: 61   LKTIPTIDLGGRDFQDAIKHKNAIEGIKEAAAKWGFFQVINHGVSLELLEKMKDGVRDFH 120

Query: 1407 EQDTQLKKQFYTRHNTKSIVYNSNFDLFTAPAANWRDTFLCFMAPNLPNPQDLPEICRDI 1466
            EQ  +++K  Y+R   +  +Y SNFDL+TA AANWRDTF C+MAP+ P PQDLPEICRD+
Sbjct: 121  EQPPEVRKDLYSRDFGRKFIYLSNFDLYTAAAANWRDTFYCYMAPDPPEPQDLPEICRDV 180

Query: 1467 LFDYSKEMKKLGRILFGLLSEALGLNTNYLSDIECDRGLAVLCHYYPACPQPELTLGTTE 1526
            + +YSK++  LG  LF LLSEALGLN N+L D+EC +GL +LCHY+P CP+P+LT GT++
Sbjct: 181  MMEYSKQVMILGEFLFELLSEALGLNPNHLKDMECLKGLRMLCHYFPPCPEPDLTFGTSK 240

Query: 1527 HADNDFLTVLLQDDQIGGLQVLHQKKWIDIPPIPGALVLSHFVILSIELFLQLISNDGFK 1586
            H+D  FLTVLL D+ I GLQV  +  W D+P +PGAL      I++I   LQLI+ND F 
Sbjct: 241  HSDGSFLTVLLPDN-IEGLQVCREGYWFDVPHVPGAL------IINIGDLLQLITNDKFI 300

Query: 1587 SVEHRVLANR-DGPRVSIASFFGIGVYTTSQVYGPIKELLSEQNPAKYGETTLKDFYFYH 1646
            S++HRVLANR    RVS+A FF   V    +VYGPIKEL+SE+NP KY ETT++D+  Y 
Sbjct: 301  SLKHRVLANRATRARVSVACFFHTHVKPNPRVYGPIKELVSEENPPKYRETTIRDYATYF 360

Query: 1647 NSRGLNGTSALQHFRL 1657
            N +GL GTSAL  F++
Sbjct: 361  NGKGLGGTSALLDFKV 369

BLAST of CmUC05G085020 vs. ExPASy Swiss-Prot
Match: Q9LTH8 (1-aminocyclopropane-1-carboxylate oxidase homolog 11 OS=Arabidopsis thaliana OX=3702 GN=At5g59530 PE=2 SV=1)

HSP 1 Score: 404.1 bits (1037), Expect = 1.4e-110
Identity = 203/358 (56.70%), Postives = 259/358 (72.35%), Query Frame = 0

Query: 16  LSKADENYHRPTEIKAFDDTKAGVKGLVDAGINEIPRIFYQPPEDYYSDNISGETQYQIP 75
           ++K    + R  E KAFD+TK GVKGL+DA I EIPRIF+  P+D   D     +  +IP
Sbjct: 1   MAKNSVEFDRYIERKAFDNTKEGVKGLIDAKITEIPRIFH-VPQDTLPDKKRSVSDLEIP 60

Query: 76  VIDLDDVHRNSLKRKDTINRVREASEKLGFFQLINHGIPVGVLEELKGAVKRFN-EQDTE 135
            ID   V+ ++  R+  + +V+ A E  GFFQ+INHG+P+ VLEE+K  V+RF+ E+D E
Sbjct: 61  TIDFASVNVDTPSREAIVEKVKYAVENWGFFQVINHGVPLNVLEEIKDGVRRFHEEEDPE 120

Query: 136 VKKQYYTRDNTK-PLIYNSNFDLYSAS-TTNWRDTLGYISAPNPPNPQDLPEIIRDNLVD 195
           VKK YY+ D TK    Y+SNFDLYS+S +  WRD++    AP+PP P++LPE  RD +++
Sbjct: 121 VKKSYYSLDFTKNKFAYSSNFDLYSSSPSLTWRDSISCYMAPDPPTPEELPETCRDAMIE 180

Query: 196 YSKRVMEIGKLLFELLSEALGLNPNYLNEIGCSEGLAIGCHYYPPCPQPNLTLGTSEHSD 255
           YSK V+ +G LLFELLSEALGL    L  + C + L + CHYYPPCPQP+LTLG S+HSD
Sbjct: 181 YSKHVLSLGDLLFELLSEALGLKSEILKSMDCLKSLLMICHYYPPCPQPDLTLGISKHSD 240

Query: 256 NVFITVLFQDNIGGLQIRHQKKWVDVPPVAGALVVNIGELMQLITNDRFISVAHRVLAKK 315
           N F+TVL QDNIGGLQI HQ  WVDV P+ GALVVN+G+ +QLITND+FISV HRVLA  
Sbjct: 241 NSFLTVLLQDNIGGLQILHQDSWVDVSPLPGALVVNVGDFLQLITNDKFISVEHRVLANT 300

Query: 316 EGPRISVASFFSTLAYRSSKVYGPIKELLSEENPPKYRETTIRDFHMLYRADGL-GTS 370
            GPRISVASFFS+    +S VYGP+KEL+SEENPPKYR+TT+R++   Y   GL GTS
Sbjct: 301 RGPRISVASFFSSSIRENSTVYGPMKELVSEENPPKYRDTTLREYSEGYFKKGLDGTS 357

BLAST of CmUC05G085020 vs. ExPASy TrEMBL
Match: A0A3Q7I749 (Uncharacterized protein OS=Solanum lycopersicum OX=4081 PE=3 SV=1)

HSP 1 Score: 2556.6 bits (6625), Expect = 0.0e+00
Identity = 1382/3275 (42.20%), Postives = 1937/3275 (59.15%), Query Frame = 0

Query: 21   ENYHRPTEIKAFDDTKAGVKGLVDAGINEIPRIFYQPPEDYYSDNISGETQYQIPVIDLD 80
            ++Y + +E+KAFDDTKAGVKGLVDAGI ++P+IF  PP +      + E Q+  PVID +
Sbjct: 230  KSYDKMSELKAFDDTKAGVKGLVDAGITKVPQIFILPPNNRTESLDTSEKQFIFPVIDFE 289

Query: 81   DVHRNSLKRKDTINRVREASEKLGFFQLINHGIPVGVLEELKGAVKRFNEQDTEVKKQYY 140
             +  + +KRK+ + +VR+ASE  GFFQ++NHGIP  VLE +    + F EQD EVKKQYY
Sbjct: 290  GIDEDPIKRKEIVGKVRDASETWGFFQVVNHGIPTSVLEGMLQGTREFFEQDIEVKKQYY 349

Query: 141  TRDNTKPLIYNSNFDLYSAS--TTNWRDTLGYISAPNPPNPQDLPEIIRDNLVDYSKRVM 200
            TRD  K +++NSNFDLYS S    NWRD+  +  APNPP+P++ P   R+ L+DYSK VM
Sbjct: 350  TRDIMKKVVHNSNFDLYSPSVPAANWRDSFCFSMAPNPPSPEEFPRPCREILMDYSKNVM 409

Query: 201  EIGKLLFELLSEALGLNPNYLNEIGCSEGLAIGCHYYPPCPQPNLTLGTSEHSDNVFITV 260
            E+G  L  LLSE LGL+P +L ++ C +GL +  HYYPPCPQP LT+GT+ HSDN FITV
Sbjct: 410  ELGCSLLGLLSEGLGLDPCHLEDMDCVKGLGVVGHYYPPCPQPELTIGTNTHSDNDFITV 469

Query: 261  LFQDNIGGLQIRHQKKWVDVPPVAGALVVNIGELMQLITNDRFISVAHRVLAKKEGPRIS 320
            L QD+IGGLQ+ HQ +WVD+PP + ALV        LI+ND++ SV HRVL+ K GPRIS
Sbjct: 470  LLQDHIGGLQVLHQNQWVDIPPTSAALV--------LISNDKYTSVEHRVLSNKVGPRIS 529

Query: 321  VASFFSTLAYRSSKVYGPIKELLSEENPPKYRETTIRDFHMLYRADGLGTSKAKRSTMAV 380
            VASFF+T  + S K+YGPI ELLSE+NPPKYR TT+ D+   +R                
Sbjct: 530  VASFFTTGPFPSPKLYGPIAELLSEDNPPKYRATTVTDYSDYFR---------------- 589

Query: 381  AASRVNGANLTPLSEADENYHRPTELRAFDDTKAGVKGLVDAGITEIPRIFYFPPEDYNS 440
                          + ++NY + +EL+AFDDTKAGVKG+VD+GIT++P+IF  PP+    
Sbjct: 590  -------------KKVEKNYDKMSELKAFDDTKAGVKGIVDSGITKVPQIFILPPKKRPE 649

Query: 441  DNATVEIQIQIPVIDFDHVGRNSLKRKYTIDRIREASEKLGFFQLINHGIPVSVLEEMKD 500
             + T E Q   PVID + +  + +K K  +D +R+ASE  GFFQ++NHGIP SVLEEM  
Sbjct: 650  LSDTNETQFIFPVIDIEGIDEDPIKHKEIVDNVRDASETWGFFQVVNHGIPTSVLEEMMQ 709

Query: 501  AVRRFHEQETELKKQYYTRDLTKPLIYTSNFDLYSAA--TTNWRDAFRYVSSPNAHDPQV 560
              R+F EQ+ E+KKQYY+RD TK +I+TSNFDLYS++    NWRD    + +P+   P+ 
Sbjct: 710  GTRQFFEQDVEVKKQYYSRDTTKRVIHTSNFDLYSSSVPAANWRDTLFCLMAPDPPSPEE 769

Query: 561  LPEICRDILVEYSKQVMEIGKLVFELLSEALGLNPNYLNDIDCSEGLAFVCHYYPPCPQP 620
            LP  C +IL++YSK VM++G  + ELLSE LGL+  +L D+DC+EGL  + HYYP CPQP
Sbjct: 770  LPTACGEILMQYSKDVMKLGFSLLELLSEGLGLDRCHLKDMDCAEGLGILGHYYPACPQP 829

Query: 621  NLAIGTSEHTDNGFITVLLQDHIGGLQIRHGNNWVDIPPVARALV------RSTMAVVAS 680
             LAIGT++H+DN FITVLLQDHIGGLQ+ H N WV++PP   ALV          ++ +S
Sbjct: 830  ELAIGTNKHSDNDFITVLLQDHIGGLQVLHQNQWVNVPPTPGALVVNIGDLLQASSMPSS 889

Query: 681  RVNAPNLTPLSKADE--------------------------------NYHRPTDLKAFDD 740
            ++  P +T L   D                                 +Y R ++LKAFDD
Sbjct: 890  KLYGP-ITELLSEDNPPKYRATTVKDYRDYFRKKVSSTDDFEARVPGSYDRMSELKAFDD 949

Query: 741  TKAGVKGLVDAGITEIPRIFYRPPETFDSDNISGETQIHIPVVDLDHINKNSLKRKYTID 800
            TKAGVKG+VDAGITE+PRIF +P +  +    + ET+   PV+DL+ I+K+ +K K  +D
Sbjct: 950  TKAGVKGIVDAGITEVPRIFVQPTKIEECVR-NCETKFIFPVIDLEGIDKDPIKHKEIVD 1009

Query: 801  IVREASEKLGFFQLVNHGIPVDVLEEMKDAVRRFNEQETESKKQYYTRDLTKPLIYNSNF 860
             VR+ASE  GFFQ+VNHGIP+ V+EEM    RRF EQ  + KKQYYTRD TK +++ SNF
Sbjct: 1010 RVRDASETWGFFQVVNHGIPLSVMEEMLQGTRRFFEQHVDIKKQYYTRDNTKKIVHVSNF 1069

Query: 861  DLYTAA--TTNWRDTFGYISAPNSHNPQDLPEICRDILVDYSKRVMEIGNLLFELLSEAL 920
            DLY+ +   TNWRD+   + APN  +P++LP  CR+IL+++S  VM +G  LFELLSE L
Sbjct: 1070 DLYSPSVPATNWRDSIFCLMAPNHPSPEELPTACREILMEFSNHVMTLGKSLFELLSEGL 1129

Query: 921  GLNPNYLKNIDCNEGLALVCHYYPPCPQPNLAIGTSEHTDNDFITVLLQDQIGGLQIRYE 980
            GLNP++L +IDC EGL ++ HYYP CPQP L IGT++H+DNDFITVLLQDQIGGLQ+ +E
Sbjct: 1130 GLNPSHLNDIDCAEGLRVLGHYYPACPQPELTIGTNKHSDNDFITVLLQDQIGGLQVLHE 1189

Query: 981  NKWVDVPPVAGALVVNIGDLMQLITNDKFKSVKHRVLANKKGPRRNHNQRLHYSIPFKRS 1040
             +W+DVPP  GALVVNIGDL+QLI+NDK+ SV+HRVL+NK GPR +      Y+     +
Sbjct: 1190 TQWIDVPPTPGALVVNIGDLLQLISNDKYLSVEHRVLSNKVGPRIS-VACFFYTGSLPTT 1249

Query: 1041 IMAVAASEINGLNLTPLSKA-----------DDNFHRLTELKAFDDTKAGVKGLVDAKVT 1100
             +     E+   +  P  +A           +  + R +E+  FDD+K GVKGL+DA VT
Sbjct: 1250 KLYGPIKELLSDDSPPKYRATTVKDYADYFREKEYDRKSEVHLFDDSKMGVKGLLDAGVT 1309

Query: 1101 EIPRIFYHPPEEYDS-------VETQIRIPLIDLDGVGKDSLKRKHIVDQIRDASEELGF 1160
            ++PRIF H   EY S       V ++  IP++D  G+   + +R  IV +I++A E  GF
Sbjct: 1310 KLPRIFLH--NEYVSEKKSDPDVTSKFSIPVVDFQGLENSAAERADIVREIKNACENWGF 1369

Query: 1161 FQVINHGISVSVLDEIKDSVRRFHEQDTEVKKQYYTRDLMKPFIYNSNFDLYSAPTTNWR 1220
            FQ+++H I  S+ +++ + VR FHEQD+EVKK++Y+RD+ +   YNSNFDL  +PT NWR
Sbjct: 1370 FQIVHHEIPSSIKEKVLEGVRHFHEQDSEVKKEFYSRDVTRKVTYNSNFDLLKSPTANWR 1429

Query: 1221 DTFSYVSAPNPPNPQELPEICSRDILVDYSKWVMKIGKLVLELLSEALGLNPNYLNNIGC 1280
            DT   V  PNPP+P+E+P +C R++L++Y+K++MK+G  + ELLSEAL L  ++L ++ C
Sbjct: 1430 DTLYCVMDPNPPDPEEIPNVC-REVLIEYTKYIMKLGLTLFELLSEALQLKSDHLKDMEC 1489

Query: 1281 SDGLEFVCHYYPACPHPKLTTGISEHTDADFITVLLQDHIGGLQIRHHNNWIDVHPVAGA 1340
            ++GL    HYYPACP P+LT G+S HTD+ F+T++LQD IGGLQ+ H + W+DV  + GA
Sbjct: 1490 AEGLFITGHYYPACPEPELTLGLSGHTDSGFLTIVLQDQIGGLQVFHKDQWVDVPFLPGA 1549

Query: 1341 LVV----------------NIG-------------------------------------- 1400
            L++                N+G                                      
Sbjct: 1550 LILITNDKFKSVLHRVLAKNVGPRISVAKCIFKKEVNRDYMVRSRSCYPKKTLQSTGKQA 1609

Query: 1401 -------DLMQVTNADEHFDRAAELKLFDDTKAGVKGLVDSGITQIPRIFYRLPDSGVSP 1460
                   +L         +DR  ELK FDDTK GVKGLVDSGI +IPRIF R  D  V  
Sbjct: 1610 EKKHICPELQSQAELTMEYDRLMELKAFDDTKTGVKGLVDSGIVEIPRIFIRPSDELVQE 1669

Query: 1461 VP-GDTELSIPVIDLEAIDRDSSKRRDVVNKVREASEKWGFFQLVNHGVPVSVLDEMKKG 1520
            +  G   L  PVID   I+     R  VV+++REASEKWGFFQL+NHG+P SVL+ M  G
Sbjct: 1670 LTHGKPTLHCPVIDFSGIE-VQDHRNKVVDEIREASEKWGFFQLINHGIPSSVLERMIDG 1729

Query: 1521 TLRFYEQDTQLKKQFYTRHNT-KSIVYNSNFDLFTAPAANWRDTF-LCFMAPNLPNPQDL 1580
              +F+EQD ++KK++Y+R  T + + Y SNFDL+ + +ANWRDT  +  +  +   P++L
Sbjct: 1730 IRKFHEQDAEVKKEYYSRDFTSRRVRYESNFDLYQSKSANWRDTLNISLLHSSHIEPEEL 1789

Query: 1581 PEICRDILFDYSKEMKKLGRILFGLLSEALGLNTNYLSDIECDRGLAVLCHYYPACPQPE 1640
            P +CR+++ +Y   + KLG  +F +LSEALGL  ++L ++EC++G +V+CHYYPACPQPE
Sbjct: 1790 PAVCRNVIVEYINHVTKLGETIFCILSEALGLKPDHLKEMECNKGKSVVCHYYPACPQPE 1849

Query: 1641 LTLGTTEHADNDFLTVLLQDDQIGGLQVLHQKKWIDIPPIPGALVLSHFVILSIELFLQL 1700
            LTLG   H D  FLTVLLQ DQIGGLQVLH  +WID+ P+      S  ++++I   LQ+
Sbjct: 1850 LTLGAANHTDPSFLTVLLQ-DQIGGLQVLHNNQWIDVKPV------SQGLVVNIGEALQI 1909

Query: 1701 ISNDGFKSVEHRVLANRDGPRVSIASFFGIGVYTTSQVYGPIKELLSEQNPAKYGETTLK 1760
            +SND F S  HRVLAN  GPR+S+A FF  G +   ++YGPIK+L+S++NP  Y E T+ 
Sbjct: 1910 LSNDKFVSANHRVLANGVGPRMSVACFFN-GSFAQPKIYGPIKDLISDENPLLYKEFTVT 1969

Query: 1761 DFYFYHNSRGLNGTSALQHFRLSLDDEGDATPIKDCFIQSSKQNKMANLTPFSKLDQTFD 1820
            D+     SR                                         P  +L+  +D
Sbjct: 1970 DYIAKFMSR-----------------------------------------PLGELEMDYD 2029

Query: 1821 RASELKAFDQTKAGVKGLVDSGVAEIPGIFYCPPKEHSNSVPE-ETHLGIPVVDLEDIDK 1880
             + E+KA D TKAG+KGLVDSG+ EIP IF  PP E +  +   ++ L +PVVDL  I+ 
Sbjct: 2030 PSDEVKAIDGTKAGIKGLVDSGIVEIPRIFIRPPHELAEELNMCKSTLQVPVVDLSGIEV 2089

Query: 1881 DPFKRREVVGKIREASETWGFFQVLNHGVPASVQEEIINGVHRFFEQDIEVKKQYYTRDN 1940
            +  +R+++V +IR+ SE WGFFQV+NHGVP+SV E +I+G  +F EQD+EVKK+YY+ D 
Sbjct: 2090 ED-RRKKIVDEIRDVSEKWGFFQVINHGVPSSVLEGMIDGTRKFHEQDVEVKKEYYSSDP 2149

Query: 1941 TKPFVHNCNFDLFS--APVANWRDTF-FTLMAPISPSPQDLPQVCRDILVEYSKQIMKLG 2000
            T+   +  N  + +     A W+D+   + +      P+++P VCR   +EY   + KLG
Sbjct: 2150 TRGVRYESNLQVLTNKGRTATWKDSLHISALVSGYVEPEEIPPVCRRTFLEYKNHVTKLG 2209

Query: 2001 ELIFGLLSEALGLKSTHLVDLDCNEGLSILGHYYPPCPQPELSIGTTEHSDNTFITVLLQ 2060
            +++ GLLSEALGLKS HL   +C++GL++  HYYP CPQPEL++G+ +H+D  F T+LLQ
Sbjct: 2210 DVLLGLLSEALGLKSDHLKAAECDKGLALACHYYPACPQPELTLGSGKHTDPVFFTILLQ 2269

Query: 2061 DGMGGLQVRQHNKWVDVPPVPGAFVINVGSLLQLITNDRFVSSEHRVVANRKGPRVSVAG 2120
            D +GGLQV   N+W DV P+    V+N+G LLQ+++ND+FVS  HRVVA  +GPR+SVA 
Sbjct: 2270 DQIGGLQVMNDNQWADVEPIEHGLVVNIGDLLQILSNDKFVSVIHRVVAKNRGPRISVAC 2329

Query: 2121 FFSTGSLPTSKLYGPIEELLSEQNPPRYK--------------------KISVK------ 2180
            FF TG     K+YGPI+EL+SE+NPP YK                    K  VK      
Sbjct: 2330 FF-TGVSSPPKMYGPIKELISEENPPLYKDFLSLMEYEPSDEVKAIDDTKAGVKGLVDSG 2389

Query: 2181 ------------------------------------------------------------ 2240
                                                                        
Sbjct: 2390 IVEIPRIFIKPSHELAEELNMCKSTLQVPVVDLSGLEVEDGRKKIVDEIREASEKWGFFQ 2449

Query: 2241 ----------------------EYNLYFAEKGDNSKPFSYNCNFD------LFSAPSANW 2300
                                  E ++   +K  +S P +    +D           +ANW
Sbjct: 2450 LINHGVPSSVLEGMIDGTRKFHEQDVEVKKKYYSSDPTARRVRYDSNLLQYKTKGKTANW 2509

Query: 2301 RDTIF-TQMTPNSPNPQDLPQVCRDILIDYSKQMEKVGELIFGLLSEALGLKSTHLVDLD 2360
            +D+++ + +      P+++P+VCR   ++Y   + K+ +++ GLLSEALGL+S HL   +
Sbjct: 2510 KDSLYISGLISGHIEPEEIPEVCRKTSLEYINHVIKLEDILLGLLSEALGLESNHLKATE 2569

Query: 2361 CNQGHAILCHYYPPCPQPELTIGTTEHSDDTFITVLLQDHIGGLQVLHHNKWVDIPPVPG 2420
            C++G  + CHYYP CPQPELT+GT +H+D  F+T+LLQD  GGLQV+  N+W D+ P+  
Sbjct: 2570 CDKGQMLACHYYPACPQPELTLGTGKHTDPVFLTILLQDQSGGLQVMCDNQWADVTPIKH 2629

Query: 2421 AFVV----------NQLISNDKLVSSVHRVLANREGPRVSVACFFTTGAIPTSKLYGPIK 2480
              VV            ++SNDK VS+ HRV+AN+ GPR+SVACFF++    + K++ PIK
Sbjct: 2630 GLVVCNKTQLLLFSQIIVSNDKFVSATHRVVANKVGPRISVACFFSS---ESPKMFSPIK 2689

Query: 2481 QLLSQQNPPKYRQITVREYDHLHAQKGLDGT---HKQKMANLTPFSK----LDQTFDRA- 2540
            +L+S++NPP Y+   V +Y      K LD T   H        P  K    +DQ F+ A 
Sbjct: 2690 ELISEENPPLYKDFIVADYLAKFFSKPLDKTETEHNVTKQQQNPSEKTIIHIDQIFELAM 2749

Query: 2541 ------SELKAFDQTKAGWSKSLWT-----------PASQRSQEYSTAHSNTSLIPARPP 2600
                  +E+KA D TKAG    + T           P  + ++E +   S+T  +P    
Sbjct: 2750 DYEEGWAEIKAIDDTKAGVKGLVDTGVVEIPRIFVRPPHELAEELNMCKSSTLQVPV--- 2809

Query: 2601 FPTNPIWVFRWWIWKTSIKTPSNEEKWWTKSEKLQKRGGSSKCLTMGF--RVQEEIINGA 2660
                        +  + ++     +K   +  +  ++ G  + +  G    V E +I+G 
Sbjct: 2810 ------------VDLSGVEFEDRRKKIVDEIREACEKWGFLQVINHGIPSSVLEGMIDGI 2869

Query: 2661 HRFFEQDIEVKKQYYSRDYTKPFVYNCNFDLF--SAPNANWRDTIFTQMTPNSPNPQDLP 2720
             +F EQD+EVKK+YYS D T+   Y+ N  ++     + +W+DT+F     +   P+ +P
Sbjct: 2870 RKFHEQDVEVKKEYYSSDLTREVRYDSNLHVYKTKGTSVSWKDTLFISAVVS--KPEQIP 2929

Query: 2721 QVCSDILIDYSKQMEKLGEIIFGLFSEALGLKPTHLIDLDCNEGHAILCHYYPPCPQPEL 2780
            +VC    ++Y   ++KL +I+ GL SEALGLKP HL   +C++G  + CHYYP CPQPEL
Sbjct: 2930 RVCRKTSLEYINHVKKLADILLGLLSEALGLKPDHLKTAECDKGQVLACHYYPACPQPEL 2989

Query: 2781 TIGATEHSDSSFITVLLQDHIGGLQVLHNNEWADIPPVSGALVVN------LISNDKFVS 2840
            T+G  +HSD SFIT++LQD  GGLQV+H+N+ AD+ P+   LVVN      ++SNDKFVS
Sbjct: 2990 TLGTAKHSDPSFITIVLQDQSGGLQVMHDNQLADVTPIKHGLVVNIGDLLQILSNDKFVS 3049

Query: 2841 SVHRVVANREGCPRVSVASFFTTGIISTSKLYGPIKQLLSEQNPPKYTQITVKEYHWAFK 2900
            + HRVVAN+   PR+SVASFF  G+++ SK+YGPI++L+SE+NPP Y    V +Y    K
Sbjct: 3050 ANHRVVANKIR-PRISVASFF-NGLLAPSKMYGPIEELISEENPPLYKNFQVVDY--VTK 3109

Query: 2901 FEVETN----------------IMAN---------------------------------- 2923
            F  + N                I++N                                  
Sbjct: 3110 FSTKYNFFVLIFSQTITRLNLIILSNDKFVSAIHRVVAKKVGPRISVACFFNGLLAPSKM 3169

BLAST of CmUC05G085020 vs. ExPASy TrEMBL
Match: A0A4D6M4H9 (2-oxoglutarate-dependent dioxygenase OS=Vigna unguiculata OX=3917 GN=DEO72_LG5g3044 PE=4 SV=1)

HSP 1 Score: 1151.0 bits (2976), Expect = 0.0e+00
Identity = 584/1412 (41.36%), Postives = 853/1412 (60.41%), Query Frame = 0

Query: 20   DENYHRPTEIKAFDDTKAGVKGLVDAGINEIPRIFYQPPEDYYSDNISGETQYQIPVIDL 79
            D +Y R  E+KAFD+TK GVKGL+D+GI +IPR+FY    +  ++    + ++ +P+IDL
Sbjct: 11   DSSYDRIAEVKAFDETKLGVKGLLDSGITKIPRMFYHAKVNDNTETTPNDLKFNVPIIDL 70

Query: 80   DDVHRNSLKRKDTINRVREASEKLGFFQLINHGIPVGVLEELKGAVKRFNEQDTEVKKQY 139
             D+  NS  R   +++VR A ++ GFFQ++NHGI V VL+E+   ++RF+EQD EV+K +
Sbjct: 71   KDIDTNSSLRVKALDKVRRACKEWGFFQVVNHGIGVEVLDEMLCGIQRFHEQDAEVRKTF 130

Query: 140  YTRDNTKPLIYNSNFDLYSASTTNWRDTLGYISAPNPPNPQDLPEIIRDNLVDYSKRVME 199
            Y+RD +K + Y SN  L+     NWRD++ + S+P+PPNP+++P + RD +V+Y+ ++  
Sbjct: 131  YSRDRSKKVRYFSNGSLFRDPAANWRDSIAFFSSPDPPNPEEIPAVCRDIVVEYTDKIRA 190

Query: 200  IGKLLFELLSEALGLNPNYLNEIGCSEGLAIGCHYYPPCPQPNLTLGTSEHSDNVFITVL 259
             G  +FEL SEALGL  +YLNE+   +G    CHYYPPCP+P LT+GTS+H+D  F+T+L
Sbjct: 191  FGLTMFELFSEALGLPTSYLNELDSIKGEFHLCHYYPPCPEPELTMGTSKHTDISFMTIL 250

Query: 260  FQDNIGGLQIRHQKKWVDVPPVAGALVVNIGELMQLITNDRFISVAHRVLAKKEGPRISV 319
             QD+IGGL++ H+ +WVDV PV G+L         L+TND FISV HRVL++  GPRISV
Sbjct: 251  LQDHIGGLEVLHENQWVDVHPVHGSL---------LLTNDMFISVYHRVLSRDVGPRISV 310

Query: 320  ASFFSTLAYRSSKVYGPIKELLSEENPPKYRETTIRDFHMLYRADGLGTSKAKRSTMAVA 379
            ASFF++                   + P+Y    +    + Y    + T +++ S     
Sbjct: 311  ASFFTS-------------------SFPEYVSKVVGKTCIFYSFLDMATIESENS----- 370

Query: 380  ASRVNGANLTPLSEADENYHRPTELRAFDDTKAGVKGLVDAGITEIPRIFYFPPEDYNSD 439
                           D +Y R  E++AFD+TK GVKGL+D+GIT+IPR+FY    + N++
Sbjct: 371  --------------KDSSYDRIAEVKAFDETKLGVKGLLDSGITKIPRMFYHAKVNDNTE 430

Query: 440  NATVEIQIQIPVIDFDHVGRNSLKRKYTIDRIREASEKLGFFQLINHGIPVSVLEEMKDA 499
                +++  +P+ID   +  NS  R   +D++R A ++ GFFQ++NHGI V VL+EM   
Sbjct: 431  TTPNDLKFNVPIIDLKDIDTNSSLRVKALDKVRRACKEWGFFQVVNHGIGVEVLDEMLCG 490

Query: 500  VRRFHEQETELKKQYYTRDLTKPLIYTSNFDLYSAATTNWRDAFRYVSSPNAHDPQVLPE 559
            ++RFHEQ+ E++K +Y+RD +K + Y SN  L+     NWRD+  + SSP+  +P+ +P 
Sbjct: 491  IQRFHEQDAEVRKTFYSRDRSKKVRYFSNGSLFRDPAANWRDSIAFFSSPDPPNPEEIPA 550

Query: 560  ICRDILVEYSKQVMEIGKLVFELLSEALGLNPNYLNDIDCSEGLAFVCHYYPPCPQPNLA 619
            +CRDI+VEY+ ++   G  +FEL SEALGL  +YLN++D  +G   +CHYYPPCP+P L 
Sbjct: 551  VCRDIVVEYTDKIRAFGLTMFELFSEALGLPTSYLNELDSIKGEFHLCHYYPPCPEPELT 610

Query: 620  IGTSEHTDNGFITVLLQDHIGGLQIRHGNNWVDIPPVARALV------------------ 679
            +GTS+HTD  F+T+LLQDHIGGL++ H N WVD+ PV  +L+                  
Sbjct: 611  MGTSKHTDISFMTILLQDHIGGLEVLHENQWVDVHPVHGSLLLTNDMFISVYHRVLSRDV 670

Query: 680  ----------RSTMAVVASRVNAP----------------------------------NL 739
                       S+     S+V  P                                  +L
Sbjct: 671  GPRISVASFFTSSFPEYVSKVVGPIKELLSEENPPIYRDTTIKDVAAHYHKKGLDGNRSL 730

Query: 740  TPLSKA-------------------------------------------DEN-------- 799
             P  +A                                           D+N        
Sbjct: 731  DPFRRASKEWGFFQVVNHGIGVEVLDEMLCGIRRFHEQDAEVRKTFYSRDQNSESSHNFT 790

Query: 800  YHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYRPPETFDSDNISGE-TQIHIPVVDLDH 859
            Y R  ++KAFDDTK GVKGL+D+G+T IPR+F+   E  +   IS E +++ +P++DL  
Sbjct: 791  YDRTAEVKAFDDTKLGVKGLLDSGVTNIPRMFHH--EKLNMHEISKEDSKLCVPIIDLQD 850

Query: 860  INKNSLKRKYTIDIVREASEKLGFFQLVNHGIPVDVLEEMKDAVRRFNEQETESKKQYYT 919
            I  NS  R   +D +R A +K GFFQ++NHG+ V+VL EM   +RRF+EQ+ E +K +Y+
Sbjct: 851  IETNSSLRAQVVDKIRSACQKWGFFQVINHGVGVEVLNEMICGIRRFHEQDAEVRKTFYS 910

Query: 920  RDLTKPLIYNSNFDLYTAATTNWRDTFGYISAPNSHNPQDLPEICRDILVDYSKRVMEIG 979
            RD  K + Y SN + +     NWRDT  +   P+  NP+++P +CRDI+++YSK+V  +G
Sbjct: 911  RDNNKKVRYFSNVNPFRGKGANWRDTISFFLTPDPPNPEEIPIVCRDIVIEYSKKVRTLG 970

Query: 980  NLLFELLSEALGLNPNYLKNIDCNEGLALVCHYYPPCPQPNLAIGTSEHTDNDFITVLLQ 1039
            + +FEL SEALGLNP+YLK ++ + G  L+ HYYP CP+P L +GTS+HTD+DF+TVLL+
Sbjct: 971  DTIFELFSEALGLNPSYLKELESSNGQFLLGHYYPACPEPELTLGTSKHTDSDFMTVLLE 1030

Query: 1040 DQIGGLQIRYENKWVDVPPVAGALVVNIGDLMQLITNDKFKSVKHRVLANKKGPRRNHNQ 1099
            D +GGLQ+ +EN+WVDV PV G+L+VNIGD +QLITN +F SV HRVLA   GPR     
Sbjct: 1031 DHMGGLQVLHENQWVDVHPVHGSLIVNIGDFLQLITNGRFVSVYHRVLARNTGPR----- 1090

Query: 1100 RLHYSIPFKRSIMAVAASEIN----GLNLTPLSKADDNFHRLTELKAFDDTKAGVKGLVD 1159
                        +++A+  IN    G +    S  D ++ R+TE+K FD+TK GVKGL D
Sbjct: 1091 ------------ISIASFFINTSTRGTSKVVESSKDSSYDRITEVKVFDETKLGVKGLFD 1150

Query: 1160 AKVTEIPRIFYHPPEEYDSVET-----QIRIPLIDLDGVGKDSLKRKHIVDQIRDASEEL 1219
            + VT+IPR+F+H   + D+ ET     +  +P+IDL  V K+S  R   +D+I+ A +E 
Sbjct: 1151 SGVTKIPRMFHHAKVK-DNTETTPNDLKFNVPIIDLKDVEKNSSMRVEALDKIKRACKEW 1210

Query: 1220 GFFQVINHGISVSVLDEIKDSVRRFHEQDTEVKKQYYTRDLMKPFIYNSNFDLYSAPTTN 1279
            GFFQV+NHGI V VLDE+   +RRFHEQD +V+K +Y+RD+ K   Y SN  L++  T +
Sbjct: 1211 GFFQVVNHGIGVEVLDEMLHGIRRFHEQDAKVRKTFYSRDMSKKVRYFSNGRLFTDSTAD 1270

Query: 1280 WRDTFSYVSAPNPPNPQELPEICSR-----------------DILVDYSKWVMKIGKLVL 1292
            WRD+ ++ S+P+PPNP+E+P +C                   DI+V+Y++ +   G  + 
Sbjct: 1271 WRDSIAFFSSPDPPNPEEIPVVCRYLSNTNFDCDNFDTIKFIDIVVEYTEKIRAFGLTMF 1330

BLAST of CmUC05G085020 vs. ExPASy TrEMBL
Match: A0A1J6I2X8 (1-aminocyclopropane-1-carboxylate oxidase-like protein OS=Nicotiana attenuata OX=49451 GN=ACO3_3 PE=3 SV=1)

HSP 1 Score: 1127.1 bits (2914), Expect = 0.0e+00
Identity = 567/1030 (55.05%), Postives = 728/1030 (70.68%), Query Frame = 0

Query: 1019 NFHRLTELKAFDDTKAGVKGLVDAKVTEIPRIFYHPPEEYDS---VETQIRIPLIDLDGV 1078
            N+ + +ELKAFDDTKAGVKGLVDA +TEIPRIF HP     S    E     PLIDL+ +
Sbjct: 16   NYDKNSELKAFDDTKAGVKGLVDAGITEIPRIFIHPEGSGKSPSLEEKHFIFPLIDLENM 75

Query: 1079 GKDSLKRKHIVDQIRDASEELGFFQVINHGISVSVLDEIKDSVRRFHEQDTEVKKQYYTR 1138
              D +K K +V QI DASE  GFFQVINHGI VSVLDE+    RRF EQD E+KKQYY R
Sbjct: 76   NNDPVKHKEMVKQIGDASETWGFFQVINHGIPVSVLDEMLRGARRFFEQDIEIKKQYYHR 135

Query: 1139 DLMKPFIYNSNFDLYS--APTTNWRDTFSYVSAPNPPNPQELPEICSRDILVDYSKWVMK 1198
            D  K  ++NSNFDLYS  A + NW D+      P+P NP+ELPE C R+IL++YS  V K
Sbjct: 136  DYTKRVVFNSNFDLYSPKAISVNWSDSLYSTMGPDPLNPEELPEAC-REILMEYSHHVKK 195

Query: 1199 IGKLVLELLSEALGLNPNYLNNIGCSDGLEFVCHYYPACPHPKLTTGISEHTDADFITVL 1258
            +G  +LEL+SE+LGL P++L  + CS+GL  +C+YYPACP P+L  G+S+HTD DF T+L
Sbjct: 196  MGCTLLELMSESLGLKPSHLKEMECSEGLSILCNYYPACPQPELAIGVSKHTDNDFFTIL 255

Query: 1259 LQDHIGGLQIRHHNNWI-DVHPVAGALVVNIGDLMQVTNADEHFDRAAELKLFDDTKAGV 1318
            LQD IGGLQ+ H N++I +       +V + G+ +Q T   +++DR +ELK FDDTKAGV
Sbjct: 256  LQDDIGGLQVLHQNHFIKNTKKQETKMVFSSGEEVQAT-FQQNYDRQSELKAFDDTKAGV 315

Query: 1319 KGLVDSGITQIPRIFYRLPDSGVSPVPGDTE--LSIPVIDLEAIDRDSSKRRDVVNKVRE 1378
            KG+VD+GIT++P +F   P   +      TE     PVIDLE  D+D  K +++V+KVR+
Sbjct: 316  KGVVDAGITKVPWMFIH-PQESIHNSSNSTEKKFIFPVIDLEGFDKDPIKHKEIVDKVRD 375

Query: 1379 ASEKWGFFQLVNHGVPVSVLDEMKKGTLRFYEQDTQLKKQFYTRHNTKSIVYNSNFDLF- 1438
            ASE WGFFQ+VNHG+P+ VL+EM +G  RF+EQD ++KK++YTR NTK + + SNFDL+ 
Sbjct: 376  ASETWGFFQVVNHGIPLPVLEEMLQGARRFFEQDIEIKKRYYTRDNTKKVAHVSNFDLYS 435

Query: 1439 -TAPAANWRDTFLCFMAPNLPNPQDLPEICRDILFDYSKEMKKLGRILFGLLSEALGLNT 1498
             + PAA+WRD+  C MAPN P+P++LP  CR+IL D+SK M KLG  L  LLSE LGLN+
Sbjct: 436  KSVPAASWRDSLYCVMAPNPPSPEELPTACREILMDFSKHMMKLGYSLLELLSEGLGLNS 495

Query: 1499 NYLSDIECDRGLAVLCHYYPACPQPELTLGTTEHADNDFLTVLLQDDQIGGLQVLHQKKW 1558
             +L D+    GL++  HYYPACPQPELT+GT +H+D  F+TVLLQDD IGGLQVLHQ +W
Sbjct: 496  WHLKDMNIAEGLSIGHHYYPACPQPELTIGTRKHSDCVFMTVLLQDD-IGGLQVLHQNQW 555

Query: 1559 IDIPPIPGALVLSHFVILSIELFLQLISNDGFKSVEHRVLANRDGPRVSIASFFGIGVYT 1618
            ID+PP  GALV      ++I   LQLISND + SVEHRVL+N+ GPR+S+ SFF  G + 
Sbjct: 556  IDVPPTSGALV------VNIGDMLQLISNDKYISVEHRVLSNQVGPRISVPSFFSTGAFP 615

Query: 1619 TSQVYGPIKELLSE-QNPAKYGETTLKDFYFYHNSRGLNGTSALQHFRLSLDDEGDATPI 1678
            +S++YGPIKELLSE   P    E  +K      N  G             +D+    + I
Sbjct: 616  SSKIYGPIKELLSELPTPLASNEKLMK------NDGG-----------RKVDNTLYTSII 675

Query: 1679 KDCFIQSSKQNKMANLTPFSKLDQTFDRASELKAFDQTKAGVKGLVDSGVAEIPGIFYCP 1738
                + SS  N  A      K+ +++D+ +ELKAFD TKAGVKGLVD+ + E+P IF  P
Sbjct: 676  AKKMVVSSTDNFQA------KIQKSYDKMNELKAFDDTKAGVKGLVDAEITEVPQIFILP 735

Query: 1739 PKEHSNSVPE-ETHLGIPVVDLEDIDKDPFKRREVVGKIREASETWGFFQVLNHGVPASV 1798
            P+    S    ET    PV+DLE+ID+DP K +E+V K+R+ASETWGFFQV+NHG+P  V
Sbjct: 736  PENRPESSETCETQCIFPVIDLEEIDEDPMKHKEIVDKVRDASETWGFFQVINHGIPLPV 795

Query: 1799 QEEIINGVHRFFEQDIEVKKQYYTRDNTKPFVHNCNFDLF--SAPVANWRDTFFTLMAPI 1858
             EE++ G  RFFEQDIEVKK+YYTRD+TK  ++  NFDL+  S P ANWRD+ F LMAP 
Sbjct: 796  LEEMLQGTRRFFEQDIEVKKEYYTRDSTKKVIYTSNFDLYSPSVPAANWRDSLFCLMAPT 855

Query: 1859 SPSPQDLPQVCRDILVEYSKQIMKLGELIFGLLSEALGLKSTHLVDLDCNEGLSILGHYY 1918
             P P+ LP   R+IL+EYSK +MKLG  +  LLSE LGL   HL D+DC EGL++LGHYY
Sbjct: 856  PPCPEQLPTANREILLEYSKHVMKLGCSLLELLSEGLGLNRCHLKDMDCAEGLAVLGHYY 915

Query: 1919 PPCPQPELSIGTTEHSDNTFITVLLQDGMGGLQVRQHNKWVDVPPVPGAFVINVGSLLQL 1978
            P CPQPEL+IGT +HSDN FIT+LLQD +GGLQV   N+WVDVPP PGA V+N+G LLQL
Sbjct: 916  PTCPQPELTIGTNKHSDNDFITLLLQDHIGGLQVLHQNQWVDVPPTPGALVVNIGDLLQL 975

Query: 1979 ITNDRFVSSEHRVVANRKGPRVSVAGFFSTGSLPTSKLYGPIEELLSEQNPPRYKKISVK 2035
            I+ND+++S EHRV+AN+ GPR+SVA FF TG  P+S+LYGPI ELLSE NPP+Y+  +VK
Sbjct: 976  ISNDKYISVEHRVLANKVGPRISVACFFYTGPQPSSRLYGPIPELLSEDNPPKYRATTVK 1012

BLAST of CmUC05G085020 vs. ExPASy TrEMBL
Match: A0A445CZE5 (Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_A05g022030 PE=4 SV=1)

HSP 1 Score: 983.0 bits (2540), Expect = 2.7e-282
Identity = 551/1356 (40.63%), Postives = 770/1356 (56.78%), Query Frame = 0

Query: 687  YHRPTDLKAFDDTKAGVKGLVDAGITEIPRIFYRPPETFDSDNISGETQIHIPVVDLDHI 746
            Y R  +L +F+D+K+GVKGLV++G+T+IPR+FY P     S   S      IPV+DL +I
Sbjct: 17   YDREAELISFEDSKSGVKGLVESGVTKIPRMFYSPNFDETSTTASDSNLRSIPVIDLQNI 76

Query: 747  ---NKNSLKRKYTIDIVREASEKLGFFQLVNHGIPVDVLEEMKDAVRRFNEQETESKKQY 806
               N N+L     ID +R A ++ GFFQ++NHG+PVDVL+EM   +RRF+EQ+   +  +
Sbjct: 77   HNNNNNNLLHVQVIDQIRSACKEWGFFQVINHGVPVDVLDEMISGIRRFHEQDVGERSLF 136

Query: 807  YTRDLTKPLIYNSNFDLYTAATTNWRDTFGYISAPNSHNPQDLPEICRDILVDYSKRVME 866
            Y+RD  K + Y SN  L+     NWRDT  +++ PN  NPQ++P +CRDI+++Y KR+ E
Sbjct: 137  YSRDKNKNVRYFSNGTLFKDPAANWRDTIAFVANPNPPNPQEIPHLCRDIVIEYLKRIRE 196

Query: 867  IGNLLFELLSEALGLNPNYLKNIDCNEGLALVCHYYPPCPQPNLAIGTSEHTDNDFITVL 926
            +G  +FELLSEALGLNP YLK + C++ L ++  YYPPCP+P L +GTS+HTD+DF+T+L
Sbjct: 197  LGVTIFELLSEALGLNPCYLKEMGCSQDLFMMGQYYPPCPEPQLTMGTSKHTDSDFMTIL 256

Query: 927  LQDQIGGLQIRYENKWVDVPPVAGALVVNIGDLMQLITNDKFKSVKHRVLANKKGPRRNH 986
            LQDQ+GGLQ+ ++N+WV++PPV GALVVNIGD++QL+TND F SV HRVLA   GPR   
Sbjct: 257  LQDQMGGLQVLHDNQWVNIPPVHGALVVNIGDILQLMTNDTFVSVYHRVLAQNTGPR--- 316

Query: 987  NQRLHYSIPFKRSIMAVAASEINGLNLTPLSKADDNFHRLTELKAFDDTKAGVKGLVDAK 1046
                          +++A   +N                                     
Sbjct: 317  --------------VSIATFFMN-----------------------------------FS 376

Query: 1047 VTEIPRIFYHPPEEYDSVETQIRIPLIDLDGVGKDSLKRKHIVDQIRDASEELGFFQVIN 1106
             +E     Y P +E  S E     P I  D   K+ L                       
Sbjct: 377  TSECTSKVYGPIKELLSEEN----PPIYRDITMKEFLAH--------------------- 436

Query: 1107 HGISVSVLDEIKDSVRRFHEQDTEVKKQYYTRDLMKPFIYNSNFDLYSAPTTNWRDTFSY 1166
                                      K       + PF +   F   S   TN+ D+ S 
Sbjct: 437  -----------------------NFAKGLDGNSCLHPFSFVIEF---SQTETNFYDSISG 496

Query: 1167 VSAPNPPNPQELPEICSRDILVDYSKWVMKIGKLVLELLSEALGLNPNYLNNIGCSDGLE 1226
            V  P              D+L                           YLN         
Sbjct: 497  VLTP--------------DLL---------------------------YLN--------- 556

Query: 1227 FVCHYYPACPHPKLTTGISEHTDADFITVLLQDHIGGLQIRHHNNWIDVHPVAGALVVNI 1286
              C +    P  +L +                                         VN 
Sbjct: 557  -CCTFLMKVPTMELES-----------------------------------------VN- 616

Query: 1287 GDLMQVTNADEHFDRAAELKLFDDTKAGVKGLVDSGITQIPRIFYRLPDSGVSPVPGDTE 1346
                     D + DR AE++ F+D+K+GVKGL+DSG+T+IP +FY   D   +    D+ 
Sbjct: 617  ---------DSNCDRKAEVQAFEDSKSGVKGLLDSGVTKIPSMFYVKLDPSDNTKQSDSN 676

Query: 1347 LSIPVIDLEAIDRDSSKRRDVVNKVREASEKWGFFQLVNHGVPVSVLDEMKKGTLRFYEQ 1406
             SIP IDL+ ID+ SS R  VV+++R AS+KWGFFQ++NHGVP  V+DEM  G  RF+EQ
Sbjct: 677  FSIPAIDLQDIDKSSSLRGKVVDQIRSASQKWGFFQVINHGVPEDVMDEMINGICRFHEQ 736

Query: 1407 DTQLKKQFYTRHNTKSIVYNSNFDLFTAPAANWRDTFLCFMAPNLPNPQDLPEICRDILF 1466
            + +LKK FY+R N+K + Y SN  LF   AA WRDT      P++ NP+ LPE+CRDI+ 
Sbjct: 737  EAELKKPFYSRENSKKVRYFSNGKLFRDYAATWRDTISFVANPDISNPEQLPEVCRDIVC 796

Query: 1467 DYSKEMKKLGRILFGLLSEALGLNTNYLS-DIECDRGLAVLCHYYPACPQPELTLGTTEH 1526
            +Y+K+++ LG I+  LLSEALGL+++YL+ +++C   L ++  YYP CP+PELT+G T+H
Sbjct: 797  EYTKQVRALGIIILELLSEALGLDSSYLTKEMDCAEALYIMGQYYPQCPEPELTMGLTKH 856

Query: 1527 ADNDFLTVLLQDDQIGGLQVLHQKKWIDIPPIPGALVLSHFVILSIELFLQLISNDGFKS 1586
             D DF+T+LLQ DQIGGLQVLHQ +W+D+PPI GALV      ++I   LQL+SND F S
Sbjct: 857  TDCDFITILLQ-DQIGGLQVLHQNQWVDVPPIQGALV------VNIGDILQLMSNDKFIS 916

Query: 1587 VEHRVLANRDGPRVSIASFFGIGVYT--TSQVYGPIKELLSEQNPAKYGETTLKDFYFYH 1646
            V HRVL+   GPR+S++SFF     +  TS+VYGPIKELLS++NP +Y + T+K+    +
Sbjct: 917  VYHRVLSKTIGPRISVSSFFMNFTISECTSKVYGPIKELLSDENPPRYRDITMKEILTNY 976

Query: 1647 NSRGLNGTSALQHFRLSLDDEGDATPIKDCFIQSSKQNKMANLTPFSKLDQTFDRASELK 1706
             ++ L+G S L              P+                           R +E++
Sbjct: 977  YAKCLDGNSYL-------------IPL---------------------------RKAEVQ 1036

Query: 1707 AFDQTKAGVKGLVDSGVAEIPGIFYCPPKEHSNSVPEETHLGIPVVDLEDIDKDPFKRRE 1766
            AFD +K GVKGL+DSG+ +IP +FY       N+ P +++  IP++DL+DI K    R +
Sbjct: 1037 AFDDSKTGVKGLLDSGMTKIPSMFYVKLDPLENTKPSDSNFSIPIIDLQDIYKSSLLRCQ 1096

Query: 1767 VVGKIREASETWGFFQVLNHGVPASVQEEIINGVHRFFEQDIEVKKQYYTRDNTKPFVHN 1826
            VV +IR AS+ WGFFQV+NHGVP  V +E+I+G+ RF EQ+ E+KK +Y+RD+ K   + 
Sbjct: 1097 VVDQIRSASQKWGFFQVINHGVPKDVMDEMISGICRFHEQEAELKKPFYSRDSNKKVRYF 1120

Query: 1827 CNFDLFSAPVANWRDTFFTLMAPISPSPQDLPQVCRDILVEYSKQIMKLGELIFGLLSEA 1886
             N  LF    A WRDT   +  P   +P+ LP VCRDI+ EY+KQ+  LG +IF LLSEA
Sbjct: 1157 SNSKLFRDYAATWRDTISFVANPDLLNPEQLPAVCRDIVFEYAKQVRALGIIIFELLSEA 1120

Query: 1887 LGLKSTHLVDLDCNEGLSILGHYYPPCPQPELSIGTTEHSDNTFITVLLQDGMGGLQVRQ 1946
            LGL S++L D+D  E L I+G+YYP CP+P+L++G T+H+D  F+T+LLQD +GGLQV  
Sbjct: 1217 LGLNSSYLKDMDAAEALHIMGNYYPHCPEPDLTLGLTKHTDFDFMTILLQDQIGGLQVLH 1120

Query: 1947 HNKWVDVPPVPGAFVINVGSLLQLITNDRFVSSEHRVVANRKGPRVSVAGFFS--TGSLP 2006
             N+WVDVPP+ GA V+N+G +LQL++ND+FVS  HRV A   GPR+SV  FF   T S  
Sbjct: 1277 QNQWVDVPPMQGALVVNIGDILQLMSNDKFVSVYHRVKAKTVGPRISVTTFFMDLTTSEC 1120

Query: 2007 TSKLYGPIEELLSEQNPPRYKKISVKEYNLYFAEKG 2035
            TS++YGPI+ELLS++NPP Y+ ++ KE    +  KG
Sbjct: 1337 TSQVYGPIKELLSDENPPLYRDVTRKEIMENYYAKG 1120

BLAST of CmUC05G085020 vs. ExPASy TrEMBL
Match: A0A6J1AUS7 (LOW QUALITY PROTEIN: uncharacterized protein LOC110421487 OS=Herrania umbratica OX=108875 GN=LOC110421487 PE=4 SV=1)

HSP 1 Score: 884.4 bits (2284), Expect = 1.3e-252
Identity = 432/746 (57.91%), Postives = 553/746 (74.13%), Query Frame = 0

Query: 1296 FDRAAELKLFDDTKAGVKGLVDSGITQIPRIFYRLPDS-GVSPVPGDTELSIPVIDLEAI 1355
            +DRA+ELK FD+TKAGVKGLVD+GI ++PRIFY+  D        G T++SIPVIDLE +
Sbjct: 20   YDRASELKAFDETKAGVKGLVDAGIKEVPRIFYQPRDQFETDSFSGGTQVSIPVIDLEGV 79

Query: 1356 DRDSSKRRDVVNKVREASEKWGFFQLVNHGVPVSVLDEMKKGTLRFYEQDTQLKKQFYTR 1415
            +++   R+++V KV+ AS+ WGFFQ++NHG+PVSV+DEM  G  RF+EQ  + KKQ ++R
Sbjct: 80   EKNPITRKEIVEKVQIASKTWGFFQVLNHGIPVSVMDEMMDGVRRFFEQGVEAKKQLFSR 139

Query: 1416 HNTKSIVYNSNFDLFTAPAANWRDTFLCFMAPNLPNPQDLPEICRDILFDYSKEMKKLGR 1475
              TK +VYNSNFDLF+APAA WRDT  C MAPN P P++LP + RDI  +YSK++  LG 
Sbjct: 140  DYTKRVVYNSNFDLFSAPAAKWRDTVFCSMAPNPPKPEELPTVFRDITLEYSKQIMNLGY 199

Query: 1476 ILFGLLSEALGLNTNYLSDIECDRGLAVLCHYYPACPQPELTLGTTEHADNDFLTVLLQD 1535
            +LF LLSEALGLN +YL DI+C +GL +LCHYYP CPQPELTLG+++HADN FLTVLLQ 
Sbjct: 200  LLFELLSEALGLNLDYLRDIDCAKGLVMLCHYYPICPQPELTLGSSKHADNGFLTVLLQ- 259

Query: 1536 DQIGGLQVLHQKKWIDIPPIPGALVLSHFVILSIELFLQLISNDGFKSVEHRVLANRDGP 1595
            D +GGLQVLH+  WID+PP PGALV      ++I   LQLISND F SV HRVL N  G 
Sbjct: 260  DHVGGLQVLHENHWIDVPPTPGALV------INIGDLLQLISNDSFTSVAHRVLTNSVGS 319

Query: 1596 RVSIASFFGIGVYTTSQVYGPIKELLSEQNPAKYGETTLKDFYFYHNSRGLNGTSALQHF 1655
            RVS+ASFF   +   S++YGPIKELLSE+NP KY ETT+KD+  Y N++GL+GTS L HF
Sbjct: 320  RVSVASFFTTALLPDSRLYGPIKELLSEENPPKYRETTVKDYITYFNAKGLSGTSPLPHF 379

Query: 1656 RL---SLDDEGDATPIKDCFIQSSKQNKMANLTP--FSKLDQTFDRASELKAFDQTKAGV 1715
             L   +L       P+    I + ++  +   T     +L   +DR SELKAFD TKAGV
Sbjct: 380  SLICETLPYFSSDVPLFLLKIANRRKKMVIAKTDEVQFELKPEYDRTSELKAFDDTKAGV 439

Query: 1716 KGLVDSGVAEIPGIFYCPPKEHSN-SVPEETHLGIPVVDLEDIDKDPFKRREVVGKIREA 1775
            KGLVD+G+ E+P IF  PP +    SV   T + IPV+DLE + KDP  R+E+V K+R+A
Sbjct: 440  KGLVDAGIKEVPRIFQHPPDQSEKISVSGVTQVRIPVIDLEGVKKDPGTRQEIVEKVRDA 499

Query: 1776 SETWGFFQVLNHGVPASVQEEIINGVHRFFEQDIEVKKQYYTRDNTKPFVHNCNFDLFSA 1835
            S+T GFFQV+NHG+P SV EE+ +G  RFFEQD+E+KKQ++TRD TK   +N NFDL+S+
Sbjct: 500  SKTLGFFQVVNHGIPLSVLEEMKDGARRFFEQDLEIKKQFHTRDYTKRVAYNSNFDLYSS 559

Query: 1836 PVANWRDTFFTLMAPISPSPQDLPQVCRDILVEYSKQIMKLGELIFGLLSEALGLKSTHL 1895
            P ANWRDT  +LMAP  P P++LP VCRDI++EYSK +M LG L+F L SEA+GL   HL
Sbjct: 560  PAANWRDTVSSLMAPDPPMPEELPDVCRDIMMEYSKLVMHLGYLLFELFSEAVGLHPDHL 619

Query: 1896 VDLDCNEGLSILGHYYPPCPQPELSIGTTEHSDNTFITVLLQDGMGGLQVRQHNKWVDVP 1955
             D+DC +GL +L HYYP CP+PEL++G T+H+DN F+TVLLQD +GGLQV   N+WVD+P
Sbjct: 620  KDMDCAKGLVMLSHYYPACPRPELTLGATKHADNDFLTVLLQDHIGGLQVFHENQWVDIP 679

Query: 1956 PVPGAFVINVGSLLQLITNDRFVSSEHRVVANRKGPRVSVAGFFSTGSLPTSKLYGPIEE 2015
            P PGA VIN+G LLQLI+ND FVS EHRV++N  G RVSVA FFST  LP  + YGPI+E
Sbjct: 680  PTPGALVINIGDLLQLISNDAFVSVEHRVLSNSVGARVSVACFFSTFLLPDLRPYGPIKE 739

Query: 2016 LLSEQNPPRYKKISVKEYNLYFAEKG 2035
            LLSE+NPP+Y++ +V+E+  Y   KG
Sbjct: 740  LLSEENPPKYRETTVREFVEYVHTKG 758

BLAST of CmUC05G085020 vs. TAIR 10
Match: AT1G06620.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 427.6 bits (1098), Expect = 8.3e-119
Identity = 199/346 (57.51%), Postives = 260/346 (75.14%), Query Frame = 0

Query: 25  RPTEIKAFDDTKAGVKGLVDAGINEIPRIFYQPPEDYYSDNISGETQYQIPVIDLDDVHR 84
           R T +KAFD+TK GVKGL+DAGI EIP IF  PP    S      + + IP IDL     
Sbjct: 13  RSTLLKAFDETKTGVKGLIDAGITEIPSIFRAPPATLTSPKPPSSSDFSIPTIDLKGGGT 72

Query: 85  NSLKRKDTINRVREASEKLGFFQLINHGIPVGVLEELKGAVKRFNEQDTEVKKQYYTRDN 144
           +S+ R+  + ++ +A+EK GFFQ+INHGIP+ VLE++   ++ F+EQDTEVKK +Y+RD 
Sbjct: 73  DSITRRSLVEKIGDAAEKWGFFQVINHGIPMDVLEKMIDGIREFHEQDTEVKKGFYSRDP 132

Query: 145 TKPLIYNSNFDLYSASTTNWRDTLGYISAPNPPNPQDLPEIIRDNLVDYSKRVMEIGKLL 204
              ++Y+SNFDL+S+   NWRDTLG  +AP+PP P+DLP    + +++YSK VM++GKLL
Sbjct: 133 ASKMVYSSNFDLFSSPAANWRDTLGCYTAPDPPRPEDLPATCGEMMIEYSKEVMKLGKLL 192

Query: 205 FELLSEALGLNPNYLNEIGCSEGLAIGCHYYPPCPQPNLTLGTSEHSDNVFITVLFQDNI 264
           FELLSEALGLN N+L ++ C+  L +  HYYPPCPQP+LTLG ++HSDN F+T+L QD+I
Sbjct: 193 FELLSEALGLNTNHLKDMDCTNSLLLLGHYYPPCPQPDLTLGLTKHSDNSFLTILLQDHI 252

Query: 265 GGLQIRHQKKWVDVPPVAGALVVNIGELMQLITNDRFISVAHRVLAKKEGPRISVASFFS 324
           GGLQ+ H + WVDVPPV GALVVN+G+L+QLITND+FISV HRVLA   GPRISVA FFS
Sbjct: 253 GGLQVLHDQYWVDVPPVPGALVVNVGDLLQLITNDKFISVEHRVLANVAGPRISVACFFS 312

Query: 325 TLAYRSSKVYGPIKELLSEENPPKYRETTIRDFHMLYRADGL-GTS 370
           +    + +VYGPIKE+LSEENPP YR+TTI ++   YR+ G  GTS
Sbjct: 313 SYLMANPRVYGPIKEILSEENPPNYRDTTITEYAKFYRSKGFDGTS 358

BLAST of CmUC05G085020 vs. TAIR 10
Match: AT1G06650.2 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 420.2 bits (1079), Expect = 1.3e-116
Identity = 199/353 (56.37%), Postives = 265/353 (75.07%), Query Frame = 0

Query: 1688 KLDQTFDRASELKAFDQTKAGVKGLVDSGVAEIPGIFYCPPKEHSNSVPEETHL----GI 1747
            K+D  FDRASELKAFD+TK GVKGLVDSGV+++P IF+ P  + S   P  + L     I
Sbjct: 5    KIDPLFDRASELKAFDETKTGVKGLVDSGVSQVPRIFHHPTVKLSTPKPLPSDLLHLKTI 64

Query: 1748 PVVDLEDID-KDPFKRREVVGKIREASETWGFFQVLNHGVPASVQEEIINGVHRFFEQDI 1807
            P +DL   D +D  KR   + +I+EA+  WGFFQV+NHGV   + E++  GV  F EQ  
Sbjct: 65   PTIDLGGRDFQDAIKRNNAIEEIKEAAAKWGFFQVINHGVSLELLEKMKKGVRDFHEQSQ 124

Query: 1808 EVKKQYYTRDNTKPFVHNCNFDLFSAPVANWRDTFFTLMAPISPSPQDLPQVCRDILVEY 1867
            EV+K++Y+RD ++ F++  NFDLFS+P ANWRDTF   MAP +P PQDLP++CRDI++EY
Sbjct: 125  EVRKEFYSRDFSRRFLYLSNFDLFSSPAANWRDTFSCTMAPDTPKPQDLPEICRDIMMEY 184

Query: 1868 SKQIMKLGELIFGLLSEALGLKSTHLVDLDCNEGLSILGHYYPPCPQPELSIGTTEHSDN 1927
            SKQ+M LG+ +F LLSEALGL+  HL D+DC++GL +L HYYPPCP+P+L++GT++HSDN
Sbjct: 185  SKQVMNLGKFLFELLSEALGLEPNHLNDMDCSKGLLMLSHYYPPCPEPDLTLGTSQHSDN 244

Query: 1928 TFITVLLQDGMGGLQVRQHNKWVDVPPVPGAFVINVGSLLQLITNDRFVSSEHRVVANR- 1987
            +F+TVLL D + GLQVR+   W DVP V GA +IN+G LLQLITND+F+S EHRV+ANR 
Sbjct: 245  SFLTVLLPDQIEGLQVRREGHWFDVPHVSGALIINIGDLLQLITNDKFISLEHRVLANRA 304

Query: 1988 KGPRVSVAGFFSTGSLPTSKLYGPIEELLSEQNPPRYKKISVKEYNLYFAEKG 2035
               RVSVA FF+TG  P  ++YGPI EL+SE+NPP+Y++ ++K+Y  YF  KG
Sbjct: 305  TRARVSVACFFTTGVRPNPRMYGPIRELVSEENPPKYRETTIKDYATYFNAKG 357

BLAST of CmUC05G085020 vs. TAIR 10
Match: AT1G06640.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 405.6 bits (1041), Expect = 3.4e-112
Identity = 200/376 (53.19%), Postives = 268/376 (71.28%), Query Frame = 0

Query: 1287 MQVTNADEHFDRAAELKLFDDTKAGVKGLVDSGITQIPRIFYRLPDSGVSPVPGDTEL-- 1346
            M+ T     FDRA+ELK FD+TK GVKGLVDSGI++IPRIF+       +P P  ++L  
Sbjct: 1    MESTKIAPSFDRASELKAFDETKTGVKGLVDSGISKIPRIFHHSSVELANPKPLPSDLLH 60

Query: 1347 --SIPVIDLEAID-RDSSKRRDVVNKVREASEKWGFFQLVNHGVPVSVLDEMKKGTLRFY 1406
              +IP IDL   D +D+ K ++ +  ++EA+ KWGFFQ++NHGV + +L++MK G   F+
Sbjct: 61   LKTIPTIDLGGRDFQDAIKHKNAIEGIKEAAAKWGFFQVINHGVSLELLEKMKDGVRDFH 120

Query: 1407 EQDTQLKKQFYTRHNTKSIVYNSNFDLFTAPAANWRDTFLCFMAPNLPNPQDLPEICRDI 1466
            EQ  +++K  Y+R   +  +Y SNFDL+TA AANWRDTF C+MAP+ P PQDLPEICRD+
Sbjct: 121  EQPPEVRKDLYSRDFGRKFIYLSNFDLYTAAAANWRDTFYCYMAPDPPEPQDLPEICRDV 180

Query: 1467 LFDYSKEMKKLGRILFGLLSEALGLNTNYLSDIECDRGLAVLCHYYPACPQPELTLGTTE 1526
            + +YSK++  LG  LF LLSEALGLN N+L D+EC +GL +LCHY+P CP+P+LT GT++
Sbjct: 181  MMEYSKQVMILGEFLFELLSEALGLNPNHLKDMECLKGLRMLCHYFPPCPEPDLTFGTSK 240

Query: 1527 HADNDFLTVLLQDDQIGGLQVLHQKKWIDIPPIPGALVLSHFVILSIELFLQLISNDGFK 1586
            H+D  FLTVLL D+ I GLQV  +  W D+P +PGAL      I++I   LQLI+ND F 
Sbjct: 241  HSDGSFLTVLLPDN-IEGLQVCREGYWFDVPHVPGAL------IINIGDLLQLITNDKFI 300

Query: 1587 SVEHRVLANR-DGPRVSIASFFGIGVYTTSQVYGPIKELLSEQNPAKYGETTLKDFYFYH 1646
            S++HRVLANR    RVS+A FF   V    +VYGPIKEL+SE+NP KY ETT++D+  Y 
Sbjct: 301  SLKHRVLANRATRARVSVACFFHTHVKPNPRVYGPIKELVSEENPPKYRETTIRDYATYF 360

Query: 1647 NSRGLNGTSALQHFRL 1657
            N +GL GTSAL  F++
Sbjct: 361  NGKGLGGTSALLDFKV 369

BLAST of CmUC05G085020 vs. TAIR 10
Match: AT5G59530.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 404.1 bits (1037), Expect = 9.9e-112
Identity = 203/358 (56.70%), Postives = 259/358 (72.35%), Query Frame = 0

Query: 16  LSKADENYHRPTEIKAFDDTKAGVKGLVDAGINEIPRIFYQPPEDYYSDNISGETQYQIP 75
           ++K    + R  E KAFD+TK GVKGL+DA I EIPRIF+  P+D   D     +  +IP
Sbjct: 1   MAKNSVEFDRYIERKAFDNTKEGVKGLIDAKITEIPRIFH-VPQDTLPDKKRSVSDLEIP 60

Query: 76  VIDLDDVHRNSLKRKDTINRVREASEKLGFFQLINHGIPVGVLEELKGAVKRFN-EQDTE 135
            ID   V+ ++  R+  + +V+ A E  GFFQ+INHG+P+ VLEE+K  V+RF+ E+D E
Sbjct: 61  TIDFASVNVDTPSREAIVEKVKYAVENWGFFQVINHGVPLNVLEEIKDGVRRFHEEEDPE 120

Query: 136 VKKQYYTRDNTK-PLIYNSNFDLYSAS-TTNWRDTLGYISAPNPPNPQDLPEIIRDNLVD 195
           VKK YY+ D TK    Y+SNFDLYS+S +  WRD++    AP+PP P++LPE  RD +++
Sbjct: 121 VKKSYYSLDFTKNKFAYSSNFDLYSSSPSLTWRDSISCYMAPDPPTPEELPETCRDAMIE 180

Query: 196 YSKRVMEIGKLLFELLSEALGLNPNYLNEIGCSEGLAIGCHYYPPCPQPNLTLGTSEHSD 255
           YSK V+ +G LLFELLSEALGL    L  + C + L + CHYYPPCPQP+LTLG S+HSD
Sbjct: 181 YSKHVLSLGDLLFELLSEALGLKSEILKSMDCLKSLLMICHYYPPCPQPDLTLGISKHSD 240

Query: 256 NVFITVLFQDNIGGLQIRHQKKWVDVPPVAGALVVNIGELMQLITNDRFISVAHRVLAKK 315
           N F+TVL QDNIGGLQI HQ  WVDV P+ GALVVN+G+ +QLITND+FISV HRVLA  
Sbjct: 241 NSFLTVLLQDNIGGLQILHQDSWVDVSPLPGALVVNVGDFLQLITNDKFISVEHRVLANT 300

Query: 316 EGPRISVASFFSTLAYRSSKVYGPIKELLSEENPPKYRETTIRDFHMLYRADGL-GTS 370
            GPRISVASFFS+    +S VYGP+KEL+SEENPPKYR+TT+R++   Y   GL GTS
Sbjct: 301 RGPRISVASFFSSSIRENSTVYGPMKELVSEENPPKYRDTTLREYSEGYFKKGLDGTS 357

BLAST of CmUC05G085020 vs. TAIR 10
Match: AT5G59540.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 401.0 bits (1029), Expect = 8.3e-111
Identity = 195/346 (56.36%), Postives = 255/346 (73.70%), Query Frame = 0

Query: 28  EIKAFDDTKAGVKGLVDAGINEIPRIFYQPPEDYYSDNISGE-TQYQIPVIDLDDVHRNS 87
           E KAFD+TK GVKGLVDA I E+PRIF+   +   +   S   +  +IP+ID   VH ++
Sbjct: 14  ERKAFDETKQGVKGLVDAKITEVPRIFHHRQDILTNKKPSASVSDLEIPIIDFASVHADT 73

Query: 88  LKRKDTINRVREASEKLGFFQLINHGIPVGVLEELKGAVKRFNEQDTEVKKQYYTRD-NT 147
             R+  + +V+ A E  GFFQ+INH IP+ VLEE+K  V+RF+E+D EVKK +++RD   
Sbjct: 74  ASREAIVEKVKYAVENWGFFQVINHSIPLNVLEEIKDGVRRFHEEDPEVKKSFFSRDAGN 133

Query: 148 KPLIYNSNFDLYSAS-TTNWRDTLGYISAPNPPNPQDLPEIIRDNLVDYSKRVMEIGKLL 207
           K  +YNSNFDLYS+S + NWRD+     AP+PP P+++PE  RD + +YSK V+  G LL
Sbjct: 134 KKFVYNSNFDLYSSSPSVNWRDSFSCYIAPDPPAPEEIPETCRDAMFEYSKHVLSFGGLL 193

Query: 208 FELLSEALGLNPNYLNEIGCSEGLAIGCHYYPPCPQPNLTLGTSEHSDNVFITVLFQDNI 267
           FELLSEALGL    L  + C + L + CHYYPPCPQP+LTLG ++HSDN F+T+L QDNI
Sbjct: 194 FELLSEALGLKSQTLESMDCVKTLLMICHYYPPCPQPDLTLGITKHSDNSFLTLLLQDNI 253

Query: 268 GGLQIRHQKKWVDVPPVAGALVVNIGELMQLITNDRFISVAHRVLAKKEGPRISVASFFS 327
           GGLQI HQ  WVDV P+ GALVVNIG+ +QLITND+F+SV HRVLA ++GPRISVASFFS
Sbjct: 254 GGLQILHQDSWVDVSPIHGALVVNIGDFLQLITNDKFVSVEHRVLANRQGPRISVASFFS 313

Query: 328 TLAYRSSKVYGPIKELLSEENPPKYRETTIRDFHMLYRADGL-GTS 370
           +    +S+VYGP+KEL+SEENPPKYR+ TI+++  ++   GL GTS
Sbjct: 314 SSMRPNSRVYGPMKELVSEENPPKYRDITIKEYSKIFFEKGLDGTS 359

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG5588104.10.0e+0045.78hypothetical protein H5410_048538 [Solanum commersonii][more]
KAG8484564.10.0e+0038.57hypothetical protein CXB51_023069 [Gossypium anomalum][more]
KAG5588113.10.0e+0043.19hypothetical protein H5410_048547 [Solanum commersonii][more]
AAF24827.10.0e+0037.60F12K11.6 [Arabidopsis thaliana][more]
KAF9673550.10.0e+0045.73hypothetical protein SADUNF_Sadunf10G0035800 [Salix dunnii][more]
Match NameE-valueIdentityDescription
Q84MB31.2e-11757.511-aminocyclopropane-1-carboxylate oxidase homolog 1 OS=Arabidopsis thaliana OX=3... [more]
Q8H1S41.9e-11556.371-aminocyclopropane-1-carboxylate oxidase homolog 3 OS=Arabidopsis thaliana OX=3... [more]
P109671.8e-11354.031-aminocyclopropane-1-carboxylate oxidase homolog OS=Solanum lycopersicum OX=408... [more]
Q9C5K74.8e-11153.191-aminocyclopropane-1-carboxylate oxidase homolog 2 OS=Arabidopsis thaliana OX=3... [more]
Q9LTH81.4e-11056.701-aminocyclopropane-1-carboxylate oxidase homolog 11 OS=Arabidopsis thaliana OX=... [more]
Match NameE-valueIdentityDescription
A0A3Q7I7490.0e+0042.20Uncharacterized protein OS=Solanum lycopersicum OX=4081 PE=3 SV=1[more]
A0A4D6M4H90.0e+0041.362-oxoglutarate-dependent dioxygenase OS=Vigna unguiculata OX=3917 GN=DEO72_LG5g3... [more]
A0A1J6I2X80.0e+0055.051-aminocyclopropane-1-carboxylate oxidase-like protein OS=Nicotiana attenuata OX... [more]
A0A445CZE52.7e-28240.63Uncharacterized protein OS=Arachis hypogaea OX=3818 GN=Ahy_A05g022030 PE=4 SV=1[more]
A0A6J1AUS71.3e-25257.91LOW QUALITY PROTEIN: uncharacterized protein LOC110421487 OS=Herrania umbratica ... [more]
Match NameE-valueIdentityDescription
AT1G06620.18.3e-11957.512-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G06650.21.3e-11656.372-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT1G06640.13.4e-11253.192-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT5G59530.19.9e-11256.702-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
AT5G59540.18.3e-11156.362-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 493..513
NoneNo IPR availableCOILSCoilCoilcoord: 782..802
NoneNo IPR availableCOILSCoilCoilcoord: 118..138
NoneNo IPR availablePANTHERPTHR10209OXIDOREDUCTASE, 2OG-FE II OXYGENASE FAMILY PROTEINcoord: 15..363
NoneNo IPR availablePANTHERPTHR10209:SF653CME8 PROTEINcoord: 2035..2251
NoneNo IPR availablePANTHERPTHR10209:SF653CME8 PROTEINcoord: 15..363
coord: 1012..1293
coord: 2819..2929
NoneNo IPR availablePANTHERPTHR10209:SF653CME8 PROTEINcoord: 2625..2799
coord: 1687..2031
NoneNo IPR availablePANTHERPTHR10209OXIDOREDUCTASE, 2OG-FE II OXYGENASE FAMILY PROTEINcoord: 1687..2031
coord: 1012..1293
NoneNo IPR availablePANTHERPTHR10209:SF653CME8 PROTEINcoord: 2369..2613
NoneNo IPR availablePANTHERPTHR10209OXIDOREDUCTASE, 2OG-FE II OXYGENASE FAMILY PROTEINcoord: 679..981
coord: 391..661
coord: 2819..2929
NoneNo IPR availablePANTHERPTHR10209OXIDOREDUCTASE, 2OG-FE II OXYGENASE FAMILY PROTEINcoord: 2369..2613
NoneNo IPR availablePANTHERPTHR10209OXIDOREDUCTASE, 2OG-FE II OXYGENASE FAMILY PROTEINcoord: 2035..2251
NoneNo IPR availablePANTHERPTHR10209:SF653CME8 PROTEINcoord: 679..981
coord: 391..661
NoneNo IPR availablePANTHERPTHR10209OXIDOREDUCTASE, 2OG-FE II OXYGENASE FAMILY PROTEINcoord: 1292..1655
NoneNo IPR availablePANTHERPTHR10209OXIDOREDUCTASE, 2OG-FE II OXYGENASE FAMILY PROTEINcoord: 2625..2799
NoneNo IPR availablePANTHERPTHR10209:SF653CME8 PROTEINcoord: 1292..1655
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 36..363
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 2369..2609
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 1033..1290
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 411..661
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 1706..2030
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 2644..2930
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 2036..2252
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 700..981
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 1309..1646
IPR026992Non-haem dioxygenase N-terminal domainPFAMPF14226DIOX_Ncoord: 1743..1818
e-value: 3.0E-16
score: 60.2
coord: 449..554
e-value: 8.8E-16
score: 58.7
coord: 1346..1444
e-value: 1.1E-16
score: 61.6
coord: 738..841
e-value: 5.5E-15
score: 56.1
coord: 1067..1168
e-value: 2.3E-16
score: 60.6
coord: 2685..2784
e-value: 6.6E-16
score: 59.1
coord: 74..173
e-value: 1.1E-14
score: 55.2
IPR044861Isopenicillin N synthase-like, Fe(2+) 2OG dioxygenase domainPFAMPF031712OG-FeII_Oxycoord: 2120..2211
e-value: 2.5E-20
score: 72.8
coord: 1502..1602
e-value: 1.1E-19
score: 70.8
coord: 894..980
e-value: 3.1E-23
score: 82.2
coord: 605..661
e-value: 1.7E-11
score: 44.5
coord: 1902..1993
e-value: 1.9E-24
score: 86.0
coord: 2481..2572
e-value: 3.7E-18
score: 65.9
coord: 2863..2929
e-value: 8.8E-15
score: 55.0
coord: 232..324
e-value: 3.4E-23
score: 82.0
coord: 1225..1292
e-value: 4.9E-14
score: 52.6
IPR027443Isopenicillin N synthase-like superfamilyGENE3D2.60.120.330coord: 693..994
e-value: 1.4E-89
score: 302.9
coord: 1700..2028
e-value: 4.9E-98
score: 330.7
coord: 2638..2931
e-value: 3.3E-73
score: 249.1
coord: 2029..2246
e-value: 6.8E-62
score: 211.9
coord: 405..664
e-value: 8.0E-72
score: 244.5
coord: 1026..1294
e-value: 2.8E-76
score: 259.2
coord: 2363..2606
e-value: 4.6E-67
score: 228.9
coord: 1302..1639
e-value: 3.4E-90
score: 304.9
coord: 29..359
e-value: 3.3E-99
score: 334.6
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 2861..2932
score: 9.546528
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 1218..1404
score: 9.160934
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 1498..1604
score: 13.197618
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 888..989
score: 13.944705
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 1895..1994
score: 14.691792
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 2118..2213
score: 13.330166
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 599..714
score: 7.992104
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 2479..2574
score: 12.020754
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 224..326
score: 13.024904

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC05G085020.1CmUC05G085020.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0016706 2-oxoglutarate-dependent dioxygenase activity