HG10013854 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10013854
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRop guanine nucleotide exchange factor 14
LocationChr02: 5387823 .. 5420272 (-)
RNA-Seq ExpressionHG10013854
SyntenyHG10013854
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGTTTCTTCAAGGTTTGGATTGTTTTCGTTCCTTTAATTTACTATTAAGATCTGGAAAATCCATCTCTAGGAGATTCGTTATCGAATTTCGAAGGTTTAATACTCTTCTGGGCTCTGGATTTTGAACGTTATGTTGTTGGTGAATCAATGCGTGTTTATATGACAATTGTTTTGTACGAGTAGCTGAATATTGATTACTTTGTCGAAGATTTTTTTTGTGTGTGTTCTTGAGTTTTGATTTGGCAAAAACAGAATCGGTTTATGCTATTGCCTGATCTGATGTTCTCTCTGTTCGAAAAGTAAATAGTAATGGAACTAAAAGTGCTTTCGGTCAATTTGAACAAATCGTTGTTTGTGCAGTGCGTTTACTGTGGTCCACGTTTGCGTCATTCGCTTTTCTTGATTCTTCCATTTTGCAATTATTCCATGGATTCACAGTGCGTTTCGAGTAAAATTCCATTAACAATGATTCTTGTTTCTTATTATTGTTGTTTTTTTAATCTCTGCTCTTCCTCCGGTGGCTTTTAGCATCCTCCCTCTTTATCTAAGTACTTTGCAGTCTGGCACCACACCATCGTCGAATCTTTTCGTGTTTCTTTTCCGACCCAGCTCAGTAGAGTGTACGTTTTATTCTCTATATGTAGTAACTTTTTTTCTGTGCATCTTTCTGAAGATGCTTATCCACTATCCATTCCAAATTTCCTGGTTACCCGTTGTTACTCTCTTTGTCTTTGATAGGCATTTTAGAAAACCGTTAAGGCCTTTTTTTCCTTTTCAAATGCTCGACTTGGAATCTTTCTCTGATGATCAACAACACACAGAGTCGTACTTTTTTTCTTTTTTTTTGCATTTTCTGTTTATCTTTTTCTTAGTGTTATTAATCTGTTTGTTCATCTTGGGAAATTTTATACTGAATTGCTTCTGGTTGGAAACTAGTTATGTACTTAATTATAAGTATAGCCTGATTTCTTATATGGAAGTTACCGGCTTTTTTTTTTTTTTTTTGTTTTGCAGATGAACTGAATTGGTTAAATTTGTATCAATTTCTTTCATATTAGATTTTATGCGCAATCTACTCCCTTGCATTTTGTATCTTATTACACTATTTTTACCACTAATTTTAGCTAACCGTTTCTAACGGAATCCTCAATCTCTCCCTTTTAAAAATGTTAAAGTTTTATCAATCAAATTTTTCCCAAGAAATTTTTGTAGTAGGACCTTTCGTTGTCATCCTGGAATCTGATCAACTTTGGCGACAGAAGTAAATCCAAAATGTAAGTAGCTGCTCAGCGAAAAGAATTGGTGGAAATCTAATCAAGGTTAGTCAGCTATGTTTAAAGTTTTTTTCTTTTTTCTTTTTTTGATTTGAAATATACGAAGTTGTTTCTGCTAGATTTTTATTTTTTTATTTTTAAGTTTTGAATGTTTACATTTTAGTAATAATGTTTTTACGCTTTTAGTCGGTCTTATTTTAAAAGTAATTTAGTTGTAAGAATAATGTTCTTCATACATGTCTTTTTAAGATCAAAATCCACTAATTTTTTCCTTGGGCCAATTAGAAGCTAAATAAATACATGCATTATATGTAAAAGATCGTTTTGACAGATGCCAACCAAGGCATATCGATACTCTTAGCCCCACATCAAGGCCGCTTGTGCAGCCTAGTCAGATGTCCAATTGTCCAATAAATATTGCCTTGTGGCAATTTTTTTTTCTTTTTAATTTTTTTAAAAACTGTCTGCAAGCACCTGATCATTGTCCATAACAAAAGGGTAAAGAAATGGGTTTGATTATTTAATCTTTTCTATAATATGAAATTAGGAAGAGCCTTTGTAGTGAAGATACCTAATACATACTGCTTGATTTCTTTCAAGCAAAAGAAACAAAAACAAAAACGTCAAAAAAACAATTTTAAGTAAGAAGAAAAAGAGTTTCTCACAAACAAATTGTGGTGGTAGAAGTTAATGATGCATAGAAGTATTTTTCAATTCTTTTATTTCACGTTAAATCAAACTTACAGAGGTAGATATTATGAACAGAATTTGAAATTACATTTTTCAGTTTAGGGAATTTGGCTACGAACCCTTTTGTGGAAGAATACAAAAACAAAACAAGTGGAGATATTTCCCCAAGTTTAGTATAATATTCCTTTCTACAGTAACAAAACTGTGGCATGAGTAGGCAGGGTATGGCTTTATAGAACACAGGATCGTGGGTGTGGGCCTGATTTTGGGCCCTTGAATGAGCTTTGTACTGTTTGGAAGGTTTGTTGGCCCAGTCTGATGGGCCACGGCTCTTCAATCAGTTATGTACAGTTCGAATTTCTAGTTGGCGCTGTTCGTATTTGGTTTTGGTTCGATAAAATTGACAACTTCAAATTGATTTTCAAACATCAATAAATAAAGTCATCTACATAGTATATAAACACAGATGAACTTAATGTTCAATATATATCATAAAACAAAAAATATAAATATATAAACGAATATTTATATCACAGTATACATCGAAAAGTTGAAGAATATATATATCACAATATATATCGAAAAATTGATCAAACCATGATGGTTAGATTTTGGCAGGTGCATTGCCGGAGGGAGAACCACGGCGGCAAGACTTGAAGAAGAAATCCATGGCAGCGAGATATGAGGAAGAAGCCAACGACGACGAGATCTTGGCAGGTTGTCAAAGGGAGGTGCCTCGTCAGAAGATTGGAAGCTTTGGTGAAGATGGAAGTGAGGTTTGTGAAGGGGAGAGAAGTCATTTTTGGAGGAAGAAGATGAGGAGAGTGAGGAGAAAGTTATTTTGAGTATTTGTCGAACACTCCAATTTTTTCTATAATGACTTATTTTTTAAATTAAATATTTAAAAATGTATTTCAAACACCATTAACTTTAGGTTTCTCGTTTTTTTATGTAACACACTAGTAGAAGTTGTAATAACGTTGTTTGTATTTTGTTTGGACGGTAATTTCTCATTCTCAATCTATCATACATAGACATATTTGGAATTTTTTTTAAACAATAAATTTGAAATTCTCATTTTAGTTTCATTATTCTTAGATTTTATTTTTGTATTTTTTTAGGAGAGATTTTATTTTTTCTTTTGTATTATTCTAAAAGTGAGGCAGCAAAAACAGAAAAGTTTGTGGCATTTTTGTTAGAGAGCAAATGCTCTTTCCACATTCACACAAGATGATGGAGAACATGTGGAGGAGAAAGTGGAAGTGGGCTGCCCACCCTTGTTTTGTTTTGTTTTATATTCCTTACAAATAAATTAATTGTAATTTTTTTTAATAAAAGAATTAAAATATCATTTTTGGTCTTTATATTTGAGTATCATTCTATTTTAATCTATATATATAATTTTAAAAGCTAAATTATAACAAGTAACAATTTTATGAATAATAACCAAGTGTGTAGCAACATTTAAAAAAAATTGCAAATATAACAAAGTCTATTGTTGACAAACTTTTATGGTCTAAGTAGTGATAGACTATCAGTGATAGAATTAATTTATTCCGATAGACCAACTGATAACTACTTAGACTACTTAGACCATAAAAGTCTATGAATGATAGAAGTCTATCACTGATAGATTTTGTTATATTTGCAATTTTTAAAAAAATATTGCTATATATGCTAATATTTTGAATCTAATTGCTATTTGCAACGGTCCTTACAACAATATCCTTAAACTTCATCATTTGTCCCAAAATACTTTTATACTTTAGAATAAAAATCGAAACTCAGTACTATGTGGTAATAACCTAAAATGTTGTAGTCATTGTCCGTCACACTATTTCATATCTTAACTTTTGTTGAAAATTCTAAAAAAAAATCTACTCTAATGGTATGTTTGAAATGTTTATGAAAACTCAAGGGTAATATTACAACTTTTGAAAGTATAGAGGAATTTTTAAAATAAATGGTCGAATATATATATATTTATTTAGCCCATGCAAATGGAAGAGTTGTTTTCAAAGGGCTAATTTAGTGGAGAAAAAGGCTCCTCAATTTCATTGTCTTTGCCTGCCAATGGAAATGACAAAACTACACAATGCTCCCATTCTCATAAATGTGATGTTTCCATTGCACATTGTCCTTTTGAAATTCAACTTACACAATAATATATCATTGCTTTTATTGAAACCAATCTTCTATTTATGGTTCTCATTGTTTCATTTGTCTAGCATAATTGAAATATTCAGTGATACATCGATATATTTGTAAATCGATATTAAAGTTTAATTAATATCTTTTATACACTCAAACATCAAAATGAAATATTACTAATATTATCTATAATTGAATTTGGAGGTAAAAAATAGCTTACTTAAATTTCTTTATTTATAGAACATCTGTGATGTCGATAATTTGTCATATATCCATAAAATTTAAGTGTCGGTATCGATGTTGACATCGATAGGTTAATTATGGAAGATATGAATTTAGGGATGTCATCTGTTGTTGGATGATTCATGTAGTGTGGACGGTTTCTAAAATATGATTTTAATCATTTTAAATTTTATATTTGAAAATTAAACTTATAAATGTGAATGCGAGTTCGAGTCAGTGACTTGTGAGTTTATTTGTTTTGCTATGTAATCCCTAATAATGTTTTCAAGATTAATGTGAAGTTTTGAGAACTATGAGACAAACAAAAAACAATAGTTTCAAAAATAGCTTTTATTGATTTTTTTTTTTAAATCTGGACAATAATTTCTTTAGAAAATGTTAAAATTATTATAAAATAAGGGTAAAAAAACTAAACGCTTAGGGTCTATTTGATTTGCAATCCATATTTTGTTTTATGTTTTTATGTTTTCAGATTCACTGAATTCAGCGAATATGTGTTTGTCATACAACTCGAATTCTATGTCTGAAAACTATTTTCGAATTCTGTGATTAAAAAAAATGTAAATTCTGAAAACAACATTTTTATGTTTTTATCGTATCTAGAATTTGAATGTTAAAATTTAAAATGCATAACTATATTAAATAAAGATAAAAACATTTAAAAATATAGTATGTCATGTTATAAACTAAATTCGATTTTTTAAAATTAAAAGACGTATTATGTAATTATTATAAATTAGTTTAAATTTCAAATTTAGTTCCTATGGTTTGGATAAAGTTACAATTTAGTTTCTATGGTTTTTAAAATTTAGAATTTAGTTTCTATGGTTTGATATAATTTCATATATTGTCACATGGATTATTTAGGAAATTTTTATTGGAGTTTAATTTTAATTATAAACCACCTAAACTTCTTCCAAACCATAGGGACAAAACTTTCAATTTATCCCATAAATTATATAATATTACAAAATAATAATTATACAGTTTTTTAACATAAAATTATGTTCATAGATAAATTAATAATAGTTTATTGTCAATTGATAGTTAGTGGTTAACTACAACTAGTTTGCAATGGTTTAACTTAGAATCTAATTTATGAATTATGTAATCAAACACATGTATGAAAACATGAAATAACTATGTTTTCATTTAATTAGTTGTTTTCAAATTTATATTTTTAGGTTATCTACCAAATAGACTCATAATTTCTTTTTTTTTAATCAATTAATAAATTAAAGAACAACAATAATGAATGGATTATAATATTCATTGTTATTATTATAATTATTATTTGCAAAGTAGCGGTTTCTTTTTCTTTTTTGTTTTTGTTTTAAAAGATTATATTTCCCTCCTAGCAATTTATTTGATATATTTTTTACAATTCACCATGTTCAATTTTAGGAAATTAGAAAAAGTTACAACTCTTTGAACTGTTTTCTTTTTAATTTTTTTTTTTAGAAATAATCTTTTAAAACCATATTATTGGTGAAGTCAACAAATTCGATGAGTGCGGATCATATTATTATAATTAGGAGTGTTCAAAAAAATCGATGACTCAACAAATTCGATCAATCAACCTAATCCGTACGGTTTGAGTTGGATTGGGTTCAAATAAATGAAAAATTCATGGGTGTAGTTGGTTAATGAGTTTACCTAAAATAACCCGAACTAACCCGAACATATTATTAATTTTAAAAAATATGTTTTTTGACCTATGATACAATTATATATACCTGTACTTATTTTTATTTTATTTTATTATTTTTTAGATAATTTATTCTCCAACGACTCTAAAAATAATTTGTTAACACTTTGAAAATAAATTCTTCATAATATAAATTAAAATTGAGTTGTTAATCTTAATTCAACATATGAAAACACTTAAATAAAATATGTACATATTAACATTCTACTTATTTTTCAAACAAAAATTTAAATAAATGACCAAAGTAACCCGAACCAACCCAATCCAAAATGTTTCATGGTTGGATTCATTAATTGATAAATATAATTTGGGTTGAAGAAACTTACAATCGAAACAACAGGATTGGGTCTAAAAGTACTTCAACACTGCTCAACCAATGAACACCTCTAATTCTAATCACAGCCTGTAACAATAATTTTAAAAATAATTAAGAAAGTTTGTATTGAATCCGTTCCATAATAATTTTTCTCTTTCAAACTTTTAGTCATTTCAATGAATAAGTTACTAATTACCATTTAAAAACTTGATGGTTCAATCTCCCATATTTCGATGGTTAAATTATAATTTTTTTCCAATTCTCAAAATAATAACAATATTAATAATAATTTCTTTCTCAACTTTTTATTTTTATTTTTTATTTTACAATAGGTAGATTTTTGAATTCAAATTACTTGTAGGGTTTTAGGTTTGGGTTTTAAGGCGTACCAACCAATGTAAATGCTTGTGAAAATGAAATGAACCAACAAAAGAATCAATGGGTGCTTCTGTTATATCACAATCCCAAATGAAAAGGAAATTTATTTGTCTATTTTCTTGAGAATTACTAATACATATCATTTTTAAAAAATGAAAATAAGAATTCACCTATTTCAATAATAATGAGATTTTTTTTTTTTTTTTTTTTAAAAAAACACTATTTAAACACAATAGTAAATTTTTTCTCCAAATAATAGTAATATATCACCTCTTTTTTAATTTATGTTTAAAATATTATTTTGATTTCTATACTTGACTTTGGTTAATTTTAGTCCTTATATTTTCAATATTTTTAATATGTTTATTTTGATACTGGTACTTTTAACTTTCATTTGTTTTAGTTCATGTACTTTTAAAAATATTCAATTTAGTCTTTATACTTTTCAAAACTAACTATTTTGGTTCATTAGTTTTCATTTTATTTTTAAAGGACTAAAAATGGTTATTTTTTAAATTCAACTACCAAAATGAACATTTTAAATATACAGAGACCAAAATAAATTAAAATTAAAGATATAAACACCAAAACGAAACAAATATAAAAATATAGGGCTAAATAGTATTTAAGTCTTTAATTTACTTATCCAATAATTTGAGAGTTCCTTTTAGCTTTATTTTTTGTCCCATACATATAATTTGTCTTGTTTTATATCCAGCGCACTTGATTGAAATATAGATCAAAGACTAATTTTATTTGATTGAACATAAATAATAATATTTTTTTCAAAAATCTATTGCATTGATTAACTAATTATTTTTCTTTTCTAAACATATTGTAAATTAAGAATTAGAGTTATACTTATTACATTTAGTTAGTTTTCAATTTTTATATTAATTATTGAATTTAAATAGGCCTTAAACACACATGTCATGCATCAATGAATTGTAAAATAAAATTTAAATGGAACATCTAAATGGCTTGACTCATAAAAAATTTAAAGATTAAATTGATTATTTGAAGTTTTAAGAAAAATATTCACAGATAGAAAAAATGTCAAACTATTTAAAAAGAAAAAAAGACTGAAAAACATTGATAGACTTATATCAGCATCTATCTATGATAGACTTCTATCATTTCTATCATTAATAGACCCTAATAGACTTCTATCAGTATTTATCACGACTATCTAAAAATTTTGCTATTGTGTGTAAATAGTTTTCCTTATTTTTTTATTTTAAAAAATTCTCCCAGTTTTAATTGTCATGACTCATATTTTAGTAGTATAAATTGATTTTTTATTAGAAAAATAAACAATTGCAATGTGTATAAATTAATAATTAAGTAAACAATATTTTTAAAGAATTGCAAATATAATAAAATCTATTGTTAATATCCTATCAACGTGATAGACTCTAAAAGTTGGATATAAATTTTGCTATATTTACCCATTATTTTGCATGATATATTTGTTAATATTTTGATCCATATATTTATATTTGCCCCTACAAAAACTTATAATTTTTTTTTTTATTTATATATTATAGAAGTATTGGTCACGGAATTGATAATTATTATTATTGCTATTACTTTCTTTTATCATTATGAAAAAAAAAAACTAAAAAAATGTTTTGAAAAAGAAAGAGGAATAAGAAACCGCCAAAACGGATCCAATAATCGGGGTTGGGCCCCGCCATTAGCTTTCGCGGGGCCCACTTTAAATCCAACTGGACTTCCAATTGCCACGTGAGAAAGCGCGTGAAAAGCCAAACGCGCAGCCCCACTTTGATCATGCTTAAACGGACGGGCCAGATCGACTAACGGCGCGTGGTCTCACGCGCAAGAACCGTATTCCTCTTCGTACAGTTCCTTTCTCTCTCTAAATAGTCACTCGTCTTCTTCTCCTTCCCCTTCTCTCACACTCTCTCTAAAATCCATCCGCCATTTTCGCTCTCCCGAAGCTCTGTAATGGGAGTGCGTTTCTTTGCTTAGGGTTCAGTGGAGCTTCCGGTGAAGTAGCTTTTGAGATTGAGTTTGTTGAGAGATGCTGTTCCGGTTCCGCAGTGAAACGCCGCTGCGAAATCCGTTTCTGGACTCAGGTTGTTTGGCTTTACTTTTGCATTTTGCCTTATTTGCGCTAGATCCTAGGGTTGGTTGGTTCGAGATTTTTCGCTCTGGTTTCTCGTTTCTGTCTTCTGTAAGACTTCACGATTCACTAGGATTGTTTTGTTTTCCACACTGCGTTGTTTGTGGGTTGCATTTCTTCTGTATAGTTTGTCTTAAGCTACAGAAATCGGAGAGAGATTGGTGAGGAAATGACGTTCGAGTTCTTGACATTTTGTTTTGTTCGAAGTTTAGGGGTTCTACGAGTCTATTGAAGTCCGTGACCTTTCTCGGCGCCCAACCTTGTGTGCTTGCCTTTTCCGGTTTTCCTCCTTAAGTTTCCTTAATGCAACTTTTGAGAACTGGTTTTTCAATTGTTTTTTTTTTTGTTGAATCATGATGGTAAAGTCCATTTTCTACCAGTTTCTCTGGACCCAAAAAATTCATATAATCCTTTTCATTTCCCCTTTTTCCTTTATCTTTCTTTTACTTTTTGGTCCTGTTGACTTTTTCGTTTTCCCGATCTTATTATTTTAAAAGCAAATCAAATGCTAATCATAAATATATAGCTGAAACATTAATGGAAGTTGAATATAGGTGTGAATGAATTAAAATTTCTACGGTATCCTTAGTCATAGACATTTTCCCCCTTCAAATCATTGACCTCGTTTCATAATTTCATCAAATTACTCTTCCTGAAAGGACTAATTCTTGATTAATGATCATGTAGTGGGCTTGTGTTGATCGGCCTGTTTGTCACATTGTCTGCTGCAAGTCTATTGTGATTTTGGTCAATCGTATTTCCTCCATACCCCCAAGCCGCCAGAGAAGATAACAAGATGATGTCGATGCGGCGAAGACTGGCTTGCTGCACAAGGGACCGAGGGGTCAGCCTGGACATGGACGAGCAAGAGAGTAAGCCTCAGCTTCCCTTTACTGTTTCACGAATCAGTACCATTTGGCTTTAAATGCATCAAATTCAACTTTATTACTACTTTTGGCTTCAGCTAGATTTTTAGTGTGTTATTTAAAAGTTTCTGAAGTAACTTTGTGCTTGATTGATATAAATCTCTCACTGTTAATTCAAAAAGTAACTACAACGGATTTGAATTTGAAAAATTAGGAATAACTCCACATGCCACTAGGACATTTTTTTATTCATACAGGTTTGACTAGCTATATTAGTTTGAAATACACTCACCGCGTCCACATGCTACGAAGGATTTTTTTTCCCTTCGAATTTCTTTATTGGAAAGGAAGGCTATGACAAATTTGAAAAGAAAGGGGGCGATGATGGTGGGGCAGGTAGAGAGTAAAGAACGTGTTTGTTAGGGTTTGGTAAAGAAATTAAGAGGGTTCGAGGGAAACGATGGGTCCGGAGGGTCAGATCCTGTGAAAAGGGTTAGAGTGGGTGTAGTAGGCAAATATCACTACCAAGTGTTCCCCTGTCTCTTATTAGTCTTACCTGACCCATTGGTTAGTCCCATGCCAATTCTGGTGCCTTCATTAATGACTCTGAAGGCAGTATTTATTATACTGGAGGACATCAAAGGCCGTAAATGGGTTGGGTTTCTGGTGCTATACTTAGAATGTTTTTTTATTATTATTTTTTTTTAGGTTCAACTAGGGGGTGGGAGTTGGCAGGAGATATCTTATCCATTTAGTTAGCCATGTTTGGATTGGGCTCTTTGATTTTGGTATTCGAGATTCTTTTGGGTTGGAGAAATGCTTCAAAGATTTATTATTATTATTTTTTATTTTCAAAATTTTTTACATTCAATAATTTTATTTATTTATTTATTCATATTCATTTCGAGCCCATCATCAAATGAGGGATTACATAAATACAATTTCAAATTTGATAATGAATAATATAAGCATTGTATTAAAATAAAAAGGAAGAAGAAGTTAAAAAAAAATATAAGCAAGAAAGCATGAATACCATTTTTGAGTTGTTGGACATTGATATACTTGGACAGGATTTAGGCCTTGTTCGATTCATATGAGATTAGAACATGCGAGGGAAATGGGGTACACTCGGGGCACGGGCAATTTTAGCATATAATATACGATGAAAAAGAACTGATAGTTTATTATGGACCCTTTTTTGGAAACTTGTACTTATTTTAGTCTTTAGAAACTAAAAAAACATCTTATCATCAATCAAAACATCTAAGGTATAAACTCCCACTCTTATAGGTTGAATTCTAGAAAGAAACAAAAACTACAAAAATTTATAGGAACTATATAAGACGAAGAAAATCTATTAATTCTTTTGGTAAAATCTCATGGGCTACTTTGAGAAGAATGTGAAATGCCATGTATTATCGTGTCTTATGCGAGTCTATCGTTTTTTTTCAATAATTGTGGGAATAAGGTCGAAATATTGATCTCTGAGATGGTAAAACTTTTGCACCATTTGTGTATTCCATAACCATATCCTTATTTATATGTTTTCTTTTTTCTAGAATATATCATTATTCAAACATCATGTGATTCTACTTCGATAATAATTTGTATGAGGTGGGTAGTTTAATTATTAAGATACTGCAAGCCCAAGTGAATTAAATAAACCAAACCATTGAAAGTTTAGCCTTTTCTATTAGTTTTTCTCTTTCCTGAAAAGAATGTTATTTCTTTAAATACCATGGTTATTTTGATTTTTGAGCATGGAATACGTATAAGTGATCCCCTCCAGGCTCCAGTCTCTGTTATAATTTATAAAAATAACAGGCTTTGGTCCATGTGACTTTGTTCGTGCTTTTAGCACATGGGTGATGTGTCCGGGTGAGAAATCCGTGACTTTAGATTTCCTTGGCTTCCCATTCTATCAGTTTTCCACGGCGGATGGCTCAAATCTTGGACGCGGTCTAGATTCTAAATTGAATAGAATATATTATTTATTATTTTTAAAAGTTTAACATATTAATATCATTTATTAAACGATGTTTGGGACTAAATCTTTGTAAATTTCATTGTCTTACTAGGGAATTAGTGAAGAGGGTGGTACATTTTCAAACATAATTTAGTAATTATAGCTCAGGTAGACAAAGAAAATGTAAGTGGGGACATTATACAGCTGAGAAAAGTCTGGCTGTACGACTTGTGCGTCTGATTGATTTAAAATTTTAGGGATTACTGAGCTTCCAATATGCCAGACGAAAAGGACAAACTCAGCCCACTACTTGCTGTTTCTTTCATCCTCCTCTCTATATTTTGGTTCTGATTTTTATTTTTAGGCGAAAATCATCCAAACACATACCTTCAGACAAGTATCCAAATTGCCAGAAAAGACGGACAAACATTGCGGTGTATCTTGTTGAGCCAGTTGGCTTTGGAGCAGTTCTGGATGTCTTATTCTTTCTTGGAAGTAGTATGATAGTATCTGACTACGGTCCAAATAACAATAATTACCGAAGTTTCTGCTTTCGTTTGGGCTGTATATTTTCTGATAAAATGAACCCCATCGCTCAGAAATCACGATAAATGGTCAAAATTTGTAGGTCTTATTTGTTCTATAGTCTGTGTAAATGGTCAGTCTAAATTATTCTTACAAGAGTAGCCTTTGCATTCGGATCTGTTTGTGTATATAGTCAACGAAGAGTAAAATTTTTTTTGGAACAGCTTCGCTAGAACTAGTATGCGTAAATCTTCTCTTTCCAGCTGTGGCAGTCAGGCCTAACTAATTATTATTTTGCCTTTTCACATGCGTTATTCAACTTTTCTATAACAGGGATTTGTCTTTTGTCGTTTCTCCTTTGTTTTCAAATGGGTAACAAGATCCACATGAGAATCCAGTGAAGATCGTGCCCATCAACTAATCCCTAATTACCGCTAACTCAGACCAATAACTCCTGTTAACTATTTTGCCCTCATGTTTGATTTCCCTCGTGAACAATGTTACCTGCAAGGAATAAAAAAATTGAATATCAAAATTTTGATCCCTGGATAAGAATTATGTGTACGTTCCTTCCAGTTTTGCCACTAAGTATATGAGTGAGGAAGAAATCATTGAAATTCAGGGAAGTTCCGAGGGGAGAAAACGGAAAATATGCTGCAAAGGTTCGGGAACGTTGTCAAAACAAATGGGAAGAGGTCAAGAGGATGAAGTGTGTTTTGCAGGGAAAGCAATCTGTAAATTTTGATCCCTAGATTCATAAGATGATTACATAATAGAACTTTCAACAACAGATGGAATGATCCTTAATTGGTGCGTTTTGGAAACGAAATTTGTTTTTCTAGGAGGTCTGTTTATATTTCTGAGGAAGACCAACGATACAGTATAGTAGAAAGTCTTCAGAGTTTGTCTCCACACCAATAATTGTGAGCACAGGCAGTTAAAATCATGCTGCTTTGATATGTTATATCAAGTGCCTTTTATATTTGCTAAAAAGCGAGCATTGTTTAATTGGCATACACATGTATTAGTGACCAAGAGGTCTTGTAGTTTAAATTCCCTTATCCTTATCATATTAAAAAAACAAAAGTGCCTTTTATATTTTATCTTTCTTTTCATAATTAACTTTGAGAAGACGTAAAAGCTTCAACTCTCCTCTTGACTGCTCTCTTTGTTGTTTGCTTAAATTTAATATAAGATAACATAAAGTTATTATTGACAAATAATTTTTCCGTGGGATTCAGGGATTATGACTTTTAATGGTCTTGAGAGTTGCGTTTTAAACAATCAAACCTTCGAAAATGAAAGCAGATCGAGCAGAGCAGATGAATGTACAACTGACTCACTAGAAGATCATGATTCAAGCTCTTCTTCTAGCAAGGATGCTAGTGGATCTTTCTCCTCAAAATGGTTGGCAATGCATAGGGACGAGCAGGATTTGGATGAGTGGGAACAACCAGAAAGCCCTCAGCATTTTTACATGAAAGAGAAACATGATTACACTCTTCAGGTTTCAGACATAGAAGCAATGAAAGAAAAGTTCACAAAACTATTGCTTGGTGAAGATGTCACAGGAGGGCAGAAAGGGCTGAGCTCTGCGTTGTCACTGTCAAATGCCATCACCAACCTAGCAGGTAGAGAATACATTGCATTTCTTTTTCATCATAACGGACCCTTCTGTTTAAATATCTCATGTACTTGTCATTCATTCCCAGTCTCGACCTCGAAATAATTATTAAGGTTTTGCAGCGTCTGTCTTTGGAGAACTATGGAAATTGGAACCTCTTCCTGAGGAGAGGAAGAGTAAATGGAGAAGGGAAATGGACTGGTTGCTCTCTCCTACCCACTATATGGTTGAATTGGTTCCTACAAAGCAAAATGGTACGGGCGGTAGAGTGATGGAGGTTAGTTGATTTAATTGCAACCAGAATTGTGAAATATGTTTTTGAATCTCATATTAACGAGATTTTGTGCACAAATTTTCTGCAGATAATGACTCCAAAGGTTCGGGGAGACGTTCACATGAATCTTCCCGCTCTCCAGAAGTTAGACTCCATGTTAATTGTAGGTGGTCTGATTACCATACGTACTAGCTAATCTGCCTTTTTATTGTTTATATCAGCTCATCTGCCTCAATTATATGCTATGCTATTTTCTTTTATTACAGGGAACATTGGATTCCATGGTGAAGACAGAGTTTTGGTATGCAGAAGTTGGTAGCAGGGCTGAAGGAAAAAGCAAGAGTATGGGCCAAAGCACGAGATGGTGGCTTCCATTACCACAAGTACCATCCACTGGGCTATCTGAGAATGAGAGGAAGAAATTGCTCAACCATGGCAGGGTTGTACATCAAGTATTCAAAGCTGCCAAATCCATCAATGAAAGTATTTTGCATGAAATGCCCGTACCAACTGCTGTAAGGGAGGCAGTCCGAGCTGTGAGCATTCTTTATTCACTTATACCATTCAATTTTATCAGCTAAGAATCATGCATACTATAGAAATTTTTCTGCGACCACGACTAAACTGATCACAGTTTAACTATAATAACAAAGTGGATGAGTTTTAATAATTTTGATTATGGTCCTTCCAGATGAGATTAGATGTGTTGTGCCTAGTCCCATGTATATATTTTTAAGGGGCAGTTGTAAATATAGTACAATCAAGTCCAAAATATTACTAGATATAGTACAATGCAAAATAATTTGTAGATATAGCAATATTTAGATCCAGATCTCAGAGCCTATCAGTGATAGACTATATTGTTGGTAGGAGTCTATCAGTGATAGTCCTAAAAATGGTACCCATTGCAATTACCCTATTTTTAAATTGAAGACTTTTTGGCGCAATTGAACATACAACGGATATTAGTTCCCTTGCAGATTGAAATTTCAAATTTCTACTCCTTCATGTGTAATGTTATATTTAGGATAATTGCACAGGGTAGTGTTTTTAGGGACAATAATTTAGTGTATAGCAACAATTTTAAAAAATTGCAAATATAGCAAAATTTATCAGTGATAGATTCTATTATTAATAGACTCCTATCAACAATATAGTCTATCCCTGAGAGATTCTTACAAGCAATATGGCCCATCATTGATAGACCCCTACTAATGATATGGTCTATCACGGATAAACTTAGGAAGCAAATTCTAAATTTTGCAATAAAATGACACTTTTTGTGAGCATGGCATGAATAGGTAGAACCTTTTTGTGAAAATCAGAAATTCTTGGGTAATACAAATAGGGTAGAACCTGGCAAATCTAAATTTATGTCTTGTATCTGTTTTGGCTACCATGAATTGGACAGTCTGGAAAAGCAAGTATGAGCGAAGAACTCTACAAGATTTTGACATCAGAATCTGGTCCAGCTGAGAACATGCTGAATCAGCTTAATCTGAAATCCGAACATGACGTTCTCGAGGCCATAAACCGCCTTGAAGCTGCAATATTCTCCTTGAAAGAGAAATATACCGAACAAAGTGGCAATAAATCTCCAGTTCGAACTTCTTGGCCTTTTGTCAAGGACCCAACGGCTGGGATTGATAAGTTGAAATTACTCACTGATCAAGCCGAGGTTCTTCTACAGCTACTGAAAAATAAATATCCAAATCATCCCCAAACGTTTCTGGATGTTTCAAAAATCCAATATGAGAAGGTATATCTTCTTTTTGCATTCATCGTTAGTCGAGTCGGGCATCTATTGCATCTCCTTTGCTTCCTGTCTAAATAAAAGTATGTCTCTCTTTCTCATGTCTTTTTACTTATTAGATCAGTAGATTGATAGTAGGTCATAATAACCTGAAAACATAACACTCAAGGGAAAACATAAAAATGTGGACTGAACTCTCTTAGTAAAACCATTTCCCTTTCTGGAACAAACAGTGCATTATGCAAAGAATTTTATTGTTTTCATGTACATTGACTTTCCAATCTTTGATAAACAATGATAGTGTTGGGAGTCTGTGACTGGTAGTTTTGTTCTTGTTGGCACACTAAAAACAGAAATCATACAAGTTTCTGTTTGAATTTTCATAGGATGTTGGGCATTTGATTCTGGAAGCGTATTCGCGGGTGCTCGGAAACTTAGCTTACAGCATACTGTCTAGAATTGGAGATGTTCTGCAAGTGGATGCTATGTGCAACCCAAATTCACCTGCACCAACATGTTGTTTTCCAGGGATGAGTCTGTTGAACAATCGCAGCGATCAAATGTCCGCTCTTCATTCGTGGCAGCCGCTTATCGGTCACTCGAACAGTCCCAATATGACTTTGCCATCCAGCAAAGTTAGTGGAAACTCTCCGACCGCAACCCCGAGCCGAAACCGAGCATGGTGCATTGGCAGAGAGGTTTGTAGGAGTGTCTCCTCTGGAAACAACTCACCATAGCCATTTGAAAGCACATTAGCAGTTGCATTTTGTTGTGTACATGAAGCCATCATATTATTCAGTGCCATTGAAATCAAAATTTTGTGTTCCTTTGGGAATCAGAAAAGGCATATTTTGAACTCATTGTATATTGATAGTATAGTTTACAAAGCACAGTATATTTATACTCTCCCTCTAATCTCTCTATACACATTCCCTCCTACTATCCGTATTAGTTATTTCTTCCTTTGGCAAAATACACTTTGGTCTCTTTGATTTCTTTCTTTAACAATAAGATGAGACTCACTAACATTAATGTAGGGTCCACTAGTATTTATGGAGACAAATTTATAGTGTGACCATAAAGTGTATGATACTCTCGAGGAAGGGGAGAGTTTTGTGAGAGGGAGTGGAAAGAAGAGAGAGAAGGAAAAAAAATAATGATGATTATTGAGAGTTGGAAAAAGAGAAAAGAGAGAAGAAAAAGAACAATGATGATTATTGGGATTTATGAAAAGAGATAAAAGAGCGAATAAAATTAATTATTATTTTACAAATACCAAAATGAATTTTTGAGAAAACCACTAGTTTTGGGCATTAAATGTGTAGTTACACATCACTCTTTTTCCCACCACCTATATGCACTTTTTTTTTAAAAAAAAGAATAAATACGGTAAATCATGAACGGGGTACACAATTTCATAGGACTAAATTGTGGACGGGGTACATGATTTTAGTCCACATGAAATGCCATGTGTGCATGTCTTGCCAGGTCGGATGCCACCTAATGGCACCAAAATCGTGGACTCCATCCACGATTTAGTCTTCTGAAACCGTATACCCTTTCCACGATTTACCAATCATATTTTTGTTAATAGTTTCCAATTTAACTCTATTTCATAATTTTTTTTAAACTACATTATTTTATGTCACTACCCTATTTTTGTTATTTTTTAAAAAACTACCGTATTGTTGAAATTAACCCGATTTTAAATGGTCCCTATACCTATTTGATTAGATAACTTAATGGGTTGCTGATTGAGCTATTAAAAGATTGAGGTAGACTTACTTACTTGTTAGTTTGTATGACCTGGAAATCTATTAGTTTAAAAATTGAGACAACTAAAAAGTTTTATTTTTTCTATGTAAACAATCTTTTATTTTTTTTGCCACTTTAGTTCCCTTCTTAAAGATTCACAATTAGACTTCCCTTTCAATAAACCACGATAAACAATGGTCAATCTTCAACACCAAGTTGTTAGAGAAGATAACATCTAGACATGAACGAGCGAGAGAGTAAGCCTTAGCTTCCCTCTTTTAAAGTCAAATATGTCAAAGTTCTATTAAACACAAAATTAGAAATTAATTTATTCGATACCTTTTGAAAGAAAAGAACTTCATTGAGAACAAAAATGGATAACATAAGCACAAAAATCTTTTCAAGCCAACGACAAAATGTATCTAATAGTTTATTTTAGATACATAATAGTTTATACAAAATTAAAAGTTTAGATATTACTAGACATAGAACCTACCAGTTTTAAAGGAATAATTGCATAAAAGGCCCATTTTAAAGGCCCAAAATGACAAATAACCTAATTTTTGAAGAAAATGTCAAATAACCGTTTGATTATGTCATGACCACGTGAAATTACAGTATTGCCCTTCTTATTTTTTTTTCTTCTCTTCTTTCTCTCGCGTTCACTCCCCCTTTTTCTTTTTTTCTTATTTTTCTCTTCTTTCCCAATATCCTCCTTCTGCCTCTTCTTCTTCTTCAGGCACGACGAGTCACATGAGCAACTCCAGCAACCAACTCCAATGAGTGATAGCGAGAGCGAGAGAGATGAGAGTGATTGTGAGTGAGGAAAGAGAGCGAGATGAGCTTTTTCGCATGTGCGAGTTCATTTTGGTAATTTCATTCGATCTTGACGTCCGTAAATGGTCATTTGGCATTTTTCAAAAATTTGAGTCATTTGACATTTTTTATCTAATTTTTGGGTTATCGTAAAAAATTTTCAGTTTTAAACCACTAACGTGAATCATAATTTTTTATTTTTTATTTTAAAAAATGTGTTAAGTAGTTTTTGCAATTTGTGCAAAATAAAACTCCAGATCAAATTCACCCGCCAACTTTTCCCTGCAATCGAATTGGTTGGTGTATTAATTGAGTTCATATTTATTTTATTTCCTTCTCCAAATCCTTCTCTTCCCCTGCAAATGGACTTGCCAAGATTCGGCCGTCCAAAGGAAGACAATGGATCCTCTTCTTCTAGCCCTAATCTTTACGTTGCAAACTGTGGACCGGCCGTCGGAATCAGCCACCGCACAGTTGCGGCGGTTTTCGGCGATTTTGGGCTCGTGAAAGGGGTTCATGCCGCCGACGAAACGGGCGCTCGCGTCATCGTATGTTTTTCAGAAGAATCGAGTGCCCGAGCCGCCCTTGAGGCGCTTCACGGCCGCCCTTGCGCTCTCCTTGGAGGCCGGACTTTGCACATACGTTATTCGATCATCAGACCATCCATTTCGCACCCCAATGATTCTGTTTCAGTTTCTTTGTCGGCTTCGGAGCTGGACATTCCCGGACTTTTCTTATTGCACGACTTCGTCAGTGCTAAAGAAGAGGAGGTGAGCGTTTTAGTTGGATTATGAGGACTGGAGAAGGATTCGGTTTTGTTTTTTTCTCCCTTTGTTTCATTGTTATTGATTTTTCGTGCTTTTGAAGGATTTGCTTATGGAAGTTGATGCTCGTCCTTGGAATAATCTGGCGAAACGTAGAGTTCAGCATTATGGGTATGAGTTTTGTTATCAAGTAAGTGGCAGAATTTTTTTTTTTTTTTGTCTGTTCCTTCTTCCTACTGTCTGTTGTCCTGAATGATTGTGCATTGTCAGGATTCACGAGTACTGAAAATCAGTTGGTTTTGTCTGTGTTTGTTTTTTCTTTTTAGACGAGGAATGTTAATACTAAACATCAGTTGGGGGAACTTCCATCATTTGTTTCCCATGTAGTTGATAGGATCTCCATGTTTCCAAACGTTGAGAATGTTGCAGATGCTTCTCTTGATCAATTGACGGTAGGCTCTCTGTTTTTGATGAATTACCCTAAACTTGGCTGCTTGTAGTTGGTTACTTTCTTGTACAATCTTTGAAAATTGAGATTGTAGCTTTTGCTATCCTGAGTCTTATGATGGTTTATTTTCCATGCTTCCTCTTTTATGCTATCTGAAGTTGTTGTCATGTTTAGTATGAGACTGAAATCAATACCACATTACAGGTTAATGAATACCCACCTGGGGTGGGTTTGTCCCCTCATATAGACACCCATTCTGCATTTGAAGGATTAATTTTCAGCCTTTCCTTAGCAGGGCCATGCATTATGGAGTTTAGGAGATATCCCGAAGGCACTTGGCACAAATGCCCTTCAAGTATAGATTTGAAAATGGGGAATTCTGTAAACGACTCAAATTATCTAAGGAGAGCCATTTACCTTCCACCTCGGTCTATGCTATTACTGTCTGGAGAGGCACGTTATGCTTGGCATCATTACATTCCTCACCACAAGGTATATCAGACCATCCATGAACAATTGTTGCTTTAGTGGAGTTGATTTACGAAATTTGTAGAGTTATCTAATTATACATGTTGATAGCCCCCAATCATACATGTGTTTTCTCGTAAGCAAGTGAAGGAAAGAAATGATGGCGTGAGAAATTGTAAGGGGTGTGTGGGCCCATTAGTATGGAGGTTAGTTAGCTGTTAGATGGGCTGAGACGTTTTGGGTTTGTAGGTATAAGCATTGTGAGAGGAGGGTATATTTTTGGAGAACTTTTGTGGCATTCTCTTTGAGAATTAGGAGAGGTGGGAAGCTCTCATAACTTCCTTGTTATATTGCTATCGGTTTATATATAAGTTAGCCAAGCCTTATCACATGTTTTCAGCTATGAATTGGTCTATGTTCAAAGAGTTGGGCCAATGGGTAAGGATGAGCTATACTTATGATAATCATATTCATAAAGTCTTGTTTATTTCAAATGATGTAGAGGGATTATATTATATTCTTGTTGGATAGGTTAACTGAGGGTAAATCGGGGTATAACATATTTTTGGGTGTATACCAAGGTGCCACTCCCAGCCAAGCATCCGCAGTGGTTAAAAAGATATTTAATTGTTGAATGACATTATCATTTAACTTTTGAATGAGAGGAACTGTGCACTTATATGTTCAACCATATACTTATGCTTACTCGTTCTATTCTCATTTATGATGTTGAAATTGGGTAAATGTTCTGTAATAGATTGACATGGTGAAGGACAGTGCTATCAGAAGGGGTCCTAGGAGAGTTTCTTTTACATTTCGCAAGGTGAGTAGATTTAATGCATTTCTTTTTCCCTCCCTCTCGTACAAACAGCTTACCTTATCTGCTTCTATTATTCATGATGTCAAGACAGCAGCCTACCCCTTTGCAATTCAAAGCATTATTTATGTTAAACTGGATGATATCACTAGACCATGATATTCTAGTTTGACCCAATCTCCTCTTAAAGTTCATCCTTTTTTCTCTCTTTGTGAATTTTAATGCCCAATCTCCATTACAATTAACTTTTTTTTGCCCTATCTATCGATTTTAGTCCTCAAAAAATGCGGGGCTTTTGAAGCTCATCTGATCGATATAGGTCATGCAACCACAAACCAATCATAGCAACAAAGGGTACATAAACAGACCTTTGAAACCTATTTCTTTTCAGCATAATTCATATTAAGGCTGTTCAAATCAAGAAACTTGTTACTTGAAGTGCGGTTTGTGCTTTAAAAATGCTTGTTTAGCCATCCCGCTTTATAATGTTGAAGACTTTTTGCTAATAATGGAACTGAATGTATGGAGACTGGTTTAAATGTATGTGAATGAAATGACTAAATCATGCTTCAAAGTTGAAAATCTCTGGCCAATGCCTTGCAACTTGAAGATATGATGGTAAAGAGCGTGTTTCGTAAACAATTATTCGTCTACTTCGATTTTGTGGGGATGCTATGCAAAATTCTATCATTCTCTATGGTCTAAAAGTTGATCGTTTATTTCATCTTTATCATCTCACATGCCTTTCCATGGTTGAATAATGGGAAAAACATTGTCTTGTTGTAGGTGAGAACTGATCCTTGCCAGTGCAAATTTCCTCACTATTGCGATTCTCAGAGATAAATGAGAGGCTTCAATCCTCATCTAAGAACAGGCTGAGTTTTCTTTTTTCCTCATTCATCCATTTTTTTACATATGGTTATGTAAAGTAGTTTTTATCAAGGATGAAAAATTCGCAATATTTTGGTGCCAAAATATCGTTAAATTCGTTGTACCGATAGAGACATAAGGGGTAACATCTTTTTTCTCAAATATCTATGAAATATGTCAATATCGTCGATATTTCGGAGGGTTTTCTATACCTCTTTCCCAACCTTTATTGTTTAAATATTTTTGCCCTTATGTTTTATTCTAAATTTCTGATTATGCACGCAGCAAGCAGCGTTTTGTTGAACTCCTTGATTAATTACATGAAATGAAACTAGGGGATGAGGTTAAAATTCAAGCATTTTTTTAATTCACTATAAAATCATTTTATTTGTCTGTTTGATAATGTTATTTTTCTAAGGCTTTGTTTGACAATCAATTGTTTGATAATCAATCGTTTGATAGCTCTTTGGTTTTTTATTTTTGAAAATTAAGTCTATATACACCCTTTCTACCTCTTGGTTTCTTTGCTTTGTTATCTACTTTAAAATCAAATGGTTACTAAACAAAGTTTTTCAATTTTCAATTTAAACCTATTATTGAAAGTTCATAGACTAAATTGGTCAATTTGAAACGTCAGAGACTAAATGCACCTATTTTTAAAATTGAGGGACCAAAAAGGTATTTTTTTTTCTTATTTCTTTCAAAAAAAAAAAAAGAAAAAAAAAGGTAATATTTCTGTTTTGGGAGAAAAATATTGAAGAACAACTCATTTTTCATAATAGAGTTTGGGCCTCATGCATACTAAAATGCTCAACTTAAGCATAACTCAATTGATATGAATGCATAAAATCAATGAAGAGGTCAAATGTTTAAATTTTCATAGTTATGTAAAACATACACCTTTGATCAAAAGCAAAGAATTTTCATTATTGTTGAAAATAAAGAATATAATAATAAAAAAAAAATTAAATTACATGTTTGATCTCCAAACTTTTTGTCGAATAGGTCTAACTTTTAGTGTGTCTAATAAGTTTCTCACTTATTCAATTTTTTTTTTAAAAGTTAATTGATCTATTTGATATAAATTTGAAACTGATGTAAAACGTAACTCTAAACATAAAATTGAAAGTTCAAAATTTATTAGATAAAAAAAAATAAGACCATGTTTAGTAAACATTTAGGTCTCCCATTTAGTTTTGGGAGTTTAATTTTTGAAAATTAAGCCTACTATTCCCTTTCCACTTCTAAATTTTTTGTTTTCTTATATACTTTTTACTGATATTTTCAAAAACAAAATCCATTTTTGAAAACTACAAAAAGTAGTTTTTGTTTTTGAAATTCGGTTAGGAATTCAACTATGCTTAAGAAAAATGCAAACCATTGTAAGAAAATATAAAGAAATAGACTTAATTTTCAAAAACAAAAAATAAAAAATGAAATGGTTACCAAACAGAGAGTTAGTTTTTTGTTTTTCGTTTTTATAAATTAAGCCTATAAATAATCATTCCACGATCCATCTCTAAATTTATTGTTTTGTTATCTAATTTTTACCACTGTTTAAAAAAAATGAAGCTAAATTTTAAAAACTAAAAAATGTAGTTTTTATTTTTGAATTTTGGCTAAAAATTTAATTATTTTACTTTAAAAAAAATCATAATATAAAATTGAAAGAAAATATGCTTAATTTTTAAAAATAAAAAAAATATTATTAAATGAGACTTAAAATTTTGAAAATTTATTAGTATTTTAAAAATTGAGAGAAGGCAACAATTGAAAGTTTATAGAACATCAAATAACAAGAATAAACTTTAAAATAATCTTTCTGCTCCAATTGACATTTGACCCTCCCGGGTTGTTCTTAACGTTACATGAATAACACAAAAAATATTGGAGGTAATATAGGAAAAATGGATGGAAAATTATGGATAAATTTCTTCCATTAAATTTTCGTAGGGTATCTTTGGTTTAAATTTTTAAGTATTTAATTTTTAAAAAAAATTAAAATATTTGACAACCACTCAAAATAATTTTTAAAACATTTTAAAATTTTATTTTAAATAATTTTTATAAAAAAAGGTTTAAATAAAAATGAATTTTTTAAATAACACTTTTTTCTTTAGTCCTTTTCGAGATAGTTGATTAAAAGACACATTTTTCAAGTTCAAAATGAATTTTGCTACACTTTTTAAAATTTAAGATTTTTGATGCTTACATCATTTTCAAACAATTTTACAAAATTATCCTTACTTCCTTTTTTTCTCTTCCTTCATCTTCTACGTTTTCCTTTTTTTTTTTTTTTTTTTATATATATATTTCATTTTTTTCTCCTCCTTCTTTTTCTCCGACTAGCTCTGGCGAGATTCTTGTTCTCTTTTTTCTTCTTCTTCTTTTTCGACTAGCTCTGGCGAGTTCACACGAGCAGCTTCGATCGACTTCTTCTTCTCTCCCTCGTTTTAGTTTGTGATGCAAAATGAGACCGAACACCGATGAGCAAGGGAGCTAGACAACGAGAATGCGGGAGAGAGAGGAAATTGACAGAAAGAGGAGGTTGAGAGAGCGAAGATCGAGAGCAAAGACCAATCCGCATGCGCAAGATTCTATTTTGGTCATTTCACCCATTCGTGATGTCAACAAAGGATCAAACAGCTTAAAATTCAAAAGATAGAGCAATTTTCATTTTCTTCTCAATTTTTAAATCATTCATCTGTCATACCCGTCGTTCCAAACATGCCCTTAGTTACTTGCTTAAAAAGGTAGAAGAGTTAAATTCTTAACTAAATTACCCTTTTTTCTCTAGTCCTTTCAAATACGTCATTAGTCACTTTTCTAGGAAATAGTTTCTTTTTTAACTTTTACATTTTACCACCCCTAATTTTTACCTATTCTAGCAGTCAAGAACACAAAGGACTTATTGGAAAATACAAATAAATTACATTTAACAAAAAACCGCTGCCTGTTCTTGTTTCAACTTCCACCGCCGTCATATAATCCATCCAATTTCCACATTTATAAATTCTTTTGATCTTCTTCAGCTGAACTATTATGCTTGCATCTTTCTTCTCTCTCCCTCGGATTCCTATTCGCTCTCTAACCAGGTACTTGGCTCTCTCTCTGGACTATGAAGCTCCAATTTAATTTCGGATTATGTTAATTGATTCTAGTATTATTATTAAATCCTTCTTCGACTATGTCCGCCTCCCAAGGCACTGCGGATTGACGTACCGATTAAAGAGTTTTTTTTTGGAAGCTTAACTTGGTTGATTTTGTTTTTTGTTTTTGCATTTGGGAGTTTTGAGAGGATTACGAAAAGAATAGCGTTCCTCTTGTTTGTACTTATGATTATTGTTTGAATGACGCCGAGGGAGTGAAAGTGAAGTGATATGAGTGAGGTTATTGCTTGTGGAGTTGTTTTGATTTCAGTTTCGATTGAGTTGTGCTAGTCAAGTAGCAGTTTATTTTTGTGGAATCAAGAATGGTATGGTTAATTTTCTAGGAGGCGAGTTTTCTTGAGAAAATTAACAAGAACCTGGAACATTCAATGATTTAGGGGGAGTGGTTTTGAAAGGGTTAAAACCACTATGTTCAAAATTACACTGAAATTTGTCTTTAGTCATTTAAAATCAATTAAATGTTTGATTTTACTCATTTTGCATTTGATTTTCATACCATCAAAACTAATTTTGAATCATTAAATGCATATTTGAGAGTGATTTTGAAAATGGATCAAAGTATATTTAATCATTTCAAAATTATTCTCAAACATGTCTTAAAGAATGATTTTGGAATGGTTAAAATCACTTTTGTGGTGTTCAAAGCCACTCTAAAACATGCATTTAGTCATTTAAAATCAATTTAATATTTGATTTTACGTTTTTAAACGTGATTTTCAAAGCATGAAAATTTGATTTTGAAGATTAAGAGTATGTTTTGGAGCAACTTTGAACATGAAAAAACTGATTTAAACCAATTTCAAATTATCCCCAAACTTGAAGTAATTGTCTAAGATGCTTTCAGAGAAATTGTTTTGAGTTGTTATTTGCTTTGGCTTATTTTTCTTCTTTTTATGTTATATTACTGGCAGAAATATTACACGTAACATGAGCGCAATACAATCACCAAAACATCCAAAATTTATCTCTGTGGAAGGGGCTGACATTTACTCAAGAAGTAAATCTGATGGTATGTATATTCCCCTTGTTTCGATGGAACAGAAATGGCTATTTTGATCGGCGATGATCATGTTATTCTTGCTCTGGATTTTTCTATGTGGTTAAATTAGACTTGTTGATATTCATCATTCCACTGAACAGTGTGCATCAATAGATTTTTCTCATTTAAAATTTTAAACAACTTATACGTTTTCTAAGCATGACATTTCTATATTGCTTCAATCATTATCTAAACATGACATCTTTATGTTGCTACTGCAGGTATCAGATTTCGTTTTGTTTCTTACAATATTTTAGCGCAGGTGGGCCACACTTTCTGATTTATTTCATTCTTTATTCTTAAGCTTAATTTTATGCTTCTGTTGTTTGTTAAACATCCTATTGCAGTCAATAATATACAAGCAGAAATGGGTTTCTCTTGCCTCTTTGTGACGGGACATCATAAATTTGGTTCTTCACGGAATGGATTCAAGCAACTTTCTTTTTATTTTTATGCATTTTTTTTATACTATTGAAATTATTGAATATGACTCATATTCCTTGATGATTTTGGACTCTACTTTTGATGAAAAGTTCAAACCGTTGATAAGGTTGATGTATTTTCTTGGCATGCCGGTCAAAGAAAATTAATGCTGATGGAATGATCTTTTGTAGCATTTGAATTTTAATCCATAGTCAATGACAGGATAACATTCATTTTTATGGTAGCATATTCAATTGGACTAATTTATTTTGTTTTAAGAAATCGGATCACATTGCATATTGACTATTGTTTTTGTATTCCTTTTGCAGGTTTATGTGAAGAGTCGTTTTTTCCCACATTCTCCATCCTCTTGCCTTAGGTATATATCTTTTTTACTTTCTAATTTTTATCAGTCGATTCTTAAAAATTATCTTTATATCAGTAATAAAGCACAATAGATAGATTTGGTGGAAAGTTAAGCCATATATATGGTGCCATGCATCCTTCAGGATTATAGTAATTTTATACAACTGAATTCACATTTGTTACATAAAAATAAGTCACAACCAAGCTAAAAAGTCCAAGAATAACTTGTCTTACCAAATCAGTTGTAAATGTTAATAAGGTCGCCAAAATGGACATTTTCTCCTCTACCATAATTAATAGATAGCATCGTGTACCATTTAACATTTATTGTCTGATATCCAGTTCTTGGGCTGGATGAAAGATTCTATGGTGAACACAAAATGTTAATGTGTTTAATGGTTCTCACACTTGTTGTAAGAACAGCTGGGAGCCTGGGATTGTGAAAATTGTGATAAATTAAACTCTATTCTAGAGCCAAAAAAACCAGTTATGCCTCATTCATATTTTAACTTGTAGTAGGTTTAACATCTGATTTACAGGTGGAAAGCTCGATCGCAGGCAATTTTAGCAGTTCTTAAGAATCTTGAGGCTGATTTTCTCTGTCTACAGGTTTGTCCCTCTCTCTCTCTCTCAATTGAAATAGCTAAAGGCAAGTGTACGATGAAGCTATATCTGTTTTCTGAACTGAGAATAATGATGGAATTTCAAGTGCTAAATAATATTGCAATGCAATGAGAAAACTCAGATTGTTCATTGTACCAAATTTGCAAGATGTCAGATGCAGTGTAATTGGGGACCAGGGTGTACAACAAAGTGTAATTGTAGTGTAATTTAAGTTTTTTTGTTTAGAAGGATTCAAATCTCTGATTTAACATATCCTTATATAATGGCTACAAAATGAGCTATAAACTTAAAAGTAATTCTGGGTCTGTTATGTTGCACCACACAGGATATTGGGATATAAGAAGCGACCTCTTAGTATGTTACGAGCTTTTCAATAAAAACTACCCCAGCCCTTAGGCTGTTTTCATATTTTTCTTGTAATGCCATATAGTTTCTTGTCTGAAAAAACATTTTTTTTCTGGTCAACCCCAAACAGGTTCTAAGTCCAAGTTGAAAACCTGAAATACTTGCATGGAACATTTATAGATGGCCCTGAATTGTTTGAATAAAGACCGGCTAAAGCTCACTATAAGAGCTCCTCTATTTCCTAATGCTTTCCCTTAATACCAAATCCTAATTATGCACAAATTTTTGTGAAACCTACCCAGGAAGTTGATGAATATGATAGCTTTTACAAAGGAAATTTGGAAAAATGTGGATATTCCAGCTTATATATCCAGAGAAGTGGGCAGAAACGTGATGGATGTGGGATTTTCTTCAAGCATGAAAAGTATGCTATTTTCTTTTTCCTTGCTCTTTCTCAAAGAAAAATAGCTTCTTATCATGGGCACTATCATCTTACCCATATCTTTAATTAAAAACAAAAATTTGACATTTCTAGCTGTTGCAATTGAACTAATTTGAAAACCGATTGAGACATAAAGACAACTTCTATTCCTAGTTTTTGATTGTGTAATGCCCTATTAACTCCATTTTCCTACTGCCTTTAGACCTGTACTCCAGTCAATGTTTTAGTTGTTGACTTTCTCTATGTTTTTGGATGTTATACATACAAGTTTACAAATATACAAAAGTTCAGAACTTTTACCGATTTCTTGGCTTGCTGATAACTAATAATCATGTTCAGAGAAGTTATTCCAATTTTTACAACTTTTGAACTCAGGAACCTAGGTTTAGACTATCCCAGAATGTTTTGAACTGAAGTATACCTTTGATGCAGCGAAAGCCCTATGCACATATTTTTTAATTTTCTTGCAATAGATCCATGAACTCATCTTTGCTGGGTTATGTTATTCCTAATCCGAATTTTTTTTCCTTATAATCTGCGTCTCATCTAATATTTTTATTTCCTCCACTGTTTCAGAGCTGACTTGATCATAGAGGATAGAATTGAATACAATGATCTTGTAAACTCTATACAAGATGATGGTTGTTCTTGTGAAGATAAGTCTGAAGATGTGGTAACCAGTGCTAGTAATGATGTTGAATCAAACAAGGGTAACTTCCCTATAATAGTAATTATTATTTAAGCTTAGTCTAAAACAGCCTAAGCTAACGACAAATCATGAAAGAAAAAGAGTTATATGCCGCTTCTAGATTTCTGAGGAAGTAGGTGTAGACTGAGTAAGAAATCCTGTTTCCTTAATTCTGCAAGAGATATTACAAGATTGAAATGGGCATGGCCATTCCTTGCTCTTAGATCCAAGACGGTTGTATGAGGACAGTTTCTTTTCTTTCTCTGTTTTACCAAGCCAAAAACAGTCTTTAGGTCTCATTATTACATGCATTTGGTGGGGCGAGTAATCTTCAAAGGTGTGGGAGTTCTTATAGTATTGTTTGGTCCCTTGCTATATTCTCTGTTTCTCCCCGGAGTTTTGTTTCCAAGCTCTTTTGTAACTATTTGCTTTAGGTCTTATTTCTCTTGATTGGAGTCCTTTCTTGTAGTTTAGTTCCTTTTTGTGGGCTTTTCTTTTTGTATACTCGTGTATCCTTTCATTTTTCTCAAAATGAAAATCAAGTAATTATTACATGGATATGCATTTAAATAGTATTTAGTGATAGACTTTTTCTATCATATAATTGGCTTCCTCTCCACTGAATCTATTATTACAAAGTTCATCAAAAGTGCGTGTGTTCTAATTCCTTCTAAACTATTTCTATGGATTTTCATAAATCGCCGATGTCTTTGAAGATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTCTCTTTCCCAGAAGAAACTCAGAGTATGTATTCTCCTAATATGAGGGTTGAGGGGCATGCAATTGGTACTTGGTAATGATGAAGACAGCTTTTGTGATTAGTTTGTAGAATGATAGGAATGGTATTAGGGTGTTAGAGGGTTATTAAAGGTACATTAGCAATTAGTTAGGGGAGTTGGTTATTGAATTTGGATGTAAATAGAGTGAAGGAGAATAGGGAAGATAAGAAATATTTTGGCGAGTGGTTTAGAGCTTGAGAGAGATCTCAAGATTGGGAGGATCCGAATATCTCGAATTACATGGTGTATCTTGTAGTTTGGTTACCATATATATCTCAATATATTTGAGTTCTATCATAGAAGTGAGAGGAGCATTACCCATAGTTTATTTTGTGTATTTCTATTTCCAATACATTTTCAAAAAGACAAATATTGGTGTTATATTTTTGGTATAGCTGACTTACTTAGTTACTTTGCCATGGTCTATGGGAGCTAATTCTACAAACACCTGAATTCAGGTTCGTCACCAAAAGCTACTGTCGCCGATCGTGGGGATCCTAACGATCCCCGAGTAAGATTAAAACGTGATTGCGTTGGAATTATGGCTGCTTTCAAACTCAAGAAGCCTTTTCATCATGTTGTAATTGTAGCAAACACTCATCTTTACTGGTAAAATTTTACTTTCTAGTTTTCTTTGCCCTTACCTGGAAAGGCTATACGGTTGCTTTTAAGACCTGTTGTATTAAAGAAATAGATATTTTGTTACAGGGATCCAGAATGGGCTGATGTCAAGATCGCCCAAGCCAAATATCTTTTATCGCGCCTAGCTCGATTCAAAACTTTAGTAGCTGAAAAGTTTGAATGCACACCCTCAATACTTTTGGCTGGCGACTTCAATTCAACCCCAGGAGATAAGGTTTGTATTCTATCTTGATTTATCATTTGCATAGGATTTTCTCTTCCTGAAGAAATGGTATTTATCTATTTCATTGTAATTATTATGTCTATATTACAAGTTTAGTCCTTAAACTTTGTGTCTATTTGGTCACTGAACTTTCAAAAGTGTCTAAAAGATCCATATTCTTTCAATTTTGCTTCAACAAGTCCCTAAACTTTAAATTCATGTGTCTATTAGGTGCTATTATGTTGGGAAAAAAATTAATATAAATTTGAATTTCATGTCTAATGGAACCCCAAACTCAATTTTGTATCGTAGATTCAAGAGTTTTAAAACCTTCAAATAGGTCAAGGATCTATTAGACACATAATTGAAAATTTAGAGATTTATTGGACATTAAATTGAAAGTTTAAAGACCTAGTAGACACTGTTTGAAGTACATGGATCTTTTCAACTCAAAACTAAAATTTTAGAGATCAAATAGGCACAACCCTCGAAGTTAAGGGAATAAATTTGTAATTTAACTATAATATTATTGAGTATGAGCTTTTTTGTGGTTAGTGATATTAGAACAAGCTGATGACCGTAAGCTAACAGGGTGACAATGTTAAAAAAAATTTATAATTTATTTTATCAATGTTAAAAAATTTCATTTGTTTCTATTTCCCAAAATGTTACCGTTTAAAAAAATGGGTAATTTAGAGTGTGACGATGATAAATGGCATATAGACAAAGGAAGAAGAGATAAGCACCGTGACATAGTTCAGTGGTCAAAAAGGGGCCCGGACTCTTTGGATAGGATATAGACATATTAATAGAGATCAAATTAAAGGGAAAAAAAGAAAAACCATTTTGGTCCCTAGTTTTTGGTCTAGTTTACATTTGGTTCCTAAAGTTCAAAGCGTTAACACTTTTATTTATGAGTTTTGAGTTTGATTTTCATTTTGGTTAAAAAATTTTAAGATATTATATTTGTAAACGGCTTTAATTCCATTTAGTCCCTAGGTTTCAAGACTCACACTTTTGCCCTTGATTTTTCACTAAATCGTAAATGCTCACTTTCGATCATTGGGGTTGATTTCTATTAATTAATTTAAAAGAATTAGATTAATTCTACTTAAATTAGCTTTCACTAATTTTCCATCGCTATTAGAGTTGAAAATTTTCACATGATAGTTATTTTAAATTAATTAACTGAAATTTAACTCCAAAATGAAAAGTGAGCATTAGTGAAAATTTGGAGGTAAAAGTGTAAGATATTAAAATCTATGGATCTAATGGGAACTAAACTTAAAACTGGGAACCTAGGGACCCAATGGTAAAAAAATGTAATATTTTAAAACATCTAAACCAAATGAAAACTAAACCCAAAACCTAGAGACCAAAATTAAAACAAAACCGTACTTAAAAAAATCATATCATCATCTTAACTTAACAATAAAAAAGTTTTAAAAAAAAAATCCTCATACTTCCTTTTTTACCCCTCATAAACTCTCACTATTGGAGGCCTAACACTGTGCTACTTCTCATAATTATTTAGGTATACCAATACCTTGTTTCAGGCAGCTCTTCTTCTGGATTTTCCCCTGAAAGCTTGGAAGAGCTTCCTTTACCCCTTTGTAGTGTGTATGCTTCTATACTAGGGAGTGAACCTTCCTTTACAAACTTCACTCCTGGCTTCACTGGTACTCTTGATTATATATTTCTCTCACCTTCTGACTCTATGAGACCAACTAGCTTTCTAGAACTCCCAGAATCAGAATGGCCAGAGGTTATTGGTGGGTTACCCAATTTTAACTACCCAAGTGATCATCTTCCTATTGGTGCTGAATTTGAAATCACAATGGAATAA

mRNA sequence

ATGCAGTTTCTTCAAGGTGCATTGCCGGAGGGAGAACCACGGCGGCAAGACTTGAAGAAGAAATCCATGGCAGCGAGATATGAGGAAGAAGCCAACGACGACGAGATCTTGGCAGGTTGTCAAAGGGAGGTGCCTCGTCAGAAGATTGGAAGCTTTGGTGAAGATGGAAGGTTCAGTGGAGCTTCCGGTGAAGTAGCTTTTGAGATTGAGTTTGTTGAGAGATGCTGTTCCGGTTCCGCAGTGAAACGCCGCTGCGAAATCCGTTTCTGGACTCAGGGGTTCTACGAGTCTATTGAAGTCCGTGACCTTTCTCGGCGCCCAACCTTGTGTGCTTGCCTTTTCCGGTTTTCCTCCTTAAGTTTCCTTAATGCAACTTTTGAGAACTGGCGAAAATCATCCAAACACATACCTTCAGACAAGTATCCAAATTGCCAGAAAAGACGGACAAACATTGCGGTGTATCTTGTTGAGCCAGTTGGCTTTGGAGCAGTTCTGGATGTCTTATTCTTTCTTGGAAGTAGTATGATAGTATCTGACTACGGTCCAAATAACAATAATTACCGAAGTTTCTGCTTTCGTTTGGGCTGGATTATGACTTTTAATGGTCTTGAGAGTTGCGTTTTAAACAATCAAACCTTCGAAAATGAAAGCAGATCGAGCAGAGCAGATGAATGTACAACTGACTCACTAGAAGATCATGATTCAAGCTCTTCTTCTAGCAAGGATGCTAGTGGATCTTTCTCCTCAAAATGGTTGGCAATGCATAGGGACGAGCAGGATTTGGATGAGTGGGAACAACCAGAAAGCCCTCAGCATTTTTACATGAAAGAGAAACATGATTACACTCTTCAGGTTTCAGACATAGAAGCAATGAAAGAAAAGTTCACAAAACTATTGCTTGGTGAAGATGTCACAGGAGGGCAGAAAGGGCTGAGCTCTGCGTTGTCACTGTCAAATGCCATCACCAACCTAGCAGCGTCTGTCTTTGGAGAACTATGGAAATTGGAACCTCTTCCTGAGGAGAGGAAGAGTAAATGGAGAAGGGAAATGGACTGGTTGCTCTCTCCTACCCACTATATGGTTGAATTGGTTCCTACAAAGCAAAATGGTACGGGCGGTAGAGTGATGGAGATAATGACTCCAAAGGTTCGGGGAGACGTTCACATGAATCTTCCCGCTCTCCAGAAGTTAGACTCCATGTTAATTGGAACATTGGATTCCATGGTGAAGACAGAGTTTTGGTATGCAGAAGTTGGTAGCAGGGCTGAAGGAAAAAGCAAGAGTATGGGCCAAAGCACGAGATGGTGGCTTCCATTACCACAAGTACCATCCACTGGGCTATCTGAGAATGAGAGGAAGAAATTGCTCAACCATGGCAGGGTTGTACATCAAGTATTCAAAGCTGCCAAATCCATCAATGAAAGTATTTTGCATGAAATGCCCGTACCAACTGCTGTAAGGGAGGCAGTCCGAGCTTCTGGAAAAGCAAGTATGAGCGAAGAACTCTACAAGATTTTGACATCAGAATCTGGTCCAGCTGAGAACATGCTGAATCAGCTTAATCTGAAATCCGAACATGACGTTCTCGAGGCCATAAACCGCCTTGAAGCTGCAATATTCTCCTTGAAAGAGAAATATACCGAACAAAGTGGCAATAAATCTCCAGTTCGAACTTCTTGGCCTTTTGTCAAGGACCCAACGGCTGGGATTGATAAGTTGAAATTACTCACTGATCAAGCCGAGGTTCTTCTACAGCTACTGAAAAATAAATATCCAAATCATCCCCAAACGTTTCTGGATGTTTCAAAAATCCAATATGAGAAGGATGTTGGGCATTTGATTCTGGAAGCGTATTCGCGGGTGCTCGGAAACTTAGCTTACAGCATACTGTCTAGAATTGGAGATGTTCTGCAAGTGGATGCTATGTGCAACCCAAATTCACCTGCACCAACATGTTGTTTTCCAGGGATGAGTCTGTTGAACAATCGCAGCGATCAAATGTCCGCTCTTCATTCGTGGCAGCCGCTTATCGGTCACTCGAACAGTCCCAATATGACTTTGCCATCCAGCAAAGTTAGTGGAAACTCTCCGACCGCAACCCCGAGCCGAAACCGAGCATGGTGCATTGGCAGAGAGATCAAATTCACCCGCCAACTTTTCCCTGCAATCGAATTGGTTGGTGTATTAATTGAGTTCATATTTATTTTATTTCCTTCTCCAAATCCTTCTCTTCCCCTGCAAATGGACTTGCCAAGATTCGGCCGTCCAAAGGAAGACAATGGATCCTCTTCTTCTAGCCCTAATCTTTACGTTGCAAACTGTGGACCGGCCGTCGGAATCAGCCACCGCACAGTTGCGGCGGTTTTCGGCGATTTTGGGCTCGTGAAAGGGGTTCATGCCGCCGACGAAACGGGCGCTCGCGTCATCGTATGTTTTTCAGAAGAATCGAGTGCCCGAGCCGCCCTTGAGGCGCTTCACGGCCGCCCTTGCGCTCTCCTTGGAGGCCGGACTTTGCACATACGTTATTCGATCATCAGACCATCCATTTCGCACCCCAATGATTCTGTTTCAGTTTCTTTGTCGGCTTCGGAGCTGGACATTCCCGGACTTTTCTTATTGCACGACTTCGTCAGTGCTAAAGAAGAGGAGGATTTGCTTATGGAAGTTGATGCTCGTCCTTGGAATAATCTGGCGAAACGTAGAGTTCAGCATTATGGGTATGAGTTTTGTTATCAAACGAGGAATGTTAATACTAAACATCAGTTGGGGGAACTTCCATCATTTGTTTCCCATGTAGTTGATAGGATCTCCATGTTTCCAAACGTTGAGAATGTTGCAGATGCTTCTCTTGATCAATTGACGTATGAGACTGAAATCAATACCACATTACAGGTTAATGAATACCCACCTGGGGTGGGTTTGTCCCCTCATATAGACACCCATTCTGCATTTGAAGGATTAATTTTCAGCCTTTCCTTAGCAGGGCCATGCATTATGGAGTTTAGGAGATATCCCGAAGGCACTTGGCACAAATGCCCTTCAAGTATAGATTTGAAAATGGGGAATTCTGTAAACGACTCAAATTATCTAAGGAGAGCCATTTACCTTCCACCTCGGTCTATGCTATTACTGTCTGGAGAGGCACGTTATGCTTGGCATCATTACATTCCTCACCACAAGATTGACATGGTGAAGGACAGTGCTATCAGAAGGGGTCCTAGGAGAGTTTCTTTTACATTTCGCAAGGTTTATGTGAAGAGTCGTTTTTTCCCACATTCTCCATCCTCTTGCCTTAGGTGGAAAGCTCGATCGCAGGCAATTTTAGCAGTTCTTAAGAATCTTGAGGCTGATTTTCTCTGTCTACAGGAAGTTGATGAATATGATAGCTTTTACAAAGGAAATTTGGAAAAATGTGGATATTCCAGCTTATATATCCAGAGAAGTGGGCAGAAACGTGATGGATGTGGGATTTTCTTCAAGCATGAAAAAGCTGACTTGATCATAGAGGATAGAATTGAATACAATGATCTTGTAAACTCTATACAAGATGATGGTTGTTCTTGTGAAGATAAGTCTGAAGATGTGGTAACCAGTGCTAGTAATGATGTTGAATCAAACAAGGGTTCGTCACCAAAAGCTACTGTCGCCGATCGTGGGGATCCTAACGATCCCCGAGTAAGATTAAAACGTGATTGCGTTGGAATTATGGCTGCTTTCAAACTCAAGAAGCCTTTTCATCATGTTGTAATTGTAGCAAACACTCATCTTTACTGGGATCCAGAATGGGCTGATGTCAAGATCGCCCAAGCCAAATATCTTTTATCGCGCCTAGCTCGATTCAAAACTTTAGTAGCTGAAAAGTTTGAATGCACACCCTCAATACTTTTGGCTGGCGACTTCAATTCAACCCCAGGAGATAAGGTATACCAATACCTTGTTTCAGGCAGCTCTTCTTCTGGATTTTCCCCTGAAAGCTTGGAAGAGCTTCCTTTACCCCTTTGTAGTGTGTATGCTTCTATACTAGGGAGTGAACCTTCCTTTACAAACTTCACTCCTGGCTTCACTGGTACTCTTGATTATATATTTCTCTCACCTTCTGACTCTATGAGACCAACTAGCTTTCTAGAACTCCCAGAATCAGAATGGCCAGAGGTTATTGGTGGGTTACCCAATTTTAACTACCCAAGTGATCATCTTCCTATTGGTGCTGAATTTGAAATCACAATGGAATAA

Coding sequence (CDS)

ATGCAGTTTCTTCAAGGTGCATTGCCGGAGGGAGAACCACGGCGGCAAGACTTGAAGAAGAAATCCATGGCAGCGAGATATGAGGAAGAAGCCAACGACGACGAGATCTTGGCAGGTTGTCAAAGGGAGGTGCCTCGTCAGAAGATTGGAAGCTTTGGTGAAGATGGAAGGTTCAGTGGAGCTTCCGGTGAAGTAGCTTTTGAGATTGAGTTTGTTGAGAGATGCTGTTCCGGTTCCGCAGTGAAACGCCGCTGCGAAATCCGTTTCTGGACTCAGGGGTTCTACGAGTCTATTGAAGTCCGTGACCTTTCTCGGCGCCCAACCTTGTGTGCTTGCCTTTTCCGGTTTTCCTCCTTAAGTTTCCTTAATGCAACTTTTGAGAACTGGCGAAAATCATCCAAACACATACCTTCAGACAAGTATCCAAATTGCCAGAAAAGACGGACAAACATTGCGGTGTATCTTGTTGAGCCAGTTGGCTTTGGAGCAGTTCTGGATGTCTTATTCTTTCTTGGAAGTAGTATGATAGTATCTGACTACGGTCCAAATAACAATAATTACCGAAGTTTCTGCTTTCGTTTGGGCTGGATTATGACTTTTAATGGTCTTGAGAGTTGCGTTTTAAACAATCAAACCTTCGAAAATGAAAGCAGATCGAGCAGAGCAGATGAATGTACAACTGACTCACTAGAAGATCATGATTCAAGCTCTTCTTCTAGCAAGGATGCTAGTGGATCTTTCTCCTCAAAATGGTTGGCAATGCATAGGGACGAGCAGGATTTGGATGAGTGGGAACAACCAGAAAGCCCTCAGCATTTTTACATGAAAGAGAAACATGATTACACTCTTCAGGTTTCAGACATAGAAGCAATGAAAGAAAAGTTCACAAAACTATTGCTTGGTGAAGATGTCACAGGAGGGCAGAAAGGGCTGAGCTCTGCGTTGTCACTGTCAAATGCCATCACCAACCTAGCAGCGTCTGTCTTTGGAGAACTATGGAAATTGGAACCTCTTCCTGAGGAGAGGAAGAGTAAATGGAGAAGGGAAATGGACTGGTTGCTCTCTCCTACCCACTATATGGTTGAATTGGTTCCTACAAAGCAAAATGGTACGGGCGGTAGAGTGATGGAGATAATGACTCCAAAGGTTCGGGGAGACGTTCACATGAATCTTCCCGCTCTCCAGAAGTTAGACTCCATGTTAATTGGAACATTGGATTCCATGGTGAAGACAGAGTTTTGGTATGCAGAAGTTGGTAGCAGGGCTGAAGGAAAAAGCAAGAGTATGGGCCAAAGCACGAGATGGTGGCTTCCATTACCACAAGTACCATCCACTGGGCTATCTGAGAATGAGAGGAAGAAATTGCTCAACCATGGCAGGGTTGTACATCAAGTATTCAAAGCTGCCAAATCCATCAATGAAAGTATTTTGCATGAAATGCCCGTACCAACTGCTGTAAGGGAGGCAGTCCGAGCTTCTGGAAAAGCAAGTATGAGCGAAGAACTCTACAAGATTTTGACATCAGAATCTGGTCCAGCTGAGAACATGCTGAATCAGCTTAATCTGAAATCCGAACATGACGTTCTCGAGGCCATAAACCGCCTTGAAGCTGCAATATTCTCCTTGAAAGAGAAATATACCGAACAAAGTGGCAATAAATCTCCAGTTCGAACTTCTTGGCCTTTTGTCAAGGACCCAACGGCTGGGATTGATAAGTTGAAATTACTCACTGATCAAGCCGAGGTTCTTCTACAGCTACTGAAAAATAAATATCCAAATCATCCCCAAACGTTTCTGGATGTTTCAAAAATCCAATATGAGAAGGATGTTGGGCATTTGATTCTGGAAGCGTATTCGCGGGTGCTCGGAAACTTAGCTTACAGCATACTGTCTAGAATTGGAGATGTTCTGCAAGTGGATGCTATGTGCAACCCAAATTCACCTGCACCAACATGTTGTTTTCCAGGGATGAGTCTGTTGAACAATCGCAGCGATCAAATGTCCGCTCTTCATTCGTGGCAGCCGCTTATCGGTCACTCGAACAGTCCCAATATGACTTTGCCATCCAGCAAAGTTAGTGGAAACTCTCCGACCGCAACCCCGAGCCGAAACCGAGCATGGTGCATTGGCAGAGAGATCAAATTCACCCGCCAACTTTTCCCTGCAATCGAATTGGTTGGTGTATTAATTGAGTTCATATTTATTTTATTTCCTTCTCCAAATCCTTCTCTTCCCCTGCAAATGGACTTGCCAAGATTCGGCCGTCCAAAGGAAGACAATGGATCCTCTTCTTCTAGCCCTAATCTTTACGTTGCAAACTGTGGACCGGCCGTCGGAATCAGCCACCGCACAGTTGCGGCGGTTTTCGGCGATTTTGGGCTCGTGAAAGGGGTTCATGCCGCCGACGAAACGGGCGCTCGCGTCATCGTATGTTTTTCAGAAGAATCGAGTGCCCGAGCCGCCCTTGAGGCGCTTCACGGCCGCCCTTGCGCTCTCCTTGGAGGCCGGACTTTGCACATACGTTATTCGATCATCAGACCATCCATTTCGCACCCCAATGATTCTGTTTCAGTTTCTTTGTCGGCTTCGGAGCTGGACATTCCCGGACTTTTCTTATTGCACGACTTCGTCAGTGCTAAAGAAGAGGAGGATTTGCTTATGGAAGTTGATGCTCGTCCTTGGAATAATCTGGCGAAACGTAGAGTTCAGCATTATGGGTATGAGTTTTGTTATCAAACGAGGAATGTTAATACTAAACATCAGTTGGGGGAACTTCCATCATTTGTTTCCCATGTAGTTGATAGGATCTCCATGTTTCCAAACGTTGAGAATGTTGCAGATGCTTCTCTTGATCAATTGACGTATGAGACTGAAATCAATACCACATTACAGGTTAATGAATACCCACCTGGGGTGGGTTTGTCCCCTCATATAGACACCCATTCTGCATTTGAAGGATTAATTTTCAGCCTTTCCTTAGCAGGGCCATGCATTATGGAGTTTAGGAGATATCCCGAAGGCACTTGGCACAAATGCCCTTCAAGTATAGATTTGAAAATGGGGAATTCTGTAAACGACTCAAATTATCTAAGGAGAGCCATTTACCTTCCACCTCGGTCTATGCTATTACTGTCTGGAGAGGCACGTTATGCTTGGCATCATTACATTCCTCACCACAAGATTGACATGGTGAAGGACAGTGCTATCAGAAGGGGTCCTAGGAGAGTTTCTTTTACATTTCGCAAGGTTTATGTGAAGAGTCGTTTTTTCCCACATTCTCCATCCTCTTGCCTTAGGTGGAAAGCTCGATCGCAGGCAATTTTAGCAGTTCTTAAGAATCTTGAGGCTGATTTTCTCTGTCTACAGGAAGTTGATGAATATGATAGCTTTTACAAAGGAAATTTGGAAAAATGTGGATATTCCAGCTTATATATCCAGAGAAGTGGGCAGAAACGTGATGGATGTGGGATTTTCTTCAAGCATGAAAAAGCTGACTTGATCATAGAGGATAGAATTGAATACAATGATCTTGTAAACTCTATACAAGATGATGGTTGTTCTTGTGAAGATAAGTCTGAAGATGTGGTAACCAGTGCTAGTAATGATGTTGAATCAAACAAGGGTTCGTCACCAAAAGCTACTGTCGCCGATCGTGGGGATCCTAACGATCCCCGAGTAAGATTAAAACGTGATTGCGTTGGAATTATGGCTGCTTTCAAACTCAAGAAGCCTTTTCATCATGTTGTAATTGTAGCAAACACTCATCTTTACTGGGATCCAGAATGGGCTGATGTCAAGATCGCCCAAGCCAAATATCTTTTATCGCGCCTAGCTCGATTCAAAACTTTAGTAGCTGAAAAGTTTGAATGCACACCCTCAATACTTTTGGCTGGCGACTTCAATTCAACCCCAGGAGATAAGGTATACCAATACCTTGTTTCAGGCAGCTCTTCTTCTGGATTTTCCCCTGAAAGCTTGGAAGAGCTTCCTTTACCCCTTTGTAGTGTGTATGCTTCTATACTAGGGAGTGAACCTTCCTTTACAAACTTCACTCCTGGCTTCACTGGTACTCTTGATTATATATTTCTCTCACCTTCTGACTCTATGAGACCAACTAGCTTTCTAGAACTCCCAGAATCAGAATGGCCAGAGGTTATTGGTGGGTTACCCAATTTTAACTACCCAAGTGATCATCTTCCTATTGGTGCTGAATTTGAAATCACAATGGAATAA

Protein sequence

MQFLQGALPEGEPRRQDLKKKSMAARYEEEANDDEILAGCQREVPRQKIGSFGEDGRFSGASGEVAFEIEFVERCCSGSAVKRRCEIRFWTQGFYESIEVRDLSRRPTLCACLFRFSSLSFLNATFENWRKSSKHIPSDKYPNCQKRRTNIAVYLVEPVGFGAVLDVLFFLGSSMIVSDYGPNNNNYRSFCFRLGWIMTFNGLESCVLNNQTFENESRSSRADECTTDSLEDHDSSSSSSKDASGSFSSKWLAMHRDEQDLDEWEQPESPQHFYMKEKHDYTLQVSDIEAMKEKFTKLLLGEDVTGGQKGLSSALSLSNAITNLAASVFGELWKLEPLPEERKSKWRREMDWLLSPTHYMVELVPTKQNGTGGRVMEIMTPKVRGDVHMNLPALQKLDSMLIGTLDSMVKTEFWYAEVGSRAEGKSKSMGQSTRWWLPLPQVPSTGLSENERKKLLNHGRVVHQVFKAAKSINESILHEMPVPTAVREAVRASGKASMSEELYKILTSESGPAENMLNQLNLKSEHDVLEAINRLEAAIFSLKEKYTEQSGNKSPVRTSWPFVKDPTAGIDKLKLLTDQAEVLLQLLKNKYPNHPQTFLDVSKIQYEKDVGHLILEAYSRVLGNLAYSILSRIGDVLQVDAMCNPNSPAPTCCFPGMSLLNNRSDQMSALHSWQPLIGHSNSPNMTLPSSKVSGNSPTATPSRNRAWCIGREIKFTRQLFPAIELVGVLIEFIFILFPSPNPSLPLQMDLPRFGRPKEDNGSSSSSPNLYVANCGPAVGISHRTVAAVFGDFGLVKGVHAADETGARVIVCFSEESSARAALEALHGRPCALLGGRTLHIRYSIIRPSISHPNDSVSVSLSASELDIPGLFLLHDFVSAKEEEDLLMEVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNVENVADASLDQLTYETEINTTLQVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKCPSSIDLKMGNSVNDSNYLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSAIRRGPRRVSFTFRKVYVKSRFFPHSPSSCLRWKARSQAILAVLKNLEADFLCLQEVDEYDSFYKGNLEKCGYSSLYIQRSGQKRDGCGIFFKHEKADLIIEDRIEYNDLVNSIQDDGCSCEDKSEDVVTSASNDVESNKGSSPKATVADRGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIVANTHLYWDPEWADVKIAQAKYLLSRLARFKTLVAEKFECTPSILLAGDFNSTPGDKVYQYLVSGSSSSGFSPESLEELPLPLCSVYASILGSEPSFTNFTPGFTGTLDYIFLSPSDSMRPTSFLELPESEWPEVIGGLPNFNYPSDHLPIGAEFEITME
Homology
BLAST of HG10013854 vs. NCBI nr
Match: CBI29214.3 (unnamed protein product, partial [Vitis vinifera])

HSP 1 Score: 1118.6 bits (2892), Expect = 0.0e+00
Identity = 586/903 (64.89%), Postives = 686/903 (75.97%), Query Frame = 0

Query: 197  IMTFNGLESCVLNNQTFENESRSSRADECTTDSLEDHDSSSSSSKDASGSFSSKWLAMH- 256
            IMT+NGLE+C+LN  ++ENES +SR D C TDSL++ D+S SSSKDA GSFSSKWL M+ 
Sbjct: 27   IMTYNGLENCILNGHSYENESHTSRGDGCATDSLDEDDTSCSSSKDAFGSFSSKWLTMNM 86

Query: 257  -RDEQDLDEWEQPESPQHFYMKEKHDYTLQVSDIEAMKEKFTKLLLGEDVTGGQKGLSSA 316
             +DE  LDEWE PESPQHFY+KEK  Y+++ SD+E MKE+F+KLLLGED+TGG+KGL+SA
Sbjct: 87   KKDEHGLDEWEVPESPQHFYVKEKPGYSVRFSDVEVMKERFSKLLLGEDITGGKKGLTSA 146

Query: 317  LSLSNAITNLAASVFGELWKLEPLPEERKSKWRREMDWLLSPTHYMVELVPTKQNGTGGR 376
            L+LSNAITNLA SVFGELWKLEPL EERK KW+REMDWLLSPT+YMVELVP KQ+G  GR
Sbjct: 147  LALSNAITNLAVSVFGELWKLEPLSEERKVKWQREMDWLLSPTNYMVELVPAKQSGANGR 206

Query: 377  VMEIMTPKVRGDVHMNLPALQKLDSMLIGTLDSMVKTEFWYAEVGSRAEGKSKSMGQSTR 436
             +EIMTPK R D+HMNLPALQKLDSMLI TLDSMV TEFWYAE GSRAEG+++SM QS R
Sbjct: 207  TLEIMTPKARADIHMNLPALQKLDSMLIETLDSMVDTEFWYAEGGSRAEGRTRSMSQSKR 266

Query: 437  WWLPLPQVPSTGLSENERKKLLNHGRVVHQVFKAAKSINESILHEMPVPTAVREAVRASG 496
            WWLP PQVP+TGLS+ ERKKLL+  +VVHQVFKAA++INE++L EMPVPT +R+A+  SG
Sbjct: 267  WWLPSPQVPTTGLSDPERKKLLHQAKVVHQVFKAARAINENVLLEMPVPTLIRDALAKSG 326

Query: 497  KASMSEELYKILTSESGPAENMLNQLNLKSEHDVLEAINRLEAAIFSLKEKYTEQSGNKS 556
            KA++ EELY++LT+ES  AE ML+ LNLKSEH  LEAINRLEAA+F+ KE+ TEQ   KS
Sbjct: 327  KANLGEELYRVLTAESSSAEEMLSSLNLKSEHSALEAINRLEAAVFAWKERITEQVSGKS 386

Query: 557  PVRTSWPFVKDPTAGIDKLKLLTDQAEVLLQLLKNKYPNHPQTFLDVSKIQYEKDVGHLI 616
            PVRTSW F+KDPT  +DK++L+  +AE LLQ L+ +YPN PQ+FLDV+KIQY KD+GH I
Sbjct: 387  PVRTSWSFIKDPTTELDKMELILFRAEALLQQLRTRYPNLPQSFLDVAKIQYGKDIGHSI 446

Query: 617  LEAYSRVLGNLAYSILSRIGDVLQVDAMCNPNSP-APTCCFPGMSLLNNRSDQMSALHSW 676
            LEAYSRVLGNLA SIL R+ D+LQ D   NPNSP A T CFPG++L         +L   
Sbjct: 447  LEAYSRVLGNLASSILCRMRDILQEDVFSNPNSPIATTSCFPGINLTGMTETPTPSLRIR 506

Query: 677  QPLIGHSN-------------SPNMTLPSSKVSGNSPTATPSRNRAWCIGREIKFTRQLF 736
              LI   N             SP+     S    +S  ATPSR+R +    + K T  L 
Sbjct: 507  HSLIDQMNMVDGRFRDPNAGASPDCEASYSDSRRSSVMATPSRSRKY----KKKTTTHLK 566

Query: 737  PAIELVGVLIEFIFILFPSPNPSLPLQMDLPRFGRPKEDNGSSSSSPNLYVANCGPAVGI 796
               E                   L   M LPRF RPK  +G    SPNLYVANCGPAVG+
Sbjct: 567  SRFE-----------------QGLNYIMGLPRFSRPKGVDG--ELSPNLYVANCGPAVGL 626

Query: 797  SHRTVAAVFGDFGLVKGVHAADETGARVIVCFSEESSARAALEALHGRPCALLGGRTLHI 856
            S  T+A+VF  FG VKGV+ AD++GARVIV + EES+A+AAL+AL G PC  LGGR LHI
Sbjct: 627  SFDTIASVFSTFGEVKGVYPADDSGARVIVSYFEESAAQAALKALDGHPCPALGGRFLHI 686

Query: 857  RYSIIRPSISHPNDSVSVSLSASELDIPGLFLLHDFVSAKEEEDLLMEVDARPWNNLAKR 916
            RYSI +P  S  NDSV VSL  SEL+IPG++LLHDFVSAKEEE+LL  VD   W +L+KR
Sbjct: 687  RYSIFQPP-SQVNDSVPVSLVDSELNIPGIYLLHDFVSAKEEEELLAAVDKMSWKSLSKR 746

Query: 917  RVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNVENVADASLDQLTYETEIN 976
            RVQHYGYEFCY+TRNVNTK  LG+LPSFVS +V+RIS FPN+E+ AD  LDQLT      
Sbjct: 747  RVQHYGYEFCYETRNVNTKQYLGKLPSFVSAIVERISSFPNLESAADIVLDQLT------ 806

Query: 977  TTLQVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKCPSSIDLKM 1036
                VNEYPPGVGLSPHIDTHSAFEG IFSLSLAGPCIM+FRRY EG W K  SS D+ +
Sbjct: 807  ----VNEYPPGVGLSPHIDTHSAFEGFIFSLSLAGPCIMDFRRYTEGVWPKSASSSDMSV 866

Query: 1037 GNSVNDSNYLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSAIRRGPRRVSFTF 1084
                  S++LRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDS IRRGPRRVSFTF
Sbjct: 867  EYPDKSSSFLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSVIRRGPRRVSFTF 895

BLAST of HG10013854 vs. NCBI nr
Match: CAN76404.1 (hypothetical protein VITISV_021238 [Vitis vinifera])

HSP 1 Score: 1084.7 bits (2804), Expect = 0.0e+00
Identity = 576/952 (60.50%), Postives = 683/952 (71.74%), Query Frame = 0

Query: 197  IMTFNGLESCVLNNQTFENESRSSRADECTTDSLEDHDSSSSSSKDASGSFSSKWLAMH- 256
            IMT+NGLE+C+LN  ++ENES +SR D C TDSL++ DSS SSSKDA GSFSSKWL M+ 
Sbjct: 108  IMTYNGLENCILNGHSYENESHTSRGDGCATDSLDEDDSSCSSSKDAFGSFSSKWLTMNM 167

Query: 257  -RDEQDLDEWEQPESPQHFYMKEKHDYTLQVSDIEAMKEKFTKLLLGEDVTGGQKGLSSA 316
             +DE  LDEWE PESPQHFY+KEK  Y+ + SD+E MKE+F+KLLLGED TGG+KGL+SA
Sbjct: 168  KKDEHGLDEWEVPESPQHFYVKEKPGYSFRFSDVEVMKERFSKLLLGEDXTGGKKGLTSA 227

Query: 317  LSLSNAITNLAASVFGELWKLEPLPEERKSKWRREMDWLLSPTHYMVELVPTKQNGTGGR 376
            L+LSNAITNLA SVFGELWKLEPL EERK KW+REMDWLLSPT YMVELVP KQ+G  GR
Sbjct: 228  LALSNAITNLAVSVFGELWKLEPLSEERKVKWQREMDWLLSPTXYMVELVPAKQSGANGR 287

Query: 377  VMEIMTPKVRGDVHMNLPALQKLDSMLIGTLDSMVKTEFWYAEVGSRAEGKSKSMGQSTR 436
             +EIMTPK R D+HMNLPALQKLDSMLI TLDSMV TEFWYAE GSRAEG+++SM QS R
Sbjct: 288  TLEIMTPKARADIHMNLPALQKLDSMLIETLDSMVDTEFWYAEGGSRAEGRTRSMSQSKR 347

Query: 437  WWLPLPQVPSTGLSENERKKLLNHGRVVHQVFKAAKSINESILHEMPVPTAVREAVR--- 496
            WWLP PQVP+TGLS+ ERKKLL+  +VVHQVFKAA++INE++L EMPVPT +R+A+    
Sbjct: 348  WWLPSPQVPTTGLSDPERKKLLHQAKVVHQVFKAARAINENVLLEMPVPTLIRDALAKAS 407

Query: 497  -----------------ASGKASMSEELYKILTSESGPAENMLNQLNLKSEHDVLEAINR 556
                              SGKA++ EELY++LT+ES   E ML+ LNLKSEH  LEAINR
Sbjct: 408  KLFDLFPSNQSSCLKTLESGKANLGEELYRVLTAESSSTEEMLSSLNLKSEHSALEAINR 467

Query: 557  LEAAIFSLKEKYTEQSGNKSPVRTSWPFVKDPTAGIDKLKLLTDQAEVLLQLLKNKYPNH 616
            LEAA+F+ KE+ TEQ   KSPVRTSW F+KDPT  +DK++L+  +AE LLQ L+ +YPN 
Sbjct: 468  LEAAVFAWKERITEQVSGKSPVRTSWSFIKDPTTELDKMELILFRAEALLQQLRTRYPNL 527

Query: 617  PQTFLDVSKIQYEKDVGHLILEAYSRVLGNLAYSILSRIGDVLQVDAMCNPNSP-APTCC 676
            PQ+FLDV+KIQY KD+GH ILEAYSRVLGNLA SIL R+ D+LQ D   NPNSP A T C
Sbjct: 528  PQSFLDVAKIQYGKDIGHSILEAYSRVLGNLASSILCRMRDILQEDVFSNPNSPIATTSC 587

Query: 677  FPGMSLLNNRSDQMSALHSWQPLIGHSNSPNMTLPSSKVSGNSPTATPSRNRAWCIGREI 736
            FPG++L         +L      I HS    M +   +    +  A+P    ++   R  
Sbjct: 588  FPGINLTGMTETPTPSLR-----IRHSLIDQMNMVDGRFRDPNAGASPDCEASYSDSRRS 647

Query: 737  KFTRQLFPAIELVGVLIEFIFILFPSPNPSLPLQMDLPRFGRPKEDNGSSSSSPNLYVAN 796
                                 +  PS +  L   M LPRF RPK  +G    SPNLYVAN
Sbjct: 648  S-------------------VMATPSRSRGLNYIMGLPRFSRPKGVDG--ELSPNLYVAN 707

Query: 797  CGPAVGISHRTVAAVFGDFGLVKGVHAADETGARVIVCFSEESSARAALEALHGRPCALL 856
            CGPAVG+S  T+A+VF  FG VKGV+ AD++GARVIV + EES+A+AAL+AL G PC  L
Sbjct: 708  CGPAVGLSFDTIASVFSTFGEVKGVYPADDSGARVIVSYFEESAAQAALKALDGHPCPAL 767

Query: 857  GGRTLHIRYSIIRPSISHPNDSVSVSLSASELDIPGLFLLHDFVSAKEEEDLLMEVDARP 916
            GGR LHIRYSI +P   +           SEL+IPG++LLHDFVSAKEEE+LL  VD   
Sbjct: 768  GGRFLHIRYSIFQPPSQY-------LWWISELNIPGIYLLHDFVSAKEEEELLAAVDKMS 827

Query: 917  WNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNVENVADASLDQL 976
            W +L+KRRVQHYGYEFCY+TRNVNTK  LG+LPSFVS +V+RIS FPN+E+ AD  LDQL
Sbjct: 828  WKSLSKRRVQHYGYEFCYETRNVNTKQYLGKLPSFVSAIVERISSFPNLESAADIVLDQL 887

Query: 977  TYETEINTTLQVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKCP 1036
            T          VNEYPPGVGLSPHIDTHSAFEG IFSLSLAGPCIM+FRRY EG W K  
Sbjct: 888  T----------VNEYPPGVGLSPHIDTHSAFEGFIFSLSLAGPCIMDFRRYTEGVWPKSA 947

Query: 1037 SSIDLKMGNSVNDSNYLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSAIRRGP 1096
            SS D+ +      S++LRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDS IRRGP
Sbjct: 948  SSSDMSVEYPDKSSSFLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSVIRRGP 1007

Query: 1097 RRVSFTFRKVYVKSRFFPHSPSSCLRWKARSQAILAVLKNLEADFLCLQEVD 1126
            RRVSFTFRK  ++     H   +          +L V +   A+  CL  +D
Sbjct: 1008 RRVSFTFRKSDIRQMHTHHILFTMCE-------LLPVFRKHRAESFCLTRLD 1009

BLAST of HG10013854 vs. NCBI nr
Match: ESR46132.1 (hypothetical protein CICLE_v10004016mg [Citrus clementina])

HSP 1 Score: 1057.7 bits (2734), Expect = 8.5e-305
Identity = 549/895 (61.34%), Postives = 670/895 (74.86%), Query Frame = 0

Query: 198  MTFNGLESCVLNNQTFENESRSSRADECTTDSLEDHDSSSSSSKDASGSFSSKWLAMHRD 257
            MT++GLESC+LNNQ+++NESR+SR D C TDSL+D DSS SSSKDA GSFSSKW  M R 
Sbjct: 1    MTYDGLESCILNNQSYDNESRTSRGDGCLTDSLDDDDSSCSSSKDAFGSFSSKWSTMKRV 60

Query: 258  EQDLDEWEQPESPQHFYMKEKHDYTLQVSDIEAMKEKFTKLLLGEDVTGGQKGLSSALSL 317
            E   ++W   ESP H+++K+K  Y  +  D+E MKE+F+KLLLGED+TGG+ G+++AL+L
Sbjct: 61   EHGSEDWGPSESPNHYHVKQKPAYAFRFLDVETMKERFSKLLLGEDITGGRLGVTTALAL 120

Query: 318  SNAITNLAASVFGELWKLEPLPEERKSKWRREMDWLLSPTHYMVELVPTKQNGTGGRVME 377
            SNAITNLAASVFGELWKLEPL EERKS+W+REMDWLLSPT++MVELVP +QN + G+ +E
Sbjct: 121  SNAITNLAASVFGELWKLEPLLEERKSRWQREMDWLLSPTNFMVELVPARQNASNGQTLE 180

Query: 378  IMTPKVRGDVHMNLPALQKLDSMLIGTLDSMVKTEFWYAEVGSRAEGKSKSMGQSTRWWL 437
            IMTPK R D+HMNLPALQKLDSMLI TLDSMV TEFWYAEVGSRAEG++KS  +S RWWL
Sbjct: 181  IMTPKARADIHMNLPALQKLDSMLIETLDSMVNTEFWYAEVGSRAEGRNKSTRESKRWWL 240

Query: 438  PLPQVPSTGLSENERKKLLNHGRVVHQVFKAAKSINESILHEMPVPTAVREAVRASGKAS 497
            PL QVP++GLS++ RKK+L+  RVV+QVFKAAKSINE++L EMPVPT +++ +  SGK S
Sbjct: 241  PLAQVPASGLSDSGRKKMLSQCRVVYQVFKAAKSINENVLLEMPVPTIIKDVLPKSGKTS 300

Query: 498  MSEELYKILTSESGPAENMLNQLNLKSEHDVLEAINRLEAAIFSLKEKYTEQSGNKSPVR 557
            + EELYK+LT+ES  +  M+N LNLKSEH  LEAIN+LEAA+F+ KE+ +EQ+  KSPVR
Sbjct: 301  LGEELYKVLTAESSSSGEMINFLNLKSEHSALEAINKLEAAVFTWKERISEQASGKSPVR 360

Query: 558  TSWPFVKDPTAGIDKLKLLTDQAEVLLQLLKNKYPNHPQTFLDVSKIQYEKDVGHLILEA 617
            TSW F+KDP + +DK++ L ++AE L+Q LKN+YPN PQTFLD +KIQY KDVGH ILEA
Sbjct: 361  TSWSFIKDPISELDKIEFLLERAEALIQQLKNRYPNLPQTFLDATKIQYGKDVGHSILEA 420

Query: 618  YSRVLGNLAYSILSRIGDVLQVDAMCNPNSPAPTCCFPGMSLLNNRSDQMS----ALHSW 677
            YSRVL NLA+SILSRIGD+LQ DA+ NPNSP   CC PG  + ++  DQ+      L S 
Sbjct: 421  YSRVLANLAFSILSRIGDILQEDALSNPNSPVAKCCTPGSKMNDDNMDQIQMPWLRLRSR 480

Query: 678  QPLIGHSNSPNMTLPSSKVSGNSPTATPSRNRAWCIGREIKFTRQLFPAIELVGVLIEFI 737
              LI   N  +    +S  S  S + T +         E K +                 
Sbjct: 481  HSLIDQMNKADGKYFTSDASSCSTSETSN--------SEAKSS----------------- 540

Query: 738  FILFPSPNPSLPLQMDLPRFGRPKEDNGSSSSSPNLYVANCGPAVGISHRTVAAVFGDFG 797
                 S N +  +     RF RPK   G    SPNL+VANCGPAVG+S+  + +VF  FG
Sbjct: 541  -----SVNSTPSVNRVCSRFRRPKA--GEDERSPNLFVANCGPAVGVSYEAIGSVFSAFG 600

Query: 798  LVKGVHAADETGARVIVCFSEESSARAALEALHGRPCALLGGRTLHIRYSIIR--PSISH 857
             VKG++AAD++GARVIV + +E SA+AA  +LH RPC  L  R LHIRYS++   P+  H
Sbjct: 601  DVKGIYAADDSGARVIVSYFDEGSAQAAFNSLHSRPCPDLANRFLHIRYSVLEDSPATRH 660

Query: 858  PNDSVSVSLSASELDIPGLFLLHDFVSAKEEEDLLMEVDARPWNNLAKRRVQHYGYEFCY 917
               SV VSL ASEL+IPGLFL HDFVSAKEEE+LL  VD+RPWNNL+KRRVQHYGYEFCY
Sbjct: 661  ITSSVPVSLVASELNIPGLFLFHDFVSAKEEEELLAAVDSRPWNNLSKRRVQHYGYEFCY 720

Query: 918  QTRNVNTKHQLGELPSFVSHVVDRISMFPNVENVADASLDQLTYETEINTTLQVNEYPPG 977
              RNVNTK  LGELPSFVS +V+R+S FPN+++    +LDQLT          VNEYPPG
Sbjct: 721  DIRNVNTKQCLGELPSFVSSIVERVSSFPNLDDSTSVALDQLT----------VNEYPPG 780

Query: 978  VGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKCPSS---IDLKMGNSVNDSN 1037
            VGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRY EG+W   P+S   +++ + N  + S+
Sbjct: 781  VGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYLEGSW--LPNSTPGMNMAVENPDDYSS 840

Query: 1038 YLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSAIRRGPRRVSFTFRKV 1084
             LRRAIYLPPRSMLLLSGEARYAW+HYIPHHKIDMV D+ IRR  RRVSFTFRKV
Sbjct: 841  VLRRAIYLPPRSMLLLSGEARYAWNHYIPHHKIDMVNDTVIRRASRRVSFTFRKV 851

BLAST of HG10013854 vs. NCBI nr
Match: GAY53536.1 (hypothetical protein CUMW_149840, partial [Citrus unshiu])

HSP 1 Score: 1045.0 bits (2701), Expect = 5.7e-301
Identity = 545/895 (60.89%), Postives = 664/895 (74.19%), Query Frame = 0

Query: 197  IMTFNGLESCVLNNQTFENESRSSRADECTTDSLEDHDSSSSSSKDASGSFSSKWLAMHR 256
            IMT++GLESC+LNNQ+++NESR+SR D C TDSL+D DSS SSSKDA GSFSSKW  M R
Sbjct: 87   IMTYDGLESCILNNQSYDNESRTSRGDGCLTDSLDDDDSSCSSSKDAFGSFSSKWSTMKR 146

Query: 257  DEQDLDEWEQPESPQHFYMKEKHDYTLQVSDIEAMKEKFTKLLLGEDVTGGQKGLSSALS 316
             E   ++W   ESP H+++K+K  Y  +  D+E MKE+F+KLLLGED+TGG+ G+++AL+
Sbjct: 147  VEHGSEDWGPSESPNHYHLKQKPAYAFRFLDVETMKERFSKLLLGEDITGGRLGVTTALA 206

Query: 317  LSNAITNLAASVFGELWKLEPLPEERKSKWRREMDWLLSPTHYMVELVPTKQNGTGGRVM 376
            LSNAITNLAASVFGELWKLEPL EERKS+W+REMDWLLSPT++MVELVP +QN   G+ +
Sbjct: 207  LSNAITNLAASVFGELWKLEPLLEERKSRWQREMDWLLSPTNFMVELVPARQNAANGQTL 266

Query: 377  EIMTPKVRGDVHMNLPALQKLDSMLIGTLDSMVKTEFWYAEVGSRAEGKSKSMGQSTRWW 436
            EIMTPK R D+HMNLPALQKLDSMLI TLDSMV TEFWYAEVGSRAEG++KS  +S RWW
Sbjct: 267  EIMTPKARADIHMNLPALQKLDSMLIETLDSMVNTEFWYAEVGSRAEGRNKSTRESKRWW 326

Query: 437  LPLPQVPSTGLSENERKKLLNHGRVVHQVFKAAKSINESILHEMPVPTAVREAVRASGKA 496
            LPL QVP++GLS++ RKKLL+  RVV+QVFKAAKSINE++L EMPVPT +++A+  SGKA
Sbjct: 327  LPLAQVPASGLSDSGRKKLLSQCRVVYQVFKAAKSINENVLLEMPVPTIIKDALPKSGKA 386

Query: 497  SMSEELYKILTSESGPAENMLNQLNLKSEHDVLEAINRLEAAIFSLKEKYTEQSGNKSPV 556
            S+ EELYK+LT+ES  +  M+N LNLKSEH  LEAIN+LEAA+F+ KE+ +EQ+  KSPV
Sbjct: 387  SLGEELYKVLTAESISSGEMINFLNLKSEHSALEAINKLEAAVFTWKERISEQASGKSPV 446

Query: 557  RTSWPFVKDPTAGIDKLKLLTDQAEVLLQLLKNKYPNHPQTFLDVSKIQYEKDVGHLILE 616
            RTSW F+KDP + +DK++ L ++AE L+Q LKN+YPN PQTFLD +KIQ+ KDVGH ILE
Sbjct: 447  RTSWSFIKDPISELDKIEFLLERAEALIQQLKNRYPNLPQTFLDATKIQFGKDVGHSILE 506

Query: 617  AYSRVLGNLAYSILSRIGDVLQVDAMCNPNSPAPTCCFPGMSLLNNRSDQMS----ALHS 676
            AYSRVL NLA+SILSRIGD+LQ DA+ NPNSP   CC PG  + ++  DQ+      L S
Sbjct: 507  AYSRVLANLAFSILSRIGDILQEDALSNPNSPVAKCCTPGSKMNDDNMDQIQMPWLRLRS 566

Query: 677  WQPLIGHSNSPNMTLPSSKVSGNSPTATPSRNRAWCIGREIKFTRQLFPAIELVGVLIEF 736
               LI   N  +    +S  S  S + T +          +  T                
Sbjct: 567  RHSLIDQMNKADGKYFTSDASSCSTSETSNSEAK---SSSVNST---------------- 626

Query: 737  IFILFPSPNPSLPLQMDLPRFGRPKEDNGSSSSSPNLYVANCGPAVGISHRTVAAVFGDF 796
                 PS N +                 G    SPNL+VANCGPAVG+S+  + +VF  F
Sbjct: 627  -----PSVNRA-----------------GEDERSPNLFVANCGPAVGVSYEAIGSVFSAF 686

Query: 797  GLVKGVHAADETGARVIVCFSEESSARAALEALHGRPCALLGGRTLHIRYSIIR--PSIS 856
            G VKG++AAD++GARVIV + +E SA+AA  +LH RPC  L  R LHI YS++   P+  
Sbjct: 687  GDVKGIYAADDSGARVIVSYFDEGSAQAAFNSLHSRPCPDLANRFLHISYSVLEDSPAPR 746

Query: 857  HPNDSVSVSLSASELDIPGLFLLHDFVSAKEEEDLLMEVDARPWNNLAKRRVQHYGYEFC 916
            H   SV VSL ASEL+IPGL LLHDFVSAKEEE+LL  VD+RPWNNL+KRRVQHYGYEFC
Sbjct: 747  HITSSVPVSLVASELNIPGLHLLHDFVSAKEEEELLAAVDSRPWNNLSKRRVQHYGYEFC 806

Query: 917  YQTRNVNTKHQLGELPSFVSHVVDRISMFPNVENVADASLDQLTYETEINTTLQVNEYPP 976
            Y  RNVNTK  LGELPSFVS +V+R+S FPN+++    +LDQLT          VNEYPP
Sbjct: 807  YDIRNVNTKQCLGELPSFVSSIVERVSSFPNLDDSTSVALDQLT----------VNEYPP 866

Query: 977  GVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKCPSS---IDLKMGNSVNDS 1036
            GVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRY EG+W   P+S   +++ + N  + S
Sbjct: 867  GVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYLEGSW--LPNSTPGMNMAVENPDDYS 926

Query: 1037 NYLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSAIRRGPRRVSFTFRK 1083
            + LRRAIYLPPRSMLLLSGEARYAW+HYIPHHKIDMV D+ IRR  RRVSFTFRK
Sbjct: 927  SVLRRAIYLPPRSMLLLSGEARYAWNHYIPHHKIDMVNDTGIRRASRRVSFTFRK 928

BLAST of HG10013854 vs. NCBI nr
Match: GAY53535.1 (hypothetical protein CUMW_149840, partial [Citrus unshiu])

HSP 1 Score: 1037.3 bits (2681), Expect = 1.2e-298
Identity = 545/904 (60.29%), Postives = 664/904 (73.45%), Query Frame = 0

Query: 197  IMTFNGLESCVLNNQTFENESRSSRADECTTDSLEDHDSSSSSSKDASGSFSSKWLAMHR 256
            IMT++GLESC+LNNQ+++NESR+SR D C TDSL+D DSS SSSKDA GSFSSKW  M R
Sbjct: 87   IMTYDGLESCILNNQSYDNESRTSRGDGCLTDSLDDDDSSCSSSKDAFGSFSSKWSTMKR 146

Query: 257  DEQDLDEWEQPESPQHFYMKEKHDYTLQVSDIEAMKEKFTKLLLGEDVTGGQKGLSSALS 316
             E   ++W   ESP H+++K+K  Y  +  D+E MKE+F+KLLLGED+TGG+ G+++AL+
Sbjct: 147  VEHGSEDWGPSESPNHYHLKQKPAYAFRFLDVETMKERFSKLLLGEDITGGRLGVTTALA 206

Query: 317  LSNAITNLAASVFGELWKLEPLPEERKSKWRREMDWLLSPTHYMVELVPTKQNGTGGRVM 376
            LSNAITNLAASVFGELWKLEPL EERKS+W+REMDWLLSPT++MVELVP +QN   G+ +
Sbjct: 207  LSNAITNLAASVFGELWKLEPLLEERKSRWQREMDWLLSPTNFMVELVPARQNAANGQTL 266

Query: 377  EIMTPKVRGDVHMNLPALQKLDSMLI---------GTLDSMVKTEFWYAEVGSRAEGKSK 436
            EIMTPK R D+HMNLPALQKLDSMLI          TLDSMV TEFWYAEVGSRAEG++K
Sbjct: 267  EIMTPKARADIHMNLPALQKLDSMLIVGGLFFCLEETLDSMVNTEFWYAEVGSRAEGRNK 326

Query: 437  SMGQSTRWWLPLPQVPSTGLSENERKKLLNHGRVVHQVFKAAKSINESILHEMPVPTAVR 496
            S  +S RWWLPL QVP++GLS++ RKKLL+  RVV+QVFKAAKSINE++L EMPVPT ++
Sbjct: 327  STRESKRWWLPLAQVPASGLSDSGRKKLLSQCRVVYQVFKAAKSINENVLLEMPVPTIIK 386

Query: 497  EAVRASGKASMSEELYKILTSESGPAENMLNQLNLKSEHDVLEAINRLEAAIFSLKEKYT 556
            +A+  SGKAS+ EELYK+LT+ES  +  M+N LNLKSEH  LEAIN+LEAA+F+ KE+ +
Sbjct: 387  DALPKSGKASLGEELYKVLTAESISSGEMINFLNLKSEHSALEAINKLEAAVFTWKERIS 446

Query: 557  EQSGNKSPVRTSWPFVKDPTAGIDKLKLLTDQAEVLLQLLKNKYPNHPQTFLDVSKIQYE 616
            EQ+  KSPVRTSW F+KDP + +DK++ L ++AE L+Q LKN+YPN PQTFLD +KIQ+ 
Sbjct: 447  EQASGKSPVRTSWSFIKDPISELDKIEFLLERAEALIQQLKNRYPNLPQTFLDATKIQFG 506

Query: 617  KDVGHLILEAYSRVLGNLAYSILSRIGDVLQVDAMCNPNSPAPTCCFPGMSLLNNRSDQM 676
            KDVGH ILEAYSRVL NLA+SILSRIGD+LQ DA+ NPNSP   CC PG  + ++  DQ+
Sbjct: 507  KDVGHSILEAYSRVLANLAFSILSRIGDILQEDALSNPNSPVAKCCTPGSKMNDDNMDQI 566

Query: 677  S----ALHSWQPLIGHSNSPNMTLPSSKVSGNSPTATPSRNRAWCIGREIKFTRQLFPAI 736
                  L S   LI   N  +    +S  S  S + T +          +  T       
Sbjct: 567  QMPWLRLRSRHSLIDQMNKADGKYFTSDASSCSTSETSNSEAK---SSSVNST------- 626

Query: 737  ELVGVLIEFIFILFPSPNPSLPLQMDLPRFGRPKEDNGSSSSSPNLYVANCGPAVGISHR 796
                          PS N +                 G    SPNL+VANCGPAVG+S+ 
Sbjct: 627  --------------PSVNRA-----------------GEDERSPNLFVANCGPAVGVSYE 686

Query: 797  TVAAVFGDFGLVKGVHAADETGARVIVCFSEESSARAALEALHGRPCALLGGRTLHIRYS 856
             + +VF  FG VKG++AAD++GARVIV + +E SA+AA  +LH RPC  L  R LHI YS
Sbjct: 687  AIGSVFSAFGDVKGIYAADDSGARVIVSYFDEGSAQAAFNSLHSRPCPDLANRFLHISYS 746

Query: 857  IIR--PSISHPNDSVSVSLSASELDIPGLFLLHDFVSAKEEEDLLMEVDARPWNNLAKRR 916
            ++   P+  H   SV VSL ASEL+IPGL LLHDFVSAKEEE+LL  VD+RPWNNL+KRR
Sbjct: 747  VLEDSPAPRHITSSVPVSLVASELNIPGLHLLHDFVSAKEEEELLAAVDSRPWNNLSKRR 806

Query: 917  VQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNVENVADASLDQLTYETEINT 976
            VQHYGYEFCY  RNVNTK  LGELPSFVS +V+R+S FPN+++    +LDQLT       
Sbjct: 807  VQHYGYEFCYDIRNVNTKQCLGELPSFVSSIVERVSSFPNLDDSTSVALDQLT------- 866

Query: 977  TLQVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKCPSS---IDL 1036
               VNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRY EG+W   P+S   +++
Sbjct: 867  ---VNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYLEGSW--LPNSTPGMNM 926

Query: 1037 KMGNSVNDSNYLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSAIRRGPRRVSF 1083
             + N  + S+ LRRAIYLPPRSMLLLSGEARYAW+HYIPHHKIDMV D+ IRR  RRVSF
Sbjct: 927  AVENPDDYSSVLRRAIYLPPRSMLLLSGEARYAWNHYIPHHKIDMVNDTGIRRASRRVSF 937

BLAST of HG10013854 vs. ExPASy Swiss-Prot
Match: Q56WM6 (Rop guanine nucleotide exchange factor 14 OS=Arabidopsis thaliana OX=3702 GN=ROPGEF14 PE=1 SV=1)

HSP 1 Score: 563.9 bits (1452), Expect = 5.0e-159
Identity = 306/545 (56.15%), Postives = 385/545 (70.64%), Query Frame = 0

Query: 197 IMTFNGLESCVLNNQTFENESRSSRADECTTDSLEDHD-SSSSSSKDASGSFSSKWLAMH 256
           ++T+ GLE+C++NNQ++E ES +SR D C TDSL+D   SS SSSKDAS SFSSKWL M 
Sbjct: 29  MITYYGLETCIINNQSYEEESGTSRGDGCLTDSLDDDAFSSCSSSKDASSSFSSKWLPMK 88

Query: 257 RDE------------QDLDEWEQPE----SPQHFYMKEKHDYTLQVSDIEAMKEKFTKLL 316
            DE            Q  D  E+ +    S QHF  KEK  Y     D+EAMKEKF+KLL
Sbjct: 89  NDEHSCDGLNLSGRSQHFDAKEKKKQGYGSSQHFDAKEKPGYVYCHLDVEAMKEKFSKLL 148

Query: 317 LGEDVTGGQKGLSSALSLSNAITNLAASVFGELWKLEPLPEERKSKWRREMDWLLSPTHY 376
           LGEDVTGG KG+  AL+LSNA+T+LA S+FGELWKLEPL EE+K KWRREMDWLLSPT+Y
Sbjct: 149 LGEDVTGGCKGVQVALALSNAVTHLATSIFGELWKLEPLCEEKKQKWRREMDWLLSPTNY 208

Query: 377 MVELVPTKQNGTGGRVMEIMTPKVRGDVHMNLPALQKLDSMLIGTLDSMVKTEFWYAEVG 436
           M+ELVP+KQN   GR +EIMTPK R D+HMNLPALQKLDSMLI TLDSMV TEFWY+E+G
Sbjct: 209 MIELVPSKQNDANGRSLEIMTPKARADIHMNLPALQKLDSMLIETLDSMVNTEFWYSEIG 268

Query: 437 SRAEGKSKSMGQSTRWWLPLPQVPSTGLSENERKKLLNHGRVVHQVFKAAKSINESILHE 496
           SRAEGK+KS  +S RWWLP PQVP  GLS + RKKLL+ G+VV+QVFKA K+INE+IL E
Sbjct: 269 SRAEGKNKSTSESKRWWLPSPQVPKPGLSNSGRKKLLDKGKVVYQVFKATKAINENILLE 328

Query: 497 MPVPTAVREAVRASGKASMSEELYKILTSESGPAENMLNQLNLKSEHDVLEAINRLEAAI 556
           MPVP  ++EA+  SGK S+ +ELYK+L  ES   + +   LNL +EH  LE +N+LE+A+
Sbjct: 329 MPVPIVIKEAIPKSGKNSLGDELYKMLAVESATVDEIFISLNLGTEHAALETVNKLESAM 388

Query: 557 FSLKEKYTEQSGN-KSPVRTSWPFVKDPTAGIDKLKLLTDQAEVLLQLLKNKYPNHPQTF 616
           F+ KE+ TEQ  N KSPVR SW F KDP + I + + L ++AE L   +K+K+PN P +F
Sbjct: 389 FAWKERITEQGSNGKSPVRASWSFAKDPLSEIGRNESLLNRAEALRTQIKSKHPNLPHSF 448

Query: 617 LDVSKIQYEKDVGHLILEAYSRVLGNLAYSILSRIGDVLQVDAMCNPNSPAPTCCFPGMS 676
           LD +KIQY+KD+GH +LEAYSR L NLA+ ILSR+G++L+ D++ NPNSPAP  CFP  S
Sbjct: 449 LDATKIQYDKDIGHAVLEAYSRTLANLAFRILSRMGEILKEDSLSNPNSPAPPSCFPS-S 508

Query: 677 LLNNRSDQMSALHSWQPLIGHSNSPNMTLPSSKVSG-----------NSPTATPSR-NRA 712
               R+ +   L S    + HS + +M       +G           NS   TPSR +R 
Sbjct: 509 RDPYRTPERPLLSS---RVRHSLTDDMNKADGTETGLDFLFADAKASNSVNTTPSRSSRL 568

BLAST of HG10013854 vs. ExPASy Swiss-Prot
Match: A8MS41 (Carbon catabolite repressor protein 4 homolog 4 OS=Arabidopsis thaliana OX=3702 GN=CCR4-4 PE=1 SV=1)

HSP 1 Score: 421.0 bits (1081), Expect = 5.3e-116
Identity = 213/339 (62.83%), Postives = 260/339 (76.70%), Query Frame = 0

Query: 1074 RRVSFT-FRKVYVKSRFFPHSPSSCLRWKARSQAILAVLKNLEADFLCLQEVDEYDSFYK 1133
            R VS+    +VYVKS   PHSP +CL+WKARS AIL+VLKNL+ADF CLQEVDEYDSFY+
Sbjct: 93   RLVSYNILAQVYVKSALLPHSPPACLKWKARSHAILSVLKNLQADFFCLQEVDEYDSFYR 152

Query: 1134 GNLEKCGYSSLYIQRSGQ-KRDGCGIFFKHEKADLIIEDRIEYNDLVNSIQDDGCSCEDK 1193
             N++  GYS +YIQR+GQ KRDGC IF+K   A+L+ ++RIEYNDLV+SI+ D  SC   
Sbjct: 153  NNMDSLGYSGIYIQRTGQRKRDGCAIFYKPSCAELVTKERIEYNDLVDSIKADSVSC--- 212

Query: 1194 SEDVVTSASNDVESNKGSSPKATVADRGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIV 1253
            SE  + +++   +S K         D  D NDP VRLKRDCVGIMAAF++ KPF H+VIV
Sbjct: 213  SEQKIETSNEGKDSRK---------DSRDLNDPLVRLKRDCVGIMAAFRINKPFQHIVIV 272

Query: 1254 ANTHLYWDPEWADVKIAQAKYLLSRLARFKTLVAEKFECTPSILLAGDFNSTPGDKVYQY 1313
            ANTHLYWDPE ADVK+AQAKYLLSRLA+FKTL++++FECTPS+LLAGDFNS PGD VY Y
Sbjct: 273  ANTHLYWDPELADVKLAQAKYLLSRLAQFKTLISDEFECTPSLLLAGDFNSIPGDMVYSY 332

Query: 1314 LVSGSSSSGFSPESLEELPLPLCSVYASILGSEPSFTNFTPGFTGTLDYIFLSPSDSMRP 1373
            LVSG++    + E  EE P+PL SVY  +   EP FTN TPGFT TLDYIF+SPSD ++P
Sbjct: 333  LVSGNAKPTETIEE-EEAPVPLSSVY-EVTRGEPKFTNCTPGFTNTLDYIFISPSDFIKP 392

Query: 1374 TSFLELPESEWPEVIGGLPNFNYPSDHLPIGAEFEITME 1411
             S L+LPE + P+V+G LPN ++PSDHLPIGAEFEI  E
Sbjct: 393  VSILQLPEPDSPDVVGFLPNHHHPSDHLPIGAEFEIRRE 417

BLAST of HG10013854 vs. ExPASy Swiss-Prot
Match: Q8RWY1 (Alkylated DNA repair protein ALKBH8 homolog OS=Arabidopsis thaliana OX=3702 GN=ALKBH8 PE=2 SV=2)

HSP 1 Score: 413.7 bits (1062), Expect = 8.4e-114
Identity = 220/339 (64.90%), Postives = 256/339 (75.52%), Query Frame = 0

Query: 751  PRFGRPKEDNGSSSS----SPNLYVANCGPAVGISHRTVAAVFGDFGLVKGVHAADETGA 810
            PRF RP + + SS S    S NLYVANCGPAVG++H  +AAVF +FG V GV+AAD++G 
Sbjct: 4    PRFVRPTQSSPSSISGEPNSSNLYVANCGPAVGLTHNAIAAVFAEFGEVNGVYAADDSGV 63

Query: 811  RVIVCFSEESSARAALEALHGRPCALLGGRTLHIRYSIIR-PSISHPNDSVSVSLSASEL 870
            RVIV F++  SA+AALEAL GRPC  L GR+LHIRYS+++ PS +  ND V VSL  SEL
Sbjct: 64   RVIVSFADPFSAKAALEALSGRPCPDLKGRSLHIRYSVLQLPSETQVNDCVPVSLIDSEL 123

Query: 871  DIPGLFLLHDFVSAKEEEDLLMEVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQLGEL 930
            +IPGLFLL DFV+  EE+ LL  VDAR W  LAKRRVQHYGYEFCY TRNV+TK +LGEL
Sbjct: 124  NIPGLFLLPDFVTVAEEQQLLAAVDARHWIGLAKRRVQHYGYEFCYGTRNVDTKKRLGEL 183

Query: 931  PSFVSHVVDRISMFPNVEN-VADASLDQLTYETEINTTLQVNEYPPGVGLSPHIDTHSAF 990
            PSFVS +++RI +FPN +N  A  +LDQLT          VNEYP GVGLSPHIDTHSAF
Sbjct: 184  PSFVSPILERIYLFPNFDNGSASLNLDQLT----------VNEYPSGVGLSPHIDTHSAF 243

Query: 991  EGLIFSLSLAGPCIMEFRRYPEGTWHKCPSSIDLKMGNSVNDSNYLRRAIYLPPRSMLLL 1050
            E  IFSLSLAGPCIMEFRRY   TW K  ++   K G    DS+ +++A+YLPPRSMLLL
Sbjct: 244  EDCIFSLSLAGPCIMEFRRYSVSTW-KASTTDAEKSG----DSSCIKKALYLPPRSMLLL 303

Query: 1051 SGEARYAWHHYIPHHKIDMVKDSAIRRGPRRVSFTFRKV 1084
            SGEARYAW+HYIPHHKID VKD  IRR  RRVSFT RKV
Sbjct: 304  SGEARYAWNHYIPHHKIDKVKDKVIRRSSRRVSFTLRKV 327

BLAST of HG10013854 vs. ExPASy Swiss-Prot
Match: Q9LZN0 (Rop guanine nucleotide exchange factor 7 OS=Arabidopsis thaliana OX=3702 GN=ROPGEF7 PE=1 SV=1)

HSP 1 Score: 320.9 bits (821), Expect = 7.4e-86
Identity = 198/503 (39.36%), Postives = 302/503 (60.04%), Query Frame = 0

Query: 214 ENESRSSRADECTTDSLEDHDSSSSSSKDASGS-FSSKWLAMHRDEQDLDEWEQPESPQH 273
           E + R S      T   E+ + S S ++D + S  SS+W   + D +             
Sbjct: 12  EEKGRESSCCSSETTRQEEEEQSPSCTEDFTASPVSSRWSVKNIDGE------------- 71

Query: 274 FYMKEKHDYTLQVSDIEAMKEKFTKLLLGEDVTGGQKGLSSALSLSNAITNLAASVFGEL 333
              K+K     +VS++E MKE+F+KLLLGED++G   G+ +AL++SNAITNL A++FG+L
Sbjct: 72  ---KKKIRSDSRVSEVEMMKERFSKLLLGEDMSGSGNGVCTALAISNAITNLCATLFGQL 131

Query: 334 WKLEPLPEERKSKWRREMDWLLSPTHYMVELVPTKQNGTGGRVMEIMTPKVRGDVHMNLP 393
           W+LEPLP E+K  WRREM+WLL  + ++VE+ PT Q    G  +EIMT + R D+++NLP
Sbjct: 132 WRLEPLPTEKKEMWRREMEWLLCVSDHIVEMTPTWQTFPDGTKLEIMTCRPRSDLYVNLP 191

Query: 394 ALQKLDSMLIGTLDSMVKTEFWYAEVG-----SRAEGKS----KSMGQSTRWWLPLPQVP 453
           AL+KLD+ML+  LDS  +TEFWY + G     S A+G S        Q  +WWLP+P+V 
Sbjct: 192 ALRKLDNMLLEILDSFEETEFWYVDQGIMAHESAADGSSSFRKSFQRQEDKWWLPVPRVS 251

Query: 454 STGLSENERKKLLNHGRVVHQVFKAAKSINESILHEMPVPTAVREAVRASGKASMSEELY 513
             GL EN RK+L +     +Q+ KAA +IN   L +M +P +  E++   G++ + + +Y
Sbjct: 252 PGGLQENSRKQLQHKRDCTNQILKAAMAINSITLADMEIPESYLESLPRKGRSCLGDLIY 311

Query: 514 KILTSESGPAENMLNQLNLKSEHDVLEAINRLEAAIFSLKEKYTEQSGNKSPVRTSWPFV 573
           + ++S+    E +L+ L+L SEH  +E  NR+E++I+   ++   +    +  +TSW  V
Sbjct: 312 RYISSDQFSPECLLDCLDLSSEHQAIEIANRVESSIYLWHKRTNSKPATNT--KTSWEMV 371

Query: 574 KDPTAGIDKLKLLTDQAEVLLQLLKNKYPNHPQTFLDVSKIQYEKDVGHLILEAYSRVLG 633
           K+     DKL+L+ D+AE LL  LK ++P  PQT LD+SKIQY KD+G  ILE+YSRVL 
Sbjct: 372 KELMVDADKLELMADRAESLLLSLKQRFPGLPQTALDMSKIQYNKDIGKSILESYSRVLE 431

Query: 634 NLAYSILSRIGDVLQVDAMCNPNS-PAPTCCFPGMSLLNNRSDQMSALHSWQPLIGHSNS 693
           +LA++I++RI D+L VD +   +S   PT      +L NN +D   A  S    + +  +
Sbjct: 432 SLAFNIVARIDDLLFVDDLTRHSSDQIPT------TLGNNGND---APKSIAVPVSNYTT 486

Query: 694 PNMTLPSSKVSGNSPTATPSRNR 706
           P+ + PS +   +S T  PS +R
Sbjct: 492 PSYS-PSKQELRSSITVPPSPSR 486

BLAST of HG10013854 vs. ExPASy Swiss-Prot
Match: Q9LV40 (Rho guanine nucleotide exchange factor 8 OS=Arabidopsis thaliana OX=3702 GN=ROPGEF8 PE=1 SV=1)

HSP 1 Score: 310.1 bits (793), Expect = 1.3e-82
Identity = 173/426 (40.61%), Postives = 260/426 (61.03%), Query Frame = 0

Query: 284 QVSDIEAMKEKFTKLLLGEDVTGGQKGLSSALSLSNAITNLAASVFGELWKLEPLPEERK 343
           Q +D+E MK++F KLLLGED++GG KG+SSAL+LSNAITNLAAS+FGE  KL+P+P++R+
Sbjct: 82  QQADMEMMKDRFAKLLLGEDMSGGGKGVSSALALSNAITNLAASIFGEQTKLQPMPQDRQ 141

Query: 344 SKWRREMDWLLSPTHYMVELVPTKQNGTGGRVMEIMTPKVRGDVHMNLPALQKLDSMLIG 403
           ++W++E+DWLLS T ++VE VP++Q    G   EIM  + RGD+ MN+PAL+KLD+MLI 
Sbjct: 142 ARWKKEIDWLLSVTDHIVEFVPSQQTSKDGVCTEIMVTRQRGDLLMNIPALRKLDAMLID 201

Query: 404 TLDSM-VKTEFWYAEVGSRAEGKSKSMGQSTRWWLPLPQVPSTGLSENERKKLLNHGRVV 463
           TLD+     EFWY    S    ++++   + +WWLP  +VP  GLSE  R+ L      V
Sbjct: 202 TLDNFRGHNEFWYVSRDSEEGQQARNDRTNDKWWLPPVKVPPGGLSEPSRRMLYFQKDSV 261

Query: 464 HQVFKAAKSINESILHEMPVPTAVREAVRASGKASMSEELYKILTSESGPAENMLNQLNL 523
            QV KAA +IN  +L EM +P +  +++  +G+AS+ + +YK +T E    E  L  L++
Sbjct: 262 TQVQKAAMAINAQVLSEMEIPESYIDSLPKNGRASLGDSIYKSITEEWFDPEQFLAMLDM 321

Query: 524 KSEHDVLEAINRLEAAIFSLKEKYTEQSGNKSPVRTSWPFVKDPTAGIDKLKLLTDQAEV 583
            +EH VL+  NR+EA++   K K      +    ++SW         ++K +L  ++AE 
Sbjct: 322 STEHKVLDLKNRIEASVVIWKRKL-----HTKDTKSSW----GSAVSLEKRELFEERAET 381

Query: 584 LLQLLKNKYPNHPQTFLDVSKIQYEKDVGHLILEAYSRVLGNLAYSILSRIGDVLQVDAM 643
           +L LLK K+P  PQ+ LD+SKIQ+ KDVG  +LE+YSR+L +LAY+++SRI DVL  D +
Sbjct: 382 ILVLLKQKFPGLPQSSLDISKIQFNKDVGQAVLESYSRILESLAYTVMSRIEDVLYTDTL 441

Query: 644 CNPNSPAPTCCFPGMSLLNNRSDQMSALHSWQPLIGHSNSPNMTL------PSSKVSGNS 703
               +        G       S+   + +S +    H       L       +S   G+ 
Sbjct: 442 ALKQTLLAEETSDGGRTTETDSESAGSSNSGEEAEKHDPHSKTLLDFMGWNDNSSKGGDK 498

BLAST of HG10013854 vs. ExPASy TrEMBL
Match: D7TFE1 (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_02s0087g00460 PE=3 SV=1)

HSP 1 Score: 1118.6 bits (2892), Expect = 0.0e+00
Identity = 586/903 (64.89%), Postives = 686/903 (75.97%), Query Frame = 0

Query: 197  IMTFNGLESCVLNNQTFENESRSSRADECTTDSLEDHDSSSSSSKDASGSFSSKWLAMH- 256
            IMT+NGLE+C+LN  ++ENES +SR D C TDSL++ D+S SSSKDA GSFSSKWL M+ 
Sbjct: 27   IMTYNGLENCILNGHSYENESHTSRGDGCATDSLDEDDTSCSSSKDAFGSFSSKWLTMNM 86

Query: 257  -RDEQDLDEWEQPESPQHFYMKEKHDYTLQVSDIEAMKEKFTKLLLGEDVTGGQKGLSSA 316
             +DE  LDEWE PESPQHFY+KEK  Y+++ SD+E MKE+F+KLLLGED+TGG+KGL+SA
Sbjct: 87   KKDEHGLDEWEVPESPQHFYVKEKPGYSVRFSDVEVMKERFSKLLLGEDITGGKKGLTSA 146

Query: 317  LSLSNAITNLAASVFGELWKLEPLPEERKSKWRREMDWLLSPTHYMVELVPTKQNGTGGR 376
            L+LSNAITNLA SVFGELWKLEPL EERK KW+REMDWLLSPT+YMVELVP KQ+G  GR
Sbjct: 147  LALSNAITNLAVSVFGELWKLEPLSEERKVKWQREMDWLLSPTNYMVELVPAKQSGANGR 206

Query: 377  VMEIMTPKVRGDVHMNLPALQKLDSMLIGTLDSMVKTEFWYAEVGSRAEGKSKSMGQSTR 436
             +EIMTPK R D+HMNLPALQKLDSMLI TLDSMV TEFWYAE GSRAEG+++SM QS R
Sbjct: 207  TLEIMTPKARADIHMNLPALQKLDSMLIETLDSMVDTEFWYAEGGSRAEGRTRSMSQSKR 266

Query: 437  WWLPLPQVPSTGLSENERKKLLNHGRVVHQVFKAAKSINESILHEMPVPTAVREAVRASG 496
            WWLP PQVP+TGLS+ ERKKLL+  +VVHQVFKAA++INE++L EMPVPT +R+A+  SG
Sbjct: 267  WWLPSPQVPTTGLSDPERKKLLHQAKVVHQVFKAARAINENVLLEMPVPTLIRDALAKSG 326

Query: 497  KASMSEELYKILTSESGPAENMLNQLNLKSEHDVLEAINRLEAAIFSLKEKYTEQSGNKS 556
            KA++ EELY++LT+ES  AE ML+ LNLKSEH  LEAINRLEAA+F+ KE+ TEQ   KS
Sbjct: 327  KANLGEELYRVLTAESSSAEEMLSSLNLKSEHSALEAINRLEAAVFAWKERITEQVSGKS 386

Query: 557  PVRTSWPFVKDPTAGIDKLKLLTDQAEVLLQLLKNKYPNHPQTFLDVSKIQYEKDVGHLI 616
            PVRTSW F+KDPT  +DK++L+  +AE LLQ L+ +YPN PQ+FLDV+KIQY KD+GH I
Sbjct: 387  PVRTSWSFIKDPTTELDKMELILFRAEALLQQLRTRYPNLPQSFLDVAKIQYGKDIGHSI 446

Query: 617  LEAYSRVLGNLAYSILSRIGDVLQVDAMCNPNSP-APTCCFPGMSLLNNRSDQMSALHSW 676
            LEAYSRVLGNLA SIL R+ D+LQ D   NPNSP A T CFPG++L         +L   
Sbjct: 447  LEAYSRVLGNLASSILCRMRDILQEDVFSNPNSPIATTSCFPGINLTGMTETPTPSLRIR 506

Query: 677  QPLIGHSN-------------SPNMTLPSSKVSGNSPTATPSRNRAWCIGREIKFTRQLF 736
              LI   N             SP+     S    +S  ATPSR+R +    + K T  L 
Sbjct: 507  HSLIDQMNMVDGRFRDPNAGASPDCEASYSDSRRSSVMATPSRSRKY----KKKTTTHLK 566

Query: 737  PAIELVGVLIEFIFILFPSPNPSLPLQMDLPRFGRPKEDNGSSSSSPNLYVANCGPAVGI 796
               E                   L   M LPRF RPK  +G    SPNLYVANCGPAVG+
Sbjct: 567  SRFE-----------------QGLNYIMGLPRFSRPKGVDG--ELSPNLYVANCGPAVGL 626

Query: 797  SHRTVAAVFGDFGLVKGVHAADETGARVIVCFSEESSARAALEALHGRPCALLGGRTLHI 856
            S  T+A+VF  FG VKGV+ AD++GARVIV + EES+A+AAL+AL G PC  LGGR LHI
Sbjct: 627  SFDTIASVFSTFGEVKGVYPADDSGARVIVSYFEESAAQAALKALDGHPCPALGGRFLHI 686

Query: 857  RYSIIRPSISHPNDSVSVSLSASELDIPGLFLLHDFVSAKEEEDLLMEVDARPWNNLAKR 916
            RYSI +P  S  NDSV VSL  SEL+IPG++LLHDFVSAKEEE+LL  VD   W +L+KR
Sbjct: 687  RYSIFQPP-SQVNDSVPVSLVDSELNIPGIYLLHDFVSAKEEEELLAAVDKMSWKSLSKR 746

Query: 917  RVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNVENVADASLDQLTYETEIN 976
            RVQHYGYEFCY+TRNVNTK  LG+LPSFVS +V+RIS FPN+E+ AD  LDQLT      
Sbjct: 747  RVQHYGYEFCYETRNVNTKQYLGKLPSFVSAIVERISSFPNLESAADIVLDQLT------ 806

Query: 977  TTLQVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKCPSSIDLKM 1036
                VNEYPPGVGLSPHIDTHSAFEG IFSLSLAGPCIM+FRRY EG W K  SS D+ +
Sbjct: 807  ----VNEYPPGVGLSPHIDTHSAFEGFIFSLSLAGPCIMDFRRYTEGVWPKSASSSDMSV 866

Query: 1037 GNSVNDSNYLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSAIRRGPRRVSFTF 1084
                  S++LRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDS IRRGPRRVSFTF
Sbjct: 867  EYPDKSSSFLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSVIRRGPRRVSFTF 895

BLAST of HG10013854 vs. ExPASy TrEMBL
Match: A5B3V2 (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_021238 PE=3 SV=1)

HSP 1 Score: 1084.7 bits (2804), Expect = 0.0e+00
Identity = 576/952 (60.50%), Postives = 683/952 (71.74%), Query Frame = 0

Query: 197  IMTFNGLESCVLNNQTFENESRSSRADECTTDSLEDHDSSSSSSKDASGSFSSKWLAMH- 256
            IMT+NGLE+C+LN  ++ENES +SR D C TDSL++ DSS SSSKDA GSFSSKWL M+ 
Sbjct: 108  IMTYNGLENCILNGHSYENESHTSRGDGCATDSLDEDDSSCSSSKDAFGSFSSKWLTMNM 167

Query: 257  -RDEQDLDEWEQPESPQHFYMKEKHDYTLQVSDIEAMKEKFTKLLLGEDVTGGQKGLSSA 316
             +DE  LDEWE PESPQHFY+KEK  Y+ + SD+E MKE+F+KLLLGED TGG+KGL+SA
Sbjct: 168  KKDEHGLDEWEVPESPQHFYVKEKPGYSFRFSDVEVMKERFSKLLLGEDXTGGKKGLTSA 227

Query: 317  LSLSNAITNLAASVFGELWKLEPLPEERKSKWRREMDWLLSPTHYMVELVPTKQNGTGGR 376
            L+LSNAITNLA SVFGELWKLEPL EERK KW+REMDWLLSPT YMVELVP KQ+G  GR
Sbjct: 228  LALSNAITNLAVSVFGELWKLEPLSEERKVKWQREMDWLLSPTXYMVELVPAKQSGANGR 287

Query: 377  VMEIMTPKVRGDVHMNLPALQKLDSMLIGTLDSMVKTEFWYAEVGSRAEGKSKSMGQSTR 436
             +EIMTPK R D+HMNLPALQKLDSMLI TLDSMV TEFWYAE GSRAEG+++SM QS R
Sbjct: 288  TLEIMTPKARADIHMNLPALQKLDSMLIETLDSMVDTEFWYAEGGSRAEGRTRSMSQSKR 347

Query: 437  WWLPLPQVPSTGLSENERKKLLNHGRVVHQVFKAAKSINESILHEMPVPTAVREAVR--- 496
            WWLP PQVP+TGLS+ ERKKLL+  +VVHQVFKAA++INE++L EMPVPT +R+A+    
Sbjct: 348  WWLPSPQVPTTGLSDPERKKLLHQAKVVHQVFKAARAINENVLLEMPVPTLIRDALAKAS 407

Query: 497  -----------------ASGKASMSEELYKILTSESGPAENMLNQLNLKSEHDVLEAINR 556
                              SGKA++ EELY++LT+ES   E ML+ LNLKSEH  LEAINR
Sbjct: 408  KLFDLFPSNQSSCLKTLESGKANLGEELYRVLTAESSSTEEMLSSLNLKSEHSALEAINR 467

Query: 557  LEAAIFSLKEKYTEQSGNKSPVRTSWPFVKDPTAGIDKLKLLTDQAEVLLQLLKNKYPNH 616
            LEAA+F+ KE+ TEQ   KSPVRTSW F+KDPT  +DK++L+  +AE LLQ L+ +YPN 
Sbjct: 468  LEAAVFAWKERITEQVSGKSPVRTSWSFIKDPTTELDKMELILFRAEALLQQLRTRYPNL 527

Query: 617  PQTFLDVSKIQYEKDVGHLILEAYSRVLGNLAYSILSRIGDVLQVDAMCNPNSP-APTCC 676
            PQ+FLDV+KIQY KD+GH ILEAYSRVLGNLA SIL R+ D+LQ D   NPNSP A T C
Sbjct: 528  PQSFLDVAKIQYGKDIGHSILEAYSRVLGNLASSILCRMRDILQEDVFSNPNSPIATTSC 587

Query: 677  FPGMSLLNNRSDQMSALHSWQPLIGHSNSPNMTLPSSKVSGNSPTATPSRNRAWCIGREI 736
            FPG++L         +L      I HS    M +   +    +  A+P    ++   R  
Sbjct: 588  FPGINLTGMTETPTPSLR-----IRHSLIDQMNMVDGRFRDPNAGASPDCEASYSDSRRS 647

Query: 737  KFTRQLFPAIELVGVLIEFIFILFPSPNPSLPLQMDLPRFGRPKEDNGSSSSSPNLYVAN 796
                                 +  PS +  L   M LPRF RPK  +G    SPNLYVAN
Sbjct: 648  S-------------------VMATPSRSRGLNYIMGLPRFSRPKGVDG--ELSPNLYVAN 707

Query: 797  CGPAVGISHRTVAAVFGDFGLVKGVHAADETGARVIVCFSEESSARAALEALHGRPCALL 856
            CGPAVG+S  T+A+VF  FG VKGV+ AD++GARVIV + EES+A+AAL+AL G PC  L
Sbjct: 708  CGPAVGLSFDTIASVFSTFGEVKGVYPADDSGARVIVSYFEESAAQAALKALDGHPCPAL 767

Query: 857  GGRTLHIRYSIIRPSISHPNDSVSVSLSASELDIPGLFLLHDFVSAKEEEDLLMEVDARP 916
            GGR LHIRYSI +P   +           SEL+IPG++LLHDFVSAKEEE+LL  VD   
Sbjct: 768  GGRFLHIRYSIFQPPSQY-------LWWISELNIPGIYLLHDFVSAKEEEELLAAVDKMS 827

Query: 917  WNNLAKRRVQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNVENVADASLDQL 976
            W +L+KRRVQHYGYEFCY+TRNVNTK  LG+LPSFVS +V+RIS FPN+E+ AD  LDQL
Sbjct: 828  WKSLSKRRVQHYGYEFCYETRNVNTKQYLGKLPSFVSAIVERISSFPNLESAADIVLDQL 887

Query: 977  TYETEINTTLQVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKCP 1036
            T          VNEYPPGVGLSPHIDTHSAFEG IFSLSLAGPCIM+FRRY EG W K  
Sbjct: 888  T----------VNEYPPGVGLSPHIDTHSAFEGFIFSLSLAGPCIMDFRRYTEGVWPKSA 947

Query: 1037 SSIDLKMGNSVNDSNYLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSAIRRGP 1096
            SS D+ +      S++LRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDS IRRGP
Sbjct: 948  SSSDMSVEYPDKSSSFLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSVIRRGP 1007

Query: 1097 RRVSFTFRKVYVKSRFFPHSPSSCLRWKARSQAILAVLKNLEADFLCLQEVD 1126
            RRVSFTFRK  ++     H   +          +L V +   A+  CL  +D
Sbjct: 1008 RRVSFTFRKSDIRQMHTHHILFTMCE-------LLPVFRKHRAESFCLTRLD 1009

BLAST of HG10013854 vs. ExPASy TrEMBL
Match: V4SZ85 (Uncharacterized protein OS=Citrus clementina OX=85681 GN=CICLE_v10004016mg PE=3 SV=1)

HSP 1 Score: 1057.7 bits (2734), Expect = 4.1e-305
Identity = 549/895 (61.34%), Postives = 670/895 (74.86%), Query Frame = 0

Query: 198  MTFNGLESCVLNNQTFENESRSSRADECTTDSLEDHDSSSSSSKDASGSFSSKWLAMHRD 257
            MT++GLESC+LNNQ+++NESR+SR D C TDSL+D DSS SSSKDA GSFSSKW  M R 
Sbjct: 1    MTYDGLESCILNNQSYDNESRTSRGDGCLTDSLDDDDSSCSSSKDAFGSFSSKWSTMKRV 60

Query: 258  EQDLDEWEQPESPQHFYMKEKHDYTLQVSDIEAMKEKFTKLLLGEDVTGGQKGLSSALSL 317
            E   ++W   ESP H+++K+K  Y  +  D+E MKE+F+KLLLGED+TGG+ G+++AL+L
Sbjct: 61   EHGSEDWGPSESPNHYHVKQKPAYAFRFLDVETMKERFSKLLLGEDITGGRLGVTTALAL 120

Query: 318  SNAITNLAASVFGELWKLEPLPEERKSKWRREMDWLLSPTHYMVELVPTKQNGTGGRVME 377
            SNAITNLAASVFGELWKLEPL EERKS+W+REMDWLLSPT++MVELVP +QN + G+ +E
Sbjct: 121  SNAITNLAASVFGELWKLEPLLEERKSRWQREMDWLLSPTNFMVELVPARQNASNGQTLE 180

Query: 378  IMTPKVRGDVHMNLPALQKLDSMLIGTLDSMVKTEFWYAEVGSRAEGKSKSMGQSTRWWL 437
            IMTPK R D+HMNLPALQKLDSMLI TLDSMV TEFWYAEVGSRAEG++KS  +S RWWL
Sbjct: 181  IMTPKARADIHMNLPALQKLDSMLIETLDSMVNTEFWYAEVGSRAEGRNKSTRESKRWWL 240

Query: 438  PLPQVPSTGLSENERKKLLNHGRVVHQVFKAAKSINESILHEMPVPTAVREAVRASGKAS 497
            PL QVP++GLS++ RKK+L+  RVV+QVFKAAKSINE++L EMPVPT +++ +  SGK S
Sbjct: 241  PLAQVPASGLSDSGRKKMLSQCRVVYQVFKAAKSINENVLLEMPVPTIIKDVLPKSGKTS 300

Query: 498  MSEELYKILTSESGPAENMLNQLNLKSEHDVLEAINRLEAAIFSLKEKYTEQSGNKSPVR 557
            + EELYK+LT+ES  +  M+N LNLKSEH  LEAIN+LEAA+F+ KE+ +EQ+  KSPVR
Sbjct: 301  LGEELYKVLTAESSSSGEMINFLNLKSEHSALEAINKLEAAVFTWKERISEQASGKSPVR 360

Query: 558  TSWPFVKDPTAGIDKLKLLTDQAEVLLQLLKNKYPNHPQTFLDVSKIQYEKDVGHLILEA 617
            TSW F+KDP + +DK++ L ++AE L+Q LKN+YPN PQTFLD +KIQY KDVGH ILEA
Sbjct: 361  TSWSFIKDPISELDKIEFLLERAEALIQQLKNRYPNLPQTFLDATKIQYGKDVGHSILEA 420

Query: 618  YSRVLGNLAYSILSRIGDVLQVDAMCNPNSPAPTCCFPGMSLLNNRSDQMS----ALHSW 677
            YSRVL NLA+SILSRIGD+LQ DA+ NPNSP   CC PG  + ++  DQ+      L S 
Sbjct: 421  YSRVLANLAFSILSRIGDILQEDALSNPNSPVAKCCTPGSKMNDDNMDQIQMPWLRLRSR 480

Query: 678  QPLIGHSNSPNMTLPSSKVSGNSPTATPSRNRAWCIGREIKFTRQLFPAIELVGVLIEFI 737
              LI   N  +    +S  S  S + T +         E K +                 
Sbjct: 481  HSLIDQMNKADGKYFTSDASSCSTSETSN--------SEAKSS----------------- 540

Query: 738  FILFPSPNPSLPLQMDLPRFGRPKEDNGSSSSSPNLYVANCGPAVGISHRTVAAVFGDFG 797
                 S N +  +     RF RPK   G    SPNL+VANCGPAVG+S+  + +VF  FG
Sbjct: 541  -----SVNSTPSVNRVCSRFRRPKA--GEDERSPNLFVANCGPAVGVSYEAIGSVFSAFG 600

Query: 798  LVKGVHAADETGARVIVCFSEESSARAALEALHGRPCALLGGRTLHIRYSIIR--PSISH 857
             VKG++AAD++GARVIV + +E SA+AA  +LH RPC  L  R LHIRYS++   P+  H
Sbjct: 601  DVKGIYAADDSGARVIVSYFDEGSAQAAFNSLHSRPCPDLANRFLHIRYSVLEDSPATRH 660

Query: 858  PNDSVSVSLSASELDIPGLFLLHDFVSAKEEEDLLMEVDARPWNNLAKRRVQHYGYEFCY 917
               SV VSL ASEL+IPGLFL HDFVSAKEEE+LL  VD+RPWNNL+KRRVQHYGYEFCY
Sbjct: 661  ITSSVPVSLVASELNIPGLFLFHDFVSAKEEEELLAAVDSRPWNNLSKRRVQHYGYEFCY 720

Query: 918  QTRNVNTKHQLGELPSFVSHVVDRISMFPNVENVADASLDQLTYETEINTTLQVNEYPPG 977
              RNVNTK  LGELPSFVS +V+R+S FPN+++    +LDQLT          VNEYPPG
Sbjct: 721  DIRNVNTKQCLGELPSFVSSIVERVSSFPNLDDSTSVALDQLT----------VNEYPPG 780

Query: 978  VGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKCPSS---IDLKMGNSVNDSN 1037
            VGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRY EG+W   P+S   +++ + N  + S+
Sbjct: 781  VGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYLEGSW--LPNSTPGMNMAVENPDDYSS 840

Query: 1038 YLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSAIRRGPRRVSFTFRKV 1084
             LRRAIYLPPRSMLLLSGEARYAW+HYIPHHKIDMV D+ IRR  RRVSFTFRKV
Sbjct: 841  VLRRAIYLPPRSMLLLSGEARYAWNHYIPHHKIDMVNDTVIRRASRRVSFTFRKV 851

BLAST of HG10013854 vs. ExPASy TrEMBL
Match: A0A2H5PME3 (Uncharacterized protein (Fragment) OS=Citrus unshiu OX=55188 GN=CUMW_149840 PE=3 SV=1)

HSP 1 Score: 1045.0 bits (2701), Expect = 2.8e-301
Identity = 545/895 (60.89%), Postives = 664/895 (74.19%), Query Frame = 0

Query: 197  IMTFNGLESCVLNNQTFENESRSSRADECTTDSLEDHDSSSSSSKDASGSFSSKWLAMHR 256
            IMT++GLESC+LNNQ+++NESR+SR D C TDSL+D DSS SSSKDA GSFSSKW  M R
Sbjct: 87   IMTYDGLESCILNNQSYDNESRTSRGDGCLTDSLDDDDSSCSSSKDAFGSFSSKWSTMKR 146

Query: 257  DEQDLDEWEQPESPQHFYMKEKHDYTLQVSDIEAMKEKFTKLLLGEDVTGGQKGLSSALS 316
             E   ++W   ESP H+++K+K  Y  +  D+E MKE+F+KLLLGED+TGG+ G+++AL+
Sbjct: 147  VEHGSEDWGPSESPNHYHLKQKPAYAFRFLDVETMKERFSKLLLGEDITGGRLGVTTALA 206

Query: 317  LSNAITNLAASVFGELWKLEPLPEERKSKWRREMDWLLSPTHYMVELVPTKQNGTGGRVM 376
            LSNAITNLAASVFGELWKLEPL EERKS+W+REMDWLLSPT++MVELVP +QN   G+ +
Sbjct: 207  LSNAITNLAASVFGELWKLEPLLEERKSRWQREMDWLLSPTNFMVELVPARQNAANGQTL 266

Query: 377  EIMTPKVRGDVHMNLPALQKLDSMLIGTLDSMVKTEFWYAEVGSRAEGKSKSMGQSTRWW 436
            EIMTPK R D+HMNLPALQKLDSMLI TLDSMV TEFWYAEVGSRAEG++KS  +S RWW
Sbjct: 267  EIMTPKARADIHMNLPALQKLDSMLIETLDSMVNTEFWYAEVGSRAEGRNKSTRESKRWW 326

Query: 437  LPLPQVPSTGLSENERKKLLNHGRVVHQVFKAAKSINESILHEMPVPTAVREAVRASGKA 496
            LPL QVP++GLS++ RKKLL+  RVV+QVFKAAKSINE++L EMPVPT +++A+  SGKA
Sbjct: 327  LPLAQVPASGLSDSGRKKLLSQCRVVYQVFKAAKSINENVLLEMPVPTIIKDALPKSGKA 386

Query: 497  SMSEELYKILTSESGPAENMLNQLNLKSEHDVLEAINRLEAAIFSLKEKYTEQSGNKSPV 556
            S+ EELYK+LT+ES  +  M+N LNLKSEH  LEAIN+LEAA+F+ KE+ +EQ+  KSPV
Sbjct: 387  SLGEELYKVLTAESISSGEMINFLNLKSEHSALEAINKLEAAVFTWKERISEQASGKSPV 446

Query: 557  RTSWPFVKDPTAGIDKLKLLTDQAEVLLQLLKNKYPNHPQTFLDVSKIQYEKDVGHLILE 616
            RTSW F+KDP + +DK++ L ++AE L+Q LKN+YPN PQTFLD +KIQ+ KDVGH ILE
Sbjct: 447  RTSWSFIKDPISELDKIEFLLERAEALIQQLKNRYPNLPQTFLDATKIQFGKDVGHSILE 506

Query: 617  AYSRVLGNLAYSILSRIGDVLQVDAMCNPNSPAPTCCFPGMSLLNNRSDQMS----ALHS 676
            AYSRVL NLA+SILSRIGD+LQ DA+ NPNSP   CC PG  + ++  DQ+      L S
Sbjct: 507  AYSRVLANLAFSILSRIGDILQEDALSNPNSPVAKCCTPGSKMNDDNMDQIQMPWLRLRS 566

Query: 677  WQPLIGHSNSPNMTLPSSKVSGNSPTATPSRNRAWCIGREIKFTRQLFPAIELVGVLIEF 736
               LI   N  +    +S  S  S + T +          +  T                
Sbjct: 567  RHSLIDQMNKADGKYFTSDASSCSTSETSNSEAK---SSSVNST---------------- 626

Query: 737  IFILFPSPNPSLPLQMDLPRFGRPKEDNGSSSSSPNLYVANCGPAVGISHRTVAAVFGDF 796
                 PS N +                 G    SPNL+VANCGPAVG+S+  + +VF  F
Sbjct: 627  -----PSVNRA-----------------GEDERSPNLFVANCGPAVGVSYEAIGSVFSAF 686

Query: 797  GLVKGVHAADETGARVIVCFSEESSARAALEALHGRPCALLGGRTLHIRYSIIR--PSIS 856
            G VKG++AAD++GARVIV + +E SA+AA  +LH RPC  L  R LHI YS++   P+  
Sbjct: 687  GDVKGIYAADDSGARVIVSYFDEGSAQAAFNSLHSRPCPDLANRFLHISYSVLEDSPAPR 746

Query: 857  HPNDSVSVSLSASELDIPGLFLLHDFVSAKEEEDLLMEVDARPWNNLAKRRVQHYGYEFC 916
            H   SV VSL ASEL+IPGL LLHDFVSAKEEE+LL  VD+RPWNNL+KRRVQHYGYEFC
Sbjct: 747  HITSSVPVSLVASELNIPGLHLLHDFVSAKEEEELLAAVDSRPWNNLSKRRVQHYGYEFC 806

Query: 917  YQTRNVNTKHQLGELPSFVSHVVDRISMFPNVENVADASLDQLTYETEINTTLQVNEYPP 976
            Y  RNVNTK  LGELPSFVS +V+R+S FPN+++    +LDQLT          VNEYPP
Sbjct: 807  YDIRNVNTKQCLGELPSFVSSIVERVSSFPNLDDSTSVALDQLT----------VNEYPP 866

Query: 977  GVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKCPSS---IDLKMGNSVNDS 1036
            GVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRY EG+W   P+S   +++ + N  + S
Sbjct: 867  GVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYLEGSW--LPNSTPGMNMAVENPDDYS 926

Query: 1037 NYLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSAIRRGPRRVSFTFRK 1083
            + LRRAIYLPPRSMLLLSGEARYAW+HYIPHHKIDMV D+ IRR  RRVSFTFRK
Sbjct: 927  SVLRRAIYLPPRSMLLLSGEARYAWNHYIPHHKIDMVNDTGIRRASRRVSFTFRK 928

BLAST of HG10013854 vs. ExPASy TrEMBL
Match: A0A2H5PN53 (Uncharacterized protein (Fragment) OS=Citrus unshiu OX=55188 GN=CUMW_149840 PE=3 SV=1)

HSP 1 Score: 1037.3 bits (2681), Expect = 5.7e-299
Identity = 545/904 (60.29%), Postives = 664/904 (73.45%), Query Frame = 0

Query: 197  IMTFNGLESCVLNNQTFENESRSSRADECTTDSLEDHDSSSSSSKDASGSFSSKWLAMHR 256
            IMT++GLESC+LNNQ+++NESR+SR D C TDSL+D DSS SSSKDA GSFSSKW  M R
Sbjct: 87   IMTYDGLESCILNNQSYDNESRTSRGDGCLTDSLDDDDSSCSSSKDAFGSFSSKWSTMKR 146

Query: 257  DEQDLDEWEQPESPQHFYMKEKHDYTLQVSDIEAMKEKFTKLLLGEDVTGGQKGLSSALS 316
             E   ++W   ESP H+++K+K  Y  +  D+E MKE+F+KLLLGED+TGG+ G+++AL+
Sbjct: 147  VEHGSEDWGPSESPNHYHLKQKPAYAFRFLDVETMKERFSKLLLGEDITGGRLGVTTALA 206

Query: 317  LSNAITNLAASVFGELWKLEPLPEERKSKWRREMDWLLSPTHYMVELVPTKQNGTGGRVM 376
            LSNAITNLAASVFGELWKLEPL EERKS+W+REMDWLLSPT++MVELVP +QN   G+ +
Sbjct: 207  LSNAITNLAASVFGELWKLEPLLEERKSRWQREMDWLLSPTNFMVELVPARQNAANGQTL 266

Query: 377  EIMTPKVRGDVHMNLPALQKLDSMLI---------GTLDSMVKTEFWYAEVGSRAEGKSK 436
            EIMTPK R D+HMNLPALQKLDSMLI          TLDSMV TEFWYAEVGSRAEG++K
Sbjct: 267  EIMTPKARADIHMNLPALQKLDSMLIVGGLFFCLEETLDSMVNTEFWYAEVGSRAEGRNK 326

Query: 437  SMGQSTRWWLPLPQVPSTGLSENERKKLLNHGRVVHQVFKAAKSINESILHEMPVPTAVR 496
            S  +S RWWLPL QVP++GLS++ RKKLL+  RVV+QVFKAAKSINE++L EMPVPT ++
Sbjct: 327  STRESKRWWLPLAQVPASGLSDSGRKKLLSQCRVVYQVFKAAKSINENVLLEMPVPTIIK 386

Query: 497  EAVRASGKASMSEELYKILTSESGPAENMLNQLNLKSEHDVLEAINRLEAAIFSLKEKYT 556
            +A+  SGKAS+ EELYK+LT+ES  +  M+N LNLKSEH  LEAIN+LEAA+F+ KE+ +
Sbjct: 387  DALPKSGKASLGEELYKVLTAESISSGEMINFLNLKSEHSALEAINKLEAAVFTWKERIS 446

Query: 557  EQSGNKSPVRTSWPFVKDPTAGIDKLKLLTDQAEVLLQLLKNKYPNHPQTFLDVSKIQYE 616
            EQ+  KSPVRTSW F+KDP + +DK++ L ++AE L+Q LKN+YPN PQTFLD +KIQ+ 
Sbjct: 447  EQASGKSPVRTSWSFIKDPISELDKIEFLLERAEALIQQLKNRYPNLPQTFLDATKIQFG 506

Query: 617  KDVGHLILEAYSRVLGNLAYSILSRIGDVLQVDAMCNPNSPAPTCCFPGMSLLNNRSDQM 676
            KDVGH ILEAYSRVL NLA+SILSRIGD+LQ DA+ NPNSP   CC PG  + ++  DQ+
Sbjct: 507  KDVGHSILEAYSRVLANLAFSILSRIGDILQEDALSNPNSPVAKCCTPGSKMNDDNMDQI 566

Query: 677  S----ALHSWQPLIGHSNSPNMTLPSSKVSGNSPTATPSRNRAWCIGREIKFTRQLFPAI 736
                  L S   LI   N  +    +S  S  S + T +          +  T       
Sbjct: 567  QMPWLRLRSRHSLIDQMNKADGKYFTSDASSCSTSETSNSEAK---SSSVNST------- 626

Query: 737  ELVGVLIEFIFILFPSPNPSLPLQMDLPRFGRPKEDNGSSSSSPNLYVANCGPAVGISHR 796
                          PS N +                 G    SPNL+VANCGPAVG+S+ 
Sbjct: 627  --------------PSVNRA-----------------GEDERSPNLFVANCGPAVGVSYE 686

Query: 797  TVAAVFGDFGLVKGVHAADETGARVIVCFSEESSARAALEALHGRPCALLGGRTLHIRYS 856
             + +VF  FG VKG++AAD++GARVIV + +E SA+AA  +LH RPC  L  R LHI YS
Sbjct: 687  AIGSVFSAFGDVKGIYAADDSGARVIVSYFDEGSAQAAFNSLHSRPCPDLANRFLHISYS 746

Query: 857  IIR--PSISHPNDSVSVSLSASELDIPGLFLLHDFVSAKEEEDLLMEVDARPWNNLAKRR 916
            ++   P+  H   SV VSL ASEL+IPGL LLHDFVSAKEEE+LL  VD+RPWNNL+KRR
Sbjct: 747  VLEDSPAPRHITSSVPVSLVASELNIPGLHLLHDFVSAKEEEELLAAVDSRPWNNLSKRR 806

Query: 917  VQHYGYEFCYQTRNVNTKHQLGELPSFVSHVVDRISMFPNVENVADASLDQLTYETEINT 976
            VQHYGYEFCY  RNVNTK  LGELPSFVS +V+R+S FPN+++    +LDQLT       
Sbjct: 807  VQHYGYEFCYDIRNVNTKQCLGELPSFVSSIVERVSSFPNLDDSTSVALDQLT------- 866

Query: 977  TLQVNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKCPSS---IDL 1036
               VNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRY EG+W   P+S   +++
Sbjct: 867  ---VNEYPPGVGLSPHIDTHSAFEGLIFSLSLAGPCIMEFRRYLEGSW--LPNSTPGMNM 926

Query: 1037 KMGNSVNDSNYLRRAIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSAIRRGPRRVSF 1083
             + N  + S+ LRRAIYLPPRSMLLLSGEARYAW+HYIPHHKIDMV D+ IRR  RRVSF
Sbjct: 927  AVENPDDYSSVLRRAIYLPPRSMLLLSGEARYAWNHYIPHHKIDMVNDTGIRRASRRVSF 937

BLAST of HG10013854 vs. TAIR 10
Match: AT1G31650.1 (RHO guanyl-nucleotide exchange factor 14 )

HSP 1 Score: 563.9 bits (1452), Expect = 3.6e-160
Identity = 306/545 (56.15%), Postives = 385/545 (70.64%), Query Frame = 0

Query: 197 IMTFNGLESCVLNNQTFENESRSSRADECTTDSLEDHD-SSSSSSKDASGSFSSKWLAMH 256
           ++T+ GLE+C++NNQ++E ES +SR D C TDSL+D   SS SSSKDAS SFSSKWL M 
Sbjct: 29  MITYYGLETCIINNQSYEEESGTSRGDGCLTDSLDDDAFSSCSSSKDASSSFSSKWLPMK 88

Query: 257 RDE------------QDLDEWEQPE----SPQHFYMKEKHDYTLQVSDIEAMKEKFTKLL 316
            DE            Q  D  E+ +    S QHF  KEK  Y     D+EAMKEKF+KLL
Sbjct: 89  NDEHSCDGLNLSGRSQHFDAKEKKKQGYGSSQHFDAKEKPGYVYCHLDVEAMKEKFSKLL 148

Query: 317 LGEDVTGGQKGLSSALSLSNAITNLAASVFGELWKLEPLPEERKSKWRREMDWLLSPTHY 376
           LGEDVTGG KG+  AL+LSNA+T+LA S+FGELWKLEPL EE+K KWRREMDWLLSPT+Y
Sbjct: 149 LGEDVTGGCKGVQVALALSNAVTHLATSIFGELWKLEPLCEEKKQKWRREMDWLLSPTNY 208

Query: 377 MVELVPTKQNGTGGRVMEIMTPKVRGDVHMNLPALQKLDSMLIGTLDSMVKTEFWYAEVG 436
           M+ELVP+KQN   GR +EIMTPK R D+HMNLPALQKLDSMLI TLDSMV TEFWY+E+G
Sbjct: 209 MIELVPSKQNDANGRSLEIMTPKARADIHMNLPALQKLDSMLIETLDSMVNTEFWYSEIG 268

Query: 437 SRAEGKSKSMGQSTRWWLPLPQVPSTGLSENERKKLLNHGRVVHQVFKAAKSINESILHE 496
           SRAEGK+KS  +S RWWLP PQVP  GLS + RKKLL+ G+VV+QVFKA K+INE+IL E
Sbjct: 269 SRAEGKNKSTSESKRWWLPSPQVPKPGLSNSGRKKLLDKGKVVYQVFKATKAINENILLE 328

Query: 497 MPVPTAVREAVRASGKASMSEELYKILTSESGPAENMLNQLNLKSEHDVLEAINRLEAAI 556
           MPVP  ++EA+  SGK S+ +ELYK+L  ES   + +   LNL +EH  LE +N+LE+A+
Sbjct: 329 MPVPIVIKEAIPKSGKNSLGDELYKMLAVESATVDEIFISLNLGTEHAALETVNKLESAM 388

Query: 557 FSLKEKYTEQSGN-KSPVRTSWPFVKDPTAGIDKLKLLTDQAEVLLQLLKNKYPNHPQTF 616
           F+ KE+ TEQ  N KSPVR SW F KDP + I + + L ++AE L   +K+K+PN P +F
Sbjct: 389 FAWKERITEQGSNGKSPVRASWSFAKDPLSEIGRNESLLNRAEALRTQIKSKHPNLPHSF 448

Query: 617 LDVSKIQYEKDVGHLILEAYSRVLGNLAYSILSRIGDVLQVDAMCNPNSPAPTCCFPGMS 676
           LD +KIQY+KD+GH +LEAYSR L NLA+ ILSR+G++L+ D++ NPNSPAP  CFP  S
Sbjct: 449 LDATKIQYDKDIGHAVLEAYSRTLANLAFRILSRMGEILKEDSLSNPNSPAPPSCFPS-S 508

Query: 677 LLNNRSDQMSALHSWQPLIGHSNSPNMTLPSSKVSG-----------NSPTATPSR-NRA 712
               R+ +   L S    + HS + +M       +G           NS   TPSR +R 
Sbjct: 509 RDPYRTPERPLLSS---RVRHSLTDDMNKADGTETGLDFLFADAKASNSVNTTPSRSSRL 568

BLAST of HG10013854 vs. TAIR 10
Match: AT1G31500.1 (DNAse I-like superfamily protein )

HSP 1 Score: 421.0 bits (1081), Expect = 3.7e-117
Identity = 213/339 (62.83%), Postives = 260/339 (76.70%), Query Frame = 0

Query: 1074 RRVSFT-FRKVYVKSRFFPHSPSSCLRWKARSQAILAVLKNLEADFLCLQEVDEYDSFYK 1133
            R VS+    +VYVKS   PHSP +CL+WKARS AIL+VLKNL+ADF CLQEVDEYDSFY+
Sbjct: 64   RLVSYNILAQVYVKSALLPHSPPACLKWKARSHAILSVLKNLQADFFCLQEVDEYDSFYR 123

Query: 1134 GNLEKCGYSSLYIQRSGQ-KRDGCGIFFKHEKADLIIEDRIEYNDLVNSIQDDGCSCEDK 1193
             N++  GYS +YIQR+GQ KRDGC IF+K   A+L+ ++RIEYNDLV+SI+ D  SC   
Sbjct: 124  NNMDSLGYSGIYIQRTGQRKRDGCAIFYKPSCAELVTKERIEYNDLVDSIKADSVSC--- 183

Query: 1194 SEDVVTSASNDVESNKGSSPKATVADRGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIV 1253
            SE  + +++   +S K         D  D NDP VRLKRDCVGIMAAF++ KPF H+VIV
Sbjct: 184  SEQKIETSNEGKDSRK---------DSRDLNDPLVRLKRDCVGIMAAFRINKPFQHIVIV 243

Query: 1254 ANTHLYWDPEWADVKIAQAKYLLSRLARFKTLVAEKFECTPSILLAGDFNSTPGDKVYQY 1313
            ANTHLYWDPE ADVK+AQAKYLLSRLA+FKTL++++FECTPS+LLAGDFNS PGD VY Y
Sbjct: 244  ANTHLYWDPELADVKLAQAKYLLSRLAQFKTLISDEFECTPSLLLAGDFNSIPGDMVYSY 303

Query: 1314 LVSGSSSSGFSPESLEELPLPLCSVYASILGSEPSFTNFTPGFTGTLDYIFLSPSDSMRP 1373
            LVSG++    + E  EE P+PL SVY  +   EP FTN TPGFT TLDYIF+SPSD ++P
Sbjct: 304  LVSGNAKPTETIEE-EEAPVPLSSVY-EVTRGEPKFTNCTPGFTNTLDYIFISPSDFIKP 363

Query: 1374 TSFLELPESEWPEVIGGLPNFNYPSDHLPIGAEFEITME 1411
             S L+LPE + P+V+G LPN ++PSDHLPIGAEFEI  E
Sbjct: 364  VSILQLPEPDSPDVVGFLPNHHHPSDHLPIGAEFEIRRE 388

BLAST of HG10013854 vs. TAIR 10
Match: AT1G31500.2 (DNAse I-like superfamily protein )

HSP 1 Score: 421.0 bits (1081), Expect = 3.7e-117
Identity = 213/339 (62.83%), Postives = 260/339 (76.70%), Query Frame = 0

Query: 1074 RRVSFT-FRKVYVKSRFFPHSPSSCLRWKARSQAILAVLKNLEADFLCLQEVDEYDSFYK 1133
            R VS+    +VYVKS   PHSP +CL+WKARS AIL+VLKNL+ADF CLQEVDEYDSFY+
Sbjct: 34   RLVSYNILAQVYVKSALLPHSPPACLKWKARSHAILSVLKNLQADFFCLQEVDEYDSFYR 93

Query: 1134 GNLEKCGYSSLYIQRSGQ-KRDGCGIFFKHEKADLIIEDRIEYNDLVNSIQDDGCSCEDK 1193
             N++  GYS +YIQR+GQ KRDGC IF+K   A+L+ ++RIEYNDLV+SI+ D  SC   
Sbjct: 94   NNMDSLGYSGIYIQRTGQRKRDGCAIFYKPSCAELVTKERIEYNDLVDSIKADSVSC--- 153

Query: 1194 SEDVVTSASNDVESNKGSSPKATVADRGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIV 1253
            SE  + +++   +S K         D  D NDP VRLKRDCVGIMAAF++ KPF H+VIV
Sbjct: 154  SEQKIETSNEGKDSRK---------DSRDLNDPLVRLKRDCVGIMAAFRINKPFQHIVIV 213

Query: 1254 ANTHLYWDPEWADVKIAQAKYLLSRLARFKTLVAEKFECTPSILLAGDFNSTPGDKVYQY 1313
            ANTHLYWDPE ADVK+AQAKYLLSRLA+FKTL++++FECTPS+LLAGDFNS PGD VY Y
Sbjct: 214  ANTHLYWDPELADVKLAQAKYLLSRLAQFKTLISDEFECTPSLLLAGDFNSIPGDMVYSY 273

Query: 1314 LVSGSSSSGFSPESLEELPLPLCSVYASILGSEPSFTNFTPGFTGTLDYIFLSPSDSMRP 1373
            LVSG++    + E  EE P+PL SVY  +   EP FTN TPGFT TLDYIF+SPSD ++P
Sbjct: 274  LVSGNAKPTETIEE-EEAPVPLSSVY-EVTRGEPKFTNCTPGFTNTLDYIFISPSDFIKP 333

Query: 1374 TSFLELPESEWPEVIGGLPNFNYPSDHLPIGAEFEITME 1411
             S L+LPE + P+V+G LPN ++PSDHLPIGAEFEI  E
Sbjct: 334  VSILQLPEPDSPDVVGFLPNHHHPSDHLPIGAEFEIRRE 358

BLAST of HG10013854 vs. TAIR 10
Match: AT1G31500.4 (DNAse I-like superfamily protein )

HSP 1 Score: 421.0 bits (1081), Expect = 3.7e-117
Identity = 213/339 (62.83%), Postives = 260/339 (76.70%), Query Frame = 0

Query: 1074 RRVSFT-FRKVYVKSRFFPHSPSSCLRWKARSQAILAVLKNLEADFLCLQEVDEYDSFYK 1133
            R VS+    +VYVKS   PHSP +CL+WKARS AIL+VLKNL+ADF CLQEVDEYDSFY+
Sbjct: 93   RLVSYNILAQVYVKSALLPHSPPACLKWKARSHAILSVLKNLQADFFCLQEVDEYDSFYR 152

Query: 1134 GNLEKCGYSSLYIQRSGQ-KRDGCGIFFKHEKADLIIEDRIEYNDLVNSIQDDGCSCEDK 1193
             N++  GYS +YIQR+GQ KRDGC IF+K   A+L+ ++RIEYNDLV+SI+ D  SC   
Sbjct: 153  NNMDSLGYSGIYIQRTGQRKRDGCAIFYKPSCAELVTKERIEYNDLVDSIKADSVSC--- 212

Query: 1194 SEDVVTSASNDVESNKGSSPKATVADRGDPNDPRVRLKRDCVGIMAAFKLKKPFHHVVIV 1253
            SE  + +++   +S K         D  D NDP VRLKRDCVGIMAAF++ KPF H+VIV
Sbjct: 213  SEQKIETSNEGKDSRK---------DSRDLNDPLVRLKRDCVGIMAAFRINKPFQHIVIV 272

Query: 1254 ANTHLYWDPEWADVKIAQAKYLLSRLARFKTLVAEKFECTPSILLAGDFNSTPGDKVYQY 1313
            ANTHLYWDPE ADVK+AQAKYLLSRLA+FKTL++++FECTPS+LLAGDFNS PGD VY Y
Sbjct: 273  ANTHLYWDPELADVKLAQAKYLLSRLAQFKTLISDEFECTPSLLLAGDFNSIPGDMVYSY 332

Query: 1314 LVSGSSSSGFSPESLEELPLPLCSVYASILGSEPSFTNFTPGFTGTLDYIFLSPSDSMRP 1373
            LVSG++    + E  EE P+PL SVY  +   EP FTN TPGFT TLDYIF+SPSD ++P
Sbjct: 333  LVSGNAKPTETIEE-EEAPVPLSSVY-EVTRGEPKFTNCTPGFTNTLDYIFISPSDFIKP 392

Query: 1374 TSFLELPESEWPEVIGGLPNFNYPSDHLPIGAEFEITME 1411
             S L+LPE + P+V+G LPN ++PSDHLPIGAEFEI  E
Sbjct: 393  VSILQLPEPDSPDVVGFLPNHHHPSDHLPIGAEFEIRRE 417

BLAST of HG10013854 vs. TAIR 10
Match: AT1G31600.1 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 414.5 bits (1064), Expect = 3.5e-115
Identity = 221/343 (64.43%), Postives = 258/343 (75.22%), Query Frame = 0

Query: 747  QMDLPRFGRPKEDNGSSSS----SPNLYVANCGPAVGISHRTVAAVFGDFGLVKGVHAAD 806
            +M  PRF RP + + SS S    S NLYVANCGPAVG++H  +AAVF +FG V GV+AAD
Sbjct: 85   RMVQPRFVRPTQSSPSSISGEPNSSNLYVANCGPAVGLTHNAIAAVFAEFGEVNGVYAAD 144

Query: 807  ETGARVIVCFSEESSARAALEALHGRPCALLGGRTLHIRYSIIR-PSISHPNDSVSVSLS 866
            ++G RVIV F++  SA+AALEAL GRPC  L GR+LHIRYS+++ PS +  ND V VSL 
Sbjct: 145  DSGVRVIVSFADPFSAKAALEALSGRPCPDLKGRSLHIRYSVLQLPSETQVNDCVPVSLI 204

Query: 867  ASELDIPGLFLLHDFVSAKEEEDLLMEVDARPWNNLAKRRVQHYGYEFCYQTRNVNTKHQ 926
             SEL+IPGLFLL DFV+  EE+ LL  VDAR W  LAKRRVQHYGYEFCY TRNV+TK +
Sbjct: 205  DSELNIPGLFLLPDFVTVAEEQQLLAAVDARHWIGLAKRRVQHYGYEFCYGTRNVDTKKR 264

Query: 927  LGELPSFVSHVVDRISMFPNVEN-VADASLDQLTYETEINTTLQVNEYPPGVGLSPHIDT 986
            LGELPSFVS +++RI +FPN +N  A  +LDQLT          VNEYP GVGLSPHIDT
Sbjct: 265  LGELPSFVSPILERIYLFPNFDNGSASLNLDQLT----------VNEYPSGVGLSPHIDT 324

Query: 987  HSAFEGLIFSLSLAGPCIMEFRRYPEGTWHKCPSSIDLKMGNSVNDSNYLRRAIYLPPRS 1046
            HSAFE  IFSLSLAGPCIMEFRRY   TW K  ++   K G    DS+ +++A+YLPPRS
Sbjct: 325  HSAFEDCIFSLSLAGPCIMEFRRYSVSTW-KASTTDAEKSG----DSSCIKKALYLPPRS 384

Query: 1047 MLLLSGEARYAWHHYIPHHKIDMVKDSAIRRGPRRVSFTFRKV 1084
            MLLLSGEARYAW+HYIPHHKID VKD  IRR  RRVSFT RKV
Sbjct: 385  MLLLSGEARYAWNHYIPHHKIDKVKDKVIRRSSRRVSFTLRKV 412

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CBI29214.30.0e+0064.89unnamed protein product, partial [Vitis vinifera][more]
CAN76404.10.0e+0060.50hypothetical protein VITISV_021238 [Vitis vinifera][more]
ESR46132.18.5e-30561.34hypothetical protein CICLE_v10004016mg [Citrus clementina][more]
GAY53536.15.7e-30160.89hypothetical protein CUMW_149840, partial [Citrus unshiu][more]
GAY53535.11.2e-29860.29hypothetical protein CUMW_149840, partial [Citrus unshiu][more]
Match NameE-valueIdentityDescription
Q56WM65.0e-15956.15Rop guanine nucleotide exchange factor 14 OS=Arabidopsis thaliana OX=3702 GN=ROP... [more]
A8MS415.3e-11662.83Carbon catabolite repressor protein 4 homolog 4 OS=Arabidopsis thaliana OX=3702 ... [more]
Q8RWY18.4e-11464.90Alkylated DNA repair protein ALKBH8 homolog OS=Arabidopsis thaliana OX=3702 GN=A... [more]
Q9LZN07.4e-8639.36Rop guanine nucleotide exchange factor 7 OS=Arabidopsis thaliana OX=3702 GN=ROPG... [more]
Q9LV401.3e-8240.61Rho guanine nucleotide exchange factor 8 OS=Arabidopsis thaliana OX=3702 GN=ROPG... [more]
Match NameE-valueIdentityDescription
D7TFE10.0e+0064.89Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_02s0087g00460 PE=3 SV=... [more]
A5B3V20.0e+0060.50Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_021238 PE=3 SV=1[more]
V4SZ854.1e-30561.34Uncharacterized protein OS=Citrus clementina OX=85681 GN=CICLE_v10004016mg PE=3 ... [more]
A0A2H5PME32.8e-30160.89Uncharacterized protein (Fragment) OS=Citrus unshiu OX=55188 GN=CUMW_149840 PE=3... [more]
A0A2H5PN535.7e-29960.29Uncharacterized protein (Fragment) OS=Citrus unshiu OX=55188 GN=CUMW_149840 PE=3... [more]
Match NameE-valueIdentityDescription
AT1G31650.13.6e-16056.15RHO guanyl-nucleotide exchange factor 14 [more]
AT1G31500.13.7e-11762.83DNAse I-like superfamily protein [more]
AT1G31500.23.7e-11762.83DNAse I-like superfamily protein [more]
AT1G31500.43.7e-11762.83DNAse I-like superfamily protein [more]
AT1G31600.13.5e-11564.43RNA-binding (RRM/RBD/RNP motifs) family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 570..590
NoneNo IPR availableGENE3D1.20.58.2010PRONE domain, subdomain 1coord: 512..648
e-value: 1.2E-16
score: 62.7
coord: 279..511
e-value: 4.0E-91
score: 306.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..24
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1198..1222
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 217..238
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1198..1212
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 217..244
NoneNo IPR availablePANTHERPTHR33101:SF2ROP GUANINE NUCLEOTIDE EXCHANGE FACTOR 14coord: 189..713
NoneNo IPR availableCDDcd00590RRM_SFcoord: 769..841
e-value: 1.43332E-5
score: 42.2921
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 867..1083
IPR005135Endonuclease/exonuclease/phosphatasePFAMPF03372Exo_endo_phoscoord: 1098..1398
e-value: 4.4E-18
score: 65.8
IPR005512PRONE domainPFAMPF03759PRONEcoord: 286..643
e-value: 5.6E-153
score: 509.1
IPR005512PRONE domainPROSITEPS51334PRONEcoord: 278..650
score: 93.803253
IPR027450Alpha-ketoglutarate-dependent dioxygenase AlkB-likePFAMPF135322OG-FeII_Oxy_2coord: 869..1081
e-value: 1.2E-15
score: 58.2
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 1085..1410
e-value: 1.8E-85
score: 289.1
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 1094..1407
IPR037151Alpha-ketoglutarate-dependent dioxygenase AlkB-like superfamilyGENE3D2.60.120.590coord: 862..1084
e-value: 4.7E-43
score: 149.0
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 755..858
e-value: 2.7E-7
score: 32.5
IPR038937Rop guanine nucleotide exchange factorPANTHERPTHR33101ROP GUANINE NUCLEOTIDE EXCHANGE FACTOR 1coord: 189..713
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 767..845
score: 9.780305
IPR005123Oxoglutarate/iron-dependent dioxygenasePROSITEPS51471FE2OG_OXYcoord: 958..1084
score: 9.675059
IPR035979RNA-binding domain superfamilySUPERFAMILY54928RNA-binding domain, RBDcoord: 763..852

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10013854.1HG10013854.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0050790 regulation of catalytic activity
biological_process GO:0080092 regulation of pollen tube growth
molecular_function GO:0003824 catalytic activity
molecular_function GO:0005085 guanyl-nucleotide exchange factor activity
molecular_function GO:0003676 nucleic acid binding