CsaV3_3G020160 (gene) Cucumber (Chinese Long) v3

Overview
NameCsaV3_3G020160
Typegene
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
Descriptionprotein root UVB sensitive 1, chloroplastic isoform X1
Locationchr3: 16102248 .. 16146575 (-)
RNA-Seq ExpressionCsaV3_3G020160
SyntenyCsaV3_3G020160
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATGGGGTACTGCCGTTCTCTTATCAGGCGCCGCCGCCGGAGCCGATTCCATTTCGGCCAGTCTATGTCGATGTCTTAAACTACGTACCAGTCCGCCGTTTTCACCATTGCTTGGATTCTTCTATGCGAAGGTCATGTACAGCACTAAGACCTTCTCTTAGCGTATTTCCTCACTTTCTTAAACCCACAAAACTCTTTCAAGGTTATTCCTCTCCTTGTAATGGAACTAGAATCAAACCTGCTCTCGTTCATTCTCCTTTGCTGGCTGGTGACGGCCATGGGTGTGATGGAAATAACAATGGTGGCTGGAATAATTCGAATCCTTTTGGGGGTTTTGGATGGTGGCAGTATGACGGTGATTCTCCCCCATGGTCGGACAATGCCTTCCTTGCTTTCTTTTTTTCCTCTGTTCTGGGTTGTTTCTGCCTCTTTCAATTGGCAGTAGCGCTAGCACGTAACAATATGAACACCGAGTCTATTTGGGAAGTAAAAGGAGGTAAGCGAATCCGCCTCATTCTCGATACGTATAGAGATGAGTTCCATGTTGCAACTGGCATGCCGTCGTCTTCGTTATCCTTTTCCTTTGTCAACGTTTGGCTTCGTTGCAGCGATATATTCACGCGTTTGATGCTTCCGGAGGGTTTTCCAGACAGTGTCACCAGCGACTATCTGGAATATTCCCTTTGGCGAGGAGTCCAGGGGATTGCCAGCCAAGTTAGTGGGGTGCTTGCAACTCAGGTGCCCATTTAAGTTCATCTGTGGTTCCTCTTTCCTATTTCTCATCCAACAAAAAAGTAATTTACTGAATTTTGCTTGAACCCTCGCTGGGAACCAGGCACTGCTTTATGCTGTTGGATTGGGGAAAGGAGCTATTCCGACTGCTGCTGCAGTGAATTGGGTACTGAAAGATGGATTTGGATATCTAAGTAAAATTTTTCTCTCAAAATATGGACGGCACTTTGATGTTCATCCGAAGGGGTGGAGGTTGTTCGCTGATCTTCTGGAAAACGCTGCCTATGGGATGGAAATGTTAACTCCCGCATTTCCCCTCCATTTTGTCGTGATCGGTGCTGCTGCTGGGGCCGGACGATCTGCAGCTGCCTTGATTCAGGTTATTGGAAGTTGATACGTAATTTAATTTACACAGATATGCCCCATTCCTCATGTTTGCTGCCTTATGTTTATATATTTAATCTGCTGGTAAGTTTCTTCGAATGCATAGCTTAATTTGATTTAGATAGTGAAATGCAGAAGTATTAAATGAAACCAATGGATGGGCAAATGTCTAAGCTTTTGATAGGTCGTGGAGGATCTTGGTTCATGGAATGAGTAGAAGATCATGATCACAAATGTATTAGATCGCCTTCTCTTTAACAATTGAAGTGGAGGTGGGTAGTGGAGAAGAAGTTTCGGTTTTAAATGTTTATGTTCCTTCAAATTATAGATCAAGAAAAGTGTTTTGGGGAGAACTATAGGGCGTGTCTTTAAAGATATTGTGAGTGACAATTGTAGAGAGTCCCCAAAAAACCAAGCCGAAAATGGGTTTTGCCCCATCTTTTGAACCTTAAGAATTTTGCCACTTTGCTTATGCCATCATTAGCTAAAAATACCAAATTATCCTTAATTTGTTATGCCTTCTTTCTCTTCTTGATTTTCTCTTTCTCCACTCTTTCCCTCCTCTTCAGTTGCAAGGGTCATCGAATGTGGATTTTCACCAGAGAGAGGGTCGTCGTAAAGTGGGTTTTCAGCCCTGAGTGTTCTCCTCTCGCCACTCGACATGGGCCACTTGACATGTCATGAGTTGCGAGACTTTGGGATTTGACACTTGCTGACGATAATCCTTTACTAAAGAGAAGAGGACTTGGGGAAATGCTATAACTATTTTGCAAATAAGCTTTGTTTCAAGGCCTTTAATTTCCTAACTCCAACCCTCCCAAATATACGGGCATCCTGCCGAAATTTCCAATTAATCACCGACACTCCTTTACCTCCTCCACACCTTCCCATAGGAATCCTTCATGAGTTTGTTCAAGCCTTTACACACTGATCCTTGAGGCTCTAGAAAGGGAGAAAAATAAATCAGAATACTTCTCAACACCAATCAAGTTAAAGTCAGCCCACCTCCCTTTGAGGAAAAAAAAAAAACATTTCTCCCAAGAGGCCAATTTGTTTATCACCTTATCCTCAACAAAACTCCAAAAGGAATTCTAAAGCGTGTTATTCTCGACCACATTTTCATTGCCAACGAAGTGGTGAAAGACTTAGCTAAGTGGTCTAAAATCACCAAAAGCCAAAGAGATGGGGCCTCGGCCTAAATGATTTGAGATTAAGAAATATGACTCTTTTATCAAATGGGGATGGAGATTCATGTCAGAATAGAAGGCCTTGTGGTTCAAAGTAGTTAAAAGTATTCATAGTATCCACCCCTTCCTTTGGCACACTAGCAGGAAGGAGAATTTGAGTTTGAGAAGCTTGTGGGTTAGCATTTCAAGAGAATGATGCAAGATTGAAGCACTAGCCACTTTCAGAATTGGTAATGGCAGCATAATGTATTTTTAGATTGATCCATGGTTGGATAATATTCCCCTGAACAAGTTCCCTGGATTATCTCGTATTACCCTCTTGCCTAAACAGTCAATTTTTGAGCACTGGGATAGCCGAACTAGTTCCTGGGCGGTTTACTTTAGAAGATTAATGAAAGAGGAGGAGGTTGGTGATTGAGGGAAAATGGTAAACTCTATCGTAGATAAGAAAGTTTGGTCCCTAGAACCAAGTGGGTTCTTAATCCTTGGTTACTCATCTATCACTTGCTTCTCCCATTGATGTTTCTTTATATAGGGTGCTTTGGAAAACTAAGAGTCCTAAAAGAGTCAACATATCCATGTGGATTATGCTTCAAGGGAATTTAAATTGTGCCTCTTTTCTGCAAAAAATCTTTCCTCTCATTGCCTATCTCCTTATATGTGCCATTTATATTGTATGAATCAAGAAAACCTTCAAAACCTTTTTTTTTTAGTGTCCTATGCAGTGAATTGTTGGTGTAGGCTGCTGCAAATTTTTAATTTCAGATGGGTTTTTTATGGCAGTTTTAGTGGTAATGTGCTGCAAATTCTCCCTGGACCAGATTAAAGAAAACCTCATCAGTTTTATGGTGCAATGCAGCCAAGGCTTTATTAGTAGAAATATGGTTTGAAGGAAATCAATGAGTTTTCCACAATACGGCTTAGTACGGTTTGGTCGATTTGAGTATGCTCGTTTAAACACTTCATCATGGTGCTCCATGGTCACATGATTTCAAGATTACTTTACAGGACATTCACCCAACTAGCAAGCTTTTATTGTTTATTCATTTTGAGTTCAGCTAAGGAGATTTAAGCAAGGGCCTTGTTTTTGCCTGGTTTAGAGTTGTCTTTTAGTCTTTTGTTAATGATAGGCCGTTGATGTAAAGGTCTTATGTTTTGGACCATTTTTACTTTCTCAATGTCATTTCATTTCTCTATTTTTTGGATATGATGAAGACGCTAAGGGAATATTAACCTAATTGAGATATTCGGATGCGTCTACTGATCCCTAGTTCTTTAATTGCTCATTGTATAACCCTCTTGTACTTTGAGCTCTGCCTCATTTATTTTTTATAATAAATGAGATTCCTTTCCTTTTAGAAAAAAAGAAGATTCAATGTTTCAACAACAATAGATTCGATCTCATTAAGCTATCAGGAGATGAAGAGTTAACATCAATTCCTTTGAATCCTGAGATTGAAGAATTGATTGGCAGACTCTAAACATATCAGGAAACCGTCTTAGAAGCCCATGGATCAGTATCTCTAGAGAATAGGAAAAGGTGGTTGAGTTGGCAACTTTCAAAACAGGGAACGGAAGGAGAGTAGCTTTCAGGACAGATTTGTGGATAGGTGATCTACCTTTTAAATCTCAGTTTCCACATCTTTTCAGATTAGCTCAGCAGCCAAATGATTCAGTTGCTGCCCACCGGGATTTTGTCACTAAATCTTGGTCTTTAGTATTTCGAAGATTGCTTAAAGATGAAGAAATTCAAGATTTCCAATGTCTTTTAACACTCATATCTCATAAAGGTCTAACGGACTTGGATGACAGGAGAGTTTGGTCATTAGAATCCTCAGACCATTTTTCAGTTAAGTCTCTTTCAAAGCACCTTTCTCCCTCTTCACTTTTGGAAAAAGTTTACTTTAAAGCACTTTGGAAAACCAGCAGTCCAAGGAGAATAAATGTCCTGGTTTGGATTATGGCAGTGGGTTCTTTAAACTGTTTCGAGACTATGCAAAGGAAGCTTCCCAATACGTGTTTGCTGCCTTCAGTGTGCCCCCTTTGCTTGGAAAACAGTGAGCTATTAATTCACCTATTCATTTTCTGTCCCTTCTCATCAAATTGTTGGTTTAGCATATATGCTCAAATCAGTTTGGGTCTTTGATGGTTCGTTTAGCGCCAATGTGTGCCAATTAATGTGGGGGCCTTATTTATCAAAAAAACCTTTTCTAATTTGATGATTGGAAGGACATTAAGTTATACCTAGAGGATCTCTTTCAAGTCAAAATCAATATTAACCCTTTATTTGCAGACAAAGCCATCATGAAAGTTACTCAAGGAAAGTTCGAAGATTTAATTTTTGCTCAGGTCAAATGGTTCAATTATGGAAAAAATTTCATTTGTTATTCGAAAAATGGAATGAAGCCAAACACAGCAGACCAACTCTTATTGAAGGTTTCGGAGGATGGTTGTCAATAAAAAATCTTCCTCTCAACTTATGGCGAAGAATTATCTTTGAAGCAATTGGAGCATACTTTGGTGGACTGGAAAGTATAGCCTTAGAAACACTCAATCTGATCAAATGCTCTGAAGCAAGAATTAAAGTTTGGAAAAATCTTTGCAGATTCTTACCAGCAACTATAGAATTAAAGAGTGAGAACCGATGAAATTTTTTCCTTAATTTTGGGGATTTTGAACCCTTAGAAGCTCCTGTAATAGTCCAATCTGATCTATTTCGAAGTGTTTTTTCCAATACTATTGATTTATGTAGAATTAATTCAATGATTTTAGATGAAGAGCTCCAAGGTGATGGCCTCCCAGATACCTGGAATTTTCTCAAACAGAATGCCCTGCATCTCGAGAAATTCTTTCGAAGCACTTAGGAAGTTGGGTGGTGTAAATCAAAGTAGAATTATTTCTCAAAATGAATTCTCTGCCTCTAATGGCTAAGAGAAGGGATATTTATTAAGAATGTGTAAGTATACAAGCAGTTGATATAACTGTTTTTATGAAACAGTTTTAACAGTTAACAGTTTAAACAATTAAACTGTTTTACTAACTATCTACAGCTGTTAGTTAACAGTTTAAACAATTAAACTGTTTTACTAACTATTCTATATTACATCATTGGGGGATGAAGAAAACCTTGTCCGGTCGGAAAGAGTTTCAGCCCTTGAATCCTCGCCGGAAAAAAATAATTTTCAGAAGGAAAAATTCAAATTTGGGTATAGCTGGCATCTCCACGTACTTAGAAGGGGAAAAGACTTCAACAGAACAGAGAGAAAAACAGACATCGAAAAAAAAAAAGGCAGCCAGCCGGCATTTTATTATCTCAAAAAACAGCTCACCTAGCCCTCCCCAACCAGATTTTTTCAAACTTAATGTATCTGCAGAAATTTTAAAAGATGACACACAGATGGCCCAAGAAATTTATTAATCAGAATTCCAATGGCTTTTGGTAGTCTGACGAATTAGAGGAAGAATCAGAATCCTCTCCTCCATCAAGGAACTTCACTTGCGGGCCATCCCCTGTTTTCAGTTCTCAACCAAAATTACCAGCCTCTCCCAAGACTAAAATATCCTCTGCAATGAAAGCTATTCCATTTCATGCCAGAAATTTTTCTAAGCACCTCAAGACCTATACCAAAGGGAGAACTTCTTTTCGCAACATTTGGATGAGCTTGGCTAATTCTGACTTATTAGAGGAAAGTTGTGTGAAGATACAAATTCCAAATTCAGTGGGCAACAGGTCAGTACTGCCTTCCTCCTTAAGTAATAAATTTTCTCAATCGTCCTCCAAACTTTCTGGTTCATTCTTTAATTTTGTGCAAGATACTTCTTGTCTCAAAAAAGTTGAGAACCCAGATGATGGTTTTATGGAAGTTGAATCAGTCACCAGTGTAAGTACTGCAGAATTTGAACCAGTTGAGGAAGAAATAATTCAATCAGAGACAGAGAAGCAACAGGACAGTGACTCTCTCGGTCCAGATTTTATTAAATTGTTTTCCTCCCTAGAAAAAGAAATTTCGTGCAGGTCAGCAAGTATCCTTCCCCCTCAAGAGTTGGTTCCATTCTTGGCAGCATGTGAAATTGAATTGATATAAGGCAGAGGTAGCAAAGATGGAGATTATATCATGGAACAGTAGAGGATTCATGGGCCTTTTGAAACAATTGGCTATTAAAGATCTTTTGAAGAAAGCTAATCCAGATAGAGTTTTGTTTCAGGAGACCAAAAGAGAAGAGATTGCCACTTGTATTATGAAGGCACTATGGAGTTCAAAAGGAATTGGCTGGGTTTTTGTTTGAAGCTAGTGGCAGATTAGGAGTTTTTCTGATTATGTGGGATGAATCTAAGTTGTCGATTATTGAAGTTCTGAAACGAGACTATTCTCTTTCAGTAAAATGCAGCACAATCAATAGAAAAATATGTTAGGATTTCAAACTTCTATGGTCCCACACATTATTGAGATAGAAACCAAATTTGTCCGGAGTTGTCTCTTCTTTCAGACCACTGTACAGGAGCCTGGTGTTTGGGAGCTGATTTCAACATCATTCGAAAAGTTCAAGAGAGATTTCCAGTAGGAAGATTAACAAAGGGAATGAGAAAGATTAATAAACTCATTAGAGTGGCAAAGCTACTAGAAATTCCGTTTTCCAACGGTAAATTCACATGGTCAAGAGAAGAAAGGTCAGTTTCATGTTCTCTGCTGGATCAGTTTCAACAATAATTTATAATTTGTTTGCAAAGGTTCTCGCATAAAGATTAAAGGAAGTAATGCCAAGTATTACTGCTCCATTACAATGTGGCTTTCTGGAAGAGCGCCAAATCTTGGATCCTATTTTAATAGCAAACAAAATTGTGAAGGACTATAGAATCCGAAAAAAGAAAAGATGGATCTTTAATTAGACCTTGAAAAGGTCTTTAACAGAGTGGATTGGAATTTTTAGAAAATGTATTGCAAAAGATCTTCGACGTAGGATATTGTGTTATATTGGCCAGTCTTTACAGCATCCCCACTTTGAAGTCCTTGATGATCAGGACTTCCGGATCAGCCCTATGCTTCAAACAGTTTTTTGCTCAATCAATCAAGGCTTATTCTTCTTTGTTCTCAGCCTGCTGCCTTTGTTTTTGTTTTTGCTTTGGACAGATTCTTTTTATCTTTTTGTTTGGCTTTGTTTTTGTGGTTTTTTGTTTTGATATTGTTTTGTCGCTTTTTGGTTGTGTTGAGGGTGCTAAGGGGGTGTCAACCTAGTTGCTATTGTGTCAGCACAATCAATAGAAAAATATGTTGGATCCCTTTTAGAGAGAAGGGTTGCTTTCTTTGGGAAGTAGGGGTGTGTGCCATTCTATGGGTGATATGGCTTGAGAGAAATAACCAAGGGTGTAGAAGGTTGGAGAAGAGCCAGTGATGTTTGGTCCTTGGTGAATTATCATGTCTCGCTTTGGCAACATTTTAGTTAGTTGGACCTTGTTTATCTAATGGATTTCTGTGGGCTTGGTCTTTTTATGCCCTTGTACTCGTTCATTTTTTTCTTAATGAAAGTTGTGTTTTCTATATAAAATAAAGAAATGCCTTTCATTATTCTTAGGCATAAGTATCTTCTAATCTTTCTTGAATTGGGAGGGGGGAATCAAATTTTACCCTTAAATTTTGGGAGTTGTATCACTTAAATTCCATACTAATAATTTAAACCCTATACTTTCATATTGGAATCAAGTTAGACCATCTGATAGAATTATGTTCTAAAAATTGGTAATGCATAACTTTCACACATGTATGAACTTATTTAAATGTAACCATCATAGGTTGATCTAGTAGTAAAAAGGAGACATAGTCTCAGTAAATGACTAAGAGGTCAAAAGGTTCAACGCATGATGGCCACCTACTTTGGGTTTTCTTGACACCTAAATGCAGTCGATTTGTCACGTGAGATTAGTCGAGGTGCGCGTAAACTAGCTTGGACACTCACATATATAAAAGAAATAAAAAATAAACGTACTTAGATGCAGGGAGTAACTTAATATCAGGAAAAAAAAAACTAGAACCAAATTGATAGTAGTGGATTTTCATTATTTTAAGTGGTTTTTCATCTTCTTGCCTTCTTCTATTCTTTTCTGACTTATTAGGAAAATTTTCACGAGAAATTTGCTTGCTACTTTGGTAGGATTAGAAGAAAAATACTTCTATTTAGGTGATTTTGAGTTGATTTTAAATGCAATAAACTGAAAATAGATATAAATTACTTATGGAGTGTACAAATTGATTCATTTACGAAAGCTCAACCTTACAAAAATTGTTACAACTATGAGTTTGGGATTTTTGTTGATACAACCCTCAAAGATAAGTGTAAATTGACCTTTTTCTATTATTATTATTATTATTTTATTTTCTCCTTCCTGTACTTCTCTTTAAAGTTTTCTTTTCATGTGTATTAACATTTTTAATCTACTATGTACACTTGGCTTGCTAGGTCCTTTATCAAAAAAATTATATTAATAGAATACAAATATATAAGTTTTCTTGTATACATCTTTCTTATTTGACGTGTCCAACCGCACATGCATCTTTGTTTTTTGTAAGTTCAACTTTTCAAGTCATTGGTATCTCCCTCGACATCTACATTCAGCTCTTGATTTCAAGTTGAATAACATATATTGCCATTTTACTTCATTCAAGCTTTTTTGTATCAGGTGAAGTCACTGAGCCCATGAACGTTATAGTTTGCCTTTTTTCTGCCAGATTATATTGCATAAAGCAGAGAATTTTTTTATTTTTAATTTCTGTAAGTTTTTTTGAAATGGAAACAAGGCTGATAGTCTTGTTTCCGTTTTTTAAAAAAATCTGTTTTATGCAATATAATCTGGCAGAAGAAAGGCTCGTAGTCTTGTTATTGTTTCAGAAAAAAAAAAAAAAAAACTTTTCTGCTATATGCAATATAATCTGGCAGATAAGGCAAGCTATAACGTTCATGGACTCAGAGACTTCACCTGATATACCTTGTAAAATAAGTTACATTTTTCTGAAAAAATTATTATGATTTTAAGTTTTAATGGAGTTGCACAGCTTTGAAAGTTGAATACAATTTTTTTTCTTCAGGCTGCTACTAGGAGTTGTTTTTATGCTGGCTTTGCTGCTCAAAGGAATTTTGCCGAGGTGAAATAGAGTAATCTCTTCTCTGGTTAGTGTATGTTAGCAAAGTTACGATTACCATACTTTCTTGTAGCTGCAACATTCATGTTTTTCCTTTTTTTTTTATTACCTTTTTTTATTTTATAAATAGAACTCAACTAGTGCTCAAAGTACATGAGTGATATACAAAAAGCACACACATGAAGAGACTGTGGATCAGTAGGCGCATTCAGACATCTAAACTGGGTTGACACCCCCATAGGGCCTTCATCACATCCACTAAAAGGAGACTAACAAAGGTCAATCATAAAAAAAACAACTCGAGATCGTGTAGGAGGACTCAGTGATTTTTGTAGGAGAAGAATAGCTAGTATTTGGAAGAGTAAGAGAGTGGAGTGGGAGGTTCTTAAGGCTGAAGGTTTCTTTGGTGGGATTCTTGTGTTGTGGAACTCGAGGTTGTGTGTTGCTTTTGAAGTGAACTGTGACAGACACTCCATTACTCTAGCGTTTTTTGGATGGTGAGGGTCATGAGTTTTGGGTCACAGGGATTCTTGGGGGGAAGAACTGGGAGATCTTTTCGGTTATTGTGGGGTGAGATGGTGTGTGGTGGGGGATTTTAACGTGGTCCGTTCTCCTGATGAGAAAGCTTTGCGAGGGAGAATCATTAGATCTATGAGATGTTTTAACAACTTCATTTATGTAAGTGGTTTGTTTGATCCTCCTCTTGTGGGGGGTAAATTTACATGGGCCAATTGTAGGGCTGCCTTGAGAATTGATAGGGTGCTTATGTTGGAAGCATGGATTGAGAGGTTTGGGAACCCTAGGTAGGTTAGAGGGTCTAGAATCACTTCGGATCACTGGACCCTTATTTTGACTAATGGGAGGTTGAATTGGGGTCCAGTTTCCTTCAAGTTTGAGAACATGTGCCTTGACCATCCCTCCTTTACAGCAAATATTGTTTCTTGGTGGCATACCAGTATCCAAGGTTGGGAGGGGTTTAGATTGATGGAATAGCTGAGATTTCTTAAAGGCTAGCTGAGAGCTTGGAACAAAGAGGTGTTTGGGAATATTCGGGTTAAAAAGGAGATTGTTGCTAGGATTGGCGAAATTGACAATTTTGAGTTGGATGGTCCACTTGAAAATGACCTTATTGAGGAGAGGTTTAGTTAAAAAGGAGCCTTGAAGAGGTGATTAGAAAGGTTAGTGTGAGCTGGTTTCGAAAGTGTAAAATTAAGTGGGCCAAGGAAGGGGATTGGAAAACAGTCAATAAATCATTTTAGATGGAAGCCACGGCTTAGGAGGCTTAGAGGTTAAGATTTTGGCTCCTTTGGCGAAATGTGGCTGGAGATACATTGAGGAAGAGACTTCTCTATGGTAACGTGATTAGAAGTATAAACGAAAAAGATTCCTTCAATTGGCACATTGCTGGAAAAGACGGGAAGAGCTTGAGATGTCCTTGGATAAGTATATCTAGAACATGGTTGAAAGTGGATGCTTTGGCTACTGAAAAGTTGGGAAATGGTTGTGGGATTGCTTTTTGGTTAGATACTTTGGAGAGTGAGGTTCCTTTCAATTTTTTGTTCCCAAGACTATACAAAGTGGCATTACTTCCAAAAGGGTCAGTGGCAGATTCTTGGGAGAGCTCCTTGGCTTCTTGGTCCATCGCTTTTCAGCAATTGTTAAAAGAGGAGGAAATACTTGAGTTCCAAGTGTTGCTTAGAAAAATTGCAGCAAGATTAATTTCTGATGATTTTGATAAAAGATCATGGTCTTTGATGGGTAATGGGGTCTTCTCAGTAAAGTCTTTCTCAGTTCACTTATCCTCTTCTTCTCCAATGGATAAGTTGCTATACTTGGTGCTATGGAAGTCTAACATCCCAAGGATGGTTAACATTCTTGTTTGGATCATGATGATCGGTCATCTTAATGTATCCTCTGCAATGCAGCTTAAACTTTCTAACAGCTGCTTTCTTCTTCCATTTTCCCTATATGTCATCGGAATGGAGAAGTTAACATATCTTCTTCTTCTGTTTGTACGCATCACAATGTTGCGAGAAGTTATTTTCTTTCTTTAATTTTTCTTGGGTGTTTGGTGCTGACCTTCAAGGAAATATCCAACAACTTTTTAGTTGGAGTGCATTTCAAAAAAATCCTTCAATCACTTTGGATACATGCATTAAAATCGCATTTGGCAGAACTATGGTTTGAAAGAAGTCAAAGGGTTTTCCATGATAAATTCTTTGTTTGGTTGGATCGCTTTGAAGTTGCCTGTAAGAATGCCTCCTCGTGGTGTTCCTTCTCCAAGCCTTCTGAAAGCTACAACGTTCAAGAAATCTACTCAAATTGGAGAGCTTTTATCTTTCCAGCTTAATTTTGTTTTGAGTTAAATCTCGTATAGCACTTTTCGCTGAGGCCTTCTGTATGCTTTTTGTTGAATCCTTTGTAAGGCTAGTGATCAATAGTCTTTTTGTCCTATCTGTAATGGCTGAATTTATAAGTCTCGGTCGATTTTTGATGTTATGTACAAAAAGTTCATAAAAAATTGCAGTTCAGATGTAGTCACGATTCAAGAGTTTAATAAAGATATCTTGAATTTGGCTTTTATAAAATAACTGTGGAGTTTTAAGGATATTGGCTGGGATCTTTTGGCTTCGATTGGATTGAGTGTGATCCTAACCAATCTTGGAAATTACACTCCAAAGTGATAATGTCCATCCAAGGGTGCTTCCAAAACAATATTCTGCTGCCATCACCTAATTTGAAAAGAGCCAATGAATCAATTTTGCGCCATTGACTTGATATGCTAATCCAAGGACTTCTTAGACTCACACTCTTTTTCACTCTAATGCCAATTGAAAGTACTACTTTCATGAATGCATCTAACCACTTGACACCAAAGGGAATTTTGTTCTTTTGAGAAACGCCATCCCCACTTAGACAATAAAGACAGATTCTTGTGCTTCAGCTTGCTCAAACTAAGTCCTCCATCTTTGTAGGATTTAATTACAGCATCCCATTTTACGAGGTGGTTTAGCTTTCCTCCCTTGTTTCCTTCCCCAAAGAACTTTCTCATGATTCTTTCCAATGAACTTAGAGCAGATTCTGGCAGGAGGAAAACAGACATATAGTAAGTGGGGAGGCTGGATAGAATTGATTTGCAAAGTGTAATCCTCCCACCCTTAGAGAGATTGTATCTTCTCCATTTATGTAGCCTTCCATGGATTTTATCAATAATGGGCTGCCAAAATCGAGCACTTTGTGGATGACCTCACAAAGGAAGACCGAGATAAATGAAGGGGAGACTTTCTATCTTGCAGTTTAAGATGTTTGCTGTTGAAATAACCTTGCTGTCCTCAATATTTAAACCACAAATAGCTGATTTCTCCTAATTAATCTTTTGACCGGAACACCATTCAAAGACTAGTAATGTTTTCCTAAGATTTTCCATCATTTTCTCGCCAAATTTACAAAACAGTAGCGTATCATCCGCAAATTGGAGGATAGGAATGTGAATCCTATCTTTTCCAACTACAAAGCCTTCAAACATTTCTTTCTTATGAAGGTGATTTAGCAAACTCAAATTGGTCTATCTATTGGGAGAATTTATCATGAAAAACATGCTGGTTTCTTTCAAACCACAGCTCCACGAGTAAAGCCTTAACCGCATTGCTCCATAGCAAAAAAGGCTTTCTTCTTAAAGGCTGGTCCTACCAAAACTTGCAGCATGTTACTCCTAAACTTTTTCTTGACACACATTTTTGGCAGCAGGTTCAGATGGGTTTACAGTAGAATTCTTTATTAAATTTTGGGATCAGTTCAGATACAACTTCACTAGTCTCTTCAACGAGTTCTACGAGAATGGGAAAATAAATGTATGTGTCAAGGAGAACTTCATTTGTTTGATTAAGAAGAAGGGAGATGCTGTTACGGTGAAAGCTTTCAGACCTATTAGTCTTGCAACTTCAGTTTGCAAGCTGATTGCAAAGGTACTAATGGAAAGGCTAAAGAAGGTCATGCCAAGCATCATTGCTTCTACTAAGAGTGCTTTCTTGGAAGGCTTGAAAATTCTGGACCCAATTCTCATTGCAAATGAAGTAGTAGAGTAATAATGAGCTAAAAAGAAACAAGGCTGTATTCTTAAATTAGATCTTGAAGAAGCCTTTGATAGAGTGGATTGGGTCTTCTTGGAGAAAGTGGAGACAAAGAACTTTGATCCTTGTTGGATCCCTTGGATTATGGGATGTGTGAAAATTCCAAAATACTCATATGTCGTCCAAGAGGGAGAATTTTAGCTTCAAGGGGATTGGAGCAAGGAGACCCACTCTCTTCCTTTCTTTTCCTACTTGTTAGTGAAGTTCTTTTTTTAAAAAAGGAAACAATCTCTTTAATAAATTAATTAATAATGAGACAAAAGCTCATTACACAAGAGGATTGCACAATTAGCAAATAAACAAAGGATCAGAAGGCGCACCCAAGCATCTCCGCGCATCTCGATTAGATTGACACCCTCTTAGCATCCTCATCATGTCCAACTAAGAAAAAACGAAATAGATATTGTCTATAAGCAAGGCTAACATAAGCCTACTTGTTGCTGAAGTTTTGAGTAGTCTCATCTTAAAATTACACACAAAAAAGATGTTTGAAGGCTTCAATTTGCAGATGACACTGGAGAGGCAATGCTGAAAAATTTGCAAAAGACATGCAAACAAAAGCTTAGCTCAACTGGCACTTGTGTATACACCTGAGGACAAGAGGTCTTGGGTTCGAATCCCCAATATCCAATTTGTACTACAAAAAATTTGCAAAAGACATTAGAATTATTTGATTCCTTTCAAACCATAATTCAACAAGTAAAAATTTAACTGCATTGTCCCAATTATATTTGTTGTTTTATAGAATTATCCATTTATAATATTTGAAGGGACCATAAATTTATATACTCATCTTTGTGAAAATTATTATCTTATTAATTTAGACCTTTGGCCCAAGTTGGGGCTCAAAAAAGTTATCATTTTAGAGAGGTTACCAGTTTATCAATTATTCAATTAAAGGTGTTCTTCTGTATTCCAATATTTCATAGCTTGTCTTTCTACATCTTTATGTTTTGTAAGTGCGGAAAAATGAGGGACATATATTAATTAAATGGTGGACATATATTAGCTTACCTTTGCTGATTTTCTGCATTAATGTATTCTATTTAACCTTCACGTGTAGGTGGATGCCTTTTAATTGCTTCTACACTTTTGTCACTTCAATCCACCTTGATAACTGTAGTTTACGTATTACTTAATCACAGAAAACATCCAAAACGTGTATCTCCTAGTTTAAGCTGGGCTTGAGGCATATGCCCATTTTCTATGTTTTTCTTACTCTTGATGCATAAGGAAAATAAGATCTTTTGTATTTCAGAATATTTTGTTTTTGGTCTAGATGGTATCCCGTTGTATTGTACTTATCTTGGAAGCCTAGAAGGATTGGGTATCTTATTCGGAGATGATGAGGTGCCTAAGGGGATGTCAACCTAGTTGAGATGGCCGGGTGCACCTCATAATTCGTTGTGCTATTCTCTTTATTCCATTTGTTCTTTGTATCCCCTTTGTACTTTTAGCATTAAGATATCTGTTTCCGTTTAAAAAAATATATACCTTGAATGAAATTGACCCTATCCTTCCAATGCATATGTGCTGAAAGAAACATAAATTATTCTAGGTGATTGCTAAAGGTGAAGCACAAGGAATGGTGAGCAAGTCTATCGGTATGATGCTTGGCATTACATTGGCCAATCGTATAAGGTCCTCAACATCACTTGCTCTTGGGTGCTTTAGCATAGTGACCTTAATCCACATGTTCTGCAATCTAAAATCATACAAATCCATTCAACTAAGGACATTAAATCCTTATCGTGCAAGTAAGTTATCTTGCCTTCTTTTGCAATTTAGGGCCAATGAGATGTATTAGTACTATTTTTATTATTTGTTTGTATTATTTGGAATGAGCTGATTATTTTCGATTATCTTTGTATTGGATCTGAACTCTTGTAATCGTTGGTCTAGGATTGAAAATAATGAGAGTGCTAAGGCGTGTCAACTTAGTTGAGATGCTCAGGTGCACTTGTTGATCTTCGAGGCTTAGGCTTAGTCTCTTCATTGTAATTTATCTATATATATTGTCTATTCATTTTCTTAATGTAATGAAGAGGCTAGTTTCCTTTTGTTTTTTTTTCAAAAACACTATATCTTGGCAATTTCCTTGCTTGGCAGTATGTTAGACAAAACCTATTTCAAACTCACCCCTTTGCACTTCTTGAGGGGACTTGTTGGCAACCTTTTACATGGCAAACCGAAGACATTTCCTACGGTTGATGGGGCACGGTTTGAGGTTGTGCACGCAACCCATTATCCAATTTATCTGTCTGGCATCAAAAGTTTTTTGCACCATTAATTTTTCTAAAAAGTTCCAATCAATTCTATCAAAGGCTTTTTCAAGGTCCAACTTAATTAACCACCCTTTCTACTTCTTTGCTCGGTACTCTTCCACCACTTCGTTGGCAATGAGAATGGGTTCGATGATTTTCCTTCCTTCTAAAAAGGCACTTTGAGGAGGGTCAAAAATGCTTGGCATAACTTCTTAAATCTTTCAGACAAACAAGTTCTGGCAGATTGGAATCATTACAAACGCTTGCTAAGACAAAACCATAAAAGACCAACAGCATTAAAACCTGTTACTAAATAAAAGCATTCAAAGTCTTCTAATGAAGAAAGCAGCCGACCGGTAGATCTTGCATTCTGAAGCTTGAAGAGGAAACTGCAACTATAAAGCTGGTAGGGAAATGAAGACGGCCCAATTTAGACATAGATCTTGGATGGATTAGTCCTTCAATTCTTTATTTAAATAGCACCAAGTCGTGGCATTGCGTTTTGCAGTGTCTACAATATCTGACCAAACCTTTTCCTTGTCACGAAAGATCCGTTGGTTACGCTCATACCAATTTCTGCAAGAAGGGCTTTAGACATGTTTTCCCATATTAAGTGCGGTTTCTTTGCCAAAGACGGGCCCTTCAATAGTTGAAACGCATTTGAGCTCAACGATCCATCAAAACCCAAGCAACCATAAAAAGAGAGAGTATACTCCCCCAGCAATTGGTTGAAAACAGGCATGTAAGAAAGAGATGAACTAGTTCCTCACTGACCTTCAAACAGAGATGACAAAGCTGAGGGCAGAAGGCACTTGTTAGGGGACTTTCTTTGCAAGATTTTGGAACAATTAAGAAGACCAAAAGCCATAATCCAGATCTATCAAAGGCTTTTTCAAAGTACAAGAGAGTTATACAAAGAGCAATAAAAAGTAATCAAGGAAATCCTTGAGGAGATCAATAGGCGCACCTGGACATCTCAACTAGGTTGACACCCCCATAGTGCCAATCGTCATATCCCGAGCAAAACAGAAAACCAAACAAAGGAACAATATTAAGAAGTCCAGCTTATTACATGGGTTCAGAAGACAACACAGAACCAGAAAAACAGAAAGAAACAATATGGAAGGCCCATGAATAAAATCTCAAAATTAAATGCTAAACATGGGCAATTCTTCAAAGCTTCACAATGCAGCAAGGAGCTTCAACTGTGTAATCTGCAAACAATGGATCTGATGCAGCAACTATTAGGCAAGATGGGTGAGAAAAGCATCCCAATTGAGGCAAATGTCTTGGATGGAATAATTAATGAACTCCTTTTTTAAGGAACACCAAGCTGCAGCATTAAGATCTACTGTATGCATAATTTCTGCCCTTGGTCTTGTGTTATCATAGAAGATACGTTGATTATGCTCAAACCATAATTCTGTGAGTAGAGCTTTTGACAAATTTTTCCATATTATACGGGGTTTTTTGGATAAAGTCGGACCTGACAGCAGTTGAACCACACTAACACTTGGTGAACGATAAAAAACCCATTCAAAATGGAACAAGGAGAAAATTCTTACCTAGCAGAATGAAGAGACGAGGCAATTCAAAAAAATATGAGGAAGATTCTCGCTAGCCTTCAAACATAGAGGACAACCCAGCTGAAGTTCGTTATTTTTGTTGTTGTTGGTATGAAATTACATTGGTTTCTTTTTACTTGAATCTTTGCTTTGGAAACATTTAGAAGATTAAGCGTTCAGTAGCGATATTTTCCAGACCCCTAAAAGGAGCTAGAGTGTGGTTAGGGACTAAAGGAAGTGTCCTTTTTGTTTAGCTCTTTCTACAAACCTCTTGTACTTTGAGCTACATTATTATTATCAATAAAGAGGCTGGTTTCCTTTAAAAGAATAGTTTAATAACACTAGATGTAGGTGAGTTGGTAGTTTAAGGACAACCGAGGAACAAAAATAGCCCAAACAGTTATATCAAAGTAAGAAATGAAGTTACTAAGCATTAGAAAGAAAGGGTGTCATCATAATTGAGTTTCCTGGGTGTGCCTTCTTGTTTCGTTATAGGCTGAATTTTTTCCAGAATTTTTTTCTCTTCGAATAATGCTCATGTACTTTGAGCATTAGTTTAATATATTATTATTAATAAAGAGGGTTGCTTTCGTTTAAAAAAAATCAAAGGATCTTCCATGAAAAACTTCCACTTGGTTGGATTGGCTTGAAGCAGCTAGGCTCAACGCCGCCCACCGATGTGCCCTCTCTAAGCCATTCACAGTTTTTGTAGTGCAGGACTTAGTTCAAATTCGAAGAGCATTCCTTCTTCCAGTCTTCTAATTATTGTTTTACTAGACTTTTTGTGATTCCTTCAAGTAAAGGGATTTGTACTACTTGTTTTGTTTTGCCCTTATGATGTATCCTTTTGTTGTAGTTCTTCTTTACTTTGAGATATGATTTAATCATTGTTCATTTGTTTAGAAATGACGTGGGCACTAGGGATTGTCAACCTAGTTGAGATGCTTGGTTGTGCCTCTTGATCCTTAGTTCTTTTTGTCTTTTGTATAATTCTCTTGTACTTTAAGCATTAGTATCATATTATTTTATTAATGAAGAGACTCATTTCTGTTTAAAGAAATGAAAACACTCTGTTGGTTGTAAATAAGACAATTAGTTGTAAAATCACCAAAGATTTTATTTTATTCTCAATATAATTATTTCCTTTTATGTATTGAGCCTTTCCCCTATATAAGAAGGTTATGTATCTTAAATTTCAATTAGTGGAATACAAAAGTAATTGATTTCACTCAAATTGAGATGGTATCATAGACTATAATATTTTGAAAATCTCAAATCTCCGATTTTCTAAAACCCTAAAAAACCCTGCCTCTGTCACTGAACATCTTTGATCACTCACGATTTCCGTTTTTAGTCAATCACGTATCTCTGCTAGACATTTCATATCAGCCGCCGATCGATTCCTTTCTCATGAGACGTTCCATCTCAGGCGCTTGATCATGAGTTGTTTGTAGTTGTCACGTTAGGTTGTCTTCATTAGTGTCGTTGGTCGCCTCTATTCACCTATGTGTCAACCGCCTCTATTTGTTGATGTCATCTATTCGTGGGTTTCGTCCAGTTGGTTGTTTTTTATTCCCCTTGTTCGATCGGTTGCATCATTGTTTCTTTGTTCATTGCGGCTGGGTTCAATTAGGTCACAGGATCCAATTCATTTATAGAAGTTAGTTCGGTTGACTTTTTGGGAACTGATTCGATTTGGTCTTTAGAAACCGGTTTGGTTCGACTAACAGAAATTGGCTTGGTTCAGATTCAGTTGAATTTTTGTGAGCCTCGAAGTCTATTATTTTTTTTGAATCTCATTGCATTTTTTAGGATCAAGCCTTAGAGAGGATGTTTGAACGTGCTAGGATGCTTGATAGTGTTACTATATTAAATTTATCTTCACCTGTTGGTTTAAGCTTTTGGGTCAATTGGTGATTTAAGATGGTATTAGAGCCTATAAAAGCCTAAATGGGTGTTCGGTTCAATAAAAATAAATCGAGATCCGATCAAGATTATTGAACCCAAAGAGGGACCATCTTTAGGGGGCATGTTGAGATGTTGAGAATTGCTCATTAGAAAGATCAAGAGGACTCACAGTCCTTATAAAATAGATAGACTATTCCTCTTATTGCCAATTGGTTTTGAGATGGAACCCCATACTATCACATATAGTATTAGAGCAGGTTGCTTCAAGATGTCCTAGGTTCAAGTCCCTATAATGTTATTTCCTTTTCCATTAATATACATTTCCAATTAATGGGCTCTTCATATTACAAGCCCACAAGTGGGGGAGTGTTAGATATTGTATTAATTATACCTTCAACTACTAGCTTTAAGAGATGACATACTATTTTGATGAACCCTGTGGTATCAAAAACTCGTCTTTAGTAGTATAATTTATTCTTCTGTTAAAGACAAATTATGCTTTGATATTGTAGACTAGGGCAGCCAAGTTTTTTTTTTTTAATCTCAAAGTTCTATATCCAAATCTCTTGAAGGGAGTTGATTGTTCTCAATTAAATTGTGAAAACTGCATATTTGCAAAAAGTCATCGTTCTTGTTCATCAAAACCTTACTCATTTTCAAAACCATCTTACTTAATTCATAGTGATTTTTGGGGGTCTCTTTAAATTTATAACCTTACAATGGAAAATGTTAATTTGTGGCTTTTATTGATGATCATACTTGTTTTTCTTCAGGTTTTTTACTAACAAAGAAATCTGACGTCAAGGATGTTTTCACATGTTTTTCTAATATGATTGAAACCCACCTTTCAACTAAAATAGGTATTCTATGATTAGATAATGGAACTGAATATTTGAGTTTTTTTTTTAAATAAAGGCATTGTTCACCGATCTACATGTTGAGACACCCGAATTGCAGAACGCAAAAATACGCATTTACTAGAGGTTGCTTGGACCAAAAGGTTTACCATGAATTCCTGAGTACCTGTGGGGGAAGCAATTCTCATGGCTATCTACCTAATAAATTGAATGCCTTCTGAAGTCTTGAATTATAAAACTCCTCTTGTTGTTTTGAAAGAGTTTTAACTTTTTGGTCTAAATTCGGATGTGACGAGAGTGCTAAGGGAGTGTTAACCTAATTGAGATGTCCAGGTGCACTTGCTGATCCCAAAGGCTTAGATCTTGTGTCTTCATTGCTCAAAACTACTTTGTCTATTCATTTTCTTAACGAAGAGGCTCGTTTCTTTTTTGTTTTTTCAAAAAAGAAAAAACTAGTTGTTTGTCCTCTGATTTATCTATTACAGTTTTTAGGTGTATTTGTCATGTGCATCTTCCTAGTCGTTTTGGATCAAAATTAGATCATTGAGCTATTAAAGGTATTTATCTAGGGTATGCTTCTAGCAAAAAGAGATATAAATGTTTTGATCCTAAAATTTAAAAAAGGTTTATGCGAATACGGATGTGTTATTTTTGGAAAACCAACCTTTTTTCACCCAAAATAATCTTCTGGGGGAGAAAACACCAAATTTTGAGGATGATACTTTTTGGGATGTATCTATCTCTCTTCCAAATATTATTGGTCCTAATCTCTCTAGTTTTTCAATGCAAGTATCTAGCCTTTTAATTCAAGGAGAGAATTACTAGAAGATCCTAATGATCTGAATCCTAAGTTTAAGTTTTATGCTAGAAAGGGGTTCAATCAATTGAATAAGGATCAAATAACTGACTCGTCCCAAATCCATTCTCGTACTCCAAGGGATGATTCTGAAACTCCAGGTAACCAACCTTCCATTCTTGTTTCTCCTGATTATTGAAAAATGTTCGATGTCTTTCCGACCATGGAAATGTTTTACCTTTTGCCTCTGATCTTGATATCCCTATAGCTTGTATTAAATATCCCATTGCAAATTATATCTTGTACAAAAATTTGTCAGATAGTCATAAAGCTTTCACTTCTCGGATGGACACCTTATTTTTCCCAAGGAATATACAAGAAGTCTTAGATGATCCAAATTGGAAACTAGCTATTATGGAGGGCTGGAATGCTCCAAAACAGAATGGTACATGGAAAATAGTAGATTTGCTGAGAGAAAAGAAACTGTAGGGTGTAAATGGGTTTTTACATTAAATGTAATGTTGGTGGTAATGATGGAAGATATAAAGCAAAATCAGTTGCAAAAGGATTCATTCAAACTTATAATTGATTATTGGGAAACTTTTGGTTCTGTTGATCAAATTAATTTTATCAGAGTATTGTTATCTCTTACAGCAAATTTAGATTAGCCCAATCACCAGCTCTATGTAAAAAATGTCTTTCTCAATGGTTGCCTCAAAGAAAAAGTGTTTATGAGCTTACCACTAGGATTTGAAATTGACCTGGGAAAGGACAAAGTATATCAACTCAAGAAGTCTTTATACAGACTGAAATGTTCCCCTAGAGCATGGTTTGGACGATTTGAAAAAGTAGTTACTAGTTTTGGTTTCAAAGTCAAATTGATGATACCATTTTTTACAAGGGTTCTACAAATAACAAAACTACGATTTTGATTGTGTATGCAGACAACATGATCCTTACAGGGGATGACACTATAGAATTGACAACATAAAAGGAAAAACTTGCAAATATATTTCAGATCAAGGACTTCAGGGCATTAAAATACCTCATAGGAATGGAATTTGCATGATTTGTAAAGGGTATCTTTGTGCATCAAAGAAATTATTTTACTGATTTACTCAGAGAAACAGACTTACTTAGGTATCTTCGTGTATTTTTTCCACACACTTCCAGATATTGCCTTCACACTAAGCATGGTAAGTCAATCATGCACTCACTAGGATCGATTCATTTTGAAGCAGTGTATTGAATCTTGAGATATTTGAAGGGAACTCTAGAAAAAGGCATATTTAAAAAACACAACCATCGCCAAGTTGAAGTTTATACAGATGTTGATTGGGCAAGAAGTATGATGGACAAGTCTACTTCAGACTACTGTTCTTTTATTGCCAGTAACTTAGTCACTTGCCATAGTAAAAAGCAAAATGTGGTGGCCAGAAGTAGTGCTGAAGCAGTATTTAGAGCTCTAGCTCATGGTATATGTGAAGGAATATGGATAAAAAGATTACTGGATGAACTGAAGGTCTCTCAAAAGGCACCTGTACGTGTATAGTGTGACAACAATGTTACTATCTCTATCGCCCATAATTCAGTTCTACATGATAGGACAGAACATATTGAAATCAATAAACACTTCATGAAAGAGAAACTTGAAACTGGTGTTATTTGTATCCCGTATCTTCCCAGTTCAGAACAAACTGTCGATGTGTTGACTAAACGACTATCAACGGTACAATTTGATAGATTGGTTTCCAAGCTTGCAATGGAAGATATATATAAAGTAGCTTTAGGGAGAGTGTTGGTTGTAAATAAGATAATTAGGTGTAAAATCAGCAAAGATTTTATTTTATTCTCTTTATAATTTTCCTTTTATCTATTGAGCCTTTCTCCTATATAAGAAGACTACGTATCTTGCATTTAGATTATTGGAATACAAAAATAATTGATTCCACACAAACCACAGCACCCTCATTACATGCCCTCTCAAAACCACTACCAAACATTACTTTTGTAAATTTTAAATTTGAACTTTATTTTTTAGTGGTAAATTTTACCTGAAATGTTTGACAGGTTTGGTCTTCAGTGAATATCTGTTGAGTGGTGAGGTGCCTTCGATTAAGGATGTGAACAATGAAGAACCTCTTTTTCCGGCTGTACCACTTCTTAATAGAAAGCTTGCATGTGATGTGAGTGAATTTCTTTTCTATCTCTATTTATGACGCATTGGAAAAGCTTTTGACTTGGTTAACAAATGAACGGTGATTGACTAGGTTATTTGGGACATTTTGAGAAGTAATGAAATTAAAGTGCTTTGACACAATAGAATCAAGGAATGTCAGGGCTGCATTCCTCATATAAAATTTTCTATTTTCATCTGTATCCTCCTTAAGATAGAGTTCTTGCTTCATATATTAAAAAATGGAAAGGGTATGTACTCTCTCTCTTTCTCTCTTCCCTCTCTCTTGTTTTCTCTCTCCTCCTCCCTCCCTTTAACATCATTGCTGGTCTATTACCCTCTCCAGTAGACACCGCTATACGTCGATTTGTACTTTGGCAATACTCTAGCAATCATTCTTACTTCACAAAGATGGAAGTAAAAAGCTGCAAAATTGACAACCTTTTTTCTACATCTGGCATGAAGATGAAGCTTTTCACACAGAAGATGTTGAAGCTAATATTAATATGTCAACATCAATCTTTCAACTTATGTGGTTCATTGAAATTGTGAAAGAATTGAATAGGAATTCTGAAGAAGATTATTTCTATAAGAAAGAAAAAGTTGAGAGTGGAAGTGTTGAAACTTCCAAATTTAGAACCAAAAGTGGGTGGGTTTTGAGATGCAATGCATGGCCTTGTAAGGGAGGGAGATCATTTATTCATATCTCCATCGGTGAAGATAAAACAGGGGTGGAAATGTTTTGGAAAAATGTTGGGAGACTTCAAGGACAACCATGAATACAAAATCTGGTTCTCCTTCTAAATCATCATGTTCTACGCAATTATGAAGCAGCCAAACAAAAGTATGAAGGGAAGCTATGCGGAGATAGCAAAATCAAAAGAACATCCAAGAACTCTTTCTTCGATACCACAAAACAGAATACAAGCAAATATTGGGTGATTAAGAATCCGGTAGTCTTCAAAGCTGATTTTGATAATCTTTGGGTTGTTCCAAAACTTTTTGAATCTAATAGTTGGGTAAAAATTAATGAAACATTAGAACTCTTTTTCGACTCAAAATTTATCATCAATCAATTATTTGCAAACAATGCATTGATTGACTTGGAACGTGGTCCTTTTGAAGGAGTTTATTGAAGATCCAAGCAAATGGTAAAATTTTGATGTCTTTCATTTAAAGTTTGAAAAGTGGAACAATCTCATCCATGGAAGACCTTTATATTCGAAAGGCTATGGAATAAGGTGGAAATGTTGGCTCCATTCAAGGTTGGTAATGGCAGAAGGGCAGCTTTCTGGACTAATTTCTGGGTTGGAGAATTTCCTTTTAAAATTTAGTTCTGAAATCTATTCAGAATAGCCTAGCTGCCCAACGACTTAGTTGCTGCCCACTGGGTTTGTGTCACAAAATCTTGGTCCATAGTGTTTTAGAGATTGCTGAAAGACGAAGAAATCCAAGAATTCCAATGCCTATTAACCATCTTCGAGAAGATTGACAGAAATGGAGGATAGGAGAGTGTTAGGATCCCACCTAACAAAGATTATCTCAAGAAGAAAATATGAACTCAAGAACACTTAAAAAACATTCACATATGAAATATATACTAGAAAATCATAAGAACATGTAACAAGTAACCCTAGCCTTTTGAGAGGGCTAGACTCTCCTAAGGACCCCTTACAAGTAAATTTTTCTACAAAATTCTTCACACCTCTCTCCAACCCTTCTCCTACTATTTATAACAAAAAGACCTAACCAACTATTTACTAATATGCCTTCACTCATGACCCTCTGTTAGACCACCCATCTTACATCAATACAGTAGGAGAAAGAATAGGAAAGAGAAGCAGGAGAAGGAGTTAGTTATGTAACCGCATGTGGGGACCACAATGAGTGGAACGACAAAGGAAATAGGGGGGACCACTATAGGTGGGGAGTGAATATAAATAAGGGAGTGAAAGAGGGAACGAGGAGCTTTTCTTTTGGGGGGAGAAAGTTCTTTCTGTATATTCTTGAAAGAGAGGATAGCAGGAGAGGGAAGGGTGTCCATCGAATCGGCTATAAACCAATTCGGTGATATTGTCCTGTAGTTATTTTCGATAATTTCATCTTAGTAATATCAACAGTCCTTGATTTTCGTCTACTGGAAGTGAGAACAACTGGTATCTCATCAACTTGGTATCAGAGCACGTCGATCCGAAAAGAAGTGGGAAAACATGGTTCAAACCAGAAGTGAAGAGAGAAGGGACACGCACGAACAAGAACTCAACAAAATTCCGATGATGGAGGAGAAGCTTTCGGTGATTTCACAGAACATGGAGAACCTTCAGGCCCAAGTGGAGAAAACACATCAGATGGTGATGATATTCATGGAGACGATGGCCAAGGAACGAGTATTAGCGAGCGGTAAAGGAATCGATTCGTCGGCACAAGAAACATACGGGAAAATCGGCGGAGGGAGAGAGTTCGGTAAGTAAAGAAACTAAAAATGAGACGATGGAGAAGAAGGGTGATGGCGACGGGGATAGCAACGATCGAAACAAATTCAAAAAAGTTGAAATGCCTGTATTCAATGGAGATGACTGATGGGATACTAGCTATTCCCACTTTCAATATATAAAGATGAAGGATACTCGATATTACTAAGATGAAAATTGCAGAAATAATACAGAACACTCTCACCGAACTGGTTATGAGCCCATTCGATGGATTCCTTTTCCTCTCCTGCTATCCTATCTTTCAAGAATAAAACAGTAGGATTCTTTACTACCAAAAACAAAAGCTGCCCGTTCTCTCTTTCTCTCCCTTATTTATATTCACTCCCCCCACCTACAGTGGTCCCTCTTGTATCGTGCGTCGCTCCACTCATTTGTGGTCCCCACCTACAGTTGGTTACATAACTAACTCCCTCTCCTGCTTCTCTTTCCTATTCTTTCTCCTACTGTATTGGTGTATGATGGGTGGTCTAACATTGCATTCCTGGTCCAGTTTCACCTTGTCCTCAAGGTGGAAATCGGGGAAGGATTGCTGAAAGTCATCATAGCTTTCCCATGTGGCTTCATGATGCGGTAGACCCTTCCAGCTCATTAAAACTTCCCATCCTCCTTTCTCGTTTTTCCGATAATCGTAAACCTCCTGAGGCACAACCTTCCACTCGTGATTTGCAGTAAAAAATGGCAAAAGCTCTTCGCTGTTCGCACTCTCCCCAAAGGTTCTTTTCAGCTGTGAAATATGGAACACAGGATGAATTGTTGCTGCCGTTGGTAACTCCAGCCGATATGCCACCTAACCAATTCTCTTCACTATTTTATACGGCCCGAAATATTTTGGTGATAGCTTCTCATTTCGCTTTCTCCGCAGGGATGCCTGCCTGTATGGTCTAATTTTTAAGAACACTTTATCTCCTTCAAATTCGACATGTCTTCTCTTCATGTCAGCATAACTTTTCATCTTGTCCTGGGCTATGCGTAGATGTTCCTTCAAAGCCCCCAATGCTACATCTCTTTCCTTCAGTTGCTCATCTAAGGTTGAGTTGGAAGTTTCCCGATCCCCATAATATACCAGGGCTGGTGGTGTTCTCCCATAAACAGCTTGGAACGGTGTCACGCCCAACGATCTTTGGAATGTGGTGTTATACCAGTATTCAGCCCAAGATAACCATTTCATCCAATCTTTCGGTTTTTCTCCGCAAAAGCATCTTAGATAAATTTCGACCGACCTGTTGACAACCTCTGTCTGTCCGTCTGTCTGGGGGTGGTAGGCTGTGCTTCGGTTTAACTTAGTACCAGCTAAACGAAAAAGTTCTTTCCAAAAATGACTCAGAAAGATTTTGTCCCGGTCAGATACAATTGACTGTGGGAAACCGTGCAGTCTTACAACCTCCTTAACGAACAATTCAGCTACCATCTTCGCGTCAAAGGGATTTTTAAGGCTGAGGAAATGAGCATATTTGCTGAAGCGATCCACCACTACGAATATGACGTCACACCCCATTGATTTAGGCAGTCCTTCAATAAAATCCATGGATATATCCTCCCATACTCTATTTGGTACTTCCAGGGGTGTCAATAATCCTGCCGGGGATAATGCTAAAGTCTTATTCCGCTGGCACGTCATACATTCTTCGCAATATTTTTGCACTTCAGCCTTCATTCCCACCCAAAACAACTCTCCTGTCATTCTTTTATACGTTCTTAAGAACCCTGAGTGACCCCCTAGGACCGAATCATGATATGTGTGCAGAATGGCTGGTACTAATGAAGAGTTCTTCGCGATTACTAATCTCCCTTTGTATCTCAATATGCCTTGTTGCAGAGTGTAGTTCTTCACTTCTTCCTCCCTCTGAATTCTGTTGATTATATCCTTCAAATAGTCATCTTTGTCAACTTCCTCTCTGATTACTTGGATGTCTACCAAAGTGTGAGCGGTTAGTTGGTTAAGATGGGCAGTTGGTGGTACTCGGGAGAGGGCATCTGCTGCCTTGTTTTCCAATCCTGGTTTGTACATCACCTCAAATGAATACCCCAGCAGCTTTGCAATCCACTTTTGATATTGCGGTTGTATGACTCTCTGTTCCAGTAGGAATTTTAGTGATCGTTGATCTGTTTTAACTATGAACGTCCTCCCTAGTAGATAGGGTCGCCAGCGTTGGACTGCCATTACCACTGCCATTAACTCCCTCTCATATACTGGTTTGGACCGGTCTCGCAGCGCTAGTGTGTGGCTATAAAACGCAACTGGTCTCTTGTTCTGCATTAGTACCGCCCCAATCCCATAGCCTGACGCGTCTATCTCTACTTCAAATGGTGCATTAAAATCGGGCAGAGCTAATATAGGCAGGGTCATCATTGCTCGTTGAAGCTTCTCAAACGCTTCTTGTGCTTCTTCACTCCATTTAAATGATCCAAGCTTCAGTAGCTGAGTCAGGGGTGCCGCTATGGATCCATAATGCTGTACAAAACAGCGGTAATAACCAGTCAGCCCCAGAAACCCTCTAACCTCCCGAACGTTTGTTGGAGTTGGCCATTGCTTGATTGCTCTAATTTTTTCAGGGTCCACTTCTACTCCTTTTCCTGACAATATATGTCCCAAATACTCCACCCTTGAACACGCAAAGCAGCATTTCTTTCGGTTAGCAAATAACTTGTGCTTCCTCAATACTTCTAGCACTAGTTCGATATGTTGACAATGTTCCTCTAGGTTCCTGTTATAAACCAGTATATCATCAAAGAAAACTAAGACGAACTTCCTCAGATACGACCTAAAAATCGAGTTCATTAGTGATTGGAAGGTTGCTGGTGCATTTGTGAGTCCAAACGGCATCACTAAAAACTCGTAATGTCCCTCGTGAGTTCTAAAGGCCGTTTTCTCTATGTCTTGACTACACATTCTCAGTTGATGATATCCTGCTTTCAAGTCTATTTTAGAGAATAAGTTTGCACCATTTAGCTCATCAAACAGTTCTTCCACAACTGGGATAGGGAACTTATCTGGAATCGTTATGTTATTGAGTGCCCTGTAATCCACGCAGAATCGCCAACTTCCATCCTTTTTCTTGACCAATAGTACGGGGCTTGTGCTGGGGCAGATGATTCCTGACGCCATCATTTCGTCCACCAATTTTTCCAGTTCTTATTTCTGCTGAAACGCATACCGATAGGGTCGGACGTTCACCGGGTCTTCCCCTTCCCTTATATGTATATGATGTTTGATATCCCTGCTGGGAGGCAACTCCTTAGGCCAATCGAAAACATCCATATACTGTGTCAGGGTTGTCAATATGCTCTCAGGTTCGTTCTTGTTCCCTTCTGTTTCTATCTTAACTTGGCATGCTTCCAGTGTTCTGCACTCAATCAAATATCCCGTGTCTGTTTCAATCCACGATTTAGTGAGATTTTTTAGGCTCACTTGAGTTTTAGTTAAGCTTGGATCCCCTTTTAGCACTATTTTTTTGTTGTCATGGAAAAAAGGACATTGTTAAGTTCTTCCAGTCCATTTCCGTCACTCCCAAGGAATGTAACCATTGCATCCCCAGTATCAGGTCTACCCCTCCCAGTTCCAGTGGTAGGAAGTTTTTCATGACTGTCCACCCGTTGAGATCCAACTCTACTTTTTCACACACCCCCTTGCCTTTGATGGCTGTTCCTGACCCCAGTATTACTCCATAGTTAGCAGTGTCTTTTGTGGATAGTTTCAGTGTCCTCACTAGCCGGTCGGATATGAAATTGTGGGTGGCTCCGCAGTCCACTAGCACGACAACCTCTTTGCTTTGGATTGTGCCTCTTATCTTCATGGTCCCCGGGTTCGTCAATCCCACTACTGAGTTAATACATAACTCCACTACCTCTCCAGGGTCATTCTGCAGCTCCATGGCTTTCAATTCCTTTAAGTCATACTCGTCTTCCTCGATGATTTCTTCCTCCACGTCGTCCGCTCTGACCACAAACATACGTAACTCACGTATTTCCTTCGCCTTGCATTTGTGCCCGGAGTAATACTTCTCATCGTATTTGAAACAGAGTCCCTTCTCTCTCTTGGCTTGGAATTCTGCGTCTGAGAGCCGTTTAGAGGGTCCTTCTTTCTTAATCTCCTTCGCCGGTGATCCTCTCAGTGTAATTGTTCGTATTGGAAATACTGTGTTCTCCTTATTCCCCTGTTCCTTTAGAATTGAGTTTGTTTTGGCTGTAAAATAGCTACCATTTGGAACTTTCGCTCCAGAATATCCGGGTAGATTTGCTTCCCTCCTCAGGATCTCCCGCTGTTCCACCATTTGTGCATATCTCATCATCTCAGCTAATCCCACGGGATTGCAGAATTCCATCTCCACCTTAATCCACGGTAACAGCCCTCCCATGAATGTCTCTTCCACAATCTTTTCCGGAATATCCGATAATGGTGCCACCCACTTATCAAAGAGGTTCCTATATTCCTCCACTTGATTCCTGCTGAATACGCAAGAACCGGCCATAAAGAGAGCCTTCGCGGGATGATCGAAATCGGATCAGTAGTCGTTCCTTTAAATTTAACCAACAGGTAAACTTGTCTCTCTCCTCCTGCGATCGATACCAGTTGAGTGCGGGCCCTTCAAAGCTGGTTGTGGCCACCATAAGTTTTTCGGAATCCGTTAGTTTGTGTATTTGAAAGTATCTCTCTGCGCGGAACAGCCATGAATCTGGGTCATCTCCATCGAATACCGGCATTTCAACTTTTTTGAATTTGTTTCGATCGTTGCTATCCCCGTTGCCATCTTCCTTCTTTTTCGTCGTCCCATTTTTCGTTTCTTTACTTGTCGAACTCTCTCCCTCCGCTGTTTTTCCCGTCCATGTTTCTTGTACCGACGAATCGATTCCTTTTCCGCTTGCTAATACCCATTCCTTGGCCATCGTCTCCATGAATATCATCACCATCTGATGTGTTTTCTCCACTTGGGCCTGCAGGTTTTCCATGTTCTGTGACATCACCGTAAGTTTCTCCTCCATCACCGGAATTTTGTTGAGTTCTTGTTTGTGTGTGCCCCTTCTCTCTTCGCTTCTGGTTTGAACCATGTCTTCCCACTTCTTTTCGGATCGACGAGCTCTGATACCAAGTTGATGGGATACTAGCTATTCCCACTTTCAATATATAAAGATGAAGCATACTCGATATTACTAAGATGAAAATTGCAGAAATAATACAGAACACTCTCACCGAACTGGTTATGAGCCCATTCGATGGATTCCTTTCCCTCTCCTGCTATCCTATCTTTCAAGAATAAAACAGTAGGATTCTTTACTACCAAAAACAAAAGCTGCCCGTTCTCTCTTTCTCTCCCTTATTTATATTCACTCCCCCACCTACAGTGGTCCCTCTTGTATCGTGCGTCGCTCCACTCATTTGTGGTCCCCACCTACAGTTGGTTACATAACTAACTCCCTCTCCTGCTTCTCTTTCCTATTCTTTCTCCTACTGTATTGGTGTATAATGGGTGGTCTAACAATGACCCAAATTCATGGCTATTCCGTGCAGATAGGTACTTCTAAATACACAAACTGACGGATTCTGAAAAACTCACGGTCGCTACAATTAGTTTTGAAGGCCCCGCACTCAACTGGTATCGGTCGCAGGAGGAGAGAGACAAATTTACTTGTTGGTTAAACTTAAAAGAATGATTACTAATCCGATTTCGATCGTCCCGCGAAGGCTCCCTATATGGTCGGTTCTTACGTATTCAGCAGGAATCAAGTGTAGAGGAATAGCGGAATCTATTCGATAAGTGGGTGGCACCATTATCGGACATTCCGGAAAAGATTGTGGAAGAGACGTTCATGGGAGGGCTGTTACCATGGATTAAGGTGGAGATAGAATTCTGCAATTCCGTGGGATTAGCCGAGATGATGAGATACGCGCAAATGGTGGAACAATGGGAGATCCTGAGGAGAGAAACAAATTTCCCAGTTATTCTGGAGCGAAAGTTCCAAATTACACCTATAATACGGCCAAAACAAATTCAGTTATGAAAGAACAAGGGAACAAGGAGAACACAATATTTCCGATACAAACAATCACGTTGAGGGGATCACCGGCAAAGGAGATTAAGAAAGACGGACCATCCAAATGGCTTTCCGACGCAGAATTCCAGGCCAAGAGGGAGAAAGGACTTTGTTTCAAATGCGATGAGAAGTATTACTCCGGGCACAAATGCAGGGTGAAGGAAATACGTGAGTTACATATGTCCGTGGTAAGAGCGGACGACGTGGAGGAAGAAATTATTGAAGAAGACGAGTATGACTTGAAGGAACTGAAAACGATGGAGTTGCAGAATGACCTTGGGGAAGTAAAGGAGTTATGTATTAACTCGGTAGTGGGATTGACGAATCCAGGTACCATGAAGATAAGGGGAAAAGTTCAAAGCAAGGAGGTTGTCGTGTTAGTGGATTGCGGAGCCACCCACAATTTCATATCCGACAGGCTAGTGATGACACTGAAATTACCCACAAAGGAGACTTCTAACTATGGGGTAATACTGGGATCAGGAATAGCCATCAAAGGCAAGGGAGTGTGTGAAAAAGTAGAGTTGGATCTCAATGGATGGACAGTCCTTGAAAACTTTCTACCGCTGGAACTGGGAGGGGTAGACGTGATACTTGGGATGCAATGGTTACACTCATTGGGAGTGACTGAGATGGACTGGAAGAACTTAACCATGTCATTCTTCCATGAGAACAAAAAAATAGTGATAAAAGGGGATCCAAGCTTAACCAAAACTCAAGTGAGCTTGAAGAATTTAACTAAATCGTGGACGGAGTCAGACATGGGGTACTTGATTGAGTGCAGAACCCTAGAAGCCCACATAGCCGAGATAGAACCAGAGAACAATAACGTACCTGAGAGCATACTGACAGCCCTGAATCAGTATAATGATGTTTTCGATTGGCCCAAAGAATTGCCTCCAAGAAGGGATATCGAACATCATATACATATAAAGGGAGGGGCAGAACCGGTGAATGTCCGGCCCTATCAGTATGCGTTTCAGCAGAAGGAAGAAATGGAAAAACTGGTGGACGAAATGCTAACCTCAGGAATTATCCGCCCCAGCACAAGCCCCTACTCAAGCACCGTACTATTGGTCAAAAAGAAGGACGGAAGCTGGCGATTCTACGTGGACTACAGGGCACTCAACAACATAACTATTCCAGATAAGTTTCCTATCCCGGTTGTGGAAGAGCTGTTTGACGAGCTAAATGGTGCAAACCTATTCTCTAAAATTGACCTGAAAGCGGGATATCATCAACTTAGAATGTGTAGTCAAGATATAGAGAAGACGGCCTTTAGAACTCATGAAGGACATTATGAGTTTCTGGTGATGCCGTTTGGACTCACAAACGCACCAGCAACTTTCCAATCACTAATGAACTCGATTTTTAGATCGTATTTGAGGAGGTTCGTCTTGGTATTCTTTGACGATATACTGGTTTATAGTAGAAACTTAGAGGAACATTGCCAGCACATTGAGCTAGTTCTGGAAGTATTGAGGAGACATAAGCTGTTTGCTAATCGAAAGAAATGCAGTTTTGCGTACTCAAAGGTGGAGTATTTAGGACACATATTGTCGGGAAAAGGAGTAGAAGTCGACCTGAAAAAATCAGAGCAATCAAACAATGGCCAACTCCAACAAATGTCCAGGAAGTTAGAGGGTTTCTGGGGTTGACTGGTTACTACCGCCATTTTGTACAGCACTATGGGTCCATAGCAGCACCTCTAACTCAACTACTTAAGCTGGGATCATTTAAATGGAATGAGGGAGCACAAGAAGCGTTTGAAAAGCTTCAACGAGCAATGATGACCCTGCCTATACTAGCTCTTCCAGATTTTAACGCACCATTCGAAGTAGAGACAGATGCATTAGGCTATGGGGTAGGGACAGTGCTAATGCAGAACAAGAGACCAATTGCTTTTTATAGCCATACACTAGCCTTGAGAGACCGAGCCAAACCAGTATACGAGAGGGAGTTAATGGCAGTAGTACTAGCAGTCCAACGTTGGCGACCCTATTTGTTAGGAAGAACCTTCATAGTTAAGACAGATCAGCGATCACTTAAGTTCCTGCTGGAACAGAGAGTCATACAACCGCAATATCAGAAGTGGATTGCAAAATTGTTGGGTTATTCATTTGAGGTGGTGTATAAACCGGACTTGGAAAACAAGGCAGCAGATGCCCTTTCACGAGTACCACCAACTGTCCATCTTAACCAACTAACAGCCCCCACCTTGGTAGACATAAAGGTAATCGGAGAGGAGGTTGACAAGGATGACTACTTGAAAGATATAATCAACCAGATGGGAGGAGGAGGTAAAGAATTACACTATGCAACAAGGAATACTGAGATACAAAGGGAGATTAGTGATTGCGAAGAACTCTTCATTGATATCTGCCATTATGCACACATATCATGACTCGGTCCTAGGAGGTCATTCCGGGTTCTTAAGAACGTATAAGAGGCTGACAGGATAGTTGTTTTGGATAGGGATGAAAGCTGATGTCAAAAAATATTGCGAATAATGTATTACGTGCCAGCGGAATAAGACTTTAGCATTGTCTCCGGCAGGATTATTGACACCTCTAGAGGTACCAAATAGAGTATGGGAGGATATATCCATGGATTTTATTGAAGGACTGCCTAAATGAATGGGGGTTGAAGTAATATTCGTAGTGGTGGACCGCTTCAGTAAATATGCTCATTTCCTCGGCCTTAAACATCCTTTTGACGCTAAGATGGTAGCGGATTTGTTCGTTAAGGAGATTGTAAGACTGCATGGTTTTCCACAGTCAATTGTCTCTGACCGAGACAAAAATTTTCTGAGTCATTTTTGGAAAGAACTGTTTCGTTTAGCGGGTACGAAGTTGAACCGCAGCACAGCATACCATCCCCAGACGGATGGACAGACAGAGGTTGTCAACAGATCTGTAGAAATTTATTTAAGATGCTTCTGTGGGGAAAAACCAAAAGATTGGATGAAATGGTTGTCTTGGGCTGAATACTGGTATAGGCGTGTCACCGTTCCAAGCTGTCTATGGAAGAACACCACCAGCCCTGATATATTATGGGGATCGTGAAACTCCCAACTCAGCTTTAGATGAGCAACTTAAGGAAAGAGATGTAGCCTTGGGTGCTTTGAAGGAACATCTACGCATAGCCCAGGAAAAGATGAAGAGTTATGTCGATATGAAGAGAAGACATGTCGAATTTGAAGAAGGAGATAAGGTGTTCCTAAAGATTAGACCATACAGGCAGGTATCACTGCGGAAAAGGAGAAACGAGAAGCTGTCACCGAAGTATTTCCGGCCGTATCGAATAGTGAAGAGGATTGGTCCAATGGCATATCAGCTGGAGTTACCAGCGGCAGCAACAATTCATCCTGTATTCCATATTTCACAGCTGAAAAGAGCTTTTAGGGAGAGTGTGAACAGCGACGAGCTTTTGCCATTCTTGACTGCAAATCATGAGTGGAAGGCTGTGCCTCAGGAGGTATTCGGTTATCAGAAAAACGAGAAAGGAAGATGGGAAGTCTTAATGAGTTGGAAGGGTCTACCGCAACACGAAGCAACATGGGAAAGCTATGATGACTTTCAGCAATCCTTCCCCGATTTCCACCTTGAGGACAAGGTGAAACTGGACCGGAAATGCAATGTTAGACCACCCATCATACATCAATACAGTAGGAGAAAGAATAGGAAAGAGAAGCATGAGAAGGAGTTAGTTATGTAACCGCATGTGGGGACCACAATGAGTGGAACGACAAAGGAAATAGGGGGGACCACCGGAGGTGGGGAGTGAATATAAATAAGGGAGTGAAAGAGGGAACGGGGAGCTTTTCTTTTGGGGGGAGAAAGTTTTTTCTGTATATTCTTGAAAGAGAGGATAGCAGGAGAGGGAAGGGTGTCCATCGAATCGGCTATAAACCAATTCGGTGATATTGTCCTGTAGTTATTTTCGATAATTTCATCTTAGTAATATCAACAGTCCTTGATTTTCGTCTACTGGAAGTGAGAACAACTGGTATCTCATCACCCTCTTCATTTTCTACTAGTAATCCACTATTTTCCTATCTAGTGTCCTTACGGTGAGTTTGGTCATTAGAGGCCTCGGGTTATTTCTCAGTTAAATTTCTATCGAATCACCTCTCCTTCTTCCCCTTTAGAAATAGATTGCTTTAAAGCCATTTGGAAAACCAGCAGCCCAACAACATTTCTGCCTAGTAACCACACGAACCATATACATTGGTAACCCATCAACGTTTCTTGCACAAAGTGACACATTTTATTGAGAGGAAATATCGCCCTCTTATGTTTCCATCACTGTGATTTTGCTTTTATCCCACAAAGTAAGAATGCCTCTGAATACGCCAACTGAATCCACTGATTCCCAACCTATATCCATTGAACATCAAATTGTTTTAATTAAATTTGCGTCCAGATTTTCCATTTTTTATTCTTTGATCATAACCACATCCGGGTGATGGCTTTTTATGAACTTCTTGAGAGCAGAATGCTTGGATGGGTCTTTTAGGCCTCTTGTATTCCAAGAAATGATCTTCATGAGAGAAAAATCCGAGCATATGCTTTACTATGACTTACTCAAACCAGCACAATTCCATTGGAATTTCTGTTTGCTTGATTGGGGAGGAGCAAGGATTTTTACTAAAATCACCACAGTCCACTTTGAAAAGGGATATCAGATCCAATGCTTCAATTTCAAAATTTGTTTTGTCTTCTTGAATTTGCGTTAATCTGGAAGAATCTTCACTGCTGACACTATAAGGGGAGATAATGTCTGTAGTTGCAAACTTCTTCACGATTCAACTGATTGGCATGGAGACCATCGGATTAAAGTGATTTTAGTGTTAGGGTTTGTGAAGCTTGAGTAATTAAAACAAATAGGAGTAGGACGGGGTGATACTGACCTGATTTTATCTAAGACTGATTGTAGTTTAGAGGTACAGATTTCCAAAAGATCCGGATTTTCCTCTAATTAAGAAGGCTTGCTTGGCTGATTCTTTTGAGAGTCCTCTTTCAGCTTTAAGGGAGCATATTGCAAGTGATTCAGACTTCAGCAATGCAATCCAAGCATTATCAAATGAAGTTTGTTTTCTAATGAACTGCTTCAACCGAATTCTTGGGGAAATACTTTTCTTCTTTCCTTTTGCTTAACCTGTGTTTGATTGGAGTAACGTCACTGGTGGAAGAAAAGGTGGCGCCAGCTAGATGAGCGGCATTGTTGGAGGGATCATTGAGATTCTGGTGTGAGCTGCCAAGTATGGCCGAGTCACTTGAGGCAGTAAAGAGCATCCCTTCGCCAGAATCTTCAGATATCTCAATGGTTGCTGGAGTGAAACCACACAGATTCTCCTTCACTTGAATTTTGGGCTCTGATACATTAAGGAAGTTGATTGTTTCAACTTCAATATTTTCTAAACCTCCAAAATGAGCTCCAAAGTTTTATTATACCAATAATCAAGTGGTAGATTTTTTATTGTAATCCACCCTCCATAGCTTTTTGAATATAAAGGTCTTCCATGGATGAGATTGTTCCACTTTTCAAACTTTAAATGAAAGGCACCAAAATTTTGCCATTTGCCTGGATTTTCAATAAAGTCGACCACATTCCAATTCAATCAAAGCATTCTCTGCAAATAATGGATTGATGATAAATCCAATCTCTAACGGTCTCTCTTCTCATGGATCATCAAAAGTTGACACCCTTGCTTCTTTTCCAATTACTATGTTACCAGGCAAGTCAGTTAACAAAGAAGAAAGACTTTCTCCTTCCTCGTCAGCTAGGTTTAACAAGCTGCAAAAGTCTTGAAGCATTTTATAAGGAAAAAGACATCCTTCAACAATGTTTGGACTACTCTTTTCAATTCTGAGTCTTTTGGAGCTTGTTCATTCACGGTTTAAGAGAAGCATTACCAGGGTCAGCAATCATTCAGGTCTTCATTCACTGATGTTAATCCGTATCTTGTGGAAGTTTGCATCTCTCTAGTCCCAGCTCTTGGGTGCAATACCAGGTTAGTCAATCCTCCATCTAAGCTTCATTCTCTCTTTCTTCGAATTCAGACCAGTCGGCTCAAATGTGTCCTTTATGCAAGGTGCATTGAACTCACAACAAGAGTGCAAGCAAAATACAAGCAAGGAGCAATATGAGAATGCTAATCCAAAGGAAGCTGGCAGTCCCGATGCTCAACGAACTCAAGACTCAGATGTTGAATAAGTTGGAAGTCCTTTGAGTTTTATAAACTTTATCTCCAGCCGTTAAAACAGTCTTGGATGGGGTAGTGCATTGAAATTCCTTTCATGGAATACATGTGAGCTAGGAGTTCATAAAAAAGAATGGTAGTTAGGAAGCATAAAAAAATAATTACCTTGGTTTTGGTTTTGCTACAAGTAACAAAAGTAAACAGCAAGTATCCAAGCCTATTTTGTTTTGTCCTTAATCGAAAAGGGTCAATTGCCGACCAATGGGACAGTCAAACCAGTTCTTGGTCTCTATATTTCAGAAGATTATTGAAAGATGAGGAAGTCCCCGATTCACAAGCCTTGTTGAGTTCATTAGAAGGAATCATGATAAATTCCCTAAACAGAAGAATTTGGTCCCTTGGATCAGGCGGATCTTTCACCATTAAATCATTGGTCAATCACTTATCAACCGCTTCACCCATTGACATTACAATTTTGAGTTTTTGGAAATCTAAAAGTCCAAAGCGAGTCAATATTTCTGTGTGGATCATGCTGCATGGAAGTTTGAATTGTGCTTCAATATTGCAAAAAAAGCTCCCTTCCCATTGTCTTTCACCCCATATGTGCCCTCTCTATCTTAATCACCAGGAAAACCTTCAGCACCTCTTTTTTGACTGCTTTTATGCTGCAAAATGCTGGAACCAACTGCTACAAACTTTCAATCTTTGTTGGGCTTTTGATGACAACATCAGGATCAACATGCTGTAAATTTTGACGGGGTCCAGCCTTGAAGAAAAGTACTCTACTCTTATGGTATTACGTGGTCAAGGCCTTGCTAGTAGAGTTATGGTTCAAAAGCAATCAGCGTGTATTCCATGACAAGGTTTCAAGTTGGAATGATCGTATTGAGTTTGCTCAATTAAATGCATCATCATGGTGCTCTTTATCCAAGGGTTTCCGGGATTACTCCTTGCAAGAGATTCAATTGAACTGGCAAGCTTCCATTTTCTAAGTGTCGTAAAGAGTCTGTATTCTAGTATTTTGTGTTCTACTTTCTGCTTTTAACTTTCAAGTGCTAGACTTTTGTATCTTTTCAGCTTGATATGGGCTTGAGTTTTTTTTTATGTAACTTGCTCAGCAAGTTATAGCTATGGACATGGTGAAGGCACTAAAGGTGTGTCAACCTAGTTGAGATGTCCTAGTTCGCCTACTGATCCAAAGGTTCTTTGCTATTTGTATATTCCTCATGTGCTTAGAGCTTTGTCTCTTTATTATTTTACCTATCTAATAAAGAGATTAGTATCTTTTTCAAATAAAATAAAACGAGACAAATCAATTGGATTTGAAAATTCCTTAAAATATAGATCATGGGCTTCTAGTAAATTGAAATCCTCAAAATTAAGGAAACAATTGCATTTAACAACATTTCTTAGTTCGATTGTCGTCGTCATAAATCCACACAAATTCCTTTGAACTTGAATTCTTACCTTTGAACAATTGATGAGATTGAGAGTCTCCATAGCAATGCTTTCTAACCCTCCAAAGTAAGCTCCAATGACTTGAAAGATAACTCTCCTCGAAAGATCTAAAGGGTGATTTTTTATATTAATCACCCTCCGTATCCTTTAATGACCATTGGTCTTCCATGCTTGGCCGTATCTCATTTCTCAAATAGCAAATGGTACTTTCCACAGTCAAACCATTTTCCTGGGGTGATGATCAAGTCTTCAAACTGCAAACAATGGATTAATGATAATTTTAACTTGAAAAAAGTATTCCAAGAATTCAGCTATTTTTTTCCAATCATCAAAGGAAAAAAGTCTGGACAGAACCCATAGGTAGCTAAAATCTTCCTTAAGAACGTCCTTGCTTTTTCTGACCCAGTAATTCTGCATCCTCTGTTTACTGGTGGGAAAAGCAGGGGATATTAGTTGGTTGTTTTCCATTAGAATCAGCTCTGTTTTGTTGTTCTAGAAACAGGGTAGACCTCTTTGCCGTGGTTTCTCACCTTTTCCTTATAGCTTGAGCCTGGCAAATAGGTGTATGAGTCTATGTTTCTTGCCAGAGTAATGGAGGAATGCCAATCCAAGTAGTTTTGTTTCGTAAGGAAGCTTCTTAGCATTTTTAAAACTACTGGTCAGCCCTGCTTGTCTTCCCCTGAGAAAATACGAACATAGGAATGGTAACCAGAAGTTGGTCGAACGACACACCTTAAAACCCAACCTGAACGTGCCTGGAAATTCATCAAACTTATGGCTCCCCTCTTATCCTCACCGTTCTAATAAAGAATCTGTTTTCTAAACCCTGAATCAGGTCGTTGATGGACTCAATGAACCATTTTAACTGAGGCTTCGTGATCTTAAGTATTTTGTTGAAATCTACATCCTCAATGTTGAATAACCCATTTTCGAACCAGATGCAGTTGTAGGAATTAACCACTTTGCTGCTTACCACTTCCATTTGACCCTCCAAGTTGATGAGAATCGTGAAAGAAGAGGTGAAGGATGGACTGCTCTATCGAACAGAAAAGTTTCCGACGAGAAGGGATTTGGGAGGGGGGAGGAGAGAGAGAACAAACCTTTGACCCATAGGCCTTCACTATTTGCAATTGGAGAGTTTTTATCTTTCTAGATTTTTATTTTTTTGAAGGAAACACACTTTTCATTGATTAATAATCTCAATAATTACATGGAAAAATACAGAAAGAAATTCAGTCAATGACCAAAATAAAGGACTACGCTATAAATCAACTGGGCTTAATGCTGGATAGATCCTCTAGCTGTCTGTATATCAGTCTGGCTAACTGTTTTTTTCCCTTTTAGTTTCAACTGGACTTAATGTTAGGAAGATCTTTTATTCGATTCAATTAACTGACTTTATAATAATTTTATTTTAATAAGGAGCCAAAGTTGAGCTTACTATCTGCCGAAGCGAAGGAATCAGCAGCAAATATTGAAAAGCGACTGCAATTGGGTTCAAAGCTCAGTGATGTGGCCACTTGTGAGGAGGATGTTCTTGAACTCTTAAGTCTGTTCAATAAAGAAAATTATATTCTATCAGAGCACAGGGGAAAATATTGTGTAAGTAAAATCTAAACCCATTTCATTTTAGACCTGAATAAAACTTTTCTTTGGGAAAGGCACAAAGTTGATGGAATTATTTTTCCTGCTCAATTAAATTTGCATTTGGGTTAAATTAATTTTGTTTCTTCTGATTCATTTACTAAACTGTATTTCCACTGTGTGTATTGTTTTGGATTAAGAAGCTTTCTTCTCTTAGTCCAATAGTACTTGATATGAAGCTTTACGCCCTTGTTACCCATGGATTATTAGTTTATGATTTTAGAGAATATTGCAAAAATCATCCATAAAGCAAAAAGATGGGATAGTAAAAAAAACCTCTATACACCCCCTTTTCCATTCACCATCTTTAACCATGTTTTGATGCAAGGCCTCACCAACTCCTGTCACAGAAAGGACGCACAAGAGCACACCCAAGTAATAGGGGAAATATATCTATTGAATTTCATCACAAAAATACATCAATAACCTTAGCCTAATAAGAGGGCTTTACTCTCCACAATGAACTATTGTTCATCCACATAAAATACACATAAACCTAGATAACTAACCCTAGCACAAAACACCCAATATAGACTAACATCGTTCATCCACATAAAACACACATAAACCTTGATAACTAACCCTAGCACAAAACACCCAAATATATACTAACATTTCCTAAGAACATAGGACCACAACAACTACCCTTTTGTCGGGGTGAAAACCCCTTCCTACCCCTCCTCCTATATACTTGCTTAATAGGTGGCCTTGCAATACCCTAATGGAGAAGAGTTACCTTATCCTCAAGGTGAAAATTAGGGAACTGGGCCTTAAGGACAGCGTACGATTCCCAACTTTGGACGGGAAGGATCAGACTCACAACAATAAGGGCGATAGAGTGGCCATGGTGGAGAACATGGTGAGGCATTTGGCTATCACCAAGGACACCATAACAACTGGTCACACATCCATCCATCTCCACAGCCGCAAGCCTACGCTGTTGCTGCTTCTTCATCGTCATCATCAATGGAAAACCTTCTCTGCGAGTACATGCAAAAGAATGATGCCCTTTTGTAGAGTCGGGCTGCTACCATCCGCAATCTGAGTTACAGATGGAGCAAATTGCCAGTGACATCTTCGAAAGACTAAGGGGATCCCTCCCTAACAGTACTGAAACGCCAAACCAAGTGGGGGATCGAAAAAGGATCAGTGTCAATCTGTGATGCTAAGAAACGCAAGGAACTTAACCATCCGCGAACTAGACCAAGAATAGTCAGTCTACTTCTTTGCCTAATACCACGAATGACCATTCACACAGTACTGACACTTCTCCTATTGTTTTTTCTTCAACTAATGATCTTTCTTCTTTGTAGAATGACAGTACTGAAAAACAAGAAAGTTGAGACTACCAGAGAAGATCTAAAATGCAGGCAGACGTCTAATTAGGCGTCATCATCTAATCTGTTGCCTCTCTTTTGACCCAGCATAGCCTTACTGCCACATCACTTGCGTTGCACTTCAGTTTGAAAGGTTGCGACCAATCAGGTGTGATGAGGATAAGCATTGAGGTCAATGCGTCTTTCAAAGTTTGGAACACCTAGTTGCATTTCTCATCAAACATATAGGGTTGGTTGGCGCTTAATAGGTTGCTCAATGGCTTAGCGAGCTGAGAGAAACCTCAAGTGAATCTTTTGTAAAAGCCAGCATGTCCCAAAAAGCTTCGTAAACATTTGATATCAGAAGGCGTTGGAAACTTGCTCACCACATTAATCTTCGCATGGTCTACTTCTAAGCCTGCTTGGAGAGCTTGTGCCCTAGCAAAATCCCCTCGATTATCATTAAATTACACTTTTCCTTGTTGTGTATTAAGCATGTCTTCTTCCATCTCTTCAATACCTCTTCCAAGTTGTTCAGACATTCGATATATGATTTCCAAAGATTGAGAAGTCATTCATAAAAATTTCTACTGATCTGTTTAAAAAGTCCGAAAATATTGCCATCATGCACCTTTAAAATGTCCATGGCAGATTACACAATCCAAAAGGCATGTGATGAAAAGCAAACGTGCCGCAAGGACAAGTAAATGTGGTTTTGTGCTAATCTTTGGGTGCAATAGTTATCTGATTATACCCATAATAGCCATTCAGAAAGCAATAATACTTGTTGCCTGCAAGTTGGTCCAGCATATGATCAATAAAAGACAAGGGAAAGTGGTTCTTCTTTGTTGCCACATTCAGCTTGTGGTAATCCGCCACCCAGTCACTGTTCTTGTTGGGATTAATTCATTATTCCATCGTTGCGGTCTTTAATTGATAATTTGTTGGTCGGTTGAGTAGCTTAGAAGTCATCTGTTTCTTCCATTATATTCTTTGTTGCTTTTTTTTTTCCTTGTTAATTTTATTTGAAATCTTAACATAAAATTTACAAGTTAAAGATAGAGCTATTCAACTAATCAATAGGGAATTAGTTTCATAATTTACTTGATTTCAAGATGCAATTGAATTAAATAATAATAACAATAACAAAAAATTAATAATCTTCCAATTATGTGTGGTAAGGATTCTTTGAATAACTTAATTTTTTTTATTTGCAACTGCCCAAACTAATGATAAAAAGAAGATTCTGTCTGATGAAGTCAATTTGCTAGATTAGTCAACATAGCGTTGGGTGGTTTAACAAACCAAAAACAAAATTTGATGAAAGATCCTCTTAAATAGAATTATTTGATGACGTAGACTGTGTAGTTTATAAGCATATGTTAAAAATTCCATTAAATCTTATCAATTATTTTACAAAGATAAATTGTTACATATGAAAGTGAATTGTCACAAAATATGAAGGTATGTGACCAAATGGTTGCAAATCAAAGTTCATGTTTCCATCAAAGCTCATTGACCTCAAAAAGTGATGTTTAAGCCAAACTATTATCACCAAAAGAAAGAAAAAAAAAGCTAGTTCATGGCAGAGTTACTAAACAGTCCATGCCAGAGATCCAAATCCCCAAAATAAAAGGTAGAGGGGTTATCTTCATTTCATTTGAAAGAAACACATCTAAGGCTAGAAAATCATTCTATTTGAAGTGACCAAGTATTATTTGCTAAGAGTATCACTATATAAACTCAAATCATGCTGGGAATGTTATTTTATTGTGCAGGTAATGCTTAAAGAAAGTGCTTCGCCTGTAGACATGCTGAAGGCAGTATTTCATGTCAATTATTTGCATTGGTTAGAGAGAAACGCTGGAATAACAGCAAGAAGTGCTTCTAATGACTGCAGACCAGGAGGAAGGCTACAAATGTCTTTGGAGTATGTGGAGAGGGAATTCAAGCATGTCAAATATGATGGGGAATTGGCTGGTTGGTCGACTGATGGCCTAATTGCAAGGCCCTTAACTACTAGGATTTGTGAATGTCATGTAACTTAG

mRNA sequence

ATGTATGGGGTACTGCCGTTCTCTTATCAGGCGCCGCCGCCGGAGCCGATTCCATTTCGGCCAGTCTATGTCGATGTCTTAAACTACGTACCAGTCCGCCGTTTTCACCATTGCTTGGATTCTTCTATGCGAAGGTCATGTACAGCACTAAGACCTTCTCTTAGCGTATTTCCTCACTTTCTTAAACCCACAAAACTCTTTCAAGGTTATTCCTCTCCTTGTAATGGAACTAGAATCAAACCTGCTCTCGTTCATTCTCCTTTGCTGGCTGGTGACGGCCATGGGTGTGATGGAAATAACAATGGTGGCTGGAATAATTCGAATCCTTTTGGGGGTTTTGGATGGTGGCAGTATGACGGTGATTCTCCCCCATGGTCGGACAATGCCTTCCTTGCTTTCTTTTTTTCCTCTGTTCTGGGTTGTTTCTGCCTCTTTCAATTGGCAGTAGCGCTAGCACGTAACAATATGAACACCGAGTCTATTTGGGAAGTAAAAGGAGGTAAGCGAATCCGCCTCATTCTCGATACGTATAGAGATGAGTTCCATGTTGCAACTGGCATGCCGTCGTCTTCGTTATCCTTTTCCTTTGTCAACGTTTGGCTTCGTTGCAGCGATATATTCACGCGTTTGATGCTTCCGGAGGGTTTTCCAGACAGTGTCACCAGCGACTATCTGGAATATTCCCTTTGGCGAGGAGTCCAGGGGATTGCCAGCCAAGTTAGTGGGGTGCTTGCAACTCAGGCACTGCTTTATGCTGTTGGATTGGGGAAAGGAGCTATTCCGACTGCTGCTGCAGTGAATTGGGTACTGAAAGATGGATTTGGATATCTAAGTAAAATTTTTCTCTCAAAATATGGACGGCACTTTGATGTTCATCCGAAGGGGTGGAGGTTGTTCGCTGATCTTCTGGAAAACGCTGCCTATGGGATGGAAATGTTAACTCCCGCATTTCCCCTCCATTTTGTCGTGATCGGTGCTGCTGCTGGGGCCGGACGATCTGCAGCTGCCTTGATTCAGGCTGCTACTAGGAGTTGTTTTTATGCTGGCTTTGCTGCTCAAAGGAATTTTGCCGAGGTGATTGCTAAAGGTGAAGCACAAGGAATGGTGAGCAAGTCTATCGGTATGATGCTTGGCATTACATTGGCCAATCGTATAAGGTCCTCAACATCACTTGCTCTTGGGTGCTTTAGCATAGTGACCTTAATCCACATGTTCTGCAATCTAAAATCATACAAATCCATTCAACTAAGGACATTAAATCCTTATCGTGCAAGTTTGGTCTTCAGTGAATATCTGTTGAGTGGTGAGGTGCCTTCGATTAAGGATGTGAACAATGAAGAACCTCTTTTTCCGGCTGTACCACTTCTTAATAGAAAGCTTGCATGTGATGAGCCAAAGTTGAGCTTACTATCTGCCGAAGCGAAGGAATCAGCAGCAAATATTGAAAAGCGACTGCAATTGGGTTCAAAGCTCAGTGATGTGGCCACTTGTGAGGAGGATGTTCTTGAACTCTTAAGTCTGTTCAATAAAGAAAATTATATTCTATCAGAGCACAGGGGAAAATATTGTGTAATGCTTAAAGAAAGTGCTTCGCCTGTAGACATGCTGAAGGCAGTATTTCATGTCAATTATTTGCATTGGTTAGAGAGAAACGCTGGAATAACAGCAAGAAGTGCTTCTAATGACTGCAGACCAGGAGGAAGGCTACAAATGTCTTTGGAGTATGTGGAGAGGGAATTCAAGCATGTCAAATATGATGGGGAATTGGCTGGTTGGTCGACTGATGGCCTAATTGCAAGGCCCTTAACTACTAGGATTTGTGAATGTCATGTAACTTAG

Coding sequence (CDS)

ATGTATGGGGTACTGCCGTTCTCTTATCAGGCGCCGCCGCCGGAGCCGATTCCATTTCGGCCAGTCTATGTCGATGTCTTAAACTACGTACCAGTCCGCCGTTTTCACCATTGCTTGGATTCTTCTATGCGAAGGTCATGTACAGCACTAAGACCTTCTCTTAGCGTATTTCCTCACTTTCTTAAACCCACAAAACTCTTTCAAGGTTATTCCTCTCCTTGTAATGGAACTAGAATCAAACCTGCTCTCGTTCATTCTCCTTTGCTGGCTGGTGACGGCCATGGGTGTGATGGAAATAACAATGGTGGCTGGAATAATTCGAATCCTTTTGGGGGTTTTGGATGGTGGCAGTATGACGGTGATTCTCCCCCATGGTCGGACAATGCCTTCCTTGCTTTCTTTTTTTCCTCTGTTCTGGGTTGTTTCTGCCTCTTTCAATTGGCAGTAGCGCTAGCACGTAACAATATGAACACCGAGTCTATTTGGGAAGTAAAAGGAGGTAAGCGAATCCGCCTCATTCTCGATACGTATAGAGATGAGTTCCATGTTGCAACTGGCATGCCGTCGTCTTCGTTATCCTTTTCCTTTGTCAACGTTTGGCTTCGTTGCAGCGATATATTCACGCGTTTGATGCTTCCGGAGGGTTTTCCAGACAGTGTCACCAGCGACTATCTGGAATATTCCCTTTGGCGAGGAGTCCAGGGGATTGCCAGCCAAGTTAGTGGGGTGCTTGCAACTCAGGCACTGCTTTATGCTGTTGGATTGGGGAAAGGAGCTATTCCGACTGCTGCTGCAGTGAATTGGGTACTGAAAGATGGATTTGGATATCTAAGTAAAATTTTTCTCTCAAAATATGGACGGCACTTTGATGTTCATCCGAAGGGGTGGAGGTTGTTCGCTGATCTTCTGGAAAACGCTGCCTATGGGATGGAAATGTTAACTCCCGCATTTCCCCTCCATTTTGTCGTGATCGGTGCTGCTGCTGGGGCCGGACGATCTGCAGCTGCCTTGATTCAGGCTGCTACTAGGAGTTGTTTTTATGCTGGCTTTGCTGCTCAAAGGAATTTTGCCGAGGTGATTGCTAAAGGTGAAGCACAAGGAATGGTGAGCAAGTCTATCGGTATGATGCTTGGCATTACATTGGCCAATCGTATAAGGTCCTCAACATCACTTGCTCTTGGGTGCTTTAGCATAGTGACCTTAATCCACATGTTCTGCAATCTAAAATCATACAAATCCATTCAACTAAGGACATTAAATCCTTATCGTGCAAGTTTGGTCTTCAGTGAATATCTGTTGAGTGGTGAGGTGCCTTCGATTAAGGATGTGAACAATGAAGAACCTCTTTTTCCGGCTGTACCACTTCTTAATAGAAAGCTTGCATGTGATGAGCCAAAGTTGAGCTTACTATCTGCCGAAGCGAAGGAATCAGCAGCAAATATTGAAAAGCGACTGCAATTGGGTTCAAAGCTCAGTGATGTGGCCACTTGTGAGGAGGATGTTCTTGAACTCTTAAGTCTGTTCAATAAAGAAAATTATATTCTATCAGAGCACAGGGGAAAATATTGTGTAATGCTTAAAGAAAGTGCTTCGCCTGTAGACATGCTGAAGGCAGTATTTCATGTCAATTATTTGCATTGGTTAGAGAGAAACGCTGGAATAACAGCAAGAAGTGCTTCTAATGACTGCAGACCAGGAGGAAGGCTACAAATGTCTTTGGAGTATGTGGAGAGGGAATTCAAGCATGTCAAATATGATGGGGAATTGGCTGGTTGGTCGACTGATGGCCTAATTGCAAGGCCCTTAACTACTAGGATTTGTGAATGTCATGTAACTTAG

Protein sequence

MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHFLKPTKLFQGYSSPCNGTRIKPALVHSPLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQYDGDSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTYRDEFHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWSTDGLIARPLTTRICECHVT*
Homology
BLAST of CsaV3_3G020160 vs. NCBI nr
Match: XP_011651345.1 (protein root UVB sensitive 1, chloroplastic isoform X1 [Cucumis sativus] >KAE8650562.1 hypothetical protein Csa_009558 [Cucumis sativus])

HSP 1 Score: 1243.4 bits (3216), Expect = 0.0e+00
Identity = 612/612 (100.00%), Postives = 612/612 (100.00%), Query Frame = 0

Query: 1   MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF 60
           MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF
Sbjct: 1   MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF 60

Query: 61  LKPTKLFQGYSSPCNGTRIKPALVHSPLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQYDG 120
           LKPTKLFQGYSSPCNGTRIKPALVHSPLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQYDG
Sbjct: 61  LKPTKLFQGYSSPCNGTRIKPALVHSPLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQYDG 120

Query: 121 DSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTYRDE 180
           DSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTYRDE
Sbjct: 121 DSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTYRDE 180

Query: 181 FHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240
           FHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV
Sbjct: 181 FHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240

Query: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA 300
           SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA
Sbjct: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA 300

Query: 301 DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVI 360
           DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVI
Sbjct: 301 DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVI 360

Query: 361 AKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLN 420
           AKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLN
Sbjct: 361 AKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLN 420

Query: 421 PYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSAEAKESAAN 480
           PYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSAEAKESAAN
Sbjct: 421 PYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSAEAKESAAN 480

Query: 481 IEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVF 540
           IEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVF
Sbjct: 481 IEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVF 540

Query: 541 HVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWSTDGLIAR 600
           HVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWSTDGLIAR
Sbjct: 541 HVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWSTDGLIAR 600

Query: 601 PLTTRICECHVT 613
           PLTTRICECHVT
Sbjct: 601 PLTTRICECHVT 612

BLAST of CsaV3_3G020160 vs. NCBI nr
Match: XP_031738101.1 (protein root UVB sensitive 1, chloroplastic isoform X2 [Cucumis sativus])

HSP 1 Score: 1195.6 bits (3092), Expect = 0.0e+00
Identity = 593/612 (96.90%), Postives = 593/612 (96.90%), Query Frame = 0

Query: 1   MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF 60
           MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF
Sbjct: 1   MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF 60

Query: 61  LKPTKLFQGYSSPCNGTRIKPALVHSPLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQYDG 120
           LKPTKLFQGYSSPCNGTRIKPALVHSPLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQYDG
Sbjct: 61  LKPTKLFQGYSSPCNGTRIKPALVHSPLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQYDG 120

Query: 121 DSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTYRDE 180
           DSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTYRDE
Sbjct: 121 DSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTYRDE 180

Query: 181 FHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240
           FHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV
Sbjct: 181 FHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240

Query: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA 300
           SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA
Sbjct: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA 300

Query: 301 DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVI 360
           DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQ                   VI
Sbjct: 301 DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQ-------------------VI 360

Query: 361 AKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLN 420
           AKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLN
Sbjct: 361 AKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLN 420

Query: 421 PYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSAEAKESAAN 480
           PYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSAEAKESAAN
Sbjct: 421 PYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSAEAKESAAN 480

Query: 481 IEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVF 540
           IEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVF
Sbjct: 481 IEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVF 540

Query: 541 HVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWSTDGLIAR 600
           HVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWSTDGLIAR
Sbjct: 541 HVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWSTDGLIAR 593

Query: 601 PLTTRICECHVT 613
           PLTTRICECHVT
Sbjct: 601 PLTTRICECHVT 593

BLAST of CsaV3_3G020160 vs. NCBI nr
Match: XP_008449956.1 (PREDICTED: protein root UVB sensitive 1, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 1179.1 bits (3049), Expect = 0.0e+00
Identity = 583/612 (95.26%), Postives = 593/612 (96.90%), Query Frame = 0

Query: 1   MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF 60
           MYGVLPFSYQ PPPE IP R VYVDVL+YVPVRRFHHCLDSSMRRSC +LRP LSVFPHF
Sbjct: 1   MYGVLPFSYQ-PPPELIPLRRVYVDVLSYVPVRRFHHCLDSSMRRSCKSLRPPLSVFPHF 60

Query: 61  LKPTKLFQGYSSPCNGTRIKPALVHSPLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQYDG 120
           LKP KLF+GYSSPCNGTRIKPALVHSPLLAGDG+GCDGNNNGGWNNSNPFGGFGWWQYD 
Sbjct: 61  LKPAKLFRGYSSPCNGTRIKPALVHSPLLAGDGYGCDGNNNGGWNNSNPFGGFGWWQYDS 120

Query: 121 DSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTYRDE 180
           DSPPWSDNAFLA FF+SVLGCFCLFQLAVALARN+M TESIWEVKGGKRIRLILDTYRDE
Sbjct: 121 DSPPWSDNAFLALFFTSVLGCFCLFQLAVALARNDMKTESIWEVKGGKRIRLILDTYRDE 180

Query: 181 FHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240
           FHVATGMPSSSLSFSFVNVWLRCSDIF RLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV
Sbjct: 181 FHVATGMPSSSLSFSFVNVWLRCSDIFKRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240

Query: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA 300
           SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA
Sbjct: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA 300

Query: 301 DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVI 360
           DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVI
Sbjct: 301 DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVI 360

Query: 361 AKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLN 420
           AKGEAQGMVSKSIGMMLGITLAN IRSSTSLALGCFSIVTLIHMF NLKSYKSIQLRTLN
Sbjct: 361 AKGEAQGMVSKSIGMMLGITLANHIRSSTSLALGCFSIVTLIHMFSNLKSYKSIQLRTLN 420

Query: 421 PYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSAEAKESAAN 480
           PYRASLVFSEYL SGEVPSIK+VNNEEPLFPAVPLLN +L CDEPKL LLSAEAKESAAN
Sbjct: 421 PYRASLVFSEYLFSGEVPSIKEVNNEEPLFPAVPLLNTRLGCDEPKLGLLSAEAKESAAN 480

Query: 481 IEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVF 540
           I++RLQLGSKLSDVATCE DVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVF
Sbjct: 481 IDQRLQLGSKLSDVATCEADVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVF 540

Query: 541 HVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWSTDGLIAR 600
           HVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGW TDGLIAR
Sbjct: 541 HVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWLTDGLIAR 600

Query: 601 PLTTRICECHVT 613
           PLTTRICECHVT
Sbjct: 601 PLTTRICECHVT 611

BLAST of CsaV3_3G020160 vs. NCBI nr
Match: XP_038881395.1 (protein root UVB sensitive 1, chloroplastic [Benincasa hispida])

HSP 1 Score: 1147.5 bits (2967), Expect = 0.0e+00
Identity = 566/611 (92.64%), Postives = 578/611 (94.60%), Query Frame = 0

Query: 1   MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF 60
           MYG+LPFSYQ  PPEPIP R VY DVLNYVP   FHHC DSS RR+C AL   LSVFPHF
Sbjct: 1   MYGLLPFSYQ--PPEPIPLRRVYADVLNYVPGGHFHHCSDSSKRRACAALTLPLSVFPHF 60

Query: 61  LKPTKLFQGYSSPCNGTRIKPALVHSPLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQYDG 120
           LKPT+  QGY SPC GTRIKPALVHSPLLAGDGHGC GNNNGGWNNSNPFGGFGWWQ DG
Sbjct: 61  LKPTEQVQGYFSPCIGTRIKPALVHSPLLAGDGHGCGGNNNGGWNNSNPFGGFGWWQNDG 120

Query: 121 DSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTYRDE 180
           DSPPWSDNAFLAFFF+SVLGCFCL Q A ALARN MN ES+WEVKGGKRIRLILDT+RDE
Sbjct: 121 DSPPWSDNAFLAFFFTSVLGCFCLLQFAAALARNEMNYESVWEVKGGKRIRLILDTFRDE 180

Query: 181 FHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240
           FHVATGMPSSSLSFSFVNVW+RCSDIF RLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV
Sbjct: 181 FHVATGMPSSSLSFSFVNVWIRCSDIFKRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240

Query: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA 300
           SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKI LSKYGRHFDVHPKGWRLFA
Sbjct: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVHPKGWRLFA 300

Query: 301 DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVI 360
           DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQA+TRSCFYAGFAAQRNFAEVI
Sbjct: 301 DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVI 360

Query: 361 AKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLN 420
           AKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTL+HMFCNLKSYKSIQLRTLN
Sbjct: 361 AKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLVHMFCNLKSYKSIQLRTLN 420

Query: 421 PYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSAEAKESAAN 480
           PYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVP LN +LACDEPKL +LSAEAKESAAN
Sbjct: 421 PYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPFLNTRLACDEPKLGILSAEAKESAAN 480

Query: 481 IEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVF 540
           IEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVF
Sbjct: 481 IEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVF 540

Query: 541 HVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWSTDGLIAR 600
           HVNYLHWLERNAGITARSASNDC+PGGRLQMSLEYVEREF HVKYDGELAGW TDGLIAR
Sbjct: 541 HVNYLHWLERNAGITARSASNDCKPGGRLQMSLEYVEREFNHVKYDGELAGWLTDGLIAR 600

Query: 601 PLTTRICECHV 612
           PLT RICECHV
Sbjct: 601 PLTNRICECHV 609

BLAST of CsaV3_3G020160 vs. NCBI nr
Match: XP_023528607.1 (protein root UVB sensitive 1, chloroplastic isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1015.4 bits (2624), Expect = 2.1e-292
Identity = 514/610 (84.26%), Postives = 543/610 (89.02%), Query Frame = 0

Query: 1   MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF 60
           MYG LPFSYQ   PE IP R VYVDVL+YVP   FHH    S R SC A RP L+VFPH 
Sbjct: 25  MYG-LPFSYQL--PEQIPLRRVYVDVLDYVPGGCFHH---YSTRSSCAARRPPLNVFPHL 84

Query: 61  LKPTKLFQGYSSPCNGTRIKPALVHS---PLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQ 120
           LKP KL  GY SPC GTRIKP LVHS   P L  DGHGC GNNNGGWN+S  FGGFGWWQ
Sbjct: 85  LKPIKLAHGYFSPCIGTRIKPTLVHSHFLPPLLDDGHGCGGNNNGGWNSSYRFGGFGWWQ 144

Query: 121 YDGDSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTY 180
              +S P   NAFLA   +S++GCFC FQLA ALARN MN+ES+WEV+GGKRIRLILDT+
Sbjct: 145 DGSNSSPRWRNAFLALVLTSIMGCFCHFQLAAALARNGMNSESVWEVRGGKRIRLILDTF 204

Query: 181 RDEFHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIA 240
           RDEF+VATG+PSS LSFSFVN WLRCS+IF RLMLPEGFPDSVTSDYLEYSLWRGVQGIA
Sbjct: 205 RDEFYVATGVPSSPLSFSFVNFWLRCSEIFKRLMLPEGFPDSVTSDYLEYSLWRGVQGIA 264

Query: 241 SQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWR 300
           SQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKI LSKYGRHFDV+PKGWR
Sbjct: 265 SQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVNPKGWR 324

Query: 301 LFADLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFA 360
           LFADLLENAA+GMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFA
Sbjct: 325 LFADLLENAAFGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFA 384

Query: 361 EVIAKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLR 420
           EVIAKGEAQGMVSKSIGM+LGI LANRIRSSTSLALGCFS+VT+IHMFCNLKSYKSIQLR
Sbjct: 385 EVIAKGEAQGMVSKSIGMLLGIALANRIRSSTSLALGCFSVVTVIHMFCNLKSYKSIQLR 444

Query: 421 TLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSAEAKES 480
           TLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVP LN +LACDEPK+ LLS EAKES
Sbjct: 445 TLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPFLNTRLACDEPKVGLLSTEAKES 504

Query: 481 AANIEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLK 540
           AA+IEKRLQLGSKLSDVA CEEDVL+LLSL+  ENYILSEHRG+YCVMLKESA P DMLK
Sbjct: 505 AASIEKRLQLGSKLSDVARCEEDVLQLLSLYKNENYILSEHRGRYCVMLKESALPKDMLK 564

Query: 541 AVFHVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWSTDGL 600
           A+FHVNYLHWLERNAGI ARSA+NDC+PGGRLQ+SLEYVEREF HVKYDGELAGW TDGL
Sbjct: 565 ALFHVNYLHWLERNAGIEARSAANDCKPGGRLQISLEYVEREFIHVKYDGELAGWLTDGL 624

Query: 601 IARPLTTRIC 608
           IARPL  RIC
Sbjct: 625 IARPLNNRIC 628

BLAST of CsaV3_3G020160 vs. ExPASy Swiss-Prot
Match: Q7X6P3 (Protein root UVB sensitive 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=RUS1 PE=1 SV=1)

HSP 1 Score: 623.6 bits (1607), Expect = 2.3e-177
Identity = 328/518 (63.32%), Postives = 396/518 (76.45%), Query Frame = 0

Query: 98  GNNNGGWNNSNPFGGFGWWQYDGDSPPWSDNAFLAFFFSSVLGCFCLFQLAVALA----- 157
           G +NG  +N N  GG G    D       D  +L F     L CF  F+L+ A A     
Sbjct: 77  GGSNGNNDNGNGGGGGGDGGGDNSDDSSFDLRYLCFLLLG-LSCFFHFRLSAASAIAKDQ 136

Query: 158 ----RNNMNTESIWEVKGGKRIRLILDTYRDEFHVATGMPSSSLSFSFVNVWLRCSDIFT 217
                 +   E++WEV+G KR RL+ D  +DEF         S S +  N+  +C ++ T
Sbjct: 137 NSDSNGDAVKETVWEVRGSKRKRLVPDFVKDEFVSEESAFELSSSLTPENLLAQCRNLLT 196

Query: 218 RLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNW 277
           + +LPEGFP+SVTSDYL+YSLWRGVQGIASQ+SGVLATQ+LLYAVGLGKGAIPTAAA+NW
Sbjct: 197 QFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAAAINW 256

Query: 278 VLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAA 337
           VLKDG GYLSKI LSKYGRHFDVHPKGWRLFADLLENAA+GMEMLTP FP  FV+IGAAA
Sbjct: 257 VLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGAAA 316

Query: 338 GAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSS 397
           GAGRSAAALIQAATRSCF AGFA+QRNFAEVIAKGEAQGMVSKS+G++LGI +AN I +S
Sbjct: 317 GAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSVGILLGIVVANCIGTS 376

Query: 398 TSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEP 457
           TSLAL  F +VT IHM+ NLKSY+ IQLRTLNPYRASLVFSEYL+SG+ P IK+VN+EEP
Sbjct: 377 TSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEP 436

Query: 458 LFPAVPLLNRKLACDEPKLSLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLF 517
           LFP V   N K + ++ +  +LS+EAK +AA+IE+RLQLGSKLSDV   +E+ + L  L+
Sbjct: 437 LFPTVRFSNMK-SPEKLQDFVLSSEAKAAAADIEERLQLGSKLSDVIHNKEEAIALFDLY 496

Query: 518 NKENYILSEHRGKYCVMLKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGR 577
             E YIL+EH+G++CVMLKES++P DML+++F VNYL+WLE+NAGI   S  +DC+PGGR
Sbjct: 497 RNEGYILTEHKGRFCVMLKESSTPQDMLRSLFQVNYLYWLEKNAGIEPASTYSDCKPGGR 556

Query: 578 LQMSLEYVEREFKHVKYDGELAGWSTDGLIARPLTTRI 607
           L +SL+YV REF+H K D E  GW T+GLIARPL TRI
Sbjct: 557 LHISLDYVRREFEHAKEDSESVGWVTEGLIARPLPTRI 592

BLAST of CsaV3_3G020160 vs. ExPASy Swiss-Prot
Match: Q84JB8 (Protein root UVB sensitive 3 OS=Arabidopsis thaliana OX=3702 GN=RUS3 PE=2 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 3.1e-44
Identity = 133/439 (30.30%), Postives = 221/439 (50.34%), Query Frame = 0

Query: 181 FHVATGMPSSSLSFS-----FVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQG 240
           F  AT   SSSLS       F +VW R    F    +PEGFP SVT DY+ + LW  +QG
Sbjct: 26  FKTATITASSSLSIQRSANRFNHVWRRVLQAF----VPEGFPGSVTPDYVGFQLWDTLQG 85

Query: 241 IASQVSGVLATQALLYAVGLG-KGAIPTAAAVNWVLKDGFGYLSKIFLSKY-GRHFDVHP 300
           +++    +L+TQALL A+G+G K A    A   W L+D  G L  I  + Y G + D + 
Sbjct: 86  LSTYTKMMLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNA 145

Query: 301 KGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQ 360
           K WRL ADL+ +    M++L+P FP  F+V+       RS   +   ATR+     FA Q
Sbjct: 146 KMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQ 205

Query: 361 RNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTSLALG-CFSIVTLIHMFCNLKSYK 420
            N A++ AK  +Q  ++  +GM LG+ LA R  S   +A+   F  +T+ HM+ N ++ +
Sbjct: 206 DNAADISAKEGSQETMATMMGMSLGMLLA-RFTSGNPMAIWLSFLSLTVFHMYANYRAVR 265

Query: 421 SIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSA 480
            + L +LN  R+S++ + ++ +G+V S + V++ E + P                SL S 
Sbjct: 266 CLVLNSLNFERSSILLTHFIQTGQVLSPEQVSSMEGVLPLW------------ATSLRST 325

Query: 481 EAKESAANIEKRLQLGSKLSDVATCEEDVLELL-----SLFNKENYILSEHRGKYCVMLK 540
            +K     + KR+QLG ++S +     D+L+LL     S +    Y+L+  +G   V+L 
Sbjct: 326 NSKP----LHKRVQLGVRVSSLPRL--DMLQLLNGVGASSYKNAKYLLAHIKGNVSVILH 385

Query: 541 ESASPVDMLKAVFHVNYL-HWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYD 600
           + + P D+LK+  H   L + +E++    +   +              ++++ +  + + 
Sbjct: 386 KDSKPADVLKSYIHAIVLANLMEKSTSFYSEGEA--------------WIDKHYDELLHK 427

Query: 601 GELAGWSTDGLIARPLTTR 606
               GW T+ L++  +T R
Sbjct: 446 LRSGGWKTERLLSPSITWR 427

BLAST of CsaV3_3G020160 vs. ExPASy Swiss-Prot
Match: Q93YU2 (Protein root UVB sensitive 6 OS=Arabidopsis thaliana OX=3702 GN=RUS6 PE=2 SV=1)

HSP 1 Score: 167.2 bits (422), Expect = 6.0e-40
Identity = 120/436 (27.52%), Postives = 198/436 (45.41%), Query Frame = 0

Query: 184 ATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGV 243
           A  + S    F  V  +LR        ++PEGFP SV   Y+ Y  WR ++       GV
Sbjct: 91  AISLESPQTPFDEVGSFLR------SYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGV 150

Query: 244 LATQALLYAVGLGKGAIPTAA-AVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADL 303
             TQ LL +VG  + +  +AA A+NW+LKDG G + K+  ++ G+ FD   K  R   DL
Sbjct: 151 FTTQTLLNSVGASRNSSASAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDL 210

Query: 304 LENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAK 363
           L     G+E+ T A P  F+ +  AA   ++ AA+   +TR+  Y  FA   N  +V AK
Sbjct: 211 LMELGAGVELATAAVPHLFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK 270

Query: 364 GEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPY 423
           GE  G ++  +G    I ++ R  S  +     F +++  ++  + +  +S+ L TLN  
Sbjct: 271 GECVGNIADLMGTGFSILISKRNPSLVT----TFGLLSCGYLMSSYQEVRSVVLHTLNRA 330

Query: 424 RASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSAEAKESAANIE 483
           R ++    +L +G VPS+++ N +E +F   P ++                        +
Sbjct: 331 RFTVAVESFLKTGRVPSLQEGNIQEKIF-TFPWVD------------------------D 390

Query: 484 KRLQLGSKLSDVATCEEDVLELLSLFNKENYIL--SEHRGKYCVMLKESASPVDMLKAVF 543
           + + LG++  D        + +   F+KE Y++  S  +GK   +LK  A+  D+LKA F
Sbjct: 391 RPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVYALLKHQANSDDILKAAF 450

Query: 544 HVN-YLHWLERNAGITARS--------ASNDCRPGGRLQMSLEYVEREFKHVKYDGELAG 603
           H +  LH++ ++     RS        A  +     R+  S E V   +   K      G
Sbjct: 451 HAHVLLHFMNQSKDGNPRSVEQLDPAFAPTEYELESRIAESCEMVSTSYGVFKSRAAEQG 491

Query: 604 WSTDGLIARPLTTRIC 608
           W     +  P   R+C
Sbjct: 511 WRMSESLLNPGRARLC 491

BLAST of CsaV3_3G020160 vs. ExPASy Swiss-Prot
Match: Q91W34 (RUS family member 1 OS=Mus musculus OX=10090 GN=Rusf1 PE=1 SV=1)

HSP 1 Score: 165.2 bits (417), Expect = 2.3e-39
Identity = 108/336 (32.14%), Postives = 172/336 (51.19%), Query Frame = 0

Query: 210 LMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKG-AIPTAAAVNW 269
           ++LP+GFPDSV+ DYL Y LW  VQ  AS +SG LATQA+L  +G+G   A  +AA   W
Sbjct: 74  VLLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTW 133

Query: 270 VLKDGFGYLSKIFLSKY-GRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAA 329
           ++KD  G L +I L+ + G   D + K WRLFAD+L + A  +E++ P +P+ F +  + 
Sbjct: 134 LVKDSTGMLGRIILAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVST 193

Query: 330 AGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRS 389
           +   +    +   ATR+      A + N A+V AK  +Q  V    G+++ + +   +  
Sbjct: 194 SNLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSD 253

Query: 390 STSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEE 449
             SL+LGCF ++T +H++ N ++ +++ L TLN  R  LV   +L  GEV      N  E
Sbjct: 254 CPSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQME 313

Query: 450 PLFPAVPLLNRKLACDEPKLSLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSL 509
           PL+              P LS                L LG  L  + +   ++ +L+  
Sbjct: 314 PLWTGF----------WPSLS----------------LSLGVPLHHLVSSVSELKQLVE- 373

Query: 510 FNKENYIL--SEHRGKYCVMLKESASPVDMLKAVFH 542
            + E Y+L  ++ R +  V L + A P  +L+A  H
Sbjct: 374 GHHEPYLLCWNKSRNQVQVALSQEAGPETVLRAATH 382

BLAST of CsaV3_3G020160 vs. ExPASy Swiss-Prot
Match: Q499P8 (RUS family member 1 OS=Rattus norvegicus OX=10116 GN=Rusf1 PE=2 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 1.5e-38
Identity = 88/243 (36.21%), Postives = 139/243 (57.20%), Query Frame = 0

Query: 210 LMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKG-AIPTAAAVNW 269
           ++LP+GFPDSV+ DYL+Y LW  VQ  AS +SG LATQA+L  +G+G   A  +AA   W
Sbjct: 74  VLLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTW 133

Query: 270 VLKDGFGYLSKIFLSKY-GRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAA 329
           ++KD  G L +I  + + G   D + K WRLFAD+L + A  +E++ P +P+ F +  + 
Sbjct: 134 LVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVST 193

Query: 330 AGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRS 389
           +   +    +   ATR+      A + N A+V AK  +Q  V    G+++ + +   +  
Sbjct: 194 SNLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSD 253

Query: 390 STSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEE 449
             SL+LGCF ++T +H++ N ++ +++ L TLN  R  LV   +L  GEV      N  E
Sbjct: 254 CLSLSLGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHFLQRGEVLEPASANQME 313

Query: 450 PLF 451
           PL+
Sbjct: 314 PLW 316

BLAST of CsaV3_3G020160 vs. ExPASy TrEMBL
Match: A0A1S3BP56 (protein root UVB sensitive 1, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491686 PE=3 SV=1)

HSP 1 Score: 1179.1 bits (3049), Expect = 0.0e+00
Identity = 583/612 (95.26%), Postives = 593/612 (96.90%), Query Frame = 0

Query: 1   MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF 60
           MYGVLPFSYQ PPPE IP R VYVDVL+YVPVRRFHHCLDSSMRRSC +LRP LSVFPHF
Sbjct: 1   MYGVLPFSYQ-PPPELIPLRRVYVDVLSYVPVRRFHHCLDSSMRRSCKSLRPPLSVFPHF 60

Query: 61  LKPTKLFQGYSSPCNGTRIKPALVHSPLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQYDG 120
           LKP KLF+GYSSPCNGTRIKPALVHSPLLAGDG+GCDGNNNGGWNNSNPFGGFGWWQYD 
Sbjct: 61  LKPAKLFRGYSSPCNGTRIKPALVHSPLLAGDGYGCDGNNNGGWNNSNPFGGFGWWQYDS 120

Query: 121 DSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTYRDE 180
           DSPPWSDNAFLA FF+SVLGCFCLFQLAVALARN+M TESIWEVKGGKRIRLILDTYRDE
Sbjct: 121 DSPPWSDNAFLALFFTSVLGCFCLFQLAVALARNDMKTESIWEVKGGKRIRLILDTYRDE 180

Query: 181 FHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240
           FHVATGMPSSSLSFSFVNVWLRCSDIF RLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV
Sbjct: 181 FHVATGMPSSSLSFSFVNVWLRCSDIFKRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQV 240

Query: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA 300
           SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA
Sbjct: 241 SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFA 300

Query: 301 DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVI 360
           DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVI
Sbjct: 301 DLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVI 360

Query: 361 AKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLN 420
           AKGEAQGMVSKSIGMMLGITLAN IRSSTSLALGCFSIVTLIHMF NLKSYKSIQLRTLN
Sbjct: 361 AKGEAQGMVSKSIGMMLGITLANHIRSSTSLALGCFSIVTLIHMFSNLKSYKSIQLRTLN 420

Query: 421 PYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSAEAKESAAN 480
           PYRASLVFSEYL SGEVPSIK+VNNEEPLFPAVPLLN +L CDEPKL LLSAEAKESAAN
Sbjct: 421 PYRASLVFSEYLFSGEVPSIKEVNNEEPLFPAVPLLNTRLGCDEPKLGLLSAEAKESAAN 480

Query: 481 IEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVF 540
           I++RLQLGSKLSDVATCE DVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVF
Sbjct: 481 IDQRLQLGSKLSDVATCEADVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVF 540

Query: 541 HVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWSTDGLIAR 600
           HVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGW TDGLIAR
Sbjct: 541 HVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWLTDGLIAR 600

Query: 601 PLTTRICECHVT 613
           PLTTRICECHVT
Sbjct: 601 PLTTRICECHVT 611

BLAST of CsaV3_3G020160 vs. ExPASy TrEMBL
Match: A0A6J1F2S0 (protein root UVB sensitive 1, chloroplastic isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111441619 PE=3 SV=1)

HSP 1 Score: 1012.3 bits (2616), Expect = 8.6e-292
Identity = 515/610 (84.43%), Postives = 541/610 (88.69%), Query Frame = 0

Query: 1   MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF 60
           MYG LPFSYQ   PE IP R VYVDVL+YVP   FHH    S R SC A RP L+VFP  
Sbjct: 1   MYG-LPFSYQL--PEQIPLRRVYVDVLDYVPGGCFHH---YSTRSSCAARRPPLNVFPDL 60

Query: 61  LKPTKLFQGYSSPCNGTRIKPALVHS---PLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQ 120
           LKP KL QG  SPC GTRIKP LVHS   P L  DGHGC GNNNGGWN+S  FGGFGWW 
Sbjct: 61  LKPIKLAQGCFSPCIGTRIKPTLVHSHLLPPLLDDGHGCGGNNNGGWNSSYRFGGFGWWH 120

Query: 121 YDGDSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTY 180
              +S P   NAFLA   +SVLGCFC FQLA ALARN MN+ES+WEV+GGKRIRLILDT+
Sbjct: 121 DGSNSSPGWRNAFLALVLTSVLGCFCHFQLAAALARNGMNSESVWEVRGGKRIRLILDTF 180

Query: 181 RDEFHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIA 240
           RDEF+VATG+PSS LSFSFVN WLRCS+IF RLMLPEGFPDSVTSDYLEYSLWRGVQGIA
Sbjct: 181 RDEFYVATGVPSSPLSFSFVNFWLRCSEIFKRLMLPEGFPDSVTSDYLEYSLWRGVQGIA 240

Query: 241 SQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWR 300
           SQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKI LSKYGRHFDV+PKGWR
Sbjct: 241 SQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVNPKGWR 300

Query: 301 LFADLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFA 360
           LFADLLENAA+GMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFA
Sbjct: 301 LFADLLENAAFGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFA 360

Query: 361 EVIAKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLR 420
           EVIAKGEAQGMVSKSIGM+LGI LANRIRSSTSLALGCFS+VT+IHMFCNLKSYKSIQLR
Sbjct: 361 EVIAKGEAQGMVSKSIGMLLGIALANRIRSSTSLALGCFSVVTIIHMFCNLKSYKSIQLR 420

Query: 421 TLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSAEAKES 480
           TLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVP LN +LACDEPK+ LLS EAKES
Sbjct: 421 TLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPFLNTRLACDEPKVGLLSTEAKES 480

Query: 481 AANIEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLK 540
           AANIEKRLQLGSKLSDVA CEEDVL+LLSL+  ENYILSEHRG+YCVMLKESA P DMLK
Sbjct: 481 AANIEKRLQLGSKLSDVARCEEDVLQLLSLYKNENYILSEHRGRYCVMLKESALPKDMLK 540

Query: 541 AVFHVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWSTDGL 600
           A+FHVNYLHWLERNAGI ARSA+NDC+PGGRLQ+SLEYVEREF HVKYDGELAGW TDGL
Sbjct: 541 ALFHVNYLHWLERNAGIEARSAANDCKPGGRLQISLEYVEREFIHVKYDGELAGWLTDGL 600

Query: 601 IARPLTTRIC 608
           IARPL  RIC
Sbjct: 601 IARPLNNRIC 604

BLAST of CsaV3_3G020160 vs. ExPASy TrEMBL
Match: A0A6J1F1U0 (protein root UVB sensitive 1, chloroplastic isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111441619 PE=3 SV=1)

HSP 1 Score: 1007.7 bits (2604), Expect = 2.1e-290
Identity = 515/611 (84.29%), Postives = 541/611 (88.54%), Query Frame = 0

Query: 1   MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF 60
           MYG LPFSYQ   PE IP R VYVDVL+YVP   FHH    S R SC A RP L+VFP  
Sbjct: 1   MYG-LPFSYQL--PEQIPLRRVYVDVLDYVPGGCFHH---YSTRSSCAARRPPLNVFPDL 60

Query: 61  LKPTKLFQGYSSPCNGTRIKPALVHS---PLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQ 120
           LKP KL QG  SPC GTRIKP LVHS   P L  DGHGC GNNNGGWN+S  FGGFGWW 
Sbjct: 61  LKPIKLAQGCFSPCIGTRIKPTLVHSHLLPPLLDDGHGCGGNNNGGWNSSYRFGGFGWWH 120

Query: 121 YDGDSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTY 180
              +S P   NAFLA   +SVLGCFC FQLA ALARN MN+ES+WEV+GGKRIRLILDT+
Sbjct: 121 DGSNSSPGWRNAFLALVLTSVLGCFCHFQLAAALARNGMNSESVWEVRGGKRIRLILDTF 180

Query: 181 RDEFHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIA 240
           RDEF+VATG+PSS LSFSFVN WLRCS+IF RLMLPEGFPDSVTSDYLEYSLWRGVQGIA
Sbjct: 181 RDEFYVATGVPSSPLSFSFVNFWLRCSEIFKRLMLPEGFPDSVTSDYLEYSLWRGVQGIA 240

Query: 241 SQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWR 300
           SQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKI LSKYGRHFDV+PKGWR
Sbjct: 241 SQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVNPKGWR 300

Query: 301 LFADLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFA 360
           LFADLLENAA+GMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFA
Sbjct: 301 LFADLLENAAFGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFA 360

Query: 361 EVIAKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLR 420
           EVIAKGEAQGMVSKSIGM+LGI LANRIRSSTSLALGCFS+VT+IHMFCNLKSYKSIQLR
Sbjct: 361 EVIAKGEAQGMVSKSIGMLLGIALANRIRSSTSLALGCFSVVTIIHMFCNLKSYKSIQLR 420

Query: 421 TLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACD-EPKLSLLSAEAKE 480
           TLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVP LN +LACD EPK+ LLS EAKE
Sbjct: 421 TLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPFLNTRLACDKEPKVGLLSTEAKE 480

Query: 481 SAANIEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDML 540
           SAANIEKRLQLGSKLSDVA CEEDVL+LLSL+  ENYILSEHRG+YCVMLKESA P DML
Sbjct: 481 SAANIEKRLQLGSKLSDVARCEEDVLQLLSLYKNENYILSEHRGRYCVMLKESALPKDML 540

Query: 541 KAVFHVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWSTDG 600
           KA+FHVNYLHWLERNAGI ARSA+NDC+PGGRLQ+SLEYVEREF HVKYDGELAGW TDG
Sbjct: 541 KALFHVNYLHWLERNAGIEARSAANDCKPGGRLQISLEYVEREFIHVKYDGELAGWLTDG 600

Query: 601 LIARPLTTRIC 608
           LIARPL  RIC
Sbjct: 601 LIARPLNNRIC 605

BLAST of CsaV3_3G020160 vs. ExPASy TrEMBL
Match: A0A6J1J7M2 (protein root UVB sensitive 1, chloroplastic isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111482094 PE=3 SV=1)

HSP 1 Score: 1005.7 bits (2599), Expect = 8.0e-290
Identity = 511/610 (83.77%), Postives = 542/610 (88.85%), Query Frame = 0

Query: 1   MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF 60
           MYG LPFSYQ   P  IP R VYVDVL+YVP   FHH    S R SC A R  L+VFPH 
Sbjct: 1   MYG-LPFSYQL--PGQIPLRRVYVDVLDYVPGGCFHH---YSTRSSCAARRRPLNVFPHL 60

Query: 61  LKPTKLFQGYSSPCNGTRIKPALVHS---PLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQ 120
           LKP KL QGY SPC GTRIKP LVHS   P L  DGHGC GNNNGGWN+S  FGGFGWWQ
Sbjct: 61  LKPIKLAQGYFSPCVGTRIKPTLVHSHLLPPLLDDGHGCGGNNNGGWNSSYRFGGFGWWQ 120

Query: 121 YDGDSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTY 180
              +S P   NAFLA   +SVLGCFC FQLA ALARN +N+ES+WEV+GGKRIRLILDT+
Sbjct: 121 DGSNSSPGWRNAFLALVLTSVLGCFCHFQLAAALARNGINSESVWEVRGGKRIRLILDTF 180

Query: 181 RDEFHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIA 240
           RDEF+VATG+PSS LSFSFVN WLRCS+IF RLMLPEGFPD+VTSDYLEYSLWRGVQGIA
Sbjct: 181 RDEFYVATGVPSSPLSFSFVNFWLRCSEIFKRLMLPEGFPDTVTSDYLEYSLWRGVQGIA 240

Query: 241 SQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWR 300
           SQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKI LSKYGRHFDV+PKGWR
Sbjct: 241 SQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVNPKGWR 300

Query: 301 LFADLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFA 360
           LFADLLENAA+GMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFA
Sbjct: 301 LFADLLENAAFGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFA 360

Query: 361 EVIAKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLR 420
           EVIAKGEAQGMVSKSIGM+LGI LANRIRSSTSLALGCFS+VTLIHMFCNLKSYKSIQLR
Sbjct: 361 EVIAKGEAQGMVSKSIGMLLGIALANRIRSSTSLALGCFSVVTLIHMFCNLKSYKSIQLR 420

Query: 421 TLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSAEAKES 480
           TLNPYRASLVFSEYLLSGEVPSIK+VN+EEPLFPAVP LN +LACDEPK+ LLS EAKES
Sbjct: 421 TLNPYRASLVFSEYLLSGEVPSIKNVNDEEPLFPAVPFLNARLACDEPKVGLLSTEAKES 480

Query: 481 AANIEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDMLK 540
           AANIE+RLQLGSKLSDVA CEEDVL+LLSL+  ENYILSEHRG+YCVMLKESA P DMLK
Sbjct: 481 AANIERRLQLGSKLSDVARCEEDVLQLLSLYKNENYILSEHRGRYCVMLKESALPKDMLK 540

Query: 541 AVFHVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWSTDGL 600
           A+FHVNYLHWLERNAGI ARSA++DC+PGGRLQ+SLEYVEREF HVKYDGELAGW TDGL
Sbjct: 541 ALFHVNYLHWLERNAGIEARSAASDCQPGGRLQISLEYVEREFIHVKYDGELAGWLTDGL 600

Query: 601 IARPLTTRIC 608
           IARPL  RIC
Sbjct: 601 IARPLNNRIC 604

BLAST of CsaV3_3G020160 vs. ExPASy TrEMBL
Match: A0A6J1J7Y8 (protein root UVB sensitive 1, chloroplastic isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111482094 PE=3 SV=1)

HSP 1 Score: 1001.1 bits (2587), Expect = 2.0e-288
Identity = 511/611 (83.63%), Postives = 542/611 (88.71%), Query Frame = 0

Query: 1   MYGVLPFSYQAPPPEPIPFRPVYVDVLNYVPVRRFHHCLDSSMRRSCTALRPSLSVFPHF 60
           MYG LPFSYQ   P  IP R VYVDVL+YVP   FHH    S R SC A R  L+VFPH 
Sbjct: 1   MYG-LPFSYQL--PGQIPLRRVYVDVLDYVPGGCFHH---YSTRSSCAARRRPLNVFPHL 60

Query: 61  LKPTKLFQGYSSPCNGTRIKPALVHS---PLLAGDGHGCDGNNNGGWNNSNPFGGFGWWQ 120
           LKP KL QGY SPC GTRIKP LVHS   P L  DGHGC GNNNGGWN+S  FGGFGWWQ
Sbjct: 61  LKPIKLAQGYFSPCVGTRIKPTLVHSHLLPPLLDDGHGCGGNNNGGWNSSYRFGGFGWWQ 120

Query: 121 YDGDSPPWSDNAFLAFFFSSVLGCFCLFQLAVALARNNMNTESIWEVKGGKRIRLILDTY 180
              +S P   NAFLA   +SVLGCFC FQLA ALARN +N+ES+WEV+GGKRIRLILDT+
Sbjct: 121 DGSNSSPGWRNAFLALVLTSVLGCFCHFQLAAALARNGINSESVWEVRGGKRIRLILDTF 180

Query: 181 RDEFHVATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIA 240
           RDEF+VATG+PSS LSFSFVN WLRCS+IF RLMLPEGFPD+VTSDYLEYSLWRGVQGIA
Sbjct: 181 RDEFYVATGVPSSPLSFSFVNFWLRCSEIFKRLMLPEGFPDTVTSDYLEYSLWRGVQGIA 240

Query: 241 SQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWR 300
           SQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKI LSKYGRHFDV+PKGWR
Sbjct: 241 SQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILLSKYGRHFDVNPKGWR 300

Query: 301 LFADLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFA 360
           LFADLLENAA+GMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFA
Sbjct: 301 LFADLLENAAFGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFA 360

Query: 361 EVIAKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLR 420
           EVIAKGEAQGMVSKSIGM+LGI LANRIRSSTSLALGCFS+VTLIHMFCNLKSYKSIQLR
Sbjct: 361 EVIAKGEAQGMVSKSIGMLLGIALANRIRSSTSLALGCFSVVTLIHMFCNLKSYKSIQLR 420

Query: 421 TLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACD-EPKLSLLSAEAKE 480
           TLNPYRASLVFSEYLLSGEVPSIK+VN+EEPLFPAVP LN +LACD EPK+ LLS EAKE
Sbjct: 421 TLNPYRASLVFSEYLLSGEVPSIKNVNDEEPLFPAVPFLNARLACDKEPKVGLLSTEAKE 480

Query: 481 SAANIEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVMLKESASPVDML 540
           SAANIE+RLQLGSKLSDVA CEEDVL+LLSL+  ENYILSEHRG+YCVMLKESA P DML
Sbjct: 481 SAANIERRLQLGSKLSDVARCEEDVLQLLSLYKNENYILSEHRGRYCVMLKESALPKDML 540

Query: 541 KAVFHVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYDGELAGWSTDG 600
           KA+FHVNYLHWLERNAGI ARSA++DC+PGGRLQ+SLEYVEREF HVKYDGELAGW TDG
Sbjct: 541 KALFHVNYLHWLERNAGIEARSAASDCQPGGRLQISLEYVEREFIHVKYDGELAGWLTDG 600

Query: 601 LIARPLTTRIC 608
           LIARPL  RIC
Sbjct: 601 LIARPLNNRIC 605

BLAST of CsaV3_3G020160 vs. TAIR 10
Match: AT3G45890.1 (Protein of unknown function, DUF647 )

HSP 1 Score: 623.6 bits (1607), Expect = 1.7e-178
Identity = 328/518 (63.32%), Postives = 396/518 (76.45%), Query Frame = 0

Query: 98  GNNNGGWNNSNPFGGFGWWQYDGDSPPWSDNAFLAFFFSSVLGCFCLFQLAVALA----- 157
           G +NG  +N N  GG G    D       D  +L F     L CF  F+L+ A A     
Sbjct: 77  GGSNGNNDNGNGGGGGGDGGGDNSDDSSFDLRYLCFLLLG-LSCFFHFRLSAASAIAKDQ 136

Query: 158 ----RNNMNTESIWEVKGGKRIRLILDTYRDEFHVATGMPSSSLSFSFVNVWLRCSDIFT 217
                 +   E++WEV+G KR RL+ D  +DEF         S S +  N+  +C ++ T
Sbjct: 137 NSDSNGDAVKETVWEVRGSKRKRLVPDFVKDEFVSEESAFELSSSLTPENLLAQCRNLLT 196

Query: 218 RLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNW 277
           + +LPEGFP+SVTSDYL+YSLWRGVQGIASQ+SGVLATQ+LLYAVGLGKGAIPTAAA+NW
Sbjct: 197 QFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAAAINW 256

Query: 278 VLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAA 337
           VLKDG GYLSKI LSKYGRHFDVHPKGWRLFADLLENAA+GMEMLTP FP  FV+IGAAA
Sbjct: 257 VLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGAAA 316

Query: 338 GAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSS 397
           GAGRSAAALIQAATRSCF AGFA+QRNFAEVIAKGEAQGMVSKS+G++LGI +AN I +S
Sbjct: 317 GAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSVGILLGIVVANCIGTS 376

Query: 398 TSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEP 457
           TSLAL  F +VT IHM+ NLKSY+ IQLRTLNPYRASLVFSEYL+SG+ P IK+VN+EEP
Sbjct: 377 TSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEP 436

Query: 458 LFPAVPLLNRKLACDEPKLSLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLF 517
           LFP V   N K + ++ +  +LS+EAK +AA+IE+RLQLGSKLSDV   +E+ + L  L+
Sbjct: 437 LFPTVRFSNMK-SPEKLQDFVLSSEAKAAAADIEERLQLGSKLSDVIHNKEEAIALFDLY 496

Query: 518 NKENYILSEHRGKYCVMLKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGR 577
             E YIL+EH+G++CVMLKES++P DML+++F VNYL+WLE+NAGI   S  +DC+PGGR
Sbjct: 497 RNEGYILTEHKGRFCVMLKESSTPQDMLRSLFQVNYLYWLEKNAGIEPASTYSDCKPGGR 556

Query: 578 LQMSLEYVEREFKHVKYDGELAGWSTDGLIARPLTTRI 607
           L +SL+YV REF+H K D E  GW T+GLIARPL TRI
Sbjct: 557 LHISLDYVRREFEHAKEDSESVGWVTEGLIARPLPTRI 592

BLAST of CsaV3_3G020160 vs. TAIR 10
Match: AT1G13770.1 (Protein of unknown function, DUF647 )

HSP 1 Score: 181.4 bits (459), Expect = 2.2e-45
Identity = 133/439 (30.30%), Postives = 221/439 (50.34%), Query Frame = 0

Query: 181 FHVATGMPSSSLSFS-----FVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQG 240
           F  AT   SSSLS       F +VW R    F    +PEGFP SVT DY+ + LW  +QG
Sbjct: 26  FKTATITASSSLSIQRSANRFNHVWRRVLQAF----VPEGFPGSVTPDYVGFQLWDTLQG 85

Query: 241 IASQVSGVLATQALLYAVGLG-KGAIPTAAAVNWVLKDGFGYLSKIFLSKY-GRHFDVHP 300
           +++    +L+TQALL A+G+G K A    A   W L+D  G L  I  + Y G + D + 
Sbjct: 86  LSTYTKMMLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNA 145

Query: 301 KGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQ 360
           K WRL ADL+ +    M++L+P FP  F+V+       RS   +   ATR+     FA Q
Sbjct: 146 KMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQ 205

Query: 361 RNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTSLALG-CFSIVTLIHMFCNLKSYK 420
            N A++ AK  +Q  ++  +GM LG+ LA R  S   +A+   F  +T+ HM+ N ++ +
Sbjct: 206 DNAADISAKEGSQETMATMMGMSLGMLLA-RFTSGNPMAIWLSFLSLTVFHMYANYRAVR 265

Query: 421 SIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSA 480
            + L +LN  R+S++ + ++ +G+V S + V++ E + P                SL S 
Sbjct: 266 CLVLNSLNFERSSILLTHFIQTGQVLSPEQVSSMEGVLPLW------------ATSLRST 325

Query: 481 EAKESAANIEKRLQLGSKLSDVATCEEDVLELL-----SLFNKENYILSEHRGKYCVMLK 540
            +K     + KR+QLG ++S +     D+L+LL     S +    Y+L+  +G   V+L 
Sbjct: 326 NSKP----LHKRVQLGVRVSSLPRL--DMLQLLNGVGASSYKNAKYLLAHIKGNVSVILH 385

Query: 541 ESASPVDMLKAVFHVNYL-HWLERNAGITARSASNDCRPGGRLQMSLEYVEREFKHVKYD 600
           + + P D+LK+  H   L + +E++    +   +              ++++ +  + + 
Sbjct: 386 KDSKPADVLKSYIHAIVLANLMEKSTSFYSEGEA--------------WIDKHYDELLHK 427

Query: 601 GELAGWSTDGLIARPLTTR 606
               GW T+ L++  +T R
Sbjct: 446 LRSGGWKTERLLSPSITWR 427

BLAST of CsaV3_3G020160 vs. TAIR 10
Match: AT5G49820.1 (Protein of unknown function, DUF647 )

HSP 1 Score: 167.2 bits (422), Expect = 4.2e-41
Identity = 120/436 (27.52%), Postives = 198/436 (45.41%), Query Frame = 0

Query: 184 ATGMPSSSLSFSFVNVWLRCSDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGV 243
           A  + S    F  V  +LR        ++PEGFP SV   Y+ Y  WR ++       GV
Sbjct: 91  AISLESPQTPFDEVGSFLR------SYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGV 150

Query: 244 LATQALLYAVGLGKGAIPTAA-AVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADL 303
             TQ LL +VG  + +  +AA A+NW+LKDG G + K+  ++ G+ FD   K  R   DL
Sbjct: 151 FTTQTLLNSVGASRNSSASAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDL 210

Query: 304 LENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAK 363
           L     G+E+ T A P  F+ +  AA   ++ AA+   +TR+  Y  FA   N  +V AK
Sbjct: 211 LMELGAGVELATAAVPHLFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK 270

Query: 364 GEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPY 423
           GE  G ++  +G    I ++ R  S  +     F +++  ++  + +  +S+ L TLN  
Sbjct: 271 GECVGNIADLMGTGFSILISKRNPSLVT----TFGLLSCGYLMSSYQEVRSVVLHTLNRA 330

Query: 424 RASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKLACDEPKLSLLSAEAKESAANIE 483
           R ++    +L +G VPS+++ N +E +F   P ++                        +
Sbjct: 331 RFTVAVESFLKTGRVPSLQEGNIQEKIF-TFPWVD------------------------D 390

Query: 484 KRLQLGSKLSDVATCEEDVLELLSLFNKENYIL--SEHRGKYCVMLKESASPVDMLKAVF 543
           + + LG++  D        + +   F+KE Y++  S  +GK   +LK  A+  D+LKA F
Sbjct: 391 RPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVYALLKHQANSDDILKAAF 450

Query: 544 HVN-YLHWLERNAGITARS--------ASNDCRPGGRLQMSLEYVEREFKHVKYDGELAG 603
           H +  LH++ ++     RS        A  +     R+  S E V   +   K      G
Sbjct: 451 HAHVLLHFMNQSKDGNPRSVEQLDPAFAPTEYELESRIAESCEMVSTSYGVFKSRAAEQG 491

Query: 604 WSTDGLIARPLTTRIC 608
           W     +  P   R+C
Sbjct: 511 WRMSESLLNPGRARLC 491

BLAST of CsaV3_3G020160 vs. TAIR 10
Match: AT2G31190.1 (Protein of unknown function, DUF647 )

HSP 1 Score: 151.8 bits (382), Expect = 1.8e-36
Identity = 86/250 (34.40%), Postives = 139/250 (55.60%), Query Frame = 0

Query: 207 FTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAV 266
           F     P G+P SV   YL Y+ +R +Q  +S    VL+TQ+LL+A GL +     A  V
Sbjct: 67  FLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQATVV 126

Query: 267 NWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGA 326
           +W+LKDG  ++ K+  S  G   D  PK WR+ AD+L +   G+E+++P  P  F+ +  
Sbjct: 127 SWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEMAG 186

Query: 327 AAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIR 386
                +  A +   ATR   Y+ FA + N +++ AKGEA   +    G+  GI LA+ I 
Sbjct: 187 LGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLASTIC 246

Query: 387 SSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNE 446
           SS    L   SI++++H++  ++  + + + TLNP R +L+ + +L +G+VPS  D+  +
Sbjct: 247 SSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPDLRFQ 306

Query: 447 EPL-FPAVPL 456
           E L FP  P+
Sbjct: 307 EDLMFPERPI 315

BLAST of CsaV3_3G020160 vs. TAIR 10
Match: AT2G31190.2 (Protein of unknown function, DUF647 )

HSP 1 Score: 151.8 bits (382), Expect = 1.8e-36
Identity = 86/250 (34.40%), Postives = 139/250 (55.60%), Query Frame = 0

Query: 207 FTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAV 266
           F     P G+P SV   YL Y+ +R +Q  +S    VL+TQ+LL+A GL +     A  V
Sbjct: 66  FLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQATVV 125

Query: 267 NWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGA 326
           +W+LKDG  ++ K+  S  G   D  PK WR+ AD+L +   G+E+++P  P  F+ +  
Sbjct: 126 SWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEMAG 185

Query: 327 AAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIR 386
                +  A +   ATR   Y+ FA + N +++ AKGEA   +    G+  GI LA+ I 
Sbjct: 186 LGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLASTIC 245

Query: 387 SSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNE 446
           SS    L   SI++++H++  ++  + + + TLNP R +L+ + +L +G+VPS  D+  +
Sbjct: 246 SSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPDLRFQ 305

Query: 447 EPL-FPAVPL 456
           E L FP  P+
Sbjct: 306 EDLMFPERPI 314

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011651345.10.0e+00100.00protein root UVB sensitive 1, chloroplastic isoform X1 [Cucumis sativus] >KAE865... [more]
XP_031738101.10.0e+0096.90protein root UVB sensitive 1, chloroplastic isoform X2 [Cucumis sativus][more]
XP_008449956.10.0e+0095.26PREDICTED: protein root UVB sensitive 1, chloroplastic isoform X1 [Cucumis melo][more]
XP_038881395.10.0e+0092.64protein root UVB sensitive 1, chloroplastic [Benincasa hispida][more]
XP_023528607.12.1e-29284.26protein root UVB sensitive 1, chloroplastic isoform X2 [Cucurbita pepo subsp. pe... [more]
Match NameE-valueIdentityDescription
Q7X6P32.3e-17763.32Protein root UVB sensitive 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=R... [more]
Q84JB83.1e-4430.30Protein root UVB sensitive 3 OS=Arabidopsis thaliana OX=3702 GN=RUS3 PE=2 SV=1[more]
Q93YU26.0e-4027.52Protein root UVB sensitive 6 OS=Arabidopsis thaliana OX=3702 GN=RUS6 PE=2 SV=1[more]
Q91W342.3e-3932.14RUS family member 1 OS=Mus musculus OX=10090 GN=Rusf1 PE=1 SV=1[more]
Q499P81.5e-3836.21RUS family member 1 OS=Rattus norvegicus OX=10116 GN=Rusf1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3BP560.0e+0095.26protein root UVB sensitive 1, chloroplastic isoform X1 OS=Cucumis melo OX=3656 G... [more]
A0A6J1F2S08.6e-29284.43protein root UVB sensitive 1, chloroplastic isoform X2 OS=Cucurbita moschata OX=... [more]
A0A6J1F1U02.1e-29084.29protein root UVB sensitive 1, chloroplastic isoform X1 OS=Cucurbita moschata OX=... [more]
A0A6J1J7M28.0e-29083.77protein root UVB sensitive 1, chloroplastic isoform X2 OS=Cucurbita maxima OX=36... [more]
A0A6J1J7Y82.0e-28883.63protein root UVB sensitive 1, chloroplastic isoform X1 OS=Cucurbita maxima OX=36... [more]
Match NameE-valueIdentityDescription
AT3G45890.11.7e-17863.32Protein of unknown function, DUF647 [more]
AT1G13770.12.2e-4530.30Protein of unknown function, DUF647 [more]
AT5G49820.14.2e-4127.52Protein of unknown function, DUF647 [more]
AT2G31190.11.8e-3634.40Protein of unknown function, DUF647 [more]
AT2G31190.21.8e-3634.40Protein of unknown function, DUF647 [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Chinese Long) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006968Root UVB sensitive familyPFAMPF04884DUF647coord: 202..433
e-value: 9.1E-73
score: 244.8
IPR006968Root UVB sensitive familyPANTHERPTHR12770RUS1 FAMILY PROTEIN C16ORF58coord: 157..607
NoneNo IPR availablePANTHERPTHR12770:SF22RUS1 FAMILY PROTEIN C16ORF58coord: 157..607

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_3G020160.1CsaV3_3G020160.1mRNA