Lsi05G004180 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi05G004180
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionUPF0420 protein C16orf58 like
Locationchr05 : 5286941 .. 5310989 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAATTTTACCTTCCAAGTCATAATGGTCCAAGCGACTTGCTTGTTTCGCTTCATATATATAACTCCATCGATCTGCACGTCACGGACTCTCAATCGCAGAGCCACCGGCGTCTTTCTTCATTGCAATGTATGGGCTGCTGCCGTTCTCTTATCAGCCGCCGGAGCCGATTTCATTACGTCGAATCTATGCCGGTGTTTTAAACTATGTACCAGGCGGCCGTTTTCACCATTGCTCGGATTCTTCTATGCGAAGGTCATCCGCAGCACGAAGACCTCCTCTTAGCGTATTTCCCCACTTTCTTAAGTCCACAAAACTCGTCCAAGGTTACTTCTCTCCTTGTATCGGAACTAGAATGAAACCTGCTCTCGTTCATTCTCCTTTGCTGGCTGGTGACGGCCATGGGCATGGTGGAAACAACAATGGTGGTTGGAATAATTCGAATCCTTTTGGGGGTTTTGGATGGTGGCAGGATGACGGTGATTCTTCCCCATGGTCGGACAATGCCTTCTTTGCCTTCTTCTTTACCTCCATTTTGGGCTGTTTCTGCCTTTTTCAATTGGCAGCAGCGCTAGCACGTAATGAAATGAACTATGAGTCTGTTTGGGAAGTAAAAGGAGGTAAGCGAATCCGCCTCATTCTCGATACGTTTAGGGATGAGTTCCATGTTGCAACTGGCATGCCGTCGTCTTCGTTATCCTTTTCCTTTGTCAATGTTTGGCTTCGTTGCAGCGATGTATTCAGGCGTTTGATGCTTCCGGAGGGTTTTCCAGACAGCGTTACCAGCGACTATCTGGAATATTCTCTTTGGCGAGGAGTGCAGGGGATTGCCAGCCAAGTTAGTGGGGTCCTTGCAACTCAGGTGCCCATTTACGTTCGTCTATGGTTCCTCTTTCCTATTTCTCATCCAACAAAAGAATAATTGACTGCATTTTGCTTGAACCTCTGCTGGAATCCAGGCACTGCTTTATGCTGTTGGATTGGGAAAAGGAGCTATTCCGACTGCTGCTGCAGTGAATTGGGTACTGAAAGATGGATTTGGATATCTGAGTAAAATTTTATTCTCAAAATATGGACGGCACTTTGACGTTCATCCGAAGGGATGGAGGTTGTTTGCTGATCTTCTGGAAAACGCTGCCTATGGGATGGAAATGTTAACTCCCGCATTTCCCCTGCATTTTGTCGTGATCGGTGCTGCTGCTGGGGCTGGACGATCTGCAGCCGCCTTGATTCAGGTTCTTGGTAGTTGATACGTAATTTACACACGATATTGGAAGTTGTGCTTGTGCCTCATTCCCTGTGTTTGTTACCTTATATTTATATATTTAATCTGCTGATTAGTTCTTTCGACTAGGTAGCTTAATTTGATTTAAAAAGTGAAACGCAGAAGTATTTAGGGAAACCAATGGATGGGAAACTGTCTAAGCTTTTGATAGTTTGTGGAGAATTTTGGTTACATGCAACGAGAAGTAGATCAAGATCACAAATGTATTAGAGGGAACCTTTTCTTTAACAATCAAAGTGGAGGGATAGTGGAGAAGAAGTTTGGGTTTCAAATGTTTACGGTCCTTCAAATTATAGAGAAAGAAAGGTGTTTTCGGAAGAACTATAGGGAGTGTCATCGGAGATATCGTGAGAGATAATTGTATAGAAGCCTCAAAAAATAAGTCCAAAATGGATTTTGCCCTACTTTTTGAAACTTAAGAACTTTGCCCTTTTGCTTATGTCATCATCTGCTAAAAACACCAAATTATCCTTAATTTGTTTTGCTCTCTTCCTCTTCCCGATTTTCTCTTTCTCCACTTTTTTTTTTTAATAAATTTTATATATGATTATTATTATTTTTTGGATAGAAACAACTGTTTTCATTGAGAAAAAAATGAAGGAATACAAGAGCATACAAAAAAATCAAGCTCACAGAAAAATCCCACTAGAGAAAGGGTTTTCAACTAAGTAAGATATTGCTTAGGGAATAATAAAAAAATCTTTGAAACCGAAGCCCAAAGAAAAACGTAAAACCTCACCAAAGACCAAACCTCACAAAGATCCCCATCCACACCTCGAAGCGCTCTATTGTTCCTCTCCTCCCAAAGATTCCACAACAACACGCCCCCCCCCCCCCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNACCCCCCCCCCCCCCCCCCAACCAAAGAAAACGACCTTTCTCTTCGAAAGGCGGATGAAGGAGGAACTCCCCTATCATGAACTGAACATCTTTCTGACGAGCAAGCATAAAATCAAACTCCTGAAAGAATCCATTCCACACAGATCTCTCAAACTGACACTCGCAAAGGTGATCCAGGTTTTCCTCTGCCTTCCGACAAAGGATACAACAATAAGGACCTATTAACGTAGGCATCTTTCTCAAAAGCCTATCCATCGTGTTCACATGGCCAAGAAAATAATTTGACTTTCTTTGGAATCTTAATCCTCCATAAAACATAAAAGAGAGACTCAACAAGAGGAGAGAGATCCAATAAAATCTTAAAGAACGACTTACAAGAGAAGCCCTCCAAGGGGTTAGGACTCCACACACGAACATCCTGTCTCCCAAGCTTAAAGTTAAAACCCTCAATCAAGGAGAGAAGAGAGGCCACCTCTGTCATTTTCCTATTGAACAACAACCGACGAAAATCAAAGGAGAAAGAAATAGAGTTCCCCAACTAAACCAAAAAGTTTGGCATGAGAAAATTTTTCAAGGAAGACAAATGATATAACTGAGGAAATGCATAAGAGTGTGGTCCATCCCCCATCCAATGATCTTCCCAAAAATACGTTTCCTTACCATCTCCCACAACACAAAAAACAAGATAGGAAAAGGAAGGGAGCTGAATAGAAATATCATTCCACGGATTTCGGTGCGTGCCTTTATTCCCCTTAGTTGCAAGGGTCACCGAATGTGGGATTTCATTGGAGAGGGAGGTCACCGAAATGGGTTTGTGTTGCATTGGGGGATTTTCGCTCGAATGGGTTTGTCAAATCTGTGGGTTTTTGCTCGAGAGTGAGAGAGAGGAGATCTAGAGAGTGAGAATGTGAGTGAGAGAAGAATTGCCTTTGCGGCAGAAGTTAGAGAGCGAGAAAATTTTAACCAATCAATAAACACTCACAAAGAAGAAAGGGTTTGTCTGCAAAGAGTGAGAGAAGATCGACAGAGGGGGGCGAGATGAGAGAGCAAAAGCGAGAGGCCCAACTTGCATGTGTGACATTCCGGTTTGGTAATTTCACTCGGATGATGTAAGCAAAAGAGCAAAAATCTTAAACTTTATAAAGTAGGTTAAAAATCATTTCGACCCAAAATTTTGGGTTATCTATATAGTTTTCCCATATCGTGAGGAATTTTTGCATTTCGGTGTAACTTGAATATATATATGTATATAAAGACTTGAACTTTCATTGAGAAAAAAATGAAACAATACTAGGGCATACAAAAAATAAAAGAAAAAAGAAAGAAGGAAAAAATAAGCCCACCAAAAAGAGCCCAACTAAAGGGGGCTCTAATCAAGTGAAGTGAGACCTATTGGACAATTACAAAAGAAAATTTCGTCATTAATGTCTAAAGAGAAACTTTGAACCTAACAAGGGACCAAACATCACTAGGATCCCTCTCCAACTCTCTAAAAGTTCTATTATTCCTTGCACCCTACAAACTCCATAACAAAGCACAAACCTTCACTTGCCACAAAAACCTCTAATTCTCTCGAAAAGGTTGATGGAGAAGGAACTCCTTAATCATCTCCCAGAGCCATATGTATCTGGCAAGTTGAAAGCTAAACGTGTTAGAAAAAAGATTCCATACCATCCAAGCACAGTTGCAACACCACAAATATGATTAAGATTTTCCTTCGCCCTTTGAAAAAGAATGCAACAAAAAGGCTCAACCAAGAAGACTCTGCATTAATTCTTATGGTGATGCCCAATGAGAAACATTAAATCTCACAAGACTCCCTCTCAATCCCTCTAAGGATTCTATTATTTCTCTCCCCTATAAACCCCACAACAAAGCATACACTCTGGCCTGCCATAAAAAAACTTCCCTTCCCCAAATTGGAAGATGGATGAAGAACTCTTCGATCGTATCACTAAAAGCTTTGTGCCAAGCCAATTGAAGGCCAAACACATCAAAGAAGGCAAAGGATGCAACAGAAAGACCCAACCTATCTAATGTAATGAATTTTCCATGGATGACCTGCCAGACAAAGAACTTCACCTTTTTTTTGGTTGAGATTTTCACCTTTCAAAGACTCGAAAAGACGAACTCCCTAGCAGGGAAGGGTCCTCCAATAAATAATGAAAGAAAGAGTTACAAGAAAAGCATTTAGAAGGGTTAGGGCTCCAAAAATGAATATCCCTTCTCCTTTGAATAAGATCGAAATCTCCAATTAAATAAAGAAGAGCCATGGTCGCTGACATTTTTCTTTCTATCAACGAACAACAAAACCCAAACGAAAGAGAGAATGAGCCCCCTAAATGAACAAGGGCAGCAACAACACAATGATTTTTCATAGGGATGAGTGATAATGATTAGTGAAACTCCTGCAGTGAGGTCAATCCCCCCTCCACTTGTCCTCCCAAAAATAAGTATACTTCCTGTCACCCACCACACACCTAATCAGGTCAGAAAAAGAGGGGAGCTCATTGGAAATTTCTTTCCAAGGATTTGTGAATGTGCCTTTAACCCCACCCCACATCTAATCTAATAGATGAGAATCGTACTTGCTCATGTAATCCTATGCTGAAGAGAGAAGGACCGATGTAAGTAGAATGTGGTTAAATTGGGTAGGTAATAGGTTTAAGTTAATTCCCCTTTACCCCCACTTATACTAAAACTCTACTAAATAGGAGTCTTCCCCCCACGTATGGACAAATAATGAATAAGATTTTCCTTCAAGAATTCTCCAAGTTTCGTAGTCTACATCAAATTGGTATCAGAGCGCCTTCGTGAAAGGGCCTGATGGTGGTCAAGAACACCAACAAGCTTCCCAACCTTCAAACTCATGATGCGAAAGCATCCTTCTCCACATCTCCTCAAACCGTGAATCAAAGACTTGGCAATTTGGAAACTTTAGCGGGTGGTTTATGAGACAATATGATTTTGATGCAAACCTCAATGGCCAGTTGCAACAAGATATTGAGAAACTATCGTTGGAAGTGGAAAAGATGACCAACAATCAATCAAGAACCAACCAAGAAAATCCAAGGATGCAAGAAGTCCACCCAGAAGACAAGTTTTAGAATTTCAACCAACAATCAATTATCAAAATCATTCGGTAGGTATTCTTCCCCAACAACAATACGATGGGATATGATTTGGACTCTTCAAGCGATGAAGGTTGTCCACAATTCCACCAAAATTATTTTTCAAGGTACCTTCCTAACCCTTTGTATCAAGAAAACCATCTTCAAGCCCAAGAACCATACCAAAAACACAAAAATTATAGTCATAATAGCCTTGTTTTCTATCAAGAGGAGAATTATGCCCTCCCAAGACCTCAAGATCAAGCACGTAATAATTCCCATGGCCAAGAAACTTATCAAACTCATCATAATGGTCAACAACATCACCCACTGTTTACACATTATGACCAAGAAGTTGTCCAAGAAACAGATGAGAATCCGGGCTAGGGTGCCTCGACTAATCTCATGGGACAACTTACCTAACCCTACAACATTTGGGTATCAAGGAAACTTATAAGATATTAATTCTTAGGTAGGTGGCCACCATAGATTGAACTCATGACCTTTTAGCCATTTATTGAGACTATGTCTCCTTTTTTACCATTAGGCCAATCCATGATGGTTGAAAACTCAAAATATTAAATCTTAGGTAGGTGGCCACCATGGATTGGGAGAAAGAGCCTTTCGTTATTCTTAGGCACAAGTATCTTCTAATCTTTCTTGAATATTTTTTGTTGGGGGGGGGGAATTTATACCCCTAAACTTTGGGAGTTGTATCAGTTAAATTCTATACTTTCATATTTGTATCATCTGATAGAATTGTGTTCTAAATGTTGGTAATGCATAATTTCACACATGTATAAGACTTCTTTAAATGCACAGAGTAACTTAATATTAGAGAAAAATGCTAGAAGCCTAGAATCAAATAGATAGTAGCGAATTTTCATTAATATGAGTGGGTTTTTTTATCTTCTTGTCTTCCTCTATTTTTTTCTATTTATTTAGGAGAATTTGCACGTTTAATTTTCTTGTTACTTTGATAGGATTTTAGAAGAATATTTTTATTTAGGTAATTTTGAGTTGATTTAAAATGCAATCAAGTGAAAATAGATAAGATTGATTCATTTATGAAAGTTTAGGGTTTAAATTGATACAATTATGAGTTTGGTGTTTTCATTGATATAACCCTCAAAGTTTATAGGTATAAACTGATTTTCCCCTTTTTATTATTATTATTTTATTTTGTCCTTCCCATATTCTACTTTAAAGTTTTCTTTTCATGTGCCCATTTGTAAAGATTTTTCCTTCTGTCTCTGATTCTTTTTTACTAGTAATCTACTATGCGCACTTGGCCTGCTGGATCCTTTATCAAAAAATATATATTAATAGAATACAAATATATAAGTTTCCTCATTCATCTTTCTTGTTTGACATATCCAACAGCATGTGCATCTATGTTTTTTGTAACTTCAACTCTTCAAGTCATTGGTATCTCCCTCAACATCTACATTTAGCTTTTTGATTTCAAGTTGAATATCATATATTGGCATTTTACCTCATTCAAGCTTTTGCCTTTCATATAAGTTCCAAAACTTTTCTGCTTTATGCCATACAATCTGGGTGAAAAAAGGCAAGCTCCAAGATAACATTCATGGGCTTAGTGACTTCACCTTAAATACCTCGTAAAATAAGTTACATCTTTTTTTTAAGAAAAATATTATTATTTTAAGTTTTAATGGAGTTGCACAATTTTTGAAAGTCGAATACAATTTTCTTCTTCCTCAGGCTGCTACTAGGAGTTGTTTTTATGCTGGCTTTGCTGCTCAAAGGAATTTTGCCGAGGTGAAATAGAGTAATCTCTTCTCTGGTTAGTGTGTGCTAGCAAAGTTACAAACACCATACTTTCTTGTAGTTGTAGCTGCAAGATTCATGTAAGGAAAGAAATAATTACCACCGAGGTTATTAACTCGTGAATCGGAGTGCGTATTGAAATTAAAAATTAGTGTATCGTGTACGGTAAGTTTTGACTTTTAGTTAACATTTGAATATAAATTATTTGTATACATATATAAGCAAGAATTAATCTTGATTCTTGAATTCTGAGTTGTTGATGAAATCTTATAAATATGAAGGGTTTTTAAGAAAAGAAATGTAGTCATTAGTGTCTTAATATTACAGACCTATTGTTCTTTCATCTTCCTTAGAATTAGAGGCTAAGAAGAAGCACAAATACAGATATTAGACATAGAAACGATACAATACAGACACGGCGACATGTCATTTTTTAAAAATCTAGGACACCACACGACAAGGACATGTTTATTAAATTATGCATTTTTAAAAGTATATATCATTTTTATACCAAAAGAAAATTCAAAGTAAATGGGTTGATGCACTTATATGCTTAAAAAACTTTGTTTGATGTATTTCACACTAAAAAATTCCAATGCTTAAAAAACTTATGTGTCTTTCGAGTCTACTCAACAAGTTTTCTATACACATCTAACACATTTGTTGCACTAACGAGTGTCGGATACGTGTTCAACAAGTGTCGAAGTGTCCAAGTATCCAACACGTATCAGACACTGACATGTTAGCCAAACTAAAGTACTCGCTTCTTAGATTAGAAGATTCTTTAATTCTCTCAAGTCCCAGAATTAAGACCCTCTTAGTCCTTTCTCCTTCTTTAAACCTAACCTAGTCTTAAACTCGAGGAATTAACTATTTAGTGACTTCATCTATCCCTTTCATTTATTTCATTCCAAGTCCCAAGATCAACATTTAGGGGTGAGCATCGGTTGGTCGATATTGGTTTTGGGCAAAAACGGGCGCCAGCCACCAACACGTCGGTTTTCGTGGTCGTGGGATGAGTGAGAACACCAATCGGCCAACTAGTCAGGCGGGCGGCGCGGTTTCGTTGGGTTTCTAGTTTTGGAGAATTGAAAAAAAAATGAGAGGAGGGAGAAGTAAAGAAAGAAAAAAATGGAGAATAGAGAAGAAAAATAAAAAAAATAAAGGAGAAGGGAGAAGGAAGAAAATAAGGAAAATTAGAGCACGAAAAAAAAAAGATTGGGAAGAGAAAAAAGGAAGAAGAGAAAGAGAGGGGGAAAGAAAAGAAAATGGAGAAAGAGAAGGAGAAGGGGGAGGGTTTCGTGAGAGGGAGGGGAAAGAAGAGAGAGAAGGAAAAAGATAATGATGATTATTGGATAGGAAAGGAGAGAAGAGAGAGAAGGTAAAGAATTAATGATGACTATTGGTAGCTGGGAAAAGAGATGAGAGATAATAAAATTAATTATTATTTTATAAATACCAAAGTGGTTTTTGTGAAAACCATTAGTTTTGGGCATTAAATGTAATATAATTACACATTATTGACTATTTCTTTTTCCCACCATCCATATGCACATTTTTTTGAAAAAAAGTATGGAGGTTGGTCGGGGCCGGTCAGAATCCCTGGTTTTTCTTGGACGTCCCGTTTTGTAATGGACCAAACCGACCTCTGACCAGTTACATGGAAAAACCAACTGGCTACCTTCGGTTCGGCTGGTTTCGGTCGTTTGGCTCAGTTTTTCGGAATGCCTTGCACACCCCTATCAACATTTAGCCCTTCCACTTGTGCATGTGCCACAACAATCTTCATCTTCTTCAATTTTCCTTTTCATGTTGGATGAACTGTTCTGTTTTCTTTGATATCCGCATGGGCACACCAAAGTAAAAGACATGCATCGCTTGTATTAAATGCATGCCGGCAAATTCATCGCTTGAGATGAGGCGTACGCACAAAGGCTTATGCCTCAAGAGGCGTAAGCCTTGAGGCACAATATTGCCGCCTCATCTCAAGGTTTATGATAGGTTTGGAAAGTGAACTGGTGATTTAGTTGTGTGATTAAGAGACCTACGGTAGCACAGGATGTGAGATGACATGAGGTATATGTTGAAGTTATCTATGAGGTGAAATGAATGAGAATACACACAAGGTTGTCACCCAAAACACTAGCAGAAGTATACACTATTGCTTCGGCAAGTAATACACTGAAAGTATTCCAAGATCGAACCCACAGGAAAATTTTGCAAACATCAATTATTACAGGCATGCGTGCTACAACAGGTTAGCCTCGATATGCTGGTAACTTAAAATGAATAAAAAGGTGGTGGACTAAGCATTGAAATGAAAGTAGAATAGTAAATGTAAAAGATATGGATGATGGTGAGAAGCAAGACTCAACTCATGACTCACCCCTATGTTGTGTTGGTCACGTTGCTCTAGATGTACCTTGACACATCATGAATCTCCTCATACAATGCTCTTTCGATACATTATGATGCTCAACATGGATCAATATATGTCTATTTGGTTTTTGTTGATAAAATTGGCCCAAACCCTCCTATGTTGGTAAGTGGTTGGATTATGTCTATAATGCAACCTAGTTGTTTTGAGTATTTCCACACACCTTGACCCTCTCGGTGCCTAAAGCATGCTACTATTTTATATCGGTTGATCAGACAAATAATAGTAATTTAAACAAACACTCTCAAAACAACAATTCATTTCATTAAGAAATCAAAGCGCTACTAGATCATAGCCGAAACACAATATAGAGGTTTGCCGCGCTACTAGATCATGGCTAAAACACAATATAGAGGTTTGTCAAGCAAAGTCTTGCTAATAGAACTTAGATCATGATGGTGATGAAAAACATAAAAGAAATAGTAGAAGAAAGGAAACTTAAATACATTGTTCAAGATGTTGAAGATAACTCAAATACAAGAAGAAAAAAATAAAGTAAATGCTTCAAAACTATAGAGAAATTTCTAAGGTTGCGGTAGAACAAGAATGCTGAAGAAAGATGGAGTGTAGCGCCCGTCGGTCCCTAGCAACTCTGGTGGAAAGTTGATGAAAAGTGGTGGATGAAAAGGGTAAACTGGAGAGCAAAGGAGGTGGTGACCTGAGGAGGAAGATGATCTGAGGCCCCCCAAGTCCTCGAAAATCCATTTGAAGAGCAATGTCTTCTCCTATTTATACTACTATGCTCTCTACAAGAAATTGTCATGTGTCTTCTACTTTTTCTCCTCCCACTATCCTTCTGCCAATGCCCTTTTCTACTTTCCACCAAAGATGTAAGTGGTTTTGAAATTACTTTTTACTTTTTCTGATGCAGTGGGTGCAAGGTGGGATTATGCTGGTGAGCAATGTTGGATCAAAGATACTTTTGATTGGTTGTTTAAAGGGATTAAAGGTACTCACCAGTATTTGTAGGAAGATATCTTTAAAGAGCTCCCTTCGTTTGCTCATCTGGTCCGTTGTGTGGTGGAGGAAAGGGAGAGAAACGTATTTCTGAGAAGATCATTGGGTGGGGAAGAGACCTCTATGTGTTTCATTCCCTTATTTGTATCACTTGTCTTCTCTTAAAAACCATTTCGTGGCCGACTTCTTAGTGTGGTTAGGGAGCTCTTATTCCTTTTCTTTTGGGTTCCGTCATTCTCTTTCCGATAGAGAAACGATGGAGGTGATTGCTCTTCTTTCTTTACTCGAGGGTCACCACTTTAGTTTGGGGAGAAGGGATGTCAGATTTTGGAGTTTCAATCCTTTGGAAGGGTTCTCGTGTAAGTCTTTCTTTCAGTGCTTGGTTAATCCTTCCCCTTTAGGCGAGTCTGTCTTTTCGGTGCTTTGGAGGATTAAGGTTCCTAGAAAGGTGAGGTTCTTTTCCTGGCAGGTCCTTCTCAGTCGTACTAATACTTTGGGCAGACTTGTCAGGAAGTTGCCTTCGGTTATTGGGCCTTTTTGTTGTATTCTCTGTTAGAAGGCGGAGGAAGACCTGAACCATATTCTTTGCACTATGATTTTGCGAGTGGTGTCTGAGATTCCTTTCTCCATACGTTTGGCATGTCGAGTACTTGTCACAGAGACATCAGTGCTATGATCGAGGAGTTCTTCCTTAATCTGCCTTTTGGGGAGAAAGGTCGGTTTCTTTAGCGTGTTGGTGTTTGTGCGATTTTATGGGTTTTGTGGAGAGTGGAGAGGGACTCTAGTTGTCATATGATAAGAGACCTCAAAATCTCCATCCATCTGGATGAAAGAGTCTGAGTTTGAAGAATTAGGGACTTTTGATTACCTTGTTGCATACTCTTGAAAGCACTTACACTTGTAGAATTTGCTTTGTTCAACAATGGTAGAATATTGTTATTTGGTAGAAGTCAACTGAGAGTGAAACAGTGACATGTTGAAAGAACAATAGTTCGAGCAGAAACGAAATTTACCATTTTGGAGATGAATCGGGCAGACTTGCTGAACATCAGCACAAATAGAGGTATATAAGAACTTCTTGTAGGATAGTGGCAGTAAATCAAGTATAAGTGGCAAATTTGAAAAAGAAAAACCAATGGTGGTGATGATGATTGATGAAAACCACACATGGACCACCAAATCTGGAAGCTAGGGGTTGTTGAAGAACGAAGTCAGAACTCAGAAAAGAGATTTCATGAAATTTCACAACTATACACTCTATCAATAATCGAGCTAAAAAAAACAGGGTGTTATGGGTGTGTCAACCTAGTTGAGATAAACGGGTGCACTTGCTGATCCCTAGTTCTTGTTATTTCGTTGTTTTTTTATAATTCAAGCATTAGTCTCTTTTCATTTCATCAATGGAAAGGGTTGTTTCCGTTTTTAAAAAGACATATACCATGGTACTTTTGAATTGGTGCCTGTCACCTTGGTGTTTGGGCTGTATTATGGACTTTGATTGTTTTATTTGGTTTACTTAGTTCTTATTGGATATGATTAGGGTGCTAAGGGGATGTCGACCTAGTTGAGATGTCCGTGTGCACCTACTAATCCTTAGGTTATTACGTACTTGGTTTCCTCTTGTATCTTGAGCATTAGTCTCATTTCATTACCTTAATGAAGAGACTTGTTTCTTGTTTTAAAAAAAAAACTAGTTTAAATATATACTGTGTACGTAGTACTAGTTAAAATTTAATGCAATAAACGTATACAATGTATCCCCCTAGAAAAAAGTTCATTGTAGTACGCAGTACTAGTTAACACTTAATGCAAGTTCATGTATGGTAAGGAAATCAACTACTATATGTTGTATAAATAACATCAAACATTCCTAATTAGTCAATCCAAATCAAAATTGATCAACCATGGCTTGACATCTAAAAGGTGTTCAAAAATCAACCATGACAGATCAAATACATAAAGCATTATGTGAGTATAAGAGAGATCACCCCACATGCACTCAAAAAGATTTGCAACAGTGGATTGATGAAAATTTTCACTCAAAGATTTGTAGCTTGAACTTCTTAAGGTACAATATCGAACACACTCGAGCGATCATTTGAGTACTTTTCGGCTAACTTAGAAAAAGGAGGGGATACCAAACGGCGCAAACTTGTAAAATATCCAGAGATGGAGAAGGTTCTTTTTGAGTGGTTTCTCCAATACCAAGAATGTGTGAACATGTACGGAGAGCTAATTTTGGAGAAGGCAAGAGACACGATGGAACTTCTATACCCTCAACAACCTCTTGAGTTGCGCAAATTTTCTCAAGGTTGGCTTGAGAAGTTCAAGCTAAGACATGGTATCAAGTAATTTTGTCATTTTGGTGGTTCTATAGACATAGAGGATGTGGAGAACAAATTGGTGGCTATAAGGGAAAAAATAAACCAGTATCCTAAGAAGAATGTTTTTAATATGGATGAAACTGGTTGGTTTTATAGGTTGCAAGCTAATCATTCTCTTGCAATAAAACAACTTGAAGGAAGAAAACAAAGCAAAGAAAGGCTTACTATTGTTATTTAATGCAATGAGGACGACTCTGAAAAAATTCCTTTATGGTTTATTGGAAAGTACGCAAAGCCATGTTGCTTCAAAAATATTAACATGAACGGCTTGAATAGTCAATGTCGTTCCAACGGACGATGGGTGACAGGTTTGCTTTTTGATGAATATGTTAATTGGCTGGACCAGAAAATGCATGGTAGAAGAATTTTTGGTAAATGGTAAATACTTGTCCAGCCCATCCGACAAATGTTGGACTACTTAATATTGTGTTGTATTTTTTGCCACCCAACATGATATCAAAAATCCAATCATGTGATGCAGGGGTAATAAGAACTTTCAAGATGCACTACTGTAGGAGATTTTATTGTAGGGTATTAGAGGGCTATGAGTTGGGAGAATTTGATTCAGAAAAGACTAAGTTTTAGGTGCCATCAATCTTGTAGTCTTAGCTTGGACGTCGGACGATAAATGTTCAGCAAGAGATAATAGCAAATTGTTTTTGACACTGTAAAATTCGTTCGCAGGATGAAGTAGCCTCAATAAATTTGAATGAAGGAACTACTAAAAAGGCATTCATGAACTTGATACGATGATTAATGGTCTTGGGCACCGCAACAAAATGGATGCCAATCACCTGTTGGATTACCCAAGTGAAAATGATACATGTTCAGAGGTCCAAAGTTTAAAAGAAATTATAGCTAGTGATGTTGAAATTCCTATTGAGGATGAAGCTGAAGATGATACAATGCCTATGCAACCAATTACGTGGAAGGAACTAGGAAGCACTCATAGCGGTAACCACTCTTCGCAATTTTTTGTTGTAGTATGAGAAAGCAACACCGGAGCTTTTTTTTGGGGATATAGTAAGAAAAGTTAGAGATGAAATACAACTAGATTTACATTTTAAGAAGAAACAAACAACAATAGAATCATATTTTACTAAGTTGTCCTAATTATATTTGTAGTTTTTTAGAATTATGAATTTATAATATTTGAAGGGACCATAAATTTATATATGCATCTTCCAGAAAATTATTATCTTATTAATTTATCGAAATTATTAATTTTGACCTTGGTCCCAAGTCGGGACCCAAAATAATTATTATTTTAGAGGTTATTAATTTATTGAGTATTAAATTATAGAGGTTCTACTGTATTCCAATATTTTATAACTTGTTCTGCATCCTTGTGTTTTGTATGTGCGGACAAATGAGGGACATATATTAGTTTACCTTTGCTGATGTCTTGCTTAAATTTATTCTATTTAACCTTCACGTGTAGGTGGATGCCTTTTAATTACTTCTACATTTTTGTCACTTCAATCCATCTTCATAACTGTAGTTTACATATTATTTCACCACACAAACCACCCAAAGCACTTATCTCCTTGTTTAAGCTGGGCTTGAGGCATGTGGCCATTTTCTATCTTTTCTTACTCCTGATGCATGTGGAAAATAAGATCTTTTGCTTTTAATAGATACCTTGAATGAATTTGACCCTATCCTTCCTATTCATATTGCTGAAAGAAACAGAAATTGATTCTAGGTGATTGCCAAAGGTGAAGCACAAGGAATGGTGAGCAAGTCTATTGGTATGATGCTTGGCATTACATTGGCTAATCGTATAAGGTCCTCAACATCTCTTGCTCTTGGATGCTTTAGCATAGTGACCTTAATCCACATGTTCTGCAATCTAAAATCATACAAGTCCATTGAACTAAGGACACTAAATCCTTATCGTGCAAGTAAGTTCTCTTTCCTGCATTTGCAATTTAGGGTTGATGAGATGTATGTGTACTGTTTTTATTACATTATCACTATCACAATATATCATGGTGATTGGCTTGCTTGGTAGTAAGTTAGATAATACCCTATTTCACCCACCCCCTTGCACTTCTTGAGGGGACTTTTTGGAATATTTTACATGGCAAACAGAAGACATTTTCTAGGGTTGATGGGGGCAAGGTTGTTAACTTATGAATCAAAACGTTAAATTAAAATTTGATGTGACGTGTATTATAAATAAAAATTATTTGTGTATATATATATACCAGGAACTTGACCTTATGTTGTTAAGGGAGTTTTATAAATACAAGGGATTTTTAAAAACAAATTTAGTCATTAGTTTGTTTGAAAATGTCTAAATGACCCATTAGCCTTTTCACCTTTTTTTTCTTAAAAGTTAAATTGAGTCTTCAAAGTCTTTCCAGTTGAAAATTTTGAAAAAGCTCTTTGTTCTTGTAGTGCTTATATTCTTGGCTCAAGTGTATTTGTTTTTTTGCATTTCTCTTTTTCTTTCTCTCCAGCAATCACTTCTTTAAATAAGACAGCTGTCTTTATTGATGCTTAATACAACCTGTTTACTTCTTTATTTTCTACCTTTCCTTTGGTATTGTATTTGTTATCTGTTACTAGTACTTCACTCTCTTGTGGAGTTTGTTTCCCTTGAGCATTAGTCTCTTTTCATTTTATCAATGAAAAGTTTTGTCTCTTGTTAAAAAAAAAAAAAATTAAGACTTCAACCAGTTTTATCTGTCAGGTATTCAACCAGTTTCAATGTAGCTTTTGTAGTAATGGTACTTGCTATTGGGGATGGGTCAAAAGGGCTGGGGAAAAGTTGCTGGAAGTCAGATGGGGGTTATGTCCATGGTACGGATTCTTTCAAAAATGGTAGCATCTAAAATATCATCGGTATGCAGGCTAGCCTTCACCAGAAGTTTGGTTGGAGCACCATGAACTCATCATAGAGACGTTAGTGATATGATCGAGGAGTTCCTCCTTAATCCGCCTTTTGGGGAGAAAGGGCCGTTTTTTGTGGTGGGTGGGAGCGTGTGCTATTTTATGGGTTTTATGGGGTGAGCAAAACAGTAGGGTGTTTAGAGGAGTGGATGGGGATCCTTCGGATGTTTGGTCCCTCGTTCATTTCATGTTTTTTTGCGGGCTGCGATCTCGAAGATTTTTTGTAATTATTCTATAGGCATGATCTTGCATAGTTGGAGTCCCTTCTTGTAATGGGAGCTCTCTTTTTTTTGTAGGCTTGTTTTTTTGTATACCCGTGTATTCTTTCATTTTTTCTCAATGAAAACTGTTCTTTTATTAAAAAAAAAATTCAACTCCAGCACTAATTTTAAAAAATGGGTCAATCAAAACCATGTTACCCATGGCTGATGAGGAGGGAAGGTTCATCATGATATTTTGGCCTTAAACTAGTGTCTGAAGAAGTCAATTCTGATGAAAATTTTGGTGAAGATTTTGCAGTATTATTCCATAGTTCGGCTAAGGATAAATCAAGTCAGAGGCTGTGTTAGAAGTTCTCCCAGTCAGCCCCCTCTGAAATCCCCTTTATGTTTTCTTCTTTGATTGAGGAGTGTGGTTTGATACTTGGAAAAGCCTCTCTTCACGCATCTAAAGCTAGTTGTTGACAGTACTAGTTTCTTTTTAGCATTAACGGAAATCTTCAGAGGGATCACCCGTGGACTTGGGGTTAAATTCGAAGCTGTCGCATTGACAAAGCTTCTCCAACAAACCCATTTGGATGCTGTTTTAGTTCAAGAGTCAAATAAGGAGGAAGTGGTAGGTTTTCTTATTAATTCGTAATGGAGTCTAAAGTTTTTGGCTGGGCCTTTTTTGAATCTTGTGGAAAATCAGGAGGCATGTTTTCTTTGTGGGTTGAGAGTAAGCTGACAACCTTGGAATTTCTCAAAGGGTTTTTTTTTCTCCCTATTTCAGTAAAGTTTTCAACTAAAAGCATCAAGCTATTGTGAAGGCCCTTGGTATGTAGGTAGCCCGGGCACTTCACACGTTTTGCAGTTGGGACTTTGTTCACTTATTGGGGCATTGCTGCTATTTCCTTTACTGGCATTTAGCTGGCTAGTCGTTTTGATGAATTTAACAACACTGAAGGTGAGTTAGCGGTTCAAGGACAACCAATGAACAAAAATAGCCCCAACGGCTGTATCAAAGAAAGAAATGAAGTTACCAATGGCTAGAAATATATTGAAGAAAAAGTAACATAAGTAGACTGCATCAAACTTCCTAATGTAATCGCAAAAGAAAAACAAATGACACAGATATTTGCATAAAGAGTCCTTTTTTTTTACTAATGTATTTTATCGGTAAATGTCTTATCATCATATCTGTGGCATAACCATGTTCCATGTTCTTGTCATTGTTTGATTGTAGTTTATTTAATGTGCTCCCAATTCCTTAAAGAAGTCCTAGCATTAAATGTGTTCTATACGGGTGGATGTCTGGAAGTTTAATCATTTTAATTCCTCTTTTTGGACTGTGCTCAAGTTATTTTCTTTTCTGATTTTTTACATTAAATAATCACTAGGCATTGTTCCATTCTTGTTACTATTTTTTGGCAGCTCTCCCATTCTTGTAAGCACACAAGGGGACTAAAAGATATGACACCCCAAACAGGGAGGGAACTAATTGAAAGCAATAGTATATTATAAGAAGTTGCTTCAGTTGTTAAATAGTAGAAGATAATGGATAATTAATTTTGACGAAAATGCCCTCATTTCACACGCCCTCTCAAAAGTCAAAACCACTACCAAACATTACTTTTGAAAATTTGAAATTTAAACTTTATCTTTTAGTGGTAAATCTTACCTTAAATGTTTAACAGGTTTGGTGTTTAGTGAATATCTGTTGAGTGGTGAGGTGCCTTCTATTAAAGATGTAAACAATGAAGAACCTCTTTTTCCAGCTGTACCATTTCTTAATACAAGGCTTGCTTGTGATGTGAGTGAATTTCTTTTCTATTGCTATTTATGACACTATTCCTTCTTAATTTTTCTTTTTAAATATTAAATGTGTTCATTTTAATAATACGCCAATGTAACGATTGCAATGTTGGGGCTTGGTTGGAGAGAAATCATAGAATCTTCAAGGTTATTTCTAGGGATAGCAAGAAGCGGAGTTATCAAATAAAAGGGGAAAAATATGGAGTTCTCTTTTTAAGTTAATTTTATTGGAGAATTTCTTTTTGTAAACCAAGCTTTTTGGAGCACTCGTGCGCACCCGTGCCTCCAACTTCTCACTAATAGTGATCCTCTTGATAAAACACTTCCCATAGCAATAATATGGAAAGTGCTCTGGTTTGAAGTAGGGATGCCTAATGTTCGGATGTTCACTTGGAAAGGTGGGGACAAGGATGGCTTCCATGTCATCATCCTTGCCCATTTTATTCCCTCATCACTATGAAAATTTCTGCTAAAATTTTTAAGGATCAAGTTCTTCGCAGGGAATTATTTCCCGTTTATTTCTTTTTTTCATAATCAAATAAATTTTTTTAGATTTTTTATCATTAAATAGTTACTTTTCCGTCGAAAATTTTCATATAAATGGTTAATTACACATGGAGAATCTCCATTAATAAAGAAATCTCTTAAACCTCCTAATGGAAACCAAATAATTAACCTATCAAATTATATAACTATCCATGTACATAAAAAGTTGGCTAATAATCTTGTTAAAGACAGTTATCTCAAATAGAGAAAAGTAAAAAGAACGTAATTTATTTTATTTTTATTAGATTGAAGGTGGTTTTTACTTTCCTTGAGAGAAGAAGCTTCAGAGTTTCTTGTTGGAGGGTGATGGTAGCCCTCATTGTGGATTTCTTCAAGTCCCCGGGAGGGTTTTCTTGCTGATTGTTGGACGAGTTTTGGATGTTTAGTTTGATGTTCTAATTTTTATCCTGTAGTATTTTTGTGCTTCTTTCTTTTTGGCTGTTTTGTCTTTTTCCATTTCCATTAGGAACTTGTATCCTCAAAAAAAAAAAAATTCTTTGAAAATTTGTTGTTTTTTTATAAGAAAAAAAGAAAATAAAATAAAAGGAAAGATAGGATGCCAATCAATCAAATCAGAACCAAACCTCCATCAAGTTGAGATTTGGGGACTAATTGCCATCTAACAAAATGATTTATCTTCCCACCAATGCTTTCTTCCCAAAAAGTAGATCTTTCTTTCCAATTCAAACAAATAGTAGGGGGAATAAGGAAAAGGAATATATCATGAGGGGAGAATGGATAGTACCTATTTTCATGAGGTTAATCTTCCACCTCTGGGAAGAATAAATCTTTTCCACCGATTTTGCTTCTTATTGATTTTGTCTACAATTGGGCTACCAAGATGAAGCATTTCTTGGATGGCCTCCAAGAGGTAGACCAATGTATAAGGAGGGTGGAGATTCAATCTTACGGACCCTTCATATTTGAATCTCCCTATAACTAATCTCATTCTTAGAATTAACCCTTCGTATTTTTCTACATGTTGGAATAGTTTGTATTTTTTTTTTATTAAAAAAAAAAAATCTGGACTTAATGCTGGATCGATCCTCTAGCTGTTCATATATCAATCCAATTAACTGGATCTTTTTTTTTTTTAAGATCATCTGGACTTAATGCTGGATAGATCCTCTAGCTGTTCATATTTCAATACGATTAACTGACTTTTGCAATATTTTATTTTAATAAGGAGCCAAAATTGGGTTTACTATCTGCTGAAGCAAAGGAATCAGCAGCTAATATTGAAAAGCGACTGCAGTTGGGATCTAAGCTCAGTGATGTGGCCACCTGTGAGGAGGATGTTCTTGAACTCTTAAGTCTGTTTAATAAAGAAAATTACATTCTGTCAGAGCACAGGGGGAAATATTGTGTAAGTAAAATCTAACCCATTTCATTTTGGACCTGCATACTACTTTTCTTTGGGAAGGGCGAAGTTGATGGATTTCTTTTTTTCTACTCTCAATTAAATTTGCATTTAGAGTCAATTTTGTTTCTTCCAGTAGCCATTTATTAACTGTATTTCCAATGTGTGAATTGTTTTGGATTATGGAGCTTGCTTCTCTCAGTCTAATAAATACTTGATATGAAAAGCTTCACAACCTTGTTCCCCTTAGATTAGTAGTGTACTTTAATTTTATCAACCGACACTTGAAACCTGTCTCATCAATTATGTTTGAGATATATAAGTGTAGAAAATAGATTGAATAATTCAAGTTAGGTTTTGAAACTTTAAGCCCTTGTTCCCCTTGGATTAGTAGTGTACTTTAATTTTATCAACCAACATTTGAAACCTGTCTCATCAATTATGTTGGAGATATGTAAGTGTAGAAAATAGATTGAATAATTCAAGTTAAGTTTTGAAACTTTAAGTTCAAATTCTGCTCCATTATTTTCTTCCTCATTTAATATAGATAGTATTTTAAACCATGAGGAAGATAGTGAGTGATTGCTTCGGAAGCAGGATGCTCCGATCTATGAAGGCAACAAGGTTTCTGATGGCAGCTAGGTTTTCTAGTGGTCATTAGGCTAGGGTCTCCGATACCATCTTAGTTTGAAGGAGTAAAATTAAATGTATTCCAATTATTTGAAGAAGAAATGCATGACTCTCTTTCGGATAGCAGCATGTAGGAGAGCCTCTTTTGGACTTTCTTTAGCATGTTTATGCTTCTTAGAGCCACATTCTAGATACAGTAATTGACTTGTTTGACTTTTCATTAAGTAAATGAAAAGAGACTAATGCTCAAAATACACAGATAAAAACCAACTGAAAAAAGAATCCCTCAGTACAAATAATAGACAGTGGGCCTTTGTAATCTCCTGCGATTCAAGATCTCGAATGGTTTCCTTACTGTGGTCCTTTGTTTTTTATGGAAATGGTCTCTCAGATTTGGTGTTTGAGGCGGTCTCAGGCAGCGTCTTTCAGTGGCCTTCTCTTTGCTGTTTGTTATTTAGTGTCTTTTCTCTCACTTGGTGGAGTGTGGTTTCTTCCTCTTGCCCTCCTTGTTCAGCTGTTGCAGAGTTCTGTTTTGTCTAGGGCTTTAAGTTTGTTCTCATTATTCCTTTTGGACTTCTTATTGAGGGATTTTTGTTCAGATTCTGTTAATTTTTTAGGTTGTTTTTTCTGTTCTATCTTTCGAACCCTTGTTAGTTTGTAACCTTTGAGCATTAGTCACTTTTCATCTCTGAAAAGTTCCTTTTCTTGTTCCACACCAAGACATCATTGTTCTGTCGCGGTCTCACATTTTCCAGCAAAGAAAATAGACAATCCCATTCATCTATTTTCTTTTTGATGAGACTTCCAAGTTTGGTCTATCTCTAATTCTTCAGATTATATTGGTCTTGGTCCCAAGTCCAAGTCTTAGTTGCCACTTGAAAAACAACTTGTTATTATTGTCTAAACATGTTGGCAGTATCCTTTTGTGTCACTTTCTACATTCTTATTGATGCCCCACATTATATTTAATACAGAGCATATCAATTTGGATGATCCTTTATCTTCTTTTGATAAGAATGATATTTTTCCTCTAATCTCTTAAATTTTTGTTCTATTGATTTTTGTTAAACACTGATTCATGTTTTATGCATGATGGTTATTTAAGTATGGCTTGTTCTTGTTATGACCTTGCTTGAGCAGGATTTATCTAATAGGTTGTACGTATCCTTGAGTTCTAAATAGCATGGTCCTGTTCTTTTTAGTGTGCAGTATAATTTTCCTAGGCAAAGGTTTATTTCATATACCCATTTTGTATTTCAGCATTTATTACCTGTTCCCATCATTGCAGTTTTTAAGGGGGCGTTTGGCCCAGGAGTTGGGAAGTAAGAAGTTGTGAACTCTACTCTTTGTTTGGCCCAAGGAGTTCGTGGATCTTACGACTAAAATATGTCAATTTTATACCTATTATCTCCTTACATTATGGGTCCAAGAGTTCACAACTCCCTAGACTTCACAACTCTACTCCTTGCCCTAAACACCCTTTTTTCTTTCTTTCTTTTTAATTGTGTAATCTCAACAAAATTCTACAAGAAGTTAAAAATGGTTATTCAACTAATCAAGAGGGAAATGGTTGCATCATTTATTTGATTTCAAGATTCAATTGAATTAAATAATAATAATCTTCCAATTCTACCTGGTGAGGATTCTGAGTATTTTCAATTCTAGACCGCATCAACAAGAAAGTTTATATCTTTAGCAAGATCTTTTGTGAGAGCATTTCATGTAGTTGATTATATTTAATTTTTATGTGCTTATCATGTGCTATGCTCTTGAGTATTAAGGATTTGCCTCTTTTTGGCACCCTCATTGCCTGCCCTTATGTTCTTCCGTTGTTTTTAGTTAAAAGTTAGAGGCACTTTAGGAGATATTTGTACAGATGACCTTTTCATAAGCACCTTTATGATCAATGACTCGCCTCCCACAAATATATGAAATCTAAACTTCTGCTTACATTAGAGCGCACTACCGCATTTTATAACCTATTGCGATAGGGATGGTTATGTCTATTATAGTCCTATCGCGATGGTTATCTACATCGCGATCTTCATTCACAGGTACTTTTTAACTCAAACAATTATCACCACAGAAAGGCTAATGCAGGCCAGAGTTATTACAATAAACAGTCCACGTCCTTGTTCATTTGAAGCAAAATAAGGCGCTGTTCATTTTGTGAATCATTTTATTTGAAGTGACTAAGTGTATTTGCTGAGAGTAACATCATGTAAACTCAAAATCTTCTTGGGAATGTTATGGTGCAGGTAATTCTTAAAGAAAGTGCTTCACCAGTAGACATGCTGAAGGCAGTGTTTCATGTCAATTATTTGCACTGGTTAGAGAGAAACGCTGGAATAACAGCAAGAAGTGCTTCTAATGACTGCAGACCAGGAGGAAGGCTGCAAATGTCTTTGGAATATGTGGAGAGGGAATTCAACCATATCAAATATGATGGGGAATTGGCTGGTTGGTTGACTGATGGCCTAATTGCAAGGCCGTTAACTAATAGGATTTGTGAATGTCATGTAGCCACCTAAGGACGAGTTACTACACTACCAAAGCTACCCAGGACATTTCTAATAACACTGGATTTTGAGAAATTGGTATGAAATTCATCTCACTTTAAAAAAATTCAAACAATCAAATAGGATGCATGTGACATACAATATATTTGACAGTCTAGAAATTCACCAAATCCATGATATCATACTCTCATGGATCTACATGTGTTCTACTTTTAAACAAATCATACCTTTGTTCCATAGAAATTTCAGTATATAAATTCACCGAATTCATGATCTTAAATTAGTATCAAGAATCTAATAAGAGCTTGTTTGTTTCATTGATTTCA

mRNA sequence

CAATTTTACCTTCCAAGTCATAATGGTCCAAGCGACTTGCTTGTTTCGCTTCATATATATAACTCCATCGATCTGCACGTCACGGACTCTCAATCGCAGAGCCACCGGCGTCTTTCTTCATTGCAATGTATGGGCTGCTGCCGTTCTCTTATCAGCCGCCGGAGCCGATTTCATTACGTCGAATCTATGCCGGTGTTTTAAACTATGTACCAGGCGGCCGTTTTCACCATTGCTCGGATTCTTCTATGCGAAGGTCATCCGCAGCACGAAGACCTCCTCTTAGCGTATTTCCCCACTTTCTTAAGTCCACAAAACTCGTCCAAGGTTACTTCTCTCCTTGTATCGGAACTAGAATGAAACCTGCTCTCGTTCATTCTCCTTTGCTGGCTGGTGACGGCCATGGGCATGGTGGAAACAACAATGGTGGTTGGAATAATTCGAATCCTTTTGGGGGTTTTGGATGGTGGCAGGATGACGGTGATTCTTCCCCATGGTCGGACAATGCCTTCTTTGCCTTCTTCTTTACCTCCATTTTGGGCTGTTTCTGCCTTTTTCAATTGGCAGCAGCGCTAGCACGTAATGAAATGAACTATGAGTCTGTTTGGGAAGTAAAAGGAGCGATGTATTCAGGCGTTTGATGCTTCCGGAGGGTTTTCCAGACAGCGTTACCAGCGACTATCTGGAATATTCTCTTTGGCGAGGAGTGCAGGGGATTGCCAGCCAAGTTAGTGGGGTCCTTGCAACTCAGGCACTGCTTTATGCTGTTGGATTGGGAAAAGGAGCTATTCCGACTGCTGCTGCAGTGAATTGGGTACTGAAAGATGGATTTGGATATCTGAGTAAAATTTTATTCTCAAAATATGGACGGCACTTTGACGTTCATCCGAAGGGATGGAGGTTGTTTGCTGATCTTCTGGAAAACGCTGCCTATGGGATGGAAATGTTAACTCCCGCATTTCCCCTGCATTTTGTCGTGATCGGTGCTGCTGCTGGGGCTGGACGATCTGCAGCCGCCTTGATTCAGGCTGCTACTAGGAGTTGTTTTTATGCTGGCTTTGCTGCTCAAAGGAATTTTGCCGAGGTGATTGCCAAAGGTGAAGCACAAGGAATGGTGAGCAAGTCTATTGGTATGATGCTTGGCATTACATTGGCTAATCGTATAAGGTCCTCAACATCTCTTGCTCTTGGATGCTTTAGCATAGTGACCTTAATCCACATGTTCTGCAATCTAAAATCATACAAGTCCATTGAACTAAGGACACTAAATCCTTATCGTGCAAGTTTGGTGTTTAGTGAATATCTGTTGAGTGGTGAGGTGCCTTCTATTAAAGATGTAAACAATGAAGAACCTCTTTTTCCAGCTGTACCATTTCTTAATACAAGGCTTGCTTGTGATGAGCCAAAATTGGGTTTACTATCTGCTGAAGCAAAGGAATCAGCAGCTAATATTGAAAAGCGACTGCAGTTGGGATCTAAGCTCAGTGATGTGGCCACCTGTGAGGAGGATGTTCTTGAACTCTTAAGTCTGTTTAATAAAGAAAATTACATTCTGTCAGAGCACAGGGGGAAATATTGTGTAATTCTTAAAGAAAGTGCTTCACCAGTAGACATGCTGAAGGCAGTGTTTCATGTCAATTATTTGCACTGGTTAGAGAGAAACGCTGGAATAACAGCAAGAAGTGCTTCTAATGACTGCAGACCAGGAGGAAGGCTGCAAATGTCTTTGGAATATGTGGAGAGGGAATTCAACCATATCAAATATGATGGGGAATTGGCTGGTTGGTTGACTGATGGCCTAATTGCAAGGCCGTTAACTAATAGGATTTGTGAATGTCATGTAGCCACCTAAGGACGAGTTACTACACTACCAAAGCTACCCAGGACATTTCTAATAACACTGGATTTTGAGAAATTGGTATGAAATTCATCTCACTTTAAAAAAATTCAAACAATCAAATAGGATGCATGTGACATACAATATATTTGACAGTCTAGAAATTCACCAAATCCATGATATCATACTCTCATGGATCTACATGTGTTCTACTTTTAAACAAATCATACCTTTGTTCCATAGAAATTTCAGTATATAAATTCACCGAATTCATGATCTTAAATTAGTATCAAGAATCTAATAAGAGCTTGTTTGTTTCATTGATTTCA

Coding sequence (CDS)

ATGCTTCCGGAGGGTTTTCCAGACAGCGTTACCAGCGACTATCTGGAATATTCTCTTTGGCGAGGAGTGCAGGGGATTGCCAGCCAAGTTAGTGGGGTCCTTGCAACTCAGGCACTGCTTTATGCTGTTGGATTGGGAAAAGGAGCTATTCCGACTGCTGCTGCAGTGAATTGGGTACTGAAAGATGGATTTGGATATCTGAGTAAAATTTTATTCTCAAAATATGGACGGCACTTTGACGTTCATCCGAAGGGATGGAGGTTGTTTGCTGATCTTCTGGAAAACGCTGCCTATGGGATGGAAATGTTAACTCCCGCATTTCCCCTGCATTTTGTCGTGATCGGTGCTGCTGCTGGGGCTGGACGATCTGCAGCCGCCTTGATTCAGGCTGCTACTAGGAGTTGTTTTTATGCTGGCTTTGCTGCTCAAAGGAATTTTGCCGAGGTGATTGCCAAAGGTGAAGCACAAGGAATGGTGAGCAAGTCTATTGGTATGATGCTTGGCATTACATTGGCTAATCGTATAAGGTCCTCAACATCTCTTGCTCTTGGATGCTTTAGCATAGTGACCTTAATCCACATGTTCTGCAATCTAAAATCATACAAGTCCATTGAACTAAGGACACTAAATCCTTATCGTGCAAGTTTGGTGTTTAGTGAATATCTGTTGAGTGGTGAGGTGCCTTCTATTAAAGATGTAAACAATGAAGAACCTCTTTTTCCAGCTGTACCATTTCTTAATACAAGGCTTGCTTGTGATGAGCCAAAATTGGGTTTACTATCTGCTGAAGCAAAGGAATCAGCAGCTAATATTGAAAAGCGACTGCAGTTGGGATCTAAGCTCAGTGATGTGGCCACCTGTGAGGAGGATGTTCTTGAACTCTTAAGTCTGTTTAATAAAGAAAATTACATTCTGTCAGAGCACAGGGGGAAATATTGTGTAATTCTTAAAGAAAGTGCTTCACCAGTAGACATGCTGAAGGCAGTGTTTCATGTCAATTATTTGCACTGGTTAGAGAGAAACGCTGGAATAACAGCAAGAAGTGCTTCTAATGACTGCAGACCAGGAGGAAGGCTGCAAATGTCTTTGGAATATGTGGAGAGGGAATTCAACCATATCAAATATGATGGGGAATTGGCTGGTTGGTTGACTGATGGCCTAATTGCAAGGCCGTTAACTAATAGGATTTGTGAATGTCATGTAGCCACCTAA

Protein sequence

MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGKYCVILKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNRICECHVAT
BLAST of Lsi05G004180 vs. Swiss-Prot
Match: RUS1_ARATH (Protein root UVB sensitive 1, chloroplastic OS=Arabidopsis thaliana GN=RUS1 PE=1 SV=1)

HSP 1 Score: 580.5 bits (1495), Expect = 1.4e-164
Identity = 287/396 (72.47%), Postives = 342/396 (86.36%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVL 60
           +LPEGFP+SVTSDYL+YSLWRGVQGIASQ+SGVLATQ+LLYAVGLGKGAIPTAAA+NWVL
Sbjct: 198 LLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAAAINWVL 257

Query: 61  KDGFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGA 120
           KDG GYLSKI+ SKYGRHFDVHPKGWRLFADLLENAA+GMEMLTP FP  FV+IGAAAGA
Sbjct: 258 KDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGAAAGA 317

Query: 121 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTS 180
           GRSAAALIQAATRSCF AGFA+QRNFAEVIAKGEAQGMVSKS+G++LGI +AN I +STS
Sbjct: 318 GRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSVGILLGIVVANCIGTSTS 377

Query: 181 LALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLF 240
           LAL  F +VT IHM+ NLKSY+ I+LRTLNPYRASLVFSEYL+SG+ P IK+VN+EEPLF
Sbjct: 378 LALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPLF 437

Query: 241 PAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNK 300
           P V F N + + ++ +  +LS+EAK +AA+IE+RLQLGSKLSDV   +E+ + L  L+  
Sbjct: 438 PTVRFSNMK-SPEKLQDFVLSSEAKAAAADIEERLQLGSKLSDVIHNKEEAIALFDLYRN 497

Query: 301 ENYILSEHRGKYCVILKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQ 360
           E YIL+EH+G++CV+LKES++P DML+++F VNYL+WLE+NAGI   S  +DC+PGGRL 
Sbjct: 498 EGYILTEHKGRFCVMLKESSTPQDMLRSLFQVNYLYWLEKNAGIEPASTYSDCKPGGRLH 557

Query: 361 MSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNRI 397
           +SL+YV REF H K D E  GW+T+GLIARPL  RI
Sbjct: 558 ISLDYVRREFEHAKEDSESVGWVTEGLIARPLPTRI 592

BLAST of Lsi05G004180 vs. Swiss-Prot
Match: RUS3_ARATH (Protein root UVB sensitive 3 OS=Arabidopsis thaliana GN=RUS3 PE=2 SV=1)

HSP 1 Score: 176.8 bits (447), Expect = 4.8e-43
Identity = 122/403 (30.27%), Postives = 209/403 (51.86%), Query Frame = 1

Query: 2   LPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLG-KGAIPTAAAVNWVL 61
           +PEGFP SVT DY+ + LW  +QG+++    +L+TQALL A+G+G K A    A   W L
Sbjct: 58  VPEGFPGSVTPDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEKSATVIGATFQWFL 117

Query: 62  KDGFGYLSKILFSKY-GRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAG 121
           +D  G L  ILF+ Y G + D + K WRL ADL+ +    M++L+P FP  F+V+     
Sbjct: 118 RDFTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLGS 177

Query: 122 AGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSST 181
             RS   +   ATR+     FA Q N A++ AK  +Q  ++  +GM LG+ LA R  S  
Sbjct: 178 LSRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETMATMMGMSLGMLLA-RFTSGN 237

Query: 182 SLALG-CFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEP 241
            +A+   F  +T+ HM+ N ++ + + L +LN  R+S++ + ++ +G+V S + V++ E 
Sbjct: 238 PMAIWLSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQVLSPEQVSSMEG 297

Query: 242 LFPAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELL--- 301
           + P               L   S  +  S   + KR+QLG ++S +     D+L+LL   
Sbjct: 298 VLP---------------LWATSLRSTNSKP-LHKRVQLGVRVSSLPRL--DMLQLLNGV 357

Query: 302 --SLFNKENYILSEHRGKYCVILKESASPVDMLKAVFHVNYL-HWLERNAGITARSASND 361
             S +    Y+L+  +G   VIL + + P D+LK+  H   L + +E++    +   +  
Sbjct: 358 GASSYKNAKYLLAHIKGNVSVILHKDSKPADVLKSYIHAIVLANLMEKSTSFYSEGEA-- 417

Query: 362 CRPGGRLQMSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNR 396
                       ++++ ++ + +     GW T+ L++  +T R
Sbjct: 418 ------------WIDKHYDELLHKLRSGGWKTERLLSPSITWR 427

BLAST of Lsi05G004180 vs. Swiss-Prot
Match: RUS6_ARATH (Protein root UVB sensitive 6 OS=Arabidopsis thaliana GN=RUS6 PE=2 SV=1)

HSP 1 Score: 171.8 bits (434), Expect = 1.5e-41
Identity = 116/409 (28.36%), Postives = 193/409 (47.19%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAA-AVNWV 60
           ++PEGFP SV   Y+ Y  WR ++       GV  TQ LL +VG  + +  +AA A+NW+
Sbjct: 112 VVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSSASAAVAINWI 171

Query: 61  LKDGFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAG 120
           LKDG G + K+LF++ G+ FD   K  R   DLL     G+E+ T A P  F+ +  AA 
Sbjct: 172 LKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 231

Query: 121 AGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSST 180
             ++ AA+   +TR+  Y  FA   N  +V AKGE  G ++  +G    I ++ R  S  
Sbjct: 232 VVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSILISKRNPSLV 291

Query: 181 SLALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPL 240
           +     F +++  ++  + +  +S+ L TLN  R ++    +L +G VPS+++ N +E +
Sbjct: 292 T----TFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQEGNIQEKI 351

Query: 241 FPAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFN 300
           F   P+++                        ++ + LG++  D        + +   F+
Sbjct: 352 F-TFPWVD------------------------DRPVMLGARFKDAFQDPSTYMAVKPFFD 411

Query: 301 KENYIL--SEHRGKYCVILKESASPVDMLKAVFHVN-YLHWLERNAGITARS-------- 360
           KE Y++  S  +GK   +LK  A+  D+LKA FH +  LH++ ++     RS        
Sbjct: 412 KERYMVTYSPTKGKVYALLKHQANSDDILKAAFHAHVLLHFMNQSKDGNPRSVEQLDPAF 471

Query: 361 ASNDCRPGGRLQMSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNRIC 398
           A  +     R+  S E V   +   K      GW     +  P   R+C
Sbjct: 472 APTEYELESRIAESCEMVSTSYGVFKSRAAEQGWRMSESLLNPGRARLC 491

BLAST of Lsi05G004180 vs. Swiss-Prot
Match: RUS1_RAT (RUS1 family protein C16orf58 homolog OS=Rattus norvegicus PE=2 SV=1)

HSP 1 Score: 165.6 bits (418), Expect = 1.1e-39
Identity = 88/242 (36.36%), Postives = 140/242 (57.85%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVN-WV 60
           +LP+GFPDSV+ DYL+Y LW  VQ  AS +SG LATQA+L  +G+G      +AA + W+
Sbjct: 75  LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 61  LKDGFGYLSKILFSKY-GRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAA 120
           +KD  G L +I+F+ + G   D + K WRLFAD+L + A  +E++ P +P+ F +  + +
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVSTS 194

Query: 121 GAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSS 180
              +    +   ATR+      A + N A+V AK  +Q  V    G+++ + +   +   
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 181 TSLALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEP 240
            SL+LGCF ++T +H++ N ++ +++ L TLN  R  LV   +L  GEV      N  EP
Sbjct: 255 LSLSLGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHFLQRGEVLEPASANQMEP 314

BLAST of Lsi05G004180 vs. Swiss-Prot
Match: RUS1_MOUSE (RUS1 family protein C16orf58 homolog OS=Mus musculus PE=1 SV=1)

HSP 1 Score: 163.7 bits (413), Expect = 4.2e-39
Identity = 105/335 (31.34%), Postives = 170/335 (50.75%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVN-WV 60
           +LP+GFPDSV+ DYL Y LW  VQ  AS +SG LATQA+L  +G+G      +AA + W+
Sbjct: 75  LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 61  LKDGFGYLSKILFSKY-GRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAA 120
           +KD  G L +I+ + + G   D + K WRLFAD+L + A  +E++ P +P+ F +  + +
Sbjct: 135 VKDSTGMLGRIILAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194

Query: 121 GAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSS 180
              +    +   ATR+      A + N A+V AK  +Q  V    G+++ + +   +   
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 181 TSLALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEP 240
            SL+LGCF ++T +H++ N ++ +++ L TLN  R  LV   +L  GEV      N  EP
Sbjct: 255 PSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQMEP 314

Query: 241 LFPAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLF 300
           L+              P L                 L LG  L  + +   ++ +L+   
Sbjct: 315 LWTGF----------WPSLS----------------LSLGVPLHHLVSSVSELKQLVE-G 374

Query: 301 NKENYIL--SEHRGKYCVILKESASPVDMLKAVFH 332
           + E Y+L  ++ R +  V L + A P  +L+A  H
Sbjct: 375 HHEPYLLCWNKSRNQVQVALSQEAGPETVLRAATH 382

BLAST of Lsi05G004180 vs. TrEMBL
Match: A0A061GE65_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_016682 PE=4 SV=1)

HSP 1 Score: 624.8 bits (1610), Expect = 7.4e-176
Identity = 310/400 (77.50%), Postives = 354/400 (88.50%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVL 60
           +LPEGFPDSVTSDYL+YSLWRGVQG+ASQ+SGVLATQALLYAVGLGKGAIPTAAA+NWVL
Sbjct: 186 LLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAINWVL 245

Query: 61  KDGFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGA 120
           KDG GYLSKI+ SKYGRHFDV+PKGWRLFADLLENAA+G+EMLTPAFP  FV IGAAAGA
Sbjct: 246 KDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHLFVPIGAAAGA 305

Query: 121 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTS 180
           GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIG++LGI LAN + SSTS
Sbjct: 306 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIVLGIALANCVGSSTS 365

Query: 181 LALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLF 240
           LAL  F +VT +HM+CNLKSY+SI+LRTLN YRASLVFSEYLLSG+ PSIK+VN+EEPLF
Sbjct: 366 LALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVFSEYLLSGQAPSIKEVNDEEPLF 425

Query: 241 PAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNK 300
           PAVPFLN  L+ +  +  +LS+EAK++AA+IE+RLQLGSKLSD+   +ED L L SL+  
Sbjct: 426 PAVPFLNL-LSANRERSVVLSSEAKQAAADIERRLQLGSKLSDIVNNKEDALALFSLYKD 485

Query: 301 ENYILSEHRGKYCVILKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQ 360
           E YIL+EH GK+CV+LKES+ P DMLK++F VNYL+WLERNAGI A  AS DCRPGGRLQ
Sbjct: 486 EGYILTEHEGKFCVVLKESSLPQDMLKSLFQVNYLYWLERNAGIEASGASTDCRPGGRLQ 545

Query: 361 MSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNRICECH 401
           +S+EYV+REFNH+K D E  GW+TDGLIARPL NRI   H
Sbjct: 546 ISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRPGH 584

BLAST of Lsi05G004180 vs. TrEMBL
Match: D7SX09_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0027g00220 PE=4 SV=1)

HSP 1 Score: 621.7 bits (1602), Expect = 6.3e-175
Identity = 309/401 (77.06%), Postives = 350/401 (87.28%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVL 60
           MLPEGFP SVTSDYL+Y+LWRGVQG+ASQ+SGVLATQALLYAVGLGKGAIPTAAAVNWVL
Sbjct: 87  MLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAVNWVL 146

Query: 61  KDGFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGA 120
           KDG GYLSKIL SKYGRHFDVHPKGWRLFADLLENAAYG+E+LTPAFP  F++IGA AGA
Sbjct: 147 KDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGAVAGA 206

Query: 121 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTS 180
           GRSAAALIQA+TRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIG+MLGI LAN I SS  
Sbjct: 207 GRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIGSSAP 266

Query: 181 LALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLF 240
           L+   F++VT +HMFCNLKSY+SI+LRTLNPYRASLVFSEYLLSG+VPSIK+VN EEPLF
Sbjct: 267 LSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVNEEEPLF 326

Query: 241 PAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNK 300
           P VP LN +    + +  +LS EAK++AA IE+RLQLGSKLS+V + +EDVL L  L+  
Sbjct: 327 PVVPLLNAK-PTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVVSSKEDVLALFDLYRN 386

Query: 301 ENYILSEHRGKYCVILKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQ 360
           E YIL+EH+G++ VILKES SP DMLK+VFHVNYL+WLERNAGI +  AS+DCRPGGRLQ
Sbjct: 387 EAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLERNAGIISMGASDDCRPGGRLQ 446

Query: 361 MSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNRICECHV 402
           +SLEYV+REFNH+K D E  GW TDGLIARPL NRI   H+
Sbjct: 447 ISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPGHI 486

BLAST of Lsi05G004180 vs. TrEMBL
Match: M1BL01_SOLTU (Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400018482 PE=4 SV=1)

HSP 1 Score: 617.8 bits (1592), Expect = 9.0e-174
Identity = 310/396 (78.28%), Postives = 348/396 (87.88%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVL 60
           +LPEGFPDSVTSDYLEY+LWRGVQG+A+Q+SGVLATQALLYAVGLGKGAIPTAAAVNWVL
Sbjct: 203 LLPEGFPDSVTSDYLEYALWRGVQGVAAQISGVLATQALLYAVGLGKGAIPTAAAVNWVL 262

Query: 61  KDGFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGA 120
           KDG GYLSKIL S YGRHFDV+PK WRLFADLLENAAYG+E+LTPAFP  FV IGA AGA
Sbjct: 263 KDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTPAFPHLFVPIGAVAGA 322

Query: 121 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTS 180
           GRSAA+LIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK+IG+MLGI LAN  RSSTS
Sbjct: 323 GRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGIMLGIALANCTRSSTS 382

Query: 181 LALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLF 240
           LAL  F +VT IHMFCNLKSY SI+LRTLNPYRASLVFSEYLLSG VPS+K+VN+EEPLF
Sbjct: 383 LALASFGVVTWIHMFCNLKSYHSIQLRTLNPYRASLVFSEYLLSGLVPSVKEVNDEEPLF 442

Query: 241 PAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNK 300
           PA   LN + A  E ++ +LS  AK++AA I +RLQLGSKLSDVAT  EDVL L  L+  
Sbjct: 443 PAA-ILNLK-AAYETQMEVLSVHAKQAAAGIVRRLQLGSKLSDVATSREDVLALFELYKN 502

Query: 301 ENYILSEHRGKYCVILKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQ 360
           E YIL+EH G++C++LKES+SP DMLK++FHVNYL+WLE  AGI + S +NDCRPGGRLQ
Sbjct: 503 EGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETKAGIKSSSVANDCRPGGRLQ 562

Query: 361 MSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNRI 397
           MSLEYVEREFNH+K DGE+AGW+TD LIARPL NRI
Sbjct: 563 MSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPNRI 596

BLAST of Lsi05G004180 vs. TrEMBL
Match: A0A0V0ISL3_SOLCH (Putative UPF0420 protein C16orf58-like OS=Solanum chacoense PE=4 SV=1)

HSP 1 Score: 616.7 bits (1589), Expect = 2.0e-173
Identity = 309/396 (78.03%), Postives = 349/396 (88.13%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVL 60
           +LPEGFP+SVTSDYLEY+LWRGVQG+A+Q+SGVLATQALLYAVGLGKGAIPTAAAVNWVL
Sbjct: 203 LLPEGFPESVTSDYLEYALWRGVQGVAAQISGVLATQALLYAVGLGKGAIPTAAAVNWVL 262

Query: 61  KDGFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGA 120
           KDG GYLSKIL S YGRHFDV+PK WRLFADLLENAAYG+E+LTPAFP  FV IGA AGA
Sbjct: 263 KDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTPAFPHLFVPIGAVAGA 322

Query: 121 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTS 180
           GRSAA+LIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK+IG+MLGI LAN  RSSTS
Sbjct: 323 GRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGIMLGIALANCTRSSTS 382

Query: 181 LALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLF 240
           L L  F +VT IHMFCNLKSY SI+LRTLNPYRASLVFSEYLLSG VPS+ +VN+EEPLF
Sbjct: 383 LXLASFGVVTWIHMFCNLKSYHSIQLRTLNPYRASLVFSEYLLSGLVPSVXEVNDEEPLF 442

Query: 241 PAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNK 300
           PA   LN + A  E +  +LS +AK++AA I +RLQLGSKLSDVAT  EDVL L  L+  
Sbjct: 443 PAA-ILNLK-AAYEXQXEVLSVQAKQAAAGIVRRLQLGSKLSDVATSREDVLALFELYKN 502

Query: 301 ENYILSEHRGKYCVILKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQ 360
           E+YIL+EH G++C++LKES+SP DMLK++FHVNYL+WLE NAGI +RS +NDCRPGGRLQ
Sbjct: 503 ESYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETNAGIKSRSVANDCRPGGRLQ 562

Query: 361 MSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNRI 397
           MSLEYVEREFNH+K DGE+AGW+TD LIARPL NRI
Sbjct: 563 MSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPNRI 596

BLAST of Lsi05G004180 vs. TrEMBL
Match: K4CI48_SOLLC (Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1)

HSP 1 Score: 614.8 bits (1584), Expect = 7.7e-173
Identity = 309/395 (78.23%), Postives = 348/395 (88.10%), Query Frame = 1

Query: 2   LPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLK 61
           LPEGFP+SVTSDYLEY+LWRGVQGIA+Q+SGVLATQALLYAVGLGKGAIPTAAA+NWVLK
Sbjct: 201 LPEGFPESVTSDYLEYALWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTAAAINWVLK 260

Query: 62  DGFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGAG 121
           DG GYLSKIL S YGRHFDV+PK WRLFADLLENAAYG+E+LTPAFP  FV IGA AGAG
Sbjct: 261 DGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTPAFPHLFVPIGAVAGAG 320

Query: 122 RSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTSL 181
           RSAA+LIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK+IG+MLGI LAN  RSSTSL
Sbjct: 321 RSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGIMLGIALANYTRSSTSL 380

Query: 182 ALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFP 241
           AL  F +VT IHMFCNLKSY+SI+LRTLNPYRASLVFSEYLLSG VPS+K+VN+EEPLFP
Sbjct: 381 ALASFGVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVKEVNDEEPLFP 440

Query: 242 AVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNKE 301
           A   LN + A  E +  +LS  AK++AA I +RLQLGSKLSDVAT +EDVL L  L+  E
Sbjct: 441 AA-ILNLK-AAYETQTEVLSVHAKQAAAGIVRRLQLGSKLSDVATSQEDVLALFELYKNE 500

Query: 302 NYILSEHRGKYCVILKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQM 361
            YIL+EH G++C++LKES+SP DMLK++FHVNYL+WLE NAGI + S +NDCRPGGRLQM
Sbjct: 501 GYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETNAGIKSSSVANDCRPGGRLQM 560

Query: 362 SLEYVEREFNHIKYDGELAGWLTDGLIARPLTNRI 397
           SLEYVEREFNH+K DGE+AGW+TD LIARPL  RI
Sbjct: 561 SLEYVEREFNHVKTDGEVAGWVTDSLIARPLPVRI 593

BLAST of Lsi05G004180 vs. TAIR10
Match: AT3G45890.1 (AT3G45890.1 Protein of unknown function, DUF647)

HSP 1 Score: 580.5 bits (1495), Expect = 8.1e-166
Identity = 287/396 (72.47%), Postives = 342/396 (86.36%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVL 60
           +LPEGFP+SVTSDYL+YSLWRGVQGIASQ+SGVLATQ+LLYAVGLGKGAIPTAAA+NWVL
Sbjct: 198 LLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAAAINWVL 257

Query: 61  KDGFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGA 120
           KDG GYLSKI+ SKYGRHFDVHPKGWRLFADLLENAA+GMEMLTP FP  FV+IGAAAGA
Sbjct: 258 KDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGAAAGA 317

Query: 121 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTS 180
           GRSAAALIQAATRSCF AGFA+QRNFAEVIAKGEAQGMVSKS+G++LGI +AN I +STS
Sbjct: 318 GRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSVGILLGIVVANCIGTSTS 377

Query: 181 LALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLF 240
           LAL  F +VT IHM+ NLKSY+ I+LRTLNPYRASLVFSEYL+SG+ P IK+VN+EEPLF
Sbjct: 378 LALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPLF 437

Query: 241 PAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNK 300
           P V F N + + ++ +  +LS+EAK +AA+IE+RLQLGSKLSDV   +E+ + L  L+  
Sbjct: 438 PTVRFSNMK-SPEKLQDFVLSSEAKAAAADIEERLQLGSKLSDVIHNKEEAIALFDLYRN 497

Query: 301 ENYILSEHRGKYCVILKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQ 360
           E YIL+EH+G++CV+LKES++P DML+++F VNYL+WLE+NAGI   S  +DC+PGGRL 
Sbjct: 498 EGYILTEHKGRFCVMLKESSTPQDMLRSLFQVNYLYWLEKNAGIEPASTYSDCKPGGRLH 557

Query: 361 MSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNRI 397
           +SL+YV REF H K D E  GW+T+GLIARPL  RI
Sbjct: 558 ISLDYVRREFEHAKEDSESVGWVTEGLIARPLPTRI 592

BLAST of Lsi05G004180 vs. TAIR10
Match: AT1G13770.1 (AT1G13770.1 Protein of unknown function, DUF647)

HSP 1 Score: 176.8 bits (447), Expect = 2.7e-44
Identity = 122/403 (30.27%), Postives = 209/403 (51.86%), Query Frame = 1

Query: 2   LPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLG-KGAIPTAAAVNWVL 61
           +PEGFP SVT DY+ + LW  +QG+++    +L+TQALL A+G+G K A    A   W L
Sbjct: 58  VPEGFPGSVTPDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEKSATVIGATFQWFL 117

Query: 62  KDGFGYLSKILFSKY-GRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAG 121
           +D  G L  ILF+ Y G + D + K WRL ADL+ +    M++L+P FP  F+V+     
Sbjct: 118 RDFTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLGS 177

Query: 122 AGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSST 181
             RS   +   ATR+     FA Q N A++ AK  +Q  ++  +GM LG+ LA R  S  
Sbjct: 178 LSRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETMATMMGMSLGMLLA-RFTSGN 237

Query: 182 SLALG-CFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEP 241
            +A+   F  +T+ HM+ N ++ + + L +LN  R+S++ + ++ +G+V S + V++ E 
Sbjct: 238 PMAIWLSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQVLSPEQVSSMEG 297

Query: 242 LFPAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELL--- 301
           + P               L   S  +  S   + KR+QLG ++S +     D+L+LL   
Sbjct: 298 VLP---------------LWATSLRSTNSKP-LHKRVQLGVRVSSLPRL--DMLQLLNGV 357

Query: 302 --SLFNKENYILSEHRGKYCVILKESASPVDMLKAVFHVNYL-HWLERNAGITARSASND 361
             S +    Y+L+  +G   VIL + + P D+LK+  H   L + +E++    +   +  
Sbjct: 358 GASSYKNAKYLLAHIKGNVSVILHKDSKPADVLKSYIHAIVLANLMEKSTSFYSEGEA-- 417

Query: 362 CRPGGRLQMSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNR 396
                       ++++ ++ + +     GW T+ L++  +T R
Sbjct: 418 ------------WIDKHYDELLHKLRSGGWKTERLLSPSITWR 427

BLAST of Lsi05G004180 vs. TAIR10
Match: AT5G49820.1 (AT5G49820.1 Protein of unknown function, DUF647)

HSP 1 Score: 171.8 bits (434), Expect = 8.7e-43
Identity = 116/409 (28.36%), Postives = 193/409 (47.19%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAA-AVNWV 60
           ++PEGFP SV   Y+ Y  WR ++       GV  TQ LL +VG  + +  +AA A+NW+
Sbjct: 112 VVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSSASAAVAINWI 171

Query: 61  LKDGFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAG 120
           LKDG G + K+LF++ G+ FD   K  R   DLL     G+E+ T A P  F+ +  AA 
Sbjct: 172 LKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 231

Query: 121 AGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSST 180
             ++ AA+   +TR+  Y  FA   N  +V AKGE  G ++  +G    I ++ R  S  
Sbjct: 232 VVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSILISKRNPSLV 291

Query: 181 SLALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPL 240
           +     F +++  ++  + +  +S+ L TLN  R ++    +L +G VPS+++ N +E +
Sbjct: 292 T----TFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQEGNIQEKI 351

Query: 241 FPAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFN 300
           F   P+++                        ++ + LG++  D        + +   F+
Sbjct: 352 F-TFPWVD------------------------DRPVMLGARFKDAFQDPSTYMAVKPFFD 411

Query: 301 KENYIL--SEHRGKYCVILKESASPVDMLKAVFHVN-YLHWLERNAGITARS-------- 360
           KE Y++  S  +GK   +LK  A+  D+LKA FH +  LH++ ++     RS        
Sbjct: 412 KERYMVTYSPTKGKVYALLKHQANSDDILKAAFHAHVLLHFMNQSKDGNPRSVEQLDPAF 471

Query: 361 ASNDCRPGGRLQMSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNRIC 398
           A  +     R+  S E V   +   K      GW     +  P   R+C
Sbjct: 472 APTEYELESRIAESCEMVSTSYGVFKSRAAEQGWRMSESLLNPGRARLC 491

BLAST of Lsi05G004180 vs. TAIR10
Match: AT2G31190.1 (AT2G31190.1 Protein of unknown function, DUF647)

HSP 1 Score: 157.1 bits (396), Expect = 2.2e-38
Identity = 112/384 (29.17%), Postives = 185/384 (48.18%), Query Frame = 1

Query: 3   PEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKD 62
           P G+P SV   YL Y+ +R +Q  +S    VL+TQ+LL+A GL +     A  V+W+LKD
Sbjct: 73  PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQATVVSWILKD 132

Query: 63  GFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGAGR 122
           G  ++ K++ S  G   D  PK WR+ AD+L +   G+E+++P  P  F+ +       +
Sbjct: 133 GMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEMAGLGNFAK 192

Query: 123 SAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTSLA 182
             A +   ATR   Y+ FA + N +++ AKGEA   +    G+  GI LA+ I SS    
Sbjct: 193 GMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLASTICSSMEGK 252

Query: 183 LGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPL-FP 242
           L   SI++++H++  ++  + + + TLNP R +L+ + +L +G+VPS  D+  +E L FP
Sbjct: 253 LVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPDLRFQEDLMFP 312

Query: 243 AVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNKE 302
             P                     + A N+    ++G  L   A    +V  L  +F +E
Sbjct: 313 ERPI--------------------QDAGNV----KVGRALHK-AVKPSEVQRLKQVFVEE 372

Query: 303 NYILSEHRGKYCVILKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQM 362
            ++LS  +    ++L+  A+  D L+         WL      +     ND      LQ 
Sbjct: 373 KFLLSHGKSWTDMVLEHDATGEDALRG--------WLVAAYVKSMTKIYND-PDDIILQD 421

Query: 363 SLEYVEREFNHIKYDGELAGWLTD 386
           + + +   FN      +  GW TD
Sbjct: 433 AYDKMNDVFNPFLSQVQAKGWYTD 421

BLAST of Lsi05G004180 vs. TAIR10
Match: AT5G01510.1 (AT5G01510.1 Protein of unknown function, DUF647)

HSP 1 Score: 143.7 bits (361), Expect = 2.5e-34
Identity = 103/397 (25.94%), Postives = 186/397 (46.85%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGK--------GAIPT 60
           + P GFP SV+ DYL+Y LW+    I   +  VL T +LL AVG+G          A  +
Sbjct: 119 VFPSGFPGSVSDDYLDYMLWQFPTNITGWICNVLVTSSLLKAVGVGSFSGTSAAATAAAS 178

Query: 61  AAAVNWVLKDGFGYLSKILF-SKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHF 120
           AAA+ WV KDG G L ++L   ++G  FD  PK WR++AD + +A    ++ T  +P  F
Sbjct: 179 AAAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQF 238

Query: 121 VVIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITL 180
           +++ +     ++ A  ++  +       FA   N  EV AK E   + ++ IG+  GI +
Sbjct: 239 LLLASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFGILI 298

Query: 181 ANR--IRSSTSLALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPS 240
            +   +  S    L  ++ + L+H++   +S   ++  T+N  RA ++   +++   VP 
Sbjct: 299 IDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSVVPG 358

Query: 241 IKDVNNEEPLFPAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEE 300
             D N  E +     F+  R         ++   + E  + +EK +              
Sbjct: 359 YVDCNKRENILLWQRFMKPR---------IIFGVSLEELSGLEKSV-------------S 418

Query: 301 DVLELLSLFNKENYILSEHR----GKYCVILKESASPVDMLKAVFHVNYLHWLERNAGIT 360
            V  LL ++ KE YIL+ ++     ++ V  K +A+  D+L+ ++     +WLE N   +
Sbjct: 419 KVKALLKMYTKEKYILTLNKLNKDTEFSVSFKVNATSRDVLRCLWQA---YWLEENMEES 478

Query: 361 ARSASNDCRPGGRLQMSLEYVEREFNHIKYDGELAGW 383
            +   +       L+ SL  ++ +F+   +  + AGW
Sbjct: 479 FKDKDSVFH---WLKQSLSEMDNKFDDFLFKLDTAGW 487

BLAST of Lsi05G004180 vs. NCBI nr
Match: gi|778680559|ref|XP_011651345.1| (PREDICTED: protein root UVB sensitive 1, chloroplastic [Cucumis sativus])

HSP 1 Score: 777.3 bits (2006), Expect = 1.3e-221
Identity = 389/401 (97.01%), Postives = 393/401 (98.00%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVL 60
           MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVL
Sbjct: 211 MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVL 270

Query: 61  KDGFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGA 120
           KDGFGYLSKI  SKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGA
Sbjct: 271 KDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGA 330

Query: 121 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTS 180
           GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTS
Sbjct: 331 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTS 390

Query: 181 LALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLF 240
           LALGCFSIVTLIHMFCNLKSYKSI+LRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLF
Sbjct: 391 LALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLF 450

Query: 241 PAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNK 300
           PAVP LN +LACDEPKL LLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNK
Sbjct: 451 PAVPLLNRKLACDEPKLSLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNK 510

Query: 301 ENYILSEHRGKYCVILKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQ 360
           ENYILSEHRGKYCV+LKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQ
Sbjct: 511 ENYILSEHRGKYCVMLKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQ 570

Query: 361 MSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNRICECHV 402
           MSLEYVEREF H+KYDGELAGW TDGLIARPLT RICECHV
Sbjct: 571 MSLEYVEREFKHVKYDGELAGWSTDGLIARPLTTRICECHV 611

BLAST of Lsi05G004180 vs. NCBI nr
Match: gi|659098056|ref|XP_008449956.1| (PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis melo])

HSP 1 Score: 770.0 bits (1987), Expect = 2.0e-219
Identity = 385/401 (96.01%), Postives = 391/401 (97.51%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVL 60
           MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVL
Sbjct: 210 MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVL 269

Query: 61  KDGFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGA 120
           KDGFGYLSKI  SKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGA
Sbjct: 270 KDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGA 329

Query: 121 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTS 180
           GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLAN IRSSTS
Sbjct: 330 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANHIRSSTS 389

Query: 181 LALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLF 240
           LALGCFSIVTLIHMF NLKSYKSI+LRTLNPYRASLVFSEYL SGEVPSIK+VNNEEPLF
Sbjct: 390 LALGCFSIVTLIHMFSNLKSYKSIQLRTLNPYRASLVFSEYLFSGEVPSIKEVNNEEPLF 449

Query: 241 PAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNK 300
           PAVP LNTRL CDEPKLGLLSAEAKESAANI++RLQLGSKLSDVATCE DVLELLSLFNK
Sbjct: 450 PAVPLLNTRLGCDEPKLGLLSAEAKESAANIDQRLQLGSKLSDVATCEADVLELLSLFNK 509

Query: 301 ENYILSEHRGKYCVILKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQ 360
           ENYILSEHRGKYCV+LKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQ
Sbjct: 510 ENYILSEHRGKYCVMLKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQ 569

Query: 361 MSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNRICECHV 402
           MSLEYVEREF H+KYDGELAGWLTDGLIARPLT RICECHV
Sbjct: 570 MSLEYVEREFKHVKYDGELAGWLTDGLIARPLTTRICECHV 610

BLAST of Lsi05G004180 vs. NCBI nr
Match: gi|590680331|ref|XP_007040833.1| (Uncharacterized protein isoform 1 [Theobroma cacao])

HSP 1 Score: 624.8 bits (1610), Expect = 1.1e-175
Identity = 310/400 (77.50%), Postives = 354/400 (88.50%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVL 60
           +LPEGFPDSVTSDYL+YSLWRGVQG+ASQ+SGVLATQALLYAVGLGKGAIPTAAA+NWVL
Sbjct: 186 LLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAINWVL 245

Query: 61  KDGFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGA 120
           KDG GYLSKI+ SKYGRHFDV+PKGWRLFADLLENAA+G+EMLTPAFP  FV IGAAAGA
Sbjct: 246 KDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHLFVPIGAAAGA 305

Query: 121 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTS 180
           GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIG++LGI LAN + SSTS
Sbjct: 306 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIVLGIALANCVGSSTS 365

Query: 181 LALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLF 240
           LAL  F +VT +HM+CNLKSY+SI+LRTLN YRASLVFSEYLLSG+ PSIK+VN+EEPLF
Sbjct: 366 LALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVFSEYLLSGQAPSIKEVNDEEPLF 425

Query: 241 PAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNK 300
           PAVPFLN  L+ +  +  +LS+EAK++AA+IE+RLQLGSKLSD+   +ED L L SL+  
Sbjct: 426 PAVPFLNL-LSANRERSVVLSSEAKQAAADIERRLQLGSKLSDIVNNKEDALALFSLYKD 485

Query: 301 ENYILSEHRGKYCVILKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQ 360
           E YIL+EH GK+CV+LKES+ P DMLK++F VNYL+WLERNAGI A  AS DCRPGGRLQ
Sbjct: 486 EGYILTEHEGKFCVVLKESSLPQDMLKSLFQVNYLYWLERNAGIEASGASTDCRPGGRLQ 545

Query: 361 MSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNRICECH 401
           +S+EYV+REFNH+K D E  GW+TDGLIARPL NRI   H
Sbjct: 546 ISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRPGH 584

BLAST of Lsi05G004180 vs. NCBI nr
Match: gi|731434384|ref|XP_010645036.1| (PREDICTED: protein root UVB sensitive 1, chloroplastic [Vitis vinifera])

HSP 1 Score: 624.0 bits (1608), Expect = 1.8e-175
Identity = 311/403 (77.17%), Postives = 352/403 (87.34%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVL 60
           MLPEGFP SVTSDYL+Y+LWRGVQG+ASQ+SGVLATQALLYAVGLGKGAIPTAAAVNWVL
Sbjct: 222 MLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAVNWVL 281

Query: 61  KDGFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGA 120
           KDG GYLSKIL SKYGRHFDVHPKGWRLFADLLENAAYG+E+LTPAFP  F++IGA AGA
Sbjct: 282 KDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGAVAGA 341

Query: 121 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTS 180
           GRSAAALIQA+TRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIG+MLGI LAN I SS  
Sbjct: 342 GRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIGSSAP 401

Query: 181 LALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLF 240
           L+   F++VT +HMFCNLKSY+SI+LRTLNPYRASLVFSEYLLSG+VPSIK+VN EEPLF
Sbjct: 402 LSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVNEEEPLF 461

Query: 241 PAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNK 300
           P VP LN +    + +  +LS EAK++AA IE+RLQLGSKLS+V + +EDVL L  L+  
Sbjct: 462 PVVPLLNAK-PTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVVSSKEDVLALFDLYRN 521

Query: 301 ENYILSEHRGKYCVILKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQ 360
           E YIL+EH+G++ VILKES SP DMLK+VFHVNYL+WLERNAGI +  AS+DCRPGGRLQ
Sbjct: 522 EAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLERNAGIISMGASDDCRPGGRLQ 581

Query: 361 MSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNRICECHVAT 404
           +SLEYV+REFNH+K D E  GW TDGLIARPL NRI   HVA+
Sbjct: 582 ISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPGHVAS 623

BLAST of Lsi05G004180 vs. NCBI nr
Match: gi|657970594|ref|XP_008377058.1| (PREDICTED: UPF0420 protein C16orf58 homolog isoform X1 [Malus domestica])

HSP 1 Score: 623.6 bits (1607), Expect = 2.4e-175
Identity = 313/402 (77.86%), Postives = 353/402 (87.81%), Query Frame = 1

Query: 1   MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVL 60
           MLPEG+PDSVTSDYLEYSLWRGVQG+ASQ+SGVLATQALLYAVGLGKGAIPTAAAVNWVL
Sbjct: 185 MLPEGYPDSVTSDYLEYSLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAVNWVL 244

Query: 61  KDGFGYLSKILFSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGA 120
           KDG GYLSKI+ SKYGRHFDV+PKGWRLFADLLENAA+GMEMLTPAFP  F++IGAAAGA
Sbjct: 245 KDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEMLTPAFPHLFLLIGAAAGA 304

Query: 121 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTS 180
           GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKS G+MLGI LAN I SS +
Sbjct: 305 GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSFGIMLGIALANHIGSSMA 364

Query: 181 LALGCFSIVTLIHMFCNLKSYKSIELRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLF 240
           L L  FS+VT IHMFCNLKSY+SI++RTLNPYRASLVFSEYLLSG+   +KDVN EEPLF
Sbjct: 365 LGLASFSMVTWIHMFCNLKSYQSIQIRTLNPYRASLVFSEYLLSGQASPVKDVNEEEPLF 424

Query: 241 PAVPFLNTRLACDEPKLGLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNK 300
           PAVPFLN++ A     +G LS+ AKE+AA IE+RLQLGSKLSD+   ++DVL LLSL+NK
Sbjct: 425 PAVPFLNSKSANKAHSVG-LSSNAKEAAAEIERRLQLGSKLSDLVNNKDDVLALLSLYNK 484

Query: 301 ENYILSEHRGKYCVILKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQ 360
           E YILSEH+G+YCV+LKE++S  DML+A+F VNYL+WLE+NAG  AR  S DC+PGG L 
Sbjct: 485 EGYILSEHKGRYCVVLKETSSLQDMLRALFQVNYLYWLEKNAGYEARGTSVDCKPGGWLH 544

Query: 361 MSLEYVEREFNHIKYDGELAGWLTDGLIARPLTNRICECHVA 403
           +SLEYV REFNH+K D E AGW+TDGLIARPL NRI   +VA
Sbjct: 545 LSLEYVRREFNHVKNDAESAGWVTDGLIARPLPNRIRPVYVA 585

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RUS1_ARATH1.4e-16472.47Protein root UVB sensitive 1, chloroplastic OS=Arabidopsis thaliana GN=RUS1 PE=1... [more]
RUS3_ARATH4.8e-4330.27Protein root UVB sensitive 3 OS=Arabidopsis thaliana GN=RUS3 PE=2 SV=1[more]
RUS6_ARATH1.5e-4128.36Protein root UVB sensitive 6 OS=Arabidopsis thaliana GN=RUS6 PE=2 SV=1[more]
RUS1_RAT1.1e-3936.36RUS1 family protein C16orf58 homolog OS=Rattus norvegicus PE=2 SV=1[more]
RUS1_MOUSE4.2e-3931.34RUS1 family protein C16orf58 homolog OS=Mus musculus PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A061GE65_THECC7.4e-17677.50Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_016682 PE=4 SV=1[more]
D7SX09_VITVI6.3e-17577.06Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0027g00220 PE=4 SV=... [more]
M1BL01_SOLTU9.0e-17478.28Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400018482 PE=4 SV=1[more]
A0A0V0ISL3_SOLCH2.0e-17378.03Putative UPF0420 protein C16orf58-like OS=Solanum chacoense PE=4 SV=1[more]
K4CI48_SOLLC7.7e-17378.23Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G45890.18.1e-16672.47 Protein of unknown function, DUF647[more]
AT1G13770.12.7e-4430.27 Protein of unknown function, DUF647[more]
AT5G49820.18.7e-4328.36 Protein of unknown function, DUF647[more]
AT2G31190.12.2e-3829.17 Protein of unknown function, DUF647[more]
AT5G01510.12.5e-3425.94 Protein of unknown function, DUF647[more]
Match NameE-valueIdentityDescription
gi|778680559|ref|XP_011651345.1|1.3e-22197.01PREDICTED: protein root UVB sensitive 1, chloroplastic [Cucumis sativus][more]
gi|659098056|ref|XP_008449956.1|2.0e-21996.01PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis melo][more]
gi|590680331|ref|XP_007040833.1|1.1e-17577.50Uncharacterized protein isoform 1 [Theobroma cacao][more]
gi|731434384|ref|XP_010645036.1|1.8e-17577.17PREDICTED: protein root UVB sensitive 1, chloroplastic [Vitis vinifera][more]
gi|657970594|ref|XP_008377058.1|2.4e-17577.86PREDICTED: UPF0420 protein C16orf58 homolog isoform X1 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR006968RUS_fam
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032502 developmental process
biological_process GO:0010224 response to UV-B
biological_process GO:0008150 biological_process
biological_process GO:0007155 cell adhesion
cellular_component GO:0009941 chloroplast envelope
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0005540 hyaluronic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi05G004180.1Lsi05G004180.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006968Root UVB sensitive familyPANTHERPTHR12770FAMILY NOT NAMEDcoord: 267..398
score: 1.5E-277coord: 1..251
score: 1.5E
IPR006968Root UVB sensitive familyPFAMPF04884DUF647coord: 1..223
score: 2.6
NoneNo IPR availablePANTHERPTHR12770:SF7PROTEIN ROOT UVB SENSITIVE 1coord: 267..398
score: 1.5E-277coord: 1..251
score: 1.5E