Lsi01G020640 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi01G020640
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionE3 ubiquitin-protein ligase CHFR
Locationchr01 : 28861192 .. 28883630 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGAATGAGAGAGTGAGACAACGACAAATATGAGAGGGATATGTGAGAGAGGAGAGTGAGTTGTAAAAGAGAAGAGAGTGAGAGCCCATTTCTTGTGTGCAAGGTTTGTTTCGATAATTTCACTTACATTTGATATCAGCAAAGGGTAAAAAAAAATATTGTTTCCAAAAAGAAGGCAATTTTTATTTTTTATTTTCTAAAATTTGAGTCATTCGTGAAAAAAAAAAAAAAAAAAAAAAACCCTTTTGTAATTTGCCCCAATTTAGTGTAAGTATAAATTATGGGACAATTAGTTTAATACGTGCGTCCACGTACAACGTAGTCATCAGCCGTTTACTCCACCTTCCTCCGCACGCTTCACAATTTCCTCTCTGAGTTTTGAGCGTTTGTATTTGTAATTTGTTTCATTTCAAAATTCTCAGTATCAGCCATCAGTCGCTCTCTTCCCCTTCTCCTTCTCCTACTGGCGGTTATGGCGGAAATTGGAGAGTGCTCTGCTTCAAATTCCTCCATAGAAAACTGGGCGAAGCTAAGTATGTTTCCTCTTGTACTTTCTTCCCATTTCCCTCCCCCCCTGAGTTCTTCGCGGTCTGTCTCTTGTTTGACTGATCCGATGGCTGTTCCGAGTTGCATTGTTGGATTCTCTGTTTGATTTGATTGTGCTCAGCATTTGAGAGTTGGACACTTTTTTTTAAAGAAATAATTGTTCCATTCTTTGCCATCCCAAATAATTATTCTTCAGTATGAAGTCTGGGTTGAAATTACTAGGGAAGATAATGTCATCGAGGAAATGATTTTTGGCGGAACGAGCAGTTGAACTAGGTTTAACGTTTATGGAAGGTCGACGGTTCATTTGTATTTGATTTAAACTGTTAGCAGTATGATTTACATAGATGAAGGAAATTGTCTCACTAGTTTTTCTATCGGTGGTTTAAATTCATTCAGAGAAAATGGGTAGAAAGTATAAGAATTAAAGAATTACTATTTTGATGAAGATATACAAGAGGAAGAGGAGAGAGACTATAAGGAACCGTACTCCCCTACTTAGCTTAAACGATCACAGAAAGCAAGGGAAAATTTTTACAAGATCATCCATGCAAAGAACACCGCTTAGCTACCGAAAACGAATGAACCGCTCAAAGTCTATATAACCTCTCTGAATTTATAGCTCAAAATCTATATTCCTTCCAATCCAAAATTCTCAATATCTATTTACTTTACCATCAAAAGTGACCAGATAAAGCCTTTATATTCTTTGAAACATTCCCATACCCTTGTGGAAATGATTACTTTATAGATTGTCTCAATTTCTATGAACAAGGTAGTTGAAACTCGATCATGGATTTGCATGAATGAAACTTGATGATGGATTGAAGATCATTCTCGAACCTCTTTGTCATTTAAATCTATGTATATATATATATATATCTTTTTAAAAAAACAAGAAACACAAATTTTCACGAATGATTGAAAATTTATAAAAGAGTACAAGAATTCAAAGGCTACAATGTGTAGACTTAGAAAAATAAAAGTAAGGAGTGGAACAAAACTGCAAAATAGAAAGAGGGAAAGCTATAACTTGACCATACAAAAAAGCCTCTCAATTGTTTCTTGTCATTTGCAGATCTTTTAATATCAAGATTACACTTTTGGGATGATCTCCTTTCAAGAAGCCTTCCTTTCAATTTCCTATGTTGGGCAGGTATTAGGAATAGGAAACGTCATTTCATGCCATTGGAATTCTGCTTGCTACCGTGTCTCTTGGACAACTATTTTTTCTCCTCTAATACCAAAATGATGTAATCTAGCATCTTAGTGAATAATCTCGAAGAATACTGAACCAAAATGTTACTGTTATTCAAAAAGTCTTTCAATTACAAGCTAGGAAGTAAAGACACTTTAGTTTGGCTAGCGTATTAGAGTCCGACATGTATTAGACACTTGGACACTCCGACACTTGTTGGAGACGTATTGGACACTTGTTAGTGCAACAAATGTATTAGATATGCATAGAGTACTTGTTAAGTAGACTAAAAAGGCACATAAATGACGATAATAATAAATTTTTTAGTGTGAAATACATCAAACTATATTTTTAAGCATATAAATGCATCAACTCATTTACTTTGAATTTTCTTTTGGTATAAAATGATATATATTTTTAAAATGTATATTTTAATAAACGTATCCGTGTCCTAGATTTTTAAAAAATAATGTCATCGTGTCCGTGTCGTCTCATATTTGTGTCTCGTATCCGTATCCGTGCTTCTTGGATTACAAGAGACTAACCCTTTATTTTTAGAGGATGATAGTTCATCAAATAAACCAATTAACAATTGAAAACTAACAGTTAACAAATTAAAACCAACAAACTCAACCTAAATTACAAGGGTACCTAATCATGCTACATCATACTTGTTTTGTGTGTCGCTGGATTGTGCATTTTCTTACTTTATCTTTATTCTTTAGTTTTTTGAGGAGTCGTGTAGACTGTAGACATTGGTTATTAATAATTTCTATGAAAAGGCTTTTTTAAATTTATAGGAAAATATGTAAATATTTAACACATACCACTAGGTTTGTTATCGTAATTATTTAGCATTTGCTTGAGCATAAACTAAAAGTTCTGAAATTTCTTGCTTATGGTTTTTAACCAAATATAATGTTGAAAGGTTTCGATGAGTTTAGGCTTACATGTAGAGTGATTTATTTTTTTAAAAAGGCAAAAAAATGAAAAAAAAAATCGTTCAATTAAAAATTAAACTTTTCGATATGTAATAAATCTTAATCCATTATTTATTTATTATTATAAGTAACTGAATGTCTGTAAAAGGATAAACAAATGTTTATATGAATCCACATTTTTTAATATTTTCTTAGGTATGTTTGTACAATTTCTTCAAAATTTTCTTTTTGAAAGGAGATGGAAATTTTTTATTGACAAAATGAAACATTACATATAAGATGTTTGCAAGCAGAATGATTGCTAAGTCTTTCTTGGCATACAATAGAGCACTAGTTAGAGATATATTATGATTTATTAGTATGATTATGAGCAAGGGTATATTAGTAAATAGCTGGTAAGTTTATTAGGGCTTTTAGTTATAAATAGAGGGGGTGGGGTAAGGAGGAAGGTGCAAAAATTTTAGTAGAATTTTCAATGTGGGAATTTGGGAGAGAGTAGCCCTCTTGAAAGGCTATGATATATTATAGTTTCTTCATTGATATTGCAATATATATTTCTATCTTTTAATGTTCTTTGTGTTCTTTGTTGTTCTTGAGTGGTTTTGTTCGGTGGTAACCTAACATACTCAAATTTTTAAAATTTATTGCAATCTTCTATTTTTTTTTTATTATTATTATTTAAAAAAAAAACTACTCAATGTAAATTGGCATCATTTTTTGGAAATTAATTAGCTATCTTATTTTTATCTTTTCCAGTTCCACCTGATAAAATATATGCAGATATTGAGATAAACTCTGATGAGACCATTATCTTCTCGGAGACAAAATCCACATCTGTTGAAAAACACGAATGGTGCAAAATCACAAGGAATCCAGATCTAAAGTCTGCCACTCTGCAGAATAGAAGGTAATATGCTGTAAATACAAAGTAATTAGTATTTAAGTCAAATGATATCCCCATAAATCAAAGCTCCAATACATTTTGGTCGAGGCATTTAAGATCTCATATAAAATTGTAAACCTTAGAAAAAATCTCCACTTTTACCCTTATGACTCTAGGAGCTTTGGTATCGAAGTTTGGTGAAGAAAAAGGCTGAAGGTTCATGATTTTACACTGATGTTGCGGATTAGCTAGCTGATTGTAAGATCATGGGAGTTAAGGTGGAAATTTCGCCCAAGATCTTGACTTTTCTAGGTGGAATGGATGAGACAATTTTGTTTTCTCAATGTTCATTTTTTTTTGGGTGATGTTGTTGTTGTTGTTGTCGTCATCTCTTGGAGTTGGTGTATCTCTTTGGTGGACCCATGTTGGTCCCCCATCCCATAACTGAATTCATTCCATGTAATGACCTTACTTGAACACGATATGTTTTATAGGCTGTATAACTGTGTCCTTAAGTTCTAAAATGAATTCATTCCACGTCTCACTATCTTTTCCTCCTTGTGGAATTCCCTTCCTACCCTTGCGACAATAAGTATAACGAATTGGGGGTCTCACAATACCCCCATCTCCAAATTCACCTTGTCCTCAAGGTGGAAATTAGGGAATTGTTGTTTAATGAAGTCATTCCCACATTGCTTCACTCTCGGCTAAATTCACCCATTTCACCAATCATTCGTTCATGGAGGTGTCTCCATTCCATCTATTGCCCAAAAAAGTTTCAAGTTGAATCTATAATTCAAAATCTTTGGTTAAGTTTGGAGGATGGTCTTGCACCATATGTTTCTTGCCAAGCATCTTTTTGAGTTGAGAGACGTGGAACACATTATGAATGACAACCTCTTTAGACAATAACAAGTGGTAAGCGACTTCCCCAATTTTAGCTGTGATGGGATACGGGCCATAGAACTTGGGAGATAGCTTCTCACTATTCCTCTTAGTTAATGAGCACTGATGATAAAGCTTGAGCTTTAGGAATACTTCATCCCCAACTTCAAACTGAAGCTCATGTCGATTCAGATCAGCCATCTTCTTCATTTGGTTTTGAGCTATCATCAAATGCTCCTTAAGAGCTACCAACATGCCATAAGCTGTTGTTCAATTGCATTATTGGTGGTTTTCTTCTCCCCATATGATATTAACAGTGGAGGTTTCCTTCGATACACAACATTAAAGGGCATGGTGCTCTCCCCTCTCCTCTCTCCACTCTTTTCTCTCCTTCCCCTTTCCAGACCATAGCTCCAACGACCGTTTCTCACTTCTGATCAGCACTCATCCACTTTGCCAATCTCATTGCCTTTCCACCTCTATACATCCATGGAAATGAAAAGCTGCAACATACTCAACAGACACTTCTATATTTGGTTTCAAAATTCAAGCTACTTGGTTGAAGATTTGGAGAAAGATCAGATCGCAGCAATATCACCCCAGACGCTGGAGTGGCTTGTGGAAAGCCTAAACACAATGCTTAGAGACCTAATACAAGCTTTCTACTCAGAAAAAATCAAAACCGACTGTGGTATTTCTCGGCTTGCTAAGTTTCGTGCTAATGAAGGTTGGTTTGTAGAATATGCCTTTTGGTCTTCCTTGGGTGGTAGAAGAAACATTCATATTCCGGCAGGGGAAAATAAACAAGGTTGGCAATCGTTTCTCTCAATGCTCAAGGAAACTCTGATACTTAGTACAGTTAATTCACGAGACAACAAAGAAGGCATCATTGGACCTACAAGAAATAGAGCTTCTACAAGATCATATGCAGACGCCGTACATTTAGAATTTGAGAGAATAAGTCGAGATAAACCAGAGAATTTGAAGGAAAACAAGGTACATTCTATGTCCTCCTCCATTTGGGTCAAAAAGGAGAAAGATATGTTGAATGTTGACTTTAATTCCTTACTAGTGGTGACGAAATTAATGGAATTTTATTCTTGGAAAGAAGTTAAATCCACCCTTGAAGACTTCTTTCAAGCATCTATTTCCATTAATCCTTATCTAGCTGATACGGCTTTGATTAAGTTGAACAAGAAGATTGATGCATATTTTAGAAATGGAAAATGGTTTGAATATGGTAAATTCCATCTTAAGTTAGAATACTGGTCCAAAATGAACCACAATCTGCCAGAAGTCACGAATTGGATTTACATAAAAAACCTACCATTGCCTTTCGAGAAAAATTCAGTTTTTGAAGCTATCGGTGATCATTTGGGAGGATTAATGGAAATTTCTTCAAAAACATTAAATTGGTTAGATTGTTCATCAGCATTAATCAAAGTTCAAAATAACATATGTGGTTTCATCTCGACATCCTTATCCATTGGAGATTCAAACTTGGGGAAATTCTTGATTCATTTTGAAGGCAAAGATGCTCTGGTTGATCAGAGAAAAAGATTTCCCAACAAAAGCAGCTTTATGTCAAGTGACTTTTTAAATTCTCTGGACATCATTCGAATTAAAGCCACCATGATTGACGAAGGTTTTTCAACAGAGATGCTTGAGAGAGGATCATAAGTTGGAAAAGAAGATGAAGAATTGCATTAAGCTACAAATGAGATTCTGTTGATTATTTCCTATTTTGTTAATATTCTTACTGTATTTATTTCAACATATTTGCCATATTTATTTTTTCCTTATTTGTATTTGGGTATTTCTTCTATTTAAGAAATTCCTCTTTGCCCTATTTTGTCTCCAAAGTCACAAATTCTTTGGTCAAATGCAGTTAAAGCCTTAATCTCAGAAATTTGGTTTGAGCGAAACCAACGAGTCTTCCACAACAATGCTATTGATTGGTCAGACCGGTTTGGATCAGCCCATTTGTTAGCCTCTTCTTGGTGCTCTTAGACAAAATTTTTTGCTGATTATTCTATTCAAGACATTTGTCTCAACTGGAATGCTTTTATATCTTCTTTGTAATCAAGCAAGTAGTCTATTGTCTTTTGTATTCATAGCTTTTTCTTGTAAATGGATATGATGAGGGTGCTATGCGGGTGTCAATCTAGTTGAGATGTCTTGGTGCACCTACTAATCCATAGTACTTTAGTTATTACGATTATTACTATGTATCGTTGTATTTTGAAGCTTTGTAATTTGAGCATTAGTCTCTTTTCATTACATCAATGAAAAGTTCATTTTCGTTTCAAGAAAAAAAAATAAAGGGCGTGGTGTTAATGGAAATGTGGAAGCTTATGTTATACCAAAACTCAGCCCAAGGGATCCATCTACTCCATTGCTTCAGTTGTTCAATACAAAACAGTGAAGATAGGTTTTAAGGCACCTATTAACTCATTCAGTCTATCCATCAGTTTGTGGATGAAAAGTCGCACTCCTTTTTAACACCGTTCCCTGCATAGTGAATAATTCTACCCATAAATGACTAAAAATTTCTATCCCTATCCAATATGTTAGATTTAGCGATTCGATGGAGCCGAACCATTTCTTGCACAAAGACATCAACCACTTGCTTTGCTGTGTAAGGATGACATAAAGGAACAAAATGTGCATACTTACTTAAGCGATCTACTACCACCATGATGGAATTGTTACCCCTTGATTTTGGTGAACCTCAACAAAATCCATGGAAAGTTCTTCCAAACTTGATTCGGTATTGGAAGGAGTTGTAAAAGACCATATGGAGAAAGTGATTATGTTTTATTCCTCCGACAAGCAAGGCATTCTTCGACATATTTTTTGACATTTGCCTTCATTCCCTTCCAATATAGCTAACCAAGAACCAAGCAAAAGATTCTTAGAACAAAGGCGAAATAAGAACAGAAATCAAAGATAAAAGATAACTTCCCAACACAATTCGAGAGAGCTGGAATTTCCTCCTTCTTCAGCCTTCAGCTTCAGAAATCTCCAAATTCAGCATAACTTCCGGAATACCCCCAAAAACTGAAACCTCTCTAGTGGTCCCATGTTGGTCCCCCACCCCATAACTAAATTCATTCTGCATCCCACTATCCTTTCCTCTTTGTAGAATTCCCTTCCCACCCTTGCAATAATAAGTATAACGAACTGGGGGTTTTCTTCTCCATTAGTTTGGTTGCTTGTTTTTCTCACTCCTTTGAGAGTCTGTTACTTGACTTCTTTCTTTTCAATCGATCAATTTGTTTCTTGTTTACAAAAGAACATATTAGAATAAAGAAATCTCGGTTATCATTAGTTATTTCCATTATGAAATTGTTGCTATTACCTATGGAATTTGAGATTGTGTTAACGAAGTTTGTAAAGATATTTCTTGTCTTTATAGTCTCCCTTTCTTTGTCTCTCTTTTTCCTTATAGTATGTTTTATCTCTTGTGTTTTGGGTTTTTTAGTTCAAATGCAATAATTGTTGATGGGACTCTTGTTCAAAAGGATGACACTACTTTCATCACGTGTGGAAGCGAAATTGTTTCGGGTCCGGTCCGGGATGGTTAGTATTTTTTCAAACTCTTTAATATGTGTGAAAGAGAGAGTTTAACTTGGTGGATGAATTCGAATTATTTGGTTTCAAATTGTTGACGTATAATAGGATGGAAGTCCTCTTTAATTTGTTTCATGAAAGTAAATTATTTACTATATTAGTACTTAACATGCATAACTCAGGTTGGCCTTTAATAGTAGTTAAATTTGTCTCAATTGAATTTTAATCAAATGAACTCTCGTCTAATTTATAGAGTGATACAATCTAATTTGAAAGTTAAGGTTATGTGAGAGTACTAGTAAGTGATATATCGAGAATATTAGTAGGAGACTACATCAAAAACTCTTGGATTCACTAAGTTTCTCACTAACATATTCGATTGGATGGTTTTGTTGGGAAAGAACAGCCCCTATTCCATAACTTGAAGCATCAATTGCAACTTCAAAAGGATTATTATAGTCGGGTAATGCTAGGACAGGGGCATTAATTAGCTTACTCTTTAAAATACTAAAGCTTTTTGTTTGAGCTTTGGTCCAACAAAATTTTCCCTTCTTTAGACACTCGGTAATAGGAGCCGCAATTGAGCTAAAGTTATTGATAAATTTCTTGTAAAATGAAGCAAGGCCTAGGAAACTTTGGATTGCTTGGACTGAGTTATGCAAGGGCTAGTTTTGGATGGCTTGGATTTTGTTTGGATCCACAGATAACCCAAATTCATTTATGATGTAACCTAAAAAATTAATCGAATCAGTTGAAAAAAGGTAAGTTCTCTCTTCTCTTTCCTCTCTCCTCTCCCTTCTCTGGTGACCTTTGCTCTTTTCCCTGGTCTTCTCCGCCTCAAGTGCCACCAGCATGGAAGTAGTAAGTAGCAAGGTGAACGACTCTTTCTTCTACATTTGGTCCGATTTGGGTTGGTACTTCGTTGAGGATATGGAGGTCAACAAAGTCCTGACATTGGCCAAGTCTCATCTTCATTGGTTTGTGGAGCAAATAACAAAGCTTCTCCAGGGTCCGGGCAATCGTCTTTTCCTAAAAAATGAAAGGAACAATTCAGGGGCAACGAGGTTGTCCAAATTCAGAGATACTAGTGGATGGGTTATGAGATGTGTAGTATGGCCAGTCACTGGAGGGCGTTATTATATCCATGTTCTGATGGGGCTGTCGCAGCAAGGGTGGCGTTCCTTCCTTGGAATGCTTAATGATTTTTTAAGTAAGTTTAAACCCGTTGAACCCAAGTTTTCGAAGAGGCCGGTTTCAACTTCCTCGGAGCAAATCTTCACGAAGATTCAAAACTTGAGCTATGCTGATATTGTAAAGGGTCAAAGAAAACCATCATCCTCTGTTTCTTCTTCGTTAAAATAGAGTTTGAAAAAGGAAAAACAATCCTCTGTTTCAGCATAGGCAAAACAGAGCAACAATCTTTTCTGGGTTCAAAAGAGTCACGATGTGTTCAAGGAAGATTTTAATAAGTTATGGATTATATCCAAGCTATTCATGTTTAATGACTGGAAGGAGATAGCTAAGATCTTGGAAGAATACTTTCAAACAAAAGTCATCATCAATCCCCTGTTCACAGACACTGCACTGGTTAAAATCGATCAGGGTAATTTGGAAGATTTGATTGAGGTTCCGAGTAAATGGTAGGCTTTCAGACCTTTTCAACTGCTATTTGAAAAGTGGAATAAAAATAAACATAGCAGACCTTCTCTAATGAAAGGATATGGGGGTTGGTTATCGATAAAAAACCTTCCATTAGATTACTGGTGCAGAAACACTTTTGAAGCAATTGGAGCTTATTTTGGAGGCCTTGAAAACATTGAGATAAAGCATGAGATAAAGGATGAGAAGTGCGACAATATTTTTTTAAATTCCGGTGATTTTGAAATAGTTGAAGTTCCTAAGTTCACTCATGGCGATTTATTCATTAAAAACTTCACTAATCCAATTGATTTAGTTCGTCTGAATCAAGTTTCTCCGGATGAAGGCATTGACATTGAAGTTTTGAAGCCTGAATGGTCGTTCCTAACGTCTTCGCAGTCCAAGCCAGAGTTATCAAGAAATCCATTCGCGGCATTTGAAAAAGTAGAATCTAAAAAGTTACAGACCTCTTTGGTGGCAAAGAATTCTTCGCCGGTGATCGGAACTCTCCATGTCGATCATATGGATTCATCATCAAAGATGACGAATCGTCATTTTAGTGTGAACGCATTGAAGGAACCCGAAGAGTCAGGTTTAATGAGGAAAGAGAGTTTAGTTGGCGGATTGATTAATGGGCCCAATGACGTGCTCTATTTTATCGCTCTTAATAATAATATCACTAAAGTTTCTCAATCTAAGACCATTGAAATTCCCCTGAAGGCAATTGAAAGATCACCTGGTCCCACTTTGGCCGATTCAAGCTCCAAGAAAAACAACTCAACATTTTTTGTCCCTTCGAATCTGTTAGCATCCTCCCCAGTCGAAGTTGTAAAGTTCCCAACAGCCTCACAGAAGTTTTCGCACCTACTGAAGCCGTTTCTCAAGTATTTTTCCAGAAAAAAGAGATTGTTCATCAAAGCAAGTAATTGATTCTAATCCTGATTTGCTAGAAGTTTGCAGCAATCGAGTTCAAGTTCCAGCCCTTTCTAAGAGGTCAGTTATTCCTTCTTCTATATCTCAACAATTCAAGCACTCTGTTTTCACCATTCCTAATTCAAAGATTAAACTTTTGAGAGGTACTCCGTGCTGCCCATCAACTTCTTATCAGAAAAGGAAAAGTGCATCAGATTTTGATTCCCCGATCACTGTTAGCAGTGAGGATTCTGATTACTTGGAATATGGAGTGGACTAGGATGGATCAGTGCAAGAGGATAACTTAGAAATGGATCTCAATGCAATGTTTCAAAAGGAAGCTGATTTGACAGACATTGTGAAGATTGCCTCATGTTTTTCTCCATTACATCCTTCAAAAATTCCTTCTCATTTGAAATCTATTATTGAAGACTGTGGAATATCCTTGGGATGAGCCTTCCATCCTAGGCAAGTTAATTTGTGATCATTTGTCTGCAGCTTAGAAATGAAAATCATTTCTTGGAACACGAGAGGAATTAAGGATTTGTCCAAACGTTTGGCCCTTAAAAGATTCCTTAAAAAGCATAATCCAGATTTAGTCCTAATTCAAGAATCCAAAAAAGAAGAGTTTGATTTAGACTTCATAAAGTCATTGTGGAGCTCGAAGGATATTGGATGGGAGTTCGTGGAATCCTTTGGTAAATCTGGAGGTATGTTGACTACGTAGGAGGGACATGAGTAAACTCTCAGTGGTTGAAACTCTTAAAAGTGGTTACTCTTTATCAGTTAAATGCATCATAATTTGCAAGAAAACCTGTTGGGTTACAAACGTTTATGGACCAAATGATTATAAAGAGAGGAGATTTGTATGGCCAGAACTAACTTCCCTTTCAGATTACTGTCTAGAGGCTTGGTGTATTGGGGGAGATTTTAACATAACACATTGGGCACATGAACGTTTCCCCTTTTGAAGAGTCACAAAAGGAATGTGGAAGTTCACCTGTTTCATAGAATCAGCAAATTTATTGGAAGTTCCTTTATCCAATGGAAGGTATACTTGGTCTCCAGATCCTTAATTGATCGGTTTTTTGTCAATAAAGAATGGGACGATGCGTTTGAAAATACAAGAGTCAATCATCAATCTTGCACTCTATCAGATCATTCCCCTTGTTACTTGAAGCTGGCACGGTTTGTTGGGGTCCCTCACCTTTTCGCGTTTATAACAGTTGGTTGATCAATAAAGATTGCAGCAAGATTATTGAACAAGCATGGAGTAGAAGTCGAATCACTGGGTGGGCTGGATTCGTTCTTAGTGCTAAATTGAGGATAGTGAAGATTGCTATAAAGGCTTGACATGTTGAGTTTCAAGAGAGGCAGGAAAAAAAAGGAAGAGGATCTATTAGCAGAAATTGAAAAATGTGATGCTATGGCTGAATCTTTCCGTTCAGCTTTTGATGAAATTGATACTAGAGCTTCTTTGCAGGCTGAATTAATGAGTCTTTATAGACGGAAAGAAACTTAATCCAAAAAAGTAAGTTAAATTGGATTAGGTTGGGAGATGAAAATTCTAGCTTTTTTCATCGCTTTCTAGCAGCTAACAAGAGAAGAAATTTAATAACAGAGTTGATAAATGATCAAGGGGTACCAACTACATCATTCCGCAAGATCGAAAACCTTATTTTGGGCTTTATGACTCCTTATACACGGGAATTCCAGAGAATTGATATATTCCTCTAAGTCTTGATTGGCAATGCGTTACCTCAATCCAGAATCTCATGCTTATTGTTAGATTTTGAGCATCAGAAATTAGATCGGCTTTAAAGACTCTTGGGAAAAGTAAAGCTCCGTGTCCCGATGGATTCACGGCTGAATTCTTCATCAAATATTGGGACTTGGTAAAAGGAAATTTCCAATCCTTATTTGAGGATTTCTATGAGAATGGAAGGCTCAATGTCTGTGTTCAGGAAAACTTCATTTGTTTGATACGAAAGAAAGAAGATGTGGTTCTTGTTAAAGACTTTAGGCCAATAAGCCTTACTACCTTAACGTATAAGGTAGTGGCTAAGGTCCTTGTTGAAAGATTAAAGAAAGTCATACCATCAATAATAGCCCAATCTCAAAGTGCCTTTATTGGGGGAAGATAAATACTTGATCCTATTTTGATTGCCAACGAAGCAGTGGAAGATTATAGAGCTAAGAAGAAAAAAGGTTGGATACTCAAACTAGATCTTGAGAAAGCCTTTGACCGAGTTGATTGGGGTTTTTTAGAAAAGATTATGGTTTTAAAAAAATTCGACTTTCGTTGGATATCATGGATAATGGGTTGTGTCAAGAACCCCAGGTTTTCGGTCTTCATTAATGGAAGACCAAAAGGAAGAATATTTGCTTCAAGAGGAATTAGGCAAGGTGACCCATTATCACCTTTTCTTTTTCTTTTGGTAAGTGAAGTCTTGGGAGCTCTTCTAGAGAAGCTTCATGGGAATGGCATGTTTGAAGGCTTTGTGGTGGGTAAAGAAAAAATACATGTTTCTATCCTTCAGTTTGATGATGATACCCTTCTCTTTTACAAGTATGATGATGCTATGTTATTTAAATTGAGGCAGACCCTTGAGTTATTTGAATGGTGCTCAGGTCAGAAGATTAATTGGGAGAAATCTGGTTTATGTGGGGTCAATGTAGATGAAGATGAGCTGTTGGCCATGGCTGTAAAGTTGAATTGCAAACCGTGCATTTGCCACTTACTTTCCTCGGTCTTCCCTTGGGTGGATATCCCAAGCAAGTCTTTCTTTTGGCAACCAATGATAGACAAAGTTCACTCGAAACTTGACAAATGGAGAAGATTCAATTTATCTAGAGGAGGGCGAGCTACTTTGTGCAATTCTGTGCTAGATAACCTTCCCACGTATTACATGTCTCTATTTTTAATGCCTGAAAAAGTCATCTCCACACTGGAAAGAATCAGGAAAAAGTTCTTATGGGAAGGGCATAATGGAAGTAAGATAAATCATTTGGTAAAATGAGATATTGTCACACGATCTCAAGATGATGGAGGTCTCGGGTTTGGTGGCATGAAAGCAAAGAATTTGGCACTTTTAGCTAAATGGGGATGGCGGTATGTGGTTGAAGAAATTTCCTTTCGGTGCCAAGTTGTTAGAAGCATTCATGGCAAGGACTCTTATAATTGGCACACGACTAGTAGGAATGGTAATAGTTTGAGAAGCCCTTGGATAAGTATATCTAGAACTTGGTTGAAGGTAGAAGCTTTGGCTACTTATAAACTCGGAAATGGTAACCATATCGCTTTTTGGTTGGACCCTTGGATTGGTCTGGTTCCGTTGAGCATTCGATCCACAACTATACATGGTGGCTCTATTCCCTAAGGGGTCAGCAGCAGCTCATTGGGATGCCATTTTTTCCTCGTGGTCTTTAATTTTCCGTCGTTTGTTAAAGTATGAAGAGGTCCTTGAGTTTCAAGATTTGATTGGGCACATTTAAGATTTGAAAGTCTTGGAAACCTTTGATAGAAGATTATGGTCCTTACAGTCATCAGGAAATTTTTTTGTGAAATCACTTATAGCTCACTTGTCCCCTTCATCCCCAATGGACAAGACGTTCTTTATGGTTCTATGGAAGACTAAATGCCCACGGCTTGTCAACATCCTAGTGTGGATTACGTGTTTTGGTTACCTTAATTGCTCCTCGGTAATGCAAAGGAAACTTCCATCTCACTGTCTTTCTCCATCAATTTGTCCTTTATGCAACCAGGAAGGAGAGGACCTTCAGCACCTTTTCTTTGCCTGCACATATACAATTTGTTGCTGGTGGAGGTTATTTTCTATCTTCAATGTCGCTTGGGTGTTTGGGGGTGTCTTTAGAGACAACATTCAGCAGATTTTAGTTGGTCCAACTTTGAAAGAGGGCCCTCGTTTAATTTGGGTCAATGCGGTTAAGGCAATACTTTCTGAAATTTGGTTTGAACGAAATCAAAGAGTATTCCACGACAAAAATCTTGGCTGGTTGGATCGTTTTGAAGTCGCTCGTCGGAATGCTTCCTCTTGGTGTACCTTATCCAAGCTTTTCGAAGATTTCTCTATCCAAGACTTATATTTAAATTGGCATGCTTTTATTTTCCCTATATAAGTTTAGTTTTTGCTTTGTAACGGACAGCTTGGTCCTTTTGTTTTTGTTTCTGTGGACAATGAATGAATATTACTTATTTTAGGGATATGATGGGGGTGCTATGAGGGTGTCAACCTAGTTGAGATTCTCGGGTGCACCTGCTGATCCTTAGCTTTCTCTTAGCTTTCGATTAGCTCTTTGTATATCAATATTGTACTTTGAGTTTTGTCTCATTTCATTATATCAATGAAGTGGTTGGTTTCCTTTTAAAAAAAATGTAACCTAAAAAATTAATTTCAGGTACTAAGGATACACACTTCTTATGATTGATGAACAACTCATTTTTGGACAAGGTCTTGAATTGTGCAAATGGGAGAGATGTTCCTCTAGAGATTTACTATAAATAAGTATGTAATCAAAATATACCACTAGGAACTTATTAAGAAAGGGATGAAGGATTTGTGTCATTAGTCTCATGAATGTGCTAGGTGCATTAGAAAGACCAAATGACATCACTAACCATTCATAGAGCCCTCATTTGTCTTAAAGGTCGTTTTCCACTCGTCTCCGGGTCGAATTCTTATTTGGTGATATCCACTCTTCAAGTCTATCTTAGAGAAGATTTTGGCTCCTCCAAGTTGATCTAAGAGGTTCGATATTCTAGGGATTGGGAATCTATATTTTTTTGTAATTTTATTAATGGCCTGACTATGTACACACATCCTTCATGACCCGTCTTTCTTATGAGTTAGAAGAACAGGAGCGGCGCATGGGCTTAAGCTTGGTCTAATGTTACCTTTCTCCAACATCTCATTTATTTGTTCTTGTAAAATTTCATATTCTCTAGGGCTCATCGTATAATGTGGAAGATTGGGAAGCATGGAGCCAGGGATGAGGTCAATATGGTGTTGGATGTCTCTTAATGGTGGTAGGGTAGTAGGAGTTTCTAATAGTTTTGGGAATTCTTCAAAAGTCGTCGGATTTCAATGTTGTTACTCATAGAGTTGTTGTTTTGCAAAGGATCCTTTACTACAATTGCCCAAATTTCAGTGTCTTCCTTGGTGATAATAGGTCCACTGTTTACAATAGAGAAAAGTTGTTTTTTTTTGGAGCTAGATCTTATTTTAGCTTTTTTTTTTTCTAAGTGGTTATTACTAGGAAGAAGCTTAATTTTCCTACCCATCCAAGTAAATTCATAAGTATTCTCTCTACCCATATGTATTGCTTGGACATCATACTGCCATGGACGACCTAATAATACATGACAAACATCCATATCAATCACATCACAAACAATTTGATCTTTGTAAAGATTTCCAATAGTAAGAGAAATGGTACAAGTAGAACTTATATGGGCTTCGCCACCTTTCTTTATCCAGCTTACCTTGTAGAGTTGCAGGTGTGGATCGAGTTTGAGATGGAGAGCTTGTACGAGCTTGGAAGATACGCGTTTTCACTGCTTCCACTGTTACTTCCACTGTTAGTTATCACGTTGCACACTTTTTCATTGATGGTGCACCTTGTCCAAAATAAGGAGTGTCTTTGTGGATGATTTTCTGTTTTAGGATCGAGTAAAATTCTCTGCAAGACACAAGACAGCTGTCTGGTTCCAATTCATTGAACTCTTTGATAGACCCCGTATTACTAAGGTGTATGTAAGAAAGCGTGTGAAGGGAGGAGGTAAGCATGTGTGAGGAATTTGTTAGTTATTTGTATGTGGGCCCAGGGAATATAAGTTAGGAGTGAAGAGGGCCTTAGTGAGAAATATATAGGGGAGTTGAGAATTGGGAAGGGTATCTTAATTTTGATCATTGTAATAGAGAGGAAGGGAGCTCTCAAAACTCCCCTGGTTCTGTTGATTAATAAATTGAAGGTTAAGAGCGGTGTTCTAGGGTATGGTGGGTAAGATGGAAGGTAGAGTATCGGCACTTGAAGAGAAAGTATCGAAAATTGTGGATAATCAGAAACTTGTGGAAGCAAGGGTAGAGTCAGGCTTCAGAGGGTTCGATGAGAAGATTGACAGATTGGCGAGTAATATGCAAGTGATACAAGGGCAAATGACATTGTTATTGTCTCGCTTTGATGTCACCATGTCGGAAAGAGGGTCGGCATCGTTGGATCGGACGGTGACAGAGAAAGGGAAGGGTGTAGCTGAAACACCTCTCGGGGAGTCAAAGAAGGCAGAAGAACGTAAAGAGGAAATCCACGCACCTAGCAATAAAGAGGTGCCATTATTTGACATGAGGTTGAGGAAGTTAGAAGTACCTATATTCAAAGGTGAAGAAGAAGAGAATCCATACGGATGGCTACATCGGGTGGAGCGATACTTCATCGTGAATCAACTAACAGAGAGAGATAAGCTGGATGCGGCGGTTCTGTATCTGGAGGGCGAAGCCTTGGAGTGGTATTAGTGGGAAGAAGATTGGAAGCCGATTGGAAGCTGGAGCAGATTTCGAGAGCTATTTGTGGATAGATTCAGACCAGCAAACGAGGTTGACCGATATGCTTGTCAGATGCGGCTGAAGCAGGAGAAGTCGGTGAGGGAATATCAGCGACGACTCAAGCAAATATTCAACAAGCTTAAAAGATTTGGGGGAGAAAGCCCTGGAAAGTAAATTCGTATGTGGGCTTCAAGAAGAAATCCAAAGTGAGATGAGGAAACTAAATCAGGTATTAAGGCCAAAAGACCATGGCCCAATTAATTGAAGAAGGTCAAATAATTCAACAGAAGAGGCTTAAGGGTGATGGGCCGTCCAGTAGATCAGGAGGAAAATCAAGCCCAAACCCGAGAGGGTCGAGCGGGTCAACTGGAGCGGGTGGAAGAAGCGGTTTGGGTGGGTCGATGGTCTTTTACATTGAGCCCTAATCGCCAAACGAGCAACGCTGGAAGTTCGTCAACGACAGTCACACCAATACGCGATCAAAAGAGCAAGAACACAGGGGCTCTTCCGTATTGGCGTTTGACGGAGGAGGAGATGAGGATCAAGAAAGAAAAGGGGATTTGCTTTAAATGCGATGGGAAGTTCAACTTTGGGCATAGATGCAAGAAAAAAGAGCTCCAGTTCATGTTCGTGCAGGAAGGGTAGGATATGTCGCCGTGGGAAGAGGAAGATGTTGAGCACGAATGTATCGGCGGGGATGGAGGTACGATTCAGACCACGGGCAAAAAAGAGGTAGCTCACTTATCCCTAACATCAATGGCGGGATTGAACTCCACCCAGACGTTGAAGGTAAAAGGCATGATCCGAGGCCGAGTGGTAGTTGTATTGATCGATGGAGGGGCCACGCATAATTTCATTGACGAGAACTTGGTCGCTTCACTCCAACTGCCTCTGTCTGCGACAAATAGTTTTGGAGTAGTGCTCGGGGCTGGAGGATCCATCCAAGGAGCAGGAATTTGTAATAGTGTGACACTCACGATCTCTGATTTAACAATCACTCATGATTTTCTTCCGTTGCCTCTTGGAAGTGTAGATGTGATACTGAGAGTGGTGTGGTTGGAAGCTCTCGGGAAGATTGTGATTGATTACCGGAGCGTCAGTCATGGAATTCACAGTGGGAGAATGGTTGGTGGAGTTACATGGAGATCGAAGCTTTGTCAAATCTCAAATCTCTTTAAAATCCCTGATGAAATCAATGGAGGTGGAGGATCAAGCTCTTTTGATTGAGCTTAGTACAGTGGAGTATGAGCTCGGTAAGGATAAGGGTGCAGAAGTCCAGCGTTCGGATGTGTGGCCAAGGGAAATCCAAGTGTACTAGCCCACTACAAACAAGTGTTTGTGGCAAAAAATGTGTTGCCACCGTCCAGAACACGATCATGCTATCGGATTGGAGCTTGGGGCTGGTTCAGTCAACGTGTGCCCTTACAGATATCCACAGTTTCAAAAAGATGAGATTGAGAAGCTTGTAAAAGAGATGTTGATGGCAGGAATCATACAACCGAGTCGTACCTCTTTTTCCAGCCCGATTCTGTTAGTTAAGAAGAAAGATTGAAGCAATGGAGAGATGGCCGGTTCCTCGAACCATAAAGGAACTACGGGGATTTTTGGGGCTGATAGGATATTGCAAGAAGTTCGTTGCAAATTATGGTGCGATAGCTCGCATTGCCCAACTTCGAAGAAACATTTGTTGTAGAAACTGATGCATTAGGAGTAGGTGTTGACGCTGTGTTGATGCAGAACAAGCGGCCATTGGCCTTCTTTAGTCAAGCTCTTCAACTATCCCACCGCTTTAAGTTGTGTATGAACAAGAACTTATGGTTACTGTGTTTGCTGTAAAGAAATGGCGACCGTATTTATTGGGTCGACGTTTTACCGTGCGTACCGATCAGAAGAGTCTGAAATTTCTATTGGACCAACGCTTAATTGCAAGGGAATACCAACGTTGGATAGCTAAGCTGATGGGGTATGATTTTGCCATAGAATACAAGAAGGGGATGGAGAATACAGTTGCGGATGCATTATCTCGATTACCTATGTGATGTAAGAAGAATAGGGGTTAATTGGGTAATTAGTTAATATTCTAGGTTAATTCCCTTTTTACCCCCACTTGTAATAAAGCTCTATAAATAGGAACCTTCCCCTCTTGTATCAAACACATTATTCATTCTAATAAAGATCCACAATATTGATTCTTGGAGAAAATTCTCTTCGTTATCTCATTAGGCTACATTAATTTGGTATAAGAGCGACCGTCTGGGCCAACGTTGACTGCCGCCGGTGGAAGACCGAAAAACTCTTAGTGAAGACGATCACAAGAGGGAGCTTCGTGAGTTGTGCTCGTTGCTAAACAATCCACATCTAATTAGGGGGAACCCTTGAAATTCCAAGAAAGAAATAACAACCTAAAGATTTTTCAAGATTTTGCAAGAATGGATTATTACCGATCAAAGAAGACGTTCAAGAAAAAAATCAGCCGTTGTTATAGGCAAGATAATCAATTTTACTTCCAAAAACAATGTTGGGATTCAGACTTAAGTGATTCAGAAGAAGAATTTCAACATACTAATTTTGCCCATGCTCGATTTTCTCAAAGAACCTATCATATGCATTATCCACATTGGAAACTTGATTCTAGTGATTCTAGTGACTTCGAAGACGTTTCTAAGTCAGTTTGGGATGGAATAAGAAGGCGACAATATCGAAAAAAGGCTCAACGATCCAATCAAACCCATTTTTCAAGAAATCAACATAAAGATTGGGAGTTAGATGAATTAAAGAATCATCAAGAATCGTTAAGAATGGATTTTTATAGCCCAAGAAGAATGTTCAAGACTCGAAATTATTCTTTTAGTTACGAAACACAATCAAGAAAGGTTGATTCAAGAAAATCAATGCTTGAGTTCTATACCCACAACGAGATTGAAACATATCCAAGAATCCAACAAAATGACTACTTTGATCCCATAACATTGGAGAAAATAAAGAAACCAACAAAAGAAATCCAACGTTCATTGGAAAAAACGACTCAACATATTGATCATTTATCAACTCATTTGCAAGAGTTTAAGAATGGATTGATTTCATATACAGAAGAAAACACAAAAGAATCACAGTTGGAAACTTTTCAATTAGAAAATCTTGAAGATAAAATTGGAAAAAATCTTGTTGAGAACCATGTAGTTAATGGAGGTGATTGGGACAAAAATCATGAGAACGAAGTAAAAGAAGAAGAAAAAGAAGAATTTAAAGATGTTAATTTGGAAATTATCTCAAGTGAAGAAGAAAAATTCAACGAAGAAGATGGAATAGAGGTATCGAACACTGAGCAAGAATCGAGCAAGAACAATGACACATTGATTGCGTTCGGAATAAATCACCAAACTGTCGAAGAACTTCAACGAATGATAAAGAAGATGTGATTCTTGAACAATTTTCTTTGGAAGGTGGTTTTTTAGTGATTGATCCTTTGAATACGGTTTTGAATAAGCTTGAAGGAAACATCTTTTTGGGTTTATGGCCAATGGGGAAGTTCATCTTTTGGTGGCAATAATGGATTTTTTGTCTCAAATTTATCACTTTGATCATAATATTTTTGTTCAAAATGTAGGAAGTTGATTTGGTCATACTCCATTGAACAAGTTTATTTCATACCTTGTTTGCTTCATAATTTTTTTTCTTTTTAAAACTTAGGGACGATTTTTTTTTTTTTTTTTTTTGGAGGGGTAGAATGATGTAAGAAGAATAGGGGTTAATTGGGTAATTAGTTAATATTCTAGGTTAATTTCCTTTTTACCCCCCACTTGTAATAAAGCTCTATAAATAGGAACCTTCCCCTCTTGTATCAAACATATTATTCATTCTAATAAAGATCCACAATATTGATTCTTGGAGAGAATTCTCTTCGTTATCTTTTTAGGCTACATCACTACGGCAATGGAGTTTGGTTTGTTGAGTGTGGTAGAAGGATTGAACACGGTTGTGTTCAAAGAACAGATTGAAGCAGATGCGGATTTTAAAGGTATACGGTAAGCTATTGAGGAGGGGCAGTCGGCGTTTAATGGGTACTCGATCAGTAAAGGGGCGCTTGGTTCTCCCATCTCATTCTCCTACTCTTCTGCTTTTATTAAATGAATTTCATAGTAGTCCGATTGGGGGTCACCAGGGAGCTCTTAAAACGTACCAGCGGTTAGCCAAAGAAGTGTATTGGCAAGGAATGAAAGCACGGGTACGAAAATTTGTGGCTGAATGTGAAGTTTGTCAACAGGTGAAATATTTGACATTGGCGCTAGTAGGACTACTTTAGGCTATACCAATTCCTGACCGAGTGTGGGAAGACGTTGCCATGGACTTCGTGGATGGGTTACAAAATCTGATGGGTATGAGTTTTTCATGTGTCCCTTTTAAAGAAAGTGGTGGGAAAGAATATGCCTATATTACCATTGGAGAGTCCCAACTCTCTCGAATAGCTGGTTTATCTTTTCCCTTGTGATTTCTCTTGTTCTTGATCTTGATATTGTAAGAGGAACTTGTCTCTTGAATATATCATAGTATTTTCCCTCTTGTTGCTCTGTTTTGGAGTGGTTTTTGTGAGTTTTATCGCAGGGAATTGTTAGTTACCTAACAGTTTTCTCCGCTTGAAAGGCAAAGACCGCAAGCCAATTGGTTCAATATTAGGTCATTGTTTAACTTCCAGAATATAGACAAGCTAAGTAAATAAATTTATTTTTCTTAGGTTCTTTGACTTCCAACAACACTTGCCAATTCCTTTTTATTTTTTTTTTCTTTTTTGATAAGAATTTATCAATTCTTGTTTAATTTAAGCCAATTGGAGAATATCTTATGCTCACCTTTGGTTTGGCTTTGAATACTTTCAGGAATTTTTTCTTATTCAAATATTCTTATAGAGAAACACGAAAGATAACTGCACATATATTTCTTGTTTTTTTAGGCCAAATATCATTTAGCTGGCCTAAAAAAACAAGAAATATCTGTAGAACTATCTTAAGTAAAAGCTTTTGTCTTATGGTTATCTTAATTTTCAGGGCATTTGAGCTATAAATTTGAAATGTTGTCAGACTCAAGAATATGCAAGGGCTTAAAGGTGATTGGTTAATTGTCATGTTATGGTTATCTAAATGTGATATTAGATTCTACCATCCTTTTAGAAATACTCAATTTTAGCCGTTGTTGAATTTGTCCAGATTTCTATGGATGTTGAGCATGCAAAATGCAGTATTTGCTTAAACATTTGGCATGATGTTGTCACAGCCGCTCCTTGCTTTCACAACTTTTGGTGGGCCTAT

mRNA sequence

GAGAATGAGAGAGTGAGACAACGACAAATATGAGAGGGATATGTGAGAGAGGAGAGTGAGTTGTAAAAGAGAAGAGAGTGAGAGCCCATTTCTTGTGTGCAAGTATCAGCCATCAGTCGCTCTCTTCCCCTTCTCCTTCTCCTACTGGCGGTTATGGCGGAAATTGGAGAGTGCTCTGCTTCAAATTCCTCCATAGAAAACTGGGCGAAGCTAATTCCACCTGATAAAATATATGCAGATATTGAGATAAACTCTGATGAGACCATTATCTTCTCGGAGACAAAATCCACATCTGTTGAAAAACACGAATGGTGCAAAATCACAAGGAATCCAGATCTAAAGTCTGCCACTCTGCAGAATAGAAGTTCAAATGCAATAATTGTTGATGGGACTCTTGTTCAAAAGGATGACACTACTTTCATCACGTGTGGAAGCGAAATTGTTTCGGGTCCGGTCCGGGATGGGCATTTGAGCTATAAATTTGAAATGTTGTCAGACTCAAGAATATGCAAGGGCTTAAAGAAATACTCAATTTTAGCCGTTGTTGAATTTGTCCAGATTTCTATGGATGTTGAGCATGCAAAATGCAGTATTTGCTTAAACATTTGGCATGATGTTGTCACAGCCGCTCCTTGCTTTCACAACTTTTGGTGGGCCTAT

Coding sequence (CDS)

ATGGCGGAAATTGGAGAGTGCTCTGCTTCAAATTCCTCCATAGAAAACTGGGCGAAGCTAATTCCACCTGATAAAATATATGCAGATATTGAGATAAACTCTGATGAGACCATTATCTTCTCGGAGACAAAATCCACATCTGTTGAAAAACACGAATGGTGCAAAATCACAAGGAATCCAGATCTAAAGTCTGCCACTCTGCAGAATAGAAGTTCAAATGCAATAATTGTTGATGGGACTCTTGTTCAAAAGGATGACACTACTTTCATCACGTGTGGAAGCGAAATTGTTTCGGGTCCGGTCCGGGATGGGCATTTGAGCTATAAATTTGAAATGTTGTCAGACTCAAGAATATGCAAGGGCTTAAAGAAATACTCAATTTTAGCCGTTGTTGAATTTGTCCAGATTTCTATGGATGTTGAGCATGCAAAATGCAGTATTTGCTTAAACATTTGGCATGATGTTGTCACAGCCGCTCCTTGCTTTCACAACTTTTGGTGGGCCTAT

Protein sequence

MAEIGECSASNSSIENWAKLIPPDKIYADIEINSDETIIFSETKSTSVEKHEWCKITRNPDLKSATLQNRSSNAIIVDGTLVQKDDTTFITCGSEIVSGPVRDGHLSYKFEMLSDSRICKGLKKYSILAVVEFVQISMDVEHAKCSICLNIWHDVVTAAPCFHNFWWAY
BLAST of Lsi01G020640 vs. TrEMBL
Match: A0A0A0KYR0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G508530 PE=4 SV=1)

HSP 1 Score: 281.2 bits (718), Expect = 8.4e-73
Identity = 138/165 (83.64%), Postives = 148/165 (89.70%), Query Frame = 1

Query: 1   MAEIGECSASNSSIENWAKLIPPDKIYADIEINSDETIIFSETKSTSVEKHEWCKITRNP 60
           MAEIGECSASNSS+ENWAKLIP DK+YADIEINSDET+IFSETKSTSVEKHEWCKITRN 
Sbjct: 1   MAEIGECSASNSSMENWAKLIPTDKMYADIEINSDETVIFSETKSTSVEKHEWCKITRNS 60

Query: 61  DLKSATLQNRSSNAIIVDGTLVQKDDTTFITCGSEIVSGPVRDGHLSYKFEMLSDSRICK 120
           DL +ATLQNRSSNAIIVD TLVQKD+TTFI CGSEIVSGPVRDG+LS+KFEMLSDS++CK
Sbjct: 61  DLNTATLQNRSSNAIIVDETLVQKDETTFIKCGSEIVSGPVRDGNLSFKFEMLSDSKLCK 120

Query: 121 GLKKYSILAVVEFVQISMDVEHAKCSICLNIWHDVVTAAPCFHNF 166
           GLK            IS+DVEHAKCSICLNIWHDVVTAAPCFHNF
Sbjct: 121 GLK------------ISVDVEHAKCSICLNIWHDVVTAAPCFHNF 153

BLAST of Lsi01G020640 vs. TrEMBL
Match: M5WHB6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005407mg PE=4 SV=1)

HSP 1 Score: 193.4 bits (490), Expect = 2.3e-46
Identity = 92/165 (55.76%), Postives = 118/165 (71.52%), Query Frame = 1

Query: 1   MAEIGECSASNSSIENWAKLIPPDKIYADIEINSDETIIFSETKSTSVEKHEWCKITRNP 60
           MAE+GE S +  S E WAKL+P D  Y D+EI+SDE +I SE   +S +K +WCKITR+ 
Sbjct: 1   MAEVGESSGAKPSCEIWAKLVPSDSRYPDVEISSDEIVICSEILFSSTDKWKWCKITRSS 60

Query: 61  DLKSATLQNRSSNAIIVDGTLVQKDDTTFITCGSEIVSGPVRDGHLSYKFEMLSDSRICK 120
           D  SAT+QN+SSNAI VDGT++Q +DT  I CGSEI SGP ++G+LSY+F +L     C+
Sbjct: 61  DHSSATIQNKSSNAIFVDGTVIQAEDTVVIRCGSEITSGPDKEGYLSYRFNVLPGPETCQ 120

Query: 121 GLKKYSILAVVEFVQISMDVEHAKCSICLNIWHDVVTAAPCFHNF 166
                      + ++ISMDVEHAKC ICLNIWH+VVT +PCFHNF
Sbjct: 121 -----------KQLKISMDVEHAKCCICLNIWHEVVTVSPCFHNF 154

BLAST of Lsi01G020640 vs. TrEMBL
Match: B9N5A6_POPTR (Zinc finger family protein OS=Populus trichocarpa GN=POPTR_0004s14500g PE=4 SV=1)

HSP 1 Score: 189.1 bits (479), Expect = 4.4e-45
Identity = 91/166 (54.82%), Postives = 114/166 (68.67%), Query Frame = 1

Query: 3   EIGECSA---SNSSIENWAKLIPPDKIYADIEINSDETIIFSETKSTSVEKHEWCKITRN 62
           E GECS+   S+SS E WAKL+P D  Y+D+EI S+E +I SE  STS+EKHEWCKITRN
Sbjct: 2   EFGECSSGSNSSSSNEAWAKLVPSDSRYSDVEIRSNEMVICSEITSTSLEKHEWCKITRN 61

Query: 63  PDLKSATLQNRSSNAIIVDGTLVQKDDTTFITCGSEIVSGPVRDGHLSYKFEMLSDSRIC 122
            D  SA +QN+S N I+VD   VQ +D   I CGSEI+ GP R+G+LSY+F+++      
Sbjct: 62  SDQSSAMMQNKSLNTILVDEATVQNEDDVVINCGSEIIPGPAREGYLSYRFKLMPRESFT 121

Query: 123 KGLKKYSILAVVEFVQISMDVEHAKCSICLNIWHDVVTAAPCFHNF 166
           + LK            +S+D EHAKCSICLN+WHDVVT APC HNF
Sbjct: 122 RWLK------------VSIDAEHAKCSICLNVWHDVVTVAPCLHNF 155

BLAST of Lsi01G020640 vs. TrEMBL
Match: A0A067F5J8_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g019461mg PE=4 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 6.3e-44
Identity = 88/165 (53.33%), Postives = 118/165 (71.52%), Query Frame = 1

Query: 1   MAEIGECSASNSSIENWAKLIPPDKIYADIEINSDETIIFSETKSTSVEKHEWCKITRNP 60
           MAE+GECSAS  S E WAKL P D  +AD++I+S+E +I SE  S+S +KHEWCKITRN 
Sbjct: 1   MAEVGECSASKPSREIWAKLEPSDSRFADVDISSNEVVICSEITSSSSDKHEWCKITRNS 60

Query: 61  DLKSATLQNRSSNAIIVDGTLVQKDDTTFITCGSEIVSGPVRDGHLSYKFEMLSDSRICK 120
           DL SA +QN+SSNAI+VD T+VQ ++   I CG+EI+ GP R+ +L+++F+++       
Sbjct: 61  DLHSAKMQNKSSNAILVDDTMVQNEEVVDIKCGTEIIPGPDREVYLNFRFKVVPVQESSN 120

Query: 121 GLKKYSILAVVEFVQISMDVEHAKCSICLNIWHDVVTAAPCFHNF 166
                      + ++IS+D+EHAKC ICLNIWHDVVT APC HNF
Sbjct: 121 -----------QQLEISIDIEHAKCCICLNIWHDVVTVAPCLHNF 154

BLAST of Lsi01G020640 vs. TrEMBL
Match: A0A061DVR2_THECC (RING/U-box superfamily protein OS=Theobroma cacao GN=TCM_003224 PE=4 SV=1)

HSP 1 Score: 183.0 bits (463), Expect = 3.1e-43
Identity = 87/164 (53.05%), Postives = 116/164 (70.73%), Query Frame = 1

Query: 2   AEIGECSASNSSIENWAKLIPPDKIYADIEINSDETIIFSETKSTSVEKHEWCKITRNPD 61
           AE+GE S S  +   WAKL+P D   +D+EI S+E I+ S+  S+S EKHEWC+ITRNPD
Sbjct: 10  AEVGESSTSKPTSHFWAKLVPLDAQLSDVEICSNEMIVSSQVTSSSQEKHEWCRITRNPD 69

Query: 62  LKSATLQNRSSNAIIVDGTLVQKDDTTFITCGSEIVSGPVRDGHLSYKFEMLSDSRICKG 121
           L +A ++N+SSN ++VD  +VQ++D   I CG+EIV GP R+G+LSYKF+++   + CK 
Sbjct: 70  LLTAMMKNKSSNDMLVDDAVVQREDVVEIKCGTEIVLGPNREGYLSYKFKLMPGPKTCK- 129

Query: 122 LKKYSILAVVEFVQISMDVEHAKCSICLNIWHDVVTAAPCFHNF 166
                       ++I +DVEHAKCSICLNIWHDVVT APC HNF
Sbjct: 130 ----------RQLKICVDVEHAKCSICLNIWHDVVTIAPCLHNF 162

BLAST of Lsi01G020640 vs. TAIR10
Match: AT1G47570.1 (AT1G47570.1 RING/U-box superfamily protein)

HSP 1 Score: 163.7 bits (413), Expect = 9.9e-41
Identity = 85/167 (50.90%), Postives = 111/167 (66.47%), Query Frame = 1

Query: 2   AEIGECSASNSSIEN-WAKLIPPDKIYADIEINSDETIIFSETKSTSVEKHEWCKITRNP 61
           AE G+ S S  S ++ WAKL+P D  ++DIEI  ++ +I SE K +S+EKHEWC+IT+N 
Sbjct: 5   AETGQSSGSKPSDDDAWAKLVPLDTRFSDIEIRCNDMVICSEIKPSSLEKHEWCRITKNL 64

Query: 62  DLKSATLQNRSSNAIIVDGTLVQKDDTTFITCGSEIVSGPVRDGHLSYKFEML--SDSRI 121
              SAT+ N+SS+AI+VD  +V KD    I  GSEIV GP   G+L Y+F ++   +SR 
Sbjct: 65  GQSSATIHNKSSDAILVDKAVVPKDGAVDIISGSEIVPGPEEQGYLQYRFTIMPAPESR- 124

Query: 122 CKGLKKYSILAVVEFVQISMDVEHAKCSICLNIWHDVVTAAPCFHNF 166
                        + +QIS+D EHAKCSICLNIWHDVVTAAPC HNF
Sbjct: 125 ------------TQLLQISIDPEHAKCSICLNIWHDVVTAAPCLHNF 158

BLAST of Lsi01G020640 vs. NCBI nr
Match: gi|778695012|ref|XP_011653910.1| (PREDICTED: E3 ubiquitin-protein ligase CHFR isoform X3 [Cucumis sativus])

HSP 1 Score: 282.7 bits (722), Expect = 4.2e-73
Identity = 139/165 (84.24%), Postives = 150/165 (90.91%), Query Frame = 1

Query: 1   MAEIGECSASNSSIENWAKLIPPDKIYADIEINSDETIIFSETKSTSVEKHEWCKITRNP 60
           MAEIGECSASNSS+ENWAKLIP DK+YADIEINSDET+IFSETKSTSVEKHEWCKITRN 
Sbjct: 1   MAEIGECSASNSSMENWAKLIPTDKMYADIEINSDETVIFSETKSTSVEKHEWCKITRNS 60

Query: 61  DLKSATLQNRSSNAIIVDGTLVQKDDTTFITCGSEIVSGPVRDGHLSYKFEMLSDSRICK 120
           DL +ATLQNRSSNAIIVD TLVQKD+TTFI CGSEIVSGPVRDG+LS+KFEMLSDS++CK
Sbjct: 61  DLNTATLQNRSSNAIIVDETLVQKDETTFIKCGSEIVSGPVRDGNLSFKFEMLSDSKLCK 120

Query: 121 GLKKYSILAVVEFVQISMDVEHAKCSICLNIWHDVVTAAPCFHNF 166
           GLK      V+    IS+DVEHAKCSICLNIWHDVVTAAPCFHNF
Sbjct: 121 GLKVIGNCHVL----ISVDVEHAKCSICLNIWHDVVTAAPCFHNF 161

BLAST of Lsi01G020640 vs. NCBI nr
Match: gi|778695005|ref|XP_011653909.1| (PREDICTED: E3 ubiquitin-protein ligase CHFR isoform X1 [Cucumis sativus])

HSP 1 Score: 282.7 bits (722), Expect = 4.2e-73
Identity = 139/165 (84.24%), Postives = 150/165 (90.91%), Query Frame = 1

Query: 1   MAEIGECSASNSSIENWAKLIPPDKIYADIEINSDETIIFSETKSTSVEKHEWCKITRNP 60
           MAEIGECSASNSS+ENWAKLIP DK+YADIEINSDET+IFSETKSTSVEKHEWCKITRN 
Sbjct: 1   MAEIGECSASNSSMENWAKLIPTDKMYADIEINSDETVIFSETKSTSVEKHEWCKITRNS 60

Query: 61  DLKSATLQNRSSNAIIVDGTLVQKDDTTFITCGSEIVSGPVRDGHLSYKFEMLSDSRICK 120
           DL +ATLQNRSSNAIIVD TLVQKD+TTFI CGSEIVSGPVRDG+LS+KFEMLSDS++CK
Sbjct: 61  DLNTATLQNRSSNAIIVDETLVQKDETTFIKCGSEIVSGPVRDGNLSFKFEMLSDSKLCK 120

Query: 121 GLKKYSILAVVEFVQISMDVEHAKCSICLNIWHDVVTAAPCFHNF 166
           GLK      V+    IS+DVEHAKCSICLNIWHDVVTAAPCFHNF
Sbjct: 121 GLKVIGNCHVL----ISVDVEHAKCSICLNIWHDVVTAAPCFHNF 161

BLAST of Lsi01G020640 vs. NCBI nr
Match: gi|659082892|ref|XP_008442085.1| (PREDICTED: E3 ubiquitin-protein ligase CHFR isoform X1 [Cucumis melo])

HSP 1 Score: 282.0 bits (720), Expect = 7.1e-73
Identity = 138/165 (83.64%), Postives = 148/165 (89.70%), Query Frame = 1

Query: 1   MAEIGECSASNSSIENWAKLIPPDKIYADIEINSDETIIFSETKSTSVEKHEWCKITRNP 60
           MAEIGECSASNSS+ENWAKLIP DK+Y DIEINSDET+IFSETKSTSVEKHEWCKITRN 
Sbjct: 1   MAEIGECSASNSSLENWAKLIPTDKMYVDIEINSDETVIFSETKSTSVEKHEWCKITRNS 60

Query: 61  DLKSATLQNRSSNAIIVDGTLVQKDDTTFITCGSEIVSGPVRDGHLSYKFEMLSDSRICK 120
           DL +ATLQNRSSNAIIVD TLVQKD+TTFI CGSEIVSGPVRDGHLS+KFEM SDS++CK
Sbjct: 61  DLNTATLQNRSSNAIIVDETLVQKDETTFIRCGSEIVSGPVRDGHLSFKFEMSSDSKLCK 120

Query: 121 GLKKYSILAVVEFVQISMDVEHAKCSICLNIWHDVVTAAPCFHNF 166
           GLK      V+    IS+DVEHAKCSICLNIWHDVVTAAPCFHNF
Sbjct: 121 GLKVIGNCHVL----ISVDVEHAKCSICLNIWHDVVTAAPCFHNF 161

BLAST of Lsi01G020640 vs. NCBI nr
Match: gi|659082896|ref|XP_008442087.1| (PREDICTED: uncharacterized protein LOC103486049 isoform X3 [Cucumis melo])

HSP 1 Score: 282.0 bits (720), Expect = 7.1e-73
Identity = 138/165 (83.64%), Postives = 148/165 (89.70%), Query Frame = 1

Query: 1   MAEIGECSASNSSIENWAKLIPPDKIYADIEINSDETIIFSETKSTSVEKHEWCKITRNP 60
           MAEIGECSASNSS+ENWAKLIP DK+Y DIEINSDET+IFSETKSTSVEKHEWCKITRN 
Sbjct: 1   MAEIGECSASNSSLENWAKLIPTDKMYVDIEINSDETVIFSETKSTSVEKHEWCKITRNS 60

Query: 61  DLKSATLQNRSSNAIIVDGTLVQKDDTTFITCGSEIVSGPVRDGHLSYKFEMLSDSRICK 120
           DL +ATLQNRSSNAIIVD TLVQKD+TTFI CGSEIVSGPVRDGHLS+KFEM SDS++CK
Sbjct: 61  DLNTATLQNRSSNAIIVDETLVQKDETTFIRCGSEIVSGPVRDGHLSFKFEMSSDSKLCK 120

Query: 121 GLKKYSILAVVEFVQISMDVEHAKCSICLNIWHDVVTAAPCFHNF 166
           GLK      V+    IS+DVEHAKCSICLNIWHDVVTAAPCFHNF
Sbjct: 121 GLKVIGNCHVL----ISVDVEHAKCSICLNIWHDVVTAAPCFHNF 161

BLAST of Lsi01G020640 vs. NCBI nr
Match: gi|778695009|ref|XP_004146401.2| (PREDICTED: E3 ubiquitin-protein ligase CHFR isoform X2 [Cucumis sativus])

HSP 1 Score: 281.2 bits (718), Expect = 1.2e-72
Identity = 138/165 (83.64%), Postives = 148/165 (89.70%), Query Frame = 1

Query: 1   MAEIGECSASNSSIENWAKLIPPDKIYADIEINSDETIIFSETKSTSVEKHEWCKITRNP 60
           MAEIGECSASNSS+ENWAKLIP DK+YADIEINSDET+IFSETKSTSVEKHEWCKITRN 
Sbjct: 1   MAEIGECSASNSSMENWAKLIPTDKMYADIEINSDETVIFSETKSTSVEKHEWCKITRNS 60

Query: 61  DLKSATLQNRSSNAIIVDGTLVQKDDTTFITCGSEIVSGPVRDGHLSYKFEMLSDSRICK 120
           DL +ATLQNRSSNAIIVD TLVQKD+TTFI CGSEIVSGPVRDG+LS+KFEMLSDS++CK
Sbjct: 61  DLNTATLQNRSSNAIIVDETLVQKDETTFIKCGSEIVSGPVRDGNLSFKFEMLSDSKLCK 120

Query: 121 GLKKYSILAVVEFVQISMDVEHAKCSICLNIWHDVVTAAPCFHNF 166
           GLK            IS+DVEHAKCSICLNIWHDVVTAAPCFHNF
Sbjct: 121 GLK------------ISVDVEHAKCSICLNIWHDVVTAAPCFHNF 153

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KYR0_CUCSA8.4e-7383.64Uncharacterized protein OS=Cucumis sativus GN=Csa_4G508530 PE=4 SV=1[more]
M5WHB6_PRUPE2.3e-4655.76Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005407mg PE=4 SV=1[more]
B9N5A6_POPTR4.4e-4554.82Zinc finger family protein OS=Populus trichocarpa GN=POPTR_0004s14500g PE=4 SV=1[more]
A0A067F5J8_CITSI6.3e-4453.33Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g019461mg PE=4 SV=1[more]
A0A061DVR2_THECC3.1e-4353.05RING/U-box superfamily protein OS=Theobroma cacao GN=TCM_003224 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G47570.19.9e-4150.90 RING/U-box superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778695012|ref|XP_011653910.1|4.2e-7384.24PREDICTED: E3 ubiquitin-protein ligase CHFR isoform X3 [Cucumis sativus][more]
gi|778695005|ref|XP_011653909.1|4.2e-7384.24PREDICTED: E3 ubiquitin-protein ligase CHFR isoform X1 [Cucumis sativus][more]
gi|659082892|ref|XP_008442085.1|7.1e-7383.64PREDICTED: E3 ubiquitin-protein ligase CHFR isoform X1 [Cucumis melo][more]
gi|659082896|ref|XP_008442087.1|7.1e-7383.64PREDICTED: uncharacterized protein LOC103486049 isoform X3 [Cucumis melo][more]
gi|778695009|ref|XP_004146401.2|1.2e-7283.64PREDICTED: E3 ubiquitin-protein ligase CHFR isoform X2 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR013083Znf_RING/FYVE/PHD
IPR000253FHA_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0044267 cellular protein metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0016874 ligase activity
molecular_function GO:0046872 metal ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi01G020640.1Lsi01G020640.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000253Forkhead-associated (FHA) domainGENE3DG3DSA:2.60.200.20coord: 52..97
score: 6.
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3DG3DSA:3.30.40.10coord: 141..167
score: 5.
NoneNo IPR availablePANTHERPTHR16079UBIQUITIN LIGASE PROTEIN CHFRcoord: 3..165
score: 1.4