Moc11g16400 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc11g16400
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Locationchr11: 12530233 .. 12551154 (+)
RNA-Seq ExpressionMoc11g16400
SyntenyMoc11g16400
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGCGCATTTGTGTGCGCATTGCCTCATCTCTCTCTGGAGCGATCAAAGTTCTCGCTTGTTGGAGCCGGGGCTGGACCCCGGGCGCCCTTCTTAGAGGTCCCACCTCGGCCACGGGTGGCTTGGACGTCCTCGGGTGCGCAGGCGGGGGTAGGGCAGGGCAGTGATCCGTGCGACCTGCGGAGGGGTTCCGTTGCTAGGCTGTCGTGACCTTGCCCTCTACCACTGCTGCTCCGACCTCCCTCTGGTGGGCATCGCTGGCAGCTAAAGTTCTTCGATCTGTCGTATTGGTCGAGTTCGCTGGTTGAACCATGTTCGCTCGCGTGTTGGTTTTAAAGAATTGGGGATGACTGAGATTGTCGTTCCCCACAGACGGCGCCAAATGTTTATGCAAGGATATGCACCACAGTGTATTTCAGATTGCAGCTCGAACTCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCTGACGATCAAGTCAGTATAAGTCGGATTCCCAGTTTAGTTCGAGGTCAGAATCATCGTACCTGATCGTGGAGTGAGGCCCTCTACTTATGGGTGGTAGTCCCTCTAATCTCGCGACGGTTACATTCGATATCTCGGGACCGTCAGTTACACCAGGTAATCTCGGGACCGACGGTTACGCCCGATAATCACGCCGCTGACTGTTTACTCACATCGGCCCCTACCGAGCTCCCCTGTAGGTCGGACCTGGGCCAGGTTCACCTCGGCCCTCATTCTTAGCATTTGTCGACGTTAGTGGCGGCCATCTCGGCTATCCGAGCTGGGCATGACTCATGAGCCATCTTGGAGCACCAATAGGGGTCCTCCACGTGTCAGGGTATTCCCTCCCCAAACACACTTGGTTTTGGATAAATAGATGGAATTATGAGGAATATATATGTTTTATAGGGAAAAGGAAGGAAAAAAAAGCAGCAATGCACCTACTGCAGATGGACACATAGGCTATCTTAAAGCTAGCAGCAATGCGTCTGGCGCATCGCCAACACTCCAGATACTTCAGCAATGCGCCTGGCGCATTCTCCACCTACAAATAAGGGTTTGTTGACCGGGCAGTTAGAAGAAATCTCAACCCATCCATTGATCCGTGCAACCAAATGCTGTTAAGCCCCATTGTAGCTCCTCCATCAAATACCCAAGATTTTCTTCCACAACTTGCAAGGTTAGCTTATGAATACCAAGTTCCATCAAGCTTCAAAGGGGGAGCTCACTTGACATTTAAATTCCAGCTTGCTTTCTCTCTAGAATTTATGCTTTATAATTTGTTTTGTCTAATTGTGTTCAACATTCCTAGAATTATTTTTATTGCAATTCCTTTTTGTTGTTATCATTTTCTTAGATTTTGGTTCTCGATTTTAATTTAGCTATTGTAATTCTCTTAACTTAATATGACATTTTATTTCACTCCATTTCTTATTATTCTATTTTTCAGTTTCTATGAGTAGCTAATTGCTTTTCAACCCTCCTACGAATGACTTAGTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTGGACATTCTCCCTTATTAGATGCCCATTTCTTTCCCCACCCACAGTCCATATCAGCTTAGCAAAAATATTCTCACCTACTCATATTTTCTTTGCTACTAGTTTGGGGGTATGTTGGGAGCATTTTCATGCTTTGTCTTTAGTTGCATTGAGGAAATGCAAGCTTTTCAGTTTTGGGGTGTGCACATTCGCTTTGCTTCTAACACTTTTTTAGCTTAGTTTTGTACAATTATGAATCTTTCTAATCCTGTAATTCAGTTAAAAAAAAAAAACTCAATATCATTAGAGGCTGCGGATTAGGAAAGCTAGTAGCCCATAGTTGAGAGCTCAATCCAGTAAAAAAGAGCTAGCGGTTCAAAGCAAGTGAAAGGGAGTTTAGAAAAGAAGTCTCTAGGTGGGTCAGGTTTGTGAAAGTAAAGTAGGAGGATGCTAATCCTAGGACAATATCCTTTTCTAGAGTTAGGCAAGATCTTAGCTACTCTCTTTCGTTGTCCAAGGTGGAGGCCCGAATGAATGAGCTCTCAAATTTGTGTGCATTGTGGGTGAACCGAGTAAGTAGAGCTAGTTCAGCATTTGATGCATTGCATATTCAGTACTTTTCTGTGGGGTGTAAAGTGTAGTCTTGCCTTGAGGATAAGCAATGATTTAAGTTTGGGGGTGTGATAACTCTACAAAAGAGAGCTATTCTAGACATTATTAGGGAAGTTTTTGTGCAATTTATTGTTAATGTGTGTCTATTTTAGCTCGAGAAAATCCTGAATAAAATAGAAAATTAATTATGAGATGAAATCTCGAGCAAATAATGTTTTGGCTTAAGGAAGTGCTGAATTCTTTCTATATATCGGCTCTATTATGCAAGATTGAAGTGAGAACCATAGAATGAGATAAACGTTGAGTCCAAAGAGCTGTTGACAAAACACAGTCAAACCTGAGTAGGAGAATGCCGGAGTGTTGCAGGGTTAGAGCATCCCGATTTTGGACAGCATCTCTTTCTGTAGAAATTTATGGGCAGCACAGTAACACGCGGGGTTCTCACTTATGTCCGTACAACGCGCGTTTTACTGATTTTTGCCCTAGTTTTTTGGGGGAGTTATAAAATCCAAGATGTGGGTTGCCTCTTCATCAATTTCCCAGGGGAAAAACCCAAGAGAGGAAATAGCTCCCAAAACGTGCTGAAGACAAGGAAAGCTACCCCCATTTCAAGAAGAAGTCGATGGGAGGAGGCTGGAACTGCTGGTTTCTCACTGGCTGTAGGTTTCGAGAATTGCTCAAGCTTTTTCACCAAATGAGTTGTTTTCTCTCTTTGCCCATTGTATTTTTTTCTCATTCTCCATGAACTAAACCCCATTTTTTGGGGAGACCTTTGGTGTAGCTTGTGAACGAGGCTCTCTATCTGTTTTGTTGCTGTTAAATAGAGTTGATATAATATTTTCATCTCACTGATCTATGCTGGCACATTTATTAGTTTGTTTGATCTTGTATGACTCGGAAGGGTATATAGGTAGCGTGTTTGGCTTTTTTTGTGGTTAGCTACTCTTTTCTAATAGATTTGTGGTGCCTAGTGATGACTCTCAAAATAGATTGTTTTGTGGGCTCTTATTAGGGGCATTTATGTGCGGAAATGTGAGAATCTAGTCCACATGGCCAAAAAAATGTGAAATTCCAATAGTGGACCCTAATTACATGTATAATTCTTAGATGCTCTCAATTATATTATTTAATGCTAAATACTAAGGATTTGTGCAGAAAGAGTAGCATGGCGTTTAGGTCGAGTAGAATGCTGCCTTGTTTAAAGAACTGCTGCTAGATGCTGAAAACGAGCGCAAACCTTGGAAGCTTGCAGTTACCTGGATGCTCTAGAGTAGAGGCTAAAGCAGTACTGCTTTAAGTTAACATCTCTTTTACCACGTTCGTACAGCGTCCCGCGGGTGCCCTAAGTTTTAAGGCCTGGTATTTAAGGAAAGGGGGTCCGAAGCCGACAAATAGGAGACCGATGACAGATTGATTCAATCATTTTTGGAATCATCAATTTCAGGGAATGTTTCTGTTTTTGACGTGGGGACTTTATCGAGGAAGGTGAAGGGTGACAGATTGTTGAGGATTTATATTGTACACCTGAGGAAGCTTTCTACTATATTTTCATTTCTTCAACAAGAACTCCTCTCTTTCATTTTCCTTCTCTTTTCTTCTTTCCATTATGAACTAAACCCCACCTTCGGGGAAGAGTTGAAAGGTTTCTGTAATATTTTTGACTTGCTACACTTTAGAACTTTGAAGTACTGATTTATAACAGAATTGTGTTTGTTTTTTCTTTCTCATTAAACTTGTGGGCCAACTCGTTTAGGGTGCTAGCTTACTTGAGGCACGGAACTCGTTTAGGGTGCTAGCTTACTTGAGGCTCGGAGAGTCCTTATTGAGTTGCTTGATTGTTTAGCTATAGTTTCCTTTTTCCTTTCTGAAACAGGGCAGTGAATACCCAGGGATGGGGTTCACTTTATAGCACCTTTAGACCGCTTTATTTGTTTGCTTAAATTACAATGGAGTTGGGTTAGAGCAAGTGTAGTATCCTTGGAGGAGAGCCATGAAGGTTTGGGTAATCATTGTTTATGGGACTATTAGGTAGTATCTGAGAGTAACAGTATGTCAATCTTCCCCTCCCCAATCTTTTAACTAGTCGTAATCCCCTTATCATTTTCGTTGTATTCGTTGGTTGTTTTTAGGAGCAGATCTAAGTAGTTCTGGCTAGCTTTTGTTTAAATCTTTATACATCTTAAGTTCACAAATCAACGCACAACATATATTTTTTCTTGTTTTGTTTACTTGTGGTATTTCGAAATTACTAACCATTTTATGAGTTATTTTCCCTGAGGAGACGACCTTATTTGGACTACACAACTGAATATTTAGTGTACACTTGTCTGAGTTTCTCCTATTCGTGAGAGAACGAACACTAGGGATAAGATCCGCCTAACCATGCTTATAAACTGTTCTTCATGTTTAGTATTTAGCAACAGAAGTTGTAATATTCTATTTGGGTGTCTCGGAAGGAAACCACGTCAGTTTGAGCATCAGTAGTAATTTTATGCTAGAGCGTCTTCTGTGAACGTGTGAAAGCATGTCACAATCTCCCCCGGTTCCATCCCATTACTGTTTGTTTTCGTTTTTTTTCCCCGTTCTGTTTATTTTGCTACAGCATTTCTTTGACCCAGCATCTGTTAATACTATATTTCTCTCTTGGAATTTCTTTTGTGTTTTATAGTATTTTACTTAGTTTAATTGTACAAACATCGATTATCTTTTATTTGCATTTTCTTGGTATTCTTAAGTGCATGCTTAAAGTAACATTCTTAGTCCATTGAATTTCGACCTTGTTTATACTACAATCAGCATGAATATAGTCTACACTTGCACTTGCATTCACCTAGGTTTGTAGGAATGTAACAAGTTTTTGGCACTGTTGCTCGAGACTAAGCTATTTTACTTTAAGTGTATTTTGCTTAAGAATACGGTTCGTAAAGAGTTTCCTTAGCCATATTTTGGTTGCTTTGTAGGTCGATGTATGATTTCCAAGGGTTAGAATTTTCCTGCTTGATGACAAAAATGAAGGAACGCCTAATATGAGAAGAAATGAGCACAGAATACAAGCAAGAAGAGACGAATCGCAATCCGAATTCGAATCAATCACTGAATATGGTAGTTGAGACATGTATGAGGAGAGGTTGTTGGGGCAACAGCAGCCCTTCCTCTAAATGCTATCCTGTTAGCAGATGACAGGGATAGAGCAATTAGAGCATACATGACTCTGGCATTTTTATAATTTTAATCCAGCTATAGTGGAGCCGGATATAGATGCCGATGAGTTTGAGCCTACACCCGTAATGTTTCAAATGCTTCAAACTGTGGGACAGTTTGAGGGTCATGAATCAGAAGATCCTCACCTCCATCTGAGATATTTCTTGGAGGTCAGTAATTTGTTTAAGATCAGGGGAGTAACGAAGGAAGCATTGAGGCTGAAATTGTTCCCATTTTCGCTGAGAGACAGTTAGGATATGGCTACACTCCCTACCATCAGAATGCATCAGTACATGGAATGAATTGGTAGAAACTTTTTCTATGAAATATTTCCACGAGAAAAAATGCAAAGTTCAGAAACGACATCAATAACTTTCAGTCAATTTTCAGGGGAATCTGTAAGCGAATCATGGGAAAGGTTCAAGGGACTTTTGCAAAAGTGCCCATACCACAGTATCCCCAAGTGTATTTAGAGAGATATATATTTAATGGGTTGAATGAAGCTACGAGGTTAGTGGTAGATGCCTATGCAAATGGAGCTCTCCTATCGAAGTCGTATGCTGAAGCATTCAACATCCTCAATGAATCTCATCTAACTATCACCAATGGTCGAACCCAAGATCAGCTGTGGATAAAAATTCAAAGAGAATACATGATGCTGACGTGCTAACTTTGAAATGTAGGAATGTTGAGTCGGTGACTAAGGCCAGTCTCACCCATACAGATCAAAGGACGAGCCCTCACAGGCAGGAGTTCATAACTCACTCAGGATTGAGAATGAGTTTCCTGGTCAGGTCCTATGAAAATAGTAATCTATTAGTTAATGGTGTTACAAAAGATTGGCTATTTCACGTTTCGATCTTCTACAAACTCATTGTATAACACCTCCACTCACATGTCTCCATGAACGACTTGGATCGAGTCATCTGTAACATTTACAAAGCGGGTCGTATCCATAGTATTGCCAAGATAAGGTACCCACCTTATCCTTATACTATAGAGCCTTTAGGTTATTCCCGGTACATTATTCCATATGTATATCGAGGGTATATCAAATATATACTCAGTATATATACTATCCCGGTTACATGACAATAACCTTGGATGTTTAGTTATTGGATTTGGACAATGCAATGTTATAAATTTAACCACAATTTTTTTAATATGAAAAACACAACTTTATTAAATATAAATTTGTTTTTACATATATTACAACTACGAGTTTAGGGCAACACAACCCAACAAACTCCCACTTGAACTAAACTCTAGTGGGGTGTACAAATCATAAAGCAAAAGGAAATACAATAAACTAGAGCATACACTATACCTATCACATTTCTCTCACTTGCCAGAGACTAATTGTACAACGCTCGTAGACCTACACTTTCTAGATGGCCCTCAAATAGTTTAGCCGTGAGCGCCTTTGTAAAAGGATCAACAATGTTGTGCTCCGAAGCGATCTTTGTGACGATCACATCTTCTCGTTACTATCTCCCGTATAAGGTGGTACTTCCTTTCGATGTGCTTGCCTCTCTTGTGGCTTCGAGGTTCTTTAGAATTTGCCACAGCACCACTGTTATCACAATAAGTGTGACGGGCAAGTTCATGTTTGGAAACGACTTCCAAATCGGTTAAGCACTTCCTAAGCCAAACCGCTTCTTTTGTTGCTTCACAAGTTGCGATGTACTCAGCCTCCATGGTGGAGTCTGAGATGCATCCTCATTTGATGCTTCGCCAAACCACTGCTCCTCCCGTTAAGAGTGAATACTGATCCAACGGATTTCCTTGAATCCTGTAAGGATCAAATCCTTAGAGTCATACACGAGCTTGTAGTCCCTCGTTCTCCTAAGATACTTGAGGAATTCCCTTAACAGCAGTCCAGTGATCATATCCTGGGTTGGATTGATACGTACTGATCATCCCTATTGCATAGCAGATGATAGGCTTAGTACACAACATAGCGTACATCAAGCTGCCCACTACTGAAGCATGAAATATGCGTCTCATATTCTCAACTTCTTGAGGTATCTTAGGACATTGTTCCTTAGATAGATGAATTCCATGTCTGAAGACAATAAGCCCCTTCTTGAAATTTTGGCATTGATTATGGAACAAACATCTTGTTGATATAAGTTGTTTGAGATAAGGCTAGTGTTTTGTTTTAAGCGATCCGTAGGAATTTAGATGCCAAGAACATAATGAGCTCTTCTCCAAATCATTCATTTGGAACTCAATAGCTAGCCACTTCTTATGTCAGTAAAGTATCCTACATCATTCCCAATAAGTAGGATAGCATCCACATACAATACTAAAAAGACTATTGTGTTGTTGACGATTTTCTTGTATACACAAGGCTCATCAACATTTTGGTCAAAACCATAAAATTTGATCGCAGTATCAAAACTTATGCTCCAAGGATGAGATGCTTGTTTCTATCCATATATGAACCGTTTAAACTTGCAAACCTTTTGCTATTGACCTTGAGCTATGAACCCTTTGGGCTGAGACATATAAATACTTTCTTCAAAATTGTCATTCAGAAGGGCAGTCTTGATGTCCATTTGCCAAATCTCATAGTCATAATATGTGATAAATGGATAAAAGTATTCTTATGGACTTGAGTATAGCTATAGGGGAGATGTTTCTTCATAGTCCACCCCTTCCGTTGTGTGTAGCCTTTCCCACAAGTCTTGCTTGTAGGTCTGTACCTTTCCTTCTGCATCTTGTTTACTCTTATAGATCCATTTGCACCCCAAAGACCTGAATTTGAAGTACATAGAGACTCTATTTCCAAGTCCATGGCTTTGACACATTTGTCCTGTCCACATTTTTTCGTTGCCTGTTTGTATGTTAATGGATCCTCGACGCCGTCATCGGGTATAACAACTTGGGTTTCAGTTAAACCCATGTAGTGGTCAGGTTGTGACACAATCCTCCCACTACGTCGAGGCACCCTCAATTCTTGAGGCAGATGCGACTAACGTGATGTGTTGATTCATCAACACCTCTTGTTGAAGTGTCAGCGCTTGATCAACAACTCTTGTAGACGTTTGTAGCCGCTTCGAAAATCTCGCTTATCACTACCTTACTTCGTGGTTTATGATCTTGAACGTGGTCTTCTCTTCCAAGAAGTGGCGTTTGTGGATACAAATACTCTTTTTCTTGAGGATCGAAGATGAGAGATTACAGATCCATTAGCTTGTGCAATGTGATCTATAAGCTTATTGCAAAAACTTTGGCGAACAGGCTAAACTTTTTTCTTGATGAGATCATAGCTCTAATTCAATCGGTCTTTTATCCTGGGACGAAGTATTATAGATAATGCCACAATCGGGTTTGAGTGCATCCATTCGATAAACAAAAAAATTATAGGAAAGCACAGAGTGGCGGTGCTAAAACTAGACATGTCTAAAGCTTATGATAGAGTTGAGTAGGGGTATATGAGACAGGTTATGGAGAAATTGGATTTCTGCCAAAGATTGATTTCTCTTATTATGTAGTGTGTGAAAAAAGTCAAATTTTCTATATTGATCAATGGTGTAGATACCGTATTTGGTCCGGCTTCATTTTAATTTTAATTTTTACTATTTAATTTTTGTTTTATTAAGTTTTTTTTTTGGTTTAAAATAATATAATTACTTTATTATTGGTTATAAGCTAAAGTTTAAATTAAATTAAAAATATAGATTTTGTTTTAAGATTTTGATTTTATATAATAAGAATTAAAAAAAAATAAAGATTTTATGTTAAACAAATGGTTTTATTTTTAACAGAATTGATCTTTTCCTCGTCAAAGTTCCATTCCAGCCACGTAGTGTCTCCTACGCTTTCCTTTCCTTTTTTTCTCTATCGTTTTACTCGACTCACGCATTATTACTGTTATTTTTTTGTGTATTTAAAGGGCACACACACACAGAAAACACACAGCGCAGACAGTTAGAAACTCATCACCGTATCTCTTCTCTTCATCTCTCTTATTCCCACACCATCACCACTTTTCTCCTTCATACCCACGACCTCCGACGACGCAGGGAACGACGCGAACGACAGCAGCACAACCTTCTTCACAGCTTCTTCCAAGCCCGCTGGCACTCCGACTTTTCGCATCCGCGGTTGTGTTTGAGTGAGTTCGTTAAGTGTGTTTGTAAATACGTTGAAAGGATTTTTTTTTTTCTAAATAGGAAAGGCTTTGGAAATCATGCTTTTAATTTGAAACAGATTCTTTTGTTTCTATGAAAGATTTTTTTTTAGAAAAAAAATGTTTTTTTAAGAAAAAGTTTTTTTTTTTTTTTTTTTTTTAAGAAAATGCTTTTAGTAAGAAACCTTTTTTATAAGATAAAAAAAAATCTTCTAAATTAATTTTAAATAAGTTTTGAAATAAAATAAAGATTTTGATATTTTAAAATTTGTTTTAAATAAAAGGTTTTCTTTTTCTAATAAAGAAGAGTTTTTTAAATGAAATAAATATAAATTTTATTTTATTTTAATAAAATATTTACTTTAAAATTTTGGTTAAAATACGTTGAAAAAAATAAAAGTTTTATAAAATAATCTTGTTTAAAAAATTTGGATTCTTTTTAAATTTGAATTCAAGTTTAAGAGATAGTTTTTTATTTAAAATGATTTTTTTTTTTACAAGGTTTACTTTTTTTGTGTAACTTTGTTTTATTTTTTAATTAAAAAAGAATCTTCTGCTGCAGGATTTTAAAGGAGGAGATAAAAACAGAGTATTTTAGAATAAAATCTCATTTTTAAAATTTTTTTTAATGGTTCGGTAGTTATTGTTTTGAAAAGAATCTTAACAAAAAAAAATCATTTTTTTAGCAAATAAGTTTGTTAAAAAGTAAATTGACTTTGTTTTTAAATAAAAGTTTTTTTTCTTTTAAATAATATTATTAATTTCGAAATAAATAAAAAAAGTTATTTATTTTTCTGTTTTAATTTAACAAAATTTGTAGGTTTTTTTTAGTTTAAAAAAATAAATTAAAACTCTTGTTCTTGTAAAAGAATACAATATTTACATCTTAGGATGTTGGAATAGTGGGAAGATGTTTGTTTTTTTCAATGTACATGACCCAAGTTCGAGTCTTTGCATGCCTCATTTTATCATTGTTTTTTCTAAATAATTCTTTTAATAAAAGATTTCCATTAGAGTTTTTTTAACAAAATTCAAAAATACAAATTTGTATTTTAAAAAGGAACACAGCTTAAATCTCAAGGTGATGGCACGTTGATGAGACATTGGAGTGTTATCACTAATGTTGTAACTTCAAGTCTTAATGTTCCATAAAATTAAAATTATTTTTTTTATTTATTATTTTTTGCTAAATAATTATTTTAAAAACTCCAATTACGAAAATTATGACCTTCCAATATTCTCGGGGTGTTAAAATTTTTCTAATGGCCTTCGTGCCACTCATACCTATTAATTCAAATCTTATAACAAGGTGTGAATTAATATAATATGAGACATTTAAAATAAAATAAAACTTTTTTTGGACCAAATAAAATAATAGGATAATTTTTGACAAAATGTGGTACCGTTTTCGAACGGGTATCGTAGGGTGCTAACACCTTCCCTACGCTCAGTCGACTCCCGAACCAATAATCTATTTTCGCAGACCTTGTCTTTATTTTTAGGTGACCAATCACACCTCGTAGATGATTGGTGGCGACTCCAAACCCTTATCTCGAAAGAGACCCTTCAGGGGATGACGTCGGCCGCTCCACGTCGCGATGACGTTGCGACAGCTTGGCGACTCCACTGGGGAATATTAAGAGAGTCAAGCCATTTATTTTGTCGTTTTTATCCTAATTGTAATATGTTTTGGGTCCATTTTTGTTATTTGTTATTTTTTTCAATTTTTGTAAATATTTTTTTATGCATTATGATTTTATTTTGCATCATCACGCATCTCACACACATTCACAAGCCTCCTACCCCTACCCAGGCTTTACCCTTAAGATTAGATCGGAGACTGAGTAGTGTGACCTTCGTGGAATTTTGGTTCCCGTGCAGGCGCATGAAATTCTCCACTCAGGTCAACCATACTGCAAAAGCATATGGAAGATTTGGGGGCCATTATCCTCTAGGTGCATTAGTACATAACTTCACGCTCATATTCATATTACATTTTATTTTTGTTGTGTCATATTGTTTATTTTAAAAAATAATAATAAATAGGGTCAATTTGGCCGAGTTTTAACCATAAGGTTTTAAATGAACCACTTTTTTTTTAATCTAAAAAAAAAAGAAAAGGAAAAAAGCATTTACCTTATTTCTTCCTAACTATTTGCTTAAGTTTAGATAGAAGTGTTGCATTTCAAAGGGGTTGTAAATGTCTCTGTTGTAGGCACATAAAAACTTTTAAACTTATTCAATTATTTAGGAGTTCGTCCTTATAACCCTAAGATTTCATTTCATGCGCATATTCATGATCTTCCCTCCTGGGATGATTTATAGTCTAGCATTACATTTACATGCATTTAGATTGTATATCATATCACTTCGCGGTAATTGGTAGCACATTTTTTTTCTTAAAGTAAATTGAAACTGTTCTTTCCTTGTATCATTCTGCATGTCCATAACATAAATTAAAAAAATATCATCTTATGCATGCATCATTTCACATGTTCATAATCATTTAAAATTTCCCTTAATTTGTGATCCAAAATTTGCATAACATTATCTCATGCATATCAAATTCTATCTTAAAAAAATCAAAATCTGTGCATTCTCCATTACATTATAATTAATATGGACTCTTAGAAAAAATAGCTAAAGAACCACATATTTACTATTCTTTATATACAAGGTACATAAAGCAGCATAACATGAAGAATTTAACGCTGAGCTAAAGAAACTTAGAATCGAAATCCACGAACGGGTCATGGAAGAATTGAATACTGAGCTAAAGAAAATAAGAGGCGAGATTCAAGAACAGATCATGGAGGTGCAGAATGTAGAGCTAAAGAAGATGAGACAAGATATTGATAAGATTCAGGGACAAATCACATAAATTTTCTATATCTTAAAAAATATTGTTGATGGCATTTCTCAACCTAACAATTTAGTCGAACCTCATAATAATTCATTACCTAAGGTGAACTCACCTCCTTGTCAGTCAATAGAAGGCTTCCCCCAGCATAATTCATCACATGAAGCTCAGCTTTCCAAGAAAGGCCAACAAGCTCCTACAATCTATCAGGTTGAATTCACATTCCAAGATGGACAAGCATCAAAGACCATTCACTCAAATATTTTAAAAGGTAAAGAAAAATAAAATGTTACCAATGGAAATGTTGAAGAGACTACAGGGAAAGAATATGCTCAGAGATGGGGAGAAGCAATGGAACAACCGCCAGTGCATTTAATTAATAAAGAGCCACACCTTGAAACGAATAGAAGCTCTTTTAAAAGGAAAAATAAAGTTCAGGCAGTAGTTTCGAAGCGCAAGGGGAAGCACCAACAAGGATTCGTGGCATACAGTCCTTCTACGGCACCCATAAACTCATTGACTCTGTAGAATGATCCACAAACATTACCTGACAATCATAGTAGTCGGAGACAACTAAAGTGAGAGAGCTTTCATTTGATCCCAATATCCTATACCAAGTTGTTATCCCAATTATTTCAAGGTAATCTATTAGCTCCCGTACCCGTAGAGCCATTACAACCACCTTATTCAAAGTGGTACGACCCGCATGTTCGTTGTGATTACCATGCAGGAGCTGTAGGTCATTCAACTGAGAACTGTACTTCACTGAAGCATAAAGTGCAAGTGCTAATGAGGGCAGGATGGCTAAAGTTTTCAAAGGAAGATGGTCAATACATGAAACACTAGAATCATGAGTGTACAAGATTATTAAAATCAATGTTGACCTATTACTTTGCATTATCAATAAAGTTTATTTATTCTTTGAAGTCATCTTTGAGATTGCTTGCTAGTGTTAAAACGACAATGAAGCTTTAGTCACCCATGATTTAGCTGAGTACTTGTAAAAAGGAATGTGAGTGTGTGGAAATTGAGCTATTTTTTTTTTAAATCTAAGTCGTCATACATTCATTCATGCATCACTCATATTATATCATTGGTTGAAAAATAAAAGAACAAAAAAACAAAAAATATTTTCTTTGAAAATGCTCTCGATAAAAATGAAATTACATGTTTTGGAGTTTTTCAATGATAATTTTTGTATGGTCTACTCTTATAATTCTTGTTAGTGGAGACCTTAGCAAGAATCATAAACCAATTATTGTAAGTCACTAGGAGGAAACAAAAAAAGAAGTGAAAAAGAAATAAAAAATGGGAAAAAATGAAAATGTGAATGCCTTCAAGGTCCTCAATGAGATATAATCTTTTTCAAAACTCAACCATTTTTCCGGACTTTGTTGTCGGGATAAATCCGTAACTCAAACCAATTTTATCTGGTCTTCCTATTGCTGCTCATGTGCTAACCAGCAAACCACCAAGTTTCATACTGGGGCAAGCATGCCTATTGAGGTGAGGAACCAAAATTACTCACTCCCTGTAAAACCTTGAGCTTAGCTTACTTTAAGCCTAATCTGCCTTGAGCCTAGCTTACCGATATATGGTAAATATAGTCAACATTGAGAGTTGACCATTTACTGCAAAAAAAACATTTGCTCAGCCCGTCACATACAATTTTTGTGAGGAATATCAATATTTGCCTACTTACCATAAATGTCTTTTATCAAACCTGTCACATCTAATTGCGTGGTAAATATCCAAAATTATCCACTTAGTGTTACCCTAAGAACTTTGAAGGTAATAAAATAACATTGGGCTCTTATGATTGACCGCTTCATGAATTTGTACTTGTAGTTGCTCAAAATTTCCCTACTCTGACTTCCTTCTTTCACTTTCTATTGTTCTACTATCTTAATCCTTCGTCTACTTTCTTCTTTTGAGTAAAAAGTGATACCAAGACCGAGGATAGTGATTTTGAGAATTGGGAAGTTAGCGGCCTGACAAATAGTCTCAAGATCTCAGTCTCCTTTGATGAGTTTCGTTTGAGAGCAATTTTTACTAGAGCAGTAGCAAAGACATAATTTGGAATAGCAGTAGAGGCAACTACGCATTTTATACATGATTGCACCATTTTTTAGTTCAATGACTGGGGCAATCAACAGAAATCTACAACAACAATTTCAAAGGCTTCAAGGTCTTCTTTGCAAATCTCCACACTAGGGCAAATTTGATCATTACTATATTGGGGCAAAAGCCGTCATCTGCTACACTGGGGCAAAACTATCATTTGCAAGCTGGGGCAGGATCATGAGACAAAGGCCATCTTCAGACTAAGGAGATCTCATTTCTCTCTCAGGAAATCTTTCAATTTTGAAGGTCCTTCACTGCACAAAGTGGGCTGTTATACAACATTCCATAAAGAACTTCATTTTGAAATTGAAAGCTAAAAGGAGCAGTCATTTACATTCAATAAGTAGACTGCAACAATATTGCCATGAAGAACGACACTTCAGAGCTAAGGCTAAAAAGATATTTACCTTCAAAGTGGACCACGAAAACATTTTCGTGAAGAATAACATGTGGTTAGAAAGATGATTTCATCAAATTTCTCCAGCTAAGCTCAAAGAAGGAGTGAGGTACATGAGATAAACTCTTCGAGAGTTACGCAACCAATTCTCAAAGTTATTTGTCCAAAGAAATTAGGTCATTTCTACTCTTCTATTCATTTGATCCTACTCTAATCTCTCACCAAAGCTCTTAAAAATATATTTCAATCTCTTATTGTTCCAGAGATTTGACAAAATATCCATGAATTCTTGGTTTCATTTTCTTTAAAAAGGAAGCCTGCTGACTTTTATTCATTGTCAAGCTTTGTTGCAAAATCATGATTCCTATCCTCCATTACTCCTAATGTTAAATCCATTTTCTTGTAACCGATCTTTTCATGTCAATTATAAGGTTGAGTTTATGAAGCAGTTTAATCAAACAAGATGTACAATGCTATGAGTTACCTTTGATTGTTCTCTAAACTGGCACATGAGATACCTCTACAAAAAAATTCCCTTCCAAACCTGATATAGTTGAGAAGTTACCTTGTATATCTCCAATCGAGTGAAAACATTCATTAAAAAGTAAATAAATAAAAAATAAAGAAAAAAAAGCAAAAGGAAAAAAAGAACAGTTGATGTGTTTCTTCCAATGTTGATATTACCTTTTCTTATTTGATTCTTTTGCTAAATAATTGCTAATAAGAAGGATGAGAGTAGACAAAAAAGTAATATCTTTGCAAAGTATGGAAAAACTTCTTGTTCTCTCAACTATATTTTCACTAATCATGCATACACATCACTTGCATACATATTCTTCAAAAAAAAAAAAAAGGTTGTCATATCATGATTATATTATGAAAATTTAAGTTTCATGCATTGTGCATTCATAATCACATTTTATGGTTTGAGCCAAAATAAAAATGAGTATCATACATACATGATCCAACCATCTAGGATTGCATATATATTATTGCAGCATGCGTCGGGGGTCAAATTCATTGGTTAAACGAAATAAAAAACAGATTATATACTGTCACACATTGTCAAATGATAAGAATAAATGGAAAGTAAAATGGAGCATCATTTGTCATTCATGACATTGGGGCAATACACATTTGTAGATGCACCATCCGTCCTAAGATAATTGCACCAGAGGTTGCGCAGCAGCTTCCCATGAAGCCTTCTGTAGATGTCTTTAGCTTTTCCTCGGCCTGTAGAGTTAGAGCTAGTTCATAGGCCACACTCAATTGTTAGCGACAGGGTGCCTGCTCTTCATTAGCTATTTCTGACAGGAGCTCTTTCGTTTACAAGGGGACACAATTTAATATGAGTACAGCTTTCCACGGCCTCCATTCCTATCCCATCCTTTATGATCTTTGGGCATTCCAGTCGTTGACTCTTCACGCCGCTACGGGCCTTTCTAAGACCGACCAGCACCTCACCTGCACAGGTACCTACGTAGTGTAGTGTCGGTTTGTTAGGATGAATTGGATTGATCTAACCTCTCGCTCGTATGTAATATTTCGCTACGATAAGTACTCCATACTGCCATTGGAAGTTCGCATTAGCATGAACATGTGGTGTTAGTCTTATCGTCTTGCGCCAAGTGCATATCTCATGCTAGCAAGTTTAAAAGAAACTCTAAACCCTACACTTAGGTGGCATATCCTTACTAGAAGAGAAAAGTTTTATCAGCTTCCACTGTCTATTGAGCCAACGCCCGTTATGGCCGAGCCAACCATGATCTCCTTTGGTCTCGTTCCCATTCTTTGTTTGAATCGACCTAGCTGTGTCTTGCTTAGCCGCATATTTCAACAAACTCCACGACAGTACGCCGTAACTCACGCTTACCTTAAAGAGTCGTTGAGGCTTGGTCGGGAGTCCTGTCTGTGCCTATATATAGAAATCATTTGCTTTTATCCAAGGTATGCAAACACTGTTTTTTTCCCTCGAATACAACTCGGATTCTTCCTAACTTGGGCATCAGAGTGTTTTCTCTTGCAAGTCTCCTGCTCACCAAAATTCAGCTAATCCTCAAGGTCATCATTCCGTCCAAGCCGAGCTCGTAAGCTTGGTAATTTTTGTGATCAAACATTTATCTAAAGTTAAATCGAACCCGATCCAAGTCGAGGTCGAAGCCTGTTGTTAAAGTCAGGTCCGTCGCATTTTATCATTTGGTTAAATCAAACCCGATCCAAGTCGAGGTCGAAGCCTGTTGTGAAAGCCAGGTCCGTTGCATATTCTCATTCGGTTAAATCGAACCTGATCCAAGTCGAGGCCGAAGCCTGTTGTTAAAGCCAGGTCCGTTGCATTTTATCATTTGGTTAAATCAAACCCGATCCAAGTCAAGGTCGAAGCATGTTGTTAAAGCCAGGTCCATTGCATTATCATTCGGTTAAATCGAACCCGATCCAAGCCGAGGTCGAAGCCTGTGTGAAAGTCAGGTCCGTTGCATTTTATCATTCGATTAAATTGAACCCGATCCAAGCCGAGGTCGAAGCTTGTGTTAAAGTCAGGTCCGTTGCATTTTATCATTGGGTTAAATCTAACCTGATCCAAGATTGTGGTCGAAGCCTATGTCAAGCCAGGTCCATCGCACTTATCATGTAGATTAAATCAAACCCGATCCAAGATCGACGTTAATGCCTGTATCAAGTACGAGGTTCTTTGCATGTCGGGATCCTGCCTAGTGTCCAGTTGGGTCCATCTCTATCTTGTCTTGTCTGAATAAAATTTTCGCCTTCCCAAATACTTTGTCTTTTGCCTTTGTCTACCCTTTTAGTTCTGCAAGGAAACACAACGGGGGGCAAGCTGTAGATTCCGTGTTTGGTTCGGCTTCATTTTAATTTTAATTTTTACTATTTAATTTTTGTTTTATTAAGTTTTATTTTTTTTGGTTTAAAATACTATAAGATTACTTTATTATTGGTTATAAGCTAAAGTTTAAATTAAATTAAAAAAATATAGATTTTGTTTTAAAGATTTTGATTTTATTAATAATAAGAATTAAAAAAATAAAGATTTTATGTTAAACAAATGGTTTGATTTTTAACAGAATTGATCTTTTCCTCGTCAAAGTTCCAGCCACGTGGTGTCTCCTACGCTTTCCTTTCCTTTTTTCTCTATGGTTTTACTCGACTCACGCATTGTTACTGTTATTTTTTTGTGTATTTAAGGGCACACAACACACAAAAAACCCATCACAGCAGCAACAACCTTCTTCACAGCTTCTTCAAAGCCCGCTGGCACTCCGACGATTTCGTTCGCAGTAGTGCGTTTTAAGTAAGTATCAAATACGTTGAAAAGATTTTTTTTTCTAAATAAGGAAAAGGCTTTGTAAATCATGCTTTTAATTTGAAACAAATTCTTTTATTTCTATCAAAAGATTTTTTTTTAATGTTTTTTTAAGAAAAAGTTTTTTTTTTTTTTTTTTTTAAAAAATGCTTTTAATAAGAAACCTTTTTTTTAAGATAAAAAAATCTTCTAAATTAATTTTAAATAAGTTTTTGAAAATAAAATAAGGATTTTGATATTTTAAATTTTGTTTTAAAATAAAAGGTTTTCTTTTTCTAATAAAGAAGAGTTTTTTTAAATGAAATAAATATAAATTTTATTTTATTTTAATAAAATATTTACTTTAAAATTTTTGTTAAAATACGTTGAAAAAAATAGGAGTTTTATAAAATAAATTTTGTTTAAAAAAATTTGGATTCTTTTAAATTTGAATTCAAAGTTTAAGAGATAGTTTTTATTTAAAATGATTTTTTTTTAACAAGGTTTACTTTTTTTGTGTTAACTTTGTTTTATTTTTTAATTAAAAAAGAATCTTCTGCCGTAGGATTTTAAAGGAGGAGATAAAAACAGAGTATTTTAGAATAAAATCTCATTTTTTAAAATTTCTTTTAATGGTTCGGTGGTTACGGTTTTGAAAAGAATCTTAACAAAAAAAAATCAGTTTTTTTAGCAAATAAGTTGTTAAAGTAAATTTGACTTTGTTTTTAAATAAAAGTTTTTTTTTCTTTTAAATAATCTTGTTAATTTCTAAATAAATAAAAAAAAAGTTATTTAATTTTCTGTTTTAATTGAACAAAACTCGTAGGTTTTTCTGAGTTTAAAAAAATAAATTAATACTTTTATTTTTGTAAAAGAATTCGATGTTTACATCTTAGGATGTTGGAATAGTGGTAAGAGGTTTATTTTTTCAATGTACATGACCCAAGGTTGGAAAGGGAGCTCCAAGTCCCTTTCACTAATCATTTGGGCCGATACCTTGGACTTCCGTCACAAGTGTCTAGAAACCGGTGCAAGGTGTTCAACTCAATCAAAGAGATAGTGTGGAAAGCACTCCAAGGTTGGAAAGGAAAATTGTTCTCGATGATAGGAAAGGAAGTCCTTATCAAAACAGCTGCTCAAGCAATTCCTGATTACACTATGAGTTGCTTCCAGCTACCCATGGGTCTGTGTAATGAGATTACAACAATTTATGCAAGATTTTTGTGGGGTCAACTCTAGAGCACGTAAAATTCACTAGCAAGGTTGGGAAAAGCTTTACGCGAGGAAGAAGCAGGGAGAAATGAGGTTCAAGGATGTTAGAACCTGCAATCAAGCAATGTTAGCTAAGCAAAGCTGGCGAATTTTAAGGAATCCTTCTACTCCCCCGTCATAAATTCTGAAAGGAAAATACTTTCAAAAAGACAACTTTTTAAAGGCTGAGTTAGGACACAACCCTTCCTTTGCCAGGCGAAGTAACTTAGTGGGAGTATGGCCTCTTTGAAAAGGGCTGTCGGTAGAGAATTGGAAATGGGCTTTCTATTTTTATTGGGAATGATCCATGGCTGACACGAGAGGACAATTTTAAACCACTTTCTTGTCCACCCTTGTTATTGAAAGAACTTGTTACTTCCTTGATGATGTCGAAATGCTATTGGAATGAGGGAAAGATAAGATCCATTTTTATGGAGTCAGAGGCCAATCTTATTTTAGAGATACCTTTACGTAAAACGAATAAGGGTGATGAGATTATTTGGGATCTTAGACAAAAAAGGGATATTTACTGTGAAAAGTGCTTACACATTGGGGAAGGGAGCTGTTAATTTGGAGGAGACTTCTTCGTCCAAAAATGATCATCTAAAAAACAGATGGAATAGCTAGTCGAAGGCCAAAGTCCGTCCTAAATTTAGAATCTGCTGCTGGAGAATTCTTAGTGATATTATTCTTAATAAGTCTAACTTGTGCTTAAAAGGCATCACCAGTTATTCTTCTTGTGTTTTTTGTAGGAAACATGAGGAGACAACTTAGCATCTTCTTTGGGAGTGTGAAGTAACTAAATATAATTGGAATTACATTCTAAATGCTAACTCGAGTTCTGGTTGGTCTAACAGTAAAGGATGGAAGATTGCAGATTATTGGGACTAGCCTGTGAACAACGTGAAGGGGACTGAGCTTGATATAGCGTTGCTGACCTTGTGAAATACATGGTAATATAGGAACAAGGTGCTTTTCAAGTGGGAAATTCAGACACATCAACAATTGTGTTTCTTGATCGATAAACAGTGTGAGAGAGAGAGATCATCAAGTGAACCTTACCTCATACCAGCTTTTGGCAGCTTAATGTTGATGCAGCTTGGTTCCAAGAGCGTCAACACGGTGGATTAGGGTGGGTGATTCGAAAGGAAGAAGGCACAATTATTTAGCTGGTAGCTCCTTTATTCGGGAGAAAATTTGCATCAAACACCTGAAAACATTGGCTATGCTAGAAGGGATTCACAATTTGGACAATCCAAATGGTGTTGATGTTTACCTAGAAACAGATTTATTGGATGTCAAATGGCTAATGAATAAGGAGATGGATGACTTCAAAAAAGTTCTTTGGCTGGCCGAAGAAATTCGGGCGCTATTGGAAGTGAAGAAGATTGTCTCAATTTGCCATATCAAGACGAGAAGTGAATCGCGTGGCTCATGA

mRNA sequence

ATGGAGCGCATTTGTGTGCGCATTGCCTCATCTCTCTCTGGAGCGATCAAAGTTCTCGCTTGTTGGAGCCGGGGCTGGACCCCGGGCGCCCTTCTTAGAGGTCCCACCTCGGCCACGGGTGGCTTGGACGTCCTCGGGTGCGCAGGCGGGGACGGCGCCAAATGTTTATGCAAGGATATGCACCACAGTGTATTTCAGATTGCAGCTCGAACTCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCTGACGATCAAGTCAGTATAAGTCGGATTCCCAGTTTAGTTCGAGGGGAAAAACCCAAGAGAGGAAATAGCTCCCAAAACGTGCTGAAGACAAGGAAAGCTACCCCCATTTCAAGAAGAAGTCGATGGGAGGAGGCTGGAACTGCTGGTTTCTCACTGGCTGTAGCTATAGTGGAGCCGGATATAGATGCCGATGAGTTTGAGCCTACACCCGTAATGTTTCAAATGCTTCAAACTGTGGGACAGTTTGAGGGTCATGAATCAGAAGATCCTCACCTCCATCTGAGATATTTCTTGGAGTTAGGATATGGCTACACTCCCTACCATCAGAATGCATCAGTACATGGAATGAATTGGGGAATCTGTAAGCGAATCATGGGAAAGGTTCAAGGGACTTTTGCAAAAGTGCCCATACCACAGTATCCCCAAGTGTATTTAGAGAGATATATATTTAATGGGTTGAATGAAGCTACGAGGTTAGTGGTAGATGCCTATGCAAATGGAGCTCTCCTATCGAAGTCGAATGTTGAGTCGGTGACTAAGGCCAGTCTCACCCATACAGATCAAAGGACGAGCCCTCACAGGCAGGAGTTCATAACTCACTCAGGATTGAGAATGAGTTTCCTGGTCAGGCGCATGAAATTCTCCACTCAGGTCAACCATACTGCAAAAGCATATGGAAGATTTGGGGGCCATTATCCTCTAGGTAATCTATTAGCTCCCGTACCCGTAGAGCCATTACAACCACCTTATTCAAAGTGGTACGACCCGCATGTTCGTTGTGATTACCATGCAGGAGCTGTAGGTCATTCAACTGAGAACTGTACTTCACTGAAGCATAAAGTGCAAGTGCTAATGAGGGCAGGATGGCTAAAATGCACCATCCGTCCTAAGATAATTGCACCAGAGGTTGCGCAGCAGCTTCCCATGAAGCCTTCTGTAGATGTCTTTAGCTTTTCCTCGGCCTGTAGAGGGACACAATTTAATATGAGTACAGCTTTCCACGGCCTCCATTCCTATCCCATCCTTTATGATCTTTGGGCATTCCAGTCGTTGACTCTTCACGCCGCTACGGGCCTTTCTAAGACCGACCAGCACCTCACCTGCACAGGGCACACAACACACAAAAAACCCATCACAGCAGCAACAACCTTCTTCACAGCTTCTTCAAAGCCCGCTGGCACTCCGACGATTTCGTTCGCAGTAGTGCGTTTTAACTTGGTTCCAAGAGCGTCAACACGGTGGATTAGGGTGGGTGATTCGAAAGGAAGAAGGCACAATTATTTAGCTGGTAGCTCCTTTATTCGGGAGAAAATTTGCATCAAACACCTGAAAACATTGGCTATGCTAGAAGGGATTCACAATTTGGACAATCCAAATGGTGTTGATGTTTACCTAGAAACAGATTTATTGGATGTCAAATGGCTAATGAATAAGGAGATGGATGACTTCAAAAAAGTTCTTTGGCTGGCCGAAGAAATTCGGGCGCTATTGGAAGTGAAGAAGATTGTCTCAATTTGCCATATCAAGACGAGAAGTGAATCGCGTGGCTCATGA

Coding sequence (CDS)

ATGGAGCGCATTTGTGTGCGCATTGCCTCATCTCTCTCTGGAGCGATCAAAGTTCTCGCTTGTTGGAGCCGGGGCTGGACCCCGGGCGCCCTTCTTAGAGGTCCCACCTCGGCCACGGGTGGCTTGGACGTCCTCGGGTGCGCAGGCGGGGACGGCGCCAAATGTTTATGCAAGGATATGCACCACAGTGTATTTCAGATTGCAGCTCGAACTCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCTGACGATCAAGTCAGTATAAGTCGGATTCCCAGTTTAGTTCGAGGGGAAAAACCCAAGAGAGGAAATAGCTCCCAAAACGTGCTGAAGACAAGGAAAGCTACCCCCATTTCAAGAAGAAGTCGATGGGAGGAGGCTGGAACTGCTGGTTTCTCACTGGCTGTAGCTATAGTGGAGCCGGATATAGATGCCGATGAGTTTGAGCCTACACCCGTAATGTTTCAAATGCTTCAAACTGTGGGACAGTTTGAGGGTCATGAATCAGAAGATCCTCACCTCCATCTGAGATATTTCTTGGAGTTAGGATATGGCTACACTCCCTACCATCAGAATGCATCAGTACATGGAATGAATTGGGGAATCTGTAAGCGAATCATGGGAAAGGTTCAAGGGACTTTTGCAAAAGTGCCCATACCACAGTATCCCCAAGTGTATTTAGAGAGATATATATTTAATGGGTTGAATGAAGCTACGAGGTTAGTGGTAGATGCCTATGCAAATGGAGCTCTCCTATCGAAGTCGAATGTTGAGTCGGTGACTAAGGCCAGTCTCACCCATACAGATCAAAGGACGAGCCCTCACAGGCAGGAGTTCATAACTCACTCAGGATTGAGAATGAGTTTCCTGGTCAGGCGCATGAAATTCTCCACTCAGGTCAACCATACTGCAAAAGCATATGGAAGATTTGGGGGCCATTATCCTCTAGGTAATCTATTAGCTCCCGTACCCGTAGAGCCATTACAACCACCTTATTCAAAGTGGTACGACCCGCATGTTCGTTGTGATTACCATGCAGGAGCTGTAGGTCATTCAACTGAGAACTGTACTTCACTGAAGCATAAAGTGCAAGTGCTAATGAGGGCAGGATGGCTAAAATGCACCATCCGTCCTAAGATAATTGCACCAGAGGTTGCGCAGCAGCTTCCCATGAAGCCTTCTGTAGATGTCTTTAGCTTTTCCTCGGCCTGTAGAGGGACACAATTTAATATGAGTACAGCTTTCCACGGCCTCCATTCCTATCCCATCCTTTATGATCTTTGGGCATTCCAGTCGTTGACTCTTCACGCCGCTACGGGCCTTTCTAAGACCGACCAGCACCTCACCTGCACAGGGCACACAACACACAAAAAACCCATCACAGCAGCAACAACCTTCTTCACAGCTTCTTCAAAGCCCGCTGGCACTCCGACGATTTCGTTCGCAGTAGTGCGTTTTAACTTGGTTCCAAGAGCGTCAACACGGTGGATTAGGGTGGGTGATTCGAAAGGAAGAAGGCACAATTATTTAGCTGGTAGCTCCTTTATTCGGGAGAAAATTTGCATCAAACACCTGAAAACATTGGCTATGCTAGAAGGGATTCACAATTTGGACAATCCAAATGGTGTTGATGTTTACCTAGAAACAGATTTATTGGATGTCAAATGGCTAATGAATAAGGAGATGGATGACTTCAAAAAAGTTCTTTGGCTGGCCGAAGAAATTCGGGCGCTATTGGAAGTGAAGAAGATTGTCTCAATTTGCCATATCAAGACGAGAAGTGAATCGCGTGGCTCATGA

Protein sequence

MERICVRIASSLSGAIKVLACWSRGWTPGALLRGPTSATGGLDVLGCAGGDGAKCLCKDMHHSVFQIAARTRPPDRSEYLGGPAQKGEHSDDQVSISRIPSLVRGEKPKRGNSSQNVLKTRKATPISRRSRWEEAGTAGFSLAVAIVEPDIDADEFEPTPVMFQMLQTVGQFEGHESEDPHLHLRYFLELGYGYTPYHQNASVHGMNWGICKRIMGKVQGTFAKVPIPQYPQVYLERYIFNGLNEATRLVVDAYANGALLSKSNVESVTKASLTHTDQRTSPHRQEFITHSGLRMSFLVRRMKFSTQVNHTAKAYGRFGGHYPLGNLLAPVPVEPLQPPYSKWYDPHVRCDYHAGAVGHSTENCTSLKHKVQVLMRAGWLKCTIRPKIIAPEVAQQLPMKPSVDVFSFSSACRGTQFNMSTAFHGLHSYPILYDLWAFQSLTLHAATGLSKTDQHLTCTGHTTHKKPITAATTFFTASSKPAGTPTISFAVVRFNLVPRASTRWIRVGDSKGRRHNYLAGSSFIREKICIKHLKTLAMLEGIHNLDNPNGVDVYLETDLLDVKWLMNKEMDDFKKVLWLAEEIRALLEVKKIVSICHIKTRSESRGS
Homology
BLAST of Moc11g16400 vs. NCBI nr
Match: XP_022158986.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia])

HSP 1 Score: 102.8 bits (255), Expect = 1.0e-17
Identity = 40/55 (72.73%), Postives = 50/55 (90.91%), Query Frame = 0

Query: 326 NLLAPVPVEPLQPPYSKWYDPHVRCDYHAGAVGHSTENCTSLKHKVQVLMRAGWL 381
           N LAPVPV+P+QPPY +WYD + RCDYHAGA+GHSTENCT+LK++VQ L++AGWL
Sbjct: 366 NQLAPVPVDPIQPPYPRWYDTNARCDYHAGAIGHSTENCTALKYRVQALIKAGWL 420

BLAST of Moc11g16400 vs. NCBI nr
Match: XP_031738551.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC101203611 [Cucumis sativus])

HSP 1 Score: 99.4 bits (246), Expect = 1.2e-16
Identity = 39/53 (73.58%), Postives = 45/53 (84.91%), Query Frame = 0

Query: 328 LAPVPVEPLQPPYSKWYDPHVRCDYHAGAVGHSTENCTSLKHKVQVLMRAGWL 381
           LAP+P+ P+QPPY KWYDP+ RCDYHAG VGHSTENC +LK KVQ L+ AGWL
Sbjct: 780 LAPIPINPIQPPYPKWYDPNARCDYHAGGVGHSTENCLALKRKVQSLINAGWL 832

BLAST of Moc11g16400 vs. NCBI nr
Match: XP_031737539.1 (uncharacterized protein LOC116402431 [Cucumis sativus])

HSP 1 Score: 99.4 bits (246), Expect = 1.2e-16
Identity = 39/53 (73.58%), Postives = 45/53 (84.91%), Query Frame = 0

Query: 328 LAPVPVEPLQPPYSKWYDPHVRCDYHAGAVGHSTENCTSLKHKVQVLMRAGWL 381
           LAP+P+ P+QPPY KWYDP+ RCDYHAG VGHSTENC +LK KVQ L+ AGWL
Sbjct: 193 LAPIPINPIQPPYPKWYDPNARCDYHAGGVGHSTENCLALKRKVQSLINAGWL 245

BLAST of Moc11g16400 vs. NCBI nr
Match: XP_031741698.1 (uncharacterized protein LOC116403895 [Cucumis sativus])

HSP 1 Score: 99.4 bits (246), Expect = 1.2e-16
Identity = 39/53 (73.58%), Postives = 45/53 (84.91%), Query Frame = 0

Query: 328 LAPVPVEPLQPPYSKWYDPHVRCDYHAGAVGHSTENCTSLKHKVQVLMRAGWL 381
           LAP+P+ P+QPPY KWYDP+ RCDYHAG VGHSTENC +LK KVQ L+ AGWL
Sbjct: 432 LAPIPINPIQPPYPKWYDPNARCDYHAGGVGHSTENCLALKRKVQSLINAGWL 484

BLAST of Moc11g16400 vs. NCBI nr
Match: XP_031741309.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC105435093 [Cucumis sativus])

HSP 1 Score: 99.4 bits (246), Expect = 1.2e-16
Identity = 39/53 (73.58%), Postives = 45/53 (84.91%), Query Frame = 0

Query: 328 LAPVPVEPLQPPYSKWYDPHVRCDYHAGAVGHSTENCTSLKHKVQVLMRAGWL 381
           LAP+P+ P+QPPY KWYDP+ RCDYHAG VGHSTENC +LK KVQ L+ AGWL
Sbjct: 432 LAPIPINPIQPPYPKWYDPNARCDYHAGGVGHSTENCLALKRKVQSLINAGWL 484

BLAST of Moc11g16400 vs. ExPASy TrEMBL
Match: A0A6J1E2J7 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111025431 PE=4 SV=1)

HSP 1 Score: 102.8 bits (255), Expect = 5.1e-18
Identity = 40/55 (72.73%), Postives = 50/55 (90.91%), Query Frame = 0

Query: 326 NLLAPVPVEPLQPPYSKWYDPHVRCDYHAGAVGHSTENCTSLKHKVQVLMRAGWL 381
           N LAPVPV+P+QPPY +WYD + RCDYHAGA+GHSTENCT+LK++VQ L++AGWL
Sbjct: 366 NQLAPVPVDPIQPPYPRWYDTNARCDYHAGAIGHSTENCTALKYRVQALIKAGWL 420

BLAST of Moc11g16400 vs. ExPASy TrEMBL
Match: A0A6J1D099 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111016200 PE=4 SV=1)

HSP 1 Score: 97.8 bits (242), Expect = 1.6e-16
Identity = 50/109 (45.87%), Postives = 64/109 (58.72%), Query Frame = 0

Query: 273 LTHTDQRTSPHRQEFITH--SGLRMSFLVRRMKFSTQVNHTAKAYGRFGGHYPLGNLLAP 332
           LT T Q   P  Q+  T    G + +   R  +  TQ +     Y          N LAP
Sbjct: 354 LTPTSQNFQPRGQQHNTFYTQGQQNN---RGARKQTQFDPIPMTYTELLPQLFQNNQLAP 413

Query: 333 VPVEPLQPPYSKWYDPHVRCDYHAGAVGHSTENCTSLKHKVQVLMRAGW 380
           VPV+P+QPPY +WYD + RCDYHAGA+ HSTENCT LK++VQ L++AGW
Sbjct: 414 VPVDPIQPPYPRWYDANARCDYHAGAIXHSTENCTXLKYRVQALIKAGW 459

BLAST of Moc11g16400 vs. ExPASy TrEMBL
Match: A0A6J1DM29 (LOW QUALITY PROTEIN: uncharacterized protein LOC111022231 OS=Momordica charantia OX=3673 GN=LOC111022231 PE=4 SV=1)

HSP 1 Score: 96.7 bits (239), Expect = 3.6e-16
Identity = 53/139 (38.13%), Postives = 78/139 (56.12%), Query Frame = 0

Query: 326 NLLAPVPVEPLQPPYSKWYDPHVRCDYHAGAVGHSTENCTSLKHKVQVLMRAGWLKCTIR 385
           N LAPVPV+P+QPPY  WYD + RCDYHAGA+GHSTENCT+LK++VQ L++AG L  T +
Sbjct: 468 NQLAPVPVDPIQPPYPGWYDANXRCDYHAGAIGHSTENCTALKYRVQALIKAGXL--TFK 527

Query: 386 PKIIAPEVAQQLPMKPSVDVFSFSSACRGTQFNMSTAFHGLHSYPILYDLWAFQSLTL-H 445
            +         LP   +V + +    C+G +     +        +   LW    +++ H
Sbjct: 528 KENXPDVKNNPLPNHKNVQINAVE--CQGIESRSKVSEITTPMQXLFEILWXHGYMSMEH 587

Query: 446 AATGL--SKTDQHLTCTGH 462
               +   + D++LTC  H
Sbjct: 588 LCPDIRCERYDENLTCPYH 602

BLAST of Moc11g16400 vs. ExPASy TrEMBL
Match: A0A5A7VBN6 (Retrotrans_gag domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold468G001210 PE=4 SV=1)

HSP 1 Score: 96.3 bits (238), Expect = 4.7e-16
Identity = 46/79 (58.23%), Postives = 56/79 (70.89%), Query Frame = 0

Query: 305 STQ--VNHTAKAYGRFGGHYPLGNLLAPVPVEPLQPPYSKWYDPHVRCDYHAGAVGHSTE 364
           STQ  V+ ++   G+    Y   N +A VP EPLQPPY KWYDP+V+C+YHAG V HSTE
Sbjct: 246 STQRVVHVSSPTVGQTNYSYSYQNGVAIVPQEPLQPPYPKWYDPNVKCEYHAGVVRHSTE 305

Query: 365 NCTSLKHKVQVLMRAGWLK 382
           NC  LK KVQ L++AGWLK
Sbjct: 306 NCFPLKAKVQSLVKAGWLK 324

BLAST of Moc11g16400 vs. ExPASy TrEMBL
Match: A0A6J1DZ90 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111024415 PE=4 SV=1)

HSP 1 Score: 96.3 bits (238), Expect = 4.7e-16
Identity = 45/95 (47.37%), Postives = 63/95 (66.32%), Query Frame = 0

Query: 328 LAPVPVEPLQPPYSKWYDPHVRCDYHAGAVGHSTENCTSLKHKVQVLMRAGWLKCTIRPK 387
           L PVPV+P+QP Y +WYD + RCDYHAGA+GHSTENCT+LK++VQ L++AGWL      K
Sbjct: 361 LTPVPVDPIQPLYPRWYDANARCDYHAGAIGHSTENCTALKYRVQALLKAGWLNF---KK 420

Query: 388 IIAPEVAQQLPMKPSVDVFSFSSACRGTQFNMSTA 423
              P+V++  P+    +V   +  C+G +     A
Sbjct: 421 ENEPDVSKN-PLSNHQNVQINAIECQGIESKSKVA 451

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022158986.11.0e-1772.73LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia][more]
XP_031738551.11.2e-1673.58LOW QUALITY PROTEIN: uncharacterized protein LOC101203611 [Cucumis sativus][more]
XP_031737539.11.2e-1673.58uncharacterized protein LOC116402431 [Cucumis sativus][more]
XP_031741698.11.2e-1673.58uncharacterized protein LOC116403895 [Cucumis sativus][more]
XP_031741309.11.2e-1673.58LOW QUALITY PROTEIN: uncharacterized protein LOC105435093 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1E2J75.1e-1872.73Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111025431 PE=4 SV=1[more]
A0A6J1D0991.6e-1645.87Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111016200 PE=4 SV=1[more]
A0A6J1DM293.6e-1638.13LOW QUALITY PROTEIN: uncharacterized protein LOC111022231 OS=Momordica charantia... [more]
A0A5A7VBN64.7e-1658.23Retrotrans_gag domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 ... [more]
A0A6J1DZ904.7e-1647.37Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111024415 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 605..607
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 108..123
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 74..125

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc11g16400.1Moc11g16400.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071897 DNA biosynthetic process
biological_process GO:0015074 DNA integration
biological_process GO:0006310 DNA recombination
biological_process GO:0090502 RNA phosphodiester bond hydrolysis, endonucleolytic
cellular_component GO:0030430 host cell cytoplasm
cellular_component GO:0005634 nucleus
molecular_function GO:0003887 DNA-directed DNA polymerase activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity