Moc05g11980 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc05g11980
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Locationchr5: 9354077 .. 9375584 (-)
RNA-Seq ExpressionMoc05g11980
SyntenyMoc05g11980
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGGGTTGAATATGAGCCTTTGACCTCGTCAAAGCTGCCAACCGCGAGGGACAATGGTAAATCGACAACGTCGCAAAGGACGTTACCTCGACATGTCACCAAGATATATGATAAACTAACCAGCCAAGAGCCCAGAATTCAAACCAGGATTTGGCACCTCTCTGTCTGGCATTCCCTGCGATCTGATCTAGTTCTGAGCTACTTAGAAGAGCGTAGACTATCCCTATAGACCAACATGAGTCCTTTCAGGATGCTTTGTCCTCATTGGAGTTCTTACGATTGAGTTACCGAAAAGAAAGGTGCACCTTATCATAGGTAGTAACTATTATTTCCTTAAGCTTTCTTAACTATATTTTCTATGGTCTCAGGATTCCTCTCATTCGGATGTGGTATCGATTCATTCTAGTCGGACGTTACATTTATTCTGATGTGCATTCAATTAATTACTAAACATGGTCATAGTACAAAAATTGATGGTCATAATACAAAAGTGTATGGAGCTTGATGCTACAGTCAGATTATTATTGAAATATAGAGATTTATTATCAATTGTCGACTTTGTGCACTGATCGAACTTAAATTATTGCATGACTTTGACTTTCATTGATAATAAAAAATTTTCTAAGACTAAATTTACATCTAAATTTTTGTTAAATGAGTGGGTGAAACGAAGTCGAATTTAAATAGAAAGTAAAATTTACAACATTTGTCCAAGTCTAAACATAATAATCCATAAAAGTTTGACTATAATGTTTATAGTTATTATATACCCACATGTTTAAAGATGAAGAGAGTAAAAATTAAAAAATAAATTATCATCAAAACACACCATTAGTGTAAAACCTAACTTTTTATTTTTCGACAAAACAAACACCAAGAAACTTTTTTAATTAATTGGTGTGACAAAATTGTTAAATAGAGGAGTCTCCAACAAAAGTAGTACCAAATACATATTTGATTTTTGACAATGTCATGTTACCCTCTACATGTTACACACGAGAGAGAGAGAGAGAAAATATAATAATTATCTAAGCCATGTGCATCCATTTTGTGCTAAATACAATGATACGCTCCACAATCATCTCTTAATTAAAATTTGAAAGTTTTGATTGGCTTTTTTGTCCTTTTAATTAACAATTAATTAATAAAAGTAATGTGGGTAGTTTAATTAGAGAGAAAATTAAAATACCCATTGCTTAATTAAGTTGTATATATGAACACACACATAAACCCCCATATGACGGTTTGTTGGGTCGCATGTGACAAGGACAAAAATGACTTTTCCTCATTTCTTTATTTTCTTAAACATCTCAACAATGGTCGGTTCTTGACTTAATTGACCACATCCCCTTTATCTAAACTCAACTACTCATGCATTATCATCCTTTTATTTTATTTCTTAAATTATCCCATCTTCTAATTTTGACAATATTATTAAGTTTAATTAATTTTTCTTTGCATTTACCGGTGATTTAACCTAATAACATGCAAGGTTTTAGACGTTTAGTACCTATTATTTTGACACTATATTAAATCACCACTAAATTTTTAAAAAATATAATCCGATCGTTGTATCATATCTATATATTTGACCTCAACTAATACATTTGAGTCTATGTATGAAGATATTATTAGAGGTGAAAACATAACTTCAATGAATACAGTGACATGCAGGTTAGAAATATAGAAAACTTTTAGGTATAATAGATAAGTTAGAACTTTGTTGAGATTGCAAGGAGGGCAAACAAAGTCTCACATTTGTCAGGGAAGAGAGTCATAATGGTATATAGACGAGAATAATTTTCTTTATTGATATGAAACTGTTTGAGTGAAACTAAAAGTAAAGACTTTGCAGATAATATCATATTATTGTAGACTGTTATTAACACTGGACAATGCATCTTAATCTAGAAGTAGTGTGCTGGTAATATATGGAAAATACAAATTTACGTTATTTTTCACCCATCAAGAATTAACTGTTTTTGATGCCTTGGATTTAAACAATACTCTTAAATTATTTATAAAAGCAAGTCCTAATTAATTATTGATAGTAATAATATATATGAGAGGGAGCCGGTCGATATCCAAATCCAAGAGCATATTAGTTATAGGTTTTTTTTTTTTTTAAGTCTTTTCGCAGCTTTTCAGTATTGTATTGGTTTGGGATTGCCGGAAGATTTGACTTTTATTACGTGGGTGCTTAGCCCTCCCACCTTCCTAGAAACTCCTGCGTCCTTGACCACAACTTACCTAAATTGAAGTTGTCGTAAACTATACTAATATTTATCAGGCTCTATGTACTTCAATATCCTAACTCTCTTTAGTTTATCAATTTATATTAGATTTTATTTTTTGATCGAAACTCCTACTCATTGTTAAGGACAATGGTACTCCAAGTACTCCGAGTATCTCTACAAAAGGATTCACCGACACAGTTAGGTTTAAACCATGGTTCTGATACCATTTATTAAGGACAACGGTACTCCAAGTATCCCGAGTATCTCCACAATGGTATGATATTTATCAGGCTCTATGGACTTCAAGATCGTTACTCTAAGTATCTCCACAATTGCATGATATTGTCCGCTTTGGACATAGCTCTCACGACTTTGTTTTTTGTTTCAAACAAAAGGCCTCAACCTCACTACTTTTTTGTTCAACACTCAAGGATTCACCGACACAGCTAGATTTAGAGCATGGCTCTGATACCACTTGTTAAGGACAACGGTACCCTAAGTAGACTTTGTTTTTTGTTTCACCCAAAAGGTCTCACACCAATTAAGATAATTGTCCTCTCTTATATACCCATGATTATTCCCTTCCTTAGCCAATGTGAGACTTTGTTTGTAATCACCTTACAAACTTAACACTAATATTCATCGGGTTTCATGGCCCCTCAAGATCGTAACTCTTTTTGGCTCATCGACCTTAGCTAGATTCCATTTGAGCGTTTAGGCTCTGATACCATATTAATTCTTGGGAACGCTACTAAAGTCCTAGATTGAATAAATAAGGGGAAGATCATGGATATAAATGAAGATAATTATCTTCATTAGTATTGATGTTGGAAATTTTGCCTCCTAATTGGTTCTCGCCAAATGTGCACCAAGCTCGATAATCACGGGATCGGATACTTGCAAAACCGAACAACCCTAACTGGTGTGATTTCGGTCACACTAGCTCCAATGCTTAAGTCAGCGATGGGAGGATTACTAAATGAAGTACTTAGGGTTTATTTGCTTACCTCTAGGGTTCGGTTCCATTTATAGAAGTCTTGCCTTTTGGAACCTTCTCGTTGAGCCCATCTGTCCGTCGTTACTGGATCACAGGGATTCGTGGACCGGGGCACCTTAAACAATGGTAGGGGAGAGAAAGTGGGCTCGACCTATGGGCCGAGCCGGGGTCGGTATTGAGCCCCGGCTCGTTCGGTCTGTAGCTGGGTTTGGGCATCTCTTGTCTGTAGAGGCTCATTTCCTTGGCCGGATCCAATACCATAACAGTTGCCCCCACTCTCGAGATAATGCTCTTCCGACCGTCTGATCTCATCATTTCGAGAGTACATCTGTAGGTGTTGGAATATCTGGGATCGACTTTTGCCAAAACTACCATTGCAGGTGTTGGAGAAGGTCAATGCTACCTTCTTGTTGAGCCCATCCAATCCATCGCTGCTGGATCACAAGGGTTCATGGACCGAGGCACCTCAACCACTGGGAGAAGAGAGAAAGTGGGCTCGACCAATGGGCCGAGCCTACTTAGAGCTGAGCAGGGGTTGGAGCCGACCCGGTCGGCTCGGTGTTCCCTTCGTCGTTGCTTTCTGCTTCATGGTTAGGAGGTAAGCTATTGGTCGACTTCGGTTTCGCCCTCCCAGAACTAGGTCACGAACGTCCTAGCCAGGTCTTTAAAACTTGAAATTGACCCTCGCTTCAACTGTTGAAACCATACCCTGGCTGAGCCATTTAGGGTAAAGGAAAACACTCAGCATCTAATGGCATCGATCATCCCGTAGATGTCCATCCATTTTCGGTAGGCATCGAGGTGGTCTATCGGGTCGGTTGACCCATCGAATTGCTTTATGGCCGGGAGCTTAAATTTTGGAGGGCCTTTTCTCTTATTATTTCTTCCGTGAACGGAGAGTCGACCTGGTCTATCAATTCTTCGAGGTCAAATCCCTTCTCCTTTTGGCTTAACAGCTTCTCCAATGGCTCGGGTTGATCCTGTGACCTTTCCTTGTCGATGTTGATAAGTCCAAGGTCTCCTCGGTCATCTCGGCATTCATTGCCTCCCATGAGTATCAGGGACTTCGGTTGCCTCACGACTTGCTACTGTCCTGACAGATCGAAAATTCAGGTACGTTGTTTGTCATTCCCTCTAATTTTCAACTTGCTCTGGGCGTTTGCAGTCGATTCCTTTGCCCCGTTCAACTCCGAGGCTTTACCCTTCTGAGGATCCTTGTCGAGGGGTGGCGGGTTCACTTCTTATTGTTCCAATTGCTAGAGTAGTCGGGACATGCGTTGATACATTCCTTCAACCTTCTCTTCTAGAGCCGCGAACCTGCCCTAGACCTGCCCAGGAACTTCAACCGTAGAGATCTCGACCGACTCTTCTATACGTTCTGGAACTTGTCGATCATTTTCTCTAGGAGTTTGCTTGACAATCTGTCGGCTCGTGCGCCTTCCACTACTCCTAGCAGGGGGTCATCAATGTTCAGCTTCCTCCTCGTAGGGCAATTCTCATCAGAGTGCCGTGGACTTTCCTCCAGATTCATGTTGCTCAATCGTGTGGTTTGTGTCGTTCCCATAGACGGCTCCAATTGTTGATGCTGTAAATTTGGCCTCTTTGGTTCTCGCCAAATGTGCACCAAGCTCGATAATCACGGGACCGGATACCTGCAAAATTGAACAACCCTAACCCGGTCTGACTTCGGTCACACTAACTCCGATGCTTAAGTCAGCGATGGGAGTATTAATAAGTACTTAGAGTTTATTTGCTTACCTCTAGAGTTCAGTCAGTGGTTCTATTTATAGGAGTCTTGCCTTCTAGAATCTTCTCGTTGAGCCCATCGGTCCGTCGTCACTAGATCACAAGGATTCGTGGACCGGGACGCCTCAACCAATGAGAGAGGAGAGAAAGTGGGCTCGGCCAATGGGTCGTCGCGGGTGTAGGGCTCTCTAGTTTGTAGATGCCCATTTACTTGGCTAGATCTAATCCCATAACAATTGGTATGAGGTATTTTAGGTGAAATAAAAAGCAAAGTCGTGAATATTATATCCAAAATGGACAATATGATCATTGTAAAGATACTCGGATGTACAAAGGTACCATTGTTCTTTTGGATCCTTGACATAAACATTTTCCTATATTAATCAGAGGTGATAAGATTTCGGTAAATAAATAAGATTTTTATTGGCGTGGGACATTTTAACTTTCAGTAAAACTAAATTCGAAAGTAGACACTATTATATTATATAAATTCAGGCTATCCAATAAACCATGAGCCAAAATAGGAAGTGAGGGAGTCAAAGAGAGAGATTATAGGTAGGTGTGGAAAAATTATAATCCCAAATTGGGTGTGCAAGTTGGTCTATAAAATCCAAAGAATGATCTTAAGATGTGAGATAATTTGGTGCATAAAAAAATGAAAAGCAGGGTTGAATTAGAAGACACACAGGCATCGATGTTTGAAATTAAAAGAATAATAATTTTAGGGTAATTGGTCTATATGTCACCCTAATGGAATCTACAATGCCTTTCTTCAGGTGATCTCATAGTGAATATTTTTGCTTCATAGTTCTAGCCCTCGCTCTCTCTCCAAATCATCAAATTTCTATGCGCATGCACATATATAGGCTACTATTTTTTCTGTGTGACCTTCCAAAACAAAAATACAAAAATATTTAGGTACGTGGTAGGAATCGAACTCACGACCTTTTGATTTGACATAAATGTTAATTACCATCGAACTATACTCGCTTTGGCACTCTGTATTATATTTTATTGTAATATTAAATTTCTCATTCGAATTATTAAAATTTCATCAACCTCTATACATATTTATCATTTACGAATATGATATAATATATAATGAAAGGTTTACGTGTTATCCTTATCAATGGGAAGTTACGAGGCGCCACGCGAAACTGTCTCATAAGGACGTGCCAGATCTCCACCACCTGCTAAGTATAGGTGAGTAGAGCGCATTGCAAGGGAAAGGGCGAACGCACCGTATGGACGCCCCCAATGCACTTAAAGGTATCAACAAACTCGCCCAGCAAGTGTCTAACACGGTGTAAGTATTGGTTTTATAAGGTGGATTCAAACAAAGTCTCACATTAGCTAGAGAAGGAGGTCATCATCGACATATAAGGACAATTATTCCATTGGTATAAGGCCTTCTGGTGGAACAAAAAACAAGACATGAGGGGTTATGCCCAAAGCGGACATTAAAATCGAAATAAATGTTGTTAAATAATGGGAAAGGAGAAAAAGAGAGTTCAAAAGGCATCGATTCAAAATGCCTCCCAATGGAATAATACCAAACAGAATTCAGAAGTGAAGCATGTCATTTATTAATTTGACTACCATATATTATAATCGGTAATCATAGAGAATATTTGACAGATATTGAGATTAAAATTTTGGAAATAGAGAAATAATGGATGAACCAAAGAGTCAAAGAAGAGTGGAGAGAGCGACAAGGGTGGTTTGAAAATCCCATGAATCTCCATTTCCAAGAAAAAAAAACGAATAAAAACAGATTGTACAGTGAAAGCTCAAAGAAAAAAAAAAACAAACAAACAATGGTCAAAAGGAAACTTTCCCACCCGCATAACATGGGAGAAAAGGAAATCGAAAAAGCAGAAGATTGTGCTTATAATTTGGTTCAAATTGAAATTTAATACAATTATATATTACAAGAAAAGTTATTGACTGTTCAAGGGGATGACAGACGTTTGAAATTAGGGAAGTTTTAAGATATCGAGGGAACGAATTGATTTGAATTGAATACGTGGTTGTCCTGTTTTAAGATATTCCCAAGACTTTGAAGTCTCCTCGAGTTTCACGTTTAAAAATGGATGTTAATGGAACATTTTGTGTCTATATAAGATTAAAAAGTGGAACCATTGTAAAAATATTGAACAATGTATCTATATTGAATAAATAATTTATGGTAATATTCAATCATTGGTTTACAACTAAATTGAATTAATCATTTGTAGATTTAGTTCTTTATAGTAAAAATATTCATATATTCGTACGGGGTTGAAAGATCTAAACTAACTTGCTAAAATATTATTCTATTTTTAAAGTTTCAAAATTTTTGGATCTTAGAAATTAAAATAATTAAGTGAGATGACACAAACACGTTTTTAACGTGGTTCGATTTCCCAACCTACCCCACGAGGCTACGTCCTAAGATTAACAATAAAAAATTAGTACAATTTACAATCAATATTGACTTATGCATAAAAGATCTCCCTTCTTAAATCATGCCGCAATACTTGATCGACTCCTCCTTCGATTGAATCTTTTCAAACACAAACTCCCCTTGTGTTGTAGTCCACCGCTTGAACTCCCTTGAATGGTGGATGTGTCTTCTCCAATAGTTGATAGAAATATTAAAAAAGAAAAACAACTCACATGATCGCACTTATTGAATTACATAAAAAATTCTCTCTCAAAATTTTACACTATAAACAATACTTGAACAATTTGAGAGAATAAAACAAACTTTGCTCCACACTTACTTGATATGTTCCAAAACTACTCAATAAATAGACCAGAAAAATTGACATGGAAGGAAAGTGAATTTATCACAATTCCAATTCAACCACAAATCTCTCAATATAATTAAAATACTATTTTCATATAATCAATCAGTAAAACGTTCCAAGAAAGATAACTAGCCGTTGGAAACATTTAAGAGCTCAAATTCAAATTCAAACATACAATTAAATTTAATTATACACCTAAAAAAATAGAGAAAATGATTAAATAAAACTTAATGTTTTTGCAAAAAAATGTCAATAAATAAAATTCCTTTCAAGGGTCAATGGTGTTATCGAGTGTTTGGGGGAAGAAACCAGGTTGTCACATGGAGGATAGGGATTCGTTGCAAGGGGACGCCCCCAAAGCAAAGTTCCATGTAGCTCGGGTTCCGAGGTGGATGATTTTATCATTCTCTCCAGAAATTAAGTTATTAATGCAGGAAAAGTGTCTAGATTCGACCTACTCAAACCCGACACGGCCGACCTCAATTCCGACCTAGGTAACCGCCTATCGGAGCCTAAACCTATAAATAGAGGATCTCATTTCACCATCAGTTATCGATTTTCTCCTCGAACTTAATATTGAGTCGGAGATTTTGTCCTCTTGTGCAGGTCTTCTCTTAAAAATTCAGGTCGGAGCAAAGATCGAGTTCGAACTTGACCAAATTTGACGTTGCGTAAATTGATGCATAAACCTTGAGAATAATGTAAATAATGTATTAGTTACATTTTGAGATTTTCTTCTTCTTTCTTTCTTTCTTTCTTTTTTTTTTTTTGAGAAAAAGAAATTAAATAGAAACTCTAAAAATCCTTAAAGGCTACAATGTTCGACAGCGAAGGTGGGAAGTTTGACAGCCATAGATAAGGCTGATTCCAACGAAGGGAAACACCGGCAAGGAGATGGCGCTCCCATTTACATCTCTGCCAACATGGATAAATTTGAAATGCACAAATGAATTTGCCAAAACTTTAATGTCTTCAACCCATGTTCCAATTTCGTTTTCTCTGATCAAATTGCCATTGATAAGATCTACAGCTTCCTATGAATCTGATTGAAAAACCACTTCCCAGAGACCTAACCGTTCGGCCAAACGTGCAGCTTCAAGGAGGGCACAAATCTCAGCCAACAAAGGAGTCTCCACATTTTGAGACTTCCTATTAGTATCAGTTTAAGTTGTTTATGTGCATTAGTTGCAGACTATTAGTATACTTACCTATAAATTAGTCATAAGCCTTCTTTATGAAGAAATAAGAAGAGCTCTCTTATCTTGATAATGTAAACAATGATACTAGTCCATCGCATAACAAAAAGGAATGTGACAAATAGCAGGTAAACAACAAGTAATCGGAAAATCATCACAACTTACAGTCTCTATGTGTGAGTTGAATACCAACTGTGATTTAGAATTATTGTTGCAGCAACCATCTCTCTTAATGGATTGATTCTTTCCTTCTTTCCTTTATGACTTCGATAATTTGGCTGATGTAAATTCAATTCATTAGTTTTCTTTTTATTTTCTTATGTAGGTCCACTAACTCTATGTATTTAGCCTAATCATGTAAAGCTTTGTAAAAACATCATCTTGAATTATATTTAGTGACTTTACCTCTGGTTTGGTTTCTCAATTCCTTTCATGGTATCAGAGGGCTATATATTGAGAGATCAAACCAAGATGGTTGCTATCTCCTTTATTGCTCACTCAATTAAAGATGAACCTACTCTCACAATCCAATCATTTCACCAGTGTTCCAGCTTGATATCAATCAAATTGAACTCCAGCAACTATTTAATTTACTCTGGAAATCTCAGGTCATTCCTCTTGTTCAAAGTTTTGGAATTGAGCATCACCTAGTCAATGAAATTCCACTAGATTCAGAAACCATAGATAAAAAAGGAAACATCACTGTTAATCCCTTATACAATACTTGGTTCCCAAATGATGGCTCGCTGACCGCATGGCTTCTAGGAATCATCACTGAGGAAGTATTGGCAATGATTGAGGGATTGGAAGCAACACGCCAAGTTCGGAATTACTAGAAGAAACTCTCCTAACCATCGTCAAGGAGAATAAAATTCACATTAGTGAATCTTTGTATAGTCTAAAGAAAGGTAATTTATCCATTGAAGAGTATGTCAAGAAATTTAAACTTTTATGTGATAAAATTGCAGCGACAAACCTATTGTTGACCTAAGCAAAGTGTTCACCTTGGCTCAAGGTTTGGGTCCTAAAATGATTTTAGGACAGTAATGTTGTCGAAGGCTCCTTATCCCATATTCAACTAGTTTATTCTTGCACTCAAATCCCATGAATATGAATCAAATCGAAGTAGAAGAAGAAAAGATCTCAATTCTAAACACAAAGCAAGCCTATTATAGCTAAAGAAGGAGAGGTAGAGGAAGAAACAAGCATTTTAATTCAAGGGGAGAGGTTTTCAACCTAGCAGAAACGTTTTATGATTTCCTTTGTTCATAAGGGTAAAAAGAAGCTAAATGGGTGCTATTTCTAACGTTTGATACTCAGAAACCGTGTTTTAAGCTTGGTTTAGTGTCCAAAACCTTTCATTTGACGTTATCTTAAGTTATACATGCAGCTATGACGTTTTTACATCATTTGGAGTTCATTTCTAATGGAAATTATGATTTTGTGCAGAAAAATATCGAACTAGGCGCTCGGCGCCTTCCTCAATCCCGTGTTTTGGTGTGATTTGAAGCTTACCCAAGCCTTTCCACGTGTCTTCCATGCATTGAATATTGTCAATTCAACAACAAACCAACTAAAAGGTGTTTGAAATGGATGGCTCCCCACAAAACCCTCAACCTCCTAGCCACCTTGAAGTTCTTACAAATACAGCAGTTCCTCTCTCTTCATTCATGCTTTAGACAATTATTTCTCAGCTTACCTCGTTGTTTCCCTTACTTTTTAGTTGCTATTGTTTTAAACCTTGCTTATTTCAAATTCATTCTTGTTTTTCAGCTTGTTACTTTCCAATTGTAAGCTTTAATCTTGTTCAATTAAAAATTCCTGCAAGTTCGTGCATCTTTATTTCTATCTTTCTCTCTATCTTCGAAGTTGTATCCACACAATCTTGTGGAATTGCCCGAGTCTTGCTCCTCAATCTTGAAAGAGAACGGGTTAGGGTCGTTTGGGTTACAACTCCACTATTTCAGCTTGTTTAATTCAATTCATCTTTTAATCATGTCGTCTACTTCCATCAACGTTCAATTATGTTTTTCCTTAAAATTCATGAGCTAAATTTATTTTCCCAAGCCCATTATAGATAGTTAGCTAAAGTTTTTAAAATTCAACAAGGCAAAAGGCGTTTTTCTTCTATTTTATCACTCAAAGTTGTTTTCATAAGTTAAAACTTGATTTGCCCAATTAGTGTTTTAACCAAGCATCAATACTGAAAATGTTGATACATTTTCACAACTTTAATGATTTAACTTTGGAGAGAAATACCGTAGACATAAAGATTTCGCTCGAGTAGTGGAGACATAGGACCTCGACCTAAGGTTACGGTTGGTTTTTATTAAATTAAGATCTTTTTTTCTAATAGACATATTAGGAAGGGTCTTATACCTTCAAAATCACTTAGACATAAGAAGATGACGAAGTAAAACCGAATAAAATTTGTGAGAGGCACCTGACGCCTAGTTGGAAAAAATGGTTTTATTTTTAGCTAATTCGAGTTAACTGGGTGGGTTTATTGCATTGAGTCTTTAAATATCAAAACCAACATTATCCTTATTATTTGTTATCCAAAAATCAATTACAAAGTCACTCATAAACAACATTTTCATTAAAAAGACTTCATTTGTTCTTGAGGATCGATGCTCAGAATACTCATTCTGCTTTACTACTTGTGACGGGTGTACACTTGCGCTCTATCATTTATTGTCATTCCAAAATCCTAAACAAGTTTTTGGCGTCGTTGCCGAGGTTTATTTGTTTTAGTTTGTTATTCATTTTTGTGATCGGTTTTTGTGATTGTCGTTTTTGTTCTCAAACACTTTGATTTATATCCAACCACGGAGGTCATTGTTCAGTGCATGAGTACGGATTCATTTCTGTTGCCATTTGATCCTGAAATTGAAAGAACATATCATAAGTTGAGAAAAGAATAGAGACTCAGAAAATAGAGAGAAAAGCAAAAAAAGAAAGAGAAAGAAAGAAAATTCGAAAGCGAAAGTGAAAGTACAATCACCGTTATGGACGAGATTCCACCACAAAATCTTATAGATCCACCAGTTGTTAACAAAAATATGGTAGGTCAACAACAAAATAATGAGTTCAATCATATCTTAATGGCAAGTAATAGAGACGTGGCTATGCGAGAATTTGTTGCCACAACATTCCAGAATTTGGATTCTATCATCCTAAATCCCATTCTAGAAGCCGCCAACTTCGAACTGAAGCCCCTTACGTTTCAAATGCTTCAGACATTAGGACAGTTCGACGGGCGTGAACACGAATATCCTCATGACCATCTCAAAAATTTTCTTCAAATTGCCGGTGTATTTAGATTTCCAAATATAACCGATGATGCATAAAGGTTAACTCTTTTCCCATTTTCTCTTAAGGATCAGGCAAGAACATGGCTCAACTCGTTCCCGCCAAGATCAATTACAACACGAGGGTCACTAGTAGAGAAGTTCCTTACAAAGCATTTTCCTCCTACTCGCCATGCTGACATAAGGGAAGAGATTGTCACCTTTAGACAATATGATCGTGAACCAGTGCACGAGGCATGGGAGAGATTCAAAGAGTTGCTGCAAAAGTGTCCATGGATTACTAACATGTATCCAGATCGAGCAGTTCTTCTGAGGTTTGGACCATCCCATTAAGATGATGCTCAACAATGCTGCAAATGGAGCCTTCACAAAGAAGACCTTCAACGAAATAGTTGATATTTTGGAGGACCTAGTATCCCACAATGAGCTATGGTGTTCTCAAAGGTCGAAACCTATACCTAAGAAGCAAGATACAGCAAGAGTGTTGGCATTAGACATTGCAACTTCGATGCAAAAAGAGATGGTGACAATGAACCAAATGCTAAGGAGATGGCCTTGGACAGGAAGAATACACCAGCCACACTGGTGCAACCTTCCCGTGTAACCTGAATATACTGCCCCTTCGCATGTTTTCCAAATTAATGAAATTGTTTGTTCCTATTGCAGTGATAACCATCTTGTAAGGGTATTTTCGTAAGTTGAAAAAAAATAAAAAAAATTGGAAATTGAAAAAAAAAGGGGCTTGTGTGTGAGCTGTGGAAGGGAACACGTGGAAAGGAAAAAAAATAAAAAAGAAAAGAAAAGGAAAAGGAAAGGGAAAGAAAGGAAAAGAAAAGAAAGAATTGAATCGAAGAGAGAAAGTTACGAAAAACAGGGGGGTTTCGATAGTTGAAGAAGAAGGGGTTGAAATCAGAAGTTGATGGTCATTTTGGAGCGGGAATTGAAGAGCCAACAACAATAGTTAATGAAAAGTGGTAATTTAACTTGTTTTTTTGAAGAATTTTGAGGAAATAGAAGTTAAGATTGGCTTGTGGGTTTGATTTTAAGTTAAGGAAAGTTCTTGAATTAGTTGAAGACAATGAGGACTTCTGATTCTTTGTGAACATATGTATTGATGGATTAGTTTCTGCTTGTGTATGTTGGTTTCAGAACACAGAGTGTGTGGGTAGGTGGAATTTACATGAAAAACAAAAGGAGAGAAGAAATGGAAGGGGAAGAGCCACAGTTAGGGAAGAAGAAAGTGAATAGTGTAGGGGAGAGAGAAAGACTAGGTGTTGTTGATTAAAAAAGAAAAAAAAAATGATTGGACATTGAAGCAAAATAAAACGTGAAATGAGTTTGGTTGGAAATAAGTTGGGGTATAACCAATGGTTAATAACCAAGTTGGGAATTTGTGTACTGTGGGTCCCACTTGTAAGGAAGATGAGAGAGCAAGATGAGAGAGCAAGATGTGTAATTGGATGGTAGAGAGAGAATGAAGTTGTTGATTAAATTTATGGTTTAATTTGAAGAATTGTAGAGTTTAAATTGGAAATTGAGTGTGAATTGAAATTTCATTGGTTATTATTTCAGGACGATCAGAAATTGGAGCAGCATACCAACCTAATTCACCTCGTTCATAGGATTTGTGAGTGATTATTTTACTAAATATTTTAAGATATCTATTATAGTATTTCAGGAAAAAAATATATTACATATGTTTGATTATTTTTGCTGAGAAATTGAGTTTGCTGAGAATTTCTGAGAAACTGTTGTGTTTGCTGAATATGATTAGAAAGAAATATGTTTAGAAGTGATTTGTGAAAGAAATGTTCTTAGAAAGAAAAAAAAATTTTTAGAACTGTTTATGAGATTGAGAATTTTCTGAGAATCGTTTTCAAACCACCATATGGTTGTATCTAGGTGATTACCCGTATGCTATGTGAATGTGTTTCTATAGTATACCCAGAAATAGAAGGTCTTTGGGACCATTTTACTCCGCGCGTTGAGAGAAATATTCTGCTGTGTATATTGAGATTACTCTATGCCAGTGGACCAGAAATAGAATTATTCTGCTGTGCGCATTGAGATTACTCTGCACTAATAGACCAGAATAGAATGTGAATACATATATGGTGTTGAGTTTCCTTAATTTGTTTTAGGCTTGAAAGAAGATATTTGATATGTATTTAGAATGTGTTTTTTTTAGAAAGTATATGAAGTTTGGCTGAGAAAGCTTATTATTTCAATTTAGAGAAGTTTGTTATAAGAGTTTAAATCTCAGTTATTTTTATGGAAGAAGTTTTACGAAATTATTTCCTTATTAAAATGCCACTCACTGACCTTCCTAGCTCACTGTTTATCATTGTTTTTCTCCTCCAGACTGCACCTGAGTTTCATAGAGACTTGCCGACCAGTTTCTGCCACTCATCGTCTTAGAATTTGCATATCATTTTGTATGTACATCTAGAATGACTTATACTGGAATAGTTGTACCTTTTGTGATTGTACAACATGTTTAGGAATCGTGGCCGAGTAGTTATTTTTGTGATGAATTATGTTACACTTTATGTTTTATATATTGAAGAATAATCTAAATTAGTTTTAGAAATTTAGAAAGAAGTTGACTATAGACTCGGTAGTTGGTTGTGGGTCTGTGGTCGACGCTGTTTGTCCTATAGCTAGTAGGTGTTTAGGGCGGGCCGTGTCACATCTTTATGAGAATTGTCCGTATAATCCAGCTTTTGTTTATTATGTAGGTCACGGGAACAATCGGAACTTCAACCTCTATTCAAACACGTACAATCCAGGTGGAGACATCATCCCAACTTTTCATGGGGAGGTCAAGGCAGTTCTAGTGGAGCGACCCAAGGGAAGAACCAACAGTACAAGCAACCCTATGTTCCACCTACTTACCCAAATAATCAACCACCACAACAGCAGTTTACTCAGAATCAACAAGTTCCACCACAGCCAGCCAAAAATAATAATTCTAGCCTTGAGAATATGATGGAGTACATGGCTAGAAATGATATAATGATGAGAAACCTAGAGAGGCAAGTGGGACAGCTTGCCAATGATTTGAAGGTGAGACCGCAAGGCTCTTTTCTTGGACATACCGAAGTTCCAGAAAGAGACGGAAAAGATCAATTCAAATTGGTAACTTTGAGAAGTGGATTTATCTTATGAAGGACCAAAGATGCCGGTTGAGGTAGAGAAACCTTCCACATCAATCTAGAATGCCCCTAGTGAAGCAGATAAAGAAAAAAACACCTCGGATGATGGGGAGGCATCTGACGCCTCTCCACAAAAAAGTTCGTCTTTTGCCCCGATTCATCATTTCCCTCAAAGATTAGCCGAGAGAAATCAAGAAACTCGGTTTAGAAAGTTTTTAGACATCCTCAAACAGCTACACATAAACATTCCTTTAGTTGATGCCTTAGAACAAATGTCGAATTATGCTAAGTTCTTAAAGGATATTGTGTCTAGAAAAAAGAAGTTAGGAGAACATGAGATGGTGGCTATGACTAAGTGTAGTAGTGAAGCTGTAGGTAGTCCTTTGCCTATAAAATGTAAAAGATCCCGGCAGTTTCACCATCTTTTGTTCTATTGGACGAAAGAATTTAAGTAGAGCTTTATGCGACCTAGGTGCTAGTATTAACCTAATGCCCCTATCTGTTTTCAAAGAATTAGGTATAGGAGAAGCTAAAACCATGATAGTTACCCTCCAATTAGCCGATATATCCATTAAGAAGCTTGAAGGAAAAATAGAAGATGTTCTAGTTCAGATAGATAAGTTTATCTTCCTTGCAGACTTCATCATCTTAGACTGTGAGATCGATTTGGATGTATCTATCATCTTAAGTAGACCCTTCCTAGCTACAATAGATACAGTGTTTAATGTCAAAATGGAGAAATAACAACGAAGGTTAATAATGAGAAGGTGAAGTTTAATGTGTTGGATGCAATGAAACTTTCAGGAGATTTAGAAGAATGTTCAGCCATTAGTACTTCGAACCCTGCTTCGTTTGATGAGTTTTATGATCTTTTAGTCGCAAGAATTGAGGAAAAACTAGAAGATGCAGAAGATGAATGCCCAAAAATAACTGAAGAGATGCATCCACCAGAGGAGAAAGAGGATATAAAGGCGAAAGTGTGCAAAATATTGCAACCATCCATTGTGGAACCGCCCACTTTGGAACAAAAGCCTTTACCTTCCCACTTGAAATATGCTTATCTTGGACTTAATGAAACTTTGCCTGTTATTATTTCTTCCAGTTTAACTATTGAAAATGAGCTTTCATTGTTGCAGATATTGACAAAATACAAGAAGGCCATTGGTTGGACCATTACCGATATTCGAGGGATAAGCCATGCATTCTGTATGCATAAGATCCTATTGGAAGAAGGTGCCAACAACTCAATAGAGAATCAGAGATGTTTAAATCCAAAAATGAAAGAAGTTGTTCGAAAAGAAATCTTGAAATGGTTAGATGCCGGAGTCATCTACTCTATATCAGATAGCAGTTGGGTGAGCCCAGTACAATGCGTACCAAAGAAGGGTGGGATGATGGTGGTGGTAAATGAGGAGAATGAAACTTATATCGACCCGAACTGTGAGCGGTTGGAGAGTTTGCATCGACTATCACAAACTGAACAAGGTAACGAAAAAAGATCATCTTCCACTCCCGTTTGTTGATTAAATGTTGGATCGTCTTGCAGGTAAGGAGTTTTACTACTTTTTAAACGGATATTTGGGATATAACTAGATTACCATCGCTCTGGAGGATCAGGAGAAGATAACGTTCACTTGTCCGTATGAGACATTTGCCTTTAGGCGTATGCCTTTCGGTCTATGCAATACACCGACGACATTTCAAAGGTGTATGATGACCATTTTTTCGGATATGGTAGAAAATATTCTTGAGGTTTTTATGGATGACTTTTCAGTTTTTGGAAATTGTTTTGATGATTGCTTATTGAATTTAGAAAATATTTTACAAAGATATAAGGACACAAATCTTGTTTTAAACTAGGAGAAGTGTCATTTTATGGTACGAGAAGGGATAGTTCTAGGCCATAAGATTTCGAAAAATGGGATTGAGGTAGTTCGTGCTAAGATGATGTGATTTCAAAATTGCCACCTCCCACAAATGTTAAAGAAATAAGAAGTTTTTTAGGCCATGGGGGTTTGTATAGACGTTTTATAAAAGATTTGCAAAGATTTCAAAATCCCTCTACCAACTTTTGGAAAATGATAGACCATTTGTGTTTGATGATGCTTGCATGATAGCTTTTGAAACTTTAAAAACTGCTTTGAGCTCGACTCTAATAATTATAGAACCTAACTGGGAATTACCCTTTCAACTTATGTGCAATGTTAGTGATTATGCTTTAGGGGTCATGTTAGGTCAAAGAAGGAATAGAAATTTACATCTAGTATATTATGCAAGTAAGACTTTAACGGGAGCACAACTAAACGACACCACTGTTGAAAAGGAATTATTAGCAGTTGTGTTGCTTTCGATAAGTTTAGGTCTTACTTGATAGGTACAAAAGTAATTGTTTATATTGATCATTCTGCTTTAAAATATTTGTTTGCAAAGAAGGATGCAAAACCACGTTTGATATGATGGATTTTGCTATTTCAAGAATTTGATATAGAAGTCAGGGACCGTAAAAGAACAAAGAACCAAGTGGCCGACCACTTGTCTAGGCTCGAGTCCCGACTTCAGGAAGGTCAAACAGAGATTCGAAAACAATTCATAGATGAGCAGTTACTTCGTCTCAAAGATGTCCCTTGGTATGTCGACATTGCAAATTACTTGGTAAGTCATATTATCCCCCATGAATTCAGTCGGCAGCAAGTAAAAAAGTTTCTTCATAATGTGAGATATTATAGGTGGGATGAGCCTTTCTTGTATAAACTTGGTCCTGATGGAATCCTATGCAGGTGTGTAGTGGAGGAGGAGCACAAAGCAATTTTGGAGGCTTGTCATATGTCTCCATATGGAGGCCATTTCGCAGGACAAAAGACCGCAACGAAGGTACTTCAAAGCGGGTTCTTCTGGCCCACTCTATTTCGTGATACACATAGCTTCGTCCAGGTTTGTGATAAGTGTCAAAGGATAGACAATATATCTAAACTTAACGAGATGCCTCTGAACCTTATTCTAGAAGTTGAAATTTTTGATGTATGGGGTATTGATTTCATAGGCCCGTTCCCTCCTTCGTATGGTATGAATTATATTTTACTAGCTGTTGATTACGTTTCCAAGTGGATAGAAGCTATTGCTACCCCCACAAATGATGCTAGAGTAGTTGTTCGATTTTTACAAAAGCATATATTCACTTGTTATGGAACCCCTCAATGTTTGATAAGTGATGAGGGGACTCATTTTTTGAATATACCGGTAAAAAGTTTGTTGTCTAAGTATGATGTCCGTCACAGGATAGTCGCAACTTATCATCCCCAATCCAATGGGTTAGCTGAGGTGTCTAATAGAGAAATCAAATCTATTTTGGAAAAAACTGTCAATTTAAATAGAAAAGATTGGTCTTTAAAATTGGACGATTCCTTATGGGCATATAGGACATCATATAATACTCCTCTTGGCATGTCTTCTTATAGATTAATTTTTAGAAAAGTTTGTCATTTACCGTTAGAGTTAGAATATAAAGCTTTTTGGGTGACAAAGAGACTAAACTTTCATCTTTTTTTCTGCAAGAGAATCAAGGAAGCTCCAATTGATGGAACTTGAAGAGCATAGGAATTTTTCTTACGAAAATGCAAAGATTTTTAAAGAAAAAACTAAGTAGCGGCATGATAGAAGGCTTAGGCCTAAATCTTTTCAAATAGGAGACTTAGTTTTACTTTTCAACTCTCGTTTAAAATTATTTTCGGGAAAACTAAATTTTAGGTGGTCCGGACGATTTAAAATTTCAAAAATTTTCCCTCACGGTGCCATTGAACTGCAAAATTCAAAGGAAATATTTTTAAAGTCAATGGCCAACGTGTTAAAAAATATTTTGGAGAAGAAATAGATAGGGGAAGGCCTTTATCCAGCTTTCAGGAAGTTAAGAGTTTGAAAAAAAAAATTCTCTTCCTTGTTCAATTGAAAATTTCTTTTGAAGGATTTTTGTGTAAATTAGTTAGGAGTAGTTTTCTTTATTTTCTTTATTTTCTTTTCGAATTTTACTTGTTTTTAATTGTTGTTAGGTTATTTGAGCTGAGATTAAGGTGTTAACAAATTACTTGGGCAAAATTTTAGAAAATTCGGAATTCTTTTTATGGAGGGCGCCTGGCGCCTTGGTCCAATCGAAAAATTTTGATTTTAATTTTGGGTGTGAGTTTTGATTTGAAATTAATGAGGGTGCGATAATTTTTAGTTTTTAATCAAATTTAGGAGGTTTAAAAAAAAGGAAAATTTAAGTGGATGGCTAAATTTTGTCACCCGTTTTAAAAAATGTCTATTAGTTTTAAATTTTACACAAATGGCTAAAGGGTATTTTAGGAGTTTTACAAAAGAACGTAGTTTTCCAACAATACTCTTCTCTATGTCTTTTTGGCTTTTCTTATATTTGACCAAGTAACAAACTTTGTTTTGAATTTAAAATGATTTTCCTACATTCAAAGGATTTCAAGTTAATTATTTTAATTACTTTGTTTTGTATTTCAGTTAATTTAAACGATTTGTGTACAATGATTCATCTCTCATCCCATCGTTCTGCAATTGTTCTACTCTATTTTCTTAAAGTATATATACATATTTGCATCAATTGGTATATATACTTAACTACTAAGGAGTATATGCTGTCACAGGTTCTTTGTATTCTTGTTTTCCTATTTCTTTATCATTTAATAATATCATCTACAACAAATGTAAATTAAAAAACCAAAGACTAACAACCCATAAATCAAAACGATTAGGAAAATAGGTGATGTCTCTTAAATTTGTAGCCTTTGGGCATGTATCGTCATCAGATTCAGTTTCTTAAACTTCTTTTGACGATGAGTTATTTTTCTTATGTTTATCGGTGAATTCATTATTCACTCTTTCATTCTCTTAATTAGACTTTCTATCATACTTTAAAACATTAGAAAATTTATCAAAATTAAAAGGTGAATAAATGTATATACCAATTAAAGAAAAATCAAAATTAGAATTCAATACACGTAAATATCACCATGAAACTCAATTATGTCAATACATGTATATACCAATTCAATAATTAACAAAATATGAAAGTCAATACATGTATATACAATTATATAACTCAAATATCATAAGTAGAATGCCAATACATGTATATACCAATAATTACCTCACTAACACGTTCTTCTTCAACTATCTTGAGACCCTTTCCCTTTTGTGAAATATACGAGTACAGTTAGCTTAGTGAATCTAATTTGTAAATATTACTGTAAAAATTAACAAACAAGAAATATATAAACTACTGCATCTATACATCTTTTTCAGTTTCTAAGTTGATTTTGCACGATTCCTTTTAACGTTATGTTATACTAGTAGCTTTTAATCATTGTCGAGATGTTCAGCATCTAACTTTTACCATATCTATAGATTCTATTTAGAATTTAAGTTATATAAATATAGTAAGTCTGTTGTAGTTGTAGATATATGCTTCTCAGATCTAGTTATAAACAAAAGACACATATCGGTCCCACACTTGTATTTTTTTAAATATCCTAAAATATAGGGAAACCAAAAATATCTTTTTAAATATCCCTAAAATACGAAAAATCTGAACAAAGGGTCTTACCTTCCTCGAACAGTTGCTAGAAGATGGAAGTTGGGCGGTAGATCGGAAACATATCGAAGCGGCAGTAGGTGGATCTGGTTTGGGGCGGCTGGTGGATCGTAGGCCTTGGGTGGATCTGGTCGGGCGGGACGGTAGATCTGGTTTGGGACGTGGCGCATCAAGCAGATATGGTACGTGTTGCAGCCAGCGGGGAGGGTGGTGACGGTTGGGTGCGGCGGCGAAGGGTCTGGTTTTGGTTCGTGGACGGTGGCTTGAAGGTGAGCGGCACACGAATCTGCTTCGTGAGGTGAGCAGCGGGACGTGATGGTGGCCTTGAACGTGAGCGGCGGGATGGAGGCTTGCAGATCTACTTCATGAGGAAGACGATGACTGTGGAGCGCATGAAGTTGCGTGAGACCAAAAGGTGGGAAGAAGTGGGGACTCCTAAAACTCTAACTTCTCTTACTCAAGTCAGACGTGAGAAGTAA

mRNA sequence

ATGTGGGTTGAATATGAGCCTTTGACCTCGTCAAAGCTGCCAACCGCGAGGGACAATGGTAAATCGACAACGTCGCAAAGGACGTTACCTCGACATGTCACCAAGATATATGATAAACTAACCAGCCAAGAGCCCAGAATTCAAACCAGGATTTGGCACCTCTCTGTCTGGCATTCCCTGCGATCTGATCTAGATTCCTCTCATTCGGATGTGGGTTCGGTTCCATTTATAGAAGTCTTGCCTTTTGGAACCTTCTCGTTGAGCCCATCTGTCCGTCGTTACTGGATCACAGGGATTCGTGGACCGGGGCACCTTAAACAATGGTGTTGGAATATCTGGGATCGACTTTTGCCAAAACTACCATTGCAGGTGTTGGAGAAGGTCAATGCTACCTTCTTGTTGAGCCCATCCAATCCATCGCTGCTGGATCACAAGGGTTCATGGACCGAGGCACCTCAACCACTGGGAGAAGAGAGAAAGTGGGCTCGACCAATGGGCCGAGCCTACTTAGAGCTGAGCAGGGGTTGGAGCCGACCCGGTCGGCTCGGTGTTCCCTTCGTCGTTGCTTTCTGCTTCATGGTTAGGAGTCCAAGGTCTCCTCGGTCATCTCGGCATTCATTGCCTCCCATGAGTATCAGGGACTTCGGTTGCCTCACGACTTGCTACTGTCCTGACAGATCGAAAATTCAGAGCCGCGAACCTGCCCTAGACCTGCCCAGGAACTTCAACCGTAGAGATCTCGACCGACTCTTCTATACGTTCTGGAACTTGTCGATCATTTTCTCTAGGAGTTTGCTTGACAATCTGTCGGCTCGTGCGCCTTCCACTACTCCTAGCAGGGGGTCATCAATGTTCAGCTTCCTCCTCGTAGGGCAATTCTCATCAGAGTGCCGTGGACTTTCCTCCAGATTCATGTTGCTCAATCGTGTGGTCATTCCTCTTGTTCAAAGTTTTGGAATTGAGCATCACCTAGTCAATGAAATTCCACTAGATTCAGAAACCATAGATAAAAAAGGAAACATCACTGTTAATCCCTTATACAATACTTGGTTCCCAAATGATGGCTCGCTGACCGCATGGCTTCTAGGAATCATCACTGAGGAAGTATTGGCAATGATTGAGGGATTGGAAGCAACACGCCAAGATCAGGCAAGAACATGGCTCAACTCGTTCCCGCCAAGATCAATTACAACACGAGGGTCACTAGTAGAGAAGTTCCTTACAAAGCATTTTCCTCCTACTCGCCATGCTGACATAAGGGAAGAGATTGTCACCTTTAGACAATATGATCGTGAACCAGTGCACGAGGCATGGGAGAGATTCAAAGAGTTGCTGCAAAAGTGTCCATGGATTACTAACATGTATCCAGATCGAGCAGTTCTTCTGAGGTCGAAACCTATACCTAAGAAGCAAGATACAGCAAGAGTGTTGGCATTAGACATTGCAACTTCGATGCAAAAAGAGATGGTGACAATGAACCAAATGCTAAGGAGATGGCCTTGGACAGGAAGAATACACCAGCCACACTGTGATAACCATCTTTTGAAGAAGAAGGGGTTGAAATCAGAAGTTGATGGTCATTTTGGAGCGGGAATTGAAGAGCCAACAACAATAGTTAATGAAAAGTGGTCACGGGAACAATCGGAACTTCAACCTCTATTCAAACACGTACAATCCAGGTGGAGACATCATCCCAACTTTTCATGGGGAGGTCAAGGCAGTTCTAGTGGAGCGACCCAAGGGAAGAACCAACAGTACAAGCAACCCTATGTTCCACCTACTTACCCAAATAATCAACCACCACAACAGCAGTTTACTCAGAATCAACAAGTTCCACCACAGCCAGCCAAAAATAATAATTCTAGCCTTGAGAATATGATGGAGTACATGGCTAGAAATGATATAATGATGAGAAACCTAGAGAGGCAAGTGGGACAGCTTGCCAATGATTTGAAGGTGAGACCGCAAGGCTCTTTTCTTGGACATACCGAAGTTCCAGAAAGAGACGGAAAAGATCAATTCAAATTGAATGCCCCTAGTGAAGCAGATAAAGAAAAAAACACCTCGGATGATGGGGAGGCATCTGACGCCTCTCCACAAAAAAGTTCGTCTTTTGCCCCGATTCATCATTTCCCTCAAAGATTAGCCGAGAGAAATCAAGAAACTCGGTTTAGAAAGTTTTTAGACATCCTCAAACAGCTACACATAAACATTCCTTTAGTTGATGCCTTAGAACAAATGTCGAATTATGCTAAGTTCTTAAAGGATATTGTGTCTAGAAAAAAGAAGTTAGGAGAACATGAGATGGTGGCTATGACTAAGTGTAGTAGTGAAGCTGTAGGAGATTTAGAAGAATGTTCAGCCATTAGTACTTCGAACCCTGCTTCGTTTGATGAGTTTTATGATCTTTTAGTCGCAAGAATTGAGGAAAAACTAGAAGATGCAGAAGATGAATGCCCAAAAATAACTGAAGAGATGCATCCACCAGAGGAGAAAGAGGATATAAAGGCGAAATTGCTAGAAGATGGAAGTTGGGCGGTAGATCGGAAACATATCGAAGCGGCAGTAGGTGGATCTGGTTTGGGGCGGCTGGTGGATCGTAGGCCTTGGGTGGATCTGGTCGGGCGGGACGGTAGATCTGGTTTGGGACGTGGCGCATCAAGCAGATATGCGGGACGTGATGGTGGCCTTGAACGTGAGCGGCGGGATGGAGGCTTGCAGATCTACTTCATGAGGAAGACGATGACTGTGGAGCGCATGAAGTTGCGTGAGACCAAAAGGTGGGAAGAAGTGGGGACTCCTAAAACTCTAACTTCTCTTACTCAAGTCAGACGTGAGAAGTAA

Coding sequence (CDS)

ATGTGGGTTGAATATGAGCCTTTGACCTCGTCAAAGCTGCCAACCGCGAGGGACAATGGTAAATCGACAACGTCGCAAAGGACGTTACCTCGACATGTCACCAAGATATATGATAAACTAACCAGCCAAGAGCCCAGAATTCAAACCAGGATTTGGCACCTCTCTGTCTGGCATTCCCTGCGATCTGATCTAGATTCCTCTCATTCGGATGTGGGTTCGGTTCCATTTATAGAAGTCTTGCCTTTTGGAACCTTCTCGTTGAGCCCATCTGTCCGTCGTTACTGGATCACAGGGATTCGTGGACCGGGGCACCTTAAACAATGGTGTTGGAATATCTGGGATCGACTTTTGCCAAAACTACCATTGCAGGTGTTGGAGAAGGTCAATGCTACCTTCTTGTTGAGCCCATCCAATCCATCGCTGCTGGATCACAAGGGTTCATGGACCGAGGCACCTCAACCACTGGGAGAAGAGAGAAAGTGGGCTCGACCAATGGGCCGAGCCTACTTAGAGCTGAGCAGGGGTTGGAGCCGACCCGGTCGGCTCGGTGTTCCCTTCGTCGTTGCTTTCTGCTTCATGGTTAGGAGTCCAAGGTCTCCTCGGTCATCTCGGCATTCATTGCCTCCCATGAGTATCAGGGACTTCGGTTGCCTCACGACTTGCTACTGTCCTGACAGATCGAAAATTCAGAGCCGCGAACCTGCCCTAGACCTGCCCAGGAACTTCAACCGTAGAGATCTCGACCGACTCTTCTATACGTTCTGGAACTTGTCGATCATTTTCTCTAGGAGTTTGCTTGACAATCTGTCGGCTCGTGCGCCTTCCACTACTCCTAGCAGGGGGTCATCAATGTTCAGCTTCCTCCTCGTAGGGCAATTCTCATCAGAGTGCCGTGGACTTTCCTCCAGATTCATGTTGCTCAATCGTGTGGTCATTCCTCTTGTTCAAAGTTTTGGAATTGAGCATCACCTAGTCAATGAAATTCCACTAGATTCAGAAACCATAGATAAAAAAGGAAACATCACTGTTAATCCCTTATACAATACTTGGTTCCCAAATGATGGCTCGCTGACCGCATGGCTTCTAGGAATCATCACTGAGGAAGTATTGGCAATGATTGAGGGATTGGAAGCAACACGCCAAGATCAGGCAAGAACATGGCTCAACTCGTTCCCGCCAAGATCAATTACAACACGAGGGTCACTAGTAGAGAAGTTCCTTACAAAGCATTTTCCTCCTACTCGCCATGCTGACATAAGGGAAGAGATTGTCACCTTTAGACAATATGATCGTGAACCAGTGCACGAGGCATGGGAGAGATTCAAAGAGTTGCTGCAAAAGTGTCCATGGATTACTAACATGTATCCAGATCGAGCAGTTCTTCTGAGGTCGAAACCTATACCTAAGAAGCAAGATACAGCAAGAGTGTTGGCATTAGACATTGCAACTTCGATGCAAAAAGAGATGGTGACAATGAACCAAATGCTAAGGAGATGGCCTTGGACAGGAAGAATACACCAGCCACACTGTGATAACCATCTTTTGAAGAAGAAGGGGTTGAAATCAGAAGTTGATGGTCATTTTGGAGCGGGAATTGAAGAGCCAACAACAATAGTTAATGAAAAGTGGTCACGGGAACAATCGGAACTTCAACCTCTATTCAAACACGTACAATCCAGGTGGAGACATCATCCCAACTTTTCATGGGGAGGTCAAGGCAGTTCTAGTGGAGCGACCCAAGGGAAGAACCAACAGTACAAGCAACCCTATGTTCCACCTACTTACCCAAATAATCAACCACCACAACAGCAGTTTACTCAGAATCAACAAGTTCCACCACAGCCAGCCAAAAATAATAATTCTAGCCTTGAGAATATGATGGAGTACATGGCTAGAAATGATATAATGATGAGAAACCTAGAGAGGCAAGTGGGACAGCTTGCCAATGATTTGAAGGTGAGACCGCAAGGCTCTTTTCTTGGACATACCGAAGTTCCAGAAAGAGACGGAAAAGATCAATTCAAATTGAATGCCCCTAGTGAAGCAGATAAAGAAAAAAACACCTCGGATGATGGGGAGGCATCTGACGCCTCTCCACAAAAAAGTTCGTCTTTTGCCCCGATTCATCATTTCCCTCAAAGATTAGCCGAGAGAAATCAAGAAACTCGGTTTAGAAAGTTTTTAGACATCCTCAAACAGCTACACATAAACATTCCTTTAGTTGATGCCTTAGAACAAATGTCGAATTATGCTAAGTTCTTAAAGGATATTGTGTCTAGAAAAAAGAAGTTAGGAGAACATGAGATGGTGGCTATGACTAAGTGTAGTAGTGAAGCTGTAGGAGATTTAGAAGAATGTTCAGCCATTAGTACTTCGAACCCTGCTTCGTTTGATGAGTTTTATGATCTTTTAGTCGCAAGAATTGAGGAAAAACTAGAAGATGCAGAAGATGAATGCCCAAAAATAACTGAAGAGATGCATCCACCAGAGGAGAAAGAGGATATAAAGGCGAAATTGCTAGAAGATGGAAGTTGGGCGGTAGATCGGAAACATATCGAAGCGGCAGTAGGTGGATCTGGTTTGGGGCGGCTGGTGGATCGTAGGCCTTGGGTGGATCTGGTCGGGCGGGACGGTAGATCTGGTTTGGGACGTGGCGCATCAAGCAGATATGCGGGACGTGATGGTGGCCTTGAACGTGAGCGGCGGGATGGAGGCTTGCAGATCTACTTCATGAGGAAGACGATGACTGTGGAGCGCATGAAGTTGCGTGAGACCAAAAGGTGGGAAGAAGTGGGGACTCCTAAAACTCTAACTTCTCTTACTCAAGTCAGACGTGAGAAGTAA

Protein sequence

MWVEYEPLTSSKLPTARDNGKSTTSQRTLPRHVTKIYDKLTSQEPRIQTRIWHLSVWHSLRSDLDSSHSDVGSVPFIEVLPFGTFSLSPSVRRYWITGIRGPGHLKQWCWNIWDRLLPKLPLQVLEKVNATFLLSPSNPSLLDHKGSWTEAPQPLGEERKWARPMGRAYLELSRGWSRPGRLGVPFVVAFCFMVRSPRSPRSSRHSLPPMSIRDFGCLTTCYCPDRSKIQSREPALDLPRNFNRRDLDRLFYTFWNLSIIFSRSLLDNLSARAPSTTPSRGSSMFSFLLVGQFSSECRGLSSRFMLLNRVVIPLVQSFGIEHHLVNEIPLDSETIDKKGNITVNPLYNTWFPNDGSLTAWLLGIITEEVLAMIEGLEATRQDQARTWLNSFPPRSITTRGSLVEKFLTKHFPPTRHADIREEIVTFRQYDREPVHEAWERFKELLQKCPWITNMYPDRAVLLRSKPIPKKQDTARVLALDIATSMQKEMVTMNQMLRRWPWTGRIHQPHCDNHLLKKKGLKSEVDGHFGAGIEEPTTIVNEKWSREQSELQPLFKHVQSRWRHHPNFSWGGQGSSSGATQGKNQQYKQPYVPPTYPNNQPPQQQFTQNQQVPPQPAKNNNSSLENMMEYMARNDIMMRNLERQVGQLANDLKVRPQGSFLGHTEVPERDGKDQFKLNAPSEADKEKNTSDDGEASDASPQKSSSFAPIHHFPQRLAERNQETRFRKFLDILKQLHINIPLVDALEQMSNYAKFLKDIVSRKKKLGEHEMVAMTKCSSEAVGDLEECSAISTSNPASFDEFYDLLVARIEEKLEDAEDECPKITEEMHPPEEKEDIKAKLLEDGSWAVDRKHIEAAVGGSGLGRLVDRRPWVDLVGRDGRSGLGRGASSRYAGRDGGLERERRDGGLQIYFMRKTMTVERMKLRETKRWEEVGTPKTLTSLTQVRREK
Homology
BLAST of Moc05g11980 vs. NCBI nr
Match: XP_022142953.1 (uncharacterized protein LOC111012947 [Momordica charantia])

HSP 1 Score: 266.2 bits (679), Expect = 1.1e-66
Identity = 181/474 (38.19%), Postives = 239/474 (50.42%), Query Frame = 0

Query: 383 QARTWLNSFPPRSITTRGSLVEKFLTKHFPPTRHADIREEIVTFRQYDREPVHEAWERFK 442
           QA  WLN+FP  +I T   +V+KFL K+FPPTR+AD+REEI++FRQ + E V+ AWERFK
Sbjct: 150 QATAWLNAFPSBTIXTXSDMVDKFLVKYFPPTRNADVREEIISFRQKENEAVNVAWERFK 209

Query: 443 ELLQKCPWI---------------------------------------------TNMYPD 502
           +L++ CP I                                              + + D
Sbjct: 210 DLIRNCPNIGIPACVQIEHFFRXCDIPTXMMLNGAANGKFTSKSFNEIVEILDQLSEHND 269

Query: 503 RAVLLRSKPIPKKQDTARVLALDIATSMQKEMVTMNQMLRRWPWTGRIHQPHCDNHLLKK 562
           +    + +   K+ D A VLALD  TSMQK++ T+ QML+                    
Sbjct: 270 QWCSEKPRTQSKRADPAIVLALDNMTSMQKQIDTITQMLKNME----------------- 329

Query: 563 KGLKSEVDGHFGAGIEEPT---TIVNEKWSREQSELQPLFKHVQSRWRHHPNFSWGGQGS 622
              K+            P+    I        Q +  P        W+ HPNFSW GQGS
Sbjct: 330 ---KNNAAAALAPATTNPSPVYQIAESTCQMNQQKFNPYSNIYNPGWKQHPNFSWSGQGS 389

Query: 623 SSGATQGKNQQYKQPYVPPTYPNNQ--PPQQQFTQNQQVPPQPAKNNNSSLENMM----- 682
           SSG   G+NQQYKQ Y PP +PN+   PP  Q    Q+   QPA+ N S++E +M     
Sbjct: 390 SSGT--GQNQQYKQAYTPPRFPNSPAFPPTPQQYNQQKNYGQPAQQNLSNMEILMKEFIT 449

Query: 683 ------------------------EYMARNDIMMRNLERQVGQLANDLKVRPQGSFLGHT 742
                                   +YM RND+ +RNLE Q+GQLAN+++ RPQGS    T
Sbjct: 450 KNDATMKELMTRTDATIKDMKEVKDYMGRNDVTVRNLEMQLGQLANEVRTRPQGSLPSST 509

Query: 743 EVPERDGKDQFKLNAPSEADKEKNTSDDGEASDASPQKSSSFAPIHHFPQRLAERNQETR 778
           E P R           ++   +K    +   S  +PQ S+  +P   FPQRL  +NQ+  
Sbjct: 510 EEPRRIVSPSPSREKDTQVVPDKIVEPEVSVS-VAPQVSNCRSP-PPFPQRLVRKNQDNN 569

BLAST of Moc05g11980 vs. NCBI nr
Match: XP_022158408.1 (uncharacterized protein LOC111024897, partial [Momordica charantia])

HSP 1 Score: 265.4 bits (677), Expect = 1.9e-66
Identity = 169/349 (48.42%), Postives = 199/349 (57.02%), Query Frame = 0

Query: 381 QDQARTWLNSFPPRSITTRGSLVEKFLTKHFPPTRHADIREEIVTFRQYDREPVHEAWER 440
           +DQAR  LN+FP  SITT GSLVEKFLTK FPPTRHADIREEI++FRQYDREPVHEAWER
Sbjct: 177 KDQARXXLNAFPXGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWER 236

Query: 441 FKELLQKC-----PWITNM--------YPDRAVL-------------------------- 500
           FKEL++KC     P    +        +P + +L                          
Sbjct: 237 FKELIRKCXNHGLPACXQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASH 296

Query: 501 ------LRSKPIPKKQDTARVLALDIATSMQKEMVTMNQMLRRWPWTGRIHQPHCDNHLL 560
                  RS+  PKKQD A VLALDIATSMQKEMVTMNQ L+                  
Sbjct: 297 NELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLK------------------ 356

Query: 561 KKKGLKSEVDGHFGAGIEEPTTIVNEKWSREQSELQPLFK-HVQSRWRHHPNFSWGGQGS 620
                          GI+ P     +    +     P+ + +    WRHHPNFSWGGQG 
Sbjct: 357 -----------EMALGIKNPLATXIQPVQSDYCTHAPVCQVNDLICWRHHPNFSWGGQGG 416

Query: 621 SSGATQGKNQQYKQPYVPPTYPNNQPPQQQFTQNQQVPPQPAKNNNSSLENMM-EYMARN 676
           SSG  QG++QQ KQPYVPPT  +  PPQQQ+ Q  Q P  P +NNNS+LENMM EYMAR 
Sbjct: 417 SSGFNQGQSQQNKQPYVPPTQQHIPPPQQQYNQRTQTP--PIQNNNSNLENMMKEYMART 476

BLAST of Moc05g11980 vs. NCBI nr
Match: XP_022159235.1 (uncharacterized protein LOC111025653 [Momordica charantia])

HSP 1 Score: 256.9 bits (655), Expect = 6.7e-64
Identity = 182/499 (36.47%), Postives = 242/499 (48.50%), Query Frame = 0

Query: 383 QARTWLNSFPPRSITTRGSLVEKFLTKHFPPTRHADIREEIVTFRQYDREPVHEAWERFK 442
           QA  WLN+FP  +ITT   +V+KFL K+FPPTR+AD+REEI++FRQ + E V+ AWERFK
Sbjct: 105 QATAWLNAFPSDTITTWSDMVDKFLVKYFPPTRNADVREEIISFRQKENEAVNVAWERFK 164

Query: 443 ELLQKCPWI------------------TNMYPDRAV------------------------ 502
           +L+  CP I                  T M  + A                         
Sbjct: 165 DLIMNCPNIGIPACVQIEHFFRGCDILTKMMLNGAANGKFTSKSFNEIVEILDQLSEHNY 224

Query: 503 ---LLRSKPIPKKQDTARVLALDIATSMQKEMVTMNQMLRRWPWTGRIHQPHCDN----- 562
                +S+   K+ D A VLALD  TSMQK++ T+ QML+                    
Sbjct: 225 QWCSEKSRTQSKRADPAGVLALDNMTSMQKQIDTITQMLKNMEKNNAXAASAXATTNPSP 284

Query: 563 -HLLKKKGLKSEVDGHFGAGIEEPTTIVNEKWSREQSELQPLFKHVQSRWRHHPNFSWGG 622
            + + +       D H         + +       Q +  P        W+ HPNFSW G
Sbjct: 285 VYQIAESTCYYCGDLHPSENCPSNPSSMYYVGQMNQQKFNPYSNTYNPGWKQHPNFSWSG 344

Query: 623 QGSSSGATQGKNQQYKQPYVPPTYPNNQ--PPQQQFTQNQQVPPQPAKNNNSSLENMM-- 682
           QGSS+  T G NQQYK+ Y PP +PN+   PP       Q+   QPA+ N S++E +M  
Sbjct: 345 QGSSN--TTGHNQQYKEAYTPPGFPNSPAFPPTPHQYNQQKNYVQPAQQNLSNMEILMKE 404

Query: 683 ---------------------------EYMARNDIMMRNLERQVGQLANDLKVRPQGSFL 742
                                      +YM RND+ +R LE Q+GQL N+++ RPQGS  
Sbjct: 405 LITKNDATMKELMTRTDVTMKDMKDVKDYMGRNDVTVRKLEMQLGQLVNEVRTRPQGSLP 464

Query: 743 GHTEVPERDGKDQ---------FKLNAPSEADKEKNTSDDGEASDASPQK---------- 778
             TE P R GK+           K   P   D+  ++    + + A P K          
Sbjct: 465 SSTEEPRRIGKEHCNSIATRSGLKYEGPRMPDESSHSPSREKDTQAVPDKIVEPAVSVPV 524

BLAST of Moc05g11980 vs. NCBI nr
Match: XP_030503898.1 (uncharacterized protein LOC115719117 [Cannabis sativa])

HSP 1 Score: 224.9 bits (572), Expect = 2.8e-54
Identity = 167/484 (34.50%), Postives = 235/484 (48.55%), Query Frame = 0

Query: 362 LGIITEEVLAMIEGLEATRQDQARTWLNSFPPRSITTRGSLVEKFLTKHFPPTRHADIRE 421
           L  ++EE L  ++    + +D+AR WLN+ PP S+T    L EKFL K+FPPTR+A  R 
Sbjct: 73  LNRVSEEAL-RLKLFPFSLRDRARAWLNTLPPDSVTNWNDLAEKFLRKYFPPTRNAKFRS 132

Query: 422 EIVTFRQYDREPVHEAWERFKELLQKCP----------------------WITNMYPDRA 481
           EI++F+Q + E   +AWERFKELL+KCP                       + +   + A
Sbjct: 133 EIMSFQQLEDETTSDAWERFKELLRKCPHHGIPHCIQLETFYNGLNAASRMVLDASANGA 192

Query: 482 VLLRS--------------------KPIPKKQDTARVLALDIATSMQKEMVTMNQMLRRW 541
           +L +S                       P  +  A VL +D  T++  +M +M  +L+  
Sbjct: 193 ILSKSYNEAFEILERIASNNYQWSTNRAPTSRKVAGVLEVDALTALTAQMASMTNILKNM 252

Query: 542 PWTGRIHQPHCDNHLLKKKGLKSEVDGHFGAGIEEPTTIVNEKWSREQSELQPLFKHVQS 601
              G + QP         +  K E+             + N+ ++R  +   P       
Sbjct: 253 NMGGSV-QP--------ARHSKGEISSFL-------CYVGNQNFNRNNN---PYSNSYNP 312

Query: 602 RWRHHPNFSWGGQGSSSGATQGKNQQYKQPYVPPTYPNNQPPQQQFTQNQQVPPQPAKNN 661
            W+HHPNFSWGGQG+SS   QG+ +Q      PP +     PQQ        P QP  + 
Sbjct: 313 AWKHHPNFSWGGQGASSSGAQGQGKQ----SFPPGFSQQPRPQQ--------PHQPQGSQ 372

Query: 662 NSSLENMM-EYMARNDIM-------MRNLERQVGQLANDLKVRPQGSFLGHTEVPERDGK 721
            SSLE++M +YMA+ND +       +RNLE Q+GQLANDLK RPQG+    TE P RDGK
Sbjct: 373 TSSLESLMRDYMAKNDAVIQSQAASLRNLEVQLGQLANDLKNRPQGTLPSDTENPRRDGK 432

Query: 722 DQFK-LNAPSEADKEKNTSDDGEASDASPQKSSSF------------------------- 766
           +  K +   S    E N +  G    +S QK                             
Sbjct: 433 EHCKAVTLRSGKIIESNVAAKGSKEPSSIQKEGEMKKKPATSAAEIPQELQRSNKILCRR 492

BLAST of Moc05g11980 vs. NCBI nr
Match: XP_017233063.1 (PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus])

HSP 1 Score: 208.0 bits (528), Expect = 3.6e-49
Identity = 160/466 (34.33%), Postives = 231/466 (49.57%), Query Frame = 0

Query: 381 QDQARTWLNSFPPRSITTRGSLVEKFLTKHFPPTRHADIREEIVTFRQYDREPVHEAWER 440
           +D+ARTWLNS P  S+TT   L EKFL+K+FPP  +A +R EI +F+Q D E +++AWER
Sbjct: 145 RDRARTWLNSLPAGSVTTWNDLTEKFLSKYFPPNMNAKLRNEINSFQQQDDESLYDAWER 204

Query: 441 FKELLQKCP----------------------WITNMYPDRAVLLRS--------KPIPKK 500
           FKELL+KCP                       + +   + A+L +S        + I  K
Sbjct: 205 FKELLRKCPHHGILHCIQMETFYNGLNAQTKMVVDASANGALLSKSYNQAYEILETIATK 264

Query: 501 ------------QDTARVLALDIATSMQKEMVTMNQMLRRWPW-TGRIHQPHCDNHLLKK 560
                       +  A +  +D  TSM+ ++ +M  ML+       +  +    + + + 
Sbjct: 265 NYQWSSSRAQTGKKVAGIYDVDSITSMKAQLASMEHMLKNLSMGNNQSKEQSLSSQINQT 324

Query: 561 KGLKSEVDGHFGAGIEEPTTIVNEKWSREQSELQPLFKHVQSRWRHHPNFSWGGQGSSSG 620
           K +     G        P+   +  +   Q++  P        WR HPNFSW  QG++SG
Sbjct: 325 KNVSCVFCGEAHTYDSCPSNPESVFYMGNQNKAGPYSNTYNQSWRQHPNFSWSNQGANSG 384

Query: 621 ATQGKNQQYKQPYVPPTYPNNQPPQQQFTQNQQVPPQPAKN--NNSSLENMMEYMARNDI 680
            + G     K  Y PP + + Q PQ    +N  +     KN  + S  E +++  A +  
Sbjct: 385 TSTG---NVKSNY-PPGF-SQQAPQSNSLEN-MLKEYIIKNEASRSQTEALVQSQAAS-- 444

Query: 681 MMRNLERQVGQLANDLKVRPQGSFLGHTEVPERDGKDQFKLNAPSEADKEKNTSDDGEAS 740
            +RNLE QVGQLAN+L+ RP G+    TE P+  G +  K           NT  D +  
Sbjct: 445 -LRNLENQVGQLANELRNRPHGTLPSDTEKPKGVGNEHCKAMTLKSGKVLGNTVTDAKYD 504

Query: 741 DA--------------------SPQKSSS---FAPIHHFPQRLAERNQETRFRKFLDILK 778
           D+                    SP KSS      P   FPQR  ++ Q  +F+KFLD+LK
Sbjct: 505 DSVEPSGNEEIPDDKESENDKVSPPKSSKKSHIQPQPPFPQRFQKQKQNVQFKKFLDVLK 564

BLAST of Moc05g11980 vs. ExPASy TrEMBL
Match: A0A6J1CPJ3 (uncharacterized protein LOC111012947 OS=Momordica charantia OX=3673 GN=LOC111012947 PE=4 SV=1)

HSP 1 Score: 265.8 bits (678), Expect = 7.0e-67
Identity = 181/474 (38.19%), Postives = 239/474 (50.42%), Query Frame = 0

Query: 383 QARTWLNSFPPRSITTRGSLVEKFLTKHFPPTRHADIREEIVTFRQYDREPVHEAWERFK 442
           QA  WLN+FP  +I T   +V+KFL K+FPPTR+AD+REEI++FRQ + E V+ AWERFK
Sbjct: 150 QATAWLNAFPSDTIXTXSDMVDKFLVKYFPPTRNADVREEIISFRQKENEAVNVAWERFK 209

Query: 443 ELLQKCPWI---------------------------------------------TNMYPD 502
           +L++ CP I                                              + + D
Sbjct: 210 DLIRNCPNIGIPACVQIEHFFRXCDIPTXMMLNGAANGKFTSKSFNEIVEILDQLSEHND 269

Query: 503 RAVLLRSKPIPKKQDTARVLALDIATSMQKEMVTMNQMLRRWPWTGRIHQPHCDNHLLKK 562
           +    + +   K+ D A VLALD  TSMQK++ T+ QML+                    
Sbjct: 270 QWCSEKPRTQSKRADPAIVLALDNMTSMQKQIDTITQMLKNME----------------- 329

Query: 563 KGLKSEVDGHFGAGIEEPT---TIVNEKWSREQSELQPLFKHVQSRWRHHPNFSWGGQGS 622
              K+            P+    I        Q +  P        W+ HPNFSW GQGS
Sbjct: 330 ---KNNAAAALAPATTNPSPVYQIAESTCQMNQQKFNPYSNIYNPGWKQHPNFSWSGQGS 389

Query: 623 SSGATQGKNQQYKQPYVPPTYPNNQ--PPQQQFTQNQQVPPQPAKNNNSSLENMM----- 682
           SSG   G+NQQYKQ Y PP +PN+   PP  Q    Q+   QPA+ N S++E +M     
Sbjct: 390 SSGT--GQNQQYKQAYTPPRFPNSPAFPPTPQQYNQQKNYGQPAQQNLSNMEILMKEFIT 449

Query: 683 ------------------------EYMARNDIMMRNLERQVGQLANDLKVRPQGSFLGHT 742
                                   +YM RND+ +RNLE Q+GQLAN+++ RPQGS    T
Sbjct: 450 KNDATMKELMTRTDATIKDMKEVKDYMGRNDVTVRNLEMQLGQLANEVRTRPQGSLPSST 509

Query: 743 EVPERDGKDQFKLNAPSEADKEKNTSDDGEASDASPQKSSSFAPIHHFPQRLAERNQETR 778
           E P R           ++   +K    +   S  +PQ S+  +P   FPQRL  +NQ+  
Sbjct: 510 EEPRRIVSPSPSREKDTQVVPDKIVEPEVSVS-VAPQVSNCRSP-PPFPQRLVRKNQDNN 569

BLAST of Moc05g11980 vs. ExPASy TrEMBL
Match: A0A6J1DW02 (uncharacterized protein LOC111024897 OS=Momordica charantia OX=3673 GN=LOC111024897 PE=4 SV=1)

HSP 1 Score: 265.4 bits (677), Expect = 9.2e-67
Identity = 169/349 (48.42%), Postives = 199/349 (57.02%), Query Frame = 0

Query: 381 QDQARTWLNSFPPRSITTRGSLVEKFLTKHFPPTRHADIREEIVTFRQYDREPVHEAWER 440
           +DQAR  LN+FP  SITT GSLVEKFLTK FPPTRHADIREEI++FRQYDREPVHEAWER
Sbjct: 177 KDQARXXLNAFPXGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWER 236

Query: 441 FKELLQKC-----PWITNM--------YPDRAVL-------------------------- 500
           FKEL++KC     P    +        +P + +L                          
Sbjct: 237 FKELIRKCXNHGLPACXQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASH 296

Query: 501 ------LRSKPIPKKQDTARVLALDIATSMQKEMVTMNQMLRRWPWTGRIHQPHCDNHLL 560
                  RS+  PKKQD A VLALDIATSMQKEMVTMNQ L+                  
Sbjct: 297 NELWCSQRSRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLK------------------ 356

Query: 561 KKKGLKSEVDGHFGAGIEEPTTIVNEKWSREQSELQPLFK-HVQSRWRHHPNFSWGGQGS 620
                          GI+ P     +    +     P+ + +    WRHHPNFSWGGQG 
Sbjct: 357 -----------EMALGIKNPLATXIQPVQSDYCTHAPVCQVNDLICWRHHPNFSWGGQGG 416

Query: 621 SSGATQGKNQQYKQPYVPPTYPNNQPPQQQFTQNQQVPPQPAKNNNSSLENMM-EYMARN 676
           SSG  QG++QQ KQPYVPPT  +  PPQQQ+ Q  Q P  P +NNNS+LENMM EYMAR 
Sbjct: 417 SSGFNQGQSQQNKQPYVPPTQQHIPPPQQQYNQRTQTP--PIQNNNSNLENMMKEYMART 476

BLAST of Moc05g11980 vs. ExPASy TrEMBL
Match: A0A6J1DY39 (uncharacterized protein LOC111025653 OS=Momordica charantia OX=3673 GN=LOC111025653 PE=4 SV=1)

HSP 1 Score: 256.9 bits (655), Expect = 3.3e-64
Identity = 182/499 (36.47%), Postives = 242/499 (48.50%), Query Frame = 0

Query: 383 QARTWLNSFPPRSITTRGSLVEKFLTKHFPPTRHADIREEIVTFRQYDREPVHEAWERFK 442
           QA  WLN+FP  +ITT   +V+KFL K+FPPTR+AD+REEI++FRQ + E V+ AWERFK
Sbjct: 105 QATAWLNAFPSDTITTWSDMVDKFLVKYFPPTRNADVREEIISFRQKENEAVNVAWERFK 164

Query: 443 ELLQKCPWI------------------TNMYPDRAV------------------------ 502
           +L+  CP I                  T M  + A                         
Sbjct: 165 DLIMNCPNIGIPACVQIEHFFRGCDILTKMMLNGAANGKFTSKSFNEIVEILDQLSEHNY 224

Query: 503 ---LLRSKPIPKKQDTARVLALDIATSMQKEMVTMNQMLRRWPWTGRIHQPHCDN----- 562
                +S+   K+ D A VLALD  TSMQK++ T+ QML+                    
Sbjct: 225 QWCSEKSRTQSKRADPAGVLALDNMTSMQKQIDTITQMLKNMEKNNAXAASAXATTNPSP 284

Query: 563 -HLLKKKGLKSEVDGHFGAGIEEPTTIVNEKWSREQSELQPLFKHVQSRWRHHPNFSWGG 622
            + + +       D H         + +       Q +  P        W+ HPNFSW G
Sbjct: 285 VYQIAESTCYYCGDLHPSENCPSNPSSMYYVGQMNQQKFNPYSNTYNPGWKQHPNFSWSG 344

Query: 623 QGSSSGATQGKNQQYKQPYVPPTYPNNQ--PPQQQFTQNQQVPPQPAKNNNSSLENMM-- 682
           QGSS+  T G NQQYK+ Y PP +PN+   PP       Q+   QPA+ N S++E +M  
Sbjct: 345 QGSSN--TTGHNQQYKEAYTPPGFPNSPAFPPTPHQYNQQKNYVQPAQQNLSNMEILMKE 404

Query: 683 ---------------------------EYMARNDIMMRNLERQVGQLANDLKVRPQGSFL 742
                                      +YM RND+ +R LE Q+GQL N+++ RPQGS  
Sbjct: 405 LITKNDATMKELMTRTDVTMKDMKDVKDYMGRNDVTVRKLEMQLGQLVNEVRTRPQGSLP 464

Query: 743 GHTEVPERDGKDQ---------FKLNAPSEADKEKNTSDDGEASDASPQK---------- 778
             TE P R GK+           K   P   D+  ++    + + A P K          
Sbjct: 465 SSTEEPRRIGKEHCNSIATRSGLKYEGPRMPDESSHSPSREKDTQAVPDKIVEPAVSVPV 524

BLAST of Moc05g11980 vs. ExPASy TrEMBL
Match: A0A6J1DTH7 (uncharacterized protein LOC111022971 OS=Momordica charantia OX=3673 GN=LOC111022971 PE=4 SV=1)

HSP 1 Score: 206.1 bits (523), Expect = 6.6e-49
Identity = 116/171 (67.84%), Postives = 129/171 (75.44%), Query Frame = 0

Query: 637 MRNLERQVGQLANDLKVRPQGSFLGHTEVPERDGKDQFKL-------------------- 696
           MRNLE Q+GQL+NDLK RPQGSF  HTEVP+RDGK+Q K                     
Sbjct: 1   MRNLEVQIGQLSNDLKARPQGSFPWHTEVPKRDGKEQCKAVTLRSGLSYERPKMPVEVEK 60

Query: 697 ------NAPSEADKEKNTSDDGEASDASPQKSSSFAPIHHFPQRLAERNQETRFRKFLDI 756
                 N+PSEA+KEK TS +GEAS AS Q++SSF  I  FPQ LA++NQET+FRKFLDI
Sbjct: 61  PSTSIQNSPSEAEKEKTTSGEGEASGASSQENSSFLLIPPFPQILAKKNQETQFRKFLDI 120

Query: 757 LKQLHINIPLVDALEQMSNYAKFLKDIVSRKKKLGEHEMVAMTKCSSEAVG 782
           LKQLHINIPL+DALEQM NYAKFLKDIVSRKKKLGEHEMVAMTKCSSEAVG
Sbjct: 121 LKQLHINIPLLDALEQMPNYAKFLKDIVSRKKKLGEHEMVAMTKCSSEAVG 171

BLAST of Moc05g11980 vs. ExPASy TrEMBL
Match: A0A6J1DPM2 (uncharacterized protein LOC111022366 OS=Momordica charantia OX=3673 GN=LOC111022366 PE=4 SV=1)

HSP 1 Score: 204.9 bits (520), Expect = 1.5e-48
Identity = 125/205 (60.98%), Postives = 138/205 (67.32%), Query Frame = 0

Query: 637 MRNLERQVGQLANDLKVRPQGSFLGHTEVPERDGKDQFKL-------------------- 696
           MRNLERQVGQLANDLK RPQGS  GHTEVP+R GK+Q K                     
Sbjct: 1   MRNLERQVGQLANDLKARPQGSIPGHTEVPKRYGKEQCKAVTLRSGLSYEGPKMLVEVKK 60

Query: 697 ------NAPSEADKEKNTSDDGEASDASPQKSSSFAPIHHFPQRLAERNQETRFRKFLDI 756
                 NAPSEA KEK TS +GEAS +SPQK SSFAPI  FPQRLA +NQET+FRKFLDI
Sbjct: 61  PSTSIQNAPSEAKKEKTTSGEGEASGSSPQKKSSFAPIPPFPQRLAMKNQETQFRKFLDI 120

Query: 757 LKQLHINIPLVDALEQMSNYAKFLKDIVSRKKKLGEHEMVAMTKCSSEAVGDLEECSAIS 815
           LKQLHINIPLVDALEQM NYA F  DIVSRKKKLGE EM+AMTKCSSEAV        I 
Sbjct: 121 LKQLHINIPLVDALEQMLNYAMFQNDIVSRKKKLGELEMIAMTKCSSEAVE-----LGIG 180

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022142953.11.1e-6638.19uncharacterized protein LOC111012947 [Momordica charantia][more]
XP_022158408.11.9e-6648.42uncharacterized protein LOC111024897, partial [Momordica charantia][more]
XP_022159235.16.7e-6436.47uncharacterized protein LOC111025653 [Momordica charantia][more]
XP_030503898.12.8e-5434.50uncharacterized protein LOC115719117 [Cannabis sativa][more]
XP_017233063.13.6e-4934.33PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1CPJ37.0e-6738.19uncharacterized protein LOC111012947 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
A0A6J1DW029.2e-6748.42uncharacterized protein LOC111024897 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A6J1DY393.3e-6436.47uncharacterized protein LOC111025653 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A6J1DTH76.6e-4967.84uncharacterized protein LOC111022971 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A6J1DPM21.5e-4860.98uncharacterized protein LOC111022366 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 805..825
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 12..27
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 568..592
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 660..712
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 600..621
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..27
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 664..692
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 693..707
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 564..621
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 376..449
coord: 710..774
NoneNo IPR availablePANTHERPTHR24559:SF334SUBFAMILY NOT NAMEDcoord: 376..449
coord: 710..774
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 381..450
e-value: 1.9E-10
score: 40.9

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc05g11980.1Moc05g11980.1mRNA