Moc05g14790 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc05g14790
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Locationchr5: 11131627 .. 11147041 (+)
RNA-Seq ExpressionMoc05g14790
SyntenyMoc05g14790
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCACACAACTATACCATCAACAACGCGCTTCTGCGCGCCTGCCATTACCTTTTGCAGTCTCTGCACGCACGCTAGTGCCCCGCACGAGCACTTGCACCTTGCCACACTGCGCGCGCCTCCTCCAGCCAAGTGTCAACCTCTCAACGCACCCCGTGGTGCCAGCTTCAACGCATCCTCTGTCAGCTTCAGCGCCCTTCTCTGCTCACCTGACAAGGCACCCACGCCTGTGCGTGTGCCCGTCTTCGCCTCGTCCATGCAAGCACCTGCCTACGCCGCATGCTGCTTGCCCATGCAAATGTCCAACAAGCCTCACACAGTGCCTCTATATGCCTTGCCTGCCTTCGTCCGCGTGCACGCCTACCAATCGCTCAGCGTCGGGCCTCTTCGCCCGCGCCCGCCATGCCGCCCATCCGCCCAGCGCACGCCCACCATGCCACCGCCCCAAGTGCGCATCTCCTTGGTGTTGCTTGCCGCCCACTCTATTCCTTGTAAGGAAATTATTTCCTTTTTATTTTTCTTAATTTAATTTTGTTTAGATAAGGAATGATTTCCCTTGCTTGGTTAGCAAGTTGGTGGATTTTAGGAAATATCTTTTTTTTCCTTATTTTATTTTAGGGAAAAATACATTTTTAGTCCTTTGAGATTTGGATTAAGTGTCTATTTGGTCCTTAAAGTTACAAAATAGACACTTTTAGTCCTTGAAGTTTGAAGTAGATGCCTATTTGGTCCATAAAGTTTGAAAATGAACATTTTTTGTCCTTTATGTTTGAGAAATGGTTGTAAATGGTCATTGACTTATACTTTGACCGTTAGTTGACTAACGTTAAAATGACGTATCAGTTAAATTGCTGACCGTATTTTGTTAAGTATATTAATTAAATTTTTTTATGACTCAATTTTTTTAATAATTAAATCTCCTTCCTTTCACCTTGCACCTTTTTCTTCCCTTCCTCCGGATGAAGGTCACCATGATCGTCGGCCGCCGTCTCAATCCTCGTCTGTCGCTATCATCGTCGCCAACCTCCGACGACCTTCGCTGGCCAAGAAGAGGGAACAATGACAAGAAGTAGAGCTCTCACTTGTTCTAGATCTGGAAAGAAGGAGAGAGGGATAGAGACTCCAAGCACAACTTCGCGTACTTGCCATATACAGCCCCGCCCTGATTCTGGTAAGCGTGTTGGGTGATGTTTATAGGGGTGGGTTTGTTTAGGGATTTGGCCACATTTTTACTCCCAGTTTGTTTGTATCGATGAACAGGTACGTTACAGAGACCGCACATCGCTCGGAAACTTTGTTCGTGTTCAATTCAAAACAAAATCCCAATCTTAAAACTCCATTTACTCTAAACCATCCAAAAATTTAACATCAATTTTATTATGAATTACGAATTTAAAACCTTGTTCACCATCGAAACTGTCGTACATGAATGTAATATGGTTGTCCTTTACACGACTCTTCACTTTCCAGATCGCATCTCTGAATCCTATCTTCCTGCCTAATTGATCATTGAATCATAGCACAACACGAATTTGGTTGCGAATACTTAGAGATATTAGAGAAGGAAGATGTGGGAGGGGCTTTAATTAAAAATGCCCATTTTCATTAATTTCCCTATAAACCCTGAAAAAGAATTTGAAAATCAAAATCGTGAGGGGAAGGAATAGAGAAGAAGGGAAGGGGCAGTGAAATTGAAGTAAAACTCACATGGAGGATGAAATATAATAATAGGTTTAAGAGGTTGTTTTGGAAGAAGCTCCATTAGAGAAGGAAAAGTTGTAGAGATGATAGAAGAGAAGGAGAAGCAATGAGAGAGAAGAGAGAGATGTAATGATAAGTGAGGCAAGGAGAGCTCTCTCCCTCTCTCCTTAGTTCCAAATCTGGAACGACGGAGAGCACTCCTTCGCCATGGTTCCCTCTTCTTGGTTAAAAAAGATCGTCGGAGGTTAGCGGTGACGACGATGACAGGCGAGGACTGAGACAGCGACCGATGGTCGTGGTGTGACCTTCATCCGAAGGAGGAGGAAGAAGAAGGTGTGAGGTGAAAGGAAAGAGATTTAATTAGTAAAAAAAGATCCAGTAATAAAAAAATATTAAATTAATATATTTAACAAATTACAATCAACAATTTAACTGACACGCCATTTTTAACGATCAAAGTATAGCTCAAGGACTATTTAGAACCATTTCTCAAATATAAGGGACAAGAACTGTTCATTTTCAAACTTTAGGGACCAAATAAGCACCTACTTCTTTCTTCAGGACCAAAAATGTCAAATTTTGAAACTTTAAGGACCAAATAGACACCTATCTCAGACCTCAAGCACCAAAAGTGTATTATTCCCTTTATTTTATTTTTATCAAAATTAGTTCAAAGGAATAGTTGCTTTGCTTGATTAGCAAGTTGGCACCATTTAGGGGAAGATCTTTTAGTTATTTTGTTTATTTGTATTTTCCATAGAATTTAAGATTATTTTAATTTTTACAATTTCTTGTAAAGGTCATGGGGCTTAGCGAATTTGAAAATTGAGATTTTCTCACCCTTTGCACTATGCAAAATCACTCCCAAAATATTGATTTCCTTTGTGTAGTTGAAGTTAGGATTTCATATTTTATCCTCAAAGCATGCTTGTTTGACTGGTCATCAAATTCGTGGAGTATTTCTGACCTCGAGCATAGTGAGTATTTTGTCTACTCAAGGTAACGACTTCAAAGCTAATCCCTCCAACCCTTAACGATTGAGCCAACACCTACTAATGGCATAGGGGTGCCATCAGCAGTCACAATTGATATTGAATAATGAGAAGACAAAGAAACAAAAGAGGACAAATTGAGAGACATATGGTGAGCTGAGTCTAGGACCCATAAGGTAGAGGATATACCTGAAGCATTAGATGATGACAAACCTTTAGAAGATGAGGCAGACATAGCATGAGGGTGTGAAACTAGAAACTTCTGAAACTGCTCAAACATGTGTGGATCCAAAGATGGAGCGGTCATTCCATTATTGGACTGAAGTGATCTATATGTCCATGGAGTATGTTGTGGGGGTTGGTGTTGCTGCGATGGTGGCGGCTGATGTTGTTGTTGCTTTTGTTGTGATTTACCCAGTTTATTCAGCAACAAAGAACATTGAGCCTTCCAATGACCCTTCTTTTTTCAGAATGCACACTCATCATGAGACACCTGTGTGTGTGAGTTTTTCTGTATAATACCAGTGGAAGATCGCTGTGGAGTAGCAAAAACTGATGGATTCAGAGGAAGGATAATTTTCTTATCCTTTTTAGACTTGAGATGAATCTTCTTTGATAATAGTTCACTAACCACTGAATTAATTGATGGAATAGGAGTACGATGTAAAATTGACCCACGCAATGGCTCAAAATCATGACGAAGAGCCATAAGAAAGTGGACCAAGCGCTGTGAGTCCCTCCGAGTGATATAAGCCTCGAATGTGCTTAATTCTAAAGATTCTGTTAATGCCAATTGATCCCATAATTCTGACATAGAAGAGTAGAAGTCTTGAGTACTCATATCATTATGCTGTAATGCTTGAAATTTCGTCTCCAATTAATACTGCTTGGTAAAATTAGACTGTGTATACAATTTGGCCAAATAATCCCAAACCTCTTTAGAAGTATCATATTTTGCTAATTGTGCACCAATGGAGTGCATAACAGAATTATTTATTCATCTGATAATCTTCGAATTGTCAGACTCCCAAGTATCTAATGCGGATTCATAGTCATTTGCCTTGGTGTCTTTAAGCTTAACTCGTTTACCCGTAACATAGTTCCACATAGATTTTCCACGCAAGAAATTCTTCATAACATAACTCCAATACGAATAATTTTTACCATCCAATTGTACATTAATTGATTGAAGAGAATCATCCTTACCATTCCACATAATGGCAAAATCAAAACAATTCCCAATTTCACCCAACAAAACTATCACTAAATTTCTCCGAACAAAAAAAGAGTTGCACACTTGCACTATGCCGAAGACTAACACTACGCGTAAATTTCCAATCCACAATATGTAAATTTTCAATCCCCGATGCACAAAAATTAAGGATACTAGACGATTCCAAAATCAATCTAACAATCCAAAAGCAAATTAAGCTTCCAAATTTCAAGCAAATATCACCAAAATTCAAAATTATGTTGACTGACCCTTCTTCGATGTAAAAAAAATCCAAACCTTGACCCTTCCTCGATGCAATAAAAAATCCAAAACTCAAAACACATACTGATTGCGACAATGAGAATTTCGAAATCGATTTCTCCACGAAATTGCCTGCAAATTCAACATCTTCAAAAAAACTAAATTTACATGAAATTGCCTTCCTTGCCCGATCAACTAACAGATTTAGATTTAAATGATAATGGAGGCTCTGATACCATGAAATCGTAGAGAAAAAATTAGGAATTACTTTATTGATCGTGGAAAAATCAATTATGAACTCCACAATAAGATATAGGTTATCAACCCTAATCCTAAAAATATGAAACGACTAATTTACCCTCAAAGACTAATAATAATAATTATAATATTTATTCTAACAGGTTGAATCCTAAACAATAACAAAAGACTAATATACCTTGAAGGACTAACATTAATAATGGTAATAATAATTTATTCCAACAAGCGAGAGGGAGATTAGAAGATAATGAAGAAGAGAGAAAAAAAACAGATGAGAGGGAAGAATGAGGGAAACATGAAAGAGAGAGAAATTCAAGAAAAAATAATGGGAGACAAATACATAATTTCATACTAAATTGATATAAGCAAATGGTCAGAAAACTTACGTTTTAAAAAATAGGCAAAGATCAAATCTGCCTTAGAGAAATGTCTATTTTTGTTGAAATCTCATAAAAATCCCTCCAGAAATCTCAACAAAATCACCACAGGCAGCCTCACTAGTCACCATCTCCACCAACCTCACCATCTTCCTCATAGTCTTATCCTCTATCCAAATCAGGTCGCTCACAACTATCTCCTCTATCCAAGAGGTCATGTAGGATTCCTTGAGGCTTATTGCAATTTGGGTACATGGTGCCAGCAATTTGCCAGGTTGACCAAATATCTACTTTATAATATATAATATATATTACTATTTATTATTTTCTTAAAAATCCAAGACGTCTCGTCGTCCCGTGAAGAAACAACATTAGGATGCATTCCAACACATGGTCGTCCCTGTTACGACATGACTTGTCTTGGTGTCTCCTTGATGTGTCATTTATATTACTGTAGAAGTTGGTAAAGGGTAGTTAGGCTACTCTTGTTAGATCTTGTGATTCATTTATTATCTTTCCCTTTGTTGAAGTCTATTTAGTCTTTGCTTGTATTCTTTTCGATAATCAACTTGGCTAATAAAGCACATTGAACAAGTCTCTGGTTTCCTCTCTCACTAAGAACTTCAACTTGGTATCAAAGCTTCAATGGCTGACACCTTCATCGATCCAAACCCCCAACCCTACCAATCCAAAACCCAATCCCACAAATGTTCAGGTTCATCTTCCCAAGCCACAGGGAGTTGTACAGGTTCGATCTGGAACCACCCCTGTAGGTACCCCGACCTTTGTTAATCTCCTAAACTAGGCCACATCGATCAAGCTAGATCAAAATAACTTTATGCTTTTGCAGAATATTGTTTTGCCAATTCTAAGAAGCTACAAGCTCGAAGGACACCTAACAAGAAAGACTGTTGCCCTAGAACTATCCATCATTATACCTCCTAGAGGAACCTCAAGGTCTGCTATTGCCCAACCCAGAATATGATGTGTGGGGTGCTGCAGATCAACTCTTAATCGGCTGGCTTTATAACTTGATGACTCCTGAGGTTGCGTCCCAAGTCACAAGCTATGAAACTGCTCAAGAGCTCTGGACTGCTCTTCAAGACTTCTATGGCATCCAAGCCACTTCTCATAAAGACTATATCAAGAGCATGATGCAACAAACTCGAAAAGGGGGTATGAAAATGGGCGATTATCTAAGATTAATGAAAGGATATGTTGATAGTTTGCATTTGGTAGGTTCCCCAATGGATATCAAAGGTCTCATCTCATGTGTCATAGCTAGATTTGATTAGGAGTATAAACCCATTGTGGTTGTTATACAGAACATGAAAATGAATGGGAATGAAGTCCAATCACAGCTCCTCACCTTTGAAATGCGTCAAGCTCAGTTTCAAGCTTTGAAGAATGTGGTCTCTATCAACCAACCTTCTGCCAACCTTGCTACAACCAACTCTCAGAGCCATCAAAACTAATCTCACACCAAGAACTCTACCAGCAACTTCAATAAAAGTAATCGGATTGGTGGGAAAGGGCGCTTCTCTAGTTCTTCAAAGCCCATTTGTCAAGTCTATGGCAAAGTTGGGCATACGGTAGCTGTCTGCTATTATCGTTTTGAGAGAAGCTTCAACAATAACCAAAATCTCAACAAATGAGGAAGTAATTTTCACTCCAAGCAGACATCAGGCAATCATATTGCCATGATTGCTACACCAGAAATGATTCATGACACTGCCTGGTATCTGGATAGTGGGGCGAGCAACCACGTTATTGCTGAGCTTGGAAATTTGTCTATCAAGAGTGATTACACAGGTATGGCACGTATCACTGTTGGAAATAGTTGTCAACTCCCTATACAATCTATTGGTAGTTCAATTATACCTTCCAATACTGGATGCTTGATCTTAAAGGATCTTTTACATGTGCCTCAGATTAGTAAGAATTTGATTAGTATTTCCCTCTTAACCTTGGACAATAATGTTGTAATTGAATTTCATGACTCTTTTTGTGCTGTTAAGGACAAGGTCACGGGGAAGGTTCTTCTGGAAGGAATGCTTAAGGATGACTTGTACCAGATTCCTCCGTTTACAGCCTCTTCTCATGCTTTTGGTCAAAGATCTTCAAGGAATCAATGTATTTCTGTTGCTTTTATGTCTCAGATGACAAGGAAATGTAAGGGTGATGCTCTGTTAGTTATTTCTCCTGGTAGTAATCATATTTTGTATTGTTCAAAGAATATCTGGTATAGTAGGACTTAGACATCCCTCTTCTACAACCTTGAATCACATATTGACCTCTTGTAACTTGAAATGCAAAATAAATGAGAAAATCTCCTTTTGTGAGGCTTGTCAGTATGGGAAATCACATAAGTTACCATTCTCTCTCACAAACTAGAGCTTCTAAACATTTACAATTGATTCATACTGACCTTTGGGGACCCACCCATGTTGCATCTTCAAATGGCTACCTATACTACATTAGTTTTCTTGATGACTTCACTCGCTTTACTTGGATATTCCCACTCAAAAGAAAAAGTGATGCTATAGACATCTTTAAACAATTCAAGTCTCAAGTTGAAAACTTGTTTGAAACCACTGTTAAGACCATTCGATGTGATGAGGGAGGAGAGTTCAAACCTTTATCTCGCTATGCTTCAGAGATTGGGATACAAATTCAGTTTGCTTGTCCTTACACCTCTCCTCAAAATGGTAGAGTGGAACGGAAGCACAGACACATTGTTGAGACCGACCTTGCACTTTTAGCTCAGGCCAAAATGACACTAAACTACTAGTTTGATGCCTTTCATACTGTAGTTTACTTAATCTACTGATTGCCTACTGCTGCTCTCAATGGTCAGGTGCCATTTTTGGTTCTTCATAAGAAGCAACCTGATTACAACACCCTACGTGTTTTTGGTTATGCCTGCTACCCCTACCTTCGATCATACTAGAAACACAAGTTTGATTTTGACACTATCAAATGTGTCTTTTTGGGATACAACAACAATCATCGTGGTTATCGATGTCTCAGTCCCTCAGGGCGGATTTATGTATCTCGGCATGTTTGTGTTAATGAAGAGCAATTTCCTTTTGCTGATCACTTTATGGTAGAGTCTCAGATGACACATGATTACGCTTGTTTACATGCGTAAAAAGAAGGTTTTAGATGCTGTTTTTACAATGTTTCAGCTTTGTTTCGGATTTTTCTCGGTCGTTTGAGTTCCAATTCATGCAAAATTCATATATTATGGTTTGTGGTTGGCTTATATTGTATTTATGACATTTTAGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGACGGAAAAACCATCATGGGAGGCGCCAGGCGCCTGCGTGCCTGCAGAAAAACAGTTTTTCTTCCAACTTTGCCCTTAATGAAACGCGTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCATACGTGTAGAAGAAGTATTCCACTATCAGTTTGAGCACGATTTGGTCAGTCAAAAGAGTTGGAAATGGGTGATTCACCATCGTTTTAGGACCTACCTAACCAATTCTGCACAGCTATAAATTCCAGCAACTTCTCCACCAAAATCATCAAGCCTTGCTTCCTTCTCTTGATCATTTTTAGTGTTTCTTTGTTGCTTTCTAAGTACTTTTGAGTTGCTTTCATTAGTTACTTTTGGTTTTTAAATCCCGAGTACTTTACATTCTTTTATTGAGCTTTGTTTTGTTTCCATTTGTACTTAGTTATTTTCTACTTATTCAAATATATAAAAATCTAGCACTTTGCTTATCTTCCCTTTGCCTTTGCTTTATTTTTAGTTAAGAGAGTTGTGTTTTTGGAGGAAAGAGCACGTGACACTCCCCCTTGCTTGAAAGAGAAGGGTGAAGCTCTCAACCCCGATAAAAACTCTCCGTTAGTTAACTTTTGTTTTTCAAACATGCATTTCAATTTCAACCACTCTACTTGCTGTATTTTCTTCAATATGTCTTTCTAAATTCCCGTCTCTAGCCCGGTTATGGATAGTTAGCATTTTATTTCTAATAAGGGTTGCTAAGAAGTAGTGTTTTTGCTTTCCATTCATTGTTCAACATGTTTTCCCAAAAAGTTAGGATTTAGAGAAGGCCACTGTAAGTCTTAACAAGTCAATATTGAAAAGGTTGCTTATTTGCATTGTTGAGTGGTTAAACCGAGGAGAGGAGCGCCTTAGACATAAGGGTTCCGCTTAAGTAGCCTAGACATAGGACCTTAACCCGAGGTTGCGACTATTTCCATCCTAAAACCAAGATATATCTCATAATAGACATATTAGGGAATGTCTTAGGACTTTGAACTTGCTTAGACATAAGTCAAGTGAGAAGCGTAGTCGTTGGTTTCTAAAGTAAAATGCATATTGTCCTTAGTATGTCCTTGATTCTTTGTTGGCTAACTTGAAGTAATTCGGGCTAGGTTTTATTATTATAAGTCCAAATCCCAAATCCCAAACCATTATTTGTGACCACTTCCCATTTTACAAGAATCACTCATTTCAACATTCACTATTCTTTAAAACTTCAGAGTTTGGAAATTATTTAGAAGTCCTTGAGAATCGATACTTGGATACTCCATCCATTTTATATTACTCTGACAGGTGTGCGCTTGCATCTATATAATTGTTACCACGCTTAGTGCGTGCGTTCAGCATAAACGCCCAGTCAACACACTTCAACTCTACGACTCACTTAAGCAATGACACCATTCTATCTTGGCTCCCCATCCACGTTTCTACTCCCACCGATTATTCTTTGGTCATTATTCATGCTCAGTCACCGTTACTCAACACCGAGCCCTTCTCCCCTACTACAACACCTCCATCACCCAATCAACTCTATACCCCTCATCCCACCTCACCTATTCTTCAGTCTTCACCCGACACCTCTTAACTCATTTCCCCTATCTCCCCTGTTTCACCCATTCCATCTTGCACTAGCAGTCACCCGATGCTTACTCGTGGCAAGGCTGGCATCACCAAACCAAAGCAATTCTTCGGTTGTTATTCCCAATCCAATCTTGGTGTTGATTAGTCCCTTACAAGACCTACAAAATTTTCAGATGCTGTTTGTAATCCCTTCTTGGAAGGATGCTATGAATGCTGAGTTCAATGTCTTACAACTTAATCAAATTTGGGTTTTGGTTCCTCCCACTGAAGCCTCTCATCTCATAGGCAACAAATGGGTCTACAAAATAAAACGACGTGCAGATGGGAGTATTAAGCGTTATAAAGCTCGCCTCGTTGCCAAAGGTCTCTCAAACCCTTGGCATTGATTTTCATGAGACTTATAGTCCTGTCATTAAACCGGTTTTGATTCGACTAATTCTTACTCTTGCTACTACTTGGAATTGGCCTGTTTGGCAACTAGATGTAAGCAACGCGTTCCTCAATGGTGGCCTTAATGAGACTTTGTTTATGCCTCAACCGCAAGACTATGTTGATCCGTTGTGCCCTTACCATGTTTGTCAATTGAACAAGGCTCTCTATGGTCTTAAGTAGGCCTCAAGAGCATGGTATGATAAGCTTCGTCAGGGCCTTACAGAATCGGGCTTTTGTCAAGCTCAAACCGACCCTTCCCTCTTCTACTGCCTGCACTCCCACCAGCCTACTTATCATCTTCATATATGTTGATGATATTCTCATCACTGGTTCGAATGGCACGCTTATTAATCGTCTCATTCATAATCTCAACCGTAGATTCACTCTTCGGGATTTTGGCCCTATCCACTATTTTTTAGGGATCCAAGTTGACAGAACACCTACCGGTTTTATGCTCTCTCAAGCTACTTACATCCGTGATCTCCTTTGCCGTCTTCAGATGCAACATATCAAGGCAGCCCCCACACCTATGTTCTTCTCACTCGGCCAGATATTGCTTTTGCTGTCAACCGGTTAAGTCAATTCTCTCAGGTTCCTTCCATCTCCAATTGGCAATCACTTAAACGCGTTCTTCGTTACCTTAGTGGCTCCCTACATATTGGTCGTGCTATACAAATGGCTTCGACTCTCACTTTCACTACCTTTTCTGATGCTGACTGTGCTGCATGTCCCTTTGATCACAAGTCCGTGGGAGGTTACTGTGTTTTCCGGCTCTACTTTGATTTCTTGGTCCTCCAAGAAGCAACAAGTCATAGCTCGATCCAGTACGGAGTCTGAGTACCGTGCTCTTGCACATGTATCATGCGAATTGAAGTGGATTCAATTCCTATTTTCTGAGCTTAACGTTCCTCTCTCTTCCACACCAATCATATGGTGTGATAATGTTTCCGCTGCTGCCATTGCTCGAAATCTTGTTACTCATGCCCGCACCAAGCACATTGAGATTGACATTCACTTCGTTCGTGACCAGGTGCTGCAACAACAACTAGATATTCGATATGTTCCCTCTGTAGATCAGTTGGTCGACTGCTCACCAAGCCTCTGCATATCTCCAGATTCAATGGGACATTGGGGTTTTTGACCCCACCAATAGTTCAAATTGGATAAATGAACCCCTTTCAAAAAAACCTAATTTTTGACCCCACCTCTACTTAATTTTGAACCTCTCCCTAAGCCAATTTACCATAGTACCCTCATGTTTAAAAATGAAGGGAACTCCCTCCAAGTAAATAAGTAAAATCCTAAACCCCATCTCGGACGTTTCCTGGCCCAACAAAAATTTTCACCATTTCTCGAACGTTTCATTTGTCCTAAAATCCTAAAACCCATCTCGAACGTTATCTCGGACAAAGGACGTTGGACGATTCATCCATCCCGAACCTTACGAAGTTATCTCAGACGTTTCTCTTCCATTGTTTCGCCCTACGAAGTCATCGATCGCAAGGTTAGTGGTCGGGTAGATTATAACAAAATTTTTAGAAACGTTTGTTGTTCTGATTTCATTATCGGCACTATCTGAAGCAGCCATGTAGATCTAGGGTCATTTTCATGTTGTGCTTCCAGGACTGAGAAATTTGGGGAAAAAATCAATCTCTGATTCAATCCCGTATTCAATCCCGGATGATTTTGGTCCGGGATCGAATACGAGATTGAATCCGGGATCGAATTCGAACTGGCTGCGTAAGCTTCCATCCCAGACCACCCGGACCATTTCGCCTTCGGGATGAAAGACAGAAAATGAAACTCCATTCATCCCGGAACGAAACGGTCCAGGATAAGGTCCGGGATGGAAGGCTTTCACAACATAATGCATTCTGCCTTCCATCCCGGACCTTATCCCGGACCGTTTCGGTCCGGGATGAAAGGCAGAGAATGAAACCCCATTTTCTGCCTTTCATCCCGGAGGCGAAATGGTCCGGGATAAGGTCCGGGATGGAAGGCAGAATGCAGACACTTCGAATTTTTTTTTTAAGTTGGTGTGCACTGAACGTTTTTTTTAATTTCCTTGCTACTTTATGTAATTTTTTTTTACCTTTGTATAGGACATATAATGGATTTGAGACTAATTCTAAACCGTGATGACTGGTTTCCGGCCACGTTGACCAACCTTGCCCATGTAGATAAAACCACTTCTAGGCTGAAGGGTAGGTTAACCCCAACCCAGTTAGACATGTTTAGTCAAACGTGTTTCGGTCCCATTTTGGACATGGACGTAGTTTTTAACGGTCCATTAATACATCATCTATTGTTGAGAGAGGTTGAAGAGCCTAGGCAGGACATCATTAGTTTCGACCTGTTTGGGAAAAGGGTCTCCTTTGGTAAGCGGGAGTTTGACCAAATCACCGGCCTCAGTCATAGGATGATTAGGGTAGATAACGATATTCCTGGCCGACGACTTCGAGCTCGTTACTTTAAGGATAGTGTCAGGGTTAAGTGTAGTGAGTTAGAGAAGATTTTTATGGAGGCAGTTTTTGACGATGATGAGGATGCTGTCAAGGTTGGCATAGTTTACTTCGTCGAGCTTGCCATGATGGGGAAGGAGAGGAAGCAGTTTATAGATATGACCCTTTTAGGGGTTGTGGATAGGTGGGAGCTGTTCTGCAATCACGACTGGAGTTAGTTGATTTCGAAAGAACACTTTGGAGCCTGAAGAATGCCCTGAAGGATAAACTACCGACGTACCAACAGAAGGCGAGAAATGACCCCACACACCAAGAGACTTATAGTCTCTACGGGTTTCCGTACGCATTTCAGGTTAGCGAGCTTTAAGTCAACTATATAATCGTATATATTATTTAAATGCATTATTTATTTGACTTCACTGGATTCTAGGTATAGGCTTACGAGACGATATCGACGTTGAGTCTGCGCGTAGCCACGAGGCTGAGCGACGACGCCATTCCTCGACTCCTTAGGTGGTCGTGCACTTATTCTCGTGGGTTTCTTACTCTGTAGAGAGATGTGTTCGATAACACGATGGTAAGTAATTATCTTGACCCCGTACGATTCAAATTTATTTTTTCATTTGGGTCTAATCATAAGTTTTTTGCAGTCCAAGGTTAATGAATACTTGGTTTTGACTAATGCTGAGGCAGAACACATGGTCCGTATCATGCGTCCATCGGAAGCCCGCGCTATACCTGCCCCGCCGGCTGTACCTGACCCGCCTGCAGTACCTGACCCGGATGTTGTACCTGCCCCGGCTGCAGTACGTAACCCGCCTGCAAATTTGGAAAGGGGTGCTAAGGAAATAAGGGTGAAGGGCAAAGGAAAAAACATCATAGAGGATCCGGTAGAAGAGGCCGAGACATTGGACGATGTTGCATTACAGGATTCTGCATTAGACGATGCTGGACCCAGTGGAAATGACAGCGAAGCCCTACAGAAGAGGTCGAAACGGAAAAAATTCAAAAATAAGATCAGTAGAAGGTTGAAGAGGCTCGATGACCGAGTTGGTGCTATCGAGGCCACACTGATTGGCTTCGAGGCCACACTGACTGGCTTCGGGGTTGCCCTGAAAGGTATCCAGAGATACCTTAAGAAAATGTCGAAGGTACGCATTTTATATGTATATAACATTAGGTATGCTTAGTTATTAATTATGATTAGTTAGGTCCGAGATAACGCTCATTGGTCCGTACTTTTCGTGCAAGGTAAATTCCGTGATCTGACCAAATATTTTGGACGTGGGGATGGACCCGATGATGATGATCCATCGGATCAAAGGCCTGATGAGGCCCCAACACAAGGTCCGAAGAGTATGGACGAGGACCGGAGGCCGGAAGCGGTCCCTAAGACTGACGAGTATCAGACCATGGACGATAATCCGAAGAGTATGGACGAGGATCCGAAGAATATGGACGAGGATCCGATGTTTATGGTTGAAGACCAGGGTACGATAACGGAGCGGGACAATGCATCGAATGCTTACCTCGATCGTCCTGTCGGTTTGTTTCAGGTAAGTCCTATTTCTATAACGCTAGTATACGTTTGTGCTTCCATATACCAATATGATTAATATTGGTTGTATATTATGATGCCACTGTTGGAATGCAAGAGCCGGACGTTGCATCAGATACGCGACCCGTCAGCCGACGCGTTAGGCGTCCCTATAAGGACTGGGCACCGGACGCAATCATTAAGGTAAACTGTCATTACATATACATATTAGCTGTCAGCTGTCATTACATGAACTCACGTGTGTATTGTGTTTTAGGTTGAACCTTACCTTGACCAGGATGAATATGACCTTCAGCAGGCCCCAACTGGGCGTGGGCTACGCAAGGGGCATTACTCTTGGAAACTGAAGGATATATACACACCAACCGGTCAGCGTGGGATCACCGTGGATAGATACGACCCAGTATGTCCCATTCCACCGCAGTTGGACGATAAGTTCCAGAGATGGATGGATGACCCGAAGACGGACGGGAGATTGCGGTCCACTGCAACTGGTTTCCAAAAGAAGGAATGGTATCGCGATCTATTGGACCCTAATGTTGAATTGAAGGACGAAGTAAGTACATGTTACTATATTCGTAGTACTATATTCGTCAATATGTTTGATTACATCAATGTGGATTGGTTAAATGTAACAGGTACTTGATGGTCTCGTCCTGTTTACAGCAAAAAAGTTGGAGAAGTGTCTCCATCTATGTCGCAAGAAGTTTGTGATAGGTGACGTACTTCTTTCGGTAATTACTTTTTACTTCTTCCAGTTCGAACTAATAAATGCTGTTTTGAACAATTCTTTAACTAATGAATTACTTTTTAACTGCTATACAGACTCTGCTGAATCGGACAGACGGTCCATATGCGGCCATGAAGCCGGGTGTATTGTCCACTAGGATCAACTACCCCTGGCGCGAGGAGAATACAATCTGGCGATATGTCCACGGTAGGCAGTCGGACCACAACGTGCCCTGGAGTGATGCAGGCATCGTGTACACCCCCATGAACGTAGGCGGGAACCACTGGGTGATGCTCGGGATCGACCTTGTACAGGGCGACATAACCATATGGGATTCACTCCAAACGGCCACTCCACTGGATGAACTTGAGAAGGAGTTGAAGCCCATGTGTACAATCCTACCTGCACTGCTGCATCATGGCGGGATATTTTCAGTTCGACCCGACTTGCCAGTGGTGCCGTGGAGGGTACGTCGGGTTCGCGTACCACAGCAGAGTAGCGGCGACTGA

mRNA sequence

ATGGCGCACACAACTATACCATCAACAACGCGCTTCTGCGCGCCTGCCATTACCTTTTGCAGTCTCTGCACGCACGCTAGTGCCCCGCACGAGCACTTGCACCTTGCCACACTGCGCGCGCCTCCTCCAGCCAAGTGTCAACCTCTCAACGCACCCCGTGGTGCCAGCTTCAACGCATCCTCTGTCAGCTTCAGCGCCCTTCTCTGCTCACCTGACAAGGCACCCACGCCTGTGCGTGTGCCCGTCTTCGCCTCGTCCATGCAAGCACCTGCCTACGCCGCATGCTGCTTGCCCATGCAAATGTCCAACAAGCCTCACACAGTGCCTCTATATGCCTTGCCTGCCTTCGTCCGCGTGCACGCCTACCAATCGCTCAGCGTCGGGCCTCTTCGCCCGCGCCCGCCATGCCGCCCATCCGCCCAGCGCACGCCCACCATGCCACCGCCCCAAGTGCGCATCTCCTTGGTGTTGCTTGCCGCCCACTCTATTCCTTTTGAAGTTAGGATTTCATATTTTATCCTCAAAGCATGCTTGTTTGACTGGTCATCAAATTCGTGGAGTATTTCTGACCTCGAGCATAGTGAGTATTTTGTCTACTCAAGCACATTGAACAAGTCTCTGGTTTCCTCTCTCACTAAGAACTTCAACTTGGTATCAAAGCTTCAATGGCTGACACCTTCATCGATCCAAACCCCCAACCCTACCAATCCAAAACCCAATCCCACAAATGTTCAGGTTCATCTTCCCAAGCCACAGGGAGTTGTACAGGTTCGATCTGGAACCACCCCTGTAGAGGAACCTCAAGGTCTGCTATTGCCCAACCCAGAATATGATGTGTGGGGTGCTGCAGATCAACTCTTAATCGGCTGGCTTTATAACTTGATGACTCCTGAGGTTGCGTCCCAAGTCACAAGCTATGAAACTGCTCAAGAGCTCTGGACTGCTCTTCAAGACTTCTATGGCATCCAAGCCACTTCTCATAAAGACTATATCAAGAGCATGATGCAACAAACTCGAAAAGGGGGTATGAAAATGGGCGATTATCTAAGATTAATGAAAGGATATGTTGATAGTTTGCATTTGAACATGAAAATGAATGGGAATGAAGTCCAATCACAGCTCCTCACCTTTGAAATGCGTCAAGCTCAGTTTCAAGCTTTGAAGAATGTGACATCAGGCAATCATATTGCCATGATTGCTACACCAGAAATGATTCATGACACTGCCTGGTATCTGGATAGTGGGGCGAGCAACCACGTTATTGCTGAGCTTGGAAATTTGTCTATCAAGAGTGATTACACAGGTATGGCACGTATCACTGTTGGAAATAGTTGTCAACTCCCTATACAATCTATTGGTAGTTCAATTATACCTTCCAATACTGGATGCTTGATCTTAAAGGATCTTTTACATGTGCCTCAGATTAGTAAGAATTTGATTAGTATTTCCCTCTTAACCTTGGACAATAATGTTGTAATTGAATTTCATGACTCTTTTTGTGCTGTTAAGGACAAGGTCACGGGGAAGGTTCTTCTGGAAGGAATGCTTAAGGATGACTTGTACCAGATTCCTCCGTTTACAGCCTCTTCTCATGCTTTTGGTCAAAGATCTTCAAGGAATCAATGTATTTCTGTTGCTTTTATGTCTCAGATGACAAGGAAATGTAAGGGTGATGCTCTGTTAGTTATTTCTCCTGGACATATAATGGATTTGAGACTAATTCTAAACCGTGATGACTGGTTTCCGGCCACGTTGACCAACCTTGCCCATGTAGATAAAACCACTTCTAGGCTGAAGGGTAGGTTAACCCCAACCCAGTTAGACATGTTTAGTCAAACGTGTTTCGGTCCCATTTTGGACATGGACGTAGTTTTTAACGGTCCATTAATACATCATCTATTGTTGAGAGAGGTTGAAGAGCCTAGGCAGGACATCATTAGTTTCGACCTGTTTGGGAAAAGGGTCTCCTTTGGTAAGCGGGAGTTTGACCAAATCACCGGCCTCAGTCATAGGATGATTAGGGTAGATAACGATATTCCTGGCCGACGACTTCGAGCTCGTTACTTTAAGGATAGTGTCAGGGTTAAGTGTAGTGAGTTAGAGAAGATTTTTATGGAGGCAGTTTTTGACGATGATGAGGATGCTGTCAAGGTGGGAGCTGTTCTGCAATCACGACTGGAGTTAGTTGATTTCGAAAGAACACTTTGGAGCCTGAAGAATGCCCTGAAGGATAAACTACCGACGTACCAACAGAAGGCGAGAAATGACCCCACACACCAAGAGACTTATAGTCTCTACGGGTTTCCGTACGCATTTCAGGCTTACGAGACGATATCGACGTTGAGTCTGCGCGTAGCCACGAGGCTGAGCGACGACGCCATTCCTCGACTCCTTAGTTTTTTGCAGTCCAAGGTTAATGAATACTTGGTTTTGACTAATGCTGAGGCAGAACACATGGTCCGTATCATGCGTCCATCGGAAGCCCGCGCTATACCTGCCCCGCCGGCTGTACCTGACCCGCCTGCAGTACCTGACCCGGATGTTGTACCTGCCCCGGCTGCAGTACGTAACCCGCCTGCAAATTTGGAAAGGGGTGCTAAGGAAATAAGGGTGAAGGGCAAAGGAAAAAACATCATAGAGGATCCGGTAGAAGAGGCCGAGACATTGGACGATGTTGCATTACAGGATTCTGCATTAGACGATGCTGGACCCAGTGGAAATGACAGCGAAGCCCTACAGAAGAGGTCGAAACGGAAAAAATTCAAAAATAAGATCAGTAGAAGGTTGAAGAGGCTCGATGACCGAGTTGGTGCTATCGAGGCCACACTGATTGGCTTCGAGGCCACACTGACTGGCTTCGGGGTTGCCCTGAAAGGTATCCAGAGATACCTTAAGAAAATGTCGAAGGTCCGAGATAACGCTCATTGGTCCGTACTTTTCGTGCAAGGTAAATTCCGTGATCTGACCAAATATTTTGGACGTGGGGATGGACCCGATGATGATGATCCATCGGATCAAAGGCCTGATGAGGCCCCAACACAAGGTCCGAAGAGTATGGACGAGGACCGGAGGCCGGAAGCGGTCCCTAAGACTGACGAGTATCAGACCATGGACGATAATCCGAAGAGTATGGACGAGGATCCGAAGAATATGGACGAGGATCCGATGTTTATGGTTGAAGACCAGGGTACGATAACGGAGCGGGACAATGCATCGAATGCTTACCTCGATCGTCCTGTCGGTTTGTTTCAGCCGGACGTTGCATCAGATACGCGACCCGTCAGCCGACGCGTTAGGCGTCCCTATAAGGACTGGGCACCGGACGCAATCATTAAGGTTGAACCTTACCTTGACCAGGATGAATATGACCTTCAGCAGGCCCCAACTGGGCGTGGGCTACGCAAGGGGCATTACTCTTGGAAACTGAAGGATATATACACACCAACCGGTCAGCGTGGGATCACCGTGGATAGATACGACCCAGTATGTCCCATTCCACCGCAGTTGGACGATAAGTTCCAGAGATGGATGGATGACCCGAAGACGGACGGGAGATTGCGGTCCACTGCAACTGGTTTCCAAAAGAAGGAATGGTATCGCGATCTATTGGACCCTAATGTTGAATTGAAGGACGAAGTACTTGATGGTCTCGTCCTGTTTACAGCAAAAAAGTTGGAGAAGTGTCTCCATCTATGTCGCAAGAAGTTTGTGATAGGTGACGTACTTCTTTCGACTCTGCTGAATCGGACAGACGGTCCATATGCGGCCATGAAGCCGGGTGTATTGTCCACTAGGATCAACTACCCCTGGCGCGAGGAGAATACAATCTGGCGATATGTCCACGGTAGGCAGTCGGACCACAACGTGCCCTGGAGTGATGCAGGCATCGTGTACACCCCCATGAACGTAGGCGGGAACCACTGGGTGATGCTCGGGATCGACCTTGTACAGGGCGACATAACCATATGGGATTCACTCCAAACGGCCACTCCACTGGATGAACTTGAGAAGGAGTTGAAGCCCATGTGTACAATCCTACCTGCACTGCTGCATCATGGCGGGATATTTTCAGTTCGACCCGACTTGCCAGTGGTGCCGTGGAGGGTACGTCGGGTTCGCGTACCACAGCAGAGTAGCGGCGACTGA

Coding sequence (CDS)

ATGGCGCACACAACTATACCATCAACAACGCGCTTCTGCGCGCCTGCCATTACCTTTTGCAGTCTCTGCACGCACGCTAGTGCCCCGCACGAGCACTTGCACCTTGCCACACTGCGCGCGCCTCCTCCAGCCAAGTGTCAACCTCTCAACGCACCCCGTGGTGCCAGCTTCAACGCATCCTCTGTCAGCTTCAGCGCCCTTCTCTGCTCACCTGACAAGGCACCCACGCCTGTGCGTGTGCCCGTCTTCGCCTCGTCCATGCAAGCACCTGCCTACGCCGCATGCTGCTTGCCCATGCAAATGTCCAACAAGCCTCACACAGTGCCTCTATATGCCTTGCCTGCCTTCGTCCGCGTGCACGCCTACCAATCGCTCAGCGTCGGGCCTCTTCGCCCGCGCCCGCCATGCCGCCCATCCGCCCAGCGCACGCCCACCATGCCACCGCCCCAAGTGCGCATCTCCTTGGTGTTGCTTGCCGCCCACTCTATTCCTTTTGAAGTTAGGATTTCATATTTTATCCTCAAAGCATGCTTGTTTGACTGGTCATCAAATTCGTGGAGTATTTCTGACCTCGAGCATAGTGAGTATTTTGTCTACTCAAGCACATTGAACAAGTCTCTGGTTTCCTCTCTCACTAAGAACTTCAACTTGGTATCAAAGCTTCAATGGCTGACACCTTCATCGATCCAAACCCCCAACCCTACCAATCCAAAACCCAATCCCACAAATGTTCAGGTTCATCTTCCCAAGCCACAGGGAGTTGTACAGGTTCGATCTGGAACCACCCCTGTAGAGGAACCTCAAGGTCTGCTATTGCCCAACCCAGAATATGATGTGTGGGGTGCTGCAGATCAACTCTTAATCGGCTGGCTTTATAACTTGATGACTCCTGAGGTTGCGTCCCAAGTCACAAGCTATGAAACTGCTCAAGAGCTCTGGACTGCTCTTCAAGACTTCTATGGCATCCAAGCCACTTCTCATAAAGACTATATCAAGAGCATGATGCAACAAACTCGAAAAGGGGGTATGAAAATGGGCGATTATCTAAGATTAATGAAAGGATATGTTGATAGTTTGCATTTGAACATGAAAATGAATGGGAATGAAGTCCAATCACAGCTCCTCACCTTTGAAATGCGTCAAGCTCAGTTTCAAGCTTTGAAGAATGTGACATCAGGCAATCATATTGCCATGATTGCTACACCAGAAATGATTCATGACACTGCCTGGTATCTGGATAGTGGGGCGAGCAACCACGTTATTGCTGAGCTTGGAAATTTGTCTATCAAGAGTGATTACACAGGTATGGCACGTATCACTGTTGGAAATAGTTGTCAACTCCCTATACAATCTATTGGTAGTTCAATTATACCTTCCAATACTGGATGCTTGATCTTAAAGGATCTTTTACATGTGCCTCAGATTAGTAAGAATTTGATTAGTATTTCCCTCTTAACCTTGGACAATAATGTTGTAATTGAATTTCATGACTCTTTTTGTGCTGTTAAGGACAAGGTCACGGGGAAGGTTCTTCTGGAAGGAATGCTTAAGGATGACTTGTACCAGATTCCTCCGTTTACAGCCTCTTCTCATGCTTTTGGTCAAAGATCTTCAAGGAATCAATGTATTTCTGTTGCTTTTATGTCTCAGATGACAAGGAAATGTAAGGGTGATGCTCTGTTAGTTATTTCTCCTGGACATATAATGGATTTGAGACTAATTCTAAACCGTGATGACTGGTTTCCGGCCACGTTGACCAACCTTGCCCATGTAGATAAAACCACTTCTAGGCTGAAGGGTAGGTTAACCCCAACCCAGTTAGACATGTTTAGTCAAACGTGTTTCGGTCCCATTTTGGACATGGACGTAGTTTTTAACGGTCCATTAATACATCATCTATTGTTGAGAGAGGTTGAAGAGCCTAGGCAGGACATCATTAGTTTCGACCTGTTTGGGAAAAGGGTCTCCTTTGGTAAGCGGGAGTTTGACCAAATCACCGGCCTCAGTCATAGGATGATTAGGGTAGATAACGATATTCCTGGCCGACGACTTCGAGCTCGTTACTTTAAGGATAGTGTCAGGGTTAAGTGTAGTGAGTTAGAGAAGATTTTTATGGAGGCAGTTTTTGACGATGATGAGGATGCTGTCAAGGTGGGAGCTGTTCTGCAATCACGACTGGAGTTAGTTGATTTCGAAAGAACACTTTGGAGCCTGAAGAATGCCCTGAAGGATAAACTACCGACGTACCAACAGAAGGCGAGAAATGACCCCACACACCAAGAGACTTATAGTCTCTACGGGTTTCCGTACGCATTTCAGGCTTACGAGACGATATCGACGTTGAGTCTGCGCGTAGCCACGAGGCTGAGCGACGACGCCATTCCTCGACTCCTTAGTTTTTTGCAGTCCAAGGTTAATGAATACTTGGTTTTGACTAATGCTGAGGCAGAACACATGGTCCGTATCATGCGTCCATCGGAAGCCCGCGCTATACCTGCCCCGCCGGCTGTACCTGACCCGCCTGCAGTACCTGACCCGGATGTTGTACCTGCCCCGGCTGCAGTACGTAACCCGCCTGCAAATTTGGAAAGGGGTGCTAAGGAAATAAGGGTGAAGGGCAAAGGAAAAAACATCATAGAGGATCCGGTAGAAGAGGCCGAGACATTGGACGATGTTGCATTACAGGATTCTGCATTAGACGATGCTGGACCCAGTGGAAATGACAGCGAAGCCCTACAGAAGAGGTCGAAACGGAAAAAATTCAAAAATAAGATCAGTAGAAGGTTGAAGAGGCTCGATGACCGAGTTGGTGCTATCGAGGCCACACTGATTGGCTTCGAGGCCACACTGACTGGCTTCGGGGTTGCCCTGAAAGGTATCCAGAGATACCTTAAGAAAATGTCGAAGGTCCGAGATAACGCTCATTGGTCCGTACTTTTCGTGCAAGGTAAATTCCGTGATCTGACCAAATATTTTGGACGTGGGGATGGACCCGATGATGATGATCCATCGGATCAAAGGCCTGATGAGGCCCCAACACAAGGTCCGAAGAGTATGGACGAGGACCGGAGGCCGGAAGCGGTCCCTAAGACTGACGAGTATCAGACCATGGACGATAATCCGAAGAGTATGGACGAGGATCCGAAGAATATGGACGAGGATCCGATGTTTATGGTTGAAGACCAGGGTACGATAACGGAGCGGGACAATGCATCGAATGCTTACCTCGATCGTCCTGTCGGTTTGTTTCAGCCGGACGTTGCATCAGATACGCGACCCGTCAGCCGACGCGTTAGGCGTCCCTATAAGGACTGGGCACCGGACGCAATCATTAAGGTTGAACCTTACCTTGACCAGGATGAATATGACCTTCAGCAGGCCCCAACTGGGCGTGGGCTACGCAAGGGGCATTACTCTTGGAAACTGAAGGATATATACACACCAACCGGTCAGCGTGGGATCACCGTGGATAGATACGACCCAGTATGTCCCATTCCACCGCAGTTGGACGATAAGTTCCAGAGATGGATGGATGACCCGAAGACGGACGGGAGATTGCGGTCCACTGCAACTGGTTTCCAAAAGAAGGAATGGTATCGCGATCTATTGGACCCTAATGTTGAATTGAAGGACGAAGTACTTGATGGTCTCGTCCTGTTTACAGCAAAAAAGTTGGAGAAGTGTCTCCATCTATGTCGCAAGAAGTTTGTGATAGGTGACGTACTTCTTTCGACTCTGCTGAATCGGACAGACGGTCCATATGCGGCCATGAAGCCGGGTGTATTGTCCACTAGGATCAACTACCCCTGGCGCGAGGAGAATACAATCTGGCGATATGTCCACGGTAGGCAGTCGGACCACAACGTGCCCTGGAGTGATGCAGGCATCGTGTACACCCCCATGAACGTAGGCGGGAACCACTGGGTGATGCTCGGGATCGACCTTGTACAGGGCGACATAACCATATGGGATTCACTCCAAACGGCCACTCCACTGGATGAACTTGAGAAGGAGTTGAAGCCCATGTGTACAATCCTACCTGCACTGCTGCATCATGGCGGGATATTTTCAGTTCGACCCGACTTGCCAGTGGTGCCGTGGAGGGTACGTCGGGTTCGCGTACCACAGCAGAGTAGCGGCGACTGA

Protein sequence

MAHTTIPSTTRFCAPAITFCSLCTHASAPHEHLHLATLRAPPPAKCQPLNAPRGASFNASSVSFSALLCSPDKAPTPVRVPVFASSMQAPAYAACCLPMQMSNKPHTVPLYALPAFVRVHAYQSLSVGPLRPRPPCRPSAQRTPTMPPPQVRISLVLLAAHSIPFEVRISYFILKACLFDWSSNSWSISDLEHSEYFVYSSTLNKSLVSSLTKNFNLVSKLQWLTPSSIQTPNPTNPKPNPTNVQVHLPKPQGVVQVRSGTTPVEEPQGLLLPNPEYDVWGAADQLLIGWLYNLMTPEVASQVTSYETAQELWTALQDFYGIQATSHKDYIKSMMQQTRKGGMKMGDYLRLMKGYVDSLHLNMKMNGNEVQSQLLTFEMRQAQFQALKNVTSGNHIAMIATPEMIHDTAWYLDSGASNHVIAELGNLSIKSDYTGMARITVGNSCQLPIQSIGSSIIPSNTGCLILKDLLHVPQISKNLISISLLTLDNNVVIEFHDSFCAVKDKVTGKVLLEGMLKDDLYQIPPFTASSHAFGQRSSRNQCISVAFMSQMTRKCKGDALLVISPGHIMDLRLILNRDDWFPATLTNLAHVDKTTSRLKGRLTPTQLDMFSQTCFGPILDMDVVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDQITGLSHRMIRVDNDIPGRRLRARYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGAVLQSRLELVDFERTLWSLKNALKDKLPTYQQKARNDPTHQETYSLYGFPYAFQAYETISTLSLRVATRLSDDAIPRLLSFLQSKVNEYLVLTNAEAEHMVRIMRPSEARAIPAPPAVPDPPAVPDPDVVPAPAAVRNPPANLERGAKEIRVKGKGKNIIEDPVEEAETLDDVALQDSALDDAGPSGNDSEALQKRSKRKKFKNKISRRLKRLDDRVGAIEATLIGFEATLTGFGVALKGIQRYLKKMSKVRDNAHWSVLFVQGKFRDLTKYFGRGDGPDDDDPSDQRPDEAPTQGPKSMDEDRRPEAVPKTDEYQTMDDNPKSMDEDPKNMDEDPMFMVEDQGTITERDNASNAYLDRPVGLFQPDVASDTRPVSRRVRRPYKDWAPDAIIKVEPYLDQDEYDLQQAPTGRGLRKGHYSWKLKDIYTPTGQRGITVDRYDPVCPIPPQLDDKFQRWMDDPKTDGRLRSTATGFQKKEWYRDLLDPNVELKDEVLDGLVLFTAKKLEKCLHLCRKKFVIGDVLLSTLLNRTDGPYAAMKPGVLSTRINYPWREENTIWRYVHGRQSDHNVPWSDAGIVYTPMNVGGNHWVMLGIDLVQGDITIWDSLQTATPLDELEKELKPMCTILPALLHHGGIFSVRPDLPVVPWRVRRVRVPQQSSGD
Homology
BLAST of Moc05g14790 vs. NCBI nr
Match: XP_022153201.1 (uncharacterized protein LOC111020757 [Momordica charantia])

HSP 1 Score: 511.1 bits (1315), Expect = 2.9e-140
Identity = 306/527 (58.06%), Postives = 350/527 (66.41%), Query Frame = 0

Query: 569  MDLRLILNRDDWFPATLTNLAHVDKTTSRLKGRLTPTQLDMFSQTCFGPILDMDVVFNGP 628
            MDLRLI++R+DWFPATLTNLAH+DKT++R+K RLTPTQLDMF QTCFGPILD+DVVFNGP
Sbjct: 1    MDLRLIIDRNDWFPATLTNLAHIDKTSTRIKARLTPTQLDMFRQTCFGPILDIDVVFNGP 60

Query: 629  LIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDQITGLSHRMIRVDNDIPGRRLRARY 688
            LIHHLLLREVEEPRQD+ISFDLFGKRVSFGKREFD ITGLSHRM RVDN IPGRRLRARY
Sbjct: 61   LIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGLSHRMNRVDNHIPGRRLRARY 120

Query: 689  FKDSVRVKCSELEKIFMEAVFDDDEDAVKVGAVL--------QSRLELVD---------- 748
            FKD VRVKCSELEKIF+E VF DDED VKV  V         + R + +D          
Sbjct: 121  FKDGVRVKCSELEKIFLEDVFYDDEDVVKVRIVYFIELAMMGKERKQFIDTALLGVVDRW 180

Query: 749  ------------FERTLWSLKNALKDKLPTYQQKARNDPTHQETYSLYGFPYAFQ--AYE 808
                        F+RT+WSLKNALKDKL  YQQKA  DP+H ETYSLYGFPYAFQ  AYE
Sbjct: 181  EVFCNYDWSSMIFDRTIWSLKNALKDKLSVYQQKATADPSHVETYSLYGFPYAFQVWAYE 240

Query: 809  TISTLSLRVATRLSDDAIPRLLSF------------------LQSKVNEYLVLTNAEAEH 868
            TIST        LSDDAIPRLL +                   +SKV E+L+ T+A+ +H
Sbjct: 241  TIST--------LSDDAIPRLLRWSCIYSCGFRVLTSEVFDNTRSKVKEHLLATDAKEQH 300

Query: 869  MVRIMRPSEARAIPAPPAVPDPPAVPDPDVVPAPAAVRNPPANLERGAKEIRVKGKGKNI 928
            MVR++ P E R IP PPAVPD   VPDP   P  AAV +PPA++E G             
Sbjct: 301  MVRVILPPEVRVIPDPPAVPDRAVVPDPPASPERAAVPDPPADVEMGP------------ 360

Query: 929  IEDPVEEAETLDDVALQDSALDDAGPSGNDSEALQKRSKRKKFKNKISRRLKRLDDRVGA 988
            +EDPV +A           A+D+A PS ND E L+KR K+ KFK +ISRRLKRLD+ VGA
Sbjct: 361  LEDPVVDAH----------AVDEARPSANDGEGLEKRLKKNKFKKRISRRLKRLDNCVGA 420

Query: 989  IEATLIGFEATLTGFGVALKGIQRYLKKMSKVRDNAHWSVLFVQGKFRDLTKYFGRGDGP 1044
            I       E  L  FGVALKGIQ YLKK++K             GKF D +KYFG G GP
Sbjct: 421  I-------EDKLGDFGVALKGIQIYLKKLAK-------------GKFPDSSKYFGGGGGP 477

BLAST of Moc05g14790 vs. NCBI nr
Match: XP_022155476.1 (uncharacterized protein LOC111022607 [Momordica charantia])

HSP 1 Score: 413.3 bits (1061), Expect = 8.2e-111
Identity = 201/292 (68.84%), Postives = 218/292 (74.66%), Query Frame = 0

Query: 1041 PKSMDEDPKNMDEDPMFMVEDQGTITERDNASNAYL------------------------ 1100
            P S+DEDPK  D DPM M ED G IT+ D   N  +                        
Sbjct: 6    PNSVDEDPKRRDNDPMIMEEDDGMITDGDEDPNQDITIGRRPDGSEVDHTDDHVPQVAVI 65

Query: 1101 -DRPVGLFQPDVASDTRPVSRRVRRPYKDWAPDAIIKVEPYLDQDEYDLQQAPTGRGLRK 1160
             D  VG  +PD   DT+P  RRVRRPYKDWAPDAI+KVEPYLDQDE DLQ APTGRGLRK
Sbjct: 66   QDLTVGRQEPDAQPDTQPTRRRVRRPYKDWAPDAIVKVEPYLDQDETDLQHAPTGRGLRK 125

Query: 1161 GHYSWKLKDIYTPTGQRGITVDRYDPVCPIPPQLDDKFQRWMDDPKTDGRLRSTATGFQK 1220
             HYSWKLK IYTPTG+R ITVD YDP CPIPPQLD +FQ WMDD   DGR RSTA G Q 
Sbjct: 126  HHYSWKLKGIYTPTGRRRITVDAYDPACPIPPQLDGQFQTWMDDLDIDGRTRSTAAGLQG 185

Query: 1221 KEWYRDLLDPNVELKDEVLDGLVLFTAKKLEKCLHLCRKKFVIGDVLLSTLLNRTDGPYA 1280
            KEWYRDLLDP V+LKDEV+D LVLFTAKKLEKC++LCRKKF IGDVLLSTLLNRTDGPYA
Sbjct: 186  KEWYRDLLDPTVQLKDEVVDALVLFTAKKLEKCIYLCRKKFAIGDVLLSTLLNRTDGPYA 245

Query: 1281 AMKPGVLSTRINYPWREENTIWRYVHGRQSDHNVPWSDAGIVYTPMNVGGNH 1308
            AMKPGVLSTRI YP  +ENTI+RYV GRQSD NV W+DA IVYTP+N+GGNH
Sbjct: 246  AMKPGVLSTRIEYPCSQENTIFRYVFGRQSDQNVAWTDADIVYTPINIGGNH 297

BLAST of Moc05g14790 vs. NCBI nr
Match: XP_022146372.1 (uncharacterized protein LOC111015600 [Momordica charantia])

HSP 1 Score: 365.5 bits (937), Expect = 2.0e-96
Identity = 199/314 (63.38%), Postives = 224/314 (71.34%), Query Frame = 0

Query: 569 MDLRLILNRDDWFPATLTNLAHVDKTTSRLKGRLTPTQLDMFSQTCFGPILDMDVVFNGP 628
           MDLRLIL+R+DWFPATLTNLAHVDKTT+R+K RLTPTQLDMF QTCFGPILDM VVFNGP
Sbjct: 1   MDLRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQLDMFRQTCFGPILDMGVVFNGP 60

Query: 629 LIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDQITGLSHRMIRVDNDIPGRRLRARY 688
           LIHHLLL EVEEPRQD+ISFDLF KRVSFGKREFD ITGLSH+M RV+N IPGRRLRARY
Sbjct: 61  LIHHLLLIEVEEPRQDVISFDLFRKRVSFGKREFDLITGLSHKMNRVNNHIPGRRLRARY 120

Query: 689 FKDSVRVKCSELEKIFMEAVFDDDEDAVKVGAVL--------QSRLELVD---------- 748
           FKDSVRVKCSELEKIF+E +F DDED VKVG V         + R + +D          
Sbjct: 121 FKDSVRVKCSELEKIFLEDIFYDDEDVVKVGIVYFIELAMMGKERKQFIDTVGVVDRWEA 180

Query: 749 ----------FERTLWSLKNALKDKLPTYQQKARNDPTHQETYSLYGFPYAFQAYETIST 808
                     F+RT+WSLKN LKDKL  YQQKA  DPTH ETYSLYGFPY       +  
Sbjct: 181 FCNSDWSSMIFDRTIWSLKNTLKDKLSAYQQKATADPTHVETYSLYGFPYGRMRRSRV-- 240

Query: 809 LSLRVATRLSDDAIPRLLSFLQSKVNEYLVLTNAEAEHMVRIMRPSEARAIPAPPAVPDP 855
               +A+ + D+          SKV E+L+ T+AE +HMVR++ P E R IP PPAVPD 
Sbjct: 241 ----LASEVFDNT--------WSKVKEHLLATDAEEQHMVRVILPPEVRVIPDPPAVPDR 300

BLAST of Moc05g14790 vs. NCBI nr
Match: XP_022157020.1 (uncharacterized protein LOC111023847 [Momordica charantia])

HSP 1 Score: 320.9 bits (821), Expect = 5.5e-83
Identity = 175/291 (60.14%), Postives = 205/291 (70.45%), Query Frame = 0

Query: 569 MDLRLILNRDDWFPATLTNLAHVDKTTSRLKGRLTPTQLDMFSQTCFGPILDMDVVFNGP 628
           M++ L +N+DDWFPA L+NLAHV KT+SRLK RLTP+QLDMFSQTCFGPIL M+VVFNGP
Sbjct: 1   MNMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGP 60

Query: 629 LIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDQITGLSHRMIRVDNDIPGRRLRARY 688
           L+HHLLLREVEEP+ D+ISF+LFG RVSFGKREFD ITGL H M RVD D+  RRLR  Y
Sbjct: 61  LLHHLLLREVEEPKDDLISFNLFGNRVSFGKREFDLITGLRHTMNRVDEDVRNRRLRILY 120

Query: 689 FKDSVRVKCSELEKIFMEAVFDDDEDAVKVGAVL----------------QSRLELVD-- 748
           F+D   VKCSELEKIF+E  F++DEDAVK+  V                  S L +VD  
Sbjct: 121 FQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMGKERKLKMDTSLLGIVDRW 180

Query: 749 ------------FERTLWSLKNALKDKLPTYQQKARNDPTHQETYSLYGFPYAFQ--AYE 808
                       FERTLWSLKNALKDK+  Y+QK   D +H ETYSLY FPYAFQ  AYE
Sbjct: 181 EVFCNYDWSSMIFERTLWSLKNALKDKVEXYKQKVAMDSSHVETYSLYXFPYAFQVWAYE 240

Query: 809 TISTLSLRVATRLSDDAIPRLLSFLQS-----KVNEYLVLTNAEAEHMVRI 823
           TISTLS RVA RL+DDAIPRLL +  +      V E  V  N +++ +VR+
Sbjct: 241 TISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFENVKSKVVVRL 291

BLAST of Moc05g14790 vs. NCBI nr
Match: XP_022155158.1 (uncharacterized protein LOC111022300 [Momordica charantia])

HSP 1 Score: 316.2 bits (809), Expect = 1.4e-81
Identity = 163/212 (76.89%), Postives = 169/212 (79.72%), Query Frame = 0

Query: 569 MDLRLILNRDDWFPATLTNLAHVDKTTSRLKGRLTPTQLDMFSQTCFGPILDMDVVFNGP 628
           MDLRLILNRDDWFP TLTNLAH DKTTSRLKGRLTPTQ+DMF QTCFGPILDMDVVFNGP
Sbjct: 1   MDLRLILNRDDWFPPTLTNLAHADKTTSRLKGRLTPTQIDMFRQTCFGPILDMDVVFNGP 60

Query: 629 LIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDQITGLSHRMIRVDNDIPGRRLRARY 688
           LIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFD ITGLS+RMIRVDNDIPGRRLRARY
Sbjct: 61  LIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITGLSYRMIRVDNDIPGRRLRARY 120

Query: 689 FKDSVRVKCSELEKIFMEAVFDDDEDAVKVGAVL--------QSRLELVD---------- 748
           FKDSVRVKCSELEKIFME VF DDEDAVKVG V         + R + +D          
Sbjct: 121 FKDSVRVKCSELEKIFMETVFYDDEDAVKVGIVYFVELAMMGKERKQFIDATLLGVVDRW 180

Query: 749 ------------FERTLWSLKNALKDKLPTYQ 751
                       FERTLWSLKNA+ DKLP YQ
Sbjct: 181 ELFCNHDWSSLIFERTLWSLKNAVNDKLPAYQ 212

BLAST of Moc05g14790 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 95.9 bits (237), Expect = 3.8e-18
Identity = 60/166 (36.14%), Postives = 87/166 (52.41%), Query Frame = 0

Query: 373 QLLTFEMRQAQFQALKNVTSGNHIAMIATPEMIHDTAWYLDSGASNHVIAELGNLSIKSD 432
           QL  F+    Q Q+    T     A +A     +   W LDSGA++H+ ++  NLS    
Sbjct: 273 QLHQFQSTTNQQQSTSPFTPWQPRANLAVNSPYNANNWLLDSGATHHITSDFNNLSFHQP 332

Query: 433 YTGMARITVGNSCQLPIQSIGSSIIPSNTGCLILKDLLHVPQISKNLISISLLTLDNNVV 492
           YTG   + + +   +PI   GS+ +P+++  L L  +L+VP I KNLIS+  L   N V 
Sbjct: 333 YTGGDDVMIADGSTIPITHTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVS 392

Query: 493 IEFHDSFCAVKDKVTGKVLLEGMLKDDLYQIPPFTASSHAFGQRSS 539
           +EF  +   VKD  TG  LL+G  KD+LY+ P   ASS A    +S
Sbjct: 393 VEFFPASFQVKDLNTGVPLLQGKTKDELYEWP--IASSQAVSMFAS 436

BLAST of Moc05g14790 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 89.0 bits (219), Expect = 4.7e-16
Identity = 47/115 (40.87%), Postives = 69/115 (60.00%), Query Frame = 0

Query: 410 WYLDSGASNHVIAELGNLSIKSDYTGMARITVGNSCQLPIQSIGSSIIPSNTGCLILKDL 469
           W LDSGA++H+ ++  NLS+   YTG   + V +   +PI   GS+ + + +  L L ++
Sbjct: 331 WLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSLSTKSRPLNLHNI 390

Query: 470 LHVPQISKNLISISLLTLDNNVVIEFHDSFCAVKDKVTGKVLLEGMLKDDLYQIP 525
           L+VP I KNLIS+  L   N V +EF  +   VKD  TG  LL+G  KD+LY+ P
Sbjct: 391 LYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWP 445

BLAST of Moc05g14790 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 48.9 bits (115), Expect = 5.3e-04
Identity = 36/121 (29.75%), Postives = 60/121 (49.59%), Query Frame = 0

Query: 407 DTAWYLDSGASNHV--IAELGNLSIKSDYTGMARITVGNSCQLPIQSIGSSIIPSNTGC- 466
           ++ W +D+ AS+H   + +L    +  D+     + +GN+    I  IG   I +N GC 
Sbjct: 291 ESEWVVDTAASHHATPVRDLFCRYVAGDF---GTVKMGNTSYSKIAGIGDICIKTNVGCT 350

Query: 467 LILKDLLHVPQISKNLISISLLTLDNNVVIEFHDSFCAVKDKVT--GKVLLEGMLKDDLY 523
           L+LKD+ HVP +  NLIS   L  D      +   F   K ++T    V+ +G+ +  LY
Sbjct: 351 LVLKDVRHVPDLRMNLISGIALDRDG-----YESYFANQKWRLTKGSLVIAKGVARGTLY 403

BLAST of Moc05g14790 vs. ExPASy TrEMBL
Match: A0A6J1DJX9 (uncharacterized protein LOC111020757 OS=Momordica charantia OX=3673 GN=LOC111020757 PE=4 SV=1)

HSP 1 Score: 511.1 bits (1315), Expect = 1.4e-140
Identity = 306/527 (58.06%), Postives = 350/527 (66.41%), Query Frame = 0

Query: 569  MDLRLILNRDDWFPATLTNLAHVDKTTSRLKGRLTPTQLDMFSQTCFGPILDMDVVFNGP 628
            MDLRLI++R+DWFPATLTNLAH+DKT++R+K RLTPTQLDMF QTCFGPILD+DVVFNGP
Sbjct: 1    MDLRLIIDRNDWFPATLTNLAHIDKTSTRIKARLTPTQLDMFRQTCFGPILDIDVVFNGP 60

Query: 629  LIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDQITGLSHRMIRVDNDIPGRRLRARY 688
            LIHHLLLREVEEPRQD+ISFDLFGKRVSFGKREFD ITGLSHRM RVDN IPGRRLRARY
Sbjct: 61   LIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGLSHRMNRVDNHIPGRRLRARY 120

Query: 689  FKDSVRVKCSELEKIFMEAVFDDDEDAVKVGAVL--------QSRLELVD---------- 748
            FKD VRVKCSELEKIF+E VF DDED VKV  V         + R + +D          
Sbjct: 121  FKDGVRVKCSELEKIFLEDVFYDDEDVVKVRIVYFIELAMMGKERKQFIDTALLGVVDRW 180

Query: 749  ------------FERTLWSLKNALKDKLPTYQQKARNDPTHQETYSLYGFPYAFQ--AYE 808
                        F+RT+WSLKNALKDKL  YQQKA  DP+H ETYSLYGFPYAFQ  AYE
Sbjct: 181  EVFCNYDWSSMIFDRTIWSLKNALKDKLSVYQQKATADPSHVETYSLYGFPYAFQVWAYE 240

Query: 809  TISTLSLRVATRLSDDAIPRLLSF------------------LQSKVNEYLVLTNAEAEH 868
            TIST        LSDDAIPRLL +                   +SKV E+L+ T+A+ +H
Sbjct: 241  TIST--------LSDDAIPRLLRWSCIYSCGFRVLTSEVFDNTRSKVKEHLLATDAKEQH 300

Query: 869  MVRIMRPSEARAIPAPPAVPDPPAVPDPDVVPAPAAVRNPPANLERGAKEIRVKGKGKNI 928
            MVR++ P E R IP PPAVPD   VPDP   P  AAV +PPA++E G             
Sbjct: 301  MVRVILPPEVRVIPDPPAVPDRAVVPDPPASPERAAVPDPPADVEMGP------------ 360

Query: 929  IEDPVEEAETLDDVALQDSALDDAGPSGNDSEALQKRSKRKKFKNKISRRLKRLDDRVGA 988
            +EDPV +A           A+D+A PS ND E L+KR K+ KFK +ISRRLKRLD+ VGA
Sbjct: 361  LEDPVVDAH----------AVDEARPSANDGEGLEKRLKKNKFKKRISRRLKRLDNCVGA 420

Query: 989  IEATLIGFEATLTGFGVALKGIQRYLKKMSKVRDNAHWSVLFVQGKFRDLTKYFGRGDGP 1044
            I       E  L  FGVALKGIQ YLKK++K             GKF D +KYFG G GP
Sbjct: 421  I-------EDKLGDFGVALKGIQIYLKKLAK-------------GKFPDSSKYFGGGGGP 477

BLAST of Moc05g14790 vs. ExPASy TrEMBL
Match: A0A6J1DRS0 (uncharacterized protein LOC111022607 OS=Momordica charantia OX=3673 GN=LOC111022607 PE=4 SV=1)

HSP 1 Score: 413.3 bits (1061), Expect = 4.0e-111
Identity = 201/292 (68.84%), Postives = 218/292 (74.66%), Query Frame = 0

Query: 1041 PKSMDEDPKNMDEDPMFMVEDQGTITERDNASNAYL------------------------ 1100
            P S+DEDPK  D DPM M ED G IT+ D   N  +                        
Sbjct: 6    PNSVDEDPKRRDNDPMIMEEDDGMITDGDEDPNQDITIGRRPDGSEVDHTDDHVPQVAVI 65

Query: 1101 -DRPVGLFQPDVASDTRPVSRRVRRPYKDWAPDAIIKVEPYLDQDEYDLQQAPTGRGLRK 1160
             D  VG  +PD   DT+P  RRVRRPYKDWAPDAI+KVEPYLDQDE DLQ APTGRGLRK
Sbjct: 66   QDLTVGRQEPDAQPDTQPTRRRVRRPYKDWAPDAIVKVEPYLDQDETDLQHAPTGRGLRK 125

Query: 1161 GHYSWKLKDIYTPTGQRGITVDRYDPVCPIPPQLDDKFQRWMDDPKTDGRLRSTATGFQK 1220
             HYSWKLK IYTPTG+R ITVD YDP CPIPPQLD +FQ WMDD   DGR RSTA G Q 
Sbjct: 126  HHYSWKLKGIYTPTGRRRITVDAYDPACPIPPQLDGQFQTWMDDLDIDGRTRSTAAGLQG 185

Query: 1221 KEWYRDLLDPNVELKDEVLDGLVLFTAKKLEKCLHLCRKKFVIGDVLLSTLLNRTDGPYA 1280
            KEWYRDLLDP V+LKDEV+D LVLFTAKKLEKC++LCRKKF IGDVLLSTLLNRTDGPYA
Sbjct: 186  KEWYRDLLDPTVQLKDEVVDALVLFTAKKLEKCIYLCRKKFAIGDVLLSTLLNRTDGPYA 245

Query: 1281 AMKPGVLSTRINYPWREENTIWRYVHGRQSDHNVPWSDAGIVYTPMNVGGNH 1308
            AMKPGVLSTRI YP  +ENTI+RYV GRQSD NV W+DA IVYTP+N+GGNH
Sbjct: 246  AMKPGVLSTRIEYPCSQENTIFRYVFGRQSDQNVAWTDADIVYTPINIGGNH 297

BLAST of Moc05g14790 vs. ExPASy TrEMBL
Match: A0A6J1CZE8 (uncharacterized protein LOC111015600 OS=Momordica charantia OX=3673 GN=LOC111015600 PE=4 SV=1)

HSP 1 Score: 365.5 bits (937), Expect = 9.5e-97
Identity = 199/314 (63.38%), Postives = 224/314 (71.34%), Query Frame = 0

Query: 569 MDLRLILNRDDWFPATLTNLAHVDKTTSRLKGRLTPTQLDMFSQTCFGPILDMDVVFNGP 628
           MDLRLIL+R+DWFPATLTNLAHVDKTT+R+K RLTPTQLDMF QTCFGPILDM VVFNGP
Sbjct: 1   MDLRLILDRNDWFPATLTNLAHVDKTTTRIKARLTPTQLDMFRQTCFGPILDMGVVFNGP 60

Query: 629 LIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDQITGLSHRMIRVDNDIPGRRLRARY 688
           LIHHLLL EVEEPRQD+ISFDLF KRVSFGKREFD ITGLSH+M RV+N IPGRRLRARY
Sbjct: 61  LIHHLLLIEVEEPRQDVISFDLFRKRVSFGKREFDLITGLSHKMNRVNNHIPGRRLRARY 120

Query: 689 FKDSVRVKCSELEKIFMEAVFDDDEDAVKVGAVL--------QSRLELVD---------- 748
           FKDSVRVKCSELEKIF+E +F DDED VKVG V         + R + +D          
Sbjct: 121 FKDSVRVKCSELEKIFLEDIFYDDEDVVKVGIVYFIELAMMGKERKQFIDTVGVVDRWEA 180

Query: 749 ----------FERTLWSLKNALKDKLPTYQQKARNDPTHQETYSLYGFPYAFQAYETIST 808
                     F+RT+WSLKN LKDKL  YQQKA  DPTH ETYSLYGFPY       +  
Sbjct: 181 FCNSDWSSMIFDRTIWSLKNTLKDKLSAYQQKATADPTHVETYSLYGFPYGRMRRSRV-- 240

Query: 809 LSLRVATRLSDDAIPRLLSFLQSKVNEYLVLTNAEAEHMVRIMRPSEARAIPAPPAVPDP 855
               +A+ + D+          SKV E+L+ T+AE +HMVR++ P E R IP PPAVPD 
Sbjct: 241 ----LASEVFDNT--------WSKVKEHLLATDAEEQHMVRVILPPEVRVIPDPPAVPDR 300

BLAST of Moc05g14790 vs. ExPASy TrEMBL
Match: A0A6J1DRZ7 (uncharacterized protein LOC111023847 OS=Momordica charantia OX=3673 GN=LOC111023847 PE=4 SV=1)

HSP 1 Score: 320.9 bits (821), Expect = 2.7e-83
Identity = 175/291 (60.14%), Postives = 205/291 (70.45%), Query Frame = 0

Query: 569 MDLRLILNRDDWFPATLTNLAHVDKTTSRLKGRLTPTQLDMFSQTCFGPILDMDVVFNGP 628
           M++ L +N+DDWFPA L+NLAHV KT+SRLK RLTP+QLDMFSQTCFGPIL M+VVFNGP
Sbjct: 1   MNMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGP 60

Query: 629 LIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDQITGLSHRMIRVDNDIPGRRLRARY 688
           L+HHLLLREVEEP+ D+ISF+LFG RVSFGKREFD ITGL H M RVD D+  RRLR  Y
Sbjct: 61  LLHHLLLREVEEPKDDLISFNLFGNRVSFGKREFDLITGLRHTMNRVDEDVRNRRLRILY 120

Query: 689 FKDSVRVKCSELEKIFMEAVFDDDEDAVKVGAVL----------------QSRLELVD-- 748
           F+D   VKCSELEKIF+E  F++DEDAVK+  V                  S L +VD  
Sbjct: 121 FQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMGKERKLKMDTSLLGIVDRW 180

Query: 749 ------------FERTLWSLKNALKDKLPTYQQKARNDPTHQETYSLYGFPYAFQ--AYE 808
                       FERTLWSLKNALKDK+  Y+QK   D +H ETYSLY FPYAFQ  AYE
Sbjct: 181 EVFCNYDWSSMIFERTLWSLKNALKDKVEXYKQKVAMDSSHVETYSLYXFPYAFQVWAYE 240

Query: 809 TISTLSLRVATRLSDDAIPRLLSFLQS-----KVNEYLVLTNAEAEHMVRI 823
           TISTLS RVA RL+DDAIPRLL +  +      V E  V  N +++ +VR+
Sbjct: 241 TISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFENVKSKVVVRL 291

BLAST of Moc05g14790 vs. ExPASy TrEMBL
Match: A0A6J1DM82 (uncharacterized protein LOC111022300 OS=Momordica charantia OX=3673 GN=LOC111022300 PE=4 SV=1)

HSP 1 Score: 316.2 bits (809), Expect = 6.6e-82
Identity = 163/212 (76.89%), Postives = 169/212 (79.72%), Query Frame = 0

Query: 569 MDLRLILNRDDWFPATLTNLAHVDKTTSRLKGRLTPTQLDMFSQTCFGPILDMDVVFNGP 628
           MDLRLILNRDDWFP TLTNLAH DKTTSRLKGRLTPTQ+DMF QTCFGPILDMDVVFNGP
Sbjct: 1   MDLRLILNRDDWFPPTLTNLAHADKTTSRLKGRLTPTQIDMFRQTCFGPILDMDVVFNGP 60

Query: 629 LIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDQITGLSHRMIRVDNDIPGRRLRARY 688
           LIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFD ITGLS+RMIRVDNDIPGRRLRARY
Sbjct: 61  LIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITGLSYRMIRVDNDIPGRRLRARY 120

Query: 689 FKDSVRVKCSELEKIFMEAVFDDDEDAVKVGAVL--------QSRLELVD---------- 748
           FKDSVRVKCSELEKIFME VF DDEDAVKVG V         + R + +D          
Sbjct: 121 FKDSVRVKCSELEKIFMETVFYDDEDAVKVGIVYFVELAMMGKERKQFIDATLLGVVDRW 180

Query: 749 ------------FERTLWSLKNALKDKLPTYQ 751
                       FERTLWSLKNA+ DKLP YQ
Sbjct: 181 ELFCNHDWSSLIFERTLWSLKNAVNDKLPAYQ 212

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022153201.12.9e-14058.06uncharacterized protein LOC111020757 [Momordica charantia][more]
XP_022155476.18.2e-11168.84uncharacterized protein LOC111022607 [Momordica charantia][more]
XP_022146372.12.0e-9663.38uncharacterized protein LOC111015600 [Momordica charantia][more]
XP_022157020.15.5e-8360.14uncharacterized protein LOC111023847 [Momordica charantia][more]
XP_022155158.11.4e-8176.89uncharacterized protein LOC111022300 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q9ZT943.8e-1836.14Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW24.7e-1640.87Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P109785.3e-0429.75Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Match NameE-valueIdentityDescription
A0A6J1DJX91.4e-14058.06uncharacterized protein LOC111020757 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
A0A6J1DRS04.0e-11168.84uncharacterized protein LOC111022607 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A6J1CZE89.5e-9763.38uncharacterized protein LOC111015600 OS=Momordica charantia OX=3673 GN=LOC111015... [more]
A0A6J1DRZ72.7e-8360.14uncharacterized protein LOC111023847 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
A0A6J1DM826.6e-8276.89uncharacterized protein LOC111022300 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003653Ulp1 protease family, C-terminal catalytic domainPFAMPF02902Peptidase_C48coord: 1279..1379
e-value: 1.9E-12
score: 47.5
IPR003653Ulp1 protease family, C-terminal catalytic domainPROSITEPS50600ULP_PROTEASEcoord: 1154..1381
score: 9.156683
NoneNo IPR availableGENE3D3.40.395.10Adenoviral Proteinase; Chain Acoord: 1153..1381
e-value: 8.8E-11
score: 43.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 991..1055
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1014..1055
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 897..920
NoneNo IPR availablePANTHERPTHR47481FAMILY NOT NAMEDcoord: 406..496
NoneNo IPR availablePANTHERPTHR47481:SF3GAG-POLYPEPTIDE OF LTR COPIA-TYPE-RELATEDcoord: 272..359
coord: 406..496
NoneNo IPR availablePANTHERPTHR47481FAMILY NOT NAMEDcoord: 272..359
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 1201..1380

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc05g14790.1Moc05g14790.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0008234 cysteine-type peptidase activity