MC04g1792 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC04g1792
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionthylakoid membrane protein slr0575
LocationMC04: 25241771 .. 25252482 (+)
RNA-Seq ExpressionMC04g1792
SyntenyMC04g1792
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5CDSpolypeptideutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGGGATAATATGAGGACACGTAGTGTAAATTGGCGGGTTGAGCAACGAATCCTATTTTCGTTTTTCTTTCTCTAGCAACCGAAGCTTCACATTCATCTGCGTTCTTCAGAGTTTCAGAGAGAAGAATGGGTGGAATTTCACCAGCCGCTGGTCTCTGCTCCCGTGATCAATTCAGTCTCGGCAATCGCCTCTTCGTTGCTTCTCTGCATTCCTGTCGCGCAATCTCCCGCCCCGCCGGTGGTAATTGCAACGGCGGAGCAAAGTTTTTGGTCCATTTGGAACCGAGAACGCTTGTGCAGAAGCGTCAGCGTTTGGATTTCACATTCTCCGCTAGAGCTGCTGATTCAACTCAGCCGTCTTCGGTTTCAGCCTCGCCAGACAGAGCAGTAGTCACTGATGATGAATTCTCGCTCGCGAAAGTACGTTTCTAACTTTCTATGGTTCTGAATTCTTGAAGAGTATATGAATCGGACGCCCCTAGGTAGCATTTGGATTGGAATTCCTGAGTTCTGTGTTGAGAATGGTTGTGATGAAATGAGAGGAGATGTTTGAAAGTCAAGTGTAGTGTTTAAATTTTACGGGACCTCGTGTTGATTTTCCCTGCCTTTAGCATTTTTCACTAGCTTATTTCGTTTATTATATGGAGCTCTTGGTTGTTCTAGCGAGAATTGTTCATGCGCTTGGAGGAGGGGAAAGAGAAGGAAAGATACTTCTCTCCAATTTTTGTCGATAAGATATACATATATATAGTGTTGTAGAGATTTGGATTTAAAATTTCATTATCGATACTATATTAAATTATTGTTATGTTCAAATTTAATCGATTAAACTAGGACAAAATAAACATTTAGTATGTTCTCAACATAGAAAAATAAATTGTCCCAAATATTTGGAATAATCTCGTAGACATTAGATACTTTGATATTTTAATGCAAGAAAATATTTATGGGAGCTGGAGGATGGTTGGTACCCTGAATTGCAATGGTGAAGCTGTATCCAAGATAGTCGTGGCAGTATATTGATTTTAATTTTTGCCTTTAATAGCAATATACATTTATTTTCTGTTCACTAGTGGTGTTGTTAACCTCCAACATTCTTTGACATGAGTTAAGTAGCTTCCCTCTATGCATCCCTTTTCTCTTTTATGCATCGTGATATAATTTTTGTATGTCTGTTTGATCTGTTATTTTGGTATTTATGTTTCTTTTCAGGTTTCATTTGGTGTTATTGGCTTAGGTGTTGGGATTTCTTTGTTGTCGTAAGTTGTTTCCTCCTCAGGACTCAAAAATACCTTAATTCATAACTTGTACATGTGATTTATTTTCATTCTACAGGTATGGATTTGGTGCATACTTTAATATCCTCCCAGGATCTGAATGGTCTGCCATTATGTTAACCTATGGTTTCCCTCTTGCTATTATTGGCATGGCTCTCAAGGTATAACTTTTTTTTCCCTCCTGCAGTGTGATTAGGCATTTGTGTGATAAATATATGCTTTTAGTGGCTATTGTAGCCACGTATATATCCAGGTCCATTTCAGCTTTTCTATATCCTATATGAGTCATAACTCAATAGCTGATCCAAGTTCAAGATATTACCAAACAATTACTGGTAGGTGTTTTAAAAAAAATTGTCATGTTGGTGTTCATCCACTTTCTATGTTTCATTCCTTATTTACTTTCTGAAATGGTCTTGCAGTATTGCTTCATTTTTAACTTCATGAACTTGAGCTGAAAAGGAAAATAATTGGTAGAAAAAGATCAGGTAACTCATTCTTGCTTTTGAGTATTTTGCAGTATGCCGAACTCAAACCAGTGCCATGCTTGACATATTTAGATGCTCAAAAGCTGAGGGAAACATGTGCCACCCCTATTCTCAAACAGGTTCTTTCATGGAATTTGATTATTTCTAATAGCCATTTTAGTAAAATATTTTGTAATTATATTCTCTTGTGAGTGCTTGTTTTGATAGACCCCTAGCTTGTACATTTATACTTGTACGGGGAGAAGAGGGACAAAAGTGTAATTAGAATGTCTAAGAGAGAAATTCGGGCCCATAATTGTAGGAAGCTATTATACTTAGTTAGGAGGGGCAGAAGAGAGAAATGCTGAGGAATTTAAGGTATAGGAAGAAGGGAAGAGGGCAGTCTGGTTTTTTGCTGATTGTGGAGGGTTTAAGGCCTCCCGAACTCCCTCTATTTCTGTTTCTTTTTGTTTGTCACTTAGATTTCCATTTGAGACTTGTCATGTTTTATTTACTGTTTCCATGATATCTTGTTTATTATTTTGTTTAGGGTTTCTTCATCTAGTCAATTCGGTTTTTAAAGATGAAATTTCATCTGGTAAATAGTGTAATACAACTTGGGGAAGATGGAATTTATCGTTTTGTGTGAATATTATCTTTCCCGATTATTATTGAAAGTTGTCAATTTATTCACTAAGAACAAAAAGCTTGCCTTTTGTTTCACTTTGGCTGTTACTTGAGTAATATGTCACTTTCGATTTTAACTACCATTGTTTCCAATACATAGTCACTTTGTACATACACAAGTTTTGTTGTATAAATTCATATTCGAGCTTTTCTTTGGATTTTTTGGAATGAAAGGAATCATCGACTATTCCTAGATAAAGATCTGCCTTTCATCGCCCTTTTTGATAATATTACTTTTCATGCTATGTCTTGGTGTAGATCAACACACTATTTTCATTCTTATAGTTCGAGCTCTTTATTATCCAATTGGAGAAGTTTTATGTAACTCCTTTGGTTTTTGGTGGGTTCGCCCCTCTCTTTATAATTTCATCCATGAATGAAATTGTTTCTTATCAAAAAAAAAAAAAAAAAAATTAACCACCTCTTAATATTTTCCACTAGGTAAGGGATGATGTGATAAGGTACCGTTATGGGGATGAACAACATTTGGATGAGGCATTGAAACGGATTTTCCAGTATGGTCTGGTAAGTAAACTGACATAACTCTGAATATTCTTTCTTGTGATCTTTGATCACGTTTTTTGAAAACTACGTGGTAGTAATTCCATAATGCTTTTGGTGACCTTATCTCATTAGCAACATATTGATTCAAAAGACTCCTGCATTTGTGTTCGGTGTAGAAAGTTGGCTCCACTTTTGCTTAGCTGTAGCATTTGTTCTGAAACTAGAACTTCAATACACCACTAGACGCTGACTAATTTCGCTGGCTGGTCTATGCTTGTCCCATCTGAATTATGGATTCAGTAAAGATTCCAATAAATAGGTCCACGAACATGCATTTTGATGTAGTTTTGAAACTGATGATAAATAAATATTGAGATACCCTCTTACCAACCATTTGCACTCTTTTTGAAGGATGCAGCCTAGCATCCAACACTCGATGATTATTTCAATATTTATTATTATGAAAGTAGTATTTACAAAGGCATTGAAGTTGCACCCATTATACAGGGGCTACCTTTTAACTAAATTGCTAACCCCACGACATTGTAACTACAACAGCGAATTTACAAACTAATGGACCAAAACAGTAAGCTAATAAACAAATCAGCAAATTGATCAACAATGAAGAAGAGTGACTGATTGGTAAATGTGACAAACAGCAAAAGACAAAGTAATAGGTGAAACTAGCTTCTATCATGCATTATGAGGTTTTTACTTGTTAGTTTTGGATGTCTGTCACTTAATACACACTGCAAGACTCCAATCTTTTGTTGGACGCTTAAATAGACTGTTAGAGGTTTTATAATTAGTTTTGGGTGCTTGACATAACCAAATATATTACAGAGATCTATGATGTTTGGACAACATATTATATTTCTGGTTTTGAAAATATAATTTTATAGACATATTTGCTAATGGATTTTATATACTTACGAGGCATTGATCTATTTAGAAGTACATTTCTCTGATTATACATTTTCTTTACAAGCTTTGGTATGAGATGTTGCATCGTCATTATGTATTTGGTTTCTTCAACTTCTCTTAGTTATCATGTACTTTGTCTCTGCTTTGTGTTGTTTTTAAAATGTAAACATCTATTTTAATTGTAAACTTTTCAGCATGGGAAGGATTAAAAATTGAAATCCATTAAAATATACAAATAGATTTTGGTATCCAAGTCTCAACTTTTTGAAGATTTACATGTTGGGCACTTGATGAACTTGTTCTCAATATATACATGTCATGTCCATCTCCTCTGCTTGTACCCTAACTTTCTAAACAGTTAGCAGTTTGATATTTAATAAATTCATAAGACTTTAAGGTATGGGCACTTGATGAACTTGTTCTTTTGGTACTTTGGCTGGATCAATTTCTTCAACGGTTATCCATAAGTGAACAAAATGTATTGTTCTTAGATTTTATTTTAGGTTTTCAATGTACATTGCACTTAATGGCCTGTACGGATAAGGCTTCTAAACTTATAAGTTCTCTTCATGTCATCATGTTGTTACTTGTTACTTGTTATTATTATTTATTTATTTTTTCATTTGGTTATCTTGAACTTGAAAAACTTGAAGAAGATCAGATACAAAAATCCAGATAGAAGTTTCTTTCACAATCTAATGTAAAATTGCATAAGTTGTATTTGAATCTTCATTTTTATGCTTTTACATCAATTTTTGTACACTGTTCTTTATTTCATTCTACTTTTGTATACACAGGCTGGGGGAATTCCTAGAAGGAGCGCTCCTATTTTGCAAAGTATTCGTGAAGAAGTATGTTTATATGACTCTGTATTTATTTCATTGTACCCAACTGGACGTACTATGATTCTTCTCTACCAAATACAATAGTATAATATCAGCAATATCGATTGTAATGTTATGCCATTTGCTACAATTCTAGGTCACAGAAGATGGAAAGTATTGCCTGACCTTGGTGTTCGAAGCAAAAGCCTTGGCATTGTCAGATTTTGAGCAAAGACAAGTAATTGACGCATGCCTTTTCTTATATGGTTGAACAACGATTGCTGCTGGATTTTTGTACCTTTTAATTTTCTATGGGATTATATTTAATTGCTCAAAACCCAAATTTTGCTACGTTGATGTAGTCATGATGATTAAGGCTTATTGTCATTTTAAATGAAACAATAATTTACCGAAGGCTTGCAACTGCGTGAAGACATTATATTGGAAAGTGAACAATCACTAATCTAGTGTCTTTATGATGGATTTGCAGGCAAAATTTGCTTCTTTCTTTGGACCAGGGATCACTGCAGAAGTTGGTGAGCATTATGCATGACATAATTATTTTTTTTCTCCCGTCATTATACTATTGTTAACTGAATGTTTTGCGTGTTTATCCCACAAAAAAAAAGATATATATATATAATATATATATATATATATATACTGAATGTTTTGTGTGAACATTGCTTTAAAACTTTTGCAGGGAAGGGAGAGAATGAGCTGTATGAAGTCCGACTTATTTCTAACACTATTCCCAGTGCTTCACCATCATAAATTATACGGTTAAATCAAATTCTGCTACATATATTGGTTTCATTTAGTATACAAACTGCTGTCCACTTGTGAAGTTCTGGTAAGCATCAATATTTCTTTGGAAACCCATGCTATCTAAATGTTAAATCCGATCTCTAATGTTTATAAAATGTACAAAAATATAGACAAAAGGATTAGGAAGAGTGAAAAGAATGACATCTAAATGGTTTCCTTCCTAAAAGATAATGAAGCAGATGGTGCAAAAAGGGTTTGTTCTAAAAGGAAAAATGGGACACTACTATCTGTACCTTTTTGGAAACTCCTTCACGTGATGCATTCTTGGTTATTTGATTAATAAAATTTAACCTTGGAATAATATCAGAATTAAGAAATATATCATTGCATAGTGCAACCTGCACCCTAAAAAGCACCCAATGCGGCACACGTGCCCGTAAACATGGTGCATTGTGGGCTTGCATACGCCATGTTGAATGTAGCTGTTGTCACAATTTGCTGGAAGTAATAGAGCCCTACTTACACATTGATAATTTGATACATGAGGATTGACTTTCTTATTCTGGTGCTTTGAAGTCTCATTCAAAAAATAGAAGACTTCTATCTGCCAATCAGTGATTTATCATTAATTATCTTATATCAAAGATTGGTTATCTTATCGATCATTTAATTTCATTTATGGCAATGAGACCAACAAAACATTGTTCTTGTCAAAATGAGGGTAACTTAGCGGTAATTGGCATATACCTCGAGGTTGTGAGTTTGATCCCCTACCCCTTACAACTGTTATTCAAAGCTTCTCTCCTGGGCTTGATATTAAATGCTATTATGCCTTTTATTGAAGCCTGATAGGTGTCTCTCTAAGTTGCTAGATTTCTGAAAATAGGGTTTTAGAGATCTATAACTGTTGGCTCTTCCCTCTCTTGCTACTCCTAAAGTTCTCTTCAAGCTGCTGCTTTCTGGAAATATATTTCCCTTTCGATAGTCATTAGAATCTATTCTTACATATTCTGTCGAGGGTCATGTCCACGTGAAACCTCCACTACGGACATCCCATCGAGGGTCAAATTTTGCTAATCGAAGAGATGGCATTTCCTTCGAATCCCTTCTATGGTTAAGACGAGGAACTTTCATTGGACCTCCCATGGAAAGAGGGAAATCATGGGTGATAATTGAGTACATAAAAGTAAATTGTTCAACTCAACTAGACGGGCCAGGGAAATACAATTCTTTCGATTCTTTAGAAGGGGCATCTCTGAATGTTCTTTTGGACATGGACGAGCAATTGAAGATCGGCTAAAGTGCGATCCAATGGCTCAAATTCAAGTTGGAATGGAAGATATCTGCCATTGAGAGGATTTTTGTCATCATGCCGTGCACGATCTTATTTCATATATATATATGAATAATTAAAAATGTTGTTTCATATTGATTAGTTTGTTGTTCAAATATAGTTTTCGTACTTAAATTTTTACTTATTTTCTGTTTGTCTTTACATTCTGACACCTGGTAATCTCGATTTTACTTGGACACCTAGGTAATAAATAGGGTTACATCTCTTTAATTAAATTTTTACGAAGACTTTTTAGTTAACTTTAAAATAAAATATCTACGGCAGACACCAAATTTAATGATTAGTGGAAAATAGATAAAAGATTTAAAAAGTTCAATAAGCTAAACACATAATTTTAAAAATTTCACTACTGGGTATTAGCTCATGAACCACTTGCCTTTCCCTTACACTTTAGATATCTATTTATGATTTATCTGATGTATTAGAGGACCAATATCTTGGATTTTGATAGAAGATTCACTTGTTAGCGACTAATGAAAGCTTATGAAATTAGTAATCCTACTCATTAAAAGCTTGGTTTGTGTAAAGGTAAAGACAAAGGCATGCACGTGAAGTGAGAAGGAGAAAGCTGCTCCCAACTTTCACCATCCCAATAAACAAAAAATAATATCATTCGGCACCAAGACTAAAGCAGAGAACCACCAAACTAAAACCCTAACCTTCCTCCGCAGGCCAGAGGTCCCTCTCCATGATTGCCCGCCCTTAATTTCTAAACACCCCACTAACACCTGCAAAAATATGCCTGAAATACGAAGAGAATCTCCTTTTGCCATTTTCTCAGTAACTTTTCAGCACCCTTTCAAATGAGGTAAAAAATTCTTCTTTCTTGTTTTGTGTTTATCTCTCTGTCTTGTTCTCATCCATGACAGGCACCCACCTTCACCTTGCCTCCACGTTTGCTCGTTTGTATTGTTTCTATCTCCCTTTTCTCCTTTTGTCATCTCCTTTTCTTTTCCTGCTTTAATTGCCACCTGCACTCATATTAGACTCATCACTGTTATGCATGAATCTGAGGAGAATGGGAAATCTCCTCTCTTATATTTGCTAATTTAGGCAATGTCATGTGTGATCATGACCCTCCTGGAGATTTGTAGAAAATGTTCAAACTTTAAGAAAGAGAGAGATGCAATGCAATATCTCTTTGTTGGAATGAAATTTTTATGCCTAAAATGAGAAGATTAGACACGTCAAGGACTGTACCAGTACAACTAAGTGATCTTTAGAGGCTCATTTATATTTCACTATGAAAGTGAAGTTTTGATTAATTTAGAGTTGTTACTATTTTTCCTCCCCAGCTGAAAGCTATTGTTTGTGGTTACGGCCAAAAAAAGAAAAGGTTAATAATTATGACCCGTAAGCCGCAATCTTACCAAACAATGTTAAGATACTGATGAAATGATGAATCGCAGTTCAATCAGTACAAGTAAAACAAAATCCAGAATGGTCACTTCTTTTGCTCAACTATAATAATTGAAAACTGTTTTCTCGTTTTACTTTTTTGTTTCTTTTCACTTTTAGTTTTTCCCTTGAATCCCGGCCCTTTCACCTGAGGACTGATTTTGAAATTATACTTTCTGATATTGAGGTACAAGCATGTGGATGTCTTATGAATGACAGATGAAATGATTCAGAGACCCTTTTTTTTCATTATTTTGGGCATTTGACCATGTCCTTCAAATTCATAACCATTTTTTCCAGAAAGAGAGAAATAAAGCAGAAATTTCAACATAAAGCAGCAGAAACTGCGAGAACCCATTTTGTATTTTGCAGGAAAGAATCTCGATGGGCTCCACAATAAAGCAAGGGGGGATGGTTGGGGAAGTGGGATTGTGAAGTGTTTGGTACAGAGTTTATGGTAGTTGGGGACTTGGAAATATGTTTTGTTAAGTTGGCTTTGCTCATGTTACTGATATATTCTTGGTTATAGTCCACAGTTTTTCTTCCCATCTGATTGATTCTTTTGCTGTCCGGATAATGCTTCCCAGAAACAGAGACAACCACAACTGTATGAGTTGTTGGGGAGCAGACAGCCATGGGAGAAAATTATAGGTCTGTAAACTAGAGCCTACAGGAAGATGCTAAGAACCAAAGCGTGGTCCCAATTTTACTCTTACACTTCTCCATCTTGTTACAACATCAATCTGAATCAGCCAAGCCTAATCTTGACGGCCCCATTTATAGAAATGCACAATTAACGAGGAAGAGAGTGAAACATCAAAATGATTACTATTAGCACTCTTAATGTTTTGGGGAACGGGTTAATCAGCCTAAAATTTCTCCCTAGGTTTCTTGATTAGTTGGATTTGTATATTAATCTTGTCATTTCAAGCTCCTTTGATCTTCCTTATGCTAAATCATGATTATCATTTGAAGGATCAAAGAATGTACATACCCTTGCATCCTCTGACTTATTTTTTGGTTTGGAATTAGAATTTGGTGAAGCAGTTTCTTCTTTCCCACATGCCTTTAACGATCTCTCTTTAATGGGAGGTGTAGAAGCAGGGCTAGGACTTGCTGAACTTCGGTCAGGTGGTGTGCTTTGAGTTCCCACATCCTTAGTCATCCCTTCCACTTTCTTTTCTACAACTTGAACAAGGAAAACTTGTTCGATATTGAATCCAAAATCTTTCAAAGCATACTATAAGAAAGGAGGAAGAAATAAACGATGGTACCTGGAATTGGATACTGATGAACTGCTTCTTCCTGTGGAGAAGTAATTGGCCTTTCATGGAGTAGTCTGGAGTTGGTTGAATAAACTAAAGGAGCAGACATGGTGATTCTGGTTCTGAAGTTTCTGAATGACCTTGCGCTGAGTGTTTTCGCAGATAAACACTCCATTGGAACCTCCCCTGAAAGAGGATTGCAAATATATTTCTCTGCTTCGTACCACTTTGAGTTCAAAGAAAACCCATCTGGTGACAAGCTTTCCCAGCTATTATAACTTGACCTGGCTTGCAATGCTGTACAGAAACCCACCCACCCAATGTACTATCAATCCCAGCTGAAACCCAGGTTCAAGTTTTGACATTTTAGAGCCAAAAGGAGTGATTTCTTACCTTTGAGTGCAGCAGTGAATGAGTAGGTTGATGGGTCAATTTCTTCTGCAAAAGAAAGCAAGAAAAAGAAGATGAAGAGTTGAAGATCAGAAACAGAGAGGAAGTAATTTAATTGAAGTACCTCTCATGGAGTAGCCTGGAGAAGAAGCAGTGCTGGAGATGGTTATGTTTGATCTGAAGGATCTGCCATCTTCCCTAAAGCTTCTTATATAAGAACCCGAAAACCTTCTGGAACTTGTGCTCTCGCACTTGATCCTTGCCTTAACTGGCCATTCTCTCAAACTGAACTCCTCTGCTTCTGCTTCTTCTCTTCTTTGAGAACTATAATGAAACTGCTCCATTCTAGGCCTCTGGACTCTGGAGAAATGAGCAACTAGTTCATTTCATTCTCCCCTTTTAAACCAACTCTCCTCACACTAGTTTTTTAGTTTTTTAGGCTAAATATTAACCATGGAAATATTTTTATTTTATCATGTATATTCATGTTTGGATGTGGAATTTGTTTCTCTAATCTGTTGCATCAACATTCTAATTGCTATCACATCTCTAAAAAAGCTAAGTTCCTTTTTCCCAAGTAAAGCAATCATCTCTGTCACATCCTGAATTGGCCAACTTCTTTTTTTGAAAGAGAAAACAAGTCCATTTCAACAAATTTCAGTCTCAATTATTGGTGTGTATTGTTAGGATTCTGCTTCAATTAATGGGTCTCACACTTAGTTGTTGTGTAGCTCGAAATTGTTCCCCATGCAATTTCACTTTGTTTGTGGTGCTTTCCTCCTCGCAATTTCATCTTCTCTCCTTGGTTTAAAATTTTGGATCTTGATCTCATGTCATGATCAACATTGTCCCAA

mRNA sequence

GGGGGATAATATGAGGACACGTAGTGTAAATTGGCGGGTTGAGCAACGAATCCTATTTTCGTTTTTCTTTCTCTAGCAACCGAAGCTTCACATTCATCTGCGTTCTTCAGAGTTTCAGAGAGAAGAATGGGTGGAATTTCACCAGCCGCTGGTCTCTGCTCCCGTGATCAATTCAGTCTCGGCAATCGCCTCTTCGTTGCTTCTCTGCATTCCTGTCGCGCAATCTCCCGCCCCGCCGGTGGTAATTGCAACGGCGGAGCAAAGTTTTTGGTCCATTTGGAACCGAGAACGCTTGTGCAGAAGCGTCAGCGTTTGGATTTCACATTCTCCGCTAGAGCTGCTGATTCAACTCAGCCGTCTTCGGTTTCAGCCTCGCCAGACAGAGCAGTAGTCACTGATGATGAATTCTCGCTCGCGAAAGTTTCATTTGGTGTTATTGGCTTAGGTGTTGGGATTTCTTTGTTGTCGTATGGATTTGGTGCATACTTTAATATCCTCCCAGGATCTGAATGGTCTGCCATTATGTTAACCTATGGTTTCCCTCTTGCTATTATTGGCATGGCTCTCAAGTATGCCGAACTCAAACCAGTGCCATGCTTGACATATTTAGATGCTCAAAAGCTGAGGGAAACATGTGCCACCCCTATTCTCAAACAGGTAAGGGATGATGTGATAAGGTACCGTTATGGGGATGAACAACATTTGGATGAGGCATTGAAACGGATTTTCCAGTATGGTCTGGCTGGGGGAATTCCTAGAAGGAGCGCTCCTATTTTGCAAAGTATTCGTGAAGAAGTCACAGAAGATGGAAAGTATTGCCTGACCTTGGTGTTCGAAGCAAAAGCCTTGGCATTGTCAGATTTTGAGCAAAGACAAGCAAAATTTGCTTCTTTCTTTGGACCAGGGATCACTGCAGAAGTTGGGAAGGGAGAGAATGAGCTGTATGAAGTCCGACTTATTTCTAACACTATTCCCAGTGCTTCACCATCATAAATTATACGGTTAAATCAAATTCTGCTACATATATTGGTTTCATTTAGTATACAAACTGCTGTCCACTTGTGAAGTTCTGGTAAGCATCAATATTTCTTTGGAAACCCATGCTATCTAAATGTTAAATCCGATCTCTAATGTTTATAAAATGTACAAAAATATAGACAAAAGGATTAGGAAGAGTGAAAAGAATGACATCTAAATGGTTTCCTTCCTAAAAGATAATGAAGCAGATGGTGCAAAAAGGGTTTGTTCTAAAAGGAAAAATGGGACACTACTATCTGTACCTTTTTGGAAACTCCTTCACGTGATGCATTCTTGGTTATTTGATTAATAAAATTTAACCTTGGAATAATATCAGAATTAAGAAATATATCATTGCATAGTGCAACCTGCACCCTAAAAAGCACCCAATGCGGCACACGTGCCCGTAAACATGGTGCATTGTGGGCTTGCATACGCCATGTTGAATGTAGCTGTTGTCACAATTTGCTGGAAGTAATAGAGCCCTACTTACACATTGATAATTTGATACATGAGGATTGACTTTCTTATTCTGGTGCTTTGAAGTCTCATTCAAAAAATAGAAGACTTCTATCTGCCAATCAGTGATTTATCATTAATTATCTTATATCAAAGATTGGTTATCTTATCGATCATTTAATTTCATTTATGGCAATGAGACCAACAAAACATTGTTCTTGTCAAAATGAGGGTAACTTAGCGGTAATTGGCATATACCTCGAGGTTGTGAGTTTGATCCCCTACCCCTTACAACTGTTATTCAAAGCTTCTCTCCTGGGCTTGATATTAAATGCTATTATGCCTTTTATTGAAGCCTGATAGGTGTCTCTCTAAGTTGCTAGATTTCTGAAAATAGGGTTTTAGAGATCTATAACTGTTGGCTCTTCCCTCTCTTGCTACTCCTAAAGTTCTCTTCAAGCTGCTGCTTTCTGGAAATATATTTCCCTTTCGATAGTCATTAGAATCTATTCTTACATATTCTGTCGAGGGTCATGTCCACGTGAAACCTCCACTACGGACATCCCATCGAGGGTCAAATTTTGCTAATCGAAGAGATGGCATTTCCTTCGAATCCCTTCTATGGTTAAGACGAGGAACTTTCATTGGACCTCCCATGGAAAGAGGGAAATCATGGGTGATAATTGAGTACATAAAAGTAAATTGTTCAACTCAACTAGACGGGCCAGGGAAATACAATTCTTTCGATTCTTTAGAAGGGGCATCTCTGAATGTTCTTTTGGACATGGACGAGCAATTGAAGATCGGCTAAAGTGCGATCCAATGGCTCAAATTCAAGTTGGAATGGAAGATATCTGCCATTGAGAGGATTTTTGTCATCATGCCGTGCACGATCTTATTTCATATATATATATGAATAATTAAAAATGTTGTTTCATATTGATTAGTTTGTTGTTCAAATATAGTTTTCGTACTTAAATTTTTACTTATTTTCTGTTTGTCTTTACATTCTGACACCTGGTAATCTCGATTTTACTTGGACACCTAGGTAATAAATAGGGTTACATCTCTTTAATTAAATTTTTACGAAGACTTTTTAGTTAACTTTAAAATAAAATATCTACGGCAGACACCAAATTTAATGATTAGTGGAAAATAGATAAAAGATTTAAAAAGTTCAATAAGCTAAACACATAATTTTAAAAATTTCACTACTGGGTATTAGCTCATGAACCACTTGCCTTTCCCTTACACTTTAGATATCTATTTATGATTTATCTGATGTATTAGAGGACCAATATCTTGGATTTTGATAGAAGATTCACTTGTTAGCGACTAATGAAAGCTTATGAAATTAGTAATCCTACTCATTAAAAGCTTGGTTTGTGTAAAGGTAAAGACAAAGGCATGCACGTGAAGTGAGAAGGAGAAAGCTGCTCCCAACTTTCACCATCCCAATAAACAAAAAATAATATCATTCGGCACCAAGACTAAAGCAGAGAACCACCAAACTAAAACCCTAACCTTCCTCCGCAGGCCAGAGGTCCCTCTCCATGATTGCCCGCCCTTAATTTCTAAACACCCCACTAACACCTGCAAAAATATGCCTGAAATACGAAGAGAATCTCCTTTTGCCATTTTCTCAGTAACTTTTCAGCACCCTTTCAAATGAGGTAAAAAATTCTTCTTTCTTGTTTTGTGTTTATCTCTCTGTCTTGTTCTCATCCATGACAGGCACCCACCTTCACCTTGCCTCCACGTTTGCTCGTTTGTATTGTTTCTATCTCCCTTTTCTCCTTTTGTCATCTCCTTTTCTTTTCCTGCTTTAATTGCCACCTGCACTCATATTAGACTCATCACTGTTATGCATGAATCTGAGGAGAATGGGAAATCTCCTCTCTTATATTTGCTAATTTAGGCAATGTCATGTGTGATCATGACCCTCCTGGAGATTTGTAGAAAATGTTCAAACTTTAAGAAAGAGAGAGATGCAATGCAATATCTCTTTGTTGGAATGAAATTTTTATGCCTAAAATGAGAAGATTAGACACGTCAAGGACTGTACCAGTACAACTAAGTGATCTTTAGAGGCTCATTTATATTTCACTATGAAAGTGAAGTTTTGATTAATTTAGAGTTGTTACTATTTTTCCTCCCCAGCTGAAAGCTATTGTTTGTGGTTACGGCCAAAAAAAGAAAAGGTTAATAATTATGACCCGTAAGCCGCAATCTTACCAAACAATGTTAAGATACTGATGAAATGATGAATCGCAGTTCAATCAGTACAAGTAAAACAAAATCCAGAATGGTCACTTCTTTTGCTCAACTATAATAATTGAAAACTGTTTTCTCGTTTTACTTTTTTGTTTCTTTTCACTTTTAGTTTTTCCCTTGAATCCCGGCCCTTTCACCTGAGGACTGATTTTGAAATTATACTTTCTGATATTGAGGTACAAGCATGTGGATGTCTTATGAATGACAGATGAAATGATTCAGAGACCCTTTTTTTTCATTATTTTGGGCATTTGACCATGTCCTTCAAATTCATAACCATTTTTTCCAGAAAGAGAGAAATAAAGCAGAAATTTCAACATAAAGCAGCAGAAACTGCGAGAACCCATTTTGTATTTTGCAGGAAAGAATCTCGATGGGCTCCACAATAAAGCAAGGGGGGATGGTTGGGGAAGTGGGATTGTGAAGTGTTTGGTACAGAGTTTATGGTAGTTGGGGACTTGGAAATATGTTTTGTTAAGTTGGCTTTGCTCATGTTACTGATATATTCTTGGTTATAGTCCACAGTTTTTCTTCCCATCTGATTGATTCTTTTGCTGTCCGGATAATGCTTCCCAGAAACAGAGACAACCACAACTGTATGAGTTGTTGGGGAGCAGACAGCCATGGGAGAAAATTATAGGTCTGTAAACTAGAGCCTACAGGAAGATGCTAAGAACCAAAGCGTGGTCCCAATTTTACTCTTACACTTCTCCATCTTGTTACAACATCAATCTGAATCAGCCAAGCCTAATCTTGACGGCCCCATTTATAGAAATGCACAATTAACGAGGAAGAGAGTGAAACATCAAAATGATTACTATTAGCACTCTTAATGTTTTGGGGAACGGGTTAATCAGCCTAAAATTTCTCCCTAGGTTTCTTGATTAGTTGGATTTGTATATTAATCTTGTCATTTCAAGCTCCTTTGATCTTCCTTATGCTAAATCATGATTATCATTTGAAGGATCAAAGAATGTACATACCCTTGCATCCTCTGACTTATTTTTTGGTTTGGAATTAGAATTTGGTGAAGCAGTTTCTTCTTTCCCACATGCCTTTAACGATCTCTCTTTAATGGGAGGTGTAGAAGCAGGGCTAGGACTTGCTGAACTTCGGTCAGGTGGTGTGCTTTGAGTTCCCACATCCTTAGTCATCCCTTCCACTTTCTTTTCTACAACTTGAACAAGGAAAACTTGTTCGATATTGAATCCAAAATCTTTCAAAGCATACTATAAGAAAGGAGGAAGAAATAAACGATGGTACCTGGAATTGGATACTGATGAACTGCTTCTTCCTGTGGAGAAGTAATTGGCCTTTCATGGAGTAGTCTGGAGTTGGTTGAATAAACTAAAGGAGCAGACATGGTGATTCTGGTTCTGAAGTTTCTGAATGACCTTGCGCTGAGTGTTTTCGCAGATAAACACTCCATTGGAACCTCCCCTGAAAGAGGATTGCAAATATATTTCTCTGCTTCGTACCACTTTGAGTTCAAAGAAAACCCATCTGGTGACAAGCTTTCCCAGCTATTATAACTTGACCTGGCTTGCAATGCTGTACAGAAACCCACCCACCCAATGTACTATCAATCCCAGCTGAAACCCAGGTTCAAGTTTTGACATTTTAGAGCCAAAAGGAGTGATTTCTTACCTTTGAGTGCAGCAGTGAATGAGTAGGTTGATGGGTCAATTTCTTCTGCAAAAGAAAGCAAGAAAAAGAAGATGAAGAGTTGAAGATCAGAAACAGAGAGGAAGTAATTTAATTGAAGTACCTCTCATGGAGTAGCCTGGAGAAGAAGCAGTGCTGGAGATGGTTATGTTTGATCTGAAGGATCTGCCATCTTCCCTAAAGCTTCTTATATAAGAACCCGAAAACCTTCTGGAACTTGTGCTCTCGCACTTGATCCTTGCCTTAACTGGCCATTCTCTCAAACTGAACTCCTCTGCTTCTGCTTCTTCTCTTCTTTGAGAACTATAATGAAACTGCTCCATTCTAGGCCTCTGGACTCTGGAGAAATGAGCAACTAGTTCATTTCATTCTCCCCTTTTAAACCAACTCTCCTCACACTAGTTTTTTAGTTTTTTAGGCTAAATATTAACCATGGAAATATTTTTATTTTATCATGTATATTCATGTTTGGATGTGGAATTTGTTTCTCTAATCTGTTGCATCAACATTCTAATTGCTATCACATCTCTAAAAAAGCTAAGTTCCTTTTTCCCAAGTAAAGCAATCATCTCTGTCACATCCTGAATTGGCCAACTTCTTTTTTTGAAAGAGAAAACAAGTCCATTTCAACAAATTTCAGTCTCAATTATTGGTGTGTATTGTTAGGATTCTGCTTCAATTAATGGGTCTCACACTTAGTTGTTGTGTAGCTCGAAATTGTTCCCCATGCAATTTCACTTTGTTTGTGGTGCTTTCCTCCTCGCAATTTCATCTTCTCTCCTTGGTTTAAAATTTTGGATCTTGATCTCATGTCATGATCAACATTGTCCCAA

Coding sequence (CDS)

ATGGGTGGAATTTCACCAGCCGCTGGTCTCTGCTCCCGTGATCAATTCAGTCTCGGCAATCGCCTCTTCGTTGCTTCTCTGCATTCCTGTCGCGCAATCTCCCGCCCCGCCGGTGGTAATTGCAACGGCGGAGCAAAGTTTTTGGTCCATTTGGAACCGAGAACGCTTGTGCAGAAGCGTCAGCGTTTGGATTTCACATTCTCCGCTAGAGCTGCTGATTCAACTCAGCCGTCTTCGGTTTCAGCCTCGCCAGACAGAGCAGTAGTCACTGATGATGAATTCTCGCTCGCGAAAGTTTCATTTGGTGTTATTGGCTTAGGTGTTGGGATTTCTTTGTTGTCGTATGGATTTGGTGCATACTTTAATATCCTCCCAGGATCTGAATGGTCTGCCATTATGTTAACCTATGGTTTCCCTCTTGCTATTATTGGCATGGCTCTCAAGTATGCCGAACTCAAACCAGTGCCATGCTTGACATATTTAGATGCTCAAAAGCTGAGGGAAACATGTGCCACCCCTATTCTCAAACAGGTAAGGGATGATGTGATAAGGTACCGTTATGGGGATGAACAACATTTGGATGAGGCATTGAAACGGATTTTCCAGTATGGTCTGGCTGGGGGAATTCCTAGAAGGAGCGCTCCTATTTTGCAAAGTATTCGTGAAGAAGTCACAGAAGATGGAAAGTATTGCCTGACCTTGGTGTTCGAAGCAAAAGCCTTGGCATTGTCAGATTTTGAGCAAAGACAAGCAAAATTTGCTTCTTTCTTTGGACCAGGGATCACTGCAGAAGTTGGGAAGGGAGAGAATGAGCTGTATGAAGTCCGACTTATTTCTAACACTATTCCCAGTGCTTCACCATCATAA

Protein sequence

MGGISPAAGLCSRDQFSLGNRLFVASLHSCRAISRPAGGNCNGGAKFLVHLEPRTLVQKRQRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGFGAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQVRDDVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFEAKALALSDFEQRQAKFASFFGPGITAEVGKGENELYEVRLISNTIPSASPS
Homology
BLAST of MC04g1792 vs. ExPASy Swiss-Prot
Match: Q55403 (Thylakoid membrane protein slr0575 OS=Synechocystis sp. (strain PCC 6803 / Kazusa) OX=1111708 GN=slr0575 PE=4 SV=1)

HSP 1 Score: 117.1 bits (292), Expect = 3.3e-25
Identity = 78/187 (41.71%), Postives = 107/187 (57.22%), Query Frame = 0

Query: 96  LAKVSFGVIGLGVGISLLSYGFGAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPV 155
           L K+S   +GL VG  L   GF AY   L  +  +     YG PL + G+ALK AELKP+
Sbjct: 2   LPKISLAAVGLTVGGILTITGFVAY--ALDYATLNLAGFFYGIPLVLGGLALKAAELKPI 61

Query: 156 PCLTYLDAQK---LRETCATPILKQVRDDVIRYRYGDEQHLDEALKRIFQYGLAGGIPRR 215
           P  +   ++K   LR   ATP   Q+R DV RYRYG E HLDE+L+R+   GL+     R
Sbjct: 62  P-FSQPTSEKIIALRNQLATPTQNQIRKDVTRYRYGQEAHLDESLERL---GLSPTDEER 121

Query: 216 SAPILQSIREEVTEDGKYCLTLVFEAKALALSDFEQRQAKFASFFGPGITAEVGKGENEL 275
             P+L S+ E+  E GKY LTL F +  ++L  ++++Q K A FFGP +   V + E ++
Sbjct: 122 --PVLTSLLEQDWE-GKYVLTLTFTSPFISLETWQEKQEKIAKFFGPDLEVTVAEPEEKV 179

Query: 276 YEVRLIS 280
             V LIS
Sbjct: 182 VTVNLIS 179

BLAST of MC04g1792 vs. NCBI nr
Match: XP_022134099.1 (uncharacterized protein LOC111006453 [Momordica charantia])

HSP 1 Score: 568 bits (1464), Expect = 3.84e-204
Identity = 288/288 (100.00%), Postives = 288/288 (100.00%), Query Frame = 0

Query: 1   MGGISPAAGLCSRDQFSLGNRLFVASLHSCRAISRPAGGNCNGGAKFLVHLEPRTLVQKR 60
           MGGISPAAGLCSRDQFSLGNRLFVASLHSCRAISRPAGGNCNGGAKFLVHLEPRTLVQKR
Sbjct: 1   MGGISPAAGLCSRDQFSLGNRLFVASLHSCRAISRPAGGNCNGGAKFLVHLEPRTLVQKR 60

Query: 61  QRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGFGAY 120
           QRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGFGAY
Sbjct: 61  QRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGFGAY 120

Query: 121 FNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQVRD 180
           FNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQVRD
Sbjct: 121 FNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQVRD 180

Query: 181 DVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFEAKA 240
           DVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFEAKA
Sbjct: 181 DVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFEAKA 240

Query: 241 LALSDFEQRQAKFASFFGPGITAEVGKGENELYEVRLISNTIPSASPS 288
           LALSDFEQRQAKFASFFGPGITAEVGKGENELYEVRLISNTIPSASPS
Sbjct: 241 LALSDFEQRQAKFASFFGPGITAEVGKGENELYEVRLISNTIPSASPS 288

BLAST of MC04g1792 vs. NCBI nr
Match: XP_022989929.1 (uncharacterized protein LOC111486968 [Cucurbita maxima])

HSP 1 Score: 486 bits (1250), Expect = 1.76e-171
Identity = 250/291 (85.91%), Postives = 263/291 (90.38%), Query Frame = 0

Query: 1   MGGISPAAGLCSRDQFSLGNRLFVASLHSCR---AISRPAGGNCNGGAKFLVHLEPRTLV 60
           M GISP A LCS D+F+L NRL + SL   R   A S P GGN NG A F VHLEPR  +
Sbjct: 1   MSGISPPAALCSHDKFTLPNRLSLVSLRFSRPYCATSLPGGGNRNGRANFFVHLEPRAPL 60

Query: 61  QKRQRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGF 120
           QK QRL FTFSARAAD+TQPSSVSASPD+AVVTDDEFSLAKVSFGVIGLGVGISLLSYGF
Sbjct: 61  QKPQRLAFTFSARAADTTQPSSVSASPDKAVVTDDEFSLAKVSFGVIGLGVGISLLSYGF 120

Query: 121 GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQ 180
           GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYL+AQKLRE+CATPILKQ
Sbjct: 121 GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLEAQKLRESCATPILKQ 180

Query: 181 VRDDVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFE 240
           VRDDVIRYRYGDEQHLDEALKRIFQ+GLAGGIPRRSAPILQSIREEVTEDGKYCL L+FE
Sbjct: 181 VRDDVIRYRYGDEQHLDEALKRIFQFGLAGGIPRRSAPILQSIREEVTEDGKYCLVLMFE 240

Query: 241 AKALALSDFEQRQAKFASFFGPGITAEVGKGENELYEVRLISNTIPSASPS 288
           AKAL LSDFE+RQAKFASFFGPGI AEVGKG+N+LYEVRLISNTIP ASPS
Sbjct: 241 AKALTLSDFEKRQAKFASFFGPGIAAEVGKGDNDLYEVRLISNTIPGASPS 291

BLAST of MC04g1792 vs. NCBI nr
Match: XP_022956448.1 (uncharacterized protein LOC111458174 [Cucurbita moschata] >KAG6602567.1 hypothetical protein SDJN03_07800, partial [Cucurbita argyrosperma subsp. sororia] >KAG7033244.1 hypothetical protein SDJN02_07298 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 485 bits (1249), Expect = 2.50e-171
Identity = 250/291 (85.91%), Postives = 262/291 (90.03%), Query Frame = 0

Query: 1   MGGISPAAGLCSRDQFSLGNRLFVASLHSCR---AISRPAGGNCNGGAKFLVHLEPRTLV 60
           M GISP A LCS D+F+L NRL + SL   R   A S P GGN NG A F VH EPR  +
Sbjct: 1   MSGISPPAALCSHDKFTLPNRLSLVSLRFSRPYCATSLPGGGNRNGRANFFVHFEPRAPL 60

Query: 61  QKRQRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGF 120
           QK QRL FTFSARAAD+TQPSSVSASPD+AVVTDDEFSLAKVSFGVIGLGVGISLLSYGF
Sbjct: 61  QKPQRLAFTFSARAADTTQPSSVSASPDKAVVTDDEFSLAKVSFGVIGLGVGISLLSYGF 120

Query: 121 GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQ 180
           GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYL+AQKLRE+CATPILKQ
Sbjct: 121 GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLEAQKLRESCATPILKQ 180

Query: 181 VRDDVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFE 240
           VRDDVIRYRYGDEQHLDEALKRIFQ+GLAGGIPRRSAPILQSIREEVTEDGKYCL LVFE
Sbjct: 181 VRDDVIRYRYGDEQHLDEALKRIFQFGLAGGIPRRSAPILQSIREEVTEDGKYCLVLVFE 240

Query: 241 AKALALSDFEQRQAKFASFFGPGITAEVGKGENELYEVRLISNTIPSASPS 288
           AKAL LSDFE+RQAKFASFFGPGI AEVGKG+N+LYEVRLISNTIP ASPS
Sbjct: 241 AKALTLSDFEKRQAKFASFFGPGIAAEVGKGDNDLYEVRLISNTIPGASPS 291

BLAST of MC04g1792 vs. NCBI nr
Match: XP_023524489.1 (uncharacterized protein LOC111788387 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 484 bits (1247), Expect = 5.03e-171
Identity = 248/291 (85.22%), Postives = 262/291 (90.03%), Query Frame = 0

Query: 1   MGGISPAAGLCSRDQFSLGNRLFVASLHSCR---AISRPAGGNCNGGAKFLVHLEPRTLV 60
           M GISP A LCS D+F+L NRL + SL   R   A S P GGN NG A F VH EPR  +
Sbjct: 1   MSGISPPAALCSHDKFTLPNRLSLVSLRFSRPYCATSLPGGGNWNGRANFFVHFEPRAPL 60

Query: 61  QKRQRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGF 120
           QK QRL FTFSARAAD+TQPSSVSASPD+AVVTDDEFSLAKVSFGVIGLGVGISLLSYGF
Sbjct: 61  QKPQRLAFTFSARAADTTQPSSVSASPDKAVVTDDEFSLAKVSFGVIGLGVGISLLSYGF 120

Query: 121 GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQ 180
           GAYFN+LPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYL+AQKLRE+CATPILKQ
Sbjct: 121 GAYFNVLPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLEAQKLRESCATPILKQ 180

Query: 181 VRDDVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFE 240
           VRDDV+RYRYGDEQHLDEALKRIFQ+GLAGGIPRRSAPILQSIREEVTEDGKYCL LVFE
Sbjct: 181 VRDDVLRYRYGDEQHLDEALKRIFQFGLAGGIPRRSAPILQSIREEVTEDGKYCLVLVFE 240

Query: 241 AKALALSDFEQRQAKFASFFGPGITAEVGKGENELYEVRLISNTIPSASPS 288
           AKAL LSDFE+RQAKFASFFGPGI AEVGKG+N+LYEVRLISNTIP ASPS
Sbjct: 241 AKALTLSDFEKRQAKFASFFGPGIAAEVGKGDNDLYEVRLISNTIPGASPS 291

BLAST of MC04g1792 vs. NCBI nr
Match: XP_038890554.1 (thylakoid membrane protein slr0575-like [Benincasa hispida])

HSP 1 Score: 464 bits (1195), Expect = 5.20e-163
Identity = 243/286 (84.97%), Postives = 250/286 (87.41%), Query Frame = 0

Query: 3   GISPAAGLCS--RDQFSLGNRLFVASLHSCR---AISRPAGGNCNGGAKFLVHLEPRTLV 62
           GISP  GLCS   DQF+LGNRL V SL   R    IS P GGN  G   FLVH EPRTL+
Sbjct: 5   GISPTPGLCSCSHDQFTLGNRLSVLSLRFSRPYRTISPPGGGNFIGRPNFLVHFEPRTLL 64

Query: 63  QKRQRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGF 122
           QK QRL FTFSARAADSTQPSSVSASPD+AVV+DDEFSLAKVSFGVIGLGVGISLLSYGF
Sbjct: 65  QKPQRLAFTFSARAADSTQPSSVSASPDKAVVSDDEFSLAKVSFGVIGLGVGISLLSYGF 124

Query: 123 GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQ 182
           GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTY DAQKLRETCATPILKQ
Sbjct: 125 GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYSDAQKLRETCATPILKQ 184

Query: 183 VRDDVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFE 242
           VRDDVIR+RYGDEQHLDEALKRIFQYG+AGGIPRRSAPILQSIREEV EDGKYCL LVFE
Sbjct: 185 VRDDVIRFRYGDEQHLDEALKRIFQYGMAGGIPRRSAPILQSIREEVIEDGKYCLVLVFE 244

Query: 243 AKALALSDFEQRQAKFASFFGPGITAEVGKGENELYEVRLISNTIP 283
           AKAL LSDFEQRQAKFASFFGPGITAEVGKGE +LYE  L     P
Sbjct: 245 AKALTLSDFEQRQAKFASFFGPGITAEVGKGETDLYETGLFLTLFP 290

BLAST of MC04g1792 vs. ExPASy TrEMBL
Match: A0A6J1BXU7 (uncharacterized protein LOC111006453 OS=Momordica charantia OX=3673 GN=LOC111006453 PE=4 SV=1)

HSP 1 Score: 568 bits (1464), Expect = 1.86e-204
Identity = 288/288 (100.00%), Postives = 288/288 (100.00%), Query Frame = 0

Query: 1   MGGISPAAGLCSRDQFSLGNRLFVASLHSCRAISRPAGGNCNGGAKFLVHLEPRTLVQKR 60
           MGGISPAAGLCSRDQFSLGNRLFVASLHSCRAISRPAGGNCNGGAKFLVHLEPRTLVQKR
Sbjct: 1   MGGISPAAGLCSRDQFSLGNRLFVASLHSCRAISRPAGGNCNGGAKFLVHLEPRTLVQKR 60

Query: 61  QRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGFGAY 120
           QRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGFGAY
Sbjct: 61  QRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGFGAY 120

Query: 121 FNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQVRD 180
           FNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQVRD
Sbjct: 121 FNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQVRD 180

Query: 181 DVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFEAKA 240
           DVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFEAKA
Sbjct: 181 DVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFEAKA 240

Query: 241 LALSDFEQRQAKFASFFGPGITAEVGKGENELYEVRLISNTIPSASPS 288
           LALSDFEQRQAKFASFFGPGITAEVGKGENELYEVRLISNTIPSASPS
Sbjct: 241 LALSDFEQRQAKFASFFGPGITAEVGKGENELYEVRLISNTIPSASPS 288

BLAST of MC04g1792 vs. ExPASy TrEMBL
Match: A0A6J1JQQ4 (uncharacterized protein LOC111486968 OS=Cucurbita maxima OX=3661 GN=LOC111486968 PE=4 SV=1)

HSP 1 Score: 486 bits (1250), Expect = 8.51e-172
Identity = 250/291 (85.91%), Postives = 263/291 (90.38%), Query Frame = 0

Query: 1   MGGISPAAGLCSRDQFSLGNRLFVASLHSCR---AISRPAGGNCNGGAKFLVHLEPRTLV 60
           M GISP A LCS D+F+L NRL + SL   R   A S P GGN NG A F VHLEPR  +
Sbjct: 1   MSGISPPAALCSHDKFTLPNRLSLVSLRFSRPYCATSLPGGGNRNGRANFFVHLEPRAPL 60

Query: 61  QKRQRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGF 120
           QK QRL FTFSARAAD+TQPSSVSASPD+AVVTDDEFSLAKVSFGVIGLGVGISLLSYGF
Sbjct: 61  QKPQRLAFTFSARAADTTQPSSVSASPDKAVVTDDEFSLAKVSFGVIGLGVGISLLSYGF 120

Query: 121 GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQ 180
           GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYL+AQKLRE+CATPILKQ
Sbjct: 121 GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLEAQKLRESCATPILKQ 180

Query: 181 VRDDVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFE 240
           VRDDVIRYRYGDEQHLDEALKRIFQ+GLAGGIPRRSAPILQSIREEVTEDGKYCL L+FE
Sbjct: 181 VRDDVIRYRYGDEQHLDEALKRIFQFGLAGGIPRRSAPILQSIREEVTEDGKYCLVLMFE 240

Query: 241 AKALALSDFEQRQAKFASFFGPGITAEVGKGENELYEVRLISNTIPSASPS 288
           AKAL LSDFE+RQAKFASFFGPGI AEVGKG+N+LYEVRLISNTIP ASPS
Sbjct: 241 AKALTLSDFEKRQAKFASFFGPGIAAEVGKGDNDLYEVRLISNTIPGASPS 291

BLAST of MC04g1792 vs. ExPASy TrEMBL
Match: A0A6J1GWV5 (uncharacterized protein LOC111458174 OS=Cucurbita moschata OX=3662 GN=LOC111458174 PE=4 SV=1)

HSP 1 Score: 485 bits (1249), Expect = 1.21e-171
Identity = 250/291 (85.91%), Postives = 262/291 (90.03%), Query Frame = 0

Query: 1   MGGISPAAGLCSRDQFSLGNRLFVASLHSCR---AISRPAGGNCNGGAKFLVHLEPRTLV 60
           M GISP A LCS D+F+L NRL + SL   R   A S P GGN NG A F VH EPR  +
Sbjct: 1   MSGISPPAALCSHDKFTLPNRLSLVSLRFSRPYCATSLPGGGNRNGRANFFVHFEPRAPL 60

Query: 61  QKRQRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGF 120
           QK QRL FTFSARAAD+TQPSSVSASPD+AVVTDDEFSLAKVSFGVIGLGVGISLLSYGF
Sbjct: 61  QKPQRLAFTFSARAADTTQPSSVSASPDKAVVTDDEFSLAKVSFGVIGLGVGISLLSYGF 120

Query: 121 GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQ 180
           GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYL+AQKLRE+CATPILKQ
Sbjct: 121 GAYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLEAQKLRESCATPILKQ 180

Query: 181 VRDDVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFE 240
           VRDDVIRYRYGDEQHLDEALKRIFQ+GLAGGIPRRSAPILQSIREEVTEDGKYCL LVFE
Sbjct: 181 VRDDVIRYRYGDEQHLDEALKRIFQFGLAGGIPRRSAPILQSIREEVTEDGKYCLVLVFE 240

Query: 241 AKALALSDFEQRQAKFASFFGPGITAEVGKGENELYEVRLISNTIPSASPS 288
           AKAL LSDFE+RQAKFASFFGPGI AEVGKG+N+LYEVRLISNTIP ASPS
Sbjct: 241 AKALTLSDFEKRQAKFASFFGPGIAAEVGKGDNDLYEVRLISNTIPGASPS 291

BLAST of MC04g1792 vs. ExPASy TrEMBL
Match: A0A5A7TDN5 (Thylakoid membrane protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G001820 PE=4 SV=1)

HSP 1 Score: 461 bits (1186), Expect = 5.31e-162
Identity = 241/289 (83.39%), Postives = 252/289 (87.20%), Query Frame = 0

Query: 4   ISPAAGLC--SRDQFSLGNRLFVASL---HSCRAISRPAGGNCNGGAKFLVHLEPRTLVQ 63
           ISP  GL   S  QF+L NRL + SL      R IS P GGN        VH E +TL+ 
Sbjct: 6   ISPTPGLSTSSHHQFTLSNRLSLLSLPFSRPNRPISLPGGGNFIARTNVFVHFETKTLLH 65

Query: 64  KRQRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGFG 123
           K  RL FTFS RAADSTQPS+VSASPD+AVVTDDEFSLAKVSFGVIGLGVG+SLLSYGFG
Sbjct: 66  KPHRLAFTFSTRAADSTQPSAVSASPDKAVVTDDEFSLAKVSFGVIGLGVGVSLLSYGFG 125

Query: 124 AYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQV 183
           AYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQV
Sbjct: 126 AYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQV 185

Query: 184 RDDVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFEA 243
           RDDVIR+RYGDEQHLDEALKRIFQYGLAGGIPRRSAPIL+SIREEVTEDGKYCL LVFEA
Sbjct: 186 RDDVIRFRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILESIREEVTEDGKYCLVLVFEA 245

Query: 244 KALALSDFEQRQAKFASFFGPGITAEVGKGENELYEVRLISNTIPSASP 287
           KAL LSDFEQRQAKFASFFGPGITAEVGKGEN+LYEVRLISN+IP ASP
Sbjct: 246 KALTLSDFEQRQAKFASFFGPGITAEVGKGENDLYEVRLISNSIPGASP 294

BLAST of MC04g1792 vs. ExPASy TrEMBL
Match: A0A0A0KVX1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G644000 PE=4 SV=1)

HSP 1 Score: 460 bits (1183), Expect = 1.52e-161
Identity = 240/289 (83.04%), Postives = 250/289 (86.51%), Query Frame = 0

Query: 4   ISPAAGL--CSRDQFSLGNRLFVASL---HSCRAISRPAGGNCNGGAKFLVHLEPRTLVQ 63
           ISP  GL  CS DQF+L NRL + SL      R IS P G N        VH E  TL+ 
Sbjct: 6   ISPTPGLSTCSHDQFTLSNRLSLVSLPFSRPNRTISLPGGANFIARTNVFVHFETTTLLH 65

Query: 64  KRQRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGFG 123
           K  RL F+FS RAADSTQPS+VSASP +AVVTDDEFSLAKVSFGVIGLGVG+SLLSYGFG
Sbjct: 66  KPHRLAFSFSTRAADSTQPSAVSASPGKAVVTDDEFSLAKVSFGVIGLGVGVSLLSYGFG 125

Query: 124 AYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQV 183
           AYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQV
Sbjct: 126 AYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQV 185

Query: 184 RDDVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFEA 243
           RDDVIR+RYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCL LVFEA
Sbjct: 186 RDDVIRFRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLVLVFEA 245

Query: 244 KALALSDFEQRQAKFASFFGPGITAEVGKGENELYEVRLISNTIPSASP 287
           KAL LSDFE+RQAKFASFFGPGITAEVGKGEN+LYEVRL SNTIP ASP
Sbjct: 246 KALTLSDFEKRQAKFASFFGPGITAEVGKGENDLYEVRLTSNTIPGASP 294

BLAST of MC04g1792 vs. TAIR 10
Match: AT5G38660.1 (acclimation of photosynthesis to environment )

HSP 1 Score: 348.6 bits (893), Expect = 4.8e-96
Identity = 177/223 (79.37%), Postives = 194/223 (87.00%), Query Frame = 0

Query: 59  KRQRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGFG 118
           KR+ L      RAADST  S   AS DR ++ DDEF+LAK+SFGVIGLG+G+SLLSYGFG
Sbjct: 54  KREVLKLDVVGRAADSTSSSPSVASGDRTLIPDDEFTLAKISFGVIGLGLGVSLLSYGFG 113

Query: 119 AYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQV 178
           AYF ILPG+EWSAIMLTYGFPL+IIGMALKYAELKPVPCL+Y DA KLRE+CATPIL QV
Sbjct: 114 AYFTILPGTEWSAIMLTYGFPLSIIGMALKYAELKPVPCLSYSDAVKLRESCATPILTQV 173

Query: 179 RDDVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFEA 238
           R+DV RYRYGDEQHL+EALKRIFQYGL GGIPRRSAPILQ IREEV  DG+YC+ LVFEA
Sbjct: 174 RNDVTRYRYGDEQHLEEALKRIFQYGLGGGIPRRSAPILQLIREEVLTDGRYCVVLVFEA 233

Query: 239 KALALSDFEQRQAKFASFFGPGITAEVGKGENE-LYEVRLISN 281
           KAL LSDFE+RQAKF SFFGP ITAEVGKGE+E LYEVRLISN
Sbjct: 234 KALTLSDFEKRQAKFTSFFGPNITAEVGKGESENLYEVRLISN 276

BLAST of MC04g1792 vs. TAIR 10
Match: AT5G38660.2 (acclimation of photosynthesis to environment )

HSP 1 Score: 325.1 bits (832), Expect = 5.7e-89
Identity = 163/207 (78.74%), Postives = 179/207 (86.47%), Query Frame = 0

Query: 59  KRQRLDFTFSARAADSTQPSSVSASPDRAVVTDDEFSLAKVSFGVIGLGVGISLLSYGFG 118
           KR+ L      RAADST  S   AS DR ++ DDEF+LAK+SFGVIGLG+G+SLLSYGFG
Sbjct: 54  KREVLKLDVVGRAADSTSSSPSVASGDRTLIPDDEFTLAKISFGVIGLGLGVSLLSYGFG 113

Query: 119 AYFNILPGSEWSAIMLTYGFPLAIIGMALKYAELKPVPCLTYLDAQKLRETCATPILKQV 178
           AYF ILPG+EWSAIMLTYGFPL+IIGMALKYAELKPVPCL+Y DA KLRE+CATPIL QV
Sbjct: 114 AYFTILPGTEWSAIMLTYGFPLSIIGMALKYAELKPVPCLSYSDAVKLRESCATPILTQV 173

Query: 179 RDDVIRYRYGDEQHLDEALKRIFQYGLAGGIPRRSAPILQSIREEVTEDGKYCLTLVFEA 238
           R+DV RYRYGDEQHL+EALKRIFQYGL GGIPRRSAPILQ IREEV  DG+YC+ LVFEA
Sbjct: 174 RNDVTRYRYGDEQHLEEALKRIFQYGLGGGIPRRSAPILQLIREEVLTDGRYCVVLVFEA 233

Query: 239 KALALSDFEQRQAKFASFFGPGITAEV 266
           KAL LSDFE+RQAKF SFFGP ITAEV
Sbjct: 234 KALTLSDFEKRQAKFTSFFGPNITAEV 260

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q554033.3e-2541.71Thylakoid membrane protein slr0575 OS=Synechocystis sp. (strain PCC 6803 / Kazus... [more]
Match NameE-valueIdentityDescription
XP_022134099.13.84e-204100.00uncharacterized protein LOC111006453 [Momordica charantia][more]
XP_022989929.11.76e-17185.91uncharacterized protein LOC111486968 [Cucurbita maxima][more]
XP_022956448.12.50e-17185.91uncharacterized protein LOC111458174 [Cucurbita moschata] >KAG6602567.1 hypothet... [more]
XP_023524489.15.03e-17185.22uncharacterized protein LOC111788387 [Cucurbita pepo subsp. pepo][more]
XP_038890554.15.20e-16384.97thylakoid membrane protein slr0575-like [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1BXU71.86e-204100.00uncharacterized protein LOC111006453 OS=Momordica charantia OX=3673 GN=LOC111006... [more]
A0A6J1JQQ48.51e-17285.91uncharacterized protein LOC111486968 OS=Cucurbita maxima OX=3661 GN=LOC111486968... [more]
A0A6J1GWV51.21e-17185.91uncharacterized protein LOC111458174 OS=Cucurbita moschata OX=3662 GN=LOC1114581... [more]
A0A5A7TDN55.31e-16283.39Thylakoid membrane protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaff... [more]
A0A0A0KVX11.52e-16183.04Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G644000 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G38660.14.8e-9679.37acclimation of photosynthesis to environment [more]
AT5G38660.25.7e-8978.74acclimation of photosynthesis to environment [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021275Protein of unknown function DUF2854PFAMPF11016DUF2854coord: 115..265
e-value: 1.1E-48
score: 164.9
IPR021275Protein of unknown function DUF2854PANTHERPTHR35551FAMILY NOT NAMEDcoord: 50..285

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC04g1792.1MC04g1792.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane