CsGy7G003850 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy7G003850
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptionprotein THYLAKOID FORMATION1, chloroplastic
LocationGy14Chr7: 2932262 .. 2944251 (-)
RNA-Seq ExpressionCsGy7G003850
SyntenyCsGy7G003850
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGCTGTTAATTCCATTTCATTCTCTACACTAAACCAATGTTCCGATAGGAGATTGCTGCTTCCCTCCTCTCGTTCGCACTCCTCCAATTTCCACGGCTTTCCTTTTCGTACTAGCGTTTTCACTCATTATTCCCGAGTACGAGCATCCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTCCGCCGGAACAGGTAGCTTCTACCTCTGGTTATTTATTTTCCTTACTTGCTTAATTTGACTATTCCGATTAAGTTAGTGAATCTCTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGAGATTGAATGTTATAATGAGTATTAGAAGTTTTAAGTAACTATTAAGCTTCCAATAGTTCAGTTGGATCAGTTCTGTACTTGTGTAGTTTGTTGAGAGATTTCTCTAGTTCGTTTTCTTACTACTATAGGTTAGAAACTGATGCGCTATTTTGACGGCTCAGATGTGACCACTGTGGCCGAGACAAAATTGAACTTCCTTAAGGCCTATAAACGGCCTATCCCTAGTATTTACAACACGGTTCTGCAAGAATTGATTGTTCAGCAGCATTTGATGAGGTATAAGAGGACATACCGTTATGATCCTGTTTTCGCTCTTGGATTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGTGAAGCCATCTTCCAAGCGTATATTAAGGCTTTGAATGAGGATCCAGAGCAATATAGGTTTGGCCATGTATTCCCCTTTGCTTTTACCATCTCTAATCTCTATCAATTGTTGTCACTTGTCAGCTAGAAACTTCAGTTGCCTTATTGTTAAGTTTAAAAATCACCAACACGATTCCCTTTGGCTGTCTAAATGACTAAATTGGATTTGTATTGAATGTTAGTGCATGCTCGAAAAGTAGTATTAGAGACTTTTATTTGTACCTAGTAAACAAATTTACGAATGTTCATTTTTTATATTTCAAAAAGGGTTTTCTTTATTTTAAGTAATAACTTTTGTTATATCATCCAGAATTGATGCTAAAAAATTTGAAGAGTGGGCTCGATCTCAGACTGCAGCTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAGTTGAGAGTATTTTGAAGGACATTGCAGAAAGAGCAGGGAGCAAGGGGAATTTCAGTTACAGCCGATTTTTTGCTATTGGACTATTTCGTCTCCTTGAATTGGCAAATGCTACTGAGCCCAGTATCTTGGAAAAGGTTTCTTTAATATTCCTTTCTGTGATATAGCTTTTAAGTAGTTGAGATTTTGATGACATTCTTATCGAGTTGATTATAGCTAAATTAAATGACAGCCAGCCTCTCTTGTAGGACTTTTATCAATAGATGTTATCTCTGAGTGCTGCCATTAATAAAGTATAATAGTGTTTCTGGTTCATTAGTTTTCTTTTTACTCGTTAGAGAAATTGGATCTTGATGGAATATTGCCTAACATACTGTTCGAGACGAGAGAATTAGGTTCAAAGGCTTCTTTTGTGCATAAGGCTGCATTTTATTTTCTGGCTAACTAGTCTTTTGGAACATTTGTCTTTCCCCTTTCCTCCCTTGCCGCTGACCCATCACACACACACATACTATTGTCGGATTTCCTTTCACCCTTGTGAAGGCCGATTCTATTTACTGTGTCTATCACTGAAGATAAGATACTTTTGATCTCTTAAAACAGCTCAAGCTATGTAGTCGAACAGTTTATAAAACAGCAGAAGTCATATGATAGAGAATTAACCGGGTATTGAGATCTAATTTCTTGGTACTTCTGAACAGCTCTGTGCCGCTTTAAATATCGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGTAACCTGCTTTCAAAGTTGGTTCAGGCGAAAGAGCTCCTAAAGGAATATGTCGACAGGTAAGAGACTGTTAAGTTCATGGTTCTAGGCATGGTAGATTAATATGCTCTTTAAAGCCCTCATATTTTCAGACTTTATGAAAATCTACAATCGAATTGAAAACTGTCTCAATCTTGTATTTGTGAAATCTCCTGCAAGTCACTATCATAGCATTTATATTCATTGGATCTTATCTGAGGTCTATATGAGGTAGACAATTGGGTGGATGATGGTGGTTGTAGGAGTAAGATTTAAAGGATAGAAGTTATATGATTAACCAGCGGGTTTTGAATTGTGTAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCACAGACAGCTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCATGCAGACTGGTTTATAAAAGGTAATTGGAGTCGCTACATAATTTGAGATAGACTTGAGAGAGTTTATAGCAACATTATTCTAAATACTTGTTAGATATGCATCTATAATGTATTATGTTGGGTCTCATGCATTTGGTAAATTTTGTATTCAGCCACGGCTCTACCTTTATTTCTCATTTGATATAACCACCACTCAAGCTATTTTTTAATCATTTGTATTACATACTTTGAGTGCAACTGTCAATACAAAATGCTTTCGCACTCTGTGCAGTAATCACTCCTCCAGGGATATTTGACCATCTGCCTTTTGTATTGTTATCCTTTCTGTATTATTCTGATATTTGGCTATTTGCTTTTGTATGATCTAAATTGTTTCAATTATAAACGTTTTACGATTTTAATTGATATAGTTAATAGTTTGGAGTTTTAACCCTAATTTAGAACCATCCTTTTGAAAATATGACCTTACTGGAACAGATCTGTTTTACAGGTGTGCTATTTCAATTGTGTAGACTTGTGAAGCTCAATACTAAAATTTATACTCTCCCGAAAAATTTTATGTTCTAATTCAGTTACCTAGTATACGTTAAAAAAAAAAAAGGATTAATATTCTCTCAAGAGTGCTCTAGGACTCCATATTTGAGAGGAATGCCCACTATGATAAGGCTGTGACTGAGGCCCAATAGAATGAGGCTATTGGTGAAGATCCAATGAGCCTATTGTGATAGACTTATGGATAGCGGTTATTGGGCCTGTTCTGGCAAGACCGGGGTCTATCCTATGGCTGTGACAATTTTCCCCAAAGTTAAATAGGTAAATTGCATTAATATATTACTTATTACTTTTAGGTATAATTATTATCTAACAACGTTTTGAAAACTTTTGTGAATGCAACAAAATCTATCTACTTTACCACTAATTGATTCGTTTGAGTGATAATCTATTGCATATTCTACCAACGACATAATGTATCACTTGTGTACTTCAATATTTGAATCAAAAAGCTTAATCTAAAATTTATTATATTAGCAAAATTTTTGTATTGAACAGATTATTGTAATGATATGAGTTTAAGAAGGGAATATGAATGACTGAGGTTGTATTGAATTGGAATCTTAAACTTGGGGTGTTATAGTTACTATATTTGTAACAACCCCAAATCTAATCAAATTTTGTATAAAAAAAATTTATACTTGGTTTTGCTTTTAGAATTGCACGTAAATTAAAAGTATAATCATTACCACTACCAAAAATTACAATTTTTTACAATAGTAACGATGAAATAAACAGCGTCTAAGTCTACAAGTATATGTTGTAGACGAATTATAAGTATATGTTGTATCTTTCATTTGAACATGAGAGTTTTTTCAAAATTAGACTTGAATTGATGGTTTAATAAAAAAAAATGGATGAAAATCATACCAACAACTCTCTTGATTAGAACACATGAAATCTAAATTAAAAACCATAAGGGTATTAATATGCCACATTTGGAAAGTTGAATTTAAAAAAAAAAAAAAAAAAGCTACATAGTTAGCCACAATATTTGAATAGAAGACACGTAATGATGTTGTATTGATACGTGTGAGTTCCCTAATGCATGAAAATATGAAAAGCCTCTCTTAGAAGACCACCAACCTCAACCCTAATTCCACTCCCACTTACGCAACCCATTCTCTACTCTGCTCTTCCTTTTTCACTCCCCTATTTTATTATAATTATTAACGATCTAATATATATTACAAGCCATTCAGAGATGTATTTTCCTTAATTATAAATACGTATGTTCAATACGAAAATTGTTTATATGACCAAGTAACCAATATTAGACATTCCAACTATATACATCATATAACTAATTAATCATATATGTTATAAACACAAATAAACATACACGAGGCTTCTAATTTAAACTGTATAATTCAAGTTATAACCTTTATGCTTAATACCATTTAGTTGTTGTTGATCTATGGATCACAGAACCATCTCGAAGTAATTATAAGATGATGATTACATTATAAAATGCAAAAAATCATATGAAGTTTCTAGATTTTGATAGAAAAAAAATGTTAGATAGTTTGTTGGGGGATTATGTTTATACATAATAACTAATATATAATTGAGAGTTTGTTAAAGGGGTTTGTATATACGTATCATTTTGTTTGAATAGACGTATATATTATTGTATTGTATTTGGTAGAAATGATAGCCATGTGGACAAACCCGAGACCAAACTGCATGGCTTTTATTTTAATATATTGAATAAAATGAATCTTTCCTTCACCAAAATTTTATTAAATAACAACCACTGTATCTATTTCTTTATATAAAATAACCTACCAAACTTGGAGACATCATATGTTAGAAAGATGACATATGCCCTACTTTGCATGTTAATATACATTCACTATTTAGTTCTTTAACTTTTATATGTTTTCCAACTGATCATTATTTAGTTCTTTAACTTTTATATGTTTTCCAACTCATTATTTTCAGTTAAGACACATTTTAGGTGGAAGAAGATGATATAATATCAAACTTATTTTTTATCCTCAATTTTGGGTGATTTAACATCTTATCATAGTAGAGGAGTCTAGGAGGTTCTATGTTTAAATTTACGTAATGTTATTTTTTTTGTATAATTAATACATCAACTTAAACTTTTGGGTTAGTTAGTGATTTAATATAGTTTTAAATTGCAATAATTTAGGAGGTCGTATATTAAAGGTCATTCATGAATGCTCTCTCATTTTGTTGGTTATGACTTCTCATTTTGACATTTTAGCTAAGCACATTCAAGAGGGGCATGTGCTTATTATATGTACTAAACGAATTTTTGAAGTTTAATTTCTCCCCGAAGTATATTATAACTAACACATCAACAAACCCTTTTAGGACCAAAGCATTACCTAGCCTTTGATGGTTAAGTGGGCAGTTGAAGTTGACTCAATTTGTATCACTCGTCACCATGCCATTCATTTGGTACAGAGAAAGAAAGAAAAAAAGTAAATCAGAAAATGGGAAGTTTTAGAAATGATGAAATCTTGTGGTCGTAAAAGTGAAGGGAATAAAAAAGGATATATTGAGTGAGATTTATGGGTTAGATTAGATAGAGAGTGGATGAAATAAGGGACATAATTTTCAATGAGTTTGAATTTAATCGATTCCAAAAGGCAAAGACACAACTTATTGGCCACAGGATTTAAACTGAACCTGATGATTTGGTTTGGTACTGTGCCACAGTAGACAAATGTCAACACAAGAAAAAGAGTTTTGATCTCTTTTTTTAATTGATTGGTTGACCTTCTGATTAGTACTGCTACGTGTAGGAATTGCAATGGTCTGTTTCATATACCAAAGCTAGCTAAAAGGCTCCAGCTTGGAAATTTGTAGGTGAGCATGTGAATATCATATCTTACACGCTATAATATAGTTTTGTGTGTTCCTTGATTCTTATTGCATTAAAATGTGAAACAAACAACATGCTTTTATGCACATAAAGGGTATGTCCTCGTTTTGTTACCAACTCATTGTATATTATAATATGTATACTCATTTTTATAATATTTTGTTGCTCAACCATCCAAAATCAACTTAGATTCTCTTTTCTATTATTTTAACTTAATTTCATGGCTCATATTTGAAAGTGTTTCATCACATATAGAGAGACTTGATGACTCATCATTGAATTTTGAAATGTTTGGTTATTAATATATATAATGGATAATAATTTGAACTTATTTGATTCTGTGGAGTTTGTGATCATTTGTTGTTATGTAATATAAATTAAAGATGATTTTTTTTACAAGTGTCTTGGCTTTTATGTAACATCATGAGTGTTGATGAGTTAATATGTAATTTTGGGATTAATTATTTAAACTTAAGATTAGTAGGTAGTAAATGAATGTATATGTTTTTAATATAAAGTTTTAAAAGAAGTATTTGACATATAATTATTTTCTTAGTTAGAGTCAAATTTTGAAAAAGAGGAGTTTATGTGTCAATATAACTTTTCTTCTTTTAGTGAAATTTGACTGACATTTGAAAAGTGTTTTACATAATTAAGCATTATTCTCCTTTTTCAAAAGTCATTATGCTTTTTGCTTTTGTTTTAGTTTAATGCCTATAGTCTTAAAATTTTCAACTTCCATAAATTAATTAGTTTTTATGATCTCTTATAATTAAAGCTTCCAAAATATTCTTCTGTCTTAAAGATGTGTAAACACTTTTTTTATATATAACACAATCAATAGAGAAAGAAAGACATCAATATGAGTTGTTGCATAAATGCATCCATAACAAAAATTGTTAGATCAATATTGTTAAAGGCTGCTCTAGATCACAATGTTTAAAATCTATGAGTGGATCCAATTCAATAAAAATCGAGAAACAATATAGAAAAAAAATGTATGTCTCAATAAAATAAGAAAAAAGAGTCAGAGTCGCGAGTATTGGACAAACCATTCAAGCCAAATAAAAAAAATAGCCACGAAACCTTCTTGAGGATTACAACCACCTCGAAAATATTGAGTATAAAAGAAAATGAAAATTGTTTTATTATTTAAAGACATAAAAACAAAGATTATAATAAACGAAGGCCTAAAAATAATTAAATAACCGACCAAAGTTTACTAAGTTTGAAAATTTAACCCCAAGTATTTGAAGGATAAGTTTGGTATCATAAAAAATGGATGAATATGACAATAATTTTTTATTTTCTAGAAGTTGACTACTCTTCACATAAAAAGTAAAAATCCCTCGAGGTCGAGGTCGAGGTCGATGGTTATGTTCTTTATCTCTATAATATAGTTGTTGTACTAAAATAACATTTAAGCTTATAGACTAATAATTGTTTGGTTATTTTACAAACTTAATTTGGTTTACGTTTTAAAATCATTTATCAAAATAGATAATAAAATTAAAAAAAAAAAGATGTTTAGAAAGTTTTAAAAATTAACATCTTTAGGTTAATGTGATCATAGTTCAACTAACATATACTATGAAATGAAAAGGCATATAATTAAAAGTCCGATCCAAAGTTTGATGTAAATAAATAAACATATCTTATTGAATGTTTTTAATTAGTGAAAATTAGAAATATCTATGCTACATTACATGCATGTGATGTATTTATTTTTGTAGAGTTTTAATATTGCACATAACTTCTAGATTAAAACTCAAAACACTATACTATTAATTCCCACCTACATCTTTAGAAAAAGAGGAACTTTAGTATTGACATTAAGGATTTAAGGTTAATTAGTTGAAGTTTATAAAACATAATTGTTTTTGTTAAAAGAAAAAAAAGTTATAATTTTAGCTTTGCTCATTTTTCTTGGGTTTAATTTCTACTTTATCCTCTGTTTGAAAAATGAGTAATTCAAGACAACAAGGGAACTGCTTTCTAATTTCAAGGCCTTATATTTCCTTCTCATACAAGGAAAAGAAATTAAGGAGTCTTTATTTCCCACTTTTAACTTTTTATGGTCAAAACTTAATACTTTCTTTAACTAATTATTCTAATATTTAGTTTTATTCCATTAAATATATATATAATATAATATAATCATTCTATTTCATTTTTCTTCTCTTTTTATTAAATCTAAAATTTAGTCATTTTGATGTGTAAAATTCATTTAAATTGGCAAAATCAAGGTAGTTCAATTCAAATTTTGTTGGTACAACTAACCTAAATAAGGAAAAAAGAAAGAGAAAAAATATCAATAAAATTATTTATGAATAATATGAAATACGATCGGAAAGAAGCCTTTGATTTGAGAATTTCGAGAAATTAAATTTTCATAATAAATAAAGTTAGAAATTTTTGTCAAACTAAAACTTTTAAAATAAATGGTAGAAATGAATGTGAAGTTTAATTTTTTTAAAAATGCTCGTAAACTTTCAAAATGAGTTTGAAACTACCGTTAATATACCATCTTATCTTTTTATTTTCCATCTTTTTCTCCCCTCCCTTCTTCAACCCTCTCTTATTCTCATCTTTGTCATCTATTTAATATTTGCTTATCTAAACATAAACACATAAATAAAACTACACACTTTAATTAAGTTATATTGAACTATATAATTAGAAGAAAGATAATAAATAAAACAAATTTTAGGTTAGGAGTTGAGGGTGTGTTGATAGTCAATTTTTTTTTTCTTTTTGTTTTGGTATGTTTTTAGAGAAGATTTTGATCTTGAAATTTTGTTGGATATTACGTTTTAACTAATTTTTTTTGTTTTTATTTTGATCATTGAAAAAATTGATGGTGTAAGCTAATTAGAATTAGAAAATTAAGTGAGAATATCTGTATGAAAAAAAATTAGAATCTTTGTATAATTTCTTTTAAAATTGGTTCTTTAATTCAAACCTAATAAAACTTCAAACATATTAGCATTTTTTTTTAATTTTAAGAATATTTTTGTCACCAAGAATTATACTAATGGTTTTTGTGCAAATACTAACATGCAATATGTTTTAATTTTTTTTATTAAATTCGCGGTAGTTTTTAAACTTTTGAAAGTTTAGCGGTATTTTTGAAACAACGTGTTAACGATTTCCATCCAAAACTAACGACATAAGTATTTTTTTTTTCTCTTTTTAGAAGTTCGAGAGTATTTTTGAAATTTTTTGACACTGTAACATTTTTTTAAGAGATAGTTGCAAATTTAGTCATTAGATTCAAAATAATTAAATATATACCAACATTCTAAAAAGTTTACAAATATAACAAAAATTATCAAATTTTATCAATGATAGAAGTTTATCATCGATAGACTATGTTACAAATATTGGTCAACCACTAATAGATTATACGAGTCTATTATCGATAATTTTGTTATATTTGTAATTTTTTAAAAATATTTATAAAAAAAATTAAAACAATGTAAAATGTGGGTAAAATTTAACATAACAGTATAAGAAGTATTAATAAGTATGAATTGGCATTCAATATAGTTTATAATATTATTAGAATAACACAAAAAAAATTTATACATTGAAATTGATATACATTATTGATCCTTCCATACTTTTATAACTAATTTACGTTTAGTTTCAATAAAGAAACGTTAATTCATATATAAAGTTGAAAGAAATATTATATATTTTGAATAGGAAGTGAGGTTACGTCCACGTTCCTAATGTTTATACAGAATGTAATAAGGTATGAATGAAAATTGTAGACATTCCTAATGTTTATACAGAATGTAATAAGGTATGATGAAAGTTGTAGACATTCCTAATGTTTATACAGAATGTAATAAGGTATGAATGAAAGTTGTAGACATTAGTTGGTTGGTTAAAGTGTTAGAGGCTAAACCCAATACTATTCATCACCATAACTAATAACTAATAATGTAATACAAAAATGGGTTAGAATTTAGCACAAGCAATTGGTAAAAACACTTTAATAAGATCATTATGATATTTTGTTGGATGTGGATGAGAGAAGTTACCAACCATGACCGTCCAATCATTCTGCTAACTCCAAATTATTAAAAATGTCAACTCCCTCCAATATTCAAACACATAGCTTCCTATAGCCATTTTACTTAGCTCCCATCAAAATTTTATTTGTGAGGGGGGATTACAAAATTTAAAAGAATTACTGGTTTATATGATATACAACAATATAATAATTATAATTCAAACAATTAAGAACCACTAAAAGTAGAAAAAAGTATTGGACACTTCGGGTGTCAACCCATATGCATCTACATATATATAACCATGCACATGCATTATGGTTATATATATAGATAGTCCATTTCTTACAAAGTCAGCCCATTAATTATAGCCAAAACTCTATAGTTTCTAGAAATAAATTAAACTCAAATAGTCAATTTAATCTCTAAATCCAAATAAGTCACACCTAAACTATATGACTTTCCTGTTTGATTTAATTCAATACACATATATATACCATATATGTATACATATATATACATACATGATTACATACAAAAACAAACACGTGGCATTTAGGACGGCACCAAATACGTTTGACTTAAAATAATATTTGGAAATTTCGGCTTTCCATTCACCTTCAAATCTGTATATAAGAACTTCATCTTCTTCGTACAATTCTCAGCACAAAAAACCAAACACTTTTTCCCTTTTTCTCTCTCTTTCTCTCTCTAAATTCAATTTTTCTTTTTCACTTTATCAAACTTTTAGTCTCTTCATTGAAAACAGTCATAAAAAACAAATGGGGAGAGCTCCATGCTGTGACAAAAATGGGCTTAAAAAAGGTCCATGGACTCCTGAGGAAGATCAGAAGCTTGTTAATTATATTCAAATTCATGGCCCTGGGAATTGGCGTAATCTTCCTAAGAATGCTGGTATGTTTCTGTTTTTCTTTTTTTTTTTTTTTTTCTTTTTATAATTATAATTATAATTAAATTATTTTTGTTTATTTATTTATAGGGCTTCAAAGATGTGGAAAAAGTTGTAGGCTTCGATGGACCAATTATTTGAGACCTGATATTAAGAGAGGCAGATTTTCTTTTGAAGAAGAAGAGACTATTATTCAATTGCACAGTGTTTTGGGAAACAAGTAAGATTATTTTTCTTTTCTTTTCTGGGTTTTTCTTTTTTGAAAAACAAAATCCAGAAAGTTTGTTGATTAAAGAATTAAATTCTTTTTTGTTTCTAGCTATGATTGGATTAACTGATTGGGAAGAATACTTAATGACTAATCCCAATTAATGATCTCTTGATATTATATCCCTCTGACGTTCATTTCTCATGGTAGACTACAAAATTAGCTTAACACATGAACCAATAATTATCAATATTAATGAACTAGACTTAAATATATCTTATTCATCTTTCTTAAGGTTTTTCTCGCTTAATGGGTAATAATTAAGACTGCAATTAGCCTAAAAGCTAAAGAAGATATATAAAGATTATGTGTTATTTAATTGATTATTAATGTTTTTGCTCGTTTCAGGTGGTCAGCAATAGCTGCTCGCTTACCTGGAAGAACAGATAACGAGATCAAGAATTATTGGAACACCCACATACGAAAAAGGCTCCTTCGAATGGGAATCGATCCGGTGACCCACGCACCTCGGATTGATCTTCTTGACCTATCGTCGATTCTCTCTGCCGCCATTCGAAGCCACTCACTCCTCAGCCTATCAACCTTATTGAACAACCACCAAACTACCGCAACCCTAAATCCTGAATCTCTTAGGCTAATTCCCACCCTTTTAGGCCTTAAACAACAAGATCCAAATGCCCATAATTTACTTCTCCAAGCTCAGGCTCAGGCTCAAATTCAAGCTCAAATGGATTCATTATCACAACTCTTACAACCCAACGATAATGTTAATAATACAAATTCTTCTTCAATAATGCCCATATCTTCTACCTTTGTTGACTGCCCAAATACTTCGCAAGAAAATCTTAATTTCTTACCTACACTTTTGAATTGTGGTGAGGATGTTTTGATGAACCAACCCAATTACATATACGGCGGCAATGGGTCGAATCCAACGGCTTCGGAAATTCTCGACATTTCAAATAACAATGCTCAAAATTTAGGGTTTGATTCTGTAAAATCAAGCCCTACGCCATTGAATTCTTCATCTACTTATTTAAACAACAGCAGCAGCAATGAGGATGAGAAAGATAGTTTTTGTAGTAACTTTTTGCAGTTTGAAATTCCTGAAGGTTTGGACTTTGCTGATTTTGTGTAA

mRNA sequence

ATGGCGGCTGTTAATTCCATTTCATTCTCTACACTAAACCAATGTTCCGATAGGAGATTGCTGCTTCCCTCCTCTCGTTCGCACTCCTCCAATTTCCACGGCTTTCCTTTTCGTACTAGCGTTTTCACTCATTATTCCCGAGTACGAGCATCCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTCCGCCGGAACAGATGTGACCACTGTGGCCGAGACAAAATTGAACTTCCTTAAGGCCTATAAACGGCCTATCCCTAGTATTTACAACACGGTTCTGCAAGAATTGATTGTTCAGCAGCATTTGATGAGGTATAAGAGGACATACCGTTATGATCCTGTTTTCGCTCTTGGATTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGTGAAGCCATCTTCCAAGCGTATATTAAGGCTTTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTTGAAGAGTGGGCTCGATCTCAGACTGCAGCTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAGTTGAGAGTATTTTGAAGGACATTGCAGAAAGAGCAGGGAGCAAGGGGAATTTCAGTTACAGCCGATTTTTTGCTATTGGACTATTTCGTCTCCTTGAATTGGCAAATGCTACTGAGCCCAGTATCTTGGAAAAGCTCTGTGCCGCTTTAAATATCGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGTAACCTGCTTTCAAAGTTGGTTCAGGCGAAAGAGCTCCTAAAGGAATATGTCGACAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCACAGACAGCTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCATGCAGACTGGAAGTGAGGTTACGTCCACGTTCCTAATGTTTATACAGAATGTAATAAGTCATAAAAAACAAATGGGGAGAGCTCCATGCTGTGACAAAAATGGGCTTAAAAAAGGTCCATGGACTCCTGAGGAAGATCAGAAGCTTGTTAATTATATTCAAATTCATGGCCCTGGGAATTGGCGTAATCTTCCTAAGAATGCTGGGCTTCAAAGATGTGGAAAAAGTTGTAGGCTTCGATGGACCAATTATTTGAGACCTGATATTAAGAGAGGCAGATTTTCTTTTGAAGAAGAAGAGACTATTATTCAATTGCACAGTGTTTTGGGAAACAAGTGGTCAGCAATAGCTGCTCGCTTACCTGGAAGAACAGATAACGAGATCAAGAATTATTGGAACACCCACATACGAAAAAGGCTCCTTCGAATGGGAATCGATCCGGTGACCCACGCACCTCGGATTGATCTTCTTGACCTATCGTCGATTCTCTCTGCCGCCATTCGAAGCCACTCACTCCTCAGCCTATCAACCTTATTGAACAACCACCAAACTACCGCAACCCTAAATCCTGAATCTCTTAGGCTAATTCCCACCCTTTTAGGCCTTAAACAACAAGATCCAAATGCCCATAATTTACTTCTCCAAGCTCAGGCTCAGGCTCAAATTCAAGCTCAAATGGATTCATTATCACAACTCTTACAACCCAACGATAATGTTAATAATACAAATTCTTCTTCAATAATGCCCATATCTTCTACCTTTGTTGACTGCCCAAATACTTCGCAAGAAAATCTTAATTTCTTACCTACACTTTTGAATTGTGGTGAGGATGTTTTGATGAACCAACCCAATTACATATACGGCGGCAATGGGTCGAATCCAACGGCTTCGGAAATTCTCGACATTTCAAATAACAATGCTCAAAATTTAGGGTTTGATTCTGTAAAATCAAGCCCTACGCCATTGAATTCTTCATCTACTTATTTAAACAACAGCAGCAGCAATGAGGATGAGAAAGATAGTTTTTGTAGTAACTTTTTGCAGTTTGAAATTCCTGAAGGTTTGGACTTTGCTGATTTTGTGTAA

Coding sequence (CDS)

ATGGCGGCTGTTAATTCCATTTCATTCTCTACACTAAACCAATGTTCCGATAGGAGATTGCTGCTTCCCTCCTCTCGTTCGCACTCCTCCAATTTCCACGGCTTTCCTTTTCGTACTAGCGTTTTCACTCATTATTCCCGAGTACGAGCATCCACCTTCAGTTCTCGCATGGTCATTCATTGCATGTCCGCCGGAACAGATGTGACCACTGTGGCCGAGACAAAATTGAACTTCCTTAAGGCCTATAAACGGCCTATCCCTAGTATTTACAACACGGTTCTGCAAGAATTGATTGTTCAGCAGCATTTGATGAGGTATAAGAGGACATACCGTTATGATCCTGTTTTCGCTCTTGGATTTGTTACTGTATATGATCAGCTTATGGAAGGGTACCCTAGCGATGAGGATCGTGAAGCCATCTTCCAAGCGTATATTAAGGCTTTGAATGAGGATCCAGAGCAATATAGAATTGATGCTAAAAAATTTGAAGAGTGGGCTCGATCTCAGACTGCAGCTTCATTGGTTGAGTTTGCATCAAGAGAAGGAGAAGTTGAGAGTATTTTGAAGGACATTGCAGAAAGAGCAGGGAGCAAGGGGAATTTCAGTTACAGCCGATTTTTTGCTATTGGACTATTTCGTCTCCTTGAATTGGCAAATGCTACTGAGCCCAGTATCTTGGAAAAGCTCTGTGCCGCTTTAAATATCGACAAAAAAGGTGTAGACCGAGACCTTGATGTATACCGTAACCTGCTTTCAAAGTTGGTTCAGGCGAAAGAGCTCCTAAAGGAATATGTCGACAGAGAGAAGAAGAAAAGAGATGAGAGGGCTGGATCACAGACAGCTAATGAGGCCATAACTAAATGCTTGGGAGAATACAGCATGCAGACTGGAAGTGAGGTTACGTCCACGTTCCTAATGTTTATACAGAATGTAATAAGTCATAAAAAACAAATGGGGAGAGCTCCATGCTGTGACAAAAATGGGCTTAAAAAAGGTCCATGGACTCCTGAGGAAGATCAGAAGCTTGTTAATTATATTCAAATTCATGGCCCTGGGAATTGGCGTAATCTTCCTAAGAATGCTGGGCTTCAAAGATGTGGAAAAAGTTGTAGGCTTCGATGGACCAATTATTTGAGACCTGATATTAAGAGAGGCAGATTTTCTTTTGAAGAAGAAGAGACTATTATTCAATTGCACAGTGTTTTGGGAAACAAGTGGTCAGCAATAGCTGCTCGCTTACCTGGAAGAACAGATAACGAGATCAAGAATTATTGGAACACCCACATACGAAAAAGGCTCCTTCGAATGGGAATCGATCCGGTGACCCACGCACCTCGGATTGATCTTCTTGACCTATCGTCGATTCTCTCTGCCGCCATTCGAAGCCACTCACTCCTCAGCCTATCAACCTTATTGAACAACCACCAAACTACCGCAACCCTAAATCCTGAATCTCTTAGGCTAATTCCCACCCTTTTAGGCCTTAAACAACAAGATCCAAATGCCCATAATTTACTTCTCCAAGCTCAGGCTCAGGCTCAAATTCAAGCTCAAATGGATTCATTATCACAACTCTTACAACCCAACGATAATGTTAATAATACAAATTCTTCTTCAATAATGCCCATATCTTCTACCTTTGTTGACTGCCCAAATACTTCGCAAGAAAATCTTAATTTCTTACCTACACTTTTGAATTGTGGTGAGGATGTTTTGATGAACCAACCCAATTACATATACGGCGGCAATGGGTCGAATCCAACGGCTTCGGAAATTCTCGACATTTCAAATAACAATGCTCAAAATTTAGGGTTTGATTCTGTAAAATCAAGCCCTACGCCATTGAATTCTTCATCTACTTATTTAAACAACAGCAGCAGCAATGAGGATGAGAAAGATAGTTTTTGTAGTAACTTTTTGCAGTTTGAAATTCCTGAAGGTTTGGACTTTGCTGATTTTGTGTAA

Protein sequence

MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIHCMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGFVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASREGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGVDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGSEVTSTFLMFIQNVISHKKQMGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMGIDPVTHAPRIDLLDLSSILSAAIRSHSLLSLSTLLNNHQTTATLNPESLRLIPTLLGLKQQDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSIMPISSTFVDCPNTSQENLNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNAQNLGFDSVKSSPTPLNSSSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV*
Homology
BLAST of CsGy7G003850 vs. ExPASy Swiss-Prot
Match: Q7XAB8 (Protein THYLAKOID FORMATION1, chloroplastic OS=Solanum tuberosum OX=4113 GN=THF1 PE=2 SV=1)

HSP 1 Score: 386.7 bits (992), Expect = 5.1e-106
Identity = 201/293 (68.60%), Postives = 243/293 (82.94%), Query Frame = 0

Query: 1   MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH 60
           MAAV S+SFS + Q ++R+  + SSRS  +    F FR++       VR+S  +SR V+H
Sbjct: 1   MAAVTSVSFSAITQSAERKSSVSSSRSIDT----FRFRSNFSFDSVNVRSSNSTSRFVVH 60

Query: 61  C-MSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALG 120
           C  S+  D+ TVA+TKL FL AYKRPIP++YNTVLQELIVQQHL RYK++Y+YDPVFALG
Sbjct: 61  CTSSSAADLPTVADTKLKFLTAYKRPIPTVYNTVLQELIVQQHLTRYKKSYQYDPVFALG 120

Query: 121 FVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFAS 180
           FVTVYDQLMEGYPS+EDR AIF+AYI+AL EDPEQYR DA+K EEWAR+Q A +LV+F+S
Sbjct: 121 FVTVYDQLMEGYPSEEDRNAIFKAYIEALKEDPEQYRADAQKLEEWARTQNANTLVDFSS 180

Query: 181 REGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKG 240
           +EGE+E+I KDIA+RAG+K  F YSR FA+GLFRLLELAN T+P+ILEKLCAALN++KK 
Sbjct: 181 KEGEIENIFKDIAQRAGTKDGFCYSRLFAVGLFRLLELANVTDPTILEKLCAALNVNKKS 240

Query: 241 VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEY 293
           VDRDLDVYRNLLSKLVQAKELLKEYV+REKKKR ER  +Q ANE +TKCLG+Y
Sbjct: 241 VDRDLDVYRNLLSKLVQAKELLKEYVEREKKKRGERE-TQKANETVTKCLGDY 288

BLAST of CsGy7G003850 vs. ExPASy Swiss-Prot
Match: Q9SKT0 (Protein THYLAKOID FORMATION 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=THF1 PE=1 SV=1)

HSP 1 Score: 386.3 bits (991), Expect = 6.7e-106
Identity = 207/291 (71.13%), Postives = 247/291 (84.88%), Query Frame = 0

Query: 3   AVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFS-SRMVIHC 62
           A++S+SF  L Q SD+     SSR  +S          + T +SR+  ++ S S+ +IHC
Sbjct: 5   AISSLSFPALGQ-SDKISNFASSRPLASAIR-------ICTKFSRLSLNSRSTSKSLIHC 64

Query: 63  MSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 122
           MS  T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 65  MSNVTADVPPVSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGF 124

Query: 123 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR 182
           VTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDA+K EEWARSQT+ASLV+F+S+
Sbjct: 125 VTVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSK 184

Query: 183 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 242
           EG++E++LKDIA RAGSK  FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK V
Sbjct: 185 EGDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSV 244

Query: 243 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGE 292
           DRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA SQ ANE I+KCLG+
Sbjct: 245 DRDLDVYRNLLSKLVQAKELLKEYVEREKKKQGERAQSQKANETISKCLGD 287

BLAST of CsGy7G003850 vs. ExPASy Swiss-Prot
Match: Q84PB7 (Protein THYLAKOID FORMATION1, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=THF1 PE=2 SV=1)

HSP 1 Score: 361.3 bits (926), Expect = 2.3e-98
Identity = 195/288 (67.71%), Postives = 236/288 (81.94%), Query Frame = 0

Query: 1   MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH 60
           MAA++S+ F+ L + +D R   PS+ + ++         SV     R R     SR V+ 
Sbjct: 1   MAAISSLPFAALRRAADCR---PSTAAAAAGAGAGAVVLSV-----RPRR---GSRSVVR 60

Query: 61  CMSAGTDV-TTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALG 120
           C++   DV  TVAETK+NFLK+YKRPI SIY+TVLQEL+VQQHLMRYK TY+YD VFALG
Sbjct: 61  CVATAGDVPPTVAETKMNFLKSYKRPILSIYSTVLQELLVQQHLMRYKTTYQYDAVFALG 120

Query: 121 FVTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFAS 180
           FVTVYDQLMEGYPS+EDR+AIF+AYI ALNEDPEQYR DA+K EEWARSQ   SLVEF+S
Sbjct: 121 FVTVYDQLMEGYPSNEDRDAIFKAYITALNEDPEQYRADAQKMEEWARSQNGNSLVEFSS 180

Query: 181 REGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKG 240
           ++GE+E+ILKDI+ERA  KG+FSYSRFFA+GLFRLLELANATEP+IL+KLCAALNI+K+ 
Sbjct: 181 KDGEIEAILKDISERAQGKGSFSYSRFFAVGLFRLLELANATEPTILDKLCAALNINKRS 240

Query: 241 VDRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITK 288
           VDRDLDVYRN+LSKLVQAKELLKEYV+REKKKR+ER+ +  +NEA+TK
Sbjct: 241 VDRDLDVYRNILSKLVQAKELLKEYVEREKKKREERSETPKSNEAVTK 277

BLAST of CsGy7G003850 vs. ExPASy Swiss-Prot
Match: Q9LDR8 (Transcription factor MYB102 OS=Arabidopsis thaliana OX=3702 GN=MYB102 PE=2 SV=1)

HSP 1 Score: 313.2 bits (801), Expect = 7.2e-84
Identity = 183/354 (51.69%), Postives = 238/354 (67.23%), Query Frame = 0

Query: 318 MGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY 377
           M R+PCC+KNGLKKGPWT EEDQKLV+YIQ HG GNWR LPKNAGLQRCGKSCRLRWTNY
Sbjct: 1   MARSPCCEKNGLKKGPWTSEEDQKLVDYIQKHGYGNWRTLPKNAGLQRCGKSCRLRWTNY 60

Query: 378 LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG 437
           LRPDIKRGRFSFEEEETIIQLHS LGNKWSAIAARLPGRTDNEIKN+WNTHIRK+LLRMG
Sbjct: 61  LRPDIKRGRFSFEEEETIIQLHSFLGNKWSAIAARLPGRTDNEIKNFWNTHIRKKLLRMG 120

Query: 438 IDPVTHAPRIDLLDLSSILSAAI---RSH----SLLSLSTLLNNHQTTATLNPESLRLIP 497
           IDPVTH+PR+DLLD+SSIL++++    SH    S L + T   +HQ    +NPE L+L  
Sbjct: 121 IDPVTHSPRLDLLDISSILASSLYNSSSHHMNMSRLMMDTNRRHHQQHPLVNPEILKLAT 180

Query: 498 TLLGLKQQDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSIMPISSTFVDC 557
           +L    Q      NL++   ++ Q +  + S + + Q   N    N+ +   + S+    
Sbjct: 181 SLFSQNQN----QNLVVDHDSRTQEKQTVYSQTGVNQYQTNQYFENTIT-QELQSSMPPF 240

Query: 558 PNTSQENLNFLPTLLNCGEDVLMNQPNY----IYGGNGSNPTASE-ILDISNN----NAQ 617
           PN +++  N        GE  L++         Y  + ++ ++S  +LD S +    N  
Sbjct: 241 PNEARQFNNMDHHFNGFGEQNLVSTSTTSVQDCYNPSFNDYSSSNFVLDPSYSDQSFNFA 300

Query: 618 NLGFDSVKSSPTPLNSSSTYLNNSS-SNEDEKDSFCSNFLQFEIPEGLDFADFV 655
           N   ++  SSP+P   +S+Y+N+SS S EDE +S+CSN ++F+IP+ LD   F+
Sbjct: 301 NSVLNTPSSSPSPTTLNSSYINSSSCSTEDEIESYCSNLMKFDIPDFLDVNGFI 349

BLAST of CsGy7G003850 vs. ExPASy Swiss-Prot
Match: Q9M0Y5 (Transcription factor MYB74 OS=Arabidopsis thaliana OX=3702 GN=MYB74 PE=2 SV=1)

HSP 1 Score: 297.4 bits (760), Expect = 4.1e-79
Identity = 184/356 (51.69%), Postives = 229/356 (64.33%), Query Frame = 0

Query: 318 MGRAPCCD-KNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTN 377
           MGR+PCC+ KNGLKKGPWTPEEDQKL++YI IHG GNWR LPKNAGLQRCGKSCRLRWTN
Sbjct: 1   MGRSPCCEKKNGLKKGPWTPEEDQKLIDYINIHGYGNWRTLPKNAGLQRCGKSCRLRWTN 60

Query: 378 YLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRM 437
           YLRPDIKRGRFSFEEEETIIQLHS++GNKWSAIAARLPGRTDNEIKNYWNTHIRKRLL+M
Sbjct: 61  YLRPDIKRGRFSFEEEETIIQLHSIMGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLKM 120

Query: 438 GIDPVTHAPRIDLLDLSSILSAAIRSHS---------LLSLSTLL---NNHQTTATLNPE 497
           GIDPVTH PR+DLLD+SSILS++I + S          +++S L+    NHQ    +NPE
Sbjct: 121 GIDPVTHTPRLDLLDISSILSSSIYNSSHHHHHHHQQHMNMSRLMMSDGNHQ--PLVNPE 180

Query: 498 SLRLIPTLLGLKQQDPNAH--NLLLQAQA-QAQIQAQMDSLSQLLQPNDNVNN-TNSSSI 557
            L+L  +L   +    N H  N + Q +  Q Q    M    +L      ++  TN   +
Sbjct: 181 ILKLATSLFSNQNHPNNTHENNTVNQTEVNQYQTGYNMPGNEELQSWFPIMDQFTNFQDL 240

Query: 558 MPISSTFVDCPNTSQENLNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNA 617
           MP+ +T        Q +L++     +C +   + +P Y           S+   +     
Sbjct: 241 MPMKTTV-------QNSLSYDD---DCSKSNFVLEPYY-----------SDFASV----- 300

Query: 618 QNLGFDSVKSSPTPLN-SSSTYLNNSS-SNEDEKDSFCSN-----------FLQFE 644
                 +  SSPTPLN SSSTY+N+S+ S EDEK+S+ S+           FLQF+
Sbjct: 301 ----LTTPSSSPTPLNSSSSTYINSSTCSTEDEKESYYSDNITNYSFDVNGFLQFQ 324

BLAST of CsGy7G003850 vs. NCBI nr
Match: KAE8645760.1 (hypothetical protein Csa_020345 [Cucumis sativus])

HSP 1 Score: 1286 bits (3327), Expect = 0.0
Identity = 653/654 (99.85%), Postives = 654/654 (100.00%), Query Frame = 0

Query: 1   MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH 60
           MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH
Sbjct: 1   MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR 180
           VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR 180

Query: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
           EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGSEV 300
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGSE+
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTGSEI 300

Query: 301 TSTFLMFIQNVISHKKQMGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKN 360
           TSTFLMFIQNVISHKKQMGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKN
Sbjct: 301 TSTFLMFIQNVISHKKQMGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKN 360

Query: 361 AGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNE 420
           AGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNE
Sbjct: 361 AGLQRCGKSCRLRWTNYLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNE 420

Query: 421 IKNYWNTHIRKRLLRMGIDPVTHAPRIDLLDLSSILSAAIRSHSLLSLSTLLNNHQTTAT 480
           IKNYWNTHIRKRLLRMGIDPVTHAPRIDLLDLSSILSAAIRSHSLLSLSTLLNNHQTTAT
Sbjct: 421 IKNYWNTHIRKRLLRMGIDPVTHAPRIDLLDLSSILSAAIRSHSLLSLSTLLNNHQTTAT 480

Query: 481 LNPESLRLIPTLLGLKQQDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSI 540
           LNPESLRLIPTLLGLKQQDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSI
Sbjct: 481 LNPESLRLIPTLLGLKQQDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSI 540

Query: 541 MPISSTFVDCPNTSQENLNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNA 600
           MPISSTFVDCPNTSQENLNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNA
Sbjct: 541 MPISSTFVDCPNTSQENLNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNA 600

Query: 601 QNLGFDSVKSSPTPLNSSSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV 654
           QNLGFDSVKSSPTPLNSSSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV
Sbjct: 601 QNLGFDSVKSSPTPLNSSSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV 654

BLAST of CsGy7G003850 vs. NCBI nr
Match: XP_004137011.1 (transcription factor MYB41 [Cucumis sativus])

HSP 1 Score: 677 bits (1746), Expect = 3.56e-240
Identity = 337/337 (100.00%), Postives = 337/337 (100.00%), Query Frame = 0

Query: 318 MGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY 377
           MGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY
Sbjct: 1   MGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY 60

Query: 378 LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG 437
           LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG
Sbjct: 61  LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG 120

Query: 438 IDPVTHAPRIDLLDLSSILSAAIRSHSLLSLSTLLNNHQTTATLNPESLRLIPTLLGLKQ 497
           IDPVTHAPRIDLLDLSSILSAAIRSHSLLSLSTLLNNHQTTATLNPESLRLIPTLLGLKQ
Sbjct: 121 IDPVTHAPRIDLLDLSSILSAAIRSHSLLSLSTLLNNHQTTATLNPESLRLIPTLLGLKQ 180

Query: 498 QDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSIMPISSTFVDCPNTSQEN 557
           QDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSIMPISSTFVDCPNTSQEN
Sbjct: 181 QDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSIMPISSTFVDCPNTSQEN 240

Query: 558 LNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNAQNLGFDSVKSSPTPLNS 617
           LNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNAQNLGFDSVKSSPTPLNS
Sbjct: 241 LNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNAQNLGFDSVKSSPTPLNS 300

Query: 618 SSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV 654
           SSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV
Sbjct: 301 SSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV 337

BLAST of CsGy7G003850 vs. NCBI nr
Match: XP_008455362.1 (PREDICTED: transcription factor MYB39 [Cucumis melo])

HSP 1 Score: 642 bits (1657), Expect = 1.18e-226
Identity = 320/337 (94.96%), Postives = 327/337 (97.03%), Query Frame = 0

Query: 318 MGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY 377
           MGRAPCCDK+GLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY
Sbjct: 1   MGRAPCCDKSGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY 60

Query: 378 LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG 437
           LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG
Sbjct: 61  LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG 120

Query: 438 IDPVTHAPRIDLLDLSSILSAAIRSHSLLSLSTLLNNHQTTATLNPESLRLIPTLLGLKQ 497
           IDPVTHAPRIDLLDLSS+LSAAI+SH LL LSTLLNNHQTT TLNPESLRLI TLL LKQ
Sbjct: 121 IDPVTHAPRIDLLDLSSMLSAAIQSHPLLGLSTLLNNHQTTTTLNPESLRLISTLLSLKQ 180

Query: 498 QDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSIMPISSTFVDCPNTSQEN 557
           +D NAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSS+MPISSTFVDC NTSQEN
Sbjct: 181 EDQNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSVMPISSTFVDCSNTSQEN 240

Query: 558 LNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNAQNLGFDSVKSSPTPLNS 617
           LNFLPT LNCGEDVLMNQ NY+YGG+GSNPTASEILDISNNNAQ LGFDSVKSSPTPLNS
Sbjct: 241 LNFLPTNLNCGEDVLMNQHNYMYGGDGSNPTASEILDISNNNAQTLGFDSVKSSPTPLNS 300

Query: 618 SSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV 654
           SSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV
Sbjct: 301 SSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV 337

BLAST of CsGy7G003850 vs. NCBI nr
Match: XP_038888637.1 (transcription factor MYB41-like [Benincasa hispida])

HSP 1 Score: 587 bits (1513), Expect = 7.60e-205
Identity = 297/337 (88.13%), Postives = 314/337 (93.18%), Query Frame = 0

Query: 318 MGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY 377
           MGRAPCCDKNGLKKGPWTPEEDQKLV +IQ+HGPGNWRNLPKNAGLQRCGKSCRLRWTNY
Sbjct: 1   MGRAPCCDKNGLKKGPWTPEEDQKLVTFIQVHGPGNWRNLPKNAGLQRCGKSCRLRWTNY 60

Query: 378 LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG 437
           LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG
Sbjct: 61  LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG 120

Query: 438 IDPVTHAPRIDLLDLSSILSAAIRSHSLLSLSTLLNNHQTTATLNPESLRLIPTLLGLKQ 497
           IDPVTH PRIDLLDLSSILSAAIRSH LLSLSTLLNNHQTT TLN ESLRLI TLLGLKQ
Sbjct: 121 IDPVTHTPRIDLLDLSSILSAAIRSHPLLSLSTLLNNHQTT-TLNSESLRLISTLLGLKQ 180

Query: 498 QDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSIMPISSTFVDCPNTSQEN 557
           +DPN +N LLQAQ QAQ+QAQMDSLSQLLQP D++NNTN SS +PIS TFVD PN+SQEN
Sbjct: 181 EDPNTNNFLLQAQVQAQVQAQMDSLSQLLQPIDDINNTNKSSSIPIS-TFVDGPNSSQEN 240

Query: 558 LNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNAQNLGFDSVKSSPTPLNS 617
           LNFLP+ LN GED+LMNQ NY+YG +GSNPTAS   +ISNNN QNLGFDSVKSSPT LNS
Sbjct: 241 LNFLPSNLN-GEDILMNQANYLYGDDGSNPTASNFPEISNNNVQNLGFDSVKSSPTQLNS 300

Query: 618 SSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV 654
           SSTY+N+SSSNEDEKDSFCSNFLQFEIPEGLDFADF+
Sbjct: 301 SSTYINSSSSNEDEKDSFCSNFLQFEIPEGLDFADFM 334

BLAST of CsGy7G003850 vs. NCBI nr
Match: XP_004136805.1 (protein THYLAKOID FORMATION1, chloroplastic [Cucumis sativus])

HSP 1 Score: 576 bits (1484), Expect = 5.03e-201
Identity = 297/297 (100.00%), Postives = 297/297 (100.00%), Query Frame = 0

Query: 1   MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH 60
           MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH
Sbjct: 1   MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR 180
           VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR 180

Query: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
           EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297

BLAST of CsGy7G003850 vs. ExPASy TrEMBL
Match: A0A0A0K293 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G046120 PE=4 SV=1)

HSP 1 Score: 677 bits (1746), Expect = 1.72e-240
Identity = 337/337 (100.00%), Postives = 337/337 (100.00%), Query Frame = 0

Query: 318 MGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY 377
           MGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY
Sbjct: 1   MGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY 60

Query: 378 LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG 437
           LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG
Sbjct: 61  LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG 120

Query: 438 IDPVTHAPRIDLLDLSSILSAAIRSHSLLSLSTLLNNHQTTATLNPESLRLIPTLLGLKQ 497
           IDPVTHAPRIDLLDLSSILSAAIRSHSLLSLSTLLNNHQTTATLNPESLRLIPTLLGLKQ
Sbjct: 121 IDPVTHAPRIDLLDLSSILSAAIRSHSLLSLSTLLNNHQTTATLNPESLRLIPTLLGLKQ 180

Query: 498 QDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSIMPISSTFVDCPNTSQEN 557
           QDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSIMPISSTFVDCPNTSQEN
Sbjct: 181 QDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSIMPISSTFVDCPNTSQEN 240

Query: 558 LNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNAQNLGFDSVKSSPTPLNS 617
           LNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNAQNLGFDSVKSSPTPLNS
Sbjct: 241 LNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNAQNLGFDSVKSSPTPLNS 300

Query: 618 SSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV 654
           SSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV
Sbjct: 301 SSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV 337

BLAST of CsGy7G003850 vs. ExPASy TrEMBL
Match: A0A1S3C0Q7 (transcription factor MYB39 OS=Cucumis melo OX=3656 GN=LOC103495548 PE=4 SV=1)

HSP 1 Score: 642 bits (1657), Expect = 5.72e-227
Identity = 320/337 (94.96%), Postives = 327/337 (97.03%), Query Frame = 0

Query: 318 MGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY 377
           MGRAPCCDK+GLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY
Sbjct: 1   MGRAPCCDKSGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY 60

Query: 378 LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG 437
           LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG
Sbjct: 61  LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG 120

Query: 438 IDPVTHAPRIDLLDLSSILSAAIRSHSLLSLSTLLNNHQTTATLNPESLRLIPTLLGLKQ 497
           IDPVTHAPRIDLLDLSS+LSAAI+SH LL LSTLLNNHQTT TLNPESLRLI TLL LKQ
Sbjct: 121 IDPVTHAPRIDLLDLSSMLSAAIQSHPLLGLSTLLNNHQTTTTLNPESLRLISTLLSLKQ 180

Query: 498 QDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSIMPISSTFVDCPNTSQEN 557
           +D NAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSS+MPISSTFVDC NTSQEN
Sbjct: 181 EDQNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSVMPISSTFVDCSNTSQEN 240

Query: 558 LNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNAQNLGFDSVKSSPTPLNS 617
           LNFLPT LNCGEDVLMNQ NY+YGG+GSNPTASEILDISNNNAQ LGFDSVKSSPTPLNS
Sbjct: 241 LNFLPTNLNCGEDVLMNQHNYMYGGDGSNPTASEILDISNNNAQTLGFDSVKSSPTPLNS 300

Query: 618 SSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV 654
           SSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV
Sbjct: 301 SSTYLNNSSSNEDEKDSFCSNFLQFEIPEGLDFADFV 337

BLAST of CsGy7G003850 vs. ExPASy TrEMBL
Match: A0A0A0K3P0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G046130 PE=3 SV=1)

HSP 1 Score: 576 bits (1484), Expect = 2.44e-201
Identity = 297/297 (100.00%), Postives = 297/297 (100.00%), Query Frame = 0

Query: 1   MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH 60
           MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH
Sbjct: 1   MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR 180
           VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR 180

Query: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
           EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297
           DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297

BLAST of CsGy7G003850 vs. ExPASy TrEMBL
Match: A0A5D3C7D3 (Protein THYLAKOID FORMATION1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G003760 PE=3 SV=1)

HSP 1 Score: 552 bits (1422), Expect = 6.16e-192
Identity = 286/297 (96.30%), Postives = 290/297 (97.64%), Query Frame = 0

Query: 1   MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH 60
           MAAVNSISFSTLNQCSDRR  +PSSRS SSNF GF FRTS+FTHYSRVR STFSSRMVIH
Sbjct: 1   MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR 180
           VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+K EEWARSQTAASLVEFASR
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
           EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297
           DRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTG
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297

BLAST of CsGy7G003850 vs. ExPASy TrEMBL
Match: A0A1S3C0V5 (protein THYLAKOID FORMATION1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103495547 PE=3 SV=1)

HSP 1 Score: 552 bits (1422), Expect = 6.16e-192
Identity = 286/297 (96.30%), Postives = 290/297 (97.64%), Query Frame = 0

Query: 1   MAAVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFSSRMVIH 60
           MAAVNSISFSTLNQCSDRR  +PSSRS SSNF GF FRTS+FTHYSRVR STFSSRMVIH
Sbjct: 1   MAAVNSISFSTLNQCSDRRFPVPSSRSLSSNFDGFRFRTSLFTHYSRVRPSTFSSRMVIH 60

Query: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120
           CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF
Sbjct: 61  CMSAGTDVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 120

Query: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR 180
           VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDA+K EEWARSQTAASLVEFASR
Sbjct: 121 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAQKLEEWARSQTAASLVEFASR 180

Query: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240
           EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV
Sbjct: 181 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 240

Query: 241 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297
           DRDLDVYRNLLSKLVQAKELLKEY+DREKKKRDERAGSQTANEAITKCLGEYSMQTG
Sbjct: 241 DRDLDVYRNLLSKLVQAKELLKEYIDREKKKRDERAGSQTANEAITKCLGEYSMQTG 297

BLAST of CsGy7G003850 vs. TAIR 10
Match: AT2G20890.1 (photosystem II reaction center PSB29 protein )

HSP 1 Score: 386.3 bits (991), Expect = 4.8e-107
Identity = 207/291 (71.13%), Postives = 247/291 (84.88%), Query Frame = 0

Query: 3   AVNSISFSTLNQCSDRRLLLPSSRSHSSNFHGFPFRTSVFTHYSRVRASTFS-SRMVIHC 62
           A++S+SF  L Q SD+     SSR  +S          + T +SR+  ++ S S+ +IHC
Sbjct: 5   AISSLSFPALGQ-SDKISNFASSRPLASAIR-------ICTKFSRLSLNSRSTSKSLIHC 64

Query: 63  MSAGT-DVTTVAETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMRYKRTYRYDPVFALGF 122
           MS  T DV  V+ETK  FLKAYKRPIPSIYNTVLQELIVQQHLMRYK+TYRYDPVFALGF
Sbjct: 65  MSNVTADVPPVSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGF 124

Query: 123 VTVYDQLMEGYPSDEDREAIFQAYIKALNEDPEQYRIDAKKFEEWARSQTAASLVEFASR 182
           VTVYDQLMEGYPSD+DR+AIF+AYI+ALNEDP+QYRIDA+K EEWARSQT+ASLV+F+S+
Sbjct: 125 VTVYDQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSK 184

Query: 183 EGEVESILKDIAERAGSKGNFSYSRFFAIGLFRLLELANATEPSILEKLCAALNIDKKGV 242
           EG++E++LKDIA RAGSK  FSYSRFFA+GLFRLLELA+AT+P++L+KLCA+LNI+KK V
Sbjct: 185 EGDIEAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSV 244

Query: 243 DRDLDVYRNLLSKLVQAKELLKEYVDREKKKRDERAGSQTANEAITKCLGE 292
           DRDLDVYRNLLSKLVQAKELLKEYV+REKKK+ ERA SQ ANE I+KCLG+
Sbjct: 245 DRDLDVYRNLLSKLVQAKELLKEYVEREKKKQGERAQSQKANETISKCLGD 287

BLAST of CsGy7G003850 vs. TAIR 10
Match: AT4G21440.1 (MYB-like 102 )

HSP 1 Score: 313.2 bits (801), Expect = 5.1e-85
Identity = 183/354 (51.69%), Postives = 238/354 (67.23%), Query Frame = 0

Query: 318 MGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY 377
           M R+PCC+KNGLKKGPWT EEDQKLV+YIQ HG GNWR LPKNAGLQRCGKSCRLRWTNY
Sbjct: 1   MARSPCCEKNGLKKGPWTSEEDQKLVDYIQKHGYGNWRTLPKNAGLQRCGKSCRLRWTNY 60

Query: 378 LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG 437
           LRPDIKRGRFSFEEEETIIQLHS LGNKWSAIAARLPGRTDNEIKN+WNTHIRK+LLRMG
Sbjct: 61  LRPDIKRGRFSFEEEETIIQLHSFLGNKWSAIAARLPGRTDNEIKNFWNTHIRKKLLRMG 120

Query: 438 IDPVTHAPRIDLLDLSSILSAAI---RSH----SLLSLSTLLNNHQTTATLNPESLRLIP 497
           IDPVTH+PR+DLLD+SSIL++++    SH    S L + T   +HQ    +NPE L+L  
Sbjct: 121 IDPVTHSPRLDLLDISSILASSLYNSSSHHMNMSRLMMDTNRRHHQQHPLVNPEILKLAT 180

Query: 498 TLLGLKQQDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSIMPISSTFVDC 557
           +L    Q      NL++   ++ Q +  + S + + Q   N    N+ +   + S+    
Sbjct: 181 SLFSQNQN----QNLVVDHDSRTQEKQTVYSQTGVNQYQTNQYFENTIT-QELQSSMPPF 240

Query: 558 PNTSQENLNFLPTLLNCGEDVLMNQPNY----IYGGNGSNPTASE-ILDISNN----NAQ 617
           PN +++  N        GE  L++         Y  + ++ ++S  +LD S +    N  
Sbjct: 241 PNEARQFNNMDHHFNGFGEQNLVSTSTTSVQDCYNPSFNDYSSSNFVLDPSYSDQSFNFA 300

Query: 618 NLGFDSVKSSPTPLNSSSTYLNNSS-SNEDEKDSFCSNFLQFEIPEGLDFADFV 655
           N   ++  SSP+P   +S+Y+N+SS S EDE +S+CSN ++F+IP+ LD   F+
Sbjct: 301 NSVLNTPSSSPSPTTLNSSYINSSSCSTEDEIESYCSNLMKFDIPDFLDVNGFI 349

BLAST of CsGy7G003850 vs. TAIR 10
Match: AT4G05100.1 (myb domain protein 74 )

HSP 1 Score: 297.4 bits (760), Expect = 2.9e-80
Identity = 184/356 (51.69%), Postives = 229/356 (64.33%), Query Frame = 0

Query: 318 MGRAPCCD-KNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTN 377
           MGR+PCC+ KNGLKKGPWTPEEDQKL++YI IHG GNWR LPKNAGLQRCGKSCRLRWTN
Sbjct: 1   MGRSPCCEKKNGLKKGPWTPEEDQKLIDYINIHGYGNWRTLPKNAGLQRCGKSCRLRWTN 60

Query: 378 YLRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRM 437
           YLRPDIKRGRFSFEEEETIIQLHS++GNKWSAIAARLPGRTDNEIKNYWNTHIRKRLL+M
Sbjct: 61  YLRPDIKRGRFSFEEEETIIQLHSIMGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLKM 120

Query: 438 GIDPVTHAPRIDLLDLSSILSAAIRSHS---------LLSLSTLL---NNHQTTATLNPE 497
           GIDPVTH PR+DLLD+SSILS++I + S          +++S L+    NHQ    +NPE
Sbjct: 121 GIDPVTHTPRLDLLDISSILSSSIYNSSHHHHHHHQQHMNMSRLMMSDGNHQ--PLVNPE 180

Query: 498 SLRLIPTLLGLKQQDPNAH--NLLLQAQA-QAQIQAQMDSLSQLLQPNDNVNN-TNSSSI 557
            L+L  +L   +    N H  N + Q +  Q Q    M    +L      ++  TN   +
Sbjct: 181 ILKLATSLFSNQNHPNNTHENNTVNQTEVNQYQTGYNMPGNEELQSWFPIMDQFTNFQDL 240

Query: 558 MPISSTFVDCPNTSQENLNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNA 617
           MP+ +T        Q +L++     +C +   + +P Y           S+   +     
Sbjct: 241 MPMKTTV-------QNSLSYDD---DCSKSNFVLEPYY-----------SDFASV----- 300

Query: 618 QNLGFDSVKSSPTPLN-SSSTYLNNSS-SNEDEKDSFCSN-----------FLQFE 644
                 +  SSPTPLN SSSTY+N+S+ S EDEK+S+ S+           FLQF+
Sbjct: 301 ----LTTPSSSPTPLNSSSSTYINSSTCSTEDEKESYYSDNITNYSFDVNGFLQFQ 324

BLAST of CsGy7G003850 vs. TAIR 10
Match: AT4G28110.1 (myb domain protein 41 )

HSP 1 Score: 288.1 bits (736), Expect = 1.8e-77
Identity = 147/244 (60.25%), Postives = 176/244 (72.13%), Query Frame = 0

Query: 318 MGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY 377
           MGR+PCCDKNG+KKGPWT EEDQKL++YI+ HGPGNWR LPKNAGL RCGKSCRLRWTNY
Sbjct: 1   MGRSPCCDKNGVKKGPWTAEEDQKLIDYIRFHGPGNWRTLPKNAGLHRCGKSCRLRWTNY 60

Query: 378 LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG 437
           LRPDIKRGRFSFEEEETIIQLHSV+GNKWSAIAARLPGRTDNEIKN+WNTHIRKRL+R G
Sbjct: 61  LRPDIKRGRFSFEEEETIIQLHSVMGNKWSAIAARLPGRTDNEIKNHWNTHIRKRLVRSG 120

Query: 438 IDPVTHAPRIDLLDLSSILSAAIRSHSLLSLSTLLNNHQTTATLNPESLRLIPTLLGLKQ 497
           IDPVTH+PR+DLLDLSS+LSA     +  +++T       ++ LNP+ LRL   LL L+ 
Sbjct: 121 IDPVTHSPRLDLLDLSSLLSALFNQPNFSAVAT-----HASSLLNPDVLRLASLLLPLQN 180

Query: 498 -------------QDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSIM-PI 548
                        Q PN  +   Q QA+          S L   N  +++   + ++ P+
Sbjct: 181 PNPVYPSNLDQNLQTPNTSSESSQPQAETSTVPTNYETSSLEPMNARLDDVGLADVLPPL 239

BLAST of CsGy7G003850 vs. TAIR 10
Match: AT5G54230.1 (myb domain protein 49 )

HSP 1 Score: 258.5 bits (659), Expect = 1.5e-68
Identity = 154/327 (47.09%), Postives = 199/327 (60.86%), Query Frame = 0

Query: 318 MGRAPCCDKNGLKKGPWTPEEDQKLVNYIQIHGPGNWRNLPKNAGLQRCGKSCRLRWTNY 377
           MG++   +++ +KKGPWTPEED+KLV YIQ HGPG WR LPKNAGL+RCGKSCRLRWTNY
Sbjct: 1   MGKSSSSEESEVKKGPWTPEEDEKLVGYIQTHGPGKWRTLPKNAGLKRCGKSCRLRWTNY 60

Query: 378 LRPDIKRGRFSFEEEETIIQLHSVLGNKWSAIAARLPGRTDNEIKNYWNTHIRKRLLRMG 437
           LRPDIKRG FS +EEETIIQLH +LGNKWSAIA  LPGRTDNEIKNYWNTHI+K+LLRMG
Sbjct: 61  LRPDIKRGEFSLQEEETIIQLHRLLGNKWSAIAIHLPGRTDNEIKNYWNTHIKKKLLRMG 120

Query: 438 IDPVTHAPRIDLLDLSSILSAAIRSHSLLSLSTLLNN--HQTTATLNPESLRLIPTLLGL 497
           IDPVTH PRI+LL LSS L++++      S+S  +N     TT+ +NP+ L  +   L  
Sbjct: 121 IDPVTHCPRINLLQLSSFLTSSL----FKSMSQPMNTPFDLTTSNINPDILNHLTASLNN 180

Query: 498 KQQDPNAHNLLLQAQAQAQIQAQMDSLSQLLQPNDNVNNTNSSSIMPISSTFVDCPNTSQ 557
            Q +    N     Q Q  +     + + LL     V   N+   +     +     T  
Sbjct: 181 VQTESYQPN----QQLQNDLNTDQTTFTGLLNSTPPVQWQNNGEYL---GDYHSYTGTGD 240

Query: 558 ENLNFLPTLLNCGEDVLMNQPNYIYGGNGSNPTASEILDISNNNAQNLGFDSVKSSPTPL 617
            + N +P   N      ++     +  +G N  A        N + ++   +  SS TPL
Sbjct: 241 PSNNKVPQAGNYSSAAFVSD----HINDGENFKAGW------NFSSSMLAGTSSSSSTPL 300

Query: 618 NSSSTYLNNSSSNEDEKDSFCSNFLQF 643
           NSSST+  N  S ED+++SF S+ L F
Sbjct: 301 NSSSTFYVNGGS-EDDRESFGSDMLMF 305

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q7XAB85.1e-10668.60Protein THYLAKOID FORMATION1, chloroplastic OS=Solanum tuberosum OX=4113 GN=THF1... [more]
Q9SKT06.7e-10671.13Protein THYLAKOID FORMATION 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q84PB72.3e-9867.71Protein THYLAKOID FORMATION1, chloroplastic OS=Oryza sativa subsp. japonica OX=3... [more]
Q9LDR87.2e-8451.69Transcription factor MYB102 OS=Arabidopsis thaliana OX=3702 GN=MYB102 PE=2 SV=1[more]
Q9M0Y54.1e-7951.69Transcription factor MYB74 OS=Arabidopsis thaliana OX=3702 GN=MYB74 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
KAE8645760.10.099.85hypothetical protein Csa_020345 [Cucumis sativus][more]
XP_004137011.13.56e-240100.00transcription factor MYB41 [Cucumis sativus][more]
XP_008455362.11.18e-22694.96PREDICTED: transcription factor MYB39 [Cucumis melo][more]
XP_038888637.17.60e-20588.13transcription factor MYB41-like [Benincasa hispida][more]
XP_004136805.15.03e-201100.00protein THYLAKOID FORMATION1, chloroplastic [Cucumis sativus][more]
Match NameE-valueIdentityDescription
A0A0A0K2931.72e-240100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G046120 PE=4 SV=1[more]
A0A1S3C0Q75.72e-22794.96transcription factor MYB39 OS=Cucumis melo OX=3656 GN=LOC103495548 PE=4 SV=1[more]
A0A0A0K3P02.44e-201100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G046130 PE=3 SV=1[more]
A0A5D3C7D36.16e-19296.30Protein THYLAKOID FORMATION1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sca... [more]
A0A1S3C0V56.16e-19296.30protein THYLAKOID FORMATION1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103495... [more]
Match NameE-valueIdentityDescription
AT2G20890.14.8e-10771.13photosystem II reaction center PSB29 protein [more]
AT4G21440.15.1e-8551.69MYB-like 102 [more]
AT4G05100.12.9e-8051.69myb domain protein 74 [more]
AT4G28110.11.8e-7760.25myb domain protein 41 [more]
AT5G54230.11.5e-6847.09myb domain protein 49 [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 247..274
NoneNo IPR availableGENE3D1.10.10.60coord: 330..383
e-value: 2.1E-20
score: 74.4
NoneNo IPR availableGENE3D1.10.10.60coord: 386..433
e-value: 2.6E-20
score: 74.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 604..630
NoneNo IPR availablePANTHERPTHR34793:SF7PHOTOSYSTEM II BIOGENESIS PROTEINcoord: 1..286
IPR001005SANT/Myb domainSMARTSM00717santcoord: 383..431
e-value: 4.8E-16
score: 69.3
coord: 330..380
e-value: 7.1E-15
score: 65.4
IPR001005SANT/Myb domainPROSITEPS50090MYB_LIKEcoord: 379..429
score: 10.615142
IPR001005SANT/Myb domainPROSITEPS50090MYB_LIKEcoord: 326..378
score: 11.753157
IPR001005SANT/Myb domainCDDcd00167SANTcoord: 333..378
e-value: 3.8609E-13
score: 61.8226
IPR001005SANT/Myb domainCDDcd00167SANTcoord: 386..429
e-value: 3.91133E-12
score: 59.1262
IPR017499Protein Thf1PFAMPF11264ThylakoidFormatcoord: 70..275
e-value: 2.6E-74
score: 249.5
IPR017499Protein Thf1TIGRFAMTIGR03060TIGR03060coord: 67..272
e-value: 9.1E-47
score: 157.5
IPR017499Protein Thf1PANTHERPTHR34793PROTEIN THYLAKOID FORMATION 1, CHLOROPLASTICcoord: 1..286
IPR017499Protein Thf1HAMAPMF_01843Thf1coord: 65..273
score: 22.235203
IPR017930Myb domainPFAMPF00249Myb_DNA-bindingcoord: 384..428
e-value: 7.4E-14
score: 51.7
coord: 331..378
e-value: 8.4E-15
score: 54.7
IPR017930Myb domainPROSITEPS51294HTH_MYBcoord: 383..433
score: 22.084002
IPR017930Myb domainPROSITEPS51294HTH_MYBcoord: 326..382
score: 26.26757
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 328..425

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy7G003850.2CsGy7G003850.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010207 photosystem II assembly
biological_process GO:0045037 protein import into chloroplast stroma
biological_process GO:0045038 protein import into chloroplast thylakoid membrane
biological_process GO:0010027 thylakoid membrane organization
biological_process GO:0015979 photosynthesis