CsGy5G009580 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy5G009580
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionThiamine-phosphate pyrophosphorylase
LocationGy14Chr5: 8129866 .. 8149383 (-)
RNA-Seq ExpressionCsGy5G009580
SyntenyCsGy5G009580
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GACTTCAAAACCCTATTTCATTTTCAGCCACATTGCACTCCACCGACTCAGCGCCGCCGCCACACACTTCATTCTCAGCCGTCCGACCAAATTTTCGCCGCCTTCTCCCTCTCCCACTTTCTCGCCTCCTTGAGCCATTACAGCTACATATCTCTGTTTTATTTCTCAGAAACGCAGTTGTTCAACCCCTAGCCGGCGCCGTTGGTGAGATGGTGCCGCTGCCCCTCATTTCTCAGATCCCTAAGGTCTTATTTTTTATATTGTTTCCTTGGATAAGAAAATTTAGAACAGCCCTGTAGTTTTGTTGGCTATAAAAAGACTTGTTGGAATTATAAACTATAAAATATTCAAATTAAATTTATCAATTCGATAAACCCACCAAAATACAATGCACAGGCTAAATTGAAAACTTAACTTTGACTCTAGTTTGATTGAATAGCACTGAGTCTCGTTGAATATAGAAGAAACTTCATGGCAATTGGAGGCGGCTAGGCAGATATTATTATATTTGTAGAATTGGAAACATAATTGTTCAAGTTTACGAAGCCATCGTGGGTGTTTTATCGGTAAAAAGGAAACATACTCTTAATAACTCACCAATAGATCCTGACACTTACTTGAAAATTAATTTTTTAGGTTCAAATGTTATAGAACACGCTGTTCACTCACGAATATTCATAAAAGAAGATTTAAAATCCTCTCTTATCAGACCTATGTAAACCAATTTATTTTATTTTATTTAAACAAAAATAAGATTTCAAGATTCCTACCCCCATTTTTATGCATCCTAAATTAATTTGAAGTTCTTTCCACATTAAGGGATTCCAAGCAGAGGGTTAGCTTTATCTATTATGAAAATATATTTTATAGTTTATAGGTTTTCCTCTCTGTGCCTTAATCTTTACTGTTGTCTATTTTAGTTCAATCAAGTTTCTAGATTTTGTATGGCCATGAAGAAGCAGGATGAAATGGTTGTAGCTTCAAGTAATCGAAATGAGACGTGTATTCCCCATGTGTTGAGTGTTGCTGGTTCTGATTCGGGAGCAGGGGCTGGAATCCAAGCAGATCTTAAGACTTGTGCTGCTCGTGGAGTGTATTGCTCCACTGTGATAACTGCTATTACTGCACAGAATACTGTGGGGGTTCAGGTCATTATCACACACTCATTATTATTATTATCTATATTTATGTATTGTTTTTTTTTGTTTTGGGTTGAAAGAATAATGGGAAAGACTGGTGTATGAATAATCAAACGTTCACTTGCTATTTGTCCGACTGTTTTGGTATATTGGAGTGTAGAAAATGGTTTTACATGCTGCAATGGGCAAATTCTAATGTTTTTTTTGAGAGTTTGGGATAATTTTTGAAATCATGCTTTTAAGTGAAAGCTAATACTTCGTTAGCACTTTATTAAATGCTGTCTAGGGTGTGTTTAGGGTTGCTTTTGGCATGGTGAAAAGCACCTTTCTCATGGCTAAAGTACTAGTGATATGATCAAAGAGTTCTTCCTCCATCCATTTGGTGAGAAAAGCTAATTCTTGTGCCTTGCCGAAGTGTGCACTGTTTTGTGAGGTCTTTGATATGTGTGGAATAATAGTGTTTAGAGGGTTGGAAAGGGATCCTAATGATGTTTTGTTTTTTGTTGGATCTCACGTTTCCGTTGGGTGCGTTCAGTTTCGAAGATTTTTTGTAATTATTAGTATTTTCAGGTGACCCAAGTTGCATAATTGCAATGTTATTTACTACAGTATTTAAGGGTTATATTATTATTTAGTTAGACAAGTTGTTAGGATACTTGGTTATAAATTGAGATAGAGAGAGGGAGTAGTAAGACAATACTGTGGTGGTTGAGTTAGGGTTTGAGAGAGATCTCAAGAGGGGAGAGTCTAAGTACCTTGAATCACTTGACTATCTTGTAAGTTCTAGACACATACAAAGGTACTAGATTCGATTTGGTATAAGAACAGTTTGATTATGACCATGGGGTTGAGAGCTATATATAAAGTTAAGTGAAACAATATTAAACACTTTATGTGAAGCAATGGACCCTTTGAAGAATCCCTTGGATAACCAAATGAAATTGATGTTTCAAACGATATAGAGTACAATGACACTGAGTTCTGGTCTAATTAGAGAAGTACTTAACACTTATTCTACGAACACTTAAATACATGCATAGTTCTCTAAATAATTTTCAAAACCGCTTTTAAATACCTAATTAAACATTACGAAAAATTTGAGTAGTACTAGCATAAAACACTTTTCACCTTTGCAAAAATCATTCTAAACTCACTCTTAGAATATATTTGTTGTAGTTAGCTTTTGCTTGTTTTGAATTAATATTAACTTAAAATTTTGTTTAACAATAAGCTGCAATAACTTTTTCAGGATGTAAACGTTGTTCCGGAGGGCTTTGTTTCAAAGCAGCTGAAATCTGTTCTCTCTGATATGCAAGTGGATGTGGTGAGTATAATTAAATTAGGTTTGATGTGGTGAGTATAACTAAATTAGGGTTGAGAGTAGGCTCTTCGTAAATCCATCAATATATATATTAAATAAGAAATTTCAAGGTATGGTTGCCTACTGATTTTTCTTAGGAGCTCATGCTAACAAGAATGTGGACTGTTCATGTATGGGATATTGCACCGGGTAGCTTTTATTGTTCTTTCAGAGTACATACTAATATGTACTTTATTTAATCTGGTAAGCACGGATAGAAGATTACTAATGATTACATTAATTTAGGTGAAAACAGGAATGCTACCTTCTACTGGCATCGTTCAAGTTCTACATCAGTGCTTGAAGGAGTTTCCCGTTCGAGGTATAAAGTAATTAAGGAAGTTGTTGGTGGATTTCTTTGATTTAATCGTGGTTTAAATTTTAATGTAATTAAAGGCAGCTTTGGTGGTTGATCCTGTCATGGTGTCTACTAGTGGAGATGTTCTGGCTGGTCCTGCTATTATTTCAGTGTTACAGTAAGTTGATGCTTTCTTGAAATGGAATATCTTCAGTCCTTCAAGGGGATTCTATTTGTATGATTATGATAATTTCTTTCGTTCCATATTTTCATTTTTTCAATATCTTCATAATTGGGACAGTGTTTTCTGGTTTAAAAGGCTTCCGAAACTTATGTGTGGAGCTCAAACTTTTAACGTTAAAGCTTACATCTGACGCACGTTTATAAGAGCTTATTGAGGCTTGAAATCCTCATGAATCACACAATAATTTGAAAAATAATACTATTTTGTTCCTCTCCCCACCATGATTTTAGATGGCATCCATTCATCATAGCAAGAGCTATATGAATCTGCCTATAATGTGATGTTTTATCTTTTATGTGGTATTGCTACTATGTTCACTTTCACTTGTTAGATTGAGGGTTTCAAAACCTCTCCGCTTGTGTAGGCTGCTTTAGAAGCTTTTCTTTTATACAATTTCTGGCATGTTTCTAATAAAAAAAGAACTCGTATTATCGCTGAAATATGAGGGACTACACGGGGTTGATGGACATTTGGAGAGCACTCATATTATCGCTGAAATGTTCAGGAGCTTTAAATAATGAGATTTCAATGTCACAGCAAGAATGTGATCCAAGTGAGTTCTGCGATTGCATTTGCCATAGCTAGATATTCTACCTCAACGCTGGATCTAGAGACAGTTGGGTGCTTCTTCGAAACCCATGAGATGAGGTTGGATCCAAGATACACACAAATGCCTGAGGTGCTCCTCCTTGTGGTTGGACAATCAACCAATCTGTATCACTAAAATCATAGTGCTTAAGGCAATTGTTTTTAAGAAAAATGATGCCATGATGCATACTATGCTTAGAGTGCATTTAACTTCCCTGGAGTGCTGCATTGTGGGTAGTTGCATCTTTTACAAGCTTTGTTGATGGCAAAGGCGATGTCAGGTCTTGTTAAGGTGAGGTATTGGATGGACCCAGCAAGGCCTCTATAAGTGGTAGGATCGACTAGTTCATCATCTCTTGGGCATTTTTTGTTGCTTGTTGCTGTTGGAGTTGATATTGGGTAAACATTTAGTTAGCCTTAGCTAGATATCTACTGTGTGTGTATACTTAATTTGAGTGATGTGAAAACCATGGGGTATATGCTTTACTTCAACTCCTAAGAACTAGTGAAGGATCCTGAAATCTTTTAAAGCAAGTTCTTGACCTAACTTGGTCACTAAGGTGGCAATAAAGGCTGGGTCATTGCCAGTGAGAATAATATTATCAACGTAAAAAAGCATAAGAACTAAGGTACTAGAATTTTTGAAAAGGAAGAGAGATGGGTCGAAGGTGCTACATGTCCTATGTGAGTAAGGAAAGTGGACGATCTATCGAACCAAGCTCTAGGTGCTTGTTTTAAGCCATAAAGTGATCATTTCAATAAATAGACAATCAGGATGAGCAACGTAAGTGAAACCTGGAGGTTGAGCCATGTACACTTTTTCTTTGAGATAACCGTGGAGGAAGGCGTTTTTGACATATATTTGTTTCAGGGGCTAGTTGAAAGAAGCAGCAAGAGCAAGTACTAATCAAATTGTTGTTGGTTTAACAAGGCTAAAGGTTTCCTCATATTCGAGACCTTCAATTGAAGAATATCCTTGAGCTACTAGTCTCCTTTTGTGTTTCTCTATGGTTCCATCTTCTTTGAATTTTCTTTAAAGATCCACTTGGAGCCAACTATGTTGGTGTTGGGTGGCTTAGGTACAAGTTTCCATGTTTGCTTCTCATGTAAGGCTTTAATTTTTGCCTCCATTGCATCCTTCCACACGGAAATTTTGAGTGCAGAGCGAGATGTCTTTGGTTCAAGAGATGGGCTTCAAGTGGCACTAACTAGGAAGGTTGTGTAGGAATGGATTTTGTTATGTATTTTGGTTTCACTTTGGACCGAGTGACCATGGGATTTAAAGAGAGAGGTATTACTGCTGGTTTGTTTAGGATGGGCTACATGATCTTGAGTTGGTAATGGAGGCAATGAATCTGTAGCATCGTCAAAAGGTAAGTCAATAAACAGAGAATTATAAGTGTAGGCAAATCAATAAAATTAGATTTAGTTGAGGGTTGAGAGATCTCATGGGCAGAGGATGCAGTTGACTGTTTTGGTGTTGAGGTTGGCGGTTGATCTAATATCTGCCCATATGATTCGTGCTCAAGGTTTTGACATTCGTCTAGGCTATGTTCTTCCACACCAATTCGTTTAGAATTTTGATCGTCTAGTGACATGAGATTGCTTGATGGTTCTTGGCTCTTGGCATCTGGTGTCGCGGTAACAATTGCTTTATAACTGTTGGGATCTCCAAAGATTATTTTGGTCTTACATGGTCTGACAGTCTCAAAATTATTTGAGTTGAAGGTAGGATGGAGAGGAGCTTGAGAAATTGTTTCTTGGCCAATCTTGGAGTTGATTGTGGCATCTAATTTGTCACTTTCTGCATCTAAAAATGTTGTAACATCCAAGATAGGTAATTATGTGGTGTGTGTTGATTCTGTTATGTAAGGCATGGTTTCTTCATCAAATATAACATGTCGAGATATACACACACGATTCTAGTAGGGTGTAAAGGGAAATGCATTTTATCAAGTGTTTCATGACATAACTTGTATGCATGACACGAGGCAGGGATTTTGTGAGGCACTTAATAATAAAGATTATTAAGTGTGCTTGACCAAATAAGCTACTAAACATATTTATAGTGATGAATGGTGGGTATTCATGATCATAGACATCATGGGTGGTTACTCCCTAATTTGTAGGAAGGTGGCCTAGAATTATGCAAGGTGGGAAGTGGATGAAATAGGGAGAAAGTGGTCATGTTCTCCACTATCCTAAAAGGTAGGAGTTAAATGTGGAAAAATGGAAGAGTGGCCCTAATTACATGTGTATTGGGTGGTTTTAGCCCACTAGGGACCCACCTGGCCCTTCTAAAGTTATTTCTCCCACACCTGGATGAGCTTGGGGATTGTTCAAGGCATCTCAAAATAGATAGGTTATCTTAGAGATCTCAATATCTCAAGTATTCATTAGTTCACACCTTGCAAGGGAAGATCTCTCATGGTGGGAGATTTCTAGTCTTGAGTCTTGTAAAGTGTGGTTTTCACTTTTCATAGAGCTTTGACGAATAAGGTGTGCTATATTTTCACCATGTATATGTGCATCACTCCTCTCTCTAGCTGGGTTATGTGGTGATCGTTTGAGTTGAAATTGGGAGTTCATTGTTACGATATCACCCTAAAAATAACACAAATAATCCTAACAAAGGCAACAAATGGAGGCAACACGCTGTAATCTTTACAATAATCAACCAAACCAGAAGTTGACAAGATGTTTAAGCAAGAAAATACAAGAAAACACAAAACAAAGCAAACTCCTTAGAGAAGGCTGGCAATCTCTCCCATAAAGCTCCAGCAGCCTTTTACTACAAAAAAACCTACCTACTCCAATCCCCAAACATTCTATTTATATCCCTCCTTTATTTGTGGGCCCCACTTGCCATATTCTTTATTACAATCATCTCTCCTTGATCTGTCTCCTGGTTAATGAACCTTTTTGCCTCTTCTTTTATATCTGTGAATAATTGGGGGTCTAACAATCATCTAGTGTTCTGAGTGTCCTTGTGGGTGTTTCGGGCAAAGTTCATTAAGGCTTGTCAAGTAGTGTCTTGATGTCGCATGGGATTAAATGGTGAAAAATCTATAGCAGGGATTCTCTCTTGTGACACCCGAATTCCCCAATAGAGCAATAGTTCGCTAATTGAATCAAAATAGTTTCATTTTTTAAAATTCAGAAGTCATGTTTGATCTTTTTATGTTAGTTTTGCATAAATCTTTGACATGTCTTTTGCTTGCAGGGAAAAACTTCTACCAATGGCGGACTTGGTAACTCCAAATTTGAAGGAAGCATCTGCCTTACTTGGTGATATGCCACTTACAACAATTTCTGACATGCGTCATGCTGCAATGTTAATCTATCAGATGGGATCAAAGTAAGTTCTTAATCTATTAGTCATCAGATAAGCTTGCCTTTGTAATTTCATTGGTATCAATGAAAGACACCCATTTTTCTCGTAATAAGAAAGGGAGAAGAAAAGGAAGTTTTTGGTTGCCTTGCCTCTGGCCGAAAAGGAAGTTTTAAGCTAAATTATCAAAGATACCTTTGAATTTTGCAGGTGCCTCTATTCTTTCAAAGTTGCATCTTTCATTAACGTTTTAAAACTACTCTAGGAGTACAAATCCTTTAGAATTTCGGATGAAATTTTAGGATATGAAACCATGTTGTGCTGATCTAGAATAGAAAATTGGATTGTAATATTTCAAACATTTGAAAGTATTGGGAGTACTTTTGAAACAAGTTATCTTTTATAATTTGAAGGGTGTAAATAAACCATCATGGATTGGTCGTGGTTTTATATTTTCCCCACTGTGCTGTTAGTAATTAGTTGACTGTTGGAAGCTGTAATGTTTTATATTTTCCTGATATATTAGTTGGCTATTGGGTTTAGTATAAAAACCCAATAGTTTGTCAATTTGTGGGGATAATAGAAAAATTTCCGGAACAAAGACTTTAGTCTCTTTACGTGTTCTGTCCAAGAATCGGACATACACTAACTTGCTATTAGAGCTTCACCGATGAAGGGAGCTGTGACCGGCAAGAACACCTCTGACAAGGGAAAGGTGGTAGTGCCGACAGAGGAAGAACTCCCATCTCCACGCACTTCGACTGCCCGATTGATGGCCATTGAAGAAAGCATAGCAGAAATATTCAACCAAATGGGAGTTCTTGAAACAATTATGGAGAGACTTGCTCGCTGGCTTGAAGAGGCGTTGACTGCCCTTCTTCAGCAGCAAGAGATCGGAAATGGTAACCACCAAGAACTCGGAATTCGGAACACAAAAATCGCCAGAGTTCGTTAGAATTGTCCGAGAAAGGTGGGGACCACCTCAGAGGCCGCCGACTACAAGAACCCAATATGGGTCTTCAAGTTTTAGAGGGTTTTGAAGAGGAGGAGATGTTATTTCGGCCATGCCAAAGACGGAGGAAATTTGGGAGATGGCCGGCGACAGAGAGAGATTAGATATCAAAACCTAGTTTTCAGAAGGGGCCGAAATGGTGGAGAAGATTATTTTGAAGGCAGATTAGAACCAGCCTGATATAGAGGAGATCTAAGAAGAGGAATGTTTCAGCAAAACTTCAAGATGAAGGTCGATTTACCAAATTTCAGTGGTAAATTGGATGTAGAAGCATTTCTTGATTGGGTTAAAAATGTGGAGAGCTTCTTTGAGTATATGAAGACAGCCGAAGACAAGAAAGTCAAGATGGTCTCACTGAAGCTCAAATTGGGTGCATCGGCTTGGTGGGATCAAATCCAAGTCAATCGACGCTTAATTGGCAAAACACCCATAAGGAGCTGGCCAAGAATGCTCAAGATGATGAAAGAACACTTTCTACCCACGGATTTCGAAAAGATTCTTTATCAACAATATCAATGATGCCGTCAAGGTAATAGGAAAATGGGAGAATATGCAGAGGAGTTTCACCGTCTAAGTGCAAGAACTCAGACGAACGGGAGTGAAAATTATCAAATAGCAAGATTTGTTGATGGTCTTAAGGAAGACATATACGAACAGCTAGATTTACAGCCCATAGCCACACTACTGGCTGCAATTTTGATGGCTTTTAAAGCGGAATGGAAGCTGGAAAAAAGGCAGAAAAACAGCGGCATCGAGAAGAACCTGTGGGAGAAGGCATTCGCCCCGTACCAACGAAAGAACTATGACAATGCCAAGCAAGCTCAAGGCTCCGGTACAACAATGGCGAAAGAAGGACAACCTTCCAAAACAACTCATAGCCCAAGAACTTGAGAACTCTCTACCAAGAAGAATGGTTCAACCAACTACCCAAGACTGAATTTGGGATTTTGCTACTGGTGCAACCAAAAGGGACACTTATCCAATCAATGCCCACAACGGAAGATGGTAGCACATGTTGAGGAAGGAGGGAGCCAAGAAGACGAAGCTGAACCTAATTCCGAATAGGAAATAAATGAGTTGGAACTGGATGAAGGGGAACAACTATCTTGCATGATACAACGAATTCTCCTAACACCAAAAACGGAAACTCACCCTCAACGACATTCATTGTTCCATACTCGCTGCACAATCAACGGTAAAGTTTGCAATTAGGTAAAGTTTGCAATGTCATAATTGATGGCGGAAGTAGCGAGAACATTGTCTCCTCAAAACTTGTTCAAGAAAAGCTTGTTCTAAATTTCAAGTAGTTGCATTTGATTCTTTGCCCACTCTTTATAAAGACGACCCCGATTTTAGTCAAATTTGGTCTCTTTGTGTTGATCATGTCAATTGTAATGACTTCCATTTAACTAACGATTTTCTCTTTAAGAATAATCTTCTTTGCATTCCTAGAACATCTCTTCGTAGGATTCTTCTTCTCTTCCCTCTTCGCTTCGTCAGTGAAAGACCGCCGGCATTTCTTTGAGTGGAAAAAAATCAGTCAAATGTAAACTCGAGCAGCTTCTCGCACCGCTCCGTGGGCAGCTTCTCGATCCGGCACTCTTCCTTTCGAGGTATGGTGAACCTACCTCTCCCCACTTGCTCTGCGGAAAAAGCGGAATTCTTGGTCTTACCACTTTCCTCTCGTATTGCGGCCTTCTCCTGTTCATCAACTCGAACCGCGTAGTGGGTCTTCCGCAGTTGATGTTACGGATCAGCCCCACTCTTCAGGCCAGCAGCTTTGCTCAGCATCTTCTGCCGATTCTCAGAATGGTCTTGTGGGGCATTTTGGCATAGACAAAACTTACCAATTATTGAGTGATAGGTTTTATTGGCCTCAACTCCACCGAGATGTTACAAAGACTGTTAAGCACTGTTTTACGTGCCAAACTGCCAAAGGACAGCAGCAGAATACTGGACTGTATACACCTTTACCTATTCCCAAAGGCATTTGTTAAGTTTTTTATTAATTAGATTAAAATTAGTTCACTTTTGTTAAGTTTTCTCTTAGAATTAGGATTAGGATTAGTTTACTTTTTGCTATAAATTGAGTAGTTATCTTCTTGTATTCACAACTTGAAAAACATTAATACAATTCTCTTATTTGAGTTACATCAAAGGGTCTGTAGATGATGAAGATACAATTAATTTGTCGAAGCCTTTAGATCTTTGAGGAGATCGGTCTTTAGTTTATAGATGATGAAGATACAATCAATTTGGAAAAGATCAATTCGTTGAAGAGACAATTTATTCGTTTAGAAGAGATGCTCAATTTGTTGAAGACGAGACGATCTATTCATTGGGGAAGATATGCTTAATTTGTTGAGGGTCATTTTGTTGAATTCTCTAAATATTGATTTTTGGATCTCTGCCTGAATGTATCAAATGATGCGTATTCTCTGTTTATATAGTTTTGGGAGGCCCATGTGGGCTTGGTTGACTTTGGGCTCAACTTTTAGACTTGATCTTAAATTAATATGTTTTTAAATAGATTTATTGTTCAAGTTTGTCCCAATTTATTTTATTAAACCTGGGACAGCCTAGCTTCGGGCCCAATCGCCACTTAGTGTAATTCGATTGGACAAGCTAACATTGCCCGTTGGATCAATGCTTAGAAAAGTGGCGCCATTTGATTTGCCACTGTCATGTTTCTAATTTTAATTCTAGGAAGCATGCAAATTTGTAATTGGTCGCAAATTAATTACAGGTCAATTTGTGATTACATGGAATTTTGTCAATAGAAATAATTTTATCGATAGAAAACAAAGATAGGGAATTATAGAATGAGTGACGGGAGGGTGGGATGCCGAGTTCAGGTTTAGGCATGGGTTTACTCAAAGAAAAACATCCAAATTTCTCTCCTTGGATTCTTAATTTTATATATGTGTGTATATATATATATATTCTAATTTCTTGTTCTGCAAGCTATTAACTATAAAGATAATTTCGATTGGAAAATTGCATCAAATAACAAACATTTAGGAAAATACAGCCCATGACACCTCTTTTTTGCATATTGCGAATATGACAAGTAATTAGTGATATCAGAGGGTTATCATCAAATGGTTATCAACTTTTAAATTTGTTACTTTTGCAATTTTGAAAAATGCAGCAACATGAGTCCTATTATCATAAATTTTTTTGCTATTTTTGCCAGCGCTCCATTTCGGTTTGGCAAAATGAGTTAGTCCTATTCCCTTTTTCCGGGTCGGAGGTCTAGTTTCCCATCCTCAGTTTTTAATATTATATTCAAATAAAGAAACGATAACAATAATTTCGAATTCAATTTTCTAACCAATTAAAATCAACCTCACACATTTTAGAAGGCCTCAGTTTCATACGGTCATCCATAGGTTATAGAAGGTTTAATTTTGTAGGCATGACGGTCCTAAGACAATATACATTATTGGCATTTGGTTCAATGGTTCTATATTTTATTAAAAATTAACTTACGGAGTGTTGTGCTGTTTTCAGAAATGTACTTATCAAAGGTGGGGACCTGCCCGATTCATTGGATGCGGTTGATATATTCTTTGATGGTAAGTACATATTTGCAAAATTTTGACAATTTGATATTATTTCTTTTTGTTACGAAGGAAACATCCATTGAATCCAAAATTTAAAATAATTTTCACAGGCAAGGATTTGCATGAGCTACGATCTTCACGCATAAAGTCTCGCAACACACATGGTACTGGATGCAGCTTAGCATCATGCATTGCAGCTGAACTTGCTAAAGGGTCTTCAATGTTCTCAGCTGTTAAGGTAATTGTATATGGATCTCTTCTTTAATTTACTAACTTCAGCCGTAAGAAGATCCAAGTCTTAAGGGCTCATAATACAACTTCCTATTGAATAGTTAAAGAAATCTAGTTATAATAATCTAGCTGTAGTAATAGCTCTAATTTACCGTTAGGGTGGGCCATGAGCATCATACAAGGAAAAATAGCATCGACAATCTTATTGATGAAAAAGGAGAGAAGAAAATAATAGAAACAAACATCTTATTATGATATATTTTCTTAACTCATTCATCAAAATATGGTTGCCCCTCAATTAAATTAGGAGTTTTGGATTCTTTGATTATTTGTCTTCTTTCTTTTTGGATGAAAAGACATCATTCTCACTCCATATCCATTTGATGACTGAATCACGCTTATAAGCTTTGTTGTTGATTAACAGACTTGATGACTGCCTACAACTAAGGTTTTACTTATCTGAATTTTTACAGGCAAGCAAACAGTTCATCGAAAGAGCATTGAGGTACAGCAAGGACATTAACATTGGACATGGACCTCAAGGCCCATTCGATCATCTATGTTGTCTCAAGAATCGAGAACCAAGTTCCTACAGTCAGGGATGTTTCAATCCAGCTGACTTGTTTTTGTATGCTGTTACGGATTCAGGTATGAATGAGCGTTGGGACCGTTCTATCACTGATGCTGTTAAAGATGCAGTGGAAGGAGGTGCTACTATTGTTCAAATAAGGTTTGCCTAATGTCTTGTATAAGTTGATATATAATTTTAAGTCATTACTTCAAAATGCATGGAATTCTTTGTTGCATTTTATATCTCCTCTCAACACCTCAGTTCAAAGGTTAAATGGCTTTAGAAATTAACCCATAAGTGTAGTCTGGATGTTCCTTGGGTGCTCATTGGGCGAGCGATAGTGAAGTTGTCATCTAGATTTGAATATCTTAGGCCTTGAACCATGAAGGTCGTGTTTGGTAGCCATTTTGTTTTTAAAAATTGAGTCTATAAACCCTTAACTATTTTGACGAAGAAAGAACACAAGATCTACGTGGAACCCTAGTACAGGGAGAAAAACCACGGAATAGATGTCCCTTATTATTTTTTGATGATAATGAAGGTACAAAGGGGAAATATTTATAGGCAACACGAGCCTAATTAAAATAATAAAAAAACTAGGGCAAAGTAAATCTCAACTACGCTTGGGCTTTCAACATGACCCAAGCCCACTATTTCTAACACTCTCCCTCAAGTTGGGATGTAAGTTTCGGAAAGATCCAACATGCTAACACATGATTGAAAGTTTTGTCTAAGGAGCCCCTTTGTGAGTATAACAGTAATCTGTTGACTCGAGGGTATGCAAACGCTGACGAATTTAGTATAGGCCCGGGGGGCTCAAACCCCCCCTCAACTTTATTGTCTTTATATAATATGTATAAACAAAACCAATTAATGCTTAATATTTGTAGACAGTTTAGTGGTAACTATGCTTGTCCGCCCCCCTTCCAACTAGATTTCAAGTCTTTCTTATGTAGTTTTTTTCAGATTTTTATTTATCCTCTAAGTTACAACTCGAAGTCCCCTGTTGATAAATTTTTTGGATCTGCCACTGTATGTAAAGGATGTAGATGCTACCAATATCTAGTCTCTCTAAGTGTCTGTCAATCTTCACATGTTTGGTTCTGTCATGTTGGATCGGGTTATTGGCAATGCTAATGGCTACCATGTTATCGCAGAATAGGCTCATAAGCATCTCTTGATTCTGATGAAGATCAGAAAACACCTTCTAAAGCCAGATTTCTTCATAGATTTCCGAACTCATAGACCTGTATTCGACTTCAACGCTACTTCGAACCATAACTCCTTGCTTCTTACTTCTTCAAGTAACAAGATTGCCCACACAAAGATGCAGTATCTTGAGGTGAATTTTCTATCATCAATAGACCCTGCCCAGTCAGAGTCAGTATAGGCCTCAATACATCTTCTGTCAGTTTTCCTAAACCTCAACCCTTTACCGGAGTTGCTTTTATATACCTTAGAATTCTATTAATGCCTCCATGTGGTCTTCATAAGGAGCCTGCATGAACTAACTGATAGTACTTATAGCATAGGAGATGTCGGGTCTAGTGTGAGGGAAATAAATCAGCTTTCCCTCAAGGCGTTGATATTTCTATTTGTCAACAGGAAACCTGTCACCTGAATTTTTGAGTTTGACATTGAACTCAATTGGCGTATCAACAAGATAACATCCCGTCATACTTGTCTTAGCTAATAAATCAAAGGTGTACTTTCTCTGGCCTAGCAACCTCCATCTCAAGGAAGTATTTTAGATTCCCCAAGTCTTTAATGTCAAACTCATCCCTCATCTTCCTTGTTAGTTGGATGATCTCAATGGTGTCATCCCCATACAGTATAATGTCATCAACATAGATGATCAATAATGCAATTTTCTCAGTCTTGGAGACTTTTTTTGTGAACAAGGCGTGATCGAAGTGGCCTTGATTGTATCCCTGGGACTTGACAAAAGTAGTAAACAACCTGTTAAACCAAGCTCTTGGTGACTGTTTCAACCCATACAACAACTTTTGAAGCTTGCAAACCTGTTTGTCAAATTGAACTTGAAGTCTTGGAGGATGACTTACATACACTTTCTTCTAAATCTACATTCAAGAAAACATTCTTAAAATCAAGTTGATATTGAGGCCAATCTTTATTTGCAGCAACGGACAACAAGACTCTAACAGTGTTTAACTTGACAATAGGAAAAAAGTCTCAGAATAGTCAACCCCATAAGTTTAAGTGAATCTTTTTGCAACTAACCTAGTCTTATGTCTGTCAAGGGTACCATTTGCTTTGTACTTAGTGCAAACACCCATTTGCATCCTACAATCTTGTGTCCCTTGGGAGTGTGTAGAGATCGCAGTCTTCCACTTAGGGCACTCTAAGGCAAGGTGGATATTCTTGGCTATTATAGTATAGTTAAGACGACAATGAAGGCTTTGAACTGGGGTGAAAGGTTCTCGTACGAAACATAATTAGAGATGGAGTGCTTTGTACATGACCTGGTACCTTTCCTCAGTACAATAGGAAGATCAATGGATGAATCATACTTACGAATGTTTTCTAGATGGTCTTGCTTTGTTTCGTTGTGAGTAGATTCTGCAATGACCTCATTTTCATCAATCGTATCCTCTCTGTCTGTGATGGCTTCATCAACGGCCCTTTTTAACCATATTTTCAAAGGATTGTCTCGGACCCGTCATTCTCACTCACCAGTATGTGAGTTAGTGGACTTCGTCTTGAATCATTTATTAATATGTGTCAATAGAATCAATCATACCTTAATCTCGTAAAAGTTTAGGATCTTCGACCGGAGCTGGTCCAACAGTAGGAGATCTAACTTCCTTTTTAAGATTCCTCTTATAGTAGGTTTTCCAGGGAGCTTGATTTGTAGGTAGGACTGCCCTGTGGGATTTGGGTCAGGTAAGGAACCCAGAGTAGGATAGGTAGACTTCAAAGGAACCACATAATTAGACTCTTCACTCACACTCCCCCTCTAAAATAGACTAACAGGAAAGAAAGGACGATCCTCAAGAAAGGTGACATCCATGGTAACTAAGTACGTACGTGAAGGTGAGTGGAAAAATTTGTAGTCCCTTTCATGCAGATGACATCCAACAAGCACACAAGTTGTGCCTGAGGAGAGAATTTAGTTTGGTTAGGCTCATGGCTATGAACATAAGCTATGCACCTAAACACTTGGAGAGGAACATTAGAGATGAGACTGGTGGAGGGGTAGGATTCCTTGAGACAGTCTAAAGGGTATTTGGAGGTGTAGGACATGGGAAAGCATTCGGTTGATGAGATAAGCTGCAGAGAGAACATCGCCTTATAAATAGGAAGGGAAGTAGACAACATAAGGGAACAAATAACTTTCAAAAGGTGATGGTTCTTTCACTTGACAACCCTATTTTAGAGGGTGTAAGCACAAGAACTTTGGTGAATAATTCCTTTAGACAACAAAAACTCGTTAAGGGTGTGGTTTTGAAAGTCACGACCATTATCACTTTGTAGGATTGTAATCTTTGTATGAATTACGTTTCTACGAAGTGATAGAAGTCCTGAAATGTGGAAGTGACTCTAGATTTGTCGGAGATAAGGAAAACCCAAGTACGATGGGTATGGTCATCAATCAAGGTCACAAACCATCGTTTTCCAGAAGAGGTAGTCACCTTAGATGGTTCCCAAACATCGCTATGAACAAGAGTGAGGTTGGGTTGACTTGTATGGTTTCGAGGGAAAAGAGACTCGATGCTGTTTTGTGCACACATCACAAGCTAAAGAAGAAAATTCGACTTTAGAAAAAAGACGATGAAATAAGTGCTTCATGTATTGAAAGTTGGGATGACCTAGACAAAAGTGCTACAACATATCGCCTTTTGAAAGTAGCAAAGTGTGAAGATAAAAGACTAATCCTAGAAATCTAGGAAGGAGATCATCATCAACTAGGTAGTCTTCTGCTGTGTCGAACAGTGCCAATCATCCTCCCGGAGCTAAAGTCTGCAAAAAAAATAAAAATAAATAATCAGTTAAGAATGTTGCTTTGTAGTTCAACGCACTAGTATTCTTGCTTATAGATAGTATATTATAGGAAACTCGAGGCACATGCAACATATTGTGTAAGGTTAATCCTTCAAAAAGAGAAATCTGCCCCTTTTCGACAATGGGAGCCAAGGAGCTATTTGCAATTCTTATTTTCTCATTTCTAGCACATGGAATGTAAGACAAAATGATCAGAAAAACCTGTCAAATGATTTGTGGCGCCAAAGTCCAGGATCCAGATGTTCTTCTCATCTACACTAATAAGACTGAAGGACTGAGATATAATTTTTTGGACAATAGCTCCTAGTGTAGTCGGGTTGGGCTCCTAGGTGGCTGAGAAGGGGTGGGAGACGACTCACTCACATAAGCTTGTCCTGGGTTCTGTTTGTCGTTGGAAGAGCATCTGTTACCTCTTGAGGAATGACCATGTAACTTCCAACACTGCTCATGGTATGCCATTATTTCTTACAATGCTCACAAATAGGGATCGGTTTTCCATTGTGCCTCTCACTACCATGACTAGGGGACTTCGCACTGAGGGCAGTAGAATTAATGGCGGGGTTGTTAGACTACTCATACCATTGGTGCGATCCTCAAGGTGAGCCTCAGAACAAACCTCGACCAGAGAGGGTATAGGTCTTTGCTTGCTCCAGTATATGTCTGCGGACAATATCAAACTTAGGGTTGAGACCAACCGGAAAAACATATATCCTACCAATCTCCTCGATCTTTGGAGTATTGTATGCCATTGTTGGGACAATTTCGAACAATTTCTCTACATAGGTCCATTTCTTGCCAGAAAAGGGAGAATCGGTTAGAAAATGATGTCACCATTCATTGTCCTCAGTTTACATTCGTGAACCTGTTTTTGGTTCTATACTATTAATCAATGAAGACTGAATAAGAGAATCCTCGCTTTTCCAGTACCATTTCTATGGGTCTCTCGGTGGAGGATGAAATATCTCACTTGTTAGAAAGCCAAATTTGTGGTGCCCTTCAAGAATCATTTTGATAGACTGGGACCACGAAATATAGTTTTGGTCATTAGGCTTTTTCACTGAAAAATATCCTGCTGAATGTGCACTGTATTAGACATATAAAAAGAGGATAAAGAGGCGAACGATGCTACAAGATTCTCATATACATCGGTGGGATTGTATGAAGTTAGGACTAGGGTTGGTAAAAGTGGCTCCTAATGTCTTGAATTGCCGCAATTTGTTGTCGAAGCATGTCCAATTGTTGTTGTGCTATTCACAGAGAGGAAGTCTGTATGTGAGGATTTGATTGTGTTGAAGACTCACCACCATTAGAAGAATAGATGTGAGCAGCAGAAATGTGGGTGATGGTTCATCAATATCTCGCTCTGATACCATATCAAAAATAGAGATGAAGGAAGAACACAAGGTTTATGTGGAAACCCTAGTACATGCAAGGCCACAGTATAAATGTTGCTTATTATTTTCTAATGAGAATAAAGGTACAAAGTAGAAATATTTATAGGAAACACGAGCCTAATTAAAATAATAAAAAAATAGGGTAAAGTAAATCTCAATTACTCTTGGACTTTCAACATTGACCTAATCCCACTATTTCTAACAACACTACTTCCACCTCTAACTTTCTTTATTTGTACCAACATTGTACCAACGGTTAAAAAACCAATCCAAAATTTGAAAACTAAAAGTAGTTTTCAAAAAGTTGTTTCTGGAATTGACTAAAAATTCAATCAATTGTATTTAAGAAAAATGCAAATTATCAGAAAACATGGAAATGAAAGACTTAATTTTAAAGAACAAATAACGAAAACCAACATCAAGATGTTACCAAATGTAAACTTAGTTTTTAGTTTTTGAAATTAAGCCTGTGAAGAACCTTTTACCCTTTTCACCTCTAAATTTGTTGGTTTGGTATCTACTTTCTACTGTTGTTTTCTTTTTTAAAAAAATTAAAAAATAAAGTTCTTAAGAACTTGTTTTTTAAATTTAATTAAGAATTCAACTCTCACTTAAAAAAGATGAAAACCATGGTAAAATATTGAGAGTAAATAGACTTAAATTTCAAAAACCAAAAACCAATTAGTTACCAAACTAGGCTTAATTAACTTTTGCTAGAAGTTGAGTACTAGGCATAGTTTTGTAAACTTTTATTCAACTGTTAATTTTCTTCCAACGTAATTCCCTCTTATTTCTATGAACAGAGAAAAAGATGCTAAAACTCGTGATTTCTTGGAAGTAGCAAAGTCATGCATAAAGATTTGTCATGCACATGGAGTTCCATTGTTGATCAACGATCGTATTGACATTGCACTTGCGTGTAATGCGGATGGTGTACACGTTGGTCAGTCCGATATTCCTGCTCATGAAGTTCGCCGCCTTCTTGGCCCCAACAAGATCATCGGTGTCTCATGCAAGACAACGGAGCAAGCGGAACAGGCATGGATTGATGGTGCAGATTACATTGGGTGTGGTGGAGTTTATCCCACTAACACAAAAGCGAACAATTTAACCGTTGGGATTGATGGATTGAAAAGAGTTTGCTTAGCTTCCAAATTGCCAGTGGTAGCAATTGGCGGTATCAATCACACTAATGCAGCTGCTGTGATGGGAATTGGTATCCCAAATCTTAAAGGTGTTGCTGTTGTGTCAGCTCTTTTTGATAGGCAATGTGTTTTAGAGGAGGCCTCAAAGTTACATGCTACTTTAGTGGAGGCTACAACATGATGAAATGTTTGAATAATTTTGTAATGCTTTGCTTTTTGTAGAAAGCCTTTTGAAATAAAAATAATTGTCACAATATATATGTCATGTAAGATTCTATTCATATAAAGTAGAGCATATTTATATTCGCTATAATAAGCATTCTGATGGTAACAATAATTGTTAAGTTACCTTTCTATGTAAAAGTACTTTTTGCAACTTTTGTGAAATATAAAAGTTAGATGGAGGCTCATACCAACCGTAAAATCTTATTGCAATCTTTAATTTTTCAAAAAGAAAGATTTCAAAGGAAGAGATGCCAAGCAGGTTCCTGTCATGTCGGCAACCTCAGAAATGTTTCTCATATGTACTCAATAATTGTTGAGACCTTGGAAATTTTTTTATAACTTCTATATACACATTGAATAAAATTTTGAAGTGTATAACATTAGTTAAAGG

mRNA sequence

GACTTCAAAACCCTATTTCATTTTCAGCCACATTGCACTCCACCGACTCAGCGCCGCCGCCACACACTTCATTCTCAGCCGTCCGACCAAATTTTCGCCGCCTTCTCCCTCTCCCACTTTCTCGCCTCCTTGAGCCATTACAGCTACATATCTCTGTTTTATTTCTCAGAAACGCAGTTGTTCAACCCCTAGCCGGCGCCGTTGGTGAGATGGTGCCGCTGCCCCTCATTTCTCAGATCCCTAAGTTCAATCAAGTTTCTAGATTTTGTATGGCCATGAAGAAGCAGGATGAAATGGTTGTAGCTTCAAGTAATCGAAATGAGACGTGTATTCCCCATGTGTTGAGTGTTGCTGGTTCTGATTCGGGAGCAGGGGCTGGAATCCAAGCAGATCTTAAGACTTGTGCTGCTCGTGGAGTGTATTGCTCCACTGTGATAACTGCTATTACTGCACAGAATACTGTGGGGGTTCAGGATGTAAACGTTGTTCCGGAGGGCTTTGTTTCAAAGCAGCTGAAATCTGTTCTCTCTGATATGCAAGTGGATGTGGTGAAAACAGGAATGCTACCTTCTACTGGCATCGTTCAAGTTCTACATCAGTGCTTGAAGGAGTTTCCCGTTCGAGCTTTGGTGGTTGATCCTGTCATGGTGTCTACTAGTGGAGATGTTCTGGCTGGTCCTGCTATTATTTCAGTGTTACAGGAAAAACTTCTACCAATGGCGGACTTGGTAACTCCAAATTTGAAGGAAGCATCTGCCTTACTTGGTGATATGCCACTTACAACAATTTCTGACATGCGTCATGCTGCAATGTTAATCTATCAGATGGGATCAAAAAATGTACTTATCAAAGGTGGGGACCTGCCCGATTCATTGGATGCGGTTGATATATTCTTTGATGGCAAGGATTTGCATGAGCTACGATCTTCACGCATAAAGTCTCGCAACACACATGGTACTGGATGCAGCTTAGCATCATGCATTGCAGCTGAACTTGCTAAAGGGTCTTCAATGTTCTCAGCTGTTAAGGCAAGCAAACAGTTCATCGAAAGAGCATTGAGGTACAGCAAGGACATTAACATTGGACATGGACCTCAAGGCCCATTCGATCATCTATGTTGTCTCAAGAATCGAGAACCAAGTTCCTACAGTCAGGGATGTTTCAATCCAGCTGACTTGTTTTTGTATGCTGTTACGGATTCAGGTATGAATGAGCGTTGGGACCGTTCTATCACTGATGCTGTTAAAGATGCAGTGGAAGGAGGTGCTACTATTGTTCAAATAAGAGAAAAAGATGCTAAAACTCGTGATTTCTTGGAAGTAGCAAAGTCATGCATAAAGATTTGTCATGCACATGGAGTTCCATTGTTGATCAACGATCGTATTGACATTGCACTTGCGTGTAATGCGGATGGTGTACACGTTGGTCAGTCCGATATTCCTGCTCATGAAGTTCGCCGCCTTCTTGGCCCCAACAAGATCATCGGTGTCTCATGCAAGACAACGGAGCAAGCGGAACAGGCATGGATTGATGGTGCAGATTACATTGGGTGTGGTGGAGTTTATCCCACTAACACAAAAGCGAACAATTTAACCGTTGGGATTGATGGATTGAAAAGAGTTTGCTTAGCTTCCAAATTGCCAGTGGTAGCAATTGGCGGTATCAATCACACTAATGCAGCTGCTGTGATGGGAATTGGTATCCCAAATCTTAAAGGTGTTGCTGTTGTGTCAGCTCTTTTTGATAGGCAATGTGTTTTAGAGGAGGCCTCAAAGTTACATGCTACTTTAGTGGAGGCTACAACATGATGAAATGTTTGAATAATTTTGTAATGCTTTGCTTTTTGTAGAAAGCCTTTTGAAATAAAAATAATTGTCACAATATATATGTCATGTAAGATTCTATTCATATAAAGTAGAGCATATTTATATTCGCTATAATAAGCATTCTGATGGTAACAATAATTGTTAAGTTACCTTTCTATGTAAAAGTACTTTTTGCAACTTTTGTGAAATATAAAAGTTAGATGGAGGCTCATACCAACCGTAAAATCTTATTGCAATCTTTAATTTTTCAAAAAGAAAGATTTCAAAGGAAGAGATGCCAAGCAGGTTCCTGTCATGTCGGCAACCTCAGAAATGTTTCTCATATGTACTCAATAATTGTTGAGACCTTGGAAATTTTTTTATAACTTCTATATACACATTGAATAAAATTTTGAAGTGTATAACATTAGTTAAAGG

Coding sequence (CDS)

ATGGTGCCGCTGCCCCTCATTTCTCAGATCCCTAAGTTCAATCAAGTTTCTAGATTTTGTATGGCCATGAAGAAGCAGGATGAAATGGTTGTAGCTTCAAGTAATCGAAATGAGACGTGTATTCCCCATGTGTTGAGTGTTGCTGGTTCTGATTCGGGAGCAGGGGCTGGAATCCAAGCAGATCTTAAGACTTGTGCTGCTCGTGGAGTGTATTGCTCCACTGTGATAACTGCTATTACTGCACAGAATACTGTGGGGGTTCAGGATGTAAACGTTGTTCCGGAGGGCTTTGTTTCAAAGCAGCTGAAATCTGTTCTCTCTGATATGCAAGTGGATGTGGTGAAAACAGGAATGCTACCTTCTACTGGCATCGTTCAAGTTCTACATCAGTGCTTGAAGGAGTTTCCCGTTCGAGCTTTGGTGGTTGATCCTGTCATGGTGTCTACTAGTGGAGATGTTCTGGCTGGTCCTGCTATTATTTCAGTGTTACAGGAAAAACTTCTACCAATGGCGGACTTGGTAACTCCAAATTTGAAGGAAGCATCTGCCTTACTTGGTGATATGCCACTTACAACAATTTCTGACATGCGTCATGCTGCAATGTTAATCTATCAGATGGGATCAAAAAATGTACTTATCAAAGGTGGGGACCTGCCCGATTCATTGGATGCGGTTGATATATTCTTTGATGGCAAGGATTTGCATGAGCTACGATCTTCACGCATAAAGTCTCGCAACACACATGGTACTGGATGCAGCTTAGCATCATGCATTGCAGCTGAACTTGCTAAAGGGTCTTCAATGTTCTCAGCTGTTAAGGCAAGCAAACAGTTCATCGAAAGAGCATTGAGGTACAGCAAGGACATTAACATTGGACATGGACCTCAAGGCCCATTCGATCATCTATGTTGTCTCAAGAATCGAGAACCAAGTTCCTACAGTCAGGGATGTTTCAATCCAGCTGACTTGTTTTTGTATGCTGTTACGGATTCAGGTATGAATGAGCGTTGGGACCGTTCTATCACTGATGCTGTTAAAGATGCAGTGGAAGGAGGTGCTACTATTGTTCAAATAAGAGAAAAAGATGCTAAAACTCGTGATTTCTTGGAAGTAGCAAAGTCATGCATAAAGATTTGTCATGCACATGGAGTTCCATTGTTGATCAACGATCGTATTGACATTGCACTTGCGTGTAATGCGGATGGTGTACACGTTGGTCAGTCCGATATTCCTGCTCATGAAGTTCGCCGCCTTCTTGGCCCCAACAAGATCATCGGTGTCTCATGCAAGACAACGGAGCAAGCGGAACAGGCATGGATTGATGGTGCAGATTACATTGGGTGTGGTGGAGTTTATCCCACTAACACAAAAGCGAACAATTTAACCGTTGGGATTGATGGATTGAAAAGAGTTTGCTTAGCTTCCAAATTGCCAGTGGTAGCAATTGGCGGTATCAATCACACTAATGCAGCTGCTGTGATGGGAATTGGTATCCCAAATCTTAAAGGTGTTGCTGTTGTGTCAGCTCTTTTTGATAGGCAATGTGTTTTAGAGGAGGCCTCAAAGTTACATGCTACTTTAGTGGAGGCTACAACATGA

Protein sequence

MVPLPLISQIPKFNQVSRFCMAMKKQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLPSTGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKEASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFDHLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIREKDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLGPNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEATT*
Homology
BLAST of CsGy5G009580 vs. ExPASy Swiss-Prot
Match: Q5M731 (Thiamine biosynthetic bifunctional enzyme TH1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=TH1 PE=1 SV=1)

HSP 1 Score: 689.9 bits (1779), Expect = 2.3e-197
Identity = 348/490 (71.02%), Postives = 414/490 (84.49%), Query Frame = 0

Query: 41  IPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSK 100
           +P VL+VAGSDSGAGAGIQADLK CAARGVYC++VITA+TAQNT GVQ V+++P  F+S+
Sbjct: 30  VPQVLTVAGSDSGAGAGIQADLKVCAARGVYCASVITAVTAQNTRGVQSVHLLPPEFISE 89

Query: 101 QLKSVLSDMQVDVVKTGMLPSTGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAII 160
           QLKSVLSD + DVVKTGMLPST IV+VL Q L +FPVRALVVDPVMVSTSG VLAG +I+
Sbjct: 90  QLKSVLSDFEFDVVKTGMLPSTEIVEVLLQNLSDFPVRALVVDPVMVSTSGHVLAGSSIL 149

Query: 161 SVLQEKLLPMADLVTPNLKEASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPD 220
           S+ +E+LLP+AD++TPN+KEASALL    + T+++MR AA  +++MG + VL+KGGDLPD
Sbjct: 150 SIFRERLLPIADIITPNVKEASALLDGFRIETVAEMRSAAKSLHEMGPRFVLVKGGDLPD 209

Query: 221 SLDAVDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIE 280
           S D+VD++FDGK+ HELRS RI +RNTHGTGC+LASCIAAELAKGSSM SAVK +K+F++
Sbjct: 210 SSDSVDVYFDGKEFHELRSPRIATRNTHGTGCTLASCIAAELAKGSSMLSAVKVAKRFVD 269

Query: 281 RALRYSKDINIGHGPQGPFDHLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRS 340
            AL YSKDI IG G QGPFDH   LK ++P S     FNP DLFLYAVTDS MN++W+RS
Sbjct: 270 NALDYSKDIVIGSGMQGPFDHFFGLK-KDPQSSRCSIFNPDDLFLYAVTDSRMNKKWNRS 329

Query: 341 ITDAVKDAVEGGATIVQIREKDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACNA 400
           I DA+K A+EGGATI+Q+REK+A+TR+FLE AK+CI IC +HGV LLINDRIDIALAC+A
Sbjct: 330 IVDALKAAIEGGATIIQLREKEAETREFLEEAKACIDICRSHGVSLLINDRIDIALACDA 389

Query: 401 DGVHVGQSDIPAHEVRRLLGPNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANN 460
           DGVHVGQSD+P   VR LLGP+KIIGVSCKT EQA QAW DGADYIG GGV+PTNTKANN
Sbjct: 390 DGVHVGQSDMPVDLVRSLLGPDKIIGVSCKTPEQAHQAWKDGADYIGSGGVFPTNTKANN 449

Query: 461 LTVGIDGLKRVCLASKLPVVAIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEA 520
            T+G+DGLK VC ASKLPVVAIGGI  +NA +VM I  PNLKGVAVVSALFD+ CVL +A
Sbjct: 450 RTIGLDGLKEVCEASKLPVVAIGGIGISNAGSVMQIDAPNLKGVAVVSALFDQDCVLTQA 509

Query: 521 SKLHATLVEA 531
            KLH TL E+
Sbjct: 510 KKLHKTLKES 518

BLAST of CsGy5G009580 vs. ExPASy Swiss-Prot
Match: O48881 (Thiamine biosynthetic bifunctional enzyme BTH1, chloroplastic OS=Brassica napus OX=3708 GN=BTH1 PE=1 SV=1)

HSP 1 Score: 673.3 bits (1736), Expect = 2.2e-192
Identity = 339/490 (69.18%), Postives = 409/490 (83.47%), Query Frame = 0

Query: 41  IPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSK 100
           +  VL+VAGSDSGAGAGIQAD+K CAARGVYC++V TA+ A+NT  VQ V+++P   VS+
Sbjct: 32  VAQVLTVAGSDSGAGAGIQADIKVCAARGVYCASVKTAVKAKNTRAVQSVHLLPPDSVSE 91

Query: 101 QLKSVLSDMQVDVVKTGMLPSTGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAII 160
           QLKSVLSD +VDVVKTGMLPS  IV+VL Q L E+PVRALVVDPVMVSTSG VLAG +I+
Sbjct: 92  QLKSVLSDFEVDVVKTGMLPSPEIVEVLLQNLSEYPVRALVVDPVMVSTSGHVLAGSSIL 151

Query: 161 SVLQEKLLPMADLVTPNLKEASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPD 220
           S+ +E+LLP+AD++TPN+KEASALLG + + T+++MR AA  ++QMG + VL+KGGDLPD
Sbjct: 152 SIFRERLLPLADIITPNVKEASALLGGVRIQTVAEMRSAAKSLHQMGPRFVLVKGGDLPD 211

Query: 221 SLDAVDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIE 280
           S D+VD++FDG + HEL S RI +RNTHGTGC+LASCIAAELAKGS+M SAVK +K+F++
Sbjct: 212 SSDSVDVYFDGNEFHELHSPRIATRNTHGTGCTLASCIAAELAKGSNMLSAVKVAKRFVD 271

Query: 281 RALRYSKDINIGHGPQGPFDHLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRS 340
            AL YSKDI IG G QGPFDH   LK  +P SY Q  F P DLFLYAVTDS MN++W+RS
Sbjct: 272 SALNYSKDIVIGSGMQGPFDHFLSLK--DPQSYRQSTFKPDDLFLYAVTDSRMNKKWNRS 331

Query: 341 ITDAVKDAVEGGATIVQIREKDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACNA 400
           I DAVK A+EGGATI+Q+REK+A+TR+FLE AKSC+ IC ++GV LLINDR DIA+A +A
Sbjct: 332 IVDAVKAAIEGGATIIQLREKEAETREFLEEAKSCVDICRSNGVCLLINDRFDIAIALDA 391

Query: 401 DGVHVGQSDIPAHEVRRLLGPNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANN 460
           DGVHVGQSD+P   VR LLGP+KIIGVSCKT EQA QAW DGADYIG GGV+PTNTKANN
Sbjct: 392 DGVHVGQSDMPVDLVRSLLGPDKIIGVSCKTQEQAHQAWKDGADYIGSGGVFPTNTKANN 451

Query: 461 LTVGIDGLKRVCLASKLPVVAIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEA 520
            T+G+DGL+ VC ASKLPVVAIGGI  +NA +VM IG PNLKGVAVVSALFD++CVL +A
Sbjct: 452 RTIGLDGLREVCKASKLPVVAIGGIGISNAESVMRIGEPNLKGVAVVSALFDQECVLTQA 511

Query: 521 SKLHATLVEA 531
            KLH TL E+
Sbjct: 512 KKLHKTLTES 519

BLAST of CsGy5G009580 vs. ExPASy Swiss-Prot
Match: Q2QWK9 (Probable thiamine biosynthetic bifunctional enzyme, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=Os12g0192500 PE=2 SV=2)

HSP 1 Score: 668.3 bits (1723), Expect = 7.2e-191
Identity = 333/508 (65.55%), Postives = 412/508 (81.10%), Query Frame = 0

Query: 24  KKQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAITAQN 83
           ++ + +  ++S   E   PHVL+VAGSDSG GAGIQAD+K CAA G YCS+V+TA+TAQN
Sbjct: 39  RRYNRLAASASAAREMPWPHVLTVAGSDSGGGAGIQADIKACAALGAYCSSVVTAVTAQN 98

Query: 84  TVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLPSTGIVQVLHQCLKEFPVRALVVD 143
           T GVQ ++VVPE F+ +QL SVLSDM VDVVKTGMLPS G+V+VL + LK+FPV+ALVVD
Sbjct: 99  TAGVQGIHVVPEEFIREQLNSVLSDMSVDVVKTGMLPSIGVVRVLCESLKKFPVKALVVD 158

Query: 144 PVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKEASALLGDMPLTTISDMRHAAMLI 203
           PVMVSTSGD L+  + +SV +++L  MAD+VTPN+KEAS LLG + L T+SDMR+AA  I
Sbjct: 159 PVMVSTSGDTLSESSTLSVYRDELFAMADIVTPNVKEASRLLGGVSLRTVSDMRNAAESI 218

Query: 204 YQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCIAAELA 263
           Y+ G K+VL+KGGD+ +S DA D+FFDGK+  EL + RIK+ NTHGTGC+LASCIA+ELA
Sbjct: 219 YKFGPKHVLVKGGDMLESSDATDVFFDGKEFIELHAHRIKTHNTHGTGCTLASCIASELA 278

Query: 264 KGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFDHLCCLKNREPSSYSQGCFNPADL 323
           KG++M  AV+ +K F+E AL +SKD+ +G+GPQGPFDHL  LK    +  SQ  F P  L
Sbjct: 279 KGATMLHAVQVAKNFVESALHHSKDLVVGNGPQGPFDHLFKLKCPPYNVGSQPSFKPDQL 338

Query: 324 FLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIREKDAKTRDFLEVAKSCIKICHAHG 383
           FLYAVTDSGMN++W RSI +AV+ A+EGGATIVQ+REKD++TR+FLE AK+C++IC + G
Sbjct: 339 FLYAVTDSGMNKKWGRSIKEAVQAAIEGGATIVQLREKDSETREFLEAAKACMEICKSSG 398

Query: 384 VPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLGPNKIIGVSCKTTEQAEQAWIDGA 443
           VPLLINDR+DIALACNADGVHVGQ D+ AHEVR LLGP KIIGVSCKT  QA+QAW DGA
Sbjct: 399 VPLLINDRVDIALACNADGVHVGQLDMSAHEVRELLGPGKIIGVSCKTPAQAQQAWNDGA 458

Query: 444 DYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINHTNAAAVMGIGIPNLKG 503
           DYIGCGGV+PT+TKANN T+G DGLK VCLASKLPVVAIGGIN +NA +VM +G+PNLKG
Sbjct: 459 DYIGCGGVFPTSTKANNPTLGFDGLKTVCLASKLPVVAIGGINASNAGSVMELGLPNLKG 518

Query: 504 VAVVSALFDRQCVLEEASKLHATLVEAT 532
           VAVVSALFDR  V+ E   + + L   +
Sbjct: 519 VAVVSALFDRPSVVAETRNMKSILTNTS 546

BLAST of CsGy5G009580 vs. ExPASy Swiss-Prot
Match: P61422 (Thiamine biosynthesis bifunctional protein ThiED OS=Geobacter sulfurreducens (strain ATCC 51573 / DSM 12127 / PCA) OX=243231 GN=thiDE PE=3 SV=1)

HSP 1 Score: 218.4 bits (555), Expect = 2.0e-55
Identity = 129/255 (50.59%), Postives = 168/255 (65.88%), Query Frame = 0

Query: 44  VLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLK 103
           VL+VAGSDSG GAGIQADLKT    G Y S+V+TA+TAQNT GV  ++ VP  FV+ QL 
Sbjct: 228 VLTVAGSDSGGGAGIQADLKTVTLLGSYGSSVLTALTAQNTRGVSGIHGVPPAFVADQLD 287

Query: 104 SVLSDMQVDVVKTGMLPSTGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVL 163
           +V SD+ VDVVKTGML S   +  +   L E+  R +VVDPVMV+  G  L     +SVL
Sbjct: 288 AVFSDIPVDVVKTGMLFSAETIVAIAAKLTEYRRRMVVVDPVMVAKGGANLIDRGAVSVL 347

Query: 164 QEKLLPMADLVTPNLKEASALLGDMPLTTISD---MRHAAMLIYQMGSKNVLIKGGDLPD 223
           +E+L P+A LVTPN+ EA  L G      ISD   MR AA  ++++G++NVL+KGG L  
Sbjct: 348 KERLFPLAYLVTPNIPEAERLTG----ANISDEESMREAARRLHRLGARNVLLKGGHLLA 407

Query: 224 SLDAVDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIE 283
             D+VDI FDG   H   S RI S+NTHGTGC+ AS IA  LA+G  +  A+  +K++I 
Sbjct: 408 G-DSVDILFDGAAFHRFVSPRILSKNTHGTGCTFASAIATYLAQGDPLREAIARAKRYIT 467

Query: 284 RALRYSKDINIGHGP 296
            A+R ++ +  GHGP
Sbjct: 468 AAIRLAQPLGRGHGP 477

BLAST of CsGy5G009580 vs. ExPASy Swiss-Prot
Match: P56904 (Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase OS=Rhizobium meliloti (strain 1021) OX=266834 GN=thiD PE=3 SV=2)

HSP 1 Score: 203.0 bits (515), Expect = 8.5e-51
Identity = 124/257 (48.25%), Postives = 164/257 (63.81%), Query Frame = 0

Query: 45  LSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKS 104
           LS+AGSDSG GAGIQADLKT +A GVY ++VITAITAQNT GV  V  V    VS Q+ +
Sbjct: 6   LSIAGSDSGGGAGIQADLKTFSALGVYGASVITAITAQNTRGVTAVEDVSAEIVSAQMDA 65

Query: 105 VLSDMQVDVVKTGMLPSTGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQ 164
           V SD+ V  VK GM+     +  +   L+ F  RA VVDPVMV+TSGD L  P  ++ L 
Sbjct: 66  VFSDLDVKAVKIGMVSRRETIAAIADGLRRFGKRA-VVDPVMVATSGDALLRPDAVAALI 125

Query: 165 EKLLPMADLVTPNLKEASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDA 224
           E+LLP+A +VTPNL EA+ + G       ++M   A  I + G+  VL+KGG L    +A
Sbjct: 126 EELLPLALVVTPNLAEAALMTGRAIAGDEAEMARQAEAIMRTGAHAVLVKGGHLKGQ-EA 185

Query: 225 VDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALR 284
            D+FFDG  L  L + RI++RN HGTGC+L++ IAA LAKG  +  AV A+K ++  A+ 
Sbjct: 186 TDLFFDGDTLVRLPAGRIETRNDHGTGCTLSAAIAAGLAKGVPLIEAVSAAKAYLHAAIS 245

Query: 285 YSKDINIGHGPQGPFDH 302
            +  + IG G +GP  H
Sbjct: 246 AADRLEIGQG-RGPVHH 259

BLAST of CsGy5G009580 vs. NCBI nr
Match: XP_004140412.1 (thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X1 [Cucumis sativus])

HSP 1 Score: 1038 bits (2684), Expect = 0.0
Identity = 531/532 (99.81%), Postives = 531/532 (99.81%), Query Frame = 0

Query: 1   MVPLPLISQIPKFNQVSRFCMAMKKQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQA 60
           MVPLPLISQIPKFNQVSRFCMAMKKQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVPLPLISQIPKFNQVSRFCMAMKKQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKE 180
           STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKE
Sbjct: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKE 180

Query: 181 ASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300
           RIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD
Sbjct: 241 RIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300

Query: 301 HLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIRE 360
           HLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIRE
Sbjct: 301 HLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIRE 360

Query: 361 KDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLG 420
           KDAKTRDFLEVAKSCIKIC AHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLG
Sbjct: 361 KDAKTRDFLEVAKSCIKICRAHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLG 420

Query: 421 PNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480
           PNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV
Sbjct: 421 PNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480

Query: 481 AIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEATT 532
           AIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEATT
Sbjct: 481 AIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEATT 532

BLAST of CsGy5G009580 vs. NCBI nr
Match: KAE8648089.1 (hypothetical protein Csa_004689 [Cucumis sativus])

HSP 1 Score: 1004 bits (2597), Expect = 0.0
Identity = 519/532 (97.56%), Postives = 519/532 (97.56%), Query Frame = 0

Query: 1   MVPLPLISQIPKFNQVSRFCMAMKKQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQA 60
           MVPLPLISQIPK            KQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVPLPLISQIPK------------KQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKE 180
           STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKE
Sbjct: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKE 180

Query: 181 ASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300
           RIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD
Sbjct: 241 RIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300

Query: 301 HLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIRE 360
           HLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIRE
Sbjct: 301 HLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIRE 360

Query: 361 KDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLG 420
           KDAKTRDFLEVAKSCIKIC AHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLG
Sbjct: 361 KDAKTRDFLEVAKSCIKICRAHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLG 420

Query: 421 PNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480
           PNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV
Sbjct: 421 PNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480

Query: 481 AIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEATT 532
           AIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEATT
Sbjct: 481 AIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEATT 520

BLAST of CsGy5G009580 vs. NCBI nr
Match: XP_031741473.1 (thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X2 [Cucumis sativus])

HSP 1 Score: 1000 bits (2585), Expect = 0.0
Identity = 517/532 (97.18%), Postives = 517/532 (97.18%), Query Frame = 0

Query: 1   MVPLPLISQIPKFNQVSRFCMAMKKQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQA 60
           MVPLPLISQIPK              DEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVPLPLISQIPK--------------DEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKE 180
           STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKE
Sbjct: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKE 180

Query: 181 ASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300
           RIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD
Sbjct: 241 RIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300

Query: 301 HLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIRE 360
           HLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIRE
Sbjct: 301 HLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIRE 360

Query: 361 KDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLG 420
           KDAKTRDFLEVAKSCIKIC AHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLG
Sbjct: 361 KDAKTRDFLEVAKSCIKICRAHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLG 420

Query: 421 PNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480
           PNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV
Sbjct: 421 PNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480

Query: 481 AIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEATT 532
           AIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEATT
Sbjct: 481 AIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEATT 518

BLAST of CsGy5G009580 vs. NCBI nr
Match: XP_008460243.1 (PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X2 [Cucumis melo])

HSP 1 Score: 993 bits (2568), Expect = 0.0
Identity = 509/532 (95.68%), Postives = 520/532 (97.74%), Query Frame = 0

Query: 1   MVPLPLISQIPKFNQVSRFCMAMKKQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQA 60
           MVPLPLISQIPKFNQVSRFCMAMKKQ+EMVVASS+R ET IPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVPLPLISQIPKFNQVSRFCMAMKKQEEMVVASSDRYETRIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITAITAQNTVGVQDVN+VPEGFVSKQLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNIVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKE 180
           STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGP IISVLQE+LLPMADLVTPNLKE
Sbjct: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPTIISVLQEELLPMADLVTPNLKE 180

Query: 181 ASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLG MPL TISDMRHAA LI+QMGSKNVL+KGGDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGGMPLKTISDMRHAATLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300
           RIKSRNTHGTGCSLASCI+AELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD
Sbjct: 241 RIKSRNTHGTGCSLASCISAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300

Query: 301 HLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIRE 360
           HLC LK+RE SSYSQGCFNP DLFLYAVTDSGMNERWDRSITDAVK AVEGGATIVQIRE
Sbjct: 301 HLCRLKSREQSSYSQGCFNPTDLFLYAVTDSGMNERWDRSITDAVKAAVEGGATIVQIRE 360

Query: 361 KDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLG 420
           KDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALAC+ADGVHVGQSDIPAHEVRRLLG
Sbjct: 361 KDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACDADGVHVGQSDIPAHEVRRLLG 420

Query: 421 PNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480
           PNK+IGVSCKT EQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV
Sbjct: 421 PNKVIGVSCKTMEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480

Query: 481 AIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEATT 532
           AIGGINHTNAAAVMGIGIPNL+GVAVVSALFDRQCVLE ASKLHATLVEATT
Sbjct: 481 AIGGINHTNAAAVMGIGIPNLRGVAVVSALFDRQCVLEAASKLHATLVEATT 532

BLAST of CsGy5G009580 vs. NCBI nr
Match: XP_038876926.1 (thiamine biosynthetic bifunctional enzyme TH1, chloroplastic [Benincasa hispida])

HSP 1 Score: 959 bits (2478), Expect = 0.0
Identity = 489/532 (91.92%), Postives = 513/532 (96.43%), Query Frame = 0

Query: 1   MVPLPLISQIPKFNQVSRFCMAMKKQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQA 60
           MVPLPLISQIPKFNQVSRFC+AMKK +E VVASS+R E  IPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVPLPLISQIPKFNQVSRFCVAMKKHEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITAITAQNTVGVQDVN+VPEGFVS+QLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNIVPEGFVSEQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKE 180
           STGIV+VLHQCLKEFPV+ALVVDPVMVSTSGDVLAGP IISVLQ++LLPMADLVTPNLKE
Sbjct: 121 STGIVRVLHQCLKEFPVQALVVDPVMVSTSGDVLAGPTIISVLQKELLPMADLVTPNLKE 180

Query: 181 ASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLG MPL TISDMRHAA LI+QMGSKNVL+KGGDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGGMPLKTISDMRHAATLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300
           RIK+RNTHGTGCSLASCIAAELAKGSSMFSAVK SKQFIERAL+YSKDI IG+GPQGPFD
Sbjct: 241 RIKTRNTHGTGCSLASCIAAELAKGSSMFSAVKTSKQFIERALKYSKDIGIGNGPQGPFD 300

Query: 301 HLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIRE 360
           HLC LK+RE SSY QGCFNPADLFLYAVTDSGMN+RWDRSI+DAVK AVEGGATI+QIRE
Sbjct: 301 HLCRLKSREQSSYRQGCFNPADLFLYAVTDSGMNKRWDRSISDAVKAAVEGGATIIQIRE 360

Query: 361 KDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLG 420
           K+AKTRDFLE AKSCIKICHAHGVPLLINDRIDIALAC+ADGVHVGQSDIPAHEVRRLLG
Sbjct: 361 KEAKTRDFLEAAKSCIKICHAHGVPLLINDRIDIALACDADGVHVGQSDIPAHEVRRLLG 420

Query: 421 PNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480
           PNKIIGVSCKT EQAEQAW+DGADYIGCGGVYPTNTKANNLTVGI+GLKRVCLASKLPVV
Sbjct: 421 PNKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGIEGLKRVCLASKLPVV 480

Query: 481 AIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEATT 532
           AIGGINH+NAAAVM IG+PNLKGVAVVSALFDRQCVLEEA KLHATLVEATT
Sbjct: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEEALKLHATLVEATT 532

BLAST of CsGy5G009580 vs. ExPASy TrEMBL
Match: A0A1S3CCI6 (Thiamine-phosphate pyrophosphorylase OS=Cucumis melo OX=3656 GN=LOC103499122 PE=3 SV=1)

HSP 1 Score: 993 bits (2568), Expect = 0.0
Identity = 509/532 (95.68%), Postives = 520/532 (97.74%), Query Frame = 0

Query: 1   MVPLPLISQIPKFNQVSRFCMAMKKQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQA 60
           MVPLPLISQIPKFNQVSRFCMAMKKQ+EMVVASS+R ET IPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVPLPLISQIPKFNQVSRFCMAMKKQEEMVVASSDRYETRIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITAITAQNTVGVQDVN+VPEGFVSKQLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNIVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKE 180
           STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGP IISVLQE+LLPMADLVTPNLKE
Sbjct: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPTIISVLQEELLPMADLVTPNLKE 180

Query: 181 ASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLG MPL TISDMRHAA LI+QMGSKNVL+KGGDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGGMPLKTISDMRHAATLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300
           RIKSRNTHGTGCSLASCI+AELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD
Sbjct: 241 RIKSRNTHGTGCSLASCISAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300

Query: 301 HLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIRE 360
           HLC LK+RE SSYSQGCFNP DLFLYAVTDSGMNERWDRSITDAVK AVEGGATIVQIRE
Sbjct: 301 HLCRLKSREQSSYSQGCFNPTDLFLYAVTDSGMNERWDRSITDAVKAAVEGGATIVQIRE 360

Query: 361 KDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLG 420
           KDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALAC+ADGVHVGQSDIPAHEVRRLLG
Sbjct: 361 KDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACDADGVHVGQSDIPAHEVRRLLG 420

Query: 421 PNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480
           PNK+IGVSCKT EQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV
Sbjct: 421 PNKVIGVSCKTMEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480

Query: 481 AIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEATT 532
           AIGGINHTNAAAVMGIGIPNL+GVAVVSALFDRQCVLE ASKLHATLVEATT
Sbjct: 481 AIGGINHTNAAAVMGIGIPNLRGVAVVSALFDRQCVLEAASKLHATLVEATT 532

BLAST of CsGy5G009580 vs. ExPASy TrEMBL
Match: A0A1S3CDB7 (Thiamine-phosphate pyrophosphorylase OS=Cucumis melo OX=3656 GN=LOC103499122 PE=3 SV=1)

HSP 1 Score: 953 bits (2464), Expect = 0.0
Identity = 489/512 (95.51%), Postives = 500/512 (97.66%), Query Frame = 0

Query: 21  MAMKKQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAIT 80
           MAMKKQ+EMVVASS+R ET IPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAIT
Sbjct: 1   MAMKKQEEMVVASSDRYETRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAIT 60

Query: 81  AQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLPSTGIVQVLHQCLKEFPVRAL 140
           AQNTVGVQDVN+VPEGFVSKQLKSVLSDMQVDVVKTGMLPSTGIVQVLHQCLKEFPVRAL
Sbjct: 61  AQNTVGVQDVNIVPEGFVSKQLKSVLSDMQVDVVKTGMLPSTGIVQVLHQCLKEFPVRAL 120

Query: 141 VVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKEASALLGDMPLTTISDMRHAA 200
           VVDPVMVSTSGDVLAGP IISVLQE+LLPMADLVTPNLKEASALLG MPL TISDMRHAA
Sbjct: 121 VVDPVMVSTSGDVLAGPTIISVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAA 180

Query: 201 MLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCIAA 260
            LI+QMGSKNVL+KGGDLPDSLDAVDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCI+A
Sbjct: 181 TLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCISA 240

Query: 261 ELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFDHLCCLKNREPSSYSQGCFNP 320
           ELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFDHLC LK+RE SSYSQGCFNP
Sbjct: 241 ELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFDHLCRLKSREQSSYSQGCFNP 300

Query: 321 ADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIREKDAKTRDFLEVAKSCIKICH 380
            DLFLYAVTDSGMNERWDRSITDAVK AVEGGATIVQIREKDAKTRDFLEVAKSCIKICH
Sbjct: 301 TDLFLYAVTDSGMNERWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEVAKSCIKICH 360

Query: 381 AHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLGPNKIIGVSCKTTEQAEQAWI 440
           AHGVPLLINDRIDIALAC+ADGVHVGQSDIPAHEVRRLLGPNK+IGVSCKT EQAEQAWI
Sbjct: 361 AHGVPLLINDRIDIALACDADGVHVGQSDIPAHEVRRLLGPNKVIGVSCKTMEQAEQAWI 420

Query: 441 DGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINHTNAAAVMGIGIPN 500
           DGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINHTNAAAVMGIGIPN
Sbjct: 421 DGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINHTNAAAVMGIGIPN 480

Query: 501 LKGVAVVSALFDRQCVLEEASKLHATLVEATT 532
           L+GVAVVSALFDRQCVLE ASKLHATLVEATT
Sbjct: 481 LRGVAVVSALFDRQCVLEAASKLHATLVEATT 512

BLAST of CsGy5G009580 vs. ExPASy TrEMBL
Match: A0A6J1GYV1 (Thiamine-phosphate pyrophosphorylase OS=Cucurbita moschata OX=3662 GN=LOC111458768 PE=3 SV=1)

HSP 1 Score: 920 bits (2378), Expect = 0.0
Identity = 469/531 (88.32%), Postives = 498/531 (93.79%), Query Frame = 0

Query: 1   MVPLPLISQIPKFNQVSRFCMAMKKQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQA 60
           MV LPL  QIPKFNQVSRFCMAMKK +E VVASS+R+E  IPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRHEMRIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITA+TAQNTVGVQDVN++PEGFVS+QLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIIPEGFVSEQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKE 180
           STGI+QV+HQ LKEFPV+ALVVDPVMVSTSGDVLA P IISVLQ++LLPMADLVTPNLKE
Sbjct: 121 STGIIQVIHQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180

Query: 181 ASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLG MPL TISDMRHAA LI+QMGSKNVL+KGGDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGGMPLNTISDMRHAATLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300
           RI +RNTHGTGCSLASCIAAELAKGSSMFSAVK SKQFIERAL YSKDI+IG+GPQGPFD
Sbjct: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300

Query: 301 HLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIRE 360
           HLC LK+RE SSY QG FN +DLFLYAVTDSGMN+RWDRSITDAVK AVEGGATI+QIRE
Sbjct: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360

Query: 361 KDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLG 420
           KDAKTRDFLE AKSCI+ICH HGVPLLINDRID+ALAC ADGVH+GQSDIP H  R LLG
Sbjct: 361 KDAKTRDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420

Query: 421 PNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480
           P+KIIGVSCKT EQAEQAW+DGADYIGCGGVYPTNTKANNLTVG+DGLKRVCLASKLPVV
Sbjct: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480

Query: 481 AIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEAT 531
           AIGGINH+NAAAVM IG+PNLKGVAVVSALFDRQCVLEE  KLHATLVEAT
Sbjct: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEAT 531

BLAST of CsGy5G009580 vs. ExPASy TrEMBL
Match: A0A6J1KCE1 (Thiamine-phosphate pyrophosphorylase OS=Cucurbita maxima OX=3661 GN=LOC111492599 PE=3 SV=1)

HSP 1 Score: 912 bits (2356), Expect = 0.0
Identity = 466/531 (87.76%), Postives = 495/531 (93.22%), Query Frame = 0

Query: 1   MVPLPLISQIPKFNQVSRFCMAMKKQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQA 60
           MV LPL  QIPKFNQVSRFCMAMKK +E VVASS+R E  IPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITA+TAQNTVGVQDVN++PEGFVS+QLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKE 180
           STGI+QV+ Q LKEFPV+ALVVDPVMVSTSGDVLA P IISVLQ++LLPM DLVTPNLKE
Sbjct: 121 STGIIQVICQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMVDLVTPNLKE 180

Query: 181 ASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLG MPL TISDMRHAA LI+QMGSKNVL+KGGDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGGMPLNTISDMRHAATLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300
           RI +RNTHGTGCSLASCIAAELAKGSSMFSAVK SKQFIERAL YSKDI+IG+GPQGPFD
Sbjct: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALNYSKDISIGNGPQGPFD 300

Query: 301 HLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIRE 360
           HLC LK+RE SSY QG FN +DLFLYAVTDSGMN+RWDRSITDAVK AVEGGATI+QIRE
Sbjct: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360

Query: 361 KDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLG 420
           KDAKTRDFLE AKSCI+ICH HGVPLLINDRID+ALAC ADGVH+GQSDIP H  R LLG
Sbjct: 361 KDAKTRDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420

Query: 421 PNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480
           P+KIIGVSCKT EQAEQAW+DGADYIGCGGVYPTNTKANNLTVG+DGLK+VCLASKLPVV
Sbjct: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKKVCLASKLPVV 480

Query: 481 AIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEAT 531
           AIGGINH+NAAAVM IG+PNLKGVAVVSALFDRQCVLEE  KLHATLVEAT
Sbjct: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEAT 531

BLAST of CsGy5G009580 vs. ExPASy TrEMBL
Match: A0A6J1GZX4 (Thiamine-phosphate pyrophosphorylase OS=Cucurbita moschata OX=3662 GN=LOC111458768 PE=3 SV=1)

HSP 1 Score: 888 bits (2295), Expect = 0.0
Identity = 452/511 (88.45%), Postives = 481/511 (94.13%), Query Frame = 0

Query: 21  MAMKKQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAIT 80
           MAMKK +E VVASS+R+E  IPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITA+T
Sbjct: 1   MAMKKPEETVVASSDRHEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVT 60

Query: 81  AQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLPSTGIVQVLHQCLKEFPVRAL 140
           AQNTVGVQDVN++PEGFVS+QLKSVLSDMQVDVVKTGMLPSTGI+QV+HQ LKEFPV+AL
Sbjct: 61  AQNTVGVQDVNIIPEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVIHQRLKEFPVQAL 120

Query: 141 VVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKEASALLGDMPLTTISDMRHAA 200
           VVDPVMVSTSGDVLA P IISVLQ++LLPMADLVTPNLKEASALLG MPL TISDMRHAA
Sbjct: 121 VVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKEASALLGGMPLNTISDMRHAA 180

Query: 201 MLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCIAA 260
            LI+QMGSKNVL+KGGDLPDSLDAVDIFFDGKDLHELRSSRI +RNTHGTGCSLASCIAA
Sbjct: 181 TLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAA 240

Query: 261 ELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFDHLCCLKNREPSSYSQGCFNP 320
           ELAKGSSMFSAVK SKQFIERAL YSKDI+IG+GPQGPFDHLC LK+RE SSY QG FN 
Sbjct: 241 ELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQ 300

Query: 321 ADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIREKDAKTRDFLEVAKSCIKICH 380
           +DLFLYAVTDSGMN+RWDRSITDAVK AVEGGATI+QIREKDAKTRDFLE AKSCI+ICH
Sbjct: 301 SDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIREKDAKTRDFLEAAKSCIEICH 360

Query: 381 AHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLGPNKIIGVSCKTTEQAEQAWI 440
            HGVPLLINDRID+ALAC ADGVH+GQSDIP H  R LLGP+KIIGVSCKT EQAEQAW+
Sbjct: 361 THGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWL 420

Query: 441 DGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINHTNAAAVMGIGIPN 500
           DGADYIGCGGVYPTNTKANNLTVG+DGLKRVCLASKLPVVAIGGINH+NAAAVM IG+PN
Sbjct: 421 DGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPN 480

Query: 501 LKGVAVVSALFDRQCVLEEASKLHATLVEAT 531
           LKGVAVVSALFDRQCVLEE  KLHATLVEAT
Sbjct: 481 LKGVAVVSALFDRQCVLEETLKLHATLVEAT 511

BLAST of CsGy5G009580 vs. TAIR 10
Match: AT1G22940.1 (thiamin biosynthesis protein, putative )

HSP 1 Score: 689.9 bits (1779), Expect = 1.6e-198
Identity = 348/490 (71.02%), Postives = 414/490 (84.49%), Query Frame = 0

Query: 41  IPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSK 100
           +P VL+VAGSDSGAGAGIQADLK CAARGVYC++VITA+TAQNT GVQ V+++P  F+S+
Sbjct: 30  VPQVLTVAGSDSGAGAGIQADLKVCAARGVYCASVITAVTAQNTRGVQSVHLLPPEFISE 89

Query: 101 QLKSVLSDMQVDVVKTGMLPSTGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAII 160
           QLKSVLSD + DVVKTGMLPST IV+VL Q L +FPVRALVVDPVMVSTSG VLAG +I+
Sbjct: 90  QLKSVLSDFEFDVVKTGMLPSTEIVEVLLQNLSDFPVRALVVDPVMVSTSGHVLAGSSIL 149

Query: 161 SVLQEKLLPMADLVTPNLKEASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPD 220
           S+ +E+LLP+AD++TPN+KEASALL    + T+++MR AA  +++MG + VL+KGGDLPD
Sbjct: 150 SIFRERLLPIADIITPNVKEASALLDGFRIETVAEMRSAAKSLHEMGPRFVLVKGGDLPD 209

Query: 221 SLDAVDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIE 280
           S D+VD++FDGK+ HELRS RI +RNTHGTGC+LASCIAAELAKGSSM SAVK +K+F++
Sbjct: 210 SSDSVDVYFDGKEFHELRSPRIATRNTHGTGCTLASCIAAELAKGSSMLSAVKVAKRFVD 269

Query: 281 RALRYSKDINIGHGPQGPFDHLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRS 340
            AL YSKDI IG G QGPFDH   LK ++P S     FNP DLFLYAVTDS MN++W+RS
Sbjct: 270 NALDYSKDIVIGSGMQGPFDHFFGLK-KDPQSSRCSIFNPDDLFLYAVTDSRMNKKWNRS 329

Query: 341 ITDAVKDAVEGGATIVQIREKDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACNA 400
           I DA+K A+EGGATI+Q+REK+A+TR+FLE AK+CI IC +HGV LLINDRIDIALAC+A
Sbjct: 330 IVDALKAAIEGGATIIQLREKEAETREFLEEAKACIDICRSHGVSLLINDRIDIALACDA 389

Query: 401 DGVHVGQSDIPAHEVRRLLGPNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANN 460
           DGVHVGQSD+P   VR LLGP+KIIGVSCKT EQA QAW DGADYIG GGV+PTNTKANN
Sbjct: 390 DGVHVGQSDMPVDLVRSLLGPDKIIGVSCKTPEQAHQAWKDGADYIGSGGVFPTNTKANN 449

Query: 461 LTVGIDGLKRVCLASKLPVVAIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEA 520
            T+G+DGLK VC ASKLPVVAIGGI  +NA +VM I  PNLKGVAVVSALFD+ CVL +A
Sbjct: 450 RTIGLDGLKEVCEASKLPVVAIGGIGISNAGSVMQIDAPNLKGVAVVSALFDQDCVLTQA 509

Query: 521 SKLHATLVEA 531
            KLH TL E+
Sbjct: 510 KKLHKTLKES 518

BLAST of CsGy5G009580 vs. TAIR 10
Match: AT5G37850.1 (pfkB-like carbohydrate kinase family protein )

HSP 1 Score: 43.1 bits (100), Expect = 8.0e-04
Identity = 31/104 (29.81%), Postives = 53/104 (50.96%), Query Frame = 0

Query: 114 VKTGMLPSTG----IVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLP 173
           V TG + S      I++V+++     P    V DPVM    G +     ++ V +EK++P
Sbjct: 125 VLTGYIGSVSFLDTILEVINKLRSVNPNLTYVCDPVM-GDEGKLYVPEELVHVYREKVVP 184

Query: 174 MADLVTPNLKEASALLGDMPLTTISDMRHAAMLIYQMGSKNVLI 214
           +A ++TPN  EA  L G + + +  D R A  +++  G   V+I
Sbjct: 185 LASMLTPNQFEAEKLTG-LRINSEEDGREACAILHAAGPSKVVI 226

BLAST of CsGy5G009580 vs. TAIR 10
Match: AT5G37850.2 (pfkB-like carbohydrate kinase family protein )

HSP 1 Score: 43.1 bits (100), Expect = 8.0e-04
Identity = 31/104 (29.81%), Postives = 53/104 (50.96%), Query Frame = 0

Query: 114 VKTGMLPSTG----IVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLP 173
           V TG + S      I++V+++     P    V DPVM    G +     ++ V +EK++P
Sbjct: 91  VLTGYIGSVSFLDTILEVINKLRSVNPNLTYVCDPVM-GDEGKLYVPEELVHVYREKVVP 150

Query: 174 MADLVTPNLKEASALLGDMPLTTISDMRHAAMLIYQMGSKNVLI 214
           +A ++TPN  EA  L G + + +  D R A  +++  G   V+I
Sbjct: 151 LASMLTPNQFEAEKLTG-LRINSEEDGREACAILHAAGPSKVVI 192

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q5M7312.3e-19771.02Thiamine biosynthetic bifunctional enzyme TH1, chloroplastic OS=Arabidopsis thal... [more]
O488812.2e-19269.18Thiamine biosynthetic bifunctional enzyme BTH1, chloroplastic OS=Brassica napus ... [more]
Q2QWK97.2e-19165.55Probable thiamine biosynthetic bifunctional enzyme, chloroplastic OS=Oryza sativ... [more]
P614222.0e-5550.59Thiamine biosynthesis bifunctional protein ThiED OS=Geobacter sulfurreducens (st... [more]
P569048.5e-5148.25Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase OS=Rhizobium meliloti (st... [more]
Match NameE-valueIdentityDescription
XP_004140412.10.099.81thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X1 [Cucumis... [more]
KAE8648089.10.097.56hypothetical protein Csa_004689 [Cucumis sativus][more]
XP_031741473.10.097.18thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X2 [Cucumis... [more]
XP_008460243.10.095.68PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform ... [more]
XP_038876926.10.091.92thiamine biosynthetic bifunctional enzyme TH1, chloroplastic [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A1S3CCI60.095.68Thiamine-phosphate pyrophosphorylase OS=Cucumis melo OX=3656 GN=LOC103499122 PE=... [more]
A0A1S3CDB70.095.51Thiamine-phosphate pyrophosphorylase OS=Cucumis melo OX=3656 GN=LOC103499122 PE=... [more]
A0A6J1GYV10.088.32Thiamine-phosphate pyrophosphorylase OS=Cucurbita moschata OX=3662 GN=LOC1114587... [more]
A0A6J1KCE10.087.76Thiamine-phosphate pyrophosphorylase OS=Cucurbita maxima OX=3661 GN=LOC111492599... [more]
A0A6J1GZX40.088.45Thiamine-phosphate pyrophosphorylase OS=Cucurbita moschata OX=3662 GN=LOC1114587... [more]
Match NameE-valueIdentityDescription
AT1G22940.11.6e-19871.02thiamin biosynthesis protein, putative [more]
AT5G37850.18.0e-0429.81pfkB-like carbohydrate kinase family protein [more]
AT5G37850.28.0e-0429.81pfkB-like carbohydrate kinase family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013785Aldolase-type TIM barrelGENE3D3.20.20.70Aldolase class Icoord: 320..531
e-value: 1.3E-71
score: 242.0
IPR013749Pyridoxamine kinase/Phosphomethylpyrimidine kinasePFAMPF08543Phos_pyr_kincoord: 51..295
e-value: 1.1E-88
score: 296.8
IPR034291Thiamine phosphate synthaseTIGRFAMTIGR00693TIGR00693coord: 325..517
e-value: 5.4E-62
score: 206.5
IPR034291Thiamine phosphate synthaseHAMAPMF_00097TMP_synthasecoord: 322..529
score: 26.825001
IPR004399Hydroxymethylpyrimidine kinase/phosphomethylpyrimidine kinase domainTIGRFAMTIGR00097TIGR00097coord: 44..301
e-value: 3.0E-98
score: 326.1
IPR004399Hydroxymethylpyrimidine kinase/phosphomethylpyrimidine kinase domainCDDcd01169HMPP_kinasecoord: 43..286
e-value: 1.12093E-117
score: 345.257
IPR029056Ribokinase-likeGENE3D3.40.1190.20coord: 22..319
e-value: 4.8E-99
score: 333.3
IPR029056Ribokinase-likeSUPERFAMILY53613Ribokinase-likecoord: 41..298
IPR022998Thiamine phosphate synthase/TenIPFAMPF02581TMP-TENIcoord: 325..510
e-value: 2.3E-63
score: 212.8
IPR022998Thiamine phosphate synthase/TenICDDcd00564TMP_TenIcoord: 325..525
e-value: 8.05207E-83
score: 253.98
IPR045029Hydroxymethylpyrimidine kinase/phosphomethylpyrimidine kinasePANTHERPTHR20858PHOSPHOMETHYLPYRIMIDINE KINASEcoord: 38..526
IPR036206Thiamin phosphate synthase superfamilySUPERFAMILY51391Thiamin phosphate synthasecoord: 321..524

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy5G009580.2CsGy5G009580.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016310 phosphorylation
biological_process GO:0009228 thiamine biosynthetic process
biological_process GO:0009229 thiamine diphosphate biosynthetic process
cellular_component GO:0009507 chloroplast
molecular_function GO:0008902 hydroxymethylpyrimidine kinase activity
molecular_function GO:0008972 phosphomethylpyrimidine kinase activity
molecular_function GO:0004789 thiamine-phosphate diphosphorylase activity
molecular_function GO:0003824 catalytic activity