Cla97C11G207240 (gene) Watermelon (97103) v2

NameCla97C11G207240
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
Descriptionthiamine biosynthetic bifunctional enzyme TH1, chloroplastic-like
LocationCla97Chr11 : 1092240 .. 1096230 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGTCTACCAGTGGAGATGTACTAGCTGGTCCTACCATTGTCTCAGTGTTACAGTAAGTTGATTCTCTCTTTAAAGGGAATATCGTCAGTCCTTCAAGCTGGTTGTATTTATATGATTATGTTAATTTCTTGTGCTCCACGTTTTCATTTTTTAAATATCTTAATAATTAGGACAGCGTTTTCTGGTTTTATTAAAAGGCTTCTGAAGCTTATGTGTGGAGCTCAAACTTTTAGCATGAAAGCTTACATCCGATGCACCTTTATTAAAACCTATTTGAGGCTTGAAATCCTTATGCCTCATACAATAATTTGAAAAATAATACTATGACGTTTCAAGCTCATGTTTGATCTTCATATGTTAGTTTTTCATAATTTTACAACACTTCTTTTGTTTGCAGGGAAGAGCTTCTACCAATGGCTGACTTGGTAACTCCAAATTTGAAGGAAGCATCTGCCTTACTTGGTGGTATGCCACTTAAAACAATTTCTGACATGCGTCATGCTGCAACATTAATCCATCAGATGGGATCAAAGTAAGTTTTTAATCCATTAGATAAGCTTGTCTTCTTTTGTAATTTCATTGTCATATTAATAATAATAATAATCCCAAATTCAATTTTCTAACCAACTAAAATCAACCCCGCACATGTTAGATTCCTCAGTTTCATACTGTCATTTTTAGGTTGTAAGGTTGTTGAAGGTTTCATTGTGTCGGCCTGACCGTTCTAAAGACAATATCCATCGTTGGTGTTTGATTTAATGGTTTTATATTTATCAAAATTTAAGCTACTCCCGATTAATGCTGAAGAGTGTTGTGTTGTTTTCAGGAATGTACTTGTCAAAGGTGGGGACCTTCCCGATTCATTGGATGCCGTCGATATATTCTTTGATGGTAAGTACGTATTGCACAATTCTGACAGTTCGATATTATTTTTTGTTTTGAAGGAAGCATATTCATTGAATCGAAATTTGAACTAATTTTCACAGGCAAGGATTTGCATGAGCTACGTTCTTCGCGCATAAAGACATGCAACACTCATGGTACTGGATGCAGCTTAGCATCATGCATCGCAGCTGAACTTGCTAAAGGGTCTTCAATGTTCTCAGCTGTTAAGGTAATTAATTGTCTATGAATATTTTCTTTAATAACTTCAGCCATAAGAAGATTCAAGTCTCTAGGGCTAATAATACAAGTTCCTACCGAATCGCTAAATAAATTAATAAATAAAAACTTATGAACCATTTTCAATTTGTTGTAGGATAATAATTGTTTAAAATAATGCTATGTTCACTATGTATGCTGCTCACTTACCCTAATGGGTGGTCAAGGCTTGGGATGTTTGGAGATACCTCCATCCTGCGTTCAACCGTGCAAGTTATGAGTTAAAAACTTAGGTCCTATTTGATAAATAGTTCTATTTGGTTTTTGAAAGTACTTGTTTTTTTGTTTTTTTTTGTTTTTCACATTTCTTTACCATGGTTTTCATCTTTCTTAGAAACATTTGATAATTCCATAACCAAATTCCAAAAACAAACACAAGGTTTGACTACTTTTTTTAGTTTTCAAAACATGACTTGAATTTTGATAACACTCCTTGAAGGTAGATAACATAATAAAGAAACTCATGGGTTGAAATAGTGTTTATAAATTGAATTTTTAGAAACAAAGAACCAGAAACAAAAGTCCCATTTTTTGGTTTTGAAACTTGTGCTTGTTTTTTCACAAATTTTTAAGGGATTTTTTCAAATATAGTAAAATGAGTCAAACTGTCTACAAATATAGAAAAATTTCACTATTTGTCAGTGATAGATTGTAATAGCGATAGACTTCTTTCACTGAAGCGATAAAAGTCTATCGCGATTTATCACTGATAGATAGTGAAATTTTTCTATATTTGTAAATAGTTTGATATTTTTTCTATTTATAAGAATTTTTCAATTTTTAATCATGATTTTCATCTTTCCTATCTAAACATTTTAATTCTTAATCAAATTCCTAAAACAAAAGTTTTTAAATATTACTTTTTATTATTTTATTTTATTTTTGTAATTTTTAAAATTTGGTTTGAATTTTAAAAACACTCTTAAAAGTAAAGAACAAAATATAGAAACTAATTAGAGAAAGTAGCCTTGATAGGCCTAATTTTGAAAAATTAAAAATCAAATATCAAATATGAAATGGTTAACAAATGGGGTTTAATAGTTATCAAATGAGGCCTTGAATTGGAAAACTAGAGATTCAAGTGTAAAATAGAGTTAATGGTCCATATTTCATTTCTTTTTAATGAGTTTGGTAGTAGACTTTGAATATGCACTGCGTTTGGGTCATTAGAAATTAAAATGCAAAACCAGAAAATAAACCCACCGAACTGAACCTCACAGAATTTATTTGATTTTGTTAATCTAAAAACTAGAAATTGAATCTAGATTAGATATCAAATACGCTTTTAACACTTGTCTTCTCTCAAACCTAGTTAGGGTGGGAAAAGCGTCTTTGAAACTAGTTAGGAGCATAATACAAGGAATAATAGCCCCTGACCGTTGACACCCTTGATCAAAAAGGAGAAAAGAAAATAAAAATACCTGGCTTCAAATAAACATCTTGTTATGGTGTTGTGTTTTTTCGAGCCCTAATCAACATATGGTTGCCCCTCAACCCAATTAGGAGTTTTGAATTCTTTGATTATTTTTGTCTTCTTTTTGGATGAAAAGACATATCATTCTCACTCCATCCATATCCATATTTAATATGGCTCGACTCAAAACGCAAATAGGCTTTGTCGCTGATTAACGGACTTGATGACTGTCCACAATTAAGGTTTTATTTATCTGGATCTTTGCAGGCAAGCAAACAGTTCATTGAAAGAGCATTGAAGTACAGTAAGGACATCAGCATTGGAAATGGACCTCAAGGCCCATTTGATCATCTATGTCGTCTCAAGAGTCAAGAACATAGTTCCTACAGACAGGGGTATTTCAATCCAGCTGACTTATTCTTGTATGCTGTTACGGACTCAGGTATGAATAAGCGTTGGGACCGTTCTATTACCGATGCTGTTAAAGCTGCAGTGGAAGGAGGTGCTACTATTGTTCAAATAAGGTTTGTCTAATGTCTTCCATCTATCAATTTGTTATTTTGTTCGTTAAGAATGTTTTTCACAAGTAGATAGATAATACAAGTCATTACTTCAAATGTTTGGAATTCCTTGTTGCATTTTAGTATCCTCTCAACACCTCAGTTCAAAGGTTAAATGGCTTTAGGACTTACCCATAAGGTGTAGGCTGAATGCTCCTAAGGCGAGTGATAGTGAAGTAGATCTCTAGTAAATAGAACTCGACTGTAAGATTTGAATATCTTTACTTTTCAAGAAGTTAAATACTTGGCATACTTATGAAGCTTGTATTCAATTGTTAATATTCTTCCCATGATTAATGTTCTCTTACTTCTATGAGCAGGGAAAAGGATGCTAAAACTCGTGATTTCTTGGAAGCAGCAAAGGCATGTATGAAGATTTGTCACGCACATGGAGTTCCATTGTTGATCAACGATCGCATTGACATCGCACTTGCGTGTGATGCTGATGGTGTACACGTTGGTCAGTCCGATATTCCTGCTCATGAAGTTCGCAGTCTTCTGGGCCCTAATAAGATCATCGGTGTCTCGTGCAAGACACCGGAGCAAGCAGAACAGGCATGGCTTGATGGTGCAGATTACATTGGGTGCGGTGGAGTTTATCCCACAAACACAAAAGCAAACAATCTGACTGTTGGGATTGATGGATTGAAAAGAGTTTGCTTAGCTTCCAAGTTACCTGTGGTTGCAATTGGTGGTATTAATCAAAGTAATGCAGCAGCTGTGATGGAAATTGGTGTCCCAAATCTTAAAGGTGTTGCAGTTGTGTCAGCTCTTTTTGATAGGCAATGTGTTTTAGAGGAGGCCTTAAAGTTACATGCAACTTTGGTGGAGGCTACAACAACAAATATATGA

mRNA sequence

ATGGTGTCTACCAGTGGAGATGTACTAGCTGGTCCTACCATTGTCTCAGTGTTACAGGAAGAGCTTCTACCAATGGCTGACTTGGTAACTCCAAATTTGAAGGAAGCATCTGCCTTACTTGGTGGTATGCCACTTAAAACAATTTCTGACATGCGTCATGCTGCAACATTAATCCATCAGATGGGATCAAAGAATGTACTTGTCAAAGGTGGGGACCTTCCCGATTCATTGGATGCCGTCGATATATTCTTTGATGGCAAGGATTTGCATGAGCTACGTTCTTCGCGCATAAAGACATGCAACACTCATGGTACTGGATGCAGCTTAGCATCATGCATCGCAGCTGAACTTGCTAAAGGGTCTTCAATGTTCTCAGCTGTTAAGGCAAGCAAACAGTTCATTGAAAGAGCATTGAAGTACAGTAAGGACATCAGCATTGGAAATGGACCTCAAGGCCCATTTGATCATCTATGTCGTCTCAAGAGTCAAGAACATAGTTCCTACAGACAGGGGTATTTCAATCCAGCTGACTTATTCTTGTATGCTGTTACGGACTCAGGTATGAATAAGCGTTGGGACCGTTCTATTACCGATGCTGTTAAAGCTGCAGTGGAAGGAGGTGCTACTATTGTTCAAATAAGGGAAAAGGATGCTAAAACTCGTGATTTCTTGGAAGCAGCAAAGGCATGTATGAAGATTTGTCACGCACATGGAGTTCCATTGTTGATCAACGATCGCATTGACATCGCACTTGCGTGTGATGCTGATGGTGTACACGTTGGTCAGTCCGATATTCCTGCTCATGAAGTTCGCAGTCTTCTGGGCCCTAATAAGATCATCGGTGTCTCGTGCAAGACACCGGAGCAAGCAGAACAGGCATGGCTTGATGGTGCAGATTACATTGGGTGCGGTGGAGTTTATCCCACAAACACAAAAGCAAACAATCTGACTGTTGGGATTGATGGATTGAAAAGAGTTTGCTTAGCTTCCAAGTTACCTGTGGTTGCAATTGGTGGTATTAATCAAAGTAATGCAGCAGCTGTGATGGAAATTGGTGTCCCAAATCTTAAAGGTGTTGCAGTTGTGTCAGCTCTTTTTGATAGGCAATGTGTTTTAGAGGAGGCCTTAAAGTTACATGCAACTTTGGTGGAGGCTACAACAACAAATATATGA

Coding sequence (CDS)

ATGGTGTCTACCAGTGGAGATGTACTAGCTGGTCCTACCATTGTCTCAGTGTTACAGGAAGAGCTTCTACCAATGGCTGACTTGGTAACTCCAAATTTGAAGGAAGCATCTGCCTTACTTGGTGGTATGCCACTTAAAACAATTTCTGACATGCGTCATGCTGCAACATTAATCCATCAGATGGGATCAAAGAATGTACTTGTCAAAGGTGGGGACCTTCCCGATTCATTGGATGCCGTCGATATATTCTTTGATGGCAAGGATTTGCATGAGCTACGTTCTTCGCGCATAAAGACATGCAACACTCATGGTACTGGATGCAGCTTAGCATCATGCATCGCAGCTGAACTTGCTAAAGGGTCTTCAATGTTCTCAGCTGTTAAGGCAAGCAAACAGTTCATTGAAAGAGCATTGAAGTACAGTAAGGACATCAGCATTGGAAATGGACCTCAAGGCCCATTTGATCATCTATGTCGTCTCAAGAGTCAAGAACATAGTTCCTACAGACAGGGGTATTTCAATCCAGCTGACTTATTCTTGTATGCTGTTACGGACTCAGGTATGAATAAGCGTTGGGACCGTTCTATTACCGATGCTGTTAAAGCTGCAGTGGAAGGAGGTGCTACTATTGTTCAAATAAGGGAAAAGGATGCTAAAACTCGTGATTTCTTGGAAGCAGCAAAGGCATGTATGAAGATTTGTCACGCACATGGAGTTCCATTGTTGATCAACGATCGCATTGACATCGCACTTGCGTGTGATGCTGATGGTGTACACGTTGGTCAGTCCGATATTCCTGCTCATGAAGTTCGCAGTCTTCTGGGCCCTAATAAGATCATCGGTGTCTCGTGCAAGACACCGGAGCAAGCAGAACAGGCATGGCTTGATGGTGCAGATTACATTGGGTGCGGTGGAGTTTATCCCACAAACACAAAAGCAAACAATCTGACTGTTGGGATTGATGGATTGAAAAGAGTTTGCTTAGCTTCCAAGTTACCTGTGGTTGCAATTGGTGGTATTAATCAAAGTAATGCAGCAGCTGTGATGGAAATTGGTGTCCCAAATCTTAAAGGTGTTGCAGTTGTGTCAGCTCTTTTTGATAGGCAATGTGTTTTAGAGGAGGCCTTAAAGTTACATGCAACTTTGGTGGAGGCTACAACAACAAATATATGA

Protein sequence

MVSTSGDVLAGPTIVSVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKTCNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALKYSKDISIGNGPQGPFDHLCRLKSQEHSSYRQGYFNPADLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHAHGVPLLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEEALKLHATLVEATTTNI
BLAST of Cla97C11G207240 vs. NCBI nr
Match: XP_008460243.1 (PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X2 [Cucumis melo])

HSP 1 Score: 723.8 bits (1867), Expect = 3.2e-205
Identity = 361/390 (92.56%), Postives = 378/390 (96.92%), Query Frame = 0

Query: 1   MVSTSGDVLAGPTIVSVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 60
           MVSTSGDVLAGPTI+SVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ
Sbjct: 146 MVSTSGDVLAGPTIISVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 205

Query: 61  MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKTCNTHGTGCSLASCIAAELAKG 120
           MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIK+ NTHGTGCSLASCI+AELAKG
Sbjct: 206 MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCISAELAKG 265

Query: 121 SSMFSAVKASKQFIERALKYSKDISIGNGPQGPFDHLCRLKSQEHSSYRQGYFNPADLFL 180
           SSMFSAVKASKQFIERAL+YSKDI+IG+GPQGPFDHLCRLKS+E SSY QG FNP DLFL
Sbjct: 266 SSMFSAVKASKQFIERALRYSKDINIGHGPQGPFDHLCRLKSREQSSYSQGCFNPTDLFL 325

Query: 181 YAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHAHGVP 240
           YAVTDSGMN+RWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLE AK+C+KICHAHGVP
Sbjct: 326 YAVTDSGMNERWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEVAKSCIKICHAHGVP 385

Query: 241 LLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLDGADY 300
           LLINDRIDIALACDADGVHVGQSDIPAHEVR LLGPNK+IGVSCKT EQAEQAW+DGADY
Sbjct: 386 LLINDRIDIALACDADGVHVGQSDIPAHEVRRLLGPNKVIGVSCKTMEQAEQAWIDGADY 445

Query: 301 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNLKGVA 360
           IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGIN +NAAAVM IG+PNL+GVA
Sbjct: 446 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINHTNAAAVMGIGIPNLRGVA 505

Query: 361 VVSALFDRQCVLEEALKLHATLVEATTTNI 391
           VVSALFDRQCVLE A KLHATLVEATT+N+
Sbjct: 506 VVSALFDRQCVLEAASKLHATLVEATTSNV 535

BLAST of Cla97C11G207240 vs. NCBI nr
Match: XP_008460244.1 (PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 723.8 bits (1867), Expect = 3.2e-205
Identity = 361/390 (92.56%), Postives = 378/390 (96.92%), Query Frame = 0

Query: 1   MVSTSGDVLAGPTIVSVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 60
           MVSTSGDVLAGPTI+SVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ
Sbjct: 126 MVSTSGDVLAGPTIISVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 185

Query: 61  MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKTCNTHGTGCSLASCIAAELAKG 120
           MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIK+ NTHGTGCSLASCI+AELAKG
Sbjct: 186 MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCISAELAKG 245

Query: 121 SSMFSAVKASKQFIERALKYSKDISIGNGPQGPFDHLCRLKSQEHSSYRQGYFNPADLFL 180
           SSMFSAVKASKQFIERAL+YSKDI+IG+GPQGPFDHLCRLKS+E SSY QG FNP DLFL
Sbjct: 246 SSMFSAVKASKQFIERALRYSKDINIGHGPQGPFDHLCRLKSREQSSYSQGCFNPTDLFL 305

Query: 181 YAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHAHGVP 240
           YAVTDSGMN+RWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLE AK+C+KICHAHGVP
Sbjct: 306 YAVTDSGMNERWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEVAKSCIKICHAHGVP 365

Query: 241 LLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLDGADY 300
           LLINDRIDIALACDADGVHVGQSDIPAHEVR LLGPNK+IGVSCKT EQAEQAW+DGADY
Sbjct: 366 LLINDRIDIALACDADGVHVGQSDIPAHEVRRLLGPNKVIGVSCKTMEQAEQAWIDGADY 425

Query: 301 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNLKGVA 360
           IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGIN +NAAAVM IG+PNL+GVA
Sbjct: 426 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINHTNAAAVMGIGIPNLRGVA 485

Query: 361 VVSALFDRQCVLEEALKLHATLVEATTTNI 391
           VVSALFDRQCVLE A KLHATLVEATT+N+
Sbjct: 486 VVSALFDRQCVLEAASKLHATLVEATTSNV 515

BLAST of Cla97C11G207240 vs. NCBI nr
Match: XP_022957338.1 (thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X2 [Cucurbita moschata])

HSP 1 Score: 715.3 bits (1845), Expect = 1.1e-202
Identity = 358/386 (92.75%), Postives = 370/386 (95.85%), Query Frame = 0

Query: 1   MVSTSGDVLAGPTIVSVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 60
           MVSTSGDVLA PTI+SVLQ+ELLPMADLVTPNLKEASALLGGMPL TISDMRHAATLIHQ
Sbjct: 126 MVSTSGDVLACPTIISVLQDELLPMADLVTPNLKEASALLGGMPLNTISDMRHAATLIHQ 185

Query: 61  MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKTCNTHGTGCSLASCIAAELAKG 120
           MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRI T NTHGTGCSLASCIAAELAKG
Sbjct: 186 MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKG 245

Query: 121 SSMFSAVKASKQFIERALKYSKDISIGNGPQGPFDHLCRLKSQEHSSYRQGYFNPADLFL 180
           SSMFSAVK SKQFIERAL YSKDISIGNGPQGPFDHLCRLKS+E SSYRQG FN +DLFL
Sbjct: 246 SSMFSAVKISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFL 305

Query: 181 YAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHAHGVP 240
           YAVTDSGMNKRWDRSITDAVKAAVEGGATI+QIREKDAKTRDFLEAAK+C++ICH HGVP
Sbjct: 306 YAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIREKDAKTRDFLEAAKSCIEICHTHGVP 365

Query: 241 LLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLDGADY 300
           LLINDRID+ALAC ADGVH+GQSDIP H  RSLLGP+KIIGVSCKTPEQAEQAWLDGADY
Sbjct: 366 LLINDRIDVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADY 425

Query: 301 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNLKGVA 360
           IGCGGVYPTNTKANNLTVG+DGLKRVCLASKLPVVAIGGIN SNAAAVMEIGVPNLKGVA
Sbjct: 426 IGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVA 485

Query: 361 VVSALFDRQCVLEEALKLHATLVEAT 387
           VVSALFDRQCVLEE LKLHATLVEAT
Sbjct: 486 VVSALFDRQCVLEETLKLHATLVEAT 511

BLAST of Cla97C11G207240 vs. NCBI nr
Match: XP_022957337.1 (thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X1 [Cucurbita moschata])

HSP 1 Score: 715.3 bits (1845), Expect = 1.1e-202
Identity = 358/386 (92.75%), Postives = 370/386 (95.85%), Query Frame = 0

Query: 1   MVSTSGDVLAGPTIVSVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 60
           MVSTSGDVLA PTI+SVLQ+ELLPMADLVTPNLKEASALLGGMPL TISDMRHAATLIHQ
Sbjct: 146 MVSTSGDVLACPTIISVLQDELLPMADLVTPNLKEASALLGGMPLNTISDMRHAATLIHQ 205

Query: 61  MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKTCNTHGTGCSLASCIAAELAKG 120
           MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRI T NTHGTGCSLASCIAAELAKG
Sbjct: 206 MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKG 265

Query: 121 SSMFSAVKASKQFIERALKYSKDISIGNGPQGPFDHLCRLKSQEHSSYRQGYFNPADLFL 180
           SSMFSAVK SKQFIERAL YSKDISIGNGPQGPFDHLCRLKS+E SSYRQG FN +DLFL
Sbjct: 266 SSMFSAVKISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFL 325

Query: 181 YAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHAHGVP 240
           YAVTDSGMNKRWDRSITDAVKAAVEGGATI+QIREKDAKTRDFLEAAK+C++ICH HGVP
Sbjct: 326 YAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIREKDAKTRDFLEAAKSCIEICHTHGVP 385

Query: 241 LLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLDGADY 300
           LLINDRID+ALAC ADGVH+GQSDIP H  RSLLGP+KIIGVSCKTPEQAEQAWLDGADY
Sbjct: 386 LLINDRIDVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADY 445

Query: 301 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNLKGVA 360
           IGCGGVYPTNTKANNLTVG+DGLKRVCLASKLPVVAIGGIN SNAAAVMEIGVPNLKGVA
Sbjct: 446 IGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVA 505

Query: 361 VVSALFDRQCVLEEALKLHATLVEAT 387
           VVSALFDRQCVLEE LKLHATLVEAT
Sbjct: 506 VVSALFDRQCVLEETLKLHATLVEAT 531

BLAST of Cla97C11G207240 vs. NCBI nr
Match: XP_023550227.1 (thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 713.4 bits (1840), Expect = 4.3e-202
Identity = 357/386 (92.49%), Postives = 369/386 (95.60%), Query Frame = 0

Query: 1   MVSTSGDVLAGPTIVSVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 60
           MVSTSGDVLA PTI+SVLQ+ELLPMADLVTPNLKEASALLGGMPL TISDMRHAATLIHQ
Sbjct: 126 MVSTSGDVLACPTIISVLQDELLPMADLVTPNLKEASALLGGMPLNTISDMRHAATLIHQ 185

Query: 61  MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKTCNTHGTGCSLASCIAAELAKG 120
           MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRI T NTHGTGCSLASCIAAELAKG
Sbjct: 186 MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKG 245

Query: 121 SSMFSAVKASKQFIERALKYSKDISIGNGPQGPFDHLCRLKSQEHSSYRQGYFNPADLFL 180
           SSMFSAVK SKQFIERAL YSKDISIGNGPQGPFDHLCRLKS+E SSYRQG FN +DLFL
Sbjct: 246 SSMFSAVKISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFL 305

Query: 181 YAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHAHGVP 240
           YAVTDSGMNKRWDRSITDAVKAAVEGGATI+QIREKDAKT DFLEAAK+C++ICH HGVP
Sbjct: 306 YAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVP 365

Query: 241 LLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLDGADY 300
           LLINDRID+ALAC ADGVH+GQSDIP H  RSLLGP+KIIGVSCKTPEQAEQAWLDGADY
Sbjct: 366 LLINDRIDVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADY 425

Query: 301 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNLKGVA 360
           IGCGGVYPTNTKANNLTVG+DGLKRVCLASKLPVVAIGGIN SNAAAVMEIGVPNLKGVA
Sbjct: 426 IGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVA 485

Query: 361 VVSALFDRQCVLEEALKLHATLVEAT 387
           VVSALFDRQCVLEE LKLHATLVEAT
Sbjct: 486 VVSALFDRQCVLEETLKLHATLVEAT 511

BLAST of Cla97C11G207240 vs. TrEMBL
Match: tr|A0A1S3CCI6|A0A1S3CCI6_CUCME (thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499122 PE=3 SV=1)

HSP 1 Score: 723.8 bits (1867), Expect = 2.1e-205
Identity = 361/390 (92.56%), Postives = 378/390 (96.92%), Query Frame = 0

Query: 1   MVSTSGDVLAGPTIVSVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 60
           MVSTSGDVLAGPTI+SVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ
Sbjct: 146 MVSTSGDVLAGPTIISVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 205

Query: 61  MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKTCNTHGTGCSLASCIAAELAKG 120
           MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIK+ NTHGTGCSLASCI+AELAKG
Sbjct: 206 MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCISAELAKG 265

Query: 121 SSMFSAVKASKQFIERALKYSKDISIGNGPQGPFDHLCRLKSQEHSSYRQGYFNPADLFL 180
           SSMFSAVKASKQFIERAL+YSKDI+IG+GPQGPFDHLCRLKS+E SSY QG FNP DLFL
Sbjct: 266 SSMFSAVKASKQFIERALRYSKDINIGHGPQGPFDHLCRLKSREQSSYSQGCFNPTDLFL 325

Query: 181 YAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHAHGVP 240
           YAVTDSGMN+RWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLE AK+C+KICHAHGVP
Sbjct: 326 YAVTDSGMNERWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEVAKSCIKICHAHGVP 385

Query: 241 LLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLDGADY 300
           LLINDRIDIALACDADGVHVGQSDIPAHEVR LLGPNK+IGVSCKT EQAEQAW+DGADY
Sbjct: 386 LLINDRIDIALACDADGVHVGQSDIPAHEVRRLLGPNKVIGVSCKTMEQAEQAWIDGADY 445

Query: 301 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNLKGVA 360
           IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGIN +NAAAVM IG+PNL+GVA
Sbjct: 446 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINHTNAAAVMGIGIPNLRGVA 505

Query: 361 VVSALFDRQCVLEEALKLHATLVEATTTNI 391
           VVSALFDRQCVLE A KLHATLVEATT+N+
Sbjct: 506 VVSALFDRQCVLEAASKLHATLVEATTSNV 535

BLAST of Cla97C11G207240 vs. TrEMBL
Match: tr|A0A1S3CDB7|A0A1S3CDB7_CUCME (thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499122 PE=3 SV=1)

HSP 1 Score: 723.8 bits (1867), Expect = 2.1e-205
Identity = 361/390 (92.56%), Postives = 378/390 (96.92%), Query Frame = 0

Query: 1   MVSTSGDVLAGPTIVSVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 60
           MVSTSGDVLAGPTI+SVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ
Sbjct: 126 MVSTSGDVLAGPTIISVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 185

Query: 61  MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKTCNTHGTGCSLASCIAAELAKG 120
           MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIK+ NTHGTGCSLASCI+AELAKG
Sbjct: 186 MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCISAELAKG 245

Query: 121 SSMFSAVKASKQFIERALKYSKDISIGNGPQGPFDHLCRLKSQEHSSYRQGYFNPADLFL 180
           SSMFSAVKASKQFIERAL+YSKDI+IG+GPQGPFDHLCRLKS+E SSY QG FNP DLFL
Sbjct: 246 SSMFSAVKASKQFIERALRYSKDINIGHGPQGPFDHLCRLKSREQSSYSQGCFNPTDLFL 305

Query: 181 YAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHAHGVP 240
           YAVTDSGMN+RWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLE AK+C+KICHAHGVP
Sbjct: 306 YAVTDSGMNERWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEVAKSCIKICHAHGVP 365

Query: 241 LLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLDGADY 300
           LLINDRIDIALACDADGVHVGQSDIPAHEVR LLGPNK+IGVSCKT EQAEQAW+DGADY
Sbjct: 366 LLINDRIDIALACDADGVHVGQSDIPAHEVRRLLGPNKVIGVSCKTMEQAEQAWIDGADY 425

Query: 301 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNLKGVA 360
           IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGIN +NAAAVM IG+PNL+GVA
Sbjct: 426 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINHTNAAAVMGIGIPNLRGVA 485

Query: 361 VVSALFDRQCVLEEALKLHATLVEATTTNI 391
           VVSALFDRQCVLE A KLHATLVEATT+N+
Sbjct: 486 VVSALFDRQCVLEAASKLHATLVEATTSNV 515

BLAST of Cla97C11G207240 vs. TrEMBL
Match: tr|D7TGZ0|D7TGZ0_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_12s0035g00320 PE=3 SV=1)

HSP 1 Score: 627.5 bits (1617), Expect = 2.0e-176
Identity = 311/385 (80.78%), Postives = 347/385 (90.13%), Query Frame = 0

Query: 1   MVSTSGDVLAGPTIVSVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 60
           MVSTSGDVLAGP+I++  +EELLPMAD+VTPNLKEASALLGG+ L+T+SDM  AA LIH 
Sbjct: 108 MVSTSGDVLAGPSILAAFREELLPMADIVTPNLKEASALLGGLQLETVSDMCTAAKLIHD 167

Query: 61  MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKTCNTHGTGCSLASCIAAELAKG 120
           MG +NVLVKGGDLP SLDAVDIFFDG D +ELRSSRIKT NTHGTGC+LASCIAAELAKG
Sbjct: 168 MGPRNVLVKGGDLPSSLDAVDIFFDGDDFYELRSSRIKTRNTHGTGCTLASCIAAELAKG 227

Query: 121 SSMFSAVKASKQFIERALKYSKDISIGNGPQGPFDHLCRLKSQEHSSYRQGYFNPADLFL 180
           S + SAVKA+K +IE AL YSKDI+IGNG QGPFDHL +LKS   +S+R+  FNPA+LFL
Sbjct: 228 SQILSAVKAAKHYIETALDYSKDIAIGNGFQGPFDHLLKLKSNIRNSFRKQAFNPANLFL 287

Query: 181 YAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHAHGVP 240
           YAVTDSGMNK+W RSIT+AVKAA+EGGATIVQ+REKDA+TRDFLEAAKAC++ICH+HGVP
Sbjct: 288 YAVTDSGMNKKWGRSITEAVKAAIEGGATIVQLREKDAETRDFLEAAKACVEICHSHGVP 347

Query: 241 LLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLDGADY 300
           LLINDRID+ALACDADGVHVGQSDIPA  VR+LLGP KIIGVSCKTPEQAE+AW+DGADY
Sbjct: 348 LLINDRIDVALACDADGVHVGQSDIPARVVRTLLGPEKIIGVSCKTPEQAEKAWIDGADY 407

Query: 301 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNLKGVA 360
           IGCGGVYPTNTKANN+TVG+DGLK VCLASKLPVVAIGGIN SNA  VMEIGVPNLKGVA
Sbjct: 408 IGCGGVYPTNTKANNITVGLDGLKTVCLASKLPVVAIGGINASNARTVMEIGVPNLKGVA 467

Query: 361 VVSALFDRQCVLEEALKLHATLVEA 386
           VVSALFDR+CVL E  KLH  L +A
Sbjct: 468 VVSALFDRECVLTETQKLHGDLTQA 492

BLAST of Cla97C11G207240 vs. TrEMBL
Match: tr|A0A2N9ITQ5|A0A2N9ITQ5_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS55435 PE=3 SV=1)

HSP 1 Score: 620.5 bits (1599), Expect = 2.5e-174
Identity = 307/390 (78.72%), Postives = 344/390 (88.21%), Query Frame = 0

Query: 1   MVSTSGDVLAGPTIVSVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 60
           MVSTSGDVLAGP+I++  ++ LLPMAD+VTPNLKEASALLGG  L+T++DMR AA L+H 
Sbjct: 123 MVSTSGDVLAGPSILAGFRDHLLPMADIVTPNLKEASALLGGQQLETVADMRSAAKLLHD 182

Query: 61  MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKTCNTHGTGCSLASCIAAELAKG 120
           MG +NVLVKGGDLPDSLDAVDIFFDGKD HELRSSRIKT NTHGTGCSLASCIAAELAKG
Sbjct: 183 MGPRNVLVKGGDLPDSLDAVDIFFDGKDFHELRSSRIKTRNTHGTGCSLASCIAAELAKG 242

Query: 121 SSMFSAVKASKQFIERALKYSKDISIGNGPQGPFDHLCRLKSQEHSSYRQGYFNPADLFL 180
           SSM  AVK +K +IE AL YSKDI IGNGPQGPFDHL RLKS   +S RQ  F+P+DLFL
Sbjct: 243 SSMLPAVKVAKHYIETALDYSKDIVIGNGPQGPFDHLLRLKSYVQNSCRQVGFSPSDLFL 302

Query: 181 YAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHAHGVP 240
           YAVTD GMNK+W RSITDAVKAA+ GGATI+Q+REKD +T+DF EAAKAC+KIC  HGVP
Sbjct: 303 YAVTDPGMNKKWGRSITDAVKAAINGGATIIQLREKDVETQDFFEAAKACLKICRFHGVP 362

Query: 241 LLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLDGADY 300
           LLINDRID+ALACDADGVHVGQSD+PA   R+LLGP+KIIGVSCKTPEQA QAW++GADY
Sbjct: 363 LLINDRIDVALACDADGVHVGQSDMPARVARTLLGPDKIIGVSCKTPEQAHQAWIEGADY 422

Query: 301 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNLKGVA 360
           IGCGGVYPTNTKANNLTVG++GLK VC ASKLPVVAIGGI+ SNA++VMEIGVPNLKGVA
Sbjct: 423 IGCGGVYPTNTKANNLTVGLNGLKTVCAASKLPVVAIGGISASNASSVMEIGVPNLKGVA 482

Query: 361 VVSALFDRQCVLEEALKLHATLVEATTTNI 391
           VVSALFDR+C+L E  KLHA L EAT  +I
Sbjct: 483 VVSALFDRECILTETRKLHALLKEATAMSI 512

BLAST of Cla97C11G207240 vs. TrEMBL
Match: tr|A0A2P5B8J3|A0A2P5B8J3_9ROSA (Thiamine phosphate synthase OS=Trema orientalis OX=63057 GN=TorRG33x02_329480 PE=3 SV=1)

HSP 1 Score: 618.2 bits (1593), Expect = 1.2e-173
Identity = 305/387 (78.81%), Postives = 344/387 (88.89%), Query Frame = 0

Query: 1   MVSTSGDVLAGPTIVSVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 60
           MVSTSGDVLAGP++++V +E+LLP+AD+VTPNLKEASALLGG+ L+TISDM  AA L+H 
Sbjct: 166 MVSTSGDVLAGPSVLTVFREQLLPVADIVTPNLKEASALLGGLKLETISDMHAAAKLLHD 225

Query: 61  MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKTCNTHGTGCSLASCIAAELAKG 120
           MG +NVLVKGGDLPDSLDA+DIFFDG+++ ELRSSRIKT NTHGTGCSLASCIAAELAKG
Sbjct: 226 MGPRNVLVKGGDLPDSLDAIDIFFDGENIFELRSSRIKTRNTHGTGCSLASCIAAELAKG 285

Query: 121 SSMFSAVKASKQFIERALKYSKDISIGNGPQGPFDHLCRLKSQEHSSYRQGYFNPADLFL 180
           SSM  AVKA+K++IE AL YS+DI IGNGPQGPFDHL RLK   H+S RQ  F+P+DLFL
Sbjct: 286 SSMLQAVKAAKRYIEAALGYSRDIVIGNGPQGPFDHLLRLKRNIHNSSRQKGFDPSDLFL 345

Query: 181 YAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHAHGVP 240
           YAVTDS MNK+W  SI DAVKAA+EGGATIVQ+REKD +TRDFLEAAK+C+KIC +HGVP
Sbjct: 346 YAVTDSRMNKKWGHSIADAVKAAIEGGATIVQLREKDIETRDFLEAAKSCLKICRSHGVP 405

Query: 241 LLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLDGADY 300
           LLINDR+D+ALACDADGVHVGQSD+PAH  R+LLGP KIIGVSCKTPEQAEQAW DGADY
Sbjct: 406 LLINDRVDVALACDADGVHVGQSDMPAHIARTLLGPEKIIGVSCKTPEQAEQAWFDGADY 465

Query: 301 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNLKGVA 360
           IGCGGVYPTNTK NN+TVG+DGLK VCLASKLPVVAIGGI  SNA AVMEI VP LKGVA
Sbjct: 466 IGCGGVYPTNTKENNVTVGLDGLKTVCLASKLPVVAIGGIGLSNAHAVMEINVPQLKGVA 525

Query: 361 VVSALFDRQCVLEEALKLHATLVEATT 388
           VVSALFDRQCV  E  KLH+ L+EAT+
Sbjct: 526 VVSALFDRQCVSTETRKLHSVLIEATS 552

BLAST of Cla97C11G207240 vs. Swiss-Prot
Match: sp|Q5M731|TPS1L_ARATH (Thiamine biosynthetic bifunctional enzyme TH1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=TH1 PE=1 SV=1)

HSP 1 Score: 557.8 bits (1436), Expect = 9.8e-158
Identity = 278/385 (72.21%), Postives = 325/385 (84.42%), Query Frame = 0

Query: 1   MVSTSGDVLAGPTIVSVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 60
           MVSTSG VLAG +I+S+ +E LLP+AD++TPN+KEASALL G  ++T+++MR AA  +H+
Sbjct: 135 MVSTSGHVLAGSSILSIFRERLLPIADIITPNVKEASALLDGFRIETVAEMRSAAKSLHE 194

Query: 61  MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKTCNTHGTGCSLASCIAAELAKG 120
           MG + VLVKGGDLPDS D+VD++FDGK+ HELRS RI T NTHGTGC+LASCIAAELAKG
Sbjct: 195 MGPRFVLVKGGDLPDSSDSVDVYFDGKEFHELRSPRIATRNTHGTGCTLASCIAAELAKG 254

Query: 121 SSMFSAVKASKQFIERALKYSKDISIGNGPQGPFDHLCRLKSQEHSSYRQGYFNPADLFL 180
           SSM SAVK +K+F++ AL YSKDI IG+G QGPFDH   LK    SS R   FNP DLFL
Sbjct: 255 SSMLSAVKVAKRFVDNALDYSKDIVIGSGMQGPFDHFFGLKKDPQSS-RCSIFNPDDLFL 314

Query: 181 YAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHAHGVP 240
           YAVTDS MNK+W+RSI DA+KAA+EGGATI+Q+REK+A+TR+FLE AKAC+ IC +HGV 
Sbjct: 315 YAVTDSRMNKKWNRSIVDALKAAIEGGATIIQLREKEAETREFLEEAKACIDICRSHGVS 374

Query: 241 LLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLDGADY 300
           LLINDRIDIALACDADGVHVGQSD+P   VRSLLGP+KIIGVSCKTPEQA QAW DGADY
Sbjct: 375 LLINDRIDIALACDADGVHVGQSDMPVDLVRSLLGPDKIIGVSCKTPEQAHQAWKDGADY 434

Query: 301 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNLKGVA 360
           IG GGV+PTNTKANN T+G+DGLK VC ASKLPVVAIGGI  SNA +VM+I  PNLKGVA
Sbjct: 435 IGSGGVFPTNTKANNRTIGLDGLKEVCEASKLPVVAIGGIGISNAGSVMQIDAPNLKGVA 494

Query: 361 VVSALFDRQCVLEEALKLHATLVEA 386
           VVSALFD+ CVL +A KLH TL E+
Sbjct: 495 VVSALFDQDCVLTQAKKLHKTLKES 518

BLAST of Cla97C11G207240 vs. Swiss-Prot
Match: sp|O48881|TPS1_BRANA (Thiamine biosynthetic bifunctional enzyme BTH1, chloroplastic OS=Brassica napus OX=3708 GN=BTH1 PE=1 SV=1)

HSP 1 Score: 548.9 bits (1413), Expect = 4.6e-155
Identity = 272/385 (70.65%), Postives = 324/385 (84.16%), Query Frame = 0

Query: 1   MVSTSGDVLAGPTIVSVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 60
           MVSTSG VLAG +I+S+ +E LLP+AD++TPN+KEASALLGG+ ++T+++MR AA  +HQ
Sbjct: 137 MVSTSGHVLAGSSILSIFRERLLPLADIITPNVKEASALLGGVRIQTVAEMRSAAKSLHQ 196

Query: 61  MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKTCNTHGTGCSLASCIAAELAKG 120
           MG + VLVKGGDLPDS D+VD++FDG + HEL S RI T NTHGTGC+LASCIAAELAKG
Sbjct: 197 MGPRFVLVKGGDLPDSSDSVDVYFDGNEFHELHSPRIATRNTHGTGCTLASCIAAELAKG 256

Query: 121 SSMFSAVKASKQFIERALKYSKDISIGNGPQGPFDHLCRLKSQEHSSYRQGYFNPADLFL 180
           S+M SAVK +K+F++ AL YSKDI IG+G QGPFDH   LK  +  SYRQ  F P DLFL
Sbjct: 257 SNMLSAVKVAKRFVDSALNYSKDIVIGSGMQGPFDHFLSLKDPQ--SYRQSTFKPDDLFL 316

Query: 181 YAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHAHGVP 240
           YAVTDS MNK+W+RSI DAVKAA+EGGATI+Q+REK+A+TR+FLE AK+C+ IC ++GV 
Sbjct: 317 YAVTDSRMNKKWNRSIVDAVKAAIEGGATIIQLREKEAETREFLEEAKSCVDICRSNGVC 376

Query: 241 LLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLDGADY 300
           LLINDR DIA+A DADGVHVGQSD+P   VRSLLGP+KIIGVSCKT EQA QAW DGADY
Sbjct: 377 LLINDRFDIAIALDADGVHVGQSDMPVDLVRSLLGPDKIIGVSCKTQEQAHQAWKDGADY 436

Query: 301 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNLKGVA 360
           IG GGV+PTNTKANN T+G+DGL+ VC ASKLPVVAIGGI  SNA +VM IG PNLKGVA
Sbjct: 437 IGSGGVFPTNTKANNRTIGLDGLREVCKASKLPVVAIGGIGISNAESVMRIGEPNLKGVA 496

Query: 361 VVSALFDRQCVLEEALKLHATLVEA 386
           VVSALFD++CVL +A KLH TL E+
Sbjct: 497 VVSALFDQECVLTQAKKLHKTLTES 519

BLAST of Cla97C11G207240 vs. Swiss-Prot
Match: sp|Q2QWK9|TPS1_ORYSJ (Probable thiamine biosynthetic bifunctional enzyme, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=Os12g0192500 PE=2 SV=2)

HSP 1 Score: 537.0 bits (1382), Expect = 1.8e-151
Identity = 265/388 (68.30%), Postives = 321/388 (82.73%), Query Frame = 0

Query: 1   MVSTSGDVLAGPTIVSVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 60
           MVSTSGD L+  + +SV ++EL  MAD+VTPN+KEAS LLGG+ L+T+SDMR+AA  I++
Sbjct: 161 MVSTSGDTLSESSTLSVYRDELFAMADIVTPNVKEASRLLGGVSLRTVSDMRNAAESIYK 220

Query: 61  MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKTCNTHGTGCSLASCIAAELAKG 120
            G K+VLVKGGD+ +S DA D+FFDGK+  EL + RIKT NTHGTGC+LASCIA+ELAKG
Sbjct: 221 FGPKHVLVKGGDMLESSDATDVFFDGKEFIELHAHRIKTHNTHGTGCTLASCIASELAKG 280

Query: 121 SSMFSAVKASKQFIERALKYSKDISIGNGPQGPFDHLCRLKSQEHSSYRQGYFNPADLFL 180
           ++M  AV+ +K F+E AL +SKD+ +GNGPQGPFDHL +LK   ++   Q  F P  LFL
Sbjct: 281 ATMLHAVQVAKNFVESALHHSKDLVVGNGPQGPFDHLFKLKCPPYNVGSQPSFKPDQLFL 340

Query: 181 YAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHAHGVP 240
           YAVTDSGMNK+W RSI +AV+AA+EGGATIVQ+REKD++TR+FLEAAKACM+IC + GVP
Sbjct: 341 YAVTDSGMNKKWGRSIKEAVQAAIEGGATIVQLREKDSETREFLEAAKACMEICKSSGVP 400

Query: 241 LLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLDGADY 300
           LLINDR+DIALAC+ADGVHVGQ D+ AHEVR LLGP KIIGVSCKTP QA+QAW DGADY
Sbjct: 401 LLINDRVDIALACNADGVHVGQLDMSAHEVRELLGPGKIIGVSCKTPAQAQQAWNDGADY 460

Query: 301 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNLKGVA 360
           IGCGGV+PT+TKANN T+G DGLK VCLASKLPVVAIGGIN SNA +VME+G+PNLKGVA
Sbjct: 461 IGCGGVFPTSTKANNPTLGFDGLKTVCLASKLPVVAIGGINASNAGSVMELGLPNLKGVA 520

Query: 361 VVSALFDRQCVLEEALKLHATLVEATTT 389
           VVSALFDR  V+ E   + + L   + T
Sbjct: 521 VVSALFDRPSVVAETRNMKSILTNTSRT 548

BLAST of Cla97C11G207240 vs. Swiss-Prot
Match: sp|A8MK92|THIE_ALKOO (Thiamine-phosphate synthase OS=Alkaliphilus oremlandii (strain OhILAs) OX=350688 GN=thiE PE=3 SV=1)

HSP 1 Score: 181.0 bits (458), Expect = 2.5e-44
Identity = 97/211 (45.97%), Postives = 139/211 (65.88%), Query Frame = 0

Query: 172 YFNPADLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACM 231
           Y +  D  LY V+D  + K   R    +++ A+ GGATIVQ+REK+A + +F + A    
Sbjct: 3   YNHTVDYGLYLVSDRDVLK--GRDFIKSLEEAILGGATIVQLREKEASSLEFYQLALKAK 62

Query: 232 KICHAHGVPLLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAE 291
            +   + VPL+INDR+DIALA DADGVHVGQSD+PAH VRS++G NKI+GVS  T E+++
Sbjct: 63  ALTEKYNVPLIINDRVDIALAVDADGVHVGQSDLPAHIVRSMIGQNKILGVSTATLEESK 122

Query: 292 QAWLDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEI 351
           +A  DGADYIG G ++PT TK +   V +D L+ +     +PVV IGGI + N   +ME+
Sbjct: 123 KAAEDGADYIGVGALFPTGTKTDANPVTLDQLRYIKENMDIPVVGIGGICEDNIKTIMEV 182

Query: 352 GVPNLKGVAVVSALFDRQCVLEEALKLHATL 383
           G+    GVA+VSA+  ++ + E A  L A++
Sbjct: 183 GI---DGVAIVSAILGKENIKEAAESLKASI 208

BLAST of Cla97C11G207240 vs. Swiss-Prot
Match: sp|Q97LQ9|THIE_CLOAB (Thiamine-phosphate synthase OS=Clostridium acetobutylicum (strain ATCC 824 / DSM 792 / JCM 1419 / LMG 5710 / VKM B-1787) OX=272562 GN=thiE PE=3 SV=1)

HSP 1 Score: 172.6 bits (436), Expect = 8.9e-42
Identity = 91/199 (45.73%), Postives = 133/199 (66.83%), Query Frame = 0

Query: 177 DLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHA 236
           D  LY VTD  + K  +R +  +++ A++GG T+VQ+REK+  T DF E+A    KI   
Sbjct: 5   DYKLYLVTDRKVLK--ERDLYKSIEEAIKGGVTLVQLREKEMSTLDFYESALKLKKITET 64

Query: 237 HGVPLLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLD 296
           + +PL+INDRIDIALA +ADGVH+GQSD+P  + R LLG +KIIGVS  + E+A +A  +
Sbjct: 65  YKIPLIINDRIDIALAINADGVHIGQSDMPLIKARELLGKDKIIGVSAHSIEEALEAERN 124

Query: 297 GADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNL 356
           GA Y+G G +Y T+TK +   V ++ LK +  + K+PVV IGGIN+ NA  V+E GV   
Sbjct: 125 GATYLGVGAIYNTSTKGDAQAVSLEELKNIKNSVKIPVVGIGGINEENANKVIETGV--- 184

Query: 357 KGVAVVSALFDRQCVLEEA 376
            G++V+S +   Q + ++A
Sbjct: 185 DGISVISGILSAQKIKDKA 198

BLAST of Cla97C11G207240 vs. TAIR10
Match: AT1G22940.1 (thiamin biosynthesis protein, putative)

HSP 1 Score: 557.8 bits (1436), Expect = 5.4e-159
Identity = 278/385 (72.21%), Postives = 325/385 (84.42%), Query Frame = 0

Query: 1   MVSTSGDVLAGPTIVSVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAATLIHQ 60
           MVSTSG VLAG +I+S+ +E LLP+AD++TPN+KEASALL G  ++T+++MR AA  +H+
Sbjct: 135 MVSTSGHVLAGSSILSIFRERLLPIADIITPNVKEASALLDGFRIETVAEMRSAAKSLHE 194

Query: 61  MGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKTCNTHGTGCSLASCIAAELAKG 120
           MG + VLVKGGDLPDS D+VD++FDGK+ HELRS RI T NTHGTGC+LASCIAAELAKG
Sbjct: 195 MGPRFVLVKGGDLPDSSDSVDVYFDGKEFHELRSPRIATRNTHGTGCTLASCIAAELAKG 254

Query: 121 SSMFSAVKASKQFIERALKYSKDISIGNGPQGPFDHLCRLKSQEHSSYRQGYFNPADLFL 180
           SSM SAVK +K+F++ AL YSKDI IG+G QGPFDH   LK    SS R   FNP DLFL
Sbjct: 255 SSMLSAVKVAKRFVDNALDYSKDIVIGSGMQGPFDHFFGLKKDPQSS-RCSIFNPDDLFL 314

Query: 181 YAVTDSGMNKRWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEAAKACMKICHAHGVP 240
           YAVTDS MNK+W+RSI DA+KAA+EGGATI+Q+REK+A+TR+FLE AKAC+ IC +HGV 
Sbjct: 315 YAVTDSRMNKKWNRSIVDALKAAIEGGATIIQLREKEAETREFLEEAKACIDICRSHGVS 374

Query: 241 LLINDRIDIALACDADGVHVGQSDIPAHEVRSLLGPNKIIGVSCKTPEQAEQAWLDGADY 300
           LLINDRIDIALACDADGVHVGQSD+P   VRSLLGP+KIIGVSCKTPEQA QAW DGADY
Sbjct: 375 LLINDRIDIALACDADGVHVGQSDMPVDLVRSLLGPDKIIGVSCKTPEQAHQAWKDGADY 434

Query: 301 IGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINQSNAAAVMEIGVPNLKGVA 360
           IG GGV+PTNTKANN T+G+DGLK VC ASKLPVVAIGGI  SNA +VM+I  PNLKGVA
Sbjct: 435 IGSGGVFPTNTKANNRTIGLDGLKEVCEASKLPVVAIGGIGISNAGSVMQIDAPNLKGVA 494

Query: 361 VVSALFDRQCVLEEALKLHATLVEA 386
           VVSALFD+ CVL +A KLH TL E+
Sbjct: 495 VVSALFDQDCVLTQAKKLHKTLKES 518

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008460243.13.2e-20592.56PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform ... [more]
XP_008460244.13.2e-20592.56PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform ... [more]
XP_022957338.11.1e-20292.75thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X2 [Cucurbi... [more]
XP_022957337.11.1e-20292.75thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X1 [Cucurbi... [more]
XP_023550227.14.3e-20292.49thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X2 [Cucurbi... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3CCI6|A0A1S3CCI6_CUCME2.1e-20592.56thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X2 OS=Cucum... [more]
tr|A0A1S3CDB7|A0A1S3CDB7_CUCME2.1e-20592.56thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X1 OS=Cucum... [more]
tr|D7TGZ0|D7TGZ0_VITVI2.0e-17680.78Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_12s0035g00320 PE=3 SV=... [more]
tr|A0A2N9ITQ5|A0A2N9ITQ5_FAGSY2.5e-17478.72Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS55435 PE=3 SV=1[more]
tr|A0A2P5B8J3|A0A2P5B8J3_9ROSA1.2e-17378.81Thiamine phosphate synthase OS=Trema orientalis OX=63057 GN=TorRG33x02_329480 PE... [more]
Match NameE-valueIdentityDescription
sp|Q5M731|TPS1L_ARATH9.8e-15872.21Thiamine biosynthetic bifunctional enzyme TH1, chloroplastic OS=Arabidopsis thal... [more]
sp|O48881|TPS1_BRANA4.6e-15570.65Thiamine biosynthetic bifunctional enzyme BTH1, chloroplastic OS=Brassica napus ... [more]
sp|Q2QWK9|TPS1_ORYSJ1.8e-15168.30Probable thiamine biosynthetic bifunctional enzyme, chloroplastic OS=Oryza sativ... [more]
sp|A8MK92|THIE_ALKOO2.5e-4445.97Thiamine-phosphate synthase OS=Alkaliphilus oremlandii (strain OhILAs) OX=350688... [more]
sp|Q97LQ9|THIE_CLOAB8.9e-4245.73Thiamine-phosphate synthase OS=Clostridium acetobutylicum (strain ATCC 824 / DSM... [more]
Match NameE-valueIdentityDescription
AT1G22940.15.4e-15972.21thiamin biosynthesis protein, putative[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008972phosphomethylpyrimidine kinase activity
GO:0003824catalytic activity
GO:0004789thiamine-phosphate diphosphorylase activity
Vocabulary: Biological Process
TermDefinition
GO:0009228thiamine biosynthetic process
Vocabulary: INTERPRO
TermDefinition
IPR036206ThiamineP_synth_sf
IPR004399HMP/HMP-P_kinase
IPR013749PM/HMP-P_kinase-1
IPR013785Aldolase_TIM
IPR029056Ribokinase-like
IPR034291TMP_synthase
IPR022998ThiamineP_synth_TenI
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016310 phosphorylation
biological_process GO:0009228 thiamine biosynthetic process
biological_process GO:0008150 biological_process
cellular_component GO:0009570 chloroplast stroma
cellular_component GO:0005886 plasma membrane
cellular_component GO:0005575 cellular_component
cellular_component GO:0009507 chloroplast
cellular_component GO:0005829 cytosol
molecular_function GO:0003824 catalytic activity
molecular_function GO:0008972 phosphomethylpyrimidine kinase activity
molecular_function GO:0004789 thiamine-phosphate diphosphorylase activity
molecular_function GO:0008902 hydroxymethylpyrimidine kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C11G207240.1Cla97C11G207240.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR022998Thiamine phosphate synthase/TenIPFAMPF02581TMP-TENIcoord: 180..365
e-value: 4.3E-65
score: 218.3
IPR022998Thiamine phosphate synthase/TenICDDcd00564TMP_TenIcoord: 180..380
e-value: 8.31461E-84
score: 255.521
IPR034291Thiamine phosphate synthaseTIGRFAMTIGR00693TIGR00693coord: 180..369
e-value: 5.9E-63
score: 209.7
IPR034291Thiamine phosphate synthaseHAMAPMF_00097TMP_synthasecoord: 177..384
score: 27.15
IPR029056Ribokinase-likeGENE3DG3DSA:3.40.1190.20coord: 1..175
e-value: 1.8E-48
score: 167.1
IPR029056Ribokinase-likeSUPERFAMILYSSF53613Ribokinase-likecoord: 1..156
IPR013785Aldolase-type TIM barrelGENE3DG3DSA:3.20.20.70coord: 176..383
e-value: 9.6E-72
score: 242.7
IPR013749Pyridoxamine kinase/Phosphomethylpyrimidine kinasePFAMPF08543Phos_pyr_kincoord: 1..150
e-value: 1.6E-44
score: 152.1
NoneNo IPR availablePANTHERPTHR20858PHOSPHOMETHYLPYRIMIDINE KINASEcoord: 1..353
NoneNo IPR availablePANTHERPTHR20858:SF17HYDROXYMETHYLPYRIMIDINE/PHOSPHOMETHYLPYRIMIDINE KINASE THI20-RELATEDcoord: 1..353
IPR004399Hydroxymethylpyrimidine kinase/phosphomethylpyrimidine kinaseCDDcd01169HMPP_kinasecoord: 1..141
e-value: 1.65725E-51
score: 173.458
IPR036206Thiamin phosphate synthase superfamilySUPERFAMILYSSF51391Thiamin phosphate synthasecoord: 175..378