Cp4.1LG01g16710 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG01g16710
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionThiamine-phosphate pyrophosphorylase
LocationCp4.1LG01: 10460313 .. 10469592 (+)
RNA-Seq ExpressionCp4.1LG01g16710
SyntenyCp4.1LG01g16710
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGCCCATTACCAGTCACTGTACTCCGTCCCGCCGCCTCCTGCCCGCCGCCTCCTCCTCCCCCTCCCCCTCACATTCTCTCGCCCTCCCAAGGACTCAAGAGTCACATTTCTTTGTTTAATTCCCTCAGAAACACAGTGGTTCAGCCCCTAACCCGCGCCGTTCTTGAGATGGTGCTGCTGCCTCTCACTTTTCAGATTCCTAAGGTCTGATCTTTTTTTGTTTCTTTTTATTCTTTTTCGTTGGGTTATGCGCTTCATCCAAGGAGAAGAAAACTAATAATGTATCCCTCGCAAATTAGACAGAAAACACCGATTGAACCAACAAGTACAGAGGAACATAAAACAAACCAATTCTTGTATGAATCAGTAAATTTATAGGAGAAGTAATGTTTAAGAATGAACAATGCTTAAAAAGACCCTGTAGTTTTGTTGGCTATGAAAAACTTGTTGGAATTGAAAAGCTGTATTTGAAAACATTGAAACTATAGCAAACTATGGACAAACAAAATAGAAAACTTAACTTTTGATTTTAGTTTGATTGAATGACGTTGAGTCTTCTTGAAAATAGAAGAAACTATCCACGGAGAGAAGAAATTATTATCTATTTTTTAAAAAAATTTCATATGGAGTAGAGTTGGAAACAGAATTTTGAAGTATTAAATCCATACAAATCAGACCTATATAGAAGTATATTTAGGGGTTCCAACACGGGTTAGCTTTATCAATTATGGAGGTATATTTTATGATTTATGGGTTTTACTTTCTGTGCCTTGATTTGTCCTTTTTAGTTCAATCAAGTCTCTAGGTTTTGTATGGCCATGAAGAAGCCGGAAGAAACAGTTGTAGCTTCAAGTGATCGCTATGAGATGAGGATTCCACATGTGTTGAGTGTTGCTGGTTCTGATTCAGGAGCAGGGGCTGGAATCCAAGCGGATCTTAAGACTTGTGCTGCTCGTGGAGTGTACTGTTCCACTGTGATAACTGCTGTTACAGCACAGAACACCGTGGGGGTTCAGGTCATTATCACACACACATAATTATTATCCATATTTATTTGTTGGTGTATTTGTTTTGGCTTGAAAGGACAATGGGGAGGACTGGTGAATAAATAAATAATGTTCCTTGCCTGCCTGCTATGGTATAATGGGGTGTATAATGTTATTGTGAGTTTAGGATGATTTTTGAAAACAAGGTGCTTCTGAGTGAAAGCTTATCCTTTGTTAGCACTTTATTAAGTGCTATCCTGGGTATGTTTAGGAATGCTTTAGGCATGGTGAAAAACACCTTTTTCTTATGCTCAAAAGCATGTATCATAAATATCATGTTTGGGAGCAACAACAGATGCTTTTAAAAAAGTTAATAGTACTTATAATACTTGGTAAAAAGCACTTTTAGAGTGCTTTTGTCTAACTGGGAAGTACTTTATAAACACTTATTGATAGTGTGAAAGCAAGCATAAAGTAATCTTTCGTAAAGCAGTTCTTTTCAAAATGCTTACAGTACTTCATCCTTATTCTATAAGCACTCCTAAATACATCCGTAGTCCTTTTAATAAGTTTTCAAGTATCACTTCTAAGTATTTAACTCAAATTATGAAAACTTTGAAAATACTTTTGACAAGCCAGAAACACTTTTTACCTTTGCGAAAATCATCCTAAACTCACTCTTAGAATATACAATGCCCTTGAAGTTAGTTTTTGCTTTGTTCTCAATGAAATTAGCTGAAATGTTTTGTCCTAAATTATACTGCTTGTAACACAAATTTTATTTAAGACGAAGATGTACTAACTTTTTCAGGATGTAAACATTATGCCGGAGGGCTTTGTTTCAGAGCAGCTGAAATCTGTTCTCTCTGATATGCAAGTGGATGTGGTGAGTATAATTAAATTTTAAGTTAGGATTGCCTTGCAAGTTTTTTAGGAGCTCATATGGACAAGACTGTGGACAGTTCATGTATAGGATATTGAACAGGCTAGCTGTTATTGCTTTTTCAGTGCACGTACTAATATATACTTTATGTAAGCTGGAAAACACAGAAAGACTACTAATGATTACATTAATTCAGGTGAAAACAGGAATGCTACCTTCTACTGGCATCATTCAGGTTATACGTCAGCGCCTGAAGGAGTTTCCTGTTCAAGGTATAAAGTATTTCAGTAAGTTATTGGTGCATTTCTTTGATTTAATTATCGTTTTTTAAAGATTTTAATATGAATAAAGGCAGCTTTGGTGGTTGATCCTGTCATGGTGTCTACCAGTGGGGATGTTCTGGCTTGTCCTACAATTATTTCAGTGTTACAGTAAGTTGATTATCTGTTTATAGGGAGTATCATTAGTCCTTCAAACTAGTTCTTATGATAATTTCCTGCGCTTCCTGTTTGCATTTTTTAAATGTCCTCAGAATTAGTATATTGTTTTTTGGTTCAAAAGGACTCTGAAGCTTTTGGTATGGACTTCAAGCTTTTAGCGTTAAGGCTTACATCGAACGAACCTTTATAAGAACCTACTGAGGCTTGAAATTCTTATGCCTCATATAATAATTAGAAAAATAATACTATTTTATTGCCTATTCTCTCAAAATGTGAATGTTCACCTGTGTAAGTAATGCTTATAGCACCACAATTTATTCATGCTTTTCGCTGATGTCCATACATCCAAGTAAGGACTACATGAAACTCCCAACAATTTAATACTGTTTTTAATGGTATTGCTACTGTTTTTACTTTCACATGTTAGATTGAGTGATTTTGCAAAACCCCTTCGTTTGCAAGGGGACATCTTGTCCGTTTTTCCATATACTGTTTTGGAAGCTTTTTTTTAATAATGAAATTTCTTGCATGTTTCTTATAAAAAGAAGAGAGAGAGAGCGTGAAAAGGAGAAAAATCTTCTTGGTAGTAGCTTCTTCTTAAAGTTGCTGGTTTTTCTTGTTGAGAGAGGCTTTTAGTTTTGACTTGGCAAAACTTGTTAGTCTTTGTTTGGCTGCAATTACTCCCACAAATTTGCCTCACATTGGCCCCAAGGTGTGTCCTTGTTGGTAAGAAATTACAATATTGGAACCCCTCCCAAATCTTTAATTTAAATAGTGGGTTCTAGGGATCGAGAGAATTAATTTCCTTGCCCATCTACTAGAGCTGGGGCTGCGGGTTTTGCTGGGGATTAGAGTTGGTGGGGCCTGGATTCCCCAATTTACAGCCATAATTTGCTTAATTGAATGAAAATAATGCTTTTTCTTTAATCAAAATTCATGTCTGATCTTCATATGTTAGTTTTTCATAATTCTATGATACGTCTTTTGCTTTCAGGGATGAACTTCTACCAATGGCTGACTTGGTAACCCCAAATTTGAAGGAAGCATCTGCCTTACTTGGCGGTATGCCCCTTAATACAATTTCTGACATGCGTCATGCTGCAACGCTAATCCATCAGATGGGATCCAAGTAAGTTTTTAATCCATCACATAGCTTGCGTTCTTTTGTAATTCCATTTTATCAATGAAATGAAAATCAATGTTTCTCATGAAAAAAGAAGAAGAAATTTTATGCTACCTCTAGTCACAAGAGACTGACTTTGACACTCAATTTTCCTCATTGCCGCCTGCCTCCTAGCAGTAATGTTAATACCATTCTACTTAATTCAAAGAAGTTTTTTTAAGTCTAAATTATAAAAATACCCCTGAATTTTACTCTTTGTTTCAAAAGTTCAGAGGTGTCTTTTATAATTTAACCATTTTTAAAACTTTAATTCTATGGTTATTCGGATTACATTAGCGATCGCTAGTCATTCTCCCTCCTTTGCCACCCAATCTTCACCTTCTTATAGACCCATCGAATCTTCGCCCCCTTGCCTTCTCAATCCCTTGACCAACCCTCCTCAACAATCGCTAGCCATCCTGACCCCAATCCCTTCTTCCTTTCCCTCCTCATCAGCCAATTAACCTACCTCACTGCATTTACCAATTGGCCTCGAGATGACTACAAATTCCATGGAGTAATCAGGGACAAAAAAAAGGCTCAACGAAAATCGTGATTGGCGACGAAGAAGACAGGAGGTGTATCTGATATTGGGAGAAAACCCTTCATATTTTCCTTGACCCTTATGTTAACCTCCATCATATTCATCCGAGAGAAAGATTTAGTGGCTGTATCGATGTATCCTCCATGGGCATTCCAAATTTTCTTGAAGGATTCAAAAGATCATTTCTTGAAGGATATTTATCAATTGGAAAATTTCTGATTTTCATCCATCCCCACTGGGTAGGCACTGGAGGATCTTTAAGAAAGGACTCCATACTCAAATTTTCAAATCTTACCCTATAAGTTCCCACCTTATACCACCTATATTTCCCAAAGTAGTCTCTCGTTCTTTGCCTTCCTCCACGAGGAGTGCCCTATCAGGTTGAATGGGTCTAATAAACCCAAAATCATATACTTATTGTTAAAGAGCTCATAAAATCACATGCCCGTCATTATGAAAGTGCACAAAAGATAAAAGGATTGATCTCTGCTTGTTTCAGTGATTTTTAACCCTTTTCCACCCTCAATCAGTGATTGGTGCGCACCGTCGAGTTAGGCAGCAGGAGAGCATGGTAGGGAGGGGTGGGATTGGGTTGGGTATGTGAGAAGAGAGAGAAGGGAACATTTTTTTTCTCTTAAATCAAACAGAGTGAAAGCCAATTGGAGAGCTTTGTTGTAATTCACCAATAGTTGCTTGGAGTTTTTACTCCCGTGTTTCATCAACTAGTGAAATTGTTTCTTCTTCTTCTTCTTCTTCAAAATAATATAACAATAAATTAATAAAAATCTTTCTAGTTTTCTAACAACAAAATGTTGTAGGTGCACATGGGTGTCCATGAAATTAGTGAGGTGTCCACCAGTTGGCCTAAATGCTCAATCCATGGATATCAACGAAAGGAAGAAAAACAATGGGGTTTATAAAATTTTTGGAAGGAAGAAAAAAAAAAAGGGAGCTGCAAAAAGAGTGACGGAGGGAGGTTGAGAGAAGCAAATGTTTAGTTTAGGCTTGGGTTTACTCAAATAACAGCATCCAAATTTCTCTTAGAATTATTTGTTTATTTATTTTTCTAATTTCTTATTGATTTAAAACCCTAGCTAATTGGATCTAAGAACCCTTATCAACCCTTAGATATTCTAACCCATAAATCACAAATTCTAAAACCCCCAAATGAAGATGAGTTCATCCCAAATGATTGTAAGTATCGAGAATTCAATTTGAAGAATAAGACCTTTAGATCTAAAATATCCCAGAACTCTAAAATGACAAGGAATACCCCAAGGAAAGTTAGATCCGGATTATTTTATTGATCAAATATCTCAAGTATCACAAGGCAAGAAAACTTGTTTGAGATTTGAATCACTTCACCAGCAGAATCAATCATGTCAAGCTTAAATGATTCTAAGCATACAACCTAAACTATATAGAATTGCAAATCAACTTAGCCCTTGGCTAAGAGAAAGCATGAATGCTATTTTTACTATATTTTCTAAGTCTATCTTACAAAGACAACATGCATGGCTTTATATAGCCTCAAAATGAAACCCTTGACCTTTCATGAGGCATTTCAAAAGTTGTAACCTTCATACTTCATGGCTATAATTGGCCACTAAGGAAATGTATCCTAGAAACATCATAAATTTAGATTATCATCTACCAAAGAATTTGTAACCATCCAAGAATAAATGAAGTTCCACCTGACGTAGTTTCATTGTTGCACAATTGAAGTTTCATTGTTGCACAATTGAAGTTTCATTGTTGCACAATTGAAGCTTCATTGTTGCACAATTGAAGCTTGATTCTTCTTTAATGTGACATTAATTGCAACTTGAGCTTATCTCGTGGTAATTTGAACCACATTTTTTCACATCTTTCTGACATGATTGTTAGTTCTGCATGTTACAACTATTAAGATAATTTTGAGTTCAATTTTCTACCCAATTAAATTCAACCTTGCACATTTTAGACGCCCCCTCCGCTTCATACTACCATTTTTATCAATTTACTCCTGACTGATGCTGAAGAGCGTTATGTTTGATTTTAGAAATGTACTTGTCAAAGGCGGGGACCTTCCAGATTCATTGGATGCTGTAGATATATTCTTTGATGGTAAGTACAGTACAGATTGCATAATTCTGACAATTTGATATATATATTTTTTTGTTATGAAGGAAGCCCATCCACTGAATCCATATTTTTTTCATAGGCAAGGATCTGCATGAGCTACGATCTTCACGCATAACGACTCGCAACACTCATGGCACTGGATGCAGCTTAGCATCATGCATTGCAGCTGAGCTTGCTAAAGGGTCTTCAATGTTCTCAGCTGTTAAGGTAATAATCTACGGACACTTATTTTAAACCCACCAATAAGAAGATTAAAGTCTCAACGGTTCATAATACAACTTCCTATTGAATAGTTAAATAATTTTTTAAAAATAAACATCTTCAATTTGTTATAGGATAATAGTTGTTGAAAATAACGCTATATTCAATATGTATGCTCACTGACCTTGACAGATGATCAAGACGGGATGTTAAGAGTTACTTTAATGCTTTTTTTTTTCTGGCTTTTCATTGAAATGATGAAAAATAAAAAAATGTTCTAGGGATACAAACTCCCAAAGGGAGTGAAAAAGGGAAAAAGGAAAAAAAATACAAACAGTACATACAACCAAATAGAGATTAGTCAGGAAAATAAATAAATTTTTAAGGAACCAAGAACTTTCTCTGAACACTTGGAGAGCTTGCCAATGAATAATGAAACTGTGAATAAAACCTCTATCAAACACTGAAATTCGATAGATGCTTCAAAACTTCAAAAAGAAAACCAAGAATCGACCACAACCAACAACTCTTGAGCCAATTTTACTCGTGACACCTTCATCAACTCTTGAGCTTTGCAAGAATTACTAACCATTGTTAAAAGCTTTGTAAGAACGATCACCGACAAAGAAATAAAAAAATATCCTCATGCAAGTCAATTCAAATATTTCATGCAAGTCGATTCAAATATCCCAGTGAAGACAAAAAAAAACTTTGTAAAGCAAAACCATGACGTGGCTACCCTCTCGAGCTAATGGAGAGTCTTCTTGTCACAGTCCTGAAATAATAAGCCAAAAGTTTAAGAAAAAGAATCTACAAGAACATCAACCTCCCTACTGAAATCCATGGATGAATGAGTGATTCCACGCTGTTCAAACGTACCTGGGAGTCAACCTCAAACACGTCATTACATGGTGGTGAGTTCTTTTTCGAGTGAATGACTCCTCTCGTTAAACCTTGAGTGAAGTCAAAATGGTTGTTGAGGGGATAAGTGAAGGGGGAGGGTGGTAAGGCGACATTCTAATTGAATAAGAATTTAAAGAAGTTTCCTGGAAGCATGAATGGTAAGAAGAAAAAAGCACATCTTTTTTGTGAGAATAATGCAAATGATAGGCTGGAATATATAAACATTCTTCCCCTCCTTTTTAACACTTGTGGGCTTGGAAATCTCAATCCGTAATGGAGGCCTCTAGGATTCTCAATAAATGAGGGATATATCTTTCACAGTCCTATTTGATAATCAAGCTTTTAAAAGTTGTACTTATTTTTTCACCTTTCTTTACAGCATCTTTCCTAATGAAACATTTAAATTCACGACCAAATTCTATTTACAAACCCAAGGTTTTCAAAACTACTTTTTTATGCACTAATCAAAATATGGTTGCCACTCAATCCAATTTGGAGCTTCAAATTCTTTGGTTATTTGTTTTTTTCTTTTTCTTTTTTTTTGGATGAAAGAGACGTATCATTTCACTCCATATATATAATTTGAGTAATTCAAAACGCAATAGGCTTGGTTGTCGTTACCAGACTTGATGATTGTCCACAACTAAGGTTTTATTTATCTGGATCTTTACAGATAAGCAAACAGTTTATTGAAAGAGCATTGAGTTACAGCAAGGACATTAGCATTGGAAATGGACCTCAAGGCCCATTCGATCATCTATGTCGTCTTAAGAGTCGAGAACAAAGTTCGTATAGACAGGGGAGTTTCAATCAATCTGACTTATTCTTGTATGCTGTTACGGACTCGGGTATGAATAAGCGTTGGGACCGTTCTATCACAGATGCTGTTAAAGCTGCAGTTGAAGGAGGTGCTACTATCATTCAGATAAGGTTTGTCTACTGTTTTCCATTAATCAATTGTTGTTAGTTTGTTTGCTTAAAAATGTTTTCAGAAGTTCATACATATTCAAGTCATTACTTCAAATGTATTGAATTCCTTGTTGCATTTTAATACCTCTTAGCACCTTCTCAGTTGTTCAAAGGTTGAATGGCTTTAGAAACTTACCCATAAAGTGTAGGCTAAATATGCTCCCTCTATAGGCTAGCAATAGTGAGGTAGTAATCTTGGGTTAATAGAACCCGACTGTAAGATTTGAATCTCTTAACGTGTTGCATTATTGTTGACCGAAGTTATAAACTTGGTATAATTAAAACTTGTATTCAATTGTTAATTTGCTTCCAATGGTCGCATGATCTTATTTTTATGAACAGGGAAAAGGATGCTAAAACTCACGATTTCTTGGAAGCAGCCAAATCATGTATAGAGATTTGTCACACACACGGTGTTCCATTACTGATCAACGATCGTATTGACGTCGCACTTGCCTGTGGTGCTGATGGTGTACACATTGGTCAGTCCGATATTCCTGTTCATGCAGCTCGTAGCCTTCTTGGCCCTGATAAGATTATCGGTGTCTCATGCAAGACACCGGAGCAAGCGGAACAGGCATGGCTTGATGGTGCTGATTACATCGGGTGTGGTGGAGTTTATCCCACAAACACAAAGGCAAACAATCTGACTGTTGGGCTTGATGGATTGAAAAGGGTTTGCTTAGCTTCCAAGTTGCCTGTGGTTGCAATTGGTGGAATTAATCACAGTAATGCAGCGGCTGTGATGGAAATTGGTGTCCCAAATCTTAAAGGTGTTGCAGTTGTGTCGGCTCTTTTTGATAGGCAATGTGTTTTAGAGGAGACCTTGAAGCTACATGCAACATTAGTGGAGGCTACAGCATTATCACCTGCGAAATGAGTGAGTGAGGGACCATAGTGCAACGACACAGTTTCTTGATTTTGATGAAATCAAATGTATGAATAATTGTAATGCTTTGATTTTAGTAGAAAGGTTTTTGAAATAAATGTAATTGTCTCAATGTGATTATAATGTAACAAAAATTTAATTTCGTGATAATAAGCATGGTCATTATATATTCTTTAAAA

mRNA sequence

CGCCCATTACCAGTCACTGTACTCCGTCCCGCCGCCTCCTGCCCGCCGCCTCCTCCTCCCCCTCCCCCTCACATTCTCTCGCCCTCCCAAGGACTCAAGAGTCACATTTCTTTGTTTAATTCCCTCAGAAACACAGTGGTTCAGCCCCTAACCCGCGCCGTTCTTGAGATGGTGCTGCTGCCTCTCACTTTTCAGATTCCTAAGTTCAATCAAGTCTCTAGGTTTTGTATGGCCATGAAGAAGCCGGAAGAAACAGTTGTAGCTTCAAGTGATCGCTATGAGATGAGGATTCCACATGTGTTGAGTGTTGCTGGTTCTGATTCAGGAGCAGGGGCTGGAATCCAAGCGGATCTTAAGACTTGTGCTGCTCGTGGAGTGTACTGTTCCACTGTGATAACTGCTGTTACAGCACAGAACACCGTGGGGGTTCAGGATGTAAACATTATGCCGGAGGGCTTTGTTTCAGAGCAGCTGAAATCTGTTCTCTCTGATATGCAAGTGGATGTGGTGAAAACAGGAATGCTACCTTCTACTGGCATCATTCAGGTTATACGTCAGCGCCTGAAGGAGTTTCCTGTTCAAGCTTTGGTGGTTGATCCTGTCATGGTGTCTACCAGTGGGGATGTTCTGGCTTGTCCTACAATTATTTCAGTGTTACAGGATGAACTTCTACCAATGGCTGACTTGGTAACCCCAAATTTGAAGGAAGCATCTGCCTTACTTGGCGGCGGGGACCTTCCAGATTCATTGGATGCTGTAGATATATTCTTTGATGGCAAGGATCTGCATGAGCTACGATCTTCACGCATAACGACTCGCAACACTCATGGCACTGGATGCAGCTTAGCATCATGCATTGCAGCTGAGCTTGCTAAAGGGTCTTCAATGTTCTCAGCTGTTAAGATAAGCAAACAGTTTATTGAAAGAGCATTGAGTTACAGCAAGGACATTAGCATTGGAAATGGACCTCAAGGCCCATTCGATCATCTATGTCGTCTTAAGAGTCGAGAACAAAGTTCGTATAGACAGGGGAGTTTCAATCAATCTGACTTATTCTTGTATGCTGTTACGGACTCGGGTATGAATAAGCGTTGGGACCGTTCTATCACAGATGCTGTTAAAGCTGCAGTTGAAGGAGGTGCTACTATCATTCAGATAAGGGAAAAGGATGCTAAAACTCACGATTTCTTGGAAGCAGCCAAATCATGTATAGAGATTTGTCACACACACGGTGTTCCATTACTGATCAACGATCGTATTGACGTCGCACTTGCCTGTGGTGCTGATGGTGTACACATTGGTCAGTCCGATATTCCTGTTCATGCAGCTCGTAGCCTTCTTGGCCCTGATAAGATTATCGGTGTCTCATGCAAGACACCGGAGCAAGCGGAACAGGCATGGCTTGATGGTGCTGATTACATCGGGTGTGGTGGAGTTTATCCCACAAACACAAAGGCAAACAATCTGACTGTTGGGCTTGATGGATTGAAAAGGGTTTGCTTAGCTTCCAAGTTGCCTGTGGTTGCAATTGGTGGAATTAATCACAGTAATGCAGCGGCTGTGATGGAAATTGGTGTCCCAAATCTTAAAGGTGTTGCAGTTGTGTCGGCTCTTTTTGATAGGCAATGTGTTTTAGAGGAGACCTTGAAGCTACATGCAACATTAGTGGAGGCTACAGCATTATCACCTGCGAAATGAGTGAGTGAGGGACCATAGTGCAACGACACAGTTTCTTGATTTTGATGAAATCAAATGTATGAATAATTGTAATGCTTTGATTTTAGTAGAAAGGTTTTTGAAATAAATGTAATTGTCTCAATGTGATTATAATGTAACAAAAATTTAATTTCGTGATAATAAGCATGGTCATTATATATTCTTTAAAA

Coding sequence (CDS)

ATGGTGCTGCTGCCTCTCACTTTTCAGATTCCTAAGTTCAATCAAGTCTCTAGGTTTTGTATGGCCATGAAGAAGCCGGAAGAAACAGTTGTAGCTTCAAGTGATCGCTATGAGATGAGGATTCCACATGTGTTGAGTGTTGCTGGTTCTGATTCAGGAGCAGGGGCTGGAATCCAAGCGGATCTTAAGACTTGTGCTGCTCGTGGAGTGTACTGTTCCACTGTGATAACTGCTGTTACAGCACAGAACACCGTGGGGGTTCAGGATGTAAACATTATGCCGGAGGGCTTTGTTTCAGAGCAGCTGAAATCTGTTCTCTCTGATATGCAAGTGGATGTGGTGAAAACAGGAATGCTACCTTCTACTGGCATCATTCAGGTTATACGTCAGCGCCTGAAGGAGTTTCCTGTTCAAGCTTTGGTGGTTGATCCTGTCATGGTGTCTACCAGTGGGGATGTTCTGGCTTGTCCTACAATTATTTCAGTGTTACAGGATGAACTTCTACCAATGGCTGACTTGGTAACCCCAAATTTGAAGGAAGCATCTGCCTTACTTGGCGGCGGGGACCTTCCAGATTCATTGGATGCTGTAGATATATTCTTTGATGGCAAGGATCTGCATGAGCTACGATCTTCACGCATAACGACTCGCAACACTCATGGCACTGGATGCAGCTTAGCATCATGCATTGCAGCTGAGCTTGCTAAAGGGTCTTCAATGTTCTCAGCTGTTAAGATAAGCAAACAGTTTATTGAAAGAGCATTGAGTTACAGCAAGGACATTAGCATTGGAAATGGACCTCAAGGCCCATTCGATCATCTATGTCGTCTTAAGAGTCGAGAACAAAGTTCGTATAGACAGGGGAGTTTCAATCAATCTGACTTATTCTTGTATGCTGTTACGGACTCGGGTATGAATAAGCGTTGGGACCGTTCTATCACAGATGCTGTTAAAGCTGCAGTTGAAGGAGGTGCTACTATCATTCAGATAAGGGAAAAGGATGCTAAAACTCACGATTTCTTGGAAGCAGCCAAATCATGTATAGAGATTTGTCACACACACGGTGTTCCATTACTGATCAACGATCGTATTGACGTCGCACTTGCCTGTGGTGCTGATGGTGTACACATTGGTCAGTCCGATATTCCTGTTCATGCAGCTCGTAGCCTTCTTGGCCCTGATAAGATTATCGGTGTCTCATGCAAGACACCGGAGCAAGCGGAACAGGCATGGCTTGATGGTGCTGATTACATCGGGTGTGGTGGAGTTTATCCCACAAACACAAAGGCAAACAATCTGACTGTTGGGCTTGATGGATTGAAAAGGGTTTGCTTAGCTTCCAAGTTGCCTGTGGTTGCAATTGGTGGAATTAATCACAGTAATGCAGCGGCTGTGATGGAAATTGGTGTCCCAAATCTTAAAGGTGTTGCAGTTGTGTCGGCTCTTTTTGATAGGCAATGTGTTTTAGAGGAGACCTTGAAGCTACATGCAACATTAGTGGAGGCTACAGCATTATCACCTGCGAAATGA

Protein sequence

MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKEASALLGGGDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK
Homology
BLAST of Cp4.1LG01g16710 vs. ExPASy Swiss-Prot
Match: Q5M731 (Thiamine biosynthetic bifunctional enzyme TH1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=TH1 PE=1 SV=1)

HSP 1 Score: 663.7 bits (1711), Expect = 1.7e-189
Identity = 341/491 (69.45%), Postives = 397/491 (80.86%), Query Frame = 0

Query: 40  RIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVS 99
           ++P VL+VAGSDSGAGAGIQADLK CAARGVYC++VITAVTAQNT GVQ V+++P  F+S
Sbjct: 29  KVPQVLTVAGSDSGAGAGIQADLKVCAARGVYCASVITAVTAQNTRGVQSVHLLPPEFIS 88

Query: 100 EQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTI 159
           EQLKSVLSD + DVVKTGMLPST I++V+ Q L +FPV+ALVVDPVMVSTSG VLA  +I
Sbjct: 89  EQLKSVLSDFEFDVVKTGMLPSTEIVEVLLQNLSDFPVRALVVDPVMVSTSGHVLAGSSI 148

Query: 160 ISVLQDELLPMADLVTPNLKEASALLG----------------------------GGDLP 219
           +S+ ++ LLP+AD++TPN+KEASALL                             GGDLP
Sbjct: 149 LSIFRERLLPIADIITPNVKEASALLDGFRIETVAEMRSAAKSLHEMGPRFVLVKGGDLP 208

Query: 220 DSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFI 279
           DS D+VD++FDGK+ HELRS RI TRNTHGTGC+LASCIAAELAKGSSM SAVK++K+F+
Sbjct: 209 DSSDSVDVYFDGKEFHELRSPRIATRNTHGTGCTLASCIAAELAKGSSMLSAVKVAKRFV 268

Query: 280 ERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDR 339
           + AL YSKDI IG+G QGPFDH   LK   QSS R   FN  DLFLYAVTDS MNK+W+R
Sbjct: 269 DNALDYSKDIVIGSGMQGPFDHFFGLKKDPQSS-RCSIFNPDDLFLYAVTDSRMNKKWNR 328

Query: 340 SITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACG 399
           SI DA+KAA+EGGATIIQ+REK+A+T +FLE AK+CI+IC +HGV LLINDRID+ALAC 
Sbjct: 329 SIVDALKAAIEGGATIIQLREKEAETREFLEEAKACIDICRSHGVSLLINDRIDIALACD 388

Query: 400 ADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKAN 459
           ADGVH+GQSD+PV   RSLLGPDKIIGVSCKTPEQA QAW DGADYIG GGV+PTNTKAN
Sbjct: 389 ADGVHVGQSDMPVDLVRSLLGPDKIIGVSCKTPEQAHQAWKDGADYIGSGGVFPTNTKAN 448

Query: 460 NLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEE 503
           N T+GLDGLK VC ASKLPVVAIGGI  SNA +VM+I  PNLKGVAVVSALFD+ CVL +
Sbjct: 449 NRTIGLDGLKEVCEASKLPVVAIGGIGISNAGSVMQIDAPNLKGVAVVSALFDQDCVLTQ 508

BLAST of Cp4.1LG01g16710 vs. ExPASy Swiss-Prot
Match: O48881 (Thiamine biosynthetic bifunctional enzyme BTH1, chloroplastic OS=Brassica napus OX=3708 GN=BTH1 PE=1 SV=1)

HSP 1 Score: 640.2 bits (1650), Expect = 2.0e-182
Identity = 328/491 (66.80%), Postives = 391/491 (79.63%), Query Frame = 0

Query: 40  RIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVS 99
           ++  VL+VAGSDSGAGAGIQAD+K CAARGVYC++V TAV A+NT  VQ V+++P   VS
Sbjct: 31  KVAQVLTVAGSDSGAGAGIQADIKVCAARGVYCASVKTAVKAKNTRAVQSVHLLPPDSVS 90

Query: 100 EQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTI 159
           EQLKSVLSD +VDVVKTGMLPS  I++V+ Q L E+PV+ALVVDPVMVSTSG VLA  +I
Sbjct: 91  EQLKSVLSDFEVDVVKTGMLPSPEIVEVLLQNLSEYPVRALVVDPVMVSTSGHVLAGSSI 150

Query: 160 ISVLQDELLPMADLVTPNLKEASALLG----------------------------GGDLP 219
           +S+ ++ LLP+AD++TPN+KEASALLG                            GGDLP
Sbjct: 151 LSIFRERLLPLADIITPNVKEASALLGGVRIQTVAEMRSAAKSLHQMGPRFVLVKGGDLP 210

Query: 220 DSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFI 279
           DS D+VD++FDG + HEL S RI TRNTHGTGC+LASCIAAELAKGS+M SAVK++K+F+
Sbjct: 211 DSSDSVDVYFDGNEFHELHSPRIATRNTHGTGCTLASCIAAELAKGSNMLSAVKVAKRFV 270

Query: 280 ERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDR 339
           + AL+YSKDI IG+G QGPFDH   LK  +  SYRQ +F   DLFLYAVTDS MNK+W+R
Sbjct: 271 DSALNYSKDIVIGSGMQGPFDHFLSLK--DPQSYRQSTFKPDDLFLYAVTDSRMNKKWNR 330

Query: 340 SITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACG 399
           SI DAVKAA+EGGATIIQ+REK+A+T +FLE AKSC++IC ++GV LLINDR D+A+A  
Sbjct: 331 SIVDAVKAAIEGGATIIQLREKEAETREFLEEAKSCVDICRSNGVCLLINDRFDIAIALD 390

Query: 400 ADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKAN 459
           ADGVH+GQSD+PV   RSLLGPDKIIGVSCKT EQA QAW DGADYIG GGV+PTNTKAN
Sbjct: 391 ADGVHVGQSDMPVDLVRSLLGPDKIIGVSCKTQEQAHQAWKDGADYIGSGGVFPTNTKAN 450

Query: 460 NLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEE 503
           N T+GLDGL+ VC ASKLPVVAIGGI  SNA +VM IG PNLKGVAVVSALFD++CVL +
Sbjct: 451 NRTIGLDGLREVCKASKLPVVAIGGIGISNAESVMRIGEPNLKGVAVVSALFDQECVLTQ 510

BLAST of Cp4.1LG01g16710 vs. ExPASy Swiss-Prot
Match: Q2QWK9 (Probable thiamine biosynthetic bifunctional enzyme, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=Os12g0192500 PE=2 SV=2)

HSP 1 Score: 630.2 bits (1624), Expect = 2.1e-179
Identity = 319/500 (63.80%), Postives = 388/500 (77.60%), Query Frame = 0

Query: 32  ASSDRYEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVN 91
           ++S   EM  PHVL+VAGSDSG GAGIQAD+K CAA G YCS+V+TAVTAQNT GVQ ++
Sbjct: 47  SASAAREMPWPHVLTVAGSDSGGGAGIQADIKACAALGAYCSSVVTAVTAQNTAGVQGIH 106

Query: 92  IMPEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSG 151
           ++PE F+ EQL SVLSDM VDVVKTGMLPS G+++V+ + LK+FPV+ALVVDPVMVSTSG
Sbjct: 107 VVPEEFIREQLNSVLSDMSVDVVKTGMLPSIGVVRVLCESLKKFPVKALVVDPVMVSTSG 166

Query: 152 DVLACPTIISVLQDELLPMADLVTPNLKEASALLG------------------------- 211
           D L+  + +SV +DEL  MAD+VTPN+KEAS LLG                         
Sbjct: 167 DTLSESSTLSVYRDELFAMADIVTPNVKEASRLLGGVSLRTVSDMRNAAESIYKFGPKHV 226

Query: 212 ---GGDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSA 271
              GGD+ +S DA D+FFDGK+  EL + RI T NTHGTGC+LASCIA+ELAKG++M  A
Sbjct: 227 LVKGGDMLESSDATDVFFDGKEFIELHAHRIKTHNTHGTGCTLASCIASELAKGATMLHA 286

Query: 272 VKISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDS 331
           V+++K F+E AL +SKD+ +GNGPQGPFDHL +LK    +   Q SF    LFLYAVTDS
Sbjct: 287 VQVAKNFVESALHHSKDLVVGNGPQGPFDHLFKLKCPPYNVGSQPSFKPDQLFLYAVTDS 346

Query: 332 GMNKRWDRSITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDR 391
           GMNK+W RSI +AV+AA+EGGATI+Q+REKD++T +FLEAAK+C+EIC + GVPLLINDR
Sbjct: 347 GMNKKWGRSIKEAVQAAIEGGATIVQLREKDSETREFLEAAKACMEICKSSGVPLLINDR 406

Query: 392 IDVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGV 451
           +D+ALAC ADGVH+GQ D+  H  R LLGP KIIGVSCKTP QA+QAW DGADYIGCGGV
Sbjct: 407 VDIALACNADGVHVGQLDMSAHEVRELLGPGKIIGVSCKTPAQAQQAWNDGADYIGCGGV 466

Query: 452 YPTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALF 504
           +PT+TKANN T+G DGLK VCLASKLPVVAIGGIN SNA +VME+G+PNLKGVAVVSALF
Sbjct: 467 FPTSTKANNPTLGFDGLKTVCLASKLPVVAIGGINASNAGSVMELGLPNLKGVAVVSALF 526

BLAST of Cp4.1LG01g16710 vs. ExPASy Swiss-Prot
Match: P56904 (Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase OS=Rhizobium meliloti (strain 1021) OX=266834 GN=thiD PE=3 SV=2)

HSP 1 Score: 182.2 bits (461), Expect = 1.5e-44
Identity = 118/260 (45.38%), Postives = 152/260 (58.46%), Query Frame = 0

Query: 45  LSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKS 104
           LS+AGSDSG GAGIQADLKT +A GVY ++VITA+TAQNT GV  V  +    VS Q+ +
Sbjct: 6   LSIAGSDSGGGAGIQADLKTFSALGVYGASVITAITAQNTRGVTAVEDVSAEIVSAQMDA 65

Query: 105 VLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQ 164
           V SD+ V  VK GM+     I  I   L+ F  +A VVDPVMV+TSGD L  P  ++ L 
Sbjct: 66  VFSDLDVKAVKIGMVSRRETIAAIADGLRRFGKRA-VVDPVMVATSGDALLRPDAVAALI 125

Query: 165 DELLPMADLVTPNLKEASALLG----------------------------GGDLPDSLDA 224
           +ELLP+A +VTPNL EA+ + G                            GG L    +A
Sbjct: 126 EELLPLALVVTPNLAEAALMTGRAIAGDEAEMARQAEAIMRTGAHAVLVKGGHLKGQ-EA 185

Query: 225 VDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALS 277
            D+FFDG  L  L + RI TRN HGTGC+L++ IAA LAKG  +  AV  +K ++  A+S
Sbjct: 186 TDLFFDGDTLVRLPAGRIETRNDHGTGCTLSAAIAAGLAKGVPLIEAVSAAKAYLHAAIS 245

BLAST of Cp4.1LG01g16710 vs. ExPASy Swiss-Prot
Match: P61422 (Thiamine biosynthesis bifunctional protein ThiED OS=Geobacter sulfurreducens (strain ATCC 51573 / DSM 12127 / PCA) OX=243231 GN=thiDE PE=3 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 7.4e-44
Identity = 113/251 (45.02%), Postives = 149/251 (59.36%), Query Frame = 0

Query: 44  VLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLK 103
           VL+VAGSDSG GAGIQADLKT    G Y S+V+TA+TAQNT GV  ++ +P  FV++QL 
Sbjct: 228 VLTVAGSDSGGGAGIQADLKTVTLLGSYGSSVLTALTAQNTRGVSGIHGVPPAFVADQLD 287

Query: 104 SVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVL 163
           +V SD+ VDVVKTGML S   I  I  +L E+  + +VVDPVMV+  G  L     +SVL
Sbjct: 288 AVFSDIPVDVVKTGMLFSAETIVAIAAKLTEYRRRMVVVDPVMVAKGGANLIDRGAVSVL 347

Query: 164 QDELLPMADLVTPNLKEASALLG---------------------------GGDLPDSLDA 223
           ++ L P+A LVTPN+ EA  L G                           GG L    D+
Sbjct: 348 KERLFPLAYLVTPNIPEAERLTGANISDEESMREAARRLHRLGARNVLLKGGHLLAG-DS 407

Query: 224 VDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALS 268
           VDI FDG   H   S RI ++NTHGTGC+ AS IA  LA+G  +  A+  +K++I  A+ 
Sbjct: 408 VDILFDGAAFHRFVSPRILSKNTHGTGCTFASAIATYLAQGDPLREAIARAKRYITAAIR 467

BLAST of Cp4.1LG01g16710 vs. NCBI nr
Match: XP_023550218.1 (thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 972 bits (2514), Expect = 0.0
Identity = 509/537 (94.79%), Postives = 509/537 (94.79%), Query Frame = 0

Query: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA 60
           MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180
           STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE
Sbjct: 121 STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180

Query: 181 ASALLGG----------------------------GDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLGG                            GDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGGMPLNTISDMRHAATLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300
           RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD
Sbjct: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300

Query: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360
           HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE
Sbjct: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360

Query: 361 KDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420
           KDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG
Sbjct: 361 KDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420

Query: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480
           PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV
Sbjct: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480

Query: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 509
           AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK
Sbjct: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 537

BLAST of Cp4.1LG01g16710 vs. NCBI nr
Match: XP_022957337.1 (thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X1 [Cucurbita moschata])

HSP 1 Score: 964 bits (2492), Expect = 0.0
Identity = 505/537 (94.04%), Postives = 507/537 (94.41%), Query Frame = 0

Query: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA 60
           MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDR+EMRIPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRHEMRIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITAVTAQNTVGVQDVNI+PEGFVSEQLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIIPEGFVSEQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180
           STGIIQVI QRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE
Sbjct: 121 STGIIQVIHQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180

Query: 181 ASALLGG----------------------------GDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLGG                            GDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGGMPLNTISDMRHAATLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300
           RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD
Sbjct: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300

Query: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360
           HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE
Sbjct: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360

Query: 361 KDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420
           KDAKT DFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG
Sbjct: 361 KDAKTRDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420

Query: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480
           PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV
Sbjct: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480

Query: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 509
           AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK
Sbjct: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 537

BLAST of Cp4.1LG01g16710 vs. NCBI nr
Match: XP_022997729.1 (thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X1 [Cucurbita maxima])

HSP 1 Score: 962 bits (2488), Expect = 0.0
Identity = 504/537 (93.85%), Postives = 506/537 (94.23%), Query Frame = 0

Query: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA 60
           MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180
           STGIIQVI QRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPM DLVTPNLKE
Sbjct: 121 STGIIQVICQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMVDLVTPNLKE 180

Query: 181 ASALLGG----------------------------GDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLGG                            GDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGGMPLNTISDMRHAATLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300
           RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERAL+YSKDISIGNGPQGPFD
Sbjct: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALNYSKDISIGNGPQGPFD 300

Query: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360
           HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE
Sbjct: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360

Query: 361 KDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420
           KDAKT DFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG
Sbjct: 361 KDAKTRDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420

Query: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480
           PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLK+VCLASKLPVV
Sbjct: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKKVCLASKLPVV 480

Query: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 509
           AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK
Sbjct: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 537

BLAST of Cp4.1LG01g16710 vs. NCBI nr
Match: KAG7032412.1 (Thiamine biosynthetic bifunctional enzyme TH1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 958 bits (2477), Expect = 0.0
Identity = 503/537 (93.67%), Postives = 505/537 (94.04%), Query Frame = 0

Query: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA 60
           MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDR+EMRIPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRHEMRIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180
           STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE
Sbjct: 121 STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180

Query: 181 ASALLGG----------------------------GDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLGG                            GDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGGMPLNTISDMRHAATLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300
           RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD
Sbjct: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300

Query: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360
           HLCRLKSREQSSYRQGSFNQSDLFLYAVTDS MNKRWDRSITDAVKAAVEGGATIIQIRE
Sbjct: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSSMNKRWDRSITDAVKAAVEGGATIIQIRE 360

Query: 361 KDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420
           KDAKT DFLEAAKSC+EIC T GVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG
Sbjct: 361 KDAKTRDFLEAAKSCLEICRTRGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420

Query: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480
           PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV
Sbjct: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480

Query: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 509
           AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK
Sbjct: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 537

BLAST of Cp4.1LG01g16710 vs. NCBI nr
Match: KAG6601651.1 (Thiamine biosynthetic bifunctional enzyme TH1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 957 bits (2474), Expect = 0.0
Identity = 503/537 (93.67%), Postives = 505/537 (94.04%), Query Frame = 0

Query: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA 60
           MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDR+EMRIPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRHEMRIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180
           STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE
Sbjct: 121 STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180

Query: 181 ASALLGG----------------------------GDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLGG                            GDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGGMPLNTISDMRHAATLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300
           RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD
Sbjct: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300

Query: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360
           HLCRLKSREQSSYRQGSFNQSDLFLYAVTDS MNKRWDRSITDAVKAAVEGGATIIQIRE
Sbjct: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSSMNKRWDRSITDAVKAAVEGGATIIQIRE 360

Query: 361 KDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420
           KDAKT DFLEAAKSC+EIC T GVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG
Sbjct: 361 KDAKTRDFLEAAKSCLEICRTCGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420

Query: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480
           PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV
Sbjct: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480

Query: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 509
           AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK
Sbjct: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 537

BLAST of Cp4.1LG01g16710 vs. ExPASy TrEMBL
Match: A0A6J1GYV1 (Thiamine-phosphate pyrophosphorylase OS=Cucurbita moschata OX=3662 GN=LOC111458768 PE=3 SV=1)

HSP 1 Score: 964 bits (2492), Expect = 0.0
Identity = 505/537 (94.04%), Postives = 507/537 (94.41%), Query Frame = 0

Query: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA 60
           MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDR+EMRIPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRHEMRIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITAVTAQNTVGVQDVNI+PEGFVSEQLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIIPEGFVSEQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180
           STGIIQVI QRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE
Sbjct: 121 STGIIQVIHQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180

Query: 181 ASALLGG----------------------------GDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLGG                            GDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGGMPLNTISDMRHAATLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300
           RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD
Sbjct: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300

Query: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360
           HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE
Sbjct: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360

Query: 361 KDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420
           KDAKT DFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG
Sbjct: 361 KDAKTRDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420

Query: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480
           PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV
Sbjct: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480

Query: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 509
           AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK
Sbjct: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 537

BLAST of Cp4.1LG01g16710 vs. ExPASy TrEMBL
Match: A0A6J1KCE1 (Thiamine-phosphate pyrophosphorylase OS=Cucurbita maxima OX=3661 GN=LOC111492599 PE=3 SV=1)

HSP 1 Score: 962 bits (2488), Expect = 0.0
Identity = 504/537 (93.85%), Postives = 506/537 (94.23%), Query Frame = 0

Query: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA 60
           MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180
           STGIIQVI QRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPM DLVTPNLKE
Sbjct: 121 STGIIQVICQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMVDLVTPNLKE 180

Query: 181 ASALLGG----------------------------GDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLGG                            GDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGGMPLNTISDMRHAATLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300
           RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERAL+YSKDISIGNGPQGPFD
Sbjct: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALNYSKDISIGNGPQGPFD 300

Query: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360
           HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE
Sbjct: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360

Query: 361 KDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420
           KDAKT DFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG
Sbjct: 361 KDAKTRDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420

Query: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480
           PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLK+VCLASKLPVV
Sbjct: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKKVCLASKLPVV 480

Query: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 509
           AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK
Sbjct: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 537

BLAST of Cp4.1LG01g16710 vs. ExPASy TrEMBL
Match: A0A6J1GZX4 (Thiamine-phosphate pyrophosphorylase OS=Cucurbita moschata OX=3662 GN=LOC111458768 PE=3 SV=1)

HSP 1 Score: 925 bits (2390), Expect = 0.0
Identity = 485/517 (93.81%), Postives = 487/517 (94.20%), Query Frame = 0

Query: 21  MAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVT 80
           MAMKKPEETVVASSDR+EMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVT
Sbjct: 1   MAMKKPEETVVASSDRHEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVT 60

Query: 81  AQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQAL 140
           AQNTVGVQDVNI+PEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVI QRLKEFPVQAL
Sbjct: 61  AQNTVGVQDVNIIPEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVIHQRLKEFPVQAL 120

Query: 141 VVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKEASALLGG------------- 200
           VVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKEASALLGG             
Sbjct: 121 VVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKEASALLGGMPLNTISDMRHAA 180

Query: 201 ---------------GDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAA 260
                          GDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAA
Sbjct: 181 TLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAA 240

Query: 261 ELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQ 320
           ELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQ
Sbjct: 241 ELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQ 300

Query: 321 SDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICH 380
           SDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIREKDAKT DFLEAAKSCIEICH
Sbjct: 301 SDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIREKDAKTRDFLEAAKSCIEICH 360

Query: 381 THGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWL 440
           THGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWL
Sbjct: 361 THGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWL 420

Query: 441 DGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPN 500
           DGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPN
Sbjct: 421 DGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPN 480

Query: 501 LKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 509
           LKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK
Sbjct: 481 LKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 517

BLAST of Cp4.1LG01g16710 vs. ExPASy TrEMBL
Match: A0A6J1K5X2 (Thiamine-phosphate pyrophosphorylase OS=Cucurbita maxima OX=3661 GN=LOC111492599 PE=3 SV=1)

HSP 1 Score: 923 bits (2386), Expect = 0.0
Identity = 484/517 (93.62%), Postives = 486/517 (94.00%), Query Frame = 0

Query: 21  MAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVT 80
           MAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVT
Sbjct: 1   MAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVT 60

Query: 81  AQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQAL 140
           AQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVI QRLKEFPVQAL
Sbjct: 61  AQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVICQRLKEFPVQAL 120

Query: 141 VVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKEASALLGG------------- 200
           VVDPVMVSTSGDVLACPTIISVLQDELLPM DLVTPNLKEASALLGG             
Sbjct: 121 VVDPVMVSTSGDVLACPTIISVLQDELLPMVDLVTPNLKEASALLGGMPLNTISDMRHAA 180

Query: 201 ---------------GDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAA 260
                          GDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAA
Sbjct: 181 TLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAA 240

Query: 261 ELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQ 320
           ELAKGSSMFSAVKISKQFIERAL+YSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQ
Sbjct: 241 ELAKGSSMFSAVKISKQFIERALNYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQ 300

Query: 321 SDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICH 380
           SDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIREKDAKT DFLEAAKSCIEICH
Sbjct: 301 SDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIREKDAKTRDFLEAAKSCIEICH 360

Query: 381 THGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWL 440
           THGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWL
Sbjct: 361 THGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWL 420

Query: 441 DGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPN 500
           DGADYIGCGGVYPTNTKANNLTVGLDGLK+VCLASKLPVVAIGGINHSNAAAVMEIGVPN
Sbjct: 421 DGADYIGCGGVYPTNTKANNLTVGLDGLKKVCLASKLPVVAIGGINHSNAAAVMEIGVPN 480

Query: 501 LKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 509
           LKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK
Sbjct: 481 LKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK 517

BLAST of Cp4.1LG01g16710 vs. ExPASy TrEMBL
Match: A0A1S3CCI6 (Thiamine-phosphate pyrophosphorylase OS=Cucumis melo OX=3656 GN=LOC103499122 PE=3 SV=1)

HSP 1 Score: 869 bits (2246), Expect = 6.95e-316
Identity = 451/531 (84.93%), Postives = 475/531 (89.45%), Query Frame = 0

Query: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA 60
           MV LPL  QIPKFNQVSRFCMAMKK EE VVASSDRYE RIPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVPLPLISQIPKFNQVSRFCMAMKKQEEMVVASSDRYETRIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITA+TAQNTVGVQDVNI+PEGFVS+QLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNIVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180
           STGI+QV+ Q LKEFPV+ALVVDPVMVSTSGDVLA PTIISVLQ+ELLPMADLVTPNLKE
Sbjct: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPTIISVLQEELLPMADLVTPNLKE 180

Query: 181 ASALLGG----------------------------GDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLGG                            GDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGGMPLKTISDMRHAATLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300
           RI +RNTHGTGCSLASCI+AELAKGSSMFSAVK SKQFIERAL YSKDI+IG+GPQGPFD
Sbjct: 241 RIKSRNTHGTGCSLASCISAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300

Query: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360
           HLCRLKSREQSSY QG FN +DLFLYAVTDSGMN+RWDRSITDAVKAAVEGGATI+QIRE
Sbjct: 301 HLCRLKSREQSSYSQGCFNPTDLFLYAVTDSGMNERWDRSITDAVKAAVEGGATIVQIRE 360

Query: 361 KDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420
           KDAKT DFLE AKSCI+ICH HGVPLLINDRID+ALAC ADGVH+GQSDIP H  R LLG
Sbjct: 361 KDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACDADGVHVGQSDIPAHEVRRLLG 420

Query: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480
           P+K+IGVSCKT EQAEQAW+DGADYIGCGGVYPTNTKANNLTVG+DGLKRVCLASKLPVV
Sbjct: 421 PNKVIGVSCKTMEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480

Query: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEAT 503
           AIGGINH+NAAAVM IG+PNL+GVAVVSALFDRQCVLE   KLHATLVEAT
Sbjct: 481 AIGGINHTNAAAVMGIGIPNLRGVAVVSALFDRQCVLEAASKLHATLVEAT 531

BLAST of Cp4.1LG01g16710 vs. TAIR 10
Match: AT1G22940.1 (thiamin biosynthesis protein, putative )

HSP 1 Score: 663.7 bits (1711), Expect = 1.2e-190
Identity = 341/491 (69.45%), Postives = 397/491 (80.86%), Query Frame = 0

Query: 40  RIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVS 99
           ++P VL+VAGSDSGAGAGIQADLK CAARGVYC++VITAVTAQNT GVQ V+++P  F+S
Sbjct: 29  KVPQVLTVAGSDSGAGAGIQADLKVCAARGVYCASVITAVTAQNTRGVQSVHLLPPEFIS 88

Query: 100 EQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTI 159
           EQLKSVLSD + DVVKTGMLPST I++V+ Q L +FPV+ALVVDPVMVSTSG VLA  +I
Sbjct: 89  EQLKSVLSDFEFDVVKTGMLPSTEIVEVLLQNLSDFPVRALVVDPVMVSTSGHVLAGSSI 148

Query: 160 ISVLQDELLPMADLVTPNLKEASALLG----------------------------GGDLP 219
           +S+ ++ LLP+AD++TPN+KEASALL                             GGDLP
Sbjct: 149 LSIFRERLLPIADIITPNVKEASALLDGFRIETVAEMRSAAKSLHEMGPRFVLVKGGDLP 208

Query: 220 DSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFI 279
           DS D+VD++FDGK+ HELRS RI TRNTHGTGC+LASCIAAELAKGSSM SAVK++K+F+
Sbjct: 209 DSSDSVDVYFDGKEFHELRSPRIATRNTHGTGCTLASCIAAELAKGSSMLSAVKVAKRFV 268

Query: 280 ERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDR 339
           + AL YSKDI IG+G QGPFDH   LK   QSS R   FN  DLFLYAVTDS MNK+W+R
Sbjct: 269 DNALDYSKDIVIGSGMQGPFDHFFGLKKDPQSS-RCSIFNPDDLFLYAVTDSRMNKKWNR 328

Query: 340 SITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACG 399
           SI DA+KAA+EGGATIIQ+REK+A+T +FLE AK+CI+IC +HGV LLINDRID+ALAC 
Sbjct: 329 SIVDALKAAIEGGATIIQLREKEAETREFLEEAKACIDICRSHGVSLLINDRIDIALACD 388

Query: 400 ADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKAN 459
           ADGVH+GQSD+PV   RSLLGPDKIIGVSCKTPEQA QAW DGADYIG GGV+PTNTKAN
Sbjct: 389 ADGVHVGQSDMPVDLVRSLLGPDKIIGVSCKTPEQAHQAWKDGADYIGSGGVFPTNTKAN 448

Query: 460 NLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEE 503
           N T+GLDGLK VC ASKLPVVAIGGI  SNA +VM+I  PNLKGVAVVSALFD+ CVL +
Sbjct: 449 NRTIGLDGLKEVCEASKLPVVAIGGIGISNAGSVMQIDAPNLKGVAVVSALFDQDCVLTQ 508

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q5M7311.7e-18969.45Thiamine biosynthetic bifunctional enzyme TH1, chloroplastic OS=Arabidopsis thal... [more]
O488812.0e-18266.80Thiamine biosynthetic bifunctional enzyme BTH1, chloroplastic OS=Brassica napus ... [more]
Q2QWK92.1e-17963.80Probable thiamine biosynthetic bifunctional enzyme, chloroplastic OS=Oryza sativ... [more]
P569041.5e-4445.38Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase OS=Rhizobium meliloti (st... [more]
P614227.4e-4445.02Thiamine biosynthesis bifunctional protein ThiED OS=Geobacter sulfurreducens (st... [more]
Match NameE-valueIdentityDescription
XP_023550218.10.094.79thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X1 [Cucurbi... [more]
XP_022957337.10.094.04thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X1 [Cucurbi... [more]
XP_022997729.10.093.85thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X1 [Cucurbi... [more]
KAG7032412.10.093.67Thiamine biosynthetic bifunctional enzyme TH1, chloroplastic, partial [Cucurbita... [more]
KAG6601651.10.093.67Thiamine biosynthetic bifunctional enzyme TH1, chloroplastic, partial [Cucurbita... [more]
Match NameE-valueIdentityDescription
A0A6J1GYV10.094.04Thiamine-phosphate pyrophosphorylase OS=Cucurbita moschata OX=3662 GN=LOC1114587... [more]
A0A6J1KCE10.093.85Thiamine-phosphate pyrophosphorylase OS=Cucurbita maxima OX=3661 GN=LOC111492599... [more]
A0A6J1GZX40.093.81Thiamine-phosphate pyrophosphorylase OS=Cucurbita moschata OX=3662 GN=LOC1114587... [more]
A0A6J1K5X20.093.62Thiamine-phosphate pyrophosphorylase OS=Cucurbita maxima OX=3661 GN=LOC111492599... [more]
A0A1S3CCI66.95e-31684.93Thiamine-phosphate pyrophosphorylase OS=Cucumis melo OX=3656 GN=LOC103499122 PE=... [more]
Match NameE-valueIdentityDescription
AT1G22940.11.2e-19069.45thiamin biosynthesis protein, putative [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR034291Thiamine phosphate synthaseTIGRFAMTIGR00693TIGR00693coord: 297..486
e-value: 5.2E-62
score: 206.6
IPR034291Thiamine phosphate synthaseHAMAPMF_00097TMP_synthasecoord: 294..501
score: 26.885986
IPR013749Pyridoxamine kinase/Phosphomethylpyrimidine kinasePFAMPF08543Phos_pyr_kincoord: 51..186
e-value: 2.7E-51
score: 174.3
IPR022998Thiamine phosphate synthase/TenIPFAMPF02581TMP-TENIcoord: 297..482
e-value: 8.0E-64
score: 214.3
IPR022998Thiamine phosphate synthase/TenICDDcd00564TMP_TenIcoord: 297..497
e-value: 6.55894E-84
score: 256.291
IPR029056Ribokinase-likeGENE3D3.40.1190.20coord: 192..283
e-value: 7.3E-19
score: 70.1
IPR029056Ribokinase-likeGENE3D3.40.1190.20coord: 22..191
e-value: 6.9E-60
score: 204.6
IPR029056Ribokinase-likeSUPERFAMILY53613Ribokinase-likecoord: 39..270
IPR013785Aldolase-type TIM barrelGENE3D3.20.20.70Aldolase class Icoord: 284..502
e-value: 3.3E-73
score: 247.2
IPR045029Hydroxymethylpyrimidine kinase/phosphomethylpyrimidine kinasePANTHERPTHR20858PHOSPHOMETHYLPYRIMIDINE KINASEcoord: 38..187
coord: 184..498
IPR004399Hydroxymethylpyrimidine kinase/phosphomethylpyrimidine kinase domainCDDcd01169HMPP_kinasecoord: 43..258
e-value: 1.88835E-100
score: 300.189
IPR036206Thiamin phosphate synthase superfamilySUPERFAMILY51391Thiamin phosphate synthasecoord: 293..495

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g16710.1Cp4.1LG01g16710.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016310 phosphorylation
biological_process GO:0009228 thiamine biosynthetic process
biological_process GO:0009229 thiamine diphosphate biosynthetic process
molecular_function GO:0008972 phosphomethylpyrimidine kinase activity
molecular_function GO:0004789 thiamine-phosphate diphosphorylase activity
molecular_function GO:0003824 catalytic activity