Cp4.1LG01g16710 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g16710
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionThiamine-phosphate synthase
LocationCp4.1LG01 : 10460313 .. 10469592 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGCCCATTACCAGTCACTGTACTCCGTCCCGCCGCCTCCTGCCCGCCGCCTCCTCCTCCCCCTCCCCCTCACATTCTCTCGCCCTCCCAAGGACTCAAGAGTCACATTTCTTTGTTTAATTCCCTCAGAAACACAGTGGTTCAGCCCCTAACCCGCGCCGTTCTTGAGATGGTGCTGCTGCCTCTCACTTTTCAGATTCCTAAGGTCTGATCTTTTTTTGTTTCTTTTTATTCTTTTTCGTTGGGTTATGCGCTTCATCCAAGGAGAAGAAAACTAATAATGTATCCCTCGCAAATTAGACAGAAAACACCGATTGAACCAACAAGTACAGAGGAACATAAAACAAACCAATTCTTGTATGAATCAGTAAATTTATAGGAGAAGTAATGTTTAAGAATGAACAATGCTTAAAAAGACCCTGTAGTTTTGTTGGCTATGAAAAACTTGTTGGAATTGAAAAGCTGTATTTGAAAACATTGAAACTATAGCAAACTATGGACAAACAAAATAGAAAACTTAACTTTTGATTTTAGTTTGATTGAATGACGTTGAGTCTTCTTGAAAATAGAAGAAACTATCCACGGAGAGAAGAAATTATTATCTATTTTTTAAAAAAATTTCATATGGAGTAGAGTTGGAAACAGAATTTTGAAGTATTAAATCCATACAAATCAGACCTATATAGAAGTATATTTAGGGGTTCCAACACGGGTTAGCTTTATCAATTATGGAGGTATATTTTATGATTTATGGGTTTTACTTTCTGTGCCTTGATTTGTCCTTTTTAGTTCAATCAAGTCTCTAGGTTTTGTATGGCCATGAAGAAGCCGGAAGAAACAGTTGTAGCTTCAAGTGATCGCTATGAGATGAGGATTCCACATGTGTTGAGTGTTGCTGGTTCTGATTCAGGAGCAGGGGCTGGAATCCAAGCGGATCTTAAGACTTGTGCTGCTCGTGGAGTGTACTGTTCCACTGTGATAACTGCTGTTACAGCACAGAACACCGTGGGGGTTCAGGTCATTATCACACACACATAATTATTATCCATATTTATTTGTTGGTGTATTTGTTTTGGCTTGAAAGGACAATGGGGAGGACTGGTGAATAAATAAATAATGTTCCTTGCCTGCCTGCTATGGTATAATGGGGTGTATAATGTTATTGTGAGTTTAGGATGATTTTTGAAAACAAGGTGCTTCTGAGTGAAAGCTTATCCTTTGTTAGCACTTTATTAAGTGCTATCCTGGGTATGTTTAGGAATGCTTTAGGCATGGTGAAAAACACCTTTTTCTTATGCTCAAAAGCATGTATCATAAATATCATGTTTGGGAGCAACAACAGATGCTTTTAAAAAAGTTAATAGTACTTATAATACTTGGTAAAAAGCACTTTTAGAGTGCTTTTGTCTAACTGGGAAGTACTTTATAAACACTTATTGATAGTGTGAAAGCAAGCATAAAGTAATCTTTCGTAAAGCAGTTCTTTTCAAAATGCTTACAGTACTTCATCCTTATTCTATAAGCACTCCTAAATACATCCGTAGTCCTTTTAATAAGTTTTCAAGTATCACTTCTAAGTATTTAACTCAAATTATGAAAACTTTGAAAATACTTTTGACAAGCCAGAAACACTTTTTACCTTTGCGAAAATCATCCTAAACTCACTCTTAGAATATACAATGCCCTTGAAGTTAGTTTTTGCTTTGTTCTCAATGAAATTAGCTGAAATGTTTTGTCCTAAATTATACTGCTTGTAACACAAATTTTATTTAAGACGAAGATGTACTAACTTTTTCAGGATGTAAACATTATGCCGGAGGGCTTTGTTTCAGAGCAGCTGAAATCTGTTCTCTCTGATATGCAAGTGGATGTGGTGAGTATAATTAAATTTTAAGTTAGGATTGCCTTGCAAGTTTTTTAGGAGCTCATATGGACAAGACTGTGGACAGTTCATGTATAGGATATTGAACAGGCTAGCTGTTATTGCTTTTTCAGTGCACGTACTAATATATACTTTATGTAAGCTGGAAAACACAGAAAGACTACTAATGATTACATTAATTCAGGTGAAAACAGGAATGCTACCTTCTACTGGCATCATTCAGGTTATACGTCAGCGCCTGAAGGAGTTTCCTGTTCAAGGTATAAAGTATTTCAGTAAGTTATTGGTGCATTTCTTTGATTTAATTATCGTTTTTTAAAGATTTTAATATGAATAAAGGCAGCTTTGGTGGTTGATCCTGTCATGGTGTCTACCAGTGGGGATGTTCTGGCTTGTCCTACAATTATTTCAGTGTTACAGTAAGTTGATTATCTGTTTATAGGGAGTATCATTAGTCCTTCAAACTAGTTCTTATGATAATTTCCTGCGCTTCCTGTTTGCATTTTTTAAATGTCCTCAGAATTAGTATATTGTTTTTTGGTTCAAAAGGACTCTGAAGCTTTTGGTATGGACTTCAAGCTTTTAGCGTTAAGGCTTACATCGAACGAACCTTTATAAGAACCTACTGAGGCTTGAAATTCTTATGCCTCATATAATAATTAGAAAAATAATACTATTTTATTGCCTATTCTCTCAAAATGTGAATGTTCACCTGTGTAAGTAATGCTTATAGCACCACAATTTATTCATGCTTTTCGCTGATGTCCATACATCCAAGTAAGGACTACATGAAACTCCCAACAATTTAATACTGTTTTTAATGGTATTGCTACTGTTTTTACTTTCACATGTTAGATTGAGTGATTTTGCAAAACCCCTTCGTTTGCAAGGGGACATCTTGTCCGTTTTTCCATATACTGTTTTGGAAGCTTTTTTTTAATAATGAAATTTCTTGCATGTTTCTTATAAAAAGAAGAGAGAGAGAGCGTGAAAAGGAGAAAAATCTTCTTGGTAGTAGCTTCTTCTTAAAGTTGCTGGTTTTTCTTGTTGAGAGAGGCTTTTAGTTTTGACTTGGCAAAACTTGTTAGTCTTTGTTTGGCTGCAATTACTCCCACAAATTTGCCTCACATTGGCCCCAAGGTGTGTCCTTGTTGGTAAGAAATTACAATATTGGAACCCCTCCCAAATCTTTAATTTAAATAGTGGGTTCTAGGGATCGAGAGAATTAATTTCCTTGCCCATCTACTAGAGCTGGGGCTGCGGGTTTTGCTGGGGATTAGAGTTGGTGGGGCCTGGATTCCCCAATTTACAGCCATAATTTGCTTAATTGAATGAAAATAATGCTTTTTCTTTAATCAAAATTCATGTCTGATCTTCATATGTTAGTTTTTCATAATTCTATGATACGTCTTTTGCTTTCAGGGATGAACTTCTACCAATGGCTGACTTGGTAACCCCAAATTTGAAGGAAGCATCTGCCTTACTTGGCGGTATGCCCCTTAATACAATTTCTGACATGCGTCATGCTGCAACGCTAATCCATCAGATGGGATCCAAGTAAGTTTTTAATCCATCACATAGCTTGCGTTCTTTTGTAATTCCATTTTATCAATGAAATGAAAATCAATGTTTCTCATGAAAAAAGAAGAAGAAATTTTATGCTACCTCTAGTCACAAGAGACTGACTTTGACACTCAATTTTCCTCATTGCCGCCTGCCTCCTAGCAGTAATGTTAATACCATTCTACTTAATTCAAAGAAGTTTTTTTAAGTCTAAATTATAAAAATACCCCTGAATTTTACTCTTTGTTTCAAAAGTTCAGAGGTGTCTTTTATAATTTAACCATTTTTAAAACTTTAATTCTATGGTTATTCGGATTACATTAGCGATCGCTAGTCATTCTCCCTCCTTTGCCACCCAATCTTCACCTTCTTATAGACCCATCGAATCTTCGCCCCCTTGCCTTCTCAATCCCTTGACCAACCCTCCTCAACAATCGCTAGCCATCCTGACCCCAATCCCTTCTTCCTTTCCCTCCTCATCAGCCAATTAACCTACCTCACTGCATTTACCAATTGGCCTCGAGATGACTACAAATTCCATGGAGTAATCAGGGACAAAAAAAAGGCTCAACGAAAATCGTGATTGGCGACGAAGAAGACAGGAGGTGTATCTGATATTGGGAGAAAACCCTTCATATTTTCCTTGACCCTTATGTTAACCTCCATCATATTCATCCGAGAGAAAGATTTAGTGGCTGTATCGATGTATCCTCCATGGGCATTCCAAATTTTCTTGAAGGATTCAAAAGATCATTTCTTGAAGGATATTTATCAATTGGAAAATTTCTGATTTTCATCCATCCCCACTGGGTAGGCACTGGAGGATCTTTAAGAAAGGACTCCATACTCAAATTTTCAAATCTTACCCTATAAGTTCCCACCTTATACCACCTATATTTCCCAAAGTAGTCTCTCGTTCTTTGCCTTCCTCCACGAGGAGTGCCCTATCAGGTTGAATGGGTCTAATAAACCCAAAATCATATACTTATTGTTAAAGAGCTCATAAAATCACATGCCCGTCATTATGAAAGTGCACAAAAGATAAAAGGATTGATCTCTGCTTGTTTCAGTGATTTTTAACCCTTTTCCACCCTCAATCAGTGATTGGTGCGCACCGTCGAGTTAGGCAGCAGGAGAGCATGGTAGGGAGGGGTGGGATTGGGTTGGGTATGTGAGAAGAGAGAGAAGGGAACATTTTTTTTCTCTTAAATCAAACAGAGTGAAAGCCAATTGGAGAGCTTTGTTGTAATTCACCAATAGTTGCTTGGAGTTTTTACTCCCGTGTTTCATCAACTAGTGAAATTGTTTCTTCTTCTTCTTCTTCTTCAAAATAATATAACAATAAATTAATAAAAATCTTTCTAGTTTTCTAACAACAAAATGTTGTAGGTGCACATGGGTGTCCATGAAATTAGTGAGGTGTCCACCAGTTGGCCTAAATGCTCAATCCATGGATATCAACGAAAGGAAGAAAAACAATGGGGTTTATAAAATTTTTGGAAGGAAGAAAAAAAAAAAGGGAGCTGCAAAAAGAGTGACGGAGGGAGGTTGAGAGAAGCAAATGTTTAGTTTAGGCTTGGGTTTACTCAAATAACAGCATCCAAATTTCTCTTAGAATTATTTGTTTATTTATTTTTCTAATTTCTTATTGATTTAAAACCCTAGCTAATTGGATCTAAGAACCCTTATCAACCCTTAGATATTCTAACCCATAAATCACAAATTCTAAAACCCCCAAATGAAGATGAGTTCATCCCAAATGATTGTAAGTATCGAGAATTCAATTTGAAGAATAAGACCTTTAGATCTAAAATATCCCAGAACTCTAAAATGACAAGGAATACCCCAAGGAAAGTTAGATCCGGATTATTTTATTGATCAAATATCTCAAGTATCACAAGGCAAGAAAACTTGTTTGAGATTTGAATCACTTCACCAGCAGAATCAATCATGTCAAGCTTAAATGATTCTAAGCATACAACCTAAACTATATAGAATTGCAAATCAACTTAGCCCTTGGCTAAGAGAAAGCATGAATGCTATTTTTACTATATTTTCTAAGTCTATCTTACAAAGACAACATGCATGGCTTTATATAGCCTCAAAATGAAACCCTTGACCTTTCATGAGGCATTTCAAAAGTTGTAACCTTCATACTTCATGGCTATAATTGGCCACTAAGGAAATGTATCCTAGAAACATCATAAATTTAGATTATCATCTACCAAAGAATTTGTAACCATCCAAGAATAAATGAAGTTCCACCTGACGTAGTTTCATTGTTGCACAATTGAAGTTTCATTGTTGCACAATTGAAGTTTCATTGTTGCACAATTGAAGCTTCATTGTTGCACAATTGAAGCTTGATTCTTCTTTAATGTGACATTAATTGCAACTTGAGCTTATCTCGTGGTAATTTGAACCACATTTTTTCACATCTTTCTGACATGATTGTTAGTTCTGCATGTTACAACTATTAAGATAATTTTGAGTTCAATTTTCTACCCAATTAAATTCAACCTTGCACATTTTAGACGCCCCCTCCGCTTCATACTACCATTTTTATCAATTTACTCCTGACTGATGCTGAAGAGCGTTATGTTTGATTTTAGAAATGTACTTGTCAAAGGCGGGGACCTTCCAGATTCATTGGATGCTGTAGATATATTCTTTGATGGTAAGTACAGTACAGATTGCATAATTCTGACAATTTGATATATATATTTTTTTGTTATGAAGGAAGCCCATCCACTGAATCCATATTTTTTTCATAGGCAAGGATCTGCATGAGCTACGATCTTCACGCATAACGACTCGCAACACTCATGGCACTGGATGCAGCTTAGCATCATGCATTGCAGCTGAGCTTGCTAAAGGGTCTTCAATGTTCTCAGCTGTTAAGGTAATAATCTACGGACACTTATTTTAAACCCACCAATAAGAAGATTAAAGTCTCAACGGTTCATAATACAACTTCCTATTGAATAGTTAAATAATTTTTTAAAAATAAACATCTTCAATTTGTTATAGGATAATAGTTGTTGAAAATAACGCTATATTCAATATGTATGCTCACTGACCTTGACAGATGATCAAGACGGGATGTTAAGAGTTACTTTAATGCTTTTTTTTTTCTGGCTTTTCATTGAAATGATGAAAAATAAAAAAATGTTCTAGGGATACAAACTCCCAAAGGGAGTGAAAAAGGGAAAAAGGAAAAAAAATACAAACAGTACATACAACCAAATAGAGATTAGTCAGGAAAATAAATAAATTTTTAAGGAACCAAGAACTTTCTCTGAACACTTGGAGAGCTTGCCAATGAATAATGAAACTGTGAATAAAACCTCTATCAAACACTGAAATTCGATAGATGCTTCAAAACTTCAAAAAGAAAACCAAGAATCGACCACAACCAACAACTCTTGAGCCAATTTTACTCGTGACACCTTCATCAACTCTTGAGCTTTGCAAGAATTACTAACCATTGTTAAAAGCTTTGTAAGAACGATCACCGACAAAGAAATAAAAAAATATCCTCATGCAAGTCAATTCAAATATTTCATGCAAGTCGATTCAAATATCCCAGTGAAGACAAAAAAAAACTTTGTAAAGCAAAACCATGACGTGGCTACCCTCTCGAGCTAATGGAGAGTCTTCTTGTCACAGTCCTGAAATAATAAGCCAAAAGTTTAAGAAAAAGAATCTACAAGAACATCAACCTCCCTACTGAAATCCATGGATGAATGAGTGATTCCACGCTGTTCAAACGTACCTGGGAGTCAACCTCAAACACGTCATTACATGGTGGTGAGTTCTTTTTCGAGTGAATGACTCCTCTCGTTAAACCTTGAGTGAAGTCAAAATGGTTGTTGAGGGGATAAGTGAAGGGGGAGGGTGGTAAGGCGACATTCTAATTGAATAAGAATTTAAAGAAGTTTCCTGGAAGCATGAATGGTAAGAAGAAAAAAGCACATCTTTTTTGTGAGAATAATGCAAATGATAGGCTGGAATATATAAACATTCTTCCCCTCCTTTTTAACACTTGTGGGCTTGGAAATCTCAATCCGTAATGGAGGCCTCTAGGATTCTCAATAAATGAGGGATATATCTTTCACAGTCCTATTTGATAATCAAGCTTTTAAAAGTTGTACTTATTTTTTCACCTTTCTTTACAGCATCTTTCCTAATGAAACATTTAAATTCACGACCAAATTCTATTTACAAACCCAAGGTTTTCAAAACTACTTTTTTATGCACTAATCAAAATATGGTTGCCACTCAATCCAATTTGGAGCTTCAAATTCTTTGGTTATTTGTTTTTTTCTTTTTCTTTTTTTTTGGATGAAAGAGACGTATCATTTCACTCCATATATATAATTTGAGTAATTCAAAACGCAATAGGCTTGGTTGTCGTTACCAGACTTGATGATTGTCCACAACTAAGGTTTTATTTATCTGGATCTTTACAGATAAGCAAACAGTTTATTGAAAGAGCATTGAGTTACAGCAAGGACATTAGCATTGGAAATGGACCTCAAGGCCCATTCGATCATCTATGTCGTCTTAAGAGTCGAGAACAAAGTTCGTATAGACAGGGGAGTTTCAATCAATCTGACTTATTCTTGTATGCTGTTACGGACTCGGGTATGAATAAGCGTTGGGACCGTTCTATCACAGATGCTGTTAAAGCTGCAGTTGAAGGAGGTGCTACTATCATTCAGATAAGGTTTGTCTACTGTTTTCCATTAATCAATTGTTGTTAGTTTGTTTGCTTAAAAATGTTTTCAGAAGTTCATACATATTCAAGTCATTACTTCAAATGTATTGAATTCCTTGTTGCATTTTAATACCTCTTAGCACCTTCTCAGTTGTTCAAAGGTTGAATGGCTTTAGAAACTTACCCATAAAGTGTAGGCTAAATATGCTCCCTCTATAGGCTAGCAATAGTGAGGTAGTAATCTTGGGTTAATAGAACCCGACTGTAAGATTTGAATCTCTTAACGTGTTGCATTATTGTTGACCGAAGTTATAAACTTGGTATAATTAAAACTTGTATTCAATTGTTAATTTGCTTCCAATGGTCGCATGATCTTATTTTTATGAACAGGGAAAAGGATGCTAAAACTCACGATTTCTTGGAAGCAGCCAAATCATGTATAGAGATTTGTCACACACACGGTGTTCCATTACTGATCAACGATCGTATTGACGTCGCACTTGCCTGTGGTGCTGATGGTGTACACATTGGTCAGTCCGATATTCCTGTTCATGCAGCTCGTAGCCTTCTTGGCCCTGATAAGATTATCGGTGTCTCATGCAAGACACCGGAGCAAGCGGAACAGGCATGGCTTGATGGTGCTGATTACATCGGGTGTGGTGGAGTTTATCCCACAAACACAAAGGCAAACAATCTGACTGTTGGGCTTGATGGATTGAAAAGGGTTTGCTTAGCTTCCAAGTTGCCTGTGGTTGCAATTGGTGGAATTAATCACAGTAATGCAGCGGCTGTGATGGAAATTGGTGTCCCAAATCTTAAAGGTGTTGCAGTTGTGTCGGCTCTTTTTGATAGGCAATGTGTTTTAGAGGAGACCTTGAAGCTACATGCAACATTAGTGGAGGCTACAGCATTATCACCTGCGAAATGAGTGAGTGAGGGACCATAGTGCAACGACACAGTTTCTTGATTTTGATGAAATCAAATGTATGAATAATTGTAATGCTTTGATTTTAGTAGAAAGGTTTTTGAAATAAATGTAATTGTCTCAATGTGATTATAATGTAACAAAAATTTAATTTCGTGATAATAAGCATGGTCATTATATATTCTTTAAAA

mRNA sequence

CGCCCATTACCAGTCACTGTACTCCGTCCCGCCGCCTCCTGCCCGCCGCCTCCTCCTCCCCCTCCCCCTCACATTCTCTCGCCCTCCCAAGGACTCAAGAGTCACATTTCTTTGTTTAATTCCCTCAGAAACACAGTGGTTCAGCCCCTAACCCGCGCCGTTCTTGAGATGGTGCTGCTGCCTCTCACTTTTCAGATTCCTAAGTTCAATCAAGTCTCTAGGTTTTGTATGGCCATGAAGAAGCCGGAAGAAACAGTTGTAGCTTCAAGTGATCGCTATGAGATGAGGATTCCACATGTGTTGAGTGTTGCTGGTTCTGATTCAGGAGCAGGGGCTGGAATCCAAGCGGATCTTAAGACTTGTGCTGCTCGTGGAGTGTACTGTTCCACTGTGATAACTGCTGTTACAGCACAGAACACCGTGGGGGTTCAGGATGTAAACATTATGCCGGAGGGCTTTGTTTCAGAGCAGCTGAAATCTGTTCTCTCTGATATGCAAGTGGATGTGGTGAAAACAGGAATGCTACCTTCTACTGGCATCATTCAGGTTATACGTCAGCGCCTGAAGGAGTTTCCTGTTCAAGCTTTGGTGGTTGATCCTGTCATGGTGTCTACCAGTGGGGATGTTCTGGCTTGTCCTACAATTATTTCAGTGTTACAGGATGAACTTCTACCAATGGCTGACTTGGTAACCCCAAATTTGAAGGAAGCATCTGCCTTACTTGGCGGCGGGGACCTTCCAGATTCATTGGATGCTGTAGATATATTCTTTGATGGCAAGGATCTGCATGAGCTACGATCTTCACGCATAACGACTCGCAACACTCATGGCACTGGATGCAGCTTAGCATCATGCATTGCAGCTGAGCTTGCTAAAGGGTCTTCAATGTTCTCAGCTGTTAAGATAAGCAAACAGTTTATTGAAAGAGCATTGAGTTACAGCAAGGACATTAGCATTGGAAATGGACCTCAAGGCCCATTCGATCATCTATGTCGTCTTAAGAGTCGAGAACAAAGTTCGTATAGACAGGGGAGTTTCAATCAATCTGACTTATTCTTGTATGCTGTTACGGACTCGGGTATGAATAAGCGTTGGGACCGTTCTATCACAGATGCTGTTAAAGCTGCAGTTGAAGGAGGTGCTACTATCATTCAGATAAGGGAAAAGGATGCTAAAACTCACGATTTCTTGGAAGCAGCCAAATCATGTATAGAGATTTGTCACACACACGGTGTTCCATTACTGATCAACGATCGTATTGACGTCGCACTTGCCTGTGGTGCTGATGGTGTACACATTGGTCAGTCCGATATTCCTGTTCATGCAGCTCGTAGCCTTCTTGGCCCTGATAAGATTATCGGTGTCTCATGCAAGACACCGGAGCAAGCGGAACAGGCATGGCTTGATGGTGCTGATTACATCGGGTGTGGTGGAGTTTATCCCACAAACACAAAGGCAAACAATCTGACTGTTGGGCTTGATGGATTGAAAAGGGTTTGCTTAGCTTCCAAGTTGCCTGTGGTTGCAATTGGTGGAATTAATCACAGTAATGCAGCGGCTGTGATGGAAATTGGTGTCCCAAATCTTAAAGGTGTTGCAGTTGTGTCGGCTCTTTTTGATAGGCAATGTGTTTTAGAGGAGACCTTGAAGCTACATGCAACATTAGTGGAGGCTACAGCATTATCACCTGCGAAATGAGTGAGTGAGGGACCATAGTGCAACGACACAGTTTCTTGATTTTGATGAAATCAAATGTATGAATAATTGTAATGCTTTGATTTTAGTAGAAAGGTTTTTGAAATAAATGTAATTGTCTCAATGTGATTATAATGTAACAAAAATTTAATTTCGTGATAATAAGCATGGTCATTATATATTCTTTAAAA

Coding sequence (CDS)

ATGGTGCTGCTGCCTCTCACTTTTCAGATTCCTAAGTTCAATCAAGTCTCTAGGTTTTGTATGGCCATGAAGAAGCCGGAAGAAACAGTTGTAGCTTCAAGTGATCGCTATGAGATGAGGATTCCACATGTGTTGAGTGTTGCTGGTTCTGATTCAGGAGCAGGGGCTGGAATCCAAGCGGATCTTAAGACTTGTGCTGCTCGTGGAGTGTACTGTTCCACTGTGATAACTGCTGTTACAGCACAGAACACCGTGGGGGTTCAGGATGTAAACATTATGCCGGAGGGCTTTGTTTCAGAGCAGCTGAAATCTGTTCTCTCTGATATGCAAGTGGATGTGGTGAAAACAGGAATGCTACCTTCTACTGGCATCATTCAGGTTATACGTCAGCGCCTGAAGGAGTTTCCTGTTCAAGCTTTGGTGGTTGATCCTGTCATGGTGTCTACCAGTGGGGATGTTCTGGCTTGTCCTACAATTATTTCAGTGTTACAGGATGAACTTCTACCAATGGCTGACTTGGTAACCCCAAATTTGAAGGAAGCATCTGCCTTACTTGGCGGCGGGGACCTTCCAGATTCATTGGATGCTGTAGATATATTCTTTGATGGCAAGGATCTGCATGAGCTACGATCTTCACGCATAACGACTCGCAACACTCATGGCACTGGATGCAGCTTAGCATCATGCATTGCAGCTGAGCTTGCTAAAGGGTCTTCAATGTTCTCAGCTGTTAAGATAAGCAAACAGTTTATTGAAAGAGCATTGAGTTACAGCAAGGACATTAGCATTGGAAATGGACCTCAAGGCCCATTCGATCATCTATGTCGTCTTAAGAGTCGAGAACAAAGTTCGTATAGACAGGGGAGTTTCAATCAATCTGACTTATTCTTGTATGCTGTTACGGACTCGGGTATGAATAAGCGTTGGGACCGTTCTATCACAGATGCTGTTAAAGCTGCAGTTGAAGGAGGTGCTACTATCATTCAGATAAGGGAAAAGGATGCTAAAACTCACGATTTCTTGGAAGCAGCCAAATCATGTATAGAGATTTGTCACACACACGGTGTTCCATTACTGATCAACGATCGTATTGACGTCGCACTTGCCTGTGGTGCTGATGGTGTACACATTGGTCAGTCCGATATTCCTGTTCATGCAGCTCGTAGCCTTCTTGGCCCTGATAAGATTATCGGTGTCTCATGCAAGACACCGGAGCAAGCGGAACAGGCATGGCTTGATGGTGCTGATTACATCGGGTGTGGTGGAGTTTATCCCACAAACACAAAGGCAAACAATCTGACTGTTGGGCTTGATGGATTGAAAAGGGTTTGCTTAGCTTCCAAGTTGCCTGTGGTTGCAATTGGTGGAATTAATCACAGTAATGCAGCGGCTGTGATGGAAATTGGTGTCCCAAATCTTAAAGGTGTTGCAGTTGTGTCGGCTCTTTTTGATAGGCAATGTGTTTTAGAGGAGACCTTGAAGCTACATGCAACATTAGTGGAGGCTACAGCATTATCACCTGCGAAATGA

Protein sequence

MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKEASALLGGGDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEATALSPAK
BLAST of Cp4.1LG01g16710 vs. Swiss-Prot
Match: TPS1L_ARATH (Thiamine biosynthetic bifunctional enzyme TH1, chloroplastic OS=Arabidopsis thaliana GN=TH1 PE=1 SV=1)

HSP 1 Score: 663.7 bits (1711), Expect = 1.6e-189
Identity = 341/491 (69.45%), Postives = 397/491 (80.86%), Query Frame = 1

Query: 40  RIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVS 99
           ++P VL+VAGSDSGAGAGIQADLK CAARGVYC++VITAVTAQNT GVQ V+++P  F+S
Sbjct: 29  KVPQVLTVAGSDSGAGAGIQADLKVCAARGVYCASVITAVTAQNTRGVQSVHLLPPEFIS 88

Query: 100 EQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTI 159
           EQLKSVLSD + DVVKTGMLPST I++V+ Q L +FPV+ALVVDPVMVSTSG VLA  +I
Sbjct: 89  EQLKSVLSDFEFDVVKTGMLPSTEIVEVLLQNLSDFPVRALVVDPVMVSTSGHVLAGSSI 148

Query: 160 ISVLQDELLPMADLVTPNLKEASALLGG----------------------------GDLP 219
           +S+ ++ LLP+AD++TPN+KEASALL G                            GDLP
Sbjct: 149 LSIFRERLLPIADIITPNVKEASALLDGFRIETVAEMRSAAKSLHEMGPRFVLVKGGDLP 208

Query: 220 DSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFI 279
           DS D+VD++FDGK+ HELRS RI TRNTHGTGC+LASCIAAELAKGSSM SAVK++K+F+
Sbjct: 209 DSSDSVDVYFDGKEFHELRSPRIATRNTHGTGCTLASCIAAELAKGSSMLSAVKVAKRFV 268

Query: 280 ERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDR 339
           + AL YSKDI IG+G QGPFDH   LK   QSS R   FN  DLFLYAVTDS MNK+W+R
Sbjct: 269 DNALDYSKDIVIGSGMQGPFDHFFGLKKDPQSS-RCSIFNPDDLFLYAVTDSRMNKKWNR 328

Query: 340 SITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACG 399
           SI DA+KAA+EGGATIIQ+REK+A+T +FLE AK+CI+IC +HGV LLINDRID+ALAC 
Sbjct: 329 SIVDALKAAIEGGATIIQLREKEAETREFLEEAKACIDICRSHGVSLLINDRIDIALACD 388

Query: 400 ADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKAN 459
           ADGVH+GQSD+PV   RSLLGPDKIIGVSCKTPEQA QAW DGADYIG GGV+PTNTKAN
Sbjct: 389 ADGVHVGQSDMPVDLVRSLLGPDKIIGVSCKTPEQAHQAWKDGADYIGSGGVFPTNTKAN 448

Query: 460 NLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEE 503
           N T+GLDGLK VC ASKLPVVAIGGI  SNA +VM+I  PNLKGVAVVSALFD+ CVL +
Sbjct: 449 NRTIGLDGLKEVCEASKLPVVAIGGIGISNAGSVMQIDAPNLKGVAVVSALFDQDCVLTQ 508

BLAST of Cp4.1LG01g16710 vs. Swiss-Prot
Match: TPS1_BRANA (Thiamine biosynthetic bifunctional enzyme BTH1, chloroplastic OS=Brassica napus GN=BTH1 PE=1 SV=1)

HSP 1 Score: 640.2 bits (1650), Expect = 1.9e-182
Identity = 328/491 (66.80%), Postives = 391/491 (79.63%), Query Frame = 1

Query: 40  RIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVS 99
           ++  VL+VAGSDSGAGAGIQAD+K CAARGVYC++V TAV A+NT  VQ V+++P   VS
Sbjct: 31  KVAQVLTVAGSDSGAGAGIQADIKVCAARGVYCASVKTAVKAKNTRAVQSVHLLPPDSVS 90

Query: 100 EQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTI 159
           EQLKSVLSD +VDVVKTGMLPS  I++V+ Q L E+PV+ALVVDPVMVSTSG VLA  +I
Sbjct: 91  EQLKSVLSDFEVDVVKTGMLPSPEIVEVLLQNLSEYPVRALVVDPVMVSTSGHVLAGSSI 150

Query: 160 ISVLQDELLPMADLVTPNLKEASALLGG----------------------------GDLP 219
           +S+ ++ LLP+AD++TPN+KEASALLGG                            GDLP
Sbjct: 151 LSIFRERLLPLADIITPNVKEASALLGGVRIQTVAEMRSAAKSLHQMGPRFVLVKGGDLP 210

Query: 220 DSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFI 279
           DS D+VD++FDG + HEL S RI TRNTHGTGC+LASCIAAELAKGS+M SAVK++K+F+
Sbjct: 211 DSSDSVDVYFDGNEFHELHSPRIATRNTHGTGCTLASCIAAELAKGSNMLSAVKVAKRFV 270

Query: 280 ERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDR 339
           + AL+YSKDI IG+G QGPFDH   LK  +  SYRQ +F   DLFLYAVTDS MNK+W+R
Sbjct: 271 DSALNYSKDIVIGSGMQGPFDHFLSLK--DPQSYRQSTFKPDDLFLYAVTDSRMNKKWNR 330

Query: 340 SITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACG 399
           SI DAVKAA+EGGATIIQ+REK+A+T +FLE AKSC++IC ++GV LLINDR D+A+A  
Sbjct: 331 SIVDAVKAAIEGGATIIQLREKEAETREFLEEAKSCVDICRSNGVCLLINDRFDIAIALD 390

Query: 400 ADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKAN 459
           ADGVH+GQSD+PV   RSLLGPDKIIGVSCKT EQA QAW DGADYIG GGV+PTNTKAN
Sbjct: 391 ADGVHVGQSDMPVDLVRSLLGPDKIIGVSCKTQEQAHQAWKDGADYIGSGGVFPTNTKAN 450

Query: 460 NLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEE 503
           N T+GLDGL+ VC ASKLPVVAIGGI  SNA +VM IG PNLKGVAVVSALFD++CVL +
Sbjct: 451 NRTIGLDGLREVCKASKLPVVAIGGIGISNAESVMRIGEPNLKGVAVVSALFDQECVLTQ 510

BLAST of Cp4.1LG01g16710 vs. Swiss-Prot
Match: TPS1_ORYSJ (Probable thiamine biosynthetic bifunctional enzyme, chloroplastic OS=Oryza sativa subsp. japonica GN=Os12g0192500 PE=2 SV=2)

HSP 1 Score: 630.2 bits (1624), Expect = 2.0e-179
Identity = 319/500 (63.80%), Postives = 388/500 (77.60%), Query Frame = 1

Query: 32  ASSDRYEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVN 91
           ++S   EM  PHVL+VAGSDSG GAGIQAD+K CAA G YCS+V+TAVTAQNT GVQ ++
Sbjct: 47  SASAAREMPWPHVLTVAGSDSGGGAGIQADIKACAALGAYCSSVVTAVTAQNTAGVQGIH 106

Query: 92  IMPEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSG 151
           ++PE F+ EQL SVLSDM VDVVKTGMLPS G+++V+ + LK+FPV+ALVVDPVMVSTSG
Sbjct: 107 VVPEEFIREQLNSVLSDMSVDVVKTGMLPSIGVVRVLCESLKKFPVKALVVDPVMVSTSG 166

Query: 152 DVLACPTIISVLQDELLPMADLVTPNLKEASALLGG------------------------ 211
           D L+  + +SV +DEL  MAD+VTPN+KEAS LLGG                        
Sbjct: 167 DTLSESSTLSVYRDELFAMADIVTPNVKEASRLLGGVSLRTVSDMRNAAESIYKFGPKHV 226

Query: 212 ----GDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSA 271
               GD+ +S DA D+FFDGK+  EL + RI T NTHGTGC+LASCIA+ELAKG++M  A
Sbjct: 227 LVKGGDMLESSDATDVFFDGKEFIELHAHRIKTHNTHGTGCTLASCIASELAKGATMLHA 286

Query: 272 VKISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDS 331
           V+++K F+E AL +SKD+ +GNGPQGPFDHL +LK    +   Q SF    LFLYAVTDS
Sbjct: 287 VQVAKNFVESALHHSKDLVVGNGPQGPFDHLFKLKCPPYNVGSQPSFKPDQLFLYAVTDS 346

Query: 332 GMNKRWDRSITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDR 391
           GMNK+W RSI +AV+AA+EGGATI+Q+REKD++T +FLEAAK+C+EIC + GVPLLINDR
Sbjct: 347 GMNKKWGRSIKEAVQAAIEGGATIVQLREKDSETREFLEAAKACMEICKSSGVPLLINDR 406

Query: 392 IDVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGV 451
           +D+ALAC ADGVH+GQ D+  H  R LLGP KIIGVSCKTP QA+QAW DGADYIGCGGV
Sbjct: 407 VDIALACNADGVHVGQLDMSAHEVRELLGPGKIIGVSCKTPAQAQQAWNDGADYIGCGGV 466

Query: 452 YPTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALF 504
           +PT+TKANN T+G DGLK VCLASKLPVVAIGGIN SNA +VME+G+PNLKGVAVVSALF
Sbjct: 467 FPTSTKANNPTLGFDGLKTVCLASKLPVVAIGGINASNAGSVMELGLPNLKGVAVVSALF 526

BLAST of Cp4.1LG01g16710 vs. Swiss-Prot
Match: THID_RHIME (Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase OS=Rhizobium meliloti (strain 1021) GN=thiD PE=3 SV=2)

HSP 1 Score: 182.2 bits (461), Expect = 1.4e-44
Identity = 118/260 (45.38%), Postives = 152/260 (58.46%), Query Frame = 1

Query: 45  LSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKS 104
           LS+AGSDSG GAGIQADLKT +A GVY ++VITA+TAQNT GV  V  +    VS Q+ +
Sbjct: 6   LSIAGSDSGGGAGIQADLKTFSALGVYGASVITAITAQNTRGVTAVEDVSAEIVSAQMDA 65

Query: 105 VLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQ 164
           V SD+ V  VK GM+     I  I   L+ F  +A VVDPVMV+TSGD L  P  ++ L 
Sbjct: 66  VFSDLDVKAVKIGMVSRRETIAAIADGLRRFGKRA-VVDPVMVATSGDALLRPDAVAALI 125

Query: 165 DELLPMADLVTPNLKEASALLG----------------------------GGDLPDSLDA 224
           +ELLP+A +VTPNL EA+ + G                            GG L    +A
Sbjct: 126 EELLPLALVVTPNLAEAALMTGRAIAGDEAEMARQAEAIMRTGAHAVLVKGGHLKGQ-EA 185

Query: 225 VDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALS 277
            D+FFDG  L  L + RI TRN HGTGC+L++ IAA LAKG  +  AV  +K ++  A+S
Sbjct: 186 TDLFFDGDTLVRLPAGRIETRNDHGTGCTLSAAIAAGLAKGVPLIEAVSAAKAYLHAAIS 245

BLAST of Cp4.1LG01g16710 vs. Swiss-Prot
Match: THIED_GEOSL (Thiamine biosynthesis bifunctional protein ThiED OS=Geobacter sulfurreducens (strain ATCC 51573 / DSM 12127 / PCA) GN=thiDE PE=3 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 7.1e-44
Identity = 113/251 (45.02%), Postives = 149/251 (59.36%), Query Frame = 1

Query: 44  VLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLK 103
           VL+VAGSDSG GAGIQADLKT    G Y S+V+TA+TAQNT GV  ++ +P  FV++QL 
Sbjct: 228 VLTVAGSDSGGGAGIQADLKTVTLLGSYGSSVLTALTAQNTRGVSGIHGVPPAFVADQLD 287

Query: 104 SVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVL 163
           +V SD+ VDVVKTGML S   I  I  +L E+  + +VVDPVMV+  G  L     +SVL
Sbjct: 288 AVFSDIPVDVVKTGMLFSAETIVAIAAKLTEYRRRMVVVDPVMVAKGGANLIDRGAVSVL 347

Query: 164 QDELLPMADLVTPNLKEASALLG---------------------------GGDLPDSLDA 223
           ++ L P+A LVTPN+ EA  L G                           GG L    D+
Sbjct: 348 KERLFPLAYLVTPNIPEAERLTGANISDEESMREAARRLHRLGARNVLLKGGHLLAG-DS 407

Query: 224 VDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALS 268
           VDI FDG   H   S RI ++NTHGTGC+ AS IA  LA+G  +  A+  +K++I  A+ 
Sbjct: 408 VDILFDGAAFHRFVSPRILSKNTHGTGCTFASAIATYLAQGDPLREAIARAKRYITAAIR 467

BLAST of Cp4.1LG01g16710 vs. TrEMBL
Match: D7TGZ0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0035g00320 PE=3 SV=1)

HSP 1 Score: 740.3 bits (1910), Expect = 1.5e-210
Identity = 377/495 (76.16%), Postives = 423/495 (85.45%), Query Frame = 1

Query: 39  MRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFV 98
           M+IPHVL+VAGSDSGAGAGIQADLK CAARGVYCSTVITAVTAQNTVGVQ VNI+PE FV
Sbjct: 1   MKIPHVLTVAGSDSGAGAGIQADLKACAARGVYCSTVITAVTAQNTVGVQGVNIVPEDFV 60

Query: 99  SEQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPT 158
           +EQLKSVLSDM VDVVKTGMLP+ GI++V+   LKEFPVQALVVDPVMVSTSGDVLA P+
Sbjct: 61  AEQLKSVLSDMHVDVVKTGMLPTIGIVKVLHHSLKEFPVQALVVDPVMVSTSGDVLAGPS 120

Query: 159 IISVLQDELLPMADLVTPNLKEASALLGG----------------------------GDL 218
           I++  ++ELLPMAD+VTPNLKEASALLGG                            GDL
Sbjct: 121 ILAAFREELLPMADIVTPNLKEASALLGGLQLETVSDMCTAAKLIHDMGPRNVLVKGGDL 180

Query: 219 PDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQF 278
           P SLDAVDIFFDG D +ELRSSRI TRNTHGTGC+LASCIAAELAKGS + SAVK +K +
Sbjct: 181 PSSLDAVDIFFDGDDFYELRSSRIKTRNTHGTGCTLASCIAAELAKGSQILSAVKAAKHY 240

Query: 279 IERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWD 338
           IE AL YSKDI+IGNG QGPFDHL +LKS  ++S+R+ +FN ++LFLYAVTDSGMNK+W 
Sbjct: 241 IETALDYSKDIAIGNGFQGPFDHLLKLKSNIRNSFRKQAFNPANLFLYAVTDSGMNKKWG 300

Query: 339 RSITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALAC 398
           RSIT+AVKAA+EGGATI+Q+REKDA+T DFLEAAK+C+EICH+HGVPLLINDRIDVALAC
Sbjct: 301 RSITEAVKAAIEGGATIVQLREKDAETRDFLEAAKACVEICHSHGVPLLINDRIDVALAC 360

Query: 399 GADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKA 458
            ADGVH+GQSDIP    R+LLGP+KIIGVSCKTPEQAE+AW+DGADYIGCGGVYPTNTKA
Sbjct: 361 DADGVHVGQSDIPARVVRTLLGPEKIIGVSCKTPEQAEKAWIDGADYIGCGGVYPTNTKA 420

Query: 459 NNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLE 506
           NN+TVGLDGLK VCLASKLPVVAIGGIN SNA  VMEIGVPNLKGVAVVSALFDR+CVL 
Sbjct: 421 NNITVGLDGLKTVCLASKLPVVAIGGINASNARTVMEIGVPNLKGVAVVSALFDRECVLT 480

BLAST of Cp4.1LG01g16710 vs. TrEMBL
Match: A0A067EGP5_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g010260mg PE=3 SV=1)

HSP 1 Score: 728.8 bits (1880), Expect = 4.6e-207
Identity = 372/498 (74.70%), Postives = 422/498 (84.74%), Query Frame = 1

Query: 33  SSDRYEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNI 92
           ++++Y+M+IPHVL+VAGSDSGAGAGIQADLK CAARGVYCSTVITAVTAQNT GVQ VNI
Sbjct: 12  TTEQYKMKIPHVLTVAGSDSGAGAGIQADLKACAARGVYCSTVITAVTAQNTAGVQGVNI 71

Query: 93  MPEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGD 152
           +PE FV+ QLKSVLSDMQVDVVKTGMLPST +++V+ Q L EFPV+ALVVDPVMVSTSGD
Sbjct: 72  VPEDFVAAQLKSVLSDMQVDVVKTGMLPSTDLVKVLLQSLSEFPVRALVVDPVMVSTSGD 131

Query: 153 VLACPTIISVLQDELLPMADLVTPNLKEASALLGG------------------------- 212
           VLA P+ I+ L++ LLPMAD+VTPN+KEASALLGG                         
Sbjct: 132 VLAGPSTITGLRENLLPMADIVTPNVKEASALLGGMQVVTVADMCSAAKLLHNLGPRTVL 191

Query: 213 ---GDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAV 272
              GDLPDS DAVDIFFDG+D HELRSSR+ TRNTHGTGC+LASCIAAELAKGS M SAV
Sbjct: 192 VKGGDLPDSSDAVDIFFDGEDFHELRSSRVNTRNTHGTGCTLASCIAAELAKGSPMLSAV 251

Query: 273 KISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDSG 332
           K++K F+E AL YSKDI IG+GPQGPFDHL RLKS  + S+R  +FN SDLFLYAVTDSG
Sbjct: 252 KVAKCFVETALDYSKDIVIGSGPQGPFDHLLRLKSTSRQSHRAEAFNPSDLFLYAVTDSG 311

Query: 333 MNKRWDRSITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDRI 392
           MNK+W RSITDAVKAA+EGGATIIQ+REKDA T  FLEAAK+C++IC  HGVPLLINDRI
Sbjct: 312 MNKKWGRSITDAVKAALEGGATIIQLREKDADTRGFLEAAKACLQICCVHGVPLLINDRI 371

Query: 393 DVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGVY 452
           D+ALAC ADGVH+GQSD+P   AR+LLGPDKIIGVSCKTPE+A QAW+DGA+YIGCGGVY
Sbjct: 372 DIALACDADGVHLGQSDMPARTARALLGPDKIIGVSCKTPEEAHQAWIDGANYIGCGGVY 431

Query: 453 PTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALFD 503
           PTNTKANNLTVGLDGLK VCLASKLPVVAIGGI  SNA+ VM+IGV NLKGVAVVSALFD
Sbjct: 432 PTNTKANNLTVGLDGLKTVCLASKLPVVAIGGIGISNASDVMKIGVSNLKGVAVVSALFD 491

BLAST of Cp4.1LG01g16710 vs. TrEMBL
Match: M5XRD0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003609mg PE=3 SV=1)

HSP 1 Score: 728.0 bits (1878), Expect = 7.8e-207
Identity = 371/500 (74.20%), Postives = 421/500 (84.20%), Query Frame = 1

Query: 33  SSDRYEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNI 92
           +S++  + IPHVL+VAGSDSGAGAGIQADLK CAARGVYCSTVITAVTAQNT GVQ VN+
Sbjct: 60  TSNQSTINIPHVLTVAGSDSGAGAGIQADLKACAARGVYCSTVITAVTAQNTAGVQGVNV 119

Query: 93  MPEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGD 152
           +PE FV+EQ+KSVLSDM VDVVKTGMLPS GI++++ Q L+E+PV+ALVVDPVMVSTSGD
Sbjct: 120 VPEDFVAEQMKSVLSDMHVDVVKTGMLPSIGIVKILHQHLREYPVRALVVDPVMVSTSGD 179

Query: 153 VLACPTIISVLQDELLPMADLVTPNLKEASALLG-------------------------- 212
           VLA P++++  ++ELLPMA+++TPNLKEASALL                           
Sbjct: 180 VLAGPSVLAGFREELLPMANIITPNLKEASALLDGVKIKTVSDMRSAAKLLHDKGARNVL 239

Query: 213 --GGDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAV 272
             GGDLPDSLDAVDIFFDG+ L+ELRSSRI TRNTHGTGC+LASCIAAELAKG+SM  AV
Sbjct: 240 VKGGDLPDSLDAVDIFFDGEHLYELRSSRIKTRNTHGTGCTLASCIAAELAKGASMLEAV 299

Query: 273 KISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDSG 332
           K++K F+E AL YSK+I IGNGPQGPFDHL +LKS   +S RQ  FN SDLFLYAVTDSG
Sbjct: 300 KVAKCFVETALDYSKEIFIGNGPQGPFDHLMKLKSNAHNSGRQVRFNPSDLFLYAVTDSG 359

Query: 333 MNKRWDRSITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDRI 392
           MN+RW  SI+DAVKAAV+GGATI+Q+REKD +T DF+EAAKSC++IC  HGVPLLINDRI
Sbjct: 360 MNRRWGHSISDAVKAAVQGGATIVQLREKDIETRDFVEAAKSCLQICRAHGVPLLINDRI 419

Query: 393 DVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGVY 452
           DVALAC ADGVHIGQSD+P H AR+LLGP+KIIGVSCKTPEQAEQAW+ GADYIGCGGVY
Sbjct: 420 DVALACDADGVHIGQSDMPAHTARALLGPEKIIGVSCKTPEQAEQAWIAGADYIGCGGVY 479

Query: 453 PTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALFD 505
           PTNTKANNLTVGLDGLK VCLASKLPVVAIGGI  SNA  VMEIGVPNLKGVAVVSA+FD
Sbjct: 480 PTNTKANNLTVGLDGLKTVCLASKLPVVAIGGIKVSNARPVMEIGVPNLKGVAVVSAIFD 539

BLAST of Cp4.1LG01g16710 vs. TrEMBL
Match: V4T108_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000782mg PE=3 SV=1)

HSP 1 Score: 724.5 bits (1869), Expect = 8.7e-206
Identity = 371/498 (74.50%), Postives = 419/498 (84.14%), Query Frame = 1

Query: 33  SSDRYEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNI 92
           ++++Y+M+IPHVL+VAGSDSGAGAGIQADLK CAARGVYCSTVITAVTAQNT GVQ VNI
Sbjct: 12  TTEQYKMKIPHVLTVAGSDSGAGAGIQADLKACAARGVYCSTVITAVTAQNTAGVQGVNI 71

Query: 93  MPEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGD 152
           +PE FV+ QLKSVLSDMQVDVVKTGMLPST + +V+ Q L EFPV+ALVVDPVMVSTSGD
Sbjct: 72  VPEDFVAAQLKSVLSDMQVDVVKTGMLPSTDLAKVLLQSLSEFPVRALVVDPVMVSTSGD 131

Query: 153 VLACPTIISVLQDELLPMADLVTPNLKEASALLGG------------------------- 212
           VLA P+ I+ L++ LLPMAD+VTPN+KEASALLGG                         
Sbjct: 132 VLAGPSTITGLRENLLPMADIVTPNVKEASALLGGMQVVTVADMCSAAKLLHNLGPRTVL 191

Query: 213 ---GDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAV 272
              GDLPDS DAVDIFFDG+D HELRSSR+ TRNTHGTGC+LASCIAAELAKGS M SAV
Sbjct: 192 VKGGDLPDSSDAVDIFFDGEDFHELRSSRVNTRNTHGTGCTLASCIAAELAKGSPMLSAV 251

Query: 273 KISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDSG 332
           K++K F+E AL YSKDI IG+GPQGPFDHL RLKS    S+R  +FN SDLFLYAVTDSG
Sbjct: 252 KVAKCFVETALDYSKDIVIGSGPQGPFDHLLRLKSTSHQSHRAEAFNPSDLFLYAVTDSG 311

Query: 333 MNKRWDRSITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDRI 392
            NK+W RSITDAVKAA+EGGATIIQ+REKDA T  FLEAAK+C++IC  HGVPLLINDRI
Sbjct: 312 TNKKWGRSITDAVKAALEGGATIIQLREKDADTRGFLEAAKACLQICCVHGVPLLINDRI 371

Query: 393 DVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGVY 452
           D+ALAC ADGVH+GQSD+P   AR+LLGPDKIIGVSCKTPE+A QAW+DGA+YIGCGGVY
Sbjct: 372 DIALACDADGVHLGQSDMPARTARALLGPDKIIGVSCKTPEEAHQAWIDGANYIGCGGVY 431

Query: 453 PTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALFD 503
           PTNTKANNLTVGLDGLK VCLASKLPVVAIGGI  SNA+ VM+IGV NLKGVAVVSALFD
Sbjct: 432 PTNTKANNLTVGLDGLKTVCLASKLPVVAIGGIGISNASDVMKIGVSNLKGVAVVSALFD 491

BLAST of Cp4.1LG01g16710 vs. TrEMBL
Match: V4T678_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000782mg PE=3 SV=1)

HSP 1 Score: 724.5 bits (1869), Expect = 8.7e-206
Identity = 371/498 (74.50%), Postives = 419/498 (84.14%), Query Frame = 1

Query: 33  SSDRYEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNI 92
           ++++Y+M+IPHVL+VAGSDSGAGAGIQADLK CAARGVYCSTVITAVTAQNT GVQ VNI
Sbjct: 40  TTEQYKMKIPHVLTVAGSDSGAGAGIQADLKACAARGVYCSTVITAVTAQNTAGVQGVNI 99

Query: 93  MPEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGD 152
           +PE FV+ QLKSVLSDMQVDVVKTGMLPST + +V+ Q L EFPV+ALVVDPVMVSTSGD
Sbjct: 100 VPEDFVAAQLKSVLSDMQVDVVKTGMLPSTDLAKVLLQSLSEFPVRALVVDPVMVSTSGD 159

Query: 153 VLACPTIISVLQDELLPMADLVTPNLKEASALLGG------------------------- 212
           VLA P+ I+ L++ LLPMAD+VTPN+KEASALLGG                         
Sbjct: 160 VLAGPSTITGLRENLLPMADIVTPNVKEASALLGGMQVVTVADMCSAAKLLHNLGPRTVL 219

Query: 213 ---GDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAV 272
              GDLPDS DAVDIFFDG+D HELRSSR+ TRNTHGTGC+LASCIAAELAKGS M SAV
Sbjct: 220 VKGGDLPDSSDAVDIFFDGEDFHELRSSRVNTRNTHGTGCTLASCIAAELAKGSPMLSAV 279

Query: 273 KISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDSG 332
           K++K F+E AL YSKDI IG+GPQGPFDHL RLKS    S+R  +FN SDLFLYAVTDSG
Sbjct: 280 KVAKCFVETALDYSKDIVIGSGPQGPFDHLLRLKSTSHQSHRAEAFNPSDLFLYAVTDSG 339

Query: 333 MNKRWDRSITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDRI 392
            NK+W RSITDAVKAA+EGGATIIQ+REKDA T  FLEAAK+C++IC  HGVPLLINDRI
Sbjct: 340 TNKKWGRSITDAVKAALEGGATIIQLREKDADTRGFLEAAKACLQICCVHGVPLLINDRI 399

Query: 393 DVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGVY 452
           D+ALAC ADGVH+GQSD+P   AR+LLGPDKIIGVSCKTPE+A QAW+DGA+YIGCGGVY
Sbjct: 400 DIALACDADGVHLGQSDMPARTARALLGPDKIIGVSCKTPEEAHQAWIDGANYIGCGGVY 459

Query: 453 PTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALFD 503
           PTNTKANNLTVGLDGLK VCLASKLPVVAIGGI  SNA+ VM+IGV NLKGVAVVSALFD
Sbjct: 460 PTNTKANNLTVGLDGLKTVCLASKLPVVAIGGIGISNASDVMKIGVSNLKGVAVVSALFD 519

BLAST of Cp4.1LG01g16710 vs. TAIR10
Match: AT1G22940.1 (AT1G22940.1 thiamin biosynthesis protein, putative)

HSP 1 Score: 663.7 bits (1711), Expect = 9.2e-191
Identity = 341/491 (69.45%), Postives = 397/491 (80.86%), Query Frame = 1

Query: 40  RIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVS 99
           ++P VL+VAGSDSGAGAGIQADLK CAARGVYC++VITAVTAQNT GVQ V+++P  F+S
Sbjct: 29  KVPQVLTVAGSDSGAGAGIQADLKVCAARGVYCASVITAVTAQNTRGVQSVHLLPPEFIS 88

Query: 100 EQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTI 159
           EQLKSVLSD + DVVKTGMLPST I++V+ Q L +FPV+ALVVDPVMVSTSG VLA  +I
Sbjct: 89  EQLKSVLSDFEFDVVKTGMLPSTEIVEVLLQNLSDFPVRALVVDPVMVSTSGHVLAGSSI 148

Query: 160 ISVLQDELLPMADLVTPNLKEASALLGG----------------------------GDLP 219
           +S+ ++ LLP+AD++TPN+KEASALL G                            GDLP
Sbjct: 149 LSIFRERLLPIADIITPNVKEASALLDGFRIETVAEMRSAAKSLHEMGPRFVLVKGGDLP 208

Query: 220 DSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFI 279
           DS D+VD++FDGK+ HELRS RI TRNTHGTGC+LASCIAAELAKGSSM SAVK++K+F+
Sbjct: 209 DSSDSVDVYFDGKEFHELRSPRIATRNTHGTGCTLASCIAAELAKGSSMLSAVKVAKRFV 268

Query: 280 ERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDR 339
           + AL YSKDI IG+G QGPFDH   LK   QSS R   FN  DLFLYAVTDS MNK+W+R
Sbjct: 269 DNALDYSKDIVIGSGMQGPFDHFFGLKKDPQSS-RCSIFNPDDLFLYAVTDSRMNKKWNR 328

Query: 340 SITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACG 399
           SI DA+KAA+EGGATIIQ+REK+A+T +FLE AK+CI+IC +HGV LLINDRID+ALAC 
Sbjct: 329 SIVDALKAAIEGGATIIQLREKEAETREFLEEAKACIDICRSHGVSLLINDRIDIALACD 388

Query: 400 ADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKAN 459
           ADGVH+GQSD+PV   RSLLGPDKIIGVSCKTPEQA QAW DGADYIG GGV+PTNTKAN
Sbjct: 389 ADGVHVGQSDMPVDLVRSLLGPDKIIGVSCKTPEQAHQAWKDGADYIGSGGVFPTNTKAN 448

Query: 460 NLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEE 503
           N T+GLDGLK VC ASKLPVVAIGGI  SNA +VM+I  PNLKGVAVVSALFD+ CVL +
Sbjct: 449 NRTIGLDGLKEVCEASKLPVVAIGGIGISNAGSVMQIDAPNLKGVAVVSALFDQDCVLTQ 508

BLAST of Cp4.1LG01g16710 vs. NCBI nr
Match: gi|659120551|ref|XP_008460243.1| (PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 874.4 bits (2258), Expect = 9.7e-251
Identity = 451/531 (84.93%), Postives = 475/531 (89.45%), Query Frame = 1

Query: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA 60
           MV LPL  QIPKFNQVSRFCMAMKK EE VVASSDRYE RIPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVPLPLISQIPKFNQVSRFCMAMKKQEEMVVASSDRYETRIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITA+TAQNTVGVQDVNI+PEGFVS+QLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNIVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180
           STGI+QV+ Q LKEFPV+ALVVDPVMVSTSGDVLA PTIISVLQ+ELLPMADLVTPNLKE
Sbjct: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPTIISVLQEELLPMADLVTPNLKE 180

Query: 181 ASALLG----------------------------GGDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLG                            GGDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGGMPLKTISDMRHAATLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300
           RI +RNTHGTGCSLASCI+AELAKGSSMFSAVK SKQFIERAL YSKDI+IG+GPQGPFD
Sbjct: 241 RIKSRNTHGTGCSLASCISAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300

Query: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360
           HLCRLKSREQSSY QG FN +DLFLYAVTDSGMN+RWDRSITDAVKAAVEGGATI+QIRE
Sbjct: 301 HLCRLKSREQSSYSQGCFNPTDLFLYAVTDSGMNERWDRSITDAVKAAVEGGATIVQIRE 360

Query: 361 KDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420
           KDAKT DFLE AKSCI+ICH HGVPLLINDRID+ALAC ADGVH+GQSDIP H  R LLG
Sbjct: 361 KDAKTRDFLEVAKSCIKICHAHGVPLLINDRIDIALACDADGVHVGQSDIPAHEVRRLLG 420

Query: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480
           P+K+IGVSCKT EQAEQAW+DGADYIGCGGVYPTNTKANNLTVG+DGLKRVCLASKLPVV
Sbjct: 421 PNKVIGVSCKTMEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480

Query: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEAT 504
           AIGGINH+NAAAVM IG+PNL+GVAVVSALFDRQCVLE   KLHATLVEAT
Sbjct: 481 AIGGINHTNAAAVMGIGIPNLRGVAVVSALFDRQCVLEAASKLHATLVEAT 531

BLAST of Cp4.1LG01g16710 vs. NCBI nr
Match: gi|449445302|ref|XP_004140412.1| (PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic [Cucumis sativus])

HSP 1 Score: 854.7 bits (2207), Expect = 8.0e-245
Identity = 443/531 (83.43%), Postives = 469/531 (88.32%), Query Frame = 1

Query: 1   MVLLPLTFQIPKFNQVSRFCMAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQA 60
           MV LPL  QIPKFNQVSRFCMAMKK +E VVASS+R E  IPHVLSVAGSDSGAGAGIQA
Sbjct: 1   MVPLPLISQIPKFNQVSRFCMAMKKQDEMVVASSNRNETCIPHVLSVAGSDSGAGAGIQA 60

Query: 61  DLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLP 120
           DLKTCAARGVYCSTVITA+TAQNTVGVQDVN++PEGFVS+QLKSVLSDMQVDVVKTGMLP
Sbjct: 61  DLKTCAARGVYCSTVITAITAQNTVGVQDVNVVPEGFVSKQLKSVLSDMQVDVVKTGMLP 120

Query: 121 STGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKE 180
           STGI+QV+ Q LKEFPV+ALVVDPVMVSTSGDVLA P IISVLQ++LLPMADLVTPNLKE
Sbjct: 121 STGIVQVLHQCLKEFPVRALVVDPVMVSTSGDVLAGPAIISVLQEKLLPMADLVTPNLKE 180

Query: 181 ASALLG----------------------------GGDLPDSLDAVDIFFDGKDLHELRSS 240
           ASALLG                            GGDLPDSLDAVDIFFDGKDLHELRSS
Sbjct: 181 ASALLGDMPLTTISDMRHAAMLIYQMGSKNVLIKGGDLPDSLDAVDIFFDGKDLHELRSS 240

Query: 241 RITTRNTHGTGCSLASCIAAELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFD 300
           RI +RNTHGTGCSLASCIAAELAKGSSMFSAVK SKQFIERAL YSKDI+IG+GPQGPFD
Sbjct: 241 RIKSRNTHGTGCSLASCIAAELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFD 300

Query: 301 HLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIRE 360
           HLC LK+RE SSY QG FN +DLFLYAVTDSGMN+RWDRSITDAVK AVEGGATI+QIRE
Sbjct: 301 HLCCLKNREPSSYSQGCFNPADLFLYAVTDSGMNERWDRSITDAVKDAVEGGATIVQIRE 360

Query: 361 KDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLG 420
           KDAKT DFLE AKSCI+IC  HGVPLLINDRID+ALAC ADGVH+GQSDIP H  R LLG
Sbjct: 361 KDAKTRDFLEVAKSCIKICRAHGVPLLINDRIDIALACNADGVHVGQSDIPAHEVRRLLG 420

Query: 421 PDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVV 480
           P+KIIGVSCKT EQAEQAW+DGADYIGCGGVYPTNTKANNLTVG+DGLKRVCLASKLPVV
Sbjct: 421 PNKIIGVSCKTTEQAEQAWIDGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVV 480

Query: 481 AIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQCVLEETLKLHATLVEAT 504
           AIGGINH+NAAAVM IG+PNLKGVAVVSALFDRQCVLEE  KLHATLVEAT
Sbjct: 481 AIGGINHTNAAAVMGIGIPNLKGVAVVSALFDRQCVLEEASKLHATLVEAT 531

BLAST of Cp4.1LG01g16710 vs. NCBI nr
Match: gi|659120553|ref|XP_008460244.1| (PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X2 [Cucumis melo])

HSP 1 Score: 843.2 bits (2177), Expect = 2.4e-241
Identity = 434/511 (84.93%), Postives = 458/511 (89.63%), Query Frame = 1

Query: 21  MAMKKPEETVVASSDRYEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVT 80
           MAMKK EE VVASSDRYE RIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITA+T
Sbjct: 1   MAMKKQEEMVVASSDRYETRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAIT 60

Query: 81  AQNTVGVQDVNIMPEGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQAL 140
           AQNTVGVQDVNI+PEGFVS+QLKSVLSDMQVDVVKTGMLPSTGI+QV+ Q LKEFPV+AL
Sbjct: 61  AQNTVGVQDVNIVPEGFVSKQLKSVLSDMQVDVVKTGMLPSTGIVQVLHQCLKEFPVRAL 120

Query: 141 VVDPVMVSTSGDVLACPTIISVLQDELLPMADLVTPNLKEASALLG-------------- 200
           VVDPVMVSTSGDVLA PTIISVLQ+ELLPMADLVTPNLKEASALLG              
Sbjct: 121 VVDPVMVSTSGDVLAGPTIISVLQEELLPMADLVTPNLKEASALLGGMPLKTISDMRHAA 180

Query: 201 --------------GGDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAA 260
                         GGDLPDSLDAVDIFFDGKDLHELRSSRI +RNTHGTGCSLASCI+A
Sbjct: 181 TLIHQMGSKNVLVKGGDLPDSLDAVDIFFDGKDLHELRSSRIKSRNTHGTGCSLASCISA 240

Query: 261 ELAKGSSMFSAVKISKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQ 320
           ELAKGSSMFSAVK SKQFIERAL YSKDI+IG+GPQGPFDHLCRLKSREQSSY QG FN 
Sbjct: 241 ELAKGSSMFSAVKASKQFIERALRYSKDINIGHGPQGPFDHLCRLKSREQSSYSQGCFNP 300

Query: 321 SDLFLYAVTDSGMNKRWDRSITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICH 380
           +DLFLYAVTDSGMN+RWDRSITDAVKAAVEGGATI+QIREKDAKT DFLE AKSCI+ICH
Sbjct: 301 TDLFLYAVTDSGMNERWDRSITDAVKAAVEGGATIVQIREKDAKTRDFLEVAKSCIKICH 360

Query: 381 THGVPLLINDRIDVALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWL 440
            HGVPLLINDRID+ALAC ADGVH+GQSDIP H  R LLGP+K+IGVSCKT EQAEQAW+
Sbjct: 361 AHGVPLLINDRIDIALACDADGVHVGQSDIPAHEVRRLLGPNKVIGVSCKTMEQAEQAWI 420

Query: 441 DGADYIGCGGVYPTNTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPN 500
           DGADYIGCGGVYPTNTKANNLTVG+DGLKRVCLASKLPVVAIGGINH+NAAAVM IG+PN
Sbjct: 421 DGADYIGCGGVYPTNTKANNLTVGIDGLKRVCLASKLPVVAIGGINHTNAAAVMGIGIPN 480

Query: 501 LKGVAVVSALFDRQCVLEETLKLHATLVEAT 504
           L+GVAVVSALFDRQCVLE   KLHATLVEAT
Sbjct: 481 LRGVAVVSALFDRQCVLEAASKLHATLVEAT 511

BLAST of Cp4.1LG01g16710 vs. NCBI nr
Match: gi|731411629|ref|XP_010658056.1| (PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X1 [Vitis vinifera])

HSP 1 Score: 741.9 bits (1914), Expect = 7.5e-211
Identity = 378/499 (75.75%), Postives = 425/499 (85.17%), Query Frame = 1

Query: 35  DRYEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMP 94
           D  +M+IPHVL+VAGSDSGAGAGIQADLK CAARGVYCSTVITAVTAQNTVGVQ VNI+P
Sbjct: 65  DDSKMKIPHVLTVAGSDSGAGAGIQADLKACAARGVYCSTVITAVTAQNTVGVQGVNIVP 124

Query: 95  EGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVL 154
           E FV+EQLKSVLSDM VDVVKTGMLP+ GI++V+   LKEFPVQALVVDPVMVSTSGDVL
Sbjct: 125 EDFVAEQLKSVLSDMHVDVVKTGMLPTIGIVKVLHHSLKEFPVQALVVDPVMVSTSGDVL 184

Query: 155 ACPTIISVLQDELLPMADLVTPNLKEASALLGG--------------------------- 214
           A P+I++  ++ELLPMAD+VTPNLKEASALLGG                           
Sbjct: 185 AGPSILAAFREELLPMADIVTPNLKEASALLGGLQLETVSDMCTAAKLIHDMGPRNVLVK 244

Query: 215 -GDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAVKI 274
            GDLP SLDAVDIFFDG D +ELRSSRI TRNTHGTGC+LASCIAAELAKGS + SAVK 
Sbjct: 245 GGDLPSSLDAVDIFFDGDDFYELRSSRIKTRNTHGTGCTLASCIAAELAKGSQILSAVKA 304

Query: 275 SKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMN 334
           +K +IE AL YSKDI+IGNG QGPFDHL +LKS  ++S+R+ +FN ++LFLYAVTDSGMN
Sbjct: 305 AKHYIETALDYSKDIAIGNGFQGPFDHLLKLKSNIRNSFRKQAFNPANLFLYAVTDSGMN 364

Query: 335 KRWDRSITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDV 394
           K+W RSIT+AVKAA+EGGATI+Q+REKDA+T DFLEAAK+C+EICH+HGVPLLINDRIDV
Sbjct: 365 KKWGRSITEAVKAAIEGGATIVQLREKDAETRDFLEAAKACVEICHSHGVPLLINDRIDV 424

Query: 395 ALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPT 454
           ALAC ADGVH+GQSDIP    R+LLGP+KIIGVSCKTPEQAE+AW+DGADYIGCGGVYPT
Sbjct: 425 ALACDADGVHVGQSDIPARVVRTLLGPEKIIGVSCKTPEQAEKAWIDGADYIGCGGVYPT 484

Query: 455 NTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQ 506
           NTKANN+TVGLDGLK VCLASKLPVVAIGGIN SNA  VMEIGVPNLKGVAVVSALFDR+
Sbjct: 485 NTKANNITVGLDGLKTVCLASKLPVVAIGGINASNARTVMEIGVPNLKGVAVVSALFDRE 544

BLAST of Cp4.1LG01g16710 vs. NCBI nr
Match: gi|731411633|ref|XP_010658058.1| (PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform X2 [Vitis vinifera])

HSP 1 Score: 741.9 bits (1914), Expect = 7.5e-211
Identity = 378/499 (75.75%), Postives = 425/499 (85.17%), Query Frame = 1

Query: 35  DRYEMRIPHVLSVAGSDSGAGAGIQADLKTCAARGVYCSTVITAVTAQNTVGVQDVNIMP 94
           D  +M+IPHVL+VAGSDSGAGAGIQADLK CAARGVYCSTVITAVTAQNTVGVQ VNI+P
Sbjct: 11  DDSKMKIPHVLTVAGSDSGAGAGIQADLKACAARGVYCSTVITAVTAQNTVGVQGVNIVP 70

Query: 95  EGFVSEQLKSVLSDMQVDVVKTGMLPSTGIIQVIRQRLKEFPVQALVVDPVMVSTSGDVL 154
           E FV+EQLKSVLSDM VDVVKTGMLP+ GI++V+   LKEFPVQALVVDPVMVSTSGDVL
Sbjct: 71  EDFVAEQLKSVLSDMHVDVVKTGMLPTIGIVKVLHHSLKEFPVQALVVDPVMVSTSGDVL 130

Query: 155 ACPTIISVLQDELLPMADLVTPNLKEASALLGG--------------------------- 214
           A P+I++  ++ELLPMAD+VTPNLKEASALLGG                           
Sbjct: 131 AGPSILAAFREELLPMADIVTPNLKEASALLGGLQLETVSDMCTAAKLIHDMGPRNVLVK 190

Query: 215 -GDLPDSLDAVDIFFDGKDLHELRSSRITTRNTHGTGCSLASCIAAELAKGSSMFSAVKI 274
            GDLP SLDAVDIFFDG D +ELRSSRI TRNTHGTGC+LASCIAAELAKGS + SAVK 
Sbjct: 191 GGDLPSSLDAVDIFFDGDDFYELRSSRIKTRNTHGTGCTLASCIAAELAKGSQILSAVKA 250

Query: 275 SKQFIERALSYSKDISIGNGPQGPFDHLCRLKSREQSSYRQGSFNQSDLFLYAVTDSGMN 334
           +K +IE AL YSKDI+IGNG QGPFDHL +LKS  ++S+R+ +FN ++LFLYAVTDSGMN
Sbjct: 251 AKHYIETALDYSKDIAIGNGFQGPFDHLLKLKSNIRNSFRKQAFNPANLFLYAVTDSGMN 310

Query: 335 KRWDRSITDAVKAAVEGGATIIQIREKDAKTHDFLEAAKSCIEICHTHGVPLLINDRIDV 394
           K+W RSIT+AVKAA+EGGATI+Q+REKDA+T DFLEAAK+C+EICH+HGVPLLINDRIDV
Sbjct: 311 KKWGRSITEAVKAAIEGGATIVQLREKDAETRDFLEAAKACVEICHSHGVPLLINDRIDV 370

Query: 395 ALACGADGVHIGQSDIPVHAARSLLGPDKIIGVSCKTPEQAEQAWLDGADYIGCGGVYPT 454
           ALAC ADGVH+GQSDIP    R+LLGP+KIIGVSCKTPEQAE+AW+DGADYIGCGGVYPT
Sbjct: 371 ALACDADGVHVGQSDIPARVVRTLLGPEKIIGVSCKTPEQAEKAWIDGADYIGCGGVYPT 430

Query: 455 NTKANNLTVGLDGLKRVCLASKLPVVAIGGINHSNAAAVMEIGVPNLKGVAVVSALFDRQ 506
           NTKANN+TVGLDGLK VCLASKLPVVAIGGIN SNA  VMEIGVPNLKGVAVVSALFDR+
Sbjct: 431 NTKANNITVGLDGLKTVCLASKLPVVAIGGINASNARTVMEIGVPNLKGVAVVSALFDRE 490

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TPS1L_ARATH1.6e-18969.45Thiamine biosynthetic bifunctional enzyme TH1, chloroplastic OS=Arabidopsis thal... [more]
TPS1_BRANA1.9e-18266.80Thiamine biosynthetic bifunctional enzyme BTH1, chloroplastic OS=Brassica napus ... [more]
TPS1_ORYSJ2.0e-17963.80Probable thiamine biosynthetic bifunctional enzyme, chloroplastic OS=Oryza sativ... [more]
THID_RHIME1.4e-4445.38Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase OS=Rhizobium meliloti (st... [more]
THIED_GEOSL7.1e-4445.02Thiamine biosynthesis bifunctional protein ThiED OS=Geobacter sulfurreducens (st... [more]
Match NameE-valueIdentityDescription
D7TGZ0_VITVI1.5e-21076.16Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0035g00320 PE=3 SV=... [more]
A0A067EGP5_CITSI4.6e-20774.70Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g010260mg PE=3 SV=1[more]
M5XRD0_PRUPE7.8e-20774.20Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003609mg PE=3 SV=1[more]
V4T108_9ROSI8.7e-20674.50Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000782mg PE=3 SV=1[more]
V4T678_9ROSI8.7e-20674.50Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000782mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G22940.19.2e-19169.45 thiamin biosynthesis protein, putative[more]
Match NameE-valueIdentityDescription
gi|659120551|ref|XP_008460243.1|9.7e-25184.93PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform ... [more]
gi|449445302|ref|XP_004140412.1|8.0e-24583.43PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic [Cucumis... [more]
gi|659120553|ref|XP_008460244.1|2.4e-24184.93PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform ... [more]
gi|731411629|ref|XP_010658056.1|7.5e-21175.75PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform ... [more]
gi|731411633|ref|XP_010658058.1|7.5e-21175.75PREDICTED: thiamine biosynthetic bifunctional enzyme TH1, chloroplastic isoform ... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003824catalytic activity
GO:0004789thiamine-phosphate diphosphorylase activity
Vocabulary: Biological Process
TermDefinition
GO:0009228thiamine biosynthetic process
Vocabulary: INTERPRO
TermDefinition
IPR022998ThiamineP_synth_TenI
IPR013785Aldolase_TIM
IPR013749PM/HMP-P_kinase-1
IPR003733Thiamine phosphate synthase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016310 phosphorylation
biological_process GO:0009228 thiamine biosynthetic process
cellular_component GO:0009570 chloroplast stroma
cellular_component GO:0005575 cellular_component
molecular_function GO:0008902 hydroxymethylpyrimidine kinase activity
molecular_function GO:0008972 phosphomethylpyrimidine kinase activity
molecular_function GO:0004789 thiamine-phosphate diphosphorylase activity
molecular_function GO:0003824 catalytic activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g16710.1Cp4.1LG01g16710.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003733Thiamine phosphate synthaseHAMAPMF_00097TMP_synthasecoord: 294..501
score: 26
IPR003733Thiamine phosphate synthasePFAMPF02581TMP-TENIcoord: 297..482
score: 9.7
IPR003733Thiamine phosphate synthaseTIGRFAMsTIGR00693TIGR00693coord: 297..486
score: 5.2
IPR013749Pyridoxamine kinase/Phosphomethylpyrimidine kinasePFAMPF08543Phos_pyr_kincoord: 51..186
score: 1.1
IPR013785Aldolase-type TIM barrelGENE3DG3DSA:3.20.20.70coord: 290..498
score: 7.3
IPR022998Thiamin phosphate synthase superfamilyunknownSSF51391Thiamin phosphate synthasecoord: 293..495
score: 6.8
NoneNo IPR availablePANTHERPTHR20858PHOSPHOMETHYLPYRIMIDINE KINASEcoord: 28..313
score: 2.2E
NoneNo IPR availablePANTHERPTHR20858:SF17HYDROXYMETHYLPYRIMIDINE/PHOSPHOMETHYLPYRIMIDINE KINASE THI20-RELATEDcoord: 28..313
score: 2.2E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG01g16710CmoCh04G019850Cucurbita moschata (Rifu)cmocpeB673
Cp4.1LG01g16710Csa5G262270Cucumber (Chinese Long) v2cpecuB430
Cp4.1LG01g16710CSPI05G12990Wild cucumber (PI 183967)cpecpiB428
Cp4.1LG01g16710CsaV3_5G012850Cucumber (Chinese Long) v3cpecucB0555
Cp4.1LG01g16710CsGy5G009580Cucumber (Gy14) v2cgybcpeB659
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG01g16710Cucurbita maxima (Rimu)cmacpeB720
Cp4.1LG01g16710Bottle gourd (USVL1VR-Ls)cpelsiB319
Cp4.1LG01g16710Watermelon (Charleston Gray)cpewcgB365
Cp4.1LG01g16710Watermelon (Charleston Gray)cpewcgB369
Cp4.1LG01g16710Watermelon (97103) v1cpewmB455