ClCG03G004230 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG03G004230
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionAICAR transformylase
LocationCG_Chr03: 4564707 .. 4572322 (-)
RNA-Seq ExpressionClCG03G004230
SyntenyClCG03G004230
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCTCCTCACACATCCACATCCGGTTGTTTCCGCTAACACTCGCCGGCCGTTCACTTTGGCGATACCTGATCTTCAAAACTTTGAATCTTCACTTCACCACTGCAGAATTTCCACAAACACAACGATTCCTCATCTCTTATGGTGAAAATAATTGCCTGAGATATTGCAATTTTAGGTATGCTTCCAACAAGTTCTCTAAATAAAGTTTTGGTGTATCTCTAAGATTGTTTGTTCCTCTTCCAAATGCCTATTTATAAACCTCATTTTCTTCCGTTTTTACACTCTCAAATCCAATATCTATTTCTCTATTAAAAACTCAAAATTTTCTTGTTTTTCTCTCACACCCGTACATAAATGTCTTTGGGTTAGATTGGGTTAACCCGAATATAAAGTTAGTCAGATTATAGTTTGGTTTTTAGTTTTGTAAAGAAGATCCACCCAACTCAACCCATATACTCCACTAATTATTTAATACATATAATCATGTCTTTCTTAGTTTACAACTTAAAAAAAAAATAAAAGGTAATTAGAAAGGAAGGAGAAAAAAAATTAAAATTTGTTCCACATAAACTCAATTCTATTAAGTTTGTAACTTTAAACATCATAAAGGATATGTGAAGGAGAAAATTATTTTCTTCTTTTACAATACATGGAGATTTGAAAATCAAAACATAGTCTAACTCTATGATTTTTCCTTTTTGAACCAAGGATTCTAACCAACTACTCTCAAAAGAGAATATAAAAAAAATACAAAATAGAATATCCTTTAAAAACTTTCAAAGTCGAAATAATATGAGCAAAAAATACAATTTTTTTTAATGAAGACTCGAATACTGACAAAAGAGATCTTCCTATTGAGATGTTTAACCGCCTCTTATGTCAATTACCATTGACCTAAACTTGCTTAGTTATTTCAGTTTATCTAAGCCCTATGAATTAAATTATATTTATGTGGGCTGAAAAATACATGTGATATTAGAAAGGAATCCATGAAAAATAAATAAATAAAAATATTTGTTTCTTATTTTATTTTTTATTTTTTTGAGAAGCGTGAAATGGCAAATGTATCATCTGCTTAAAAATTGTCCATTTTTGTAGACACTACTAAAAATCTGGCTGCCATAACCCTAACTCCGTGGCCCCGCCTCATTGACAGTGAAGTAGACGGTAAGCGTATCACCCTCCTCCGAAGAGTTTCAACTTTCAGGGTTCCTTCAACACAACTCTACTTTACGGGTAGTTTTTGATTGATTATCTTCAACCTCCATTTCTCATTTTCGTACATGGCTTTTCTTAATTCCTTCGTCGGGCGGTTGTGTGGCTTGGTTTGTTGATTTATTTATTTACTTTTAGTTCAAATAAGGCTGAGGATTCGACATCATGGAGGTCATTGATGACGTTCTAATTTGACTTCAAAACTTTTTCTTAAAAAAACTCTGTTTGTGGGTGTTCTGAAACGAGTGAAAAAATCTATGTAAGAGCGATGCAAATTTGCGAGCTCTGAACTGGGAAAACAACAATGAAGTATTCAACTATTTCTCCAGTGATGCTGTACCTCTCCAACTCAGCATTTTTTTTTTCCCACTATTTCTCTTGCTTATTCTTAATATGTTTATTTGTGTTCTGTTTGAAGTCAAAGAACATATTCTCCCCCCCTATCCCCCAAACTTTCTGCAATTTAGTGGTGGAATAGGGTATACAGTGATTTATTGGTCAAGGTTTAAACTGTAGACGATTTTGATGATAAATTCCACATTGGAATTGTAATGAAGCCATAAATGAGTCCCGTTATATTGGTATTACCTTTTAGGTTGATATTATGGGATTGCGGTTGATATGTTGTGGGTACAAACGCCAATCCACATCTTTCCTATGTGAATTTTCTTGATATGTCCAAGTCAGCGTAAAGTAGACAAAAGAAAAGTTTTACACTTAATTGTCAAAGTGTGTGTTTGTTTAAATCAGAGTAATTTGTATTTTGTTACTTATGTTATATTTGTTAAATTTCTCCTATTTGTACAAGATGAGAGTCTTGTAATTGGTGTGAGTTTTATGAATAATGTTCTGAACTCCTCATATGGCATTAGAGAGAATGTTTCTGACACTAGTGGCTTGACAGGCTGACGGAAGAAAACGGTTGGTGTCAAGAAACCAAGCAACAAAAGCGAAGGAGTGACAGTGATGTTTAGGTCTGTTGCTGCTCATTCTCCTGCCACACCTATCACTGCTATTTCGTTTGGAGAACCCCGTGCTCGGTTCTTTCTTAAGGAAGCCAATCCTTCGCCTCTTCTTGTAAAGTGCAATTCACCAATTTTTGTATTTTATTTTCTTTTTAATTTTTGGTAGCAACAATCCCATATTCTGGGTTTTTTACTATTCCTTTTTTTTTTAACAGTCTTTATTCACACGTGTCTCTCTCCATCATTCTGTGCTACGTCGGCGGTGCTCTACTCTCAAAGCCATGGCTGATGGTGAAACCATCACTTTTTCTTCAAAGATCACCATACCATCTGCTTCTGGTAGTATCCTCTATTCCTTTTATATTCAACATTGTATTTTCTTTAGAAGGCTTTGAATTTTGTCTGTATGTTGTTAGTATGTTTGAGAGATGGAATTTAGTGGATAAAAGACATGAAGGGAAATTTTGGTTGTTTCCGCTAGGGAGCTAGCTGAGTTGTTCTCTACAGAAGTGAAGACTTATTTATTTTATTTAATATTATTATTATTTTTCCATTTGGAGCTTAGGTTTCAGTTTCTTTTGTTTTATGATGTAACTTGACTCAATTGGCAACAAATTTGTATTGGTTATGGTTCCATAAGAAGTTCTCTCTTTGATCTTTTTTTAAATATGGGATCTTTTTATAGAGTTGGGTGTATCAATTTGACAGTTGTTTACTATTATGTTCAATTTCAGGAAAGAAACTAGCTTTGATATCATTGTCAGACAAGAAAGATCTTGCATTTCTAGGAAATGGCCTTCAAGAATTGGGGTGAACTCTTCTAGACTTATTATGCAATTTTGTTTAAGTTCTTTGCTTGAAGATTGATCATTTCTTCTTGAAATCACTAACATGTCATCAATGAATATTTTATGAAACATTTTGTTTGGGGTGAGTTCTAAGTTATGACACATACGTTGTATTTTTGGTACGTACTCTAGATTGTGCCTTGGAAACATAGATTATATCATGCATCTTATGAACTTGCAGCTATACAATTGTTTCAACTGGGGGAACAGCATCTACACTGGAAACTTCTGGGGTTCGTGTTACTAAGGTGGAGGAGCTTACATGTTTCCCTGAAATGGTATGCTGTCTTTTCTCTTGTTTATTCAGATTCTGAATGGAAAGAGCTCCCTAAAGCTTCCTGTAATCCATGGCTGTGCTCAGTTCACCGTTTTGCTTAGATTCTAAAAACTTATTTAAAAGTCATATGTGGCATGAAAGATTGGCAGTAATTTCCTCTGAATAGATTCTTGAAGCACTATTGTATAAAGTTAGAAGTAGATGCAGTGTATTTCTACATAATCAAATGCTTCTCTTCCTTGCTTATCTTATACACTTGTTGAATAGTTTTATCTTGTTTGCCTTTTCATTTTTATTCTTGTATGAAGTTAACGTAAAATTAGGTGGAGAATTAGCTGGCCATTTTGTATATGTTAATTATAAACTTTACTTTGTGGTGGCAGCTTGATGGCCGTGTGAAAACTTTGCATCCTTCTATACATGGGGGCATTCTTGCTAGAAGAGACCAAAGGCATCATATGGACGCCTTGAAAAAACATGGAATTGGTGAGTTTTATGCTTGTGTTTTATTCACGTATTGTATCTTTGCAATCAATCTCAGGAATTCTTATGTGGGTCCTAAGTTTTAACTGCATTTCAACTGCAAGTCTGCAATGCATGGTCTTAGTTGCTGCAAAGCTTTTTCTCCTTTCTGAAATAGTTTCTCCGCAGGCACATTTGATGTTGTTGTGGTGAACTTGTATCCCTTCTATGAAAAAGTCACCTCATCTCAAGAAATTAACTTTGAAGATGGAATCGAGAACATTGATATTGGTGGTCCGGCTATGATCAGAGCTGCTGCAAAGGTAGGAATGTGCAGAACTTTGTGGGTAAATTGTGTGTAAATTATAGGCAAAGTTGATCTTGGGATTTATATTAGCTTCTCCCCGGTTTCATATGGAAGTTGGACAGAAATATGAGAACTGTTATTGATTTGTCAAATAGTACTAGGTTTCCAAATACGGTAGTGAAATTAATGAACTAGCTTGAAGCCATTTTGGGCCTAGTGACATTGGAACATGTGAAAAAAGTAATTTTGAGGAGTGAGTTCAAATCCGTGATGGCCACACACCTAAAATAATAAAATCTTATGCAGTACTTAACAATTAAATTTTGTCCTATGAAATTAGTTGCGGCGTGTTTAAGTGGGTTTGAGGACTCATGAATTCTTAAGGGAAAAAAATCAGCATAATGGCTTAAAGACCATTTCTCTAATAGGAGGGTGCATATCATGCTTTGATTTTTAGTTTAGATAACTTTCATTTCATGATGTAAAATTTAGCATTGTTTTTAGTTTACTGTCTAGAGTCTTGTATTGATATGTGGATGAACATGCACATGCAGAATCACAAGGATGTTTTGGTGGTCGTAGATACTGAAGACTATCCTGCACTGCTTGAATTTTTGAAAGGAAGTGAGGATGATCAGCAATTTCGCAGAAAACTTGCTTGGAAGGCATTTCAACATGTAGCTTCTTATGATTCTGCAGTCTCAGAATGGCTGTGGAAGCAGACTGTTGGAGGTAAGCTCCACATTTCATCTATGTCAAGTGCATAATCGCTAGGGTTGGTTCAATGATCTTTACTATGTGCATTTATTTTGTTATTTGGTCCAAGATAAAATTAAGCATGCACTTGTCAAAACTATGAAAGAGAAAAAAAAGTTGCTTTTATTAGCCTTTTTTTATTAAAAAAAATTATTATTATTATTCTCGATATAAAATACAAAATTATTTTGTTAATCTTGTGGAATCTTCGATTAAATAGTCTTCTGTGTTCGTTGGTAGACAAATTCCCTCCCAGCTTTACCGTGCCTCTTTCCCTCAAAAGTTCTCTTCGCTACGGGGAAAATCCTCACCAGAAGGCTGCCTTTTATGTTGATAAGAGTCTTTCTGAAGTCAATGCTGGTGGTATTGCTACAGCAGTCCAACACCATGGAAAGGTGAGTAAGTGCCCCTCTTCATCGATTCATTTGTGGGTTGAAGCTATGAGAAATACACACTTGTTAATTGTTGCCTTTCTGCAGTTATTTTTGTATATTTAACACAGATCCATGGTACAAAGGATTCGTTTTATCTATTATGTCTGTGCCTTCAAATTTTCTGAAACAATAACGGTAACATTGCCCTGATGAGAAGGTGGAGATGTTTGGAATTTATGATTATGATAATGATTTATTGGAAATCGGTTGTTGCATTGTGGTTGATTTATTGTTCTAGATCACTTTTCCAAGTAAGTTCTTAGATAAGATGGACCGGACCAAGTTATCTAATCATTTTTCTAATGGCTTTGCCCCTTAAAAGATAAAGTTGATTCTGTTTGCCCTCCACAGTGAGAAGTAAATTTTAATATGCGAGTAATATGTAAAAGTGTACTTGTGAATCATACTTAAAATACTCTTCGGTCAGTATTTCTAAATTCAGAAAGGCAGTATATGTAAACTGTAAAAGAGTTTCAATTTCATTTTGAAGCCTTTTCTGTTGTCTTCTTTCCATGATGATCATGGAAATGCATCATTAATATCTAAAAAGTAAGACTTCTCCTAAGTATGCACAGGGATTTCATTTATGATATTACATTCTTTAGCCTTCTATTGAAAGTTTTCACTCTTGCTCAAGAGTTCCTTTGAATGTAAAGATTGTTATTTGGCATAAAATTATCTGAATCTGCCACATTACATTTCATTCTCTTTACCTTATATGCCATTTGAGTTTCATTTGTCTTGGAAACTTTTGTTTTGTGCAGGAGATGTCATATAATAACTACCTAGATGCCGATGCAGCTTGGAATTGTGTATCAGAGTTTAGGAATCCTACCTGTGTAATTGTGAAGCACACAAATCCATGTGGTGTAGCTTCACGTGATGATATTCTGGAAGCATATAGGCTGGCTGTGAAAGCTGATCCCGTGAGTGCATTTGGTGGCATTGTTGCCTTCAACATAGAAGTTGATGAGGTTGGTATGTTTTCTTTTCTTAAAACATGTTTTGAGGACCAGTATTCTGCCTTGATTTAAGGAATTAATCCTCTGTCAAGACTCGAGTTCGATAAGACATCAATTGGACTTGAATCTATGATAATTCTATGTTAAGTATTAATTGTTCCAAGTAGCTGCTTGCCAGAGCATAACCTCCACCTTCCAAAACATCAAACCAAATTTTCGTTTACCAAGATATATTTGTGCTTCTGTTATGTAGTGATTTTTGGTTGCTATTAGATAAATTAGATGAATGAAGAAGCTAAATAGATTCAATGCTTTACTTCTACTGCATTTAGGCTCTTGCAAGGGAACTTCGGGAGTTCAGAAGCCCTACAGATGGTGAAACTCGGATGTTTTATGAGATTGTGGTTGCACCCAAGTACACAGAGAAAGGGCTTGAGATCTTGCGTGGGAAATCAAAGACACTGCGAATTCTCGAGGCAGGAAAAAATGAGAAAGGAAAACTATCACTCAGGCAAGTTGGTGGGGGGTGGTTAGCACAGGATGCTGACGATTTGGTTCCACAAGATATAAAATTTAACGTAGTTTCTGGAAAGGCTCCTCAAGAAAGTGAGCTTCGGGATGCAGAGTTTGCATGGCTGTGCGTCAAGCATGTTAAGAGCAATGCCATTGTGATAGCAAAGGTTGGTATATTCTAACCCTGAATTTTTCTTGACATGTTTGCTTTCTGTTGGGCTATTTCTGATGCTGAACCTTTTTGGCAGAATAACTGTATGTTGGGTATGGGAAGCGGGCAACCAAATCGTCTCGAGAGTTTGCGGATAGCGTTGAGGAAAGCAGGGGATGAGGTCAAAGGAGCTGCTTTGGCTAGTGATGCGTTCTTTCCATTTGGTAATTTCTGGGAATCAAATGATGACTTTAAAGTTTCTAATGTTGCACCTCCAATCTGCTACATATGTGGCATGACGGTAACATTAACGTGATCTGTTAGTCTTAGTAGTGAATTTATTCTTGCTTGCAACTTTTGGTTGGCAGCTTGGAATGACGCAGTGGAAGAAGCATGCCAGAGCGGAGTAGGTATTATTGCAGAGCCTGGGGGCAGCATCCGGGATCCCGATGCTGTGGACTGCTGCAACAAGTACGGTGTGTCTCTCGTCTTCACCAACGTGAGGCACTTCAGGCATTGACATTGACAATGATGCTCGTTAAGCCTCCCTGATTCTCCTTTGTCCCCTAAAATCTTTTAAACTTGGTTGGGACACCACAAAATATGAAGATGTTTTTTGACTTAATGCAGAGGTTTGACGTGGTTGTAAAATAAGTAATTCAAGTCGTCGCTTGAAGTAGGTTAAGATGTATCCTATAACTTTTAATAATAATGAAACGCCTCTG

mRNA sequence

CTCTCCTCACACATCCACATCCGGTTGTTTCCGCTAACACTCGCCGGCCGTTCACTTTGGCGATACCTGATCTTCAAAACTTTGAATCTTCACTTCACCACTGCAGAATTTCCACAAACACAACGATTCCTCATCTCTTATGGTGAAAATAATTGCCTGAGATATTGCAATTTTAGACACTACTAAAAATCTGGCTGCCATAACCCTAACTCCGTGGCCCCGCCTCATTGACAGTGAAGTAGACGGTAAGCGTATCACCCTCCTCCGAAGAGTTTCAACTTTCAGGGTTCCTTCAACACAACTCTACTTTACGGGCTGACGGAAGAAAACGGTTGGTGTCAAGAAACCAAGCAACAAAAGCGAAGGAGTGACAGTGATGTTTAGGTCTGTTGCTGCTCATTCTCCTGCCACACCTATCACTGCTATTTCGTTTGGAGAACCCCGTGCTCGGTTCTTTCTTAAGGAAGCCAATCCTTCGCCTCTTCTTTCTTTATTCACACGTGTCTCTCTCCATCATTCTGTGCTACGTCGGCGGTGCTCTACTCTCAAAGCCATGGCTGATGGTGAAACCATCACTTTTTCTTCAAAGATCACCATACCATCTGCTTCTGGTAGAAAGAAACTAGCTTTGATATCATTGTCAGACAAGAAAGATCTTGCATTTCTAGGAAATGGCCTTCAAGAATTGGGCTATACAATTGTTTCAACTGGGGGAACAGCATCTACACTGGAAACTTCTGGGGTTCGTGTTACTAAGGTGGAGGAGCTTACATGTTTCCCTGAAATGCTTGATGGCCGTGTGAAAACTTTGCATCCTTCTATACATGGGGGCATTCTTGCTAGAAGAGACCAAAGGCATCATATGGACGCCTTGAAAAAACATGGAATTGGCACATTTGATGTTGTTGTGGTGAACTTGTATCCCTTCTATGAAAAAGTCACCTCATCTCAAGAAATTAACTTTGAAGATGGAATCGAGAACATTGATATTGGTGGTCCGGCTATGATCAGAGCTGCTGCAAAGAATCACAAGGATGTTTTGGTGGTCGTAGATACTGAAGACTATCCTGCACTGCTTGAATTTTTGAAAGGAAGTGAGGATGATCAGCAATTTCGCAGAAAACTTGCTTGGAAGGCATTTCAACATGTAGCTTCTTATGATTCTGCAGTCTCAGAATGGCTGTGGAAGCAGACTGTTGGAGACAAATTCCCTCCCAGCTTTACCGTGCCTCTTTCCCTCAAAAGTTCTCTTCGCTACGGGGAAAATCCTCACCAGAAGGCTGCCTTTTATGTTGATAAGAGTCTTTCTGAAGTCAATGCTGGTGGTATTGCTACAGCAGTCCAACACCATGGAAAGGAGATGTCATATAATAACTACCTAGATGCCGATGCAGCTTGGAATTGTGTATCAGAGTTTAGGAATCCTACCTGTGTAATTGTGAAGCACACAAATCCATGTGGTGTAGCTTCACGTGATGATATTCTGGAAGCATATAGGCTGGCTGTGAAAGCTGATCCCGTGAGTGCATTTGGTGGCATTGTTGCCTTCAACATAGAAGTTGATGAGGCTCTTGCAAGGGAACTTCGGGAGTTCAGAAGCCCTACAGATGGTGAAACTCGGATGTTTTATGAGATTGTGGTTGCACCCAAGTACACAGAGAAAGGGCTTGAGATCTTGCGTGGGAAATCAAAGACACTGCGAATTCTCGAGGCAGGAAAAAATGAGAAAGGAAAACTATCACTCAGGCAAGTTGGTGGGGGGTGGTTAGCACAGGATGCTGACGATTTGGTTCCACAAGATATAAAATTTAACGTAGTTTCTGGAAAGGCTCCTCAAGAAAGTGAGCTTCGGGATGCAGAGTTTGCATGGCTGTGCGTCAAGCATGTTAAGAGCAATGCCATTGTGATAGCAAAGAATAACTGTATGTTGGGTATGGGAAGCGGGCAACCAAATCGTCTCGAGAGTTTGCGGATAGCGTTGAGGAAAGCAGGGGATGAGGTCAAAGGAGCTGCTTTGGCTAGTGATGCGTTCTTTCCATTTGGTAATTTCTGGGAATCAAATGATGACTTTAAAGTTTCTAATGTTGCACCTCCAATCTGCTACATATCTTGGAATGACGCAGTGGAAGAAGCATGCCAGAGCGGAGTAGGTATTATTGCAGAGCCTGGGGGCAGCATCCGGGATCCCGATGCTGTGGACTGCTGCAACAAGTACGGTGTGTCTCTCGTCTTCACCAACGTGAGGCACTTCAGGCATTGACATTGACAATGATGCTCGTTAAGCCTCCCTGATTCTCCTTTGTCCCCTAAAATCTTTTAAACTTGGTTGGGACACCACAAAATATGAAGATGTTTTTTGACTTAATGCAGAGGTTTGACGTGGTTGTAAAATAAGTAATTCAAGTCGTCGCTTGAAGTAGGTTAAGATGTATCCTATAACTTTTAATAATAATGAAACGCCTCTG

Coding sequence (CDS)

ATGTTTAGGTCTGTTGCTGCTCATTCTCCTGCCACACCTATCACTGCTATTTCGTTTGGAGAACCCCGTGCTCGGTTCTTTCTTAAGGAAGCCAATCCTTCGCCTCTTCTTTCTTTATTCACACGTGTCTCTCTCCATCATTCTGTGCTACGTCGGCGGTGCTCTACTCTCAAAGCCATGGCTGATGGTGAAACCATCACTTTTTCTTCAAAGATCACCATACCATCTGCTTCTGGTAGAAAGAAACTAGCTTTGATATCATTGTCAGACAAGAAAGATCTTGCATTTCTAGGAAATGGCCTTCAAGAATTGGGCTATACAATTGTTTCAACTGGGGGAACAGCATCTACACTGGAAACTTCTGGGGTTCGTGTTACTAAGGTGGAGGAGCTTACATGTTTCCCTGAAATGCTTGATGGCCGTGTGAAAACTTTGCATCCTTCTATACATGGGGGCATTCTTGCTAGAAGAGACCAAAGGCATCATATGGACGCCTTGAAAAAACATGGAATTGGCACATTTGATGTTGTTGTGGTGAACTTGTATCCCTTCTATGAAAAAGTCACCTCATCTCAAGAAATTAACTTTGAAGATGGAATCGAGAACATTGATATTGGTGGTCCGGCTATGATCAGAGCTGCTGCAAAGAATCACAAGGATGTTTTGGTGGTCGTAGATACTGAAGACTATCCTGCACTGCTTGAATTTTTGAAAGGAAGTGAGGATGATCAGCAATTTCGCAGAAAACTTGCTTGGAAGGCATTTCAACATGTAGCTTCTTATGATTCTGCAGTCTCAGAATGGCTGTGGAAGCAGACTGTTGGAGACAAATTCCCTCCCAGCTTTACCGTGCCTCTTTCCCTCAAAAGTTCTCTTCGCTACGGGGAAAATCCTCACCAGAAGGCTGCCTTTTATGTTGATAAGAGTCTTTCTGAAGTCAATGCTGGTGGTATTGCTACAGCAGTCCAACACCATGGAAAGGAGATGTCATATAATAACTACCTAGATGCCGATGCAGCTTGGAATTGTGTATCAGAGTTTAGGAATCCTACCTGTGTAATTGTGAAGCACACAAATCCATGTGGTGTAGCTTCACGTGATGATATTCTGGAAGCATATAGGCTGGCTGTGAAAGCTGATCCCGTGAGTGCATTTGGTGGCATTGTTGCCTTCAACATAGAAGTTGATGAGGCTCTTGCAAGGGAACTTCGGGAGTTCAGAAGCCCTACAGATGGTGAAACTCGGATGTTTTATGAGATTGTGGTTGCACCCAAGTACACAGAGAAAGGGCTTGAGATCTTGCGTGGGAAATCAAAGACACTGCGAATTCTCGAGGCAGGAAAAAATGAGAAAGGAAAACTATCACTCAGGCAAGTTGGTGGGGGGTGGTTAGCACAGGATGCTGACGATTTGGTTCCACAAGATATAAAATTTAACGTAGTTTCTGGAAAGGCTCCTCAAGAAAGTGAGCTTCGGGATGCAGAGTTTGCATGGCTGTGCGTCAAGCATGTTAAGAGCAATGCCATTGTGATAGCAAAGAATAACTGTATGTTGGGTATGGGAAGCGGGCAACCAAATCGTCTCGAGAGTTTGCGGATAGCGTTGAGGAAAGCAGGGGATGAGGTCAAAGGAGCTGCTTTGGCTAGTGATGCGTTCTTTCCATTTGGTAATTTCTGGGAATCAAATGATGACTTTAAAGTTTCTAATGTTGCACCTCCAATCTGCTACATATCTTGGAATGACGCAGTGGAAGAAGCATGCCAGAGCGGAGTAGGTATTATTGCAGAGCCTGGGGGCAGCATCCGGGATCCCGATGCTGTGGACTGCTGCAACAAGTACGGTGTGTCTCTCGTCTTCACCAACGTGAGGCACTTCAGGCATTGA

Protein sequence

MFRSVAAHSPATPITAISFGEPRARFFLKEANPSPLLSLFTRVSLHHSVLRRRCSTLKAMADGETITFSSKITIPSASGRKKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLETSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVVNLYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGSEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQKAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTNPCGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYEIVVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDADDLVPQDIKFNVVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGDEVKGAALASDAFFPFGNFWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGGSIRDPDAVDCCNKYGVSLVFTNVRHFRH
Homology
BLAST of ClCG03G004230 vs. NCBI nr
Match: XP_038876982.1 (bifunctional purine biosynthesis protein PurH [Benincasa hispida] >XP_038876983.1 bifunctional purine biosynthesis protein PurH [Benincasa hispida] >XP_038876985.1 bifunctional purine biosynthesis protein PurH [Benincasa hispida] >XP_038876986.1 bifunctional purine biosynthesis protein PurH [Benincasa hispida])

HSP 1 Score: 1179.9 bits (3051), Expect = 0.0e+00
Identity = 590/627 (94.10%), Postives = 597/627 (95.22%), Query Frame = 0

Query: 1   MFRSVAAHSPATPITAISFGEPRARFFLKEANPSPLLSLFTRVSLHHSVLRRRCSTLKAM 60
           MFRSVAAHSPATPITAISFGEPRAR FLKEANPSPLLS+FT V LHHSVLRRRCSTLKAM
Sbjct: 1   MFRSVAAHSPATPITAISFGEPRARLFLKEANPSPLLSIFTSVPLHHSVLRRRCSTLKAM 60

Query: 61  ADGETITFSSKITIPSASGRKKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLET 120
           ADGETITFSSKITIPSASG KKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLET
Sbjct: 61  ADGETITFSSKITIPSASG-KKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLET 120

Query: 121 SGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVVN 180
           SGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVVN
Sbjct: 121 SGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVVN 180

Query: 181 LYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGS 240
           LYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVD+EDYPALLEFLKGS
Sbjct: 181 LYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDSEDYPALLEFLKGS 240

Query: 241 EDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQ 300
           +DDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQ
Sbjct: 241 KDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQ 300

Query: 301 KAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTNP 360
           KAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTNP
Sbjct: 301 KAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTNP 360

Query: 361 CGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYEI 420
           CG ASRDDILEAYRLAVKADPVSAFGGIVAFN+EVDE LARELREFRSPTDGETRMFYEI
Sbjct: 361 CGAASRDDILEAYRLAVKADPVSAFGGIVAFNVEVDEVLARELREFRSPTDGETRMFYEI 420

Query: 421 VVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDADDLVPQDIKFNV 480
           VVAPKYTEKGLEILRGKSKTLRILEA +NEKGKLSLRQVGGGWLAQDADDLVPQDIK NV
Sbjct: 421 VVAPKYTEKGLEILRGKSKTLRILEASRNEKGKLSLRQVGGGWLAQDADDLVPQDIKLNV 480

Query: 481 VSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGD 540
           VSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGD
Sbjct: 481 VSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGD 540

Query: 541 EVKGAALASDAFFPFGNFWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGGS 600
           EVKGAALASDAFFPF                      +WNDAVEEACQSGVGIIAEPGGS
Sbjct: 541 EVKGAALASDAFFPF----------------------AWNDAVEEACQSGVGIIAEPGGS 600

Query: 601 IRDPDAVDCCNKYGVSLVFTNVRHFRH 628
           IRDPDA+DCCNKYGVSLVFTNVRHFRH
Sbjct: 601 IRDPDAIDCCNKYGVSLVFTNVRHFRH 604

BLAST of ClCG03G004230 vs. NCBI nr
Match: XP_008440494.1 (PREDICTED: bifunctional purine biosynthesis protein PurH [Cucumis melo] >XP_008440502.1 PREDICTED: bifunctional purine biosynthesis protein PurH [Cucumis melo] >XP_008440519.1 PREDICTED: bifunctional purine biosynthesis protein PurH [Cucumis melo] >XP_016899930.1 PREDICTED: bifunctional purine biosynthesis protein PurH [Cucumis melo])

HSP 1 Score: 1166.8 bits (3017), Expect = 0.0e+00
Identity = 583/627 (92.98%), Postives = 594/627 (94.74%), Query Frame = 0

Query: 1   MFRSVAAHSPATPITAISFGEPRARFFLKEANPSPLLSLFTRVSLHHSVLRRRCSTLKAM 60
           MFRSV AHSPATPITAIS GEPRAR FLKEANP PL+SLFTRVSLHH +LR+RCSTLKAM
Sbjct: 1   MFRSVVAHSPATPITAISSGEPRARLFLKEANPLPLISLFTRVSLHHCLLRQRCSTLKAM 60

Query: 61  ADGETITFSSKITIPSASGRKKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLET 120
           ADGETITFSSK+TIPSASG KKLALISLSDKK+LAFLGNGLQELGYTIVSTGGTASTLE+
Sbjct: 61  ADGETITFSSKLTIPSASG-KKLALISLSDKKNLAFLGNGLQELGYTIVSTGGTASTLES 120

Query: 121 SGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVVN 180
           SGV VTKVEE+TCFPEMLDGRVKTLHPSIHGGILARRDQ HHMDALKKHGIGTFDVVVVN
Sbjct: 121 SGVHVTKVEEVTCFPEMLDGRVKTLHPSIHGGILARRDQGHHMDALKKHGIGTFDVVVVN 180

Query: 181 LYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGS 240
           LYPFYEKVTSSQ INFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGS
Sbjct: 181 LYPFYEKVTSSQGINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGS 240

Query: 241 EDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQ 300
           EDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQ
Sbjct: 241 EDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQ 300

Query: 301 KAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTNP 360
           KAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEF NPTCVIVKHTNP
Sbjct: 301 KAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFSNPTCVIVKHTNP 360

Query: 361 CGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYEI 420
           CGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDE LARELREFRSPTDGETRMFYEI
Sbjct: 361 CGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDETLARELREFRSPTDGETRMFYEI 420

Query: 421 VVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDADDLVPQDIKFNV 480
           VVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQD+DDLVPQDIKFNV
Sbjct: 421 VVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDSDDLVPQDIKFNV 480

Query: 481 VSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGD 540
           VSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGD
Sbjct: 481 VSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGD 540

Query: 541 EVKGAALASDAFFPFGNFWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGGS 600
           EVKGAALASDAFFPF                      +WNDAVEEACQSGVGIIAEPGGS
Sbjct: 541 EVKGAALASDAFFPF----------------------AWNDAVEEACQSGVGIIAEPGGS 600

Query: 601 IRDPDAVDCCNKYGVSLVFTNVRHFRH 628
           IRDPDA+DCCNKYGVSL+FTNVRHFRH
Sbjct: 601 IRDPDAIDCCNKYGVSLIFTNVRHFRH 604

BLAST of ClCG03G004230 vs. NCBI nr
Match: XP_011652409.1 (uncharacterized protein LOC101206006 [Cucumis sativus] >XP_011652413.1 uncharacterized protein LOC101206006 [Cucumis sativus] >XP_031740461.1 uncharacterized protein LOC101206006 [Cucumis sativus] >XP_031740464.1 uncharacterized protein LOC101206006 [Cucumis sativus] >XP_031740469.1 uncharacterized protein LOC101206006 [Cucumis sativus] >KGN64465.1 hypothetical protein Csa_013868 [Cucumis sativus])

HSP 1 Score: 1164.8 bits (3012), Expect = 0.0e+00
Identity = 582/627 (92.82%), Postives = 594/627 (94.74%), Query Frame = 0

Query: 1   MFRSVAAHSPATPITAISFGEPRARFFLKEANPSPLLSLFTRVSLHHSVLRRRCSTLKAM 60
           MFRSV AHSPATPITAIS GEPRA  FLKEANP PL+SLFTRVSLHHS+LR+RCSTLKAM
Sbjct: 1   MFRSVVAHSPATPITAISSGEPRAPLFLKEANPLPLISLFTRVSLHHSLLRQRCSTLKAM 60

Query: 61  ADGETITFSSKITIPSASGRKKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLET 120
           ADGETI FSSK+TIPSASG KKLALISLSDKK+LAFLGNGLQELGYTIVSTGGTASTLE+
Sbjct: 61  ADGETIAFSSKLTIPSASG-KKLALISLSDKKNLAFLGNGLQELGYTIVSTGGTASTLES 120

Query: 121 SGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVVN 180
           SGV VTKVEE+TCFPEMLDGRVKTLHPSIHGGILARRDQ HHMDALKKHGIGTFDVVVVN
Sbjct: 121 SGVHVTKVEEVTCFPEMLDGRVKTLHPSIHGGILARRDQGHHMDALKKHGIGTFDVVVVN 180

Query: 181 LYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGS 240
           LYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGS
Sbjct: 181 LYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGS 240

Query: 241 EDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQ 300
           EDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPL+LKSSLRYGENPHQ
Sbjct: 241 EDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLALKSSLRYGENPHQ 300

Query: 301 KAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTNP 360
           KAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEF NPTCVIVKHTNP
Sbjct: 301 KAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFSNPTCVIVKHTNP 360

Query: 361 CGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYEI 420
           CGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDE LARELREFRSPTDGETRMFYEI
Sbjct: 361 CGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDETLARELREFRSPTDGETRMFYEI 420

Query: 421 VVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDADDLVPQDIKFNV 480
           VVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQD+DDLVPQDIKFNV
Sbjct: 421 VVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDSDDLVPQDIKFNV 480

Query: 481 VSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGD 540
           VSGKAPQE+ELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGD
Sbjct: 481 VSGKAPQENELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGD 540

Query: 541 EVKGAALASDAFFPFGNFWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGGS 600
           EVKGAALASDAFFPF                      +WNDAVEEACQSGVGIIAEPGGS
Sbjct: 541 EVKGAALASDAFFPF----------------------AWNDAVEEACQSGVGIIAEPGGS 600

Query: 601 IRDPDAVDCCNKYGVSLVFTNVRHFRH 628
           IRDPDA+DCCNKYGVSLVFTNVRHFRH
Sbjct: 601 IRDPDAIDCCNKYGVSLVFTNVRHFRH 604

BLAST of ClCG03G004230 vs. NCBI nr
Match: KAG6584189.1 (hypothetical protein SDJN03_20121, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1142.5 bits (2954), Expect = 0.0e+00
Identity = 570/628 (90.76%), Postives = 586/628 (93.31%), Query Frame = 0

Query: 1   MFRSVAAHSPATPITAISFGEPRARFFLKEANPSPLLSLFTRVSLH-HSVLRRRCSTLKA 60
           M  SV AHSPATPITAIS GEPRARFFLKE NPSPLL+ F+R SLH  SV RR C T K 
Sbjct: 99  MLSSVVAHSPATPITAISLGEPRARFFLKEPNPSPLLTSFSRNSLHTQSVQRRPCFTFKV 158

Query: 61  MADGETITFSSKITIPSASGRKKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLE 120
           MADGETIT+SSKIT PS SG KKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLE
Sbjct: 159 MADGETITYSSKITFPSGSG-KKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLE 218

Query: 121 TSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVV 180
           TSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGI TFDVVVV
Sbjct: 219 TSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGISTFDVVVV 278

Query: 181 NLYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKG 240
           NLYPFYEKVTSS+ +NFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPA+LEFLKG
Sbjct: 279 NLYPFYEKVTSSRNLNFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPAMLEFLKG 338

Query: 241 SEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPH 300
           SEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPH
Sbjct: 339 SEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPH 398

Query: 301 QKAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTN 360
           QKAAFYVDKSLSEVNAGGIATA+QHHGKEMSYNNYLDADAAWNCVSEFRNPTCV+VKHTN
Sbjct: 399 QKAAFYVDKSLSEVNAGGIATAIQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVVVKHTN 458

Query: 361 PCGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYE 420
           PCGVASRDDILEAYRLAVKADPVSAFGGIVAFN+EVDEALARE+REFRSPTDGETRMFYE
Sbjct: 459 PCGVASRDDILEAYRLAVKADPVSAFGGIVAFNVEVDEALAREIREFRSPTDGETRMFYE 518

Query: 421 IVVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDADDLVPQDIKFN 480
           IVVAPKYT+KGLEILRGKSKTLRILEA KNEKGKLSLRQVGGGWLAQD+DDLVPQDI+FN
Sbjct: 519 IVVAPKYTKKGLEILRGKSKTLRILEAAKNEKGKLSLRQVGGGWLAQDSDDLVPQDIQFN 578

Query: 481 VVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAG 540
           VVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIA+RKAG
Sbjct: 579 VVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIAMRKAG 638

Query: 541 DEVKGAALASDAFFPFGNFWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGG 600
           DEVKGAALASDAFFPF                      +WNDAVEEACQSGVGIIAEPGG
Sbjct: 639 DEVKGAALASDAFFPF----------------------AWNDAVEEACQSGVGIIAEPGG 698

Query: 601 SIRDPDAVDCCNKYGVSLVFTNVRHFRH 628
           SIRDPDA+DCCNKYGVSLVFTNVRHFRH
Sbjct: 699 SIRDPDAIDCCNKYGVSLVFTNVRHFRH 703

BLAST of ClCG03G004230 vs. NCBI nr
Match: KAG7019781.1 (purH [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1142.5 bits (2954), Expect = 0.0e+00
Identity = 570/628 (90.76%), Postives = 586/628 (93.31%), Query Frame = 0

Query: 1   MFRSVAAHSPATPITAISFGEPRARFFLKEANPSPLLSLFTRVSLH-HSVLRRRCSTLKA 60
           M  SV AHSPATPITAIS GEPRARFFLKE NPSPLL+ F+R SLH  SV RR C T K 
Sbjct: 1   MLSSVVAHSPATPITAISLGEPRARFFLKEPNPSPLLTSFSRNSLHTQSVQRRPCFTFKV 60

Query: 61  MADGETITFSSKITIPSASGRKKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLE 120
           MADGETIT+SSKIT PS SG KKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLE
Sbjct: 61  MADGETITYSSKITFPSGSG-KKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLE 120

Query: 121 TSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVV 180
           TSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGI TFDVVVV
Sbjct: 121 TSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGISTFDVVVV 180

Query: 181 NLYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKG 240
           NLYPFYEKVTSS+ +NFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPA+LEFLKG
Sbjct: 181 NLYPFYEKVTSSRNLNFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPAMLEFLKG 240

Query: 241 SEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPH 300
           SEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPH
Sbjct: 241 SEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPH 300

Query: 301 QKAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTN 360
           QKAAFYVDKSLSEVNAGGIATA+QHHGKEMSYNNYLDADAAWNCVSEFRNPTCV+VKHTN
Sbjct: 301 QKAAFYVDKSLSEVNAGGIATAIQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVVVKHTN 360

Query: 361 PCGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYE 420
           PCGVASRDDILEAYRLAVKADPVSAFGGIVAFN+EVDEALARE+REFRSPTDGETRMFYE
Sbjct: 361 PCGVASRDDILEAYRLAVKADPVSAFGGIVAFNVEVDEALAREIREFRSPTDGETRMFYE 420

Query: 421 IVVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDADDLVPQDIKFN 480
           IVVAPKYT+KGLEILRGKSKTLRILEA KNEKGKLSLRQVGGGWLAQD+DDLVPQDI+FN
Sbjct: 421 IVVAPKYTKKGLEILRGKSKTLRILEAAKNEKGKLSLRQVGGGWLAQDSDDLVPQDIQFN 480

Query: 481 VVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAG 540
           VVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIA+RKAG
Sbjct: 481 VVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIAMRKAG 540

Query: 541 DEVKGAALASDAFFPFGNFWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGG 600
           DEVKGAALASDAFFPF                      +WNDAVEEACQSGVGIIAEPGG
Sbjct: 541 DEVKGAALASDAFFPF----------------------AWNDAVEEACQSGVGIIAEPGG 600

Query: 601 SIRDPDAVDCCNKYGVSLVFTNVRHFRH 628
           SIRDPDA+DCCNKYGVSLVFTNVRHFRH
Sbjct: 601 SIRDPDAIDCCNKYGVSLVFTNVRHFRH 605

BLAST of ClCG03G004230 vs. ExPASy Swiss-Prot
Match: A9VRF5 (Bifunctional purine biosynthesis protein PurH OS=Bacillus mycoides (strain KBAB4) OX=315730 GN=purH PE=3 SV=1)

HSP 1 Score: 471.1 bits (1211), Expect = 2.0e-131
Identity = 257/550 (46.73%), Postives = 342/550 (62.18%), Query Frame = 0

Query: 81  KKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLETSGVRVTKVEELTCFPEMLDG 140
           KK AL+S+SDK  +     GL E G  ++STGGT   LE +G++V  + E+T FPE++DG
Sbjct: 2   KKRALVSVSDKTGVVEFVKGLLEQGIEVISTGGTKKLLEENGLQVIGISEVTGFPEIMDG 61

Query: 141 RVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVVNLYPFYEKVTSSQEINFEDGI 200
           RVKTLHP+IHGG+LA RD   H+  + + GI   D VVVNLYPF E + +  ++ F D I
Sbjct: 62  RVKTLHPNIHGGLLAVRDNEMHVAQMNELGIQPIDFVVVNLYPFKETI-AKPDVTFADAI 121

Query: 201 ENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLK-GSEDDQQFRRKLAWKAFQHVA 260
           ENIDIGGP MIR+AAKNHK V V+VD  DY  +L  LK   E  ++ +RKLA K F+H A
Sbjct: 122 ENIDIGGPTMIRSAAKNHKFVSVIVDPVDYDVVLAELKENGEVTEETKRKLAAKVFRHTA 181

Query: 261 SYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQKAAFYVDKSLSEVNAGGIA 320
           +YD+ +S +L KQ +G++ P + TV    K  LRYGENPHQKA FY            +A
Sbjct: 182 AYDALISNYLTKQ-MGEESPETVTVTFEKKQDLRYGENPHQKATFY---KAPFAATSSVA 241

Query: 321 TAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTNPCGVASRDDILEAYRLAVKA 380
            A Q HGKE+SYNN  DADAA + V EF  P  V VKH NPCGV    DI EAY  A +A
Sbjct: 242 YAEQLHGKELSYNNINDADAALSIVKEFTEPAVVAVKHMNPCGVGVGADIHEAYTRAYEA 301

Query: 381 DPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYEIVVAPKYTEKGLEILRGKSK 440
           DPVS FGGI+A N E+D+A A +L E          +F EIV+AP ++++ LE+L+ K K
Sbjct: 302 DPVSIFGGIIAANREIDKATAEKLHE----------IFLEIVIAPSFSQEALEVLQSK-K 361

Query: 441 TLRILEAG--KNEKGKLSLRQVGGGWLAQDADDLVPQDIKFNVVSGKAPQESELRDAEFA 500
            LR+L     K       L  V GG L Q+ D L   +   ++ + + P E E +D + A
Sbjct: 362 NLRLLTVNIEKATSASKKLTSVQGGLLVQEEDTLSLDEDAISIPTKREPSEQEWKDLKLA 421

Query: 501 WLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGDEVKGAALASDAFFPFGN 560
           W  VKHVKSNAIV+A +N  +G+G+GQ NR+ S +IA+ +AG++ +G+ALASDAFFP   
Sbjct: 422 WKVVKHVKSNAIVLANDNMTVGVGAGQMNRVGSAKIAITQAGEKAQGSALASDAFFPM-- 481

Query: 561 FWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGGSIRDPDAVDCCNKYGVSL 620
                                  D VEEA ++G+  I +PGGSIRD D++   + YG+++
Sbjct: 482 ----------------------PDTVEEAAKAGITAIIQPGGSIRDEDSIKMADAYGITM 511

Query: 621 VFTNVRHFRH 628
           VFT VRHF+H
Sbjct: 542 VFTGVRHFKH 511

BLAST of ClCG03G004230 vs. ExPASy Swiss-Prot
Match: A3DEU9 (Bifunctional purine biosynthesis protein PurH OS=Hungateiclostridium thermocellum (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=purH PE=3 SV=1)

HSP 1 Score: 469.9 bits (1208), Expect = 4.4e-131
Identity = 249/551 (45.19%), Postives = 340/551 (61.71%), Query Frame = 0

Query: 82  KLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLETSGVRVTKVEELTCFPEMLDGR 141
           K ALIS+SDK  +  +   LQ +G  I+STGGTA TL  +G++V  + ++T FPE LDGR
Sbjct: 3   KRALISVSDKTGIVEMARELQSMGVDIISTGGTAKTLSDAGIKVINISDVTGFPECLDGR 62

Query: 142 VKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVVNLYPFYEKVTSSQEINFEDGIE 201
           VKTLHP +H GILA R    HM  LK+  I T D+V++NLYPF + +   + ++  + IE
Sbjct: 63  VKTLHPKVHAGILAIRSNEEHMRQLKELNIETIDMVIINLYPFKQTIL-KENVDLSEAIE 122

Query: 202 NIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGSED-DQQFRRKLAWKAFQHVAS 261
           NIDIGGP MIRAAAKN++DV+V+VD  DY A+LE LK ++D   + + KLA+K F+H + 
Sbjct: 123 NIDIGGPTMIRAAAKNYQDVVVIVDPSDYAAVLEELKTTKDVSLKTKFKLAYKVFEHTSH 182

Query: 262 YDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQKAAFYVDKSLSEVNAGGIAT 321
           YD+ ++++L +Q   D+FP + ++       +RYGENPHQKA FY +      N G I  
Sbjct: 183 YDTLIAKYLREQIGEDEFPQTLSLTFEKVQDMRYGENPHQKAVFYKEVG---ANVGCITA 242

Query: 322 AVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTNPCGVASRDDILEAYRLAVKAD 381
           A Q HGKE+SYNN  DA+ A   + EF  PT V VKH NPCGVAS  +I +AY  A +AD
Sbjct: 243 AKQLHGKELSYNNINDANGAIEIIKEFDEPTVVAVKHANPCGVASASNIYDAYIKAYEAD 302

Query: 382 PVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYEIVVAPKYTEKGLEILRGKSKT 441
           PVS FGGI+A N E+DE  A E+           ++F EIV+AP +TE  L+IL  K K 
Sbjct: 303 PVSIFGGIIAANREIDEKTAEEI----------NKIFVEIVIAPSFTEGALKILTQK-KN 362

Query: 442 LRILE----AGKNEKGKLSLRQVGGGWLAQDADDLVPQDIKFNVVSGKAPQESELRDAEF 501
           +R+L+    + K  KG   +++V GG L Q+ +  +       VV+ K P + EL D  F
Sbjct: 363 IRLLQLEDISAKIPKGTYDMKKVPGGLLVQNYNSELLNMDDLKVVTEKKPTQEELEDLIF 422

Query: 502 AWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGDEVKGAALASDAFFPFG 561
           A   VKH KSN I +AK    +G+G GQ NR+ + +IA+   G+  KGA LASDAFFPF 
Sbjct: 423 AMKVVKHTKSNGIALAKGKQTIGVGPGQTNRVTACKIAIEYGGERTKGAVLASDAFFPFA 482

Query: 562 NFWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGGSIRDPDAVDCCNKYGVS 621
                                   D VE A  +G+  I +PGGSIRD +++D CNKYG++
Sbjct: 483 ------------------------DCVEAAAAAGITAIIQPGGSIRDQESIDACNKYGIA 514

Query: 622 LVFTNVRHFRH 628
           +VFT +RHF+H
Sbjct: 543 MVFTGMRHFKH 514

BLAST of ClCG03G004230 vs. ExPASy Swiss-Prot
Match: C1EV67 (Bifunctional purine biosynthesis protein PurH OS=Bacillus cereus (strain 03BB102) OX=572264 GN=purH PE=3 SV=1)

HSP 1 Score: 469.2 bits (1206), Expect = 7.5e-131
Identity = 255/550 (46.36%), Postives = 347/550 (63.09%), Query Frame = 0

Query: 81  KKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLETSGVRVTKVEELTCFPEMLDG 140
           KK AL+S+SDK  +     GL E G  ++STGGT   LE +G++V  + E+T FPE++DG
Sbjct: 2   KKRALVSVSDKTGVVEFVKGLLEQGIEVISTGGTKKLLEENGLQVIGISEVTGFPEIMDG 61

Query: 141 RVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVVNLYPFYEKVTSSQEINFEDGI 200
           RVKTLHP+IHGG+LA RD   H+  + + G+   D V+VNLYPF E + +  ++ F D I
Sbjct: 62  RVKTLHPNIHGGLLAVRDNETHVAQMNELGMEPIDFVIVNLYPFKETI-AKPDVTFADAI 121

Query: 201 ENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLK-GSEDDQQFRRKLAWKAFQHVA 260
           ENIDIGGP MIR+AAKNHK V V+VD  DY  +L  LK   E  ++ +RKLA K F+H A
Sbjct: 122 ENIDIGGPTMIRSAAKNHKFVSVIVDPVDYDVVLAELKENGEVKEETKRKLAAKVFRHTA 181

Query: 261 SYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQKAAFYVDKSLSEVNAGGIA 320
           +YD+ +S +L +Q +G++ P + TV    K  LRYGENPHQKA FY  K+   V +  +A
Sbjct: 182 AYDALISNYLTEQ-MGEESPETLTVTFEKKQDLRYGENPHQKATFY--KAPFTVTS-SVA 241

Query: 321 TAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTNPCGVASRDDILEAYRLAVKA 380
            A Q HGKE+SYNN  DADAA + V EF  P  V VKH NPCGV    DI EAY  A +A
Sbjct: 242 YAEQLHGKELSYNNINDADAALSIVKEFTEPAVVAVKHMNPCGVGVGTDIHEAYTRAYEA 301

Query: 381 DPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYEIVVAPKYTEKGLEILRGKSK 440
           DPVS FGGI+A N E+D+A A +L E          +F EI++AP ++++ LE+L+ K K
Sbjct: 302 DPVSIFGGIIAANREIDKATAEKLHE----------IFLEIIIAPSFSKEALEVLQSK-K 361

Query: 441 TLRILEAG--KNEKGKLSLRQVGGGWLAQDADDLVPQDIKFNVVSGKAPQESELRDAEFA 500
            LR+L     K       L  V GG L Q+ D L   +   ++ + + P E E +D + A
Sbjct: 362 NLRLLTVNIEKATSASKKLTSVQGGLLVQEEDTLSLDESTISIPTKREPSEQEWKDLKLA 421

Query: 501 WLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGDEVKGAALASDAFFPFGN 560
           W  VKHVKSNAIV+AK++  +G+G+GQ NR+ S +IA+ +AG++ +G+ALASDAFFP   
Sbjct: 422 WKVVKHVKSNAIVLAKDDMTIGVGAGQMNRVGSAKIAITQAGEKAQGSALASDAFFPM-- 481

Query: 561 FWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGGSIRDPDAVDCCNKYGVSL 620
                                  D VEEA ++G+  I +PGGSIRD D++   + YG+++
Sbjct: 482 ----------------------PDTVEEAAKAGITAIIQPGGSIRDEDSIKVADTYGIAM 511

Query: 621 VFTNVRHFRH 628
           VFT VRHF+H
Sbjct: 542 VFTGVRHFKH 511

BLAST of ClCG03G004230 vs. ExPASy Swiss-Prot
Match: Q6HPA0 (Bifunctional purine biosynthesis protein PurH OS=Bacillus thuringiensis subsp. konkukian (strain 97-27) OX=281309 GN=purH PE=3 SV=1)

HSP 1 Score: 469.2 bits (1206), Expect = 7.5e-131
Identity = 255/550 (46.36%), Postives = 347/550 (63.09%), Query Frame = 0

Query: 81  KKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLETSGVRVTKVEELTCFPEMLDG 140
           KK AL+S+SDK  +     GL E G  ++STGGT   LE +G++V  + E+T FPE++DG
Sbjct: 2   KKRALVSVSDKTGVVEFVKGLLEQGIEVISTGGTKKLLEKNGLQVIGISEVTGFPEIMDG 61

Query: 141 RVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVVNLYPFYEKVTSSQEINFEDGI 200
           RVKTLHP+IHGG+LA RD   H+  + + G+   D V+VNLYPF E + +  ++ F D I
Sbjct: 62  RVKTLHPNIHGGLLAVRDNETHVAQMNELGMEPIDFVIVNLYPFKETI-AKPDVTFADAI 121

Query: 201 ENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLK-GSEDDQQFRRKLAWKAFQHVA 260
           ENIDIGGP MIR+AAKNHK V V+VD  DY  +L  LK   E  ++ +RKLA K F+H A
Sbjct: 122 ENIDIGGPTMIRSAAKNHKFVSVIVDPVDYDVVLAELKENGEVKEETKRKLAAKVFRHTA 181

Query: 261 SYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQKAAFYVDKSLSEVNAGGIA 320
           +YD+ +S +L +Q +G++ P + TV    K  LRYGENPHQKA FY  K+   V +  +A
Sbjct: 182 AYDALISNYLTEQ-MGEESPETLTVTFEKKQDLRYGENPHQKATFY--KAPFAVTS-SVA 241

Query: 321 TAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTNPCGVASRDDILEAYRLAVKA 380
            A Q HGKE+SYNN  DADAA + V EF  P  V VKH NPCGV    DI EAY  A +A
Sbjct: 242 YAEQLHGKELSYNNINDADAALSIVKEFTEPAVVAVKHMNPCGVGVGTDIHEAYTRAYEA 301

Query: 381 DPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYEIVVAPKYTEKGLEILRGKSK 440
           DPVS FGGI+A N E+D+A A +L E          +F EI++AP ++++ LE+L+ K K
Sbjct: 302 DPVSIFGGIIAANREIDKATAEKLHE----------IFLEIIIAPSFSKEALEVLQSK-K 361

Query: 441 TLRILEAG--KNEKGKLSLRQVGGGWLAQDADDLVPQDIKFNVVSGKAPQESELRDAEFA 500
            LR+L     K       L  V GG L Q+ D L   +   ++ + + P E E +D + A
Sbjct: 362 NLRLLTVNIEKATSASKKLTSVQGGLLVQEEDTLSLDESTISIPTKREPSEQEWKDLKLA 421

Query: 501 WLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGDEVKGAALASDAFFPFGN 560
           W  VKHVKSNAIV+AK++  +G+G+GQ NR+ S +IA+ +AG++ +G+ALASDAFFP   
Sbjct: 422 WKVVKHVKSNAIVLAKDDMTIGVGAGQMNRVGSAKIAITQAGEKAQGSALASDAFFPM-- 481

Query: 561 FWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGGSIRDPDAVDCCNKYGVSL 620
                                  D VEEA ++G+  I +PGGSIRD D++   + YG+++
Sbjct: 482 ----------------------PDTVEEAAKAGITAIIQPGGSIRDEDSIKVADTYGIAM 511

Query: 621 VFTNVRHFRH 628
           VFT VRHF+H
Sbjct: 542 VFTGVRHFKH 511

BLAST of ClCG03G004230 vs. ExPASy Swiss-Prot
Match: C3PBN4 (Bifunctional purine biosynthesis protein PurH OS=Bacillus anthracis (strain A0248) OX=592021 GN=purH PE=3 SV=1)

HSP 1 Score: 468.8 bits (1205), Expect = 9.8e-131
Identity = 254/550 (46.18%), Postives = 343/550 (62.36%), Query Frame = 0

Query: 81  KKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLETSGVRVTKVEELTCFPEMLDG 140
           KK AL+S+SDK  +     GL E G  ++STGGT   LE +G++V  + E+T FPE++DG
Sbjct: 2   KKRALVSVSDKTGVVEFVKGLLEQGIEVISTGGTKKLLEENGLQVIGISEVTGFPEIMDG 61

Query: 141 RVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVVNLYPFYEKVTSSQEINFEDGI 200
           RVKTLHP+IHGG+LA RD   H+  + + G+   D VVVNLYPF E + +  ++ F D I
Sbjct: 62  RVKTLHPNIHGGLLAVRDNETHVAQMNELGMEPIDFVVVNLYPFKETI-AKPDVTFADAI 121

Query: 201 ENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLK-GSEDDQQFRRKLAWKAFQHVA 260
           ENIDIGGP MIR+AAKNHK V V+VD  DY  +L  LK   E  ++ +RKLA K F+H A
Sbjct: 122 ENIDIGGPTMIRSAAKNHKFVSVIVDPVDYDVVLAELKENGEVAEETKRKLAAKVFRHTA 181

Query: 261 SYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQKAAFYVDKSLSEVNAGGIA 320
           +YD+ +S +L +Q +G++ P + TV    K  LRYGENPHQKA FY            +A
Sbjct: 182 AYDALISNYLTEQ-MGEESPETLTVTFEKKQDLRYGENPHQKATFY---KAPFAATSSVA 241

Query: 321 TAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTNPCGVASRDDILEAYRLAVKA 380
            A Q HGKE+SYNN  DADAA + V EF  P  V VKH NPCGV    DI EAY  A +A
Sbjct: 242 YAEQLHGKELSYNNINDADAALSIVKEFTEPAVVAVKHMNPCGVGVGTDIHEAYTRAYEA 301

Query: 381 DPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYEIVVAPKYTEKGLEILRGKSK 440
           DPVS FGGI+A N E+D+A A +L E          +F EI++AP ++++ LE+L+ K K
Sbjct: 302 DPVSIFGGIIAANREIDKATAEKLHE----------IFLEIIIAPSFSKEALEVLQSK-K 361

Query: 441 TLRILEAG--KNEKGKLSLRQVGGGWLAQDADDLVPQDIKFNVVSGKAPQESELRDAEFA 500
            LR+L     K       L  V GG L Q+ D L   +   ++ + + P E E +D + A
Sbjct: 362 NLRLLTVNIEKATSASKKLTSVQGGLLVQEEDTLSLDESTISIPTKREPSEQEWKDLKLA 421

Query: 501 WLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGDEVKGAALASDAFFPFGN 560
           W  VKHVKSNAIV+AK++  +G+G+GQ NR+ S +IA+ +AG++ +G+ALASDAFFP   
Sbjct: 422 WKVVKHVKSNAIVLAKDDMTIGVGAGQMNRVGSAKIAITQAGEKAQGSALASDAFFPM-- 481

Query: 561 FWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGGSIRDPDAVDCCNKYGVSL 620
                                  D VEEA ++G+  I +PGGSIRD D++   + YG+++
Sbjct: 482 ----------------------PDTVEEAAKAGITAIIQPGGSIRDEDSIKVADTYGIAM 511

Query: 621 VFTNVRHFRH 628
           VFT VRHF+H
Sbjct: 542 VFTGVRHFKH 511

BLAST of ClCG03G004230 vs. ExPASy TrEMBL
Match: A0A1S3B1V0 (AICAR transformylase OS=Cucumis melo OX=3656 GN=LOC103484902 PE=3 SV=1)

HSP 1 Score: 1166.8 bits (3017), Expect = 0.0e+00
Identity = 583/627 (92.98%), Postives = 594/627 (94.74%), Query Frame = 0

Query: 1   MFRSVAAHSPATPITAISFGEPRARFFLKEANPSPLLSLFTRVSLHHSVLRRRCSTLKAM 60
           MFRSV AHSPATPITAIS GEPRAR FLKEANP PL+SLFTRVSLHH +LR+RCSTLKAM
Sbjct: 1   MFRSVVAHSPATPITAISSGEPRARLFLKEANPLPLISLFTRVSLHHCLLRQRCSTLKAM 60

Query: 61  ADGETITFSSKITIPSASGRKKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLET 120
           ADGETITFSSK+TIPSASG KKLALISLSDKK+LAFLGNGLQELGYTIVSTGGTASTLE+
Sbjct: 61  ADGETITFSSKLTIPSASG-KKLALISLSDKKNLAFLGNGLQELGYTIVSTGGTASTLES 120

Query: 121 SGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVVN 180
           SGV VTKVEE+TCFPEMLDGRVKTLHPSIHGGILARRDQ HHMDALKKHGIGTFDVVVVN
Sbjct: 121 SGVHVTKVEEVTCFPEMLDGRVKTLHPSIHGGILARRDQGHHMDALKKHGIGTFDVVVVN 180

Query: 181 LYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGS 240
           LYPFYEKVTSSQ INFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGS
Sbjct: 181 LYPFYEKVTSSQGINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGS 240

Query: 241 EDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQ 300
           EDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQ
Sbjct: 241 EDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQ 300

Query: 301 KAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTNP 360
           KAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEF NPTCVIVKHTNP
Sbjct: 301 KAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFSNPTCVIVKHTNP 360

Query: 361 CGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYEI 420
           CGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDE LARELREFRSPTDGETRMFYEI
Sbjct: 361 CGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDETLARELREFRSPTDGETRMFYEI 420

Query: 421 VVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDADDLVPQDIKFNV 480
           VVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQD+DDLVPQDIKFNV
Sbjct: 421 VVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDSDDLVPQDIKFNV 480

Query: 481 VSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGD 540
           VSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGD
Sbjct: 481 VSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGD 540

Query: 541 EVKGAALASDAFFPFGNFWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGGS 600
           EVKGAALASDAFFPF                      +WNDAVEEACQSGVGIIAEPGGS
Sbjct: 541 EVKGAALASDAFFPF----------------------AWNDAVEEACQSGVGIIAEPGGS 600

Query: 601 IRDPDAVDCCNKYGVSLVFTNVRHFRH 628
           IRDPDA+DCCNKYGVSL+FTNVRHFRH
Sbjct: 601 IRDPDAIDCCNKYGVSLIFTNVRHFRH 604

BLAST of ClCG03G004230 vs. ExPASy TrEMBL
Match: A0A0A0LRB7 (AICAR transformylase OS=Cucumis sativus OX=3659 GN=Csa_1G057010 PE=3 SV=1)

HSP 1 Score: 1164.8 bits (3012), Expect = 0.0e+00
Identity = 582/627 (92.82%), Postives = 594/627 (94.74%), Query Frame = 0

Query: 1   MFRSVAAHSPATPITAISFGEPRARFFLKEANPSPLLSLFTRVSLHHSVLRRRCSTLKAM 60
           MFRSV AHSPATPITAIS GEPRA  FLKEANP PL+SLFTRVSLHHS+LR+RCSTLKAM
Sbjct: 1   MFRSVVAHSPATPITAISSGEPRAPLFLKEANPLPLISLFTRVSLHHSLLRQRCSTLKAM 60

Query: 61  ADGETITFSSKITIPSASGRKKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLET 120
           ADGETI FSSK+TIPSASG KKLALISLSDKK+LAFLGNGLQELGYTIVSTGGTASTLE+
Sbjct: 61  ADGETIAFSSKLTIPSASG-KKLALISLSDKKNLAFLGNGLQELGYTIVSTGGTASTLES 120

Query: 121 SGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVVN 180
           SGV VTKVEE+TCFPEMLDGRVKTLHPSIHGGILARRDQ HHMDALKKHGIGTFDVVVVN
Sbjct: 121 SGVHVTKVEEVTCFPEMLDGRVKTLHPSIHGGILARRDQGHHMDALKKHGIGTFDVVVVN 180

Query: 181 LYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGS 240
           LYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGS
Sbjct: 181 LYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGS 240

Query: 241 EDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPHQ 300
           EDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPL+LKSSLRYGENPHQ
Sbjct: 241 EDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLALKSSLRYGENPHQ 300

Query: 301 KAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTNP 360
           KAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEF NPTCVIVKHTNP
Sbjct: 301 KAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFSNPTCVIVKHTNP 360

Query: 361 CGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYEI 420
           CGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDE LARELREFRSPTDGETRMFYEI
Sbjct: 361 CGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDETLARELREFRSPTDGETRMFYEI 420

Query: 421 VVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDADDLVPQDIKFNV 480
           VVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQD+DDLVPQDIKFNV
Sbjct: 421 VVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDSDDLVPQDIKFNV 480

Query: 481 VSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGD 540
           VSGKAPQE+ELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGD
Sbjct: 481 VSGKAPQENELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGD 540

Query: 541 EVKGAALASDAFFPFGNFWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGGS 600
           EVKGAALASDAFFPF                      +WNDAVEEACQSGVGIIAEPGGS
Sbjct: 541 EVKGAALASDAFFPF----------------------AWNDAVEEACQSGVGIIAEPGGS 600

Query: 601 IRDPDAVDCCNKYGVSLVFTNVRHFRH 628
           IRDPDA+DCCNKYGVSLVFTNVRHFRH
Sbjct: 601 IRDPDAIDCCNKYGVSLVFTNVRHFRH 604

BLAST of ClCG03G004230 vs. ExPASy TrEMBL
Match: A0A6J1KHE3 (AICAR transformylase OS=Cucurbita maxima OX=3661 GN=LOC111495272 PE=3 SV=1)

HSP 1 Score: 1141.7 bits (2952), Expect = 0.0e+00
Identity = 571/628 (90.92%), Postives = 586/628 (93.31%), Query Frame = 0

Query: 1   MFRSVAAHSPATPITAISFGEPRARFFLKEANPSPLLSLFTRVSLH-HSVLRRRCSTLKA 60
           MF SVAAHSPATPITAIS GEPRAR FLKE NPSPLL+LF+R SLH  SV RR C T K 
Sbjct: 1   MFSSVAAHSPATPITAISLGEPRARVFLKEPNPSPLLTLFSRNSLHTQSVQRRPCFTFKV 60

Query: 61  MADGETITFSSKITIPSASGRKKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLE 120
           MADGETITFSSKIT PS SG KKLALISLSDKKDLAFLG+GLQELGYTIVSTGGTASTLE
Sbjct: 61  MADGETITFSSKITFPSGSG-KKLALISLSDKKDLAFLGHGLQELGYTIVSTGGTASTLE 120

Query: 121 TSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVV 180
           TSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGI TFDVVVV
Sbjct: 121 TSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGISTFDVVVV 180

Query: 181 NLYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKG 240
           NLYPFYEKVTSSQ +NFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYP LLEFLKG
Sbjct: 181 NLYPFYEKVTSSQNLNFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPTLLEFLKG 240

Query: 241 SEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPH 300
           SEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPH
Sbjct: 241 SEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPH 300

Query: 301 QKAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTN 360
           QKAAFYVDKSLSEVN GGIATA+QHHGKEMSYNNYLDADAAWNCVSEFRNPTCV+VKHTN
Sbjct: 301 QKAAFYVDKSLSEVNGGGIATAIQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVVVKHTN 360

Query: 361 PCGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYE 420
           PCGVASRDDILEAYRLAVKADPVSAFGGIVAFN+EVDEALARE+REFRSPTDGETRMFYE
Sbjct: 361 PCGVASRDDILEAYRLAVKADPVSAFGGIVAFNVEVDEALAREIREFRSPTDGETRMFYE 420

Query: 421 IVVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDADDLVPQDIKFN 480
           IVVAPKYT+KGLEILRGKSKTLRILEA KNEKGKLSLRQVGGGWLAQD+DDLVPQDI+FN
Sbjct: 421 IVVAPKYTKKGLEILRGKSKTLRILEATKNEKGKLSLRQVGGGWLAQDSDDLVPQDIQFN 480

Query: 481 VVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAG 540
           VVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIA+RKAG
Sbjct: 481 VVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIAMRKAG 540

Query: 541 DEVKGAALASDAFFPFGNFWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGG 600
           DEVKGAALASDAFFPF                      +WNDAVEEACQSGVGIIAEPGG
Sbjct: 541 DEVKGAALASDAFFPF----------------------AWNDAVEEACQSGVGIIAEPGG 600

Query: 601 SIRDPDAVDCCNKYGVSLVFTNVRHFRH 628
           SIRDPDA++CCNKYGVSLVFTNVRHFRH
Sbjct: 601 SIRDPDAINCCNKYGVSLVFTNVRHFRH 605

BLAST of ClCG03G004230 vs. ExPASy TrEMBL
Match: A0A6J1EDV7 (AICAR transformylase OS=Cucurbita moschata OX=3662 GN=LOC111431645 PE=3 SV=1)

HSP 1 Score: 1140.6 bits (2949), Expect = 0.0e+00
Identity = 570/628 (90.76%), Postives = 585/628 (93.15%), Query Frame = 0

Query: 1   MFRSVAAHSPATPITAISFGEPRARFFLKEANPSPLLSLFTRVSLH-HSVLRRRCSTLKA 60
           M  SV AHSPATPITAIS GEPRARFFLKE NPSPLL+ F+R SLH  SV RR C T K 
Sbjct: 1   MLSSVVAHSPATPITAISLGEPRARFFLKEPNPSPLLTSFSRNSLHTQSVQRRPCFTFKV 60

Query: 61  MADGETITFSSKITIPSASGRKKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLE 120
           MADGETIT+SSKIT PS SG KKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLE
Sbjct: 61  MADGETITYSSKITFPSGSG-KKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLE 120

Query: 121 TSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVV 180
           TSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGI TFDVVVV
Sbjct: 121 TSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGISTFDVVVV 180

Query: 181 NLYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKG 240
           NLYPFYEKVTSS+ +NFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKG
Sbjct: 181 NLYPFYEKVTSSRNLNFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKG 240

Query: 241 SEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPH 300
           SEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPH
Sbjct: 241 SEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPH 300

Query: 301 QKAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTN 360
           QKAAFYVDKSLSEVNAGGIATA+QHHGKEMSYNNYLDADAAWNCVSEFRNPTCV+VKHTN
Sbjct: 301 QKAAFYVDKSLSEVNAGGIATAIQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVVVKHTN 360

Query: 361 PCGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYE 420
           PCGVASRDDILEAYRLAVKADPVSAFGGIVAFN+EVDEALARE+REFRSPTDGETRMFYE
Sbjct: 361 PCGVASRDDILEAYRLAVKADPVSAFGGIVAFNVEVDEALAREIREFRSPTDGETRMFYE 420

Query: 421 IVVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDADDLVPQDIKFN 480
           IVVAPKYT+KGLEILRGKSKTLRILEA KNEKGKLSLRQVGGGWLAQD+DDLVPQD +FN
Sbjct: 421 IVVAPKYTKKGLEILRGKSKTLRILEAAKNEKGKLSLRQVGGGWLAQDSDDLVPQDRQFN 480

Query: 481 VVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAG 540
           VVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIA+RKAG
Sbjct: 481 VVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIAMRKAG 540

Query: 541 DEVKGAALASDAFFPFGNFWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGG 600
           DEVKGAALASDAFFPF                      +WNDAVEEACQSGVGIIAEPGG
Sbjct: 541 DEVKGAALASDAFFPF----------------------AWNDAVEEACQSGVGIIAEPGG 600

Query: 601 SIRDPDAVDCCNKYGVSLVFTNVRHFRH 628
           SIRDPDA+DCCNKYGVSLVFTNVRHFRH
Sbjct: 601 SIRDPDAIDCCNKYGVSLVFTNVRHFRH 605

BLAST of ClCG03G004230 vs. ExPASy TrEMBL
Match: A0A6J1KF89 (AICAR transformylase OS=Cucurbita maxima OX=3661 GN=LOC111495272 PE=3 SV=1)

HSP 1 Score: 1137.1 bits (2940), Expect = 0.0e+00
Identity = 569/628 (90.61%), Postives = 584/628 (92.99%), Query Frame = 0

Query: 1   MFRSVAAHSPATPITAISFGEPRARFFLKEANPSPLLSLFTRVSLH-HSVLRRRCSTLKA 60
           MF SVAAHSPATPITAIS GEPRAR FLKE NPSPLL+LF+R SLH  SV RR C T K 
Sbjct: 1   MFSSVAAHSPATPITAISLGEPRARVFLKEPNPSPLLTLFSRNSLHTQSVQRRPCFTFKV 60

Query: 61  MADGETITFSSKITIPSASGRKKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLE 120
           MADGETITFSSKIT PS    KKLALISLSDKKDLAFLG+GLQELGYTIVSTGGTASTLE
Sbjct: 61  MADGETITFSSKITFPSG---KKLALISLSDKKDLAFLGHGLQELGYTIVSTGGTASTLE 120

Query: 121 TSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVV 180
           TSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGI TFDVVVV
Sbjct: 121 TSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGISTFDVVVV 180

Query: 181 NLYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKG 240
           NLYPFYEKVTSSQ +NFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYP LLEFLKG
Sbjct: 181 NLYPFYEKVTSSQNLNFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPTLLEFLKG 240

Query: 241 SEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPH 300
           SEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPH
Sbjct: 241 SEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVGDKFPPSFTVPLSLKSSLRYGENPH 300

Query: 301 QKAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTN 360
           QKAAFYVDKSLSEVN GGIATA+QHHGKEMSYNNYLDADAAWNCVSEFRNPTCV+VKHTN
Sbjct: 301 QKAAFYVDKSLSEVNGGGIATAIQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVVVKHTN 360

Query: 361 PCGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYE 420
           PCGVASRDDILEAYRLAVKADPVSAFGGIVAFN+EVDEALARE+REFRSPTDGETRMFYE
Sbjct: 361 PCGVASRDDILEAYRLAVKADPVSAFGGIVAFNVEVDEALAREIREFRSPTDGETRMFYE 420

Query: 421 IVVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDADDLVPQDIKFN 480
           IVVAPKYT+KGLEILRGKSKTLRILEA KNEKGKLSLRQVGGGWLAQD+DDLVPQDI+FN
Sbjct: 421 IVVAPKYTKKGLEILRGKSKTLRILEATKNEKGKLSLRQVGGGWLAQDSDDLVPQDIQFN 480

Query: 481 VVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAG 540
           VVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIA+RKAG
Sbjct: 481 VVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIAMRKAG 540

Query: 541 DEVKGAALASDAFFPFGNFWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGG 600
           DEVKGAALASDAFFPF                      +WNDAVEEACQSGVGIIAEPGG
Sbjct: 541 DEVKGAALASDAFFPF----------------------AWNDAVEEACQSGVGIIAEPGG 600

Query: 601 SIRDPDAVDCCNKYGVSLVFTNVRHFRH 628
           SIRDPDA++CCNKYGVSLVFTNVRHFRH
Sbjct: 601 SIRDPDAINCCNKYGVSLVFTNVRHFRH 603

BLAST of ClCG03G004230 vs. TAIR 10
Match: AT2G35040.1 (AICARFT/IMPCHase bienzyme family protein )

HSP 1 Score: 935.6 bits (2417), Expect = 2.0e-272
Identity = 461/572 (80.59%), Postives = 503/572 (87.94%), Query Frame = 0

Query: 57  LKAMADGETITFSSKITIPSASGRKKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTAS 116
           ++AMA+ +T   +   +  S S  +K ALISLSDK+DLA LGNGLQELGYTIVSTGGTAS
Sbjct: 49  VRAMAESQTAQRNQPQS--SGSSGEKQALISLSDKRDLASLGNGLQELGYTIVSTGGTAS 108

Query: 117 TLETSGVRVTKVEELTCFPEMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDV 176
           TLE +GV VTKVE+LT FPEMLDGRVKTLHP+IHGGILARRD  HHM+AL +HGIGTFDV
Sbjct: 109 TLENAGVSVTKVEKLTHFPEMLDGRVKTLHPNIHGGILARRDVEHHMEALNEHGIGTFDV 168

Query: 177 VVVNLYPFYEKVTSSQEINFEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEF 236
           VVVNLYPFYEKVT+   I+FEDGIENIDIGGPAMIRAAAKNHKDVL+VVD+ DY A+LE+
Sbjct: 169 VVVNLYPFYEKVTAPGGISFEDGIENIDIGGPAMIRAAAKNHKDVLIVVDSGDYQAVLEY 228

Query: 237 LKGSEDDQQFRRKLAWKAFQHVASYDSAVSEWLWKQTVG-DKFPPSFTVPLSLKSSLRYG 296
           LKG + DQQFRRKLAWKAFQHVA+YDSAVSEWLWKQT G +KFPPSFTVPL LKSSLRYG
Sbjct: 229 LKGGQSDQQFRRKLAWKAFQHVAAYDSAVSEWLWKQTEGKEKFPPSFTVPLVLKSSLRYG 288

Query: 297 ENPHQKAAFYVDKSLSEVNAGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIV 356
           ENPHQKAAFYVDKSL+EVNAGGIATA+QHHGKEMSYNNYLDADAAWNCVSEF NPTCV+V
Sbjct: 289 ENPHQKAAFYVDKSLAEVNAGGIATAIQHHGKEMSYNNYLDADAAWNCVSEFENPTCVVV 348

Query: 357 KHTNPCGVASRDDILEAYRLAVKADPVSAFGGIVAFNIEVDEALARELREFRSPTDGETR 416
           KHTNPCGVASRDDILEAYRLAVKADPVSAFGGIVAFN+EVDE LARE+REFRSPTDGETR
Sbjct: 349 KHTNPCGVASRDDILEAYRLAVKADPVSAFGGIVAFNVEVDEVLAREIREFRSPTDGETR 408

Query: 417 MFYEIVVAPKYTEKGLEILRGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDADDLVPQD 476
           MFYEIVVAPKYT KGLE+L+GKSKTLRILEA KN++GKLSLRQVGGGWLAQD+DDL P+D
Sbjct: 409 MFYEIVVAPKYTAKGLEVLKGKSKTLRILEAKKNDQGKLSLRQVGGGWLAQDSDDLTPED 468

Query: 477 IKFNVVSGKAPQESELRDAEFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIAL 536
           I FN VS K P ESEL DA+FAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNR+ESLRIA 
Sbjct: 469 ISFNSVSDKTPTESELADAKFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRVESLRIAF 528

Query: 537 RKAGDEVKGAALASDAFFPFGNFWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIA 596
           +KAG+E KGAALASDAFFPF                      +W DAVEEACQ G+G+IA
Sbjct: 529 KKAGEEAKGAALASDAFFPF----------------------AWKDAVEEACQMGIGVIA 588

Query: 597 EPGGSIRDPDAVDCCNKYGVSLVFTNVRHFRH 628
           EPGGSIRD DA+DCC KYGVSL+FTNVRHFRH
Sbjct: 589 EPGGSIRDQDAIDCCKKYGVSLLFTNVRHFRH 596

BLAST of ClCG03G004230 vs. TAIR 10
Match: AT2G35040.2 (AICARFT/IMPCHase bienzyme family protein )

HSP 1 Score: 934.1 bits (2413), Expect = 5.9e-272
Identity = 457/553 (82.64%), Postives = 493/553 (89.15%), Query Frame = 0

Query: 76  SASGRKKLALISLSDKKDLAFLGNGLQELGYTIVSTGGTASTLETSGVRVTKVEELTCFP 135
           S S  +K ALISLSDK+DLA LGNGLQELGYTIVSTGGTASTLE +GV VTKVE+LT FP
Sbjct: 15  SGSSGEKQALISLSDKRDLASLGNGLQELGYTIVSTGGTASTLENAGVSVTKVEKLTHFP 74

Query: 136 EMLDGRVKTLHPSIHGGILARRDQRHHMDALKKHGIGTFDVVVVNLYPFYEKVTSSQEIN 195
           EMLDGRVKTLHP+IHGGILARRD  HHM+AL +HGIGTFDVVVVNLYPFYEKVT+   I+
Sbjct: 75  EMLDGRVKTLHPNIHGGILARRDVEHHMEALNEHGIGTFDVVVVNLYPFYEKVTAPGGIS 134

Query: 196 FEDGIENIDIGGPAMIRAAAKNHKDVLVVVDTEDYPALLEFLKGSEDDQQFRRKLAWKAF 255
           FEDGIENIDIGGPAMIRAAAKNHKDVL+VVD+ DY A+LE+LKG + DQQFRRKLAWKAF
Sbjct: 135 FEDGIENIDIGGPAMIRAAAKNHKDVLIVVDSGDYQAVLEYLKGGQSDQQFRRKLAWKAF 194

Query: 256 QHVASYDSAVSEWLWKQTVG-DKFPPSFTVPLSLKSSLRYGENPHQKAAFYVDKSLSEVN 315
           QHVA+YDSAVSEWLWKQT G +KFPPSFTVPL LKSSLRYGENPHQKAAFYVDKSL+EVN
Sbjct: 195 QHVAAYDSAVSEWLWKQTEGKEKFPPSFTVPLVLKSSLRYGENPHQKAAFYVDKSLAEVN 254

Query: 316 AGGIATAVQHHGKEMSYNNYLDADAAWNCVSEFRNPTCVIVKHTNPCGVASRDDILEAYR 375
           AGGIATA+QHHGKEMSYNNYLDADAAWNCVSEF NPTCV+VKHTNPCGVASRDDILEAYR
Sbjct: 255 AGGIATAIQHHGKEMSYNNYLDADAAWNCVSEFENPTCVVVKHTNPCGVASRDDILEAYR 314

Query: 376 LAVKADPVSAFGGIVAFNIEVDEALARELREFRSPTDGETRMFYEIVVAPKYTEKGLEIL 435
           LAVKADPVSAFGGIVAFN+EVDE LARE+REFRSPTDGETRMFYEIVVAPKYT KGLE+L
Sbjct: 315 LAVKADPVSAFGGIVAFNVEVDEVLAREIREFRSPTDGETRMFYEIVVAPKYTAKGLEVL 374

Query: 436 RGKSKTLRILEAGKNEKGKLSLRQVGGGWLAQDADDLVPQDIKFNVVSGKAPQESELRDA 495
           +GKSKTLRILEA KN++GKLSLRQVGGGWLAQD+DDL P+DI FN VS K P ESEL DA
Sbjct: 375 KGKSKTLRILEAKKNDQGKLSLRQVGGGWLAQDSDDLTPEDISFNSVSDKTPTESELADA 434

Query: 496 EFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRLESLRIALRKAGDEVKGAALASDAFFP 555
           +FAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNR+ESLRIA +KAG+E KGAALASDAFFP
Sbjct: 435 KFAWLCVKHVKSNAIVIAKNNCMLGMGSGQPNRVESLRIAFKKAGEEAKGAALASDAFFP 494

Query: 556 FGNFWESNDDFKVSNVAPPICYISWNDAVEEACQSGVGIIAEPGGSIRDPDAVDCCNKYG 615
           F                      +W DAVEEACQ G+G+IAEPGGSIRD DA+DCC KYG
Sbjct: 495 F----------------------AWKDAVEEACQMGIGVIAEPGGSIRDQDAIDCCKKYG 545

Query: 616 VSLVFTNVRHFRH 628
           VSL+FTNVRHFRH
Sbjct: 555 VSLLFTNVRHFRH 545

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038876982.10.0e+0094.10bifunctional purine biosynthesis protein PurH [Benincasa hispida] >XP_038876983.... [more]
XP_008440494.10.0e+0092.98PREDICTED: bifunctional purine biosynthesis protein PurH [Cucumis melo] >XP_0084... [more]
XP_011652409.10.0e+0092.82uncharacterized protein LOC101206006 [Cucumis sativus] >XP_011652413.1 uncharact... [more]
KAG6584189.10.0e+0090.76hypothetical protein SDJN03_20121, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7019781.10.0e+0090.76purH [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
A9VRF52.0e-13146.73Bifunctional purine biosynthesis protein PurH OS=Bacillus mycoides (strain KBAB4... [more]
A3DEU94.4e-13145.19Bifunctional purine biosynthesis protein PurH OS=Hungateiclostridium thermocellu... [more]
C1EV677.5e-13146.36Bifunctional purine biosynthesis protein PurH OS=Bacillus cereus (strain 03BB102... [more]
Q6HPA07.5e-13146.36Bifunctional purine biosynthesis protein PurH OS=Bacillus thuringiensis subsp. k... [more]
C3PBN49.8e-13146.18Bifunctional purine biosynthesis protein PurH OS=Bacillus anthracis (strain A024... [more]
Match NameE-valueIdentityDescription
A0A1S3B1V00.0e+0092.98AICAR transformylase OS=Cucumis melo OX=3656 GN=LOC103484902 PE=3 SV=1[more]
A0A0A0LRB70.0e+0092.82AICAR transformylase OS=Cucumis sativus OX=3659 GN=Csa_1G057010 PE=3 SV=1[more]
A0A6J1KHE30.0e+0090.92AICAR transformylase OS=Cucurbita maxima OX=3661 GN=LOC111495272 PE=3 SV=1[more]
A0A6J1EDV70.0e+0090.76AICAR transformylase OS=Cucurbita moschata OX=3662 GN=LOC111431645 PE=3 SV=1[more]
A0A6J1KF890.0e+0090.61AICAR transformylase OS=Cucurbita maxima OX=3661 GN=LOC111495272 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT2G35040.12.0e-27280.59AICARFT/IMPCHase bienzyme family protein [more]
AT2G35040.25.9e-27282.64AICARFT/IMPCHase bienzyme family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011607Methylglyoxal synthase-like domainSMARTSM00851MGS_2acoord: 93..209
e-value: 4.7E-44
score: 162.3
IPR011607Methylglyoxal synthase-like domainPFAMPF02142MGScoord: 98..208
e-value: 3.7E-22
score: 78.3
IPR011607Methylglyoxal synthase-like domainPROSITEPS51855MGScoord: 72..225
score: 19.575466
IPR002695Bifunctional purine biosynthesis protein PurH-likeSMARTSM00798aicarft_impchascoord: 214..538
e-value: 4.5E-168
score: 574.3
IPR002695Bifunctional purine biosynthesis protein PurH-likePIRSFPIRSF000414PurHcoord: 575..627
e-value: 1.6E-18
score: 64.3
coord: 76..574
e-value: 8.5E-199
score: 659.2
IPR002695Bifunctional purine biosynthesis protein PurH-likePFAMPF01808AICARFT_IMPCHascoord: 214..537
e-value: 9.8E-115
score: 383.3
IPR002695Bifunctional purine biosynthesis protein PurH-likeTIGRFAMTIGR00355TIGR00355coord: 82..563
e-value: 1.1E-160
score: 533.8
IPR002695Bifunctional purine biosynthesis protein PurH-likePANTHERPTHR11692BIFUNCTIONAL PURINE BIOSYNTHESIS PROTEIN PURHcoord: 54..627
IPR002695Bifunctional purine biosynthesis protein PurH-likeHAMAPMF_00139PurHcoord: 81..627
score: 38.641636
IPR024051AICAR transformylase, duplicated domain superfamilyGENE3D3.40.140.20coord: 286..453
e-value: 3.5E-60
score: 204.2
IPR024051AICAR transformylase, duplicated domain superfamilyGENE3D3.40.140.20coord: 473..627
e-value: 6.8E-48
score: 163.8
IPR036914Methylglyoxal synthase-like domain superfamilyGENE3D3.40.50.1380coord: 76..276
e-value: 4.5E-84
score: 282.7
IPR036914Methylglyoxal synthase-like domain superfamilySUPERFAMILY52335Methylglyoxal synthase-likecoord: 81..272
NoneNo IPR availablePANTHERPTHR11692:SF1AICARFT/IMPCHASE BIENZYME FAMILY PROTEINcoord: 54..627
NoneNo IPR availableCDDcd01421IMPCHcoord: 82..269
e-value: 2.73131E-97
score: 293.736
IPR016193Cytidine deaminase-likeSUPERFAMILY53927Cytidine deaminase-likecoord: 289..627

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG03G004230.1ClCG03G004230.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006189 'de novo' IMP biosynthetic process
biological_process GO:0006164 purine nucleotide biosynthetic process
cellular_component GO:0005829 cytosol
molecular_function GO:0003937 IMP cyclohydrolase activity
molecular_function GO:0004643 phosphoribosylaminoimidazolecarboxamide formyltransferase activity
molecular_function GO:0003824 catalytic activity