Cp4.1LG01g06560 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g06560
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionDihydropteroate synthase, putative
LocationCp4.1LG01 : 130513 .. 133535 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCAAGGGCATAAAAGACATTCCCCCGTCCTCTTCCGATGCTGCCCTAATTCCTGGTGTATAAACTTGTTGGAACTGGTACACCGTAATTTTTGCTTCATTTTCTGGGCTTCTCGCAAATGAACATTTTGAAGCATCCAATTATCAGCAGGCGAGGATTCAGATATGGTGGAGGTACTGTATGTTCATCTCTAATTTTGAGACTCTGGTTTTTGGATTCTCGTTCAAATGTTAGACGCACATTCGTTTCTGTTCCCCCTTTTTCTTTTCTGCTTACCTCATGTATTTGCTTGATCGACTTTTTTGGATTACAATGGTTTCAAAATAGAGCGAGGGGTTGGAGTAAGTAAATATTAGTTTTTGATTCCTCTGTCCAGTCATTATGATGATTCAGTAGTCAAAATTGTTCAAATCCAGCAAATGGATAGTTAATGTGCTCCAAATGACTATGGAGAAAGAATAATTTTTTCTCTGGCCTCTTGATAAGTTCTACATTCTGCTGAGCCTTTTTTTTTTAGACTAAGAATTTAAGTAGTGAAAGAAAGCTCCTCCCGCTTGAATTAGCATCTCAGCTTGTTGCTGATGAACCATTTGTACATTGTAGCATTGCAAAGTTCGTTCATTCATTCATCGCAAGACGCAGTGCTGGAAGTTTGTTCTAGAGAGCAAGAAGTAGTGATTGCTTTAGGGAGCAATGTGGGTGATAGACTGCAGAATTTCAACGGAGCTTTGCAGTTGATGAAAAAGGCAGGGATACACATTACAAGACATGCTTGTTTGTATGAGACAGCACCTGCTTATGTCACTAATCAACCTCAATTTCTCAACTCCGCTGTTAGAGCTGTCACAAAGCTTGGACCACATGAACTACTGAGTGCAGTTAAGAACATAGAGAAACAGCTGGGTCGTACTGCTGGTATACGCTACGGCCCGAGGCCGATTGACTTAGATATCTTGTTGTATGGAAGATATAAAATACACTCAGATACTCTCACTGTCCCTCATGAAAGAATCTGGGAAAGGCCGTTTGTGCTGGCTCCTTTGATTGATTTGCTGGGTTCAGATGTTGATACCGATGATGTTGCTTGCTGGCATTCTTTAGCTGCTGATCGTGGCGGGCTTTTTGAGTTATGGGAGAAAATGGGTGGTGAATCTCTTATTGGTAAAGAAGGAATGAGAAGGGTTTTGCCCATTGGAAACAACTTATGGGATTGGTCCTGCAAGACTTCCATCATGGGGGTTCTTAATTTGACACCTGACAGTTTTAGCGATGGTGGCAAGTTTCAACCTATTGAAGCTGCAGTTTCTCAGGTGCGTTCGATGGTTTCAGATGGTGCTGATATGATTGACATTGGTGCTCAGTCAACACGGCCCATGGCACCTATGATTTCTGTTGAAGAAGAATTGGATAGATTAGTTCCCGTTTTGGAAGCTGTTACAGGAATGCCAGAGATGAGTGGAAAGCTCATATCAGTGGATACGTTTTATTCAGAAGTCGCTTTGGAAGCTGTAAAGAGAGGGGCTCATATTGTAAATGACGTATCAGCGGGCCAGTTGGATCCTCAAATGCACAGGGTTGTTGCTCAGCTTAAGGTGCCTTATATTGCAATGCACATGAGAGGAGATCCATCTACAATGCAAAACAATGAGAATTTACAGTATGATGATGTTTGCAATGAAATTGCCTCTGAGCTACACTCTAGGATTAGAGATGCAGAATTATCAGGCATCCCAGCTTGGAGGATAATTGTTGATCCTGGGATTGGATTCTCGAAGAACACAAAGCAAAACCTGGAAATTCTAGGAGGCGTACCAAAGATTCGAGCAGAAATTGCAAGGAGAAGCTTGGGATTGTCTCATGCTCCCATGTTGATTGGACCTTCGAGAAAGAAATTTCTCGGCGAAGTATGTTCGCAACCGATTGCAACGAAGAGAGATCCCGCTACAGTTGCTGCAGTTACCATAGGGGTTCTCGGTGGTGCAAACATAGCTAGAGTACATAACGTAAGAGATAATGTGGATGCAGTGAGGCTTTGTGATGCAATGCTGAAGGAGAAAACAAGCTGAAGCATGATTGTTTCTCGTTACACATCCTTTTCATTCTCATTTTTTTAATCAAATTATTATTTTTGGATAATTTGGTACACCGCTCATGCTTTTACTGCTAGCGTCACAAGGAATAAAATGAGTTACGCGATGCCCCCTGGAATAGTATTATCTCAAGTTATTGGTTGTCATTGATCCATCGGAAGATTGTCTGTCTCTCACTAGTGCCCCTAGGAGTATTTGTTTTCAAGGTTGCCTTTAAATATCATTTGAGTAAAGGTATTGGCACTCTAAATGTCTGTTGGAATATTATTATCTGAAAACTTAACCAACAAATACAATGACATCCATTCAAAGTAATGGAAGAACTAATATAAAGAGATTAGAGAAAGAGAGAGTATACACAACTTAGTTCCTCCTTTCTTTGGTAAAAGGGAATTATCTGGTAACTGACGAAGCATGTAAATATCTGATAATTTCAAGAGGCCTAGAAAGTGATCAGGTCAGGTAAGTCTTGTTTATGTACTGGTTTAGAAACAAGGCTCAAATGGGAAGAAGAACAGCTTGCAAAAGGGGCTTGTAAAGTGGTTGATGAACAACAATCTGCATCAAATCTCTCCCATTTGTCTGTAATCACTGTCTTCAACCCCCTCTTTGATCCATTTCCAAGCTCGCTCCTCATCTCTCCTCCATTATCTTTCCCTTTTAAATTAAGGCTTCCATTGATAATTTTCTCAAGATCTTTGATATCTTCTTCTGGGATCTTCTCCAATTTTAGGCTCTGAGATGCATTCAAAACACCCATTTCTCTACATATCTCAAAGTATTGAGTCAAATCTTCTACGTGAGTTGCTGCTTTTTTGACAACTTGAAGTGCCAATGATGCTTCGGGTTTCGCTGGTGTCTCGTAAATGTTCAACAGAACTTGAGCAATTCCGTTGCAGATTTGGCTGTAAACGTCAAAAATCTCAAAGATGA

mRNA sequence

CTCAAGGGCATAAAAGACATTCCCCCGTCCTCTTCCGATGCTGCCCTAATTCCTGCATTGCAAAGTTCGTTCATTCATTCATCGCAAGACGCAGTGCTGGAAGTTTGTTCTAGAGAGCAAGAAGTAGTGATTGCTTTAGGGAGCAATGTGGGTGATAGACTGCAGAATTTCAACGGAGCTTTGCAGTTGATGAAAAAGGCAGGGATACACATTACAAGACATGCTTGTTTGTATGAGACAGCACCTGCTTATGTCACTAATCAACCTCAATTTCTCAACTCCGCTGTTAGAGCTGTCACAAAGCTTGGACCACATGAACTACTGAGTGCAGTTAAGAACATAGAGAAACAGCTGGGTCGTACTGCTGGTATACGCTACGGCCCGAGGCCGATTGACTTAGATATCTTGTTGTATGGAAGATATAAAATACACTCAGATACTCTCACTGTCCCTCATGAAAGAATCTGGGAAAGGCCGTTTGTGCTGGCTCCTTTGATTGATTTGCTGGGTTCAGATGTTGATACCGATGATGTTGCTTGCTGGCATTCTTTAGCTGCTGATCGTGGCGGGCTTTTTGAGTTATGGGAGAAAATGGGTGGTGAATCTCTTATTGGTAAAGAAGGAATGAGAAGGGTTTTGCCCATTGGAAACAACTTATGGGATTGGTCCTGCAAGACTTCCATCATGGGGGTTCTTAATTTGACACCTGACAGTTTTAGCGATGGTGGCAAGTTTCAACCTATTGAAGCTGCAGTTTCTCAGGTGCGTTCGATGGTTTCAGATGGTGCTGATATGATTGACATTGGTGCTCAGTCAACACGGCCCATGGCACCTATGATTTCTGTTGAAGAAGAATTGGATAGATTAGTTCCCGTTTTGGAAGCTGTTACAGGAATGCCAGAGATGAGTGGAAAGCTCATATCAGTGGATACGTTTTATTCAGAAGTCGCTTTGGAAGCTGTAAAGAGAGGGGCTCATATTGTAAATGACGTATCAGCGGGCCAGTTGGATCCTCAAATGCACAGGGTTGTTGCTCAGCTTAAGGTGCCTTATATTGCAATGCACATGAGAGGAGATCCATCTACAATGCAAAACAATGAGAATTTACAGTATGATGATGTTTGCAATGAAATTGCCTCTGAGCTACACTCTAGGATTAGAGATGCAGAATTATCAGGCATCCCAGCTTGGAGGATAATTGTTGATCCTGGGATTGGATTCTCGAAGAACACAAAGCAAAACCTGGAAATTCTAGGAGGCGTACCAAAGATTCGAGCAGAAATTGCAAGGAGAAGCTTGGGATTGTCTCATGCTCCCATGTTGATTGGACCTTCGAGAAAGAAATTTCTCGGCGAAGTATGTTCGCAACCGATTGCAACGAAGAGAGATCCCGCTACAGTTGCTGCAGTTACCATAGGGGTTCTCGGTGGTGCAAACATAGCTAGAGTACATAACGTAAGAGATAATGTGGATGCAGTGAGGCTTTGTGATGCAATGCTGAAGGAGAAAACAAGCTGAAGCATGATTGTTTCTCGTTACACATCCTTTTCATTCTCATTTTTTTAATCAAATTATTATTTTTGGATAATTTGGTACACCGCTCATGCTTTTACTGCTAGCGTCACAAGGAATAAAATGAGTTACGCGATGCCCCCTGGAATAGTATTATCTCAAGTTATTGGTTGTCATTGATCCATCGGAAGATTGTCTGTCTCTCACTAGTGCCCCTAGGAGTATTTGTTTTCAAGGTTGCCTTTAAATATCATTTGAGTAAAGGTATTGGCACTCTAAATGTCTGTTGGAATATTATTATCTGAAAACTTAACCAACAAATACAATGACATCCATTCAAAGTAATGGAAGAACTAATATAAAGAGATTAGAGAAAGAGAGAGTATACACAACTTAGTTCCTCCTTTCTTTGGTAAAAGGGAATTATCTGGTAACTGACGAAGCATGTAAATATCTGATAATTTCAAGAGGCCTAGAAAGTGATCAGGTCAGGTAAGTCTTGTTTATGTACTGGTTTAGAAACAAGGCTCAAATGGGAAGAAGAACAGCTTGCAAAAGGGGCTTGTAAAGTGGTTGATGAACAACAATCTGCATCAAATCTCTCCCATTTGTCTGTAATCACTGTCTTCAACCCCCTCTTTGATCCATTTCCAAGCTCGCTCCTCATCTCTCCTCCATTATCTTTCCCTTTTAAATTAAGGCTTCCATTGATAATTTTCTCAAGATCTTTGATATCTTCTTCTGGGATCTTCTCCAATTTTAGGCTCTGAGATGCATTCAAAACACCCATTTCTCTACATATCTCAAAGTATTGAGTCAAATCTTCTACGTGAGTTGCTGCTTTTTTGACAACTTGAAGTGCCAATGATGCTTCGGGTTTCGCTGGTGTCTCGTAAATGTTCAACAGAACTTGAGCAATTCCGTTGCAGATTTGGCTGTAAACGTCAAAAATCTCAAAGATGA

Coding sequence (CDS)

CTCAAGGGCATAAAAGACATTCCCCCGTCCTCTTCCGATGCTGCCCTAATTCCTGCATTGCAAAGTTCGTTCATTCATTCATCGCAAGACGCAGTGCTGGAAGTTTGTTCTAGAGAGCAAGAAGTAGTGATTGCTTTAGGGAGCAATGTGGGTGATAGACTGCAGAATTTCAACGGAGCTTTGCAGTTGATGAAAAAGGCAGGGATACACATTACAAGACATGCTTGTTTGTATGAGACAGCACCTGCTTATGTCACTAATCAACCTCAATTTCTCAACTCCGCTGTTAGAGCTGTCACAAAGCTTGGACCACATGAACTACTGAGTGCAGTTAAGAACATAGAGAAACAGCTGGGTCGTACTGCTGGTATACGCTACGGCCCGAGGCCGATTGACTTAGATATCTTGTTGTATGGAAGATATAAAATACACTCAGATACTCTCACTGTCCCTCATGAAAGAATCTGGGAAAGGCCGTTTGTGCTGGCTCCTTTGATTGATTTGCTGGGTTCAGATGTTGATACCGATGATGTTGCTTGCTGGCATTCTTTAGCTGCTGATCGTGGCGGGCTTTTTGAGTTATGGGAGAAAATGGGTGGTGAATCTCTTATTGGTAAAGAAGGAATGAGAAGGGTTTTGCCCATTGGAAACAACTTATGGGATTGGTCCTGCAAGACTTCCATCATGGGGGTTCTTAATTTGACACCTGACAGTTTTAGCGATGGTGGCAAGTTTCAACCTATTGAAGCTGCAGTTTCTCAGGTGCGTTCGATGGTTTCAGATGGTGCTGATATGATTGACATTGGTGCTCAGTCAACACGGCCCATGGCACCTATGATTTCTGTTGAAGAAGAATTGGATAGATTAGTTCCCGTTTTGGAAGCTGTTACAGGAATGCCAGAGATGAGTGGAAAGCTCATATCAGTGGATACGTTTTATTCAGAAGTCGCTTTGGAAGCTGTAAAGAGAGGGGCTCATATTGTAAATGACGTATCAGCGGGCCAGTTGGATCCTCAAATGCACAGGGTTGTTGCTCAGCTTAAGGTGCCTTATATTGCAATGCACATGAGAGGAGATCCATCTACAATGCAAAACAATGAGAATTTACAGTATGATGATGTTTGCAATGAAATTGCCTCTGAGCTACACTCTAGGATTAGAGATGCAGAATTATCAGGCATCCCAGCTTGGAGGATAATTGTTGATCCTGGGATTGGATTCTCGAAGAACACAAAGCAAAACCTGGAAATTCTAGGAGGCGTACCAAAGATTCGAGCAGAAATTGCAAGGAGAAGCTTGGGATTGTCTCATGCTCCCATGTTGATTGGACCTTCGAGAAAGAAATTTCTCGGCGAAGTATGTTCGCAACCGATTGCAACGAAGAGAGATCCCGCTACAGTTGCTGCAGTTACCATAGGGGTTCTCGGTGGTGCAAACATAGCTAGAGTACATAACGTAAGAGATAATGTGGATGCAGTGAGGCTTTGTGATGCAATGCTGAAGGAGAAAACAAGCTGA

Protein sequence

LKGIKDIPPSSSDAALIPALQSSFIHSSQDAVLEVCSREQEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHACLYETAPAYVTNQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILLYGRYKIHSDTLTVPHERIWERPFVLAPLIDLLGSDVDTDDVACWHSLAADRGGLFELWEKMGGESLIGKEGMRRVLPIGNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQVRSMVSDGADMIDIGAQSTRPMAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSEVALEAVKRGAHIVNDVSAGQLDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVCNEIASELHSRIRDAELSGIPAWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGLSHAPMLIGPSRKKFLGEVCSQPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRLCDAMLKEKTS
BLAST of Cp4.1LG01g06560 vs. Swiss-Prot
Match: FOLM_PEA (Folate synthesis bifunctional protein, mitochondrial OS=Pisum sativum GN=MitHPPK/DHPS PE=1 SV=1)

HSP 1 Score: 725.7 bits (1872), Expect = 3.5e-208
Identity = 344/489 (70.35%), Postives = 424/489 (86.71%), Query Frame = 1

Query: 17  IPALQSSFIHSSQDAVLEVCSREQEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHAC 76
           +  L  S  H++ ++ +E+ ++++EVVIALGSNVGDRL NF  AL+LM+K+GIHITRHA 
Sbjct: 21  LKVLGFSSFHTAPNSSIEIQTQDEEVVIALGSNVGDRLHNFKEALKLMRKSGIHITRHAS 80

Query: 77  LYETAPAYVTNQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDIL 136
           LYETAPAYVT+QP+FLNSAVRA TKLGPHELL+A+K IEK +GRT GIRYGPRPIDLDIL
Sbjct: 81  LYETAPAYVTDQPRFLNSAVRADTKLGPHELLAALKRIEKDMGRTDGIRYGPRPIDLDIL 140

Query: 137 LYGRYKIHSDTLTVPHERIWERPFVLAPLIDLLGSDVDTDDVACWHSLAADRGGLFELWE 196
            YG++K+ SD LTVPHERIWERPFV+APL+DLLG+ +D+D VA WHS +   GGL  LWE
Sbjct: 141 FYGKFKVRSDILTVPHERIWERPFVMAPLMDLLGTAIDSDTVASWHSFSGHSGGLNALWE 200

Query: 197 KMGGESLIGKEGMRRVLPIGNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQVR 256
           K+GGESLIG+EGM RV+P+ N L DWS +T +MG+LNLTPDSFSDGG FQ +++AVSQ R
Sbjct: 201 KLGGESLIGEEGMYRVMPVANGLLDWSRRTLVMGILNLTPDSFSDGGNFQSVKSAVSQAR 260

Query: 257 SMVSDGADMIDIGAQSTRPMAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSEV 316
            M+S+GAD+IDIGAQSTRPMA  IS EEEL RL+PVLEAV  +PE+ GKLISVDTFYSEV
Sbjct: 261 LMISEGADIIDIGAQSTRPMASRISAEEELGRLIPVLEAVMSIPEVEGKLISVDTFYSEV 320

Query: 317 ALEAVKRGAHIVNDVSAGQLDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVCN 376
           ALEAV++GAHI+NDVSAG+LD  M +V+A+L VPY+AMHMRGDPSTMQ++ENL+YD+VC 
Sbjct: 321 ALEAVRKGAHIINDVSAGKLDASMFKVMAELDVPYVAMHMRGDPSTMQDSENLKYDNVCK 380

Query: 377 EIASELHSRIRDAELSGIPAWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGLS 436
           +I+SEL+SR+R+AE+SGIPAWRII+DPGIGFSK T+ NL  L G+P IR EI++RSL +S
Sbjct: 381 DISSELYSRVREAEISGIPAWRIIMDPGIGFSKKTEDNLAALTGIPDIREEISKRSLAIS 440

Query: 437 HAPMLIGPSRKKFLGEVCSQPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRLC 496
           HAP+LIGPSRK+FLGE+CS+P A  RDPAT+A+VT GVL GANI RVHNV+DN+DAV+LC
Sbjct: 441 HAPILIGPSRKRFLGEICSRPSAVDRDPATIASVTAGVLCGANIVRVHNVKDNLDAVKLC 500

Query: 497 DAMLKEKTS 506
           DA+LK+K+S
Sbjct: 501 DAILKQKSS 509

BLAST of Cp4.1LG01g06560 vs. Swiss-Prot
Match: FOLM_ARATH (Folate synthesis bifunctional protein, mitochondrial OS=Arabidopsis thaliana GN=MitHPPK/DHPS PE=2 SV=1)

HSP 1 Score: 703.4 bits (1814), Expect = 1.8e-201
Identity = 336/489 (68.71%), Postives = 411/489 (84.05%), Query Frame = 1

Query: 18  PAL-QSSFIHSSQDAVLEVCSREQEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHAC 77
           PAL  S+F  S+    +EV S E EVVIALGSN+G+R+ NF  AL+LMK+ GI +TRH+C
Sbjct: 64  PALCNSAFSSSATSTTIEVQSTEHEVVIALGSNIGNRMNNFREALRLMKRGGICVTRHSC 123

Query: 78  LYETAPAYVTNQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDIL 137
           LYETAP +VT+QP+FLN+AVR VTKLGPHELLS +K IE+ +GR  GIRYGPRP+DLDIL
Sbjct: 124 LYETAPVHVTDQPRFLNAAVRGVTKLGPHELLSVLKTIERDMGRKDGIRYGPRPLDLDIL 183

Query: 138 LYGRYKIHSDTLTVPHERIWERPFVLAPLIDLLGSDVDTDDVACWHSLAADRGGLFELWE 197
            YG+ +I SD L +PHER+WER FVLAPL+DLLGS VD D VA WHSLA   GG+F+ WE
Sbjct: 184 FYGKMRISSDKLIIPHERLWERSFVLAPLVDLLGSAVDNDTVAHWHSLAIHPGGIFQAWE 243

Query: 198 KMGGESLIGKEGMRRVLPIGNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQVR 257
           ++GGESLIG++G++RVLPIG+ LWD+S KT +MG+LNLTPDSFSDGGKFQ I++AVS+VR
Sbjct: 244 RLGGESLIGQDGIQRVLPIGDKLWDFSNKTHVMGILNLTPDSFSDGGKFQSIDSAVSRVR 303

Query: 258 SMVSDGADMIDIGAQSTRPMAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSEV 317
           SM+S+GAD+IDIGAQSTRPMA  IS +EELDRL+PVLEAV GMPEM  KLISVDTF SEV
Sbjct: 304 SMISEGADIIDIGAQSTRPMASRISSQEELDRLLPVLEAVRGMPEMEEKLISVDTFNSEV 363

Query: 318 ALEAVKRGAHIVNDVSAGQLDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVCN 377
           A EA+  GA I+NDVSAG LDP MH+VVA+  VPY+AMHMRGDP TMQN ENLQYDDVC 
Sbjct: 364 ASEAISNGADILNDVSAGTLDPNMHKVVAESGVPYMAMHMRGDPCTMQNKENLQYDDVCK 423

Query: 378 EIASELHSRIRDAELSGIPAWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGLS 437
           ++ASEL+ R+RDAELSGIPAWR+++DPGIGFSK+   NL+I+  +PKIR E+A+RS+ +S
Sbjct: 424 DVASELYLRVRDAELSGIPAWRVMIDPGIGFSKSVDHNLDIIMDLPKIREEMAKRSIAVS 483

Query: 438 HAPMLIGPSRKKFLGEVCSQPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRLC 497
           HAP+L+GPSRK+FLG++C +P AT RD ATVA+VT G+LGGANI RVHNVR N DA ++C
Sbjct: 484 HAPILVGPSRKRFLGDICGRPEATDRDAATVASVTAGILGGANIIRVHNVRHNADAAKVC 543

Query: 498 DAMLKEKTS 506
           DAML+ + S
Sbjct: 544 DAMLRRRRS 552

BLAST of Cp4.1LG01g06560 vs. Swiss-Prot
Match: FOLC_ARATH (Folate synthesis bifunctional protein OS=Arabidopsis thaliana GN=CytHPPK/DHPS PE=1 SV=1)

HSP 1 Score: 660.6 bits (1703), Expect = 1.4e-188
Identity = 313/468 (66.88%), Postives = 396/468 (84.62%), Query Frame = 1

Query: 40  QEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHACLYETAPAYVTNQPQFLNSAVRAV 99
           +EVVIALGSNVG+R+ NF  AL+LMK  GI +TRH+CLYET P +VT+QP+FLN+A+R V
Sbjct: 12  EEVVIALGSNVGNRMNNFKEALRLMKDYGISVTRHSCLYETEPVHVTDQPRFLNAAIRGV 71

Query: 100 TKLGPHELLSAVKNIEKQLGRTA-GIRYGPRPIDLDILLYGRYKIHSDTLTVPHERIWER 159
           TKL PHELL+ +K IEK++GR   G+RYGPRP+DLDIL YG++KI SD L +PHERIWER
Sbjct: 72  TKLKPHELLNVLKKIEKEMGREENGLRYGPRPLDLDILFYGKHKIISDKLIIPHERIWER 131

Query: 160 PFVLAPLIDLLGS-DVDTDD-VACWHSLAADRGGLFELWEKMGGESLIGKEGM-RRVLPI 219
           PFVLAPL+DLLG+ D+D D  VA WHSL+   GG+F+ WE++GGESL+GK+G+ +RV+PI
Sbjct: 132 PFVLAPLVDLLGTEDIDNDKIVAYWHSLSMHSGGIFQAWERLGGESLLGKDGIIQRVIPI 191

Query: 220 GNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQVRSMVSDGADMIDIGAQSTRP 279
           G++LWD+S KT +MG+LNLTPDSFSDGGKFQ ++ AVS+VRSM+S+G D+IDIGAQSTRP
Sbjct: 192 GDHLWDFSKKTYVMGILNLTPDSFSDGGKFQSVDTAVSRVRSMISEGVDIIDIGAQSTRP 251

Query: 280 MAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSEVALEAVKRGAHIVNDVSAGQ 339
           MA  IS +EE+DRL+PVL+ V GM EM GKLISVDTF SEVALEA++ GA I+NDVS G 
Sbjct: 252 MASRISSQEEIDRLIPVLKVVRGMAEMKGKLISVDTFNSEVALEAIRNGADILNDVSGGS 311

Query: 340 LDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVCNEIASELHSRIRDAELSGIP 399
           LD  MH+VVA   VPY+ MHMRGDP TMQN ENL+Y+++C ++A+EL+ R+R+AELSGIP
Sbjct: 312 LDENMHKVVADSDVPYMIMHMRGDPCTMQNKENLEYNEICKDVATELYERVREAELSGIP 371

Query: 400 AWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGLSHAPMLIGPSRKKFLGEVCS 459
           AWRI++DPGIGFSK    NL+I+  +PKIR E+A++S+GLSHAP+LIGPSRK+FLG++C 
Sbjct: 372 AWRIMIDPGIGFSKGIDHNLDIVMELPKIREEMAKKSIGLSHAPILIGPSRKRFLGDICG 431

Query: 460 QPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRLCDAMLKEK 504
           +P A++RD ATVA VT G+L GANI RVHNVRDNVDA RLCDAM+ ++
Sbjct: 432 RPEASERDAATVACVTAGILKGANIIRVHNVRDNVDAARLCDAMMTKR 479

BLAST of Cp4.1LG01g06560 vs. Swiss-Prot
Match: FOL1_PNECA (Folic acid synthesis protein fol1 OS=Pneumocystis carinii GN=fol1 PE=1 SV=1)

HSP 1 Score: 311.6 bits (797), Expect = 1.6e-83
Identity = 196/500 (39.20%), Postives = 288/500 (57.60%), Query Frame = 1

Query: 12  SDAALIPALQSSFIHSSQDAVLEVCSREQEVV-IALGSNVGDRLQNFNGALQLMKKAGIH 71
           +++A +  ++S    SS + +    S + E V I+LGSN+G+R++    A++ M   GI 
Sbjct: 265 AESAGVEIVRSRSCFSSNNYIKSENSIDNEAVYISLGSNLGNRIKFILDAIEKMSIKGIK 324

Query: 72  ITRHACLYETAPAYVTNQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRP 131
           + + + LYE+ P Y  +QP F N+  +  T L P +LL  ++ IEK+LGR   I  GPR 
Sbjct: 325 VLKTSMLYESKPMYFKDQPAFYNAVCKVQTSLHPEQLLFELQLIEKELGRVKVIDKGPRC 384

Query: 132 IDLDILLYGRYKIHSDTLTVPHERIWERPFVLAPLIDLLGSDVDTDDVACWHSLAADRGG 191
           IDLDI+ YGR  I+S++L +PH R+ ER FVL PL+D+ G  V        H +      
Sbjct: 385 IDLDIVFYGRKIINSESLIIPHPRVLERSFVLKPLLDISGDLV--------HPVTGL--S 444

Query: 192 LFELWEKMGGESLIGKEGMRRVLPI--GNNLWDWSCK-----TSIMGVLNLTPDSFSDGG 251
           +   +EK      I    ++ VLP    N   D+S +     T IM +LNLTPDSF DGG
Sbjct: 445 IASYFEK------IVDHDIKPVLPFLYKNKSIDFSFRSYKAPTYIMAILNLTPDSFFDGG 504

Query: 252 KFQPIEAAVSQVRSMVSDGADMIDIGAQSTRPMAPMISVEEELDRLVPVLEAVTGM-PEM 311
                ++ +  V   ++ GA +IDIG QSTRP + +I +EEE+ R++P ++ +    P++
Sbjct: 505 -IHSYDSVLIDVEKFINAGATIIDIGGQSTRPGSYIIPLEEEIFRVIPAIKYLQKTYPDI 564

Query: 312 SGKLISVDTFYSEVALEAVKRGAHIVNDVSAGQLDPQMHRVVAQLKVPYIAMHMRGDPST 371
              LIS+DTF SEVA +AVK GA +VND+S G+ DP+M   VA+LKVP   MHMRG+   
Sbjct: 565 ---LISIDTFRSEVAEQAVKAGASLVNDISGGRYDPKMFNTVARLKVPICIMHMRGNFLN 624

Query: 372 MQNNENLQYDDVCNEIASELHSRIRDAELSGIPAWRIIVDPGIGFSKNTKQNLEILGGVP 431
           M N  +    D+  +I  EL   +  AE SGIP W II+DPG+GFSK   QN+E+L    
Sbjct: 625 MDNLTDYG-TDIIEQITIELEKLLNSAEKSGIPRWNIILDPGLGFSKTLHQNIELLRRFN 684

Query: 432 KIRAEIARRSLGLSHAPMLIGPSRKKFLGEVCSQPIATKRDPATVAAVTIGVLGGANIAR 491
           +++++     L     P L+GPSRK+F G +    +   R   TVAAV   + GG +I R
Sbjct: 685 ELKSKNCFNGL-----PWLLGPSRKRFTGFITGDNMPKDRIWGTVAAVVASISGGCDIIR 738

Query: 492 VHNVRDNVDAVRLCDAMLKE 503
           VH+V +     ++ DA+ KE
Sbjct: 745 VHDVYEMYKISKMSDAIWKE 738

BLAST of Cp4.1LG01g06560 vs. Swiss-Prot
Match: FOL1_SCHPO (Folic acid synthesis protein fol1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=fol1 PE=1 SV=2)

HSP 1 Score: 272.3 bits (695), Expect = 1.0e-71
Identity = 174/459 (37.91%), Postives = 255/459 (55.56%), Query Frame = 1

Query: 44  IALGSNVGDRLQNFNGALQLMKKA-GIHITRHACLYETAPAYVTNQPQFLNSAVRAVTKL 103
           ++ GSN+GD+ +    AL ++ K  GI +   + LYET P Y  +QP FLN   +  T++
Sbjct: 300 LSFGSNIGDKFEQIQTALSMLHKIEGIRVLDVSPLYETEPMYYKDQPSFLNGVCKIETRM 359

Query: 104 GPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILLYGRYKIHSDTLTVPHERIWERPFVL 163
            P  LL A ++IE+++GR   I  GPR IDLDI+LY      S+ LT+PH  + ER FVL
Sbjct: 360 SPINLLRACQSIEQEMGRIKTILKGPRCIDLDIVLYEDCVYESEVLTIPHLGLQEREFVL 419

Query: 164 APLIDLLGSDVDTDDVACWHSLAADRGGLFELWEKMGGESLIGKEGMRRVLPIGNNLWDW 223
            PL+ L      + D+   ++       L E  +K      +  +G+R      N     
Sbjct: 420 RPLLAL------SPDLVHPYT----HQPLQEALDK------LPSQGIRLYSSFDNKKIIN 479

Query: 224 SCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQVRSMVSDGADMIDIGAQSTRPMAPMISV 283
              T  MG+LN+TPDSFSDGGK       + + +SMV DGA ++DIG QST+P A  +SV
Sbjct: 480 GALT--MGILNVTPDSFSDGGKVSQ-NNILEKAKSMVGDGASILDIGGQSTKPGADPVSV 539

Query: 284 EEELDRLVPVLEAVTGMPEMSGKL--ISVDTFYSEVALEAVKRGAHIVNDVSAGQLDPQM 343
           EEEL R++P++  +      SG    IS+DT+YS+VA  A++ GA+I+NDV+ G  D +M
Sbjct: 540 EEELRRVIPMISLL----RSSGITVPISIDTYYSKVAKLAIEAGANIINDVTGGMGDEKM 599

Query: 344 HRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVCNEIASELHSRIRDAELSGIPAWRII 403
             + A L+VP   MHMRG P TM+   ++   D+  E+A EL SR+  A  SG+  + II
Sbjct: 600 LPLAASLQVPICIMHMRGTPETMK-ALSIYEKDIVEEVAVELSSRVEAAVQSGVHRYNII 659

Query: 404 VDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGLSHAPMLIGPSRKKFLGEVCSQPIAT 463
           +DPG GF+K  KQ+  +LG + ++  +   + +       L GPSRK F G         
Sbjct: 660 LDPGFGFAKTPKQSAGLLGRLHELMKKPQFKDM-----HWLSGPSRKGFTGYFTGDASPK 719

Query: 464 KRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRLCDAM 500
            R   T A VT  VL G +I RVH+ ++    V + +A+
Sbjct: 720 DRIWGTSACVTASVLQGVSIVRVHDTKEMSKVVGMANAI 729

BLAST of Cp4.1LG01g06560 vs. TrEMBL
Match: A0A0A0KP07_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G190480 PE=4 SV=1)

HSP 1 Score: 886.7 bits (2290), Expect = 1.3e-254
Identity = 436/488 (89.34%), Postives = 464/488 (95.08%), Query Frame = 1

Query: 19  ALQSSFIH-SSQDAVLEVCSREQEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHACL 78
           ALQ SF+H SSQD V+E+CS+EQEVVIALGSNVGDRLQNFN AL+LMKKAGIHITRHACL
Sbjct: 19  ALQISFLHSSSQDKVVEICSQEQEVVIALGSNVGDRLQNFNEALRLMKKAGIHITRHACL 78

Query: 79  YETAPAYVTNQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILL 138
           YETAPAYVT+QPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILL
Sbjct: 79  YETAPAYVTDQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILL 138

Query: 139 YGRYKIHSDTLTVPHERIWERPFVLAPLIDLLGSDVDTDDVACWHSLAADRGGLFELWEK 198
           YGRYK+HSDTLT+PHERIWERPFVLAPLIDLLGSDVDTDDVA WHSLAAD GGLFE WEK
Sbjct: 139 YGRYKVHSDTLTIPHERIWERPFVLAPLIDLLGSDVDTDDVASWHSLAADHGGLFESWEK 198

Query: 199 MGGESLIGKEGMRRVLPIGNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQVRS 258
           +GGE L+GKEGMRRVL IGN+LWDWSCKTS+MGVLNLTPDSFSDGGKFQ IEAAVSQVRS
Sbjct: 199 VGGEYLVGKEGMRRVLSIGNSLWDWSCKTSVMGVLNLTPDSFSDGGKFQSIEAAVSQVRS 258

Query: 259 MVSDGADMIDIGAQSTRPMAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSEVA 318
           MVSDGADMIDIGAQSTRPMAPMISVEEELDRL+PVLEAVT MPEMSGKLISVDTFYS+VA
Sbjct: 259 MVSDGADMIDIGAQSTRPMAPMISVEEELDRLIPVLEAVTRMPEMSGKLISVDTFYSKVA 318

Query: 319 LEAVKRGAHIVNDVSAGQLDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVCNE 378
           LEAVKRGAHIVNDVSAG LDP+MH+VVA L VPYIAMHMRGDPSTMQN ENLQYDDVCN+
Sbjct: 319 LEAVKRGAHIVNDVSAGNLDPEMHKVVADLNVPYIAMHMRGDPSTMQNKENLQYDDVCNQ 378

Query: 379 IASELHSRIRDAELSGIPAWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGLSH 438
           IA ELHS+IRDAE SGIPAWRII+DPG+GFSK TKQNLEIL G+PKIRA IA+RSLGLSH
Sbjct: 379 IALELHSKIRDAESSGIPAWRIIIDPGVGFSKTTKQNLEILTGIPKIRAAIAKRSLGLSH 438

Query: 439 APMLIGPSRKKFLGEVCSQPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRLCD 498
           APMLIGPSRKKFLGEVCS+ +AT+RDPAT+AAVT+GVLGGANI RVHNVR+NVDAVRLCD
Sbjct: 439 APMLIGPSRKKFLGEVCSRSVATERDPATIAAVTVGVLGGANIVRVHNVRNNVDAVRLCD 498

Query: 499 AMLKEKTS 506
           AM KEK S
Sbjct: 499 AMQKEKKS 506

BLAST of Cp4.1LG01g06560 vs. TrEMBL
Match: V4TD06_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031260mg PE=4 SV=1)

HSP 1 Score: 771.5 bits (1991), Expect = 6.1e-220
Identity = 367/478 (76.78%), Postives = 422/478 (88.28%), Query Frame = 1

Query: 23  SFIHSSQDAVLEVCSREQEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHACLYETAP 82
           SF HSS +  +EV S+EQEVVIA+GSNVGDRL NFN ALQLMKK G++ITRH CLYET P
Sbjct: 27  SFFHSSPETTVEVQSQEQEVVIAMGSNVGDRLCNFNEALQLMKKLGVNITRHGCLYETEP 86

Query: 83  AYVTNQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILLYGRYK 142
           AYVT+QP+FLNSAVR VTKLGPHELL  +K IEK +GRT GIRYGPRPIDLDIL YGR+ 
Sbjct: 87  AYVTDQPRFLNSAVRGVTKLGPHELLGVLKKIEKDMGRTNGIRYGPRPIDLDILFYGRFS 146

Query: 143 IHSDTLTVPHERIWERPFVLAPLIDLLGSDVDTDDVACWHSLAADRGGLFELWEKMGGES 202
           IHSD LTVPHERIWERPFV+APL+DLLGS V++D VACWHSL+    GLFE WEK+GGES
Sbjct: 147 IHSDILTVPHERIWERPFVVAPLLDLLGSSVESDTVACWHSLSQQHNGLFETWEKLGGES 206

Query: 203 LIGKEGMRRVLPIGNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQVRSMVSDG 262
           LIGKEGM+RVLPIGN LWDWS KTS+MG+LNLTPDSFSDGGKFQ +EAAVSQVR M+S+G
Sbjct: 207 LIGKEGMKRVLPIGNLLWDWSLKTSVMGILNLTPDSFSDGGKFQSVEAAVSQVRLMISEG 266

Query: 263 ADMIDIGAQSTRPMAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSEVALEAVK 322
           ADMIDIGAQSTRPMA  IS E+EL+RL+PVLEAV  MPEM GKL+SVDTFYS+VA EAV 
Sbjct: 267 ADMIDIGAQSTRPMATKISAEKELERLIPVLEAVLTMPEMEGKLVSVDTFYSKVASEAVG 326

Query: 323 RGAHIVNDVSAGQLDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVCNEIASEL 382
           +GAHI+NDVSAGQLDP M++VVA LKVPY+AMHMRGDPSTMQN ENLQYDDVC ++ASEL
Sbjct: 327 KGAHIINDVSAGQLDPDMYKVVAGLKVPYVAMHMRGDPSTMQNEENLQYDDVCKQVASEL 386

Query: 383 HSRIRDAELSGIPAWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGLSHAPMLI 442
           +S++RDAELSGIPAWRII+DPGIGFSK  + NL+IL G+P IR  IA +SL  SHAP+LI
Sbjct: 387 YSKVRDAELSGIPAWRIIIDPGIGFSKKAEHNLDILLGLPAIRRHIAMKSLAASHAPILI 446

Query: 443 GPSRKKFLGEVCSQPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRLCDAML 501
           GPSRK+FLGE+C++P A +RDPAT+A++T GVLGGANI RVHNVRDN+DAV+LCD+ML
Sbjct: 447 GPSRKRFLGEICNRPSADERDPATIASITAGVLGGANIVRVHNVRDNLDAVKLCDSML 504

BLAST of Cp4.1LG01g06560 vs. TrEMBL
Match: A0A061EBB3_THECC (Dihydropterin pyrophosphokinase / Dihydropteroate synthase OS=Theobroma cacao GN=TCM_011500 PE=4 SV=1)

HSP 1 Score: 771.5 bits (1991), Expect = 6.1e-220
Identity = 365/480 (76.04%), Postives = 425/480 (88.54%), Query Frame = 1

Query: 23  SFIHSSQDAVLEVCSREQEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHACLYETAP 82
           +F+H++ D  +EV S +QEVVIALGSNVGDRL NFN ALQLM+K+GI ITRHACLYETAP
Sbjct: 27  AFLHTTTDQSVEVHSPDQEVVIALGSNVGDRLHNFNEALQLMRKSGIKITRHACLYETAP 86

Query: 83  AYVTNQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILLYGRYK 142
           AYVT+QP+FLNSAVRAVTKLGPHELL  +K IEK +GRT GIRYGPRPIDLDIL YG+Y+
Sbjct: 87  AYVTDQPRFLNSAVRAVTKLGPHELLGVLKKIEKDMGRTGGIRYGPRPIDLDILFYGKYR 146

Query: 143 IHSDTLTVPHERIWERPFVLAPLIDLLGSDVDTDDVACWHSLAADRGGLFELWEKMGGES 202
           I SD LTVPHERIWERPFV+APL+DLLGS +D D +ACWHS + D  GL   WEK+GGES
Sbjct: 147 IGSDILTVPHERIWERPFVMAPLMDLLGSVIDNDTIACWHSFSTDSDGLLGSWEKLGGES 206

Query: 203 LIGKEGMRRVLPIGNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQVRSMVSDG 262
           LIGKEGM+RVLPIGN LWDWS +TS+MG+LNLTPDSFSDGGKF  +E AVS V  M+S+G
Sbjct: 207 LIGKEGMKRVLPIGNRLWDWSERTSVMGILNLTPDSFSDGGKFLSVETAVSHVHLMISEG 266

Query: 263 ADMIDIGAQSTRPMAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSEVALEAVK 322
           AD++DIGAQSTRPMA  IS EEELDRL+P+LEAV GM EM GKLISVDTFYS+VALEAVK
Sbjct: 267 ADIVDIGAQSTRPMASRISAEEELDRLIPILEAVLGMSEMEGKLISVDTFYSDVALEAVK 326

Query: 323 RGAHIVNDVSAGQLDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVCNEIASEL 382
           +GAHI+NDVSAGQLDP MHR+VA L VPYIAMHMRGDP+TMQ+++NLQYDDVC ++ASEL
Sbjct: 327 KGAHIINDVSAGQLDPNMHRIVASLGVPYIAMHMRGDPTTMQSSDNLQYDDVCLQVASEL 386

Query: 383 HSRIRDAELSGIPAWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGLSHAPMLI 442
            SR+ DAELSGIPAWRII+DPGIGFSK T+ NL+IL G+P IRAEIA+RSL +SHAP+LI
Sbjct: 387 FSRVNDAELSGIPAWRIILDPGIGFSKKTEHNLDILAGLPDIRAEIAKRSLAVSHAPVLI 446

Query: 443 GPSRKKFLGEVCSQPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRLCDAMLKE 502
           GPSRK+FLGE+C++P A +RDPAT+A+VT G+LGGANI RVHNV+DNVDAV++CDAMLKE
Sbjct: 447 GPSRKRFLGEICNRPAAVERDPATIASVTAGILGGANIVRVHNVKDNVDAVKVCDAMLKE 506

BLAST of Cp4.1LG01g06560 vs. TrEMBL
Match: B9HIG0_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0008s09700g PE=4 SV=2)

HSP 1 Score: 770.0 bits (1987), Expect = 1.8e-219
Identity = 365/479 (76.20%), Postives = 429/479 (89.56%), Query Frame = 1

Query: 27  SSQDAVLEVCSREQEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHACLYETAPAYVT 86
           SS +  +E+ S+E+EVVIALGSNVG+RL NFN AL+LMKK+GI+ITRHACLYETAPAYVT
Sbjct: 31  SSPETFVEIRSQEKEVVIALGSNVGNRLHNFNEALRLMKKSGINITRHACLYETAPAYVT 90

Query: 87  NQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILLYGRYKIHSD 146
           +QPQFLNSAVR VTKL PHELL  +K IEK +GRTAGIRYGPRPIDLDIL YG++++ SD
Sbjct: 91  DQPQFLNSAVRGVTKLWPHELLGVLKKIEKDMGRTAGIRYGPRPIDLDILFYGKFRVSSD 150

Query: 147 TLTVPHERIWERPFVLAPLIDLLGSDVDTDDVACWHSLAADRGGLFELWEKMGGESLIGK 206
            LTVPHERIWERPFV+APL+DLLG+DV+ D VACWHSL+   GGLFE WEK+GGE +IGK
Sbjct: 151 ILTVPHERIWERPFVMAPLMDLLGADVENDTVACWHSLSIHSGGLFESWEKLGGECIIGK 210

Query: 207 EGMRRVLPIGNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQVRSMVSDGADMI 266
           +GM+RVLPIGN+LWDWS KTS+MG+LNLTPDSFSDGGKFQ +EAAVSQVR M+S+GADMI
Sbjct: 211 DGMKRVLPIGNDLWDWSLKTSVMGILNLTPDSFSDGGKFQSVEAAVSQVRLMISEGADMI 270

Query: 267 DIGAQSTRPMAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSEVALEAVKRGAH 326
           D+GAQSTRP+A  IS +EELDRL+PVLEA+  MPEM+GKLISVDTFYSEVA EAV +GAH
Sbjct: 271 DLGAQSTRPVASRISPQEELDRLIPVLEAILKMPEMNGKLISVDTFYSEVASEAVSKGAH 330

Query: 327 IVNDVSAGQLDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVCNEIASELHSRI 386
           IVNDVS GQLDP M +VVA L+VPY+AMHMRGDP+TMQN+ENLQYDDVC ++ASEL+SR+
Sbjct: 331 IVNDVSGGQLDPNMTKVVAGLEVPYVAMHMRGDPATMQNSENLQYDDVCKQVASELYSRV 390

Query: 387 RDAELSGIPAWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGLSHAPMLIGPSR 446
           +DAELSGIP WRII+DPG+GFSK T+ NLE+L G+P IRAEIAR+SL +SH+P+L+G SR
Sbjct: 391 KDAELSGIPVWRIIIDPGLGFSKKTEHNLELLMGLPSIRAEIARKSLAMSHSPVLVGSSR 450

Query: 447 KKFLGEVCSQPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRLCDAMLKEKTS 506
           KKFLGE CS+P A++RDPATVA+VT GVLGGANI RVHNVRDN+DAV+LCDAMLK K S
Sbjct: 451 KKFLGETCSRPAASERDPATVASVTAGVLGGANIVRVHNVRDNLDAVKLCDAMLKYKRS 509

BLAST of Cp4.1LG01g06560 vs. TrEMBL
Match: M5XEJ3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004379mg PE=4 SV=1)

HSP 1 Score: 762.3 bits (1967), Expect = 3.7e-217
Identity = 362/483 (74.95%), Postives = 423/483 (87.58%), Query Frame = 1

Query: 23  SFIHSSQDAVLEVCSREQEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHACLYETAP 82
           +FIHSS +  +EV + +QEVVIALGSNVGDRL NFN ALQLM+K+GIHITRH CLYETAP
Sbjct: 27  AFIHSSPNFSVEVHAPDQEVVIALGSNVGDRLHNFNEALQLMRKSGIHITRHGCLYETAP 86

Query: 83  AYVTNQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILLYGRYK 142
           AYVT+QP FLNSAVRAVT+LGPHELL A+K IEK++GRT GIRYGPRPIDLDIL YG+ +
Sbjct: 87  AYVTDQPNFLNSAVRAVTQLGPHELLGALKKIEKEMGRTDGIRYGPRPIDLDILFYGKLR 146

Query: 143 IHSDTLTVPHERIWERPFVLAPLIDLLGSDVDTDDVACWHSLAADRGGLFELWEKMGGES 202
           + S+ LTVPHERIWERPFV+APL+DLLGS +D+D VACWHS +   GGLF+ WEK+GGE+
Sbjct: 147 VSSEILTVPHERIWERPFVIAPLMDLLGSTIDSDTVACWHSFSMHSGGLFDAWEKLGGET 206

Query: 203 LIGKEGMRRVLPIGNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQVRSMVSDG 262
           L GKEG++RVLPIG   WDWS KTS+MG+LNLTPDSFSDGGKFQ +EAA+SQVRSM+S+G
Sbjct: 207 LTGKEGLKRVLPIGEGFWDWSTKTSVMGILNLTPDSFSDGGKFQSVEAAISQVRSMISEG 266

Query: 263 ADMIDIGAQSTRPMAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSEVALEAVK 322
           ADMIDIGAQSTRPMA  ISV++ELDRL+PVLEAV GMPE  GK+ISVDTFYSEVA EAV 
Sbjct: 267 ADMIDIGAQSTRPMASRISVQQELDRLIPVLEAVVGMPEAEGKIISVDTFYSEVAAEAVS 326

Query: 323 RGAHIVNDVSAGQLDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVCNEIASEL 382
           +GAHIVNDVSAG LD  M RVVA LKVPYIAMHMRGDPSTMQN+ENL+YD+VC ++ASEL
Sbjct: 327 KGAHIVNDVSAGLLDSNMFRVVAGLKVPYIAMHMRGDPSTMQNSENLKYDNVCKQVASEL 386

Query: 383 HSRIRDAELSGIPAWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGLSHAPMLI 442
           +SR+R+AEL GIPAWR+I+DPGIGFSKN   NL++L G+P IRAEI   SL +SHAP+LI
Sbjct: 387 YSRVREAELIGIPAWRMIIDPGIGFSKNCDHNLDVLMGLPNIRAEIGSESLAMSHAPILI 446

Query: 443 GPSRKKFLGEVCSQPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRLCDAMLKE 502
           GPSRKKFLGE+CS+   T+RDPATVA+VT  VLGGANI RVHNVRDN DAV++CDAML++
Sbjct: 447 GPSRKKFLGEICSRTAGTERDPATVASVTAAVLGGANIVRVHNVRDNADAVKVCDAMLRQ 506

Query: 503 KTS 506
           + S
Sbjct: 507 RKS 509

BLAST of Cp4.1LG01g06560 vs. TAIR10
Match: AT4G30000.2 (AT4G30000.2 Dihydropterin pyrophosphokinase / Dihydropteroate synthase)

HSP 1 Score: 690.6 bits (1781), Expect = 7.0e-199
Identity = 331/483 (68.53%), Postives = 405/483 (83.85%), Query Frame = 1

Query: 18  PAL-QSSFIHSSQDAVLEVCSREQEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHAC 77
           PAL  S+F  S+    +EV S E EVVIALGSN+G+R+ NF  AL+LMK+ GI +TRH+C
Sbjct: 64  PALCNSAFSSSATSTTIEVQSTEHEVVIALGSNIGNRMNNFREALRLMKRGGICVTRHSC 123

Query: 78  LYETAPAYVTNQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDIL 137
           LYETAP +VT+QP+FLN+AVR VTKLGPHELLS +K IE+ +GR  GIRYGPRP+DLDIL
Sbjct: 124 LYETAPVHVTDQPRFLNAAVRGVTKLGPHELLSVLKTIERDMGRKDGIRYGPRPLDLDIL 183

Query: 138 LYGRYKIHSDTLTVPHERIWERPFVLAPLIDLLGSDVDTDDVACWHSLAADRGGLFELWE 197
            YG+ +I SD L +PHER+WER FVLAPL+DLLGS VD D VA WHSLA   GG+F+ WE
Sbjct: 184 FYGKMRISSDKLIIPHERLWERSFVLAPLVDLLGSAVDNDTVAHWHSLAIHPGGIFQAWE 243

Query: 198 KMGGESLIGKEGMRRVLPIGNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQVR 257
           ++GGESLIG++G++RVLPIG+ LWD+S KT +MG+LNLTPDSFSDGGKFQ I++AVS+VR
Sbjct: 244 RLGGESLIGQDGIQRVLPIGDKLWDFSNKTHVMGILNLTPDSFSDGGKFQSIDSAVSRVR 303

Query: 258 SMVSDGADMIDIGAQSTRPMAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSEV 317
           SM+S+GAD+IDIGAQSTRPMA  IS +EELDRL+PVLEAV GMPEM  KLISVDTF SEV
Sbjct: 304 SMISEGADIIDIGAQSTRPMASRISSQEELDRLLPVLEAVRGMPEMEEKLISVDTFNSEV 363

Query: 318 ALEAVKRGAHIVNDVSAGQLDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVCN 377
           A EA+  GA I+NDVSAG LDP MH+VVA+  VPY+AMHMRGDP TMQN ENLQYDDVC 
Sbjct: 364 ASEAISNGADILNDVSAGTLDPNMHKVVAESGVPYMAMHMRGDPCTMQNKENLQYDDVCK 423

Query: 378 EIASELHSRIRDAELSGIPAWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGLS 437
           ++ASEL+ R+RDAELSGIPAWR+++DPGIGFSK+   NL+I+  +PKIR E+A+RS+ +S
Sbjct: 424 DVASELYLRVRDAELSGIPAWRVMIDPGIGFSKSVDHNLDIIMDLPKIREEMAKRSIAVS 483

Query: 438 HAPMLIGPSRKKFLGEVCSQPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRLC 497
           HAP+L+GPSRK+FLG++C +P AT RD ATVA+VT G+LGGANI RVHNVR N DA ++ 
Sbjct: 484 HAPILVGPSRKRFLGDICGRPEATDRDAATVASVTAGILGGANIIRVHNVRHNADAAKID 543

Query: 498 DAM 500
            A+
Sbjct: 544 TAV 546

BLAST of Cp4.1LG01g06560 vs. TAIR10
Match: AT1G69190.1 (AT1G69190.1 Dihydropterin pyrophosphokinase / Dihydropteroate synthase)

HSP 1 Score: 660.6 bits (1703), Expect = 7.7e-190
Identity = 313/468 (66.88%), Postives = 396/468 (84.62%), Query Frame = 1

Query: 40  QEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHACLYETAPAYVTNQPQFLNSAVRAV 99
           +EVVIALGSNVG+R+ NF  AL+LMK  GI +TRH+CLYET P +VT+QP+FLN+A+R V
Sbjct: 12  EEVVIALGSNVGNRMNNFKEALRLMKDYGISVTRHSCLYETEPVHVTDQPRFLNAAIRGV 71

Query: 100 TKLGPHELLSAVKNIEKQLGRTA-GIRYGPRPIDLDILLYGRYKIHSDTLTVPHERIWER 159
           TKL PHELL+ +K IEK++GR   G+RYGPRP+DLDIL YG++KI SD L +PHERIWER
Sbjct: 72  TKLKPHELLNVLKKIEKEMGREENGLRYGPRPLDLDILFYGKHKIISDKLIIPHERIWER 131

Query: 160 PFVLAPLIDLLGS-DVDTDD-VACWHSLAADRGGLFELWEKMGGESLIGKEGM-RRVLPI 219
           PFVLAPL+DLLG+ D+D D  VA WHSL+   GG+F+ WE++GGESL+GK+G+ +RV+PI
Sbjct: 132 PFVLAPLVDLLGTEDIDNDKIVAYWHSLSMHSGGIFQAWERLGGESLLGKDGIIQRVIPI 191

Query: 220 GNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQVRSMVSDGADMIDIGAQSTRP 279
           G++LWD+S KT +MG+LNLTPDSFSDGGKFQ ++ AVS+VRSM+S+G D+IDIGAQSTRP
Sbjct: 192 GDHLWDFSKKTYVMGILNLTPDSFSDGGKFQSVDTAVSRVRSMISEGVDIIDIGAQSTRP 251

Query: 280 MAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSEVALEAVKRGAHIVNDVSAGQ 339
           MA  IS +EE+DRL+PVL+ V GM EM GKLISVDTF SEVALEA++ GA I+NDVS G 
Sbjct: 252 MASRISSQEEIDRLIPVLKVVRGMAEMKGKLISVDTFNSEVALEAIRNGADILNDVSGGS 311

Query: 340 LDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVCNEIASELHSRIRDAELSGIP 399
           LD  MH+VVA   VPY+ MHMRGDP TMQN ENL+Y+++C ++A+EL+ R+R+AELSGIP
Sbjct: 312 LDENMHKVVADSDVPYMIMHMRGDPCTMQNKENLEYNEICKDVATELYERVREAELSGIP 371

Query: 400 AWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGLSHAPMLIGPSRKKFLGEVCS 459
           AWRI++DPGIGFSK    NL+I+  +PKIR E+A++S+GLSHAP+LIGPSRK+FLG++C 
Sbjct: 372 AWRIMIDPGIGFSKGIDHNLDIVMELPKIREEMAKKSIGLSHAPILIGPSRKRFLGDICG 431

Query: 460 QPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRLCDAMLKEK 504
           +P A++RD ATVA VT G+L GANI RVHNVRDNVDA RLCDAM+ ++
Sbjct: 432 RPEASERDAATVACVTAGILKGANIIRVHNVRDNVDAARLCDAMMTKR 479

BLAST of Cp4.1LG01g06560 vs. NCBI nr
Match: gi|659114046|ref|XP_008456882.1| (PREDICTED: folic acid synthesis protein fol1-like isoform X1 [Cucumis melo])

HSP 1 Score: 888.6 bits (2295), Expect = 4.9e-255
Identity = 439/490 (89.59%), Postives = 462/490 (94.29%), Query Frame = 1

Query: 17  IPALQSSFIH-SSQDAVLEVCSREQEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHA 76
           + ALQ SF H SSQD V+E+CSREQEVVIALGSNVGDRLQNFN AL+LMKKAGIHITRHA
Sbjct: 22  VKALQISFFHSSSQDKVVEICSREQEVVIALGSNVGDRLQNFNEALRLMKKAGIHITRHA 81

Query: 77  CLYETAPAYVTNQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDI 136
           CLYETAPAYVT+QPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDI
Sbjct: 82  CLYETAPAYVTDQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDI 141

Query: 137 LLYGRYKIHSDTLTVPHERIWERPFVLAPLIDLLGSDVDTDDVACWHSLAADRGGLFELW 196
           LLYGRYKIHSD LT+PHERIWERPFVLAPLIDLLGSDVDTDDVA WHSLAAD GGLFE W
Sbjct: 142 LLYGRYKIHSDILTIPHERIWERPFVLAPLIDLLGSDVDTDDVASWHSLAADHGGLFESW 201

Query: 197 EKMGGESLIGKEGMRRVLPIGNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQV 256
           EK+GGE L+GKEGMRRVL +GN+LWDWSCKTS+MGVLNLTPDSFSDGGKFQ IEAAVSQV
Sbjct: 202 EKVGGEYLVGKEGMRRVLSVGNSLWDWSCKTSVMGVLNLTPDSFSDGGKFQSIEAAVSQV 261

Query: 257 RSMVSDGADMIDIGAQSTRPMAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSE 316
           RSMVSDGADMIDIGAQSTRPMAPMISVEEELDRL+PVLEAVT MPEM GKLISVDTFYS+
Sbjct: 262 RSMVSDGADMIDIGAQSTRPMAPMISVEEELDRLIPVLEAVTRMPEMGGKLISVDTFYSK 321

Query: 317 VALEAVKRGAHIVNDVSAGQLDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVC 376
           VALEAVKRGAHIVNDVSAG LDP+MH+VVA L VPYIAMHMRGDPSTMQNNENLQYDDVC
Sbjct: 322 VALEAVKRGAHIVNDVSAGNLDPEMHKVVADLNVPYIAMHMRGDPSTMQNNENLQYDDVC 381

Query: 377 NEIASELHSRIRDAELSGIPAWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGL 436
           N+IA ELHS+IRDAE SGIPAWRII+DPG+GFSK TKQNLEIL GVPKIR  IARRSLGL
Sbjct: 382 NQIALELHSKIRDAESSGIPAWRIIIDPGVGFSKTTKQNLEILTGVPKIRTAIARRSLGL 441

Query: 437 SHAPMLIGPSRKKFLGEVCSQPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRL 496
           SHAPMLIGPSRKKFLGEVCS+ +AT+RDPATVAAVT+GVLGGANI RVHNVRDNVDAVRL
Sbjct: 442 SHAPMLIGPSRKKFLGEVCSRSVATERDPATVAAVTVGVLGGANIVRVHNVRDNVDAVRL 501

Query: 497 CDAMLKEKTS 506
           CDAM KEK S
Sbjct: 502 CDAMQKEKKS 511

BLAST of Cp4.1LG01g06560 vs. NCBI nr
Match: gi|659114050|ref|XP_008456884.1| (PREDICTED: folic acid synthesis protein fol1-like isoform X2 [Cucumis melo])

HSP 1 Score: 888.3 bits (2294), Expect = 6.5e-255
Identity = 439/488 (89.96%), Postives = 461/488 (94.47%), Query Frame = 1

Query: 19  ALQSSFIH-SSQDAVLEVCSREQEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHACL 78
           ALQ SF H SSQD V+E+CSREQEVVIALGSNVGDRLQNFN AL+LMKKAGIHITRHACL
Sbjct: 19  ALQISFFHSSSQDKVVEICSREQEVVIALGSNVGDRLQNFNEALRLMKKAGIHITRHACL 78

Query: 79  YETAPAYVTNQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILL 138
           YETAPAYVT+QPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILL
Sbjct: 79  YETAPAYVTDQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILL 138

Query: 139 YGRYKIHSDTLTVPHERIWERPFVLAPLIDLLGSDVDTDDVACWHSLAADRGGLFELWEK 198
           YGRYKIHSD LT+PHERIWERPFVLAPLIDLLGSDVDTDDVA WHSLAAD GGLFE WEK
Sbjct: 139 YGRYKIHSDILTIPHERIWERPFVLAPLIDLLGSDVDTDDVASWHSLAADHGGLFESWEK 198

Query: 199 MGGESLIGKEGMRRVLPIGNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQVRS 258
           +GGE L+GKEGMRRVL +GN+LWDWSCKTS+MGVLNLTPDSFSDGGKFQ IEAAVSQVRS
Sbjct: 199 VGGEYLVGKEGMRRVLSVGNSLWDWSCKTSVMGVLNLTPDSFSDGGKFQSIEAAVSQVRS 258

Query: 259 MVSDGADMIDIGAQSTRPMAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSEVA 318
           MVSDGADMIDIGAQSTRPMAPMISVEEELDRL+PVLEAVT MPEM GKLISVDTFYS+VA
Sbjct: 259 MVSDGADMIDIGAQSTRPMAPMISVEEELDRLIPVLEAVTRMPEMGGKLISVDTFYSKVA 318

Query: 319 LEAVKRGAHIVNDVSAGQLDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVCNE 378
           LEAVKRGAHIVNDVSAG LDP+MH+VVA L VPYIAMHMRGDPSTMQNNENLQYDDVCN+
Sbjct: 319 LEAVKRGAHIVNDVSAGNLDPEMHKVVADLNVPYIAMHMRGDPSTMQNNENLQYDDVCNQ 378

Query: 379 IASELHSRIRDAELSGIPAWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGLSH 438
           IA ELHS+IRDAE SGIPAWRII+DPG+GFSK TKQNLEIL GVPKIR  IARRSLGLSH
Sbjct: 379 IALELHSKIRDAESSGIPAWRIIIDPGVGFSKTTKQNLEILTGVPKIRTAIARRSLGLSH 438

Query: 439 APMLIGPSRKKFLGEVCSQPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRLCD 498
           APMLIGPSRKKFLGEVCS+ +AT+RDPATVAAVT+GVLGGANI RVHNVRDNVDAVRLCD
Sbjct: 439 APMLIGPSRKKFLGEVCSRSVATERDPATVAAVTVGVLGGANIVRVHNVRDNVDAVRLCD 498

Query: 499 AMLKEKTS 506
           AM KEK S
Sbjct: 499 AMQKEKKS 506

BLAST of Cp4.1LG01g06560 vs. NCBI nr
Match: gi|778701154|ref|XP_011654974.1| (PREDICTED: folic acid synthesis protein fol1-like isoform X1 [Cucumis sativus])

HSP 1 Score: 887.1 bits (2291), Expect = 1.4e-254
Identity = 436/490 (88.98%), Postives = 465/490 (94.90%), Query Frame = 1

Query: 17  IPALQSSFIH-SSQDAVLEVCSREQEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHA 76
           + ALQ SF+H SSQD V+E+CS+EQEVVIALGSNVGDRLQNFN AL+LMKKAGIHITRHA
Sbjct: 22  VKALQISFLHSSSQDKVVEICSQEQEVVIALGSNVGDRLQNFNEALRLMKKAGIHITRHA 81

Query: 77  CLYETAPAYVTNQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDI 136
           CLYETAPAYVT+QPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDI
Sbjct: 82  CLYETAPAYVTDQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDI 141

Query: 137 LLYGRYKIHSDTLTVPHERIWERPFVLAPLIDLLGSDVDTDDVACWHSLAADRGGLFELW 196
           LLYGRYK+HSDTLT+PHERIWERPFVLAPLIDLLGSDVDTDDVA WHSLAAD GGLFE W
Sbjct: 142 LLYGRYKVHSDTLTIPHERIWERPFVLAPLIDLLGSDVDTDDVASWHSLAADHGGLFESW 201

Query: 197 EKMGGESLIGKEGMRRVLPIGNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQV 256
           EK+GGE L+GKEGMRRVL IGN+LWDWSCKTS+MGVLNLTPDSFSDGGKFQ IEAAVSQV
Sbjct: 202 EKVGGEYLVGKEGMRRVLSIGNSLWDWSCKTSVMGVLNLTPDSFSDGGKFQSIEAAVSQV 261

Query: 257 RSMVSDGADMIDIGAQSTRPMAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSE 316
           RSMVSDGADMIDIGAQSTRPMAPMISVEEELDRL+PVLEAVT MPEMSGKLISVDTFYS+
Sbjct: 262 RSMVSDGADMIDIGAQSTRPMAPMISVEEELDRLIPVLEAVTRMPEMSGKLISVDTFYSK 321

Query: 317 VALEAVKRGAHIVNDVSAGQLDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVC 376
           VALEAVKRGAHIVNDVSAG LDP+MH+VVA L VPYIAMHMRGDPSTMQN ENLQYDDVC
Sbjct: 322 VALEAVKRGAHIVNDVSAGNLDPEMHKVVADLNVPYIAMHMRGDPSTMQNKENLQYDDVC 381

Query: 377 NEIASELHSRIRDAELSGIPAWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGL 436
           N+IA ELHS+IRDAE SGIPAWRII+DPG+GFSK TKQNLEIL G+PKIRA IA+RSLGL
Sbjct: 382 NQIALELHSKIRDAESSGIPAWRIIIDPGVGFSKTTKQNLEILTGIPKIRAAIAKRSLGL 441

Query: 437 SHAPMLIGPSRKKFLGEVCSQPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRL 496
           SHAPMLIGPSRKKFLGEVCS+ +AT+RDPAT+AAVT+GVLGGANI RVHNVR+NVDAVRL
Sbjct: 442 SHAPMLIGPSRKKFLGEVCSRSVATERDPATIAAVTVGVLGGANIVRVHNVRNNVDAVRL 501

Query: 497 CDAMLKEKTS 506
           CDAM KEK S
Sbjct: 502 CDAMQKEKKS 511

BLAST of Cp4.1LG01g06560 vs. NCBI nr
Match: gi|778701157|ref|XP_011654975.1| (PREDICTED: folic acid synthesis protein fol1-like isoform X2 [Cucumis sativus])

HSP 1 Score: 886.7 bits (2290), Expect = 1.9e-254
Identity = 436/488 (89.34%), Postives = 464/488 (95.08%), Query Frame = 1

Query: 19  ALQSSFIH-SSQDAVLEVCSREQEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHACL 78
           ALQ SF+H SSQD V+E+CS+EQEVVIALGSNVGDRLQNFN AL+LMKKAGIHITRHACL
Sbjct: 19  ALQISFLHSSSQDKVVEICSQEQEVVIALGSNVGDRLQNFNEALRLMKKAGIHITRHACL 78

Query: 79  YETAPAYVTNQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILL 138
           YETAPAYVT+QPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILL
Sbjct: 79  YETAPAYVTDQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILL 138

Query: 139 YGRYKIHSDTLTVPHERIWERPFVLAPLIDLLGSDVDTDDVACWHSLAADRGGLFELWEK 198
           YGRYK+HSDTLT+PHERIWERPFVLAPLIDLLGSDVDTDDVA WHSLAAD GGLFE WEK
Sbjct: 139 YGRYKVHSDTLTIPHERIWERPFVLAPLIDLLGSDVDTDDVASWHSLAADHGGLFESWEK 198

Query: 199 MGGESLIGKEGMRRVLPIGNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQVRS 258
           +GGE L+GKEGMRRVL IGN+LWDWSCKTS+MGVLNLTPDSFSDGGKFQ IEAAVSQVRS
Sbjct: 199 VGGEYLVGKEGMRRVLSIGNSLWDWSCKTSVMGVLNLTPDSFSDGGKFQSIEAAVSQVRS 258

Query: 259 MVSDGADMIDIGAQSTRPMAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSEVA 318
           MVSDGADMIDIGAQSTRPMAPMISVEEELDRL+PVLEAVT MPEMSGKLISVDTFYS+VA
Sbjct: 259 MVSDGADMIDIGAQSTRPMAPMISVEEELDRLIPVLEAVTRMPEMSGKLISVDTFYSKVA 318

Query: 319 LEAVKRGAHIVNDVSAGQLDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVCNE 378
           LEAVKRGAHIVNDVSAG LDP+MH+VVA L VPYIAMHMRGDPSTMQN ENLQYDDVCN+
Sbjct: 319 LEAVKRGAHIVNDVSAGNLDPEMHKVVADLNVPYIAMHMRGDPSTMQNKENLQYDDVCNQ 378

Query: 379 IASELHSRIRDAELSGIPAWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGLSH 438
           IA ELHS+IRDAE SGIPAWRII+DPG+GFSK TKQNLEIL G+PKIRA IA+RSLGLSH
Sbjct: 379 IALELHSKIRDAESSGIPAWRIIIDPGVGFSKTTKQNLEILTGIPKIRAAIAKRSLGLSH 438

Query: 439 APMLIGPSRKKFLGEVCSQPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRLCD 498
           APMLIGPSRKKFLGEVCS+ +AT+RDPAT+AAVT+GVLGGANI RVHNVR+NVDAVRLCD
Sbjct: 439 APMLIGPSRKKFLGEVCSRSVATERDPATIAAVTVGVLGGANIVRVHNVRNNVDAVRLCD 498

Query: 499 AMLKEKTS 506
           AM KEK S
Sbjct: 499 AMQKEKKS 506

BLAST of Cp4.1LG01g06560 vs. NCBI nr
Match: gi|567890739|ref|XP_006437890.1| (hypothetical protein CICLE_v10031260mg [Citrus clementina])

HSP 1 Score: 771.5 bits (1991), Expect = 8.8e-220
Identity = 367/478 (76.78%), Postives = 422/478 (88.28%), Query Frame = 1

Query: 23  SFIHSSQDAVLEVCSREQEVVIALGSNVGDRLQNFNGALQLMKKAGIHITRHACLYETAP 82
           SF HSS +  +EV S+EQEVVIA+GSNVGDRL NFN ALQLMKK G++ITRH CLYET P
Sbjct: 27  SFFHSSPETTVEVQSQEQEVVIAMGSNVGDRLCNFNEALQLMKKLGVNITRHGCLYETEP 86

Query: 83  AYVTNQPQFLNSAVRAVTKLGPHELLSAVKNIEKQLGRTAGIRYGPRPIDLDILLYGRYK 142
           AYVT+QP+FLNSAVR VTKLGPHELL  +K IEK +GRT GIRYGPRPIDLDIL YGR+ 
Sbjct: 87  AYVTDQPRFLNSAVRGVTKLGPHELLGVLKKIEKDMGRTNGIRYGPRPIDLDILFYGRFS 146

Query: 143 IHSDTLTVPHERIWERPFVLAPLIDLLGSDVDTDDVACWHSLAADRGGLFELWEKMGGES 202
           IHSD LTVPHERIWERPFV+APL+DLLGS V++D VACWHSL+    GLFE WEK+GGES
Sbjct: 147 IHSDILTVPHERIWERPFVVAPLLDLLGSSVESDTVACWHSLSQQHNGLFETWEKLGGES 206

Query: 203 LIGKEGMRRVLPIGNNLWDWSCKTSIMGVLNLTPDSFSDGGKFQPIEAAVSQVRSMVSDG 262
           LIGKEGM+RVLPIGN LWDWS KTS+MG+LNLTPDSFSDGGKFQ +EAAVSQVR M+S+G
Sbjct: 207 LIGKEGMKRVLPIGNLLWDWSLKTSVMGILNLTPDSFSDGGKFQSVEAAVSQVRLMISEG 266

Query: 263 ADMIDIGAQSTRPMAPMISVEEELDRLVPVLEAVTGMPEMSGKLISVDTFYSEVALEAVK 322
           ADMIDIGAQSTRPMA  IS E+EL+RL+PVLEAV  MPEM GKL+SVDTFYS+VA EAV 
Sbjct: 267 ADMIDIGAQSTRPMATKISAEKELERLIPVLEAVLTMPEMEGKLVSVDTFYSKVASEAVG 326

Query: 323 RGAHIVNDVSAGQLDPQMHRVVAQLKVPYIAMHMRGDPSTMQNNENLQYDDVCNEIASEL 382
           +GAHI+NDVSAGQLDP M++VVA LKVPY+AMHMRGDPSTMQN ENLQYDDVC ++ASEL
Sbjct: 327 KGAHIINDVSAGQLDPDMYKVVAGLKVPYVAMHMRGDPSTMQNEENLQYDDVCKQVASEL 386

Query: 383 HSRIRDAELSGIPAWRIIVDPGIGFSKNTKQNLEILGGVPKIRAEIARRSLGLSHAPMLI 442
           +S++RDAELSGIPAWRII+DPGIGFSK  + NL+IL G+P IR  IA +SL  SHAP+LI
Sbjct: 387 YSKVRDAELSGIPAWRIIIDPGIGFSKKAEHNLDILLGLPAIRRHIAMKSLAASHAPILI 446

Query: 443 GPSRKKFLGEVCSQPIATKRDPATVAAVTIGVLGGANIARVHNVRDNVDAVRLCDAML 501
           GPSRK+FLGE+C++P A +RDPAT+A++T GVLGGANI RVHNVRDN+DAV+LCD+ML
Sbjct: 447 GPSRKRFLGEICNRPSADERDPATIASITAGVLGGANIVRVHNVRDNLDAVKLCDSML 504

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
FOLM_PEA3.5e-20870.35Folate synthesis bifunctional protein, mitochondrial OS=Pisum sativum GN=MitHPPK... [more]
FOLM_ARATH1.8e-20168.71Folate synthesis bifunctional protein, mitochondrial OS=Arabidopsis thaliana GN=... [more]
FOLC_ARATH1.4e-18866.88Folate synthesis bifunctional protein OS=Arabidopsis thaliana GN=CytHPPK/DHPS PE... [more]
FOL1_PNECA1.6e-8339.20Folic acid synthesis protein fol1 OS=Pneumocystis carinii GN=fol1 PE=1 SV=1[more]
FOL1_SCHPO1.0e-7137.91Folic acid synthesis protein fol1 OS=Schizosaccharomyces pombe (strain 972 / ATC... [more]
Match NameE-valueIdentityDescription
A0A0A0KP07_CUCSA1.3e-25489.34Uncharacterized protein OS=Cucumis sativus GN=Csa_5G190480 PE=4 SV=1[more]
V4TD06_9ROSI6.1e-22076.78Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031260mg PE=4 SV=1[more]
A0A061EBB3_THECC6.1e-22076.04Dihydropterin pyrophosphokinase / Dihydropteroate synthase OS=Theobroma cacao GN... [more]
B9HIG0_POPTR1.8e-21976.20Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0008s09700g PE=4 SV=2[more]
M5XEJ3_PRUPE3.7e-21774.95Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004379mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G30000.27.0e-19968.53 Dihydropterin pyrophosphokinase / Dihydropteroate synthase[more]
AT1G69190.17.7e-19066.88 Dihydropterin pyrophosphokinase / Dihydropteroate synthase[more]
Match NameE-valueIdentityDescription
gi|659114046|ref|XP_008456882.1|4.9e-25589.59PREDICTED: folic acid synthesis protein fol1-like isoform X1 [Cucumis melo][more]
gi|659114050|ref|XP_008456884.1|6.5e-25589.96PREDICTED: folic acid synthesis protein fol1-like isoform X2 [Cucumis melo][more]
gi|778701154|ref|XP_011654974.1|1.4e-25488.98PREDICTED: folic acid synthesis protein fol1-like isoform X1 [Cucumis sativus][more]
gi|778701157|ref|XP_011654975.1|1.9e-25489.34PREDICTED: folic acid synthesis protein fol1-like isoform X2 [Cucumis sativus][more]
gi|567890739|ref|XP_006437890.1|8.8e-22076.78hypothetical protein CICLE_v10031260mg [Citrus clementina][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0044237cellular metabolic process
GO:0009396folic acid-containing compound biosynthetic process
GO:0042558pteridine-containing compound metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0004156dihydropteroate synthase activity
GO:00038482-amino-4-hydroxy-6-hydroxymethyldihydropteridine diphosphokinase activity
Vocabulary: INTERPRO
TermDefinition
IPR011005Dihydropteroate_synth-like
IPR006390DHP_synth
IPR000550Hppk
IPR000489Pterin-binding_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0046656 folic acid biosynthetic process
biological_process GO:0016310 phosphorylation
biological_process GO:0044237 cellular metabolic process
biological_process GO:0009396 folic acid-containing compound biosynthetic process
biological_process GO:0042558 pteridine-containing compound metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003848 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine diphosphokinase activity
molecular_function GO:0004156 dihydropteroate synthase activity
molecular_function GO:0016301 kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g06560.1Cp4.1LG01g06560.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000489Pterin-binding domainGENE3DG3DSA:3.20.20.20coord: 225..501
score: 7.6E
IPR000489Pterin-binding domainPFAMPF00809Pterin_bindcoord: 229..484
score: 2.0
IPR000489Pterin-binding domainPROSITEPS00792DHPS_1coord: 228..243
scor
IPR000489Pterin-binding domainPROSITEPS00793DHPS_2coord: 262..275
scor
IPR000489Pterin-binding domainPROFILEPS50972PTERIN_BINDINGcoord: 226..494
score: 73
IPR0005507,8-Dihydro-6-hydroxymethylpterin-pyrophosphokinase, HPPKGENE3DG3DSA:3.30.70.560coord: 41..170
score: 6.4
IPR0005507,8-Dihydro-6-hydroxymethylpterin-pyrophosphokinase, HPPKPFAMPF01288HPPKcoord: 43..168
score: 1.0
IPR0005507,8-Dihydro-6-hydroxymethylpterin-pyrophosphokinase, HPPKTIGRFAMsTIGR01498TIGR01498coord: 42..168
score: 1.7
IPR0005507,8-Dihydro-6-hydroxymethylpterin-pyrophosphokinase, HPPKPROSITEPS00794HPPKcoord: 125..136
scor
IPR0005507,8-Dihydro-6-hydroxymethylpterin-pyrophosphokinase, HPPKunknownSSF550836-hydroxymethyl-7,8-dihydropterin pyrophosphokinase, HPPKcoord: 42..171
score: 3.4
IPR006390Dihydropteroate synthaseTIGRFAMsTIGR01496TIGR01496coord: 228..498
score: 3.5
IPR011005Dihydropteroate synthase-likeunknownSSF51717Dihydropteroate synthetase-likecoord: 215..501
score: 3.01
NoneNo IPR availablePANTHERPTHR20941FOLATE SYNTHESIS PROTEINScoord: 16..504
score: 3.0E
NoneNo IPR availablePANTHERPTHR20941:SF1FOLIC ACID SYNTHESIS PROTEIN FOL1coord: 16..504
score: 3.0E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG01g06560CmaCh04G000250Cucurbita maxima (Rimu)cmacpeB720
Cp4.1LG01g06560CmoCh04G000250Cucurbita moschata (Rifu)cmocpeB673
Cp4.1LG01g06560Lsi11G000160Bottle gourd (USVL1VR-Ls)cpelsiB320
Cp4.1LG01g06560Carg21982Silver-seed gourdcarcpeB0258
The following gene(s) are paralogous to this gene:

None