Cp4.1LG14g06540 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g06540
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionFolylpolyglutamate synthase
LocationCp4.1LG14 : 321551 .. 327580 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTAGCTCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANGGTAGTTTACCCTAGAAAAACTTAATCTTTATGCTATTGTCGTAGTCGAGGAGGATTGTCGCAAGTCCAAGAGTTCCAACATTTTCGCTCAATCAGCTATTCTTCAAGCTTGCAAGTCGCATTTCCTACCATTTTTTTACTCAAGAAAGAATGCCCATTTCACAATGAATCTCTTCAAATATCATCATCACTTCCGCCCGCAGATTCACGGAAGGCTTCTCCTAAATTATTTCGTTGGTGAAGGGCCTTCTATCAGTAGCCGAATTGGGTCCAAACAATGCTTCGGCAATCACTCAGAAGATCAACAGATGACGGAATTCATGGAATACTTAGATTCCCTGAAGAACTACGAGAAGTTGGGTGTGCCCACAGGCGCGGGGACGGATTCTGAGGATGGGTTCGATCTCGGAAGGATGAAAAGACTGATGGAGCGCCTGGGTAATCCACAATCCAAGTTTAAGGTTGGTTATTTATGATGATCAACTCTTACTTTTCTTAGTTTTTACTAAATTTTGAAGTTTTGGACCGACCCATTTCCTCTGCAATTCACTAAATTTAGTGTTTTATTCATATTAATTCTCTCCTCACTTGTGATGGATATCAATACCAATCGATGGGAAATGACACTACCAATGTTTGAACATTGCATATGAAAAGTTTAAGCTGTTAGTGACTGATGGTAGGACAATTATAGGCAATTCACATTGCGGGAACCAAGGGAAAAGGGTCGACTGCGGCCTTCCTGTCTAGCATTTTACGAGCAGAGGGATACTCTGTTGGTTGTTATACTAGGTACAGCCTTGTTGCAACACTTATCAAAATTTTCGTATACTCCTACTTGGAAGTTATTTGAGAAGGCATAACTTTGTGCTTGTGAATTTCTTCATCTGCAGTCCTCATATTGAAACTATAAGGGAACGTATTTCACTCGGAAGATCTGGAGAGATGGTCTCTGGAAAGGCACTAAATTTTCTTTTCAAAAGGAACAAAGAGTTACTTGACCAATCGGTAAAACTTGAAAATGGACGTATAAGTCACTTCGAGGTATTATATCATCCAGTCTGTTGTTGTGATTGAAAATTCTATAATAACTCAAATGGCTTTTATGCTGCTTCTGATTCTTCTAGTTGATTTGATTAAGCAAAATGCTGTTCCAAGTCTAGATTTTCACTCTGTAAATTTGTTTTTCTCGTGCTCATTTATTTCACACTGGATATGTACTTCATACATGTTTGATATATCCTGCATCTTTTTGGGTTTGGTTTCATCTAGGAAGATATAAATTGTTCATGTTAATGATAAGCCTGGACCCCTCATTGGACCCATGCAGTCATTCATGTTTTCTTCGATCTTTTAGACGAGCTGATGAAATTCTCCCAGGTCCTTACTGCTATGGCATTTTCACTCTTTGCTCAAGAGAATGTTGATGTTGCAGTCATTGAGGTTCACTTTCTTTTGGACTAATTTTTCTTTGCCAAGTATTTTCGTATGTCTTAAGAAAATCGACAATATGTTCTTATTTAAACTGAAACAGGCTGGACTTGGTGGTGCACGAGATGCAACAAACATAATTTGTAGCTCTAGACTTGCTGCTGCAGTCATAACTTCAATTGGAGAGGAACACATGGCTGCTCTGGGTGGCTCTTTGGAAAGCATTGCAATGGCAAAGGCTGGAATAATTAAACATGGTTGTCCGGTTAGGTTGTTTTCTTGATGTAATTTTCATGCTGATTGACTTTGTTCAGCCTTTGATATTCATTATCCTTGCATTAGTTAAGTTTTTCCTTTTCCAATGAGGTGACATGACCTACAAGAAGAGAAGAGGGGACTTAAAGTCAGCCTGAACTTACTTAAGACTATGGAGATTACAAAGATGTAATCTAATTAGCCTTAATTATTGTTTAAAATAATAATTGTTGTTGATAGAATAAGTTTGCAATTTTGTTGCAATGAACTTTAAGCATACGAATCCTGAACTACATTTTGTGTGAGATTCCACATCGATTGGAGAGAGGAACGAGTGTCTGCGAGGACGTTGGACCCTAAAGGGGGTAGACTGTGAGATCATGCATTGATTGGGGAGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCCCCTAGCAGACGCGTTTTATAATGTTGAGGGGAAGCTCGGAAGGGAAAGTCCAAATAGGACAACATCTGCTAGCGGTGGGCTTGGACTGTACGAATGATATCAAAGCCAGACCCCAGGCAGTGTGCCAATTAGGACGCTGGGCCCCGAAGGGGGTAAACACAGGGCGGTGTGCCAATGATGACGTTGGCCCCGAAGGGGGGGGGGGGGGGAATGTNGCGTGGATGTTGGACCCCGAAGGAGGTGGATTGTGAGATCCTACATCGGTTGGGGAGGAGAACGAAACATTATTTATAAGGGTGTGGAAACCTCCTCCTAGCAGACAATATGCCAACGAGGATGCTGGGCCCCCAAAGGGATGAACATCGGGCGGTGTGCTAGTGAGGACACTGGCCCCGAAAGGGGGTGGATTGTGAGATCCCATATCGATTGGAGAGAGGAACGAGTGCCAGTGAGGACGCTGGGCCCCGAAGGGGGTGAATTGTGAGATCCTACATCGGTTAGGGAGGAGAACAAAACATTCTTTATAAGGGTGTGGAAACCTCTTCTTAGCAAAACGCGTTTTAAAACCTTGAGGGGAAGCTTGGAAGGGAAAGCCCAAAAAGGACAATATCTGCTAGAGGTGGGCTTGGGCTATTATGAATGGTAAGAGCCAGACACCAGGCAATGTGTCAACGAGGGCGCTAGGCCCAAAGAAGGTGAACACTAGGCGGTGTGCCAGTGAGGATGCTGGCCCCGAAGAGGAGTGGATTGTGGGATCCCACATCAATTGGAGAGAGGAACAAGTGCCAGCGAGGATGCTAGACCCTGAAGGGGGTGGATTGTGAGATCCTACATTGGCTAGGGAGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCCTCCTAGCAGTCGCGTTTCAAAACCTTGAGGGAGGCCCGAAAGGGAAAGCCCAAAGAAGACAATATCTGCTAGCAATGGGCTCGAGCTATTACAAATGGTATCAGAGCCAGACACCAGGCGATGTGCCCCACGAGGGCGCTAGACTCCGAAGGGGGTGAACCAGTGGTGTGCTAGTGAGGACGATGGCCCCGAAGGAGGTGAGCACTGGGCGGTATGCCAGTGAGGACGTTGGCCTCGAAGGGGGGTGGATTGTGAGATCCCACATTGATTAGAGAGAGGAACGAGTGCCAGCGAGGACACTGGACCCCGAAGGAGGTGGATTGTGACATCCTACATCAGTTGGGGAGGAGAACGAAACATTCTTTACAAAGGTGTAAACCTCCTCCTAGCATACGTGTTTTAAAACCTTGAGGGAAAGCCAAAAGTGGACAATATCTGTTAGTGGTGGGCGTGGGCTATTACAAATGGTATCAGAGTCAAACACCAAGCGATGTTCCAACGAGGGCGCTAGGCCCCAAAGTGGGTGAACACTAAGTGGTGTGCCAGTGAGTAGGCTAGCCCCAAAGGGGGGTGGATTGTGAGATCCCACATCGATTAGAGAGAGGAATGAGTGCTAGCGTGGACGCTAGACCCGAAGGGAGTGGGTTGTGAGATCCTACATTGGTTGGGGAGGAGAACAAAACATTCTTTATAAGGGTGTGTAAAATTTCTCCTAGCAGACGCATTTTAAAACCTTGAGGAGAAGCTCGGAAGGGAGAGCACAAAAAGGCCAACATCTGCTGCCGTGATCTTGGGCTGTTAAAGGCTGTTTGTCAGATTCTATTGCCGTATAGAGGGGTATCAAATCAATTTAGTTCTGTAAATAACCTAAAAGCAACCTGCATGAAAAGTTCTTAGAGGCTTTCATAGTTCATTGCCTTTTATATGGTCTCCAAGGGGCATGAAATTCTCTCCCATAGTTGTTGTCGCAAGTTTTTGTGTTCGTGTGTCTGTCTGTGTGAGAGAAACTTATTCTCCAATTCCTATTGTAATAAGGTTGTCTTTTTTATTTGATCTGATTTCTCTTCTGAAATTTATTCCTAAAATACTCCTCAAATTCAATGTCAGACCATTCTGGGTGGTCCTTTCATTCCAAATATTGAGTGCATTCTTCGCGACAAAGCATTGTCCATGTCTTCGCCTGTAGTATCAGCCTCAGATCCTGGAAATAGAAGTACCATAAAAGGTGTAAGCCTGCTCAATGGAAGACTTTGCCAATGCTGCGACCTAATAATCCAAACAGACAATGAAGTATGACCCTTATATTCTATTAAACGAATATAAATTTGACTCCTTGAACGTCTAACAAGAGTTCCTTTCTCTCATTGTCAATGCAGTTCATTGAGTTATTTGATGTCAACCTCCGCATGCTTGGACGTCATCAACTTCAGAATGCAGCAACTGCAACTTGTGTTATTCTTACTCTCCGCAATTTAGGTAATATCATTTCGTACATTTGACTTGATATCCGTACCATCATTTAGGGAATAAACATGCGATGTGTGATGATTTCCCTGATTCATTTACTGAAGATAATAGCTGCTGTGTGTTGCATCTACACTTTCTTTTTGAAGTTTCTTAACAGTTTGTTCCCTTTGCATGTGAAACCTCTGCAGATAAGATACTTATGATTGCTTGCTTTTAGAGGATATGTGATGAAAGTTACAGTATAGTTCATTTACCCGTATACTTCCTTTTAATATTTTCCTATTGTGATGAATAATTCAGGTTGGAGAATTTCAGATGCATCTATTAGGAGTGGACTAGAGAAAACATTCTTGGTCGGTAGGAGTCATTTCTTATCAGCCAAGGAAGCTGGGATGCTTGGACTACCTGGAACAACAATATTGCTTGATGGAGGTGCATTCATACTTCTGTTTACTGACAAGTTTTGTCTGTTGGGATGGAGGTGCAGTCTTTTTAATTGGTTTCGTTAACTGTTCTCTCAGCCCACACCAAAGACTCTGCTAAAGCATTAGTGGACACCATTCAAATGAGTTTTCCCGATGCTCAATTGGCTCTCGTGGTTGCAATGGCTAGTGATAAGGATCACAATGGTTTTGCCACAGAATTTCTTCAAGGTATAGCCTTCCCCTTCGAAGAGTTGGTCACTCATTTTCTTTGTCACAGAGTTTGAAACTGAAGATGTATATTGCTTGATTGTAAATGATGGCAGTTTAAACGAAACAATCAGTTTTTGTAGAGATTCTGTTAAACAAACATTCCATATGTTATGCACTGTCATGAAATCTTGGTTGTTATATCTGTCTTTTCTGATGTTGTGAGATGTAGCGATGTAATGCTGACAGTCAGCAGTTGACTGAATTTCAGGTGGAAAATTGGAGTCCATTGTCTTAAGTGAAGCCAATATTGGTGGAGGCAAATCAAGGACAACTTCAGCTGCTCTTCTAAGGGACTGCTGGATCCAAGCATCCAATGAAATGGGGATCCCTATTTCTCTGGAAACTAAAGACGCACCAGTTTCCTCGACAAGCAAGCTAGAAAACAGACCGGTATTGACTACAGAGACCTCGTTATTACGTGCCATAAAAATCGCTGCTGAGATTCTCAAGCAGAGAATCGAAGGGCGGCGAGGCCTTGTCGTAGTAACTGGCTCCCTGCATGCTGTTTCAATGGTGTTATCTTCTCTTCATTCTTGAAGCCTTTTACTTATGAAAGCACTTCATTGGATGAAGTTTAAGCAAAGTTCCTTAGATTTGCAGATTATTTTGGTAGTATGTTTGGTATGAATTATTTGAGGTATTTTCTGTCATACTTTTTTGAGAAGAATAAATATGTTTATAAAAGTATAAATGCTGTTGTTTAATTACCCTATGCTTTCAGAACTTCAAAGCTATTCTGAACGCTATGTTGACATTCCAATTGCTGTGTGCAGAGCCATTAGTGAACCATAAAGAGCATTCTGAAAAAG

mRNA sequence

TTAGCTCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANGGTAGTTTACCCTAGAAAAACTTAATCTTTATGCTATTGTCGTAGTCGAGGAGGATTGTCGCAAGTCCAAGAGTTCCAACATTTTCGCTCAATCAGCTATTCTTCAAGCTTGCAAGTCGCATTTCCTACCATTTTTTTACTCAAGAAAGAATGCCCATTTCACAATGAATCTCTTCAAATATCATCATCACTTCCGCCCGCAGATTCACGGAAGGCTTCTCCTAAATTATTTCGTTGGTGAAGGGCCTTCTATCAGTAGCCGAATTGGGTCCAAACAATGCTTCGGCAATCACTCAGAAGATCAACAGATGACGGAATTCATGGAATACTTAGATTCCCTGAAGAACTACGAGAAGTTGGGTGTGCCCACAGGCGCGGGGACGGATTCTGAGGATGGGTTCGATCTCGGAAGGATGAAAAGACTGATGGAGCGCCTGGGTAATCCACAATCCAAGTTTAAGGCAATTCACATTGCGGGAACCAAGGGAAAAGGGTCGACTGCGGCCTTCCTGTCTAGCATTTTACGAGCAGAGGGATACTCTGTTGGTTGTTATACTAGTCCTCATATTGAAACTATAAGGGAACGTATTTCACTCGGAAGATCTGGAGAGATGGTCTCTGGAAAGGCACTAAATTTTCTTTTCAAAAGGAACAAAGAGTTACTTGACCAATCGGTAAAACTTGAAAATGGACGTATAAGTCACTTCGAGGTCCTTACTGCTATGGCATTTTCACTCTTTGCTCAAGAGAATGTTGATGTTGCAGTCATTGAGGCTGGACTTGGTGGTGCACGAGATGCAACAAACATAATTTGTAGCTCTAGACTTGCTGCTGCAGTCATAACTTCAATTGGAGAGGAACACATGGCTGCTCTGGGTGGCTCTTTGGAAAGCATTGCAATGGCAAAGGCTGGAATAATTAAACATGGTTGTCCGACCATTCTGGGTGGTCCTTTCATTCCAAATATTGAGTGCATTCTTCGCGACAAAGCATTGTCCATGTCTTCGCCTGTAGTATCAGCCTCAGATCCTGGAAATAGAAGTACCATAAAAGGTGTAAGCCTGCTCAATGGAAGACTTTGCCAATGCTGCGACCTAATAATCCAAACAGACAATGAATTCATTGAGTTATTTGATGTCAACCTCCGCATGCTTGGACGTCATCAACTTCAGAATGCAGCAACTGCAACTTGTGTTATTCTTACTCTCCGCAATTTAGGTTGGAGAATTTCAGATGCATCTATTAGGAGTGGACTAGAGAAAACATTCTTGGTCGGTAGGAGTCATTTCTTATCAGCCAAGGAAGCTGGGATGCTTGGACTACCTGGAACAACAATATTGCTTGATGGAGCCCACACCAAAGACTCTGCTAAAGCATTAGTGGACACCATTCAAATGAGTTTTCCCGATGCTCAATTGGCTCTCGTGGTTGCAATGGCTAGTGATAAGGATCACAATGGTTTTGCCACAGAATTTCTTCAAGGTGGAAAATTGGAGTCCATTGTCTTAAGTGAAGCCAATATTGGTGGAGGCAAATCAAGGACAACTTCAGCTGCTCTTCTAAGGGACTGCTGGATCCAAGCATCCAATGAAATGGGGATCCCTATTTCTCTGGAAACTAAAGACGCACCAGTTTCCTCGACAAGCAAGCTAGAAAACAGACCGGTATTGACTACAGAGACCTCGTTATTACGTGCCATAAAAATCGCTGCTGAGATTCTCAAGCAGAGAATCGAAGGGCGGCGAGGCCTTGTCGTAGTAACTGGCTCCCTGCATGCTGTTTCAATGGTGTTATCTTCTCTTCATTCTTGAAGCCTTTTACTTATGAAAGCACTTCATTGGATGAAGTTTAAGCAAAGTTCCTTAGATTTGCAGATTATTTTGGTAGTATGTTTGGTATGAATTATTTGAGGTATTTTCTGTCATACTTTTTTGAGAAGAATAAATATGTTTATAAAAGTATAAATGCTGTTGTTTAATTACCCTATGCTTTCAGAACTTCAAAGCTATTCTGAACGCTATGTTGACATTCCAATTGCTGTGTGCAGAGCCATTAGTGAACCATAAAGAGCATTCTGAAAAAG

Coding sequence (CDS)

ATGAATCTCTTCAAATATCATCATCACTTCCGCCCGCAGATTCACGGAAGGCTTCTCCTAAATTATTTCGTTGGTGAAGGGCCTTCTATCAGTAGCCGAATTGGGTCCAAACAATGCTTCGGCAATCACTCAGAAGATCAACAGATGACGGAATTCATGGAATACTTAGATTCCCTGAAGAACTACGAGAAGTTGGGTGTGCCCACAGGCGCGGGGACGGATTCTGAGGATGGGTTCGATCTCGGAAGGATGAAAAGACTGATGGAGCGCCTGGGTAATCCACAATCCAAGTTTAAGGCAATTCACATTGCGGGAACCAAGGGAAAAGGGTCGACTGCGGCCTTCCTGTCTAGCATTTTACGAGCAGAGGGATACTCTGTTGGTTGTTATACTAGTCCTCATATTGAAACTATAAGGGAACGTATTTCACTCGGAAGATCTGGAGAGATGGTCTCTGGAAAGGCACTAAATTTTCTTTTCAAAAGGAACAAAGAGTTACTTGACCAATCGGTAAAACTTGAAAATGGACGTATAAGTCACTTCGAGGTCCTTACTGCTATGGCATTTTCACTCTTTGCTCAAGAGAATGTTGATGTTGCAGTCATTGAGGCTGGACTTGGTGGTGCACGAGATGCAACAAACATAATTTGTAGCTCTAGACTTGCTGCTGCAGTCATAACTTCAATTGGAGAGGAACACATGGCTGCTCTGGGTGGCTCTTTGGAAAGCATTGCAATGGCAAAGGCTGGAATAATTAAACATGGTTGTCCGACCATTCTGGGTGGTCCTTTCATTCCAAATATTGAGTGCATTCTTCGCGACAAAGCATTGTCCATGTCTTCGCCTGTAGTATCAGCCTCAGATCCTGGAAATAGAAGTACCATAAAAGGTGTAAGCCTGCTCAATGGAAGACTTTGCCAATGCTGCGACCTAATAATCCAAACAGACAATGAATTCATTGAGTTATTTGATGTCAACCTCCGCATGCTTGGACGTCATCAACTTCAGAATGCAGCAACTGCAACTTGTGTTATTCTTACTCTCCGCAATTTAGGTTGGAGAATTTCAGATGCATCTATTAGGAGTGGACTAGAGAAAACATTCTTGGTCGGTAGGAGTCATTTCTTATCAGCCAAGGAAGCTGGGATGCTTGGACTACCTGGAACAACAATATTGCTTGATGGAGCCCACACCAAAGACTCTGCTAAAGCATTAGTGGACACCATTCAAATGAGTTTTCCCGATGCTCAATTGGCTCTCGTGGTTGCAATGGCTAGTGATAAGGATCACAATGGTTTTGCCACAGAATTTCTTCAAGGTGGAAAATTGGAGTCCATTGTCTTAAGTGAAGCCAATATTGGTGGAGGCAAATCAAGGACAACTTCAGCTGCTCTTCTAAGGGACTGCTGGATCCAAGCATCCAATGAAATGGGGATCCCTATTTCTCTGGAAACTAAAGACGCACCAGTTTCCTCGACAAGCAAGCTAGAAAACAGACCGGTATTGACTACAGAGACCTCGTTATTACGTGCCATAAAAATCGCTGCTGAGATTCTCAAGCAGAGAATCGAAGGGCGGCGAGGCCTTGTCGTAGTAACTGGCTCCCTGCATGCTGTTTCAATGGTGTTATCTTCTCTTCATTCTTGA

Protein sequence

MNLFKYHHHFRPQIHGRLLLNYFVGEGPSISSRIGSKQCFGNHSEDQQMTEFMEYLDSLKNYEKLGVPTGAGTDSEDGFDLGRMKRLMERLGNPQSKFKAIHIAGTKGKGSTAAFLSSILRAEGYSVGCYTSPHIETIRERISLGRSGEMVSGKALNFLFKRNKELLDQSVKLENGRISHFEVLTAMAFSLFAQENVDVAVIEAGLGGARDATNIICSSRLAAAVITSIGEEHMAALGGSLESIAMAKAGIIKHGCPTILGGPFIPNIECILRDKALSMSSPVVSASDPGNRSTIKGVSLLNGRLCQCCDLIIQTDNEFIELFDVNLRMLGRHQLQNAATATCVILTLRNLGWRISDASIRSGLEKTFLVGRSHFLSAKEAGMLGLPGTTILLDGAHTKDSAKALVDTIQMSFPDAQLALVVAMASDKDHNGFATEFLQGGKLESIVLSEANIGGGKSRTTSAALLRDCWIQASNEMGIPISLETKDAPVSSTSKLENRPVLTTETSLLRAIKIAAEILKQRIEGRRGLVVVTGSLHAVSMVLSSLHS
BLAST of Cp4.1LG14g06540 vs. Swiss-Prot
Match: DHFS_ARATH (Dihydrofolate synthetase OS=Arabidopsis thaliana GN=DHFS PE=1 SV=1)

HSP 1 Score: 555.4 bits (1430), Expect = 6.7e-157
Identity = 305/509 (59.92%), Postives = 383/509 (75.25%), Query Frame = 1

Query: 44  SEDQQMTEFMEYLDSLKNYEKLGVPTGAGTDSEDGFDLGRMKRLMERLGNPQSKFKAIHI 103
           +ED ++ +F+ +L+SLKNYEK GVP GAGTDS+DGFDLGRMKRLM RL NP  K+K +H+
Sbjct: 40  TEDPELRDFVGFLESLKNYEKSGVPKGAGTDSDDGFDLGRMKRLMLRLRNPHYKYKVVHV 99

Query: 104 AGTKGKGSTAAFLSSILRAEGYSVGCYTSPHIETIRERISLGRSGEMVSGKALNFLFKRN 163
           AGTKGKGST+AFLS+ILRA GYSVGCY+SPHI +I+ERIS   +GE VS   LN LF   
Sbjct: 100 AGTKGKGSTSAFLSNILRAGGYSVGCYSSPHILSIKERISC--NGEPVSASTLNDLFYSV 159

Query: 164 KELLDQSVKLENGRISHFEVLTAMAFSLFAQENVDVAVIEAGLGGARDATNIICSSRLAA 223
           K +L+QS++ ENG +SHFE+LT +AFSLF +ENVD+AVIEAGLGGARDATN+I SS LAA
Sbjct: 160 KPILEQSIQEENGSLSHFEILTGIAFSLFEKENVDIAVIEAGLGGARDATNVIESSNLAA 219

Query: 224 AVITSIGEEHMAALGGSLESIAMAKAGIIKHGCPTILGGPFIPNIECILRDKALSMSSPV 283
           +VIT+IGEEHMAALGGSLESIA AK+GIIKHG P +LGGPF+P+IE ILR KA S+SS V
Sbjct: 220 SVITTIGEEHMAALGGSLESIAEAKSGIIKHGRPVVLGGPFLPHIEGILRSKAASVSSSV 279

Query: 284 VSASDPGNRSTIKGVSLLNG-RLCQCCDLIIQT---DNEFIELFDVNLRMLGRHQLQNAA 343
           + AS+ G+ S+IKG+   NG  LCQ CD++IQ    D   +EL DVNLRMLG HQLQNA 
Sbjct: 280 ILASNIGSSSSIKGIINKNGIGLCQSCDIVIQNEKDDQPIVELSDVNLRMLGHHQLQNAV 339

Query: 344 TATCVILTLRNLG-WRISDASIRSGLEKTFLVGRSHFLSAKEAGMLGLPGTTILLDGAHT 403
           TATCV L LR+ G  R++D +IR GLE T L+GRS FL+ KEA  L LPG T+LLDGAHT
Sbjct: 340 TATCVSLCLRDQGCGRVTDEAIRIGLENTRLLGRSQFLTPKEAETLLLPGATVLLDGAHT 399

Query: 404 KDSAKALVDTIQMSFPDAQLALVVAMASDKDHNGFATEFLQGGKLESIVLSEANIGGGKS 463
           K+SA+AL + I+  FP+ +L  VVAMASDKDH  FA E L G K E+++L+EA+IGGGK 
Sbjct: 400 KESARALKEMIKKDFPEKRLVFVVAMASDKDHVSFAKELLSGLKPEAVILTEADIGGGKI 459

Query: 464 RTTSAALLRDCWIQASNEMGIPISLETKDAPVSSTSKLENRPVLTTETSLLRAIKIAAEI 523
           R+T +++L++ WI+A++E+G             S    EN+ V       L ++K+A +I
Sbjct: 460 RSTESSVLKESWIKAADELG-----------SRSMEASENKTV-------LGSLKLAYKI 519

Query: 524 LK-QRIEGRRGLVVVTGSLHAVSMVLSSL 547
           L         G+V+VTGSLH VS VL+SL
Sbjct: 520 LSDDTTSSDSGMVIVTGSLHIVSSVLASL 528

BLAST of Cp4.1LG14g06540 vs. Swiss-Prot
Match: FOLC_BACSU (Folylpolyglutamate synthase OS=Bacillus subtilis (strain 168) GN=folC PE=3 SV=2)

HSP 1 Score: 184.5 bits (467), Expect = 3.1e-45
Identity = 125/354 (35.31%), Postives = 190/354 (53.67%), Query Frame = 1

Query: 81  LGRMKRLMERLGNPQSKFKAIHIAGTKGKGSTAAFLSSILRAEGYSVGCYTSPHIETIRE 140
           LGRMK+LM RLG+P+ K +A H+AGT GKGST AF+ S+L+  GY+VG +TSP+I T  E
Sbjct: 24  LGRMKQLMARLGHPEKKIRAFHVAGTNGKGSTVAFIRSMLQEAGYTVGTFTSPYIITFNE 83

Query: 141 RISLGRSGEMVSGKALNFLFKRNKELLDQSVKLENGRISHFEVLTAMAFSLFAQ-ENVDV 200
           RIS+  +G  +S +    L  + K  ++   + E G+ + FE++TA AF  FA+   VD 
Sbjct: 84  RISV--NGIPISDEEWTALVNQMKPHVEALDQTEYGQPTEFEIMTACAFLYFAEFHKVDF 143

Query: 201 AVIEAGLGGARDATNIICSSRLAAAVITSIGEEHMAALGGSLESIAMAKAGIIKHGCPTI 260
            + E GLGG  D+TN++        VITSIG +HM  LG ++E IA  KAGIIK G P I
Sbjct: 144 VIFETGLGGRFDSTNVV---EPLLTVITSIGHDHMNILGNTIEEIAGEKAGIIKEGIP-I 203

Query: 261 LGGPFIPNIECILRDKALSMSSPVVSASDPGNRSTIKGVSLLNGRLCQCCD-LIIQTDNE 320
           +     P    ++R +A   ++P  S  D           + N       +    +T+ +
Sbjct: 204 VTAVTQPEALQVIRHEAERHAAPFQSLHD--------ACVIFNEEALPAGEQFSFKTEEK 263

Query: 321 FIELFDVNLRMLGRHQLQNAATATCVI--LTLRNLGWRISDASIRSGLEKTFLVGRSHFL 380
             E  D+   ++G HQ QNAA +      L   N+   ISD ++RSGL K    GR   +
Sbjct: 264 CYE--DIRTSLIGTHQRQNAALSILAAEWLNKENIA-HISDEALRSGLVKAAWPGRLELV 323

Query: 381 SAKEAGMLGLPGTTILLDGAHTKDSAKALVDTIQMSFPDAQLALVVAMASDKDH 431
                         + LDGAH ++  + L +T++  F ++++++V +   DK +
Sbjct: 324 QEH---------PPVYLDGAHNEEGVEKLAETMKQRFANSRISVVFSALKDKPY 351

BLAST of Cp4.1LG14g06540 vs. Swiss-Prot
Match: FOLCP_HALVD (Probable bifunctional folylpolyglutamate synthase/dihydropteroate synthase OS=Haloferax volcanii (strain ATCC 29605 / DSM 3757 / JCM 8879 / NBRC 14742 / NCIMB 2012 / VKM B-1768 / DS2) GN=folCP PE=3 SV=1)

HSP 1 Score: 139.0 bits (349), Expect = 1.5e-31
Identity = 112/375 (29.87%), Postives = 175/375 (46.67%), Query Frame = 1

Query: 84  MKRLMERLGNPQSKFKAIHIAGTKGKGSTAAFLSSILRAEGYSVGCYTSPHIETIRERIS 143
           ++RL+  LG+P      + +AG+ GKGSTA  + ++LR  G  VG YTSPH + +RER+ 
Sbjct: 25  IRRLLSHLGDPHEGVSFVQVAGSNGKGSTARMVDAMLRESGAHVGLYTSPHFDDVRERVR 84

Query: 144 LGRSGEMVSGKALNFLFKRNKELLDQSVKLENGRISHFEVLTAMAFSLFAQENVDVAVIE 203
           +   G  +   AL+      K  L +    +   ++ FE +TA+A   F +  VDVAV+E
Sbjct: 85  V--DGRKIPKSALSAFVAEAKPYLVERA-ADGEPLTFFETVTALALWYFDRAGVDVAVLE 144

Query: 204 AGLGGARDATNIICSSRLAAAVITSIGEEHMAALGGSLESIAMAKAGIIKHGCPTILGGP 263
            G+GG  DAT+ +      A+ +T++  EH A LG ++  IA  KA +     P + G  
Sbjct: 145 VGMGGELDATSAVDP---VASAVTNVSLEHTAVLGDTVAEIAKTKAAVAPADAPLVTG-- 204

Query: 264 FIPNIECILRDKALSMSSPVVSASDPGNRSTIKGVSLLNGRL-CQCCDLIIQTDNEFIEL 323
                  ++R++A      V++  D    S         GR+  Q   + ++TD    E 
Sbjct: 205 TTGEALSVIREEA----GDVLTVGDADADSDADVRVSYGGRVNHQEAAVTVETD---AET 264

Query: 324 FDVNLRMLGRHQLQNAATATCVILTLRNLGWRISDASIRSGLEKTFLVGRSHFLSAKEAG 383
            DV + +LG +Q +NA  A  +   +R     I   +I  GL      GR   +  +   
Sbjct: 265 LDVRIPLLGAYQARNAGIAVSLARQVRP---DIDAEAIHRGLRNAHWPGRFEVMGTE--- 324

Query: 384 MLGLPGTTILLDGAHTKDSAKALVDTIQMSFPDAQLALVVAMASDKDHNGFATEFLQGGK 443
                  T++LDGAH  D A A V T+   F    L LV     DKDH           +
Sbjct: 325 ------PTVVLDGAHNPD-ACAQVATVLDEFDYDDLHLVYGAMHDKDHGEMVGAL---PE 368

Query: 444 LESIVLSEANIGGGK 458
           + S+V  +A+I  G+
Sbjct: 385 VASVVTCKADISRGE 368

BLAST of Cp4.1LG14g06540 vs. Swiss-Prot
Match: FOLC_LACCA (Folylpolyglutamate synthase OS=Lactobacillus casei GN=fgs PE=1 SV=1)

HSP 1 Score: 137.9 bits (346), Expect = 3.4e-31
Identity = 107/388 (27.58%), Postives = 174/388 (44.85%), Query Frame = 1

Query: 50  TEFMEYLDSLKNYEKLGVPTGAGTDSEDGFDLGRMKRLMERLGNPQSKFKAIHIAGTKGK 109
           TE + Y+ S     K G             D  R+  L+  LGNPQ + + IH+ GT GK
Sbjct: 4   TETVAYIHSFPRLAKTG-------------DHRRILTLLHALGNPQQQGRYIHVTGTNGK 63

Query: 110 GSTAAFLSSILRAEGYSVGCYTSPHIETIRERISLGRS--GEMVSGKALNFLFKRNKELL 169
           GS A  ++ +L A G +VG YTSP I    ERI +      +     A+ F+    + L 
Sbjct: 64  GSAANAIAHVLEASGLTVGLYTSPFIMRFNERIMIDHEPIPDAALVNAVAFVRAALERLQ 123

Query: 170 DQSVKLENGRISHFEVLTAMAFSLFAQENVDVAVIEAGLGGARDATNIICSSRLAAAVIT 229
            Q        ++ FE +TA+ +  F Q  VDVAVIE G+GG  D+TN+I       +V+T
Sbjct: 124 QQQADF---NVTEFEFITALGYWYFRQRQVDVAVIEVGIGGDTDSTNVITP---VVSVLT 183

Query: 230 SIGEEHMAALGGSLESIAMAKAGIIKHGCPTILGGPFIPNIECILRDKALSMSSPVVSAS 289
            +  +H   LG ++ +IA  KAGIIK G P + G   +P+   ++  K  +  S  +   
Sbjct: 184 EVALDHQKLLGHTITAIAKHKAGIIKRGIPVVTGN-LVPDAAAVVAAKVATTGSQWLRFD 243

Query: 290 DPGNRSTIKGVSLLNGRLCQCCDLIIQTDNEFIELFDVNLRMLGRHQLQNAATA-TCVIL 349
                   +  S+   +L          D +   + D+ + ++G +Q +N A A     +
Sbjct: 244 --------RDFSVPKAKLHGWGQRFTYEDQDG-RISDLEVPLVGDYQQRNMAIAIQTAKV 303

Query: 350 TLRNLGWRISDASIRSGLEKTFLVGRSHFLSAKEAGMLGLPGTTILLDGAHTKDSAKALV 409
             +   W ++  +IR GL  +    R   +S             I++DGAH  D    L+
Sbjct: 304 YAKQTEWPLTPQNIRQGLAASHWPARLEKIS---------DTPLIVIDGAHNPDGINGLI 352

Query: 410 DTIQMSFPDAQLALVVAMASDKDHNGFA 435
             ++  F    + ++  + +DKD+   A
Sbjct: 364 TALKQLF-SQPITVIAGILADKDYAAMA 352

BLAST of Cp4.1LG14g06540 vs. Swiss-Prot
Match: FOLD_SCHPO (Probable dihydrofolate synthetase OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=fol3 PE=3 SV=1)

HSP 1 Score: 136.0 bits (341), Expect = 1.3e-30
Identity = 120/364 (32.97%), Postives = 175/364 (48.08%), Query Frame = 1

Query: 81  LGRMKRLMERLGNPQSKFKAIHIAGTKGKGSTAAFLSSILRAEGYSVGCYTSPHIETIRE 140
           L RM +L++ LGNPQ  F A+ IAGT GKGS  +++ + L       G YTSPH    R+
Sbjct: 7   LQRMLQLLKHLGNPQESFCAVQIAGTNGKGSICSYIYTSLLQAAIKTGRYTSPHFLEPRD 66

Query: 141 RISLGRSGEMVSGKALNFLFKRNKELLDQSVKLENGRISHFEVLTAMAFSLFAQENVDVA 200
            IS+  +G++ S +  N  +K+  E +D+  +    + + FE+LTA AF  F    V VA
Sbjct: 67  TISI--NGQIASEEIFNTCWKQVIE-VDRRFRT---KATEFELLTATAFQCFHHSGVRVA 126

Query: 201 VIEAGLGGARDATNIICSSRLAAAVITSIGEEHMAALGGSLESIAMAKAGIIKHGCPTIL 260
           VIE G+GG  DATN+        ++I+ I  +H A LG +LE+IA  KAGI K   P ++
Sbjct: 127 VIETGMGGRLDATNVF--EEPVLSIISRICLDHQAFLGNTLEAIAKEKAGIFKKNVPCVV 186

Query: 261 GGPFIPNIECILRDKA-LSMSSPVVSASDPGNRSTIKGVSLLNGRLCQCCDLIIQTDNEF 320
            G    N+   L+  A  + + P   A         KG S  N       + II T N  
Sbjct: 187 DGLNEVNVLNQLKLSAEETRAHPFYLA---------KGKSGENKN-----EWIINTPNWG 246

Query: 321 IELFDVNLRMLGRHQLQNAATATCVILTLRNLGWRISDASIRSGLEKTFLVGRSHFLSAK 380
              F   L+  G +Q QN A A    L + +  + I    +++G++ T   GR    S  
Sbjct: 247 TNTFSTPLK--GDYQGQNLACAV-TALDILSSSFSIMLPHVQNGVKNTSWPGRLDIRSVP 306

Query: 381 EAGMLGLPGTTILLDGAHTKDSAKALVDTI--QMSFPDAQLALVVAMASDKDHNGFATEF 440
             G        IL DGAH K++A  L   +  Q    +  ++ VVA  + KD  G     
Sbjct: 307 SLG-------DILFDGAHNKEAAIELAKFVNSQRREHNKSVSWVVAFTNTKDVTGIMKIL 338

Query: 441 LQGG 442
           L+ G
Sbjct: 367 LRKG 338

BLAST of Cp4.1LG14g06540 vs. TrEMBL
Match: A0A0A0L7A4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G354540 PE=4 SV=1)

HSP 1 Score: 933.3 bits (2411), Expect = 1.3e-268
Identity = 477/548 (87.04%), Postives = 513/548 (93.61%), Query Frame = 1

Query: 1   MNLFKYHHHFRPQIHGRLLLNYFVGEGPSISSRIGSKQCFGNHSEDQQMTEFMEYLDSLK 60
           MN+FKYHHH RPQI G LLLN+F+GE PSISSR GSKQCF  HSEDQ MT+FMEYLDSLK
Sbjct: 1   MNVFKYHHHCRPQILGSLLLNHFIGECPSISSRFGSKQCFATHSEDQHMTQFMEYLDSLK 60

Query: 61  NYEKLGVPTGAGTDSEDGFDLGRMKRLMERLGNPQSKFKAIHIAGTKGKGSTAAFLSSIL 120
           NYEKLGVP G+GTDS+DGFDLGRM+RLMERLGNPQS+FKAIHIAGTKGKGSTAAFLS+IL
Sbjct: 61  NYEKLGVPRGSGTDSDDGFDLGRMRRLMERLGNPQSRFKAIHIAGTKGKGSTAAFLSNIL 120

Query: 121 RAEGYSVGCYTSPHIETIRERISLGRSGEMVSGKALNFLFKRNKELLDQSVKLENGRISH 180
           R EGYSVGCYTSPHIETIRERISLGRSG+MVSGKALN LFKRNKE+ DQSV+LENG +SH
Sbjct: 121 RVEGYSVGCYTSPHIETIRERISLGRSGDMVSGKALNSLFKRNKEVFDQSVELENGHLSH 180

Query: 181 FEVLTAMAFSLFAQENVDVAVIEAGLGGARDATNIICSSRLAAAVITSIGEEHMAALGGS 240
           FEVLTAMAFSLFAQE+VDVAVIEAGLGGARDATNIICSS LAAAVITSIGEEH+AALGGS
Sbjct: 181 FEVLTAMAFSLFAQEDVDVAVIEAGLGGARDATNIICSSELAAAVITSIGEEHVAALGGS 240

Query: 241 LESIAMAKAGIIKHGCPTILGGPFIPNIECILRDKALSMSSPVVSASDPGNRSTIKGVSL 300
           LESIA AKAGIIK GCPTILGGPF+P IE ILRDKALSMSSPV+SASDPGNRSTIKGV+L
Sbjct: 241 LESIATAKAGIIKRGCPTILGGPFLPRIEYILRDKALSMSSPVISASDPGNRSTIKGVNL 300

Query: 301 LNGRLCQCCDLIIQTDNEFIELFDVNLRMLGRHQLQNAATATCVILTLRNLGWRISDASI 360
           LNG L QCCD++IQ DNEFIEL DVNLRMLG HQLQNAATATCVILTLRNLGWRISDASI
Sbjct: 301 LNGGLSQCCDIVIQIDNEFIELLDVNLRMLGPHQLQNAATATCVILTLRNLGWRISDASI 360

Query: 361 RSGLEKTFLVGRSHFLSAKEAGMLGLPGTTILLDGAHTKDSAKALVDTIQMSFPDAQLAL 420
           RSGLE+TFL+GRSHFL+A+EA +LGLPG TILLDGAHTKDSAKAL+DTIQM+FP+AQLAL
Sbjct: 361 RSGLEQTFLIGRSHFLAAREAEVLGLPGATILLDGAHTKDSAKALLDTIQMAFPEAQLAL 420

Query: 421 VVAMASDKDHNGFATEFLQGGKLESIVLSEANIGGGKSRTTSAALLRDCWIQASNEMGIP 480
           VVAMASDK+H GFA EFLQGGKLES+VL+EA IGGGKSRTTSAA LRDCWIQASNE+GIP
Sbjct: 421 VVAMASDKNHVGFAREFLQGGKLESVVLTEALIGGGKSRTTSAAFLRDCWIQASNELGIP 480

Query: 481 ISLETKDAPVSSTSKLENRPVLTTETSLLRAIKIAAEILKQRIEGRRGLVVVTGSLHAVS 540
           ISLETKDA V  TSKL NRPVLTTETSLL AIKIAAEILKQR +GR+GLVVV+GSLHAVS
Sbjct: 481 ISLETKDAEVFFTSKLGNRPVLTTETSLLHAIKIAAEILKQRTKGRQGLVVVSGSLHAVS 540

Query: 541 MVLSSLHS 549
           MVL+SLHS
Sbjct: 541 MVLASLHS 548

BLAST of Cp4.1LG14g06540 vs. TrEMBL
Match: A0A061G8S9_THECC (Folylpolyglutamate synthetase family protein isoform 2 OS=Theobroma cacao GN=TCM_015343 PE=4 SV=1)

HSP 1 Score: 668.7 bits (1724), Expect = 6.1e-189
Identity = 343/526 (65.21%), Postives = 425/526 (80.80%), Query Frame = 1

Query: 33  RIGSKQCFGNHSEDQQMTEFMEYLDSLKNYEKLGVPTGAGTDSEDGFDLGRMKRLMERLG 92
           R+ SKQ F  ++E+ ++ +F++Y+DSLKNYEK GVP  AGTDS+DGFDLGRM+RLM+RLG
Sbjct: 23  RLESKQWFSTYTEEPELKDFIQYIDSLKNYEKSGVPKDAGTDSDDGFDLGRMRRLMDRLG 82

Query: 93  NPQSKFKAIHIAGTKGKGSTAAFLSSILRAEGYSVGCYTSPHIETIRERISLGRSGEMVS 152
           NPQS FK++HIAGTKGKGSTAA+LS+ILR EGYSVGCYTSPHI +IRER+S+GR G+ VS
Sbjct: 83  NPQSNFKSVHIAGTKGKGSTAAYLSNILRTEGYSVGCYTSPHILSIRERMSVGRLGKPVS 142

Query: 153 GKALNFLFKRNKELLDQSVKLENGRISHFEVLTAMAFSLFAQENVDVAVIEAGLGGARDA 212
              LN LF R K+ LD+++ LENG +SHFEVLTA+AF+LFAQENVD+A+IEAGLGGARDA
Sbjct: 143 SNTLNCLFHRIKQSLDEAIILENGCLSHFEVLTAVAFTLFAQENVDIAIIEAGLGGARDA 202

Query: 213 TNIICSSRLAAAVITSIGEEHMAALGGSLESIAMAKAGIIKHGCPTILGGPFIPNIECIL 272
           TNII SS LAA++IT+IGEEH+AALGGSLESIAMAKAGIIKHG P ILGGPF+P+I+CIL
Sbjct: 203 TNIISSSELAASIITTIGEEHLAALGGSLESIAMAKAGIIKHGRPLILGGPFLPHIDCIL 262

Query: 273 RDKALSMSSPVVSASDPGNRSTIKGVSLLNGRLCQCCDLIIQTDNEF---IELFDVNLRM 332
           RDKALSMSSP+VSASD G R+ IKGVS   GR  Q CDL+IQ D +F   IEL D+NL M
Sbjct: 263 RDKALSMSSPIVSASDSGIRTAIKGVSTFKGRPSQSCDLMIQLDCDFQLSIELCDLNLSM 322

Query: 333 LGRHQLQNAATATCVILTLRNLGWRISDASIRSGLEKTFLVGRSHFLSAKEAGMLGLPGT 392
           LG HQLQNA TA C  L L N GW+ISD SIR+GLE T L GRS FL++KEA  LGLPG 
Sbjct: 323 LGTHQLQNAVTAACTALCLCNQGWKISDGSIRAGLENTCLQGRSQFLTSKEAETLGLPGA 382

Query: 393 TILLDGAHTKDSAKALVDTIQMSFPDAQLALVVAMASDKDHNGFATEFLQGGKLESIVLS 452
           T+L+DGAHTK+SAKAL+DTIQM+FPD++LA+VVAMA DKDH  FA E L G ++E++ L+
Sbjct: 383 TVLIDGAHTKESAKALLDTIQMTFPDSRLAIVVAMACDKDHLAFAKELLSGRQVEAVFLT 442

Query: 453 EANIGGGKSRTTSAALLRDCWIQASNEMGIPISLET--------KDAPVSSTSKLENRPV 512
           E+NI GG SRTTSA++LRDCW+QAS E+GI +  +         +D  + ST  L +  +
Sbjct: 443 ESNIAGGTSRTTSASVLRDCWMQASRELGIKVLHDRIAEYRELFEDKYICSTRDLNHEIL 502

Query: 513 LTTETSLLRAIKIAAEILKQRIEGRRGLVVVTGSLHAVSMVLSSLH 548
           + TE SL  +++ A +IL++R   R G++VVTGSLH VS+VL+SL+
Sbjct: 503 VATENSLSDSLRFANQILRERTWNRSGIIVVTGSLHIVSLVLASLN 548

BLAST of Cp4.1LG14g06540 vs. TrEMBL
Match: E0CUK0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_16s0050g02490 PE=4 SV=1)

HSP 1 Score: 665.6 bits (1716), Expect = 5.1e-188
Identity = 345/524 (65.84%), Postives = 418/524 (79.77%), Query Frame = 1

Query: 34  IGSKQCFGNHSEDQQMTEFMEYLDSLKNYEKLGVPTGAGTDSEDGFDLGRMKRLMERLGN 93
           +G K+ F   S + ++ +F+ YLD+LKNYEK GVP  AGTDS  GFDLGRM RLM+RLGN
Sbjct: 14  LGFKRSF---STEPELKDFLNYLDNLKNYEKCGVPKDAGTDSSHGFDLGRMNRLMDRLGN 73

Query: 94  PQSKFKAIHIAGTKGKGSTAAFLSSILRAEGYSVGCYTSPHIETIRERISLGRSGEMVSG 153
           P++ FKA+HIAGTKGKGSTAAFL++ILR EGY+VGCYTSPH+ TIRERISLG+ GE VS 
Sbjct: 74  PETGFKAVHIAGTKGKGSTAAFLANILRTEGYAVGCYTSPHVRTIRERISLGKLGEPVSA 133

Query: 154 KALNFLFKRNKELLDQSVKLENGRISHFEVLTAMAFSLFAQENVDVAVIEAGLGGARDAT 213
           KALN LF R K +LD++V LENGR+SHFE+LTAMAF LFAQENVDVAVIEAGLGGARDAT
Sbjct: 134 KALNCLFHRIKPILDEAVALENGRLSHFEILTAMAFKLFAQENVDVAVIEAGLGGARDAT 193

Query: 214 NIICSSRLAAAVITSIGEEHMAALGGSLESIAMAKAGIIKHGCPTILGGPFIPNIECILR 273
           NII SS LAAAVIT+IGEEH+AALGGSLESIAMAK+GIIK GCP +LGGPF+P+IE I  
Sbjct: 194 NIISSSGLAAAVITTIGEEHLAALGGSLESIAMAKSGIIKQGCPLVLGGPFLPHIEHIFL 253

Query: 274 DKALSMSSPVVSASDPGNRSTIKGVSLLNGRLCQCCDLIIQTDNE---FIELFDVNLRML 333
           DKA SM SPVVSAS PGNRS +KGVS  NG+  Q CD++I+ + +   FIELFDV L+ML
Sbjct: 254 DKASSMCSPVVSASGPGNRSAVKGVSKSNGKPFQSCDIVIEVERDFKLFIELFDVKLQML 313

Query: 334 GRHQLQNAATATCVILTLRNLGWRISDASIRSGLEKTFLVGRSHFLSAKEAGMLGLPGTT 393
           G HQLQNAATATCV L LR+ GWRISD SI +GLE  +L+GRS FL++ EA  LGLPG T
Sbjct: 314 GIHQLQNAATATCVALCLRDKGWRISDESIHAGLEHAYLLGRSQFLTSTEAETLGLPGAT 373

Query: 394 ILLDGAHTKDSAKALVDTIQMSFPDAQLALVVAMASDKDHNGFATEFLQGGKLESIVLSE 453
           I+LDGAHTK+SAKALVDTIQM+FP+A+LAL+VAMASDKDH  FA EFL GG+LE++ L+E
Sbjct: 374 IMLDGAHTKESAKALVDTIQMTFPEARLALIVAMASDKDHMAFAREFLSGGQLEAVFLTE 433

Query: 454 ANIGGGKSRTTSAALLRDCWIQASNEMGIPISLE--------TKDAPVSSTSKLENRPVL 513
            NI G KSRTTSA++LRDCWIQAS E+GI    +         ++    S  + +++ +L
Sbjct: 434 VNIAGAKSRTTSASMLRDCWIQASKELGINTLHDGMEEYQKLFENQSFCSAGESKHKTIL 493

Query: 514 TTETSLLRAIKIAAEILKQRIEGRRGLVVVTGSLHAVSMVLSSL 547
             E SLL ++++  +IL+ R   +  ++V+TGSLH VS VLSSL
Sbjct: 494 AAENSLLVSLRVGNQILRARTRDQTSIIVITGSLHIVSTVLSSL 534

BLAST of Cp4.1LG14g06540 vs. TrEMBL
Match: A0A0D2Q2P9_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G270800 PE=4 SV=1)

HSP 1 Score: 651.0 bits (1678), Expect = 1.3e-183
Identity = 333/531 (62.71%), Postives = 422/531 (79.47%), Query Frame = 1

Query: 28  PSISSRIGSKQCFGNHSEDQQMTEFMEYLDSLKNYEKLGVPTGAGTDSEDGFDLGRMKRL 87
           P  + ++ SKQC  +++E+ ++  F +++DSLKNYEK GVP  AGTDS+DGFDLGRMKRL
Sbjct: 19  PLSNFQLESKQCLSSYTEEPELEGFSQFIDSLKNYEKSGVPKDAGTDSDDGFDLGRMKRL 78

Query: 88  MERLGNPQSKFKAIHIAGTKGKGSTAAFLSSILRAEGYSVGCYTSPHIETIRERISLGRS 147
           M RLGNP S FK++HIAGTKGKGSTAA+LS+ILR+EGYSVGCYTSPH+ +IRER+S+G+ 
Sbjct: 79  MTRLGNPLSNFKSVHIAGTKGKGSTAAYLSNILRSEGYSVGCYTSPHMLSIRERMSVGKM 138

Query: 148 GEMVSGKALNFLFKRNKELLDQSVKLENGRISHFEVLTAMAFSLFAQENVDVAVIEAGLG 207
           G+ VS  ALN LF   K  L++++ LENG +SHFEVLTA+AF+LFAQENVD+A+IEAGLG
Sbjct: 139 GKPVSSNALNCLFHSIKRSLNEAIVLENGCLSHFEVLTAVAFALFAQENVDIAIIEAGLG 198

Query: 208 GARDATNIICSSRLAAAVITSIGEEHMAALGGSLESIAMAKAGIIKHGCPTILGGPFIPN 267
           GARDATN+I SS L A++IT+IGEEH+AALGGSLESIAMAKAGIIKHG P ILGGPF+P+
Sbjct: 199 GARDATNVISSSELDASIITTIGEEHLAALGGSLESIAMAKAGIIKHGRPVILGGPFLPH 258

Query: 268 IECILRDKALSMSSPVVSASDPGNRSTIKGVSLLNGRLCQCCDLIIQTDN---EFIELFD 327
           I+ ILRD+A SM SP+VSASD G R++IKG+ +  GR  QCCDL+I+ D+     IEL D
Sbjct: 259 IDRILRDRAASMFSPIVSASDAGVRTSIKGIGMFKGRPSQCCDLVIELDHGSQSSIELRD 318

Query: 328 VNLRMLGRHQLQNAATATCVILTLRNLGWRISDASIRSGLEKTFLVGRSHFLSAKEAGML 387
           +NL MLG HQLQNA TATC  L LRN GWRIS+ SIR+GLE TFL GRS FLS+KEA  L
Sbjct: 319 LNLSMLGTHQLQNAVTATCAALCLRNQGWRISNGSIRAGLENTFLPGRSQFLSSKEAEKL 378

Query: 388 GLPGTTILLDGAHTKDSAKALVDTIQMSFPDAQLALVVAMASDKDHNGFATEFLQGGKLE 447
           GL G+T+L+DGAHTKDSAKAL++TIQ +FPD++LA+VVAMASDKDH  FA EFL G +LE
Sbjct: 379 GLSGSTVLVDGAHTKDSAKALLETIQTTFPDSRLAIVVAMASDKDHLAFAKEFLSGKQLE 438

Query: 448 SIVLSEANIGGGKSRTTSAALLRDCWIQASNEMGIPISLET--------KDAPVSSTSKL 507
           ++ L+EA+I GG SRTTSA  LRDCWIQAS E+GI +  +         +D  +SST   
Sbjct: 439 AVFLTEADIAGGTSRTTSATALRDCWIQASRELGIEVLHDRMTRYRELFEDNFISSTRDS 498

Query: 508 ENRPVLTTETSLLRAIKIAAEILKQRIEGRRGLVVVTGSLHAVSMVLSSLH 548
           ++  ++  + SL  +++ A +IL++R     G++VVTGSLH VS+VL+SL+
Sbjct: 499 KHETIVAAQNSLSDSLRFANQILRERTRNELGILVVTGSLHIVSLVLASLN 549

BLAST of Cp4.1LG14g06540 vs. TrEMBL
Match: U5FM82_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0015s06150g PE=4 SV=1)

HSP 1 Score: 650.2 bits (1676), Expect = 2.2e-183
Identity = 340/513 (66.28%), Postives = 418/513 (81.48%), Query Frame = 1

Query: 40  FGNHSEDQQMTEFMEYLDSLKNYEKLGVPTGAGTDSEDGFDLGRMKRLMERLGNPQSKFK 99
           F  ++E+ +  EF++YLDSLKNYEKLGVP  AGTDS+DG DLGRM+RLM+RLGNPQSKFK
Sbjct: 6   FSKYTEEPEPKEFIDYLDSLKNYEKLGVPKDAGTDSDDGLDLGRMRRLMDRLGNPQSKFK 65

Query: 100 AIHIAGTKGKGSTAAFLSSILRAEGYSVGCYTSPHIETIRERISLGRSGEMVSGKALNFL 159
           A+H+AGTKGKGSTAA+LS+ILRAEGYSVGCYTSPH+ +IRERISLG+SG  VS K LN L
Sbjct: 66  AVHVAGTKGKGSTAAYLSNILRAEGYSVGCYTSPHMMSIRERISLGQSGNPVSTKTLNKL 125

Query: 160 FKRNKELLDQSVKLENGRISHFEVLTAMAFSLFAQENVDVAVIEAGLGGARDATNIICSS 219
           F   K  LD++++LENG ++HFEVLTA AF+L A+E VD+AVIEAGLGGARDATNI+CSS
Sbjct: 126 FHMIKPKLDEAIQLENGSLTHFEVLTATAFTLMAEEKVDIAVIEAGLGGARDATNILCSS 185

Query: 220 RLAAAVITSIGEEHMAALGGSLESIAMAKAGIIKHGCPTILGGPFIPNIECILRDKALSM 279
            LAA+VIT+IGEEH+AALGGSLESIA+AK+GIIK+G P +LGGPF+ +++ ILRDKA  M
Sbjct: 186 ELAASVITTIGEEHLAALGGSLESIAVAKSGIIKYGRPVVLGGPFLSHVDRILRDKASVM 245

Query: 280 SSPVVSASDPGNRSTIKGVSLLNGRLCQCCDLIIQTDNE---FIELFDVNLRMLGRHQLQ 339
            SPVVSASD G R++IKG+ +L+GR CQ  D++IQ + +   FIEL DV LRMLGRHQL 
Sbjct: 246 CSPVVSASDAGIRTSIKGLIILDGRPCQLSDIMIQVERDFPLFIELSDVKLRMLGRHQLH 305

Query: 340 NAATATCVILTLRNLGWRISDASIRSGLEKTFLVGRSHFLSAKEAGMLGLPGTTILLDGA 399
           NA++A CV L LR+ G RISD SIR+GLE TFL+GRS FLS+KE  +LGLPG TILLDGA
Sbjct: 306 NASSAACVALCLRDQGCRISDRSIRAGLENTFLLGRSQFLSSKETEVLGLPGATILLDGA 365

Query: 400 HTKDSAKALVDTIQMSFPDAQLALVVAMASDKDHNGFATEFLQGGKLESIVLSEANIGGG 459
           HTKDSAKALVDT++M+FPDA++ALVVAMASDKDH  FA EFL G +LE++ L+EA+I GG
Sbjct: 366 HTKDSAKALVDTVRMAFPDARVALVVAMASDKDHLAFAREFLSGLQLEAVFLTEADIAGG 425

Query: 460 KSRTTSAALLRDCWIQASNEMGIPISLETKDAPVSSTSKL--ENRPVLTTETSLLRAIKI 519
           KSRTTSA+LL DCWIQAS E+GI     T    +    +L  EN+ +L TE S   A++ 
Sbjct: 426 KSRTTSASLLMDCWIQASEELGI----NTLHDGMEKNRELLEENKIILATEKSPEVAMRA 485

Query: 520 AAEILKQRIEGRRGLVVVTGSLHAVSMVLSSLH 548
           A E L++R   R  ++VVTGSLH VS++L+SLH
Sbjct: 486 ANETLRRRAGNRSSVIVVTGSLHIVSLLLASLH 514

BLAST of Cp4.1LG14g06540 vs. TAIR10
Match: AT5G41480.1 (AT5G41480.1 Folylpolyglutamate synthetase family protein)

HSP 1 Score: 555.4 bits (1430), Expect = 3.8e-158
Identity = 305/509 (59.92%), Postives = 383/509 (75.25%), Query Frame = 1

Query: 44  SEDQQMTEFMEYLDSLKNYEKLGVPTGAGTDSEDGFDLGRMKRLMERLGNPQSKFKAIHI 103
           +ED ++ +F+ +L+SLKNYEK GVP GAGTDS+DGFDLGRMKRLM RL NP  K+K +H+
Sbjct: 40  TEDPELRDFVGFLESLKNYEKSGVPKGAGTDSDDGFDLGRMKRLMLRLRNPHYKYKVVHV 99

Query: 104 AGTKGKGSTAAFLSSILRAEGYSVGCYTSPHIETIRERISLGRSGEMVSGKALNFLFKRN 163
           AGTKGKGST+AFLS+ILRA GYSVGCY+SPHI +I+ERIS   +GE VS   LN LF   
Sbjct: 100 AGTKGKGSTSAFLSNILRAGGYSVGCYSSPHILSIKERISC--NGEPVSASTLNDLFYSV 159

Query: 164 KELLDQSVKLENGRISHFEVLTAMAFSLFAQENVDVAVIEAGLGGARDATNIICSSRLAA 223
           K +L+QS++ ENG +SHFE+LT +AFSLF +ENVD+AVIEAGLGGARDATN+I SS LAA
Sbjct: 160 KPILEQSIQEENGSLSHFEILTGIAFSLFEKENVDIAVIEAGLGGARDATNVIESSNLAA 219

Query: 224 AVITSIGEEHMAALGGSLESIAMAKAGIIKHGCPTILGGPFIPNIECILRDKALSMSSPV 283
           +VIT+IGEEHMAALGGSLESIA AK+GIIKHG P +LGGPF+P+IE ILR KA S+SS V
Sbjct: 220 SVITTIGEEHMAALGGSLESIAEAKSGIIKHGRPVVLGGPFLPHIEGILRSKAASVSSSV 279

Query: 284 VSASDPGNRSTIKGVSLLNG-RLCQCCDLIIQT---DNEFIELFDVNLRMLGRHQLQNAA 343
           + AS+ G+ S+IKG+   NG  LCQ CD++IQ    D   +EL DVNLRMLG HQLQNA 
Sbjct: 280 ILASNIGSSSSIKGIINKNGIGLCQSCDIVIQNEKDDQPIVELSDVNLRMLGHHQLQNAV 339

Query: 344 TATCVILTLRNLG-WRISDASIRSGLEKTFLVGRSHFLSAKEAGMLGLPGTTILLDGAHT 403
           TATCV L LR+ G  R++D +IR GLE T L+GRS FL+ KEA  L LPG T+LLDGAHT
Sbjct: 340 TATCVSLCLRDQGCGRVTDEAIRIGLENTRLLGRSQFLTPKEAETLLLPGATVLLDGAHT 399

Query: 404 KDSAKALVDTIQMSFPDAQLALVVAMASDKDHNGFATEFLQGGKLESIVLSEANIGGGKS 463
           K+SA+AL + I+  FP+ +L  VVAMASDKDH  FA E L G K E+++L+EA+IGGGK 
Sbjct: 400 KESARALKEMIKKDFPEKRLVFVVAMASDKDHVSFAKELLSGLKPEAVILTEADIGGGKI 459

Query: 464 RTTSAALLRDCWIQASNEMGIPISLETKDAPVSSTSKLENRPVLTTETSLLRAIKIAAEI 523
           R+T +++L++ WI+A++E+G             S    EN+ V       L ++K+A +I
Sbjct: 460 RSTESSVLKESWIKAADELG-----------SRSMEASENKTV-------LGSLKLAYKI 519

Query: 524 LK-QRIEGRRGLVVVTGSLHAVSMVLSSL 547
           L         G+V+VTGSLH VS VL+SL
Sbjct: 520 LSDDTTSSDSGMVIVTGSLHIVSSVLASL 528

BLAST of Cp4.1LG14g06540 vs. TAIR10
Match: AT3G55630.3 (AT3G55630.3 DHFS-FPGS homolog D)

HSP 1 Score: 119.8 bits (299), Expect = 5.3e-27
Identity = 105/317 (33.12%), Postives = 147/317 (46.37%), Query Frame = 1

Query: 96  SKFKAIHIAGTKGKGSTAAFLSSILRAEGYSVGCYTSPHIETIRERISLGRSGEMVSGKA 155
           S+ K IH+AGTKGKGST  F  SILR  G   G +TSPH+  +RER  L    E+   K 
Sbjct: 56  SQMKIIHVAGTKGKGSTCTFAESILRCYGLRTGLFTSPHLIDVRERFRL-NGIEISQEKF 115

Query: 156 LNFL---FKRNKELLDQSVKLENGRISHFEVLTAMAFSLFAQENVDVAVIEAGLGGARDA 215
           +N+    F + KE     V +     ++F  L  +AF +F  E VDV ++E GLGG  DA
Sbjct: 116 VNYFWCCFHKLKEKTSNEVPMP----TYFCFLALLAFKIFTTEQVDVVILEVGLGGRFDA 175

Query: 216 TNIICSSRLAAAVITSIGEEHMAALGGSLESIAMAKAGIIKHGCPTI-LGGPFIPNIECI 275
           TN+I   +     I+S+G +HM  LG +L  IA  KAGI K G P   +  P       +
Sbjct: 176 TNVI--QKPVVCGISSLGYDHMEILGYTLAEIAAEKAGIFKSGVPAFTVAQP--DEAMRV 235

Query: 276 LRDKA--LSMSSPVVSASDPGNRSTIKGV-SLLNGRLCQCCDLIIQTDNEFIELFDVNLR 335
           L +KA  L ++  VV   D   R  ++G    LN  L              + L    L+
Sbjct: 236 LNEKASKLEVNLQVVEPLDSSQRLGLQGEHQYLNAGLA-------------VALCSTFLK 295

Query: 336 MLGRHQLQNAATATCVILTLRNLGWRISDASIRSGLEKTFLVGRSHFLSAKEAGMLGLPG 395
            +G    +N    T  +                SGL   +L+GR+  +   E     LP 
Sbjct: 296 EIGIED-KNGLDQTNGL-----------PEKFISGLSNAYLMGRAMIVPDSE-----LPE 333

Query: 396 TTI-LLDGAHTKDSAKA 405
             +  LDGAH+ +S +A
Sbjct: 356 EIVYYLDGAHSPESMEA 333

BLAST of Cp4.1LG14g06540 vs. TAIR10
Match: AT5G05980.1 (AT5G05980.1 DHFS-FPGS homolog B)

HSP 1 Score: 115.2 bits (287), Expect = 1.3e-25
Identity = 92/317 (29.02%), Postives = 146/317 (46.06%), Query Frame = 1

Query: 97  KFKAIHIAGTKGKGSTAAFLSSILRAEGYSVGCYTSPHIETIRERISLGRSGEMVSGKAL 156
           K   IH+AGTKGKGST  F  SI+R  G+  G +TSPH+  +RER  L    ++   K L
Sbjct: 111 KMNVIHVAGTKGKGSTCTFTESIIRNYGFRTGLFTSPHLIDVRERFRLD-GVDISEEKFL 170

Query: 157 NFL---FKRNKELLDQSVKLENGRISHFEVLTAMAFSLFAQENVDVAVIEAGLGGARDAT 216
            +    + R KE  ++ + +     ++F  L  +AF +FA E VD A++E GLGG  DAT
Sbjct: 171 GYFWWCYNRLKERTNEEIPMP----TYFRFLALLAFKIFAAEEVDAAILEVGLGGKFDAT 230

Query: 217 NIICSSRLAAAVITSIGEEHMAALGGSLESIAMAKAGIIKHGCPTILGGPFIPNIECILR 276
           N +   +     I+S+G +HM  LG +L  IA  KAGI K G P       +P       
Sbjct: 231 NAV--QKPVVCGISSLGYDHMEILGDTLGKIAGEKAGIFKLGVPAFT----VPQ-----P 290

Query: 277 DKALSMSSPVVSASDPGNRSTIKGVSLLNGRLCQCCDLIIQTDNEF------IELFDVNL 336
           D+A+ +     S ++      ++ V  L  RL     L +  ++++      + L  + L
Sbjct: 291 DEAMRVLEEKASETE----VNLEVVQPLTARLLSGQKLGLDGEHQYVNAGLAVSLASIWL 350

Query: 337 RMLGRHQLQNAATATCVILTLRNLGWRISDASIRSGLEKTFLVGRSHFLSAKEAGMLGLP 396
           + +G+ ++ +    +            I       GL    L GR+  +  +        
Sbjct: 351 QQIGKLEVPSRTQMS------------ILPEKFIKGLATASLQGRAQVVPDQYTESRTSG 395

Query: 397 GTTILLDGAHTKDSAKA 405
                LDGAH+ +S +A
Sbjct: 411 DLVFYLDGAHSPESMEA 395

BLAST of Cp4.1LG14g06540 vs. TAIR10
Match: AT3G10160.1 (AT3G10160.1 DHFS-FPGS homolog C)

HSP 1 Score: 110.2 bits (274), Expect = 4.2e-24
Identity = 79/208 (37.98%), Postives = 114/208 (54.81%), Query Frame = 1

Query: 97  KFKAIHIAGTKGKGSTAAFLSSILRAEGYSVGCYTSPHIETIRERISLGRSGEMVSGKAL 156
           + K IH+AGTKGKGST  F  +ILR  G+  G +TSPH+  +RER  +    ++   K L
Sbjct: 130 ELKVIHVAGTKGKGSTCVFSEAILRNCGFRTGMFTSPHLIDVRERFRID-GLDISEEKFL 189

Query: 157 NFLFKRNKELLDQSVKLENGRISH--FEVLTAMAFSLFAQENVDVAVIEAGLGGARDATN 216
            + ++  K L +++V   +G      F+ LT +AF +F  E VDVAVIE GLGG  D+TN
Sbjct: 190 QYFWECWKLLKEKAV---DGLTMPPLFQFLTVLAFKIFVCEKVDVAVIEVGLGGKLDSTN 249

Query: 217 IICSSRLAAAVITSIGEEHMAALGGSLESIAMAKAGIIKHGCPTILGGPFIPNIECILRD 276
           +I   +     I S+G +HM  LG +L  IA  KAGI K   P     P +     +L+ 
Sbjct: 250 VI--QKPVVCGIASLGMDHMDILGNTLADIAFHKAGIFKPQIPAFT-VPQLSEAMDVLQK 309

Query: 277 KALSMSSP--VVSASDPGNRSTIKGVSL 301
            A ++  P  VV+  +P     + GV+L
Sbjct: 310 TANNLEVPLEVVAPLEP---KKLDGVTL 327

BLAST of Cp4.1LG14g06540 vs. NCBI nr
Match: gi|659130858|ref|XP_008465388.1| (PREDICTED: dihydrofolate synthetase isoform X1 [Cucumis melo])

HSP 1 Score: 936.8 bits (2420), Expect = 1.7e-269
Identity = 479/548 (87.41%), Postives = 514/548 (93.80%), Query Frame = 1

Query: 1   MNLFKYHHHFRPQIHGRLLLNYFVGEGPSISSRIGSKQCFGNHSEDQQMTEFMEYLDSLK 60
           MN+FKYHHH RPQI G LLLN+F+GE PSISSR GSKQCF  HSEDQ MTEFMEYLDSLK
Sbjct: 1   MNVFKYHHHCRPQILGSLLLNHFIGECPSISSRNGSKQCFATHSEDQHMTEFMEYLDSLK 60

Query: 61  NYEKLGVPTGAGTDSEDGFDLGRMKRLMERLGNPQSKFKAIHIAGTKGKGSTAAFLSSIL 120
           NYEKLGVP GAGTDS+DGFDLGRM+RLMERLGNPQSKFKAIHIAGTKGKGS AAFLS+IL
Sbjct: 61  NYEKLGVPRGAGTDSDDGFDLGRMRRLMERLGNPQSKFKAIHIAGTKGKGSIAAFLSNIL 120

Query: 121 RAEGYSVGCYTSPHIETIRERISLGRSGEMVSGKALNFLFKRNKELLDQSVKLENGRISH 180
           RAEGYSVGCYTSPHIET+RERISLGRSGEMVSGKALN LFKRNKE+ DQSV+LE+GRISH
Sbjct: 121 RAEGYSVGCYTSPHIETLRERISLGRSGEMVSGKALNSLFKRNKEVFDQSVELEHGRISH 180

Query: 181 FEVLTAMAFSLFAQENVDVAVIEAGLGGARDATNIICSSRLAAAVITSIGEEHMAALGGS 240
           FEVLTAMAFSLFAQENVDVAVIEAGLGGARDATNIICSS LAAAVITSIGEEHMAALGGS
Sbjct: 181 FEVLTAMAFSLFAQENVDVAVIEAGLGGARDATNIICSSGLAAAVITSIGEEHMAALGGS 240

Query: 241 LESIAMAKAGIIKHGCPTILGGPFIPNIECILRDKALSMSSPVVSASDPGNRSTIKGVSL 300
           LESIAMAKAGIIK GCPTILGG F+P IE ILRDKALSMSSPV++ SDPGNRSTIKGV++
Sbjct: 241 LESIAMAKAGIIKRGCPTILGGSFLPRIEYILRDKALSMSSPVIAVSDPGNRSTIKGVNM 300

Query: 301 LNGRLCQCCDLIIQTDNEFIELFDVNLRMLGRHQLQNAATATCVILTLRNLGWRISDASI 360
           LNG LCQCCD++IQ DNEF+EL DVNLRMLGRHQLQNAATATCVILTLRNLGWRISDASI
Sbjct: 301 LNGGLCQCCDIVIQIDNEFVELLDVNLRMLGRHQLQNAATATCVILTLRNLGWRISDASI 360

Query: 361 RSGLEKTFLVGRSHFLSAKEAGMLGLPGTTILLDGAHTKDSAKALVDTIQMSFPDAQLAL 420
           RSGLE+TFL+GRSHFL+AKEA +LGLPG TILLDGAHTKDSAKALVDTIQM+FP+A+LAL
Sbjct: 361 RSGLEQTFLIGRSHFLAAKEAEVLGLPGATILLDGAHTKDSAKALVDTIQMAFPEARLAL 420

Query: 421 VVAMASDKDHNGFATEFLQGGKLESIVLSEANIGGGKSRTTSAALLRDCWIQASNEMGIP 480
           V+AMASDK+H  FA EFLQGGKLE +VL+EA+IGGGKSRTTSAA LRDCWIQAS E+GIP
Sbjct: 421 VIAMASDKNHVDFAREFLQGGKLECVVLTEAHIGGGKSRTTSAAFLRDCWIQASIELGIP 480

Query: 481 ISLETKDAPVSSTSKLENRPVLTTETSLLRAIKIAAEILKQRIEGRRGLVVVTGSLHAVS 540
           ISLETKDA VS TSKLENRPVLTTETSLL AIKIAAEILKQR +G++ LVVV+GSLHAVS
Sbjct: 481 ISLETKDAEVSFTSKLENRPVLTTETSLLHAIKIAAEILKQRTKGQQSLVVVSGSLHAVS 540

Query: 541 MVLSSLHS 549
           MVLSSLHS
Sbjct: 541 MVLSSLHS 548

BLAST of Cp4.1LG14g06540 vs. NCBI nr
Match: gi|449463470|ref|XP_004149457.1| (PREDICTED: probable dihydrofolate synthetase isoform X2 [Cucumis sativus])

HSP 1 Score: 933.3 bits (2411), Expect = 1.9e-268
Identity = 477/548 (87.04%), Postives = 513/548 (93.61%), Query Frame = 1

Query: 1   MNLFKYHHHFRPQIHGRLLLNYFVGEGPSISSRIGSKQCFGNHSEDQQMTEFMEYLDSLK 60
           MN+FKYHHH RPQI G LLLN+F+GE PSISSR GSKQCF  HSEDQ MT+FMEYLDSLK
Sbjct: 1   MNVFKYHHHCRPQILGSLLLNHFIGECPSISSRFGSKQCFATHSEDQHMTQFMEYLDSLK 60

Query: 61  NYEKLGVPTGAGTDSEDGFDLGRMKRLMERLGNPQSKFKAIHIAGTKGKGSTAAFLSSIL 120
           NYEKLGVP G+GTDS+DGFDLGRM+RLMERLGNPQS+FKAIHIAGTKGKGSTAAFLS+IL
Sbjct: 61  NYEKLGVPRGSGTDSDDGFDLGRMRRLMERLGNPQSRFKAIHIAGTKGKGSTAAFLSNIL 120

Query: 121 RAEGYSVGCYTSPHIETIRERISLGRSGEMVSGKALNFLFKRNKELLDQSVKLENGRISH 180
           R EGYSVGCYTSPHIETIRERISLGRSG+MVSGKALN LFKRNKE+ DQSV+LENG +SH
Sbjct: 121 RVEGYSVGCYTSPHIETIRERISLGRSGDMVSGKALNSLFKRNKEVFDQSVELENGHLSH 180

Query: 181 FEVLTAMAFSLFAQENVDVAVIEAGLGGARDATNIICSSRLAAAVITSIGEEHMAALGGS 240
           FEVLTAMAFSLFAQE+VDVAVIEAGLGGARDATNIICSS LAAAVITSIGEEH+AALGGS
Sbjct: 181 FEVLTAMAFSLFAQEDVDVAVIEAGLGGARDATNIICSSELAAAVITSIGEEHVAALGGS 240

Query: 241 LESIAMAKAGIIKHGCPTILGGPFIPNIECILRDKALSMSSPVVSASDPGNRSTIKGVSL 300
           LESIA AKAGIIK GCPTILGGPF+P IE ILRDKALSMSSPV+SASDPGNRSTIKGV+L
Sbjct: 241 LESIATAKAGIIKRGCPTILGGPFLPRIEYILRDKALSMSSPVISASDPGNRSTIKGVNL 300

Query: 301 LNGRLCQCCDLIIQTDNEFIELFDVNLRMLGRHQLQNAATATCVILTLRNLGWRISDASI 360
           LNG L QCCD++IQ DNEFIEL DVNLRMLG HQLQNAATATCVILTLRNLGWRISDASI
Sbjct: 301 LNGGLSQCCDIVIQIDNEFIELLDVNLRMLGPHQLQNAATATCVILTLRNLGWRISDASI 360

Query: 361 RSGLEKTFLVGRSHFLSAKEAGMLGLPGTTILLDGAHTKDSAKALVDTIQMSFPDAQLAL 420
           RSGLE+TFL+GRSHFL+A+EA +LGLPG TILLDGAHTKDSAKAL+DTIQM+FP+AQLAL
Sbjct: 361 RSGLEQTFLIGRSHFLAAREAEVLGLPGATILLDGAHTKDSAKALLDTIQMAFPEAQLAL 420

Query: 421 VVAMASDKDHNGFATEFLQGGKLESIVLSEANIGGGKSRTTSAALLRDCWIQASNEMGIP 480
           VVAMASDK+H GFA EFLQGGKLES+VL+EA IGGGKSRTTSAA LRDCWIQASNE+GIP
Sbjct: 421 VVAMASDKNHVGFAREFLQGGKLESVVLTEALIGGGKSRTTSAAFLRDCWIQASNELGIP 480

Query: 481 ISLETKDAPVSSTSKLENRPVLTTETSLLRAIKIAAEILKQRIEGRRGLVVVTGSLHAVS 540
           ISLETKDA V  TSKL NRPVLTTETSLL AIKIAAEILKQR +GR+GLVVV+GSLHAVS
Sbjct: 481 ISLETKDAEVFFTSKLGNRPVLTTETSLLHAIKIAAEILKQRTKGRQGLVVVSGSLHAVS 540

Query: 541 MVLSSLHS 549
           MVL+SLHS
Sbjct: 541 MVLASLHS 548

BLAST of Cp4.1LG14g06540 vs. NCBI nr
Match: gi|778680896|ref|XP_011651415.1| (PREDICTED: probable dihydrofolate synthetase isoform X1 [Cucumis sativus])

HSP 1 Score: 922.5 bits (2383), Expect = 3.4e-265
Identity = 471/541 (87.06%), Postives = 506/541 (93.53%), Query Frame = 1

Query: 1   MNLFKYHHHFRPQIHGRLLLNYFVGEGPSISSRIGSKQCFGNHSEDQQMTEFMEYLDSLK 60
           MN+FKYHHH RPQI G LLLN+F+GE PSISSR GSKQCF  HSEDQ MT+FMEYLDSLK
Sbjct: 1   MNVFKYHHHCRPQILGSLLLNHFIGECPSISSRFGSKQCFATHSEDQHMTQFMEYLDSLK 60

Query: 61  NYEKLGVPTGAGTDSEDGFDLGRMKRLMERLGNPQSKFKAIHIAGTKGKGSTAAFLSSIL 120
           NYEKLGVP G+GTDS+DGFDLGRM+RLMERLGNPQS+FKAIHIAGTKGKGSTAAFLS+IL
Sbjct: 61  NYEKLGVPRGSGTDSDDGFDLGRMRRLMERLGNPQSRFKAIHIAGTKGKGSTAAFLSNIL 120

Query: 121 RAEGYSVGCYTSPHIETIRERISLGRSGEMVSGKALNFLFKRNKELLDQSVKLENGRISH 180
           R EGYSVGCYTSPHIETIRERISLGRSG+MVSGKALN LFKRNKE+ DQSV+LENG +SH
Sbjct: 121 RVEGYSVGCYTSPHIETIRERISLGRSGDMVSGKALNSLFKRNKEVFDQSVELENGHLSH 180

Query: 181 FEVLTAMAFSLFAQENVDVAVIEAGLGGARDATNIICSSRLAAAVITSIGEEHMAALGGS 240
           FEVLTAMAFSLFAQE+VDVAVIEAGLGGARDATNIICSS LAAAVITSIGEEH+AALGGS
Sbjct: 181 FEVLTAMAFSLFAQEDVDVAVIEAGLGGARDATNIICSSELAAAVITSIGEEHVAALGGS 240

Query: 241 LESIAMAKAGIIKHGCPTILGGPFIPNIECILRDKALSMSSPVVSASDPGNRSTIKGVSL 300
           LESIA AKAGIIK GCPTILGGPF+P IE ILRDKALSMSSPV+SASDPGNRSTIKGV+L
Sbjct: 241 LESIATAKAGIIKRGCPTILGGPFLPRIEYILRDKALSMSSPVISASDPGNRSTIKGVNL 300

Query: 301 LNGRLCQCCDLIIQTDNEFIELFDVNLRMLGRHQLQNAATATCVILTLRNLGWRISDASI 360
           LNG L QCCD++IQ DNEFIEL DVNLRMLG HQLQNAATATCVILTLRNLGWRISDASI
Sbjct: 301 LNGGLSQCCDIVIQIDNEFIELLDVNLRMLGPHQLQNAATATCVILTLRNLGWRISDASI 360

Query: 361 RSGLEKTFLVGRSHFLSAKEAGMLGLPGTTILLDGAHTKDSAKALVDTIQMSFPDAQLAL 420
           RSGLE+TFL+GRSHFL+A+EA +LGLPG TILLDGAHTKDSAKAL+DTIQM+FP+AQLAL
Sbjct: 361 RSGLEQTFLIGRSHFLAAREAEVLGLPGATILLDGAHTKDSAKALLDTIQMAFPEAQLAL 420

Query: 421 VVAMASDKDHNGFATEFLQGGKLESIVLSEANIGGGKSRTTSAALLRDCWIQASNEMGIP 480
           VVAMASDK+H GFA EFLQGGKLES+VL+EA IGGGKSRTTSAA LRDCWIQASNE+GIP
Sbjct: 421 VVAMASDKNHVGFAREFLQGGKLESVVLTEALIGGGKSRTTSAAFLRDCWIQASNELGIP 480

Query: 481 ISLETKDAPVSSTSKLENRPVLTTETSLLRAIKIAAEILKQRIEGRRGLVVVTGSLHAVS 540
           ISLETKDA V  TSKL NRPVLTTETSLL AIKIAAEILKQR +GR+GLVVV+GSLHAVS
Sbjct: 481 ISLETKDAEVFFTSKLGNRPVLTTETSLLHAIKIAAEILKQRTKGRQGLVVVSGSLHAVS 540

Query: 541 M 542
           M
Sbjct: 541 M 541

BLAST of Cp4.1LG14g06540 vs. NCBI nr
Match: gi|659130860|ref|XP_008465389.1| (PREDICTED: dihydrofolate synthetase isoform X2 [Cucumis melo])

HSP 1 Score: 698.0 bits (1800), Expect = 1.3e-197
Identity = 350/395 (88.61%), Postives = 372/395 (94.18%), Query Frame = 1

Query: 1   MNLFKYHHHFRPQIHGRLLLNYFVGEGPSISSRIGSKQCFGNHSEDQQMTEFMEYLDSLK 60
           MN+FKYHHH RPQI G LLLN+F+GE PSISSR GSKQCF  HSEDQ MTEFMEYLDSLK
Sbjct: 1   MNVFKYHHHCRPQILGSLLLNHFIGECPSISSRNGSKQCFATHSEDQHMTEFMEYLDSLK 60

Query: 61  NYEKLGVPTGAGTDSEDGFDLGRMKRLMERLGNPQSKFKAIHIAGTKGKGSTAAFLSSIL 120
           NYEKLGVP GAGTDS+DGFDLGRM+RLMERLGNPQSKFKAIHIAGTKGKGS AAFLS+IL
Sbjct: 61  NYEKLGVPRGAGTDSDDGFDLGRMRRLMERLGNPQSKFKAIHIAGTKGKGSIAAFLSNIL 120

Query: 121 RAEGYSVGCYTSPHIETIRERISLGRSGEMVSGKALNFLFKRNKELLDQSVKLENGRISH 180
           RAEGYSVGCYTSPHIET+RERISLGRSGEMVSGKALN LFKRNKE+ DQSV+LE+GRISH
Sbjct: 121 RAEGYSVGCYTSPHIETLRERISLGRSGEMVSGKALNSLFKRNKEVFDQSVELEHGRISH 180

Query: 181 FEVLTAMAFSLFAQENVDVAVIEAGLGGARDATNIICSSRLAAAVITSIGEEHMAALGGS 240
           FEVLTAMAFSLFAQENVDVAVIEAGLGGARDATNIICSS LAAAVITSIGEEHMAALGGS
Sbjct: 181 FEVLTAMAFSLFAQENVDVAVIEAGLGGARDATNIICSSGLAAAVITSIGEEHMAALGGS 240

Query: 241 LESIAMAKAGIIKHGCPTILGGPFIPNIECILRDKALSMSSPVVSASDPGNRSTIKGVSL 300
           LESIAMAKAGIIK GCPTILGG F+P IE ILRDKALSMSSPV++ SDPGNRSTIKGV++
Sbjct: 241 LESIAMAKAGIIKRGCPTILGGSFLPRIEYILRDKALSMSSPVIAVSDPGNRSTIKGVNM 300

Query: 301 LNGRLCQCCDLIIQTDNEFIELFDVNLRMLGRHQLQNAATATCVILTLRNLGWRISDASI 360
           LNG LCQCCD++IQ DNEF+EL DVNLRMLGRHQLQNAATATCVILTLRNLGWRISDASI
Sbjct: 301 LNGGLCQCCDIVIQIDNEFVELLDVNLRMLGRHQLQNAATATCVILTLRNLGWRISDASI 360

Query: 361 RSGLEKTFLVGRSHFLSAKEAGMLGLPGTTILLDG 396
           RSGLE+TFL+GRSHFL+AKEA +LGLPG TILLDG
Sbjct: 361 RSGLEQTFLIGRSHFLAAKEAEVLGLPGATILLDG 395

BLAST of Cp4.1LG14g06540 vs. NCBI nr
Match: gi|1009122795|ref|XP_015878195.1| (PREDICTED: dihydrofolate synthetase [Ziziphus jujuba])

HSP 1 Score: 694.1 bits (1790), Expect = 1.9e-196
Identity = 357/523 (68.26%), Postives = 429/523 (82.03%), Query Frame = 1

Query: 36  SKQCFGNHSEDQQMTEFMEYLDSLKNYEKLGVPTGAGTDSEDGFDLGRMKRLMERLGNPQ 95
           S++     SED  + +FMEYLD+LKNYEK GVP  AGTDS++GFDLGRM+RLME LGNPQ
Sbjct: 35  SRKSLCTRSEDTVLKDFMEYLDALKNYEKSGVPKSAGTDSDEGFDLGRMRRLMELLGNPQ 94

Query: 96  SKFKAIHIAGTKGKGSTAAFLSSILRAEGYSVGCYTSPHIETIRERISLGRSGEMVSGKA 155
           S FKA+HIAG+KGKGSTAAFLSSILRAEGYSVGCYTSPHI+TIRERISLGR GE V+ KA
Sbjct: 95  SNFKAVHIAGSKGKGSTAAFLSSILRAEGYSVGCYTSPHIQTIRERISLGRFGEPVAAKA 154

Query: 156 LNFLFKRNKELLDQSVKLENGRISHFEVLTAMAFSLFAQENVDVAVIEAGLGGARDATNI 215
           LN LF R K+++DQ+V+LENG ISHFEVLTAMAF+LFAQENVD+AVIEAGLGGARDATN+
Sbjct: 155 LNSLFNRTKKIIDQAVELENGCISHFEVLTAMAFTLFAQENVDIAVIEAGLGGARDATNV 214

Query: 216 ICSSRLAAAVITSIGEEHMAALGGSLESIAMAKAGIIKHGCPTILGGPFIPNIECILRDK 275
           I SS LA +VIT+IGEEHMAALGGSLESIA+AK+GIIKHGCP +LGGPF+P+IECILR+K
Sbjct: 215 ISSSGLALSVITTIGEEHMAALGGSLESIAVAKSGIIKHGCPVVLGGPFLPHIECILRNK 274

Query: 276 ALSMSSPVVSASDPGNRSTIKGVSLLNGRLCQCCDLIIQTDNE---FIELFDVNLRMLGR 335
           A SM SPV+SA D GN+S IKG+S+ NGR CQ CD++IQ ++E   FIELFDV L MLG 
Sbjct: 275 ASSMHSPVISAYDTGNQSKIKGISIHNGRPCQSCDIVIQVESEINLFIELFDVQLYMLGS 334

Query: 336 HQLQNAATATCVILTLRNLGWRISDASIRSGLEKTFLVGRSHFLSAKEAGMLGLPGTTIL 395
           HQLQNAATATC  L LRNLGW+ISD SI+ GL+ T L+GRS FL++KEA  LG+    I+
Sbjct: 335 HQLQNAATATCAALCLRNLGWKISDRSIKDGLQHTHLLGRSQFLTSKEAEALGVSKPMIM 394

Query: 396 LDGAHTKDSAKALVDTIQMSFPDAQLALVVAMASDKDHNGFATEFLQGGKLESIVLSEAN 455
           LDGAHTK+SAKALV+TIQM+FP A+LALVVAMASDKDH GFA EFL GG+LE ++L+EA+
Sbjct: 395 LDGAHTKESAKALVETIQMTFPRARLALVVAMASDKDHVGFAREFLSGGQLEGVLLTEAD 454

Query: 456 IGGGKSRTTSAALLRDCWIQASNEMGIPISLET--------KDAPVSSTSKLENRPVLTT 515
           I GGKSRT +A+ LRDCWI+AS E+GI +  +          D    S S ++NR +L  
Sbjct: 455 IAGGKSRTAAASFLRDCWIKASEELGITLVHDKMSDYQELFNDQLDCSASTIQNRTILAV 514

Query: 516 ETSLLRAIKIAAEILKQRIEGRRGLVVVTGSLHAVSMVLSSLH 548
           E S L ++KIA +IL++R     G++VVTGSLH VS+VL++LH
Sbjct: 515 EASFLSSMKIADQILRKRTGNGFGIIVVTGSLHIVSLVLATLH 557

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DHFS_ARATH6.7e-15759.92Dihydrofolate synthetase OS=Arabidopsis thaliana GN=DHFS PE=1 SV=1[more]
FOLC_BACSU3.1e-4535.31Folylpolyglutamate synthase OS=Bacillus subtilis (strain 168) GN=folC PE=3 SV=2[more]
FOLCP_HALVD1.5e-3129.87Probable bifunctional folylpolyglutamate synthase/dihydropteroate synthase OS=Ha... [more]
FOLC_LACCA3.4e-3127.58Folylpolyglutamate synthase OS=Lactobacillus casei GN=fgs PE=1 SV=1[more]
FOLD_SCHPO1.3e-3032.97Probable dihydrofolate synthetase OS=Schizosaccharomyces pombe (strain 972 / ATC... [more]
Match NameE-valueIdentityDescription
A0A0A0L7A4_CUCSA1.3e-26887.04Uncharacterized protein OS=Cucumis sativus GN=Csa_3G354540 PE=4 SV=1[more]
A0A061G8S9_THECC6.1e-18965.21Folylpolyglutamate synthetase family protein isoform 2 OS=Theobroma cacao GN=TCM... [more]
E0CUK0_VITVI5.1e-18865.84Putative uncharacterized protein OS=Vitis vinifera GN=VIT_16s0050g02490 PE=4 SV=... [more]
A0A0D2Q2P9_GOSRA1.3e-18362.71Uncharacterized protein OS=Gossypium raimondii GN=B456_008G270800 PE=4 SV=1[more]
U5FM82_POPTR2.2e-18366.28Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0015s06150g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G41480.13.8e-15859.92 Folylpolyglutamate synthetase family protein[more]
AT3G55630.35.3e-2733.12 DHFS-FPGS homolog D[more]
AT5G05980.11.3e-2529.02 DHFS-FPGS homolog B[more]
AT3G10160.14.2e-2437.98 DHFS-FPGS homolog C[more]
Match NameE-valueIdentityDescription
gi|659130858|ref|XP_008465388.1|1.7e-26987.41PREDICTED: dihydrofolate synthetase isoform X1 [Cucumis melo][more]
gi|449463470|ref|XP_004149457.1|1.9e-26887.04PREDICTED: probable dihydrofolate synthetase isoform X2 [Cucumis sativus][more]
gi|778680896|ref|XP_011651415.1|3.4e-26587.06PREDICTED: probable dihydrofolate synthetase isoform X1 [Cucumis sativus][more]
gi|659130860|ref|XP_008465389.1|1.3e-19788.61PREDICTED: dihydrofolate synthetase isoform X2 [Cucumis melo][more]
gi|1009122795|ref|XP_015878195.1|1.9e-19668.26PREDICTED: dihydrofolate synthetase [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016874ligase activity
GO:0005524ATP binding
GO:0004326tetrahydrofolylpolyglutamate synthase activity
Vocabulary: Biological Process
TermDefinition
GO:0009058biosynthetic process
GO:0009396folic acid-containing compound biosynthetic process
Vocabulary: INTERPRO
TermDefinition
IPR018109Folylpolyglutamate_synth_CS
IPR013221Mur_ligase_cen
IPR004101Mur_ligase_C
IPR001645Folylpolyglutamate_synth
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006761 dihydrofolate biosynthetic process
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0046656 folic acid biosynthetic process
biological_process GO:0006536 glutamate metabolic process
biological_process GO:0006730 one-carbon metabolic process
biological_process GO:0046901 tetrahydrofolylpolyglutamate biosynthetic process
biological_process GO:0009058 biosynthetic process
biological_process GO:0009396 folic acid-containing compound biosynthetic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005524 ATP binding
molecular_function GO:0008841 dihydrofolate synthase activity
molecular_function GO:0004326 tetrahydrofolylpolyglutamate synthase activity
molecular_function GO:0016874 ligase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g06540.1Cp4.1LG14g06540.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001645Folylpolyglutamate synthetasePANTHERPTHR11136FOLYLPOLYGLUTAMATE SYNTHASE-RELATEDcoord: 305..548
score: 3.2E-198coord: 44..288
score: 3.2E
IPR001645Folylpolyglutamate synthetaseTIGRFAMsTIGR01499TIGR01499coord: 81..544
score: 2.4
IPR004101Mur ligase, C-terminalGENE3DG3DSA:3.90.190.20coord: 387..547
score: 4.2
IPR004101Mur ligase, C-terminalunknownSSF53244MurD-like peptide ligases, peptide-binding domaincoord: 525..546
score: 6.93E-12coord: 386..484
score: 6.93
IPR013221Mur ligase, centralGENE3DG3DSA:3.40.1190.10coord: 55..367
score: 2.7
IPR013221Mur ligase, centralPFAMPF08245Mur_ligase_Mcoord: 103..344
score: 5.0
IPR013221Mur ligase, centralunknownSSF53623MurD-like peptide ligases, catalytic domaincoord: 78..368
score: 1.26
IPR018109Folylpolyglutamate synthetase, conserved sitePROSITEPS01011FOLYLPOLYGLU_SYNT_1coord: 101..124
scor
IPR018109Folylpolyglutamate synthetase, conserved sitePROSITEPS01012FOLYLPOLYGLU_SYNT_2coord: 201..216
scor
NoneNo IPR availablePANTHERPTHR11136:SF0DIHYDROFOLATE SYNTHETASE-RELATEDcoord: 305..548
score: 3.2E-198coord: 44..288
score: 3.2E

The following gene(s) are paralogous to this gene:

None