Cp4.1LG02g16580 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG02g16580
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionBifunctional protein FolD
LocationCp4.1LG02 : 12774935 .. 12778865 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAGCCCAAACAATGCAAGCGATTTTCATAAACGGATCCACTTCTCATGAGCGCAGTCCATCGCCCAAAATCGAAGCCCATCAGATTCCTCAACTGCGATTTCATGGTTGCACCTATTCAGCGAGTCTAAAATCAATCCTTCTCGATCTCAGCTCATCCATTACAACCTTCTGCAAGAGACATTGTCACTGGTCGCAATTCAGAAGGGATAAGCATTTTCGCCTTGTCACCTGCAATAATGGCGTCCGAATCCGAGCACAAAGCTACAATAATCGACGGCAAACAAATTGCTCAGACTGTTAGATCTGAAATAGCGGAGGAAGTGAAAAAACTCTCTGAGAAATACGGAAAGGTTCAATTTCCTGTGCTTGAACGTTGTTTTTAGTATTTTGATCTATTATTGTGACAAATTATGCGTTATATCTGTCTGGTTTTAGTATTTTTCTTGTTCATTGTGGTGTGGGATTTGTGAAGGTTCCGGGGTTGGCGGTTGTGATTGTGGGGAGTAGAAAGGATTCGCAGAGTTATGTGAACATGAAGAGAAAGGCTTGCGCGGAAGTTGGAATCAAGTCCTTTGATTATGATCTTCCCGAGCAAGTGTCTGAGGCCGAATTGATCAGCAAGATCCATGAGTTGAATGCGAATCCTGAAGTGCATGGTTAATCTCTTCTGTGCTTTTATGCTTATGCTTACTTGATTTTTTAGTGCTACACTACAAATTTTATGGAGCATCCACTGTGTTTGAAATATCATAGTGAAATTGCATCTTTGATAGATAAACGTTACTTGGATTCATTTTTGTGATGTGTGAAAAAATGTTAATTAAAGTACAAGCACCCCTTTAGTGCTTGTAGAAGCATTTCAAGCATTTTAGGTGTTTATAGAAAATTTACAAAGTTACTCTTGTTTATGTGTCAACATTTGAAAAACTCTTCAAATTGCCTAAAAGTACTTTTTTAGTGTTCTAAGAGTAACTTACTGTTAATAATTTGTTCAAAATTTTAATGCTTAGAGTTTTGAATTTTGAATTTTTTTGCTTTCACAGGGATATTGGTTCAACTTCCTTTGCCAAAGCACATAAATGAGGAAAAAGTTTTATCTGAAATCAACATTGAGAAGGATGTAGATGGGTTTCATCCTCTGAACATTGGGAAACTTGCAATGAAAGGCAGAGATCCCTTATTCCTACCTTGCACTCCAAAGGCAATAACAAATGTTTAATCTTCTGTTAATTGGAAGAGCAATAGCTCCTTATTTAGCAATGTTCAAGATTTGGTTTTATGTACAGGGGTGTCTTGAGCTACTGTCACGAAGTGGAATAAGCATTAGGGGAAAGAAGGCAGTTGTTATGGGGCGAAGCAACATCGTTGGATTACCAGTTTCATTGCTGCTCCTTAAAGCAGATGCAACTGTGACGATTGTCCATTCCCGTTCTGTCAATCCAGAAAGTGTTATCCGCGAAGCTGACATTGTTATTGCTGCTGCAGGACAGGCACAAATGGTATGCTTGGCTTTACAATCACTTGTTTTCGCTTTCCCTATATTTCTATTTGAATATCTGTTTGGCTTCACCGAGAATATAGACTTTGATTCCTAAGTTTACTTGAAAGCAAACAATGCTATGAAATATGCTTTAGGTGCTAGAATAGTATTGGAAAAGAAAAAAAAGGGTTAGCATTGGAATATAATCCCTGAAGTTGGAGAATATAGACTTTGATTCCTAAGTTCTACCTATTTAAACTTTTCTTGATAAACACACTAAATCCATTTGGCCAATAAAAGGATGTTCCCCTTGGGAATGTCAAAAGGCGCCTGTTGGATGAAAGTCCCACATCGGCTAATTTAGGGAATGATCATGGATTTATAATCAAAGAATACTATCTCTTGGGGAATCCCAAAGCAAAGCCATGAGAGCTTATGCTCAAAGTGGACAATATCATACCATCATGGAAATCGTGTTTATCTAACGTGGTATCAGAGCCATGCCCTAAACTTAGCCATGTCAATAGAATCCTCAAAGGTCGAACAAAAGTATTGTGAGCCTTGAAGGCATAGTAAAAAATGACTAAGACTCCAAAAGAAAAGGAGTCGAGCCTCTATTAAGGGGAGGCGTACTTTGTTTGAGGGGAAGTGTTGGATGAAAGTCGCACATCGGCTAATTTAGGGAATGATCATGGGTTTATAATCAAAGAATACTATCTCTATTGGTATGAGGCCTTTTGGGGAAGCCCAAAGCAAAACCATGAGAGTTTATGCTCAAAGTAGACAGTATCATACCATTGTGGAGAGTCGTGTTCAATTAACAGCGCCAATCCTCCTTCAATAGGGGTTTGTATAATAGACCACCTCCTGGCTACCCTTTTTGTCAATGCCTCCCTTCCAAAGGAAAGTCTTACACAACTTCTCTATAGCTTTTGTGATTTTGGATAAGGGCGTGCCTTCCTCCTCCTGATATATGAGATGCCCTCCAATACTGCCACCGTTTCTCCATTTTTTCATTAATTGGCGCCCAAAAATTCAGAGTGCCAAATAGGTAGGAGGTCAGTAACCAACCTGGCAGCCAAAAGAAGATGTGACGAACTCAGCTCGCTGTGAATCTAACCCAACACCCAAAAGTTCCAATTTCTGACAATTAATATTTAGAGCCGAAGCTACCTCAAACTCATGGATGATATGGAAGAAGAATCGGAATTGAGGAGGTTGAGAAGAAGACTGGTGGGATTGTTGGGAAGACATGGCTGGTGGTGTAGGGTAGGAGAGAGGAAGTCGTGAGGAAACTTAAGCACAAGCATTAAACTCACCATATATCCACAAGTCCTCCAAGCTTGAGTACCATAAGTAATCATAATGCTTCCATTTGGATCCAAGCGATCATAACAATCTTTAAAGAGACCAAATGTGGTGGTCTGAACTCCACTGGAGCAGGCAAAAGAAGATGTTTGAAAGGTAATGCTTATCCTGTTTCAGCTACAATGCTGGTTGGGAGTGGGTGTTTTAAGTAGACAAAGTTACAAAGGAATGTTTGAAATTGGAAGTTCTTAAAAACCTAGATTTTGGAAGAAATGGCTAATGGGGATTTGAGTGAAGTTGAGTAAATGGGGGAAATAAAAATGGTGGGTTTTGTTGATAATTGCTACGTGTTTCTTCGGATTAATGAAACGAATGGTTTCTTCTATCGGAAGTGAGCTTGACTTTACTTTATCCAATTTTCTACCTTCTTCTATCTCTAGCTCCCACATAGTGTTGAATCAGCTCGTTCACCCAATAATCATACTGCCTCATTATTTAGTTAACAATATCGATAAAGCTAAATAGAATCATAAATCAAAGTTCAGGAACATAACAGAAAGATGTCGAAGTCGAAGAGATTAAATGCGCACACAAACCTTAAACTTTGCCAGAAGCATAAAGAAACAGGCATAGTCATTGCAGGCCAGTAATATCTGCTAGTAGAGAGCATTAGTTCTGTTAGCTTATTGATTTTGTTGGTTGATGTTAGATCAAGGGCAGTTGGATAAAACCAGGGGCTGCAGTGATTGATGTTGGTACAAACGCAGTTGATGATCCCACCCGAAAGTCGGGCTACCGGCTGGTTGGGGATGTAGATTTCCAGGAAGCTTGTAAAGTGGCTGGTTGGGTAACTCCTGTTCCCGGTGGTGTGGGTCCTATGACTGTTGCCATGCTGCTTAGGAACACTTTAGATGGAGCTAAGCGTGTGATCGAGCAGTGATACCATACCCATATCAAGTGGCTTCATTGTTGTGTTTCTAAGGTCTTGCAAATTTGGAGACAATTTTGTAGGGAAAATAAAAACCTTAATAATCGTGTGGATATCTTGAAGAATGAATAATTTTAGCTTGTTGAATGTTAATATGGATCCTTGTGGGACTTTCTTAGTAATTTATAAATAAATGTTCTACCTATTTTGTGCC

mRNA sequence

TAGCCCAAACAATGCAAGCGATTTTCATAAACGGATCCACTTCTCATGAGCGCAGTCCATCGCCCAAAATCGAAGCCCATCAGATTCCTCAACTGCGATTTCATGGTTGCACCTATTCAGCGAGTCTAAAATCAATCCTTCTCGATCTCAGCTCATCCATTACAACCTTCTGCAAGAGACATTGTCACTGGTCGCAATTCAGAAGGGATAAGCATTTTCGCCTTGTCACCTGCAATAATGGCGTCCGAATCCGAGCACAAAGCTACAATAATCGACGGCAAACAAATTGCTCAGACTGTTAGATCTGAAATAGCGGAGGAAGTGAAAAAACTCTCTGAGAAATACGGAAAGGTTCCGGGGTTGGCGGTTGTGATTGTGGGGAGTAGAAAGGATTCGCAGAGTTATGTGAACATGAAGAGAAAGGCTTGCGCGGAAGTTGGAATCAAGTCCTTTGATTATGATCTTCCCGAGCAAGTGTCTGAGGCCGAATTGATCAGCAAGATCCATGAGTTGAATGCGAATCCTGAAGTGCATGGGATATTGGTTCAACTTCCTTTGCCAAAGCACATAAATGAGGAAAAAGTTTTATCTGAAATCAACATTGAGAAGGATGTAGATGGGTTTCATCCTCTGAACATTGGGAAACTTGCAATGAAAGGCAGAGATCCCTTATTCCTACCTTGCACTCCAAAGGCAATAACAAATGGGTGTCTTGAGCTACTGTCACGAAGTGGAATAAGCATTAGGGGAAAGAAGGCAGTTGTTATGGGGCGAAGCAACATCGTTGGATTACCAGTTTCATTGCTGCTCCTTAAAGCAGATGCAACTGTGACGATTGTCCATTCCCGTTCTGTCAATCCAGAAAGTGTTATCCGCGAAGCTGACATTGTTATTGCTGCTGCAGGACAGGCACAAATGATCAAGGGCAGTTGGATAAAACCAGGGGCTGCAGTGATTGATGTTGGTACAAACGCAGTTGATGATCCCACCCGAAAGTCGGGCTACCGGCTGGTTGGGGATGTAGATTTCCAGGAAGCTTGTAAAGTGGCTGGTTGGGTAACTCCTGTTCCCGGTGGTGTGGGTCCTATGACTGTTGCCATGCTGCTTAGGAACACTTTAGATGGAGCTAAGCGTGTGATCGAGCAGTGATACCATACCCATATCAAGTGGCTTCATTGTTGTGTTTCTAAGGTCTTGCAAATTTGGAGACAATTTTGTAGGGAAAATAAAAACCTTAATAATCGTGTGGATATCTTGAAGAATGAATAATTTTAGCTTGTTGAATGTTAATATGGATCCTTGTGGGACTTTCTTAGTAATTTATAAATAAATGTTCTACCTATTTTGTGCC

Coding sequence (CDS)

ATGGCGTCCGAATCCGAGCACAAAGCTACAATAATCGACGGCAAACAAATTGCTCAGACTGTTAGATCTGAAATAGCGGAGGAAGTGAAAAAACTCTCTGAGAAATACGGAAAGGTTCCGGGGTTGGCGGTTGTGATTGTGGGGAGTAGAAAGGATTCGCAGAGTTATGTGAACATGAAGAGAAAGGCTTGCGCGGAAGTTGGAATCAAGTCCTTTGATTATGATCTTCCCGAGCAAGTGTCTGAGGCCGAATTGATCAGCAAGATCCATGAGTTGAATGCGAATCCTGAAGTGCATGGGATATTGGTTCAACTTCCTTTGCCAAAGCACATAAATGAGGAAAAAGTTTTATCTGAAATCAACATTGAGAAGGATGTAGATGGGTTTCATCCTCTGAACATTGGGAAACTTGCAATGAAAGGCAGAGATCCCTTATTCCTACCTTGCACTCCAAAGGCAATAACAAATGGGTGTCTTGAGCTACTGTCACGAAGTGGAATAAGCATTAGGGGAAAGAAGGCAGTTGTTATGGGGCGAAGCAACATCGTTGGATTACCAGTTTCATTGCTGCTCCTTAAAGCAGATGCAACTGTGACGATTGTCCATTCCCGTTCTGTCAATCCAGAAAGTGTTATCCGCGAAGCTGACATTGTTATTGCTGCTGCAGGACAGGCACAAATGATCAAGGGCAGTTGGATAAAACCAGGGGCTGCAGTGATTGATGTTGGTACAAACGCAGTTGATGATCCCACCCGAAAGTCGGGCTACCGGCTGGTTGGGGATGTAGATTTCCAGGAAGCTTGTAAAGTGGCTGGTTGGGTAACTCCTGTTCCCGGTGGTGTGGGTCCTATGACTGTTGCCATGCTGCTTAGGAACACTTTAGATGGAGCTAAGCGTGTGATCGAGCAGTGA

Protein sequence

MASESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMKRKACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEINIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRSNIVGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVIDVGTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRVIEQ
BLAST of Cp4.1LG02g16580 vs. Swiss-Prot
Match: FOLD2_ARATH (Bifunctional protein FolD 2 OS=Arabidopsis thaliana GN=FOLD2 PE=2 SV=1)

HSP 1 Score: 487.3 bits (1253), Expect = 1.2e-136
Identity = 245/300 (81.67%), Postives = 275/300 (91.67%), Query Frame = 1

Query: 1   MASESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMK 60
           MAS S+H A IIDGK IA T+RSEIAEEV+ LSEK+GKVPGLAVVIVGSRKDSQ+YVN K
Sbjct: 1   MASSSDHTAKIIDGKAIAHTIRSEIAEEVRGLSEKHGKVPGLAVVIVGSRKDSQTYVNTK 60

Query: 61  RKACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEI 120
           RKACAEVGIKSFD  LPE+VSEA+LISK+HELN+NP+VHGILVQLPLPKHINEE +L  I
Sbjct: 61  RKACAEVGIKSFDVGLPEEVSEADLISKVHELNSNPDVHGILVQLPLPKHINEEHILGAI 120

Query: 121 NIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRS 180
           +I+KDVDGFHPLNIGKLAMKGR+PLFLPCTPK    GCLELL+RSG+ I+G++AVV+GRS
Sbjct: 121 SIDKDVDGFHPLNIGKLAMKGREPLFLPCTPK----GCLELLARSGVKIKGQRAVVVGRS 180

Query: 181 NIVGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVI 240
           NIVGLPVSLLLLKADATVT VHS + +PE++IREADIVIAA GQA MIKG+WIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTTVHSHTKDPEAIIREADIVIAACGQAHMIKGNWIKPGAAVI 240

Query: 241 DVGTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 300
           DVGTNAV DP++KSGYRLVGDVDF EA KVAG++TPVPGGVGPMTVAMLLRNT+DGAKRV
Sbjct: 241 DVGTNAVSDPSKKSGYRLVGDVDFAEASKVAGFITPVPGGVGPMTVAMLLRNTVDGAKRV 296

BLAST of Cp4.1LG02g16580 vs. Swiss-Prot
Match: FOLD4_ARATH (Bifunctional protein FolD 4, chloroplastic OS=Arabidopsis thaliana GN=FOLD4 PE=1 SV=1)

HSP 1 Score: 374.8 bits (961), Expect = 9.0e-103
Identity = 180/298 (60.40%), Postives = 233/298 (78.19%), Query Frame = 1

Query: 3   SESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMKRK 62
           ++SE  A +IDGK +A+ +R EI  EV ++ E  G +PGLAV++VG RKDS +YV  K+K
Sbjct: 63  TKSEGGAIVIDGKAVAKKIRDEITIEVSRMKESIGVIPGLAVILVGDRKDSATYVRNKKK 122

Query: 63  ACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEINI 122
           AC  VGIKSF+  L E  SE E++  +   N +P VHGILVQLPLP H++E+ +L+ ++I
Sbjct: 123 ACDSVGIKSFEVRLAEDSSEEEVLKSVSGFNDDPSVHGILVQLPLPSHMDEQNILNAVSI 182

Query: 123 EKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRSNI 182
           EKDVDGFHPLNIG+LAM+GR+PLF+PCTPK    GC+ELL R  I I+GK+AVV+GRSNI
Sbjct: 183 EKDVDGFHPLNIGRLAMRGREPLFVPCTPK----GCIELLHRYNIEIKGKRAVVIGRSNI 242

Query: 183 VGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVIDV 242
           VG+P +LLL + DATV+I+HSR+ NPE + READI+I+A GQ  M++GSWIKPGA +IDV
Sbjct: 243 VGMPAALLLQREDATVSIIHSRTKNPEEITREADIIISAVGQPNMVRGSWIKPGAVLIDV 302

Query: 243 GTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 301
           G N V+DP+   GYRLVGD+ ++EA KVA  +TPVPGGVGPMT+AMLL NTL  AKR+
Sbjct: 303 GINPVEDPSAARGYRLVGDICYEEASKVASAITPVPGGVGPMTIAMLLSNTLTSAKRI 356

BLAST of Cp4.1LG02g16580 vs. Swiss-Prot
Match: FOLD1_ARATH (Bifunctional protein FolD 1, mitochondrial OS=Arabidopsis thaliana GN=FOLD1 PE=2 SV=1)

HSP 1 Score: 356.3 bits (913), Expect = 3.3e-97
Identity = 175/297 (58.92%), Postives = 230/297 (77.44%), Query Frame = 1

Query: 4   ESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMKRKA 63
           E+E K  +IDG  IA+ +R++I  EV K+ +  GKVPGLAVV+VG ++DSQ+YV  K KA
Sbjct: 58  ETEQKTVVIDGNVIAEEIRTKIISEVGKMKKAVGKVPGLAVVLVGEQRDSQTYVRNKIKA 117

Query: 64  CAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEINIE 123
           C E GIKS   +LPE  +E ++IS + + N +  +HGILVQLPLP+H+NE K+L+ + +E
Sbjct: 118 CEETGIKSVLAELPEDCTEGQIISVLRKFNEDTSIHGILVQLPLPQHLNESKILNMVRLE 177

Query: 124 KDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRSNIV 183
           KDVDGFHPLN+G LAM+GR+PLF+ CTPK    GC+ELL R+G+ I GK AVV+GRSNIV
Sbjct: 178 KDVDGFHPLNVGNLAMRGREPLFVSCTPK----GCVELLIRTGVEIAGKNAVVIGRSNIV 237

Query: 184 GLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVIDVG 243
           GLP+SLLL + DATV+ VH+ + +PE + R+ADIVIAAAG   +++GSW+KPGA VIDVG
Sbjct: 238 GLPMSLLLQRHDATVSTVHAFTKDPEHITRKADIVIAAAGIPNLVRGSWLKPGAVVIDVG 297

Query: 244 TNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 301
           T  V+D + + GYRLVGDV ++EA  VA  +TPVPGGVGPMT+ MLL NTL+ AKR+
Sbjct: 298 TTPVEDSSCEFGYRLVGDVCYEEALGVASAITPVPGGVGPMTITMLLCNTLEAAKRI 350

BLAST of Cp4.1LG02g16580 vs. Swiss-Prot
Match: FOLD_MAGMM (Bifunctional protein FolD OS=Magnetococcus marinus (strain ATCC BAA-1437 / JCM 17883 / MC-1) GN=folD PE=3 SV=1)

HSP 1 Score: 295.0 bits (754), Expect = 9.1e-79
Identity = 158/291 (54.30%), Postives = 203/291 (69.76%), Query Frame = 1

Query: 9   ATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMKRKACAEVG 68
           A +IDGK IAQ+VR E+  EV++L   +   PGLAVV+VG+   SQ YV  K++AC   G
Sbjct: 2   AHVIDGKAIAQSVREELRMEVERLKLNHQLTPGLAVVLVGADPASQVYVRNKKRACETAG 61

Query: 69  IKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEINIEKDVDG 128
           I SF ++L    S+AEL++ I +LN +  VHGILVQLPLPKHI+E+KVL  I+  KD DG
Sbjct: 62  IASFSHELAATTSQAELLALIEQLNQDDAVHGILVQLPLPKHIDEQKVLEAISPSKDADG 121

Query: 129 FHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRSNIVGLPVS 188
           FHP N+G+L     DP F PCTP     G +E+L  SG+  +GK AVV+GRSNIVG PV+
Sbjct: 122 FHPYNVGRLVT--GDPTFQPCTPW----GVMEMLKVSGVDPKGKHAVVIGRSNIVGKPVA 181

Query: 189 LLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVIDVGTNAVD 248
           L+LL A ATVTI HSR+ +    ++ ADIV+AA G+A M+ GSWIK GA VIDVG N  +
Sbjct: 182 LMLLAAHATVTICHSRTPDLAETVKRADIVVAAVGRANMVPGSWIKKGAVVIDVGINRGE 241

Query: 249 DPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKR 300
           D       +L GDVD+    + A  +TPVPGGVGPMT+AMLL+NT++GAKR
Sbjct: 242 DG------KLCGDVDYASCFEHASAITPVPGGVGPMTIAMLLKNTVEGAKR 280

BLAST of Cp4.1LG02g16580 vs. Swiss-Prot
Match: FOLD_GEOUR (Bifunctional protein FolD OS=Geobacter uraniireducens (strain Rf4) GN=folD PE=3 SV=1)

HSP 1 Score: 294.3 bits (752), Expect = 1.5e-78
Identity = 158/291 (54.30%), Postives = 205/291 (70.45%), Query Frame = 1

Query: 9   ATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMKRKACAEVG 68
           A IIDGK IA  +R EI  EV KL+ K G  PGLAVV+VG    S+ YV+MK KAC +VG
Sbjct: 2   AKIIDGKAIAAKIRGEITAEVAKLASK-GVTPGLAVVLVGEDPASKVYVSMKEKACKDVG 61

Query: 69  IKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEINIEKDVDG 128
           I S +Y LP   SEA+L+  IH+LN++P++HGIL+QLPLPK I+ EKVL  I+ EKD DG
Sbjct: 62  IFSDEYKLPVDTSEADLLLLIHKLNSDPKIHGILIQLPLPKQIDTEKVLEAISPEKDADG 121

Query: 129 FHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRSNIVGLPVS 188
           FHP N+G+L +    PLF PCTP     G + +L  +G+ + GK+ VV+GRSNIVG PV+
Sbjct: 122 FHPYNVGRLVI--GKPLFQPCTP----YGVMVMLKEAGVELAGKEVVVVGRSNIVGKPVA 181

Query: 189 LLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVIDVGTNAVD 248
            + L+ +ATVT+ HS++ +  + +  AD+VIAA GQ +MIKG+WIK GA VIDVG N V 
Sbjct: 182 FMCLQQNATVTLCHSKTRDLAAKVGMADVVIAAVGQPEMIKGAWIKEGAVVIDVGVNRVG 241

Query: 249 DPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKR 300
           +       +LVGDV+F  A + A  +TPVPGGVGPMT+ MLL NTL+ AKR
Sbjct: 242 EK------KLVGDVEFDAAAERASAITPVPGGVGPMTITMLLYNTLEAAKR 279

BLAST of Cp4.1LG02g16580 vs. TrEMBL
Match: A0A0A0KWH1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G001540 PE=3 SV=1)

HSP 1 Score: 548.5 bits (1412), Expect = 5.1e-153
Identity = 279/303 (92.08%), Postives = 293/303 (96.70%), Query Frame = 1

Query: 1   MASESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMK 60
           MASES+HKATIIDGK+IAQTVRSEI EEV KLS+KYGK+PGLAVVIVG+RKDS +YVNMK
Sbjct: 75  MASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMK 134

Query: 61  RKACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEI 120
           RKAC EVGIKSF+ DLPEQVSEAELISK+HELNANPEVHGILVQLPLP HINEEKVLSEI
Sbjct: 135 RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPEVHGILVQLPLPNHINEEKVLSEI 194

Query: 121 NIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRS 180
           +IEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK    GC+ELLSRSGISIRGKKAVVMGRS
Sbjct: 195 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK----GCIELLSRSGISIRGKKAVVMGRS 254

Query: 181 NIVGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVI 240
           NIVGLPVSLLLLKADATVTIVHSRSV+PESVIREADI+IAAAGQAQMIKGSWIKPGAAVI
Sbjct: 255 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 314

Query: 241 DVGTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 300
           DVGTNAVDDPT+KSGYRLVGDVDFQEACKVAGW+TPVPGGVGPMTVAMLLRNTLDGAKRV
Sbjct: 315 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 373

Query: 301 IEQ 304
           IEQ
Sbjct: 375 IEQ 373

BLAST of Cp4.1LG02g16580 vs. TrEMBL
Match: A0A067K8J8_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_13900 PE=3 SV=1)

HSP 1 Score: 520.4 bits (1339), Expect = 1.5e-144
Identity = 258/303 (85.15%), Postives = 287/303 (94.72%), Query Frame = 1

Query: 1   MASESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMK 60
           MAS S+HKA +IDGK IAQT+RSEIA+EV++LSEKYGKVPGLAVVIVG RKDSQSYV+MK
Sbjct: 1   MASPSDHKAAVIDGKAIAQTIRSEIADEVRQLSEKYGKVPGLAVVIVGHRKDSQSYVSMK 60

Query: 61  RKACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEI 120
           RKAC EVGIKSF  DLPEQ+SEAELISK+HELNANP+VHGILVQLPLPKHINEE +LSEI
Sbjct: 61  RKACVEVGIKSFGVDLPEQISEAELISKVHELNANPDVHGILVQLPLPKHINEENILSEI 120

Query: 121 NIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRS 180
           ++EKDVDGFHPLNIGKLAMKGR+PLF+PCTPK    GCLELLSRSGISI+GK AVV+GRS
Sbjct: 121 SLEKDVDGFHPLNIGKLAMKGREPLFVPCTPK----GCLELLSRSGISIKGKNAVVVGRS 180

Query: 181 NIVGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVI 240
           NIVGLPVSLLLLKADATVTIVHSR+ +PES+IREADI+IAAAGQA+M+KGSWIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTIVHSRTDDPESIIREADIIIAAAGQAKMVKGSWIKPGAAVI 240

Query: 241 DVGTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 300
           DVGTNA+DDP+RKSGYRLVGDVD++EACKVAGW+TPVPGGVGPMTVAMLLRNT+DGAKRV
Sbjct: 241 DVGTNAIDDPSRKSGYRLVGDVDYEEACKVAGWITPVPGGVGPMTVAMLLRNTVDGAKRV 299

Query: 301 IEQ 304
             Q
Sbjct: 301 FGQ 299

BLAST of Cp4.1LG02g16580 vs. TrEMBL
Match: M5VKQ0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009269mg PE=3 SV=1)

HSP 1 Score: 519.6 bits (1337), Expect = 2.5e-144
Identity = 263/303 (86.80%), Postives = 285/303 (94.06%), Query Frame = 1

Query: 1   MASESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMK 60
           MAS+S+HKA IIDGK IAQT+R+EIAEEV+ LS+KYGKVPGLAVVIVG+RKDSQSYV+MK
Sbjct: 1   MASQSDHKAAIIDGKAIAQTIRNEIAEEVRHLSQKYGKVPGLAVVIVGNRKDSQSYVSMK 60

Query: 61  RKACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEI 120
           RKACAEVGI S D DLPE VS+ +LI+K+HELNANP+VHGILVQLPLPKHINEEKVLSEI
Sbjct: 61  RKACAEVGILSLDIDLPEDVSQVDLIAKVHELNANPDVHGILVQLPLPKHINEEKVLSEI 120

Query: 121 NIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRS 180
           +IEKDVDGFHPLNIGKLAMKGR+PLFLPCTPK    GCLELLSRSGISI+GKKAVV+GRS
Sbjct: 121 SIEKDVDGFHPLNIGKLAMKGREPLFLPCTPK----GCLELLSRSGISIKGKKAVVVGRS 180

Query: 181 NIVGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVI 240
           NIVGLPVSLLLLKADATVT+VHS S +PES+IREADIVIAAAGQA MIKGSWIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTVVHSHSHDPESIIREADIVIAAAGQAMMIKGSWIKPGAAVI 240

Query: 241 DVGTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 300
           DVGTNA+DD +RKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLL NTLDGAKRV
Sbjct: 241 DVGTNAIDDSSRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLNNTLDGAKRV 299

Query: 301 IEQ 304
           I Q
Sbjct: 301 IAQ 299

BLAST of Cp4.1LG02g16580 vs. TrEMBL
Match: F6HZ29_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0005g00070 PE=3 SV=1)

HSP 1 Score: 518.8 bits (1335), Expect = 4.3e-144
Identity = 262/303 (86.47%), Postives = 286/303 (94.39%), Query Frame = 1

Query: 1   MASESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMK 60
           MAS   HKA+IIDGK IAQ +RSEIAEEV+ LSEKYGKVPGLAVVIVG+RKDSQSYV+MK
Sbjct: 1   MASPDGHKASIIDGKAIAQAIRSEIAEEVRHLSEKYGKVPGLAVVIVGNRKDSQSYVSMK 60

Query: 61  RKACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEI 120
           RKACAEVGIKSFD DLPEQV E+ELI K+HELNA P+VHGILVQLPLPKHINEEKVLSEI
Sbjct: 61  RKACAEVGIKSFDVDLPEQVLESELIGKVHELNALPDVHGILVQLPLPKHINEEKVLSEI 120

Query: 121 NIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRS 180
           ++EKDVDGFHPLNIGKLAMKGR+PLFLPCTPK    GCLELLSRSGIS++GKKAVV+GRS
Sbjct: 121 SLEKDVDGFHPLNIGKLAMKGREPLFLPCTPK----GCLELLSRSGISVKGKKAVVVGRS 180

Query: 181 NIVGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVI 240
           NIVGLPV+LLLLKADATVT+VHS + +PES+IR+ADIVIAAAGQA MIKGSWIKPGAAVI
Sbjct: 181 NIVGLPVALLLLKADATVTVVHSHTQDPESIIRDADIVIAAAGQAMMIKGSWIKPGAAVI 240

Query: 241 DVGTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 300
           DVGTNAV+DP++KSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLL+NTLDGAKRV
Sbjct: 241 DVGTNAVNDPSKKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLKNTLDGAKRV 299

Query: 301 IEQ 304
           IEQ
Sbjct: 301 IEQ 299

BLAST of Cp4.1LG02g16580 vs. TrEMBL
Match: A0A068UM70_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00030020001 PE=3 SV=1)

HSP 1 Score: 516.5 bits (1329), Expect = 2.1e-143
Identity = 260/310 (83.87%), Postives = 287/310 (92.58%), Query Frame = 1

Query: 1   MASESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMK 60
           MAS S+HKAT+IDGK +A T+RSE+A+EV++LS+KYGKVPGLAVVIVG RKDSQSYV+MK
Sbjct: 1   MASSSDHKATVIDGKAVAHTIRSEVADEVRQLSQKYGKVPGLAVVIVGHRKDSQSYVSMK 60

Query: 61  RKACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEI 120
           RKACAEVGIKSFD +LPEQVSEAELISK+HELNANP+VHGILVQLPLPKH+NEEKVL EI
Sbjct: 61  RKACAEVGIKSFDINLPEQVSEAELISKVHELNANPDVHGILVQLPLPKHVNEEKVLGEI 120

Query: 121 NIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAI-------TNGCLELLSRSGISIRGKK 180
           ++EKDVDGFHPLNIGKLAMKGR+PLFLPCTPKAI         GCLELLSRSGISI+GKK
Sbjct: 121 SLEKDVDGFHPLNIGKLAMKGREPLFLPCTPKAIYFEALFLEQGCLELLSRSGISIKGKK 180

Query: 181 AVVMGRSNIVGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWI 240
           AVV+GRSNIVGLPVSLLLLK DATVTIVHSR+  PE +IREADIVIAAAGQA MI+GSWI
Sbjct: 181 AVVVGRSNIVGLPVSLLLLKEDATVTIVHSRTKEPEKIIREADIVIAAAGQANMIQGSWI 240

Query: 241 KPGAAVIDVGTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNT 300
           K GAAVIDVGTNAVDD T+KSGYRLVGDVDF+EA KVAGW+TPVPGGVGPMTVAMLL+NT
Sbjct: 241 KSGAAVIDVGTNAVDDRTKKSGYRLVGDVDFKEASKVAGWITPVPGGVGPMTVAMLLKNT 300

Query: 301 LDGAKRVIEQ 304
           +DGAKRVIEQ
Sbjct: 301 VDGAKRVIEQ 310

BLAST of Cp4.1LG02g16580 vs. TAIR10
Match: AT3G12290.1 (AT3G12290.1 Amino acid dehydrogenase family protein)

HSP 1 Score: 487.3 bits (1253), Expect = 7.0e-138
Identity = 245/300 (81.67%), Postives = 275/300 (91.67%), Query Frame = 1

Query: 1   MASESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMK 60
           MAS S+H A IIDGK IA T+RSEIAEEV+ LSEK+GKVPGLAVVIVGSRKDSQ+YVN K
Sbjct: 1   MASSSDHTAKIIDGKAIAHTIRSEIAEEVRGLSEKHGKVPGLAVVIVGSRKDSQTYVNTK 60

Query: 61  RKACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEI 120
           RKACAEVGIKSFD  LPE+VSEA+LISK+HELN+NP+VHGILVQLPLPKHINEE +L  I
Sbjct: 61  RKACAEVGIKSFDVGLPEEVSEADLISKVHELNSNPDVHGILVQLPLPKHINEEHILGAI 120

Query: 121 NIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRS 180
           +I+KDVDGFHPLNIGKLAMKGR+PLFLPCTPK    GCLELL+RSG+ I+G++AVV+GRS
Sbjct: 121 SIDKDVDGFHPLNIGKLAMKGREPLFLPCTPK----GCLELLARSGVKIKGQRAVVVGRS 180

Query: 181 NIVGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVI 240
           NIVGLPVSLLLLKADATVT VHS + +PE++IREADIVIAA GQA MIKG+WIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTTVHSHTKDPEAIIREADIVIAACGQAHMIKGNWIKPGAAVI 240

Query: 241 DVGTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 300
           DVGTNAV DP++KSGYRLVGDVDF EA KVAG++TPVPGGVGPMTVAMLLRNT+DGAKRV
Sbjct: 241 DVGTNAVSDPSKKSGYRLVGDVDFAEASKVAGFITPVPGGVGPMTVAMLLRNTVDGAKRV 296

BLAST of Cp4.1LG02g16580 vs. TAIR10
Match: AT4G00620.1 (AT4G00620.1 Amino acid dehydrogenase family protein)

HSP 1 Score: 374.8 bits (961), Expect = 5.1e-104
Identity = 180/298 (60.40%), Postives = 233/298 (78.19%), Query Frame = 1

Query: 3   SESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMKRK 62
           ++SE  A +IDGK +A+ +R EI  EV ++ E  G +PGLAV++VG RKDS +YV  K+K
Sbjct: 63  TKSEGGAIVIDGKAVAKKIRDEITIEVSRMKESIGVIPGLAVILVGDRKDSATYVRNKKK 122

Query: 63  ACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEINI 122
           AC  VGIKSF+  L E  SE E++  +   N +P VHGILVQLPLP H++E+ +L+ ++I
Sbjct: 123 ACDSVGIKSFEVRLAEDSSEEEVLKSVSGFNDDPSVHGILVQLPLPSHMDEQNILNAVSI 182

Query: 123 EKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRSNI 182
           EKDVDGFHPLNIG+LAM+GR+PLF+PCTPK    GC+ELL R  I I+GK+AVV+GRSNI
Sbjct: 183 EKDVDGFHPLNIGRLAMRGREPLFVPCTPK----GCIELLHRYNIEIKGKRAVVIGRSNI 242

Query: 183 VGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVIDV 242
           VG+P +LLL + DATV+I+HSR+ NPE + READI+I+A GQ  M++GSWIKPGA +IDV
Sbjct: 243 VGMPAALLLQREDATVSIIHSRTKNPEEITREADIIISAVGQPNMVRGSWIKPGAVLIDV 302

Query: 243 GTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 301
           G N V+DP+   GYRLVGD+ ++EA KVA  +TPVPGGVGPMT+AMLL NTL  AKR+
Sbjct: 303 GINPVEDPSAARGYRLVGDICYEEASKVASAITPVPGGVGPMTIAMLLSNTLTSAKRI 356

BLAST of Cp4.1LG02g16580 vs. TAIR10
Match: AT2G38660.1 (AT2G38660.1 Amino acid dehydrogenase family protein)

HSP 1 Score: 356.3 bits (913), Expect = 1.9e-98
Identity = 175/297 (58.92%), Postives = 230/297 (77.44%), Query Frame = 1

Query: 4   ESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMKRKA 63
           E+E K  +IDG  IA+ +R++I  EV K+ +  GKVPGLAVV+VG ++DSQ+YV  K KA
Sbjct: 58  ETEQKTVVIDGNVIAEEIRTKIISEVGKMKKAVGKVPGLAVVLVGEQRDSQTYVRNKIKA 117

Query: 64  CAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEINIE 123
           C E GIKS   +LPE  +E ++IS + + N +  +HGILVQLPLP+H+NE K+L+ + +E
Sbjct: 118 CEETGIKSVLAELPEDCTEGQIISVLRKFNEDTSIHGILVQLPLPQHLNESKILNMVRLE 177

Query: 124 KDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRSNIV 183
           KDVDGFHPLN+G LAM+GR+PLF+ CTPK    GC+ELL R+G+ I GK AVV+GRSNIV
Sbjct: 178 KDVDGFHPLNVGNLAMRGREPLFVSCTPK----GCVELLIRTGVEIAGKNAVVIGRSNIV 237

Query: 184 GLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVIDVG 243
           GLP+SLLL + DATV+ VH+ + +PE + R+ADIVIAAAG   +++GSW+KPGA VIDVG
Sbjct: 238 GLPMSLLLQRHDATVSTVHAFTKDPEHITRKADIVIAAAGIPNLVRGSWLKPGAVVIDVG 297

Query: 244 TNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 301
           T  V+D + + GYRLVGDV ++EA  VA  +TPVPGGVGPMT+ MLL NTL+ AKR+
Sbjct: 298 TTPVEDSSCEFGYRLVGDVCYEEALGVASAITPVPGGVGPMTITMLLCNTLEAAKRI 350

BLAST of Cp4.1LG02g16580 vs. TAIR10
Match: AT4G00600.1 (AT4G00600.1 Amino acid dehydrogenase family protein)

HSP 1 Score: 285.0 bits (728), Expect = 5.3e-77
Identity = 134/223 (60.09%), Postives = 176/223 (78.92%), Query Frame = 1

Query: 78  EQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEINIEKDVDGFHPLNIGKL 137
           E  SE E++  +   N +P VHG+LVQLPLP H++E+ +L+ ++IEKDVDGFHPLNIG+L
Sbjct: 88  EDSSEEEVLKYVSGFNDDPSVHGVLVQLPLPSHMDEQNILNAVSIEKDVDGFHPLNIGRL 147

Query: 138 AMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRSNIVGLPVSLLLLKADAT 197
           AM+GR+PLF+PCTPK    GC+ELL R  I  +GK+AVV+GRSNIVG+P +LLL K DAT
Sbjct: 148 AMRGREPLFVPCTPK----GCIELLHRYNIEFKGKRAVVIGRSNIVGMPAALLLQKEDAT 207

Query: 198 VTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVIDVGTNAVDDPTRKSGYR 257
           V+I+HSR++NPE + R+ADI+I+A G+  M++GSWIKPGA +IDVG   V+DP+   G R
Sbjct: 208 VSIIHSRTMNPEELTRQADILISAVGKPNMVRGSWIKPGAVLIDVGIKPVEDPSAAGGER 267

Query: 258 LVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 301
           LVGD+ + EA K+A  +TPVPG VGPMT+AMLL NTL  AKR+
Sbjct: 268 LVGDICYVEASKIASAITPVPGDVGPMTIAMLLSNTLTSAKRI 306

BLAST of Cp4.1LG02g16580 vs. NCBI nr
Match: gi|659108749|ref|XP_008454369.1| (PREDICTED: bifunctional protein FolD 2 isoform X1 [Cucumis melo])

HSP 1 Score: 550.4 bits (1417), Expect = 1.9e-153
Identity = 281/303 (92.74%), Postives = 293/303 (96.70%), Query Frame = 1

Query: 1   MASESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMK 60
           MASES+HKATIIDGK+IAQTVRSE+AEEV KLSEKYGKVPGLAVVIVG+RKDS +YVNMK
Sbjct: 1   MASESDHKATIIDGKKIAQTVRSEVAEEVNKLSEKYGKVPGLAVVIVGNRKDSLTYVNMK 60

Query: 61  RKACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEI 120
           RKAC EVGIKSF+ DLPEQVSEAELISK+HELNANPEVHGILVQLPLP HINEEKVLSEI
Sbjct: 61  RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPEVHGILVQLPLPNHINEEKVLSEI 120

Query: 121 NIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRS 180
           +IEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK    GCLELLSRSGISIRGKKAVVMGRS
Sbjct: 121 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK----GCLELLSRSGISIRGKKAVVMGRS 180

Query: 181 NIVGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVI 240
           NIVGLPVSLLLLKADATVTIVHSRSV+PESVIREADI+IAAAGQAQMIKGSWIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 240

Query: 241 DVGTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 300
           DVGTNAVDDPT+KSGYRLVGDVDFQEACKVAGW+TPVPGGVGPMTVAMLLRNTLDGAKR 
Sbjct: 241 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRA 299

Query: 301 IEQ 304
           IEQ
Sbjct: 301 IEQ 299

BLAST of Cp4.1LG02g16580 vs. NCBI nr
Match: gi|449469104|ref|XP_004152261.1| (PREDICTED: bifunctional protein FolD 2 isoform X1 [Cucumis sativus])

HSP 1 Score: 548.5 bits (1412), Expect = 7.3e-153
Identity = 279/303 (92.08%), Postives = 293/303 (96.70%), Query Frame = 1

Query: 1   MASESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMK 60
           MASES+HKATIIDGK+IAQTVRSEI EEV KLS+KYGK+PGLAVVIVG+RKDS +YVNMK
Sbjct: 1   MASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMK 60

Query: 61  RKACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEI 120
           RKAC EVGIKSF+ DLPEQVSEAELISK+HELNANPEVHGILVQLPLP HINEEKVLSEI
Sbjct: 61  RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPEVHGILVQLPLPNHINEEKVLSEI 120

Query: 121 NIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRS 180
           +IEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK    GC+ELLSRSGISIRGKKAVVMGRS
Sbjct: 121 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK----GCIELLSRSGISIRGKKAVVMGRS 180

Query: 181 NIVGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVI 240
           NIVGLPVSLLLLKADATVTIVHSRSV+PESVIREADI+IAAAGQAQMIKGSWIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 240

Query: 241 DVGTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 300
           DVGTNAVDDPT+KSGYRLVGDVDFQEACKVAGW+TPVPGGVGPMTVAMLLRNTLDGAKRV
Sbjct: 241 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 299

Query: 301 IEQ 304
           IEQ
Sbjct: 301 IEQ 299

BLAST of Cp4.1LG02g16580 vs. NCBI nr
Match: gi|700197629|gb|KGN52787.1| (hypothetical protein Csa_4G001540 [Cucumis sativus])

HSP 1 Score: 548.5 bits (1412), Expect = 7.3e-153
Identity = 279/303 (92.08%), Postives = 293/303 (96.70%), Query Frame = 1

Query: 1   MASESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMK 60
           MASES+HKATIIDGK+IAQTVRSEI EEV KLS+KYGK+PGLAVVIVG+RKDS +YVNMK
Sbjct: 75  MASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMK 134

Query: 61  RKACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEI 120
           RKAC EVGIKSF+ DLPEQVSEAELISK+HELNANPEVHGILVQLPLP HINEEKVLSEI
Sbjct: 135 RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPEVHGILVQLPLPNHINEEKVLSEI 194

Query: 121 NIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRS 180
           +IEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK    GC+ELLSRSGISIRGKKAVVMGRS
Sbjct: 195 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK----GCIELLSRSGISIRGKKAVVMGRS 254

Query: 181 NIVGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVI 240
           NIVGLPVSLLLLKADATVTIVHSRSV+PESVIREADI+IAAAGQAQMIKGSWIKPGAAVI
Sbjct: 255 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 314

Query: 241 DVGTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 300
           DVGTNAVDDPT+KSGYRLVGDVDFQEACKVAGW+TPVPGGVGPMTVAMLLRNTLDGAKRV
Sbjct: 315 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 373

Query: 301 IEQ 304
           IEQ
Sbjct: 375 IEQ 373

BLAST of Cp4.1LG02g16580 vs. NCBI nr
Match: gi|802688671|ref|XP_012082722.1| (PREDICTED: bifunctional protein FolD 2 [Jatropha curcas])

HSP 1 Score: 520.4 bits (1339), Expect = 2.1e-144
Identity = 258/303 (85.15%), Postives = 287/303 (94.72%), Query Frame = 1

Query: 1   MASESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMK 60
           MAS S+HKA +IDGK IAQT+RSEIA+EV++LSEKYGKVPGLAVVIVG RKDSQSYV+MK
Sbjct: 1   MASPSDHKAAVIDGKAIAQTIRSEIADEVRQLSEKYGKVPGLAVVIVGHRKDSQSYVSMK 60

Query: 61  RKACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEI 120
           RKAC EVGIKSF  DLPEQ+SEAELISK+HELNANP+VHGILVQLPLPKHINEE +LSEI
Sbjct: 61  RKACVEVGIKSFGVDLPEQISEAELISKVHELNANPDVHGILVQLPLPKHINEENILSEI 120

Query: 121 NIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRS 180
           ++EKDVDGFHPLNIGKLAMKGR+PLF+PCTPK    GCLELLSRSGISI+GK AVV+GRS
Sbjct: 121 SLEKDVDGFHPLNIGKLAMKGREPLFVPCTPK----GCLELLSRSGISIKGKNAVVVGRS 180

Query: 181 NIVGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVI 240
           NIVGLPVSLLLLKADATVTIVHSR+ +PES+IREADI+IAAAGQA+M+KGSWIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTIVHSRTDDPESIIREADIIIAAAGQAKMVKGSWIKPGAAVI 240

Query: 241 DVGTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 300
           DVGTNA+DDP+RKSGYRLVGDVD++EACKVAGW+TPVPGGVGPMTVAMLLRNT+DGAKRV
Sbjct: 241 DVGTNAIDDPSRKSGYRLVGDVDYEEACKVAGWITPVPGGVGPMTVAMLLRNTVDGAKRV 299

Query: 301 IEQ 304
             Q
Sbjct: 301 FGQ 299

BLAST of Cp4.1LG02g16580 vs. NCBI nr
Match: gi|645258763|ref|XP_008235037.1| (PREDICTED: bifunctional protein FolD 2-like [Prunus mume])

HSP 1 Score: 520.0 bits (1338), Expect = 2.8e-144
Identity = 262/303 (86.47%), Postives = 286/303 (94.39%), Query Frame = 1

Query: 1   MASESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMK 60
           MAS+S+HKA IIDGK IAQT+R+EIAEEV+ LS+KYGKVPGLAVVIVG+RKDSQSYV+MK
Sbjct: 1   MASQSDHKAAIIDGKAIAQTIRNEIAEEVRHLSQKYGKVPGLAVVIVGNRKDSQSYVSMK 60

Query: 61  RKACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEI 120
           RKACAEVGI S D DLPE VS+ +LI+K+HELNANP+VHGILVQLPLPKHINEEKVLSEI
Sbjct: 61  RKACAEVGILSLDIDLPEDVSQVDLIAKVHELNANPDVHGILVQLPLPKHINEEKVLSEI 120

Query: 121 NIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAITNGCLELLSRSGISIRGKKAVVMGRS 180
           +IEKDVDGFHPLNIGKLAMKGR+PLFLPCTPK    GCLELLSRSGISI+GKKAVV+GRS
Sbjct: 121 SIEKDVDGFHPLNIGKLAMKGREPLFLPCTPK----GCLELLSRSGISIKGKKAVVVGRS 180

Query: 181 NIVGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVI 240
           NIVGLPVSLLLLKADATVT+VHS S +PES+IREADI+IAAAGQA MIKGSWIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTVVHSHSHDPESIIREADIIIAAAGQAMMIKGSWIKPGAAVI 240

Query: 241 DVGTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 300
           DVGTNA+DD +RKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLL+NTLDGAKRV
Sbjct: 241 DVGTNAIDDSSRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLKNTLDGAKRV 299

Query: 301 IEQ 304
           I Q
Sbjct: 301 IAQ 299

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
FOLD2_ARATH1.2e-13681.67Bifunctional protein FolD 2 OS=Arabidopsis thaliana GN=FOLD2 PE=2 SV=1[more]
FOLD4_ARATH9.0e-10360.40Bifunctional protein FolD 4, chloroplastic OS=Arabidopsis thaliana GN=FOLD4 PE=1... [more]
FOLD1_ARATH3.3e-9758.92Bifunctional protein FolD 1, mitochondrial OS=Arabidopsis thaliana GN=FOLD1 PE=2... [more]
FOLD_MAGMM9.1e-7954.30Bifunctional protein FolD OS=Magnetococcus marinus (strain ATCC BAA-1437 / JCM 1... [more]
FOLD_GEOUR1.5e-7854.30Bifunctional protein FolD OS=Geobacter uraniireducens (strain Rf4) GN=folD PE=3 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KWH1_CUCSA5.1e-15392.08Uncharacterized protein OS=Cucumis sativus GN=Csa_4G001540 PE=3 SV=1[more]
A0A067K8J8_JATCU1.5e-14485.15Uncharacterized protein OS=Jatropha curcas GN=JCGZ_13900 PE=3 SV=1[more]
M5VKQ0_PRUPE2.5e-14486.80Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009269mg PE=3 SV=1[more]
F6HZ29_VITVI4.3e-14486.47Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0005g00070 PE=3 SV=... [more]
A0A068UM70_COFCA2.1e-14383.87Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00030020001 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G12290.17.0e-13881.67 Amino acid dehydrogenase family protein[more]
AT4G00620.15.1e-10460.40 Amino acid dehydrogenase family protein[more]
AT2G38660.11.9e-9858.92 Amino acid dehydrogenase family protein[more]
AT4G00600.15.3e-7760.09 Amino acid dehydrogenase family protein[more]
Match NameE-valueIdentityDescription
gi|659108749|ref|XP_008454369.1|1.9e-15392.74PREDICTED: bifunctional protein FolD 2 isoform X1 [Cucumis melo][more]
gi|449469104|ref|XP_004152261.1|7.3e-15392.08PREDICTED: bifunctional protein FolD 2 isoform X1 [Cucumis sativus][more]
gi|700197629|gb|KGN52787.1|7.3e-15392.08hypothetical protein Csa_4G001540 [Cucumis sativus][more]
gi|802688671|ref|XP_012082722.1|2.1e-14485.15PREDICTED: bifunctional protein FolD 2 [Jatropha curcas][more]
gi|645258763|ref|XP_008235037.1|2.8e-14486.47PREDICTED: bifunctional protein FolD 2-like [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
GO:0009396folic acid-containing compound biosynthetic process
Vocabulary: Molecular Function
TermDefinition
GO:0004488methylenetetrahydrofolate dehydrogenase (NADP+) activity
GO:0003824catalytic activity
Vocabulary: INTERPRO
TermDefinition
IPR020867THF_DH/CycHdrlase_CS
IPR020631THF_DH/CycHdrlase_NAD-bd_dom
IPR020630THF_DH/CycHdrlase_cat_dom
IPR016040NAD(P)-bd_dom
IPR000672THF_DH/CycHdrlase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030244 cellulose biosynthetic process
biological_process GO:0019344 cysteine biosynthetic process
biological_process GO:0009396 folic acid-containing compound biosynthetic process
biological_process GO:0046487 glyoxylate metabolic process
biological_process GO:0048193 Golgi vesicle transport
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0009507 chloroplast
cellular_component GO:0005829 cytosol
cellular_component GO:0005575 cellular_component
molecular_function GO:0004477 methenyltetrahydrofolate cyclohydrolase activity
molecular_function GO:0004488 methylenetetrahydrofolate dehydrogenase (NADP+) activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0016787 hydrolase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g16580.1Cp4.1LG02g16580.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000672Tetrahydrofolate dehydrogenase/cyclohydrolasePRINTSPR00085THFDHDRGNASEcoord: 116..137
score: 3.6E-82coord: 40..62
score: 3.6E-82coord: 216..245
score: 3.6E-82coord: 81..108
score: 3.6E-82coord: 274..292
score: 3.6E-82coord: 167..187
score: 3.6E-82coord: 257..273
score: 3.6
IPR000672Tetrahydrofolate dehydrogenase/cyclohydrolaseHAMAPMF_01576THF_DHG_CYHcoord: 9..301
score: 36
IPR016040NAD(P)-binding domainGENE3DG3DSA:3.40.50.720coord: 148..282
score: 3.2
IPR016040NAD(P)-binding domainunknownSSF51735NAD(P)-binding Rossmann-fold domainscoord: 129..296
score: 2.77
IPR020630Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domainPFAMPF00763THF_DHG_CYHcoord: 11..127
score: 6.3
IPR020631Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domainPFAMPF02882THF_DHG_CYH_Ccoord: 130..299
score: 5.1
IPR020867Tetrahydrofolate dehydrogenase/cyclohydrolase, conserved sitePROSITEPS00766THF_DHG_CYH_1coord: 82..107
scor
IPR020867Tetrahydrofolate dehydrogenase/cyclohydrolase, conserved sitePROSITEPS00767THF_DHG_CYH_2coord: 278..286
scor
NoneNo IPR availableGENE3DG3DSA:3.40.192.10coord: 13..147
score: 6.6
NoneNo IPR availablePANTHERPTHR10025TETRAHYDROFOLATE DEHYDROGENASE/CYCLOHYDROLASE FAMILY MEMBERcoord: 1..303
score: 7.3E
NoneNo IPR availablePANTHERPTHR10025:SF33BIFUNCTIONAL PROTEIN FOLD 2coord: 1..303
score: 7.3E
NoneNo IPR availableunknownSSF53223Aminoacid dehydrogenase-like, N-terminal domaincoord: 9..128
score: 8.15