CsGy4G000520 (gene) Cucumber (Gy14) v2

NameCsGy4G000520
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionBifunctional protein FolD 2
LocationChr4 : 314587 .. 318419 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGGAAGAGTATAGTAATAAATATGTTTTGGTTGCGGTCTTGGGATACCAATCTGGAACCTTAGTCCCCGTAGCTTTAGAAAGTCACGCTCATCGTAATTACCCAAAACAAATGACTTAGATTTGGAATCCTTTCTCTTCGGCTGAAGAAAATGGGCCACACAAAATTGTGCACGGACGTACCCATTTCTCGCCAAACACAGTCAGATTGCCCAAATCCAAGCCCATCCACCTATTCCCATGCCTGCATCAACTCACTGATTATACACAATCAATCCTTCTCAATTTCAGCTTCTCTATTACATGCCATTGCCACTGGCAGCTGCAAATTCAATTCGGAAGTAGTAACCATTTTCTTCTTCTCACCTGCAAAAATGGCGTCCGAATCCGACCACAAGGCTACAATCATCGACGGCAAAAAGATTGCACAAACTGTCAGATCTGAAATCACGGAGGAAGTGAACAAACTCTCTCAGAAATATGGAAAGGTTTAGTTTCCTTTGCTCCAGTCAAATTCTTACAATTTCCATCAACTCTTTTGAGATATTATGTTTTATATCTATCTGATTTTAGCATTTTTCTTGTTTCATATGGTGTGGGATTTGTGAAGATTCCAGGACTGGCGGTGGTGATTGTGGGGAATAGAAAAGATTCGCTTACCTATGTAAATATGAAGAGGAAGGCGTGCCTGGAAGTAGGGATCAAGTCTTTTGAGATTGATCTTCCTGAGCAAGTGTCTGAGGCTGAATTGATCAGCAAAGTTCACGAGCTTAATGCCAATCCTGAAGTGCATGGTTAATCTCTTCTTTACTTTTATGCTCAGGATTTCTTCTAATTTTAGTGCTGTAGTTCAAATTCGATGGAGTCTCCACTGTTTTTGGAATAGTACAATGAAGATGCATTTCTGATATATAATTGTTCTTTGATTCTTTTTACCTTTCTAAAGCAATGTGATTCATTGAAGAAAAAGCACCCTTTTAGTGCTTCTAGAGTGATTTAAAATATTATAAGTGTTTATAGTAACTTTCAAAGTTATTTTCTGTGTCAATTTGTTTTTTTTTTTAAGAAAAAAGTTCTTCAAATGGCTCAAAAGTACTTTTTCAGAGTTTTAAAGGTACCAATTCGTGCATAACTCTGTTGATAATGATATCAATTACCATCTCTAAAGTTGTTGAACTCAAAAGTAATTTACAATTTTTGAAGAACTTTGTTTCTTTGACAGGGATATTGGTTCAGCTTCCTTTGCCGAACCACATAAATGAAGAGAAAGTTCTATCTGAAATTAGCATTGAAAAGGATGTAGATGGCTTTCATCCCCTGAACATTGGGAAGCTTGCAATGAAAGGCAGAGATCCCTTGTTCCTACCTTGCACTCCGAAGGCAATAATGAATGTTTTATTAGCCTCAGAAAATCATTCAACTCATTCTTGATAGATCTCTTTTAGCTAGAAGAGCAATATATCCTTGTTTATCTATGTTCAATATTTGGTTTCATGCATATCTTAGCCCAAAAAAGGCAATTCATTTCATAGTCTGAGATATCGTTTTATGTGCAGGGGTGTATTGAACTGCTATCGCGAAGTGGAATAAGCATCAGAGGAAAGAAAGCAGTTGTAATGGGGCGAAGCAACATTGTTGGATTACCAGTTTCATTGTTGCTTCTTAAAGCAGATGCAACTGTGACAATCGTCCATTCACGTTCCGTCGATCCAGAAAGTGTTATCCGCGAAGCAGACATTATAATTGCTGCAGCAGGACAGGCACAAATGGTATAATTTGGCTATACAACCTCAGTGTTTTCTCTTTCCCCTTTACTTTTATCTGAATATCAGCATTTGTCTCTAAACTTAGGGAATATGTATAGGTGCTAGAATAGTCCTGGAAACGAAAAAAGGTCAGATCCATAGGTATAATTAAAACTTTTCTTAGAAATTTTAAATTTTATTCTACATAGTGCATGTTGTTAACTAAACGATGACACATCTTCAATGATTTGATGCAAAGTGGTCAGCAAAATTTTCTTATATCATCTCTCTCTCTCTTTGTGTTATTGGAATCCGCCCATTTTTCCCCCAATAAGCATCTACCTCATTATTCAGTCAACAATGGTAAATCAAACAAGTTAGGGAAAAATATTTGAGAGAATTAAGCAAAAGCACAATCATCAAAGTTGGTACTTCAAGCAATACTTTGCCATTTGCATATGCATGATAGAACAATAACTGCAAAAGCATTAAGAAACAGGACTGTAGGGAATATGATTTAGTTGACTTGTTGAAACATCCTAGTTTTGAGACTAACTTGTGCACATGGTGATTTCGTTTGGTTGATGTTAGATCAAGGGGAGTTGGATTAAACCAGGTGCTGCAGTTATCGATGTCGGTACAAATGCAGTTGATGATCCAACCAAGAAGTCAGGCTACCGGCTGGTCGGTGATGTAGATTTCCAGGAAGCTTGTAAAGTAGCTGGTTGGATAACTCCGGTTCCCGGTGGCGTAGGTCCTATGACCGTTGCCATGCTGCTTAGGAATACTCTGGATGGGGCTAAGCGTGTGATCGAGCAGTGAAAAAGATACAGCCAGCTTGTTACCGATCCTCGTGGGACTTTTTCGGAGTAATTTCAAAATAAATGGTTCGTTTGTTTTGTGCTATCTTTTTTTTGTACTAATAATAGTTGATTTTTCATTGAAAATAATGGAGTAATTAAAATCAAATCATAACACCCCAATCTTGAAAAATCTAAGATAAGTTTTCTTAAAGTACCAACATAAATCATCAATAATCATGGTGTATGATTTGATTTGATTTGATTTCCCTGTAATCTAGAAAAGCATTTGAGGTGGAAGTTATGAGTTCACAAAATGAGAGAAGCTGTTGATTTTGCAGAAAAACCTCCATTTCCTTCTTGTCTAAGCTGACTATGGCCTCAATCCCTTCACCATCTTTGGAATCCATCAACAAAATCATATTCTTCCATGGAAACTCAGGTACAGTGATCCATACAGGTTTCCCCCTTCCAAAATCCGCTTCGTAAATTGGAAATTTACACCAACTACTACATGTATACAAATTATGATCCTCTTGATTTATCATCCTGTCTTCCATTGATTCTTTGGCATGTAATTTGTGAAGCAAACCCCATTCTTCAGCTCTGTAATTTGTTGGAAACGTTTTGCAGAACTCTTCAAAGTTTCTCTTCGTCTCACCCACTAAGTTCCATAGCTCCATTTCCTTCTTCTCCGGCATCCTCGACAATCCTATGAACCAAGAGATTACATTTCCTGATAATGTCGCCGGCAATGGCGGATCCACTCTGTTTCGTAAGTTGATGATTTGAAGTAGAATTGTTGCTGCCACATTGCCTTTGATTCAAGAAATTAACCACAAAGGTTTCGTTAATCCCAATAGAAATCCAAATGGGAAAATACTCCAATAATTGACCCCAAAACCAAAAACTTTGAACATAGAATATTCTTTCGATATGTGATGCTTTGTTTTGGTAACATAAGAGAGAAACACAGATAGGAGAAACCCAAAATCTAAAGGGAAAAGAAGTGAGAAGAGATTCATTTGAAATTGAGGGCTAAATGGCTATTCCATACCTTCAAAATATTAATTATAAACAAACATGTTTCCTAAAAGAAGTCTTTTTTTTTTTCTTAACAAAATCCATTTTTCTTGCAAGTTTATTTGAAAAAAAATTATAGATAAGGGATTGAAGTACTATTAAAGTTCGCATATTTATCAAATAACTCCCTCTTACAACCGTACACCTAAACATAAATTTACTAACTCCAACATATTAACTACTCTAGATTTATAAACCACCCGTAT

mRNA sequence

TGGGAAGAGTATAGTAATAAATATGTTTTGGTTGCGGTCTTGGGATACCAATCTGGAACCTTAGTCCCCGTAGCTTTAGAAAGTCACGCTCATCGTAATTACCCAAAACAAATGACTTAGATTTGGAATCCTTTCTCTTCGGCTGAAGAAAATGGGCCACACAAAATTGTGCACGGACGTACCCATTTCTCGCCAAACACAGTCAGATTGCCCAAATCCAAGCCCATCCACCTATTCCCATGCCTGCATCAACTCACTGATTATACACAATCAATCCTTCTCAATTTCAGCTTCTCTATTACATGCCATTGCCACTGGCAGCTGCAAATTCAATTCGGAAGTAGTAACCATTTTCTTCTTCTCACCTGCAAAAATGGCGTCCGAATCCGACCACAAGGCTACAATCATCGACGGCAAAAAGATTGCACAAACTGTCAGATCTGAAATCACGGAGGAAGTGAACAAACTCTCTCAGAAATATGGAAAGATTCCAGGACTGGCGGTGGTGATTGTGGGGAATAGAAAAGATTCGCTTACCTATGTAAATATGAAGAGGAAGGCGTGCCTGGAAGTAGGGATCAAGTCTTTTGAGATTGATCTTCCTGAGCAAGTGTCTGAGGCTGAATTGATCAGCAAAGTTCACGAGCTTAATGCCAATCCTGAACTTCCTTTGCCGAACCACATAAATGAAGAGAAAGTTCTATCTGAAATTAGCATTGAAAAGGATGTAGATGGCTTTCATCCCCTGAACATTGGGAAGCTTGCAATGAAAGGCAGAGATCCCTTGTTCCTACCTTGCACTCCGAAGGCAATAATGAATGGGTGTATTGAACTGCTATCGCGAAGTGGAATAAGCATCAGAGGAAAGAAAGCAGTTGTAATGGGGCGAAGCAACATTGTTGGATTACCAGTTTCATTGTTGCTTCTTAAAGCAGATGCAACTGTGACAATCGTCCATTCACGTTCCGTCGATCCAGAAAGTGTTATCCGCGAAGCAGACATTATAATTGCTGCAGCAGGACAGGCACAAATGATCAAGGGGAGTTGGATTAAACCAGGTGCTGCAGTTATCGATGTCGGTACAAATGCAGTTGATGATCCAACCAAGAAGTCAGGCTACCGGCTGGTCGGTGATGTAGATTTCCAGGAAGCTTGTAAAGTAGCTGGTTGGATAACTCCGGTTCCCGGTGGCGTAGGTCCTATGACCGTTGCCATGCTGCTTAGGAATACTCTGGATGGGGCTAAGCGTGTGATCGAGCAGTGAAAAAGATACAGCCAGCTTGTTACCGATCCTCGTGGGACTTTTTCGGAGTAATTTCAAAATAAATGAAAAACCTCCATTTCCTTCTTGTCTAAGCTGACTATGGCCTCAATCCCTTCACCATCTTTGGAATCCATCAACAAAATCATATTCTTCCATGGAAACTCAGAACTCTTCAAAGTTTCTCTTCGTCTCACCCACTAAGTTCCATAGCTCCATTTCCTTCTTCTCCGGCATCCTCGACAATCCTATGAACCAAGAGATTACATTTCCTGATAATGTCGCCGGCAATGGCGGATCCACTCTGTTTCGTAAGTTGATGATTTGAAGTAGAATTGTTGCTGCCACATTGCCTTTGATTCAAGAAATTAACCACAAAGGTTTCGTTAATCCCAATAGAAATCCAAATGGGAAAATACTCCAATAATTGACCCCAAAACCAAAAACTTTGAACATAGAATATTCTTTCGATATGTGATGCTTTGTTTTGGTAACATAAGAGAGAAACACAGATAGGAGAAACCCAAAATCTAAAGGGAAAAGAAGTGAGAAGAGATTCATTTGAAATTGAGGGCTAAATGGCTATTCCATACCTTCAAAATATTAATTATAAACAAACATGTTTCCTAAAAGAAGTCTTTTTTTTTTTCTTAACAAAATCCATTTTTCTTGCAAGTTTATTTGAAAAAAAATTATAGATAAGGGATTGAAGTACTATTAAAGTTCGCATATTTATCAAATAACTCCCTCTTACAACCGTACACCTAAACATAAATTTACTAACTCCAACATATTAACTACTCTAGATTTATAAACCACCCGTAT

Coding sequence (CDS)

ATGGGCCACACAAAATTGTGCACGGACGTACCCATTTCTCGCCAAACACAGTCAGATTGCCCAAATCCAAGCCCATCCACCTATTCCCATGCCTGCATCAACTCACTGATTATACACAATCAATCCTTCTCAATTTCAGCTTCTCTATTACATGCCATTGCCACTGGCAGCTGCAAATTCAATTCGGAAGTAGTAACCATTTTCTTCTTCTCACCTGCAAAAATGGCGTCCGAATCCGACCACAAGGCTACAATCATCGACGGCAAAAAGATTGCACAAACTGTCAGATCTGAAATCACGGAGGAAGTGAACAAACTCTCTCAGAAATATGGAAAGATTCCAGGACTGGCGGTGGTGATTGTGGGGAATAGAAAAGATTCGCTTACCTATGTAAATATGAAGAGGAAGGCGTGCCTGGAAGTAGGGATCAAGTCTTTTGAGATTGATCTTCCTGAGCAAGTGTCTGAGGCTGAATTGATCAGCAAAGTTCACGAGCTTAATGCCAATCCTGAACTTCCTTTGCCGAACCACATAAATGAAGAGAAAGTTCTATCTGAAATTAGCATTGAAAAGGATGTAGATGGCTTTCATCCCCTGAACATTGGGAAGCTTGCAATGAAAGGCAGAGATCCCTTGTTCCTACCTTGCACTCCGAAGGCAATAATGAATGGGTGTATTGAACTGCTATCGCGAAGTGGAATAAGCATCAGAGGAAAGAAAGCAGTTGTAATGGGGCGAAGCAACATTGTTGGATTACCAGTTTCATTGTTGCTTCTTAAAGCAGATGCAACTGTGACAATCGTCCATTCACGTTCCGTCGATCCAGAAAGTGTTATCCGCGAAGCAGACATTATAATTGCTGCAGCAGGACAGGCACAAATGATCAAGGGGAGTTGGATTAAACCAGGTGCTGCAGTTATCGATGTCGGTACAAATGCAGTTGATGATCCAACCAAGAAGTCAGGCTACCGGCTGGTCGGTGATGTAGATTTCCAGGAAGCTTGTAAAGTAGCTGGTTGGATAACTCCGGTTCCCGGTGGCGTAGGTCCTATGACCGTTGCCATGCTGCTTAGGAATACTCTGGATGGGGCTAAGCGTGTGATCGAGCAGTGA

Protein sequence

MGHTKLCTDVPISRQTQSDCPNPSPSTYSHACINSLIIHNQSFSISASLLHAIATGSCKFNSEVVTIFFFSPAKMASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMKRKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPELPLPNHINEEKVLSEISIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRSNIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVIDVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRVIEQ
BLAST of CsGy4G000520 vs. NCBI nr
Match: KGN52787.1 (hypothetical protein Csa_4G001540 [Cucumis sativus])

HSP 1 Score: 705.3 bits (1819), Expect = 1.1e-199
Identity = 366/377 (97.08%), Postives = 366/377 (97.08%), Query Frame = 0

Query: 1   MGHTKLCTDVPISRQTQSDCPNPSPSTYSHACINSLIIHNQSFSISASLLHAIATGSCKF 60
           MGHTKLCTDVPISRQTQSDCPNPSPSTYSHACINSLIIHNQSFSISASLLHAIATGSCKF
Sbjct: 1   MGHTKLCTDVPISRQTQSDCPNPSPSTYSHACINSLIIHNQSFSISASLLHAIATGSCKF 60

Query: 61  NSEVVTIFFFSPAKMASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVI 120
           NSEVVTIFFFSPAKMASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVI
Sbjct: 61  NSEVVTIFFFSPAKMASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVI 120

Query: 121 VGNRKDSLTYVNMKRKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPE-------LP 180
           VGNRKDSLTYVNMKRKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPE       LP
Sbjct: 121 VGNRKDSLTYVNMKRKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPEVHGILVQLP 180

Query: 181 LPNHINEEKVLSEISIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSG 240
           LPNHINEEKVLSEISIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK    GCIELLSRSG
Sbjct: 181 LPNHINEEKVLSEISIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK----GCIELLSRSG 240

Query: 241 ISIRGKKAVVMGRSNIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQ 300
           ISIRGKKAVVMGRSNIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQ
Sbjct: 241 ISIRGKKAVVMGRSNIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQ 300

Query: 301 MIKGSWIKPGAAVIDVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTV 360
           MIKGSWIKPGAAVIDVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTV
Sbjct: 301 MIKGSWIKPGAAVIDVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTV 360

Query: 361 AMLLRNTLDGAKRVIEQ 371
           AMLLRNTLDGAKRVIEQ
Sbjct: 361 AMLLRNTLDGAKRVIEQ 373

BLAST of CsGy4G000520 vs. NCBI nr
Match: XP_004152261.1 (PREDICTED: bifunctional protein FolD 2 isoform X1 [Cucumis sativus])

HSP 1 Score: 557.8 bits (1436), Expect = 2.8e-155
Identity = 292/303 (96.37%), Postives = 292/303 (96.37%), Query Frame = 0

Query: 75  MASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMK 134
           MASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMK
Sbjct: 1   MASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMK 60

Query: 135 RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPE-------LPLPNHINEEKVLSEI 194
           RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPE       LPLPNHINEEKVLSEI
Sbjct: 61  RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPEVHGILVQLPLPNHINEEKVLSEI 120

Query: 195 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRS 254
           SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK    GCIELLSRSGISIRGKKAVVMGRS
Sbjct: 121 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK----GCIELLSRSGISIRGKKAVVMGRS 180

Query: 255 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 314
           NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 240

Query: 315 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 371
           DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV
Sbjct: 241 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 299

BLAST of CsGy4G000520 vs. NCBI nr
Match: XP_008454369.1 (PREDICTED: bifunctional protein FolD 2 isoform X1 [Cucumis melo])

HSP 1 Score: 551.6 bits (1420), Expect = 2.0e-153
Identity = 286/303 (94.39%), Postives = 290/303 (95.71%), Query Frame = 0

Query: 75  MASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMK 134
           MASESDHKATIIDGKKIAQTVRSE+ EEVNKLS+KYGK+PGLAVVIVGNRKDSLTYVNMK
Sbjct: 1   MASESDHKATIIDGKKIAQTVRSEVAEEVNKLSEKYGKVPGLAVVIVGNRKDSLTYVNMK 60

Query: 135 RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPE-------LPLPNHINEEKVLSEI 194
           RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPE       LPLPNHINEEKVLSEI
Sbjct: 61  RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPEVHGILVQLPLPNHINEEKVLSEI 120

Query: 195 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRS 254
           SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK    GC+ELLSRSGISIRGKKAVVMGRS
Sbjct: 121 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK----GCLELLSRSGISIRGKKAVVMGRS 180

Query: 255 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 314
           NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 240

Query: 315 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 371
           DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKR 
Sbjct: 241 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRA 299

BLAST of CsGy4G000520 vs. NCBI nr
Match: XP_023524684.1 (bifunctional protein FolD 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 529.6 bits (1363), Expect = 8.3e-147
Identity = 272/303 (89.77%), Postives = 286/303 (94.39%), Query Frame = 0

Query: 75  MASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMK 134
           MASES+HKATIIDGK+IAQTVRSEI EEV KLS+KYGK+PGLAVVIVG+RKDS +YVNMK
Sbjct: 1   MASESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMK 60

Query: 135 RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPE-------LPLPNHINEEKVLSEI 194
           RKAC EVGIKSF+ DLPEQVSEAELISK+HELNANPE       LPLP HINEEKVLSEI
Sbjct: 61  RKACAEVGIKSFDYDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEI 120

Query: 195 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRS 254
           +IEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK    GC+ELLSRSGISIRGKKAVVMGRS
Sbjct: 121 NIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK----GCLELLSRSGISIRGKKAVVMGRS 180

Query: 255 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 314
           NIVGLPVSLLLLKADATVTIVHSRSV+PESVIREADI+IAAAGQAQMIKGSWIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVI 240

Query: 315 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 371
           DVGTNAVDDPT+KSGYRLVGDVDFQEACKVAGW+TPVPGGVGPMTVAMLLRNTLDGAKRV
Sbjct: 241 DVGTNAVDDPTRKSGYRLVGDVDFQEACKVAGWVTPVPGGVGPMTVAMLLRNTLDGAKRV 299

BLAST of CsGy4G000520 vs. NCBI nr
Match: XP_022997744.1 (bifunctional protein FolD 2 isoform X2 [Cucurbita maxima])

HSP 1 Score: 526.6 bits (1355), Expect = 7.0e-146
Identity = 272/303 (89.77%), Postives = 285/303 (94.06%), Query Frame = 0

Query: 75  MASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMK 134
           MASES+HKATIIDGK+IAQTVRSEI EEV KLS+KYGK+PGLAVVIVG+RKDS +YVNMK
Sbjct: 1   MASESEHKATIIDGKQIAQTVRSEIAEEVKKLSEKYGKVPGLAVVIVGSRKDSQSYVNMK 60

Query: 135 RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPE-------LPLPNHINEEKVLSEI 194
           RKAC EVGIKSF+ DLPEQVSEAELISK+HELNANPE       LPLP HINEEKVLSEI
Sbjct: 61  RKACAEVGIKSFDFDLPEQVSEAELISKIHELNANPEVHGILVQLPLPKHINEEKVLSEI 120

Query: 195 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRS 254
           +IEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK    GC+ELLSRSGISIRGKKAVVMGRS
Sbjct: 121 NIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK----GCLELLSRSGISIRGKKAVVMGRS 180

Query: 255 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 314
           NIVGLPVSLLLLKADATVTIVHSRSV+PESVIREADI+IAAAGQAQMIKGSWIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTIVHSRSVNPESVIREADIVIAAAGQAQMIKGSWIKPGAAVI 240

Query: 315 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 371
           DVGTNAVDDPT+KSGYRLVGDVDFQEA KVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV
Sbjct: 241 DVGTNAVDDPTRKSGYRLVGDVDFQEASKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 299

BLAST of CsGy4G000520 vs. TAIR10
Match: AT3G12290.1 (Amino acid dehydrogenase family protein)

HSP 1 Score: 463.8 bits (1192), Expect = 1.0e-130
Identity = 235/300 (78.33%), Postives = 264/300 (88.00%), Query Frame = 0

Query: 75  MASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMK 134
           MAS SDH A IIDGK IA T+RSEI EEV  LS+K+GK+PGLAVVIVG+RKDS TYVN K
Sbjct: 1   MASSSDHTAKIIDGKAIAHTIRSEIAEEVRGLSEKHGKVPGLAVVIVGSRKDSQTYVNTK 60

Query: 135 RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANP-------ELPLPNHINEEKVLSEI 194
           RKAC EVGIKSF++ LPE+VSEA+LISKVHELN+NP       +LPLP HINEE +L  I
Sbjct: 61  RKACAEVGIKSFDVGLPEEVSEADLISKVHELNSNPDVHGILVQLPLPKHINEEHILGAI 120

Query: 195 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRS 254
           SI+KDVDGFHPLNIGKLAMKGR+PLFLPCTPK    GC+ELL+RSG+ I+G++AVV+GRS
Sbjct: 121 SIDKDVDGFHPLNIGKLAMKGREPLFLPCTPK----GCLELLARSGVKIKGQRAVVVGRS 180

Query: 255 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 314
           NIVGLPVSLLLLKADATVT VHS + DPE++IREADI+IAA GQA MIKG+WIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTTVHSHTKDPEAIIREADIVIAACGQAHMIKGNWIKPGAAVI 240

Query: 315 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 368
           DVGTNAV DP+KKSGYRLVGDVDF EA KVAG+ITPVPGGVGPMTVAMLLRNT+DGAKRV
Sbjct: 241 DVGTNAVSDPSKKSGYRLVGDVDFAEASKVAGFITPVPGGVGPMTVAMLLRNTVDGAKRV 296

BLAST of CsGy4G000520 vs. TAIR10
Match: AT4G00620.1 (Amino acid dehydrogenase family protein)

HSP 1 Score: 362.5 bits (929), Expect = 3.2e-100
Identity = 179/298 (60.07%), Postives = 232/298 (77.85%), Query Frame = 0

Query: 77  SESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMKRK 136
           ++S+  A +IDGK +A+ +R EIT EV+++ +  G IPGLAV++VG+RKDS TYV  K+K
Sbjct: 63  TKSEGGAIVIDGKAVAKKIRDEITIEVSRMKESIGVIPGLAVILVGDRKDSATYVRNKKK 122

Query: 137 ACLEVGIKSFEIDLPEQVSEAELISKVHELNANP-------ELPLPNHINEEKVLSEISI 196
           AC  VGIKSFE+ L E  SE E++  V   N +P       +LPLP+H++E+ +L+ +SI
Sbjct: 123 ACDSVGIKSFEVRLAEDSSEEEVLKSVSGFNDDPSVHGILVQLPLPSHMDEQNILNAVSI 182

Query: 197 EKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRSNI 256
           EKDVDGFHPLNIG+LAM+GR+PLF+PCTPK    GCIELL R  I I+GK+AVV+GRSNI
Sbjct: 183 EKDVDGFHPLNIGRLAMRGREPLFVPCTPK----GCIELLHRYNIEIKGKRAVVIGRSNI 242

Query: 257 VGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVIDV 316
           VG+P +LLL + DATV+I+HSR+ +PE + READIII+A GQ  M++GSWIKPGA +IDV
Sbjct: 243 VGMPAALLLQREDATVSIIHSRTKNPEEITREADIIISAVGQPNMVRGSWIKPGAVLIDV 302

Query: 317 GTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 368
           G N V+DP+   GYRLVGD+ ++EA KVA  ITPVPGGVGPMT+AMLL NTL  AKR+
Sbjct: 303 GINPVEDPSAARGYRLVGDICYEEASKVASAITPVPGGVGPMTIAMLLSNTLTSAKRI 356

BLAST of CsGy4G000520 vs. TAIR10
Match: AT2G38660.1 (Amino acid dehydrogenase family protein)

HSP 1 Score: 340.1 bits (871), Expect = 1.7e-93
Identity = 169/303 (55.78%), Postives = 225/303 (74.26%), Query Frame = 0

Query: 72  PAKMASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYV 131
           P  ++ E++ K  +IDG  IA+ +R++I  EV K+ +  GK+PGLAVV+VG ++DS TYV
Sbjct: 52  PPPVSFETEQKTVVIDGNVIAEEIRTKIISEVGKMKKAVGKVPGLAVVLVGEQRDSQTYV 111

Query: 132 NMKRKACLEVGIKSFEIDLPEQVSEAELISKVHELNANP-------ELPLPNHINEEKVL 191
             K KAC E GIKS   +LPE  +E ++IS + + N +        +LPLP H+NE K+L
Sbjct: 112 RNKIKACEETGIKSVLAELPEDCTEGQIISVLRKFNEDTSIHGILVQLPLPQHLNESKIL 171

Query: 192 SEISIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVM 251
           + + +EKDVDGFHPLN+G LAM+GR+PLF+ CTPK    GC+ELL R+G+ I GK AVV+
Sbjct: 172 NMVRLEKDVDGFHPLNVGNLAMRGREPLFVSCTPK----GCVELLIRTGVEIAGKNAVVI 231

Query: 252 GRSNIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGA 311
           GRSNIVGLP+SLLL + DATV+ VH+ + DPE + R+ADI+IAAAG   +++GSW+KPGA
Sbjct: 232 GRSNIVGLPMSLLLQRHDATVSTVHAFTKDPEHITRKADIVIAAAGIPNLVRGSWLKPGA 291

Query: 312 AVIDVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGA 368
            VIDVGT  V+D + + GYRLVGDV ++EA  VA  ITPVPGGVGPMT+ MLL NTL+ A
Sbjct: 292 VVIDVGTTPVEDSSCEFGYRLVGDVCYEEALGVASAITPVPGGVGPMTITMLLCNTLEAA 350

BLAST of CsGy4G000520 vs. TAIR10
Match: AT4G00600.1 (Amino acid dehydrogenase family protein)

HSP 1 Score: 275.8 bits (704), Expect = 3.9e-74
Identity = 145/298 (48.66%), Postives = 194/298 (65.10%), Query Frame = 0

Query: 77  SESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMKRK 136
           S S   A +IDGK  A+ +R +I  EV+++ +  G +P                      
Sbjct: 49  SSSSPSAIVIDGKAEAKKIRDDIKIEVSRMKESIGVVPA--------------------- 108

Query: 137 ACLEVGIKSFEIDLPEQVSEAELISKVHELNANP-------ELPLPNHINEEKVLSEISI 196
                          E  SE E++  V   N +P       +LPLP+H++E+ +L+ +SI
Sbjct: 109 ---------------EDSSEEEVLKYVSGFNDDPSVHGVLVQLPLPSHMDEQNILNAVSI 168

Query: 197 EKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRSNI 256
           EKDVDGFHPLNIG+LAM+GR+PLF+PCTPK    GCIELL R  I  +GK+AVV+GRSNI
Sbjct: 169 EKDVDGFHPLNIGRLAMRGREPLFVPCTPK----GCIELLHRYNIEFKGKRAVVIGRSNI 228

Query: 257 VGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVIDV 316
           VG+P +LLL K DATV+I+HSR+++PE + R+ADI+I+A G+  M++GSWIKPGA +IDV
Sbjct: 229 VGMPAALLLQKEDATVSIIHSRTMNPEELTRQADILISAVGKPNMVRGSWIKPGAVLIDV 288

Query: 317 GTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 368
           G   V+DP+   G RLVGD+ + EA K+A  ITPVPG VGPMT+AMLL NTL  AKR+
Sbjct: 289 GIKPVEDPSAAGGERLVGDICYVEASKIASAITPVPGDVGPMTIAMLLSNTLTSAKRI 306

BLAST of CsGy4G000520 vs. Swiss-Prot
Match: sp|Q9LHH7|FOLD2_ARATH (Bifunctional protein FolD 2 OS=Arabidopsis thaliana OX=3702 GN=FOLD2 PE=2 SV=1)

HSP 1 Score: 463.8 bits (1192), Expect = 1.8e-129
Identity = 235/300 (78.33%), Postives = 264/300 (88.00%), Query Frame = 0

Query: 75  MASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMK 134
           MAS SDH A IIDGK IA T+RSEI EEV  LS+K+GK+PGLAVVIVG+RKDS TYVN K
Sbjct: 1   MASSSDHTAKIIDGKAIAHTIRSEIAEEVRGLSEKHGKVPGLAVVIVGSRKDSQTYVNTK 60

Query: 135 RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANP-------ELPLPNHINEEKVLSEI 194
           RKAC EVGIKSF++ LPE+VSEA+LISKVHELN+NP       +LPLP HINEE +L  I
Sbjct: 61  RKACAEVGIKSFDVGLPEEVSEADLISKVHELNSNPDVHGILVQLPLPKHINEEHILGAI 120

Query: 195 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRS 254
           SI+KDVDGFHPLNIGKLAMKGR+PLFLPCTPK    GC+ELL+RSG+ I+G++AVV+GRS
Sbjct: 121 SIDKDVDGFHPLNIGKLAMKGREPLFLPCTPK----GCLELLARSGVKIKGQRAVVVGRS 180

Query: 255 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 314
           NIVGLPVSLLLLKADATVT VHS + DPE++IREADI+IAA GQA MIKG+WIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTTVHSHTKDPEAIIREADIVIAACGQAHMIKGNWIKPGAAVI 240

Query: 315 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 368
           DVGTNAV DP+KKSGYRLVGDVDF EA KVAG+ITPVPGGVGPMTVAMLLRNT+DGAKRV
Sbjct: 241 DVGTNAVSDPSKKSGYRLVGDVDFAEASKVAGFITPVPGGVGPMTVAMLLRNTVDGAKRV 296

BLAST of CsGy4G000520 vs. Swiss-Prot
Match: sp|O65271|FOLD4_ARATH (Bifunctional protein FolD 4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=FOLD4 PE=1 SV=1)

HSP 1 Score: 362.5 bits (929), Expect = 5.7e-99
Identity = 179/298 (60.07%), Postives = 232/298 (77.85%), Query Frame = 0

Query: 77  SESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMKRK 136
           ++S+  A +IDGK +A+ +R EIT EV+++ +  G IPGLAV++VG+RKDS TYV  K+K
Sbjct: 63  TKSEGGAIVIDGKAVAKKIRDEITIEVSRMKESIGVIPGLAVILVGDRKDSATYVRNKKK 122

Query: 137 ACLEVGIKSFEIDLPEQVSEAELISKVHELNANP-------ELPLPNHINEEKVLSEISI 196
           AC  VGIKSFE+ L E  SE E++  V   N +P       +LPLP+H++E+ +L+ +SI
Sbjct: 123 ACDSVGIKSFEVRLAEDSSEEEVLKSVSGFNDDPSVHGILVQLPLPSHMDEQNILNAVSI 182

Query: 197 EKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRSNI 256
           EKDVDGFHPLNIG+LAM+GR+PLF+PCTPK    GCIELL R  I I+GK+AVV+GRSNI
Sbjct: 183 EKDVDGFHPLNIGRLAMRGREPLFVPCTPK----GCIELLHRYNIEIKGKRAVVIGRSNI 242

Query: 257 VGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVIDV 316
           VG+P +LLL + DATV+I+HSR+ +PE + READIII+A GQ  M++GSWIKPGA +IDV
Sbjct: 243 VGMPAALLLQREDATVSIIHSRTKNPEEITREADIIISAVGQPNMVRGSWIKPGAVLIDV 302

Query: 317 GTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 368
           G N V+DP+   GYRLVGD+ ++EA KVA  ITPVPGGVGPMT+AMLL NTL  AKR+
Sbjct: 303 GINPVEDPSAARGYRLVGDICYEEASKVASAITPVPGGVGPMTIAMLLSNTLTSAKRI 356

BLAST of CsGy4G000520 vs. Swiss-Prot
Match: sp|A2RVV7|FOLD1_ARATH (Bifunctional protein FolD 1, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=FOLD1 PE=2 SV=1)

HSP 1 Score: 340.1 bits (871), Expect = 3.1e-92
Identity = 169/303 (55.78%), Postives = 225/303 (74.26%), Query Frame = 0

Query: 72  PAKMASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYV 131
           P  ++ E++ K  +IDG  IA+ +R++I  EV K+ +  GK+PGLAVV+VG ++DS TYV
Sbjct: 52  PPPVSFETEQKTVVIDGNVIAEEIRTKIISEVGKMKKAVGKVPGLAVVLVGEQRDSQTYV 111

Query: 132 NMKRKACLEVGIKSFEIDLPEQVSEAELISKVHELNANP-------ELPLPNHINEEKVL 191
             K KAC E GIKS   +LPE  +E ++IS + + N +        +LPLP H+NE K+L
Sbjct: 112 RNKIKACEETGIKSVLAELPEDCTEGQIISVLRKFNEDTSIHGILVQLPLPQHLNESKIL 171

Query: 192 SEISIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVM 251
           + + +EKDVDGFHPLN+G LAM+GR+PLF+ CTPK    GC+ELL R+G+ I GK AVV+
Sbjct: 172 NMVRLEKDVDGFHPLNVGNLAMRGREPLFVSCTPK----GCVELLIRTGVEIAGKNAVVI 231

Query: 252 GRSNIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGA 311
           GRSNIVGLP+SLLL + DATV+ VH+ + DPE + R+ADI+IAAAG   +++GSW+KPGA
Sbjct: 232 GRSNIVGLPMSLLLQRHDATVSTVHAFTKDPEHITRKADIVIAAAGIPNLVRGSWLKPGA 291

Query: 312 AVIDVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGA 368
            VIDVGT  V+D + + GYRLVGDV ++EA  VA  ITPVPGGVGPMT+ MLL NTL+ A
Sbjct: 292 VVIDVGTTPVEDSSCEFGYRLVGDVCYEEALGVASAITPVPGGVGPMTITMLLCNTLEAA 350

BLAST of CsGy4G000520 vs. Swiss-Prot
Match: sp|P07245|C1TC_YEAST (C-1-tetrahydrofolate synthase, cytoplasmic OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=ADE3 PE=1 SV=1)

HSP 1 Score: 276.6 bits (706), Expect = 4.1e-73
Identity = 150/302 (49.67%), Postives = 197/302 (65.23%), Query Frame = 0

Query: 85  IIDGKKIAQTVRSEITEEVNKLSQKY-GKIPGLAVVIVGNRKDSLTYVNMKRKACLEVGI 144
           ++DGK  AQ  RS I  E+  +     G  P LA++ VGNR DS TYV MKRKA  E GI
Sbjct: 5   VLDGKACAQQFRSNIANEIKSIQGHVPGFAPNLAIIQVGNRPDSATYVRMKRKAAEEAGI 64

Query: 145 KSFEIDLPEQVSEAELISKVHELNANP-------ELPLPNHINEEKVLSEISIEKDVDGF 204
            +  I L E  +E E++  V +LN +P       +LPLP H++E+++ S +  EKDVDGF
Sbjct: 65  VANFIHLDESATEFEVLRYVDQLNEDPHTHGIIVQLPLPAHLDEDRITSRVLAEKDVDGF 124

Query: 205 HPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRSNIVGLPVSL 264
            P NIG+L  K   P FLPCTPK    G IELL ++ ++I G ++VV+GRS+IVG PV+ 
Sbjct: 125 GPTNIGELNKKNGHPFFLPCTPK----GIIELLHKANVTIEGSRSVVIGRSDIVGSPVAE 184

Query: 265 LLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKP--------GAAVID 324
           LL   ++TVTI HS++ D  S + +ADI++ A GQ + +KG W KP           VID
Sbjct: 185 LLKSLNSTVTITHSKTRDIASYLHDADIVVVAIGQPEFVKGEWFKPRDGTSSDKKTVVID 244

Query: 325 VGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRVI 371
           VGTN V DP+KKSG++ VGDV+F EA K    ITPVPGGVGPMTVAML++NTL  AKR +
Sbjct: 245 VGTNYVADPSKKSGFKCVGDVEFNEAIKYVHLITPVPGGVGPMTVAMLMQNTLIAAKRQM 302

BLAST of CsGy4G000520 vs. Swiss-Prot
Match: sp|Q2RIB4|FOLD_MOOTA (Bifunctional protein FolD OS=Moorella thermoacetica (strain ATCC 39073 / JCM 9320) OX=264732 GN=folD PE=3 SV=1)

HSP 1 Score: 276.6 bits (706), Expect = 4.1e-73
Identity = 154/291 (52.92%), Postives = 201/291 (69.07%), Query Frame = 0

Query: 83  ATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMKRKACLEVG 142
           A I+DGKKIA  VR+E+ EEV++L  + G  PGLAVV+VG    S  YV  K +AC EVG
Sbjct: 3   AQILDGKKIAAEVRAEVKEEVSRLKAE-GINPGLAVVLVGEDPASQVYVRNKHRACEEVG 62

Query: 143 IKSFEIDLPEQVSEAELISKVHELNANP-------ELPLPNHINEEKVLSEISIEKDVDG 202
           I S    LP   S+AEL+  + +LN +P       +LPLP+HI+E+KV+  I++EKDVDG
Sbjct: 63  IYSEVHRLPAATSQAELLKLIDQLNKDPKIHGILVQLPLPDHIDEKKVIDAIALEKDVDG 122

Query: 203 FHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRSNIVGLPVS 262
           F P N+G L +   D  F PCTP    +GC+ LL ++GI  +GKKAVV+GRSNIVG PV+
Sbjct: 123 FSPANVGNLVI--GDKCFYPCTP----HGCMVLLEKAGIDPKGKKAVVVGRSNIVGKPVA 182

Query: 263 LLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVIDVGTNAVD 322
           ++LL   ATVTI HSR+ D  +  R+ADI+IAA G+ ++I G  IK GA VIDVG N V 
Sbjct: 183 MMLLARHATVTICHSRTRDLAAECRQADILIAAVGKPELITGDMIKEGAVVIDVGINRVG 242

Query: 323 DPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKR 367
           +       +LVGDV F+ A + AGWITPVPGGVGPMT+AMLL+NT++ A+R
Sbjct: 243 EK------KLVGDVHFESAAQKAGWITPVPGGVGPMTIAMLLKNTVEAARR 280

BLAST of CsGy4G000520 vs. TrEMBL
Match: tr|A0A0A0KWH1|A0A0A0KWH1_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G001540 PE=3 SV=1)

HSP 1 Score: 705.3 bits (1819), Expect = 7.3e-200
Identity = 366/377 (97.08%), Postives = 366/377 (97.08%), Query Frame = 0

Query: 1   MGHTKLCTDVPISRQTQSDCPNPSPSTYSHACINSLIIHNQSFSISASLLHAIATGSCKF 60
           MGHTKLCTDVPISRQTQSDCPNPSPSTYSHACINSLIIHNQSFSISASLLHAIATGSCKF
Sbjct: 1   MGHTKLCTDVPISRQTQSDCPNPSPSTYSHACINSLIIHNQSFSISASLLHAIATGSCKF 60

Query: 61  NSEVVTIFFFSPAKMASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVI 120
           NSEVVTIFFFSPAKMASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVI
Sbjct: 61  NSEVVTIFFFSPAKMASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVI 120

Query: 121 VGNRKDSLTYVNMKRKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPE-------LP 180
           VGNRKDSLTYVNMKRKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPE       LP
Sbjct: 121 VGNRKDSLTYVNMKRKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPEVHGILVQLP 180

Query: 181 LPNHINEEKVLSEISIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSG 240
           LPNHINEEKVLSEISIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK    GCIELLSRSG
Sbjct: 181 LPNHINEEKVLSEISIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK----GCIELLSRSG 240

Query: 241 ISIRGKKAVVMGRSNIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQ 300
           ISIRGKKAVVMGRSNIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQ
Sbjct: 241 ISIRGKKAVVMGRSNIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQ 300

Query: 301 MIKGSWIKPGAAVIDVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTV 360
           MIKGSWIKPGAAVIDVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTV
Sbjct: 301 MIKGSWIKPGAAVIDVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTV 360

Query: 361 AMLLRNTLDGAKRVIEQ 371
           AMLLRNTLDGAKRVIEQ
Sbjct: 361 AMLLRNTLDGAKRVIEQ 373

BLAST of CsGy4G000520 vs. TrEMBL
Match: tr|A0A1S3BZ79|A0A1S3BZ79_CUCME (bifunctional protein FolD 2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494790 PE=3 SV=1)

HSP 1 Score: 551.6 bits (1420), Expect = 1.3e-153
Identity = 286/303 (94.39%), Postives = 290/303 (95.71%), Query Frame = 0

Query: 75  MASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMK 134
           MASESDHKATIIDGKKIAQTVRSE+ EEVNKLS+KYGK+PGLAVVIVGNRKDSLTYVNMK
Sbjct: 1   MASESDHKATIIDGKKIAQTVRSEVAEEVNKLSEKYGKVPGLAVVIVGNRKDSLTYVNMK 60

Query: 135 RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPE-------LPLPNHINEEKVLSEI 194
           RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPE       LPLPNHINEEKVLSEI
Sbjct: 61  RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANPEVHGILVQLPLPNHINEEKVLSEI 120

Query: 195 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRS 254
           SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK    GC+ELLSRSGISIRGKKAVVMGRS
Sbjct: 121 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPK----GCLELLSRSGISIRGKKAVVMGRS 180

Query: 255 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 314
           NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 240

Query: 315 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 371
           DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKR 
Sbjct: 241 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRA 299

BLAST of CsGy4G000520 vs. TrEMBL
Match: tr|A0A2C9WN05|A0A2C9WN05_MANES (Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_01G216600 PE=3 SV=1)

HSP 1 Score: 500.4 bits (1287), Expect = 3.6e-138
Identity = 256/303 (84.49%), Postives = 276/303 (91.09%), Query Frame = 0

Query: 75  MASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMK 134
           MAS SDHKATIIDGK IAQTVRSEI +EV +LS+KYGK+PGLAVVIVGNRKDS +YVNMK
Sbjct: 1   MASPSDHKATIIDGKAIAQTVRSEIADEVRQLSEKYGKVPGLAVVIVGNRKDSQSYVNMK 60

Query: 135 RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANP-------ELPLPNHINEEKVLSEI 194
           RKAC EVGIKSF+IDLPEQ+SEAELISKVHELNANP       +LPLP HINEEKVLSEI
Sbjct: 61  RKACAEVGIKSFDIDLPEQISEAELISKVHELNANPYIHGILVQLPLPKHINEEKVLSEI 120

Query: 195 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRS 254
            +EKDVDGFHPLNIGKLAMKGR+PLF+PCTPK    GC+ELLSRSGISI+GK AVV+GRS
Sbjct: 121 HLEKDVDGFHPLNIGKLAMKGREPLFVPCTPK----GCLELLSRSGISIKGKNAVVVGRS 180

Query: 255 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 314
           NIVGLPVSLLLLKADATVTIVHSRS D E +IR ADIIIAAAGQA MIKGSWIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTIVHSRSDDQERIIRGADIIIAAAGQAMMIKGSWIKPGAAVI 240

Query: 315 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 371
           DVGTNA+DDP+KKSGYRLVGDVD++EACKVAGWITPVPGGVGPMTVAMLL+NTLDGAKRV
Sbjct: 241 DVGTNAIDDPSKKSGYRLVGDVDYKEACKVAGWITPVPGGVGPMTVAMLLKNTLDGAKRV 299

BLAST of CsGy4G000520 vs. TrEMBL
Match: tr|A0A2P4KXZ8|A0A2P4KXZ8_QUESU (Bifunctional protein fold 2 OS=Quercus suber OX=58331 GN=CFP56_27282 PE=3 SV=1)

HSP 1 Score: 498.4 bits (1282), Expect = 1.4e-137
Identity = 254/302 (84.11%), Postives = 276/302 (91.39%), Query Frame = 0

Query: 75  MASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMK 134
           MAS SDHKA IIDGK IAQT+R+EI  EV++LSQK+GK+PGLAVVIVGNRKDS +YVNMK
Sbjct: 1   MASPSDHKANIIDGKAIAQTIRNEIAAEVHQLSQKHGKVPGLAVVIVGNRKDSQSYVNMK 60

Query: 135 RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANP-------ELPLPNHINEEKVLSEI 194
           RKAC EVGIKSF+IDLPEQVSEAELI+KV ELNANP       +LPLP HINEE VL+EI
Sbjct: 61  RKACAEVGIKSFDIDLPEQVSEAELIAKVDELNANPDVHGILVQLPLPKHINEENVLTEI 120

Query: 195 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRS 254
           SIEKDVDGFHPLNIG+LAMKGRDPLFLPCTPK    GC+ELLSRSGI+++GKKAVV+GRS
Sbjct: 121 SIEKDVDGFHPLNIGRLAMKGRDPLFLPCTPK----GCLELLSRSGITVKGKKAVVVGRS 180

Query: 255 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 314
           NIVGLPVSLLLLKADATVTIVHS S DPE VIREADIIIAAAGQA M+KGSWIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTIVHSHSQDPEKVIREADIIIAAAGQAMMVKGSWIKPGAAVI 240

Query: 315 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 370
           DVGTNA+DDP+KKSGYRLVGDV +QEACKVAGWITPVPGGVGPMTVAMLL+NTLDGAKRV
Sbjct: 241 DVGTNAIDDPSKKSGYRLVGDVHYQEACKVAGWITPVPGGVGPMTVAMLLKNTLDGAKRV 298

BLAST of CsGy4G000520 vs. TrEMBL
Match: tr|A0A067K8J8|A0A067K8J8_JATCU (Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_13900 PE=3 SV=1)

HSP 1 Score: 498.0 bits (1281), Expect = 1.8e-137
Identity = 249/303 (82.18%), Postives = 279/303 (92.08%), Query Frame = 0

Query: 75  MASESDHKATIIDGKKIAQTVRSEITEEVNKLSQKYGKIPGLAVVIVGNRKDSLTYVNMK 134
           MAS SDHKA +IDGK IAQT+RSEI +EV +LS+KYGK+PGLAVVIVG+RKDS +YV+MK
Sbjct: 1   MASPSDHKAAVIDGKAIAQTIRSEIADEVRQLSEKYGKVPGLAVVIVGHRKDSQSYVSMK 60

Query: 135 RKACLEVGIKSFEIDLPEQVSEAELISKVHELNANP-------ELPLPNHINEEKVLSEI 194
           RKAC+EVGIKSF +DLPEQ+SEAELISKVHELNANP       +LPLP HINEE +LSEI
Sbjct: 61  RKACVEVGIKSFGVDLPEQISEAELISKVHELNANPDVHGILVQLPLPKHINEENILSEI 120

Query: 195 SIEKDVDGFHPLNIGKLAMKGRDPLFLPCTPKAIMNGCIELLSRSGISIRGKKAVVMGRS 254
           S+EKDVDGFHPLNIGKLAMKGR+PLF+PCTPK    GC+ELLSRSGISI+GK AVV+GRS
Sbjct: 121 SLEKDVDGFHPLNIGKLAMKGREPLFVPCTPK----GCLELLSRSGISIKGKNAVVVGRS 180

Query: 255 NIVGLPVSLLLLKADATVTIVHSRSVDPESVIREADIIIAAAGQAQMIKGSWIKPGAAVI 314
           NIVGLPVSLLLLKADATVTIVHSR+ DPES+IREADIIIAAAGQA+M+KGSWIKPGAAVI
Sbjct: 181 NIVGLPVSLLLLKADATVTIVHSRTDDPESIIREADIIIAAAGQAKMVKGSWIKPGAAVI 240

Query: 315 DVGTNAVDDPTKKSGYRLVGDVDFQEACKVAGWITPVPGGVGPMTVAMLLRNTLDGAKRV 371
           DVGTNA+DDP++KSGYRLVGDVD++EACKVAGWITPVPGGVGPMTVAMLLRNT+DGAKRV
Sbjct: 241 DVGTNAIDDPSRKSGYRLVGDVDYEEACKVAGWITPVPGGVGPMTVAMLLRNTVDGAKRV 299

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KGN52787.11.1e-19997.08hypothetical protein Csa_4G001540 [Cucumis sativus][more]
XP_004152261.12.8e-15596.37PREDICTED: bifunctional protein FolD 2 isoform X1 [Cucumis sativus][more]
XP_008454369.12.0e-15394.39PREDICTED: bifunctional protein FolD 2 isoform X1 [Cucumis melo][more]
XP_023524684.18.3e-14789.77bifunctional protein FolD 2 [Cucurbita pepo subsp. pepo][more]
XP_022997744.17.0e-14689.77bifunctional protein FolD 2 isoform X2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT3G12290.11.0e-13078.33Amino acid dehydrogenase family protein[more]
AT4G00620.13.2e-10060.07Amino acid dehydrogenase family protein[more]
AT2G38660.11.7e-9355.78Amino acid dehydrogenase family protein[more]
AT4G00600.13.9e-7448.66Amino acid dehydrogenase family protein[more]
Match NameE-valueIdentityDescription
sp|Q9LHH7|FOLD2_ARATH1.8e-12978.33Bifunctional protein FolD 2 OS=Arabidopsis thaliana OX=3702 GN=FOLD2 PE=2 SV=1[more]
sp|O65271|FOLD4_ARATH5.7e-9960.07Bifunctional protein FolD 4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=FO... [more]
sp|A2RVV7|FOLD1_ARATH3.1e-9255.78Bifunctional protein FolD 1, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=FO... [more]
sp|P07245|C1TC_YEAST4.1e-7349.67C-1-tetrahydrofolate synthase, cytoplasmic OS=Saccharomyces cerevisiae (strain A... [more]
sp|Q2RIB4|FOLD_MOOTA4.1e-7352.92Bifunctional protein FolD OS=Moorella thermoacetica (strain ATCC 39073 / JCM 932... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KWH1|A0A0A0KWH1_CUCSA7.3e-20097.08Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G001540 PE=3 SV=1[more]
tr|A0A1S3BZ79|A0A1S3BZ79_CUCME1.3e-15394.39bifunctional protein FolD 2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494790 P... [more]
tr|A0A2C9WN05|A0A2C9WN05_MANES3.6e-13884.49Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_01G216600 PE=3 SV=... [more]
tr|A0A2P4KXZ8|A0A2P4KXZ8_QUESU1.4e-13784.11Bifunctional protein fold 2 OS=Quercus suber OX=58331 GN=CFP56_27282 PE=3 SV=1[more]
tr|A0A067K8J8|A0A067K8J8_JATCU1.8e-13782.18Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_13900 PE=3 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003824catalytic activity
GO:0004488methylenetetrahydrofolate dehydrogenase (NADP+) activity
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
Vocabulary: INTERPRO
TermDefinition
IPR036291NAD(P)-bd_dom_sf
IPR020867THF_DH/CycHdrlase_CS
IPR020630THF_DH/CycHdrlase_cat_dom
IPR020631THF_DH/CycHdrlase_NAD-bd_dom
IPR000672THF_DH/CycHdrlase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0009396 folic acid-containing compound biosynthetic process
biological_process GO:0046487 glyoxylate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0004488 methylenetetrahydrofolate dehydrogenase (NADP+) activity
molecular_function GO:0003824 catalytic activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy4G000520.1CsGy4G000520.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000672Tetrahydrofolate dehydrogenase/cyclohydrolasePRINTSPR00085THFDHDRGNASEcoord: 283..312
score: 52.82
coord: 341..359
score: 85.22
coord: 234..254
score: 55.68
coord: 324..340
score: 48.64
coord: 114..136
score: 52.34
IPR000672Tetrahydrofolate dehydrogenase/cyclohydrolasePANTHERPTHR10025TETRAHYDROFOLATE DEHYDROGENASE/CYCLOHYDROLASE FAMILY MEMBERcoord: 78..370
IPR000672Tetrahydrofolate dehydrogenase/cyclohydrolaseHAMAPMF_01576THF_DHG_CYHcoord: 83..368
score: 33.652
IPR020631Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domainPFAMPF02882THF_DHG_CYH_Ccoord: 197..366
e-value: 2.2E-63
score: 212.2
NoneNo IPR availableGENE3DG3DSA:3.40.50.720coord: 86..87
e-value: 2.1E-99
score: 333.9
coord: 215..344
e-value: 2.1E-99
score: 333.9
NoneNo IPR availableGENE3DG3DSA:3.40.50.10860coord: 88..214
e-value: 2.1E-99
score: 333.9
coord: 345..361
e-value: 2.1E-99
score: 333.9
NoneNo IPR availablePANTHERPTHR10025:SF33METHENYL TETRAHYDROFOLATE CYCLOHYDROLASE / NADP-DEPENDENT METHYLENE H4F DEHYDROGENASEcoord: 78..370
NoneNo IPR availableCDDcd01080NAD_bind_m-THF_DH_Cyclohydcoord: 189..365
e-value: 3.28174E-89
score: 267.884
NoneNo IPR availableSUPERFAMILYSSF53223Aminoacid dehydrogenase-like, N-terminal domaincoord: 83..195
IPR020630Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domainPFAMPF00763THF_DHG_CYHcoord: 85..194
e-value: 1.0E-30
score: 106.3
IPR020867Tetrahydrofolate dehydrogenase/cyclohydrolase, conserved sitePROSITEPS00767THF_DHG_CYH_2coord: 345..353
IPR036291NAD(P)-binding domain superfamilySUPERFAMILYSSF51735NAD(P)-binding Rossmann-fold domainscoord: 196..363