Cla97C09G172900 (gene) Watermelon (97103) v2

NameCla97C09G172900
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionG-box-binding factor 4-like
LocationCla97Chr09 : 9473812 .. 9477042 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAAACCAAATCAGGTATTTAGAACAACAACCCTCACTGCAGCACCAAAATTGGATACCCTTAATTTCCACAACTCCAATGTCGATTCTGTAGTAGCTTTAATCGGCAATCGCAATCCCTCTCATTTCCGCACTGTGGACATGGATGCTAAGTTCGATACTTCATCTTCTGCTTCTAAAACTGTGGACGATCTTTGGAAGGAGTTGAAGGAGGAGGCTGTTGGAGAGATGATCTTGGAGGGTTTTCTTCAAGCCAAACCACAGGATCAGGATGTGAGGATTTTGAATCCGTTTAGTTGTTTAAAGGATTTCGATAGGGTTTATGTTGAAGAAGAGACTGTTGGGTTTGGGAATGGAGTTGACATTAGTGGGAGAGGGAAGAGAAGGCGCGCAGCTATGGAACCAATGGATGAAGCTGCACTGCAAAGACAACGGAGGATGATTAAGAACAGGGAGTCTGCTGCTAGGTCCAGAGAAAGGAAACAAGTGAGCTTTTTCTGTTGATTAAATCATTTAATCTCTTAAGTTTTAATTAGGTGGTTGAATTTCTAACTTTAGTTTTAATTTGTTGTTAGCTGTGTTGGATTCTTTATTATAACATTATGTTTCATCCAAAACTAATTGAGATGGAAATAGCCCATGTATCCTATAAATACTCATTGATTTTTTTCATTGTGGATACTCAACATGCTCGATGGATCCTATAAACAAGTGAAACCCTTACTAATCTTGTTGTGTGAGTGGTATCGTGCTTCTTTTTTATAATTGAGATCAGAGAGAGAATTCTACATGATTGTACAACATTTTCATATAGAATTTCTTTCTAGTGAATCATGGTATTCCTCTAAGAAGCACAAATACGGATATAACATGGACTTGGCGACATGCCATATCATTTTATACTAAGAAAATTCAAAGTAATGGGTTGATGTATTTATATGCAAAAAAATTAGTTTGATATATTTCATGCTCAAAATTTATTTATTATTGTCATATATGTGTCTTTTTAGTCTACTCAACAAGTGTTCTATGCATGTCTAACACATTTATTATACTAACAAGTATCCAATACATGTCTAATAAGTGTCAGAATGTTAAGTGTTTGACATGAGTTGAACATGGACACACTAGCCTAACTAACGTGTTCGTGCTTCTTAGGTTTTCCTCAATTTTCATGTAAATATCGTGTTTCTTATTTTTGTTGCTTTCTTTTAATTTTAAGAGTTGGATAGGTTAAAAAATTTATGTAACATAGGATAGATTTTAAATTGGGTTTTTAGCTCAACCACCACTATTTCATTCAACTAAGTATTTAGCTGCTAGTTTAATCATTGTCTCATGCCAATATTTAAGTATTTGTGTTAATGATAGAGTTATTAGACTCAAAATCCAAGGTTTAAACATATATTGTGTGTGGTTCTGTGGTTTTTAGGCACATCAAGTTGAGTTAGAGTTAATAGCTTCGAGACTTGAGGAAGAGAACGAGCTATTATTGAAAGAGAAGGTACTGTGTACTTTTTGCTTTGCTTTATAATGCTACTTGCTCGACCATTTACAGCCACAAAATGGCAAAGTCATTTCAATGAGTTTTGAAATCTTCCAAAATTTACCTCGGTTTAAATATCAGGTTCAGTTTTCCTGCTGAGTTAATCATCCTGTTTACTCTTCGTTTATTACATAATTGTTGACTTTTTAATACTTGGACAAATATATGCAAGATAATGAAAAATGGTTTAACTACAAAATAAATCTTTTTCACTATCTTGATATATATGTATCACAAATAGCCATTTGAAATTGTTACTTTTATTTTGAGTTCAAAGTTTTTTGCCATCGTTATTCACACTTAATTGCTTGACTTGTAAACTCTCTTGTCATATTTTCGAGTCTCAAAACCCAAGATTGTGAGTGTCATTGTATTTATGTCATCTTATATTTCCTTCAACCATCTGACGTTTTTCTTTTTTTCTTCTCTCATAGGCCGAGAGATCTAAGGAACGACTAAAGCAGGTAGCAATCTTTACTCAATCTGCTTTTCTGGAATAATTACTAAAGGAAAACAGAAAAAATATAAATTAATCCTTTTAGCCTGCAGTTTCTCCTTAGCAAGGGATGAGGAAATTAATAATCTAAATTAGAAGCCAATCCTTTAATATGTATACTCCCTCTTTTGCGGGTCGGACAGTAATAAACCATACTTAGAAGCATTTTTTACACTAGATTTGTGTGTGAATACTAATCGGGGATATATTGAAAATAATAGTAGGATGTTATATGGGTATAGTATAGTAGGGATTTATTTTATTTATTTATTTATTTATTTTTGGTATGGATGACCTTAGTTACATTGTTGTTGCCCCAAAATTTGGAACTAATGAAATATGACCTTCTTGCCATTGTTAGAACGTACTAAAAACGTGTTAATTATCCATACTGCTCCTTGTGATTTTAGCGAGTGTGAAAGTTCATCTGTACAATATTGCTCTCACATGCTAGGGCTCTTCCTCCCTTTGTTGCGTTTGCACTCACATTCAACTTACCTAGTTCAATAATGCTCTCACTCATCGGGCTCTTCCTCCCTCACACTCCCACGCTCGCGCTCCTCCCTTTTCTCACTTACACTCCTCCCCACTCTCCCTATCGCTCATGCTCCTCAATCTCTCACTCACATTTGTATTATTTCTCTTAATCGTGCTTGCGCTCGCACTCGCACTCGCACTCGCCACTCTGAACACTTATTAATGGTGCGGAGGGGCAATATAGATAATTAACACATTAGTACCCCTTTAACAATGGCATAGGGTCATCCTTCACTATTTAAAAAAATTGGAGCAAACATTAACCATCCGAAATTTAGGTCATTCACATAAAAAACTCAATTAACTTGGGATTGTCTAGTGAGGCATATCCCTCTCAAAAGACTATTGAAATTGTAATTTATGAAAGATATTGCAACATATTTCTATTCGGGATCCTAACAATCTGCCATTAACAAGATGCTTGTTTATAGGTTAGATGCTAGCGGTCTTCTTGTTGGTTTAAGTTCTATCTCAAATACTATCTTATAGTATTATCCTTGAAAACATTGTTTTGAATTGAAACTTAGTTCTAATATTGACCATCATGCACATTGTTTAACATATATAGTTGATGGAAAAAGTAATTCCAGTTGTGGAGAAACAAAGACTGCCACGTGTCATATGCTGGGGTCACTCCTTCGAGTGGTAG

mRNA sequence

ATGAAGAAACCAAATCAGGTATTTAGAACAACAACCCTCACTGCAGCACCAAAATTGGATACCCTTAATTTCCACAACTCCAATGTCGATTCTGTAGTAGCTTTAATCGGCAATCGCAATCCCTCTCATTTCCGCACTGTGGACATGGATGCTAAGTTCGATACTTCATCTTCTGCTTCTAAAACTGTGGACGATCTTTGGAAGGAGTTGAAGGAGGAGGCTGTTGGAGAGATGATCTTGGAGGGTTTTCTTCAAGCCAAACCACAGGATCAGGATGTGAGGATTTTGAATCCGTTTAGTTGTTTAAAGGATTTCGATAGGGTTTATGTTGAAGAAGAGACTGTTGGGTTTGGGAATGGAGTTGACATTAGTGGGAGAGGGAAGAGAAGGCGCGCAGCTATGGAACCAATGGATGAAGCTGCACTGCAAAGACAACGGAGGATGATTAAGAACAGGGAGTCTGCTGCTAGGTCCAGAGAAAGGAAACAAGCACATCAAGTTGAGTTAGAGTTAATAGCTTCGAGACTTGAGGAAGAGAACGAGCTATTATTGAAAGAGAAGGCCGAGAGATCTAAGGAACGACTAAAGCAGTTGATGGAAAAAGTAATTCCAGTTGTGGAGAAACAAAGACTGCCACGTGTCATATGCTGGGGTCACTCCTTCGAGTGGTAG

Coding sequence (CDS)

ATGAAGAAACCAAATCAGGTATTTAGAACAACAACCCTCACTGCAGCACCAAAATTGGATACCCTTAATTTCCACAACTCCAATGTCGATTCTGTAGTAGCTTTAATCGGCAATCGCAATCCCTCTCATTTCCGCACTGTGGACATGGATGCTAAGTTCGATACTTCATCTTCTGCTTCTAAAACTGTGGACGATCTTTGGAAGGAGTTGAAGGAGGAGGCTGTTGGAGAGATGATCTTGGAGGGTTTTCTTCAAGCCAAACCACAGGATCAGGATGTGAGGATTTTGAATCCGTTTAGTTGTTTAAAGGATTTCGATAGGGTTTATGTTGAAGAAGAGACTGTTGGGTTTGGGAATGGAGTTGACATTAGTGGGAGAGGGAAGAGAAGGCGCGCAGCTATGGAACCAATGGATGAAGCTGCACTGCAAAGACAACGGAGGATGATTAAGAACAGGGAGTCTGCTGCTAGGTCCAGAGAAAGGAAACAAGCACATCAAGTTGAGTTAGAGTTAATAGCTTCGAGACTTGAGGAAGAGAACGAGCTATTATTGAAAGAGAAGGCCGAGAGATCTAAGGAACGACTAAAGCAGTTGATGGAAAAAGTAATTCCAGTTGTGGAGAAACAAAGACTGCCACGTGTCATATGCTGGGGTCACTCCTTCGAGTGGTAG

Protein sequence

MKKPNQVFRTTTLTAAPKLDTLNFHNSNVDSVVALIGNRNPSHFRTVDMDAKFDTSSSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVEEETVGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKAERSKERLKQLMEKVIPVVEKQRLPRVICWGHSFEW
BLAST of Cla97C09G172900 vs. NCBI nr
Match: XP_008461927.1 (PREDICTED: G-box-binding factor 4-like isoform X3 [Cucumis melo])

HSP 1 Score: 256.1 bits (653), Expect = 1.1e-64
Identity = 153/229 (66.81%), Postives = 168/229 (73.36%), Query Frame = 0

Query: 1   MKKPNQVFRTTT--LTAAPKLDTLNFHNSNVDSVVALIGNRNP-SHFRTVDMDAKFDTS- 60
           MKK NQ+FRT T    AA K DT+NFHNSNVDS+VALI NRNP SH     +D +F TS 
Sbjct: 1   MKKSNQIFRTITPASAAAAKFDTINFHNSNVDSLVALIDNRNPLSH-----LDGEFHTSS 60

Query: 61  -SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EET 120
            SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E 
Sbjct: 61  PSSVSKTVDDLWRQLKEESVEDL----------------ILNPLSCLKDFDRVYVEDQEN 120

Query: 121 VGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIAS 180
           VGFGN VDI  RGKRRR AMEPMD+AALQRQRRMIKNRESAARSR  KQAHQ+ELE IAS
Sbjct: 121 VGFGNKVDIRARGKRRRVAMEPMDDAALQRQRRMIKNRESAARSRXXKQAHQIELESIAS 180

Query: 181 RLEEENELLLKEKAERSKERLKQLMEKVIPVVEKQRLPRVICWGHSFEW 224
           RLEEENE L           LKQLM KVIPV+EKQRLP+VICWG SFEW
Sbjct: 181 RLEEENERLXXXXXXXXXXXLKQLMAKVIPVMEKQRLPQVICWGRSFEW 208

BLAST of Cla97C09G172900 vs. NCBI nr
Match: XP_008461925.1 (PREDICTED: G-box-binding factor 4-like isoform X1 [Cucumis melo])

HSP 1 Score: 238.0 bits (606), Expect = 3.0e-59
Identity = 145/206 (70.39%), Postives = 159/206 (77.18%), Query Frame = 0

Query: 1   MKKPNQVFRTTT--LTAAPKLDTLNFHNSNVDSVVALIGNRNP-SHFRTVDMDAKFDTS- 60
           MKK NQ+FRT T    AA K DT+NFHNSNVDS+VALI NRNP SH     +D +F TS 
Sbjct: 1   MKKSNQIFRTITPASAAAAKFDTINFHNSNVDSLVALIDNRNPLSH-----LDGEFHTSS 60

Query: 61  -SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EET 120
            SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E 
Sbjct: 61  PSSVSKTVDDLWRQLKEESVEDL----------------ILNPLSCLKDFDRVYVEDQEN 120

Query: 121 VGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIAS 180
           VGFGN VDI  RGKRRR AMEPMD+AALQRQRRMIKNRESAARSRERKQAHQ+ELE IAS
Sbjct: 121 VGFGNKVDIRARGKRRRVAMEPMDDAALQRQRRMIKNRESAARSRERKQAHQIELESIAS 180

Query: 181 RLEEENELLLKEKAERSKERLKQLME 201
           RLEEENE LLKEKAERSKERLKQL +
Sbjct: 181 RLEEENERLLKEKAERSKERLKQLSQ 185

BLAST of Cla97C09G172900 vs. NCBI nr
Match: XP_008461926.1 (PREDICTED: G-box-binding factor 4-like isoform X2 [Cucumis melo])

HSP 1 Score: 228.8 bits (582), Expect = 1.8e-56
Identity = 141/204 (69.12%), Postives = 154/204 (75.49%), Query Frame = 0

Query: 1   MKKPNQVFRTTT--LTAAPKLDTLNFHNSNVDSVVALIGNRNP-SHFRTVDMDAKFDTS- 60
           MKK NQ+FRT T    AA K DT+NFHNSNVDS+VALI NRNP SH     +D +F TS 
Sbjct: 1   MKKSNQIFRTITPASAAAAKFDTINFHNSNVDSLVALIDNRNPLSH-----LDGEFHTSS 60

Query: 61  -SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EET 120
            SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E 
Sbjct: 61  PSSVSKTVDDLWRQLKEESVEDL----------------ILNPLSCLKDFDRVYVEDQEN 120

Query: 121 VGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIAS 180
           VGFGN VDI  RGKRRR AMEPMD+AALQRQRRMIKNRESAARS    QAHQ+ELE IAS
Sbjct: 121 VGFGNKVDIRARGKRRRVAMEPMDDAALQRQRRMIKNRESAARSXXXXQAHQIELESIAS 180

Query: 181 RLEEENELLLKEKAERSKERLKQL 199
           RLEEENE LLKEKAERSKERLKQL
Sbjct: 181 RLEEENERLLKEKAERSKERLKQL 183

BLAST of Cla97C09G172900 vs. NCBI nr
Match: XP_023521221.1 (G-box-binding factor 4-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 222.6 bits (566), Expect = 1.3e-54
Identity = 141/224 (62.95%), Postives = 158/224 (70.54%), Query Frame = 0

Query: 1   MKKPNQVFRTTTLTAAPKLDTLNFHNSNVDSVVALIGNRNPSHFRTVDMDAKFDTSSSAS 60
           MKKPNQ+FRT    AA               VVA     NP H   +DMD+K D  S+ S
Sbjct: 1   MKKPNQIFRTAAAAAA------------TAKVVA-----NPIHMSAMDMDSKLDGPSTPS 60

Query: 61  KTVDDLWKELKEEAVGEMI-LEGFLQAKPQDQDVRILNPFSCLKDFDRVYVEEETVGFGN 120
             VDD+W   +E+AV EM+  E F+  K Q +DVRILNP +C   F+    EE  VGFGN
Sbjct: 61  GGVDDVW---REKAVEEMMRWEDFIGVKAQ-EDVRILNPLNCFPQFE----EEMIVGFGN 120

Query: 121 GVDISGR-GKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEE 180
           G +ISGR GKRRRA MEPMDEAALQRQRRMIKNRESAARSRERK AHQVELELIA+RLEE
Sbjct: 121 GGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEE 180

Query: 181 ENELLLKEKAERSKERLKQLMEKVIPVVEKQRLPRVICWGHSFE 223
           EN  LLK+KAER KERLKQLME VIPVVEK+R PR +C GHSFE
Sbjct: 181 ENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRALCPGHSFE 199

BLAST of Cla97C09G172900 vs. NCBI nr
Match: XP_022926188.1 (G-box-binding factor 4-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 221.5 bits (563), Expect = 2.9e-54
Identity = 143/224 (63.84%), Postives = 160/224 (71.43%), Query Frame = 0

Query: 1   MKKPNQVFRTTTLTAAPKLDTLNFHNSNVDSVVALIGNRNPSHFRTVDMDAKFDTSSSAS 60
           MKKPNQ+FRT    AA K             VVA     NP H   +DMD+K D  S+ S
Sbjct: 1   MKKPNQIFRT---AAAAK-------------VVA-----NPIHMSAMDMDSKLDGPSTPS 60

Query: 61  KTVDDLWKELKEEAVGEMI-LEGFLQAKPQDQDVRILNPFSCLKDFDRVYVEEETVGFGN 120
             VDD+W   +E+AV EM+  E F+  K Q +DVRILNP +C   F+    EE  VGFGN
Sbjct: 61  GGVDDVW---REKAVEEMMRWEDFIGVKAQ-EDVRILNPLNCFPQFE----EEMIVGFGN 120

Query: 121 GVDISGR-GKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEE 180
           G +ISGR GKRRRA MEPMDEAALQRQRRMIKNRESAARSRERK AHQVELELIA+RLEE
Sbjct: 121 GGEISGRSGKRRRAPMEPMDEAALQRQRRMIKNRESAARSRERKHAHQVELELIAARLEE 180

Query: 181 ENELLLKEKAERSKERLKQLMEKVIPVVEKQRLPRVICWGHSFE 223
           EN  LLK+KAER KERLKQLME VIPVVEK+R PRV+C GHSFE
Sbjct: 181 ENHRLLKQKAERGKERLKQLMETVIPVVEKRRPPRVLCPGHSFE 195

BLAST of Cla97C09G172900 vs. TrEMBL
Match: tr|A0A1S3CH58|A0A1S3CH58_CUCME (G-box-binding factor 4-like isoform X3 OS=Cucumis melo OX=3656 GN=LOC103500414 PE=4 SV=1)

HSP 1 Score: 256.1 bits (653), Expect = 7.1e-65
Identity = 153/229 (66.81%), Postives = 168/229 (73.36%), Query Frame = 0

Query: 1   MKKPNQVFRTTT--LTAAPKLDTLNFHNSNVDSVVALIGNRNP-SHFRTVDMDAKFDTS- 60
           MKK NQ+FRT T    AA K DT+NFHNSNVDS+VALI NRNP SH     +D +F TS 
Sbjct: 1   MKKSNQIFRTITPASAAAAKFDTINFHNSNVDSLVALIDNRNPLSH-----LDGEFHTSS 60

Query: 61  -SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EET 120
            SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E 
Sbjct: 61  PSSVSKTVDDLWRQLKEESVEDL----------------ILNPLSCLKDFDRVYVEDQEN 120

Query: 121 VGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIAS 180
           VGFGN VDI  RGKRRR AMEPMD+AALQRQRRMIKNRESAARSR  KQAHQ+ELE IAS
Sbjct: 121 VGFGNKVDIRARGKRRRVAMEPMDDAALQRQRRMIKNRESAARSRXXKQAHQIELESIAS 180

Query: 181 RLEEENELLLKEKAERSKERLKQLMEKVIPVVEKQRLPRVICWGHSFEW 224
           RLEEENE L           LKQLM KVIPV+EKQRLP+VICWG SFEW
Sbjct: 181 RLEEENERLXXXXXXXXXXXLKQLMAKVIPVMEKQRLPQVICWGRSFEW 208

BLAST of Cla97C09G172900 vs. TrEMBL
Match: tr|A0A1S3CFQ6|A0A1S3CFQ6_CUCME (G-box-binding factor 4-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103500414 PE=4 SV=1)

HSP 1 Score: 238.0 bits (606), Expect = 2.0e-59
Identity = 145/206 (70.39%), Postives = 159/206 (77.18%), Query Frame = 0

Query: 1   MKKPNQVFRTTT--LTAAPKLDTLNFHNSNVDSVVALIGNRNP-SHFRTVDMDAKFDTS- 60
           MKK NQ+FRT T    AA K DT+NFHNSNVDS+VALI NRNP SH     +D +F TS 
Sbjct: 1   MKKSNQIFRTITPASAAAAKFDTINFHNSNVDSLVALIDNRNPLSH-----LDGEFHTSS 60

Query: 61  -SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EET 120
            SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E 
Sbjct: 61  PSSVSKTVDDLWRQLKEESVEDL----------------ILNPLSCLKDFDRVYVEDQEN 120

Query: 121 VGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIAS 180
           VGFGN VDI  RGKRRR AMEPMD+AALQRQRRMIKNRESAARSRERKQAHQ+ELE IAS
Sbjct: 121 VGFGNKVDIRARGKRRRVAMEPMDDAALQRQRRMIKNRESAARSRERKQAHQIELESIAS 180

Query: 181 RLEEENELLLKEKAERSKERLKQLME 201
           RLEEENE LLKEKAERSKERLKQL +
Sbjct: 181 RLEEENERLLKEKAERSKERLKQLSQ 185

BLAST of Cla97C09G172900 vs. TrEMBL
Match: tr|A0A1S3CGA7|A0A1S3CGA7_CUCME (G-box-binding factor 4-like isoform X2 OS=Cucumis melo OX=3656 GN=LOC103500414 PE=4 SV=1)

HSP 1 Score: 228.8 bits (582), Expect = 1.2e-56
Identity = 141/204 (69.12%), Postives = 154/204 (75.49%), Query Frame = 0

Query: 1   MKKPNQVFRTTT--LTAAPKLDTLNFHNSNVDSVVALIGNRNP-SHFRTVDMDAKFDTS- 60
           MKK NQ+FRT T    AA K DT+NFHNSNVDS+VALI NRNP SH     +D +F TS 
Sbjct: 1   MKKSNQIFRTITPASAAAAKFDTINFHNSNVDSLVALIDNRNPLSH-----LDGEFHTSS 60

Query: 61  -SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EET 120
            SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E 
Sbjct: 61  PSSVSKTVDDLWRQLKEESVEDL----------------ILNPLSCLKDFDRVYVEDQEN 120

Query: 121 VGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIAS 180
           VGFGN VDI  RGKRRR AMEPMD+AALQRQRRMIKNRESAARS    QAHQ+ELE IAS
Sbjct: 121 VGFGNKVDIRARGKRRRVAMEPMDDAALQRQRRMIKNRESAARSXXXXQAHQIELESIAS 180

Query: 181 RLEEENELLLKEKAERSKERLKQL 199
           RLEEENE LLKEKAERSKERLKQL
Sbjct: 181 RLEEENERLLKEKAERSKERLKQL 183

BLAST of Cla97C09G172900 vs. TrEMBL
Match: tr|A0A1S3CFR1|A0A1S3CFR1_CUCME (G-box-binding factor 4-like isoform X6 OS=Cucumis melo OX=3656 GN=LOC103500414 PE=4 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 1.5e-51
Identity = 133/229 (58.08%), Postives = 147/229 (64.19%), Query Frame = 0

Query: 1   MKKPNQVFRTTT--LTAAPKLDTLNFHNSNVDSVVALIGNRNP-SHFRTVDMDAKFDTS- 60
           MKK NQ+FRT T    AA K DT+NFHNSNVDS+VALI NRNP SH     +D +F TS 
Sbjct: 1   MKKSNQIFRTITPASAAAAKFDTINFHNSNVDSLVALIDNRNPLSH-----LDGEFHTSS 60

Query: 61  -SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EET 120
            SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E 
Sbjct: 61  PSSVSKTVDDLWRQLKEESVEDL----------------ILNPLSCLKDFDRVYVEDQEN 120

Query: 121 VGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIAS 180
           VGFGN VDI  RGKRRR AMEPMD+AALQRQRRMIKNRESAARS                
Sbjct: 121 VGFGNKVDIRARGKRRRVAMEPMDDAALQRQRRMIKNRESAARS---------------- 180

Query: 181 RLEEENELLLKEKAERSKERLKQLMEKVIPVVEKQRLPRVICWGHSFEW 224
                               LKQLM KVIPV+EKQRLP+VICWG SFEW
Sbjct: 181 --------XXXXXXXXXXXXLKQLMAKVIPVMEKQRLPQVICWGRSFEW 184

BLAST of Cla97C09G172900 vs. TrEMBL
Match: tr|A0A1S3CFP6|A0A1S3CFP6_CUCME (uncharacterized protein LOC103500414 isoform X4 OS=Cucumis melo OX=3656 GN=LOC103500414 PE=4 SV=1)

HSP 1 Score: 180.6 bits (457), Expect = 3.8e-42
Identity = 109/165 (66.06%), Postives = 121/165 (73.33%), Query Frame = 0

Query: 1   MKKPNQVFRTTT--LTAAPKLDTLNFHNSNVDSVVALIGNRNP-SHFRTVDMDAKFDTS- 60
           MKK NQ+FRT T    AA K DT+NFHNSNVDS+VALI NRNP SH     +D +F TS 
Sbjct: 1   MKKSNQIFRTITPASAAAAKFDTINFHNSNVDSLVALIDNRNPLSH-----LDGEFHTSS 60

Query: 61  -SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EET 120
            SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E 
Sbjct: 61  PSSVSKTVDDLWRQLKEESVEDL----------------ILNPLSCLKDFDRVYVEDQEN 120

Query: 121 VGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSR 160
           VGFGN VDI  RGKRRR AMEPMD+AALQRQRRMIKNRESAARSR
Sbjct: 121 VGFGNKVDIRARGKRRRVAMEPMDDAALQRQRRMIKNRESAARSR 144

BLAST of Cla97C09G172900 vs. Swiss-Prot
Match: sp|P42777|GBF4_ARATH (G-box-binding factor 4 OS=Arabidopsis thaliana OX=3702 GN=GBF4 PE=1 SV=1)

HSP 1 Score: 111.7 bits (278), Expect = 1.1e-23
Identity = 91/205 (44.39%), Postives = 112/205 (54.63%), Query Frame = 0

Query: 47  VDMDAKFDTSSSAS--KTVDDLWKEL-----------KEEAVGEMILEGFLQAKPQDQ-- 106
           +D+D      +S +  K+VDD+WKE+           +EE    M LE FL     D+  
Sbjct: 69  IDVDRSIGDRNSVNNGKSVDDVWKEIVSGEQKTIMMKEEEPEDIMTLEDFLAKAEMDEGA 128

Query: 107 ----DV-----RILNPFSCLKDFD---RVYVEEETVGFGNGVDISGRGKRRRAAMEPMDE 166
               DV     R+ N  S   DF        +      G GV    RGKR R  ME MD+
Sbjct: 129 SDEIDVKIPTERLNNDGSYTFDFPMQRHSSFQMVEGSMGGGVT---RGKRGRVMMEAMDK 188

Query: 167 AALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKAERSKERLKQLM 224
           AA QRQ+RMIKNRESAARSRERKQA+QVELE +A++LEEENE         +KER K+LM
Sbjct: 189 AAAQRQKRMIKNRESAARSRERKQAYQVELETLAAKLEEENEQXXXXXXXSTKERYKKLM 248

BLAST of Cla97C09G172900 vs. Swiss-Prot
Match: sp|Q0JHF1|BZP12_ORYSJ (bZIP transcription factor 12 OS=Oryza sativa subsp. japonica OX=39947 GN=BZIP12 PE=2 SV=1)

HSP 1 Score: 76.6 bits (187), Expect = 3.8e-13
Identity = 58/152 (38.16%), Postives = 82/152 (53.95%), Query Frame = 0

Query: 77  EMILEGFLQAK-PQDQDVRILNPFSCLKDFDRVYVEEETVGFGNGVD----ISGRGKRRR 136
           EM LE FL  +    +D  ++   S  K        +  +GF NG +    ++G   R+R
Sbjct: 122 EMTLEDFLAREGAVKEDEAVVTDPSAAKG-------QVVMGFLNGAEVTGGVTGGRSRKR 181

Query: 137 AAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKAERS 196
             M+PMD AA+QRQ+RMIKNRESAARSRERKQA+  ELE + ++LEEEN  +        
Sbjct: 182 HLMDPMDRAAMQRQKRMIKNRESAARSRERKQAYIAELESLVTQLEEENAKMFXXXXXXX 241

Query: 197 KERLKQLMEKVIPVVEKQRLPRVICWGHSFEW 224
                     V+PV+ ++   R +   +S EW
Sbjct: 242 XXXXXXXXXXVVPVIIRKTSARDLRRTNSMEW 266

BLAST of Cla97C09G172900 vs. Swiss-Prot
Match: sp|Q9SJN0|ABI5_ARATH (Protein ABSCISIC ACID-INSENSITIVE 5 OS=Arabidopsis thaliana OX=3702 GN=ABI5 PE=1 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 8.7e-10
Identity = 45/112 (40.18%), Postives = 67/112 (59.82%), Query Frame = 0

Query: 91  QDVRILNPFSCLKDFDRVYVEEETVGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIK 150
           Q + ++ P S +      + + + +G   GVD+ G   R+R    P+++   +RQRRMIK
Sbjct: 305 QQMGMVGPLSPVSSDGLGHGQVDNIGGQYGVDMGGLRGRKRVVDGPVEKVVERRQRRMIK 364

Query: 151 NRESAARSRERKQAHQVELELIASRLEEENELLLKEKAERSKERLKQLMEKV 203
           NRESAARSR RKQA+ VELE   ++L+EEN  L    AE  ++R +Q  E +
Sbjct: 365 NRESAARSRARKQAYTVELEAELNQLKEENAQLKHALAELERKRKQQYFESL 416

BLAST of Cla97C09G172900 vs. Swiss-Prot
Match: sp|Q9C5Q2|AI5L3_ARATH (ABSCISIC ACID-INSENSITIVE 5-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=DPBF4 PE=1 SV=1)

HSP 1 Score: 61.2 bits (147), Expect = 1.6e-08
Identity = 60/180 (33.33%), Postives = 88/180 (48.89%), Query Frame = 0

Query: 59  ASKTVDDLWKEL---------------KEEAVGEMILEGFL--------QAKPQDQDVRI 118
           + KTVD++W+++               K+  +GE+ LE  L           PQ+  V I
Sbjct: 74  SKKTVDEVWRDIQQDKNGNGTSTTTTHKQPTLGEITLEDLLLRAGVVTETVVPQENVVNI 133

Query: 119 LNPFSCLKDFDR-----------VYVEEETVGFGNGVDISGRGKRRRAAMEPMDEAALQR 178
            +    ++   +           V   ++ V  G   D      R+R A E +++   +R
Sbjct: 134 ASNGQWVEYHHQPQQQQGFMTYPVCEMQDMVMMGGLSDTPQAPGRKRVAGEIVEKTVERR 193

Query: 179 QRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKAERSKERLKQLMEKVIP 205
           Q+RMIKNRESAARSR RKQA+  ELE+  SRLEEENE L          RLK+ +EK++P
Sbjct: 194 QKRMIKNRESAARSRARKQAYTHELEIKVSRLEEENEKL---------RRLKE-VEKILP 243

BLAST of Cla97C09G172900 vs. Swiss-Prot
Match: sp|Q9LES3|AI5L2_ARATH (ABSCISIC ACID-INSENSITIVE 5-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=DPBF3 PE=1 SV=1)

HSP 1 Score: 59.3 bits (142), Expect = 6.3e-08
Identity = 43/92 (46.74%), Postives = 61/92 (66.30%), Query Frame = 0

Query: 122 DISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENE 181
           D    G++R A+ E +++   +RQ+RMIKNRESAARSR RKQA+  ELE+  SRLEEENE
Sbjct: 206 DTQTPGRKRVASGEVVEKTVERRQKRMIKNRESAARSRARKQAYTHELEIKVSRLEEENE 265

Query: 182 LLLKEKAERSKERLKQLMEKVIPVVEKQRLPR 214
            L K+K       +++++  V P   K++L R
Sbjct: 266 RLRKQK------EVEKILPSVPPPDPKRQLRR 291

BLAST of Cla97C09G172900 vs. TAIR10
Match: AT1G03970.1 (G-box binding factor 4)

HSP 1 Score: 111.7 bits (278), Expect = 5.9e-25
Identity = 91/205 (44.39%), Postives = 112/205 (54.63%), Query Frame = 0

Query: 47  VDMDAKFDTSSSAS--KTVDDLWKEL-----------KEEAVGEMILEGFLQAKPQDQ-- 106
           +D+D      +S +  K+VDD+WKE+           +EE    M LE FL     D+  
Sbjct: 69  IDVDRSIGDRNSVNNGKSVDDVWKEIVSGEQKTIMMKEEEPEDIMTLEDFLAKAEMDEGA 128

Query: 107 ----DV-----RILNPFSCLKDFD---RVYVEEETVGFGNGVDISGRGKRRRAAMEPMDE 166
               DV     R+ N  S   DF        +      G GV    RGKR R  ME MD+
Sbjct: 129 SDEIDVKIPTERLNNDGSYTFDFPMQRHSSFQMVEGSMGGGVT---RGKRGRVMMEAMDK 188

Query: 167 AALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKAERSKERLKQLM 224
           AA QRQ+RMIKNRESAARSRERKQA+QVELE +A++LEEENE         +KER K+LM
Sbjct: 189 AAAQRQKRMIKNRESAARSRERKQAYQVELETLAAKLEEENEQXXXXXXXSTKERYKKLM 248

BLAST of Cla97C09G172900 vs. TAIR10
Match: AT5G44080.1 (Basic-leucine zipper (bZIP) transcription factor family protein)

HSP 1 Score: 70.9 bits (172), Expect = 1.2e-12
Identity = 71/206 (34.47%), Postives = 94/206 (45.63%), Query Frame = 0

Query: 56  SSSASKTVDDLWKE--------LKEEAVGE-MILEGFL-----------QAKPQDQDVRI 115
           ++   K+VD++W+E        +KEE   E M LE FL            A  +D DV+I
Sbjct: 115 TTRGGKSVDEIWREMVSGEGKGMKEETSEEIMTLEDFLAKAAVEDETAVTASAEDLDVKI 174

Query: 116 -------------LNPFSCLKDFDRVYVEEETVGFGNGVDISG---RGKRRRAAMEPMDE 175
                         NPF  +       VE   V FGNG+D+ G   RGKR R  +EP+D+
Sbjct: 175 PVTNYGFDHSAPPHNPFQMIDK-----VEGSIVAFGNGLDVYGGGARGKRARVMVEPLDK 234

Query: 176 AALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKAERSKERLKQLM 224
           AA                               A++LEEENELL KE  ++ KER ++LM
Sbjct: 235 AAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAAKLEEENELLSKEIEDKRKERYQKLM 294

BLAST of Cla97C09G172900 vs. TAIR10
Match: AT2G36270.1 (Basic-leucine zipper (bZIP) transcription factor family protein)

HSP 1 Score: 65.5 bits (158), Expect = 4.8e-11
Identity = 45/112 (40.18%), Postives = 67/112 (59.82%), Query Frame = 0

Query: 91  QDVRILNPFSCLKDFDRVYVEEETVGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIK 150
           Q + ++ P S +      + + + +G   GVD+ G   R+R    P+++   +RQRRMIK
Sbjct: 305 QQMGMVGPLSPVSSDGLGHGQVDNIGGQYGVDMGGLRGRKRVVDGPVEKVVERRQRRMIK 364

Query: 151 NRESAARSRERKQAHQVELELIASRLEEENELLLKEKAERSKERLKQLMEKV 203
           NRESAARSR RKQA+ VELE   ++L+EEN  L    AE  ++R +Q  E +
Sbjct: 365 NRESAARSRARKQAYTVELEAELNQLKEENAQLKHALAELERKRKQQYFESL 416

BLAST of Cla97C09G172900 vs. TAIR10
Match: AT2G41070.1 (Basic-leucine zipper (bZIP) transcription factor family protein)

HSP 1 Score: 61.2 bits (147), Expect = 9.1e-10
Identity = 60/180 (33.33%), Postives = 88/180 (48.89%), Query Frame = 0

Query: 59  ASKTVDDLWKEL---------------KEEAVGEMILEGFL--------QAKPQDQDVRI 118
           + KTVD++W+++               K+  +GE+ LE  L           PQ+  V I
Sbjct: 74  SKKTVDEVWRDIQQDKNGNGTSTTTTHKQPTLGEITLEDLLLRAGVVTETVVPQENVVNI 133

Query: 119 LNPFSCLKDFDR-----------VYVEEETVGFGNGVDISGRGKRRRAAMEPMDEAALQR 178
            +    ++   +           V   ++ V  G   D      R+R A E +++   +R
Sbjct: 134 ASNGQWVEYHHQPQQQQGFMTYPVCEMQDMVMMGGLSDTPQAPGRKRVAGEIVEKTVERR 193

Query: 179 QRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKAERSKERLKQLMEKVIP 205
           Q+RMIKNRESAARSR RKQA+  ELE+  SRLEEENE L          RLK+ +EK++P
Sbjct: 194 QKRMIKNRESAARSRARKQAYTHELEIKVSRLEEENEKL---------RRLKE-VEKILP 243

BLAST of Cla97C09G172900 vs. TAIR10
Match: AT3G56850.1 (ABA-responsive element binding protein 3)

HSP 1 Score: 59.3 bits (142), Expect = 3.5e-09
Identity = 43/92 (46.74%), Postives = 61/92 (66.30%), Query Frame = 0

Query: 122 DISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENE 181
           D    G++R A+ E +++   +RQ+RMIKNRESAARSR RKQA+  ELE+  SRLEEENE
Sbjct: 206 DTQTPGRKRVASGEVVEKTVERRQKRMIKNRESAARSRARKQAYTHELEIKVSRLEEENE 265

Query: 182 LLLKEKAERSKERLKQLMEKVIPVVEKQRLPR 214
            L K+K       +++++  V P   K++L R
Sbjct: 266 RLRKQK------EVEKILPSVPPPDPKRQLRR 291

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008461927.11.1e-6466.81PREDICTED: G-box-binding factor 4-like isoform X3 [Cucumis melo][more]
XP_008461925.13.0e-5970.39PREDICTED: G-box-binding factor 4-like isoform X1 [Cucumis melo][more]
XP_008461926.11.8e-5669.12PREDICTED: G-box-binding factor 4-like isoform X2 [Cucumis melo][more]
XP_023521221.11.3e-5462.95G-box-binding factor 4-like isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022926188.12.9e-5463.84G-box-binding factor 4-like isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
tr|A0A1S3CH58|A0A1S3CH58_CUCME7.1e-6566.81G-box-binding factor 4-like isoform X3 OS=Cucumis melo OX=3656 GN=LOC103500414 P... [more]
tr|A0A1S3CFQ6|A0A1S3CFQ6_CUCME2.0e-5970.39G-box-binding factor 4-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103500414 P... [more]
tr|A0A1S3CGA7|A0A1S3CGA7_CUCME1.2e-5669.12G-box-binding factor 4-like isoform X2 OS=Cucumis melo OX=3656 GN=LOC103500414 P... [more]
tr|A0A1S3CFR1|A0A1S3CFR1_CUCME1.5e-5158.08G-box-binding factor 4-like isoform X6 OS=Cucumis melo OX=3656 GN=LOC103500414 P... [more]
tr|A0A1S3CFP6|A0A1S3CFP6_CUCME3.8e-4266.06uncharacterized protein LOC103500414 isoform X4 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
sp|P42777|GBF4_ARATH1.1e-2344.39G-box-binding factor 4 OS=Arabidopsis thaliana OX=3702 GN=GBF4 PE=1 SV=1[more]
sp|Q0JHF1|BZP12_ORYSJ3.8e-1338.16bZIP transcription factor 12 OS=Oryza sativa subsp. japonica OX=39947 GN=BZIP12 ... [more]
sp|Q9SJN0|ABI5_ARATH8.7e-1040.18Protein ABSCISIC ACID-INSENSITIVE 5 OS=Arabidopsis thaliana OX=3702 GN=ABI5 PE=1... [more]
sp|Q9C5Q2|AI5L3_ARATH1.6e-0833.33ABSCISIC ACID-INSENSITIVE 5-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=DP... [more]
sp|Q9LES3|AI5L2_ARATH6.3e-0846.74ABSCISIC ACID-INSENSITIVE 5-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=DP... [more]
Match NameE-valueIdentityDescription
AT1G03970.15.9e-2544.39G-box binding factor 4[more]
AT5G44080.11.2e-1234.47Basic-leucine zipper (bZIP) transcription factor family protein[more]
AT2G36270.14.8e-1140.18Basic-leucine zipper (bZIP) transcription factor family protein[more]
AT2G41070.19.1e-1033.33Basic-leucine zipper (bZIP) transcription factor family protein[more]
AT3G56850.13.5e-0946.74ABA-responsive element binding protein 3[more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: Molecular Function
TermDefinition
GO:0003700transcription factor activity, sequence-specific DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR004827bZIP
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C09G172900.1Cla97C09G172900.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 166..201
NoneNo IPR availableGENE3DG3DSA:1.20.5.170coord: 143..202
e-value: 7.5E-11
score: 43.9
NoneNo IPR availablePANTHERPTHR22952CAMP-RESPONSE ELEMENT BINDING PROTEIN-RELATEDcoord: 30..223
NoneNo IPR availablePANTHERPTHR22952:SF184BASIC LEUCINE ZIPPER TRANSCRIPTION FACTOR-RELATEDcoord: 30..223
NoneNo IPR availableSUPERFAMILYSSF57959Leucine zipper domaincoord: 143..194
IPR004827Basic-leucine zipper domainSMARTSM00338brlzneucoord: 139..201
e-value: 8.9E-8
score: 41.8
IPR004827Basic-leucine zipper domainPFAMPF00170bZIP_1coord: 139..201
e-value: 2.9E-11
score: 43.2
IPR004827Basic-leucine zipper domainPROSITEPS00036BZIP_BASICcoord: 146..161
IPR004827Basic-leucine zipper domainPROSITEPS50217BZIPcoord: 141..193
score: 10.335