Clc03G19040 (gene) Watermelon (cordophanus) v2

Overview
NameClc03G19040
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionGATA transcription factor 16-like
LocationClcChr03: 31270946 .. 31272619 (+)
RNA-Seq ExpressionClc03G19040
SyntenyClc03G19040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGGAGCGGTGGGTGATGTGAAAAATAGGGAGGATATCGTGTCAGCAGAAGCCCTTCTCCTTCGCTGCAAAATCAGTGGAATGATAAAATGGCAATTTCGTCAATCCGCCCTTAAACCCCAATTCTCCGTTCAAGGGTTTTGCAGTGCCGCCTAGGGTTTCTCCATCATCTTCATTCATTCTTATCTCTATCCGCTCTCTCTCTCTCTGTCGGTTCTGCTCTCTTTTTTGCAGTAATGATCTTGCATTCTCCCCCTCTTTTTCTCCCTCATTCAATTTTCTTCCTAAATCGTGAAATGGGTTTGATGGATTTGAGGCAAAAGGTCAGAATCACCAGCCCCAAAAGCCCCTTTTAGTCCAATCTTTGTTATTTTTACTGATTTTCTTAATTCCACTCCTCTGTTTCAGGGACTGTTGCTTCCGGACACTAAATATTGTGTTGATTGTAAGACAACCAAGACTCCTTTATGGCGTGTAGGCCCTACTGGACCTAAGGTTCTTTCCATCCTTTGGCCTTTCTTTCTCTTTTCGTTTTTGCTTCTGTTTGCCTTGATTTCCTAGGTTGTTTCTGTTTGCCTTGATTTCCTTGATTTCCTAATGTTCTGTTTTTGTTTGATCTTGAGCTTATGAGTTTTGATTCTTGATCTATTTATGGTTTATAATTTTGCCTCGTAATTGGGTTTCTCATCTATTCATGAACCTGATCATCGTCTGTTTCTTCTACTGTTATCATGTGTTCTGTTTTGGCACTATGTCACTCGAATTTGGGTTTCTCCTCTTTTATCTTAGATGATGAGCTTTCCCTTTCATTTACTTTCCTCTCGAATTGATTTTATGATCTTGAGTCTGATTTGGATTCACATTTGTTTTTTTTTTTTTTGTGTTCTGTTTTTAGTGGGGTTCTCCTCTGTTTTATCAATCCAACCGTCCATTTCGTGTTGGAACTCGAAATTCTTATATGGTCAATTAAACAAACCTTTCTTCCTATTTGGGATTTTTGCAGTCACTGTGCAATGCATGCGGGATCAGGTTTAGAAAGAGAAGAATATCCACCACAGGAACGAACAAAGGATATGACAGGAAGAGAGGAGTTCATAACAATGGCTCCACGGCCATGACCACCGTGTCAGCCGCCACTTCCTCGGCCACCACCACGACCTCTGGCAGTGGTGGTGGAGATGGGGATGAGAATTTAGGGGAATGTGAGTCATTGAGGATGACACTGATGATGGCATTGGAGGAGGAGGAGGTGAAGAATTTACCGTCAGCAGTGAAGAAACAGCGGTGTCAGCGGCCGAAGAAGCTCGGGGAGGAGGAGAAGCAGGCAGCAGTGTCGTTAATGGAGTTGTCCTGTGGCTCTGTGTTTTCCTGAGGATGAAGAAGAAGAAGAAGGTAGATAAGTAGTATTTGAGTATGTTACTCAGCATAGTTCTAACCTATAATACTAGTCACTTATTTTCTTTTTGAAACTCAAAATTAATGAAATTGGAAGAAGACCATAAAAGTAGGAAGTGGGTTTCTTTCCAAAATGGTAGTTTTAGGCAATCAAGGAAGGAGATTAGGGAATTCAAAATGATATTTTGTGTAGTGGTAGTTTTGTTCTTCTCTTTTCTGAATAACTCATATGTGTACATATGCCAACAAAGTATTACAAATTTACATGCCTATTCTCTCT

mRNA sequence

GTGGAGCGGTGGGTGATGTGAAAAATAGGGAGGATATCGTGTCAGCAGAAGCCCTTCTCCTTCGCTGCAAAATCAGTGGAATGATAAAATGGCAATTTCGTCAATCCGCCCTTAAACCCCAATTCTCCGTTCAAGGGTTTTGCAGTGCCGCCTAGGGTTTCTCCATCATCTTCATTCATTCTTATCTCTATCCGCTCTCTCTCTCTCTGTCGGTTCTGCTCTCTTTTTTGCAGTAATGATCTTGCATTCTCCCCCTCTTTTTCTCCCTCATTCAATTTTCTTCCTAAATCGTGAAATGGGTTTGATGGATTTGAGGCAAAAGGGACTGTTGCTTCCGGACACTAAATATTGTGTTGATTGTAAGACAACCAAGACTCCTTTATGGCGTGTAGGCCCTACTGGACCTAAGTCACTGTGCAATGCATGCGGGATCAGGTTTAGAAAGAGAAGAATATCCACCACAGGAACGAACAAAGGATATGACAGGAAGAGAGGAGTTCATAACAATGGCTCCACGGCCATGACCACCGTGTCAGCCGCCACTTCCTCGGCCACCACCACGACCTCTGGCAGTGGTGGTGGAGATGGGGATGAGAATTTAGGGGAATGTGAGTCATTGAGGATGACACTGATGATGGCATTGGAGGAGGAGGAGGTGAAGAATTTACCGTCAGCAGTGAAGAAACAGCGGTGTCAGCGGCCGAAGAAGCTCGGGGAGGAGGAGAAGCAGGCAGCAGTGTCGTTAATGGAGTTGTCCTGTGGCTCTGTGTTTTCCTGAGGATGAAGAAGAAGAAGAAGGTAGATAAGTAGTATTTGAGTATGTTACTCAGCATAGTTCTAACCTATAATACTAGTCACTTATTTTCTTTTTGAAACTCAAAATTAATGAAATTGGAAGAAGACCATAAAAGTAGGAAGTGGGTTTCTTTCCAAAATGGTAGTTTTAGGCAATCAAGGAAGGAGATTAGGGAATTCAAAATGATATTTTGTGTAGTGGTAGTTTTGTTCTTCTCTTTTCTGAATAACTCATATGTGTACATATGCCAACAAAGTATTACAAATTTACATGCCTATTCTCTCT

Coding sequence (CDS)

ATGATCTTGCATTCTCCCCCTCTTTTTCTCCCTCATTCAATTTTCTTCCTAAATCGTGAAATGGGTTTGATGGATTTGAGGCAAAAGGGACTGTTGCTTCCGGACACTAAATATTGTGTTGATTGTAAGACAACCAAGACTCCTTTATGGCGTGTAGGCCCTACTGGACCTAAGTCACTGTGCAATGCATGCGGGATCAGGTTTAGAAAGAGAAGAATATCCACCACAGGAACGAACAAAGGATATGACAGGAAGAGAGGAGTTCATAACAATGGCTCCACGGCCATGACCACCGTGTCAGCCGCCACTTCCTCGGCCACCACCACGACCTCTGGCAGTGGTGGTGGAGATGGGGATGAGAATTTAGGGGAATGTGAGTCATTGAGGATGACACTGATGATGGCATTGGAGGAGGAGGAGGTGAAGAATTTACCGTCAGCAGTGAAGAAACAGCGGTGTCAGCGGCCGAAGAAGCTCGGGGAGGAGGAGAAGCAGGCAGCAGTGTCGTTAATGGAGTTGTCCTGTGGCTCTGTGTTTTCCTGA

Protein sequence

MILHSPPLFLPHSIFFLNREMGLMDLRQKGLLLPDTKYCVDCKTTKTPLWRVGPTGPKSLCNACGIRFRKRRISTTGTNKGYDRKRGVHNNGSTAMTTVSAATSSATTTTSGSGGGDGDENLGECESLRMTLMMALEEEEVKNLPSAVKKQRCQRPKKLGEEEKQAAVSLMELSCGSVFS
Homology
BLAST of Clc03G19040 vs. NCBI nr
Match: XP_038880207.1 (GATA transcription factor 17-like [Benincasa hispida])

HSP 1 Score: 276.2 bits (705), Expect = 2.0e-70
Identity = 153/186 (82.26%), Postives = 158/186 (84.95%), Query Frame = 0

Query: 1   MILHSPPLFLPHSIFFLNREMGLMDLRQKGLLLPDTKYCVDCKTTKTPLWRVGPTGPKSL 60
           MIL SP L LPHSI FLNREMG+MDLRQKGLLL DTK CVDCKTTKTPLWR GPTGPKSL
Sbjct: 1   MILRSPRLSLPHSICFLNREMGMMDLRQKGLLLADTKCCVDCKTTKTPLWRGGPTGPKSL 60

Query: 61  CNACGIRFRKRRISTTGTNKGYDRKR-GVHNNGSTAMTTVSAATSS---ATTTTSGSGGG 120
           CNACGIRFRKRRIST GTN+GYDRKR  VHNNGST  TTVSA TSS    TTTTSGSG G
Sbjct: 61  CNACGIRFRKRRISTIGTNRGYDRKRERVHNNGSTITTTVSATTSSTGTTTTTTSGSGDG 120

Query: 121 DGDENLGECESLRMTLMMALEEE--EVKNLPSAVKKQRCQRPKKLGEEEKQAAVSLMELS 180
           DGDENLGEC SL M LMMALEEE   V+NLPS+VKKQR QR +KLGEEEKQAAVSLM LS
Sbjct: 121 DGDENLGECGSLGMRLMMALEEEVMVVQNLPSSVKKQRFQRERKLGEEEKQAAVSLMALS 180

BLAST of Clc03G19040 vs. NCBI nr
Match: XP_008450587.1 (PREDICTED: GATA transcription factor 16-like [Cucumis melo])

HSP 1 Score: 201.4 bits (511), Expect = 6.4e-48
Identity = 120/185 (64.86%), Postives = 137/185 (74.05%), Query Frame = 0

Query: 1   MILHSPPLFLPHSIFFLNREMGLMDLRQKGLLLPDTKYCVDCKTTKTPLWRVGPTGPKSL 60
           MIL SP L LP SIFFLN EMG +DL QKGLLL DTK CVDCKTTKTPLWR GPTGPKSL
Sbjct: 1   MILCSPLLSLPRSIFFLNPEMGFVDLSQKGLLLADTKCCVDCKTTKTPLWRGGPTGPKSL 60

Query: 61  CNACGIRFRKRRISTTGTNK-GYDRKR-GVHNNGSTAMTTVSAATSSA---TTTTSGSGG 120
           CNACGIRFRKR+I T  TN+ G D+KR  V +N S+ +  VSA T+S+   TTTT+ + G
Sbjct: 61  CNACGIRFRKRKIFTRRTNRGGRDKKRERVRDNHSSTVAIVSATTTSSSGTTTTTTTTSG 120

Query: 121 GDGDENLGECESLRMTLMMALEEEEVKNLPSAVKKQRCQRPKKLGEEEKQAAVSLMELSC 180
            DGDEN GEC S RM +MM LEE+ +      VKK R Q  +K+GEEEKQAAVSLM LS 
Sbjct: 121 VDGDENSGECGSSRMKIMMGLEEDVM-----VVKKHRWQWQRKVGEEEKQAAVSLMALSN 180

BLAST of Clc03G19040 vs. NCBI nr
Match: XP_011659732.1 (GATA transcription factor 16 [Cucumis sativus] >KGN66031.1 hypothetical protein Csa_006937 [Cucumis sativus])

HSP 1 Score: 188.0 bits (476), Expect = 7.3e-44
Identity = 106/164 (64.63%), Postives = 126/164 (76.83%), Query Frame = 0

Query: 21  MGLMDLRQKGLLLPDTKYCVDCKTTKTPLWRVGPTGPKSLCNACGIRFRKRRISTTGTNK 80
           MG MDL QKGLLL DTK CVDCKTTKTPLWR GPTGPKSLCNACGIRFRKRRIST GTN+
Sbjct: 1   MGFMDLSQKGLLLADTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRRISTRGTNR 60

Query: 81  GYDRKRGVHNNGSTAMTTVSAATSSAT----TTTSGSGGGDGDENLGECESLRMTLMMAL 140
              ++  V++N S+A+ TVSA T+S++    TTT+ S G DGDEN GEC SLRM LMM+L
Sbjct: 61  RDKKREKVNDNHSSAVATVSATTTSSSGTTITTTTSSSGVDGDENSGECGSLRMRLMMSL 120

Query: 141 EEEEVKNLPSAVKKQRCQRPKKLGEEEKQAAVSLMELSCGSVFS 181
           EE+ +      VKKQ+ Q  +K+GEEEKQAA+SL+ LS  S+ S
Sbjct: 121 EEDVM-----VVKKQQWQWQRKVGEEEKQAAMSLIALSNDSLIS 159

BLAST of Clc03G19040 vs. NCBI nr
Match: XP_022135613.1 (GATA transcription factor 17-like isoform X2 [Momordica charantia])

HSP 1 Score: 173.7 bits (439), Expect = 1.4e-39
Identity = 109/177 (61.58%), Postives = 128/177 (72.32%), Query Frame = 0

Query: 16  FLNR-EMGLMDL---RQKGLLLPDT-KYCVDCKTTKTPLWRVGPTGPKSLCNACGIRFRK 75
           FLN+ EMG+MD+   + K  +  DT KYCVDCKTTKTPLWR GP GPKSLCNACGIRFRK
Sbjct: 49  FLNKAEMGMMDVLRRKNKERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRK 108

Query: 76  RRISTTGTNKGYDRKR-GVHNNGSTAMTTVSAATSSATTTT---SGSGGGDG---DENLG 135
           RR+ST GTN+G DRKR   H++G +    +SA TSS+ T     S +GG DG   +E+LG
Sbjct: 109 RRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLG 168

Query: 136 ECESLRMTLMMALEEEEVKNLPSAVKKQRCQRPKKLGEEEKQAAVSLMELSCGSVFS 181
           EC SLRM LMMAL EE V  +   + KQR   P+KLGEEE QAAVSLM LSCGSVF+
Sbjct: 169 ECGSLRMRLMMALGEEVV--VQQNISKQR--PPRKLGEEE-QAAVSLMALSCGSVFA 220

BLAST of Clc03G19040 vs. NCBI nr
Match: XP_022135612.1 (GATA transcription factor 17-like isoform X1 [Momordica charantia])

HSP 1 Score: 172.6 bits (436), Expect = 3.2e-39
Identity = 111/178 (62.36%), Postives = 129/178 (72.47%), Query Frame = 0

Query: 16  FLNR-EMGLMD-LRQKG---LLLPDT-KYCVDCKTTKTPLWRVGPTGPKSLCNACGIRFR 75
           FLN+ EMG+MD LR+K     +  DT KYCVDCKTTKTPLWR GP GPKSLCNACGIRFR
Sbjct: 49  FLNKAEMGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFR 108

Query: 76  KRRISTTGTNKGYDRKR-GVHNNGSTAMTTVSAATSSATTTT---SGSGGGDG---DENL 135
           KRR+ST GTN+G DRKR   H++G +    +SA TSS+ T     S +GG DG   +E+L
Sbjct: 109 KRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDL 168

Query: 136 GECESLRMTLMMALEEEEVKNLPSAVKKQRCQRPKKLGEEEKQAAVSLMELSCGSVFS 181
           GEC SLRM LMMAL EE V  +   + KQR   P+KLGEEE QAAVSLM LSCGSVF+
Sbjct: 169 GECGSLRMRLMMALGEEVV--VQQNISKQR--PPRKLGEEE-QAAVSLMALSCGSVFA 221

BLAST of Clc03G19040 vs. ExPASy Swiss-Prot
Match: Q9FJ10 (GATA transcription factor 16 OS=Arabidopsis thaliana OX=3702 GN=GATA16 PE=2 SV=1)

HSP 1 Score: 82.4 bits (202), Expect = 5.7e-15
Identity = 64/145 (44.14%), Postives = 76/145 (52.41%), Query Frame = 0

Query: 37  KYCVDCKTTKTPLWRVGPTGPKSLCNACGIRFRKRRISTTGTNKGYDRKRGVHNNGSTAM 96
           K C DC T+KTPLWR GP GPKSLCNACGIR RK+R   T  NK   +            
Sbjct: 36  KTCADCGTSKTPLWRGGPVGPKSLCNACGIRNRKKRRGGTEDNKKLKK------------ 95

Query: 97  TTVSAATSSATTTTSGSGGGDGDENLGECESLRMTLM-MALEEEEVKNLPSAVKKQRCQR 156
                         S SGGG    N    ESL+ +LM + + +       S V+KQR   
Sbjct: 96  --------------SSSGGG----NRKFGESLKQSLMDLGIRKR------STVEKQR--- 139

Query: 157 PKKLGEEEKQAAVSLMELSCGSVFS 181
            +KLGEEE QAAV LM LS GSV++
Sbjct: 156 -QKLGEEE-QAAVLLMALSYGSVYA 139

BLAST of Clc03G19040 vs. ExPASy Swiss-Prot
Match: Q9LIB5 (GATA transcription factor 17 OS=Arabidopsis thaliana OX=3702 GN=GATA17 PE=2 SV=1)

HSP 1 Score: 74.3 bits (181), Expect = 1.5e-12
Identity = 61/158 (38.61%), Postives = 84/158 (53.16%), Query Frame = 0

Query: 35  DTK-YCVDCKTTKTPLWRVGPTGPKSLCNACGIRFRKRRISTTGTNKGYDRKRGVHNNGS 94
           DTK  CVDC T +TPLWR GP GPKSLCNACGI+ RK+R +  G  +  ++K+   +N +
Sbjct: 39  DTKRTCVDCGTIRTPLWRGGPAGPKSLCNACGIKSRKKRQAALGM-RSEEKKKNRKSNCN 98

Query: 95  TAMTTVSAATSSATTTTSGSGGGDGDENLGECESLRMT-----------LMMALEEEEVK 154
             +                 G  D D++   C + R +           L +  +   +K
Sbjct: 99  NDLNLDHRNAKKYKINIVDDGKIDIDDDPKICNNKRSSSSSSNKGVSKFLDLGFKVPVMK 158

Query: 155 NLPSAVKKQRCQRPKKLGEEEKQAAVSLMELSCGSVFS 181
              SAV+K+R  R  KLGEEE+ AAV LM LSC SV++
Sbjct: 159 R--SAVEKKRLWR--KLGEEER-AAVLLMALSCSSVYA 190

BLAST of Clc03G19040 vs. ExPASy Swiss-Prot
Match: Q8LG10 (GATA transcription factor 15 OS=Arabidopsis thaliana OX=3702 GN=GATA15 PE=2 SV=2)

HSP 1 Score: 71.6 bits (174), Expect = 1.0e-11
Identity = 58/143 (40.56%), Postives = 69/143 (48.25%), Query Frame = 0

Query: 35  DTKYCVDCKTTKTPLWRVGPTGPKSLCNACGIRFRKRRISTTGTNKGYDRKRGVHNNGST 94
           + K C  C T+KTPLWR GP GPKSLCNACGIR RK+R  T  +N+  D+K+  HN    
Sbjct: 39  EKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKR-RTLISNRSEDKKKKSHN---- 98

Query: 95  AMTTVSAATSSATTTTSGSGGGDGDENLGECESLRMTLMMALEEEEVKNLPSAVKKQRCQ 154
                                     N    +SL+  L M L  E +    +A      Q
Sbjct: 99  -------------------------RNPKFGDSLKQRL-MELGREVMMQRSTAEN----Q 145

Query: 155 RPKKLGEEEKQAAVSLMELSCGS 178
           R  KLGEEE QAAV LM LS  S
Sbjct: 159 RRNKLGEEE-QAAVLLMALSYAS 145

BLAST of Clc03G19040 vs. ExPASy Swiss-Prot
Match: Q9SZI6 (Putative GATA transcription factor 22 OS=Arabidopsis thaliana OX=3702 GN=GATA22 PE=1 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 1.9e-10
Identity = 29/42 (69.05%), Postives = 31/42 (73.81%), Query Frame = 0

Query: 37  KYCVDCKTTKTPLWRVGPTGPKSLCNACGIRFRKRRISTTGT 79
           + C DC TTKTPLWR GP GPKSLCNACGIR RK R +   T
Sbjct: 199 RICSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAAMAT 240

BLAST of Clc03G19040 vs. ExPASy Swiss-Prot
Match: Q6YW48 (Protein CYTOKININ-RESPONSIVE GATA TRANSCRIPTION FACTOR 1 OS=Oryza sativa subsp. japonica OX=39947 GN=CGA1 PE=2 SV=1)

HSP 1 Score: 67.0 bits (162), Expect = 2.5e-10
Identity = 31/46 (67.39%), Postives = 32/46 (69.57%), Query Frame = 0

Query: 37  KYCVDCKTTKTPLWRVGPTGPKSLCNACGIRFRK-RRISTTGTNKG 82
           + C DC TTKTPLWR GP GPKSLCNACGIR RK RR      N G
Sbjct: 176 RVCSDCNTTKTPLWRSGPCGPKSLCNACGIRQRKARRAMAAAANGG 221

BLAST of Clc03G19040 vs. ExPASy TrEMBL
Match: A0A1S3BQ71 (GATA transcription factor 16-like OS=Cucumis melo OX=3656 GN=LOC103492133 PE=4 SV=1)

HSP 1 Score: 201.4 bits (511), Expect = 3.1e-48
Identity = 120/185 (64.86%), Postives = 137/185 (74.05%), Query Frame = 0

Query: 1   MILHSPPLFLPHSIFFLNREMGLMDLRQKGLLLPDTKYCVDCKTTKTPLWRVGPTGPKSL 60
           MIL SP L LP SIFFLN EMG +DL QKGLLL DTK CVDCKTTKTPLWR GPTGPKSL
Sbjct: 1   MILCSPLLSLPRSIFFLNPEMGFVDLSQKGLLLADTKCCVDCKTTKTPLWRGGPTGPKSL 60

Query: 61  CNACGIRFRKRRISTTGTNK-GYDRKR-GVHNNGSTAMTTVSAATSSA---TTTTSGSGG 120
           CNACGIRFRKR+I T  TN+ G D+KR  V +N S+ +  VSA T+S+   TTTT+ + G
Sbjct: 61  CNACGIRFRKRKIFTRRTNRGGRDKKRERVRDNHSSTVAIVSATTTSSSGTTTTTTTTSG 120

Query: 121 GDGDENLGECESLRMTLMMALEEEEVKNLPSAVKKQRCQRPKKLGEEEKQAAVSLMELSC 180
            DGDEN GEC S RM +MM LEE+ +      VKK R Q  +K+GEEEKQAAVSLM LS 
Sbjct: 121 VDGDENSGECGSSRMKIMMGLEEDVM-----VVKKHRWQWQRKVGEEEKQAAVSLMALSN 180

BLAST of Clc03G19040 vs. ExPASy TrEMBL
Match: A0A0A0M1G9 (GATA-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G569090 PE=4 SV=1)

HSP 1 Score: 188.0 bits (476), Expect = 3.5e-44
Identity = 106/164 (64.63%), Postives = 126/164 (76.83%), Query Frame = 0

Query: 21  MGLMDLRQKGLLLPDTKYCVDCKTTKTPLWRVGPTGPKSLCNACGIRFRKRRISTTGTNK 80
           MG MDL QKGLLL DTK CVDCKTTKTPLWR GPTGPKSLCNACGIRFRKRRIST GTN+
Sbjct: 1   MGFMDLSQKGLLLADTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRRISTRGTNR 60

Query: 81  GYDRKRGVHNNGSTAMTTVSAATSSAT----TTTSGSGGGDGDENLGECESLRMTLMMAL 140
              ++  V++N S+A+ TVSA T+S++    TTT+ S G DGDEN GEC SLRM LMM+L
Sbjct: 61  RDKKREKVNDNHSSAVATVSATTTSSSGTTITTTTSSSGVDGDENSGECGSLRMRLMMSL 120

Query: 141 EEEEVKNLPSAVKKQRCQRPKKLGEEEKQAAVSLMELSCGSVFS 181
           EE+ +      VKKQ+ Q  +K+GEEEKQAA+SL+ LS  S+ S
Sbjct: 121 EEDVM-----VVKKQQWQWQRKVGEEEKQAAMSLIALSNDSLIS 159

BLAST of Clc03G19040 vs. ExPASy TrEMBL
Match: A0A6J1C5A4 (GATA transcription factor 17-like isoform X2 OS=Momordica charantia OX=3673 GN=LOC111007526 PE=4 SV=1)

HSP 1 Score: 173.7 bits (439), Expect = 6.9e-40
Identity = 109/177 (61.58%), Postives = 128/177 (72.32%), Query Frame = 0

Query: 16  FLNR-EMGLMDL---RQKGLLLPDT-KYCVDCKTTKTPLWRVGPTGPKSLCNACGIRFRK 75
           FLN+ EMG+MD+   + K  +  DT KYCVDCKTTKTPLWR GP GPKSLCNACGIRFRK
Sbjct: 49  FLNKAEMGMMDVLRRKNKERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRK 108

Query: 76  RRISTTGTNKGYDRKR-GVHNNGSTAMTTVSAATSSATTTT---SGSGGGDG---DENLG 135
           RR+ST GTN+G DRKR   H++G +    +SA TSS+ T     S +GG DG   +E+LG
Sbjct: 109 RRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLG 168

Query: 136 ECESLRMTLMMALEEEEVKNLPSAVKKQRCQRPKKLGEEEKQAAVSLMELSCGSVFS 181
           EC SLRM LMMAL EE V  +   + KQR   P+KLGEEE QAAVSLM LSCGSVF+
Sbjct: 169 ECGSLRMRLMMALGEEVV--VQQNISKQR--PPRKLGEEE-QAAVSLMALSCGSVFA 220

BLAST of Clc03G19040 vs. ExPASy TrEMBL
Match: A0A6J1C373 (GATA transcription factor 17-like isoform X1 OS=Momordica charantia OX=3673 GN=LOC111007526 PE=4 SV=1)

HSP 1 Score: 172.6 bits (436), Expect = 1.5e-39
Identity = 111/178 (62.36%), Postives = 129/178 (72.47%), Query Frame = 0

Query: 16  FLNR-EMGLMD-LRQKG---LLLPDT-KYCVDCKTTKTPLWRVGPTGPKSLCNACGIRFR 75
           FLN+ EMG+MD LR+K     +  DT KYCVDCKTTKTPLWR GP GPKSLCNACGIRFR
Sbjct: 49  FLNKAEMGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFR 108

Query: 76  KRRISTTGTNKGYDRKR-GVHNNGSTAMTTVSAATSSATTTT---SGSGGGDG---DENL 135
           KRR+ST GTN+G DRKR   H++G +    +SA TSS+ T     S +GG DG   +E+L
Sbjct: 109 KRRVSTIGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDL 168

Query: 136 GECESLRMTLMMALEEEEVKNLPSAVKKQRCQRPKKLGEEEKQAAVSLMELSCGSVFS 181
           GEC SLRM LMMAL EE V  +   + KQR   P+KLGEEE QAAVSLM LSCGSVF+
Sbjct: 169 GECGSLRMRLMMALGEEVV--VQQNISKQR--PPRKLGEEE-QAAVSLMALSCGSVFA 221

BLAST of Clc03G19040 vs. ExPASy TrEMBL
Match: A0A6J1C1I3 (GATA transcription factor 16-like isoform X4 OS=Momordica charantia OX=3673 GN=LOC111007526 PE=4 SV=1)

HSP 1 Score: 169.1 bits (427), Expect = 1.7e-38
Identity = 105/171 (61.40%), Postives = 123/171 (71.93%), Query Frame = 0

Query: 21  MGLMDL---RQKGLLLPDT-KYCVDCKTTKTPLWRVGPTGPKSLCNACGIRFRKRRISTT 80
           MG+MD+   + K  +  DT KYCVDCKTTKTPLWR GP GPKSLCNACGIRFRKRR+ST 
Sbjct: 1   MGMMDVLRRKNKERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTI 60

Query: 81  GTNKGYDRKR-GVHNNGSTAMTTVSAATSSATTTT---SGSGGGDG---DENLGECESLR 140
           GTN+G DRKR   H++G +    +SA TSS+ T     S +GG DG   +E+LGEC SLR
Sbjct: 61  GTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLR 120

Query: 141 MTLMMALEEEEVKNLPSAVKKQRCQRPKKLGEEEKQAAVSLMELSCGSVFS 181
           M LMMAL EE V  +   + KQR   P+KLGEEE QAAVSLM LSCGSVF+
Sbjct: 121 MRLMMALGEEVV--VQQNISKQR--PPRKLGEEE-QAAVSLMALSCGSVFA 166

BLAST of Clc03G19040 vs. TAIR 10
Match: AT5G49300.1 (GATA transcription factor 16 )

HSP 1 Score: 82.4 bits (202), Expect = 4.0e-16
Identity = 64/145 (44.14%), Postives = 76/145 (52.41%), Query Frame = 0

Query: 37  KYCVDCKTTKTPLWRVGPTGPKSLCNACGIRFRKRRISTTGTNKGYDRKRGVHNNGSTAM 96
           K C DC T+KTPLWR GP GPKSLCNACGIR RK+R   T  NK   +            
Sbjct: 36  KTCADCGTSKTPLWRGGPVGPKSLCNACGIRNRKKRRGGTEDNKKLKK------------ 95

Query: 97  TTVSAATSSATTTTSGSGGGDGDENLGECESLRMTLM-MALEEEEVKNLPSAVKKQRCQR 156
                         S SGGG    N    ESL+ +LM + + +       S V+KQR   
Sbjct: 96  --------------SSSGGG----NRKFGESLKQSLMDLGIRKR------STVEKQR--- 139

Query: 157 PKKLGEEEKQAAVSLMELSCGSVFS 181
            +KLGEEE QAAV LM LS GSV++
Sbjct: 156 -QKLGEEE-QAAVLLMALSYGSVYA 139

BLAST of Clc03G19040 vs. TAIR 10
Match: AT3G16870.1 (GATA transcription factor 17 )

HSP 1 Score: 74.3 bits (181), Expect = 1.1e-13
Identity = 61/158 (38.61%), Postives = 84/158 (53.16%), Query Frame = 0

Query: 35  DTK-YCVDCKTTKTPLWRVGPTGPKSLCNACGIRFRKRRISTTGTNKGYDRKRGVHNNGS 94
           DTK  CVDC T +TPLWR GP GPKSLCNACGI+ RK+R +  G  +  ++K+   +N +
Sbjct: 39  DTKRTCVDCGTIRTPLWRGGPAGPKSLCNACGIKSRKKRQAALGM-RSEEKKKNRKSNCN 98

Query: 95  TAMTTVSAATSSATTTTSGSGGGDGDENLGECESLRMT-----------LMMALEEEEVK 154
             +                 G  D D++   C + R +           L +  +   +K
Sbjct: 99  NDLNLDHRNAKKYKINIVDDGKIDIDDDPKICNNKRSSSSSSNKGVSKFLDLGFKVPVMK 158

Query: 155 NLPSAVKKQRCQRPKKLGEEEKQAAVSLMELSCGSVFS 181
              SAV+K+R  R  KLGEEE+ AAV LM LSC SV++
Sbjct: 159 R--SAVEKKRLWR--KLGEEER-AAVLLMALSCSSVYA 190

BLAST of Clc03G19040 vs. TAIR 10
Match: AT3G06740.1 (GATA transcription factor 15 )

HSP 1 Score: 71.6 bits (174), Expect = 7.1e-13
Identity = 58/143 (40.56%), Postives = 69/143 (48.25%), Query Frame = 0

Query: 35  DTKYCVDCKTTKTPLWRVGPTGPKSLCNACGIRFRKRRISTTGTNKGYDRKRGVHNNGST 94
           + K C  C T+KTPLWR GP GPKSLCNACGIR RK+R  T  +N+  D+K+  HN    
Sbjct: 39  EKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKR-RTLISNRSEDKKKKSHN---- 98

Query: 95  AMTTVSAATSSATTTTSGSGGGDGDENLGECESLRMTLMMALEEEEVKNLPSAVKKQRCQ 154
                                     N    +SL+  L M L  E +    +A      Q
Sbjct: 99  -------------------------RNPKFGDSLKQRL-MELGREVMMQRSTAEN----Q 145

Query: 155 RPKKLGEEEKQAAVSLMELSCGS 178
           R  KLGEEE QAAV LM LS  S
Sbjct: 159 RRNKLGEEE-QAAVLLMALSYAS 145

BLAST of Clc03G19040 vs. TAIR 10
Match: AT4G16141.1 (GATA type zinc finger transcription factor family protein )

HSP 1 Score: 68.2 bits (165), Expect = 7.9e-12
Identity = 57/166 (34.34%), Postives = 74/166 (44.58%), Query Frame = 0

Query: 37  KYCVDCKTTKTPLWRVGPTGPKSLCNACGIRFRKRRISTTG-------------TNKGYD 96
           K CVDC T++TPLWR GP GPKSLCNACGI+ RK+R +  G              N G +
Sbjct: 37  KTCVDCGTSRTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIRQDDIKIKSKSNNNLGLE 96

Query: 97  RKRGVHNNGSTAMTTVSAATSSATTTTSGSGG-------------GDGDENLGECESLRM 156
            +      G      ++           G  G                + N    + +  
Sbjct: 97  SRNVKTGKGEPVNVKIAKCEPGIVKIAKGEPGNVKNKIKRDPENSSSSNNNKKNVKRVGR 156

Query: 157 TLMMALEEEEVKNLPSAVKKQRCQRPKKLGEEEKQAAVSLMELSCG 177
            L    +   +K   SAV+K+R  R  KLGEEE+ AAV LM LSCG
Sbjct: 157 FLDFGFKVPAMKR--SAVEKKRLWR--KLGEEER-AAVLLMALSCG 197

BLAST of Clc03G19040 vs. TAIR 10
Match: AT5G26930.1 (GATA transcription factor 23 )

HSP 1 Score: 65.5 bits (158), Expect = 5.1e-11
Identity = 52/142 (36.62%), Postives = 65/142 (45.77%), Query Frame = 0

Query: 39  CVDCKTTKTPLWRVGPTGPKSLCNACGIRFRKRRISTTGTNKGYDRKRGVHNNGSTAMTT 98
           C +CKTTKTP+WR GPTGPKSLCNACGIR RK+R S            G+H      + +
Sbjct: 28  CSECKTTKTPMWRGGPTGPKSLCNACGIRHRKQRRS---------ELLGIH-----IIRS 87

Query: 99  VSAATSSATTTTSGSGGGDGDENLGECESLRMTLMMALEEEEVKNLPSAVKKQRCQRPKK 158
             +  S      S S GG                              AVKK+R  +   
Sbjct: 88  HKSLASKKINLLSSSHGG-----------------------------VAVKKRRSLK--- 120

Query: 159 LGEEEKQAAVSLMELSCGSVFS 181
              EE+QAA+ L+ LSC SV +
Sbjct: 148 ---EEEQAALCLLLLSCSSVLA 120

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038880207.12.0e-7082.26GATA transcription factor 17-like [Benincasa hispida][more]
XP_008450587.16.4e-4864.86PREDICTED: GATA transcription factor 16-like [Cucumis melo][more]
XP_011659732.17.3e-4464.63GATA transcription factor 16 [Cucumis sativus] >KGN66031.1 hypothetical protein ... [more]
XP_022135613.11.4e-3961.58GATA transcription factor 17-like isoform X2 [Momordica charantia][more]
XP_022135612.13.2e-3962.36GATA transcription factor 17-like isoform X1 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q9FJ105.7e-1544.14GATA transcription factor 16 OS=Arabidopsis thaliana OX=3702 GN=GATA16 PE=2 SV=1[more]
Q9LIB51.5e-1238.61GATA transcription factor 17 OS=Arabidopsis thaliana OX=3702 GN=GATA17 PE=2 SV=1[more]
Q8LG101.0e-1140.56GATA transcription factor 15 OS=Arabidopsis thaliana OX=3702 GN=GATA15 PE=2 SV=2[more]
Q9SZI61.9e-1069.05Putative GATA transcription factor 22 OS=Arabidopsis thaliana OX=3702 GN=GATA22 ... [more]
Q6YW482.5e-1067.39Protein CYTOKININ-RESPONSIVE GATA TRANSCRIPTION FACTOR 1 OS=Oryza sativa subsp. ... [more]
Match NameE-valueIdentityDescription
A0A1S3BQ713.1e-4864.86GATA transcription factor 16-like OS=Cucumis melo OX=3656 GN=LOC103492133 PE=4 S... [more]
A0A0A0M1G93.5e-4464.63GATA-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G569090 P... [more]
A0A6J1C5A46.9e-4061.58GATA transcription factor 17-like isoform X2 OS=Momordica charantia OX=3673 GN=L... [more]
A0A6J1C3731.5e-3962.36GATA transcription factor 17-like isoform X1 OS=Momordica charantia OX=3673 GN=L... [more]
A0A6J1C1I31.7e-3861.40GATA transcription factor 16-like isoform X4 OS=Momordica charantia OX=3673 GN=L... [more]
Match NameE-valueIdentityDescription
AT5G49300.14.0e-1644.14GATA transcription factor 16 [more]
AT3G16870.11.1e-1338.61GATA transcription factor 17 [more]
AT3G06740.17.1e-1340.56GATA transcription factor 15 [more]
AT4G16141.17.9e-1234.34GATA type zinc finger transcription factor family protein [more]
AT5G26930.15.1e-1136.62GATA transcription factor 23 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 33..95
e-value: 8.0E-12
score: 55.3
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 39..72
e-value: 2.5E-17
score: 62.2
IPR000679Zinc finger, GATA-typePROSITEPS50114GATA_ZN_FINGER_2coord: 39..69
score: 12.492275
IPR000679Zinc finger, GATA-typeCDDcd00202ZnF_GATAcoord: 39..73
e-value: 6.63943E-13
score: 58.5382
IPR013088Zinc finger, NHR/GATA-typeGENE3D3.30.50.10coord: 35..97
e-value: 3.4E-16
score: 61.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 90..116
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 75..123
NoneNo IPR availablePANTHERPTHR47172:SF9GATA TRANSCRIPTION FACTOR 16coord: 36..179
NoneNo IPR availablePANTHERPTHR47172OS01G0976800 PROTEINcoord: 36..179
NoneNo IPR availableSUPERFAMILY57716Glucocorticoid receptor-like (DNA-binding domain)coord: 36..73

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc03G19040.1Clc03G19040.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding