HG10007039 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10007039
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionGATA transcription factor 16-like
LocationChr10: 640036 .. 641206 (-)
RNA-Seq ExpressionHG10007039
SyntenyHG10007039
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTTTGATGGATTTGAGGCAAAAGGTCAGAATCACCAGCCCCAAAAGCCCCTTTCAGACGAATCTTTCTTATTTTTTTACTGATTTTCTTAATTAGACTCCTCTGTTTCAGGGACTGTTGCTGCCGGACACTAAATGTTGTGTTGATTGTAAGACAACCAAGACTCCTTTGTGGCGTGGAGGCCCTACTGGACCTAAGGTTCTTCTCTTTCTTTATATTTTCGTTTTTGCTTATGTTTGTCTTGATTTCCTGGGTTTGTTTCTGTTGACAAGATTTGGATTCATTCTCCATTTCTTCTCGACTCTGTTCTAATGTTTTCTTGTTTGATCTTTGCCAGTCTGTTTTTGTTTGATCTTGAGCTTATGAGTTTTGATTCTTGATCTATTTATGGTTTATAAGTTTCCCTTGTAATTGGGTTTCATTAACCTGATCATCGTCTGTTTCTTCTACTGTTTGATTTTGAGCTTTTCTGTTACGATCATGTGTTCTGTTTTGGCACTGAGTTCACTAGAATTTGGGTTTCTCCTCTGTTCATGAAGATGATCCTACTTTTCATTTATGTTCTATTTTTACTGGGTTTCTCCTCTTTTCACTTAGATGATGAGCTTTCCCTTTCATTTACTTTCCTCTCAAATTCTTTGTATGATCTTGAGCCTCTGATTTGGATTTACATTTGTTTTTTTTAGTGGGGTTCTTCTCTGTTTATCAATCTAACCGTCCATTTCTTGTTGGAACTCGAAATCTTTATATTGTCAATTAAACAAACCCCTCTTCCCATTTGGGTTTTTTGCAGTCACTGTGCAATGCATGCGGGATCAGGTTTAGAAAGAGAGGAATATCCACGATAGGAACGAACAGAGGATGTGACAGGAAGAGAGAAGGAGTTCATAACAATGGCTCCTCCACCATGACCACCGTGTCAGCCACCACTTCATCGAGTGAGACAACAGCCACCACCACCTCTGGAGATGGGGATGAGAATTTGGGGGAATGTGGGTCATTGAGGATGAGATTGATGATGGCATCGGAGGAGGAGGTGATGGTGGTGCAGAATTTACCGTCGTCGGTGAAGAAACAGCGTTGTCGACGGCAGAGGAAGCTTGGGGAGGAGGAGAAGCAGGCAGCAGTGTCATTAATGGCGCTGTCATGTGGCTCTCTTTTTGCCTGA

mRNA sequence

ATGGGTTTGATGGATTTGAGGCAAAAGGGACTGTTGCTGCCGGACACTAAATGTTGTGTTGATTGTAAGACAACCAAGACTCCTTTGTGGCGTGGAGGCCCTACTGGACCTAAGTCACTGTGCAATGCATGCGGGATCAGGTTTAGAAAGAGAGGAATATCCACGATAGGAACGAACAGAGGATGTGACAGGAAGAGAGAAGGAGTTCATAACAATGGCTCCTCCACCATGACCACCGTGTCAGCCACCACTTCATCGAGTGAGACAACAGCCACCACCACCTCTGGAGATGGGGATGAGAATTTGGGGGAATGTGGGTCATTGAGGATGAGATTGATGATGGCATCGGAGGAGGAGGTGATGGTGGTGCAGAATTTACCGTCGTCGGTGAAGAAACAGCGTTGTCGACGGCAGAGGAAGCTTGGGGAGGAGGAGAAGCAGGCAGCAGTGTCATTAATGGCGCTGTCATGTGGCTCTCTTTTTGCCTGA

Coding sequence (CDS)

ATGGGTTTGATGGATTTGAGGCAAAAGGGACTGTTGCTGCCGGACACTAAATGTTGTGTTGATTGTAAGACAACCAAGACTCCTTTGTGGCGTGGAGGCCCTACTGGACCTAAGTCACTGTGCAATGCATGCGGGATCAGGTTTAGAAAGAGAGGAATATCCACGATAGGAACGAACAGAGGATGTGACAGGAAGAGAGAAGGAGTTCATAACAATGGCTCCTCCACCATGACCACCGTGTCAGCCACCACTTCATCGAGTGAGACAACAGCCACCACCACCTCTGGAGATGGGGATGAGAATTTGGGGGAATGTGGGTCATTGAGGATGAGATTGATGATGGCATCGGAGGAGGAGGTGATGGTGGTGCAGAATTTACCGTCGTCGGTGAAGAAACAGCGTTGTCGACGGCAGAGGAAGCTTGGGGAGGAGGAGAAGCAGGCAGCAGTGTCATTAATGGCGCTGTCATGTGGCTCTCTTTTTGCCTGA

Protein sequence

MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA
Homology
BLAST of HG10007039 vs. NCBI nr
Match: XP_038880207.1 (GATA transcription factor 17-like [Benincasa hispida])

HSP 1 Score: 270.4 bits (690), Expect = 1.0e-68
Identity = 143/166 (86.14%), Postives = 150/166 (90.36%), Query Frame = 0

Query: 1   MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNR 60
           MG+MDLRQKGLLL DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR ISTIGTNR
Sbjct: 21  MGMMDLRQKGLLLADTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRRISTIGTNR 80

Query: 61  GCDRKREGVHNNGSSTMTTVSATTSSSETTATTTS----GDGDENLGECGSLRMRLMMAS 120
           G DRKRE VHNNGS+  TTVSATTSS+ TT TTTS    GDGDENLGECGSL MRLMMA 
Sbjct: 81  GYDRKRERVHNNGSTITTTVSATTSSTGTTTTTTSGSGDGDGDENLGECGSLGMRLMMAL 140

Query: 121 EEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA 163
           EEEVMVVQNLPSSVKKQR +R+RKLGEEEKQAAVSLMALSCGS+ +
Sbjct: 141 EEEVMVVQNLPSSVKKQRFQRERKLGEEEKQAAVSLMALSCGSVLS 186

BLAST of HG10007039 vs. NCBI nr
Match: XP_011659732.1 (GATA transcription factor 16 [Cucumis sativus] >KGN66031.1 hypothetical protein Csa_006937 [Cucumis sativus])

HSP 1 Score: 206.5 bits (524), Expect = 1.8e-49
Identity = 120/167 (71.86%), Postives = 135/167 (80.84%), Query Frame = 0

Query: 1   MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNR 60
           MG MDL QKGLLL DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR IST GTNR
Sbjct: 1   MGFMDLSQKGLLLADTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRRISTRGTNR 60

Query: 61  GCDRKREGVHNNGSSTMTTVSATTSSSE----TTATTTSG-DGDENLGECGSLRMRLMMA 120
             D+KRE V++N SS + TVSATT+SS     TT T++SG DGDEN GECGSLRMRLMM+
Sbjct: 61  R-DKKREKVNDNHSSAVATVSATTTSSSGTTITTTTSSSGVDGDENSGECGSLRMRLMMS 120

Query: 121 SEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA 163
            EE+VMV       VKKQ+ + QRK+GEEEKQAA+SL+ALS  SL +
Sbjct: 121 LEEDVMV-------VKKQQWQWQRKVGEEEKQAAMSLIALSNDSLIS 159

BLAST of HG10007039 vs. NCBI nr
Match: XP_008450587.1 (PREDICTED: GATA transcription factor 16-like [Cucumis melo])

HSP 1 Score: 204.5 bits (519), Expect = 6.8e-49
Identity = 120/167 (71.86%), Postives = 131/167 (78.44%), Query Frame = 0

Query: 1   MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNR 60
           MG +DL QKGLLL DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR I T  TNR
Sbjct: 21  MGFVDLSQKGLLLADTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRKIFTRRTNR 80

Query: 61  -GCDRKREGVHNNGSSTMTTVSATTSSSE---TTATTTSG-DGDENLGECGSLRMRLMMA 120
            G D+KRE V +N SST+  VSATT+SS    TT TTTSG DGDEN GECGS RM++MM 
Sbjct: 81  GGRDKKRERVRDNHSSTVAIVSATTTSSSGTTTTTTTTSGVDGDENSGECGSSRMKIMMG 140

Query: 121 SEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA 163
            EE+VMV       VKK R + QRK+GEEEKQAAVSLMALS GSL +
Sbjct: 141 LEEDVMV-------VKKHRWQWQRKVGEEEKQAAVSLMALSNGSLIS 180

BLAST of HG10007039 vs. NCBI nr
Match: TYK10277.1 (GATA transcription factor 16-like [Cucumis melo var. makuwa])

HSP 1 Score: 193.0 bits (489), Expect = 2.0e-45
Identity = 120/186 (64.52%), Postives = 131/186 (70.43%), Query Frame = 0

Query: 1   MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPK-------------------SLC 60
           MG +DL QKGLLL DTKCCVDCKTTKTPLWRGGPTGPK                   SLC
Sbjct: 1   MGFVDLSQKGLLLADTKCCVDCKTTKTPLWRGGPTGPKEIFISSIKQTSLPIWGFLQSLC 60

Query: 61  NACGIRFRKRGISTIGTNR-GCDRKREGVHNNGSSTMTTVSATTSSSE---TTATTTSG- 120
           NACGIRFRKR I T  TNR G D+KRE V +N SST+  VSATT+SS    TT TTTSG 
Sbjct: 61  NACGIRFRKRKIFTRRTNRGGRDKKRERVRDNHSSTVAIVSATTTSSSGTTTTTTTTSGV 120

Query: 121 DGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALS 163
           DGDEN GECGS RM++MM  EE+VMV       VKK R + QRK+GEEEKQAAVSLMALS
Sbjct: 121 DGDENSGECGSSRMKIMMGLEEDVMV-------VKKHRWQWQRKVGEEEKQAAVSLMALS 179

BLAST of HG10007039 vs. NCBI nr
Match: XP_022135615.1 (GATA transcription factor 16-like isoform X4 [Momordica charantia])

HSP 1 Score: 191.0 bits (484), Expect = 7.8e-45
Identity = 115/173 (66.47%), Postives = 129/173 (74.57%), Query Frame = 0

Query: 1   MGLMDL---RQKGLLLPDT-KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTI 60
           MG+MD+   + K  +  DT K CVDCKTTKTPLWRGGP GPKSLCNACGIRFRKR +STI
Sbjct: 1   MGMMDVLRRKNKERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTI 60

Query: 61  GTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDG-------DENLGECGSLR 120
           GTNRGCDRKRE  H++G ST   +SATTSSS T A   S +G       +E+LGECGSLR
Sbjct: 61  GTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLR 120

Query: 121 MRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA 163
           MRLMMA  EEV+V QN    + KQ  R  RKLGEEE QAAVSLMALSCGS+FA
Sbjct: 121 MRLMMALGEEVVVQQN----ISKQ--RPPRKLGEEE-QAAVSLMALSCGSVFA 166

BLAST of HG10007039 vs. ExPASy Swiss-Prot
Match: Q9FJ10 (GATA transcription factor 16 OS=Arabidopsis thaliana OX=3702 GN=GATA16 PE=2 SV=1)

HSP 1 Score: 80.9 bits (198), Expect = 1.5e-14
Identity = 64/146 (43.84%), Postives = 77/146 (52.74%), Query Frame = 0

Query: 17  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSST 76
           K C DC T+KTPLWRGGP GPKSLCNACGIR RK              KR G        
Sbjct: 36  KTCADCGTSKTPLWRGGPVGPKSLCNACGIRNRK--------------KRRG-------- 95

Query: 77  MTTVSATTSSSETTATTTSGDGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCR 136
                  T  ++    ++SG G+   GE  SL+  LM     +        S+V+KQR  
Sbjct: 96  ------GTEDNKKLKKSSSGGGNRKFGE--SLKQSLMDLGIRK-------RSTVEKQR-- 139

Query: 137 RQRKLGEEEKQAAVSLMALSCGSLFA 163
             +KLGEEE QAAV LMALS GS++A
Sbjct: 156 --QKLGEEE-QAAVLLMALSYGSVYA 139

BLAST of HG10007039 vs. ExPASy Swiss-Prot
Match: Q8LG10 (GATA transcription factor 15 OS=Arabidopsis thaliana OX=3702 GN=GATA15 PE=2 SV=2)

HSP 1 Score: 76.3 bits (186), Expect = 3.7e-13
Identity = 62/145 (42.76%), Postives = 75/145 (51.72%), Query Frame = 0

Query: 15  DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGS 74
           + K C  C T+KTPLWRGGP GPKSLCNACGIR RK+   T+ +NR  D+K++  HN   
Sbjct: 39  EKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKR-RTLISNRSEDKKKKS-HNRNP 98

Query: 75  STMTTVSATTSSSETTATTTSGDGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQR 134
                                  GD       SL+ RLM    E +M      S+ + Q 
Sbjct: 99  KF---------------------GD-------SLKQRLMELGREVMM----QRSTAENQ- 145

Query: 135 CRRQRKLGEEEKQAAVSLMALSCGS 160
             R+ KLGEEE QAAV LMALS  S
Sbjct: 159 --RRNKLGEEE-QAAVLLMALSYAS 145

BLAST of HG10007039 vs. ExPASy Swiss-Prot
Match: Q9LIB5 (GATA transcription factor 17 OS=Arabidopsis thaliana OX=3702 GN=GATA17 PE=2 SV=1)

HSP 1 Score: 75.5 bits (184), Expect = 6.2e-13
Identity = 63/162 (38.89%), Postives = 89/162 (54.94%), Query Frame = 0

Query: 15  DTK-CCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNG 74
           DTK  CVDC T +TPLWRGGP GPKSLCNACGI+ RK+  + +G      R  E   N  
Sbjct: 39  DTKRTCVDCGTIRTPLWRGGPAGPKSLCNACGIKSRKKRQAALGM-----RSEEKKKNRK 98

Query: 75  SSTMTTVSATTSSSETTATTTSGDG----DENLGECGSLRMRLMMASEE---------EV 134
           S+    ++    +++        DG    D++   C + R     +++          +V
Sbjct: 99  SNCNNDLNLDHRNAKKYKINIVDDGKIDIDDDPKICNNKRSSSSSSNKGVSKFLDLGFKV 158

Query: 135 MVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA 163
            V++   S+V+K+R    RKLGEEE+ AAV LMALSC S++A
Sbjct: 159 PVMKR--SAVEKKRL--WRKLGEEER-AAVLLMALSCSSVYA 190

BLAST of HG10007039 vs. ExPASy Swiss-Prot
Match: Q8LC59 (GATA transcription factor 23 OS=Arabidopsis thaliana OX=3702 GN=GATA23 PE=2 SV=2)

HSP 1 Score: 74.7 bits (182), Expect = 1.1e-12
Identity = 29/35 (82.86%), Postives = 33/35 (94.29%), Query Frame = 0

Query: 17 KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR 52
          +CC +CKTTKTP+WRGGPTGPKSLCNACGIR RK+
Sbjct: 26 RCCSECKTTKTPMWRGGPTGPKSLCNACGIRHRKQ 60

BLAST of HG10007039 vs. ExPASy Swiss-Prot
Match: Q6YW48 (Protein CYTOKININ-RESPONSIVE GATA TRANSCRIPTION FACTOR 1 OS=Oryza sativa subsp. japonica OX=39947 GN=CGA1 PE=2 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 6.5e-10
Identity = 27/34 (79.41%), Postives = 28/34 (82.35%), Query Frame = 0

Query: 17  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRK 51
           + C DC TTKTPLWR GP GPKSLCNACGIR RK
Sbjct: 176 RVCSDCNTTKTPLWRSGPCGPKSLCNACGIRQRK 209

BLAST of HG10007039 vs. ExPASy TrEMBL
Match: A0A0A0M1G9 (GATA-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G569090 PE=4 SV=1)

HSP 1 Score: 206.5 bits (524), Expect = 8.7e-50
Identity = 120/167 (71.86%), Postives = 135/167 (80.84%), Query Frame = 0

Query: 1   MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNR 60
           MG MDL QKGLLL DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR IST GTNR
Sbjct: 1   MGFMDLSQKGLLLADTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRRISTRGTNR 60

Query: 61  GCDRKREGVHNNGSSTMTTVSATTSSSE----TTATTTSG-DGDENLGECGSLRMRLMMA 120
             D+KRE V++N SS + TVSATT+SS     TT T++SG DGDEN GECGSLRMRLMM+
Sbjct: 61  R-DKKREKVNDNHSSAVATVSATTTSSSGTTITTTTSSSGVDGDENSGECGSLRMRLMMS 120

Query: 121 SEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA 163
            EE+VMV       VKKQ+ + QRK+GEEEKQAA+SL+ALS  SL +
Sbjct: 121 LEEDVMV-------VKKQQWQWQRKVGEEEKQAAMSLIALSNDSLIS 159

BLAST of HG10007039 vs. ExPASy TrEMBL
Match: A0A1S3BQ71 (GATA transcription factor 16-like OS=Cucumis melo OX=3656 GN=LOC103492133 PE=4 SV=1)

HSP 1 Score: 204.5 bits (519), Expect = 3.3e-49
Identity = 120/167 (71.86%), Postives = 131/167 (78.44%), Query Frame = 0

Query: 1   MGLMDLRQKGLLLPDTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNR 60
           MG +DL QKGLLL DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR I T  TNR
Sbjct: 21  MGFVDLSQKGLLLADTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRKIFTRRTNR 80

Query: 61  -GCDRKREGVHNNGSSTMTTVSATTSSSE---TTATTTSG-DGDENLGECGSLRMRLMMA 120
            G D+KRE V +N SST+  VSATT+SS    TT TTTSG DGDEN GECGS RM++MM 
Sbjct: 81  GGRDKKRERVRDNHSSTVAIVSATTTSSSGTTTTTTTTSGVDGDENSGECGSSRMKIMMG 140

Query: 121 SEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA 163
            EE+VMV       VKK R + QRK+GEEEKQAAVSLMALS GSL +
Sbjct: 141 LEEDVMV-------VKKHRWQWQRKVGEEEKQAAVSLMALSNGSLIS 180

BLAST of HG10007039 vs. ExPASy TrEMBL
Match: A0A6J1C1I3 (GATA transcription factor 16-like isoform X4 OS=Momordica charantia OX=3673 GN=LOC111007526 PE=4 SV=1)

HSP 1 Score: 191.0 bits (484), Expect = 3.8e-45
Identity = 115/173 (66.47%), Postives = 129/173 (74.57%), Query Frame = 0

Query: 1   MGLMDL---RQKGLLLPDT-KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTI 60
           MG+MD+   + K  +  DT K CVDCKTTKTPLWRGGP GPKSLCNACGIRFRKR +STI
Sbjct: 1   MGMMDVLRRKNKERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTI 60

Query: 61  GTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDG-------DENLGECGSLR 120
           GTNRGCDRKRE  H++G ST   +SATTSSS T A   S +G       +E+LGECGSLR
Sbjct: 61  GTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLR 120

Query: 121 MRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA 163
           MRLMMA  EEV+V QN    + KQ  R  RKLGEEE QAAVSLMALSCGS+FA
Sbjct: 121 MRLMMALGEEVVVQQN----ISKQ--RPPRKLGEEE-QAAVSLMALSCGSVFA 166

BLAST of HG10007039 vs. ExPASy TrEMBL
Match: A0A6J1C5A4 (GATA transcription factor 17-like isoform X2 OS=Momordica charantia OX=3673 GN=LOC111007526 PE=4 SV=1)

HSP 1 Score: 191.0 bits (484), Expect = 3.8e-45
Identity = 115/173 (66.47%), Postives = 129/173 (74.57%), Query Frame = 0

Query: 1   MGLMDL---RQKGLLLPDT-KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTI 60
           MG+MD+   + K  +  DT K CVDCKTTKTPLWRGGP GPKSLCNACGIRFRKR +STI
Sbjct: 55  MGMMDVLRRKNKERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVSTI 114

Query: 61  GTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDG-------DENLGECGSLR 120
           GTNRGCDRKRE  H++G ST   +SATTSSS T A   S +G       +E+LGECGSLR
Sbjct: 115 GTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSLR 174

Query: 121 MRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA 163
           MRLMMA  EEV+V QN    + KQ  R  RKLGEEE QAAVSLMALSCGS+FA
Sbjct: 175 MRLMMALGEEVVVQQN----ISKQ--RPPRKLGEEE-QAAVSLMALSCGSVFA 220

BLAST of HG10007039 vs. ExPASy TrEMBL
Match: A0A6J1C373 (GATA transcription factor 17-like isoform X1 OS=Momordica charantia OX=3673 GN=LOC111007526 PE=4 SV=1)

HSP 1 Score: 189.9 bits (481), Expect = 8.4e-45
Identity = 117/174 (67.24%), Postives = 130/174 (74.71%), Query Frame = 0

Query: 1   MGLMD-LRQKG---LLLPDT-KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGIST 60
           MG+MD LR+K     +  DT K CVDCKTTKTPLWRGGP GPKSLCNACGIRFRKR +ST
Sbjct: 55  MGMMDVLRRKNKVERVEDDTKKYCVDCKTTKTPLWRGGPAGPKSLCNACGIRFRKRRVST 114

Query: 61  IGTNRGCDRKREGVHNNGSSTMTTVSATTSSSETTATTTSGDG-------DENLGECGSL 120
           IGTNRGCDRKRE  H++G ST   +SATTSSS T A   S +G       +E+LGECGSL
Sbjct: 115 IGTNRGCDRKREKAHSHGGSTTAAMSATTSSSATAADAKSNNGGADGEEEEEDLGECGSL 174

Query: 121 RMRLMMASEEEVMVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA 163
           RMRLMMA  EEV+V QN    + KQ  R  RKLGEEE QAAVSLMALSCGS+FA
Sbjct: 175 RMRLMMALGEEVVVQQN----ISKQ--RPPRKLGEEE-QAAVSLMALSCGSVFA 221

BLAST of HG10007039 vs. TAIR 10
Match: AT5G49300.1 (GATA transcription factor 16 )

HSP 1 Score: 80.9 bits (198), Expect = 1.1e-15
Identity = 64/146 (43.84%), Postives = 77/146 (52.74%), Query Frame = 0

Query: 17  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSST 76
           K C DC T+KTPLWRGGP GPKSLCNACGIR RK              KR G        
Sbjct: 36  KTCADCGTSKTPLWRGGPVGPKSLCNACGIRNRK--------------KRRG-------- 95

Query: 77  MTTVSATTSSSETTATTTSGDGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQRCR 136
                  T  ++    ++SG G+   GE  SL+  LM     +        S+V+KQR  
Sbjct: 96  ------GTEDNKKLKKSSSGGGNRKFGE--SLKQSLMDLGIRK-------RSTVEKQR-- 139

Query: 137 RQRKLGEEEKQAAVSLMALSCGSLFA 163
             +KLGEEE QAAV LMALS GS++A
Sbjct: 156 --QKLGEEE-QAAVLLMALSYGSVYA 139

BLAST of HG10007039 vs. TAIR 10
Match: AT3G06740.1 (GATA transcription factor 15 )

HSP 1 Score: 76.3 bits (186), Expect = 2.6e-14
Identity = 62/145 (42.76%), Postives = 75/145 (51.72%), Query Frame = 0

Query: 15  DTKCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGS 74
           + K C  C T+KTPLWRGGP GPKSLCNACGIR RK+   T+ +NR  D+K++  HN   
Sbjct: 39  EKKSCAICGTSKTPLWRGGPAGPKSLCNACGIRNRKKR-RTLISNRSEDKKKKS-HNRNP 98

Query: 75  STMTTVSATTSSSETTATTTSGDGDENLGECGSLRMRLMMASEEEVMVVQNLPSSVKKQR 134
                                  GD       SL+ RLM    E +M      S+ + Q 
Sbjct: 99  KF---------------------GD-------SLKQRLMELGREVMM----QRSTAENQ- 145

Query: 135 CRRQRKLGEEEKQAAVSLMALSCGS 160
             R+ KLGEEE QAAV LMALS  S
Sbjct: 159 --RRNKLGEEE-QAAVLLMALSYAS 145

BLAST of HG10007039 vs. TAIR 10
Match: AT3G16870.1 (GATA transcription factor 17 )

HSP 1 Score: 75.5 bits (184), Expect = 4.4e-14
Identity = 63/162 (38.89%), Postives = 89/162 (54.94%), Query Frame = 0

Query: 15  DTK-CCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNG 74
           DTK  CVDC T +TPLWRGGP GPKSLCNACGI+ RK+  + +G      R  E   N  
Sbjct: 39  DTKRTCVDCGTIRTPLWRGGPAGPKSLCNACGIKSRKKRQAALGM-----RSEEKKKNRK 98

Query: 75  SSTMTTVSATTSSSETTATTTSGDG----DENLGECGSLRMRLMMASEE---------EV 134
           S+    ++    +++        DG    D++   C + R     +++          +V
Sbjct: 99  SNCNNDLNLDHRNAKKYKINIVDDGKIDIDDDPKICNNKRSSSSSSNKGVSKFLDLGFKV 158

Query: 135 MVVQNLPSSVKKQRCRRQRKLGEEEKQAAVSLMALSCGSLFA 163
            V++   S+V+K+R    RKLGEEE+ AAV LMALSC S++A
Sbjct: 159 PVMKR--SAVEKKRL--WRKLGEEER-AAVLLMALSCSSVYA 190

BLAST of HG10007039 vs. TAIR 10
Match: AT5G26930.1 (GATA transcription factor 23 )

HSP 1 Score: 74.7 bits (182), Expect = 7.6e-14
Identity = 29/35 (82.86%), Postives = 33/35 (94.29%), Query Frame = 0

Query: 17 KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKR 52
          +CC +CKTTKTP+WRGGPTGPKSLCNACGIR RK+
Sbjct: 26 RCCSECKTTKTPMWRGGPTGPKSLCNACGIRHRKQ 60

BLAST of HG10007039 vs. TAIR 10
Match: AT4G16141.1 (GATA type zinc finger transcription factor family protein )

HSP 1 Score: 70.9 bits (172), Expect = 1.1e-12
Identity = 57/163 (34.97%), Postives = 78/163 (47.85%), Query Frame = 0

Query: 17  KCCVDCKTTKTPLWRGGPTGPKSLCNACGIRFRKRGISTIGTNRGCDRKREGVHNNGSST 76
           K CVDC T++TPLWRGGP GPKSLCNACGI+ RK+  + +G  +   + +   +NN    
Sbjct: 37  KTCVDCGTSRTPLWRGGPAGPKSLCNACGIKSRKKRQAALGIRQDDIKIKSKSNNNLGLE 96

Query: 77  MTTVSATTSSSETTATTTSGDGDENL--GECGSLRMRLMMASEEEVMVVQNLPSSVKK-- 136
              V                 G   +  GE G+++ ++     E      N   +VK+  
Sbjct: 97  SRNVKTGKGEPVNVKIAKCEPGIVKIAKGEPGNVKNKI-KRDPENSSSSNNNKKNVKRVG 156

Query: 137 -----------------QRCRRQRKLGEEEKQAAVSLMALSCG 159
                            ++ R  RKLGEEE+ AAV LMALSCG
Sbjct: 157 RFLDFGFKVPAMKRSAVEKKRLWRKLGEEER-AAVLLMALSCG 197

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038880207.11.0e-6886.14GATA transcription factor 17-like [Benincasa hispida][more]
XP_011659732.11.8e-4971.86GATA transcription factor 16 [Cucumis sativus] >KGN66031.1 hypothetical protein ... [more]
XP_008450587.16.8e-4971.86PREDICTED: GATA transcription factor 16-like [Cucumis melo][more]
TYK10277.12.0e-4564.52GATA transcription factor 16-like [Cucumis melo var. makuwa][more]
XP_022135615.17.8e-4566.47GATA transcription factor 16-like isoform X4 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q9FJ101.5e-1443.84GATA transcription factor 16 OS=Arabidopsis thaliana OX=3702 GN=GATA16 PE=2 SV=1[more]
Q8LG103.7e-1342.76GATA transcription factor 15 OS=Arabidopsis thaliana OX=3702 GN=GATA15 PE=2 SV=2[more]
Q9LIB56.2e-1338.89GATA transcription factor 17 OS=Arabidopsis thaliana OX=3702 GN=GATA17 PE=2 SV=1[more]
Q8LC591.1e-1282.86GATA transcription factor 23 OS=Arabidopsis thaliana OX=3702 GN=GATA23 PE=2 SV=2[more]
Q6YW486.5e-1079.41Protein CYTOKININ-RESPONSIVE GATA TRANSCRIPTION FACTOR 1 OS=Oryza sativa subsp. ... [more]
Match NameE-valueIdentityDescription
A0A0A0M1G98.7e-5071.86GATA-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G569090 P... [more]
A0A1S3BQ713.3e-4971.86GATA transcription factor 16-like OS=Cucumis melo OX=3656 GN=LOC103492133 PE=4 S... [more]
A0A6J1C1I33.8e-4566.47GATA transcription factor 16-like isoform X4 OS=Momordica charantia OX=3673 GN=L... [more]
A0A6J1C5A43.8e-4566.47GATA transcription factor 17-like isoform X2 OS=Momordica charantia OX=3673 GN=L... [more]
A0A6J1C3738.4e-4567.24GATA transcription factor 17-like isoform X1 OS=Momordica charantia OX=3673 GN=L... [more]
Match NameE-valueIdentityDescription
AT5G49300.11.1e-1543.84GATA transcription factor 16 [more]
AT3G06740.12.6e-1442.76GATA transcription factor 15 [more]
AT3G16870.14.4e-1438.89GATA transcription factor 17 [more]
AT5G26930.17.6e-1482.86GATA transcription factor 23 [more]
AT4G16141.11.1e-1234.97GATA type zinc finger transcription factor family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 13..69
e-value: 2.6E-13
score: 60.2
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 19..52
e-value: 8.0E-18
score: 63.8
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 19..44
IPR000679Zinc finger, GATA-typePROSITEPS50114GATA_ZN_FINGER_2coord: 19..54
score: 12.980124
IPR000679Zinc finger, GATA-typeCDDcd00202ZnF_GATAcoord: 19..53
e-value: 1.34027E-12
score: 57.3826
IPR013088Zinc finger, NHR/GATA-typeGENE3D3.30.50.10coord: 15..61
e-value: 7.0E-15
score: 56.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 71..100
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 64..102
NoneNo IPR availablePANTHERPTHR47172:SF9GATA TRANSCRIPTION FACTOR 16coord: 16..162
NoneNo IPR availablePANTHERPTHR47172OS01G0976800 PROTEINcoord: 16..162
NoneNo IPR availableSUPERFAMILY57716Glucocorticoid receptor-like (DNA-binding domain)coord: 16..53

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10007039.1HG10007039.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding