CsaV3_4G023740 (gene) Cucumber (Chinese Long) v3

NameCsaV3_4G023740
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionGATA transcription factor 29-like
Locationchr4 : 13807043 .. 13807480 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGATGGCGAACAATCAAGGGTTAACCAGAAATAATGAAATGATGAGTGAACCAAATTACAATAACCCTCACCAAATTGTATGGTCAAAAGATCTAAAAACCTATGTTCTTAAGTCAAATAATAATTTTGGGCAGCAGCTTCATCCAAGTTCATACCCTAATACTCATCAAGCAAAGCCCATACCCACCAGTACTCCAAATAACACCCCAAATCTTCCTCTTCCAAAACGAGGACACAATAATGACCTATACACACTCCTCAACCCTGCTTCTAGAGTAGCAGATGATCATGAAGCCGAAGCTTCCTCCGCTGGTCGGCGAAAAGGGTCGCGACGACGTCGAGTTTCAGCCACCAATGATGTCGAGAGAAGGTGTACCAATTACAATTGCAACACCAACTTCACACCCATGTGGCGTAAAGGTCCTCTTGGTCCTAAG

mRNA sequence

GTGATGGCGAACAATCAAGGGTTAACCAGAAATAATGAAATGATGAGTGAACCAAATTACAATAACCCTCACCAAATTGTATGGTCAAAAGATCTAAAAACCTATGTTCTTAAGTCAAATAATAATTTTGGGCAGCAGCTTCATCCAAGTTCATACCCTAATACTCATCAAGCAAAGCCCATACCCACCAGTACTCCAAATAACACCCCAAATCTTCCTCTTCCAAAACGAGGACACAATAATGACCTATACACACTCCTCAACCCTGCTTCTAGAGTAGCAGATGATCATGAAGCCGAAGCTTCCTCCGCTGGTCGGCGAAAAGGGTCGCGACGACGTCGAGTTTCAGCCACCAATGATGTCGAGAGAAGGTGTACCAATTACAATTGCAACACCAACTTCACACCCATGTGGCGTAAAGGTCCTCTTGGTCCTAAG

Coding sequence (CDS)

GTGATGGCGAACAATCAAGGGTTAACCAGAAATAATGAAATGATGAGTGAACCAAATTACAATAACCCTCACCAAATTGTATGGTCAAAAGATCTAAAAACCTATGTTCTTAAGTCAAATAATAATTTTGGGCAGCAGCTTCATCCAAGTTCATACCCTAATACTCATCAAGCAAAGCCCATACCCACCAGTACTCCAAATAACACCCCAAATCTTCCTCTTCCAAAACGAGGACACAATAATGACCTATACACACTCCTCAACCCTGCTTCTAGAGTAGCAGATGATCATGAAGCCGAAGCTTCCTCCGCTGGTCGGCGAAAAGGGTCGCGACGACGTCGAGTTTCAGCCACCAATGATGTCGAGAGAAGGTGTACCAATTACAATTGCAACACCAACTTCACACCCATGTGGCGTAAAGGTCCTCTTGGTCCTAAG

Protein sequence

VMANNQGLTRNNEMMSEPNYNNPHQIVWSKDLKTYVLKSNNNFGQQLHPSSYPNTHQAKPIPTSTPNNTPNLPLPKRGHNNDLYTLLNPASRVADDHEAEASSAGRRKGSRRRRVSATNDVERRCTNYNCNTNFTPMWRKGPLGPK
BLAST of CsaV3_4G023740 vs. NCBI nr
Match: XP_011654362.1 (PREDICTED: GATA transcription factor 29 [Cucumis sativus] >KGN54112.1 hypothetical protein Csa_4G286370 [Cucumis sativus])

HSP 1 Score: 285.8 bits (730), Expect = 8.2e-74
Identity = 146/146 (100.00%), Postives = 146/146 (100.00%), Query Frame = 0

Query: 1   VMANNQGLTRNNEMMSEPNYNNPHQIVWSKDLKTYVLKSNNNFGQQLHPSSYPNTHQAKP 60
           VMANNQGLTRNNEMMSEPNYNNPHQIVWSKDLKTYVLKSNNNFGQQLHPSSYPNTHQAKP
Sbjct: 4   VMANNQGLTRNNEMMSEPNYNNPHQIVWSKDLKTYVLKSNNNFGQQLHPSSYPNTHQAKP 63

Query: 61  IPTXXXXXXXXLPLPKRGHNNDLYTLLNPASRVADDHEAEASSAGRRKGSRRRRVSATND 120
           IPTXXXXXXXXLPLPKRGHNNDLYTLLNPASRVADDHEAEASSAGRRKGSRRRRVSATND
Sbjct: 64  IPTXXXXXXXXLPLPKRGHNNDLYTLLNPASRVADDHEAEASSAGRRKGSRRRRVSATND 123

Query: 121 VERRCTNYNCNTNFTPMWRKGPLGPK 147
           VERRCTNYNCNTNFTPMWRKGPLGPK
Sbjct: 124 VERRCTNYNCNTNFTPMWRKGPLGPK 149

BLAST of CsaV3_4G023740 vs. NCBI nr
Match: XP_022985849.1 (GATA zinc finger domain-containing protein 8-like [Cucurbita maxima])

HSP 1 Score: 188.0 bits (476), Expect = 2.3e-44
Identity = 104/147 (70.75%), Postives = 111/147 (75.51%), Query Frame = 0

Query: 4   NNQGLT----RNNEMMSEPNYNNPHQIVWSKDLKTYVLKSNNNFGQQLHPSSYPNTHQAK 63
           NN GL     RN EMMSE NY NPHQIVWSK+LKTYVLKS NNFG QLHPS Y NT+ AK
Sbjct: 97  NNHGLASSGGRNGEMMSEANY-NPHQIVWSKELKTYVLKS-NNFG-QLHPSLYCNTYPAK 156

Query: 64  PIPTXXXXXXXXLPLPKRGHNNDLYTLLNPASRVADDHEAEASSAGRRKGSRRRRVSATN 123
              T          LP + +NND YTLLNPA R ADDHEAEASS  RR+G +RRRVSA N
Sbjct: 157 ---TVGSTANSTANLPLQNYNNDQYTLLNPACRKADDHEAEASSGSRRRGLQRRRVSACN 216

Query: 124 DVERRCTNYNCNTNFTPMWRKGPLGPK 147
           +VERRCTNYNCNTNFTPMWRKGPLGPK
Sbjct: 217 EVERRCTNYNCNTNFTPMWRKGPLGPK 237

BLAST of CsaV3_4G023740 vs. NCBI nr
Match: XP_022943672.1 (GATA zinc finger domain-containing protein 8-like [Cucurbita moschata])

HSP 1 Score: 169.5 bits (428), Expect = 8.6e-39
Identity = 98/148 (66.22%), Postives = 106/148 (71.62%), Query Frame = 0

Query: 4   NNQGLT----RNNEMMSEPNYNNPHQIVWSKDLKTYVLKSNNNFGQQLHPSSYPNTHQAK 63
           NN GL     R+ EMMSE NY NPHQIVWSK+LKTYVLKS NNFG QLHPS Y NT+ AK
Sbjct: 97  NNHGLASSGGRSGEMMSEANY-NPHQIVWSKELKTYVLKS-NNFG-QLHPSLYCNTYPAK 156

Query: 64  PIPTXXXXXXXXLPLPKRGHNNDLYTLLNPASRVADDHEAEA-SSAGRRKGSRRRRVSAT 123
            I +          LP + +NND YTLLNPA R  DDHEAEA        G +RRRVSA 
Sbjct: 157 TIGSTANGMPN---LPLQNYNNDQYTLLNPACRKTDDHEAEAXXXXXXXXGLQRRRVSAC 216

Query: 124 NDVERRCTNYNCNTNFTPMWRKGPLGPK 147
           N+VERRCTNYNCNTNFTPMWRKGPLGPK
Sbjct: 217 NEVERRCTNYNCNTNFTPMWRKGPLGPK 238

BLAST of CsaV3_4G023740 vs. NCBI nr
Match: XP_023522473.1 (GATA transcription factor 21-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 169.5 bits (428), Expect = 8.6e-39
Identity = 98/148 (66.22%), Postives = 106/148 (71.62%), Query Frame = 0

Query: 4   NNQGLT----RNNEMMSEPNYNNPHQIVWSKDLKTYVLKSNNNFGQQLHPSSYPNTHQAK 63
           NN GL     R+ EMMSE NY NPHQIVWSK+LKTYVLKS NNFG QLHPS Y NT+ AK
Sbjct: 4   NNHGLASSGGRSGEMMSEANY-NPHQIVWSKELKTYVLKS-NNFG-QLHPSLYCNTYPAK 63

Query: 64  PIPTXXXXXXXXLPLPKRGHNNDLYTLLNPASRVADDHEAEA-SSAGRRKGSRRRRVSAT 123
            I +          LP + +NND YTLLNPA R  DDHEAEA        G +RRRVSA 
Sbjct: 64  TIGSTANSMAN---LPLQNYNNDQYTLLNPACRKTDDHEAEAXXXXXXXXGLQRRRVSAC 123

Query: 124 NDVERRCTNYNCNTNFTPMWRKGPLGPK 147
           N+VERRCTNYNCNTNFTPMWRKGPLGPK
Sbjct: 124 NEVERRCTNYNCNTNFTPMWRKGPLGPK 145

BLAST of CsaV3_4G023740 vs. NCBI nr
Match: XP_023513295.1 (putative GATA transcription factor 22 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 169.5 bits (428), Expect = 8.6e-39
Identity = 98/148 (66.22%), Postives = 106/148 (71.62%), Query Frame = 0

Query: 4   NNQGLT----RNNEMMSEPNYNNPHQIVWSKDLKTYVLKSNNNFGQQLHPSSYPNTHQAK 63
           NN GL     R+ EMMSE NY NPHQIVWSK+LKTYVLKS NNFG QLHPS Y NT+ AK
Sbjct: 97  NNHGLASSGGRSGEMMSEANY-NPHQIVWSKELKTYVLKS-NNFG-QLHPSLYCNTYPAK 156

Query: 64  PIPTXXXXXXXXLPLPKRGHNNDLYTLLNPASRVADDHEAEA-SSAGRRKGSRRRRVSAT 123
            I +          LP + +NND YTLLNPA R  DDHEAEA        G +RRRVSA 
Sbjct: 157 TIGSTANSMAN---LPLQNYNNDQYTLLNPACRKTDDHEAEAXXXXXXXXGLQRRRVSAC 216

Query: 124 NDVERRCTNYNCNTNFTPMWRKGPLGPK 147
           N+VERRCTNYNCNTNFTPMWRKGPLGPK
Sbjct: 217 NEVERRCTNYNCNTNFTPMWRKGPLGPK 238

BLAST of CsaV3_4G023740 vs. TAIR10
Match: AT3G20750.1 (GATA transcription factor 29)

HSP 1 Score: 43.5 bits (101), Expect = 1.3e-04
Identity = 23/73 (31.51%), Postives = 39/73 (53.42%), Query Frame = 0

Query: 81  NDLYTLLNPASRVADDHEAEASSAGRRKGSRRRRVSATN-------DVERRCTNYNCNTN 140
           +D Y L++  +R A  + +   +   ++ +  +R+           +  ++CTN NCN  
Sbjct: 108 SDEYVLIDVPARRARRNNSTVMTNSWKENATPKRIRGCGGFCGGRIEGMKKCTNMNCNAL 167

Query: 141 FTPMWRKGPLGPK 147
            TPMWR+GPLGPK
Sbjct: 168 NTPMWRRGPLGPK 180

BLAST of CsaV3_4G023740 vs. TrEMBL
Match: tr|A0A0A0L0J0|A0A0A0L0J0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G286370 PE=4 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 5.4e-74
Identity = 146/146 (100.00%), Postives = 146/146 (100.00%), Query Frame = 0

Query: 1   VMANNQGLTRNNEMMSEPNYNNPHQIVWSKDLKTYVLKSNNNFGQQLHPSSYPNTHQAKP 60
           VMANNQGLTRNNEMMSEPNYNNPHQIVWSKDLKTYVLKSNNNFGQQLHPSSYPNTHQAKP
Sbjct: 4   VMANNQGLTRNNEMMSEPNYNNPHQIVWSKDLKTYVLKSNNNFGQQLHPSSYPNTHQAKP 63

Query: 61  IPTXXXXXXXXLPLPKRGHNNDLYTLLNPASRVADDHEAEASSAGRRKGSRRRRVSATND 120
           IPTXXXXXXXXLPLPKRGHNNDLYTLLNPASRVADDHEAEASSAGRRKGSRRRRVSATND
Sbjct: 64  IPTXXXXXXXXLPLPKRGHNNDLYTLLNPASRVADDHEAEASSAGRRKGSRRRRVSATND 123

Query: 121 VERRCTNYNCNTNFTPMWRKGPLGPK 147
           VERRCTNYNCNTNFTPMWRKGPLGPK
Sbjct: 124 VERRCTNYNCNTNFTPMWRKGPLGPK 149

BLAST of CsaV3_4G023740 vs. TrEMBL
Match: tr|A0A1R3K8P1|A0A1R3K8P1_9ROSI (Uncharacterized protein OS=Corchorus olitorius OX=93759 GN=COLO4_10405 PE=4 SV=1)

HSP 1 Score: 76.3 bits (186), Expect = 6.6e-11
Identity = 42/74 (56.76%), Postives = 47/74 (63.51%), Query Frame = 0

Query: 73  PLPKRGHNNDLYTLLNPASRVADDHEAEASSAGRRKGSRRRRVSATNDVERRCTNYNCNT 132
           P P  G  N    L  PASR AD     +S +GRR G R+R V+  ND  +RCTNYNC T
Sbjct: 155 PPPAPGPANTYVLLDVPASRRADGDVGSSSGSGRRGGRRQRGVN-YNDPNKRCTNYNCGT 214

Query: 133 NFTPMWRKGPLGPK 147
           N TPMWRKGPLGPK
Sbjct: 215 NNTPMWRKGPLGPK 227

BLAST of CsaV3_4G023740 vs. TrEMBL
Match: tr|A0A061GSP9|A0A061GSP9_THECC (Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_040387 PE=4 SV=1)

HSP 1 Score: 75.1 bits (183), Expect = 1.5e-10
Identity = 53/132 (40.15%), Postives = 63/132 (47.73%), Query Frame = 0

Query: 22  NPHQIVWSKD---LKTYVLKSNNNFGQQLHPSSYPNTHQAKPIPTXXXXXXXXLPLPKRG 81
           N  Q  W  D      + +   N F Q   PS    ++               LP     
Sbjct: 94  NASQSAWPSDQLASNVHSMNHENTFNQISGPSMGCPSNSTNTYSNFAPTHHHQLPPSSVP 153

Query: 82  HNNDLYTLLN-PASRVADDHE---AEASSAGRRKGSRRRRVSATNDVERRCTNYNCNTNF 141
            NN  YTLL+ P  R AD  E   + AS  G+R G RR+R    ND  +RC+NYNCNTN 
Sbjct: 154 TNN--YTLLDVPPRRTADQRELGNSAASGLGKR-GQRRQRGGNYNDPNKRCSNYNCNTND 213

Query: 142 TPMWRKGPLGPK 147
           TPMWRKGPLGPK
Sbjct: 214 TPMWRKGPLGPK 222

BLAST of CsaV3_4G023740 vs. TrEMBL
Match: tr|B9SQ61|B9SQ61_RICCO (Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_0147050 PE=4 SV=1)

HSP 1 Score: 71.6 bits (174), Expect = 1.6e-09
Identity = 58/147 (39.46%), Postives = 75/147 (51.02%), Query Frame = 0

Query: 11  NNEMMSEPNYN-NPHQIVWSKDLKTYVLKSNNN-FGQQLH------PSSYPNTHQAKPIP 70
           NN  + E N+N    QI W +  + Y+  +N++ +G          P S PN   A PI 
Sbjct: 6   NNPDIGEQNFNLTDSQIAWLR--QNYMNSTNHHRYGGGAEGSVVGIPISSPNYFNA-PI- 65

Query: 71  TXXXXXXXXLPLPKRGHNNDLYTLLNPA-SRVADDHEAEASSA--GRRKGSRRRRVSATN 130
                           H  + YTLL+    RVA   +   SS+   RR+ SRR+R  + N
Sbjct: 66  ----------------HTMNDYTLLDSTPRRVAHMEDVGGSSSINFRRRDSRRQRAGSYN 125

Query: 131 DVERRCTNYNCNTNFTPMWRKGPLGPK 147
           D  +RCTNYNCNTN TPMWRKGPLGPK
Sbjct: 126 DPTKRCTNYNCNTNDTPMWRKGPLGPK 132

BLAST of CsaV3_4G023740 vs. TrEMBL
Match: tr|A0A0D2TL01|A0A0D2TL01_GOSRA (Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_009G070200 PE=4 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 6.8e-08
Identity = 36/65 (55.38%), Postives = 42/65 (64.62%), Query Frame = 0

Query: 84  YTLLN-PASRVADDHEAE-ASSAGRRKGSRRRRVSATNDVERRCTNYNCNTNFTPMWRKG 143
           YTLL+ P  R A   + E  SS+G   G  +R     ND  +RCTNYNCNTN TPMWR+G
Sbjct: 137 YTLLDVPPRRAAQLQQREFESSSGLSLGQGQRGYGLYNDPNKRCTNYNCNTNDTPMWRRG 196

Query: 144 PLGPK 147
           PLGPK
Sbjct: 197 PLGPK 201

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011654362.18.2e-74100.00PREDICTED: GATA transcription factor 29 [Cucumis sativus] >KGN54112.1 hypothetic... [more]
XP_022985849.12.3e-4470.75GATA zinc finger domain-containing protein 8-like [Cucurbita maxima][more]
XP_022943672.18.6e-3966.22GATA zinc finger domain-containing protein 8-like [Cucurbita moschata][more]
XP_023522473.18.6e-3966.22GATA transcription factor 21-like [Cucurbita pepo subsp. pepo][more]
XP_023513295.18.6e-3966.22putative GATA transcription factor 22 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
AT3G20750.11.3e-0431.51GATA transcription factor 29[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
tr|A0A0A0L0J0|A0A0A0L0J0_CUCSA5.4e-74100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G286370 PE=4 SV=1[more]
tr|A0A1R3K8P1|A0A1R3K8P1_9ROSI6.6e-1156.76Uncharacterized protein OS=Corchorus olitorius OX=93759 GN=COLO4_10405 PE=4 SV=1[more]
tr|A0A061GSP9|A0A061GSP9_THECC1.5e-1040.15Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_040387 PE=4 SV=1[more]
tr|B9SQ61|B9SQ61_RICCO1.6e-0939.46Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_0147050 PE=4 SV=1[more]
tr|A0A0D2TL01|A0A0D2TL01_GOSRA6.8e-0855.38Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_009G070200 PE=4 ... [more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_4G023740.1CsaV3_4G023740.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 42..75
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 42..118

The following gene(s) are paralogous to this gene:

None