Sgr027271 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr027271
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionGATA transcription factor 21
Locationtig00153048: 2601949 .. 2603281 (-)
RNA-Seq ExpressionSgr027271
SyntenySgr027271
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCCACCTTATCACCACTCCTTCCCCCCCGACCATCTCGATCATCTTCGCTATTCTTCCGATCACCTCTTCTTCCCCACCACCCTTCAAGCTTCTTCCTCTTCTTCTTCTTCTTCTCTTTCTTTCCTCGATCATGCCCACTCTCACGATTCTCACCCGCTAGAGCTCAAACAAGATCAGGTAAGCGAAGAGAGAGCAAAATTTTACCTTCTAACTTTATATATATATATATATATACACTCTTTCGATAGAAAAGGTATAAAATCTTTTAAGTATTTTATATATTTTTTAAGAAATTTATGATGGCCCCCTCTTACTTCTTCGCTCTAGGCTTACTTTTACGAACTTTTATCATCTGGGTCTCACTTGAAGATTGCTGGAAAAGAATTAAAAGTGTTAATTATTTGATATATATTGATCATGGTTTGATTTTCATCTTCCTAATTTTTTGTTTCTTCATAGGGTGGAGGGATCGTGCAGGCTTGTAATCACGATCAAACTGTTGGGAACGATCATCTGGAAACTGGGATTAGGTTTACTATTTGGAAGCAGATTGATAAATTAAGTGAAACTTCGAGCTGCTGCGAGAATATTAATGATTCGGTGAAGTGGGCCGCTTCGTCTTCTTCCAAGATGAGATTCGTGATAAATTCTAATCAAACGGAGACGCCCACCACCCAGATGATCGACGGCGGCCGGAATTTCCAAGATCTCAACCAGACGTCGCCGTCGCCGTCATCGTCACCGTCCGATCAGACCAATAAACGAAGCAGCTTAAACGACGGCGGCAGCAGCGCCGTCGTCATCCGGACCTGCTCCGACTGTAACACCACCAAAACTCCCCTCTGGAGGAGCGGACCCAGAGGTCCAAAGGTAATTCTGAAGAATATAAGATTTTCTTTTTTCTTAAATTCACTGCAAAACGATCTCTAGGGTTCGCTAATTTGGACGCATATACGTACTGATTTTCTATTTGGTGCAGTCACTTTGCAACGCCTGCGGAATCCGACAAAGGAAAGCGAGAAGAGCCATGGCGGCAGCGGCGGCGAACGGCCCCATTAATCCATGTGGGGGAAAGCCGCCGGCCGCAGTAGTTCTGAAAACCAACAAAGTGCAACACAAGATAATCAAGCCGGCCGCGGCGAACAGAAAGTGCAAAGACGTCGCCGGCGGCCGCGGCGGAGGGAGAAAAAAGCTTTGCTTCGAAGACATAACGATCAGCAAGCGATTGAGCGAGAGTTCATCTTCTTACCAACGAGTTTTCCCGCAAGACGAGAGAGAAGCCGCCATCTTGCTCATGACCCTATCTTATGGCCTTCTTCATGGTTGA

mRNA sequence

ATGGCTCCACCTTATCACCACTCCTTCCCCCCCGACCATCTCGATCATCTTCGCTATTCTTCCGATCACCTCTTCTTCCCCACCACCCTTCAAGCTTCTTCCTCTTCTTCTTCTTCTTCTCTTTCTTTCCTCGATCATGCCCACTCTCACGATTCTCACCCGCTAGAGCTCAAACAAGATCAGGGTGGAGGGATCGTGCAGGCTTGTAATCACGATCAAACTGTTGGGAACGATCATCTGGAAACTGGGATTAGGTTTACTATTTGGAAGCAGATTGATAAATTAAGTGAAACTTCGAGCTGCTGCGAGAATATTAATGATTCGGTGAAGTGGGCCGCTTCGTCTTCTTCCAAGATGAGATTCGTGATAAATTCTAATCAAACGGAGACGCCCACCACCCAGATGATCGACGGCGGCCGGAATTTCCAAGATCTCAACCAGACGTCGCCGTCGCCGTCATCGTCACCGTCCGATCAGACCAATAAACGAAGCAGCTTAAACGACGGCGGCAGCAGCGCCGTCGTCATCCGGACCTGCTCCGACTGTAACACCACCAAAACTCCCCTCTGGAGGAGCGGACCCAGAGGTCCAAAGTCACTTTGCAACGCCTGCGGAATCCGACAAAGGAAAGCGAGAAGAGCCATGGCGGCAGCGGCGGCGAACGGCCCCATTAATCCATGTGGGGGAAAGCCGCCGGCCGCAGTAGTTCTGAAAACCAACAAAGTGCAACACAAGATAATCAAGCCGGCCGCGGCGAACAGAAAGTGCAAAGACGTCGCCGGCGGCCGCGGCGGAGGGAGAAAAAAGCTTTGCTTCGAAGACATAACGATCAGCAAGCGATTGAGCGAGAGTTCATCTTCTTACCAACGAGTTTTCCCGCAAGACGAGAGAGAAGCCGCCATCTTGCTCATGACCCTATCTTATGGCCTTCTTCATGGTTGA

Coding sequence (CDS)

ATGGCTCCACCTTATCACCACTCCTTCCCCCCCGACCATCTCGATCATCTTCGCTATTCTTCCGATCACCTCTTCTTCCCCACCACCCTTCAAGCTTCTTCCTCTTCTTCTTCTTCTTCTCTTTCTTTCCTCGATCATGCCCACTCTCACGATTCTCACCCGCTAGAGCTCAAACAAGATCAGGGTGGAGGGATCGTGCAGGCTTGTAATCACGATCAAACTGTTGGGAACGATCATCTGGAAACTGGGATTAGGTTTACTATTTGGAAGCAGATTGATAAATTAAGTGAAACTTCGAGCTGCTGCGAGAATATTAATGATTCGGTGAAGTGGGCCGCTTCGTCTTCTTCCAAGATGAGATTCGTGATAAATTCTAATCAAACGGAGACGCCCACCACCCAGATGATCGACGGCGGCCGGAATTTCCAAGATCTCAACCAGACGTCGCCGTCGCCGTCATCGTCACCGTCCGATCAGACCAATAAACGAAGCAGCTTAAACGACGGCGGCAGCAGCGCCGTCGTCATCCGGACCTGCTCCGACTGTAACACCACCAAAACTCCCCTCTGGAGGAGCGGACCCAGAGGTCCAAAGTCACTTTGCAACGCCTGCGGAATCCGACAAAGGAAAGCGAGAAGAGCCATGGCGGCAGCGGCGGCGAACGGCCCCATTAATCCATGTGGGGGAAAGCCGCCGGCCGCAGTAGTTCTGAAAACCAACAAAGTGCAACACAAGATAATCAAGCCGGCCGCGGCGAACAGAAAGTGCAAAGACGTCGCCGGCGGCCGCGGCGGAGGGAGAAAAAAGCTTTGCTTCGAAGACATAACGATCAGCAAGCGATTGAGCGAGAGTTCATCTTCTTACCAACGAGTTTTCCCGCAAGACGAGAGAGAAGCCGCCATCTTGCTCATGACCCTATCTTATGGCCTTCTTCATGGTTGA

Protein sequence

MAPPYHHSFPPDHLDHLRYSSDHLFFPTTLQASSSSSSSSLSFLDHAHSHDSHPLELKQDQGGGIVQACNHDQTVGNDHLETGIRFTIWKQIDKLSETSSCCENINDSVKWAASSSSKMRFVINSNQTETPTTQMIDGGRNFQDLNQTSPSPSSSPSDQTNKRSSLNDGGSSAVVIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAANGPINPCGGKPPAAVVLKTNKVQHKIIKPAAANRKCKDVAGGRGGGRKKLCFEDITISKRLSESSSSYQRVFPQDEREAAILLMTLSYGLLHG
Homology
BLAST of Sgr027271 vs. NCBI nr
Match: XP_038878562.1 (GATA transcription factor 21 [Benincasa hispida])

HSP 1 Score: 366.7 bits (940), Expect = 2.0e-97
Identity = 227/342 (66.37%), Postives = 252/342 (73.68%), Query Frame = 0

Query: 1   MAPPYHHSFPPDHLD-HLRYSSD-HLFFPTTLQASSSSSSSSLSFLDHAHSHDSHPLELK 60
           MAPPY  SFP DH D  LRYSS  HLFFP T Q  SSSSSSSLSF    HS D   +ELK
Sbjct: 1   MAPPYRDSFPSDHDDLDLRYSSSHHLFFPITPQ-PSSSSSSSLSFPILDHSDDPRSIELK 60

Query: 61  QDQGGGIVQACNHDQTVGNDH---LETGIRFTIWKQIDKLSETSSCCEN------INDSV 120
            + GG  + ACN+DQ +GN+H   +ETG+RFTIWKQIDK  E+SSCCEN       ND V
Sbjct: 61  HEGGG--IMACNNDQIIGNNHEDDVETGLRFTIWKQIDK-RESSSCCENNNNNNTHNDLV 120

Query: 121 KW-AASSSSKMRFVINSNQTETPTTQMIDGGRNFQDLNQTSPSPSSSPSDQTNKRSS--L 180
           KW ++SSSSK++F+INSNQTET  T+ ID GRNFQDLNQTSP+PS S  DQTNKR+S  L
Sbjct: 121 KWSSSSSSSKIKFLINSNQTET-ATRTIDSGRNFQDLNQTSPTPSPSSFDQTNKRTSTAL 180

Query: 181 NDGGSSAVVIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAANGPINP 240
            DGG+   +IRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMA AAA      
Sbjct: 181 QDGGA---IIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAA---- 240

Query: 241 CGGKPPAAVVLKTNK-VQHKIIKPAAA-------NRKCKDV-------AGGRGGGRKKLC 300
             G+ PAAVVLK+NK VQHKI+  +A         RKCKD         G  GGGRK LC
Sbjct: 241 ANGEKPAAVVLKSNKAVQHKIMTKSAVATTTTTLKRKCKDAVVQGEGGGGDSGGGRKNLC 300

Query: 301 FEDITISKRLSESSSSYQRVFPQDEREAAILLMTLSYGLLHG 314
           FE+I I +RLSE SSSYQRVFPQDEREAAILLMTLSYGLLHG
Sbjct: 301 FEEIKIGRRLSEISSSYQRVFPQDEREAAILLMTLSYGLLHG 330

BLAST of Sgr027271 vs. NCBI nr
Match: XP_004135818.1 (putative GATA transcription factor 22 [Cucumis sativus] >KGN66237.1 hypothetical protein Csa_007289 [Cucumis sativus])

HSP 1 Score: 345.5 bits (885), Expect = 4.8e-91
Identity = 226/344 (65.70%), Postives = 254/344 (73.84%), Query Frame = 0

Query: 1   MAPPYHHSFPPDHLD-HLRYSSD-HLFFPTTLQASSSSSSSSLSF--LDHAHSHD---SH 60
           MAPPY  SFP DH D  L YSS  HLFFP     +SSSSSSSLSF  LDH+   D   + 
Sbjct: 1   MAPPYRDSFPSDHDDLDLHYSSSHHLFFPILTPQASSSSSSSLSFTALDHSMISDDPLAR 60

Query: 61  PLELKQDQGGGIVQACNHDQTVGN--DHL-ETGIRFTIWKQIDKLSETSSCCEN------ 120
            +ELK +  GG++  CN+DQ++GN  DH+ ETG+RFTIWKQIDK  ETSSCCEN      
Sbjct: 61  SIELKHE--GGVIMGCNNDQSIGNHEDHMEETGLRFTIWKQIDK-RETSSCCENNNNDST 120

Query: 121 INDSVKW-AASSSSKMRFVINSNQTETPTTQMIDGGRNFQDLNQTSPSPSSSPSDQTNKR 180
            NDSVKW ++SSSSK++F+INSNQTET  T+ I+ GRN QDLN  SPSPSS   +QTNKR
Sbjct: 121 HNDSVKWSSSSSSSKIKFMINSNQTETTLTRTIESGRNVQDLN-NSPSPSS--FEQTNKR 180

Query: 181 SS---LNDGGSSAVVIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAM---AA 240
           +S   L+DGG+   +IRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAM   AA
Sbjct: 181 TSTTTLHDGGA---IIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAA 240

Query: 241 AAANGPINPCGGKPPAAVVLKTNK-VQHKI-IKPAAA-NRKCKD---VAGG--RGGGRKK 300
           AAANG           AVV+KTNK VQHKI  KPA    RK KD   V GG  +GGGRKK
Sbjct: 241 AAANG----------GAVVVKTNKVVQHKITTKPATTLKRKYKDEVVVVGGDKKGGGRKK 300

Query: 301 LCFEDITISKRLSESSSSYQRVFPQDEREAAILLMTLSYGLLHG 314
           LCFE+I +  RLSE SSSYQRVFPQDEREAAILLMTLSYGLLHG
Sbjct: 301 LCFEEIKMGGRLSEISSSYQRVFPQDEREAAILLMTLSYGLLHG 325

BLAST of Sgr027271 vs. NCBI nr
Match: XP_008450852.1 (PREDICTED: GATA transcription factor 21 [Cucumis melo])

HSP 1 Score: 339.0 bits (868), Expect = 4.5e-89
Identity = 226/354 (63.84%), Postives = 248/354 (70.06%), Query Frame = 0

Query: 1   MAPPYHHSFPPDH--LDHLRYSSD--HLFFPTTLQA-SSSSSSSSLSF--LDHAH-SHDS 60
           MAPPY  SFP DH  LDHL YSS   HLFFP    A +SSSSSSSLSF  LDH+  S D 
Sbjct: 1   MAPPYRDSFPSDHDDLDHLHYSSSHHHLFFPIVTPAQASSSSSSSLSFTALDHSMISDDP 60

Query: 61  HPLELKQDQGGGIVQACNHDQTVGN--DHL-ETGIRFTIWKQIDKLSETSSCCEN----- 120
             +ELK + GG  +  CN+DQ++GN  DH+ ETG+RFTIWKQIDK  ETSSCCEN     
Sbjct: 61  RSVELKHEGGG--IMGCNNDQSIGNHEDHIEETGLRFTIWKQIDK-RETSSCCENNNNDN 120

Query: 121 -INDSVKW--AASSSSKMRFVINSN-QTETPTTQMIDGGRNFQDLNQTSPSPSSSPSDQT 180
             NDSVKW  ++SSSSK++F+INSN QTET  T+ ID GRN QDLN  SPSPSS   +QT
Sbjct: 121 THNDSVKWSSSSSSSSKIKFMINSNLQTETTPTRTIDSGRNVQDLNPPSPSPSS--IEQT 180

Query: 181 NKRSSLNDGGSSAVVIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAM---AA 240
           NKR+S         +IRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAM   AA
Sbjct: 181 NKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAA 240

Query: 241 AAANGPINPCGGKPPAAVVLKTNK-VQHKI-IKPA-------AANRKCKDV--------A 300
           AA NG           AVVLKTNK VQHKI  KPA       A  RK KD          
Sbjct: 241 AATNG----------GAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGG 300

Query: 301 GGRGGGRK-KLCFEDITISKRLSESSSSYQRVFPQDEREAAILLMTLSYGLLHG 314
           G +GGGRK KLCFE+I +  RLSE SSSYQRVFPQDEREAAILLMTLSYGLLHG
Sbjct: 301 GDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAILLMTLSYGLLHG 339

BLAST of Sgr027271 vs. NCBI nr
Match: XP_022967871.1 (GATA transcription factor 21-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 313.5 bits (802), Expect = 2.0e-81
Identity = 203/333 (60.96%), Postives = 228/333 (68.47%), Query Frame = 0

Query: 1   MAPPYHHSFPPDHLDHLRY---SSD-HLFFPTT-LQASSSSSSSSLSFLDHAHSHDSHPL 60
           MAPPY  SFP +H + +RY   SSD HLFFPTT L +S SS  S   F D   S+  HP 
Sbjct: 1   MAPPYRDSFPSNHDNLIRYPSSSSDRHLFFPTTPLDSSPSSPLSFPLFPDLHRSNPDHPH 60

Query: 61  EL---KQDQGGGIVQACNHDQT-VGNDHLETGIRFTIWKQIDKLSETSSCCENINDSVKW 120
            L    Q+ GG     C +DQ    N  +ETG+ FTIWK     SETSS   N NDSVKW
Sbjct: 61  SLGFHHQEDGG--FMGCENDQVHESNQEVETGLSFTIWK-----SETSSNDHNHNDSVKW 120

Query: 121 ---AASSSSKMRFVINSNQTETPTTQMIDGGRNFQDLNQTSPSPSSSPSDQTNKRSSLND 180
              ++SSSSK+R VIN NQTET   + ID  RNFQDLN  SPSPS SPSDQTNKR++LND
Sbjct: 121 SSSSSSSSSKIRLVINYNQTET-LAKTIDAHRNFQDLNPMSPSPSPSPSDQTNKRNALND 180

Query: 181 GGSSAVVIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAANGPINPCG 240
           GG +  +IRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMA AA         
Sbjct: 181 GGGA--IIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAA--------N 240

Query: 241 GKPPAAVVLKTNKVQHKIIKPAAA-NRKCKDVAGGR-------GGGRKKLCFEDITISKR 300
           G  P AVVLKTNK    IIKPAA   RK K+V           GGGR+KLC ED+ + +R
Sbjct: 241 GGNPTAVVLKTNKA---IIKPAATMKRKHKEVVAATTATTAAGGGGRRKLCVEDVKMGRR 300

Query: 301 LSESSSSYQRVFPQDEREAAILLMTLSYGLLHG 314
           L+E +S+YQRVFPQDEREAAILLMTLSYGLLHG
Sbjct: 301 LNEIASTYQRVFPQDEREAAILLMTLSYGLLHG 312

BLAST of Sgr027271 vs. NCBI nr
Match: KAG6588037.1 (GATA transcription factor 21, partial [Cucurbita argyrosperma subsp. sororia] >KAG7021934.1 GATA transcription factor 21, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 313.5 bits (802), Expect = 2.0e-81
Identity = 202/332 (60.84%), Postives = 223/332 (67.17%), Query Frame = 0

Query: 1   MAPPYHHSFPPDHLDHLRYSSD---HLFFPTT-LQASSSSSSSSLSFLDHAHSHDSHPLE 60
           MAPPY  SFP +H D LRYSS    HLFFPTT L +S SS  S   F D   S+  HP  
Sbjct: 1   MAPPYRDSFPSNHDDLLRYSSSSDRHLFFPTTPLDSSPSSPLSFPLFPDLHRSNPDHPHS 60

Query: 61  LKQDQGGGIVQACNHDQTVGNDHLETGIRFTIWKQIDKLSETSSCCENINDSVKW---AA 120
           L      G     +      N  +ETG+ FTIWK     SETSS   N NDSVKW   ++
Sbjct: 61  L------GFHHQEDDQVHESNQEVETGLSFTIWK-----SETSSNDHNHNDSVKWSSSSS 120

Query: 121 SSSSKMRFVINSNQTETPTTQMIDGGRNFQDLNQTSPSPSSSPSDQTNKRSSLNDGGSSA 180
           SSSSK+R VIN NQTETP T+ ID  RNFQDLN  SPSPS SPSDQTNKR++LNDGG + 
Sbjct: 121 SSSSKIRLVINYNQTETP-TKTIDAHRNFQDLNPMSPSPSPSPSDQTNKRNTLNDGGGA- 180

Query: 181 VVIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAANGPINPCGGKPPA 240
            +IRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMA AA         G    
Sbjct: 181 -IIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAA--------NGGNST 240

Query: 241 AVVLKTNKVQHKIIKPAAA-NRKCKDV-----------AGGRGGGRKKLCFEDITISKRL 300
           AVVLKTNK    IIKPAA   RK K+V           +   GGGR+KLC ED+ + +RL
Sbjct: 241 AVVLKTNKA---IIKPAATMKRKHKEVVAATTTTAAAASAAGGGGRRKLCVEDVKMGRRL 300

Query: 301 SESSSSYQRVFPQDEREAAILLMTLSYGLLHG 314
           SE SS+YQRVFPQDEREAAILLMTLSYGLLHG
Sbjct: 301 SEISSTYQRVFPQDEREAAILLMTLSYGLLHG 307

BLAST of Sgr027271 vs. ExPASy Swiss-Prot
Match: Q5HZ36 (GATA transcription factor 21 OS=Arabidopsis thaliana OX=3702 GN=GATA21 PE=1 SV=2)

HSP 1 Score: 137.9 bits (346), Expect = 2.0e-31
Identity = 134/379 (35.36%), Postives = 177/379 (46.70%), Query Frame = 0

Query: 16  HLRYSSDHLFFPTTLQASSSSSSSSLSFLD-----------------HA-HSHDSHPLEL 75
           H  +   H   P+   +SSSS SS  S+L                  HA H H S PL+ 
Sbjct: 33  HHHHHHHHHQVPSNSSSSSSSISSLSSYLPFLINSQEDQHVAYNNTYHADHLHLSQPLKA 92

Query: 76  KQDQGGGIVQACNHDQTVGNDHLETGIRFTIWKQIDKLSETSSCCEN----INDSVKWAA 135
           K     G   AC+H         ET ++ TI K+ D   +     +N     +DS KW  
Sbjct: 93  KMFVANGGSSACDHMV----PKKETRLKLTIRKK-DHEDQPHPLHQNPTKPDSDSDKWL- 152

Query: 136 SSSSKMRFVINSNQTETPTTQMIDGGRN---------------------FQDLN-QTSPS 195
             S KMR +    +T T   Q+ID   N                      +DLN +   +
Sbjct: 153 -MSPKMRLI---KKTITNNKQLIDQTNNNNHKESDHYPLNHKTNFDEDHHEDLNFKNVLT 212

Query: 196 PSSSPSDQTNKRSSLNDGG--SSAVVIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQR 255
             ++ +   N+ +++N+ G  ++  VIR CSDCNTTKTPLWRSGPRGPKSLCNACGIRQR
Sbjct: 213 RKTTAATTENRYNTINENGYSNNNGVIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQR 272

Query: 256 KARR-AMAAAAANG-------------PI---------NPCGGK-----PPAAVVLKTNK 314
           KARR AMAAAAA G             P+            GG+     PP     K  K
Sbjct: 273 KARRAAMAAAAAAGDQEVAVAPRVQQLPLKKKLQNKKKRSNGGEKYNHSPPMVAKAKKCK 332

BLAST of Sgr027271 vs. ExPASy Swiss-Prot
Match: Q9SZI6 (Putative GATA transcription factor 22 OS=Arabidopsis thaliana OX=3702 GN=GATA22 PE=1 SV=1)

HSP 1 Score: 129.0 bits (323), Expect = 9.2e-29
Identity = 109/299 (36.45%), Postives = 143/299 (47.83%), Query Frame = 0

Query: 42  SFLDHAHSHDSHPLELKQDQGGGIVQACNHDQTVGNDHLETGIRFTIWKQIDKLSETSSC 101
           +F D   +H S PLE K     G   + + DQ V     ET ++ TI K+ +   +T   
Sbjct: 80  TFHDVLDTHISQPLETKNFVSDG--GSSSSDQMVPKK--ETRLKLTIKKKDNHQDQTDLP 139

Query: 102 CENIND-----SVKWAASSSSKMRFVINSNQTETPTTQMIDGGRNFQDLNQTSPSPSSSP 161
              I D     S+KW    SSK+R +       T +        N Q  N ++       
Sbjct: 140 QSPIKDMTGTNSLKWI---SSKVRLMKKKKAIITTSDSSKQHTNNDQSSNLSN------- 199

Query: 162 SDQTNKRSSLNDGGSSAVVIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMA 221
                  S   +G ++  VIR CSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRA  
Sbjct: 200 -------SERQNGYNNDCVIRICSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAAM 259

Query: 222 AAAANGPINPCGGKPPAAVVLKTNKVQ-----HKIIKPAAAN-RKCKDVA---------- 281
           A A    ++  G  PP       NK +     +KI+ P       CK +           
Sbjct: 260 ATATATAVS--GVSPPVMKKKMQNKNKISNGVYKILSPLPLKVNTCKRMITLEETALAED 319

Query: 282 ------GGRGGGRKKLCFEDITISKRLSESSSSYQRVFPQDEREAAILLMTLSYGLLHG 314
                          + F+D+ +   L   SS+YQ+VFPQDE+EAAILLM LS+G++HG
Sbjct: 320 LETQSNSTMLSSSDNIYFDDLAL---LLSKSSAYQQVFPQDEKEAAILLMALSHGMVHG 352

BLAST of Sgr027271 vs. ExPASy Swiss-Prot
Match: Q6YW48 (Protein CYTOKININ-RESPONSIVE GATA TRANSCRIPTION FACTOR 1 OS=Oryza sativa subsp. japonica OX=39947 GN=CGA1 PE=2 SV=1)

HSP 1 Score: 98.2 bits (243), Expect = 1.7e-19
Identity = 75/180 (41.67%), Postives = 90/180 (50.00%), Query Frame = 0

Query: 175 VIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAANG---------PIN 234
           V+R CSDCNTTKTPLWRSGP GPKSLCNACGIRQRKARRAMAAAA  G            
Sbjct: 174 VVRVCSDCNTTKTPLWRSGPCGPKSLCNACGIRQRKARRAMAAAANGGAAVAPAKSVAAA 233

Query: 235 PCGGKPPAAVVLKTNKVQHKI------------------IKPAAANR------KCKD--- 294
           P   KP A    +   V   +                   KP AA        K +D   
Sbjct: 234 PVNNKPAAKKEKRAADVDRSLPFKKRCKMVDHVAAAVAATKPTAAGEVVAAAPKDQDHVI 293

Query: 295 VAGGRGGGRKKLCFEDITISK-----RLSESSSSYQRVFPQDE-REAAILLMTLSYGLLH 313
           V GG       +  ++  ISK       + +S ++    P+DE  +AA+LLMTLS GL+H
Sbjct: 294 VVGGENAAATSMPAQN-PISKAAATAAAAAASPAFFHGLPRDEITDAAMLLMTLSCGLVH 352

BLAST of Sgr027271 vs. ExPASy Swiss-Prot
Match: Q9FJ10 (GATA transcription factor 16 OS=Arabidopsis thaliana OX=3702 GN=GATA16 PE=2 SV=1)

HSP 1 Score: 80.1 bits (196), Expect = 4.9e-14
Identity = 57/141 (40.43%), Postives = 69/141 (48.94%), Query Frame = 0

Query: 177 RTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAANGPINPCGGKPPAAVV 236
           +TC+DC T+KTPLWR GP GPKSLCNACGIR RK RR              GG       
Sbjct: 36  KTCADCGTSKTPLWRGGPVGPKSLCNACGIRNRKKRR--------------GG------- 95

Query: 237 LKTNKVQHKIIKPAAANRKCKDVAGGRGGGRK-----KLCFEDITISKRLSESSSSYQRV 296
                           N+K K  + G GG RK     K    D+ I KR   S+   QR 
Sbjct: 96  -------------TEDNKKLKKSSSG-GGNRKFGESLKQSLMDLGIRKR---STVEKQRQ 138

Query: 297 FPQDEREAAILLMTLSYGLLH 313
              +E +AA+LLM LSYG ++
Sbjct: 156 KLGEEEQAAVLLMALSYGSVY 138

BLAST of Sgr027271 vs. ExPASy Swiss-Prot
Match: Q8LC79 (GATA transcription factor 18 OS=Arabidopsis thaliana OX=3702 GN=GATA18 PE=1 SV=2)

HSP 1 Score: 70.1 bits (170), Expect = 5.1e-11
Identity = 43/96 (44.79%), Postives = 53/96 (55.21%), Query Frame = 0

Query: 141 NFQDLNQTSPSPSSS------PSDQTNKRS---------SLNDGGSSAVVIRTCSDCNTT 200
           NF DL  T  + S +      PS   NK S             GG  +++ R C++C+TT
Sbjct: 101 NFWDLIHTKNNNSKTAPYNNVPSFSANKPSRGCSGGGGGGGGGGGGDSLLARRCANCDTT 160

Query: 201 KTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAN 222
            TPLWR+GPRGPKSLCNACGIR +K  R   AA  N
Sbjct: 161 STPLWRNGPRGPKSLCNACGIRFKKEERRTTAATGN 196

BLAST of Sgr027271 vs. ExPASy TrEMBL
Match: A0A0A0LZE4 (GATA-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G587970 PE=4 SV=1)

HSP 1 Score: 345.5 bits (885), Expect = 2.3e-91
Identity = 226/344 (65.70%), Postives = 254/344 (73.84%), Query Frame = 0

Query: 1   MAPPYHHSFPPDHLD-HLRYSSD-HLFFPTTLQASSSSSSSSLSF--LDHAHSHD---SH 60
           MAPPY  SFP DH D  L YSS  HLFFP     +SSSSSSSLSF  LDH+   D   + 
Sbjct: 1   MAPPYRDSFPSDHDDLDLHYSSSHHLFFPILTPQASSSSSSSLSFTALDHSMISDDPLAR 60

Query: 61  PLELKQDQGGGIVQACNHDQTVGN--DHL-ETGIRFTIWKQIDKLSETSSCCEN------ 120
            +ELK +  GG++  CN+DQ++GN  DH+ ETG+RFTIWKQIDK  ETSSCCEN      
Sbjct: 61  SIELKHE--GGVIMGCNNDQSIGNHEDHMEETGLRFTIWKQIDK-RETSSCCENNNNDST 120

Query: 121 INDSVKW-AASSSSKMRFVINSNQTETPTTQMIDGGRNFQDLNQTSPSPSSSPSDQTNKR 180
            NDSVKW ++SSSSK++F+INSNQTET  T+ I+ GRN QDLN  SPSPSS   +QTNKR
Sbjct: 121 HNDSVKWSSSSSSSKIKFMINSNQTETTLTRTIESGRNVQDLN-NSPSPSS--FEQTNKR 180

Query: 181 SS---LNDGGSSAVVIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAM---AA 240
           +S   L+DGG+   +IRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAM   AA
Sbjct: 181 TSTTTLHDGGA---IIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAA 240

Query: 241 AAANGPINPCGGKPPAAVVLKTNK-VQHKI-IKPAAA-NRKCKD---VAGG--RGGGRKK 300
           AAANG           AVV+KTNK VQHKI  KPA    RK KD   V GG  +GGGRKK
Sbjct: 241 AAANG----------GAVVVKTNKVVQHKITTKPATTLKRKYKDEVVVVGGDKKGGGRKK 300

Query: 301 LCFEDITISKRLSESSSSYQRVFPQDEREAAILLMTLSYGLLHG 314
           LCFE+I +  RLSE SSSYQRVFPQDEREAAILLMTLSYGLLHG
Sbjct: 301 LCFEEIKMGGRLSEISSSYQRVFPQDEREAAILLMTLSYGLLHG 325

BLAST of Sgr027271 vs. ExPASy TrEMBL
Match: A0A1S3BPL1 (GATA transcription factor 21 OS=Cucumis melo OX=3656 GN=LOC103492321 PE=4 SV=1)

HSP 1 Score: 339.0 bits (868), Expect = 2.2e-89
Identity = 226/354 (63.84%), Postives = 248/354 (70.06%), Query Frame = 0

Query: 1   MAPPYHHSFPPDH--LDHLRYSSD--HLFFPTTLQA-SSSSSSSSLSF--LDHAH-SHDS 60
           MAPPY  SFP DH  LDHL YSS   HLFFP    A +SSSSSSSLSF  LDH+  S D 
Sbjct: 1   MAPPYRDSFPSDHDDLDHLHYSSSHHHLFFPIVTPAQASSSSSSSLSFTALDHSMISDDP 60

Query: 61  HPLELKQDQGGGIVQACNHDQTVGN--DHL-ETGIRFTIWKQIDKLSETSSCCEN----- 120
             +ELK + GG  +  CN+DQ++GN  DH+ ETG+RFTIWKQIDK  ETSSCCEN     
Sbjct: 61  RSVELKHEGGG--IMGCNNDQSIGNHEDHIEETGLRFTIWKQIDK-RETSSCCENNNNDN 120

Query: 121 -INDSVKW--AASSSSKMRFVINSN-QTETPTTQMIDGGRNFQDLNQTSPSPSSSPSDQT 180
             NDSVKW  ++SSSSK++F+INSN QTET  T+ ID GRN QDLN  SPSPSS   +QT
Sbjct: 121 THNDSVKWSSSSSSSSKIKFMINSNLQTETTPTRTIDSGRNVQDLNPPSPSPSS--IEQT 180

Query: 181 NKRSSLNDGGSSAVVIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAM---AA 240
           NKR+S         +IRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAM   AA
Sbjct: 181 NKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAA 240

Query: 241 AAANGPINPCGGKPPAAVVLKTNK-VQHKI-IKPA-------AANRKCKDV--------A 300
           AA NG           AVVLKTNK VQHKI  KPA       A  RK KD          
Sbjct: 241 AATNG----------GAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGG 300

Query: 301 GGRGGGRK-KLCFEDITISKRLSESSSSYQRVFPQDEREAAILLMTLSYGLLHG 314
           G +GGGRK KLCFE+I +  RLSE SSSYQRVFPQDEREAAILLMTLSYGLLHG
Sbjct: 301 GDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAILLMTLSYGLLHG 339

BLAST of Sgr027271 vs. ExPASy TrEMBL
Match: A0A6J1HT96 (GATA transcription factor 21-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111467247 PE=4 SV=1)

HSP 1 Score: 313.5 bits (802), Expect = 9.7e-82
Identity = 203/333 (60.96%), Postives = 228/333 (68.47%), Query Frame = 0

Query: 1   MAPPYHHSFPPDHLDHLRY---SSD-HLFFPTT-LQASSSSSSSSLSFLDHAHSHDSHPL 60
           MAPPY  SFP +H + +RY   SSD HLFFPTT L +S SS  S   F D   S+  HP 
Sbjct: 1   MAPPYRDSFPSNHDNLIRYPSSSSDRHLFFPTTPLDSSPSSPLSFPLFPDLHRSNPDHPH 60

Query: 61  EL---KQDQGGGIVQACNHDQT-VGNDHLETGIRFTIWKQIDKLSETSSCCENINDSVKW 120
            L    Q+ GG     C +DQ    N  +ETG+ FTIWK     SETSS   N NDSVKW
Sbjct: 61  SLGFHHQEDGG--FMGCENDQVHESNQEVETGLSFTIWK-----SETSSNDHNHNDSVKW 120

Query: 121 ---AASSSSKMRFVINSNQTETPTTQMIDGGRNFQDLNQTSPSPSSSPSDQTNKRSSLND 180
              ++SSSSK+R VIN NQTET   + ID  RNFQDLN  SPSPS SPSDQTNKR++LND
Sbjct: 121 SSSSSSSSSKIRLVINYNQTET-LAKTIDAHRNFQDLNPMSPSPSPSPSDQTNKRNALND 180

Query: 181 GGSSAVVIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAANGPINPCG 240
           GG +  +IRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMA AA         
Sbjct: 181 GGGA--IIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAA--------N 240

Query: 241 GKPPAAVVLKTNKVQHKIIKPAAA-NRKCKDVAGGR-------GGGRKKLCFEDITISKR 300
           G  P AVVLKTNK    IIKPAA   RK K+V           GGGR+KLC ED+ + +R
Sbjct: 241 GGNPTAVVLKTNKA---IIKPAATMKRKHKEVVAATTATTAAGGGGRRKLCVEDVKMGRR 300

Query: 301 LSESSSSYQRVFPQDEREAAILLMTLSYGLLHG 314
           L+E +S+YQRVFPQDEREAAILLMTLSYGLLHG
Sbjct: 301 LNEIASTYQRVFPQDEREAAILLMTLSYGLLHG 312

BLAST of Sgr027271 vs. ExPASy TrEMBL
Match: A0A6J1ELP1 (GATA transcription factor 21-like OS=Cucurbita moschata OX=3662 GN=LOC111433707 PE=4 SV=1)

HSP 1 Score: 312.8 bits (800), Expect = 1.7e-81
Identity = 202/334 (60.48%), Postives = 223/334 (66.77%), Query Frame = 0

Query: 1   MAPPYHHSFPPDHLDHLRYSSD---HLFFPTT-LQASSSSSSSSLSFLDHAHSHDSHPLE 60
           MAPPY  SFP +H D LRYSS    HLFFPTT L +S SS  S   F D   S+  HP  
Sbjct: 1   MAPPYRDSFPSNHDDLLRYSSSSDRHLFFPTTPLDSSPSSPLSFPLFPDLHRSNPDHPHS 60

Query: 61  LKQDQGGGIVQACNHDQTVGNDHLETGIRFTIWKQIDKLSETSSCCENINDSVKW---AA 120
           L      G     +      N  +ETG+ FTIWK     SETSS   N NDSVKW   ++
Sbjct: 61  L------GFHHQEDDQVHESNQEVETGLSFTIWK-----SETSSNDHNHNDSVKWSSSSS 120

Query: 121 SSSSKMRFVINSNQTETPTTQMIDGGRNFQDLNQTSPSPSSSPSDQTNKRSSLNDGGSSA 180
           SSSSK+R VIN NQTETP T+ ID  RNFQDLN  SPSPS SPSDQTNKR++LNDGG + 
Sbjct: 121 SSSSKIRLVINYNQTETP-TKTIDAHRNFQDLNPMSPSPSPSPSDQTNKRNTLNDGGGA- 180

Query: 181 VVIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAANGPINPCGGKPPA 240
            +IRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMA AA         G    
Sbjct: 181 -IIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAA--------NGGNST 240

Query: 241 AVVLKTNKVQHKIIKPAAA-NRKCKDV-------------AGGRGGGRKKLCFEDITISK 300
           AVVLKTNK    IIKPAA   RK K+V             +   GGGR+KLC ED+ + +
Sbjct: 241 AVVLKTNKA---IIKPAATMKRKHKEVVAATTTTAAAAAASAAGGGGRRKLCVEDVKMGR 300

Query: 301 RLSESSSSYQRVFPQDEREAAILLMTLSYGLLHG 314
           RLSE SS+YQRVFPQDEREAAILLMTLSYGLLHG
Sbjct: 301 RLSEISSTYQRVFPQDEREAAILLMTLSYGLLHG 309

BLAST of Sgr027271 vs. ExPASy TrEMBL
Match: A0A6J1D336 (GATA transcription factor 21-like OS=Momordica charantia OX=3673 GN=LOC111016549 PE=4 SV=1)

HSP 1 Score: 309.3 bits (791), Expect = 1.8e-80
Identity = 213/324 (65.74%), Postives = 230/324 (70.99%), Query Frame = 0

Query: 1   MAPPYHHSFPPDHLDHLRYSSDHLFFPTTLQASSSSSSSSLSF---LDHA-HSHD-SHPL 60
           MAPPY +SFP  H     +  DHLFFPTT Q  SSSSSS+LSF   LDHA HS D SHPL
Sbjct: 1   MAPPYQNSFPSHH----HHDLDHLFFPTTPQ-PSSSSSSTLSFPLHLDHATHSDDHSHPL 60

Query: 61  ELKQDQGGGIVQACNHDQTVGNDHLETGIRFTIWKQIDKLSETSSCCENINDSVKWAASS 120
            L+QDQGG IV  CN D     D +ETG+RFTIWKQIDK    SS  +++  ++KW  SS
Sbjct: 61  NLRQDQGGRIV-GCNSD-----DGVETGLRFTIWKQIDK--SGSSIDDDL--ALKW-DSS 120

Query: 121 SSKMRFVINSNQTETPTTQMIDGGRNFQDLNQTSPSPSSSPSDQTNKRSSLNDGGSSAVV 180
           SS +R VINSN TETP    I+GGRNFQDLN     PS S  DQTNKR +LN   +++V 
Sbjct: 121 SSNIRMVINSNPTETPAP--INGGRNFQDLN-----PSLS-FDQTNKRRTLNVDDNNSVT 180

Query: 181 IRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAM------AAAAANGPINPCGG 240
            RTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRA        AAAANG I P GG
Sbjct: 181 -RTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAAMAAAAEEAAAANGTI-PYGG 240

Query: 241 KPPAAVVLKTNK--VQHKIIKPAAA-NRKCKDVAGGR-GGGRKKLCFEDITISKRLSESS 300
           KP AAVVLK NK  VQHKIIKPAA   RKCKDVA GR GGGRKKL FED           
Sbjct: 241 KPAAAVVLKPNKTAVQHKIIKPAATLKRKCKDVAAGRGGGGRKKLHFED----------- 287

Query: 301 SSYQRVFPQDEREAAILLMTLSYG 310
           SSYQRVFP DEREAAILLMTLSYG
Sbjct: 301 SSYQRVFPPDEREAAILLMTLSYG 287

BLAST of Sgr027271 vs. TAIR 10
Match: AT5G56860.1 (GATA type zinc finger transcription factor family protein )

HSP 1 Score: 137.9 bits (346), Expect = 1.4e-32
Identity = 134/379 (35.36%), Postives = 177/379 (46.70%), Query Frame = 0

Query: 16  HLRYSSDHLFFPTTLQASSSSSSSSLSFLD-----------------HA-HSHDSHPLEL 75
           H  +   H   P+   +SSSS SS  S+L                  HA H H S PL+ 
Sbjct: 33  HHHHHHHHHQVPSNSSSSSSSISSLSSYLPFLINSQEDQHVAYNNTYHADHLHLSQPLKA 92

Query: 76  KQDQGGGIVQACNHDQTVGNDHLETGIRFTIWKQIDKLSETSSCCEN----INDSVKWAA 135
           K     G   AC+H         ET ++ TI K+ D   +     +N     +DS KW  
Sbjct: 93  KMFVANGGSSACDHMV----PKKETRLKLTIRKK-DHEDQPHPLHQNPTKPDSDSDKWL- 152

Query: 136 SSSSKMRFVINSNQTETPTTQMIDGGRN---------------------FQDLN-QTSPS 195
             S KMR +    +T T   Q+ID   N                      +DLN +   +
Sbjct: 153 -MSPKMRLI---KKTITNNKQLIDQTNNNNHKESDHYPLNHKTNFDEDHHEDLNFKNVLT 212

Query: 196 PSSSPSDQTNKRSSLNDGG--SSAVVIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQR 255
             ++ +   N+ +++N+ G  ++  VIR CSDCNTTKTPLWRSGPRGPKSLCNACGIRQR
Sbjct: 213 RKTTAATTENRYNTINENGYSNNNGVIRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQR 272

Query: 256 KARR-AMAAAAANG-------------PI---------NPCGGK-----PPAAVVLKTNK 314
           KARR AMAAAAA G             P+            GG+     PP     K  K
Sbjct: 273 KARRAAMAAAAAAGDQEVAVAPRVQQLPLKKKLQNKKKRSNGGEKYNHSPPMVAKAKKCK 332

BLAST of Sgr027271 vs. TAIR 10
Match: AT4G26150.1 (cytokinin-responsive gata factor 1 )

HSP 1 Score: 129.0 bits (323), Expect = 6.5e-30
Identity = 109/299 (36.45%), Postives = 143/299 (47.83%), Query Frame = 0

Query: 42  SFLDHAHSHDSHPLELKQDQGGGIVQACNHDQTVGNDHLETGIRFTIWKQIDKLSETSSC 101
           +F D   +H S PLE K     G   + + DQ V     ET ++ TI K+ +   +T   
Sbjct: 80  TFHDVLDTHISQPLETKNFVSDG--GSSSSDQMVPKK--ETRLKLTIKKKDNHQDQTDLP 139

Query: 102 CENIND-----SVKWAASSSSKMRFVINSNQTETPTTQMIDGGRNFQDLNQTSPSPSSSP 161
              I D     S+KW    SSK+R +       T +        N Q  N ++       
Sbjct: 140 QSPIKDMTGTNSLKWI---SSKVRLMKKKKAIITTSDSSKQHTNNDQSSNLSN------- 199

Query: 162 SDQTNKRSSLNDGGSSAVVIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMA 221
                  S   +G ++  VIR CSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRA  
Sbjct: 200 -------SERQNGYNNDCVIRICSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAAM 259

Query: 222 AAAANGPINPCGGKPPAAVVLKTNKVQ-----HKIIKPAAAN-RKCKDVA---------- 281
           A A    ++  G  PP       NK +     +KI+ P       CK +           
Sbjct: 260 ATATATAVS--GVSPPVMKKKMQNKNKISNGVYKILSPLPLKVNTCKRMITLEETALAED 319

Query: 282 ------GGRGGGRKKLCFEDITISKRLSESSSSYQRVFPQDEREAAILLMTLSYGLLHG 314
                          + F+D+ +   L   SS+YQ+VFPQDE+EAAILLM LS+G++HG
Sbjct: 320 LETQSNSTMLSSSDNIYFDDLAL---LLSKSSAYQQVFPQDEKEAAILLMALSHGMVHG 352

BLAST of Sgr027271 vs. TAIR 10
Match: AT5G49300.1 (GATA transcription factor 16 )

HSP 1 Score: 80.1 bits (196), Expect = 3.5e-15
Identity = 57/141 (40.43%), Postives = 69/141 (48.94%), Query Frame = 0

Query: 177 RTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAANGPINPCGGKPPAAVV 236
           +TC+DC T+KTPLWR GP GPKSLCNACGIR RK RR              GG       
Sbjct: 36  KTCADCGTSKTPLWRGGPVGPKSLCNACGIRNRKKRR--------------GG------- 95

Query: 237 LKTNKVQHKIIKPAAANRKCKDVAGGRGGGRK-----KLCFEDITISKRLSESSSSYQRV 296
                           N+K K  + G GG RK     K    D+ I KR   S+   QR 
Sbjct: 96  -------------TEDNKKLKKSSSG-GGNRKFGESLKQSLMDLGIRKR---STVEKQRQ 138

Query: 297 FPQDEREAAILLMTLSYGLLH 313
              +E +AA+LLM LSYG ++
Sbjct: 156 KLGEEEQAAVLLMALSYGSVY 138

BLAST of Sgr027271 vs. TAIR 10
Match: AT5G26930.1 (GATA transcription factor 23 )

HSP 1 Score: 71.6 bits (174), Expect = 1.2e-12
Identity = 30/39 (76.92%), Postives = 33/39 (84.62%), Query Frame = 0

Query: 176 IRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRA 215
           IR CS+C TTKTP+WR GP GPKSLCNACGIR RK RR+
Sbjct: 25  IRCCSECKTTKTPMWRGGPTGPKSLCNACGIRHRKQRRS 63

BLAST of Sgr027271 vs. TAIR 10
Match: AT3G50870.1 (GATA type zinc finger transcription factor family protein )

HSP 1 Score: 70.1 bits (170), Expect = 3.6e-12
Identity = 43/96 (44.79%), Postives = 53/96 (55.21%), Query Frame = 0

Query: 141 NFQDLNQTSPSPSSS------PSDQTNKRS---------SLNDGGSSAVVIRTCSDCNTT 200
           NF DL  T  + S +      PS   NK S             GG  +++ R C++C+TT
Sbjct: 101 NFWDLIHTKNNNSKTAPYNNVPSFSANKPSRGCSGGGGGGGGGGGGDSLLARRCANCDTT 160

Query: 201 KTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAN 222
            TPLWR+GPRGPKSLCNACGIR +K  R   AA  N
Sbjct: 161 STPLWRNGPRGPKSLCNACGIRFKKEERRTTAATGN 196

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038878562.12.0e-9766.37GATA transcription factor 21 [Benincasa hispida][more]
XP_004135818.14.8e-9165.70putative GATA transcription factor 22 [Cucumis sativus] >KGN66237.1 hypothetical... [more]
XP_008450852.14.5e-8963.84PREDICTED: GATA transcription factor 21 [Cucumis melo][more]
XP_022967871.12.0e-8160.96GATA transcription factor 21-like isoform X1 [Cucurbita maxima][more]
KAG6588037.12.0e-8160.84GATA transcription factor 21, partial [Cucurbita argyrosperma subsp. sororia] >K... [more]
Match NameE-valueIdentityDescription
Q5HZ362.0e-3135.36GATA transcription factor 21 OS=Arabidopsis thaliana OX=3702 GN=GATA21 PE=1 SV=2[more]
Q9SZI69.2e-2936.45Putative GATA transcription factor 22 OS=Arabidopsis thaliana OX=3702 GN=GATA22 ... [more]
Q6YW481.7e-1941.67Protein CYTOKININ-RESPONSIVE GATA TRANSCRIPTION FACTOR 1 OS=Oryza sativa subsp. ... [more]
Q9FJ104.9e-1440.43GATA transcription factor 16 OS=Arabidopsis thaliana OX=3702 GN=GATA16 PE=2 SV=1[more]
Q8LC795.1e-1144.79GATA transcription factor 18 OS=Arabidopsis thaliana OX=3702 GN=GATA18 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LZE42.3e-9165.70GATA-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G587970 P... [more]
A0A1S3BPL12.2e-8963.84GATA transcription factor 21 OS=Cucumis melo OX=3656 GN=LOC103492321 PE=4 SV=1[more]
A0A6J1HT969.7e-8260.96GATA transcription factor 21-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
A0A6J1ELP11.7e-8160.48GATA transcription factor 21-like OS=Cucurbita moschata OX=3662 GN=LOC111433707 ... [more]
A0A6J1D3361.8e-8065.74GATA transcription factor 21-like OS=Momordica charantia OX=3673 GN=LOC111016549... [more]
Match NameE-valueIdentityDescription
AT5G56860.11.4e-3235.36GATA type zinc finger transcription factor family protein [more]
AT4G26150.16.5e-3036.45cytokinin-responsive gata factor 1 [more]
AT5G49300.13.5e-1540.43GATA transcription factor 16 [more]
AT5G26930.11.2e-1276.92GATA transcription factor 23 [more]
AT3G50870.13.6e-1244.79GATA type zinc finger transcription factor family protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 173..224
e-value: 4.9E-20
score: 82.6
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 179..212
e-value: 5.0E-17
score: 61.2
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 179..204
IPR000679Zinc finger, GATA-typePROSITEPS50114GATA_ZN_FINGER_2coord: 177..209
score: 12.650496
IPR000679Zinc finger, GATA-typeCDDcd00202ZnF_GATAcoord: 178..206
e-value: 7.71521E-12
score: 57.3826
IPR013088Zinc finger, NHR/GATA-typeGENE3D3.30.50.10coord: 176..255
e-value: 3.2E-16
score: 61.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 129..170
NoneNo IPR availablePANTHERPTHR47255GATA TRANSCRIPTION FACTOR 22-RELATEDcoord: 1..313
NoneNo IPR availablePANTHERPTHR47255:SF10GATA TRANSCRIPTION FACTOR 21-LIKEcoord: 1..313
NoneNo IPR availableSUPERFAMILY57716Glucocorticoid receptor-like (DNA-binding domain)coord: 176..216

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr027271.1Sgr027271.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding