HG10023533 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023533
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionGATA transcription factor
LocationChr05: 35072601 .. 35073754 (+)
RNA-Seq ExpressionHG10023533
SyntenyHG10023533
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGTGAACAAGTTTCTGATCGGTGGATATTTCGACGGTGGAGTGGGAGAATTTTCGCCGGAGAAGATGAAGGCTGCGGAGCATTTCACCATTGATGACCTACTTGACTTTTCAAATGAAGATGCAATAATGACAGACGGTTTCTTCGATAATGTGGCGGGAAGTTCGACGGATTGCTCAACTGTTACTGCAGTTGATAGCTGTAATTCCTCTGTCTCTGGTGGTGATCATCATCATTTCCATGGAAATATCGGCTCTCGGAGTTTCGACGAGTCTCAATTCTCCGGAGACCTTTGCGTTCCGGTAATGTCTCGCATTTAATCAGTTTCTACTGTTCTTCAGAACTTCAATTCAGTTGATTGAACTCTGAAATTTAAAATTCATTGCGAGGGTTGAAACAATTTGAAGTTAGATTAGTTAGATTCTGATTCAATTTCTGGATTCTACGTTTACAGTGCGATGATTTAGCGGAACTCGAATGGCTCTCGAACTTCGTTGAAGATTCATTCTCCACGGAAGGGAAAGATCTTCAAACGCTTAATTACCTCTCTAATAGTCATTCAATTTCCAAGCCTCAAACTCCAGAAACTTCATCTTCCTCGGAATTACCGCCTTCCGTATCAATTCCCTCCGATTCCTCCAAGAACTCGCCGCGTTTCCCCGCCGAAACGCCGCTCCCTTGCAAAGCTCGAAGCAAGCGATCGCGAACCGCTCCTTGTGACTGGACCACACGCCTCCTCCACCTCCTCTCTCCGGCTGATCCTAAACCGCCGAAATCATCTTCGTCGAAGAAGAAAGACGCTCCGAACGGAGATTCCTCCGGCCGGAAATGCTTGCATTGTCAGTCGGAGAAGACTCCTCAATGGCGGACCGGACCTATGGGCCCTAAAACACTCTGCAACGCTTGTGGCGTCCGGTACAAGTCCGGACGGTTAGTTCCGGAGTATCGGCCGGCAGCGAGTCCGACGTTCATATCGGCGAAACACTCGAATTCTCACCGGAAAGTTCTGGAGCTGAGAAGGCAGAAGGAGCTTCAAATTGCGCAACAGCAACAGTTCATGAATCAGAGTTCAATTTTCGGAGTAACGAACGGTTGTGATGAGTACTTGATTTCTCATCACATGGGCCCCAATGTTAGGCATATGATCTAG

mRNA sequence

ATGGAGGTGAACAAGTTTCTGATCGGTGGATATTTCGACGGTGGAGTGGGAGAATTTTCGCCGGAGAAGATGAAGGCTGCGGAGCATTTCACCATTGATGACCTACTTGACTTTTCAAATGAAGATGCAATAATGACAGACGGTTTCTTCGATAATGTGGCGGGAAGTTCGACGGATTGCTCAACTGTTACTGCAGTTGATAGCTGTAATTCCTCTGTCTCTGGTGGTGATCATCATCATTTCCATGGAAATATCGGCTCTCGGAGTTTCGACGAGTCTCAATTCTCCGGAGACCTTTGCGTTCCGTGCGATGATTTAGCGGAACTCGAATGGCTCTCGAACTTCGTTGAAGATTCATTCTCCACGGAAGGGAAAGATCTTCAAACGCTTAATTACCTCTCTAATAGTCATTCAATTTCCAAGCCTCAAACTCCAGAAACTTCATCTTCCTCGGAATTACCGCCTTCCGTATCAATTCCCTCCGATTCCTCCAAGAACTCGCCGCGTTTCCCCGCCGAAACGCCGCTCCCTTGCAAAGCTCGAAGCAAGCGATCGCGAACCGCTCCTTGTGACTGGACCACACGCCTCCTCCACCTCCTCTCTCCGGCTGATCCTAAACCGCCGAAATCATCTTCGTCGAAGAAGAAAGACGCTCCGAACGGAGATTCCTCCGGCCGGAAATGCTTGCATTGTCAGTCGGAGAAGACTCCTCAATGGCGGACCGGACCTATGGGCCCTAAAACACTCTGCAACGCTTGTGGCGTCCGGTACAAGTCCGGACGGTTAGTTCCGGAGTATCGGCCGGCAGCGAGTCCGACGTTCATATCGGCGAAACACTCGAATTCTCACCGGAAAGTTCTGGAGCTGAGAAGGCAGAAGGAGCTTCAAATTGCGCAACAGCAACAGTTCATGAATCAGAGTTCAATTTTCGGAGTAACGAACGGTTGTGATGAGTACTTGATTTCTCATCACATGGGCCCCAATGTTAGGCATATGATCTAG

Coding sequence (CDS)

ATGGAGGTGAACAAGTTTCTGATCGGTGGATATTTCGACGGTGGAGTGGGAGAATTTTCGCCGGAGAAGATGAAGGCTGCGGAGCATTTCACCATTGATGACCTACTTGACTTTTCAAATGAAGATGCAATAATGACAGACGGTTTCTTCGATAATGTGGCGGGAAGTTCGACGGATTGCTCAACTGTTACTGCAGTTGATAGCTGTAATTCCTCTGTCTCTGGTGGTGATCATCATCATTTCCATGGAAATATCGGCTCTCGGAGTTTCGACGAGTCTCAATTCTCCGGAGACCTTTGCGTTCCGTGCGATGATTTAGCGGAACTCGAATGGCTCTCGAACTTCGTTGAAGATTCATTCTCCACGGAAGGGAAAGATCTTCAAACGCTTAATTACCTCTCTAATAGTCATTCAATTTCCAAGCCTCAAACTCCAGAAACTTCATCTTCCTCGGAATTACCGCCTTCCGTATCAATTCCCTCCGATTCCTCCAAGAACTCGCCGCGTTTCCCCGCCGAAACGCCGCTCCCTTGCAAAGCTCGAAGCAAGCGATCGCGAACCGCTCCTTGTGACTGGACCACACGCCTCCTCCACCTCCTCTCTCCGGCTGATCCTAAACCGCCGAAATCATCTTCGTCGAAGAAGAAAGACGCTCCGAACGGAGATTCCTCCGGCCGGAAATGCTTGCATTGTCAGTCGGAGAAGACTCCTCAATGGCGGACCGGACCTATGGGCCCTAAAACACTCTGCAACGCTTGTGGCGTCCGGTACAAGTCCGGACGGTTAGTTCCGGAGTATCGGCCGGCAGCGAGTCCGACGTTCATATCGGCGAAACACTCGAATTCTCACCGGAAAGTTCTGGAGCTGAGAAGGCAGAAGGAGCTTCAAATTGCGCAACAGCAACAGTTCATGAATCAGAGTTCAATTTTCGGAGTAACGAACGGTTGTGATGAGTACTTGATTTCTCATCACATGGGCCCCAATGTTAGGCATATGATCTAG

Protein sequence

MEVNKFLIGGYFDGGVGEFSPEKMKAAEHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDCSTVTAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSFSTEGKDLQTLNYLSNSHSISKPQTPETSSSSELPPSVSIPSDSSKNSPRFPAETPLPCKARSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQSEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQQQFMNQSSIFGVTNGCDEYLISHHMGPNVRHMI
Homology
BLAST of HG10023533 vs. NCBI nr
Match: XP_038899153.1 (GATA transcription factor 9-like [Benincasa hispida])

HSP 1 Score: 644.8 bits (1662), Expect = 4.0e-181
Identity = 320/333 (96.10%), Postives = 325/333 (97.60%), Query Frame = 0

Query: 1   MEVNKFLIGGYFDGGVGEFSPEKMKAAEHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDC 60
           ME+NKFLIGGYFDGGVGEFSPEK KAAEHFTIDDLLDFSNEDAIMTDGFFD VAGSSTD 
Sbjct: 1   MELNKFLIGGYFDGGVGEFSPEKTKAAEHFTIDDLLDFSNEDAIMTDGFFDYVAGSSTDS 60

Query: 61  STVTAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120
           STVTAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF
Sbjct: 61  STVTAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120

Query: 121 STEGKDLQTLNYLSNSHSISKPQTPETSSSSELPPSVSIPSDSSKNSPRFPAETPLPCKA 180
           STEGKDLQ LNYLSN HSISKPQTPETSSSSE+PPSVSIPSDSSKNSPRFPAETPLPCKA
Sbjct: 121 STEGKDLQALNYLSNGHSISKPQTPETSSSSEVPPSVSIPSDSSKNSPRFPAETPLPCKA 180

Query: 181 RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQSEKTPQWR 240
           RSKRSR APCDWTTRLLHLLSPAD KPPKSSSSKKKDA NGDSSGRKCLHCQ+EKTPQWR
Sbjct: 181 RSKRSRIAPCDWTTRLLHLLSPADSKPPKSSSSKKKDASNGDSSGRKCLHCQAEKTPQWR 240

Query: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ 300
           TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ
Sbjct: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ 300

Query: 301 QQFMNQSSIFGVTNGCDEYLISHHMGPNVRHMI 334
           QQF+NQSSIFGVTNGCDEYLISHHMGP+VRHMI
Sbjct: 301 QQFINQSSIFGVTNGCDEYLISHHMGPSVRHMI 333

BLAST of HG10023533 vs. NCBI nr
Match: KAA0051035.1 (GATA transcription factor 9-like [Cucumis melo var. makuwa] >TYK03830.1 GATA transcription factor 9-like [Cucumis melo var. makuwa])

HSP 1 Score: 644.4 bits (1661), Expect = 5.3e-181
Identity = 319/333 (95.80%), Postives = 324/333 (97.30%), Query Frame = 0

Query: 1   MEVNKFLIGGYFDGGVGEFSPEKMKAAEHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDC 60
           MEVNKFLIGGYFDGGVGEFS E  KAAEHFTIDDLLDFSNED IMTDG FDNVAGSSTD 
Sbjct: 1   MEVNKFLIGGYFDGGVGEFSQEMTKAAEHFTIDDLLDFSNEDTIMTDGCFDNVAGSSTDS 60

Query: 61  STVTAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120
           ST+TAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF
Sbjct: 61  STITAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120

Query: 121 STEGKDLQTLNYLSNSHSISKPQTPETSSSSELPPSVSIPSDSSKNSPRFPAETPLPCKA 180
           STEGKDLQ LNYLSNSHSISKPQTPETSSSS LPPSVSIPSDSS NSPRFPAETPLPCKA
Sbjct: 121 STEGKDLQVLNYLSNSHSISKPQTPETSSSSALPPSVSIPSDSSNNSPRFPAETPLPCKA 180

Query: 181 RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQSEKTPQWR 240
           RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQ+EKTPQWR
Sbjct: 181 RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQAEKTPQWR 240

Query: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ 300
           TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ
Sbjct: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ 300

Query: 301 QQFMNQSSIFGVTNGCDEYLISHHMGPNVRHMI 334
           QQF+NQS+IFGVTNGCDEYLISHHMGP+VRHMI
Sbjct: 301 QQFVNQSAIFGVTNGCDEYLISHHMGPSVRHMI 333

BLAST of HG10023533 vs. NCBI nr
Match: XP_008462442.1 (PREDICTED: GATA transcription factor 9-like [Cucumis melo])

HSP 1 Score: 642.1 bits (1655), Expect = 2.6e-180
Identity = 318/333 (95.50%), Postives = 323/333 (97.00%), Query Frame = 0

Query: 1   MEVNKFLIGGYFDGGVGEFSPEKMKAAEHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDC 60
           MEVNKFLIGGYFDGGVGEFS E  KAAEHFTIDDLLDFSNED IMTDG FDNVAGSSTD 
Sbjct: 1   MEVNKFLIGGYFDGGVGEFSQEMTKAAEHFTIDDLLDFSNEDTIMTDGCFDNVAGSSTDS 60

Query: 61  STVTAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120
           ST+TAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF
Sbjct: 61  STITAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120

Query: 121 STEGKDLQTLNYLSNSHSISKPQTPETSSSSELPPSVSIPSDSSKNSPRFPAETPLPCKA 180
           STEGKDLQ LNYLSNSHSISKPQTPETSSSS LPPSVSIPSDSS NSPRFPAETPLPCKA
Sbjct: 121 STEGKDLQVLNYLSNSHSISKPQTPETSSSSALPPSVSIPSDSSNNSPRFPAETPLPCKA 180

Query: 181 RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQSEKTPQWR 240
           RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQ+EKTPQWR
Sbjct: 181 RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQAEKTPQWR 240

Query: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ 300
           TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ
Sbjct: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ 300

Query: 301 QQFMNQSSIFGVTNGCDEYLISHHMGPNVRHMI 334
           QQF+NQS+IFGVTNGCDEYLISHH GP+VRHMI
Sbjct: 301 QQFVNQSAIFGVTNGCDEYLISHHTGPSVRHMI 333

BLAST of HG10023533 vs. NCBI nr
Match: XP_004141657.1 (GATA transcription factor 9 [Cucumis sativus] >KGN45592.1 hypothetical protein Csa_016079 [Cucumis sativus])

HSP 1 Score: 631.3 bits (1627), Expect = 4.6e-177
Identity = 311/333 (93.39%), Postives = 320/333 (96.10%), Query Frame = 0

Query: 1   MEVNKFLIGGYFDGGVGEFSPEKMKAAEHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDC 60
           MEVNKFLIGGYFDGGVGEFSPE  KAA+HFTIDDLLDFSNED IMTDG FDN+AGSSTD 
Sbjct: 1   MEVNKFLIGGYFDGGVGEFSPEMTKAADHFTIDDLLDFSNEDTIMTDGLFDNMAGSSTDS 60

Query: 61  STVTAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120
           ST+TAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF
Sbjct: 61  STITAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120

Query: 121 STEGKDLQTLNYLSNSHSISKPQTPETSSSSELPPSVSIPSDSSKNSPRFPAETPLPCKA 180
           STEGKDLQ LNYLSNSHS SKPQTPETSSSS LP S+SIPS+SS NSPRFPAETPLPCKA
Sbjct: 121 STEGKDLQVLNYLSNSHSTSKPQTPETSSSSALPASLSIPSNSSNNSPRFPAETPLPCKA 180

Query: 181 RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQSEKTPQWR 240
           RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDA NGDSSGRKCLHCQ+EKTPQWR
Sbjct: 181 RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDASNGDSSGRKCLHCQAEKTPQWR 240

Query: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ 300
           TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKEL IAQQ
Sbjct: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELHIAQQ 300

Query: 301 QQFMNQSSIFGVTNGCDEYLISHHMGPNVRHMI 334
           QQF+NQ +IFGVTNGCDEYLISHHMGP+VRHMI
Sbjct: 301 QQFVNQGAIFGVTNGCDEYLISHHMGPSVRHMI 333

BLAST of HG10023533 vs. NCBI nr
Match: XP_022144687.1 (GATA transcription factor 9-like [Momordica charantia])

HSP 1 Score: 603.2 bits (1554), Expect = 1.3e-168
Identity = 300/334 (89.82%), Postives = 319/334 (95.51%), Query Frame = 0

Query: 1   MEVNKFLIGGYFDGGVGEFSPEKMKAAEHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDC 60
           MEVNKFLIGGYFD G G+FSPEK KAAEHFTIDDLLDFSNEDA++TDGFFDNVAG+STD 
Sbjct: 1   MEVNKFLIGGYFDAGAGQFSPEKAKAAEHFTIDDLLDFSNEDAMVTDGFFDNVAGASTDS 60

Query: 61  STVTAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCV-PCDDLAELEWLSNFVEDS 120
           STVTAVDSCNSSVSGGD HHFHGNIGS+SF ESQ S DLC+ P DDLAELEWLSNFVEDS
Sbjct: 61  STVTAVDSCNSSVSGGD-HHFHGNIGSQSFGESQLSSDLCIDPYDDLAELEWLSNFVEDS 120

Query: 121 FSTEGKDLQTLNYLSNSHSISKPQTPETSSSSELPPSVSIPSDSSKNSPRFPAETPLPCK 180
           FSTEGKDLQ L+YLS+SHSISKPQTPETSSSSELPPSVSIPSD+SKN+PRFPAETPLPCK
Sbjct: 121 FSTEGKDLQALHYLSSSHSISKPQTPETSSSSELPPSVSIPSDTSKNAPRFPAETPLPCK 180

Query: 181 ARSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQSEKTPQW 240
           ARSKRSRTAPCDWTTRLLHLLSPADPKPPKSS+SKKK+A N +SSGRKCLHCQ+EKTPQW
Sbjct: 181 ARSKRSRTAPCDWTTRLLHLLSPADPKPPKSSTSKKKEASNSESSGRKCLHCQAEKTPQW 240

Query: 241 RTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQ 300
           RTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQK+LQ+AQ
Sbjct: 241 RTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKDLQMAQ 300

Query: 301 QQQFMNQSSIFGVTNGCDEYLISHHMGPNVRHMI 334
           QQQF++ SSIFGVTNGCDEYLISHHMGP +RHMI
Sbjct: 301 QQQFISHSSIFGVTNGCDEYLISHHMGPTIRHMI 333

BLAST of HG10023533 vs. ExPASy Swiss-Prot
Match: P69781 (GATA transcription factor 12 OS=Arabidopsis thaliana OX=3702 GN=GATA12 PE=2 SV=1)

HSP 1 Score: 234.6 bits (597), Expect = 1.7e-60
Identity = 158/344 (45.93%), Postives = 200/344 (58.14%), Query Frame = 0

Query: 30  FTIDDLL-DFSNEDAIMTDGFFDNVAGSSTDCSTVTAVDSCNSSVSGGDHHHFHGNIGSR 89
           F +DDLL DFSN+D    D   D VA S+T   T T  DS  S+ S  D   FHG++   
Sbjct: 14  FAVDDLLVDFSNDD----DEENDVVADSTT---TTTITDS--SNFSAADLPSFHGDVQ-- 73

Query: 90  SFDESQFSGDLCVPCDDLA-ELEWLSNFVEDSFSTEGKDLQTLNYLSNSHSISKPQTPET 149
             D + FSGDLC+P DDLA ELEWLSN V++S S E  D+  L  +S   S   P++ +T
Sbjct: 74  --DGTSFSGDLCIPSDDLADELEWLSNIVDESLSPE--DVHKLELISGFKSRPDPKS-DT 133

Query: 150 SSSSELPPSVSIPSDSSKNSPRFPAETPLPCKARSKRSRTAPCDWTTRLL---------- 209
            S          P + + +SP F  +  +P KARSKRSR A C+W +R L          
Sbjct: 134 GS----------PENPNSSSPIFTTDVSVPAKARSKRSRAAACNWASRGLLKETFYDSPF 193

Query: 210 ----------HLLSPADP----------KPPKSSSSKKKDAPNGDSSG---RKCLHCQSE 269
                     HL  P  P          +       +KKD  + +S G   R+CLHC ++
Sbjct: 194 TGETILSSQQHLSPPTSPPLLMAPLGKKQAVDGGHRRKKDVSSPESGGAEERRCLHCATD 253

Query: 270 KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKE 329
           KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTF+ AKHSNSHRKV+ELRRQKE
Sbjct: 254 KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVMELRRQKE 313

Query: 330 LQIAQQQ-----QFMNQSSIFGVTNGCDEYLISHHMGPNVRHMI 334
           +  A  +        + + IF V++  D+YLI H++GP+ R +I
Sbjct: 314 MSRAHHEFIHHHHGTDTAMIFDVSSDGDDYLIHHNVGPDFRQLI 331

BLAST of HG10023533 vs. ExPASy Swiss-Prot
Match: O82632 (GATA transcription factor 9 OS=Arabidopsis thaliana OX=3702 GN=GATA9 PE=2 SV=1)

HSP 1 Score: 221.5 bits (563), Expect = 1.4e-56
Identity = 142/318 (44.65%), Postives = 188/318 (59.12%), Query Frame = 0

Query: 28  EHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDCSTVTAVDSCNSSVSGGDHHHFHGNIGS 87
           + F +DDLLDFSN+D  + DG   N    S+  ST T  DS NSS              S
Sbjct: 16  DSFVVDDLLDFSNDDGEVDDGL--NTLPDSSTLSTGTLTDSSNSS--------------S 75

Query: 88  RSFDESQFSGDLCVPCDDLAELEWLSNFVEDSFSTEGKDLQTLNYLSNSHSISKPQTPET 147
              D + FS DL +P DD+AELEWLSNFVE+SF+  G+D   L+  S    +  PQT  +
Sbjct: 76  LFTDGTGFS-DLYIPNDDIAELEWLSNFVEESFA--GEDQDKLHLFS---GLKNPQTTGS 135

Query: 148 SSSSELPPSVSIPSDSSKNSPRFPAETPLPCKARSKRSRTAPCDWTTRLLHLLSPADPKP 207
           + +  + P    P    +      +   +P KARSKRSR+A   W +RLL L    +  P
Sbjct: 136 TLTHLIKPE---PELDHQFIDIDESNVAVPAKARSKRSRSAASTWASRLLSLADSDETNP 195

Query: 208 PKSSSSKKKDAPNGD--------SSGRKCLHCQSEKTPQWRTGPMGPKTLCNACGVRYKS 267
            K     K+    GD          GR+CLHC +EKTPQWRTGPMGPKTLCNACGVRYKS
Sbjct: 196 KKKQRRVKEQDFAGDMDVDCGESGGGRRCLHCATEKTPQWRTGPMGPKTLCNACGVRYKS 255

Query: 268 GRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQ-QQQFMNQSSIFGVTNGCDE 327
           GRLVPEYRPA+SPTF+ A+HSNSHRKV+ELRRQKE++      Q   ++ +  + +  ++
Sbjct: 256 GRLVPEYRPASSPTFVMARHSNSHRKVMELRRQKEMRDEHLLSQLRCENLLMDIRSNGED 308

Query: 328 YLI---SHHMGPNVRHMI 334
           +L+   ++H+ P+ RH+I
Sbjct: 316 FLMHNNTNHVAPDFRHLI 308

BLAST of HG10023533 vs. ExPASy Swiss-Prot
Match: O49741 (GATA transcription factor 2 OS=Arabidopsis thaliana OX=3702 GN=GATA2 PE=2 SV=1)

HSP 1 Score: 167.5 bits (423), Expect = 2.5e-40
Identity = 115/283 (40.64%), Postives = 140/283 (49.47%), Query Frame = 0

Query: 32  IDDLLDFSNEDAIMTDGFFDNVAGSSTDCSTVTAVDSCNSSVSGGDHHHFHGNIGSRSFD 91
           IDDLLDFSNED           + SS+  ST     S          HH H      S D
Sbjct: 14  IDDLLDFSNEDIF---------SASSSGGSTAATSSSSFPPPQNPSFHHHH---LPSSAD 73

Query: 92  ESQFSGDLCVPCDDLAELEWLSNFVEDSFSTEGKDLQTLNYLSNSHSISKPQTPETSSSS 151
              F  D+CVP DD A LEWLS FV+DSF+                    P  P   + +
Sbjct: 74  HHSFLHDICVPSDDAAHLEWLSQFVDDSFA------------------DFPANPLGGTMT 133

Query: 152 ELPPSVSIPSDSSKNSPRFPAE-----TPLPCKARSKRSRTAPCDWTTRLLHLLSPADPK 211
            +    S P        R PA      +P+P ++  ++            LH  +   PK
Sbjct: 134 SVKTETSFPGKPRSKRSRAPAPFAGTWSPMPLESEHQQ------------LHSAAKFKPK 193

Query: 212 PPKSSSSKKKDAPNGDSSG--------RKCLHCQSEKTPQWRTGPMGPKTLCNACGVRYK 271
             +S         +  SS         R+C HC SEKTPQWRTGP+GPKTLCNACGVR+K
Sbjct: 194 KEQSGGGGGGGGRHQSSSSETTEGGGMRRCTHCASEKTPQWRTGPLGPKTLCNACGVRFK 253

Query: 272 SGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQQ 302
           SGRLVPEYRPA+SPTF+  +HSNSHRKV+ELRRQKE+    QQ
Sbjct: 254 SGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEVMRQPQQ 254

BLAST of HG10023533 vs. ExPASy Swiss-Prot
Match: O49743 (GATA transcription factor 4 OS=Arabidopsis thaliana OX=3702 GN=GATA4 PE=2 SV=1)

HSP 1 Score: 167.5 bits (423), Expect = 2.5e-40
Identity = 120/273 (43.96%), Postives = 146/273 (53.48%), Query Frame = 0

Query: 24  MKAAEHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDCSTVTAVDSCNSSVSGGDHHHFHG 83
           M + +   IDDLLDFSN          D +  SS   STVT+  + +S+ S  +   F  
Sbjct: 6   MSSPDLLRIDDLLDFSN----------DEIFSSS---STVTS-SAASSAASSENPFSFPS 65

Query: 84  NIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSFSTEGKDLQTLNYLSNSHSISKPQ 143
           +  +     + F+ DLCVP DD A LEWLS FV+DSFS                    P 
Sbjct: 66  STYTSPTLLTDFTHDLCVPSDDAAHLEWLSRFVDDSFS------------------DFPA 125

Query: 144 TPETSSSSELPPSVSIPSDSSKNSPRFPAETPLPCKARSKRSRTAPCDWTTRLLHLLSPA 203
            P T +   + P +S          R PA         S     AP    + L H  S A
Sbjct: 126 NPLTMT---VRPEISFTGKPRSRRSRAPAP--------SVAGTWAPMS-ESELCH--SVA 185

Query: 204 DPKPPKSSSSKKKDAPNGDSSGRKCLHCQSEKTPQWRTGPMGPKTLCNACGVRYKSGRLV 263
            PKP K  +++   A       R+C HC SEKTPQWRTGP+GPKTLCNACGVRYKSGRLV
Sbjct: 186 KPKPKKVYNAESVTA----DGARRCTHCASEKTPQWRTGPLGPKTLCNACGVRYKSGRLV 228

Query: 264 PEYRPAASPTFISAKHSNSHRKVLELRRQKELQ 297
           PEYRPA+SPTF+  +HSNSHRKV+ELRRQKE Q
Sbjct: 246 PEYRPASSPTFVLTQHSNSHRKVMELRRQKEQQ 228

BLAST of HG10023533 vs. ExPASy Swiss-Prot
Match: Q9FH57 (GATA transcription factor 5 OS=Arabidopsis thaliana OX=3702 GN=GATA5 PE=2 SV=1)

HSP 1 Score: 144.4 bits (363), Expect = 2.3e-33
Identity = 110/306 (35.95%), Postives = 144/306 (47.06%), Query Frame = 0

Query: 26  AAEHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDCSTVTAVDSCNSSVSGGDHHHFHGNI 85
           + + F++DDLLD SN+D           A   TD      +   +S     D      + 
Sbjct: 37  SVDDFSVDDLLDLSNDDVF---------ADEETDLKAQHEMVRVSSEEPNDDGDALRRSS 96

Query: 86  GSRSFDE--SQFSGDLCVPCDDLAELEWLSNFVEDSFSTEGKDLQTLNYLSNSHSISKPQ 145
                D+  S  + +L +P DDLA LEWLS+FVEDSF+          Y   + + +  +
Sbjct: 97  DFSGCDDFGSLPTSELSLPADDLANLEWLSHFVEDSFT---------EYSGPNLTGTPTE 156

Query: 146 TPETSSSSELPPSVSIPSDSSKNSPRFPAETPLPCKARSKRSRTAPCDWTTRL------- 205
            P   +     P  ++  ++   S       P+P KARSKR+R     W+          
Sbjct: 157 KPAWLTGDRKHPVTAVTEETCFKS-------PVPAKARSKRNRNGLKVWSLGSSSSSGPS 216

Query: 206 -------------------LHLLSP--ADPKPPKSSSSKKKDAPNGDSS-------GRKC 265
                                LL P     +PP     KK+ A +  S         RKC
Sbjct: 217 SSGSTSSSSSGPSSPWFSGAELLEPVVTSERPPFPKKHKKRSAESVFSGELQQLQPQRKC 276

Query: 266 LHCQSEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLE 295
            HC  +KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF S  HSN HRKV+E
Sbjct: 277 SHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHRKVIE 317

BLAST of HG10023533 vs. ExPASy TrEMBL
Match: A0A5A7UBN6 (GATA transcription factor OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G00220 PE=3 SV=1)

HSP 1 Score: 644.4 bits (1661), Expect = 2.6e-181
Identity = 319/333 (95.80%), Postives = 324/333 (97.30%), Query Frame = 0

Query: 1   MEVNKFLIGGYFDGGVGEFSPEKMKAAEHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDC 60
           MEVNKFLIGGYFDGGVGEFS E  KAAEHFTIDDLLDFSNED IMTDG FDNVAGSSTD 
Sbjct: 1   MEVNKFLIGGYFDGGVGEFSQEMTKAAEHFTIDDLLDFSNEDTIMTDGCFDNVAGSSTDS 60

Query: 61  STVTAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120
           ST+TAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF
Sbjct: 61  STITAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120

Query: 121 STEGKDLQTLNYLSNSHSISKPQTPETSSSSELPPSVSIPSDSSKNSPRFPAETPLPCKA 180
           STEGKDLQ LNYLSNSHSISKPQTPETSSSS LPPSVSIPSDSS NSPRFPAETPLPCKA
Sbjct: 121 STEGKDLQVLNYLSNSHSISKPQTPETSSSSALPPSVSIPSDSSNNSPRFPAETPLPCKA 180

Query: 181 RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQSEKTPQWR 240
           RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQ+EKTPQWR
Sbjct: 181 RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQAEKTPQWR 240

Query: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ 300
           TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ
Sbjct: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ 300

Query: 301 QQFMNQSSIFGVTNGCDEYLISHHMGPNVRHMI 334
           QQF+NQS+IFGVTNGCDEYLISHHMGP+VRHMI
Sbjct: 301 QQFVNQSAIFGVTNGCDEYLISHHMGPSVRHMI 333

BLAST of HG10023533 vs. ExPASy TrEMBL
Match: A0A1S3CIH2 (GATA transcription factor OS=Cucumis melo OX=3656 GN=LOC103500789 PE=3 SV=1)

HSP 1 Score: 642.1 bits (1655), Expect = 1.3e-180
Identity = 318/333 (95.50%), Postives = 323/333 (97.00%), Query Frame = 0

Query: 1   MEVNKFLIGGYFDGGVGEFSPEKMKAAEHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDC 60
           MEVNKFLIGGYFDGGVGEFS E  KAAEHFTIDDLLDFSNED IMTDG FDNVAGSSTD 
Sbjct: 1   MEVNKFLIGGYFDGGVGEFSQEMTKAAEHFTIDDLLDFSNEDTIMTDGCFDNVAGSSTDS 60

Query: 61  STVTAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120
           ST+TAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF
Sbjct: 61  STITAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120

Query: 121 STEGKDLQTLNYLSNSHSISKPQTPETSSSSELPPSVSIPSDSSKNSPRFPAETPLPCKA 180
           STEGKDLQ LNYLSNSHSISKPQTPETSSSS LPPSVSIPSDSS NSPRFPAETPLPCKA
Sbjct: 121 STEGKDLQVLNYLSNSHSISKPQTPETSSSSALPPSVSIPSDSSNNSPRFPAETPLPCKA 180

Query: 181 RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQSEKTPQWR 240
           RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQ+EKTPQWR
Sbjct: 181 RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQAEKTPQWR 240

Query: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ 300
           TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ
Sbjct: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ 300

Query: 301 QQFMNQSSIFGVTNGCDEYLISHHMGPNVRHMI 334
           QQF+NQS+IFGVTNGCDEYLISHH GP+VRHMI
Sbjct: 301 QQFVNQSAIFGVTNGCDEYLISHHTGPSVRHMI 333

BLAST of HG10023533 vs. ExPASy TrEMBL
Match: A0A0A0KB38 (GATA transcription factor OS=Cucumis sativus OX=3659 GN=Csa_7G452960 PE=3 SV=1)

HSP 1 Score: 631.3 bits (1627), Expect = 2.2e-177
Identity = 311/333 (93.39%), Postives = 320/333 (96.10%), Query Frame = 0

Query: 1   MEVNKFLIGGYFDGGVGEFSPEKMKAAEHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDC 60
           MEVNKFLIGGYFDGGVGEFSPE  KAA+HFTIDDLLDFSNED IMTDG FDN+AGSSTD 
Sbjct: 1   MEVNKFLIGGYFDGGVGEFSPEMTKAADHFTIDDLLDFSNEDTIMTDGLFDNMAGSSTDS 60

Query: 61  STVTAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120
           ST+TAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF
Sbjct: 61  STITAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120

Query: 121 STEGKDLQTLNYLSNSHSISKPQTPETSSSSELPPSVSIPSDSSKNSPRFPAETPLPCKA 180
           STEGKDLQ LNYLSNSHS SKPQTPETSSSS LP S+SIPS+SS NSPRFPAETPLPCKA
Sbjct: 121 STEGKDLQVLNYLSNSHSTSKPQTPETSSSSALPASLSIPSNSSNNSPRFPAETPLPCKA 180

Query: 181 RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQSEKTPQWR 240
           RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDA NGDSSGRKCLHCQ+EKTPQWR
Sbjct: 181 RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDASNGDSSGRKCLHCQAEKTPQWR 240

Query: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQ 300
           TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKEL IAQQ
Sbjct: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELHIAQQ 300

Query: 301 QQFMNQSSIFGVTNGCDEYLISHHMGPNVRHMI 334
           QQF+NQ +IFGVTNGCDEYLISHHMGP+VRHMI
Sbjct: 301 QQFVNQGAIFGVTNGCDEYLISHHMGPSVRHMI 333

BLAST of HG10023533 vs. ExPASy TrEMBL
Match: A0A6J1CUD7 (GATA transcription factor OS=Momordica charantia OX=3673 GN=LOC111014311 PE=3 SV=1)

HSP 1 Score: 603.2 bits (1554), Expect = 6.5e-169
Identity = 300/334 (89.82%), Postives = 319/334 (95.51%), Query Frame = 0

Query: 1   MEVNKFLIGGYFDGGVGEFSPEKMKAAEHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDC 60
           MEVNKFLIGGYFD G G+FSPEK KAAEHFTIDDLLDFSNEDA++TDGFFDNVAG+STD 
Sbjct: 1   MEVNKFLIGGYFDAGAGQFSPEKAKAAEHFTIDDLLDFSNEDAMVTDGFFDNVAGASTDS 60

Query: 61  STVTAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCV-PCDDLAELEWLSNFVEDS 120
           STVTAVDSCNSSVSGGD HHFHGNIGS+SF ESQ S DLC+ P DDLAELEWLSNFVEDS
Sbjct: 61  STVTAVDSCNSSVSGGD-HHFHGNIGSQSFGESQLSSDLCIDPYDDLAELEWLSNFVEDS 120

Query: 121 FSTEGKDLQTLNYLSNSHSISKPQTPETSSSSELPPSVSIPSDSSKNSPRFPAETPLPCK 180
           FSTEGKDLQ L+YLS+SHSISKPQTPETSSSSELPPSVSIPSD+SKN+PRFPAETPLPCK
Sbjct: 121 FSTEGKDLQALHYLSSSHSISKPQTPETSSSSELPPSVSIPSDTSKNAPRFPAETPLPCK 180

Query: 181 ARSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQSEKTPQW 240
           ARSKRSRTAPCDWTTRLLHLLSPADPKPPKSS+SKKK+A N +SSGRKCLHCQ+EKTPQW
Sbjct: 181 ARSKRSRTAPCDWTTRLLHLLSPADPKPPKSSTSKKKEASNSESSGRKCLHCQAEKTPQW 240

Query: 241 RTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQ 300
           RTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQK+LQ+AQ
Sbjct: 241 RTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKDLQMAQ 300

Query: 301 QQQFMNQSSIFGVTNGCDEYLISHHMGPNVRHMI 334
           QQQF++ SSIFGVTNGCDEYLISHHMGP +RHMI
Sbjct: 301 QQQFISHSSIFGVTNGCDEYLISHHMGPTIRHMI 333

BLAST of HG10023533 vs. ExPASy TrEMBL
Match: A0A6J1GN14 (GATA transcription factor OS=Cucurbita moschata OX=3662 GN=LOC111455890 PE=3 SV=1)

HSP 1 Score: 599.7 bits (1545), Expect = 7.2e-168
Identity = 301/329 (91.49%), Postives = 309/329 (93.92%), Query Frame = 0

Query: 1   MEVNKFLIGGYFDGGVGEFSPEKMKAAEHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDC 60
           MEVNKF+IGGYFD GVG+FSPEK KA EHF IDDL DFSNEDAIM DGFFDNVAG+ST+ 
Sbjct: 1   MEVNKFMIGGYFDAGVGQFSPEKTKATEHFAIDDLFDFSNEDAIMMDGFFDNVAGTSTES 60

Query: 61  STVTAVDSCNSSVSGGDHHHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120
           STVTAVDSCNSSVSGGDH HFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF
Sbjct: 61  STVTAVDSCNSSVSGGDHQHFHGNIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSF 120

Query: 121 STEGKDLQTLNYLSNSHSISKPQTPETSSSSELPPSVSIPSDSSKNSPRFPAETPLPCKA 180
           STEGKDLQ LNYL+NSHSISKPQTPETSSSSELP     PSDSSKN+PRFPAETPLP KA
Sbjct: 121 STEGKDLQALNYLTNSHSISKPQTPETSSSSELP-----PSDSSKNTPRFPAETPLPSKA 180

Query: 181 RSKRSRTAPCDWTTRLLHLLSPADPKPPKSSSSKKKDAPNGDSSGRKCLHCQSEKTPQWR 240
           RSKRSRTAPCDWTTRLLHLLSP D KPPKSSSSKKKDAPNGDSS RKCLHCQSEKTPQWR
Sbjct: 181 RSKRSRTAPCDWTTRLLHLLSPTDRKPPKSSSSKKKDAPNGDSSSRKCLHCQSEKTPQWR 240

Query: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIA-- 300
           TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQ+A  
Sbjct: 241 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQMAQQ 300

Query: 301 QQQQFMNQSSIFGVTNGCDEYLISHHMGP 328
           QQQQF+NQSSIFGVTNGCDEYLISHHMGP
Sbjct: 301 QQQQFINQSSIFGVTNGCDEYLISHHMGP 324

BLAST of HG10023533 vs. TAIR 10
Match: AT5G25830.1 (GATA transcription factor 12 )

HSP 1 Score: 234.6 bits (597), Expect = 1.2e-61
Identity = 158/344 (45.93%), Postives = 200/344 (58.14%), Query Frame = 0

Query: 30  FTIDDLL-DFSNEDAIMTDGFFDNVAGSSTDCSTVTAVDSCNSSVSGGDHHHFHGNIGSR 89
           F +DDLL DFSN+D    D   D VA S+T   T T  DS  S+ S  D   FHG++   
Sbjct: 14  FAVDDLLVDFSNDD----DEENDVVADSTT---TTTITDS--SNFSAADLPSFHGDVQ-- 73

Query: 90  SFDESQFSGDLCVPCDDLA-ELEWLSNFVEDSFSTEGKDLQTLNYLSNSHSISKPQTPET 149
             D + FSGDLC+P DDLA ELEWLSN V++S S E  D+  L  +S   S   P++ +T
Sbjct: 74  --DGTSFSGDLCIPSDDLADELEWLSNIVDESLSPE--DVHKLELISGFKSRPDPKS-DT 133

Query: 150 SSSSELPPSVSIPSDSSKNSPRFPAETPLPCKARSKRSRTAPCDWTTRLL---------- 209
            S          P + + +SP F  +  +P KARSKRSR A C+W +R L          
Sbjct: 134 GS----------PENPNSSSPIFTTDVSVPAKARSKRSRAAACNWASRGLLKETFYDSPF 193

Query: 210 ----------HLLSPADP----------KPPKSSSSKKKDAPNGDSSG---RKCLHCQSE 269
                     HL  P  P          +       +KKD  + +S G   R+CLHC ++
Sbjct: 194 TGETILSSQQHLSPPTSPPLLMAPLGKKQAVDGGHRRKKDVSSPESGGAEERRCLHCATD 253

Query: 270 KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKE 329
           KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTF+ AKHSNSHRKV+ELRRQKE
Sbjct: 254 KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVMELRRQKE 313

Query: 330 LQIAQQQ-----QFMNQSSIFGVTNGCDEYLISHHMGPNVRHMI 334
           +  A  +        + + IF V++  D+YLI H++GP+ R +I
Sbjct: 314 MSRAHHEFIHHHHGTDTAMIFDVSSDGDDYLIHHNVGPDFRQLI 331

BLAST of HG10023533 vs. TAIR 10
Match: AT4G32890.1 (GATA transcription factor 9 )

HSP 1 Score: 221.5 bits (563), Expect = 1.0e-57
Identity = 142/318 (44.65%), Postives = 188/318 (59.12%), Query Frame = 0

Query: 28  EHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDCSTVTAVDSCNSSVSGGDHHHFHGNIGS 87
           + F +DDLLDFSN+D  + DG   N    S+  ST T  DS NSS              S
Sbjct: 16  DSFVVDDLLDFSNDDGEVDDGL--NTLPDSSTLSTGTLTDSSNSS--------------S 75

Query: 88  RSFDESQFSGDLCVPCDDLAELEWLSNFVEDSFSTEGKDLQTLNYLSNSHSISKPQTPET 147
              D + FS DL +P DD+AELEWLSNFVE+SF+  G+D   L+  S    +  PQT  +
Sbjct: 76  LFTDGTGFS-DLYIPNDDIAELEWLSNFVEESFA--GEDQDKLHLFS---GLKNPQTTGS 135

Query: 148 SSSSELPPSVSIPSDSSKNSPRFPAETPLPCKARSKRSRTAPCDWTTRLLHLLSPADPKP 207
           + +  + P    P    +      +   +P KARSKRSR+A   W +RLL L    +  P
Sbjct: 136 TLTHLIKPE---PELDHQFIDIDESNVAVPAKARSKRSRSAASTWASRLLSLADSDETNP 195

Query: 208 PKSSSSKKKDAPNGD--------SSGRKCLHCQSEKTPQWRTGPMGPKTLCNACGVRYKS 267
            K     K+    GD          GR+CLHC +EKTPQWRTGPMGPKTLCNACGVRYKS
Sbjct: 196 KKKQRRVKEQDFAGDMDVDCGESGGGRRCLHCATEKTPQWRTGPMGPKTLCNACGVRYKS 255

Query: 268 GRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQ-QQQFMNQSSIFGVTNGCDE 327
           GRLVPEYRPA+SPTF+ A+HSNSHRKV+ELRRQKE++      Q   ++ +  + +  ++
Sbjct: 256 GRLVPEYRPASSPTFVMARHSNSHRKVMELRRQKEMRDEHLLSQLRCENLLMDIRSNGED 308

Query: 328 YLI---SHHMGPNVRHMI 334
           +L+   ++H+ P+ RH+I
Sbjct: 316 FLMHNNTNHVAPDFRHLI 308

BLAST of HG10023533 vs. TAIR 10
Match: AT2G45050.1 (GATA transcription factor 2 )

HSP 1 Score: 167.5 bits (423), Expect = 1.8e-41
Identity = 115/283 (40.64%), Postives = 140/283 (49.47%), Query Frame = 0

Query: 32  IDDLLDFSNEDAIMTDGFFDNVAGSSTDCSTVTAVDSCNSSVSGGDHHHFHGNIGSRSFD 91
           IDDLLDFSNED           + SS+  ST     S          HH H      S D
Sbjct: 14  IDDLLDFSNEDIF---------SASSSGGSTAATSSSSFPPPQNPSFHHHH---LPSSAD 73

Query: 92  ESQFSGDLCVPCDDLAELEWLSNFVEDSFSTEGKDLQTLNYLSNSHSISKPQTPETSSSS 151
              F  D+CVP DD A LEWLS FV+DSF+                    P  P   + +
Sbjct: 74  HHSFLHDICVPSDDAAHLEWLSQFVDDSFA------------------DFPANPLGGTMT 133

Query: 152 ELPPSVSIPSDSSKNSPRFPAE-----TPLPCKARSKRSRTAPCDWTTRLLHLLSPADPK 211
            +    S P        R PA      +P+P ++  ++            LH  +   PK
Sbjct: 134 SVKTETSFPGKPRSKRSRAPAPFAGTWSPMPLESEHQQ------------LHSAAKFKPK 193

Query: 212 PPKSSSSKKKDAPNGDSSG--------RKCLHCQSEKTPQWRTGPMGPKTLCNACGVRYK 271
             +S         +  SS         R+C HC SEKTPQWRTGP+GPKTLCNACGVR+K
Sbjct: 194 KEQSGGGGGGGGRHQSSSSETTEGGGMRRCTHCASEKTPQWRTGPLGPKTLCNACGVRFK 253

Query: 272 SGRLVPEYRPAASPTFISAKHSNSHRKVLELRRQKELQIAQQQ 302
           SGRLVPEYRPA+SPTF+  +HSNSHRKV+ELRRQKE+    QQ
Sbjct: 254 SGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEVMRQPQQ 254

BLAST of HG10023533 vs. TAIR 10
Match: AT3G60530.1 (GATA transcription factor 4 )

HSP 1 Score: 167.5 bits (423), Expect = 1.8e-41
Identity = 120/273 (43.96%), Postives = 146/273 (53.48%), Query Frame = 0

Query: 24  MKAAEHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDCSTVTAVDSCNSSVSGGDHHHFHG 83
           M + +   IDDLLDFSN          D +  SS   STVT+  + +S+ S  +   F  
Sbjct: 6   MSSPDLLRIDDLLDFSN----------DEIFSSS---STVTS-SAASSAASSENPFSFPS 65

Query: 84  NIGSRSFDESQFSGDLCVPCDDLAELEWLSNFVEDSFSTEGKDLQTLNYLSNSHSISKPQ 143
           +  +     + F+ DLCVP DD A LEWLS FV+DSFS                    P 
Sbjct: 66  STYTSPTLLTDFTHDLCVPSDDAAHLEWLSRFVDDSFS------------------DFPA 125

Query: 144 TPETSSSSELPPSVSIPSDSSKNSPRFPAETPLPCKARSKRSRTAPCDWTTRLLHLLSPA 203
            P T +   + P +S          R PA         S     AP    + L H  S A
Sbjct: 126 NPLTMT---VRPEISFTGKPRSRRSRAPAP--------SVAGTWAPMS-ESELCH--SVA 185

Query: 204 DPKPPKSSSSKKKDAPNGDSSGRKCLHCQSEKTPQWRTGPMGPKTLCNACGVRYKSGRLV 263
            PKP K  +++   A       R+C HC SEKTPQWRTGP+GPKTLCNACGVRYKSGRLV
Sbjct: 186 KPKPKKVYNAESVTA----DGARRCTHCASEKTPQWRTGPLGPKTLCNACGVRYKSGRLV 228

Query: 264 PEYRPAASPTFISAKHSNSHRKVLELRRQKELQ 297
           PEYRPA+SPTF+  +HSNSHRKV+ELRRQKE Q
Sbjct: 246 PEYRPASSPTFVLTQHSNSHRKVMELRRQKEQQ 228

BLAST of HG10023533 vs. TAIR 10
Match: AT5G66320.1 (GATA transcription factor 5 )

HSP 1 Score: 144.4 bits (363), Expect = 1.6e-34
Identity = 110/306 (35.95%), Postives = 144/306 (47.06%), Query Frame = 0

Query: 26  AAEHFTIDDLLDFSNEDAIMTDGFFDNVAGSSTDCSTVTAVDSCNSSVSGGDHHHFHGNI 85
           + + F++DDLLD SN+D           A   TD      +   +S     D      + 
Sbjct: 37  SVDDFSVDDLLDLSNDDVF---------ADEETDLKAQHEMVRVSSEEPNDDGDALRRSS 96

Query: 86  GSRSFDE--SQFSGDLCVPCDDLAELEWLSNFVEDSFSTEGKDLQTLNYLSNSHSISKPQ 145
                D+  S  + +L +P DDLA LEWLS+FVEDSF+          Y   + + +  +
Sbjct: 97  DFSGCDDFGSLPTSELSLPADDLANLEWLSHFVEDSFT---------EYSGPNLTGTPTE 156

Query: 146 TPETSSSSELPPSVSIPSDSSKNSPRFPAETPLPCKARSKRSRTAPCDWTTRL------- 205
            P   +     P  ++  ++   S       P+P KARSKR+R     W+          
Sbjct: 157 KPAWLTGDRKHPVTAVTEETCFKS-------PVPAKARSKRNRNGLKVWSLGSSSSSGPS 216

Query: 206 -------------------LHLLSP--ADPKPPKSSSSKKKDAPNGDSS-------GRKC 265
                                LL P     +PP     KK+ A +  S         RKC
Sbjct: 217 SSGSTSSSSSGPSSPWFSGAELLEPVVTSERPPFPKKHKKRSAESVFSGELQQLQPQRKC 276

Query: 266 LHCQSEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFISAKHSNSHRKVLE 295
            HC  +KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF S  HSN HRKV+E
Sbjct: 277 SHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHRKVIE 317

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038899153.14.0e-18196.10GATA transcription factor 9-like [Benincasa hispida][more]
KAA0051035.15.3e-18195.80GATA transcription factor 9-like [Cucumis melo var. makuwa] >TYK03830.1 GATA tra... [more]
XP_008462442.12.6e-18095.50PREDICTED: GATA transcription factor 9-like [Cucumis melo][more]
XP_004141657.14.6e-17793.39GATA transcription factor 9 [Cucumis sativus] >KGN45592.1 hypothetical protein C... [more]
XP_022144687.11.3e-16889.82GATA transcription factor 9-like [Momordica charantia][more]
Match NameE-valueIdentityDescription
P697811.7e-6045.93GATA transcription factor 12 OS=Arabidopsis thaliana OX=3702 GN=GATA12 PE=2 SV=1[more]
O826321.4e-5644.65GATA transcription factor 9 OS=Arabidopsis thaliana OX=3702 GN=GATA9 PE=2 SV=1[more]
O497412.5e-4040.64GATA transcription factor 2 OS=Arabidopsis thaliana OX=3702 GN=GATA2 PE=2 SV=1[more]
O497432.5e-4043.96GATA transcription factor 4 OS=Arabidopsis thaliana OX=3702 GN=GATA4 PE=2 SV=1[more]
Q9FH572.3e-3335.95GATA transcription factor 5 OS=Arabidopsis thaliana OX=3702 GN=GATA5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A5A7UBN62.6e-18195.80GATA transcription factor OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffo... [more]
A0A1S3CIH21.3e-18095.50GATA transcription factor OS=Cucumis melo OX=3656 GN=LOC103500789 PE=3 SV=1[more]
A0A0A0KB382.2e-17793.39GATA transcription factor OS=Cucumis sativus OX=3659 GN=Csa_7G452960 PE=3 SV=1[more]
A0A6J1CUD76.5e-16989.82GATA transcription factor OS=Momordica charantia OX=3673 GN=LOC111014311 PE=3 SV... [more]
A0A6J1GN147.2e-16891.49GATA transcription factor OS=Cucurbita moschata OX=3662 GN=LOC111455890 PE=3 SV=... [more]
Match NameE-valueIdentityDescription
AT5G25830.11.2e-6145.93GATA transcription factor 12 [more]
AT4G32890.11.0e-5744.65GATA transcription factor 9 [more]
AT2G45050.11.8e-4140.64GATA transcription factor 2 [more]
AT3G60530.11.8e-4143.96GATA transcription factor 4 [more]
AT5G66320.11.6e-3435.95GATA transcription factor 5 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 222..272
e-value: 2.6E-17
score: 73.5
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 228..261
e-value: 1.9E-14
score: 53.0
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 228..253
IPR000679Zinc finger, GATA-typePROSITEPS50114GATA_ZN_FINGER_2coord: 222..258
score: 12.386794
IPR000679Zinc finger, GATA-typeCDDcd00202ZnF_GATAcoord: 227..278
e-value: 1.93135E-13
score: 62.005
IPR013088Zinc finger, NHR/GATA-typeGENE3D3.30.50.10coord: 222..272
e-value: 1.3E-15
score: 58.7
IPR016679Transcription factor, GATA, plantPIRSFPIRSF016992Txn_fac_GATA_plantcoord: 15..314
e-value: 3.5E-79
score: 264.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 136..170
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 209..223
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 196..229
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 136..182
NoneNo IPR availablePANTHERPTHR45658:SF46GATA TRANSCRIPTION FACTOR 9coord: 1..333
NoneNo IPR availablePANTHERPTHR45658GATA TRANSCRIPTION FACTORcoord: 1..333
NoneNo IPR availableSUPERFAMILY57716Glucocorticoid receptor-like (DNA-binding domain)coord: 223..286

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023533.1HG10023533.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030154 cell differentiation
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003677 DNA binding