Lag0005038 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0005038
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionGATA transcription factor
Locationchr6: 9819987 .. 9822050 (-)
RNA-Seq ExpressionLag0005038
SyntenyLag0005038
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGCTCCAGAATATTTCCAGAACAATGGCTACTGCTCCCAATTCGCCGCCGACAACGACGCCCCCGCCGCCGTCGGAGACCATTTCATCGTCGAGGAGCTTCTCGATTTCTCCAACCACGACGCCGACGCCGGAGGATTGTTCAACAATGCCACCGCCTGCTTCCTCAATAATAATGGAATTAACTCCGCCTCCGCCGACTCCTCCGCCCTCACCGTCGTCGAGAGCTGCAATTCCTCCAATTCCTTCTCGGTTTCCGAACCCAATTCCTTTCTCGAAGACATTAGTGCCTCTAATTTAGCCGACGCCCATTTCTCCGACGAACTCTGCATTCCGGTAATTGTTCTTATTATCGTTAATTATTATTTTTTTCTTTAAATAGGATGACGAGGAACTTTTCTTTTGGGGATTTCTTGCTTTTTTTGTTTTTTTTTACGTACATTGAACATTTTTCTGTGTATTAATTATATAATTTTGTTGCGGAATAGTAATTAATGACTAAATTCATTTGAAATTAAATTACACTCCAATGGCCTTTTTTTTTTTTTTTTTTGGGTGTGTGTTTTCTTCTTAATAGTAAGCTTAGAGTATTTACCTTATTTGTGATATATAAAAAGTTAAGATGGTTAAAAATACTTTTCAATTGGGGTTTATTTTGTTTTTGGTGAATGTATAAAATGGTCTCTTTTTTTTTTTTTTGCTTAAATTTCTTTTAGTAATTTGATATTTAATCTTCTAAAGTTCACTATTCTAGTTTGTACATTGGAAAACGATTGATGTGTCATTCAATTGTTGTTTTATTAACATATTTGAACATAAAAGCATGTGACTTTGATATACCACTTGAAGAAGTAGTTTTTGTTGATGAAAGGCAAAACTCTACGGAATAAACTAGTTGGATTCTGTGTTTTCCATTAGAATCTAAGTGACATATTTGTGCGTCCAAATATGTTTTGTGGTGGGAGATTTGAAACACCAGAAAAAATGGACCTACGTGAACACACTAACTTTTAACAGGCTCTTTTTTGAAATTTCAACCATATAGTAATATATATTTTTTTTAATTCAACAACATTTGAGGTGGGAGTTTCGAACCTCTGACTTTTTGATCGGAAGTACATGTCAATTACTGCTAAGCTGCTTACTTTGGCAACCATATAGTATTAATATAGTAAAATAAGATGTAAATTTTAACCATAGTAAATAAGATGGTAAATTTTTGTAATGAAGATTATTATTAGTATTATTAACACATTAGCTTTTTTAATTTTTGTGTTTTTACAGTTAGATGATTTAGCTGAGTTGGAATGGCTTTCAAATTTCGTAGAGGAATCATTTTCCAGTGAGGACATGCAAAAGTTAGAACTCATCTCCGGAGTCAAAGTCAAATCCGACGGACCCTCCCACTCCCGACAACCCACCACCGCCGCCGTCTCCACCACCACCCACGCCCGAAACGCCGCCGAAATCTTCAAACCCGACATTGTCTCAGTTCCGGCGAAGGCCCGCAGCAAACGCTCACGCGCCGTCCCATCCAATTGGAACAACTCCCGCCTCCTCCCCCTCTCTCCGACCACCTCCTCGTCGGAATCCGAGATCGCCCCCGCCGGACCACCGCAGCCGGTCAAAAAAGTCCCGCCGAAGGTGGCGGCGACGGTGAAGAAGAAGGACTGCCCGGAGGCCGGAGCGTCCGCCGGAGAGGGGCGGAAGTGCATGCACTGCGCCACCGACAAGACGCCGCAGTGGCGGACCGGCCCAATGGGCCCAAAGACGCTGTGTAACGCCTGCGGCGTTCGGTACAAATCCGGCCGCCTGGTGCCGGAATACCGCCCCGCCGCCAGCCCCACCTTCGTCCTCACCAAACACTCGAACTCCCACCGGAAAGTTTTGGAGCTCCGGCGGCAGAAGGAGCTTCTCAGAGCCCAACAACAGCAACAGCAACAAGTGCTTTTGGATCACCATCACCATCACCGTCATCAGGATATGATGTTTGATTCATCCAACGGTGACGACTATCTCATCCATCAACACGTGGGCCCCGATTTCCGGCAGCTGATCTGA

mRNA sequence

ATGGAAGCTCCAGAATATTTCCAGAACAATGGCTACTGCTCCCAATTCGCCGCCGACAACGACGCCCCCGCCGCCGTCGGAGACCATTTCATCGTCGAGGAGCTTCTCGATTTCTCCAACCACGACGCCGACGCCGGAGGATTGTTCAACAATGCCACCGCCTGCTTCCTCAATAATAATGGAATTAACTCCGCCTCCGCCGACTCCTCCGCCCTCACCGTCGTCGAGAGCTGCAATTCCTCCAATTCCTTCTCGGTTTCCGAACCCAATTCCTTTCTCGAAGACATTAGTGCCTCTAATTTAGCCGACGCCCATTTCTCCGACGAACTCTGCATTCCGTTAGATGATTTAGCTGAGTTGGAATGGCTTTCAAATTTCGTAGAGGAATCATTTTCCAGTGAGGACATGCAAAAGTTAGAACTCATCTCCGGAGTCAAAGTCAAATCCGACGGACCCTCCCACTCCCGACAACCCACCACCGCCGCCGTCTCCACCACCACCCACGCCCGAAACGCCGCCGAAATCTTCAAACCCGACATTGTCTCAGTTCCGGCGAAGGCCCGCAGCAAACGCTCACGCGCCGTCCCATCCAATTGGAACAACTCCCGCCTCCTCCCCCTCTCTCCGACCACCTCCTCGTCGGAATCCGAGATCGCCCCCGCCGGACCACCGCAGCCGGTCAAAAAAGTCCCGCCGAAGGTGGCGGCGACGGTGAAGAAGAAGGACTGCCCGGAGGCCGGAGCGTCCGCCGGAGAGGGGCGGAAGTGCATGCACTGCGCCACCGACAAGACGCCGCAGTGGCGGACCGGCCCAATGGGCCCAAAGACGCTGTGTAACGCCTGCGGCGTTCGGTACAAATCCGGCCGCCTGGTGCCGGAATACCGCCCCGCCGCCAGCCCCACCTTCGTCCTCACCAAACACTCGAACTCCCACCGGAAAGTTTTGGAGCTCCGGCGGCAGAAGGAGCTTCTCAGAGCCCAACAACAGCAACAGCAACAAGTGCTTTTGGATCACCATCACCATCACCGTCATCAGGATATGATGTTTGATTCATCCAACGGTGACGACTATCTCATCCATCAACACGTGGGCCCCGATTTCCGGCAGCTGATCTGA

Coding sequence (CDS)

ATGGAAGCTCCAGAATATTTCCAGAACAATGGCTACTGCTCCCAATTCGCCGCCGACAACGACGCCCCCGCCGCCGTCGGAGACCATTTCATCGTCGAGGAGCTTCTCGATTTCTCCAACCACGACGCCGACGCCGGAGGATTGTTCAACAATGCCACCGCCTGCTTCCTCAATAATAATGGAATTAACTCCGCCTCCGCCGACTCCTCCGCCCTCACCGTCGTCGAGAGCTGCAATTCCTCCAATTCCTTCTCGGTTTCCGAACCCAATTCCTTTCTCGAAGACATTAGTGCCTCTAATTTAGCCGACGCCCATTTCTCCGACGAACTCTGCATTCCGTTAGATGATTTAGCTGAGTTGGAATGGCTTTCAAATTTCGTAGAGGAATCATTTTCCAGTGAGGACATGCAAAAGTTAGAACTCATCTCCGGAGTCAAAGTCAAATCCGACGGACCCTCCCACTCCCGACAACCCACCACCGCCGCCGTCTCCACCACCACCCACGCCCGAAACGCCGCCGAAATCTTCAAACCCGACATTGTCTCAGTTCCGGCGAAGGCCCGCAGCAAACGCTCACGCGCCGTCCCATCCAATTGGAACAACTCCCGCCTCCTCCCCCTCTCTCCGACCACCTCCTCGTCGGAATCCGAGATCGCCCCCGCCGGACCACCGCAGCCGGTCAAAAAAGTCCCGCCGAAGGTGGCGGCGACGGTGAAGAAGAAGGACTGCCCGGAGGCCGGAGCGTCCGCCGGAGAGGGGCGGAAGTGCATGCACTGCGCCACCGACAAGACGCCGCAGTGGCGGACCGGCCCAATGGGCCCAAAGACGCTGTGTAACGCCTGCGGCGTTCGGTACAAATCCGGCCGCCTGGTGCCGGAATACCGCCCCGCCGCCAGCCCCACCTTCGTCCTCACCAAACACTCGAACTCCCACCGGAAAGTTTTGGAGCTCCGGCGGCAGAAGGAGCTTCTCAGAGCCCAACAACAGCAACAGCAACAAGTGCTTTTGGATCACCATCACCATCACCGTCATCAGGATATGATGTTTGATTCATCCAACGGTGACGACTATCTCATCCATCAACACGTGGGCCCCGATTTCCGGCAGCTGATCTGA

Protein sequence

MEAPEYFQNNGYCSQFAADNDAPAAVGDHFIVEELLDFSNHDADAGGLFNNATACFLNNNGINSASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESEIAPAGPPQPVKKVPPKVAATVKKKDCPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHHHHRHQDMMFDSSNGDDYLIHQHVGPDFRQLI
Homology
BLAST of Lag0005038 vs. NCBI nr
Match: XP_038886306.1 (GATA transcription factor 12-like [Benincasa hispida])

HSP 1 Score: 523.1 bits (1346), Expect = 2.0e-144
Identity = 295/385 (76.62%), Postives = 313/385 (81.30%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQF----AADNDAPAAVG----DHFIVEELLDFSNHD----ADAGGL 60
           MEAPEYFQ NGYCSQF    ++D D   A      +HFIVEELLDFSN D     D GGL
Sbjct: 1   MEAPEYFQINGYCSQFSTHSSSDTDTTTATATAGPEHFIVEELLDFSNDDDGVVGDGGGL 60

Query: 61  FNNATACFLNNNGINSASADSSALTVVESCNSSNSFSVSEPN-SFLEDISASNLADAHFS 120
           F N      N N  N+ S +SSA+TV+ESCNSS SFS  EPN SFLEDIS SNLADAHFS
Sbjct: 61  FYNTN----NGNNNNNNSTESSAVTVIESCNSS-SFSGCEPNSSFLEDISGSNLADAHFS 120

Query: 121 DELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTAAVSTTT 180
            ELC+P DDLAELEWLS+FVEESFSSEDMQKLELISGVKV+SD P++SRQPT        
Sbjct: 121 SELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVRSDEPTNSRQPTA------- 180

Query: 181 HARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESEI-APAGPPQP 240
             RNAA IFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTT   E EI A AGPP P
Sbjct: 181 -TRNAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTT---EPEITATAGPPHP 240

Query: 241 VKKVPPKVAATVKKKDCPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK 300
           +KK PPK AAT KKKD PE G S+GEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK
Sbjct: 241 IKKNPPK-AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK 300

Query: 301 SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHHHHRHQD 360
           SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQ +LLDH     HQD
Sbjct: 301 SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDH-----HQD 360

Query: 361 MMFDSSNGDDYLIHQHVGPDFRQLI 372
           M+FD+SNGDDYLIHQH+GPDFRQLI
Sbjct: 361 MIFDASNGDDYLIHQHMGPDFRQLI 363

BLAST of Lag0005038 vs. NCBI nr
Match: XP_023002390.1 (GATA transcription factor 12-like [Cucurbita maxima])

HSP 1 Score: 494.6 bits (1272), Expect = 7.5e-136
Identity = 276/386 (71.50%), Postives = 307/386 (79.53%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDAPA------AVGDHFIVEELLDFSNHD----ADAGGLFN 60
           MEAPEYF NN YCSQF +D DA A      A  DHFIVEELLDFSN D    AD+GG FN
Sbjct: 1   MEAPEYFHNNAYCSQFTSDKDAAASTATATATADHFIVEELLDFSNDDDSAIADSGGFFN 60

Query: 61  NATACFLNNNGINSASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDEL 120
           N T CFLN N     S++SSA T VES NSS SFS  E  SF +D+S S+LAD  FSD++
Sbjct: 61  NVT-CFLNGN-----SSESSAATAVESSNSS-SFSGCERTSFFDDVSGSSLADVRFSDDI 120

Query: 121 CIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTAAVSTTTHAR 180
            IP ++L ELEWL++F EE FSSEDMQKLELI+GVKVK D P  S+ PTT AVS  +H R
Sbjct: 121 FIPYNELVELEWLASFEEEPFSSEDMQKLELITGVKVKPDEPLQSQHPTT-AVSAASHGR 180

Query: 181 N-AAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESEI-APAGPPQPVK 240
           N AAEIFKPDIV+VPAKARSKRSR +PSNWNNSRLLPLSPT+SSSE +I A   PP PVK
Sbjct: 181 NAAAEIFKPDIVAVPAKARSKRSRIIPSNWNNSRLLPLSPTSSSSEQDIPATEPPPHPVK 240

Query: 241 KVPPKVAATVKKKD---CPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY 300
           KVPPKVAA VKKK+     E G SAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY
Sbjct: 241 KVPPKVAAAVKKKESSSSSETGMSAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY 300

Query: 301 KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHHHHRHQ 360
           KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +A  Q+QQ +++DHHHHH HQ
Sbjct: 301 KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQKA--QEQQALMMDHHHHHHHQ 360

Query: 361 DMMFDSSNGDDYLIHQHVGPDFRQLI 372
           +MMFDSSNG+DYL+ Q+V  D+  LI
Sbjct: 361 EMMFDSSNGEDYLMKQNVAHDYLHLI 376

BLAST of Lag0005038 vs. NCBI nr
Match: XP_022951637.1 (GATA transcription factor 12-like [Cucurbita moschata])

HSP 1 Score: 492.3 bits (1266), Expect = 3.7e-135
Identity = 278/385 (72.21%), Postives = 307/385 (79.74%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDAPAA-VGDHFIVEELLDFSNHD----ADAGGLFNNATAC 60
           MEAPEYF NN YCSQF +D DA AA   DHFIVEELLDFSN D    AD+GG FNN T C
Sbjct: 1   MEAPEYFHNNAYCSQFTSDKDAAAATTADHFIVEELLDFSNDDDSAIADSGGFFNNVT-C 60

Query: 61  FLNNNGINSASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDELCIPLD 120
           FLN N     SA+SSA T VES NSS SFS SE  SF +D+SAS+LAD  FSD++ IP +
Sbjct: 61  FLNGN-----SAESSAATAVESSNSS-SFSGSERTSFFDDVSASSLADVRFSDDIFIPYN 120

Query: 121 DLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTAAVSTTTHARN-AAE 180
           +L ELEWL++F EE FSSEDMQKLELI+GVKVK D P  S  PT  AVS  +H RN AA 
Sbjct: 121 ELVELEWLASFEEEPFSSEDMQKLELITGVKVKPDEPPQSHHPTN-AVSALSHGRNAAAA 180

Query: 181 IFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESEI-APAGPPQPVKKVPPK 240
           IFKPDIV+VPAKARSKRSR +PSNWNNSRLLPLSPT+SSSE +I A   PP PVKKVPPK
Sbjct: 181 IFKPDIVAVPAKARSKRSRTIPSNWNNSRLLPLSPTSSSSELDIPATEPPPHPVKKVPPK 240

Query: 241 VAAT----VKKKD---CPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK 300
           VAAT    VKKK+     E G SAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK
Sbjct: 241 VAATATAAVKKKESSSSSETGMSAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK 300

Query: 301 SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHHHHRHQD 360
           SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +A  Q+QQ +++DHHHHH HQ+
Sbjct: 301 SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQKA--QEQQALMMDHHHHHHHQE 360

Query: 361 MMFDSSNGDDYLIHQHVGPDFRQLI 372
           MMFDSSNG+DYL+ Q+V  D+  LI
Sbjct: 361 MMFDSSNGEDYLMKQNVAHDYLHLI 375

BLAST of Lag0005038 vs. NCBI nr
Match: KAG6585379.1 (GATA transcription factor 12, partial [Cucurbita argyrosperma subsp. sororia] >KAG7020295.1 GATA transcription factor 12, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 491.1 bits (1263), Expect = 8.3e-135
Identity = 280/397 (70.53%), Postives = 309/397 (77.83%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDAPA------AVGDHFIVEELLDFSNHD----ADAGGLFN 60
           MEAPEYF NN YCSQF +D DA A      A  DHFIVEELLDFSN D    AD+GG FN
Sbjct: 1   MEAPEYFHNNAYCSQFTSDKDAAAATTASTATADHFIVEELLDFSNDDDSAIADSGGFFN 60

Query: 61  NATACFLNNNGINSASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDEL 120
           N T CFLN N     SA+SSA T VES NSS SFS SE  SF +D+SAS+LAD  FSD++
Sbjct: 61  NVT-CFLNGN-----SAESSAATAVESSNSS-SFSGSERTSFFDDVSASSLADVRFSDDI 120

Query: 121 CIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTA--AVSTTTH 180
            IP ++L ELEWL++F EE FSSEDMQKLELI+GVKVK D P  S  PTTA  AVS  +H
Sbjct: 121 FIPYNELVELEWLASFEEEPFSSEDMQKLELITGVKVKPDEPPQSHHPTTAVSAVSAASH 180

Query: 181 ARN-AAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESEI-APAGPPQP 240
            RN AA IFKPDIV+VPAKARSKRSR +PSNWNNSRLLPLSPT+SSSE +I A   PP P
Sbjct: 181 GRNAAAAIFKPDIVAVPAKARSKRSRIIPSNWNNSRLLPLSPTSSSSELDIPATEPPPHP 240

Query: 241 VKKVPPKVAAT---------VKKKD---CPEAGASAGEGRKCMHCATDKTPQWRTGPMGP 300
           VKKVPPKVAAT         VKKK+     E G SAGEGRKCMHCATDKTPQWRTGPMGP
Sbjct: 241 VKKVPPKVAATATATASTAAVKKKESSSSSETGMSAGEGRKCMHCATDKTPQWRTGPMGP 300

Query: 301 KTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQV 360
           KTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +A  Q+QQ +
Sbjct: 301 KTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQKA--QEQQAL 360

Query: 361 LLDHHHHHRHQDMMFDSSNGDDYLIHQHVGPDFRQLI 372
           ++DHHHHH HQ+MMFDSSNG+DYL+ Q+V  D+  LI
Sbjct: 361 MMDHHHHHHHQEMMFDSSNGEDYLMKQNVAHDYLHLI 388

BLAST of Lag0005038 vs. NCBI nr
Match: XP_023538437.1 (GATA transcription factor 12-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 488.8 bits (1257), Expect = 4.1e-134
Identity = 278/394 (70.56%), Postives = 308/394 (78.17%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDAPA------AVGDHFIVEELLDFSNHD----ADAGGLFN 60
           MEAPEYF  N YCSQF +D DA A      A  DHFIVEELLDFSN D    AD+GG FN
Sbjct: 1   MEAPEYFHKNAYCSQFTSDKDAAASTATASATADHFIVEELLDFSNDDDSAIADSGGFFN 60

Query: 61  NATACFLNNNGINSASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDEL 120
           N T CFLN N     SA+SSA T VES NSS SFS SE  SF +D+S S+LAD  FSD++
Sbjct: 61  NVT-CFLNGN-----SAESSAATAVESSNSS-SFSGSERTSFFDDVSGSSLADVRFSDDI 120

Query: 121 CIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTA--AVSTTTH 180
            IP ++L ELEWL++F EE FSSEDMQKLELI+GVKVK D P  S+ PTTA  AVS  +H
Sbjct: 121 FIPYNELVELEWLASFEEEPFSSEDMQKLELITGVKVKPDEPPQSQHPTTAPSAVSAASH 180

Query: 181 ARN-AAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESEI-APAGPPQP 240
            RN AA IFKPDIV+VPAKARSKRSR +PSNWNNSRLLPLSPT+SSSE +I A   PP P
Sbjct: 181 GRNAAAAIFKPDIVAVPAKARSKRSRIIPSNWNNSRLLPLSPTSSSSELDIPATEPPPHP 240

Query: 241 VKKVPPKVAAT------VKKKD---CPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTL 300
           VKKVPPKVAAT      VKKK+     E G SAGEGRKCMHCATDKTPQWRTGPMGPKTL
Sbjct: 241 VKKVPPKVAATATATAAVKKKESSSSSETGMSAGEGRKCMHCATDKTPQWRTGPMGPKTL 300

Query: 301 CNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLD 360
           CNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +A  Q+QQ +++D
Sbjct: 301 CNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQKA--QEQQALMMD 360

Query: 361 HHHHHRHQDMMFDSSNGDDYLIHQHVGPDFRQLI 372
           HHHHH HQ+MMFDSSNG+DYL+ Q+V  D+  LI
Sbjct: 361 HHHHHHHQEMMFDSSNGEDYLMKQNVAHDYLHLI 385

BLAST of Lag0005038 vs. ExPASy Swiss-Prot
Match: P69781 (GATA transcription factor 12 OS=Arabidopsis thaliana OX=3702 GN=GATA12 PE=2 SV=1)

HSP 1 Score: 267.3 bits (682), Expect = 2.6e-70
Identity = 180/357 (50.42%), Postives = 213/357 (59.66%), Query Frame = 0

Query: 30  FIVEELL-DFSNHDADAGGLFNNATACFLNNNGINSASADSSALTVVESCNSSNSFSVSE 89
           F V++LL DFSN D +                  N   ADS+  T +     S++FS ++
Sbjct: 14  FAVDDLLVDFSNDDDEE-----------------NDVVADSTTTTTI---TDSSNFSAAD 73

Query: 90  PNSFLEDISASNLADAHFSDELCIPLDDLA-ELEWLSNFVEESFSSEDMQKLELISGVKV 149
             SF  D+         FS +LCIP DDLA ELEWLSN V+ES S ED+ KLELISG K 
Sbjct: 74  LPSFHGDVQDG----TSFSGDLCIPSDDLADELEWLSNIVDESLSPEDVHKLELISGFKS 133

Query: 150 KSDGPSHSRQPTTAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLL-- 209
           + D  S +  P         +  +++ IF  D VSVPAKARSKRSRA   NW +  LL  
Sbjct: 134 RPDPKSDTGSP--------ENPNSSSPIFTTD-VSVPAKARSKRSRAAACNWASRGLLKE 193

Query: 210 -----PLSPTT--SSSESEIAPAGPP---QPVKKVPPKVAATVKKKDCPEAGASAGEGRK 269
                P +  T  SS +    P  PP    P+ K         +KKD     +   E R+
Sbjct: 194 TFYDSPFTGETILSSQQHLSPPTSPPLLMAPLGKKQAVDGGHRRKKDVSSPESGGAEERR 253

Query: 270 CMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVL 329
           C+HCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVL KHSNSHRKV+
Sbjct: 254 CLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVM 313

Query: 330 ELRRQKELLRAQQQQQQQVLLDHHHHHRHQDMMFD-SSNGDDYLIHQHVGPDFRQLI 372
           ELRRQKE+ RA  +        HHHH     M+FD SS+GDDYLIH +VGPDFRQLI
Sbjct: 314 ELRRQKEMSRAHHE------FIHHHHGTDTAMIFDVSSDGDDYLIHHNVGPDFRQLI 331

BLAST of Lag0005038 vs. ExPASy Swiss-Prot
Match: O82632 (GATA transcription factor 9 OS=Arabidopsis thaliana OX=3702 GN=GATA9 PE=2 SV=1)

HSP 1 Score: 230.3 bits (586), Expect = 3.5e-59
Identity = 157/348 (45.11%), Postives = 203/348 (58.33%), Query Frame = 0

Query: 28  DHFIVEELLDFSNHDADAGGLFNNATACFLNNNGINSASADSSALTVVESCNSSNSFSVS 87
           D F+V++LLDFSN D +              ++G+N+   DSS L+     +SSNS S+ 
Sbjct: 16  DSFVVDDLLDFSNDDGEV-------------DDGLNTL-PDSSTLSTGTLTDSSNSSSL- 75

Query: 88  EPNSFLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKV 147
                          D     +L IP DD+AELEWLSNFVEESF+ ED  KL L SG+K 
Sbjct: 76  -------------FTDGTGFSDLYIPNDDIAELEWLSNFVEESFAGEDQDKLHLFSGLK- 135

Query: 148 KSDGPSHSRQPTTAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPL 207
               P  +    T  +       +         V+VPAKARSKRSR+  S W  SRLL L
Sbjct: 136 ---NPQTTGSTLTHLIKPEPELDHQFIDIDESNVAVPAKARSKRSRSAASTW-ASRLLSL 195

Query: 208 SPTTSSSESEIAPAGPPQPVKKVPPKVAATVKKKDCPEAGASAGEGRKCMHCATDKTPQW 267
           + +  ++        P +  ++V  +  A     DC E+G     GR+C+HCAT+KTPQW
Sbjct: 196 ADSDETN--------PKKKQRRVKEQDFAGDMDVDCGESGG----GRRCLHCATEKTPQW 255

Query: 268 RTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQ 327
           RTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRRQKE+    
Sbjct: 256 RTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRRQKEM---- 308

Query: 328 QQQQQQVLLDHHHHHRHQDMMFD-SSNGDDYLIH---QHVGPDFRQLI 372
              + + LL      R ++++ D  SNG+D+L+H    HV PDFR LI
Sbjct: 316 ---RDEHLLS---QLRCENLLMDIRSNGEDFLMHNNTNHVAPDFRHLI 308

BLAST of Lag0005038 vs. ExPASy Swiss-Prot
Match: O49741 (GATA transcription factor 2 OS=Arabidopsis thaliana OX=3702 GN=GATA2 PE=2 SV=1)

HSP 1 Score: 177.2 bits (448), Expect = 3.5e-43
Identity = 128/311 (41.16%), Postives = 164/311 (52.73%), Query Frame = 0

Query: 32  VEELLDFSNHDADAGGLFNNATACFLNNNGINSASADSSALTVVESCNSSNSFSVSEPNS 91
           +++LLDFSN D  +                  S+S  S+A T      SS+SF   +  S
Sbjct: 14  IDDLLDFSNEDIFSA-----------------SSSGGSTAAT------SSSSFPPPQNPS 73

Query: 92  FLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDG 151
           F      S+     F  ++C+P DD A LEWLS FV++SF                 +D 
Sbjct: 74  FHHHHLPSSADHHSFLHDICVPSDDAAHLEWLSQFVDDSF-----------------ADF 133

Query: 152 PSHSRQPTTAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTT 211
           P++    T  +V T T              S P K RSKRSRA          +PL    
Sbjct: 134 PANPLGGTMTSVKTET--------------SFPGKPRSKRSRAPAPFAGTWSPMPL---- 193

Query: 212 SSSESEIAPAGPPQPVKKVPPKVAATVKKKDCPEAGASAGEG-RKCMHCATDKTPQWRTG 271
            S   ++  A   +P K+          +     +  + G G R+C HCA++KTPQWRTG
Sbjct: 194 ESEHQQLHSAAKFKPKKEQSGGGGGGGGRHQSSSSETTEGGGMRRCTHCASEKTPQWRTG 253

Query: 272 PMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ 331
           P+GPKTLCNACGVR+KSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE++R    Q
Sbjct: 254 PLGPKTLCNACGVRFKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEVMR----Q 262

Query: 332 QQQVLLDHHHH 342
            QQV L HHHH
Sbjct: 314 PQQVQLHHHHH 262

BLAST of Lag0005038 vs. ExPASy Swiss-Prot
Match: O49743 (GATA transcription factor 4 OS=Arabidopsis thaliana OX=3702 GN=GATA4 PE=2 SV=1)

HSP 1 Score: 161.8 bits (408), Expect = 1.5e-38
Identity = 111/253 (43.87%), Postives = 137/253 (54.15%), Query Frame = 0

Query: 70  SALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEE 129
           S+ + V S  +S++ S   P SF      S      F+ +LC+P DD A LEWLS FV++
Sbjct: 27  SSSSTVTSSAASSAASSENPFSFPSSTYTSPTLLTDFTHDLCVPSDDAAHLEWLSRFVDD 86

Query: 130 SFSSEDMQKLELISGVKVKSDGPSHSRQPTTAAVSTTTHARNAAEIFKPDIVSVPAKARS 189
           SF                 SD P++   P T  V             +P+I S   K RS
Sbjct: 87  SF-----------------SDFPAN---PLTMTV-------------RPEI-SFTGKPRS 146

Query: 190 KRSRAVPSNWNNSRLLPLSPTTSSSESEIAPAGPPQPVKKVPPKVAATVKKKDCPEAGAS 249
           +RSRA              P  S     +A    P    ++   VA    KK       +
Sbjct: 147 RRSRA--------------PAPS-----VAGTWAPMSESELCHSVAKPKPKKVYNAESVT 206

Query: 250 AGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSN 309
           A   R+C HCA++KTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLT+HSN
Sbjct: 207 ADGARRCTHCASEKTPQWRTGPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFVLTQHSN 226

Query: 310 SHRKVLELRRQKE 323
           SHRKV+ELRRQKE
Sbjct: 267 SHRKVMELRRQKE 226

BLAST of Lag0005038 vs. ExPASy Swiss-Prot
Match: Q9FH57 (GATA transcription factor 5 OS=Arabidopsis thaliana OX=3702 GN=GATA5 PE=2 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 7.3e-33
Identity = 116/312 (37.18%), Postives = 148/312 (47.44%), Query Frame = 0

Query: 28  DHFIVEELLDFSNHDADAGGLFNNATACFLNNNGINSASADSSALTVVESCNSSNSFSVS 87
           D F V++LLD SN D             F +      A  +   ++  E  +  ++   S
Sbjct: 39  DDFSVDDLLDLSNDDV------------FADEETDLKAQHEMVRVSSEEPNDDGDALRRS 98

Query: 88  EPNSFLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKV 147
              S  +D  +        + EL +P DDLA LEWLS+FVE+SF+      L   +G   
Sbjct: 99  SDFSGCDDFGSLP------TSELSLPADDLANLEWLSHFVEDSFTEYSGPNL---TGTPT 158

Query: 148 KSDG--PSHSRQPTTAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLL 207
           +         + P TA    T         FK     VPAKARSKR+R     W+     
Sbjct: 159 EKPAWLTGDRKHPVTAVTEET--------CFKS---PVPAKARSKRNRNGLKVWSLGSSS 218

Query: 208 PLSPTTSSSESEIAPAGPPQP-------VKKVPPKVAATVKKKDCPEAGASAGEG----- 267
              P++S S S  + +GP  P       ++ V         KK    +  S   G     
Sbjct: 219 SSGPSSSGSTSS-SSSGPSSPWFSGAELLEPVVTSERPPFPKKHKKRSAESVFSGELQQL 278

Query: 268 ---RKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNS 323
              RKC HC   KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF    HSN 
Sbjct: 279 QPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNH 317

BLAST of Lag0005038 vs. ExPASy TrEMBL
Match: A0A6J1KNT0 (GATA transcription factor OS=Cucurbita maxima OX=3661 GN=LOC111496245 PE=3 SV=1)

HSP 1 Score: 494.6 bits (1272), Expect = 3.6e-136
Identity = 276/386 (71.50%), Postives = 307/386 (79.53%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDAPA------AVGDHFIVEELLDFSNHD----ADAGGLFN 60
           MEAPEYF NN YCSQF +D DA A      A  DHFIVEELLDFSN D    AD+GG FN
Sbjct: 1   MEAPEYFHNNAYCSQFTSDKDAAASTATATATADHFIVEELLDFSNDDDSAIADSGGFFN 60

Query: 61  NATACFLNNNGINSASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDEL 120
           N T CFLN N     S++SSA T VES NSS SFS  E  SF +D+S S+LAD  FSD++
Sbjct: 61  NVT-CFLNGN-----SSESSAATAVESSNSS-SFSGCERTSFFDDVSGSSLADVRFSDDI 120

Query: 121 CIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTAAVSTTTHAR 180
            IP ++L ELEWL++F EE FSSEDMQKLELI+GVKVK D P  S+ PTT AVS  +H R
Sbjct: 121 FIPYNELVELEWLASFEEEPFSSEDMQKLELITGVKVKPDEPLQSQHPTT-AVSAASHGR 180

Query: 181 N-AAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESEI-APAGPPQPVK 240
           N AAEIFKPDIV+VPAKARSKRSR +PSNWNNSRLLPLSPT+SSSE +I A   PP PVK
Sbjct: 181 NAAAEIFKPDIVAVPAKARSKRSRIIPSNWNNSRLLPLSPTSSSSEQDIPATEPPPHPVK 240

Query: 241 KVPPKVAATVKKKD---CPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY 300
           KVPPKVAA VKKK+     E G SAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY
Sbjct: 241 KVPPKVAAAVKKKESSSSSETGMSAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY 300

Query: 301 KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHHHHRHQ 360
           KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +A  Q+QQ +++DHHHHH HQ
Sbjct: 301 KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQKA--QEQQALMMDHHHHHHHQ 360

Query: 361 DMMFDSSNGDDYLIHQHVGPDFRQLI 372
           +MMFDSSNG+DYL+ Q+V  D+  LI
Sbjct: 361 EMMFDSSNGEDYLMKQNVAHDYLHLI 376

BLAST of Lag0005038 vs. ExPASy TrEMBL
Match: A0A6J1GI87 (GATA transcription factor OS=Cucurbita moschata OX=3662 GN=LOC111454392 PE=3 SV=1)

HSP 1 Score: 492.3 bits (1266), Expect = 1.8e-135
Identity = 278/385 (72.21%), Postives = 307/385 (79.74%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDAPAA-VGDHFIVEELLDFSNHD----ADAGGLFNNATAC 60
           MEAPEYF NN YCSQF +D DA AA   DHFIVEELLDFSN D    AD+GG FNN T C
Sbjct: 1   MEAPEYFHNNAYCSQFTSDKDAAAATTADHFIVEELLDFSNDDDSAIADSGGFFNNVT-C 60

Query: 61  FLNNNGINSASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDELCIPLD 120
           FLN N     SA+SSA T VES NSS SFS SE  SF +D+SAS+LAD  FSD++ IP +
Sbjct: 61  FLNGN-----SAESSAATAVESSNSS-SFSGSERTSFFDDVSASSLADVRFSDDIFIPYN 120

Query: 121 DLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDGPSHSRQPTTAAVSTTTHARN-AAE 180
           +L ELEWL++F EE FSSEDMQKLELI+GVKVK D P  S  PT  AVS  +H RN AA 
Sbjct: 121 ELVELEWLASFEEEPFSSEDMQKLELITGVKVKPDEPPQSHHPTN-AVSALSHGRNAAAA 180

Query: 181 IFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESEI-APAGPPQPVKKVPPK 240
           IFKPDIV+VPAKARSKRSR +PSNWNNSRLLPLSPT+SSSE +I A   PP PVKKVPPK
Sbjct: 181 IFKPDIVAVPAKARSKRSRTIPSNWNNSRLLPLSPTSSSSELDIPATEPPPHPVKKVPPK 240

Query: 241 VAAT----VKKKD---CPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK 300
           VAAT    VKKK+     E G SAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK
Sbjct: 241 VAATATAAVKKKESSSSSETGMSAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK 300

Query: 301 SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHHHHRHQD 360
           SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +A  Q+QQ +++DHHHHH HQ+
Sbjct: 301 SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQKA--QEQQALMMDHHHHHHHQE 360

Query: 361 MMFDSSNGDDYLIHQHVGPDFRQLI 372
           MMFDSSNG+DYL+ Q+V  D+  LI
Sbjct: 361 MMFDSSNGEDYLMKQNVAHDYLHLI 375

BLAST of Lag0005038 vs. ExPASy TrEMBL
Match: A0A5A7VCX1 (GATA transcription factor OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G003790 PE=3 SV=1)

HSP 1 Score: 485.7 bits (1249), Expect = 1.7e-133
Identity = 275/389 (70.69%), Postives = 302/389 (77.63%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDA-----PAAVGDHFIVEELLDFSNHDADA---------- 60
           MEAPEYFQ N Y SQF++ + A      AA  +HFIVEELLDFSN++ DA          
Sbjct: 1   MEAPEYFQINAYSSQFSSPDHADASTTAAAAPEHFIVEELLDFSNNEDDAVFTDAGGGGG 60

Query: 61  -GGLFNNATACFLNNNGINSASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADA 120
            GGLF N      N++  N+ SA+SSA+TV+ESCNSS        +SF EDIS SNL DA
Sbjct: 61  GGGLFYNNNNTTSNDHNNNNNSAESSAITVMESCNSS--------SSFFEDISGSNLGDA 120

Query: 121 HFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDG-PSHSRQPTTAAV 180
           HFS ELC+P DDLAELEWLSNFVEESFSSEDMQKLEL+SGVKVKSD  P+ S QPT    
Sbjct: 121 HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKSDEIPAQSPQPTA--- 180

Query: 181 STTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESEI-APAG 240
                 R AA IFKP+IVSVPAKARSKRSRA+PSNWNNS LLPLSPT   +E EI AP G
Sbjct: 181 -----TRTAAAIFKPEIVSVPAKARSKRSRALPSNWNNSSLLPLSPT---AEPEITAPIG 240

Query: 241 PPQPVKKVPPKVAATVKKKDCPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACG 300
            P  +KK  PKVAAT KKKD P+ G S+GEGRKCMHCATDKTPQWRTGPMGPKTLCNACG
Sbjct: 241 QPYSIKKPLPKVAATAKKKDNPDVGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACG 300

Query: 301 VRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHHHH 360
           VRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQQQ +LLDH    
Sbjct: 301 VRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQQQHLLLDH---- 360

Query: 361 RHQDMMFDSSNGDDYLIHQHVGPDFRQLI 372
             QDM+FD+SNGDDYLIHQHVGPDFRQ+I
Sbjct: 361 -RQDMIFDASNGDDYLIHQHVGPDFRQMI 365

BLAST of Lag0005038 vs. ExPASy TrEMBL
Match: A0A1S3BBN7 (GATA transcription factor OS=Cucumis melo OX=3656 GN=LOC103488171 PE=3 SV=1)

HSP 1 Score: 485.7 bits (1249), Expect = 1.7e-133
Identity = 275/389 (70.69%), Postives = 302/389 (77.63%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADNDA-----PAAVGDHFIVEELLDFSNHDADA---------- 60
           MEAPEYFQ N Y SQF++ + A      AA  +HFIVEELLDFSN++ DA          
Sbjct: 1   MEAPEYFQINAYSSQFSSPDHADASTTAAAAPEHFIVEELLDFSNNEDDAVFTDAGGGGG 60

Query: 61  -GGLFNNATACFLNNNGINSASADSSALTVVESCNSSNSFSVSEPNSFLEDISASNLADA 120
            GGLF N      N++  N+ SA+SSA+TV+ESCNSS        +SF EDIS SNL DA
Sbjct: 61  GGGLFYNNNNTTSNDHNNNNNSAESSAITVMESCNSS--------SSFFEDISGSNLGDA 120

Query: 121 HFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDG-PSHSRQPTTAAV 180
           HFS ELC+P DDLAELEWLSNFVEESFSSEDMQKLEL+SGVKVKSD  P+ S QPT    
Sbjct: 121 HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKSDEIPAQSPQPTA--- 180

Query: 181 STTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSESEI-APAG 240
                 R AA IFKP+IVSVPAKARSKRSRA+PSNWNNS LLPLSPT   +E EI AP G
Sbjct: 181 -----TRTAAAIFKPEIVSVPAKARSKRSRALPSNWNNSSLLPLSPT---AEPEITAPIG 240

Query: 241 PPQPVKKVPPKVAATVKKKDCPEAGASAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACG 300
            P  +KK  PKVAAT KKKD P+ G S+GEGRKCMHCATDKTPQWRTGPMGPKTLCNACG
Sbjct: 241 QPYSIKKPLPKVAATAKKKDNPDVGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACG 300

Query: 301 VRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVLLDHHHHH 360
           VRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQQQ +LLDH    
Sbjct: 301 VRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQQQHLLLDH---- 360

Query: 361 RHQDMMFDSSNGDDYLIHQHVGPDFRQLI 372
             QDM+FD+SNGDDYLIHQHVGPDFRQ+I
Sbjct: 361 -RQDMIFDASNGDDYLIHQHVGPDFRQMI 365

BLAST of Lag0005038 vs. ExPASy TrEMBL
Match: A0A0A0LPR5 (GATA transcription factor OS=Cucumis sativus OX=3659 GN=Csa_2G373450 PE=3 SV=1)

HSP 1 Score: 483.0 bits (1242), Expect = 1.1e-132
Identity = 273/396 (68.94%), Postives = 299/396 (75.51%), Query Frame = 0

Query: 1   MEAPEYFQNNGYCSQFAADND-------APAAVGDHFIVEELLDFSNHDADA-------- 60
           MEAPEYFQ N Y SQF++ +D       A AA  DHFIVEELLDFSN++ DA        
Sbjct: 1   MEAPEYFQINAYSSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGG 60

Query: 61  ---------GGLFNNATACFLNNNGINSASADSSALTVVESCNSSNSFSVSEPNSFLEDI 120
                    GGLF N      N++  N+ S +SSA+TV+ESCNSS        +SF EDI
Sbjct: 61  GGGGGGGGGGGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSS--------SSFFEDI 120

Query: 121 SASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSD-GPSHS 180
           S SNL DAHFS ELC+P DDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSD  P+ S
Sbjct: 121 SGSNLGDAHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQS 180

Query: 181 RQPTTAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTSSSE 240
            QPT          R+AA IFKP+IVSVPAKARSKRSRA+PSNWNNS LLPLS  T+ SE
Sbjct: 181 PQPTA--------TRSAAAIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPTAESE 240

Query: 241 SEIAPAGPPQPVKKVPPKVAATVKKKDCPEAGASAGEGRKCMHCATDKTPQWRTGPMGPK 300
           +   P   P P+KK  PK AAT KKKD P+ G S+GEGRKCMHCATDKTPQWRTGPMGPK
Sbjct: 241 T-TPPIEQPHPIKKTLPKAAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPK 300

Query: 301 TLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQQVL 360
           TLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQ Q +L
Sbjct: 301 TLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLL 360

Query: 361 LDHHHHHRHQDMMFDSSNGDDYLIHQHVGPDFRQLI 372
           LDH      QDM+FD+SNGDDYLIHQHVGPDFRQLI
Sbjct: 361 LDH-----RQDMIFDASNGDDYLIHQHVGPDFRQLI 374

BLAST of Lag0005038 vs. TAIR 10
Match: AT5G25830.1 (GATA transcription factor 12 )

HSP 1 Score: 267.3 bits (682), Expect = 1.8e-71
Identity = 180/357 (50.42%), Postives = 213/357 (59.66%), Query Frame = 0

Query: 30  FIVEELL-DFSNHDADAGGLFNNATACFLNNNGINSASADSSALTVVESCNSSNSFSVSE 89
           F V++LL DFSN D +                  N   ADS+  T +     S++FS ++
Sbjct: 14  FAVDDLLVDFSNDDDEE-----------------NDVVADSTTTTTI---TDSSNFSAAD 73

Query: 90  PNSFLEDISASNLADAHFSDELCIPLDDLA-ELEWLSNFVEESFSSEDMQKLELISGVKV 149
             SF  D+         FS +LCIP DDLA ELEWLSN V+ES S ED+ KLELISG K 
Sbjct: 74  LPSFHGDVQDG----TSFSGDLCIPSDDLADELEWLSNIVDESLSPEDVHKLELISGFKS 133

Query: 150 KSDGPSHSRQPTTAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLL-- 209
           + D  S +  P         +  +++ IF  D VSVPAKARSKRSRA   NW +  LL  
Sbjct: 134 RPDPKSDTGSP--------ENPNSSSPIFTTD-VSVPAKARSKRSRAAACNWASRGLLKE 193

Query: 210 -----PLSPTT--SSSESEIAPAGPP---QPVKKVPPKVAATVKKKDCPEAGASAGEGRK 269
                P +  T  SS +    P  PP    P+ K         +KKD     +   E R+
Sbjct: 194 TFYDSPFTGETILSSQQHLSPPTSPPLLMAPLGKKQAVDGGHRRKKDVSSPESGGAEERR 253

Query: 270 CMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVL 329
           C+HCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVL KHSNSHRKV+
Sbjct: 254 CLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVM 313

Query: 330 ELRRQKELLRAQQQQQQQVLLDHHHHHRHQDMMFD-SSNGDDYLIHQHVGPDFRQLI 372
           ELRRQKE+ RA  +        HHHH     M+FD SS+GDDYLIH +VGPDFRQLI
Sbjct: 314 ELRRQKEMSRAHHE------FIHHHHGTDTAMIFDVSSDGDDYLIHHNVGPDFRQLI 331

BLAST of Lag0005038 vs. TAIR 10
Match: AT4G32890.1 (GATA transcription factor 9 )

HSP 1 Score: 230.3 bits (586), Expect = 2.5e-60
Identity = 157/348 (45.11%), Postives = 203/348 (58.33%), Query Frame = 0

Query: 28  DHFIVEELLDFSNHDADAGGLFNNATACFLNNNGINSASADSSALTVVESCNSSNSFSVS 87
           D F+V++LLDFSN D +              ++G+N+   DSS L+     +SSNS S+ 
Sbjct: 16  DSFVVDDLLDFSNDDGEV-------------DDGLNTL-PDSSTLSTGTLTDSSNSSSL- 75

Query: 88  EPNSFLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKV 147
                          D     +L IP DD+AELEWLSNFVEESF+ ED  KL L SG+K 
Sbjct: 76  -------------FTDGTGFSDLYIPNDDIAELEWLSNFVEESFAGEDQDKLHLFSGLK- 135

Query: 148 KSDGPSHSRQPTTAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPL 207
               P  +    T  +       +         V+VPAKARSKRSR+  S W  SRLL L
Sbjct: 136 ---NPQTTGSTLTHLIKPEPELDHQFIDIDESNVAVPAKARSKRSRSAASTW-ASRLLSL 195

Query: 208 SPTTSSSESEIAPAGPPQPVKKVPPKVAATVKKKDCPEAGASAGEGRKCMHCATDKTPQW 267
           + +  ++        P +  ++V  +  A     DC E+G     GR+C+HCAT+KTPQW
Sbjct: 196 ADSDETN--------PKKKQRRVKEQDFAGDMDVDCGESGG----GRRCLHCATEKTPQW 255

Query: 268 RTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQ 327
           RTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRRQKE+    
Sbjct: 256 RTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRRQKEM---- 308

Query: 328 QQQQQQVLLDHHHHHRHQDMMFD-SSNGDDYLIH---QHVGPDFRQLI 372
              + + LL      R ++++ D  SNG+D+L+H    HV PDFR LI
Sbjct: 316 ---RDEHLLS---QLRCENLLMDIRSNGEDFLMHNNTNHVAPDFRHLI 308

BLAST of Lag0005038 vs. TAIR 10
Match: AT2G45050.1 (GATA transcription factor 2 )

HSP 1 Score: 177.2 bits (448), Expect = 2.5e-44
Identity = 128/311 (41.16%), Postives = 164/311 (52.73%), Query Frame = 0

Query: 32  VEELLDFSNHDADAGGLFNNATACFLNNNGINSASADSSALTVVESCNSSNSFSVSEPNS 91
           +++LLDFSN D  +                  S+S  S+A T      SS+SF   +  S
Sbjct: 14  IDDLLDFSNEDIFSA-----------------SSSGGSTAAT------SSSSFPPPQNPS 73

Query: 92  FLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDG 151
           F      S+     F  ++C+P DD A LEWLS FV++SF                 +D 
Sbjct: 74  FHHHHLPSSADHHSFLHDICVPSDDAAHLEWLSQFVDDSF-----------------ADF 133

Query: 152 PSHSRQPTTAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTT 211
           P++    T  +V T T              S P K RSKRSRA          +PL    
Sbjct: 134 PANPLGGTMTSVKTET--------------SFPGKPRSKRSRAPAPFAGTWSPMPL---- 193

Query: 212 SSSESEIAPAGPPQPVKKVPPKVAATVKKKDCPEAGASAGEG-RKCMHCATDKTPQWRTG 271
            S   ++  A   +P K+          +     +  + G G R+C HCA++KTPQWRTG
Sbjct: 194 ESEHQQLHSAAKFKPKKEQSGGGGGGGGRHQSSSSETTEGGGMRRCTHCASEKTPQWRTG 253

Query: 272 PMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ 331
           P+GPKTLCNACGVR+KSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE++R    Q
Sbjct: 254 PLGPKTLCNACGVRFKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEVMR----Q 262

Query: 332 QQQVLLDHHHH 342
            QQV L HHHH
Sbjct: 314 PQQVQLHHHHH 262

BLAST of Lag0005038 vs. TAIR 10
Match: AT3G60530.1 (GATA transcription factor 4 )

HSP 1 Score: 161.8 bits (408), Expect = 1.1e-39
Identity = 111/253 (43.87%), Postives = 137/253 (54.15%), Query Frame = 0

Query: 70  SALTVVESCNSSNSFSVSEPNSFLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEE 129
           S+ + V S  +S++ S   P SF      S      F+ +LC+P DD A LEWLS FV++
Sbjct: 27  SSSSTVTSSAASSAASSENPFSFPSSTYTSPTLLTDFTHDLCVPSDDAAHLEWLSRFVDD 86

Query: 130 SFSSEDMQKLELISGVKVKSDGPSHSRQPTTAAVSTTTHARNAAEIFKPDIVSVPAKARS 189
           SF                 SD P++   P T  V             +P+I S   K RS
Sbjct: 87  SF-----------------SDFPAN---PLTMTV-------------RPEI-SFTGKPRS 146

Query: 190 KRSRAVPSNWNNSRLLPLSPTTSSSESEIAPAGPPQPVKKVPPKVAATVKKKDCPEAGAS 249
           +RSRA              P  S     +A    P    ++   VA    KK       +
Sbjct: 147 RRSRA--------------PAPS-----VAGTWAPMSESELCHSVAKPKPKKVYNAESVT 206

Query: 250 AGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSN 309
           A   R+C HCA++KTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLT+HSN
Sbjct: 207 ADGARRCTHCASEKTPQWRTGPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFVLTQHSN 226

Query: 310 SHRKVLELRRQKE 323
           SHRKV+ELRRQKE
Sbjct: 267 SHRKVMELRRQKE 226

BLAST of Lag0005038 vs. TAIR 10
Match: AT5G66320.1 (GATA transcription factor 5 )

HSP 1 Score: 142.9 bits (359), Expect = 5.2e-34
Identity = 116/312 (37.18%), Postives = 148/312 (47.44%), Query Frame = 0

Query: 28  DHFIVEELLDFSNHDADAGGLFNNATACFLNNNGINSASADSSALTVVESCNSSNSFSVS 87
           D F V++LLD SN D             F +      A  +   ++  E  +  ++   S
Sbjct: 39  DDFSVDDLLDLSNDDV------------FADEETDLKAQHEMVRVSSEEPNDDGDALRRS 98

Query: 88  EPNSFLEDISASNLADAHFSDELCIPLDDLAELEWLSNFVEESFSSEDMQKLELISGVKV 147
              S  +D  +        + EL +P DDLA LEWLS+FVE+SF+      L   +G   
Sbjct: 99  SDFSGCDDFGSLP------TSELSLPADDLANLEWLSHFVEDSFTEYSGPNL---TGTPT 158

Query: 148 KSDG--PSHSRQPTTAAVSTTTHARNAAEIFKPDIVSVPAKARSKRSRAVPSNWNNSRLL 207
           +         + P TA    T         FK     VPAKARSKR+R     W+     
Sbjct: 159 EKPAWLTGDRKHPVTAVTEET--------CFKS---PVPAKARSKRNRNGLKVWSLGSSS 218

Query: 208 PLSPTTSSSESEIAPAGPPQP-------VKKVPPKVAATVKKKDCPEAGASAGEG----- 267
              P++S S S  + +GP  P       ++ V         KK    +  S   G     
Sbjct: 219 SSGPSSSGSTSS-SSSGPSSPWFSGAELLEPVVTSERPPFPKKHKKRSAESVFSGELQQL 278

Query: 268 ---RKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNS 323
              RKC HC   KTPQWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF    HSN 
Sbjct: 279 QPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNH 317

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886306.12.0e-14476.62GATA transcription factor 12-like [Benincasa hispida][more]
XP_023002390.17.5e-13671.50GATA transcription factor 12-like [Cucurbita maxima][more]
XP_022951637.13.7e-13572.21GATA transcription factor 12-like [Cucurbita moschata][more]
KAG6585379.18.3e-13570.53GATA transcription factor 12, partial [Cucurbita argyrosperma subsp. sororia] >K... [more]
XP_023538437.14.1e-13470.56GATA transcription factor 12-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
P697812.6e-7050.42GATA transcription factor 12 OS=Arabidopsis thaliana OX=3702 GN=GATA12 PE=2 SV=1[more]
O826323.5e-5945.11GATA transcription factor 9 OS=Arabidopsis thaliana OX=3702 GN=GATA9 PE=2 SV=1[more]
O497413.5e-4341.16GATA transcription factor 2 OS=Arabidopsis thaliana OX=3702 GN=GATA2 PE=2 SV=1[more]
O497431.5e-3843.87GATA transcription factor 4 OS=Arabidopsis thaliana OX=3702 GN=GATA4 PE=2 SV=1[more]
Q9FH577.3e-3337.18GATA transcription factor 5 OS=Arabidopsis thaliana OX=3702 GN=GATA5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1KNT03.6e-13671.50GATA transcription factor OS=Cucurbita maxima OX=3661 GN=LOC111496245 PE=3 SV=1[more]
A0A6J1GI871.8e-13572.21GATA transcription factor OS=Cucurbita moschata OX=3662 GN=LOC111454392 PE=3 SV=... [more]
A0A5A7VCX11.7e-13370.69GATA transcription factor OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffo... [more]
A0A1S3BBN71.7e-13370.69GATA transcription factor OS=Cucumis melo OX=3656 GN=LOC103488171 PE=3 SV=1[more]
A0A0A0LPR51.1e-13268.94GATA transcription factor OS=Cucumis sativus OX=3659 GN=Csa_2G373450 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G25830.11.8e-7150.42GATA transcription factor 12 [more]
AT4G32890.12.5e-6045.11GATA transcription factor 9 [more]
AT2G45050.12.5e-4441.16GATA transcription factor 2 [more]
AT3G60530.11.1e-3943.87GATA transcription factor 4 [more]
AT5G66320.15.2e-3437.18GATA transcription factor 5 [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 314..334
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 193..219
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 187..232
NoneNo IPR availablePANTHERPTHR45658:SF43GATA TRANSCRIPTION FACTORcoord: 1..371
NoneNo IPR availablePANTHERPTHR45658GATA TRANSCRIPTION FACTORcoord: 1..371
NoneNo IPR availableSUPERFAMILY57716Glucocorticoid receptor-like (DNA-binding domain)coord: 252..314
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 250..300
e-value: 7.5E-18
score: 75.3
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 256..289
e-value: 1.9E-15
score: 56.1
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 256..281
IPR000679Zinc finger, GATA-typePROSITEPS50114GATA_ZN_FINGER_2coord: 250..286
score: 12.545015
IPR000679Zinc finger, GATA-typeCDDcd00202ZnF_GATAcoord: 255..302
e-value: 3.75827E-15
score: 67.0126
IPR016679Transcription factor, GATA, plantPIRSFPIRSF016992Txn_fac_GATA_plantcoord: 14..347
e-value: 3.0E-73
score: 245.0
IPR013088Zinc finger, NHR/GATA-typeGENE3D3.30.50.10coord: 248..334
e-value: 6.7E-16
score: 59.7

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0005038.1Lag0005038.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030154 cell differentiation
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003677 DNA binding