Cla97C08G154610 (gene) Watermelon (97103) v2.5

Overview
NameCla97C08G154610
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionGATA transcription factor
LocationCla97Chr08: 22716499 .. 22736605 (-)
RNA-Seq ExpressionCla97C08G154610
SyntenyCla97C08G154610
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTCGGGGCATATTTCGTAAATTCCGCCACCATTTAGGATCCCCTTTCTTCCTTCAGTCCACAGTTCTTTGTCCTCAGTCGAATCCCCATTCTCTCGTTTCTCTAGACTTCCCACGAACACAGCGACGCCTACTGTAACTACAAACTTCCCGAAGTCTTCACATTCTCACAACCCTCTCTTATCTTCTCATCTTCCTCCTTTCTCTTTTGCTCGGCTTCGATTCTCATCTATATATTCTCTCAATTCTCTCTCTGCCCATTCCTTCTTTCCCCATTCTTGCAATGCTCCAAACCCATTTGATTTTGTTCTGCCAGAGTTAATGTACGCACGCGCTCAACCGTTGACCATGGGCGACCAGATCGCCTCTGTCCCCGACTGCGGCGACAGCGGCGAGCCCTTGGATAATCGTCTCGTTCGATACGGAGCTCACTCTCTCGAAGATGGTGGTGGAGTAGGCGGCATGATCGAGGATCTCAACCCTGACGCCGTTTATGCTTCCGCCGGTGATGGTTCGGACTTGGCTGTTCAGCGCAACGATGGTTCAAGCCAGCTCACACTTTCATTCCGTGGTCAGGTTTATCTTTTCGATGCAGTCTCTCCTGAGAAGGTATAGTGTTCATGAGATTTAGGGTTTTTGGGTTTTCAAATTTTATTTTCTTCTGGGTGTTGGTGAAGTTTTGTAATGGGACCGTGTTTATAAAACATGAAATTTGTTGGACTTGTGAGATTTTCAGAATGGAATACATATTTTTCCTCAACTTCCATGGCGGCCGCAAGGGGTGGAATTGTAAAATAATAACATTGCGTATCCTTTGTAGGTTCAAGCAGTATTGTTGTTGTTGGGTGGTTGCGAACTATCTTCTGGTCAGCAAAGCGTGGATTTGGTTAATCAAAATCAGAGGGTACTACTAGAAACTTGTATCACATTGTTTTCGGAGATATTTATGCTTTTCTTTTTAATTTATTCCCTTTCATGTTTTGTAGAATGCTGTGGACTTCCCTGGACGCAGTAGTCAACCCCAGAGGGCAGCCTCGTTAAATCGGTTTCGACAGAAAAGGAAGGAGCGATGTTATGATAAAAAAGTTAGATACAGTGTTCGTCAGGAGGTTGCTCTCAGGTCTGGCGATGAACATGTTAAGTTACTGGCTGGATTTTGTATTTTCCCTGTTTGCACATTCTTTGATTTCAGTTTTTCCTTCTCTTTGACATTTTCACTTAGATCTAACCCTCTGTTGTACATTAGCAGTTTTTTTTGGTTCGTGCATGTTGTGTATTCACCCAGCTTTTGCTTTGAAAGAACTCTGAGTTTTTGATCTCTTATTTAAATTGACAAAATAATTTGCAGGATGCAACGCAACAAAGGACAGTTTACTTCTTCCAAGAAGCCTGATGGTTCGTATAGTCATGGCAGTGTCTCGGAGTTAGGTCAAGATGAAAGTCCACCAGAAACTTCGTGAGTTTTCATGGTCTTATAAATTATAACAATCTGAAACAACTATTTTAACACTTAGGGGCTGTTTGGGGGCTTGTAATGGAATGGATTTGGATTGTAATGTAATCCAAAATCCATGTTTGGATTGACCATTTTGGGCCCGATTTGTAATATTAAACTCATTCCGTTCCGACAGTTTTTAACTAAACCCTCTTTCCTATCGTTTTCATGTTTCACAATCCCATTTTTGCTCCCCCCTCAATACTTCAAGCATTCCAACACTGTTTTGATTCTTAGTAATCTAATTACATTCCAGCCCCTCCCAAACCACCACTTAAATACTGTCTGTGGAATATGGCATTTCATTTCAGTATCAGAATTTGTATTCTGTTTTATGTTAGAACTCACATTTATATCTTATTTATATCCATGCAATGTTGTGCTTTCTCTAGATGCACAAATTGTGGCATAAGTTCGATGGCAACCCCAATGATGCGGCGAGGACCATCTGGGCCTAGATCACTTTGCAATGCCTGTGGACTCTTTTGGGCAAACCGGGTAAGTACGCTCAAAGTTGATAATTTTGTACTGCTACTTATATATTAAGGAAAGCTTCACTTTCCCTAGATGGTCAGGGTGATTGGATTTGCCAGACCTTTGCATGGAACCTATTGCATCTTATGATGTAAGATGTATCATTCTTATTGCTATTTTCTACTTGTGTGTGGACTTCTGTTTTTGAGTGTTTGTTTCCTTGGATGTTTTGAGTTATAGATTTGTCATTTTTATGGATCTCTATGGGCACAATGATTCTAGGCTCTAGCTGCACTTAGGACGATGGGAATAGAAGAAGTGATGATGCACAGTGTCTAGGTTACTTTGGGCCTTTTTGGTAGTAATTGGTCTTTAAGGAAAACTGTTGATGATGTTAACAATTTCTGTATCAACACCAGTTGCATTTTTAATTCTTATTCTTGCACAAATTTTCTGTTTCTCCATTACCTAACTAACCATCTTAGTAAATATCACTCTCTTCGCCCTTTCTTATTCCATTTTGCTGCTTGTTGATGCAGTGTAATTAGAAGTAGTTTTGTAGTTTTATTTGACATGTCATTTTATTTTTCCAGTTAGTTAGTTGGTATATTTGGATTATTAGGCTATAGCTTCTCTACAAGTAGAGCTGATGAACTTTTGATCATTAATAGAGGTTTAGTTAAAGTTTCTTGAGATATTTTCTCTTTCATTTCTTGGTTGCCTTACCTCCCTTGATTGTTGGTTCATTTTTCTTGTTCGATAATTATTTCTTCTCTCCACCCTTGCTCTTCTCTTGCTTAGTCGTGCTGCTCTCTTTTCTTCTTTCATTTTCTGTTCATCTCTTCCATGCAGTTTTGTTAGATGTTCCTTTTAAACTTGGTGGTTGAGTTATGGAAGTTATTTTCTTCCCTCGACCTTCCACCCCCTTGCATCCTAAATCTTGTGGAAGGATTCCCAACTAAATTTGATTAAATGATTGCACCCTCCAATCTAGCACTATCCCAATGAAATTTCCCAATAAAATCTCCCAATTCTTATCTTGATTTCCTTGCAACTTTCTGCATTGATTTTGAACTTGCATAAATAGCGACGAATTTTACTGAAGTTTATTGCTTCATCCTCCTCAACTCTCTAAAAAAGAGTTGAAATGAGGAGAAATCTTGAGAGGAAGCCCAAGGTAAAGCGAAGGTATCTTCTCAACATTGCACCAAAAATTTGTAGGGGTCCTCTATGTAAGTAAAGCTAAACAATATACTGAAATTAATGGACACACGGGAGGTACAGTGAAAGACATTGGGCAGTTTCTTACAAGTTACAATCGAACAACTTCTGCTTCTTCTGCAACACATTGTACTAGAGACGACTAGCGAGTTAGATATATCACTCCTCTCGAACACAGAAGCCTTCTATTTGACCATTTACCTTACTCTTATGGGCAATGAGTTTAAGTCTCTCTCACTCTCTCTCTTTTTTTTTGGGGGGGGGGGGTAAGAAATCGAACTTACATTGAAAAAAGGAGGAAAGAATACAAGTGTGTCCAAAAACAGCCACTCAACTAAAATTTCCCTTGACCACCACAACCATTTGGCCAAAAGAGGCTCATTGCGTATTCTTAAATTTCCAAAACCCAAACCTCCAAGGTCAACATGCTTTAAGACCACCTCCTGCTCAACTAGATGAGAACCTCCTCCTTCGTCAAGCCTTTCCCTCGAAAAACTCCTCATAAAATTCCTCATCAACATCTTAAGATTCATACCAATCAAGTTCATGATCCTAAACAATATTCTATGAGTTTCCTTGGAAACCAATTTTAGTAAAGGGTCTGGTAGTTGCCTCATAGATACTAGTCAAAATGTCAAATGTATGTAAGCTGACTTAGACACACATGTAGATTAACATAATTTGCCAATTATATGTAGATTAACATAACTCGCCAATAAGTCTATTAATGAGTCAACCACCATGGTAAAGAGAAAAGGGGCTGTATCTTTGATTGGGACCTTGAGAAGTATGAAGAACTTGTCAACATGCTTTATCATAGGTTTTTCCAAGGCTTATACAACTAGCTTTATTTTAACTTTCAAATCCTCCACAGCTTCCTGTCTTTGATTGGAGAGTTTAGTTTTAGGATCAAGAGAAGAGATTTGTGTCTTTGGAGCCCTAACCCTTCTGGAGCTTACTCCTATAGTTCCTTCTTCCATAGTTTGTTGGAACCTTCTCTTGATAAGGATTCTATTTACCAGTTCTTTGGAAGATTGAGATCTCAAAGAAGGTTAAGTTCTTTGTCTTGCAGGTTTCGCTCAGTCGGGTTAACACTTTGGGTTGGCTATCGAGAAAATGATCTTTGCTAATAGGCTTGTTTTGTTGTATCCTCTGACGGAAGGCGAAGGAAGACTTGGATTATATCCTTTGAAGTTGTGTATATGTGGTCCATGTGGAGTTTCTTTATGGAGGCATTTGGTTTGCAATTTGTAGTCATAAAGATTTACTTTACAAGGTATCCCTCCATTCGCCTTTTATGTGGATAAGGGTTGTTTCTTGTGGCTTGTTGGGGGTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTGGAATTTGTGGGGTGAGAGGAACAATGAAGTGGTTATGGTTTGGTTGGAAGGGATCCAGAGATGTTTGGTTTCTTATTAGATTCCGTGTTTCTTTTTGGGCTTTGGTTGAAAATGAATCCACAATTTAGCTGCATAAAATTTGAGGATATAGAATAATAGATAACCCCTAAACGTGTTTAGGAGCTCATTATAGGCACTGTGCAAAAGGTTGTTTGTTGTAATATTAAAGAACAAACACTCGTGTCTGACTTGAGTGCCAACTTTGGGCTTTGGTGATAAAGACATTTTGTAATTTTTCGACCAATCCTATAAAAATAGGATTGGAGGTCATTCCTTTAGTAGTCACATTCTTTTTCCCTTTAGTAGCCTCATATATATATTGCATCTAAGTTCAATTGGCTTTTTTCGATCATGCGAGGGAGCTATTTCTTTAATCTCTCAGGGTGCCTTTGGTGCGCAGAGTTGAGTTGAGTTGGTAATAATTGTCTCTGTTTGGAGTATAAAGTTGAGCTAAGTTGGTAATAATTGTCTATGTTTGGGAGAGTTGAGTTGTATTGAGTTATGAATATTTGGGGTAGTTTTTTGTAACCTAAAAAAACATGCATTTTTTATGAGTTAATTTTGTTGTTGTTATTATTATTATCTTGTTTTTGTTTTTGATTTTTTTTTTGCTATACTTTTTGTTCCACCATTCATCTACACATATTTGCGAAATTCTAAACACTCATTTTAGATGTGAACTCAATAAATAAAAATATAGTTTACAAGTTTCTACCTATAAATATTGCACTTAATGCTGACAAAATACTTTAATTTATATAATCAATCACCCTACAACATACTTTAACGGTCGTGGAACGTATGAGTTGAACATACATAAAAGATAGGGTTGCCATTAGGGTATATTTACATGTTTACAAAAAAAAAAAAACATGGCTAACAATTAGTCGATTTCATGCCTACAAAAAATATTGTTCATTAAGATAGTCAAGCTACTTGGTCAAACCAAGTAGACGCAAACAATACTTCTTCCCAGGCTGTGGTGAGACTATTAAGAAGCAGCAAATCTTTTGGATGCCTGTGACAGGTAGGTATATAAGGATGACTCGATAATCCTCGGATAATCCCTCAATATCAAATTACTGTAAAAGATGACTATTTAATGCTATAAACTAATAATTTCAAATTTATATTTAAAAATACTTTCGCCAGCCAAGATTCCCCCCTACTTTTTCTTTCTTTCTTTTCCCCTCTTCTAGTATATGATTGGTCAAGTTCGTCGATAGAGTTCTTCGTTGGAGATGAACGACAAAGTTGATGTTTGGAATTGGTCGTTTGAGGTGATCATCGAAGTTGGCCGCCAGAAGTGGTGGTCGTCGAAGTTGATCATTGGAGGTGGTTTTCTTCGGAGTTGGTCACCTAAGGTGAATGCCAATGTTGGTCGTTAGAGGTAGTTGCACCAGAGTTGATCATCGAAGGAGGTCTTCGCTAGAGTTGTTGTCGAAGGTGGTTGCTAGAGCCCAGAGTTGGTCGTTGAAGTTGATTGCGCGAAGGTTGTTGTCGGAGTATATCACTGGAGTGCGGCAACGGTAGTCGTCGATAGTAACCAACGGGTGAAGCCATTGAGAACATTGAAAGGAGGGAGAATTGTGGTGAGTTGGTAAAAATAACCAACTCAACTCCACCATCATTGCCAAACATTTAGTTGGTAATTTTGGTTTGGTTCTACGTGTCCTTGTGGGTTTCTGTGACCAAGCCTTTTTGTAATTTTCTGTTAGGTCTTATTTTGTTCGATAGGAGCATGCAGCCTGTTTTTGTAGTTTTTCTCCTTTCACTTTTCGTGCACTCAATGTTTGTATGATATTGTATTCTTTCCTATTTTCTCAATAAGGTTTGGTTATTCATAAAAAGGAAAAAGATTCCAATATTCCTCTCCCTCAAAGACCCCTGTAAAATAGCACACCCCACCTGCCCAAAAACAATCTCCCACAGAAAGGAAACATAATTTTTATGGTTGCATTTTGCAACGATCCCTTTCTCAGCAACTTTTTTTTTTTTTCCCCCCTCAAGTTGAGAGAACCTTCTCAACAAAATCTTAGAACGACTTAAGCACATTCATGATGTTCCAGTCTTTTTTTGAGAATTTTCAACCAAAGCCCATTTTCAGGCTATAGTATTGATGTCTTTATCACAGACCATATCTTTGTTCAAAGGGCCTCTCTGTACCAATTTTTCCAGGGTTCACCCAATTGGTTTCTCTCTTCAATATATAAAAGAGAACTGTCAGTGTGTTTTGTAATGGATTATTTAGAGTTAAAACCAATTCAAACTGAGTATTTTTGGTGGAGAAACGACAATGGTTCCCTTAACTACTGCTGATTGATTATGGTAGACATAAGATTTGGAAATGTGTGCTGTGGTGGGTTCAGTTGAAAGTAGTCTATGGTCAGGGAACTGAACCGGAATCCCGGAATCACCCTTGGGTCGGATAAACCTGTTCTCTTGCTCCTGTATACTTGACATAGCATGCAGATGTGCACAGACTGTGTGCGGTGACATGCATATCTGCTAATATGTCATTATGATTATTGCTATTATGTTGTGGTCTTTTAAAAATACCCTAATACATACTCATGTACAAATTGATATATGCATCTTGAGGCTATATTATATTCTATACATGATTATGCATGTATAGTGTCAAATTGTTACTTCAATGTTACAGTCATTGTGAGATCGTTTATATTTGATTATGATTCTTTTATTTGCATTTACAAGGTGAGCAGGGGCTGATCCAAACCCTCTTAACTTGTGGTAGATTTCAAAATTCATATATAATAGTTGATATTTTTTTTGAAAAGGAAACAAGACTTCATTCACTAAGAAAATGAAATGAGTCTAATGCTCAAATTACGATAATAACCAAGAAAGGAGAATGCCAATGAAATCAAATTAAATTAAATTACTAATGAGGCCAATACTTAAACTAACAATTGATAAAACAAGAGTTTCCAAAATGCCAAAATTCCAGCCAAGTAAGTTGCTTGTATCTTGAGGTCTAGAGAGGTTTGAATGCAAAAATTCGAACCATAATAAAACCAACTTCTTCTTCACAAACAACCGCCTAGCTAATTACCAACAACAATCGTAGGAAGATACTTCTCCAAGAGTTTGACCGATAAATCATTCATTCTGACCTATAGAGTTTCTTGAACAAAGACCACGATCGAAGTTCACCACCAGAGCCTTTAAGAGAGCAACTTATTTTTAACTTATGAGAATACTTGGAAATTAAAATTTCAACAGTTGGAGAATGGTGAGAAAACTATACAAGTACCCGGCCACATTCTTTAATTAGAGAGGAAAATTTTGAAGGAATTTGAGAAGGGGATAATGGAGTAAAAAGCTCACTAGGATTCTGATCAAGCTTTTTAAGAGCCGGACTTTGAAAGAGTGCAACAAGTTCTGTCCCTTGGACTTCTTCCTTGACAAGAATGGATTTGAATGCTAAGATTTCTGTATCGTCGTCACTGCTTATGCTAACTGAAGAGTCATCTTCTTAAAAAACTTGAGACTTAGGAGTTCTTGGAGAAGGAAGCCTTGGTAATTCTTTAACGAAAGTTACCGATGAATCCGGGAATGGAAGAATAGACCTTGACATAGCAATAGAGGAAAAGGAAGTTGATGAGCTGGAGCTGGAATATTTTCCAGCCAATGAAGGTAGTTGCAAACGAGTACATCTTTCTTCTAGGACGTCCTATAATAGTTGATATTTTATAGTTTTCCCTTTAATTTGTCCTTAAATTCTCAAGAACATAAAGATTATTTACTAAAAGAATCAGCATGTTAAACTTTACCCTATCATAAGAAAATTTCTAGCTATTCCCCTGGGGAGTCAAGTTCATTTAGGGGGCGTTTGGAGTGCTGAGTTGATTATTATAGTCTGTAGGTTATAATAGTTAGTATTTGGGGGTGCAAACTATTTTAGTATGGGTTATAGCAGTTTGTGTTTGGGGTATACACTATTTTTGTTTGGGTAGGAAATAGTAAACACTATAGCTAAGAAGAAAAGAGAGGATGAGTGAAAAATAACAGATACTATAGCAAACAGTAAACATTGTAACAAGAAAGGTTTTGAAATAGTATTTACTATACCATTTCTACAATGGAAAATGGATATTGTGGATGCATTTTATGAGAGCATTTCTTTGGCAATCATGGAAGGAAAGAAACAACTGACTCTACAATGACAAAGAGAGAACATATGACACTTTTTTTGAGAACATCTTCCTTACTATGTCATGGAGCAAATGTACCTCCTTTTACAATTCTTAGTCTCTCATCTGTTTTGACCAATTGGAGAAATCTTATGCAACTCCAATGGTTGGGCCATTTTGTTGTCTTTTGTAAATTTCATACATCAATGAAATTCTCTTATCTAAAAAATTTAGTATTTACTATACATACTTAATTGTAGGTTATAAATAGTAATATACACCACTCCACCAACTTCTAGTTCGTGTCCCAAACATGAAGTAGGCTATAATAACTCACTCCACCAACTTCTAGTTGGTGTCCTAGACGGCCCTTTAGATTTCTATTTTATCGAGTTTGGACTTTTTGATACCTTGATGATCCATGTTACTTTTACAATTGGCTTCAAATCCTATTATGGTTGTTACAGTTTCTTTACAATTGGCTTCAAATCCTATTATGGTTGTTACAGTTTCTTCAGGTTATGTCGTTGTGTATACTGTGACATTGTGAATGCATGCTTGACCTCTTTTCTCGTTTATTTGTCAATTTCCTTTGCTTTCACAGTTTGAGCTAAGACATCGCAACTGCCAATAATTATTATTCAACATGCTGTTAAATTATCAAAATTTCAATTGTTTCGTTCATGCAACATTGCAGGGAACTTTAAGGGATCTTTCAAAGAGAAGTCAAGACCATCCGGTGACCCCAGCCGAGCAGGTATGTGCCTATTATGAAATTTTTAAAACTCATTCTGGGTAGTGTCATGTATTCCATGTTCCAACTCAATTAATTTCTTCATTTTAGTGTGAAAGTGATGGTGGCAAGGACTTAGACTGCAGACATGGTAACCATGCGCCTAGCAATCTGGTTTCTTTCTCAAACGGTGATAACTCTGCTTTAATGGCTGAGCATTAGAATTACGTTTGTCAATACAGATAAAAAAAAGGCCATTATGGCTGTTTCTGGCTTCATTCAATCAGGCCCTCGTTTTGGTAATTTTACAACGTGGGTTGTAGAAGAATGTACAGAATATAGCATTGCCTTCTATTAAATACGAGAAAGAGATCGTCTCCTCATTACTACTAATTGCCCTTCTTTTTTCCTCACAATCACGCCAACTTCTTCGTGTGAGTGGAAATTTGAACTTCCGACCTCAAAGGAAGAAATATATGTCATGTCAATTAACACTGAGTTATGCTCATGTCGGCAATGACATTGTGACTTTCTTCATTATGTTTTGTTTTGGTGTACGTACTAATTTATTTTCAAGGTTTAAGGTGAAAACAAGGAGATAGATAATAATTTGATGTCAAATTAGATAATAATTTTCTTGCTGAATTTCAGCATGAAATCAGTTCTGCTTTTAATAATTAAATTTTTCTTAAGTTTAAGATTATATGTGGTAATGTGGCTGAGAGTGGACCAACTAGTTAATATATACTCAATTTGGTCAATTGGAGGATGCTTGAACATAGCATGATTTTATTGGATTGTATTATTTTCAAGCAATCACTCAATAATGAAAATAAAAAAGTTTGAGGAGGTACTATATGTATCCCTCCTGACTCAGTTCTCACCATTGCAGTTCTGTTTTTGTCCTTCAAAGAAAATGTAACGAAGGGTTGGAAGTAAAACAAAATTTGGGTTACCGACAGCAAAAAATTATCTGCAATTGTACAAATTTACGGTTTTTTTTTATTTTTTTATTTTTATTTATTTATTTATTTATTTTTACATTATTATTTTTAGACAAGTGGACAATTTACGATTGGTTTGGAGCAATCGTCGAGGACTCTACATTTGTATGTACCATAAATTACCTTTAAATAATAGCCTTGGAATAATCGATCATACAAATTAATTCTAGCATATGAATGTATTTACATATTTTCAAGCTTTTGAAAAATAAAAATGAAGTTTTAGGGTTCTAAGAACACGTCATGCTTTCTTGCTTTCATTCTTCTTTTAGCAAGTAAAAATTATAGTGGCCAAATTATTACAGGTAAGTTAGAATTGCATGCTTAGAAGGGTCAACTTGATGATTACAGCTATGGCTTGTCTTAAATTTCACTAAAGTTGTAGCTGGACAAGGATGTGATGTGTTTTAGGGTGGTACAATATGTAGCAGCGGTATTAATTGTATATTTCTTGCACCTAGTACATATTGTACAATATCTTATCACTACTAATTGAGTAACATTTGACCTTATTTGGGTATTATAGACCAACAATTTTAATGTCATCTAATCCTGACATGTACACTCACTAAAACTCGAGATCTTTGCCTCTCATTAGGGTAATGCTAGATTCATCTATGGAAGATCCCTCCATTCTCATACTGATTTGACTGACTACTAAACAATCGATCTTACTTGATATCACATTGGTCATGCTATGGCCTCTGTGGCCCCTGAAGACTTGCATCAAATGATTCTCTAATTTCAATTAGCCAAGCTATCAAAGAAGGGGACTCATGTTGGTTTTCTTCTAAAACAACATATATTAGTAGTGCTTGACCAACAATTTACCAGAACAAATATCACACTGTCAAACATGTTAAACTAATTTTTAGATCTAACCAAATTGTTTTAGTTATAAACTTCTTCCATCTAAATAACCCTAATTAAAAAAACAAACCCAACCCAACAATATATTTGGGTTGGATTGGTTTGGGTCATTTATTTAAATTTTGATTCTAAAAAGAAGTAAAATGTTAATATGTAACAATTTTTTCTTTTTTTTTCTTTTTCTTTTTTTTTTGGATGATTTTTAGATATTGAATTAAGATTAGCAACTCAATTTGAAGTTATATTATGAAATATTCTTTCCAAAGTACAATTTTTTTAGAGTTGGTTGGAGAATAAACAATCCAAAAAATCATGAAGTTAAATAAAATTAAAATAAATATATGTGTATATGATTGTATCGGAAGTAAAAAATATAGGATTGTTTTTAAATATAGCAAAATAAGCCAAAATATTTACAAATATAGAAAAATATCATTGTCTATTATTATCATCTATCATTATCTATTAGTGATAGACACTGGTAGATGTTTATTGGTGTTTATCATACTCTATCGTTGATAAATAATGACATTTTACTATGTTTAAAAATATTTTCAGCAGTTTTTTCATATAAAATAATTACCAAAATATATATATATTTTAAAATTAATGATAAGTTTAAGTTAATCTAAATTATTTTAGGGAAACCCACAAACTAATTTGACACATAAAATTTTCATTCATTTGAACCTAACCCAAATGAGTTGATAATATAACCCAAACCTATGGGTTGAGTCGAGTTAATCGAGTCCTTTGAGTCATCGAATATTTTGAACACTCTTAATTGGTAGTGGTTTATTATTATTTTTGTAATAATAGTAATATTATTATTTTTAGATAGACAAGAATAGAAAAAGGGGCCTAAAAGCGCCCTTTGAATATGAAAGCGAAAATCTCCAAAGACAAAAAAAGGGTAGGAGGCTGGGGGGAGGGCAAATTCACGAGGGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATTATTATTATTTTTTTAAAAGTAAATTAAAAAAAGAAATACTTAAAAGTGAAAATCGCAAAATGTAGTCAACGATAACCAAACACAGAAAAGTCGTTATTTATCATAGAACAGGGCCCACTTGTATACCCTGAATAATTTCAGAAACATTATCCAATGACTTCAGTAGTTGTTACCAGCCGTTGGATCTGATTATAATACTAGAAAATTAAGTTATAAACTTAAAAAATCTCCGTCGCCGTCATTATTTTTCGTTATTTATTATTATTATTTTTTAAAAAAGAAAAACAAATAAAAGACTGAAAATTTGAAAATTACCCGTTTGAGTCTCTCCAAGGAAACACAAGGGCACATCTTGCAAATCCTTTAACCATTTCTGGTAGCCTTTTTAAAATTTTCATTTTATTTTTATTTTTCTATCTTCTTGGTTTCTGAAACTTTGTTCGAGCTCGAATTTTCGATTGAGTATGGATTTTCGGAATTTCATGGAATTATTCATCTTTGCCCCATGAAGCTGGTTGACGGTTAGGGCTTGGTTTCTGGTGCAGTTAGGGCTTTCGAAAAGTTGATGGCGGCTGCAAATCCTCAGCCTTTGCAAGCGCGTCCCTTCCAGGAACATGTCCAAGTTTCCACAATGATGGGGGATGATGACGGTGAATATGAAGATGGTGGTGGTGGTGGTGGTGGCGGCGGCGGCGGCGGCGGCGGCGATGTTATGGATGATGTCGAAGAAGCTCATATGACTTCAGTGAGCGTAGCGAATCATGGGGGGTTGGTCATGGCGTCTAGAACTAGTGAACTTACGCTCTCTTTTGAGGGCGAGGTTTATGTGTTCCCCGCAGTTACTCCCGAAAAGGTACCTATTCGCCTGTCGTCTGGCATTTCTGTCATAAATATGCTTCTATTCTCGTAGTTATGTTAATTTGAACAAGCTGGAGTTGTTGAGACTTTTCATGAACCTGCACTCGTGTCTTTATTTTACTTTGAGTGCCTGAAATTATTCAAAACAGGCTATAGTAGTTAAAAAGGAATCTCAATCTTAATCTTCTGGAGGATTGTGTTGGAATGTAATGCTTTTGATTTATATTCTTTTTTGCTGAGAGAGAAGGAGAGACAGAGAGACAGACAGAGAGAGGCAGAGACTTTGGGTAGACATCCATTAATGTTTTTGTTCAAATTGTCTGACTAGTTAGAATATTCGTACTTTTTTCAATGGGTATATTACATTCATGTGCAGATCTCAGAATTTAGTTTCTCTCTCTGTTTTCCTGGTTATGCCATGTTTTTTCTATCATTGTGTTGTAGCATGAAGTTCTGTTGTCAGACGCTTTATCCACAGAGCTTTATGCTGTGGCAATTAGCTATTTTTCATAACAATTCCCAATTTCTTGGGGCAACAGCTCTTGTGTTTCATTGGGTGGGAACATGCACATCTGGTGACCACCCACAGAATTGTGGGCTGCCTTCATGATCACATGGTCAACACCTATTACACCTGGGATTGCGATAACTACCATACCATTGTCAATTAGTCAACCGTGTCTATTTCCTTTCATTGGAATTTGAGAAGATTGGTTTTGCTCTTTATGTCTATTTGAGTAAGTTTGATTTCCCGGATTTTTTTTGAAATAGCCTGTCAATCCTTTCTTTTAACTAAGCACCACAATAATGGGGTGTGTTGGCAGTACTTCCTTATTGGGACTTGTCTTATGCATGTATCTATCTCTCTTTGCCTGACTGTTGTGTTATAGCAGCTGATCTGTTCATTTTTTTAAAGAACTATTTGTGTTCCATCTTAAAGAAACAATTTGGTTAAGATAAGGAATGCTTGCCTTTCTTTGAGCAGGGATTATTAGTTTTGGAATCCAGTATACTCTGGACTCTGGAGTCTGTTTTCTTTGGAAGAATTACCGGGTTTTAATATGACAGGTACAAGCCGTGCTCTTGCTTCTTGGAGGGCGTGATGTGCCAACCGGTGTACCTACAATGGAAGTACCATACGATCATAATAACAGGGTAATGCAAATCATGGAAACTTTACTCCATATTATGTGGCTTTGTTCATATATTAGATCTTGCCTACCATCTTCAGGGTATGGTTGACACACCAAAGCGCTCCAACTTATCACGGAGAATAGCCTCCCTGGTTAGGTTTCGTGAAAAACGGAAAGAGAGATGTTTTGACAAGAAAATTAGGTACACTGTACGGAAAGAGGTTGCACAGAGGTAAGATGACTTAGGCTACTATTATGACATGGAAGTAAAAGTTCAGTGTGAATATTTTACTTTGCGCATTAGCATTAGCATCTGTGGTTGTTTCAATCCTCTTCTTCTCTCACTGTTTGGTACAAGATGCTTATGGATACAGTTTACAAAATTCTAGGTCTCAACTGCAGGATGATTTTTAGCTCATATTATTAAGGCAGCTGATTTTGCCTCTAAATATTAATAGCTAATACGCTATCTTTGTTTCTATTCTGCTGAATGATCATAAGTTTGAGGCCCTTGGATTTTGTGCCGCTTCTGTTAGAATTTTCTCATTTGAACAAAACTTGATAATTATAAAGTAATTTTTACGTAGCAGACATTTATGTACTCTAATAAAAAGCAAGAACTATGTTGATAACTTCTTATGATAATTATAAAGTAATTTTTACGCAGCAGACATTTATGTACTCTAATAAAAAGCAAGAGCTATATCGATAACTTCTTATTCCCCTCTCTATCTCTCTCTGTCTTTTTTTGTTTTTATTTTTTTTGAGAAACTAATATTACTCTGATCTCTTCGGAAAGTTAAAAGTACATATCACTTCTTAATGGAAACAGTTTTAATAGTATGGACGTCTAATTATTATTTTCTTTTTTTCATTGTTGTTTCAGGATGCACCGCAAGAATGGCCAATTTGCATCCTTAAAAGAAAGTTCAGGTGCTTCAAGTTGGGAGTCAGCACATAGTTGCCTCCAAGATGGTACTCGTTCAGAAACTGTGTGAGTTCACAGTTTTAGTCTGTTGTCCTTTAGTATTTGAAACTACTTTTCAACAATATATTGTTTGGATACATATATATTTTTTTTATTATTGCTGTTTTATACCTCTATCAATATTTCTTTGTTTTAAAATATTTTTCAGTTTGCGGAAGTGTCAACATTGTGGTGTTAGTGAGAACAATACACCTGCAATGCGTCGTGGGCCTGCTGGACCAAGGACTTTATGTAATGCATGTGGCTTGATGTGGGCGAACAAGGTTTTATGCTAAGAATTATCGATATAAATTTTTAACTTAAGATGTCCTCGTACATCTTGAATACCTTTTCCTATTATTATTATTTTTATTATTATGTCAACAGAATAAGATATTAAAAATGGAAGAAACTATGTGCCAAGAAAGTTACAAATGTCTTTTAATTAGTGCAGTAGAGATATTGCTGTAATTACCAAAACGCTTAAAGTGTTTACACCAAGAATCAATGACGTTATATAGATTCTTCAAATCTGATCTACAGTGAGGTTTCACTGTTAAGAGACACTGTAAAGTCTTGGCAAGGCCACCGGGAAAGGAGGAATTAACCTTCCAAAAATGTTCCTGGGATTTGACATTAGATCCATTTTCTATACCGAATGAAACATGATCATTCATAAAATTTTGAAGCTTATCATTTTGTTTTTCACTTTGCTTCAACGAGAAACCTCTTCCTGAAGAGCAGCCTCCCTAAATTAAGCATTAGTCTGTTTTTCATTACTTTAATGAAATTTAATAAGATTCATGGAGAGCAGGTATCAGGGTTATGTGCAATAATATTTTATGGATTTAAATTCTTAGTATATTCTATTCTTCATGCAGTATTTGTTGGTAATTTCTTCTGCAACATTTTCTTTAGCATTTTGCCTTTCTTTCACTGGCTTATTGGTTGTACAGGGCACGTTGAGAGACCTCAGCAAGGGAGGGAGGAATGTGTCTCTGGATCATATGGAACCTGTATGTTAAAGAAAGACTGCTTTTCATTTTGAAATTGTTTGTCTCATTGTTTAAGGCTTACTATACTCTGATATTCATCAAACTTTCCTTGTTAGAACGCATTTTCTGGTGGGTACTGGCTTAGGATGCCACTTTTTCATTTTTTAAGTAGAAGTCTTAGATGGATGAGGGAAAAAAGAAAGTACAACTAAGATGGGCAATGCCTTTTTTCTAGATAAGAGGCCCATGCATTGAGAGACAGATGGCACACAACTGTGGTGGACTGGATGACCATCCCTTACTGAAGAGGATTAGGAAATAACAAAAAATTCTCTATTCAGCCATATTAGGGAAAATACAAGAAAAGGAAAAGAAAAAGGCAAATGGGAAATCTCTTTGGTGGTGCCACCTTGAGAGACTTGATTTTTCTTCTAACAATGGCCATATTCCAACTGAAGTAACGCGTCATTCGTGCAAGAGGGAAATATTACATTTCCTATTCATATCTACTAAGAAAGCAATGAATGAATTTTTTGTACTTCCACACATAATATATGGTAGATTCCTTGGCGTATAATCTTATGCTTTTATGTGTTTTGAGTATTGACCTATCTGTTGGTTTAGGGTATTGGCATCATGCTATTCTTAGTGTATGTATGCCAACTTTTCCTTGAGAACCAAATTTTCTCTTTTAATATGATCTCCTCTCTACTAATAAATTATTTTTATTATGAAATATAACTATCCTTGTTAAATAGAGTAATAGGCATATTGTATCCCTTTTTCTATGCTATGATAACACAATTTAAATGTGTTCAAGCTCTTACTTTCCTGAATGGCTGACATTCTATTGTTCTGATCCTTTGGCCTCTAATTGTAGACTGCTTTTCAAAATTATATTCCTATAGCTTCTGTTCATCTCCAGTACCTTCTCACAGTAAATATTTCCTAAAAGCTGAATACGAAACATACTAGGTTTTTTGATAGGCTGTTTCTGGCTATTGTGTGAAATGTGAATGTGTTCACTGCATGATCTTCTACTCCTGCATCAATTTGTTAGTGACTGAAGTCTTGCAATTTTTTTTTCTGTTCCACATTGATTTATTTCAGTGTAGATTGAGGTTGCGGAGTTATGCATGTCTGTCAATTTGTGTGTGCACACTTTTCAGTTATTTATCAATGATACATTATTTCCCATTAAAAAGATCAATGATATCTATCTATATTCATCTCGTGTTTTCTTTGCATGCCTAGTTTGTCTTCTTTCCCAATTATCATAATTACAGTACTTATGATTCATGGAAAGCTTATATGCTTTTATTCATGTTGATGTATTGACATACATTGTTTTTTCCAATGACTTCAATGGCTTTGCAGGAAACCCCAATGGATGTCAAGCCCACAATCATGGAGGGAGAATTTTCGGGCATCCAGGATGAACATGTACTACTTCCCTTTTCCCCCATTTAACTAATTTCCAGTTCGCTAGCTTGTGTTATCTGGAAGCCAATGCATACAATATACATGCAGGGAACTCCTGAGGATCCTAGTAAAACCATGACTGAAGGCTCCAGTAATCCTTCCATTGACCCGGATGAGGAAGTATGCCCATATCTTAGAGGATTTTAAGCCTCCTATTTTTATATGACTTTTTTTTACCTGCAGTTGAAAGTTTTTCTGCTGGATCTTTATGTACCCTCGTTTTCTTTTACCTTGGAGTAGGTAACTCCGCCCATTTGCTAAGATATCCTAAATCACCAAACCCCCGTTATGAAAACCAGCACTTTCACCCAGTTAGTGTTGACAAATTGGTTCATACCAGATTGGTGCTCTAATAGCTGTTCCTAGACTTTTGTCAATAAAGGTGCTTGGCGGATGTATTCCTCCTCACTTGAAAAGGCTTATCCTAGATTTGCAGTTCGTTGGTTTATCTTAGTCACCCTCTTGCATAATTTCATAAGTTAGTTTGCTCTTGAATTGAACGCTTAATCAATAATCGAGGATGCTGGGGTATATGCTTTGTTTGTGACTTATGGGGAGAAATAAATAATAAAGTGTTTAGAGGAGTGGAGAGAGACCCTAGTGATGTTAGTCTGTGAAGTTTCATTTGCCCCTTTAGACCTTTTGTAACTACACTTTAGGCAATATTTTACTTTGTTGGCCCCTTTTCTTTAATGGGGGTTCTGTGGGCTTGAGTTTTTTGTACGCCCTTGTATTCTTTTATTTTTTTTTCAAAAATAAATGGTGGATGTTTGATTGACGAGAATCTTGTTAACCTGCACATATGTTGTGTTGCATCATTCTGCACATTGCTATTCTTGATGCCTGGCAGTATCAGTGGTTACCCCCAGAGTTCAGTTTTTGTAGTGTTTTTGAAGGATTCAAAGTTGAAAACAAAACAAATCCTTGAGCACTTTGTTTCGAACTAGACAAGTAACTTAACGTTCCTTCTGGTGCAGGACATAAATGAAACCACTGGAGACCTTACAAATTCGTTGCGCATGCGAACTGTTAATCATTCAACAAATGACGATGAGCAGGTATTGCTTATTTTTATTTCAGAAAAATAAATAAAATGAGTTATCTTTTATTTTAAATAGTCAGATATTCACCCAAATTTAAATACCATTGCTGCTCCCCTTCCCTTTCATTTCATAGGACAACGTCACTTGAGGCCTCTGACTAGTAACCTTGTCTTCTCGTTAGGAAAACCTCCATGACAGAAATATGAGAATCAAATAAAGTAAGCCCCAAGGCGCTCAGCTGACCTTTACATAAGATGTTAAAAATTCTTGCAAGCCTCATATTAATTGAATTGAAACTTTGTGAAAAGTTGTTGGTACAAGTTTAGCCTAAGCTATTAGTTATTACTATTCTTATTTGTTTTTCTCCAATGATTCTCCAAGGAGACTGCTTGTATAACTATCTGATGGCGGCTGAGCCATTTGGCAACCTTAACTTTTTTAGATTTGGCGCTGTAGATATAGACGAAGGACCTTGATTGCGTTAACATGTGGCTTCATAGAGAAACCTTTGCTAATATTTCCTTCCCTTTTGCAACTAGTAAAATCACTAAGTGGTATGGATTTGCTGCATTTTTTCAGTTGATATTCAGTTACTTATACTAGTCTTTATGTTCAGGAACCTCTTGTCGAACTTGCTAATCCTTCGGATACCGATATAGACATCCCCACTAACTTTGATTAGAAGGTAATCGTTTGCTTAGGCAATGGGACTCGGAGATTTCAAATAATACTTTGGCTGTTAATAATTTCTTGTGCCCACATAATTCCTAGATGTCAAAAGTTACTATCAACAGTGGAGTGGCAGTTAGGACTTGTATAATTACAAAAAAATGTCCAGTGACATAAGGTTCAAGTGTGCAGTGGTCATCTTTTGGCCGGGACGTTGCACGCTGGTCTCAGCTGACGTTAGATCTTCTAATGATGTTGATATTGCTGCCAGAAACTGTTTGTATTCAGTGTACGAGTTGAGTTAGGGGATACAGTTGAGTTCTTCCATCAGTGTATGATCTGAGTGATTCATTTGGTAGGGTTACTCTTGATTCAATTTTATTGGAAGGTCTGCTTCCATATTTTATCGTCATAATGATGAAGGGATCAAATGCTTGTGGTGTTTGTATCTTAAGATAGACCTTTGTTTTGCTTCTAATTGACAAGATTATATACAATGAATTGAGCAGTTGCTAATAGAATAAACTTGCACATTATGAACCCTGTAATTTTAGTGATGGTCCCTTCCCTGTAATTTTAAGTTTGTACTGTTCG

mRNA sequence

ATTCGGGGCATATTTCGTAAATTCCGCCACCATTTAGGATCCCCTTTCTTCCTTCAGTCCACAGTTCTTTGTCCTCAGTCGAATCCCCATTCTCTCGTTTCTCTAGACTTCCCACGAACACAGCGACGCCTACTGTAACTACAAACTTCCCGAAGTCTTCACATTCTCACAACCCTCTCTTATCTTCTCATCTTCCTCCTTTCTCTTTTGCTCGGCTTCGATTCTCATCTATATATTCTCTCAATTCTCTCTCTGCCCATTCCTTCTTTCCCCATTCTTGCAATGCTCCAAACCCATTTGATTTTGTTCTGCCAGAGTTAATGTACGCACGCGCTCAACCGTTGACCATGGGCGACCAGATCGCCTCTGTCCCCGACTGCGGCGACAGCGGCGAGCCCTTGGATAATCGTCTCGTTCGATACGGAGCTCACTCTCTCGAAGATGGTGGTGGAGTAGGCGGCATGATCGAGGATCTCAACCCTGACGCCGTTTATGCTTCCGCCGGTGATGGTTCGGACTTGGCTGTTCAGCGCAACGATGGTTCAAGCCAGCTCACACTTTCATTCCGTGGTCAGGTTTATCTTTTCGATGCAGTCTCTCCTGAGAAGGTTCAAGCAGTATTGTTGTTGTTGGGTGGTTGCGAACTATCTTCTGGTCAGCAAAGCGTGGATTTGGTTAATCAAAATCAGAGGAATGCTGTGGACTTCCCTGGACGCAGTAGTCAACCCCAGAGGGCAGCCTCGTTAAATCGGTTTCGACAGAAAAGGAAGGAGCGATGTTATGATAAAAAAGTTAGATACAGTGTTCGTCAGGAGGTTGCTCTCAGGATGCAACGCAACAAAGGACAGTTTACTTCTTCCAAGAAGCCTGATGGTTCGTATAGTCATGGCAGTGTCTCGGAGTTAGGTCAAGATGAAAGTCCACCAGAAACTTCATGCACAAATTGTGGCATAAGTTCGATGGCAACCCCAATGATGCGGCGAGGACCATCTGGGCCTAGATCACTTTGCAATGCCTGTGGACTCTTTTGGGCAAACCGGGGAACTTTAAGGGATCTTTCAAAGAGAAGTCAAGACCATCCGGTGACCCCAGCCGAGCAGTGTGAAAGTGATGGTGGCAAGGACTTAGACTGCAGACATGATAAAAAAAAGGCCATTATGGCTGTTTCTGGCTTCATTCAATCAGGCCCTCGTTTTGTTAGGGCTTTCGAAAAGTTGATGGCGGCTGCAAATCCTCAGCCTTTGCAAGCGCGTCCCTTCCAGGAACATGTCCAAGTTTCCACAATGATGGGGGATGATGACGGTGAATATGAAGATGGTGGTGGTGGTGGTGGTGGCGGCGGCGGCGGCGGCGGCGGCGATGTTATGGATGATGTCGAAGAAGCTCATATGACTTCAGTGAGCGTAGCGAATCATGGGGGGTTGGTCATGGCGTCTAGAACTAGTGAACTTACGCTCTCTTTTGAGGGCGAGGTTTATGTGTTCCCCGCAGTTACTCCCGAAAAGGTACAAGCCGTGCTCTTGCTTCTTGGAGGGCGTGATGTGCCAACCGGTGTACCTACAATGGAAGTACCATACGATCATAATAACAGGGGTATGGTTGACACACCAAAGCGCTCCAACTTATCACGGAGAATAGCCTCCCTGGTTAGGTTTCGTGAAAAACGGAAAGAGAGATGTTTTGACAAGAAAATTAGGTACACTGTACGGAAAGAGGTTGCACAGAGGATGCACCGCAAGAATGGCCAATTTGCATCCTTAAAAGAAAGTTCAGGTGCTTCAAGTTGGGAGTCAGCACATAGTTGCCTCCAAGATGGTACTCGTTCAGAAACTGTTTTGCGGAAGTGTCAACATTGTGGTGTTAGTGAGAACAATACACCTGCAATGCGTCGTGGGCCTGCTGGACCAAGGACTTTATGTAATGCATGTGGCTTGATGTGGGCGAACAAGGGCACGTTGAGAGACCTCAGCAAGGGAGGGAGGAATGTGTCTCTGGATCATATGGAACCTGAAACCCCAATGGATGTCAAGCCCACAATCATGGAGGGAGAATTTTCGGGCATCCAGGATGAACATGGAACTCCTGAGGATCCTAGTAAAACCATGACTGAAGGCTCCAGTAATCCTTCCATTGACCCGGATGAGGAAGACATAAATGAAACCACTGGAGACCTTACAAATTCGTTGCGCATGCGAACTGTTAATCATTCAACAAATGACGATGAGCAGGAACCTCTTGTCGAACTTGCTAATCCTTCGGATACCGATATAGACATCCCCACTAACTTTGATTAGAAGATGTCAAAAGTTACTATCAACAGTGGAGTGGCAGTTAGGACTTGTATAATTACAAAAAAATGTCCAGTGACATAAGGTTCAAGTGTGCAGTGGTCATCTTTTGGCCGGGACGTTGCACGCTGGTCTCAGCTGACGTTAGATCTTCTAATGATGTTGATATTGCTGCCAGAAACTGTTTGTATTCAGTGTACGAGTTGAGTTAGGGGATACAGTTGAGTTCTTCCATCAGTGTATGATCTGAGTGATTCATTTGGTAGGGTTACTCTTGATTCAATTTTATTGGAAGGTCTGCTTCCATATTTTATCGTCATAATGATGAAGGGATCAAATGCTTGTGGTGTTTGTATCTTAAGATAGACCTTTGTTTTGCTTCTAATTGACAAGATTATATACAATGAATTGAGCAGTTGCTAATAGAATAAACTTGCACATTATGAACCCTGTAATTTTAGTGATGGTCCCTTCCCTGTAATTTTAAGTTTGTACTGTTCG

Coding sequence (CDS)

ATGTACGCACGCGCTCAACCGTTGACCATGGGCGACCAGATCGCCTCTGTCCCCGACTGCGGCGACAGCGGCGAGCCCTTGGATAATCGTCTCGTTCGATACGGAGCTCACTCTCTCGAAGATGGTGGTGGAGTAGGCGGCATGATCGAGGATCTCAACCCTGACGCCGTTTATGCTTCCGCCGGTGATGGTTCGGACTTGGCTGTTCAGCGCAACGATGGTTCAAGCCAGCTCACACTTTCATTCCGTGGTCAGGTTTATCTTTTCGATGCAGTCTCTCCTGAGAAGGTTCAAGCAGTATTGTTGTTGTTGGGTGGTTGCGAACTATCTTCTGGTCAGCAAAGCGTGGATTTGGTTAATCAAAATCAGAGGAATGCTGTGGACTTCCCTGGACGCAGTAGTCAACCCCAGAGGGCAGCCTCGTTAAATCGGTTTCGACAGAAAAGGAAGGAGCGATGTTATGATAAAAAAGTTAGATACAGTGTTCGTCAGGAGGTTGCTCTCAGGATGCAACGCAACAAAGGACAGTTTACTTCTTCCAAGAAGCCTGATGGTTCGTATAGTCATGGCAGTGTCTCGGAGTTAGGTCAAGATGAAAGTCCACCAGAAACTTCATGCACAAATTGTGGCATAAGTTCGATGGCAACCCCAATGATGCGGCGAGGACCATCTGGGCCTAGATCACTTTGCAATGCCTGTGGACTCTTTTGGGCAAACCGGGGAACTTTAAGGGATCTTTCAAAGAGAAGTCAAGACCATCCGGTGACCCCAGCCGAGCAGTGTGAAAGTGATGGTGGCAAGGACTTAGACTGCAGACATGATAAAAAAAAGGCCATTATGGCTGTTTCTGGCTTCATTCAATCAGGCCCTCGTTTTGTTAGGGCTTTCGAAAAGTTGATGGCGGCTGCAAATCCTCAGCCTTTGCAAGCGCGTCCCTTCCAGGAACATGTCCAAGTTTCCACAATGATGGGGGATGATGACGGTGAATATGAAGATGGTGGTGGTGGTGGTGGTGGCGGCGGCGGCGGCGGCGGCGGCGATGTTATGGATGATGTCGAAGAAGCTCATATGACTTCAGTGAGCGTAGCGAATCATGGGGGGTTGGTCATGGCGTCTAGAACTAGTGAACTTACGCTCTCTTTTGAGGGCGAGGTTTATGTGTTCCCCGCAGTTACTCCCGAAAAGGTACAAGCCGTGCTCTTGCTTCTTGGAGGGCGTGATGTGCCAACCGGTGTACCTACAATGGAAGTACCATACGATCATAATAACAGGGGTATGGTTGACACACCAAAGCGCTCCAACTTATCACGGAGAATAGCCTCCCTGGTTAGGTTTCGTGAAAAACGGAAAGAGAGATGTTTTGACAAGAAAATTAGGTACACTGTACGGAAAGAGGTTGCACAGAGGATGCACCGCAAGAATGGCCAATTTGCATCCTTAAAAGAAAGTTCAGGTGCTTCAAGTTGGGAGTCAGCACATAGTTGCCTCCAAGATGGTACTCGTTCAGAAACTGTTTTGCGGAAGTGTCAACATTGTGGTGTTAGTGAGAACAATACACCTGCAATGCGTCGTGGGCCTGCTGGACCAAGGACTTTATGTAATGCATGTGGCTTGATGTGGGCGAACAAGGGCACGTTGAGAGACCTCAGCAAGGGAGGGAGGAATGTGTCTCTGGATCATATGGAACCTGAAACCCCAATGGATGTCAAGCCCACAATCATGGAGGGAGAATTTTCGGGCATCCAGGATGAACATGGAACTCCTGAGGATCCTAGTAAAACCATGACTGAAGGCTCCAGTAATCCTTCCATTGACCCGGATGAGGAAGACATAAATGAAACCACTGGAGACCTTACAAATTCGTTGCGCATGCGAACTGTTAATCATTCAACAAATGACGATGAGCAGGAACCTCTTGTCGAACTTGCTAATCCTTCGGATACCGATATAGACATCCCCACTAACTTTGATTAG

Protein sequence

MYARAQPLTMGDQIASVPDCGDSGEPLDNRLVRYGAHSLEDGGGVGGMIEDLNPDAVYASAGDGSDLAVQRNDGSSQLTLSFRGQVYLFDAVSPEKVQAVLLLLGGCELSSGQQSVDLVNQNQRNAVDFPGRSSQPQRAASLNRFRQKRKERCYDKKVRYSVRQEVALRMQRNKGQFTSSKKPDGSYSHGSVSELGQDESPPETSCTNCGISSMATPMMRRGPSGPRSLCNACGLFWANRGTLRDLSKRSQDHPVTPAEQCESDGGKDLDCRHDKKKAIMAVSGFIQSGPRFVRAFEKLMAAANPQPLQARPFQEHVQVSTMMGDDDGEYEDGGGGGGGGGGGGGGDVMDDVEEAHMTSVSVANHGGLVMASRTSELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPTGVPTMEVPYDHNNRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFASLKESSGASSWESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGLMWANKGTLRDLSKGGRNVSLDHMEPETPMDVKPTIMEGEFSGIQDEHGTPEDPSKTMTEGSSNPSIDPDEEDINETTGDLTNSLRMRTVNHSTNDDEQEPLVELANPSDTDIDIPTNFD
Homology
BLAST of Cla97C08G154610 vs. NCBI nr
Match: KAG7029665.1 (GATA transcription factor 25 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1050.8 bits (2716), Expect = 4.8e-303
Identity = 545/637 (85.56%), Postives = 562/637 (88.23%), Query Frame = 0

Query: 1   MYARAQPLTMGDQIASVPDCGDSGEPLDNRLVRYGAHSLEDGGGVGGMIEDLNPDAVYAS 60
           MY RA+PLTMGDQIAS PDCGD+GE LDNRLVRYGAHSLEDG GVG MIEDLNPDAVY S
Sbjct: 1   MYGRAEPLTMGDQIASTPDCGDTGEALDNRLVRYGAHSLEDGAGVGSMIEDLNPDAVYVS 60

Query: 61  AGDGSDLAVQRNDGSSQLTLSFRGQVYLFDAVSPEKVQAVLLLLGGCELSSGQQSVDLVN 120
           AGDGSD+AVQR+DGSSQLTLSFRGQVYLFDAVSPEKVQAVLLLLGGCELSSGQ SVDLVN
Sbjct: 61  AGDGSDMAVQRSDGSSQLTLSFRGQVYLFDAVSPEKVQAVLLLLGGCELSSGQPSVDLVN 120

Query: 121 QNQRNAVDFPGRSSQPQRAASLNRFRQKRKERCYDKKVRYSVRQEVALRMQRNKGQFTSS 180
           QNQRNA+D PGRSSQPQRAASL RFRQKRKERC+DKKVRY VRQEVALRMQRNKGQFTSS
Sbjct: 121 QNQRNAIDLPGRSSQPQRAASLYRFRQKRKERCFDKKVRYGVRQEVALRMQRNKGQFTSS 180

Query: 181 KKPDGSYSHGSVSELGQDESPPETSCTNCGISSMATPMMRRGPSGPRSLCNACGLFWANR 240
           KKPDG YSH SVSE  +DE+P ETSCTNCG+SSM TPMMRRGPSGPRSLCNACGLFWANR
Sbjct: 181 KKPDGLYSHDSVSESRRDENPLETSCTNCGVSSMVTPMMRRGPSGPRSLCNACGLFWANR 240

Query: 241 GTLRDLSKRSQDHPVTPA-EQCESDGGKDLDCRHDKKKAIMAVSGFIQSGPRFVRAFEKL 300
           GTL+DLSKRSQDH VTPA EQCESD  KD DCR                           
Sbjct: 241 GTLKDLSKRSQDHRVTPATEQCESDAAKDFDCRRGNH----------------------- 300

Query: 301 MAAANPQPLQARPFQEHVQVSTMMGDDDGEYEDGGGGGGGGGGGGGGDVMDDVEEAHMTS 360
             A +  PLQARPFQEHVQV TMMG+DDGEYED     GGGGGGGGGD MDDVE AHMTS
Sbjct: 301 --APSNLPLQARPFQEHVQVPTMMGNDDGEYED----DGGGGGGGGGDFMDDVEGAHMTS 360

Query: 361 VSVANHGGLVMASRTSELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPTGVPTMEVPY 420
           VSV  HGGLVMASRTSELTLSFEGEVYVFPAV+PEKVQAVLLLLGGRDVPTGVPTMEVPY
Sbjct: 361 VSVVKHGGLVMASRTSELTLSFEGEVYVFPAVSPEKVQAVLLLLGGRDVPTGVPTMEVPY 420

Query: 421 DHNNRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFAS 480
           D NNRGM DTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFAS
Sbjct: 421 DDNNRGMFDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFAS 480

Query: 481 LKESSGASSWESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGLM 540
           LKE SGASSWESAHSCLQDG+RSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGLM
Sbjct: 481 LKEGSGASSWESAHSCLQDGSRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGLM 540

Query: 541 WANKGTLRDLSKGGRNVSLDHMEPETPMDVKPTIMEGEFSGIQDEHGTPEDPSKTMTEGS 600
           WANKGTLRDLSKGGRNVSLDHMEPETPMDVKPTIMEGEFSGIQDEHGTPEDP KTMTEGS
Sbjct: 541 WANKGTLRDLSKGGRNVSLDHMEPETPMDVKPTIMEGEFSGIQDEHGTPEDPGKTMTEGS 600

Query: 601 SNPSIDPDEEDINETTGDLTNSLRMRTVNHSTNDDEQ 637
           SNPS+DP EED NETT +LTNSL  +  NHSTN DEQ
Sbjct: 601 SNPSLDPVEEDTNETTLELTNSLPKQIANHSTNGDEQ 608

BLAST of Cla97C08G154610 vs. NCBI nr
Match: KAG7020257.1 (GATA transcription factor 24 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 922.2 bits (2382), Expect = 2.6e-264
Identity = 515/677 (76.07%), Postives = 528/677 (77.99%), Query Frame = 0

Query: 1   MYARAQPLTMGDQIASVPDCGDSGEPLDNRLVRYGAHSLEDGGGVGGMIEDLNPDAVYAS 60
           MY R QPLT+GDQIASVPDCGD GEPLDNRLVRYGAH+LEDG GVGGMIEDLNPDAVYAS
Sbjct: 88  MYGRPQPLTVGDQIASVPDCGDGGEPLDNRLVRYGAHALEDGAGVGGMIEDLNPDAVYAS 147

Query: 61  AGDGSDLAVQRNDGSSQLTLSFRGQVYLFDAVSPEKVQAVLLLLGGCELSSGQQSVDLVN 120
           AGDGSD+A+QR+DGSSQLTLSFRGQVYLFDAVSPEKVQAVLLLLGG ELSSGQQSVDLV 
Sbjct: 148 AGDGSDMAIQRSDGSSQLTLSFRGQVYLFDAVSPEKVQAVLLLLGGSELSSGQQSVDLVT 207

Query: 121 QNQRNAVDFPGRSSQPQRAASLNRFRQKRKERCYDKKVRYSVRQEVALRMQRNKGQFTSS 180
           Q+QRNAVDFPGRSSQPQRAASLN                         RMQRNKGQFTSS
Sbjct: 208 QSQRNAVDFPGRSSQPQRAASLN-------------------------RMQRNKGQFTSS 267

Query: 181 KKPDGSYSHGSVSELGQDESPPETSCTNCGISSMATPMMRRGPSGPRSLCNACGLFWANR 240
           KKPDGSYSHGSVSE GQ+ESP ET CTNCG SSMATPMMRRGPSGPRSLCNACGLFWANR
Sbjct: 268 KKPDGSYSHGSVSEQGQEESPLETLCTNCGTSSMATPMMRRGPSGPRSLCNACGLFWANR 327

Query: 241 GT-LRDLSKRSQDHPVTPAEQCESDGGKDLDCRHDKKKAIMAVSGFIQSGPRFVRAFEKL 300
              L D+            + C              K  I    GFI      VRAF+K 
Sbjct: 328 IVWLMDM-----------PDLC--------------KGLIGFYDGFISGA---VRAFDKF 387

Query: 301 MAAANPQPLQARPFQEHVQVSTMMGDDDGEYED-GGGGGGGGGGGGGGDVMDDVEEAHMT 360
           MAAANPQPLQARPFQEHVQV  MMGDDDGEYED GGGGGGGGGGGGGGDVMDDVEEAHMT
Sbjct: 388 MAAANPQPLQARPFQEHVQVPAMMGDDDGEYEDGGGGGGGGGGGGGGGDVMDDVEEAHMT 447

Query: 361 SVSVANHGGLVMASRTSELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPTGVPTMEVP 420
           SVSVAN GGLVMASRTSELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPT VPTMEVP
Sbjct: 448 SVSVANLGGLVMASRTSELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPTSVPTMEVP 507

Query: 421 YDHNNRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFA 480
           YDH +RGMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFA
Sbjct: 508 YDHMSRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFA 567

Query: 481 SLKESSGASSWESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGL 540
           SLKESSGASSWESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGL
Sbjct: 568 SLKESSGASSWESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGL 627

Query: 541 MWANKGTLRDLSKGGRNVSLDHMEPETPMDVKPTIMEGEFSGIQDEH----------GTP 600
           MWANK                    ETPMDVKPTIMEGEFSGI DEH           TP
Sbjct: 628 MWANK--------------------ETPMDVKPTIMEGEFSGIPDEHVLLLYFSLITNTP 687

Query: 601 E--------DPSKTMTEGSSNPSIDPDEEDINETTGDLTNSLRMRTVNHSTNDDEQEPLV 658
                     P          P I     DINETTG  TNSL M+ VN STNDDEQEPL+
Sbjct: 688 SIWLACVICKPMHIQYACRELPRI----LDINETTGGQTNSLPMQIVNDSTNDDEQEPLI 687

BLAST of Cla97C08G154610 vs. NCBI nr
Match: GAV66855.1 (GATA domain-containing protein/tify domain-containing protein/CCT domain-containing protein [Cephalotus follicularis])

HSP 1 Score: 794.7 bits (2051), Expect = 6.2e-226
Identity = 432/667 (64.77%), Postives = 495/667 (74.21%), Query Frame = 0

Query: 1   MYARAQPLTMGDQIASVPDCGD---SGEPLDNRLVRYGAHSLEDGGGVGGMIEDLNPDAV 60
           MYA  Q + +  QIAS  D  D   SGEP+DN       H   + G V   +ED+  D V
Sbjct: 1   MYAHPQAMNIHSQIASSVDDDDGSGSGEPIDNH-----THIAYENGVV---VEDIASDGV 60

Query: 61  YASAGDGSDLAVQRNDGSSQLTLSFRGQVYLFDAVSPEKVQAVLLLLGGCELSSGQQSVD 120
           Y      SD+A+QR DGSSQL+LSFRGQVY+FD+V+P+KVQAVLLLLGGCEL+SG   ++
Sbjct: 61  YIPGAAKSDMAIQRADGSSQLSLSFRGQVYVFDSVTPDKVQAVLLLLGGCELTSGPNGME 120

Query: 121 LVNQNQRNAVDFPGRSSQPQRAASLNRFRQKRKERCYDKKVRYSVRQEVALRMQRNKGQF 180
           ++ QNQR  V+ P R SQPQR ASLNRFRQKRKERC+DKKVRYSVRQEVALRMQRNKGQF
Sbjct: 121 VMQQNQRGVVNLPARCSQPQRVASLNRFRQKRKERCFDKKVRYSVRQEVALRMQRNKGQF 180

Query: 181 TSSKKPDGSYSHGSVSELGQDESPPETSCTNCGISSMATPMMRRGPSGPRSLCNACGLFW 240
           TS+KKP+G YS G + + GQD+ P + +C +CGISS ATPMMRRGPSGPRSLCNACGL+W
Sbjct: 181 TSAKKPEGGYSWGDIQDSGQDDIPLDIACMHCGISSKATPMMRRGPSGPRSLCNACGLYW 240

Query: 241 ANRGTLRDLSKRSQDHPVTPAEQCESDGGKDLDCRHDKKKAIMAVSGFIQSGPRFVRAFE 300
           ANRG+LRDLSKR+QDH +TP EQ +                                   
Sbjct: 241 ANRGSLRDLSKRTQDHALTPMEQID----------------------------------- 300

Query: 301 KLMAAANPQPLQARPFQEHVQVSTMMGDDDGEYEDGGGGGGGGGGGGGGDVMDDVEEAHM 360
                                      DDDGEYEDGG G  G         MDDVEEAHM
Sbjct: 301 ---------------------------DDDGEYEDGGVGDDG---------MDDVEEAHM 360

Query: 361 TSVSVANH---GGLVMASRTSELTLSFEGEVYVFPAVTPEKVQ---AVLLLLGGRDVPTG 420
           +SV+VA H   GG+VMASRTSELTL+FEGEV+VFPAVTPEK +    +LLLLGGRD+PT 
Sbjct: 361 SSVNVAEHGDGGGVVMASRTSELTLAFEGEVFVFPAVTPEKARIYSLLLLLLGGRDIPTA 420

Query: 421 VPTMEVPYDHNNRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMH 480
           VPT+E+PYD NNR   DT KRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMH
Sbjct: 421 VPTIELPYDQNNRSAGDTQKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMH 480

Query: 481 RKNGQFASLKESSGASSWESAHSCLQDGTRSE-TVLRKCQHCGVSENNTPAMRRGPAGPR 540
           RKNGQFAS+KESSGASSW+SA SCLQDGT S  T++R+CQHCGV+ENNTPAMRRGPAGPR
Sbjct: 481 RKNGQFASIKESSGASSWDSARSCLQDGTPSAITIVRRCQHCGVTENNTPAMRRGPAGPR 540

Query: 541 TLCNACGLMWANKGTLRDLSKGGRNVSLDHMEPETPMDVKPTIMEGEFSGIQDEHGTPED 600
           TLCNACGLMWANKGTLRDLSKGGRN+S+DH+EPETP+DVKP IMEGEFSG  +EHGTPED
Sbjct: 541 TLCNACGLMWANKGTLRDLSKGGRNLSMDHIEPETPIDVKPAIMEGEFSGNHEEHGTPED 587

Query: 601 PSKTMTEGSSNPSIDPDEEDINETTGDLTNSLRMRTVNHSTNDDEQEPLVELANPSDTDI 658
            S+ +T G SNPSI+P+EED+ E   DLTN+LR   VN S +DDE+EPLVELA PSDTD 
Sbjct: 601 TSRAIT-GGSNPSINPEEEDLQEGAEDLTNALRTGMVNSSADDDEREPLVELATPSDTDF 587

BLAST of Cla97C08G154610 vs. NCBI nr
Match: RXH92614.1 (hypothetical protein DVH24_033510 [Malus domestica])

HSP 1 Score: 747.7 bits (1929), Expect = 8.7e-212
Identity = 431/687 (62.74%), Postives = 500/687 (72.78%), Query Frame = 0

Query: 1   MYARAQPLTMGDQIASVPDCGD----SGEPLDNRLVRYGAHSLEDGGGVGGMIEDLNPDA 60
           MY  +Q +TM + I   P C D    + + +DN  ++Y +H+LEDGG V  ++ED + D 
Sbjct: 1   MYGHSQDMTMPNPI---PACDDDDAGAADSIDNAHIQYDSHTLEDGGIV--VVEDGSSDG 60

Query: 61  VYASAGD-GSDLAVQRNDGSSQLTLSFRGQVYLFDAVSPEKVQAVLLLLGGCELSSGQQS 120
           VY   G   S+L  Q  DG+SQLTLSFRGQV++FDAV+PEKVQAVLLLLGG ELS   Q 
Sbjct: 61  VYVQGGSASSELRGQPYDGASQLTLSFRGQVFVFDAVTPEKVQAVLLLLGGNELSPNAQG 120

Query: 121 VDLVNQNQRNAVDFPGRSSQPQRAASLNRFRQKRKERCYDKKVRYSVRQEVALRMQRNKG 180
            +L +QN R   DFP R SQP RAASL RFRQKRKERC+DKKVRY VRQEVALRMQRNKG
Sbjct: 121 TELASQNPRATEDFP-RCSQPHRAASLFRFRQKRKERCFDKKVRYGVRQEVALRMQRNKG 180

Query: 181 QFTSSKKPDGSYSHGSVSELGQDESPPETSCTNCGISSMATPMMRRGPSGPRSLCNACGL 240
           QF+SSKK DG  +  +  E GQ+++  ET C +CGISS +TPMMRRGPSGPRSLCNACGL
Sbjct: 181 QFSSSKKSDGDGNWSNGQESGQEDTHAETFCKHCGISSKSTPMMRRGPSGPRSLCNACGL 240

Query: 241 FWANRGTLRDLSKRSQDHPVTPAEQCESDGGKDLDCRHDKKKAIMAVSGFIQSGPRFVRA 300
           FWANRG++R+LSKRS  H V   EQ      KDLD       ++ A+       P     
Sbjct: 241 FWANRGSMRELSKRS--HDVKRTEQGRDSDTKDLD-------SVTAIDAHNNLVP----- 300

Query: 301 FEKLMAAANPQPLQARPFQEHVQVSTMMGDDDGEYEDGGGG------------------G 360
                 A NPQPL+A PF+EH +    + DDDG+YEDGG G                  G
Sbjct: 301 ---FSNAVNPQPLEAGPFEEHGRGQIQVEDDDGDYEDGGDGMEDMEEVHANPVSVAEREG 360

Query: 361 GGGGGGGGGDVMDDVEEAHMTSVSVANHGGLVMASRTSELTLSFEGEVYVFPAVTPEKVQ 420
           GGGGGGGGG                   GG+VMASRTSELTLSFEGEVYVFPAVT EKVQ
Sbjct: 361 GGGGGGGGGG------------------GGVVMASRTSELTLSFEGEVYVFPAVTHEKVQ 420

Query: 421 AVLLLLGGRDVPTGVPTMEVPYDHNNRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKK 480
           AVLLLLGGRDVPTGVPT+EV YD N RG+ DTPKRSNLSRRIASLVRFREKRKERCFDKK
Sbjct: 421 AVLLLLGGRDVPTGVPTVEVSYDQNPRGVGDTPKRSNLSRRIASLVRFREKRKERCFDKK 480

Query: 481 IRYTVRKEVAQRMHRKNGQFASLKESSGASSWESAHSCLQDGT-RSETVLRKCQHCGVSE 540
           IRYTVRKEVAQRM RKNGQFASLK++SGASSW+S  SC QDGT + ETV+R+CQHCGVSE
Sbjct: 481 IRYTVRKEVAQRMLRKNGQFASLKQTSGASSWDSTQSCPQDGTPQPETVVRRCQHCGVSE 540

Query: 541 NNTPAMRRGPAGPRTLCNACGLMWANKGTLRDLSKGGRNVSLDHMEPETPMDVKPTIMEG 600
           NNTPAMRRGPAGPRTLCNACGLMWANKGTLRDLSKGGRN+++D++EP TP +VKP+++EG
Sbjct: 541 NNTPAMRRGPAGPRTLCNACGLMWANKGTLRDLSKGGRNLTMDNIEPGTPTEVKPSVVEG 600

Query: 601 EFSGIQDEHGTPEDPSKTMTEGSSNPSIDPDEE------DINETTGDLTNSLRMRTVNHS 658
           EFS  QDE G  +  SK +TEGS++ S++PDEE      D++ET  DLTNS  M  V  S
Sbjct: 601 EFSRNQDECGILDGLSKNITEGSNDASVNPDEEECIAFKDLHETAEDLTNSYPMGIV--S 644

BLAST of Cla97C08G154610 vs. NCBI nr
Match: XP_038884827.1 (GATA transcription factor 19-like isoform X1 [Benincasa hispida] >XP_038884828.1 GATA transcription factor 19-like isoform X1 [Benincasa hispida] >XP_038884829.1 GATA transcription factor 19-like isoform X1 [Benincasa hispida])

HSP 1 Score: 680.2 bits (1754), Expect = 1.7e-191
Identity = 352/361 (97.51%), Postives = 353/361 (97.78%), Query Frame = 0

Query: 300 MAAANPQPLQARPFQEHVQVSTMMGDDDGEYED-GGGGGGGGGGGGGGDVMDDVEEAHMT 359
           MAAANPQPLQARPFQ HVQV TMMGDDD EYED GGGGGGGGGGGGGGDVMDDVEEAHMT
Sbjct: 1   MAAANPQPLQARPFQGHVQVPTMMGDDDAEYEDGGGGGGGGGGGGGGGDVMDDVEEAHMT 60

Query: 360 SVSVANHGGLVMASRTSELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPTGVPTMEVP 419
           SVSVANHGGLVMASRTSELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPTGVPTMEVP
Sbjct: 61  SVSVANHGGLVMASRTSELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPTGVPTMEVP 120

Query: 420 YDHNNRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFA 479
           YDHNNRG+VDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFA
Sbjct: 121 YDHNNRGIVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFA 180

Query: 480 SLKESSGASSWESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGL 539
           SLKESSGASSWESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGL
Sbjct: 181 SLKESSGASSWESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGL 240

Query: 540 MWANKGTLRDLSKGGRNVSLDHMEPETPMDVKPTIMEGEFSGIQDEHGTPEDPSKTMTEG 599
           MWANKGTLRDLSKGGRNVSLDHMEPETPMDVKPTIMEGEFSGIQDEHGTPEDPSKTMTEG
Sbjct: 241 MWANKGTLRDLSKGGRNVSLDHMEPETPMDVKPTIMEGEFSGIQDEHGTPEDPSKTMTEG 300

Query: 600 SSNPSI--DPDEEDINETTGDLTNSLRMRTVNHSTNDDEQEPLVELANPSDTDIDIPTNF 658
           SSNPSI  DPDEEDINETTGDLTNSL MR VNHSTNDDEQEPLVELANPSDTDIDIPTNF
Sbjct: 301 SSNPSIDPDPDEEDINETTGDLTNSLPMRIVNHSTNDDEQEPLVELANPSDTDIDIPTNF 360

BLAST of Cla97C08G154610 vs. ExPASy Swiss-Prot
Match: Q9LRH6 (GATA transcription factor 25 OS=Arabidopsis thaliana OX=3702 GN=GATA25 PE=1 SV=2)

HSP 1 Score: 245.4 bits (625), Expect = 1.8e-63
Identity = 136/233 (58.37%), Postives = 169/233 (72.53%), Query Frame = 0

Query: 47  GMIEDLNPDA--VYASAGDGSDLAVQR-NDGSSQLTLSFRGQVYLFDAVSPEKVQAVLLL 106
           G   DL PD   + A   DGS+L V R  +G++QLT+SFRGQVY+FDAV  +KV AVL L
Sbjct: 50  GAASDLIPDGSQLVAHRSDGSELLVSRPPEGANQLTISFRGQVYVFDAVGADKVDAVLSL 109

Query: 107 LGG-CELSSGQQSVDLV-NQNQRNAVDFPGRSSQPQRAASLNRFRQKRKERCYDKKVRYS 166
           LGG  EL+ G Q ++L   QN    V++  R S PQRA SL+RFR+KR  RC++KKVRY 
Sbjct: 110 LGGSTELAPGPQVMELAQQQNHMPVVEYQSRCSLPQRAQSLDRFRKKRNARCFEKKVRYG 169

Query: 167 VRQEVALRMQRNKGQFTSSKKPDGSYSHGSVSELGQDESPPETSCTNCGISSMATPMMRR 226
           VRQEVALRM RNKGQFTSSK  DG+Y+ G+  +  QD++ PE SCT+CGISS  TPMMRR
Sbjct: 170 VRQEVALRMARNKGQFTSSKMTDGAYNSGTDQDSAQDDAHPEISCTHCGISSKCTPMMRR 229

Query: 227 GPSGPRSLCNACGLFWANRGTLRDLSKRSQDHPVTPAEQCESDGGKDLDCRHD 275
           GPSGPR+LCNACGLFWANRGTLRDLSK+++++ +  A     DGG   D  ++
Sbjct: 230 GPSGPRTLCNACGLFWANRGTLRDLSKKTEENQL--ALMKPDDGGSVADAANN 280

BLAST of Cla97C08G154610 vs. ExPASy Swiss-Prot
Match: Q8GXL7 (GATA transcription factor 24 OS=Arabidopsis thaliana OX=3702 GN=GATA24 PE=1 SV=2)

HSP 1 Score: 209.5 bits (532), Expect = 1.1e-52
Identity = 115/183 (62.84%), Postives = 133/183 (72.68%), Query Frame = 0

Query: 376 ELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPTGVPTMEVPYDHNNR--GMVDTPKRS 435
           +LTLSF+G+VYVF  V+PEKVQAVLLLLGGR+VP  +PT       NNR  G+  TP+R 
Sbjct: 79  QLTLSFQGQVYVFDRVSPEKVQAVLLLLGGREVPHTLPTTLGSPHQNNRVLGLSGTPQRL 138

Query: 436 NLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFASLKES-----SGASS 495
           ++ +R+ASL+RFREKRK R FDK IRYTVRKEVA RM RK GQF S K S     S  S 
Sbjct: 139 SVPQRLASLLRFREKRKGRNFDKTIRYTVRKEVALRMQRKKGQFTSAKSSNDDSGSTGSD 198

Query: 496 WESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGLMWANKGTLRD 552
           W S  S   +GT ++     C+HCG SE +TP MRRGP GPRTLCNACGLMWANKGTLRD
Sbjct: 199 WGSNQSWAVEGTETQKPEVLCRHCGTSEKSTPMMRRGPDGPRTLCNACGLMWANKGTLRD 258

BLAST of Cla97C08G154610 vs. ExPASy Swiss-Prot
Match: Q8H1G0 (GATA transcription factor 28 OS=Arabidopsis thaliana OX=3702 GN=GATA28 PE=1 SV=1)

HSP 1 Score: 194.1 bits (492), Expect = 4.9e-48
Identity = 127/261 (48.66%), Postives = 161/261 (61.69%), Query Frame = 0

Query: 302 AANPQPLQARPFQEHV--QVSTMMGDDDGEYEDGGGGGGGGGGGGGGDVMDDVEEAHMTS 361
           A +P  +Q   F+ H    +    G  D + +DG  GG   G      V  D+  +H  +
Sbjct: 16  AQDPMHVQ---FEHHALHHIHNGSGMVDDQADDGNAGGMSEG------VETDI-PSHPGN 75

Query: 362 VSVANHGGLV--MASRTSELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPTGVPTMEV 421
           V+  N G +V   + +  +LTLSF+G+VYVF +V PEKVQAVLLLLGGR++P   P    
Sbjct: 76  VT-DNRGEVVDRGSEQGDQLTLSFQGQVYVFDSVLPEKVQAVLLLLGGRELPQAAPPGLG 135

Query: 422 PYDHNNR--GMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNG 481
               NNR   +  TP+R ++ +R+ASLVRFREKRK R FDKKIRYTVRKEVA RM R  G
Sbjct: 136 SPHQNNRVSSLPGTPQRFSIPQRLASLVRFREKRKGRNFDKKIRYTVRKEVALRMQRNKG 195

Query: 482 QFASLKE-----SSGASSWESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPR 541
           QF S K      +S  SSW S  +   + + ++     C+HCG+ E +TP MRRGPAGPR
Sbjct: 196 QFTSAKSNNDEAASAGSSWGSNQTWAIESSEAQHQEISCRHCGIGEKSTPMMRRGPAGPR 255

Query: 542 TLCNACGLMWANKGTLRDLSK 552
           TLCNACGLMWANKG  RDLSK
Sbjct: 256 TLCNACGLMWANKGAFRDLSK 265

BLAST of Cla97C08G154610 vs. ExPASy Swiss-Prot
Match: Q5Z4U5 (GATA transcription factor 20 OS=Oryza sativa subsp. japonica OX=39947 GN=GATA20 PE=2 SV=1)

HSP 1 Score: 193.0 bits (489), Expect = 1.1e-47
Identity = 110/196 (56.12%), Postives = 135/196 (68.88%), Query Frame = 0

Query: 75  SSQLTLSFRGQVYLFDAVSPEKVQAVLLLLGGCELSSGQQSVDLVNQNQRNAVDFPGRSS 134
           S+QLTLSF+G+VY+FD+VSP+KVQAVLLLLGG EL+ G      +     ++  +  R +
Sbjct: 125 SNQLTLSFQGEVYVFDSVSPDKVQAVLLLLGGRELNPG------LGSGASSSAPYSKRLN 184

Query: 135 QPQRAASLNRFRQKRKERCYDKKVRYSVRQEVALRMQRNKGQFTSSK-----------KP 194
            P R ASL RFR+KRKER +DKK+RYSVR+EVALRMQRN+GQFTSSK             
Sbjct: 185 FPHRVASLMRFREKRKERNFDKKIRYSVRKEVALRMQRNRGQFTSSKPKGDEATSELTAS 244

Query: 195 DGSYSHGSVSELGQDESPPETSCTNCGISSMATPMMRRGPSGPRSLCNACGLFWANRGTL 254
           DGS + GSV    +   P    C +CGI++ ATPMMRRGP GPR+LCNACGL WAN+G L
Sbjct: 245 DGSPNWGSV----EGRPPSAAECHHCGINAKATPMMRRGPDGPRTLCNACGLMWANKGML 304

Query: 255 RDLSKRSQDHPVTPAE 260
           RDLSK     P TP +
Sbjct: 305 RDLSKA----PPTPIQ 306

BLAST of Cla97C08G154610 vs. ExPASy Swiss-Prot
Match: A2XKR7 (GATA transcription factor 18 OS=Oryza sativa subsp. indica OX=39946 GN=GATA18 PE=3 SV=1)

HSP 1 Score: 184.1 bits (466), Expect = 5.1e-45
Identity = 120/242 (49.59%), Postives = 148/242 (61.16%), Query Frame = 0

Query: 322 MMGDDDGEYEDGGGGGGGGGGGGG---GDVMDDVEEAHMTSVSVANHG------GLVMAS 381
           +M D   +   GGG      G  G    +  DD EE     +  A           ++  
Sbjct: 17  VMRDAPADAAAGGGDNNDDDGDDGTEEDEEEDDDEEGDEEELPPAEDPAAPEPVSALLPG 76

Query: 382 RTSELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPTGVPTMEVPYDHNNRGMVDTPKR 441
             ++LTL F+GEVYVF +VTPEKVQAVLLLLG  ++P G+  M +P    NRG  D  +R
Sbjct: 77  SPNQLTLLFQGEVYVFESVTPEKVQAVLLLLGSCEMPPGLANMVLPNQRENRGYDDLLQR 136

Query: 442 SNL-SRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFASLKESSGASSWES 501
           +++ ++R+ASL+RFREKRKER FDKKIRY VRKEVA RM R+ GQFA      G S    
Sbjct: 137 TDIPAKRVASLIRFREKRKERNFDKKIRYAVRKEVALRMQRRKGQFAGRANMEGESLSPG 196

Query: 502 AHSCLQDGTRSETVLR--KCQHCGVSENNTPAMRRGPAGPRTLCNACGLMWANKGTLRDL 552
                Q G+  + + R  KCQ+CG SE  TPAMRRGPAGPRTLCNACGLMWANKGTLR+ 
Sbjct: 197 CELASQ-GSGQDFLSRESKCQNCGTSEKMTPAMRRGPAGPRTLCNACGLMWANKGTLRNC 256

BLAST of Cla97C08G154610 vs. ExPASy TrEMBL
Match: A0A1Q3BG01 (GATA domain-containing protein/tify domain-containing protein/CCT domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_10365 PE=3 SV=1)

HSP 1 Score: 794.7 bits (2051), Expect = 3.0e-226
Identity = 432/667 (64.77%), Postives = 495/667 (74.21%), Query Frame = 0

Query: 1   MYARAQPLTMGDQIASVPDCGD---SGEPLDNRLVRYGAHSLEDGGGVGGMIEDLNPDAV 60
           MYA  Q + +  QIAS  D  D   SGEP+DN       H   + G V   +ED+  D V
Sbjct: 1   MYAHPQAMNIHSQIASSVDDDDGSGSGEPIDNH-----THIAYENGVV---VEDIASDGV 60

Query: 61  YASAGDGSDLAVQRNDGSSQLTLSFRGQVYLFDAVSPEKVQAVLLLLGGCELSSGQQSVD 120
           Y      SD+A+QR DGSSQL+LSFRGQVY+FD+V+P+KVQAVLLLLGGCEL+SG   ++
Sbjct: 61  YIPGAAKSDMAIQRADGSSQLSLSFRGQVYVFDSVTPDKVQAVLLLLGGCELTSGPNGME 120

Query: 121 LVNQNQRNAVDFPGRSSQPQRAASLNRFRQKRKERCYDKKVRYSVRQEVALRMQRNKGQF 180
           ++ QNQR  V+ P R SQPQR ASLNRFRQKRKERC+DKKVRYSVRQEVALRMQRNKGQF
Sbjct: 121 VMQQNQRGVVNLPARCSQPQRVASLNRFRQKRKERCFDKKVRYSVRQEVALRMQRNKGQF 180

Query: 181 TSSKKPDGSYSHGSVSELGQDESPPETSCTNCGISSMATPMMRRGPSGPRSLCNACGLFW 240
           TS+KKP+G YS G + + GQD+ P + +C +CGISS ATPMMRRGPSGPRSLCNACGL+W
Sbjct: 181 TSAKKPEGGYSWGDIQDSGQDDIPLDIACMHCGISSKATPMMRRGPSGPRSLCNACGLYW 240

Query: 241 ANRGTLRDLSKRSQDHPVTPAEQCESDGGKDLDCRHDKKKAIMAVSGFIQSGPRFVRAFE 300
           ANRG+LRDLSKR+QDH +TP EQ +                                   
Sbjct: 241 ANRGSLRDLSKRTQDHALTPMEQID----------------------------------- 300

Query: 301 KLMAAANPQPLQARPFQEHVQVSTMMGDDDGEYEDGGGGGGGGGGGGGGDVMDDVEEAHM 360
                                      DDDGEYEDGG G  G         MDDVEEAHM
Sbjct: 301 ---------------------------DDDGEYEDGGVGDDG---------MDDVEEAHM 360

Query: 361 TSVSVANH---GGLVMASRTSELTLSFEGEVYVFPAVTPEKVQ---AVLLLLGGRDVPTG 420
           +SV+VA H   GG+VMASRTSELTL+FEGEV+VFPAVTPEK +    +LLLLGGRD+PT 
Sbjct: 361 SSVNVAEHGDGGGVVMASRTSELTLAFEGEVFVFPAVTPEKARIYSLLLLLLGGRDIPTA 420

Query: 421 VPTMEVPYDHNNRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMH 480
           VPT+E+PYD NNR   DT KRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMH
Sbjct: 421 VPTIELPYDQNNRSAGDTQKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMH 480

Query: 481 RKNGQFASLKESSGASSWESAHSCLQDGTRSE-TVLRKCQHCGVSENNTPAMRRGPAGPR 540
           RKNGQFAS+KESSGASSW+SA SCLQDGT S  T++R+CQHCGV+ENNTPAMRRGPAGPR
Sbjct: 481 RKNGQFASIKESSGASSWDSARSCLQDGTPSAITIVRRCQHCGVTENNTPAMRRGPAGPR 540

Query: 541 TLCNACGLMWANKGTLRDLSKGGRNVSLDHMEPETPMDVKPTIMEGEFSGIQDEHGTPED 600
           TLCNACGLMWANKGTLRDLSKGGRN+S+DH+EPETP+DVKP IMEGEFSG  +EHGTPED
Sbjct: 541 TLCNACGLMWANKGTLRDLSKGGRNLSMDHIEPETPIDVKPAIMEGEFSGNHEEHGTPED 587

Query: 601 PSKTMTEGSSNPSIDPDEEDINETTGDLTNSLRMRTVNHSTNDDEQEPLVELANPSDTDI 658
            S+ +T G SNPSI+P+EED+ E   DLTN+LR   VN S +DDE+EPLVELA PSDTD 
Sbjct: 601 TSRAIT-GGSNPSINPEEEDLQEGAEDLTNALRTGMVNSSADDDEREPLVELATPSDTDF 587

BLAST of Cla97C08G154610 vs. ExPASy TrEMBL
Match: A0A498JEZ9 (Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_033510 PE=3 SV=1)

HSP 1 Score: 747.7 bits (1929), Expect = 4.2e-212
Identity = 431/687 (62.74%), Postives = 500/687 (72.78%), Query Frame = 0

Query: 1   MYARAQPLTMGDQIASVPDCGD----SGEPLDNRLVRYGAHSLEDGGGVGGMIEDLNPDA 60
           MY  +Q +TM + I   P C D    + + +DN  ++Y +H+LEDGG V  ++ED + D 
Sbjct: 1   MYGHSQDMTMPNPI---PACDDDDAGAADSIDNAHIQYDSHTLEDGGIV--VVEDGSSDG 60

Query: 61  VYASAGD-GSDLAVQRNDGSSQLTLSFRGQVYLFDAVSPEKVQAVLLLLGGCELSSGQQS 120
           VY   G   S+L  Q  DG+SQLTLSFRGQV++FDAV+PEKVQAVLLLLGG ELS   Q 
Sbjct: 61  VYVQGGSASSELRGQPYDGASQLTLSFRGQVFVFDAVTPEKVQAVLLLLGGNELSPNAQG 120

Query: 121 VDLVNQNQRNAVDFPGRSSQPQRAASLNRFRQKRKERCYDKKVRYSVRQEVALRMQRNKG 180
            +L +QN R   DFP R SQP RAASL RFRQKRKERC+DKKVRY VRQEVALRMQRNKG
Sbjct: 121 TELASQNPRATEDFP-RCSQPHRAASLFRFRQKRKERCFDKKVRYGVRQEVALRMQRNKG 180

Query: 181 QFTSSKKPDGSYSHGSVSELGQDESPPETSCTNCGISSMATPMMRRGPSGPRSLCNACGL 240
           QF+SSKK DG  +  +  E GQ+++  ET C +CGISS +TPMMRRGPSGPRSLCNACGL
Sbjct: 181 QFSSSKKSDGDGNWSNGQESGQEDTHAETFCKHCGISSKSTPMMRRGPSGPRSLCNACGL 240

Query: 241 FWANRGTLRDLSKRSQDHPVTPAEQCESDGGKDLDCRHDKKKAIMAVSGFIQSGPRFVRA 300
           FWANRG++R+LSKRS  H V   EQ      KDLD       ++ A+       P     
Sbjct: 241 FWANRGSMRELSKRS--HDVKRTEQGRDSDTKDLD-------SVTAIDAHNNLVP----- 300

Query: 301 FEKLMAAANPQPLQARPFQEHVQVSTMMGDDDGEYEDGGGG------------------G 360
                 A NPQPL+A PF+EH +    + DDDG+YEDGG G                  G
Sbjct: 301 ---FSNAVNPQPLEAGPFEEHGRGQIQVEDDDGDYEDGGDGMEDMEEVHANPVSVAEREG 360

Query: 361 GGGGGGGGGDVMDDVEEAHMTSVSVANHGGLVMASRTSELTLSFEGEVYVFPAVTPEKVQ 420
           GGGGGGGGG                   GG+VMASRTSELTLSFEGEVYVFPAVT EKVQ
Sbjct: 361 GGGGGGGGGG------------------GGVVMASRTSELTLSFEGEVYVFPAVTHEKVQ 420

Query: 421 AVLLLLGGRDVPTGVPTMEVPYDHNNRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKK 480
           AVLLLLGGRDVPTGVPT+EV YD N RG+ DTPKRSNLSRRIASLVRFREKRKERCFDKK
Sbjct: 421 AVLLLLGGRDVPTGVPTVEVSYDQNPRGVGDTPKRSNLSRRIASLVRFREKRKERCFDKK 480

Query: 481 IRYTVRKEVAQRMHRKNGQFASLKESSGASSWESAHSCLQDGT-RSETVLRKCQHCGVSE 540
           IRYTVRKEVAQRM RKNGQFASLK++SGASSW+S  SC QDGT + ETV+R+CQHCGVSE
Sbjct: 481 IRYTVRKEVAQRMLRKNGQFASLKQTSGASSWDSTQSCPQDGTPQPETVVRRCQHCGVSE 540

Query: 541 NNTPAMRRGPAGPRTLCNACGLMWANKGTLRDLSKGGRNVSLDHMEPETPMDVKPTIMEG 600
           NNTPAMRRGPAGPRTLCNACGLMWANKGTLRDLSKGGRN+++D++EP TP +VKP+++EG
Sbjct: 541 NNTPAMRRGPAGPRTLCNACGLMWANKGTLRDLSKGGRNLTMDNIEPGTPTEVKPSVVEG 600

Query: 601 EFSGIQDEHGTPEDPSKTMTEGSSNPSIDPDEE------DINETTGDLTNSLRMRTVNHS 658
           EFS  QDE G  +  SK +TEGS++ S++PDEE      D++ET  DLTNS  M  V  S
Sbjct: 601 EFSRNQDECGILDGLSKNITEGSNDASVNPDEEECIAFKDLHETAEDLTNSYPMGIV--S 644

BLAST of Cla97C08G154610 vs. ExPASy TrEMBL
Match: A0A2H5P729 (Uncharacterized protein OS=Citrus unshiu OX=55188 GN=CUMW_109570 PE=3 SV=1)

HSP 1 Score: 672.9 bits (1735), Expect = 1.3e-189
Identity = 380/662 (57.40%), Postives = 446/662 (67.37%), Query Frame = 0

Query: 1   MYARAQPLTMGDQIA---SVPDCGDSGEPLDNRLVRYGAHSLEDGGGVGGMIEDLNPDAV 60
           MY ++Q + +  Q++   +  D  D     D+  + Y  HS  + G V  ++ED+  D+ 
Sbjct: 117 MYGQSQSMNISSQMSGGGAAADEDDVSVAADDHHLSYDPHSALENGIV--VVEDVAHDSG 176

Query: 61  YASAGDGSDLAVQRNDGSSQLTLSFRGQVYLFDAVSPEKVQAVLLLLGGCELSSGQQSVD 120
           YA+ G+           SSQLTLSFRGQVY+FD+V+P+KVQAVLLLLGGCELSS  Q ++
Sbjct: 177 YATGGN-------ELSNSSQLTLSFRGQVYVFDSVTPDKVQAVLLLLGGCELSSSPQGME 236

Query: 121 LVNQNQRNAVDFPGRSSQPQRAASLNRFRQKRKERCYDKKVRYSVRQEVALRMQRNKGQF 180
           ++  +QR   D+P + +QPQRAASL+RFRQKRKERC+DKKVRYSVRQEVALRMQRNKGQF
Sbjct: 237 VIPHSQRGIADYPAKCTQPQRAASLDRFRQKRKERCFDKKVRYSVRQEVALRMQRNKGQF 296

Query: 181 TSSKKPD-GSYSHGSVSELGQDESPPETSCTNCGISSMATPMMRRGPSGPRSLCNACGLF 240
           TS+KK + G+    +  + GQD+SP ETSCT+CGISS +TPMMRRGPSGPRSLCNACGLF
Sbjct: 297 TSAKKCEGGALGWSNAQDPGQDDSPSETSCTHCGISSKSTPMMRRGPSGPRSLCNACGLF 356

Query: 241 WANRGTLRDLSKRSQDHPVTPAEQCESDGGKDLDCRHDKKKAIMAVSGFIQSGPRFVRAF 300
           WAN+G LRDL K+ +D P+TPAEQ E                                  
Sbjct: 357 WANKGALRDLGKKMEDQPLTPAEQGE---------------------------------- 416

Query: 301 EKLMAAANPQPLQARPFQEHVQVSTMMGDDDGEYEDGGGGGGGGGGGGGGDVMDDVEEAH 360
                                          GE  D   G                    
Sbjct: 417 -------------------------------GEVNDSDCG-------------------- 476

Query: 361 MTSVSVANHGGLVMASRTSELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPTGVPTME 420
                         A+ T                   E VQAVLLLLGGRD+PTGVPT+E
Sbjct: 477 -------------TAAHTDN-----------------ELVQAVLLLLGGRDIPTGVPTIE 536

Query: 421 VPYDHNNRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQ 480
           VPYD +NRG+VDTPKRSNLSRRIASLVRFREKRKERCFDKKIRY+VRKEVAQRMHRKNGQ
Sbjct: 537 VPYDQSNRGVVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYSVRKEVAQRMHRKNGQ 596

Query: 481 FASLKESSGASSWESAHSCLQDGT-RSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNA 540
           FASLKESSGAS W+S+   +QDGT R ETV+R+CQHCGVSENNTPAMRRGPAGPRTLCNA
Sbjct: 597 FASLKESSGASPWDSSQDGIQDGTPRPETVVRRCQHCGVSENNTPAMRRGPAGPRTLCNA 654

Query: 541 CGLMWANKGTLRDLSKGGRNVSLDHMEPETPMDVKPTIMEGEFSGIQDEHGTPEDPSKTM 600
           CGLMWANKGTLRDLSKGGR++S+D +EPETPMDVKP+IMEGEFSG QDE GTPEDP+K +
Sbjct: 657 CGLMWANKGTLRDLSKGGRSLSMDQLEPETPMDVKPSIMEGEFSGNQDELGTPEDPAKAV 654

Query: 601 TEGSSNPSIDPDEEDINETTGDLTNSLRMRTVNHSTNDDEQEPLVELANPSDTDIDIPTN 658
            +GS NPSIDPDEED++    DLTNSL M  V+ S +DDEQEPLVELANPSDTDIDIP+N
Sbjct: 717 NQGSDNPSIDPDEEDMHGAAEDLTNSLPMGLVHSSADDDEQEPLVELANPSDTDIDIPSN 654

BLAST of Cla97C08G154610 vs. ExPASy TrEMBL
Match: A0A1S3BAZ6 (GATA transcription factor 24-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103488115 PE=3 SV=1)

HSP 1 Score: 671.4 bits (1731), Expect = 3.9e-189
Identity = 344/358 (96.09%), Postives = 346/358 (96.65%), Query Frame = 0

Query: 300 MAAANPQPLQARPFQEHVQVSTMMGDDDGEYEDGGGGGGGGGGGGGGDVMDDVEEAHMTS 359
           MAAANPQPLQARPFQEHVQV +MMGDDDGEYED      GGGGGGGGDVMDDVEEAHMTS
Sbjct: 1   MAAANPQPLQARPFQEHVQVPSMMGDDDGEYED------GGGGGGGGDVMDDVEEAHMTS 60

Query: 360 VSVANHGGLVMASRTSELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPTGVPTMEVPY 419
           VSVANHGGLVMASRTSELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVP GVPTMEVPY
Sbjct: 61  VSVANHGGLVMASRTSELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPAGVPTMEVPY 120

Query: 420 DHNNRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFAS 479
           DHNNRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFAS
Sbjct: 121 DHNNRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFAS 180

Query: 480 LKESSGASSWESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGLM 539
           LKESSGASSWESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGLM
Sbjct: 181 LKESSGASSWESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGLM 240

Query: 540 WANKGTLRDLSKGGRNVSLDHMEPETPMDVKPTIMEGEFSGIQDEHGTPEDPSKTMTEGS 599
           WANKGTLRDLSKGGRNVSLDHMEPETPMDVKP IMEGEFSGIQDEHGTPEDPSKTMTEGS
Sbjct: 241 WANKGTLRDLSKGGRNVSLDHMEPETPMDVKPVIMEGEFSGIQDEHGTPEDPSKTMTEGS 300

Query: 600 SNPSIDPDEEDINETTGDLTNSLRMRTVNHSTNDDEQEPLVELANPSDTDIDIPTNFD 658
           SNPSID DEEDINETTGDLTNSL MR VNHS+NDDEQEPLVELANPSDTDIDIPTNFD
Sbjct: 301 SNPSIDLDEEDINETTGDLTNSLPMRIVNHSSNDDEQEPLVELANPSDTDIDIPTNFD 352

BLAST of Cla97C08G154610 vs. ExPASy TrEMBL
Match: A0A0A0LLS6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G370420 PE=3 SV=1)

HSP 1 Score: 661.0 bits (1704), Expect = 5.2e-186
Identity = 339/358 (94.69%), Postives = 342/358 (95.53%), Query Frame = 0

Query: 300 MAAANPQPLQARPFQEHVQVSTMMGDDDGEYEDGGGGGGGGGGGGGGDVMDDVEEAHMTS 359
           MAAANPQPLQARPFQEHVQV +MM DDDGEYED      GGGGGGGGDVMDDVEEAHMTS
Sbjct: 1   MAAANPQPLQARPFQEHVQVPSMMADDDGEYED------GGGGGGGGDVMDDVEEAHMTS 60

Query: 360 VSVANHGGLVMASRTSELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPTGVPTMEVPY 419
           VSVANHGGLVMASR SELTLSFEGEVYVFP VTPEKVQAVLLLLGGRDVP  VPTMEVPY
Sbjct: 61  VSVANHGGLVMASRASELTLSFEGEVYVFPEVTPEKVQAVLLLLGGRDVPADVPTMEVPY 120

Query: 420 DHNNRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFAS 479
           DHNNRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFAS
Sbjct: 121 DHNNRGMVDTPKRSNLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFAS 180

Query: 480 LKESSGASSWESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGLM 539
           LKESSGASSWESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGLM
Sbjct: 181 LKESSGASSWESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGLM 240

Query: 540 WANKGTLRDLSKGGRNVSLDHMEPETPMDVKPTIMEGEFSGIQDEHGTPEDPSKTMTEGS 599
           WANKGTLRDLSKGGRNVSLDHMEPETPMDVKP IMEGEFSGIQDEHGTPEDPSKTMTEGS
Sbjct: 241 WANKGTLRDLSKGGRNVSLDHMEPETPMDVKPVIMEGEFSGIQDEHGTPEDPSKTMTEGS 300

Query: 600 SNPSIDPDEEDINETTGDLTNSLRMRTVNHSTNDDEQEPLVELANPSDTDIDIPTNFD 658
           SNPSID DEEDINETTG+LTNSL MR VNHS+NDDEQEPLVELANPSDTDIDIPTNFD
Sbjct: 301 SNPSIDLDEEDINETTGELTNSLPMRIVNHSSNDDEQEPLVELANPSDTDIDIPTNFD 352

BLAST of Cla97C08G154610 vs. TAIR 10
Match: AT4G24470.3 (GATA-type zinc finger protein with TIFY domain )

HSP 1 Score: 246.9 bits (629), Expect = 4.5e-65
Identity = 137/239 (57.32%), Postives = 171/239 (71.55%), Query Frame = 0

Query: 47  GMIEDLNPDA--VYASAGDGSDLAVQR-NDGSSQLTLSFRGQVYLFDAVSPEKVQAVLLL 106
           G   DL PD   + A   DGS+L V R  +G++QLT+SFRGQVY+FDAV  +KV AVL L
Sbjct: 50  GAASDLIPDGSQLVAHRSDGSELLVSRPPEGANQLTISFRGQVYVFDAVGADKVDAVLSL 109

Query: 107 LGG-CELSSGQQSVDLV-NQNQRNAVDFPGRSSQPQRAASLNRFRQKRKERCYDKKVRYS 166
           LGG  EL+ G Q ++L   QN    V++  R S PQRA SL+RFR+KR  RC++KKVRY 
Sbjct: 110 LGGSTELAPGPQVMELAQQQNHMPVVEYQSRCSLPQRAQSLDRFRKKRNARCFEKKVRYG 169

Query: 167 VRQEVALRMQRNKGQFTSSKKPDGSYSHGSVSELGQDESPPETSCTNCGISSMATPMMRR 226
           VRQEVALRM RNKGQFTSSK  DG+Y+ G+  +  QD++ PE SCT+CGISS  TPMMRR
Sbjct: 170 VRQEVALRMARNKGQFTSSKMTDGAYNSGTDQDSAQDDAHPEISCTHCGISSKCTPMMRR 229

Query: 227 GPSGPRSLCNACGLFWANRGTLRDLSKRSQDH------PVTPAEQCESDGGKDLDCRHD 275
           GPSGPR+LCNACGLFWANRGTLRDLSK+++++      PV+  +    DGG   D  ++
Sbjct: 230 GPSGPRTLCNACGLFWANRGTLRDLSKKTEENQLALMKPVSSYKYHPDDGGSVADAANN 288

BLAST of Cla97C08G154610 vs. TAIR 10
Match: AT4G24470.1 (GATA-type zinc finger protein with TIFY domain )

HSP 1 Score: 245.4 bits (625), Expect = 1.3e-64
Identity = 136/233 (58.37%), Postives = 169/233 (72.53%), Query Frame = 0

Query: 47  GMIEDLNPDA--VYASAGDGSDLAVQR-NDGSSQLTLSFRGQVYLFDAVSPEKVQAVLLL 106
           G   DL PD   + A   DGS+L V R  +G++QLT+SFRGQVY+FDAV  +KV AVL L
Sbjct: 50  GAASDLIPDGSQLVAHRSDGSELLVSRPPEGANQLTISFRGQVYVFDAVGADKVDAVLSL 109

Query: 107 LGG-CELSSGQQSVDLV-NQNQRNAVDFPGRSSQPQRAASLNRFRQKRKERCYDKKVRYS 166
           LGG  EL+ G Q ++L   QN    V++  R S PQRA SL+RFR+KR  RC++KKVRY 
Sbjct: 110 LGGSTELAPGPQVMELAQQQNHMPVVEYQSRCSLPQRAQSLDRFRKKRNARCFEKKVRYG 169

Query: 167 VRQEVALRMQRNKGQFTSSKKPDGSYSHGSVSELGQDESPPETSCTNCGISSMATPMMRR 226
           VRQEVALRM RNKGQFTSSK  DG+Y+ G+  +  QD++ PE SCT+CGISS  TPMMRR
Sbjct: 170 VRQEVALRMARNKGQFTSSKMTDGAYNSGTDQDSAQDDAHPEISCTHCGISSKCTPMMRR 229

Query: 227 GPSGPRSLCNACGLFWANRGTLRDLSKRSQDHPVTPAEQCESDGGKDLDCRHD 275
           GPSGPR+LCNACGLFWANRGTLRDLSK+++++ +  A     DGG   D  ++
Sbjct: 230 GPSGPRTLCNACGLFWANRGTLRDLSKKTEENQL--ALMKPDDGGSVADAANN 280

BLAST of Cla97C08G154610 vs. TAIR 10
Match: AT4G24470.2 (GATA-type zinc finger protein with TIFY domain )

HSP 1 Score: 245.4 bits (625), Expect = 1.3e-64
Identity = 136/233 (58.37%), Postives = 169/233 (72.53%), Query Frame = 0

Query: 47  GMIEDLNPDA--VYASAGDGSDLAVQR-NDGSSQLTLSFRGQVYLFDAVSPEKVQAVLLL 106
           G   DL PD   + A   DGS+L V R  +G++QLT+SFRGQVY+FDAV  +KV AVL L
Sbjct: 50  GAASDLIPDGSQLVAHRSDGSELLVSRPPEGANQLTISFRGQVYVFDAVGADKVDAVLSL 109

Query: 107 LGG-CELSSGQQSVDLV-NQNQRNAVDFPGRSSQPQRAASLNRFRQKRKERCYDKKVRYS 166
           LGG  EL+ G Q ++L   QN    V++  R S PQRA SL+RFR+KR  RC++KKVRY 
Sbjct: 110 LGGSTELAPGPQVMELAQQQNHMPVVEYQSRCSLPQRAQSLDRFRKKRNARCFEKKVRYG 169

Query: 167 VRQEVALRMQRNKGQFTSSKKPDGSYSHGSVSELGQDESPPETSCTNCGISSMATPMMRR 226
           VRQEVALRM RNKGQFTSSK  DG+Y+ G+  +  QD++ PE SCT+CGISS  TPMMRR
Sbjct: 170 VRQEVALRMARNKGQFTSSKMTDGAYNSGTDQDSAQDDAHPEISCTHCGISSKCTPMMRR 229

Query: 227 GPSGPRSLCNACGLFWANRGTLRDLSKRSQDHPVTPAEQCESDGGKDLDCRHD 275
           GPSGPR+LCNACGLFWANRGTLRDLSK+++++ +  A     DGG   D  ++
Sbjct: 230 GPSGPRTLCNACGLFWANRGTLRDLSKKTEENQL--ALMKPDDGGSVADAANN 280

BLAST of Cla97C08G154610 vs. TAIR 10
Match: AT3G21175.2 (ZIM-like 1 )

HSP 1 Score: 214.5 bits (545), Expect = 2.5e-55
Identity = 115/181 (63.54%), Postives = 133/181 (73.48%), Query Frame = 0

Query: 376 ELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPTGVPTMEVPYDHNNRGMVDTPKRSNL 435
           +LTLSF+G+VYVF  V+PEKVQAVLLLLGGR+VP  +PT       NNRG+  TP+R ++
Sbjct: 79  QLTLSFQGQVYVFDRVSPEKVQAVLLLLGGREVPHTLPTTLGSPHQNNRGLSGTPQRLSV 138

Query: 436 SRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFASLKES-----SGASSWE 495
            +R+ASL+RFREKRK R FDK IRYTVRKEVA RM RK GQF S K S     S  S W 
Sbjct: 139 PQRLASLLRFREKRKGRNFDKTIRYTVRKEVALRMQRKKGQFTSAKSSNDDSGSTGSDWG 198

Query: 496 SAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGLMWANKGTLRDLS 552
           S  S   +GT ++     C+HCG SE +TP MRRGP GPRTLCNACGLMWANKGTLRDLS
Sbjct: 199 SNQSWAVEGTETQKPEVLCRHCGTSEKSTPMMRRGPDGPRTLCNACGLMWANKGTLRDLS 258

BLAST of Cla97C08G154610 vs. TAIR 10
Match: AT3G21175.1 (ZIM-like 1 )

HSP 1 Score: 209.5 bits (532), Expect = 8.0e-54
Identity = 115/183 (62.84%), Postives = 133/183 (72.68%), Query Frame = 0

Query: 376 ELTLSFEGEVYVFPAVTPEKVQAVLLLLGGRDVPTGVPTMEVPYDHNNR--GMVDTPKRS 435
           +LTLSF+G+VYVF  V+PEKVQAVLLLLGGR+VP  +PT       NNR  G+  TP+R 
Sbjct: 79  QLTLSFQGQVYVFDRVSPEKVQAVLLLLGGREVPHTLPTTLGSPHQNNRVLGLSGTPQRL 138

Query: 436 NLSRRIASLVRFREKRKERCFDKKIRYTVRKEVAQRMHRKNGQFASLKES-----SGASS 495
           ++ +R+ASL+RFREKRK R FDK IRYTVRKEVA RM RK GQF S K S     S  S 
Sbjct: 139 SVPQRLASLLRFREKRKGRNFDKTIRYTVRKEVALRMQRKKGQFTSAKSSNDDSGSTGSD 198

Query: 496 WESAHSCLQDGTRSETVLRKCQHCGVSENNTPAMRRGPAGPRTLCNACGLMWANKGTLRD 552
           W S  S   +GT ++     C+HCG SE +TP MRRGP GPRTLCNACGLMWANKGTLRD
Sbjct: 199 WGSNQSWAVEGTETQKPEVLCRHCGTSEKSTPMMRRGPDGPRTLCNACGLMWANKGTLRD 258

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7029665.14.8e-30385.56GATA transcription factor 25 [Cucurbita argyrosperma subsp. argyrosperma][more]
KAG7020257.12.6e-26476.07GATA transcription factor 24 [Cucurbita argyrosperma subsp. argyrosperma][more]
GAV66855.16.2e-22664.77GATA domain-containing protein/tify domain-containing protein/CCT domain-contain... [more]
RXH92614.18.7e-21262.74hypothetical protein DVH24_033510 [Malus domestica][more]
XP_038884827.11.7e-19197.51GATA transcription factor 19-like isoform X1 [Benincasa hispida] >XP_038884828.1... [more]
Match NameE-valueIdentityDescription
Q9LRH61.8e-6358.37GATA transcription factor 25 OS=Arabidopsis thaliana OX=3702 GN=GATA25 PE=1 SV=2[more]
Q8GXL71.1e-5262.84GATA transcription factor 24 OS=Arabidopsis thaliana OX=3702 GN=GATA24 PE=1 SV=2[more]
Q8H1G04.9e-4848.66GATA transcription factor 28 OS=Arabidopsis thaliana OX=3702 GN=GATA28 PE=1 SV=1[more]
Q5Z4U51.1e-4756.12GATA transcription factor 20 OS=Oryza sativa subsp. japonica OX=39947 GN=GATA20 ... [more]
A2XKR75.1e-4549.59GATA transcription factor 18 OS=Oryza sativa subsp. indica OX=39946 GN=GATA18 PE... [more]
Match NameE-valueIdentityDescription
A0A1Q3BG013.0e-22664.77GATA domain-containing protein/tify domain-containing protein/CCT domain-contain... [more]
A0A498JEZ94.2e-21262.74Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_033510 PE=3 SV=1[more]
A0A2H5P7291.3e-18957.40Uncharacterized protein OS=Citrus unshiu OX=55188 GN=CUMW_109570 PE=3 SV=1[more]
A0A1S3BAZ63.9e-18996.09GATA transcription factor 24-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC10348... [more]
A0A0A0LLS65.2e-18694.69Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G370420 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G24470.34.5e-6557.32GATA-type zinc finger protein with TIFY domain [more]
AT4G24470.11.3e-6458.37GATA-type zinc finger protein with TIFY domain [more]
AT4G24470.21.3e-6458.37GATA-type zinc finger protein with TIFY domain [more]
AT3G21175.22.5e-5563.54ZIM-like 1 [more]
AT3G21175.18.0e-5462.84ZIM-like 1 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 503..556
e-value: 4.9E-8
score: 42.7
coord: 200..253
e-value: 2.7E-15
score: 66.8
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 206..242
e-value: 9.1E-14
score: 50.8
coord: 509..545
e-value: 2.9E-13
score: 49.2
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 509..536
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 206..233
IPR000679Zinc finger, GATA-typePROSITEPS50114GATA_ZN_FINGER_2coord: 200..259
score: 11.002358
IPR000679Zinc finger, GATA-typePROSITEPS50114GATA_ZN_FINGER_2coord: 503..551
score: 8.52356
IPR000679Zinc finger, GATA-typeCDDcd00202ZnF_GATAcoord: 205..252
e-value: 2.69646E-15
score: 68.5534
IPR000679Zinc finger, GATA-typeCDDcd00202ZnF_GATAcoord: 508..552
e-value: 1.6716E-15
score: 68.9386
IPR010399Tify domainSMARTSM00979tify_2coord: 370..405
e-value: 4.2E-7
score: 39.6
coord: 71..106
e-value: 2.1E-11
score: 53.9
IPR010399Tify domainPFAMPF06200tifycoord: 74..105
e-value: 1.7E-11
score: 43.2
coord: 373..404
e-value: 4.5E-10
score: 38.7
IPR010399Tify domainPROSITEPS51320TIFYcoord: 370..405
score: 12.941822
IPR010399Tify domainPROSITEPS51320TIFYcoord: 71..106
score: 14.217342
IPR013088Zinc finger, NHR/GATA-typeGENE3D3.30.50.10coord: 501..559
e-value: 1.3E-13
score: 52.4
coord: 196..255
e-value: 5.6E-15
score: 56.8
IPR010402CCT domainPFAMPF06203CCTcoord: 138..180
e-value: 2.1E-14
score: 53.3
coord: 438..479
e-value: 3.0E-13
score: 49.6
IPR010402CCT domainPROSITEPS51017CCTcoord: 138..180
score: 12.371778
IPR010402CCT domainPROSITEPS51017CCTcoord: 438..480
score: 12.649593
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 127..146
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 248..268
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 324..346
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 127..142
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 579..657
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 171..208
NoneNo IPR availablePANTHERPTHR46125GATA TRANSCRIPTION FACTOR 28coord: 67..259
NoneNo IPR availablePANTHERPTHR46125GATA TRANSCRIPTION FACTOR 28coord: 300..657
NoneNo IPR availablePANTHERPTHR46125:SF7GATA TRANSCRIPTION FACTOR 28-LIKE ISOFORM X1coord: 67..259
NoneNo IPR availablePANTHERPTHR46125:SF7GATA TRANSCRIPTION FACTOR 28-LIKE ISOFORM X1coord: 300..657
NoneNo IPR availableSUPERFAMILY57716Glucocorticoid receptor-like (DNA-binding domain)coord: 200..250
NoneNo IPR availableSUPERFAMILY57716Glucocorticoid receptor-like (DNA-binding domain)coord: 504..553

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C08G154610.2Cla97C08G154610.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0005515 protein binding
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding