Cla97C01G010480 (gene) Watermelon (97103) v2

NameCla97C01G010480
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
Descriptiontrihelix transcription factor GT-2
LocationCla97Chr01 : 15901010 .. 15903963 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACCTCTTCACCGGCGACCACCGGATACCGAGCTCCGACAACTTCCCACAGCACGTTGCTCCATTTCCCGATCCGACGGACCTTCTCTACGCTGCTCCGTCCGCTGTATTTCCCCCTGCCGACATCATCACCCACCTCTCGAACCCTCCACCGCCGCCGCAGAAGCTTCGTCCCATCCGCTGCAACGGTAGGTCCCCGGCGGGATCTCAGGCCGAAAATATCTTCGATGGAGCCCTAAGGAGTTTCCAATGCGTTTCGTCGTCGCCTGAGGGTGGATTTTCTGGCGATCAGCTCTGTGTGGCTAATATTGACCCTTGTCAGTACTTCGACTCCTCGGCGAAAGATGATAAGCCCGAAGTCAAGGATAATGGCGGCTTCGGCGATATCATTGGAAATGATTACTTCTCGGAGGAAGAAACGAAGAACGGCGGATCCGGTGCTGCTATCGCTGCGGAGAATTTGAGCCGGAGCCGCGAAGGACCTCAATTGGATGACGATTCATGGTGAGTTGTAACTCTGAACTGAACATCGCCTCATGAATTGGGGGTTTTTGTTTTTCAATTGATCTATTGAAGTTTATTTACTTTAATTATTCATTTATTAAATATATTTTTAGAGTTGTTGGGTGCCTGGAATTTGTTTAGGAAGTGCCGAAATGCCAATTATTTTATGATGGTAAACTCCGAATCCTTATGGAGAAGTGAGAAGATTAAAGAAAAGCTTTCCTGGAAAGTGATCCTTATAATATGTGAAAAAGAAATTGTTCTTTCAGTTTGGTGACTTCCTTGGCCTCTCTTCATTCAAAGCCTGTATGATTTCTTTTAATTCTAAGAAAAGAAAAACAGGAAAAAATAAACATTATTAGTTATTTCACCAGATTTTTCAAGTATTACTTTCATCGTTTAGGCCTCTCTCTCTCTCTCTCCTTTAAATAGAATAGAAAAGTTAAAGATGAATCGTGTATAGCTTTCAACATTTAAGGTTCGAATTGAAGTATTGGAAATTCTTATAGGAGAAACATTTTTTTTTTCTTTTATCATTATTAAGTGAAATGAAAAGAAAAAGATACAAACAGGAGATTCCCAAAAAAGTCCCCAATAGCATTAATAAGAACAATAAGTAATTTTACAAAACCCCTTTTAGAGAGTGCACAAATTGAAGCAAGAAAATGACAAGGTCAACAGAGTCATAAAAGATTCTTCTTTTTCTCTCCTTTTTTAAAGTTCCCATCACTAGCAATCATCATATTGTCCCAGAAAATTCTAGCTCTCCTTTTGAAGGAAAGAGATCGTTCGAAATGGATTTGTACAAAAGAGTTCAGGACTACTTAATGCTTGTTTGAGAATTACGGTAATTACATTACATGTCTCTTTATATAGATAATGGGAGCAATCAGAAGACACGTACCTAAGAGGAGAATTGGAAAGAAGAGGTGAGGAGAGGAAGAAACATTATACATTGACCTCCTTTATATGGTATTGGATATAAAATCTAGAGGTGCAAGAATCTGTTTTTCATTTCAAAACTGATAGGTTTTTGGTTCTTGAAAGTTGAAATCTTGTGAATTCTTTTTGGTCCTGGTCAAATGAAGCATTTTCTGATACAAGTAGTATGGTTAGTTATGGTTCTTCCTTTTGTTGTGGTCACTGGCTGATATTGTGACGATGGGTTTAGGATAGAGTGAATTGAACAAAAAGTTCTTGCTCTGATATGAGGGTTTCTTGTCATTGCAGTAATTTGAGACTCGTGACTGTTGCTTTTGTATTACTTTTTTCTTTCCTGAAAAAGAAACCAAAATTTTCATTGATAATGAAGAAGTTACAAAATTTATAAGAACCAATTCTTCATTTTAGAAGAGCCCCCCCCCCCCCCCCCCCCCCCCCCCCCAATTCTTCAGCAAATTCCTTCTCATGGCTGTTTCTTTTCAAGCTTAACGCAAGTTCTATGGAATTCCTTGCCTTTAAGGTTTAAGTATTTAATACATGCACAGCAGCTTATGTTATCATGATTGAAGTTTTATCAGACCAAATAAGTAGAAAGCATGGCATGGATACATGCATAGATACAGAGGCTATATATAGTGCAACGATTAATGACAAATTACTTGATTACTTGTTTCCAGCTCAACGTCAGATGGTGTTGATGGTGTTTTTAGCACCAAAAAACATTTAAGCCATAAGAGAAAAAGAACAAGAAGGTCACTCGAGCATTTTGTGGAAAATTTGGTAATGAAGGTAATGGATAAACAGGAGGAGATGCATCGGCAGTTGATCGATATGATAGAAAAGAAGGAGAAAGAGAGAACAGTCAGAGAAGAAGCTTGGAAGCAGAGGGAGATTGAAAGAATGAGAAGGGATGAGGAGTTAAGAGCCCAAGAAACGTCTCGCAGTTTAGCAATTATTTCCTTGATCCAAAATTTGCTGGGCCATGAAATTCAAATCTCCCGACCAGTTGAAAACCAATGTACAGAAGAAGATGGAGGTGAAAGCAGCATTCAGAAGGAGCTAAAAAGTGATCCTAGTGGTAGGAGATGGCCTCAGGCTGAAGTACAATCTTTAATATCACTTCGAACGTCACTGGAACATAAATTTCGTGCTACAGGCTCGAAAGGATCGATATGGGAGGAGATATCGGTTGAGATGCAGAAGATGGGTTACAAGCGTTCAGCAAAGAAGTGCAAAGAAAAATGGGAAAACATGAACAAATATTTCAAAAGGACAATTGTAACTGGTAAGGCTAGTATTGCAAATGGTAAGACATGTCCATATTTTCAAGAATTAGATATTCTTTATAGAAATGGAGTGGTAAATACTGGAGCTGTCTTTGATAGTACAAATACTGAAAATAATTCCAAGGCTGAAAGAAGTATAGACCCTTTTCATGAAGAAGCCTTTGTAGAAGGTGAAAGCGAGCATATAAAACAGGAGGCCCTGGACATGGTACAATTTTAA

mRNA sequence

ATGGACCTCTTCACCGGCGACCACCGGATACCGAGCTCCGACAACTTCCCACAGCACGTTGCTCCATTTCCCGATCCGACGGACCTTCTCTACGCTGCTCCGTCCGCTGTATTTCCCCCTGCCGACATCATCACCCACCTCTCGAACCCTCCACCGCCGCCGCAGAAGCTTCGTCCCATCCGCTGCAACGGTAGGTCCCCGGCGGGATCTCAGGCCGAAAATATCTTCGATGGAGCCCTAAGGAGTTTCCAATGCGTTTCGTCGTCGCCTGAGGGTGGATTTTCTGGCGATCAGCTCTGTGTGGCTAATATTGACCCTTGTCAGTACTTCGACTCCTCGGCGAAAGATGATAAGCCCGAAGTCAAGGATAATGGCGGCTTCGGCGATATCATTGGAAATGATTACTTCTCGGAGGAAGAAACGAAGAACGGCGGATCCGGTGCTGCTATCGCTGCGGAGAATTTGAGCCGGAGCCGCGAAGGACCTCAATTGGATGACGATTCATGCTCAACGTCAGATGGTGTTGATGGTGTTTTTAGCACCAAAAAACATTTAAGCCATAAGAGAAAAAGAACAAGAAGGTCACTCGAGCATTTTGTGGAAAATTTGGTAATGAAGGTAATGGATAAACAGGAGGAGATGCATCGGCAGTTGATCGATATGATAGAAAAGAAGGAGAAAGAGAGAACAGTCAGAGAAGAAGCTTGGAAGCAGAGGGAGATTGAAAGAATGAGAAGGGATGAGGAGTTAAGAGCCCAAGAAACGTCTCGCAGTTTAGCAATTATTTCCTTGATCCAAAATTTGCTGGGCCATGAAATTCAAATCTCCCGACCAGTTGAAAACCAATGTACAGAAGAAGATGGAGGTGAAAGCAGCATTCAGAAGGAGCTAAAAAGTGATCCTAGTGGTAGGAGATGGCCTCAGGCTGAAGTACAATCTTTAATATCACTTCGAACGTCACTGGAACATAAATTTCGTGCTACAGGCTCGAAAGGATCGATATGGGAGGAGATATCGGTTGAGATGCAGAAGATGGGTTACAAGCGTTCAGCAAAGAAGTGCAAAGAAAAATGGGAAAACATGAACAAATATTTCAAAAGGACAATTGTAACTGGTAAGGCTAGTATTGCAAATGGTAAGACATGTCCATATTTTCAAGAATTAGATATTCTTTATAGAAATGGAGTGGTAAATACTGGAGCTGTCTTTGATAGTACAAATACTGAAAATAATTCCAAGGCTGAAAGAAGTATAGACCCTTTTCATGAAGAAGCCTTTGTAGAAGGTGAAAGCGAGCATATAAAACAGGAGGCCCTGGACATGGTACAATTTTAA

Coding sequence (CDS)

ATGGACCTCTTCACCGGCGACCACCGGATACCGAGCTCCGACAACTTCCCACAGCACGTTGCTCCATTTCCCGATCCGACGGACCTTCTCTACGCTGCTCCGTCCGCTGTATTTCCCCCTGCCGACATCATCACCCACCTCTCGAACCCTCCACCGCCGCCGCAGAAGCTTCGTCCCATCCGCTGCAACGGTAGGTCCCCGGCGGGATCTCAGGCCGAAAATATCTTCGATGGAGCCCTAAGGAGTTTCCAATGCGTTTCGTCGTCGCCTGAGGGTGGATTTTCTGGCGATCAGCTCTGTGTGGCTAATATTGACCCTTGTCAGTACTTCGACTCCTCGGCGAAAGATGATAAGCCCGAAGTCAAGGATAATGGCGGCTTCGGCGATATCATTGGAAATGATTACTTCTCGGAGGAAGAAACGAAGAACGGCGGATCCGGTGCTGCTATCGCTGCGGAGAATTTGAGCCGGAGCCGCGAAGGACCTCAATTGGATGACGATTCATGCTCAACGTCAGATGGTGTTGATGGTGTTTTTAGCACCAAAAAACATTTAAGCCATAAGAGAAAAAGAACAAGAAGGTCACTCGAGCATTTTGTGGAAAATTTGGTAATGAAGGTAATGGATAAACAGGAGGAGATGCATCGGCAGTTGATCGATATGATAGAAAAGAAGGAGAAAGAGAGAACAGTCAGAGAAGAAGCTTGGAAGCAGAGGGAGATTGAAAGAATGAGAAGGGATGAGGAGTTAAGAGCCCAAGAAACGTCTCGCAGTTTAGCAATTATTTCCTTGATCCAAAATTTGCTGGGCCATGAAATTCAAATCTCCCGACCAGTTGAAAACCAATGTACAGAAGAAGATGGAGGTGAAAGCAGCATTCAGAAGGAGCTAAAAAGTGATCCTAGTGGTAGGAGATGGCCTCAGGCTGAAGTACAATCTTTAATATCACTTCGAACGTCACTGGAACATAAATTTCGTGCTACAGGCTCGAAAGGATCGATATGGGAGGAGATATCGGTTGAGATGCAGAAGATGGGTTACAAGCGTTCAGCAAAGAAGTGCAAAGAAAAATGGGAAAACATGAACAAATATTTCAAAAGGACAATTGTAACTGGTAAGGCTAGTATTGCAAATGGTAAGACATGTCCATATTTTCAAGAATTAGATATTCTTTATAGAAATGGAGTGGTAAATACTGGAGCTGTCTTTGATAGTACAAATACTGAAAATAATTCCAAGGCTGAAAGAAGTATAGACCCTTTTCATGAAGAAGCCTTTGTAGAAGGTGAAAGCGAGCATATAAAACAGGAGGCCCTGGACATGGTACAATTTTAA

Protein sequence

MDLFTGDHRIPSSDNFPQHVAPFPDPTDLLYAAPSAVFPPADIITHLSNPPPPPQKLRPIRCNGRSPAGSQAENIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFDSSAKDDKPEVKDNGGFGDIIGNDYFSEEETKNGGSGAAIAAENLSRSREGPQLDDDSCSTSDGVDGVFSTKKHLSHKRKRTRRSLEHFVENLVMKVMDKQEEMHRQLIDMIEKKEKERTVREEAWKQREIERMRRDEELRAQETSRSLAIISLIQNLLGHEIQISRPVENQCTEEDGGESSIQKELKSDPSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQKMGYKRSAKKCKEKWENMNKYFKRTIVTGKASIANGKTCPYFQELDILYRNGVVNTGAVFDSTNTENNSKAERSIDPFHEEAFVEGESEHIKQEALDMVQF
BLAST of Cla97C01G010480 vs. NCBI nr
Match: XP_004139609.1 (PREDICTED: trihelix transcription factor GT-2 [Cucumis sativus] >KGN65026.1 hypothetical protein Csa_1G181390 [Cucumis sativus])

HSP 1 Score: 776.9 bits (2005), Expect = 3.6e-221
Identity = 412/445 (92.58%), Postives = 422/445 (94.83%), Query Frame = 0

Query: 1   MDLFTGDHRIPSSDNFPQHVAPFPDPTDLLYAAPSAVFPPADIITHLSNPPPPPQKLRPI 60
           MDLFT DHRIP+SDNFPQHVAPFPDPTDLLYAAPS+VFPP DII HLSNPPPPPQKLRPI
Sbjct: 1   MDLFTADHRIPTSDNFPQHVAPFPDPTDLLYAAPSSVFPPTDIINHLSNPPPPPQKLRPI 60

Query: 61  RCNGRSPAGSQAENIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFDSSAKDDKPE 120
           RCNGRSPAGSQAENIFDG+LRSFQCVSSSPEGGFSGDQLCVANIDPCQYF+SSAKD+KPE
Sbjct: 61  RCNGRSPAGSQAENIFDGSLRSFQCVSSSPEGGFSGDQLCVANIDPCQYFNSSAKDEKPE 120

Query: 121 VKDNGGFGDIIGNDYFSEEETKNGGSGAAIAAENLSRSREGPQLDDDSCSTSDGVDGVFS 180
           VK NG FGDII NDYFSEEETKNGGSGAAIAAENLSRSRE PQLDDDSCSTSDG D VFS
Sbjct: 121 VKHNGSFGDIIANDYFSEEETKNGGSGAAIAAENLSRSREEPQLDDDSCSTSDGGDAVFS 180

Query: 181 TKKHLSHKRKRTRRSLEHFVENLVMKVMDKQEEMHRQLIDMIEKKEKERTVREEAWXXXX 240
           +KKHLSHKRKRTRRSLEHFVE LVMKVMDKQEEMHRQLIDMIEKKE ERTVRE   XXXX
Sbjct: 181 SKKHLSHKRKRTRRSLEHFVEKLVMKVMDKQEEMHRQLIDMIEKKENERTVREXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXSRSLAIISLIQNLLGHEIQISRPVENQCTEEDGGESSIQKELKSD 300
           XXXXXXXXXXXX   SRSLAIISLIQNLLGHEIQISRP ENQC E+DGGESSIQKELK D
Sbjct: 241 XXXXXXXXXXXXQETSRSLAIISLIQNLLGHEIQISRPAENQCAEDDGGESSIQKELKCD 300

Query: 301 PSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQKMGYKRSAKKCKEKWEN 360
           PSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEIS+EMQKMGYKRSAKKCKEKWEN
Sbjct: 301 PSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEISIEMQKMGYKRSAKKCKEKWEN 360

Query: 361 MNKYFKRTIVTGKASIANGKTCPYFQELDILYRNGVVNTGAVFDSTNTENNSKAERSIDP 420
           MNKYFKRT+VTGKASIANGKTCPYFQELDILYRNGVVNTGAVFDSTNTENNS AERSIDP
Sbjct: 361 MNKYFKRTVVTGKASIANGKTCPYFQELDILYRNGVVNTGAVFDSTNTENNSNAERSIDP 420

Query: 421 FHEEAFVEGESEHIKQ-EALDMVQF 445
           FHE+AFVEGE EHIKQ EALDMVQF
Sbjct: 421 FHEDAFVEGEREHIKQEEALDMVQF 445

BLAST of Cla97C01G010480 vs. NCBI nr
Match: XP_023001567.1 (trihelix transcription factor GT-2 isoform X1 [Cucurbita maxima])

HSP 1 Score: 741.1 bits (1912), Expect = 2.2e-210
Identity = 380/447 (85.01%), Postives = 401/447 (89.71%), Query Frame = 0

Query: 1   MDLFTGDHRIPSSDNFPQHVAPFPDPTDLLYAAPSAVFPPADIITHLSNPPPPPQKLRPI 60
           MDLFTGDHRIPSSD+FPQHVAPFPD TDLLYAAPSAVFP ADII HL NPPPPPQKLRPI
Sbjct: 1   MDLFTGDHRIPSSDSFPQHVAPFPDSTDLLYAAPSAVFPSADIIAHLPNPPPPPQKLRPI 60

Query: 61  RCNGRSPAGSQAENIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFDSSAKDDKPE 120
           RCNGRSPAGSQA+NIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYF+SS KDDKP+
Sbjct: 61  RCNGRSPAGSQADNIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFNSSEKDDKPD 120

Query: 121 VKDNGGFGDIIGNDYFSEEETKNGGSGAAIAAENLSRSREGPQLDDDSCSTSDGVDGVFS 180
            KDNGGF DIIGN++FSEEETKNGG+ AAIAAENLSRS EGPQLDDDSCSTSDG D V S
Sbjct: 121 AKDNGGFSDIIGNNFFSEEETKNGGTDAAIAAENLSRSLEGPQLDDDSCSTSDGGDDVLS 180

Query: 181 TKKHLSHKRKRTRRSLEHFVENLVMKVMDKQEEMHRQLIDMIEKKEKERTVREEAWXXXX 240
           TKKHL+HKRKRT RSLE FVENL+MKVM+KQEEMHRQLIDMIEK EKER VREEAW    
Sbjct: 181 TKKHLNHKRKRTTRSLELFVENLIMKVMNKQEEMHRQLIDMIEKNEKERIVREEAWKQRE 240

Query: 241 XXXXXXXXXXXXXXXSRSLAIISLIQNLLGHEIQISRPVENQCTEEDGGESSIQKELKSD 300
                          SRSLAIIS IQNLLGHEIQIS+PVEN CTE+DGGESSIQKELKSD
Sbjct: 241 IERMRRDEELRAQETSRSLAIISFIQNLLGHEIQISQPVENHCTEDDGGESSIQKELKSD 300

Query: 301 PSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQKMGYKRSAKKCKEKWEN 360
           PS RRWP+AEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQK+GY RSAKKCKEKWEN
Sbjct: 301 PSSRRWPRAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQKVGYNRSAKKCKEKWEN 360

Query: 361 MNKYFKRTIVTGKASIANGKTCPYFQELDILYRNGVVNTGAVFDSTNTENNSKAERSIDP 420
           MNKYFKRTI TGKASIANGKTCPYFQELD LYRNGVVN+GAV DST+TE+NS+AERSIDP
Sbjct: 361 MNKYFKRTIGTGKASIANGKTCPYFQELDTLYRNGVVNSGAVIDSTSTEHNSQAERSIDP 420

Query: 421 FHE-EAFVEGES--EHIKQEALDMVQF 445
           FHE EAFV+GES  EH+KQEAL+M QF
Sbjct: 421 FHEDEAFVQGESEREHVKQEALEMTQF 447

BLAST of Cla97C01G010480 vs. NCBI nr
Match: XP_023520409.1 (trihelix transcription factor GT-2 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 738.8 bits (1906), Expect = 1.1e-209
Identity = 379/446 (84.98%), Postives = 399/446 (89.46%), Query Frame = 0

Query: 1   MDLFTGDHRIPSSDNFPQHVAPFPDPTDLLYAAPSAVFPPADIITHLSNPPPPPQKLRPI 60
           MDLFTGDHRIPSSD+FPQHVAPFPD TDLLYAAPSAVFP ADII HL NPPPPPQKLRPI
Sbjct: 1   MDLFTGDHRIPSSDSFPQHVAPFPDSTDLLYAAPSAVFPSADIIAHLPNPPPPPQKLRPI 60

Query: 61  RCNGRSPAGSQAENIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFDSSAKDDKPE 120
           RCNGRSPAGSQA+NIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYF+SS KDDKP+
Sbjct: 61  RCNGRSPAGSQADNIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFNSSEKDDKPD 120

Query: 121 VKDNGGFGDIIGNDYFSEEETKNGGSGAAIAAENLSRSREGPQLDDDSCSTSDGVDGVFS 180
            KDNGGF DIIGN++FSEEETKNGG+ AA AAENLSRS EGPQLDDDSCSTSDG D V S
Sbjct: 121 AKDNGGFSDIIGNNFFSEEETKNGGTDAATAAENLSRSLEGPQLDDDSCSTSDGGDDVLS 180

Query: 181 TKKHLSHKRKRTRRSLEHFVENLVMKVMDKQEEMHRQLIDMIEKKEKERTVREEAWXXXX 240
           TKKHL+HKRKRT RSLE FVENL+MKVMDKQEEMHRQLIDMIEK EKER VREEAW    
Sbjct: 181 TKKHLNHKRKRTTRSLELFVENLIMKVMDKQEEMHRQLIDMIEKNEKERIVREEAWKQRE 240

Query: 241 XXXXXXXXXXXXXXXSRSLAIISLIQNLLGHEIQISRPVENQCTEEDGGESSIQKELKSD 300
                          SRSLAIIS IQNLLGHEIQIS+PVEN CTE+DGGESSIQKELKSD
Sbjct: 241 IERMRRDEELRAQETSRSLAIISFIQNLLGHEIQISQPVENHCTEDDGGESSIQKELKSD 300

Query: 301 PSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQKMGYKRSAKKCKEKWEN 360
           PS RRWP+AEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQK+GY RSAKKCKEKWEN
Sbjct: 301 PSSRRWPRAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQKVGYNRSAKKCKEKWEN 360

Query: 361 MNKYFKRTIVTGKASIANGKTCPYFQELDILYRNGVVNTGAVFDSTNTENNSKAERSIDP 420
           MNKYFKRTI TGKASIANGKTCPYFQELD LYRNGVVN+GAV DST+TE+NS+AERSIDP
Sbjct: 361 MNKYFKRTIGTGKASIANGKTCPYFQELDTLYRNGVVNSGAVIDSTSTEHNSQAERSIDP 420

Query: 421 FH-EEAFVEGES--EHIKQEALDMVQ 444
           FH EEAFV+GES  EH+KQEAL+M Q
Sbjct: 421 FHEEEAFVQGESEREHVKQEALEMTQ 446

BLAST of Cla97C01G010480 vs. NCBI nr
Match: XP_022927335.1 (trihelix transcription factor GT-2 isoform X1 [Cucurbita moschata])

HSP 1 Score: 736.9 bits (1901), Expect = 4.1e-209
Identity = 378/447 (84.56%), Postives = 398/447 (89.04%), Query Frame = 0

Query: 1   MDLFTGDHRIPSSDNFPQHVAPFPDPTDLLYAAPSAVFPPADIITHLSNPPPPPQKLRPI 60
           MDLFTGDHRIPSSD+FPQHVAPFPD TDLLYAAPSAVFP ADII HL NPPPPPQKLRPI
Sbjct: 1   MDLFTGDHRIPSSDSFPQHVAPFPDSTDLLYAAPSAVFPSADIIAHLPNPPPPPQKLRPI 60

Query: 61  RCNGRSPAGSQAENIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFDSSAKDDKPE 120
           RCNGRSPAGSQA+NIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYF+SS KDDKP+
Sbjct: 61  RCNGRSPAGSQADNIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFNSSEKDDKPD 120

Query: 121 VKDNGGFGDIIGNDYFSEEETKNGGSGAAIAAENLSRSREGPQLDDDSCSTSDGVDGVFS 180
            KDNGGF DIIGN++F EEETKNGG+ AAIAAENLSRS EGPQLDDDSCSTSDG D V S
Sbjct: 121 AKDNGGFSDIIGNNFFPEEETKNGGTDAAIAAENLSRSLEGPQLDDDSCSTSDGGDDVLS 180

Query: 181 TKKHLSHKRKRTRRSLEHFVENLVMKVMDKQEEMHRQLIDMIEKKEKERTVREEAWXXXX 240
           TKKHL+HKRKRT RSLE FVENL+MKVMDKQEEMHRQLIDMIEK EKER VREEAW    
Sbjct: 181 TKKHLNHKRKRTTRSLELFVENLIMKVMDKQEEMHRQLIDMIEKNEKERIVREEAWKQRE 240

Query: 241 XXXXXXXXXXXXXXXSRSLAIISLIQNLLGHEIQISRPVENQCTEEDGGESSIQKELKSD 300
                          SRSLAIIS IQNLLGHEIQIS+PVEN CTE+DGGESSIQKELKSD
Sbjct: 241 IERMRRDEELRAQETSRSLAIISFIQNLLGHEIQISQPVENHCTEDDGGESSIQKELKSD 300

Query: 301 PSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQKMGYKRSAKKCKEKWEN 360
           PS RRWP+AEVQSLISLRTSLEHKFRATGSKGSIWEEISVEM K+GY RSAKKCKEKWEN
Sbjct: 301 PSSRRWPRAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMHKVGYNRSAKKCKEKWEN 360

Query: 361 MNKYFKRTIVTGKASIANGKTCPYFQELDILYRNGVVNTGAVFDSTNTENNSKAERSIDP 420
           MNKYFKRTI TGKASIANGKTCPYFQELD LYRNGVVN+GAV DST+TE+NS+AERSIDP
Sbjct: 361 MNKYFKRTIGTGKASIANGKTCPYFQELDTLYRNGVVNSGAVIDSTSTEHNSQAERSIDP 420

Query: 421 FH-EEAFVEGES--EHIKQEALDMVQF 445
           FH EEAFV+GES  EH+KQEAL+  QF
Sbjct: 421 FHEEEAFVQGESEREHVKQEALETTQF 447

BLAST of Cla97C01G010480 vs. NCBI nr
Match: XP_023001568.1 (trihelix transcription factor GTL1 isoform X2 [Cucurbita maxima])

HSP 1 Score: 637.1 bits (1642), Expect = 4.4e-179
Identity = 325/379 (85.75%), Postives = 339/379 (89.45%), Query Frame = 0

Query: 1   MDLFTGDHRIPSSDNFPQHVAPFPDPTDLLYAAPSAVFPPADIITHLSNPPPPPQKLRPI 60
           MDLFTGDHRIPSSD+FPQHVAPFPD TDLLYAAPSAVFP ADII HL NPPPPPQKLRPI
Sbjct: 1   MDLFTGDHRIPSSDSFPQHVAPFPDSTDLLYAAPSAVFPSADIIAHLPNPPPPPQKLRPI 60

Query: 61  RCNGRSPAGSQAENIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFDSSAKDDKPE 120
           RCNGRSPAGSQA+NIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYF+SS KDDKP+
Sbjct: 61  RCNGRSPAGSQADNIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFNSSEKDDKPD 120

Query: 121 VKDNGGFGDIIGNDYFSEEETKNGGSGAAIAAENLSRSREGPQLDDDSCSTSDGVDGVFS 180
            KDNGGF DIIGN++FSEEETKNGG+ AAIAAENLSRS EGPQLDDDSCSTSDG D V S
Sbjct: 121 AKDNGGFSDIIGNNFFSEEETKNGGTDAAIAAENLSRSLEGPQLDDDSCSTSDGGDDVLS 180

Query: 181 TKKHLSHKRKRTRRSLEHFVENLVMKVMDKQEEMHRQLIDMIEKKEKERTVREEAWXXXX 240
           TKKHL+HKRKRT RSLE FVENL+MKVM+KQEEMHRQLIDMIEK EKER VREEAW    
Sbjct: 181 TKKHLNHKRKRTTRSLELFVENLIMKVMNKQEEMHRQLIDMIEKNEKERIVREEAWKQRE 240

Query: 241 XXXXXXXXXXXXXXXSRSLAIISLIQNLLGHEIQISRPVENQCTEEDGGESSIQKELKSD 300
                          SRSLAIIS IQNLLGHEIQIS+PVEN CTE+DGGESSIQKELKSD
Sbjct: 241 IERMRRDEELRAQETSRSLAIISFIQNLLGHEIQISQPVENHCTEDDGGESSIQKELKSD 300

Query: 301 PSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQKMGYKRSAKKCKEKWEN 360
           PS RRWP+AEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQK+GY RSAKKCKEKWEN
Sbjct: 301 PSSRRWPRAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQKVGYNRSAKKCKEKWEN 360

Query: 361 MNKYFKRTIVTGKASIANG 380
           MNKYFKRTI TGKASIANG
Sbjct: 361 MNKYFKRTIGTGKASIANG 379

BLAST of Cla97C01G010480 vs. TrEMBL
Match: tr|A0A0A0LYK7|A0A0A0LYK7_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G181390 PE=4 SV=1)

HSP 1 Score: 776.9 bits (2005), Expect = 2.4e-221
Identity = 412/445 (92.58%), Postives = 422/445 (94.83%), Query Frame = 0

Query: 1   MDLFTGDHRIPSSDNFPQHVAPFPDPTDLLYAAPSAVFPPADIITHLSNPPPPPQKLRPI 60
           MDLFT DHRIP+SDNFPQHVAPFPDPTDLLYAAPS+VFPP DII HLSNPPPPPQKLRPI
Sbjct: 1   MDLFTADHRIPTSDNFPQHVAPFPDPTDLLYAAPSSVFPPTDIINHLSNPPPPPQKLRPI 60

Query: 61  RCNGRSPAGSQAENIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFDSSAKDDKPE 120
           RCNGRSPAGSQAENIFDG+LRSFQCVSSSPEGGFSGDQLCVANIDPCQYF+SSAKD+KPE
Sbjct: 61  RCNGRSPAGSQAENIFDGSLRSFQCVSSSPEGGFSGDQLCVANIDPCQYFNSSAKDEKPE 120

Query: 121 VKDNGGFGDIIGNDYFSEEETKNGGSGAAIAAENLSRSREGPQLDDDSCSTSDGVDGVFS 180
           VK NG FGDII NDYFSEEETKNGGSGAAIAAENLSRSRE PQLDDDSCSTSDG D VFS
Sbjct: 121 VKHNGSFGDIIANDYFSEEETKNGGSGAAIAAENLSRSREEPQLDDDSCSTSDGGDAVFS 180

Query: 181 TKKHLSHKRKRTRRSLEHFVENLVMKVMDKQEEMHRQLIDMIEKKEKERTVREEAWXXXX 240
           +KKHLSHKRKRTRRSLEHFVE LVMKVMDKQEEMHRQLIDMIEKKE ERTVRE   XXXX
Sbjct: 181 SKKHLSHKRKRTRRSLEHFVEKLVMKVMDKQEEMHRQLIDMIEKKENERTVREXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXSRSLAIISLIQNLLGHEIQISRPVENQCTEEDGGESSIQKELKSD 300
           XXXXXXXXXXXX   SRSLAIISLIQNLLGHEIQISRP ENQC E+DGGESSIQKELK D
Sbjct: 241 XXXXXXXXXXXXQETSRSLAIISLIQNLLGHEIQISRPAENQCAEDDGGESSIQKELKCD 300

Query: 301 PSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQKMGYKRSAKKCKEKWEN 360
           PSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEIS+EMQKMGYKRSAKKCKEKWEN
Sbjct: 301 PSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEISIEMQKMGYKRSAKKCKEKWEN 360

Query: 361 MNKYFKRTIVTGKASIANGKTCPYFQELDILYRNGVVNTGAVFDSTNTENNSKAERSIDP 420
           MNKYFKRT+VTGKASIANGKTCPYFQELDILYRNGVVNTGAVFDSTNTENNS AERSIDP
Sbjct: 361 MNKYFKRTVVTGKASIANGKTCPYFQELDILYRNGVVNTGAVFDSTNTENNSNAERSIDP 420

Query: 421 FHEEAFVEGESEHIKQ-EALDMVQF 445
           FHE+AFVEGE EHIKQ EALDMVQF
Sbjct: 421 FHEDAFVEGEREHIKQEEALDMVQF 445

BLAST of Cla97C01G010480 vs. TrEMBL
Match: tr|A0A1S3CJS0|A0A1S3CJS0_CUCME (trihelix transcription factor GT-2 OS=Cucumis melo OX=3656 GN=LOC103501749 PE=4 SV=1)

HSP 1 Score: 543.5 bits (1399), Expect = 4.4e-151
Identity = 297/339 (87.61%), Postives = 304/339 (89.68%), Query Frame = 0

Query: 1   MDLFTGDHRIPSSDNFPQHVAPFPDPTDLLYAAPSAVFPPADIITHLSNPPPPPQKLRPI 60
           MDLFT DHRIP+SDNFPQHVAPFPDPTDLLYAAPSAVFPP DII HLSNPPPPPQKLRPI
Sbjct: 1   MDLFTADHRIPTSDNFPQHVAPFPDPTDLLYAAPSAVFPPTDIINHLSNPPPPPQKLRPI 60

Query: 61  RCNGRSPAGSQAENIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFDSSAKDDKPE 120
           RCNGRSPAGSQAENIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFDSSAKD+KPE
Sbjct: 61  RCNGRSPAGSQAENIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFDSSAKDEKPE 120

Query: 121 VKDNGGFGDIIGNDYFSEEETKNGGSGAAIAAENLSRSREGPQLDDDSCSTSDGVDGVFS 180
           VK NG FGDII NDYFSEEETKNGGSGAAIAAENLSRSRE PQLD+DSCSTSDG D VFS
Sbjct: 121 VKHNGSFGDIIANDYFSEEETKNGGSGAAIAAENLSRSREEPQLDNDSCSTSDGGDAVFS 180

Query: 181 TKKHLSHKRKRTRRSLEHFVENLVMKVMDKQEEMHRQLIDMIEKKEKERTVREEAWXXXX 240
           +KKHLSHKRKRTRRSLEHFVE LV+KVM KQEEMHRQ                   XXXX
Sbjct: 181 SKKHLSHKRKRTRRSLEHFVEKLVLKVMHKQEEMHRQXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXSRSLAIISLIQNLLGHEIQISRPVENQCTEEDGGESSIQKELKSD 300
           XXXXXXXXXXX    SRSLAIISLIQNLLG+EIQISRPVENQCTE+DGGESSIQKELK D
Sbjct: 241 XXXXXXXXXXXAQETSRSLAIISLIQNLLGNEIQISRPVENQCTEDDGGESSIQKELKCD 300

Query: 301 PSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEIS 340
           PSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEIS
Sbjct: 301 PSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEIS 339

BLAST of Cla97C01G010480 vs. TrEMBL
Match: tr|A0A2N9GR49|A0A2N9GR49_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS29855 PE=4 SV=1)

HSP 1 Score: 276.2 bits (705), Expect = 1.3e-70
Identity = 194/423 (45.86%), Postives = 253/423 (59.81%), Query Frame = 0

Query: 1   MDLFTGDHRIPSSDN-FPQHVAPFPDPTDLLYAAPSAVF--PPADIITHLSNPPPPPQKL 60
           M++FTGD  IPS +  FP HVAPFPD T+L+Y+ P+A     PA++ITH  +   PPQKL
Sbjct: 1   MEVFTGDGEIPSPEEAFPGHVAPFPDTTELIYSNPTASIHSSPAELITHRHS--FPPQKL 60

Query: 61  RPIRCNGRSPAGSQAEN--IFDGALRSFQCVSSSPEGGFSGDQ-LCVANIDPCQYFDSSA 120
           RPIR +   PA SQ     +  G L          + GF   Q +C  N    +Y ++  
Sbjct: 61  RPIRRS--PPASSQLPETPMIAGNL-------EEDDVGFLRSQGVCAVNRVTEEYLEA-- 120

Query: 121 KDDKPEV-KDNGGFGDIIGNDYFSEEETKNGGSGAAIAAENLSRSREGPQLDDDSCSTSD 180
               PE+ K    FG   G D +                      R              
Sbjct: 121 ----PEMEKCTDRFGSGTGTDGWDPN----------------CEVRVEXXXXXXXXXXXX 180

Query: 181 GVDGVFSTKKHLSHKRKRTRRSLEHFVENLVMKVMDKQEEMHRQLIDMIEKKEKERTVRE 240
                                 L+ F+ +LV KVM++QEEMH++LI++IEK+ +ER +RE
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXKLKLFLGSLVAKVMERQEEMHKELIEIIEKRGRERIIRE 240

Query: 241 EAWXXXXXXXXXXXXXXXXXXXSRSLAIISLIQNLLGHEIQISRPVENQCTEEDGGESSI 300
           EAW    XXXXXXXXXXXXXXXSRSLA+ISLIQN+LG EIQI +P+  QC EEDGGE  +
Sbjct: 241 EAWRQQEXXXXXXXXXXXXXXXSRSLALISLIQNILGDEIQIPQPLITQCKEEDGGEIGM 300

Query: 301 QKELKSDPSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQKMGYKRSAKK 360
           Q ++K DP+ +RWP+AEVQ+LI+LR +LEHKFR TGS+GSIWEEISV M  MGY R AKK
Sbjct: 301 QNDVKCDPNNKRWPEAEVQALITLRATLEHKFRLTGSRGSIWEEISVGMYNMGYNRPAKK 360

Query: 361 CKEKWENMNKYFKRTIVTGKASIANGKTCPYFQELDILYRNGVVNTGAVFDSTNTENNSK 417
           CKEKWEN+NKYF+R++ +GK   +N K C YF +L+ILYRNG++N G    +T  E  +K
Sbjct: 361 CKEKWENINKYFRRSMESGKKPSSNAKACQYFHDLNILYRNGLINPGFNVKNTTIEIEAK 390

BLAST of Cla97C01G010480 vs. TrEMBL
Match: tr|A0A2P5EXN0|A0A2P5EXN0_9ROSA (Octamer-binding transcription factor OS=Trema orientalis OX=63057 GN=TorRG33x02_139510 PE=4 SV=1)

HSP 1 Score: 257.3 bits (656), Expect = 6.3e-65
Identity = 190/426 (44.60%), Postives = 247/426 (57.98%), Query Frame = 0

Query: 1   MDLFTGDHRIPSSDNFPQHVAPFPDPT-DLLYAAPSAVFPPADIITHLSNPPPPPQKLRP 60
           M++F  D +IP+ D FPQ +APFP+PT DL+YA P+AV  P D+I H   P  PP KLRP
Sbjct: 1   MEVFAADRQIPNPDEFPQLIAPFPEPTDDLIYAHPTAVIHPPDVINH-HRPISPPHKLRP 60

Query: 61  IRCNGRSPAGSQAENIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFDSSAKDDKP 120
           IR + RSP     ++   G L            GF G Q     +D     D + K +  
Sbjct: 61  IRYHARSPVADFPDH---GTLGK----------GFLGHQ--PGEVD--GELDCAVKLEAV 120

Query: 121 EVKDNGGFGDIIGNDYFSEEETKN----GGSGAAIAAENLSRSREGPQLDDDSCSTSDGV 180
           E          I ND  +  E  N     GSG  +    ++    G   ++   S+ DGV
Sbjct: 121 EAPK-------ISNDVCNVSEMMNYPIGFGSGMFLEGWEMNGEEHGVLENESGSSSDDGV 180

Query: 181 D-GVFSTKKHLSHKRKRTRRSLEHFVENLVMKVMDKQEEMHRQLIDMIEKKEKERTVREE 240
           D    + K  ++ KR+++  +LEHF+ENLVMKVM+KQE+                     
Sbjct: 181 DCTAANLKGPMNRKRRKSTTNLEHFLENLVMKVMEKQEQXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 AWXXXXXXXXXXXXXXXXXXXSRSLAIISLIQNLLGHEIQISRPVENQ-CTEEDGGESSI 300
             XXXXXXXXXXXXXXXXXXXSRSLA+I+ +QN+LG EIQI  PV  + C E +G E   
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXSRSLALITFLQNMLGEEIQIPEPVVREPCMEGNGVEIDT 300

Query: 301 QKELKSDPSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQKMGYKRSAKK 360
           Q + K DP+ +RWP+AEVQ+LI+LR ++EHKFR   SKGS WEEISV M  +GY R+AKK
Sbjct: 301 QTDTKCDPNSKRWPEAEVQALIALRNAMEHKFRLACSKGSTWEEISVSMHSLGYNRTAKK 360

Query: 361 CKEKWENMNKYFKRTIVTGKASIANGKTCPYFQELDILYRNGVVNTG--AVFDSTNTENN 418
           C+EKW+N+NKYFK++  +     ANGKTC YFQ LD+LY+NG+ NTG    F S    NN
Sbjct: 361 CREKWDNINKYFKKSRESANKRSANGKTCLYFQNLDLLYKNGLANTGIDGTFLSNGVTNN 401

BLAST of Cla97C01G010480 vs. TrEMBL
Match: tr|A0A2I4HJQ6|A0A2I4HJQ6_9ROSI (trihelix transcription factor GTL1 isoform X1 OS=Juglans regia OX=51240 GN=LOC109018704 PE=4 SV=1)

HSP 1 Score: 251.9 bits (642), Expect = 2.7e-63
Identity = 189/433 (43.65%), Postives = 255/433 (58.89%), Query Frame = 0

Query: 1   MDLFTGDHRIPSSDN-FPQHVAPFPDPTDLLYAAPSAVFPPADIITHLSNPPPPPQKLRP 60
           M+ F GD  +P  D  FP HV PFPD  DL+Y  P+A     +++ HL N   PPQKLRP
Sbjct: 1   MEPFAGDRGVPDPDEAFPDHVTPFPDTMDLIYDHPTAAVHSPELVAHLQN--LPPQKLRP 60

Query: 61  IRC-NGRSPAGSQAENIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFDSSAKDDK 120
           IRC N RSP     E + +   R       SPE     + + + +I              
Sbjct: 61  IRCFNFRSP-----EKLEETQRR------CSPE-----EAVALGSI-------------- 120

Query: 121 PEVKDNGGFGDIIGNDYFSEEETKNGGS---GAAIAAENLSR--SREGPQLDDDSCSTSD 180
                NG   ++ G  +    + + G +   G  +AA++  R  S  G      +C   D
Sbjct: 121 -----NGPVAEVPGECFGHPVKVEVGEAFKIGKGLAADDSERFGSVTGWGEWGPNCVEED 180

Query: 181 GVDGVF----------STKKHLSHKRK-RTRRSLEHFVENLVMKVMDKQEEMHRQLIDMI 240
              G            + K+ ++ KRK ++ R LE F+E+LV KV++KQE+MH+QLI+  
Sbjct: 181 VESGTXXXXXXXXXXANMKEPMNRKRKGKSSRKLELFLESLVNKVLEKQEQMHKQLIEXX 240

Query: 241 EKKEKERTVREEAWXXXXXXXXXXXXXXXXXXXSRSLAIISLIQNLLGHEIQISRPVENQ 300
                         XXXXXXXXXXXXXXXXXXXSRSLA+IS IQNL G EI + +PV  Q
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSRSLALISFIQNLSGQEIPVPQPVNTQ 300

Query: 301 CTEEDGGESSIQKELKSDPSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEM 360
           C EEDG E  ++ ++K DP+G+RWP+AEVQ+LI+LR +LEHK   TGSK S+WEEISV M
Sbjct: 301 CKEEDGSEIGMEHDIKCDPNGKRWPEAEVQALITLRAALEHKSCLTGSKRSMWEEISVGM 360

Query: 361 QKMGYKRSAKKCKEKWENMNKYFKRTIVTGKASIANGKTCPYFQELDILYRNGVVNTGAV 416
             MGY R+AKKCKEKWEN+NKYF+R++ +GK   ANGKTC YF EL++LY NG+++ G +
Sbjct: 361 WGMGYNRTAKKCKEKWENINKYFRRSMESGKKHSANGKTCHYFHELNMLYSNGLMDPGLL 396

BLAST of Cla97C01G010480 vs. Swiss-Prot
Match: sp|Q39117|TGT2_ARATH (Trihelix transcription factor GT-2 OS=Arabidopsis thaliana OX=3702 GN=GT-2 PE=2 SV=1)

HSP 1 Score: 129.4 bits (324), Expect = 9.8e-29
Identity = 76/217 (35.02%), Postives = 116/217 (53.46%), Query Frame = 0

Query: 203 LVMKVMDKQEEMHRQLIDMIEKKEKERTVREEAWXXXXXXXXXXXXXXXXXXXS----RS 262
           L  ++M+KQE+M ++ ++ +E +EKER  REEAW                   S    + 
Sbjct: 268 LTKELMEKQEKMQKRFLETLEYREKERISREEAWRVQEIGRINREHETLIHERSNAAAKD 327

Query: 263 LAIISLIQNLLGHEIQ-----ISRPVENQCTEEDGGESSIQKELKS-------------- 322
            AIIS +  + G + Q       +P + +  + D   +   KE ++              
Sbjct: 328 AAIISFLHKISGGQPQQPQQHNHKPSQRKQYQSDHSITFESKEPRAVLLDTTIKMGNYDN 387

Query: 323 ----DPSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQKMGYKRSAKKCK 382
                PS  RWP+ EV++LI +R +LE  ++  G+KG +WEEIS  M+++GY RSAK+CK
Sbjct: 388 NHSVSPSSSRWPKTEVEALIRIRKNLEANYQENGTKGPLWEEISAGMRRLGYNRSAKRCK 447

Query: 383 EKWENMNKYFKRTIVTGKASIANGKTCPYFQELDILY 393
           EKWEN+NKYFK+   + K    + KTCPYF +L+ LY
Sbjct: 448 EKWENINKYFKKVKESNKKRPLDSKTCPYFHQLEALY 484

BLAST of Cla97C01G010480 vs. Swiss-Prot
Match: sp|Q9C882|GTL1_ARATH (Trihelix transcription factor GTL1 OS=Arabidopsis thaliana OX=3702 GN=GTL1 PE=1 SV=2)

HSP 1 Score: 111.3 bits (277), Expect = 2.8e-23
Identity = 97/273 (35.53%), Postives = 130/273 (47.62%), Query Frame = 0

Query: 201 ENLVMKVMDKQEEMHRQLIDMIEKKEKER----TVREEAWXXXXXXXXXXXXXXXXXXXS 260
           E LV +VM KQ  M R  ++ +EK+E+ER           XXXXXXXXXXXXXXXXXX S
Sbjct: 271 EGLVRQVMQKQAAMQRSFLEALEKREQERLXXXXXXXXXXXXXXXXXXXXXXXXXXXXAS 330

Query: 261 RSLAIISLIQNLLGHEIQI----------------------------------------- 320
           R  AIISLIQ + GH IQ+                                         
Sbjct: 331 RDAAIISLIQKITGHTIQLPPSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 390

Query: 321 ---------------SRPVENQCTEEDGGESSIQKELKSDPSGRRWPQAEVQSLISLRTS 380
                                                    S  RWP+AE+ +LI+LR+ 
Sbjct: 391 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSSSRWPKAEILALINLRSG 450

Query: 381 LEHKFRATGSKGSIWEEISVEMQKMGYKRSAKKCKEKWENMNKYFKRTIVTGKASIANGK 414
           +E +++    KG +WEEIS  M++MGY R+AK+CKEKWEN+NKY+K+   + K    + K
Sbjct: 451 MEPRYQDNVPKGLLWEEISTSMKRMGYNRNAKRCKEKWENINKYYKKVKESNKKRPQDAK 510

BLAST of Cla97C01G010480 vs. Swiss-Prot
Match: sp|Q8H181|GTL2_ARATH (Trihelix transcription factor GTL2 OS=Arabidopsis thaliana OX=3702 GN=At5g28300 PE=2 SV=1)

HSP 1 Score: 93.6 bits (231), Expect = 6.0e-18
Identity = 82/261 (31.42%), Postives = 120/261 (45.98%), Query Frame = 0

Query: 196 LEHFVENLVMKVMDKQEEMHRQLIDMIEKKEKERTVREEAWXXXXXXXXXXXXXXXXXXX 255
           L+ F E LV  ++ +QEEMH++L++ + KKE+E+  R    XXXX               
Sbjct: 299 LKGFCEGLVRNMIAQQEEMHKKLLEDMVKKEEEKIARXXXXXXXXIERVNKEVEIRAQEQ 358

Query: 256 S----RSLAIISLIQNLLGHEIQISRPVENQCTEEDGGES-------------------- 315
           +    R+  II  I     H++ +   V+N  +      S                    
Sbjct: 359 AMASDRNTNIIKFISKFTDHDLDV---VQNPTSPSQDSSSLALRKTQGRRKFQTSSSLLP 418

Query: 316 ---------SIQKEL------------------KSDPS---GRRWPQAEVQSLISLRTSL 375
                    +I K L                  KSD     G+RWP+ EV +LI++R S+
Sbjct: 419 QTLTPHNLLTIDKSLEPFSTKTLKPKNQNPKPPKSDDKSDLGKRWPKDEVLALINIRRSI 478

Query: 376 ------EHKFR---ATGSKG-SIWEEISVEMQKMGYKRSAKKCKEKWENMNKYFKRTIVT 393
                 +HK     +T SK   +WE IS +M ++GYKRSAK+CKEKWEN+NKYF++T   
Sbjct: 479 SNMNDDDHKDENSLSTSSKAVPLWERISKKMLEIGYKRSAKRCKEKWENINKYFRKTKDV 538

BLAST of Cla97C01G010480 vs. Swiss-Prot
Match: sp|Q9LZS0|PTL_ARATH (Trihelix transcription factor PTL OS=Arabidopsis thaliana OX=3702 GN=PTL PE=2 SV=1)

HSP 1 Score: 85.1 bits (209), Expect = 2.1e-15
Identity = 38/89 (42.70%), Postives = 63/89 (70.79%), Query Frame = 0

Query: 305 RWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEIS-VEMQKMGYKRSAKKCKEKWENMNK 364
           RWP+ E  +L+ +R+ L+HKF+    KG +W+E+S +  ++ GY+RS KKC+EK+EN+ K
Sbjct: 119 RWPRQETLTLLEIRSRLDHKFKEANQKGPLWDEVSRIMSEEHGYQRSGKKCREKFENLYK 178

Query: 365 YFKRTIVTGKASIANGKTCPYFQELDILY 393
           Y+++T   GKA   +GK   +F++L+ LY
Sbjct: 179 YYRKT-KEGKAGRQDGKHYRFFRQLEALY 206

BLAST of Cla97C01G010480 vs. Swiss-Prot
Match: sp|Q9SDW0|TGT3A_ARATH (Trihelix transcription factor GT-3a OS=Arabidopsis thaliana OX=3702 GN=GT-3A PE=1 SV=1)

HSP 1 Score: 52.4 bits (124), Expect = 1.5e-05
Identity = 24/88 (27.27%), Postives = 47/88 (53.41%), Query Frame = 0

Query: 305 RWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQKMGYKRSAKKCKEKWENMNKY 364
           +W   E + L+++R  L+  F  T     +WE ++ +M   G+ RSA++CK KW+N+   
Sbjct: 51  QWSIEETKELLAIREELDQTFMETKRNKLLWEVVAAKMADKGFVRSAEQCKSKWKNLVTR 110

Query: 365 FKRTIVTGKASIANGKTCPYFQELDILY 393
           +K    T   +I   +  P++ E+  ++
Sbjct: 111 YKACETTEPDAIR--QQFPFYNEIQSIF 136

BLAST of Cla97C01G010480 vs. TAIR10
Match: AT5G47660.1 (Homeodomain-like superfamily protein)

HSP 1 Score: 164.1 bits (414), Expect = 2.0e-40
Identity = 161/421 (38.24%), Postives = 227/421 (53.92%), Query Frame = 0

Query: 1   MDLFTGDHRIPSSDNFPQHVAPFPDPTD-----LLYAAPSAVFPPADIITHLSNPPPPPQ 60
           M+L  GD R    D+F + + PF D +D     +            D +  L++   PPQ
Sbjct: 1   MELLAGDCRKRVGDDFEEDINPF-DGSDGGCGWMYGTRQMGSNGNDDALATLADLASPPQ 60

Query: 61  KLRPIRCNGRSPAGSQAENIFDGALRSFQCVSSSPEGGFSGDQLCVANIDPCQYFDSSAK 120
           KL+PIRC  + P+ S+  +  D    +   +   PE GF               F++   
Sbjct: 61  KLKPIRCGVKLPSSSEDRHPLDILAGT---LDRLPEMGFG-------------CFEAPLG 120

Query: 121 DDKPEVKDNGGFGDIIGNDYFSEEETKNGGSGAAIAAENLSRSREGPQLDDDSCSTSDGV 180
               +V+++G          FS+EE     S   +  E  +R+R    +  D  S S  V
Sbjct: 121 SKIADVEESGQL-----TRGFSKEE---DDSLPPLQMEFQARNR----ISWDGLSLSSSV 180

Query: 181 DGVFS-----TKKHLSHKRKR-TRRSLEHFVENLVMKVMDKQEEMHRQLIDMIEKKEKER 240
           D   S      +K ++ KRKR TR  LEHF+E LV  +M +QE+MH QLI+++EK E ER
Sbjct: 181 DSSDSDSSPDVRKTVTGKRKRETRVKLEHFLEKLVGSMMKRQEKMHNQLINVMEKMEVER 240

Query: 241 TVREEAWXXXXXXXXXXXXXXXXXXXSRSLAIISLIQNLLGHEIQISRPVE--------- 300
             RE   XXXXXXXXXXXXXXXXXXX  +L++IS I+++ G EI+I +  E         
Sbjct: 241 IRREXXXXXXXXXXXXXXXXXXXXXXXXNLSLISFIRSVTGDEIEIPKQCEFPQPLQQIL 300

Query: 301 -NQCTEEDGGESSIQKELK------SDPSGRRWPQAEVQSLISLRTSLEHKFRATG-SKG 360
             QC +E    +  ++E+K      S  SGRRWPQ EVQ+LIS R+ +E K   TG +KG
Sbjct: 301 PEQCKDEKCESAQREREIKFRYSSGSGSSGRRWPQEEVQALISSRSDVEEK---TGINKG 360

Query: 361 SIWEEISVEMQKMGYKRSAKKCKEKWENMNKYFKRTIVTGKASIANGKTCPYFQELDILY 394
           +IW+EIS  M++ GY+RSAKKCKEKWENMNKY++R    G+    + KT  YF++L   Y
Sbjct: 361 AIWDEISARMKERGYERSAKKCKEKWENMNKYYRRVTEGGQKQPEHSKTRSYFEKLGNFY 389

BLAST of Cla97C01G010480 vs. TAIR10
Match: AT1G76890.2 (Duplicated homeodomain-like superfamily protein)

HSP 1 Score: 129.4 bits (324), Expect = 5.4e-30
Identity = 76/217 (35.02%), Postives = 116/217 (53.46%), Query Frame = 0

Query: 203 LVMKVMDKQEEMHRQLIDMIEKKEKERTVREEAWXXXXXXXXXXXXXXXXXXXS----RS 262
           L  ++M+KQE+M ++ ++ +E +EKER  REEAW                   S    + 
Sbjct: 268 LTKELMEKQEKMQKRFLETLEYREKERISREEAWRVQEIGRINREHETLIHERSNAAAKD 327

Query: 263 LAIISLIQNLLGHEIQ-----ISRPVENQCTEEDGGESSIQKELKS-------------- 322
            AIIS +  + G + Q       +P + +  + D   +   KE ++              
Sbjct: 328 AAIISFLHKISGGQPQQPQQHNHKPSQRKQYQSDHSITFESKEPRAVLLDTTIKMGNYDN 387

Query: 323 ----DPSGRRWPQAEVQSLISLRTSLEHKFRATGSKGSIWEEISVEMQKMGYKRSAKKCK 382
                PS  RWP+ EV++LI +R +LE  ++  G+KG +WEEIS  M+++GY RSAK+CK
Sbjct: 388 NHSVSPSSSRWPKTEVEALIRIRKNLEANYQENGTKGPLWEEISAGMRRLGYNRSAKRCK 447

Query: 383 EKWENMNKYFKRTIVTGKASIANGKTCPYFQELDILY 393
           EKWEN+NKYFK+   + K    + KTCPYF +L+ LY
Sbjct: 448 EKWENINKYFKKVKESNKKRPLDSKTCPYFHQLEALY 484

BLAST of Cla97C01G010480 vs. TAIR10
Match: AT1G76880.1 (Duplicated homeodomain-like superfamily protein)

HSP 1 Score: 125.9 bits (315), Expect = 6.0e-29
Identity = 76/242 (31.40%), Postives = 119/242 (49.17%), Query Frame = 0

Query: 199 FVENLVMKVMDKQEEMHRQLIDMIEKKEKERTVREEAWXXXXXXXXXXXXXXXXXXXSRS 258
           F E L+ +V+DKQEE+ R+ ++ +EK+E ER VREE+W                   S S
Sbjct: 257 FFERLMKQVVDKQEELQRKFLEAVEKREHERLVREESWRVQEIARINREHEILAQERSMS 316

Query: 259 ----LAIISLIQNLLGHE------------------------------------------ 318
                A+++ +Q L   +                                          
Sbjct: 317 AAKDAAVMAFLQKLSEKQPNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 376

Query: 319 -IQISRPVENQCTEEDGGESSIQKELKSDPSGRRWPQAEVQSLISLRTSLEHKFRATGSK 378
             Q      +    ++GG+ ++     +  S  RWP+ E+++LI LRT+L+ K++  G K
Sbjct: 377 XXQAVVSTLDTTKTDNGGDQNMTP--AASASSSRWPKVEIEALIKLRTNLDSKYQENGPK 436

Query: 379 GSIWEEISVEMQKMGYKRSAKKCKEKWENMNKYFKRTIVTGKASIANGKTCPYFQELDIL 394
           G +WEEIS  M+++G+ R++K+CKEKWEN+NKYFK+   + K    + KTCPYF +LD L
Sbjct: 437 GPLWEEISAGMRRLGFNRNSKRCKEKWENINKYFKKVKESNKKRPEDSKTCPYFHQLDAL 496

BLAST of Cla97C01G010480 vs. TAIR10
Match: AT1G33240.1 (GT-2-like 1)

HSP 1 Score: 111.3 bits (277), Expect = 1.5e-24
Identity = 97/273 (35.53%), Postives = 130/273 (47.62%), Query Frame = 0

Query: 201 ENLVMKVMDKQEEMHRQLIDMIEKKEKER----TVREEAWXXXXXXXXXXXXXXXXXXXS 260
           E LV +VM KQ  M R  ++ +EK+E+ER           XXXXXXXXXXXXXXXXXX S
Sbjct: 271 EGLVRQVMQKQAAMQRSFLEALEKREQERLXXXXXXXXXXXXXXXXXXXXXXXXXXXXAS 330

Query: 261 RSLAIISLIQNLLGHEIQI----------------------------------------- 320
           R  AIISLIQ + GH IQ+                                         
Sbjct: 331 RDAAIISLIQKITGHTIQLPPSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 390

Query: 321 ---------------SRPVENQCTEEDGGESSIQKELKSDPSGRRWPQAEVQSLISLRTS 380
                                                    S  RWP+AE+ +LI+LR+ 
Sbjct: 391 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSSSRWPKAEILALINLRSG 450

Query: 381 LEHKFRATGSKGSIWEEISVEMQKMGYKRSAKKCKEKWENMNKYFKRTIVTGKASIANGK 414
           +E +++    KG +WEEIS  M++MGY R+AK+CKEKWEN+NKY+K+   + K    + K
Sbjct: 451 MEPRYQDNVPKGLLWEEISTSMKRMGYNRNAKRCKEKWENINKYYKKVKESNKKRPQDAK 510

BLAST of Cla97C01G010480 vs. TAIR10
Match: AT5G28300.1 (Duplicated homeodomain-like superfamily protein)

HSP 1 Score: 93.6 bits (231), Expect = 3.3e-19
Identity = 82/261 (31.42%), Postives = 120/261 (45.98%), Query Frame = 0

Query: 196 LEHFVENLVMKVMDKQEEMHRQLIDMIEKKEKERTVREEAWXXXXXXXXXXXXXXXXXXX 255
           L+ F E LV  ++ +QEEMH++L++ + KKE+E+  R    XXXX               
Sbjct: 299 LKGFCEGLVRNMIAQQEEMHKKLLEDMVKKEEEKIARXXXXXXXXIERVNKEVEIRAQEQ 358

Query: 256 S----RSLAIISLIQNLLGHEIQISRPVENQCTEEDGGES-------------------- 315
           +    R+  II  I     H++ +   V+N  +      S                    
Sbjct: 359 AMASDRNTNIIKFISKFTDHDLDV---VQNPTSPSQDSSSLALRKTQGRRKFQTSSSLLP 418

Query: 316 ---------SIQKEL------------------KSDPS---GRRWPQAEVQSLISLRTSL 375
                    +I K L                  KSD     G+RWP+ EV +LI++R S+
Sbjct: 419 QTLTPHNLLTIDKSLEPFSTKTLKPKNQNPKPPKSDDKSDLGKRWPKDEVLALINIRRSI 478

Query: 376 ------EHKFR---ATGSKG-SIWEEISVEMQKMGYKRSAKKCKEKWENMNKYFKRTIVT 393
                 +HK     +T SK   +WE IS +M ++GYKRSAK+CKEKWEN+NKYF++T   
Sbjct: 479 SNMNDDDHKDENSLSTSSKAVPLWERISKKMLEIGYKRSAKRCKEKWENINKYFRKTKDV 538

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004139609.13.6e-22192.58PREDICTED: trihelix transcription factor GT-2 [Cucumis sativus] >KGN65026.1 hypo... [more]
XP_023001567.12.2e-21085.01trihelix transcription factor GT-2 isoform X1 [Cucurbita maxima][more]
XP_023520409.11.1e-20984.98trihelix transcription factor GT-2 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022927335.14.1e-20984.56trihelix transcription factor GT-2 isoform X1 [Cucurbita moschata][more]
XP_023001568.14.4e-17985.75trihelix transcription factor GTL1 isoform X2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A0A0LYK7|A0A0A0LYK7_CUCSA2.4e-22192.58Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G181390 PE=4 SV=1[more]
tr|A0A1S3CJS0|A0A1S3CJS0_CUCME4.4e-15187.61trihelix transcription factor GT-2 OS=Cucumis melo OX=3656 GN=LOC103501749 PE=4 ... [more]
tr|A0A2N9GR49|A0A2N9GR49_FAGSY1.3e-7045.86Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS29855 PE=4 SV=1[more]
tr|A0A2P5EXN0|A0A2P5EXN0_9ROSA6.3e-6544.60Octamer-binding transcription factor OS=Trema orientalis OX=63057 GN=TorRG33x02_... [more]
tr|A0A2I4HJQ6|A0A2I4HJQ6_9ROSI2.7e-6343.65trihelix transcription factor GTL1 isoform X1 OS=Juglans regia OX=51240 GN=LOC10... [more]
Match NameE-valueIdentityDescription
sp|Q39117|TGT2_ARATH9.8e-2935.02Trihelix transcription factor GT-2 OS=Arabidopsis thaliana OX=3702 GN=GT-2 PE=2 ... [more]
sp|Q9C882|GTL1_ARATH2.8e-2335.53Trihelix transcription factor GTL1 OS=Arabidopsis thaliana OX=3702 GN=GTL1 PE=1 ... [more]
sp|Q8H181|GTL2_ARATH6.0e-1831.42Trihelix transcription factor GTL2 OS=Arabidopsis thaliana OX=3702 GN=At5g28300 ... [more]
sp|Q9LZS0|PTL_ARATH2.1e-1542.70Trihelix transcription factor PTL OS=Arabidopsis thaliana OX=3702 GN=PTL PE=2 SV... [more]
sp|Q9SDW0|TGT3A_ARATH1.5e-0527.27Trihelix transcription factor GT-3a OS=Arabidopsis thaliana OX=3702 GN=GT-3A PE=... [more]
Match NameE-valueIdentityDescription
AT5G47660.12.0e-4038.24Homeodomain-like superfamily protein[more]
AT1G76890.25.4e-3035.02Duplicated homeodomain-like superfamily protein[more]
AT1G76880.16.0e-2931.40Duplicated homeodomain-like superfamily protein[more]
AT1G33240.11.5e-2435.53GT-2-like 1[more]
AT5G28300.13.3e-1931.42Duplicated homeodomain-like superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR017877Myb-like_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0046983 protein dimerization activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G010480.1Cla97C01G010480.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 208..228
NoneNo IPR availablePFAMPF13837Myb_DNA-bind_4coord: 304..389
e-value: 1.8E-20
score: 73.0
NoneNo IPR availableGENE3DG3DSA:1.10.10.60coord: 306..368
e-value: 6.4E-26
score: 92.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 289..303
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 282..305
NoneNo IPR availablePANTHERPTHR21654FAMILY NOT NAMEDcoord: 2..123
NoneNo IPR availablePANTHERPTHR21654:SF7DNA-BINDING PROTEIN-LIKE PROTEINcoord: 2..123
NoneNo IPR availablePANTHERPTHR21654FAMILY NOT NAMEDcoord: 164..413
NoneNo IPR availablePANTHERPTHR21654:SF7DNA-BINDING PROTEIN-LIKE PROTEINcoord: 164..413
NoneNo IPR availableCDDcd12203GT1coord: 304..368
e-value: 1.59684E-28
score: 105.825
IPR017877Myb-like domainPROSITEPS50090MYB_LIKEcoord: 298..362
score: 7.213

The following gene(s) are paralogous to this gene:

None