Tan0012817 (gene) Snake gourd v1

Overview
NameTan0012817
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontranscription factor EMB1444-like isoform X1
LocationLG06: 10028439 .. 10036727 (+)
RNA-Seq ExpressionTan0012817
SyntenyTan0012817
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTGTGTGTGAGTTTCTGTTCTGTTCTGTTATTTTGCCATCACCAATTGGAAAAAAGCGAACAAAATCCAACAGAAATCTCAGAGCTTTCTCTCTCTTTCAAGCATTTCAACCTTTTTTGCTCTCTCTCTAGAGTTCTACTGCTTCCCAGTGTTGTGTGAAGAGGAAAGTGAATTAAGAAAAAGAAAGAGACAATGCAAGGAAATGTAGATCAGGGCGTTGAATTTCTCTGTGATTTTCATCGTTTCTTTCGGTCAAAAAGTTTCAAAACTGTTTCTCTCTGTAAAGGGGCTTGTTGAAATCCATCCCTTTTCTCTTTGACGCTTTTTCGGTTTCTGGGTTTTGGTTTTGGGTTGAGCCTCGAAAGCACTTCTTCTTAGACCCTTCTGTTGTTTTTGTTGTTGTTCTCTAGTTTTATGACGATTTCATAAGTTCATCCATGTCGTCCGACATAAGCTAGGGTTTAATTTTCCCCTTGTTTGAGTGATCCGTGGTGGGTCTGAGAGAAAAAGAGCTATTACTTGAGTGTTTGTTTGAGATTCTTCTCTGCTGCTGTTGTTGTTGTCCATGGATTGTTATACTTATTCCAACTCTAATGGTGTCCACTGTGTTAGAAGCTATGTTGTTGTTGCTGAAATTTCATTGATATCATCTTATTTGTACTGAATTTGTTGAAGAATTCTTGTATTTGAGTTGGGGGTTTTTTTGGGGGGCTGTTTGAGTTTTAAGGGGGATGGGAACTGATCGTCTGCTATTACCAACAGTTGGACCGCCAATAAAACGGCGTGCAGGATTGAGGAGAAAGCAGGCTGGTAGAGGCTCGTATAGGGGAAGCTAGGAAAATTGGGTGTTTTCTTTTCAAGGTGTTTAGAGTTGGGGGTTTTCAAGGTGGTTTTAGGAACAAGCCCTGTTTTGTAGGGTATTCTTTTTAGGGGGGCGTTTAGACAAAATTGGTTCAGTGAGAGATGGGTTCTACTGACTTGCACCAAATACTCAAAAGCCTTTGTTGCAACTCGGAATGGAAGTATGCTGTCTTTTGGAAACTTAAACATCGAGCTCGCATGTAAGTCATTTTCCTTTCCACTTTCTTGTTATTTCATTGTGAAAGTTGGGGATTTGAATAATGAACCCTTTTGTCCTTTCTGATTCTTGGATGTATACTGTGAAGTGAAAAAAAACACTTGTTTTGTTATTCTGATTCTTGGATGTATACTAAGTAGAGAAAGTTGATGACCAACCTGAGCATAACTTATCTGGATAAGACATTATAACCTCGACCAAAAGGTTAAAGGTTTAAATACTTACCCCCACTTATCATCAAACTAAATTAAAATTAGAGAAATTTGATGGTTATTCAATTCAAGGAAGAATTTACTAGTATTTCTCTGTTAATTTTTGCCACCATGCTTTGCATTTTGTGGTTCAATTACACTGGAGTATCCCTTCTTTCTCTTTCCTCTCTTTTTTGGATCAATGGTCGTTTATTTATATTATGTGCAATGCTAAAATTTTTGAATGGCTATTCCTAGTGTTTGTTAGTCAGTGGATGATGGCTTTATGCTGTCCAACAATTCCCTTCCCCTGTTTTGATCTTTCTATTGGTATACTGTATACCAGTTCAGGACAGGCAATTGATATAAATGTTTGGTCGTTATCATTTAAATGGATGTTGAATTCAGGGTGTTGACTTGGGAAGATGGCTACTATGACAATTCTGAACAACATGATCCCTCAGAGAGCAAGTTTTACAGAAAAACGCTCGAGAAGTTTCATGATGGACATTATTCACATGACCCTCTTGGATTAGCTGTGGCAAAAATGTCATATCATGTATATTCTCTTGGGGAAGGGTAAACATCTTAACATTTGTCATTCTATATGGCTATTTTACGATTAGTTCAAGCCAATTTTGGTTTAAACGCCATTAATTTTGGATATAATTTTATAATTTATCCTTATTTCAAGTTTGCTTAAAGAGGAAACATAATGGACAATAACTTTCATTAATTAAATTTAATTTTTTTTTCTGCTTTATGTGACTTCAATTAGCTTATATTATGATTAAAAGTTTGAGACAATGGATTAAATCAGTTTCTCTGGGAATCTGTATAAAGTTCTAAGTGGCTTCACCTTGTTGTAATTATTATTTCCTTTTGCAGGATTGTTGGACAAGTAGCGGTTAATGGAAAACATCAGTGGATTACTGCAGATGAACAAATACCAAATTTCTCTTCAACAATTGAGGTGCTACACTTTCCCGTTCTGCAATAATTTAAGTTAATGAAGTTTGTCTTATCATTTCAGGGATATAATAAAATGCTTGTTTTTCGATTATTTGGGATTGTGTTTCATTACTCATTTTCTGATCATTTTTTTTAATAGTACTGCGATGGTTGGCAAACACAATTTTCAGCTGGCATTAAGGTATGGCAGCCTTTCTTTCCCTCTCTCACACATTCTCTCTCTCTCCCTCTCTCTCTCTCTCTCTCACACACACACACACAGATGCTTACACCAAGAAAACTAAGAAAAAGTATTTTCCTTGATTGCCCACATTTAGTGACTAAGAGAGTATTTCTCCATCGGATTGTAATTTCGACTTCGCACGTTTGAGTGTACATGTTGTGCTACTCGACACGTTGAATTTCTTCTCTTTAGTGTGAACCCCCCTTTCTTAAAGGGCAGCAGTGTATTATTACCCATAATATATGATTGCAATAATATTTGTAGGTCTTTTGAATTAAGATAACATTTATTTTTTCTCATTATCATATTCATCAACTCAAATCAGCATTTTTTTTTGTTTTACACCTCGTAATATCTGGTTTATTTCATCAGCCATATACCTGAAGTCTGGATACTACATTCTAAATGCTATTCCTTTTGACCTGATCAAACTTTGTAGATCTACTTTGCCTAACTAGGTCAAGGAATCAAATAGTGTACTTTGATAAAACAGGGTGGGTGTTTGGTCATGTATTCAACTGTTCATATGCTGTCTTGATGAGAAGTTCGAGAATAGATCCAGCTTACTTAGATTCTATAGTTTCCTTTGGGCTTCTGTTTCTAAACCGTTCTGTAATTATTTGCTTAGTCTGATTTTGTTAGATCAAACTTTGTAGATCTACTTTGCCTAACTAAGTCAAGGAATCAAATAGTCTACTTTGATAAAACAGGGTGGGTTTGGTCATGTATTCAACTGTTCATATGCTGTCTTGATGAGAAGTTCGAGAATAGATCCAGCTTACTTAGATTCTATAGTTTCCTTTGGGCTTCGGTTTCTAACCTGTTCTGTAATTATTTGCTTAGTCTGATTTTGTTAGATTGGAGCCCGTTTATTTAGGTGGCTCCCTTTTTTTGTGGGCTGTGGTTTTTTGTTTCCCTTGTATTTTTTCATTTTTTCTCAATGAAAGTGTGATTTTTCATTAAAAAAAAAGGATCCAGCTTACTCAAGTTTTCTTTCTGCCAAATTTCTTTATATATTTTGCTAACTCATTACTTAATTGTTATTTATATTCTTTGTAGACTATTGTTGTTGTAGCAGTTGTCCCACATGGTGTTTTACAGCTTGGATCTTTAGATAAAGTAAGAGCATCTTCCTTCTGGTAGATTGTGAAAGTTTGTGGATTATGATCCTTGCTGAACGTCAACTTTCTGTTTGGGTTTGTAGCATATATAATTATTTTCAATCCTAACTGTAACCGAACTTGTTTCATACAATAGGCCTTTCTGACATTATAGTTTATTTTAATTTCTTGTAGCCTTACATTCCTTACAATTTTTAGTATTTGGTATCTGTTATATTTTAGTAGCTTTGCTACCGATGCATTAAATAATAGGTTAAAAACATGGACGCTTTTCTATGTGTAAATTGGAACAATAAAGAGTCTGGAGCCTACATTTTGTTTCCATTATTAGTGGTGGAAATATATATTATGTTTCAGTAAATTTTTTACCCATGCAATAAATAACGGGGTTAAAAATACACGTGTTTTTTCTATTTGTAAATTGGAGCAGCGCTGCTGGAGATTGCATTTTGTTTCATCACATTTTAGTATTTCTTTCATTAAGTTGAATGGAGTCTACTTGATGGGTTCTTCCTAGATTAGTCAATTTGTGCATTCTACTAGAAGTCCCAAAAAACCCATGTCGTGGATTCATGTGTCTTATTATTGCTATTTGGGCTTCTTATAATAGAGTATCTTTTTGTAGTATTTCCCAGTTTACGATCTCATGTGACTGGAATACCAATTTCTAAGCGCCACCGTCTAGATTTAGGGTAGGGTGGGCACCTGGTACCTAATTTCACCCTTTTGTAACTACTATACTTGTTTATAAAACTGCGTCATCTGAAAACAATTACTCAATCTACTTGGAGAAAATACTTGATTTAACTTGCATGGATCAGTGCAGAGATATCCTATAAATTTCAGTTCGTGGCTTCCTAGATTTGAAGGTCATCTTACTCTAGTTTATTGTATCTGTGTGCATTTTATTATTGCAATTAGGTCACTGAGGATGTCAATCTTGTGGCTCGCATCAGAAATGTCTTTCTAACTCTTCAGGAGTCTTCAGCTGGGCATATGAAGCCAATGCATTCTTGTAAGAGCTCAGGATACGTGGTACCTCTCTTTCTCCTATCTCTTACCTTTCCTCTTCCCCCTTTTCCCTCGTTTAGACAATTAAAGAATTTGTCTGTCTTGGTTTGATATGATAGGGTATGTACTAGTTACGCTAATGATATCAAAATTTTCTTTTCTGTAGGCAGACATTCCATCTAGAAGCTTAGCAACAGAGGTGAGTCATGGTTTCTTTATCAACTTAGACACTTGTGAAAACAGTGATAGTACAGATATTTGTTACCTAAAATTTCCTTCCTGTGATAAAACTTGGAACTTTTCTTCAAATAATCTACTTCCTGTACAGAAAAATGAAGTTGCAAGGGTCAGCAAGAATGTGGGAATGGAGTTGTCAGGAACTGGGGGTATTGAATCTCTACAATCTAAACCGGATGCTATCAATGTAGAGAGTTTGAAGTCACAAGTGAGGTTGCTTGATGACAGGATATGTGGAGGGGAGCCTAGTGGGTGCAAAGATATGTCAGTTGGTTTGAAACACAAAGTCAATGTACAATCGCAAAACTCTATGGTTAGCATATGTGGTAATTTACTTCCAGCTGAAAAGATCATAACAAATGAAGCATACTACCCTATGAATCCTCACGCCTCTTCTGTCTATGATGGAGTTAATCACGATGGGATGTTTAGCCGAACCAACCCTTCAGAAATGTACTTGCAAAATGACGTGGAAGCATCGGAAACTATTGATATGTATCCATCAAACACATCATTGAAGTTCCCTGCTGGTTATGAGCTACATGAAGTACTCGGACCTGCTTTTCTGAAAGATGCTCTTTATCTTGACTGGCAAACAGAGTACGTTTTTGGTGGTAAAACTTTCGAGTTATCTGAGGGAATGAGTGGTAGCCAGTTGACATCCGATTCACCAACGGAGCATCTTCTAGAGGCAGTAGTGGCTGATGTTTGTCACAGTGGTTCTGATGTTAAAAGTGACACATCTCTATGCAAGTCTGGACAGTCTCTGTTAACGACTGAAAAAATTCCTGAGCCTTCCACAAATGTGACAACATCTGCTTATTCAGAAGGTTACTCTATGGGTCAAAGTCAAACATCTTTTATTGGAGAGGACATGCAGAACTCTTTAAGCTCATCAGGAGTATGTGGGGTGATGTCCCCAAAAGGGTTTTCGTCGACGTATTCTGGTACCGGCAGTGAGCACCTGGAGCGGTCTTCAGAACCTGCAAAGAACAGTAAAAGAAGGGCAAGACCTGGTGAGAGTTGTAGGCCTAGGCCGAGGGACAGACAATTGATCCAGGATCGCATCAAAGAACTGCGGGAGCTAGTACCAAATGGAGCAAAAGTAACCTCTGAGAAGCCATTCATTCATCTTTCCTTTTTGATAGAAATAAAATGTTTCATTACTATTTTGCATCATCCACTAAAATTTTCTTGCTTGTATCACAGTGTAGTATTGATTCATTGCTGGAGCGCACAATCAAGCACATGTTATTCTTGCAAGGCATCACCAAGCATGCTGACAAGCTAAACAAATGCGCCAACATGAAGGTAATTGGCTAGAAACCCGAGTGTAACAGGAAAGTTATTTATTTTTTCTACATGATTAGAAAGCCTGTTCCACTTTATACCTTTTCCGTTGTTGTGCCAATTTGGCTCAAGCAAATGCCAATGTCATGCCTATTATCCAAAGAAAGTCAGTCTGTAATTACGATGAGTAATAACAATTAGGTTAAACTGCAAGTTTAGTCCCTGAACTTTTAGGTTTGTGTCTATTTGGTCCATGAACTTTAAAAGTGTCTTATAGGTCCTTGAACTTTTAATTTTGTGTCCAACACAACTTTAGAAAGTATCTAATAGATCCTTAAATTTTCAATTTTGTGTCCAATAGATCTCTAAATTTTCAATTGTGTCAATAAGTCCTTGACTTATTCAATATTTTTTAAAATTCACGAACCTACTAGACACAAAATTGCAAGTTTAGGGTCCCATTAGACATAAAATTCAATTTTATATCTAATAGATAAGATATTTTTTTTTAAAATTGGAATATGTCAAGAGCCTATTAGACACAAAATTGAAGTTCGAGGATCTATTAAATATTTTTAATATCTATTAGACAAAAAATCGAAAGTTCAGTTCATGGACCAAACAAACACAAACATGAAAAAGTTCAGGGACTTAACTTGAAATATAGCTTGACAACAATTATAAAAATGACCCTTAATGCCACACCTCGTGTCCCCTTGTGATAGTAGGCTTAGACTTCCAACTTGTCTGCACTTGAAGTAACATTTTTACGTGAATTTCAAAAAAAAAAAAAAAAAGAAAGAGAGAAAGAAAGAAAGTTACATTTTTATGTTATAATCTTGAACAGCGTTTTCCTCTACTTACATGTTATGTATTATTAGTTGCATCAGAAGGAAAATGGCATGCTAGGATCCTCAAATACTGATCAGGGTTCAAGCTGGGCAGTGGAGGTCGGTGGTCAACTTAAAGTTTGTTCGATAATTGTGGAGAATCTAAACAAGAACGGACAGATTCTTGTAGAGGTATGAATTATGAAAATTCATCTTTTATTTATCATGAAGTTTCTTATTTAAAAAAAGAAAAGAAAAGAAACTTTCATTTAATATAGAGCAAAAGATTGGTCAATTTCTCTTGGGCAAAACAAGTATGTATTTGTTGTCCATCTTGTTTCTTCAGTTTAACTCATTAATTGGCTTTAGTTTTCCGTCATTGGTTAGTTACTTTTTCCTTGGTTTAACATAATGTTAAGTGATATTCCTGTATGCAGATGTTGTGTGAAGAATGCAGCCATTTTTTGGAGATAGCAGAGGCTATTAGAAGCTTGGGACTGACAATACTAAAAGGCATAACAGAAGCTCATGGCGAGAAGACATGGATTTGTTTTGTGGTTGAGGTATAAATAAGTCAAATTACCAGCACATGAGAGTTATTCTTTGCCTGCAACACGTTACATGTTTTGAAAGAACTTGATAGATAGATTTGACTTTATATGCTAGCATTTCGGTAAAAGATAAAGATATTTAGTTTACTGGATGAATATCTGTATCTTGGGGCATCCCTAATTGCATAGAATAAAACCATTTTTAATCTGTAGGGTAGTGAGAAAGCCAACTGATATTCATTGTACTTGACAATCATTGTTTGATATCTGGTTTTTAACTATTGTAGGGGGAGAATAACAGAAACATACACAGGATGGATATCTTATGGTCACTTGTTCAAATACTACAGCGCAGTAATACAATGTGACAGCTTCAACTGCTGAGCTTTTTGCAACTGTTTGTGTAGATTCCTCTGTAAAATATAGTGTGAGTTGCTTGTTGTGACTATTGATTCATCTGTCTCCTGCTTTATGGAGTGCAGCACTTTTCTCAGAGATTCACCCTTCTCCATACAGTCCTCAAAAGTTGAACAAGGAAAAAGCGCAAAAAAAAAAAAAAAAATCTATAGAAGGGTGCACTAAAAAGAAAACCAAGTCTCTGTAAAGAGCACTATTGTTTCTAAGTGTGAAAAACTATTTTTGTTTTTTGGTGTGTTTTCTAGCCAGTGTTGTCATTTGTTTGAAAAAGCTTTCTACTTGATATAGTGCAACTATTCTGTATTTGTAGAACCCAATTTTAATAGTATAAGTTCATTTGAACAATACAAGTGTTTTATTTTTTTCC

mRNA sequence

GTTGTGTGTGAGTTTCTGTTCTGTTCTGTTATTTTGCCATCACCAATTGGAAAAAAGCGAACAAAATCCAACAGAAATCTCAGAGCTTTCTCTCTCTTTCAAGCATTTCAACCTTTTTTGCTCTCTCTCTAGAGTTCTACTGCTTCCCAGTGTTGTGTGAAGAGGAAAGTGAATTAAGAAAAAGAAAGAGACAATGCAAGGAAATGTAGATCAGGGCGTTGAATTTCTCTGTGATTTTCATCGTTTCTTTCGGTCAAAAAGTTTCAAAACTGTTTCTCTCTGTAAAGGGGCTTGTTGAAATCCATCCCTTTTCTCTTTGACGCTTTTTCGGTTTCTGGGTTTTGGTTTTGGGTTGAGCCTCGAAAGCACTTCTTCTTAGACCCTTCTGTTGTTTTTGTTGTTGTTCTCTAGTTTTATGACGATTTCATAAGTTCATCCATGTCGTCCGACATAAGCTAGGGTTTAATTTTCCCCTTGTTTGAGTGATCCGTGGTGGGTCTGAGAGAAAAAGAGCTATTACTTGAGTGTTTGTTTGAGATTCTTCTCTGCTGCTGTTGTTGTTGTCCATGGATTGTTATACTTATTCCAACTCTAATGGTGTCCACTGTGTTAGAAGCTATGTTGTTGTTGCTGAAATTTCATTGATATCATCTTATTTGTACTGAATTTGTTGAAGAATTCTTGTATTTGAGTTGGGGGTTTTTTTGGGGGGCTGTTTGAGTTTTAAGGGGGATGGGAACTGATCGTCTGCTATTACCAACAGTTGGACCGCCAATAAAACGGCGTGCAGGATTGAGGAGAAAGCAGGCTGGTAGAGGCTCGTATAGGGGAAGCTAGGAAAATTGGGTGTTTTCTTTTCAAGGTGTTTAGAGTTGGGGGTTTTCAAGGTGGTTTTAGGAACAAGCCCTGTTTTGTAGGGTATTCTTTTTAGGGGGGCGTTTAGACAAAATTGGTTCAGTGAGAGATGGGTTCTACTGACTTGCACCAAATACTCAAAAGCCTTTGTTGCAACTCGGAATGGAAGTATGCTGTCTTTTGGAAACTTAAACATCGAGCTCGCATGGTGTTGACTTGGGAAGATGGCTACTATGACAATTCTGAACAACATGATCCCTCAGAGAGCAAGTTTTACAGAAAAACGCTCGAGAAGTTTCATGATGGACATTATTCACATGACCCTCTTGGATTAGCTGTGGCAAAAATGTCATATCATGTATATTCTCTTGGGGAAGGGATTGTTGGACAAGTAGCGGTTAATGGAAAACATCAGTGGATTACTGCAGATGAACAAATACCAAATTTCTCTTCAACAATTGAGTACTGCGATGGTTGGCAAACACAATTTTCAGCTGGCATTAAGGTCACTGAGGATGTCAATCTTGTGGCTCGCATCAGAAATGTCTTTCTAACTCTTCAGGAGTCTTCAGCTGGGCATATGAAGCCAATGCATTCTTGTAAGAGCTCAGGATACGTGGCAGACATTCCATCTAGAAGCTTAGCAACAGAGAAAAATGAAGTTGCAAGGGTCAGCAAGAATGTGGGAATGGAGTTGTCAGGAACTGGGGGTATTGAATCTCTACAATCTAAACCGGATGCTATCAATGTAGAGAGTTTGAAGTCACAAGTGAGGTTGCTTGATGACAGGATATGTGGAGGGGAGCCTAGTGGGTGCAAAGATATGTCAGTTGGTTTGAAACACAAAGTCAATGTACAATCGCAAAACTCTATGGTTAGCATATGTGGTAATTTACTTCCAGCTGAAAAGATCATAACAAATGAAGCATACTACCCTATGAATCCTCACGCCTCTTCTGTCTATGATGGAGTTAATCACGATGGGATGTTTAGCCGAACCAACCCTTCAGAAATGTACTTGCAAAATGACGTGGAAGCATCGGAAACTATTGATATGTATCCATCAAACACATCATTGAAGTTCCCTGCTGGTTATGAGCTACATGAAGTACTCGGACCTGCTTTTCTGAAAGATGCTCTTTATCTTGACTGGCAAACAGAGTACGTTTTTGGTGGTAAAACTTTCGAGTTATCTGAGGGAATGAGTGGTAGCCAGTTGACATCCGATTCACCAACGGAGCATCTTCTAGAGGCAGTAGTGGCTGATGTTTGTCACAGTGGTTCTGATGTTAAAAGTGACACATCTCTATGCAAGTCTGGACAGTCTCTGTTAACGACTGAAAAAATTCCTGAGCCTTCCACAAATGTGACAACATCTGCTTATTCAGAAGGTTACTCTATGGGTCAAAGTCAAACATCTTTTATTGGAGAGGACATGCAGAACTCTTTAAGCTCATCAGGAGTATGTGGGGTGATGTCCCCAAAAGGGTTTTCGTCGACGTATTCTGGTACCGGCAGTGAGCACCTGGAGCGGTCTTCAGAACCTGCAAAGAACAGTAAAAGAAGGGCAAGACCTGGTGAGAGTTGTAGGCCTAGGCCGAGGGACAGACAATTGATCCAGGATCGCATCAAAGAACTGCGGGAGCTAGTACCAAATGGAGCAAAATGTAGTATTGATTCATTGCTGGAGCGCACAATCAAGCACATGTTATTCTTGCAAGGCATCACCAAGCATGCTGACAAGCTAAACAAATGCGCCAACATGAAGTTGCATCAGAAGGAAAATGGCATGCTAGGATCCTCAAATACTGATCAGGGTTCAAGCTGGGCAGTGGAGGTCGGTGGTCAACTTAAAGTTTGTTCGATAATTGTGGAGAATCTAAACAAGAACGGACAGATTCTTGTAGAGATGTTGTGTGAAGAATGCAGCCATTTTTTGGAGATAGCAGAGGCTATTAGAAGCTTGGGACTGACAATACTAAAAGGCATAACAGAAGCTCATGGCGAGAAGACATGGATTTGTTTTGTGGTTGAGGGGGAGAATAACAGAAACATACACAGGATGGATATCTTATGGTCACTTGTTCAAATACTACAGCGCAGTAATACAATGTGACAGCTTCAACTGCTGAGCTTTTTGCAACTGTTTGTGTAGATTCCTCTGTAAAATATAGTGTGAGTTGCTTGTTGTGACTATTGATTCATCTGTCTCCTGCTTTATGGAGTGCAGCACTTTTCTCAGAGATTCACCCTTCTCCATACAGTCCTCAAAAGTTGAACAAGGAAAAAGCGCAAAAAAAAAAAAAAAAATCTATAGAAGGGTGCACTAAAAAGAAAACCAAGTCTCTGTAAAGAGCACTATTGTTTCTAAGTGTGAAAAACTATTTTTGTTTTTTGGTGTGTTTTCTAGCCAGTGTTGTCATTTGTTTGAAAAAGCTTTCTACTTGATATAGTGCAACTATTCTGTATTTGTAGAACCCAATTTTAATAGTATAAGTTCATTTGAACAATACAAGTGTTTTATTTTTTTCC

Coding sequence (CDS)

ATGGGTTCTACTGACTTGCACCAAATACTCAAAAGCCTTTGTTGCAACTCGGAATGGAAGTATGCTGTCTTTTGGAAACTTAAACATCGAGCTCGCATGGTGTTGACTTGGGAAGATGGCTACTATGACAATTCTGAACAACATGATCCCTCAGAGAGCAAGTTTTACAGAAAAACGCTCGAGAAGTTTCATGATGGACATTATTCACATGACCCTCTTGGATTAGCTGTGGCAAAAATGTCATATCATGTATATTCTCTTGGGGAAGGGATTGTTGGACAAGTAGCGGTTAATGGAAAACATCAGTGGATTACTGCAGATGAACAAATACCAAATTTCTCTTCAACAATTGAGTACTGCGATGGTTGGCAAACACAATTTTCAGCTGGCATTAAGGTCACTGAGGATGTCAATCTTGTGGCTCGCATCAGAAATGTCTTTCTAACTCTTCAGGAGTCTTCAGCTGGGCATATGAAGCCAATGCATTCTTGTAAGAGCTCAGGATACGTGGCAGACATTCCATCTAGAAGCTTAGCAACAGAGAAAAATGAAGTTGCAAGGGTCAGCAAGAATGTGGGAATGGAGTTGTCAGGAACTGGGGGTATTGAATCTCTACAATCTAAACCGGATGCTATCAATGTAGAGAGTTTGAAGTCACAAGTGAGGTTGCTTGATGACAGGATATGTGGAGGGGAGCCTAGTGGGTGCAAAGATATGTCAGTTGGTTTGAAACACAAAGTCAATGTACAATCGCAAAACTCTATGGTTAGCATATGTGGTAATTTACTTCCAGCTGAAAAGATCATAACAAATGAAGCATACTACCCTATGAATCCTCACGCCTCTTCTGTCTATGATGGAGTTAATCACGATGGGATGTTTAGCCGAACCAACCCTTCAGAAATGTACTTGCAAAATGACGTGGAAGCATCGGAAACTATTGATATGTATCCATCAAACACATCATTGAAGTTCCCTGCTGGTTATGAGCTACATGAAGTACTCGGACCTGCTTTTCTGAAAGATGCTCTTTATCTTGACTGGCAAACAGAGTACGTTTTTGGTGGTAAAACTTTCGAGTTATCTGAGGGAATGAGTGGTAGCCAGTTGACATCCGATTCACCAACGGAGCATCTTCTAGAGGCAGTAGTGGCTGATGTTTGTCACAGTGGTTCTGATGTTAAAAGTGACACATCTCTATGCAAGTCTGGACAGTCTCTGTTAACGACTGAAAAAATTCCTGAGCCTTCCACAAATGTGACAACATCTGCTTATTCAGAAGGTTACTCTATGGGTCAAAGTCAAACATCTTTTATTGGAGAGGACATGCAGAACTCTTTAAGCTCATCAGGAGTATGTGGGGTGATGTCCCCAAAAGGGTTTTCGTCGACGTATTCTGGTACCGGCAGTGAGCACCTGGAGCGGTCTTCAGAACCTGCAAAGAACAGTAAAAGAAGGGCAAGACCTGGTGAGAGTTGTAGGCCTAGGCCGAGGGACAGACAATTGATCCAGGATCGCATCAAAGAACTGCGGGAGCTAGTACCAAATGGAGCAAAATGTAGTATTGATTCATTGCTGGAGCGCACAATCAAGCACATGTTATTCTTGCAAGGCATCACCAAGCATGCTGACAAGCTAAACAAATGCGCCAACATGAAGTTGCATCAGAAGGAAAATGGCATGCTAGGATCCTCAAATACTGATCAGGGTTCAAGCTGGGCAGTGGAGGTCGGTGGTCAACTTAAAGTTTGTTCGATAATTGTGGAGAATCTAAACAAGAACGGACAGATTCTTGTAGAGATGTTGTGTGAAGAATGCAGCCATTTTTTGGAGATAGCAGAGGCTATTAGAAGCTTGGGACTGACAATACTAAAAGGCATAACAGAAGCTCATGGCGAGAAGACATGGATTTGTTTTGTGGTTGAGGGGGAGAATAACAGAAACATACACAGGATGGATATCTTATGGTCACTTGTTCAAATACTACAGCGCAGTAATACAATGTGA

Protein sequence

MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTIEYCDGWQTQFSAGIKVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLATEKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQVRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGENNRNIHRMDILWSLVQILQRSNTM
Homology
BLAST of Tan0012817 vs. ExPASy Swiss-Prot
Match: P0C7P8 (Transcription factor EMB1444 OS=Arabidopsis thaliana OX=3702 GN=EMB1444 PE=2 SV=1)

HSP 1 Score: 462.2 bits (1188), Expect = 9.8e-129
Identity = 319/750 (42.53%), Postives = 424/750 (56.53%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTL 60
           MG T L QIL+S+C N++W YAVFWKL H + MVLT ED Y  N E+    ES       
Sbjct: 1   MGYT-LQQILRSICSNTDWNYAVFWKLNHHSPMVLTLEDVYCVNHERGLMPES------- 60

Query: 61  EKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTIEYC 120
              H G ++HDPLGLAVAKMSYHV+SLGEGIVGQVA++G+HQWI + E + +  ST++  
Sbjct: 61  --LHGGRHAHDPLGLAVAKMSYHVHSLGEGIVGQVAISGQHQWIFS-EYLNDSHSTLQVH 120

Query: 121 DGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKP 180
           +GW++Q SAGI                    KV ED  LV  IR++FL L +  A H   
Sbjct: 121 NGWESQISAGIKTILIVAVGSCGVVQLGSLCKVEEDPALVTHIRHLFLALTDPLADHASN 180

Query: 181 MHSC--KSSGYVADIPSRSL-------------ATEKNEVARVSKNVG------------ 240
           +  C   S      IPS+ L             A +   +  VS+N              
Sbjct: 181 LMQCDINSPSDRPKIPSKCLHEASPDFSGEFDKAMDMEGLNIVSQNTSNRSNDLPYNFTP 240

Query: 241 ----MELSG--TGGIESLQSKPDAIN-----------VESL-KSQVRLLD-DRICGGEPS 300
               ME +    GG+E++Q      N           V++  K+QV + D  ++   E +
Sbjct: 241 TYFHMERTAQVIGGLEAVQPSMFGSNDCVTSGFSVGVVDTKHKNQVDISDMSKVIYDEET 300

Query: 301 GCKDMSVGLKHKVNVQSQNSMVSICGN---LLPAEKIITNEAYYPMNPHASSVYDGVNHD 360
           G    S  L       S+N + +  G     + ++++    +Y  ++   S+V   +  D
Sbjct: 301 GGYRYSRELDPNFQHYSRNHVRNSGGTSALAMESDRLKAGSSYPQLD---STVLTALKTD 360

Query: 361 GMFSRTN----PSE----MYLQN----DVEASETIDMYPSNTSLKFPAGYELHEVLGPAF 420
             +SR N    PSE    +++++      E SE+  +     SL   +G EL E LGPAF
Sbjct: 361 KDYSRRNEVFQPSESQGSIFVKDTEHRQEEKSESSQLDALTASLCSFSGSELLEALGPAF 420

Query: 421 LKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTS 480
            K +       ++         +  MS S LT +S +E+LL+AVVA + +   +V+ + S
Sbjct: 421 SKTSTDYGELAKFE-SAAAIRRTNDMSHSHLTFESSSENLLDAVVASMSNGDGNVRREIS 480

Query: 481 LCKSGQSLLTTEKI--PEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMS 540
             +S QSLLTT ++   EP  +   +  S   S+        G   QN    S +CG  S
Sbjct: 481 SSRSTQSLLTTAEMAQAEPFGHNKQNIVSTVDSVISQPPLADGLIQQN---PSNICGAFS 540

Query: 541 PKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNG 600
             GFSST   + S+    S E  K +K+RA+PGES RPRPRDRQLIQDRIKELRELVPNG
Sbjct: 541 SIGFSSTCLSSSSDQFPTSLEIPKKNKKRAKPGESSRPRPRDRQLIQDRIKELRELVPNG 600

Query: 601 AKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEV 660
           +KCSIDSLLE TIKHMLFLQ +++HADKL K A+ K+  K+ G LG S+T+QGSSWAVE+
Sbjct: 601 SKCSIDSLLECTIKHMLFLQSVSQHADKLTKSASSKMQHKDTGTLGISSTEQGSSWAVEI 660

Query: 661 GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWI 668
           GG L+VCSI+VENL+K G +L+EMLCEECSHFLEIA  IRSL L IL+G TE  GEKTWI
Sbjct: 661 GGHLQVCSIMVENLDKEGVMLIEMLCEECSHFLEIANVIRSLELIILRGTTEKQGEKTWI 720

BLAST of Tan0012817 vs. ExPASy Swiss-Prot
Match: Q58G01 (Transcription factor bHLH155 OS=Arabidopsis thaliana OX=3702 GN=BHLH155 PE=1 SV=1)

HSP 1 Score: 459.1 bits (1180), Expect = 8.3e-128
Identity = 320/756 (42.33%), Postives = 413/756 (54.63%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHDPSESKFYRKT 60
           MGST   +ILKS C N++W YAVFW+L HR +RMVLT ED YYD                
Sbjct: 1   MGSTS-QEILKSFCFNTDWDYAVFWQLNHRGSRMVLTLEDAYYD---------------- 60

Query: 61  LEKFHDG---HYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSST 120
               H G   H +HDPLGLAVAKMSYHVYSLGEGIVGQVAV+G+HQW+   E   N +S 
Sbjct: 61  ----HHGTNMHGAHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGEHQWV-FPENYNNCNSA 120

Query: 121 IEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAG 180
            E+ + W++Q SAGI                    KV EDVN V  IR++FL L++  A 
Sbjct: 121 FEFHNVWESQISAGIKTILVVAVGPCGVVQLGSLCKVNEDVNFVNHIRHLFLALRDPLAD 180

Query: 181 HMKPMHSC--KSSGYVADIPSRSLATE--KNEVARVSKNVGMELSG------TGGIESL- 240
           H   +  C   +S  +  +PS  L  E   +    V K + +E S       T   +S+ 
Sbjct: 181 HAANLRQCNMNNSLCLPKMPSEGLHAEAFPDCSGEVDKAMDVEESNILTQYKTRRSDSMP 240

Query: 241 QSKPDAINVESLKSQV----RLLDDRICG------------------------------- 300
            + P +  V    +QV     ++    CG                               
Sbjct: 241 YNTPSSCLVMEKAAQVVGGREVVQGSTCGSYSGVTFGFPVDLVGAKHENQVGTNIIRDAP 300

Query: 301 --GEPSGCKDMSVGLKHKVNVQSQNSMV---SICGNLLPAEKIITNEAYYPMNPHASSVY 360
             G  SGCKD S  L   +++  +N ++   S     + AE++IT+++Y    P   S +
Sbjct: 301 HVGMTSGCKD-SRDLDPNLHLYMKNHVLNDTSTSALAIEAERLITSQSY----PRLDSTF 360

Query: 361 DGVN--------HDGMFSRT--------NPSEMYLQNDVEASETIDMYPSNTSLKFPAGY 420
              +        H+ +F  +          +E  L  + E+S+   +  S  +    AG 
Sbjct: 361 QATSRTDKESSYHNEVFQLSENQGNKYIKETERMLGRNCESSQFDALISSGYTF---AGS 420

Query: 421 ELHEVLGPAFLKDALYLDWQTEYVFG--GKTFELSEGMSGSQLTSDSPTEHLLEAVVADV 480
           EL E LG AF +       Q E +    G T   ++ MS SQLT D   E+LL+AVVA+V
Sbjct: 421 ELLEALGSAFKQTN---TGQEELLKSEHGSTMRPTDDMSHSQLTFDPGPENLLDAVVANV 480

Query: 481 CHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSL 540
           C    + + D    +S QSLLT  ++ EPS     +  +   +   +Q      D Q   
Sbjct: 481 CQRDGNARDDMMSSRSVQSLLTNMELAEPSGQKKHNIVNP-INSAMNQPPMAEVDTQQ-- 540

Query: 541 SSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRI 600
           +SS +CG  S  GFSSTY  + S+  + S +  K +K+RA+PGES RPRPRDRQLIQDRI
Sbjct: 541 NSSDICGAFSSIGFSSTYPSSSSDQFQTSLDIPKKNKKRAKPGESSRPRPRDRQLIQDRI 600

Query: 601 KELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNT 660
           KELRELVPNG+KCSIDSLLERTIKHMLFLQ +TKHA+KL+K AN K+ QKE GM      
Sbjct: 601 KELRELVPNGSKCSIDSLLERTIKHMLFLQNVTKHAEKLSKSANEKMQQKETGM------ 660

Query: 661 DQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGI 664
            QGSS AVEVGG L+V SIIVENLNK G +L+EMLCEEC HFLEIA  IRSL L IL+G 
Sbjct: 661 -QGSSCAVEVGGHLQVSSIIVENLNKQGMVLIEMLCEECGHFLEIANVIRSLDLVILRGF 713

BLAST of Tan0012817 vs. ExPASy Swiss-Prot
Match: Q9XIN0 (Transcription factor LHW OS=Arabidopsis thaliana OX=3702 GN=LHW PE=1 SV=1)

HSP 1 Score: 240.0 bits (611), Expect = 7.9e-62
Identity = 220/703 (31.29%), Postives = 332/703 (47.23%), Query Frame = 0

Query: 6   LHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTLEKFHD 65
           L + L+S+C N++W YAVFWK+  +   +L WE+ Y +     +P       + L     
Sbjct: 5   LREALRSMCVNNQWSYAVFWKIGCQNSSLLIWEECYNETESSSNP-------RRLCGLGV 64

Query: 66  GHYSHDPLGLAVAKM--SYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTI---EYC 125
               ++ + L   +M  +  +  +GEG+VG+ A  G HQWI A+    +F+  +   E  
Sbjct: 65  DTQGNEKVQLLTNRMMLNNRIILVGEGLVGRAAFTGHHQWILAN----SFNRDVHPPEVI 124

Query: 126 DGWQTQFSAGIKVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT 185
           +    QFSAGI+             VF  +          +   ++ G+V D+    L  
Sbjct: 125 NEMLLQFSAGIQTVA----------VFPVVPHGVVQLGSSLPIMENLGFVNDVKGLILQL 184

Query: 186 EKNEVARVSKN----------VGMELS---GTGGIESLQSKPDAINVESLKSQVRLL--- 245
                A +S+N          +G+ +S    + G + LQS   A   E+ K         
Sbjct: 185 GCVPGALLSENYRTYEPAADFIGVPVSRIIPSQGHKILQS--SAFVAETSKQHFNSTGSS 244

Query: 246 DDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPHASSV 305
           D ++    P    D     +H+   QS    ++              E   P NP A   
Sbjct: 245 DHQMVEESPCNLVD-----EHEGGWQSTTGFLT------------AGEVAVPSNPDA--- 304

Query: 306 YDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGY-ELHEVLGPAFLKDA 365
                    +   N S M   ++V+A+E   +   + S K   G  +L ++LG       
Sbjct: 305 ---------WLNQNFSCM---SNVDAAEQQQIPCEDISSKRSLGSDDLFDMLGLDDKNKG 364

Query: 366 LYLDW-----QTEYVFGGKTFELSE-----------GMSGSQLTSDSPTEHLLEAVVADV 425
               W     +TE +    T ELS+           G SG +L   S T+HLL+AVV+  
Sbjct: 365 CDNSWGVSQMRTEVL----TRELSDFRIIQEMDPEFGSSGYEL---SGTDHLLDAVVSGA 424

Query: 426 CHSGSDVKSDTS-LCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNS 485
           C S   +  +TS  CK+  + ++   +  PS           +S  Q    F  +  Q  
Sbjct: 425 CSSTKQISDETSESCKTTLTKVSNSSVTTPS-----------HSSPQGSQLFEKKHGQ-P 484

Query: 486 LSSSGVCGVMSPKGFSSTYS--GTGSEHLERSSEPAK--NSKRRARPGESCRPRPRDRQL 545
           L  S V G          +S    GS  +   +E AK  N+++R +PGE+ RPRP+DRQ+
Sbjct: 485 LGPSSVYGSQISSWVEQAHSLKREGSPRMVNKNETAKPANNRKRLKPGENPRPRPKDRQM 544

Query: 546 IQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGML 605
           IQDR+KELRE++PNGAKCSID+LLERTIKHMLFLQ ++KH+DKL +    K+ +++ G  
Sbjct: 545 IQDRVKELREIIPNGAKCSIDALLERTIKHMLFLQNVSKHSDKLKQTGESKIMKEDGG-- 604

Query: 606 GSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLT 665
                  G++WA EVG +  VC I+VE++N      VEMLCE+   FLEIA+ IRSLGLT
Sbjct: 605 -------GATWAFEVGSKSMVCPIVVEDINPPRIFQVEMLCEQRGFFLEIADWIRSLGLT 622

BLAST of Tan0012817 vs. ExPASy Swiss-Prot
Match: Q7XJU0 (Transcription factor bHLH157 OS=Arabidopsis thaliana OX=3702 GN=BHLH157 PE=1 SV=1)

HSP 1 Score: 183.3 bits (464), Expect = 8.8e-45
Identity = 193/678 (28.47%), Postives = 290/678 (42.77%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTL 60
           MGS   H ILKSLC +  W YAVFW+      M+L +E+ Y D                 
Sbjct: 1   MGSEYKH-ILKSLCLSHGWSYAVFWRYDPINSMILRFEEAYNDEQSV------------- 60

Query: 61  EKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTIEYC 120
                           V  M      LG+GIVG+VA +G HQW+ +D       +  ++ 
Sbjct: 61  --------------ALVDDMVLQAPILGQGIVGEVASSGNHQWLFSD-------TLFQWE 120

Query: 121 DGWQTQFSAGIKVTEDVNLVARIRNVFLTLQESSAGHMKPMHSCKSSGYVADIPSRSLAT 180
             +Q QF  G K+         IR    T                    +A IP      
Sbjct: 121 HEFQNQFLCGFKIL--------IRQFTYTQT------------------IAIIP------ 180

Query: 181 EKNEVARVSKNVGMELSGTGGIESLQSKPDAI-NVESLKSQVRLLDDRICGGEPSGCKDM 240
                            G+ G+  L S    + + E L+   R L +       SG    
Sbjct: 181 ----------------LGSSGVVQLGSTQKILESTEILEQTTRALQETCLKPHDSG---- 240

Query: 241 SVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPHASSVYDGVNHDGMFSRTNP 300
                   ++ +    +  C  + PAE                  + G + D +F+  NP
Sbjct: 241 --------DLDTLFESLGDC-EIFPAES-----------------FQGFSFDDIFAEDNP 300

Query: 301 SEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFGGKTF 360
             + L  ++ +SE      SN  L     Y   ++L  ++  D LY              
Sbjct: 301 PSL-LSPEMISSEAAS---SNQDLTNGDDYGF-DIL-QSYSLDDLY-------------- 360

Query: 361 ELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDV---KSDTSLCKSGQSLLTTEKIPEP 420
                    QL +D P ++    V+  V     D+    S T         L +E I   
Sbjct: 361 ---------QLLADPPEQNCSSMVIQGVDKDLFDILGMNSQTPTMALPPKGLFSELISSS 420

Query: 421 STNVTTSA---YSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHL 480
            +N T S+     + YS G +Q+     D  ++ SSS     + P+  + T      +  
Sbjct: 421 LSNNTCSSSLTNVQEYS-GVNQSKRRKLDTSSAHSSS-----LFPQEETVTSRSLWIDDD 480

Query: 481 ERSS------EPAKN--SKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSL 540
           ERSS      +P +    K+RA+ GES RPRP+DRQ+IQDRIKELR ++PNGAKCSID+L
Sbjct: 481 ERSSIGGNWKKPHEEGVKKKRAKAGESRRPRPKDRQMIQDRIKELRGMIPNGAKCSIDTL 517

Query: 541 LERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCS 600
           L+ TIKHM+F+Q + K+A++L +    KL +++             +WA+EVG +  VC 
Sbjct: 541 LDLTIKHMVFMQSLAKYAERLKQPYESKLVKEKE-----------RTWALEVGEEGVVCP 517

Query: 601 IIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFVVEGEN 660
           I+VE LN+ G++ +EM+CEE   FLEI + +R LGL ILKG+ E    + W  F+V+ + 
Sbjct: 601 IMVEELNREGEMQIEMVCEEREEFLEIGQVVRGLGLKILKGVMETRKGQIWAHFIVQAK- 517

Query: 661 NRNIHRMDILWSLVQILQ 664
              + R+ +L+SLVQ+ Q
Sbjct: 661 -PQVTRIQVLYSLVQLFQ 517

BLAST of Tan0012817 vs. ExPASy Swiss-Prot
Match: K4PW38 (Protein RICE SALT SENSITIVE 3 OS=Oryza sativa subsp. japonica OX=39947 GN=RSS3 PE=1 SV=1)

HSP 1 Score: 77.0 bits (188), Expect = 8.9e-13
Identity = 54/203 (26.60%), Postives = 91/203 (44.83%), Query Frame = 0

Query: 2   GSTDLHQILKSLCCNSEWKYAVFWKLKHRAR---------------MVLTWEDGYYDNSE 61
           G   LH+ L+++C NS+W Y+VFW ++ R R               ++L WEDG      
Sbjct: 27  GMMALHEALRNVCLNSDWTYSVFWTIRPRPRCRGGNGCKVGDDNGSLMLMWEDG------ 86

Query: 62  QHDPSESKFYRKTLEKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWI-- 121
                   F R  + +  +     DP+  A +KMS  +Y+ GEG++G+VA +  H+W+  
Sbjct: 87  --------FCRPRVAECLEDIDGEDPVRKAFSKMSIQLYNYGEGLMGKVASDKCHKWVFK 146

Query: 122 ---TADEQIPNF--SSTIEYCDGWQTQFSAGIK-------------------VTEDVNLV 164
                +  I N+  SS       W  QF++GI+                   + ED++ V
Sbjct: 147 EPSECEPNIANYWQSSFDALPPEWTDQFASGIQTIAVIQAGHGLLQLGSCKIIPEDLHFV 206

BLAST of Tan0012817 vs. NCBI nr
Match: XP_011659263.1 (transcription factor bHLH155 isoform X1 [Cucumis sativus] >KGN44749.1 hypothetical protein Csa_015656 [Cucumis sativus])

HSP 1 Score: 1214.1 bits (3140), Expect = 0.0e+00
Identity = 609/691 (88.13%), Postives = 637/691 (92.19%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTL 60
           MG+TDLHQILKS CCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH+P E KF+RKTL
Sbjct: 1   MGTTDLHQILKSFCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHEPPEGKFFRKTL 60

Query: 61  EKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTIEYC 120
           E F+DGHYSHD LGLAVAKMSYHVYSLGEGIVGQVAV GKHQWITADEQIPNFSSTIEYC
Sbjct: 61  ETFYDGHYSHDALGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWITADEQIPNFSSTIEYC 120

Query: 121 DGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKP 180
           DGWQTQFSAGI                    KVTEDVNLV RIRNVFLTLQESSAG +KP
Sbjct: 121 DGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVTRIRNVFLTLQESSAGEIKP 180

Query: 181 MHSCKSSGYVADIPSRSLATEKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQ 240
           MHSCKSSGY+ADIPSRSLATEK EVA VSKNVG+ELSG+   ESL +KPD INVE+ KSQ
Sbjct: 181 MHSCKSSGYMADIPSRSLATEKGEVASVSKNVGLELSGSEAFESLTTKPDGINVENFKSQ 240

Query: 241 VRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM 300
           VRLLDDR+CGGEPSGCKD +VGLK K+NVQSQNS   MV+ICGNLLPAEKI+TN+AY+ M
Sbjct: 241 VRLLDDRMCGGEPSGCKDKAVGLKQKINVQSQNSTMDMVNICGNLLPAEKIMTNDAYFSM 300

Query: 301 NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGP 360
           NPH SS YDGVNH+GMF RTN +EMYLQND+EASETI+MYPSNTSLKFPAGYELHEVLGP
Sbjct: 301 NPHPSSAYDGVNHNGMFIRTNHTEMYLQNDMEASETIEMYPSNTSLKFPAGYELHEVLGP 360

Query: 361 AFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSD 420
           AFLKDALYLDWQTEYV GGK FELSEGMSGSQLTSDSPTE LLEAVVADVCHSGSDVKSD
Sbjct: 361 AFLKDALYLDWQTEYVLGGKAFELSEGMSGSQLTSDSPTERLLEAVVADVCHSGSDVKSD 420

Query: 421 TSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMS 480
           TSLCKSGQSLLTTE+IPEPSTNVTTSA SEGYSMGQSQTSF GEDMQNSLSSSGVCGVMS
Sbjct: 421 TSLCKSGQSLLTTERIPEPSTNVTTSACSEGYSMGQSQTSFTGEDMQNSLSSSGVCGVMS 480

Query: 481 PKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNG 540
           PKGFSSTYSGTGSEHL++SSEPAKNSKRRARPGES RPRPRDRQLIQDRIKELRELVPNG
Sbjct: 481 PKGFSSTYSGTGSEHLDKSSEPAKNSKRRARPGESSRPRPRDRQLIQDRIKELRELVPNG 540

Query: 541 AKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEV 600
           AKCSIDSLLERTIKHMLFLQGITKHADKL KCANMKLHQK +GMLG+S+TDQGSSWAVEV
Sbjct: 541 AKCSIDSLLERTIKHMLFLQGITKHADKLTKCANMKLHQKGSGMLGTSDTDQGSSWAVEV 600

Query: 601 GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWI 660
           GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWI
Sbjct: 601 GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWI 660

Query: 661 CFVVEGENNRNIHRMDILWSLVQILQRSNTM 669
           CFVVEGENNRNIHRMDILWSLVQILQRS+TM
Sbjct: 661 CFVVEGENNRNIHRMDILWSLVQILQRSSTM 691

BLAST of Tan0012817 vs. NCBI nr
Match: XP_016901084.1 (PREDICTED: transcription factor EMB1444 [Cucumis melo])

HSP 1 Score: 1192.9 bits (3085), Expect = 0.0e+00
Identity = 601/691 (86.98%), Postives = 628/691 (90.88%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTL 60
           MG+TDL QILKS CCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH+P E KF+RKTL
Sbjct: 1   MGTTDLRQILKSFCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHEPPEGKFFRKTL 60

Query: 61  EKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTIEYC 120
           E F+DGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAV GKHQWITADEQIPNFSSTIEYC
Sbjct: 61  ETFYDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWITADEQIPNFSSTIEYC 120

Query: 121 DGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKP 180
           DGWQTQFSAGI                    KVTEDVNLV RIRN FLTLQESSAG +KP
Sbjct: 121 DGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVTRIRNAFLTLQESSAGEIKP 180

Query: 181 MHSCKSSGYVADIPSRSLATEKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQ 240
           +HSCKSSGYV           K E A VSKNVG+ELSG+GG ESL++KPDAINVES KSQ
Sbjct: 181 LHSCKSSGYV-----------KGEDASVSKNVGIELSGSGGFESLKTKPDAINVESFKSQ 240

Query: 241 VRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM 300
           VRLLDDRICGGEPSGCKD +VGLK K+NVQSQ+S   M++ICGNLLPAEKI+TN AY+PM
Sbjct: 241 VRLLDDRICGGEPSGCKDTAVGLKQKINVQSQDSAMDMLNICGNLLPAEKIMTNGAYFPM 300

Query: 301 NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGP 360
           NPH SSVYDGVNH+GMF+RTN +EMYLQND+EAS+TIDMYPSN SLKFPAGYELHEVLGP
Sbjct: 301 NPHPSSVYDGVNHNGMFTRTNHTEMYLQNDMEASKTIDMYPSNASLKFPAGYELHEVLGP 360

Query: 361 AFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSD 420
           AFLKDALYLDWQTEYV GGK FELSEGMSGSQLTSDSPTE LLEAVVADVCHS SDVKSD
Sbjct: 361 AFLKDALYLDWQTEYVLGGKAFELSEGMSGSQLTSDSPTERLLEAVVADVCHSSSDVKSD 420

Query: 421 TSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMS 480
           TSLCKSGQSLLTTE+IPEPSTN TTSA SEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMS
Sbjct: 421 TSLCKSGQSLLTTERIPEPSTNATTSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMS 480

Query: 481 PKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNG 540
           PKGFSSTYSGTGSEHL++S EPAKNSKRRARPGES RPRPRDRQLIQDRIKELRELVPNG
Sbjct: 481 PKGFSSTYSGTGSEHLDKSLEPAKNSKRRARPGESSRPRPRDRQLIQDRIKELRELVPNG 540

Query: 541 AKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEV 600
           AKCSIDSLLERTIKHMLFLQGITKHADKL KCANMKLHQKENGMLG+SNTDQGSSWAVEV
Sbjct: 541 AKCSIDSLLERTIKHMLFLQGITKHADKLTKCANMKLHQKENGMLGTSNTDQGSSWAVEV 600

Query: 601 GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWI 660
           GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWI
Sbjct: 601 GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWI 660

Query: 661 CFVVEGENNRNIHRMDILWSLVQILQRSNTM 669
           CFVVEGENNRNIHRMDILWSLVQILQRS+TM
Sbjct: 661 CFVVEGENNRNIHRMDILWSLVQILQRSSTM 680

BLAST of Tan0012817 vs. NCBI nr
Match: XP_022155842.1 (transcription factor EMB1444-like isoform X1 [Momordica charantia])

HSP 1 Score: 1192.2 bits (3083), Expect = 0.0e+00
Identity = 598/692 (86.42%), Postives = 633/692 (91.47%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTL 60
           MG+TDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDP ESKF+RKTL
Sbjct: 1   MGTTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPPESKFFRKTL 60

Query: 61  EKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTIEYC 120
           EKFHDGH+SHDPLGLAVAKMSYHVYSLGEGIVGQVAV GKHQWITADEQIPNFSST+EYC
Sbjct: 61  EKFHDGHFSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWITADEQIPNFSSTLEYC 120

Query: 121 DGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKP 180
           DGWQTQFSAGI                    KVTED+NLV RIRN+FLTLQESSAGH+KP
Sbjct: 121 DGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDINLVTRIRNIFLTLQESSAGHIKP 180

Query: 181 MHSCKSSGYVADIPSRSLATEKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQ 240
           MHSC SSGY+ADI ++SL TEKNEV  VSK+VG+ELSG+GG ESL++KPDA  V+ LKSQ
Sbjct: 181 MHSCTSSGYMADISTKSLVTEKNEVEMVSKSVGIELSGSGGNESLKTKPDATIVQGLKSQ 240

Query: 241 VRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM 300
           VR +DDR+C GEPSGCKDM+VGLKHKV+V+ QNS   MV+ICGNLLPAEKI+TN+A +PM
Sbjct: 241 VRSIDDRMCVGEPSGCKDMAVGLKHKVHVRLQNSTMDMVNICGNLLPAEKIMTNDACFPM 300

Query: 301 NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASE---TIDMYPSNTSLKFPAGYELHEV 360
           N HASS  DGVNH+GM  RTNP+EM L+NDVEA E     DMYPSNTSLKFPAGYELHEV
Sbjct: 301 NSHASSACDGVNHNGMLRRTNPTEMCLENDVEALEIRNETDMYPSNTSLKFPAGYELHEV 360

Query: 361 LGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDV 420
           LGPAFLKDALYLDWQ EY FG K FELSEGMSGSQLTSDSP E LLEAVVADVCHSGSDV
Sbjct: 361 LGPAFLKDALYLDWQVEYGFGSKAFELSEGMSGSQLTSDSPMERLLEAVVADVCHSGSDV 420

Query: 421 KSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCG 480
           KS+TSLCKSGQSLLTTE+IPEPSTN+TTSA SEGYSMGQSQ+SFIGEDMQNSLSSSGVCG
Sbjct: 421 KSNTSLCKSGQSLLTTERIPEPSTNITTSACSEGYSMGQSQSSFIGEDMQNSLSSSGVCG 480

Query: 481 VMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELV 540
           VMSPKGFSSTYSGTGSEHLERSSEPAKN+KRRA+PGESCRPRPRDRQLIQDRIKELRELV
Sbjct: 481 VMSPKGFSSTYSGTGSEHLERSSEPAKNNKRRAKPGESCRPRPRDRQLIQDRIKELRELV 540

Query: 541 PNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWA 600
           PNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSN DQGSSWA
Sbjct: 541 PNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNNDQGSSWA 600

Query: 601 VEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEK 660
           VEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEK
Sbjct: 601 VEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEK 660

Query: 661 TWICFVVEGENNRNIHRMDILWSLVQILQRSN 667
           TWICFVVEGENNR+IHRMDILWSLVQILQRSN
Sbjct: 661 TWICFVVEGENNRSIHRMDILWSLVQILQRSN 692

BLAST of Tan0012817 vs. NCBI nr
Match: XP_022959660.1 (transcription factor EMB1444-like [Cucurbita moschata])

HSP 1 Score: 1189.1 bits (3075), Expect = 0.0e+00
Identity = 601/688 (87.35%), Postives = 634/688 (92.15%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTL 60
           MG+TDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHD  ESKFY KT+
Sbjct: 1   MGTTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDLPESKFYSKTI 60

Query: 61  EKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTIEYC 120
           EKFHDG YSHDPL LAVAKMSYHVYSLGEGIVGQVAV GKHQWITADEQIPNFSST+EYC
Sbjct: 61  EKFHDGRYSHDPLELAVAKMSYHVYSLGEGIVGQVAVTGKHQWITADEQIPNFSSTLEYC 120

Query: 121 DGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKP 180
           DGWQTQFSAGI                    KV EDVNLVARIRNVFLTLQESSAGH+KP
Sbjct: 121 DGWQTQFSAGIKTIVVAAVVPHGVLQLGSLDKVIEDVNLVARIRNVFLTLQESSAGHIKP 180

Query: 181 MHSCKSSGYVADIPSRSLATEKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQ 240
           MHSC+SSGYVADIPSRSLATEKNEV  VSK+VG+ELSG+GGI+SL+ KPDAINV+S KSQ
Sbjct: 181 MHSCESSGYVADIPSRSLATEKNEVEMVSKDVGIELSGSGGIKSLEIKPDAINVDSFKSQ 240

Query: 241 VRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPH 300
           VRLLDDRICGGEPSGCKD++VGLK K+NV SQNS + +     PAEKIIT+EAY+PMNP 
Sbjct: 241 VRLLDDRICGGEPSGCKDIAVGLKQKINVGSQNSEMGMVNLSGPAEKIITDEAYFPMNPQ 300

Query: 301 ASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFL 360
           ASSV  G+ H GM S+TNPSEMYLQNDVEASETI++YPSN+SLKFPAGYELHEVLGPAFL
Sbjct: 301 ASSVC-GLKHYGMCSQTNPSEMYLQNDVEASETINVYPSNSSLKFPAGYELHEVLGPAFL 360

Query: 361 KDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSL 420
           KDALYLDW+TEYVFGGK FELSEG +GS LTSDSPTEHLLEAVVADVCHSGSDVKSDTSL
Sbjct: 361 KDALYLDWKTEYVFGGKAFELSEGTNGSHLTSDSPTEHLLEAVVADVCHSGSDVKSDTSL 420

Query: 421 CKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKG 480
           CKSGQSLLTTE+IPEPSTNV TSA SEGY+MGQSQTSFIGE+MQNSLSSSGVCGVMS KG
Sbjct: 421 CKSGQSLLTTERIPEPSTNV-TSACSEGYTMGQSQTSFIGEEMQNSLSSSGVCGVMSTKG 480

Query: 481 FSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKC 540
           FSSTYSGTGSEHLE+ SEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKC
Sbjct: 481 FSSTYSGTGSEHLEQFSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKC 540

Query: 541 SIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQ 600
           SIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQK+NGMLGSS+TDQGSSWAVEVGGQ
Sbjct: 541 SIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKKNGMLGSSSTDQGSSWAVEVGGQ 600

Query: 601 LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFV 660
           LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGL ILKGITEAHG+KTWICFV
Sbjct: 601 LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLIILKGITEAHGDKTWICFV 660

Query: 661 VEGENNRNIHRMDILWSLVQILQRSNTM 669
           VEGENNRN+HRMDILWSLVQILQRSNTM
Sbjct: 661 VEGENNRNVHRMDILWSLVQILQRSNTM 686

BLAST of Tan0012817 vs. NCBI nr
Match: KAG7025488.1 (Transcription factor EMB-like protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1188.7 bits (3074), Expect = 0.0e+00
Identity = 602/688 (87.50%), Postives = 634/688 (92.15%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTL 60
           MG+TDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYY NSEQHD  ESKFY KT+
Sbjct: 1   MGTTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYGNSEQHDLPESKFYSKTI 60

Query: 61  EKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTIEYC 120
           EKFHDG YSHD LGLAVAKMSYHVYSLGEGIVGQVAV GKHQWITADEQIPNFSST+EYC
Sbjct: 61  EKFHDGCYSHDALGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWITADEQIPNFSSTLEYC 120

Query: 121 DGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKP 180
           DGW TQFSAGI                    KVTEDVNLVARIRNVFLTLQESSAGH+KP
Sbjct: 121 DGWLTQFSAGIKTIVVAAVVPHGVLQLGSLDKVTEDVNLVARIRNVFLTLQESSAGHIKP 180

Query: 181 MHSCKSSGYVADIPSRSLATEKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQ 240
           MHSC+SSGYVADIPSRSLATEKNEV  VSK+VG+ELSG+GGI+SL+ KPDAINV+S KSQ
Sbjct: 181 MHSCESSGYVADIPSRSLATEKNEVEMVSKDVGIELSGSGGIKSLEIKPDAINVDSFKSQ 240

Query: 241 VRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPH 300
           VRLLDDRICGGEPSGCKD++VGLK K+NV SQNS + +     PAEKIIT+EAY+PMNPH
Sbjct: 241 VRLLDDRICGGEPSGCKDIAVGLKQKINVGSQNSEMGMVNLSGPAEKIITDEAYFPMNPH 300

Query: 301 ASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFL 360
           ASSV  G+ H GM S+ NPSEMYLQNDVEASETI++YPSN+SLKFPAGYELHEVLGPAFL
Sbjct: 301 ASSVC-GLKHYGMCSQANPSEMYLQNDVEASETINVYPSNSSLKFPAGYELHEVLGPAFL 360

Query: 361 KDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSL 420
           KDALYLDW+TEYVFGGK FELSEGM+GS LTSDSPTEHLLEAVVADVCHSGSDVKSDTSL
Sbjct: 361 KDALYLDWKTEYVFGGKAFELSEGMNGSHLTSDSPTEHLLEAVVADVCHSGSDVKSDTSL 420

Query: 421 CKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKG 480
           CKSGQSLLTTE+IPEPSTNV TSA SEGY+MGQSQTSFIGE+MQNSLSSSGVCGVMS KG
Sbjct: 421 CKSGQSLLTTERIPEPSTNV-TSACSEGYTMGQSQTSFIGEEMQNSLSSSGVCGVMSTKG 480

Query: 481 FSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKC 540
           FSSTYSGTGSEHLE+ SEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKC
Sbjct: 481 FSSTYSGTGSEHLEQFSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKC 540

Query: 541 SIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQ 600
           SIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQK+NGMLGSS+TDQGSSWAVEVGGQ
Sbjct: 541 SIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKKNGMLGSSSTDQGSSWAVEVGGQ 600

Query: 601 LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFV 660
           LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGL ILKGITEAHG+KTWICFV
Sbjct: 601 LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLIILKGITEAHGDKTWICFV 660

Query: 661 VEGENNRNIHRMDILWSLVQILQRSNTM 669
           VEGENNRNIHRMDILWSLVQILQRSNTM
Sbjct: 661 VEGENNRNIHRMDILWSLVQILQRSNTM 686

BLAST of Tan0012817 vs. ExPASy TrEMBL
Match: A0A0A0K751 (BHLH domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G378380 PE=4 SV=1)

HSP 1 Score: 1214.1 bits (3140), Expect = 0.0e+00
Identity = 609/691 (88.13%), Postives = 637/691 (92.19%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTL 60
           MG+TDLHQILKS CCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH+P E KF+RKTL
Sbjct: 1   MGTTDLHQILKSFCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHEPPEGKFFRKTL 60

Query: 61  EKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTIEYC 120
           E F+DGHYSHD LGLAVAKMSYHVYSLGEGIVGQVAV GKHQWITADEQIPNFSSTIEYC
Sbjct: 61  ETFYDGHYSHDALGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWITADEQIPNFSSTIEYC 120

Query: 121 DGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKP 180
           DGWQTQFSAGI                    KVTEDVNLV RIRNVFLTLQESSAG +KP
Sbjct: 121 DGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVTRIRNVFLTLQESSAGEIKP 180

Query: 181 MHSCKSSGYVADIPSRSLATEKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQ 240
           MHSCKSSGY+ADIPSRSLATEK EVA VSKNVG+ELSG+   ESL +KPD INVE+ KSQ
Sbjct: 181 MHSCKSSGYMADIPSRSLATEKGEVASVSKNVGLELSGSEAFESLTTKPDGINVENFKSQ 240

Query: 241 VRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM 300
           VRLLDDR+CGGEPSGCKD +VGLK K+NVQSQNS   MV+ICGNLLPAEKI+TN+AY+ M
Sbjct: 241 VRLLDDRMCGGEPSGCKDKAVGLKQKINVQSQNSTMDMVNICGNLLPAEKIMTNDAYFSM 300

Query: 301 NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGP 360
           NPH SS YDGVNH+GMF RTN +EMYLQND+EASETI+MYPSNTSLKFPAGYELHEVLGP
Sbjct: 301 NPHPSSAYDGVNHNGMFIRTNHTEMYLQNDMEASETIEMYPSNTSLKFPAGYELHEVLGP 360

Query: 361 AFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSD 420
           AFLKDALYLDWQTEYV GGK FELSEGMSGSQLTSDSPTE LLEAVVADVCHSGSDVKSD
Sbjct: 361 AFLKDALYLDWQTEYVLGGKAFELSEGMSGSQLTSDSPTERLLEAVVADVCHSGSDVKSD 420

Query: 421 TSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMS 480
           TSLCKSGQSLLTTE+IPEPSTNVTTSA SEGYSMGQSQTSF GEDMQNSLSSSGVCGVMS
Sbjct: 421 TSLCKSGQSLLTTERIPEPSTNVTTSACSEGYSMGQSQTSFTGEDMQNSLSSSGVCGVMS 480

Query: 481 PKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNG 540
           PKGFSSTYSGTGSEHL++SSEPAKNSKRRARPGES RPRPRDRQLIQDRIKELRELVPNG
Sbjct: 481 PKGFSSTYSGTGSEHLDKSSEPAKNSKRRARPGESSRPRPRDRQLIQDRIKELRELVPNG 540

Query: 541 AKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEV 600
           AKCSIDSLLERTIKHMLFLQGITKHADKL KCANMKLHQK +GMLG+S+TDQGSSWAVEV
Sbjct: 541 AKCSIDSLLERTIKHMLFLQGITKHADKLTKCANMKLHQKGSGMLGTSDTDQGSSWAVEV 600

Query: 601 GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWI 660
           GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWI
Sbjct: 601 GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWI 660

Query: 661 CFVVEGENNRNIHRMDILWSLVQILQRSNTM 669
           CFVVEGENNRNIHRMDILWSLVQILQRS+TM
Sbjct: 661 CFVVEGENNRNIHRMDILWSLVQILQRSSTM 691

BLAST of Tan0012817 vs. ExPASy TrEMBL
Match: A0A1S4DYM6 (transcription factor EMB1444 OS=Cucumis melo OX=3656 GN=LOC103492612 PE=4 SV=1)

HSP 1 Score: 1192.9 bits (3085), Expect = 0.0e+00
Identity = 601/691 (86.98%), Postives = 628/691 (90.88%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTL 60
           MG+TDL QILKS CCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQH+P E KF+RKTL
Sbjct: 1   MGTTDLRQILKSFCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHEPPEGKFFRKTL 60

Query: 61  EKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTIEYC 120
           E F+DGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAV GKHQWITADEQIPNFSSTIEYC
Sbjct: 61  ETFYDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWITADEQIPNFSSTIEYC 120

Query: 121 DGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKP 180
           DGWQTQFSAGI                    KVTEDVNLV RIRN FLTLQESSAG +KP
Sbjct: 121 DGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDVNLVTRIRNAFLTLQESSAGEIKP 180

Query: 181 MHSCKSSGYVADIPSRSLATEKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQ 240
           +HSCKSSGYV           K E A VSKNVG+ELSG+GG ESL++KPDAINVES KSQ
Sbjct: 181 LHSCKSSGYV-----------KGEDASVSKNVGIELSGSGGFESLKTKPDAINVESFKSQ 240

Query: 241 VRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM 300
           VRLLDDRICGGEPSGCKD +VGLK K+NVQSQ+S   M++ICGNLLPAEKI+TN AY+PM
Sbjct: 241 VRLLDDRICGGEPSGCKDTAVGLKQKINVQSQDSAMDMLNICGNLLPAEKIMTNGAYFPM 300

Query: 301 NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGP 360
           NPH SSVYDGVNH+GMF+RTN +EMYLQND+EAS+TIDMYPSN SLKFPAGYELHEVLGP
Sbjct: 301 NPHPSSVYDGVNHNGMFTRTNHTEMYLQNDMEASKTIDMYPSNASLKFPAGYELHEVLGP 360

Query: 361 AFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSD 420
           AFLKDALYLDWQTEYV GGK FELSEGMSGSQLTSDSPTE LLEAVVADVCHS SDVKSD
Sbjct: 361 AFLKDALYLDWQTEYVLGGKAFELSEGMSGSQLTSDSPTERLLEAVVADVCHSSSDVKSD 420

Query: 421 TSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMS 480
           TSLCKSGQSLLTTE+IPEPSTN TTSA SEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMS
Sbjct: 421 TSLCKSGQSLLTTERIPEPSTNATTSACSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMS 480

Query: 481 PKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNG 540
           PKGFSSTYSGTGSEHL++S EPAKNSKRRARPGES RPRPRDRQLIQDRIKELRELVPNG
Sbjct: 481 PKGFSSTYSGTGSEHLDKSLEPAKNSKRRARPGESSRPRPRDRQLIQDRIKELRELVPNG 540

Query: 541 AKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEV 600
           AKCSIDSLLERTIKHMLFLQGITKHADKL KCANMKLHQKENGMLG+SNTDQGSSWAVEV
Sbjct: 541 AKCSIDSLLERTIKHMLFLQGITKHADKLTKCANMKLHQKENGMLGTSNTDQGSSWAVEV 600

Query: 601 GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWI 660
           GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWI
Sbjct: 601 GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWI 660

Query: 661 CFVVEGENNRNIHRMDILWSLVQILQRSNTM 669
           CFVVEGENNRNIHRMDILWSLVQILQRS+TM
Sbjct: 661 CFVVEGENNRNIHRMDILWSLVQILQRSSTM 680

BLAST of Tan0012817 vs. ExPASy TrEMBL
Match: A0A6J1DQG8 (transcription factor EMB1444-like isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022869 PE=4 SV=1)

HSP 1 Score: 1192.2 bits (3083), Expect = 0.0e+00
Identity = 598/692 (86.42%), Postives = 633/692 (91.47%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTL 60
           MG+TDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDP ESKF+RKTL
Sbjct: 1   MGTTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPPESKFFRKTL 60

Query: 61  EKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTIEYC 120
           EKFHDGH+SHDPLGLAVAKMSYHVYSLGEGIVGQVAV GKHQWITADEQIPNFSST+EYC
Sbjct: 61  EKFHDGHFSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWITADEQIPNFSSTLEYC 120

Query: 121 DGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKP 180
           DGWQTQFSAGI                    KVTED+NLV RIRN+FLTLQESSAGH+KP
Sbjct: 121 DGWQTQFSAGIKTIVVVAVVPHGVLQLGSLDKVTEDINLVTRIRNIFLTLQESSAGHIKP 180

Query: 181 MHSCKSSGYVADIPSRSLATEKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQ 240
           MHSC SSGY+ADI ++SL TEKNEV  VSK+VG+ELSG+GG ESL++KPDA  V+ LKSQ
Sbjct: 181 MHSCTSSGYMADISTKSLVTEKNEVEMVSKSVGIELSGSGGNESLKTKPDATIVQGLKSQ 240

Query: 241 VRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNS---MVSICGNLLPAEKIITNEAYYPM 300
           VR +DDR+C GEPSGCKDM+VGLKHKV+V+ QNS   MV+ICGNLLPAEKI+TN+A +PM
Sbjct: 241 VRSIDDRMCVGEPSGCKDMAVGLKHKVHVRLQNSTMDMVNICGNLLPAEKIMTNDACFPM 300

Query: 301 NPHASSVYDGVNHDGMFSRTNPSEMYLQNDVEASE---TIDMYPSNTSLKFPAGYELHEV 360
           N HASS  DGVNH+GM  RTNP+EM L+NDVEA E     DMYPSNTSLKFPAGYELHEV
Sbjct: 301 NSHASSACDGVNHNGMLRRTNPTEMCLENDVEALEIRNETDMYPSNTSLKFPAGYELHEV 360

Query: 361 LGPAFLKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDV 420
           LGPAFLKDALYLDWQ EY FG K FELSEGMSGSQLTSDSP E LLEAVVADVCHSGSDV
Sbjct: 361 LGPAFLKDALYLDWQVEYGFGSKAFELSEGMSGSQLTSDSPMERLLEAVVADVCHSGSDV 420

Query: 421 KSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCG 480
           KS+TSLCKSGQSLLTTE+IPEPSTN+TTSA SEGYSMGQSQ+SFIGEDMQNSLSSSGVCG
Sbjct: 421 KSNTSLCKSGQSLLTTERIPEPSTNITTSACSEGYSMGQSQSSFIGEDMQNSLSSSGVCG 480

Query: 481 VMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELV 540
           VMSPKGFSSTYSGTGSEHLERSSEPAKN+KRRA+PGESCRPRPRDRQLIQDRIKELRELV
Sbjct: 481 VMSPKGFSSTYSGTGSEHLERSSEPAKNNKRRAKPGESCRPRPRDRQLIQDRIKELRELV 540

Query: 541 PNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWA 600
           PNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSN DQGSSWA
Sbjct: 541 PNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNNDQGSSWA 600

Query: 601 VEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEK 660
           VEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEK
Sbjct: 601 VEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEK 660

Query: 661 TWICFVVEGENNRNIHRMDILWSLVQILQRSN 667
           TWICFVVEGENNR+IHRMDILWSLVQILQRSN
Sbjct: 661 TWICFVVEGENNRSIHRMDILWSLVQILQRSN 692

BLAST of Tan0012817 vs. ExPASy TrEMBL
Match: A0A6J1H6X5 (transcription factor EMB1444-like OS=Cucurbita moschata OX=3662 GN=LOC111460673 PE=4 SV=1)

HSP 1 Score: 1189.1 bits (3075), Expect = 0.0e+00
Identity = 601/688 (87.35%), Postives = 634/688 (92.15%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTL 60
           MG+TDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHD  ESKFY KT+
Sbjct: 1   MGTTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDLPESKFYSKTI 60

Query: 61  EKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTIEYC 120
           EKFHDG YSHDPL LAVAKMSYHVYSLGEGIVGQVAV GKHQWITADEQIPNFSST+EYC
Sbjct: 61  EKFHDGRYSHDPLELAVAKMSYHVYSLGEGIVGQVAVTGKHQWITADEQIPNFSSTLEYC 120

Query: 121 DGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKP 180
           DGWQTQFSAGI                    KV EDVNLVARIRNVFLTLQESSAGH+KP
Sbjct: 121 DGWQTQFSAGIKTIVVAAVVPHGVLQLGSLDKVIEDVNLVARIRNVFLTLQESSAGHIKP 180

Query: 181 MHSCKSSGYVADIPSRSLATEKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQ 240
           MHSC+SSGYVADIPSRSLATEKNEV  VSK+VG+ELSG+GGI+SL+ KPDAINV+S KSQ
Sbjct: 181 MHSCESSGYVADIPSRSLATEKNEVEMVSKDVGIELSGSGGIKSLEIKPDAINVDSFKSQ 240

Query: 241 VRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPH 300
           VRLLDDRICGGEPSGCKD++VGLK K+NV SQNS + +     PAEKIIT+EAY+PMNP 
Sbjct: 241 VRLLDDRICGGEPSGCKDIAVGLKQKINVGSQNSEMGMVNLSGPAEKIITDEAYFPMNPQ 300

Query: 301 ASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFL 360
           ASSV  G+ H GM S+TNPSEMYLQNDVEASETI++YPSN+SLKFPAGYELHEVLGPAFL
Sbjct: 301 ASSVC-GLKHYGMCSQTNPSEMYLQNDVEASETINVYPSNSSLKFPAGYELHEVLGPAFL 360

Query: 361 KDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSL 420
           KDALYLDW+TEYVFGGK FELSEG +GS LTSDSPTEHLLEAVVADVCHSGSDVKSDTSL
Sbjct: 361 KDALYLDWKTEYVFGGKAFELSEGTNGSHLTSDSPTEHLLEAVVADVCHSGSDVKSDTSL 420

Query: 421 CKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKG 480
           CKSGQSLLTTE+IPEPSTNV TSA SEGY+MGQSQTSFIGE+MQNSLSSSGVCGVMS KG
Sbjct: 421 CKSGQSLLTTERIPEPSTNV-TSACSEGYTMGQSQTSFIGEEMQNSLSSSGVCGVMSTKG 480

Query: 481 FSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKC 540
           FSSTYSGTGSEHLE+ SEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKC
Sbjct: 481 FSSTYSGTGSEHLEQFSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKC 540

Query: 541 SIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQ 600
           SIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQK+NGMLGSS+TDQGSSWAVEVGGQ
Sbjct: 541 SIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKKNGMLGSSSTDQGSSWAVEVGGQ 600

Query: 601 LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFV 660
           LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGL ILKGITEAHG+KTWICFV
Sbjct: 601 LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLIILKGITEAHGDKTWICFV 660

Query: 661 VEGENNRNIHRMDILWSLVQILQRSNTM 669
           VEGENNRN+HRMDILWSLVQILQRSNTM
Sbjct: 661 VEGENNRNVHRMDILWSLVQILQRSNTM 686

BLAST of Tan0012817 vs. ExPASy TrEMBL
Match: A0A6J1L0P6 (transcription factor EMB1444-like OS=Cucurbita maxima OX=3661 GN=LOC111498031 PE=4 SV=1)

HSP 1 Score: 1171.8 bits (3030), Expect = 0.0e+00
Identity = 593/688 (86.19%), Postives = 630/688 (91.57%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTL 60
           MG+TDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHD  ESKFY KT+
Sbjct: 1   MGTTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDLPESKFYSKTI 60

Query: 61  EKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTIEYC 120
           EKFHDG YSHD LGLAVAKMSYHVYSLGEGIVGQVAV GK+QWITADEQIPNFSST+EYC
Sbjct: 61  EKFHDGCYSHDALGLAVAKMSYHVYSLGEGIVGQVAVTGKYQWITADEQIPNFSSTLEYC 120

Query: 121 DGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKP 180
           DGWQTQFSAGI                    KVTEDVNLVA IRNVFLTLQESSAGH++P
Sbjct: 121 DGWQTQFSAGIKTIVVAAVVPHGVLQLGSLDKVTEDVNLVACIRNVFLTLQESSAGHIQP 180

Query: 181 MHSCKSSGYVADIPSRSLATEKNEVARVSKNVGMELSGTGGIESLQSKPDAINVESLKSQ 240
           M SC+SSGYVADIPSRSLATEKNEV  VSK+VG+ELSG+GGI+SL+ KPDAIN++S KSQ
Sbjct: 181 MRSCESSGYVADIPSRSLATEKNEVEMVSKDVGIELSGSGGIKSLEIKPDAINMDSFKSQ 240

Query: 241 VRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMVSICGNLLPAEKIITNEAYYPMNPH 300
           VR+LDDRICGGEPS CKD++VGLK K+NV SQN  +       PAEKIIT+EAY+PMNPH
Sbjct: 241 VRVLDDRICGGEPSECKDIAVGLKQKINVGSQNFEMGTVNLSGPAEKIITDEAYFPMNPH 300

Query: 301 ASSVYDGVNHDGMFSRTNPSEMYLQNDVEASETIDMYPSNTSLKFPAGYELHEVLGPAFL 360
           ASSV  G+ HDGM S+TNPSEMYLQNDVEASETI++YPSN+SLKFPAGYELHEVLGPAFL
Sbjct: 301 ASSVC-GLKHDGMCSQTNPSEMYLQNDVEASETINVYPSNSSLKFPAGYELHEVLGPAFL 360

Query: 361 KDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTSL 420
           KDALYLDW+T+YVFGGK FELSEG +GS LTSDSPTEHLLEAVVADVCHSGSDVKSDTSL
Sbjct: 361 KDALYLDWKTKYVFGGKAFELSEGTNGSHLTSDSPTEHLLEAVVADVCHSGSDVKSDTSL 420

Query: 421 CKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMSPKG 480
           CKSGQSLLTTE+IPEPSTNV TS  SEGY+MGQSQTSFIGE+MQNSLSSSGVCGVMS KG
Sbjct: 421 CKSGQSLLTTERIPEPSTNV-TSTCSEGYTMGQSQTSFIGEEMQNSLSSSGVCGVMSTKG 480

Query: 481 FSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKC 540
           FSSTYSGTGSEHLE+ SEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKC
Sbjct: 481 FSSTYSGTGSEHLEQFSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNGAKC 540

Query: 541 SIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEVGGQ 600
           SIDSLLERTIKHMLFLQGITKHA KLNKCANMKLHQK+NGMLGSS+TDQGSSWAVEVGGQ
Sbjct: 541 SIDSLLERTIKHMLFLQGITKHAGKLNKCANMKLHQKKNGMLGSSSTDQGSSWAVEVGGQ 600

Query: 601 LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWICFV 660
           LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGL IL+GITEAHG+KTWICFV
Sbjct: 601 LKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLIILRGITEAHGDKTWICFV 660

Query: 661 VEGENNRNIHRMDILWSLVQILQRSNTM 669
           VEGENNRNIHRMDILWSLVQILQRSNTM
Sbjct: 661 VEGENNRNIHRMDILWSLVQILQRSNTM 686

BLAST of Tan0012817 vs. TAIR 10
Match: AT1G06150.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 462.2 bits (1188), Expect = 6.9e-130
Identity = 319/750 (42.53%), Postives = 424/750 (56.53%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTL 60
           MG T L QIL+S+C N++W YAVFWKL H + MVLT ED Y  N E+    ES       
Sbjct: 1   MGYT-LQQILRSICSNTDWNYAVFWKLNHHSPMVLTLEDVYCVNHERGLMPES------- 60

Query: 61  EKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTIEYC 120
              H G ++HDPLGLAVAKMSYHV+SLGEGIVGQVA++G+HQWI + E + +  ST++  
Sbjct: 61  --LHGGRHAHDPLGLAVAKMSYHVHSLGEGIVGQVAISGQHQWIFS-EYLNDSHSTLQVH 120

Query: 121 DGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKP 180
           +GW++Q SAGI                    KV ED  LV  IR++FL L +  A H   
Sbjct: 121 NGWESQISAGIKTILIVAVGSCGVVQLGSLCKVEEDPALVTHIRHLFLALTDPLADHASN 180

Query: 181 MHSC--KSSGYVADIPSRSL-------------ATEKNEVARVSKNVG------------ 240
           +  C   S      IPS+ L             A +   +  VS+N              
Sbjct: 181 LMQCDINSPSDRPKIPSKCLHEASPDFSGEFDKAMDMEGLNIVSQNTSNRSNDLPYNFTP 240

Query: 241 ----MELSG--TGGIESLQSKPDAIN-----------VESL-KSQVRLLD-DRICGGEPS 300
               ME +    GG+E++Q      N           V++  K+QV + D  ++   E +
Sbjct: 241 TYFHMERTAQVIGGLEAVQPSMFGSNDCVTSGFSVGVVDTKHKNQVDISDMSKVIYDEET 300

Query: 301 GCKDMSVGLKHKVNVQSQNSMVSICGN---LLPAEKIITNEAYYPMNPHASSVYDGVNHD 360
           G    S  L       S+N + +  G     + ++++    +Y  ++   S+V   +  D
Sbjct: 301 GGYRYSRELDPNFQHYSRNHVRNSGGTSALAMESDRLKAGSSYPQLD---STVLTALKTD 360

Query: 361 GMFSRTN----PSE----MYLQN----DVEASETIDMYPSNTSLKFPAGYELHEVLGPAF 420
             +SR N    PSE    +++++      E SE+  +     SL   +G EL E LGPAF
Sbjct: 361 KDYSRRNEVFQPSESQGSIFVKDTEHRQEEKSESSQLDALTASLCSFSGSELLEALGPAF 420

Query: 421 LKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTS 480
            K +       ++         +  MS S LT +S +E+LL+AVVA + +   +V+ + S
Sbjct: 421 SKTSTDYGELAKFE-SAAAIRRTNDMSHSHLTFESSSENLLDAVVASMSNGDGNVRREIS 480

Query: 481 LCKSGQSLLTTEKI--PEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMS 540
             +S QSLLTT ++   EP  +   +  S   S+        G   QN    S +CG  S
Sbjct: 481 SSRSTQSLLTTAEMAQAEPFGHNKQNIVSTVDSVISQPPLADGLIQQN---PSNICGAFS 540

Query: 541 PKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNG 600
             GFSST   + S+    S E  K +K+RA+PGES RPRPRDRQLIQDRIKELRELVPNG
Sbjct: 541 SIGFSSTCLSSSSDQFPTSLEIPKKNKKRAKPGESSRPRPRDRQLIQDRIKELRELVPNG 600

Query: 601 AKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEV 660
           +KCSIDSLLE TIKHMLFLQ +++HADKL K A+ K+  K+ G LG S+T+QGSSWAVE+
Sbjct: 601 SKCSIDSLLECTIKHMLFLQSVSQHADKLTKSASSKMQHKDTGTLGISSTEQGSSWAVEI 660

Query: 661 GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWI 668
           GG L+VCSI+VENL+K G +L+EMLCEECSHFLEIA  IRSL L IL+G TE  GEKTWI
Sbjct: 661 GGHLQVCSIMVENLDKEGVMLIEMLCEECSHFLEIANVIRSLELIILRGTTEKQGEKTWI 720

BLAST of Tan0012817 vs. TAIR 10
Match: AT1G06150.2 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 462.2 bits (1188), Expect = 6.9e-130
Identity = 319/750 (42.53%), Postives = 424/750 (56.53%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHRARMVLTWEDGYYDNSEQHDPSESKFYRKTL 60
           MG T L QIL+S+C N++W YAVFWKL H + MVLT ED Y  N E+    ES       
Sbjct: 1   MGYT-LQQILRSICSNTDWNYAVFWKLNHHSPMVLTLEDVYCVNHERGLMPES------- 60

Query: 61  EKFHDGHYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTIEYC 120
              H G ++HDPLGLAVAKMSYHV+SLGEGIVGQVA++G+HQWI + E + +  ST++  
Sbjct: 61  --LHGGRHAHDPLGLAVAKMSYHVHSLGEGIVGQVAISGQHQWIFS-EYLNDSHSTLQVH 120

Query: 121 DGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAGHMKP 180
           +GW++Q SAGI                    KV ED  LV  IR++FL L +  A H   
Sbjct: 121 NGWESQISAGIKTILIVAVGSCGVVQLGSLCKVEEDPALVTHIRHLFLALTDPLADHASN 180

Query: 181 MHSC--KSSGYVADIPSRSL-------------ATEKNEVARVSKNVG------------ 240
           +  C   S      IPS+ L             A +   +  VS+N              
Sbjct: 181 LMQCDINSPSDRPKIPSKCLHEASPDFSGEFDKAMDMEGLNIVSQNTSNRSNDLPYNFTP 240

Query: 241 ----MELSG--TGGIESLQSKPDAIN-----------VESL-KSQVRLLD-DRICGGEPS 300
               ME +    GG+E++Q      N           V++  K+QV + D  ++   E +
Sbjct: 241 TYFHMERTAQVIGGLEAVQPSMFGSNDCVTSGFSVGVVDTKHKNQVDISDMSKVIYDEET 300

Query: 301 GCKDMSVGLKHKVNVQSQNSMVSICGN---LLPAEKIITNEAYYPMNPHASSVYDGVNHD 360
           G    S  L       S+N + +  G     + ++++    +Y  ++   S+V   +  D
Sbjct: 301 GGYRYSRELDPNFQHYSRNHVRNSGGTSALAMESDRLKAGSSYPQLD---STVLTALKTD 360

Query: 361 GMFSRTN----PSE----MYLQN----DVEASETIDMYPSNTSLKFPAGYELHEVLGPAF 420
             +SR N    PSE    +++++      E SE+  +     SL   +G EL E LGPAF
Sbjct: 361 KDYSRRNEVFQPSESQGSIFVKDTEHRQEEKSESSQLDALTASLCSFSGSELLEALGPAF 420

Query: 421 LKDALYLDWQTEYVFGGKTFELSEGMSGSQLTSDSPTEHLLEAVVADVCHSGSDVKSDTS 480
            K +       ++         +  MS S LT +S +E+LL+AVVA + +   +V+ + S
Sbjct: 421 SKTSTDYGELAKFE-SAAAIRRTNDMSHSHLTFESSSENLLDAVVASMSNGDGNVRREIS 480

Query: 481 LCKSGQSLLTTEKI--PEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSLSSSGVCGVMS 540
             +S QSLLTT ++   EP  +   +  S   S+        G   QN    S +CG  S
Sbjct: 481 SSRSTQSLLTTAEMAQAEPFGHNKQNIVSTVDSVISQPPLADGLIQQN---PSNICGAFS 540

Query: 541 PKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRIKELRELVPNG 600
             GFSST   + S+    S E  K +K+RA+PGES RPRPRDRQLIQDRIKELRELVPNG
Sbjct: 541 SIGFSSTCLSSSSDQFPTSLEIPKKNKKRAKPGESSRPRPRDRQLIQDRIKELRELVPNG 600

Query: 601 AKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNTDQGSSWAVEV 660
           +KCSIDSLLE TIKHMLFLQ +++HADKL K A+ K+  K+ G LG S+T+QGSSWAVE+
Sbjct: 601 SKCSIDSLLECTIKHMLFLQSVSQHADKLTKSASSKMQHKDTGTLGISSTEQGSSWAVEI 660

Query: 661 GGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGITEAHGEKTWI 668
           GG L+VCSI+VENL+K G +L+EMLCEECSHFLEIA  IRSL L IL+G TE  GEKTWI
Sbjct: 661 GGHLQVCSIMVENLDKEGVMLIEMLCEECSHFLEIANVIRSLELIILRGTTEKQGEKTWI 720

BLAST of Tan0012817 vs. TAIR 10
Match: AT2G31280.1 (conserved peptide upstream open reading frame 7 )

HSP 1 Score: 459.1 bits (1180), Expect = 5.9e-129
Identity = 320/756 (42.33%), Postives = 413/756 (54.63%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHDPSESKFYRKT 60
           MGST   +ILKS C N++W YAVFW+L HR +RMVLT ED YYD                
Sbjct: 1   MGSTS-QEILKSFCFNTDWDYAVFWQLNHRGSRMVLTLEDAYYD---------------- 60

Query: 61  LEKFHDG---HYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSST 120
               H G   H +HDPLGLAVAKMSYHVYSLGEGIVGQVAV+G+HQW+   E   N +S 
Sbjct: 61  ----HHGTNMHGAHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGEHQWV-FPENYNNCNSA 120

Query: 121 IEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAG 180
            E+ + W++Q SAGI                    KV EDVN V  IR++FL L++  A 
Sbjct: 121 FEFHNVWESQISAGIKTILVVAVGPCGVVQLGSLCKVNEDVNFVNHIRHLFLALRDPLAD 180

Query: 181 HMKPMHSC--KSSGYVADIPSRSLATE--KNEVARVSKNVGMELSG------TGGIESL- 240
           H   +  C   +S  +  +PS  L  E   +    V K + +E S       T   +S+ 
Sbjct: 181 HAANLRQCNMNNSLCLPKMPSEGLHAEAFPDCSGEVDKAMDVEESNILTQYKTRRSDSMP 240

Query: 241 QSKPDAINVESLKSQV----RLLDDRICG------------------------------- 300
            + P +  V    +QV     ++    CG                               
Sbjct: 241 YNTPSSCLVMEKAAQVVGGREVVQGSTCGSYSGVTFGFPVDLVGAKHENQVGTNIIRDAP 300

Query: 301 --GEPSGCKDMSVGLKHKVNVQSQNSMV---SICGNLLPAEKIITNEAYYPMNPHASSVY 360
             G  SGCKD S  L   +++  +N ++   S     + AE++IT+++Y    P   S +
Sbjct: 301 HVGMTSGCKD-SRDLDPNLHLYMKNHVLNDTSTSALAIEAERLITSQSY----PRLDSTF 360

Query: 361 DGVN--------HDGMFSRT--------NPSEMYLQNDVEASETIDMYPSNTSLKFPAGY 420
              +        H+ +F  +          +E  L  + E+S+   +  S  +    AG 
Sbjct: 361 QATSRTDKESSYHNEVFQLSENQGNKYIKETERMLGRNCESSQFDALISSGYTF---AGS 420

Query: 421 ELHEVLGPAFLKDALYLDWQTEYVFG--GKTFELSEGMSGSQLTSDSPTEHLLEAVVADV 480
           EL E LG AF +       Q E +    G T   ++ MS SQLT D   E+LL+AVVA+V
Sbjct: 421 ELLEALGSAFKQTN---TGQEELLKSEHGSTMRPTDDMSHSQLTFDPGPENLLDAVVANV 480

Query: 481 CHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSL 540
           C    + + D    +S QSLLT  ++ EPS     +  +   +   +Q      D Q   
Sbjct: 481 CQRDGNARDDMMSSRSVQSLLTNMELAEPSGQKKHNIVNP-INSAMNQPPMAEVDTQQ-- 540

Query: 541 SSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRI 600
           +SS +CG  S  GFSSTY  + S+  + S +  K +K+RA+PGES RPRPRDRQLIQDRI
Sbjct: 541 NSSDICGAFSSIGFSSTYPSSSSDQFQTSLDIPKKNKKRAKPGESSRPRPRDRQLIQDRI 600

Query: 601 KELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNT 660
           KELRELVPNG+KCSIDSLLERTIKHMLFLQ +TKHA+KL+K AN K+ QKE GM      
Sbjct: 601 KELRELVPNGSKCSIDSLLERTIKHMLFLQNVTKHAEKLSKSANEKMQQKETGM------ 660

Query: 661 DQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGI 664
            QGSS AVEVGG L+V SIIVENLNK G +L+EMLCEEC HFLEIA  IRSL L IL+G 
Sbjct: 661 -QGSSCAVEVGGHLQVSSIIVENLNKQGMVLIEMLCEECGHFLEIANVIRSLDLVILRGF 713

BLAST of Tan0012817 vs. TAIR 10
Match: AT2G31280.3 (conserved peptide upstream open reading frame 7 )

HSP 1 Score: 448.4 bits (1152), Expect = 1.0e-125
Identity = 320/773 (41.40%), Postives = 413/773 (53.43%), Query Frame = 0

Query: 1   MGSTDLHQILKSLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHDPSESKFYRKT 60
           MGST   +ILKS C N++W YAVFW+L HR +RMVLT ED YYD                
Sbjct: 1   MGSTS-QEILKSFCFNTDWDYAVFWQLNHRGSRMVLTLEDAYYD---------------- 60

Query: 61  LEKFHDG---HYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSST 120
               H G   H +HDPLGLAVAKMSYHVYSLGEGIVGQVAV+G+HQW+   E   N +S 
Sbjct: 61  ----HHGTNMHGAHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGEHQWV-FPENYNNCNSA 120

Query: 121 IEYCDGWQTQFSAGI--------------------KVTEDVNLVARIRNVFLTLQESSAG 180
            E+ + W++Q SAGI                    KV EDVN V  IR++FL L++  A 
Sbjct: 121 FEFHNVWESQISAGIKTILVVAVGPCGVVQLGSLCKVNEDVNFVNHIRHLFLALRDPLAD 180

Query: 181 HMKPMHSC--KSSGYVADIPSRSLATE--KNEVARVSKNVGMELSG------TGGIESL- 240
           H   +  C   +S  +  +PS  L  E   +    V K + +E S       T   +S+ 
Sbjct: 181 HAANLRQCNMNNSLCLPKMPSEGLHAEAFPDCSGEVDKAMDVEESNILTQYKTRRSDSMP 240

Query: 241 QSKPDAINVESLKSQV----RLLDDRICG------------------------------- 300
            + P +  V    +QV     ++    CG                               
Sbjct: 241 YNTPSSCLVMEKAAQVVGGREVVQGSTCGSYSGVTFGFPVDLVGAKHENQVGTNIIRDAP 300

Query: 301 --GEPSGCKDMSVGLKHKVNVQSQNSMV---SICGNLLPAEKIITNEAYYPMNPHASSVY 360
             G  SGCKD S  L   +++  +N ++   S     + AE++IT+++Y    P   S +
Sbjct: 301 HVGMTSGCKD-SRDLDPNLHLYMKNHVLNDTSTSALAIEAERLITSQSY----PRLDSTF 360

Query: 361 DGVN--------HDGMFSRT--------NPSEMYLQNDVEASETIDMYPSNTSLKFPAGY 420
              +        H+ +F  +          +E  L  + E+S+   +  S  +    AG 
Sbjct: 361 QATSRTDKESSYHNEVFQLSENQGNKYIKETERMLGRNCESSQFDALISSGYTF---AGS 420

Query: 421 ELHEVLGPAFLKDALYLDWQTEYVFG--GKTFELSEGMSGSQLTSDSPTEHLLEAVVADV 480
           EL E LG AF +       Q E +    G T   ++ MS SQLT D   E+LL+AVVA+V
Sbjct: 421 ELLEALGSAFKQTN---TGQEELLKSEHGSTMRPTDDMSHSQLTFDPGPENLLDAVVANV 480

Query: 481 CHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYSMGQSQTSFIGEDMQNSL 540
           C    + + D    +S QSLLT  ++ EPS     +  +   +   +Q      D Q   
Sbjct: 481 CQRDGNARDDMMSSRSVQSLLTNMELAEPSGQKKHNIVNP-INSAMNQPPMAEVDTQQ-- 540

Query: 541 SSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPGESCRPRPRDRQLIQDRI 600
           +SS +CG  S  GFSSTY  + S+  + S +  K +K+RA+PGES RPRPRDRQLIQDRI
Sbjct: 541 NSSDICGAFSSIGFSSTYPSSSSDQFQTSLDIPKKNKKRAKPGESSRPRPRDRQLIQDRI 600

Query: 601 KELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCANMKLHQKENGMLGSSNT 660
           KELRELVPNG+KCSIDSLLERTIKHMLFLQ +TKHA+KL+K AN K+ QKE GM      
Sbjct: 601 KELRELVPNGSKCSIDSLLERTIKHMLFLQNVTKHAEKLSKSANEKMQQKETGM------ 660

Query: 661 DQGSSWAVEVGGQLKVCSIIVENLNKNGQILVEMLCEECSHFLEIAEAIRSLGLTILKGI 664
            QGSS AVEVGG L+V SIIVENLNK G +L+EMLCEEC HFLEIA  IRSL L IL+G 
Sbjct: 661 -QGSSCAVEVGGHLQVSSIIVENLNKQGMVLIEMLCEECGHFLEIANVIRSLDLVILRGF 720

BLAST of Tan0012817 vs. TAIR 10
Match: AT2G31280.2 (conserved peptide upstream open reading frame 7 )

HSP 1 Score: 385.6 bits (989), Expect = 8.3e-107
Identity = 283/716 (39.53%), Postives = 368/716 (51.40%), Query Frame = 0

Query: 2   GSTDLHQILKSLCCNSEWKYAVFWKLKHR-ARMVLTWEDGYYDNSEQHDPSESKFYRKTL 61
           G     +ILKS C N++W YAVFW+L HR +RMVLT ED YYD                 
Sbjct: 27  GRGSYREILKSFCFNTDWDYAVFWQLNHRGSRMVLTLEDAYYD----------------- 86

Query: 62  EKFHDG---HYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVNGKHQWITADEQIPNFSSTI 121
              H G   H +HDPLGLAVAKMSYHVYSLGEGIVGQVAV+G+HQW+        F    
Sbjct: 87  ---HHGTNMHGAHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGEHQWV--------FPENY 146

Query: 122 EYCDGWQTQFSAGIKVTEDVNLVARIRNVFLTLQESSAGHMKPMHSC------------- 181
             C+   + F   + V      V ++ ++      S   H +    C             
Sbjct: 147 NNCN---SAFETILVVAVGPCGVVQLGSLCKPKMPSEGLHAEAFPDCSGEVDKAMDVEES 206

Query: 182 ---------KSSGYVADIPSRSLATEKNEVARVSKNVGMELSGTGGIESLQSKPDAINVE 241
                    +S     + PS  L  EK   A+V     +    T G  S  +    +++ 
Sbjct: 207 NILTQYKTRRSDSMPYNTPSSCLVMEK--AAQVVGGREVVQGSTCGSYSGVTFGFPVDLV 266

Query: 242 SLKSQ----VRLLDDRICGGEPSGCKDMSVGLKHKVNVQSQNSMV---SICGNLLPAEKI 301
             K +      ++ D    G  SGCKD S  L   +++  +N ++   S     + AE++
Sbjct: 267 GAKHENQVGTNIIRDAPHVGMTSGCKD-SRDLDPNLHLYMKNHVLNDTSTSALAIEAERL 326

Query: 302 ITNEAYYPMNPHASSVYDGVN--------HDGMFSRT--------NPSEMYLQNDVEASE 361
           IT+++Y    P   S +   +        H+ +F  +          +E  L  + E+S+
Sbjct: 327 ITSQSY----PRLDSTFQATSRTDKESSYHNEVFQLSENQGNKYIKETERMLGRNCESSQ 386

Query: 362 TIDMYPSNTSLKFPAGYELHEVLGPAFLKDALYLDWQTEYVFG--GKTFELSEGMSGSQL 421
              +  S  +    AG EL E LG AF +       Q E +    G T   ++ MS SQL
Sbjct: 387 FDALISSGYTF---AGSELLEALGSAFKQTN---TGQEELLKSEHGSTMRPTDDMSHSQL 446

Query: 422 TSDSPTEHLLEAVVADVCHSGSDVKSDTSLCKSGQSLLTTEKIPEPSTNVTTSAYSEGYS 481
           T D   E+LL+AVVA+VC    + + D    +S QSLLT  ++ EPS     +  +   +
Sbjct: 447 TFDPGPENLLDAVVANVCQRDGNARDDMMSSRSVQSLLTNMELAEPSGQKKHNIVNP-IN 506

Query: 482 MGQSQTSFIGEDMQNSLSSSGVCGVMSPKGFSSTYSGTGSEHLERSSEPAKNSKRRARPG 541
              +Q      D Q   +SS +CG  S  GFSSTY  + S+  + S +  K +K+RA+PG
Sbjct: 507 SAMNQPPMAEVDTQQ--NSSDICGAFSSIGFSSTYPSSSSDQFQTSLDIPKKNKKRAKPG 566

Query: 542 ESCRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQGITKHADKLNKCA 601
           ES RPRPRDRQLIQDRIKELRELVPNG+KCSIDSLLERTIKHMLFLQ +TKHA+KL+K A
Sbjct: 567 ESSRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIKHMLFLQNVTKHAEKLSKSA 626

Query: 602 NMKLHQKENGMLGSSNTDQGSSWAVEVGGQLKVCSIIVENLNKNGQILVE---------- 643
           N K+ QKE GM       QGSS AVEVGG L+V SIIVENLNK G +L+E          
Sbjct: 627 NEKMQQKETGM-------QGSSCAVEVGGHLQVSSIIVENLNKQGMVLIEFNLCLNSSPK 686

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P0C7P89.8e-12942.53Transcription factor EMB1444 OS=Arabidopsis thaliana OX=3702 GN=EMB1444 PE=2 SV=... [more]
Q58G018.3e-12842.33Transcription factor bHLH155 OS=Arabidopsis thaliana OX=3702 GN=BHLH155 PE=1 SV=... [more]
Q9XIN07.9e-6231.29Transcription factor LHW OS=Arabidopsis thaliana OX=3702 GN=LHW PE=1 SV=1[more]
Q7XJU08.8e-4528.47Transcription factor bHLH157 OS=Arabidopsis thaliana OX=3702 GN=BHLH157 PE=1 SV=... [more]
K4PW388.9e-1326.60Protein RICE SALT SENSITIVE 3 OS=Oryza sativa subsp. japonica OX=39947 GN=RSS3 P... [more]
Match NameE-valueIdentityDescription
XP_011659263.10.0e+0088.13transcription factor bHLH155 isoform X1 [Cucumis sativus] >KGN44749.1 hypothetic... [more]
XP_016901084.10.0e+0086.98PREDICTED: transcription factor EMB1444 [Cucumis melo][more]
XP_022155842.10.0e+0086.42transcription factor EMB1444-like isoform X1 [Momordica charantia][more]
XP_022959660.10.0e+0087.35transcription factor EMB1444-like [Cucurbita moschata][more]
KAG7025488.10.0e+0087.50Transcription factor EMB-like protein [Cucurbita argyrosperma subsp. argyrosperm... [more]
Match NameE-valueIdentityDescription
A0A0A0K7510.0e+0088.13BHLH domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G378380 PE=4 S... [more]
A0A1S4DYM60.0e+0086.98transcription factor EMB1444 OS=Cucumis melo OX=3656 GN=LOC103492612 PE=4 SV=1[more]
A0A6J1DQG80.0e+0086.42transcription factor EMB1444-like isoform X1 OS=Momordica charantia OX=3673 GN=L... [more]
A0A6J1H6X50.0e+0087.35transcription factor EMB1444-like OS=Cucurbita moschata OX=3662 GN=LOC111460673 ... [more]
A0A6J1L0P60.0e+0086.19transcription factor EMB1444-like OS=Cucurbita maxima OX=3661 GN=LOC111498031 PE... [more]
Match NameE-valueIdentityDescription
AT1G06150.16.9e-13042.53basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT1G06150.26.9e-13042.53basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
AT2G31280.15.9e-12942.33conserved peptide upstream open reading frame 7 [more]
AT2G31280.31.0e-12541.40conserved peptide upstream open reading frame 7 [more]
AT2G31280.28.3e-10739.53conserved peptide upstream open reading frame 7 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025610Transcription factor MYC/MYB N-terminalPFAMPF14215bHLH-MYC_Ncoord: 6..131
e-value: 2.4E-22
score: 80.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 464..502
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 474..502
NoneNo IPR availablePANTHERPTHR46196:SF1TRANSCRIPTION FACTOR EMB1444-RELATEDcoord: 183..666
coord: 4..193
IPR043561Transcription factor LHW-likePANTHERPTHR46196TRANSCRIPTION FACTOR BHLH155-LIKE ISOFORM X1-RELATEDcoord: 183..666
coord: 4..193
IPR011598Myc-type, basic helix-loop-helix (bHLH) domainPROSITEPS50888BHLHcoord: 487..536
score: 10.772322
IPR036638Helix-loop-helix DNA-binding domain superfamilySUPERFAMILY47459HLH, helix-loop-helix DNA-binding domaincoord: 500..544

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0012817.1Tan0012817.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0003700 DNA-binding transcription factor activity
molecular_function GO:0046983 protein dimerization activity