Tan0014193 (gene) Snake gourd v1

Overview
NameTan0014193
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontranscriptional activator DEMETER-like
LocationLG04: 5792803 .. 5809529 (+)
RNA-Seq ExpressionTan0014193
SyntenyTan0014193
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTCCATCTCTCTCTCTCTTTCTCTCTCTCTCTCTTTCTTCTCTCTCTCTCTCTCTCTCCTGGAAACTCAAGCTAACAACGTAACTCTCTCTCCCCTTCCTCTTTCCATCAGCATTTGGCATTTGATCGACCAAGAAACTCGGTCAGAGTACGGAGTAGCTGATGGGAGGTAATTACAGAATAACAAGCATTGGATCGGTGAGGAGATTGGCGAAAGAGGTAATTCGAATTCATCTTTGACTGACTCTCTCTCATCTTCTGTGCTAATGGACGCAAGAATATCAGAATTAGGGCTCAAATTGAGGCCTCATTATTTTGAACTGTATATTTCTGTTATTTGCTTCGATTGATTCGGTCGAAGCTCAGCTAATTTCTGCAATTCTCTGGTGTTCTATTAATTTTCTTGTTCTCTCTCTGATTGAAGAATTTGAGCCATTGGTATGTACTTAAAGAACTTGATATTTTATCGAACTGGATTTGTTTTGTCATTTACAATTTCGATTCCTTTTGAAATAATCGTAGTAGATGCCTTGGAGGAGGTTGATCCAGACCTCAGTATGACGTTTTTCAGGGGATTAGTCCTGTAAATTTGGTTCTGTATAGAAAATAATCGTACAATTGGGAGGAATTGAAGCTGAAATGATAACTAGACAAATTTTAAGGTGCATATCCTTAATTGAATTGCTCTCTTGATTCTGAGTATATAGTCTTGGTCATTCGGTGCTAATTTCTTACTTAGTGAACATTGAAATGAATTATTTATGGAGTTCTAATCTTCTACTCGGCTTCAATTGTGCGTGAAATTGTTCTGTCATTGTTCTCTGTATCTAGAGGATTGTTGCTTCGGGATATCTTCACATAGCGATACTATTATTTGATTTCTAATTTCTTTTGAATTTACCATCTATGACTTTATTCGAAGTGCGGTTAAAAATATTAATGTCGACTGTTCTTAAATGCTTGATTGTTTAAGTCAAAGTTAACCGTAGAATTTTCCTACTCTCTTCCAATTTTAAACAAATTGGTTCAGAATTTCATTTTTTAAGTTTAGCTTTAACGTGCCAAGGCGCTTGGACAAATCTGCGATATGAACCTATACGATTAATTTATATTCTCTCTCTGGTAAGAATAACTGCAGTAGTAAACTGGCCTAGAAGATTAAACGTTGTTGAGCTGCTTCTTTTTTTTTTTCCTTCTTCCTTTTTTGGAGTTAGCATAAAGTGTAAAACTTTAGTATAAAACGAGTAACTAATACGAAGAAATTTCCCCTTCCATGTTTTTATAATATTATAATGTATTGTTACATGAAAATGGAGGAAACGTGATCTGCCACCGCAAGTTTGAATTTAACAGTCGGCTCTTGAACTATACTTGCGTTTAAGTTATGGTTTCTTCAGAATGTGTTTCATCGCTGTCATATTTTTCCAAGCAAGATTTATAATTTGCCTTTCATTGACGTTATGCCTTTAACAAGAAGAAATTGTGCATTTTTTTAGCAGCTTGGCTCAAAGCAGTTTGGAGAATTGTCTTAATGTTCGTTGCATGCGTTGACAGCATTGAACATGAATTCTCAAGTTAACAGCAGTGGGGATTTCTATGCGGGCAATTTATTGGTTAGAAACCAAAATCTGTATCCAAGTTCGAGGCCATCTAGTAACAATAGCTACGCCCAACATGTACGCCCATGTAAGTTCACGAATTATCTAGCAATAGTTTTATACGATCAGCCTCTTTGTAATTATTTCAAATAATATTTGTATGTTCAGTAAATTTCAAATTGATGTCTTCAGGTAATAGCGTTTTTCCCATCCCAATGACACTTCATATACACACACTTGAGAAATCTTGATTGCTATTGTTACCGTATAAAGAACTGATCCATGATAGCGTCAGTCACAAAAGGAAATAGTGATCTTGTTCGTTACTTTAGACCGACTTATTCTCATCCTTTTTCGTAACCAAATGAAAGTTTACTGTATTTGGCTGAAATTGCATTGGGTAATGTAACTATTACATGTCATGCATAGATTTTCTTTCCCGTTGTTTCTTATTTCGCTACTAATTGTGATGAGTAATGAAATTGTGTTCTGTGAATTTTACAGATGGACTCCCCATGTATCAACCCAATTATAACTTGAACCCAGCATCAATGACACAAATGAACCAAATGTCGATTTTCACCAACTCAATCCATCCTCCACCGGTCTCCTCGCACCTGGAAAATTTTGCATTTGATCCCATCTCTACACCATCTTTTCTCGTAAGAGATGAAAGTTCAAGCTTCAGAAGGGATGGCGAGGACGACTTCATCAGAATGTTTCAAGATGAACCACCTCGTCAACACTGTGACGAACTTCTACAAAGCATTGTGGAATCATCATGTGTTGGAAATTCTACTCCATTCAAGAGAACGACGGACTTTGGGAAGCAGAGAGATCTTGAAATTGATCTCAACAGGACACCAGAGCAGAGACCACCAAAAAGAAGACAGCATACGCCCATGGTATTTTCAGAAAGGTTTACTGATTTACTTAATCTTCCATTGGCTGAAAGTTTAAGCCTTTACGAGGAAACACAGGAGAACTTTGTCACAGTTCCACTTGATGAAGCAACTCAGAAACGTCATGATGAACTCTTGAAAGATCTCACAGATACATTATCTGCAGCCATTTCTGCACCAACACCGACGAAGGAAGTGGAAAAGGGCAGCGATCAAGTAATTGATCTTAATAAGACACCAGAGCAGAAGACGCCTAGGCGAAGAAAACATAGGCCCAAGGTTATAAAAGAAGGAAAACCCAAAAAGTCTCCTAAACCTGTGACGCCGAAGATTCCCAAGGAAACCCCATCAGGAAAGAGAAAGTATGTACGGAAGAAGAACATCAAAGAAGCAGCTACTCCACCTGCGAATATTGTGGAGATTAAAGATTCGAGCACTGCTACTAAAACAAAATCCTGCCGGAGAGTAATAAATTTTGAGATGGAAAAAACTGGAGATGAGGAGCAGGAAAAGAAACACAATGAGAAGGATGTGCAAGAGGAGAACATGGGGAACTTTTGCTCCATCACGAGACCAAACGTTCCAGACTTCTGCACCCAAAGTAATGGTGTTTGTGGAACAAGTCCAGATGTTCATATCAGTCATCGACTGAGCACAATGGTGGCTGAAAATGTGCGACCTACACTACAAAGTAACCTTGCTCATATGAATCACATGACGACCTCTCTCACATCACAATCCGAAAGGGAGGCAGCTGGAGGCCCATTCAACAAATCAGCATACAACACAGCAGAAGATTTGCTCAATGTTGGAAGAATTATAGATCAGGGAAAAGCAGATCAATATCAAAATGGATTCAGCAACGGATACACACCTGTTCAGCAACACATCCGTGCAGAAGATATGGAGCAATTTGCAAATCATGCTAAAAGAAGTACTTCTTTTAAGGAGCTGATGGGGATGAACTTTGAATATTCTCAAACAATCCCAAATCATCAGTCCAACATCAATGAAGCAAGGGGCTCAAAGAGAGGCCGCCCGCTTACAATTCAGCCAACACCGTCGTGCTCGATAACTACACTGAACTCTTCAGTGTTGTGTCAAGAGGTACTTCAAACGGGTGAGTCCCATAGACAAGGCAGCAGTGTAAACATAGGGCCCTTAGAAATTCCTGCGAAGAAATTTGAGTCTGGACTCTATGCAACCCTCTATAAAAGATACACTACTATTCAAGCAAATGAGGGTTGCCCAAGCCACCTCAACACAAGCGGTTGCAATCCTATCAATTCTGTTGGATTTACAACAGAAATGAAGCAAGCCATGCTAAATAGTCATATTAGAAGTAACCAAAGCAGAGACAGACAAACTAATTGGACTAAAGAAACTATTGGTGACAGGCATATTCACTCGGTCGTCCATGAAAACAATTTTCAAAGGCGACAAATCTCACACAATCTGCATCCAGAAATAGATAGGACGTGTGAGACCACTGGGTTGAATAAGGTTACTAGCTATCGCTCATTGATTACTGGTGACAAATGCAACGTGCTCCGACCATATCCGCATCTGAAAGCCTCAGAACAAGGTTATGCATACAGACAATCTGATAACAGTATGCTGACAATAAGGCAGGCTTGCCAACCCATGATATCAGGTTCCCTAACAACAAATCAAGTACAAAAACTAGGCTACTCGTTTGGCTTTCACCAATTCCCTGATAAAACAGCAGGTAAGTATATTTAATTCTTCAGATTAGGAATTCGATCTTGCTGTGAAGTACTAACTATAGCCTCTTGATATTGATGTAGGTCTACTAGAAAATGAGATCATACACAAATTGAAGGGTCTCAATCTTAACGACGATAAAGGAACCACCAGGACAGAGCAAAATGCTATTGTTCCATATAAAGGAAATGGTGCAGTAGTTCCATATGTGGAGTCTGAATATTTAAGGAAACGAAAGGCCCGACCTAGAGTTGACCTTGACCCGGAAACGGAGAGAATATGGAATTTATTAATGGGAAAGGAAGGAAGCGAAGGCATTGAAAATCATGAGAAAGACAAGGAGAAATGGTGGGAAGAGGAACGGAAAGTTTTTCGTGGTCGAGCTGATTCGTTCATTGCGAGGATGCATCTAGTGCAAGGTTCATTTCTTATATCTTCCTATCAAATTGCATGAATCCCAGAGGGAACGACACTGGTTTCCTCACTTGGAAATTTCTATTTGATTTAAATGCCATATCTTTTCCAAAAGGAGGGTTTACAAAATATTTACATTAACAATTCTCAAATCCATATTCAAAAGTTCCTTTTTTTCTTTAAACAGAATGCATAGTTCAAATTAAACAGGACAAAACCAAAATGGTTTTATCAGTTCTTTTCTTAAGTTTTAATCAGGTTGGTTCAGTTTTTGTCTTTTAAATGTTTGGAGTAAATTAAAAAAAAAAACCACCAGACCGGAATAAGAAAAGTATAAATTAAAATATTTCATTATTTGCACGTTTTTTTCCAATAAATTCACATGATTTACAATTACAAGTTTTATTTAATTAATAAATCAACAGGAATTAGATTTCTTTCAAAAAGAAATATTAATTTATTAGAGTTTATTCATGTACATATTTTAATCAAATTTTTAATTCACTTTTTTTTAAGATATGTGAGGTGAGAAGATTTGAATCTTTAATCTCTTAGTCAAGAGTACATATTTATCTCATTTGAACTACGTTGATGTTAGCAATTCACATTTTTTTTTAATTACAAACTTGCAATACACTTATATAATTAAATATGAGGAAATGAGCGTTTTGAAATGGTTGAAAAAATCTGTGATAGGCAAAGAATTTAACCAATTAATTCAGTCTAGTTCCACTTCTGGTATATAAAAAATCAATTTAATTGAATTTATCGTATAAAACCTCAAGCAGCCAAAAGTATTGCAGTTCCAAGTTGACAGGTATTTTTTGAACAAGGATTCTGCATGGCATCAGCAATTAAAGAAAACTATTCTTTAGTATATTTATGGTACATGTATAACAAAATTCTCTTGTTCCCTTCGCAGGGGACAGAAGATTCTCACGATGGAAAGGATCAGTTGTTGACTCAGTTATAGGGGTTTTCCTAACCCAGAATGTTTCGGATCATCTTTCAAGGTACGCTTATCATTTTGGTTAAAACTCTAGATCAAGATTATCTTTGAAGCAAATAATGAACAAGTTGATCACTTTTTTTTTTCTCGTTCTTTACAGCTCTGCGTTCATGTCTCTAGCAGCACGTTTTCCTTTAAAATCTACCAGCAACATTAGAACTCAGGGTGACGTTGAAACGAGCATGGTGGCCAACGAATCAGCAGCCTGTCTACTATATCCGGCAGATTCTATAAGATGGGATAGTCAGGTACTATCCCTGCCAAGGTTTGAGATGCCCCAGACTTCAATAAACCATCAAAACCACAGAGTAAAGTCGGGGACTGAATTTTTTTTCACAGAAGTAGGTAGTCAAATTGTGGAGGAAGAAGTCATATCCTCACAAGATTCCTTCGACTCCACAATCACACAAGGTACTGGAGGAGCCAGATCATGTTCTGGATCTAACTCAGATGCAGAAGAACCCATTGTAAGTTACAACTCTAGCAGCACTCATTGTTCAAATTTCACAGATATCAAACAAATGGAGACAACCACCTCACTACAGAAGTCCTTCAGTGACTTGAATAGAAGTTCAGTTTTTGATGAAGTCTCAGAACATAAACATTGGCAATTATCAGATGGTAAACAGGATTCACTAACCGAATGGAATGAGATTGACAATCTCAACGGCCATTCCTTAATTAATTTCCTTGTAAATATTGAAAACCAACACAAGCAAGTACCAGTTGCTCCTTCAAACAATCAGTTGCATATGACCCCCGACTGTAGGGTATTGGAGGTTGAAGGCCGTGAAGCATTCAGTGAAGAGAGCATATCTTCTGGGCCATCAATTGTATCTGGATGCTCTACAGAAAAGAATATGACTTGTCATAGCTTAAACATCGGGGATCCCGAGCGAACTTTGGATAAAATCAGCGGTGAAGAGATTGGACGACAAGCAAGATCTCAAGAAAGAATCAGGATGGAGCATAGCGAATCTATCAGTGAGCACTCGGTGCACCTGCAGGGTAATGGTATTCAATTGGGATCTCATTGTGAATATAGGCTTCATGACAATTATGAACCATGTGAGAGGAACAAGACTTCCCCAATAGAAAGCACGTCAGTTACCAATCCTTCCCCAGAATTGGATGCACCAGCCAAAATGCAGCAAAGTGCCCTATCAAATGTCGTAAACGCAACTACGCACACAGAAAAGTTGCTGCCTGGAAACGACAATCAAATAAACTTTTCAAATAATGAGGTCCATTCTCTATCTCAGGCAGATAATGAGGGAAATGTTGTTAGCACTTCAAAAGCAAAAAGAAGAAAGGTCAATAGTGAGAAAAAGAGTGCAGTCGATTGGGATATTTTGAGAAAGCAGGTGGAAGCCAATGGACAAATAAAAGAAAAAGGCAAGGATGCCATGGATTCAATAGACTACGAAGCAATCAGACTAGCCAACGTTCATGAAATTTCAAGTGCTATCAAGGAACGAGGAATGAACAACATGCTAGCTGAACGAATTAAGGTATGTACAAGCAATTGAAGTCTTTAAAGGCCTACTCTTACTTACATTAGCTAATATACCAAGCAGTAACAACTACTCTAAATAATGTTGCAGGAGTTTTTGAATCGTCTGGTAAAAGATCATGGGAGCATTGATCTTGAATGGTTAAGAGATGTTCCCCCAGACAAAGCAAAGTATGACATTAATTTCTTTTTTTTTTAATCAATTCTAGCTTCTTTAGCAGATTTATGTATTAGATATATATTTATATAATATAAAAATTACATACCTGCTGCAGGGATTATCTACTGAGTGTACGAGGATTGGGTTTAAAAAGTGTGGAGTGTGTTCGGCTATTAACACTTCATCACCTTGCTTTCCCAGTATGATTTCTCACAGAAAACCAATCATTTATCTAAAAGCTTAATAGAGGGGATTCTAATATCATTTTAAATCACTTTCTACATAGGTTGACACAAATGTTGGAAGAATAGCCGTTCGGCTTGGTTGGGTTCCTCTCCAACCATTACCCGAGTCACTTCAATTACATCTTCTAGAACTGTAAGTGATACCGGACTGATGAATGGTCGGTTTTCTTGAATCCGAAACTTCTAATCTTCGTTCTTTGTAGGTATCCAGTGCTGGAGTCCATTCAAAAATATCTATGGCCAAGATTATGCAAACTTGATCAGCGAACACTGTAACCCCTCAATACATTCATTATTCATGTTTGATGACTAAAAAATAAGCAACAAATTGAAACCAAATACTTCTGAATTGAATTAAGGTTATACTTATAACTGACCAAGTGTCTATTGCACAGATATGAACTACACTACCAGTTAATCACATTTGGAAAGGTGAGTCATAAGACTGTAGATGCAATTCGTCTTTTTAAAGTACAAAATGGATATGCCTGATGGAAGTGTTGATTGTAACTGTGATATCAGGTGTTCTGCACAAAGAGCAAGCCAAATTGCAATGCATGTCCAATGAGAGGAGAGTGCAAGCACTTCGCAAGTGCTTTTGCAAGGTTTGTCAATTTCTTTAAAATTGCACGATTTTACAATCCTGTCTATAGCTGACACCATCAGAAACTGGCTTTCCAACAATTTGGTATATCATTGAGTATTGATATGGAGCTACAGTCTAATCTAGGTAAATTGACGAAAACTAATATCATATGTATCAAAGGAAACTAATATTATAACATAAAAGTGTGAAAGTACTATTATGAAATATGTGGGGGAGATTAGTAGAATAGTAGAGTAAGAGCATAGTAGTAATTAGTTAAAAGTACCTTGGTTATAAGGAGTGTTAGGGACCTTAGAGGGTGTAGATCATTATTTTAGTGGAATTGTCCATTATTTGGGAGATAGCATTCTTGAAAAACTATTAATATATTGTAGTTTCCTTTGATATTGCAATATATTACTATCTTGTGTTTTCTAGGTTTGGGTGCCTAATAGAAAGTTCCAATGCGAGAAAATATGTTCATAAATATGGTCCACTCGTTTGTATATAGCCTATAGGCTTACCTTTTCAACACATTACTAAAACATGAACTGTATTTAGTGCAGTGCCCGACTTGCTCTTCCAGCACCGGATGAAAAGCGCATTGTGACTTCGACCAATCCTGTTGCCATGGAGAAGCAGCCAGCTGTGGTCTCGAATCCTTTGCCAATTCTTCCTCCTGAAGGAAGCACTTACACAGAAAGTACCTTGGGCACCAGCAAGTGTGAGCCAATAGTTGAAGTACCAGCGACGCCCGAACCCGAACCTGAACCCAATGAGATAACTGAAAGTGATATTGAAGATTCATTTTATGAGGATCCTGATGAAATTCCTACTATTAAACTCAGCATGGAAGAGTTCAAAACAACTCTACAAAATTACATCCCAGAAGGCGACATGTCCAGAGCTTTAGTTGCCTTGAACCAAGAAGCTGCCTCTATCCCAACGCCAAAATTGAAGAATGTGAGCAGGCTACGGACAGAGCATCAAGTGTAAGTTGTTTTTCTAGCTTGCTCATTTCTTTTTTAATATAAACGCAATAAAGTATTTAAAATAAATAAATAAATAAAGCAACATTATTTTAACACTAGGATAGTAATAAAATAACCAATCTATTTGGTTCTTTGCAGTTTTATGGAGTATTTAGTTAGGTAGAAACAATAGAATCATATAGGGCGAGAGCTATTGGAAGGAGACTTGGAAAGGTCATTACATTAGATTTAATATATTGGCGTAGGCTCCGATTTCTCGCCTTTTTTTTAATTATGAGTTAGGCTTGATCCTTTTGTTGGACTTTTTTTTTTATGCCCTCGTTCATTCTTTCGATTTAAAACCTCGGTTTCTCATTAAAAAAATCTAAAAATAATAATTAATAATTAATGCAGTCTATAACTCCTGTCTCTATCAAAATGCAGGTATGAACTTCCAGATTCACATCCGCTCTTAAAAGAGGTAACATTTCCATTTTACTTTGATATGCCCACATAACGCGCCAGCATTAAATTTATCGTAATAATTTTTTTTTAGCATAACAATTTTATTTAAACATACAGTTGGATAGACGAGAACCTGATGATCCAAGCCCATATCTTCTTGCGATATGGACGCCAGGTAAGCAGTAGTCAGCTCTATAAAGCAATGATTTTCTCACTTCTGTTCCTTTTATGCCTCCTTCCTTTTCCCCTTTTCCATTTTGGTAAACCAGAGAGAGAGTGAACCATCAAAGTTTCATGGTTGATGGAAAATAATAGTTTTAAGACTAAGTTACAAATCTAGTCCCTAAGGCCCCATTTAATAATTATTTGTTTTTTGAAAATTGTACTTATTTTCTCACTATTTCTTTGTCATGTTGTCATCTTTCTTATGAAATCATTTGAGTTTTTAGCCAAATTCTAAAAAAAAAAATAGGTTTTTAAAAACTACTTTTTTTTTTGTTTTACAAAACTTGGCTTAGTTTTTTAAAACATGTGTATAAAGTAGATAACAAAATAAGAAAAGTCATATGTAGAAATAGTATCTATAAGCTAACTTTCAAAAACCAAAAACTTAAAATCAAATGATGATCAATCGAAGCCTAAATTTTTTTGATCTTTTAAAATTTATGGTTCTACACATATTCAAATTCATATCTAATAGGTCAATTAATTTTAAAAAAATTTAAATTGTTAGAGACCTATTAGACACGAAATTTAAAATTTAAGAACCAAATAAATACAAATTTCAAAATTCAAGGACAATAGTAGAATTTAACCTAGTTTTAAGCAGGAAAAAAGAGAGAATTATCATCATTAACAATAATTTCAATTTTTCCAAAATCTCTAGGTGAAACAGCTAACTCGATTCAACCACCAGAACAAAGTTGCGGATCCCAAGACCCGGGCAGGCTCTGCAATGAGAAAACATGCTTCACATGCAACAGTAGAAGAGAAGCTAACTCTCAAACAGTCAGAGGAACGCTCCTGGTAAAATCTAATGAATTTAGTAAAAATGATTTGTTTATAAAGTACTTTATCAAAAATGATTTGCATAGTCATTTTAATATTTAGTTTCATGATTTTCATACCATCACAATTGACCTTGATTGATTAAATACATGTTTGAAAGTAATTTTGGACATGACAACTGATTTTAGCTATTTCAAAATCACTTTCAAACATGTTCTAATTGAAATTCATTGCAATGAGATAATTTTTAGTCTCTAAGCCTTTCTATATTTCTCCCTTGCAAAATTGCAGATACCTTGCAGAACCGCAATGCGAGGGAGCTTTCCGCTCAATGGAACATATTTTCAGGTCAACGAGGTAAATCGTGTCCAACACCATCATCTTGGGTAGAAAAAGTAGATGATCTTTAAGTGATTTAATTTGAAAATTCTCTATGCCAATTAAAATGGAGTAATTTCAAATCCAATACATGTTCAAATATCATTTTGGTTCCTAAACTTTCACATTTGTACTATTTTATTCCTTGAACTTTTAAAATATATATTTTATTCCTTAAACTTTCGAGTAAAATAATCATTTTGATCCTTATCTTTATTTTGTTTTTAACAAGTTAGTGACACAACTTTGAATGTGTATACTAACTTGTTAATATGACATGATTTCAAGTTATTTAGATTACAAACATGCATTTAGCAAGTGGATATTAGACATGAAAATAAGTTAGCAACAACACTTATTGTAGAATTTTTAAAATTGATTTCAAGCCATTTAGATTACAAACATGCATTTAGCAAGTGAATATTAGACATGAAAATAATTTAGCAACAACACTTATTGTTGATTTTTAAATTTTAAAAATTCAAGAACTATAATAGACATTTTAAAAGTTCAAAGATGAGATGAAAATTGAACCAACACATTCAAAGGCCATTAGGATCAATCTAGCTCATTAGATTTCAATTTTTCATTCATAAATTATGTTAGTCAAATTCTTATTTTCTGTTTACTTTTAACACAAACATCCGAACAAAATATTTCAATAACATCCAAATAGCCATTTTAACTTTGTTCTCTTTGATTCCAAAGGGGGAAAAAAACAATGAATTTTAACATCAAGATAATATGATTCTGCATATTTGCTCTTTTTAACTTTTTCTTCAAATTTACAGATGTTTGCAGATCATGAATCTAGTACAAATCCTATCGACGTTCCAAGAAAATGGCTATGGAACTTACCTAGACGAACCGTCTACTTCGGAACATCAGTATCAACAATATTTAAGGGTAAGAATACGTAAAATTCAAGGACATGCTGATTGCCGATTTTAAATAATGAAAATCTACTAAATATGACTTGTATGCCACTAGGCCTGGTGACGGAGGAGATCCAGCAATGCTTTTGGAGAGGTATGTAATTTATTAATTTATTAATTTTCTTGCTTAGAGATCTCTATCCATCTAAATACGAAAAGTAGCAGTGGCCAAAAATTTTAACCTGGTAAGATTTTAAAATATAAAATTCACAGAAAGTACAGGATCCTCTTACTCAAGATAAAATTTCTATGCCACAATCCTCCAAATTAGGCCGCCAAACACCAAAATTTTATCACCATGGCCATCCAATGGTCAGATATGGTTGTATTTGCCCTTCATTACGCATTTTTTAAATAGAAAAATTAAAAATAAAATTAACAAAACCATTTTTTTTAACACAACAAAACCATTTTAACGGGAGGAACAAATGATTTTAATCGCATCCGAAGAGCCACAGAATTTCTTGTTATCTTGAATTGACAGGCTTTTAGGATCTAATTTAATGATTCTTTTTTTATTCTTTGTACCATTAAAATTCCATTGTACAACCAACTGCGTACTATTGAAATAAAACCAAGCATTTCTGGACTTAATTTTGAAGAGGACAAATTGGTTTTTAGCTAACAAGATTATGGTTGTTTTACAATCGACCGTTATAATATACTTAAAGGGCATGTTAGAAAATGGTTTTTCAAGGGTTGAAAACAATTTTGTCATATTAGAAATTACTCTAAAATATGTTTTTAATTATTCAAAAAATACTTTTATAATATTAAATTCAGTAAGAAGTTAATTTCAAATAACTAAATGCATTTTTGGAATGATTCAAACATGACAAAAGCAATTTAATCATTTCAAAATCACTCTCAAATAGGTTTAGGTTTACTCTATGATTTCTTTTGTTTTATTTCATGTTTTTTGGATTATATTTGCTATGTTCATTTTTTAAATAATTAGAATACAAAAAAATAAATAAATAGATATAAAATGTTTTAACAAAAAAAGTGAAAATGCTACCGATTATATAATTATGGTTCAACTTTTCAGAAACATAAATAAAAATACCATCATGCCATTTTTTACTAAAATTGATCAATTCAGTATATTGTTTTGAAATTATTAGCTTTTTTTAATCAAGAAATCAAGAAATAATCTAAAAGTTTAAAAAGAACCAGAATATTCATCTCATGTTTTTCCCTTATTTTAAAAATATAACAAAAGACATAAACATAACGATAACAAACTTGCCGAGCCGTTTGAAAATAAAAATTACAAAAAACATATTCTTTTGCCTAAAAAGTGTATAAATATTGTGATTTTAAAAAATAACTTTGAAAACTGAAGTAATAATTTTTTATTTTAGTCATTAATATTTATAAATCAACTATTCATAGTCATAGGATATCATTTCAAGTACAACTGACTTTACGTATAGAGGAAAAATTGTTTGGCTTAAAACAAATCATTCATAACTTTTAAAGTAATAATTTTTTATTTTAATTATTAATATTTTTAAATCAACTATTCATAGACATAGGGTATCATTTCAAATACAATTGACTTTGCATAGAGGAAAAAATCTTTGGCTTAAAACAAATCATTCATTACTTTTAATAATTTCAAACATGGTGGAAAAATTTTATTCATTTTAATACATTAAATTAAAGTAATCATCCAAATGTGAATAAAAAAAGTGTTTTACGAATCAATTTAATGAAACTAATTTTTCAACGGTATTAATTTTAGCCGCTGATACTTTCTCAATGTCACCATGCTTTGATGATCTGGAAAAAAAATAGAATGAAAATTTAAAACGAAAAATAGCAATATTTGGAGCAGCCAGAGTACAGTCACCAAAAAACATATCATAGTTCTTTAATTATTTTACTCGAGAAAATATTATTTTATATATATACCTCGTCATTGCCTATTGTTAACATTAATACTTAAGACACGTTATAAACTTCATCAATGAAAGTTTATAAAGAGACAACTATGAGAACTGATCAAAACTAGAATTATCTTAGAATCAATGTCTAAATAATGAAATTTTGAAAAAGTAGAGAAAGAATAGGAACTTTTATCCAGACCACTTCATCTTAAAAACAAGATCGTTAACTTTTTCGGAATAAAATGCCTAATCTTAAATAGTCTAAATTAAAATTTGAAAAAAACTTAAAAAGTCTAAATTAAAATTTGAAAACATAAAAACAGAACAAAACGCAATCTGCTTTACTAAATAATGGGAGGTTTCGGCGGTATTTATAAAATACGGCAGAAAAGATCGCTAATGCACGTTATTGAATTCTTAAGCCCTTTTTCTTTGTTTCTCACGTCTACTTTTATTTTTATTTTTTATGTTCTTTTTTAGTATCTGAGCACATAGGAAATTACCCAAAAAAAAATGCATTTTTTTTAGGAAAAAAAATGCATATTATGATATAAAGCTTCGAAATAGTAGGATATCTTTTTTTTTTTTTTTGAAACTTGAATAGTAGGACATCTTATTTAGGCTCTTTCTGATCAACTCATCATGAATGAGCTAAGATTTTGTGACTAAATGTTATAGGAACAATCATTTTGAAGATATATTAGGTTTTTTTTAGTTCAACATGTGAAGGAGAAAAATACAAACTTTTAATCTCATGGTCAAAGACATGCTTTAACTAGTAGAGCCATACTCGAGTTAGGAGAAGCTTGAGAGAAAGAAACTCAAACTAAGTTGTACTACCAGAAGTTGGAAAAAGAGAATGGAGAATGCTGGCAAGAAAATCTCCAAGGGCAATGAGGGGAAATGTTATTCGCAGGGCAAAGTCTAGAATCATGTTACTTAAGAGACTAGTACCTTGAGTCATATAAATAAGTTCAACATCCAATGAGTTACCTAAATAACCAAATATGGTAAGGTCTATCAAGATAGAATTTGCATAAGAGAAACAGAGAGACGAGAAGCTATGGTTTCAACCAAAAGACCCAAGCACAATGAAACCAATGTTCGAGGAGCCAAGAAAAAAGGGGAAAAAAGATTCAAGGAGGAATGAGAACTGTATAGATCAACTAGAGAACACGAGTTGTAAAGAGATATTGCTAGTTGTTTCTAACGGGCCAATAGGTTGAATCACATTTGGATTCTTTATACTCGACAGATTTTGAGAGGCTAGAGTTTGAGTTTAACCTGGAAGTGCTTCTTCAATGGGTCGGAGTTTGGAAGAGGTCCAGTGTATTGAGTGGTTATCCAACTACTACTAAACCCACCCTCAAAGACAAGATATGTTTCAAAGGCATAGTAAAGTTAGGACTGCTATTGGAATTAAATAATAATGATCAATGAGTTGCAAAGAAGGTTTGGGATTTAGGCACAATTTACTTTTAGTTTTGAAAAAAGAAATAAGCCTCTCGGCTACTTCCTAACCATTTGTGAAATTCATCTCGTGTAGCAATTTTTCCTTGGCCTGTTTCCATCAGATTCCTTTCCAATTTCCCACACCCAAGAACTCTGGAATGCTTCTTATCTGAACTCAATGTGATCATCACCAAAAAAAAAAAAAAGAAAAAGAAAAAAGAAAAAAGAAAAAGGATTTGGTAGCTGAATGTTGTTAGAGCAGTCTTGTGGGAGATATGGAAGAGGAAACAAAAGGATTTTACAAGAGATTGGGAGACCTTTCGGATATAATTGGGAGGAGACGGCAACTTCGGCGTCTTCCATAAACATTTTTGAAATCATAATACTTTACATCCATCCGTGCCAAATGCAGAGGTCTCTCGTAATACTTTTGAGCCTCCTTTTGGGGATTTCTCGTCCCCCCTCTTTTGTTTTGTTATTTTCTTCAACATTAAGCCTCGGTTTCTTATCAAAGAAATACTGTTTTGGGGAGTTCTTCCTACCTCTCTGTTTTTCATTTTATCTATCCATCAGTGTTCCTTAATAAAGTAGTTAAGAAATCCAATAAGATTTGGGAGCTCAGAGAAGCGAAGGCCTTCCGCACACCACACCAATCAGTGAGAAGAGGTATCAGCCTGTAATGGTCTAGTCTACGTGCTTATGATGAGGTTTGTCTCATTATTTAGAAGAAACATACAAAGGATTTCATACCCAAAGTGAGAAGATGATCACCTTGATAATTGGATATGCATTCCCAATTTCCCCTGATGTACCAAGGGTTCTGATCATCCGTAACTGAGGTAGAAAAAAATCAAAGAGATCAAAAAAACATGAGAAATATCTGAGTGAATAAAATTTTCTAACTCAAGGTATTTTTGCATGCTATATGGATAATAATGACAACATGGCTAGATTAGGGTGCATCAAATATCTAGACTTTCTTATTTTCTTATGGTGCGGTTTTGTCGCACTATCAACAATGATTGAGTCTCAATCTATATTTCCGTTTGAGTTTGTGCACGAGCTAAATTTTGAGTTATATTGCAGGCTTTGTTTGTGTCAGAGGATTTGAACGAAAAACACGGGCACCTCGACCTCTGATTGCAAGATTGCACTTTCCAGCAAGCAAGCTGGCCAAGATGAAAAATGGACATACAGAATAGCCAACTTTTTATGCAAAGGGCAGAAGGTAAGAGCAAGAGCAACAGAAAGGATTACTGACTGCTAAAGGAGATTTTCGACTAACCTATTGTTCTAGATTTTTAATTATTCTTGGTCTAACGTGGAATGCATTCACTGCATGTTAGCCAAGCGGCAGAGGAAAAAGTTACCAAATTGAAATTTTAGAATCCATAAGTGAGATGTAGTCTCTACCCTATTGATATTCTTATGATTATTCTCAATTCATGCTCGAAACAGAGCCCATTCTTCAAAA

mRNA sequence

GGTCCATCTCTCTCTCTCTTTCTCTCTCTCTCTCTTTCTTCTCTCTCTCTCTCTCTCTCCTGGAAACTCAAGCTAACAACGTAACTCTCTCTCCCCTTCCTCTTTCCATCAGCATTTGGCATTTGATCGACCAAGAAACTCGGTCAGAGTACGGAGTAGCTGATGGGAGGTAATTACAGAATAACAAGCATTGGATCGGTGAGGAGATTGGCGAAAGAGCTTGGCTCAAAGCAGTTTGGAGAATTGTCTTAATGTTCGTTGCATGCGTTGACAGCATTGAACATGAATTCTCAAGTTAACAGCAGTGGGGATTTCTATGCGGGCAATTTATTGGTTAGAAACCAAAATCTGTATCCAAGTTCGAGGCCATCTAGTAACAATAGCTACGCCCAACATGTACGCCCATATGGACTCCCCATGTATCAACCCAATTATAACTTGAACCCAGCATCAATGACACAAATGAACCAAATGTCGATTTTCACCAACTCAATCCATCCTCCACCGGTCTCCTCGCACCTGGAAAATTTTGCATTTGATCCCATCTCTACACCATCTTTTCTCGTAAGAGATGAAAGTTCAAGCTTCAGAAGGGATGGCGAGGACGACTTCATCAGAATGTTTCAAGATGAACCACCTCGTCAACACTGTGACGAACTTCTACAAAGCATTGTGGAATCATCATGTGTTGGAAATTCTACTCCATTCAAGAGAACGACGGACTTTGGGAAGCAGAGAGATCTTGAAATTGATCTCAACAGGACACCAGAGCAGAGACCACCAAAAAGAAGACAGCATACGCCCATGGTATTTTCAGAAAGGTTTACTGATTTACTTAATCTTCCATTGGCTGAAAGTTTAAGCCTTTACGAGGAAACACAGGAGAACTTTGTCACAGTTCCACTTGATGAAGCAACTCAGAAACGTCATGATGAACTCTTGAAAGATCTCACAGATACATTATCTGCAGCCATTTCTGCACCAACACCGACGAAGGAAGTGGAAAAGGGCAGCGATCAAGTAATTGATCTTAATAAGACACCAGAGCAGAAGACGCCTAGGCGAAGAAAACATAGGCCCAAGGTTATAAAAGAAGGAAAACCCAAAAAGTCTCCTAAACCTGTGACGCCGAAGATTCCCAAGGAAACCCCATCAGGAAAGAGAAAGTATGTACGGAAGAAGAACATCAAAGAAGCAGCTACTCCACCTGCGAATATTGTGGAGATTAAAGATTCGAGCACTGCTACTAAAACAAAATCCTGCCGGAGAGTAATAAATTTTGAGATGGAAAAAACTGGAGATGAGGAGCAGGAAAAGAAACACAATGAGAAGGATGTGCAAGAGGAGAACATGGGGAACTTTTGCTCCATCACGAGACCAAACGTTCCAGACTTCTGCACCCAAAGTAATGGTGTTTGTGGAACAAGTCCAGATGTTCATATCAGTCATCGACTGAGCACAATGGTGGCTGAAAATGTGCGACCTACACTACAAAGTAACCTTGCTCATATGAATCACATGACGACCTCTCTCACATCACAATCCGAAAGGGAGGCAGCTGGAGGCCCATTCAACAAATCAGCATACAACACAGCAGAAGATTTGCTCAATGTTGGAAGAATTATAGATCAGGGAAAAGCAGATCAATATCAAAATGGATTCAGCAACGGATACACACCTGTTCAGCAACACATCCGTGCAGAAGATATGGAGCAATTTGCAAATCATGCTAAAAGAAGTACTTCTTTTAAGGAGCTGATGGGGATGAACTTTGAATATTCTCAAACAATCCCAAATCATCAGTCCAACATCAATGAAGCAAGGGGCTCAAAGAGAGGCCGCCCGCTTACAATTCAGCCAACACCGTCGTGCTCGATAACTACACTGAACTCTTCAGTGTTGTGTCAAGAGGTACTTCAAACGGGTGAGTCCCATAGACAAGGCAGCAGTGTAAACATAGGGCCCTTAGAAATTCCTGCGAAGAAATTTGAGTCTGGACTCTATGCAACCCTCTATAAAAGATACACTACTATTCAAGCAAATGAGGGTTGCCCAAGCCACCTCAACACAAGCGGTTGCAATCCTATCAATTCTGTTGGATTTACAACAGAAATGAAGCAAGCCATGCTAAATAGTCATATTAGAAGTAACCAAAGCAGAGACAGACAAACTAATTGGACTAAAGAAACTATTGGTGACAGGCATATTCACTCGGTCGTCCATGAAAACAATTTTCAAAGGCGACAAATCTCACACAATCTGCATCCAGAAATAGATAGGACGTGTGAGACCACTGGGTTGAATAAGGTTACTAGCTATCGCTCATTGATTACTGGTGACAAATGCAACGTGCTCCGACCATATCCGCATCTGAAAGCCTCAGAACAAGGTTATGCATACAGACAATCTGATAACAGTATGCTGACAATAAGGCAGGCTTGCCAACCCATGATATCAGGTTCCCTAACAACAAATCAAGTACAAAAACTAGGCTACTCGTTTGGCTTTCACCAATTCCCTGATAAAACAGCAGGTCTACTAGAAAATGAGATCATACACAAATTGAAGGGTCTCAATCTTAACGACGATAAAGGAACCACCAGGACAGAGCAAAATGCTATTGTTCCATATAAAGGAAATGGTGCAGTAGTTCCATATGTGGAGTCTGAATATTTAAGGAAACGAAAGGCCCGACCTAGAGTTGACCTTGACCCGGAAACGGAGAGAATATGGAATTTATTAATGGGAAAGGAAGGAAGCGAAGGCATTGAAAATCATGAGAAAGACAAGGAGAAATGGTGGGAAGAGGAACGGAAAGTTTTTCGTGGTCGAGCTGATTCGTTCATTGCGAGGATGCATCTAGTGCAAGGGGACAGAAGATTCTCACGATGGAAAGGATCAGTTGTTGACTCAGTTATAGGGGTTTTCCTAACCCAGAATGTTTCGGATCATCTTTCAAGCTCTGCGTTCATGTCTCTAGCAGCACGTTTTCCTTTAAAATCTACCAGCAACATTAGAACTCAGGGTGACGTTGAAACGAGCATGGTGGCCAACGAATCAGCAGCCTGTCTACTATATCCGGCAGATTCTATAAGATGGGATAGTCAGGTACTATCCCTGCCAAGGTTTGAGATGCCCCAGACTTCAATAAACCATCAAAACCACAGAGTAAAGTCGGGGACTGAATTTTTTTTCACAGAAGTAGGTAGTCAAATTGTGGAGGAAGAAGTCATATCCTCACAAGATTCCTTCGACTCCACAATCACACAAGGTACTGGAGGAGCCAGATCATGTTCTGGATCTAACTCAGATGCAGAAGAACCCATTGTAAGTTACAACTCTAGCAGCACTCATTGTTCAAATTTCACAGATATCAAACAAATGGAGACAACCACCTCACTACAGAAGTCCTTCAGTGACTTGAATAGAAGTTCAGTTTTTGATGAAGTCTCAGAACATAAACATTGGCAATTATCAGATGGTAAACAGGATTCACTAACCGAATGGAATGAGATTGACAATCTCAACGGCCATTCCTTAATTAATTTCCTTGTAAATATTGAAAACCAACACAAGCAAGTACCAGTTGCTCCTTCAAACAATCAGTTGCATATGACCCCCGACTGTAGGGTATTGGAGGTTGAAGGCCGTGAAGCATTCAGTGAAGAGAGCATATCTTCTGGGCCATCAATTGTATCTGGATGCTCTACAGAAAAGAATATGACTTGTCATAGCTTAAACATCGGGGATCCCGAGCGAACTTTGGATAAAATCAGCGGTGAAGAGATTGGACGACAAGCAAGATCTCAAGAAAGAATCAGGATGGAGCATAGCGAATCTATCAGTGAGCACTCGGTGCACCTGCAGGGTAATGGTATTCAATTGGGATCTCATTGTGAATATAGGCTTCATGACAATTATGAACCATGTGAGAGGAACAAGACTTCCCCAATAGAAAGCACGTCAGTTACCAATCCTTCCCCAGAATTGGATGCACCAGCCAAAATGCAGCAAAGTGCCCTATCAAATGTCGTAAACGCAACTACGCACACAGAAAAGTTGCTGCCTGGAAACGACAATCAAATAAACTTTTCAAATAATGAGGTCCATTCTCTATCTCAGGCAGATAATGAGGGAAATGTTGTTAGCACTTCAAAAGCAAAAAGAAGAAAGGTCAATAGTGAGAAAAAGAGTGCAGTCGATTGGGATATTTTGAGAAAGCAGGTGGAAGCCAATGGACAAATAAAAGAAAAAGGCAAGGATGCCATGGATTCAATAGACTACGAAGCAATCAGACTAGCCAACGTTCATGAAATTTCAAGTGCTATCAAGGAACGAGGAATGAACAACATGCTAGCTGAACGAATTAAGGAGTTTTTGAATCGTCTGGTAAAAGATCATGGGAGCATTGATCTTGAATGGTTAAGAGATGTTCCCCCAGACAAAGCAAAGGATTATCTACTGAGTGTACGAGGATTGGGTTTAAAAAGTGTGGAGTGTGTTCGGCTATTAACACTTCATCACCTTGCTTTCCCAGTTGACACAAATGTTGGAAGAATAGCCGTTCGGCTTGGTTGGGTTCCTCTCCAACCATTACCCGAGTCACTTCAATTACATCTTCTAGAACTGTATCCAGTGCTGGAGTCCATTCAAAAATATCTATGGCCAAGATTATGCAAACTTGATCAGCGAACACTATATGAACTACACTACCAGTTAATCACATTTGGAAAGGTGTTCTGCACAAAGAGCAAGCCAAATTGCAATGCATGTCCAATGAGAGGAGAGTGCAAGCACTTCGCAAGTGCTTTTGCAAGTGCCCGACTTGCTCTTCCAGCACCGGATGAAAAGCGCATTGTGACTTCGACCAATCCTGTTGCCATGGAGAAGCAGCCAGCTGTGGTCTCGAATCCTTTGCCAATTCTTCCTCCTGAAGGAAGCACTTACACAGAAAGTACCTTGGGCACCAGCAAGTGTGAGCCAATAGTTGAAGTACCAGCGACGCCCGAACCCGAACCTGAACCCAATGAGATAACTGAAAGTGATATTGAAGATTCATTTTATGAGGATCCTGATGAAATTCCTACTATTAAACTCAGCATGGAAGAGTTCAAAACAACTCTACAAAATTACATCCCAGAAGGCGACATGTCCAGAGCTTTAGTTGCCTTGAACCAAGAAGCTGCCTCTATCCCAACGCCAAAATTGAAGAATGTGAGCAGGCTACGGACAGAGCATCAAGTGTATGAACTTCCAGATTCACATCCGCTCTTAAAAGAGTTGGATAGACGAGAACCTGATGATCCAAGCCCATATCTTCTTGCGATATGGACGCCAGGTGAAACAGCTAACTCGATTCAACCACCAGAACAAAGTTGCGGATCCCAAGACCCGGGCAGGCTCTGCAATGAGAAAACATGCTTCACATGCAACAGTAGAAGAGAAGCTAACTCTCAAACAGTCAGAGGAACGCTCCTGATACCTTGCAGAACCGCAATGCGAGGGAGCTTTCCGCTCAATGGAACATATTTTCAGGTCAACGAGATGTTTGCAGATCATGAATCTAGTACAAATCCTATCGACGTTCCAAGAAAATGGCTATGGAACTTACCTAGACGAACCGTCTACTTCGGAACATCAGTATCAACAATATTTAAGGGCCTGGTGACGGAGGAGATCCAGCAATGCTTTTGGAGAGGCTTTGTTTGTGTCAGAGGATTTGAACGAAAAACACGGGCACCTCGACCTCTGATTGCAAGATTGCACTTTCCAGCAAGCAAGCTGGCCAAGATGAAAAATGGACATACAGAATAGCCAACTTTTTATGCAAAGGGCAGAAGGTAAGAGCAAGAGCAACAGAAAGGATTACTGACTGCTAAAGGAGATTTTCGACTAACCTATTGTTCTAGATTTTTAATTATTCTTGGTCTAACGTGGAATGCATTCACTGCATGTTAGCCAAGCGGCAGAGGAAAAAGTTACCAAATTGAAATTTTAGAATCCATAAGTGAGATGTAGTCTCTACCCTATTGATATTCTTATGATTATTCTCAATTCATGCTCGAAACAGAGCCCATTCTTCAAAA

Coding sequence (CDS)

ATGAATTCTCAAGTTAACAGCAGTGGGGATTTCTATGCGGGCAATTTATTGGTTAGAAACCAAAATCTGTATCCAAGTTCGAGGCCATCTAGTAACAATAGCTACGCCCAACATGTACGCCCATATGGACTCCCCATGTATCAACCCAATTATAACTTGAACCCAGCATCAATGACACAAATGAACCAAATGTCGATTTTCACCAACTCAATCCATCCTCCACCGGTCTCCTCGCACCTGGAAAATTTTGCATTTGATCCCATCTCTACACCATCTTTTCTCGTAAGAGATGAAAGTTCAAGCTTCAGAAGGGATGGCGAGGACGACTTCATCAGAATGTTTCAAGATGAACCACCTCGTCAACACTGTGACGAACTTCTACAAAGCATTGTGGAATCATCATGTGTTGGAAATTCTACTCCATTCAAGAGAACGACGGACTTTGGGAAGCAGAGAGATCTTGAAATTGATCTCAACAGGACACCAGAGCAGAGACCACCAAAAAGAAGACAGCATACGCCCATGGTATTTTCAGAAAGGTTTACTGATTTACTTAATCTTCCATTGGCTGAAAGTTTAAGCCTTTACGAGGAAACACAGGAGAACTTTGTCACAGTTCCACTTGATGAAGCAACTCAGAAACGTCATGATGAACTCTTGAAAGATCTCACAGATACATTATCTGCAGCCATTTCTGCACCAACACCGACGAAGGAAGTGGAAAAGGGCAGCGATCAAGTAATTGATCTTAATAAGACACCAGAGCAGAAGACGCCTAGGCGAAGAAAACATAGGCCCAAGGTTATAAAAGAAGGAAAACCCAAAAAGTCTCCTAAACCTGTGACGCCGAAGATTCCCAAGGAAACCCCATCAGGAAAGAGAAAGTATGTACGGAAGAAGAACATCAAAGAAGCAGCTACTCCACCTGCGAATATTGTGGAGATTAAAGATTCGAGCACTGCTACTAAAACAAAATCCTGCCGGAGAGTAATAAATTTTGAGATGGAAAAAACTGGAGATGAGGAGCAGGAAAAGAAACACAATGAGAAGGATGTGCAAGAGGAGAACATGGGGAACTTTTGCTCCATCACGAGACCAAACGTTCCAGACTTCTGCACCCAAAGTAATGGTGTTTGTGGAACAAGTCCAGATGTTCATATCAGTCATCGACTGAGCACAATGGTGGCTGAAAATGTGCGACCTACACTACAAAGTAACCTTGCTCATATGAATCACATGACGACCTCTCTCACATCACAATCCGAAAGGGAGGCAGCTGGAGGCCCATTCAACAAATCAGCATACAACACAGCAGAAGATTTGCTCAATGTTGGAAGAATTATAGATCAGGGAAAAGCAGATCAATATCAAAATGGATTCAGCAACGGATACACACCTGTTCAGCAACACATCCGTGCAGAAGATATGGAGCAATTTGCAAATCATGCTAAAAGAAGTACTTCTTTTAAGGAGCTGATGGGGATGAACTTTGAATATTCTCAAACAATCCCAAATCATCAGTCCAACATCAATGAAGCAAGGGGCTCAAAGAGAGGCCGCCCGCTTACAATTCAGCCAACACCGTCGTGCTCGATAACTACACTGAACTCTTCAGTGTTGTGTCAAGAGGTACTTCAAACGGGTGAGTCCCATAGACAAGGCAGCAGTGTAAACATAGGGCCCTTAGAAATTCCTGCGAAGAAATTTGAGTCTGGACTCTATGCAACCCTCTATAAAAGATACACTACTATTCAAGCAAATGAGGGTTGCCCAAGCCACCTCAACACAAGCGGTTGCAATCCTATCAATTCTGTTGGATTTACAACAGAAATGAAGCAAGCCATGCTAAATAGTCATATTAGAAGTAACCAAAGCAGAGACAGACAAACTAATTGGACTAAAGAAACTATTGGTGACAGGCATATTCACTCGGTCGTCCATGAAAACAATTTTCAAAGGCGACAAATCTCACACAATCTGCATCCAGAAATAGATAGGACGTGTGAGACCACTGGGTTGAATAAGGTTACTAGCTATCGCTCATTGATTACTGGTGACAAATGCAACGTGCTCCGACCATATCCGCATCTGAAAGCCTCAGAACAAGGTTATGCATACAGACAATCTGATAACAGTATGCTGACAATAAGGCAGGCTTGCCAACCCATGATATCAGGTTCCCTAACAACAAATCAAGTACAAAAACTAGGCTACTCGTTTGGCTTTCACCAATTCCCTGATAAAACAGCAGGTCTACTAGAAAATGAGATCATACACAAATTGAAGGGTCTCAATCTTAACGACGATAAAGGAACCACCAGGACAGAGCAAAATGCTATTGTTCCATATAAAGGAAATGGTGCAGTAGTTCCATATGTGGAGTCTGAATATTTAAGGAAACGAAAGGCCCGACCTAGAGTTGACCTTGACCCGGAAACGGAGAGAATATGGAATTTATTAATGGGAAAGGAAGGAAGCGAAGGCATTGAAAATCATGAGAAAGACAAGGAGAAATGGTGGGAAGAGGAACGGAAAGTTTTTCGTGGTCGAGCTGATTCGTTCATTGCGAGGATGCATCTAGTGCAAGGGGACAGAAGATTCTCACGATGGAAAGGATCAGTTGTTGACTCAGTTATAGGGGTTTTCCTAACCCAGAATGTTTCGGATCATCTTTCAAGCTCTGCGTTCATGTCTCTAGCAGCACGTTTTCCTTTAAAATCTACCAGCAACATTAGAACTCAGGGTGACGTTGAAACGAGCATGGTGGCCAACGAATCAGCAGCCTGTCTACTATATCCGGCAGATTCTATAAGATGGGATAGTCAGGTACTATCCCTGCCAAGGTTTGAGATGCCCCAGACTTCAATAAACCATCAAAACCACAGAGTAAAGTCGGGGACTGAATTTTTTTTCACAGAAGTAGGTAGTCAAATTGTGGAGGAAGAAGTCATATCCTCACAAGATTCCTTCGACTCCACAATCACACAAGGTACTGGAGGAGCCAGATCATGTTCTGGATCTAACTCAGATGCAGAAGAACCCATTGTAAGTTACAACTCTAGCAGCACTCATTGTTCAAATTTCACAGATATCAAACAAATGGAGACAACCACCTCACTACAGAAGTCCTTCAGTGACTTGAATAGAAGTTCAGTTTTTGATGAAGTCTCAGAACATAAACATTGGCAATTATCAGATGGTAAACAGGATTCACTAACCGAATGGAATGAGATTGACAATCTCAACGGCCATTCCTTAATTAATTTCCTTGTAAATATTGAAAACCAACACAAGCAAGTACCAGTTGCTCCTTCAAACAATCAGTTGCATATGACCCCCGACTGTAGGGTATTGGAGGTTGAAGGCCGTGAAGCATTCAGTGAAGAGAGCATATCTTCTGGGCCATCAATTGTATCTGGATGCTCTACAGAAAAGAATATGACTTGTCATAGCTTAAACATCGGGGATCCCGAGCGAACTTTGGATAAAATCAGCGGTGAAGAGATTGGACGACAAGCAAGATCTCAAGAAAGAATCAGGATGGAGCATAGCGAATCTATCAGTGAGCACTCGGTGCACCTGCAGGGTAATGGTATTCAATTGGGATCTCATTGTGAATATAGGCTTCATGACAATTATGAACCATGTGAGAGGAACAAGACTTCCCCAATAGAAAGCACGTCAGTTACCAATCCTTCCCCAGAATTGGATGCACCAGCCAAAATGCAGCAAAGTGCCCTATCAAATGTCGTAAACGCAACTACGCACACAGAAAAGTTGCTGCCTGGAAACGACAATCAAATAAACTTTTCAAATAATGAGGTCCATTCTCTATCTCAGGCAGATAATGAGGGAAATGTTGTTAGCACTTCAAAAGCAAAAAGAAGAAAGGTCAATAGTGAGAAAAAGAGTGCAGTCGATTGGGATATTTTGAGAAAGCAGGTGGAAGCCAATGGACAAATAAAAGAAAAAGGCAAGGATGCCATGGATTCAATAGACTACGAAGCAATCAGACTAGCCAACGTTCATGAAATTTCAAGTGCTATCAAGGAACGAGGAATGAACAACATGCTAGCTGAACGAATTAAGGAGTTTTTGAATCGTCTGGTAAAAGATCATGGGAGCATTGATCTTGAATGGTTAAGAGATGTTCCCCCAGACAAAGCAAAGGATTATCTACTGAGTGTACGAGGATTGGGTTTAAAAAGTGTGGAGTGTGTTCGGCTATTAACACTTCATCACCTTGCTTTCCCAGTTGACACAAATGTTGGAAGAATAGCCGTTCGGCTTGGTTGGGTTCCTCTCCAACCATTACCCGAGTCACTTCAATTACATCTTCTAGAACTGTATCCAGTGCTGGAGTCCATTCAAAAATATCTATGGCCAAGATTATGCAAACTTGATCAGCGAACACTATATGAACTACACTACCAGTTAATCACATTTGGAAAGGTGTTCTGCACAAAGAGCAAGCCAAATTGCAATGCATGTCCAATGAGAGGAGAGTGCAAGCACTTCGCAAGTGCTTTTGCAAGTGCCCGACTTGCTCTTCCAGCACCGGATGAAAAGCGCATTGTGACTTCGACCAATCCTGTTGCCATGGAGAAGCAGCCAGCTGTGGTCTCGAATCCTTTGCCAATTCTTCCTCCTGAAGGAAGCACTTACACAGAAAGTACCTTGGGCACCAGCAAGTGTGAGCCAATAGTTGAAGTACCAGCGACGCCCGAACCCGAACCTGAACCCAATGAGATAACTGAAAGTGATATTGAAGATTCATTTTATGAGGATCCTGATGAAATTCCTACTATTAAACTCAGCATGGAAGAGTTCAAAACAACTCTACAAAATTACATCCCAGAAGGCGACATGTCCAGAGCTTTAGTTGCCTTGAACCAAGAAGCTGCCTCTATCCCAACGCCAAAATTGAAGAATGTGAGCAGGCTACGGACAGAGCATCAAGTGTATGAACTTCCAGATTCACATCCGCTCTTAAAAGAGTTGGATAGACGAGAACCTGATGATCCAAGCCCATATCTTCTTGCGATATGGACGCCAGGTGAAACAGCTAACTCGATTCAACCACCAGAACAAAGTTGCGGATCCCAAGACCCGGGCAGGCTCTGCAATGAGAAAACATGCTTCACATGCAACAGTAGAAGAGAAGCTAACTCTCAAACAGTCAGAGGAACGCTCCTGATACCTTGCAGAACCGCAATGCGAGGGAGCTTTCCGCTCAATGGAACATATTTTCAGGTCAACGAGATGTTTGCAGATCATGAATCTAGTACAAATCCTATCGACGTTCCAAGAAAATGGCTATGGAACTTACCTAGACGAACCGTCTACTTCGGAACATCAGTATCAACAATATTTAAGGGCCTGGTGACGGAGGAGATCCAGCAATGCTTTTGGAGAGGCTTTGTTTGTGTCAGAGGATTTGAACGAAAAACACGGGCACCTCGACCTCTGATTGCAAGATTGCACTTTCCAGCAAGCAAGCTGGCCAAGATGAAAAATGGACATACAGAATAG

Protein sequence

MNSQVNSSGDFYAGNLLVRNQNLYPSSRPSSNNSYAQHVRPYGLPMYQPNYNLNPASMTQMNQMSIFTNSIHPPPVSSHLENFAFDPISTPSFLVRDESSSFRRDGEDDFIRMFQDEPPRQHCDELLQSIVESSCVGNSTPFKRTTDFGKQRDLEIDLNRTPEQRPPKRRQHTPMVFSERFTDLLNLPLAESLSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSAAISAPTPTKEVEKGSDQVIDLNKTPEQKTPRRRKHRPKVIKEGKPKKSPKPVTPKIPKETPSGKRKYVRKKNIKEAATPPANIVEIKDSSTATKTKSCRRVINFEMEKTGDEEQEKKHNEKDVQEENMGNFCSITRPNVPDFCTQSNGVCGTSPDVHISHRLSTMVAENVRPTLQSNLAHMNHMTTSLTSQSEREAAGGPFNKSAYNTAEDLLNVGRIIDQGKADQYQNGFSNGYTPVQQHIRAEDMEQFANHAKRSTSFKELMGMNFEYSQTIPNHQSNINEARGSKRGRPLTIQPTPSCSITTLNSSVLCQEVLQTGESHRQGSSVNIGPLEIPAKKFESGLYATLYKRYTTIQANEGCPSHLNTSGCNPINSVGFTTEMKQAMLNSHIRSNQSRDRQTNWTKETIGDRHIHSVVHENNFQRRQISHNLHPEIDRTCETTGLNKVTSYRSLITGDKCNVLRPYPHLKASEQGYAYRQSDNSMLTIRQACQPMISGSLTTNQVQKLGYSFGFHQFPDKTAGLLENEIIHKLKGLNLNDDKGTTRTEQNAIVPYKGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEGSEGIENHEKDKEKWWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQVLSLPRFEMPQTSINHQNHRVKSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARSCSGSNSDAEEPIVSYNSSSTHCSNFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQLSDGKQDSLTEWNEIDNLNGHSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEGREAFSEESISSGPSIVSGCSTEKNMTCHSLNIGDPERTLDKISGEEIGRQARSQERIRMEHSESISEHSVHLQGNGIQLGSHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPAKMQQSALSNVVNATTHTEKLLPGNDNQINFSNNEVHSLSQADNEGNVVSTSKAKRRKVNSEKKSAVDWDILRKQVEANGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLVKDHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNACPMRGECKHFASAFASARLALPAPDEKRIVTSTNPVAMEKQPAVVSNPLPILPPEGSTYTESTLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFYEDPDEIPTIKLSMEEFKTTLQNYIPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEHQVYELPDSHPLLKELDRREPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRKWLWNLPRRTVYFGTSVSTIFKGLVTEEIQQCFWRGFVCVRGFERKTRAPRPLIARLHFPASKLAKMKNGHTE
Homology
BLAST of Tan0014193 vs. ExPASy Swiss-Prot
Match: Q8LK56 (Transcriptional activator DEMETER OS=Arabidopsis thaliana OX=3702 GN=DME PE=1 SV=2)

HSP 1 Score: 1040.4 bits (2689), Expect = 2.4e-302
Identity = 758/1794 (42.25%), Postives = 982/1794 (54.74%), Query Frame = 0

Query: 201  ENFVTVPLDEATQKRHDELLKDLTDTLSAAISAPTPTKEVEKGSDQVI--DLNKTPEQK- 260
            E  VT    E  + + D+ ++ + D  S+A++A   T++ +     V+  DLNKTP+QK 
Sbjct: 240  EQIVTTTGHEIPEPKSDKSMQSIMD--SSAVNATEATEQNDGSRQDVLEFDLNKTPQQKP 299

Query: 261  TPRRRKHRPKVIKEGKPKKSP-KPV-TPKI-----PKETP----------SGKRKYVRKK 320
            + R+RK  PKV+ EGKPK+ P KP   PK+     PK  P          S +    +KK
Sbjct: 300  SKRKRKFMPKVVVEGKPKRKPRKPAELPKVVVEGKPKRKPRKAATQEKVKSKETGSAKKK 359

Query: 321  NIKEAAT-PPANIVEIKDSSTATKTKSCRRVINFEMEKTGDEEQEKKHNEKDVQEENMGN 380
            N+KE+AT  PAN+ ++ + S     KSCR+ +NF++E  GD  Q    +E         +
Sbjct: 360  NLKESATKKPANVGDMSNKSPEVTLKSCRKALNFDLENPGDARQGDSESEIVQNSSGANS 419

Query: 381  FCSI------TRPNVPDFCTQ---SNGVCGTSPDVHIS-----HRLST--MVAENVRPTL 440
            F  I      T  +  D  +Q   +NG+   +  + +S      +LST   +A + +P L
Sbjct: 420  FSEIRDAIGGTNGSFLDSVSQIDKTNGLGAMNQPLEVSMGNQPDKLSTGAKLARDQQPDL 479

Query: 441  -------QSNLAHMNHMTTSLTSQS----EREAAGGPFN----KSAYNTAEDLLNVGR-- 500
                   Q  +A  N        Q+    + +  G PF     +      +  L +G   
Sbjct: 480  LTRNQQCQFPVATQNTQFPMENQQAWLQMKNQLIGFPFGNQQPRMTIRNQQPCLAMGNQQ 539

Query: 501  ---IIDQGK-----ADQYQNGFSNGYTPVQQHIRAEDMEQFANHAKRSTSFKELMGMNFE 560
               +I   +      +Q   G      P+           F NH     +  +L G   +
Sbjct: 540  PMYLIGTPRPALVSGNQQLGGPQGNKRPI-----------FLNHQTCLPAGNQLYGSPTD 599

Query: 561  YSQTIPN-----------HQSNINEARGSKRGRPL-TIQPTPSCSITTLNSSVL------ 620
              Q + +           +Q   +  RG +   PL   QP      T LN  V       
Sbjct: 600  MHQLVMSTGGQQHGLLIKNQQPGSLIRGQQPCVPLIDQQPATPKGFTHLNQMVATSMSSP 659

Query: 621  -----CQEVLQTGESHRQGSSVNIGPLEIPAKKFESGLYATL---------YKRYTTIQA 680
                  Q  + T   H +  S  +       ++  +  Y +L         Y     I  
Sbjct: 660  GLRPHSQSQVPTTYLHVESVSRILNGTTGTCQRSRAPAYDSLQQDIHQGNKYILSHEISN 719

Query: 681  NEGCPSHLNTSGCNPINSVGFTTEMKQAMLNSHIRSNQSRDRQTNWTKETIG----DRHI 740
              GC   L  +   P   +    E + +    H    Q+     N  ++       +RH 
Sbjct: 720  GNGCKKALPQNSSLPTPIMAKLEEARGSKRQYHRAMGQTEKHDLNLAQQIAQSQDVERHN 779

Query: 741  HSVVHE------NNFQRRQISHNLH---PEIDRTCE--TTGLNKVTSYRSLITG------ 800
             S   E          ++ +  NLH   PE+    +  T G  K  +  S+  G      
Sbjct: 780  SSTCVEYLDAAKKTKIQKVVQENLHGMPPEVIEIEDDPTDGARKGKNTASISKGASKGNS 839

Query: 801  ---------DKCNVLR-PYPHLKASEQGYAYRQSDNSMLTIRQACQPM--ISGSLTTNQV 860
                     +KC V + P    +A  +      +  S + + Q   P   +S S    + 
Sbjct: 840  SPVKKTAEKEKCIVPKTPAKKGRAGRKKSVPPPAHASEIQLWQPTPPKTPLSRSKPKGKG 899

Query: 861  QKLGYSFGFHQFPDKTAGLLEN--EIIHKLKGLNLNDDKGTTRTEQNAIVPYKGNGAVVP 920
            +K     G  + P       ++  EII++++ L L D +     EQNA+V YKG+GA+VP
Sbjct: 900  RKSIQDSGKARGPSGELLCQDSIAEIIYRMQNLYLGDKE--REQEQNAMVLYKGDGALVP 959

Query: 921  YVESEYLRKRKARPRVDLDPETERIWNLLMGK-EGSEGIENHEKDKEKWWEEERKVFRGR 980
            Y ES   +KRK RP+VD+D ET RIWNLLMGK +  EG E  +K KEKWWEEER+VFRGR
Sbjct: 960  Y-ES---KKRKPRPKVDIDDETTRIWNLLMGKGDEKEGDEEKDKKKEKWWEEERRVFRGR 1019

Query: 981  ADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPLKSTSN 1040
            ADSFIARMHLVQGDRRFS WKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFP K +S+
Sbjct: 1020 ADSFIARMHLVQGDRRFSPWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPPKLSSS 1079

Query: 1041 IRTQGDVETSMVANESAACLLYPADSIRWDSQVLSLPRFEMPQTSINHQNHR---VKSGT 1100
               + +V  S+V  +   C+L   +   W  +V      E+       +        SG 
Sbjct: 1080 REDERNVR-SVVVEDPEGCILNLNEIPSWQEKVQHPSDMEVSGVDSGSKEQLRDCSNSGI 1139

Query: 1101 E-FFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARSCSGSNSDAEEPIVSYNSSSTHCS 1160
            E F F E   Q +EEEV+SSQDSFD  I Q  G   SCS S SDAE P       +T C 
Sbjct: 1140 ERFNFLEKSIQNLEEEVLSSQDSFDPAIFQSCGRVGSCSCSKSDAEFP-------TTRCE 1199

Query: 1161 NFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQLSDG---KQDSLTEWNEIDNL-- 1220
              T      T+ S+Q    +L+   +  + +E  H     G   KQ++     +  +L  
Sbjct: 1200 TKT---VSGTSQSVQTGSPNLS-DEICLQGNERPHLYEGSGDVQKQETTNVAQKKPDLEK 1259

Query: 1221 --NGHSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEGREAFSEESISSGPSIVS 1280
              N    + F     + + Q   + S  Q   T    VL++E      E    S  SI  
Sbjct: 1260 TMNWKDSVCFGQPRNDTNWQTTPSSSYEQC-ATRQPHVLDIEDFGMQGEGLGYSWMSISP 1319

Query: 1281 GCSTEKNMTCHSLNIGDPERTLDKISGEEIGRQARSQERIRMEHSES-ISEH---SVHLQ 1340
                 KN                + +G+ I         + +  S S + EH   + H Q
Sbjct: 1320 RVDRVKNKNVPRRFFRQGGSVPREFTGQIIPSTPHELPGMGLSGSSSAVQEHQDDTQHNQ 1379

Query: 1341 GNGIQLGSHCEYRLHD---NYEPCERNKTSPIESTSVTNPSPELDAPAKMQQSALSNVVN 1400
             + +   SH +    D   + E C   ++S     ++T+     D  A+     LSN   
Sbjct: 1380 QDEMNKASHLQKTFLDLLNSSEECLTRQSS--TKQNITDGCLPRDRTAEDVVDPLSN--- 1439

Query: 1401 ATTHTEKLLPGNDNQINFSNNEVHSLSQADNEGNVVSTSKAKRRKVNSEKKSAVDWDILR 1460
              +  + +L     + N SN E  ++   +    ++   K     +   KK    WD LR
Sbjct: 1440 -NSSLQNILV----ESNSSNKEQTAVEYKETNATILREMKG---TLADGKKPTSQWDSLR 1499

Query: 1461 KQVEANGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLVKD 1520
            K VE N   +E+ K+ MDSIDYEAIR A++ EIS AIKERGMNNMLA RIK+FL R+VKD
Sbjct: 1500 KDVEGNEGRQERNKNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLERIVKD 1559

Query: 1521 HGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGW 1580
            HG IDLEWLR+ PPDKAKDYLLS+RGLGLKSVECVRLLTLH+LAFPVDTNVGRIAVR+GW
Sbjct: 1560 HGGIDLEWLRESPPDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIAVRMGW 1619

Query: 1581 VPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKP 1640
            VPLQPLPESLQLHLLELYPVLESIQK+LWPRLCKLDQRTLYELHYQLITFGKVFCTKS+P
Sbjct: 1620 VPLQPLPESLQLHLLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFCTKSRP 1679

Query: 1641 NCNACPMRGECKHFASAFASARLALPAPDEKRIVTSTNPVAMEKQPAVVSNPLPI-LPPE 1700
            NCNACPMRGEC+HFASA+ASARLALPAP+E+ + ++T PV  E  P V    + + LP E
Sbjct: 1680 NCNACPMRGECRHFASAYASARLALPAPEERSLTSATIPVPPESYPPVAIPMIELPLPLE 1739

Query: 1701 GSTYTESTLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFY-EDPDEIPTIKLSMEEF 1760
             S  + +      CEPI+E PA+  P  E  EITESDIED++Y EDPDEIPTIKL++E+F
Sbjct: 1740 KSLASGAPSNRENCEPIIEEPAS--PGQECTEITESDIEDAYYNEDPDEIPTIKLNIEQF 1799

Query: 1761 KTTLQNY------IPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEHQVYELPDSHPLL 1820
              TL+ +      + EGDMS+ALVAL+    SIPTPKLKN+SRLRTEHQVYELPDSH LL
Sbjct: 1800 GMTLREHMERNMELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPDSHRLL 1859

Query: 1821 KELDRREPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREANS 1839
              +D+REPDDPSPYLLAIWTPGETANS QPPEQ CG +  G++C ++TC  CNS REANS
Sbjct: 1860 DGMDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSLREANS 1919

BLAST of Tan0014193 vs. ExPASy Swiss-Prot
Match: C7IW64 (Protein ROS1A OS=Oryza sativa subsp. japonica OX=39947 GN=ROS1A PE=1 SV=2)

HSP 1 Score: 941.0 bits (2431), Expect = 2.0e-272
Identity = 568/1151 (49.35%), Postives = 717/1151 (62.29%), Query Frame = 0

Query: 757  IIHKLKGLNLNDDKGTTRTE-QNAIVPYKGN-GAVVPYVESEYLRKRKARPRVDLDPETE 816
            +I K+K L++N  +     E   A+VPY G  G +VP+ E +  RKR +R +VDLDP T 
Sbjct: 817  VIQKIKVLDINKSEDPVTAEPHGALVPYNGEFGPIVPF-EGKVKRKR-SRAKVDLDPVTA 876

Query: 817  RIWNLLMGKEGSEGIENHEKDKEKWWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSV 876
             +W LLMG + S+  E  +KDKEKW  EERK+F+GR DSFIARMHLVQGDRRFS WKGSV
Sbjct: 877  LMWKLLMGPDMSDCAEGMDKDKEKWLNEERKIFQGRVDSFIARMHLVQGDRRFSPWKGSV 936

Query: 877  VDSVIGVFLTQNVSDHLSSSAFMSLAARFPLKSTSNIRTQGDVETSMVANESAACLLYPA 936
            VDSV+GVFLTQNVSDHLSSSAFM+LAA+FP+K  ++ +    +  ++  +E+  C     
Sbjct: 937  VDSVVGVFLTQNVSDHLSSSAFMALAAKFPVKPEASEKPANVMFHTI--SENGDCSGLFG 996

Query: 937  DSIRWDSQVLSLPRFEMPQTSINHQNHRVKSGTEFFFTEVGSQI---------VEEEVIS 996
            +S++   ++L         + I  ++    +  E   +  G  +         + E + +
Sbjct: 997  NSVKLQGEILVQEASNTAASFITTEDKEGSNSVELLGSSFGDGVDGAAGVYSNIYENLPA 1056

Query: 997  SQDSFDSTITQGTGGA-----RSCSGSNSDAEEPIVSYNSS-----------STHCSNFT 1056
               +    + Q TG A      S  G  S     I S NSS           S+   NFT
Sbjct: 1057 RLHATRRPVVQ-TGNAVEAEDGSLEGVVSSENSTISSQNSSDYLFHMSDHMFSSMLLNFT 1116

Query: 1057 --DIKQMETTTSLQKSFSDL-------NRSSVFDEVSEHKHWQLSDGKQDSLTEWNEIDN 1116
              DI       + + ++++L       N+S+   E SE+    +S          N I  
Sbjct: 1117 AEDIGSRNMPKATRTTYTELLRMQELKNKSNETIESSEYHGVPVSCS--------NNIQV 1176

Query: 1117 LNGHSLINFLVNIENQHKQVPVAPS---NNQLHMTPDCRVLEVE-----GREAFSEESI- 1176
            LNG      + NI ++H+ +  + S     Q+H+       ++E     G     + ++ 
Sbjct: 1177 LNG------IQNIGSKHQPLHSSISYHQTGQVHLPDIVHASDLEQSVYTGLNRVLDSNVT 1236

Query: 1177 ------SSGPSIVSGCSTEKNMTCHSLNIGDPERTLDKISGEEIGR-----QARSQERIR 1236
                  S  P I     T+K  +  ++  G           E   R     Q  S E++ 
Sbjct: 1237 QTSYYPSPHPGIACNNETQKADSLSNMLYGIDRSDKTTSLSEPTPRIDNCFQPLSSEKMS 1296

Query: 1237 MEHSESISEHSVHLQGNGIQLGSHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPA 1296
                +S SE+  +L  N  +  +  +     N +     +T      +  +   + D   
Sbjct: 1297 FAREQSSSEN--YLSRNEAE-AAFVKQHGTSNVQGDNTVRTEQNGGENSQSGYSQQDDNV 1356

Query: 1297 KMQQSALSNVV------NATTHTEKLLPGNDNQINFSNNEVHSLSQADNEGNVVSTSKAK 1356
              Q +  SN+       N   ++E L   + N I  S ++  +  +   +G     SKAK
Sbjct: 1357 GFQTATTSNLYSSNLCQNQKANSEVLHGVSSNLIENSKDDKKTSPKVPVDG-----SKAK 1416

Query: 1357 RRKVNSEKKSAVDWDILRKQVEANGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGM 1416
            R +V + KK   DWD+LRK+V  +   KE+ ++A DSID+E IR A V EIS  I+ERGM
Sbjct: 1417 RPRVGAGKKKTYDWDMLRKEVLYSHGNKERSQNAKDSIDWETIRQAEVKEISDTIRERGM 1476

Query: 1417 NNMLAERIKEFLNRLVKDHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHH 1476
            NNMLAERIK+FLNRLV+DHGSIDLEWLR V  DKAKDYLLS+RGLGLKSVECVRLLTLHH
Sbjct: 1477 NNMLAERIKDFLNRLVRDHGSIDLEWLRYVDSDKAKDYLLSIRGLGLKSVECVRLLTLHH 1536

Query: 1477 LAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYE 1536
            +AFPVDTNVGRI VRLGWVPLQPLPESLQLHLLE+YP+LE+IQKYLWPRLCKLDQRTLYE
Sbjct: 1537 MAFPVDTNVGRICVRLGWVPLQPLPESLQLHLLEMYPMLENIQKYLWPRLCKLDQRTLYE 1596

Query: 1537 LHYQLITFGKVFCTKSKPNCNACPMRGECKHFASAFASARLALPAPDEKRIVTSTNPVAM 1596
            LHYQ+ITFGKVFCTKSKPNCNACPMR ECKHFASAFASARLALP P+EK +VTS  P+A 
Sbjct: 1597 LHYQMITFGKVFCTKSKPNCNACPMRAECKHFASAFASARLALPGPEEKSLVTSGTPIAA 1656

Query: 1597 EKQPAVVSNPLPILPPEGSTYTESTLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFY 1656
            E       +  P++            G +  +PI+E PA+PEPE E  E+ E  IEDSF 
Sbjct: 1657 ETFHQTYISSRPVVSQLEWNSNTCHHGMNNRQPIIEEPASPEPEHETEEMKECAIEDSFV 1716

Query: 1657 EDPDEIPTIKLSMEEFKTTLQNY-------IPEGDMSRALVALNQEAASIPTPKLKNVSR 1716
            +DP+EIPTIKL+ EEF   L++Y       I + DMS+ALVA+  E ASIPTPKLKNVSR
Sbjct: 1717 DDPEEIPTIKLNFEEFTQNLKSYMQANNIEIEDADMSKALVAITPEVASIPTPKLKNVSR 1776

Query: 1717 LRTEHQVYELPDSHPLLKELDRREPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPGRL 1776
            LRTEHQVYELPDSHPLL+  ++REPDDP PYLL+IWTPGETA S   P+  C SQ+ G L
Sbjct: 1777 LRTEHQVYELPDSHPLLEGFNQREPDDPCPYLLSIWTPGETAQSTDAPKSVCNSQENGEL 1836

Query: 1777 CNEKTCFTCNSRREANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNEMFADHESSTNPID 1836
            C   TCF+CNS REA +Q VRGTLLIPCRTAMRGSFPLNGTYFQVNE+FADH+SS NPID
Sbjct: 1837 CASNTCFSCNSIREAQAQKVRGTLLIPCRTAMRGSFPLNGTYFQVNEVFADHDSSRNPID 1896

Query: 1837 VPRKWLWNLPRRTVYFGTSVSTIFKGLVTEEIQQCFWRGFVCVRGFERKTRAPRPLIARL 1839
            VPR W+WNLPRRTVYFGTS+ TIFKGL TEEIQ CFWRGFVCVRGF+R +RAPRPL ARL
Sbjct: 1897 VPRSWIWNLPRRTVYFGTSIPTIFKGLTTEEIQHCFWRGFVCVRGFDRTSRAPRPLYARL 1940

BLAST of Tan0014193 vs. ExPASy Swiss-Prot
Match: Q9SJQ6 (DNA glycosylase/AP lyase ROS1 OS=Arabidopsis thaliana OX=3702 GN=ROS1 PE=1 SV=2)

HSP 1 Score: 881.7 bits (2277), Expect = 1.4e-254
Identity = 542/1125 (48.18%), Postives = 679/1125 (60.36%), Query Frame = 0

Query: 744  QFPDKTAGLLENE----------IIHKLKGLNLNDDKGTTRTEQNAIVPYK--------- 803
            +FP    GL  +E          I   L+ L++N +   T     A+VPY          
Sbjct: 454  RFPPSFTGLSPDELWKRRNSIETISELLRLLDINREHSET-----ALVPYTMNSQIVLFG 513

Query: 804  -GNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEGSEGIENHEKDKEKWWEEE 863
             G GA+VP      ++K + RP+VDLD ET+R+W LL+    SEG++  ++ K KWWEEE
Sbjct: 514  GGAGAIVPVTP---VKKPRPRPKVDLDDETDRVWKLLLENINSEGVDGSDEQKAKWWEEE 573

Query: 864  RKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARF 923
            R VFRGRADSFIARMHLVQGDRRF+ WKGSVVDSV+GVFLTQNVSDHLSSSAFMSLA++F
Sbjct: 574  RNVFRGRADSFIARMHLVQGDRRFTPWKGSVVDSVVGVFLTQNVSDHLSSSAFMSLASQF 633

Query: 924  PLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQVLSLPRFEMPQTSINHQNHRV 983
            P+                              S  +D+   S+P  ++            
Sbjct: 634  PVPF--------------------------VPSSNFDAGTSSMPSIQI------------ 693

Query: 984  KSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARSCSGSNSDAEEPIVSYNSSST 1043
                    T + S   EE + S  D   S++T           +  D E+  V  N +S 
Sbjct: 694  --------TYLDS---EETMSSPPDHNHSSVT--------LKNTQPDEEKDYVPSNETSR 753

Query: 1044 HCSNFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQLSDGKQDSLTEWNEIDNLNG 1103
              S        E   S  +S          D+ ++ K +  SD K  S+    E+D  + 
Sbjct: 754  SSS--------EIAISAHES---------VDKTTDSKEYVDSDRKGSSV----EVDKTD- 813

Query: 1104 HSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEGREAFSEESISSGPSIVSGCST 1163
                                           CRVL +     F  E              
Sbjct: 814  -----------------------------EKCRVLNL-----FPSE-------------- 873

Query: 1164 EKNMTC-HSLNIGDPERTLDKISGEEIGRQARSQERIRMEHSESISEHSVHLQGNGIQLG 1223
            +  +TC HS+    P+ T           +A S   I +E  E  +     LQG  + L 
Sbjct: 874  DSALTCQHSMVSDAPQNT----------ERAGSSSEIDLE-GEYRTSFMKLLQGVQVSLE 933

Query: 1224 SHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPAKMQQSALSNVVNATTHTEKLLP 1283
                          + N+ SP  + S  + S E+     M++   S+V            
Sbjct: 934  --------------DSNQVSP--NMSPGDCSSEIKGFQSMKEPTKSSV------------ 993

Query: 1284 GNDNQINFSNNEVHSLSQADNEGNVVS----TSKAKRRKVNSEKKSAVDWDILRKQVEAN 1343
                     ++E    SQ D  G+V+S    T K K +KV  E+K A DWD LR++ +A 
Sbjct: 994  --------DSSEPGCCSQQD--GDVLSCQKPTLKEKGKKVLKEEKKAFDWDCLRREAQAR 1053

Query: 1344 GQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLVKDHGSIDL 1403
              I+EK +  MD++D++AIR A+V E++  IK RGMN+ LAERI+ FL+RLV DHGSIDL
Sbjct: 1054 AGIREKTRSTMDTVDWKAIRAADVKEVAETIKSRGMNHKLAERIQGFLDRLVNDHGSIDL 1113

Query: 1404 EWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPL 1463
            EWLRDVPPDKAK+YLLS  GLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPL
Sbjct: 1114 EWLRDVPPDKAKEYLLSFNGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPL 1173

Query: 1464 PESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNACP 1523
            PESLQLHLLE+YP+LESIQKYLWPRLCKLDQ+TLYELHYQ+ITFGKVFCTKSKPNCNACP
Sbjct: 1174 PESLQLHLLEMYPMLESIQKYLWPRLCKLDQKTLYELHYQMITFGKVFCTKSKPNCNACP 1233

Query: 1524 MRGECKHFASAFASARLALPAPDEKRIVTSTNPVAMEKQPAVVSNPLPILPPEGS---TY 1583
            M+GEC+HFASAFASARLALP+  EK + T       +K P  +  P P    +GS    +
Sbjct: 1234 MKGECRHFASAFASARLALPS-TEKGMGTP------DKNPLPLHLPEPFQREQGSEVVQH 1293

Query: 1584 TESTLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFYEDPDEIPTIKLSMEEFKTTLQ 1643
            +E     + CEPI+E PA+PEPE    E++ +DIE++F+EDP+EIPTI+L+M+ F + L+
Sbjct: 1294 SEPAKKVTCCEPIIEEPASPEPETA--EVSIADIEEAFFEDPEEIPTIRLNMDAFTSNLK 1353

Query: 1644 NY------IPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEHQVYELPDSHPLLKELDR 1703
                    + +G+MS ALVAL  E AS+P PKLKN+S+LRTEH+VYELPD HPLL +L++
Sbjct: 1354 KIMEHNKELQDGNMSSALVALTAETASLPMPKLKNISQLRTEHRVYELPDEHPLLAQLEK 1385

Query: 1704 REPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREANSQTVRG 1763
            REPDDP  YLLAIWTPGETA+SIQP   +C  Q  G LC+E+TCF+CNS +E  SQ VRG
Sbjct: 1414 REPDDPCSYLLAIWTPGETADSIQPSVSTCIFQANGMLCDEETCFSCNSIKETRSQIVRG 1385

Query: 1764 TLLIPCRTAMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRKWLWNLPRRTVYFGTSVST 1823
            T+LIPCRTAMRGSFPLNGTYFQVNE+FADH SS NPI+VPR+ +W LPRRTVYFGTSV T
Sbjct: 1474 TILIPCRTAMRGSFPLNGTYFQVNEVFADHASSLNPINVPRELIWELPRRTVYFGTSVPT 1385

Query: 1824 IFKGLVTEEIQQCFWRGFVCVRGFERKTRAPRPLIARLHFPASKL 1835
            IFKGL TE+IQ CFW+G+VCVRGF+RKTR P+PLIARLHFPASKL
Sbjct: 1534 IFKGLSTEKIQACFWKGYVCVRGFDRKTRGPKPLIARLHFPASKL 1385


HSP 2 Score: 58.5 bits (140), Expect = 9.0e-07
Identity = 63/181 (34.81%), Postives = 98/181 (54.14%), Query Frame = 0

Query: 223 LTDTLSAAISAPTP-----TKEVEKGSDQVIDLN-----------KTPEQKTPRRRKHRP 282
           L +T S   S  TP     T+ ++KG+++V  L+           KTPE+  P+R+KHRP
Sbjct: 67  LANTASLIFSGQTPIPTRNTEVMQKGTEEVESLSSVSNNVAEQILKTPEK--PKRKKHRP 126

Query: 283 KVIKEGKPKKSPKPVTPKIP-----KETPSGKRKYVRKK---NIKEAATPPANIVEIKDS 342
           KV +E KPK+ PKP  P+       +E+ + KRKYVRKK   +  + ATP  +   + ++
Sbjct: 127 KVRREAKPKREPKPRAPRKSVVTDGQESKTPKRKYVRKKVEVSKDQDATPVESSAAV-ET 186

Query: 343 STATKTKSCRRVINFEME----KTGDEEQEKKHNEKDVQEENM--GN----FCSITRPNV 370
           ST  K + CRRV++FE E    +T  + +E    E  +QE+ +  GN     C ++ P+ 
Sbjct: 187 STRPK-RLCRRVLDFEAENGENQTNGDIREAGEMESALQEKQLDSGNQELKDCLLSAPST 243

BLAST of Tan0014193 vs. ExPASy Swiss-Prot
Match: B8YIE8 (Protein ROS1C OS=Oryza sativa subsp. japonica OX=39947 GN=ROS1C PE=2 SV=2)

HSP 1 Score: 850.1 bits (2195), Expect = 4.6e-245
Identity = 650/1699 (38.26%), Postives = 878/1699 (51.68%), Query Frame = 0

Query: 222  DLTDTLSAAISAPTPTKEVEKGSDQVIDLNKT--------PEQK-TPRRRKHRPKVIKEG 281
            D  D  + A+SA    K + +   Q+ D  +T        P QK   RRRKHRPKVI+E 
Sbjct: 317  DKIDLPTQAVSA-CKEKTITQIEMQIADAERTEALKGEDAPAQKLKTRRRKHRPKVIRED 376

Query: 282  KPKKSPKPVTPKIPKETPSGKRKYVRK----KNIKEAATPPANIVEIKDSSTATKTK--S 341
            +P K     T +        KRKYVRK     ++++ A P ++    ++S T  ++   S
Sbjct: 377  RPAKKQMATTSEEKPLNQKPKRKYVRKNRNPSSLEKCAEPFSDHSISRESRTTVRSSIAS 436

Query: 342  CRRVINFEMEKTGDEEQEK-------KHNEKDVQEENMGNFCSITRPNVPDFCTQSNGVC 401
             RR + FE  + G +  +        ++ EK V  E+  + CS+T+ +V     Q   + 
Sbjct: 437  VRRRLQFEFGEHGVQRDQSSMTNSWYQNQEKPVNAES--SLCSVTKSSVQVEHGQELHM- 496

Query: 402  GTSPD---VHISHRLSTMVAENV-------RPTLQSNLAHMNHMTTSLTSQSEREAAGGP 461
              SP+     I+ +L+ ++ E +       +P+ Q  LA   H++  L  +         
Sbjct: 497  ENSPEGLFFGINSKLNKILDEYIHLPEAAPKPSEQIPLAASGHVSEELARKQYDVRHTHD 556

Query: 462  FNKSAYNTAEDLLNVGRIIDQGKADQYQNGFS--NGYTPVQQHIRAEDMEQFANHAKRST 521
             + ++YN        G I  +G        +S  NG+          +M+       + +
Sbjct: 557  PDSTSYNIERS----GLITTKGHKKDLDLNYSNTNGFQMYCSASLLPEMDSTKGSMTKVS 616

Query: 522  SFKELMGMNFEYSQTIPNHQSNI-----------NEARGSKRGRPLTIQPTPSCSITTLN 581
               +    ++    ++   QS+I            +A G K+ R   ++     S+  L 
Sbjct: 617  KMDKNKKRHYGGESSLAGTQSSIIMRTAAEMLAVYQACGIKKKRSARVRRNSFLSVMDLE 676

Query: 582  SSVLCQEVLQTGESHRQGSSVNIGPLEIPAKKFESGLYATLYKRYTTIQANEGCPSHLNT 641
             +        + ES R           +P    E+ LY + Y ++ T + ++        
Sbjct: 677  KNT-------SQESTR-----------LPRSCMEA-LYESSYIKFMTKKRSQ-------- 736

Query: 642  SGCNPINSVGFTTEMKQAMLNSHIRSNQSRDRQTNWTKETIGDRHIHSVVH-ENNFQRRQ 701
                            +A LNS      + D++  ++ ET+     + +   E  FQ+  
Sbjct: 737  ----------------KARLNSPNSIQPNIDQKNRFSSETVFSGGFNGLKRSEETFQKTL 796

Query: 702  ISHNLHPEIDRTCETTGLNKVTSYRSLITGDKCNVLRPYPHLKASEQGYAYRQSDNSMLT 761
                  P+I                                             D+  + 
Sbjct: 797  ------PQI--------------------------------------------PDDKRIN 856

Query: 762  IRQACQPMISGSLTTNQVQKLGYSFG----FHQFPDKTAGLLENEIIHKLKGLNLNDDKG 821
            +   C+  +  S  T+    + Y  G    F  F   T  + + E +H  + +      G
Sbjct: 857  LDIHCKVPVESSPNTSTPPYMDYLQGVTSKFRYFDLNTEQVHKTE-MHLSQTMPSLSSLG 916

Query: 822  TTRTEQNAIVPYKGNGAVVPY-VESEYLRKRKARPRVDLDPETERIWNLLMGKEGSEGIE 881
             T    NA+VPY G GAVVPY  +   ++K++ R +VDLD ET R+WNLLMGK  ++ ++
Sbjct: 917  ATNYLPNALVPYVG-GAVVPYQTQFHLVKKQRPRAKVDLDFETTRVWNLLMGK-AADPVD 976

Query: 882  NHEKDKEKWWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDH 941
              + DKE+WW++ER+VF+GRA+SFIARM LVQGDRRFS WKGSVVDSV+GVFLTQNV+DH
Sbjct: 977  GTDVDKERWWKQEREVFQGRANSFIARMRLVQGDRRFSPWKGSVVDSVVGVFLTQNVADH 1036

Query: 942  LSSSAFMSLAARFPLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQVLSLPRFE 1001
            LSSSA+M+LAA FP  S       G+    +   ++        + I   S V     FE
Sbjct: 1037 LSSSAYMALAASFPTGS------HGNCNDGIAGQDN--------EEIISTSAVGDRGTFE 1096

Query: 1002 MPQTSINHQNHRVKSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARSC---SGS 1061
                   +   R   G  F F+    +I  E      ++  + +T+G   +  C   +GS
Sbjct: 1097 -----FFYNGSRPDIGLNFEFSMACEKIHME---PKDNTTVNELTKGENYSLHCKESAGS 1156

Query: 1062 NSDAEEPIVSYNSSSTHCS--NFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQLS 1121
              D E  I     S +  S    T   +    T  QK  S L++S V  E        LS
Sbjct: 1157 LCDHETEIDHKAKSISDFSAVELTACMKNLHATQFQKEIS-LSQSVVTSESILQPGLPLS 1216

Query: 1122 DGKQDSLTEWNEIDNLNGHSLINFLVNIENQHKQVPVAPSNNQLHMTPD---CRVLEVEG 1181
             G                H+  NF+ +I +   Q   +  ++   +T +       E  G
Sbjct: 1217 SGMD--------------HARRNFVGSISDTASQQVGSNFDDGKSLTGNDVTANETEYHG 1276

Query: 1182 -REAFSEESISSGPSIVSGCSTE---KNMTCHSLNIGDPERTLDKISGEEIGRQARSQER 1241
             + A +   +   P I SG S       + CH L+ G  +  +   S       A S  +
Sbjct: 1277 IKAAATNNYVVDEPGIPSGSSLYPFFSAIDCHQLD-GRNDTHVSSTSPNCSICSASSNFK 1336

Query: 1242 IRM--EHSESISEHSVHL-QGNGIQLGSHCEYRLHDNYEPCERNKTSPIESTSVTNPSPE 1301
            I    E+S        HL Q NG  +               + N +S +EST        
Sbjct: 1337 IGTIEENSSLFMPFDAHLAQRNGNMI--------------VDTNLSSALEST-------- 1396

Query: 1302 LDAPAKMQQSALSNVVNATTH--------TEKLLPGNDNQINFSNNEVHSLSQADNEGNV 1361
             + P K+      +   A+          T  ++P    + + S  +    S        
Sbjct: 1397 -ELPVKLLHCGKRSCYEASEFQDHESLYATGGVIPETATKADDSTLKSGFASFNGLPDTA 1456

Query: 1362 VSTSKAKRRKVNSEKKSA-VDWDILRKQVEANGQIKEKGKDAMDSIDYEAIRLANVHEIS 1421
               SK K+ +  S+K S   DWD LR+Q   N Q+KE+  D  DS+D+EA+R A+V  IS
Sbjct: 1457 AQASKPKKSRTTSKKNSENFDWDKLRRQACGNYQMKERIFDRRDSVDWEAVRCADVQRIS 1516

Query: 1422 SAIKERGMNNMLAERIKEFLNRLVKDHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVEC 1481
             AI+ERGMNN+LAERI++FLNRLV DHGSIDLEWLRDVPPD AKDYLLS+RGLGLKSVEC
Sbjct: 1517 HAIRERGMNNVLAERIQKFLNRLVTDHGSIDLEWLRDVPPDSAKDYLLSIRGLGLKSVEC 1576

Query: 1482 VRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCK 1541
            VRLLTLHHLAFPVDTNVGRI VRLGWVP+QPLPESLQLHLLELYPVLE+IQKYLWPRLCK
Sbjct: 1577 VRLLTLHHLAFPVDTNVGRICVRLGWVPIQPLPESLQLHLLELYPVLETIQKYLWPRLCK 1636

Query: 1542 LDQRTLYELHYQLITFGKVFCTKSKPNCNACPMRGECKHFASAFASARLALPAPDEKRIV 1601
            LDQ+TLYELHYQ+ITFGKVFCTKSKPNCNACPMR EC+HFASAFASARLALP+P +KR+V
Sbjct: 1637 LDQQTLYELHYQMITFGKVFCTKSKPNCNACPMRSECRHFASAFASARLALPSPQDKRLV 1696

Query: 1602 TSTNPVAMEKQPAVVSNPLPILPPEGSTYTESTLGTSKCEPIVEVPATPEPEPEPNEITE 1661
              +N  A         N  P+   EGS +    +  +   PI+E PA+P  E E  E+ E
Sbjct: 1697 NLSNQFAFHNGTMPTPNSTPLPQLEGSIHARD-VHANNTNPIIEEPASPR-EEECRELLE 1756

Query: 1662 SDIEDSFYEDPDEIPTIKLSMEEFKTTLQNYIPEG-------DMSRALVALNQEAASIPT 1721
            +DIED F ED DEIP IKL+ME F   L+N I E        D+++ALVA++ EAASIP 
Sbjct: 1757 NDIED-FDEDTDEIPIIKLNMEAFSQNLENCIKESNKDFQSDDITKALVAISNEAASIPV 1816

Query: 1722 PKLKNVSRLRTEHQVYELPDSHPLLKE--LDRREPDDPSPYLLAIWTPGETANSIQPPEQ 1781
            PKLKNV RLRTEH VYELPDSHPL+++  LD+REPDDPSPYLLAIWTP E  ++ + P+ 
Sbjct: 1817 PKLKNVHRLRTEHYVYELPDSHPLMQQLALDQREPDDPSPYLLAIWTPDELKDTREAPKP 1847

Query: 1782 SCGSQDPGRLCNEKTCFTCNSRREANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNEMFA 1837
             C  Q  G LC+ + C  C S RE   + VRGT+L+PCRTAMRGSFPLNGTYFQVNE+FA
Sbjct: 1877 CCNPQTEGGLCSNEMCHNCVSERENQYRYVRGTVLVPCRTAMRGSFPLNGTYFQVNEVFA 1847

BLAST of Tan0014193 vs. ExPASy Swiss-Prot
Match: Q9SR66 (DEMETER-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=DML2 PE=3 SV=2)

HSP 1 Score: 719.9 bits (1857), Expect = 7.2e-206
Identity = 456/1077 (42.34%), Postives = 585/1077 (54.32%), Query Frame = 0

Query: 776  EQNAIVPYKGNGAVVPYVESEYLRK------RKARPRVDLDPETERIWNLLMGKEGSEGI 835
            ++   +P+    A++ Y +S   +K      +K +P+V LDPET R+W LLM     +G+
Sbjct: 461  KEGLCLPHNRETALILYKKSYEEQKAIVKYSKKQKPKVQLDPETSRVWKLLMSSIDCDGV 520

Query: 836  ENHEKDKEKWWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSD 895
            +  +++K KWWEEER +F GRA+SFIARM +VQG+R FS WKGSVVDSV+GVFLTQNV+D
Sbjct: 521  DGSDEEKRKWWEEERNMFHGRANSFIARMRVVQGNRTFSPWKGSVVDSVVGVFLTQNVAD 580

Query: 896  HLSSSAFMSLAARFPLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQVLSLPRF 955
            H SSSA+M LAA FP++   N  +  +   S V  E+                +L+L   
Sbjct: 581  HSSSSAYMDLAAEFPVEWNFNKGSCHEEWGSSVTQET----------------ILNLD-- 640

Query: 956  EMPQTSINHQNHRVKSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARSCSGSNS 1015
              P+T ++    R+++ T         +++ EE+   ++  D+                 
Sbjct: 641  --PRTGVS--TPRIRNPT---------RVIIEEIDDDENDIDA----------------- 700

Query: 1016 DAEEPIVSYNSSSTHCSNFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQLSDGKQ 1075
                 + S  SS T  S+ T   Q  + T L   F+ +  +   D        Q+  GK 
Sbjct: 701  -----VCSQESSKTSDSSITSADQ--SKTMLLDPFNTVLMNEQVDS-------QMVKGK- 760

Query: 1076 DSLTEWNEIDNLNGHSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEGREAFSEE 1135
                         GH               +P     N L                    
Sbjct: 761  -------------GH---------------IPYTDDLNDL-------------------- 820

Query: 1136 SISSGPSIVSGCSTEKNMTCHSLNIGDPERTLDKISGEEIGRQARSQERIRMEHSESISE 1195
              S G S+VS  ST                                              
Sbjct: 821  --SQGISMVSSAST---------------------------------------------- 880

Query: 1196 HSVHLQGNGIQLGSHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPAKMQQSALSN 1255
                          HCE  L         N+  P          PE     + QQ     
Sbjct: 881  --------------HCELNL---------NEVPPEVELCSHQQDPESTIQTQDQQE---- 940

Query: 1256 VVNATTHTEKLLPGNDNQINFSNNEVHSLSQADNEGNVVSTSKAKRRKVNSEK---KSAV 1315
                +T TE +                            +TSK K++   S K   K +V
Sbjct: 941  ----STRTEDVKKNRKKP---------------------TTSKPKKKSKESAKSTQKKSV 1000

Query: 1316 DWDILRKQVEANGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFL 1375
            DWD LRK+ E+ G+ +E+ +  MD++D++A+R  +VH+I++ I +RGMNNMLAERIK FL
Sbjct: 1001 DWDSLRKEAESGGRKRERTERTMDTVDWDALRCTDVHKIANIIIKRGMNNMLAERIKAFL 1060

Query: 1376 NRLVKDHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRI 1435
            NRLVK HGSIDLEWLRDVPPDKAK+YLLS+ GLGLKSVECVRLL+LH +AFPVDTNVGRI
Sbjct: 1061 NRLVKKHGSIDLEWLRDVPPDKAKEYLLSINGLGLKSVECVRLLSLHQIAFPVDTNVGRI 1120

Query: 1436 AVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVF 1495
            AVRLGWVPLQPLP+ LQ+HLLELYPVLES+QKYLWPRLCKLDQ+TLYELHY +ITFGKVF
Sbjct: 1121 AVRLGWVPLQPLPDELQMHLLELYPVLESVQKYLWPRLCKLDQKTLYELHYHMITFGKVF 1180

Query: 1496 CTKSKPNCNACPMRGECKHFASAFASARLALPAPDEK-RIVTSTNPVAMEKQPAVVS-NP 1555
            CTK KPNCNACPM+ EC+H++SA ASARLALP P+E  R     +    +++P VV+  P
Sbjct: 1181 CTKVKPNCNACPMKAECRHYSSARASARLALPEPEESDRTSVMIHERRSKRKPVVVNFRP 1240

Query: 1556 LPILPPEGSTYTESTLGTSKCEPIVEVPATPEPEPEPNEITESDIED------------S 1615
               L  E     +    +  CEPI+E PA+PEP     E  E DIED             
Sbjct: 1241 SLFLYQEKE---QEAQRSQNCEPIIEEPASPEP-----EYIEHDIEDYPRDKNNVGTSED 1300

Query: 1616 FYEDPDEIPTIKLSMEEFKTTLQNYIPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEH 1675
             +E+ D IPTI L+ E   +       E   S  LV L+  AA+IP  KLK   +LRTEH
Sbjct: 1301 PWENKDVIPTIILNKEAGTSHDLVVNKEAGTSHDLVVLSTYAAAIPRRKLKIKEKLRTEH 1318

Query: 1676 QVYELPDSHPLLKELDRREPDDPSPYLLAIWTPGETANSIQPPEQSCG-SQDPGRLCNEK 1735
             V+ELPD H +L+  +RRE +D  PYLLAIWTPGET NSIQPP+Q C   +    LCNE 
Sbjct: 1361 HVFELPDHHSILEGFERREAEDIVPYLLAIWTPGETVNSIQPPKQRCALFESNNTLCNEN 1318

Query: 1736 TCFTCNSRREANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRK 1795
             CF CN  RE  SQTVRGT+LIPCRTAMRG FPLNGTYFQ NE+FADH+SS NPIDVP +
Sbjct: 1421 KCFQCNKTREEESQTVRGTILIPCRTAMRGGFPLNGTYFQTNEVFADHDSSINPIDVPTE 1318

Query: 1796 WLWNLPRRTVYFGTSVSTIFKGLVTEEIQQCFWRGFVCVRGFERKTRAPRPLIARLH 1829
             +W+L RR  Y G+SVS+I KGL  E I+  F  G+VCVRGF+R+ R P+ L+ RLH
Sbjct: 1481 LIWDLKRRVAYLGSSVSSICKGLSVEAIKYNFQEGYVCVRGFDRENRKPKSLVKRLH 1318

BLAST of Tan0014193 vs. NCBI nr
Match: XP_011655842.1 (protein ROS1A isoform X1 [Cucumis sativus] >XP_011655844.1 protein ROS1A isoform X1 [Cucumis sativus] >KGN52209.1 hypothetical protein Csa_008069 [Cucumis sativus])

HSP 1 Score: 3036.9 bits (7872), Expect = 0.0e+00
Identity = 1559/1861 (83.77%), Postives = 1662/1861 (89.31%), Query Frame = 0

Query: 1    MNSQVNSSGDFYAGNLLVRNQNLYPSSRPSSNNSYAQHVRPYGLPMYQPNYNLNPASMTQ 60
            MNSQVNSSGDFYAGNLL+RNQN+Y  SRPS+NNS+AQHV  YGLPM+QPNYNLNP SMTQ
Sbjct: 1    MNSQVNSSGDFYAGNLLLRNQNIYSGSRPSTNNSFAQHVLTYGLPMFQPNYNLNPVSMTQ 60

Query: 61   MNQMSIFTNSIH-PPPVSSHLENFAFDPISTPSFLVRDESSSFRRDGEDDFIRMFQDEPP 120
             NQ  IFTNS+H  PPVSS++E+ A++ +STPSFLVRDESS FR++  DDFIRMFQDE P
Sbjct: 61   TNQ--IFTNSVHTTPPVSSNVESVAYNQVSTPSFLVRDESSCFRKNA-DDFIRMFQDEAP 120

Query: 121  RQHCDELLQSI--------------VESSCVGNSTPFKRTTDFGKQRDLEIDLNRTPEQR 180
            RQHCDELLQSI              VESSCVGNSTPFK T DF KQ+DLEIDLNRTPEQR
Sbjct: 121  RQHCDELLQSIVESSCVGNSTPFKGVESSCVGNSTPFKGTKDFVKQKDLEIDLNRTPEQR 180

Query: 181  PPKRRQHTPMVFS-ERFTDLLNLPLAESLSLYEETQENFVTVPLDEATQKRHDELLKDLT 240
            PPKRRQHTP VFS ERFTDLLNLPL  +LSLYEETQENFVTVPLDEATQKRHDELLKDLT
Sbjct: 181  PPKRRQHTPTVFSGERFTDLLNLPLDGNLSLYEETQENFVTVPLDEATQKRHDELLKDLT 240

Query: 241  DTLSAAISAPTPTKEVEKGSDQVIDLNKTPEQKTPRRRKHRPKVIKEGKPKKSPKPVTPK 300
            DTLSAAIS   PTKEVEKGSDQ IDLNKTPEQKTP+RRKHRPKVIKEGKPKKSPKPVTPK
Sbjct: 241  DTLSAAIS--EPTKEVEKGSDQAIDLNKTPEQKTPKRRKHRPKVIKEGKPKKSPKPVTPK 300

Query: 301  IPKETPSGKRKYVRKKNIKEAATPPANIVEIKDSSTATKTKSCRRVINFEMEKTGDEEQE 360
            I KETPSGKRKYVRKKNIKEA TPPAN+VEIKDS+TATKTKSCRRVI+FEMEKTGDEEQE
Sbjct: 301  ISKETPSGKRKYVRKKNIKEATTPPANVVEIKDSNTATKTKSCRRVIHFEMEKTGDEEQE 360

Query: 361  KKHNEKDVQEENMGNFCSITRPNVPDFCTQSNGVCGTSPDVHISHRLSTMVAENVRPTLQ 420
            KK NEKDV EENMGNFC +TRPNVPDFC+QS  VCGTS DVH S +L  MVAENVRPT+ 
Sbjct: 361  KKQNEKDVSEENMGNFCFMTRPNVPDFCSQSTSVCGTSQDVHDSTQLGPMVAENVRPTIP 420

Query: 421  SNLAHMNHMTTSLTSQSEREAAGGPFNKSAYNTAEDLLNVGRIIDQGKADQYQNGFSNGY 480
            SN  HMNHMTTS   QSEREAA  P NKS YN AE+ LNV RI+ QG+A+QYQ GFSNGY
Sbjct: 421  SNPTHMNHMTTSHILQSEREAAEVPLNKSGYNKAENWLNVLRILHQGRANQYQTGFSNGY 480

Query: 481  TPVQQHIRAEDMEQFANHAKRSTSFKELMGMNFEYSQTIPNHQSNINEARGSKRGRPLTI 540
             PVQQ+I AEDM+QFAN AKR+T +KE+MG+N  Y QT+PNHQSNINEARGSKRGRPLT 
Sbjct: 481  APVQQNICAEDMQQFANQAKRNTYYKEVMGINSGYCQTVPNHQSNINEARGSKRGRPLTT 540

Query: 541  QPTPSCSITTLNSSVLCQEVLQTGESHRQGSSVNIGPLEIPAKKFESGLYATLYKRYTTI 600
             PT  CSITTL+SS+ CQEV Q GE  RQGS++NIGPLE P KKFESGLYATL+KRY+TI
Sbjct: 541  YPTQPCSITTLDSSMTCQEVRQIGEFQRQGSNINIGPLENPGKKFESGLYATLHKRYSTI 600

Query: 601  QANEGCPSHLNTSGCNPINSVGFTTEMKQAMLNS-HIRSNQSRDRQTNWTKETIGDRHIH 660
            Q+NEGC SHLNT GCNP NSVGFT EMKQAMLN  HIRSNQ         KE IGDRHIH
Sbjct: 601  QSNEGCSSHLNTIGCNPTNSVGFTAEMKQAMLNGHHIRSNQIT------AKEIIGDRHIH 660

Query: 661  SVVHENNFQRRQISHNLHPEIDRTCETTGLNKVTSYRSLITGDKCNVLRPYPHLKASEQG 720
            SVVHEN+FQR+Q+SHNLHP +DRT   +GLNKV SYRSL+TGDKCN+++P+PH KA EQG
Sbjct: 661  SVVHENHFQRQQVSHNLHPAVDRTSVASGLNKVASYRSLMTGDKCNMIQPFPHPKAPEQG 720

Query: 721  YAYRQSDNSMLTIRQACQPMISGSLTTNQVQKLGYSFGFHQFPDKTAGLLENEIIHKLKG 780
            YA RQSDNS+LT+RQA QPMISGSL TN+V K GYSFGF +FP KT  LLENEI+HK+K 
Sbjct: 721  YACRQSDNSILTVRQAYQPMISGSLATNEVHKQGYSFGFQKFPAKTTSLLENEILHKMKR 780

Query: 781  LNLNDDKGTTRTEQNAIVPYKGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGK 840
            L+LND + + R+EQNAIVPYKGNGAVVPYVESEYLRKRKARPRVD+DPETERIWNLLMGK
Sbjct: 781  LSLNDHEVSIRSEQNAIVPYKGNGAVVPYVESEYLRKRKARPRVDIDPETERIWNLLMGK 840

Query: 841  EGSEGIENHEKDKEKWWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFL 900
            EGSEGIE+HEKDKEKWWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFL
Sbjct: 841  EGSEGIESHEKDKEKWWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFL 900

Query: 901  TQNVSDHLSSSAFMSLAARFPLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQV 960
            TQNVSDHLSSSAFMSLAARFP+KS SN+RTQG+VETS+VANESAAC+LYPA+SIRW  Q 
Sbjct: 901  TQNVSDHLSSSAFMSLAARFPVKSASNLRTQGEVETSIVANESAACVLYPAESIRWHVQE 960

Query: 961  LSLPRFEMPQTSINHQNHRVKSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARS 1020
            LS+PRFEMPQTSINHQN    SGTE  FTE+G QIVEEEVISSQDSFDSTITQGT GARS
Sbjct: 961  LSVPRFEMPQTSINHQNQIANSGTEKIFTELGGQIVEEEVISSQDSFDSTITQGTAGARS 1020

Query: 1021 CSGSNSDAEEPIVSYNSSSTHCSNFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQ 1080
            CSGSNS+AEEPIVSYNSSSTH SNFTDIKQMETT ++QKSFSDLNRSSV DEVSEHKHWQ
Sbjct: 1021 CSGSNSEAEEPIVSYNSSSTHYSNFTDIKQMETTATIQKSFSDLNRSSVSDEVSEHKHWQ 1080

Query: 1081 LSDGKQDSLT-EWNEIDNLNGHSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEG 1140
            L DGKQ SLT EWNEIDNL+GHSLINFLVNIENQ KQVP APSNNQLH+TPDC VLEVEG
Sbjct: 1081 LPDGKQGSLTSEWNEIDNLSGHSLINFLVNIENQPKQVPDAPSNNQLHITPDCGVLEVEG 1140

Query: 1141 REAFSEESISSGPSIVSGCSTEKNMTCHSLNIGDPERTLDKISGEEIGRQARSQERIRME 1200
            REAFSEES SSGPSIVSGCSTEKNMT H LNIG  E+ LDK S E+   QARS E  RME
Sbjct: 1141 REAFSEESTSSGPSIVSGCSTEKNMTFHRLNIGALEQRLDKTSAED-NVQARSHETTRME 1200

Query: 1201 HSESISEHSVHLQGNGIQLGSHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPAKM 1260
            HSES+SEHSVHLQGNGIQ  SHCEY LH  YEPCERN TSP+ES SVTNP PELD PA  
Sbjct: 1201 HSESVSEHSVHLQGNGIQFRSHCEYNLHGKYEPCERNNTSPVESVSVTNPPPELDTPA-- 1260

Query: 1261 QQSALSNVVNATTHTEKLLPGNDNQINFSNNEVHSLSQADNEGNVVSTSKAKRRKVNSEK 1320
            ++SA+SNVV+   HTEKLLPG  N INFSNNE HSLSQA NEGN +S SKAKRRKVNSEK
Sbjct: 1261 EKSAVSNVVHVHAHTEKLLPGKGNLINFSNNEAHSLSQAHNEGN-ISPSKAKRRKVNSEK 1320

Query: 1321 KSAVDWDILRKQVEANGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERI 1380
            K  +DWD LRKQVEANGQIKEKGKDAMDSIDYEAIRLA+V EIS+AIKERGMNNMLAERI
Sbjct: 1321 KGGMDWDSLRKQVEANGQIKEKGKDAMDSIDYEAIRLADVREISNAIKERGMNNMLAERI 1380

Query: 1381 KEFLNRLVKDHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTN 1440
            KEFLNRLV DHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTN
Sbjct: 1381 KEFLNRLVTDHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTN 1440

Query: 1441 VGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITF 1500
            VGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITF
Sbjct: 1441 VGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITF 1500

Query: 1501 GKVFCTKSKPNCNACPMRGECKHFASAFASARLALPAPDEKRIVTSTNPVAMEKQPAVVS 1560
            GKVFCTKSKPNCNACPMRGECKHFASAFASARLALPAPDEK IV STNP++ EKQP +V+
Sbjct: 1501 GKVFCTKSKPNCNACPMRGECKHFASAFASARLALPAPDEKGIVASTNPMSTEKQPPIVT 1560

Query: 1561 NPLPILPPEGSTYTESTLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFYEDPDEIPT 1620
            NPLPILPPEGSTY E+T G SKCEPIVEVPAT  PEPEPNEITESDIED+FYEDPDEIPT
Sbjct: 1561 NPLPILPPEGSTYAENTSGPSKCEPIVEVPAT--PEPEPNEITESDIEDAFYEDPDEIPT 1620

Query: 1621 IKLSMEEFKTTLQNYIPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEHQVYELPDSHP 1680
            IKLSMEEFKTTLQ+YIPEGDMS+ALVALN EAA IPTPKLKNVSRLRTEHQVYELPDSHP
Sbjct: 1621 IKLSMEEFKTTLQHYIPEGDMSKALVALNPEAAFIPTPKLKNVSRLRTEHQVYELPDSHP 1680

Query: 1681 LLKELDRREPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREA 1740
            LL+E+DRREPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDP RLCNE TCFTCNSRREA
Sbjct: 1681 LLREMDRREPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPNRLCNEITCFTCNSRREA 1740

Query: 1741 NSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRKWLWNLPRRTVY 1800
            NSQTVRGTLL+PCRTAMRGSFPLNGTYFQVNEMFADHESS  PIDVPRKWLWNLPRRTVY
Sbjct: 1741 NSQTVRGTLLVPCRTAMRGSFPLNGTYFQVNEMFADHESSMKPIDVPRKWLWNLPRRTVY 1800

Query: 1801 FGTSVSTIFKGLVTEEIQQCFWRGFVCVRGFERKTRAPRPLIARLHFPASKLAKMKNGHT 1844
            FGTSVSTIFKGLVTEEIQQCFWRGFVCVRGF++KTRAPRPLIARLHFPASKLAK+KNG T
Sbjct: 1801 FGTSVSTIFKGLVTEEIQQCFWRGFVCVRGFDQKTRAPRPLIARLHFPASKLAKVKNGQT 1844

BLAST of Tan0014193 vs. NCBI nr
Match: KAG6601206.1 (Transcriptional activator DEMETER, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 3022.3 bits (7834), Expect = 0.0e+00
Identity = 1553/1846 (84.13%), Postives = 1652/1846 (89.49%), Query Frame = 0

Query: 1    MNSQVNSSGDFYAGNLLVRNQNLYPSSRPSSNNSYAQHVRPYGLPMYQPNYNLNPASMTQ 60
            MNSQ NSSGDFYAGNLL+RNQNLY  SRPSSN+SYAQHVR YGLPMYQPN+N NP SMTQ
Sbjct: 65   MNSQYNSSGDFYAGNLLLRNQNLYSGSRPSSNDSYAQHVRTYGLPMYQPNHN-NPVSMTQ 124

Query: 61   MNQMSIFTNSIHPPPVSSHLENFAFDPISTPSFLVRDESSSFRRDGEDDFIRMFQDEPPR 120
             NQMSIF NS H PPVSSHLENFA+D I+T SFLVRDESSSFR+DGEDDFIR+FQ E PR
Sbjct: 125  ANQMSIFMNSAHTPPVSSHLENFAYDHIATSSFLVRDESSSFRKDGEDDFIRIFQAEAPR 184

Query: 121  QHCDELLQSIVESSCVGNSTPFKRTTDFGKQRDLEIDLNRTPEQRPPKRRQHTPMVFS-E 180
            Q CDELLQSIVESSCVGNSTPFK T DFGKQRDLEIDLN+TPEQRPPKRRQHTP+VFS E
Sbjct: 185  QPCDELLQSIVESSCVGNSTPFKGTKDFGKQRDLEIDLNKTPEQRPPKRRQHTPLVFSGE 244

Query: 181  RFTDLLNLPLAESLSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSAAISAPTPTKE 240
             FTDLLNLPL E+LSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSA IS   PT E
Sbjct: 245  SFTDLLNLPLDENLSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSAGIS--EPTNE 304

Query: 241  VEKGSDQVIDLNKTPEQKTPRRRKHRPKVIKEGKPKKSPKPVTPKIPKETPSGKRKYVRK 300
             EKGSDQVID +KT EQKTP+RRKHRPKVIKEGKPKKSPKPVTPKI KETPSGKRKYVR+
Sbjct: 305  AEKGSDQVID-HKTTEQKTPKRRKHRPKVIKEGKPKKSPKPVTPKISKETPSGKRKYVRR 364

Query: 301  KNIKEAATPPANIVEIKDSSTATKTKSCRRVINFEMEKTGDEEQEKKHNEKDVQEENMGN 360
            KNIKEA TPP NI+EIKDS+ A KTKSCRRVINFEMEKTGDEE+EK+ NEKD+Q ENMGN
Sbjct: 365  KNIKEAVTPPENIMEIKDSNPAAKTKSCRRVINFEMEKTGDEEREKERNEKDMQ-ENMGN 424

Query: 361  FCSITRPNVPDFCTQSNGVCGTSPDVHISHRLSTMVAENVRPTLQSNLAHMNHMTTSLTS 420
             C ITR NVP F TQSNG+CGTSPDV  +HRL T+VAE+V+P++QS +A MNHM TS  S
Sbjct: 425  SCFITRSNVPGFSTQSNGICGTSPDVQDNHRLGTLVAESVQPSIQSYIARMNHMMTSHIS 484

Query: 421  QSEREAAGGPFNKSAYNTAEDLLNVGRIIDQGKADQYQNGFSNGYTPVQQHIRAEDMEQF 480
            QSEREAA  P N S YN AE L NV RI+DQGK  QYQ GFSNGYTPV+Q+IRAE+ME+F
Sbjct: 485  QSEREAAESPLNSSGYNKAESLFNVLRILDQGKGYQYQTGFSNGYTPVEQNIRAEEMEKF 544

Query: 481  ANHAKRSTSFKELMGMNFEYSQTIPNHQSNINEARGSKRGRPLTIQPTPSCSITTLNSSV 540
            A  AKR+T +KE+MG+N  YSQT+PNHQSNINEARGSKRG PLT QPT  CSITTL+SSV
Sbjct: 545  ATTAKRNTYYKEMMGINSAYSQTVPNHQSNINEARGSKRGCPLTAQPTQLCSITTLDSSV 604

Query: 541  LCQEVLQTGESHRQGSSVNIGPLEIPAKKFESGLYATLYKRYTTIQANEGCPSHLNTSGC 600
            LCQE LQTGE HR GSS N+G LEIP K FESGLY+TL+KRY+TIQ NE C  HLNT+GC
Sbjct: 605  LCQEALQTGEFHRLGSSTNVGSLEIPGKNFESGLYSTLHKRYSTIQPNEDCSRHLNTTGC 664

Query: 601  NPINSVGFTTEMKQAMLNS-HIRSNQSRDRQTNWTKETIGDRHIHSVVHENNFQRRQISH 660
            +P  S GFT EMKQAMLN  HIRSNQ  DRQ++WTKE IGD +IHSVVH NNFQRRQ+SH
Sbjct: 665  SPTISAGFTAEMKQAMLNGYHIRSNQITDRQSSWTKEIIGDGYIHSVVHGNNFQRRQVSH 724

Query: 661  NLHPEIDRTCETTGLNKVTSYRSLITGDKCNVLRPYPHLKASEQGYAYRQSDNSMLTIRQ 720
            NLHPEI+R CET+GLN V S+RSLI  DKCN+L+P+PH KASEQ YA RQ +NS+LT+RQ
Sbjct: 725  NLHPEINRMCETSGLNTVNSHRSLIIRDKCNMLQPFPHPKASEQWYACRQPNNSILTVRQ 784

Query: 721  ACQPMISGSLTTNQVQKLGYSFGFHQFPDKTAGLLENEIIHKLKGLNLNDDKGTTRTEQN 780
            ACQPMISGSL TN VQK GYSFG  QF  KT  LLE EI  KLK L+L DD+G TRTEQN
Sbjct: 785  ACQPMISGSLATN-VQKQGYSFGMQQFSAKTTSLLEYEITRKLKSLSLKDDEGATRTEQN 844

Query: 781  AIVPYKGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEGSEGIENHEKDKEK 840
            AIVPY GNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEG EGIENHEKDKEK
Sbjct: 845  AIVPYNGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEGIEGIENHEKDKEK 904

Query: 841  WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS 900
            WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS
Sbjct: 905  WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS 964

Query: 901  LAARFPLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQVLSLPRFEMPQTSINH 960
            LAARFP+KSTSN RT  +VETS+VANE AACL YPADSIRW+ Q LS+P FEMPQTSI H
Sbjct: 965  LAARFPVKSTSNFRTPDEVETSIVANELAACLQYPADSIRWEGQELSVPSFEMPQTSIIH 1024

Query: 961  QNHRVKSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARSCSGSNSDAEEPIVSY 1020
            QNHRV SGTE FFTE G QIVEEEVISSQ SFDSTITQGT GARSCSGSNS+AEEPIVS 
Sbjct: 1025 QNHRVNSGTENFFTERGGQIVEEEVISSQGSFDSTITQGTAGARSCSGSNSEAEEPIVSN 1084

Query: 1021 NSSSTHCSNFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQLSDGKQDSLT-EWNE 1080
            NSSSTH SNFTDIKQMETTT ++KSFSD NR+SVFDEVSEHKHWQL DGKQDSLT EWNE
Sbjct: 1085 NSSSTHYSNFTDIKQMETTTVIEKSFSDKNRTSVFDEVSEHKHWQLPDGKQDSLTSEWNE 1144

Query: 1081 IDNLNGHSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEGREAFSEESISSGPSI 1140
            IDNL+GHSL NFLVNIENQ KQ+P APSNNQL MTPDC VLEVEGREAFSEESISSGPSI
Sbjct: 1145 IDNLSGHSLFNFLVNIENQQKQLPDAPSNNQLRMTPDCGVLEVEGREAFSEESISSGPSI 1204

Query: 1141 VSGCSTEKNMTCHSLNIGDPERTLDKISGEEIGRQARSQERIRMEHSESISEHSVHLQGN 1200
            +SGCSTEKN TCHSLN  DP+R+ DKIS EE  R AR+QE  RMEHSES+SEHSVH QGN
Sbjct: 1205 ISGCSTEKNTTCHSLNTEDPDRSSDKISAEE-NRPARTQETTRMEHSESVSEHSVHRQGN 1264

Query: 1201 GIQLGSHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPAKMQQSALSNVVNATTHT 1260
            GIQL S CEY LHD Y+PCERN TSP+ES SV+NP PELD PAK  +SALSNVV+   HT
Sbjct: 1265 GIQLTSRCEYSLHDKYKPCERNNTSPLESASVSNPPPELDTPAK--KSALSNVVHVHAHT 1324

Query: 1261 EKLLPGNDNQINFSNNEVHSLSQADNEGNVVSTSKAKRRKVNSEKKSAVDWDILRKQVEA 1320
            EKLLPG  N INFSNNE HSLSQADNEGN +S SKAKRRKVNSEK SA+DWD LRKQVEA
Sbjct: 1325 EKLLPGKGNLINFSNNEAHSLSQADNEGN-ISPSKAKRRKVNSEKNSAIDWDSLRKQVEA 1384

Query: 1321 NGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLVKDHGSID 1380
            NGQIKEKGKDAMDSIDYEAIRLANV EISSAIKERGMNNMLAERI+EFLNRLV DHGSID
Sbjct: 1385 NGQIKEKGKDAMDSIDYEAIRLANVQEISSAIKERGMNNMLAERIQEFLNRLVTDHGSID 1444

Query: 1381 LEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 1440
            LEWLRDVPPD+AKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP
Sbjct: 1445 LEWLRDVPPDQAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 1504

Query: 1441 LPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC 1500
            LPESLQLHLLELYPVLE+IQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC
Sbjct: 1505 LPESLQLHLLELYPVLETIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC 1564

Query: 1501 PMRGECKHFASAFASARLALPAPDEKRIVTSTNPVAMEKQPAVVSNPLPILPPEGSTYTE 1560
            PMRGECKHFASAFASARLALPAPDEK IV STNP+A EKQP +V++ LPILPPE STYTE
Sbjct: 1565 PMRGECKHFASAFASARLALPAPDEKGIVASTNPIATEKQPPIVTSHLPILPPE-STYTE 1624

Query: 1561 STLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFYEDPDEIPTIKLSMEEFKTTLQNY 1620
            +TL TSKCEPIVEVPAT  PEPEPNE+TESDIED FYEDPDEIPTIKLSMEEFKTTLQNY
Sbjct: 1625 NTLETSKCEPIVEVPAT--PEPEPNEMTESDIEDLFYEDPDEIPTIKLSMEEFKTTLQNY 1684

Query: 1621 IPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEHQVYELPDSHPLLKELDRREPDDPSP 1680
            IPEGDMSRALVALN EAA IPTPKLKNVSRLRTEHQVYELPDSHPLL+E+D REPDDPSP
Sbjct: 1685 IPEGDMSRALVALNPEAAYIPTPKLKNVSRLRTEHQVYELPDSHPLLREMDTREPDDPSP 1744

Query: 1681 YLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREANSQTVRGTLLIPCRT 1740
            YLLAIWTPGETA+SIQPPEQSCGSQDP RLCNEKTCFTCNSRREANSQTVRGTLL+PCRT
Sbjct: 1745 YLLAIWTPGETADSIQPPEQSCGSQDPDRLCNEKTCFTCNSRREANSQTVRGTLLVPCRT 1804

Query: 1741 AMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRKWLWNLPRRTVYFGTSVSTIFKGLVTE 1800
            AMRGSFPLNGTYFQVNEMFADHESS  PIDVPR WLWNLPRRTVYFGTSVS+IFKGLVTE
Sbjct: 1805 AMRGSFPLNGTYFQVNEMFADHESSMKPIDVPRTWLWNLPRRTVYFGTSVSSIFKGLVTE 1864

Query: 1801 EIQQCFWRGFVCVRGFERKTRAPRPLIARLHFPASKLAKMKNGHTE 1844
            EIQQCFWRGFVCVRGF++KTRAPRPLIARLHFPASKLAK++NGHTE
Sbjct: 1865 EIQQCFWRGFVCVRGFDQKTRAPRPLIARLHFPASKLAKVRNGHTE 1897

BLAST of Tan0014193 vs. NCBI nr
Match: KAG7032001.1 (Transcriptional activator DEMETER, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 3021.5 bits (7832), Expect = 0.0e+00
Identity = 1553/1846 (84.13%), Postives = 1652/1846 (89.49%), Query Frame = 0

Query: 1    MNSQVNSSGDFYAGNLLVRNQNLYPSSRPSSNNSYAQHVRPYGLPMYQPNYNLNPASMTQ 60
            MNSQ NSSGDFYAGNLL+RNQNLY  SRPSSN+SYAQHVR YGLPMYQPN+N NP SMTQ
Sbjct: 70   MNSQYNSSGDFYAGNLLLRNQNLYSGSRPSSNDSYAQHVRTYGLPMYQPNHN-NPVSMTQ 129

Query: 61   MNQMSIFTNSIHPPPVSSHLENFAFDPISTPSFLVRDESSSFRRDGEDDFIRMFQDEPPR 120
             NQMSIF NS H PPVSSHLENFA+D I+T SFLVRDESSSFR+DGEDDFIR+FQ E PR
Sbjct: 130  ANQMSIFMNSAHTPPVSSHLENFAYDHIATSSFLVRDESSSFRKDGEDDFIRIFQAEAPR 189

Query: 121  QHCDELLQSIVESSCVGNSTPFKRTTDFGKQRDLEIDLNRTPEQRPPKRRQHTPMVFS-E 180
            Q CDELLQSIVESSCVGNSTPFK T DFGKQRDLEIDLN+TPEQRPPKRRQHTP+VFS E
Sbjct: 190  QPCDELLQSIVESSCVGNSTPFKGTKDFGKQRDLEIDLNKTPEQRPPKRRQHTPLVFSGE 249

Query: 181  RFTDLLNLPLAESLSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSAAISAPTPTKE 240
             FTDLLNLPL E+LSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSA IS   PT E
Sbjct: 250  SFTDLLNLPLDENLSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSAGIS--EPTNE 309

Query: 241  VEKGSDQVIDLNKTPEQKTPRRRKHRPKVIKEGKPKKSPKPVTPKIPKETPSGKRKYVRK 300
             EKGSDQVID +KT EQKTP+RRKHRPKVIKEGKPKKSPKPVTPKI KETPSGKRKYVR+
Sbjct: 310  AEKGSDQVID-HKTTEQKTPKRRKHRPKVIKEGKPKKSPKPVTPKISKETPSGKRKYVRR 369

Query: 301  KNIKEAATPPANIVEIKDSSTATKTKSCRRVINFEMEKTGDEEQEKKHNEKDVQEENMGN 360
            KNIKEA TPP NI+EIKDS+ A KTKSCRRVINFEMEKTGDEE+EK+ NEKD+Q ENMGN
Sbjct: 370  KNIKEAVTPPENIMEIKDSNPAAKTKSCRRVINFEMEKTGDEEREKERNEKDMQ-ENMGN 429

Query: 361  FCSITRPNVPDFCTQSNGVCGTSPDVHISHRLSTMVAENVRPTLQSNLAHMNHMTTSLTS 420
             C ITR NVP F TQSNG+CGTSPDV  +HRL T+VAE+V+P++QS +A MNHM TS  S
Sbjct: 430  SCFITRSNVPGFSTQSNGICGTSPDVQDNHRLGTLVAESVQPSIQSYIARMNHMMTSHIS 489

Query: 421  QSEREAAGGPFNKSAYNTAEDLLNVGRIIDQGKADQYQNGFSNGYTPVQQHIRAEDMEQF 480
            QSEREAA  P N S YN AE L NV RI+DQGK  QYQ GFSNGYTPV+Q+IRAE+ME+F
Sbjct: 490  QSEREAAESPLNSSGYNKAESLFNVLRILDQGKGYQYQTGFSNGYTPVEQNIRAEEMEKF 549

Query: 481  ANHAKRSTSFKELMGMNFEYSQTIPNHQSNINEARGSKRGRPLTIQPTPSCSITTLNSSV 540
            A  AKR+T +KE+MG+N  YSQT+PNHQSNINEARGSKRG PLT QPT  CSITTL+SSV
Sbjct: 550  ATTAKRNTYYKEMMGINSAYSQTVPNHQSNINEARGSKRGCPLTAQPTQLCSITTLDSSV 609

Query: 541  LCQEVLQTGESHRQGSSVNIGPLEIPAKKFESGLYATLYKRYTTIQANEGCPSHLNTSGC 600
            LCQE LQTGE HR GSS N+G LEIP K FESGLY+TL+KRY+TIQ NE C  HLNT+GC
Sbjct: 610  LCQEALQTGEFHRLGSSTNVGSLEIPGKNFESGLYSTLHKRYSTIQPNEDCSRHLNTTGC 669

Query: 601  NPINSVGFTTEMKQAMLNS-HIRSNQSRDRQTNWTKETIGDRHIHSVVHENNFQRRQISH 660
            +P  S GFT EMKQAMLN  HIRSNQ  DRQ++WTKE IGD +IHSVVH NNFQRRQ+SH
Sbjct: 670  SPTISAGFTAEMKQAMLNGYHIRSNQITDRQSSWTKEIIGDGYIHSVVHGNNFQRRQVSH 729

Query: 661  NLHPEIDRTCETTGLNKVTSYRSLITGDKCNVLRPYPHLKASEQGYAYRQSDNSMLTIRQ 720
            NLHPEI+R CET+GLN V S+RSLI  DKCN+L+P+PH KASEQ YA RQ +NS+LT+RQ
Sbjct: 730  NLHPEINRMCETSGLNTVNSHRSLIIRDKCNMLQPFPHPKASEQWYACRQPNNSILTVRQ 789

Query: 721  ACQPMISGSLTTNQVQKLGYSFGFHQFPDKTAGLLENEIIHKLKGLNLNDDKGTTRTEQN 780
            ACQPMISGSL TN VQK GYSFG  QF  KT  LLE EI  KLK L+L DD+G TRTEQN
Sbjct: 790  ACQPMISGSLATN-VQKQGYSFGMQQFSAKTTSLLEYEITRKLKSLSLKDDEGATRTEQN 849

Query: 781  AIVPYKGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEGSEGIENHEKDKEK 840
            AIVPY GNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEG EGIENHEKDKEK
Sbjct: 850  AIVPYNGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEGIEGIENHEKDKEK 909

Query: 841  WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS 900
            WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS
Sbjct: 910  WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS 969

Query: 901  LAARFPLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQVLSLPRFEMPQTSINH 960
            LAARFP+KSTSN RT  +VETS+VANE AACL YPADSIRW+ Q LS+P FEMPQTSI H
Sbjct: 970  LAARFPVKSTSNFRTPDEVETSIVANELAACLQYPADSIRWEGQELSVPSFEMPQTSIIH 1029

Query: 961  QNHRVKSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARSCSGSNSDAEEPIVSY 1020
            QNHRV SGTE FFTE G QIVEEEVISSQ SFDSTITQGT GARSCSGSNS+AEEPIVS 
Sbjct: 1030 QNHRVNSGTENFFTERGGQIVEEEVISSQGSFDSTITQGTAGARSCSGSNSEAEEPIVSN 1089

Query: 1021 NSSSTHCSNFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQLSDGKQDSLT-EWNE 1080
            NSSSTH SNFTDIKQMETTT ++KSFSD NR+SVFDEVSEHKHWQL DGKQDSLT EWNE
Sbjct: 1090 NSSSTHYSNFTDIKQMETTTVIEKSFSDKNRTSVFDEVSEHKHWQLPDGKQDSLTSEWNE 1149

Query: 1081 IDNLNGHSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEGREAFSEESISSGPSI 1140
            IDNL+GHSL NFLVNIENQ KQ+P APSNNQL MTPDC VLEVEGREAFSEESISSGPSI
Sbjct: 1150 IDNLSGHSLFNFLVNIENQQKQLPDAPSNNQLRMTPDCGVLEVEGREAFSEESISSGPSI 1209

Query: 1141 VSGCSTEKNMTCHSLNIGDPERTLDKISGEEIGRQARSQERIRMEHSESISEHSVHLQGN 1200
            +SGCSTEKN TCHSLN  DP+R+ DKIS EE  R AR+QE  RMEHSES+SEHSVH QGN
Sbjct: 1210 ISGCSTEKNTTCHSLNTEDPDRSSDKISAEE-NRPARTQEITRMEHSESVSEHSVHRQGN 1269

Query: 1201 GIQLGSHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPAKMQQSALSNVVNATTHT 1260
            GIQL S CEY LHD Y+PCERN TSP+ES SV+NP PELD PAK  +SALSNVV+   HT
Sbjct: 1270 GIQLTSRCEYSLHDKYKPCERNNTSPLESASVSNPPPELDTPAK--KSALSNVVHVHAHT 1329

Query: 1261 EKLLPGNDNQINFSNNEVHSLSQADNEGNVVSTSKAKRRKVNSEKKSAVDWDILRKQVEA 1320
            EKLLPG  N INFSNNE HSLSQADNEGN +S SKAKRRKVNSEK SA+DWD LRKQVEA
Sbjct: 1330 EKLLPGKGNLINFSNNEAHSLSQADNEGN-ISPSKAKRRKVNSEKNSAIDWDSLRKQVEA 1389

Query: 1321 NGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLVKDHGSID 1380
            NGQIKEKGKDAMDSIDYEAIRLANV EISSAIKERGMNNMLAERI+EFLNRLV DHGSID
Sbjct: 1390 NGQIKEKGKDAMDSIDYEAIRLANVQEISSAIKERGMNNMLAERIQEFLNRLVTDHGSID 1449

Query: 1381 LEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 1440
            LEWLRDVPPD+AKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP
Sbjct: 1450 LEWLRDVPPDQAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 1509

Query: 1441 LPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC 1500
            LPESLQLHLLELYPVLE+IQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC
Sbjct: 1510 LPESLQLHLLELYPVLETIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC 1569

Query: 1501 PMRGECKHFASAFASARLALPAPDEKRIVTSTNPVAMEKQPAVVSNPLPILPPEGSTYTE 1560
            PMRGECKHFASAFASARLALPAPDEK IV STNP+A EKQP +V++ LPILPPE STYTE
Sbjct: 1570 PMRGECKHFASAFASARLALPAPDEKGIVASTNPIATEKQPPIVTSHLPILPPE-STYTE 1629

Query: 1561 STLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFYEDPDEIPTIKLSMEEFKTTLQNY 1620
            +TL TSKCEPIVEVPAT  PEPEPNE+TESDIED FYEDPDEIPTIKLSMEEFKTTLQNY
Sbjct: 1630 NTLETSKCEPIVEVPAT--PEPEPNEMTESDIEDLFYEDPDEIPTIKLSMEEFKTTLQNY 1689

Query: 1621 IPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEHQVYELPDSHPLLKELDRREPDDPSP 1680
            IPEGDMSRALVALN EAA IPTPKLKNVSRLRTEHQVYELPDSHPLL+E+D REPDDPSP
Sbjct: 1690 IPEGDMSRALVALNPEAAYIPTPKLKNVSRLRTEHQVYELPDSHPLLREMDTREPDDPSP 1749

Query: 1681 YLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREANSQTVRGTLLIPCRT 1740
            YLLAIWTPGETA+SIQPPEQSCGSQDP RLCNEKTCFTCNSRREANSQTVRGTLL+PCRT
Sbjct: 1750 YLLAIWTPGETADSIQPPEQSCGSQDPDRLCNEKTCFTCNSRREANSQTVRGTLLVPCRT 1809

Query: 1741 AMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRKWLWNLPRRTVYFGTSVSTIFKGLVTE 1800
            AMRGSFPLNGTYFQVNEMFADHESS  PIDVPR WLWNLPRRTVYFGTSVS+IFKGLVTE
Sbjct: 1810 AMRGSFPLNGTYFQVNEMFADHESSMKPIDVPRTWLWNLPRRTVYFGTSVSSIFKGLVTE 1869

Query: 1801 EIQQCFWRGFVCVRGFERKTRAPRPLIARLHFPASKLAKMKNGHTE 1844
            EIQQCFWRGFVCVRGF++KTRAPRPLIARLHFPASKLAK++NGHTE
Sbjct: 1870 EIQQCFWRGFVCVRGFDQKTRAPRPLIARLHFPASKLAKVRNGHTE 1902

BLAST of Tan0014193 vs. NCBI nr
Match: XP_023518039.1 (transcriptional activator DEMETER-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023518047.1 transcriptional activator DEMETER-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 3019.6 bits (7827), Expect = 0.0e+00
Identity = 1553/1846 (84.13%), Postives = 1648/1846 (89.27%), Query Frame = 0

Query: 1    MNSQVNSSGDFYAGNLLVRNQNLYPSSRPSSNNSYAQHVRPYGLPMYQPNYNLNPASMTQ 60
            MNSQ NSSGDFYAGNLL+RNQNLY  SRPSSN+SYAQHVR YGLPMYQPN+N NP SMTQ
Sbjct: 1    MNSQYNSSGDFYAGNLLLRNQNLYSGSRPSSNDSYAQHVRTYGLPMYQPNHN-NPVSMTQ 60

Query: 61   MNQMSIFTNSIHPPPVSSHLENFAFDPISTPSFLVRDESSSFRRDGEDDFIRMFQDEPPR 120
             NQMSIF NS H PPVSSHLENFA+D I+T SFLVRDESSSFR+DGEDDFIRMFQ E PR
Sbjct: 61   GNQMSIFMNSAHTPPVSSHLENFAYDHIATSSFLVRDESSSFRKDGEDDFIRMFQAEAPR 120

Query: 121  QHCDELLQSIVESSCVGNSTPFKRTTDFGKQRDLEIDLNRTPEQRPPKRRQHTPMVFS-E 180
            Q CDELLQSIVESSCVGNSTPFK T DFGKQRDLEIDLN+TPEQRPPKRRQHTPMVFS E
Sbjct: 121  QPCDELLQSIVESSCVGNSTPFKGTKDFGKQRDLEIDLNKTPEQRPPKRRQHTPMVFSGE 180

Query: 181  RFTDLLNLPLAESLSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSAAISAPTPTKE 240
             FTDLLNLPL E+LSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSA IS   PT E
Sbjct: 181  SFTDLLNLPLDENLSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSAGIS--EPTNE 240

Query: 241  VEKGSDQVIDLNKTPEQKTPRRRKHRPKVIKEGKPKKSPKPVTPKIPKETPSGKRKYVRK 300
             EKGSDQVID +KT EQKTP+RRKHRPKVIKEGKPKKSPKPVTPKI KETPSGKRKYVR+
Sbjct: 241  AEKGSDQVID-HKTTEQKTPKRRKHRPKVIKEGKPKKSPKPVTPKISKETPSGKRKYVRR 300

Query: 301  KNIKEAATPPANIVEIKDSSTATKTKSCRRVINFEMEKTGDEEQEKKHNEKDVQEENMGN 360
            KNIKEA TPP NI+EIKDS+   KTKSCRRVINFEMEKTGDEEQEK+ NEKD+Q ENMGN
Sbjct: 301  KNIKEAVTPPENIMEIKDSNPEAKTKSCRRVINFEMEKTGDEEQEKERNEKDMQ-ENMGN 360

Query: 361  FCSITRPNVPDFCTQSNGVCGTSPDVHISHRLSTMVAENVRPTLQSNLAHMNHMTTSLTS 420
             C ITR NVP F TQSNG+CGTSPDV  +HRL T+VAE+V+P++QS +A MNHM TS  S
Sbjct: 361  SCFITRSNVPGFSTQSNGICGTSPDVQDNHRLGTLVAESVQPSIQSYIARMNHMMTSHIS 420

Query: 421  QSEREAAGGPFNKSAYNTAEDLLNVGRIIDQGKADQYQNGFSNGYTPVQQHIRAEDMEQF 480
            QSEREAA  P N S YN AE L NV RI+DQGK  QYQ GFSNGYTPV+Q+IRAE+ME+F
Sbjct: 421  QSEREAAESPLNSSGYNKAESLFNVLRILDQGKGYQYQTGFSNGYTPVEQNIRAEEMEKF 480

Query: 481  ANHAKRSTSFKELMGMNFEYSQTIPNHQSNINEARGSKRGRPLTIQPTPSCSITTLNSSV 540
            A  AKR+T +KE+MGMN  YSQT+PNHQSNINEARGSKRG PLT QPT  CSIT+L+SSV
Sbjct: 481  ATTAKRNTYYKEMMGMNSAYSQTVPNHQSNINEARGSKRGCPLTAQPTQLCSITSLDSSV 540

Query: 541  LCQEVLQTGESHRQGSSVNIGPLEIPAKKFESGLYATLYKRYTTIQANEGCPSHLNTSGC 600
            LCQE LQTGE HR GSS N+G LEIP KKFESGLY+TL+ RY+TIQ NE C  HLNT+GC
Sbjct: 541  LCQEALQTGEFHRLGSSTNVGSLEIPGKKFESGLYSTLHNRYSTIQPNEDCSRHLNTTGC 600

Query: 601  NPINSVGFTTEMKQAMLNS-HIRSNQSRDRQTNWTKETIGDRHIHSVVHENNFQRRQISH 660
            +P  SVGFT EMKQAMLN  HIRSNQ  DRQ++WTKE IGD HIHSVVH NNFQRRQ+SH
Sbjct: 601  SPTISVGFTAEMKQAMLNGYHIRSNQITDRQSSWTKEIIGDGHIHSVVHGNNFQRRQVSH 660

Query: 661  NLHPEIDRTCETTGLNKVTSYRSLITGDKCNVLRPYPHLKASEQGYAYRQSDNSMLTIRQ 720
            NLHPEI+R CET+GLN V S+RSLI  DKCN+L+P+PH KA EQ YA RQ +NS+LT+RQ
Sbjct: 661  NLHPEINRMCETSGLNTVNSHRSLIIRDKCNMLQPFPHPKAPEQWYACRQPNNSILTVRQ 720

Query: 721  ACQPMISGSLTTNQVQKLGYSFGFHQFPDKTAGLLENEIIHKLKGLNLNDDKGTTRTEQN 780
            ACQPMISGSL TN VQK GYSFG  QF  KT  LLE EI  KLK L+L DD+G TRTEQN
Sbjct: 721  ACQPMISGSLATN-VQKQGYSFGMQQFSAKTTSLLEYEITRKLKSLSLKDDEGATRTEQN 780

Query: 781  AIVPYKGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEGSEGIENHEKDKEK 840
            AIVPY GNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEG EGIENHEKDKEK
Sbjct: 781  AIVPYNGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEGIEGIENHEKDKEK 840

Query: 841  WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS 900
            WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS
Sbjct: 841  WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS 900

Query: 901  LAARFPLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQVLSLPRFEMPQTSINH 960
            LAARFP+KSTSN RT  +VETS+VANE A CL YPADSIRWD Q LS+P FEMPQTSI H
Sbjct: 901  LAARFPVKSTSNFRTPDEVETSIVANELATCLQYPADSIRWDGQELSVPSFEMPQTSIIH 960

Query: 961  QNHRVKSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARSCSGSNSDAEEPIVSY 1020
            QNHRV SGTE FFTE G QIVEEEVISSQ SFDSTITQGT GARSCSGSNS+AEEPIVS 
Sbjct: 961  QNHRVNSGTENFFTERGGQIVEEEVISSQGSFDSTITQGTAGARSCSGSNSEAEEPIVSN 1020

Query: 1021 NSSSTHCSNFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQLSDGKQDSLT-EWNE 1080
            NS+STH SNFTDIKQ ETTT ++K FSD NR+SVFDEVSEHKHWQL DGKQDSLT EWNE
Sbjct: 1021 NSNSTHYSNFTDIKQTETTTVIEKPFSDKNRTSVFDEVSEHKHWQLPDGKQDSLTSEWNE 1080

Query: 1081 IDNLNGHSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEGREAFSEESISSGPSI 1140
            IDNL+GHSL NFLVNIENQ K++P APSNNQL MT DC VLEVEGREAFSEESISSGPSI
Sbjct: 1081 IDNLSGHSLFNFLVNIENQQKKLPDAPSNNQLRMTSDCGVLEVEGREAFSEESISSGPSI 1140

Query: 1141 VSGCSTEKNMTCHSLNIGDPERTLDKISGEEIGRQARSQERIRMEHSESISEHSVHLQGN 1200
            +SGCSTEKN TCHSLN  DP+R+ DKIS EE  R AR+QE  RMEHSES+SEHSVH QGN
Sbjct: 1141 ISGCSTEKNTTCHSLNTEDPDRSSDKISAEE-NRPARTQETTRMEHSESVSEHSVHRQGN 1200

Query: 1201 GIQLGSHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPAKMQQSALSNVVNATTHT 1260
            GIQL S CEY LHDNY+PCERN TSP+ES SV+NP PELD PAK  +SALSNVV+   HT
Sbjct: 1201 GIQLTSRCEYSLHDNYKPCERNNTSPLESASVSNPPPELDTPAK--KSALSNVVHVHAHT 1260

Query: 1261 EKLLPGNDNQINFSNNEVHSLSQADNEGNVVSTSKAKRRKVNSEKKSAVDWDILRKQVEA 1320
            EKLLPG  N INFSNNE HSLSQADNEGN +S SKAKRRKVNSEK SA+DWD LRKQVEA
Sbjct: 1261 EKLLPGKGNLINFSNNEAHSLSQADNEGN-ISPSKAKRRKVNSEKNSAIDWDSLRKQVEA 1320

Query: 1321 NGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLVKDHGSID 1380
            NGQIKEKGKDAMDSIDYEAIRLANV EISSAIKERGMNNMLAERIKEFLNRLV DHGSID
Sbjct: 1321 NGQIKEKGKDAMDSIDYEAIRLANVQEISSAIKERGMNNMLAERIKEFLNRLVTDHGSID 1380

Query: 1381 LEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 1440
            LEWLRDVPPD+AKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP
Sbjct: 1381 LEWLRDVPPDQAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 1440

Query: 1441 LPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC 1500
            LPESLQLHLLELYPVLE+IQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC
Sbjct: 1441 LPESLQLHLLELYPVLETIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC 1500

Query: 1501 PMRGECKHFASAFASARLALPAPDEKRIVTSTNPVAMEKQPAVVSNPLPILPPEGSTYTE 1560
            PMRGECKHFASAFASARLALPAPDEK IV STNP++ EKQP +V++ LPILPPE STYTE
Sbjct: 1501 PMRGECKHFASAFASARLALPAPDEKGIVASTNPISTEKQPPIVTSHLPILPPE-STYTE 1560

Query: 1561 STLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFYEDPDEIPTIKLSMEEFKTTLQNY 1620
            +TL TSKCEPIVEVPAT  PEPEPNE+TESDIED FYEDPDEIPTIKLSMEEFKTTLQNY
Sbjct: 1561 NTLETSKCEPIVEVPAT--PEPEPNEMTESDIEDLFYEDPDEIPTIKLSMEEFKTTLQNY 1620

Query: 1621 IPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEHQVYELPDSHPLLKELDRREPDDPSP 1680
            IPEGDMSRALVALN EAA IPTPKLKNVSRLRTEHQVYELPDSHPLL+E+DRREPDDPSP
Sbjct: 1621 IPEGDMSRALVALNPEAAYIPTPKLKNVSRLRTEHQVYELPDSHPLLREMDRREPDDPSP 1680

Query: 1681 YLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREANSQTVRGTLLIPCRT 1740
            YLLAIWTPGETA+SIQPPEQSCGSQDP RLCNEKTCFTCNSRREANSQTVRGTLL+PCRT
Sbjct: 1681 YLLAIWTPGETADSIQPPEQSCGSQDPDRLCNEKTCFTCNSRREANSQTVRGTLLVPCRT 1740

Query: 1741 AMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRKWLWNLPRRTVYFGTSVSTIFKGLVTE 1800
            AMRGSFPLNGTYFQVNEMFADHESS  PIDVPR WLWNLPRRTVYFGTSVS+IFKGLVTE
Sbjct: 1741 AMRGSFPLNGTYFQVNEMFADHESSMKPIDVPRTWLWNLPRRTVYFGTSVSSIFKGLVTE 1800

Query: 1801 EIQQCFWRGFVCVRGFERKTRAPRPLIARLHFPASKLAKMKNGHTE 1844
            EIQQCFWRGFVCVRGF++KTRAPRPLIARLHFPASKLAK+KNGH E
Sbjct: 1801 EIQQCFWRGFVCVRGFDQKTRAPRPLIARLHFPASKLAKVKNGHRE 1833

BLAST of Tan0014193 vs. NCBI nr
Match: XP_022997004.1 (transcriptional activator DEMETER-like isoform X1 [Cucurbita maxima] >XP_022997011.1 transcriptional activator DEMETER-like isoform X1 [Cucurbita maxima] >XP_022997018.1 transcriptional activator DEMETER-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 3019.2 bits (7826), Expect = 0.0e+00
Identity = 1554/1846 (84.18%), Postives = 1653/1846 (89.54%), Query Frame = 0

Query: 1    MNSQVNSSGDFYAGNLLVRNQNLYPSSRPSSNNSYAQHVRPYGLPMYQPNYNLNPASMTQ 60
            MNSQ NSSGDFYAGNLL+RNQNLY  SRPSSN+SYAQHVR YGLPMYQ N+N NP SMTQ
Sbjct: 1    MNSQYNSSGDFYAGNLLLRNQNLYSGSRPSSNDSYAQHVRTYGLPMYQANHN-NPVSMTQ 60

Query: 61   MNQMSIFTNSIHPPPVSSHLENFAFDPISTPSFLVRDESSSFRRDGEDDFIRMFQDEPPR 120
             NQMSIF NS+H PPVSSHLENFA+D I+T SFLVRDESSSFR+DGEDDFIRMFQ E PR
Sbjct: 61   ANQMSIFMNSVHAPPVSSHLENFAYDHIATSSFLVRDESSSFRKDGEDDFIRMFQAEAPR 120

Query: 121  QHCDELLQSIVESSCVGNSTPFKRTTDFGKQRDLEIDLNRTPEQRPPKRRQHTPMVFS-E 180
            Q CDELLQSIVESS VGNSTPFK T DFGKQRDLEIDLN+TPEQRPPKRRQHTP+VFS E
Sbjct: 121  QPCDELLQSIVESSGVGNSTPFKGTKDFGKQRDLEIDLNKTPEQRPPKRRQHTPVVFSGE 180

Query: 181  RFTDLLNLPLAESLSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSAAISAPTPTKE 240
             FTDLLNLPL E+LSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSA IS   PT E
Sbjct: 181  SFTDLLNLPLDENLSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSAGIS--EPTNE 240

Query: 241  VEKGSDQVIDLNKTPEQKTPRRRKHRPKVIKEGKPKKSPKPVTPKIPKETPSGKRKYVRK 300
             EKGSDQVID +KT EQKTP+RRKHRPKVIKEGKPKKSPKPVTPKI KETPSGKRKYVR+
Sbjct: 241  AEKGSDQVID-HKTTEQKTPKRRKHRPKVIKEGKPKKSPKPVTPKISKETPSGKRKYVRR 300

Query: 301  KNIKEAATPPANIVEIKDSSTATKTKSCRRVINFEMEKTGDEEQEKKHNEKDVQEENMGN 360
            KNIKEA TPP NI+EIKDS+ A KTKSCRRVINFEMEKTGDEE+EK+ NEKD+Q ENMGN
Sbjct: 301  KNIKEAVTPPENIMEIKDSNPAAKTKSCRRVINFEMEKTGDEEREKERNEKDMQ-ENMGN 360

Query: 361  FCSITRPNVPDFCTQSNGVCGTSPDVHISHRLSTMVAENVRPTLQSNLAHMNHMTTSLTS 420
             C ITR NVP F TQSNG+CGTSPDV  +HRL  +VAE+V+P++QS +A MNHM TS  S
Sbjct: 361  SCFITRSNVPGFSTQSNGICGTSPDVQDNHRLGALVAESVQPSIQSYIARMNHMMTSRIS 420

Query: 421  QSEREAAGGPFNKSAYNTAEDLLNVGRIIDQGKADQYQNGFSNGYTPVQQHIRAEDMEQF 480
            QSEREAA  P N S YN AE L NV RI+DQGK  QYQ GFSNGYTPV+Q+IRAE+ME+F
Sbjct: 421  QSEREAAESPLNSSGYNKAESLFNVLRILDQGKGYQYQAGFSNGYTPVEQNIRAEEMEKF 480

Query: 481  ANHAKRSTSFKELMGMNFEYSQTIPNHQSNINEARGSKRGRPLTIQPTPSCSITTLNSSV 540
            A  AKR+T ++E++G+N  YSQT+PNHQSNINEARGSKRG PLT QPT  CSITTL+SSV
Sbjct: 481  ATTAKRNTCYQEMIGINSAYSQTVPNHQSNINEARGSKRGCPLTAQPTQLCSITTLDSSV 540

Query: 541  LCQEVLQTGESHRQGSSVNIGPLEIPAKKFESGLYATLYKRYTTIQANEGCPSHLNTSGC 600
            LCQE LQTGE H  GSS N+G LEIP KKFESGLY+TL+KRY+TIQ NE C  HLNT+GC
Sbjct: 541  LCQEALQTGEFHGLGSSTNVGSLEIPGKKFESGLYSTLHKRYSTIQPNEDCSRHLNTTGC 600

Query: 601  NPINSVGFTTEMKQAMLNS-HIRSNQSRDRQTNWTKETIGDRHIHSVVHENNFQRRQISH 660
            +P  SVGFT EMKQAMLN  HIRSNQ  DRQ++WTKE IGD HIHSVVH NNFQRRQ+SH
Sbjct: 601  SPTISVGFTAEMKQAMLNGYHIRSNQITDRQSSWTKEIIGDGHIHSVVHGNNFQRRQVSH 660

Query: 661  NLHPEIDRTCETTGLNKVTSYRSLITGDKCNVLRPYPHLKASEQGYAYRQSDNSMLTIRQ 720
            NLHPEI+R CET+GLNKV S+RSLI  DKCN+L+P+PH KASEQ YA +Q +NS+LT+RQ
Sbjct: 661  NLHPEINRMCETSGLNKVNSHRSLIIRDKCNMLQPFPHPKASEQWYACKQPNNSILTLRQ 720

Query: 721  ACQPMISGSLTTNQVQKLGYSFGFHQFPDKTAGLLENEIIHKLKGLNLNDDKGTTRTEQN 780
            ACQPMISGSL TN VQK GYSFG  QF  KT GLLE EI  KLK L+L DD+G TRTEQN
Sbjct: 721  ACQPMISGSLATN-VQKQGYSFGMQQFSAKTTGLLEYEITRKLKSLSLKDDEGATRTEQN 780

Query: 781  AIVPYKGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEGSEGIENHEKDKEK 840
            AIVPY GNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEG EGIENHEKDKEK
Sbjct: 781  AIVPYNGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEGIEGIENHEKDKEK 840

Query: 841  WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS 900
            WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS
Sbjct: 841  WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS 900

Query: 901  LAARFPLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQVLSLPRFEMPQTSINH 960
            LAARFP+KSTSN RT  +VETS+VANE AACL YPADSIRW+ Q LS+P FEMPQTSI H
Sbjct: 901  LAARFPVKSTSNFRTPDEVETSIVANELAACLQYPADSIRWEGQELSVPSFEMPQTSIIH 960

Query: 961  QNHRVKSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARSCSGSNSDAEEPIVSY 1020
            QNHRV SGTE FFTE G QIVEEEVISSQ SFDSTITQGT GARSCSGSNS+AEEPIVS 
Sbjct: 961  QNHRVNSGTENFFTERGGQIVEEEVISSQGSFDSTITQGTAGARSCSGSNSEAEEPIVSN 1020

Query: 1021 NSSSTHCSNFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQLSDGKQDSLT-EWNE 1080
            NSSSTH SNFTDIKQMETTT ++KSFSD NR+SVFDEVSEHKHWQL DGKQDSLT EWNE
Sbjct: 1021 NSSSTHYSNFTDIKQMETTTVIEKSFSDKNRTSVFDEVSEHKHWQLPDGKQDSLTSEWNE 1080

Query: 1081 IDNLNGHSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEGREAFSEESISSGPSI 1140
            IDNL+GHSL NFLVNIENQ KQ+P APSNNQL MTPDC VLEVEGREAFSEESISSGPSI
Sbjct: 1081 IDNLSGHSLFNFLVNIENQQKQLPDAPSNNQLRMTPDCGVLEVEGREAFSEESISSGPSI 1140

Query: 1141 VSGCSTEKNMTCHSLNIGDPERTLDKISGEEIGRQARSQERIRMEHSESISEHSVHLQGN 1200
            +SGCSTEKN TCHSLN  DP+R+ DKIS EE  R AR+QE  RMEHSES+SEHSVH QGN
Sbjct: 1141 ISGCSTEKNTTCHSLNTKDPDRSSDKISAEE-NRPARTQETTRMEHSESVSEHSVHRQGN 1200

Query: 1201 GIQLGSHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPAKMQQSALSNVVNATTHT 1260
            GIQL S C Y LHDNY+PC RN TSP+ES SV+NP PELDAPAK  +SALSNVV+   HT
Sbjct: 1201 GIQLTSLCGYSLHDNYKPCARNNTSPLESASVSNPPPELDAPAK--KSALSNVVHVHAHT 1260

Query: 1261 EKLLPGNDNQINFSNNEVHSLSQADNEGNVVSTSKAKRRKVNSEKKSAVDWDILRKQVEA 1320
            EKLLPG  N INFSNNE HSLSQADN GN +S SKAKRRKVNSEK SA+DWD LRKQVEA
Sbjct: 1261 EKLLPGKGNLINFSNNEAHSLSQADNGGN-ISPSKAKRRKVNSEKISAIDWDSLRKQVEA 1320

Query: 1321 NGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLVKDHGSID 1380
            NGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLV DHGSID
Sbjct: 1321 NGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLVTDHGSID 1380

Query: 1381 LEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 1440
            LEWLRDVPPD+AKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP
Sbjct: 1381 LEWLRDVPPDQAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 1440

Query: 1441 LPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC 1500
            LPESLQLHLLELYPVLE+IQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC
Sbjct: 1441 LPESLQLHLLELYPVLETIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC 1500

Query: 1501 PMRGECKHFASAFASARLALPAPDEKRIVTSTNPVAMEKQPAVVSNPLPILPPEGSTYTE 1560
            PMRGECKHFASAFASARLALPAPDEK IV STNP+A +KQP +V+N LPILPPE STYTE
Sbjct: 1501 PMRGECKHFASAFASARLALPAPDEKGIVASTNPIATQKQPPIVTNHLPILPPE-STYTE 1560

Query: 1561 STLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFYEDPDEIPTIKLSMEEFKTTLQNY 1620
            + L TSKCEPIVEVPAT  PEPEPNE+TESDIED FYEDPDEIPTIKLSMEEFKTTLQNY
Sbjct: 1561 NALETSKCEPIVEVPAT--PEPEPNEMTESDIEDLFYEDPDEIPTIKLSMEEFKTTLQNY 1620

Query: 1621 IPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEHQVYELPDSHPLLKELDRREPDDPSP 1680
            IPEGDMSRALVALN EAA IPTPKLKNVSRLRTEHQVYELPDSHPLL+E+DRREPDDPSP
Sbjct: 1621 IPEGDMSRALVALNPEAAYIPTPKLKNVSRLRTEHQVYELPDSHPLLREMDRREPDDPSP 1680

Query: 1681 YLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREANSQTVRGTLLIPCRT 1740
            YLLAIWTPGETA+SIQPPEQSCGSQDP RLCNEKTCFTCNSRREANSQTVRGTLL+PCRT
Sbjct: 1681 YLLAIWTPGETADSIQPPEQSCGSQDPDRLCNEKTCFTCNSRREANSQTVRGTLLVPCRT 1740

Query: 1741 AMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRKWLWNLPRRTVYFGTSVSTIFKGLVTE 1800
            AMRGSFPLNGTYFQVNEMFADHESS  PIDVPR WLWNLPRRTVYFGTSVS+IFKGLVTE
Sbjct: 1741 AMRGSFPLNGTYFQVNEMFADHESSMKPIDVPRTWLWNLPRRTVYFGTSVSSIFKGLVTE 1800

Query: 1801 EIQQCFWRGFVCVRGFERKTRAPRPLIARLHFPASKLAKMKNGHTE 1844
            EIQQCFWRGFVCVRGF++KTRAPRPLIARLHFPASKLAK+KNGHTE
Sbjct: 1801 EIQQCFWRGFVCVRGFDQKTRAPRPLIARLHFPASKLAKVKNGHTE 1833

BLAST of Tan0014193 vs. ExPASy TrEMBL
Match: A0A0A0KTG6 (ENDO3c domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G615310 PE=3 SV=1)

HSP 1 Score: 3036.9 bits (7872), Expect = 0.0e+00
Identity = 1559/1861 (83.77%), Postives = 1662/1861 (89.31%), Query Frame = 0

Query: 1    MNSQVNSSGDFYAGNLLVRNQNLYPSSRPSSNNSYAQHVRPYGLPMYQPNYNLNPASMTQ 60
            MNSQVNSSGDFYAGNLL+RNQN+Y  SRPS+NNS+AQHV  YGLPM+QPNYNLNP SMTQ
Sbjct: 1    MNSQVNSSGDFYAGNLLLRNQNIYSGSRPSTNNSFAQHVLTYGLPMFQPNYNLNPVSMTQ 60

Query: 61   MNQMSIFTNSIH-PPPVSSHLENFAFDPISTPSFLVRDESSSFRRDGEDDFIRMFQDEPP 120
             NQ  IFTNS+H  PPVSS++E+ A++ +STPSFLVRDESS FR++  DDFIRMFQDE P
Sbjct: 61   TNQ--IFTNSVHTTPPVSSNVESVAYNQVSTPSFLVRDESSCFRKNA-DDFIRMFQDEAP 120

Query: 121  RQHCDELLQSI--------------VESSCVGNSTPFKRTTDFGKQRDLEIDLNRTPEQR 180
            RQHCDELLQSI              VESSCVGNSTPFK T DF KQ+DLEIDLNRTPEQR
Sbjct: 121  RQHCDELLQSIVESSCVGNSTPFKGVESSCVGNSTPFKGTKDFVKQKDLEIDLNRTPEQR 180

Query: 181  PPKRRQHTPMVFS-ERFTDLLNLPLAESLSLYEETQENFVTVPLDEATQKRHDELLKDLT 240
            PPKRRQHTP VFS ERFTDLLNLPL  +LSLYEETQENFVTVPLDEATQKRHDELLKDLT
Sbjct: 181  PPKRRQHTPTVFSGERFTDLLNLPLDGNLSLYEETQENFVTVPLDEATQKRHDELLKDLT 240

Query: 241  DTLSAAISAPTPTKEVEKGSDQVIDLNKTPEQKTPRRRKHRPKVIKEGKPKKSPKPVTPK 300
            DTLSAAIS   PTKEVEKGSDQ IDLNKTPEQKTP+RRKHRPKVIKEGKPKKSPKPVTPK
Sbjct: 241  DTLSAAIS--EPTKEVEKGSDQAIDLNKTPEQKTPKRRKHRPKVIKEGKPKKSPKPVTPK 300

Query: 301  IPKETPSGKRKYVRKKNIKEAATPPANIVEIKDSSTATKTKSCRRVINFEMEKTGDEEQE 360
            I KETPSGKRKYVRKKNIKEA TPPAN+VEIKDS+TATKTKSCRRVI+FEMEKTGDEEQE
Sbjct: 301  ISKETPSGKRKYVRKKNIKEATTPPANVVEIKDSNTATKTKSCRRVIHFEMEKTGDEEQE 360

Query: 361  KKHNEKDVQEENMGNFCSITRPNVPDFCTQSNGVCGTSPDVHISHRLSTMVAENVRPTLQ 420
            KK NEKDV EENMGNFC +TRPNVPDFC+QS  VCGTS DVH S +L  MVAENVRPT+ 
Sbjct: 361  KKQNEKDVSEENMGNFCFMTRPNVPDFCSQSTSVCGTSQDVHDSTQLGPMVAENVRPTIP 420

Query: 421  SNLAHMNHMTTSLTSQSEREAAGGPFNKSAYNTAEDLLNVGRIIDQGKADQYQNGFSNGY 480
            SN  HMNHMTTS   QSEREAA  P NKS YN AE+ LNV RI+ QG+A+QYQ GFSNGY
Sbjct: 421  SNPTHMNHMTTSHILQSEREAAEVPLNKSGYNKAENWLNVLRILHQGRANQYQTGFSNGY 480

Query: 481  TPVQQHIRAEDMEQFANHAKRSTSFKELMGMNFEYSQTIPNHQSNINEARGSKRGRPLTI 540
             PVQQ+I AEDM+QFAN AKR+T +KE+MG+N  Y QT+PNHQSNINEARGSKRGRPLT 
Sbjct: 481  APVQQNICAEDMQQFANQAKRNTYYKEVMGINSGYCQTVPNHQSNINEARGSKRGRPLTT 540

Query: 541  QPTPSCSITTLNSSVLCQEVLQTGESHRQGSSVNIGPLEIPAKKFESGLYATLYKRYTTI 600
             PT  CSITTL+SS+ CQEV Q GE  RQGS++NIGPLE P KKFESGLYATL+KRY+TI
Sbjct: 541  YPTQPCSITTLDSSMTCQEVRQIGEFQRQGSNINIGPLENPGKKFESGLYATLHKRYSTI 600

Query: 601  QANEGCPSHLNTSGCNPINSVGFTTEMKQAMLNS-HIRSNQSRDRQTNWTKETIGDRHIH 660
            Q+NEGC SHLNT GCNP NSVGFT EMKQAMLN  HIRSNQ         KE IGDRHIH
Sbjct: 601  QSNEGCSSHLNTIGCNPTNSVGFTAEMKQAMLNGHHIRSNQIT------AKEIIGDRHIH 660

Query: 661  SVVHENNFQRRQISHNLHPEIDRTCETTGLNKVTSYRSLITGDKCNVLRPYPHLKASEQG 720
            SVVHEN+FQR+Q+SHNLHP +DRT   +GLNKV SYRSL+TGDKCN+++P+PH KA EQG
Sbjct: 661  SVVHENHFQRQQVSHNLHPAVDRTSVASGLNKVASYRSLMTGDKCNMIQPFPHPKAPEQG 720

Query: 721  YAYRQSDNSMLTIRQACQPMISGSLTTNQVQKLGYSFGFHQFPDKTAGLLENEIIHKLKG 780
            YA RQSDNS+LT+RQA QPMISGSL TN+V K GYSFGF +FP KT  LLENEI+HK+K 
Sbjct: 721  YACRQSDNSILTVRQAYQPMISGSLATNEVHKQGYSFGFQKFPAKTTSLLENEILHKMKR 780

Query: 781  LNLNDDKGTTRTEQNAIVPYKGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGK 840
            L+LND + + R+EQNAIVPYKGNGAVVPYVESEYLRKRKARPRVD+DPETERIWNLLMGK
Sbjct: 781  LSLNDHEVSIRSEQNAIVPYKGNGAVVPYVESEYLRKRKARPRVDIDPETERIWNLLMGK 840

Query: 841  EGSEGIENHEKDKEKWWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFL 900
            EGSEGIE+HEKDKEKWWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFL
Sbjct: 841  EGSEGIESHEKDKEKWWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFL 900

Query: 901  TQNVSDHLSSSAFMSLAARFPLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQV 960
            TQNVSDHLSSSAFMSLAARFP+KS SN+RTQG+VETS+VANESAAC+LYPA+SIRW  Q 
Sbjct: 901  TQNVSDHLSSSAFMSLAARFPVKSASNLRTQGEVETSIVANESAACVLYPAESIRWHVQE 960

Query: 961  LSLPRFEMPQTSINHQNHRVKSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARS 1020
            LS+PRFEMPQTSINHQN    SGTE  FTE+G QIVEEEVISSQDSFDSTITQGT GARS
Sbjct: 961  LSVPRFEMPQTSINHQNQIANSGTEKIFTELGGQIVEEEVISSQDSFDSTITQGTAGARS 1020

Query: 1021 CSGSNSDAEEPIVSYNSSSTHCSNFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQ 1080
            CSGSNS+AEEPIVSYNSSSTH SNFTDIKQMETT ++QKSFSDLNRSSV DEVSEHKHWQ
Sbjct: 1021 CSGSNSEAEEPIVSYNSSSTHYSNFTDIKQMETTATIQKSFSDLNRSSVSDEVSEHKHWQ 1080

Query: 1081 LSDGKQDSLT-EWNEIDNLNGHSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEG 1140
            L DGKQ SLT EWNEIDNL+GHSLINFLVNIENQ KQVP APSNNQLH+TPDC VLEVEG
Sbjct: 1081 LPDGKQGSLTSEWNEIDNLSGHSLINFLVNIENQPKQVPDAPSNNQLHITPDCGVLEVEG 1140

Query: 1141 REAFSEESISSGPSIVSGCSTEKNMTCHSLNIGDPERTLDKISGEEIGRQARSQERIRME 1200
            REAFSEES SSGPSIVSGCSTEKNMT H LNIG  E+ LDK S E+   QARS E  RME
Sbjct: 1141 REAFSEESTSSGPSIVSGCSTEKNMTFHRLNIGALEQRLDKTSAED-NVQARSHETTRME 1200

Query: 1201 HSESISEHSVHLQGNGIQLGSHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPAKM 1260
            HSES+SEHSVHLQGNGIQ  SHCEY LH  YEPCERN TSP+ES SVTNP PELD PA  
Sbjct: 1201 HSESVSEHSVHLQGNGIQFRSHCEYNLHGKYEPCERNNTSPVESVSVTNPPPELDTPA-- 1260

Query: 1261 QQSALSNVVNATTHTEKLLPGNDNQINFSNNEVHSLSQADNEGNVVSTSKAKRRKVNSEK 1320
            ++SA+SNVV+   HTEKLLPG  N INFSNNE HSLSQA NEGN +S SKAKRRKVNSEK
Sbjct: 1261 EKSAVSNVVHVHAHTEKLLPGKGNLINFSNNEAHSLSQAHNEGN-ISPSKAKRRKVNSEK 1320

Query: 1321 KSAVDWDILRKQVEANGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERI 1380
            K  +DWD LRKQVEANGQIKEKGKDAMDSIDYEAIRLA+V EIS+AIKERGMNNMLAERI
Sbjct: 1321 KGGMDWDSLRKQVEANGQIKEKGKDAMDSIDYEAIRLADVREISNAIKERGMNNMLAERI 1380

Query: 1381 KEFLNRLVKDHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTN 1440
            KEFLNRLV DHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTN
Sbjct: 1381 KEFLNRLVTDHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTN 1440

Query: 1441 VGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITF 1500
            VGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITF
Sbjct: 1441 VGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITF 1500

Query: 1501 GKVFCTKSKPNCNACPMRGECKHFASAFASARLALPAPDEKRIVTSTNPVAMEKQPAVVS 1560
            GKVFCTKSKPNCNACPMRGECKHFASAFASARLALPAPDEK IV STNP++ EKQP +V+
Sbjct: 1501 GKVFCTKSKPNCNACPMRGECKHFASAFASARLALPAPDEKGIVASTNPMSTEKQPPIVT 1560

Query: 1561 NPLPILPPEGSTYTESTLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFYEDPDEIPT 1620
            NPLPILPPEGSTY E+T G SKCEPIVEVPAT  PEPEPNEITESDIED+FYEDPDEIPT
Sbjct: 1561 NPLPILPPEGSTYAENTSGPSKCEPIVEVPAT--PEPEPNEITESDIEDAFYEDPDEIPT 1620

Query: 1621 IKLSMEEFKTTLQNYIPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEHQVYELPDSHP 1680
            IKLSMEEFKTTLQ+YIPEGDMS+ALVALN EAA IPTPKLKNVSRLRTEHQVYELPDSHP
Sbjct: 1621 IKLSMEEFKTTLQHYIPEGDMSKALVALNPEAAFIPTPKLKNVSRLRTEHQVYELPDSHP 1680

Query: 1681 LLKELDRREPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREA 1740
            LL+E+DRREPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDP RLCNE TCFTCNSRREA
Sbjct: 1681 LLREMDRREPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPNRLCNEITCFTCNSRREA 1740

Query: 1741 NSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRKWLWNLPRRTVY 1800
            NSQTVRGTLL+PCRTAMRGSFPLNGTYFQVNEMFADHESS  PIDVPRKWLWNLPRRTVY
Sbjct: 1741 NSQTVRGTLLVPCRTAMRGSFPLNGTYFQVNEMFADHESSMKPIDVPRKWLWNLPRRTVY 1800

Query: 1801 FGTSVSTIFKGLVTEEIQQCFWRGFVCVRGFERKTRAPRPLIARLHFPASKLAKMKNGHT 1844
            FGTSVSTIFKGLVTEEIQQCFWRGFVCVRGF++KTRAPRPLIARLHFPASKLAK+KNG T
Sbjct: 1801 FGTSVSTIFKGLVTEEIQQCFWRGFVCVRGFDQKTRAPRPLIARLHFPASKLAKVKNGQT 1844

BLAST of Tan0014193 vs. ExPASy TrEMBL
Match: A0A6J1KCN5 (transcriptional activator DEMETER-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111492060 PE=3 SV=1)

HSP 1 Score: 3019.2 bits (7826), Expect = 0.0e+00
Identity = 1554/1846 (84.18%), Postives = 1653/1846 (89.54%), Query Frame = 0

Query: 1    MNSQVNSSGDFYAGNLLVRNQNLYPSSRPSSNNSYAQHVRPYGLPMYQPNYNLNPASMTQ 60
            MNSQ NSSGDFYAGNLL+RNQNLY  SRPSSN+SYAQHVR YGLPMYQ N+N NP SMTQ
Sbjct: 1    MNSQYNSSGDFYAGNLLLRNQNLYSGSRPSSNDSYAQHVRTYGLPMYQANHN-NPVSMTQ 60

Query: 61   MNQMSIFTNSIHPPPVSSHLENFAFDPISTPSFLVRDESSSFRRDGEDDFIRMFQDEPPR 120
             NQMSIF NS+H PPVSSHLENFA+D I+T SFLVRDESSSFR+DGEDDFIRMFQ E PR
Sbjct: 61   ANQMSIFMNSVHAPPVSSHLENFAYDHIATSSFLVRDESSSFRKDGEDDFIRMFQAEAPR 120

Query: 121  QHCDELLQSIVESSCVGNSTPFKRTTDFGKQRDLEIDLNRTPEQRPPKRRQHTPMVFS-E 180
            Q CDELLQSIVESS VGNSTPFK T DFGKQRDLEIDLN+TPEQRPPKRRQHTP+VFS E
Sbjct: 121  QPCDELLQSIVESSGVGNSTPFKGTKDFGKQRDLEIDLNKTPEQRPPKRRQHTPVVFSGE 180

Query: 181  RFTDLLNLPLAESLSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSAAISAPTPTKE 240
             FTDLLNLPL E+LSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSA IS   PT E
Sbjct: 181  SFTDLLNLPLDENLSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSAGIS--EPTNE 240

Query: 241  VEKGSDQVIDLNKTPEQKTPRRRKHRPKVIKEGKPKKSPKPVTPKIPKETPSGKRKYVRK 300
             EKGSDQVID +KT EQKTP+RRKHRPKVIKEGKPKKSPKPVTPKI KETPSGKRKYVR+
Sbjct: 241  AEKGSDQVID-HKTTEQKTPKRRKHRPKVIKEGKPKKSPKPVTPKISKETPSGKRKYVRR 300

Query: 301  KNIKEAATPPANIVEIKDSSTATKTKSCRRVINFEMEKTGDEEQEKKHNEKDVQEENMGN 360
            KNIKEA TPP NI+EIKDS+ A KTKSCRRVINFEMEKTGDEE+EK+ NEKD+Q ENMGN
Sbjct: 301  KNIKEAVTPPENIMEIKDSNPAAKTKSCRRVINFEMEKTGDEEREKERNEKDMQ-ENMGN 360

Query: 361  FCSITRPNVPDFCTQSNGVCGTSPDVHISHRLSTMVAENVRPTLQSNLAHMNHMTTSLTS 420
             C ITR NVP F TQSNG+CGTSPDV  +HRL  +VAE+V+P++QS +A MNHM TS  S
Sbjct: 361  SCFITRSNVPGFSTQSNGICGTSPDVQDNHRLGALVAESVQPSIQSYIARMNHMMTSRIS 420

Query: 421  QSEREAAGGPFNKSAYNTAEDLLNVGRIIDQGKADQYQNGFSNGYTPVQQHIRAEDMEQF 480
            QSEREAA  P N S YN AE L NV RI+DQGK  QYQ GFSNGYTPV+Q+IRAE+ME+F
Sbjct: 421  QSEREAAESPLNSSGYNKAESLFNVLRILDQGKGYQYQAGFSNGYTPVEQNIRAEEMEKF 480

Query: 481  ANHAKRSTSFKELMGMNFEYSQTIPNHQSNINEARGSKRGRPLTIQPTPSCSITTLNSSV 540
            A  AKR+T ++E++G+N  YSQT+PNHQSNINEARGSKRG PLT QPT  CSITTL+SSV
Sbjct: 481  ATTAKRNTCYQEMIGINSAYSQTVPNHQSNINEARGSKRGCPLTAQPTQLCSITTLDSSV 540

Query: 541  LCQEVLQTGESHRQGSSVNIGPLEIPAKKFESGLYATLYKRYTTIQANEGCPSHLNTSGC 600
            LCQE LQTGE H  GSS N+G LEIP KKFESGLY+TL+KRY+TIQ NE C  HLNT+GC
Sbjct: 541  LCQEALQTGEFHGLGSSTNVGSLEIPGKKFESGLYSTLHKRYSTIQPNEDCSRHLNTTGC 600

Query: 601  NPINSVGFTTEMKQAMLNS-HIRSNQSRDRQTNWTKETIGDRHIHSVVHENNFQRRQISH 660
            +P  SVGFT EMKQAMLN  HIRSNQ  DRQ++WTKE IGD HIHSVVH NNFQRRQ+SH
Sbjct: 601  SPTISVGFTAEMKQAMLNGYHIRSNQITDRQSSWTKEIIGDGHIHSVVHGNNFQRRQVSH 660

Query: 661  NLHPEIDRTCETTGLNKVTSYRSLITGDKCNVLRPYPHLKASEQGYAYRQSDNSMLTIRQ 720
            NLHPEI+R CET+GLNKV S+RSLI  DKCN+L+P+PH KASEQ YA +Q +NS+LT+RQ
Sbjct: 661  NLHPEINRMCETSGLNKVNSHRSLIIRDKCNMLQPFPHPKASEQWYACKQPNNSILTLRQ 720

Query: 721  ACQPMISGSLTTNQVQKLGYSFGFHQFPDKTAGLLENEIIHKLKGLNLNDDKGTTRTEQN 780
            ACQPMISGSL TN VQK GYSFG  QF  KT GLLE EI  KLK L+L DD+G TRTEQN
Sbjct: 721  ACQPMISGSLATN-VQKQGYSFGMQQFSAKTTGLLEYEITRKLKSLSLKDDEGATRTEQN 780

Query: 781  AIVPYKGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEGSEGIENHEKDKEK 840
            AIVPY GNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEG EGIENHEKDKEK
Sbjct: 781  AIVPYNGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEGIEGIENHEKDKEK 840

Query: 841  WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS 900
            WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS
Sbjct: 841  WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS 900

Query: 901  LAARFPLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQVLSLPRFEMPQTSINH 960
            LAARFP+KSTSN RT  +VETS+VANE AACL YPADSIRW+ Q LS+P FEMPQTSI H
Sbjct: 901  LAARFPVKSTSNFRTPDEVETSIVANELAACLQYPADSIRWEGQELSVPSFEMPQTSIIH 960

Query: 961  QNHRVKSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARSCSGSNSDAEEPIVSY 1020
            QNHRV SGTE FFTE G QIVEEEVISSQ SFDSTITQGT GARSCSGSNS+AEEPIVS 
Sbjct: 961  QNHRVNSGTENFFTERGGQIVEEEVISSQGSFDSTITQGTAGARSCSGSNSEAEEPIVSN 1020

Query: 1021 NSSSTHCSNFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQLSDGKQDSLT-EWNE 1080
            NSSSTH SNFTDIKQMETTT ++KSFSD NR+SVFDEVSEHKHWQL DGKQDSLT EWNE
Sbjct: 1021 NSSSTHYSNFTDIKQMETTTVIEKSFSDKNRTSVFDEVSEHKHWQLPDGKQDSLTSEWNE 1080

Query: 1081 IDNLNGHSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEGREAFSEESISSGPSI 1140
            IDNL+GHSL NFLVNIENQ KQ+P APSNNQL MTPDC VLEVEGREAFSEESISSGPSI
Sbjct: 1081 IDNLSGHSLFNFLVNIENQQKQLPDAPSNNQLRMTPDCGVLEVEGREAFSEESISSGPSI 1140

Query: 1141 VSGCSTEKNMTCHSLNIGDPERTLDKISGEEIGRQARSQERIRMEHSESISEHSVHLQGN 1200
            +SGCSTEKN TCHSLN  DP+R+ DKIS EE  R AR+QE  RMEHSES+SEHSVH QGN
Sbjct: 1141 ISGCSTEKNTTCHSLNTKDPDRSSDKISAEE-NRPARTQETTRMEHSESVSEHSVHRQGN 1200

Query: 1201 GIQLGSHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPAKMQQSALSNVVNATTHT 1260
            GIQL S C Y LHDNY+PC RN TSP+ES SV+NP PELDAPAK  +SALSNVV+   HT
Sbjct: 1201 GIQLTSLCGYSLHDNYKPCARNNTSPLESASVSNPPPELDAPAK--KSALSNVVHVHAHT 1260

Query: 1261 EKLLPGNDNQINFSNNEVHSLSQADNEGNVVSTSKAKRRKVNSEKKSAVDWDILRKQVEA 1320
            EKLLPG  N INFSNNE HSLSQADN GN +S SKAKRRKVNSEK SA+DWD LRKQVEA
Sbjct: 1261 EKLLPGKGNLINFSNNEAHSLSQADNGGN-ISPSKAKRRKVNSEKISAIDWDSLRKQVEA 1320

Query: 1321 NGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLVKDHGSID 1380
            NGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLV DHGSID
Sbjct: 1321 NGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLVTDHGSID 1380

Query: 1381 LEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 1440
            LEWLRDVPPD+AKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP
Sbjct: 1381 LEWLRDVPPDQAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 1440

Query: 1441 LPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC 1500
            LPESLQLHLLELYPVLE+IQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC
Sbjct: 1441 LPESLQLHLLELYPVLETIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC 1500

Query: 1501 PMRGECKHFASAFASARLALPAPDEKRIVTSTNPVAMEKQPAVVSNPLPILPPEGSTYTE 1560
            PMRGECKHFASAFASARLALPAPDEK IV STNP+A +KQP +V+N LPILPPE STYTE
Sbjct: 1501 PMRGECKHFASAFASARLALPAPDEKGIVASTNPIATQKQPPIVTNHLPILPPE-STYTE 1560

Query: 1561 STLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFYEDPDEIPTIKLSMEEFKTTLQNY 1620
            + L TSKCEPIVEVPAT  PEPEPNE+TESDIED FYEDPDEIPTIKLSMEEFKTTLQNY
Sbjct: 1561 NALETSKCEPIVEVPAT--PEPEPNEMTESDIEDLFYEDPDEIPTIKLSMEEFKTTLQNY 1620

Query: 1621 IPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEHQVYELPDSHPLLKELDRREPDDPSP 1680
            IPEGDMSRALVALN EAA IPTPKLKNVSRLRTEHQVYELPDSHPLL+E+DRREPDDPSP
Sbjct: 1621 IPEGDMSRALVALNPEAAYIPTPKLKNVSRLRTEHQVYELPDSHPLLREMDRREPDDPSP 1680

Query: 1681 YLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREANSQTVRGTLLIPCRT 1740
            YLLAIWTPGETA+SIQPPEQSCGSQDP RLCNEKTCFTCNSRREANSQTVRGTLL+PCRT
Sbjct: 1681 YLLAIWTPGETADSIQPPEQSCGSQDPDRLCNEKTCFTCNSRREANSQTVRGTLLVPCRT 1740

Query: 1741 AMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRKWLWNLPRRTVYFGTSVSTIFKGLVTE 1800
            AMRGSFPLNGTYFQVNEMFADHESS  PIDVPR WLWNLPRRTVYFGTSVS+IFKGLVTE
Sbjct: 1741 AMRGSFPLNGTYFQVNEMFADHESSMKPIDVPRTWLWNLPRRTVYFGTSVSSIFKGLVTE 1800

Query: 1801 EIQQCFWRGFVCVRGFERKTRAPRPLIARLHFPASKLAKMKNGHTE 1844
            EIQQCFWRGFVCVRGF++KTRAPRPLIARLHFPASKLAK+KNGHTE
Sbjct: 1801 EIQQCFWRGFVCVRGFDQKTRAPRPLIARLHFPASKLAKVKNGHTE 1833

BLAST of Tan0014193 vs. ExPASy TrEMBL
Match: A0A1S3BGJ5 (transcriptional activator DEMETER isoform X1 OS=Cucumis melo OX=3656 GN=LOC103489408 PE=3 SV=1)

HSP 1 Score: 3013.8 bits (7812), Expect = 0.0e+00
Identity = 1554/1861 (83.50%), Postives = 1654/1861 (88.88%), Query Frame = 0

Query: 1    MNSQVNSSGDFYAGNLLVRNQNLYPSSRPSSNNSYAQHVRPYGLPMYQPNYNLNPASMTQ 60
            MNSQVNS GDFYAGNLL RNQN+Y  SRPS+NNS+AQHV  YGLPM+QPNYNLNP SMTQ
Sbjct: 1    MNSQVNSGGDFYAGNLLPRNQNIYSGSRPSTNNSFAQHVLTYGLPMFQPNYNLNPVSMTQ 60

Query: 61   MNQMSIFTNSIH-PPPVSSHLENFAFDPISTPSFLVRDESSSFRRDGEDDFIRMFQDEPP 120
             NQ  IFTNS+H  PPVSSHLE+FA++ +STPSFLVRDE+S FR+D  DDFIRMFQDE P
Sbjct: 61   TNQ--IFTNSVHTTPPVSSHLESFAYNQVSTPSFLVRDENSCFRKDA-DDFIRMFQDEAP 120

Query: 121  RQHCDELLQSI--------------VESSCVGNSTPFKRTTDFGKQRDLEIDLNRTPEQR 180
            RQHCDELLQSI              VESSCVGNSTPFK T DF KQ+DLEIDLNRTPEQR
Sbjct: 121  RQHCDELLQSIVESSCVSNSTPFNGVESSCVGNSTPFKGTKDFVKQKDLEIDLNRTPEQR 180

Query: 181  PPKRRQHTPMVFS-ERFTDLLNLPLAESLSLYEETQENFVTVPLDEATQKRHDELLKDLT 240
            P KRRQHTPMVFS ERFTDLLNLPL  +LSLYEETQENFVT  LDEATQKRHDELLKDLT
Sbjct: 181  PAKRRQHTPMVFSGERFTDLLNLPLDGNLSLYEETQENFVTAALDEATQKRHDELLKDLT 240

Query: 241  DTLSAAISAPTPTKEVEKGSDQVIDLNKTPEQKTPRRRKHRPKVIKEGKPKKSPKPVTPK 300
            DTLSAAIS   PTKE+EKGSDQ IDLNKTP+QKTP+RRKHRPKVIKEGKPKKSPKPVTPK
Sbjct: 241  DTLSAAIS--EPTKEMEKGSDQAIDLNKTPDQKTPKRRKHRPKVIKEGKPKKSPKPVTPK 300

Query: 301  IPKETPSGKRKYVRKKNIKEAATPPANIVEIKDSSTATKTKSCRRVINFEMEKTGDEEQE 360
            I KETPSGKRKYVRKKNIKEA TPPAN+VEIKDS+TATKTKSCRRVI+FEMEKTGDEEQE
Sbjct: 301  ISKETPSGKRKYVRKKNIKEATTPPANVVEIKDSNTATKTKSCRRVIHFEMEKTGDEEQE 360

Query: 361  KKHNEKDVQEENMGNFCSITRPNVPDFCTQSNGVCGTSPDVHISHRLSTMVAENVRPTLQ 420
            KK NE DV EENMGNF  + RPNVPDFC+QS  VCGTS DVH S +L  MVAENV+PT+ 
Sbjct: 361  KKQNETDVVEENMGNFSFMMRPNVPDFCSQSTSVCGTSQDVHDSPQLRPMVAENVQPTIP 420

Query: 421  SNLAHMNHMTTSLTSQSEREAAGGPFNKSAYNTAEDLLNVGRIIDQGKADQYQNGFSNGY 480
            SN AHMNHM TS   QSER+AA  P NKS YN AE+ LNV RI+ QG+A+QYQ GFSNGY
Sbjct: 421  SNPAHMNHMMTSHILQSERDAAEVPLNKSGYNKAENWLNVLRILHQGRANQYQTGFSNGY 480

Query: 481  TPVQQHIRAEDMEQFANHAKRSTSFKELMGMNFEYSQTIPNHQSNINEARGSKRGRPLTI 540
             PVQQ+IRAE+MEQFAN AKR+T +KE+MG+N  YSQT+PNHQSNINEARGSKRGRPLT 
Sbjct: 481  APVQQNIRAEEMEQFANQAKRNTYYKEVMGINSGYSQTVPNHQSNINEARGSKRGRPLTT 540

Query: 541  QPTPSCSITTLNSSVLCQEVLQTGESHRQGSSVNIGPLEIPAKKFESGLYATLYKRYTTI 600
             PT  CSITTL+SS  CQEV Q GE  RQGS++NIG LE P KKFE GLYATL+KRY+TI
Sbjct: 541  HPTQPCSITTLDSSKSCQEVRQIGEFQRQGSNINIGSLENPGKKFEPGLYATLHKRYSTI 600

Query: 601  QANEGCPSHLNTSGCNPINSVGFTTEMKQAMLNS-HIRSNQSRDRQTNWTKETIGDRHIH 660
            Q+NE C SHLNT GCN  NSVGFT EMKQAMLN  HIRSNQ         KE IGDRHIH
Sbjct: 601  QSNEVCSSHLNTIGCNSTNSVGFTAEMKQAMLNGHHIRSNQIT------AKEIIGDRHIH 660

Query: 661  SVVHENNFQRRQISHNLHPEIDRTCETTGLNKVTSYRSLITGDKCNVLRPYPHLKASEQG 720
            SVVHEN+FQR+Q+SHNLHP I+RT   +GLNKV SYRSL+TGDK N+++P+PH KA EQG
Sbjct: 661  SVVHENHFQRQQVSHNLHPAIERTSVASGLNKVASYRSLMTGDKRNMIQPFPHPKAPEQG 720

Query: 721  YAYRQSDNSMLTIRQACQPMISGSLTTNQVQKLGYSFGFHQFPDKTAGLLENEIIHKLKG 780
            YA RQSDNS+LT+RQA QPMISGSL TN+V K GYSFGF +FP KT  LLENEI+HK+K 
Sbjct: 721  YACRQSDNSILTVRQAYQPMISGSLATNEVHKQGYSFGFQKFPAKTTSLLENEILHKMKR 780

Query: 781  LNLNDDKGTTRTEQNAIVPYKGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGK 840
            L+LND + + RTEQNAIVPYKGNGAVVPYVESEYLRKRKARPRVD+DPETERIWNLLMGK
Sbjct: 781  LSLNDHEVSIRTEQNAIVPYKGNGAVVPYVESEYLRKRKARPRVDIDPETERIWNLLMGK 840

Query: 841  EGSEGIENHEKDKEKWWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFL 900
            EGSEGIE+HEKDKEKWWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFL
Sbjct: 841  EGSEGIESHEKDKEKWWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFL 900

Query: 901  TQNVSDHLSSSAFMSLAARFPLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQV 960
            TQNVSDHLSSSAFMSLAARFP+KS SN+RTQG+VETS+VANESAAC+LYPA+SI W  Q 
Sbjct: 901  TQNVSDHLSSSAFMSLAARFPVKSASNLRTQGEVETSIVANESAACVLYPAESITWHVQE 960

Query: 961  LSLPRFEMPQTSINHQNHRVKSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARS 1020
            LS+PRFEMPQTSINHQN  V SGTE  FTE+G QIVEEEVISSQDSFDSTITQGT GARS
Sbjct: 961  LSVPRFEMPQTSINHQNRIVNSGTEKNFTELGGQIVEEEVISSQDSFDSTITQGTAGARS 1020

Query: 1021 CSGSNSDAEEPIVSYNSSSTHCSNFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQ 1080
            CSGSNS+AEEPIVSYNSSSTH SNFTDIKQ ETT ++QKSFSDLNRSSV DEVSEHKHWQ
Sbjct: 1021 CSGSNSEAEEPIVSYNSSSTHYSNFTDIKQTETTATIQKSFSDLNRSSVSDEVSEHKHWQ 1080

Query: 1081 LSDGKQDSLT-EWNEIDNLNGHSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEG 1140
            L+DGKQ SLT EWNEID+L+GHSLINFLVNIENQ KQVP APSNNQLH+TPDC VLEVEG
Sbjct: 1081 LADGKQGSLTSEWNEIDDLSGHSLINFLVNIENQPKQVPDAPSNNQLHITPDCGVLEVEG 1140

Query: 1141 REAFSEESISSGPSIVSGCSTEKNMTCHSLNIGDPERTLDKISGEEIGRQARSQERIRME 1200
            REAFSEES SSGPSIVSGCSTEKNMT H LNIG  E+ LDK S EE   QARSQE  RME
Sbjct: 1141 REAFSEESTSSGPSIVSGCSTEKNMTFHRLNIGALEQRLDKTSAEE-NVQARSQETTRME 1200

Query: 1201 HSESISEHSVHLQGNGIQLGSHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPAKM 1260
            HSES+SEHSVHLQGNGIQ  SHCEY LH+ YEPCERN TSP+ES SVTNP PELD PA  
Sbjct: 1201 HSESVSEHSVHLQGNGIQFSSHCEYNLHEKYEPCERNNTSPLESVSVTNPPPELDTPA-- 1260

Query: 1261 QQSALSNVVNATTHTEKLLPGNDNQINFSNNEVHSLSQADNEGNVVSTSKAKRRKVNSEK 1320
            ++SA+SNVV+   HTEKLLPG  N INFSNNE HSLSQADNEGN +S SKAKRRKVNSEK
Sbjct: 1261 EKSAVSNVVHVHAHTEKLLPGKGNLINFSNNEAHSLSQADNEGN-ISPSKAKRRKVNSEK 1320

Query: 1321 KSAVDWDILRKQVEANGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERI 1380
            K   DWD LRKQVEANGQIKEKGKDAMDSIDYEAIRLA+VHEIS+AIKERGMNNMLAERI
Sbjct: 1321 KGGNDWDSLRKQVEANGQIKEKGKDAMDSIDYEAIRLADVHEISNAIKERGMNNMLAERI 1380

Query: 1381 KEFLNRLVKDHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTN 1440
            KEFLNRLV DHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTN
Sbjct: 1381 KEFLNRLVTDHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTN 1440

Query: 1441 VGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITF 1500
            VGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITF
Sbjct: 1441 VGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITF 1500

Query: 1501 GKVFCTKSKPNCNACPMRGECKHFASAFASARLALPAPDEKRIVTSTNPVAMEKQPAVVS 1560
            GKVFCTKSKPNCNACPMRGECKHFASAFASARLALPAPDEK IV STNP+A EKQP VV+
Sbjct: 1501 GKVFCTKSKPNCNACPMRGECKHFASAFASARLALPAPDEKGIVASTNPMATEKQPPVVT 1560

Query: 1561 NPLPILPPEGSTYTESTLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFYEDPDEIPT 1620
            NPLPILPPEGSTYTE+TL    CEPIVEVPAT  PEPEPNEITESDIED+FYEDPDEIPT
Sbjct: 1561 NPLPILPPEGSTYTENTLAPGNCEPIVEVPAT--PEPEPNEITESDIEDAFYEDPDEIPT 1620

Query: 1621 IKLSMEEFKTTLQNYIPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEHQVYELPDSHP 1680
            IKLSMEEFKTTLQNYIPEGDMS+ALVALN EAA IPTPKLKNVSRLRTEHQVYELPDSHP
Sbjct: 1621 IKLSMEEFKTTLQNYIPEGDMSKALVALNPEAAFIPTPKLKNVSRLRTEHQVYELPDSHP 1680

Query: 1681 LLKELDRREPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREA 1740
            LL+E+DRREPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDP RLCNE TCFTCNSRREA
Sbjct: 1681 LLREMDRREPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPNRLCNEITCFTCNSRREA 1740

Query: 1741 NSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRKWLWNLPRRTVY 1800
            NSQTVRGTLL+PCRTAMRGSFPLNGTYFQVNEMFADHESS  PIDVPRKWLWNLPRRTVY
Sbjct: 1741 NSQTVRGTLLVPCRTAMRGSFPLNGTYFQVNEMFADHESSMKPIDVPRKWLWNLPRRTVY 1800

Query: 1801 FGTSVSTIFKGLVTEEIQQCFWRGFVCVRGFERKTRAPRPLIARLHFPASKLAKMKNGHT 1844
            FGTSVSTIFKGLVTEEIQQCFWRGFVCVRGF++KTRAPRPLIARLHFPASKLAK+KNG T
Sbjct: 1801 FGTSVSTIFKGLVTEEIQQCFWRGFVCVRGFDQKTRAPRPLIARLHFPASKLAKVKNGQT 1844

BLAST of Tan0014193 vs. ExPASy TrEMBL
Match: A0A6J1GZH0 (transcriptional activator DEMETER-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111458277 PE=3 SV=1)

HSP 1 Score: 3008.0 bits (7797), Expect = 0.0e+00
Identity = 1549/1846 (83.91%), Postives = 1647/1846 (89.22%), Query Frame = 0

Query: 1    MNSQVNSSGDFYAGNLLVRNQNLYPSSRPSSNNSYAQHVRPYGLPMYQPNYNLNPASMTQ 60
            MNSQ NSSGDFYAGNLL+RNQNLY  SRPSSN+SYAQHVR YGLPMYQPN+N NP SMTQ
Sbjct: 1    MNSQYNSSGDFYAGNLLLRNQNLYSGSRPSSNDSYAQHVRTYGLPMYQPNHN-NPVSMTQ 60

Query: 61   MNQMSIFTNSIHPPPVSSHLENFAFDPISTPSFLVRDESSSFRRDGEDDFIRMFQDEPPR 120
             NQMSIF NS H PPVSSHLENFA+D I+T SFL RDESSSFR+DGEDDFIRMFQ E PR
Sbjct: 61   ANQMSIFMNSAHTPPVSSHLENFAYDHIATSSFLARDESSSFRKDGEDDFIRMFQAEAPR 120

Query: 121  QHCDELLQSIVESSCVGNSTPFKRTTDFGKQRDLEIDLNRTPEQRPPKRRQHTPMVFS-E 180
            Q CDELLQSIVESSCVG STPFK T DFGKQRDLEIDLN+TPEQRPPKRRQHTPMVFS E
Sbjct: 121  QPCDELLQSIVESSCVGISTPFKGTKDFGKQRDLEIDLNKTPEQRPPKRRQHTPMVFSGE 180

Query: 181  RFTDLLNLPLAESLSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSAAISAPTPTKE 240
             FTDLLNLPL E+LSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSA IS   PT E
Sbjct: 181  SFTDLLNLPLDENLSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSAGIS--EPTNE 240

Query: 241  VEKGSDQVIDLNKTPEQKTPRRRKHRPKVIKEGKPKKSPKPVTPKIPKETPSGKRKYVRK 300
             EKGSDQVID +KT EQKTP+RRKHRPKVIKEGKPKKSPKPVTPKI KETPSGKRKYVR+
Sbjct: 241  AEKGSDQVID-HKTTEQKTPKRRKHRPKVIKEGKPKKSPKPVTPKISKETPSGKRKYVRR 300

Query: 301  KNIKEAATPPANIVEIKDSSTATKTKSCRRVINFEMEKTGDEEQEKKHNEKDVQEENMGN 360
            KNIKEA TPP NI+EIKDS+ A KTKSCRRVINFEMEKTGDEE+EK+ NEKD+Q ENMGN
Sbjct: 301  KNIKEAVTPPENIMEIKDSNPAAKTKSCRRVINFEMEKTGDEEREKERNEKDMQ-ENMGN 360

Query: 361  FCSITRPNVPDFCTQSNGVCGTSPDVHISHRLSTMVAENVRPTLQSNLAHMNHMTTSLTS 420
             C ITR NVP F TQSNG+CGTSPDV  +HRL T+VAE+V+P++QS +A MNHM TS  S
Sbjct: 361  SCFITRSNVPGFSTQSNGICGTSPDVQDNHRLGTLVAESVQPSIQSYIARMNHMMTSHIS 420

Query: 421  QSEREAAGGPFNKSAYNTAEDLLNVGRIIDQGKADQYQNGFSNGYTPVQQHIRAEDMEQF 480
            QSEREAA  P N S YN AE L NV RI+DQGK  QYQ GFSNGYTPV+Q+IRAE+ME+F
Sbjct: 421  QSEREAAESPLNSSGYNKAESLFNVLRILDQGKGYQYQTGFSNGYTPVEQNIRAEEMEKF 480

Query: 481  ANHAKRSTSFKELMGMNFEYSQTIPNHQSNINEARGSKRGRPLTIQPTPSCSITTLNSSV 540
            ++ AKR+T +KE+MG+N  YSQT+PNHQSNINEARGSKRG PLT QPT  CSITTL+SSV
Sbjct: 481  SSTAKRNTYYKEMMGINSAYSQTVPNHQSNINEARGSKRGCPLTAQPTQLCSITTLDSSV 540

Query: 541  LCQEVLQTGESHRQGSSVNIGPLEIPAKKFESGLYATLYKRYTTIQANEGCPSHLNTSGC 600
            LCQE LQTGE HR GSS N+G LEIP KKFESGLY+TL+KRY+TIQ NE C  HLNT+GC
Sbjct: 541  LCQEALQTGEFHRLGSSTNVGSLEIPGKKFESGLYSTLHKRYSTIQPNEDCSRHLNTTGC 600

Query: 601  NPINSVGFTTEMKQAMLNS-HIRSNQSRDRQTNWTKETIGDRHIHSVVHENNFQRRQISH 660
            +P  SVGFT EMKQAMLN  HIRSNQ  DRQ++WTKE IGD HIHSVVH NNFQRRQ+SH
Sbjct: 601  SPTISVGFTAEMKQAMLNGYHIRSNQITDRQSSWTKEIIGDGHIHSVVHGNNFQRRQVSH 660

Query: 661  NLHPEIDRTCETTGLNKVTSYRSLITGDKCNVLRPYPHLKASEQGYAYRQSDNSMLTIRQ 720
            NLHPEI+R CET+GLN V S+ SLI  DKCN+L+P+PH KASEQ YA RQ +NS+LT+RQ
Sbjct: 661  NLHPEINRMCETSGLNTVNSHHSLIIRDKCNMLQPFPHPKASEQWYACRQPNNSILTVRQ 720

Query: 721  ACQPMISGSLTTNQVQKLGYSFGFHQFPDKTAGLLENEIIHKLKGLNLNDDKGTTRTEQN 780
            ACQPMIS SL TN VQK GYSFG  QF  KT  LLE EI  KLK L+L DD+G TRTEQN
Sbjct: 721  ACQPMISSSLATN-VQKQGYSFGMQQFSPKTTSLLECEITRKLKSLSLKDDEGATRTEQN 780

Query: 781  AIVPYKGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEGSEGIENHEKDKEK 840
            AIVPY GNGAVVPYVE EYLRKRKARPRVDLDPETERIWNLLMGKEG EGIENHEKDKEK
Sbjct: 781  AIVPYNGNGAVVPYVEPEYLRKRKARPRVDLDPETERIWNLLMGKEGIEGIENHEKDKEK 840

Query: 841  WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS 900
            WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS
Sbjct: 841  WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS 900

Query: 901  LAARFPLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQVLSLPRFEMPQTSINH 960
            LAARFP+KSTSN RT  +VETS+VANE AACL YPADSIRW+ Q LS+P FEMPQTSI H
Sbjct: 901  LAARFPVKSTSNFRTPDEVETSIVANELAACLQYPADSIRWEGQELSVPSFEMPQTSIIH 960

Query: 961  QNHRVKSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARSCSGSNSDAEEPIVSY 1020
            QNHRV SGTE FFTE G QIVEEEVISSQ SFDSTITQGT GARSCSGSNS+AEEPIVS 
Sbjct: 961  QNHRVNSGTENFFTERGGQIVEEEVISSQGSFDSTITQGTAGARSCSGSNSEAEEPIVSN 1020

Query: 1021 NSSSTHCSNFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQLSDGKQDSLT-EWNE 1080
            NSSST  SNFTDIKQMETTT ++KSFSD NR+SVFDEVSEHKHWQL DGKQDSLT EWNE
Sbjct: 1021 NSSSTPYSNFTDIKQMETTTVIEKSFSDKNRTSVFDEVSEHKHWQLPDGKQDSLTSEWNE 1080

Query: 1081 IDNLNGHSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEGREAFSEESISSGPSI 1140
            IDNL+GHSL NFLVNIENQ KQ+P APSNNQL MTPDC VLEVEGREAFSEESISSGPSI
Sbjct: 1081 IDNLSGHSLFNFLVNIENQQKQLPDAPSNNQLRMTPDCGVLEVEGREAFSEESISSGPSI 1140

Query: 1141 VSGCSTEKNMTCHSLNIGDPERTLDKISGEEIGRQARSQERIRMEHSESISEHSVHLQGN 1200
            +SGCSTEKN TCHSLN  DP+R+ DKIS EE  R AR+QE  RMEHSES+SEHSVH QGN
Sbjct: 1141 ISGCSTEKNTTCHSLNTEDPDRSSDKISAEE-NRPARTQETTRMEHSESVSEHSVHRQGN 1200

Query: 1201 GIQLGSHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPAKMQQSALSNVVNATTHT 1260
            G QL S CEY LHD Y+P ERN TSP+ES SV+NP PELD PAK  +SALSNV++   HT
Sbjct: 1201 GTQLTSRCEYSLHDKYKPRERNNTSPLESASVSNPPPELDTPAK--KSALSNVLHVHAHT 1260

Query: 1261 EKLLPGNDNQINFSNNEVHSLSQADNEGNVVSTSKAKRRKVNSEKKSAVDWDILRKQVEA 1320
            EKLLPG  N INFSNNE HSLSQADNEGN +S SKAKRRKVNSEK SA+DWD LRKQVEA
Sbjct: 1261 EKLLPGKGNLINFSNNEAHSLSQADNEGN-ISPSKAKRRKVNSEKNSAIDWDSLRKQVEA 1320

Query: 1321 NGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLVKDHGSID 1380
            NGQIKEKGKDAMDSIDYEAIRLANV EISSAIKERGMNNMLAERIKEFLNRLV DHGSID
Sbjct: 1321 NGQIKEKGKDAMDSIDYEAIRLANVQEISSAIKERGMNNMLAERIKEFLNRLVTDHGSID 1380

Query: 1381 LEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 1440
            LEWLRDVPPD+AKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP
Sbjct: 1381 LEWLRDVPPDQAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQP 1440

Query: 1441 LPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC 1500
            LPESLQLHLLELYPVLE+IQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC
Sbjct: 1441 LPESLQLHLLELYPVLETIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNAC 1500

Query: 1501 PMRGECKHFASAFASARLALPAPDEKRIVTSTNPVAMEKQPAVVSNPLPILPPEGSTYTE 1560
            PMRGECKHFASAFASARLALPAPDEK IV STNP+A EKQP +V++ LPILPPE STYTE
Sbjct: 1501 PMRGECKHFASAFASARLALPAPDEKGIVASTNPIATEKQPPIVTSHLPILPPE-STYTE 1560

Query: 1561 STLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFYEDPDEIPTIKLSMEEFKTTLQNY 1620
            +TL TSKCEPIVEVPAT  PEPEPNE+TESDIED FYEDPDEIPTIKLSMEEFKTTLQNY
Sbjct: 1561 NTLETSKCEPIVEVPAT--PEPEPNEMTESDIEDLFYEDPDEIPTIKLSMEEFKTTLQNY 1620

Query: 1621 IPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEHQVYELPDSHPLLKELDRREPDDPSP 1680
            IPEGDMSRALVALN EAA IPTPKLKNVSRLRTEHQVYELPDSHPLL+E+D REPDDPSP
Sbjct: 1621 IPEGDMSRALVALNPEAAYIPTPKLKNVSRLRTEHQVYELPDSHPLLREMDTREPDDPSP 1680

Query: 1681 YLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREANSQTVRGTLLIPCRT 1740
            YLLAIWTPGETA+SIQPPEQSCGSQDP RLC+EKTCFTCNSRREANSQTVRGTLL+PCRT
Sbjct: 1681 YLLAIWTPGETADSIQPPEQSCGSQDPDRLCDEKTCFTCNSRREANSQTVRGTLLVPCRT 1740

Query: 1741 AMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRKWLWNLPRRTVYFGTSVSTIFKGLVTE 1800
            AMRGSFPLNGTYFQVNEMFADHESS  PIDVPR WLWNLPRRTVYFGTSVS+IFKGLVTE
Sbjct: 1741 AMRGSFPLNGTYFQVNEMFADHESSMKPIDVPRTWLWNLPRRTVYFGTSVSSIFKGLVTE 1800

Query: 1801 EIQQCFWRGFVCVRGFERKTRAPRPLIARLHFPASKLAKMKNGHTE 1844
            EIQQCFWRGFVCVRGF++KTRAPRPLIARLHFPASKLAK+KNGHTE
Sbjct: 1801 EIQQCFWRGFVCVRGFDQKTRAPRPLIARLHFPASKLAKVKNGHTE 1833

BLAST of Tan0014193 vs. ExPASy TrEMBL
Match: A0A6J1CCU9 (transcriptional activator DEMETER isoform X1 OS=Momordica charantia OX=3673 GN=LOC111010054 PE=3 SV=1)

HSP 1 Score: 2971.8 bits (7703), Expect = 0.0e+00
Identity = 1524/1852 (82.29%), Postives = 1619/1852 (87.42%), Query Frame = 0

Query: 1    MNSQVNSSGDFYAGNLLVRNQNLYPSSRPSSNNSYAQHVRPYGLPMYQPNYNLNPASMTQ 60
            MNS+VNS GDFYA NLL+RNQ+LY  SRPSSN+SY QHVR YGLP+YQPNYNLNPA+ TQ
Sbjct: 1    MNSRVNSGGDFYAANLLLRNQHLYSGSRPSSNSSYPQHVRQYGLPIYQPNYNLNPATTTQ 60

Query: 61   MNQMSIFTNSIHPPPVSSHLENFAFDPISTPSFLVRDESSSFRRDGEDDFIRMFQDEPPR 120
             NQMS+FTNS+H   VSSHLEN  +D IS  SFLVRDESSSFR+DGEDDFIRMFQDE P 
Sbjct: 61   SNQMSVFTNSVHSSLVSSHLENLGYDHISASSFLVRDESSSFRKDGEDDFIRMFQDEAPH 120

Query: 121  QHCDELLQSIVESSCVGNSTPFKRTTDFGKQRDLEIDLNRTPEQRPPKRRQHTPMVFS-E 180
            QHCDELLQSIV+SSC GNSTPFK   D GKQRDLEIDLN+TPEQRP KRRQHTPM FS E
Sbjct: 121  QHCDELLQSIVDSSCAGNSTPFKGMKDSGKQRDLEIDLNKTPEQRPQKRRQHTPMAFSGE 180

Query: 181  RFTDLLNLPLAESLSLYEETQENFVTVPLDEATQKRHDELLKDLTDTLSAAISAPT-PTK 240
            +FTDLLNLPL ESLSLYEETQENFV + LDEATQKRHDELLKD TDTLSAAISAP    K
Sbjct: 181  KFTDLLNLPLDESLSLYEETQENFVPILLDEATQKRHDELLKDFTDTLSAAISAPCGEMK 240

Query: 241  EVEKGSDQVIDLNKTPEQKTPRRRKHRPKVIKEGKPKKSPKPVTPKIPKETPSGKRKYVR 300
            EVEKGSDQVIDLN TPEQKTPRRRKHRPKVIKEGKPKKSPKPVTPKI KETPSGKRKYVR
Sbjct: 241  EVEKGSDQVIDLNMTPEQKTPRRRKHRPKVIKEGKPKKSPKPVTPKITKETPSGKRKYVR 300

Query: 301  KKNIKEAATPPANIVEIKDSSTATKTKSCRRVINFEMEKTGDEEQEKKHNEKDVQEENMG 360
            KKNIKEAATPP +IVEIKD   ATKTKSCRRVI+FEMEKTGDE+QEKK NEKD Q++  G
Sbjct: 301  KKNIKEAATPPPDIVEIKDKHAATKTKSCRRVIHFEMEKTGDEDQEKKQNEKDTQDD-PG 360

Query: 361  NFCSITRPNVPDFCTQSNGVCGTSPDVHISHRLSTMVAENVRPTLQSNLAHMNHMTTSLT 420
            NFC ITRPNV DFCTQSNG C  S +VH SH   T+VAEN++PT+QSNLAHMNHM TS  
Sbjct: 361  NFCFITRPNVSDFCTQSNGFCEISSEVHSSHWHGTVVAENIQPTIQSNLAHMNHMMTSFI 420

Query: 421  SQSEREAAGGPFNKSAYNTAEDLLNVGRIIDQGKADQYQNGFSNGYTPVQQHIRAEDMEQ 480
            SQSEREAA    NKSAYN A DL NVGRI+DQGKADQYQNG S GYTPVQQH+RAE  EQ
Sbjct: 421  SQSEREAAKDSLNKSAYNKAVDLXNVGRILDQGKADQYQNGLSKGYTPVQQHVRAEGKEQ 480

Query: 481  FANHAKRSTSFKELMGMNFEYSQTIPNHQSNINEARGSKRGRPLTIQPTPSCSITTLNSS 540
            F N A+R+T +K L+ +N EYSQT+PNH SNINEARGSKRGRP T QPT S  I TL+SS
Sbjct: 481  FVNQAERNTYYKGLIELNSEYSQTVPNHPSNINEARGSKRGRPHTTQPTHSFPINTLDSS 540

Query: 541  VLCQEVLQTGESHRQGSSVNIGPLEIPAKKFESGLYATLYKRYTTIQANEGCPSHLNTSG 600
            +LCQEVL+TGE HRQGSS+N GPLEIP KKFESGLYATLYKRY+T+ ANEGC SHL    
Sbjct: 541  LLCQEVLRTGECHRQGSSLNAGPLEIPGKKFESGLYATLYKRYSTLDANEGCSSHLIKRS 600

Query: 601  CNPINSVGFTTEMKQAMLNSHIRSNQSRDRQTNWTKETIGDRHIHSVVHENNFQRRQISH 660
             N  NSVGFT EMKQAMLN HIRS+Q+ DRQ NWTKE  GDR+++SVVH N FQR+QISH
Sbjct: 601  YNSTNSVGFTAEMKQAMLNGHIRSSQTTDRQNNWTKEVSGDRYVNSVVHGNKFQRQQISH 660

Query: 661  NLHPEIDRTCETTGLNKVTSYRSLITGDKCNVLRPYPHLKASEQGYAYRQSDNSMLTIRQ 720
             LHP+ID TCETTGLNKVTSYRSLITGD+CN L+P+PH KA EQGYA R S+        
Sbjct: 661  KLHPQIDGTCETTGLNKVTSYRSLITGDRCNELQPFPHPKAPEQGYACRYSNQ------- 720

Query: 721  ACQPMISGSLTTNQVQKLGYSFGFHQFPDKTAGLLENEIIHKLKGLNLNDDKGTTRTEQN 780
                              G SFGF QFPDKT GLLEN II KL+GLNLNDD  T+R EQN
Sbjct: 721  ------------------GNSFGFQQFPDKTKGLLENGIIRKLEGLNLNDDGRTSRPEQN 780

Query: 781  AIVPYKGNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEGSEGIENHEKDKEK 840
            AIVPYKGNGAVVPYVES+Y+RKRKARPRVDLDPETERIWNLLMGKEGSEGIENHEKDKEK
Sbjct: 781  AIVPYKGNGAVVPYVESDYVRKRKARPRVDLDPETERIWNLLMGKEGSEGIENHEKDKEK 840

Query: 841  WWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMS 900
            WWEEERKVFRGRADSFIARMHLVQGDRRFS+WKGSVVDSVIGVFLTQNVSDHLSSSAFMS
Sbjct: 841  WWEEERKVFRGRADSFIARMHLVQGDRRFSQWKGSVVDSVIGVFLTQNVSDHLSSSAFMS 900

Query: 901  LAARFPLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQVLSLPRFEMPQTSINH 960
            LAARFP+KS SN RTQ +V TS+V NESAACLLYP DSIRWD QV+S+PRF MPQTSINH
Sbjct: 901  LAARFPVKSFSNNRTQDEVGTSIVTNESAACLLYPVDSIRWDGQVVSVPRFPMPQTSINH 960

Query: 961  QNHRVKSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARSCSGSNSDAEEPIVSY 1020
            Q+HR   G+E  FTEV SQIVEEEVISSQDSFDSTITQGTGGARSCSGSNS+AEEP+VSY
Sbjct: 961  QSHRENLGSEKIFTEVRSQIVEEEVISSQDSFDSTITQGTGGARSCSGSNSEAEEPLVSY 1020

Query: 1021 NSSSTHCSNFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQLSDGKQDSLT-EWNE 1080
            NSS+ H SNFT++KQ ETTT LQKSFS+LN +SVFDEVSEHKHWQL +GKQDSLT EWNE
Sbjct: 1021 NSSNIHYSNFTNVKQTETTTMLQKSFSNLNSNSVFDEVSEHKHWQLPNGKQDSLTSEWNE 1080

Query: 1081 IDNLNGHSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEGREAFSEESISSGPSI 1140
            ID+LNGHSL NFLVNIE Q KQV  APSNNQLHMTPDC  LEVEGREAFSEESISSGPSI
Sbjct: 1081 IDDLNGHSLFNFLVNIETQQKQVXGAPSNNQLHMTPDCGALEVEGREAFSEESISSGPSI 1140

Query: 1141 VSGCSTEKNMTCHSLNIGDPERTLDKISGEEIGRQARSQERIRMEHSESISEHSVHLQGN 1200
             SGCSTEKNM+CHSLN+ D   TLDK S EE G  ARS E  RMEHSES+ EHSVHLQ N
Sbjct: 1141 ASGCSTEKNMSCHSLNVADLVGTLDKTSAEENG-HARSLETTRMEHSESVGEHSVHLQNN 1200

Query: 1201 GIQLGSHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPAKMQQSALSNVVNATTHT 1260
             IQ GSHCEY LHDNY+ CERNKTSPIES SVTNP  E+D PAKMQ+S LSNVV+   HT
Sbjct: 1201 CIQSGSHCEYGLHDNYDQCERNKTSPIESASVTNPPQEIDTPAKMQKSTLSNVVHVPVHT 1260

Query: 1261 EKLL------PGNDNQINFSNNEVHSLSQADNEGNVVSTSKAKRRKVNSEKKSAVDWDIL 1320
            EKLL      PG  NQINFSNNEVHSLS   NEG+ +S SKAKRRKVNSEKKSA+DWD L
Sbjct: 1261 EKLLDVEDRMPGKGNQINFSNNEVHSLSHGGNEGD-ISPSKAKRRKVNSEKKSAMDWDSL 1320

Query: 1321 RKQVEANGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLVK 1380
            RKQVE NGQ KE+ KDAMDSIDYEAIRL NVHEIS+AIKERGMNNMLAERIKEFLNRLV 
Sbjct: 1321 RKQVETNGQRKERSKDAMDSIDYEAIRLTNVHEISNAIKERGMNNMLAERIKEFLNRLVT 1380

Query: 1381 DHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLG 1440
            DHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLG
Sbjct: 1381 DHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLG 1440

Query: 1441 WVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSK 1500
            WVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSK
Sbjct: 1441 WVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSK 1500

Query: 1501 PNCNACPMRGECKHFASAFASARLALPAPDEKRIVTSTNPVAMEKQPAVVSNPLPILPPE 1560
            PNCNACP+RGECKHFASAFASARLALPAPDEKRIVTSTNP+AMEKQP  V++ LP+LPP 
Sbjct: 1501 PNCNACPVRGECKHFASAFASARLALPAPDEKRIVTSTNPIAMEKQPGAVTSTLPMLPPA 1560

Query: 1561 GSTYTESTLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFYEDPDEIPTIKLSMEEFK 1620
             ST TE+ LGTS CEPIVEVPAT  PEPEPNEITESDIEDSFYEDPDEIPTIKLS+EEF+
Sbjct: 1561 ASTCTENALGTSNCEPIVEVPAT--PEPEPNEITESDIEDSFYEDPDEIPTIKLSLEEFR 1620

Query: 1621 TTLQNYIPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEHQVYELPDSHPLLKELDRRE 1680
            TTLQNYIPEGDMS+ALVALN EAASIPTPKLKNVSRLRTEHQVYELPDSHPLLKE+DRRE
Sbjct: 1621 TTLQNYIPEGDMSKALVALNPEAASIPTPKLKNVSRLRTEHQVYELPDSHPLLKEMDRRE 1680

Query: 1681 PDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREANSQTVRGTL 1740
            PDDPSPYLLAIWTPGETANSIQPPEQSCGSQDP RLCNE TCFTCNSRREANSQTVRGTL
Sbjct: 1681 PDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPNRLCNENTCFTCNSRREANSQTVRGTL 1740

Query: 1741 LIPCRTAMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRKWLWNLPRRTVYFGTSVSTIF 1800
            LIPCRTAMRGSFPLNGTYFQVNEMFADH SS  PIDVPR WLWNLPRRTVYFGTSVSTIF
Sbjct: 1741 LIPCRTAMRGSFPLNGTYFQVNEMFADHXSSLKPIDVPRNWLWNLPRRTVYFGTSVSTIF 1800

Query: 1801 KGLVTEEIQQCFWRGFVCVRGFERKTRAPRPLIARLHFPASKLAKMKNGHTE 1844
            KGLVTEEIQQCFWRGFVCVRGF++K+RAPRPLIARLHFPASKLAKMKNGHTE
Sbjct: 1801 KGLVTEEIQQCFWRGFVCVRGFDQKSRAPRPLIARLHFPASKLAKMKNGHTE 1822

BLAST of Tan0014193 vs. TAIR 10
Match: AT5G04560.2 (HhH-GPD base excision DNA repair family protein )

HSP 1 Score: 1040.4 bits (2689), Expect = 1.7e-303
Identity = 758/1794 (42.25%), Postives = 982/1794 (54.74%), Query Frame = 0

Query: 201  ENFVTVPLDEATQKRHDELLKDLTDTLSAAISAPTPTKEVEKGSDQVI--DLNKTPEQK- 260
            E  VT    E  + + D+ ++ + D  S+A++A   T++ +     V+  DLNKTP+QK 
Sbjct: 240  EQIVTTTGHEIPEPKSDKSMQSIMD--SSAVNATEATEQNDGSRQDVLEFDLNKTPQQKP 299

Query: 261  TPRRRKHRPKVIKEGKPKKSP-KPV-TPKI-----PKETP----------SGKRKYVRKK 320
            + R+RK  PKV+ EGKPK+ P KP   PK+     PK  P          S +    +KK
Sbjct: 300  SKRKRKFMPKVVVEGKPKRKPRKPAELPKVVVEGKPKRKPRKAATQEKVKSKETGSAKKK 359

Query: 321  NIKEAAT-PPANIVEIKDSSTATKTKSCRRVINFEMEKTGDEEQEKKHNEKDVQEENMGN 380
            N+KE+AT  PAN+ ++ + S     KSCR+ +NF++E  GD  Q    +E         +
Sbjct: 360  NLKESATKKPANVGDMSNKSPEVTLKSCRKALNFDLENPGDARQGDSESEIVQNSSGANS 419

Query: 381  FCSI------TRPNVPDFCTQ---SNGVCGTSPDVHIS-----HRLST--MVAENVRPTL 440
            F  I      T  +  D  +Q   +NG+   +  + +S      +LST   +A + +P L
Sbjct: 420  FSEIRDAIGGTNGSFLDSVSQIDKTNGLGAMNQPLEVSMGNQPDKLSTGAKLARDQQPDL 479

Query: 441  -------QSNLAHMNHMTTSLTSQS----EREAAGGPFN----KSAYNTAEDLLNVGR-- 500
                   Q  +A  N        Q+    + +  G PF     +      +  L +G   
Sbjct: 480  LTRNQQCQFPVATQNTQFPMENQQAWLQMKNQLIGFPFGNQQPRMTIRNQQPCLAMGNQQ 539

Query: 501  ---IIDQGK-----ADQYQNGFSNGYTPVQQHIRAEDMEQFANHAKRSTSFKELMGMNFE 560
               +I   +      +Q   G      P+           F NH     +  +L G   +
Sbjct: 540  PMYLIGTPRPALVSGNQQLGGPQGNKRPI-----------FLNHQTCLPAGNQLYGSPTD 599

Query: 561  YSQTIPN-----------HQSNINEARGSKRGRPL-TIQPTPSCSITTLNSSVL------ 620
              Q + +           +Q   +  RG +   PL   QP      T LN  V       
Sbjct: 600  MHQLVMSTGGQQHGLLIKNQQPGSLIRGQQPCVPLIDQQPATPKGFTHLNQMVATSMSSP 659

Query: 621  -----CQEVLQTGESHRQGSSVNIGPLEIPAKKFESGLYATL---------YKRYTTIQA 680
                  Q  + T   H +  S  +       ++  +  Y +L         Y     I  
Sbjct: 660  GLRPHSQSQVPTTYLHVESVSRILNGTTGTCQRSRAPAYDSLQQDIHQGNKYILSHEISN 719

Query: 681  NEGCPSHLNTSGCNPINSVGFTTEMKQAMLNSHIRSNQSRDRQTNWTKETIG----DRHI 740
              GC   L  +   P   +    E + +    H    Q+     N  ++       +RH 
Sbjct: 720  GNGCKKALPQNSSLPTPIMAKLEEARGSKRQYHRAMGQTEKHDLNLAQQIAQSQDVERHN 779

Query: 741  HSVVHE------NNFQRRQISHNLH---PEIDRTCE--TTGLNKVTSYRSLITG------ 800
             S   E          ++ +  NLH   PE+    +  T G  K  +  S+  G      
Sbjct: 780  SSTCVEYLDAAKKTKIQKVVQENLHGMPPEVIEIEDDPTDGARKGKNTASISKGASKGNS 839

Query: 801  ---------DKCNVLR-PYPHLKASEQGYAYRQSDNSMLTIRQACQPM--ISGSLTTNQV 860
                     +KC V + P    +A  +      +  S + + Q   P   +S S    + 
Sbjct: 840  SPVKKTAEKEKCIVPKTPAKKGRAGRKKSVPPPAHASEIQLWQPTPPKTPLSRSKPKGKG 899

Query: 861  QKLGYSFGFHQFPDKTAGLLEN--EIIHKLKGLNLNDDKGTTRTEQNAIVPYKGNGAVVP 920
            +K     G  + P       ++  EII++++ L L D +     EQNA+V YKG+GA+VP
Sbjct: 900  RKSIQDSGKARGPSGELLCQDSIAEIIYRMQNLYLGDKE--REQEQNAMVLYKGDGALVP 959

Query: 921  YVESEYLRKRKARPRVDLDPETERIWNLLMGK-EGSEGIENHEKDKEKWWEEERKVFRGR 980
            Y ES   +KRK RP+VD+D ET RIWNLLMGK +  EG E  +K KEKWWEEER+VFRGR
Sbjct: 960  Y-ES---KKRKPRPKVDIDDETTRIWNLLMGKGDEKEGDEEKDKKKEKWWEEERRVFRGR 1019

Query: 981  ADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPLKSTSN 1040
            ADSFIARMHLVQGDRRFS WKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFP K +S+
Sbjct: 1020 ADSFIARMHLVQGDRRFSPWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPPKLSSS 1079

Query: 1041 IRTQGDVETSMVANESAACLLYPADSIRWDSQVLSLPRFEMPQTSINHQNHR---VKSGT 1100
               + +V  S+V  +   C+L   +   W  +V      E+       +        SG 
Sbjct: 1080 REDERNVR-SVVVEDPEGCILNLNEIPSWQEKVQHPSDMEVSGVDSGSKEQLRDCSNSGI 1139

Query: 1101 E-FFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARSCSGSNSDAEEPIVSYNSSSTHCS 1160
            E F F E   Q +EEEV+SSQDSFD  I Q  G   SCS S SDAE P       +T C 
Sbjct: 1140 ERFNFLEKSIQNLEEEVLSSQDSFDPAIFQSCGRVGSCSCSKSDAEFP-------TTRCE 1199

Query: 1161 NFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQLSDG---KQDSLTEWNEIDNL-- 1220
              T      T+ S+Q    +L+   +  + +E  H     G   KQ++     +  +L  
Sbjct: 1200 TKT---VSGTSQSVQTGSPNLS-DEICLQGNERPHLYEGSGDVQKQETTNVAQKKPDLEK 1259

Query: 1221 --NGHSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEGREAFSEESISSGPSIVS 1280
              N    + F     + + Q   + S  Q   T    VL++E      E    S  SI  
Sbjct: 1260 TMNWKDSVCFGQPRNDTNWQTTPSSSYEQC-ATRQPHVLDIEDFGMQGEGLGYSWMSISP 1319

Query: 1281 GCSTEKNMTCHSLNIGDPERTLDKISGEEIGRQARSQERIRMEHSES-ISEH---SVHLQ 1340
                 KN                + +G+ I         + +  S S + EH   + H Q
Sbjct: 1320 RVDRVKNKNVPRRFFRQGGSVPREFTGQIIPSTPHELPGMGLSGSSSAVQEHQDDTQHNQ 1379

Query: 1341 GNGIQLGSHCEYRLHD---NYEPCERNKTSPIESTSVTNPSPELDAPAKMQQSALSNVVN 1400
             + +   SH +    D   + E C   ++S     ++T+     D  A+     LSN   
Sbjct: 1380 QDEMNKASHLQKTFLDLLNSSEECLTRQSS--TKQNITDGCLPRDRTAEDVVDPLSN--- 1439

Query: 1401 ATTHTEKLLPGNDNQINFSNNEVHSLSQADNEGNVVSTSKAKRRKVNSEKKSAVDWDILR 1460
              +  + +L     + N SN E  ++   +    ++   K     +   KK    WD LR
Sbjct: 1440 -NSSLQNILV----ESNSSNKEQTAVEYKETNATILREMKG---TLADGKKPTSQWDSLR 1499

Query: 1461 KQVEANGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLVKD 1520
            K VE N   +E+ K+ MDSIDYEAIR A++ EIS AIKERGMNNMLA RIK+FL R+VKD
Sbjct: 1500 KDVEGNEGRQERNKNNMDSIDYEAIRRASISEISEAIKERGMNNMLAVRIKDFLERIVKD 1559

Query: 1521 HGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGW 1580
            HG IDLEWLR+ PPDKAKDYLLS+RGLGLKSVECVRLLTLH+LAFPVDTNVGRIAVR+GW
Sbjct: 1560 HGGIDLEWLRESPPDKAKDYLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIAVRMGW 1619

Query: 1581 VPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKP 1640
            VPLQPLPESLQLHLLELYPVLESIQK+LWPRLCKLDQRTLYELHYQLITFGKVFCTKS+P
Sbjct: 1620 VPLQPLPESLQLHLLELYPVLESIQKFLWPRLCKLDQRTLYELHYQLITFGKVFCTKSRP 1679

Query: 1641 NCNACPMRGECKHFASAFASARLALPAPDEKRIVTSTNPVAMEKQPAVVSNPLPI-LPPE 1700
            NCNACPMRGEC+HFASA+ASARLALPAP+E+ + ++T PV  E  P V    + + LP E
Sbjct: 1680 NCNACPMRGECRHFASAYASARLALPAPEERSLTSATIPVPPESYPPVAIPMIELPLPLE 1739

Query: 1701 GSTYTESTLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFY-EDPDEIPTIKLSMEEF 1760
             S  + +      CEPI+E PA+  P  E  EITESDIED++Y EDPDEIPTIKL++E+F
Sbjct: 1740 KSLASGAPSNRENCEPIIEEPAS--PGQECTEITESDIEDAYYNEDPDEIPTIKLNIEQF 1799

Query: 1761 KTTLQNY------IPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEHQVYELPDSHPLL 1820
              TL+ +      + EGDMS+ALVAL+    SIPTPKLKN+SRLRTEHQVYELPDSH LL
Sbjct: 1800 GMTLREHMERNMELQEGDMSKALVALHPTTTSIPTPKLKNISRLRTEHQVYELPDSHRLL 1859

Query: 1821 KELDRREPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREANS 1839
              +D+REPDDPSPYLLAIWTPGETANS QPPEQ CG +  G++C ++TC  CNS REANS
Sbjct: 1860 DGMDKREPDDPSPYLLAIWTPGETANSAQPPEQKCGGKASGKMCFDETCSECNSLREANS 1919

BLAST of Tan0014193 vs. TAIR 10
Match: AT5G04560.1 (HhH-GPD base excision DNA repair family protein )

HSP 1 Score: 1035.8 bits (2677), Expect = 4.2e-302
Identity = 752/1767 (42.56%), Postives = 970/1767 (54.90%), Query Frame = 0

Query: 228  SAAISAPTPTKEVEKGSDQVI--DLNKTPEQK-TPRRRKHRPKVIKEGKPKKSP-KPV-T 287
            S+A++A   T++ +     V+  DLNKTP+QK + R+RK  PKV+ EGKPK+ P KP   
Sbjct: 7    SSAVNATEATEQNDGSRQDVLEFDLNKTPQQKPSKRKRKFMPKVVVEGKPKRKPRKPAEL 66

Query: 288  PKI-----PKETP----------SGKRKYVRKKNIKEAAT-PPANIVEIKDSSTATKTKS 347
            PK+     PK  P          S +    +KKN+KE+AT  PAN+ ++ + S     KS
Sbjct: 67   PKVVVEGKPKRKPRKAATQEKVKSKETGSAKKKNLKESATKKPANVGDMSNKSPEVTLKS 126

Query: 348  CRRVINFEMEKTGDEEQEKKHNEKDVQEENMGNFCSI------TRPNVPDFCTQ---SNG 407
            CR+ +NF++E  GD  Q    +E         +F  I      T  +  D  +Q   +NG
Sbjct: 127  CRKALNFDLENPGDARQGDSESEIVQNSSGANSFSEIRDAIGGTNGSFLDSVSQIDKTNG 186

Query: 408  VCGTSPDVHIS-----HRLST--MVAENVRPTL-------QSNLAHMNHMTTSLTSQS-- 467
            +   +  + +S      +LST   +A + +P L       Q  +A  N        Q+  
Sbjct: 187  LGAMNQPLEVSMGNQPDKLSTGAKLARDQQPDLLTRNQQCQFPVATQNTQFPMENQQAWL 246

Query: 468  --EREAAGGPFN----KSAYNTAEDLLNVGR-----IIDQGK-----ADQYQNGFSNGYT 527
              + +  G PF     +      +  L +G      +I   +      +Q   G      
Sbjct: 247  QMKNQLIGFPFGNQQPRMTIRNQQPCLAMGNQQPMYLIGTPRPALVSGNQQLGGPQGNKR 306

Query: 528  PVQQHIRAEDMEQFANHAKRSTSFKELMGMNFEYSQTIPN-----------HQSNINEAR 587
            P+           F NH     +  +L G   +  Q + +           +Q   +  R
Sbjct: 307  PI-----------FLNHQTCLPAGNQLYGSPTDMHQLVMSTGGQQHGLLIKNQQPGSLIR 366

Query: 588  GSKRGRPL-TIQPTPSCSITTLNSSVL-----------CQEVLQTGESHRQGSSVNIGPL 647
            G +   PL   QP      T LN  V             Q  + T   H +  S  +   
Sbjct: 367  GQQPCVPLIDQQPATPKGFTHLNQMVATSMSSPGLRPHSQSQVPTTYLHVESVSRILNGT 426

Query: 648  EIPAKKFESGLYATL---------YKRYTTIQANEGCPSHLNTSGCNPINSVGFTTEMKQ 707
                ++  +  Y +L         Y     I    GC   L  +   P   +    E + 
Sbjct: 427  TGTCQRSRAPAYDSLQQDIHQGNKYILSHEISNGNGCKKALPQNSSLPTPIMAKLEEARG 486

Query: 708  AMLNSHIRSNQSRDRQTNWTKETIG----DRHIHSVVHE------NNFQRRQISHNLH-- 767
            +    H    Q+     N  ++       +RH  S   E          ++ +  NLH  
Sbjct: 487  SKRQYHRAMGQTEKHDLNLAQQIAQSQDVERHNSSTCVEYLDAAKKTKIQKVVQENLHGM 546

Query: 768  -PEIDRTCE--TTGLNKVTSYRSLITG---------------DKCNVLR-PYPHLKASEQ 827
             PE+    +  T G  K  +  S+  G               +KC V + P    +A  +
Sbjct: 547  PPEVIEIEDDPTDGARKGKNTASISKGASKGNSSPVKKTAEKEKCIVPKTPAKKGRAGRK 606

Query: 828  GYAYRQSDNSMLTIRQACQPM--ISGSLTTNQVQKLGYSFGFHQFPDKTAGLLEN--EII 887
                  +  S + + Q   P   +S S    + +K     G  + P       ++  EII
Sbjct: 607  KSVPPPAHASEIQLWQPTPPKTPLSRSKPKGKGRKSIQDSGKARGPSGELLCQDSIAEII 666

Query: 888  HKLKGLNLNDDKGTTRTEQNAIVPYKGNGAVVPYVESEYLRKRKARPRVDLDPETERIWN 947
            ++++ L L D +     EQNA+V YKG+GA+VPY ES   +KRK RP+VD+D ET RIWN
Sbjct: 667  YRMQNLYLGDKE--REQEQNAMVLYKGDGALVPY-ES---KKRKPRPKVDIDDETTRIWN 726

Query: 948  LLMGK-EGSEGIENHEKDKEKWWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDS 1007
            LLMGK +  EG E  +K KEKWWEEER+VFRGRADSFIARMHLVQGDRRFS WKGSVVDS
Sbjct: 727  LLMGKGDEKEGDEEKDKKKEKWWEEERRVFRGRADSFIARMHLVQGDRRFSPWKGSVVDS 786

Query: 1008 VIGVFLTQNVSDHLSSSAFMSLAARFPLKSTSNIRTQGDVETSMVANESAACLLYPADSI 1067
            VIGVFLTQNVSDHLSSSAFMSLAARFP K +S+   + +V  S+V  +   C+L   +  
Sbjct: 787  VIGVFLTQNVSDHLSSSAFMSLAARFPPKLSSSREDERNVR-SVVVEDPEGCILNLNEIP 846

Query: 1068 RWDSQVLSLPRFEMPQTSINHQNHR---VKSGTE-FFFTEVGSQIVEEEVISSQDSFDST 1127
             W  +V      E+       +        SG E F F E   Q +EEEV+SSQDSFD  
Sbjct: 847  SWQEKVQHPSDMEVSGVDSGSKEQLRDCSNSGIERFNFLEKSIQNLEEEVLSSQDSFDPA 906

Query: 1128 ITQGTGGARSCSGSNSDAEEPIVSYNSSSTHCSNFTDIKQMETTTSLQKSFSDLNRSSVF 1187
            I Q  G   SCS S SDAE P       +T C   T      T+ S+Q    +L+   + 
Sbjct: 907  IFQSCGRVGSCSCSKSDAEFP-------TTRCETKT---VSGTSQSVQTGSPNLS-DEIC 966

Query: 1188 DEVSEHKHWQLSDG---KQDSLTEWNEIDNL----NGHSLINFLVNIENQHKQVPVAPSN 1247
             + +E  H     G   KQ++     +  +L    N    + F     + + Q   + S 
Sbjct: 967  LQGNERPHLYEGSGDVQKQETTNVAQKKPDLEKTMNWKDSVCFGQPRNDTNWQTTPSSSY 1026

Query: 1248 NQLHMTPDCRVLEVEGREAFSEESISSGPSIVSGCSTEKNMTCHSLNIGDPERTLDKISG 1307
             Q   T    VL++E      E    S  SI       KN                + +G
Sbjct: 1027 EQC-ATRQPHVLDIEDFGMQGEGLGYSWMSISPRVDRVKNKNVPRRFFRQGGSVPREFTG 1086

Query: 1308 EEIGRQARSQERIRMEHSES-ISEH---SVHLQGNGIQLGSHCEYRLHD---NYEPCERN 1367
            + I         + +  S S + EH   + H Q + +   SH +    D   + E C   
Sbjct: 1087 QIIPSTPHELPGMGLSGSSSAVQEHQDDTQHNQQDEMNKASHLQKTFLDLLNSSEECLTR 1146

Query: 1368 KTSPIESTSVTNPSPELDAPAKMQQSALSNVVNATTHTEKLLPGNDNQINFSNNEVHSLS 1427
            ++S     ++T+     D  A+     LSN     +  + +L     + N SN E  ++ 
Sbjct: 1147 QSS--TKQNITDGCLPRDRTAEDVVDPLSN----NSSLQNILV----ESNSSNKEQTAVE 1206

Query: 1428 QADNEGNVVSTSKAKRRKVNSEKKSAVDWDILRKQVEANGQIKEKGKDAMDSIDYEAIRL 1487
              +    ++   K     +   KK    WD LRK VE N   +E+ K+ MDSIDYEAIR 
Sbjct: 1207 YKETNATILREMKG---TLADGKKPTSQWDSLRKDVEGNEGRQERNKNNMDSIDYEAIRR 1266

Query: 1488 ANVHEISSAIKERGMNNMLAERIKEFLNRLVKDHGSIDLEWLRDVPPDKAKDYLLSVRGL 1547
            A++ EIS AIKERGMNNMLA RIK+FL R+VKDHG IDLEWLR+ PPDKAKDYLLS+RGL
Sbjct: 1267 ASISEISEAIKERGMNNMLAVRIKDFLERIVKDHGGIDLEWLRESPPDKAKDYLLSIRGL 1326

Query: 1548 GLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQKY 1607
            GLKSVECVRLLTLH+LAFPVDTNVGRIAVR+GWVPLQPLPESLQLHLLELYPVLESIQK+
Sbjct: 1327 GLKSVECVRLLTLHNLAFPVDTNVGRIAVRMGWVPLQPLPESLQLHLLELYPVLESIQKF 1386

Query: 1608 LWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNACPMRGECKHFASAFASARLALPA 1667
            LWPRLCKLDQRTLYELHYQLITFGKVFCTKS+PNCNACPMRGEC+HFASA+ASARLALPA
Sbjct: 1387 LWPRLCKLDQRTLYELHYQLITFGKVFCTKSRPNCNACPMRGECRHFASAYASARLALPA 1446

Query: 1668 PDEKRIVTSTNPVAMEKQPAVVSNPLPI-LPPEGSTYTESTLGTSKCEPIVEVPATPEPE 1727
            P+E+ + ++T PV  E  P V    + + LP E S  + +      CEPI+E PA+  P 
Sbjct: 1447 PEERSLTSATIPVPPESYPPVAIPMIELPLPLEKSLASGAPSNRENCEPIIEEPAS--PG 1506

Query: 1728 PEPNEITESDIEDSFY-EDPDEIPTIKLSMEEFKTTLQNY------IPEGDMSRALVALN 1787
             E  EITESDIED++Y EDPDEIPTIKL++E+F  TL+ +      + EGDMS+ALVAL+
Sbjct: 1507 QECTEITESDIEDAYYNEDPDEIPTIKLNIEQFGMTLREHMERNMELQEGDMSKALVALH 1566

Query: 1788 QEAASIPTPKLKNVSRLRTEHQVYELPDSHPLLKELDRREPDDPSPYLLAIWTPGETANS 1839
                SIPTPKLKN+SRLRTEHQVYELPDSH LL  +D+REPDDPSPYLLAIWTPGETANS
Sbjct: 1567 PTTTSIPTPKLKNISRLRTEHQVYELPDSHRLLDGMDKREPDDPSPYLLAIWTPGETANS 1626

BLAST of Tan0014193 vs. TAIR 10
Match: AT2G36490.1 (demeter-like 1 )

HSP 1 Score: 881.7 bits (2277), Expect = 1.0e-255
Identity = 542/1125 (48.18%), Postives = 679/1125 (60.36%), Query Frame = 0

Query: 744  QFPDKTAGLLENE----------IIHKLKGLNLNDDKGTTRTEQNAIVPYK--------- 803
            +FP    GL  +E          I   L+ L++N +   T     A+VPY          
Sbjct: 454  RFPPSFTGLSPDELWKRRNSIETISELLRLLDINREHSET-----ALVPYTMNSQIVLFG 513

Query: 804  -GNGAVVPYVESEYLRKRKARPRVDLDPETERIWNLLMGKEGSEGIENHEKDKEKWWEEE 863
             G GA+VP      ++K + RP+VDLD ET+R+W LL+    SEG++  ++ K KWWEEE
Sbjct: 514  GGAGAIVPVTP---VKKPRPRPKVDLDDETDRVWKLLLENINSEGVDGSDEQKAKWWEEE 573

Query: 864  RKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARF 923
            R VFRGRADSFIARMHLVQGDRRF+ WKGSVVDSV+GVFLTQNVSDHLSSSAFMSLA++F
Sbjct: 574  RNVFRGRADSFIARMHLVQGDRRFTPWKGSVVDSVVGVFLTQNVSDHLSSSAFMSLASQF 633

Query: 924  PLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQVLSLPRFEMPQTSINHQNHRV 983
            P+                              S  +D+   S+P  ++            
Sbjct: 634  PVPF--------------------------VPSSNFDAGTSSMPSIQI------------ 693

Query: 984  KSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARSCSGSNSDAEEPIVSYNSSST 1043
                    T + S   EE + S  D   S++T           +  D E+  V  N +S 
Sbjct: 694  --------TYLDS---EETMSSPPDHNHSSVT--------LKNTQPDEEKDYVPSNETSR 753

Query: 1044 HCSNFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQLSDGKQDSLTEWNEIDNLNG 1103
              S        E   S  +S          D+ ++ K +  SD K  S+    E+D  + 
Sbjct: 754  SSS--------EIAISAHES---------VDKTTDSKEYVDSDRKGSSV----EVDKTD- 813

Query: 1104 HSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEGREAFSEESISSGPSIVSGCST 1163
                                           CRVL +     F  E              
Sbjct: 814  -----------------------------EKCRVLNL-----FPSE-------------- 873

Query: 1164 EKNMTC-HSLNIGDPERTLDKISGEEIGRQARSQERIRMEHSESISEHSVHLQGNGIQLG 1223
            +  +TC HS+    P+ T           +A S   I +E  E  +     LQG  + L 
Sbjct: 874  DSALTCQHSMVSDAPQNT----------ERAGSSSEIDLE-GEYRTSFMKLLQGVQVSLE 933

Query: 1224 SHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPAKMQQSALSNVVNATTHTEKLLP 1283
                          + N+ SP  + S  + S E+     M++   S+V            
Sbjct: 934  --------------DSNQVSP--NMSPGDCSSEIKGFQSMKEPTKSSV------------ 993

Query: 1284 GNDNQINFSNNEVHSLSQADNEGNVVS----TSKAKRRKVNSEKKSAVDWDILRKQVEAN 1343
                     ++E    SQ D  G+V+S    T K K +KV  E+K A DWD LR++ +A 
Sbjct: 994  --------DSSEPGCCSQQD--GDVLSCQKPTLKEKGKKVLKEEKKAFDWDCLRREAQAR 1053

Query: 1344 GQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFLNRLVKDHGSIDL 1403
              I+EK +  MD++D++AIR A+V E++  IK RGMN+ LAERI+ FL+RLV DHGSIDL
Sbjct: 1054 AGIREKTRSTMDTVDWKAIRAADVKEVAETIKSRGMNHKLAERIQGFLDRLVNDHGSIDL 1113

Query: 1404 EWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPL 1463
            EWLRDVPPDKAK+YLLS  GLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPL
Sbjct: 1114 EWLRDVPPDKAKEYLLSFNGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPL 1173

Query: 1464 PESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNACP 1523
            PESLQLHLLE+YP+LESIQKYLWPRLCKLDQ+TLYELHYQ+ITFGKVFCTKSKPNCNACP
Sbjct: 1174 PESLQLHLLEMYPMLESIQKYLWPRLCKLDQKTLYELHYQMITFGKVFCTKSKPNCNACP 1233

Query: 1524 MRGECKHFASAFASARLALPAPDEKRIVTSTNPVAMEKQPAVVSNPLPILPPEGS---TY 1583
            M+GEC+HFASAFASARLALP+  EK + T       +K P  +  P P    +GS    +
Sbjct: 1234 MKGECRHFASAFASARLALPS-TEKGMGTP------DKNPLPLHLPEPFQREQGSEVVQH 1293

Query: 1584 TESTLGTSKCEPIVEVPATPEPEPEPNEITESDIEDSFYEDPDEIPTIKLSMEEFKTTLQ 1643
            +E     + CEPI+E PA+PEPE    E++ +DIE++F+EDP+EIPTI+L+M+ F + L+
Sbjct: 1294 SEPAKKVTCCEPIIEEPASPEPETA--EVSIADIEEAFFEDPEEIPTIRLNMDAFTSNLK 1353

Query: 1644 NY------IPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEHQVYELPDSHPLLKELDR 1703
                    + +G+MS ALVAL  E AS+P PKLKN+S+LRTEH+VYELPD HPLL +L++
Sbjct: 1354 KIMEHNKELQDGNMSSALVALTAETASLPMPKLKNISQLRTEHRVYELPDEHPLLAQLEK 1385

Query: 1704 REPDDPSPYLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREANSQTVRG 1763
            REPDDP  YLLAIWTPGETA+SIQP   +C  Q  G LC+E+TCF+CNS +E  SQ VRG
Sbjct: 1414 REPDDPCSYLLAIWTPGETADSIQPSVSTCIFQANGMLCDEETCFSCNSIKETRSQIVRG 1385

Query: 1764 TLLIPCRTAMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRKWLWNLPRRTVYFGTSVST 1823
            T+LIPCRTAMRGSFPLNGTYFQVNE+FADH SS NPI+VPR+ +W LPRRTVYFGTSV T
Sbjct: 1474 TILIPCRTAMRGSFPLNGTYFQVNEVFADHASSLNPINVPRELIWELPRRTVYFGTSVPT 1385

Query: 1824 IFKGLVTEEIQQCFWRGFVCVRGFERKTRAPRPLIARLHFPASKL 1835
            IFKGL TE+IQ CFW+G+VCVRGF+RKTR P+PLIARLHFPASKL
Sbjct: 1534 IFKGLSTEKIQACFWKGYVCVRGFDRKTRGPKPLIARLHFPASKL 1385


HSP 2 Score: 58.5 bits (140), Expect = 6.4e-08
Identity = 63/181 (34.81%), Postives = 98/181 (54.14%), Query Frame = 0

Query: 223 LTDTLSAAISAPTP-----TKEVEKGSDQVIDLN-----------KTPEQKTPRRRKHRP 282
           L +T S   S  TP     T+ ++KG+++V  L+           KTPE+  P+R+KHRP
Sbjct: 67  LANTASLIFSGQTPIPTRNTEVMQKGTEEVESLSSVSNNVAEQILKTPEK--PKRKKHRP 126

Query: 283 KVIKEGKPKKSPKPVTPKIP-----KETPSGKRKYVRKK---NIKEAATPPANIVEIKDS 342
           KV +E KPK+ PKP  P+       +E+ + KRKYVRKK   +  + ATP  +   + ++
Sbjct: 127 KVRREAKPKREPKPRAPRKSVVTDGQESKTPKRKYVRKKVEVSKDQDATPVESSAAV-ET 186

Query: 343 STATKTKSCRRVINFEME----KTGDEEQEKKHNEKDVQEENM--GN----FCSITRPNV 370
           ST  K + CRRV++FE E    +T  + +E    E  +QE+ +  GN     C ++ P+ 
Sbjct: 187 STRPK-RLCRRVLDFEAENGENQTNGDIREAGEMESALQEKQLDSGNQELKDCLLSAPST 243

BLAST of Tan0014193 vs. TAIR 10
Match: AT3G10010.1 (demeter-like 2 )

HSP 1 Score: 719.9 bits (1857), Expect = 5.1e-207
Identity = 456/1077 (42.34%), Postives = 585/1077 (54.32%), Query Frame = 0

Query: 776  EQNAIVPYKGNGAVVPYVESEYLRK------RKARPRVDLDPETERIWNLLMGKEGSEGI 835
            ++   +P+    A++ Y +S   +K      +K +P+V LDPET R+W LLM     +G+
Sbjct: 461  KEGLCLPHNRETALILYKKSYEEQKAIVKYSKKQKPKVQLDPETSRVWKLLMSSIDCDGV 520

Query: 836  ENHEKDKEKWWEEERKVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSD 895
            +  +++K KWWEEER +F GRA+SFIARM +VQG+R FS WKGSVVDSV+GVFLTQNV+D
Sbjct: 521  DGSDEEKRKWWEEERNMFHGRANSFIARMRVVQGNRTFSPWKGSVVDSVVGVFLTQNVAD 580

Query: 896  HLSSSAFMSLAARFPLKSTSNIRTQGDVETSMVANESAACLLYPADSIRWDSQVLSLPRF 955
            H SSSA+M LAA FP++   N  +  +   S V  E+                +L+L   
Sbjct: 581  HSSSSAYMDLAAEFPVEWNFNKGSCHEEWGSSVTQET----------------ILNLD-- 640

Query: 956  EMPQTSINHQNHRVKSGTEFFFTEVGSQIVEEEVISSQDSFDSTITQGTGGARSCSGSNS 1015
              P+T ++    R+++ T         +++ EE+   ++  D+                 
Sbjct: 641  --PRTGVS--TPRIRNPT---------RVIIEEIDDDENDIDA----------------- 700

Query: 1016 DAEEPIVSYNSSSTHCSNFTDIKQMETTTSLQKSFSDLNRSSVFDEVSEHKHWQLSDGKQ 1075
                 + S  SS T  S+ T   Q  + T L   F+ +  +   D        Q+  GK 
Sbjct: 701  -----VCSQESSKTSDSSITSADQ--SKTMLLDPFNTVLMNEQVDS-------QMVKGK- 760

Query: 1076 DSLTEWNEIDNLNGHSLINFLVNIENQHKQVPVAPSNNQLHMTPDCRVLEVEGREAFSEE 1135
                         GH               +P     N L                    
Sbjct: 761  -------------GH---------------IPYTDDLNDL-------------------- 820

Query: 1136 SISSGPSIVSGCSTEKNMTCHSLNIGDPERTLDKISGEEIGRQARSQERIRMEHSESISE 1195
              S G S+VS  ST                                              
Sbjct: 821  --SQGISMVSSAST---------------------------------------------- 880

Query: 1196 HSVHLQGNGIQLGSHCEYRLHDNYEPCERNKTSPIESTSVTNPSPELDAPAKMQQSALSN 1255
                          HCE  L         N+  P          PE     + QQ     
Sbjct: 881  --------------HCELNL---------NEVPPEVELCSHQQDPESTIQTQDQQE---- 940

Query: 1256 VVNATTHTEKLLPGNDNQINFSNNEVHSLSQADNEGNVVSTSKAKRRKVNSEK---KSAV 1315
                +T TE +                            +TSK K++   S K   K +V
Sbjct: 941  ----STRTEDVKKNRKKP---------------------TTSKPKKKSKESAKSTQKKSV 1000

Query: 1316 DWDILRKQVEANGQIKEKGKDAMDSIDYEAIRLANVHEISSAIKERGMNNMLAERIKEFL 1375
            DWD LRK+ E+ G+ +E+ +  MD++D++A+R  +VH+I++ I +RGMNNMLAERIK FL
Sbjct: 1001 DWDSLRKEAESGGRKRERTERTMDTVDWDALRCTDVHKIANIIIKRGMNNMLAERIKAFL 1060

Query: 1376 NRLVKDHGSIDLEWLRDVPPDKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRI 1435
            NRLVK HGSIDLEWLRDVPPDKAK+YLLS+ GLGLKSVECVRLL+LH +AFPVDTNVGRI
Sbjct: 1061 NRLVKKHGSIDLEWLRDVPPDKAKEYLLSINGLGLKSVECVRLLSLHQIAFPVDTNVGRI 1120

Query: 1436 AVRLGWVPLQPLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVF 1495
            AVRLGWVPLQPLP+ LQ+HLLELYPVLES+QKYLWPRLCKLDQ+TLYELHY +ITFGKVF
Sbjct: 1121 AVRLGWVPLQPLPDELQMHLLELYPVLESVQKYLWPRLCKLDQKTLYELHYHMITFGKVF 1180

Query: 1496 CTKSKPNCNACPMRGECKHFASAFASARLALPAPDEK-RIVTSTNPVAMEKQPAVVS-NP 1555
            CTK KPNCNACPM+ EC+H++SA ASARLALP P+E  R     +    +++P VV+  P
Sbjct: 1181 CTKVKPNCNACPMKAECRHYSSARASARLALPEPEESDRTSVMIHERRSKRKPVVVNFRP 1240

Query: 1556 LPILPPEGSTYTESTLGTSKCEPIVEVPATPEPEPEPNEITESDIED------------S 1615
               L  E     +    +  CEPI+E PA+PEP     E  E DIED             
Sbjct: 1241 SLFLYQEKE---QEAQRSQNCEPIIEEPASPEP-----EYIEHDIEDYPRDKNNVGTSED 1300

Query: 1616 FYEDPDEIPTIKLSMEEFKTTLQNYIPEGDMSRALVALNQEAASIPTPKLKNVSRLRTEH 1675
             +E+ D IPTI L+ E   +       E   S  LV L+  AA+IP  KLK   +LRTEH
Sbjct: 1301 PWENKDVIPTIILNKEAGTSHDLVVNKEAGTSHDLVVLSTYAAAIPRRKLKIKEKLRTEH 1318

Query: 1676 QVYELPDSHPLLKELDRREPDDPSPYLLAIWTPGETANSIQPPEQSCG-SQDPGRLCNEK 1735
             V+ELPD H +L+  +RRE +D  PYLLAIWTPGET NSIQPP+Q C   +    LCNE 
Sbjct: 1361 HVFELPDHHSILEGFERREAEDIVPYLLAIWTPGETVNSIQPPKQRCALFESNNTLCNEN 1318

Query: 1736 TCFTCNSRREANSQTVRGTLLIPCRTAMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRK 1795
             CF CN  RE  SQTVRGT+LIPCRTAMRG FPLNGTYFQ NE+FADH+SS NPIDVP +
Sbjct: 1421 KCFQCNKTREEESQTVRGTILIPCRTAMRGGFPLNGTYFQTNEVFADHDSSINPIDVPTE 1318

Query: 1796 WLWNLPRRTVYFGTSVSTIFKGLVTEEIQQCFWRGFVCVRGFERKTRAPRPLIARLH 1829
             +W+L RR  Y G+SVS+I KGL  E I+  F  G+VCVRGF+R+ R P+ L+ RLH
Sbjct: 1481 LIWDLKRRVAYLGSSVSSICKGLSVEAIKYNFQEGYVCVRGFDRENRKPKSLVKRLH 1318

BLAST of Tan0014193 vs. TAIR 10
Match: AT4G34060.1 (demeter-like protein 3 )

HSP 1 Score: 499.2 bits (1284), Expect = 1.4e-140
Identity = 278/573 (48.52%), Postives = 372/573 (64.92%), Query Frame = 0

Query: 1277 SLSQADNEGNVVSTSKAKRRKVNSEKKSAVDWDILRKQVEANGQIKEKGKDAMDSIDYEA 1336
            S+S+ ++  N   T+K K  K    +   VDW+ LR+     G   E     MDS+++  
Sbjct: 474  SISKVEDHEN---TAKRKNEKTGIIEDEIVDWNNLRRMYTKEGSRPEM---HMDSVNWSD 533

Query: 1337 IRLANVHEISSAIKERGMNNMLAERIKEFLNRLVKDHGSIDLEWLRDVPPDKAKDYLLSV 1396
            +RL+  + + + IK+RG   +L+ERI +FLN  V  +G+IDLEWLR+ P    K YLL +
Sbjct: 534  VRLSGQNVLETTIKKRGQFRILSERILKFLNDEVNQNGNIDLEWLRNAPSHLVKRYLLEI 593

Query: 1397 RGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESI 1456
             G+GLKS ECVRLL L H AFPVDTNVGRIAVRLG VPL+PLP  +Q+H L  YP ++SI
Sbjct: 594  EGIGLKSAECVRLLGLKHHAFPVDTNVGRIAVRLGLVPLEPLPNGVQMHQLFEYPSMDSI 653

Query: 1457 QKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNACPMRGECKHFASAFASARLA 1516
            QKYLWPRLCKL Q TLYELHYQ+ITFGKVFCTK+ PNCNACPM+ ECK+FASA+ S+++ 
Sbjct: 654  QKYLWPRLCKLPQETLYELHYQMITFGKVFCTKTIPNCNACPMKSECKYFASAYVSSKVL 713

Query: 1517 LPAPDEKRIVTSTNPVAMEKQPAV--VSNPLPILPPEGSTYTESTLGTSKC-EPIVEVPA 1576
            L +P+EK    +T   A  +  AV   SN   +     S  ++  +    C +P+VE P+
Sbjct: 714  LESPEEKMHEPNTFMNAHSQDVAVDMTSNINLVEECVSSGCSDQAI----CYKPLVEFPS 773

Query: 1577 TPEPE-PEPNEITESDIED----SFYEDPDEIPTIKLSMEEFKTTLQNYI--------PE 1636
            +P  E PE      +DIED    + Y+    +P I   ++  K ++++ +         +
Sbjct: 774  SPRAEIPE-----STDIEDVPFMNLYQSYASVPKIDFDLDALKKSVEDALVISGRMSSSD 833

Query: 1637 GDMSRALVALNQEAASIPTP---KLKNVSRLRTEHQVYELPDSHPLLKELDRREPDDPSP 1696
             ++S+ALV    E A IP     K+K  +RLRTEH VY LPD+H LL + +RR+ DDPSP
Sbjct: 834  EEISKALVIPTPENACIPIKPPRKMKYYNRLRTEHVVYVLPDNHELLHDFERRKLDDPSP 893

Query: 1697 YLLAIWTPGETANSIQPPEQSCGSQDPGRLCNEKTCFTCNSRREANSQTVRGTLLIPCRT 1756
            YLLAIW PGET++S  PP++ C S D  +LC  K C  C + RE NS   RGT+LIPCRT
Sbjct: 894  YLLAIWQPGETSSSFVPPKKKC-SSDGSKLCKIKNCSYCWTIREQNSNIFRGTILIPCRT 953

Query: 1757 AMRGSFPLNGTYFQVNEMFADHESSTNPIDVPRKWLWNLPRRTVYFGTSVSTIFKGLVTE 1816
            AMRG+FPLNGTYFQ NE+FADHE+S NPI   R+    L +R +Y G++V++IFK L T 
Sbjct: 954  AMRGAFPLNGTYFQTNEVFADHETSLNPIVFRRELCKGLEKRALYCGSTVTSIFKLLDTR 1013

Query: 1817 EIQQCFWRGFVCVRGFERKTRAPRPLIARLHFP 1831
             I+ CFW GF+C+R F+RK R P+ L+ RLH P
Sbjct: 1014 RIELCFWTGFLCLRAFDRKQRDPKELVRRLHTP 1030


HSP 2 Score: 115.5 bits (288), Expect = 4.4e-25
Identity = 62/138 (44.93%), Postives = 95/138 (68.84%), Query Frame = 0

Query: 800 KRKARPRVDLDPETERIWNLLMGKEGSEGIENHEKDKEKWWEEERKVFRGRADSFIARMH 859
           K+    +V+LDPET + W++LM  + S      +K+ E  W++ER++F+ R D FI RMH
Sbjct: 342 KKLVTAKVNLDPETIKEWDVLMVND-SPSRSYDDKETEAKWKKEREIFQTRIDLFINRMH 401

Query: 860 LVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPLKSTSNIRTQGDVET 919
            +QG+R+F +WKGSVVDSV+GVFLTQN +D+LSS+AFMS+AA+FP+ +   +     +E 
Sbjct: 402 RLQGNRKFKQWKGSVVDSVVGVFLTQNTTDYLSSNAFMSVAAKFPVDAREGLSYY--IEE 461

Query: 920 SMVANESAACLLYPADSI 938
              A +S+ C++   +SI
Sbjct: 462 PQDA-KSSECIILSDESI 475

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LK562.4e-30242.25Transcriptional activator DEMETER OS=Arabidopsis thaliana OX=3702 GN=DME PE=1 SV... [more]
C7IW642.0e-27249.35Protein ROS1A OS=Oryza sativa subsp. japonica OX=39947 GN=ROS1A PE=1 SV=2[more]
Q9SJQ61.4e-25448.18DNA glycosylase/AP lyase ROS1 OS=Arabidopsis thaliana OX=3702 GN=ROS1 PE=1 SV=2[more]
B8YIE84.6e-24538.26Protein ROS1C OS=Oryza sativa subsp. japonica OX=39947 GN=ROS1C PE=2 SV=2[more]
Q9SR667.2e-20642.34DEMETER-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=DML2 PE=3 SV=2[more]
Match NameE-valueIdentityDescription
XP_011655842.10.0e+0083.77protein ROS1A isoform X1 [Cucumis sativus] >XP_011655844.1 protein ROS1A isoform... [more]
KAG6601206.10.0e+0084.13Transcriptional activator DEMETER, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7032001.10.0e+0084.13Transcriptional activator DEMETER, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_023518039.10.0e+0084.13transcriptional activator DEMETER-like isoform X1 [Cucurbita pepo subsp. pepo] >... [more]
XP_022997004.10.0e+0084.18transcriptional activator DEMETER-like isoform X1 [Cucurbita maxima] >XP_0229970... [more]
Match NameE-valueIdentityDescription
A0A0A0KTG60.0e+0083.77ENDO3c domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G615310 PE=3... [more]
A0A6J1KCN50.0e+0084.18transcriptional activator DEMETER-like isoform X1 OS=Cucurbita maxima OX=3661 GN... [more]
A0A1S3BGJ50.0e+0083.50transcriptional activator DEMETER isoform X1 OS=Cucumis melo OX=3656 GN=LOC10348... [more]
A0A6J1GZH00.0e+0083.91transcriptional activator DEMETER-like isoform X1 OS=Cucurbita moschata OX=3662 ... [more]
A0A6J1CCU90.0e+0082.29transcriptional activator DEMETER isoform X1 OS=Momordica charantia OX=3673 GN=L... [more]
Match NameE-valueIdentityDescription
AT5G04560.21.7e-30342.25HhH-GPD base excision DNA repair family protein [more]
AT5G04560.14.2e-30242.56HhH-GPD base excision DNA repair family protein [more]
AT2G36490.11.0e-25548.18demeter-like 1 [more]
AT3G10010.15.1e-20742.34demeter-like 2 [more]
AT4G34060.11.4e-14048.52demeter-like protein 3 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003651Endonuclease III-like, iron-sulphur cluster loop motifSMARTSM00525ccc3coord: 1486..1506
e-value: 9.5E-4
score: 28.5
IPR003265HhH-GPD domainSMARTSM00478endo3endcoord: 1314..1485
e-value: 2.8E-4
score: 17.2
IPR003265HhH-GPD domainCDDcd00056ENDO3ccoord: 1335..1451
e-value: 1.74169E-14
score: 70.7334
IPR023170Helix-hairpin-helix, base-excision DNA repair, C-terminalGENE3D1.10.1670.10coord: 1385..1514
e-value: 3.9E-32
score: 112.4
IPR028924Permuted single zf-CXXC unitPFAMPF15629Perm-CXXCcoord: 1695..1726
e-value: 3.1E-14
score: 52.9
IPR028925Demeter, RRM-fold domainPFAMPF15628RRM_DMEcoord: 1729..1829
e-value: 1.5E-54
score: 182.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1279..1299
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 257..272
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 273..292
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 231..296
NoneNo IPR availablePANTHERPTHR46213:SF13TRANSCRIPTIONAL ACTIVATOR DEMETERcoord: 210..1839
IPR044811DNA glycosylase, plantPANTHERPTHR46213TRANSCRIPTIONAL ACTIVATOR DEMETERcoord: 210..1839
IPR011257DNA glycosylaseSUPERFAMILY48150DNA-glycosylasecoord: 849..1508

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0014193.1Tan0014193.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0080111 DNA demethylation
biological_process GO:0006306 DNA methylation
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0006349 regulation of gene expression by genetic imprinting
biological_process GO:0006281 DNA repair
cellular_component GO:0043078 polar nucleus
molecular_function GO:0051539 4 iron, 4 sulfur cluster binding
molecular_function GO:0003906 DNA-(apurinic or apyrimidinic site) endonuclease activity
molecular_function GO:0003677 DNA binding
molecular_function GO:0035514 DNA demethylase activity
molecular_function GO:0019104 DNA N-glycosylase activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003824 catalytic activity