Tan0008767 (gene) Snake gourd v1

Overview
NameTan0008767
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionKH domain-containing protein
LocationLG06: 529051 .. 550580 (+)
RNA-Seq ExpressionTan0008767
SyntenyTan0008767
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACACCAACTTCAGCCCCAAACGTCCATCCGACGCCATCTCCGATCCCAACCTCTCCGCCGGTCGTTCCGTCCGCCCCAGACAATTACCTCCGCCACCGCCTGTATCCACCACTGCACCTGCGACATTAGGTTTAAACTCTACCTCCAGAGATTCCCCTGTTCTTAAACCCTCGTCTCCCTCCGATACCCTCTTTCGCCTCCTTTGCCCAGCCTCCAAGGCCGCCTCCATCCTTCGCCATCTCTGCGATATTCCGGGCGCTAGGATCCACATCGATGAACCTCTTCCTTCATGCGACGAGTGCGTCATTGTCATCCTCGCCGGCTCCCCATCCAAACCCGCGCTCACTAATCCAGGAAATGATAGAGAATTCCGTGAGCACGATATCAGTCGCAACGTCTCTAGTGATGCGGTTGCTGGAAATTCGGACGAGCGGTCGCAGCTGCTGCTTCGAATTTTTGAGAGTATGATTAGGATGAACGAGGATAGTGGAGAGAACCAAGACGATCGGATTACCGGTGGAGAAACTGACGGGTTGGTGGTCTGCAGGTTGCTGGCGCCGAGTCATCAGGTAGGGCGCGTGCTTGGTAGAGGAGGCAAAACTGTGGAGAAGATTAGACAGGAGAGTACGGCTCAAGTAAAGATTTTTCCCAAGGATCAAATTCCCGCGTGTGCGTCACCACGTGACGAGTTAATTCAAGTAATACGCATGTGTTTCTTCGGCCTTTGACGTCGCACTTGAAATCAAAAGTTTCAGTTCTTATGATCTTCTGTCTTTTCAATTATCTTTCCTGCGTGGTTTGCAGATATCAGGAAGCTTTACGGCGGTGAAGAAGGCCCTTGTTTCTGTTTCCGCCTGTCTTCAGGATAGCCTGAGGGTGGACTCGAGCAACTCTTCCACCGCAAAACCATTAGGGCCCACCTCCCATGCAAGTTGTTGTTTGCCGGTACAAGACGAAGAACCTTCTCCTATGAGGAGATATATTAGCCATCATAATGCAGATTATCGCCCAAGGGGTTATTCTTCCATACCAGGACACGATAATGTTGGAGCTGGTCAAAGAGCGGCTATGGAAGAGGATGTAGTGTTTAGATTGTTGTGTCAACCCGACAAAGTTGGAAGTCTAATTGGGAAAGGTGGCACTATAGTACGAGCTTTGCAAAGTGAAACGGGTGCATCTATAAAGATAGTTGATACACCCGACCTGGATGAACGCGTGGTTATGATATCTGCAAGAGAGGTACGATTTTGTAGAAGCTTCTATTTCCTCTGTGATCGTTGGTGTTTTTGCACGTGAAAGTCAATTCCTAGTAATTTATTCGCATTAGTTTACCCATTCCCCTATTTTCCATTTTTTTTTTAATTTTTATTTTGGGTCATATGAAAAGGATCAAGACATTGCTGTTACGGAGATTTGCATGGAACCCTCTCCCTTATGATCAGCGTGTAGTCATTCCTTGTCACCAATATCTTCCACATAGATCAAGAAACAAACTAAATCATTGTTTCTTTTAGGAACAAGGAGAAAGATGAATCAGATTTGGAATTTGAAAGCCCTACTTCAGTGAAGTAAATGTATTCCTGAATTTTCTAAACCAAGATCTTAGAACTTGTTTGAGTTCATAGAATACTTTCCTTAGCTTACACACATAATTAGGTTTTTTTTGGTCTATTGTAACGACCCTTAAAACTTGGAATGAACTTACTTTAGGGCATAACGGTATTGCTAGGAGAAATCCAAAATTTTCCATGGAACCTATAAAAACAGATGTCACAGAGAATGCTCAACCTCCACCAATCTTCATTCAGAGAATAGGTTGGGCTTCAACCAGTGTTCCAACAGGCAAAACTTTTCAAGCAGACATGCAACAGAGCTAAGAAGCACGGACACAGACACGGACACGGCGACACGCCAATTTCTAAAAATGTAGGACACGGACACGTTGGGGGACACGTTCTTTATTTTATTTTTTTAGGTAATATGTGTATATATATTAAATAATATGTGCTTCATTAATACATAAATGACAAACAATACTAAGAAGTTCAATATCCAACATGTGAATACAACAAAATATTACAATACAAATGAAAGTACTAGAAAGTTAGAAAGTCATTCTCATTCTTCCATACAAAAAAAAAAAGACTAAAACTCTCCTCTCAAACTTCTACTCTGTCTTCATCTCGATCAGTATCACATTTGTCAAACCCACTACCACTACCATCATCAGTAAAGACCACAACCTCTAAATCTGGTTCATCCAATGATAAATTTGCTACTTCAAGCATTCCAACTTCCTCAAAAGAATCAAAACAATCTCCAAAGAATATCCCATGATTTCGCTCTCTCCTTGTGAATATTCCGAGTTCTTCTTGACAAAAGACGCAAATTACTATGGACAAATACCAAATCATCTCACGCACGTTGAGGCATCATCTTATTTCGTCTAACAGAGTTGATAAATGAGTAGGTGCTCCAATTTCTCTCACAACATGAAGATGATGAAGGTTGTACTAGTAGCTTGAAAGCAATTGATTGAAGGATTGGTGCATAGACACCATGGACAACCCACCAATCTTTAGGATTTGAACGATACCTCTCACTTATGGAATCAATGTCGGAAAAATCTTTTCCCTTTGTTGAAAAATTAGCAAACTCGTGTTTCACTTTTATACGGTCTTACTGGATCGGAAAATATCTCTTGAGACACCTCATTCTCTCACGAGTTACTTCAATATCTTTATGTGGAGGTACTCGATTAGGGCCTTCTTTAAGCCATTGCTCACTATAGTACCTAAATATTTATATAAGTTAAGAAAATGTTGAACTTTAGAAATTATAATACAAATGGAATAAGAATGGAAAAAATGTAAGGATCAAAAAGTACCTTGGGTTCAAAGAATGTGCCAAGCAATGGAGTGGGGTATTGTTCTTGTTCCAACGATCAATAAGAATATTATGCACCACGTCATAAAAGGAAGAGAATTCATCTCCATGTTTTCCTTCATGTCTAAATATTGCCATCTTCACCTTTTCAATCATAGTGTCCCACATGTCATAAACCAAATGAAGGCTAGGTGCATCAGTATCACAAACCCGAATCATATCATATATAGGTGAGGTGAAGGAAAGGATATAGTCGATTTTGTCCCACCATACATCATCCAACACCAATTCTTTCACATGTCTTGCTTTTCCTAAATCATCCTCTCTATAACATGTCCATTGGTCACTAATAACCATAGCTTGCAAACCTCCTTTAATGAGTTTAAACCTCTTGAGCATAATAATGACGGAAGCATAACGTGTTTTTCACCACGTAAAGTAATTTCAAAGATACAAACTCATTAAAAATCGCAAAGTCTCATTGAATGATTCATGATAAAGTTCTTCACAACCATCACATCACTGGCAATCTCAAAATCCAACAACATTCCTCATATACTAGCCGGTTGTTCTCAACATTCTTTGCTTGCACATATATTCTTTAAGGCAAGATTAAGAGTATGCACCACACATGGTGTCCATACAATTGTGGGAAATTGTGCTTCAATAATTTGCCCCGCACCCTTGCAATTAGGAGCATTATCAGTTATCACTTGAATAACATTTTGAGGGCCAACTTCATTAATGACTTCCTTCAACAAGTTTGCGATAAAATACTTATCTTTGACTTCACCTGAACAATCTACTGCTTTTAGAAACATTGGTTGACCGTCTGTAATTGCCATAAAGTTTATCAAAGGTCTTCTTTGTGAGTCACTCCATCCATCACTCACAATACTCACACCCTTTTCGCTCCATGTGTCTTTAATGGGTTGCAATAATCTTTGAACATTTGCCTTCTCTCTCAAGAAGAGTTGTTCTCAACAAATTGTATCCAGTGGCACATATCCCGACAATAAATTATTTGCAAATATAAGTAAATGCACCTATAAAATGGGGATTTCTTGCTAAATGAAAAGGCAACCTCAAGAATAGAACATCCGAGCAATAAGAGCGTGTAATTGGTCCTTGATGCCATGTTAAACGACTTCTCTATTGCACTTGGAGCACCTTTCCTTTTCTTTTGTTCAAAATTAATTGTACTAGGAGGATTACTTACACTAACTCCACCAATAGATATGATTGGAGGTGGTAAAGGAATGCTTTTAGGGGTTTTTTTAGCCATACGATTCTTTGCCTCGTCTTCCAATCTATGCATATCAACCATATCTTGGGGGGTAACTTTTAGACATGGTCCAATTCCATGACCATTTAACTTCAATAAATGAGCTCTTACCCTTGTATAAGAACTCTTCTTTATGCAATGACAAAAATTACATTGCCATGAGAAATTGCCTCCCCTTCATTCAATTTTTGATGCTTGGTCACATATTTCCAAAGTGGCTTTGATTCATCCTCAATATTATCACTAGAGATTGAAGGGAGTGACGATTGACTACAATTACTAGATGAGGAAGAGGCCATCTAATAATGAAAAAGAAAACCAAACTTTTTTTGTTACTAAACAGTAAACACTTAAACAATAAACAGAGAACATTTTTTTTTTTAGTAATAAACAGTAAAGCACGAAGTAGCAGCAGAAAACACAACAATACTAAGAAACTATTTTAAAAAACTCATAACCAAGAAAACACAACAAACCTTTGGGCTTTGAGGTGGTGCGATGCGATGGTTTAGGAAGCACGGCAGTGCAGGTCGCGGCTCGGAAGCACGGCGGCGAGATAGAGAGGTTGCGTCGGCGGCTGAGAGCGTAGCCGCCGCGTCGGCGCAGGTCGTCGCCATTCGTTGGAGGCGCAGAGTCGCCGGGCAAAGCAGCAGAAGGGAGACGAAGGCAAGCAGCAGTCGTCGTCGCGATAAGTTTTTGTTTACTATTTAGTATCGGTTTGTTTTTTTTATTTTATTTTTATTATTATTATTTACTTTATTTTAAACGTAAAGTTGGGCCAAGTTAGGCCCAACGGTTTTTTTTTCTCTCTTTTAAATGGTAAGCGTGTCCCCAACGTGTCTGAAAATAAAAAAAAAAAAAAAAAGGACACGAAAATTGCGTGTCGGACACGTGTCCGAAGCGTGTCCGGACGTGTCCGTGTCTGACACGGACACTTCACCAAATATGAAGTGTCCGTGCTTCCTAGCAACAGAGTTTAACAAAAGAAAAAGAGTCTATCAAGCTCTGAGGCAACCACAGAAAACTTTATTCAAACATCCAGAAACTAACAACCAACAACCTACACAAAAGAGAACTAGAGTCGACGAACAACTAAGGTAAAACAGGCCGTTACAAAATCAATCACAAAAATGGAAACTAGGCCTAAAGACAAAAAAGGGAAACCATAGCCTCGCTAGGTCGTCGTCATCGTCGGCCGTCTCATGGTTTCTTACCTGCAACCAAAAACATCAAAGTAGACGGGTGAGTATAAACATACTCAGTAAGCAACTCACTCGCGTTCAAACAAGCTAGCAAGTGTTGTTGTAACTAGTGAGAGAGAAATGAGAAAAATGGGAAGTGTGTTTATTTCATTCTGAAATGCTACATATAAATACACATCTAAAGTCTAAAAATAGAGTCCACAATGTTAGCAACTAATCCCACTAACTCCACAAAAATAAGAGCTTAGACTTGACTACACTAAAAATAACCTATTTCCTAGAAACTAGTAATTGACTAGACCGACTCAAAACTAATTTGTGTTACTAACATTCTCCTCCTAACTCAAATACGACAAACACCTAACTTGCTCCCAAGAGCTTGAAAACGACCCACACAAAAAGACTTAGTGAAGATATCTGCTATTTGCTCTTCTGAGCTGCAATGCACCAACTGCGCTTCTTCTACCTTATGCATCTCTCTCAGAACAGAATACTTGATCTTAAAATATTTGGTCCTTCCATGAAAAACTGGATTCAATGAGCTTGCTAGGGTGACCTGATTATCCACAAACACCTTCACGCTCTCTTCTTGTTTCAGATGCAAGTCATGCATCAATTTCTTCAACCAAACAGCTTGGTTGACTGTTGCCGTCACTGCAATGAACTCAGGTTCTGCAGTTGACTGAGCTACTATATCTTATTTTTTTAAACACCATGAGAAACACCCTGAGCTAAAAGTAAAACAATAGTCAGAGGTGCTCTTCATATCATCCAATGATCCAACCTAATCACTATCTAAGTAACCTTGCAACTCAAAATTTTCAACTTTACTAAATTTGATCCCAAAAGACCGTGTGCCTTTCAGATATCTCATAACCCTTTTTCCAGCAACCATATGATTCTTGCTTGGGCAGTGAAGAAATCTAGATAAGACACTTACAGTGTACATAATGTCAGGCCTAGTGGATGTGAGATACATGAGACATCCAACTAGACTTTTGTACGCATTCTCATCAGTAGGTTCCTCTCCATCATCCTTTTGCAACTTCTCCTTCTGAATCATTGGAGTGTTCACACTCTTACACTCAGTGTTTTAAAAAGCTCATTCGGGCGCGCGCTTAGGCTCAAGGCCCACCTTTAGCGCCTCGCCTCTTATCGCGAGCGCCTTCCATGAAACCCAAGAGGCGCACTAGGCGCGCACCTCTGGGGCTTTTTCAATTTATTATTATTTTTTTTCATCTTTAAAACTAAAACTTACCGTTTTACTTATTTTAAATATATTTAGTAAAGAAGAGATTGATATTTAAGGCTTTTGATCTTGTATTGTTTATTAGTGTTGCATTATTTCATGTTTTTTAATGGAGTATTGACTCTATGTTCAAAAACTAATACTTGAACCTTATTTCTTTTACCTATTTTTCTTGGTTAAAACTTAATTTCTTTCCATGAAAGGTTTTTTCTTATTGTGTATGTATACATATATATTTTAAAATATATATTTTTTATATATAGTGCCTAAGAAGCACGGACACTTTAATTTGGCTAGCGTGTGCGTGTCCGACACGTGTCGGACACTTGGACACTCCGACACTTGTTGGACACGTATCAGACACTTGTTAGCATTATAGATGTGTTAGAAACTAGTTGTACAAAGTCAATATAGGTCCAATATTTGTTAGGCACATAATGAACACTTGTCAAGTATACTAAATAGGTACATGTAATAAAAAAAAGATATTGTAATAAAGAAAAGAACTCATTATAATTTTGCAACACCTAAGGAATGAGCGGAAATGCTTTCTCCTTCACCTGAGCATCTCCGACCACCATGTCTGTTGCTCCCCAACAATCCTCTTCTATTTGGAATTACTCCACTCGATCGGTCCTCATTGGTCGAAAAACTTTCACGATAGCTTTCGATGAGTCTTCTAGGGGAAGTAGAGCAAGGATCACTTAACTTGGAAGATTCTCACCTTACTCTTTATCCCTCTCTTGGAGCTCTCTCAAATGGCTTTCTACCTCTTTTGAGGCTCTTTACAAAGACCCTTTCTCTTACAAATTTTTTAAGAAACATCAGACTACCAATGATACTTTTTGGCTGGAGAAACTTAGTAACAAGTATGGTTTTTATGTGGAGTTCACCCAACTTAGTCACACTGGAGAAAGGAAGAAACTTCTTATTCCATCGAAGGACAAGAAGCAAGGATGGTTTTCTTTTTTCTTCTTTTTCGATTACTCCAGTGAGGCTCATAAAAAGGATTCGTCTATGTCTTTTAGTGCGGCTATCACGAAGCCATTTCTTAAAGAAAACTATGTTGATTGTGGTGACCTACCAACCAACTCCCATCAACAAACGACAAATTCAACAAAGATTTGGAGTGGAGTTCTCATTTGTCAGCTTTTTTCCCAAAAAGATTCTTGGCCAAATATTCGAATTGCCCTCCAAGAAGGCTTTTCTCCTCGATGCACTATCAATCCATTTCAGGATAACAAGGCGGTGATTCATATTTATGATGCTTATCTTCCTGTGCAGTTTCGTACTAGTTCCTCTTGGCATTTGGTTGGGGATTTCAAGCTCAAATTTCATGCTTTCTCTACTTCTTCCTTTTTGCAAGACAGGATGATTGCCCCTTATGGTGATTGGATTGAGGTGTTGGATCTCCCTCTCAACCTATGGACAGAGGAAATTTTTAGGTTCATTGGAGATCAATGTGGCGGCTTTGTTCAAGTTTCAAATCACACAGAAAGGTGGTTATCCTTCATGGCGGCGCGGTTGAAAGTCAGAAGAAATTCAATTGGTTTCATTCCAGCAGTGATTGTTGTACCGGAATCCATTGTCAGAGTCTTGATCTCCGTTCACATCCGATCGATAACGGCTAGCGAACAATTAAAGGAAAATCCTACTATTATGGAATCTTTAATTGCAGTTAATGTTGACTCGTCGGTTTCCTATGATCACGCAGATAAAATAGGCAACCAACCTTTAATTTCGAAAGATCTTGTTGTTGAAGGAAATCGTAATTCGTGCTTGAGTGATTTGGTTGTTGATTCTCCTCCTAAAATTTCGGTTTTGAGTAATTTGGTTGTTGATTCTCCTCCTAAAATTTCGATTTTGAATGATTTGGTTGCTGATTCTCCTCCTAAAATTTCGGTTTTGAGTGATTTGGTTGTTAATTCTCCTCCTAAAATTTCGGTTTTGATTGATTTACCAAATCCCCCGATTCTGGTGGACCCCCCCTTACAAGGTATGAATCATTTGGGTATCTCTAAAGCATCTTATTCTTCGGTTTTAATTGGGGAACCATCGAACACGAGCTCCCCTGCTTTGATACAACCAACTTTCTCCTCCATTTCATTTATTGAGGGGCCCACAGTTGACCGGATCCCTTCAACACTCCATATTGGATCGCAACACTCTACATCAGATGGAAAGCCAAATCTACCCTTCAATGCTTCGGATACAGAGGCGTATCTCTCTAGTCCCAATTCTATTCGATCAGGCCATGATGTTGTTTTCCAACCAGAACCCTTGGATTTTGCCATTTTCGGGGATAATCCATCCATTGATTCTACTATTCCCTTAGACACTTTACCTCCAATTCCACAGGACTTGAATTCAAGTGTCACTGTGAATCAATCTCAGTTTTTAAACAAACCACCTTTACCTTTTTCTCCCTCCTTGAAACCTTCCATTGAACCATCTTCTCCAACCTTCTCTACGAAACATCATATCAACACTTTTCCGTGTGAGTTGCGTGACATTGCTCACCTTCTCACTAAGCATGGGTTATGTATCATGCCCATTCCAACCCTACCCCCACCTACCAAGCCTAAAAAGACTTATAATAGAAATGGGAAGAAGGACAAATTAAAGAGGGAGTTACTGAAATTTACAAACCACTGCTTCACTATGACAAATCGCCACATTGGCTCTAATGGAAGGATCTTCTATTGTAAAATGAAATTTATCACTTGGAATGTTCGGGTTTAGTTTCTTGGAAGAAACAATTCTTGTTTAAGGAATTCATTCTCAAACAAAATTCGAATTGTTCGAAACTAAGTATCTTCATGTCATCACCAACCGATAAACTCTATTTGGAGCTTGGTCGGGCTTTCCTTGATGCACTTATTTCATTTGGTGGATTTTAGTTTTGGAGTGAACCAAGGGCTCTTTGTTAAAGAAGTCACTCGAGATTGGTAGCTTTACTTTGGTTTGAGAGAAATAGTTGTCTTTTCAGGGTTTTTTCTGTTCCTGTTGGGGTTTTATGGACAAAGTTTCTTCTTTTTGTTTATTCATAGTGCAAGATTGTTCATCCATTTAGCCTTTTTGGTTGTTCTTTTTTATTTCCTTTTGGTATTTTCTTGTAATACACCTTTTGGAATTTTTCTCCATTTCATTCATCAATGAAATTGTTTCTTATACCAAAAAAAAATAGGTACATGTAATATAGGACAAAAATAATGAAGTTTGGGAACGAAATACATCAAAATCATTTTTTTAAGCATATAGATACATAAACTTATTAACTTTAAATCTCTTTCTGATATAAAAATGATATATATTTTTAAAATGTATATTTAATAAACGTGTCCTAGCCGTATCCATGTCCTAGATTTTTAAGAAATGACGTGTCGTCGTGTCCGTGTCGTATCGTATCCCTGTCTCGTATCCGTATCCGTGCTTCTTAGTATAGTGCGCCTTAGAAAAAAAAGCCTGCGCCTTGTGCCTAGGCTCCAAAGGGCTATTGCGCCTTAGTGCACCTTGAGCCTTTAAAAACACTGCTTACACTCCTCAATATTGAACCTCTTCAATATCTCTCTTACGTATTTCTTCTGACATAGGAACACTTCATTCTGCGTTTGTTGAATCTCCATACCTAGGAAGTAATGCATTAGACCTAGATCAGTCATCTCAAATACTTTCATCATTTCTTGCTTGAACATCTCTATCTGAACATCATCACTTCCTGTCATCAGTAGGTCATCTACATAGGGAGATACAATAACAATAGCAACACTTGATTTCTTCACATAGAGTGTGGATTCACTCAGACTCTTCACAAAACCCAGGTTCAACAAATGATCATCCATCTTACTGTACCATGCCCTAGGAGCTTGCTTCAACCCATAGAGAGCCTTCTTCAACAAATAAACCTTCTGCTCTTGTCCTGGTATGATGAATCCGTTTGGTTGCTCAACATAGATCTCTTTTTCTAACATTCCATTTAGGAACGTTGACTTCACATCTAGCTGATACACCTTCCATCCATGCTGGGCTGACACAACCAATAGCAACCTGATTGTGTCCATTCTTGCAAGCAATCGTGCAAAAGTCTCGAAAAAATCAATTCCCAGTTTGTGCATACCCTTTCACCACTAACTCGGCCTTGACTCGCTCACTTTCTGTGCATCGCTTACTTTCTTCACCAAATTTCTCCCAAGTATTGTTCTTCTCTATCATACTCAACTCCTCATTCATTGCCTCCATCCACTTCGTATCTCTTTTAGCTTCTTCATATCCTGCAAGCTCTAATACAGCTACATTACTTCTTTCATACACCTCACTCAATAGTCTAGTCCCTCTCATTGATGCATCATCAACTAATTCATCAGGATCTAGAGAAACTTCTTGTTGCTTCACCTCTGTTTCTTCTGTCCAGTCCCACCTCTCATCCTCCAAGAACTATCTCTGCTAATCACCATCTTCTTTGTGTGAGGTTGATAGATCTTGTAGGCCTTGGACACCATGCTATAGCCTACAAAGATTCCTGGTTCGGCCTTCTCACTCAACTTGTCTCTCTTTGCCTGTGGAACATAAAAAAAAAACACAAACAACCAAACACCTTCAAATTTTTCAGTTTTGGTTTAGTATCAAACCAAACTTCATAAGGTGTCTTCTTCTCCAATACCCTCGTAGGTAATCTATTCAATAGAAAGACTGTTGTGTTTGCTGCTTCAACCCAAAATTCCTTTGGTAAGTTTTTCTCATGCAATAAGCATCTTGTCATCTCCATAGTACTTCTGTTCTTCCTCTCGCTGACTCCATTTTGTTGTGGAGAATATGGAGCGATGAGTTGATGCTCTATCCCACTTTCTTCTTCACGAAGAATTGAACCTTTGCGATGTATATTCAAATTCCATTATCGCTCTCACCACCAATCCTATAATCACTTTGATTCTCAACCCATCTCTTAAACTTCAGAAACACTGTAGCTACTTCAGACTTATGCTTCATGAAATATATCCACAACACATTCTGGTATAATCATCAATAAAGATAATGAAATACATGCTACCATTTAGTGAGAGAGTTGGTTGAGGTCCACACATGTCAGTGTGGATGAGCTGCAACTTTTCAGTTGTTTTCCATTGTGACTTCTTGAATGGCAGCCTTGTTTGCTTCCCAGTTAAACAAGTTATGCAGTTTGTTTTCATCTCCTCCAGAATTGGTACTCCTATTGCCATCTCAGTTTTCTTCAGATACTGCAACCCTTTCTGATGAAAATGCCCCAACCTCTTGTGCTATAACTCAGTATAATTCACTTGACACTTGAAAGCTATCTGCTCATTCTCAAGTGGATCCAACGAGAAGCTTTTGTGTTGTATCTTTATTTTGAACAAGTCATTCCCATTGGAATCAGAAATGAGGCATTTTCCTTCTTCAAACAACACTTTGAATCTCTTCTCTACTAATTGACCAACGCTTGTCTCTTCGTCGTCGTTTTCCGATAAGGTCTTTCGCTCCTTTTCGGCGAGCAACCTCCAACCTTCCTCTGCTTACCCAACCTCGTTTTCGTCAGTCAAGCTTTGTCTCCAGTGTCTCGTTTTCCCTCTTCGGTGGCCTCTGTCTTTTCGTCTCCTCCTTTGGTCTTCGGTATCTTCCGTGTCGCTTTGTCTTCTCGTTTGGTCGCCGATCTTCGTCCTTGTCTGAACCTTCCGTTTTCTCTCCTTTCTCTTCTCTTTTTCTTCTTCTTCTTCGTCTTCTCGTCTCCTTTTCAATCGGTCGTTTCTTCGTGTCTTCTCTGCATTTTTTCTTTACGTTTCTTTTCCCTGTTCATTGCTTCTCTTCTTTTGTCCCTCGTTTTCTTTAAGTGAATAAGATAGTGTGTGTGTTTGGCTTCTGGGGTTTTGGTTTCCGACATTGTCTGGCTTTGTTGGTAGGATTGGTGGTGGTATGTGGGGCCGTTGTTGTTGGTCGTCTTGGAGGGGACGACATGAGGCAGCTTAGTTGTTGTGTGGGGGACAATGTCTTTTGGGTTTTTTTGGATGGGTTGAGGGTGGTGATAGAAGATTGTGCTCGGAAGGAGACCTCTTTTTTCTCTGTCGACCACATCCGATGGTTTGCGGATCATTTTGGTACGTTGGGTCGGGATTTCAGACTTTTAAACGGAAGAAGTTTAAAGATGAAAACACTACGCTGGGCTAAGAAGCACGAACACGGACACGGACACGATACGGACACGGCGACACGCCAATTTCTAAAAATGTAGGACACGGACACGTTGGGGGACACGTTCTTTATTTTATTTTTTTAGGTAATATGTGTATATATTTAAATAATATGTGCTTCATTAATACATAAATGACAATACTAAGAAGTTCAATATCCAACATGTGAATACAACAAAATATTACAATACAAATGAAAGTACTAGAAAGTTAGAAAGTCATTCTTCCATACAAAAAAAAAAAAAAAGACTAAAACTCTCCTCTCAAACTTCTACTATGTCTTCATCTCGATCAGTATCACATTTGTCAAACCCACTACCACTACCATCATCAGTAAAGACCACAACCTCTAAATCTGAGTTTTTTTTTTTCTCTTTTAAATGGTAAGCGTGTCCCCCAACGTGTCTCCAGCGTGTCCTTGGAGTGTCCGAAAATTAAAAAAAAAAAAAAAGGACACGAAAATTGCGTGTCGGACACGTGTCCGAAGCGTGTCCGGACGTGTCCGTGTCCGACACGGACACTTCGCCAAATATGAAGTGTCTGTGCTTCCTAGACGCTGGGTTTGTTTAAGGTAAGAAGGACTGGCGGTTGGTTTGCTGAATGTTGTCTTTGGCCACCTTCGGGGGGGAGAAGTTCTTTCAGAGTGCCGGTGGGAAGTTTCGGGATGGGGTGGAGGGTGTTCAAGGACTTGTTAGAGGATCTTTTGAAGCTGGTGGACGGTCGGGTAGAGCAGGAGTTTTCTGAAGTTTCTGGCAGCAATGAAATAGCTGATGCGAATGGTCTGACCAGAAATGTCGTTGGTGCTCGGCCTTCTAGAACGTCGTGGGGTGGTGTTGCTAGCTATTGGGTTAGGAAGGGGGAGGAAGTGGGTGATTTAAATCTATCAGAGTGTGGTCTCCAAGTTGTTTGCGCAGATTGGTTGGGGTGTGGTGAAAAGGGATTTGGAGGCTTTTTTTCAAGAAAGAGGTGAGGTTGAATCCGTTTCAGGTAGATAAGGCTATTCTGGCCTTTGAGGATGATTTTGATGAGGGTTTGGCACCGCTTTGTGGAAAGTGGCAGATTGTAGGGGGCCTCCATCTGAAGCTTGAAAAATGGAACCATGAAGTGCATAATTTGCCTGAGTTTATTGGGAGCTATGGTGAGTGGATAGCCATTAGAAATCTTCCTCTTAGGCTTTGGAAAAAGCAAGTGTTTGAAGCAATTGGGGAATGCTTGGGGGGCTAGAGGTTATCGCGTCAGACACTCTGAACTTGATTGATATTCAAGTGGCCAAGTTGAAAGTAGCTAAAAATTTGTGTGGGTTTCTTCCCGCTTTCATTCTTATTAATGATCCGCTGGTGGGGGAGGTGAAGTTGAATTTTGGTGATGTGGATGCCTTGGATCCTCCGGTTGTTTTGTCACAACCATTAGTCTTTAGGGACGTGTTCAATTCTTTGGATGTGGCAAGAGTGAGGCAGGTCATGGAAGATGAACGGGTGGGGTGGGTGGATCATGGGTTGGTTGTTATTGAAGGGGTAGCTGGTTTGCCTTATGGGGAGCCTCTTTTGGTTGATGGTCTGGATGAGTTGGAGAAGGTTGATAGCTTGGGAGAGATTTCAGGTGGGAATTTCAGTGAGGGAGCAGGGGCTTGTCCTGATTTGGGGGTTGGCTTGAACGCAGTCTTTGGGGCATTGAAGGACGACTCGATTTTGATGGGTCAGTTGGGGGCTGAGAAGGCATCGTTTTTTCCCCTTTACCGCAACCTGTAGAGGTGGTAAGGGACAGGTTTTGAGATAAGGTTTATGTAAGAAGATCAAGGAGGAAATTGGGGGTGGAGAAGGGGTTTCCATCTAGGGAGCTCTTTGAGCAAGCAGATGAAAGTTTGGAGTTTTGCATTCCAGCTACTCCGGTAAAGCCTTGGTCGAAAGTCAGTCCTTGTCGTCCCTCAAGAGTTACGACTGCCGCAGTTGAAAAGTGTCAGTTGGGAGTTAAGGAGGTTTCTTTTGTTAAAGTCTCTTTCCGGACTTCATTGGTCGATAATAAAGGGAGTTGCTTGTTCTCTCCAGTTTTCGGGCAAGACTCTACTGCTTGTTCGGAGGTAAGTCTCAGTAGCCTGGATTTGAGCTCGATAAAGAAGGGAGGTCAGATTATTTCGAATAAGTTGTCCCCAGTTATTAACCTCTCTAAGTTGTTTGATTCTGTTGACTGTGTCTCTCCCTCTGGAAGCAATGCCGAGGATGGTTCTTTTAATTGCCCTGTTGGTTTGTCTCAGTTTGATGCCTTAATTAAGGCGAGCGGCTTGCAATTTAAGGAAATCCCTGCAAGGGGGACTCCTGTGCTATCAAATTGAAATTTTGTTCTTTTGATAGGGAGATGGTTCAAAGAGGATTAGTTTAGAGATGGTCCTTGAAAAAGTTGTTTCAGAGGTTGCCCCGGTTGCCCCCATTCAAGCTTTGTGGTGTTGTGGTTTGATGGAAGGTGTTTATACCTTTTTGGAAATTGATGCTTTTGGCTAGGTTTTCGGTGCAGTTGTTGGTAGTTCAGGGGGTTTGTTATTTTTGTGACAGGATGCTAAAGTTTTGGTCACGGTTTTGGTAAAAGAAGGGTGTCTGTTGAAGGATAATGATTGTTTTCTGATAAGCAAGATGAAAGAGCTTCGAAGATTGGTTTTTTCTGGTGGTGGCGTTGGGTCTCTTTTGGAGGAGTTATGGGGCTTTGGGAATTCAGTGCTCCCTCGGTATTTTGTTCGTGTGTTTGTGCGATGCAGTTTTTATTTGTTGAGGTTCTCTATGTTTTGTATTCTTTTGCTTATCTTCATTGGAGCTTGTATTTTGAGCAGTAGTCTCTTTTCATTATTTCAATGAAATTCTCGTCTCCTTTTCAAAAAAAAAAAAAACTAATTGACCAACGCTTAAAAAGTTGTGATCAATCTCTGGGACAAACAACACTTCGGTAATCAACTTGGTTCTAGCACAACTCTCTATTGACACTGTGTCTTTCCCCTTTACTTCTAGATACTCACCGTTTCCTATCTTCACCCTTGACTTGAACGATTTGTCAAGGTCCTTGAACAACTCTTTGCCACTTGTCATGTGATTGGTACACCCACTATCAATCAACCAACCAACCATCACATTGAGTGACTGATGAGAAACAAGTAGCCACAAAGAGTTGATCTTCTTCTTGATGTAGCAGTATGTGCTCCTCCTTGTTGTTGAGTTTTTGCTTCTTTACAGAATCGTTCAATGTGCCTCAATAAAGTACTCTTTAATGGACTCATAATCCTTCATTTGCATTCTCTCGAATTCTCGCACCAAGTTCAACACCTTCATGCCTTTAATCCTCTCGTCACCTTCATACTCACTTTTGAGAAGGATGTCAAACTAATTCCCTTTTGATTTAGAAATGTGGCAAAAGATCTATATTTTCTCCCTTAATCGACTTGCACACACTTCATATTTATTTCAAATCTATTTTGAACATGAAATTTGAATTGAATAAAGGCTCTGCTTCTTATTTCTTATTGTCAAAGAAGGATGAAGCTCCTTAGGATTATCATTTTTTGCGATTATCTTTGCTTTTGTAATCCGTAATTACCTCTCTTTTGTTTCTGTCTTTGCTATTTGTGAAAATAGAAAGTGTGTTGAAAAGGAGGACAAACTTGCTGAGATAAGTTGGTATTGGTTATTCTTGCTCGGGTATCTATGGAACATGGCACAAGCCTCAAACGAACTTTAATATTTCATTTATTCTATCTCAAAAGATCTAATACAAACGTGAGGCTATATCAAAGATGGTATTAGGTTAACTGCCAATAAGAGACAAGGTTAAGGGTACAGAAGGTGTTTCTTCACCTTATTGTAGTTTCTACAAAATCTGACTTTGATGAAGAATATTTCTGAAAGAGTGAAATAGTGGCAATCAATTTGAGGCCAAACATCCACATGGAGGGTAGGCATAATTATGTATTAGTTATAACTTGGATTAATCTTTTAGAAGTTTATCCATCTATGTGGTTATATATTTTATATTACTTTTTGGATCTTTGAAGCCATCTTGTCCTTGACCTAAGAATATTGATTCTTGCAGTACGCCTCTTGTATTCAAACAATTGTATTAAACTTCTTAGAACTGTTCAAATTACATATTGCACATTTGCTTTTTTTTTTTCCTTTTTTTCCTTAATTTCATCTCAGAGTTTGGAGCAAACATATTCTCCTGCACAAGAAGCCGTTATACGTGTACATTGTCGAATTGCAGAAAATGGATATGAACCAGGTGTTGCAGTTGTTGCTAGACTACTTGTACATGCACACCAGATAGGTTATTTGGTGGGTAGAGGAGGCCATATTATTAATGAAATGAGAAGAGGGACCGGTACTAGCATACAGATATTTCCCCGGGAGCAAATTCAAAACAGTGGGGCTGTGAATGACGAAGTGGTGCAGGTAATTTACATCTTCTAACATGACTGTGTATCTCTCCTCAGTACTAACTATTACCGAGTGGTTTGGTTCGGTTTTTAAGAATTTTTTAGCTTTTTCGGTTTAACCTTTTAAAAAACCGAATTTTTCGGTTCGGCTTACGGTTTATAAGAAATCCAAAAAAAACCGAATCGAACCGAACTTTTTACTTAACAATAACATAAGCCCACCACCAACCAATAAAATACCCAAAGCCAAAGCCCAATTTCACATTCAAACCCTAAGTCACATCTTCTCTTTCCCTCACTCTCTTTCTTTCAATCACCATCACGTCTAATCTTTCACTCACTCACTCTCCTTTTTCTTTCCTTCTTTTTCAAGCTTATGAGTTTTCTCTCTACTCCATGGTCGTAGCTCACACACTCTTTTTTCTTTCCTCCTCCTTCAAGCTTATTATTTTTCTCTCTACTCCTTCAAGTTATGGACGTAGCCCAACCCTATCACCGTCGATGATTCTCGTTTTCATTAAGGGGACGTTTGGATCGCAGAGTTGGGTTGAGTTGTGCAGAGTTGTTTTGTCAGTGAGTTCTAGAAGTCAGTGCTTGGTTTGCAGAGTTGAGTTATCATGTCATGGTTATCTGGTGCAGTTTTTTGTAAAGTGAATAAGTCTGATTATTTGTGACTTTTTTTGTTTCCACCAAATCAATTGCAAACATTTTTGTCTACAATTTCAAAACTTCATTATAAGATCAATTACAAATTTACAATATATATTACCATTTGCAAGATGTATGAGATATTTAGAACATTACTGGATATTGAGATTGATATCAATTTTTCTTCTCTTCTAAAAATCATTTTTATCTTATGTAAAGTGAAAAGCCTGATTTTTTTGTGATTTTTTTCCTTACTTTGTTTTCACAACAAATCAAATACAAACACATTTGTCTACAATTTCAAAACTCAATTATGTAATCAATTATAAAATTACAATATATATTACCATTTGTCAAATGTATGTTAGGGATATTTGCAAAAGTAATGGATATTGGGTTTGATATCAATATTTCTTTTCCAATAAAATGCATTATTTAGAACTTTTACTCTTCAATATGAACTTAATTCCTTTTTTTATGCACATGTTTACCACTACTTTTTTATTTTAATTTAGGTAAGCATTAATTTAGTGTTACTACTTTGCATATGAAGAATGTTTATTTAAATTTATCTAAACACATGCATTGTCTATGGACATCTTTAGAATGGAATAAATAGTTTACTTTGTGAAGATGTTAAAAATATGCCAAGATACTTAAATATTTTATGTCATATTGAAGTTGCATATCTACAAAATATTAGATTGCGTTGTTTGGCCTCCTTCTAGTGGTCGAAAGCGGATCAGGGTTTCATTGGGTTTGTATAACAGAGGTTGGTTAGTTTTGGGGGAGATGTTGTTTGATTTTTTTGCAGCGGTTTGATCCTCATTTTAATGTCATTTGCCCTGTTGTTTCTACTTTCTCCTATGGGGATGGAAGGTGGGGATGTGGAGATGACTTTTCCTGCTAGAAACATCTTGGGTATCCATTTTGGGTTGTCTTTGAGTCAAGCCTTTTGGGTTAGAAAAGAAGATGAAGTCCTTGTTCTTGATTTTGGAGGTCAGTTGATGGTATCTCGTCGCTCATAGTTCTTGGGCATATATCAAGTTGATGATTGAGCATTTGGTGTGAAGATTCAGATTAATCACTTTATGGTTGGTAAGGCATTAATTTTGGTGGTGGAGGGTTCTTCTGAAGCACTTTTGTCTCGACTTGGAAAATGGCAATGGGTGGATGTTTCTTTTCTCAAGTTTGAATTTTGGTCGAATAAAGATTATAGTTTGTCGGATTTCATTTGTAGTTATGGGGGTTGGGTTTCTATTAGAAATCTTCCCCTCAAATTTTGGAAGAAATCGACTTTTGAGACAGTAGGATTACATCTGGGAGGCTTGATAGAAATCTTCCATGATACTTTGAACTTGATTGACATTTCGGCCGCTCATATTAAGGTAAATTCTAACCTTTGTGGATTCCTTCCCGCTTCTATTGATATTTTTGTTGAGTCTCTTGGTAAGTTTGACTTGAAGTTTGGGGATATTGAGGCAATCTCTCCTCCTGAAAAGGTTTTTGATTCCCTAGTGTTGCAAGATTTTTCTAATTCAGTTGATCTGGCTAGAGTTGCTCAAGGCGGAGTCGATGGAAAGGGAAATGGCTCTTCGGTAATCCATGAGAAGGGGTTTGTTGAATCGGGTACTGTTTTGTCTTGGGAAGTTCAAGGTGACTCTTTGGTGGTCGGTTTGTTTAATAAGGATAGGATCATGGTGGTTTTTCGGATCTGTCTCTTGATGGCAACGATGAGAAGCAAGGGGGCTTTCCCTTAATTTCAGAGCATCTGATGCATTGATAATGGTGAGAAGGGGGTTGCCAATTTATTTTCATCCACGATTCCTTCCTTAGTTGAGGAGGTAAGGGATAAGTTATGTGCTAAATATTACACGAGAAGGGGTGATAAAGAGTTGCATGCGAAGAAAGAAAGTGCTTTTGTGCAATTCTCACTTGATTCTTTTGAAGAAGCTTTGTCGTTGGGAAGGTTTCTTCTTTGGTTTCCAAGCAAGTTGATGAGGTTCCTTTGAAGGAAAATTTTGTTCTTATTGATGATTGTCAATTAGGTCCAAAAAGGGTTTACTTCTTTAAGGCTACTTTTCAGCCTTCGGGTGGTGCTTTAGATTCTTTAGATAGCTTCTCTCCAGTGCAAGAGAAGGATTATGATTTTAGTTCGGAAGTCACCTTAAGTAGTCATGATTTGGGTTCTTGTAGGAATTCCATACAACATGTTTCTTTGGCGGGTTTTCCTTGTTTTTATCTGAATAATATTTTTGCAACTCTAAGAGAGGGTTTTTCCTCAAAGGTGTTCAATGGTTTAGTTGATGCCTCCTATCCTGAATTGAGTCGTTTTAGTTATTTAATCAAAGCTTTTGGTCTACAATTTAAAGAAGTGCTTGTTGGTGGAGGCATTCACTTTCCTCTTATGTTATTTCTTCGTTATTATATATTGAAGTTGTTTCCAAGGAATGGATCTATTGCAATATGTATGACAGCTGAGCCTTTAATCTTGTTAGCAGTTTGAAGGCTTGTAATGACCGGTCAAAGAAGTTTCTATCACCTTTAATTGATATTTGTTCATTAAATTAGTTATTTCTTTTAAAAATTCACTGCTTCTCGTCCAATTTTCTATTATGAGTTATGATTAATGTTCGTGGCAGGTCATTGGAAGTTTGCAGTCTGTACAGGATGCTCTGTTTCATATAACAAATCGACTTAGAGATACTTTATTCCCAATGAGACCACACATGCCAAATTTCAATACTCCATCTTACTTGTCTCCCCACCCTGAAACACCTCCCCCCTTATTTAGGCCGGGAAATAATGCACATTCTCCTGGATGCTATCCTTCTCAAGCTGGAGCTCTTCGTGGGACTGAACGTCTTCATTTCCATTCCCACCCTCTCGATCATCAACCTGTATATTCTCATAGTACGAATTTTGGTGGTAATAACATGGATGGAGTACATTATCCCCATGGCATTGAGAGACCATCTCCTAGATCGTGGATGAGTCAGGTCGTTGCTTGAACTTTCAAGATAGTTTCTTTGTCATGGGAAACTGATTGGAAATTATTTTTGTTTGTTTGTTGAATTTCATTCAGGTTAGTAGCGAGATACCTAAGGCAGCAACTGATGTTGGCTTTGGCATGGTTTCAAGGAATGAGTCCTACAGCAGGTAATTATTATTTTTTTTTTTTATCAACGTTAATTCAGATTGCCAGAGTGGTCATTATGATCAACAATGTTGAATTTGAGCTGCCCGGTGACATAGGAAGACAAGTGTATATAGAGGTTGTATGTTTTAAGGGTTTATTAATTTCTTTGAACAGCGGGGGTCCAGCTCATTTTCTGGGAGGCACATCTATGGAGATAGTGATTCCACAAACTTTAATCTGCCACATTTATGGAGAGAACAACACTAACATTGCTCATGTTCAGCAGGTTAGTTGGGTACAGATTGTGTTTTTGCCTTTAAATGTGTTTGTAATTTGTTGTATGTTACTGTGGGCAGATATCAGGGGCAATGGTGGTGGTTCATGATGCTAAACCTGGGATGTTTGATGGTAAAGTGATCGTGTCTGGCACGCCAGATCAGATTCGGGCTGCCCAGAGACTTGTCCACGCCTTCATCCTGTGCGGGAGGACACAACAATCCTCATGA

mRNA sequence

ATGGACACCAACTTCAGCCCCAAACGTCCATCCGACGCCATCTCCGATCCCAACCTCTCCGCCGGTCGTTCCGTCCGCCCCAGACAATTACCTCCGCCACCGCCTGTATCCACCACTGCACCTGCGACATTAGGTTTAAACTCTACCTCCAGAGATTCCCCTGTTCTTAAACCCTCGTCTCCCTCCGATACCCTCTTTCGCCTCCTTTGCCCAGCCTCCAAGGCCGCCTCCATCCTTCGCCATCTCTGCGATATTCCGGGCGCTAGGATCCACATCGATGAACCTCTTCCTTCATGCGACGAGTGCGTCATTGTCATCCTCGCCGGCTCCCCATCCAAACCCGCGCTCACTAATCCAGGAAATGATAGAGAATTCCGTGAGCACGATATCAGTCGCAACGTCTCTAGTGATGCGGTTGCTGGAAATTCGGACGAGCGGTCGCAGCTGCTGCTTCGAATTTTTGAGAGTATGATTAGGATGAACGAGGATAGTGGAGAGAACCAAGACGATCGGATTACCGGTGGAGAAACTGACGGGTTGGTGGTCTGCAGGTTGCTGGCGCCGAGTCATCAGGTAGGGCGCGTGCTTGGTAGAGGAGGCAAAACTGTGGAGAAGATTAGACAGGAGAGTACGGCTCAAGTAAAGATTTTTCCCAAGGATCAAATTCCCGCGTGTGCGTCACCACGTGACGAGTTAATTCAAATATCAGGAAGCTTTACGGCGGTGAAGAAGGCCCTTGTTTCTGTTTCCGCCTGTCTTCAGGATAGCCTGAGGGTGGACTCGAGCAACTCTTCCACCGCAAAACCATTAGGGCCCACCTCCCATGCAAGTTGTTGTTTGCCGGTACAAGACGAAGAACCTTCTCCTATGAGGAGATATATTAGCCATCATAATGCAGATTATCGCCCAAGGGGTTATTCTTCCATACCAGGACACGATAATGTTGGAGCTGGTCAAAGAGCGGCTATGGAAGAGGATGTAGTGTTTAGATTGTTGTGTCAACCCGACAAAGTTGGAAGTCTAATTGGGAAAGGTGGCACTATAGTACGAGCTTTGCAAAGTGAAACGGGTGCATCTATAAAGATAGTTGATACACCCGACCTGGATGAACGCGTGGTTATGATATCTGCAAGAGAGAGTTTGGAGCAAACATATTCTCCTGCACAAGAAGCCGTTATACGTGTACATTGTCGAATTGCAGAAAATGGATATGAACCAGGTGTTGCAGTTGTTGCTAGACTACTTGTACATGCACACCAGATAGGTTATTTGGTGGGTAGAGGAGGCCATATTATTAATGAAATGAGAAGAGGGACCGGTACTAGCATACAGATATTTCCCCGGGAGCAAATTCAAAACAGTGGGGCTGTGAATGACGAAGTGGTGCAGGTCATTGGAAGTTTGCAGTCTGTACAGGATGCTCTGTTTCATATAACAAATCGACTTAGAGATACTTTATTCCCAATGAGACCACACATGCCAAATTTCAATACTCCATCTTACTTGTCTCCCCACCCTGAAACACCTCCCCCCTTATTTAGGCCGGGAAATAATGCACATTCTCCTGGATGCTATCCTTCTCAAGCTGGAGCTCTTCGTGGGACTGAACGTCTTCATTTCCATTCCCACCCTCTCGATCATCAACCTGTATATTCTCATAGTACGAATTTTGGTGGTAATAACATGGATGGAGTACATTATCCCCATGGCATTGAGAGACCATCTCCTAGATCGTGGATGAGTCAGGTTAGTAGCGAGATACCTAAGGCAGCAACTGATGTTGGCTTTGGCATGGTTTCAAGGAATGAGTCCTACAGCAGCGGGGGTCCAGCTCATTTTCTGGGAGGCACATCTATGGAGATAGTGATTCCACAAACTTTAATCTGCCACATTTATGGAGAGAACAACACTAACATTGCTCATGTTCAGCAGATATCAGGGGCAATGGTGGTGGTTCATGATGCTAAACCTGGGATGTTTGATGGTAAAGTGATCGTGTCTGGCACGCCAGATCAGATTCGGGCTGCCCAGAGACTTGTCCACGCCTTCATCCTGTGCGGGAGGACACAACAATCCTCATGA

Coding sequence (CDS)

ATGGACACCAACTTCAGCCCCAAACGTCCATCCGACGCCATCTCCGATCCCAACCTCTCCGCCGGTCGTTCCGTCCGCCCCAGACAATTACCTCCGCCACCGCCTGTATCCACCACTGCACCTGCGACATTAGGTTTAAACTCTACCTCCAGAGATTCCCCTGTTCTTAAACCCTCGTCTCCCTCCGATACCCTCTTTCGCCTCCTTTGCCCAGCCTCCAAGGCCGCCTCCATCCTTCGCCATCTCTGCGATATTCCGGGCGCTAGGATCCACATCGATGAACCTCTTCCTTCATGCGACGAGTGCGTCATTGTCATCCTCGCCGGCTCCCCATCCAAACCCGCGCTCACTAATCCAGGAAATGATAGAGAATTCCGTGAGCACGATATCAGTCGCAACGTCTCTAGTGATGCGGTTGCTGGAAATTCGGACGAGCGGTCGCAGCTGCTGCTTCGAATTTTTGAGAGTATGATTAGGATGAACGAGGATAGTGGAGAGAACCAAGACGATCGGATTACCGGTGGAGAAACTGACGGGTTGGTGGTCTGCAGGTTGCTGGCGCCGAGTCATCAGGTAGGGCGCGTGCTTGGTAGAGGAGGCAAAACTGTGGAGAAGATTAGACAGGAGAGTACGGCTCAAGTAAAGATTTTTCCCAAGGATCAAATTCCCGCGTGTGCGTCACCACGTGACGAGTTAATTCAAATATCAGGAAGCTTTACGGCGGTGAAGAAGGCCCTTGTTTCTGTTTCCGCCTGTCTTCAGGATAGCCTGAGGGTGGACTCGAGCAACTCTTCCACCGCAAAACCATTAGGGCCCACCTCCCATGCAAGTTGTTGTTTGCCGGTACAAGACGAAGAACCTTCTCCTATGAGGAGATATATTAGCCATCATAATGCAGATTATCGCCCAAGGGGTTATTCTTCCATACCAGGACACGATAATGTTGGAGCTGGTCAAAGAGCGGCTATGGAAGAGGATGTAGTGTTTAGATTGTTGTGTCAACCCGACAAAGTTGGAAGTCTAATTGGGAAAGGTGGCACTATAGTACGAGCTTTGCAAAGTGAAACGGGTGCATCTATAAAGATAGTTGATACACCCGACCTGGATGAACGCGTGGTTATGATATCTGCAAGAGAGAGTTTGGAGCAAACATATTCTCCTGCACAAGAAGCCGTTATACGTGTACATTGTCGAATTGCAGAAAATGGATATGAACCAGGTGTTGCAGTTGTTGCTAGACTACTTGTACATGCACACCAGATAGGTTATTTGGTGGGTAGAGGAGGCCATATTATTAATGAAATGAGAAGAGGGACCGGTACTAGCATACAGATATTTCCCCGGGAGCAAATTCAAAACAGTGGGGCTGTGAATGACGAAGTGGTGCAGGTCATTGGAAGTTTGCAGTCTGTACAGGATGCTCTGTTTCATATAACAAATCGACTTAGAGATACTTTATTCCCAATGAGACCACACATGCCAAATTTCAATACTCCATCTTACTTGTCTCCCCACCCTGAAACACCTCCCCCCTTATTTAGGCCGGGAAATAATGCACATTCTCCTGGATGCTATCCTTCTCAAGCTGGAGCTCTTCGTGGGACTGAACGTCTTCATTTCCATTCCCACCCTCTCGATCATCAACCTGTATATTCTCATAGTACGAATTTTGGTGGTAATAACATGGATGGAGTACATTATCCCCATGGCATTGAGAGACCATCTCCTAGATCGTGGATGAGTCAGGTTAGTAGCGAGATACCTAAGGCAGCAACTGATGTTGGCTTTGGCATGGTTTCAAGGAATGAGTCCTACAGCAGCGGGGGTCCAGCTCATTTTCTGGGAGGCACATCTATGGAGATAGTGATTCCACAAACTTTAATCTGCCACATTTATGGAGAGAACAACACTAACATTGCTCATGTTCAGCAGATATCAGGGGCAATGGTGGTGGTTCATGATGCTAAACCTGGGATGTTTGATGGTAAAGTGATCGTGTCTGGCACGCCAGATCAGATTCGGGCTGCCCAGAGACTTGTCCACGCCTTCATCCTGTGCGGGAGGACACAACAATCCTCATGA

Protein sequence

MDTNFSPKRPSDAISDPNLSAGRSVRPRQLPPPPPVSTTAPATLGLNSTSRDSPVLKPSSPSDTLFRLLCPASKAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPALTNPGNDREFREHDISRNVSSDAVAGNSDERSQLLLRIFESMIRMNEDSGENQDDRITGGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQIPACASPRDELIQISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPLGPTSHASCCLPVQDEEPSPMRRYISHHNADYRPRGYSSIPGHDNVGAGQRAAMEEDVVFRLLCQPDKVGSLIGKGGTIVRALQSETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVIRVHCRIAENGYEPGVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSGAVNDEVVQVIGSLQSVQDALFHITNRLRDTLFPMRPHMPNFNTPSYLSPHPETPPPLFRPGNNAHSPGCYPSQAGALRGTERLHFHSHPLDHQPVYSHSTNFGGNNMDGVHYPHGIERPSPRSWMSQVSSEIPKAATDVGFGMVSRNESYSSGGPAHFLGGTSMEIVIPQTLICHIYGENNTNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFILCGRTQQSS
Homology
BLAST of Tan0008767 vs. ExPASy Swiss-Prot
Match: F4KDN0 (KH domain-containing protein HEN4 OS=Arabidopsis thaliana OX=3702 GN=HEN4 PE=1 SV=1)

HSP 1 Score: 241.5 bits (615), Expect = 2.8e-62
Identity = 223/837 (26.64%), Postives = 348/837 (41.58%), Query Frame = 0

Query: 66  FRLLCPAS-------KAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPAL-- 125
           FRLLCP S       K+ ++++ L    GA+I ++EP     + VI I+A + SK  +  
Sbjct: 49  FRLLCPLSHVGAVIGKSGNVIKQLQQSTGAKIRVEEPPSGSPDRVITIIAQADSKSRVKL 108

Query: 126 --TNPGN-DREFREHDISRNVSSDAVAGNSDERSQLLLRIFESMIRMNEDSGENQDDRIT 185
              N GN + E +E ++  + +  A           L+++FE ++    DS         
Sbjct: 109 GANNNGNAEGEKKEEEVEVSKAQGA-----------LIKVFE-LLAAEADS--------- 168

Query: 186 GGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQIPACASPRDELI 245
                  VVCRLL  S   G V+G+GG+ V  IR+E+  ++ I   + +P CA   DE++
Sbjct: 169 -----DTVVCRLLTESSHAGAVIGKGGQMVGSIRKETGCKISI-RIENLPICADTDDEMV 228

Query: 246 QISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPL----------------------- 305
           ++ G+  AVKKALVS+S CLQ+   +D       +PL                       
Sbjct: 229 EVEGNAIAVKKALVSISRCLQNCQSIDKVRMVGNRPLEKEFQASLHRPIETIIQESLPRS 288

Query: 306 ------------------GPTSHASCCLP--------------------VQDEEPSPMRR 365
                             G  + A+  +P                    ++ +    +RR
Sbjct: 289 VEVNPYDYRLRNDEIFPRGTVARANDVIPHDTLHLRRIEAVPQGALRMHIEADRQDVLRR 348

Query: 366 YI----------------------------------------SH---------------- 425
           ++                                        SH                
Sbjct: 349 HVEADRQDALRRRIDVVPQETLYMPSDVLRGDCFRQHRERDDSHDSLHRPFEMVPRDAMG 408

Query: 426 -------------------------HNADYRPRGYSSIPGH-----DNVGAGQRAAME-- 485
                                     +ADY    YS++  H      +      A M+  
Sbjct: 409 MPFESFPRDAYGRPIETMTQETLRGQSADYLAHRYSTLDTHPHSFTTSASMANTATMKPP 468

Query: 486 --------EDVVFRLLCQPDKVGSLIGKGGTIVRALQSETGASIKIVDT-PDLDERVVMI 545
                   +DVVF++LC  +  G +IG GG +VR L SETGA I + +T  D +ER++ +
Sbjct: 469 PSEVEVGNQDVVFKILCSTENAGGVIGTGGKVVRMLHSETGAFINVGNTLDDCEERLIAV 528

Query: 546 SARESLEQTYSPAQEAVIRVHCR--------IAENGYEPGVAVVARLLVHAHQIGYLVGR 605
           +A E+ E   SPAQ+A++ +  R        I +NG  P  ++ ARL+V   QIG ++G+
Sbjct: 529 TASENPECQSSPAQKAIMLIFSRLFELATNKILDNG--PRSSITARLVVPTSQIGCVLGK 588

Query: 606 GGHIINEMRRGTGTSIQIFPREQIQNSGAVNDEVVQVIGSLQSVQDALFHITNRLRDTLF 665
           GG I++EMR+ TG +IQI   EQ     + ND+VVQ+ G   +V++A+FHIT+RLRD++F
Sbjct: 589 GGVIVSEMRKTTGAAIQILKVEQNPKCISENDQVVQITGEFPNVREAIFHITSRLRDSVF 648

Query: 666 PMRPHMPNFNTPSYLSPHPETPPPLFRPGNNAHSPGCYPSQAGALRGTERLHFHSH---- 693
                     + S L+    T     R  +N  S G + S +     +  LH  S     
Sbjct: 649 SNSMKNSLAKSSSALT----TERFYDRQSDNPLSIGSHQSVSNPATNSSSLHRRSEDSFL 708

BLAST of Tan0008767 vs. ExPASy Swiss-Prot
Match: Q8W4B1 (RNA-binding KH domain-containing protein RCF3 OS=Arabidopsis thaliana OX=3702 GN=RCF3 PE=1 SV=1)

HSP 1 Score: 183.3 bits (464), Expect = 9.1e-45
Identity = 167/656 (25.46%), Postives = 296/656 (45.12%), Query Frame = 0

Query: 58  PSSPSDTLFRLLCPASKA-------ASILRHLCDIPGARIHIDEPLPSCDECVIVILAGS 117
           P + + T +R+LC  +KA        +I++ +    GA I++ E +P   E +I I    
Sbjct: 62  PETMATTTYRILCHDAKAGGVIGKSGTIIKSIRQHTGAWINVHELVPGDAERIIEISDNR 121

Query: 118 PSKP----ALTNPGNDREFREHDISRNVSSDAVAGNSDERSQLLLRIFESMIRMNEDSGE 177
              P       +P  +  F  HD                      RI ES  +       
Sbjct: 122 RRDPDGRMPSFSPAQEALFSVHD----------------------RILESEAQFGYGGPP 181

Query: 178 NQDDRITGG--ETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKD-QIP 237
            +++   GG     G VV RL+     VG +LG+GGK +E++R E+   ++I P++  +P
Sbjct: 182 PEEEEDYGGVRPGGGRVVTRLVVSRMHVGCLLGKGGKIIEQMRIETKTHIRILPRESNLP 241

Query: 238 ACASPRDELIQISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPLGPTSHASCCLPVQ 297
            C S  +E++QI G   AVK AL  VS+ L++S   D SN          S A+      
Sbjct: 242 RCVSLSEEIVQIVGELNAVKNALAIVSSRLRESQHRDRSNFQGRSHSPERSFAA----AG 301

Query: 298 DEEPSPMRRYISHH--NADYRPRGYSS-----------IPGHDNVGAGQRAAMEEDVVFR 357
           D+    +RR  S      ++R   +SS           +P  +NV         E++VF+
Sbjct: 302 DDYMPQLRRQSSDRFPRGNFRNNNFSSRQSNYAEEAPAVPVGENV-------YSEELVFQ 361

Query: 358 LLCQPDKVGSLIGKGGTIVRALQSETGASIKIVD-TPDLDERVVMISARESLEQTYSPAQ 417
           +LC  DK+  ++G+   I+  LQ+E G  +++ D     DE+++ IS+ E+ +  + PAQ
Sbjct: 362 ILCPADKIVRVVGESQGIIDLLQNEIGVDVRVSDPVAGSDEQIITISSEEAPDDPFFPAQ 421

Query: 418 EAVIRVHCRIAENGYEPGVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSIQIFPRE 477
           EA++ +  +I +   +    +  RLLV +     L G+ G  ++E+ R TGTS+QI  RE
Sbjct: 422 EALLHIQTQIIDLIPDKDNLITTRLLVPSRDSICLEGKAGS-VSEISRLTGTSVQILARE 481

Query: 478 QIQNSGAVNDEVVQVIGSLQSVQDALFHITNRLRDTLFPMRPHMPNFNTPSYLSPHPETP 537
           +I    ++ND V+Q+ G +++ ++AL  +T  LR  +F              LS   ETP
Sbjct: 482 EIPRCASINDVVIQITGEIRAAREALVELTLLLRSHMF------------KELS-QKETP 541

Query: 538 PPLFRPGNNAHSPGCYPSQAGALRGTERLHFHSHPLDHQPVYSHSTNFGGNNMDGVHYPH 597
           P       +  + G     AG +   E    ++     +   S + N    +     +  
Sbjct: 542 PA------STSTTGPLEGVAGVM---EVASSNNTIQSREGPTSSNLNLQQVSTILPQFKE 601

Query: 598 GIERPSPRSWMSQVSSEIPKAATDVGFGMVSRNESYSSGGPAHFLGGTSMEIVIPQTLIC 657
           G    + ++  S+   E+P   + +   +V+R               +++E+V+P+ ++ 
Sbjct: 602 GFGSVA-KAGESEHREEVPVTTSRMAVPLVTR---------------STLEVVLPEAVVP 645

Query: 658 HIYGENNTNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFIL 686
            +  ++   +A + + SGA V + + +P      + +SGTP+Q   AQ L+  FIL
Sbjct: 662 KLVTKSRNKLAQISEWSGASVTIVEDRPEETQNIIRISGTPEQAERAQSLLQGFIL 645

BLAST of Tan0008767 vs. ExPASy Swiss-Prot
Match: P58223 (KH domain-containing protein At4g18375 OS=Arabidopsis thaliana OX=3702 GN=At4g18375 PE=2 SV=1)

HSP 1 Score: 151.0 bits (380), Expect = 5.0e-35
Identity = 159/661 (24.05%), Postives = 289/661 (43.72%), Query Frame = 0

Query: 65  LFRLLCP-------ASKAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPALT 124
           ++R+LCP         K+  ++  +     A+I + + L  C + VI I      K    
Sbjct: 37  VYRILCPIDVVGGVIGKSGKVINAIRHNTKAKIKVFDQLHGCSQRVITIYCSVKEK---- 96

Query: 125 NPGNDREFREHDISRNVSSDAVAGNSDERSQLLLRIFESMIRMNEDSGENQDDRITGGET 184
                    + +I    S +     + +    LL+++++++  +E++        T  + 
Sbjct: 97  ---------QEEIGFTKSENEPLCCAQD---ALLKVYDAIVASDEENNTK-----TNVDR 156

Query: 185 DGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPK---DQIPACASPRDELIQ 244
           D    CRLL P  Q   ++G+ G+ +++IR+ + A VK+  K   D    CA   D ++ 
Sbjct: 157 DDNKECRLLVPFSQSSSLIGKAGENIKRIRRRTRASVKVVSKDVSDPSHVCAMEYDNVVV 216

Query: 245 ISGSFTAVKKALVSVSACL-----QDSLRVDSSNSS--TAKPLGPTSHASCCLPV----- 304
           ISG   +VK+AL +VSA +     ++++ +DS++     A  + P+  ++   P      
Sbjct: 217 ISGEPESVKQALFAVSAIMYKINPRENIPLDSTSQDVPAASVIVPSDLSNSVYPQTGFYS 276

Query: 305 -QDEEPSPMRRYISHHNA----DYR----------PRGYSSIPGHDNVGAGQRAAMEEDV 364
            QD          S+ NA    D++          P   SS+P     G   R+   E++
Sbjct: 277 NQDHILQQGAGVPSYFNALSVSDFQGYAETAANPVPVFASSLPVTHGFGGSSRS---EEL 336

Query: 365 VFRLLCQPDKVGSLIGKGGTIVRALQSETGASIKIVDTPDL---DERVVMISARESLEQT 424
           VF++LC    +  +IGKGG+ ++ ++  +G+ I++ D+      DE V++++A ES +  
Sbjct: 337 VFKVLCPLCNIMRVIGKGGSTIKRIREASGSCIEVNDSRTKCGDDECVIIVTATESPDDM 396

Query: 425 YSPAQEAVIRVHCRIAENGYEPGVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSIQ 484
            S A EAV+ +   I +   E    V  +LLV +  IG ++G+ G +INE+R+ T  +I 
Sbjct: 397 KSMAVEAVLLLQEYIND---EDAENVKMQLLVSSKVIGCVIGKSGSVINEIRKRTNANIC 456

Query: 485 IFPREQIQNSGAVNDEVVQVIGSLQSVQDALFHITNRLRDTLFPMRPHMPNFNTPSYLSP 544
           I        S    D++V+V G + SV+DAL  I  RLR+ +   +  +     P+    
Sbjct: 457 I--------SKGKKDDLVEVSGEVSSVRDALIQIVLRLREDVLGDKDSVATRKPPA---- 516

Query: 545 HPETPPPLFRPGNNAHSPGCYPSQAGALRGTERLHFHSHPLDHQPVYSHSTNFGGNNMDG 604
                                         T+   F S               G +N   
Sbjct: 517 -----------------------------RTDNCSFLS---------------GSSNA-- 576

Query: 605 VHYPHGIERPSPRSWMSQVSSEIPKAATDVGFGMVSRNESYSSGGPAHFLGGTSMEIVIP 664
                G   PS  S M+  S      +   G  ++     YS G        +++EI+IP
Sbjct: 577 -----GYTLPSFMSSMASTSGFHGYGSFPAGDNVLGSTGPYSYG---RLPSSSALEILIP 604

Query: 665 QTLICHIYGENNTNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFI 686
              +  + G+   N+ ++++ISGAM+ +  +K    D   ++SGT +Q+R A+ LV AF+
Sbjct: 637 AHAMSKVMGKGGGNLENIRRISGAMIEISASKTSHGDHIALLSGTLEQMRCAENLVQAFV 604

BLAST of Tan0008767 vs. ExPASy Swiss-Prot
Match: Q15366 (Poly(rC)-binding protein 2 OS=Homo sapiens OX=9606 GN=PCBP2 PE=1 SV=1)

HSP 1 Score: 81.3 bits (199), Expect = 4.9e-14
Identity = 83/360 (23.06%), Postives = 154/360 (42.78%), Query Frame = 0

Query: 327 VVFRLLCQPDKVGSLIGKGGTIVRALQSETGASIKIVDTPDLDERVVMISARESLEQTYS 386
           +  RLL    +VGS+IGK G  V+ ++ E+GA I I +    +  + +     ++ + ++
Sbjct: 14  LTIRLLMHGKEVGSIIGKKGESVKKMREESGARINISEGNCPERIITLAGPTNAIFKAFA 73

Query: 387 PAQEAVIR-VHCRIAENGYEPGVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSIQI 446
              + +   +   +  +       V  RL+V A Q G L+G+GG  I E+R  TG  +Q+
Sbjct: 74  MIIDKLEEDISSSMTNSTAASRPPVTLRLVVPASQCGSLIGKGGCKIKEIRESTGAQVQV 133

Query: 447 FPREQIQNSGAVNDEVVQVIGSLQSVQDALFHITNRLRDTLFPMRPHMPNFNTPSYLSPH 506
              + + NS    +  + + G  QS+ + +  I   + +TL       P         P 
Sbjct: 134 -AGDMLPNS---TERAITIAGIPQSIIECVKQICVVMLETL----SQSPPKGVTIPYRPK 193

Query: 507 PETPPPLFRPGNNAHSPGCYPSQAGALRGTERLHFHSHPLDHQPVYSHSTNFGGNNMDGV 566
           P + P +F  G + +S G   S + +   T      +  L+  P+ +++          +
Sbjct: 194 PSSSPVIFAGGQDRYSTG---SDSASFPHTTPSMCLNPDLEGPPLEAYT----------I 253

Query: 567 HYPHGIERP--SPRSWMSQVSSEIPKAATDVGFGMVSRNESYSSG--GPAHFLGGTSMEI 626
              + I +P  +    ++   S  P    + GF  +  +     G  G       TS E+
Sbjct: 254 QGQYAIPQPDLTKLHQLAMQQSHFPMTHGNTGFSGIESSSPEVKGYWGLDASAQTTSHEL 313

Query: 627 VIPQTLICHIYGENNTNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVH 682
            IP  LI  I G     I  ++Q+SGA + + +   G  D +V ++G+   I  AQ L++
Sbjct: 314 TIPNDLIGCIIGRQGAKINEIRQMSGAQIKIANPVEGSTDRQVTITGSAASISLAQYLIN 352

BLAST of Tan0008767 vs. ExPASy Swiss-Prot
Match: P57721 (Poly(rC)-binding protein 3 OS=Homo sapiens OX=9606 GN=PCBP3 PE=1 SV=2)

HSP 1 Score: 80.1 bits (196), Expect = 1.1e-13
Identity = 90/361 (24.93%), Postives = 158/361 (43.77%), Query Frame = 0

Query: 327 VVFRLLCQPDKVGSLIGKGGTIVRALQSETGASIKIVDTPDLDERVVMISA-RESLEQTY 386
           +  RLL    +VGS+IGK G  V+ ++ E+GA I I +  +  ER+V I+   +++ + +
Sbjct: 46  LTIRLLMHGKEVGSIIGKKGETVKKMREESGARINISE-GNCPERIVTITGPTDAIFKAF 105

Query: 387 S----PAQEAVIRVHCRIAENGYEPGVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGT 446
           +      +E +I            P   V  RL+V A Q G L+G+GG  I E+R  TG 
Sbjct: 106 AMIAYKFEEDIINSMSNSPATSKPP---VTLRLVVPASQCGSLIGKGGSKIKEIRESTGA 165

Query: 447 SIQIFPREQIQNSGAVNDEVVQVIGSLQSVQDALFHITNRLRDTLFPMRPHMPNFNTPSY 506
            +Q+   + + NS    +  V + G+     DA+     ++      M    P   T  Y
Sbjct: 166 QVQV-AGDMLPNS---TERAVTISGT----PDAIIQCVKQI---CVVMLESPPKGATIPY 225

Query: 507 LSPHPETPPPLFRPGNNAHSPGCYPSQAGALRGTERLHFHSHPLDHQPVYSHSTNFGGNN 566
             P P + P +F  G           QA  ++G   +    HP          T      
Sbjct: 226 -RPKPASTPVIFAGG-----------QAYTIQGQYAI---PHP-------DQLTKLHQLA 285

Query: 567 MDGVHYPHGIERPSPRSWMSQVSSEIPKAATDVGFGMVSRNESYSSGGPAHFLGGTSMEI 626
           M    +P     P  ++  +    ++P  +++    ++ ++    +  PA     ++ E+
Sbjct: 286 MQQTPFP-----PLGQTNPAFPGEKLPLHSSEEAQNLMGQSSGLDASPPA-----STHEL 345

Query: 627 VIPQTLICHIYGENNTNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVH 683
            IP  LI  I G   T I  ++Q+SGA + + +A  G  + ++ ++GTP  I  AQ L++
Sbjct: 346 TIPNDLIGCIIGRQGTKINEIRQMSGAQIKIANATEGSSERQITITGTPANISLAQYLIN 359

BLAST of Tan0008767 vs. NCBI nr
Match: XP_038898275.1 (KH domain-containing protein HEN4 [Benincasa hispida])

HSP 1 Score: 1205.7 bits (3118), Expect = 0.0e+00
Identity = 614/717 (85.63%), Postives = 644/717 (89.82%), Query Frame = 0

Query: 1   MDTNFSPKRPSDAISDPNLSAGRSVRPRQLPPPPPV----STTAPATLGLNSTSRDSPVL 60
           MDTNFSPKRPSDAISDPNL  GRSVRPRQLPPPPP+    S TAPATLGLNSTSRD+PVL
Sbjct: 1   MDTNFSPKRPSDAISDPNLPTGRSVRPRQLPPPPPLPPPASATAPATLGLNSTSRDTPVL 60

Query: 61  KPSSPSDTLFRLLCPASKAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPAL 120
           KPSSPSDTLFRLLCPASK  SILRHLCDIPG RIHIDEPLPSCDEC+IVILA SPSKP+L
Sbjct: 61  KPSSPSDTLFRLLCPASKVPSILRHLCDIPGTRIHIDEPLPSCDECIIVILADSPSKPSL 120

Query: 121 TNPGNDREFREHDISRNVSSDAVAGNSDER---SQLLLRIFESMIRMNEDSGENQD---- 180
           TN GNDREF EHDISRNVSSDAVAG+SDER    Q LLR FE+++RMNEDS ENQ+    
Sbjct: 121 TNRGNDREFSEHDISRNVSSDAVAGDSDERLQAQQALLRTFENIVRMNEDSEENQEMQKK 180

Query: 181 -------DRITGGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQI 240
                  DRI+GGET+GLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQ 
Sbjct: 181 NADSAPNDRISGGETEGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQN 240

Query: 241 PACASPRDELIQISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPLGPTSHASCCLPV 300
           PACASP+DELIQISGS  AV KAL SVSACLQD+ R+DSSNSS+ KPLGPTSH++ C+P+
Sbjct: 241 PACASPQDELIQISGSLPAVMKALSSVSACLQDNPRMDSSNSSSTKPLGPTSHSN-CMPL 300

Query: 301 QDEEPSPMRRYISHHNADYRPRGYSSIPGHDNVGAGQRAAMEEDVVFRLLCQPDKVGSLI 360
           QDEEPSP R+Y SHHNADYR R YSSIPGH+NVGAG RAAMEEDVVFRLLCQPDKVGSLI
Sbjct: 301 QDEEPSPKRKYASHHNADYRSRSYSSIPGHENVGAGPRAAMEEDVVFRLLCQPDKVGSLI 360

Query: 361 GKGGTIVRALQSETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVIRVHCRIAEN 420
           GKGGTIVRALQ+ETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVIRVHCRIAE 
Sbjct: 361 GKGGTIVRALQNETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVIRVHCRIAEI 420

Query: 421 GYEPGVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSGAVNDEVV 480
           GYEPG  VVARLLVH  QIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSG VNDEVV
Sbjct: 421 GYEPGAPVVARLLVHGQQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSGPVNDEVV 480

Query: 481 QVIGSLQSVQDALFHITNRLRDTLFPMRPHMPNFNTPSYLSPHPETPPPLFRPGNNAHSP 540
           QVIG+LQSVQDALFHITNRLRDTLFPMRPH+PNFN PSYLSP P TPPPLFRPGNNAHSP
Sbjct: 481 QVIGNLQSVQDALFHITNRLRDTLFPMRPHVPNFNNPSYLSPLPGTPPPLFRPGNNAHSP 540

Query: 541 GCYPSQAGALRGTERLHFHSHPLDHQPVYSHSTNFGGNNMDGVHYPHGIERPS------- 600
           GCYPSQAGAL G ERL FHSHPLDHQP YSH+ +FGGNNMDGV YPHGIERP        
Sbjct: 541 GCYPSQAGALHGMERLPFHSHPLDHQPPYSHNISFGGNNMDGVPYPHGIERPGPGSFERP 600

Query: 601 PRSWMSQVSSEIPKAATDVGFGMVSRNESYSSGGPAHFLGGTSMEIVIPQTLICHIYGEN 660
           PRSW  QVSSEIPK  TDVGFGMVSRNESYSSGGPAHF+GGTSME+VIPQTLICHIYGEN
Sbjct: 601 PRSWTPQVSSEIPKGPTDVGFGMVSRNESYSSGGPAHFMGGTSMEMVIPQTLICHIYGEN 660

Query: 661 NTNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFILCGRTQQS 693
           + NIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFILCG+TQ S
Sbjct: 661 SNNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFILCGKTQPS 716

BLAST of Tan0008767 vs. NCBI nr
Match: KAA0059338.1 (KH domain-containing protein [Cucumis melo var. makuwa] >TYK03987.1 KH domain-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1158.3 bits (2995), Expect = 0.0e+00
Identity = 593/716 (82.82%), Postives = 637/716 (88.97%), Query Frame = 0

Query: 1   MDTNFSPKRPSDAISDPNLSAGRSVRPRQLPP----PPPVSTTAPATLGLNSTSRDSPVL 60
           MDTNFSPKRP DAISD NLSAGRSVRPRQLPP    PPPVSTTAPATLGL++ SRD+PVL
Sbjct: 1   MDTNFSPKRPPDAISDHNLSAGRSVRPRQLPPPPPLPPPVSTTAPATLGLDTISRDAPVL 60

Query: 61  KPSSPSDTLFRLLCPASKAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPAL 120
           KPSSPSDTLFRLLCPASK +SILRHL DI GAR+HIDE LPSCDECV+VILAGSPSKPA 
Sbjct: 61  KPSSPSDTLFRLLCPASKVSSILRHLRDISGARVHIDESLPSCDECVLVILAGSPSKPAH 120

Query: 121 TNPGNDREFREHDISRNVSSDAVAGNSDERS---QLLLRIFESMIRMNEDSGENQD---- 180
           TNPGNDREFREHD++RNVSSD VAG+SDERS   Q LLR FES++RMNEDSGENQ+    
Sbjct: 121 TNPGNDREFREHDVNRNVSSDTVAGDSDERSQAQQALLRTFESIVRMNEDSGENQEIQKK 180

Query: 181 -------DRITGGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQI 240
                  DRI+GGETDGLVVCRLLAPSHQVGRVLGRGG+TVEKIRQES A VKIFPKDQ 
Sbjct: 181 NADSAPNDRISGGETDGLVVCRLLAPSHQVGRVLGRGGRTVEKIRQESMAHVKIFPKDQN 240

Query: 241 PACASPRDELIQISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPLGPTSHASCCLPV 300
           PACASP+DELIQISG+F AV KAL SVS+CLQD+ RVDSSNSS+ K  GPTSHAS  +PV
Sbjct: 241 PACASPQDELIQISGNFPAVMKALASVSSCLQDNPRVDSSNSSSTKSSGPTSHAS-SMPV 300

Query: 301 QDEEPSPMRRYISHHNADYRPRGYSSIPGHDNVGAGQRAAMEEDVVFRLLCQPDKVGSLI 360
           QDEEPSP RRY SHHNADYR RGYSSIPGH+NVGAG RA+MEEDVVFRLLCQPDKVGSLI
Sbjct: 301 QDEEPSPRRRYGSHHNADYRSRGYSSIPGHENVGAGPRASMEEDVVFRLLCQPDKVGSLI 360

Query: 361 GKGGTIVRALQSETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVIRVHCRIAEN 420
           GKGGT+VRALQ+ETGASIKIVDTPDLDER+V+ISARESLEQTYSPAQEAVIRVHCRIAE 
Sbjct: 361 GKGGTVVRALQNETGASIKIVDTPDLDERLVVISARESLEQTYSPAQEAVIRVHCRIAEI 420

Query: 421 GYEPGVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSGAVNDEVV 480
           GYEPG AVVARLLVH  QIGYLVGRGGHIIN+MRRGTGTSIQIFPR+QIQN+G +NDEVV
Sbjct: 421 GYEPGAAVVARLLVHGQQIGYLVGRGGHIINDMRRGTGTSIQIFPRDQIQNNGPMNDEVV 480

Query: 481 QVIGSLQSVQDALFHITNRLRDTLFPMRPHMPNFNTPSYLSPHPETPPPLFRPGNNAHSP 540
           QVIG+L SVQDALFHITNR+RDT FPMRPH+PNFN P YLSPHPET PPLFRPG+NAHSP
Sbjct: 481 QVIGNLPSVQDALFHITNRIRDTFFPMRPHVPNFNNPPYLSPHPET-PPLFRPGSNAHSP 540

Query: 541 GCYPSQAGALRGTERLHFHSHPLDHQPVYSHSTNFGGNNMDGVHYPHGI--------ERP 600
           G YPSQAG LRGTERL +HSHPLDHQP Y H+  FGGNNMDGV YPHG+        ERP
Sbjct: 541 GYYPSQAGTLRGTERLPYHSHPLDHQPAYPHNVGFGGNNMDGVPYPHGMERPGPGPFERP 600

Query: 601 SPRSWMSQVSSEIPKAATDVGFGMVSRNESYSSGGPAHFLGGTSMEIVIPQTLICHIYGE 660
           SPRSW SQVSSEIPK  TD G+GMVSRNESY SGG  HF+GGTSME+VIPQTLICHIYGE
Sbjct: 601 SPRSWTSQVSSEIPKGPTD-GYGMVSRNESYGSGG-THFMGGTSMEMVIPQTLICHIYGE 660

Query: 661 NNTNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFILCGRTQ 691
           NN NIAHVQQISGAM+VVHDAKPGMFDGKVI+SGTP+QIRAAQRLVHAFILCG+TQ
Sbjct: 661 NNNNIAHVQQISGAMLVVHDAKPGMFDGKVIMSGTPEQIRAAQRLVHAFILCGKTQ 712

BLAST of Tan0008767 vs. NCBI nr
Match: XP_004141829.1 (KH domain-containing protein HEN4 [Cucumis sativus] >KGN45417.1 hypothetical protein Csa_015586 [Cucumis sativus])

HSP 1 Score: 1157.5 bits (2993), Expect = 0.0e+00
Identity = 594/718 (82.73%), Postives = 638/718 (88.86%), Query Frame = 0

Query: 1   MDTNFSPKRPSDAISDPNLSAGRSVRPRQLPP----PPPVSTTAPATLGLNSTSRDSPVL 60
           MDTNFSPKRP DAISDP+LSAGRSVRPRQLPP    PPPVSTTAPATLGL++T+RD+PVL
Sbjct: 1   MDTNFSPKRPPDAISDPSLSAGRSVRPRQLPPPPPLPPPVSTTAPATLGLDTTTRDAPVL 60

Query: 61  KPSSPSDTLFRLLCPASKAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPAL 120
           KPSSPSDTLFRLLCPASK +SILRHL DIPGARIH+DEPLPSC+ECV+VILAGSPSKPA 
Sbjct: 61  KPSSPSDTLFRLLCPASKVSSILRHLRDIPGARIHVDEPLPSCEECVLVILAGSPSKPAH 120

Query: 121 TNPGNDREFREHDISRNVSSDAVAGNSDERS---QLLLRIFESMIRMNEDSGENQD---- 180
           TNPGNDREFREHD+ RNVSSD VAG+SDERS   Q LLR FES++RMNEDSGENQ+    
Sbjct: 121 TNPGNDREFREHDVHRNVSSDTVAGDSDERSQAQQALLRTFESIVRMNEDSGENQEIQKK 180

Query: 181 -------DRITGGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQI 240
                  DRI+GGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQES A VKIFPKDQ 
Sbjct: 181 NADSAPNDRISGGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESMAHVKIFPKDQN 240

Query: 241 PACASPRDELIQISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPLGPTSHASCCLPV 300
           PACASP+DELIQISG+F+AV KAL SVS+CLQDS RVDSSNSS+ K LGPTSHAS  + V
Sbjct: 241 PACASPQDELIQISGNFSAVMKALSSVSSCLQDSPRVDSSNSSSTKSLGPTSHAS-SMSV 300

Query: 301 QDEEPSPMRRYISHHNADYRPRGYSSIPGHDNVGAGQRAAMEEDVVFRLLCQPDKVGSLI 360
           QDEEPSP RRY SHHNADYR R YSSIPGH+N GAG RAAMEEDVVFRLLCQPDKVGSLI
Sbjct: 301 QDEEPSPRRRYGSHHNADYRSRSYSSIPGHENAGAGPRAAMEEDVVFRLLCQPDKVGSLI 360

Query: 361 GKGGTIVRALQSETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVIRVHCRIAEN 420
           GKGGT+VRALQ+ETGASIKIVDTPDLDER+V+ISARE+LEQTYSPAQEAVIR HCRIAE 
Sbjct: 361 GKGGTVVRALQNETGASIKIVDTPDLDERLVVISARETLEQTYSPAQEAVIRAHCRIAEI 420

Query: 421 GYEPGVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSGAVNDEVV 480
           GYEPG AVVARLLVH  QIGYLVGRGGHIIN+MRRGTGTSIQIFPR+QIQN G ++DEVV
Sbjct: 421 GYEPGAAVVARLLVHGQQIGYLVGRGGHIINDMRRGTGTSIQIFPRDQIQNGGPMSDEVV 480

Query: 481 QVIGSLQSVQDALFHITNRLRDTLFPMRPHMPNFNT-PSYLSPHPETPPPLFRPGNNAHS 540
           QVIG+L SVQDALFHITNR+RDT FPMRPH+PNFN  P YLSPHPETPPPLFRPG+NAHS
Sbjct: 481 QVIGNLPSVQDALFHITNRIRDTFFPMRPHVPNFNNHPPYLSPHPETPPPLFRPGSNAHS 540

Query: 541 PGCYPSQAGALRGTERLHFHSHPLDHQPVYSHSTNF-GGNNMDGVHYPHGI--------E 600
           PG YPSQAG LRGTER  +HSHPLDHQP Y H+ +F GGNNMDGV YPHG+        E
Sbjct: 541 PGYYPSQAGGLRGTERPPYHSHPLDHQPAYPHNVSFGGGNNMDGVPYPHGMERPGPGSFE 600

Query: 601 RPSPRSWMSQVSSEIPKAATDVGFGMVSRNESYSSGGPAHFLGGTSMEIVIPQTLICHIY 660
           RPSPRSW SQVSSEIPK  TD GFGMVSRNE Y SGGP HF+GGTSME+VIPQTLICHIY
Sbjct: 601 RPSPRSWTSQVSSEIPKGPTD-GFGMVSRNEPYGSGGP-HFMGGTSMEMVIPQTLICHIY 660

Query: 661 GENNTNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFILCGRTQ 691
           GENN NIAHVQQISGAM+VVHDAKPGMFDGKVI+SGTPDQIRAAQRLVHAFILCG+TQ
Sbjct: 661 GENNNNIAHVQQISGAMLVVHDAKPGMFDGKVIMSGTPDQIRAAQRLVHAFILCGKTQ 715

BLAST of Tan0008767 vs. NCBI nr
Match: XP_008462207.1 (PREDICTED: KH domain-containing protein At4g18375 [Cucumis melo])

HSP 1 Score: 1156.4 bits (2990), Expect = 0.0e+00
Identity = 594/717 (82.85%), Postives = 639/717 (89.12%), Query Frame = 0

Query: 1   MDTNFSPKRPSDAISDPNLSAGRSVRPRQLPP----PPPVSTTAPATLGLNSTSRDSPVL 60
           MDTNFSPKRP DAISD NLSAGRSVRPRQLPP    PPPVSTTAPATLGL++ SRD+PVL
Sbjct: 1   MDTNFSPKRPPDAISDHNLSAGRSVRPRQLPPPPPLPPPVSTTAPATLGLDTISRDAPVL 60

Query: 61  KPSSPSDTLFRLLCPASKAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPAL 120
           KPSSPSDTLFRLLCPASK +SILRHL DIPGAR+HIDE LPSCDECV+VILAGSPSKPA 
Sbjct: 61  KPSSPSDTLFRLLCPASKVSSILRHLRDIPGARVHIDESLPSCDECVLVILAGSPSKPAH 120

Query: 121 TNPGNDREFREHDISRNVSSDAVAGNSDERS---QLLLRIFESMIRMNEDSGENQD---- 180
           TNPGNDREFREHD++RNVSSD VAG+SDERS   Q LLR FES++RMNEDSGENQ+    
Sbjct: 121 TNPGNDREFREHDVNRNVSSDTVAGDSDERSQAQQALLRTFESIVRMNEDSGENQEIQKK 180

Query: 181 -------DRITGGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQI 240
                  DRI+GGETDGLVVCRLLAPSHQVGRVLGRGG+TVEKIRQES A VKIFPKDQ 
Sbjct: 181 NADSAPNDRISGGETDGLVVCRLLAPSHQVGRVLGRGGRTVEKIRQESMAHVKIFPKDQN 240

Query: 241 PACASPRDELIQISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPLGPTSHASCCLPV 300
           PACASP+DELIQISG+F AV KAL SVS+CLQD+ RVDSSNSS+ K  GPTSHAS  +PV
Sbjct: 241 PACASPQDELIQISGNFPAVMKALSSVSSCLQDNPRVDSSNSSSTKSSGPTSHAS-SMPV 300

Query: 301 QDEEPSPMRRYISHHNADYRPRGYSSIPGHDNVGAGQRAAMEEDVVFRLLCQPDKVGSLI 360
           QDEEPSP RRY SHHNADYR RGYSSIPGH+NVGAG RA+MEEDVVFRLLCQPDKVGSLI
Sbjct: 301 QDEEPSPRRRYGSHHNADYRSRGYSSIPGHENVGAGPRASMEEDVVFRLLCQPDKVGSLI 360

Query: 361 GKGGTIVRALQSETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVIRVHCRIAEN 420
           GKGGT+VRALQ+ETGASIKIVDTPDLDER+V+ISARESLEQTYSPAQEAVIRVHCRIAE 
Sbjct: 361 GKGGTVVRALQNETGASIKIVDTPDLDERLVVISARESLEQTYSPAQEAVIRVHCRIAEI 420

Query: 421 GYEPGVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSGAVNDEVV 480
           GYEPG AVVARLLVH  QIGYLVGRGGHIIN+MRRGTGTSIQIFPR+QIQN+G + DEVV
Sbjct: 421 GYEPGAAVVARLLVHGQQIGYLVGRGGHIINDMRRGTGTSIQIFPRDQIQNNGPMIDEVV 480

Query: 481 QVIGSLQSVQDALFHITNRLRDTLFPMRPHMPNFNTPSYLSPHPETPPPLFRPGNNAHSP 540
           QVIG+L SVQDALFHITNR+RDT FPMRPH+PNFN P YLSPHPET PPLFRPG+NAHSP
Sbjct: 481 QVIGNLPSVQDALFHITNRIRDTFFPMRPHVPNFNNPPYLSPHPET-PPLFRPGSNAHSP 540

Query: 541 GCYPSQAGALRGTERLHFHSHPLDHQPVYSHSTNFGGNNMDGVHYPHGI--------ERP 600
           G YPSQ GALRGTERL +HSHPLDHQP Y H+  FGGNNMDGV YPHG+        ERP
Sbjct: 541 GYYPSQTGALRGTERLPYHSHPLDHQPAYPHNVGFGGNNMDGVPYPHGMERPGPGSFERP 600

Query: 601 SPRSWMSQV-SSEIPKAATDVGFGMVSRNESYSSGGPAHFLGGTSMEIVIPQTLICHIYG 660
           SPRSW SQV SSEIPK +TD G+GMVSRNESY SGGP HF+GGTSME+VIPQTLICHIYG
Sbjct: 601 SPRSWTSQVSSSEIPKGSTD-GYGMVSRNESYGSGGP-HFMGGTSMEMVIPQTLICHIYG 660

Query: 661 ENNTNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFILCGRTQ 691
           ENN NIAHVQQISGAM+VVHDAKPGMFDGKVI+SGTP+QIRAAQRLVHAFILCG+TQ
Sbjct: 661 ENNNNIAHVQQISGAMLVVHDAKPGMFDGKVIMSGTPEQIRAAQRLVHAFILCGKTQ 713

BLAST of Tan0008767 vs. NCBI nr
Match: KAG6575395.1 (KH domain-containing protein HEN4, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1155.2 bits (2987), Expect = 0.0e+00
Identity = 599/717 (83.54%), Postives = 626/717 (87.31%), Query Frame = 0

Query: 1   MDTNFSPKRPSDAISDPNLSAGRSVRPRQLPPPPPVSTTAPATLGLNSTSRDSPVLKPSS 60
           MDT FSPKR SDAISDPNLSA RSVRPRQLP         PATLG NSTSRDSPVLKPSS
Sbjct: 1   MDTGFSPKRSSDAISDPNLSAARSVRPRQLP---------PATLGFNSTSRDSPVLKPSS 60

Query: 61  PSDTLFRLLCPASKAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPALTNPG 120
           PS+T+FRLLCPASKAASILRHLCDIPGARIHID+PLPSCDECVIVIL GSPSKPALTNPG
Sbjct: 61  PSETVFRLLCPASKAASILRHLCDIPGARIHIDDPLPSCDECVIVILTGSPSKPALTNPG 120

Query: 121 NDREFREHDISRNVSSDAVAGNSDERS---QLLLRIFESMIRMNEDSGENQD-------- 180
           NDREFREHD SRNV SDAVAG+ DERS   Q LLR FESM+RMNEDSGENQD        
Sbjct: 121 NDREFREHDNSRNVPSDAVAGDLDERSQAQQALLRTFESMVRMNEDSGENQDIQKKNADS 180

Query: 181 ---DRITGGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQIPACA 240
              DRI+GGETDG VVCRLLAPSHQVGRVLGRGGK VEKIRQESTA VKIFPKDQIPACA
Sbjct: 181 APSDRISGGETDGSVVCRLLAPSHQVGRVLGRGGKNVEKIRQESTAHVKIFPKDQIPACA 240

Query: 241 SPRDELIQISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPLGPTSHASCCLPVQDEE 300
           SPRDELIQISG F AVKKAL S+SACLQD+ RVDSSNSST KPLGP S AS  +P  DEE
Sbjct: 241 SPRDELIQISGRFPAVKKALSSISACLQDTPRVDSSNSSTTKPLGPPSQAS-SMPGLDEE 300

Query: 301 PSPMRRYISHHNADYRPRGYSSIPGHDNVGAGQRAAMEEDVVFRLLCQPDKVGSLIGKGG 360
           PSP RRY SHHNAD+R RGYSSIPGH+N GAG R AMEE+VVFRLLCQPDKVGSLIGKGG
Sbjct: 301 PSPKRRYASHHNADHRSRGYSSIPGHENAGAGPRPAMEEEVVFRLLCQPDKVGSLIGKGG 360

Query: 361 TIVRALQSETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVIRVHCRIAENGYEP 420
           TIVRALQ+ETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAV+RVH RIAE GYEP
Sbjct: 361 TIVRALQNETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVMRVHGRIAELGYEP 420

Query: 421 GVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSGAVNDEVVQVIG 480
           G AVVARLLVHA QIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSG VNDE+VQVIG
Sbjct: 421 GAAVVARLLVHAQQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSGPVNDEIVQVIG 480

Query: 481 SLQSVQDALFHITNRLRDTLFPMRPHMPNFNTPSYLSPHPETPPPLFRPGNNAHSPGCYP 540
           +LQ VQDALFHITNRLRDTLFPMRPH+PNFN PSYLSPHPET PP+FRPGNNAHSPG YP
Sbjct: 481 NLQCVQDALFHITNRLRDTLFPMRPHVPNFNAPSYLSPHPET-PPVFRPGNNAHSPGYYP 540

Query: 541 SQAGALRGTERLHFHSHPLDHQPVYSHSTNFGGNNMDGVHYPHG------------IERP 600
           SQAGA    +RL FHSHPLDHQP YSH  +F GNNMDGV YPHG            IERP
Sbjct: 541 SQAGA----QRLPFHSHPLDHQPAYSHGMSFSGNNMDGVPYPHGIDRPGPGPGPGSIERP 600

Query: 601 SPRSWMSQVSSEIPKA-ATDVGFGMVSRNESYSSGGPAHFLGGTSMEIVIPQTLICHIYG 660
           SPRSW SQ S++IPK   TDVGFGM SRNESYSSGGPAHF+GGTSME+VIPQTLICHIYG
Sbjct: 601 SPRSWTSQASNDIPKGPTTDVGFGMASRNESYSSGGPAHFVGGTSMEMVIPQTLICHIYG 660

Query: 661 ENNTNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFILCGRTQ 691
           ENN NIAHVQQISGAMVVVHDAKPG+FDGKV+VSGTPDQIRAAQRLVHAFILCG+TQ
Sbjct: 661 ENNANIAHVQQISGAMVVVHDAKPGIFDGKVVVSGTPDQIRAAQRLVHAFILCGKTQ 702

BLAST of Tan0008767 vs. ExPASy TrEMBL
Match: A0A5A7UW07 (KH domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G001850 PE=4 SV=1)

HSP 1 Score: 1158.3 bits (2995), Expect = 0.0e+00
Identity = 593/716 (82.82%), Postives = 637/716 (88.97%), Query Frame = 0

Query: 1   MDTNFSPKRPSDAISDPNLSAGRSVRPRQLPP----PPPVSTTAPATLGLNSTSRDSPVL 60
           MDTNFSPKRP DAISD NLSAGRSVRPRQLPP    PPPVSTTAPATLGL++ SRD+PVL
Sbjct: 1   MDTNFSPKRPPDAISDHNLSAGRSVRPRQLPPPPPLPPPVSTTAPATLGLDTISRDAPVL 60

Query: 61  KPSSPSDTLFRLLCPASKAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPAL 120
           KPSSPSDTLFRLLCPASK +SILRHL DI GAR+HIDE LPSCDECV+VILAGSPSKPA 
Sbjct: 61  KPSSPSDTLFRLLCPASKVSSILRHLRDISGARVHIDESLPSCDECVLVILAGSPSKPAH 120

Query: 121 TNPGNDREFREHDISRNVSSDAVAGNSDERS---QLLLRIFESMIRMNEDSGENQD---- 180
           TNPGNDREFREHD++RNVSSD VAG+SDERS   Q LLR FES++RMNEDSGENQ+    
Sbjct: 121 TNPGNDREFREHDVNRNVSSDTVAGDSDERSQAQQALLRTFESIVRMNEDSGENQEIQKK 180

Query: 181 -------DRITGGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQI 240
                  DRI+GGETDGLVVCRLLAPSHQVGRVLGRGG+TVEKIRQES A VKIFPKDQ 
Sbjct: 181 NADSAPNDRISGGETDGLVVCRLLAPSHQVGRVLGRGGRTVEKIRQESMAHVKIFPKDQN 240

Query: 241 PACASPRDELIQISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPLGPTSHASCCLPV 300
           PACASP+DELIQISG+F AV KAL SVS+CLQD+ RVDSSNSS+ K  GPTSHAS  +PV
Sbjct: 241 PACASPQDELIQISGNFPAVMKALASVSSCLQDNPRVDSSNSSSTKSSGPTSHAS-SMPV 300

Query: 301 QDEEPSPMRRYISHHNADYRPRGYSSIPGHDNVGAGQRAAMEEDVVFRLLCQPDKVGSLI 360
           QDEEPSP RRY SHHNADYR RGYSSIPGH+NVGAG RA+MEEDVVFRLLCQPDKVGSLI
Sbjct: 301 QDEEPSPRRRYGSHHNADYRSRGYSSIPGHENVGAGPRASMEEDVVFRLLCQPDKVGSLI 360

Query: 361 GKGGTIVRALQSETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVIRVHCRIAEN 420
           GKGGT+VRALQ+ETGASIKIVDTPDLDER+V+ISARESLEQTYSPAQEAVIRVHCRIAE 
Sbjct: 361 GKGGTVVRALQNETGASIKIVDTPDLDERLVVISARESLEQTYSPAQEAVIRVHCRIAEI 420

Query: 421 GYEPGVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSGAVNDEVV 480
           GYEPG AVVARLLVH  QIGYLVGRGGHIIN+MRRGTGTSIQIFPR+QIQN+G +NDEVV
Sbjct: 421 GYEPGAAVVARLLVHGQQIGYLVGRGGHIINDMRRGTGTSIQIFPRDQIQNNGPMNDEVV 480

Query: 481 QVIGSLQSVQDALFHITNRLRDTLFPMRPHMPNFNTPSYLSPHPETPPPLFRPGNNAHSP 540
           QVIG+L SVQDALFHITNR+RDT FPMRPH+PNFN P YLSPHPET PPLFRPG+NAHSP
Sbjct: 481 QVIGNLPSVQDALFHITNRIRDTFFPMRPHVPNFNNPPYLSPHPET-PPLFRPGSNAHSP 540

Query: 541 GCYPSQAGALRGTERLHFHSHPLDHQPVYSHSTNFGGNNMDGVHYPHGI--------ERP 600
           G YPSQAG LRGTERL +HSHPLDHQP Y H+  FGGNNMDGV YPHG+        ERP
Sbjct: 541 GYYPSQAGTLRGTERLPYHSHPLDHQPAYPHNVGFGGNNMDGVPYPHGMERPGPGPFERP 600

Query: 601 SPRSWMSQVSSEIPKAATDVGFGMVSRNESYSSGGPAHFLGGTSMEIVIPQTLICHIYGE 660
           SPRSW SQVSSEIPK  TD G+GMVSRNESY SGG  HF+GGTSME+VIPQTLICHIYGE
Sbjct: 601 SPRSWTSQVSSEIPKGPTD-GYGMVSRNESYGSGG-THFMGGTSMEMVIPQTLICHIYGE 660

Query: 661 NNTNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFILCGRTQ 691
           NN NIAHVQQISGAM+VVHDAKPGMFDGKVI+SGTP+QIRAAQRLVHAFILCG+TQ
Sbjct: 661 NNNNIAHVQQISGAMLVVHDAKPGMFDGKVIMSGTPEQIRAAQRLVHAFILCGKTQ 712

BLAST of Tan0008767 vs. ExPASy TrEMBL
Match: A0A0A0KAK5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G447790 PE=4 SV=1)

HSP 1 Score: 1157.5 bits (2993), Expect = 0.0e+00
Identity = 594/718 (82.73%), Postives = 638/718 (88.86%), Query Frame = 0

Query: 1   MDTNFSPKRPSDAISDPNLSAGRSVRPRQLPP----PPPVSTTAPATLGLNSTSRDSPVL 60
           MDTNFSPKRP DAISDP+LSAGRSVRPRQLPP    PPPVSTTAPATLGL++T+RD+PVL
Sbjct: 1   MDTNFSPKRPPDAISDPSLSAGRSVRPRQLPPPPPLPPPVSTTAPATLGLDTTTRDAPVL 60

Query: 61  KPSSPSDTLFRLLCPASKAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPAL 120
           KPSSPSDTLFRLLCPASK +SILRHL DIPGARIH+DEPLPSC+ECV+VILAGSPSKPA 
Sbjct: 61  KPSSPSDTLFRLLCPASKVSSILRHLRDIPGARIHVDEPLPSCEECVLVILAGSPSKPAH 120

Query: 121 TNPGNDREFREHDISRNVSSDAVAGNSDERS---QLLLRIFESMIRMNEDSGENQD---- 180
           TNPGNDREFREHD+ RNVSSD VAG+SDERS   Q LLR FES++RMNEDSGENQ+    
Sbjct: 121 TNPGNDREFREHDVHRNVSSDTVAGDSDERSQAQQALLRTFESIVRMNEDSGENQEIQKK 180

Query: 181 -------DRITGGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQI 240
                  DRI+GGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQES A VKIFPKDQ 
Sbjct: 181 NADSAPNDRISGGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESMAHVKIFPKDQN 240

Query: 241 PACASPRDELIQISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPLGPTSHASCCLPV 300
           PACASP+DELIQISG+F+AV KAL SVS+CLQDS RVDSSNSS+ K LGPTSHAS  + V
Sbjct: 241 PACASPQDELIQISGNFSAVMKALSSVSSCLQDSPRVDSSNSSSTKSLGPTSHAS-SMSV 300

Query: 301 QDEEPSPMRRYISHHNADYRPRGYSSIPGHDNVGAGQRAAMEEDVVFRLLCQPDKVGSLI 360
           QDEEPSP RRY SHHNADYR R YSSIPGH+N GAG RAAMEEDVVFRLLCQPDKVGSLI
Sbjct: 301 QDEEPSPRRRYGSHHNADYRSRSYSSIPGHENAGAGPRAAMEEDVVFRLLCQPDKVGSLI 360

Query: 361 GKGGTIVRALQSETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVIRVHCRIAEN 420
           GKGGT+VRALQ+ETGASIKIVDTPDLDER+V+ISARE+LEQTYSPAQEAVIR HCRIAE 
Sbjct: 361 GKGGTVVRALQNETGASIKIVDTPDLDERLVVISARETLEQTYSPAQEAVIRAHCRIAEI 420

Query: 421 GYEPGVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSGAVNDEVV 480
           GYEPG AVVARLLVH  QIGYLVGRGGHIIN+MRRGTGTSIQIFPR+QIQN G ++DEVV
Sbjct: 421 GYEPGAAVVARLLVHGQQIGYLVGRGGHIINDMRRGTGTSIQIFPRDQIQNGGPMSDEVV 480

Query: 481 QVIGSLQSVQDALFHITNRLRDTLFPMRPHMPNFNT-PSYLSPHPETPPPLFRPGNNAHS 540
           QVIG+L SVQDALFHITNR+RDT FPMRPH+PNFN  P YLSPHPETPPPLFRPG+NAHS
Sbjct: 481 QVIGNLPSVQDALFHITNRIRDTFFPMRPHVPNFNNHPPYLSPHPETPPPLFRPGSNAHS 540

Query: 541 PGCYPSQAGALRGTERLHFHSHPLDHQPVYSHSTNF-GGNNMDGVHYPHGI--------E 600
           PG YPSQAG LRGTER  +HSHPLDHQP Y H+ +F GGNNMDGV YPHG+        E
Sbjct: 541 PGYYPSQAGGLRGTERPPYHSHPLDHQPAYPHNVSFGGGNNMDGVPYPHGMERPGPGSFE 600

Query: 601 RPSPRSWMSQVSSEIPKAATDVGFGMVSRNESYSSGGPAHFLGGTSMEIVIPQTLICHIY 660
           RPSPRSW SQVSSEIPK  TD GFGMVSRNE Y SGGP HF+GGTSME+VIPQTLICHIY
Sbjct: 601 RPSPRSWTSQVSSEIPKGPTD-GFGMVSRNEPYGSGGP-HFMGGTSMEMVIPQTLICHIY 660

Query: 661 GENNTNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFILCGRTQ 691
           GENN NIAHVQQISGAM+VVHDAKPGMFDGKVI+SGTPDQIRAAQRLVHAFILCG+TQ
Sbjct: 661 GENNNNIAHVQQISGAMLVVHDAKPGMFDGKVIMSGTPDQIRAAQRLVHAFILCGKTQ 715

BLAST of Tan0008767 vs. ExPASy TrEMBL
Match: A0A1S3CGC9 (KH domain-containing protein At4g18375 OS=Cucumis melo OX=3656 GN=LOC103500622 PE=4 SV=1)

HSP 1 Score: 1156.4 bits (2990), Expect = 0.0e+00
Identity = 594/717 (82.85%), Postives = 639/717 (89.12%), Query Frame = 0

Query: 1   MDTNFSPKRPSDAISDPNLSAGRSVRPRQLPP----PPPVSTTAPATLGLNSTSRDSPVL 60
           MDTNFSPKRP DAISD NLSAGRSVRPRQLPP    PPPVSTTAPATLGL++ SRD+PVL
Sbjct: 1   MDTNFSPKRPPDAISDHNLSAGRSVRPRQLPPPPPLPPPVSTTAPATLGLDTISRDAPVL 60

Query: 61  KPSSPSDTLFRLLCPASKAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPAL 120
           KPSSPSDTLFRLLCPASK +SILRHL DIPGAR+HIDE LPSCDECV+VILAGSPSKPA 
Sbjct: 61  KPSSPSDTLFRLLCPASKVSSILRHLRDIPGARVHIDESLPSCDECVLVILAGSPSKPAH 120

Query: 121 TNPGNDREFREHDISRNVSSDAVAGNSDERS---QLLLRIFESMIRMNEDSGENQD---- 180
           TNPGNDREFREHD++RNVSSD VAG+SDERS   Q LLR FES++RMNEDSGENQ+    
Sbjct: 121 TNPGNDREFREHDVNRNVSSDTVAGDSDERSQAQQALLRTFESIVRMNEDSGENQEIQKK 180

Query: 181 -------DRITGGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQI 240
                  DRI+GGETDGLVVCRLLAPSHQVGRVLGRGG+TVEKIRQES A VKIFPKDQ 
Sbjct: 181 NADSAPNDRISGGETDGLVVCRLLAPSHQVGRVLGRGGRTVEKIRQESMAHVKIFPKDQN 240

Query: 241 PACASPRDELIQISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPLGPTSHASCCLPV 300
           PACASP+DELIQISG+F AV KAL SVS+CLQD+ RVDSSNSS+ K  GPTSHAS  +PV
Sbjct: 241 PACASPQDELIQISGNFPAVMKALSSVSSCLQDNPRVDSSNSSSTKSSGPTSHAS-SMPV 300

Query: 301 QDEEPSPMRRYISHHNADYRPRGYSSIPGHDNVGAGQRAAMEEDVVFRLLCQPDKVGSLI 360
           QDEEPSP RRY SHHNADYR RGYSSIPGH+NVGAG RA+MEEDVVFRLLCQPDKVGSLI
Sbjct: 301 QDEEPSPRRRYGSHHNADYRSRGYSSIPGHENVGAGPRASMEEDVVFRLLCQPDKVGSLI 360

Query: 361 GKGGTIVRALQSETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVIRVHCRIAEN 420
           GKGGT+VRALQ+ETGASIKIVDTPDLDER+V+ISARESLEQTYSPAQEAVIRVHCRIAE 
Sbjct: 361 GKGGTVVRALQNETGASIKIVDTPDLDERLVVISARESLEQTYSPAQEAVIRVHCRIAEI 420

Query: 421 GYEPGVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSGAVNDEVV 480
           GYEPG AVVARLLVH  QIGYLVGRGGHIIN+MRRGTGTSIQIFPR+QIQN+G + DEVV
Sbjct: 421 GYEPGAAVVARLLVHGQQIGYLVGRGGHIINDMRRGTGTSIQIFPRDQIQNNGPMIDEVV 480

Query: 481 QVIGSLQSVQDALFHITNRLRDTLFPMRPHMPNFNTPSYLSPHPETPPPLFRPGNNAHSP 540
           QVIG+L SVQDALFHITNR+RDT FPMRPH+PNFN P YLSPHPET PPLFRPG+NAHSP
Sbjct: 481 QVIGNLPSVQDALFHITNRIRDTFFPMRPHVPNFNNPPYLSPHPET-PPLFRPGSNAHSP 540

Query: 541 GCYPSQAGALRGTERLHFHSHPLDHQPVYSHSTNFGGNNMDGVHYPHGI--------ERP 600
           G YPSQ GALRGTERL +HSHPLDHQP Y H+  FGGNNMDGV YPHG+        ERP
Sbjct: 541 GYYPSQTGALRGTERLPYHSHPLDHQPAYPHNVGFGGNNMDGVPYPHGMERPGPGSFERP 600

Query: 601 SPRSWMSQV-SSEIPKAATDVGFGMVSRNESYSSGGPAHFLGGTSMEIVIPQTLICHIYG 660
           SPRSW SQV SSEIPK +TD G+GMVSRNESY SGGP HF+GGTSME+VIPQTLICHIYG
Sbjct: 601 SPRSWTSQVSSSEIPKGSTD-GYGMVSRNESYGSGGP-HFMGGTSMEMVIPQTLICHIYG 660

Query: 661 ENNTNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFILCGRTQ 691
           ENN NIAHVQQISGAM+VVHDAKPGMFDGKVI+SGTP+QIRAAQRLVHAFILCG+TQ
Sbjct: 661 ENNNNIAHVQQISGAMLVVHDAKPGMFDGKVIMSGTPEQIRAAQRLVHAFILCGKTQ 713

BLAST of Tan0008767 vs. ExPASy TrEMBL
Match: A0A6J1GQJ7 (KH domain-containing protein HEN4 OS=Cucurbita moschata OX=3662 GN=LOC111456570 PE=4 SV=1)

HSP 1 Score: 1154.8 bits (2986), Expect = 0.0e+00
Identity = 598/713 (83.87%), Postives = 625/713 (87.66%), Query Frame = 0

Query: 1   MDTNFSPKRPSDAISDPNLSAGRSVRPRQLPPPPPVSTTAPATLGLNSTSRDSPVLKPSS 60
           MDT FSPKR SDAISDPNLSA RSVRPRQLP         PATLG NSTSRDSPVLKPSS
Sbjct: 1   MDTGFSPKRSSDAISDPNLSAARSVRPRQLP---------PATLGFNSTSRDSPVLKPSS 60

Query: 61  PSDTLFRLLCPASKAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPALTNPG 120
           PS+T+FRLLCPASKAASILRHLCDIPGARIHID+PLPSCDECVIVIL GSPSKPALTNPG
Sbjct: 61  PSETVFRLLCPASKAASILRHLCDIPGARIHIDDPLPSCDECVIVILTGSPSKPALTNPG 120

Query: 121 NDREFREHDISRNVSSDAVAGNSDERS---QLLLRIFESMIRMNEDSGENQD-------- 180
           NDREFREHD SRNV SDAVAG+ DERS   Q LLR FESM+RMNEDSGENQD        
Sbjct: 121 NDREFREHDNSRNVPSDAVAGDLDERSQAQQALLRTFESMVRMNEDSGENQDIQKKNADS 180

Query: 181 ---DRITGGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQIPACA 240
              DRI+GGETDG VVCRLLAPSHQVGRVLGRGGK VEKIRQESTA VKIFPKDQIPACA
Sbjct: 181 APSDRISGGETDGSVVCRLLAPSHQVGRVLGRGGKNVEKIRQESTAHVKIFPKDQIPACA 240

Query: 241 SPRDELIQISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPLGPTSHASCCLPVQDEE 300
           SPRDELIQISG F AVKKAL S+SACLQD+ RVDSSNSST KPLGP S AS  +P  DEE
Sbjct: 241 SPRDELIQISGRFPAVKKALSSISACLQDTPRVDSSNSSTTKPLGPPSQAS-SMPGLDEE 300

Query: 301 PSPMRRYISHHNADYRPRGYSSIPGHDNVGAGQRAAMEEDVVFRLLCQPDKVGSLIGKGG 360
           PSP RRY SHHNAD+R RGYSSIPGH+N GAG R AMEE+VVFRLLCQPDKVGSLIGKGG
Sbjct: 301 PSPKRRYASHHNADHRSRGYSSIPGHENAGAGPRPAMEEEVVFRLLCQPDKVGSLIGKGG 360

Query: 361 TIVRALQSETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVIRVHCRIAENGYEP 420
           TIVRALQ+ETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAV+RVH RIAE GYEP
Sbjct: 361 TIVRALQNETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVMRVHGRIAELGYEP 420

Query: 421 GVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSGAVNDEVVQVIG 480
           G AVVARLLVHA QIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSG VNDE+VQVIG
Sbjct: 421 GAAVVARLLVHAQQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSGPVNDEIVQVIG 480

Query: 481 SLQSVQDALFHITNRLRDTLFPMRPHMPNFNTPSYLSPHPETPPPLFRPGNNAHSPGCYP 540
           +LQ VQDALFHITNRLRDTLFPMRPH+PNFN PSYLSPHPET PP+FRPGNNAHSPG YP
Sbjct: 481 NLQCVQDALFHITNRLRDTLFPMRPHVPNFNAPSYLSPHPET-PPVFRPGNNAHSPGYYP 540

Query: 541 SQAGALRGTERLHFHSHPLDHQPVYSHSTNFGGNNMDGVHYPHG--------IERPSPRS 600
           SQAGA    +RL FHSHPLDHQP YSH  +F GNNMDGV YPHG        IERPSPRS
Sbjct: 541 SQAGA----QRLPFHSHPLDHQPAYSHGMSFSGNNMDGVPYPHGIDRPGPGSIERPSPRS 600

Query: 601 WMSQVSSEIPKA-ATDVGFGMVSRNESYSSGGPAHFLGGTSMEIVIPQTLICHIYGENNT 660
           W SQ S++IPK   TDVGFGM SRNESYSSGGP HF+GGTSME+VIPQTLICHIYGENN 
Sbjct: 601 WTSQASNDIPKGPTTDVGFGMASRNESYSSGGPPHFVGGTSMEMVIPQTLICHIYGENNA 660

Query: 661 NIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFILCGRTQ 691
           NIAHVQQISGAMVVVHDAKPG+FDGKV+VSGTPDQIRAAQRLVHAFILCG+TQ
Sbjct: 661 NIAHVQQISGAMVVVHDAKPGIFDGKVVVSGTPDQIRAAQRLVHAFILCGKTQ 698

BLAST of Tan0008767 vs. ExPASy TrEMBL
Match: A0A6J1JYF4 (KH domain-containing protein HEN4 OS=Cucurbita maxima OX=3661 GN=LOC111488582 PE=4 SV=1)

HSP 1 Score: 1144.4 bits (2959), Expect = 0.0e+00
Identity = 592/713 (83.03%), Postives = 622/713 (87.24%), Query Frame = 0

Query: 1   MDTNFSPKRPSDAISDPNLSAGRSVRPRQLPPPPPVSTTAPATLGLNSTSRDSPVLKPSS 60
           MDT FSP R SDAI DPNLSA RSVRPRQLP         PATLG NSTSRDSPVLKPSS
Sbjct: 1   MDTGFSPTRSSDAIPDPNLSAARSVRPRQLP---------PATLGFNSTSRDSPVLKPSS 60

Query: 61  PSDTLFRLLCPASKAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPALTNPG 120
           PS+T+FRLLCPASKAASI RHLCD+PGARIHID+PLPSCDECVIVIL GSPSKPA TNPG
Sbjct: 61  PSETVFRLLCPASKAASIHRHLCDVPGARIHIDDPLPSCDECVIVILTGSPSKPAPTNPG 120

Query: 121 NDREFREHDISRNVSSDAVAGNSDERS---QLLLRIFESMIRMNEDSGENQD-------- 180
           NDREFREHD SRNVSSDAVAG+ DERS   Q LLR FESM+RMNEDSGENQD        
Sbjct: 121 NDREFREHDNSRNVSSDAVAGDLDERSQAQQALLRTFESMVRMNEDSGENQDIQKKNADS 180

Query: 181 ---DRITGGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQIPACA 240
              +RI+GGETDG VVCRLLAPSHQVGRVLGRGGK VEKIRQESTA VKIFPKDQIPACA
Sbjct: 181 APSERISGGETDGSVVCRLLAPSHQVGRVLGRGGKNVEKIRQESTAHVKIFPKDQIPACA 240

Query: 241 SPRDELIQISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPLGPTSHASCCLPVQDEE 300
           SPRDELIQISG F AVKKAL S+SACLQD  R+DSSNSST KPLGP S AS  +P  DEE
Sbjct: 241 SPRDELIQISGRFPAVKKALSSISACLQDGPRLDSSNSSTTKPLGPPSQAS-SMPALDEE 300

Query: 301 PSPMRRYISHHNADYRPRGYSSIPGHDNVGAGQRAAMEEDVVFRLLCQPDKVGSLIGKGG 360
           PSP RRY SHHNAD+R RGYSSIPGH+N GAG R AMEE+VVFRLLCQPDKVGSLIGKGG
Sbjct: 301 PSPKRRYASHHNADHRSRGYSSIPGHENAGAGPRPAMEEEVVFRLLCQPDKVGSLIGKGG 360

Query: 361 TIVRALQSETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVIRVHCRIAENGYEP 420
           TIVRALQ+ETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAV+RVH RIAE GYEP
Sbjct: 361 TIVRALQNETGASIKIVDTPDLDERVVMISARESLEQTYSPAQEAVMRVHGRIAELGYEP 420

Query: 421 GVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSGAVNDEVVQVIG 480
           G AVVARLLVHA QIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSG V+DE+VQVIG
Sbjct: 421 GAAVVARLLVHAQQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSGPVSDEIVQVIG 480

Query: 481 SLQSVQDALFHITNRLRDTLFPMRPHMPNFNTPSYLSPHPETPPPLFRPGNNAHSPGCYP 540
           +LQ VQDALFHITNRLRDTLFPMRPH+PNFN PSYLSPHPET PP+FRPGNNAHSPG YP
Sbjct: 481 NLQCVQDALFHITNRLRDTLFPMRPHVPNFNAPSYLSPHPET-PPVFRPGNNAHSPGYYP 540

Query: 541 SQAGALRGTERLHFHSHPLDHQPVYSHSTNFGGNNMDGVHYPHG--------IERPSPRS 600
           SQAGA    +RL FHSHPLDHQP YSH  +F GNNMDGV YPHG        IERPSPRS
Sbjct: 541 SQAGA----QRLPFHSHPLDHQPAYSHGMSFSGNNMDGVPYPHGIDRPGPGSIERPSPRS 600

Query: 601 WMSQVSSEIPKA-ATDVGFGMVSRNESYSSGGPAHFLGGTSMEIVIPQTLICHIYGENNT 660
           W SQVS++IPK   TDVGFGM SRNE YSSGGPAHF+GGTSME+VIPQTLICHIYGENN 
Sbjct: 601 WTSQVSNDIPKGPTTDVGFGMASRNEPYSSGGPAHFVGGTSMEMVIPQTLICHIYGENNA 660

Query: 661 NIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFILCGRTQ 691
           NIAHVQQISGAMVVVHDAKPG+FDGKV+VSGTPDQIRAAQRLVHAFILCG+TQ
Sbjct: 661 NIAHVQQISGAMVVVHDAKPGIFDGKVVVSGTPDQIRAAQRLVHAFILCGKTQ 698

BLAST of Tan0008767 vs. TAIR 10
Match: AT1G51580.1 (RNA-binding KH domain-containing protein )

HSP 1 Score: 420.6 bits (1080), Expect = 2.4e-117
Identity = 277/674 (41.10%), Postives = 386/674 (57.27%), Query Frame = 0

Query: 48  STSRDSPVLKPSSPSDTLFRLLCPAS-------KAASILRHLCDIPGARIHI--DEPLPS 107
           STS+  P    ++     FRLLCPA+       K  S++RHL  + G++I +  D P+PS
Sbjct: 4   STSK-RPATTATAAESVHFRLLCPATRTGAIIGKGGSVIRHLQSVTGSKIRVIDDIPVPS 63

Query: 108 CDECVIVILAGSPSKPALT------NPGNDREFREHDISRNVSSDAVAGNSDERS----- 167
            +E V++I+A S  K   +      NPG++   +E    +       +G  DE +     
Sbjct: 64  -EERVVLIIAPSGKKKDESNVCDSENPGSEEPKQE----KGSECAGTSGGDDEEAPSSAQ 123

Query: 168 QLLLRIFESMIRMNEDSGENQDDRITGGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIR 227
             LLR+FE ++   +D+     D +  GE++GL  CR++   +QV  ++ +GGK ++KIR
Sbjct: 124 MALLRVFERIV-FGDDAATVDGDELDKGESEGL--CRMIVRGNQVDYLMSKGGKMIQKIR 183

Query: 228 QESTAQVKIFPKDQIPACASPRDELIQISGSFTAVKKALVSVSACLQDSLRVDSSNSSTA 287
           ++S A V+I   DQIP CA P D +IQ++G F++VKKAL+ V+ CLQ+S           
Sbjct: 184 EDSGAIVRISSTDQIPPCAFPGDVVIQMNGKFSSVKKALLLVTNCLQES----------G 243

Query: 288 KPLGPTSHASCCLPVQDEEPSPMRRY-ISHHNADYRPRGYSSIPG--HDNVGAGQRAAME 347
            P           P  DE P P   Y   +H+ +Y P+     P    ++VG   R  +E
Sbjct: 244 AP-----------PTWDECPFPQPGYPPEYHSMEYHPQWDHPPPNPMPEDVGPFNRPVVE 303

Query: 348 EDVVFRLLCQPDKVGSLIGKGGTIVRALQSETGASIKIVD-TPDLDERVVMISARESLEQ 407
           E+V FRLLC  DKVGSLIGKGG +VRALQ+E+GASIK+ D T D +ER+++ISARE+LE+
Sbjct: 304 EEVAFRLLCPADKVGSLIGKGGAVVRALQNESGASIKVSDPTHDSEERIIVISARENLER 363

Query: 408 TYSPAQEAVIRVHCRIAENGYEPGVAVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSI 467
            +S AQ+ V+RVH RI E G+EP  AVVARLLVH+  IG L+G+GGH+I+EMRR TG SI
Sbjct: 364 RHSLAQDGVMRVHNRIVEIGFEPSAAVVARLLVHSPYIGRLLGKGGHLISEMRRATGASI 423

Query: 468 QIFPREQIQNSGAVNDEVVQVIGSLQSVQDALFHITNRLRDTLFPMRPHMPNFN--TPSY 527
           ++F ++Q     + +DE+VQVIG+L++VQDALF I  RLR+ +FP R          P +
Sbjct: 424 RVFAKDQATKYESQHDEIVQVIGNLKTVQDALFQILCRLREAMFPGRLPFQGMGGPPPPF 483

Query: 528 LSPHPETPPPLFRPGNNAHSPGCYPSQAGALRGTERLHFHSHPLDHQPVYSHSTNFGGNN 587
           + P+PE PPP F P     SP  Y S  G     ER H H    D  P         G  
Sbjct: 484 MGPYPEPPPP-FGPRQYPASPDRYHSPVGPFH--ER-HCHGPGFDRPP---------GPG 543

Query: 588 MDGVHYPHGIERPSPRSWMSQVSSE------IPKAATDVGFGMVSRNESYSSGGPAHFLG 647
            D          PSP SW  Q   +      +P    DV  G   RNE   S  P   + 
Sbjct: 544 FD--------RPPSPMSWTPQPGIDGHPGGMVP---PDVNHGFALRNEPIGSENPV--MT 603

Query: 648 GTSMEIVIPQTLICHIYGENNTNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRA 690
             ++EIVIPQ  + H+YGEN +N+ +++Q+SGA VVVHD K G  +G V+VSGT DQ   
Sbjct: 604 SANVEIVIPQAYLGHVYGENCSNLNYIKQVSGANVVVHDPKAGTTEGLVVVSGTSDQAHF 621

BLAST of Tan0008767 vs. TAIR 10
Match: AT5G64390.1 (RNA-binding KH domain-containing protein )

HSP 1 Score: 241.5 bits (615), Expect = 2.0e-63
Identity = 223/837 (26.64%), Postives = 348/837 (41.58%), Query Frame = 0

Query: 66  FRLLCPAS-------KAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPAL-- 125
           FRLLCP S       K+ ++++ L    GA+I ++EP     + VI I+A + SK  +  
Sbjct: 49  FRLLCPLSHVGAVIGKSGNVIKQLQQSTGAKIRVEEPPSGSPDRVITIIAQADSKSRVKL 108

Query: 126 --TNPGN-DREFREHDISRNVSSDAVAGNSDERSQLLLRIFESMIRMNEDSGENQDDRIT 185
              N GN + E +E ++  + +  A           L+++FE ++    DS         
Sbjct: 109 GANNNGNAEGEKKEEEVEVSKAQGA-----------LIKVFE-LLAAEADS--------- 168

Query: 186 GGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQIPACASPRDELI 245
                  VVCRLL  S   G V+G+GG+ V  IR+E+  ++ I   + +P CA   DE++
Sbjct: 169 -----DTVVCRLLTESSHAGAVIGKGGQMVGSIRKETGCKISI-RIENLPICADTDDEMV 228

Query: 246 QISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPL----------------------- 305
           ++ G+  AVKKALVS+S CLQ+   +D       +PL                       
Sbjct: 229 EVEGNAIAVKKALVSISRCLQNCQSIDKVRMVGNRPLEKEFQASLHRPIETIIQESLPRS 288

Query: 306 ------------------GPTSHASCCLP--------------------VQDEEPSPMRR 365
                             G  + A+  +P                    ++ +    +RR
Sbjct: 289 VEVNPYDYRLRNDEIFPRGTVARANDVIPHDTLHLRRIEAVPQGALRMHIEADRQDVLRR 348

Query: 366 YI----------------------------------------SH---------------- 425
           ++                                        SH                
Sbjct: 349 HVEADRQDALRRRIDVVPQETLYMPSDVLRGDCFRQHRERDDSHDSLHRPFEMVPRDAMG 408

Query: 426 -------------------------HNADYRPRGYSSIPGH-----DNVGAGQRAAME-- 485
                                     +ADY    YS++  H      +      A M+  
Sbjct: 409 MPFESFPRDAYGRPIETMTQETLRGQSADYLAHRYSTLDTHPHSFTTSASMANTATMKPP 468

Query: 486 --------EDVVFRLLCQPDKVGSLIGKGGTIVRALQSETGASIKIVDT-PDLDERVVMI 545
                   +DVVF++LC  +  G +IG GG +VR L SETGA I + +T  D +ER++ +
Sbjct: 469 PSEVEVGNQDVVFKILCSTENAGGVIGTGGKVVRMLHSETGAFINVGNTLDDCEERLIAV 528

Query: 546 SARESLEQTYSPAQEAVIRVHCR--------IAENGYEPGVAVVARLLVHAHQIGYLVGR 605
           +A E+ E   SPAQ+A++ +  R        I +NG  P  ++ ARL+V   QIG ++G+
Sbjct: 529 TASENPECQSSPAQKAIMLIFSRLFELATNKILDNG--PRSSITARLVVPTSQIGCVLGK 588

Query: 606 GGHIINEMRRGTGTSIQIFPREQIQNSGAVNDEVVQVIGSLQSVQDALFHITNRLRDTLF 665
           GG I++EMR+ TG +IQI   EQ     + ND+VVQ+ G   +V++A+FHIT+RLRD++F
Sbjct: 589 GGVIVSEMRKTTGAAIQILKVEQNPKCISENDQVVQITGEFPNVREAIFHITSRLRDSVF 648

Query: 666 PMRPHMPNFNTPSYLSPHPETPPPLFRPGNNAHSPGCYPSQAGALRGTERLHFHSH---- 693
                     + S L+    T     R  +N  S G + S +     +  LH  S     
Sbjct: 649 SNSMKNSLAKSSSALT----TERFYDRQSDNPLSIGSHQSVSNPATNSSSLHRRSEDSFL 708

BLAST of Tan0008767 vs. TAIR 10
Match: AT5G64390.3 (RNA-binding KH domain-containing protein )

HSP 1 Score: 240.0 bits (611), Expect = 5.8e-63
Identity = 224/846 (26.48%), Postives = 349/846 (41.25%), Query Frame = 0

Query: 66  FRLLCPAS-------KAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPAL-- 125
           FRLLCP S       K+ ++++ L    GA+I ++EP     + VI I+A + SK  +  
Sbjct: 49  FRLLCPLSHVGAVIGKSGNVIKQLQQSTGAKIRVEEPPSGSPDRVITIIAQADSKSRVKL 108

Query: 126 --TNPGN-DREFREHDISRNVSSDAVAGNSDERSQLLLRIFESMIRMNEDSGENQDDRIT 185
              N GN + E +E ++  + +  A           L+++FE ++    DS         
Sbjct: 109 GANNNGNAEGEKKEEEVEVSKAQGA-----------LIKVFE-LLAAEADS--------- 168

Query: 186 GGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQIPACASPRDELI 245
                  VVCRLL  S   G V+G+GG+ V  IR+E+  ++ I   + +P CA   DE++
Sbjct: 169 -----DTVVCRLLTESSHAGAVIGKGGQMVGSIRKETGCKISI-RIENLPICADTDDEMV 228

Query: 246 QISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPL----------------------- 305
           ++ G+  AVKKALVS+S CLQ+   +D       +PL                       
Sbjct: 229 EVEGNAIAVKKALVSISRCLQNCQSIDKVRMVGNRPLEKEFQASLHRPIETIIQESLPRS 288

Query: 306 ------------------GPTSHASCCLP--------------------VQDEEPSPMRR 365
                             G  + A+  +P                    ++ +    +RR
Sbjct: 289 VEVNPYDYRLRNDEIFPRGTVARANDVIPHDTLHLRRIEAVPQGALRMHIEADRQDVLRR 348

Query: 366 YI----------------------------------------SH---------------- 425
           ++                                        SH                
Sbjct: 349 HVEADRQDALRRRIDVVPQETLYMPSDVLRGDCFRQHRERDDSHDSLHRPFEMVPRDAMG 408

Query: 426 -------------------------HNADYRPRGYSSIPGH-----DNVGAGQRAAME-- 485
                                     +ADY    YS++  H      +      A M+  
Sbjct: 409 MPFESFPRDAYGRPIETMTQETLRGQSADYLAHRYSTLDTHPHSFTTSASMANTATMKPP 468

Query: 486 --------EDVVFRLLCQPDKVGSLIGKGGTIVRALQSETGASIKIVDT-PDLDERVVMI 545
                   +DVVF++LC  +  G +IG GG +VR L SETGA I + +T  D +ER++ +
Sbjct: 469 PSEVEVGNQDVVFKILCSTENAGGVIGTGGKVVRMLHSETGAFINVGNTLDDCEERLIAV 528

Query: 546 SARESLEQTYSPAQEAVIRVHCR--------IAENGYEPGVAVVARLLVHAHQIGYLVGR 605
           +A E+ E   SPAQ+A++ +  R        I +NG  P  ++ ARL+V   QIG ++G+
Sbjct: 529 TASENPECQSSPAQKAIMLIFSRLFELATNKILDNG--PRSSITARLVVPTSQIGCVLGK 588

Query: 606 GGHIINEMRRGTGTSIQIFPREQIQNSGAVNDEVVQVIGSLQSVQDALFHITNRLRDTLF 665
           GG I++EMR+ TG +IQI   EQ     + ND+VVQ+ G   +V++A+FHIT+RLRD++F
Sbjct: 589 GGVIVSEMRKTTGAAIQILKVEQNPKCISENDQVVQITGEFPNVREAIFHITSRLRDSVF 648

Query: 666 PMRPHMPNFNTPSYLSPHPETPPPLFRPGNNAHSPGCYPSQAGALRGTERLHFHSH---- 693
                     + S L+    T     R  +N  S G + S +     +  LH  S     
Sbjct: 649 SNSMKNSLAKSSSALT----TERFYDRQSDNPLSIGSHQSVSNPATNSSSLHRRSEDSFL 708

BLAST of Tan0008767 vs. TAIR 10
Match: AT5G64390.2 (RNA-binding KH domain-containing protein )

HSP 1 Score: 191.8 bits (486), Expect = 1.8e-48
Identity = 198/791 (25.03%), Postives = 316/791 (39.95%), Query Frame = 0

Query: 66  FRLLCPAS-------KAASILRHLCDIPGARIHIDEPLPSCDECVIVILAGSPSKPAL-- 125
           FRLLCP S       K+ ++++ L    GA+I ++EP     + VI I+A + SK  +  
Sbjct: 49  FRLLCPLSHVGAVIGKSGNVIKQLQQSTGAKIRVEEPPSGSPDRVITIIAQADSKSRVKL 108

Query: 126 --TNPGN-DREFREHDISRNVSSDAVAGNSDERSQLLLRIFESMIRMNEDSGENQDDRIT 185
              N GN + E +E ++  + +  A           L+++FE ++    DS         
Sbjct: 109 GANNNGNAEGEKKEEEVEVSKAQGA-----------LIKVFE-LLAAEADS--------- 168

Query: 186 GGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQIPACASPRDELI 245
                  VVCRLL  S   G V+G+GG+ V  IR+E+  ++ I   + +P CA   DE++
Sbjct: 169 -----DTVVCRLLTESSHAGAVIGKGGQMVGSIRKETGCKISI-RIENLPICADTDDEMV 228

Query: 246 QISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPL----------------------- 305
           ++ G+  AVKKALVS+S CLQ+   +D       +PL                       
Sbjct: 229 EVEGNAIAVKKALVSISRCLQNCQSIDKVRMVGNRPLEKEFQASLHRPIETIIQESLPRS 288

Query: 306 ------------------GPTSHASCCLP--------------------VQDEEPSPMRR 365
                             G  + A+  +P                    ++ +    +RR
Sbjct: 289 VEVNPYDYRLRNDEIFPRGTVARANDVIPHDTLHLRRIEAVPQGALRMHIEADRQDVLRR 348

Query: 366 YI----------------------------------------SH---------------- 425
           ++                                        SH                
Sbjct: 349 HVEADRQDALRRRIDVVPQETLYMPSDVLRGDCFRQHRERDDSHDSLHRPFEMVPRDAMG 408

Query: 426 -------------------------HNADYRPRGYSSIPGH-----DNVGAGQRAAME-- 485
                                     +ADY    YS++  H      +      A M+  
Sbjct: 409 MPFESFPRDAYGRPIETMTQETLRGQSADYLAHRYSTLDTHPHSFTTSASMANTATMKPP 468

Query: 486 --------EDVVFRLLCQPDKVGSLIGKGGTIVRALQSETGASIKIVDT-PDLDERVVMI 545
                   +DVVF++LC  +  G +IG GG +VR L SETGA I + +T  D +ER++ +
Sbjct: 469 PSEVEVGNQDVVFKILCSTENAGGVIGTGGKVVRMLHSETGAFINVGNTLDDCEERLIAV 528

Query: 546 SARESLEQTYSPAQEAVIRVHCR--------IAENGYEPGVAVVARLLVHAHQIGYLVGR 605
           +A E+ E   SPAQ+A++ +  R        I +NG  P  ++ ARL+V   QIG ++G+
Sbjct: 529 TASENPECQSSPAQKAIMLIFSRLFELATNKILDNG--PRSSITARLVVPTSQIGCVLGK 588

Query: 606 GGHIINEMRRGTGTSIQIFPREQIQNSGAVNDEVVQVIGSLQSVQDALFHITNRLRDTLF 647
           GG I++EMR+ TG +IQI   EQ     + ND+VVQ+ G   +V++A+FHIT+RLRD++F
Sbjct: 589 GGVIVSEMRKTTGAAIQILKVEQNPKCISENDQVVQITGEFPNVREAIFHITSRLRDSVF 648

BLAST of Tan0008767 vs. TAIR 10
Match: AT2G22600.1 (RNA-binding KH domain-containing protein )

HSP 1 Score: 188.3 bits (477), Expect = 2.0e-47
Identity = 186/659 (28.22%), Postives = 287/659 (43.55%), Query Frame = 0

Query: 61  PSDTLFRLLCPASKAASIL-------RHLCDIPGARIHIDEPLPSCDECVIVILAGSPSK 120
           P +T  R++C AS    I+         L    G +IH + P+   D  V+ I+  +   
Sbjct: 22  PDETAIRVVCHASVIGGIIGSNGYVVSKLRRETGTKIHCESPVNGSDHWVVFIVGSTAVN 81

Query: 121 PALTNPGNDREFREHDISRNVSSDAVAGNSDERSQLLLRIFES--MIRMNEDSGENQDDR 180
            ++       EF     S     D V          L+R+ E   ++   +D G      
Sbjct: 82  QSILLTDRVGEF-----SGGEHEDWVTCEVSAAQTALIRVLERSWVVLAAKDGG-----G 141

Query: 181 ITGGETDGLVVCRLLAPSHQVGRVLGRGGKTVEKIRQESTAQVKIFPKDQIPACASPRDE 240
           +  GE D    C +LA  +Q+G VLG GGK VE +R+ S A +++ P    P C +  DE
Sbjct: 142 VVDGE-DEEAYCGILADRNQIGAVLGLGGKNVEWMRRNSGAMIRVLPP---PICGTKNDE 201

Query: 241 LIQISGSFTAVKKALVSVSACLQDSLRVDSSNSSTAKPLGPTSHASCCLPVQDEEPSPMR 300
           LIQI+G   AVKKALV VS+ +Q++  +    +    PL    + S       E+P    
Sbjct: 202 LIQITGDVLAVKKALVMVSSYIQNNAPL----NGYPPPLSIKGYESLSTDGNSEDP---- 261

Query: 301 RYISHHNADYRPRGYSSIPGHDNVGAGQR-------AAMEEDVVFRLLCQPDKVGSLIGK 360
                H+  +     SS+     + A  R        + E  VVF+++      G +IGK
Sbjct: 262 -----HSEFFPNLRSSSLSNATEIVASNRHLPYDGGNSTERKVVFKIIFTSVVAGGIIGK 321

Query: 361 GGTIVRALQSETGASIKI-VDTPDLDERVVMISARESLEQTYSPAQEAVIRVHCRI---- 420
            GTI+RALQ+ETGASI +        ERVV +SARE+LE  YS AQ A+  V  R     
Sbjct: 322 QGTIIRALQNETGASISVGAPLKVSGERVVTVSARENLESRYSHAQNALALVFARSVEID 381

Query: 421 AENGYEPGV----AVVARLLVHAHQIGYLVGRGGHIINEMRRGTGTSIQIFPREQIQNSG 480
            E G  PG+     V  +LLV +H      G G     E    TG  + I    Q+    
Sbjct: 382 VEKGLRPGLHNGAIVKTKLLVPSHFANSFNGNGN---REAIIATGADVHISVGNQVLEWI 441

Query: 481 AVNDEVVQVIGSLQSVQDALFHITNRLRDTLFP------MRPHMPNFNTPSYLSPHPETP 540
           + N+ V+++ G    VQ AL H++++LR+ L P      MR  + N              
Sbjct: 442 SENEVVIEIKGEYSHVQKALTHVSSKLRENLLPKKVLGEMRARVSN-------------- 501

Query: 541 PPLFRPGNNAHSPGCYPSQAGALRGTERLHFHSHPLDHQPVYSHSTNFGGNNMDGVHYPH 600
            P    G  +      PSQ  A RG + L   +   D + V S +     N++       
Sbjct: 502 -PYESAGGRSQIYNLQPSQQDASRG-DSLSVSAAVPDLKMVRSGAEVLKSNSVMHTEVLK 561

Query: 601 GIERPS----PRSWMSQVSSEIPKAATDVGFGMVSRNESYSSGGPAHFLGGTSMEIVIPQ 660
            ++  +    P+S + +  ++  K       G VS     S G     +   ++E+ + +
Sbjct: 562 EVDELNDFTLPQSLLEEDLTQGMKQLQMSSNGDVSSLPPRSKGVSVRKI---TLELAVEK 621

Query: 661 TLICHIYGENNTNIAHVQQISGAMVVVHDAKPGMFDGKVIVSGTPDQIRAAQRLVHAFI 685
             +  +YG + T + ++QQISGA V V D  P   +  V++SG P+Q R A  L+ + +
Sbjct: 622 DALGSLYGRDGTGVDNLQQISGANVDVKD--PTGTEATVLISGNPEQARTAMSLIESIL 629

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
F4KDN02.8e-6226.64KH domain-containing protein HEN4 OS=Arabidopsis thaliana OX=3702 GN=HEN4 PE=1 S... [more]
Q8W4B19.1e-4525.46RNA-binding KH domain-containing protein RCF3 OS=Arabidopsis thaliana OX=3702 GN... [more]
P582235.0e-3524.05KH domain-containing protein At4g18375 OS=Arabidopsis thaliana OX=3702 GN=At4g18... [more]
Q153664.9e-1423.06Poly(rC)-binding protein 2 OS=Homo sapiens OX=9606 GN=PCBP2 PE=1 SV=1[more]
P577211.1e-1324.93Poly(rC)-binding protein 3 OS=Homo sapiens OX=9606 GN=PCBP3 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
XP_038898275.10.0e+0085.63KH domain-containing protein HEN4 [Benincasa hispida][more]
KAA0059338.10.0e+0082.82KH domain-containing protein [Cucumis melo var. makuwa] >TYK03987.1 KH domain-co... [more]
XP_004141829.10.0e+0082.73KH domain-containing protein HEN4 [Cucumis sativus] >KGN45417.1 hypothetical pro... [more]
XP_008462207.10.0e+0082.85PREDICTED: KH domain-containing protein At4g18375 [Cucumis melo][more]
KAG6575395.10.0e+0083.54KH domain-containing protein HEN4, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
A0A5A7UW070.0e+0082.82KH domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sca... [more]
A0A0A0KAK50.0e+0082.73Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G447790 PE=4 SV=1[more]
A0A1S3CGC90.0e+0082.85KH domain-containing protein At4g18375 OS=Cucumis melo OX=3656 GN=LOC103500622 P... [more]
A0A6J1GQJ70.0e+0083.87KH domain-containing protein HEN4 OS=Cucurbita moschata OX=3662 GN=LOC111456570 ... [more]
A0A6J1JYF40.0e+0083.03KH domain-containing protein HEN4 OS=Cucurbita maxima OX=3661 GN=LOC111488582 PE... [more]
Match NameE-valueIdentityDescription
AT1G51580.12.4e-11741.10RNA-binding KH domain-containing protein [more]
AT5G64390.12.0e-6326.64RNA-binding KH domain-containing protein [more]
AT5G64390.35.8e-6326.48RNA-binding KH domain-containing protein [more]
AT5G64390.21.8e-4825.03RNA-binding KH domain-containing protein [more]
AT2G22600.12.0e-4728.22RNA-binding KH domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004087K Homology domainSMARTSM00322kh_6coord: 179..254
e-value: 1.5E-9
score: 47.7
coord: 325..400
e-value: 6.7E-9
score: 45.6
coord: 615..685
e-value: 6.3E-5
score: 32.4
coord: 408..483
e-value: 2.4E-9
score: 47.1
NoneNo IPR availableGENE3D3.30.310.210coord: 325..486
e-value: 3.8E-34
score: 119.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 40..60
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..60
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 499..530
NoneNo IPR availablePANTHERPTHR10288:SF257OS09G0498600 PROTEINcoord: 61..547
NoneNo IPR availablePANTHERPTHR10288KH DOMAIN CONTAINING RNA BINDING PROTEINcoord: 61..547
NoneNo IPR availablePROSITEPS50084KH_TYPE_1coord: 616..680
score: 11.99262
NoneNo IPR availablePROSITEPS50084KH_TYPE_1coord: 409..478
score: 14.319535
NoneNo IPR availablePROSITEPS50084KH_TYPE_1coord: 326..392
score: 14.654946
NoneNo IPR availablePROSITEPS50084KH_TYPE_1coord: 180..249
score: 14.969395
NoneNo IPR availableCDDcd02396PCBP_like_KHcoord: 328..396
e-value: 2.21673E-17
score: 74.8203
NoneNo IPR availableCDDcd02396PCBP_like_KHcoord: 411..479
e-value: 1.54311E-16
score: 72.5091
NoneNo IPR availableCDDcd00105KH-Icoord: 618..681
e-value: 9.09659E-9
score: 50.2511
NoneNo IPR availableCDDcd02396PCBP_like_KHcoord: 182..250
e-value: 2.34577E-15
score: 69.0423
IPR036612K Homology domain, type 1 superfamilyGENE3D3.30.1370.10K Homology domain, type 1coord: 180..256
e-value: 1.4E-18
score: 68.4
IPR036612K Homology domain, type 1 superfamilyGENE3D3.30.1370.10K Homology domain, type 1coord: 616..685
e-value: 1.2E-11
score: 46.1
IPR036612K Homology domain, type 1 superfamilySUPERFAMILY54791Eukaryotic type KH-domain (KH-domain type I)coord: 323..380
IPR036612K Homology domain, type 1 superfamilySUPERFAMILY54791Eukaryotic type KH-domain (KH-domain type I)coord: 616..684
IPR036612K Homology domain, type 1 superfamilySUPERFAMILY54791Eukaryotic type KH-domain (KH-domain type I)coord: 178..257
IPR036612K Homology domain, type 1 superfamilySUPERFAMILY54791Eukaryotic type KH-domain (KH-domain type I)coord: 409..486
IPR004088K Homology domain, type 1PFAMPF00013KH_1coord: 183..247
e-value: 1.1E-11
score: 44.4
coord: 619..681
e-value: 2.4E-6
score: 27.3
coord: 412..479
e-value: 9.3E-12
score: 44.6
coord: 329..377
e-value: 6.7E-11
score: 41.8

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0008767.1Tan0008767.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010468 regulation of gene expression
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005634 nucleus
molecular_function GO:0003729 mRNA binding
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003723 RNA binding