Tan0005393 (gene) Snake gourd v1

Overview
NameTan0005393
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG06: 11453785 .. 11479016 (+)
RNA-Seq ExpressionTan0005393
SyntenyTan0005393
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCTCACTCTCACACAGACTCCGGACTCTGTCCTCATCTCTATGCTCCAAATCTCATCAATTTCGATCGGTCCGAACGGCCACCGGACCGTCCAAGCGGCGATCAAAGGCTGCCGCATTCACCGTAAAGAGGCCCGACGAGAAGTCCGAATGGTGGGTTGTTGACGGCGAAATGCACGAAATCGGCGACAATGTGCCCCCTCGCGAGCGCTTCGTTATACCCAGAGAAAATCTCCCCAATCGGCGTCGAAAGCAGCTCAGGGAGCAGTTCATGCGCCGGACTCGCCTCGTTCTTAAGGAATCTGTTAGTCCCCTTTCTTCTTTTCTTTTTCCTTTTCTTTTTCTTCCCTGCGTTTGATTGGAGGGCTTTTATCTATAATTCCAGTTGGGATTGATGTTCAGAGTCATGAATTTAGCTGCTGTTGTCTTGAATTGTGTATGAAGAAGGAAGTTGGTTCTTATTGGTATTTGAATTGGGTGTTTGAAGGAACACGAGCCTTGGTGCAAAAGGTACATGGAGCTTTATCAGGAGCTAAGGGAGAACTGGGAGAGGCTGTACTGGGATGAGGGTTACTCTAAAAAACTTGCCCAGGAACATGCAAATTATGAGTCTTCTGAAGACGAGGATTTTTCCCCTTATAGGTTGGATTTTTTGTGATTCTATGTATATTTGGTATAGCTACTGACTACTGAAAATGATCTTAAATTGAAAGCTTTGTTCTATCCATTAAACTTTTGTGATTTAAAATTCTAAGTTTCTTTTTGCAAATGAGGAAAATGGATAAAATTATAATTTAAAAAGTGTTCAAAGTTTTTCTTTTAAATTCTAAGTTTCTGTAAGAATAGATTTGTAATGTGACTGCAATGTCTTAGTTTTACTGCTTTTGCCTGCGTTTCAGGAATAGGCAGTTCACTGCTGATCGAAGTAAGGTAAATATAAAATTTTCTTTGATAATTTTTTTAACACATTTTAATTTCAGAGCTTGGTATATCCGTTTTGAATATGAATACGTCTCTATAGAAGTTGATTTACCATTTTATCTCTTAAGATTTGTCAAGTATAACTTCTGTAGTGCCGTGTCTGATATAATTATGTTAATCCAGAGAAATGCACGAGAATTTTGAGGAGAATGGAATAGAAAATAAATAATCCTGAGAATATTCTTCTATTTATGAAGAATAGTACAGTTGCTTAAACTTGATATTTCTTTGTGAGTAGATGACTTATTGAGGAGCAATATTTGGTATAAAAGTAAGACTAGGAAGAATACTAGTGATCGGTTTTATCTTCTTCACTGGCACAAACTCCAAAGTTGGGTTTTCTATGTGCTTGTGTATTACTAGGACCATATGACCAATTTGAAGTTACACTGCTAGTCCAGCATCATCCTCATACCAAATCGAATATTTTGCCTCCTTTTCATGCACTTTAAATTAAGTCGCACTTTTTAGCAAGTTAAATTATGTTGAAAAATGGTTTGAGTGTGGTTCTTTCAACTCTCCTCTTCTCTTCCCTTCCCTCCCTATCCATGGATTCCCTTTTTGAGTCCAAGCGAACATGATGGGTTAAGGACTTGGTGGAGAACTCGCTAACCTTGTTTCTTAAAAAAAAAATTTAAGAACAAGGAAAGCTTCTTTTGGGCGCAGAAGGTCACAAGTAAAAGTGTTTCTTTGCAGAAATCAGAAAGGTTTGAATCGATGGGAAGAAAATCAACATGATAGTTTCATAGAGGTTAAGTTTGAGGGGTTGGAAAGCTTTCACGCTTTTTAGTCAATGAGTTCTTGAGTGTAGGAAGTAGGAAAGTGGAGACAAACAAAGGAGAAGAATCTTCTTGGAAGTAGAAGTAGTTCATTTAACAAATTATCCCTCACTCCTACGAGAAGTAAAATCTCTAGTTTCAGGTGTCATGCAGATGTAGCTAATACAAACCTTGTAGAGGTGGCTTCTTTTGATAAGAACTTCAGTGGTGGCAAGATGATGCTTTCTCAATGGATGGATTAACATCCTGATTCATAATCTTATAACTGGGTGATGTATTAACCTTTTTCACTCTAAAAAAGCCCCTTAATGCTTAAATCTAAATGAAACTCATTTAATTAGATTTCATAAGGGACTTTAGGAGGGCATCCAATTTCTCACTCTTTTTGGCAACTGGCGCTAGACAAGATTGGTAAAAATTTAGATAGGTGGAAGCGTTTTAATCTTTCATGTGGAGGACGTTTGACACTTTGTTCTTCGGTATTATCCAATATTCGTCTCTTCTATTTGTCTTTTTTGTAATGCCTTCTCAAGTGAGCCCTGAGCTAGAGCAAATGATGTGTAATTTTTTCCAGCAAGGTAATGAAGGAGCTAAGATTAATCATTTAATTCGATGGAAATTGGCGACTGAGGCTCAAAGTGATGGTGGATTAGGCATTGGAGCTCTTAAACATAGAAACATACCTTTGGTTGCTAATTGGGGATGGCATTTTATGAATGTGCCACATGCTTTTTAGCAAAAGTTGATTATTAGTATCCACATACGATCTTCCTTTGATTGGCATATGACAGGTAAACATGTTTCTAGTTTAAGGCCTTGGATGAATATTTCTAAGTACTGGAAGAGGTTTGAATATTTTGCAGTCTTTAAAATGGGAAATGGTAGGTATATTTTCTTTTGGCATGACATTTGGTGTGGTGACGTTTCTCTAAAATTCAAGTTCCCTCGACTTTTTACTGTTTCTTCTTATCCTAATGGTTCAATTCGGATTTTTGGGGGAATATTTTCTCTAGAGGGTTTTTAAAAGATGAAGAAATCTTGGATTTTCAATATTTGTTGAGTTTGCTGGATCCATCGGTTGTTGTCTCTTCAGAGGATGTTCGAAGTTGGTCACTTGAACTTCTGGTTGGTTTTTGGTCAGTTCTTTATCATATCAGCTTGCCTTGGCTTCTCCTTTAAATTCAGATTTTTTTGGGGGGCCATGTAGAAATCTAAAAGCCCTTAAAGAATCAATATTCTAATTTGGATCATTTTAAATGGAAGTTTGAAGGAAGTTATCATTCTTCAATTTGCTTCCTTCAGTTTGTCATTTATGTTATATGGAAGGCGATTTCCTCCTCCACATTTTCTTTGGCAGGGTATATTCGTAAAAGTGTTGGTCCAAATTGTTCTCCATTTTCAACTTGCAGTGGGTGTTTTCTAATTCAGTAAAAGATATTGTTCTCTAGCTTGTTATTGGTCCTTCATTAGTCTCAAGACCCAGACTTCTTTGGATTAATAGGATTAAAGCTTTATTATCAGAACTTTGGTTGGAAAGAAACCATAGAATTTTTTAAGACAAGGTTATGCCTTGAACAGATTGTTTTGATTTAGCGTGTTTAAGGGCTTCATTTTGATGCACTCTTTCAAAGTTTTTTGCTAGTTTTTCTTTGCAAGATATTTGTTTTAATTGACATGCTTATATTTTTCCGTTGAATTGTTTCTTTGTTTCTCTTTTCTTTTCTTTTGCTTTCATTTTCCCCTCCTCTATGGAGTTTGTATCTTGAGCAGTAGTCTCTTTTCATGACATCAATGAAAAGTTCTATTTTTTGTTTTAATAAAAAATCATAAGGGATGTTGCACACTAGGTGATTTTGCTTGAAGGTTTGAATGTTGGTTGAAGTGTGTCATGAAAGAATGAGGGCCGTTCCATCCCATGGAGTTTGGATAAAAATATAAATCTTCTTTCTATAGTTTGCACTCTTGTACAAAACAAAAGATTGCCTCCCACAACTACACCTTACTTCCAACTATGTATGGATACGCCTGATAAACTATTCCATGCTCACTCCCATGCGCCCCTTTCTACAATATTTTCTAGCAAAACACTATAAGGACCTCAGTCTCGTGCTAATTTCATTGATGTTCCTCAATTAAAGACTAATTGCATTTACATGCTGACCCTCAAGCCTTCCCTTTCTCCACCGTGGTATGTTTGTGATAGGCCCCCAATTACGAAAGTATATACTCGTAAAAAGGGGGAGGAAGGTTGAGGGGGTATGAGTGTTTGTTGTACGGGGTGTGGGGCCCAAGGAGTTTTCGTCTGTTTGTTGAGTGGGCTGGTGAGTGGTCCTTTTTAGTGTGGTGTTGAGTCTGAGAGGTTATCTTTTTGCTAGTCTGAATTTTGTAATTCTCTATGAGAATCTGGGAGAGGAGGGAGCTCTTGAAATCTTCCCTTGCTACTGGTTTTGTTCATAAATAAAATTGTTTAGGCAAGAGCCTATCAATTTGATGCGGATAGGTGGCTTGCCCAAAATTTCACTTGTCTTGGAGTTGGAAAGAAGGGGGAAGCCCATGCAATTAGTTCGAAGGTGTCAACCACAATTCTAACCTTCCACCCTGTCACAGGTCGAAGCCACAATTTTAAGTTTTTAACTTAGGCATTTGTTGGGTGGTAATGAGACATGCTTTAACATGGGCATTAACTTAGTCCCTTAGCTGGTTGCAATCAAGAAACATGAGAAATCGAATGAATGGTGGTACTAGGAGGTTTTTAACTTAGGCATCAGTGGTGAGAAAGTCCTTGATATCCTAGATTCCCAAGTTTATGATATATATTAATATCCCAAGTTTCCTTAAAAAATGATATGTATTTTAAATGATATATATAAATATGTTTCATTTTTATTACAAGAAGGAAATTCAAGAAAAGAGAACAAAACTAAGAATTAACAACAAAATTACATTGAAAAGATAAAAGCATGCAAATTATAACAAATATCCTGCAAAGAAAAACCAGTAAAAGAACCAGTATAACAAATATACTGCATTTATATGCTTAAAAGAATCAGAATGAGTTTGATGTATTTCGCTCTCAAAATTTATTATTTTTGTCTTATATTGTTGTGTGTCTATTTAGTATACTTATCAAGTGTTTGATACTTGTATAACAAATATTGGACCTACATTGACTTTGTACAATTAGTGTCTAAAAGACTAAAACGTATCTATTGTACTACCAAGTGTTTGATACATGTCCAATAAGTGTTGGCATGCCGGACACTTGGACATGCTAGCCAAACTAAAATGTTTGTGCTTCTTAGTGTGCCACTTGATGACTTCATGTCACCACCTTAAATGGACCAGACCAATTATTTGGGGTCTAATTTCCTATTCCACTGGTAGCTAAAAACCATTGTTTGTATGGTTGTAATCTGAGGAACACCTTATCCCAATTACCAACTTGAGACTGAACATATCACCTTCCTTCACCTGTTGACTTTTAAGACGTTGAGGTCGAGAGAGTTGAAATTCCAAAGCACCAAAGATAGCATCGTGGTTGGTCCATTCGAAATTGGGGAAGTGGATCACACATATTTAATGAGAGAGAGAAAGAGGACAAGTGGGGCTGGAGGTTTTCAGGTGAATCATCTTCTCTCTGGCGAAGGGTGATTGTTAGCATTCACGGCCTGGCGGATAATGGATGGGATCCTAGCAGCTCTGGAAAGTGCGACAGAAGTCCATGGGTCAACATTACAAGGGTTTGGAGGCGCTCTGACATTTGGCCCCTGCTCAAGTTAGGAAATGGCGGTCGCTTAGGTTCTGGTGGGAGCCGTGGATAGGGGGCGTTCCTCTCAAAGACAAGTTCCCTGGACTTTTTGGGGTAGCAAGGAACAAGTTGATTACGGTGGATGAAGTCTGGGACAGTCAGTTGGGAGTCTGGGCTTTAATTTTTAGGAGGCCTTTGAAGGATGAAGAGATTGAAAGCCTTGCTGAGCTTTTGGGGGTCCTCGGTTGTGCAAAGCCTTCCCGGGAGCAGGACTCTAGATGCTGGGCTCTTGAGTCCTCGGGGAATTTTTCTGTAAAATCTCTTTCAAGGCACTTGTGCCCCTACCCCTCGATGGATAAGATGGAGTATAGATTGATATGGAAGGTGAAGTGTCCAAAGAGGGTCAAAGTGCTGACTTGGATCTTGCTGTTTGGAAGCTTAAATACAATTGATGTTTTACAGAAAAGGTGGCCGCAAATGTGTCTTCAGCCTTCGGTATGCCCCCTTTGTTTTGAAGAAGCAGAAACAGGATATCATCTTTTTGTAAGTTGCTCTTTTGTTGGGAATTCTTTGGTCAAAGTCATATATGGAATTTGGAGTAGGTGGGTCTTTGCAGAAATCTAAAGGAGGGTGTGTTTCAGTTGTTGGCAGGGCCCCCACTTGCGCGAAGGGCCTCAATTTTGTGGGGAAACGCGGTAAGGGCATTGCTGGCTGATATTTGGTAGGAGCGAAATCAGAGAATTTTTCGCGATATCAGGAAATCCTGGGAATTATGCTTCGAAGCGGCTCATCTCAAGGCTTCGGCTTGGAGTTCTCTTTCAGGGGAGTTTGGTGGTTTCACTATGTCAGACATCCAGGCCAATTGGAACGCTTTTATTTTCCCTTTTTAGGATTCACTGCTGTCTTGTTTTTTTTTTATGTTATGGATGTTAGGGTTGCCTTAACTTTTGTACTTCTTCATTATCTTAATGAATTCATATTGTTATCTTTTCAAAAAAAAGAGGACAATCTCACAACAATTTGAAAGATGTGACGGTTAGGAATCCAAACACATAAAATAACAAAGAACACTAAGAAACTAATATATTGCAATAACTATGAAGAAATTACAATAGCCAATAGTCTTTCGAGAGGGTTATTTCTCTCCTAAGTCCTACAATGGACTCTGCCAAAATGATCCACACCCAAAAAGAGGACCCTAACACTACTATTTATAACCAAGACACCTTAAACTAATTACCAAAGTACCCCTCACTGATATCCAAATAATCTACCACATATACTCCATAGAATACCATACTAGTACTCTCACACTACTCCGCCTTCAAAGACACCTTGTCCTCAAGGTGTGCTCAACAAAACAAAGTCACAAAAAGAGACAAACAACTTTTTTTTTTAAATACTCACAGAACATAACAACTTCTAAAAAAGAAAACAAGCATTCGGTAACAAAAACTCAGTTGGTTAGCGCTTCATCCTCCATGTGCGACGGCGGTAGTTGGCAGGGAACACGATAGAGGTGACAAATAGTGGTTGATGGTGGCAGTCAATTGAGGTTGCCCAACAACCATGTAAGAAAATAAGAAGCTCACAATGACGACTGGGGTTTGTTTTCTTCCCAAATTGTCAAGCTGTTATGGGTTTTAAGTTGAAAACATGGACAAGCCCAACCCAAATTTTGTTTAACTCCTTTAAACAACAATGGGCCTCACCCAAGCTTTTTAAAGACTACCCATTAAAAGAAAGCATATCGAGCCCCTGCCCCATAAGTCCACAGTCCAATCTTCTGGAACCATCTAATTTCCCATTAGGTTTAGTGTTTTTTACCTATTCTTCTTCCTTTTTCCTTCTGTCACTAAGCATCCCAACACATAACAACGCACATAACTTGCAACGACAACTTCTGTAGGGATTCATCTCCGACCGTATTTGCTTCCGCCACCTACGTTGGTCTCCTTCGACCGATTCTCTCCTTTTTCTACTCGAATGACGATTTTCTTTGATTTTTCTTCCCCTACTCGAAATTATCTTAAGCCGTCGATGACGATTTTTCCCTTGATATTCCGACTTGGACACCATCCCCTTGAACTTTTTTCTAGCCCACTCGTGCACGATCTTCGATCTTCCGCAGAATACAGGTTGGGCTCCCTTCGATCACTTCTCTCTACCCCTTTTTTTGTCAATCGAACTGGTTTCGTCTTCCAACTTCGTCGTTTCACGACTCTCCTTTTCCCCGATCGTTGGGACTTTATATTTCTCCAATTCGAGAATCCTATATCTTATGGTTCATTCTCTATCTCTCTTGCCCCCTTTCTTTCTAATCTTTTTCTTTTTCTTCTGCTTCTTCCTCATTGTTTTTGTCTTCCTTCTTTGGGCGAATGTTTCCTTGTGATGACTCTGTTCTCACTCTGGACAAACTCTCTTTCTTCATCTTTCTTTGCTTGTGTTTTTCTTTTACAGTCTTGGAGTTTTTCTTGGTACTGCACCGTGATCGTGGTGGAACTATCTTCTTGTAACTTCCCTTTTCCGGCGATCATCATCATTAGTGCTTTATGGTTCTCTCGCATTTCTTCTGTTTGTCTCCGCATACTTTCTTTTGCTTCTTCCAAACTCCTCTGGTTCTCATTCACTCGGTTTGATAAAATTCTCATGTACTTCGTCAGTTCTAGAACCGTTGCCCGAAGGTCGCTAACGTCTCTCTCTGTTCCTTATACTCTTTCTTCCACCTGTCTATGTGCCATACTCCTTGTTCTTCCCAAGATGTGATATGCTTTGATACCAATTTGTTAGGGATCCAAACACATAAAATAACAAAAAACACCAAGAAACTAATATATTGCAATAACTATGAAGAAATTACATTAACCAATAGCCTTTTGAGAGGGCTATCTCTCTCCTAAGTCCCACAATGGACTCTCTACCAAAATGATCCACACCCAAAAAGAGGACCCTAACACTATTTATGACCAAGACACCTTAAATAATCTACATATACTCCAAAAAGAGGACCCTAATTACCAAAGTACCCCTCACTGATATCCAAATAATCTACATATACTCCATAGAATACCATACTAGTACTTTTTTTTGACAAGGAAACAAAGATTTCATTGATATAATGAAAAGAGACTAACTACCATACTAGTACTCTCACAGTGGCTTTCAGAGTTGCATGATACGTAGTATTGTACCCAATATTTATTTCAAGACAGCCATTCAACCCAACTTCATGGTTTCTCACAAGCAAAACATTTAAAATAAGTTTCCACCTTGCTATTCACAACCTATCACGTGAATGGTAAGATGCGTTTCTCTTCAAAATCATTCCCAGTAGTTCAAGTATTTTTTTTTCCCCAGAAGCGGCTCATGAATATCTTGTCACAGACCACAGTCTAAGATTAAGGACTTAGGAACGCCGACGAATTTGATAAATGGATTAGATACAGAGGCAGTTGTATAGAGGTACTAAGAGGATGGAAATGGGTGGGGCCTGGTTAAGCGGTCTAAATTGGACTTGGTTCCTTATTTCAGCCAACCTTTCAATGAAATCTAATCGTGAGATCATTCCAAGTGATTGAGGATAGATTAGAGTTGTAGTAAACCGTCAGCAGACATAACCATACACTTGCTTTATTAACAAACAATTGGATATATAATCTTGTACTACTGAGAAGTGTCAGGGAGCTATCCCCCTTTCTTATTAGTGTCTTTGAGTTGTCATATGTAGGGTCTAGCATTATTCCTAGCCACTAAGTTTCTTTTCTTTTTTGTTTTTGTTTTTTCCACCAACGAAAGAACAATAGTAGATCTTCTGCCTATAATACTTCCTGTCCTCTCAACATTGGTATGGACTTAGATAGTTGTAGTGGTGACTTTTAAACGATATGTTAATGCACTTTATTTGAAACTCACTTGCAAGCTTTTTCTTTAGAGTAGGTAATCCTATTTCATCATTAGATGTGGAGTATTATGTCATCAAACAATCAAATTCCAAAATTGTCATTGTTATTAAATGACAGAGGGTCTCATCAGCTTGATTTAACAATCAAATTTAAATTTTGTAATTTTGTCATTCCCAAATCCTTGATTCGAAACACTATCAATGACTTTCGTGTTTTACATTTATTAACTTTAAATTTCACTTGGAGGAATGGGGTTTTGCGTGCAGGAACAGGATTTTAGGAGAAACATGCAAGGTGGTAGCTGGGAGAAGGTCAGCCAAATTAGAGATAAGTTTGAATACGACAGGGAGAGAAGAATGAGAGAGAGAGGTTAGTTTAAATGAATAAATTGCTTGATTTGTTTTATTTTTATTTATTTATTTATTTCTTGCTATCTGCAATCCATTGATCGCTTGGTTCTTGATTTCACAATCCACATCATGATATTCATTAGCTCCATATAATGGATGTTTGATGGAATTGTTCCTTCAGGTGCAATCTGGCTTTCTCATAGCTCGAACAAAGTAGTTGTACTTTTTAATAAGATTGAATGAAAGTAATAAATCTGAGCTTAATTTACATGGCTGTACGTCAATTTCACTCAATTTCAGAATGACTTTGATTCTGGTTTTTCAATGTATCATTTAGTTAGCATTTTTTACTTTTGCCCTTTTCAATGCTAGTCAGTCTAGCACGTATAAACTTTTGTATATGCCTAATTGCCTATTCAAAGTAAGGGTTTTACTGTTAAATGCCCTGTTTCCCTAGCTCAGATGCTTGCTAGTGGCAACATTTCCTTATACTGTTCTTATGCCTCTTGGCCTTGGAGTAAATTTAACATTTGGTTTCTATCGATCAACCAGATCTTGAAGATAAGGTGTGTTTTGGAGGTGGGGTAATGTTAAGATTATTAGTAGGATACGCAGTAGAAATGTTATAAGATACTATGGCACATTAGTCGCTAGGTAAGGAATTTGTTAGTAGGAGGTGAGTACAAATAGAGGGAGTGAGAGGGACAATTTTTTATTTTTTAGTTTGTGAATTTTACTGGAGAGAGGTGTCCAAGCTCCTCAAACTATTTGGGGCCAGTTGTTTTGGCACCTTTGGATATGCTCTTTAGTTTTGGCCCCTTCCTCTTTCATTTGGGTTTATGAATTCACAATTGCAGTTGAATTGCAAGTACTGACAAATAGCTTAACTTTAACTTTCAGCATTTGCACCCATGCATGGAGATCAAGCTTTTGCCTCACAGAATCCAAATTCCCAAAGGCAACGAGCAGATGTGCAGTCGCAAAGATACTTCTCCGAGACCGACAGTGACTGACAAATAAACTAAAATTTATCTGCATCTATGTGAAGAAGAGGTTAGAAATGTACTTAAAGTTCATCTCTCCCTTCGTTGTTGCCAATATTTTCAGTTGATAGGGTGTCTTCTTTATCTCAAGTTTAAAACAACCAGAGTCATTTTGGTTGTTTAGATAGATTCAACTCAAGATCTACGGCTGGTTTCAATATTGTCTTAATATGCTCCTCCATAGAGTGAACACAGATAAACTGAACTTTTGTTATCACTACATCAGATAATGTTAACGGTTGAAAGAGCTTGGTGATGGATATCTGTGTGTAGGATCATTGTCCCCTACCCTCCCTCCGATGGGGGAATTTTTTGGCCTTCCTTTTAGGTCTGGAAATGAATACCAATGGAGAAAGCTTGGTGATGGATATATGTGTACACCTTTAGATCTTAGGCTTTTGTTACAATTGCATCACATGTGTGAATGTTGCTGTCAAAAACACTATCAAAGAACTGAAATGCCAATATGATGAACATTGAGGATTGGTTTGAAATTACTTTTCAACCCCTTAAAAAGGGATTTTAAGTGATACAAAAATATTTTTGAACCTTGAAAAGTCACTCCAATCAGGCCCTGGACTGCTTATGTTGTATTCAATTTGATATTTCTTCTTGCTACTATTGAAGTCTGAAGTCTAAAAGACAGTCTAAAGATTACTCTAGGACTGTTTTGTGAAAATGTTTTTTTTAGTACAACAATATAGGGAGGGAATTTCAAATCTTTGACCTCTTAATTGATTGTACATACTTTATATTAGTTAAACTATGCTCGTGTTGGAACTGTTTGTGAAATTGTTGGGTTGTAGTTGGTAACCATGGTAACTTGAGGGGTATTTTTTGTTTGGTCAAATTACAAGCAGGAGTCGAGTGAAAAAACAAATGCCCCCTCTTAGATATATATATATATATATATAATAGTTTCCTACTAATTTGGGTTGTTATTGTATTGTATAATATTGGTATGATTATAAGCATCCTAACAAAATTATAACCTAATTGGTTCTCTAAAAATCGCTATCAACTTCATAAATATCATGCCATATAGATTTAATTTTGGTCAATTTCAACCTTCACTAATAATGGATGTGACAAAAAAATGTTATAAAATGGAAGGATTTGGTTAAAACCTTAAACTAATATGGACTAAATTTTCAAAACACCTAAATGTTAGTTTAGTTTTTATTTTTTTTTGAAATAATGTTACACAATTCTTTAGTTTATATTTAATGAAACGTTAAATTATACCAACTATTCCTGAAATTTTGAAATATGCTTTAATTATGTTCTTAGATTTGGAAAATTTTCATTTTTAACTTTGAACTTTGTTTTAAAAAAGACTATTTTTTTTAGTACTTCAACAATTGGGAGTGGGGGATTCGAACCCATGACCTTTTGGTCAAAAGGTCATAAATCATACTTGATGCCAATTGAACAATGCTCTTGTCGGCTTTTAAAAAAATATTGTTAAAATAATTTATACAATTAATTTTTTACACATCAAATTTCATGTCGGATCTAAAATTTTTATCAAGTAAGGCTTAAATATGAAATTTTAAAAAATATTTAAAGTTATCTATACTAGAACTTGAACCTGAGACTTGAGACTTTAGGGGGCTAAAATCATACTAACCAAGTTAGAGGCTTTTTCGTGTTTTAATTTGCTTTTTCTTATATATATTATATAAAGTTACAAATTAGCTCCCTCTTGGCCTTAACTAAATCCACTCATGTTCATGTCAATTTAGGTGAATGATGTTATCCATTAGTTATCGCGTCAATTTTATCTTATTTTATTGAAAAAAAAAAACACTCTAATGATATCTTTTGAAATAAATTTTAAAATTCAAGAATAAAAAAAAATGAAAAATTTTAAAGTCACAAGGATAATCCAGAATTAGGGATATTTGTACGGAAGACCCCATTTTATTATCATTAATGTTGAATGACTCAGCGCTCAAAAATAATGAAAATCGACCTATTCTCGACGGCAACACGCGCAAAATTACGCCGAAGGGTAGCTTCTCGCCTCGCGAGAACGCCACGCCTTCTCGAAAGGCACGACGGCTACTCGTAGGAACGCTAGAGAACGCCGCACGCCACGCCTTCTCGAAGGAACGCGCACGCCTTACTCGTAGGAACGCTCACGGTTACTCGTAGGAACGCTATGAACGCTTCCAAGGAACGCTCCCGCTTAATCGTAGGAACGCTACCGCTTACTCGTAAGAACGCTCATCCTTACTCGAAACGGTGACTCGCTCATGCATACATACAGTAACGCTCACGTCTTTCGCTCACGGTTACCTTCAGAAACGCCGCGAGAAATACGCGCGTTATGCGATAAGACATCGTATGAAATGACATGCCAATTTATTAGCCGAATACTCTGACAAAGGAATGATGATAATAAAATTGTAAAATAATATAATACGTACAAAACTCCATTGTGTGTTCGATAATTACTACTATTGTTCCCATTAAGCTTAAGATACTTGTTCTATACTCTTCTCTCATTCTTCTTCGTGGAAGCCTTCTCGTAAAACCCTAAATTGGTGCGATCTCTCGTTAAGTAAGAGTAATGCGTTACAATGTTAATGGGCGCATACTTTATTCCCTTATACGTTTCAGTCAGTATGAAAACATCCATAGGATCACACCGTTGTTTTCGAAGAATTTAATTGTAAAAAACAGTAAATGAAAGTGTTTGAAGCTTGAAGCATTGAGTGGTGACATTGTTAGTAACGTGTGGAGGTTATTATGGGATTACCGAACGTGATAGCGAGGCGTGGCAGCGAGAAGTAGTAACAAATGGTAAAAGTGCGTGAGCGAGTGTGCTCTGAGAGCGAGATGACGGGAGCGGGGGACGACGGGAGCGAGGCAGCGTGGCGTTCTGAGATCTATAGTGACACCCAATTTCCAAAGATATCTAGGTTCATTCTTGTCGTTTATTAATGTCGTTGACAATCGAATTACCGCTCTATTTCAACTTGTTTTATTGCGGGATGGAGAAGAAATCTCTATTCATAAGATATGGTGGATGTTGGGATGAAATTCAAAATTGTTACATTGGAGGTTGCCTAACCGGGTGTTGTTGTGTCCGACACCATTAGGTTAGTTGAGTTGAAAAACCTTATATATGATGTTACGAGAACTAGCCGGACGATGGTGGACATTATAATAAGGGTGAAGATGCCGTTGTTCGAGGAAGCACCTCCAATGTACATAAGGACCGGCCAGGGACCTTGAGTTTCTTCTACTTGAGGAAAATGTATCCGGGATGCAACTATTTATATCGACAAGATGTGAAAATGATGGTAATGTACATCCGAATGAGGTCGTCCCCATATACGAGTCATGTGGTGCACCAATCATTAGATGCTGACACACTCAATATACCACGAGTAGAGGTGGATATATCACCCAACGAGATGGGAGTATCATTAGTGAACGATGATGTGAACCCTTGGGAGTCAGAACATGGTCACGATGTTTTTGGACAATATGAGATGGATAATTGTGAAGCATGGTCGATTCCAAATCCCACCCAAATCCAACACTAACTCCAACTCCAACTCCATATGGAAATCCAAATCATTCACCCAATCCAACTCCAACGCCTATTCCAACTCCAACTCCCACTCCCACTCCCATTCCCACTCATACCCCTACTCATACTCCAATTCTTTATCCCACTGCAAATCCAAATCCAAATCCAAATCCAACATCCAATCCAACTCCATCTCTAAATCCAACTCCATCTATAAATCCAACTACAACTCCAACTCCAACTTCAACTCCAACTTCAACTCCAACTCCAAATCCAATTCCAACCTCAACCTTAACTCTGATTGGACAATCTTCATCTGTTAATAACTGTAATGAAGATGTTGGGGTTGGTCAGATGTTTGTAAATAAGAAAGAGTTGAAGAAGCGACTGTCCATGCTGGCAATGAATAATAATTTTGAATTTAGGTTAAAAATCAACGAAGGATTTGTACACGAGTTGGTTGTTTGGAGAAGACTTGCACATGGAGGTTACGTGCGCATGCAGATGGAAGGAACAAGCTTGTTTAGAATAAACAAGTATGTCGCCAACACTCATGCTCAATTGAATTGTTGAATCATGATCATAGGCAAGCAAGTCGTGAAGTCATAGGTCATTTAATAAGAAGCAAGTTTGCGGGAGTGGGTCGAATTTACAAGCCAAGCACATCATCGAGGATGTTAGACAACAATATGGTGTGAACATTAGTTATGACAAAGCATGGCGCGCTTTGGGGGCATGCATACGAGGCTTTTCACGCATGCGGGATGTTGTCATTGTTGATGGCTCACACACGAAGGGGAAATATAAAGGTTGTATGTTGGTTGCGCAGTGGGGAGGGATGGAAACAATCAAATTTATCCACCGCCTATGCGATAGTGGATAGTGAGAATGATCGGTCATGGACATGGTTCATGACGAAACCAAAAACTTTGTGAGTGATTCCGAAAATTTGGTGGTTGTATCCGATCGCAATGCAAGCATAGCCAACAGATGTTAAGTCTGTATTCCCACATGCATTCCATGGAGTGTGTACTTATCACTGGAGCTAAACATCATGAATAATTTCAAGGATAAGGCGCAGATTGAATTATTTAAGAGTCTGCGCGCGAGCGATCCATGATCAATTTAGAAGGTATTGGGATCAACTTGTTCTATTGCGTGGAGGTGCGATGAGCTTCGATATCTAGAGGCCATAGGTATGGAGCGTTGGGCGAAGATGTTTTCGTGTAAATGAGACGATATGACAACATGACGTCCAACGGTATACCGAGTGTTTCAATTCACTCACGGTTGAGGCACGCAGTTGCTCCCATTGTTGCGTTATTTGATTTTGTGCGAGAACACCTACAAAGATGGTTTTACTGACGACGTAATTATTGGGGCACGCACACACAACCTTGCTTTCGACTCACGCAGAGAAGCGACTTGCCGTGAGTGTGAGAAAGGTAGACGTTACTGCGTTGACCACTATTGATTGTTACAATTATCACGTGAGAGATGGTCATTTAGGGGGTATCGTCAATCTTCAAACACGCGTCGATGTACATGCAGGAGTGGGATTTGTATGAGATACCTTGTGCCCATGCTATCATGGCGGCTAGGGAGCGCAAACATTGGTCCATCAACCCTTTGCAACCAGATCTTATTTCGTGGAAGCGCTCCAAGCGGTATATGAAGAACCAATTTTTCCACTTGGTCATATATCCGAATGGCCAAGCCACCGGTTTTGTGGATGTCCTGGATACAACCCCAAACGTGTTGTAAGGGGTTGGTCGAAGACGTGTACAACGGATCCCATCGGGTGGAGAGCATAGGCGTGGTAGAAAATGTGGTCGATGTGGTAATGCTCGGACACAATCGACAATCATGTAATCAACCCCCGAATCGATGAACATGTATGAAATCGAATGTAATTGCACAATGTGTTATTAGTATCACATCTTAATTTGTATACTGTGTTGCCTTGTAGGAACATGTGTGAAATCTGACGCTCGTTTTATCGTGCTCGTAGGAACAAACTTGATGGATCGCTCACGTTGGCTCGCTCACGTTGGCTCGCTCACGTTGGATCGCTCACGTTTGATCGGTCGCGTTGGATCGCTTTCGCGTTGGATCGCTTTCGCGTTGTGGCGCCGCGTTGGATCGCTCATGTTGGGTCGCTCGAGTTGGATCGCTCATGTTGGATCGCACACACTGTGGAACATTGTATCTCCTAATTAACTACCATTCACAGTATATGCATTGTACAAAACATTCGTTTATATGAACTGTATTCAATTAATCAATGAATTAATGTTCTACAATCGGTCAGATTTACATTAAATAGTTATACAGAAAACATATATACATCAATGAATCAATTAATCGTATGTGCAAATGAGAATTAGTGTCAAAAAATTCATTTCGGACCATAATCGACAAGCGTAGTGTCCCACGAAGTCCGCTCATTCGGTCTTGGATGAATCGGATTTGTTTATCGGCGCCCGGTGACGAGGTATTCAATCCACTTCGCGCAAAAATATGCCGCAATCAAGTGAGTTCCGCTGCAGATTTGCGCCTTGAACGTAATAATCTCCACCTATCCGTTTGTATCTCGGGCTTGTGACGTTGGACATCGCAATAATAAAGCAAAGACGGCAGAGTATACGTCAATGGCTCTAAGAACCTGTCCACTTCTTGTGGCTTGAGATACGATGGATAGGAGTCAAATATCATTATGCAACCTCTGTTAATGTCCAAACATAGTAGAAACCAATGCTCCTTCATGTTTGTTGGACAAAAAACGAAATCCACCTCATGCCAGCCCGGTTTGTATGCATCGTGCTCACCTCGTCGTGCGTCATCTCGCGCGTCGTGTGTCTTCTCATACTCCAAACCATGGTGATTTGATTTTCGCTTGCCTTTGATCCTCTGGAAGAGACTATCCTTCCTCGTGCATTTAGGTGAGACTGTATATGGTACATAGTAGTAAGACATAGTGAGTGAAAAACCGTAAATAACTTATATAATTAACTCACCAATATCCCAGTTGGAAGCACCGTGAACTTGTTTATACACATCTCTGGTCGCGTTGACAACTTGTTCTTCACGAACATGCACAACGTGTTCATAGTCTATGAACAACAACATTGAAAACATATTAAGGTGTACATTAGTTTAAAAAAAGAATATACGTTAGCAGATGTGAGTAACGAACGAATGAAGTAACACTTACGTCACACTCAACCCATGAGCTAGGTGTAAGGAGATCGGTGAAGAACTTTTTCCCCATTGGGACTGACGTCGTCCCCAACCCTGGCCTAACATCTCCCGCTGTGGATGGATCCTGAATCCATGACATCATCGACTCGAACACATCCTTATCAACACCATGTGCAGGATTGTACTTCACGGCCGGGTTATGAAGTATAGGTACCCTACCGGGAATAGGATATTTGGCCTTCTTGCGCTCTGGGGAGTGCTCGCCCTCGACATATTCTATGATAGTCTTACCACTGGGAGAGTAACGAACTCGTGGTACTCTCTTGCGACTACCCTTCCGCAAACCCTCTTGTTCGTCGGGTTGTTGGACCGCAAGTGGGACGGTGACGTCAATATCCTCGATCTCCAACGCCGGGTTGTGAGTTCCCCCAGCCAAGGGGGGCAGAGATGGCATTTTCTTCAAACTATCCAATGTCGTGGCTGCATCTGCAACGACATTAGATAGAATATCCTCCCGATCATCGTTAAATCGGATTGGATGGGATGGGAGACATCCGCCAATGGCGATAGTGCGCGATCGGTTTGTGGAAGGGTCCGCTGCTCGTCGTGATGTGGATGGCTTCGGATGGAATGGGAGATGCCTCGGGTGGAGTGTGGATGGTGGATGGCTCGGGTTGAACGGAAGGGTCGGATGGGGTGAGGGTGATAGATATCTCGGGTGGCTCATTAGATATCTCTGTGGGAGTGAGGGTTGGTGTGGGTGTGGTGGATGGCTCGGATGGCTCATGGGGAATGATAGTGAGTGTGTCGGTGGTGGATGGATCGGGAAGGAGAGTGGGTGTGTGGGTGGTGGATGTGGTGGACGTCGTGGGTGGAATGAGAGTATGTGATGGCTCGGGTGGAACGGAAGGGTCGGACGGGGTGAGAGTGGTAGATGACTCGGGTGGGTCATTAGGAATGTGAGTGGGAGTGGGGGTGTCGGTGGTGGATGGATCGGGAATGAGAGTGGGGGTGGGATATGGCACATGTGAAGTGGGAGAAGGCTCGGGTGCATGATGGCTTGTTGTGCAATCATCGTCCTGCAATGTATTGAATCCATGTAAGACATACTATTTCCAAATAAGGATGACAGAAAGCAAGATACTACGTGATGTAATGTACCTTCATACTCCTAGACATGAACTCGCGTAGCCATAGCGAGATGGGTTTTAACCTCCCCGCCTTCCGTTCTCATTTACATCCTCATGTCCAAAGATATATACTGTTTTGGGCGATAACGAACATCTCGACGCCTTGAACGCGCATTTCTATACCCCGAATGTAGTCAACAGTGGTTGATGCGGTGGGCATATCAATGGGTTGATCGTGTGAAGAAGGTCGGACAATAGGGTCGTGGATCGGACCGGTGATCGAATGGGGATCGGTCGGAGAGCTAATATGCGAACGAATGGGGAGCGGTCGGTGTTCCGAGACTCTCACTAATGTTCTCCGATCCTCGATCATGCGGCATCCTCAACATCGACATCCATATCTTGGACCTCCTCAAACAAATCGTCATCTTCCGCAGATCCCAACTCATCTTCTTCATCATCGGACGCGTTAAGAGTCTATCGGTATGCAGTGCATCACGACATCGCATTTCCGATTCTGATGGGATAAGGTTCGCGACGACGACCAACTAATAACAAAGAATGCACAAACACATAGAGTGGGTAAGTAAGATGAGTCATGCATACACATTAACGTGAATCATATCATATTAGAGGTTATAGACCTTTTTTGATGCGAAGACATCTCGTTCCAAGGCCTTGTACGTAACAGAGTGTGTGCACGACCATCTACGAATACGTGGCACCGCGTCGTCACTGACGCGAGTAGCTATCCGATCTGCCACTGGTAGTAGGGTCTCGTATGCCCACACCTATTAGATATTCATTAGCTTCGTAGCAGTACTAGAATGTGATAATAAGTACATTTTACAACATGTATTACCCAAAAGGGGAGGGAATCCAGGAAGGTTATACTTAAAAGTATAACGTTTGTTAGTCTTCGACTTCATTTTATACGTTTCGAGCTTCCCCTGCAAGGCGGCCTTCAACCCACGTATGGTCCTTTCCCATATCCTAGAACCCCAATCAATACCGTTGAAGTATTCGAGGTCCTCCACGTCACCAAACAAAGAACTGTCGACGACGGTTTTTGACTTGTTCTTCCCCATCATTACTGCCTCACAGTAATATATAAGCGACACCTTCACCGCATCTTCGTCGGTGTCAAATTTTATCGTCTTGTACGCAGACTCAAAAGCATCTACATGGATGTCCGTCCCCATGCCATCGAAATACTTCTCCCGAAGAGCAACTGACGGTTCGGGCCGTTCACGTGACTGTGGGGATTGCCACAATCCTGTCATTAGGATGAACTCATCCTTACCAAACTTGGCTACTGTTCCATTGATAGAGAACAGCATCGATTCATTGCCTTCGCCTCCACTCACCTCCCTCAGAAGAATGTGATGGACTAGTGGACTATTGAATACGAGGTCCACATCAACGAACGAACCGAATACTGTCCGCCTAAATAGTCCAAGCTGTGTAGATGTCAGCTTCTGTTTCAATACTTTGTTAGCACTGGTGATGTGTGCTAAACTAGATGCTTGACACGAAACCGATCTTTTCATGCAAATCTTGAGCGATTTGTCATTCTAAACCTGTGCATAATAGTTGAACATCATTTTAACAAAATGCACGAACACTGATATATTAAAAACAGTGTGTATTACAGTCAGTATATACAAAATGAAATGGATCAATCGCGCCCGTCAACTCGCTCCCGCTGACTCGCTCCCGTTCACTCGCACACGCCCAATCGCTCACGTATTCTCGTCCTTGCGTTCGATGAACGTAACAACCCCAAACTAGACAGATATAACTTAAACCCTAGCAACCTATCGTCGATTTCCAAATAAAACTAGACAGATATAGTACATGCACCCTTTCGACCGCTCGACTGTAACAATCAATATCGCAGTTACGAATATAAAAAAAAAAAAAACAATACAGTGAAACCTGTCGACGGTGGAATATCGAGGGCTAACAGCGGTAGAAGGTCGACGTAAAACCCCGATGGACTGTCGACGACAAGCGGCGGTGGAACGGAGTGCGACCGAACGCCTGGGAACTGAGAGAGAGTGATAGTGTCGAGAACTTAGAGAGAGAGTAAGAACTTGTTAGTGAGATGTAGAAATGAAAAACCCTTTTTATATGAAATGAAAAAAAAACGTTGGCTCGCTCACGCAGGATCGCTCTCGTTGGATCGCTCTCGTTATATCGCTCTCGTTGGATCGCTCTCGTTGGATCGCTCTCGTAGATCGCTCTCGTCGGATCGCTCTCGTAGATAGTCTCTCACTTTCTTGATATCGCCTATTTCAACCCGCATTATTGCTCCCCAAATGTCGTTCATTGCTCGCTTCTCGAAGACTCGCTCTGCGTTTCTCCATTACTCATTGGCGTGGATACACTCACGCCGATCGAGTATTTTCATCATTACAAAGACAAAGAAGTAATTTCTAATTGGAGAAACAAATAGTCGCATTAATTAGTGATCGGCGCCTTTAAGTTCGACCAGTTGAGTTTTTTAGTATTAATATAGGGTGTGGCTTATACAAATTTCATCCTAAACCACAATTAACCTCTATTCATCTTCTTCGCCTTTCTTTCTTTTATTCTTCCTTCAAAATGAATTCAAACACAACCGAAACACAGATTCGGATAAATCGATTTACGACAATGAAGAGGTAGAAAAACCGTTTTCTGCCTCCACACCACAATCATCGGAGGATAGTTTTTTCAACTATCACCGGGCGACCACCGAGCAAGTCCTCGGATGACTCCTCCGGGAGGAATCGAGGAGTCCTCGAGAGTCCTCCGAGAGTCCTCCGAGACCCCATTGAGGCAACCGCGCCCACGGTTCTTGATTCCGACATCCCTCCCCACAACGGGCGTGCCTTACCGAATGTTCCGATAACATTCCTGCTGAAGAGCGCCCAAGAAGAGGCTGCATACCGGCCCCGATCGTGTCGACTCCTCCAAAGTGAGTCAGTTAGGAGATGCGAGCTTCCTTTCCCTTCCTTATAATGTGTTTTTATTTTTCTTAATGTACTTTTTTTGTGACAAGAAAATGAATGAATATATTATGGTTGTTTAGATATTTGCATCTTCTGCTTTCATTATTTCCAAGCGCGAGAGCGAGTTTAAGCGTGAGGTTCCTACGAGTAAGCATGCATGAGCGTTCATCAGATGTATGCATGAGCGAGTGACCGCTTTGAGTAAGGATGAGCGTTCTACGAGTAAGCATGAGCGTTCCTACGAGTAAGCGGGAGCGTTCCTACGAGTAAGCATGGCGTTCCTACGGAGCGTTCTCCTACGAGTAACCGTGAGCGTTCTACGAGTAAGGCGTGCGCGTTCCTTCCGAGAAGGCGTGGCGTCGCGCTGTTCTAGCGCTCCTAGCGTTCCTACGAGTAACCGCGCGTTCCTTTCGAGAAGGCGTGGCGTTCTGCGAGGCGGCAAAGCTACCCTTCTACGTAATTTTGCGCGCGCGTGTTGCCCACGTCGAGAATAGGTCGATTTTCATTATTTTCAGCGCACGCGTCATTCAACATTAATGATAATAAAATGGGGTCTTCCGTACAAATATCCCCCAGAATTAGAGAAATATTTTGTATAATTTAATTTTTTTTAAAAAAATCGAGAATTACACTATTTTAATAGTAATATTATTAGAATAATAATATCAACAAAAGAAATGCTGGAATGGTGTGGTTGACTTAGTTCGCCGTTCGAGCCAACCAGCGCTACAAGCACGGCGGCGATGAAGGCGCTCCAGCTCCACTGTTACAGTCACTTCAAACCCACCGCCACCGCCACCGCCGCCGCACATCGTCACTTCGCCACCAAATACACCGCCAAAATCACTTCTTCTTCTCCCACCGGACGTTCCGTTTCCGTCGAGGTCACTCCGCCGGCTCCTCTTCCCGTCGACTCTCGCGGCCATTCTCTTCCCCGCCGAGATCTCATCTGCAGAGCCATTCAGATACTCCTCGATCGCAAACACCGTTCCTCTTCATCCAGCATTGATGATCGCTTCTCCGATTTATCCTCCTACTTTCAATCTCTCTCTGTCTCCCTAACTCCAGCTGAAGCCTCTGAAATTCTCAAATCCCTAAACCCCGATCTCGCTCTGCAATTCTTCCAGCTTTGCCCCTCCCTTTGCCCTAAGTTCCGCCACGATGTCTTCACTTACAGCCGCATCATTCTCATACTCTCCCATTCATCTTCCCCGAAACGGTTCGATCACGTTCGTGAGATTCTGTCGCAGATGGATAGAGATCAAATACGTGGTACGATTTCTACTGTTAATATCTTGATTAGAATTTTTGGTAGCAAAGAGGACTTGGAGTTATGTACTGGTTTGATTAAGAAATGGGACTTGAGACTTAATGCTTACACCTATAGGTGTTTGCTTCAAGCTCATGTAAGGTCCCATGATTCAGATAGGGCTTTCAATGTGTATATGGAAATGCGGGGTCGGGGGTATAAACTGGATATCTTTGCCTACAATATGCTGTTGGATGCTCTGGCTAAAGATGAAAAGGTTTGATTTTAGAATGTTGAAACGATTTTTAATGTTTGATTCTTTATGAAATAGCTGTTCACTTGTTCCTTCTGATCTAATTTTATGTTTTGTTTTAACAGCTTGATCGAACTTACAAAGTTTTTAAGGATATGAAACTGAAGCACTGTAATCCAGATGAGTATACGTATAGTATTATGATTAGAATGACTGGAAAAGGGGGTAGAACTGAAGAGTCTTTGGCGTTCTTTGAAGAAATGCTAACAAAGGGCTGTACTCCGAATTTGATTGTATATAATACAATGATTGAGGCACTTTCTAGGAGCAGAATGGTCGACAAGGCAATTCTTCTTTTTTCTAATATGGTTAAGAATAATTGTAGGCCGAATGAGTTTACATATAGTGTCATTTTGAATGTTTTGGTCGCAGAAGGACAGTTGGGTAGATTGGATGAAGTTCTGGGAGTGTCTGATAAATTTATGAACAAATCGATATATGCATATCTTGTTAGGACTCTAAGCAAACTAGGCCATGCAAGTGAAGCTCATCGTCTTTTCTGCAACATGTGGAGCTTTCATGATAGAGGAGATAGAGATGCTTACATTTCCATGTTGGAGAGCCTATGCAGTGCAGGTAAAACTGTAGAAGCTATCGACCTGCTCGGTAAGGTTCATGAGAAGGGAATTAGTTGTGATACTATGATGTATAACATGGTGTTATCTACTTTGGGGAAGCTGAAGCAAGTAAGTCATCTTCATGATCTTTATCAGAAGATGAAACAAGATGGGCCGTTGCCCGACATATTCACATATAATATTCTTATATCAAGCTTAGGACGTGCTGGGAAAGTTAAGGAGGCTGTTAAAGTTTTTGAAGAACTTGAGAGTAGTAGTTGTAAACCAGATATTATATCCTACAATTCTTTGATCAATTGCCTTGGGAAAAATGGGGATGTTGATGAAGCTCACATGATATTTCTCGAGATGCAAGAGAAGGGATTGAATCCTGATGTTGTAACGTACAGCACACTCATTGAATGTTTTGGGAAAACAGATAAAGTTGAGATGGCTCGCAGTTTGTTTGATAAAATGATGGCTCAAGGATGCTGTCCGAATATTGTAACATACAACATACTACTTGACTGTCTTGAAAGAGCAGGGAGAACTGGTGAAACAGTTGATCTTTATGCAAAGCTTAAACAGCAGGGATTAACACCAGATTCAATTACATATGCTATACTTGACAGATTACAAAGTGGCTCTAATAGAAAATTTAGAGTCCGGAGGCAAAATCCAATTACTGGTTGGGTCGTTAGTCCTTTAAGGTAATATTACAAAAAAGTTTTGAAGGAAGAGAAAAGGTCTATCTATTTATCTGAGATGCATCTCTCTTATGTCTACTTCTGACATATTTTTACCAAGTAGTTGCCACAGACGATGCATTTGATGTATTACAAAGTTTTAGAGGTCTAGAGTAAAACCATCCTATTTTCTGATTCTTGGGTAAAGAAAAAATGCCAAAGTTTATATCTTCAGAATATTGATGGTGATTTTTTCAGAGCATCAGTGGTCAGCAATGAAAATATCGACGTTCTTGCATTACGCAGTGCGAACGATGTCGTTACACAGGCACGTTGAATGTAAATAAAGCCAAACCCAGTGAATCCTCCTCCTGCTGAACTTGAAATCCTGAGACTGGATCTTTGTTCAATAGCAAGAAATCATATCTGATTAAATTTCTCTACCCTCACAGAAATTTTCTAATGAGAAATCAAAAAGGGTATTTTCCAAAGGTGAGAATACTCTGTTGTCTAGTGGAGGCTTCGCCAACTTCAAGTGAGTTCAAGTTTTTGAGTTCTCCACCGGAAGGCTTTAGGGTAAGCAAAGTCACTCGTTTATCTTTTATCTGTTTCAGGGAAGATTATTGTCATTGTTTTATATAACATGTACATTACCTGTTAGAAAGGCGAACAATTTCTTTCCTTAGTAGGATTTTCATGAAAGAAGTCTATATTCTCTTTTGCCTATTTAAGTCATCAATTTTGGATACCA

mRNA sequence

ATGCTCTCACTCTCACACAGACTCCGGACTCTGTCCTCATCTCTATGCTCCAAATCTCATCAATTTCGATCGGTCCGAACGGCCACCGGACCGTCCAAGCGGCGATCAAAGGCTGCCGCATTCACCGTAAAGAGGCCCGACGAGAAGTCCGAATGGTGGGTTGTTGACGGCGAAATGCACGAAATCGGCGACAATGTGCCCCCTCGCGAGCGCTTCGTTATACCCAGAGAAAATCTCCCCAATCGGCGTCGAAAGCAGCTCAGGGAGCAGTTCATGCGCCGGACTCGCCTCGTTCTTAAGGAATCTGAACACGAGCCTTGGTGCAAAAGGTACATGGAGCTTTATCAGGAGCTAAGGGAGAACTGGGAGAGGCTGTACTGGGATGAGGGTTACTCTAAAAAACTTGCCCAGGAACATGCAAATTATGAGTCTTCTGAAGACGAGGATTTTTCCCCTTATAGGAATAGGCAGTTCACTGCTGATCGAAGTAAGGAACAGGATTTTAGGAGAAACATGCAAGGTGGTAGCTGGGAGAAGGTCAGCCAAATTAGAGATAAGTTTGAATACGACAGGGAGAGAAGAATGAGAGAGAGAGGTTATTCGCCGTTCGAGCCAACCAGCGCTACAAGCACGGCGGCGATGAAGGCGCTCCAGCTCCACTGTTACAGTCACTTCAAACCCACCGCCACCGCCACCGCCGCCGCACATCGTCACTTCGCCACCAAATACACCGCCAAAATCACTTCTTCTTCTCCCACCGGACGTTCCGTTTCCGTCGAGGTCACTCCGCCGGCTCCTCTTCCCGTCGACTCTCGCGGCCATTCTCTTCCCCGCCGAGATCTCATCTGCAGAGCCATTCAGATACTCCTCGATCGCAAACACCGTTCCTCTTCATCCAGCATTGATGATCGCTTCTCCGATTTATCCTCCTACTTTCAATCTCTCTCTGTCTCCCTAACTCCAGCTGAAGCCTCTGAAATTCTCAAATCCCTAAACCCCGATCTCGCTCTGCAATTCTTCCAGCTTTGCCCCTCCCTTTGCCCTAAGTTCCGCCACGATGTCTTCACTTACAGCCGCATCATTCTCATACTCTCCCATTCATCTTCCCCGAAACGGTTCGATCACGTTCGTGAGATTCTGTCGCAGATGGATAGAGATCAAATACGTGGTACGATTTCTACTGTTAATATCTTGATTAGAATTTTTGGTAGCAAAGAGGACTTGGAGTTATGTACTGGTTTGATTAAGAAATGGGACTTGAGACTTAATGCTTACACCTATAGGTGTTTGCTTCAAGCTCATGTAAGGTCCCATGATTCAGATAGGGCTTTCAATGTGTATATGGAAATGCGGGGTCGGGGGTATAAACTGGATATCTTTGCCTACAATATGCTGTTGGATGCTCTGGCTAAAGATGAAAAGCTTGATCGAACTTACAAAGTTTTTAAGGATATGAAACTGAAGCACTGTAATCCAGATGAGTATACGTATAGTATTATGATTAGAATGACTGGAAAAGGGGGTAGAACTGAAGAGTCTTTGGCGTTCTTTGAAGAAATGCTAACAAAGGGCTGTACTCCGAATTTGATTGTATATAATACAATGATTGAGGCACTTTCTAGGAGCAGAATGGTCGACAAGGCAATTCTTCTTTTTTCTAATATGGTTAAGAATAATTGTAGGCCGAATGAGTTTACATATAGTGTCATTTTGAATGTTTTGGTCGCAGAAGGACAGTTGGGTAGATTGGATGAAGTTCTGGGAGTGTCTGATAAATTTATGAACAAATCGATATATGCATATCTTGTTAGGACTCTAAGCAAACTAGGCCATGCAAGTGAAGCTCATCGTCTTTTCTGCAACATGTGGAGCTTTCATGATAGAGGAGATAGAGATGCTTACATTTCCATGTTGGAGAGCCTATGCAGTGCAGGTAAAACTGTAGAAGCTATCGACCTGCTCGGTAAGGTTCATGAGAAGGGAATTAGTTGTGATACTATGATGTATAACATGGTGTTATCTACTTTGGGGAAGCTGAAGCAAGTAAGTCATCTTCATGATCTTTATCAGAAGATGAAACAAGATGGGCCGTTGCCCGACATATTCACATATAATATTCTTATATCAAGCTTAGGACGTGCTGGGAAAGTTAAGGAGGCTGTTAAAGTTTTTGAAGAACTTGAGAGTAGTAGTTGTAAACCAGATATTATATCCTACAATTCTTTGATCAATTGCCTTGGGAAAAATGGGGATGTTGATGAAGCTCACATGATATTTCTCGAGATGCAAGAGAAGGGATTGAATCCTGATGTTGTAACGTACAGCACACTCATTGAATGTTTTGGGAAAACAGATAAAGTTGAGATGGCTCGCAGTTTGTTTGATAAAATGATGGCTCAAGGATGCTGTCCGAATATTGTAACATACAACATACTACTTGACTGTCTTGAAAGAGCAGGGAGAACTGGTGAAACAGTTGATCTTTATGCAAAGCTTAAACAGCAGGGATTAACACCAGATTCAATTACATATGCTATACTTGACAGATTACAAAGTGGCTCTAATAGAAAATTTAGAGTCCGGAGGCAAAATCCAATTACTGGTTGGGTCGTTAGTCCTTTAAGGTAATATTACAAAAAAGTTTTGAAGGAAGAGAAAAGGTCTATCTATTTATCTGAGATGCATCTCTCTTATGTCTACTTCTGACATATTTTTACCAAGTAGTTGCCACAGACGATGCATTTGATGTATTACAAAGTTTTAGAGGTCTAGAGTAAAACCATCCTATTTTCTGATTCTTGGGTAAAGAAAAAATGCCAAAGTTTATATCTTCAGAATATTGATGGTGATTTTTTCAGAGCATCAGTGGTCAGCAATGAAAATATCGACGTTCTTGCATTACGCAGTGCGAACGATGTCGTTACACAGGCACGTTGAATGTAAATAAAGCCAAACCCAGTGAATCCTCCTCCTGCTGAACTTGAAATCCTGAGACTGGATCTTTGTTCAATAGCAAGAAATCATATCTGATTAAATTTCTCTACCCTCACAGAAATTTTCTAATGAGAAATCAAAAAGGGTATTTTCCAAAGGTGAGAATACTCTGTTGTCTAGTGGAGGCTTCGCCAACTTCAAGTGAGTTCAAGTTTTTGAGTTCTCCACCGGAAGGCTTTAGGGTAAGCAAAGTCACTCGTTTATCTTTTATCTGTTTCAGGGAAGATTATTGTCATTGTTTTATATAACATGTACATTACCTGTTAGAAAGGCGAACAATTTCTTTCCTTAGTAGGATTTTCATGAAAGAAGTCTATATTCTCTTTTGCCTATTTAAGTCATCAATTTTGGATACCA

Coding sequence (CDS)

ATGCTCTCACTCTCACACAGACTCCGGACTCTGTCCTCATCTCTATGCTCCAAATCTCATCAATTTCGATCGGTCCGAACGGCCACCGGACCGTCCAAGCGGCGATCAAAGGCTGCCGCATTCACCGTAAAGAGGCCCGACGAGAAGTCCGAATGGTGGGTTGTTGACGGCGAAATGCACGAAATCGGCGACAATGTGCCCCCTCGCGAGCGCTTCGTTATACCCAGAGAAAATCTCCCCAATCGGCGTCGAAAGCAGCTCAGGGAGCAGTTCATGCGCCGGACTCGCCTCGTTCTTAAGGAATCTGAACACGAGCCTTGGTGCAAAAGGTACATGGAGCTTTATCAGGAGCTAAGGGAGAACTGGGAGAGGCTGTACTGGGATGAGGGTTACTCTAAAAAACTTGCCCAGGAACATGCAAATTATGAGTCTTCTGAAGACGAGGATTTTTCCCCTTATAGGAATAGGCAGTTCACTGCTGATCGAAGTAAGGAACAGGATTTTAGGAGAAACATGCAAGGTGGTAGCTGGGAGAAGGTCAGCCAAATTAGAGATAAGTTTGAATACGACAGGGAGAGAAGAATGAGAGAGAGAGGTTATTCGCCGTTCGAGCCAACCAGCGCTACAAGCACGGCGGCGATGAAGGCGCTCCAGCTCCACTGTTACAGTCACTTCAAACCCACCGCCACCGCCACCGCCGCCGCACATCGTCACTTCGCCACCAAATACACCGCCAAAATCACTTCTTCTTCTCCCACCGGACGTTCCGTTTCCGTCGAGGTCACTCCGCCGGCTCCTCTTCCCGTCGACTCTCGCGGCCATTCTCTTCCCCGCCGAGATCTCATCTGCAGAGCCATTCAGATACTCCTCGATCGCAAACACCGTTCCTCTTCATCCAGCATTGATGATCGCTTCTCCGATTTATCCTCCTACTTTCAATCTCTCTCTGTCTCCCTAACTCCAGCTGAAGCCTCTGAAATTCTCAAATCCCTAAACCCCGATCTCGCTCTGCAATTCTTCCAGCTTTGCCCCTCCCTTTGCCCTAAGTTCCGCCACGATGTCTTCACTTACAGCCGCATCATTCTCATACTCTCCCATTCATCTTCCCCGAAACGGTTCGATCACGTTCGTGAGATTCTGTCGCAGATGGATAGAGATCAAATACGTGGTACGATTTCTACTGTTAATATCTTGATTAGAATTTTTGGTAGCAAAGAGGACTTGGAGTTATGTACTGGTTTGATTAAGAAATGGGACTTGAGACTTAATGCTTACACCTATAGGTGTTTGCTTCAAGCTCATGTAAGGTCCCATGATTCAGATAGGGCTTTCAATGTGTATATGGAAATGCGGGGTCGGGGGTATAAACTGGATATCTTTGCCTACAATATGCTGTTGGATGCTCTGGCTAAAGATGAAAAGCTTGATCGAACTTACAAAGTTTTTAAGGATATGAAACTGAAGCACTGTAATCCAGATGAGTATACGTATAGTATTATGATTAGAATGACTGGAAAAGGGGGTAGAACTGAAGAGTCTTTGGCGTTCTTTGAAGAAATGCTAACAAAGGGCTGTACTCCGAATTTGATTGTATATAATACAATGATTGAGGCACTTTCTAGGAGCAGAATGGTCGACAAGGCAATTCTTCTTTTTTCTAATATGGTTAAGAATAATTGTAGGCCGAATGAGTTTACATATAGTGTCATTTTGAATGTTTTGGTCGCAGAAGGACAGTTGGGTAGATTGGATGAAGTTCTGGGAGTGTCTGATAAATTTATGAACAAATCGATATATGCATATCTTGTTAGGACTCTAAGCAAACTAGGCCATGCAAGTGAAGCTCATCGTCTTTTCTGCAACATGTGGAGCTTTCATGATAGAGGAGATAGAGATGCTTACATTTCCATGTTGGAGAGCCTATGCAGTGCAGGTAAAACTGTAGAAGCTATCGACCTGCTCGGTAAGGTTCATGAGAAGGGAATTAGTTGTGATACTATGATGTATAACATGGTGTTATCTACTTTGGGGAAGCTGAAGCAAGTAAGTCATCTTCATGATCTTTATCAGAAGATGAAACAAGATGGGCCGTTGCCCGACATATTCACATATAATATTCTTATATCAAGCTTAGGACGTGCTGGGAAAGTTAAGGAGGCTGTTAAAGTTTTTGAAGAACTTGAGAGTAGTAGTTGTAAACCAGATATTATATCCTACAATTCTTTGATCAATTGCCTTGGGAAAAATGGGGATGTTGATGAAGCTCACATGATATTTCTCGAGATGCAAGAGAAGGGATTGAATCCTGATGTTGTAACGTACAGCACACTCATTGAATGTTTTGGGAAAACAGATAAAGTTGAGATGGCTCGCAGTTTGTTTGATAAAATGATGGCTCAAGGATGCTGTCCGAATATTGTAACATACAACATACTACTTGACTGTCTTGAAAGAGCAGGGAGAACTGGTGAAACAGTTGATCTTTATGCAAAGCTTAAACAGCAGGGATTAACACCAGATTCAATTACATATGCTATACTTGACAGATTACAAAGTGGCTCTAATAGAAAATTTAGAGTCCGGAGGCAAAATCCAATTACTGGTTGGGTCGTTAGTCCTTTAAGGTAA

Protein sequence

MLSLSHRLRTLSSSLCSKSHQFRSVRTATGPSKRRSKAAAFTVKRPDEKSEWWVVDGEMHEIGDNVPPRERFVIPRENLPNRRRKQLREQFMRRTRLVLKESEHEPWCKRYMELYQELRENWERLYWDEGYSKKLAQEHANYESSEDEDFSPYRNRQFTADRSKEQDFRRNMQGGSWEKVSQIRDKFEYDRERRMRERGYSPFEPTSATSTAAMKALQLHCYSHFKPTATATAAAHRHFATKYTAKITSSSPTGRSVSVEVTPPAPLPVDSRGHSLPRRDLICRAIQILLDRKHRSSSSSIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNPDLALQFFQLCPSLCPKFRHDVFTYSRIILILSHSSSPKRFDHVREILSQMDRDQIRGTISTVNILIRIFGSKEDLELCTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRGRGYKLDIFAYNMLLDALAKDEKLDRTYKVFKDMKLKHCNPDEYTYSIMIRMTGKGGRTEESLAFFEEMLTKGCTPNLIVYNTMIEALSRSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVLVAEGQLGRLDEVLGVSDKFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAYISMLESLCSAGKTVEAIDLLGKVHEKGISCDTMMYNMVLSTLGKLKQVSHLHDLYQKMKQDGPLPDIFTYNILISSLGRAGKVKEAVKVFEELESSSCKPDIISYNSLINCLGKNGDVDEAHMIFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMMAQGCCPNIVTYNILLDCLERAGRTGETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSPLR
Homology
BLAST of Tan0005393 vs. ExPASy Swiss-Prot
Match: Q9ZU27 (Pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g51965 PE=2 SV=1)

HSP 1 Score: 874.0 bits (2257), Expect = 1.4e-252
Identity = 427/651 (65.59%), Postives = 530/651 (81.41%), Query Frame = 0

Query: 225 FKPTATATAAAHRHFATKYTAKITSSSPTGRSVSVEVTPPAPLPVDSRGHSLPRRDLICR 284
           F    T T    RH+ATKY AK+TSSSP+GRS+S EV+ P PLP D RG+ LPRR LICR
Sbjct: 9   FNSVNTITRPNRRHYATKYVAKVTSSSPSGRSLSAEVSLPNPLPADVRGYPLPRRHLICR 68

Query: 285 AIQILLDRKHRSSSSSIDDRFSDLSSYFQSLSVSLTPAEASEILKSLN-PDLALQFFQLC 344
           A  ++      + +S++ D FSDLS Y  SLS+SLTP EASEILKSLN P LA++FF+L 
Sbjct: 69  ATNLI------TGASNLSDAFSDLSDYLSSLSLSLTPDEASEILKSLNSPLLAVEFFKLV 128

Query: 345 PSLCPKFRHDVFTYSRIILILSHSSSPKRFDHVREILSQMDRDQIRGTISTVNILIRIFG 404
           PSLCP  ++D F Y+RIILILS S+ P RFD VR IL  M +  + G ISTVNILI  FG
Sbjct: 129 PSLCPYSQNDPFLYNRIILILSRSNLPDRFDRVRSILDSMVKSNVHGNISTVNILIGFFG 188

Query: 405 SKEDLELCTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRGRGYKLDIFAYN 464
           + EDL++C  L+KKWDL++N++TY+CLLQA++RS D  +AF+VY E+R  G+KLDIFAYN
Sbjct: 189 NTEDLQMCLRLVKKWDLKMNSFTYKCLLQAYLRSRDYSKAFDVYCEIRRGGHKLDIFAYN 248

Query: 465 MLLDALAKDEKLDRTYKVFKDMKLKHCNPDEYTYSIMIRMTGKGGRTEESLAFFEEMLTK 524
           MLLDALAKDEK     +VF+DMK +HC  DEYTY+IMIR  G+ G+ +E++  F EM+T+
Sbjct: 249 MLLDALAKDEK---ACQVFEDMKKRHCRRDEYTYTIMIRTMGRIGKCDEAVGLFNEMITE 308

Query: 525 GCTPNLIVYNTMIEALSRSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVLVAEGQLGRL 584
           G T N++ YNT+++ L++ +MVDKAI +FS MV+  CRPNE+TYS++LN+LVAEGQL RL
Sbjct: 309 GLTLNVVGYNTLMQVLAKGKMVDKAIQVFSRMVETGCRPNEYTYSLLLNLLVAEGQLVRL 368

Query: 585 DEVLGVSDKFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAYISMLESLCS 644
           D V+ +S ++M + IY+YLVRTLSKLGH SEAHRLFC+MWSF  +G+RD+Y+SMLESLC 
Sbjct: 369 DGVVEISKRYMTQGIYSYLVRTLSKLGHVSEAHRLFCDMWSFPVKGERDSYMSMLESLCG 428

Query: 645 AGKTVEAIDLLGKVHEKGISCDTMMYNMVLSTLGKLKQVSHLHDLYQKMKQDGPLPDIFT 704
           AGKT+EAI++L K+HEKG+  DTMMYN V S LGKLKQ+SH+HDL++KMK+DGP PDIFT
Sbjct: 429 AGKTIEAIEMLSKIHEKGVVTDTMMYNTVFSALGKLKQISHIHDLFEKMKKDGPSPDIFT 488

Query: 705 YNILISSLGRAGKVKEAVKVFEELESSSCKPDIISYNSLINCLGKNGDVDEAHMIFLEMQ 764
           YNILI+S GR G+V EA+ +FEELE S CKPDIISYNSLINCLGKNGDVDEAH+ F EMQ
Sbjct: 489 YNILIASFGRVGEVDEAINIFEELERSDCKPDIISYNSLINCLGKNGDVDEAHVRFKEMQ 548

Query: 765 EKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMMAQGCCPNIVTYNILLDCLERAGRTG 824
           EKGLNPDVVTYSTL+ECFGKT++VEMA SLF++M+ +GC PNIVTYNILLDCLE+ GRT 
Sbjct: 549 EKGLNPDVVTYSTLMECFGKTERVEMAYSLFEEMLVKGCQPNIVTYNILLDCLEKNGRTA 608

Query: 825 ETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSPL 875
           E VDLY+K+KQQGLTPDSITY +L+RLQS S+ K R+RR+NPITGWVVSPL
Sbjct: 609 EAVDLYSKMKQQGLTPDSITYTVLERLQSVSHGKSRIRRKNPITGWVVSPL 650

BLAST of Tan0005393 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 224.2 bits (570), Expect = 5.9e-57
Identity = 145/551 (26.32%), Postives = 275/551 (49.91%), Query Frame = 0

Query: 315 LSVSLTPAEASEILKSLNPDLALQFFQLCPSLCPKFRHDVFTYSRIILILSHSSSPKRFD 374
           LS + TP  AS +L     D AL    +   L     H  FT  R   I  H  +  +  
Sbjct: 42  LSANFTPEAASNLLLKSQNDQAL----ILKFLNWANPHQFFTL-RCKCITLHILTKFKLY 101

Query: 375 HVREILSQMDRDQIRGTISTVNILIRIFGSKEDLELCTGLIKKWDLRLNAYTYRCLLQAH 434
              +IL++   D    T+      +     +E  +LC      +DL + +Y+   L+   
Sbjct: 102 KTAQILAE---DVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLI--- 161

Query: 435 VRSHDSDRAFNVYMEMRGRGYKLDIFAYNMLLDALAKDEK-LDRTYKVFKDMKLKHCNPD 494
                 D+A ++    +  G+   + +YN +LDA  + ++ +     VFK+M     +P+
Sbjct: 162 ------DKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPN 221

Query: 495 EYTYSIMIRMTGKGGRTEESLAFFEEMLTKGCTPNLIVYNTMIEALSRSRMVDKAILLFS 554
            +TY+I+IR     G  + +L  F++M TKGC PN++ YNT+I+   + R +D    L  
Sbjct: 222 VFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLR 281

Query: 555 NMVKNNCRPNEFTYSVILNVLVAEGQLGRLDEVLGVSDK---FMNKSIYAYLVRTLSKLG 614
           +M      PN  +Y+V++N L  EG++  +  VL   ++    +++  Y  L++   K G
Sbjct: 282 SMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEG 341

Query: 615 HASEAHRLFCNMWSFHDRGDRDAYISMLESLCSAGKTVEAIDLLGKVHEKGISCDTMMYN 674
           +  +A  +   M           Y S++ S+C AG    A++ L ++  +G+  +   Y 
Sbjct: 342 NFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYT 401

Query: 675 MVLSTLGKLKQVSHLHDLYQKMKQDGPLPDIFTYNILISSLGRAGKVKEAVKVFEELESS 734
            ++    +   ++  + + ++M  +G  P + TYN LI+     GK+++A+ V E+++  
Sbjct: 402 TLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEK 461

Query: 735 SCKPDIISYNSLINCLGKNGDVDEAHMIFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMA 794
              PD++SY+++++   ++ DVDEA  +  EM EKG+ PD +TYS+LI+ F +  + + A
Sbjct: 462 GLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEA 521

Query: 795 RSLFDKMMAQGCCPNIVTYNILLDCLERAGRTGETVDLYAKLKQQGLTPDSITYAILDRL 854
             L+++M+  G  P+  TY  L++     G   + + L+ ++ ++G+ PD +TY++   L
Sbjct: 522 CDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSV---L 572

Query: 855 QSGSNRKFRVR 862
            +G N++ R R
Sbjct: 582 INGLNKQSRTR 572

BLAST of Tan0005393 vs. ExPASy Swiss-Prot
Match: Q9SZ52 (Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PGR3 PE=1 SV=1)

HSP 1 Score: 223.8 bits (569), Expect = 7.7e-57
Identity = 152/552 (27.54%), Postives = 267/552 (48.37%), Query Frame = 0

Query: 302 DDRFSDLSSYFQSLSVSLTPAEASEILKSLNPDLALQFFQLCPSLCPK--FRHDVFTYSR 361
           D  FS   S   +L++  T    + +L++L  D  L+       L  K   + D  TY  
Sbjct: 99  DSSFSYFKSVAGNLNLVHTTETCNYMLEALRVDGKLEEMAYVFDLMQKRIIKRDTNTYLT 158

Query: 362 IILILS----HSSSPKRFDHVREILSQMDRDQIRGTISTVNILIRIFGSKEDLELCTGLI 421
           I   LS       +P     +RE    ++     G I   ++L++     E +E+   +I
Sbjct: 159 IFKSLSVKGGLKQAPYALRKMREFGFVLNAYSYNGLI---HLLLKSRFCTEAMEVYRRMI 218

Query: 422 KKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRGRGYKLDIFAYNMLLDALAKDEKL 481
            +   R +  TY  L+    +  D D    +  EM   G K +++ + + +  L +  K+
Sbjct: 219 LE-GFRPSLQTYSSLMVGLGKRRDIDSVMGLLKEMETLGLKPNVYTFTICIRVLGRAGKI 278

Query: 482 DRTYKVFKDMKLKHCNPDEYTYSIMIRMTGKGGRTEESLAFFEEMLTKGCTPNLIVYNTM 541
           +  Y++ K M  + C PD  TY+++I       + + +   FE+M T    P+ + Y T+
Sbjct: 279 NEAYEILKRMDDEGCGPDVVTYTVLIDALCTARKLDCAKEVFEKMKTGRHKPDRVTYITL 338

Query: 542 IEALSRSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVLVAEGQLGRLDEVLGV-SDKFM 601
           ++  S +R +D     +S M K+   P+  T++++++ L   G  G   + L V  D+ +
Sbjct: 339 LDRFSDNRDLDSVKQFWSEMEKDGHVPDVVTFTILVDALCKAGNFGEAFDTLDVMRDQGI 398

Query: 602 NKSIYAY--LVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAYISMLESLCSAGKTVEAID 661
             +++ Y  L+  L ++    +A  LF NM S   +     YI  ++    +G +V A++
Sbjct: 399 LPNLHTYNTLICGLLRVHRLDDALELFGNMESLGVKPTAYTYIVFIDYYGKSGDSVSALE 458

Query: 662 LLGKVHEKGISCDTMMYNMVLSTLGKLKQVSHLHDLYQKMKQDGPLPDIFTYNILISSLG 721
              K+  KGI+ + +  N  L +L K  +      ++  +K  G +PD  TYN+++    
Sbjct: 459 TFEKMKTKGIAPNIVACNASLYSLAKAGRDREAKQIFYGLKDIGLVPDSVTYNMMMKCYS 518

Query: 722 RAGKVKEAVKVFEELESSSCKPDIISYNSLINCLGKNGDVDEAHMIFLEMQEKGLNPDVV 781
           + G++ EA+K+  E+  + C+PD+I  NSLIN L K   VDEA  +F+ M+E  L P VV
Sbjct: 519 KVGEIDEAIKLLSEMMENGCEPDVIVVNSLINTLYKADRVDEAWKMFMRMKEMKLKPTVV 578

Query: 782 TYSTLIECFGKTDKVEMARSLFDKMMAQGCCPNIVTYNILLDCLERAGRTGETVDLYAKL 841
           TY+TL+   GK  K++ A  LF+ M+ +GC PN +T+N L DCL +       + +  K+
Sbjct: 579 TYNTLLAGLGKNGKIQEAIELFEGMVQKGCPPNTITFNTLFDCLCKNDEVTLALKMLFKM 638

Query: 842 KQQGLTPDSITY 845
              G  PD  TY
Sbjct: 639 MDMGCVPDVFTY 646

BLAST of Tan0005393 vs. ExPASy Swiss-Prot
Match: Q9M907 (Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana OX=3702 GN=At3g06920 PE=2 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 5.0e-56
Identity = 120/429 (27.97%), Postives = 229/429 (53.38%), Query Frame = 0

Query: 425 YTYRCLLQAHVRSHDSDRAFNVYMEMRGRGYKLDIFAYNMLLDALAKDEKLDRTYKVFKD 484
           Y Y  ++  +  +   D A+++    R +G    + AYN +L  L K  K+D   KVF++
Sbjct: 309 YAYNTMIMGYGSAGKFDEAYSLLERQRAKGSIPSVIAYNCILTCLRKMGKVDEALKVFEE 368

Query: 485 MKLKHCNPDEYTYSIMIRMTGKGGRTEESLAFFEEMLTKGCTPNLIVYNTMIEALSRSRM 544
           MK K   P+  TY+I+I M  + G+ + +    + M   G  PN+   N M++ L +S+ 
Sbjct: 369 MK-KDAAPNLSTYNILIDMLCRAGKLDTAFELRDSMQKAGLFPNVRTVNIMVDRLCKSQK 428

Query: 545 VDKAILLFSNMVKNNCRPNEFTYSVILNVLVAEGQLGRLDEVLGVSDKFM------NKSI 604
           +D+A  +F  M    C P+E T+  +++ L   G++GR+D+   V +K +      N  +
Sbjct: 429 LDEACAMFEEMDYKVCTPDEITFCSLIDGL---GKVGRVDDAYKVYEKMLDSDCRTNSIV 488

Query: 605 YAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAYISMLESLCSAGKTVEAIDLLGKVH 664
           Y  L++     G   + H+++ +M + +   D     + ++ +  AG+  +   +  ++ 
Sbjct: 489 YTSLIKNFFNHGRKEDGHKIYKDMINQNCSPDLQLLNTYMDCMFKAGEPEKGRAMFEEIK 548

Query: 665 EKGISCDTMMYNMVLSTLGKLKQVSHLHDLYQKMKQDGPLPDIFTYNILISSLGRAGKVK 724
            +    D   Y++++  L K    +  ++L+  MK+ G + D   YNI+I    + GKV 
Sbjct: 549 ARRFVPDARSYSILIHGLIKAGFANETYELFYSMKEQGCVLDTRAYNIVIDGFCKCGKVN 608

Query: 725 EAVKVFEELESSSCKPDIISYNSLINCLGKNGDVDEAHMIFLEMQEKGLNPDVVTYSTLI 784
           +A ++ EE+++   +P +++Y S+I+ L K   +DEA+M+F E + K +  +VV YS+LI
Sbjct: 609 KAYQLLEEMKTKGFEPTVVTYGSVIDGLAKIDRLDEAYMLFEEAKSKRIELNVVIYSSLI 668

Query: 785 ECFGKTDKVEMARSLFDKMMAQGCCPNIVTYNILLDCLERAGRTGETVDLYAKLKQQGLT 844
           + FGK  +++ A  + +++M +G  PN+ T+N LLD L +A    E +  +  +K+   T
Sbjct: 669 DGFGKVGRIDEAYLILEELMQKGLTPNLYTWNSLLDALVKAEEINEALVCFQSMKELKCT 728

Query: 845 PDSITYAIL 848
           P+ +TY IL
Sbjct: 729 PNQVTYGIL 733

BLAST of Tan0005393 vs. ExPASy Swiss-Prot
Match: Q9SR00 (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 211.5 bits (537), Expect = 3.9e-53
Identity = 134/448 (29.91%), Postives = 228/448 (50.89%), Query Frame = 0

Query: 403 GSKEDLELCTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRGRGYKLDIFAY 462
           G   D+ LCT LIK +      +T R + +A VR           ME+  +  + D+FAY
Sbjct: 119 GYNPDVILCTKLIKGF------FTLRNIPKA-VR----------VMEILEKFGQPDVFAY 178

Query: 463 NMLLDALAKDEKLDRTYKVFKDMKLKHCNPDEYTYSIMIRMTGKGGRTEESLAFFEEMLT 522
           N L++   K  ++D   +V   M+ K  +PD  TY+IMI      G+ + +L    ++L+
Sbjct: 179 NALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLNQLLS 238

Query: 523 KGCTPNLIVYNTMIEALSRSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVLVAEGQLGR 582
             C P +I Y  +IEA      VD+A+ L   M+    +P+ FTY+ I+  +  EG + R
Sbjct: 239 DNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEGMVDR 298

Query: 583 LDE-VLGVSDKFMNKSIYAY--LVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAYISMLE 642
             E V  +  K     + +Y  L+R L   G   E  +L   M+S     +   Y  ++ 
Sbjct: 299 AFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILIT 358

Query: 643 SLCSAGKTVEAIDLLGKVHEKGISCDTMMYNMVLSTLGKLKQVSHLHDLYQKMKQDGPLP 702
           +LC  GK  EA++LL  + EKG++ D   Y+ +++   +  ++    +  + M  DG LP
Sbjct: 359 TLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLP 418

Query: 703 DIFTYNILISSLGRAGKVKEAVKVFEELESSSCKPDIISYNSLINCLGKNGDVDEAHMIF 762
           DI  YN ++++L + GK  +A+++F +L    C P+  SYN++ + L  +GD   A  + 
Sbjct: 419 DIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMI 478

Query: 763 LEMQEKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMMAQGCCPNIVTYNILLDCLERA 822
           LEM   G++PD +TY+++I C  +   V+ A  L   M +    P++VTYNI+L    +A
Sbjct: 479 LEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLGFCKA 538

Query: 823 GRTGETVDLYAKLKQQGLTPDSITYAIL 848
            R  + +++   +   G  P+  TY +L
Sbjct: 539 HRIEDAINVLESMVGNGCRPNETTYTVL 549

BLAST of Tan0005393 vs. NCBI nr
Match: KAG7025459.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1215.7 bits (3144), Expect = 0.0e+00
Identity = 608/675 (90.07%), Postives = 642/675 (95.11%), Query Frame = 0

Query: 201 SPFEPTSATSTAAMKALQLHCYSHFKPTATATAAAHRHFATKYTAKITSSSPTGRSVSVE 260
           S FEPTSAT+TAAMK L+L+ YS  KP  +ATAA++RHFATKYTAKITSSSPTGRSVSVE
Sbjct: 20  SLFEPTSATNTAAMKVLRLYYYSLLKP--SATAASYRHFATKYTAKITSSSPTGRSVSVE 79

Query: 261 VTPPAPLPVDSRGHSLPRRDLICRAIQILLDRKHRSSSSSIDDRFSDLSSYFQSLSVSLT 320
           VTPPAPLP+D RG+SLPRRDLICRAIQILLDRK  SSSS++DDRFSDLSSYFQSLSVSLT
Sbjct: 80  VTPPAPLPIDPRGYSLPRRDLICRAIQILLDRKPHSSSSTVDDRFSDLSSYFQSLSVSLT 139

Query: 321 PAEASEILKSLNPDLALQFFQLCPSLCPKFRHDVFTYSRIILILSHSSSPKRFDHVREIL 380
           PAEASEIL+SLNPDLALQFFQLCPSLCPKFRHDVFTYSRI+LILSHSSSPKRFD VREIL
Sbjct: 140 PAEASEILRSLNPDLALQFFQLCPSLCPKFRHDVFTYSRILLILSHSSSPKRFDQVREIL 199

Query: 381 SQMDRDQIRGTISTVNILIRIFGSKEDLELCTGLIKKWDLRLNAYTYRCLLQAHVRSHDS 440
           SQM+RDQIRGTISTVNILI IFG KEDLELC GLIKKWDLRLNAYTYRCLLQAHVRSHDS
Sbjct: 200 SQMERDQIRGTISTVNILIGIFGRKEDLELCLGLIKKWDLRLNAYTYRCLLQAHVRSHDS 259

Query: 441 DRAFNVYMEMRGRGYKLDIFAYNMLLDALAKDEKLDRTYKVFKDMKLKHCNPDEYTYSIM 500
           D AFNVYMEMR RG+KLDIFAYNMLLDALAKDE+LDR YKVFKDMKLKHCNPD YTY+IM
Sbjct: 260 DGAFNVYMEMRNRGFKLDIFAYNMLLDALAKDEQLDRAYKVFKDMKLKHCNPDVYTYTIM 319

Query: 501 IRMTGKGGRTEESLAFFEEMLTKGCTPNLIVYNTMIEALSRSRMVDKAILLFSNMVKNNC 560
           IRMTGK GRTEESLAFFEEML  G TPNLIVYNTMIEALS+SRMVDKAILLFSNM+KNNC
Sbjct: 320 IRMTGKRGRTEESLAFFEEMLKNGFTPNLIVYNTMIEALSKSRMVDKAILLFSNMIKNNC 379

Query: 561 RPNEFTYSVILNVLVAEGQLGRLDEVLGVSDKFMNKSIYAYLVRTLSKLGHASEAHRLFC 620
           RPNEFTYSVILNVLVAEGQ GRLDEVL +S+KF+NKSIYAYLVRTLSKLGHA+EAHRLFC
Sbjct: 380 RPNEFTYSVILNVLVAEGQCGRLDEVLEMSNKFLNKSIYAYLVRTLSKLGHANEAHRLFC 439

Query: 621 NMWSFHDRGDRDAYISMLESLCSAGKTVEAIDLLGKVHEKGISCDTMMYNMVLSTLGKLK 680
           NMWSFHDRGDRDAYISMLESLCSAGKTVEAIDLLGKVHEKGIS DTMMYNMVLSTLGKLK
Sbjct: 440 NMWSFHDRGDRDAYISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNMVLSTLGKLK 499

Query: 681 QVSHLHDLYQKMKQDGPLPDIFTYNILISSLGRAGKVKEAVKVFEELESSSCKPDIISYN 740
           QVSHLHDLY+KMKQDGPLPD+FTYNILISS GR GKV+EAV+VFEELE+SSCKPDIISYN
Sbjct: 500 QVSHLHDLYEKMKQDGPLPDVFTYNILISSFGRVGKVEEAVQVFEELENSSCKPDIISYN 559

Query: 741 SLINCLGKNGDVDEAHMIFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMMAQ 800
           SLINCLGKNGDVDEAHM FLEM+EKGL PDVVTYSTLIECFGKTDKVEMARSLFDKM+AQ
Sbjct: 560 SLINCLGKNGDVDEAHMRFLEMREKGLTPDVVTYSTLIECFGKTDKVEMARSLFDKMIAQ 619

Query: 801 GCCPNIVTYNILLDCLERAGRTGETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRV 860
           GCCPNIVTYNILLDCLE+ GRT E VDLYA+LKQ+GLTPDSITYA+LDRLQSGS +KFRV
Sbjct: 620 GCCPNIVTYNILLDCLEKTGRTAEAVDLYAELKQRGLTPDSITYAVLDRLQSGSTKKFRV 679

Query: 861 RRQNPITGWVVSPLR 876
           RRQNPITGWVVSPLR
Sbjct: 680 RRQNPITGWVVSPLR 692

BLAST of Tan0005393 vs. NCBI nr
Match: XP_038898111.1 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Benincasa hispida])

HSP 1 Score: 1203.0 bits (3111), Expect = 0.0e+00
Identity = 601/665 (90.38%), Postives = 631/665 (94.89%), Query Frame = 0

Query: 212 AAMKALQLHCYSHFKPTATATAAAHRHFATKYTAKITSSSPTGRSVSVEVTPPAPLPVDS 271
           AAMK L+L CY H +P  TATAA +RHFATKYTAKITSSSPTGRSVSVEVTPPA LPVDS
Sbjct: 2   AAMKVLRLPCYYHLQP--TATAATYRHFATKYTAKITSSSPTGRSVSVEVTPPATLPVDS 61

Query: 272 RGHSLPRRDLICRAIQILLDRKHRSSSSSIDDRFSDLSSYFQSLSVSLTPAEASEILKSL 331
           RG+SLPRRDLICRA+ ILL RK  SSS +IDDRFSDL+SYFQSLSVSLTPAEASEILKSL
Sbjct: 62  RGYSLPRRDLICRAVDILLHRKPHSSSITIDDRFSDLASYFQSLSVSLTPAEASEILKSL 121

Query: 332 N-PDLALQFFQLCPSLCPKFRHDVFTYSRIILILSHSSSPKRFDHVREILSQMDRDQIRG 391
           N PDLALQFFQLCPSLCPKFRHD FTYSRI+L+LSHSSS KRFD VREILSQMDRDQIRG
Sbjct: 122 NCPDLALQFFQLCPSLCPKFRHDAFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRG 181

Query: 392 TISTVNILIRIFGSKEDLELCTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEM 451
           TISTVNILI+IFGSKEDLE+CTGLIKKWDLR NAYTYRCLLQAH+RSHDSDRAFNVYMEM
Sbjct: 182 TISTVNILIKIFGSKEDLEVCTGLIKKWDLRFNAYTYRCLLQAHLRSHDSDRAFNVYMEM 241

Query: 452 RGRGYKLDIFAYNMLLDALAKDEKLDRTYKVFKDMKLKHCNPDEYTYSIMIRMTGKGGRT 511
           RG+GY+LDIFAYNMLLDALAK+E+LDR+Y+VFKDMKLKHCNPDEYTY+IMIRMTGK GRT
Sbjct: 242 RGKGYQLDIFAYNMLLDALAKNEQLDRSYRVFKDMKLKHCNPDEYTYTIMIRMTGKMGRT 301

Query: 512 EESLAFFEEMLTKGCTPNLIVYNTMIEALSRSRMVDKAILLFSNMVKNNCRPNEFTYSVI 571
           EESL  FEEMLTKGCTPNLI YNTMI+AL +SRMVDKAILLFSNM+KNNCRPNEFTYSVI
Sbjct: 302 EESLVLFEEMLTKGCTPNLIAYNTMIQALCKSRMVDKAILLFSNMIKNNCRPNEFTYSVI 361

Query: 572 LNVLVAEGQLGRLDEVLGVSDKFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGD 631
           LNVLVAEGQLGRLDEVLGVS+KFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGD
Sbjct: 362 LNVLVAEGQLGRLDEVLGVSNKFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGD 421

Query: 632 RDAYISMLESLCSAGKTVEAIDLLGKVHEKGISCDTMMYNMVLSTLGKLKQVSHLHDLYQ 691
           RDAYISMLESLCS GKTVEAIDLL KVHE+GIS DTMMYN VLSTLGKLKQVSHLHDLY+
Sbjct: 422 RDAYISMLESLCSTGKTVEAIDLLSKVHERGISSDTMMYNTVLSTLGKLKQVSHLHDLYE 481

Query: 692 KMKQDGPLPDIFTYNILISSLGRAGKVKEAVKVFEELESSSCKPDIISYNSLINCLGKNG 751
           KMK+DGP PDIFTYNILISSLGR GKVKEAV+VFEELE+S CKPDIISYNSLINCLGKNG
Sbjct: 482 KMKRDGPFPDIFTYNILISSLGRVGKVKEAVEVFEELENSDCKPDIISYNSLINCLGKNG 541

Query: 752 DVDEAHMIFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMMAQGCCPNIVTYN 811
           DVDEAHM FLEMQ+KGLNPDVVTYSTLIECFGKTDKVEMA SLFDKM+ QGCCPNIVTYN
Sbjct: 542 DVDEAHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMAHSLFDKMITQGCCPNIVTYN 601

Query: 812 ILLDCLERAGRTGETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWV 871
           ILLDCLERAGRT ETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWV
Sbjct: 602 ILLDCLERAGRTAETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWV 661

Query: 872 VSPLR 876
           VSPLR
Sbjct: 662 VSPLR 664

BLAST of Tan0005393 vs. NCBI nr
Match: XP_023550137.1 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1200.3 bits (3104), Expect = 0.0e+00
Identity = 598/662 (90.33%), Postives = 629/662 (95.02%), Query Frame = 0

Query: 214 MKALQLHCYSHFKPTATATAAAHRHFATKYTAKITSSSPTGRSVSVEVTPPAPLPVDSRG 273
           MK L+L+ YS  KP  +ATAA+HRHFATKYTAKITSSSPTGRSVSVEVTPPAPLP+D RG
Sbjct: 1   MKVLRLYYYSLLKP--SATAASHRHFATKYTAKITSSSPTGRSVSVEVTPPAPLPIDPRG 60

Query: 274 HSLPRRDLICRAIQILLDRKHRSSSSSIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNP 333
           +SLPRRDLICRAIQILLDRK  SSSS++DDRFSDLSSYFQSLSVSLTPAEASEIL+SLNP
Sbjct: 61  YSLPRRDLICRAIQILLDRKPHSSSSTVDDRFSDLSSYFQSLSVSLTPAEASEILRSLNP 120

Query: 334 DLALQFFQLCPSLCPKFRHDVFTYSRIILILSHSSSPKRFDHVREILSQMDRDQIRGTIS 393
           DLALQFFQLCPSLCPKFRHDVFTYSRI+LILSHSSSPKRFD VREILSQM+RDQIRGTIS
Sbjct: 121 DLALQFFQLCPSLCPKFRHDVFTYSRILLILSHSSSPKRFDQVREILSQMERDQIRGTIS 180

Query: 394 TVNILIRIFGSKEDLELCTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRGR 453
           TVNILI IFG KEDLELC GLIKKWDLRLNAYTYRCLLQAHVRSHDSD AFNVYMEMR R
Sbjct: 181 TVNILIGIFGRKEDLELCLGLIKKWDLRLNAYTYRCLLQAHVRSHDSDGAFNVYMEMRNR 240

Query: 454 GYKLDIFAYNMLLDALAKDEKLDRTYKVFKDMKLKHCNPDEYTYSIMIRMTGKGGRTEES 513
           G+KLDIFAYNMLLDALAKDE+LDR YKVFKDMKLK CNPD YTY+IMIRMTGK GRTEES
Sbjct: 241 GFKLDIFAYNMLLDALAKDEQLDRAYKVFKDMKLKQCNPDVYTYTIMIRMTGKRGRTEES 300

Query: 514 LAFFEEMLTKGCTPNLIVYNTMIEALSRSRMVDKAILLFSNMVKNNCRPNEFTYSVILNV 573
           LAFFEEML  GCTPNLIVYNTMIEALS+SRMVDKAILLFSNM+KNNCRPNEFTYSV+LNV
Sbjct: 301 LAFFEEMLKNGCTPNLIVYNTMIEALSKSRMVDKAILLFSNMIKNNCRPNEFTYSVVLNV 360

Query: 574 LVAEGQLGRLDEVLGVSDKFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDA 633
           LVAEGQ GRLDEVL +S+KFMNKSIYAYLVRTLSKLGH +EAHRLFCNMWSFHDRGDR+A
Sbjct: 361 LVAEGQCGRLDEVLEMSNKFMNKSIYAYLVRTLSKLGHVNEAHRLFCNMWSFHDRGDREA 420

Query: 634 YISMLESLCSAGKTVEAIDLLGKVHEKGISCDTMMYNMVLSTLGKLKQVSHLHDLYQKMK 693
           YISMLESLCSAGKTVEAIDLLGKVHEKGIS DTMMYNMVLSTLGKLKQVSHLHDLY+KMK
Sbjct: 421 YISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNMVLSTLGKLKQVSHLHDLYEKMK 480

Query: 694 QDGPLPDIFTYNILISSLGRAGKVKEAVKVFEELESSSCKPDIISYNSLINCLGKNGDVD 753
           QDGPLPD+FTYNILISS GR GKV+EAV+VFEELE+SSCKPDIISYNSLINCLGKNGDVD
Sbjct: 481 QDGPLPDVFTYNILISSFGRVGKVEEAVQVFEELENSSCKPDIISYNSLINCLGKNGDVD 540

Query: 754 EAHMIFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMMAQGCCPNIVTYNILL 813
           EAHM FLEMQEKGL PDVVTYSTLIECFGKTDKVEMARSLFDKM+AQGCCPNIVTYNILL
Sbjct: 541 EAHMRFLEMQEKGLTPDVVTYSTLIECFGKTDKVEMARSLFDKMIAQGCCPNIVTYNILL 600

Query: 814 DCLERAGRTGETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSP 873
           DCLER GRT E VDLYA+LKQ+GLTPDSITYA+LDRLQSGS +KFRVRRQNPITGWVVSP
Sbjct: 601 DCLERTGRTAEAVDLYAELKQRGLTPDSITYAVLDRLQSGSTKKFRVRRQNPITGWVVSP 660

Query: 874 LR 876
           LR
Sbjct: 661 LR 660

BLAST of Tan0005393 vs. NCBI nr
Match: XP_023004732.1 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Cucurbita maxima])

HSP 1 Score: 1198.3 bits (3099), Expect = 0.0e+00
Identity = 596/662 (90.03%), Postives = 629/662 (95.02%), Query Frame = 0

Query: 214 MKALQLHCYSHFKPTATATAAAHRHFATKYTAKITSSSPTGRSVSVEVTPPAPLPVDSRG 273
           MK L+L+ YS  KP  +ATAA+HRHFATKYTAKITSSSPTGRSVSVEVT PAPLP+D RG
Sbjct: 1   MKVLRLYYYSLLKP--SATAASHRHFATKYTAKITSSSPTGRSVSVEVTSPAPLPIDPRG 60

Query: 274 HSLPRRDLICRAIQILLDRKHRSSSSSIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNP 333
           +SLPRRDLICRAIQILLDRK  SSSS++DDRF+DLSSYFQSLS+SLTPAEASEIL+SLNP
Sbjct: 61  YSLPRRDLICRAIQILLDRKRHSSSSTVDDRFTDLSSYFQSLSISLTPAEASEILRSLNP 120

Query: 334 DLALQFFQLCPSLCPKFRHDVFTYSRIILILSHSSSPKRFDHVREILSQMDRDQIRGTIS 393
           DLALQFFQLCPSLCPKFRHDVFTYSRI+LILSHSSSPKRFD VREILSQM+RDQIRGTIS
Sbjct: 121 DLALQFFQLCPSLCPKFRHDVFTYSRILLILSHSSSPKRFDQVREILSQMERDQIRGTIS 180

Query: 394 TVNILIRIFGSKEDLELCTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRGR 453
           TVNILI IFG KEDLELC GLIKKWDLRLNAYTYRCLLQAHVRSH SD AFNVYMEMR R
Sbjct: 181 TVNILIGIFGRKEDLELCLGLIKKWDLRLNAYTYRCLLQAHVRSHHSDGAFNVYMEMRNR 240

Query: 454 GYKLDIFAYNMLLDALAKDEKLDRTYKVFKDMKLKHCNPDEYTYSIMIRMTGKGGRTEES 513
           G+KLDIFAYNMLLDALAKDE+LDR YK+FKDMKLKHCNPD YTY++MIRMTGK GRTEES
Sbjct: 241 GFKLDIFAYNMLLDALAKDEQLDRAYKIFKDMKLKHCNPDVYTYTVMIRMTGKRGRTEES 300

Query: 514 LAFFEEMLTKGCTPNLIVYNTMIEALSRSRMVDKAILLFSNMVKNNCRPNEFTYSVILNV 573
           LAFFEEML  GCTPNLIVYNTMIEALS+SRMVDKAILLFSNM+KNNCRPNEFTYSVILNV
Sbjct: 301 LAFFEEMLKNGCTPNLIVYNTMIEALSKSRMVDKAILLFSNMIKNNCRPNEFTYSVILNV 360

Query: 574 LVAEGQLGRLDEVLGVSDKFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDA 633
           LVAEGQ GRLDEVL +S+KFMNKSIYAYLVRTLSKLGHA+EAHRLFCNMWSFHDRGDRDA
Sbjct: 361 LVAEGQCGRLDEVLEMSNKFMNKSIYAYLVRTLSKLGHANEAHRLFCNMWSFHDRGDRDA 420

Query: 634 YISMLESLCSAGKTVEAIDLLGKVHEKGISCDTMMYNMVLSTLGKLKQVSHLHDLYQKMK 693
           YISMLESLCSAGKTVEAIDLLGKVHEKGIS DTMMYNMVLSTLGKLKQVSHLHDLY+KMK
Sbjct: 421 YISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNMVLSTLGKLKQVSHLHDLYEKMK 480

Query: 694 QDGPLPDIFTYNILISSLGRAGKVKEAVKVFEELESSSCKPDIISYNSLINCLGKNGDVD 753
           QDGPLPD+FTYNILISSLGR GKV+EAV+VFEELE+SSCKPDIISYNSLINC GKNGDVD
Sbjct: 481 QDGPLPDVFTYNILISSLGRVGKVEEAVQVFEELENSSCKPDIISYNSLINCHGKNGDVD 540

Query: 754 EAHMIFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMMAQGCCPNIVTYNILL 813
           EAHM FLEM+EKGL PDVVTYSTLIECFGKTDKVEMARSLFDKM+AQGCCPNIVTYNILL
Sbjct: 541 EAHMRFLEMREKGLTPDVVTYSTLIECFGKTDKVEMARSLFDKMIAQGCCPNIVTYNILL 600

Query: 814 DCLERAGRTGETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSP 873
           DCLER GRT E VDLYA+LKQQGLTPDSITYA+LDRLQSGS +KFRVRRQNPITGWVVSP
Sbjct: 601 DCLERTGRTAEAVDLYAELKQQGLTPDSITYAVLDRLQSGSTKKFRVRRQNPITGWVVSP 660

Query: 874 LR 876
           LR
Sbjct: 661 LR 660

BLAST of Tan0005393 vs. NCBI nr
Match: XP_022960041.1 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Cucurbita moschata])

HSP 1 Score: 1197.6 bits (3097), Expect = 0.0e+00
Identity = 598/662 (90.33%), Postives = 629/662 (95.02%), Query Frame = 0

Query: 214 MKALQLHCYSHFKPTATATAAAHRHFATKYTAKITSSSPTGRSVSVEVTPPAPLPVDSRG 273
           MK L+L+ YS  KP  +ATAA+HRHFATKYTAKITSSSPTGRSV VEVTPPAPLP+D RG
Sbjct: 1   MKVLRLYYYSLLKP--SATAASHRHFATKYTAKITSSSPTGRSVYVEVTPPAPLPIDPRG 60

Query: 274 HSLPRRDLICRAIQILLDRKHRSSSSSIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNP 333
           +SLPRRDLICRAIQILLDRK  SSSS++DDRFSDLSSYFQSLSVSLTPAEASEIL++LNP
Sbjct: 61  YSLPRRDLICRAIQILLDRKPHSSSSTVDDRFSDLSSYFQSLSVSLTPAEASEILRALNP 120

Query: 334 DLALQFFQLCPSLCPKFRHDVFTYSRIILILSHSSSPKRFDHVREILSQMDRDQIRGTIS 393
           DLALQFFQLCPSLCPKFRHDVFTYSRI+LILSHSSSPKRFD VREILSQM+RDQIRGTIS
Sbjct: 121 DLALQFFQLCPSLCPKFRHDVFTYSRILLILSHSSSPKRFDQVREILSQMERDQIRGTIS 180

Query: 394 TVNILIRIFGSKEDLELCTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRGR 453
           TVNILI IFG KEDLELC GLIKKWDLRLNAYTYRCLLQAHVRSHDSD AFNVYMEMR R
Sbjct: 181 TVNILIGIFGRKEDLELCLGLIKKWDLRLNAYTYRCLLQAHVRSHDSDGAFNVYMEMRNR 240

Query: 454 GYKLDIFAYNMLLDALAKDEKLDRTYKVFKDMKLKHCNPDEYTYSIMIRMTGKGGRTEES 513
           G+KLDIFAYNMLLDALAKDE+LDR YKVFKDMKLKHCNPD YTY+IMIRMTGK GRTEES
Sbjct: 241 GFKLDIFAYNMLLDALAKDEQLDRAYKVFKDMKLKHCNPDVYTYTIMIRMTGKRGRTEES 300

Query: 514 LAFFEEMLTKGCTPNLIVYNTMIEALSRSRMVDKAILLFSNMVKNNCRPNEFTYSVILNV 573
           LAFFEEML  G TPNLIVYNTMIEALS+SRMVDKAILLFSNM+KNNCRPNEFTYSVILNV
Sbjct: 301 LAFFEEMLKNGFTPNLIVYNTMIEALSKSRMVDKAILLFSNMIKNNCRPNEFTYSVILNV 360

Query: 574 LVAEGQLGRLDEVLGVSDKFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDA 633
           LVAEGQ GRLDEVL +S+KFMNKSIYAYLVRTLSKLGHA+EAHRLFCNMWSFHDRGDRDA
Sbjct: 361 LVAEGQCGRLDEVLEMSNKFMNKSIYAYLVRTLSKLGHANEAHRLFCNMWSFHDRGDRDA 420

Query: 634 YISMLESLCSAGKTVEAIDLLGKVHEKGISCDTMMYNMVLSTLGKLKQVSHLHDLYQKMK 693
           YISMLESLCSAGKTVEAIDLLGKVHEKGIS DTMMYNMVLSTLGKLKQVSHLHDLY+KMK
Sbjct: 421 YISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNMVLSTLGKLKQVSHLHDLYEKMK 480

Query: 694 QDGPLPDIFTYNILISSLGRAGKVKEAVKVFEELESSSCKPDIISYNSLINCLGKNGDVD 753
           QDGPLPD+FTYNILISS GR GKV+EAV+VFEELE+SSCKPDIISYNSLINCLGKNGDVD
Sbjct: 481 QDGPLPDVFTYNILISSFGRVGKVEEAVQVFEELENSSCKPDIISYNSLINCLGKNGDVD 540

Query: 754 EAHMIFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMMAQGCCPNIVTYNILL 813
           EAHM FLEM+EKGL PDVVTYSTLIECFGKTDKVEMARSLFDKM+AQGCCPNIVTYNILL
Sbjct: 541 EAHMRFLEMREKGLTPDVVTYSTLIECFGKTDKVEMARSLFDKMIAQGCCPNIVTYNILL 600

Query: 814 DCLERAGRTGETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSP 873
           DCLER GRT E VDLYA+LKQ+GLTPDSITYA+LDRLQSGS +KFRVRRQNPITGWVVSP
Sbjct: 601 DCLERTGRTAEAVDLYAELKQRGLTPDSITYAVLDRLQSGSTKKFRVRRQNPITGWVVSP 660

Query: 874 LR 876
           LR
Sbjct: 661 LR 660

BLAST of Tan0005393 vs. ExPASy TrEMBL
Match: A0A6J1KR93 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111497945 PE=3 SV=1)

HSP 1 Score: 1198.3 bits (3099), Expect = 0.0e+00
Identity = 596/662 (90.03%), Postives = 629/662 (95.02%), Query Frame = 0

Query: 214 MKALQLHCYSHFKPTATATAAAHRHFATKYTAKITSSSPTGRSVSVEVTPPAPLPVDSRG 273
           MK L+L+ YS  KP  +ATAA+HRHFATKYTAKITSSSPTGRSVSVEVT PAPLP+D RG
Sbjct: 1   MKVLRLYYYSLLKP--SATAASHRHFATKYTAKITSSSPTGRSVSVEVTSPAPLPIDPRG 60

Query: 274 HSLPRRDLICRAIQILLDRKHRSSSSSIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNP 333
           +SLPRRDLICRAIQILLDRK  SSSS++DDRF+DLSSYFQSLS+SLTPAEASEIL+SLNP
Sbjct: 61  YSLPRRDLICRAIQILLDRKRHSSSSTVDDRFTDLSSYFQSLSISLTPAEASEILRSLNP 120

Query: 334 DLALQFFQLCPSLCPKFRHDVFTYSRIILILSHSSSPKRFDHVREILSQMDRDQIRGTIS 393
           DLALQFFQLCPSLCPKFRHDVFTYSRI+LILSHSSSPKRFD VREILSQM+RDQIRGTIS
Sbjct: 121 DLALQFFQLCPSLCPKFRHDVFTYSRILLILSHSSSPKRFDQVREILSQMERDQIRGTIS 180

Query: 394 TVNILIRIFGSKEDLELCTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRGR 453
           TVNILI IFG KEDLELC GLIKKWDLRLNAYTYRCLLQAHVRSH SD AFNVYMEMR R
Sbjct: 181 TVNILIGIFGRKEDLELCLGLIKKWDLRLNAYTYRCLLQAHVRSHHSDGAFNVYMEMRNR 240

Query: 454 GYKLDIFAYNMLLDALAKDEKLDRTYKVFKDMKLKHCNPDEYTYSIMIRMTGKGGRTEES 513
           G+KLDIFAYNMLLDALAKDE+LDR YK+FKDMKLKHCNPD YTY++MIRMTGK GRTEES
Sbjct: 241 GFKLDIFAYNMLLDALAKDEQLDRAYKIFKDMKLKHCNPDVYTYTVMIRMTGKRGRTEES 300

Query: 514 LAFFEEMLTKGCTPNLIVYNTMIEALSRSRMVDKAILLFSNMVKNNCRPNEFTYSVILNV 573
           LAFFEEML  GCTPNLIVYNTMIEALS+SRMVDKAILLFSNM+KNNCRPNEFTYSVILNV
Sbjct: 301 LAFFEEMLKNGCTPNLIVYNTMIEALSKSRMVDKAILLFSNMIKNNCRPNEFTYSVILNV 360

Query: 574 LVAEGQLGRLDEVLGVSDKFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDA 633
           LVAEGQ GRLDEVL +S+KFMNKSIYAYLVRTLSKLGHA+EAHRLFCNMWSFHDRGDRDA
Sbjct: 361 LVAEGQCGRLDEVLEMSNKFMNKSIYAYLVRTLSKLGHANEAHRLFCNMWSFHDRGDRDA 420

Query: 634 YISMLESLCSAGKTVEAIDLLGKVHEKGISCDTMMYNMVLSTLGKLKQVSHLHDLYQKMK 693
           YISMLESLCSAGKTVEAIDLLGKVHEKGIS DTMMYNMVLSTLGKLKQVSHLHDLY+KMK
Sbjct: 421 YISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNMVLSTLGKLKQVSHLHDLYEKMK 480

Query: 694 QDGPLPDIFTYNILISSLGRAGKVKEAVKVFEELESSSCKPDIISYNSLINCLGKNGDVD 753
           QDGPLPD+FTYNILISSLGR GKV+EAV+VFEELE+SSCKPDIISYNSLINC GKNGDVD
Sbjct: 481 QDGPLPDVFTYNILISSLGRVGKVEEAVQVFEELENSSCKPDIISYNSLINCHGKNGDVD 540

Query: 754 EAHMIFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMMAQGCCPNIVTYNILL 813
           EAHM FLEM+EKGL PDVVTYSTLIECFGKTDKVEMARSLFDKM+AQGCCPNIVTYNILL
Sbjct: 541 EAHMRFLEMREKGLTPDVVTYSTLIECFGKTDKVEMARSLFDKMIAQGCCPNIVTYNILL 600

Query: 814 DCLERAGRTGETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSP 873
           DCLER GRT E VDLYA+LKQQGLTPDSITYA+LDRLQSGS +KFRVRRQNPITGWVVSP
Sbjct: 601 DCLERTGRTAEAVDLYAELKQQGLTPDSITYAVLDRLQSGSTKKFRVRRQNPITGWVVSP 660

Query: 874 LR 876
           LR
Sbjct: 661 LR 660

BLAST of Tan0005393 vs. ExPASy TrEMBL
Match: A0A6J1H6J4 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111460908 PE=3 SV=1)

HSP 1 Score: 1197.6 bits (3097), Expect = 0.0e+00
Identity = 598/662 (90.33%), Postives = 629/662 (95.02%), Query Frame = 0

Query: 214 MKALQLHCYSHFKPTATATAAAHRHFATKYTAKITSSSPTGRSVSVEVTPPAPLPVDSRG 273
           MK L+L+ YS  KP  +ATAA+HRHFATKYTAKITSSSPTGRSV VEVTPPAPLP+D RG
Sbjct: 1   MKVLRLYYYSLLKP--SATAASHRHFATKYTAKITSSSPTGRSVYVEVTPPAPLPIDPRG 60

Query: 274 HSLPRRDLICRAIQILLDRKHRSSSSSIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNP 333
           +SLPRRDLICRAIQILLDRK  SSSS++DDRFSDLSSYFQSLSVSLTPAEASEIL++LNP
Sbjct: 61  YSLPRRDLICRAIQILLDRKPHSSSSTVDDRFSDLSSYFQSLSVSLTPAEASEILRALNP 120

Query: 334 DLALQFFQLCPSLCPKFRHDVFTYSRIILILSHSSSPKRFDHVREILSQMDRDQIRGTIS 393
           DLALQFFQLCPSLCPKFRHDVFTYSRI+LILSHSSSPKRFD VREILSQM+RDQIRGTIS
Sbjct: 121 DLALQFFQLCPSLCPKFRHDVFTYSRILLILSHSSSPKRFDQVREILSQMERDQIRGTIS 180

Query: 394 TVNILIRIFGSKEDLELCTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRGR 453
           TVNILI IFG KEDLELC GLIKKWDLRLNAYTYRCLLQAHVRSHDSD AFNVYMEMR R
Sbjct: 181 TVNILIGIFGRKEDLELCLGLIKKWDLRLNAYTYRCLLQAHVRSHDSDGAFNVYMEMRNR 240

Query: 454 GYKLDIFAYNMLLDALAKDEKLDRTYKVFKDMKLKHCNPDEYTYSIMIRMTGKGGRTEES 513
           G+KLDIFAYNMLLDALAKDE+LDR YKVFKDMKLKHCNPD YTY+IMIRMTGK GRTEES
Sbjct: 241 GFKLDIFAYNMLLDALAKDEQLDRAYKVFKDMKLKHCNPDVYTYTIMIRMTGKRGRTEES 300

Query: 514 LAFFEEMLTKGCTPNLIVYNTMIEALSRSRMVDKAILLFSNMVKNNCRPNEFTYSVILNV 573
           LAFFEEML  G TPNLIVYNTMIEALS+SRMVDKAILLFSNM+KNNCRPNEFTYSVILNV
Sbjct: 301 LAFFEEMLKNGFTPNLIVYNTMIEALSKSRMVDKAILLFSNMIKNNCRPNEFTYSVILNV 360

Query: 574 LVAEGQLGRLDEVLGVSDKFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDA 633
           LVAEGQ GRLDEVL +S+KFMNKSIYAYLVRTLSKLGHA+EAHRLFCNMWSFHDRGDRDA
Sbjct: 361 LVAEGQCGRLDEVLEMSNKFMNKSIYAYLVRTLSKLGHANEAHRLFCNMWSFHDRGDRDA 420

Query: 634 YISMLESLCSAGKTVEAIDLLGKVHEKGISCDTMMYNMVLSTLGKLKQVSHLHDLYQKMK 693
           YISMLESLCSAGKTVEAIDLLGKVHEKGIS DTMMYNMVLSTLGKLKQVSHLHDLY+KMK
Sbjct: 421 YISMLESLCSAGKTVEAIDLLGKVHEKGISSDTMMYNMVLSTLGKLKQVSHLHDLYEKMK 480

Query: 694 QDGPLPDIFTYNILISSLGRAGKVKEAVKVFEELESSSCKPDIISYNSLINCLGKNGDVD 753
           QDGPLPD+FTYNILISS GR GKV+EAV+VFEELE+SSCKPDIISYNSLINCLGKNGDVD
Sbjct: 481 QDGPLPDVFTYNILISSFGRVGKVEEAVQVFEELENSSCKPDIISYNSLINCLGKNGDVD 540

Query: 754 EAHMIFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMMAQGCCPNIVTYNILL 813
           EAHM FLEM+EKGL PDVVTYSTLIECFGKTDKVEMARSLFDKM+AQGCCPNIVTYNILL
Sbjct: 541 EAHMRFLEMREKGLTPDVVTYSTLIECFGKTDKVEMARSLFDKMIAQGCCPNIVTYNILL 600

Query: 814 DCLERAGRTGETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSP 873
           DCLER GRT E VDLYA+LKQ+GLTPDSITYA+LDRLQSGS +KFRVRRQNPITGWVVSP
Sbjct: 601 DCLERTGRTAEAVDLYAELKQRGLTPDSITYAVLDRLQSGSTKKFRVRRQNPITGWVVSP 660

Query: 874 LR 876
           LR
Sbjct: 661 LR 660

BLAST of Tan0005393 vs. ExPASy TrEMBL
Match: A0A0A0K6Z6 (PPR_long domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G372880 PE=3 SV=1)

HSP 1 Score: 1164.4 bits (3011), Expect = 0.0e+00
Identity = 584/669 (87.29%), Postives = 617/669 (92.23%), Query Frame = 0

Query: 208 ATSTAAMKALQLHCYSHFKPTATATAAAHRHFATKYTAKITSSSPTGRSVSVEVTPPAPL 267
           ++ +AAMK L+L CYSH KP      AAHRHFATKYTAKITSSSPTGRSV+V VTPPA L
Sbjct: 27  SSRSAAMKVLRLPCYSHLKP-----PAAHRHFATKYTAKITSSSPTGRSVAVVVTPPATL 86

Query: 268 PVDSRGHSLPRRDLICRAIQILLDRKHRSSSSSIDDRFSDLSSYFQSLSVSLTPAEASEI 327
           PVDSRG++LPRRDLICR I +LL R   SS  +IDDRFSDLSSYFQSLSVSLTPAEASEI
Sbjct: 87  PVDSRGYALPRRDLICRVIDMLLHRNPHSSLITIDDRFSDLSSYFQSLSVSLTPAEASEI 146

Query: 328 LKSLN-PDLALQFFQLCPSLCPKFRHDVFTYSRIILILSHSSSPKRFDHVREILSQMDRD 387
           LKSLN PDLALQFF  C SLCPKFRHD FTYSRI+L+LSHSSS KR D VREILSQMDRD
Sbjct: 147 LKSLNSPDLALQFFHRCSSLCPKFRHDAFTYSRILLMLSHSSSSKRIDQVREILSQMDRD 206

Query: 388 QIRGTISTVNILIRIFGSKEDLELCTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNV 447
           QIRGTISTVNILI+IF S EDLELCTGLIKKWDLRLNAYTYRCLLQAH+RS DSDRAFNV
Sbjct: 207 QIRGTISTVNILIKIFSSNEDLELCTGLIKKWDLRLNAYTYRCLLQAHIRSRDSDRAFNV 266

Query: 448 YMEMRGRGYKLDIFAYNMLLDALAKDEKLDRTYKVFKDMKLKHCNPDEYTYSIMIRMTGK 507
           YMEM  +GY+LDIFAYNMLLDALAKDE+LDR+YKVFKDMKLKHCNPDEYTY+IMIRMTGK
Sbjct: 267 YMEMWSKGYQLDIFAYNMLLDALAKDEQLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGK 326

Query: 508 GGRTEESLAFFEEMLTKGCTPNLIVYNTMIEALSRSRMVDKAILLFSNMVKNNCRPNEFT 567
            GR EESLA FEEMLTKGCTPNLI YNTMI+ALS+S MVDKAILLF NM+KNNCRPNEFT
Sbjct: 327 MGRAEESLALFEEMLTKGCTPNLIAYNTMIQALSKSGMVDKAILLFCNMIKNNCRPNEFT 386

Query: 568 YSVILNVLVAEGQLGRLDEVLGVSDKFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFH 627
           YS+ILNVLVAEGQLGRLDEVL VS+KF+NKSIYAYLVRTLSKLGH+SEAHRLFCNMWSFH
Sbjct: 387 YSIILNVLVAEGQLGRLDEVLEVSNKFINKSIYAYLVRTLSKLGHSSEAHRLFCNMWSFH 446

Query: 628 DRGDRDAYISMLESLCSAGKTVEAIDLLGKVHEKGISCDTMMYNMVLSTLGKLKQVSHLH 687
           D GDRDAYISMLESLC  GKTVEAI+LL KVHEKGIS DTMMYN VLSTLGKLKQVSHLH
Sbjct: 447 DGGDRDAYISMLESLCRGGKTVEAIELLSKVHEKGISTDTMMYNTVLSTLGKLKQVSHLH 506

Query: 688 DLYQKMKQDGPLPDIFTYNILISSLGRAGKVKEAVKVFEELESSSCKPDIISYNSLINCL 747
           DLY+KMKQDGP PDIFTYNILISSLGR GKVKEAV+VFEELESS CKPDIISYNSLINCL
Sbjct: 507 DLYEKMKQDGPFPDIFTYNILISSLGRVGKVKEAVEVFEELESSDCKPDIISYNSLINCL 566

Query: 748 GKNGDVDEAHMIFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMMAQGCCPNI 807
           GKNGDVDEAHM FLEMQ+KGLNPDVVTYSTLIECFGKTDKVEMARSLFD+M+ QGCCPNI
Sbjct: 567 GKNGDVDEAHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDRMITQGCCPNI 626

Query: 808 VTYNILLDCLERAGRTGETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPI 867
           VTYNILLDCLERAGRT ETVDLYAKL++QGLTPDSITYAILDRLQSGSNRKFRVRRQNPI
Sbjct: 627 VTYNILLDCLERAGRTAETVDLYAKLREQGLTPDSITYAILDRLQSGSNRKFRVRRQNPI 686

Query: 868 TGWVVSPLR 876
           TGWVVSPLR
Sbjct: 687 TGWVVSPLR 690

BLAST of Tan0005393 vs. ExPASy TrEMBL
Match: A0A1S3BGX8 (pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103489714 PE=3 SV=1)

HSP 1 Score: 1164.1 bits (3010), Expect = 0.0e+00
Identity = 586/663 (88.39%), Postives = 615/663 (92.76%), Query Frame = 0

Query: 214 MKALQLHCYSHFKPTATATAAAHRHFATKYTAKITSSSPTGRSVSVEVTPPAPLPVDSRG 273
           MK L+  CYSH KP  TATAAAHRHFATKYTAKITSSSPTGRSV+V VTPPA L VDSRG
Sbjct: 1   MKVLRFPCYSHLKP--TATAAAHRHFATKYTAKITSSSPTGRSVAVVVTPPATLSVDSRG 60

Query: 274 HSLPRRDLICRAIQILLDRKHRSSSSSIDDRFSDLSSYFQSLSVSLTPAEASEILKSLN- 333
           +SLPRRDLICR I ILL R   SS  +IDDRFSDLSSYFQSLSVSLTPAEASEILKSLN 
Sbjct: 61  YSLPRRDLICRVIDILLHRNPHSSLITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNC 120

Query: 334 PDLALQFFQLCPSLCPKFRHDVFTYSRIILILSHSSSPKRFDHVREILSQMDRDQIRGTI 393
           PDLALQFF  CPSLC KFRHDVFTYSRI+L+LSHSSS KRFD VREILSQMDRDQIRGTI
Sbjct: 121 PDLALQFFHRCPSLCSKFRHDVFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTI 180

Query: 394 STVNILIRIFGSKEDLELCTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRG 453
           STVNILI+IF S EDLELCTGLIKKWDLR NAYTYRCLLQAHVRS DSDRAF+VYMEM  
Sbjct: 181 STVNILIKIFSSNEDLELCTGLIKKWDLRFNAYTYRCLLQAHVRSRDSDRAFHVYMEMWS 240

Query: 454 RGYKLDIFAYNMLLDALAKDEKLDRTYKVFKDMKLKHCNPDEYTYSIMIRMTGKGGRTEE 513
           +GY+LDIFAYNMLLDALAKDEKLDR+YKVFKDMKLKHCNPDEYTY+IMIRMTGK GRTEE
Sbjct: 241 KGYQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEE 300

Query: 514 SLAFFEEMLTKGCTPNLIVYNTMIEALSRSRMVDKAILLFSNMVKNNCRPNEFTYSVILN 573
           SLA FEEMLTKGCTPN+I YNTMI+AL +SRMVDKAILLFSNM+KNNCRPNEFTYSVILN
Sbjct: 301 SLALFEEMLTKGCTPNVIAYNTMIQALCKSRMVDKAILLFSNMIKNNCRPNEFTYSVILN 360

Query: 574 VLVAEGQLGRLDEVLGVSDKFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRD 633
           VLVAEGQLGRLDEVL VS+KF+NKSIYAYLVRTLSKLGH+SEAHRLFCNMWSFHD GDRD
Sbjct: 361 VLVAEGQLGRLDEVLEVSNKFINKSIYAYLVRTLSKLGHSSEAHRLFCNMWSFHDGGDRD 420

Query: 634 AYISMLESLCSAGKTVEAIDLLGKVHEKGISCDTMMYNMVLSTLGKLKQVSHLHDLYQKM 693
           AYISMLESLC  GKTVEAI+LL KVHEKGIS +TMMYN VLSTLGKLKQVSHLHDLY+KM
Sbjct: 421 AYISMLESLCRGGKTVEAIELLSKVHEKGISTNTMMYNTVLSTLGKLKQVSHLHDLYEKM 480

Query: 694 KQDGPLPDIFTYNILISSLGRAGKVKEAVKVFEELESSSCKPDIISYNSLINCLGKNGDV 753
           K+DGP PDIFTYNILISSLGR GKVKEAV+VFEELESS CKPDIISYNSLINCLGKNGDV
Sbjct: 481 KRDGPFPDIFTYNILISSLGRVGKVKEAVEVFEELESSDCKPDIISYNSLINCLGKNGDV 540

Query: 754 DEAHMIFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMMAQGCCPNIVTYNIL 813
           DEAHM FLEMQ+KGLNPDVVTYSTLIECFGKTDKVEMARSLFD+M+ Q CCPNIVTYNIL
Sbjct: 541 DEAHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDRMITQRCCPNIVTYNIL 600

Query: 814 LDCLERAGRTGETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVS 873
           LDCLERAGRT E VDLYAKL++QGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVS
Sbjct: 601 LDCLERAGRTAEAVDLYAKLREQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVS 660

Query: 874 PLR 876
           PLR
Sbjct: 661 PLR 661

BLAST of Tan0005393 vs. ExPASy TrEMBL
Match: A0A5A7TJ34 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold306G002280 PE=3 SV=1)

HSP 1 Score: 1161.4 bits (3003), Expect = 0.0e+00
Identity = 585/663 (88.24%), Postives = 614/663 (92.61%), Query Frame = 0

Query: 214 MKALQLHCYSHFKPTATATAAAHRHFATKYTAKITSSSPTGRSVSVEVTPPAPLPVDSRG 273
           MK L+  CYSH KP  TATAAAHRHFATKYTAKITSSSPTGRSV+V VTPPA L VDSRG
Sbjct: 1   MKVLRFACYSHLKP--TATAAAHRHFATKYTAKITSSSPTGRSVAVVVTPPATLSVDSRG 60

Query: 274 HSLPRRDLICRAIQILLDRKHRSSSSSIDDRFSDLSSYFQSLSVSLTPAEASEILKSLN- 333
           +SLPRRDLICR I ILL R   SS  +IDDRFSDLSSYFQSLSVSLTPAEASEILKSLN 
Sbjct: 61  YSLPRRDLICRVIDILLHRNPHSSLITIDDRFSDLSSYFQSLSVSLTPAEASEILKSLNC 120

Query: 334 PDLALQFFQLCPSLCPKFRHDVFTYSRIILILSHSSSPKRFDHVREILSQMDRDQIRGTI 393
           PDLALQFF  CPSLC KFRHDVFTYSRI+L+LSHSSS KRFD VREILSQMDRDQIRGTI
Sbjct: 121 PDLALQFFHRCPSLCSKFRHDVFTYSRILLMLSHSSSSKRFDQVREILSQMDRDQIRGTI 180

Query: 394 STVNILIRIFGSKEDLELCTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRG 453
           STVNILI+IF S EDLELCTGLIKKWDLR NAYTYRCLLQAHVRS DSDRAF+VYMEM  
Sbjct: 181 STVNILIKIFSSNEDLELCTGLIKKWDLRFNAYTYRCLLQAHVRSRDSDRAFHVYMEMWS 240

Query: 454 RGYKLDIFAYNMLLDALAKDEKLDRTYKVFKDMKLKHCNPDEYTYSIMIRMTGKGGRTEE 513
           +GY+LDIFAYNMLLDALAKDEKLDR+YKVFKDMKLKHCNPDEYTY+IMIRMTGK GRTEE
Sbjct: 241 KGYQLDIFAYNMLLDALAKDEKLDRSYKVFKDMKLKHCNPDEYTYTIMIRMTGKMGRTEE 300

Query: 514 SLAFFEEMLTKGCTPNLIVYNTMIEALSRSRMVDKAILLFSNMVKNNCRPNEFTYSVILN 573
           SLA FEEMLTKGCTPN+I YNTMI+AL +SRMVDKAILLFSNM+KNNCRPNEFTYSVILN
Sbjct: 301 SLALFEEMLTKGCTPNVIAYNTMIQALCKSRMVDKAILLFSNMIKNNCRPNEFTYSVILN 360

Query: 574 VLVAEGQLGRLDEVLGVSDKFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRD 633
           VLVAEG LGRLDEVL VS+KF+NKSIYAYLVRTLSKLGH+SEAHRLFCNMWSFHD GDRD
Sbjct: 361 VLVAEGLLGRLDEVLEVSNKFINKSIYAYLVRTLSKLGHSSEAHRLFCNMWSFHDGGDRD 420

Query: 634 AYISMLESLCSAGKTVEAIDLLGKVHEKGISCDTMMYNMVLSTLGKLKQVSHLHDLYQKM 693
           AYISMLESLC  GKTVEAI+LL KVHEKGIS +TMMYN VLSTLGKLKQVSHLHDLY+KM
Sbjct: 421 AYISMLESLCRGGKTVEAIELLSKVHEKGISTNTMMYNTVLSTLGKLKQVSHLHDLYEKM 480

Query: 694 KQDGPLPDIFTYNILISSLGRAGKVKEAVKVFEELESSSCKPDIISYNSLINCLGKNGDV 753
           K+DGP PDIFTYNILISSLGR GKVKEAV+VFEELESS CKPDIISYNSLINCLGKNGDV
Sbjct: 481 KRDGPFPDIFTYNILISSLGRVGKVKEAVEVFEELESSDCKPDIISYNSLINCLGKNGDV 540

Query: 754 DEAHMIFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMMAQGCCPNIVTYNIL 813
           DEAHM FLEMQ+KGLNPDVVTYSTLIECFGKTDKVEMARSLFD+M+ Q CCPNIVTYNIL
Sbjct: 541 DEAHMRFLEMQDKGLNPDVVTYSTLIECFGKTDKVEMARSLFDRMITQRCCPNIVTYNIL 600

Query: 814 LDCLERAGRTGETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVS 873
           LDCLERAGRT E VDLYAKL++QGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVS
Sbjct: 601 LDCLERAGRTAEAVDLYAKLREQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVS 660

Query: 874 PLR 876
           PLR
Sbjct: 661 PLR 661

BLAST of Tan0005393 vs. TAIR 10
Match: AT1G51965.1 (ABA Overly-Sensitive 5 )

HSP 1 Score: 874.0 bits (2257), Expect = 1.0e-253
Identity = 427/651 (65.59%), Postives = 530/651 (81.41%), Query Frame = 0

Query: 225 FKPTATATAAAHRHFATKYTAKITSSSPTGRSVSVEVTPPAPLPVDSRGHSLPRRDLICR 284
           F    T T    RH+ATKY AK+TSSSP+GRS+S EV+ P PLP D RG+ LPRR LICR
Sbjct: 9   FNSVNTITRPNRRHYATKYVAKVTSSSPSGRSLSAEVSLPNPLPADVRGYPLPRRHLICR 68

Query: 285 AIQILLDRKHRSSSSSIDDRFSDLSSYFQSLSVSLTPAEASEILKSLN-PDLALQFFQLC 344
           A  ++      + +S++ D FSDLS Y  SLS+SLTP EASEILKSLN P LA++FF+L 
Sbjct: 69  ATNLI------TGASNLSDAFSDLSDYLSSLSLSLTPDEASEILKSLNSPLLAVEFFKLV 128

Query: 345 PSLCPKFRHDVFTYSRIILILSHSSSPKRFDHVREILSQMDRDQIRGTISTVNILIRIFG 404
           PSLCP  ++D F Y+RIILILS S+ P RFD VR IL  M +  + G ISTVNILI  FG
Sbjct: 129 PSLCPYSQNDPFLYNRIILILSRSNLPDRFDRVRSILDSMVKSNVHGNISTVNILIGFFG 188

Query: 405 SKEDLELCTGLIKKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRGRGYKLDIFAYN 464
           + EDL++C  L+KKWDL++N++TY+CLLQA++RS D  +AF+VY E+R  G+KLDIFAYN
Sbjct: 189 NTEDLQMCLRLVKKWDLKMNSFTYKCLLQAYLRSRDYSKAFDVYCEIRRGGHKLDIFAYN 248

Query: 465 MLLDALAKDEKLDRTYKVFKDMKLKHCNPDEYTYSIMIRMTGKGGRTEESLAFFEEMLTK 524
           MLLDALAKDEK     +VF+DMK +HC  DEYTY+IMIR  G+ G+ +E++  F EM+T+
Sbjct: 249 MLLDALAKDEK---ACQVFEDMKKRHCRRDEYTYTIMIRTMGRIGKCDEAVGLFNEMITE 308

Query: 525 GCTPNLIVYNTMIEALSRSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVLVAEGQLGRL 584
           G T N++ YNT+++ L++ +MVDKAI +FS MV+  CRPNE+TYS++LN+LVAEGQL RL
Sbjct: 309 GLTLNVVGYNTLMQVLAKGKMVDKAIQVFSRMVETGCRPNEYTYSLLLNLLVAEGQLVRL 368

Query: 585 DEVLGVSDKFMNKSIYAYLVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAYISMLESLCS 644
           D V+ +S ++M + IY+YLVRTLSKLGH SEAHRLFC+MWSF  +G+RD+Y+SMLESLC 
Sbjct: 369 DGVVEISKRYMTQGIYSYLVRTLSKLGHVSEAHRLFCDMWSFPVKGERDSYMSMLESLCG 428

Query: 645 AGKTVEAIDLLGKVHEKGISCDTMMYNMVLSTLGKLKQVSHLHDLYQKMKQDGPLPDIFT 704
           AGKT+EAI++L K+HEKG+  DTMMYN V S LGKLKQ+SH+HDL++KMK+DGP PDIFT
Sbjct: 429 AGKTIEAIEMLSKIHEKGVVTDTMMYNTVFSALGKLKQISHIHDLFEKMKKDGPSPDIFT 488

Query: 705 YNILISSLGRAGKVKEAVKVFEELESSSCKPDIISYNSLINCLGKNGDVDEAHMIFLEMQ 764
           YNILI+S GR G+V EA+ +FEELE S CKPDIISYNSLINCLGKNGDVDEAH+ F EMQ
Sbjct: 489 YNILIASFGRVGEVDEAINIFEELERSDCKPDIISYNSLINCLGKNGDVDEAHVRFKEMQ 548

Query: 765 EKGLNPDVVTYSTLIECFGKTDKVEMARSLFDKMMAQGCCPNIVTYNILLDCLERAGRTG 824
           EKGLNPDVVTYSTL+ECFGKT++VEMA SLF++M+ +GC PNIVTYNILLDCLE+ GRT 
Sbjct: 549 EKGLNPDVVTYSTLMECFGKTERVEMAYSLFEEMLVKGCQPNIVTYNILLDCLEKNGRTA 608

Query: 825 ETVDLYAKLKQQGLTPDSITYAILDRLQSGSNRKFRVRRQNPITGWVVSPL 875
           E VDLY+K+KQQGLTPDSITY +L+RLQS S+ K R+RR+NPITGWVVSPL
Sbjct: 609 EAVDLYSKMKQQGLTPDSITYTVLERLQSVSHGKSRIRRKNPITGWVVSPL 650

BLAST of Tan0005393 vs. TAIR 10
Match: AT3G07440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G48530.1); Has 37 Blast hits to 37 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 35; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 277.3 bits (708), Expect = 4.2e-74
Identity = 139/201 (69.15%), Postives = 166/201 (82.59%), Query Frame = 0

Query: 8   LRTLSSSLCSKSH----QFRSVRTATGPSKRRSKAAAFTVKRPDEKSEWWVVDGEMHEIG 67
           +R + S L S  H    Q R  RT  G  +RR+K  +  +K+ +EKSEWW+VDGEMHEIG
Sbjct: 5   VRRVGSILPSIRHGGVSQIRLARTEAGQPRRRNKLPSLPLKKKEEKSEWWIVDGEMHEIG 64

Query: 68  DNVPPRERFVIPRENLPNRRRKQLREQFMRRTRLVLKESEHEPWCKRYMELYQELRENWE 127
           D+VPPRERF IPR+N+PN+RRKQLR+QFMRRTRLVLKESEHEPWCK+YMELY ELRENWE
Sbjct: 65  DHVPPRERFTIPRDNIPNKRRKQLRDQFMRRTRLVLKESEHEPWCKKYMELYNELRENWE 124

Query: 128 RLYWDEGYSKKLAQEHANYESSE--DEDFSPYRNRQFTADRSKEQDFRRNMQGGSWEKVS 187
           RLYWDEGYSKKLA +HANYES+E  DEDF+PYRNR+  +D++KEQ F R  QG +WEKVS
Sbjct: 125 RLYWDEGYSKKLASDHANYESAEEDDEDFNPYRNRRSFSDQTKEQGFNRTTQGDNWEKVS 184

Query: 188 QIRDKFEYDRERRMRERGYSP 203
           QIRDKFEYDRERRMR++ ++P
Sbjct: 185 QIRDKFEYDRERRMRDKAFAP 205

BLAST of Tan0005393 vs. TAIR 10
Match: AT5G48530.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G07440.1); Has 32 Blast hits to 32 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 32; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 243.0 bits (619), Expect = 8.7e-64
Identity = 130/195 (66.67%), Postives = 158/195 (81.03%), Query Frame = 0

Query: 1   MLSLSHRLRTLSSSLCSKSHQFRSVRTATGPSKRRSKAAAFT-VKRPDEKSEWWVVDGEM 60
           MLS++ RL + + SL + +   R +RT     +RR+K  + + +K+ +EKSEWW+VDGEM
Sbjct: 1   MLSVARRLGSATPSLQNGASLLRFMRTEASQPRRRNKFPSLSPLKKKEEKSEWWIVDGEM 60

Query: 61  HEIGDNVPPRERFVIPRENLPNRRRKQLREQFMRRTRLVLKESEHEPWCKRYMELYQELR 120
           HEIGD+VP RERF IPR+N+PN+RRKQLREQFMRRTRLVLKESEHEPWCK+YMELY E+R
Sbjct: 61  HEIGDHVPLRERFTIPRDNIPNKRRKQLREQFMRRTRLVLKESEHEPWCKKYMELYNEVR 120

Query: 121 ENWERLYWDEGYSKKLAQEHANYESSE--DEDFSPYRNRQFTADRSK-EQDFRRNMQG-G 180
           ENWERLYWDEGYSKK+A++HANYES+E  DEDF+PYRNR+   D  K EQ F R  QG  
Sbjct: 121 ENWERLYWDEGYSKKIARDHANYESAEEDDEDFNPYRNRRPYNDSIKQEQGFNRTTQGDD 180

Query: 181 SWEKVSQIRDKFEYD 191
           +WEKVSQIRDKFEYD
Sbjct: 181 NWEKVSQIRDKFEYD 195

BLAST of Tan0005393 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 224.2 bits (570), Expect = 4.2e-58
Identity = 145/551 (26.32%), Postives = 275/551 (49.91%), Query Frame = 0

Query: 315 LSVSLTPAEASEILKSLNPDLALQFFQLCPSLCPKFRHDVFTYSRIILILSHSSSPKRFD 374
           LS + TP  AS +L     D AL    +   L     H  FT  R   I  H  +  +  
Sbjct: 42  LSANFTPEAASNLLLKSQNDQAL----ILKFLNWANPHQFFTL-RCKCITLHILTKFKLY 101

Query: 375 HVREILSQMDRDQIRGTISTVNILIRIFGSKEDLELCTGLIKKWDLRLNAYTYRCLLQAH 434
              +IL++   D    T+      +     +E  +LC      +DL + +Y+   L+   
Sbjct: 102 KTAQILAE---DVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLI--- 161

Query: 435 VRSHDSDRAFNVYMEMRGRGYKLDIFAYNMLLDALAKDEK-LDRTYKVFKDMKLKHCNPD 494
                 D+A ++    +  G+   + +YN +LDA  + ++ +     VFK+M     +P+
Sbjct: 162 ------DKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPN 221

Query: 495 EYTYSIMIRMTGKGGRTEESLAFFEEMLTKGCTPNLIVYNTMIEALSRSRMVDKAILLFS 554
            +TY+I+IR     G  + +L  F++M TKGC PN++ YNT+I+   + R +D    L  
Sbjct: 222 VFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLR 281

Query: 555 NMVKNNCRPNEFTYSVILNVLVAEGQLGRLDEVLGVSDK---FMNKSIYAYLVRTLSKLG 614
           +M      PN  +Y+V++N L  EG++  +  VL   ++    +++  Y  L++   K G
Sbjct: 282 SMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEG 341

Query: 615 HASEAHRLFCNMWSFHDRGDRDAYISMLESLCSAGKTVEAIDLLGKVHEKGISCDTMMYN 674
           +  +A  +   M           Y S++ S+C AG    A++ L ++  +G+  +   Y 
Sbjct: 342 NFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYT 401

Query: 675 MVLSTLGKLKQVSHLHDLYQKMKQDGPLPDIFTYNILISSLGRAGKVKEAVKVFEELESS 734
            ++    +   ++  + + ++M  +G  P + TYN LI+     GK+++A+ V E+++  
Sbjct: 402 TLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEK 461

Query: 735 SCKPDIISYNSLINCLGKNGDVDEAHMIFLEMQEKGLNPDVVTYSTLIECFGKTDKVEMA 794
              PD++SY+++++   ++ DVDEA  +  EM EKG+ PD +TYS+LI+ F +  + + A
Sbjct: 462 GLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEA 521

Query: 795 RSLFDKMMAQGCCPNIVTYNILLDCLERAGRTGETVDLYAKLKQQGLTPDSITYAILDRL 854
             L+++M+  G  P+  TY  L++     G   + + L+ ++ ++G+ PD +TY++   L
Sbjct: 522 CDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSV---L 572

Query: 855 QSGSNRKFRVR 862
            +G N++ R R
Sbjct: 582 INGLNKQSRTR 572

BLAST of Tan0005393 vs. TAIR 10
Match: AT4G31850.1 (proton gradient regulation 3 )

HSP 1 Score: 223.8 bits (569), Expect = 5.4e-58
Identity = 152/552 (27.54%), Postives = 267/552 (48.37%), Query Frame = 0

Query: 302 DDRFSDLSSYFQSLSVSLTPAEASEILKSLNPDLALQFFQLCPSLCPK--FRHDVFTYSR 361
           D  FS   S   +L++  T    + +L++L  D  L+       L  K   + D  TY  
Sbjct: 99  DSSFSYFKSVAGNLNLVHTTETCNYMLEALRVDGKLEEMAYVFDLMQKRIIKRDTNTYLT 158

Query: 362 IILILS----HSSSPKRFDHVREILSQMDRDQIRGTISTVNILIRIFGSKEDLELCTGLI 421
           I   LS       +P     +RE    ++     G I   ++L++     E +E+   +I
Sbjct: 159 IFKSLSVKGGLKQAPYALRKMREFGFVLNAYSYNGLI---HLLLKSRFCTEAMEVYRRMI 218

Query: 422 KKWDLRLNAYTYRCLLQAHVRSHDSDRAFNVYMEMRGRGYKLDIFAYNMLLDALAKDEKL 481
            +   R +  TY  L+    +  D D    +  EM   G K +++ + + +  L +  K+
Sbjct: 219 LE-GFRPSLQTYSSLMVGLGKRRDIDSVMGLLKEMETLGLKPNVYTFTICIRVLGRAGKI 278

Query: 482 DRTYKVFKDMKLKHCNPDEYTYSIMIRMTGKGGRTEESLAFFEEMLTKGCTPNLIVYNTM 541
           +  Y++ K M  + C PD  TY+++I       + + +   FE+M T    P+ + Y T+
Sbjct: 279 NEAYEILKRMDDEGCGPDVVTYTVLIDALCTARKLDCAKEVFEKMKTGRHKPDRVTYITL 338

Query: 542 IEALSRSRMVDKAILLFSNMVKNNCRPNEFTYSVILNVLVAEGQLGRLDEVLGV-SDKFM 601
           ++  S +R +D     +S M K+   P+  T++++++ L   G  G   + L V  D+ +
Sbjct: 339 LDRFSDNRDLDSVKQFWSEMEKDGHVPDVVTFTILVDALCKAGNFGEAFDTLDVMRDQGI 398

Query: 602 NKSIYAY--LVRTLSKLGHASEAHRLFCNMWSFHDRGDRDAYISMLESLCSAGKTVEAID 661
             +++ Y  L+  L ++    +A  LF NM S   +     YI  ++    +G +V A++
Sbjct: 399 LPNLHTYNTLICGLLRVHRLDDALELFGNMESLGVKPTAYTYIVFIDYYGKSGDSVSALE 458

Query: 662 LLGKVHEKGISCDTMMYNMVLSTLGKLKQVSHLHDLYQKMKQDGPLPDIFTYNILISSLG 721
              K+  KGI+ + +  N  L +L K  +      ++  +K  G +PD  TYN+++    
Sbjct: 459 TFEKMKTKGIAPNIVACNASLYSLAKAGRDREAKQIFYGLKDIGLVPDSVTYNMMMKCYS 518

Query: 722 RAGKVKEAVKVFEELESSSCKPDIISYNSLINCLGKNGDVDEAHMIFLEMQEKGLNPDVV 781
           + G++ EA+K+  E+  + C+PD+I  NSLIN L K   VDEA  +F+ M+E  L P VV
Sbjct: 519 KVGEIDEAIKLLSEMMENGCEPDVIVVNSLINTLYKADRVDEAWKMFMRMKEMKLKPTVV 578

Query: 782 TYSTLIECFGKTDKVEMARSLFDKMMAQGCCPNIVTYNILLDCLERAGRTGETVDLYAKL 841
           TY+TL+   GK  K++ A  LF+ M+ +GC PN +T+N L DCL +       + +  K+
Sbjct: 579 TYNTLLAGLGKNGKIQEAIELFEGMVQKGCPPNTITFNTLFDCLCKNDEVTLALKMLFKM 638

Query: 842 KQQGLTPDSITY 845
              G  PD  TY
Sbjct: 639 MDMGCVPDVFTY 646

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZU271.4e-25265.59Pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Arabidop... [more]
Q9FIX35.9e-5726.32Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9SZ527.7e-5727.54Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidop... [more]
Q9M9075.0e-5627.97Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana OX... [more]
Q9SR003.9e-5329.91Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
KAG7025459.10.0e+0090.07Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_038898111.10.0e+0090.38pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Benincasa ... [more]
XP_023550137.10.0e+0090.33pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Cucurbita ... [more]
XP_023004732.10.0e+0090.03pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Cucurbita ... [more]
XP_022960041.10.0e+0090.33pentatricopeptide repeat-containing protein At1g51965, mitochondrial [Cucurbita ... [more]
Match NameE-valueIdentityDescription
A0A6J1KR930.0e+0090.03pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucurbit... [more]
A0A6J1H6J40.0e+0090.33pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucurbit... [more]
A0A0A0K6Z60.0e+0087.29PPR_long domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G372880 PE... [more]
A0A1S3BGX80.0e+0088.39pentatricopeptide repeat-containing protein At1g51965, mitochondrial OS=Cucumis ... [more]
A0A5A7TJ340.0e+0088.24Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT1G51965.11.0e-25365.59ABA Overly-Sensitive 5 [more]
AT3G07440.14.2e-7469.15unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G48530.18.7e-6466.67unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G39710.14.2e-5826.32Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G31850.15.4e-5827.54proton gradient regulation 3 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 475..597
e-value: 6.8E-33
score: 116.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 630..729
e-value: 3.2E-22
score: 81.0
coord: 730..798
e-value: 5.5E-23
score: 83.5
coord: 799..850
e-value: 2.2E-10
score: 42.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 416..474
e-value: 2.0E-11
score: 45.5
coord: 284..415
e-value: 9.5E-9
score: 36.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 734..781
e-value: 5.7E-16
score: 58.4
coord: 804..847
e-value: 1.2E-9
score: 38.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 495..528
e-value: 8.4E-8
score: 29.9
coord: 668..700
e-value: 1.8E-4
score: 19.5
coord: 772..806
e-value: 1.3E-10
score: 38.7
coord: 530..564
e-value: 3.4E-9
score: 34.3
coord: 807..841
e-value: 4.1E-6
score: 24.6
coord: 633..666
e-value: 1.5E-4
score: 19.7
coord: 702..736
e-value: 1.4E-10
score: 38.7
coord: 425..458
e-value: 2.1E-4
score: 19.3
coord: 737..771
e-value: 1.4E-11
score: 41.8
coord: 461..494
e-value: 6.7E-6
score: 23.9
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 696..727
e-value: 3.0E-11
score: 42.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 633..662
e-value: 0.0097
score: 16.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 805..839
score: 11.443655
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 528..562
score: 12.134216
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 458..492
score: 10.972319
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 770..804
score: 12.594591
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 700..734
score: 13.537263
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 493..527
score: 12.495939
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 630..664
score: 9.251395
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 735..769
score: 13.822257
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 665..699
score: 9.437737
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 423..457
score: 10.205028
IPR033443Pentacotripeptide-repeat region of PRORPPFAMPF17177PPR_longcoord: 427..587
e-value: 1.2E-17
score: 63.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 142..175
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 252..272
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 142..172
NoneNo IPR availablePANTHERPTHR46128:SF166BNAA05G15090D PROTEINcoord: 226..869
NoneNo IPR availablePANTHERPTHR46128MITOCHONDRIAL GROUP I INTRON SPLICING FACTOR CCM1coord: 226..869
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 646..801

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0005393.1Tan0005393.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding