Tan0020092 (gene) Snake gourd v1

Overview
NameTan0020092
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG09: 51325172 .. 51335847 (+)
RNA-Seq ExpressionTan0020092
SyntenyTan0020092
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAGAGAAACCCTAATTTCATCTTCCTCTCACTCTCACGTGCCCTGCCTTCTTCTTCTTCACACCGGCTCCTCCCCACTTTCGGCAGCTCAGCGTTGCCGTCGCCTCCGCCGATTATCTCCCGCCATCACCGCTTTCTTCTTTCCGTTCGTGCCGCCGCAGCCTCCTTCGATTATCTACTTTCTTGTTGTTACTGGTACTCTCCCAATCTCTATCTATGTCTTTCTTTTCACTTGAAAACACAACAGGGCTTGTCAAATAAAAGAATGGAGGAGTTGTCTGATTTATTTATTTCTTTTAATAACAATAGCCGAATAGCAACGAAGGATTTGTATATCTCTTTGGCCTGGGCTGAAAATTGCCTTGTTTCGGCTGCCATGAATTTTCCAGCATTTCCCTTCTTCTTATTTATCAGTCAAAGTAATCTATTGTCTTAACCAATTCTGTTCTAATTCATAGTTTATAGTTGCGGAGAAGACAAGGATTTTTTTTTTTTTTTGAAAATTGAAATTCTCTCCCTTAGATTAACCAATACAATAACCAATGATACTTCAAATTTATGGAAATAAGTGTGGTCTATATGCTTTGTAAAATAGATGAAGATTAAAGTTTGAGAACTAAACTTACAATTTAACCCAATTAAATTCAAACTTTTTTCACACCAAATTAGCACAAAGTTGCGTTCTAAGTATGTCTCTTCACTTTGGCAGGGGGGACCGAGTTTTGCAACCTCCGCCGGCCTTAGAAATCCTGCAGAACGCCTTCACTCAGTACATCAACTTCAAACATAGCAATTCATCTTCCTGCTGCACCATTGGCAAGTATATCTCTCTCTATTCTCAGTTTTCTTTGTGCTTTTGCTCGATTCCAGCACGTTTCTGTTCAAACCCTTTTTGCCCCTCTCTGCCGCTATCAAACTGAGTGCATATTCTTTCACATTTTATTTGGTTTTCGAAACTAGACTTTGTTGACATGTGGACATGTGGACATGTTTCTTCTTGTTTACAATCTGTTTTTTTAGGCTCTTTTTTTATTTGATTCTAGCTCTCAGGATACCCAATTGGATGCATAGTTTTCTATCCAGTATTAGCGAACTCTTCATTGATAAATGGAAAAACAACAAAAAAATTTTAAGGATACAAACTCCCTTCGAAGTAGCAAGCTAACAGCCTAGAAAGTAGAAACAAATGTTCCCAATTCTTACAAATATCATAAGGGTAACTTGCAAATAAAACAAGAAGAGAACACCAATAAGAATATTTAATATTGACTAGATCAAACCGTTCTAAGCTAAATGCACATCTTAAAAAATTGTCCTATTTTTGGACTTTTTGGTTTCAGAGATTTGAGATATTGCCCACAATTTTGTTCTCAGTTCGTGAAGTGGTATTCTTATTATCTGAGGGCTTTTTTTACTTCATATTGATATTTGTAATCTGTTAGTTGGAAGTTAGAATTATGTTACCGTTCCTGTTGTTTTTGTAATGCTTTATGTTCTTTGTCCTAATGTTTTTGTAATGCTTTTTTTTTTTTTCATTTTTAACATTCAGGTTGTCTTTGGCTTGCCTATGTTGGTATAGCATTCGATCAAGCTATATTCTCTTTTTGGCTTTTCTCCTCCACAGTTTCTCATACATTATCCTGGGTTGTCTTTAACAATTGAACAATTAATGTCCAAGAAGGGAGCTGCAATTCTTGCTAGACGCTCCTTGATAGCACTGTGTGATGCACAATCTTCATGTAGCTCCTCAGTCTCAAACAAAGCCCTAAATCTAGTGAGAAATTTAAGCATTGCCTCTGAAAGCGAGGAATATCAGAATGATAATGGATATCATGCAGATAATTCTTCGCAGAGTTACCAAAGCCTTGGGTACTACCAGCATCATGCCGAAAATGCTAGTTTGCAGAGTTCTTCTAGGCCTTATGATGGGTTTTATACAGAAAACTCCATGCAGGGCCAGTACAGACCATCCACTAGCTCAGTTGATGGGCAGACACCTAGGAGTAGTTTTGCAAATACTTCTTTTATGCATGAAAGTGCTAGTGGATCATATGGACAAAACTATTGTGGTTTGCCTCCCAATTCTTATGGCTTTAACCAGAACCATGATGAGGCTTATAGAGACACCTATCAAAATACTCATCATGCTAACCTAGTGTCTCAGAATGGTAATTTTATTCAAAGTGGCCATAATAAAGGAATGATGGCTCAAGATCTTAATAGTTACAATGCAAATAGTCCAAGAAATTTTGTACACATAAGTAATAATGAGGTTCGTGGAGTTGATAGATCTGTGTCTCAAAATATTCTATTTGAGCATAGAGAAAATTTAACTGCATACAATGGATCTAACAATGGAGAACTACAGCAGGACAATTATGAGGTTTATGGACAGAATTCATGGGGCACACGAACAGAATCCAGACAATATGAACATGACACTGGCCTAAATTACCACAATCCTATGTCAGGACCAAACAATCATATTCCCATCTCAAGACAGTATGAGCAAAACTCTATTTCTCAGCAATATCCACAGGGACAATATCAACAAGGTGCCAGTCTTGAACAATATCAGCCGATCCCAGATACTACTCAGAGTAGCATGATTGGTACTCAAGTATTAAACAACGCCAATGCTGAGGAGGAAACTAGAGTGACTAAGGATCGCCCATATGGTGGTTGTACCCTTGAGGAGCTCGATGAATTCTGCAAGGAAGGTAAATTAAAGGAGGCTGTCGAAATTTTGGAAGTGTTAGAGAAACAACACATTCCTGTTGATTTAACCCGGTATTTAGAGTTGATGAATGCATGTGGGGAAGCAAGGTCCCTAGAAGAAGCCAAAATTGTTTGTAATTACGTAATAAAATCTCAAGCCCCTCTAAAAGTCAGCACCTATAACAAAATTCTGGAGATGTACTCTAAATGTGGTTCCATGGATGATGCCTATACGATATTCAATAAAATGCCTAGCCGCAACCTAACATCTTGGGATACTATGATTACGTGGCTTGCTAAGAACGGCCTTGGGGAAGATGCTGTTGATCTTTTTTATGAGTTCAAAAAAGCTGGGTTGAGACCTGATGGAAAAATTTTTATTGGAGTTTTCTCCGCGTGTAGTGTCTTGGGAGATGTTGATGAGGGGATGCTTCACTTCGAATCAATGACAAAGAATTATGGCATTACTCCTTCCATGCATCATTATGTTAGTATAGTAGACATGCTTGGAAGTACAGGATATGTTGATGAAGCTTTGGAGTTCATTGAAATGATGCCATTGGAGCCAGGGGTAGACATTTGGGAAACGGTGATGAATATCTCTAGAGCTCATGGGTTGATGGAGCTTGGAGATCGTTGTTTTGAGCTTGTTGAGCAGCTTGATCCCTCTCGCCTAAATGAGCAGTCAAAAGCTGGCCTTCTACCTGTAACAGCTTCAGACCTTGCAAAAGAGAGAGAGAAGAAAAAATTAGCTAACCGGAATCTTTTAGAAGTCAGGAGCCGAGTACATGAATATCGAGCTGGAGATACGTCTCATCCTGAAAATGACAAGATCTATACCTTGCTTAGGGGTTTGAGGGAACAGATGAAGGAGGCTGGTTATATTCCAGAGACTCGATTTGTACTCCATGATATAGATCAGGAAGGCAAAAATGACGCTCTACTTGGTCATAGTGAGAGACTTGCTGTTGCATATGGTCTAATCAGTAGCTCAGCCCGCTCACCCATAAGAATAATTAAGAACCTTCGTGTTTGTGGTGATTGCCATAGTGCACTAAAGATAATTTCAAAAATTGTAGGTCGAGAGCTCATCATTCGTGATGCTAAGAGATTTCACCATTTTAAAGATGGATTATGCTCCTGCCGTGATTACTGGTGAACAAAATGAAACTTGTGGTATGTCTAAATTTATCTCTCCACTTGTTTTATTTGAATATTTAAGCTTTTGTCAGGTTTGGACATGTTCATAAATTGCTTTAATTTATATGCCTCGTTCCCATTAAAAATTGAATTACAAATGGTATTCCTGTTGAATTGTATGCTATATATATTTTTTTTTTGTTGGTATAAGAAATAATTTCATTGATGTATGAAATAAAGGGAGAAAACTCCGAGTACCTATAGGTGAGTTATAATAAGACTCTCCAATTGGAGATTAAAAAAGATAAGTTATAAAGACTAAAAGGTTGAAAAGATTTACACCACAAGAAGGCATTAGAAAATACAGTTTCCATAAAGCAATCAAAAGTATGAGCAACATCCCGAAAAATTTGATCATTCCTCTCCAATCAAAGGTTCCAAAGAAAAGCACGAATAAAAGCAAGCCAAAGAATCTTCTTGACGACCTTGTAAAGATGACCCACCAAAACAGAGGAGAGGATATCGAGAATATTGTTGGAAAGAGGCCGTGACCAACCAAAAGCTTTTAAAATGCACATCAAAAAACGTGAAGCAAAACAACAATGCACAAATAAATGATCTGCAGATTCATAATTCACGCCACAAATAATGCACCAAGACGGAGAAAGAGCCATATATGACATCCTTCTTTGGAGTCGATCATTCGTATTAATTGCTCTCAACCTAAGTTCCCAAAGGAAGATTTTAATCTTCTTTGGACATGCATCCTTCCAAATAATCGAATATAAATCTTTTAGAAGAAGATCATCAATACCAACCAAGTTTGTTGTAAGCAATTTGATAGAGAACTTTTGAGTCGCTTCTAAGGGCCTTGACCAAGTATCAGGTGTGTTCAACAAAGTGATTGATGATAAAAGATGTGAAAGATTAGCCTACTCAATCACTTCCAAATCGTTCAAATTTCAACGAAGTCTCAAATTCCAAGCTAAATTAGAAACATTTCAAACTTTTTCCACAATAAGATCAGGGCTCGAAGTAAGGCGATAAAGTGGAGGAAAAGCTGTAGCAATAGGACCACAATTTAACCATGAATCTATCCAAAATAGAGTAGAACAACCATCACCAACAATACGACAAGAATGAGTAGAGACCAAATCTCGAACATTACATATATATCGCCAAGGAGATTTATGAGGAGTCAAAATAGGTTGAGGCCATCCATATCCAAGCATATTACCATAATATTTTGCAGCAATAAAAATCATTTTGTTCATGAACAAAACACCAAACCCACTTAGCCATTAAAGTCGCATTTCGATGTTGAAAATTACCTATACCAAGGCCGCCCATTATTTAGGAAGTTGTATGGTGTTGCAATTCAACTCCACCATCACCTTGAGCTCCTTTCCAAAAGAAATCACGAATAATCTTGTCAAACGTCTTGATCACTCTGGATGGAACTTTGAACAATGATAAAATGTATGTTGGCAAACTAGAAAGTGTGGCTTGTATAAGAGGCTTTGGAAATGAAAACATATTTCCAATTATGGAGTTTCTGTTGCATTCACTCTACAACAGGTTGCCAGAAAGATACAGACTTAGAGTTTCCCCCATGGTAAACCAAGGTAAGTAGATGACCAAATACCTTTTATACAACAAAAAGTTGTCAACATCCACTCCATATCAGAATCATCAATGTGTATTCCCAACAGCTCGCTCTTACTTAGATTAATCTTCAAACCAGAGGCTCTTTCAAAAATTTTAACCACCTCAAAAAGATGTATCATTGCAGTTTGATCAAAAGTAGAGAAAAGAAGAGTGTCATCCGCAGATTGAAAGTGATGAGGGCAAAGAGATGAATAATCAATAGGATGTGTCATAGCCAACAAGAGCATAGCTCAACTGGCATTAAATAAGACATATGACCAAGAGATCATGAGTTCGAATCTCCCACCCCCAATTGTTGATGTGCTCAAAAAATAATCAATAAGATGTGTCATAATATTACCCAAGCTTGTGCTATGATTCAAAATATGGCTAAGGCGATTGGAGACTGGAATAAAAAAGAAAGGGGATAGAGGATCACCTTATCGAATGCCACGAGATGGAATAATTTTTCCATGAGGCTGGCTATTGATAATAATAGAGTAGTTTGCACTAGAAATACAGCCTCTGATCCATTTCCTCCATAAAAGTCCAAAACCTTTTGCGTGCAGAATTGCCTCTAAATGTCCCAATCAACTAGTGACAGCCACATTTTATTTGATTACATCACGATAACAGCATATAAATTCAGATTCAATGTCGCCAGCTGTTACAACAAAGAAAACAAAGGGGCAACCTTTTACATGCAGAATTGCCCCTTTTAGGTTGCCCCTTTTATTCTTGGCCTTTGTTTTCATTGTTGTATTTAAGTTCGTTGTCTTTGTTCTCATTCCATCGTCGCATTTGCATCTTGTTCAAAGTTTGGCTAGACATTGAATAATATCTTGTTCATTGGAGTCTAAATTTTTTATTAGAAGTTAGCAAATCCATATTTAAAGTTTGGCTAGACATTGAATAACCTCTTGCTTGTTGGGCTCTAATTTTGAATAGAAGTAGCACAGACGGTTATCTGTCTGTCATCACCAAAAGAAAGTGGAAAAAAATCCTTGATCACCTCCTAGTTGAAACCTGAATGTGTGCTTGATCATAGAATCAAGTAATGACCGGCATTCTCAGTGCAAAATTCAAAATGAAAGCTGGTTGTCATCCACTCATTTTCAAATTGCCGAGGTGAATAATGCTTTTTTTTTCCTGCCTCCTCGAGGTACTGATTTTGACAATGATGTGTTGTCTAAATTGACATACCAAAAAAATAAAATCAAGTATAGTGATATTCTGATTTAGGTTAGGTATTTTGTATAGGATCCTGGATTTTGAAAAAGTAACAGCTTACTATTGTAAGTTAATAAATTGTCGATATTCCTTAATGGTGGATTTGTGCTTCTAGAAGGCTTTATCAAGGTGAACTATGATCTCCATTTCTATTCTTGCTAGTTGGAGAAGTTCTAACTGGCCACATTGATAAAGATGTGTCTAATGGCAGTTTTGAAGACAATTTTGTTGGCAAAGATGGGATTTCTTTTTTTTAAAAAAAGAATCGAAATCTTTTATTGATCAAATGAAAAATCAGGGTATCAAACAACTTACACAGAAGTACCAACAAAGACCATAAAAAAATCTTACAACACTGACGAAACAAACTATACTAACTCTTGCTCGAGTAGAAAAGCTTGACAAAAACTAGGACAGCACACTACGGCGTCAACAACTGAAAGTTTCAAACTCTTAAACCACCAACAGTTTTGTTTTTGGTAAGAAACGAAAAAAATTGAAAGAAAAACAAGGGCATACAAAAATGAAAAAACAAGCCCAGAAAAAAAGGGAGTTCAAGAACTAACTACAAAAACGAACTCCAATGCAACAAAATCAAATCAAGATCATAATTACAAAATGGCCTAGTGACTAAAACCCCACCTCTCAAACCTCATTACAAGACCTCTCAACCTCACTGAAAATCCTATTATTTCGCTCCAGCCAAATATCCCACAAATGACAAAGAAATTAAGGTGCCATAAGACTTTGCCTTTGCCACAAGAAGGCGAATTCAGAAGCACCTCCCCCATCATAGCAAGACAGTCTCTATTTCAAGCCAAACATAAACCAAACGAATTCAACCACCGAGCTCCCACATTAGTCAAAACAAACTACCGAGAGCAAAAGCTTGGCAGACAAATCTTTCCTCCCAAAACAACGGATCAGGCACTGCAACTTGAGCACCGTCTATACCTTTGAAACCAAGTAACCAACAACAAATAGACAAAACCAAACATGAAGAATCTCTCCAGAACTTTGCCAACAACGAAAACTCTAAAGGAAAGACAGAATACCTTGAAATGAGCCGTTGCCTCCAGAAACTTCGAAACAATTTCGTAAAAAACCAAAACTTCACCATAGAATGCAGCAGCATTTCAACCCACAATTTCGTACAAAACCTTATGTGCATAAACTTCAACCCACAAAAAGCAGCAACATTTCAGAACGTGTAAGATTTAATAACTAAAGGCACGTTTGGGAGTGATTTTAGGCTTAAAATCACTTTTTTCATATTCAAATTCAAAATCACTCCCTCCAAAATCATTTTAATGCTTGGTTTTACACTTTTAAATGCGATTTACATACCATCAAAATTGATTTTGAATGGTTAGATAGATGTTTTGGAGTGATTTTCATTATAACAAAAGTGATTTTTAACCATTTAAAATCACTCCCAAGTGTGCCCTAAACCAAAGATATTTTTCAAGAAACTAAATTCATCTCATTGAATTATTCCACAAATTTAGAAGATACTTCAAATTCTCTCTTTTTTTTCTTTTTTTATCTCCCTCTCCGATCTCCCCCTAGTAAGAAAAACTGAGCTTTCATTGAGAAAAATGAAAGAATATATAGGGGATACAAAAATCAAGTCCATGAAGAAGAAGTCCACCTGCTAACTACAAAAAATAACTCCAATCCAAAAGTATCAAACCAAGTTCCCTCTTTCTCTCTTTTACTACAACCCTTCTCCTCTCTATTTTCTCTCACTCACTTCTTTGTTTTTTCCCGCTCTCTCTCTTTCTTCTACTTATCCCTATCTTCTTCCTACTTTTTTTTCTAGGCTTCTTCTCTTCGTTCTTTTCTCTTTCTTCTACTTTTCCCTATCTTCTTCCTACTTTTTCCCCCCACCACCTATATATGTAAAATGAAATGAAAAAAAAAATGTACAAATACATCAAACAATTCACATAATATAAACTTTCTCGTCTTTCAAAAATTTTCATCTTACCTAATCATTTTTTCTCCCCATCTTTTGCACCTAGTACTTCTTCCTGCAACCCTTTTAACCGTCCTCTTTTCCCCTCCTTTTCTTCTTCTACTTCCCCATTCTCTTTTTCATTTTTCCTTCTTTTTCTTTTTTTTTTTCCTTTTATTTCTTTTATTTCTTTGTTCTTTCTTTCCTTACTCCCAGTACACTTACAATTGCTCCTCCAGCTTATCAAGTTCCCTTTCCATAGAAGGAGTTCTAGTTATAAAGAGATTATAACTACAATCTTTGCTATGTAACCCATTCCTGTTGATTTGAAATTATCAAAGTTTTTTAACATGGCTCAAAAAGTGCGTGAGTTCAGAACCTTTACCTTTGGGGGAAATTTTGAAAAAAAAGGAGTTGATAGATGGAACGAGTTTTGCTTGTCTAGGGGAGGGATGATGATACTTTGTCAATGTCGTTCTTACAAATTTCCTAATGCTGTCTTGTCTTTATTCTTAATACTGAATGGCTGAATGAATCTTTACTGTTACAATATTCTGTCATTTGTGAACAATACCAATCGAGGGCTCTCTTCATTAGAGAATGTCAGGGATGATCTTGTAGTTGCTACCTAAATAGAAAATGAGTGAGAGAGAGAGAGAAAAGGGAGAGGAGGGGGAGGGGGTGTGGAAATGGATGGCATTGCCAATCGGGTAAGATAAAAATGAAATTTGATAGCTTAAGCTTCGTTTGATAACCATTTTGTTTTTAGATTTTAACAATTAAGCCTATAAACATTACTTCCACATGGGTTTTCTTGTTTTGTTATCTACTTTTTGCATGTGTTAAAAAACGAAGTCAAGTTTTGAAAATTTGTTTTTATTTTTGGAATTTGGCTATGAATTCAAATGTTTTTTTTTTTTAAAAAAAAATATGAATCCCACGGTAGAGAAATGATGAGAAAAAAAATACAATTTTCAAAAATCAATAATTAAAAACCAAATGGTTATCAAATGAGGCCTTAAAGATTTCTTTGATGTAAGATTTAGGGGAAGTTGAGTAAATATTGAGGAAAGAAGTAGAGTTGTGACTTCAATTTTTATCCAATTGGTCTCAGCTATTGGAACAATCGTTTAACAATAAAGGTGCTGTGATGGAAATTTTCCTAATTGGTTCACATGGAGTAAACGCGAGTAACTTTATACTCAAGCTGAGGTGGAACTGAAGCATGAAGAGGATGAAACTTGAGTGGTAGAAAAGAGAACTTTGGGTTCGGTAGGAGGCATTGTGTTGTTAATAAGCTTTGTAATCATCAAATTGATCATACTTGACCAAAGTGGTCTCTAAGCTTTCTAATCATCTCATTGATCCTTGTTTGGACTACGTAGAGGCATTGTGATGACAGATCTGACATTTCTTACCTTCCTCTGGTATTGTAGAAGCACTGTGACAGATAAATGTACAACTCTCGTCGCATATGTGGCTTCTGCAGCTCCATTACTAGGTTCGAAAGGGCGTCGAGGTTGCACACCGGGAGAACCAGTGAAGTGAAATCAACAGAAAGATCAGAATCATTGCTGCCAGGCAAAGTCACCAAAGGCCTCACTCGGACGAGACAGAATGTTTAAGAGTATATGGAACAAATCACTGATGGCTTATGCATCGATCAGATCAAAGCTAATTTATTCTTCCTGAATTGCAGCTATTATTTCTTGACATCTTTTAATTATTTAGTAATGTAATAACCATTAACATCTCCAATTTTGATGTAGAAGTTAGCAAGAGAAATAGTGATACAAAATTGAACCAAGCCAGTAGTCCAAACTAAACAAAGTCGAACCGAAAGATTTGTATTTTCTTTGTTCAAAGATTTATTGATTTAATTCTATTTAGTTCTGTTTTTTATTTGGATTGATTCTATCAAATTTTTACAGTCAAACCAATGTGTAAATGTACATGTGTTAGTGAACAAATACTGCATAGGTC

mRNA sequence

GAAAGAGAAACCCTAATTTCATCTTCCTCTCACTCTCACGTGCCCTGCCTTCTTCTTCTTCACACCGGCTCCTCCCCACTTTCGGCAGCTCAGCGTTGCCGTCGCCTCCGCCGATTATCTCCCGCCATCACCGCTTTCTTCTTTCCGTTCGTGCCGCCGCAGCCTCCTTCGATTATCTACTTTCTTGTTGTTACTGGGGGGACCGAGTTTTGCAACCTCCGCCGGCCTTAGAAATCCTGCAGAACGCCTTCACTCAGTACATCAACTTCAAACATAGCAATTCATCTTCCTGCTGCACCATTGGTTGTCTTTGGCTTGCCTATGTTGGTATAGCATTCGATCAAGCTATATTCTCTTTTTGGCTTTTCTCCTCCACAGTTTCTCATACATTATCCTGGGTTGTCTTTAACAATTGAACAATTAATGTCCAAGAAGGGAGCTGCAATTCTTGCTAGACGCTCCTTGATAGCACTGTGTGATGCACAATCTTCATGTAGCTCCTCAGTCTCAAACAAAGCCCTAAATCTAGTGAGAAATTTAAGCATTGCCTCTGAAAGCGAGGAATATCAGAATGATAATGGATATCATGCAGATAATTCTTCGCAGAGTTACCAAAGCCTTGGGTACTACCAGCATCATGCCGAAAATGCTAGTTTGCAGAGTTCTTCTAGGCCTTATGATGGGTTTTATACAGAAAACTCCATGCAGGGCCAGTACAGACCATCCACTAGCTCAGTTGATGGGCAGACACCTAGGAGTAGTTTTGCAAATACTTCTTTTATGCATGAAAGTGCTAGTGGATCATATGGACAAAACTATTGTGGTTTGCCTCCCAATTCTTATGGCTTTAACCAGAACCATGATGAGGCTTATAGAGACACCTATCAAAATACTCATCATGCTAACCTAGTGTCTCAGAATGGTAATTTTATTCAAAGTGGCCATAATAAAGGAATGATGGCTCAAGATCTTAATAGTTACAATGCAAATAGTCCAAGAAATTTTGTACACATAAGTAATAATGAGGTTCGTGGAGTTGATAGATCTGTGTCTCAAAATATTCTATTTGAGCATAGAGAAAATTTAACTGCATACAATGGATCTAACAATGGAGAACTACAGCAGGACAATTATGAGGTTTATGGACAGAATTCATGGGGCACACGAACAGAATCCAGACAATATGAACATGACACTGGCCTAAATTACCACAATCCTATGTCAGGACCAAACAATCATATTCCCATCTCAAGACAGTATGAGCAAAACTCTATTTCTCAGCAATATCCACAGGGACAATATCAACAAGGTGCCAGTCTTGAACAATATCAGCCGATCCCAGATACTACTCAGAGTAGCATGATTGGTACTCAAGTATTAAACAACGCCAATGCTGAGGAGGAAACTAGAGTGACTAAGGATCGCCCATATGGTGGTTGTACCCTTGAGGAGCTCGATGAATTCTGCAAGGAAGGTAAATTAAAGGAGGCTGTCGAAATTTTGGAAGTGTTAGAGAAACAACACATTCCTGTTGATTTAACCCGGTATTTAGAGTTGATGAATGCATGTGGGGAAGCAAGGTCCCTAGAAGAAGCCAAAATTGTTTGTAATTACGTAATAAAATCTCAAGCCCCTCTAAAAGTCAGCACCTATAACAAAATTCTGGAGATGTACTCTAAATGTGGTTCCATGGATGATGCCTATACGATATTCAATAAAATGCCTAGCCGCAACCTAACATCTTGGGATACTATGATTACGTGGCTTGCTAAGAACGGCCTTGGGGAAGATGCTGTTGATCTTTTTTATGAGTTCAAAAAAGCTGGGTTGAGACCTGATGGAAAAATTTTTATTGGAGTTTTCTCCGCGTGTAGTGTCTTGGGAGATGTTGATGAGGGGATGCTTCACTTCGAATCAATGACAAAGAATTATGGCATTACTCCTTCCATGCATCATTATGTTAGTATAGTAGACATGCTTGGAAGTACAGGATATGTTGATGAAGCTTTGGAGTTCATTGAAATGATGCCATTGGAGCCAGGGGTAGACATTTGGGAAACGGTGATGAATATCTCTAGAGCTCATGGGTTGATGGAGCTTGGAGATCGTTGTTTTGAGCTTGTTGAGCAGCTTGATCCCTCTCGCCTAAATGAGCAGTCAAAAGCTGGCCTTCTACCTGTAACAGCTTCAGACCTTGCAAAAGAGAGAGAGAAGAAAAAATTAGCTAACCGGAATCTTTTAGAAGTCAGGAGCCGAGTACATGAATATCGAGCTGGAGATACGTCTCATCCTGAAAATGACAAGATCTATACCTTGCTTAGGGGTTTGAGGGAACAGATGAAGGAGGCTGGTTATATTCCAGAGACTCGATTTGTACTCCATGATATAGATCAGGAAGGCAAAAATGACGCTCTACTTGGTCATAGTGAGAGACTTGCTGTTGCATATGGTCTAATCAGTAGCTCAGCCCGCTCACCCATAAGAATAATTAAGAACCTTCGTGTTTGTGGTGATTGCCATAGTGCACTAAAGATAATTTCAAAAATTGTAGGTCGAGAGCTCATCATTCGTGATGCTAAGAGATTTCACCATTTTAAAGATGGATTATGCTCCTGCCGTGATTACTGGTGAACAAAATGAAACTTGTGAAGCACTGTGACAGATAAATGTACAACTCTCGTCGCATATGTGGCTTCTGCAGCTCCATTACTAGGTTCGAAAGGGCGTCGAGGTTGCACACCGGGAGAACCAGTGAAGTGAAATCAACAGAAAGATCAGAATCATTGCTGCCAGGCAAAGTCACCAAAGGCCTCACTCGGACGAGACAGAATGTTTAAGAGTATATGGAACAAATCACTGATGGCTTATGCATCGATCAGATCAAAGCTAATTTATTCTTCCTGAATTGCAGCTATTATTTCTTGACATCTTTTAATTATTTAGTAATGTAATAACCATTAACATCTCCAATTTTGATGTAGAAGTTAGCAAGAGAAATAGTGATACAAAATTGAACCAAGCCAGTAGTCCAAACTAAACAAAGTCGAACCGAAAGATTTGTATTTTCTTTGTTCAAAGATTTATTGATTTAATTCTATTTAGTTCTGTTTTTTATTTGGATTGATTCTATCAAATTTTTACAGTCAAACCAATGTGTAAATGTACATGTGTTAGTGAACAAATACTGCATAGGTC

Coding sequence (CDS)

ATGTCCAAGAAGGGAGCTGCAATTCTTGCTAGACGCTCCTTGATAGCACTGTGTGATGCACAATCTTCATGTAGCTCCTCAGTCTCAAACAAAGCCCTAAATCTAGTGAGAAATTTAAGCATTGCCTCTGAAAGCGAGGAATATCAGAATGATAATGGATATCATGCAGATAATTCTTCGCAGAGTTACCAAAGCCTTGGGTACTACCAGCATCATGCCGAAAATGCTAGTTTGCAGAGTTCTTCTAGGCCTTATGATGGGTTTTATACAGAAAACTCCATGCAGGGCCAGTACAGACCATCCACTAGCTCAGTTGATGGGCAGACACCTAGGAGTAGTTTTGCAAATACTTCTTTTATGCATGAAAGTGCTAGTGGATCATATGGACAAAACTATTGTGGTTTGCCTCCCAATTCTTATGGCTTTAACCAGAACCATGATGAGGCTTATAGAGACACCTATCAAAATACTCATCATGCTAACCTAGTGTCTCAGAATGGTAATTTTATTCAAAGTGGCCATAATAAAGGAATGATGGCTCAAGATCTTAATAGTTACAATGCAAATAGTCCAAGAAATTTTGTACACATAAGTAATAATGAGGTTCGTGGAGTTGATAGATCTGTGTCTCAAAATATTCTATTTGAGCATAGAGAAAATTTAACTGCATACAATGGATCTAACAATGGAGAACTACAGCAGGACAATTATGAGGTTTATGGACAGAATTCATGGGGCACACGAACAGAATCCAGACAATATGAACATGACACTGGCCTAAATTACCACAATCCTATGTCAGGACCAAACAATCATATTCCCATCTCAAGACAGTATGAGCAAAACTCTATTTCTCAGCAATATCCACAGGGACAATATCAACAAGGTGCCAGTCTTGAACAATATCAGCCGATCCCAGATACTACTCAGAGTAGCATGATTGGTACTCAAGTATTAAACAACGCCAATGCTGAGGAGGAAACTAGAGTGACTAAGGATCGCCCATATGGTGGTTGTACCCTTGAGGAGCTCGATGAATTCTGCAAGGAAGGTAAATTAAAGGAGGCTGTCGAAATTTTGGAAGTGTTAGAGAAACAACACATTCCTGTTGATTTAACCCGGTATTTAGAGTTGATGAATGCATGTGGGGAAGCAAGGTCCCTAGAAGAAGCCAAAATTGTTTGTAATTACGTAATAAAATCTCAAGCCCCTCTAAAAGTCAGCACCTATAACAAAATTCTGGAGATGTACTCTAAATGTGGTTCCATGGATGATGCCTATACGATATTCAATAAAATGCCTAGCCGCAACCTAACATCTTGGGATACTATGATTACGTGGCTTGCTAAGAACGGCCTTGGGGAAGATGCTGTTGATCTTTTTTATGAGTTCAAAAAAGCTGGGTTGAGACCTGATGGAAAAATTTTTATTGGAGTTTTCTCCGCGTGTAGTGTCTTGGGAGATGTTGATGAGGGGATGCTTCACTTCGAATCAATGACAAAGAATTATGGCATTACTCCTTCCATGCATCATTATGTTAGTATAGTAGACATGCTTGGAAGTACAGGATATGTTGATGAAGCTTTGGAGTTCATTGAAATGATGCCATTGGAGCCAGGGGTAGACATTTGGGAAACGGTGATGAATATCTCTAGAGCTCATGGGTTGATGGAGCTTGGAGATCGTTGTTTTGAGCTTGTTGAGCAGCTTGATCCCTCTCGCCTAAATGAGCAGTCAAAAGCTGGCCTTCTACCTGTAACAGCTTCAGACCTTGCAAAAGAGAGAGAGAAGAAAAAATTAGCTAACCGGAATCTTTTAGAAGTCAGGAGCCGAGTACATGAATATCGAGCTGGAGATACGTCTCATCCTGAAAATGACAAGATCTATACCTTGCTTAGGGGTTTGAGGGAACAGATGAAGGAGGCTGGTTATATTCCAGAGACTCGATTTGTACTCCATGATATAGATCAGGAAGGCAAAAATGACGCTCTACTTGGTCATAGTGAGAGACTTGCTGTTGCATATGGTCTAATCAGTAGCTCAGCCCGCTCACCCATAAGAATAATTAAGAACCTTCGTGTTTGTGGTGATTGCCATAGTGCACTAAAGATAATTTCAAAAATTGTAGGTCGAGAGCTCATCATTCGTGATGCTAAGAGATTTCACCATTTTAAAGATGGATTATGCTCCTGCCGTGATTACTGGTGA

Protein sequence

MSKKGAAILARRSLIALCDAQSSCSSSVSNKALNLVRNLSIASESEEYQNDNGYHADNSSQSYQSLGYYQHHAENASLQSSSRPYDGFYTENSMQGQYRPSTSSVDGQTPRSSFANTSFMHESASGSYGQNYCGLPPNSYGFNQNHDEAYRDTYQNTHHANLVSQNGNFIQSGHNKGMMAQDLNSYNANSPRNFVHISNNEVRGVDRSVSQNILFEHRENLTAYNGSNNGELQQDNYEVYGQNSWGTRTESRQYEHDTGLNYHNPMSGPNNHIPISRQYEQNSISQQYPQGQYQQGASLEQYQPIPDTTQSSMIGTQVLNNANAEEETRVTKDRPYGGCTLEELDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSLEEAKIVCNYVIKSQAPLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGEDAVDLFYEFKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTGYVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVEQLDPSRLNEQSKAGLLPVTASDLAKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAGYIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIRIIKNLRVCGDCHSALKIISKIVGRELIIRDAKRFHHFKDGLCSCRDYW
Homology
BLAST of Tan0020092 vs. ExPASy Swiss-Prot
Match: Q680H3 (Pentatricopeptide repeat-containing protein At2g25580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H75 PE=2 SV=2)

HSP 1 Score: 452.6 bits (1163), Expect = 8.5e-126
Identity = 216/400 (54.00%), Postives = 290/400 (72.50%), Query Frame = 0

Query: 336 YGGCTLEELDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSLEEAKIVC 395
           Y    +EE D FCK GK+K+A+  +++L   +  VDL+R L L   CGEA  L+EAK V 
Sbjct: 218 YTDIMIEEYDAFCKHGKVKKALYTIDILASMNYVVDLSRLLRLAKICGEAEGLQEAKTVH 277

Query: 396 NYVIKSQAPLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGE 455
             +  S + L +S+ + +LEMYS CG  ++A ++F KM  +NL +W  +I   AKNG GE
Sbjct: 278 GKISASVSHLDLSSNHVLLEMYSNCGLANEAASVFEKMSEKNLETWCIIIRCFAKNGFGE 337

Query: 456 DAVDLFYEFKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSI 515
           DA+D+F  FK+ G  PDG++F G+F AC +LGDVDEG+LHFESM+++YGI PS+  YVS+
Sbjct: 338 DAIDMFSRFKEEGNIPDGQLFRGIFYACGMLGDVDEGLLHFESMSRDYGIAPSIEDYVSL 397

Query: 516 VDMLGSTGYVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVEQLDPSRL 575
           V+M    G++DEALEF+E MP+EP VD+WET+MN+SR HG +ELGD C E+VE LDP+RL
Sbjct: 398 VEMYALPGFLDEALEFVERMPMEPNVDVWETLMNLSRVHGNLELGDYCAEVVEFLDPTRL 457

Query: 576 NEQSKAGLLPVTASDLAKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGL 635
           N+QS+ G +PV ASD+ KE  KK+  +  L  V+S + E+RAGDT+ PEND+++ LLR L
Sbjct: 458 NKQSREGFIPVKASDVEKESLKKR--SGILHGVKSSMQEFRAGDTNLPENDELFQLLRNL 517

Query: 636 REQMKEAGYIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIRIIKNLRVC 695
           +  M E GY+ ETR  LHDIDQE K   LLGHSER+A A  +++S+ R P  +IKNLRVC
Sbjct: 518 KMHMVEVGYVAETRMALHDIDQESKETLLLGHSERIAFARAVLNSAPRKPFTVIKNLRVC 577

Query: 696 GDCHSALKIISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
            DCH+ALKI+S IVGRE+I RD KRFH  K+G C+C+DYW
Sbjct: 578 VDCHNALKIMSDIVGREVITRDIKRFHQMKNGACTCKDYW 615

BLAST of Tan0020092 vs. ExPASy Swiss-Prot
Match: Q9SUU7 (Pentatricopeptide repeat-containing protein At4g32450, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H63 PE=2 SV=1)

HSP 1 Score: 443.4 bits (1139), Expect = 5.2e-123
Identity = 249/546 (45.60%), Postives = 340/546 (62.27%), Query Frame = 0

Query: 194 FVHISNNEVR-GVDRSVSQNILFEHRENLTAYNGSNNGELQQDNYEVYGQNSWGTRTESR 253
           F ++S   +R G +   + N +     ++   NG N GE     ++   QNS        
Sbjct: 23  FSYLSTAALRLGFENPTNGNPMDNSSHHIGYVNGFNGGEQSLGGFQ---QNS-------- 82

Query: 254 QYEHDTGLNYHNPMSGPNNHIPISRQYEQNSISQQYPQGQYQQGASLEQYQPIPDTTQSS 313
            YE        NP+SG N   P +R Y QN  ++    G++ +  + ++ Q    +   S
Sbjct: 83  -YEQSL-----NPVSGQN---PTNRFY-QNGYNRNQSYGEHSEIIN-QRNQNWQSSDGCS 142

Query: 314 MIGTQVLNNANAEEET---RVTKDRPYGGCTLEELDEFCKEGKLKEAVEILEVLEKQHIP 373
             GT   N    E  T      +D   G  +L+ELD  C+EGK+K+AVEI++    +   
Sbjct: 143 SYGT-TGNGVPQENNTGGNHFQQDHS-GHSSLDELDSICREGKVKKAVEIIKSWRNEGYV 202

Query: 374 VDLTRYLELMNACGEARSLEEAKIVCNYVIKSQAPLKVSTYNKILEMYSKCGSMDDAYTI 433
           VDL R   +   CG+A++L+EAK+V  ++  S     +S YN I+EMYS CGS++DA T+
Sbjct: 203 VDLPRLFWIAQLCGDAQALQEAKVVHEFITSSVGISDISAYNSIIEMYSGCGSVEDALTV 262

Query: 434 FNKMPSRNLTSWDTMITWLAKNGLGEDAVDLFYEFKKAGLRPDGKIFIGVFSACSVLGDV 493
           FN MP RNL +W  +I   AKNG GEDA+D F  FK+ G +PDG++F  +F AC VLGD+
Sbjct: 263 FNSMPERNLETWCGVIRCFAKNGQGEDAIDTFSRFKQEGNKPDGEMFKEIFFACGVLGDM 322

Query: 494 DEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTGYVDEALEFIEMMPLEPGVDIWETVMN 553
           +EG+LHFESM K YGI P M HYVS+V ML   GY+DEAL F+E M  EP VD+WET+MN
Sbjct: 323 NEGLLHFESMYKEYGIIPCMEHYVSLVKMLAEPGYLDEALRFVESM--EPNVDLWETLMN 382

Query: 554 ISRAHGLMELGDRCFELVEQLDPSRLNEQSKAGLLPVTASDLAKEREKKKLANRNLLEVR 613
           +SR HG + LGDRC ++VEQLD SRLN++SKAGL+PV +SDL KE+ ++     N     
Sbjct: 383 LSRVHGDLILGDRCQDMVEQLDASRLNKESKAGLVPVKSSDLVKEKLQRMAKGPNY---- 442

Query: 614 SRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAGYIPETRFVLHDIDQEGKNDALLGHSE 673
             +    AGD S PEN ++Y  L+ L+E M E GY+P ++  LHD+DQE K++ L  H+E
Sbjct: 443 -GIRYMAAGDISRPENRELYMALKSLKEHMIEIGYVPLSKLALHDVDQESKDENLFNHNE 502

Query: 674 RLAVAYGLISSSARSPIRIIKNLRVCGDCHSALKIISKIVGRELIIRDAKRFHHFKDGLC 733
           R A     + + ARS IR++KNLRVC DCH+ALK++SKIVGRELI RDAKRFHH KDG+C
Sbjct: 503 RFAFISTFLDTPARSLIRVMKNLRVCADCHNALKLMSKIVGRELISRDAKRFHHMKDGVC 537

Query: 734 SCRDYW 736
           SCR+YW
Sbjct: 563 SCREYW 537

BLAST of Tan0020092 vs. ExPASy Swiss-Prot
Match: Q8S8Q7 (Pentatricopeptide repeat-containing protein At2g34370, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H25 PE=2 SV=1)

HSP 1 Score: 367.9 bits (943), Expect = 2.8e-100
Identity = 178/397 (44.84%), Postives = 268/397 (67.51%), Query Frame = 0

Query: 340 TLEELDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSLEEAKIVCNYVI 399
           T+E  D  CK+ K++EA+E++++LE +   VD  R L L   CGE  +LEEA++V + + 
Sbjct: 80  TIETFDALCKQVKIREALEVIDILEDKGYIVDFPRLLGLAKLCGEVEALEEARVVHDCI- 139

Query: 400 KSQAPLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGEDAVD 459
               PL   +Y+ ++EMYS C S DDA  +FN+MP RN  +W TMI  LAKNG GE A+D
Sbjct: 140 ---TPLDARSYHTVIEMYSGCRSTDDALNVFNEMPKRNSETWGTMIRCLAKNGEGERAID 199

Query: 460 LFYEFKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDML 519
           +F  F + G +PD +IF  VF AC  +GD++EG+LHFESM ++YG+  SM  YV++++ML
Sbjct: 200 MFTRFIEEGNKPDKEIFKAVFFACVSIGDINEGLLHFESMYRDYGMVLSMEDYVNVIEML 259

Query: 520 GSTGYVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVEQLDPSRLNEQS 579
            + G++DEAL+F+E M +EP V++WET+MN+    G +ELGDR  EL+++LD SR++++S
Sbjct: 260 AACGHLDEALDFVERMTVEPSVEMWETLMNLCWVQGYLELGDRFAELIKKLDASRMSKES 319

Query: 580 KAGLLPVTASDLAKEREKK-KLANRNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQ 639
            AGL+   ASD A E+ K+ +       + + R+HE+RAGDTSH       +  R L+ Q
Sbjct: 320 NAGLVAAKASDSAMEKLKELRYCQMIRDDPKKRMHEFRAGDTSHLGT---VSAFRSLKVQ 379

Query: 640 MKEAGYIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIRIIKNLRVCGDC 699
           M + G++P TR     +++E K + LL  S +LA A+ +I+S AR P+ +++N+R C D 
Sbjct: 380 MLDIGFVPATRVCFVTVEEEEKEEQLLFRSNKLAFAHAIINSEARRPLTVLQNMRTCIDG 439

Query: 700 HSALKIISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           H+  K+IS I GR LI RD K++H +K+G+CSC+DYW
Sbjct: 440 HNTFKMISLITGRALIQRDKKKYHFYKNGVCSCKDYW 469

BLAST of Tan0020092 vs. ExPASy Swiss-Prot
Match: Q9C6G2 (Pentatricopeptide repeat-containing protein At1g29710, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H67 PE=3 SV=1)

HSP 1 Score: 356.7 bits (914), Expect = 6.4e-97
Identity = 191/470 (40.64%), Postives = 281/470 (59.79%), Query Frame = 0

Query: 269 PNNHIPISRQYEQNSISQQYPQGQYQQGASLEQYQPIPDTTQSSMIGTQVLNNANAEEET 328
           P++H+ I ++Y  + I++     +Y++  +          TQ+SM+G         + +T
Sbjct: 34  PSHHLHILKKYGSSEITEMI--NRYKRNVAGH------TLTQNSMVG---------QYKT 93

Query: 329 RVTKDRPYGGCTLEELDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSL 388
            V+        T+E  D  C +G  +EAVE+L+ LE +   +DL R L L   CG+  +L
Sbjct: 94  TVSPSVAQ-NVTIETFDSLCIQGNWREAVEVLDYLENKGYAMDLIRLLGLAKLCGKPEAL 153

Query: 389 EEAKIVCNYVIKSQAPLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWL 448
           E A++V   +I   +P  V   N I+EMYS C S+DDA  +F +MP  N  +   M+   
Sbjct: 154 EAARVVHECIIALVSPCDVGARNAIIEMYSGCCSVDDALKVFEEMPEWNSGTLCVMMRCF 213

Query: 449 AKNGLGEDAVDLFYEFKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPS 508
             NG GE+A+DLF  FK+ G +P+G+IF  VFS C++ GDV EG L F++M + YGI PS
Sbjct: 214 VNNGYGEEAIDLFTRFKEEGNKPNGEIFNQVFSTCTLTGDVKEGSLQFQAMYREYGIVPS 273

Query: 509 MHHYVSIVDMLGSTGYVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVE 568
           M HY S+  ML ++G++DEAL F+E MP+EP VD+WET+MN+SR HG +ELGDRC ELVE
Sbjct: 274 MEHYHSVTKMLATSGHLDEALNFVERMPMEPSVDVWETLMNLSRVHGDVELGDRCAELVE 333

Query: 569 QLDPSRLNEQSKAGLLPVTASDLAKEREKKKLANRNLLEVRSRVHEY---RAGDTSHPEN 628
           +LD +RL++ S AGL+   ASD  K+              RS  + Y   R  D+SHP+ 
Sbjct: 334 KLDATRLDKVSSAGLVATKASDFVKKEP----------STRSEPYFYSTFRPVDSSHPQM 393

Query: 629 DKIYTLLRGLREQMKEAGYIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSP 688
           + IY  L  LR Q+KE GY+P+TR+    I      + + G+ E +AV   L+ S  RS 
Sbjct: 394 NIIYETLMSLRSQLKEMGYVPDTRYYRSLIMAMENKEQIFGYREEIAVVESLLKSKPRSA 453

Query: 689 IRIIKNLRVCGDCHSALKIISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           I ++ N+R+ GDCH  +K++S I GR++I RDAK +H FK+G+C C + W
Sbjct: 454 ITLLTNIRIVGDCHDMMKLMSVITGRDMIKRDAKIYHLFKNGVCRCNNLW 475

BLAST of Tan0020092 vs. ExPASy Swiss-Prot
Match: Q9ZQE5 (Pentatricopeptide repeat-containing protein At2g15690, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H66 PE=1 SV=2)

HSP 1 Score: 294.3 bits (752), Expect = 3.9e-78
Identity = 201/588 (34.18%), Postives = 298/588 (50.68%), Query Frame = 0

Query: 156 NTHHANLVSQNGNFIQSGHNKGMMAQDLNSYNANSPRNFVHISNNEVRGVDRSVSQNILF 215
           N +H N   Q+G+   S H +    Q  +S N  +    V  S N+              
Sbjct: 56  NDYHQN--PQSGS--PSQHQRPYPPQSFDSQNQTNTNQRVPQSPNQWS-----------T 115

Query: 216 EHRENLTAYNGSNNGELQQDNYEVYGQNSWGTRTESRQYEHDTGLNYHNPM--------S 275
           +H   +  Y G N     Q      GQN       S+   H+     H P          
Sbjct: 116 QHGGQIPQYGGQNPQHGGQ-RPPYGGQNPQQGGQMSQYGGHNPQHGGHRPQYGGQRPQYG 175

Query: 276 GPNNHIPISRQYEQNSISQQYPQGQYQQGASLEQYQPIPDTTQSSMIGTQVLNNANAEEE 335
           GP N+        QN   QQ  Q QY      +Q QP   + QS     +V    + EE 
Sbjct: 176 GPGNNY-------QNQNVQQSNQSQYYTPQQQQQPQPPRSSNQSPNQMNEVAPPPSVEEV 235

Query: 336 TRVTKDRPYGGCTLEELDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARS 395
            R+ + R Y                 K+A+E+   L+K  +P D   ++ L  +C   +S
Sbjct: 236 MRLCQRRLY-----------------KDAIEL---LDKGAMP-DRECFVLLFESCANLKS 295

Query: 396 LEEAKIVCNYVIKSQAPLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITW 455
           LE +K V ++ ++S+        N ++ M+ +C S+ DA  +F+ M  +++ SW  M+  
Sbjct: 296 LEHSKKVHDHFLQSKFRGDPKLNNMVISMFGECSSITDAKRVFDHMVDKDMDSWHLMMCA 355

Query: 456 LAKNGLGEDAVDLFYEFKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITP 515
            + NG+G+DA+ LF E  K GL+P+ + F+ VF AC+ +G ++E  LHF+SM   +GI+P
Sbjct: 356 YSDNGMGDDALHLFEEMTKHGLKPNEETFLTVFLACATVGGIEEAFLHFDSMKNEHGISP 415

Query: 516 SMHHYVSIVDMLGSTGYVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELV 575
              HY+ ++ +LG  G++ EA ++I  +P EP  D WE + N +R HG ++L D   EL+
Sbjct: 416 KTEHYLGVLGVLGKCGHLVEAEQYIRDLPFEPTADFWEAMRNYARLHGDIDLEDYMEELM 475

Query: 576 EQLDPSRLNEQSKAGLLPVTASDLAKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDK 635
             +DPS+    +    +P       KE         N++  +SR+ E+R   T + +  K
Sbjct: 476 VDVDPSK----AVINKIPTPPPKSFKE--------TNMVTSKSRILEFR-NLTFYKDEAK 535

Query: 636 IYTLLRGLREQMKEAGYIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIR 695
                +G+        Y+P+TRFVLHDIDQE K  ALL HSERLA+AYG+I +  R  + 
Sbjct: 536 EMAAKKGV-------VYVPDTRFVLHDIDQEAKEQALLYHSERLAIAYGIICTPPRKTLT 579

Query: 696 IIKNLRVCGDCHSALKIISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           IIKNLRVCGDCH+ +KI+SKI+GR LI+RD KRFHHFKDG CSC DYW
Sbjct: 596 IIKNLRVCGDCHNFIKIMSKIIGRVLIVRDNKRFHHFKDGKCSCGDYW 579

BLAST of Tan0020092 vs. NCBI nr
Match: XP_038876765.1 (pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like [Benincasa hispida] >XP_038876766.1 pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like [Benincasa hispida])

HSP 1 Score: 1216.1 bits (3145), Expect = 0.0e+00
Identity = 625/749 (83.44%), Postives = 673/749 (89.85%), Query Frame = 0

Query: 1   MSKKGAAILARRSLIALCDAQSSCSSSVSNKALNLVRNLSIASESEEYQNDNGYHADNSS 60
           M KK AAI+ARRSL+ALC A SSCSSS SNK LNLVRNLS+ASE EE QNDNGYHAD S 
Sbjct: 1   MCKKRAAIIARRSLVALCTAGSSCSSSASNKVLNLVRNLSVASEREECQNDNGYHADTSL 60

Query: 61  QSY----------QSLGYYQHHAENASLQSSSRPY----DGFYTENSMQGQYRPSTSSVD 120
           QSY          Q+ GYYQH++++ASLQ  SRPY    D F  ENSMQGQ++ STSSV 
Sbjct: 61  QSYQTHGGFSSDNQNSGYYQHYSQSASLQ--SRPYEDNIDSFDAENSMQGQHKLSTSSVY 120

Query: 121 GQTPRSSFANTSFMHESASGSYGQNYCGLPPNSYGFNQNHDEAYRDTYQNTHHANLVSQN 180
           GQ P SSFANTS MHE+AS SYGQNY G+PPNSYGFNQNHDEAYR+TYQNTHHAN VS N
Sbjct: 121 GQRPGSSFANTSPMHETASRSYGQNYGGMPPNSYGFNQNHDEAYRETYQNTHHANPVSLN 180

Query: 181 GNFIQSGHNKGMMAQDLNSYNANSPRNFVHISNNEVRGVDRSVSQNILFEHRENLTAYNG 240
           GNF+++G+N G++AQD NSYN N+ RNFVHISNNE+ GVDRS+SQNI  E RE  +AYNG
Sbjct: 181 GNFVENGYN-GVVAQDHNSYNGNNRRNFVHISNNELCGVDRSMSQNIQLERREIFSAYNG 240

Query: 241 SNNGELQQDNYEVYGQNSWGTRTESRQYEHDTGLNYHNPMSGPNNHIPISRQYEQNSISQ 300
            NN ELQQ+NY V GQNSWGT TES+QY H TGLNYHNP+SG NNHIP+SR YEQNSISQ
Sbjct: 241 YNNEELQQNNYGVSGQNSWGTWTESKQYVHGTGLNYHNPISGLNNHIPLSRHYEQNSISQ 300

Query: 301 QYPQGQYQQGASLEQYQPIPDTTQSSMIGTQVLNNANAEEETRVTKDRPYGGCTLEELDE 360
           Q+PQGQYQQGAS+EQY+P PDT QS+MIGTQV+NNAN E E  + KDRP+ G TLEELDE
Sbjct: 301 QHPQGQYQQGASVEQYKPNPDTNQSNMIGTQVVNNANGEGEIGMAKDRPH-GATLEELDE 360

Query: 361 FCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSLEEAKIVCNYVIKSQAPLK 420
           FC+EGKLKEAV+ILEVLEKQHIP++L+RYL+LMNACGEARSLEEAK+VCNYVIKSQ PLK
Sbjct: 361 FCREGKLKEAVQILEVLEKQHIPINLSRYLDLMNACGEARSLEEAKVVCNYVIKSQTPLK 420

Query: 421 VSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGEDAVDLFYEFKK 480
           VSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGEDA+DLFYEFKK
Sbjct: 421 VSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGEDAIDLFYEFKK 480

Query: 481 AGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTGYVD 540
           AGLRPDGK+FIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGS GYVD
Sbjct: 481 AGLRPDGKMFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSIGYVD 540

Query: 541 EALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVEQLDPSRLNEQSKAGLLPV 600
           EALEFIE MPLEPGVDIWET+MNISRAHGLM+LGDRC ELVE LD SRLNEQSKAGLLP+
Sbjct: 541 EALEFIEKMPLEPGVDIWETMMNISRAHGLMDLGDRCCELVEHLDSSRLNEQSKAGLLPI 600

Query: 601 TASDLAKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAGYIP 660
            ASDL KEREKKKLANRNLLEVRSRVHEYRAGDTSHPEND+IYTLLRGLREQMKEAGYIP
Sbjct: 601 KASDLEKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDRIYTLLRGLREQMKEAGYIP 660

Query: 661 ETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIRIIKNLRVCGDCHSALKIIS 720
           ETRFVLHDIDQEGK DALLGHSERLAVAYGLISSSARSPIR+IKNLRVCGDCHSALKIIS
Sbjct: 661 ETRFVLHDIDQEGKYDALLGHSERLAVAYGLISSSARSPIRVIKNLRVCGDCHSALKIIS 720

Query: 721 KIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           KIVGRELIIRDAKRFHHFKDGLCSCRDYW
Sbjct: 721 KIVGRELIIRDAKRFHHFKDGLCSCRDYW 745

BLAST of Tan0020092 vs. NCBI nr
Match: XP_008443720.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like [Cucumis melo] >XP_008443721.1 PREDICTED: pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like [Cucumis melo] >XP_008443722.1 PREDICTED: pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like [Cucumis melo] >XP_008443723.1 PREDICTED: pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like [Cucumis melo] >XP_008443724.1 PREDICTED: pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like [Cucumis melo] >KAA0038293.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1164.8 bits (3012), Expect = 0.0e+00
Identity = 610/752 (81.12%), Postives = 654/752 (86.97%), Query Frame = 0

Query: 1   MSKKGAAILARRSLIALCDAQSSCSSSVSNKALNLVRNLSIASESEEYQNDNGYHADNSS 60
           M KK AAILARRSLIAL  A+SSCSSSVS+KALNLVRNLSIASE EE Q+DNGYHADNS 
Sbjct: 1   MCKKKAAILARRSLIALYTARSSCSSSVSHKALNLVRNLSIASEREECQDDNGYHADNSL 60

Query: 61  QSY-----------QSLGYYQHHAENASLQSSSRPY----DGFYTENSMQGQYRPSTSSV 120
           +SY           QS GYYQHHA++ SLQ  SRP+    DGFYT NS+QG  RPSTSS 
Sbjct: 61  RSYQTHGGSVSSYNQSPGYYQHHAQSTSLQ--SRPHQDILDGFYTGNSLQGPDRPSTSSA 120

Query: 121 DGQTPRSSFANTSFMHESASGSYGQNYCGLPPNSYGFNQNHDEAYRDTYQNTHHANLVSQ 180
             Q P SSFANTS MHE AS SYGQ+Y G+PPNS GFN+NH EA R+TYQNTHH + V+ 
Sbjct: 121 YRQKPGSSFANTSHMHEIASRSYGQHYSGMPPNSCGFNENHHEACRETYQNTHHTSPVAP 180

Query: 181 NGNFIQSGHNKGMMAQDLNSYNANSPRNFVHISNNEVRGVDRSVSQNILFEHRENLTAYN 240
           NGNFI++G+N G++AQD NSYN N+PRNFV ISNN VR VDRS S N     RE  +AYN
Sbjct: 181 NGNFIENGYN-GVVAQDHNSYNGNTPRNFVEISNNVVREVDRSTSPNNQLGPREIFSAYN 240

Query: 241 --GSNNGELQQDNYEVYGQNSWGTRTESRQYEHDTGLNYHNPMSGPNNHIPISRQYEQNS 300
             G +N   QQ+ Y + GQNSWGT  ES+QY H TGL +HNPMSGPNNHIP+SRQYEQNS
Sbjct: 241 GYGYSNEATQQNIYGISGQNSWGTWRESKQYLHGTGLKHHNPMSGPNNHIPLSRQYEQNS 300

Query: 301 ISQQYPQGQYQQGASLEQYQPIPDTTQSSMIGTQVLNNANAEEETRVTKDRPYGGCTLEE 360
           I QQYPQGQY QG+S+EQYQP PDT Q+ MIG QVL N NA EE   T+DR  GG  LE+
Sbjct: 301 IPQQYPQGQYHQGSSVEQYQPNPDTNQNYMIGNQVLYNVNANEEIGKTRDRQQGG-PLEK 360

Query: 361 LDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSLEEAKIVCNYVIKSQA 420
           LDEFCKEG LKEAVEILEVLEKQHIPVDL+RYL+LMNACGEARSLEEAK VCNYVIKSQ 
Sbjct: 361 LDEFCKEGNLKEAVEILEVLEKQHIPVDLSRYLDLMNACGEARSLEEAKAVCNYVIKSQT 420

Query: 421 PLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGEDAVDLFYE 480
            +KVSTYNKILEMYSKCGSMDDAYTIFNKMPSRN+TSWDTMITWLAKNGLGEDA+DLFYE
Sbjct: 421 HVKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNITSWDTMITWLAKNGLGEDAIDLFYE 480

Query: 481 FKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG 540
           FKKAGLRPDGK+FIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG
Sbjct: 481 FKKAGLRPDGKMFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG 540

Query: 541 YVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVEQLDPSRLNEQSKAGL 600
           +VDEALEFIE MPLEPGVDIWET+MNI+RAHGLMELGDRCFELVE LDPSRLNEQSKAGL
Sbjct: 541 FVDEALEFIEKMPLEPGVDIWETMMNIARAHGLMELGDRCFELVEHLDPSRLNEQSKAGL 600

Query: 601 LPVTASDLAKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAG 660
           LP+ ASDL KEREKKKLANRNLLEVRSRVHEYRAGDTSHPEND+IYTLLRGLREQMKEAG
Sbjct: 601 LPIKASDLEKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDRIYTLLRGLREQMKEAG 660

Query: 661 YIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIRIIKNLRVCGDCHSALK 720
           YIPETRFVLHDIDQE KNDALLGHSERLAVAYGLISSSARSPIR+IKNLRVCGDCHSALK
Sbjct: 661 YIPETRFVLHDIDQEAKNDALLGHSERLAVAYGLISSSARSPIRVIKNLRVCGDCHSALK 720

Query: 721 IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW
Sbjct: 721 IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 748

BLAST of Tan0020092 vs. NCBI nr
Match: TYK19847.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1162.5 bits (3006), Expect = 0.0e+00
Identity = 609/752 (80.98%), Postives = 653/752 (86.84%), Query Frame = 0

Query: 1   MSKKGAAILARRSLIALCDAQSSCSSSVSNKALNLVRNLSIASESEEYQNDNGYHADNSS 60
           M KK AAILARRSLIAL  A+SSCSSSVS+KALNLVRNLSIASE EE Q+DNGYHADNS 
Sbjct: 1   MCKKKAAILARRSLIALYTARSSCSSSVSHKALNLVRNLSIASEREECQDDNGYHADNSL 60

Query: 61  QSY-----------QSLGYYQHHAENASLQSSSRPY----DGFYTENSMQGQYRPSTSSV 120
           +SY           QS GYYQHHA++ SLQ  SRP+    DGFYT NS+QG  RPSTSS 
Sbjct: 61  RSYQTHGGSVSSYNQSPGYYQHHAQSTSLQ--SRPHQDILDGFYTGNSLQGPDRPSTSSA 120

Query: 121 DGQTPRSSFANTSFMHESASGSYGQNYCGLPPNSYGFNQNHDEAYRDTYQNTHHANLVSQ 180
             Q P SSFANTS MHE AS SYGQ+Y G+PPNS GFN+NH EA R+TYQNTHH + V+ 
Sbjct: 121 YRQKPGSSFANTSHMHEIASRSYGQHYSGMPPNSCGFNENHHEACRETYQNTHHTSPVAP 180

Query: 181 NGNFIQSGHNKGMMAQDLNSYNANSPRNFVHISNNEVRGVDRSVSQNILFEHRENLTAYN 240
           NGNFI++G+N G++AQD NSYN N+PRNFV ISNN VR VDRS S N     RE  +AYN
Sbjct: 181 NGNFIENGYN-GVVAQDHNSYNGNTPRNFVEISNNVVREVDRSTSPNNQLGPREIFSAYN 240

Query: 241 --GSNNGELQQDNYEVYGQNSWGTRTESRQYEHDTGLNYHNPMSGPNNHIPISRQYEQNS 300
             G +N   QQ+ Y + GQNSWGT  ES+QY H TGL +HNPMSGPNNHIP+SRQYEQNS
Sbjct: 241 GYGYSNEATQQNIYGISGQNSWGTWRESKQYLHGTGLKHHNPMSGPNNHIPLSRQYEQNS 300

Query: 301 ISQQYPQGQYQQGASLEQYQPIPDTTQSSMIGTQVLNNANAEEETRVTKDRPYGGCTLEE 360
           I QQYPQGQY QG+S+EQYQP PDT Q+ MIG QVL N NA EE   T+DR  GG  LE+
Sbjct: 301 IPQQYPQGQYHQGSSVEQYQPNPDTNQNYMIGNQVLYNVNANEEIGKTRDRQQGG-PLEK 360

Query: 361 LDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSLEEAKIVCNYVIKSQA 420
           LDEFCKEG LKEAVEILEVLEKQHIPVDL+RYL+LMNACGEARSLEEAK VCNYVIKSQ 
Sbjct: 361 LDEFCKEGNLKEAVEILEVLEKQHIPVDLSRYLDLMNACGEARSLEEAKAVCNYVIKSQT 420

Query: 421 PLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGEDAVDLFYE 480
            +KVSTYNKILEMYSKCGSMDDAYTIFNKMPSRN+TSWDTMITWLAKNGLGEDA+DLFYE
Sbjct: 421 HVKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNITSWDTMITWLAKNGLGEDAIDLFYE 480

Query: 481 FKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG 540
           FKKAGLRPDGK+FIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG
Sbjct: 481 FKKAGLRPDGKMFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG 540

Query: 541 YVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVEQLDPSRLNEQSKAGL 600
           +VDEALEFIE MPLEPGVDIWET+MNI+RAHGLMELGDRC ELVE LDPSRLNEQSKAGL
Sbjct: 541 FVDEALEFIEKMPLEPGVDIWETMMNIARAHGLMELGDRCLELVEHLDPSRLNEQSKAGL 600

Query: 601 LPVTASDLAKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAG 660
           LP+ ASDL KEREKKKLANRNLLEVRSRVHEYRAGDTSHPEND+IYTLLRGLREQMKEAG
Sbjct: 601 LPIKASDLEKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDRIYTLLRGLREQMKEAG 660

Query: 661 YIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIRIIKNLRVCGDCHSALK 720
           YIPETRFVLHDIDQE KNDALLGHSERLAVAYGLISSSARSPIR+IKNLRVCGDCHSALK
Sbjct: 661 YIPETRFVLHDIDQEAKNDALLGHSERLAVAYGLISSSARSPIRVIKNLRVCGDCHSALK 720

Query: 721 IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW
Sbjct: 721 IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 748

BLAST of Tan0020092 vs. NCBI nr
Match: XP_022139059.1 (pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like [Momordica charantia] >XP_022139061.1 pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like [Momordica charantia] >XP_022139062.1 pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like [Momordica charantia])

HSP 1 Score: 1151.7 bits (2978), Expect = 0.0e+00
Identity = 598/749 (79.84%), Postives = 655/749 (87.45%), Query Frame = 0

Query: 1   MSKKGAAILARRSLIALCDAQSSCSSSVSNKALNLVRNLSIASESEEYQNDNGYHADNSS 60
           MSKK AAILARRS+IALC+A+SSCSS VSNK LNL+RNLSI+SE EEYQNDNGYHADNSS
Sbjct: 1   MSKKRAAILARRSVIALCNARSSCSSYVSNKPLNLLRNLSISSEREEYQNDNGYHADNSS 60

Query: 61  QSYQS----------LGYYQHHAENASLQSSSRPY----DGFYTENSMQGQYRPSTSSVD 120
           QSYQS           GYYQHHA++ASL+SSSRPY    DGFYTE+  +GQ+RPSTSSV 
Sbjct: 61  QSYQSHGGFNCDNQNPGYYQHHAQSASLKSSSRPYQENLDGFYTEDLTRGQHRPSTSSVY 120

Query: 121 GQTPRSSFANTSFMHESASGSYGQNYCGLPPNSYGFNQNHDEAYRDTYQNTHHANLVSQN 180
           GQ P S F NTS MHE+AS S     C LP  SYGFNQNH EA+R+TY+NTH+ANLV+ N
Sbjct: 121 GQAPGSGFTNTSPMHENASRS-----C-LPTTSYGFNQNHYEAFRETYENTHNANLVAHN 180

Query: 181 GNFIQSGHNKGMMAQDLNSYNANSPRNFVHISNNEVRGVDRSVSQNILFEHRENLTAYNG 240
           GNFI  G NKGMMAQDL SYNA+S      IS NEVRG DRS+SQN+L EHRENLTAY+G
Sbjct: 181 GNFIHHGLNKGMMAQDLKSYNADS-----QISYNEVRGDDRSLSQNVLLEHRENLTAYSG 240

Query: 241 SNNGELQQDNYEVYGQNSWGTRTESRQYEHDTGLNYHNPMSGPNNHIPISRQYEQNSISQ 300
            NNG  +++   VYGQ+SWGT  ESRQY+H TGLNYHNP SGPNN IPIS   EQNSISQ
Sbjct: 241 FNNGVPRENGNGVYGQSSWGTSMESRQYKHVTGLNYHNPTSGPNNQIPISGHCEQNSISQ 300

Query: 301 QYPQGQYQQGASLEQYQPIPDTTQSSMIGTQVLNNANAEEETRVTKDRPYGGCTLEELDE 360
           QY Q Q QQGA++EQYQP P+T QSSM+ TQ++NN NA++ET VTKD    G T+EELDE
Sbjct: 301 QYLQQQCQQGATVEQYQPSPNTIQSSMMDTQLVNNINADKETGVTKDH-QNGDTIEELDE 360

Query: 361 FCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSLEEAKIVCNYVIKSQAPLK 420
           FCKEGKLKEAV++LE LEKQHI VDL RYL+LMNAC EARSLEEAK+VC+Y+ +S +PLK
Sbjct: 361 FCKEGKLKEAVQVLEALEKQHILVDLPRYLQLMNACAEARSLEEAKVVCDYISRSHSPLK 420

Query: 421 VSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGEDAVDLFYEFKK 480
           VSTYNKILEMYSKCGSM DAYTIFN MP+RNLTSWDTMITWLAKNGLGEDA+DLFYEFKK
Sbjct: 421 VSTYNKILEMYSKCGSMGDAYTIFNNMPNRNLTSWDTMITWLAKNGLGEDAIDLFYEFKK 480

Query: 481 AGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTGYVD 540
           AGL+PDGK+FIG+FSACSVLGD+DEGMLH ESMTKNYGI PSMHHYVSIVDMLGS GYVD
Sbjct: 481 AGLKPDGKMFIGLFSACSVLGDIDEGMLHLESMTKNYGIIPSMHHYVSIVDMLGSVGYVD 540

Query: 541 EALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVEQLDPSRLNEQSKAGLLPV 600
           EALEFIE MPLEPGV+IWE +MNISRAHG MELGDRCFELVEQLDPSRLNE+SKAGLLPV
Sbjct: 541 EALEFIEKMPLEPGVEIWEMMMNISRAHGFMELGDRCFELVEQLDPSRLNEESKAGLLPV 600

Query: 601 TASDLAKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAGYIP 660
            ASDLAKEREKKKLAN+NLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAGYIP
Sbjct: 601 RASDLAKEREKKKLANQNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAGYIP 660

Query: 661 ETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIRIIKNLRVCGDCHSALKIIS 720
           ETRFVLHDIDQEGKNDALL HSERLAVAYGLISSSAR+PIR+IKNLRVCGDCH+ALKIIS
Sbjct: 661 ETRFVLHDIDQEGKNDALLAHSERLAVAYGLISSSARAPIRVIKNLRVCGDCHNALKIIS 720

Query: 721 KIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           KIVGRELI+RDAKRFHHFKDGLCSCRDYW
Sbjct: 721 KIVGRELIMRDAKRFHHFKDGLCSCRDYW 737

BLAST of Tan0020092 vs. NCBI nr
Match: XP_011660234.1 (pentatricopeptide repeat-containing protein At4g32450, mitochondrial [Cucumis sativus] >KGN66689.1 hypothetical protein Csa_007195 [Cucumis sativus])

HSP 1 Score: 1134.4 bits (2933), Expect = 0.0e+00
Identity = 602/752 (80.05%), Postives = 645/752 (85.77%), Query Frame = 0

Query: 1   MSKKGAAILARRSLIALCDAQSSCSSSVSNKALNLVRNLSIASESEEYQNDNGYHADNSS 60
           M KK AAILARRSLIAL   +SSCSSSVSNKALNLVRNLSIASE EE QNDNGYHADNS 
Sbjct: 1   MCKKRAAILARRSLIALYTPRSSCSSSVSNKALNLVRNLSIASEREECQNDNGYHADNSL 60

Query: 61  QSY-----------QSLGYYQHHAENASLQSSSRPY----DGFYTENSMQGQYRPSTSSV 120
            SY           QS GYYQHHA++ S  S SRP+    DGFYTENS+QG +RPSTSSV
Sbjct: 61  PSYQTHGGSVSSYNQSPGYYQHHAQSTS--SQSRPHQDILDGFYTENSLQGLHRPSTSSV 120

Query: 121 DGQTPRSSFANTSFMHESASGSYGQNYCGLPPNSYGFNQNHDEAYRDTYQNTHHANLVSQ 180
            GQ P  SFANTS MHESAS SYGQ+Y G+PPNS GFNQNH EAYR+T+QNTHHA+ V+ 
Sbjct: 121 YGQKPGGSFANTSPMHESASRSYGQHYSGVPPNSCGFNQNHHEAYRETFQNTHHASPVAP 180

Query: 181 NGNFIQSGHNKGMMAQDLNSYNANSPRNFVHISNNEVRGVDRSVSQNILFEHRENLTAYN 240
           NGNFI++G+ KG +AQD NSYN ++PRNFV ++NN V GVDRS+SQN    HRE  +AYN
Sbjct: 181 NGNFIENGY-KGGVAQDHNSYNGSTPRNFVDMNNNVVCGVDRSMSQNNQLGHREIFSAYN 240

Query: 241 --GSNNGELQQDNYEVYGQNSWGTRTESRQYEHDTGLNYHNPMSGPNNHIPISRQYEQNS 300
             G NN   QQ+NY V GQN            HD      NPMSGPNNHIP+SRQYEQNS
Sbjct: 241 GYGYNNEATQQNNYGVSGQNL-----------HD------NPMSGPNNHIPLSRQYEQNS 300

Query: 301 ISQQYPQGQYQQGASLEQYQPIPDTTQSSMIGTQVLNNANAEEETRVTKDRPYGGCTLEE 360
           I  Q+PQGQY QG+S+EQYQP  DT Q+SMIGTQ+LNN NA EE    KD   GG  LE+
Sbjct: 301 IPLQHPQGQYHQGSSVEQYQPNTDTNQNSMIGTQLLNNVNANEEIGEPKDCQDGG-PLEK 360

Query: 361 LDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSLEEAKIVCNYVIKSQA 420
           LDEFCKEGKLKEAV+ILEVLEKQHIPVDL+RYL+LMNACGEARSLEEAK+VCNYVIKSQ 
Sbjct: 361 LDEFCKEGKLKEAVQILEVLEKQHIPVDLSRYLDLMNACGEARSLEEAKVVCNYVIKSQT 420

Query: 421 PLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGEDAVDLFYE 480
            +KVSTYNKILEMYSKCGSMDDAYTIFNKMPSRN+TSWDTMITWLAKNGLGEDA+DLFYE
Sbjct: 421 HVKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNITSWDTMITWLAKNGLGEDAIDLFYE 480

Query: 481 FKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG 540
           FKKAGLRPDGK+FIGVFSACSVLGD DEGMLHFESMTKNYGITPSMHHYVSIVDMLGS G
Sbjct: 481 FKKAGLRPDGKMFIGVFSACSVLGDADEGMLHFESMTKNYGITPSMHHYVSIVDMLGSIG 540

Query: 541 YVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVEQLDPSRLNEQSKAGL 600
           +VDEA+EFIE MPLEPGVDIWET+MNISRAHGLMELGDRCFELVE LD SRLNEQSKAGL
Sbjct: 541 FVDEAVEFIEKMPLEPGVDIWETMMNISRAHGLMELGDRCFELVEHLDSSRLNEQSKAGL 600

Query: 601 LPVTASDLAKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAG 660
           LPV ASDL KEREKKKLANRNLLEVRSRVHEYRAGDTSHPEND+IYTLLRGLREQMKEAG
Sbjct: 601 LPVKASDLEKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDRIYTLLRGLREQMKEAG 660

Query: 661 YIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIRIIKNLRVCGDCHSALK 720
           YIPETRFVLHDIDQE KNDALLGHSERLAVAYGLISSSARSPIR+IKNLRVCGDCHSALK
Sbjct: 661 YIPETRFVLHDIDQEAKNDALLGHSERLAVAYGLISSSARSPIRVIKNLRVCGDCHSALK 720

Query: 721 IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW
Sbjct: 721 IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 731

BLAST of Tan0020092 vs. ExPASy TrEMBL
Match: A0A1S3B9H3 (pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103487241 PE=3 SV=1)

HSP 1 Score: 1164.8 bits (3012), Expect = 0.0e+00
Identity = 610/752 (81.12%), Postives = 654/752 (86.97%), Query Frame = 0

Query: 1   MSKKGAAILARRSLIALCDAQSSCSSSVSNKALNLVRNLSIASESEEYQNDNGYHADNSS 60
           M KK AAILARRSLIAL  A+SSCSSSVS+KALNLVRNLSIASE EE Q+DNGYHADNS 
Sbjct: 1   MCKKKAAILARRSLIALYTARSSCSSSVSHKALNLVRNLSIASEREECQDDNGYHADNSL 60

Query: 61  QSY-----------QSLGYYQHHAENASLQSSSRPY----DGFYTENSMQGQYRPSTSSV 120
           +SY           QS GYYQHHA++ SLQ  SRP+    DGFYT NS+QG  RPSTSS 
Sbjct: 61  RSYQTHGGSVSSYNQSPGYYQHHAQSTSLQ--SRPHQDILDGFYTGNSLQGPDRPSTSSA 120

Query: 121 DGQTPRSSFANTSFMHESASGSYGQNYCGLPPNSYGFNQNHDEAYRDTYQNTHHANLVSQ 180
             Q P SSFANTS MHE AS SYGQ+Y G+PPNS GFN+NH EA R+TYQNTHH + V+ 
Sbjct: 121 YRQKPGSSFANTSHMHEIASRSYGQHYSGMPPNSCGFNENHHEACRETYQNTHHTSPVAP 180

Query: 181 NGNFIQSGHNKGMMAQDLNSYNANSPRNFVHISNNEVRGVDRSVSQNILFEHRENLTAYN 240
           NGNFI++G+N G++AQD NSYN N+PRNFV ISNN VR VDRS S N     RE  +AYN
Sbjct: 181 NGNFIENGYN-GVVAQDHNSYNGNTPRNFVEISNNVVREVDRSTSPNNQLGPREIFSAYN 240

Query: 241 --GSNNGELQQDNYEVYGQNSWGTRTESRQYEHDTGLNYHNPMSGPNNHIPISRQYEQNS 300
             G +N   QQ+ Y + GQNSWGT  ES+QY H TGL +HNPMSGPNNHIP+SRQYEQNS
Sbjct: 241 GYGYSNEATQQNIYGISGQNSWGTWRESKQYLHGTGLKHHNPMSGPNNHIPLSRQYEQNS 300

Query: 301 ISQQYPQGQYQQGASLEQYQPIPDTTQSSMIGTQVLNNANAEEETRVTKDRPYGGCTLEE 360
           I QQYPQGQY QG+S+EQYQP PDT Q+ MIG QVL N NA EE   T+DR  GG  LE+
Sbjct: 301 IPQQYPQGQYHQGSSVEQYQPNPDTNQNYMIGNQVLYNVNANEEIGKTRDRQQGG-PLEK 360

Query: 361 LDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSLEEAKIVCNYVIKSQA 420
           LDEFCKEG LKEAVEILEVLEKQHIPVDL+RYL+LMNACGEARSLEEAK VCNYVIKSQ 
Sbjct: 361 LDEFCKEGNLKEAVEILEVLEKQHIPVDLSRYLDLMNACGEARSLEEAKAVCNYVIKSQT 420

Query: 421 PLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGEDAVDLFYE 480
            +KVSTYNKILEMYSKCGSMDDAYTIFNKMPSRN+TSWDTMITWLAKNGLGEDA+DLFYE
Sbjct: 421 HVKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNITSWDTMITWLAKNGLGEDAIDLFYE 480

Query: 481 FKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG 540
           FKKAGLRPDGK+FIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG
Sbjct: 481 FKKAGLRPDGKMFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG 540

Query: 541 YVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVEQLDPSRLNEQSKAGL 600
           +VDEALEFIE MPLEPGVDIWET+MNI+RAHGLMELGDRCFELVE LDPSRLNEQSKAGL
Sbjct: 541 FVDEALEFIEKMPLEPGVDIWETMMNIARAHGLMELGDRCFELVEHLDPSRLNEQSKAGL 600

Query: 601 LPVTASDLAKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAG 660
           LP+ ASDL KEREKKKLANRNLLEVRSRVHEYRAGDTSHPEND+IYTLLRGLREQMKEAG
Sbjct: 601 LPIKASDLEKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDRIYTLLRGLREQMKEAG 660

Query: 661 YIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIRIIKNLRVCGDCHSALK 720
           YIPETRFVLHDIDQE KNDALLGHSERLAVAYGLISSSARSPIR+IKNLRVCGDCHSALK
Sbjct: 661 YIPETRFVLHDIDQEAKNDALLGHSERLAVAYGLISSSARSPIRVIKNLRVCGDCHSALK 720

Query: 721 IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW
Sbjct: 721 IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 748

BLAST of Tan0020092 vs. ExPASy TrEMBL
Match: A0A5A7T5S8 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold270G001570 PE=3 SV=1)

HSP 1 Score: 1164.8 bits (3012), Expect = 0.0e+00
Identity = 610/752 (81.12%), Postives = 654/752 (86.97%), Query Frame = 0

Query: 1   MSKKGAAILARRSLIALCDAQSSCSSSVSNKALNLVRNLSIASESEEYQNDNGYHADNSS 60
           M KK AAILARRSLIAL  A+SSCSSSVS+KALNLVRNLSIASE EE Q+DNGYHADNS 
Sbjct: 1   MCKKKAAILARRSLIALYTARSSCSSSVSHKALNLVRNLSIASEREECQDDNGYHADNSL 60

Query: 61  QSY-----------QSLGYYQHHAENASLQSSSRPY----DGFYTENSMQGQYRPSTSSV 120
           +SY           QS GYYQHHA++ SLQ  SRP+    DGFYT NS+QG  RPSTSS 
Sbjct: 61  RSYQTHGGSVSSYNQSPGYYQHHAQSTSLQ--SRPHQDILDGFYTGNSLQGPDRPSTSSA 120

Query: 121 DGQTPRSSFANTSFMHESASGSYGQNYCGLPPNSYGFNQNHDEAYRDTYQNTHHANLVSQ 180
             Q P SSFANTS MHE AS SYGQ+Y G+PPNS GFN+NH EA R+TYQNTHH + V+ 
Sbjct: 121 YRQKPGSSFANTSHMHEIASRSYGQHYSGMPPNSCGFNENHHEACRETYQNTHHTSPVAP 180

Query: 181 NGNFIQSGHNKGMMAQDLNSYNANSPRNFVHISNNEVRGVDRSVSQNILFEHRENLTAYN 240
           NGNFI++G+N G++AQD NSYN N+PRNFV ISNN VR VDRS S N     RE  +AYN
Sbjct: 181 NGNFIENGYN-GVVAQDHNSYNGNTPRNFVEISNNVVREVDRSTSPNNQLGPREIFSAYN 240

Query: 241 --GSNNGELQQDNYEVYGQNSWGTRTESRQYEHDTGLNYHNPMSGPNNHIPISRQYEQNS 300
             G +N   QQ+ Y + GQNSWGT  ES+QY H TGL +HNPMSGPNNHIP+SRQYEQNS
Sbjct: 241 GYGYSNEATQQNIYGISGQNSWGTWRESKQYLHGTGLKHHNPMSGPNNHIPLSRQYEQNS 300

Query: 301 ISQQYPQGQYQQGASLEQYQPIPDTTQSSMIGTQVLNNANAEEETRVTKDRPYGGCTLEE 360
           I QQYPQGQY QG+S+EQYQP PDT Q+ MIG QVL N NA EE   T+DR  GG  LE+
Sbjct: 301 IPQQYPQGQYHQGSSVEQYQPNPDTNQNYMIGNQVLYNVNANEEIGKTRDRQQGG-PLEK 360

Query: 361 LDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSLEEAKIVCNYVIKSQA 420
           LDEFCKEG LKEAVEILEVLEKQHIPVDL+RYL+LMNACGEARSLEEAK VCNYVIKSQ 
Sbjct: 361 LDEFCKEGNLKEAVEILEVLEKQHIPVDLSRYLDLMNACGEARSLEEAKAVCNYVIKSQT 420

Query: 421 PLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGEDAVDLFYE 480
            +KVSTYNKILEMYSKCGSMDDAYTIFNKMPSRN+TSWDTMITWLAKNGLGEDA+DLFYE
Sbjct: 421 HVKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNITSWDTMITWLAKNGLGEDAIDLFYE 480

Query: 481 FKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG 540
           FKKAGLRPDGK+FIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG
Sbjct: 481 FKKAGLRPDGKMFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG 540

Query: 541 YVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVEQLDPSRLNEQSKAGL 600
           +VDEALEFIE MPLEPGVDIWET+MNI+RAHGLMELGDRCFELVE LDPSRLNEQSKAGL
Sbjct: 541 FVDEALEFIEKMPLEPGVDIWETMMNIARAHGLMELGDRCFELVEHLDPSRLNEQSKAGL 600

Query: 601 LPVTASDLAKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAG 660
           LP+ ASDL KEREKKKLANRNLLEVRSRVHEYRAGDTSHPEND+IYTLLRGLREQMKEAG
Sbjct: 601 LPIKASDLEKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDRIYTLLRGLREQMKEAG 660

Query: 661 YIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIRIIKNLRVCGDCHSALK 720
           YIPETRFVLHDIDQE KNDALLGHSERLAVAYGLISSSARSPIR+IKNLRVCGDCHSALK
Sbjct: 661 YIPETRFVLHDIDQEAKNDALLGHSERLAVAYGLISSSARSPIRVIKNLRVCGDCHSALK 720

Query: 721 IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW
Sbjct: 721 IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 748

BLAST of Tan0020092 vs. ExPASy TrEMBL
Match: A0A5D3D8E5 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold811G00560 PE=3 SV=1)

HSP 1 Score: 1162.5 bits (3006), Expect = 0.0e+00
Identity = 609/752 (80.98%), Postives = 653/752 (86.84%), Query Frame = 0

Query: 1   MSKKGAAILARRSLIALCDAQSSCSSSVSNKALNLVRNLSIASESEEYQNDNGYHADNSS 60
           M KK AAILARRSLIAL  A+SSCSSSVS+KALNLVRNLSIASE EE Q+DNGYHADNS 
Sbjct: 1   MCKKKAAILARRSLIALYTARSSCSSSVSHKALNLVRNLSIASEREECQDDNGYHADNSL 60

Query: 61  QSY-----------QSLGYYQHHAENASLQSSSRPY----DGFYTENSMQGQYRPSTSSV 120
           +SY           QS GYYQHHA++ SLQ  SRP+    DGFYT NS+QG  RPSTSS 
Sbjct: 61  RSYQTHGGSVSSYNQSPGYYQHHAQSTSLQ--SRPHQDILDGFYTGNSLQGPDRPSTSSA 120

Query: 121 DGQTPRSSFANTSFMHESASGSYGQNYCGLPPNSYGFNQNHDEAYRDTYQNTHHANLVSQ 180
             Q P SSFANTS MHE AS SYGQ+Y G+PPNS GFN+NH EA R+TYQNTHH + V+ 
Sbjct: 121 YRQKPGSSFANTSHMHEIASRSYGQHYSGMPPNSCGFNENHHEACRETYQNTHHTSPVAP 180

Query: 181 NGNFIQSGHNKGMMAQDLNSYNANSPRNFVHISNNEVRGVDRSVSQNILFEHRENLTAYN 240
           NGNFI++G+N G++AQD NSYN N+PRNFV ISNN VR VDRS S N     RE  +AYN
Sbjct: 181 NGNFIENGYN-GVVAQDHNSYNGNTPRNFVEISNNVVREVDRSTSPNNQLGPREIFSAYN 240

Query: 241 --GSNNGELQQDNYEVYGQNSWGTRTESRQYEHDTGLNYHNPMSGPNNHIPISRQYEQNS 300
             G +N   QQ+ Y + GQNSWGT  ES+QY H TGL +HNPMSGPNNHIP+SRQYEQNS
Sbjct: 241 GYGYSNEATQQNIYGISGQNSWGTWRESKQYLHGTGLKHHNPMSGPNNHIPLSRQYEQNS 300

Query: 301 ISQQYPQGQYQQGASLEQYQPIPDTTQSSMIGTQVLNNANAEEETRVTKDRPYGGCTLEE 360
           I QQYPQGQY QG+S+EQYQP PDT Q+ MIG QVL N NA EE   T+DR  GG  LE+
Sbjct: 301 IPQQYPQGQYHQGSSVEQYQPNPDTNQNYMIGNQVLYNVNANEEIGKTRDRQQGG-PLEK 360

Query: 361 LDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSLEEAKIVCNYVIKSQA 420
           LDEFCKEG LKEAVEILEVLEKQHIPVDL+RYL+LMNACGEARSLEEAK VCNYVIKSQ 
Sbjct: 361 LDEFCKEGNLKEAVEILEVLEKQHIPVDLSRYLDLMNACGEARSLEEAKAVCNYVIKSQT 420

Query: 421 PLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGEDAVDLFYE 480
            +KVSTYNKILEMYSKCGSMDDAYTIFNKMPSRN+TSWDTMITWLAKNGLGEDA+DLFYE
Sbjct: 421 HVKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNITSWDTMITWLAKNGLGEDAIDLFYE 480

Query: 481 FKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG 540
           FKKAGLRPDGK+FIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG
Sbjct: 481 FKKAGLRPDGKMFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG 540

Query: 541 YVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVEQLDPSRLNEQSKAGL 600
           +VDEALEFIE MPLEPGVDIWET+MNI+RAHGLMELGDRC ELVE LDPSRLNEQSKAGL
Sbjct: 541 FVDEALEFIEKMPLEPGVDIWETMMNIARAHGLMELGDRCLELVEHLDPSRLNEQSKAGL 600

Query: 601 LPVTASDLAKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAG 660
           LP+ ASDL KEREKKKLANRNLLEVRSRVHEYRAGDTSHPEND+IYTLLRGLREQMKEAG
Sbjct: 601 LPIKASDLEKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDRIYTLLRGLREQMKEAG 660

Query: 661 YIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIRIIKNLRVCGDCHSALK 720
           YIPETRFVLHDIDQE KNDALLGHSERLAVAYGLISSSARSPIR+IKNLRVCGDCHSALK
Sbjct: 661 YIPETRFVLHDIDQEAKNDALLGHSERLAVAYGLISSSARSPIRVIKNLRVCGDCHSALK 720

Query: 721 IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW
Sbjct: 721 IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 748

BLAST of Tan0020092 vs. ExPASy TrEMBL
Match: A0A6J1CCY6 (pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like OS=Momordica charantia OX=3673 GN=LOC111010074 PE=3 SV=1)

HSP 1 Score: 1151.7 bits (2978), Expect = 0.0e+00
Identity = 598/749 (79.84%), Postives = 655/749 (87.45%), Query Frame = 0

Query: 1   MSKKGAAILARRSLIALCDAQSSCSSSVSNKALNLVRNLSIASESEEYQNDNGYHADNSS 60
           MSKK AAILARRS+IALC+A+SSCSS VSNK LNL+RNLSI+SE EEYQNDNGYHADNSS
Sbjct: 1   MSKKRAAILARRSVIALCNARSSCSSYVSNKPLNLLRNLSISSEREEYQNDNGYHADNSS 60

Query: 61  QSYQS----------LGYYQHHAENASLQSSSRPY----DGFYTENSMQGQYRPSTSSVD 120
           QSYQS           GYYQHHA++ASL+SSSRPY    DGFYTE+  +GQ+RPSTSSV 
Sbjct: 61  QSYQSHGGFNCDNQNPGYYQHHAQSASLKSSSRPYQENLDGFYTEDLTRGQHRPSTSSVY 120

Query: 121 GQTPRSSFANTSFMHESASGSYGQNYCGLPPNSYGFNQNHDEAYRDTYQNTHHANLVSQN 180
           GQ P S F NTS MHE+AS S     C LP  SYGFNQNH EA+R+TY+NTH+ANLV+ N
Sbjct: 121 GQAPGSGFTNTSPMHENASRS-----C-LPTTSYGFNQNHYEAFRETYENTHNANLVAHN 180

Query: 181 GNFIQSGHNKGMMAQDLNSYNANSPRNFVHISNNEVRGVDRSVSQNILFEHRENLTAYNG 240
           GNFI  G NKGMMAQDL SYNA+S      IS NEVRG DRS+SQN+L EHRENLTAY+G
Sbjct: 181 GNFIHHGLNKGMMAQDLKSYNADS-----QISYNEVRGDDRSLSQNVLLEHRENLTAYSG 240

Query: 241 SNNGELQQDNYEVYGQNSWGTRTESRQYEHDTGLNYHNPMSGPNNHIPISRQYEQNSISQ 300
            NNG  +++   VYGQ+SWGT  ESRQY+H TGLNYHNP SGPNN IPIS   EQNSISQ
Sbjct: 241 FNNGVPRENGNGVYGQSSWGTSMESRQYKHVTGLNYHNPTSGPNNQIPISGHCEQNSISQ 300

Query: 301 QYPQGQYQQGASLEQYQPIPDTTQSSMIGTQVLNNANAEEETRVTKDRPYGGCTLEELDE 360
           QY Q Q QQGA++EQYQP P+T QSSM+ TQ++NN NA++ET VTKD    G T+EELDE
Sbjct: 301 QYLQQQCQQGATVEQYQPSPNTIQSSMMDTQLVNNINADKETGVTKDH-QNGDTIEELDE 360

Query: 361 FCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSLEEAKIVCNYVIKSQAPLK 420
           FCKEGKLKEAV++LE LEKQHI VDL RYL+LMNAC EARSLEEAK+VC+Y+ +S +PLK
Sbjct: 361 FCKEGKLKEAVQVLEALEKQHILVDLPRYLQLMNACAEARSLEEAKVVCDYISRSHSPLK 420

Query: 421 VSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGEDAVDLFYEFKK 480
           VSTYNKILEMYSKCGSM DAYTIFN MP+RNLTSWDTMITWLAKNGLGEDA+DLFYEFKK
Sbjct: 421 VSTYNKILEMYSKCGSMGDAYTIFNNMPNRNLTSWDTMITWLAKNGLGEDAIDLFYEFKK 480

Query: 481 AGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTGYVD 540
           AGL+PDGK+FIG+FSACSVLGD+DEGMLH ESMTKNYGI PSMHHYVSIVDMLGS GYVD
Sbjct: 481 AGLKPDGKMFIGLFSACSVLGDIDEGMLHLESMTKNYGIIPSMHHYVSIVDMLGSVGYVD 540

Query: 541 EALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVEQLDPSRLNEQSKAGLLPV 600
           EALEFIE MPLEPGV+IWE +MNISRAHG MELGDRCFELVEQLDPSRLNE+SKAGLLPV
Sbjct: 541 EALEFIEKMPLEPGVEIWEMMMNISRAHGFMELGDRCFELVEQLDPSRLNEESKAGLLPV 600

Query: 601 TASDLAKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAGYIP 660
            ASDLAKEREKKKLAN+NLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAGYIP
Sbjct: 601 RASDLAKEREKKKLANQNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAGYIP 660

Query: 661 ETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIRIIKNLRVCGDCHSALKIIS 720
           ETRFVLHDIDQEGKNDALL HSERLAVAYGLISSSAR+PIR+IKNLRVCGDCH+ALKIIS
Sbjct: 661 ETRFVLHDIDQEGKNDALLAHSERLAVAYGLISSSARAPIRVIKNLRVCGDCHNALKIIS 720

Query: 721 KIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           KIVGRELI+RDAKRFHHFKDGLCSCRDYW
Sbjct: 721 KIVGRELIMRDAKRFHHFKDGLCSCRDYW 737

BLAST of Tan0020092 vs. ExPASy TrEMBL
Match: A0A0A0M061 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G659600 PE=3 SV=1)

HSP 1 Score: 1134.4 bits (2933), Expect = 0.0e+00
Identity = 602/752 (80.05%), Postives = 645/752 (85.77%), Query Frame = 0

Query: 1   MSKKGAAILARRSLIALCDAQSSCSSSVSNKALNLVRNLSIASESEEYQNDNGYHADNSS 60
           M KK AAILARRSLIAL   +SSCSSSVSNKALNLVRNLSIASE EE QNDNGYHADNS 
Sbjct: 1   MCKKRAAILARRSLIALYTPRSSCSSSVSNKALNLVRNLSIASEREECQNDNGYHADNSL 60

Query: 61  QSY-----------QSLGYYQHHAENASLQSSSRPY----DGFYTENSMQGQYRPSTSSV 120
            SY           QS GYYQHHA++ S  S SRP+    DGFYTENS+QG +RPSTSSV
Sbjct: 61  PSYQTHGGSVSSYNQSPGYYQHHAQSTS--SQSRPHQDILDGFYTENSLQGLHRPSTSSV 120

Query: 121 DGQTPRSSFANTSFMHESASGSYGQNYCGLPPNSYGFNQNHDEAYRDTYQNTHHANLVSQ 180
            GQ P  SFANTS MHESAS SYGQ+Y G+PPNS GFNQNH EAYR+T+QNTHHA+ V+ 
Sbjct: 121 YGQKPGGSFANTSPMHESASRSYGQHYSGVPPNSCGFNQNHHEAYRETFQNTHHASPVAP 180

Query: 181 NGNFIQSGHNKGMMAQDLNSYNANSPRNFVHISNNEVRGVDRSVSQNILFEHRENLTAYN 240
           NGNFI++G+ KG +AQD NSYN ++PRNFV ++NN V GVDRS+SQN    HRE  +AYN
Sbjct: 181 NGNFIENGY-KGGVAQDHNSYNGSTPRNFVDMNNNVVCGVDRSMSQNNQLGHREIFSAYN 240

Query: 241 --GSNNGELQQDNYEVYGQNSWGTRTESRQYEHDTGLNYHNPMSGPNNHIPISRQYEQNS 300
             G NN   QQ+NY V GQN            HD      NPMSGPNNHIP+SRQYEQNS
Sbjct: 241 GYGYNNEATQQNNYGVSGQNL-----------HD------NPMSGPNNHIPLSRQYEQNS 300

Query: 301 ISQQYPQGQYQQGASLEQYQPIPDTTQSSMIGTQVLNNANAEEETRVTKDRPYGGCTLEE 360
           I  Q+PQGQY QG+S+EQYQP  DT Q+SMIGTQ+LNN NA EE    KD   GG  LE+
Sbjct: 301 IPLQHPQGQYHQGSSVEQYQPNTDTNQNSMIGTQLLNNVNANEEIGEPKDCQDGG-PLEK 360

Query: 361 LDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSLEEAKIVCNYVIKSQA 420
           LDEFCKEGKLKEAV+ILEVLEKQHIPVDL+RYL+LMNACGEARSLEEAK+VCNYVIKSQ 
Sbjct: 361 LDEFCKEGKLKEAVQILEVLEKQHIPVDLSRYLDLMNACGEARSLEEAKVVCNYVIKSQT 420

Query: 421 PLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGEDAVDLFYE 480
            +KVSTYNKILEMYSKCGSMDDAYTIFNKMPSRN+TSWDTMITWLAKNGLGEDA+DLFYE
Sbjct: 421 HVKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNITSWDTMITWLAKNGLGEDAIDLFYE 480

Query: 481 FKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTG 540
           FKKAGLRPDGK+FIGVFSACSVLGD DEGMLHFESMTKNYGITPSMHHYVSIVDMLGS G
Sbjct: 481 FKKAGLRPDGKMFIGVFSACSVLGDADEGMLHFESMTKNYGITPSMHHYVSIVDMLGSIG 540

Query: 541 YVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVEQLDPSRLNEQSKAGL 600
           +VDEA+EFIE MPLEPGVDIWET+MNISRAHGLMELGDRCFELVE LD SRLNEQSKAGL
Sbjct: 541 FVDEAVEFIEKMPLEPGVDIWETMMNISRAHGLMELGDRCFELVEHLDSSRLNEQSKAGL 600

Query: 601 LPVTASDLAKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAG 660
           LPV ASDL KEREKKKLANRNLLEVRSRVHEYRAGDTSHPEND+IYTLLRGLREQMKEAG
Sbjct: 601 LPVKASDLEKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDRIYTLLRGLREQMKEAG 660

Query: 661 YIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIRIIKNLRVCGDCHSALK 720
           YIPETRFVLHDIDQE KNDALLGHSERLAVAYGLISSSARSPIR+IKNLRVCGDCHSALK
Sbjct: 661 YIPETRFVLHDIDQEAKNDALLGHSERLAVAYGLISSSARSPIRVIKNLRVCGDCHSALK 720

Query: 721 IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW
Sbjct: 721 IISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 731

BLAST of Tan0020092 vs. TAIR 10
Match: AT2G25580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 452.6 bits (1163), Expect = 6.1e-127
Identity = 216/400 (54.00%), Postives = 290/400 (72.50%), Query Frame = 0

Query: 336 YGGCTLEELDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSLEEAKIVC 395
           Y    +EE D FCK GK+K+A+  +++L   +  VDL+R L L   CGEA  L+EAK V 
Sbjct: 218 YTDIMIEEYDAFCKHGKVKKALYTIDILASMNYVVDLSRLLRLAKICGEAEGLQEAKTVH 277

Query: 396 NYVIKSQAPLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGE 455
             +  S + L +S+ + +LEMYS CG  ++A ++F KM  +NL +W  +I   AKNG GE
Sbjct: 278 GKISASVSHLDLSSNHVLLEMYSNCGLANEAASVFEKMSEKNLETWCIIIRCFAKNGFGE 337

Query: 456 DAVDLFYEFKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSI 515
           DA+D+F  FK+ G  PDG++F G+F AC +LGDVDEG+LHFESM+++YGI PS+  YVS+
Sbjct: 338 DAIDMFSRFKEEGNIPDGQLFRGIFYACGMLGDVDEGLLHFESMSRDYGIAPSIEDYVSL 397

Query: 516 VDMLGSTGYVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVEQLDPSRL 575
           V+M    G++DEALEF+E MP+EP VD+WET+MN+SR HG +ELGD C E+VE LDP+RL
Sbjct: 398 VEMYALPGFLDEALEFVERMPMEPNVDVWETLMNLSRVHGNLELGDYCAEVVEFLDPTRL 457

Query: 576 NEQSKAGLLPVTASDLAKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGL 635
           N+QS+ G +PV ASD+ KE  KK+  +  L  V+S + E+RAGDT+ PEND+++ LLR L
Sbjct: 458 NKQSREGFIPVKASDVEKESLKKR--SGILHGVKSSMQEFRAGDTNLPENDELFQLLRNL 517

Query: 636 REQMKEAGYIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIRIIKNLRVC 695
           +  M E GY+ ETR  LHDIDQE K   LLGHSER+A A  +++S+ R P  +IKNLRVC
Sbjct: 518 KMHMVEVGYVAETRMALHDIDQESKETLLLGHSERIAFARAVLNSAPRKPFTVIKNLRVC 577

Query: 696 GDCHSALKIISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
            DCH+ALKI+S IVGRE+I RD KRFH  K+G C+C+DYW
Sbjct: 578 VDCHNALKIMSDIVGREVITRDIKRFHQMKNGACTCKDYW 615

BLAST of Tan0020092 vs. TAIR 10
Match: AT4G32450.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 443.4 bits (1139), Expect = 3.7e-124
Identity = 249/546 (45.60%), Postives = 340/546 (62.27%), Query Frame = 0

Query: 194 FVHISNNEVR-GVDRSVSQNILFEHRENLTAYNGSNNGELQQDNYEVYGQNSWGTRTESR 253
           F ++S   +R G +   + N +     ++   NG N GE     ++   QNS        
Sbjct: 23  FSYLSTAALRLGFENPTNGNPMDNSSHHIGYVNGFNGGEQSLGGFQ---QNS-------- 82

Query: 254 QYEHDTGLNYHNPMSGPNNHIPISRQYEQNSISQQYPQGQYQQGASLEQYQPIPDTTQSS 313
            YE        NP+SG N   P +R Y QN  ++    G++ +  + ++ Q    +   S
Sbjct: 83  -YEQSL-----NPVSGQN---PTNRFY-QNGYNRNQSYGEHSEIIN-QRNQNWQSSDGCS 142

Query: 314 MIGTQVLNNANAEEET---RVTKDRPYGGCTLEELDEFCKEGKLKEAVEILEVLEKQHIP 373
             GT   N    E  T      +D   G  +L+ELD  C+EGK+K+AVEI++    +   
Sbjct: 143 SYGT-TGNGVPQENNTGGNHFQQDHS-GHSSLDELDSICREGKVKKAVEIIKSWRNEGYV 202

Query: 374 VDLTRYLELMNACGEARSLEEAKIVCNYVIKSQAPLKVSTYNKILEMYSKCGSMDDAYTI 433
           VDL R   +   CG+A++L+EAK+V  ++  S     +S YN I+EMYS CGS++DA T+
Sbjct: 203 VDLPRLFWIAQLCGDAQALQEAKVVHEFITSSVGISDISAYNSIIEMYSGCGSVEDALTV 262

Query: 434 FNKMPSRNLTSWDTMITWLAKNGLGEDAVDLFYEFKKAGLRPDGKIFIGVFSACSVLGDV 493
           FN MP RNL +W  +I   AKNG GEDA+D F  FK+ G +PDG++F  +F AC VLGD+
Sbjct: 263 FNSMPERNLETWCGVIRCFAKNGQGEDAIDTFSRFKQEGNKPDGEMFKEIFFACGVLGDM 322

Query: 494 DEGMLHFESMTKNYGITPSMHHYVSIVDMLGSTGYVDEALEFIEMMPLEPGVDIWETVMN 553
           +EG+LHFESM K YGI P M HYVS+V ML   GY+DEAL F+E M  EP VD+WET+MN
Sbjct: 323 NEGLLHFESMYKEYGIIPCMEHYVSLVKMLAEPGYLDEALRFVESM--EPNVDLWETLMN 382

Query: 554 ISRAHGLMELGDRCFELVEQLDPSRLNEQSKAGLLPVTASDLAKEREKKKLANRNLLEVR 613
           +SR HG + LGDRC ++VEQLD SRLN++SKAGL+PV +SDL KE+ ++     N     
Sbjct: 383 LSRVHGDLILGDRCQDMVEQLDASRLNKESKAGLVPVKSSDLVKEKLQRMAKGPNY---- 442

Query: 614 SRVHEYRAGDTSHPENDKIYTLLRGLREQMKEAGYIPETRFVLHDIDQEGKNDALLGHSE 673
             +    AGD S PEN ++Y  L+ L+E M E GY+P ++  LHD+DQE K++ L  H+E
Sbjct: 443 -GIRYMAAGDISRPENRELYMALKSLKEHMIEIGYVPLSKLALHDVDQESKDENLFNHNE 502

Query: 674 RLAVAYGLISSSARSPIRIIKNLRVCGDCHSALKIISKIVGRELIIRDAKRFHHFKDGLC 733
           R A     + + ARS IR++KNLRVC DCH+ALK++SKIVGRELI RDAKRFHH KDG+C
Sbjct: 503 RFAFISTFLDTPARSLIRVMKNLRVCADCHNALKLMSKIVGRELISRDAKRFHHMKDGVC 537

Query: 734 SCRDYW 736
           SCR+YW
Sbjct: 563 SCREYW 537

BLAST of Tan0020092 vs. TAIR 10
Match: AT2G34370.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 367.9 bits (943), Expect = 2.0e-101
Identity = 178/397 (44.84%), Postives = 268/397 (67.51%), Query Frame = 0

Query: 340 TLEELDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSLEEAKIVCNYVI 399
           T+E  D  CK+ K++EA+E++++LE +   VD  R L L   CGE  +LEEA++V + + 
Sbjct: 80  TIETFDALCKQVKIREALEVIDILEDKGYIVDFPRLLGLAKLCGEVEALEEARVVHDCI- 139

Query: 400 KSQAPLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWLAKNGLGEDAVD 459
               PL   +Y+ ++EMYS C S DDA  +FN+MP RN  +W TMI  LAKNG GE A+D
Sbjct: 140 ---TPLDARSYHTVIEMYSGCRSTDDALNVFNEMPKRNSETWGTMIRCLAKNGEGERAID 199

Query: 460 LFYEFKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPSMHHYVSIVDML 519
           +F  F + G +PD +IF  VF AC  +GD++EG+LHFESM ++YG+  SM  YV++++ML
Sbjct: 200 MFTRFIEEGNKPDKEIFKAVFFACVSIGDINEGLLHFESMYRDYGMVLSMEDYVNVIEML 259

Query: 520 GSTGYVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVEQLDPSRLNEQS 579
            + G++DEAL+F+E M +EP V++WET+MN+    G +ELGDR  EL+++LD SR++++S
Sbjct: 260 AACGHLDEALDFVERMTVEPSVEMWETLMNLCWVQGYLELGDRFAELIKKLDASRMSKES 319

Query: 580 KAGLLPVTASDLAKEREKK-KLANRNLLEVRSRVHEYRAGDTSHPENDKIYTLLRGLREQ 639
            AGL+   ASD A E+ K+ +       + + R+HE+RAGDTSH       +  R L+ Q
Sbjct: 320 NAGLVAAKASDSAMEKLKELRYCQMIRDDPKKRMHEFRAGDTSHLGT---VSAFRSLKVQ 379

Query: 640 MKEAGYIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIRIIKNLRVCGDC 699
           M + G++P TR     +++E K + LL  S +LA A+ +I+S AR P+ +++N+R C D 
Sbjct: 380 MLDIGFVPATRVCFVTVEEEEKEEQLLFRSNKLAFAHAIINSEARRPLTVLQNMRTCIDG 439

Query: 700 HSALKIISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           H+  K+IS I GR LI RD K++H +K+G+CSC+DYW
Sbjct: 440 HNTFKMISLITGRALIQRDKKKYHFYKNGVCSCKDYW 469

BLAST of Tan0020092 vs. TAIR 10
Match: AT1G29710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 356.7 bits (914), Expect = 4.5e-98
Identity = 191/470 (40.64%), Postives = 281/470 (59.79%), Query Frame = 0

Query: 269 PNNHIPISRQYEQNSISQQYPQGQYQQGASLEQYQPIPDTTQSSMIGTQVLNNANAEEET 328
           P++H+ I ++Y  + I++     +Y++  +          TQ+SM+G         + +T
Sbjct: 34  PSHHLHILKKYGSSEITEMI--NRYKRNVAGH------TLTQNSMVG---------QYKT 93

Query: 329 RVTKDRPYGGCTLEELDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARSL 388
            V+        T+E  D  C +G  +EAVE+L+ LE +   +DL R L L   CG+  +L
Sbjct: 94  TVSPSVAQ-NVTIETFDSLCIQGNWREAVEVLDYLENKGYAMDLIRLLGLAKLCGKPEAL 153

Query: 389 EEAKIVCNYVIKSQAPLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITWL 448
           E A++V   +I   +P  V   N I+EMYS C S+DDA  +F +MP  N  +   M+   
Sbjct: 154 EAARVVHECIIALVSPCDVGARNAIIEMYSGCCSVDDALKVFEEMPEWNSGTLCVMMRCF 213

Query: 449 AKNGLGEDAVDLFYEFKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITPS 508
             NG GE+A+DLF  FK+ G +P+G+IF  VFS C++ GDV EG L F++M + YGI PS
Sbjct: 214 VNNGYGEEAIDLFTRFKEEGNKPNGEIFNQVFSTCTLTGDVKEGSLQFQAMYREYGIVPS 273

Query: 509 MHHYVSIVDMLGSTGYVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELVE 568
           M HY S+  ML ++G++DEAL F+E MP+EP VD+WET+MN+SR HG +ELGDRC ELVE
Sbjct: 274 MEHYHSVTKMLATSGHLDEALNFVERMPMEPSVDVWETLMNLSRVHGDVELGDRCAELVE 333

Query: 569 QLDPSRLNEQSKAGLLPVTASDLAKEREKKKLANRNLLEVRSRVHEY---RAGDTSHPEN 628
           +LD +RL++ S AGL+   ASD  K+              RS  + Y   R  D+SHP+ 
Sbjct: 334 KLDATRLDKVSSAGLVATKASDFVKKEP----------STRSEPYFYSTFRPVDSSHPQM 393

Query: 629 DKIYTLLRGLREQMKEAGYIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSP 688
           + IY  L  LR Q+KE GY+P+TR+    I      + + G+ E +AV   L+ S  RS 
Sbjct: 394 NIIYETLMSLRSQLKEMGYVPDTRYYRSLIMAMENKEQIFGYREEIAVVESLLKSKPRSA 453

Query: 689 IRIIKNLRVCGDCHSALKIISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           I ++ N+R+ GDCH  +K++S I GR++I RDAK +H FK+G+C C + W
Sbjct: 454 ITLLTNIRIVGDCHDMMKLMSVITGRDMIKRDAKIYHLFKNGVCRCNNLW 475

BLAST of Tan0020092 vs. TAIR 10
Match: AT2G15690.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 294.3 bits (752), Expect = 2.8e-79
Identity = 201/588 (34.18%), Postives = 298/588 (50.68%), Query Frame = 0

Query: 156 NTHHANLVSQNGNFIQSGHNKGMMAQDLNSYNANSPRNFVHISNNEVRGVDRSVSQNILF 215
           N +H N   Q+G+   S H +    Q  +S N  +    V  S N+              
Sbjct: 56  NDYHQN--PQSGS--PSQHQRPYPPQSFDSQNQTNTNQRVPQSPNQWS-----------T 115

Query: 216 EHRENLTAYNGSNNGELQQDNYEVYGQNSWGTRTESRQYEHDTGLNYHNPM--------S 275
           +H   +  Y G N     Q      GQN       S+   H+     H P          
Sbjct: 116 QHGGQIPQYGGQNPQHGGQ-RPPYGGQNPQQGGQMSQYGGHNPQHGGHRPQYGGQRPQYG 175

Query: 276 GPNNHIPISRQYEQNSISQQYPQGQYQQGASLEQYQPIPDTTQSSMIGTQVLNNANAEEE 335
           GP N+        QN   QQ  Q QY      +Q QP   + QS     +V    + EE 
Sbjct: 176 GPGNNY-------QNQNVQQSNQSQYYTPQQQQQPQPPRSSNQSPNQMNEVAPPPSVEEV 235

Query: 336 TRVTKDRPYGGCTLEELDEFCKEGKLKEAVEILEVLEKQHIPVDLTRYLELMNACGEARS 395
            R+ + R Y                 K+A+E+   L+K  +P D   ++ L  +C   +S
Sbjct: 236 MRLCQRRLY-----------------KDAIEL---LDKGAMP-DRECFVLLFESCANLKS 295

Query: 396 LEEAKIVCNYVIKSQAPLKVSTYNKILEMYSKCGSMDDAYTIFNKMPSRNLTSWDTMITW 455
           LE +K V ++ ++S+        N ++ M+ +C S+ DA  +F+ M  +++ SW  M+  
Sbjct: 296 LEHSKKVHDHFLQSKFRGDPKLNNMVISMFGECSSITDAKRVFDHMVDKDMDSWHLMMCA 355

Query: 456 LAKNGLGEDAVDLFYEFKKAGLRPDGKIFIGVFSACSVLGDVDEGMLHFESMTKNYGITP 515
            + NG+G+DA+ LF E  K GL+P+ + F+ VF AC+ +G ++E  LHF+SM   +GI+P
Sbjct: 356 YSDNGMGDDALHLFEEMTKHGLKPNEETFLTVFLACATVGGIEEAFLHFDSMKNEHGISP 415

Query: 516 SMHHYVSIVDMLGSTGYVDEALEFIEMMPLEPGVDIWETVMNISRAHGLMELGDRCFELV 575
              HY+ ++ +LG  G++ EA ++I  +P EP  D WE + N +R HG ++L D   EL+
Sbjct: 416 KTEHYLGVLGVLGKCGHLVEAEQYIRDLPFEPTADFWEAMRNYARLHGDIDLEDYMEELM 475

Query: 576 EQLDPSRLNEQSKAGLLPVTASDLAKEREKKKLANRNLLEVRSRVHEYRAGDTSHPENDK 635
             +DPS+    +    +P       KE         N++  +SR+ E+R   T + +  K
Sbjct: 476 VDVDPSK----AVINKIPTPPPKSFKE--------TNMVTSKSRILEFR-NLTFYKDEAK 535

Query: 636 IYTLLRGLREQMKEAGYIPETRFVLHDIDQEGKNDALLGHSERLAVAYGLISSSARSPIR 695
                +G+        Y+P+TRFVLHDIDQE K  ALL HSERLA+AYG+I +  R  + 
Sbjct: 536 EMAAKKGV-------VYVPDTRFVLHDIDQEAKEQALLYHSERLAIAYGIICTPPRKTLT 579

Query: 696 IIKNLRVCGDCHSALKIISKIVGRELIIRDAKRFHHFKDGLCSCRDYW 736
           IIKNLRVCGDCH+ +KI+SKI+GR LI+RD KRFHHFKDG CSC DYW
Sbjct: 596 IIKNLRVCGDCHNFIKIMSKIIGRVLIVRDNKRFHHFKDGKCSCGDYW 579

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q680H38.5e-12654.00Pentatricopeptide repeat-containing protein At2g25580 OS=Arabidopsis thaliana OX... [more]
Q9SUU75.2e-12345.60Pentatricopeptide repeat-containing protein At4g32450, mitochondrial OS=Arabidop... [more]
Q8S8Q72.8e-10044.84Pentatricopeptide repeat-containing protein At2g34370, mitochondrial OS=Arabidop... [more]
Q9C6G26.4e-9740.64Pentatricopeptide repeat-containing protein At1g29710, mitochondrial OS=Arabidop... [more]
Q9ZQE53.9e-7834.18Pentatricopeptide repeat-containing protein At2g15690, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_038876765.10.0e+0083.44pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like [Benin... [more]
XP_008443720.10.0e+0081.12PREDICTED: pentatricopeptide repeat-containing protein At4g32450, mitochondrial-... [more]
TYK19847.10.0e+0080.98pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_022139059.10.0e+0079.84pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like [Momor... [more]
XP_011660234.10.0e+0080.05pentatricopeptide repeat-containing protein At4g32450, mitochondrial [Cucumis sa... [more]
Match NameE-valueIdentityDescription
A0A1S3B9H30.0e+0081.12pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like OS=Cuc... [more]
A0A5A7T5S80.0e+0081.12Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5D3D8E50.0e+0080.98Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1CCY60.0e+0079.84pentatricopeptide repeat-containing protein At4g32450, mitochondrial-like OS=Mom... [more]
A0A0A0M0610.0e+0080.05DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G6596... [more]
Match NameE-valueIdentityDescription
AT2G25580.16.1e-12754.00Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G32450.13.7e-12445.60Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G34370.12.0e-10144.84Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G29710.14.5e-9840.64Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G15690.12.8e-7934.18Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 591..611
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 77..115
NoneNo IPR availablePANTHERPTHR24015:SF1801OS01G0737900 PROTEINcoord: 4..730
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 4..730
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 347..365
e-value: 0.97
score: 9.8
coord: 440..469
e-value: 7.4E-5
score: 22.7
coord: 408..437
e-value: 2.3E-4
score: 21.2
coord: 511..535
e-value: 0.21
score: 11.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 440..472
e-value: 8.4E-6
score: 23.6
coord: 409..437
e-value: 4.3E-5
score: 21.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 406..436
score: 9.569272
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 437..471
score: 10.621557
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 341..502
e-value: 7.4E-30
score: 106.4
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 606..725
e-value: 3.2E-40
score: 136.9

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0020092.1Tan0020092.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding