Tan0006998 (gene) Snake gourd v1

Overview
NameTan0006998
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG10: 43132832 .. 43141377 (-)
RNA-Seq ExpressionTan0006998
SyntenyTan0006998
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCGTGTCGCTCTTCTCGATTTCCCTCCCTACTTCACCTCTCGCATTCTTTCGCAACACCGAGGATTTGCTCGTCTAGTCGTCGTTCGCCGATTGTCGCTCGTCTAGTCGCTACTCGTCAATCATTTGGTCGCCGCTCCTCGCCTGGTCGTCGATCATTTGGTTTACTCATCGATCGTCTGGTTGTCGCTGCATGCTCACTCGTCAGGTCGTTGTCATTTCCTTTTCACGAAACTCGTGCGGCGTCGCATTTGCTAGCTGATTGGGTTCAATCGTTGGATCAATATATTTGCGTTGGTGCGATTGAATCTTCAAGCTCCATCAAACTTGGTTGGTTCAGAGTGAGTTTTAATGGTAATTCTTTTGAATGCTGGATTTTCAATGTTCTCAGGGCATCTGTATAATTTATTGATGTTGTATTATTTGCAAAGTGCGATACAATTATATCTTCTGGGATTATGATTTGTAGGTTTTTATATTTAACCAATTGCTTTTCAGGGTTGAATGGAACAAATGTTGTGGAGCTTACGTGTCTGATTGACTATTAGGGCAAGTCGCCTAAAGATCTATAAAGTGAAGGGTTAAGGGTTAAGGGTCGCTCTGCAAGTACTCAGGTAGACTGATTCGAAGGTATTTTTACTTTTGAGTTCTCTACACCAGGTCTCTTCCGTACTATGGGATTTTTATTTTTCTCCACATCAAAATAAATCTGCATGAACAGGACTAAATTTGAGTAATTGAGAATTTGAGAATTTTAAGAAATTAATGTGTACTTCTAATTTGCAATATGTTTTTTTGGAAACAAACTGCAATGTCTTTTCTCTTTATTATCATTCAAATAGATTTGCCTGTTCTACTTCAGAATAATTGAGATATTACACTTACTGATGAGAATATTTAGGAGTTTTAATGAGTTACAGTCACAGATGATGATTCATTAGTCATTACTTTGGAATGAATATTGTTCTTCACTCTGTTTAAGTATCAAACTGGCTATGAAATTGTTATATTCGATTGCTTATATTACGGTGGACTACAAAATTTATATTTGCCTTTCTTTCTTTTAAGCCTATCTTCTTTTCTTCTCCATCTTCCCATTCACTGAAGTTTAAAGTTGTAGAACTTATGTTTTAATAAATTGATATTTTATGGCAGGTTTGCTTAGGGATATGGATAGCCCACATCTGCTGGTTGTTGCATGTTCATGCTATCTTTCCTAAAGTTTGATTCTTTTGAAGATTCTTTTGCTTCCATAAGTAAACTATGAAACTGGGTAACATTTTCTTTTGTCTTGTATTCTACTTGAAAGTTCTGCATTTTTATTACTGTTATTATTCTCTCTTTGCTTTTTTCACTCAAAAATCTTATATTGTTTTGTAAAAGAAGAGATTATTTTAATCTCTATCATTTTGGTGCTATTTACACTTGTCAATCTACCGCTTTTTTCATATTGACAGAATAAAATGGCTTTGTTTTAGATGTTGGCCAATAATAAGTTGAAAATAAATTTAATGGTTTATACTATTGTATTCAAGTGCTAAAGCTTTTTGCAAGAGCTTTTTCGAAGGAGTTATAATATCGATAAAAGAAGTATTTCTGAAGGCTTTAAAGTCTATGAAAATAGTATTAAAAGCCTTTAAAAATAATAAAAACCTTCGAAAAAGATAATTAACAATGAATAATTATCGATAAACAATTTGACAATTTTCGAAGACATATGACCTTTGGAAAATAATCATTTTCGAAGGTTAATGTACCTTTGATAAAAGCTTTTTCGACGGATCTATCTCGACACTTTTTGAAGGTTCCCTTAACCTTTAAAAAAGGATTCATCGAAAGTTTGTTTAGTACTTTTTCGAAGGTTGTGACCTTCGAAAATAGTCAATTTTCTTATAGTATGGGTTTAATCTTAGTTTTTTTTTTTGTTTTGTTTTTTAGGTTGAATCTTTTAGATTTGGGTTGGGTTGGGTTGGTCCAATTTTTTCAAGATCGTAGGTGATAATTTTGAACTCTAAATAGATTTTGAGTTGTTTGTTTGTTTGTGTGACTTTTCAAGTTCTCTCTATTGTTTTGATTTGATTTTTTGAAATTTTGTATTCATATTACTTGTAAGTGTCTCGATGTTACTCTGCATCTTGACATTTTGTCTGTTAGCATCGAGCAAGGGGGCTTCGTGTTTTTTAGTGTTGAGTAGGGGCTCTGTGGAGCCCTCGACACTGTACTCTGGGGCAACTTGGTAAATTTCTCCTCTTTATCTCAAATTAGGTTTCTCGCCTTTGGGTAAGAATCAACCCCCACTACAAGAAAAACTACATTTCTTGACGCTTGCAACTTTTAATTACTTGACGTTTTTATGAGAAAATGTCAAGTAAAATGGATTTTAAACAAAAAAAACGTCAAGAAAATGAAAAAAACGTGGAGGTTGACAGATTTTTTTAGATTTTTTTCACGCTACTGTAAAACTTGACACTTTAAAAACGTCAAATTAAAAAAAAAGGTTGGCAAAAAGCTATTAAAACCTATTAATTTTAAGAGTTATCTAACATGAAAAGAAAAGAGGGAATTTTTATTTCATTTTGCCGCTTAACCTCATTCTCTTTCGCGCACAAAAGAAGCAAATTAACCTTGTTTTCATCCTCACGCCATTCTTCTTCTTCGAAAATCTTCGTCAACCTCACATCTTCTTCTTCTTCTTTCGCACATCCTCTCAACCTCACACCAACATCGGCAACGACGCGAAAAATCCCTGCACTGACAGTACCAAGGGATGCCGAAGATTTCGAGCGCCAAACCATCGAAACCAACCTCGAAGAAGCCCTATATCTCGTTGGAGCATTACCAGAGTTGGACCGAGGACTCTTGAAGCGCTAGTTTCAAACCTAATTTCAACTTTTTCGATTTCTTGTAGGAAAGTAAGACTTCCGCTCTTGATTTTCTTTTAATTTCGCACACAGGGGGTTTCATTTTCATTTCAATGTCAATTTGATTTAACACACAGAAATGTTGTTGACGGGAGTTCACCGAAGGGCCTTCATCGATTGAAAGGGATTACCCACAACGTTCAGAAAGGAAAAATCAAATCGACCTTCAAATCGAGTTCGACTTTGTGTTTCGCGTGTTGGATTCTGCTTTATCTTCGATTCCTAAGTAAGTTCCAACCGCGATTCCTCTTCTTTAACTACTCTTGTTCGACTTTGTGTTTCGCGTGTTGGATTCTGCGTGATGGCCACAAGTGAAGTTGGGAAAAGTGTCGACTTGTTGGATTTCTTTTGTTGTTGACTTCATTTGGTTGGTCTCTTTTAGTTATTACTCTGCGGTTGTAATCTGTAAGAAAGAGTTCTCTTTTCACAGATTGTACAAAAATTATTTTCACTGATTATAAGTTCTTCGTTTCTAATGTTGTGTTATTATATCATCGTAGGTTGAGTTTGTGATTTTTGACATTTGAATTTGGGATTTGAGCCTCTGCTTTTCTTCGATCTTTTATAATTATAAATTATGTTCTTAGTTCTTATTTTTGTGGTAACTCCAATCTCTTGCTAAAATAAAATGTTTCAATTTCATTTCTAGACTTTTAATGGGAGAAGGTAATGGGAGAGTGGAGCTAAGGATGATGTCTTTCAGTCCCTATAACTATGAAGACTTTAATCCATACCTACCCTAAATCCTTACCTCATCCCAAAATTTGAGCCTTTTTTTTAGGTACATTTGAGGCTAACTTTAACCTCTCTACGTAAAAGTAGAAGGTTTTTTTTTTTAGGTACAGGTCTAGAAGTCCTAGTCAACGGAGACGGTAATTTCCTCATCCAAATCCCTTAGAATGTCTTCTCAGACGATACTGAAATTTATATATTAGGATCAAATGGTTGCCTCGAAGAATCTAGTCTGGTCATGGAGTTCCATAGTGGAATTATTATTTCATAAGCTTTAGATGTTCAAGGGTGGAAGTTTCTGTTGAAATTATTGGAGGAATATGATGCATCCAAATTTTTTTGAACCCTCAAATTATTGTGATTGTTGTTTGTGCAGAGGATTATTTAGCAAACTCTTATTTTTGCTATCTCAGATCTGAGGTTCGAATATGTTGTTTTGTCCAATGAAATCATTCTTTTTCAGGGTAGCTATAGTATGTTGCAGAAAGGAGGAAGAAAATTCCAAAGGAGAAGACAATTTTTGCATCTGTGTTTTAGTTAACCATATTTTATGATCTTTATCGTGGAGATTAGGAGTTATTTTGGTGTTTTCAAATTTATCAATAACGTGTTGGACTTTGTTGGCGATTCAGGACCAAACATTTGAGTTCTAAGGCCGCCTTGAATCTTGGAGTAAATTGGTAAGTAGTCATTTGGATTTAAGCTCTTTTAAGTTCCTTATTTTGAATTAATGGATGTTTGAGAACTTGCGAAAATAGCATGCTTAAAAATTATGTTTATGAAGCATGGAATGTTTAGTGATCGGTGTGGAGGTTGTTTAAATTGATAATGGCCTAATTATGTAGGCATAATGGAAATAGGGGCTTAAAGACTCGAGCCCCTATTTTCCTTATCTATGAGCCTTGTTGTGGTATTTAGAGGGATTGCAGTCATGTGTATGTTGTTGAAAGTAATCAATATGGAAGAACTCGTCGAGCTTGGTAATAGAATTGAGCCTTGGGATACGTGAGTGACTAGGATTGTTTACTATAGGCATGAATGTCTGGATGCTTGATTTAAGCTGATTGTTTGGGTTGATTGAGTATGCTCTTTCAATTATGCGCTAGTAAACTCCTCAAGTGTGAGGTTGACATCTTGTTGGTTTGTCATGATTTTTGGTACGGTGGATGTGTTTTCGTTTGAGTTTTGTGAATGAGTAAGCATACTTAATGTGTTTTTCGAGCACGTGGTAGTGGGTTGAACGGTTTTGGAGTCATTTTTATAGAATAGTGTTGAGGTGTGTTAGCTTGAGTTGTTTGGGTGATTAACCGAGCATAGCATGCCTAAAAAGATTGAATGATTGATCAAGGCTTAGTTTTCTTGACTTGGGGGTGATTTTCCCAACGTGATGAGTTATGGGGCGATTTTCCTAGAGTGATTAGTGTGATGATGATGATAAAATTTTACATTCTAACATTTTCTATGCATGATGTTTCTATTTTAAATTTTTTTAGAGCAAGTTAGCTGGAAAAAATTTACAATCTTAATTCCTTTATGTATATATATTTTGCTTCAACCCAACTAAGTAGGACAATTCGTCCCCTCCCCTAATAATGCCTTGTATTTACTTTACACTTAGACTGAACATAATTTTTCGACTTTATTTCACCGGCAATGAACATACAAAGCTCAGAAAGTAATGAAACTTCTATTGATCTGAACACCAAGTTACAATCTTAATGAAACTTTTTATATATATTATTTTGGTGGGTTTGGCTTGTTAATTCATTGTTTTTGTTCCACTGTCACTCTCTCTCTCTCTTCTTTTTTCTGAAATCAATGTTGTTGGATGCTATTTTTCGAAGGGTTGACAATGTTGCAACTTTTTGGCAATGAAGTGAGTTTTCACATTGTGTCTCTTCCTTGGTTTGCAAAATAAACCAATTTTGAGTTTTACATTTCATCTCTTCCTTGACTTTGTATGATCCGATTGTGTTTGTAAAATGCTTTATGAAGTTAATCTGACAATGTGGCAATTGTTTCTTGTTTGATTCTCAACAGGGTTTCACTTCTTTCTGCAATGAAGTGAGTTTGATATTGTAGAACTGACAGACTTTAATGGAGGGTATGGCTTTCTCAATTTCACTGGAGAGATATGATTGAGTCTTGTAACGAACCATGCTTTTGCTACAACGAGCAGCCAGAGTAGAATCCAAAACCAAAAGTGGGATTTTTGTTTCCTCATTTAAGGACATCTTCAATGAAGGTCTTGTATCTGCCTCCTCTTCTTGTCCCAATTTATATTCCTTTTAAATCGATTCTTTGGAATTATCGGTAATGGAAACAGGGATATTCCTATGTTTTTCCCTTGGATGTCCACGAAAATTACTACAAGCTTGACAGCTGCAACTGGTGCAGATGGGATGATCACTAAGGAAGTGGCACTGTCTTTTAAGGAGTGGTTCAAATCTGGAAGCAACTCTTTGTATGATCAAATCTTCCAAATCCTTCAGGGGGCTAGAGATGAACAAGAAGTGCCATATGGTCCTTCCACTGCTGATCTAGCTCTTTCTAGTCTTGGCCTTCGCCTTAATGAGCCATTTGTCTTAGATGTCCTCCGTTTTGGCTCCAAGAATGTTTTGTCTTGCCTCAAGTTCTTTGATTGGGCTGGACGCCAACCAAAGTTCTTCCATACACGTGCCACATTCAATGCCATCTTTAAGATTCTCTCCAAGGCCAAGCTCATGTCCCTCTTGTTTGATTTCATTGACAACTATGTGCAACAGAGAATGGTCCACAAGGTTCGCTTTTACAGTACATTGGTGATTGGCTATCTTTATTCTTTGGGAAGCCCATGTTTGCTCTTCAGCTGTTTGGTAAAATGCGCTTTCAAGGTCTTGATCTCGATTCTTTTGCCTACCATGTTCTTTTGAACTCTCTTGTTGAGGAGAATTGCTTTGATGCCGTGCATGTTATTGTCAAGCAGATCTCTTTGAGGGGATTTGAGAATGAGGTCACGCACTACTTAATGCTAAAAAATTTCTGCAAGCAGAATCAGTTGGATGAGGCAGAAACCTTCTTGCATGACTTAGTAGGTAGGGGGGAAGCAGTGAATGGGCGTATGCTGGGTTTTCTTATTGGTGCACTTTGCAAAAGGGGAAACTTTGAGCGGTCATGGAAGTTGGTTGAAGGGTTTAGGGACTTGGAGTTAGTTCCAATGGAGCATGTGTATGGTGTGTGGATAACAGAACTTATTCAGGCTGGGAAGCTGGAGAATGCTCTACAGTTCTTATATAGCAGAAAGTCAGATGAAAGTTACATTCCTGATGTCTTTCGTTATAATATGTTGATTCATAGACTTCTAAGAGAAAACCGGCTTCAGGAGGTGTTTGACTTGCTTACGGAGATGATGGAGGAACATATTTCCCCTGATAAAATAACTATGAATGCTGCCATGTGTTTCCTCTGCAAAGTTGGGATGGTGGATGTTGCACTTGATTTATACAACTCAAGATCAGAATTTGGGCTTTCCCCCAATGGTATGGCATATAACTATTTGATCAATACTTTATGTGGGGATGGAAGCACTGATGAAGCATACCACATCCTGAAAAGCTCCATAGATCAAGGCTACTTTCCAGGAAAAAGAACATTTTCTATACTTGCAGATGCTTTATGTCGAGAGGGAAAACTTGATAAGATGAAGGAGATGGTTATTTTTGCCTTAGAGAGGAACTTTATGCCCAGTGATTCCACATATGACAAGTTTATACTTGCTTTATGTAGGGCTAGGAGAGTTGAAGATGGATATTTGATTCATGGTGAACTTAATAGAATAAATAAAGTAGCTATAAAGAGCACCTATTTTGCTTTGATCGATGGTTTTAACAAGTCAAGGAGAGGTGATATTGCTGCAAGACTACTCATTGAGATGCAGGAAAAGGGTCACACTCCAACTAGGAAACTATTTAGAGCAGTTATCTGCTGTCTCAATGAAATGGAGAATATGGAAAAACAATTCTTTAACCTGCTTGAGTTACAGTTATCTCGTCAAGAACCCAATTGTGCGGTGTACAATAACTTCCTTTATGGAGCTGCACTTGCAAAAAAGCCTGAGCTTGCTAGAGAAGTATATCAGATGATGTTGAGGAGTGGAATTCAACCCAATTTGAGTTCTGACATTCTTTTGTTAAAGTGCTACTTATGTAGTGAACGCATTTCTGATGCTTTGAATTTTTTAAATGATTTGTATCCGACAAGAACTATTGGGAGAAAAATATCCAACACCATGGTTGTTGGTCTATGCAAAGTCAGTAAGGGTGATGTTGCACTTGATTTTTTGAGGGGCATAAGGGATAAGGGTTTAATACCTAGTATTGAATGCTACGAGGAGCTAGCCAAGCACTTCTGTCAGAATGAAAGATATGATTTGGTGGTAAATCTTATTAATGATCTAGATAAAGTTGGACGTCCAATAACATCCTTTCTTGGTAATATACTTCTATATAGTTCATTGAAGACTAAAAAGCTCTATGAAGCCTGGGTTAATTCAAGAGAGGGACAAGTGGAGACTTCTCAAAGTTCTATGCTTGGCCTGCTAATTGGGGCATTTTCTGGCCATATTAGAGTCAGCCAGTCTATCAAGAACTTGGAAGAAGCGATTGCCAAGTGCTTCCCACTTGACATCTATACATACAACCTATTATTGAGGAGGCTAAGCACAAATGATCTGCAACAAGCGTTTGAGTTGTTCAATCGATTGTGTCAAAAAGGGTATGAGCCTAATAGTTGGACTTATGATATATTGGTTCATGGTCTTTTCAAGCATGGGAGGACATCGGAGGCTAAGCTATTGTTGGAAGTAATGTATCGAAAAGGGTTCTGTCCGTCGGAGCGCACTAAAGCATTTATTTAA

mRNA sequence

CTCGTGTCGCTCTTCTCGATTTCCCTCCCTACTTCACCTCTCGCATTCTTTCGCAACACCGAGGATTTGCTCGTCTAGTCGTCGTTCGCCGATTGTCGCTCGTCTAGTCGCTACTCGTCAATCATTTGGTCGCCGCTCCTCGCCTGGTCGTCGATCATTTGGTTTACTCATCGATCGTCTGGTTGTCGCTGCATGCTCACTCGTCAGGTCGTTGTCATTTCCTTTTCACGAAACTCGTGCGGCGTCGCATTTGCTAGCTGATTGGGTTCAATCGTTGGATCAATATATTTGCGTTGGTGCGATTGAATCTTCAAGCTCCATCAAACTTGGTTGGTTCAGAGTGAGTTTTAATGGGTTGAATGGAACAAATGTTGTGGAGCTTACGTGTCTGATTGACTATTAGGGCAAGTCGCCTAAAGATCTATAAAGTGAAGGGTTAAGGGTTAAGGGTCGCTCTGCAAGTACTCAGGTTTGCTTAGGGATATGGATAGCCCACATCTGCTGGTTGTTGCATGTTCATGCTATCTTTCCTAAAGTTTGATTCTTTTGAAGATTCTTTTGCTTCCATAAGTAAACTATGAAACTGGGGTAGCTATAGTATGTTGCAGAAAGGAGGAAGAAAATTCCAAAGGAGAAGACAATTTTTGCATCTGTGTTTTAGTTAACCATATTTTATGATCTTTATCGTGGAGATTAGGAGTTATTTTGGTGTTTTCAAATTTATCAATAACGTGTTGGACTTTGTTGGCGATTCAGGACCAAACATTTGAGTTCTAAGGCCGCCTTGAATCTTGGAGTAAATTGGGTTTCACTTCTTTCTGCAATGAAGTGAGTTTGATATTGTAGAACTGACAGACTTTAATGGAGGGTATGGCTTTCTCAATTTCACTGGAGAGATATGATTGAGTCTTGTAACGAACCATGCTTTTGCTACAACGAGCAGCCAGAGTAGAATCCAAAACCAAAAGTGGGATTTTTGTTTCCTCATTTAAGGACATCTTCAATGAAGGTCTTGTATCTGCCTCCTCTTCTTCTGCAACTGGTGCAGATGGGATGATCACTAAGGAAGTGGCACTGTCTTTTAAGGAGTGGTTCAAATCTGGAAGCAACTCTTTGTATGATCAAATCTTCCAAATCCTTCAGGGGGCTAGAGATGAACAAGAAGTGCCATATGGTCCTTCCACTGCTGATCTAGCTCTTTCTAGTCTTGGCCTTCGCCTTAATGAGCCATTTGTCTTAGATGTCCTCCGTTTTGGCTCCAAGAATGTTTTGTCTTGCCTCAAGTTCTTTGATTGGGCTGGACGCCAACCAAAGTTCTTCCATACACGTGCCACATTCAATGCCATCTTTAAGATTCTCTCCAAGGCCAAGCTCATGTCCCTCTTGTTTGATTTCATTGACAACTATGTGCAACAGAGAATGGTCCACAAGGTTCGCTTTTACAGTACATTGCCCATGTTTGCTCTTCAGCTGTTTGGTAAAATGCGCTTTCAAGGTCTTGATCTCGATTCTTTTGCCTACCATGTTCTTTTGAACTCTCTTGTTGAGGAGAATTGCTTTGATGCCGTGCATGTTATTGTCAAGCAGATCTCTTTGAGGGGATTTGAGAATGAGGTCACGCACTACTTAATGCTAAAAAATTTCTGCAAGCAGAATCAGTTGGATGAGGCAGAAACCTTCTTGCATGACTTAGTAGGTAGGGGGGAAGCAGTGAATGGGCGTATGCTGGGTTTTCTTATTGGTGCACTTTGCAAAAGGGGAAACTTTGAGCGGTCATGGAAGTTGGTTGAAGGGTTTAGGGACTTGGAGTTAGTTCCAATGGAGCATGTGTATGGTGTGTGGATAACAGAACTTATTCAGGCTGGGAAGCTGGAGAATGCTCTACAGTTCTTATATAGCAGAAAGTCAGATGAAAGTTACATTCCTGATGTCTTTCGTTATAATATGTTGATTCATAGACTTCTAAGAGAAAACCGGCTTCAGGAGGTGTTTGACTTGCTTACGGAGATGATGGAGGAACATATTTCCCCTGATAAAATAACTATGAATGCTGCCATGTGTTTCCTCTGCAAAGTTGGGATGGTGGATGTTGCACTTGATTTATACAACTCAAGATCAGAATTTGGGCTTTCCCCCAATGGTATGGCATATAACTATTTGATCAATACTTTATGTGGGGATGGAAGCACTGATGAAGCATACCACATCCTGAAAAGCTCCATAGATCAAGGCTACTTTCCAGGAAAAAGAACATTTTCTATACTTGCAGATGCTTTATGTCGAGAGGGAAAACTTGATAAGATGAAGGAGATGGTTATTTTTGCCTTAGAGAGGAACTTTATGCCCAGTGATTCCACATATGACAAGTTTATACTTGCTTTATGTAGGGCTAGGAGAGTTGAAGATGGATATTTGATTCATGGTGAACTTAATAGAATAAATAAAGTAGCTATAAAGAGCACCTATTTTGCTTTGATCGATGGTTTTAACAAGTCAAGGAGAGGTGATATTGCTGCAAGACTACTCATTGAGATGCAGGAAAAGGGTCACACTCCAACTAGGAAACTATTTAGAGCAGTTATCTGCTGTCTCAATGAAATGGAGAATATGGAAAAACAATTCTTTAACCTGCTTGAGTTACAGTTATCTCGTCAAGAACCCAATTGTGCGGTGTACAATAACTTCCTTTATGGAGCTGCACTTGCAAAAAAGCCTGAGCTTGCTAGAGAAGTATATCAGATGATGTTGAGGAGTGGAATTCAACCCAATTTGAGTTCTGACATTCTTTTGTTAAAGTGCTACTTATGTAGTGAACGCATTTCTGATGCTTTGAATTTTTTAAATGATTTGTATCCGACAAGAACTATTGGGAGAAAAATATCCAACACCATGGTTGTTGGTCTATGCAAAGTCAGTAAGGGTGATGTTGCACTTGATTTTTTGAGGGGCATAAGGGATAAGGGTTTAATACCTAGTATTGAATGCTACGAGGAGCTAGCCAAGCACTTCTGTCAGAATGAAAGATATGATTTGGTGGTAAATCTTATTAATGATCTAGATAAAGTTGGACGTCCAATAACATCCTTTCTTGGTAATATACTTCTATATAGTTCATTGAAGACTAAAAAGCTCTATGAAGCCTGGGTTAATTCAAGAGAGGGACAAGTGGAGACTTCTCAAAGTTCTATGCTTGGCCTGCTAATTGGGGCATTTTCTGGCCATATTAGAGTCAGCCAGTCTATCAAGAACTTGGAAGAAGCGATTGCCAAGTGCTTCCCACTTGACATCTATACATACAACCTATTATTGAGGAGGCTAAGCACAAATGATCTGCAACAAGCGTTTGAGTTGTTCAATCGATTGTGTCAAAAAGGGTATGAGCCTAATAGTTGGACTTATGATATATTGGTTCATGGTCTTTTCAAGCATGGGAGGACATCGGAGGCTAAGCTATTGTTGGAAGTAATGTATCGAAAAGGGTTCTGTCCGTCGGAGCGCACTAAAGCATTTATTTAA

Coding sequence (CDS)

ATGCTTTTGCTACAACGAGCAGCCAGAGTAGAATCCAAAACCAAAAGTGGGATTTTTGTTTCCTCATTTAAGGACATCTTCAATGAAGGTCTTGTATCTGCCTCCTCTTCTTCTGCAACTGGTGCAGATGGGATGATCACTAAGGAAGTGGCACTGTCTTTTAAGGAGTGGTTCAAATCTGGAAGCAACTCTTTGTATGATCAAATCTTCCAAATCCTTCAGGGGGCTAGAGATGAACAAGAAGTGCCATATGGTCCTTCCACTGCTGATCTAGCTCTTTCTAGTCTTGGCCTTCGCCTTAATGAGCCATTTGTCTTAGATGTCCTCCGTTTTGGCTCCAAGAATGTTTTGTCTTGCCTCAAGTTCTTTGATTGGGCTGGACGCCAACCAAAGTTCTTCCATACACGTGCCACATTCAATGCCATCTTTAAGATTCTCTCCAAGGCCAAGCTCATGTCCCTCTTGTTTGATTTCATTGACAACTATGTGCAACAGAGAATGGTCCACAAGGTTCGCTTTTACAGTACATTGCCCATGTTTGCTCTTCAGCTGTTTGGTAAAATGCGCTTTCAAGGTCTTGATCTCGATTCTTTTGCCTACCATGTTCTTTTGAACTCTCTTGTTGAGGAGAATTGCTTTGATGCCGTGCATGTTATTGTCAAGCAGATCTCTTTGAGGGGATTTGAGAATGAGGTCACGCACTACTTAATGCTAAAAAATTTCTGCAAGCAGAATCAGTTGGATGAGGCAGAAACCTTCTTGCATGACTTAGTAGGTAGGGGGGAAGCAGTGAATGGGCGTATGCTGGGTTTTCTTATTGGTGCACTTTGCAAAAGGGGAAACTTTGAGCGGTCATGGAAGTTGGTTGAAGGGTTTAGGGACTTGGAGTTAGTTCCAATGGAGCATGTGTATGGTGTGTGGATAACAGAACTTATTCAGGCTGGGAAGCTGGAGAATGCTCTACAGTTCTTATATAGCAGAAAGTCAGATGAAAGTTACATTCCTGATGTCTTTCGTTATAATATGTTGATTCATAGACTTCTAAGAGAAAACCGGCTTCAGGAGGTGTTTGACTTGCTTACGGAGATGATGGAGGAACATATTTCCCCTGATAAAATAACTATGAATGCTGCCATGTGTTTCCTCTGCAAAGTTGGGATGGTGGATGTTGCACTTGATTTATACAACTCAAGATCAGAATTTGGGCTTTCCCCCAATGGTATGGCATATAACTATTTGATCAATACTTTATGTGGGGATGGAAGCACTGATGAAGCATACCACATCCTGAAAAGCTCCATAGATCAAGGCTACTTTCCAGGAAAAAGAACATTTTCTATACTTGCAGATGCTTTATGTCGAGAGGGAAAACTTGATAAGATGAAGGAGATGGTTATTTTTGCCTTAGAGAGGAACTTTATGCCCAGTGATTCCACATATGACAAGTTTATACTTGCTTTATGTAGGGCTAGGAGAGTTGAAGATGGATATTTGATTCATGGTGAACTTAATAGAATAAATAAAGTAGCTATAAAGAGCACCTATTTTGCTTTGATCGATGGTTTTAACAAGTCAAGGAGAGGTGATATTGCTGCAAGACTACTCATTGAGATGCAGGAAAAGGGTCACACTCCAACTAGGAAACTATTTAGAGCAGTTATCTGCTGTCTCAATGAAATGGAGAATATGGAAAAACAATTCTTTAACCTGCTTGAGTTACAGTTATCTCGTCAAGAACCCAATTGTGCGGTGTACAATAACTTCCTTTATGGAGCTGCACTTGCAAAAAAGCCTGAGCTTGCTAGAGAAGTATATCAGATGATGTTGAGGAGTGGAATTCAACCCAATTTGAGTTCTGACATTCTTTTGTTAAAGTGCTACTTATGTAGTGAACGCATTTCTGATGCTTTGAATTTTTTAAATGATTTGTATCCGACAAGAACTATTGGGAGAAAAATATCCAACACCATGGTTGTTGGTCTATGCAAAGTCAGTAAGGGTGATGTTGCACTTGATTTTTTGAGGGGCATAAGGGATAAGGGTTTAATACCTAGTATTGAATGCTACGAGGAGCTAGCCAAGCACTTCTGTCAGAATGAAAGATATGATTTGGTGGTAAATCTTATTAATGATCTAGATAAAGTTGGACGTCCAATAACATCCTTTCTTGGTAATATACTTCTATATAGTTCATTGAAGACTAAAAAGCTCTATGAAGCCTGGGTTAATTCAAGAGAGGGACAAGTGGAGACTTCTCAAAGTTCTATGCTTGGCCTGCTAATTGGGGCATTTTCTGGCCATATTAGAGTCAGCCAGTCTATCAAGAACTTGGAAGAAGCGATTGCCAAGTGCTTCCCACTTGACATCTATACATACAACCTATTATTGAGGAGGCTAAGCACAAATGATCTGCAACAAGCGTTTGAGTTGTTCAATCGATTGTGTCAAAAAGGGTATGAGCCTAATAGTTGGACTTATGATATATTGGTTCATGGTCTTTTCAAGCATGGGAGGACATCGGAGGCTAAGCTATTGTTGGAAGTAATGTATCGAAAAGGGTTCTGTCCGTCGGAGCGCACTAAAGCATTTATTTAA

Protein sequence

MLLLQRAARVESKTKSGIFVSSFKDIFNEGLVSASSSSATGADGMITKEVALSFKEWFKSGSNSLYDQIFQILQGARDEQEVPYGPSTADLALSSLGLRLNEPFVLDVLRFGSKNVLSCLKFFDWAGRQPKFFHTRATFNAIFKILSKAKLMSLLFDFIDNYVQQRMVHKVRFYSTLPMFALQLFGKMRFQGLDLDSFAYHVLLNSLVEENCFDAVHVIVKQISLRGFENEVTHYLMLKNFCKQNQLDEAETFLHDLVGRGEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYGVWITELIQAGKLENALQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEEHISPDKITMNAAMCFLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDEAYHILKSSIDQGYFPGKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDSTYDKFILALCRARRVEDGYLIHGELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQEKGHTPTRKLFRAVICCLNEMENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPELAREVYQMMLRSGIQPNLSSDILLLKCYLCSERISDALNFLNDLYPTRTIGRKISNTMVVGLCKVSKGDVALDFLRGIRDKGLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVGRPITSFLGNILLYSSLKTKKLYEAWVNSREGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLRRLSTNDLQQAFELFNRLCQKGYEPNSWTYDILVHGLFKHGRTSEAKLLLEVMYRKGFCPSERTKAFI
Homology
BLAST of Tan0006998 vs. ExPASy Swiss-Prot
Match: Q8GZA6 (Pentatricopeptide repeat-containing protein At1g71210, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g71210 PE=1 SV=1)

HSP 1 Score: 699.9 bits (1805), Expect = 3.6e-200
Identity = 385/844 (45.62%), Postives = 525/844 (62.20%), Query Frame = 0

Query: 1   MLLLQRAARVESKTKSGIFVSSFKDIFNEGLVSASSSSATGADGMITKEVALSFKEWFK- 60
           MLL +R   + + +       +  D       +  SSS    D ++ +     +K+WFK 
Sbjct: 16  MLLRRRILSLSASSFRNFTSGNNGDAIPFSTFTKPSSSIAPGDFLVRE-----WKDWFKH 75

Query: 61  ---SGSNSLYDQIFQILQGARDEQEVPYGPSTADLALSSLGLRLNEPFVLDVLRFGSKNV 120
                S+ L D+IF IL+   ++ +         L LS+L LRL E FVLDVL     ++
Sbjct: 76  RDVKQSHQLIDRIFDILRAPSNDGD----DRAFYLHLSNLRLRLTEKFVLDVLSHTRYDI 135

Query: 121 LSCLKFFDWAGRQPKFFHTRATFNAIFKILSKAKLMSLLFDFIDNYVQ-QRMVHKVRFYS 180
           L CLKFFDWA RQP F HTRATF+AIFKIL  AKL++L+ DF+D  V  +   H +R   
Sbjct: 136 LCCLKFFDWAARQPGFHHTRATFHAIFKILRGAKLVTLMIDFLDRSVGFESCRHSLRLCD 195

Query: 181 TLPM---------FALQLFGKMRFQGLDLDSFAYHVLLNSLVEENCFDAVHVIVKQISLR 240
            L +          ALQ FG MRF+GLDLDSF YHVLLN+LVEE CFD+  VI  QIS+R
Sbjct: 196 ALVVGYAVAGRTDIALQHFGNMRFRGLDLDSFGYHVLLNALVEEKCFDSFDVIFDQISVR 255

Query: 241 GFENEVTHYLMLKNFCKQNQLDEAETFLHDLVGRGEAVNGRMLGFLIGALCKRGNFERSW 300
           GF   VTH +++K FCKQ +LDEAE +L  L+    A  G  LG L+ ALC +  F+ + 
Sbjct: 256 GFVCAVTHSILVKKFCKQGKLDEAEDYLRALLPNDPAGCGSGLGILVDALCSKRKFQEAT 315

Query: 301 KLVEGFRDLELVPMEHVYGVWITELIQAGKLENALQFLYSRKSDESYIPDVFRYNMLIHR 360
           KL++  + +  V M+  Y +WI  LI+AG L N   FL      E    +VFRYN ++ +
Sbjct: 316 KLLDEIKLVGTVNMDRAYNIWIRALIKAGFLNNPADFLQKISPLEGCELEVFRYNSMVFQ 375

Query: 361 LLRENRLQEVFDLLTEMMEEHISPDKITMNAAMCFLCKVGMVDVALDLYNSRSEFGLSPN 420
           LL+EN L  V+D+LTEMM   +SP+K TMNAA+CF CK G VD AL+LY SRSE G +P 
Sbjct: 376 LLKENNLDGVYDILTEMMVRGVSPNKKTMNAALCFFCKAGFVDEALELYRSRSEIGFAPT 435

Query: 421 GMAYNYLINTLCGDGSTDEAYHILKSSIDQGYFPGKRTFSILADALCREGKLDKMKEMVI 480
            M+YNYLI+TLC + S ++AY +LK +ID+G+F G +TFS L +ALC +GK D  +E+VI
Sbjct: 436 AMSYNYLIHTLCANESVEQAYDVLKGAIDRGHFLGGKTFSTLTNALCWKGKPDMARELVI 495

Query: 481 FALERNFMPSDSTYDKFILALCRARRVEDGYLIHGELNRINKVAIKSTYFALIDGFNKSR 540
            A ER+ +P      K I ALC   +VED  +I+   N+         + +LI G     
Sbjct: 496 AAAERDLLPKRIAGCKIISALCDVGKVEDALMINELFNKSGVDTSFKMFTSLIYGSITLM 555

Query: 541 RGDIAARLLIEMQEKGHTPTRKLFRAVICCLNEMENMEKQFF-NLLELQLSRQEPNCAVY 600
           RGDIAA+L+I MQEKG+TPTR L+R VI C+ EME+ EK FF  LL+ QLS  E     Y
Sbjct: 556 RGDIAAKLIIRMQEKGYTPTRSLYRNVIQCVCEMESGEKNFFTTLLKFQLSLWEHKVQAY 615

Query: 601 NNFLYGAALAKKPELAREVYQMMLRSGIQPNLSSDILLLKCYLCSERISDALNFLNDLYP 660
           N F+ GA  A KP+LAR VY MM R GI P ++S+IL+L+ YL +E+I+DAL+F +DL  
Sbjct: 616 NLFIEGAGFAGKPKLARLVYDMMDRDGITPTVASNILMLQSYLKNEKIADALHFFHDLRE 675

Query: 661 TRTIGRKISNTMVVGLCKVSKGDVALDFLRGIRDKGLIPSIECYEELAKHFCQNERYDLV 720
                +++   M+VGLCK +K D A+ FL  ++ +GL PSIECYE   +  C  E+YD  
Sbjct: 676 QGKTKKRLYQVMIVGLCKANKLDDAMHFLEEMKGEGLQPSIECYEVNIQKLCNEEKYDEA 735

Query: 721 VNLINDLDKVGRPITSFLGNILLYSSLKTKKLYEAWVNSREGQVETSQSSMLGLLIGAFS 780
           V L+N+  K GR IT+F+GN+LL++++K+K +YEAW   R  + +  +   LG LIG FS
Sbjct: 736 VGLVNEFRKSGRRITAFIGNVLLHNAMKSKGVYEAWTRMRNIEDKIPEMKSLGELIGLFS 795

Query: 781 GHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLRRLSTNDLQQAFELFNRLCQKGYEPNSWT 830
           G I +   +K L+E I KC+PLD+YTYN+LLR +  N  + A+E+  R+ ++GY PN  T
Sbjct: 796 GRIDMEVELKRLDEVIEKCYPLDMYTYNMLLRMIVMNQAEDAYEMVERIARRGYVPNERT 850

BLAST of Tan0006998 vs. ExPASy Swiss-Prot
Match: Q9M907 (Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana OX=3702 GN=At3g06920 PE=2 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 3.0e-45
Identity = 176/795 (22.14%), Postives = 338/795 (42.52%), Query Frame = 0

Query: 83  PYGPSTADLALSSLGLRLNEPFVLDVLRFGSKNVLSCLKFFDWAGRQPKFFHTRATFNAI 142
           P+GPS A+  LS+L  +    FV+ VLR   K+V   +++F W  R+ +  H   ++N++
Sbjct: 47  PWGPS-AENTLSALSFKPQPEFVIGVLR-RLKDVNRAIEYFRWYERRTELPHCPESYNSL 106

Query: 143 FKILSKAKLMSLLFDFIDNYVQQRMVHKVRFYSTLPMFALQLFG-----KMRFQGLDLDS 202
             ++++ +     FD +D  + +  V    F  ++      + G     K+R +G D+  
Sbjct: 107 LLVMARCR----NFDALDQILGEMSV--AGFGPSVNTCIEMVLGCVKANKLR-EGYDVVQ 166

Query: 203 F-----------AYHVLLNSLVEENCFDAVHVIVKQISLRGFENEVTHY-LMLKNFCKQN 262
                       AY  L+ +    N  D +  + +Q+   G+E  V  +  +++ F K+ 
Sbjct: 167 MMRKFKFRPAFSAYTTLIGAFSAVNHSDMMLTLFQQMQELGYEPTVHLFTTLIRGFAKEG 226

Query: 263 QLDEAETFLHDLVGRGEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYG 322
           ++D A + L ++       +  +    I +  K G  + +WK         L P E  Y 
Sbjct: 227 RVDSALSLLDEMKSSSLDADIVLYNVCIDSFGKVGKVDMAWKFFHEIEANGLKPDEVTYT 286

Query: 323 VWITELIQAGKLENALQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMME 382
             I  L +A +L+ A++ ++        +P  + YN +I       +  E + LL     
Sbjct: 287 SMIGVLCKANRLDEAVE-MFEHLEKNRRVPCTYAYNTMIMGYGSAGKFDEAYSLLERQRA 346

Query: 383 EHISPDKITMNAAMCFLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDE 442
           +   P  I  N  +  L K+G VD AL ++    +   +PN   YN LI+ LC  G  D 
Sbjct: 347 KGSIPSVIAYNCILTCLRKMGKVDEALKVFEEMKK-DAAPNLSTYNILIDMLCRAGKLDT 406

Query: 443 AYHILKSSIDQGYFPGKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDSTYDKFIL 502
           A+ +  S    G FP  RT +I+ D LC+  KLD+   M      +   P + T+   I 
Sbjct: 407 AFELRDSMQKAGLFPNVRTVNIMVDRLCKSQKLDEACAMFEEMDYKVCTPDEITFCSLID 466

Query: 503 ALCRARRVEDGYLIHGELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQEKGHTP 562
            L +  RV+D Y ++ ++   +       Y +LI  F    R +   ++  +M  +  +P
Sbjct: 467 GLGKVGRVDDAYKVYEKMLDSDCRTNSIVYTSLIKNFFNHGRKEDGHKIYKDMINQNCSP 526

Query: 563 TRKLFRAVICCLNEMENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPELAREVY 622
             +L    + C+ +    EK      E++  R  P+   Y+  ++G   A       E++
Sbjct: 527 DLQLLNTYMDCMFKAGEPEKGRAMFEEIKARRFVPDARSYSILIHGLIKAGFANETYELF 586

Query: 623 QMMLRSGIQPNLSSDILLLKCYLCSERISDALNFLNDLYPTRTIGRKIS----NTMVVGL 682
             M   G   +  +  +++  +    +++ A   L ++   +T G + +     +++ GL
Sbjct: 587 YSMKEQGCVLDTRAYNIVIDGFCKCGKVNKAYQLLEEM---KTKGFEPTVVTYGSVIDGL 646

Query: 683 CKVSKGDVALDFLRGIRDKGLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVGRPITS 742
            K+ + D A       + K +  ++  Y  L   F +  R D    ++ +L + G     
Sbjct: 647 AKIDRLDEAYMLFEEAKSKRIELNVVIYSSLIDGFGKVGRIDEAYLILEELMQKGLTPNL 706

Query: 743 FLGNILLYSSLKTKKLYEAWV--NSREGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEE 802
           +  N LL + +K +++ EA V   S +    T      G+LI       + +++    +E
Sbjct: 707 YTWNSLLDALVKAEEINEALVCFQSMKELKCTPNQVTYGILINGLCKVRKFNKAFVFWQE 766

Query: 803 AIAKCFPLDIYTYNLLLRRLS-TNDLQQAFELFNRLCQKGYEPNSWTYDILVHGLFKHGR 854
              +       +Y  ++  L+   ++ +A  LF+R    G  P+S  Y+ ++ GL    R
Sbjct: 767 MQKQGMKPSTISYTTMISGLAKAGNIAEAGALFDRFKANGGVPDSACYNAMIEGLSNGNR 826

BLAST of Tan0006998 vs. ExPASy Swiss-Prot
Match: Q76C99 (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 4.3e-44
Identity = 148/636 (23.27%), Postives = 271/636 (42.61%), Query Frame = 0

Query: 233 THYLMLKNFCKQNQLDEAETFLHDLVGRGEAVNGRMLGFLIGALCKRGNFERSWKLV-EG 292
           T+ +++   C+  +LD     L +++ +G  V+      L+  LC       +  +V   
Sbjct: 89  TYGILIGCCCRAGRLDLGFAALGNVIKKGFRVDAIAFTPLLKGLCADKRTSDAMDIVLRR 148

Query: 293 FRDLELVPMEHVYGVWITELIQAGKLENALQFLYSRKSDE--SYIPDVFRYNMLIHRLLR 352
             +L  +P    Y + +  L    + + AL+ L+    D      PDV  Y  +I+   +
Sbjct: 149 MTELGCIPNVFSYNILLKGLCDENRSQEALELLHMMADDRGGGSPPDVVSYTTVINGFFK 208

Query: 353 ENRLQEVFDLLTEMMEEHISPDKITMNAAMCFLCKVGMVDVALDLYNSRSEFGLSPNGMA 412
           E    + +    EM++  I PD +T N+ +  LCK   +D A+++ N+  + G+ P+ M 
Sbjct: 209 EGDSDKAYSTYHEMLDRGILPDVVTYNSIIAALCKAQAMDKAMEVLNTMVKNGVMPDCMT 268

Query: 413 YNYLINTLCGDGSTDEAYHILKSSIDQGYFPGKRTFSILADALCREGKLDKMKEMVIFAL 472
           YN +++  C  G   EA   LK     G  P   T+S+L D LC+ G+  + +++     
Sbjct: 269 YNSILHGYCSSGQPKEAIGFLKKMRSDGVEPDVVTYSLLMDYLCKNGRCMEARKIFDSMT 328

Query: 473 ERNFMPSDSTYDKFILALCRARRVEDGYLIHGELNRINKVAIKSTYFA---LIDGFNKSR 532
           +R   P  +TY   +        + +   +HG L+ + +  I   ++    LI  + K  
Sbjct: 329 KRGLKPEITTYGTLLQGYATKGALVE---MHGLLDLMVRNGIHPDHYVFSILICAYAKQG 388

Query: 533 RGDIAARLLIEMQEKGHTPTRKLFRAVI---CCLNEMENMEKQFFNLLELQLSRQEPNCA 592
           + D A  +  +M+++G  P    + AVI   C    +E+    F  +++  LS   P   
Sbjct: 389 KVDQAMLVFSKMRQQGLNPNAVTYGAVIGILCKSGRVEDAMLYFEQMIDEGLS---PGNI 448

Query: 593 VYNNFLYGAALAKKPELAREVYQMMLRSGIQPNLSSDILLLKCYLCSERISDALNFLNDL 652
           VYN+ ++G     K E A E+   ML  GI  N      ++  +    R+ ++      +
Sbjct: 449 VYNSLIHGLCTCNKWERAEELILEMLDRGICLNTIFFNSIIDSHCKEGRVIESEKLFELM 508

Query: 653 YPTRTIGRKIS-NTMVVGLCKVSKGDVALDFLRGIRDKGLIPSIECYEELAKHFCQNERY 712
                    I+ NT++ G C   K D A+  L G+   GL P+   Y  L   +C+  R 
Sbjct: 509 VRIGVKPNVITYNTLINGYCLAGKMDEAMKLLSGMVSVGLKPNTVTYSTLINGYCKISRM 568

Query: 713 DLVVNLINDLDKVGRPITSFLGNILLYSSLKTKKLYEAWVNSREGQVETSQSSMLGLLIG 772
           +  + L  +++  G        NI+L    +T++              T+ +  L     
Sbjct: 569 EDALVLFKEMESSGVSPDIITYNIILQGLFQTRR--------------TAAAKEL----- 628

Query: 773 AFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLRRLSTNDL-QQAFELFNRLCQKGYEP 832
               ++R+++S   +E          + TYN++L  L  N L   A ++F  LC    + 
Sbjct: 629 ----YVRITESGTQIE----------LSTYNIILHGLCKNKLTDDALQMFQNLCLMDLKL 685

Query: 833 NSWTYDILVHGLFKHGRTSEAKLLLEVMYRKGFCPS 858
            + T++I++  L K GR  EAK L       G  P+
Sbjct: 689 EARTFNIMIDALLKVGRNDEAKDLFVAFSSNGLVPN 685

BLAST of Tan0006998 vs. ExPASy Swiss-Prot
Match: Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)

HSP 1 Score: 169.5 bits (428), Expect = 1.7e-40
Identity = 148/677 (21.86%), Postives = 294/677 (43.43%), Query Frame = 0

Query: 92  ALSSLGLRLNEPFVLDVLRFGSKNVLSCLKFFDWAGRQPKFFHTRATFNAIFKILSKAKL 151
           ALSS  ++L     LD LR    +  + L+ F+ A ++P F    A +  I   L +   
Sbjct: 45  ALSSTDVKL-----LDSLR-SQPDDSAALRLFNLASKKPNFSPEPALYEEILLRLGR--- 104

Query: 152 MSLLFDFIDNYVQQRMVHKVRFYSTLPMFALQLFGKMRFQ--------------GLDLDS 211
            S  FD +   ++     +    ++  +  ++ + +   Q              GL  D+
Sbjct: 105 -SGSFDDMKKILEDMKSSRCEMGTSTFLILIESYAQFELQDEILSVVDWMIDEFGLKPDT 164

Query: 212 FAYHVLLNSLVEENCFDAVHVIVKQISLRGFENEV-THYLMLKNFCKQNQLDEAETFLHD 271
             Y+ +LN LV+ N    V +   ++S+ G + +V T  +++K  C+ +QL  A   L D
Sbjct: 165 HFYNRMLNLLVDGNSLKLVEISHAKMSVWGIKPDVSTFNVLIKALCRAHQLRPAILMLED 224

Query: 272 LVGRGEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYGVWITELIQAGK 331
           +   G   + +    ++    + G+ + + ++ E   +           V +    + G+
Sbjct: 225 MPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGR 284

Query: 332 LENALQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEEHISPDKITMN 391
           +E+AL F+    + + + PD + +N L++ L +   ++   +++  M++E   PD  T N
Sbjct: 285 VEDALNFIQEMSNQDGFFPDQYTFNTLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYN 344

Query: 392 AAMCFLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDEAYHILK----- 451
           + +  LCK+G V  A+++ +       SPN + YN LI+TLC +   +EA  + +     
Sbjct: 345 SVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSK 404

Query: 452 ---------SSIDQGYF---------------------PGKRTFSILADALCREGKLDK- 511
                    +S+ QG                       P + T+++L D+LC +GKLD+ 
Sbjct: 405 GILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEA 464

Query: 512 ---MKEMVIFALERNFMPSDSTYDKFILALCRARRVEDGYLIHGELNRINKVAIKSTYFA 571
              +K+M +    R+ +    TY+  I   C+A +  +   I  E+          TY  
Sbjct: 465 LNMLKQMELSGCARSVI----TYNTLIDGFCKANKTREAEEIFDEMEVHGVSRNSVTYNT 524

Query: 572 LIDGFNKSRRGDIAARLLIEMQEKGHTPTRKLFRAVICCLNEMENMEKQFFNLLELQLSR 631
           LIDG  KSRR + AA+L+ +M  +G  P +  + +++       +++K    +  +  + 
Sbjct: 525 LIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGGDIKKAADIVQAMTSNG 584

Query: 632 QEPNCAVYNNFLYGAALAKKPELAREVYQMMLRSGIQPNLSSDILLLKCYLCSERISDAL 691
            EP+   Y   + G   A + E+A ++ + +   GI     +   +++      + ++A+
Sbjct: 585 CEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMKGINLTPHAYNPVIQGLFRKRKTTEAI 644

Query: 692 NFLNDLYPTRTIGRKISNTMVV--GLCKVSKGDV--ALDFLRGIRDKGLIPSIECYEELA 711
           N   ++           +  +V  GLC    G +  A+DFL  + +KG +P       LA
Sbjct: 645 NLFREMLEQNEAPPDAVSYRIVFRGLCN-GGGPIREAVDFLVELLEKGFVPEFSSLYMLA 704

BLAST of Tan0006998 vs. ExPASy Swiss-Prot
Match: Q6NQ83 (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g22470 PE=1 SV=1)

HSP 1 Score: 160.2 bits (404), Expect = 1.0e-37
Identity = 127/535 (23.74%), Postives = 217/535 (40.56%), Query Frame = 0

Query: 200 YHVLLNSLVEENCFDAVHVIVKQISLRGFENEV-THYLMLKNFCKQNQLDEAETFLHDLV 259
           ++ L +++     +D V    K + L G E+++ T  +M+  +C++ +L  A + L    
Sbjct: 73  FNRLCSAVARTKQYDLVLGFCKGMELNGIEHDMYTMTIMINCYCRKKKLLFAFSVLGRAW 132

Query: 260 GRGEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYGVWITELIQAGKLE 319
             G   +      L+   C  G    +  LV+   +++  P        I  L   G++ 
Sbjct: 133 KLGYEPDTITFSTLVNGFCLEGRVSEAVALVDRMVEMKQRPDLVTVSTLINGLCLKGRVS 192

Query: 320 NALQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEEHISPDKITMNAA 379
            AL  L  R  +  + PD   Y  +++RL +        DL  +M E +I    +  +  
Sbjct: 193 EAL-VLIDRMVEYGFQPDEVTYGPVLNRLCKSGNSALALDLFRKMEERNIKASVVQYSIV 252

Query: 380 MCFLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDEAYHILKSSIDQGY 439
           +  LCK G  D AL L+N     G+  + + Y+ LI  LC DG  D+   +L+  I +  
Sbjct: 253 IDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSLIGGLCNDGKWDDGAKMLREMIGRNI 312

Query: 440 FPGKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDSTYDKFILALCRARRVEDGYL 499
            P   TFS L D   +EGKL + KE+    + R   P   TY+  I   C+   + +   
Sbjct: 313 IPDVVTFSALIDVFVKEGKLLEAKELYNEMITRGIAPDTITYNSLIDGFCKENCLHEANQ 372

Query: 500 IHGELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQEKGHTPTRKLFRAVICCLN 559
           +   +          TY  LI+ + K++R D   RL  E+  KG                
Sbjct: 373 MFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMRLFREISSKGLI-------------- 432

Query: 560 EMENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPELAREVYQMMLRSGIQPNLS 619
                                PN   YN  + G   + K   A+E++Q M+  G+ P++ 
Sbjct: 433 ---------------------PNTITYNTLVLGFCQSGKLNAAKELFQEMVSRGVPPSVV 492

Query: 620 SDILLLKCYLCSERISDALNFLNDLYPTR-TIGRKISNTMVVGLCKVSKGDVALDFLRGI 679
           +  +LL     +  ++ AL     +  +R T+G  I N ++ G+C  SK D A      +
Sbjct: 493 TYGILLDGLCDNGELNKALEIFEKMQKSRMTLGIGIYNIIIHGMCNASKVDDAWSLFCSL 552

Query: 680 RDKGLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVGRPITSFLGNILLYSSL 733
            DKG+ P +  Y  +    C+         L   + + G     F  NIL+ + L
Sbjct: 553 SDKGVKPDVVTYNVMIGGLCKKGSLSEADMLFRKMKEDGCTPDDFTYNILIRAHL 571

BLAST of Tan0006998 vs. NCBI nr
Match: XP_022946522.1 (pentatricopeptide repeat-containing protein At1g71210, mitochondrial [Cucurbita moschata] >XP_022946523.1 pentatricopeptide repeat-containing protein At1g71210, mitochondrial [Cucurbita moschata])

HSP 1 Score: 1510.4 bits (3909), Expect = 0.0e+00
Identity = 760/909 (83.61%), Postives = 808/909 (88.89%), Query Frame = 0

Query: 1   MLLLQRAARVESKTKSGIFVSSFKDIFNEGL----------------------------- 60
           M+LLQR ARVESKTK+GIFVSSFKDIFNE L                             
Sbjct: 1   MILLQRVARVESKTKTGIFVSSFKDIFNEALGSSSPCPNLYSFSSVSGISDNGNRIVPMF 60

Query: 61  -------VSASSSSATGADGMITKEVALSFKEWFKSGSNSLYDQIFQILQGARDEQEVPY 120
                  V  SS++A G D M+T+EVALSFKEWFKSGSN+LYDQIFQILQ ARD+QE+PY
Sbjct: 61  SPWMSTGVGTSSTAAAGEDWMVTQEVALSFKEWFKSGSNALYDQIFQILQMARDDQEMPY 120

Query: 121 GPSTADLALSSLGLRLNEPFVLDVLRFGSKNVLSCLKFFDWAGRQPKFFHTRATFNAIFK 180
           G STADLALSSLGLRLNE FVLDVLR+GSK+VLSCLKFFDWAG QP FFHTRATF AIFK
Sbjct: 121 GHSTADLALSSLGLRLNELFVLDVLRYGSKDVLSCLKFFDWAGHQPGFFHTRATFVAIFK 180

Query: 181 ILSKAKLMSLLFDFIDNYVQQRMVHKVRFYSTL---------PMFALQLFGKMRFQGLDL 240
           ILSKAKLMSL+FDF++NYVQQ+ VHK RFY+TL         P+FALQLFGKMRFQGLDL
Sbjct: 181 ILSKAKLMSLMFDFLENYVQQKFVHKARFYNTLVMGYAVAGKPIFALQLFGKMRFQGLDL 240

Query: 241 DSFAYHVLLNSLVEENCFDAVHVIVKQISLRGFENEVTHYLMLKNFCKQNQLDEAETFLH 300
           DSFAYHVLLNSLVEENCFDAVHVIVKQI+LRGF NE+THYLMLKNFCKQ+QLDEAETFLH
Sbjct: 241 DSFAYHVLLNSLVEENCFDAVHVIVKQITLRGFVNEITHYLMLKNFCKQSQLDEAETFLH 300

Query: 301 DLVGRGEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYGVWITELIQAG 360
           DLVG G+ +NGRMLGFL+ ALCK GNFER+WKLVEGFRDLELV M+HVYGVWITELI+AG
Sbjct: 301 DLVGSGKGLNGRMLGFLVSALCKSGNFERAWKLVEGFRDLELVSMDHVYGVWITELIRAG 360

Query: 361 KLENALQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEEHISPDKITM 420
            LE ALQFLYSRKSDESYIPDVFRYNMLIHRLLR+NRLQEVFDLLTEMMEEHISPDK+TM
Sbjct: 361 MLERALQFLYSRKSDESYIPDVFRYNMLIHRLLRDNRLQEVFDLLTEMMEEHISPDKVTM 420

Query: 421 NAAMCFLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDEAYHILKSSID 480
           NAAMCFLCK GMVDVALDLYNSRSE+ LSPN MAYNYL+NTLCGDGSTDEAYHILK SID
Sbjct: 421 NAAMCFLCKAGMVDVALDLYNSRSEYRLSPNSMAYNYLVNTLCGDGSTDEAYHILKHSID 480

Query: 481 QGYFPGKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDSTYDKFILALCRARRVED 540
           QGYFPGK+TFSILADALCREGKLDKMKE+VIF+LERNFMPS STYDKFI ALC+ARRVED
Sbjct: 481 QGYFPGKKTFSILADALCREGKLDKMKELVIFSLERNFMPSGSTYDKFISALCKARRVED 540

Query: 541 GYLIHGELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQEKGHTPTRKLFRAVIC 600
           GYLIHGELNRIN VAIKSTYF LIDGFNK RRGDI+ARLLIEMQEKGH PTRK+FR VI 
Sbjct: 541 GYLIHGELNRINVVAIKSTYFVLIDGFNKLRRGDISARLLIEMQEKGHNPTRKIFRTVIH 600

Query: 601 CLNEMENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPELAREVYQMMLRSGIQP 660
           CLNEMENMEKQFFNLLELQLSRQEP+  VYNNF+YGAALAKK ELAREVYQMMLRSGIQP
Sbjct: 601 CLNEMENMEKQFFNLLELQLSRQEPSPEVYNNFIYGAALAKKSELAREVYQMMLRSGIQP 660

Query: 661 NLSSDILLLKCYLCSERISDALNFLNDLYPTRTIGRKISNTMVVGLCKVSKGDVALDFLR 720
           NLSSDILLLK YL SERISDALNFL+DLY TRTIGRKISN MVVGLCK +K DVALD LR
Sbjct: 661 NLSSDILLLKSYLHSERISDALNFLSDLYQTRTIGRKISNVMVVGLCKANKADVALDVLR 720

Query: 721 GIRDKGLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVGRPITSFLGNILLYSSLKTK 780
            +RD+GLIPSIECYEELAKH C NERYDLVVNLINDLDKVGRPITSFLGN LLYSS+KT+
Sbjct: 721 DMRDRGLIPSIECYEELAKHLCHNERYDLVVNLINDLDKVGRPITSFLGNTLLYSSMKTQ 780

Query: 781 KLYEAWVNSREGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLL 840
           KLYEAWV+SREGQVETS+SSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLL
Sbjct: 781 KLYEAWVSSREGQVETSRSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLL 840

Query: 841 LRRLSTNDLQQAFELFNRLCQKGYEPNSWTYDILVHGLFKHGRTSEAKLLLEVMYRKGFC 865
           LRRLS NDLQQAFELFNRLC+KGY PN WTYDILVH LFKHGRTSEAK LLEVMYRKGF 
Sbjct: 841 LRRLSANDLQQAFELFNRLCEKGYVPNRWTYDILVHALFKHGRTSEAKRLLEVMYRKGFA 900

BLAST of Tan0006998 vs. NCBI nr
Match: XP_023545233.1 (pentatricopeptide repeat-containing protein At1g71210, mitochondrial [Cucurbita pepo subsp. pepo] >XP_023545234.1 pentatricopeptide repeat-containing protein At1g71210, mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1508.4 bits (3904), Expect = 0.0e+00
Identity = 758/909 (83.39%), Postives = 811/909 (89.22%), Query Frame = 0

Query: 1   MLLLQRAARVESKTKSGIFVSSFKDIFNEGL----------------------------- 60
           M+LLQR ARVESKTK+GIFVSSFKDIFNE L                             
Sbjct: 1   MILLQRVARVESKTKTGIFVSSFKDIFNEALGSSSPCPNLYSFSSVGGISDNGNRIVPMF 60

Query: 61  -------VSASSSSATGADGMITKEVALSFKEWFKSGSNSLYDQIFQILQGARDEQEVPY 120
                  V  SS++A GAD M+T+EVALSFKEWFKSGSN+LYDQIFQILQ ARD+QE+PY
Sbjct: 61  SPWMSTGVGTSSTAAGGADWMVTQEVALSFKEWFKSGSNALYDQIFQILQMARDDQEMPY 120

Query: 121 GPSTADLALSSLGLRLNEPFVLDVLRFGSKNVLSCLKFFDWAGRQPKFFHTRATFNAIFK 180
           G STADLALSSLGLRLNE FVLDVLR+GSK+VLSCLKFFDWAG QP FFHTRATF AIFK
Sbjct: 121 GHSTADLALSSLGLRLNELFVLDVLRYGSKDVLSCLKFFDWAGHQPGFFHTRATFVAIFK 180

Query: 181 ILSKAKLMSLLFDFIDNYVQQRMVHKVRFYSTL---------PMFALQLFGKMRFQGLDL 240
           ILSKAKLMSL+FDF++NYVQQ+ VHK RFY+TL         P+FALQLFGKMRFQGLDL
Sbjct: 181 ILSKAKLMSLMFDFLENYVQQKFVHKARFYNTLVMGYAVAGKPIFALQLFGKMRFQGLDL 240

Query: 241 DSFAYHVLLNSLVEENCFDAVHVIVKQISLRGFENEVTHYLMLKNFCKQNQLDEAETFLH 300
           DSFAYHVLLNSLVEENCFDAVHVIVKQI+LRGF NE+THYLMLKNFCKQ+QLDEAETFLH
Sbjct: 241 DSFAYHVLLNSLVEENCFDAVHVIVKQITLRGFVNEITHYLMLKNFCKQSQLDEAETFLH 300

Query: 301 DLVGRGEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYGVWITELIQAG 360
           DLVG G+ +NGRMLGFL+ ALCK GNFER+WKLVE FRDLELV M+HVYGVWITELI+AG
Sbjct: 301 DLVGSGKGLNGRMLGFLVSALCKSGNFERAWKLVEEFRDLELVSMDHVYGVWITELIRAG 360

Query: 361 KLENALQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEEHISPDKITM 420
           KLE ALQFLYSRKSDESYIPDVFRYNMLIHRLLR+NRLQEVFDLLTEMMEEHISPDK+TM
Sbjct: 361 KLERALQFLYSRKSDESYIPDVFRYNMLIHRLLRDNRLQEVFDLLTEMMEEHISPDKVTM 420

Query: 421 NAAMCFLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDEAYHILKSSID 480
           NAAMCFLCK GMVDVALDLYNSRSE+ LSPN MAYNYL+NTLCGDGSTDEAYHILK SID
Sbjct: 421 NAAMCFLCKAGMVDVALDLYNSRSEYRLSPNSMAYNYLVNTLCGDGSTDEAYHILKHSID 480

Query: 481 QGYFPGKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDSTYDKFILALCRARRVED 540
           QGYFPGK+TFSILADALCREGKLDKMKE+VIF+LERNFMPS STYDKFI ALC+A+RVED
Sbjct: 481 QGYFPGKKTFSILADALCREGKLDKMKELVIFSLERNFMPSGSTYDKFISALCKAKRVED 540

Query: 541 GYLIHGELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQEKGHTPTRKLFRAVIC 600
           GYLIHGELNRIN VAIKSTYF LIDGFNK RRGDI+ARLLIEMQEKGH PTRK+FR+VI 
Sbjct: 541 GYLIHGELNRINVVAIKSTYFVLIDGFNKLRRGDISARLLIEMQEKGHNPTRKIFRSVIH 600

Query: 601 CLNEMENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPELAREVYQMMLRSGIQP 660
           CLNEMENMEKQFFNLLELQLSRQEP+  VYNNF+YGAALAKKPELAREVYQMMLRSGI+P
Sbjct: 601 CLNEMENMEKQFFNLLELQLSRQEPSPEVYNNFIYGAALAKKPELAREVYQMMLRSGIRP 660

Query: 661 NLSSDILLLKCYLCSERISDALNFLNDLYPTRTIGRKISNTMVVGLCKVSKGDVALDFLR 720
           NLSSDILLLK YL SERISDALNF++DLY TRTIGRKISN MVVGLCK +K DVALD LR
Sbjct: 661 NLSSDILLLKSYLHSERISDALNFVSDLYQTRTIGRKISNVMVVGLCKANKADVALDVLR 720

Query: 721 GIRDKGLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVGRPITSFLGNILLYSSLKTK 780
            +RD+G+IPSIECYEELAKH C NERYDLVVNLINDLDKVGRPITSFLGN LLYSSLKT+
Sbjct: 721 DMRDRGVIPSIECYEELAKHLCHNERYDLVVNLINDLDKVGRPITSFLGNTLLYSSLKTQ 780

Query: 781 KLYEAWVNSREGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLL 840
           KLY+AWV+SREGQVETS+SSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLL
Sbjct: 781 KLYDAWVSSREGQVETSRSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLL 840

Query: 841 LRRLSTNDLQQAFELFNRLCQKGYEPNSWTYDILVHGLFKHGRTSEAKLLLEVMYRKGFC 865
           LRRLS NDLQQAFELFNRLC+KGY PN WTYDILVH LFKHGRTSEAK LLEVMYRKGF 
Sbjct: 841 LRRLSANDLQQAFELFNRLCEKGYVPNRWTYDILVHALFKHGRTSEAKRLLEVMYRKGFT 900

BLAST of Tan0006998 vs. NCBI nr
Match: KAG7030102.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1504.2 bits (3893), Expect = 0.0e+00
Identity = 758/909 (83.39%), Postives = 807/909 (88.78%), Query Frame = 0

Query: 1   MLLLQRAARVESKTKSGIFVSSFKDIFNEGL----------------------------- 60
           M+LLQR ARVESKTK+GIFVSSFKDIFNE L                             
Sbjct: 1   MILLQRVARVESKTKTGIFVSSFKDIFNEALGSSSPCPNLYFFSSVSGISDNENRIVPMF 60

Query: 61  -------VSASSSSATGADGMITKEVALSFKEWFKSGSNSLYDQIFQILQGARDEQEVPY 120
                  V  SS++A GAD M+T+EVALSFKEWFKSGSN+LYDQIFQILQ ARD++E+ Y
Sbjct: 61  SPWMSTGVGTSSTAAAGADWMVTQEVALSFKEWFKSGSNALYDQIFQILQMARDDKEMSY 120

Query: 121 GPSTADLALSSLGLRLNEPFVLDVLRFGSKNVLSCLKFFDWAGRQPKFFHTRATFNAIFK 180
           G STADLALSSLGLRLNE FVLDVLR+GSK+VLSCLKFFDWAG QP FFHTRATF AIFK
Sbjct: 121 GHSTADLALSSLGLRLNELFVLDVLRYGSKDVLSCLKFFDWAGHQPGFFHTRATFVAIFK 180

Query: 181 ILSKAKLMSLLFDFIDNYVQQRMVHKVRFYSTL---------PMFALQLFGKMRFQGLDL 240
           ILSKAKLMSL+FDF++NYVQQ+ VHK RFY+TL         P+FALQLFGKMRFQGLDL
Sbjct: 181 ILSKAKLMSLMFDFLENYVQQKFVHKARFYNTLVMGYAVAGKPIFALQLFGKMRFQGLDL 240

Query: 241 DSFAYHVLLNSLVEENCFDAVHVIVKQISLRGFENEVTHYLMLKNFCKQNQLDEAETFLH 300
           DSFAYHVLLNSLVEENCFDAVHVIVKQI+LRGF NE+THYLMLKNFCKQ+QLDEAETFLH
Sbjct: 241 DSFAYHVLLNSLVEENCFDAVHVIVKQITLRGFVNEITHYLMLKNFCKQSQLDEAETFLH 300

Query: 301 DLVGRGEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYGVWITELIQAG 360
           DLVG G+ +NGRMLGFL+ ALCK GNFER+WKLVEGFRDLELV M+HVYGVWITELI+AG
Sbjct: 301 DLVGSGKGLNGRMLGFLVSALCKSGNFERAWKLVEGFRDLELVSMDHVYGVWITELIRAG 360

Query: 361 KLENALQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEEHISPDKITM 420
            LE ALQFLYSRKSDESYIPDVFRYNMLIHRLLR+NRLQEVFDLLTEMMEEHISPDK+TM
Sbjct: 361 MLERALQFLYSRKSDESYIPDVFRYNMLIHRLLRDNRLQEVFDLLTEMMEEHISPDKVTM 420

Query: 421 NAAMCFLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDEAYHILKSSID 480
           NAAMCFLCK GMVDVALDLYNSRSE+ LSPN MAYNYL+NTLCGDGSTDEAYHILK SID
Sbjct: 421 NAAMCFLCKAGMVDVALDLYNSRSEYRLSPNSMAYNYLVNTLCGDGSTDEAYHILKHSID 480

Query: 481 QGYFPGKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDSTYDKFILALCRARRVED 540
           QGYFP K+TFSILADALCREGKLDKMKE+VIF+LERNFMPS STYDKFI ALC+ARRVED
Sbjct: 481 QGYFPRKKTFSILADALCREGKLDKMKELVIFSLERNFMPSGSTYDKFISALCKARRVED 540

Query: 541 GYLIHGELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQEKGHTPTRKLFRAVIC 600
           GYLIHGELNRIN VAIKSTYF LIDGFNK RRGDI+ARLLIEMQEKGH PTRK+FR+VI 
Sbjct: 541 GYLIHGELNRINVVAIKSTYFVLIDGFNKLRRGDISARLLIEMQEKGHNPTRKIFRSVIH 600

Query: 601 CLNEMENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPELAREVYQMMLRSGIQP 660
           CLNEMENMEKQFFNLLELQLSRQEP   VYNNF+YGAALAKK ELAREVYQMMLRSGIQP
Sbjct: 601 CLNEMENMEKQFFNLLELQLSRQEPGPEVYNNFIYGAALAKKSELAREVYQMMLRSGIQP 660

Query: 661 NLSSDILLLKCYLCSERISDALNFLNDLYPTRTIGRKISNTMVVGLCKVSKGDVALDFLR 720
           NLSSDILLLK YL SERISDALNFL+DLY TRTIGRKISN MVVGLCK +K DVALD LR
Sbjct: 661 NLSSDILLLKSYLHSERISDALNFLSDLYQTRTIGRKISNVMVVGLCKANKADVALDVLR 720

Query: 721 GIRDKGLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVGRPITSFLGNILLYSSLKTK 780
            +RD+GLIPSIECYEELAKH C NERYDLVVNLINDLDKVGRPITSFLGN LLYSS+KT+
Sbjct: 721 DMRDRGLIPSIECYEELAKHLCHNERYDLVVNLINDLDKVGRPITSFLGNTLLYSSMKTQ 780

Query: 781 KLYEAWVNSREGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLL 840
           KLYEAWV+SREGQVETS+SSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLL
Sbjct: 781 KLYEAWVSSREGQVETSRSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLL 840

Query: 841 LRRLSTNDLQQAFELFNRLCQKGYEPNSWTYDILVHGLFKHGRTSEAKLLLEVMYRKGFC 865
           LRRLS NDLQQAFELFNRLC+KGY PN WTYDILVH LFKHGRTSEAK LLEVMYRKGF 
Sbjct: 841 LRRLSANDLQQAFELFNRLCEKGYVPNRWTYDILVHALFKHGRTSEAKRLLEVMYRKGFT 900

BLAST of Tan0006998 vs. NCBI nr
Match: XP_022999627.1 (pentatricopeptide repeat-containing protein At1g71210, mitochondrial [Cucurbita maxima] >XP_022999628.1 pentatricopeptide repeat-containing protein At1g71210, mitochondrial [Cucurbita maxima] >XP_022999629.1 pentatricopeptide repeat-containing protein At1g71210, mitochondrial [Cucurbita maxima])

HSP 1 Score: 1499.6 bits (3881), Expect = 0.0e+00
Identity = 754/907 (83.13%), Postives = 802/907 (88.42%), Query Frame = 0

Query: 1   MLLLQRAARVESKTKSGIFVSSFKDIFNEGLVSAS------------------------- 60
           M+LLQR ARVESKTK+GIFVSSFKDIFNE L S+S                         
Sbjct: 1   MILLQRVARVESKTKTGIFVSSFKDIFNEALGSSSPCPNLYSFSSVAGISDNGNRIVPMF 60

Query: 61  ---------SSSATGADGMITKEVALSFKEWFKSGSNSLYDQIFQILQGARDEQEVPYGP 120
                    +S   GAD M+T+EVAL FKEWFKSGSN+LYDQIFQILQ ARD+QE+PYG 
Sbjct: 61  SPWMSTGVGTSLTAGADWMVTQEVALPFKEWFKSGSNALYDQIFQILQMARDDQEMPYGH 120

Query: 121 STADLALSSLGLRLNEPFVLDVLRFGSKNVLSCLKFFDWAGRQPKFFHTRATFNAIFKIL 180
           STADLALSSLGLRLNE FVLDVLR+GSK+VLSCLKFFDWAG QP FFHTRATF AIFKIL
Sbjct: 121 STADLALSSLGLRLNELFVLDVLRYGSKDVLSCLKFFDWAGHQPGFFHTRATFVAIFKIL 180

Query: 181 SKAKLMSLLFDFIDNYVQQRMVHKVRFYSTL---------PMFALQLFGKMRFQGLDLDS 240
           SKAKLMSL+FDF++NYVQQ+ VHK RFY+TL         P+FALQLFGKMRFQGLDLDS
Sbjct: 181 SKAKLMSLMFDFLENYVQQKFVHKARFYNTLVMGYAVAGKPIFALQLFGKMRFQGLDLDS 240

Query: 241 FAYHVLLNSLVEENCFDAVHVIVKQISLRGFENEVTHYLMLKNFCKQNQLDEAETFLHDL 300
           FAYHVLLNSLVEENCFDAVHV+VKQI+LRGF NE+THYLMLKNFCKQ+QLDEAETFLHDL
Sbjct: 241 FAYHVLLNSLVEENCFDAVHVVVKQITLRGFVNEITHYLMLKNFCKQSQLDEAETFLHDL 300

Query: 301 VGRGEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYGVWITELIQAGKL 360
           VG G+ +NGRMLGFL+ ALCK GNFER+WKLVEGFRDLELV M+H YG WITELI+AGKL
Sbjct: 301 VGSGKGLNGRMLGFLVSALCKSGNFERAWKLVEGFRDLELVSMDHAYGAWITELIRAGKL 360

Query: 361 ENALQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEEHISPDKITMNA 420
           E ALQFLYSRKSDESYIPDVFRYNMLIHRLLR+NRLQEVFDLLTEMMEEHISPDK+T+N 
Sbjct: 361 ERALQFLYSRKSDESYIPDVFRYNMLIHRLLRDNRLQEVFDLLTEMMEEHISPDKVTLNV 420

Query: 421 AMCFLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDEAYHILKSSIDQG 480
           AMCFLCK GMVDVALDLYNSRSE+ LSPN MAYNYL+NTLCGDGSTDEAYHILK SIDQG
Sbjct: 421 AMCFLCKAGMVDVALDLYNSRSEYRLSPNSMAYNYLVNTLCGDGSTDEAYHILKHSIDQG 480

Query: 481 YFPGKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDSTYDKFILALCRARRVEDGY 540
           YFPGKRTFSILADALCREGKLDKMKE+VIF+LERNFMPS STYDKFI ALC+ARRVEDGY
Sbjct: 481 YFPGKRTFSILADALCREGKLDKMKELVIFSLERNFMPSGSTYDKFISALCKARRVEDGY 540

Query: 541 LIHGELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQEKGHTPTRKLFRAVICCL 600
           LIH ELNRIN VAIKSTYF LIDGFNK RRGDI+ARLLIEMQEKGH PTRKLFR+VI CL
Sbjct: 541 LIHDELNRINVVAIKSTYFVLIDGFNKLRRGDISARLLIEMQEKGHNPTRKLFRSVIHCL 600

Query: 601 NEMENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPELAREVYQMMLRSGIQPNL 660
            EMENMEKQFFNLLELQLSRQEP+  VYNNF+YGAALAKK  LAREVYQMMLRSGIQPNL
Sbjct: 601 TEMENMEKQFFNLLELQLSRQEPSPEVYNNFIYGAALAKKSALAREVYQMMLRSGIQPNL 660

Query: 661 SSDILLLKCYLCSERISDALNFLNDLYPTRTIGRKISNTMVVGLCKVSKGDVALDFLRGI 720
           SSDILLLKCYL SERISDALNFL+DLY TRTIGRKISN MVVGLCK +K DVALD  R I
Sbjct: 661 SSDILLLKCYLHSERISDALNFLSDLYQTRTIGRKISNVMVVGLCKANKADVALDVFRDI 720

Query: 721 RDKGLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVGRPITSFLGNILLYSSLKTKKL 780
           RD+G+IPSIECYEELAKH C NERYDLVVNLINDLDKVGRPITSFLGN LLYSSLKT+KL
Sbjct: 721 RDRGVIPSIECYEELAKHLCHNERYDLVVNLINDLDKVGRPITSFLGNTLLYSSLKTQKL 780

Query: 781 YEAWVNSREGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLR 840
           YEAWV+ REGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLR
Sbjct: 781 YEAWVSLREGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLR 840

Query: 841 RLSTNDLQQAFELFNRLCQKGYEPNSWTYDILVHGLFKHGRTSEAKLLLEVMYRKGFCPS 865
           RLS NDLQQAFELFNRLC+KGY PN WTYDILVH LFKHGRTSEAK LLEVMYRKGF P+
Sbjct: 841 RLSANDLQQAFELFNRLCEKGYVPNRWTYDILVHALFKHGRTSEAKRLLEVMYRKGFTPT 900

BLAST of Tan0006998 vs. NCBI nr
Match: XP_022156362.1 (pentatricopeptide repeat-containing protein At1g71210 [Momordica charantia] >XP_022156363.1 pentatricopeptide repeat-containing protein At1g71210 [Momordica charantia] >XP_022156364.1 pentatricopeptide repeat-containing protein At1g71210 [Momordica charantia])

HSP 1 Score: 1494.9 bits (3869), Expect = 0.0e+00
Identity = 755/904 (83.52%), Postives = 804/904 (88.94%), Query Frame = 0

Query: 1   MLLLQRAARVESKTKSGIFVSSFKDIFNEGLVSASSS----------------------- 60
           MLLLQR  RVESKTKSGIFVSSF+DIFNE LVS  SS                       
Sbjct: 1   MLLLQRIVRVESKTKSGIFVSSFRDIFNEALVSDLSSFSSVAGISGNGNRDIPIFFPWMS 60

Query: 61  --------SATGADGMITKEVALSFKEWFKSGSNSLYDQIFQILQGARDEQEVPYGPSTA 120
                   +A G DGMI+KEVALSFKEWFKSGSNSL+DQIFQILQGARD+QE  Y PSTA
Sbjct: 61  EKIATTLTAAAGGDGMISKEVALSFKEWFKSGSNSLFDQIFQILQGARDDQETTYRPSTA 120

Query: 121 DLALSSLGLRLNEPFVLDVLRFGSKNVLSCLKFFDWAGRQPKFFHTRATFNAIFKILSKA 180
           DLALSSLGLRLNE FVLDVLRFGS +VLSCLKFFDWAGRQP FFHTRATFNAIFKILSKA
Sbjct: 121 DLALSSLGLRLNELFVLDVLRFGSNDVLSCLKFFDWAGRQPGFFHTRATFNAIFKILSKA 180

Query: 181 KLMSLLFDFIDNYVQQRMVHKVRFYSTL---------PMFALQLFGKMRFQGLDLDSFAY 240
           KLMSL+FDF+DNYVQQ+ VHKVRFY+TL         P+FALQLFG+MRFQG DLDSFAY
Sbjct: 181 KLMSLMFDFLDNYVQQKFVHKVRFYNTLVMGYAVAGKPIFALQLFGQMRFQGHDLDSFAY 240

Query: 241 HVLLNSLVEENCFDAVHVIVKQISLRGFENEVTHYLMLKNFCKQNQLDEAETFLHDLVGR 300
           HVLLNSLVEENCFDAVHVIVKQISL GFENEVTH++MLKNFCKQ+QL EAETFLH LV  
Sbjct: 241 HVLLNSLVEENCFDAVHVIVKQISLMGFENEVTHHIMLKNFCKQSQLAEAETFLHGLVSS 300

Query: 301 GEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYGVWITELIQAGKLENA 360
           G+AV+GRMLG L+GALCK GNFER+WKLVE FR+ ELV +EHVYGVWIT+L++AGKLE+A
Sbjct: 301 GQAVSGRMLGILVGALCKSGNFERAWKLVEEFRN-ELVSVEHVYGVWITQLVRAGKLESA 360

Query: 361 LQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEEHISPDKITMNAAMC 420
           LQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLL EM +EHISPDK+TMNAAMC
Sbjct: 361 LQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLMEMKKEHISPDKVTMNAAMC 420

Query: 421 FLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDEAYHILKSSIDQGYFP 480
           FLCK GMVDVALDLYNSRS FGLSPN MAYNYLINTLCGDGSTDEAYHILK+SIDQGYFP
Sbjct: 421 FLCKAGMVDVALDLYNSRSGFGLSPNSMAYNYLINTLCGDGSTDEAYHILKNSIDQGYFP 480

Query: 481 GKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDSTYDKFILALCRARRVEDGYLIH 540
           GK+TFSILADALCRE KLDKMKE+VIFALERNFMPSDSTYDKFI ALCRA+RVEDGYLIH
Sbjct: 481 GKKTFSILADALCRERKLDKMKELVIFALERNFMPSDSTYDKFISALCRAKRVEDGYLIH 540

Query: 541 GELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQEKGHTPTRKLFRAVICCLNEM 600
           GELNRINKVA++STYFALIDGFNKS RGDIAARLLIEMQEKGH PTRKLFRAVI CLNEM
Sbjct: 541 GELNRINKVAVQSTYFALIDGFNKSNRGDIAARLLIEMQEKGHVPTRKLFRAVIRCLNEM 600

Query: 601 ENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPELAREVYQMMLRSGIQPNLSSD 660
           ENMEKQFFNLLELQLSRQEP+C VYNNF+YGAA AKKPELAREVYQMMLRSGIQPNLSSD
Sbjct: 601 ENMEKQFFNLLELQLSRQEPSCEVYNNFIYGAAHAKKPELAREVYQMMLRSGIQPNLSSD 660

Query: 661 ILLLKCYLCSERISDALNFLNDLYPTRTIGRKISNTMVVGLCKVSKGDVALDFLRGIRDK 720
           IL+LKCYLCSERISDALNFL DL  +R IGRKI NTMVVGLCK +K D+ALDFLR +RDK
Sbjct: 661 ILILKCYLCSERISDALNFLEDLSQSRIIGRKIFNTMVVGLCKANKADIALDFLRDMRDK 720

Query: 721 GLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVGRPITSFLGNILLYSSLKTKKLYEA 780
            L PSIECYE LAK FCQ ERYDLV NL+NDL+ VGR +TSFLGNILLY+SLKT+KLYEA
Sbjct: 721 SLTPSIECYEVLAKQFCQIERYDLVANLVNDLENVGRHLTSFLGNILLYNSLKTRKLYEA 780

Query: 781 WVNSREGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLRRLS 840
           WV+SREG +ETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLRRLS
Sbjct: 781 WVHSREGLMETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLRRLS 840

Query: 841 TNDLQQAFELFNRLCQKGYEPNSWTYDILVHGLFKHGRTSEAKLLLEVMYRKGFCPSERT 865
            ND+Q AFELFNRLCQKGYEPN WTYDILVHGLFKHGRTSEAK LLEVMYRKGF P+E T
Sbjct: 841 INDMQLAFELFNRLCQKGYEPNKWTYDILVHGLFKHGRTSEAKRLLEVMYRKGFDPTECT 900

BLAST of Tan0006998 vs. ExPASy TrEMBL
Match: A0A6J1G442 (pentatricopeptide repeat-containing protein At1g71210, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111450559 PE=4 SV=1)

HSP 1 Score: 1510.4 bits (3909), Expect = 0.0e+00
Identity = 760/909 (83.61%), Postives = 808/909 (88.89%), Query Frame = 0

Query: 1   MLLLQRAARVESKTKSGIFVSSFKDIFNEGL----------------------------- 60
           M+LLQR ARVESKTK+GIFVSSFKDIFNE L                             
Sbjct: 1   MILLQRVARVESKTKTGIFVSSFKDIFNEALGSSSPCPNLYSFSSVSGISDNGNRIVPMF 60

Query: 61  -------VSASSSSATGADGMITKEVALSFKEWFKSGSNSLYDQIFQILQGARDEQEVPY 120
                  V  SS++A G D M+T+EVALSFKEWFKSGSN+LYDQIFQILQ ARD+QE+PY
Sbjct: 61  SPWMSTGVGTSSTAAAGEDWMVTQEVALSFKEWFKSGSNALYDQIFQILQMARDDQEMPY 120

Query: 121 GPSTADLALSSLGLRLNEPFVLDVLRFGSKNVLSCLKFFDWAGRQPKFFHTRATFNAIFK 180
           G STADLALSSLGLRLNE FVLDVLR+GSK+VLSCLKFFDWAG QP FFHTRATF AIFK
Sbjct: 121 GHSTADLALSSLGLRLNELFVLDVLRYGSKDVLSCLKFFDWAGHQPGFFHTRATFVAIFK 180

Query: 181 ILSKAKLMSLLFDFIDNYVQQRMVHKVRFYSTL---------PMFALQLFGKMRFQGLDL 240
           ILSKAKLMSL+FDF++NYVQQ+ VHK RFY+TL         P+FALQLFGKMRFQGLDL
Sbjct: 181 ILSKAKLMSLMFDFLENYVQQKFVHKARFYNTLVMGYAVAGKPIFALQLFGKMRFQGLDL 240

Query: 241 DSFAYHVLLNSLVEENCFDAVHVIVKQISLRGFENEVTHYLMLKNFCKQNQLDEAETFLH 300
           DSFAYHVLLNSLVEENCFDAVHVIVKQI+LRGF NE+THYLMLKNFCKQ+QLDEAETFLH
Sbjct: 241 DSFAYHVLLNSLVEENCFDAVHVIVKQITLRGFVNEITHYLMLKNFCKQSQLDEAETFLH 300

Query: 301 DLVGRGEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYGVWITELIQAG 360
           DLVG G+ +NGRMLGFL+ ALCK GNFER+WKLVEGFRDLELV M+HVYGVWITELI+AG
Sbjct: 301 DLVGSGKGLNGRMLGFLVSALCKSGNFERAWKLVEGFRDLELVSMDHVYGVWITELIRAG 360

Query: 361 KLENALQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEEHISPDKITM 420
            LE ALQFLYSRKSDESYIPDVFRYNMLIHRLLR+NRLQEVFDLLTEMMEEHISPDK+TM
Sbjct: 361 MLERALQFLYSRKSDESYIPDVFRYNMLIHRLLRDNRLQEVFDLLTEMMEEHISPDKVTM 420

Query: 421 NAAMCFLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDEAYHILKSSID 480
           NAAMCFLCK GMVDVALDLYNSRSE+ LSPN MAYNYL+NTLCGDGSTDEAYHILK SID
Sbjct: 421 NAAMCFLCKAGMVDVALDLYNSRSEYRLSPNSMAYNYLVNTLCGDGSTDEAYHILKHSID 480

Query: 481 QGYFPGKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDSTYDKFILALCRARRVED 540
           QGYFPGK+TFSILADALCREGKLDKMKE+VIF+LERNFMPS STYDKFI ALC+ARRVED
Sbjct: 481 QGYFPGKKTFSILADALCREGKLDKMKELVIFSLERNFMPSGSTYDKFISALCKARRVED 540

Query: 541 GYLIHGELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQEKGHTPTRKLFRAVIC 600
           GYLIHGELNRIN VAIKSTYF LIDGFNK RRGDI+ARLLIEMQEKGH PTRK+FR VI 
Sbjct: 541 GYLIHGELNRINVVAIKSTYFVLIDGFNKLRRGDISARLLIEMQEKGHNPTRKIFRTVIH 600

Query: 601 CLNEMENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPELAREVYQMMLRSGIQP 660
           CLNEMENMEKQFFNLLELQLSRQEP+  VYNNF+YGAALAKK ELAREVYQMMLRSGIQP
Sbjct: 601 CLNEMENMEKQFFNLLELQLSRQEPSPEVYNNFIYGAALAKKSELAREVYQMMLRSGIQP 660

Query: 661 NLSSDILLLKCYLCSERISDALNFLNDLYPTRTIGRKISNTMVVGLCKVSKGDVALDFLR 720
           NLSSDILLLK YL SERISDALNFL+DLY TRTIGRKISN MVVGLCK +K DVALD LR
Sbjct: 661 NLSSDILLLKSYLHSERISDALNFLSDLYQTRTIGRKISNVMVVGLCKANKADVALDVLR 720

Query: 721 GIRDKGLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVGRPITSFLGNILLYSSLKTK 780
            +RD+GLIPSIECYEELAKH C NERYDLVVNLINDLDKVGRPITSFLGN LLYSS+KT+
Sbjct: 721 DMRDRGLIPSIECYEELAKHLCHNERYDLVVNLINDLDKVGRPITSFLGNTLLYSSMKTQ 780

Query: 781 KLYEAWVNSREGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLL 840
           KLYEAWV+SREGQVETS+SSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLL
Sbjct: 781 KLYEAWVSSREGQVETSRSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLL 840

Query: 841 LRRLSTNDLQQAFELFNRLCQKGYEPNSWTYDILVHGLFKHGRTSEAKLLLEVMYRKGFC 865
           LRRLS NDLQQAFELFNRLC+KGY PN WTYDILVH LFKHGRTSEAK LLEVMYRKGF 
Sbjct: 841 LRRLSANDLQQAFELFNRLCEKGYVPNRWTYDILVHALFKHGRTSEAKRLLEVMYRKGFA 900

BLAST of Tan0006998 vs. ExPASy TrEMBL
Match: A0A6J1KBC6 (pentatricopeptide repeat-containing protein At1g71210, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111493931 PE=4 SV=1)

HSP 1 Score: 1499.6 bits (3881), Expect = 0.0e+00
Identity = 754/907 (83.13%), Postives = 802/907 (88.42%), Query Frame = 0

Query: 1   MLLLQRAARVESKTKSGIFVSSFKDIFNEGLVSAS------------------------- 60
           M+LLQR ARVESKTK+GIFVSSFKDIFNE L S+S                         
Sbjct: 1   MILLQRVARVESKTKTGIFVSSFKDIFNEALGSSSPCPNLYSFSSVAGISDNGNRIVPMF 60

Query: 61  ---------SSSATGADGMITKEVALSFKEWFKSGSNSLYDQIFQILQGARDEQEVPYGP 120
                    +S   GAD M+T+EVAL FKEWFKSGSN+LYDQIFQILQ ARD+QE+PYG 
Sbjct: 61  SPWMSTGVGTSLTAGADWMVTQEVALPFKEWFKSGSNALYDQIFQILQMARDDQEMPYGH 120

Query: 121 STADLALSSLGLRLNEPFVLDVLRFGSKNVLSCLKFFDWAGRQPKFFHTRATFNAIFKIL 180
           STADLALSSLGLRLNE FVLDVLR+GSK+VLSCLKFFDWAG QP FFHTRATF AIFKIL
Sbjct: 121 STADLALSSLGLRLNELFVLDVLRYGSKDVLSCLKFFDWAGHQPGFFHTRATFVAIFKIL 180

Query: 181 SKAKLMSLLFDFIDNYVQQRMVHKVRFYSTL---------PMFALQLFGKMRFQGLDLDS 240
           SKAKLMSL+FDF++NYVQQ+ VHK RFY+TL         P+FALQLFGKMRFQGLDLDS
Sbjct: 181 SKAKLMSLMFDFLENYVQQKFVHKARFYNTLVMGYAVAGKPIFALQLFGKMRFQGLDLDS 240

Query: 241 FAYHVLLNSLVEENCFDAVHVIVKQISLRGFENEVTHYLMLKNFCKQNQLDEAETFLHDL 300
           FAYHVLLNSLVEENCFDAVHV+VKQI+LRGF NE+THYLMLKNFCKQ+QLDEAETFLHDL
Sbjct: 241 FAYHVLLNSLVEENCFDAVHVVVKQITLRGFVNEITHYLMLKNFCKQSQLDEAETFLHDL 300

Query: 301 VGRGEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYGVWITELIQAGKL 360
           VG G+ +NGRMLGFL+ ALCK GNFER+WKLVEGFRDLELV M+H YG WITELI+AGKL
Sbjct: 301 VGSGKGLNGRMLGFLVSALCKSGNFERAWKLVEGFRDLELVSMDHAYGAWITELIRAGKL 360

Query: 361 ENALQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEEHISPDKITMNA 420
           E ALQFLYSRKSDESYIPDVFRYNMLIHRLLR+NRLQEVFDLLTEMMEEHISPDK+T+N 
Sbjct: 361 ERALQFLYSRKSDESYIPDVFRYNMLIHRLLRDNRLQEVFDLLTEMMEEHISPDKVTLNV 420

Query: 421 AMCFLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDEAYHILKSSIDQG 480
           AMCFLCK GMVDVALDLYNSRSE+ LSPN MAYNYL+NTLCGDGSTDEAYHILK SIDQG
Sbjct: 421 AMCFLCKAGMVDVALDLYNSRSEYRLSPNSMAYNYLVNTLCGDGSTDEAYHILKHSIDQG 480

Query: 481 YFPGKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDSTYDKFILALCRARRVEDGY 540
           YFPGKRTFSILADALCREGKLDKMKE+VIF+LERNFMPS STYDKFI ALC+ARRVEDGY
Sbjct: 481 YFPGKRTFSILADALCREGKLDKMKELVIFSLERNFMPSGSTYDKFISALCKARRVEDGY 540

Query: 541 LIHGELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQEKGHTPTRKLFRAVICCL 600
           LIH ELNRIN VAIKSTYF LIDGFNK RRGDI+ARLLIEMQEKGH PTRKLFR+VI CL
Sbjct: 541 LIHDELNRINVVAIKSTYFVLIDGFNKLRRGDISARLLIEMQEKGHNPTRKLFRSVIHCL 600

Query: 601 NEMENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPELAREVYQMMLRSGIQPNL 660
            EMENMEKQFFNLLELQLSRQEP+  VYNNF+YGAALAKK  LAREVYQMMLRSGIQPNL
Sbjct: 601 TEMENMEKQFFNLLELQLSRQEPSPEVYNNFIYGAALAKKSALAREVYQMMLRSGIQPNL 660

Query: 661 SSDILLLKCYLCSERISDALNFLNDLYPTRTIGRKISNTMVVGLCKVSKGDVALDFLRGI 720
           SSDILLLKCYL SERISDALNFL+DLY TRTIGRKISN MVVGLCK +K DVALD  R I
Sbjct: 661 SSDILLLKCYLHSERISDALNFLSDLYQTRTIGRKISNVMVVGLCKANKADVALDVFRDI 720

Query: 721 RDKGLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVGRPITSFLGNILLYSSLKTKKL 780
           RD+G+IPSIECYEELAKH C NERYDLVVNLINDLDKVGRPITSFLGN LLYSSLKT+KL
Sbjct: 721 RDRGVIPSIECYEELAKHLCHNERYDLVVNLINDLDKVGRPITSFLGNTLLYSSLKTQKL 780

Query: 781 YEAWVNSREGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLR 840
           YEAWV+ REGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLR
Sbjct: 781 YEAWVSLREGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLR 840

Query: 841 RLSTNDLQQAFELFNRLCQKGYEPNSWTYDILVHGLFKHGRTSEAKLLLEVMYRKGFCPS 865
           RLS NDLQQAFELFNRLC+KGY PN WTYDILVH LFKHGRTSEAK LLEVMYRKGF P+
Sbjct: 841 RLSANDLQQAFELFNRLCEKGYVPNRWTYDILVHALFKHGRTSEAKRLLEVMYRKGFTPT 900

BLAST of Tan0006998 vs. ExPASy TrEMBL
Match: A0A6J1DT81 (pentatricopeptide repeat-containing protein At1g71210 OS=Momordica charantia OX=3673 GN=LOC111023275 PE=4 SV=1)

HSP 1 Score: 1494.9 bits (3869), Expect = 0.0e+00
Identity = 755/904 (83.52%), Postives = 804/904 (88.94%), Query Frame = 0

Query: 1   MLLLQRAARVESKTKSGIFVSSFKDIFNEGLVSASSS----------------------- 60
           MLLLQR  RVESKTKSGIFVSSF+DIFNE LVS  SS                       
Sbjct: 1   MLLLQRIVRVESKTKSGIFVSSFRDIFNEALVSDLSSFSSVAGISGNGNRDIPIFFPWMS 60

Query: 61  --------SATGADGMITKEVALSFKEWFKSGSNSLYDQIFQILQGARDEQEVPYGPSTA 120
                   +A G DGMI+KEVALSFKEWFKSGSNSL+DQIFQILQGARD+QE  Y PSTA
Sbjct: 61  EKIATTLTAAAGGDGMISKEVALSFKEWFKSGSNSLFDQIFQILQGARDDQETTYRPSTA 120

Query: 121 DLALSSLGLRLNEPFVLDVLRFGSKNVLSCLKFFDWAGRQPKFFHTRATFNAIFKILSKA 180
           DLALSSLGLRLNE FVLDVLRFGS +VLSCLKFFDWAGRQP FFHTRATFNAIFKILSKA
Sbjct: 121 DLALSSLGLRLNELFVLDVLRFGSNDVLSCLKFFDWAGRQPGFFHTRATFNAIFKILSKA 180

Query: 181 KLMSLLFDFIDNYVQQRMVHKVRFYSTL---------PMFALQLFGKMRFQGLDLDSFAY 240
           KLMSL+FDF+DNYVQQ+ VHKVRFY+TL         P+FALQLFG+MRFQG DLDSFAY
Sbjct: 181 KLMSLMFDFLDNYVQQKFVHKVRFYNTLVMGYAVAGKPIFALQLFGQMRFQGHDLDSFAY 240

Query: 241 HVLLNSLVEENCFDAVHVIVKQISLRGFENEVTHYLMLKNFCKQNQLDEAETFLHDLVGR 300
           HVLLNSLVEENCFDAVHVIVKQISL GFENEVTH++MLKNFCKQ+QL EAETFLH LV  
Sbjct: 241 HVLLNSLVEENCFDAVHVIVKQISLMGFENEVTHHIMLKNFCKQSQLAEAETFLHGLVSS 300

Query: 301 GEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYGVWITELIQAGKLENA 360
           G+AV+GRMLG L+GALCK GNFER+WKLVE FR+ ELV +EHVYGVWIT+L++AGKLE+A
Sbjct: 301 GQAVSGRMLGILVGALCKSGNFERAWKLVEEFRN-ELVSVEHVYGVWITQLVRAGKLESA 360

Query: 361 LQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEEHISPDKITMNAAMC 420
           LQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLL EM +EHISPDK+TMNAAMC
Sbjct: 361 LQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLMEMKKEHISPDKVTMNAAMC 420

Query: 421 FLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDEAYHILKSSIDQGYFP 480
           FLCK GMVDVALDLYNSRS FGLSPN MAYNYLINTLCGDGSTDEAYHILK+SIDQGYFP
Sbjct: 421 FLCKAGMVDVALDLYNSRSGFGLSPNSMAYNYLINTLCGDGSTDEAYHILKNSIDQGYFP 480

Query: 481 GKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDSTYDKFILALCRARRVEDGYLIH 540
           GK+TFSILADALCRE KLDKMKE+VIFALERNFMPSDSTYDKFI ALCRA+RVEDGYLIH
Sbjct: 481 GKKTFSILADALCRERKLDKMKELVIFALERNFMPSDSTYDKFISALCRAKRVEDGYLIH 540

Query: 541 GELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQEKGHTPTRKLFRAVICCLNEM 600
           GELNRINKVA++STYFALIDGFNKS RGDIAARLLIEMQEKGH PTRKLFRAVI CLNEM
Sbjct: 541 GELNRINKVAVQSTYFALIDGFNKSNRGDIAARLLIEMQEKGHVPTRKLFRAVIRCLNEM 600

Query: 601 ENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPELAREVYQMMLRSGIQPNLSSD 660
           ENMEKQFFNLLELQLSRQEP+C VYNNF+YGAA AKKPELAREVYQMMLRSGIQPNLSSD
Sbjct: 601 ENMEKQFFNLLELQLSRQEPSCEVYNNFIYGAAHAKKPELAREVYQMMLRSGIQPNLSSD 660

Query: 661 ILLLKCYLCSERISDALNFLNDLYPTRTIGRKISNTMVVGLCKVSKGDVALDFLRGIRDK 720
           IL+LKCYLCSERISDALNFL DL  +R IGRKI NTMVVGLCK +K D+ALDFLR +RDK
Sbjct: 661 ILILKCYLCSERISDALNFLEDLSQSRIIGRKIFNTMVVGLCKANKADIALDFLRDMRDK 720

Query: 721 GLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVGRPITSFLGNILLYSSLKTKKLYEA 780
            L PSIECYE LAK FCQ ERYDLV NL+NDL+ VGR +TSFLGNILLY+SLKT+KLYEA
Sbjct: 721 SLTPSIECYEVLAKQFCQIERYDLVANLVNDLENVGRHLTSFLGNILLYNSLKTRKLYEA 780

Query: 781 WVNSREGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLRRLS 840
           WV+SREG +ETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLRRLS
Sbjct: 781 WVHSREGLMETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLRRLS 840

Query: 841 TNDLQQAFELFNRLCQKGYEPNSWTYDILVHGLFKHGRTSEAKLLLEVMYRKGFCPSERT 865
            ND+Q AFELFNRLCQKGYEPN WTYDILVHGLFKHGRTSEAK LLEVMYRKGF P+E T
Sbjct: 841 INDMQLAFELFNRLCQKGYEPNKWTYDILVHGLFKHGRTSEAKRLLEVMYRKGFDPTECT 900

BLAST of Tan0006998 vs. ExPASy TrEMBL
Match: A0A0A0LM57 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G034830 PE=4 SV=1)

HSP 1 Score: 1370.1 bits (3545), Expect = 0.0e+00
Identity = 692/904 (76.55%), Postives = 768/904 (84.96%), Query Frame = 0

Query: 1   MLLLQRAARVESKTKSGIFVSSFKDIFNEGLVSAS------------------------- 60
           MLLL R ARV+SKTK+GIFVSSFKDIFN+ LVSAS                         
Sbjct: 1   MLLLHRVARVKSKTKNGIFVSSFKDIFNDALVSASLCPNLHSVSSAAGTSGNGNRDIPRF 60

Query: 61  ------SSSATGADGMITKEVALSFKEWFKSGSNSLYDQIFQILQGARDEQEVPYGPSTA 120
                 S+ + GADGMITKEVA SFKEWFKSGSN LY +IFQIL+GARD+QE+PY PS A
Sbjct: 61  FPWKIASTLSAGADGMITKEVASSFKEWFKSGSNPLYGKIFQILRGARDDQEIPYRPSAA 120

Query: 121 DLALSSLGLRLNEPFVLDVLRFGSKNVLSCLKFFDWAGRQPKFFHTRATFNAIFKILSKA 180
           DLALS LGLRLNE FVLDVLRFGSK+VLSCLKFFDWAGRQ +FFHTRATFNAI KILSKA
Sbjct: 121 DLALSRLGLRLNESFVLDVLRFGSKDVLSCLKFFDWAGRQERFFHTRATFNAILKILSKA 180

Query: 181 KLMSLLFDFIDNYVQQRMVHKVRFYSTL---------PMFALQLFGKMRFQGLDLDSFAY 240
           KL+SL+FDF++N VQ ++ H   FY+ L         P+FAL LFGKMRFQGLDLD F+Y
Sbjct: 181 KLVSLMFDFLENCVQHKLYHMPCFYNILVMGYAAAGKPIFALHLFGKMRFQGLDLDPFSY 240

Query: 241 HVLLNSLVEENCFDAVHVIVKQISLRGFENEVTHYLMLKNFCKQNQLDEAETFLHDLVGR 300
           HVLLNSLVEENCFDAV+VI+KQI+LRGF NE+THYLMLK+FCKQNQLDEAETFLHDLV  
Sbjct: 241 HVLLNSLVEENCFDAVNVIIKQITLRGFVNEITHYLMLKSFCKQNQLDEAETFLHDLVDS 300

Query: 301 GEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYGVWITELIQAGKLENA 360
           G+ +NGRML  L+GA C+ GNFER+WKLVE FRDL++V MEHVYGVWITELI+AGKLE+A
Sbjct: 301 GKKLNGRMLDLLVGAFCQSGNFERAWKLVEWFRDLQIVSMEHVYGVWITELIRAGKLESA 360

Query: 361 LQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEEHISPDKITMNAAMC 420
           LQFL S K D  YIPDVFRYNMLIHRLLRENRLQEVFDLLTEMM++HISPDK+TM+AAMC
Sbjct: 361 LQFLNSSKLDGRYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMDQHISPDKVTMDAAMC 420

Query: 421 FLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDEAYHILKSSIDQGYFP 480
           FLCK GMV+VAL+LYNS  EFG+SPN MAYNYLIN LC DGSTDEAY ILK SI +GYFP
Sbjct: 421 FLCKAGMVEVALELYNSNFEFGISPNTMAYNYLINALCRDGSTDEAYRILKCSIYEGYFP 480

Query: 481 GKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDSTYDKFILALCRARRVEDGYLIH 540
           GK+TFSILA ALCREGKLDKMKE+VIFALERN MP+DSTYDKFI ALCRARRVEDGYLIH
Sbjct: 481 GKKTFSILASALCREGKLDKMKELVIFALERNCMPNDSTYDKFIYALCRARRVEDGYLIH 540

Query: 541 GELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQEKGHTPTRKLFRAVICCLNEM 600
            ELNRIN VA +STYF LI+GF KS RGDIAARLLIEM EKGH P R LFR+VI CL EM
Sbjct: 541 CELNRINVVATRSTYFVLIEGFIKSGRGDIAARLLIEMLEKGHNPPRGLFRSVILCLIEM 600

Query: 601 ENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPELAREVYQMMLRSGIQPNLSSD 660
           ENMEKQFFNLLELQLS QEPN  VYNNF+Y A  AKKPELA EVY MMLR+GIQPNLSSD
Sbjct: 601 ENMEKQFFNLLELQLSCQEPNSEVYNNFIYAAGRAKKPELANEVYHMMLRNGIQPNLSSD 660

Query: 661 ILLLKCYLCSERISDALNFLNDLYPTRTIGRKISNTMVVGLCKVSKGDVALDFLRGIRDK 720
           ILLL+ YL SERISDAL FL++L  TRTIGRKISN +VVGLCK +K ++A DF + +RDK
Sbjct: 661 ILLLRGYLYSERISDALIFLSNLSQTRTIGRKISNVVVVGLCKANKTNLAFDFWKHLRDK 720

Query: 721 GLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVGRPITSFLGNILLYSSLKTKKLYEA 780
           G +PSIECYEELAKHFCQNERYD VVNL+NDLDKVGRP+TSFLGN+LLYSSLKT+KLY+A
Sbjct: 721 GTVPSIECYEELAKHFCQNERYDAVVNLLNDLDKVGRPLTSFLGNVLLYSSLKTQKLYKA 780

Query: 781 WVNSREGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLRRLS 840
           WVNSR GQVETSQSSMLGLLI AFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLR L 
Sbjct: 781 WVNSRVGQVETSQSSMLGLLIKAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLRTLI 840

Query: 841 TNDLQQAFELFNRLCQKGYEPNSWTYDILVHGLFKHGRTSEAKLLLEVMYRKGFCPSERT 865
           T+D+++AFELF+RLC+KGY PN WTYDILVHGLFK GRT EAK LLE+M++KGF  +E T
Sbjct: 841 TSDMERAFELFDRLCEKGYVPNKWTYDILVHGLFKQGRTVEAKRLLEIMHKKGFSLTECT 900

BLAST of Tan0006998 vs. ExPASy TrEMBL
Match: A0A5D3BBD3 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold546G00600 PE=4 SV=1)

HSP 1 Score: 1323.5 bits (3424), Expect = 0.0e+00
Identity = 680/906 (75.06%), Postives = 748/906 (82.56%), Query Frame = 0

Query: 1   MLLLQRAARVESKTKSGIFVSSFKDIFNEGLVSAS------------------------- 60
           MLLL R ARV+SKTK+GIFVS    IFN+ LVSAS                         
Sbjct: 1   MLLLHRVARVKSKTKNGIFVS----IFNDALVSASLCPNSHSVSSVAGTSGNGNRDIPKF 60

Query: 61  --------SSSATGADGMITKEVALSFKEWFKSGSNSLYDQIFQILQGARDEQEVPYGPS 120
                      + GADGMI KEVA SFKEWFKSGS  LY  IFQIL+G RD+Q +P  PS
Sbjct: 61  FRWKIGSTLKESAGADGMIIKEVASSFKEWFKSGSKPLYGIIFQILRGDRDDQGMPSVPS 120

Query: 121 TADLALSSLGLRLNEPFVLDVLRFGSKNVLSCLKFFDWAGRQPKFFHTRATFNAIFKILS 180
            ADLALS LGLRLNE FVLDVLR+GSK++LSCLKFFDWAG Q  FFHTRATFNAI KILS
Sbjct: 121 PADLALSRLGLRLNEAFVLDVLRYGSKDILSCLKFFDWAGHQQGFFHTRATFNAILKILS 180

Query: 181 KAKLMSLLFDFIDNYVQQRMVHKVRFYSTL---------PMFALQLFGKMRFQGLDLDSF 240
           +AKL  L+ DF++N VQQR  H   F +TL         P+FAL LFGKMRFQGLDLD F
Sbjct: 181 QAKLFPLMLDFLENCVQQRTYHTACFTNTLVMGYAAAGKPIFALHLFGKMRFQGLDLDPF 240

Query: 241 AYHVLLNSLVEENCFDAVHVIVKQISLRGFENEVTHYLMLKNFCKQNQLDEAETFLHDLV 300
           +YHVLLNSLVEENCFDAV+VI+KQI+LRGF NE+THYLMLKN CKQNQLDEAETFLHDLV
Sbjct: 241 SYHVLLNSLVEENCFDAVNVIIKQITLRGFVNELTHYLMLKNLCKQNQLDEAETFLHDLV 300

Query: 301 GRGEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYGVWITELIQAGKLE 360
             G+ ++GRML FL+GA C+ GNFER+WKLVE FRDLE+V ME+VYGVW TELI+AGKLE
Sbjct: 301 DSGKKLSGRMLDFLVGAFCQSGNFERAWKLVEWFRDLEIVSMEYVYGVWTTELIRAGKLE 360

Query: 361 NALQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEEHISPDKITMNAA 420
           +ALQFL S K D  YIPDVFRYNMLIHRLLRENRLQEVFDLLTEMME+HI PDK+TM+AA
Sbjct: 361 SALQFLNSSKLDGRYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEQHIVPDKVTMHAA 420

Query: 421 MCFLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDEAYHILKSSIDQGY 480
            CFLCK GMV+VAL+LYNS  EFG+SPN MAYNYLIN LC DG TDEAY ILK SI +GY
Sbjct: 421 TCFLCKAGMVEVALELYNSNFEFGISPNTMAYNYLINALCWDGGTDEAYRILKRSIHEGY 480

Query: 481 FPGKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDSTYDKFILALCRARRVEDGYL 540
           FPGK+TFSILA ALCREGKLDKMKE+VIFALERN MPSDSTYDKFI ALCRARRVEDGYL
Sbjct: 481 FPGKKTFSILASALCREGKLDKMKELVIFALERNCMPSDSTYDKFITALCRARRVEDGYL 540

Query: 541 IHGELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQEKGHTPTRKLFRAVICCLN 600
           IH ELNRIN VA +STY  LIDGF KS RGDIAARLLIEM EKGH P R  FR VI CL 
Sbjct: 541 IHSELNRINVVATRSTYVLLIDGFIKSGRGDIAARLLIEMLEKGHNPRRSEFRYVIRCLI 600

Query: 601 EMENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPELAREVYQMMLRSGIQPNLS 660
           EMENMEKQFFNLLELQLS QEPN  VYNNF+Y AA AKKPELA EVYQMMLR+GIQPNLS
Sbjct: 601 EMENMEKQFFNLLELQLSCQEPNTEVYNNFIYAAARAKKPELANEVYQMMLRNGIQPNLS 660

Query: 661 SDILLLKCYLCSERISDALNFLNDLYPTRTIGRKISNTMVVGLCKVSKGDVALDFLRGIR 720
           SDILLL+ YL SERISDAL FL++L  TRTIGRKISN +VVGLCK +K ++A DF + +R
Sbjct: 661 SDILLLRGYLYSERISDALIFLSNLSQTRTIGRKISNVVVVGLCKANKSNLAFDFWKHLR 720

Query: 721 DKGLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVGRPITSFLGNILLYSSLKTKKLY 780
           +KG IPSIECYEELAKHFCQ ERYD+VVNLINDLDKVGRP+TSFLGNILLYSSLKT+KLY
Sbjct: 721 NKGTIPSIECYEELAKHFCQIERYDVVVNLINDLDKVGRPLTSFLGNILLYSSLKTQKLY 780

Query: 781 EAWVNSREGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLRR 840
           +AWVNSREG VETSQSSMLGLLI AFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLR+
Sbjct: 781 KAWVNSREGLVETSQSSMLGLLIKAFSGHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLRK 840

Query: 841 LSTNDLQQAFELFNRLCQKGYEPNSWTYDILVHGLFKHGRTSEAKLLLEVMYRKGFCPSE 865
           LS ND++QAFELF+RLC++GY PN WTYDILVHGLFK GRT EAK LLE+M+++GF  +E
Sbjct: 841 LSPNDMEQAFELFDRLCERGYVPNKWTYDILVHGLFKQGRTVEAKRLLEIMHQEGFTLTE 900

BLAST of Tan0006998 vs. TAIR 10
Match: AT1G71210.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 699.9 bits (1805), Expect = 2.6e-201
Identity = 385/844 (45.62%), Postives = 525/844 (62.20%), Query Frame = 0

Query: 1   MLLLQRAARVESKTKSGIFVSSFKDIFNEGLVSASSSSATGADGMITKEVALSFKEWFK- 60
           MLL +R   + + +       +  D       +  SSS    D ++ +     +K+WFK 
Sbjct: 16  MLLRRRILSLSASSFRNFTSGNNGDAIPFSTFTKPSSSIAPGDFLVRE-----WKDWFKH 75

Query: 61  ---SGSNSLYDQIFQILQGARDEQEVPYGPSTADLALSSLGLRLNEPFVLDVLRFGSKNV 120
                S+ L D+IF IL+   ++ +         L LS+L LRL E FVLDVL     ++
Sbjct: 76  RDVKQSHQLIDRIFDILRAPSNDGD----DRAFYLHLSNLRLRLTEKFVLDVLSHTRYDI 135

Query: 121 LSCLKFFDWAGRQPKFFHTRATFNAIFKILSKAKLMSLLFDFIDNYVQ-QRMVHKVRFYS 180
           L CLKFFDWA RQP F HTRATF+AIFKIL  AKL++L+ DF+D  V  +   H +R   
Sbjct: 136 LCCLKFFDWAARQPGFHHTRATFHAIFKILRGAKLVTLMIDFLDRSVGFESCRHSLRLCD 195

Query: 181 TLPM---------FALQLFGKMRFQGLDLDSFAYHVLLNSLVEENCFDAVHVIVKQISLR 240
            L +          ALQ FG MRF+GLDLDSF YHVLLN+LVEE CFD+  VI  QIS+R
Sbjct: 196 ALVVGYAVAGRTDIALQHFGNMRFRGLDLDSFGYHVLLNALVEEKCFDSFDVIFDQISVR 255

Query: 241 GFENEVTHYLMLKNFCKQNQLDEAETFLHDLVGRGEAVNGRMLGFLIGALCKRGNFERSW 300
           GF   VTH +++K FCKQ +LDEAE +L  L+    A  G  LG L+ ALC +  F+ + 
Sbjct: 256 GFVCAVTHSILVKKFCKQGKLDEAEDYLRALLPNDPAGCGSGLGILVDALCSKRKFQEAT 315

Query: 301 KLVEGFRDLELVPMEHVYGVWITELIQAGKLENALQFLYSRKSDESYIPDVFRYNMLIHR 360
           KL++  + +  V M+  Y +WI  LI+AG L N   FL      E    +VFRYN ++ +
Sbjct: 316 KLLDEIKLVGTVNMDRAYNIWIRALIKAGFLNNPADFLQKISPLEGCELEVFRYNSMVFQ 375

Query: 361 LLRENRLQEVFDLLTEMMEEHISPDKITMNAAMCFLCKVGMVDVALDLYNSRSEFGLSPN 420
           LL+EN L  V+D+LTEMM   +SP+K TMNAA+CF CK G VD AL+LY SRSE G +P 
Sbjct: 376 LLKENNLDGVYDILTEMMVRGVSPNKKTMNAALCFFCKAGFVDEALELYRSRSEIGFAPT 435

Query: 421 GMAYNYLINTLCGDGSTDEAYHILKSSIDQGYFPGKRTFSILADALCREGKLDKMKEMVI 480
            M+YNYLI+TLC + S ++AY +LK +ID+G+F G +TFS L +ALC +GK D  +E+VI
Sbjct: 436 AMSYNYLIHTLCANESVEQAYDVLKGAIDRGHFLGGKTFSTLTNALCWKGKPDMARELVI 495

Query: 481 FALERNFMPSDSTYDKFILALCRARRVEDGYLIHGELNRINKVAIKSTYFALIDGFNKSR 540
            A ER+ +P      K I ALC   +VED  +I+   N+         + +LI G     
Sbjct: 496 AAAERDLLPKRIAGCKIISALCDVGKVEDALMINELFNKSGVDTSFKMFTSLIYGSITLM 555

Query: 541 RGDIAARLLIEMQEKGHTPTRKLFRAVICCLNEMENMEKQFF-NLLELQLSRQEPNCAVY 600
           RGDIAA+L+I MQEKG+TPTR L+R VI C+ EME+ EK FF  LL+ QLS  E     Y
Sbjct: 556 RGDIAAKLIIRMQEKGYTPTRSLYRNVIQCVCEMESGEKNFFTTLLKFQLSLWEHKVQAY 615

Query: 601 NNFLYGAALAKKPELAREVYQMMLRSGIQPNLSSDILLLKCYLCSERISDALNFLNDLYP 660
           N F+ GA  A KP+LAR VY MM R GI P ++S+IL+L+ YL +E+I+DAL+F +DL  
Sbjct: 616 NLFIEGAGFAGKPKLARLVYDMMDRDGITPTVASNILMLQSYLKNEKIADALHFFHDLRE 675

Query: 661 TRTIGRKISNTMVVGLCKVSKGDVALDFLRGIRDKGLIPSIECYEELAKHFCQNERYDLV 720
                +++   M+VGLCK +K D A+ FL  ++ +GL PSIECYE   +  C  E+YD  
Sbjct: 676 QGKTKKRLYQVMIVGLCKANKLDDAMHFLEEMKGEGLQPSIECYEVNIQKLCNEEKYDEA 735

Query: 721 VNLINDLDKVGRPITSFLGNILLYSSLKTKKLYEAWVNSREGQVETSQSSMLGLLIGAFS 780
           V L+N+  K GR IT+F+GN+LL++++K+K +YEAW   R  + +  +   LG LIG FS
Sbjct: 736 VGLVNEFRKSGRRITAFIGNVLLHNAMKSKGVYEAWTRMRNIEDKIPEMKSLGELIGLFS 795

Query: 781 GHIRVSQSIKNLEEAIAKCFPLDIYTYNLLLRRLSTNDLQQAFELFNRLCQKGYEPNSWT 830
           G I +   +K L+E I KC+PLD+YTYN+LLR +  N  + A+E+  R+ ++GY PN  T
Sbjct: 796 GRIDMEVELKRLDEVIEKCYPLDMYTYNMLLRMIVMNQAEDAYEMVERIARRGYVPNERT 850

BLAST of Tan0006998 vs. TAIR 10
Match: AT3G06920.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 185.3 bits (469), Expect = 2.1e-46
Identity = 176/795 (22.14%), Postives = 338/795 (42.52%), Query Frame = 0

Query: 83  PYGPSTADLALSSLGLRLNEPFVLDVLRFGSKNVLSCLKFFDWAGRQPKFFHTRATFNAI 142
           P+GPS A+  LS+L  +    FV+ VLR   K+V   +++F W  R+ +  H   ++N++
Sbjct: 47  PWGPS-AENTLSALSFKPQPEFVIGVLR-RLKDVNRAIEYFRWYERRTELPHCPESYNSL 106

Query: 143 FKILSKAKLMSLLFDFIDNYVQQRMVHKVRFYSTLPMFALQLFG-----KMRFQGLDLDS 202
             ++++ +     FD +D  + +  V    F  ++      + G     K+R +G D+  
Sbjct: 107 LLVMARCR----NFDALDQILGEMSV--AGFGPSVNTCIEMVLGCVKANKLR-EGYDVVQ 166

Query: 203 F-----------AYHVLLNSLVEENCFDAVHVIVKQISLRGFENEVTHY-LMLKNFCKQN 262
                       AY  L+ +    N  D +  + +Q+   G+E  V  +  +++ F K+ 
Sbjct: 167 MMRKFKFRPAFSAYTTLIGAFSAVNHSDMMLTLFQQMQELGYEPTVHLFTTLIRGFAKEG 226

Query: 263 QLDEAETFLHDLVGRGEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYG 322
           ++D A + L ++       +  +    I +  K G  + +WK         L P E  Y 
Sbjct: 227 RVDSALSLLDEMKSSSLDADIVLYNVCIDSFGKVGKVDMAWKFFHEIEANGLKPDEVTYT 286

Query: 323 VWITELIQAGKLENALQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMME 382
             I  L +A +L+ A++ ++        +P  + YN +I       +  E + LL     
Sbjct: 287 SMIGVLCKANRLDEAVE-MFEHLEKNRRVPCTYAYNTMIMGYGSAGKFDEAYSLLERQRA 346

Query: 383 EHISPDKITMNAAMCFLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDE 442
           +   P  I  N  +  L K+G VD AL ++    +   +PN   YN LI+ LC  G  D 
Sbjct: 347 KGSIPSVIAYNCILTCLRKMGKVDEALKVFEEMKK-DAAPNLSTYNILIDMLCRAGKLDT 406

Query: 443 AYHILKSSIDQGYFPGKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDSTYDKFIL 502
           A+ +  S    G FP  RT +I+ D LC+  KLD+   M      +   P + T+   I 
Sbjct: 407 AFELRDSMQKAGLFPNVRTVNIMVDRLCKSQKLDEACAMFEEMDYKVCTPDEITFCSLID 466

Query: 503 ALCRARRVEDGYLIHGELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQEKGHTP 562
            L +  RV+D Y ++ ++   +       Y +LI  F    R +   ++  +M  +  +P
Sbjct: 467 GLGKVGRVDDAYKVYEKMLDSDCRTNSIVYTSLIKNFFNHGRKEDGHKIYKDMINQNCSP 526

Query: 563 TRKLFRAVICCLNEMENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPELAREVY 622
             +L    + C+ +    EK      E++  R  P+   Y+  ++G   A       E++
Sbjct: 527 DLQLLNTYMDCMFKAGEPEKGRAMFEEIKARRFVPDARSYSILIHGLIKAGFANETYELF 586

Query: 623 QMMLRSGIQPNLSSDILLLKCYLCSERISDALNFLNDLYPTRTIGRKIS----NTMVVGL 682
             M   G   +  +  +++  +    +++ A   L ++   +T G + +     +++ GL
Sbjct: 587 YSMKEQGCVLDTRAYNIVIDGFCKCGKVNKAYQLLEEM---KTKGFEPTVVTYGSVIDGL 646

Query: 683 CKVSKGDVALDFLRGIRDKGLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVGRPITS 742
            K+ + D A       + K +  ++  Y  L   F +  R D    ++ +L + G     
Sbjct: 647 AKIDRLDEAYMLFEEAKSKRIELNVVIYSSLIDGFGKVGRIDEAYLILEELMQKGLTPNL 706

Query: 743 FLGNILLYSSLKTKKLYEAWV--NSREGQVETSQSSMLGLLIGAFSGHIRVSQSIKNLEE 802
           +  N LL + +K +++ EA V   S +    T      G+LI       + +++    +E
Sbjct: 707 YTWNSLLDALVKAEEINEALVCFQSMKELKCTPNQVTYGILINGLCKVRKFNKAFVFWQE 766

Query: 803 AIAKCFPLDIYTYNLLLRRLS-TNDLQQAFELFNRLCQKGYEPNSWTYDILVHGLFKHGR 854
              +       +Y  ++  L+   ++ +A  LF+R    G  P+S  Y+ ++ GL    R
Sbjct: 767 MQKQGMKPSTISYTTMISGLAKAGNIAEAGALFDRFKANGGVPDSACYNAMIEGLSNGNR 826

BLAST of Tan0006998 vs. TAIR 10
Match: AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 169.5 bits (428), Expect = 1.2e-41
Identity = 148/677 (21.86%), Postives = 294/677 (43.43%), Query Frame = 0

Query: 92  ALSSLGLRLNEPFVLDVLRFGSKNVLSCLKFFDWAGRQPKFFHTRATFNAIFKILSKAKL 151
           ALSS  ++L     LD LR    +  + L+ F+ A ++P F    A +  I   L +   
Sbjct: 45  ALSSTDVKL-----LDSLR-SQPDDSAALRLFNLASKKPNFSPEPALYEEILLRLGR--- 104

Query: 152 MSLLFDFIDNYVQQRMVHKVRFYSTLPMFALQLFGKMRFQ--------------GLDLDS 211
            S  FD +   ++     +    ++  +  ++ + +   Q              GL  D+
Sbjct: 105 -SGSFDDMKKILEDMKSSRCEMGTSTFLILIESYAQFELQDEILSVVDWMIDEFGLKPDT 164

Query: 212 FAYHVLLNSLVEENCFDAVHVIVKQISLRGFENEV-THYLMLKNFCKQNQLDEAETFLHD 271
             Y+ +LN LV+ N    V +   ++S+ G + +V T  +++K  C+ +QL  A   L D
Sbjct: 165 HFYNRMLNLLVDGNSLKLVEISHAKMSVWGIKPDVSTFNVLIKALCRAHQLRPAILMLED 224

Query: 272 LVGRGEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYGVWITELIQAGK 331
           +   G   + +    ++    + G+ + + ++ E   +           V +    + G+
Sbjct: 225 MPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGR 284

Query: 332 LENALQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEEHISPDKITMN 391
           +E+AL F+    + + + PD + +N L++ L +   ++   +++  M++E   PD  T N
Sbjct: 285 VEDALNFIQEMSNQDGFFPDQYTFNTLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYN 344

Query: 392 AAMCFLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDEAYHILK----- 451
           + +  LCK+G V  A+++ +       SPN + YN LI+TLC +   +EA  + +     
Sbjct: 345 SVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSK 404

Query: 452 ---------SSIDQGYF---------------------PGKRTFSILADALCREGKLDK- 511
                    +S+ QG                       P + T+++L D+LC +GKLD+ 
Sbjct: 405 GILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEA 464

Query: 512 ---MKEMVIFALERNFMPSDSTYDKFILALCRARRVEDGYLIHGELNRINKVAIKSTYFA 571
              +K+M +    R+ +    TY+  I   C+A +  +   I  E+          TY  
Sbjct: 465 LNMLKQMELSGCARSVI----TYNTLIDGFCKANKTREAEEIFDEMEVHGVSRNSVTYNT 524

Query: 572 LIDGFNKSRRGDIAARLLIEMQEKGHTPTRKLFRAVICCLNEMENMEKQFFNLLELQLSR 631
           LIDG  KSRR + AA+L+ +M  +G  P +  + +++       +++K    +  +  + 
Sbjct: 525 LIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGGDIKKAADIVQAMTSNG 584

Query: 632 QEPNCAVYNNFLYGAALAKKPELAREVYQMMLRSGIQPNLSSDILLLKCYLCSERISDAL 691
            EP+   Y   + G   A + E+A ++ + +   GI     +   +++      + ++A+
Sbjct: 585 CEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMKGINLTPHAYNPVIQGLFRKRKTTEAI 644

Query: 692 NFLNDLYPTRTIGRKISNTMVV--GLCKVSKGDV--ALDFLRGIRDKGLIPSIECYEELA 711
           N   ++           +  +V  GLC    G +  A+DFL  + +KG +P       LA
Sbjct: 645 NLFREMLEQNEAPPDAVSYRIVFRGLCN-GGGPIREAVDFLVELLEKGFVPEFSSLYMLA 704

BLAST of Tan0006998 vs. TAIR 10
Match: AT3G22470.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 160.2 bits (404), Expect = 7.3e-39
Identity = 127/535 (23.74%), Postives = 217/535 (40.56%), Query Frame = 0

Query: 200 YHVLLNSLVEENCFDAVHVIVKQISLRGFENEV-THYLMLKNFCKQNQLDEAETFLHDLV 259
           ++ L +++     +D V    K + L G E+++ T  +M+  +C++ +L  A + L    
Sbjct: 73  FNRLCSAVARTKQYDLVLGFCKGMELNGIEHDMYTMTIMINCYCRKKKLLFAFSVLGRAW 132

Query: 260 GRGEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVPMEHVYGVWITELIQAGKLE 319
             G   +      L+   C  G    +  LV+   +++  P        I  L   G++ 
Sbjct: 133 KLGYEPDTITFSTLVNGFCLEGRVSEAVALVDRMVEMKQRPDLVTVSTLINGLCLKGRVS 192

Query: 320 NALQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDLLTEMMEEHISPDKITMNAA 379
            AL  L  R  +  + PD   Y  +++RL +        DL  +M E +I    +  +  
Sbjct: 193 EAL-VLIDRMVEYGFQPDEVTYGPVLNRLCKSGNSALALDLFRKMEERNIKASVVQYSIV 252

Query: 380 MCFLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCGDGSTDEAYHILKSSIDQGY 439
           +  LCK G  D AL L+N     G+  + + Y+ LI  LC DG  D+   +L+  I +  
Sbjct: 253 IDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSLIGGLCNDGKWDDGAKMLREMIGRNI 312

Query: 440 FPGKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDSTYDKFILALCRARRVEDGYL 499
            P   TFS L D   +EGKL + KE+    + R   P   TY+  I   C+   + +   
Sbjct: 313 IPDVVTFSALIDVFVKEGKLLEAKELYNEMITRGIAPDTITYNSLIDGFCKENCLHEANQ 372

Query: 500 IHGELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQEKGHTPTRKLFRAVICCLN 559
           +   +          TY  LI+ + K++R D   RL  E+  KG                
Sbjct: 373 MFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMRLFREISSKGLI-------------- 432

Query: 560 EMENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPELAREVYQMMLRSGIQPNLS 619
                                PN   YN  + G   + K   A+E++Q M+  G+ P++ 
Sbjct: 433 ---------------------PNTITYNTLVLGFCQSGKLNAAKELFQEMVSRGVPPSVV 492

Query: 620 SDILLLKCYLCSERISDALNFLNDLYPTR-TIGRKISNTMVVGLCKVSKGDVALDFLRGI 679
           +  +LL     +  ++ AL     +  +R T+G  I N ++ G+C  SK D A      +
Sbjct: 493 TYGILLDGLCDNGELNKALEIFEKMQKSRMTLGIGIYNIIIHGMCNASKVDDAWSLFCSL 552

Query: 680 RDKGLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVGRPITSFLGNILLYSSL 733
            DKG+ P +  Y  +    C+         L   + + G     F  NIL+ + L
Sbjct: 553 SDKGVKPDVVTYNVMIGGLCKKGSLSEADMLFRKMKEDGCTPDDFTYNILIRAHL 571

BLAST of Tan0006998 vs. TAIR 10
Match: AT1G62670.1 (rna processing factor 2 )

HSP 1 Score: 158.3 bits (399), Expect = 2.8e-38
Identity = 126/538 (23.42%), Postives = 232/538 (43.12%), Query Frame = 0

Query: 181 ALQLFGKMRFQGLDLDSFAYHVLLNSLVEENCFDAVHVIVKQISLRGF-ENEVTHYLMLK 240
           A+ LFG+M           +  LL+++ + N FD V  + +Q+   G   N  T+ +++ 
Sbjct: 65  AVALFGEMVKSRPFPSIIEFSKLLSAIAKMNKFDVVISLGEQMQNLGIPHNHYTYSILIN 124

Query: 241 NFCKQNQLDEAETFLHDLVGRGEAVNGRMLGFLIGALCKRGNFERSWKLVEGFRDLELVP 300
            FC+++QL  A   L  ++  G   N   L  L+   C       +  LV+        P
Sbjct: 125 CFCRRSQLPLALAVLGKMMKLGYEPNIVTLSSLLNGYCHSKRISEAVALVDQMFVTGYQP 184

Query: 301 MEHVYGVWITELIQAGKLENALQFLYSRKSDESYIPDVFRYNMLIHRLLRENRLQEVFDL 360
               +   I  L    K   A+  L  R   +   PD+  Y ++++ L +       F+L
Sbjct: 185 NTVTFNTLIHGLFLHNKASEAMA-LIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAFNL 244

Query: 361 LTEMMEEHISPDKITMNAAMCFLCKVGMVDVALDLYNSRSEFGLSPNGMAYNYLINTLCG 420
           L +M +  + P  +  N  +  LCK   +D AL+L+      G+ PN + Y+ LI+ LC 
Sbjct: 245 LNKMEQGKLEPGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCLCN 304

Query: 421 DGSTDEAYHILKSSIDQGYFPGKRTFSILADALCREGKLDKMKEMVIFALERNFMPSDST 480
            G   +A  +L   I++   P   TFS L DA  +EGKL + +++    ++R+  PS  T
Sbjct: 305 YGRWSDASRLLSDMIERKINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSIVT 364

Query: 481 YDKFILALCRARRVEDGYLIHGELNRINKVAIKSTYFALIDGFNKSRRGDIAARLLIEMQ 540
           Y   I   C   R+++   +   +   +      TY  LI GF K +R +    +  EM 
Sbjct: 365 YSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFREMS 424

Query: 541 EKGHTPTRKLFRAVICCLNEMENMEKQFFNLLELQLSRQEPNCAVYNNFLYGAALAKKPE 600
           ++G       +  +I  L +  + +       E+      PN   YN  L G     K E
Sbjct: 425 QRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGKLE 484

Query: 601 LAREVYQMMLRSGIQPNLSSDILLLKCYLCSERISDALNFLNDLYPTRTIGRKIS-NTMV 660
            A  V++ + RS ++P + +  ++++    + ++ D  +   +L         ++ NTM+
Sbjct: 485 KAMVVFEYLQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKPDVVAYNTMI 544

Query: 661 VGLCKVSKGDVALDFLRGIRDKGLIPSIECYEELAKHFCQNERYDLVVNLINDLDKVG 717
            G C+    + A    + +++ G +P+  CY  L +   ++   +    LI ++   G
Sbjct: 545 SGFCRKGSKEEADALFKEMKEDGTLPNSGCYNTLIRARLRDGDREASAELIKEMRSCG 601

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8GZA63.6e-20045.62Pentatricopeptide repeat-containing protein At1g71210, mitochondrial OS=Arabidop... [more]
Q9M9073.0e-4522.14Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana OX... [more]
Q76C994.3e-4423.27Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV... [more]
Q9LFF11.7e-4021.86Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Q6NQ831.0e-3723.74Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_022946522.10.0e+0083.61pentatricopeptide repeat-containing protein At1g71210, mitochondrial [Cucurbita ... [more]
XP_023545233.10.0e+0083.39pentatricopeptide repeat-containing protein At1g71210, mitochondrial [Cucurbita ... [more]
KAG7030102.10.0e+0083.39Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022999627.10.0e+0083.13pentatricopeptide repeat-containing protein At1g71210, mitochondrial [Cucurbita ... [more]
XP_022156362.10.0e+0083.52pentatricopeptide repeat-containing protein At1g71210 [Momordica charantia] >XP_... [more]
Match NameE-valueIdentityDescription
A0A6J1G4420.0e+0083.61pentatricopeptide repeat-containing protein At1g71210, mitochondrial OS=Cucurbit... [more]
A0A6J1KBC60.0e+0083.13pentatricopeptide repeat-containing protein At1g71210, mitochondrial OS=Cucurbit... [more]
A0A6J1DT810.0e+0083.52pentatricopeptide repeat-containing protein At1g71210 OS=Momordica charantia OX=... [more]
A0A0A0LM570.0e+0076.55Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G034830 PE=4 SV=1[more]
A0A5D3BBD30.0e+0075.06Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT1G71210.12.6e-20145.62Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G06920.12.1e-4622.14Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G53700.11.2e-4121.86Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G22470.17.3e-3923.74Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G62670.12.8e-3823.42rna processing factor 2 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 111..307
e-value: 7.5E-19
score: 70.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 308..469
e-value: 1.2E-31
score: 112.3
coord: 753..862
e-value: 3.5E-16
score: 61.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 650..752
e-value: 1.8E-7
score: 32.6
coord: 570..647
e-value: 2.9E-8
score: 35.2
coord: 470..569
e-value: 8.7E-11
score: 43.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 825..858
e-value: 2.5E-6
score: 25.3
coord: 374..406
e-value: 2.0E-4
score: 19.3
coord: 584..616
e-value: 6.1E-5
score: 20.9
coord: 340..371
e-value: 4.0E-6
score: 24.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 370..418
e-value: 1.1E-8
score: 35.1
coord: 788..835
e-value: 2.8E-9
score: 37.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 517..542
e-value: 0.075
score: 13.3
coord: 340..367
e-value: 0.048
score: 13.9
coord: 233..261
e-value: 0.79
score: 10.1
coord: 444..465
e-value: 0.18
score: 12.1
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 579..619
e-value: 0.0089
score: 16.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 581..615
score: 9.711769
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 822..856
score: 11.268274
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 511..545
score: 9.262356
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 441..475
score: 8.582755
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 336..370
score: 10.610596
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 406..440
score: 9.481582
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 371..405
score: 9.218511
NoneNo IPR availablePANTHERPTHR47938RESPIRATORY COMPLEX I CHAPERONE (CIA84), PUTATIVE (AFU_ORTHOLOGUE AFUA_2G06020)-RELATEDcoord: 71..858
NoneNo IPR availablePANTHERPTHR47938:SF19OS02G0127600 PROTEINcoord: 71..858

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0006998.1Tan0006998.1mRNA
Tan0006998.2Tan0006998.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding