Tan0006148 (gene) Snake gourd v1

Overview
NameTan0006148
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG01: 56624947 .. 56641884 (-)
RNA-Seq ExpressionTan0006148
SyntenyTan0006148
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAGAGTTGGAAAAAAGGCGGAAGTGTTCTAAAAAATTGGGCGCCTAAAATTTTTGGAAAATTCGCGTCATTCTCTATCACACATAATTTTCTTTCACTTTTCCCTTTCCCTTCTCATCTCGTTTTCTTTACTTTTCTTCAGATTTTCCAATCTTCACTGCCCATCTCTACAATTTCGACGGTACTTACATAAAACGCCCATAATCATGCAAGCAACCCACGAACGATGGTGTCGTCGACGTCTTGGATAGCAGATGCAACGGCGAGACTCCTTTGAGGGACGACAGTGGCGTGAGCGAACAGATTTGAGCACCCATCGTTTGTACCCAACGACGCATTGACAAGTTCTTGAACGACGGCGACTTGAGGTCTCTATTCTCCGATGGCCATGGGAGCAACCTCCCGCTAATTCTTAAGGTATGTTTGTAAACTTTCATTAGTTCATTTACAAATCTTAAAAATGAATGTTAACCAAAATTTCTTTCTCTAAGTTGTTAGGGTTCTTTAAATCATTTTTTTGGTTAAGTACCCATCCAAGTTACAGCTCTTTTGTCATGCGTCTTTTACACTTACAACTCTTTTGTGCGGTCCTCAAGATCACCTTCCTTAGTGTGAAATGGAAGTGTCTCGAGTCATAGTTGCACTGTAGCGAATACTCTTTTTTTGGAGAAGTCCATTAGGGTTTTCCTTTTGGATTTCTCTCTATTTCTCACTTGATCTTGTTCTTGAGGCATTTGTTCTCCACCTTTGGATCTCCTACCGTCGATTATCTGAAACTCTAGCGATTTGGCTTTTCTGCTATTTTGTTAGTTTAGTTTCGGCTAATCAGAGTTTCATATCCGTATGGGTATTGTCTATGTCTCTCTCTAAGCTGGGTTACATTTCGTATTCGTTTTTGGCAATTTTACATATTCAACTCTTTGATTATTGTAATTGTAGTATGCAAACATGAAGGGCATCATTTATAGTCTTAAGATGAGAAACTAGCACGATCAAATCGATAAATTAATTTACATGTCCCAACCCCATTTTTCAGGTACTTTTAAGTTCATTACATATTGTCCATGAAGGCCACAATTTTTCGTTCTCCACAAATCTTAACTTCCAAAAAGTTTCGAGGCTTGGTTTTTGCGACCCACAAACATTGTGATAGTTCTAAAGTTGAATTTTTTTCCAAAGAAGTGTGTGAGATAATCCTGGACCGGTATCCAGATAGAAAAACATTGAATAAACTCCACTCCAAGATTGTCGTCAATGAACATCTTCATATTGATCCGACTCTATCTATCAAGCTTATGAGAGCTTACTCTGCGTGTGGGGAAACAAGAGTTGCACGATTCATATTTGATAGAACTTTTGAAAAGAATGTTGTCTTTTTTAATGTCATGATAAGAAGTTATGTGAACAACAACTTGTATGTTGAAGCTCTTTCCGTTTATCAAGTCATGTCAAGTTGCGCTTTTAATCCTGATCATTACACATTCCCTTGTGTTTTGAAGGCATGTTCTGGATTGGATAATTTGAAGGTTGGGTTTCAAGTTCATGGTGCAATAGTGAAAATTGGTCTTGACTCAAATTTATTTATTGGTAATGCTTTGGTAGCCATGTATGGTAAATGTGGTTGTGTAAGGGAGGCTCGAAAAGTCCTTGATCAAATGCCAAATAGAGATGTGGTTTCTTGGAATTCTATGGTTGCAGGGTATGCACAAAGTGGGCAATTTGATGAAGCATTGGAAATTTGTAAGGAGATGGATTCCCTAAAGCTGAATCATGATGCTGGTACTATGGCTAGCTTGTTGCCAGCAGTGAGTAATACATCGCCTGAAAATGTTCAGTATATACATAAGATGTTTGAGAAGATGGCTATGAAGAGTTTGGTTTCGTGGAATGTGATGATAGCAATATATGTGAATAATTCTTTGCCAAATGAAGCCGTCAATATATTTTTTCAGATGGAGGAATGTGGGATGAAACCAGACGCGGTTACAATTGCGAGCCTTCTTCCGGCTTGTGGGGACCTTTCAGCACTATTTCTAGGAAAGCGTCTTCATGAATATATAGAGAAGGACAATCTTCGACCAAATTTGTTATTGGAGAATGCATTACTCGACATGTATGCTAAGTGTGGATGTTTAGAAGAAGCTAGGGATGTATTTGATAAAATGAAATTTCGAGATGTTGCGTCATGGACATCAATGATGTCTGCATATGGAATAAGTGGTCAAGGGTATGATGCGGTGGCACTCTTTGCAAAAATGCAAGAGTCAGGTCTAAATCCGGATTCCATTGCTTTTGTTTCTGTACTCTCTGCATGTAGCCATGCAGGATTGTTGGACCAAGGGCGCCATTACTTCCATGTGATGACTGAACAATGTAGGATAGTTCCTAGGATAGAACACTTTGCCTGCATGGTAGATCTATTAGGACGAGCTGGTGAAGTGGAAGAAGCATATAGTTTCATCAAGCAGATGCCTATGGAGCCAAATGAAAGAGTTTGGGGTGCACTTTTGAGTGCTTGTCGAGTGCACTCGAAGATGGATATTGGACTCCTAGCTGCTGATTGTCTTTTTCAGTTGGCCCCTAAGCAATCAGGCTACTATGTCTTGTTATCAAACATTTATGCAAAGGCTGGTATGTGGAGAGATGTAGGGAATGTTAGATATGCAATGAAGAAGAAAGGAATTAAGAAAGTGCCAGGCATTAGTAATGTTGAGATTAAGGGTCAAGTTCATACATTTCTTGCCGGTGATCAATATCATCCACAGGCAAAGAATATATATGAAGAGTTGGATGTATTAGTTGGGAAGATGAAGGAGTTAGGTTACATTCCTCAAACTGAGTCGGCTCTCCATGACGTGGAGGTTGAGGAGAAAGAATGTCATCTAGCTATTCATAGTGAAAAGTTGGCAATTGTTTTTGCTATTTTAAACACGAAGCCGGGTACACCAATTCGAATTACCAAGAACCTTCGTGTTTGTGGGGATTGCCATATTGCAATTAAATTTATCTCCAAAATAGTGTCACGTGACATTATTGTTAGAGATTGCAATCGTTTCCACCATTTTAGTAATGGAATTTGCTCATGTGGAGATTACTGGTGAGCCTCCTAGATGTTTAAGTAATGATGGAGATCATTCGAGGAATGGACAAGCTTGTGGTGCACCATTATCCTTGCCTTTGAAGCTATAGGTGATATAAGTTTTCGTCTTTAATCATCTATTCTCACAGTTCTCATTTGTATTTTTGGTGTGAATTGTGAAAATTAATGTATTTAATTTCAACTCCTTCCACAAGGGCGGGACTTCATTTGGAAAGGAGCTAATTATGATCATGACTCTCCCTTACTTTTTTGAAAGGGTCACCAGTGAAAATGGAGAATTAAAGGCTATATAGGCTTTAGGTAAGGCTACATACTTTCCGAAGGCAGAGCAATGAGACTAGAAGTCCAATAGCTATATGTGAAGGAGCAATTGATGAACAAATCTCCATGTGTTTCAGCCTCCTTCCTACAAAGAACGCACCATCCCAGAGAGAGAGGAGGGGTAAGGTGTATCCTCTGTATCAAATCATGCATTTTTATATTTTCGTGGCTTATTTTCCTTGGAACTCAGGCTACATACTTTCCGAAGGCAGAGCAATGAGACTAGAAGTCCAATAGCTATATGTGAAGGAGCAATTGATGAACAAATCTCCATGTGTTTCAGCCTCCTTCCTACAAAGAACGCATCATCCCAGAGAGAGAGGAGGGGTAAGGTGTATCCTCTGTATCAAATCATGCATTTTTATATTTTCGTGGCTTATTTTCCTTGGAACTCAGGGACACAGGTTGTAAACTGTACTGTTCAAGAGTTTGACTTTTTGGGGAACTTTCCATTCCAATTGTTGTGGTAAGTTCTTTTGTGGTGACCTTTGTGATATTTTTTTTTTTTGAGGGAGCCAACTGAAAAAGAGTTGGATGCATCTAGGGGCTATTATCATACATCTTGTTGGTCCCTTCTTGCTCATCGTACCTTTTTTGGTTCAAGGCCATTCGATCAAGTTGGCGAGCCATATTTTCCATTAACTACTGCATCTCCTTCATGTCGTTTATTATTTTGTCCATTGGAACTTCAACTGATTGCAATCAATAGTTCAAAGCCTGTGGAGAGGGGTGCTTTCGCATCGAAGTTGGGTTGAACCTGTCAAGTCCCAATGTTTCTTCCTTCCTGTCAATAAGGTATAGGCCCTTTGACAAGGTGCTCTGATACCACTTTTGGTGTAAACAAATATAGGAAATGATGTTTTACTTTTTTTCTGAAACAAGAAACAAAATGTTGAAGAGATGAAAAGAGACTATTGCTAAAAAATACAAAACTCCAAAAAGGAGATTAGAATCCAACAAGATGAGTACAACTTAAAAAAAACTAATAAAAAACATGCCAATTTAAATTTATATAAAAAATAGAATAATCCTCAAATGCTTTGGAGAGAGTGCACTAGGAAGAACCCTTGAGAAGAATAACATCGAATCTAACGCCCCAAACAAAAAACTCGTCTTCCAAAGTTCTTTAATTTCTTTCAAACCAAACATCATAAAAAATAGCTTTTACCTCATTCACCGAAAGGAGAGAAGCTTTGCCATTTAACTAAGGCAGAATGATAAGTTGTACAATAGTCATTGACACCATTGGAGAACACCAAATTTAAGGGCCTATTTAAGGCACAAGTTATAATAACCGGTGGAATACTCTGCACCCCAAACACAAACTATTATAACTCCAAACTATACTACTCTGTATTTTAAATAATATTATATGATTCTACATATTATAATAACCTATAGACTATTATAACCCCTTTTATTATAACCCACTCAACGCCCCAAACAACCCCTAAAAGAACTTGAACCAACACTTAGAAGCAAAACCAAATTAAAAAAAAGAGGTAAAACATGGATTCAAAGGAATAACCAATTAATTAGCTAAAGCCTTCAAGCGCTATTTCTTAACCAGCTAGCCTCCAACAGCCCGTTTATCTTCTAAAAGGAATTTTTACAATGAAAATCTACATCGTGTTCTTAGTTTTTGGTTGGTGTTCTTTTCAATTTAAAAGTTTTAATTTGATGGGCTGTTAACTTTTTCCTGCTAGTTTCTTCTGCATCTTAGATTATTTCTTCCCAATTATGATGGAATATTTTGTGTTATTTTTGTTGTATCATTTGTTTTGGTTTATGGGTTTTATCTGTATCTTTGTTGGATTTTGGTGGGCTGTTTTGAAATCTCTGACCGGGAATTTAGTGTAATTGGTAAATTTTCATATTATCAATGAAAACTTTTGTTCCTTTATCAAAAATAAAAATCATAATGGTTAGATTAATGTAAAATTTTCTACTAATGTTATACTGCTGGAAAAAATCACATCCTTATAATAGCAAACCTTATTATTGATATAGATGAGTCAGGAAACAAGCATCTCTCGCTGGAAGAACTACTATTCCTCGTTGTATTTCACAATATTCTTTACCCTCTTATGCTAAGCACTTTTCCCTTTGTCAACCCTCCCAAGTCTCAAATCAAGGTCTCTAACTCAAGTTTTTGTTTTGTTTTATTTTTATGATAGAACGGTGGGAATGGAACAAGACAGGGAAGAGGCGCGATAGACGTTAAGAGCTTGGGGTATGTGGATCAATTGGTGGAGGCATGCGAGAGTTTAGGCTGGAGAAATCCTTCTAAGATTCAGGCTGAGGCCATTCGTCATGCACTAGAAGGTACGAGAAGGTTCTTTCTTTAGCTTATGCTTTTTTTCCTACACAAGATACAAAACTTTTTGTAAATCACCAATCAACCCAAAAACTTAAGTTTATAGGTTATGACAAATTTAATTATATCAATACCAACACCTCCTCACTTGTGAGCTTAGAAATTTGAACAAAGGCCCAACAAGTGGAAATCAAACACTCCCTCTAACTTGTGGCTTGGTAATTTGAACAAAGGCCCAACAAGTACAGATCGATATTAATTGGGAGAAAACGACTTCACCAGGGTTCGAATACAGGACCTCCTGCTCTGATACCATGTAAATCACCAATCAACCCAAAAAGTATAAGGTTAAGCTTATAGGTTATGACAAATTTAATTATATCAATACCAACACTTTTCATTGATGTAATGAAAAGAGACTAATGCTCAAGATATGAACTCCTAAAAGGAGTGAAAACAGAAAACAAAAAAATAAGAAAAGTGATATAAAAAACAATTACAATGGAACAATAAAAGCATTCCAATTAGAACAAATGTGCTGTAAAGATAAACTAGCAAAAGACTTAGAAAGGGTGTGCCAAGAGGAAGCCTTTAAACATGCTGACCAAGGAAGCCTTTCCTCGGCTCATTAACTGTTGGCAATGATTTCAAAAGCAGTTGAGGCCACGGCTTAGGCAGTTGTGAGGTGTGTCAAAGCTTACGCCTATGCTGAACAAAAGGAGGCTTACGCCTTTGATGCGAGAGATACACACCTCCATGCTTACGCCTTTAAGGCCTGAAAGGCATAGGCCTCCCTGCATAAGTAGTGGGCTGGTAGGGAGTTTATGTTTAATTGGATGAGAGTACCTTGACATTAATTCTCTTGCAGACTTAAACAATATGGCACATTATCTACGCGAAGGTCTCACTATTCTAGTCCTTAAAAGTAAGTGGTGGCACACTAAAAAAAACAGCTATTTGTGTCCATTTGGAGAAGAAACAAGTGTCTGCCACCACACACCACAAAGTTCTCGTTACTATTTTTCCTTTTGTCTTTTCTTGGATTATTATTTATTGCCACCTTAACCATTTTTTAAGAAGAAAAAGAGTCTGATAGGTCTATGGCCATTGCTTTTCTATCTTTTCTGTGGCCGCAAGGCAAAAAGTTATTTTTCTTTCCAGTTTTGTTTCTCTAAGGATGAGCTTTAGATTTTTAACTATCATTTTTTTCCTTTTTAATGTTTAGAAGGTTCCCTCCAGCATGTTCTCTTTAACAATTTCTTCTTTCTCCACACCTTGGAACCATTCTACTCGATCAATTTCAATTGGCCGCAAGACTTTCTCCATAGCTTTTAATGTTCCATCGCGGAGAAGTAAAGCAAAAATCACAGAAATTGGGAAACTTTCCTCCCACTCGCTTTCACTATCATGGCACACTCTACATGGGATGGTTTCCTCTTTTGAATCCCTCTTTCGCGATCCTTGCTTCTACAAATTCTTTAGAAAACATCAAACAATTGAGGACACGTTTTGGATAGAAAAGCTTAGTAACAAATTTGGCTTCTATGTGGAAGTTACTCACCTATGTCAATTCGGTACAAGGAAGAAGCTTATTATCCCTTCTGAAGACAATAACAAGGTTGGTTTTTTTTGCCTCTCTCTTATTTCAGATTATCCGAGTGAGGCGCACAAAGCTGTTGATCTTGCTCACACCTCTTTCAGCAACGCTCTCAAAAAGAACAATCACCAGACTATTAATTCACAAGTTGGTAATGCACACGACTCTTTGGACTGATCACCCACCACATCACCTTATCCGAATCAGTTTGAGAATTGGAGCAACATTCTAATTTGTCAAAAGATGATGGAATCCGATTCTTGGTTGAACATTAGCACAACTATCCAAGAAAACTTCTCATCGTGATGTACCATCAATCTTTTTCACGACAACCAGGCCTTGATTCACCTCTACGATCCCTTATTGTCTGCACATTTATGTCAAAACTCTTCTTGGCAACTGATTGGAGCTTTCAAATTGAGATTTAAATCGGTCAGTACTAACTCTCTGATGCAGGATAGGAAAATAACCTCATATGGTGGTTGGATAGATATTTTAGATCTTCCTCTCACTCTCTAGAAGATGGAGGTTTTTTGTTACATTGGTGATCAATGTGGTGGTTTTATCCAAACAGTCAACCATACCGAGAGATGGTTGAACTTGGTAGCGGCTCGCATAAAGGTCAAGGAAAATGGCATGGGTTTCCTTCTGGAGAGATTCTTGTTCCAACTTCCATCGCCAGTGAAGCAGCCACCGTGAGGATCCAATCGTTTTCGGTGAAGAACCGTAGGGGTGCATTTAATTCGAATCTGGATTCTCCGACAACAATTATTGCGATTGATGATACTTTGAATTTGGCAGATTCTTCCAGAAATCTTAATGGAATTGAAGGCCTTAATTGCGATGATTTGCCAGATTTAAAAAGGAAGGAATTCTCTGATCTTGCGGCTGATCCTCGCGATATTCACGGAAAATCTGGCGTTGAATTCCCTTCTCGGATTCCCTTGGTCACGAATGGAAATTGGAGGAAGATACGCGGTGATTTATCTCTTCCGAAATCGAATCCTTTATCCAACCAACCTTCTCCCTCCACGTGTCCAGACACCGATACTAATACAACCCCACCAAATCTCGCACGACCTTTTCCCAAACCAATCCCTTTAGTCAACCAGCCTTCTTTGGCCACGTGTACTAAAGCCCTAATCAATTCGCTCTCACCAAACCTCACACAACCTACTTTATCGACAATTTCTTTTGCGGCTGGCTCCTCCATTGTTGACCACCCCACCCTTCATTTAAAAGGGTCAACATCACTAGAAAACCCCAATCTACCCTTCACTGTCTCTGATACAGAAGCTTACCTTTCTAGCCCCACTTCTCTCAAATCTAATCCTGATTTCGAGTTTCACCCTTTACCCATTGACCCGGCTGTATTTGGCGATAACCCATCCATATCTTCTGCCCTCCCCATCAACATTTTACCACCTGTCCCTATCAACATTCAACCTTCACCAACCCGACCTGAACCTGACTTGCTTTCATCAAAATCCCCTTCTGCCTTACTTCACTCTCCACCGAACATACATTTTCACCCTCCAACCTCACTTCCCTTCGATTTTGGTGATATTGTCAATTATCTAATGACACATGGTTTGTGTATAATGGCCATTCCACCGTCTAAATAAAAAAAATCATACAATCCGGTGACTAAGAAAAATAAATTGGAGAGGGAATTGCAAAATTTGAAATCTGATATTCACCATGATAAATCAGTTGCTTTGGCCTTATTGGAGGATCATCAGGGGATTAATGAAATTGGTATCTTGGAACGTTAGAGGGTTGGGCTCGTGGAAAAAACAAGCTTTGATTAAACGTTGTATTACTCAACAAACCCCGGGTATAGTTCTCTTACAGGAAACAAAATTATCTGCAGTTGATCATCACTTGATTAAATCTATCTGGAGTTCAACACATATTGGTTGGGCAACCCTAAACGCGGTTAACTCCTCTGGTGGCATTTTAATTCTTTGGAGTGAACCCGATTACACAATCAAAGAAGTGATAACAGGTTTGTACTCTTTATCGGTCCACATCTTTTTGGTTGATGGTTTCCCTTTTTGGCTCACAACAGTATATGGTCCATCTGAGAATGGTTTTCATGCTGATTTTTGGAATGAATTGGACGACTTGGCTGGTTTGGATGGAGATATTGGATTATTGGCGAGGATTTCAATGTCACTAGATGGACTTGGGAAAAATCACATGACAATTCAATCACTAGAAGTATGAATACTTTCAATCAGTGGATTTCAACTTATGATTTGATCGATATTCCGCAACAGAATGGTCGTTTCACTTGTTCCAACACCCATTATTGCTCTCTGTTGGGCAGATTTTTAATTACTAAAGGTTGTTCCAACAAGTTCGGCTCAGTTGCTCTCAATCGTTTGGCTCGAATTACATTAGACCATTATCCGCTAACTCTAATTTTTGGTAATGTTGTTTGGGGTCCCGCTCCTTTCCGTTTTGAGAATTCATGGCTCCAGATTGAATCAATTAAAGATGTGGTATCAACTTGGTGGAATCAGAATCACTTTGAAGGCTAGCCGGGACATGCTTTGATGCAAAAATTAAAAGGTCTCAAACGAGTCTTATTAGACTGGAGTAAAGGTAACTGTTTTGTAGCTTCTAATTTCCATTTCCTTGTTTTGCAATTGCAAACTCTGGACAATTTGGAGGATAGTGGGCCTTTAACTATCCCCCAGATTGAAAACCGTCAGCTTCTACGAGATCAAATCGAATCCTTGATTGCACAAGACCAGGTTTATTGGCATCAGAGATGCAAGCTAAAATGGCTTAAGGAAGGTGATGAAAATACCAATTTCTTTCATCGGATCTTGGCTGCAAGGAAGAGAAAAAGCTCGATTTCTGAAATCTTGTCAAGAGAAGGTGTTAGTCTTACTTCTGCATTGGATATTGAAAAGGAGTTTATTGATTTCTACTCAAAGTTGTATACCAAAGATGTTAACTCTCGATTTCTACCCACTAATGTTGAATGGAGTGCTATCTCCTCAGAGGAAGCAACGACTCTTGAAGTTGCCTTTACAGAAGAGGAAGTTTATCACGTTGTTTCTTCTCTGGGTTCCAATAAATCGCCAGGACCTGATGGCTTTACAGCTGAGTTCTTCATATTCTTTTGGGATATCCTTAAACCTGACATTATGAACCTGGTTCAAGACTTTCATTCTTCTGGTATCATCAACGTTTCATTGAATGAAACTTATATTTGCTTAATACCAAAAAAGTTGGGATCAAAATCAGTGACTGAGTTCTGCCCTATAAGCCTGATCTCTTGCGTTTATAAAATAATAGCGCGCGTCTTATCTAATCGTTTGAAATCTGTTTTGTCATCAACAATAGCAGAAAACCAGCTTGCTTTTGTGGCTAACCGACAGATTCTCGATGCGTCTTTATTGGCAAATGAACTAATAGATGACTGGAATATCCGAAATATCAAAGGAGTAGTTATAAAACTTGACCTTGAAAAGGCGTTTGATACAGTTGACTGGGACTTTGTAGATGTCGTGCTGGAGGTTAAAGGTTTTGGGGCAACCTGGAGGAAGTGGATTCGGGGTTGTATTACGAGTACCAATTATTCCATCATTATTAATGGTCGCCCTCGTGAGAAGATTATTCCATCTCGTGGTATTCGGTAAGGTGATCCTCTATCCCCATTTCTTTTTATTTTGGTGTCTGATTGTTTGAGCTGACTTCTAGACTATGGTGGTAGATTGGGTAGTATATCCTCTCACTTTATTGGTAACTCATCTTTTGCCTTAACTCATTTGCAATTTGCGGATGATACACTTCTTTTTTCAACTTCTGATCATTTTGCCTTGCAACGTCTCTATGAACTAATCAAGATTTTTTAGAGTGCTTTTGGTTTAAGGATTAATCTGGCGAAGAGTGAGATGATTGGCATACATATTGATCAATCTGAGAAGGACAAACTACTGGCAATCTTTGGTTGTCAACAAGGGTTTTGGCCAATCAGCTATTTGGGCTTGTCATTGGGAGATAATTCTTGTACAGTTTGTTTTTGGCAACCTATTATAGAGCGGTTGAAACAGAAACTTCATAATTGGAAATATGCATTTATTTCTAAAGGTGGTAGGCATACTTTTATTCAGGCCACACTCTCTAGTATGCCTACTTACTACATGTCTCTTTTCAAACTTCCATCGAAAATCATTTCAATGCTGAATAAAATTATACGTGATTTCTTCTGGGAAGGAGCTAAGGGTGATGGTGGAGTACATAATGTAAATTGGGCAACTGTTCAGCTTCCTAAGCTCATGGGTGGTCTTGGGCTTGGTAACTTTGCACATCGAAACTCGGCTTTGCTGGCTAAGTGGATTTGGAGATTCATGCATGAACCTAACTCCTTGTGGCGGAAATTTATAGTAGCTAAATATTATGGTTCTTTGAATTCTGTTGACTGGCCCCCGATTATTATGAATACTAAACACAAATCACCATGGAGATCTATTTGCCTTTCACGTGATTTGGTGATTCGGCGTTCTAAGCGTTGTATTGGGAATGGATGCTCCACAACTTTTTGGAAGGATCATTGGCTTAGTTGTGGACCTATTGGTTAGGCTTTTCCACGACTTTTTAGATTGACTTTGAACTCTGATTTGACGGTGACAGATATGTGGAATGTGACCGAGTTGGCTTGGGACCTGCGTCTTCGGCGTAATTTAACAAATTTAGAAATTAATGAGTGGACATCTCTTTCTCATTTATTATCGAATGTTCGTTTGTTGAATAGACTGGACTCGTGGGAATGGTTATTAAAGCCTTCTGGTTTCTTCTCTGTGAGATCTCTCACTGTTGATTTGTTGGATTGTGATATCTTGGGGTGAAAGGATACCTATTCAGTCATTTGGAAGGATTCTTACCCAAAAAAGATTAAAATCCTCATCTGGGAGCTCAGTTTAGGAGCTGTTAATATTGTTGATCGTCTTAAAAAACGAATGCCCTACATGAACTTATCCCCCTCTTGGTGTCTTATGTGCAAAAAGAACTCTGAATATGCGGGTCATCTTTTCGTGCATTACCCCTTTTCTTTAAATTTTTGGTGGAAGATTCTGAATGCTTTTGGTTGGTCATCGTCGCTTCCTATTAATATCTTTGATTTCCTATCCTCTATTTTAGTTGGACATCCTTTCAAGGGCATTAGAAAAGTCATTTGGTTGACTTTAGTTCGAGTCTTCATTTGGAATCTATGGATCGAGAGGAATAGCCGATTATTCAGGGATGTTTCTTCTACTATTGATCGCTTTATGGATAATATCCTTTTGAATGTTTATCTTTGGTGTAAGGATGTTTATCCCATTACCTTGTTTAACTCTTCTTTTTTTCATTTATAATTATAATCTTTCTTGTAACACACCTTTTGGTGTTTGAAGTTTCCTCCATTTCATTCATCAATGAAATTGTTTCTTATATCTAATTTTTTTTTTTTAATTAGAAACAAAACTTTTCGTTGAAGAAATGAGAAAAGGCTAATGCTCAAGAAGATACAAACCCCACAAGAGAATTGAAAAAAAAAACAAAATTACAAAGAAAAAATAAAAACATCCCAATTCAAAACAATATTTTGTAACGGAAAACCAATACAAAGCTTAGAAAGAGAACACCATGAAGAAACCTTTAATTAGCAAACCCAAAACGGTCAATCCAGTGTAAGTGTAAGGGCTTGTTTTCAAAAATCCTTTGATTCCTCCCCAACCATATTTTTGAAAATAAAACTTTAACCATAATCTCTCATAATAATCTTGACTTGGGAGCCAAAGAAACCCCATAAGAAGTTGAAGAACATTGCCCCTGGGCCCTTATGGTGTTGTAGAAAACCCAATGAACTTGGATATAAGAGAATAACTTGCGCAAGAGAATAGCTTGTACCAGTAGGCCCAGGAATAAGAACGATTAAGGACCCGTTTGATAACGTTCCCGTTTTCTGTTTCTGTTTCTCATTTTTTAAGAAACAAACTTGTTTGATAATCCATCATGTTTGTTATTCCCAAAATTTGAGAAACGTTTCTAAAATTGAGACTAAATTTTAGAAACTACAAATAGTAGTTTTTTTCTGTTTCGTTTTTTTTTAATATTTAAATGTCCCTAAACTGTTCTAAATTCCACCATTGCTCTGTTGGTGAAGTGCACGCCTACATCTCTATATCTTGCGAGTTACGTATTCGAATTCCAAGTAGAGCAACTGTGTTCTTTTTTTTTTTTGAAATTATATTTTTTATATATACATCGGGGTTCGTGTTCACTCTTAATGGAAGAGTTGTAGTGTGGCGAAGCATCAAGTAGGGATGCATCATTGATTGCATGATGGAGGTCGAGTATGTTGCGGCTTGTGAAGCTGTTTGGCTCAGGAAGTTCATGTTAGATTTGGAAGTTGTTCCAGATATGAACTTGTCAGTGATCCTGTTTTGTGACAACAATGGTGCAGTAACCAACTCGAGAGAACCTCAGAGTCACAAGAGGGGCAAGCATATTGAACGCAAGTATCACCTGATACGGGAGATTGTGCACCGTGGAGACGTGACTGTCACGCAGATAGCTTCGGAGCACAATGTTGTTGATCCGTTTACAAAGGCCCTCATGACTAAGGTGTTTTAGGGTCACCTAGAGAGTCTAAGTATTCGAGTCCGAGACTAGAGCAAGTGAGAGTATTGTAAAGGGTATTCAATGCCCTAGTTTATTGTATTACTTTACATTTTGGAAATGTATTTGTACTCCACTAGCTTAGTTCAAGTGGGAGTTTGTTGGGGTGTATGCCCTAATCTCATGTAGTTTGTAATACAGTTGTTTTAATAAATAAATGTTGTTTATTTATCAAACTTTGTATCTCGCATTTTATAAACTTAATTCAATAAACTAAGCTCTAGGGTCATTATTATGAGAGAACTTGAATGGTGTGTAGTGGACACACGGGGAGGATCACGTTCGAGTTAATGACCCGAATGGTCTACAATATATGAATAGGGTTGGGAACTTCATCCTAACAACACTGTGGATGCGACCCGCTTTGTATTGGATATAAACGAGGTGATCCAGCTCGTTCATGTAATTGACATGCGAGTGAGGGCATCTTGTGCAATGTGATTGTACAAGATCGAATCGTGAATTATTAGATCTACTGTTGTAGCACCGTTAACAGTATAGGACTAATAACTTCGTAAATGACCATTAGTGACTCGTCTTTAATCTTGAGCGTGTTACAGGCTCCTGTCTGTGGGGGTCTGTCCTTTGACTAGTAAGGGTGAGGGTGACCTGAGTTGCTCGCCCAATATGCCTATATTTTTGGGAGCTGGGAACTTAACTACGCAAGAAGGAATTCACTCGTTCCTGAGCTAGGGAAATTAGAGGGGTTGTTCCCTTAAGTCAGTCTCTAAGGCTTGGACAACAAGCACTCACCCTCTCACTGACTCGAGATGGATGTTGTTTATGGTTGGACCATGCAACATAGTTGTTCATTAGAGGAGTAGTGGGACTTAAGGAACAAGAAGTAGACACATGGGTAAAAAGGTAATTTGACTCAGTTGTGACTACGAACAACCTATGAAGGGTCGATTCACAGACATTGGTTATATCGATGGACACAACTTGTCTTACAGTGAGGAGAGTGCAACTACGGGCTTTAGTGGAGTGTTACTGTAGTTAATGAATGTTGATTAACCGGGTCAAAAAGTTTGATCGGTTAATCTCGTATCATTGGAGCTCGTCATCTATAGATTCATTAGGTCCCTCGGTTGACTCAAATTTGACTAAATGAAGGTTTACAGTTTGGAGTGGATTTGAAGCGTTCAAATCTATTAAAAGGGTATTTATGTCATTTATATATTATATAATGGACATAATATATAGTTTAATGTGAATGAGATTCATAATAAACTATAGTGTAGCCTAATTATTTAATTAAGGTTAATTTGGAGAAATTGATTTGAGAGAAATTAAATTGAATATGATTCAATTTAAAATTTCACAATTATACAAATGTGATTTGTATAATAAGGTATGAGATATCTTATATTATGTGATATGCAAATATGATTTGCATATTATGTAGTTGTGGTAAAATGGTAAATGGTGGAGATTTTAACTCCACCACTTGACATTATAAATAGGTGGAATGTGAGACATTCCATAAGGTTGGTTCTCATTTTTTGGAATTCAAGCCAAAGAGAACAAGTTTCCTCTCTCAATTCTCTCAATCTCAAAACACTCTCCAATCCGAAATCCAAAAGGAGTCTCACCACTCCCCATCATCCTCCCGAGAATAATCGAGAAGATCAAGTGGAGGTGTCCCCTTGCATCAGTTCGTCACTTTCGTTGGAGAGCTCGTTGTCGTGACCGAAGAAGGGAAAACTTGAAGAATGGTTCTTCAAGAGGTAGGTTTTCGAAACCTTACTATTTGTCTTCTCTAAATTGCATGCTTAGCCTAAACTGTATGTTGATACTTTCAATTTTAAGAAAATTAGAACAACAACGAAACAAACGGTGATCCGCATCGCAATTTCGCTGCAGGAAATCCATTCCCTTCATTAACAGTTTTCAACCTTCCAAAAACTCATGTGAATGACCAAAATAACAAAATCCAGCAAGACAAAATACTTTAATGACGTTAAGGTACTTCACAACTATTTTGTGACAATTAGCACACAAAGGTACCTAATCAAGGAGTTTCTAAAGCTAAAATAGTAAAAGAAAGTCCAAAATACCAAGAAATAATCATAATAAAGGCTCAATCATAACTCTTATTCTAAGAGTTAACAATGGGTTACATAAATGAAACACTATTTTAGAAATTAAAAAATTAGAAGTATATATAAGAAATTAGGATATTTAAAACTTCATTTCATAAAGTATTCAAAAATCTATTTATAGAAGAAATAAAAAAATTGAAGTTTATTTAAGAAATTAGCAAATTTAAAACTCTATTTCATAAAGTATTCAAAATCCTATATTTAAGATTCTATTTTAGAAATTAAAAAACTGAAAGTTGATAGGACTCAAAATCGTCCTATAAGTTAGGCCTAATTCTTGTTAGTTTGGTGATTTTAAGAGAATTTTTGTTTATTTTGTAGGAAATTCGAGCAAAAGTGCAATTTGGAGAGAATGCATCCAAATAGAGTCAAAACGAGCCAAAACAAAGGTCGTTCGAACAATCGAGCAACCAAAAGAAGTGAAAATATCATTTTGCTCCTGGCGAGAGCGTCTAGACACTGGGCGTAACGTCTAGACTCGGGCGCATTCACAAAAGGTGCGGATGCAGGGTAGCGTCGAGATGTTACGTACTTTGATGGGACCGGGCACGCTTCCACAACATCGAGATGCTTTCACAATGTATAAATGCTTTTTTTTGTTTAGGGTTTTGAAGAGAGAGACACTTTAGGGACCAATTCTCAATAAGCTCTCGAGTTATTTTTAGCTTAGGATATGTAGTTTTATGTTTTTCATTGTGATTATATGTGTATCGGGCTTGAGTTTTTCGACTCTGGTTAAGCGAGAATGTCGATTCAAAGGGAAGCATTAGGCTATGTAACGCTTTGAGGTTTTTTTTTACAATGCAATTTCTATTTCATTTT

mRNA sequence

GAAAGAGTTGGAAAAAAGGCGGAAGTGTTCTAAAAAATTGGGCGCCTAAAATTTTTGGAAAATTCGCGTCATTCTCTATCACACATAATTTTCTTTCACTTTTCCCTTTCCCTTCTCATCTCGTTTTCTTTACTTTTCTTCAGATTTTCCAATCTTCACTGCCCATCTCTACAATTTCGACGGTACTTACATAAAACGCCCATAATCATGCAAGCAACCCACGAACGATGGTGTCGTCGACGTCTTGGATAGCAGATGCAACGGCGAGACTCCTTTGAGGGACGACAGTGGCGTGAGCGAACAGATTTGAGCACCCATCGTTTGTACCCAACGACGCATTGACAAGTTCTTGAACGACGGCGACTTGAGGTCTCTATTCTCCGATGGCCATGGGAGCAACCTCCCGCTAATTCTTAAGGTACTTTTAAGTTCATTACATATTGTCCATGAAGGCCACAATTTTTCGTTCTCCACAAATCTTAACTTCCAAAAAGTTTCGAGGCTTGGTTTTTGCGACCCACAAACATTGTGATAGTTCTAAAGTTGAATTTTTTTCCAAAGAAGTGTGTGAGATAATCCTGGACCGGTATCCAGATAGAAAAACATTGAATAAACTCCACTCCAAGATTGTCGTCAATGAACATCTTCATATTGATCCGACTCTATCTATCAAGCTTATGAGAGCTTACTCTGCGTGTGGGGAAACAAGAGTTGCACGATTCATATTTGATAGAACTTTTGAAAAGAATGTTGTCTTTTTTAATGTCATGATAAGAAGTTATGTGAACAACAACTTGTATGTTGAAGCTCTTTCCGTTTATCAAGTCATGTCAAGTTGCGCTTTTAATCCTGATCATTACACATTCCCTTGTGTTTTGAAGGCATGTTCTGGATTGGATAATTTGAAGGTTGGGTTTCAAGTTCATGGTGCAATAGTGAAAATTGGTCTTGACTCAAATTTATTTATTGGTAATGCTTTGGTAGCCATGTATGGTAAATGTGGTTGTGTAAGGGAGGCTCGAAAAGTCCTTGATCAAATGCCAAATAGAGATGTGGTTTCTTGGAATTCTATGGTTGCAGGGTATGCACAAAGTGGGCAATTTGATGAAGCATTGGAAATTTGTAAGGAGATGGATTCCCTAAAGCTGAATCATGATGCTGGTACTATGGCTAGCTTGTTGCCAGCAGTGAGTAATACATCGCCTGAAAATGTTCAGTATATACATAAGATGTTTGAGAAGATGGCTATGAAGAGTTTGGTTTCGTGGAATGTGATGATAGCAATATATGTGAATAATTCTTTGCCAAATGAAGCCGTCAATATATTTTTTCAGATGGAGGAATGTGGGATGAAACCAGACGCGGTTACAATTGCGAGCCTTCTTCCGGCTTGTGGGGACCTTTCAGCACTATTTCTAGGAAAGCGTCTTCATGAATATATAGAGAAGGACAATCTTCGACCAAATTTGTTATTGGAGAATGCATTACTCGACATGTATGCTAAGTGTGGATGTTTAGAAGAAGCTAGGGATGTATTTGATAAAATGAAATTTCGAGATGTTGCGTCATGGACATCAATGATGTCTGCATATGGAATAAGTGGTCAAGGGTATGATGCGGTGGCACTCTTTGCAAAAATGCAAGAGTCAGGTCTAAATCCGGATTCCATTGCTTTTGTTTCTGTACTCTCTGCATGTAGCCATGCAGGATTGTTGGACCAAGGGCGCCATTACTTCCATGTGATGACTGAACAATGTAGGATAGTTCCTAGGATAGAACACTTTGCCTGCATGGTAGATCTATTAGGACGAGCTGGTGAAGTGGAAGAAGCATATAGTTTCATCAAGCAGATGCCTATGGAGCCAAATGAAAGAGTTTGGGGTGCACTTTTGAGTGCTTGTCGAGTGCACTCGAAGATGGATATTGGACTCCTAGCTGCTGATTGTCTTTTTCAGTTGGCCCCTAAGCAATCAGGCTACTATGTCTTGTTATCAAACATTTATGCAAAGGCTGGTATGTGGAGAGATGTAGGGAATGTTAGATATGCAATGAAGAAGAAAGGAATTAAGAAAGTGCCAGGCATTAGTAATGTTGAGATTAAGGGTCAAGTTCATACATTTCTTGCCGGTGATCAATATCATCCACAGGCAAAGAATATATATGAAGAGTTGGATGTATTAGTTGGGAAGATGAAGGAGTTAGGTTACATTCCTCAAACTGAGTCGGCTCTCCATGACGTGGAGGTTGAGGAGAAAGAATGTCATCTAGCTATTCATAGTGAAAAGTTGGCAATTGTTTTTGCTATTTTAAACACGAAGCCGGGTACACCAATTCGAATTACCAAGAACCTTCGTGTTTGTGGGGATTGCCATATTGCAATTAAATTTATCTCCAAAATAGTGTCACGTGACATTATTGTTAGAGATTGCAATCGTTTCCACCATTTTAGTAATGGAATTTGCTCATGTGGAGATTACTGGTGAGCCTCCTAGATGTTTAAGTAATGATGGAGATCATTCGAGGAATGGACAAGCTTGTGGTGCACCATTATCCTTGCCTTTGAAGCTATAGGGTCACCAGTGAAAATGGAGAATTAAAGGCTATATAGGCTTTAGGTAAGGCTACATACTTTCCGAAGGCAGAGCAATGAGACTAGAAGTCCAATAGCTATATGTGAAGGAGCAATTGATGAACAAATCTCCATGTGTTTCAGCCTCCTTCCTACAAAGAACGCACCATCCCAGAGAGAGAGGAGGGGCTACATACTTTCCGAAGGCAGAGCAATGAGACTAGAAGTCCAATAGCTATATGTGAAGGAGCAATTGATGAACAAATCTCCATGTGTTTCAGCCTCCTTCCTACAAAGAACGCATCATCCCAGAGAGAGAGGAGGGGGACACAGGTTGTAAACTGTACTGTTCAAGAGTTTGACTTTTTGGGGAACTTTCCATTCCAATTGTTGTGTTCAAAGCCTGTGGAGAGGGGTGCTTTCGCATCGAAGTTGGGTTGAACCTGTCAAGTCCCAATGTTTCTTCCTTCCTGTCAATAAGAACGGTGGGAATGGAACAAGACAGGGAAGAGGCGCGATAGACGTTAAGAGCTTGGGGTATGTGGATCAATTGGTGGAGGCATGCGAGAGTTTAGGCTGGAGAAATCCTTCTAAGATTCAGGCTGAGGCCATTCGTCATGCACTAGAAGGAAATTCGAGCAAAAGTGCAATTTGGAGAGAATGCATCCAAATAGAGTCAAAACGAGCCAAAACAAAGGTCGTTCGAACAATCGAGCAACCAAAAGAAGTGAAAATATCATTTTGCTCCTGGCGAGAGCGTCTAGACACTGGGCGTAACGTCTAGACTCGGGCGCATTCACAAAAGGTGCGGATGCAGGGTAGCGTCGAGATGTTACGTACTTTGATGGGACCGGGCACGCTTCCACAACATCGAGATGCTTTCACAATGTATAAATGCTTTTTTTTGTTTAGGGTTTTGAAGAGAGAGACACTTTAGGGACCAATTCTCAATAAGCTCTCGAGTTATTTTTAGCTTAGGATATGTAGTTTTATGTTTTTCATTGTGATTATATGTGTATCGGGCTTGAGTTTTTCGACTCTGGTTAAGCGAGAATGTCGATTCAAAGGGAAGCATTAGGCTATGTAACGCTTTGAGGTTTTTTTTTACAATGCAATTTCTATTTCATTTT

Coding sequence (CDS)

ATGAAGGCCACAATTTTTCGTTCTCCACAAATCTTAACTTCCAAAAAGTTTCGAGGCTTGGTTTTTGCGACCCACAAACATTGTGATAGTTCTAAAGTTGAATTTTTTTCCAAAGAAGTGTGTGAGATAATCCTGGACCGGTATCCAGATAGAAAAACATTGAATAAACTCCACTCCAAGATTGTCGTCAATGAACATCTTCATATTGATCCGACTCTATCTATCAAGCTTATGAGAGCTTACTCTGCGTGTGGGGAAACAAGAGTTGCACGATTCATATTTGATAGAACTTTTGAAAAGAATGTTGTCTTTTTTAATGTCATGATAAGAAGTTATGTGAACAACAACTTGTATGTTGAAGCTCTTTCCGTTTATCAAGTCATGTCAAGTTGCGCTTTTAATCCTGATCATTACACATTCCCTTGTGTTTTGAAGGCATGTTCTGGATTGGATAATTTGAAGGTTGGGTTTCAAGTTCATGGTGCAATAGTGAAAATTGGTCTTGACTCAAATTTATTTATTGGTAATGCTTTGGTAGCCATGTATGGTAAATGTGGTTGTGTAAGGGAGGCTCGAAAAGTCCTTGATCAAATGCCAAATAGAGATGTGGTTTCTTGGAATTCTATGGTTGCAGGGTATGCACAAAGTGGGCAATTTGATGAAGCATTGGAAATTTGTAAGGAGATGGATTCCCTAAAGCTGAATCATGATGCTGGTACTATGGCTAGCTTGTTGCCAGCAGTGAGTAATACATCGCCTGAAAATGTTCAGTATATACATAAGATGTTTGAGAAGATGGCTATGAAGAGTTTGGTTTCGTGGAATGTGATGATAGCAATATATGTGAATAATTCTTTGCCAAATGAAGCCGTCAATATATTTTTTCAGATGGAGGAATGTGGGATGAAACCAGACGCGGTTACAATTGCGAGCCTTCTTCCGGCTTGTGGGGACCTTTCAGCACTATTTCTAGGAAAGCGTCTTCATGAATATATAGAGAAGGACAATCTTCGACCAAATTTGTTATTGGAGAATGCATTACTCGACATGTATGCTAAGTGTGGATGTTTAGAAGAAGCTAGGGATGTATTTGATAAAATGAAATTTCGAGATGTTGCGTCATGGACATCAATGATGTCTGCATATGGAATAAGTGGTCAAGGGTATGATGCGGTGGCACTCTTTGCAAAAATGCAAGAGTCAGGTCTAAATCCGGATTCCATTGCTTTTGTTTCTGTACTCTCTGCATGTAGCCATGCAGGATTGTTGGACCAAGGGCGCCATTACTTCCATGTGATGACTGAACAATGTAGGATAGTTCCTAGGATAGAACACTTTGCCTGCATGGTAGATCTATTAGGACGAGCTGGTGAAGTGGAAGAAGCATATAGTTTCATCAAGCAGATGCCTATGGAGCCAAATGAAAGAGTTTGGGGTGCACTTTTGAGTGCTTGTCGAGTGCACTCGAAGATGGATATTGGACTCCTAGCTGCTGATTGTCTTTTTCAGTTGGCCCCTAAGCAATCAGGCTACTATGTCTTGTTATCAAACATTTATGCAAAGGCTGGTATGTGGAGAGATGTAGGGAATGTTAGATATGCAATGAAGAAGAAAGGAATTAAGAAAGTGCCAGGCATTAGTAATGTTGAGATTAAGGGTCAAGTTCATACATTTCTTGCCGGTGATCAATATCATCCACAGGCAAAGAATATATATGAAGAGTTGGATGTATTAGTTGGGAAGATGAAGGAGTTAGGTTACATTCCTCAAACTGAGTCGGCTCTCCATGACGTGGAGGTTGAGGAGAAAGAATGTCATCTAGCTATTCATAGTGAAAAGTTGGCAATTGTTTTTGCTATTTTAAACACGAAGCCGGGTACACCAATTCGAATTACCAAGAACCTTCGTGTTTGTGGGGATTGCCATATTGCAATTAAATTTATCTCCAAAATAGTGTCACGTGACATTATTGTTAGAGATTGCAATCGTTTCCACCATTTTAGTAATGGAATTTGCTCATGTGGAGATTACTGGTGA

Protein sequence

MKATIFRSPQILTSKKFRGLVFATHKHCDSSKVEFFSKEVCEIILDRYPDRKTLNKLHSKIVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVVFFNVMIRSYVNNNLYVEALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAIVKIGLDSNLFIGNALVAMYGKCGCVREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEALEICKEMDSLKLNHDAGTMASLLPAVSNTSPENVQYIHKMFEKMAMKSLVSWNVMIAIYVNNSLPNEAVNIFFQMEECGMKPDAVTIASLLPACGDLSALFLGKRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEEARDVFDKMKFRDVASWTSMMSAYGISGQGYDAVALFAKMQESGLNPDSIAFVSVLSACSHAGLLDQGRHYFHVMTEQCRIVPRIEHFACMVDLLGRAGEVEEAYSFIKQMPMEPNERVWGALLSACRVHSKMDIGLLAADCLFQLAPKQSGYYVLLSNIYAKAGMWRDVGNVRYAMKKKGIKKVPGISNVEIKGQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVEVEEKECHLAIHSEKLAIVFAILNTKPGTPIRITKNLRVCGDCHIAIKFISKIVSRDIIVRDCNRFHHFSNGICSCGDYW
Homology
BLAST of Tan0006148 vs. ExPASy Swiss-Prot
Match: P0C899 (Putative pentatricopeptide repeat-containing protein At3g49142 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H77 PE=3 SV=1)

HSP 1 Score: 872.1 bits (2252), Expect = 4.2e-252
Identity = 418/640 (65.31%), Postives = 510/640 (79.69%), Query Frame = 0

Query: 44  ILDRYPDRKTLNKLHSKIVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVV 103
           +LD YPD +TL  +HS+I++ E L  + +L +KLMRAY++  +   AR +FD   E+NV+
Sbjct: 48  VLDTYPDIRTLRTVHSRIIL-EDLRCNSSLGVKLMRAYASLKDVASARKVFDEIPERNVI 107

Query: 104 FFNVMIRSYVNNNLYVEALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAI 163
             NVMIRSYVNN  Y E + V+  M  C   PDHYTFPCVLKACS    + +G ++HG+ 
Sbjct: 108 IINVMIRSYVNNGFYGEGVKVFGTMCGCNVRPDHYTFPCVLKACSCSGTIVIGRKIHGSA 167

Query: 164 VKIGLDSNLFIGNALVAMYGKCGCVREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEAL 223
            K+GL S LF+GN LV+MYGKCG + EAR VLD+M  RDVVSWNS+V GYAQ+ +FD+AL
Sbjct: 168 TKVGLSSTLFVGNGLVSMYGKCGFLSEARLVLDEMSRRDVVSWNSLVVGYAQNQRFDDAL 227

Query: 224 EICKEMDSLKLNHDAGTMASLLPAVSNTSPENVQYIHKMFEKMAMKSLVSWNVMIAIYVN 283
           E+C+EM+S+K++HDAGTMASLLPAVSNT+ ENV Y+  MF KM  KSLVSWNVMI +Y+ 
Sbjct: 228 EVCREMESVKISHDAGTMASLLPAVSNTTTENVMYVKDMFFKMGKKSLVSWNVMIGVYMK 287

Query: 284 NSLPNEAVNIFFQMEECGMKPDAVTIASLLPACGDLSALFLGKRLHEYIEKDNLRPNLLL 343
           N++P EAV ++ +ME  G +PDAV+I S+LPACGD SAL LGK++H YIE+  L PNLLL
Sbjct: 288 NAMPVEAVELYSRMEADGFEPDAVSITSVLPACGDTSALSLGKKIHGYIERKKLIPNLLL 347

Query: 344 ENALLDMYAKCGCLEEARDVFDKMKFRDVASWTSMMSAYGISGQGYDAVALFAKMQESGL 403
           ENAL+DMYAKCGCLE+ARDVF+ MK RDV SWT+M+SAYG SG+G DAVALF+K+Q+SGL
Sbjct: 348 ENALIDMYAKCGCLEKARDVFENMKSRDVVSWTAMISAYGFSGRGCDAVALFSKLQDSGL 407

Query: 404 NPDSIAFVSVLSACSHAGLLDQGRHYFHVMTEQCRIVPRIEHFACMVDLLGRAGEVEEAY 463
            PDSIAFV+ L+ACSHAGLL++GR  F +MT+  +I PR+EH ACMVDLLGRAG+V+EAY
Sbjct: 408 VPDSIAFVTTLAACSHAGLLEEGRSCFKLMTDHYKITPRLEHLACMVDLLGRAGKVKEAY 467

Query: 464 SFIKQMPMEPNERVWGALLSACRVHSKMDIGLLAADCLFQLAPKQSGYYVLLSNIYAKAG 523
            FI+ M MEPNERVWGALL ACRVHS  DIGLLAAD LFQLAP+QSGYYVLLSNIYAKAG
Sbjct: 468 RFIQDMSMEPNERVWGALLGACRVHSDTDIGLLAADKLFQLAPEQSGYYVLLSNIYAKAG 527

Query: 524 MWRDVGNVRYAMKKKGIKKVPGISNVEIKGQVHTFLAGDQYHPQAKNIYEELDVLVGKMK 583
            W +V N+R  MK KG+KK PG SNVE+   +HTFL GD+ HPQ+  IY ELDVLV KMK
Sbjct: 528 RWEEVTNIRNIMKSKGLKKNPGASNVEVNRIIHTFLVGDRSHPQSDEIYRELDVLVKKMK 587

Query: 584 ELGYIPQTESALHDVEVEEKECHLAIHSEKLAIVFAILNTK-----PGTPIRITKNLRVC 643
           ELGY+P +ESALHDVE E+KE HLA+HSEKLAIVFA++NTK         IRITKNLR+C
Sbjct: 588 ELGYVPDSESALHDVEEEDKETHLAVHSEKLAIVFALMNTKEEEEDSNNTIRITKNLRIC 647

Query: 644 GDCHIAIKFISKIVSRDIIVRDCNRFHHFSNGICSCGDYW 679
           GDCH+A K IS+I SR+II+RD NRFH F  G+CSCGDYW
Sbjct: 648 GDCHVAAKLISQITSREIIIRDTNRFHVFRFGVCSCGDYW 686

BLAST of Tan0006148 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 514.6 bits (1324), Expect = 1.7e-144
Identity = 253/670 (37.76%), Postives = 413/670 (61.64%), Query Frame = 0

Query: 44  ILDRYPDRKTLNKLHSKIVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVV 103
           ++D    +  L ++H++++V   L     L  KL+ A S+ G+   AR +FD      + 
Sbjct: 27  LIDSATHKAQLKQIHARLLV-LGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIF 86

Query: 104 FFNVMIRSYVNNNLYVEALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAI 163
            +N +IR Y  NN + +AL +Y  M     +PD +TFP +LKACSGL +L++G  VH  +
Sbjct: 87  PWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQV 146

Query: 164 VKIGLDSNLFIGNALVAMYGKCGCVREARKVLD--QMPNRDVVSWNSMVAGYAQSGQFDE 223
            ++G D+++F+ N L+A+Y KC  +  AR V +   +P R +VSW ++V+ YAQ+G+  E
Sbjct: 147 FRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPME 206

Query: 224 ALEICKEMDSLKLNHDAGTMASLLPAVS--------------------NTSPENVQYIHK 283
           ALEI  +M  + +  D   + S+L A +                       P+ +  ++ 
Sbjct: 207 ALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNT 266

Query: 284 M-------------FEKMAMKSLVSWNVMIAIYVNNSLPNEAVNIFFQMEECGMKPDAVT 343
           M             F+KM   +L+ WN MI+ Y  N    EA+++F +M    ++PD ++
Sbjct: 267 MYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTIS 326

Query: 344 IASLLPACGDLSALFLGKRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEEARDVFDKMK 403
           I S + AC  + +L   + ++EY+ + + R ++ + +AL+DM+AKCG +E AR VFD+  
Sbjct: 327 ITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTL 386

Query: 404 FRDVASWTSMMSAYGISGQGYDAVALFAKMQESGLNPDSIAFVSVLSACSHAGLLDQGRH 463
            RDV  W++M+  YG+ G+  +A++L+  M+  G++P+ + F+ +L AC+H+G++ +G  
Sbjct: 387 DRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWW 446

Query: 464 YFHVMTEQCRIVPRIEHFACMVDLLGRAGEVEEAYSFIKQMPMEPNERVWGALLSACRVH 523
           +F+ M +  +I P+ +H+AC++DLLGRAG +++AY  IK MP++P   VWGALLSAC+ H
Sbjct: 447 FFNRMADH-KINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKH 506

Query: 524 SKMDIGLLAADCLFQLAPKQSGYYVLLSNIYAKAGMWRDVGNVRYAMKKKGIKKVPGISN 583
             +++G  AA  LF + P  +G+YV LSN+YA A +W  V  VR  MK+KG+ K  G S 
Sbjct: 507 RHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSW 566

Query: 584 VEIKGQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVEVEEKECHLA 643
           VE++G++  F  GD+ HP+ + I  +++ +  ++KE G++   +++LHD+  EE E  L 
Sbjct: 567 VEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEEAEETLC 626

Query: 644 IHSEKLAIVFAILNTKPGTPIRITKNLRVCGDCHIAIKFISKIVSRDIIVRDCNRFHHFS 679
            HSE++AI + +++T  GTP+RITKNLR C +CH A K ISK+V R+I+VRD NRFHHF 
Sbjct: 627 SHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFK 686

BLAST of Tan0006148 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 511.1 bits (1315), Expect = 1.9e-143
Identity = 260/702 (37.04%), Postives = 409/702 (58.26%), Query Frame = 0

Query: 47  RYPDRKTLNKLHSKIVVNEHL-HIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVVFF 106
           R   +    +LH++ +  + L H   ++ I +   Y+       A  +F       V+ +
Sbjct: 17  RIKSKSQAKQLHAQFIRTQSLSHTSASIVISI---YTNLKLLHEALLLFKTLKSPPVLAW 76

Query: 107 NVMIRSYVNNNLYVEALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAIVK 166
             +IR + + +L+ +AL+ +  M +    PDH  FP VLK+C+ + +L+ G  VHG IV+
Sbjct: 77  KSVIRCFTDQSLFSKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVR 136

Query: 167 IGLDSNLFIGNALVAMYGKC-------------------------------GCVR----- 226
           +G+D +L+ GNAL+ MY K                                 C+      
Sbjct: 137 LGMDCDLYTGNALMNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGID 196

Query: 227 EARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEALEICKEMDSLKLNHDAGTMASLLPAVS 286
             R+V + MP +DVVS+N+++AGYAQSG +++AL + +EM +  L  D+ T++S+LP  S
Sbjct: 197 SVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFS 256

Query: 287 N---------------------------------TSPENVQYIHKMFEKMAMKSLVSWNV 346
                                                  ++   ++F ++  +  +SWN 
Sbjct: 257 EYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNS 316

Query: 347 MIAIYVNNSLPNEAVNIFFQMEECGMKPDAVTIASLLPACGDLSALFLGKRLHEYIEKDN 406
           ++A YV N   NEA+ +F QM    +KP AV  +S++PAC  L+ L LGK+LH Y+ +  
Sbjct: 317 LVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLRGG 376

Query: 407 LRPNLLLENALLDMYAKCGCLEEARDVFDKMKFRDVASWTSMMSAYGISGQGYDAVALFA 466
              N+ + +AL+DMY+KCG ++ AR +FD+M   D  SWT+++  + + G G++AV+LF 
Sbjct: 377 FGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVSLFE 436

Query: 467 KMQESGLNPDSIAFVSVLSACSHAGLLDQGRHYFHVMTEQCRIVPRIEHFACMVDLLGRA 526
           +M+  G+ P+ +AFV+VL+ACSH GL+D+   YF+ MT+   +   +EH+A + DLLGRA
Sbjct: 437 EMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLGRA 496

Query: 527 GEVEEAYSFIKQMPMEPNERVWGALLSACRVHSKMDIGLLAADCLFQLAPKQSGYYVLLS 586
           G++EEAY+FI +M +EP   VW  LLS+C VH  +++    A+ +F +  +  G YVL+ 
Sbjct: 497 GKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVLMC 556

Query: 587 NIYAKAGMWRDVGNVRYAMKKKGIKKVPGISNVEIKGQVHTFLAGDQYHPQAKNIYEELD 646
           N+YA  G W+++  +R  M+KKG++K P  S +E+K + H F++GD+ HP    I E L 
Sbjct: 557 NMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINEFLK 616

Query: 647 VLVGKMKELGYIPQTESALHDVEVEEKECHLAIHSEKLAIVFAILNTKPGTPIRITKNLR 679
            ++ +M++ GY+  T   LHDV+ E K   L  HSE+LA+ F I+NT+PGT IR+TKN+R
Sbjct: 617 AVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPGTTIRVTKNIR 676

BLAST of Tan0006148 vs. ExPASy Swiss-Prot
Match: Q9SUH6 (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX=3702 GN=DYW9 PE=2 SV=1)

HSP 1 Score: 495.0 bits (1273), Expect = 1.4e-138
Identity = 251/663 (37.86%), Postives = 380/663 (57.32%), Query Frame = 0

Query: 48  YPDRKTLNKLHSKIVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVVFFNV 107
           + D +    +H + VV +    +  L   +++ Y        AR +FDR  EK+ + +N 
Sbjct: 132 FRDDRAGRVIHGQAVV-DGCDSELLLGSNIVKMYFKFWRVEDARKVFDRMPEKDTILWNT 191

Query: 108 MIRSYVNNNLYVEALSVYQ--VMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAIVK 167
           MI  Y  N +YVE++ V++  +  SC    D  T   +L A + L  L++G Q+H    K
Sbjct: 192 MISGYRKNEMYVESIQVFRDLINESCT-RLDTTTLLDILPAVAELQELRLGMQIHSLATK 251

Query: 168 IGLDSNLFIGNALVAMYGKCGCVREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEALEI 227
            G  S+ ++    +++Y KCG ++    +  +    D+V++N+M+ GY  +G+ + +L +
Sbjct: 252 TGCYSHDYVLTGFISLYSKCGKIKMGSALFREFRKPDIVAYNAMIHGYTSNGETELSLSL 311

Query: 228 CKEMDSLKLNHDAGTMASLLPAVSN------------------------------TSPEN 287
            KE+        + T+ SL+P   +                              +    
Sbjct: 312 FKELMLSGARLRSSTLVSLVPVSGHLMLIYAIHGYCLKSNFLSHASVSTALTTVYSKLNE 371

Query: 288 VQYIHKMFEKMAMKSLVSWNVMIAIYVNNSLPNEAVNIFFQMEECGMKPDAVTIASLLPA 347
           ++   K+F++   KSL SWN MI+ Y  N L  +A+++F +M++    P+ VTI  +L A
Sbjct: 372 IESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFREMQKSEFSPNPVTITCILSA 431

Query: 348 CGDLSALFLGKRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEEARDVFDKMKFRDVASW 407
           C  L AL LGK +H+ +   +   ++ +  AL+ MYAKCG + EAR +FD M  ++  +W
Sbjct: 432 CAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNEVTW 491

Query: 408 TSMMSAYGISGQGYDAVALFAKMQESGLNPDSIAFVSVLSACSHAGLLDQGRHYFHVMTE 467
            +M+S YG+ GQG +A+ +F +M  SG+ P  + F+ VL ACSHAGL+ +G   F+ M  
Sbjct: 492 NTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNSMIH 551

Query: 468 QCRIVPRIEHFACMVDLLGRAGEVEEAYSFIKQMPMEPNERVWGALLSACRVHSKMDIGL 527
           +    P ++H+ACMVD+LGRAG ++ A  FI+ M +EP   VW  LL ACR+H   ++  
Sbjct: 552 RYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKDTNLAR 611

Query: 528 LAADCLFQLAPKQSGYYVLLSNIYAKAGMWRDVGNVRYAMKKKGIKKVPGISNVEIKGQV 587
             ++ LF+L P   GY+VLLSNI++    +     VR   KK+ + K PG + +EI    
Sbjct: 612 TVSEKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIGETP 671

Query: 588 HTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVEVEEKECHLAIHSEKLA 647
           H F +GDQ HPQ K IYE+L+ L GKM+E GY P+TE ALHDVE EE+E  + +HSE+LA
Sbjct: 672 HVFTSGDQSHPQVKEIYEKLEKLEGKMREAGYQPETELALHDVEEEERELMVKVHSERLA 731

Query: 648 IVFAILNTKPGTPIRITKNLRVCGDCHIAIKFISKIVSRDIIVRDCNRFHHFSNGICSCG 679
           I F ++ T+PGT IRI KNLRVC DCH   K ISKI  R I+VRD NRFHHF +G+CSCG
Sbjct: 732 IAFGLIATEPGTEIRIIKNLRVCLDCHTVTKLISKITERVIVVRDANRFHHFKDGVCSCG 791

BLAST of Tan0006148 vs. ExPASy Swiss-Prot
Match: Q7Y211 (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 493.0 bits (1268), Expect = 5.3e-138
Identity = 261/649 (40.22%), Postives = 385/649 (59.32%), Query Frame = 0

Query: 77  LMRAYSACGETRVARFIFDRTFEKNVVFFNVMIRSYVNNNLYVEALSVYQVMSSCAFNPD 136
           L+  Y   G+   ++ +      +++V +N ++ S   N   +EAL   + M      PD
Sbjct: 242 LVAMYGKLGKLASSKVLLGSFGGRDLVTWNTVLSSLCQNEQLLEALEYLREMVLEGVEPD 301

Query: 137 HYTFPCVLKACSGLDNLKVGFQVHGAIVKIG-LDSNLFIGNALVAMYGKCGCVREARKVL 196
            +T   VL ACS L+ L+ G ++H   +K G LD N F+G+ALV MY  C  V   R+V 
Sbjct: 302 EFTISSVLPACSHLEMLRTGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVLSGRRVF 361

Query: 197 DQMPNRDVVSWNSMVAGYAQSGQFDEALEICKEM-DSLKLNHDAGTMASLLPA------- 256
           D M +R +  WN+M+AGY+Q+    EAL +   M +S  L  ++ TMA ++PA       
Sbjct: 362 DGMFDRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAF 421

Query: 257 --------------------VSNTSPE------NVQYIHKMFEKMAMKSLVSWNVMIAIY 316
                               V NT  +       +    ++F KM  + LV+WN MI  Y
Sbjct: 422 SRKEAIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGY 481

Query: 317 VNNSLPNEAVNIFFQME-----------ECGMKPDAVTIASLLPACGDLSALFLGKRLHE 376
           V +    +A+ +  +M+              +KP+++T+ ++LP+C  LSAL  GK +H 
Sbjct: 482 VFSEHHEDALLLLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHA 541

Query: 377 YIEKDNLRPNLLLENALLDMYAKCGCLEEARDVFDKMKFRDVASWTSMMSAYGISGQGYD 436
           Y  K+NL  ++ + +AL+DMYAKCGCL+ +R VFD++  ++V +W  ++ AYG+ G G +
Sbjct: 542 YAIKNNLATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQE 601

Query: 437 AVALFAKMQESGLNPDSIAFVSVLSACSHAGLLDQGRHYFHVMTEQCRIVPRIEHFACMV 496
           A+ L   M   G+ P+ + F+SV +ACSH+G++D+G   F+VM     + P  +H+AC+V
Sbjct: 602 AIDLLRMMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVV 661

Query: 497 DLLGRAGEVEEAYSFIKQMPMEPNER-VWGALLSACRVHSKMDIGLLAADCLFQLAPKQS 556
           DLLGRAG ++EAY  +  MP + N+   W +LL A R+H+ ++IG +AA  L QL P  +
Sbjct: 662 DLLGRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVA 721

Query: 557 GYYVLLSNIYAKAGMWRDVGNVRYAMKKKGIKKVPGISNVEIKGQVHTFLAGDQYHPQAK 616
            +YVLL+NIY+ AG+W     VR  MK++G++K PG S +E   +VH F+AGD  HPQ++
Sbjct: 722 SHYVLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSE 781

Query: 617 NIYEELDVLVGKMKELGYIPQTESALHDVEVEEKECHLAIHSEKLAIVFAILNTKPGTPI 676
            +   L+ L  +M++ GY+P T   LH+VE +EKE  L  HSEKLAI F ILNT PGT I
Sbjct: 782 KLSGYLETLWERMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEKLAIAFGILNTSPGTII 841

Query: 677 RITKNLRVCGDCHIAIKFISKIVSRDIIVRDCNRFHHFSNGICSCGDYW 679
           R+ KNLRVC DCH+A KFISKIV R+II+RD  RFH F NG CSCGDYW
Sbjct: 842 RVAKNLRVCNDCHLATKFISKIVDREIILRDVRRFHRFKNGTCSCGDYW 890

BLAST of Tan0006148 vs. NCBI nr
Match: XP_022157138.1 (putative pentatricopeptide repeat-containing protein At3g49142 [Momordica charantia])

HSP 1 Score: 1265.8 bits (3274), Expect = 0.0e+00
Identity = 613/678 (90.41%), Postives = 645/678 (95.13%), Query Frame = 0

Query: 1   MKATIFRSPQILTSKKFRGLVFATHKHCDSSKVEFFSKEVCEIILDRYPDRKTLNKLHSK 60
           MKATIF S QIL SK FRGLVFATHKHCDSS++E  +KE+CEIILDRYPD KTL+KLHSK
Sbjct: 1   MKATIFLSRQILASKMFRGLVFATHKHCDSSRIELVAKEMCEIILDRYPDIKTLSKLHSK 60

Query: 61  IVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVVFFNVMIRSYVNNNLYVE 120
           IVVNEHL +DPTL+IKLMRAYSACG+TRVAR+IFD TFEKNVVFFNVMIRSYVNNNLYVE
Sbjct: 61  IVVNEHLKVDPTLAIKLMRAYSACGKTRVARYIFDGTFEKNVVFFNVMIRSYVNNNLYVE 120

Query: 121 ALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAIVKIGLDSNLFIGNALVA 180
           ALS+YQVMSSC FNPDHYTFPCVLKACSGLDNL+VG QVH AIVK+GLDSN+FIGNALVA
Sbjct: 121 ALSIYQVMSSCGFNPDHYTFPCVLKACSGLDNLRVGLQVHDAIVKVGLDSNVFIGNALVA 180

Query: 181 MYGKCGCVREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEALEICKEMDSLKLNHDAGT 240
           MYGKCGC+REARKVLDQMPNRDVVSWNSMVAGYAQ GQFD+ALEICKEMD+L LNHDAGT
Sbjct: 181 MYGKCGCLREARKVLDQMPNRDVVSWNSMVAGYAQRGQFDDALEICKEMDTLNLNHDAGT 240

Query: 241 MASLLPAVSNTSPENVQYIHKMFEKMAMKSLVSWNVMIAIYVNNSLPNEAVNIFFQMEEC 300
           MASLLPAVSNTS ENVQY+HKMFEKM  KSLVSWNVMI IYVNNS+PNEAVN+F QMEEC
Sbjct: 241 MASLLPAVSNTSSENVQYVHKMFEKMDRKSLVSWNVMIGIYVNNSMPNEAVNLFSQMEEC 300

Query: 301 GMKPDAVTIASLLPACGDLSALFLGKRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEEA 360
            MKPDAVTIASLLPACGDLSALFLG+RLHEYIEKDNLRPNLLLENALLDMYAKCGCLEEA
Sbjct: 301 RMKPDAVTIASLLPACGDLSALFLGRRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEEA 360

Query: 361 RDVFDKMKFRDVASWTSMMSAYGISGQGYDAVALFAKMQESGLNPDSIAFVSVLSACSHA 420
           RDVF+KMKFRDV SWTSMMSAYG+SGQG+D V LFAKMQESGLNPDSIAFVSVLSACSHA
Sbjct: 361 RDVFNKMKFRDVVSWTSMMSAYGVSGQGHDVVVLFAKMQESGLNPDSIAFVSVLSACSHA 420

Query: 421 GLLDQGRHYFHVMTEQCRIVPRIEHFACMVDLLGRAGEVEEAYSFIKQMPMEPNERVWGA 480
           GLLDQGRHYFHVMTEQ  IVPRIEHFACMVDLLGRAGEVEEAY+FIK MPMEPNERVWGA
Sbjct: 421 GLLDQGRHYFHVMTEQYGIVPRIEHFACMVDLLGRAGEVEEAYAFIKHMPMEPNERVWGA 480

Query: 481 LLSACRVHSKMDIGLLAADCLFQLAPKQSGYYVLLSNIYAKAGMWRDVGNVRYAMKKKGI 540
           LLSACRVH KMDIGLLAA+CLFQLAPKQSGYYVLLSNIYAKAGMW+DV NVRYAMK KGI
Sbjct: 481 LLSACRVHLKMDIGLLAANCLFQLAPKQSGYYVLLSNIYAKAGMWKDVANVRYAMKNKGI 540

Query: 541 KKVPGISNVEIKGQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVEV 600
           KKVPGISNVE+KGQVHTFLAGDQYHPQ+K IYEELDVLVGKMKELGYIP+TESALHDVEV
Sbjct: 541 KKVPGISNVELKGQVHTFLAGDQYHPQSKKIYEELDVLVGKMKELGYIPETESALHDVEV 600

Query: 601 EEKECHLAIHSEKLAIVFAILNTKPGTPIRITKNLRVCGDCHIAIKFISKIVSRDIIVRD 660
           E+KECHLAIHSEKLAIVFAILNTK GTPIRITKNLRVCGDCH AIKF+SKIVSR IIVRD
Sbjct: 601 EDKECHLAIHSEKLAIVFAILNTKHGTPIRITKNLRVCGDCHTAIKFMSKIVSRSIIVRD 660

Query: 661 CNRFHHFSNGICSCGDYW 679
           CNRFH+FSNGICSCGDYW
Sbjct: 661 CNRFHYFSNGICSCGDYW 678

BLAST of Tan0006148 vs. NCBI nr
Match: XP_022946769.1 (putative pentatricopeptide repeat-containing protein At3g49142 [Cucurbita moschata] >XP_022946770.1 putative pentatricopeptide repeat-containing protein At3g49142 [Cucurbita moschata])

HSP 1 Score: 1236.9 bits (3199), Expect = 0.0e+00
Identity = 609/679 (89.69%), Postives = 641/679 (94.40%), Query Frame = 0

Query: 1   MKATIFRSPQILTSKKFRGLVFATHKHCDSSKVEFFSKEVCEIILDRYPDRKTLNKLHSK 60
           MKA IF S QILTSK+F G   A HK  +SSKVE FSKEVCEIILD+YPD KTLNKLHSK
Sbjct: 1   MKALIFCSRQILTSKRFGG--SAAHKRFNSSKVELFSKEVCEIILDKYPDLKTLNKLHSK 60

Query: 61  IVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVVFFNVMIRSYVNNNLYVE 120
           I+VNEHL IDPTL+IKLMRAYSA GETRVAR+IFDRT EKNVVFFNVMIRSYVNNNLYVE
Sbjct: 61  IIVNEHLRIDPTLAIKLMRAYSANGETRVARYIFDRTLEKNVVFFNVMIRSYVNNNLYVE 120

Query: 121 ALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAIVKIGLDSNLFIGNALVA 180
           ALS+YQVMSSCAF PDHYTFPCVLKACSGLDNL+VG QVH AIVK+GLDSNLFIGNALVA
Sbjct: 121 ALSIYQVMSSCAFYPDHYTFPCVLKACSGLDNLRVGLQVHDAIVKVGLDSNLFIGNALVA 180

Query: 181 MYGKCGCVREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEALEICKEMDSLKLNHDAGT 240
           MYGKCG +REARKVLDQMPNRDVVSWNSMVAGYAQSGQFD+ALEICKEMDSLKLNHDAGT
Sbjct: 181 MYGKCGRLREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDDALEICKEMDSLKLNHDAGT 240

Query: 241 MASLLPAV-SNTSPENVQYIHKMFEKMAMKSLVSWNVMIAIYVNNSLPNEAVNIFFQMEE 300
           MASLLP V + +SPENV+Y+H MFEKMA KSLVSWNVMIAIYVNNS+PNEAV++F QMEE
Sbjct: 241 MASLLPVVKAESSPENVRYVHNMFEKMARKSLVSWNVMIAIYVNNSMPNEAVSLFLQMEE 300

Query: 301 CGMKPDAVTIASLLPACGDLSALFLGKRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEE 360
           CGMKPDAVTIASLLPACGDLSALFLG+RLH+YIEKDNL PNLLLENALLDMYAKCGCLEE
Sbjct: 301 CGMKPDAVTIASLLPACGDLSALFLGRRLHKYIEKDNLLPNLLLENALLDMYAKCGCLEE 360

Query: 361 ARDVFDKMKFRDVASWTSMMSAYGISGQGYDAVALFAKMQESGLNPDSIAFVSVLSACSH 420
           ARDVFDKMK RDV SWTSMMSAYG SGQGYDAVALFAKM +SGLNPDSI+FVSVLSACSH
Sbjct: 361 ARDVFDKMKLRDVVSWTSMMSAYGKSGQGYDAVALFAKMLDSGLNPDSISFVSVLSACSH 420

Query: 421 AGLLDQGRHYFHVMTEQCRIVPRIEHFACMVDLLGRAGEVEEAYSFIKQMPMEPNERVWG 480
           AGLL+QGR YFH+MTEQ RIVPRIEHFACMVDL GRAGEVEEAYSFIKQMPMEPNERVWG
Sbjct: 421 AGLLEQGRRYFHIMTEQYRIVPRIEHFACMVDLFGRAGEVEEAYSFIKQMPMEPNERVWG 480

Query: 481 ALLSACRVHSKMDIGLLAADCLFQLAPKQSGYYVLLSNIYAKAGMWRDVGNVRYAMKKKG 540
           ALLSACRVHSKMDIGL+AADCLFQLAPKQSGYYVLLSNIYAKAGMW+DV NVR AMKK G
Sbjct: 481 ALLSACRVHSKMDIGLIAADCLFQLAPKQSGYYVLLSNIYAKAGMWKDVMNVRNAMKKIG 540

Query: 541 IKKVPGISNVEIKGQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVE 600
           IKKVPGISNVE+KGQVHTFLAGDQYHPQAK+IYEELD+LVGKMKELGYIPQTESALHDVE
Sbjct: 541 IKKVPGISNVELKGQVHTFLAGDQYHPQAKSIYEELDLLVGKMKELGYIPQTESALHDVE 600

Query: 601 VEEKECHLAIHSEKLAIVFAILNTKPGTPIRITKNLRVCGDCHIAIKFISKIVSRDIIVR 660
           VE+KECHLAIHSEKLAIVFAILNTK GTPIRITKNLRVCGDCH+AIK ISKIVSRDII+R
Sbjct: 601 VEDKECHLAIHSEKLAIVFAILNTKHGTPIRITKNLRVCGDCHVAIKLISKIVSRDIIIR 660

Query: 661 DCNRFHHFSNGICSCGDYW 679
           DCNRFHHFSNGICSCGDYW
Sbjct: 661 DCNRFHHFSNGICSCGDYW 677

BLAST of Tan0006148 vs. NCBI nr
Match: XP_022999640.1 (putative pentatricopeptide repeat-containing protein At3g49142 isoform X1 [Cucurbita maxima] >XP_022999641.1 putative pentatricopeptide repeat-containing protein At3g49142 isoform X1 [Cucurbita maxima])

HSP 1 Score: 1233.4 bits (3190), Expect = 0.0e+00
Identity = 607/679 (89.40%), Postives = 639/679 (94.11%), Query Frame = 0

Query: 1   MKATIFRSPQILTSKKFRGLVFATHKHCDSSKVEFFSKEVCEIILDRYPDRKTLNKLHSK 60
           MKA IF S QILT  +F G   A HK  +SSKVE FSKEVCEIILD+YPD KTLNKLHSK
Sbjct: 1   MKAIIFCSRQILTFTRFGG--SAAHKRFNSSKVELFSKEVCEIILDQYPDLKTLNKLHSK 60

Query: 61  IVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVVFFNVMIRSYVNNNLYVE 120
           I+VNEHL IDPTL+IKLMRAYSA G TRVAR+IFDRT EKNVVFFNVMIRSYVNNNLYVE
Sbjct: 61  IIVNEHLRIDPTLAIKLMRAYSANGVTRVARYIFDRTLEKNVVFFNVMIRSYVNNNLYVE 120

Query: 121 ALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAIVKIGLDSNLFIGNALVA 180
           ALS+YQVMSSCAFNPDHYTFPCVLKACSGLDNL+VG QVH AIVK+GLDSNLFIGNALVA
Sbjct: 121 ALSIYQVMSSCAFNPDHYTFPCVLKACSGLDNLRVGLQVHDAIVKVGLDSNLFIGNALVA 180

Query: 181 MYGKCGCVREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEALEICKEMDSLKLNHDAGT 240
           MYGKCGC+REARKVLDQMPNRDVVSWNSMVAGYAQSGQFD+ALEICKEMDSLKLNHDAGT
Sbjct: 181 MYGKCGCLREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDDALEICKEMDSLKLNHDAGT 240

Query: 241 MASLLPAV-SNTSPENVQYIHKMFEKMAMKSLVSWNVMIAIYVNNSLPNEAVNIFFQMEE 300
           MASLLP V + +SPENV+Y+H MFEKMA KSLVSWNVMIAIYVNNS+PNEAV++F QMEE
Sbjct: 241 MASLLPVVKAESSPENVRYVHNMFEKMARKSLVSWNVMIAIYVNNSMPNEAVSLFLQMEE 300

Query: 301 CGMKPDAVTIASLLPACGDLSALFLGKRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEE 360
            GMKPDAVTIASLLPACGDLSALFLG+RLH+YIEKDNL PNLLLENALLDMYAKCGCLEE
Sbjct: 301 RGMKPDAVTIASLLPACGDLSALFLGRRLHKYIEKDNLLPNLLLENALLDMYAKCGCLEE 360

Query: 361 ARDVFDKMKFRDVASWTSMMSAYGISGQGYDAVALFAKMQESGLNPDSIAFVSVLSACSH 420
           ARDVFDKMK RDV SWTSMMSAYG SGQGYDAVALFAKM +SGLNPDSI+FVSVLSACSH
Sbjct: 361 ARDVFDKMKLRDVVSWTSMMSAYGKSGQGYDAVALFAKMLDSGLNPDSISFVSVLSACSH 420

Query: 421 AGLLDQGRHYFHVMTEQCRIVPRIEHFACMVDLLGRAGEVEEAYSFIKQMPMEPNERVWG 480
           AGLL+QG  YFH+MTEQ RIVPRIEHFACMVDL GRAGEVEEAYSFIKQMPMEPNERVWG
Sbjct: 421 AGLLEQGSRYFHIMTEQYRIVPRIEHFACMVDLFGRAGEVEEAYSFIKQMPMEPNERVWG 480

Query: 481 ALLSACRVHSKMDIGLLAADCLFQLAPKQSGYYVLLSNIYAKAGMWRDVGNVRYAMKKKG 540
           ALLSACRVHSKMDIGL+AADCLFQLAPKQSGYYVLLSNIYAKAGMW+DV NVRYAMKK G
Sbjct: 481 ALLSACRVHSKMDIGLIAADCLFQLAPKQSGYYVLLSNIYAKAGMWKDVMNVRYAMKKIG 540

Query: 541 IKKVPGISNVEIKGQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVE 600
           IKKVPGISNVE+KGQVHTFLAGDQYHPQAK+IYEELD+LVGKMKELGYIPQTESALHDVE
Sbjct: 541 IKKVPGISNVELKGQVHTFLAGDQYHPQAKSIYEELDLLVGKMKELGYIPQTESALHDVE 600

Query: 601 VEEKECHLAIHSEKLAIVFAILNTKPGTPIRITKNLRVCGDCHIAIKFISKIVSRDIIVR 660
           VE+KECHLAIHSEKLAIVFAILNTK GTPIRITKNLRVCGDCH+AIK ISKIVSRDII+R
Sbjct: 601 VEDKECHLAIHSEKLAIVFAILNTKHGTPIRITKNLRVCGDCHVAIKLISKIVSRDIIIR 660

Query: 661 DCNRFHHFSNGICSCGDYW 679
           DCNRFHHFSNGICSCGDYW
Sbjct: 661 DCNRFHHFSNGICSCGDYW 677

BLAST of Tan0006148 vs. NCBI nr
Match: XP_023546028.1 (putative pentatricopeptide repeat-containing protein At3g49142 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1227.2 bits (3174), Expect = 0.0e+00
Identity = 605/679 (89.10%), Postives = 638/679 (93.96%), Query Frame = 0

Query: 1   MKATIFRSPQILTSKKFRGLVFATHKHCDSSKVEFFSKEVCEIILDRYPDRKTLNKLHSK 60
           MKA IF S QI TSK+F G   A H+  +SSKVE FSKEVCEIILD+YPD KTLNKLHSK
Sbjct: 1   MKAIIFCSRQISTSKRFGG--SAAHERFNSSKVELFSKEVCEIILDKYPDLKTLNKLHSK 60

Query: 61  IVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVVFFNVMIRSYVNNNLYVE 120
           I+VNEHL IDPTL+IKLMRAYSA G TRVAR+IFDRT EKNVVFFNVMIRSYVNNNLYVE
Sbjct: 61  IIVNEHLRIDPTLAIKLMRAYSANGVTRVARYIFDRTLEKNVVFFNVMIRSYVNNNLYVE 120

Query: 121 ALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAIVKIGLDSNLFIGNALVA 180
           ALS+YQVMSSCAFNPDHYTFPCVLKACSGLDNL+VG QVH AIVK+GLDSNLFIGNALVA
Sbjct: 121 ALSIYQVMSSCAFNPDHYTFPCVLKACSGLDNLRVGLQVHDAIVKVGLDSNLFIGNALVA 180

Query: 181 MYGKCGCVREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEALEICKEMDSLKLNHDAGT 240
           MYGKCG +REARKVLDQMPNRDVVSWNSMVAGYAQSGQFD+ALEICKEMDSLKLNHDAGT
Sbjct: 181 MYGKCGRLREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDDALEICKEMDSLKLNHDAGT 240

Query: 241 MASLLPAV-SNTSPENVQYIHKMFEKMAMKSLVSWNVMIAIYVNNSLPNEAVNIFFQMEE 300
           MASLLP V + +SPENV+Y+H MFEKMA KSLVSWNVMIAIYVNNS+PNEAV++F QMEE
Sbjct: 241 MASLLPVVKAESSPENVRYVHNMFEKMARKSLVSWNVMIAIYVNNSMPNEAVSLFSQMEE 300

Query: 301 CGMKPDAVTIASLLPACGDLSALFLGKRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEE 360
           CGMKPDAVTIASLLPACGDLSALFLG+RLH+YIEKDNL PNLLLENALLDMYAKCGCLEE
Sbjct: 301 CGMKPDAVTIASLLPACGDLSALFLGRRLHKYIEKDNLLPNLLLENALLDMYAKCGCLEE 360

Query: 361 ARDVFDKMKFRDVASWTSMMSAYGISGQGYDAVALFAKMQESGLNPDSIAFVSVLSACSH 420
           ARDVFDKMK RDV SWTSMMS YG SGQGYDAVALFAKM +SGLNPDSI+FVSVLSACSH
Sbjct: 361 ARDVFDKMKLRDVVSWTSMMSVYGKSGQGYDAVALFAKMLDSGLNPDSISFVSVLSACSH 420

Query: 421 AGLLDQGRHYFHVMTEQCRIVPRIEHFACMVDLLGRAGEVEEAYSFIKQMPMEPNERVWG 480
           AGLL+QGR YFH+MTEQ RIVPRIEHFACMVDL GRAGEVEEAYSFIKQMPMEPNERVWG
Sbjct: 421 AGLLEQGRRYFHIMTEQYRIVPRIEHFACMVDLFGRAGEVEEAYSFIKQMPMEPNERVWG 480

Query: 481 ALLSACRVHSKMDIGLLAADCLFQLAPKQSGYYVLLSNIYAKAGMWRDVGNVRYAMKKKG 540
           ALLSACRVHSKMDIGL+AAD LFQLAPKQSGYYVLLSNIYAKAGMW+DV NVR AMKK G
Sbjct: 481 ALLSACRVHSKMDIGLIAADSLFQLAPKQSGYYVLLSNIYAKAGMWKDVMNVRNAMKKIG 540

Query: 541 IKKVPGISNVEIKGQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVE 600
           IKKVPGISNVE+KGQVHTFLAGDQYHPQAK+IYEELD+LVGKMKELGYIPQTESALHDVE
Sbjct: 541 IKKVPGISNVELKGQVHTFLAGDQYHPQAKSIYEELDLLVGKMKELGYIPQTESALHDVE 600

Query: 601 VEEKECHLAIHSEKLAIVFAILNTKPGTPIRITKNLRVCGDCHIAIKFISKIVSRDIIVR 660
           VE+KECHLAIHSEKLAIVFAILNTK GTPIRITKNLRVCGDCH+AIK ISKIVSRDII+R
Sbjct: 601 VEDKECHLAIHSEKLAIVFAILNTKHGTPIRITKNLRVCGDCHVAIKLISKIVSRDIIIR 660

Query: 661 DCNRFHHFSNGICSCGDYW 679
           DCNRFHHFSNGICSCGDYW
Sbjct: 661 DCNRFHHFSNGICSCGDYW 677

BLAST of Tan0006148 vs. NCBI nr
Match: XP_008456467.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At3g49142 [Cucumis melo])

HSP 1 Score: 1220.3 bits (3156), Expect = 0.0e+00
Identity = 600/678 (88.50%), Postives = 627/678 (92.48%), Query Frame = 0

Query: 1   MKATIFRSPQILTSKKFRGLVFATHKHCDSSKVEFFSKEVCEIILDRYPDRKTLNKLHSK 60
           MKA IF + QILTS KFRG+V +T  H D  KVE FSKEVCEIILD+YP  KTLNKLHSK
Sbjct: 1   MKAIIFCTRQILTSNKFRGIVSSTRIHFDRLKVEVFSKEVCEIILDQYPGIKTLNKLHSK 60

Query: 61  IVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVVFFNVMIRSYVNNNLYVE 120
           IVVNEHL IDPTL+IKLMRAYSACGET VAR+IFDR+ EKNVVFFNVMIRSYVNNNLY E
Sbjct: 61  IVVNEHLRIDPTLAIKLMRAYSACGETSVARYIFDRSLEKNVVFFNVMIRSYVNNNLYFE 120

Query: 121 ALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAIVKIGLDSNLFIGNALVA 180
           ALS++QVM SCAFNPDHYTFPCVLKACSGLDNL+VG QVH AI+K+GLDSNLFIGNALVA
Sbjct: 121 ALSIFQVMLSCAFNPDHYTFPCVLKACSGLDNLRVGLQVHDAILKVGLDSNLFIGNALVA 180

Query: 181 MYGKCGCVREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEALEICKEMDSLKLNHDAGT 240
           MYGKCGC+REARKVLDQMP RDVVSWNSMVAGYAQSGQFD+ALEICKEMDSL LNHDAGT
Sbjct: 181 MYGKCGCLREARKVLDQMPYRDVVSWNSMVAGYAQSGQFDDALEICKEMDSLNLNHDAGT 240

Query: 241 MASLLPAVSNTSPENVQYIHKMFEKMAMKSLVSWNVMIAIYVNNSLPNEAVNIFFQMEEC 300
           MASL P V  TS ENVQYIH MFE+MA K+LVSWNVMIAIYVNNS+PNEAV +F QMEEC
Sbjct: 241 MASLSPVVCYTSLENVQYIHSMFERMAKKNLVSWNVMIAIYVNNSMPNEAVGLFLQMEEC 300

Query: 301 GMKPDAVTIASLLPACGDLSALFLGKRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEEA 360
           GMKPDAVTIASLLPACGDLSALFLG+RLH+YIEK NLRPNLLLENALLDMYAKCGCLEEA
Sbjct: 301 GMKPDAVTIASLLPACGDLSALFLGRRLHKYIEKGNLRPNLLLENALLDMYAKCGCLEEA 360

Query: 361 RDVFDKMKFRDVASWTSMMSAYGISGQGYDAVALFAKMQESGLNPDSIAFVSVLSACSHA 420
           RDVFDKMK RDV SWTSMMSAYG SGQGYDAVALFAKM +SG  PDSIAFVSVLSACSH 
Sbjct: 361 RDVFDKMKLRDVVSWTSMMSAYGRSGQGYDAVALFAKMLDSGQIPDSIAFVSVLSACSHT 420

Query: 421 GLLDQGRHYFHVMTEQCRIVPRIEHFACMVDLLGRAGEVEEAYSFIKQMPMEPNERVWGA 480
           GLLDQGRHYF +MTEQ  IVPRIEHFACMVDL GRAGEVEEAYSF+KQMPMEPNERVWGA
Sbjct: 421 GLLDQGRHYFRMMTEQYGIVPRIEHFACMVDLFGRAGEVEEAYSFVKQMPMEPNERVWGA 480

Query: 481 LLSACRVHSKMDIGLLAADCLFQLAPKQSGYYVLLSNIYAKAGMWRDVGNVRYAMKKKGI 540
           LLSACRVH KMDIGL+AAD LFQLAPKQSGYYVLLSNIYAKAGMW+DV NVRYAMKK GI
Sbjct: 481 LLSACRVHLKMDIGLVAADRLFQLAPKQSGYYVLLSNIYAKAGMWKDVMNVRYAMKKMGI 540

Query: 541 KKVPGISNVEIKGQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVEV 600
           KKVPGISNVE+ GQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVEV
Sbjct: 541 KKVPGISNVELNGQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVEV 600

Query: 601 EEKECHLAIHSEKLAIVFAILNTKPGTPIRITKNLRVCGDCHIAIKFISKIVSRDIIVRD 660
           E+KECHLAIHSEKLAIVFAILNTK GTPIRITKNLRVCGDCHIAIK ISKI SRDIIVRD
Sbjct: 601 EDKECHLAIHSEKLAIVFAILNTKQGTPIRITKNLRVCGDCHIAIKLISKIASRDIIVRD 660

Query: 661 CNRFHHFSNGICSCGDYW 679
           CNRFHHFSNGICSCGDYW
Sbjct: 661 CNRFHHFSNGICSCGDYW 678

BLAST of Tan0006148 vs. ExPASy TrEMBL
Match: A0A6J1DSL6 (putative pentatricopeptide repeat-containing protein At3g49142 OS=Momordica charantia OX=3673 GN=LOC111023932 PE=3 SV=1)

HSP 1 Score: 1265.8 bits (3274), Expect = 0.0e+00
Identity = 613/678 (90.41%), Postives = 645/678 (95.13%), Query Frame = 0

Query: 1   MKATIFRSPQILTSKKFRGLVFATHKHCDSSKVEFFSKEVCEIILDRYPDRKTLNKLHSK 60
           MKATIF S QIL SK FRGLVFATHKHCDSS++E  +KE+CEIILDRYPD KTL+KLHSK
Sbjct: 1   MKATIFLSRQILASKMFRGLVFATHKHCDSSRIELVAKEMCEIILDRYPDIKTLSKLHSK 60

Query: 61  IVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVVFFNVMIRSYVNNNLYVE 120
           IVVNEHL +DPTL+IKLMRAYSACG+TRVAR+IFD TFEKNVVFFNVMIRSYVNNNLYVE
Sbjct: 61  IVVNEHLKVDPTLAIKLMRAYSACGKTRVARYIFDGTFEKNVVFFNVMIRSYVNNNLYVE 120

Query: 121 ALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAIVKIGLDSNLFIGNALVA 180
           ALS+YQVMSSC FNPDHYTFPCVLKACSGLDNL+VG QVH AIVK+GLDSN+FIGNALVA
Sbjct: 121 ALSIYQVMSSCGFNPDHYTFPCVLKACSGLDNLRVGLQVHDAIVKVGLDSNVFIGNALVA 180

Query: 181 MYGKCGCVREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEALEICKEMDSLKLNHDAGT 240
           MYGKCGC+REARKVLDQMPNRDVVSWNSMVAGYAQ GQFD+ALEICKEMD+L LNHDAGT
Sbjct: 181 MYGKCGCLREARKVLDQMPNRDVVSWNSMVAGYAQRGQFDDALEICKEMDTLNLNHDAGT 240

Query: 241 MASLLPAVSNTSPENVQYIHKMFEKMAMKSLVSWNVMIAIYVNNSLPNEAVNIFFQMEEC 300
           MASLLPAVSNTS ENVQY+HKMFEKM  KSLVSWNVMI IYVNNS+PNEAVN+F QMEEC
Sbjct: 241 MASLLPAVSNTSSENVQYVHKMFEKMDRKSLVSWNVMIGIYVNNSMPNEAVNLFSQMEEC 300

Query: 301 GMKPDAVTIASLLPACGDLSALFLGKRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEEA 360
            MKPDAVTIASLLPACGDLSALFLG+RLHEYIEKDNLRPNLLLENALLDMYAKCGCLEEA
Sbjct: 301 RMKPDAVTIASLLPACGDLSALFLGRRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEEA 360

Query: 361 RDVFDKMKFRDVASWTSMMSAYGISGQGYDAVALFAKMQESGLNPDSIAFVSVLSACSHA 420
           RDVF+KMKFRDV SWTSMMSAYG+SGQG+D V LFAKMQESGLNPDSIAFVSVLSACSHA
Sbjct: 361 RDVFNKMKFRDVVSWTSMMSAYGVSGQGHDVVVLFAKMQESGLNPDSIAFVSVLSACSHA 420

Query: 421 GLLDQGRHYFHVMTEQCRIVPRIEHFACMVDLLGRAGEVEEAYSFIKQMPMEPNERVWGA 480
           GLLDQGRHYFHVMTEQ  IVPRIEHFACMVDLLGRAGEVEEAY+FIK MPMEPNERVWGA
Sbjct: 421 GLLDQGRHYFHVMTEQYGIVPRIEHFACMVDLLGRAGEVEEAYAFIKHMPMEPNERVWGA 480

Query: 481 LLSACRVHSKMDIGLLAADCLFQLAPKQSGYYVLLSNIYAKAGMWRDVGNVRYAMKKKGI 540
           LLSACRVH KMDIGLLAA+CLFQLAPKQSGYYVLLSNIYAKAGMW+DV NVRYAMK KGI
Sbjct: 481 LLSACRVHLKMDIGLLAANCLFQLAPKQSGYYVLLSNIYAKAGMWKDVANVRYAMKNKGI 540

Query: 541 KKVPGISNVEIKGQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVEV 600
           KKVPGISNVE+KGQVHTFLAGDQYHPQ+K IYEELDVLVGKMKELGYIP+TESALHDVEV
Sbjct: 541 KKVPGISNVELKGQVHTFLAGDQYHPQSKKIYEELDVLVGKMKELGYIPETESALHDVEV 600

Query: 601 EEKECHLAIHSEKLAIVFAILNTKPGTPIRITKNLRVCGDCHIAIKFISKIVSRDIIVRD 660
           E+KECHLAIHSEKLAIVFAILNTK GTPIRITKNLRVCGDCH AIKF+SKIVSR IIVRD
Sbjct: 601 EDKECHLAIHSEKLAIVFAILNTKHGTPIRITKNLRVCGDCHTAIKFMSKIVSRSIIVRD 660

Query: 661 CNRFHHFSNGICSCGDYW 679
           CNRFH+FSNGICSCGDYW
Sbjct: 661 CNRFHYFSNGICSCGDYW 678

BLAST of Tan0006148 vs. ExPASy TrEMBL
Match: A0A6J1G4Z8 (putative pentatricopeptide repeat-containing protein At3g49142 OS=Cucurbita moschata OX=3662 GN=LOC111450738 PE=3 SV=1)

HSP 1 Score: 1236.9 bits (3199), Expect = 0.0e+00
Identity = 609/679 (89.69%), Postives = 641/679 (94.40%), Query Frame = 0

Query: 1   MKATIFRSPQILTSKKFRGLVFATHKHCDSSKVEFFSKEVCEIILDRYPDRKTLNKLHSK 60
           MKA IF S QILTSK+F G   A HK  +SSKVE FSKEVCEIILD+YPD KTLNKLHSK
Sbjct: 1   MKALIFCSRQILTSKRFGG--SAAHKRFNSSKVELFSKEVCEIILDKYPDLKTLNKLHSK 60

Query: 61  IVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVVFFNVMIRSYVNNNLYVE 120
           I+VNEHL IDPTL+IKLMRAYSA GETRVAR+IFDRT EKNVVFFNVMIRSYVNNNLYVE
Sbjct: 61  IIVNEHLRIDPTLAIKLMRAYSANGETRVARYIFDRTLEKNVVFFNVMIRSYVNNNLYVE 120

Query: 121 ALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAIVKIGLDSNLFIGNALVA 180
           ALS+YQVMSSCAF PDHYTFPCVLKACSGLDNL+VG QVH AIVK+GLDSNLFIGNALVA
Sbjct: 121 ALSIYQVMSSCAFYPDHYTFPCVLKACSGLDNLRVGLQVHDAIVKVGLDSNLFIGNALVA 180

Query: 181 MYGKCGCVREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEALEICKEMDSLKLNHDAGT 240
           MYGKCG +REARKVLDQMPNRDVVSWNSMVAGYAQSGQFD+ALEICKEMDSLKLNHDAGT
Sbjct: 181 MYGKCGRLREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDDALEICKEMDSLKLNHDAGT 240

Query: 241 MASLLPAV-SNTSPENVQYIHKMFEKMAMKSLVSWNVMIAIYVNNSLPNEAVNIFFQMEE 300
           MASLLP V + +SPENV+Y+H MFEKMA KSLVSWNVMIAIYVNNS+PNEAV++F QMEE
Sbjct: 241 MASLLPVVKAESSPENVRYVHNMFEKMARKSLVSWNVMIAIYVNNSMPNEAVSLFLQMEE 300

Query: 301 CGMKPDAVTIASLLPACGDLSALFLGKRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEE 360
           CGMKPDAVTIASLLPACGDLSALFLG+RLH+YIEKDNL PNLLLENALLDMYAKCGCLEE
Sbjct: 301 CGMKPDAVTIASLLPACGDLSALFLGRRLHKYIEKDNLLPNLLLENALLDMYAKCGCLEE 360

Query: 361 ARDVFDKMKFRDVASWTSMMSAYGISGQGYDAVALFAKMQESGLNPDSIAFVSVLSACSH 420
           ARDVFDKMK RDV SWTSMMSAYG SGQGYDAVALFAKM +SGLNPDSI+FVSVLSACSH
Sbjct: 361 ARDVFDKMKLRDVVSWTSMMSAYGKSGQGYDAVALFAKMLDSGLNPDSISFVSVLSACSH 420

Query: 421 AGLLDQGRHYFHVMTEQCRIVPRIEHFACMVDLLGRAGEVEEAYSFIKQMPMEPNERVWG 480
           AGLL+QGR YFH+MTEQ RIVPRIEHFACMVDL GRAGEVEEAYSFIKQMPMEPNERVWG
Sbjct: 421 AGLLEQGRRYFHIMTEQYRIVPRIEHFACMVDLFGRAGEVEEAYSFIKQMPMEPNERVWG 480

Query: 481 ALLSACRVHSKMDIGLLAADCLFQLAPKQSGYYVLLSNIYAKAGMWRDVGNVRYAMKKKG 540
           ALLSACRVHSKMDIGL+AADCLFQLAPKQSGYYVLLSNIYAKAGMW+DV NVR AMKK G
Sbjct: 481 ALLSACRVHSKMDIGLIAADCLFQLAPKQSGYYVLLSNIYAKAGMWKDVMNVRNAMKKIG 540

Query: 541 IKKVPGISNVEIKGQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVE 600
           IKKVPGISNVE+KGQVHTFLAGDQYHPQAK+IYEELD+LVGKMKELGYIPQTESALHDVE
Sbjct: 541 IKKVPGISNVELKGQVHTFLAGDQYHPQAKSIYEELDLLVGKMKELGYIPQTESALHDVE 600

Query: 601 VEEKECHLAIHSEKLAIVFAILNTKPGTPIRITKNLRVCGDCHIAIKFISKIVSRDIIVR 660
           VE+KECHLAIHSEKLAIVFAILNTK GTPIRITKNLRVCGDCH+AIK ISKIVSRDII+R
Sbjct: 601 VEDKECHLAIHSEKLAIVFAILNTKHGTPIRITKNLRVCGDCHVAIKLISKIVSRDIIIR 660

Query: 661 DCNRFHHFSNGICSCGDYW 679
           DCNRFHHFSNGICSCGDYW
Sbjct: 661 DCNRFHHFSNGICSCGDYW 677

BLAST of Tan0006148 vs. ExPASy TrEMBL
Match: A0A6J1KDN5 (putative pentatricopeptide repeat-containing protein At3g49142 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111493940 PE=3 SV=1)

HSP 1 Score: 1233.4 bits (3190), Expect = 0.0e+00
Identity = 607/679 (89.40%), Postives = 639/679 (94.11%), Query Frame = 0

Query: 1   MKATIFRSPQILTSKKFRGLVFATHKHCDSSKVEFFSKEVCEIILDRYPDRKTLNKLHSK 60
           MKA IF S QILT  +F G   A HK  +SSKVE FSKEVCEIILD+YPD KTLNKLHSK
Sbjct: 1   MKAIIFCSRQILTFTRFGG--SAAHKRFNSSKVELFSKEVCEIILDQYPDLKTLNKLHSK 60

Query: 61  IVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVVFFNVMIRSYVNNNLYVE 120
           I+VNEHL IDPTL+IKLMRAYSA G TRVAR+IFDRT EKNVVFFNVMIRSYVNNNLYVE
Sbjct: 61  IIVNEHLRIDPTLAIKLMRAYSANGVTRVARYIFDRTLEKNVVFFNVMIRSYVNNNLYVE 120

Query: 121 ALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAIVKIGLDSNLFIGNALVA 180
           ALS+YQVMSSCAFNPDHYTFPCVLKACSGLDNL+VG QVH AIVK+GLDSNLFIGNALVA
Sbjct: 121 ALSIYQVMSSCAFNPDHYTFPCVLKACSGLDNLRVGLQVHDAIVKVGLDSNLFIGNALVA 180

Query: 181 MYGKCGCVREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEALEICKEMDSLKLNHDAGT 240
           MYGKCGC+REARKVLDQMPNRDVVSWNSMVAGYAQSGQFD+ALEICKEMDSLKLNHDAGT
Sbjct: 181 MYGKCGCLREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDDALEICKEMDSLKLNHDAGT 240

Query: 241 MASLLPAV-SNTSPENVQYIHKMFEKMAMKSLVSWNVMIAIYVNNSLPNEAVNIFFQMEE 300
           MASLLP V + +SPENV+Y+H MFEKMA KSLVSWNVMIAIYVNNS+PNEAV++F QMEE
Sbjct: 241 MASLLPVVKAESSPENVRYVHNMFEKMARKSLVSWNVMIAIYVNNSMPNEAVSLFLQMEE 300

Query: 301 CGMKPDAVTIASLLPACGDLSALFLGKRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEE 360
            GMKPDAVTIASLLPACGDLSALFLG+RLH+YIEKDNL PNLLLENALLDMYAKCGCLEE
Sbjct: 301 RGMKPDAVTIASLLPACGDLSALFLGRRLHKYIEKDNLLPNLLLENALLDMYAKCGCLEE 360

Query: 361 ARDVFDKMKFRDVASWTSMMSAYGISGQGYDAVALFAKMQESGLNPDSIAFVSVLSACSH 420
           ARDVFDKMK RDV SWTSMMSAYG SGQGYDAVALFAKM +SGLNPDSI+FVSVLSACSH
Sbjct: 361 ARDVFDKMKLRDVVSWTSMMSAYGKSGQGYDAVALFAKMLDSGLNPDSISFVSVLSACSH 420

Query: 421 AGLLDQGRHYFHVMTEQCRIVPRIEHFACMVDLLGRAGEVEEAYSFIKQMPMEPNERVWG 480
           AGLL+QG  YFH+MTEQ RIVPRIEHFACMVDL GRAGEVEEAYSFIKQMPMEPNERVWG
Sbjct: 421 AGLLEQGSRYFHIMTEQYRIVPRIEHFACMVDLFGRAGEVEEAYSFIKQMPMEPNERVWG 480

Query: 481 ALLSACRVHSKMDIGLLAADCLFQLAPKQSGYYVLLSNIYAKAGMWRDVGNVRYAMKKKG 540
           ALLSACRVHSKMDIGL+AADCLFQLAPKQSGYYVLLSNIYAKAGMW+DV NVRYAMKK G
Sbjct: 481 ALLSACRVHSKMDIGLIAADCLFQLAPKQSGYYVLLSNIYAKAGMWKDVMNVRYAMKKIG 540

Query: 541 IKKVPGISNVEIKGQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVE 600
           IKKVPGISNVE+KGQVHTFLAGDQYHPQAK+IYEELD+LVGKMKELGYIPQTESALHDVE
Sbjct: 541 IKKVPGISNVELKGQVHTFLAGDQYHPQAKSIYEELDLLVGKMKELGYIPQTESALHDVE 600

Query: 601 VEEKECHLAIHSEKLAIVFAILNTKPGTPIRITKNLRVCGDCHIAIKFISKIVSRDIIVR 660
           VE+KECHLAIHSEKLAIVFAILNTK GTPIRITKNLRVCGDCH+AIK ISKIVSRDII+R
Sbjct: 601 VEDKECHLAIHSEKLAIVFAILNTKHGTPIRITKNLRVCGDCHVAIKLISKIVSRDIIIR 660

Query: 661 DCNRFHHFSNGICSCGDYW 679
           DCNRFHHFSNGICSCGDYW
Sbjct: 661 DCNRFHHFSNGICSCGDYW 677

BLAST of Tan0006148 vs. ExPASy TrEMBL
Match: A0A1S3C3E6 (putative pentatricopeptide repeat-containing protein At3g49142 OS=Cucumis melo OX=3656 GN=LOC103496411 PE=3 SV=1)

HSP 1 Score: 1220.3 bits (3156), Expect = 0.0e+00
Identity = 600/678 (88.50%), Postives = 627/678 (92.48%), Query Frame = 0

Query: 1   MKATIFRSPQILTSKKFRGLVFATHKHCDSSKVEFFSKEVCEIILDRYPDRKTLNKLHSK 60
           MKA IF + QILTS KFRG+V +T  H D  KVE FSKEVCEIILD+YP  KTLNKLHSK
Sbjct: 1   MKAIIFCTRQILTSNKFRGIVSSTRIHFDRLKVEVFSKEVCEIILDQYPGIKTLNKLHSK 60

Query: 61  IVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVVFFNVMIRSYVNNNLYVE 120
           IVVNEHL IDPTL+IKLMRAYSACGET VAR+IFDR+ EKNVVFFNVMIRSYVNNNLY E
Sbjct: 61  IVVNEHLRIDPTLAIKLMRAYSACGETSVARYIFDRSLEKNVVFFNVMIRSYVNNNLYFE 120

Query: 121 ALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAIVKIGLDSNLFIGNALVA 180
           ALS++QVM SCAFNPDHYTFPCVLKACSGLDNL+VG QVH AI+K+GLDSNLFIGNALVA
Sbjct: 121 ALSIFQVMLSCAFNPDHYTFPCVLKACSGLDNLRVGLQVHDAILKVGLDSNLFIGNALVA 180

Query: 181 MYGKCGCVREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEALEICKEMDSLKLNHDAGT 240
           MYGKCGC+REARKVLDQMP RDVVSWNSMVAGYAQSGQFD+ALEICKEMDSL LNHDAGT
Sbjct: 181 MYGKCGCLREARKVLDQMPYRDVVSWNSMVAGYAQSGQFDDALEICKEMDSLNLNHDAGT 240

Query: 241 MASLLPAVSNTSPENVQYIHKMFEKMAMKSLVSWNVMIAIYVNNSLPNEAVNIFFQMEEC 300
           MASL P V  TS ENVQYIH MFE+MA K+LVSWNVMIAIYVNNS+PNEAV +F QMEEC
Sbjct: 241 MASLSPVVCYTSLENVQYIHSMFERMAKKNLVSWNVMIAIYVNNSMPNEAVGLFLQMEEC 300

Query: 301 GMKPDAVTIASLLPACGDLSALFLGKRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEEA 360
           GMKPDAVTIASLLPACGDLSALFLG+RLH+YIEK NLRPNLLLENALLDMYAKCGCLEEA
Sbjct: 301 GMKPDAVTIASLLPACGDLSALFLGRRLHKYIEKGNLRPNLLLENALLDMYAKCGCLEEA 360

Query: 361 RDVFDKMKFRDVASWTSMMSAYGISGQGYDAVALFAKMQESGLNPDSIAFVSVLSACSHA 420
           RDVFDKMK RDV SWTSMMSAYG SGQGYDAVALFAKM +SG  PDSIAFVSVLSACSH 
Sbjct: 361 RDVFDKMKLRDVVSWTSMMSAYGRSGQGYDAVALFAKMLDSGQIPDSIAFVSVLSACSHT 420

Query: 421 GLLDQGRHYFHVMTEQCRIVPRIEHFACMVDLLGRAGEVEEAYSFIKQMPMEPNERVWGA 480
           GLLDQGRHYF +MTEQ  IVPRIEHFACMVDL GRAGEVEEAYSF+KQMPMEPNERVWGA
Sbjct: 421 GLLDQGRHYFRMMTEQYGIVPRIEHFACMVDLFGRAGEVEEAYSFVKQMPMEPNERVWGA 480

Query: 481 LLSACRVHSKMDIGLLAADCLFQLAPKQSGYYVLLSNIYAKAGMWRDVGNVRYAMKKKGI 540
           LLSACRVH KMDIGL+AAD LFQLAPKQSGYYVLLSNIYAKAGMW+DV NVRYAMKK GI
Sbjct: 481 LLSACRVHLKMDIGLVAADRLFQLAPKQSGYYVLLSNIYAKAGMWKDVMNVRYAMKKMGI 540

Query: 541 KKVPGISNVEIKGQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVEV 600
           KKVPGISNVE+ GQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVEV
Sbjct: 541 KKVPGISNVELNGQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVEV 600

Query: 601 EEKECHLAIHSEKLAIVFAILNTKPGTPIRITKNLRVCGDCHIAIKFISKIVSRDIIVRD 660
           E+KECHLAIHSEKLAIVFAILNTK GTPIRITKNLRVCGDCHIAIK ISKI SRDIIVRD
Sbjct: 601 EDKECHLAIHSEKLAIVFAILNTKQGTPIRITKNLRVCGDCHIAIKLISKIASRDIIVRD 660

Query: 661 CNRFHHFSNGICSCGDYW 679
           CNRFHHFSNGICSCGDYW
Sbjct: 661 CNRFHHFSNGICSCGDYW 678

BLAST of Tan0006148 vs. ExPASy TrEMBL
Match: A0A0A0KE24 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G404230 PE=3 SV=1)

HSP 1 Score: 1210.7 bits (3131), Expect = 0.0e+00
Identity = 594/678 (87.61%), Postives = 626/678 (92.33%), Query Frame = 0

Query: 1   MKATIFRSPQILTSKKFRGLVFATHKHCDSSKVEFFSKEVCEIILDRYPDRKTLNKLHSK 60
           MKA IF + QIL S KFRG+V +T    D  KVE FSKE CE+ILD+YP  KTLNKLHSK
Sbjct: 1   MKAIIFCTRQILNSNKFRGIVSSTRIRFDRLKVEVFSKEACEVILDQYPGIKTLNKLHSK 60

Query: 61  IVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVVFFNVMIRSYVNNNLYVE 120
           IV+NEHL IDPTL+IKLMRAYSA GET VAR+IFDR+ EKNVVFFNVMIRSYVNNNLYVE
Sbjct: 61  IVINEHLRIDPTLAIKLMRAYSAQGETSVARYIFDRSLEKNVVFFNVMIRSYVNNNLYVE 120

Query: 121 ALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAIVKIGLDSNLFIGNALVA 180
           ALS++QVM SCAFNPDHYTFPCVLKACSGLDNL+VG QVH AIVK+GLD+NLFIGNALVA
Sbjct: 121 ALSIFQVMLSCAFNPDHYTFPCVLKACSGLDNLRVGLQVHDAIVKVGLDTNLFIGNALVA 180

Query: 181 MYGKCGCVREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEALEICKEMDSLKLNHDAGT 240
           MYGKCGC+REARKVLDQMP RDVVSWNSMVAGYAQSGQFD+ALEICKEMDSL LNHDAGT
Sbjct: 181 MYGKCGCLREARKVLDQMPYRDVVSWNSMVAGYAQSGQFDDALEICKEMDSLNLNHDAGT 240

Query: 241 MASLLPAVSNTSPENVQYIHKMFEKMAMKSLVSWNVMIAIYVNNSLPNEAVNIFFQMEEC 300
           MASL P V  TS ENVQYIH MFE+M  K+L+SWNVMIAIYVNNS+PNEAV++F QMEEC
Sbjct: 241 MASLSPVVCYTSLENVQYIHNMFERMTKKNLISWNVMIAIYVNNSMPNEAVSLFLQMEEC 300

Query: 301 GMKPDAVTIASLLPACGDLSALFLGKRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEEA 360
           GMKPDAVTIASLLPACGDLSALFLG+RLH+YIEK NLRPNLLLENALLDMYAKCGCLEEA
Sbjct: 301 GMKPDAVTIASLLPACGDLSALFLGRRLHKYIEKGNLRPNLLLENALLDMYAKCGCLEEA 360

Query: 361 RDVFDKMKFRDVASWTSMMSAYGISGQGYDAVALFAKMQESGLNPDSIAFVSVLSACSHA 420
           RDVFDKM+ RDV SWTSMMSAYG SGQGYDAVALFAKM +SG NPDSIAFVSVLSACSH 
Sbjct: 361 RDVFDKMRLRDVVSWTSMMSAYGRSGQGYDAVALFAKMLDSGQNPDSIAFVSVLSACSHT 420

Query: 421 GLLDQGRHYFHVMTEQCRIVPRIEHFACMVDLLGRAGEVEEAYSFIKQMPMEPNERVWGA 480
           GLLDQGRHYF +MTEQ  IVPRIEHFACMVDL GRAGEVEEAYSFIKQMPMEPNERVWGA
Sbjct: 421 GLLDQGRHYFRMMTEQYGIVPRIEHFACMVDLFGRAGEVEEAYSFIKQMPMEPNERVWGA 480

Query: 481 LLSACRVHSKMDIGLLAADCLFQLAPKQSGYYVLLSNIYAKAGMWRDVGNVRYAMKKKGI 540
           LLSACRVHSKMDIGL+AAD LFQLAPKQSGYYVLLSNIYAKAGMW+DV NVRYAMKK GI
Sbjct: 481 LLSACRVHSKMDIGLVAADLLFQLAPKQSGYYVLLSNIYAKAGMWKDVMNVRYAMKKIGI 540

Query: 541 KKVPGISNVEIKGQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVEV 600
           KKVPGISNVE+ GQVHTFLAGDQYHPQAKNIY ELDVLVGKMKELGYIPQTESALHDVEV
Sbjct: 541 KKVPGISNVELNGQVHTFLAGDQYHPQAKNIYGELDVLVGKMKELGYIPQTESALHDVEV 600

Query: 601 EEKECHLAIHSEKLAIVFAILNTKPGTPIRITKNLRVCGDCHIAIKFISKIVSRDIIVRD 660
           E+KECHLAIHSEKLAIVFAILNTK GTPIRITKNLRVCGDCHIAIK ISKIVSR+IIVRD
Sbjct: 601 EDKECHLAIHSEKLAIVFAILNTKQGTPIRITKNLRVCGDCHIAIKLISKIVSRNIIVRD 660

Query: 661 CNRFHHFSNGICSCGDYW 679
           CNRFHHFSNGICSCGDYW
Sbjct: 661 CNRFHHFSNGICSCGDYW 678

BLAST of Tan0006148 vs. TAIR 10
Match: AT3G49142.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 872.1 bits (2252), Expect = 3.0e-253
Identity = 418/640 (65.31%), Postives = 510/640 (79.69%), Query Frame = 0

Query: 44  ILDRYPDRKTLNKLHSKIVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVV 103
           +LD YPD +TL  +HS+I++ E L  + +L +KLMRAY++  +   AR +FD   E+NV+
Sbjct: 48  VLDTYPDIRTLRTVHSRIIL-EDLRCNSSLGVKLMRAYASLKDVASARKVFDEIPERNVI 107

Query: 104 FFNVMIRSYVNNNLYVEALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAI 163
             NVMIRSYVNN  Y E + V+  M  C   PDHYTFPCVLKACS    + +G ++HG+ 
Sbjct: 108 IINVMIRSYVNNGFYGEGVKVFGTMCGCNVRPDHYTFPCVLKACSCSGTIVIGRKIHGSA 167

Query: 164 VKIGLDSNLFIGNALVAMYGKCGCVREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEAL 223
            K+GL S LF+GN LV+MYGKCG + EAR VLD+M  RDVVSWNS+V GYAQ+ +FD+AL
Sbjct: 168 TKVGLSSTLFVGNGLVSMYGKCGFLSEARLVLDEMSRRDVVSWNSLVVGYAQNQRFDDAL 227

Query: 224 EICKEMDSLKLNHDAGTMASLLPAVSNTSPENVQYIHKMFEKMAMKSLVSWNVMIAIYVN 283
           E+C+EM+S+K++HDAGTMASLLPAVSNT+ ENV Y+  MF KM  KSLVSWNVMI +Y+ 
Sbjct: 228 EVCREMESVKISHDAGTMASLLPAVSNTTTENVMYVKDMFFKMGKKSLVSWNVMIGVYMK 287

Query: 284 NSLPNEAVNIFFQMEECGMKPDAVTIASLLPACGDLSALFLGKRLHEYIEKDNLRPNLLL 343
           N++P EAV ++ +ME  G +PDAV+I S+LPACGD SAL LGK++H YIE+  L PNLLL
Sbjct: 288 NAMPVEAVELYSRMEADGFEPDAVSITSVLPACGDTSALSLGKKIHGYIERKKLIPNLLL 347

Query: 344 ENALLDMYAKCGCLEEARDVFDKMKFRDVASWTSMMSAYGISGQGYDAVALFAKMQESGL 403
           ENAL+DMYAKCGCLE+ARDVF+ MK RDV SWT+M+SAYG SG+G DAVALF+K+Q+SGL
Sbjct: 348 ENALIDMYAKCGCLEKARDVFENMKSRDVVSWTAMISAYGFSGRGCDAVALFSKLQDSGL 407

Query: 404 NPDSIAFVSVLSACSHAGLLDQGRHYFHVMTEQCRIVPRIEHFACMVDLLGRAGEVEEAY 463
            PDSIAFV+ L+ACSHAGLL++GR  F +MT+  +I PR+EH ACMVDLLGRAG+V+EAY
Sbjct: 408 VPDSIAFVTTLAACSHAGLLEEGRSCFKLMTDHYKITPRLEHLACMVDLLGRAGKVKEAY 467

Query: 464 SFIKQMPMEPNERVWGALLSACRVHSKMDIGLLAADCLFQLAPKQSGYYVLLSNIYAKAG 523
            FI+ M MEPNERVWGALL ACRVHS  DIGLLAAD LFQLAP+QSGYYVLLSNIYAKAG
Sbjct: 468 RFIQDMSMEPNERVWGALLGACRVHSDTDIGLLAADKLFQLAPEQSGYYVLLSNIYAKAG 527

Query: 524 MWRDVGNVRYAMKKKGIKKVPGISNVEIKGQVHTFLAGDQYHPQAKNIYEELDVLVGKMK 583
            W +V N+R  MK KG+KK PG SNVE+   +HTFL GD+ HPQ+  IY ELDVLV KMK
Sbjct: 528 RWEEVTNIRNIMKSKGLKKNPGASNVEVNRIIHTFLVGDRSHPQSDEIYRELDVLVKKMK 587

Query: 584 ELGYIPQTESALHDVEVEEKECHLAIHSEKLAIVFAILNTK-----PGTPIRITKNLRVC 643
           ELGY+P +ESALHDVE E+KE HLA+HSEKLAIVFA++NTK         IRITKNLR+C
Sbjct: 588 ELGYVPDSESALHDVEEEDKETHLAVHSEKLAIVFALMNTKEEEEDSNNTIRITKNLRIC 647

Query: 644 GDCHIAIKFISKIVSRDIIVRDCNRFHHFSNGICSCGDYW 679
           GDCH+A K IS+I SR+II+RD NRFH F  G+CSCGDYW
Sbjct: 648 GDCHVAAKLISQITSREIIIRDTNRFHVFRFGVCSCGDYW 686

BLAST of Tan0006148 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 514.6 bits (1324), Expect = 1.2e-145
Identity = 253/670 (37.76%), Postives = 413/670 (61.64%), Query Frame = 0

Query: 44  ILDRYPDRKTLNKLHSKIVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVV 103
           ++D    +  L ++H++++V   L     L  KL+ A S+ G+   AR +FD      + 
Sbjct: 27  LIDSATHKAQLKQIHARLLV-LGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIF 86

Query: 104 FFNVMIRSYVNNNLYVEALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAI 163
            +N +IR Y  NN + +AL +Y  M     +PD +TFP +LKACSGL +L++G  VH  +
Sbjct: 87  PWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQV 146

Query: 164 VKIGLDSNLFIGNALVAMYGKCGCVREARKVLD--QMPNRDVVSWNSMVAGYAQSGQFDE 223
            ++G D+++F+ N L+A+Y KC  +  AR V +   +P R +VSW ++V+ YAQ+G+  E
Sbjct: 147 FRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPME 206

Query: 224 ALEICKEMDSLKLNHDAGTMASLLPAVS--------------------NTSPENVQYIHK 283
           ALEI  +M  + +  D   + S+L A +                       P+ +  ++ 
Sbjct: 207 ALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNT 266

Query: 284 M-------------FEKMAMKSLVSWNVMIAIYVNNSLPNEAVNIFFQMEECGMKPDAVT 343
           M             F+KM   +L+ WN MI+ Y  N    EA+++F +M    ++PD ++
Sbjct: 267 MYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTIS 326

Query: 344 IASLLPACGDLSALFLGKRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEEARDVFDKMK 403
           I S + AC  + +L   + ++EY+ + + R ++ + +AL+DM+AKCG +E AR VFD+  
Sbjct: 327 ITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTL 386

Query: 404 FRDVASWTSMMSAYGISGQGYDAVALFAKMQESGLNPDSIAFVSVLSACSHAGLLDQGRH 463
            RDV  W++M+  YG+ G+  +A++L+  M+  G++P+ + F+ +L AC+H+G++ +G  
Sbjct: 387 DRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWW 446

Query: 464 YFHVMTEQCRIVPRIEHFACMVDLLGRAGEVEEAYSFIKQMPMEPNERVWGALLSACRVH 523
           +F+ M +  +I P+ +H+AC++DLLGRAG +++AY  IK MP++P   VWGALLSAC+ H
Sbjct: 447 FFNRMADH-KINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKH 506

Query: 524 SKMDIGLLAADCLFQLAPKQSGYYVLLSNIYAKAGMWRDVGNVRYAMKKKGIKKVPGISN 583
             +++G  AA  LF + P  +G+YV LSN+YA A +W  V  VR  MK+KG+ K  G S 
Sbjct: 507 RHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSW 566

Query: 584 VEIKGQVHTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVEVEEKECHLA 643
           VE++G++  F  GD+ HP+ + I  +++ +  ++KE G++   +++LHD+  EE E  L 
Sbjct: 567 VEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEEAEETLC 626

Query: 644 IHSEKLAIVFAILNTKPGTPIRITKNLRVCGDCHIAIKFISKIVSRDIIVRDCNRFHHFS 679
            HSE++AI + +++T  GTP+RITKNLR C +CH A K ISK+V R+I+VRD NRFHHF 
Sbjct: 627 SHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFK 686

BLAST of Tan0006148 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 511.1 bits (1315), Expect = 1.3e-144
Identity = 260/702 (37.04%), Postives = 409/702 (58.26%), Query Frame = 0

Query: 47  RYPDRKTLNKLHSKIVVNEHL-HIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVVFF 106
           R   +    +LH++ +  + L H   ++ I +   Y+       A  +F       V+ +
Sbjct: 17  RIKSKSQAKQLHAQFIRTQSLSHTSASIVISI---YTNLKLLHEALLLFKTLKSPPVLAW 76

Query: 107 NVMIRSYVNNNLYVEALSVYQVMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAIVK 166
             +IR + + +L+ +AL+ +  M +    PDH  FP VLK+C+ + +L+ G  VHG IV+
Sbjct: 77  KSVIRCFTDQSLFSKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVR 136

Query: 167 IGLDSNLFIGNALVAMYGKC-------------------------------GCVR----- 226
           +G+D +L+ GNAL+ MY K                                 C+      
Sbjct: 137 LGMDCDLYTGNALMNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGID 196

Query: 227 EARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEALEICKEMDSLKLNHDAGTMASLLPAVS 286
             R+V + MP +DVVS+N+++AGYAQSG +++AL + +EM +  L  D+ T++S+LP  S
Sbjct: 197 SVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFS 256

Query: 287 N---------------------------------TSPENVQYIHKMFEKMAMKSLVSWNV 346
                                                  ++   ++F ++  +  +SWN 
Sbjct: 257 EYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNS 316

Query: 347 MIAIYVNNSLPNEAVNIFFQMEECGMKPDAVTIASLLPACGDLSALFLGKRLHEYIEKDN 406
           ++A YV N   NEA+ +F QM    +KP AV  +S++PAC  L+ L LGK+LH Y+ +  
Sbjct: 317 LVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLRGG 376

Query: 407 LRPNLLLENALLDMYAKCGCLEEARDVFDKMKFRDVASWTSMMSAYGISGQGYDAVALFA 466
              N+ + +AL+DMY+KCG ++ AR +FD+M   D  SWT+++  + + G G++AV+LF 
Sbjct: 377 FGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVSLFE 436

Query: 467 KMQESGLNPDSIAFVSVLSACSHAGLLDQGRHYFHVMTEQCRIVPRIEHFACMVDLLGRA 526
           +M+  G+ P+ +AFV+VL+ACSH GL+D+   YF+ MT+   +   +EH+A + DLLGRA
Sbjct: 437 EMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLGRA 496

Query: 527 GEVEEAYSFIKQMPMEPNERVWGALLSACRVHSKMDIGLLAADCLFQLAPKQSGYYVLLS 586
           G++EEAY+FI +M +EP   VW  LLS+C VH  +++    A+ +F +  +  G YVL+ 
Sbjct: 497 GKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVLMC 556

Query: 587 NIYAKAGMWRDVGNVRYAMKKKGIKKVPGISNVEIKGQVHTFLAGDQYHPQAKNIYEELD 646
           N+YA  G W+++  +R  M+KKG++K P  S +E+K + H F++GD+ HP    I E L 
Sbjct: 557 NMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINEFLK 616

Query: 647 VLVGKMKELGYIPQTESALHDVEVEEKECHLAIHSEKLAIVFAILNTKPGTPIRITKNLR 679
            ++ +M++ GY+  T   LHDV+ E K   L  HSE+LA+ F I+NT+PGT IR+TKN+R
Sbjct: 617 AVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPGTTIRVTKNIR 676

BLAST of Tan0006148 vs. TAIR 10
Match: AT4G30700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 495.0 bits (1273), Expect = 9.8e-140
Identity = 251/663 (37.86%), Postives = 380/663 (57.32%), Query Frame = 0

Query: 48  YPDRKTLNKLHSKIVVNEHLHIDPTLSIKLMRAYSACGETRVARFIFDRTFEKNVVFFNV 107
           + D +    +H + VV +    +  L   +++ Y        AR +FDR  EK+ + +N 
Sbjct: 132 FRDDRAGRVIHGQAVV-DGCDSELLLGSNIVKMYFKFWRVEDARKVFDRMPEKDTILWNT 191

Query: 108 MIRSYVNNNLYVEALSVYQ--VMSSCAFNPDHYTFPCVLKACSGLDNLKVGFQVHGAIVK 167
           MI  Y  N +YVE++ V++  +  SC    D  T   +L A + L  L++G Q+H    K
Sbjct: 192 MISGYRKNEMYVESIQVFRDLINESCT-RLDTTTLLDILPAVAELQELRLGMQIHSLATK 251

Query: 168 IGLDSNLFIGNALVAMYGKCGCVREARKVLDQMPNRDVVSWNSMVAGYAQSGQFDEALEI 227
            G  S+ ++    +++Y KCG ++    +  +    D+V++N+M+ GY  +G+ + +L +
Sbjct: 252 TGCYSHDYVLTGFISLYSKCGKIKMGSALFREFRKPDIVAYNAMIHGYTSNGETELSLSL 311

Query: 228 CKEMDSLKLNHDAGTMASLLPAVSN------------------------------TSPEN 287
            KE+        + T+ SL+P   +                              +    
Sbjct: 312 FKELMLSGARLRSSTLVSLVPVSGHLMLIYAIHGYCLKSNFLSHASVSTALTTVYSKLNE 371

Query: 288 VQYIHKMFEKMAMKSLVSWNVMIAIYVNNSLPNEAVNIFFQMEECGMKPDAVTIASLLPA 347
           ++   K+F++   KSL SWN MI+ Y  N L  +A+++F +M++    P+ VTI  +L A
Sbjct: 372 IESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFREMQKSEFSPNPVTITCILSA 431

Query: 348 CGDLSALFLGKRLHEYIEKDNLRPNLLLENALLDMYAKCGCLEEARDVFDKMKFRDVASW 407
           C  L AL LGK +H+ +   +   ++ +  AL+ MYAKCG + EAR +FD M  ++  +W
Sbjct: 432 CAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNEVTW 491

Query: 408 TSMMSAYGISGQGYDAVALFAKMQESGLNPDSIAFVSVLSACSHAGLLDQGRHYFHVMTE 467
            +M+S YG+ GQG +A+ +F +M  SG+ P  + F+ VL ACSHAGL+ +G   F+ M  
Sbjct: 492 NTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNSMIH 551

Query: 468 QCRIVPRIEHFACMVDLLGRAGEVEEAYSFIKQMPMEPNERVWGALLSACRVHSKMDIGL 527
           +    P ++H+ACMVD+LGRAG ++ A  FI+ M +EP   VW  LL ACR+H   ++  
Sbjct: 552 RYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKDTNLAR 611

Query: 528 LAADCLFQLAPKQSGYYVLLSNIYAKAGMWRDVGNVRYAMKKKGIKKVPGISNVEIKGQV 587
             ++ LF+L P   GY+VLLSNI++    +     VR   KK+ + K PG + +EI    
Sbjct: 612 TVSEKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIGETP 671

Query: 588 HTFLAGDQYHPQAKNIYEELDVLVGKMKELGYIPQTESALHDVEVEEKECHLAIHSEKLA 647
           H F +GDQ HPQ K IYE+L+ L GKM+E GY P+TE ALHDVE EE+E  + +HSE+LA
Sbjct: 672 HVFTSGDQSHPQVKEIYEKLEKLEGKMREAGYQPETELALHDVEEEERELMVKVHSERLA 731

Query: 648 IVFAILNTKPGTPIRITKNLRVCGDCHIAIKFISKIVSRDIIVRDCNRFHHFSNGICSCG 679
           I F ++ T+PGT IRI KNLRVC DCH   K ISKI  R I+VRD NRFHHF +G+CSCG
Sbjct: 732 IAFGLIATEPGTEIRIIKNLRVCLDCHTVTKLISKITERVIVVRDANRFHHFKDGVCSCG 791

BLAST of Tan0006148 vs. TAIR 10
Match: AT3G57430.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 493.0 bits (1268), Expect = 3.7e-139
Identity = 261/649 (40.22%), Postives = 385/649 (59.32%), Query Frame = 0

Query: 77  LMRAYSACGETRVARFIFDRTFEKNVVFFNVMIRSYVNNNLYVEALSVYQVMSSCAFNPD 136
           L+  Y   G+   ++ +      +++V +N ++ S   N   +EAL   + M      PD
Sbjct: 242 LVAMYGKLGKLASSKVLLGSFGGRDLVTWNTVLSSLCQNEQLLEALEYLREMVLEGVEPD 301

Query: 137 HYTFPCVLKACSGLDNLKVGFQVHGAIVKIG-LDSNLFIGNALVAMYGKCGCVREARKVL 196
            +T   VL ACS L+ L+ G ++H   +K G LD N F+G+ALV MY  C  V   R+V 
Sbjct: 302 EFTISSVLPACSHLEMLRTGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVLSGRRVF 361

Query: 197 DQMPNRDVVSWNSMVAGYAQSGQFDEALEICKEM-DSLKLNHDAGTMASLLPA------- 256
           D M +R +  WN+M+AGY+Q+    EAL +   M +S  L  ++ TMA ++PA       
Sbjct: 362 DGMFDRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAF 421

Query: 257 --------------------VSNTSPE------NVQYIHKMFEKMAMKSLVSWNVMIAIY 316
                               V NT  +       +    ++F KM  + LV+WN MI  Y
Sbjct: 422 SRKEAIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGY 481

Query: 317 VNNSLPNEAVNIFFQME-----------ECGMKPDAVTIASLLPACGDLSALFLGKRLHE 376
           V +    +A+ +  +M+              +KP+++T+ ++LP+C  LSAL  GK +H 
Sbjct: 482 VFSEHHEDALLLLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHA 541

Query: 377 YIEKDNLRPNLLLENALLDMYAKCGCLEEARDVFDKMKFRDVASWTSMMSAYGISGQGYD 436
           Y  K+NL  ++ + +AL+DMYAKCGCL+ +R VFD++  ++V +W  ++ AYG+ G G +
Sbjct: 542 YAIKNNLATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQE 601

Query: 437 AVALFAKMQESGLNPDSIAFVSVLSACSHAGLLDQGRHYFHVMTEQCRIVPRIEHFACMV 496
           A+ L   M   G+ P+ + F+SV +ACSH+G++D+G   F+VM     + P  +H+AC+V
Sbjct: 602 AIDLLRMMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVV 661

Query: 497 DLLGRAGEVEEAYSFIKQMPMEPNER-VWGALLSACRVHSKMDIGLLAADCLFQLAPKQS 556
           DLLGRAG ++EAY  +  MP + N+   W +LL A R+H+ ++IG +AA  L QL P  +
Sbjct: 662 DLLGRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVA 721

Query: 557 GYYVLLSNIYAKAGMWRDVGNVRYAMKKKGIKKVPGISNVEIKGQVHTFLAGDQYHPQAK 616
            +YVLL+NIY+ AG+W     VR  MK++G++K PG S +E   +VH F+AGD  HPQ++
Sbjct: 722 SHYVLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSE 781

Query: 617 NIYEELDVLVGKMKELGYIPQTESALHDVEVEEKECHLAIHSEKLAIVFAILNTKPGTPI 676
            +   L+ L  +M++ GY+P T   LH+VE +EKE  L  HSEKLAI F ILNT PGT I
Sbjct: 782 KLSGYLETLWERMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEKLAIAFGILNTSPGTII 841

Query: 677 RITKNLRVCGDCHIAIKFISKIVSRDIIVRDCNRFHHFSNGICSCGDYW 679
           R+ KNLRVC DCH+A KFISKIV R+II+RD  RFH F NG CSCGDYW
Sbjct: 842 RVAKNLRVCNDCHLATKFISKIVDREIILRDVRRFHRFKNGTCSCGDYW 890

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P0C8994.2e-25265.31Putative pentatricopeptide repeat-containing protein At3g49142 OS=Arabidopsis th... [more]
Q9LTV81.7e-14437.76Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q9LW631.9e-14337.04Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Q9SUH61.4e-13837.86Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX... [more]
Q7Y2115.3e-13840.22Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_022157138.10.0e+0090.41putative pentatricopeptide repeat-containing protein At3g49142 [Momordica charan... [more]
XP_022946769.10.0e+0089.69putative pentatricopeptide repeat-containing protein At3g49142 [Cucurbita moscha... [more]
XP_022999640.10.0e+0089.40putative pentatricopeptide repeat-containing protein At3g49142 isoform X1 [Cucur... [more]
XP_023546028.10.0e+0089.10putative pentatricopeptide repeat-containing protein At3g49142 isoform X1 [Cucur... [more]
XP_008456467.10.0e+0088.50PREDICTED: putative pentatricopeptide repeat-containing protein At3g49142 [Cucum... [more]
Match NameE-valueIdentityDescription
A0A6J1DSL60.0e+0090.41putative pentatricopeptide repeat-containing protein At3g49142 OS=Momordica char... [more]
A0A6J1G4Z80.0e+0089.69putative pentatricopeptide repeat-containing protein At3g49142 OS=Cucurbita mosc... [more]
A0A6J1KDN50.0e+0089.40putative pentatricopeptide repeat-containing protein At3g49142 isoform X1 OS=Cuc... [more]
A0A1S3C3E60.0e+0088.50putative pentatricopeptide repeat-containing protein At3g49142 OS=Cucumis melo O... [more]
A0A0A0KE240.0e+0087.61DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G4042... [more]
Match NameE-valueIdentityDescription
AT3G49142.13.0e-25365.31Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G12770.11.2e-14537.76mitochondrial editing factor 22 [more]
AT3G23330.11.3e-14437.04Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G30700.19.8e-14037.86Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G57430.13.7e-13940.22Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 270..316
e-value: 1.6E-8
score: 34.6
coord: 371..418
e-value: 4.0E-8
score: 33.3
coord: 100..147
e-value: 3.6E-9
score: 36.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 105..136
e-value: 7.8E-5
score: 20.6
coord: 176..203
e-value: 0.0011
score: 17.0
coord: 204..234
e-value: 7.1E-7
score: 27.0
coord: 374..407
e-value: 6.6E-6
score: 24.0
coord: 345..372
e-value: 2.4E-5
score: 22.2
coord: 272..306
e-value: 1.7E-6
score: 25.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 448..470
e-value: 0.057
score: 13.7
coord: 204..231
e-value: 6.0E-8
score: 32.4
coord: 345..370
e-value: 1.3E-5
score: 25.1
coord: 176..202
e-value: 0.0022
score: 18.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 340..370
score: 8.801982
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 202..236
score: 11.454616
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 371..405
score: 11.312119
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 270..304
score: 10.840783
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 101..135
score: 9.591195
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 544..667
e-value: 1.4E-42
score: 144.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 257..321
e-value: 1.4E-11
score: 46.1
coord: 154..256
e-value: 3.2E-21
score: 77.5
coord: 322..422
e-value: 1.8E-25
score: 91.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 52..153
e-value: 4.6E-11
score: 44.7
coord: 423..617
e-value: 2.3E-12
score: 49.0
NoneNo IPR availablePANTHERPTHR47924:SF26PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN-RELATEDcoord: 20..671
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 20..671

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0006148.1Tan0006148.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding