MS018773 (gene) Bitter gourd (TR) v1

Overview
NameMS018773
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein putative isoform 1
Locationscaffold313: 1855763 .. 1880078 (+)
RNA-Seq ExpressionMS018773
SyntenyMS018773
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CTAGCTTTCAAGTTCTTCAAATGGGCTGGAAGCCAGGTTGGTTTCCGCCATACCACCGAGGCTTACTGCATTGTAGTTCACATACTGTTTCGTGCGAGAATGTATGCAAATGCCCACGATATTATCAAGGAAGTGATTTTGAAGAGCCAGAACGACTTGGTTTTGCCAGTTTGTAAGATATTTGATATACTTTGGTCGACTAGGAATATTTTTGTGTTAGGAACAGGGGTCTTTGACGTTTTATTTAGTGTTTTGGTAGAGTTGGGGCTGCTTGAGGAAGCTAATGAATGTTTCTTGAGAATGAGGAAGTTTAGAACTCTTCCCAAAGCCCGTTCTTGCAATTTTTTTTTGCATAGGTTATCAAAGTCAGGGAAAGGACAGTTGGTGAGGAAGTTTTTCCATGACATGATTGGGGCTGGTATTGCACCTTCGGTTTTTACCTACAATGTAATGATAGATTACTTGTGCAAAGAAGGAGATTTGGAAAATGCTAGACGTTTGTTTGTGCAAATGAGGCAGATGGGCTTTTCTCCTGATGTTGTGACATATAATTCTTTGATTGATGGCTATGGCAAGGTTGGTTTATTGGAAGAATCTGTGTATTTATTTAATGAAATGAAAGGTGCAGGCTGTGTTCCTGATGTAATTACCTACAATGCTTTAATCAATTGTTTCTGCAAGTTTGAGAAGATGCCTCGAGCTTTTGAGTATCTCTCTGAGATGAAGAACAATGGGTTGAAACCAAATGTTGTAACCTATAGCACCTTGATTGATGCTTTTTGCAAGGAGGGAATAATGCAAGGTGCCATAAAACTTTTTGTTGATATGAGAAGAGTAGGCCTTGTACCTAATGAATTCACCTACACTTCTCTGATTGATGCCAATTGTAAGGCAGGTAATTTGACAGAAGCATGGAAGTTGTCTCACGATATGTTGCAAGCAGGAGTTAACTTAAACATAGTCACTTATACTGCTCTAATGGATGGCCTTTGTGAAGATGGAAGAATGACGGAGGCAGAAGAAGTGTACAGGGCAATGCTGAAAGATGGAATATCTCCCAACCAGCAAGTTTACACTGCTTTGGTTCATGGCTATATCAAGGCGGAGAGAATGGAGGATGCAATGGAAATATTGAAGCAAATGACAGAATGTAACATCAAACCAGATTTAATACTCTATGGAACCATTATTTGGGGTCTCTGTAGTCAAAATAAACTTGAAGAAACTAAGCTTATAATTAAAGAAATGAAGAGTCGGGGTATTAACGCAAATCCAGTTATATACACAACAATTATAGATGCTTATTTTAAGGCTGGAGAAAGCTCAGATGCAATAAATCTTCTTCAGGAGATGCAGGATGCAGGTATTGAGGCTACTGTTGTAACCTACTGTGTATTAATTGATGGTTTGTGCAAAACAGGTAAGGTCGAACTAGCAGTTGATTATTTTGGTAGAATGTCTGCTGTTGGTTTACAACCTAATGTTGCAGTTTATACAGCCCTCATTGATGGTCTTTGTAAAACAAATTGCGTTGAATCTGCCAAAAAGTTGTTTGATGAAATGCAATGTAGGGGTATGGCCCCGGATAAAACAGCTTTCACTGCTCTGATTGATGGCAACTTGAAGCTTGGAAATCTTCAGGAAGCTTTGAATTTGAGTAGTAGAATGACAGAATTAGCTATCGAGTTTGATTTGCATGCTTACACTTCCTTGGTTTCGGGATTTTCTCAATGCGGTGAGCTGCACCAAGCGAGGAAGTTTTTTGATGAGATGGTTGAGAAGGGCATACTTCCTGAGGAGATTTTATGTATATGTCTATTGAGAGAGTATTACAAGCTTGGACAGTTGGATGAAGCCATTGAATTGAAGAGTGAAATGCAAAGAAGGGGTTTGATTACTGAAAAGTGCAGCCATGCAGTTCCTAGTCTAACAAGAGGACTCCAATGACTAATCTGATCATCTTTGGCGTCTGGAAGCTGAGGTTGTATCTTAATCGCAAGCACATGCGGCATTTATTTGGAATTCAGAAGTATGATGACCCAAGAAACCCACCATTCAAATGGGTTCTGTGTTTGATGGGAAGAACTTGCAAGCGATGCATGATGTATCTGATGTTGGTTTTGGAAGCTCTGATGCAAAATCTTCCATTTGATAAGATGTAGCTTTCCAGTTTTCAAATGACATTCTGCAGAAAAATCCCATAATGTCATGTGTTTCATTAATCATCAATCTTCTAAAAATGTGAGTGACAGTCTTGTTCCTCCTCTTCCTCCTTTTTCCGAAAAGAAGATTTTCTTTTTAGATATGAGATACAGGCGTGGAATTTTTGGCAGTAATAGCCATTTACGTGAGATTGAAAGTAATAACTTTTTCTACAACGATTATGCATGATGTATCCATCTTGCTTGTCTTCTGTCAACTAATTTGGCATGTTGGTTTACAGGTCCAAAGAGGATTCGTGTGGGCAGAGATATTGATTCCCATGGAGGCATTCTAGTGAAAATACCTATCTGTTATTGGTTCCGAGGGAAGCATATTCATTCAAGTTTGAAGCTCGGCACTGCTTATTATGAGCAAGGTGTTCCATTCTTCATGCTTCTTTATATTTCTGAAACTCTTGGAATCCAAGCCTTTGAATGGCCCTTAGAGACTCTGGATTGGAAGAGTGGTAGGTTCTTAGGGAGATTATTGGCAACATTCAACCTAATTCTAGAGAGAACTCTTTGAAATGGATCCTTGATCCTTTTCAGTAGTTTATGAGCGTCTCTCTATTTTCCTTTTTCATGAATGGTTATAGCCTTTTGCCAACAAACCTTGCTAGGAAGATTTGAATGGCAAAATTCCTATGGAAGTCAGAACATCCCTCTGTTCTCTCAATTTCTATTAAGAAGAATAGAAAAGCAAATTACACCAATTCTCTTCTTTAGAGATAGATTGAGCCAACGACCCAAAACCCATCTAAAATTGACAAAACTGACTGCAAAAATCAGTGAAACCAACCAGTTCGTTAAAAACCAAAGCTATTGAAATATTTGCGATTTTGAAACTATATAAAATTGAATAGACCGACACTTTTACCAAATTGACCCAACTAGCATGATAAGTGCCAAGGTTGCTCGGGTTTACAAAACTAGAGGTCCTTCTAAGATCCTTGCAAACATTCACAAAAACGACTAACTTACCAAATGATTCCAACCAAGAGTAAGGTTAGAAGATGATTCACTCTTAGATCAGACTTAGAATCTACGATGGTCACTCCTAGATCCTAAGATGACTTACTCCAGAATTAGCTTGATCGAGACTGATCCAATGAATCGAGTTTAATATCAAACCTAAGCAAGATTTGAAAGAAGCAACACCAAATGTGTTAAGGAACAGAAGTTCTTGAATTTCAATATGTTGAATCTAAAACAAGGTGATTACAAGCCTTTAAATAGGCTAAATGGTCGTTTAACATAGGAACACCAAAAGTCTTTCAAAATACTACTTAATGCAAATTGAAGAAAATAAGACATGCAAATTAAAACTTGATTGCACATCATTCTAGACACTTAAATGCGGATCAGAAAACGTAGTCACGCTTCAAATGGTCTTACCTATGTGCTGTAGGAGGTGCTTCACATTTTGATATGGTTTGACAACTTTTGATTTCACAAACTCCCAAGAGGTGCAATCTGCAAAGGCACCTGTTCACGAATCTGCACACTTTTTTTGTTCTCCGTGCCATCTCTCAAAATAGACAACTTTTCCTGTGCCATCTTTGGTCAATTCTCTTCTTGACTCATTGAAGTAACATTGTAAATAAATTTGGGATAAAACTCTTCGTTGGTTCTTGTGTATTAATCATCCTTTGAAGATGTAGTGTGAAAGCCTCTTGAATCTTCTTTGCCTTTCTTGTAATAATTGTTCCTTTGATACATGTAATTGGTCACAAATGGATTCGATCATAGCACGACTTCATTGTTTTCATTGGTCCCATACCTAAGAATGTTTCAGCCTCTCTCCTAAGTCCTGAAAGCCTGGGTGGCCTCCCTTTGGGTGCTAAGGCTTGTTCTCCAGTGTGTGGGAAATGGGAATGTTGTTCTAGTTTTTTGTTGGAACATTTAGTCAGGGAAAAATACAAGGGTCTTTCAAGTTCTTCATATAGAGATTAGGGGATCCTTTTGGTTATAGTAATCTTCTTGTCTCCTGTTGATGGTTCTTTCAGGAGCTGCTACAGCTATTGTACTACTACTATAGTTGGCAGCTGGACAACTTTTTGTAGTTTTTAGTTGCTCTGGGGCTTGAGAGATGCCTTATATTTCTCCCTTATTTTTACAGGATTTTTCTTCCTCTGGAAAAAGCTCATCAACTTCCTTTTCCTCCAAATGGAAGGTCTTCACGGATGAGGAGCCACTTCTAAATCATCATGGGTTGACTTAGTGTTCATAAATGTAAATAGTAAAGGGTTGAGAAAAAATAAGCTCAAAGTTCAAACCATAGTGGCCACCTACCTATGATTTAATATCCTACCAGTTAACTTGGCAATCCAATATAGTAGGGTCATGTAGTTGTTTTGTGAGATTAATTGAGGTGCGTTCAAGCTGGTCTGGACACTCATGGATATCAAAAGATAAAAGTAAATAAATAAGTAGGCACTTCTCTGTTATGTTTTTATTTTAAAATTTTTATGTACCTTAAAGCAGTGTATTTTTAGTTATTTATCATAACACTGTAGTTATCCTTATGTGGTTGTTCTCGTTTCTATTGTTAGATATTTAGTTCGTTACAACATGATGATGGTGATTAAGAAGGCTATAGCATTGGTTAAGCATTCCAAAGCTGCTTTCTTTTTTCAAGTTAATTTGTCAAGTGAAACTGATTATCAGTACAATTTTCACTTCTGATTAGTGATTTTTGGCACCTCCACTAATAGTGACTTGCAAAAGCTTTAGATGGTTCCCATTTCTGTTCATTACCATCTTCAGGAGTCCTACTGTAGGGAACAAAACTTAGTCAGATGCCCAAGAACAGAACTTAGTGACATAATGAATATTTTATGTCGGTGTTCCGATCATTTGAATATTTGTCAAATTTATTTATGGCATGCATATATTAGTTTTACTATTGAAGCCTGAGGGTGAATGATTTTTATAAGATAAGTTTCTTTGAATTTCTCAGGTGGTTGGTTTTGGAGAGTATATCTGGCAAATCATTGGCAGAATCAGAACCCATCCATATTTGTTGAGGGTGCTTTGAGAGGTTCAATTGTTGAATAATTGCGACTTGTTTGGCTCCTTCTAACACATTATATGCAAGTCCCAAATCTCAACAGTTCTGGAAGAGAGAACCTGCTCAATAATTTGTATTACCCTTGCGGAATACAAATGAGGTGATGTGATGTACAAAATTTTATTTGTTTTGAAAATCTGTACATGATTTTAAGAAATGAAGCAGGAGAAACATAGAGAACAGAGTGTTATTGTCTACAAACAACCATAAGCTATCTGTTTTTCGGTTGCGTGAAAAACAAGGAACCAAATCATTGCTATTTTTTATATGGAATCGTACGCTACATTTTCGGGCATATTTGATAGTAATGTCAAAATTATTGTCTATCTTTTAAAATCACTTGTATATCATTGCATACAAATATCAATATTTAAAAATAAGGCTAGAGTAATTNNNNNNNNNNNNNNNNNNNNNNNNNNAGTTAACCATGGTATGGTTATCATTTTATGTGACTAGGTGTGTTACTCTGAACACTCGTAGATGATGATGAAAGTCTTGGAGGTAAATCTTAATTCATCTCTGAGTTAGAAGACCTAATTAATCTCAAGTCAGAATACCTAATTCGTTCCTAAAATATAAATTAAAAGATAATAATAGTAAAGTATATGTATACATATTTATTCACCGGTTTTGCCACTAACAAATGTTCAATAGATACTTATTAAACTAATCTAAATTGTCTGCTATATTTCTTGGTAACAAATATAATTAAGGGTGTGTTTGAGAGTGGCCAAAATCACTAATATTTTCATTGATCATGTTTGGTAAATTAGTATCATTGGAGGATGATTTTGAAAAATTAATTATACTTATATCAATTTTTTCAAAATAACTTCTTCTACCCTAGCTTCTAAACATATGTTAAAGCGAGCAAACAATCAGACTAAATAGATGTTTTAATTCTTAATGTTGGAGTATTTTTTAATTTATTTTTGATGTTTTAAAAGTTTTAATTTAGTTGTCCACCATCAATAAATATTAAAATTTTTTGATGATAAACTTTATAAATGTAATATTTAATGAATCAATAAAAATTTAAAAAAGAAAAATGTTAATGTAAATTTCTAAGAAATTAGTGAAATAAAAAATTTGAGTTTGATAAATTTTTCCCCTCTAAATTTCACAGATTTATTATATATTATGTTGATGTAAAACACCAATAAGTAGAGACCAAATTTTAAAATGTTAAAATATAAAAATTAAATTAAAAAATGTTTAAATAATGGAGACAAACATTCATTAAAGACTAAAACGAAGAGTGCACGGATATAAAGGTGAGGTCGGCCGCACGGAATCTGTACCACTGTAGTGGACTTGTTTGGGTTTTTGCTTTGGGAGGTGGTGCGGATATCAGAGTTAGGTTTTGAAACTGGCTTTGGTCGTCCCGGAAGCTCAGCTACGGTCTCAGCCTCGGGTATGGATTCCCCGCCCTCTCCCCCTCTTCCCATCTCTTCGCATAGCCGTTAAAAACTTTGCTTCTCATTTTCGAATTCACAGTCGCAGCAGAAATTGACTTCAATCGTCCTCCATCAATGTCTTCTTCTCATCTTTCTCTGTTCCCTGTCACAGGCAATGAGCTTCTCGCGAACTGGTTCAGGTTTAGACGAAGAAGAAGACGGTATTTTTGTGTTGGGAGCTGTTCTCTGTGTTTGATGCTTGTACTTGTTTCACCGATATGACTACCAGAGTTCCAGTGCAGCACTACGATCTCAGAACGGCGAATTCGTTCATCGGCAGCGCTTTGCATGATCTCAACACCGTGGATGGAAGCCCTTCTGATATTGCAGCCATCAGCGACGTTGATCGCGACGCCGTCACTGAAGATCGCTTGGACGATGACCATGATTCCAGTGCTGTTGTTAGTGTTCTTGTCTCCCTTTACTCTTCGTTCTCTGTCGTATGGCTATTTCTCGTTTACCTCTGAATTCTAGTAGTCACGAAGTTAATTATCACTTAATGATATGACGGAAAGTTGGGATGTTCATAAATTCAAGGTCAATGTTTTGTTTCCCTCCATTATTTGTTCTAACATAAAAGCCCTTATGCAGTTACATGCTTCCATCTTAATCAGCACGTCCTCTCGTTGTCTTAGAGCTAAATTCCGCTTCATCAACTGTTAAATAACGAATTTTTTTGCTGAGTTGCGGCTGCTTTCTTGTAACTGTACAGTTGTTCATGCCTTTTGATTGCATTTTGTAGGATTGCATACACGAATCCTACAGAACTTCATTACCCCTTCACGATGTGGGAGTAGAAGAAGATCGCTCCAGTCTCGAGAATAGTGGGTCTTCCAGGTTGCCATATGACTCTTTAGCAATAGAGGGTATTGTCGACATATCTGAAACTTATTACTTATCTTAAAAGGAATTCTGGACTTCAATCTCTGCTAGGCCTTCACGGGCTTTTGTTTAGACCAAATATAAATTGATAAAAGCTTAGTTTTGGTCCAGATATTGCACCTATTGAAGCAGCACGAGCAAGATTTCTGCAGATCATTGTGGATCATTTTATTGATGATCATGTGACAGAAGTGGCTGAGACTGATAATGATTATCTCTCTCAGTCTGGACAGGATAAGTTGACAAAGAGGAAGACAAAGGAGGTCCAGTATGAGGGGGATCCAAAATATGTCTTACCCTTGATGTATGTGGCAAATATGTATGAAACGCTTGTTAGTGAAGCAAACCTGAGGCTTGCTTCCTTGAGTGGCATCCGTGATAAAACTATTGGGGTAGCCCTTGAAGCAGCTGGTGGTTTGTACAGAAAACTGGCTCAGAAATTCCCCAAAAAAGGTACAAACTATTACTCTATAGCAGTGTTTCAGACGAACAGCTTAGGATAGAAGGGGGGGAGGGGCAGTGGGGGGAGTATTCTCCTTTCCTTAGGTTTACCTCTCTCCAGTGATTATTTTGTTGTGAAAAAATCCTGCAGGGGAGAACAATATGGTAGTGGTAGTATTTCTGGAAATTGAGGGTTTGTATGGCAACTACTCTCATAATGGTTTTCTGTTTTTGAAGTTCAATGAAAATGTGTTTACATTGCTCTTTTTTTAACCTGTTTTTTGTTAAACAAATGAAGTTGTTATTTCCAAAATTGTAGAGTTGTAAATAGTGGTTTTAATGGGCATATTAACTTTTACAGATTATCCAATAGCAATTCTTAAAATAGTTTTGATAATAAAACAGAAAACTGTTTTCAAACAAGCCACAATGCAGTTATTTTTTGGATAGCACTATGCAGTTATTTTTTAAGAGTGTAATACTTGGAAATTGGCTTCTTTGATTGAGTGGCAGCCAGATGAAACTAGTTTCTATATATATTATTTTTTAAGGAATAGTAATAAAATTTTCATTTTTAATGAAGTTACTTCTTGTATCTTAATGACATTGTATTGCGAACAGGCCCTTGCACATATAAGAGAAGAGAACTTGCCACTTCTCTTGAAACAAGGACTAGGTTTCCAGAGCTAGTAATTCAAGAAGAAAAGCGGGTTCGTTTTGTGGTAGTTAATGGTTTAGACATTGTTGAAAAACCAAATAGAATGCCTATGGAAGATGCTGAATGGTAAGTTAAATAGTATAATTTTACATAATTTTCTAAATTTTTTATTAGTGTAAAAGTTCCCAGTATCATTATTTACATCGCCATCGGTTAGCTTCCCAGAATCTGTTTAATTTACAAATTGTACAGAACTTTTTCTTTAGTTGTATTATACTAGCTCTTTTGCCTGGCAAAATATTTATTCCTGAGTTCTAATTGTGATTGAATAAGAATTAAGCAATGGGTTGCTGGAGAGTGAAGACCATGTCCTAAGAAAATCTTGTTTTTTCTAATCAAATAAAAAAAAATTGTGATGGCATTGTGTTAAAAGACATAATATATCTATTATGTCAACAGTTCACAGTGAAAAGAAGTAAAATATTCGATATCAGAACTATTTTTTTAATAGAAAAGGTACAAGAAAACTTTTTTTGAAGAACAAACTAAACAATTTTTATCCAGTCAAACTGAAAAGATTCCACAGTCCTTACAATCAATAACATGGCATCTACATTATCATCAAAAGGTTGAATTAAGATGACCGTGTAATATGAAGGTCCACCATTGTAAGGATGTTGGGGTTGTGGTATCTAACCACTTAACCATATTAGGAGCCCATCTGAAGGGCAGTGAAGTTGATATTATTAATGCAGATGGAGTCTACAAACCAATCTGCAGAGTCGCTTGCCTTGTTCCTCCATTTAGTTTAACATCCTGGACTACTTGAGAGAGTGAGCAATAAAATCCTTTCAATCCTTGGGAAAAATGTAACTCAAGATCTTCAGAATACCCTTTTCCTTTTATTTGGTCAAATTTTGTTTTGTTGATAATCAAGACTTGGCCATCCCTGCAAAGAAAAATCTAGATGCGAACAGAGTTTATTCACCTGTCTCCAAGAATTGGTAGCAAGTATTTTGATTTCTATTTCCAGTTTAGGTCAACATTTTGCTGATTAAATTGCTTCATTGTGTAGGTTTAGACGATTGACAGGTCGCAGTGAGGTAGCTGTGTCTACTCAGGACTACAAGTTCTATTCACCAAGACACAAATATAGGCGAGTTGTGGCAAACTCTGTGTCCAGCATTTCCAGTTTGAATGTAAGCATAGTAATTTTTCCAATATTTTTTATCCATTTTTATATCTCCCGGAAATATTTTGCATAAATGCATTTTGCAGGTGATTTTCCATAATGTATTCTCGCACTGTGAAAAAAAATTGGATCATAGCTATAGCTTTGGTATTATCTAAATGATGTCAAACAAGTTATATCGTTTTTTCTTTATTTTGTTGATAAATTAAGAGATGTTAGACCTGTATGTATTTATTTCAAGGACAAATGTTTTTGTCATGCCTGACTGAATTATTCATGGAGAAAATAATGTCTGGTGTCTGGTGTATTTTTTTCAGACATTTTCCAGCAGTGACAATTCTTCCACTCTGGCTACTGGTCAAACATTCCGCTCTCTAAGTGAAGTAAGAATTTTGTTCTCACCAAATTTTTTATGACTGGTACCTTTTCTCTTTGGGGATTAATCTGGTTTAATTTTGATCATTATGCCCTCTTACAGAATTCATCCTATTCTTTACTTATTGAGATGCAAAGAAGAAAGTAAAAGATTTAAGAGTCTGGTTTTTGTTTAAAGAACATTTAGTTTAGGAAATGGTGGGGCTGTGTCCATATTCTTGTCGGAGAGAGTAATTTATATTGTGTGCATTTCTGTGATGTCCTAACAACCAAAAGCCTGACTCTTCCGCAATTTTAAATTCAGCAACAGACACCCTGCAAACATCATATCCAACAACTGCCGCATCAGCCTCAATTTCAGTCTATCCAGCAGAATCATCATCAGTCCATGCACCCAAGTCAACATACCGCCCACTTCGCTCACAATCATCAGTGTGGTCAACCTTCACAGTTACCGGATATTTCTCATACTCATCATTCTCCAACAATGTCACAGCACATTGCTTGCTTACAACCTCTTTCTGGTGTTGTTGGTGGGCGCTTGCATCATGGGCTGGTAATTTCAAATCCCCCTCCCCCCAAATTAAGATCATCCGCTTCCCTATTTTCGTCAGCTGGATGTTGGGCAGTTTTTCATTTTTCAATTTCTTATGAAATATTTTAAATTTCTTTTTTGAAAAATTTTATTTGGGGGTAAGATCTTCTAAGAATTGTTAGCTCTCACAGTAATTGTCTGGTCTTAATTAATATGATATTGGATTACCCCTTTTATGAGAACTTCTGGTCGTGGTCCCATAAATGTTATTAAGATACTATCACAACTAAGTTCTTGGATGACGCAACGGAAAGAAAAAACTCTCTTTATTTTGGACAATTTTAAACATAAAAATAAACTACTTTTGCAACTTCGTTTCCAGATTGTAAGCCGGACATCCAAAAATTTTTTTATGCGAAACTGTGCAGTTTTTATCTACATGCTGACTGATAAGACAGGTTAAACTTTAAGCTGCCCCATAATGCTTTATTGGTTCCTGCATAGCTTGGGCAGTCGACTGCCAGAAGAGTGATTTCGAGAATCGTACAATGTTTCTCAAGATTTCAAATGTTCTAAAAAATAATTCTCAGCATTTCTCCGGGTGCCCCTGCTTATTTTAATGTAGTATCCTGACGTCTTAGTGCCATAAAATTCTTTACTTGATGTTGTGCTACTTATATTTTGATAATATGAACTCTGGATTTCTTCGAGTCCATTTTACGTTAATTATTGATTGATGTTGGCCACATAATTCAACTTCTCTGCTAAAATATCTAATAGAAGGGAAGCAGCTTAGGTTTCTATTATCCAGTAATATTTTCTGAAATACTCATATTTGCAGCCATCAAGCCCTGCCAAGTTCTGCGATGAATGTGGAGCTCCATATTTAAGAGAAACCTCTAAGTTCTGCTCAGAGTGCGGTGTTAAGAGGTTAGGAATTTGACGTGTGCTGTATAATGATCGTTGTTAGTTAGTTCCAAAAAGCTCATTGTTGTATCTAGTTTCTCGTCAAGTTAGTGGAAATTAAGTTTTCTGCAGTCCTCGTACATAAAATTTAGATCTCAAATATACAGATATACATAGCTATGATGTCAGCTTTTGAAAGGAATGAATATAGAAATTACTCTTACCCATAAACGTGCACTGTTATTGGAGCTGTATATTTCTTAAAGATTTCAGAAGAGAATTCGGATGCCAAAAAAGTATAGCGAAAGTTTCCTTCCTAAAAACACTACATACAGGCGGCATCATCACCTTCTTGGCTTCTCTTGCAGCCCAAGGACTTCATGGAAGAACCTTTCTTAACGTAACTTGCTGCAAGAGCATTCTTGAAAACATACTGAATGCCTTGATCTGATCGTTGAAAATCGAACCGAAAAAAATGTTGTCATATCATTGAAATGCCTCTGGATCTTATTTTGTGGATCTGCTAGTTCGTCCGATTTATTTGGAATCATTTCTTGGAGGTGTTTGGGGTTTCTTTAGCGCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNGTGAAAGCTTTGCTTTCCTATACCAAAGGGCACCCACACTTTGCTTTAAGGCTTGGATCCAAGAGTTTTTTGTATTAAGTCCATCGGAAAGTTCGAACTTGAGACCTCTAGGCCAATGTGACCAAGAGATCTCAAGCTCTTGCCAATCAAGTTGACCCTTGAGGGTGGACTTTCCTATACCAAATGGCACCCGCACTTGCTCTAGGGCTTGGATCCAAGAGTTTTTGGTATTAAGCCCATCGGAAAATTCGAATTTGAGACCTCTTAGCCAGTGTGACCAAGAGATCTCAAGCTCTTGCCAACCAGGTTGACCCTTGAGGGTCGGGTTTCTTTGGTTCTCAATAGAGAGTGTTGCACGATGATGGAGGAAATGTTGCTCTTTCCCCCTTTTCGAGATAGAAAAAGTTTTTTGTGGCAGGCTTATCCATTTTTATTTATTTATATATTTTTGGCTATTTTGTACAGCTTTTGGCTCATGAGGAGTAGCAGAATTTTTAGGGGTTGAGAGGCCTTGGGAGGAAGTTAGAAATATTATTATTCCTCCCTGTGAGTGTCGGTCTCGAAGGTTTTTTGTAATTATTAATTTAGAGATTATTCTTTTGGACCGAGGCCTTTTATATAGGCCGGTCTCCTTTTGTCGGGCTTATTTTTTTTTATGTCCTGTAGACCTTCTTTCATTTCTTTTCAATGAAAGCATTCTTTTCTTATTGAAAGAAAAAAGAAAAAACACAATGTTGTCAATATCATATCAGAGTTTTGACACTTGGCAATCATTTTGAAGAACAGGAACATGAAGGATATCTTCCTAGAGTTAGAGAGTTAATCTGAGGAGGGACTGTCGTTATTTGAAATCCAAATGGTTTTGCCATTTCTTGCCAAATAGTTTCTCATACTCATTAAAGAGAGTATAGAGTGAGTTGCTTAGAAAGTTGATGTCAGGAATTTAGGTGGCCTGGAATTGGATCCTCCACAGTCTGTTTTTCCAGCTTCAAAGTAAATACAGGGATAGAGGAGATATCTTATATCCCTATATAGGAGTGAAAAGTAGTTTTTTCAAAAATCACTCTAAAATTTTTTCTGCTTACTTCCATTGTTGTCTGTTCGAATGAGTATTGTTAAAGAATTTGGTTCTTGTTGCTGTATGCTCGGTGAACATTATGGATCTCCTAAGTCAACTCTTGAAGGGCACCTACTTCAAAATGAAAGCTATATCTATATTTGGTGGTATGCTTGCAAAACCTTCATTTGGGAAGTTTGGGTTGAAAAAAAAATGGAAGAATTTTTCTGATAGTTGTGGATTTCTTGATCTGATAAATTTGATCGGTTTGTGGCTTTTCTAAGGCACTCATTTGATAGAATCTTAGTCTAGAGACGGTTTTGATTCCTCTACTTTTGGTTTTGATTCATTTTGATTATTGTATTTCTAACTTTTGTATATTTAGTCCTTATACTTTTAAAACATTTGTTTTAGTCCCTATATTTTTAAAGGCGAGACTTCTTTTTACTATTTTTTAGTATTGTTTATTATTAAATTTATTCTAAAAATATATCATCCACTAAATTATTTCCAAGAATAATTCTCACAAAATAGGTGGTCGATGATTTATGAGGTGGCAGCTACAAATCTCTTGATATATTTTTTTTAAATCCATAAGTTGTCATCTTCGATAAGTTGTGTATCTTTTTTTTTTTTTTAATTCACAACATGTCTCTTGGCCACTTATTGTAGGTCATCTCTAACGGACGATTTTTTATTGTGTTTTTTAATCTCTATATTGACATGGTAATTAAAAAGAAAAATCAAATTGAATTTAACTAAAAAAATATCTAACAAAATCTCAAAACAAACCAAAAAATGTATTTTAGAAAACAAACTTTAAATTAAATTTAACTATAAAAAACCTAACAAATTCTTAAAAAAAATATTATAAAAAAAATCAATCTAAAAAACAAACTTTAAATTATGTTTAGGTATGAAAAATCTAACAAATTCTTATTAAAGATTAAAATTATAAACAAACTCTAATTTTTTAAAAATTTGATCGGATGATTGTACCTTAGATTTTAGAAAAAGGTCTGCAATCCAATAGTGTTACGGCAATCACTCACAACTTATGAATAACAATTTCGATGAAAATTTATTTTCTGCTTGTGATATTTATACTTGAATTCTAAGTCTCCACATGACAATTTCTCATCATCGAGTCATTGTTTTCTGATTTTTTATTAATATTTATGGGGGAGGTATCATGATATTAAAATGTTTAATGATAAATTTAATTTCAAATTTGAACGCTTAATTAATTAAATTAATTTATTTTAAGATTTTGTTTTTAAATAGATTGTTAGATTTTATTTTAAATATATTTTTAAAGATTTTGTTAATTAAATTTAATTTGAATTTTTTTAAAAATAAATATTAAGAAAATCTTGAGAGTACCCTATAGGTAAAGAGATTTTATACGTGGTCAAAATACAGAGACACAACTTGTGGATTAAAAAAAAATCAATAACAAAAATAGTTTGGTAAGTGATATATTTTTTGAATAAATTTAATAATAAGTAATATTTAGAAAAAAGTCCTTTGAAAGTGACTATTAGGATGTTTTCATTTTCATTTGTACATCATTTCTTCTTACCATAATTTTTTTTTTTTTTTGCAAAAGTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGAACATGACACTTCACTCATAAAATTTTTTGAATTTTTTGAAGTGTAGCCATAAAATTGTATTAATTGAAGTGTAGTAATAAAATTGTATTAAACAATTTTTACTGTGTATAAAATTTATGTTAAATTTTACCCATAAGAATTAAGATAGTCATTTTTTAAAAGTACATGAACTGAATTAAAGCAGATTTTTTTGTAAGTATAAAGACTAAAATAAACAAAACTTGAAAATATATAGACTAAAATTGATGTTTTGAAGAGTATATTGATTAAAATGAATAAAAATCAAAAGCACAGGAGTCAAAATAGTATTTAAATTTATAATTTTTTTTCTTATTATTTTGAGCCAATTATTGTAGTAGTAGTATTGGACCCTCCCATTTTTTCACATCCTTATCAGTCATACATAGTTTTCGTTTTTCTATCTAAAAGATAAGATAAAATTATAGAACATGCAGCACGCTCCCCAATACCTTCAAATTTTTCCTAAATAATAATATATTTTTCCCTTTTAAAAAAAATAATGTGTGTATATATATTTCATTGGTAAAGACTAGAAATTAAATTGTGTATTCCAAATTATTGAACTTAAAAAATATGTATAGTTGTAGTAATTAAAATAATGTTATTTATGTTTCAATTTTAAATGGTATACTTAAAAAAAAAAAAATAACAACTTAAACTCATTTACGAAAAGACTAAAATAGTAAATTGGCGGCAATTTCTTCTGTCTCACAACGTGTCAACTCACATAGTCATAGACAATGATAAAATGTCTAAACAGCTGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTACAATTTTTTTTTTCCTTTTTTCTTTTACAAAATCATAAAATACAATTGTCATCCACTAAAATTTCAAATTTGAATATCACCAAATATTCTAAACGAAATGTTTTTGTACTTTATTATACTCTTGTTAATCTAATTGGGTCAAAGTACATTTAAATATAACATAATCTTTCTTATCTTAGTTTGTGACCAAGGTAATAAAAAATGTTATTTAAATAACGTAATTTGAAATATTTAAAATGATTCACTTGAAGAAAAATATATGTTGCTTTACTTTGGAGTCATAAGAATTTCTTTATAACCTAATCAAAGTTTTAACTGTCGACATTGATGAAAATGTCGAAATCTCAATTTCATGAAAACGTCGATAGAAATATCGATAAAATATCAACGTTAATGGTTATTTCTAGAAATGTTATAAAAATTTGTAAAAAAATAATTAAATTAACAAATAAACATTTTAAATAAATTTTAAATAATTAATATTAATATCACGTCTATATAGGTTTTTTAATATTTTTTATATCTTATCGATTTTTTTTAATAGTATAGAAATGTCAATTTATTTGATANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTAATTTGCAAAAAAAAAAAGAAAAAAACCTTTAATTTTCGTCTGAATGTTAAAAAAAAATTTCATAGCAGGAGATCAAATGAAACGGATCAAAGAGAGATTCAGTGGCAAAATTATGCGAGTAATAATATTTATATTTTGAGAACCAAAAATGCCACACGCGAGAAAAACAGTGGAACCAACTAGCCCTAGCAGATAACAAAACGACATCGTTTTAGTAGTCGGAATTCCAACGGCGGCGCCGCCGGAGGTAGGTTTCGATAACTTGCGATCGGGGTCCAAAAGGTTCCCTCTTGCTCCCTCCAAAGTGTACAGTTTACTTGCTCTTGAAAATATCGTCTACCAAACTCAACCACCAGAAATCCTCCGACCGGCCATAAACCCCTCTTTCTCTCGCTCTCTTCCTTCGTAGCCGCTCGCCGGAACCATGAGCTTCACATTCGCCGCCATGTCTTCACCCTCTTCCTCCTCACTTTCATCCATCTTCCATTTCAATGCGTCTAAACCTAAACCCGTCTCACCCCCAAATCTTCGTTTTGCTGCTCCCCTTTCTCGTAATTCTGTATATGTGCAACGTTTTAGGGTTAGGGCTTCCTCTGCTCCCCTTCCGGACGCTTCCCGGAACCACCCTGTTGTTCAATGCCTCCGGAACTATGCCAGAGCTGCGATTCTCATTGGTGCTGCGGCCACCATGGTGGGGAAGTTCTCGCATTTGCCTGCAAGAGCCGAGTCTCCGGCGGCGGTGGCGGAGGAAGCTTCCAGAATGGAGGAAGATCGGCAGGTGGTTGAGGACTCGGACCGGGACAGGCAGCAATCCTCGCCGTTGAACGATTTTCTCGAATCGAATTCTGAAGCTGTCGAGGCACTCAAGTCTCTCCTGCAGCAAAAGCTCGAAAATGGTGAGGATGAGGAGGCTCTGAAAATCTTGAAGCAGTTGGTGTCTGCTCAGCCGTCGGTGACCGAGTGGAAATTTCTGATGGCCAGATTACTCGGCGAGATGGGCAAAACGGAAAATGCACGAGATGTGTTTGAAGAAATTTTGGCTGTGAATCCATTGTCTTTCGAAGCATTGTTTGAAAACGCATTGTTAATGGACCGTTGTGGGGAAGGAGAGGCAGTGCTTCGGCGGTTAGAAGAGGCTCTGAGAATTGCTGAGGATGAAAACAAAGCGAAGGAAGCTAGGGATGTGAAGTTGATAATGGCTCAAATACAGTTCTTGCAGAAGAATGTAGAGGAGGCCTTGAAGAGTTATAAAGAATTGGTGAAGGAGGATCCTAACGACTTTAGGCCTTACTTTTGTCAGGGAATGATCTATAGCTTGCTTGATAAGAATGTGGAGGCCAGAGAGCAATTCTCCAAGTACAGGGAGCTCTCTCCAAAGAAATTTGAGGTTGATGGCTACCTACGGACTCCATTGTCAAGGATGAAAATTTTTGGATCTGATGAGAAGTAAAGTCATTCCACGATGATCAGATAAGTGGGCTCTTTGGAAAATAGAAGATGAGTAAACGAATTTAAGGTGATACTTTTTTTTTTTCCTTTTTTTTCTTTTGCTGATTATTGTCATTTAGCACTTCTTACTCTTAGTCTGATCTGCCGTAAGCATATGTAGTTCATTTGCGGGATTAAATAGGAGTAAATTGAATCTTATGAGGACAGGTGTTTTCCCTTTCATGAAGCATAAATTTTGCTAATATATGCAGTTGTTTTGATGATACTGTATTAGACTACTCACATTAGTTGGAATAAATGCAGAAAGGTGCTCCGTTAGTAGTTTATCATGTTGTGTGTTGACTTCTGTTTCAAAATAATAATAGCTATAGTTCGTTTGTATTGTTACAATTGTTCTTGCACTTCGATGTATAGAAATGATTATTTCCTTCATGAAAATAAGATAGAATAAGAATGAAAAATCATCTCCTGAGAGAAGGGCAAATCTTTCTTGAAAGAGAGAGAGAGAACATCCTACTTGCTCATCCTTATGGGAGAGTTCCACATGGACTTTTCTGCCTAATTTCAAATGGTAGTTTCAAAGGAAGGGTTGTGAGAAGAGTATGCTTATTTTTTCCTTAAAAGCATAGCGATTTTCAAAGAAGCCATTAATATGTTTTACTATAAAAAGTGAAAATAAAATTGTTAGATCTGATTTTGTCTCTTTCTTTTAGATCTGTGTCAATATGTGAGATTGAGGAAATTAAAGTATTTTTTTACCTAGAGTACAAGTACGTGCTTGCATCTGTCTTGCACTAATAGTCTTTTTGGGATATATCTTGTGATCTTAGATTCTTAATTGATTATCCAAAGCTTTAGACAATTCTGGATGCTTTAGAGTAGGTTTATGGTAAGGGGGTTCATTTTCTTTTTCAAATTTGATGAGGAAGAATCCATCGCTTTCACTTCCTCTGCAAGTTTCCACGATGATTTGAGAAAGAAGGTTGGTTGGGGTAAGCAATTCAATCTTTATCACTATTTATCAGTTTGATAAGAGCAATTTTGCTTGAAATAAATGAGAGTGATCATTAGTACAAGGATACAGTGTCTAAAATTATGTCTGTGCTTTGAAAATGATGATGATGGTTGGTGGCTGATTTTAACTTAGATATTTAACTGCTGCTAAATTATACGCCAACTGTAGTGACGTTGTATGATTTTCTTACTATGATTATGGTTATGATGATTTATTAATGTCTGTTGGAGGAACTTTTGTAACATCCTTGCTCTCTGTCCCCTTTTTGTGCTTTGTTCTATTGAATTGATATAACATTTCTACGTTTCTGATTGTTAGAAAATTATATATTGTTGGGATCAACTTTCATGATTTTGCAAAATTTTCTACTTCGTATGCAACTGGAAGTCAACCCGAGCTTAGTGAATAGTACTTATGATATTCATGGGTTTCTATCTCAACACAATTGGTAATGAGATGAATAGCCCACTTATATTATAAATTCTTGGTAGGTTCCTTATCTTTTCAATGTGTGATCTTTAACATCCCTTCAAGATTTCTTCTGGCTCATCGATTTTGGTCAGATTCCAATTTTGGACCGCTTGTTCGTTTGAGCATTTTGGGCTCTAATACCATATTTATATTCATGGGGTTCCATCTCAAAACCAATTGACGATGAAAGGAGTAACTCACTTGTATTATAAACTCTTGTCGGATCCCTTCTTTTCAATGGGGAATCCTTAACAGTACTATCTTTAAAGTCGATGTTTGATTCTCGCTCCTACAATTGTTAAACTTAAAAAACAAAATGATTAAACAACACCCATACAAAGGGGATAATCTTGGCAGTTTGTTCCTTTCACGATTGCTTGCTTTAGAGAAAAATATTTTTACTCTTTGCTAAAAGAAATTCGGGAAATAATTAAAACTTTTATACTCATGTCAGTTTTTTCTTTCGTTCAATTTTTGCTTCATTGTTAAATGGGTGAGGTTTGAGATCGAATTTTCATTCTTAAAGAAGATAATTGAGGCTATAAATTACACGAAATGATATAATTGGATTTAATTTTGTACTTGGCATATTAAAGCTTCGGTTTTATCCTTGACTTTCTAATTTTATTTTTGAAAAGATCTTACATTTGCAATTTTTGTTAAAATGTTTGGAGGAAATAAAAATTGGGATTAATGTTGAAAATTTTAGAGCTAGGGCGGAATTAAAATCAACCACATATGTTAGGGACATTTTGTTATCATTTTATTGGTTTCAATTTAGTCCGTTACCTACTAATAATTTAAACTTTACCAAAGATCATAGTGAAACTTGTAATTTTTTGAATACTAAAGACAAATTTGAAAACAACTACATAATTTAAGGACATTTTATATAGTTTATCCCAGTAAAAATTAAGATATTTTAAATCTAAAATTTAATGGTTATGTTGAGCAAAAGGCTTCATTGTCAACCCAAACATCGAATAATGGATAAAACTCTTGCTTAAAATTGATAATTTAATTCCTAACTCTTACGTCTCTTCAGGATGACACAATGACAAGGACTTGGGGCCTTGTGGTATAACCTACCATAAGATCTCTTTTCCTTGCAGTATCACCTATCATGAGATCTTGTGTTTGAGTTATACGTTAGACATAGTGCTGTTGGCAATGAGTCTCGATTATAGAGATCTATAAGCGTAAGCCATATGACATCGATTTTGGGAAATAAAAAATCTCAACTTCTACCATTATTGAACTAAAAAAAATACATCATGCATATCTCGACATCATAAATTAAATCATTATTTTAACTTAGAGTATAAAAAAGAGAAGATACTGCCCAACAATTAATTATGCCTACCTCTTCATAGGTTAAATCGCTATTTTAACTTAGAGTATAAAAAACAAAAGTGTTGTTTAAGCTTATGGAAAAGAAAAAGAACTATTCAACCAAAAGATAATTTGTACCTAAACTATTTAAGACTAGGGAGTAGGATATTAAATTTCATATGTCCTATATTTTTCATCGACATCTTTATAGCATCATAAAAATTTAGACAAAATATAAAAATATTAACGAATGTTATTAATATAGACATAATATTCATATTAATAAATTTTAACTCATTTAAAAAAATGAAATGCTTGATCCTTATGTTTTAATATTTGTATTGATTTTTTCTTAACATATATATCTTTGATGAAAAAAGTATTAAGAAATCCACGTCCCTTCTAAAAAATAAAAATAAAATTCACTGCCGTTGAACTTGGGAAATTGAGGTTTCTAACCCCGTTATATAACTTATCTGGATTTTTGACCCCTTTTTAAAAAAAACTAATTTGTGACTTTAATTTGTAACCCCTTTCCACTTTTTATCTTTGTAAAATTACCAAAATATCCTTGTTTATTTTAAGTAAAATTATCCACCAATCTCTTCTCTCAACCCTTGACATTTTTTTTTTAATTTTGAATCTAATATAAATCTAATATAGATTAATTTATTACTTAACAAACGAATCCACACAAAAAAATCTGTAGATTAAATTCTCCCATAAGTAAATTAGGGTGAAAAAAAGATTTGCATAATCCAAGAATAAATGCATAAATTTGTTCGGAATTAATTACAAAAAAAAAATTAAGGTTTCATAAATTTCGTGTCAAAATTGCATTAAATGCCCGAGATTTACTTGACAATAAAGGCTAAAAGAAACTGGAATTCAAAGGGAGATAATGCAAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTATTTTTTCCATATTAAACTTTTAATTTACTATTTGAGTTAGATATTTTTCTTAAAAGTTAGATTTTTTAAGGTTGGAATTAATTATTGTGAGAAATTTGTGGGGTGAATAAAGATTTGTAAAGAGATTTGTGGAGGGAGATGTTGATTTTGAAATATGAGGATACTACGGTAATTTTACATTTAAAAATTATGAAATTATTAATAAATAAGGGAGAGTGAGGTCATAGATTAGTTTTTTTTTTAAATAGAATTATAAATCCAAAGCGAATTATTGGAGGGGTCAAAATTTCAAATTTGCAATTGAAGTTAATATATATATATATTCATTCTATTTTTATTATTTGTGAGGCCACATGGCCCAGGAATGAGGCTCGCGCTTTTGATGGAGGAATTACAATGCTTACGTGGAACATCCACTCCTTGTGTCCAAAAACTTCCCACTTGGCAGAATCCCAGTGGCTGACAAGATGTTCCCGGATAAGGTATATATATTATGTGGCACTCTGTTCTTCCTCCCCTGTCCTCCGTCGGAGCTTATCAGTCTCCCATGGCCGCCACTGCCTCTCCCGATTGCTCCCTCCCTCCCATTAACTCCTTCACAAAATCCCATCTCATCACTTTCCCCGCCTCCAACCTCCCCCTCCTCTTCTCTCTCCCCTCTTCAAATCTTCGATCCCTTCACCTCAATTCCTCCTCCTGCCCCTCCCCAATCCTCGATCAATCTCCCATCGCCCCTCCCGCCTCCAATCCCCAAGATTCGAACCATTTGTTATCTGGGTTTTCGTCCCAAGATCCTGGAACCGAGGATTCGATCTACGATTGCTACGTCAAGGCGAAGGAGAGGGCAGGGTTCAGACCCGAGAAGTCGACTTTGCGGCATCTGATCAGGTACTTGGTGCAATCCAAGAAGTGGGATCTGATTTTTTCGGTTTCTAGAGATTTTAGGGATTATGGGGTTTGCCCTGATAGAGATACGTGTTGTAGATTGGTTAGTAGTTGCGTTAGAGGTAGGAAATTCAAAATTGTTAGGGCTCTGCTTGAGGTTTTTGAAACGGATGGTGATGTTGCTGCGGCTGCTTTTGAGGCCGCTATGAGAGGCTACAATAAGCTTCATATGTTCAAGAGCACTATCCTCGTTTTCCAGCGGTTGAAATCGGCGAAAATTGAAGCCGATTCTGGATGCTATTGTAGGGTCATGGAAGCTTACCTTAAACTTGGGGATTCTCAGAGAGTTAGGGAACTGTTTGATGAAGTTGAGAGTAGGATATCGGATTTAACGCCCTTTTCGAGTAAGATTTATGGGATTCTTTGCGAGTCGTTGGCGAAATCGGGGCGTGTTTTCGAGTCGCTTGAGTTCTTCAGAGATATGAGAAAGAAAGGGATTGCAGAAGACTACACCATTTACTCTTCTTTGATATGCACTTTTGCTAGCATTAGGGAAGTCAAATTGGCTGAAGACCTTTTCAAAGAGGCCAAAAGCAAGAATTTGTTGAGAGACCCTGCAGTTTTTCTAAAGCTAATATTGATGTATATTCAACAAGGGTCATTAGAGAAGGCACTTGAGGTTGTGGAAATGATGAAGGGCTCGAAAATCGGAGTGTCTGACTGTATTTTCTGCGCAATCGTCAATGGCTACGCTGCGAGAAGGGGCTATAACGCAGCAGTTACGGTTTATGAGAAGCTGATCGGCGACGGGTGTGAGCCAGGACAAGTGACGTACGCCTCAGCGATCAACGCCTACTGCCGCGTCGGGCTGTACTCGAAAGCAGAGGACATATTTGGAGAAATGGAGGAAAAGGGGTTTGAGAAATGTGTAGTAGCTTACTCTAGTTTGATATCAATGTATGGGAAAACAGGGAGGTTGAAGGATGCAATGAGGGTGTTGGCAAAGATGAAAGAAAGAGGGTGTGAGCCAAATGTTTGGATTTACAACATTCTGATGGAGATGCATGGAAAAGCCAAAAATTTAAAGCAAGTTGAGAAGCTATGGAAGGAAATGAAGCGCAGAAAGATAGCACCTGATAAGGTTAGCTATACAAGTATCATAAGTGCTTATGTGAAGGCAAAAGAATTCGAGACATGCGAGAGATATTACGTCGAGTTTCGGATGAACGGGGGCGCCATCGATAAGGCGATGGCGGGAATCATGGTCAGCGTGTTCTCGAAGACGAGTCGGGTCGATGAGCTGGTGAAGCTTCTGAGGGAGATGAACTTAGAAGGAACAAGGTTGGATGGGAGGCTGTATAGGTCAGCCTTGAATGCTTTGATG

mRNA sequence

CTAGCTTTCAAGTTCTTCAAATGGGCTGGAAGCCAGGTTGGTTTCCGCCATACCACCGAGGCTTACTGCATTGTAGTTCACATACTGTTTCGTGCGAGAATGTATGCAAATGCCCACGATATTATCAAGGAAGTGATTTTGAAGAGCCAGAACGACTTGGTTTTGCCAGTTTGTAAGATATTTGATATACTTTGGTCGACTAGGAATATTTTTGTGTTAGGAACAGGGGTCTTTGACGTTTTATTTAGTGTTTTGGTAGAGTTGGGGCTGCTTGAGGAAGCTAATGAATGTTTCTTGAGAATGAGGAAGTTTAGAACTCTTCCCAAAGCCCGTTCTTGCAATTTTTTTTTGCATAGGTTATCAAAGTCAGGGAAAGGACAGTTGGTGAGGAAGTTTTTCCATGACATGATTGGGGCTGGTATTGCACCTTCGGTTTTTACCTACAATGTAATGATAGATTACTTGTGCAAAGAAGGAGATTTGGAAAATGCTAGACGTTTGTTTGTGCAAATGAGGCAGATGGGCTTTTCTCCTGATGTTGTGACATATAATTCTTTGATTGATGGCTATGGCAAGGTTGGTTTATTGGAAGAATCTGTGTATTTATTTAATGAAATGAAAGGTGCAGGCTGTGTTCCTGATGTAATTACCTACAATGCTTTAATCAATTGTTTCTGCAAGTTTGAGAAGATGCCTCGAGCTTTTGAGTATCTCTCTGAGATGAAGAACAATGGGTTGAAACCAAATGTTGTAACCTATAGCACCTTGATTGATGCTTTTTGCAAGGAGGGAATAATGCAAGGTGCCATAAAACTTTTTGTTGATATGAGAAGAGTAGGCCTTGTACCTAATGAATTCACCTACACTTCTCTGATTGATGCCAATTGTAAGGCAGGTAATTTGACAGAAGCATGGAAGTTGTCTCACGATATGTTGCAAGCAGGAGTTAACTTAAACATAGTCACTTATACTGCTCTAATGGATGGCCTTTGTGAAGATGGAAGAATGACGGAGGCAGAAGAAGTGTACAGGGCAATGCTGAAAGATGGAATATCTCCCAACCAGCAAGTTTACACTGCTTTGGTTCATGGCTATATCAAGGCGGAGAGAATGGAGGATGCAATGGAAATATTGAAGCAAATGACAGAATGTAACATCAAACCAGATTTAATACTCTATGGAACCATTATTTGGGGTCTCTGTAGTCAAAATAAACTTGAAGAAACTAAGCTTATAATTAAAGAAATGAAGAGTCGGGGTATTAACGCAAATCCAGTTATATACACAACAATTATAGATGCTTATTTTAAGGCTGGAGAAAGCTCAGATGCAATAAATCTTCTTCAGGAGATGCAGGATGCAGGTATTGAGGCTACTGTTGTAACCTACTGTGTATTAATTGATGGTTTGTGCAAAACAGGTAAGGTCGAACTAGCAGTTGATTATTTTGGTAGAATGTCTGCTGTTGGTTTACAACCTAATGTTGCAGTTTATACAGCCCTCATTGATGGTCTTTGTAAAACAAATTGCGTTGAATCTGCCAAAAAGTTGTTTGATGAAATGCAATGTAGGGGTATGGCCCCGGATAAAACAGCTTTCACTGCTCTGATTGATGGCAACTTGAAGCTTGGAAATCTTCAGGAAGCTTTGAATTTGAGTAGTAGAATGACAGAATTAGCTATCGAGTTTGATTTGCATGCTTACACTTCCTTGGTTTCGGGATTTTCTCAATGCGGTGAGCTGCACCAAGCGAGGAAGTTTTTTGATGAGATGGTTGAGAAGGGCATACTTCCTGAGGAGATTTTATGTATATGTCTATTGAGAGAGTATTACAAGCTTGGACAGTTGGATGAAGCCATTGAATTGAAGAGTGAAATGCAAAGAAGGGGTTTGATTACTGAAAAGTGCAGCCATGCACCAGGACAAGTGACGTACGCCTCAGCGATCAACGCCTACTGCCGCGTCGGGCTGTACTCGAAAGCAGAGGACATATTTGGAGAAATGGAGGAAAAGGGGTTTGAGAAATGTGTAGTAGCTTACTCTAGTTTGATATCAATGTATGGGAAAACAGGGAGGTTGAAGGATGCAATGAGGGTGTTGGCAAAGATGAAAGAAAGAGGGTGTGAGCCAAATGTTTGGATTTACAACATTCTGATGGAGATGCATGGAAAAGCCAAAAATTTAAAGCAAGTTGAGAAGCTATGGAAGGAAATGAAGCGCAGAAAGATAGCACCTGATAAGGTTAGCTATACAAGTATCATAAGTGCTTATGTGAAGGCAAAAGAATTCGAGACATGCGAGAGATATTACGTCGAGTTTCGGATGAACGGGGGCGCCATCGATAAGGCGATGGCGGGAATCATGGTCAGCGTGTTCTCGAAGACGAGTCGGGTCGATGAGCTGGTGAAGCTTCTGAGGGAGATGAACTTAGAAGGAACAAGGTTGGATGGGAGGCTGTATAGGTCAGCCTTGAATGCTTTGATG

Coding sequence (CDS)

CTAGCTTTCAAGTTCTTCAAATGGGCTGGAAGCCAGGTTGGTTTCCGCCATACCACCGAGGCTTACTGCATTGTAGTTCACATACTGTTTCGTGCGAGAATGTATGCAAATGCCCACGATATTATCAAGGAAGTGATTTTGAAGAGCCAGAACGACTTGGTTTTGCCAGTTTGTAAGATATTTGATATACTTTGGTCGACTAGGAATATTTTTGTGTTAGGAACAGGGGTCTTTGACGTTTTATTTAGTGTTTTGGTAGAGTTGGGGCTGCTTGAGGAAGCTAATGAATGTTTCTTGAGAATGAGGAAGTTTAGAACTCTTCCCAAAGCCCGTTCTTGCAATTTTTTTTTGCATAGGTTATCAAAGTCAGGGAAAGGACAGTTGGTGAGGAAGTTTTTCCATGACATGATTGGGGCTGGTATTGCACCTTCGGTTTTTACCTACAATGTAATGATAGATTACTTGTGCAAAGAAGGAGATTTGGAAAATGCTAGACGTTTGTTTGTGCAAATGAGGCAGATGGGCTTTTCTCCTGATGTTGTGACATATAATTCTTTGATTGATGGCTATGGCAAGGTTGGTTTATTGGAAGAATCTGTGTATTTATTTAATGAAATGAAAGGTGCAGGCTGTGTTCCTGATGTAATTACCTACAATGCTTTAATCAATTGTTTCTGCAAGTTTGAGAAGATGCCTCGAGCTTTTGAGTATCTCTCTGAGATGAAGAACAATGGGTTGAAACCAAATGTTGTAACCTATAGCACCTTGATTGATGCTTTTTGCAAGGAGGGAATAATGCAAGGTGCCATAAAACTTTTTGTTGATATGAGAAGAGTAGGCCTTGTACCTAATGAATTCACCTACACTTCTCTGATTGATGCCAATTGTAAGGCAGGTAATTTGACAGAAGCATGGAAGTTGTCTCACGATATGTTGCAAGCAGGAGTTAACTTAAACATAGTCACTTATACTGCTCTAATGGATGGCCTTTGTGAAGATGGAAGAATGACGGAGGCAGAAGAAGTGTACAGGGCAATGCTGAAAGATGGAATATCTCCCAACCAGCAAGTTTACACTGCTTTGGTTCATGGCTATATCAAGGCGGAGAGAATGGAGGATGCAATGGAAATATTGAAGCAAATGACAGAATGTAACATCAAACCAGATTTAATACTCTATGGAACCATTATTTGGGGTCTCTGTAGTCAAAATAAACTTGAAGAAACTAAGCTTATAATTAAAGAAATGAAGAGTCGGGGTATTAACGCAAATCCAGTTATATACACAACAATTATAGATGCTTATTTTAAGGCTGGAGAAAGCTCAGATGCAATAAATCTTCTTCAGGAGATGCAGGATGCAGGTATTGAGGCTACTGTTGTAACCTACTGTGTATTAATTGATGGTTTGTGCAAAACAGGTAAGGTCGAACTAGCAGTTGATTATTTTGGTAGAATGTCTGCTGTTGGTTTACAACCTAATGTTGCAGTTTATACAGCCCTCATTGATGGTCTTTGTAAAACAAATTGCGTTGAATCTGCCAAAAAGTTGTTTGATGAAATGCAATGTAGGGGTATGGCCCCGGATAAAACAGCTTTCACTGCTCTGATTGATGGCAACTTGAAGCTTGGAAATCTTCAGGAAGCTTTGAATTTGAGTAGTAGAATGACAGAATTAGCTATCGAGTTTGATTTGCATGCTTACACTTCCTTGGTTTCGGGATTTTCTCAATGCGGTGAGCTGCACCAAGCGAGGAAGTTTTTTGATGAGATGGTTGAGAAGGGCATACTTCCTGAGGAGATTTTATGTATATGTCTATTGAGAGAGTATTACAAGCTTGGACAGTTGGATGAAGCCATTGAATTGAAGAGTGAAATGCAAAGAAGGGGTTTGATTACTGAAAAGTGCAGCCATGCACCAGGACAAGTGACGTACGCCTCAGCGATCAACGCCTACTGCCGCGTCGGGCTGTACTCGAAAGCAGAGGACATATTTGGAGAAATGGAGGAAAAGGGGTTTGAGAAATGTGTAGTAGCTTACTCTAGTTTGATATCAATGTATGGGAAAACAGGGAGGTTGAAGGATGCAATGAGGGTGTTGGCAAAGATGAAAGAAAGAGGGTGTGAGCCAAATGTTTGGATTTACAACATTCTGATGGAGATGCATGGAAAAGCCAAAAATTTAAAGCAAGTTGAGAAGCTATGGAAGGAAATGAAGCGCAGAAAGATAGCACCTGATAAGGTTAGCTATACAAGTATCATAAGTGCTTATGTGAAGGCAAAAGAATTCGAGACATGCGAGAGATATTACGTCGAGTTTCGGATGAACGGGGGCGCCATCGATAAGGCGATGGCGGGAATCATGGTCAGCGTGTTCTCGAAGACGAGTCGGGTCGATGAGCTGGTGAAGCTTCTGAGGGAGATGAACTTAGAAGGAACAAGGTTGGATGGGAGGCTGTATAGGTCAGCCTTGAATGCTTTGATG

Protein sequence

LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKIFDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRLSKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDVVTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGNLTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRGINANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAVDYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGNLKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPEEILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSHAPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFEKCVVAYSSLISMYGKTGRLKDAMRVLAKMKERGCEPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRRKIAPDKVSYTSIISAYVKAKEFETCERYYVEFRMNGGAIDKAMAGIMVSVFSKTSRVDELVKLLREMNLEGTRLDGRLYRSALNALM
Homology
BLAST of MS018773 vs. NCBI nr
Match: XP_022146419.1 (putative pentatricopeptide repeat-containing protein At2g02150 [Momordica charantia] >XP_022146420.1 putative pentatricopeptide repeat-containing protein At2g02150 [Momordica charantia] >XP_022146421.1 putative pentatricopeptide repeat-containing protein At2g02150 [Momordica charantia])

HSP 1 Score: 1286.2 bits (3327), Expect = 0.0e+00
Identity = 638/644 (99.07%), Postives = 639/644 (99.22%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI
Sbjct: 140 LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 199

Query: 61  FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRL 120
           FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNF LHRL
Sbjct: 200 FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFLLHRL 259

Query: 121 SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV 180
           SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV
Sbjct: 260 SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV 319

Query: 181 VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE 240
           VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE
Sbjct: 320 VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE 379

Query: 241 MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN 300
           MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN
Sbjct: 380 MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN 439

Query: 301 LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA 360
           LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA
Sbjct: 440 LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA 499

Query: 361 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG 420
           LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG
Sbjct: 500 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG 559

Query: 421 INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV 480
           INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV
Sbjct: 560 INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV 619

Query: 481 DYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGN 540
           DYFGRMSA GLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGN
Sbjct: 620 DYFGRMSAFGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGN 679

Query: 541 LKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPEE 600
           LKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPEE
Sbjct: 680 LKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPEE 739

Query: 601 ILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSHAPGQVT 645
           ILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSHA   +T
Sbjct: 740 ILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSHAVPSLT 783

BLAST of MS018773 vs. NCBI nr
Match: KAG6601913.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1183.3 bits (3060), Expect = 0.0e+00
Identity = 571/638 (89.50%), Postives = 609/638 (95.45%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           LA KFFKWAG+ +GFRHTTE+YCI+VH+LFRARMY NAHDI+KE++LKS+ DL+LPVC +
Sbjct: 143 LALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMKEMVLKSRTDLILPVCNV 202

Query: 61  FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRL 120
           FDILWSTRN  V GTGVFDVLFSVLVELGLLEEANECF +MRKFRTLPKARSCNF LHRL
Sbjct: 203 FDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKFRTLPKARSCNFLLHRL 262

Query: 121 SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV 180
           SKSG GQLVRKFFHDMIGAGIAPSVFTYNVMID+LCKEGD+ENAR LFVQMR MGFSPDV
Sbjct: 263 SKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEGDVENARSLFVQMRTMGFSPDV 322

Query: 181 VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE 240
           VTYNSLIDGYGKVGLL+ESVYLFNEMK  GCVPDVITYNALINCFCKFEKMP+AFEYLSE
Sbjct: 323 VTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALINCFCKFEKMPQAFEYLSE 382

Query: 241 MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN 300
           MKNNGLKPNVVTYSTLIDAFCKEG+MQGAIKLFVDMRRVGL+PNEFTYTSLIDANCKAGN
Sbjct: 383 MKNNGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGN 442

Query: 301 LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA 360
           LTEAWKLS+DMLQAGVNLNIVTYTALMDGLCEDGRM EAEEV+RAMLKDGISPNQQVYTA
Sbjct: 443 LTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAMLKDGISPNQQVYTA 502

Query: 361 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG 420
           LVHGYIKAE+MEDA+EILKQMTEC IKPDL+LYGTIIWGLC+QNKLEETKLIIKEMK RG
Sbjct: 503 LVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQNKLEETKLIIKEMKKRG 562

Query: 421 INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV 480
           I ANPVIYTTIIDAYFKAG+SSDA++LLQEMQ+ G+EATVVTYCVLIDGLCKTG VE+AV
Sbjct: 563 IRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTYCVLIDGLCKTGMVEVAV 622

Query: 481 DYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGN 540
           DYFGRMS  G+QPNVAVYTALIDGLCK NC+ESAKKLFDEMQCRGM PDKTAFTALIDGN
Sbjct: 623 DYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAKKLFDEMQCRGMTPDKTAFTALIDGN 682

Query: 541 LKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPEE 600
           LKLGNLQEALNL S+MTEL IEFDLHAYT+LVSGFSQCGELHQARKFF+EM+EKGILP+E
Sbjct: 683 LKLGNLQEALNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQARKFFNEMIEKGILPDE 742

Query: 601 ILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSH 639
           ILCICLLREY KLG LDEAIELK+EMQRRGLITEKCSH
Sbjct: 743 ILCICLLREYNKLGHLDEAIELKNEMQRRGLITEKCSH 780

BLAST of MS018773 vs. NCBI nr
Match: KAG7032608.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1182.9 bits (3059), Expect = 0.0e+00
Identity = 570/638 (89.34%), Postives = 609/638 (95.45%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           LA KFFKWAG+ +GFRHTTE+YCI+VH+LFRARMY NAHDI+KE++LKS+ DL+LPVC +
Sbjct: 127 LALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMKEMVLKSRTDLILPVCNV 186

Query: 61  FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRL 120
           FDILWSTRN  V GTGVFDVLFSVLVELGLLEEANECF +MRKFRTLPKARSCNF LHRL
Sbjct: 187 FDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKFRTLPKARSCNFLLHRL 246

Query: 121 SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV 180
           SKSG GQLVRKFFHDMIGAGIAPSVFTYNVMID+LCKEGD+ENAR LFVQMR MGFSPDV
Sbjct: 247 SKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEGDVENARSLFVQMRTMGFSPDV 306

Query: 181 VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE 240
           VTYNSLIDGYGKVGLL+ESVYLFNEMK  GCVPDVITYNALINCFCKFEKMP+AFEYLSE
Sbjct: 307 VTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALINCFCKFEKMPQAFEYLSE 366

Query: 241 MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN 300
           MKNNGLKPNVVTYSTLIDAFCKEG+MQGAIKLFVDMRRVGL+PNEFTYTSLIDANCKAGN
Sbjct: 367 MKNNGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGN 426

Query: 301 LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA 360
           LTEAWKLS+DMLQAGVNLNIVTYTALMDGLCEDGRM EAEEV+RAMLKDGISPNQQVYTA
Sbjct: 427 LTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAMLKDGISPNQQVYTA 486

Query: 361 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG 420
           LVHGYIKAE+MEDA+EILKQMTEC IKPDL+LYGTIIWGLC+QNKLEETKLIIKEMK RG
Sbjct: 487 LVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQNKLEETKLIIKEMKKRG 546

Query: 421 INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV 480
           I ANPVIYTTIIDAYFKAG+SSDA++LLQEMQ+ G+EATVVTYCVLIDGLCKTG VE+AV
Sbjct: 547 IRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTYCVLIDGLCKTGMVEVAV 606

Query: 481 DYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGN 540
           DYFGRMS  G+QPNVAVYTALIDGLCK NC+ESAKKLFDEMQCRGM PDKTAFTALIDGN
Sbjct: 607 DYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAKKLFDEMQCRGMTPDKTAFTALIDGN 666

Query: 541 LKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPEE 600
           LKLGNLQEALNL S+MTEL IEFDLHAYT+LVSGFSQCGELHQARKFF+EM+EKGILP+E
Sbjct: 667 LKLGNLQEALNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQARKFFNEMIEKGILPDE 726

Query: 601 ILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSH 639
           ILCICLLREY KLG LDEAIELK+EMQRRGL+TEKCSH
Sbjct: 727 ILCICLLREYNKLGHLDEAIELKNEMQRRGLVTEKCSH 764

BLAST of MS018773 vs. NCBI nr
Match: XP_023534824.1 (putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023534833.1 putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1179.9 bits (3051), Expect = 0.0e+00
Identity = 568/638 (89.03%), Postives = 609/638 (95.45%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           LA KFFKWAG+ +GFRHTTE+YCI+VH+LFRARMY NAHDI+KE++LKS+ DL+LPVC +
Sbjct: 143 LALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMKEMVLKSRTDLILPVCNV 202

Query: 61  FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRL 120
           FDILWSTRN  V GTGVFDVLFSVLVELGLLEEANECF +MRKFRTLPKARSCNF LHRL
Sbjct: 203 FDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKFRTLPKARSCNFLLHRL 262

Query: 121 SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV 180
           SK+G GQLVRKFFHDM+GAGIAPSVFTYNVMID+LCKEGDLENAR LFVQMR MGFSPDV
Sbjct: 263 SKAGNGQLVRKFFHDMVGAGIAPSVFTYNVMIDHLCKEGDLENARSLFVQMRTMGFSPDV 322

Query: 181 VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE 240
           VTYNSLIDGYGKVGLL+ESVYLFNEMK  GCVPDVITYNALINCFCKFEKMP+AFEYLSE
Sbjct: 323 VTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALINCFCKFEKMPQAFEYLSE 382

Query: 241 MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN 300
           MKNNGLKPNVVTYSTLIDAFCKEG+MQGAIKLFVDMRRVGL+PNEFTYTSLIDANCKAGN
Sbjct: 383 MKNNGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGN 442

Query: 301 LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA 360
           LTEAWKLS+DMLQAGVNLNIVTYTALMDGLCEDGRM EAEEV+RAMLKDGISPNQQVYTA
Sbjct: 443 LTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAMLKDGISPNQQVYTA 502

Query: 361 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG 420
           LVHGYIKAE+MEDA+EILKQ+T+C IKPDL+LYGTIIWGLC+QNKLEETKLIIKEMKSRG
Sbjct: 503 LVHGYIKAEKMEDALEILKQITKCGIKPDLVLYGTIIWGLCNQNKLEETKLIIKEMKSRG 562

Query: 421 INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV 480
           I ANPVIYTTIIDAYFKAG+ SDA++LLQEMQ+ G+EATVVTYCVLIDGLCKTG VE+AV
Sbjct: 563 IRANPVIYTTIIDAYFKAGKGSDALDLLQEMQEVGVEATVVTYCVLIDGLCKTGMVEVAV 622

Query: 481 DYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGN 540
           DYFGRMS  G+QPNVAVYTALIDGLCK NC+ESAKKLFDEMQCRGM PDKTAFTALIDGN
Sbjct: 623 DYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAKKLFDEMQCRGMTPDKTAFTALIDGN 682

Query: 541 LKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPEE 600
           LKLGNLQEALNL S+MTEL IEFDLHAYT+LVSGFSQCGELHQARKFF+EM+EKGILP+E
Sbjct: 683 LKLGNLQEALNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQARKFFNEMIEKGILPDE 742

Query: 601 ILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSH 639
           ILCICLLREY KLG LDEAIELK+EMQRRGLITEKCSH
Sbjct: 743 ILCICLLREYNKLGHLDEAIELKNEMQRRGLITEKCSH 780

BLAST of MS018773 vs. NCBI nr
Match: XP_038906984.1 (putative pentatricopeptide repeat-containing protein At2g02150 [Benincasa hispida])

HSP 1 Score: 1176.8 bits (3043), Expect = 0.0e+00
Identity = 574/639 (89.83%), Postives = 607/639 (94.99%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           LA KFFKWAGS +GF HTTE+YCIVVH+LFRARMY NAHDI+KE+I+KS+ D+  PVC I
Sbjct: 127 LALKFFKWAGSHIGFHHTTESYCIVVHMLFRARMYTNAHDIVKEMIVKSRIDVGFPVCNI 186

Query: 61  FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRL 120
           FD+LWSTRNI + G GVFDVLFSVLV+LG+LEEANECF RMR FRT PKARSCNF LHRL
Sbjct: 187 FDVLWSTRNICMSGPGVFDVLFSVLVDLGMLEEANECFSRMRNFRTFPKARSCNFLLHRL 246

Query: 121 SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV 180
           SKSG GQLVRKFF DMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMR MGFSPDV
Sbjct: 247 SKSGNGQLVRKFFKDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRHMGFSPDV 306

Query: 181 VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE 240
           VTYNSLIDGYGKVGLLEE+VYLFNEMK  GCVPDVITYN LINCFCKFEKMPRAF YLSE
Sbjct: 307 VTYNSLIDGYGKVGLLEEAVYLFNEMKDVGCVPDVITYNGLINCFCKFEKMPRAFHYLSE 366

Query: 241 MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN 300
           MKNNGLKPNVVTYSTLIDAFCKEG+MQGAIKLF DMRRVGL+PNEFTYTSLIDANCKAGN
Sbjct: 367 MKNNGLKPNVVTYSTLIDAFCKEGMMQGAIKLFFDMRRVGLLPNEFTYTSLIDANCKAGN 426

Query: 301 LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA 360
           LTEAWKLS+DMLQAGVNLNIVTYTALMDGLCE GRM EAEEV+R+MLKDGISPNQQVYTA
Sbjct: 427 LTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEYGRMMEAEEVFRSMLKDGISPNQQVYTA 486

Query: 361 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG 420
           LVHGYIKAERMEDA+EILKQMTE NIKPDLILYGTIIWGLCSQ+KLEETKLIIKEMKSRG
Sbjct: 487 LVHGYIKAERMEDAIEILKQMTEYNIKPDLILYGTIIWGLCSQSKLEETKLIIKEMKSRG 546

Query: 421 INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV 480
           I+ANPVIYTTIIDAYFKAG+SSDAINLLQEMQDAG+EATVVTYCVLIDGLCKTG VELAV
Sbjct: 547 ISANPVIYTTIIDAYFKAGKSSDAINLLQEMQDAGVEATVVTYCVLIDGLCKTGLVELAV 606

Query: 481 DYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGN 540
           DYFGRMS +GLQPNVAVYTALIDGLCKTNC+ESAKKLFDEMQCRGM PD TAFTAL+DGN
Sbjct: 607 DYFGRMSNLGLQPNVAVYTALIDGLCKTNCIESAKKLFDEMQCRGMTPDITAFTALVDGN 666

Query: 541 LKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPEE 600
           LKLGNLQEAL+L SRMTELA EFDLHAYTSLVSGFSQCGELHQARK+F+EM+EKGILPEE
Sbjct: 667 LKLGNLQEALDLISRMTELATEFDLHAYTSLVSGFSQCGELHQARKYFNEMIEKGILPEE 726

Query: 601 ILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSHA 640
           ILCICLLREYYKLG+LDEAIE+K+EMQRRGLITEKCSHA
Sbjct: 727 ILCICLLREYYKLGKLDEAIEMKNEMQRRGLITEKCSHA 765

BLAST of MS018773 vs. ExPASy Swiss-Prot
Match: P0C894 (Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis thaliana OX=3702 GN=At2g02150 PE=3 SV=1)

HSP 1 Score: 780.8 bits (2015), Expect = 1.5e-224
Identity = 371/640 (57.97%), Postives = 489/640 (76.41%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           LAFKFFKW+ ++ GF+H+ E+YCIV HILF ARMY +A+ ++KE++L   +      C +
Sbjct: 124 LAFKFFKWSMTRNGFKHSVESYCIVAHILFCARMYYDANSVLKEMVLSKAD------CDV 183

Query: 61  FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRL 120
           FD+LWSTRN+ V G GVFD LFSVL++LG+LEEA +CF +M++FR  PK RSCN  LHR 
Sbjct: 184 FDVLWSTRNVCVPGFGVFDALFSVLIDLGMLEEAIQCFSKMKRFRVFPKTRSCNGLLHRF 243

Query: 121 SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV 180
           +K GK   V++FF DMIGAG  P+VFTYN+MID +CKEGD+E AR LF +M+  G  PD 
Sbjct: 244 AKLGKTDDVKRFFKDMIGAGARPTVFTYNIMIDCMCKEGDVEAARGLFEEMKFRGLVPDT 303

Query: 181 VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE 240
           VTYNS+IDG+GKVG L+++V  F EMK   C PDVITYNALINCFCKF K+P   E+  E
Sbjct: 304 VTYNSMIDGFGKVGRLDDTVCFFEEMKDMCCEPDVITYNALINCFCKFGKLPIGLEFYRE 363

Query: 241 MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN 300
           MK NGLKPNVV+YSTL+DAFCKEG+MQ AIK +VDMRRVGLVPNE+TYTSLIDANCK GN
Sbjct: 364 MKGNGLKPNVVSYSTLVDAFCKEGMMQQAIKFYVDMRRVGLVPNEYTYTSLIDANCKIGN 423

Query: 301 LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA 360
           L++A++L ++MLQ GV  N+VTYTAL+DGLC+  RM EAEE++  M   G+ PN   Y A
Sbjct: 424 LSDAFRLGNEMLQVGVEWNVVTYTALIDGLCDAERMKEAEELFGKMDTAGVIPNLASYNA 483

Query: 361 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG 420
           L+HG++KA+ M+ A+E+L ++    IKPDL+LYGT IWGLCS  K+E  K+++ EMK  G
Sbjct: 484 LIHGFVKAKNMDRALELLNELKGRGIKPDLLLYGTFIWGLCSLEKIEAAKVVMNEMKECG 543

Query: 421 INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV 480
           I AN +IYTT++DAYFK+G  ++ ++LL EM++  IE TVVT+CVLIDGLCK   V  AV
Sbjct: 544 IKANSLIYTTLMDAYFKSGNPTEGLHLLDEMKELDIEVTVVTFCVLIDGLCKNKLVSKAV 603

Query: 481 DYFGRMS-AVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDG 540
           DYF R+S   GLQ N A++TA+IDGLCK N VE+A  LF++M  +G+ PD+TA+T+L+DG
Sbjct: 604 DYFNRISNDFGLQANAAIFTAMIDGLCKDNQVEAATTLFEQMVQKGLVPDRTAYTSLMDG 663

Query: 541 NLKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPE 600
           N K GN+ EAL L  +M E+ ++ DL AYTSLV G S C +L +AR F +EM+ +GI P+
Sbjct: 664 NFKQGNVLEALALRDKMAEIGMKLDLLAYTSLVWGLSHCNQLQKARSFLEEMIGEGIHPD 723

Query: 601 EILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSHA 640
           E+LCI +L+++Y+LG +DEA+EL+S + +  L+T    +A
Sbjct: 724 EVLCISVLKKHYELGCIDEAVELQSYLMKHQLLTSDNDNA 757

BLAST of MS018773 vs. ExPASy Swiss-Prot
Match: Q9ZUA2 (Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana OX=3702 GN=At2g01740 PE=3 SV=1)

HSP 1 Score: 390.6 bits (1002), Expect = 4.5e-107
Identity = 207/547 (37.84%), Postives = 318/547 (58.14%), Query Frame = 0

Query: 90  LLEEANECFLRMRKFRTLPKARSCNFFLHRLSKSGKGQLVRKFFHDMIGAGIAPSVFTYN 149
           ++ EA +   R+RK   LP   +CN  +H+L  S  G L  KF   ++  G  P   ++N
Sbjct: 1   MVREALQFLSRLRKSSNLPDPFTCNKHIHQLINSNCGILSLKFLAYLVSRGYTPHRSSFN 60

Query: 150 VMIDYLCKEGDLENARRLFVQMRQMGFSPDVVTYNSLIDGYGKVGLLEESVYLFNEMK-- 209
            ++ ++CK G ++ A  +   M + G  PDV++YNSLIDG+ + G +  +  +   ++  
Sbjct: 61  SVVSFVCKLGQVKFAEDIVHSMPRFGCEPDVISYNSLIDGHCRNGDIRSASLVLESLRAS 120

Query: 210 -GAGCVPDVITYNALINCFCKFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKEGIM 269
            G  C PD++++N+L N F K + +   F Y+  M      PNVVTYST ID FCK G +
Sbjct: 121 HGFICKPDIVSFNSLFNGFSKMKMLDEVFVYMGVML-KCCSPNVVTYSTWIDTFCKSGEL 180

Query: 270 QGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGNLTEAWKLSHDMLQAGVNLNIVTYTAL 329
           Q A+K F  M+R  L PN  T+T LID  CKAG+L  A  L  +M +  ++LN+VTYTAL
Sbjct: 181 QLALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAVSLYKEMRRVRMSLNVVTYTAL 240

Query: 330 MDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNI 389
           +DG C+ G M  AEE+Y  M++D + PN  VYT ++ G+ +    ++AM+ L +M    +
Sbjct: 241 IDGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKFLAKMLNQGM 300

Query: 390 KPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRGINANPVIYTTIIDAYFKAGESSDAIN 449
           + D+  YG II GLC   KL+E   I+++M+   +  + VI+TT+++AYFK+G    A+N
Sbjct: 301 RLDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFKSGRMKAAVN 360

Query: 450 LLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAVDYFGRMSAVGLQPNVAVYTALIDGLC 509
           +  ++ + G E  VV    +IDG+ K G++  A+ YF    A     N  +YT LID LC
Sbjct: 361 MYHKLIERGFEPDVVALSTMIDGIAKNGQLHEAIVYFCIEKA-----NDVMYTVLIDALC 420

Query: 510 KTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGNLKLGNLQEALNLSSRMTELAIEFDLH 569
           K       ++LF ++   G+ PDK  +T+ I G  K GNL +A  L +RM +  +  DL 
Sbjct: 421 KEGDFIEVERLFSKISEAGLVPDKFMYTSWIAGLCKQGNLVDAFKLKTRMVQEGLLLDLL 480

Query: 570 AYTSLVSGFSQCGELHQARKFFDEMVEKGILPEEILCICLLREYYKLGQLDEAIELKSEM 629
           AYT+L+ G +  G + +AR+ FDEM+  GI P+  +   L+R Y K G +  A +L  +M
Sbjct: 481 AYTTLIYGLASKGLMVEARQVFDEMLNSGISPDSAVFDLLIRAYEKEGNMAAASDLLLDM 540

Query: 630 QRRGLIT 634
           QRRGL+T
Sbjct: 541 QRRGLVT 541

BLAST of MS018773 vs. ExPASy Swiss-Prot
Match: Q9LVQ5 (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX=3702 GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 383.6 bits (984), Expect = 5.5e-105
Identity = 236/818 (28.85%), Postives = 401/818 (49.02%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFR--HTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVC 60
           LA KF KW   Q G    H  +  CI  HIL RARMY  A  I+KE+ L S         
Sbjct: 52  LALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKSSF---- 111

Query: 61  KIFDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLH 120
            +F  L +T  +      V+D+L  V +  G+++++ E F  M  +   P   +CN  L 
Sbjct: 112 -VFGALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILG 171

Query: 121 RLSKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSP 180
            + KSG+   V  F  +M+   I P V T+N++I+ LC EG  E +  L  +M + G++P
Sbjct: 172 SVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAP 231

Query: 181 DVVTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYL 240
            +VTYN+++  Y K G  + ++ L + MK  G   DV TYN LI+  C+  ++ + +  L
Sbjct: 232 TIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLL 291

Query: 241 SEMKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKA 300
            +M+   + PN VTY+TLI+ F  EG +  A +L  +M   GL PN  T+ +LID +   
Sbjct: 292 RDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISE 351

Query: 301 GNLTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVY 360
           GN  EA K+ + M   G+  + V+Y  L+DGLC++     A   Y  M ++G+   +  Y
Sbjct: 352 GNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITY 411

Query: 361 TALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKS 420
           T ++ G  K   +++A+ +L +M++  I PD++ Y  +I G C   + +  K I+  +  
Sbjct: 412 TGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYR 471

Query: 421 RGINANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVEL 480
            G++ N +IY+T+I    + G   +AI + + M   G      T+ VL+  LCK GKV  
Sbjct: 472 VGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAE 531

Query: 481 AVDYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALID 540
           A ++   M++ G+ PN   +  LI+G   +     A  +FDEM   G  P    + +L+ 
Sbjct: 532 AEEFMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLK 591

Query: 541 GNLKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILP 600
           G  K G+L+EA      +  +    D   Y +L++   + G L +A   F EMV++ ILP
Sbjct: 592 GLCKGGHLREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILP 651

Query: 601 EEILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSHAPGQVTYASAINAYCRVGLY 660
           +      L+    + G+   AI    E + RG +       P +V Y   ++   + G +
Sbjct: 652 DSYTYTSLISGLCRKGKTVIAILFAKEAEARGNV------LPNKVMYTCFVDGMFKAGQW 711

Query: 661 SKAEDIFGEMEEKGFEKCVVAYSSLISMYGKTGRLKDAMRVLAKMKERGCEPNVWIYNIL 720
                   +M+  G    +V  +++I  Y + G+++    +L +M  +   PN+  YNIL
Sbjct: 712 KAGIYFREQMDNLGHTPDIVTTNAMIDGYSRMGKIEKTNDLLPEMGNQNGGPNLTTYNIL 771

Query: 721 MEMHGKAKNLKQVEKLWKEMKRRKIAPDKVSYTSIISAYVKAKEFETCERYYVEFRMNGG 780
           +  + K K++     L++ +    I PDK++  S++    ++   E   +    F   G 
Sbjct: 772 LHGYSKRKDVSTSFLLYRSIILNGILPDKLTCHSLVLGICESNMLEIGLKILKAFICRGV 831

Query: 781 AIDKAMAGIMVSVFSKTSRVDELVKLLREMNLEGTRLD 817
            +D+    +++S       ++    L++ M   G  LD
Sbjct: 832 EVDRYTFNMLISKCCANGEINWAFDLVKVMTSLGISLD 858

BLAST of MS018773 vs. ExPASy Swiss-Prot
Match: Q0WVK7 (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 370.9 bits (951), Expect = 3.7e-101
Identity = 203/636 (31.92%), Postives = 341/636 (53.62%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           L   FF WA S+       E+ CIV+H+   ++    A  +I     + + ++     + 
Sbjct: 103 LVLDFFDWARSRRD--SNLESLCIVIHLAVASKDLKVAQSLISSFWERPKLNVTDSFVQF 162

Query: 61  FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRL 120
           FD+L  T   +     VFDV F VLV+ GLL EA   F +M  +  +    SCN +L RL
Sbjct: 163 FDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTRL 222

Query: 121 SKS-GKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPD 180
           SK   K       F +    G+  +V +YN++I ++C+ G ++ A  L + M   G++PD
Sbjct: 223 SKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPD 282

Query: 181 VVTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLS 240
           V++Y+++++GY + G L++   L   MK  G  P+   Y ++I   C+  K+  A E  S
Sbjct: 283 VISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFS 342

Query: 241 EMKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAG 300
           EM   G+ P+ V Y+TLID FCK G ++ A K F +M    + P+  TYT++I   C+ G
Sbjct: 343 EMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIG 402

Query: 301 NLTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYT 360
           ++ EA KL H+M   G+  + VT+T L++G C+ G M +A  V+  M++ G SPN   YT
Sbjct: 403 DMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYT 462

Query: 361 ALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSR 420
            L+ G  K   ++ A E+L +M +  ++P++  Y +I+ GLC    +EE   ++ E ++ 
Sbjct: 463 TLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAA 522

Query: 421 GINANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELA 480
           G+NA+ V YTT++DAY K+GE   A  +L+EM   G++ T+VT+ VL++G C  G +E  
Sbjct: 523 GLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDG 582

Query: 481 VDYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDG 540
                 M A G+ PN   + +L+   C  N +++A  ++ +M  RG+ PD   +  L+ G
Sbjct: 583 EKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGKTYENLVKG 642

Query: 541 NLKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPE 600
           + K  N++EA  L   M        +  Y+ L+ GF +  +  +AR+ FD+M  +G+  +
Sbjct: 643 HCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEAREVFDQMRREGLAAD 702

Query: 601 EILCICLLREYYKLGQLDEAIELKSEMQRRGLITEK 636
           + +        YK  + D  ++   E+    L+ E+
Sbjct: 703 KEIFDFFSDTKYKGKRPDTIVDPIDEIIENYLVDEQ 736

BLAST of MS018773 vs. ExPASy Swiss-Prot
Match: Q9FJE6 (Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis thaliana OX=3702 GN=At5g59900 PE=3 SV=1)

HSP 1 Score: 341.7 bits (875), Expect = 2.4e-92
Identity = 217/803 (27.02%), Postives = 380/803 (47.32%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           L  +FF + G   GF H+T ++CI++H L +A ++  A  +++ ++L++     L    +
Sbjct: 86  LGLRFFNFLGLHRGFDHSTASFCILIHALVKANLFWPASSLLQTLLLRA-----LKPSDV 145

Query: 61  FDILWST-RNIFVLGTGVFDVLFSVLVELGLLEEANECF-LRMRKFRTLPKARSCNFFLH 120
           F++L+S      +  +  FD+L    V    + +    F + + K   LP+ R+ +  LH
Sbjct: 146 FNVLFSCYEKCKLSSSSSFDLLIQHYVRSRRVLDGVLVFKMMITKVSLLPEVRTLSALLH 205

Query: 121 RLSKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSP 180
            L K     L  + F+DM+  GI P V+ Y  +I  LC+  DL  A+ +   M   G   
Sbjct: 206 GLVKFRHFGLAMELFNDMVSVGIRPDVYIYTGVIRSLCELKDLSRAKEMIAHMEATGCDV 265

Query: 181 DVVTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYL 240
           ++V YN LIDG  K   + E+V +  ++ G    PDV+TY  L+   CK ++     E +
Sbjct: 266 NIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKVQEFEIGLEMM 325

Query: 241 SEMKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKA 300
            EM      P+    S+L++   K G ++ A+ L   +   G+ PN F Y +LID+ CK 
Sbjct: 326 DEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVYNALIDSLCKG 385

Query: 301 GNLTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVY 360
               EA  L   M + G+  N VTY+ L+D  C  G++  A      M+  G+  +   Y
Sbjct: 386 RKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVDTGLKLSVYPY 445

Query: 361 TALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKS 420
            +L++G+ K   +  A   + +M    ++P ++ Y +++ G CS+ K+ +   +  EM  
Sbjct: 446 NSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKINKALRLYHEMTG 505

Query: 421 RGINANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVEL 480
           +GI  +   +TT++   F+AG   DA+ L  EM +  ++   VTY V+I+G C+ G +  
Sbjct: 506 KGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMIEGYCEEGDMSK 565

Query: 481 AVDYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALID 540
           A ++   M+  G+ P+   Y  LI GLC T     AK   D +       ++  +T L+ 
Sbjct: 566 AFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCELNEICYTGLLH 625

Query: 541 GNLKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFF----DEMVEK 600
           G  + G L+EAL++   M +  ++ DL  Y  L+ G  +    H+ RK F     EM ++
Sbjct: 626 GFCREGKLEEALSVCQEMVQRGVDLDLVCYGVLIDGSLK----HKDRKLFFGLLKEMHDR 685

Query: 601 GILPEEILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSHAPGQVTYASAINAYCR 660
           G+ P++++   ++    K G   EA  +   M     I E C   P +VTY + IN  C+
Sbjct: 686 GLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLM-----INEGC--VPNEVTYTAVINGLCK 745

Query: 661 VGLYSKAEDIFGEMEE-----------------------------------KGFEKCVVA 720
            G  ++AE +  +M+                                    KG       
Sbjct: 746 AGFVNEAEVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKAVELHNAILKGLLANTAT 805

Query: 721 YSSLISMYGKTGRLKDAMRVLAKMKERGCEPNVWIYNILMEMHGKAKNLKQVEKLWKEMK 763
           Y+ LI  + + GR+++A  ++ +M   G  P+   Y  ++    +  ++K+  +LW  M 
Sbjct: 806 YNMLIRGFCRQGRIEEASELITRMIGDGVSPDCITYTTMINELCRRNDVKKAIELWNSMT 865

BLAST of MS018773 vs. ExPASy TrEMBL
Match: A0A6J1CX77 (putative pentatricopeptide repeat-containing protein At2g02150 OS=Momordica charantia OX=3673 GN=LOC111015641 PE=4 SV=1)

HSP 1 Score: 1286.2 bits (3327), Expect = 0.0e+00
Identity = 638/644 (99.07%), Postives = 639/644 (99.22%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI
Sbjct: 140 LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 199

Query: 61  FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRL 120
           FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNF LHRL
Sbjct: 200 FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFLLHRL 259

Query: 121 SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV 180
           SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV
Sbjct: 260 SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV 319

Query: 181 VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE 240
           VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE
Sbjct: 320 VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE 379

Query: 241 MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN 300
           MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN
Sbjct: 380 MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN 439

Query: 301 LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA 360
           LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA
Sbjct: 440 LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA 499

Query: 361 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG 420
           LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG
Sbjct: 500 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG 559

Query: 421 INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV 480
           INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV
Sbjct: 560 INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV 619

Query: 481 DYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGN 540
           DYFGRMSA GLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGN
Sbjct: 620 DYFGRMSAFGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGN 679

Query: 541 LKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPEE 600
           LKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPEE
Sbjct: 680 LKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPEE 739

Query: 601 ILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSHAPGQVT 645
           ILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSHA   +T
Sbjct: 740 ILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSHAVPSLT 783

BLAST of MS018773 vs. ExPASy TrEMBL
Match: A0A6J1H589 (putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111460688 PE=4 SV=1)

HSP 1 Score: 1175.2 bits (3039), Expect = 0.0e+00
Identity = 567/638 (88.87%), Postives = 608/638 (95.30%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           LA KFFKWAG+ +GFRHTTE+YCI+VH+LFRARMY NAHDI+KE++LKS+ DL+LPVC +
Sbjct: 143 LALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMKEMVLKSRTDLILPVCNV 202

Query: 61  FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRL 120
           FDILWSTRN  V GTGVFDVLFSVLVELGLLEEANECF +MRKFRTLPKARSCNF LHRL
Sbjct: 203 FDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKFRTLPKARSCNFLLHRL 262

Query: 121 SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV 180
           SKSG GQLVRKFFHDMIGAGIAPSVFTYNVMID+LCKEGDLENAR LFVQMR MGFSPDV
Sbjct: 263 SKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEGDLENARSLFVQMRTMGFSPDV 322

Query: 181 VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE 240
           VTYNSLIDGYGKVGLL+ESVYLFNEMK  GCVPDVITYNALINCFCKFEKMP+AFEYLSE
Sbjct: 323 VTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALINCFCKFEKMPQAFEYLSE 382

Query: 241 MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN 300
           MKN GLKPNVVTYSTLIDAFCKEG+MQGAIKLFVDMRRVGL+PNEFTYTSLIDANCKAGN
Sbjct: 383 MKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGN 442

Query: 301 LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA 360
           LTEAWKLS+DMLQAGVNLNIVTYTALMDGLCEDGRM EAEEV+RAMLKDGISPNQQVYTA
Sbjct: 443 LTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAMLKDGISPNQQVYTA 502

Query: 361 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG 420
           LVHGYIKAE+MEDA+EILKQMTEC IKPDL+LYGTIIWGLC+QNKLEETKLIIKEMKSRG
Sbjct: 503 LVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQNKLEETKLIIKEMKSRG 562

Query: 421 INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV 480
           I ANPVIYTTIIDAYFKAG+SSDA++LLQEMQ+ G+EATVVTYCVLIDGLCKTG VE+AV
Sbjct: 563 IRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTYCVLIDGLCKTGMVEVAV 622

Query: 481 DYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGN 540
           DYFGRMS  G+QPNVAVYTALIDGLCK NC+ESA+KLF+EMQCRGM PDKTAFTALIDGN
Sbjct: 623 DYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLFEEMQCRGMTPDKTAFTALIDGN 682

Query: 541 LKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPEE 600
           LKLGNLQE LNL S+MTEL IEFDLHAYT+LVSGFSQCGELHQARKFF+EM+EKGILP+E
Sbjct: 683 LKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQARKFFNEMIEKGILPDE 742

Query: 601 ILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSH 639
           ILCICLL+EY KLG LDEAI+LK+EMQRRGLITEKCSH
Sbjct: 743 ILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKCSH 780

BLAST of MS018773 vs. ExPASy TrEMBL
Match: A0A6J1FET4 (putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucurbita moschata OX=3662 GN=LOC111444847 PE=4 SV=1)

HSP 1 Score: 1172.5 bits (3032), Expect = 0.0e+00
Identity = 570/637 (89.48%), Postives = 608/637 (95.45%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           LA KFFKWAGSQ+GF HTTE+YCI+ H+LF ARMY NAHDIIKEVILK + D++ PVC I
Sbjct: 136 LALKFFKWAGSQIGFCHTTESYCIIAHMLFCARMYTNAHDIIKEVILKCRIDMIFPVCNI 195

Query: 61  FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRL 120
           FD+LWSTRN+ V GTGVFD+LFSVLVELGLLEEANECF RMRKFRTLPKARSCNF LHRL
Sbjct: 196 FDMLWSTRNVCVSGTGVFDILFSVLVELGLLEEANECFSRMRKFRTLPKARSCNFLLHRL 255

Query: 121 SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV 180
           SKSG GQLV+ FF+DMIGAGIAPSVFTYNVMIDYLCKEGDLE+ARRLFVQMRQMGFSPDV
Sbjct: 256 SKSGNGQLVKNFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLESARRLFVQMRQMGFSPDV 315

Query: 181 VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE 240
           VTYNSLIDGYGKVGLLEESVYLF EMK  GCVPDVITYNALINCFCKFEKMPRAFEYLSE
Sbjct: 316 VTYNSLIDGYGKVGLLEESVYLFKEMKDVGCVPDVITYNALINCFCKFEKMPRAFEYLSE 375

Query: 241 MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN 300
           MKN+GLKPNVVTYSTLIDAFCKEG+MQ AIKLFVDMRRVGL+PNEFTYTSLIDANCKAGN
Sbjct: 376 MKNSGLKPNVVTYSTLIDAFCKEGMMQYAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGN 435

Query: 301 LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA 360
           LTEAWKLS+DMLQAGVNLN+V+YTALMDGLCEDGRM EAEEV++AMLKDG+SPNQQVYTA
Sbjct: 436 LTEAWKLSNDMLQAGVNLNVVSYTALMDGLCEDGRMMEAEEVFKAMLKDGLSPNQQVYTA 495

Query: 361 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG 420
           LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKS+G
Sbjct: 496 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSQG 555

Query: 421 INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV 480
           I+ANPVIYTTI+DAYFKAG+SSDAINLL +MQD G+EATVVTYCVLIDGLCKTG VELAV
Sbjct: 556 ISANPVIYTTIMDAYFKAGKSSDAINLLHKMQDMGVEATVVTYCVLIDGLCKTGMVELAV 615

Query: 481 DYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGN 540
           DYFGRMS +GLQPNVAVYTALIDGLCKTNC+ESAKKLFDEMQ RGM PDKTAFTALIDGN
Sbjct: 616 DYFGRMSDLGLQPNVAVYTALIDGLCKTNCIESAKKLFDEMQYRGMTPDKTAFTALIDGN 675

Query: 541 LKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPEE 600
           LKLGNLQEAL+L SRMT+LAIEFDLHAYTS+VSGFSQCG+LHQARKFF+EM+EKGILPEE
Sbjct: 676 LKLGNLQEALDLISRMTDLAIEFDLHAYTSMVSGFSQCGDLHQARKFFNEMIEKGILPEE 735

Query: 601 ILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCS 638
           ILC CLLREYYKLGQLDEAIELK+EM+RRGLITE CS
Sbjct: 736 ILCTCLLREYYKLGQLDEAIELKNEMRRRGLITENCS 772

BLAST of MS018773 vs. ExPASy TrEMBL
Match: A0A6J1K035 (putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489432 PE=4 SV=1)

HSP 1 Score: 1158.3 bits (2995), Expect = 0.0e+00
Identity = 562/635 (88.50%), Postives = 603/635 (94.96%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           LA KFFKWAGSQ+GF H TE+YCI+ H+LF ARMY NAHDIIKEVILK + D++ PVC I
Sbjct: 125 LALKFFKWAGSQIGFCHATESYCIIAHMLFCARMYTNAHDIIKEVILKCRIDMIFPVCNI 184

Query: 61  FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRL 120
           FD+LWSTRN+ V GTGVFD+LFSVLVELGLLEEANECF RMRKFRTLPKARSCNF LHRL
Sbjct: 185 FDMLWSTRNVCVSGTGVFDILFSVLVELGLLEEANECFSRMRKFRTLPKARSCNFLLHRL 244

Query: 121 SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV 180
           SKSG GQLV+KFF+DMIGAGIAPSVFTYNVM+DYLCKEGDLENARRLFVQMRQMGFSPDV
Sbjct: 245 SKSGNGQLVKKFFNDMIGAGIAPSVFTYNVMVDYLCKEGDLENARRLFVQMRQMGFSPDV 304

Query: 181 VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE 240
           VTYNSLIDGYGKVGLLEESVYLF EMK  GCVPDVITYNALINCFCKFEKMPRAFEYLSE
Sbjct: 305 VTYNSLIDGYGKVGLLEESVYLFKEMKDVGCVPDVITYNALINCFCKFEKMPRAFEYLSE 364

Query: 241 MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN 300
           MKN+GLKPNVVTYSTLIDAFCK G+MQ AIKLFVDMRRVGL+PNEFTYTSLIDANCKAGN
Sbjct: 365 MKNSGLKPNVVTYSTLIDAFCKGGMMQYAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGN 424

Query: 301 LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA 360
           LTEAWKLS+DMLQAGVNLN+V+YTALMDGLCEDGRM EAEEV++AMLKDG+SPNQQ+YTA
Sbjct: 425 LTEAWKLSNDMLQAGVNLNVVSYTALMDGLCEDGRMMEAEEVFKAMLKDGLSPNQQLYTA 484

Query: 361 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG 420
           LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGT+IWGLCSQNKLEETKLIIKEMKS+G
Sbjct: 485 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTVIWGLCSQNKLEETKLIIKEMKSQG 544

Query: 421 INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV 480
           I+ANPVIYTTI+DAYFKAG+SSDAINLL +MQD G+EATVVTYCVLIDGLCKTG VELA 
Sbjct: 545 ISANPVIYTTIMDAYFKAGKSSDAINLLHKMQDMGVEATVVTYCVLIDGLCKTGLVELAF 604

Query: 481 DYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGN 540
           DYF RMS +GLQPNVAVYTALIDGLCKTNC+ESAKKLFDEMQ RGM PDKTAFTALIDGN
Sbjct: 605 DYFSRMSDLGLQPNVAVYTALIDGLCKTNCIESAKKLFDEMQYRGMTPDKTAFTALIDGN 664

Query: 541 LKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPEE 600
           LKLGNLQEAL+L SRMT+LAIEFDLHAYTS+VSGFSQCG+LHQARKF +EM+EKGILPEE
Sbjct: 665 LKLGNLQEALDLISRMTDLAIEFDLHAYTSMVSGFSQCGDLHQARKFLNEMIEKGILPEE 724

Query: 601 ILCICLLREYYKLGQLDEAIELKSEMQRRGLITEK 636
           ILC CLLREYYKLGQLDEAIELK+EM+RRGLITE+
Sbjct: 725 ILCTCLLREYYKLGQLDEAIELKNEMRRRGLITEQ 759

BLAST of MS018773 vs. ExPASy TrEMBL
Match: A0A1S3CT40 (putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucumis melo OX=3656 GN=LOC103503999 PE=4 SV=1)

HSP 1 Score: 1115.9 bits (2885), Expect = 0.0e+00
Identity = 542/637 (85.09%), Postives = 588/637 (92.31%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           LA KFFKWAGSQVGFRHTTE+YCI+VH++FRARMY +AHD +KEVI+K++ D+  PVC I
Sbjct: 146 LALKFFKWAGSQVGFRHTTESYCIIVHLVFRARMYTDAHDTVKEVIMKNRIDMGFPVCNI 205

Query: 61  FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRL 120
           FD+LWSTRNI V G+GVFDVLFSV VELGLLEEANECF RMR FRTLPKARSCNF LHRL
Sbjct: 206 FDMLWSTRNICVSGSGVFDVLFSVFVELGLLEEANECFSRMRNFRTLPKARSCNFLLHRL 265

Query: 121 SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV 180
           SKSG GQLVRKFF+DMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMR+MG SPDV
Sbjct: 266 SKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMREMGLSPDV 325

Query: 181 VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE 240
           VTYNSLIDGYGKVG LEE+V  FNEMK  GCVPD+ITYN LINC+CKFEKMPRAFEY SE
Sbjct: 326 VTYNSLIDGYGKVGSLEEAVSFFNEMKDVGCVPDIITYNGLINCYCKFEKMPRAFEYFSE 385

Query: 241 MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN 300
           MKNNGLKPNVVTYSTLIDAFCKEG+MQGA+KLFVDM+R GL+PNEFTYTSLIDANCKAGN
Sbjct: 386 MKNNGLKPNVVTYSTLIDAFCKEGMMQGAVKLFVDMKRAGLLPNEFTYTSLIDANCKAGN 445

Query: 301 LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA 360
           LTEAWKL +DMLQAGV LNIVTYTAL+DGLCEDGRM EAEEV+R+MLKDGISPNQQVYTA
Sbjct: 446 LTEAWKLLNDMLQAGVKLNIVTYTALVDGLCEDGRMIEAEEVFRSMLKDGISPNQQVYTA 505

Query: 361 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG 420
           LVHGYIKAERMEDAM+ILKQM ECNIKPDLILYG++IWGLCSQ+KLEETKLI+KEMKSRG
Sbjct: 506 LVHGYIKAERMEDAMKILKQMKECNIKPDLILYGSVIWGLCSQSKLEETKLILKEMKSRG 565

Query: 421 INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV 480
           I+ANPVIYTTIIDAYFKAG+SSDAINL QEMQD G+EATVVTYCVLIDGLCK G VELAV
Sbjct: 566 ISANPVIYTTIIDAYFKAGKSSDAINLFQEMQDVGVEATVVTYCVLIDGLCKAGIVELAV 625

Query: 481 DYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGN 540
           DYF RM ++GLQPNVAVYT+LIDGL KTNC++SA KLFDEMQCRGM PD TAFTALIDGN
Sbjct: 626 DYFCRMFSLGLQPNVAVYTSLIDGLSKTNCIKSANKLFDEMQCRGMTPDITAFTALIDGN 685

Query: 541 LKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPEE 600
           LK GNLQEAL   SRMTELAIEFDLH YTSLV+GFS+CGEL QARKFF+EM++KGILPEE
Sbjct: 686 LKHGNLQEALVFISRMTELAIEFDLHFYTSLVAGFSKCGELRQARKFFNEMIKKGILPEE 745

Query: 601 ILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCS 638
           +LCICLLREY K GQLDEAIELK+EMQ  GLITE  +
Sbjct: 746 VLCICLLREYCKRGQLDEAIELKNEMQGMGLITESAA 782

BLAST of MS018773 vs. TAIR 10
Match: AT2G02150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 780.8 bits (2015), Expect = 1.1e-225
Identity = 371/640 (57.97%), Postives = 489/640 (76.41%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           LAFKFFKW+ ++ GF+H+ E+YCIV HILF ARMY +A+ ++KE++L   +      C +
Sbjct: 124 LAFKFFKWSMTRNGFKHSVESYCIVAHILFCARMYYDANSVLKEMVLSKAD------CDV 183

Query: 61  FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRL 120
           FD+LWSTRN+ V G GVFD LFSVL++LG+LEEA +CF +M++FR  PK RSCN  LHR 
Sbjct: 184 FDVLWSTRNVCVPGFGVFDALFSVLIDLGMLEEAIQCFSKMKRFRVFPKTRSCNGLLHRF 243

Query: 121 SKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPDV 180
           +K GK   V++FF DMIGAG  P+VFTYN+MID +CKEGD+E AR LF +M+  G  PD 
Sbjct: 244 AKLGKTDDVKRFFKDMIGAGARPTVFTYNIMIDCMCKEGDVEAARGLFEEMKFRGLVPDT 303

Query: 181 VTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLSE 240
           VTYNS+IDG+GKVG L+++V  F EMK   C PDVITYNALINCFCKF K+P   E+  E
Sbjct: 304 VTYNSMIDGFGKVGRLDDTVCFFEEMKDMCCEPDVITYNALINCFCKFGKLPIGLEFYRE 363

Query: 241 MKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGN 300
           MK NGLKPNVV+YSTL+DAFCKEG+MQ AIK +VDMRRVGLVPNE+TYTSLIDANCK GN
Sbjct: 364 MKGNGLKPNVVSYSTLVDAFCKEGMMQQAIKFYVDMRRVGLVPNEYTYTSLIDANCKIGN 423

Query: 301 LTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTA 360
           L++A++L ++MLQ GV  N+VTYTAL+DGLC+  RM EAEE++  M   G+ PN   Y A
Sbjct: 424 LSDAFRLGNEMLQVGVEWNVVTYTALIDGLCDAERMKEAEELFGKMDTAGVIPNLASYNA 483

Query: 361 LVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRG 420
           L+HG++KA+ M+ A+E+L ++    IKPDL+LYGT IWGLCS  K+E  K+++ EMK  G
Sbjct: 484 LIHGFVKAKNMDRALELLNELKGRGIKPDLLLYGTFIWGLCSLEKIEAAKVVMNEMKECG 543

Query: 421 INANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAV 480
           I AN +IYTT++DAYFK+G  ++ ++LL EM++  IE TVVT+CVLIDGLCK   V  AV
Sbjct: 544 IKANSLIYTTLMDAYFKSGNPTEGLHLLDEMKELDIEVTVVTFCVLIDGLCKNKLVSKAV 603

Query: 481 DYFGRMS-AVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDG 540
           DYF R+S   GLQ N A++TA+IDGLCK N VE+A  LF++M  +G+ PD+TA+T+L+DG
Sbjct: 604 DYFNRISNDFGLQANAAIFTAMIDGLCKDNQVEAATTLFEQMVQKGLVPDRTAYTSLMDG 663

Query: 541 NLKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPE 600
           N K GN+ EAL L  +M E+ ++ DL AYTSLV G S C +L +AR F +EM+ +GI P+
Sbjct: 664 NFKQGNVLEALALRDKMAEIGMKLDLLAYTSLVWGLSHCNQLQKARSFLEEMIGEGIHPD 723

Query: 601 EILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSHA 640
           E+LCI +L+++Y+LG +DEA+EL+S + +  L+T    +A
Sbjct: 724 EVLCISVLKKHYELGCIDEAVELQSYLMKHQLLTSDNDNA 757

BLAST of MS018773 vs. TAIR 10
Match: AT2G01740.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 390.6 bits (1002), Expect = 3.2e-108
Identity = 207/547 (37.84%), Postives = 318/547 (58.14%), Query Frame = 0

Query: 90  LLEEANECFLRMRKFRTLPKARSCNFFLHRLSKSGKGQLVRKFFHDMIGAGIAPSVFTYN 149
           ++ EA +   R+RK   LP   +CN  +H+L  S  G L  KF   ++  G  P   ++N
Sbjct: 1   MVREALQFLSRLRKSSNLPDPFTCNKHIHQLINSNCGILSLKFLAYLVSRGYTPHRSSFN 60

Query: 150 VMIDYLCKEGDLENARRLFVQMRQMGFSPDVVTYNSLIDGYGKVGLLEESVYLFNEMK-- 209
            ++ ++CK G ++ A  +   M + G  PDV++YNSLIDG+ + G +  +  +   ++  
Sbjct: 61  SVVSFVCKLGQVKFAEDIVHSMPRFGCEPDVISYNSLIDGHCRNGDIRSASLVLESLRAS 120

Query: 210 -GAGCVPDVITYNALINCFCKFEKMPRAFEYLSEMKNNGLKPNVVTYSTLIDAFCKEGIM 269
            G  C PD++++N+L N F K + +   F Y+  M      PNVVTYST ID FCK G +
Sbjct: 121 HGFICKPDIVSFNSLFNGFSKMKMLDEVFVYMGVML-KCCSPNVVTYSTWIDTFCKSGEL 180

Query: 270 QGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAGNLTEAWKLSHDMLQAGVNLNIVTYTAL 329
           Q A+K F  M+R  L PN  T+T LID  CKAG+L  A  L  +M +  ++LN+VTYTAL
Sbjct: 181 QLALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAVSLYKEMRRVRMSLNVVTYTAL 240

Query: 330 MDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYTALVHGYIKAERMEDAMEILKQMTECNI 389
           +DG C+ G M  AEE+Y  M++D + PN  VYT ++ G+ +    ++AM+ L +M    +
Sbjct: 241 IDGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKFLAKMLNQGM 300

Query: 390 KPDLILYGTIIWGLCSQNKLEETKLIIKEMKSRGINANPVIYTTIIDAYFKAGESSDAIN 449
           + D+  YG II GLC   KL+E   I+++M+   +  + VI+TT+++AYFK+G    A+N
Sbjct: 301 RLDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFKSGRMKAAVN 360

Query: 450 LLQEMQDAGIEATVVTYCVLIDGLCKTGKVELAVDYFGRMSAVGLQPNVAVYTALIDGLC 509
           +  ++ + G E  VV    +IDG+ K G++  A+ YF    A     N  +YT LID LC
Sbjct: 361 MYHKLIERGFEPDVVALSTMIDGIAKNGQLHEAIVYFCIEKA-----NDVMYTVLIDALC 420

Query: 510 KTNCVESAKKLFDEMQCRGMAPDKTAFTALIDGNLKLGNLQEALNLSSRMTELAIEFDLH 569
           K       ++LF ++   G+ PDK  +T+ I G  K GNL +A  L +RM +  +  DL 
Sbjct: 421 KEGDFIEVERLFSKISEAGLVPDKFMYTSWIAGLCKQGNLVDAFKLKTRMVQEGLLLDLL 480

Query: 570 AYTSLVSGFSQCGELHQARKFFDEMVEKGILPEEILCICLLREYYKLGQLDEAIELKSEM 629
           AYT+L+ G +  G + +AR+ FDEM+  GI P+  +   L+R Y K G +  A +L  +M
Sbjct: 481 AYTTLIYGLASKGLMVEARQVFDEMLNSGISPDSAVFDLLIRAYEKEGNMAAASDLLLDM 540

Query: 630 QRRGLIT 634
           QRRGL+T
Sbjct: 541 QRRGLVT 541

BLAST of MS018773 vs. TAIR 10
Match: AT5G55840.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 383.6 bits (984), Expect = 3.9e-106
Identity = 236/818 (28.85%), Postives = 401/818 (49.02%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFR--HTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVC 60
           LA KF KW   Q G    H  +  CI  HIL RARMY  A  I+KE+ L S         
Sbjct: 92  LALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKSSF---- 151

Query: 61  KIFDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLH 120
            +F  L +T  +      V+D+L  V +  G+++++ E F  M  +   P   +CN  L 
Sbjct: 152 -VFGALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILG 211

Query: 121 RLSKSGKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSP 180
            + KSG+   V  F  +M+   I P V T+N++I+ LC EG  E +  L  +M + G++P
Sbjct: 212 SVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAP 271

Query: 181 DVVTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYL 240
            +VTYN+++  Y K G  + ++ L + MK  G   DV TYN LI+  C+  ++ + +  L
Sbjct: 272 TIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLL 331

Query: 241 SEMKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKA 300
            +M+   + PN VTY+TLI+ F  EG +  A +L  +M   GL PN  T+ +LID +   
Sbjct: 332 RDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISE 391

Query: 301 GNLTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVY 360
           GN  EA K+ + M   G+  + V+Y  L+DGLC++     A   Y  M ++G+   +  Y
Sbjct: 392 GNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITY 451

Query: 361 TALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKS 420
           T ++ G  K   +++A+ +L +M++  I PD++ Y  +I G C   + +  K I+  +  
Sbjct: 452 TGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYR 511

Query: 421 RGINANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVEL 480
            G++ N +IY+T+I    + G   +AI + + M   G      T+ VL+  LCK GKV  
Sbjct: 512 VGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAE 571

Query: 481 AVDYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALID 540
           A ++   M++ G+ PN   +  LI+G   +     A  +FDEM   G  P    + +L+ 
Sbjct: 572 AEEFMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLK 631

Query: 541 GNLKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILP 600
           G  K G+L+EA      +  +    D   Y +L++   + G L +A   F EMV++ ILP
Sbjct: 632 GLCKGGHLREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILP 691

Query: 601 EEILCICLLREYYKLGQLDEAIELKSEMQRRGLITEKCSHAPGQVTYASAINAYCRVGLY 660
           +      L+    + G+   AI    E + RG +       P +V Y   ++   + G +
Sbjct: 692 DSYTYTSLISGLCRKGKTVIAILFAKEAEARGNV------LPNKVMYTCFVDGMFKAGQW 751

Query: 661 SKAEDIFGEMEEKGFEKCVVAYSSLISMYGKTGRLKDAMRVLAKMKERGCEPNVWIYNIL 720
                   +M+  G    +V  +++I  Y + G+++    +L +M  +   PN+  YNIL
Sbjct: 752 KAGIYFREQMDNLGHTPDIVTTNAMIDGYSRMGKIEKTNDLLPEMGNQNGGPNLTTYNIL 811

Query: 721 MEMHGKAKNLKQVEKLWKEMKRRKIAPDKVSYTSIISAYVKAKEFETCERYYVEFRMNGG 780
           +  + K K++     L++ +    I PDK++  S++    ++   E   +    F   G 
Sbjct: 812 LHGYSKRKDVSTSFLLYRSIILNGILPDKLTCHSLVLGICESNMLEIGLKILKAFICRGV 871

Query: 781 AIDKAMAGIMVSVFSKTSRVDELVKLLREMNLEGTRLD 817
            +D+    +++S       ++    L++ M   G  LD
Sbjct: 872 EVDRYTFNMLISKCCANGEINWAFDLVKVMTSLGISLD 898

BLAST of MS018773 vs. TAIR 10
Match: AT1G05670.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 370.9 bits (951), Expect = 2.6e-102
Identity = 203/636 (31.92%), Postives = 341/636 (53.62%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           L   FF WA S+       E+ CIV+H+   ++    A  +I     + + ++     + 
Sbjct: 103 LVLDFFDWARSRRD--SNLESLCIVIHLAVASKDLKVAQSLISSFWERPKLNVTDSFVQF 162

Query: 61  FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRL 120
           FD+L  T   +     VFDV F VLV+ GLL EA   F +M  +  +    SCN +L RL
Sbjct: 163 FDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTRL 222

Query: 121 SKS-GKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPD 180
           SK   K       F +    G+  +V +YN++I ++C+ G ++ A  L + M   G++PD
Sbjct: 223 SKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPD 282

Query: 181 VVTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLS 240
           V++Y+++++GY + G L++   L   MK  G  P+   Y ++I   C+  K+  A E  S
Sbjct: 283 VISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFS 342

Query: 241 EMKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAG 300
           EM   G+ P+ V Y+TLID FCK G ++ A K F +M    + P+  TYT++I   C+ G
Sbjct: 343 EMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIG 402

Query: 301 NLTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYT 360
           ++ EA KL H+M   G+  + VT+T L++G C+ G M +A  V+  M++ G SPN   YT
Sbjct: 403 DMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYT 462

Query: 361 ALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSR 420
            L+ G  K   ++ A E+L +M +  ++P++  Y +I+ GLC    +EE   ++ E ++ 
Sbjct: 463 TLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAA 522

Query: 421 GINANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELA 480
           G+NA+ V YTT++DAY K+GE   A  +L+EM   G++ T+VT+ VL++G C  G +E  
Sbjct: 523 GLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDG 582

Query: 481 VDYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDG 540
                 M A G+ PN   + +L+   C  N +++A  ++ +M  RG+ PD   +  L+ G
Sbjct: 583 EKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGKTYENLVKG 642

Query: 541 NLKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPE 600
           + K  N++EA  L   M        +  Y+ L+ GF +  +  +AR+ FD+M  +G+  +
Sbjct: 643 HCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEAREVFDQMRREGLAAD 702

Query: 601 EILCICLLREYYKLGQLDEAIELKSEMQRRGLITEK 636
           + +        YK  + D  ++   E+    L+ E+
Sbjct: 703 KEIFDFFSDTKYKGKRPDTIVDPIDEIIENYLVDEQ 736

BLAST of MS018773 vs. TAIR 10
Match: AT1G05670.2 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 370.9 bits (951), Expect = 2.6e-102
Identity = 203/636 (31.92%), Postives = 341/636 (53.62%), Query Frame = 0

Query: 1   LAFKFFKWAGSQVGFRHTTEAYCIVVHILFRARMYANAHDIIKEVILKSQNDLVLPVCKI 60
           L   FF WA S+       E+ CIV+H+   ++    A  +I     + + ++     + 
Sbjct: 103 LVLDFFDWARSRRD--SNLESLCIVIHLAVASKDLKVAQSLISSFWERPKLNVTDSFVQF 162

Query: 61  FDILWSTRNIFVLGTGVFDVLFSVLVELGLLEEANECFLRMRKFRTLPKARSCNFFLHRL 120
           FD+L  T   +     VFDV F VLV+ GLL EA   F +M  +  +    SCN +L RL
Sbjct: 163 FDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTRL 222

Query: 121 SKS-GKGQLVRKFFHDMIGAGIAPSVFTYNVMIDYLCKEGDLENARRLFVQMRQMGFSPD 180
           SK   K       F +    G+  +V +YN++I ++C+ G ++ A  L + M   G++PD
Sbjct: 223 SKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPD 282

Query: 181 VVTYNSLIDGYGKVGLLEESVYLFNEMKGAGCVPDVITYNALINCFCKFEKMPRAFEYLS 240
           V++Y+++++GY + G L++   L   MK  G  P+   Y ++I   C+  K+  A E  S
Sbjct: 283 VISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFS 342

Query: 241 EMKNNGLKPNVVTYSTLIDAFCKEGIMQGAIKLFVDMRRVGLVPNEFTYTSLIDANCKAG 300
           EM   G+ P+ V Y+TLID FCK G ++ A K F +M    + P+  TYT++I   C+ G
Sbjct: 343 EMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIG 402

Query: 301 NLTEAWKLSHDMLQAGVNLNIVTYTALMDGLCEDGRMTEAEEVYRAMLKDGISPNQQVYT 360
           ++ EA KL H+M   G+  + VT+T L++G C+ G M +A  V+  M++ G SPN   YT
Sbjct: 403 DMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYT 462

Query: 361 ALVHGYIKAERMEDAMEILKQMTECNIKPDLILYGTIIWGLCSQNKLEETKLIIKEMKSR 420
            L+ G  K   ++ A E+L +M +  ++P++  Y +I+ GLC    +EE   ++ E ++ 
Sbjct: 463 TLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAA 522

Query: 421 GINANPVIYTTIIDAYFKAGESSDAINLLQEMQDAGIEATVVTYCVLIDGLCKTGKVELA 480
           G+NA+ V YTT++DAY K+GE   A  +L+EM   G++ T+VT+ VL++G C  G +E  
Sbjct: 523 GLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDG 582

Query: 481 VDYFGRMSAVGLQPNVAVYTALIDGLCKTNCVESAKKLFDEMQCRGMAPDKTAFTALIDG 540
                 M A G+ PN   + +L+   C  N +++A  ++ +M  RG+ PD   +  L+ G
Sbjct: 583 EKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGKTYENLVKG 642

Query: 541 NLKLGNLQEALNLSSRMTELAIEFDLHAYTSLVSGFSQCGELHQARKFFDEMVEKGILPE 600
           + K  N++EA  L   M        +  Y+ L+ GF +  +  +AR+ FD+M  +G+  +
Sbjct: 643 HCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEAREVFDQMRREGLAAD 702

Query: 601 EILCICLLREYYKLGQLDEAIELKSEMQRRGLITEK 636
           + +        YK  + D  ++   E+    L+ E+
Sbjct: 703 KEIFDFFSDTKYKGKRPDTIVDPIDEIIENYLVDEQ 736

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022146419.10.0e+0099.07putative pentatricopeptide repeat-containing protein At2g02150 [Momordica charan... [more]
KAG6601913.10.0e+0089.50putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
KAG7032608.10.0e+0089.34putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
XP_023534824.10.0e+0089.03putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 [Cucur... [more]
XP_038906984.10.0e+0089.83putative pentatricopeptide repeat-containing protein At2g02150 [Benincasa hispid... [more]
Match NameE-valueIdentityDescription
P0C8941.5e-22457.97Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis th... [more]
Q9ZUA24.5e-10737.84Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana OX... [more]
Q9LVQ55.5e-10528.85Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX... [more]
Q0WVK73.7e-10131.92Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
Q9FJE62.4e-9227.02Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A6J1CX770.0e+0099.07putative pentatricopeptide repeat-containing protein At2g02150 OS=Momordica char... [more]
A0A6J1H5890.0e+0088.87putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 OS=Cuc... [more]
A0A6J1FET40.0e+0089.48putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucurbita mosc... [more]
A0A6J1K0350.0e+0088.50putative pentatricopeptide repeat-containing protein At2g02150 isoform X1 OS=Cuc... [more]
A0A1S3CT400.0e+0085.09putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucumis melo O... [more]
Match NameE-valueIdentityDescription
AT2G02150.11.1e-22557.97Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G01740.13.2e-10837.84Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G55840.13.9e-10628.85Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G05670.12.6e-10231.92Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G05670.22.6e-10231.92Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 673..724
e-value: 4.3E-12
score: 45.9
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 280..311
e-value: 5.4E-9
score: 35.6
coord: 640..669
e-value: 1.7E-5
score: 24.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 79..104
e-value: 0.26
score: 11.6
coord: 748..777
e-value: 0.0033
score: 17.5
coord: 567..596
e-value: 9.0E-7
score: 28.7
coord: 787..812
e-value: 0.91
score: 9.9
coord: 605..631
e-value: 0.0074
score: 16.5
coord: 392..421
e-value: 0.0039
score: 17.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 357..389
e-value: 9.2E-8
score: 29.8
coord: 216..250
e-value: 4.4E-8
score: 30.8
coord: 286..319
e-value: 1.5E-5
score: 22.9
coord: 678..712
e-value: 8.2E-10
score: 36.3
coord: 426..459
e-value: 1.0E-7
score: 29.7
coord: 714..746
e-value: 1.9E-7
score: 28.8
coord: 497..529
e-value: 1.2E-9
score: 35.8
coord: 748..777
e-value: 1.6E-4
score: 19.7
coord: 567..598
e-value: 4.6E-7
score: 27.6
coord: 321..354
e-value: 2.1E-10
score: 38.1
coord: 146..180
e-value: 1.6E-10
score: 38.5
coord: 251..285
e-value: 1.1E-8
score: 32.7
coord: 181..215
e-value: 1.8E-11
score: 41.5
coord: 643..674
e-value: 5.3E-8
score: 30.6
coord: 461..495
e-value: 8.2E-9
score: 33.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 213..262
e-value: 4.0E-19
score: 68.6
coord: 143..190
e-value: 6.5E-17
score: 61.5
coord: 424..472
e-value: 4.9E-15
score: 55.5
coord: 319..364
e-value: 5.6E-12
score: 45.7
coord: 493..539
e-value: 5.4E-14
score: 52.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 284..318
score: 11.32308
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 424..458
score: 11.4875
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 676..710
score: 12.901507
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 249..283
score: 12.901507
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 144..178
score: 13.723605
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 599..633
score: 8.933517
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 641..675
score: 11.673842
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 354..388
score: 11.761533
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 319..353
score: 13.734567
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 459..493
score: 11.882107
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 494..528
score: 12.397287
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 564..598
score: 12.967276
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 711..745
score: 10.972319
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 214..248
score: 12.945353
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 389..423
score: 9.711769
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 179..213
score: 13.822257
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 558..631
e-value: 2.2E-14
score: 55.4
coord: 415..485
e-value: 6.3E-20
score: 73.5
coord: 486..557
e-value: 1.3E-19
score: 72.4
coord: 137..206
e-value: 1.0E-23
score: 85.8
coord: 1..136
e-value: 5.2E-12
score: 47.7
coord: 279..346
e-value: 2.7E-20
score: 74.7
coord: 347..414
e-value: 9.5E-18
score: 66.4
coord: 207..278
e-value: 7.5E-26
score: 92.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 724..828
e-value: 1.5E-17
score: 65.5
coord: 632..723
e-value: 2.4E-25
score: 91.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 285..519
NoneNo IPR availablePANTHERPTHR45613PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 639..815
NoneNo IPR availablePANTHERPTHR45613:SF358OS06G0565000 PROTEINcoord: 639..815
NoneNo IPR availablePANTHERPTHR45613:SF358OS06G0565000 PROTEINcoord: 3..636
NoneNo IPR availablePANTHERPTHR45613PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 3..636

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS018773.1MS018773.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding