Sgr013451 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr013451
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationtig00153871: 44055 .. 65179 (+)
RNA-Seq ExpressionSgr013451
SyntenySgr013451
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGGAATCGACGCTCACCTATGAGGAATGCCGACGCCAGAGGTTGGAGGAGAATAAGAAGAGGATGGAAGAGCTTAATCTGAACAAGCTTGCCGATGCGCTAAAAGCTTCCAGCCCTAAGTCCTCTCCGGTAGGTTTCCGTTGTTCGGAATACTTTCTGTTGCAATATCTTCCAGCATGATAATGGATTTGAAGTTTGATCTCTCTTTCTCTCTTTGGGTTCTGGAAGACGAAACAGCTAAAGCGTCCTCGCCAGCCACTCGATATCACGTCCGTCCGTGTGAGAAGGTCCAGCCGTTTTGCGGATAAGCCCCCTCCGAACTATAAGGAGGTAAGCTTTGAGCTCTCCGTTTTGAATATGACACCATTTTTTGATTGGAAATGTTCAACGGGAAAAGTAATGCTTCAGTTCCTAAACAGTAACCAGTTTTGATGTTAGTTATTTTTGTTCTCACTGAAATGAGAAATGGCACACTAATTTAAGCATTTCTCTTCTTCCCCCCACCCCACAAAATTGCTATTGAAATTCTTGCACAAATTGTAACGAATATTTGAAAATTTTAATCAATATTATGAAGAACTTTTTGAATATCGATACCACAACAACCTCGATCATTGGGACCTGACTGAAAAAAGTAGCTTCTTGTTTTTTGAGTGGTCCTTGTTCTTTCTTGGTGGGCTTTCACAATGCAGCTTAATCCTCCGATCTTATTGGCCTTTGCAGGTCCCCATTGAACCACTTCCAGGTGTCAGAAGGTATGCACGAAATTCAGTTTCAAACTACTTTATCTTGGATCGTTTTTCTAACTTCAAGAGCAACAATAATTGAACTAATAGGATCTATCAACGGAGAGATTTGTTGAATCGAGTTTATGCTTCACACGAAGAAAGACGATATGCTATTGACAGAGCAGAAGAACTTCAGTCTAGCCTGGAATCTAGGTACCCAAGTTTCGTGAAGCCCATGCTTCAGTCACATGTCACCGGGGGATTTTGGCTGGTCAGTTTATGATATTTCTGATCAAACTGCTTGTCTTTTAGAAGATCGTTTTACTGAATTTGTAAGCTTTGATTGGTTATATAAAGCCGATAATTTGATTAAGTATATTAGGTGCCCAGTGCACTGATATTGTCTTGAGAAATTTTGTCTAGGGTCTTCCAGTTCAATTTTGCAAGACACACCTTCCCCATCATGATGAAATGGTTACTCTGGTGGATGAGGATGCGAATGAGTTCCAAACAAAATACCTTGCGGAGAAAACGGGTCTTAGCGGTGGTTGGAGAGGGTTTTCACTTGATCACCAGTTAGTAGATGGGGATGCTTTGGTGTTTCAGTTAACTAAGCCAACTGAATTCAAGGTGATTACATTGAAAATTTTTTTCCTGATTGAAATATTCTGCATTGGCCAAGCATCTTGTCATGTGACTCGAATTTTATGGGTATATGCACAATTTTTCTAGCACATTTAATTTGAGGTTGTGCGCACAGTTGTATGGTATATTACATATCGCATACTTCATATCCAAGAGAGTTCAATTCAAGACACTTGAACAGTATTAGCTTCTTCAAATCTATAGGAGAAGATAAAACTTTCTATTGCCATGGGCAGCACCTAAAGAGAAGGGGAGAAGTACATTTTGGTAGTGTAGGGGAGGCAGCATGGATACTCGAGACGGTTTGTGAAGATTCTTGAGCATTTCTGTATTTCTAGGTCTAACTCATTTTTTTCCTTCAAGAAGCATGATGGAACTGACTTAAAATTGTTGATAATAAAACGAGGCCTTAGGGTGGGTGAAGCCTCCCCTGGACTAGAAGGGGTGGGCTATGACCATAATGCAAGGAATGGTGCCGGTTTATTAAAAAAAAAAGAAGTCGACAGAATGATTATCCTTTCTTTTTTCATGGCACTAATCATTTGTCAAGTTCAAAACCTTGGAATCTCACAGAAATGACTGAAAAAACTACTTCCTCTTTCATCTCTTGCTCATAATGGAAATCCTGTCTGAAATTTGAGTAATGAATGTCTTTACTAATGACCTTAGCTTTGCCACTCTTAGGTATATATCATCAGGGCATATAATTCCGAAGACAAAGCAGACACCAACGAGGATTCCGATGTCTCTCAACGGGAAAGTAGTGGCAAAAGAATTACTAGATCAGCAAGTAAAGGTTTCATTTTCGTATAGACCTTCTAGTATTATTGTATCATAATCAGCTTTCAATTTTCATTTCTCAATTTCACTGGTAATTTGTTCCTTTTTCTGGCAGGGAGAAAGTAACTCAAGCGTTTGCCTCCCACATGTTTTATTAGAAGTATCCAAGGCTTTTTGACAAAAAATGGCAGCTTTCAGGATATTTTGTCAGTCTTAGCTCTCTCTGGACTGATGTAACATTAGGGCGGATATGTTAATGAAATGATTCCCCCTTCTTTTGATGATCTGCAGCAACACCTCTCTCTCTTTCTCTGTTGAGTTCTGGATGACTCAGCTTCTGGAAATCATGCTTAAAATTTGCAGGTTAGCCAAAGAAAAAAGAACTTGTTTCATTTCACTGTACTTAACCGTTCATTATTAATGTCTACTATGGAGGGTAAATATGACTTCATCAATCTTGGGCTATCAAATTTCTCTGCAACTGATAAATTCACTAGGTTGAACTTAGAAGTAGAACGTGCAATATGAGAGATTGGGGGGAAAACTTGCAGTGTTTATTTATTTTATTTATTTTCTAAATATTTAAAACAAGGGATAAATTAGTATTTGTGCTTTCAAAATAAGGGTAGGGTTAAATTGTATTAAAAGTTCAATTTACCCTAAAAACGAAGTTCATTTTTAGATGTTATGAAAATAAGAATTCTAAAATAAGAATGTTGTTAATGTTTTAAACATTTAAATAATGCAAAGTTATTGCATATTTTGATTTCTATTTTGCACAGTATAAAACTTTTTGAAGATAGTTCTGTCAAGGTAGTTCTTATGAAATTGTTTGTTTGTTCTAAATATTAAAAAAATGTGCATGTATTAATATGGGGCAACCTTCAAAAATGAAAAAAAAAATATATATATATATATTGGTCCAACAATTAAAAGTCAAGTTATTAATGCATTCTACTTATTCGATTGTTTATATAATTCTTAATTTAAAGAAATTTTGCTCATAAGTGGCTCGAAATCAAGAGAGGATACAAAAATAGTATAAACGGTAGGGGATGCTATAAATTTGGATTGAGCCTTTCTCGCAGTCTAAGAGTAGAGAAAGGAGCACTTTAAATCAACGAGTCATTTACTATGACAAAAGAAATCTTTGCTCATATTTTTCAAACAAATTCTTCTTTAAAAATATCTTTTAAAGGTGAATAATGAGACTATAAATTCATCTAATCAATTAAAATCAATGTGACTTTGTCCCACTTTAAAGGAATATTATAAAGGGAGCAAGGAAAATAAAATATTTGATGTTATATTTTATTTATCATGATTTAATATTTTACTAGGGCTTGAGCATAACTCAGCGGTTAATGTATCTATACCTTTTCTAAAGATTGTTGATTCTCACACCCCACATTTTTTGTACTAAAAAATATATATAAATGTATATATTATCATTAAATTTTTAATTTACTAAAATATTAGTTAATAATTGGGCCACTTCCTAAAGCACTTATACATTTCTTAAAGGTCAAAGGTTCGATTCCCACCCCCATATTTGTTGTACTAAAATATATATATATATTTTTACCATTAAATATTTAATATTACTAAAATATTATTTAATAATTGGGCCACTTTTCAAAATTGGGCCCAGGTCTGAGCCCACACCTGAGCCGAATGTAAGTTTCTGGAAGATTGCTGTCTAGTTTTCCATCTTCTGCTATCCTCCTTAAGCTTGCTGTAACAGAAGTTTCTCTGAACATTTCCCTAGGGTTTAGGCCTTTTCAGATCAGAAATGGCGTCTCTCATGACGGTCCGGCGTGCTCGAAGCCCCATACTCATCTCCTCATTCTTCAAGGTACGGTCTCCACTCCCTTCTCGTTTCACGTTCTCTTGTGGAAACCAGACAGAGACTCTGATCAAAGCCCTAAGCACCTCTGCGGTCCCTAATGATTACTCAAATTTTCCTCCTCCACCACAACAACATCCTTCGTCTGATCCTAGAACTCTTCAGGGCCGGGAAACGCCTGGCCAGTGGGGCACGCCAAGCCAGGTTCATCATCGAGTTGGAAACTTTAATAACCAGTCGTTCTCGGAGTTTCAGAATCGCGACTATGTTCAACAGGGAAGCCCTAGTAATCAATTGAATTATCAGAATCAGAACCAAAGCTCTCATCCGAATCCTGGATTTGCCAGGCAGGGTCAGAGCTATACTCAAGCCGGTAACCCTAATTCATGGAATCCTCCGAACCAGAGCTACCCGCAGTATCAAAATCCTTCACAGCCGAACACTCAAAATTTCAATTATCAGCAACAAAGAGGCCCAAACCAATGGAACAATCAAATCAGGGATACCCACAATTTGGAAAGCCTCCGCAGCGGAACCCACAAGTTGAGAATTCTAATCAGCCGAATAATCAGGTTGGGATTCAAGGACACGGTGCTCAAAATCAAGCATCAAATGCCCTTGTATCTCCTATCGATGAACTGCGGCGCTTTTGTGGAGAGGGAAAGATTAAAGAAGCTGTTGAATTGTTGAAAGAAGGTGTTAAAGCTGATGCTGATTGTTTCCATGTGTTGTTTGAACTATGCGGGAAATCAAAGTCATTTGACAATGCAAAAATAGTTCATGATTACTTCTTACAGTCAACTTATAGAAGTGATCTGCAATTGAATAATAAAGTGCTTGAGATGTATGGAAAATGTGGAAGCATGAGTGATGCACGGAGAGTGTTTGACCATATGCCTGATAGGAATATTGATTCTTGGCATTTGATGATAAAAGGATATGCGGATAATGGATTTGGTGATGAGGGGCTAGAGTTATTTGAGAATATGAAGAAGCTGGGATTGCAACCCAATTCACGAACTTTCCTTTTTATAATGTCAGCTTGTGCTAGTGCGAGTGCTGTAGAAGAAGGATTTATGTACTTTGAATCGATGAAAAATGATTATCATATCACCCCAGACATGGATCATTATTTAGCGCTTTTAGGTGTTCTTGGAGAACCTGGACACATCAACGAGGCTTTCGAGTATGTTGAAAAACTGCCCATGGAACCCACAGTTGAAATCTGGGAGACTTTGAAAAACTATGCTAGAATTCATGGAGATGTTGATCTTGAGGACTACGCCGAGGAGCTAATTGTTGCTCTGGACCCGACAAAAGCTGCTTCTAATAAGATAACCACACCACCTCCCAAAAAACGGTCTGCCATTAGCATGCTTGATGGGAAGAACAGGATTGGTGAGTTCAGAAATCCGACTCTCTACAAAGATGATGAGAAGCTAAAGGCTTTGAAGGCAATGAAAGAACAAGGTTATGTGCCAGATACTAGATATGTACTTCATGATATCGATCAGGAGGCCAAAGAGCAGGCATTGTTGTATCATAGTGAACGATTGGCTATTGCATATGGCCTGATCAGTACCCCAGCACGGACACCTCTTAGGATCATTAAGAACTTACGGATCTGTGGTGACTGTCACAATGCGATTAAGATCATGTCTAGGATTGTTGGGAGAGAGTTGATTGTAAGGGACAACAAACGGTTCCATCATTTTAAGGATGGTAAATGTTCTTGTGGGGATTACTGGTGAACCCATCATTCTTCAACACTCCACAACATGTACTTGATGAGTTTATCCTCGCAATAGCAGCTTGATGTAAAGTGTTAAACCATCAAAATCATTATGCGAACCCAGGGACAAGGTACCGATCGGAGATGAGCTAGCCATTGATTTATAATGTACGGTCTGCTTATCCTGATGTTTAATTGTTCGTCTTATTATTATATTTTTTTTTCTTCTTTTCCTGCTGTTCCCTGTTGAACTGAATTGAAAAGTGCCAGTTTCCTTCTCCCTGAAAGATGACTTGTAAGCTTTTAGCAAAAGAAAACATCCGGTGGCATACCTTGAATTCTTTCTTAAATTTTTCCTGGTTCCAAAGGTGGTAGATTGTCAAACAAATCAAGAGAGATGCTTTGGATTTGTTAGTAGGTATAGTAATTTCTTTTATCCATCTCTTCTATCTAGTAGATAGAGTCCTGCAAAGTTTGCATATAATTCCTTCTTTAGCATCCATTTCTGATTCTCTTTACTTTACACTACAGTTCTAAAGAGCATGAACAGTACTTTCTGTTGCGTTTCTATAAATATTACAGGTCGCATCCTCTATGATTCTCTTCTTCCACTGAGTAGGGATTGTATAACTGGACGATCTCCCCACAAAAATTTTACATTTTGTTGGTGCACTGAGTTTCCATGGTCTTCCACCAAGTCTATTGTAACAGTTTAACCATGTATGGGAAGTTTTTCCTTTCCTTGTTCATTTACTGCAAGTCTGTAGCTGTTCTTTAATTATGGGTTTAAATCTTATATTAATCCATTTATTTAATTTTGCTCTTCAAACTATAATTGGACTCTTGGTATCGAATCTTCAATCTTAAAAAAGTAATAGATGTCTCATTTATCGAGCATGATTAGCCTATTTCTTTTTATCAATTATTTTAAGAATATCGTTAGTGCTCACCTGTTTTGAAAAATGACCTTATGCTTGATTGCCATGTTTGAAAAATGATATTAGAGCATCTAATGAAAATGGCAAACAGTGGAATTTTGACTTGTTTACAAAACTATTATGCAAAAATATTTTTCTATGAGTTCTATTTAGGAAAATTAAATGGAATATGAAGTAATACCTTATGCTTGATTGACGCGATTATAAAAAGACATTAGGACATTTAATGATAATGGTAAACAATGGAATTTGACTTGTGTACAAGATTCCTATGCAATAGAATCTCCCCCTAAGATTTCCTCTATTTATAAGAACAACTCTTGGATTAAAGGATGAAGGCCAATCTATCATCTCCATTAGAGACTTAGAGTTGGAGGTGCTGAGTTAGCTTTGGTTTTGAGAGTAACAAAAAGATCCAAGTAAAAGAGAATCCATGAACCGCTATGGACCTTTCAAGCTAGCTACTTGCATGGACCGAAAGTTAGTTTGAATTATATTTGTTGGTTGATTTTGCTTTTTAAATTATAAAATCGAAAAGAATAGTTGGTCAATTCAGTTCGATTCTCATAAAGTTTGGTTTCAGTGTTGAACTGATTCAATTTTATCCACTTGCACCCTTAAAATGCAAATTTACCTCATAGTGTAGGTTAGTGTATTTCAAAGTGAAGAAGTTTTTTATTACTTTTTCTCGAAGAAATTAATTTTGTATTTGATATAGGGTTTATTTATTTCAAGTTAGGATTTTGCTTACAAATTAAATAATTGTAATTCATTATTTGCAAAATTTAACACACAAATTCACAAGTTTGTGCTTTTGAACTTTAGCCTTTTTAAATTAAATTTTATTTCAATTGTGAGCTAAAACATTTCCTAAATATACTTTTAGAATGAAATGACTTTTAAACCTACCAATTTCTTTTAGATGTTTAAATGCTTTTTAAAACATGTCAGCCAAACATAATAAATATGTTATGATATTGAAACTTTTAAAACAAAACAATTTATGTGACAAAAAATATATTTCTTTTTTTGCCCTTTCGTAATACCATCTATTATTATTTTGAAATTACGATAACCATTTTCATCATTTTTCATGAAAAACTGACTGAAAGTTTATTATTTTTGTTAAATTAAAAATTTAGTCTTTATAATTTGTGCATAGTCTTAATTTAGTCCTTATGATTTTAAAAGTTTCAATTTAGTCCCTATAATTTAGACAAATCCCACAAATCGTCCTTGTTGTTACCATGTTCACATTTTTTTACATGATAATGAGCTTATATGTTCTATTTGAGTTATTATGTTAGTTGACTATATTAAAGTTTAAAATTGAGTAAAAAATAAAAAGAAATAGGAGAGTTTTTCGAGTGGATGAAAGACAATTTATCGACAATTTGTGAGGACTAAATTGAAACTTTTAAAATTCTATAGTAACCAAGTTGAAACTAAACTTAATCACAAAATCTACATTAAAAGTTTTAAAATTATAGAGATTAAATTAAAATTGCGCCCAAACTAAAAAATTAAATTTATAATTTAATCCTTTTGTTCCCATAAAAAAAAAAAAGGTTAAAGACTTAAAAATGAAATGAGAGTAGCTGCTTTAGTTGCAAATTAACACAAAGGCCTCTCTTTACTTTATACCACACTGTTACAATCCACAATCAAACTCCATTAAAGAACGAGTCTCAACTCAGTGTTTGAACTCAACCCCTAAACATACTTTCCAGAGCGAGTCATCATGGTCATGAACTCGTCGATGTCCCCCATGCCGTCGCCGTCGCTGTCCACCGCCCTCACCATCCGCCGGCAGTCCTCAAGGCTGCAGCTCTCCCCCAACCCCTCAACACCCTCATCACCTCCTCCGCACTTATCTTCTTATCGCCGTTCAAATCGAACGTCCTGAAAGCAAACTCCACGTCCCTCGCCTGAACTCCCCCGCCGTTCCGGTAAACCTCCATGAACTCGCTCAGGTTTATGTACCCATCGCCGTCGGCGTCCACCGCCCTGAATATCTTCTGCACCTCTTCCATAGAGTTCCCTCTTCCCAGCGCCTTCAGAATCCCCCTGTACTCGTGCTTCGAGATCCTCCCGTCTCTGTTCGAGTCGAACTTGTTGAAGATCTGCTTTATCTCCTCCGCGCTCGGCTGCAGCGCCCTTCTCAGGCCGGAGCTCTGCCGGTCCTTGAACGAAAACAGCCGGGAGGGCTCCCGGAGGAACTTCTTCTTGGAGACGTTGTATTGGAAGTCCAGGAGGTTCAAGTTCGTCGGCATTCTTTCTGTTGGGTTGCTCTGGTTTTTTCTTGAATCAGAAGAAGATGATCGATCTTCAAAGTTCAGAGGAACCCAGATGGATTAAACTGATTAAATACTTGAATTTGGAGTTGGGAAAGATGGGTTTGATTCCTTGAAAACTTGTGGTGGTGTGCGAAGGAACCTGGAAGTTTGCAGAGTGTGCGGTAGAGGGAAGGGACAAGCAAACACAGAATATGTAATAAAGGATTGGAAAAGAGTAAAGAGGCCTAAAAAACAGAGGTAGAAGAAAGAAAAGAATGTGATTCTCTTCAGAAAGTTTCCACTTTTCTCTGGCCTTGATCCTCATAACTACTTGGTTTCTATGGCTGATTTTACCCACATTCAAATTTGATTTTTACCTCCATACTTAGCTCGATAAATCAAATTTTAACCTTGGGACCTTTGGTTGGTGATTCTCATAATTTCATCTGGGGTGTTGGGATTTTATAAAGGGGTAGATTATAGAAATTTGAATTTGCGTAATCTTAGTGACGGTAAAAATTAATATATGATATAAAAAATGAATTTCACAAATTTATGAGTTTATATAGCTCACGAGATTAACAAACTTATAGTTGGAGATAAATAAACTCCTGTACTAAATATAGGCTCGACTTCTTAATTTGAATTATACAAGCTTAATCTATGAATCAAACACTCTTAAAGTTTTAAATTCATCAAATTCAACTATTAAAATTAATGTCCTTCCTATGTTGAGTTACTGTGTTTGGAATTGGAGATTTGTGACCAAGGGGTGGGCGTTTTGTTTCTAGGCAAGATTTTTTTTTTTTTTTCTATTTGATTTTGGGGCGAATTTAGATGTTCTTATACTTGGATTTGGTTATTTTACTTACTAAAATTTAAGTATTTTATCGGCAAGGTTTTATATGATTTATCAATCCATTTTGGGTAATTTTTAATTAGAATTTAAAGTTTAATTTAATGAAATTAAGTTTTTAATATTAAATTTGACTCACGGAATTAAGGAGGGAATATATCATAAATTTTTAAACTAACAAAAACCTGCCATAAGCTAAGAGGTTTAGAAGAGGAAGTTGATAAGGAATTATGAGAAAGTAAGAGAGTGTTTTTGGTGATTTTAAAGGAAGGTTTGGGAATAAAGTTTACCTTTTTTGAAGAATGTTTGAATAAAATATTCTTGTGCTTATCCTTCTTTCATATGCACCAACAAGAAGGAATGTTTTTTTTGGCTTTTGTTAAATTAATTGATGAAAGTTAGCCAAGTGTATCATTTCAAAGATATCGAATGTATTCGTTTGCTATGTGAAAAGAATAATGGTCAATTACATCTTAGTAGTTCGTGGAATTTATAAATCTTGTCCTTCAATTTTAGAGATAGTTTTAAGGTCAAATTACAAATTTAGTATATATATTATGAACATTGCGTCTATTTAATCCATAAACTTTAAAAAAGTATCAATTATGTGATGCGCCCTTAGACTTTGAGTTATGTTTATTAGATAAAATTAGATGAGATAATGTGATCTAATATAATAGCTAGATGAACTAATTACGAATGAACCATTGGATACAATTAAGACTACATTAGATTTAAGAAGGGTGGTAGAATTAGATGGTTGAGTTGGGCTAGTGGTACGAATCGATGCTAAAAGCAATGAGTTCAAGTGAATAGTGAAGAAGTGCCTACTTTCTCATTTGATTTGGTGGTTGAAATTAGGGTTAAAGCGTAATTGTTGATTTATACTCCTTTATCCAAAATTTATCTAATAGAATATCATGTCAATGCTTCATCAAATAGTGTTAACAACTAAAACGCATGATAAGCAAAATTCAAAATTTAGGTATGTAATTAAAATTTTTTTAAATTTAAGAACTAAGTAAACACAATATTTAGAACCAAAAGATAATTAAAATTCAAAGATTTAAGGATGTCAAAAATTAAATAGACATAATATTTAAAAATAAAAGATAAGCAAAATTCAAAGTTTCGAAATATAATTTATACACTATAATAGTATGCTTGAGATGGCTTTTGAAAAAAATAGTTTTAAGTAAAGCACTATTCTATAAGCATCTTTTAAGAAACACTAATTAAGTGCTTTCTTGAAAAGCACTTTAAGTACTAGAGTATTTTACAATTTTAAAATCACTTTTAATTATCTAATCAAAACTATGAAAATTTTGAAAATAGCTTTAGATTGGCTAAAAGTACTTTTTACCTTTTCAAAATTCATCCCAAACTCACTTCATCTTCCTATGAGAAGGAACTCAATCAGTTCTTTATCTCAACTTGGGTTAAAAAAAATATCCAAACAGCCCCTTAAAAAGAAACTTAATTGGCTTGGCTTCTACATAAAGTTTCAAGTTTTATTCCTTAGAAACTCCAAACATGTGTCAAAAAAATTACCCAACAAAGCACCAGCAAATGAAACTATATGCATCTTATACTTTCTATAAATTTCCAAATGGCAATTAGGCAATAATCAATCAATCATTTAGATGATAATATGGAGATAATATGGATTCTCAATTTCTCATACTACAGCAAATTCTGCAACAATTATTGCAGAAATTCTGCTAATTACAAAGCCCTTTTACCTTTAAATTACACTCTAAGCTTTATATATTCTCTTCCATCCTTTCAGTAGCCATATGGATTTCTGATTCTACAACCTAAAAGTCAAGAAGGGAGAGAAACAAAAAATGAATTCCATTAGTTAAAACTTATCTTCTGAGAGAATTTGATGACGTTTAATCTGTAGAGAGAGAGGGAGAGAGGCTGATAGAGAAGCATTTGTTCATGGAACCAAACTTGAAGAACTAAGTTTCTTTTTCTTACTCGACGTGTATGCCTCGCTTCTGCAGTGCTTCGAGAACTTTCGACTTCAGATCGTTATTCTGGAGAAATCTGTCGACACAAAATCATCAATCGAAACCATTAGATCATCTCTAAAAACTGTGCGGAGGGAAAGGGAACACTGTATTTTACTCCTGCAGAGTTCAAAGTTTCAGTGTTATGAATATGGCCTCCTCAGATTCCAGAACAAAAATAAACCCAGTCGTCATCAAGCCATTGAAAACGGATGAAAAACAAGTAAATGTGCAAACTTATTTCTATTTTAAGCTCTATTTTATACATTGACAATGTTTGAGGATCGAGGTGAAGGCCAAAAACAAGTCGGGAGTAAGATAGTTTAGTTCTTTACTTGGGTGAGGGCATTTCAGATAAAGAATGAAGATCATGTATCTGTAATAATCAGAAGTATGGTAGCTACCGTGACACCACTGAGTTCTTTCATCTACTCAGCTAATCCTTTTACGTCGTTATATCGTATGTGCGAAAAGAAGCAATAATTTCAATGCCCAAACTCACATTTCACGAATATGTGGAAGCTTGGATAATGAATGAGGAATTGGCCCTTCAAATTGGTTTTTTTCCAGATGCCTTAATCAGCAAAACAGAGTTAGAAAACAGGTAAGAGAGTTCCAGATTAAGTAAACGAGGCATTGATTGGAAAACGTACAAAGTCTGTAACTCCTTCAACGAACCCATTTCTGGAATAGAGCCCGAGAGTTTATTGTTACCGAGCCAACTGCAAAAGCAAATGCAGCCATTCAAGAAGTGGGTTTCAAGTATGGAGACCCAGATAGCTAAGTTTAACAATGGAAAAAACTTCACTTACAGATGAGAGAGGGCAGTTAAATTGTTTATGCTTGAAGGAAGTGTGCCAGATATCCCAAAGTTTGTCAAGTTCCTACACAAATATTAAAGTTAGGGAGAAAAAGTTTAATCTTAGTTCATTTGAAACATACATAAAACAGAGTCGCTATGAAATAAATCACAAATTCCTGGGTTGGCTTCTTGAGTTGAGAACACTTACAAAGTGACAACTCGAGCAAGTTTTCCCCCGGAACAAATAACTCCGGTCCAGGAATTCTCCTTAGGCAAGCAAGGATCGCCACTCCAATCATGAGGTGGGTTGTTGAAGCTTCTTGCCAAGTCCTCCATTGCCATCACTGCCAAATTATTCAAAGAAAACAACAGAAACTTTGTCAAACCAGGAAGAAGTAGAAGAAGAAGAAGAGCATGGCTTCTTTTCTGCTTCCTCGCATTAATCTCAACTCGAGTGAACGGTTGATATTTTATGAATATAAAAGAAAATGAAAACATTTGTCAAAGTTGATTCTTTCAAGTTGCATTATCCCATGGAGGAGCATATGTTTATGAAGCAGAAAAGTACCATCTCTAGTTAATGTCCTTCCACCAAGCGGAACGATCTGAAGGATCTCAGCAGCATTAATCACAGGCCCAACAGGCATGTCATCTGCAGGAGTTAGTTCTATCTGGGTCTGGCCAGATAAAGGCCATTTCGTACCATAAACAGACACACCATTAGCAGTGACATTCAGGTTTGTAAAGAAATTCTTGCCGTTTACTGCAACATTAAAGACCCTCCAGCTAAACGGACTCGGAGAACGGTTGTCCTGAAAGTAGAATGAGATGTAGTAGTGAGCTGCTGGAAGTGTAAATGGAGGCCAGTTGATTTTGAGCGACTTCCCACGGCTGGTTGTCATGGCGGTGTAAAAAGCCTTCGTTGGCGGGAGGTTCCAGAAGTTTGAGGGAGTTATGTTGGCATGGCATGTTACCAACGGGTTTTCGTCCACGAACGGGTGCCACAGCCGGTTGAATGCATCGTCTGGAAAGCTAACTCCACATTGATATTTCCAGTGTTAGTTCTCAATTGCTTTCTAGTTTCTTGGATCAAAGAGTGATCAAACGGTGAGATAGAGGTTCTTCATTGAATTCATAAGTTTGTTGAAGTTAGATTCCTTTTTTCAAGTTCATTCATCAAACCTATACATAAACAACCTCCACAACCCAACACAAAAAGAAAAGAAAGAAACAACTTTTAAGCACATAAAACAGTTTAAATCCTATTTGGCTCCTACACTTTCATACTTATTTTATTTCCGTCCCTCAACTTTCAACCATTTTGTTTTAGTCTAAACTTTACCTGAAAAACTATTTTAGTCAAAAAGATAAACTTGTACAAACATGTTTAACAACAATATACACAGACAATTTCCAAATGTTTTAGATTCATGAACATCTATGGTTTCTAACCAAACAATGCCATTTTAAATCATACAAATTCTTAGCTGGGTCTTCAACCTGAGAAAAAATCAATCATATCATAATCCAGAAAATCTTACAAAAGAAGTTTGCCTACCCAATGACGCTTTCGTCGTGTCCAAAGCTGGTTCTGGCAACCGTGCTCAGCGCATAGGTCTTGAAATCCGTCGAGTTATATACAGAATCCTCCAAAAATTCCAGCTCCAGAGCCGAAATGAAGGGGCTCGAAACCGTGTGCTGGTTCCTCCCCAAGCACACGCTCATCATCTTCCCCATGGCGACCACCACAATCTCGTAATACGACGACAGCCCCTTGGCGAAATCCTCCGTCGTGTTGACGGTGCTCCATTTGGTCCCCTCCACAATCTGGTCGAACACCGGCGGCTCCTTCCCGCCGTCGAACCCGCCATAGTAATACGTCGTTCTCACGAGGTATTTCCCGCCCTTCACTACGGGAATCGCGTAGCAATATTTCCGGGCCGACTTGTCGGGAAAATAGCGCAATGTGGACAGTATCGGCACCAGGTTTGGATCGTCGAGCTTCGTGGTGTTCCCGACGGAGGTGAACCCCTCGTCGGTGATGTACTGCAGACTGCCGACGGTTACCTTGGCGGTCGATTCGCTGGCCCCACAATTGAGAAGATAGCCTGCATTTTGGTGGGTAAGACACAAAAAACGCAATGTGATAAAGATTTAGATGAAATCATGCATTTAAGAAGAATTATGTGAAATGGGTATCGTTTGTTGATGGGAAAAAACATGGAAAACGGGAAGATGAAGGAGAAAGAGTACATGGGGATGAAGAAATGATACCTCTCGGGCCAGGATGAGAGGCGGAGGAGGAGGAGGAGGAGGAGGAGGAGGGAAGAGGGACGGTGGCGAGGCAGAGGAGGAGGAGGAAGATGGGATTGGGAAGGGTCATTTTGATTCTTTTTTTTGTGTATTTTTTGAATCTTTTTTTGGTGGAGGGGGAAATTTTACTCCACCAGAAAAAGATGGACTGCTTTTTTGCAGGTTTTGTTGAAGAGAACCAGACAGAAAGGTGAAAGGAAACGCCGGGGACTCGGCCGAGGGAGCGGTGACACGTGGTTGTTTGTTCTTTAGAAATTATTTACTAAAAATATATTTTATATTTTAATACCAAAAGCGCCTGTAGCTCAGTGGATAGAGCGTCTGTTTCCTAAGCAGAAAGTCGTAGGTTCGACCCCTACCTGGCGCGAAATATGATCATTTTTTTCCTTTTCCTCTTTTTTGGTCAATTTGAAATAGTTTTCATAATTTAAGAAAATATGATTACTTTAATAAGTTAAATTGATTAAAATTATAAAAATGTCCAATTTGATGGAATCAAATTTTCTAAATTCACTTTTATTACGTTAATTTAAGCCACGAGTTTGGCAAAGTTTTGGTGCAGCTTTCATAAGATTGAATGATATTTGATGTGATGTTGTTAATCTTTGTGCCCTCTTGAACTTGTTTTGAGTGTTAGGATACAAGCCACTTGACCTTAGCTAGGTGGCTTCTCATTGTACCTTTGTCTGCCCATGATCCACCGAAGTGGCTTATCACTTAGTCGAACTATGATTCAATTATGTTGGGTAGTTTCTTTCCCCTGGTTAGCCTACAACCCATCTCATTTAATTGGGGAACGACTATTTGATGACTTTTCATCTCACTTGGTTGAGCTATGACCCACCTGGTAGCTTCTAATCTCAGTCAGTCAATTCGACCCACTTAGTCTTCTTTGGCCAAGTAGCATCCTACTTTGTTTCACCTAGTTTGGGCACTACACACCTCATGGGTTCATTAAGGTTGGCAACATTACTATAAGAAAATAGACTTTTAGCGACGCAAAAAAAAAAAAAAACATCACTGAAAGGCAAAAATCGTCGCTAAAACTATTAGCAGCGTTTAGGCATTTGTCGCAAGAATCATCAGTAATACTGCGTTTTCTATAGTTTTAGCGAGGCAATGCAAGACATCACTAAAATTGACATTTCAGCGACAGCTGGGATGTTGCTAAAACGACACTTTCTACGACATATGTACAGACATCGCTAAAGCTTGATTTTTATGCGACATATGTATAGACGTCACTTAAAGTTTAGTTTTAGCGACACAGCCTTCGTCGCTCAATATGGCTGAAGATTTTTGTTGGTATTTTTAGTGACATCAGACAACGTCGCTAAATATGTTGTAAACGTCTCTAAAAAACAAACTTCAAAACTTAATTTTTCCATTTCATATCATTCACCAACATATATAAAACGTCACTAAAAGTTATTCATTGTTAAAAAAAAAAACCAAGTCTTAATTTATAAAATAAAATCTATTATAAACTAAACATCATCAATCATCTTACAAAACTTAATGTAGCCACTTTTACAAAACAAGCACAAAATTGTCATATAAATTTGCACAAGAAAAAAAAAGTTCCCACCTCTTCAAAACGTTCCATAAAGCCATGTAAGTAGTACTCGAAGCATCAATACATAGGAAAGTTATCTGGTCTACGTCATGGCTAAGGCCTCTTGGCAATTGTGATAGTAATGGTGTTTTCATGTGCCTAAAATTGAGAAAAGATGTCCAGTCTATGTCCTGTCTTAGGCCTCTCCCACTTCTGCAACCAAGCAATCTCAGTCAGCATTGAAGGCAAAAAACACTAGCTATAACCTATCCAAGCACTTCATACACATGATAAAATTTCTTATTGACTACTTAATTTCGTAAGAACATCAATAATTTCATATCACAAAATGAATTGATTAATTTGTAACCTTCTCCATGAATGTTGGAAAAACTCATGCAAAAACTGAAAACAATTCAAAGATCATACTGAATGCTTAAGAATGAGAAGTTTAATTAAGAATAAATTTGACGAACAAGTAAGAAATCAAAGTAAACAAACAATCAAAACCTAACAAACATGACTTTCAAAACCAAAAACTTGATAAAGTTATGAAATGCCTAAATGCTTGAGCATTGGCCTTCTGGTCAACTTCGTCATGAAACAAACATAAAGCCAAAGCAGCAACTTCTCTAGAGAGAGAAAAAATGACAGAAAATAAAGGAGTTCAATGAAAACAGTTAAGATCCAGCAAGATCAAAGAAATGCCCCTTGATTTCACTGGACAAAAATTTGGTGCAAAAAGATTAAACTATGACTACCATAACTACATGACAAATGCACATCCCAGATATAACTCTTCTTCTACTTCATGGGCATTCGACGTAATTCATTTGTCTCATTTTGCTCATTAGACTCTTTAGTAATGTCTTCAAACACTGGATCATGCAAATCATTCATCATTTCAAACATTTATCCTCCGTATTTTGAACTCTAATTAACTTAATTTATAAGATAAACACCATCTTCTTTTGAACTGAATTTGAAAATGTTGCATCAAAAAACCTTGAATTAGGTCTTAAAAATGGAGAAAGATCTACCTTATCTCTATGAAATATCTATTGGCTATAAGTACGAAACATGTCATATTTAAATCAATGACGTTCTATGTCATCTAATTTATTCCAATCAATGTTCAAACAGTTTGAACAAGGACATCTTGTTCTCCCTTCTTTATCCAAATAATTCTTGGCAACTTCAATAAATTACTAACACCGTCTGAATATTGACTAGACCTTCGATCTTTCAAAGTTATCCAACTTTTGTCCATTGCAATTTCGTCTTCAAAATCCACTTTTCAAGTACCTAGTCAGAAATCAAATTTTAACTATAAACGTTTGGAAATATTATAGCATAATTTACAAAAAATTTCAAACTTCAATTCAAGCCAAAAAGACAAAAAATTAAGCCCATTTCAATTCTCACACTTTGGTGCACTTTTCCCTTTCGAATTCTCGCCCAACAACCACAATAACCAGCTCCTAAAGGCATTAATTTAATCAAAAAAATTTGCAGCAACGCCCCTCTCAGTGATCTATGCAAAATAATTTAAAGGACTCATCAATGGAGTCGACCTCTACAGCCTAGCACAAATTATAATGGAAATTGAAGTAGAGAATAGAGGTGGTTACCGATGGCGAGATTGAAGGGGGCGTAAAGGCTTCTACGGGGGGCAGCGTTATGGCGGCGGACGGCGGCTAAGGGGAGAAATAGGAGAAGAAAAACACCAGCTGGCGGTGTCACAAGAAAAGAAAGAGGGGGCGTAACGGCGGTGTAAGACGGTGGCTAGGGAGGGGGGCAGCGTAAGGCTGAGAGGAAGAAGAAGGGGAAGGGGGACGGCATGAATCTCTGAATGTCGAAGAAGGTGGCGTCAGACGGCTTAGAGAATATGCCTTAATTCTTCAACCACGCTATTATAATAGCATCGCTGAAGATGTCTTTCTGCGACTCGCGTACTAATTATAGGCATTACTTTCAGCGTATTCGTCGCTAATACATAACTTTCAACGACGTTCTGTTTATATATGTCGCTGAAAATTATTTTTTAGTGTCGTTTTTTTGTTGCATCGCTAAAAAAAAAAGGGTAGCTGAATGCCCCATTTCTTGTAGTGCATTCTCGTCTCTGCCTTCAAATGGTAAAGTAATCCTCGTCTCCACCATAGATCTATTCTTAGCATGGCAGAATCAGAATCAATATGGAAAGGAAGGAAGAAGGAGCATGTAGTCGGTAGGGCCAAAAGAGAAGGGGGAGAAGATAGATGGATCTAAACTTTTCACTTCATATTTTAATTTAAATATTAAAAATAAAGAAAATATAATATTGATAATAAAATTTTACTTCAAATGTTGGCTGAATTTCTTAAAATAGGCTTGACGTTGAAAATTGGACTTCCATAAAAAAAAAATGTAAATATAAATAATAAATTATATATATTTTTTGGGGTCGGGAACGGGTGTGGGTGGGGATTCCATTCCTTGTCCCCAATTGGGGATATAAATTTTATCCATGATTTTGATTTAGAACCCAATCAAATCAGGGATTTCCTGCCCCAATTGGGTTGGGGACTAGTAGGGATTTGATATACGCCCTACTAGTGTGGGCACCACTTTGATATAGGCCCAACCATAGGCCATTATCACCCGCCAAAGAACTCGGATAGCCTAGGATATAGTCTATAGGTTGGCTGAGGAGACAACTAGGTATGTCAAAAGGTGAAGTAATCATCACGACTCCTTCTATAAATAAGAGAGGATGTTGAGGTACGCTATCTTATTCCCTACTCAATTCGACACAAGTTGAGATCTAACTTAAGTATTGGAGTGTGTTGGGCAAACACCACACTGGTGTACAATTTCTTTCTATGGAGGCTACTATCCCAGTCACCATGTACTCGACCAGAAGCTCAGTCTCCACCTCGTTGCTCAAGTCGAAATTCCACATCAATAGAATCTACCAAAAATTATTTTAATTTTTTTTTTAAAAAAACTCATACAAATAAAAATAAGAAAAAATTTGCGGTGAAAGAAACATATTTTGTTTGTGTTGAAAATAGTTTGGAAGAAACGTGGGAGTGGCTATCGTTTGGAAAAGTGGAAGACGAGAGAGATGATGATAAGGCAAAAGGCGTATCGCGAAACGGAACCTCAAACTGTCGCTGACAGCTGACTGAACCGCGTACTTGCCAAAACTATTCAACGTCACCGAGTACAAGAAACCCCCAAGAAAAACCCCACCCTCTTCACTTCCATTTGACCGACCAATCATGATCACTCTCTCTCTCTCTCTTCAGGTGTTTTTCTTTATTAAATTAGATCCTCCCTCCCCAACTCTCCTTTTTTAAAATAAAAAAAATTTAAGCACGGGCTAAATATCAATTTCGTGTTAATCTTTATTTATTATCTACGAATTAGTCCATTTTAATTAAACACGTGTTCTTTTTCATTGTACACCTGGCACCATGGGGTTGCATGGTTTTGGATATATGGAGTATTATAATATTACTCACTTTGAGAGCTAGCTCGTGGGGTTTTAAAAGAATGGATTTTTATTTGATTCTGATTGCCTATTGTACAATTTCTCGACAATACTTGGAAAATTCTTTCCTAATTTAAAAAAATATATATTGAAAGAATTTTCTTATCACTTATTTTTTTTTAAATAATGATTTGATTTTACCAACTCCGAGATTGACCGGTAAGATCCATTATCCATCTCTGAATCTCGTCATCCTCCTCGAACGTCTCCGGCAATTCGCCGTCGTTTCCGATCTCCGTTTTCGGCGAGAATGAGAACTCGTTGTTCGTCGTCTTCGGAACTTCATTGTCGTACAGAGTTTTCTGTCGATTCTCTGCAGTGGCAGTGGCGGCGGTGGTTGCAGACGCCGCCGCCAGAGAGGAACTTGGGTTGAATGTTGTCAACGAGTTGTCGGGGAACGGGAAATTGAGTTTGGCTCGTGGGCCGCGGAAATTGATGGCGGCTTCATCGTAGGCACGGGCGGCGTCCTCCGCCGTGTTGAAAGTGCCGAGCCAGACCCGGGTCGCCCTCTTCGGGTCACGGATTTCGGCGGCCCATTTCCCCCAAGGGCGCTGCCTCACGCCTCTGTAATTCTTCTTCAACCGCTTATTCCTGCGGCCGCCGCCCTTCTTCTGGTCTTCCGGCATGAAGAAGTTGCAACCCAGGCAGCCTTTAATCCTGCACACTTGACATGTGTCGAAATCGGAAGGAGGGAAAACGGAGCCGTTGGTGGGAGTGGAGAAAGACGAAGCCGAAGCACATTCCGAGATGGGCGAGAGGAGGTGAAGAAAATGGTCATGGCGAAATTCGAGTCCGGAGGAGGCGGCGCCGGAAACTACTTGGGTGAGGGCGTCGACGATGACGGAAACCTCCTGCTCCTCGGAGATGCGGCGGAAGGGGTGGCCGAGGAAGGCGGGGGGGTCGGAGGACATTTGCATGCCGGTGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGAGAGAGAAGGGGAGGGTTTGAGAATGAGTTGGTCAGAGGGGGGAGGAGTGAGAAAGCGAGTATAAAAGGAGGGAGAGGGGTGACGTGGCAGACAGCTGGAACACCGCCGAGACACGGGTTGAAGAGGGGGGCACGTGACGAATAG

mRNA sequence

ATGGTGGAATCGACGCTCACCTATGAGGAATGCCGACGCCAGAGGTTGGAGGAGAATAAGAAGAGGATGGAAGAGCTTAATCTGAACAAGCTTGCCGATGCGCTAAAAGCTTCCAGCCCTAAGTCCTCTCCGACGAAACAGCTAAAGCGTCCTCGCCAGCCACTCGATATCACGTCCGTCCGTGTGAGAAGGTCCAGCCGTTTTGCGGATAAGCCCCCTCCGAACTATAAGGAGGTCCCCATTGAACCACTTCCAGGTGTCAGAAGGATCTATCAACGGAGAGATTTGTTGAATCGAGTTTATGCTTCACACGAAGAAAGACGATATGCTATTGACAGAGCAGAAGAACTTCAGTCTAGCCTGGAATCTAGGTACCCAAGTTTCGTGAAGCCCATGCTTCAGTCACATGTCACCGGGGGATTTTGGCTGGGTCTTCCAGTTCAATTTTGCAAGACACACCTTCCCCATCATGATGAAATGGTTACTCTGGTGGATGAGGATGCGAATGAGTTCCAAACAAAATACCTTGCGGAGAAAACGGGTCTTAGCGGTGGTTGGAGAGGGTTTTCACTTGATCACCAGTTAGTAGATGGGGATGCTTTGGTGTTTCAGTTAACTAAGCCAACTGAATTCAAGGTATATATCATCAGGGCATATAATTCCGAAGACAAAGCAGACACCAACGAGGATTCCGATGTCTCTCAACGGGAAAGTAGTGGCAAAAGAATTACTAGATCAGCAAGTAAAGGCCTTTTCAGATCAGAAATGGCGTCTCTCATGACGGTCCGGCGTGCTCGAAGCCCCATACTCATCTCCTCATTCTTCAAGGTACGGTCTCCACTCCCTTCTCGTTTCACGTTCTCTTGTGGAAACCAGACAGAGACTCTGATCAAAGCCCTAAGCACCTCTGCGGTCCCTAATGATTACTCAAATTTTCCTCCTCCACCACAACAACATCCTTCGTCTGATCCTAGAACTCTTCAGGGCCGGGAAACGCCTGGCCAGTGGGGCACGCCAAGCCAGGTTCATCATCGAGTTGGAAACTTTAATAACCAGTCGTTCTCGGAGTTTCAGAATCGCGACTATGTTCAACAGGGAAGCCCTAGTAATCAATTGAATTATCAGAATCAGAACCAAAGCTCTCATCCGAATCCTGGATTTGCCAGGCAGGGTCAGAGCTATACTCAAGCCGCAACAAAGAGGCCCAAACCAATGGAACAATCAAATCAGGGATACCCACAATTTGGAAAGCCTCCGCAGCGGAACCCACAAGTTGAGAATTCTAATCAGCCGAATAATCAGGTTGGGATTCAAGGACACGGTGCTCAAAATCAAGCATCAAATGCCCTTGTATCTCCTATCGATGAACTGCGGCGCTTTTGTGGAGAGGGAAAGATTAAAGAAGCTGTTGAATTGTTGAAAGAAGGTGTTAAAGCTGATGCTGATTGTTTCCATGTGTTGTTTGAACTATGCGGGAAATCAAAGTCATTTGACAATGCAAAAATAGTTCATGATTACTTCTTACAGTCAACTTATAGAAGTGATCTGCAATTGAATAATAAAGTGCTTGAGATGTATGGAAAATGTGGAAGCATGAGTGATGCACGGAGAGTGTTTGACCATATGCCTGATAGGAATATTGATTCTTGGCATTTGATGATAAAAGGATATGCGGATAATGGATTTGGTGATGAGGGGCTAGAGTTATTTGAGAATATGAAGAAGCTGGGATTGCAACCCAATTCACGAACTTTCCTTTTTATAATGTCAGCTTGTGCTAGTGCGAGTGCTGTAGAAGAAGGATTTATGTACTTTGAATCGATGAAAAATGATTATCATATCACCCCAGACATGGATCATTATTTAGCGCTTTTAGGTGTTCTTGGAGAACCTGGACACATCAACGAGGCTTTCGAGTATGTTGAAAAACTGCCCATGGAACCCACAGTTGAAATCTGGGAGACTTTGAAAAACTATGCTAGAATTCATGGAGATGTTGATCTTGAGGACTACGCCGAGGAGCTAATTGTTGCTCTGGACCCGACAAAAGCTGCTTCTAATAAGATAACCACACCACCTCCCAAAAAACGGTCTGCCATTAGCATGCTTGATGGGAAGAACAGGATTGGTGAGTTCAGAAATCCGACTCTCTACAAAGATGATGAGAAGCTAAAGGCTTTGAAGGCAATGAAAGAACAAGGTTATGTGCCAGATACTAGATATGTACTTCATGATATCGATCAGGAGGCCAAAGAGCAGGCATTGTTGTATCATAGTGAACGATTGGCTATTGCATATGGCCTGATCAGTACCCCAGCACGGACACCTCTTAGGATCATTAAGAACTTACGGATCTGTGGTGACTGTCACAATGCGATTAAGATCATGTCTAGGATTGTTGGGAGAGAGTTGATTGTAAGGGACAACAAACGGTTCCATCATTTTAAGGATGGTAAATGTTCTTGTGGGGATTACTGTCCTCAAGGCTGCAGCTCTCCCCCAACCCCTCAACACCCTCATCACCTCCTCCGCACTTATCTTCTTATCGCCGTTCAAATCGAACGTCCTGAAAGCAAACTCCACGTCCCTCGCCTGAACTCCCCCGCCGTTCCGGTAAACCTCCATGAACTCGCTCAGGTTTATGTACCCATCGCCGTCGGCGTCCACCGCCCTGAATATCTTCTGCACCTCTTCCATAGAGTTCCCTCTTCCCAGCGCCTTCAGAATCCCCCTGTACTCGTGCTTCGAGATCCTCCCGTCTCTGTTCGAGTCGAACTTGTTGAAGATCTGCTTTATCTCCTCCGCGCTCGGCTGCAGCGCCCTTCTCAGGCCGGAGCTCTGCCGGTCCTTGAACGAAAACAGCCGGGAGGGCTCCCGGAGGAACTTCTTCTTGGAGACGTTGTATTGGAAGTCCAGGAGGTTCAAGTTCGTCGGCATTCTTTCTGTTGGGTTGCTCTGTGCTTCGAGAACTTTCGACTTCAGATCGTTATTCTGGAGAAATCTGTCGACACAAAATCATCAATCGAAACCATTAGATCATCTCTAAAAACTATAAAGAATGAAGATCATGTATCTGTAATAATCAGAAGTATGGTAGCTACCGTGACACCACTGAGTTCTTTCATCTACTCAGCTAATCCTTTTACGTCGTTATATCATGCCTTAATCAGCAAAACAGAGTTAGAAAACAGAGCCCGAGAGTTTATTGTTACCGAGCCAACTGCAAAAGCAAATGCAGCCATTCAAGAAGTGGGTTTCAAACACACCATTAGCAGTGACATTCAGGTTTGTAAAGAAATTCTTGCCGTTTACTGCAACATTAAAGACCCTCCAGCTAAACGGACTCGGAGAACGGTTGTCCTGAAACCTTCGTTGGCGGGAGGTTCCAGAAGTTTGAGGGAGTTATGTTGGCATGGCATGTTACCAACGGGTTTTCGTCCACGAACGGGTGCCACAGCCGAAAATCTTACAAAAGAAGTTTGCCTACCCAATGACGCTTTCGTCGTGTCCAAAGCTGGTTCTGGCAACCGTGCTCAGCGCATAGGTCTTGAAATCCGTCGAGTTATATACAGAATCCTCCAAAAATTCCAGCTCCAGAGCCGAAATGAAGGGGCTCGAAACCGTGTGCTGGTTCCTCCCCAAGCACACGCTCATCATCTTCCCCATGGCGACCACCACAATCTCGTAATACGACGACAGCCCCTTGGCGAAATCCTCCGTCGTGTTGACGGTGCTCCATTTGGTCCCCTCCACAATCTGGTCGAACACCGGCGGCTCCTTCCCGCCGTCGAACCCGCCATAGTAATACGTCGTTCTCACGAGGTATTTCCCGCCCTTCACTACGGGAATCGCGTAGCAATATTTCCGGGCCGACTTGTCGGGAAAATAGCGCAATGTGGACAGTATCGGCACCAGGTTTGGATCGTCGAGCTTCGTGGTGTTCCCGACGGAGGTGAACCCCTCGTCGGTGATGTACTGCAGACTGCCGACGGTTACCTTGGCGGTCGATTCGCTGGCCCCACAATTGAGAAGATAGCCTGCATTTTGGTGGCGTTATGGCGGCGGACGGCGGCTAAGGGGAGAAATAGGAGAAGAAAAACACCAGCTGGCGGTGTCACAAGAAAAGAAAGAGGGGGCGTAACGGCGGTTGGCAGTGGCGGCGGTGGTTGCAGACGCCGCCGCCAGAGAGGAACTTGGGTTGAATGTTGTCAACGAGTTGTCGGGGAACGGGAAATTGAGTTTGGCTCGTGGGCCGCGGAAATTGATGGCGGCTTCATCGTAGGCACGGGCGGCGTCCTCCGCCGTGTTGAAAGTGCCGAGCCAGACCCGGGTCGCCCTCTTCGGGTCACGGATTTCGGCGGCCCATTTCCCCCAAGGGCGCTGCCTCACGCCTCTGTAATTCTTCTTCAACCGCTTATTCCTGCGGCCGCCGCCCTTCTTCTGGTCTTCCGGCATGAAGAAGTTGCAACCCAGGCAGCCTTTAATCCTGCACACTTGACATGTGTCGAAATCGGAAGGAGGGAAAACGGAGCCGTTGGTGGGAGTGGAGAAAGACGAAGCCGAAGCACATTCCGAGATGGGCGAGAGGAGGTGAAGAAAATGGTCATGGCGAAATTCGAGTCCGGAGGAGGCGGCGCCGGAAACTACTTGGGTGAGGGCGTCGACGATGACGGAAACCTCCTGCTCCTCGGAGATGCGGCGGAAGGGGTGGCCGAGGAAGGCGGGGGGGTCGGAGGACATTTGCATGCCGGTGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGAGAGAGAAGGGGAGGACAGCTGGAACACCGCCGAGACACGGGTTGAAGAGGGGGGCACGTGACGAATAG

Coding sequence (CDS)

ATGGTGGAATCGACGCTCACCTATGAGGAATGCCGACGCCAGAGGTTGGAGGAGAATAAGAAGAGGATGGAAGAGCTTAATCTGAACAAGCTTGCCGATGCGCTAAAAGCTTCCAGCCCTAAGTCCTCTCCGACGAAACAGCTAAAGCGTCCTCGCCAGCCACTCGATATCACGTCCGTCCGTGTGAGAAGGTCCAGCCGTTTTGCGGATAAGCCCCCTCCGAACTATAAGGAGGTCCCCATTGAACCACTTCCAGGTGTCAGAAGGATCTATCAACGGAGAGATTTGTTGAATCGAGTTTATGCTTCACACGAAGAAAGACGATATGCTATTGACAGAGCAGAAGAACTTCAGTCTAGCCTGGAATCTAGGTACCCAAGTTTCGTGAAGCCCATGCTTCAGTCACATGTCACCGGGGGATTTTGGCTGGGTCTTCCAGTTCAATTTTGCAAGACACACCTTCCCCATCATGATGAAATGGTTACTCTGGTGGATGAGGATGCGAATGAGTTCCAAACAAAATACCTTGCGGAGAAAACGGGTCTTAGCGGTGGTTGGAGAGGGTTTTCACTTGATCACCAGTTAGTAGATGGGGATGCTTTGGTGTTTCAGTTAACTAAGCCAACTGAATTCAAGGTATATATCATCAGGGCATATAATTCCGAAGACAAAGCAGACACCAACGAGGATTCCGATGTCTCTCAACGGGAAAGTAGTGGCAAAAGAATTACTAGATCAGCAAGTAAAGGCCTTTTCAGATCAGAAATGGCGTCTCTCATGACGGTCCGGCGTGCTCGAAGCCCCATACTCATCTCCTCATTCTTCAAGGTACGGTCTCCACTCCCTTCTCGTTTCACGTTCTCTTGTGGAAACCAGACAGAGACTCTGATCAAAGCCCTAAGCACCTCTGCGGTCCCTAATGATTACTCAAATTTTCCTCCTCCACCACAACAACATCCTTCGTCTGATCCTAGAACTCTTCAGGGCCGGGAAACGCCTGGCCAGTGGGGCACGCCAAGCCAGGTTCATCATCGAGTTGGAAACTTTAATAACCAGTCGTTCTCGGAGTTTCAGAATCGCGACTATGTTCAACAGGGAAGCCCTAGTAATCAATTGAATTATCAGAATCAGAACCAAAGCTCTCATCCGAATCCTGGATTTGCCAGGCAGGGTCAGAGCTATACTCAAGCCGCAACAAAGAGGCCCAAACCAATGGAACAATCAAATCAGGGATACCCACAATTTGGAAAGCCTCCGCAGCGGAACCCACAAGTTGAGAATTCTAATCAGCCGAATAATCAGGTTGGGATTCAAGGACACGGTGCTCAAAATCAAGCATCAAATGCCCTTGTATCTCCTATCGATGAACTGCGGCGCTTTTGTGGAGAGGGAAAGATTAAAGAAGCTGTTGAATTGTTGAAAGAAGGTGTTAAAGCTGATGCTGATTGTTTCCATGTGTTGTTTGAACTATGCGGGAAATCAAAGTCATTTGACAATGCAAAAATAGTTCATGATTACTTCTTACAGTCAACTTATAGAAGTGATCTGCAATTGAATAATAAAGTGCTTGAGATGTATGGAAAATGTGGAAGCATGAGTGATGCACGGAGAGTGTTTGACCATATGCCTGATAGGAATATTGATTCTTGGCATTTGATGATAAAAGGATATGCGGATAATGGATTTGGTGATGAGGGGCTAGAGTTATTTGAGAATATGAAGAAGCTGGGATTGCAACCCAATTCACGAACTTTCCTTTTTATAATGTCAGCTTGTGCTAGTGCGAGTGCTGTAGAAGAAGGATTTATGTACTTTGAATCGATGAAAAATGATTATCATATCACCCCAGACATGGATCATTATTTAGCGCTTTTAGGTGTTCTTGGAGAACCTGGACACATCAACGAGGCTTTCGAGTATGTTGAAAAACTGCCCATGGAACCCACAGTTGAAATCTGGGAGACTTTGAAAAACTATGCTAGAATTCATGGAGATGTTGATCTTGAGGACTACGCCGAGGAGCTAATTGTTGCTCTGGACCCGACAAAAGCTGCTTCTAATAAGATAACCACACCACCTCCCAAAAAACGGTCTGCCATTAGCATGCTTGATGGGAAGAACAGGATTGGTGAGTTCAGAAATCCGACTCTCTACAAAGATGATGAGAAGCTAAAGGCTTTGAAGGCAATGAAAGAACAAGGTTATGTGCCAGATACTAGATATGTACTTCATGATATCGATCAGGAGGCCAAAGAGCAGGCATTGTTGTATCATAGTGAACGATTGGCTATTGCATATGGCCTGATCAGTACCCCAGCACGGACACCTCTTAGGATCATTAAGAACTTACGGATCTGTGGTGACTGTCACAATGCGATTAAGATCATGTCTAGGATTGTTGGGAGAGAGTTGATTGTAAGGGACAACAAACGGTTCCATCATTTTAAGGATGGTAAATGTTCTTGTGGGGATTACTGTCCTCAAGGCTGCAGCTCTCCCCCAACCCCTCAACACCCTCATCACCTCCTCCGCACTTATCTTCTTATCGCCGTTCAAATCGAACGTCCTGAAAGCAAACTCCACGTCCCTCGCCTGAACTCCCCCGCCGTTCCGGTAAACCTCCATGAACTCGCTCAGGTTTATGTACCCATCGCCGTCGGCGTCCACCGCCCTGAATATCTTCTGCACCTCTTCCATAGAGTTCCCTCTTCCCAGCGCCTTCAGAATCCCCCTGTACTCGTGCTTCGAGATCCTCCCGTCTCTGTTCGAGTCGAACTTGTTGAAGATCTGCTTTATCTCCTCCGCGCTCGGCTGCAGCGCCCTTCTCAGGCCGGAGCTCTGCCGGTCCTTGAACGAAAACAGCCGGGAGGGCTCCCGGAGGAACTTCTTCTTGGAGACGTTGTATTGGAAGTCCAGGAGGTTCAAGTTCGTCGGCATTCTTTCTGTTGGGTTGCTCTGTGCTTCGAGAACTTTCGACTTCAGATCGTTATTCTGGAGAAATCTGTCGACACAAAATCATCAATCGAAACCATTAGATCATCTCTAAAAACTATAAAGAATGAAGATCATGTATCTGTAATAATCAGAAGTATGGTAGCTACCGTGACACCACTGAGTTCTTTCATCTACTCAGCTAATCCTTTTACGTCGTTATATCATGCCTTAATCAGCAAAACAGAGTTAGAAAACAGAGCCCGAGAGTTTATTGTTACCGAGCCAACTGCAAAAGCAAATGCAGCCATTCAAGAAGTGGGTTTCAAACACACCATTAGCAGTGACATTCAGGTTTGTAAAGAAATTCTTGCCGTTTACTGCAACATTAAAGACCCTCCAGCTAAACGGACTCGGAGAACGGTTGTCCTGAAACCTTCGTTGGCGGGAGGTTCCAGAAGTTTGAGGGAGTTATGTTGGCATGGCATGTTACCAACGGGTTTTCGTCCACGAACGGGTGCCACAGCCGAAAATCTTACAAAAGAAGTTTGCCTACCCAATGACGCTTTCGTCGTGTCCAAAGCTGGTTCTGGCAACCGTGCTCAGCGCATAGGTCTTGAAATCCGTCGAGTTATATACAGAATCCTCCAAAAATTCCAGCTCCAGAGCCGAAATGAAGGGGCTCGAAACCGTGTGCTGGTTCCTCCCCAAGCACACGCTCATCATCTTCCCCATGGCGACCACCACAATCTCGTAATACGACGACAGCCCCTTGGCGAAATCCTCCGTCGTGTTGACGGTGCTCCATTTGGTCCCCTCCACAATCTGGTCGAACACCGGCGGCTCCTTCCCGCCGTCGAACCCGCCATAGTAATACGTCGTTCTCACGAGGTATTTCCCGCCCTTCACTACGGGAATCGCGTAGCAATATTTCCGGGCCGACTTGTCGGGAAAATAGCGCAATGTGGACAGTATCGGCACCAGGTTTGGATCGTCGAGCTTCGTGGTGTTCCCGACGGAGGTGAACCCCTCGTCGGTGATGTACTGCAGACTGCCGACGGTTACCTTGGCGGTCGATTCGCTGGCCCCACAATTGAGAAGATAGCCTGCATTTTGGTGGCGTTATGGCGGCGGACGGCGGCTAAGGGGAGAAATAGGAGAAGAAAAACACCAGCTGGCGGTGTCACAAGAAAAGAAAGAGGGGGCGTAACGGCGGTTGGCAGTGGCGGCGGTGGTTGCAGACGCCGCCGCCAGAGAGGAACTTGGGTTGAATGTTGTCAACGAGTTGTCGGGGAACGGGAAATTGAGTTTGGCTCGTGGGCCGCGGAAATTGATGGCGGCTTCATCGTAGGCACGGGCGGCGTCCTCCGCCGTGTTGAAAGTGCCGAGCCAGACCCGGGTCGCCCTCTTCGGGTCACGGATTTCGGCGGCCCATTTCCCCCAAGGGCGCTGCCTCACGCCTCTGTAATTCTTCTTCAACCGCTTATTCCTGCGGCCGCCGCCCTTCTTCTGGTCTTCCGGCATGAAGAAGTTGCAACCCAGGCAGCCTTTAATCCTGCACACTTGACATGTGTCGAAATCGGAAGGAGGGAAAACGGAGCCGTTGGTGGGAGTGGAGAAAGACGAAGCCGAAGCACATTCCGAGATGGGCGAGAGGAGGTGAAGAAAATGGTCATGGCGAAATTCGAGTCCGGAGGAGGCGGCGCCGGAAACTACTTGGGTGAGGGCGTCGACGATGACGGAAACCTCCTGCTCCTCGGAGATGCGGCGGAAGGGGTGGCCGAGGAAGGCGGGGGGGTCGGAGGACATTTGCATGCCGGTGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGAGAGAGAAGGGGAGGACAGCTGGAACACCGCCGAGACACGGGTTGAAGAGGGGGGCACGTGACGAATAG

Protein sequence

MVESTLTYEECRRQRLEENKKRMEELNLNKLADALKASSPKSSPTKQLKRPRQPLDITSVRVRRSSRFADKPPPNYKEVPIEPLPGVRRIYQRRDLLNRVYASHEERRYAIDRAEELQSSLESRYPSFVKPMLQSHVTGGFWLGLPVQFCKTHLPHHDEMVTLVDEDANEFQTKYLAEKTGLSGGWRGFSLDHQLVDGDALVFQLTKPTEFKVYIIRAYNSEDKADTNEDSDVSQRESSGKRITRSASKGLFRSEMASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFSEFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAATKRPKPMEQSNQGYPQFGKPPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDYCPQGCSSPPTPQHPHHLLRTYLLIAVQIERPESKLHVPRLNSPAVPVNLHELAQVYVPIAVGVHRPEYLLHLFHRVPSSQRLQNPPVLVLRDPPVSVRVELVEDLLYLLRARLQRPSQAGALPVLERKQPGGLPEELLLGDVVLEVQEVQVRRHSFCWVALCFENFRLQIVILEKSVDTKSSIETIRSSLKTIKNEDHVSVIIRSMVATVTPLSSFIYSANPFTSLYHALISKTELENRAREFIVTEPTAKANAAIQEVGFKHTISSDIQVCKEILAVYCNIKDPPAKRTRRTVVLKPSLAGGSRSLRELCWHGMLPTGFRPRTGATAENLTKEVCLPNDAFVVSKAGSGNRAQRIGLEIRRVIYRILQKFQLQSRNEGARNRVLVPPQAHAHHLPHGDHHNLVIRRQPLGEILRRVDGAPFGPLHNLVEHRRLLPAVEPAIVIRRSHEVFPALHYGNRVAIFPGRLVGKIAQCGQYRHQVWIVELRGVPDGGEPLVGDVLQTADGYLGGRFAGPTIEKIACILVALWRRTAAKGRNRRRKTPAGGVTRKERGGVTAVGSGGGGCRRRRQRGTWVECCQRVVGEREIEFGSWAAEIDGGFIVGTGGVLRRVESAEPDPGRPLRVTDFGGPFPPRALPHASVILLQPLIPAAAALLLVFRHEEVATQAAFNPAHLTCVEIGRRENGAVGGSGERRSRSTFRDGREEVKKMVMAKFESGGGGAGNYLGEGVDDDGNLLLLGDAAEGVAEEGGGVGGHLHAGGERERERERERREKGRTAGTPPRHGLKRGARDE
Homology
BLAST of Sgr013451 vs. NCBI nr
Match: KAG6571981.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1181.4 bits (3055), Expect = 0.0e+00
Identity = 626/825 (75.88%), Postives = 668/825 (80.97%), Query Frame = 0

Query: 1   MVESTLTYEECRRQRLEENKKRMEELNLNKLADALKASSPKSSPTKQLKRPRQPLDITSV 60
           MVES LTYEECRRQRLEENKKRMEELNLNKLADALK+SSPKSSPTKQLKRPRQPLDI+S+
Sbjct: 48  MVESNLTYEECRRQRLEENKKRMEELNLNKLADALKSSSPKSSPTKQLKRPRQPLDISSL 107

Query: 61  RVRRSSRFADKPPPNYKEVPIEPLPGVRRIYQRRDLLNRVYASHEERRYAIDRAEELQSS 120
            VRRSSRFADKPPP+YKE PIEPL G+RR YQRRDLLNRVYAS  ER+YAIDRA +LQSS
Sbjct: 108 SVRRSSRFADKPPPSYKEEPIEPLAGLRRTYQRRDLLNRVYASDVERQYAIDRARDLQSS 167

Query: 121 LESRYPSFVKPMLQSHVTGGFWLGLPVQFCKTHLPHHDEMVTLVDEDANEFQTKYLAEKT 180
           LESRYPSFVKPMLQSHVTGGFWLGLPV FCK HLP  DEM+TLVDED NEFQTKYLAEKT
Sbjct: 168 LESRYPSFVKPMLQSHVTGGFWLGLPVHFCKAHLPLEDEMLTLVDEDENEFQTKYLAEKT 227

Query: 181 GLSGGWRGFSLDHQLVDGDALVFQLTKPTEFKVYIIRAYNSEDKADTNEDSDVSQRESSG 240
           GLSGGWRGFS+DHQLVDGD LVFQLTKPTEFKVYIIRAYN ED+ +T+EDSDV+Q ES+G
Sbjct: 228 GLSGGWRGFSIDHQLVDGDTLVFQLTKPTEFKVYIIRAYNLEDRENTHEDSDVTQLESNG 287

Query: 241 KRITRSASKGLFRSEMASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKAL 300
           KR T S   G    ++  LM    AR P+   S+ +V   L  +       Q E   K+ 
Sbjct: 288 KRNTGS---GQINLKIKQLM----ARIPV-SPSWKQVAKELQDQ-------QVEG--KSS 347

Query: 301 STSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFSEFQNR 360
           S SA       F P                           +  + G  N  + S +   
Sbjct: 348 SRSAF------FSP---------------------------IRIKYGTINVLADSLY--- 407

Query: 361 DYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAATKRPKPMEQSNQGYPQFGKPPQ 420
                      ++Y      S PNP      Q++     + P      NQG PQFGKP Q
Sbjct: 408 ----------SISYPQYPNPSQPNP------QNFNYQQQRAPNQWSNQNQGLPQFGKPGQ 467

Query: 421 RNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKAD 480
           RN Q ENS Q NNQ GIQ H AQN A NALVSPIDELRRFCGEGK+KEAVELLKEGVKAD
Sbjct: 468 RNLQAENSYQLNNQAGIQRHAAQNHAPNALVSPIDELRRFCGEGKLKEAVELLKEGVKAD 527

Query: 481 ADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFD 540
           ADCFH LFELCGKSKSF+NAK+VHDYFLQST RSDLQLNNKVLEMYGKCGSMSDA+RVFD
Sbjct: 528 ADCFHELFELCGKSKSFENAKVVHDYFLQSTCRSDLQLNNKVLEMYGKCGSMSDAQRVFD 587

Query: 541 HMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEE 600
           HMPDR+I+SWHLMIKGYADNG GDEGLELFENMKKLGLQP+S+TFLF+MSACASASAVEE
Sbjct: 588 HMPDRSIESWHLMIKGYADNGLGDEGLELFENMKKLGLQPDSQTFLFVMSACASASAVEE 647

Query: 601 GFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYA 660
           GFMYFESMKNDYHI P+MDHYL LLG+LGEPGHINEAFEYVEKLPMEPTVE+WETLKNYA
Sbjct: 648 GFMYFESMKNDYHINPNMDHYLGLLGILGEPGHINEAFEYVEKLPMEPTVEVWETLKNYA 707

Query: 661 RIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKD 720
           RIHGDVDLEDYAEELIV LDPTKAASNKI TPPPKKRSAISMLDGKNRI EFRNPTLYKD
Sbjct: 708 RIHGDVDLEDYAEELIVDLDPTKAASNKIPTPPPKKRSAISMLDGKNRIVEFRNPTLYKD 767

Query: 721 DEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRII 780
           DEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRII
Sbjct: 768 DEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRII 803

Query: 781 KNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 826
           KNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
Sbjct: 828 KNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 803

BLAST of Sgr013451 vs. NCBI nr
Match: XP_022136002.1 (pentatricopeptide repeat-containing protein At2g15690 [Momordica charantia])

HSP 1 Score: 1007.3 bits (2603), Expect = 1.5e-289
Identity = 504/603 (83.58%), Postives = 524/603 (86.90%), Query Frame = 0

Query: 256 MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPP 315
           MASLM VRRAR PIL SSFFKVR PLPS F+FSCGNQTET IKALSTSA+PNDYSNF P 
Sbjct: 1   MASLMAVRRARIPILASSFFKVRPPLPSHFSFSCGNQTETPIKALSTSAIPNDYSNFSPS 60

Query: 316 PQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFSEFQNRDYVQQGSPSNQLNYQ 375
           PQQ+P+SDPR LQGR TPGQWGTPSQVH   GNFNNQSFSEFQNRDYVQQGS  NQ+NYQ
Sbjct: 61  PQQNPASDPRFLQGRRTPGQWGTPSQVHPPSGNFNNQSFSEFQNRDYVQQGSAGNQMNYQ 120

Query: 376 NQNQSSHPNPGFARQGQSYTQAAT---------------------------------KRP 435
           +QN+ SHPNPGF++QGQ YTQA                                   + P
Sbjct: 121 SQNRRSHPNPGFSQQGQGYTQAGNPNSWNPPNQSYPQNQNPSLPSLPNPQNFNYQQQRGP 180

Query: 436 KPMEQSNQGYPQFGKPPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCG 495
                 NQGYPQ G P QRNPQVEN NQ NNQ G+QGHGAQ QA NALV PIDELRR CG
Sbjct: 181 NQWNNQNQGYPQVGNPAQRNPQVENYNQLNNQGGVQGHGAQTQAPNALVPPIDELRRLCG 240

Query: 496 EGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKV 555
           +GKIKEAVELLKEGVKADADCFHV+FELCGKSKSFDNAKIVHDYFLQST R DLQLNNKV
Sbjct: 241 DGKIKEAVELLKEGVKADADCFHVMFELCGKSKSFDNAKIVHDYFLQSTCRGDLQLNNKV 300

Query: 556 LEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNS 615
           LEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNG GDEGLELFENMKKLGLQPNS
Sbjct: 301 LEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGLGDEGLELFENMKKLGLQPNS 360

Query: 616 RTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVE 675
           +TFL++MSACAS SAVEEGFMYFESMKNDYHI P+MDHYL LLG+LGEPGHINEAFEYVE
Sbjct: 361 QTFLYVMSACASVSAVEEGFMYFESMKNDYHIVPEMDHYLGLLGILGEPGHINEAFEYVE 420

Query: 676 KLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISM 735
           KLPMEPTVE+WETLKNYARIHG+VDLEDYAEELIVALDPTKA  NKI TPPPKKRSAISM
Sbjct: 421 KLPMEPTVEVWETLKNYARIHGNVDLEDYAEELIVALDPTKAPPNKIPTPPPKKRSAISM 480

Query: 736 LDGKNRIGEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERL 795
           LDGKNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERL
Sbjct: 481 LDGKNRIVEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERL 540

Query: 796 AIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSC 826
           AIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSC
Sbjct: 541 AIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSC 600

BLAST of Sgr013451 vs. NCBI nr
Match: XP_022952757.1 (pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucurbita moschata])

HSP 1 Score: 985.7 bits (2547), Expect = 4.7e-283
Identity = 499/600 (83.17%), Postives = 519/600 (86.50%), Query Frame = 0

Query: 256 MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPP 315
           MASLM VRR R+PI ISSF KVRSPLPS FTFSCGN+TETLIKALSTSA P+D+SNFP P
Sbjct: 1   MASLMAVRRVRTPIHISSFIKVRSPLPSTFTFSCGNRTETLIKALSTSAFPDDFSNFPTP 60

Query: 316 PQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFSEFQNRDYVQQGSPSNQLNYQ 375
           PQQ  SSDPR LQ     GQWG+PSQVH   GNFNNQSFSEFQNRDYVQQGSPSNQ+NY+
Sbjct: 61  PQQPSSSDPRYLQ-----GQWGSPSQVHRPSGNFNNQSFSEFQNRDYVQQGSPSNQMNYR 120

Query: 376 NQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPM 435
           +QNQSS+PNPGF RQGQSYTQ                                 + P   
Sbjct: 121 SQNQSSYPNPGFPRQGQSYTQGGNPNSWNPPNQSYPQYQNPSQPNPQNFNYQQQRAPNQW 180

Query: 436 EQSNQGYPQFGKPPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGK 495
              NQG PQFGKP QRN Q ENS Q NNQ GIQGHGAQN   NALVSPIDELRRFCGEGK
Sbjct: 181 SNQNQGLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQNHTPNALVSPIDELRRFCGEGK 240

Query: 496 IKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM 555
           +KEAVELLKEGVKADADCFH  FELCGKSKSF+NAK+VHDYFLQST RSDLQLNNKVLEM
Sbjct: 241 LKEAVELLKEGVKADADCFHEFFELCGKSKSFENAKVVHDYFLQSTCRSDLQLNNKVLEM 300

Query: 556 YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTF 615
           YGKCGSMSDA+RVFDHM DR+I+SWHLMIKGYADNG GDEGLELFENMKKLGL PNS+TF
Sbjct: 301 YGKCGSMSDAQRVFDHMLDRSIESWHLMIKGYADNGLGDEGLELFENMKKLGLHPNSQTF 360

Query: 616 LFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLP 675
           LF+MSACASASAVEEGFMYFESMKNDYHI PDMDHYL LLG+LGEPGHINEAFEYVEKLP
Sbjct: 361 LFVMSACASASAVEEGFMYFESMKNDYHINPDMDHYLGLLGILGEPGHINEAFEYVEKLP 420

Query: 676 MEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDG 735
           MEPTVE+WETLKNYARIHGDVDLEDYAEELIV LDPTKAASNKI TPPPKKR AISMLDG
Sbjct: 421 MEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAASNKIPTPPPKKRFAISMLDG 480

Query: 736 KNRIGEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIA 795
           KNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIA
Sbjct: 481 KNRIVEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIA 540

Query: 796 YGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 826
           YGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
Sbjct: 541 YGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 595

BLAST of Sgr013451 vs. NCBI nr
Match: XP_022972422.1 (pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucurbita maxima])

HSP 1 Score: 983.0 bits (2540), Expect = 3.0e-282
Identity = 500/600 (83.33%), Postives = 520/600 (86.67%), Query Frame = 0

Query: 256 MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPP 315
           MASLM VRR R+PI ISSF KVRSPLPS FTFSCGNQTETLIKALSTSA P+D+SNFP P
Sbjct: 1   MASLMAVRRVRTPIHISSFIKVRSPLPSTFTFSCGNQTETLIKALSTSAFPDDFSNFPTP 60

Query: 316 PQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFSEFQNRDYVQQGSPSNQLNYQ 375
           PQQ  SS PR LQ     GQ G+PSQVH   GNFNNQSFSEFQNRDYVQ GSPSNQ+N +
Sbjct: 61  PQQPSSSHPRYLQ-----GQRGSPSQVHRPSGNFNNQSFSEFQNRDYVQLGSPSNQMNNR 120

Query: 376 NQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPM 435
           +QNQSS+PNPGF RQGQSYTQ                                 + P   
Sbjct: 121 SQNQSSYPNPGFPRQGQSYTQGGNPNSWNPPNQSYPQYQNPSQPNPQNFNYQQQRAPNQW 180

Query: 436 EQSNQGYPQFGKPPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGK 495
              NQG PQFGKP QRN Q ENS Q NNQ GIQGHGAQN A NALVSPIDELRRFCGEGK
Sbjct: 181 SNQNQGLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQNHAPNALVSPIDELRRFCGEGK 240

Query: 496 IKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM 555
           +KEAVELLKEGVKADADCFH LFELCGKSKSFDNAK+VHDYFLQST RSDLQLNNKVLEM
Sbjct: 241 LKEAVELLKEGVKADADCFHELFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEM 300

Query: 556 YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTF 615
           YGKCGSMSDA+RVFDHMPDR+I+SWHLMIKGYADNG GDEGLELFENMKKLGLQPNS+TF
Sbjct: 301 YGKCGSMSDAQRVFDHMPDRSIESWHLMIKGYADNGLGDEGLELFENMKKLGLQPNSQTF 360

Query: 616 LFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLP 675
           LF+MSACASASAVEEGFMYFESMKNDYHI PDMDHYL LLG+LGEPGHINEAFEYVEKLP
Sbjct: 361 LFVMSACASASAVEEGFMYFESMKNDYHINPDMDHYLGLLGILGEPGHINEAFEYVEKLP 420

Query: 676 MEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDG 735
           +EPTVE+WETLKNYA+IHGDVDLEDYAEELIV LDPTKAASNKI TPPPKKRSAISMLDG
Sbjct: 421 IEPTVEVWETLKNYAKIHGDVDLEDYAEELIVDLDPTKAASNKIPTPPPKKRSAISMLDG 480

Query: 736 KNRIGEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIA 795
           KNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIA
Sbjct: 481 KNRIVEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIA 540

Query: 796 YGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 826
           YGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
Sbjct: 541 YGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 595

BLAST of Sgr013451 vs. NCBI nr
Match: XP_023530084.1 (pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 979.5 bits (2531), Expect = 3.4e-281
Identity = 494/601 (82.20%), Postives = 522/601 (86.86%), Query Frame = 0

Query: 256 MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPP 315
           MASLM VRRAR+PILIS FFKVRS LPSRFTFSCGNQTETLIKAL TSA PND+SNFPPP
Sbjct: 1   MASLMAVRRARTPILISFFFKVRSSLPSRFTFSCGNQTETLIKALCTSATPNDFSNFPPP 60

Query: 316 PQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFSEFQNRDYVQQGSPSNQLNYQ 375
           PQQ  SSDPR LQ     GQWG+P+QVH R GNFNNQSFSEFQNRDYV QGS +NQ+NY+
Sbjct: 61  PQQRSSSDPRFLQ-----GQWGSPNQVHSRSGNFNNQSFSEFQNRDYVPQGSSNNQMNYR 120

Query: 376 NQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPM 435
           ++NQSSHPNPGF+RQGQ Y+QA                                + P   
Sbjct: 121 SENQSSHPNPGFSRQGQGYSQAGNPNSWNPPNQSYPQYQNPPQPNAQHFNYQQQRSPNQW 180

Query: 436 EQSNQGYPQFGKPPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGK 495
              NQGYPQFG+P Q NPQVENS Q NNQ G+QGHG+QNQA N  VSP DELRRFC EGK
Sbjct: 181 NNQNQGYPQFGRPGQGNPQVENSYQLNNQAGVQGHGSQNQALNPSVSPTDELRRFCEEGK 240

Query: 496 IKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM 555
           IKEAVELLKEGVKADADCF VLF+LCGKS SFDNAK+VHDYFLQSTYRSDLQLNNKVLEM
Sbjct: 241 IKEAVELLKEGVKADADCFLVLFDLCGKSNSFDNAKVVHDYFLQSTYRSDLQLNNKVLEM 300

Query: 556 YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTF 615
           YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYA+NG GDEGLELFE+MKKLGLQPNS+TF
Sbjct: 301 YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYAENGLGDEGLELFESMKKLGLQPNSQTF 360

Query: 616 LFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLP 675
           L++M ACASASA+EEGFMYFESMK+DY I PD+DHYL LLGVLGEPGH+NEAFEYVEKLP
Sbjct: 361 LYVMRACASASAIEEGFMYFESMKSDYQINPDLDHYLGLLGVLGEPGHMNEAFEYVEKLP 420

Query: 676 MEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASN-KITTPPPKKRSAISMLD 735
           MEPTVEIWETLKNYARIHGDVDLEDYAEELIV L PTKAAS+ KI TPPPK+RSAISMLD
Sbjct: 421 MEPTVEIWETLKNYARIHGDVDLEDYAEELIVDLGPTKAASSKKIPTPPPKRRSAISMLD 480

Query: 736 GKNRIGEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAI 795
           GKNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAI
Sbjct: 481 GKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAI 540

Query: 796 AYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD 826
           AYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD
Sbjct: 541 AYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD 596

BLAST of Sgr013451 vs. ExPASy Swiss-Prot
Match: Q9ZQE5 (Pentatricopeptide repeat-containing protein At2g15690, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H66 PE=1 SV=2)

HSP 1 Score: 506.9 bits (1304), Expect = 8.4e-142
Identity = 290/592 (48.99%), Postives = 371/592 (62.67%), Query Frame = 0

Query: 256 MASLMTVRRARSP--ILISSFFKVRSPLP---SRFTFSCGNQTETLIKALSTSAVPNDYS 315
           M+SLM +R AR+   + I S  ++RS  P   S+F FS G      IK LSTSA  NDY 
Sbjct: 1   MSSLMAIRCARTQNIVTIGSLLQLRSSFPRLSSQFHFS-GTLNSIPIKHLSTSAAANDYH 60

Query: 316 NFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFSEF-----QNRDYVQQ 375
             P          P   Q  ++  Q  T  +V      ++ Q   +      QN  +  Q
Sbjct: 61  QNPQSGSPSQHQRPYPPQSFDSQNQTNTNQRVPQSPNQWSTQHGGQIPQYGGQNPQHGGQ 120

Query: 376 GSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAATKRPKPMEQSNQGYPQFGKPPQ--RNP 435
             P    N Q   Q S       + G    Q    RP    Q     PQ+G P    +N 
Sbjct: 121 RPPYGGQNPQQGGQMS-------QYGGHNPQHGGHRP----QYGGQRPQYGGPGNNYQNQ 180

Query: 436 QVENSNQ-----PNNQVGIQGHGAQNQASNAL--VSP---IDELRRFCGEGKIKEAVELL 495
            V+ SNQ     P  Q   Q   + NQ+ N +  V+P   ++E+ R C     K+A+ELL
Sbjct: 181 NVQQSNQSQYYTPQQQQQPQPPRSSNQSPNQMNEVAPPPSVEEVMRLCQRRLYKDAIELL 240

Query: 496 KEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMS 555
            +G   D +CF +LFE C   KS +++K VHD+FLQS +R D +LNN V+ M+G+C S++
Sbjct: 241 DKGAMPDRECFVLLFESCANLKSLEHSKKVHDHFLQSKFRGDPKLNNMVISMFGECSSIT 300

Query: 556 DARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACA 615
           DA+RVFDHM D+++DSWHLM+  Y+DNG GD+ L LFE M K GL+PN  TFL +  ACA
Sbjct: 301 DAKRVFDHMVDKDMDSWHLMMCAYSDNGMGDDALHLFEEMTKHGLKPNEETFLTVFLACA 360

Query: 616 SASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIW 675
           +   +EE F++F+SMKN++ I+P  +HYL +LGVLG+ GH+ EA +Y+  LP EPT + W
Sbjct: 361 TVGGIEEAFLHFDSMKNEHGISPKTEHYLGVLGVLGKCGHLVEAEQYIRDLPFEPTADFW 420

Query: 676 ETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFR 735
           E ++NYAR+HGD+DLEDY EEL+V +DP+KA  NKI TPPPK     +M+  K+RI EFR
Sbjct: 421 EAMRNYARLHGDIDLEDYMEELMVDVDPSKAVINKIPTPPPKSFKETNMVTSKSRILEFR 480

Query: 736 NPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPA 795
           N T YKD+ K  A K  K   YVPDTR+VLHDIDQEAKEQALLYHSERLAIAYG+I TP 
Sbjct: 481 NLTFYKDEAKEMAAK--KGVVYVPDTRFVLHDIDQEAKEQALLYHSERLAIAYGIICTPP 540

Query: 796 RTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 826
           R  L IIKNLR+CGDCHN IKIMS+I+GR LIVRDNKRFHHFKDGKCSCGDY
Sbjct: 541 RKTLTIIKNLRVCGDCHNFIKIMSKIIGRVLIVRDNKRFHHFKDGKCSCGDY 578

BLAST of Sgr013451 vs. ExPASy Swiss-Prot
Match: Q9SUU7 (Pentatricopeptide repeat-containing protein At4g32450, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H63 PE=2 SV=1)

HSP 1 Score: 317.4 bits (812), Expect = 9.4e-85
Identity = 191/528 (36.17%), Postives = 282/528 (53.41%), Query Frame = 0

Query: 329 GRETPGQWGTPSQVHHRVG-----NFNNQSFSEFQNRDYVQQGSPSNQLNYQNQNQSSHP 388
           G E P          H +G     N   QS   FQ   Y Q  +P +  N         P
Sbjct: 34  GFENPTNGNPMDNSSHHIGYVNGFNGGEQSLGGFQQNSYEQSLNPVSGQN---------P 93

Query: 389 NPGFARQGQSYTQAATKRPKPMEQSNQ------GYPQFGKPPQRNPQVENSNQPNNQVGI 448
              F + G +  Q+  +  + + Q NQ      G   +G      PQ  N+   + Q   
Sbjct: 94  TNRFYQNGYNRNQSYGEHSEIINQRNQNWQSSDGCSSYGTTGNGVPQENNTGGNHFQQDH 153

Query: 449 QGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLK----EGVKADADCFHVLFELCGK 508
            GH           S +DEL   C EGK+K+AVE++K    EG   D      + +LCG 
Sbjct: 154 SGH-----------SSLDELDSICREGKVKKAVEIIKSWRNEGYVVDLPRLFWIAQLCGD 213

Query: 509 SKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLM 568
           +++   AK+VH++   S   SD+   N ++EMY  CGS+ DA  VF+ MP+RN+++W  +
Sbjct: 214 AQALQEAKVVHEFITSSVGISDISAYNSIIEMYSGCGSVEDALTVFNSMPERNLETWCGV 273

Query: 569 IKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYH 628
           I+ +A NG G++ ++ F   K+ G +P+   F  I  AC     + EG ++FESM  +Y 
Sbjct: 274 IRCFAKNGQGEDAIDTFSRFKQEGNKPDGEMFKEIFFACGVLGDMNEGLLHFESMYKEYG 333

Query: 629 ITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAE 688
           I P M+HY++L+ +L EPG+++EA  +VE   MEP V++WETL N +R+HGD+ L D  +
Sbjct: 334 IIPCMEHYVSLVKMLAEPGYLDEALRFVES--MEPNVDLWETLMNLSRVHGDLILGDRCQ 393

Query: 689 ELIVALDPTKAASNKITTPPPKKRSAI------SMLDGKN------RIGEFRNPTLYKDD 748
           +++  LD ++          P K S +       M  G N        G+   P   ++ 
Sbjct: 394 DMVEQLDASRLNKESKAGLVPVKSSDLVKEKLQRMAKGPNYGIRYMAAGDISRP---ENR 453

Query: 749 EKLKALKAMKEQ----GYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPL 808
           E   ALK++KE     GYVP ++  LHD+DQE+K++ L  H+ER A     + TPAR+ +
Sbjct: 454 ELYMALKSLKEHMIEIGYVPLSKLALHDVDQESKDENLFNHNERFAFISTFLDTPARSLI 513

Query: 809 RIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 826
           R++KNLR+C DCHNA+K+MS+IVGRELI RD KRFHH KDG CSC +Y
Sbjct: 514 RVMKNLRVCADCHNALKLMSKIVGRELISRDAKRFHHMKDGVCSCREY 536

BLAST of Sgr013451 vs. ExPASy Swiss-Prot
Match: Q680H3 (Pentatricopeptide repeat-containing protein At2g25580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H75 PE=2 SV=2)

HSP 1 Score: 311.6 bits (797), Expect = 5.2e-83
Identity = 162/394 (41.12%), Postives = 241/394 (61.17%), Query Frame = 0

Query: 454 IDELRRFCGEGKIKEAVE----LLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQ 513
           I+E   FC  GK+K+A+     L       D      L ++CG+++    AK VH     
Sbjct: 223 IEEYDAFCKHGKVKKALYTIDILASMNYVVDLSRLLRLAKICGEAEGLQEAKTVHGKISA 282

Query: 514 STYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLEL 573
           S    DL  N+ +LEMY  CG  ++A  VF+ M ++N+++W ++I+ +A NGFG++ +++
Sbjct: 283 SVSHLDLSSNHVLLEMYSNCGLANEAASVFEKMSEKNLETWCIIIRCFAKNGFGEDAIDM 342

Query: 574 FENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLG 633
           F   K+ G  P+ + F  I  AC     V+EG ++FESM  DY I P ++ Y++L+ +  
Sbjct: 343 FSRFKEEGNIPDGQLFRGIFYACGMLGDVDEGLLHFESMSRDYGIAPSIEDYVSLVEMYA 402

Query: 634 EPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTK------ 693
            PG ++EA E+VE++PMEP V++WETL N +R+HG+++L DY  E++  LDPT+      
Sbjct: 403 LPGFLDEALEFVERMPMEPNVDVWETLMNLSRVHGNLELGDYCAEVVEFLDPTRLNKQSR 462

Query: 694 -----AASNKITTPPPKKRSAISMLDG-KNRIGEFR--NPTLYKDDEKLKALKAMK---- 753
                  ++ +     KKRS I  L G K+ + EFR  +  L ++DE  + L+ +K    
Sbjct: 463 EGFIPVKASDVEKESLKKRSGI--LHGVKSSMQEFRAGDTNLPENDELFQLLRNLKMHMV 522

Query: 754 EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHN 813
           E GYV +TR  LHDIDQE+KE  LL HSER+A A  ++++  R P  +IKNLR+C DCHN
Sbjct: 523 EVGYVAETRMALHDIDQESKETLLLGHSERIAFARAVLNSAPRKPFTVIKNLRVCVDCHN 582

Query: 814 AIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 826
           A+KIMS IVGRE+I RD KRFH  K+G C+C DY
Sbjct: 583 ALKIMSDIVGREVITRDIKRFHQMKNGACTCKDY 614

BLAST of Sgr013451 vs. ExPASy Swiss-Prot
Match: Q9LIQ7 (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 287.3 bits (734), Expect = 1.0e-75
Identity = 148/405 (36.54%), Postives = 234/405 (57.78%), Query Frame = 0

Query: 448 NALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYF 507
           NAL++     RR   E  ++    +L++G +     +  LF  C  +   +  K VH Y 
Sbjct: 231 NALIA--GHARRSGTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYM 290

Query: 508 LQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGL 567
           ++S  +      N +L+MY K GS+ DAR++FD +  R++ SW+ ++  YA +GFG E +
Sbjct: 291 IKSGEKLVAFAGNTLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAV 350

Query: 568 ELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGV 627
             FE M+++G++PN  +FL +++AC+ +  ++EG+ Y+E MK D  I P+  HY+ ++ +
Sbjct: 351 WWFEEMRRVGIRPNEISFLSVLTACSHSGLLDEGWHYYELMKKD-GIVPEAWHYVTVVDL 410

Query: 628 LGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDP------ 687
           LG  G +N A  ++E++P+EPT  IW+ L N  R+H + +L  YA E +  LDP      
Sbjct: 411 LGRAGDLNRALRFIEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPH 470

Query: 688 ---------------TKAASNKITTPPPKKRSAISMLDGKNRIGEF-RNPTLYKDDEKL- 747
                                K+     KK  A S ++ +N I  F  N   +   E++ 
Sbjct: 471 VILYNIYASGGRWNDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIA 530

Query: 748 ----KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRII 807
               + L  +KE GYVPDT +V+  +DQ+ +E  L YHSE++A+A+ L++TP  + + I 
Sbjct: 531 RKWEEVLAKIKELGYVPDTSHVIVHVDQQEREVNLQYHSEKIALAFALLNTPPGSTIHIK 590

Query: 808 KNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 826
           KN+R+CGDCH AIK+ S++VGRE+IVRD  RFHHFKDG CSC DY
Sbjct: 591 KNIRVCGDCHTAIKLASKVVGREIIVRDTNRFHHFKDGNCSCKDY 632

BLAST of Sgr013451 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 282.7 bits (722), Expect = 2.6e-74
Identity = 148/398 (37.19%), Postives = 225/398 (56.53%), Query Frame = 0

Query: 460 FCGEGKIKEAVELLKE----GVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSD 519
           F   GK +EA+ L  E    G+K D      L   C K  +    K VH Y ++     +
Sbjct: 197 FAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRN 256

Query: 520 LQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKK 579
           L  +N +L++Y +CG + +A+ +FD M D+N  SW  +I G A NGFG E +ELF+ M+ 
Sbjct: 257 LHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMES 316

Query: 580 L-GLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHI 639
             GL P   TF+ I+ AC+    V+EGF YF  M+ +Y I P ++H+  ++ +L   G +
Sbjct: 317 TEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQV 376

Query: 640 NEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAAS-------- 699
            +A+EY++ +PM+P V IW TL     +HGD DL ++A   I+ L+P  +          
Sbjct: 377 KKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMY 436

Query: 700 -------------NKITTPPPKKRSAISMLDGKNRIGEF-----RNPTLYKDDEKLKALK 759
                         ++     KK    S+++  NR+ EF      +P       KLK + 
Sbjct: 437 ASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMT 496

Query: 760 A-MKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICG 819
             ++ +GYVP    V  D+++E KE A++YHSE++AIA+ LISTP R+P+ ++KNLR+C 
Sbjct: 497 GRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCA 556

Query: 820 DCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 826
           DCH AIK++S++  RE++VRD  RFHHFK+G CSC DY
Sbjct: 557 DCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDY 594

BLAST of Sgr013451 vs. ExPASy TrEMBL
Match: A0A6J1C4C3 (pentatricopeptide repeat-containing protein At2g15690 OS=Momordica charantia OX=3673 GN=LOC111007809 PE=3 SV=1)

HSP 1 Score: 1007.3 bits (2603), Expect = 7.3e-290
Identity = 504/603 (83.58%), Postives = 524/603 (86.90%), Query Frame = 0

Query: 256 MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPP 315
           MASLM VRRAR PIL SSFFKVR PLPS F+FSCGNQTET IKALSTSA+PNDYSNF P 
Sbjct: 1   MASLMAVRRARIPILASSFFKVRPPLPSHFSFSCGNQTETPIKALSTSAIPNDYSNFSPS 60

Query: 316 PQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFSEFQNRDYVQQGSPSNQLNYQ 375
           PQQ+P+SDPR LQGR TPGQWGTPSQVH   GNFNNQSFSEFQNRDYVQQGS  NQ+NYQ
Sbjct: 61  PQQNPASDPRFLQGRRTPGQWGTPSQVHPPSGNFNNQSFSEFQNRDYVQQGSAGNQMNYQ 120

Query: 376 NQNQSSHPNPGFARQGQSYTQAAT---------------------------------KRP 435
           +QN+ SHPNPGF++QGQ YTQA                                   + P
Sbjct: 121 SQNRRSHPNPGFSQQGQGYTQAGNPNSWNPPNQSYPQNQNPSLPSLPNPQNFNYQQQRGP 180

Query: 436 KPMEQSNQGYPQFGKPPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCG 495
                 NQGYPQ G P QRNPQVEN NQ NNQ G+QGHGAQ QA NALV PIDELRR CG
Sbjct: 181 NQWNNQNQGYPQVGNPAQRNPQVENYNQLNNQGGVQGHGAQTQAPNALVPPIDELRRLCG 240

Query: 496 EGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKV 555
           +GKIKEAVELLKEGVKADADCFHV+FELCGKSKSFDNAKIVHDYFLQST R DLQLNNKV
Sbjct: 241 DGKIKEAVELLKEGVKADADCFHVMFELCGKSKSFDNAKIVHDYFLQSTCRGDLQLNNKV 300

Query: 556 LEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNS 615
           LEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNG GDEGLELFENMKKLGLQPNS
Sbjct: 301 LEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGLGDEGLELFENMKKLGLQPNS 360

Query: 616 RTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVE 675
           +TFL++MSACAS SAVEEGFMYFESMKNDYHI P+MDHYL LLG+LGEPGHINEAFEYVE
Sbjct: 361 QTFLYVMSACASVSAVEEGFMYFESMKNDYHIVPEMDHYLGLLGILGEPGHINEAFEYVE 420

Query: 676 KLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISM 735
           KLPMEPTVE+WETLKNYARIHG+VDLEDYAEELIVALDPTKA  NKI TPPPKKRSAISM
Sbjct: 421 KLPMEPTVEVWETLKNYARIHGNVDLEDYAEELIVALDPTKAPPNKIPTPPPKKRSAISM 480

Query: 736 LDGKNRIGEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERL 795
           LDGKNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERL
Sbjct: 481 LDGKNRIVEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERL 540

Query: 796 AIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSC 826
           AIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSC
Sbjct: 541 AIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSC 600

BLAST of Sgr013451 vs. ExPASy TrEMBL
Match: A0A6J1GL94 (pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111455360 PE=3 SV=1)

HSP 1 Score: 985.7 bits (2547), Expect = 2.3e-283
Identity = 499/600 (83.17%), Postives = 519/600 (86.50%), Query Frame = 0

Query: 256 MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPP 315
           MASLM VRR R+PI ISSF KVRSPLPS FTFSCGN+TETLIKALSTSA P+D+SNFP P
Sbjct: 1   MASLMAVRRVRTPIHISSFIKVRSPLPSTFTFSCGNRTETLIKALSTSAFPDDFSNFPTP 60

Query: 316 PQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFSEFQNRDYVQQGSPSNQLNYQ 375
           PQQ  SSDPR LQ     GQWG+PSQVH   GNFNNQSFSEFQNRDYVQQGSPSNQ+NY+
Sbjct: 61  PQQPSSSDPRYLQ-----GQWGSPSQVHRPSGNFNNQSFSEFQNRDYVQQGSPSNQMNYR 120

Query: 376 NQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPM 435
           +QNQSS+PNPGF RQGQSYTQ                                 + P   
Sbjct: 121 SQNQSSYPNPGFPRQGQSYTQGGNPNSWNPPNQSYPQYQNPSQPNPQNFNYQQQRAPNQW 180

Query: 436 EQSNQGYPQFGKPPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGK 495
              NQG PQFGKP QRN Q ENS Q NNQ GIQGHGAQN   NALVSPIDELRRFCGEGK
Sbjct: 181 SNQNQGLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQNHTPNALVSPIDELRRFCGEGK 240

Query: 496 IKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM 555
           +KEAVELLKEGVKADADCFH  FELCGKSKSF+NAK+VHDYFLQST RSDLQLNNKVLEM
Sbjct: 241 LKEAVELLKEGVKADADCFHEFFELCGKSKSFENAKVVHDYFLQSTCRSDLQLNNKVLEM 300

Query: 556 YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTF 615
           YGKCGSMSDA+RVFDHM DR+I+SWHLMIKGYADNG GDEGLELFENMKKLGL PNS+TF
Sbjct: 301 YGKCGSMSDAQRVFDHMLDRSIESWHLMIKGYADNGLGDEGLELFENMKKLGLHPNSQTF 360

Query: 616 LFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLP 675
           LF+MSACASASAVEEGFMYFESMKNDYHI PDMDHYL LLG+LGEPGHINEAFEYVEKLP
Sbjct: 361 LFVMSACASASAVEEGFMYFESMKNDYHINPDMDHYLGLLGILGEPGHINEAFEYVEKLP 420

Query: 676 MEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDG 735
           MEPTVE+WETLKNYARIHGDVDLEDYAEELIV LDPTKAASNKI TPPPKKR AISMLDG
Sbjct: 421 MEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAASNKIPTPPPKKRFAISMLDG 480

Query: 736 KNRIGEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIA 795
           KNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIA
Sbjct: 481 KNRIVEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIA 540

Query: 796 YGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 826
           YGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
Sbjct: 541 YGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 595

BLAST of Sgr013451 vs. ExPASy TrEMBL
Match: A0A6J1I4R8 (pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like OS=Cucurbita maxima OX=3661 GN=LOC111470986 PE=3 SV=1)

HSP 1 Score: 983.0 bits (2540), Expect = 1.5e-282
Identity = 500/600 (83.33%), Postives = 520/600 (86.67%), Query Frame = 0

Query: 256 MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPP 315
           MASLM VRR R+PI ISSF KVRSPLPS FTFSCGNQTETLIKALSTSA P+D+SNFP P
Sbjct: 1   MASLMAVRRVRTPIHISSFIKVRSPLPSTFTFSCGNQTETLIKALSTSAFPDDFSNFPTP 60

Query: 316 PQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFSEFQNRDYVQQGSPSNQLNYQ 375
           PQQ  SS PR LQ     GQ G+PSQVH   GNFNNQSFSEFQNRDYVQ GSPSNQ+N +
Sbjct: 61  PQQPSSSHPRYLQ-----GQRGSPSQVHRPSGNFNNQSFSEFQNRDYVQLGSPSNQMNNR 120

Query: 376 NQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPM 435
           +QNQSS+PNPGF RQGQSYTQ                                 + P   
Sbjct: 121 SQNQSSYPNPGFPRQGQSYTQGGNPNSWNPPNQSYPQYQNPSQPNPQNFNYQQQRAPNQW 180

Query: 436 EQSNQGYPQFGKPPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGK 495
              NQG PQFGKP QRN Q ENS Q NNQ GIQGHGAQN A NALVSPIDELRRFCGEGK
Sbjct: 181 SNQNQGLPQFGKPGQRNLQAENSYQLNNQAGIQGHGAQNHAPNALVSPIDELRRFCGEGK 240

Query: 496 IKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM 555
           +KEAVELLKEGVKADADCFH LFELCGKSKSFDNAK+VHDYFLQST RSDLQLNNKVLEM
Sbjct: 241 LKEAVELLKEGVKADADCFHELFELCGKSKSFDNAKVVHDYFLQSTCRSDLQLNNKVLEM 300

Query: 556 YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTF 615
           YGKCGSMSDA+RVFDHMPDR+I+SWHLMIKGYADNG GDEGLELFENMKKLGLQPNS+TF
Sbjct: 301 YGKCGSMSDAQRVFDHMPDRSIESWHLMIKGYADNGLGDEGLELFENMKKLGLQPNSQTF 360

Query: 616 LFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLP 675
           LF+MSACASASAVEEGFMYFESMKNDYHI PDMDHYL LLG+LGEPGHINEAFEYVEKLP
Sbjct: 361 LFVMSACASASAVEEGFMYFESMKNDYHINPDMDHYLGLLGILGEPGHINEAFEYVEKLP 420

Query: 676 MEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDG 735
           +EPTVE+WETLKNYA+IHGDVDLEDYAEELIV LDPTKAASNKI TPPPKKRSAISMLDG
Sbjct: 421 IEPTVEVWETLKNYAKIHGDVDLEDYAEELIVDLDPTKAASNKIPTPPPKKRSAISMLDG 480

Query: 736 KNRIGEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIA 795
           KNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIA
Sbjct: 481 KNRIVEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIA 540

Query: 796 YGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 826
           YGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
Sbjct: 541 YGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 595

BLAST of Sgr013451 vs. ExPASy TrEMBL
Match: A0A6J1JDU2 (pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111485971 PE=3 SV=1)

HSP 1 Score: 967.2 bits (2499), Expect = 8.4e-278
Identity = 489/601 (81.36%), Postives = 517/601 (86.02%), Query Frame = 0

Query: 256 MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPP 315
           MASLM VRRAR+PILIS FFKVRS LPSRFTFSCGNQTETLIKAL TSA PND+SNFPPP
Sbjct: 1   MASLMAVRRARTPILISFFFKVRSSLPSRFTFSCGNQTETLIKALCTSATPNDFSNFPPP 60

Query: 316 PQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFSEFQNRDYVQQGSPSNQLNYQ 375
           PQQH SSDPR LQ      QWG+PSQVH R GNFNNQSFSEF NRDYV QGS +NQ+NY+
Sbjct: 61  PQQHSSSDPRFLQ-----AQWGSPSQVHSRSGNFNNQSFSEFHNRDYVPQGSSNNQINYR 120

Query: 376 NQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPM 435
           ++NQSSHPNPG +RQGQ Y+ A                                + P   
Sbjct: 121 SENQSSHPNPGVSRQGQGYSHAGNPNSWSPLNQSYPQYQNPPQPNAQHFNYQQQRGPNQW 180

Query: 436 EQSNQGYPQFGKPPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGK 495
              NQGYPQFG+P Q  PQVENS Q NNQ G+QGHG+QNQA N  VS  DELRRFC EGK
Sbjct: 181 NNQNQGYPQFGRPGQGKPQVENSYQLNNQAGVQGHGSQNQALNPSVSLTDELRRFCEEGK 240

Query: 496 IKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM 555
           IKEAVELLKEGVKADADCF VLF+LCGKS SFDNAK+VHDYFLQSTYRSDLQLNNKVLEM
Sbjct: 241 IKEAVELLKEGVKADADCFLVLFDLCGKSNSFDNAKVVHDYFLQSTYRSDLQLNNKVLEM 300

Query: 556 YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTF 615
           YGKCGS+SDARRVFDHMPDRNIDSWHLMI+GYA+NG GDEGLELFE+MKKLGLQPNS+TF
Sbjct: 301 YGKCGSISDARRVFDHMPDRNIDSWHLMIEGYAENGLGDEGLELFESMKKLGLQPNSQTF 360

Query: 616 LFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLP 675
           LF+M ACASASA+EEGFMYFESMKNDY I PD+DHYL LLGVLGEPGH+NEAFEYVEKLP
Sbjct: 361 LFVMRACASASAIEEGFMYFESMKNDYQINPDLDHYLGLLGVLGEPGHMNEAFEYVEKLP 420

Query: 676 MEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASN-KITTPPPKKRSAISMLD 735
           MEPTVE+WETLKNYARIHGDVDLED AEELIV LDPTKAAS+ KI TPPPK+RSAISMLD
Sbjct: 421 MEPTVEVWETLKNYARIHGDVDLEDCAEELIVDLDPTKAASSKKIPTPPPKRRSAISMLD 480

Query: 736 GKNRIGEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAI 795
           GKNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAI
Sbjct: 481 GKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAI 540

Query: 796 AYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD 826
           AYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD
Sbjct: 541 AYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD 596

BLAST of Sgr013451 vs. ExPASy TrEMBL
Match: A0A6J1EIR6 (pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111434643 PE=3 SV=1)

HSP 1 Score: 952.6 bits (2461), Expect = 2.1e-273
Identity = 486/601 (80.87%), Postives = 512/601 (85.19%), Query Frame = 0

Query: 256 MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPP 315
           MASLM VRRAR+PILIS F KVRS LPSRF FSCGNQTETLIKAL TSA PND+SNFPPP
Sbjct: 1   MASLMAVRRARTPILISFFLKVRSSLPSRFIFSCGNQTETLIKALCTSATPNDFSNFPPP 60

Query: 316 PQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFSEFQNRDYVQQGSPSNQLNYQ 375
           PQQH SSDPR LQ     GQWG+P+QVH R GNFNNQSFSEFQNRDYVQQGS +NQ+NY+
Sbjct: 61  PQQHSSSDPRFLQ-----GQWGSPNQVHSRSGNFNNQSFSEFQNRDYVQQGSSNNQMNYR 120

Query: 376 NQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPM 435
           ++NQSSHPNPGF+RQGQ Y+QA                                + P   
Sbjct: 121 SENQSSHPNPGFSRQGQGYSQAGNPNSWNPPNQSYPQYQNPPQPNAQHFNYQQQRGPNQW 180

Query: 436 EQSNQGYPQFGKPPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGK 495
              NQGYPQFG+P Q NPQVENS Q N           NQA N  VSPIDELRRFC EGK
Sbjct: 181 NNQNQGYPQFGRPGQGNPQVENSYQLN-----------NQAPNPSVSPIDELRRFCEEGK 240

Query: 496 IKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM 555
           IKEAVELLKEGVKADADCF VLF+LCGKS SFDNAK+VHDYFLQST RSDLQLNNKVLEM
Sbjct: 241 IKEAVELLKEGVKADADCFLVLFDLCGKSNSFDNAKVVHDYFLQSTCRSDLQLNNKVLEM 300

Query: 556 YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTF 615
           YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYA+NG GDEGLELFE+M+KLGLQPNS+TF
Sbjct: 301 YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYAENGLGDEGLELFESMQKLGLQPNSQTF 360

Query: 616 LFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLP 675
           LF+M ACASASA+EEGFMYFESMKNDY I PD+DHYL LLGVLGEPGH+NEAFEY EKLP
Sbjct: 361 LFVMRACASASAIEEGFMYFESMKNDYQINPDLDHYLGLLGVLGEPGHMNEAFEYGEKLP 420

Query: 676 MEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASN-KITTPPPKKRSAISMLD 735
           MEPTVE+WETLKNYARIHGDVDLEDYAEELIV LDPTKAAS+ KI TPPPK+RSAISMLD
Sbjct: 421 MEPTVEVWETLKNYARIHGDVDLEDYAEELIVDLDPTKAASSKKIPTPPPKRRSAISMLD 480

Query: 736 GKNRIGEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAI 795
           GKNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAI
Sbjct: 481 GKNRISEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAI 540

Query: 796 AYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD 826
           AYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD
Sbjct: 541 AYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD 585

BLAST of Sgr013451 vs. TAIR 10
Match: AT2G15690.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 506.9 bits (1304), Expect = 6.0e-143
Identity = 290/592 (48.99%), Postives = 371/592 (62.67%), Query Frame = 0

Query: 256 MASLMTVRRARSP--ILISSFFKVRSPLP---SRFTFSCGNQTETLIKALSTSAVPNDYS 315
           M+SLM +R AR+   + I S  ++RS  P   S+F FS G      IK LSTSA  NDY 
Sbjct: 1   MSSLMAIRCARTQNIVTIGSLLQLRSSFPRLSSQFHFS-GTLNSIPIKHLSTSAAANDYH 60

Query: 316 NFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFSEF-----QNRDYVQQ 375
             P          P   Q  ++  Q  T  +V      ++ Q   +      QN  +  Q
Sbjct: 61  QNPQSGSPSQHQRPYPPQSFDSQNQTNTNQRVPQSPNQWSTQHGGQIPQYGGQNPQHGGQ 120

Query: 376 GSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAATKRPKPMEQSNQGYPQFGKPPQ--RNP 435
             P    N Q   Q S       + G    Q    RP    Q     PQ+G P    +N 
Sbjct: 121 RPPYGGQNPQQGGQMS-------QYGGHNPQHGGHRP----QYGGQRPQYGGPGNNYQNQ 180

Query: 436 QVENSNQ-----PNNQVGIQGHGAQNQASNAL--VSP---IDELRRFCGEGKIKEAVELL 495
            V+ SNQ     P  Q   Q   + NQ+ N +  V+P   ++E+ R C     K+A+ELL
Sbjct: 181 NVQQSNQSQYYTPQQQQQPQPPRSSNQSPNQMNEVAPPPSVEEVMRLCQRRLYKDAIELL 240

Query: 496 KEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMS 555
            +G   D +CF +LFE C   KS +++K VHD+FLQS +R D +LNN V+ M+G+C S++
Sbjct: 241 DKGAMPDRECFVLLFESCANLKSLEHSKKVHDHFLQSKFRGDPKLNNMVISMFGECSSIT 300

Query: 556 DARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACA 615
           DA+RVFDHM D+++DSWHLM+  Y+DNG GD+ L LFE M K GL+PN  TFL +  ACA
Sbjct: 301 DAKRVFDHMVDKDMDSWHLMMCAYSDNGMGDDALHLFEEMTKHGLKPNEETFLTVFLACA 360

Query: 616 SASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIW 675
           +   +EE F++F+SMKN++ I+P  +HYL +LGVLG+ GH+ EA +Y+  LP EPT + W
Sbjct: 361 TVGGIEEAFLHFDSMKNEHGISPKTEHYLGVLGVLGKCGHLVEAEQYIRDLPFEPTADFW 420

Query: 676 ETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFR 735
           E ++NYAR+HGD+DLEDY EEL+V +DP+KA  NKI TPPPK     +M+  K+RI EFR
Sbjct: 421 EAMRNYARLHGDIDLEDYMEELMVDVDPSKAVINKIPTPPPKSFKETNMVTSKSRILEFR 480

Query: 736 NPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPA 795
           N T YKD+ K  A K  K   YVPDTR+VLHDIDQEAKEQALLYHSERLAIAYG+I TP 
Sbjct: 481 NLTFYKDEAKEMAAK--KGVVYVPDTRFVLHDIDQEAKEQALLYHSERLAIAYGIICTPP 540

Query: 796 RTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 826
           R  L IIKNLR+CGDCHN IKIMS+I+GR LIVRDNKRFHHFKDGKCSCGDY
Sbjct: 541 RKTLTIIKNLRVCGDCHNFIKIMSKIIGRVLIVRDNKRFHHFKDGKCSCGDY 578

BLAST of Sgr013451 vs. TAIR 10
Match: AT4G32450.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 317.4 bits (812), Expect = 6.7e-86
Identity = 191/528 (36.17%), Postives = 282/528 (53.41%), Query Frame = 0

Query: 329 GRETPGQWGTPSQVHHRVG-----NFNNQSFSEFQNRDYVQQGSPSNQLNYQNQNQSSHP 388
           G E P          H +G     N   QS   FQ   Y Q  +P +  N         P
Sbjct: 34  GFENPTNGNPMDNSSHHIGYVNGFNGGEQSLGGFQQNSYEQSLNPVSGQN---------P 93

Query: 389 NPGFARQGQSYTQAATKRPKPMEQSNQ------GYPQFGKPPQRNPQVENSNQPNNQVGI 448
              F + G +  Q+  +  + + Q NQ      G   +G      PQ  N+   + Q   
Sbjct: 94  TNRFYQNGYNRNQSYGEHSEIINQRNQNWQSSDGCSSYGTTGNGVPQENNTGGNHFQQDH 153

Query: 449 QGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLK----EGVKADADCFHVLFELCGK 508
            GH           S +DEL   C EGK+K+AVE++K    EG   D      + +LCG 
Sbjct: 154 SGH-----------SSLDELDSICREGKVKKAVEIIKSWRNEGYVVDLPRLFWIAQLCGD 213

Query: 509 SKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLM 568
           +++   AK+VH++   S   SD+   N ++EMY  CGS+ DA  VF+ MP+RN+++W  +
Sbjct: 214 AQALQEAKVVHEFITSSVGISDISAYNSIIEMYSGCGSVEDALTVFNSMPERNLETWCGV 273

Query: 569 IKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYH 628
           I+ +A NG G++ ++ F   K+ G +P+   F  I  AC     + EG ++FESM  +Y 
Sbjct: 274 IRCFAKNGQGEDAIDTFSRFKQEGNKPDGEMFKEIFFACGVLGDMNEGLLHFESMYKEYG 333

Query: 629 ITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAE 688
           I P M+HY++L+ +L EPG+++EA  +VE   MEP V++WETL N +R+HGD+ L D  +
Sbjct: 334 IIPCMEHYVSLVKMLAEPGYLDEALRFVES--MEPNVDLWETLMNLSRVHGDLILGDRCQ 393

Query: 689 ELIVALDPTKAASNKITTPPPKKRSAI------SMLDGKN------RIGEFRNPTLYKDD 748
           +++  LD ++          P K S +       M  G N        G+   P   ++ 
Sbjct: 394 DMVEQLDASRLNKESKAGLVPVKSSDLVKEKLQRMAKGPNYGIRYMAAGDISRP---ENR 453

Query: 749 EKLKALKAMKEQ----GYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPL 808
           E   ALK++KE     GYVP ++  LHD+DQE+K++ L  H+ER A     + TPAR+ +
Sbjct: 454 ELYMALKSLKEHMIEIGYVPLSKLALHDVDQESKDENLFNHNERFAFISTFLDTPARSLI 513

Query: 809 RIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 826
           R++KNLR+C DCHNA+K+MS+IVGRELI RD KRFHH KDG CSC +Y
Sbjct: 514 RVMKNLRVCADCHNALKLMSKIVGRELISRDAKRFHHMKDGVCSCREY 536

BLAST of Sgr013451 vs. TAIR 10
Match: AT2G25580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 311.6 bits (797), Expect = 3.7e-84
Identity = 162/394 (41.12%), Postives = 241/394 (61.17%), Query Frame = 0

Query: 454 IDELRRFCGEGKIKEAVE----LLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQ 513
           I+E   FC  GK+K+A+     L       D      L ++CG+++    AK VH     
Sbjct: 223 IEEYDAFCKHGKVKKALYTIDILASMNYVVDLSRLLRLAKICGEAEGLQEAKTVHGKISA 282

Query: 514 STYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLEL 573
           S    DL  N+ +LEMY  CG  ++A  VF+ M ++N+++W ++I+ +A NGFG++ +++
Sbjct: 283 SVSHLDLSSNHVLLEMYSNCGLANEAASVFEKMSEKNLETWCIIIRCFAKNGFGEDAIDM 342

Query: 574 FENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLG 633
           F   K+ G  P+ + F  I  AC     V+EG ++FESM  DY I P ++ Y++L+ +  
Sbjct: 343 FSRFKEEGNIPDGQLFRGIFYACGMLGDVDEGLLHFESMSRDYGIAPSIEDYVSLVEMYA 402

Query: 634 EPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTK------ 693
            PG ++EA E+VE++PMEP V++WETL N +R+HG+++L DY  E++  LDPT+      
Sbjct: 403 LPGFLDEALEFVERMPMEPNVDVWETLMNLSRVHGNLELGDYCAEVVEFLDPTRLNKQSR 462

Query: 694 -----AASNKITTPPPKKRSAISMLDG-KNRIGEFR--NPTLYKDDEKLKALKAMK---- 753
                  ++ +     KKRS I  L G K+ + EFR  +  L ++DE  + L+ +K    
Sbjct: 463 EGFIPVKASDVEKESLKKRSGI--LHGVKSSMQEFRAGDTNLPENDELFQLLRNLKMHMV 522

Query: 754 EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHN 813
           E GYV +TR  LHDIDQE+KE  LL HSER+A A  ++++  R P  +IKNLR+C DCHN
Sbjct: 523 EVGYVAETRMALHDIDQESKETLLLGHSERIAFARAVLNSAPRKPFTVIKNLRVCVDCHN 582

Query: 814 AIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 826
           A+KIMS IVGRE+I RD KRFH  K+G C+C DY
Sbjct: 583 ALKIMSDIVGREVITRDIKRFHQMKNGACTCKDY 614

BLAST of Sgr013451 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 282.7 bits (722), Expect = 1.8e-75
Identity = 148/398 (37.19%), Postives = 225/398 (56.53%), Query Frame = 0

Query: 460 FCGEGKIKEAVELLKE----GVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSD 519
           F   GK +EA+ L  E    G+K D      L   C K  +    K VH Y ++     +
Sbjct: 197 FAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRN 256

Query: 520 LQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKK 579
           L  +N +L++Y +CG + +A+ +FD M D+N  SW  +I G A NGFG E +ELF+ M+ 
Sbjct: 257 LHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMES 316

Query: 580 L-GLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHI 639
             GL P   TF+ I+ AC+    V+EGF YF  M+ +Y I P ++H+  ++ +L   G +
Sbjct: 317 TEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQV 376

Query: 640 NEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAAS-------- 699
            +A+EY++ +PM+P V IW TL     +HGD DL ++A   I+ L+P  +          
Sbjct: 377 KKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMY 436

Query: 700 -------------NKITTPPPKKRSAISMLDGKNRIGEF-----RNPTLYKDDEKLKALK 759
                         ++     KK    S+++  NR+ EF      +P       KLK + 
Sbjct: 437 ASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMT 496

Query: 760 A-MKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICG 819
             ++ +GYVP    V  D+++E KE A++YHSE++AIA+ LISTP R+P+ ++KNLR+C 
Sbjct: 497 GRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCA 556

Query: 820 DCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 826
           DCH AIK++S++  RE++VRD  RFHHFK+G CSC DY
Sbjct: 557 DCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDY 594

BLAST of Sgr013451 vs. TAIR 10
Match: AT4G21065.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 282.7 bits (722), Expect = 1.8e-75
Identity = 148/398 (37.19%), Postives = 225/398 (56.53%), Query Frame = 0

Query: 460 FCGEGKIKEAVELLKE----GVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSD 519
           F   GK +EA+ L  E    G+K D      L   C K  +    K VH Y ++     +
Sbjct: 64  FAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRN 123

Query: 520 LQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKK 579
           L  +N +L++Y +CG + +A+ +FD M D+N  SW  +I G A NGFG E +ELF+ M+ 
Sbjct: 124 LHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMES 183

Query: 580 L-GLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHI 639
             GL P   TF+ I+ AC+    V+EGF YF  M+ +Y I P ++H+  ++ +L   G +
Sbjct: 184 TEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQV 243

Query: 640 NEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAAS-------- 699
            +A+EY++ +PM+P V IW TL     +HGD DL ++A   I+ L+P  +          
Sbjct: 244 KKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMY 303

Query: 700 -------------NKITTPPPKKRSAISMLDGKNRIGEF-----RNPTLYKDDEKLKALK 759
                         ++     KK    S+++  NR+ EF      +P       KLK + 
Sbjct: 304 ASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMT 363

Query: 760 A-MKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICG 819
             ++ +GYVP    V  D+++E KE A++YHSE++AIA+ LISTP R+P+ ++KNLR+C 
Sbjct: 364 GRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCA 423

Query: 820 DCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY 826
           DCH AIK++S++  RE++VRD  RFHHFK+G CSC DY
Sbjct: 424 DCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDY 461

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6571981.10.0e+0075.88Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_022136002.11.5e-28983.58pentatricopeptide repeat-containing protein At2g15690 [Momordica charantia][more]
XP_022952757.14.7e-28383.17pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucur... [more]
XP_022972422.13.0e-28283.33pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucur... [more]
XP_023530084.13.4e-28182.20pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucur... [more]
Match NameE-valueIdentityDescription
Q9ZQE58.4e-14248.99Pentatricopeptide repeat-containing protein At2g15690, mitochondrial OS=Arabidop... [more]
Q9SUU79.4e-8536.17Pentatricopeptide repeat-containing protein At4g32450, mitochondrial OS=Arabidop... [more]
Q680H35.2e-8341.12Pentatricopeptide repeat-containing protein At2g25580 OS=Arabidopsis thaliana OX... [more]
Q9LIQ71.0e-7536.54Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
A8MQA32.6e-7437.19Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1C4C37.3e-29083.58pentatricopeptide repeat-containing protein At2g15690 OS=Momordica charantia OX=... [more]
A0A6J1GL942.3e-28383.17pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like OS=Cuc... [more]
A0A6J1I4R81.5e-28283.33pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like OS=Cuc... [more]
A0A6J1JDU28.4e-27881.36pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like isofor... [more]
A0A6J1EIR62.1e-27380.87pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT2G15690.16.0e-14348.99Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G32450.16.7e-8636.17Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G25580.13.7e-8441.12Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21065.11.8e-7537.19Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21065.21.8e-7537.19Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 6..33
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1571..1617
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1361..1387
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 364..442
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 31..80
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 225..243
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 322..346
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 225..246
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1582..1617
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1509..1528
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 306..346
NoneNo IPR availablePANTHERPTHR24015:SF1958PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 257..821
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 257..821
IPR003340B3 DNA binding domainSMARTSM01019B3_2coord: 128..219
e-value: 1.1E-19
score: 81.4
IPR003340B3 DNA binding domainPFAMPF02362B3coord: 128..218
e-value: 3.0E-14
score: 52.7
IPR003340B3 DNA binding domainPROSITEPS50863B3coord: 128..219
score: 11.927698
IPR003340B3 DNA binding domainCDDcd10017B3_DNAcoord: 126..217
e-value: 2.19502E-25
score: 99.7096
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 723..816
e-value: 1.5E-32
score: 112.2
IPR015300DNA-binding pseudobarrel domain superfamilyGENE3D2.40.330.10coord: 103..220
e-value: 2.3E-26
score: 93.9
IPR015300DNA-binding pseudobarrel domain superfamilySUPERFAMILY101936DNA-binding pseudobarrel domaincoord: 121..218
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 549..581
e-value: 1.7E-5
score: 22.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 546..593
e-value: 5.7E-8
score: 32.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 546..580
score: 12.068449
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 455..689
e-value: 4.8E-30
score: 107.0

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr013451.1Sgr013451.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003677 DNA binding
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding