CmaCh05G003410.1 (mRNA) Cucurbita maxima (Rimu)

NameCmaCh05G003410.1
TypemRNA
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCma_Chr05 : 1491372 .. 1498040 (-)
Sequence length3621
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCTCGCGCGAGGAGGCAATGGCGAACGACGAAGAGAGAGTTCTGGAGCACCAGCTAGAGGTTCAATTGCATGAGCAGAGAGAATCTCTCGCCGCCTTGCAAGATGCCTTAGCCTCCGATGCCTCCAATCCGGAGCTTCTCGAGGTACGTTCTTTTGTCCAATATGAATTTCCGTTTGGCTTTGAGAATATTCATGAGAGGAATGCGTGATCGCCTCTTATTGAATTTGATTATTATTTTTGAAGTTTTGAAGACCTCTGCTTGTTGATTTATTTGAATATAATCTGTTGATGTCAAGGTTCATGATGAGCTTGTCCAAGCAATTAAAGATGCCGAGGAAGGGCTGCTTCACCTTAAGCGTTCTAGATTACTAAGAGAAGCAGATTTGGTGTTGTGTGGTCGTGATAGTAACGCAGCGGAGGATGTTAAGGTGGAGCCTCTTCATTCTACGGACGTCGAACCTGAATCACCAGAGGATCAGAGTTTCGTCGTTGGATCGAAATGCAGATTTCGGCACACTGATGGACGTTGGTATGACGGTGAAATTGTTGGATTGGATGGTTCTAATTCTGCGAAAATTTCTTTCCTCACTCCTACAACTGAAAATATGTTGGTAAGCAATCTGGGACTCTATTCGTCGGCTTTTGCGAAACTGTTAGTTCTAATTTCAAACCACTCCTCATTTTGTCTTTTGGATTGTATTTATGCACATATTATTTTGACTTGAATTTACCGTTTTCTGTCTGATTGTGGCTGTCTCGGTCTTCTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGAGGGAATTCGATTAGCTTCTTCCCTTTAATAGTACCGAGGATCTGTTTGTGCCTACTTACTAGATGAGTTTTTAAGTGTTGGCAGTGACTTGGGTTTTGATTCACCTAACTAAACTATTGAGCAATCACTATGGTTTTTTCTCACATCATGATACTAATGCCATAACACCACCAAAGTGCAGATATGCAAGTTCTTCTTACAGCAAAGGTGTCGGTTTGGCACTAGCTGCCGCTTATCGCATGGTATGTCATATTCTTTCACGTGGACCATGTAGATCCAACTACTCATTAAGTAAAGCCTTCCCCTCGTTTCAATGGCTGTTCCCACCAGCAATACTATCTTGAGATGCTAGAATGGATTATAAACTCTTTAATAAAGGAACCTGCAGAATTAAAATAATGGTGAAGTTCCTGCATAGGTCTAGGACTAAAAATGTTTTCCTTCATAAAGAGTTCAGTATGAAAAAAATGCTGGTGCTGTTCGGTTTTCTTGTTGATGAATCTTCAATCATTTGTATTCCTAATTAGAAGAATGTGAGTAAAACTTAGATCTTCTACAACTCGTTACTTCTTATTCCCCGTTGAGACTAGAAACAGTTTTGTTTCTATTTAAAGAAGAATGCAAGAGGTAATGGGGGACTAGTTTACCAATCCTAGAACTAAAGAAAATGGAGATTACACTGAAGAAAAACTGTCAAGGAACTGTGAATTAAGCTATTTCACCTAAGAGGAGAGCAAAGAGAAACTGTCTTGAATTTCTCTTTCTAATATCTTGGAAAATTCAATTGCCACATTTCATGCATTGGGCAGATGTTTGTGAATATCGCATTTCTAGAACTCCCCTTCATTTCAGCTGTCTCTATGTGATACATTCTAAGACTGCCTTATGTAATGCAAGGTAGTCTTGCCTTTCAACATTTTTGGAAAATACATGTTGCATTGCATTTTACCTCCATTCCAATAGTACCACATGTATTTCTTTCCCCTTGATTTAATGTTTCCATTTTATTTAAAATGAACTTTCCCTAGTTTTTTATTCACCTTTTTAGAGACTGGTCTTTTTCCTTCCATGTAAAAGAACAGGCAATAGCTAAGCTGCTGTTCTCAACTATATTTGTATACTTCTGTATATCATTAAAGTCAATAGCTTTGCTGCTATTCTCCATTGGGTTTGTCGACCAATTGGGAGTTTTGAGGAGGAAAGAATTTTTACATTTATTTCTTTGATTAGGTTTTTGTCCTTCTGTATAAAGTATATCTTAACAGGCAATAGCTATGCTTTTATTCTCAATTATCTGAGAGAGTTTTTCATGCACAGGAGTTGATATCCCTTTAACCTCTCTTAGGAGATATGCGCCAACAATTTGGAATCAGTCACTGACAGGGTCCAGTATCTGGGCTCTCTCGTCCAGGAATGGCATTTGGAGGCATGCTGAACTTGAATCTTGGGATGATGCACTACAAATTGCACAAGTTGTTTTTAAAGGTGATGGATCCTCTCAAAAGCTTGGACCGGAGGACATAGCGTTATCTGTGCGTGCTCAAATTAGTGATGGAGAAGAAAGTGATTCCAGCTTGGAAAAGTCTGACTCAAGTGATTATGAAGATGATGATTTGCAGGGTTTGGGATTCCTCGAAAGCTCTACACAGCAGAGGGGCATTCAGATGGAAACCACCATATTTGCAAAATGGGAGAACCATACCCGGGGAATCGCCTCCAAGATGATGGCTAATATGGGCTATCGAGAAGGAATGGGTTTGGGTGCATCTGGGCAGGGGATGCTAAATCCTATCCCTGTCAAAGTTCTTCCAGCAAAACAATCTCTTGATCATGCTCTAGAATCACAAAAGGAGAATAATACTAACGACGAGAATAATGGCAAGAAACGAAGTAGAGGCGGTAAGAGGAAACGTGATAAGAAGTTTGCTGCAGCAATGCAGGCAGCTAAAGAGGAAGAAGACTCAAGACCTGATGTCTTTAATCTTATCAACAACCACCTTGCAATGCATAATGGAGCACCGAATGATGGATCTGTTAAGAAACAGAAAGATAAAGGTTCAGCAGATGGAAAGAAGGTAGATAGACGAACTCTAATCGCGTACGATGGTGAGGTGAAAGACCTGCGAGTACGAATAGAGAAGCTTGAAGAAATGGTGAACAGAAATAAGAATGAGAAGGTTGTTTTCGAGGCTGCCTTAAGAAAGCTGAACGAGACTCGGAAAGCTTTGGCCGAGGCCGAGGCAGCTCATGCGTCTGCATCAAATGCAGTTACCAGCAGAGAAAAGGAAAAAAAATGGTTGAAGTTTTAGGGCTTGACTTGATGTTATTGTATAATGATAGCTCCACCTGTAGAAGATTCATGTCAATCTCTGAAACTGAACCCCACGTGAATTATTGAAGGAAATCGTGTGATGATTTGTGTTTCATGGTGGCTGGCCGTTGGTGTATATTCATCCTAATATGAAGTCCAAATCTTAATGTAGGCGACAGGCTGATTCTTGTTCCTTTTTCTCAATTAGTACTTCTTATAGTTCGGTCGTTTTTGTTTCTTTTGTGATTTGCATACTTTTCCCTAATAATGTATAAATTACTTTTTGGGATAGGATGAGCATTCTAGATGGAATTGGTTAAATATGTCATTTCCTACTCGTGCAAATTAGAATTTCATTCAATTTTATCGGTTTGATCCTGTTTGAAAATTTTTGCCTTTTTCACAATCGCACTTTTCTTACCACGAACTCTGCGATTGTGTGGCACTCGCTGTTTTTCAATAGCAAGTGAGTCAAGTCGATTTGTATTTGCGTTGCAATGCTCAAGTCTCTGACTCCTATTTGAAAAGAAGGTTTAGAAAACAGGGGCAGAGAAAGATTGAGTGAGAGAATTTGAAAACAAGGTTATATGACATACAAGCAAAAGGTAATACAACGGCTTAAGTAAAATTCATCCAATTCTTCACGTTCAAACAATTTAATTTTTTCTTCTCGCGTATTCCATTGGGTATAACTCCTCATCTTCATCCAATTAAAATTCTGAAATGTTTAAAAAATATTTTTAAAAAATTGATGTAATTTAATAAGTTGGTAAAATTAGTACATATCCCCACCACCGGTGAAAAAACATGTATTTGAATGTTGCTTTCGGGGAAACTATGAATTGAAGCTTCATCTGTGCAGGTAGGAGTTAAATGTGCTATTTTCGGTGTCCTCTAGTTTGAGAAAGAGAGCTGATATCCCGTTCAGGTACTCATTCCTCTCAATTTGACTCGATTACTTTCTGTTTCTTTTCCTGGGAATAGAAAATTTGGCTACCTATGAATGAGGTCATTGTACTCGGTAACTGGGACATCGTTTCGAGCTTTGTCCAATCTTTTGTTAACCCATTCTCTGGCCATCAAGTTGGGTACCATAGCAGACGTTTACACTTGCAACAATATCCTAAGTGGGTATTGGAAATGCAAAGAGTTTCGATCTGCAGACGTACTGTTCGACGAAATGCCGCTGAGAGACTCTGTATCTTGGAACACGATGATCGCGGGGTATATTAACTCTGGAAACTTGGAGAATTCATGGGAAGTTCTTAAATGCATGAGAAGATTTGGTTTTGATCAAGATGAGTACACCTTTGGAAGCATGCTGAAGGGCATTGCTTGTGCTGGTATGCTTGATTTGGGTCAGCAAATACATTCTATGATCATTAAGATGGGTTTTGCTGGAAATGTATATGCAGGGAGTGCTCTTCTGGATATGTATGCGAAATGTGAGAGACTTGAGGATGCATATTTGACATTCCTAAATATATCTAAGCAGAACACTGTTTCGTGGAATGCAATGATTGCTGGATACGCACAAACGGGTGATCGCGAGACCGCGTTTTTGTTGTTAGATTGTATGGAGCAAGAAGGTGAGAAGGTTGATGATGGCTCATTTGCTCCTCTTTTGCCTTTACTAGATGATGCTGAGTTTTGTAGATTGACAAGGCAAGTTCATGGAAAAGTCATAAAACATGGATTGGAGTCTGCTAATACAATGTGTAATGCTTTGATCACTTCTTATTCAGAATGTGGATCCCTTGTCGATGCCAAAAGGGTTTTCAATTGTTCGGCGGGCGTTCGAGATTTGGTGTCGTGGAACTCCCTGTTGGGTGCTTTTTTGGTGCATAATCAGGAAGATCTTGCTTTTAAACTCTTGATTGATATGCAAGAACATGGTTTTGAACCAGATTTGTACTCTTACACAAGCATTATCAGTGCTTGTTTCAACAAAGAGCTTAGCAATAATGGGAAATCCCTGCATGGGATGGTCATTAAAAGAGGATTAGAACAATCAGTGCCAATTTCAAATGCATTGATATCTATGTATCTTAAATCAGACGGTGGTTCGATGAAGGAAGCTTTATGTATATTCGAATCCTTGGAGATTAAGGATCGTGTGTCGTGGAACTCGATCTTGACGGGATTATCACAAATGGGGTCGAGCGAAGATGCTGTGAAGTCGTTTCTGCATATGAGATCTTTAGCAATGGATATTGATCGGTATTCGTTTTCTGCTGTGCTCAAATCATGCTCAGATTTGGCCACCTTTCAATTGGGACAACAATTTCATGTCTTGGCGCTGAAATATGGTATGGATTCCAATGAGTTTGTTTCAAGTTCATTAATCTTCATGTATTCAAAGTGTGGGATTATGGAAGATGCTAAAAGATCATTTGAAGGAGCTTCAAAAAGCTCTTCAATCACCTGGAATGCACTCATGTTTGGCTATGCACAACATGGGCAATGCCATGTTGCATTAGACCTCTTCTTTCTAATGGAAGAGAAGAAGGTGAAAATGGATCACATAACATTCGTTGCAGTTCTGACCGCTTGTAGCCATATCGGTTTAGTCGAACGGGGCTGCGAATTCTTACGATGTATGGAATCTGATTATGGGGTTCCTCCACGAATGGAGCATTATGCTTGTGCAGTTGATCTATATGGCCGTTCTGGGCGTCTTGATGAAGCCAAGGCCTTGATTGAGGCAATGCCATTCAAGCCGAACGCGATGGTGTGGAAGACGTTCTTGGGGGCATGTCGTTCTTGTGGGAACGTTGAGTTAGCTTGTCAGGTTGCAAGGCATCTACTAGAGATGGAGCCTGAAGAGCATTGCACTTATGTTCTTCTCTCAAACATGTATGGAGATCTAATGAGATGGGAGGAGAAGGCTCAGGTGAAGAGGTTAATGAAGGAAAGAGGAGTTAAGAAAACGCCTGGTTGGAGTTGGATTGAAGTTAAGAACAAGGTTCATGCTTTCATTGCTGAAGATCGTTCTCATCCCAGTTGCCAACAGATATACTTTTTGCTGGAAGTTCTTATGGAGGAAATCACAAGAATTGAAGCTGCTGCTGATGGTTTTGAGAGTTTTTTGGAGCAGGAAGAGCTAAGTTATGCATATGCATAA

mRNA sequence

ATCTCGCGCGAGGAGGCAATGGCGAACGACGAAGAGAGAGTTCTGGAGCACCAGCTAGAGGTTCAATTGCATGAGCAGAGAGAATCTCTCGCCGCCTTGCAAGATGCCTTAGCCTCCGATGCCTCCAATCCGGAGCTTCTCGAGGTTCATGATGAGCTTGTCCAAGCAATTAAAGATGCCGAGGAAGGGCTGCTTCACCTTAAGCGTTCTAGATTACTAAGAGAAGCAGATTTGGTGTTGTGTGGTCGTGATAGTAACGCAGCGGAGGATGTTAAGGTGGAGCCTCTTCATTCTACGGACGTCGAACCTGAATCACCAGAGGATCAGAGTTTCGTCGTTGGATCGAAATGCAGATTTCGGCACACTGATGGACGTTGGTATGACGGTGAAATTGTTGGATTGGATGGTTCTAATTCTGCGAAAATTTCTTTCCTCACTCCTACAACTGAAAATATGTTGATATGCAAGTTCTTCTTACAGCAAAGGTGTCGGTTTGGCACTAGCTGCCGCTTATCGCATGGAGTTGATATCCCTTTAACCTCTCTTAGGAGATATGCGCCAACAATTTGGAATCAGTCACTGACAGGGTCCAGTATCTGGGCTCTCTCGTCCAGGAATGGCATTTGGAGGCATGCTGAACTTGAATCTTGGGATGATGCACTACAAATTGCACAAGTTGTTTTTAAAGGTGATGGATCCTCTCAAAAGCTTGGACCGGAGGACATAGCGTTATCTGTGCGTGCTCAAATTAGTGATGGAGAAGAAAGTGATTCCAGCTTGGAAAAGTCTGACTCAAGTGATTATGAAGATGATGATTTGCAGGGTTTGGGATTCCTCGAAAGCTCTACACAGCAGAGGGGCATTCAGATGGAAACCACCATATTTGCAAAATGGGAGAACCATACCCGGGGAATCGCCTCCAAGATGATGGCTAATATGGGCTATCGAGAAGGAATGGGTTTGGGTGCATCTGGGCAGGGGATGCTAAATCCTATCCCTGTCAAAGTTCTTCCAGCAAAACAATCTCTTGATCATGCTCTAGAATCACAAAAGGAGAATAATACTAACGACGAGAATAATGGCAAGAAACGAAGTAGAGGCGGTAAGAGGAAACGTGATAAGAAGTTTGCTGCAGCAATGCAGGCAGCTAAAGAGGAAGAAGACTCAAGACCTGATGTCTTTAATCTTATCAACAACCACCTTGCAATGCATAATGGAGCACCGAATGATGGATCTGTTAAGAAACAGAAAGATAAAGGTTCAGCAGATGGAAAGAAGGTAGATAGACGAACTCTAATCGCGTACGATGGTGAGGTGAAAGACCTGCGAGTACGAATAGAGAAGCTTGAAGAAATGGTGAACAGAAATAAGAATGAGAAGGTTGTTTTCGAGGCTGCCTTAAGAAAGCTGAACGAGACTCGGAAAGCTTTGGCCGAGGCCGAGGCAGCTCATGCGTCTGCATCAAATGCAGTTACCAGCAGAGAAAAGGAAAAAAAATGGTTGAAGTCATTGTACTCGGTAACTGGGACATCGTTTCGAGCTTTGTCCAATCTTTTGTTAACCCATTCTCTGGCCATCAAGTTGGGTACCATAGCAGACGTTTACACTTGCAACAATATCCTAAGTGGGTATTGGAAATGCAAAGAGTTTCGATCTGCAGACGTACTGTTCGACGAAATGCCGCTGAGAGACTCTGTATCTTGGAACACGATGATCGCGGGGTATATTAACTCTGGAAACTTGGAGAATTCATGGGAAGTTCTTAAATGCATGAGAAGATTTGGTTTTGATCAAGATGAGTACACCTTTGGAAGCATGCTGAAGGGCATTGCTTGTGCTGGTATGCTTGATTTGGGTCAGCAAATACATTCTATGATCATTAAGATGGGTTTTGCTGGAAATGTATATGCAGGGAGTGCTCTTCTGGATATGTATGCGAAATGTGAGAGACTTGAGGATGCATATTTGACATTCCTAAATATATCTAAGCAGAACACTGTTTCGTGGAATGCAATGATTGCTGGATACGCACAAACGGGTGATCGCGAGACCGCGTTTTTGTTGTTAGATTGTATGGAGCAAGAAGGTGAGAAGGTTGATGATGGCTCATTTGCTCCTCTTTTGCCTTTACTAGATGATGCTGAGTTTTGTAGATTGACAAGGCAAGTTCATGGAAAAGTCATAAAACATGGATTGGAGTCTGCTAATACAATGTGTAATGCTTTGATCACTTCTTATTCAGAATGTGGATCCCTTGTCGATGCCAAAAGGGTTTTCAATTGTTCGGCGGGCGTTCGAGATTTGGTGTCGTGGAACTCCCTGTTGGGTGCTTTTTTGGTGCATAATCAGGAAGATCTTGCTTTTAAACTCTTGATTGATATGCAAGAACATGGTTTTGAACCAGATTTGTACTCTTACACAAGCATTATCAGTGCTTGTTTCAACAAAGAGCTTAGCAATAATGGGAAATCCCTGCATGGGATGGTCATTAAAAGAGGATTAGAACAATCAGTGCCAATTTCAAATGCATTGATATCTATGTATCTTAAATCAGACGGTGGTTCGATGAAGGAAGCTTTATGTATATTCGAATCCTTGGAGATTAAGGATCGTGTGTCGTGGAACTCGATCTTGACGGGATTATCACAAATGGGGTCGAGCGAAGATGCTGTGAAGTCGTTTCTGCATATGAGATCTTTAGCAATGGATATTGATCGGTATTCGTTTTCTGCTGTGCTCAAATCATGCTCAGATTTGGCCACCTTTCAATTGGGACAACAATTTCATGTCTTGGCGCTGAAATATGGTATGGATTCCAATGAGTTTGTTTCAAGTTCATTAATCTTCATGTATTCAAAGTGTGGGATTATGGAAGATGCTAAAAGATCATTTGAAGGAGCTTCAAAAAGCTCTTCAATCACCTGGAATGCACTCATGTTTGGCTATGCACAACATGGGCAATGCCATGTTGCATTAGACCTCTTCTTTCTAATGGAAGAGAAGAAGGTGAAAATGGATCACATAACATTCGTTGCAGTTCTGACCGCTTGTAGCCATATCGGTTTAGTCGAACGGGGCTGCGAATTCTTACGATGTATGGAATCTGATTATGGGGTTCCTCCACGAATGGAGCATTATGCTTGTGCAGTTGATCTATATGGCCGTTCTGGGCGTCTTGATGAAGCCAAGGCCTTGATTGAGGCAATGCCATTCAAGCCGAACGCGATGGTGTGGAAGACGTTCTTGGGGGCATGTCGTTCTTGTGGGAACGTTGAGTTAGCTTGTCAGGTTGCAAGGCATCTACTAGAGATGGAGCCTGAAGAGCATTGCACTTATGTTCTTCTCTCAAACATGTATGGAGATCTAATGAGATGGGAGGAGAAGGCTCAGGTGAAGAGGTTAATGAAGGAAAGAGGAGTTAAGAAAACGCCTGGTTGGAGTTGGATTGAAGTTAAGAACAAGGTTCATGCTTTCATTGCTGAAGATCGTTCTCATCCCAGTTGCCAACAGATATACTTTTTGCTGGAAGTTCTTATGGAGGAAATCACAAGAATTGAAGCTGCTGCTGATGGTTTTGAGAGTTTTTTGGAGCAGGAAGAGCTAAGTTATGCATATGCATAA

Coding sequence (CDS)

ATGGCGAACGACGAAGAGAGAGTTCTGGAGCACCAGCTAGAGGTTCAATTGCATGAGCAGAGAGAATCTCTCGCCGCCTTGCAAGATGCCTTAGCCTCCGATGCCTCCAATCCGGAGCTTCTCGAGGTTCATGATGAGCTTGTCCAAGCAATTAAAGATGCCGAGGAAGGGCTGCTTCACCTTAAGCGTTCTAGATTACTAAGAGAAGCAGATTTGGTGTTGTGTGGTCGTGATAGTAACGCAGCGGAGGATGTTAAGGTGGAGCCTCTTCATTCTACGGACGTCGAACCTGAATCACCAGAGGATCAGAGTTTCGTCGTTGGATCGAAATGCAGATTTCGGCACACTGATGGACGTTGGTATGACGGTGAAATTGTTGGATTGGATGGTTCTAATTCTGCGAAAATTTCTTTCCTCACTCCTACAACTGAAAATATGTTGATATGCAAGTTCTTCTTACAGCAAAGGTGTCGGTTTGGCACTAGCTGCCGCTTATCGCATGGAGTTGATATCCCTTTAACCTCTCTTAGGAGATATGCGCCAACAATTTGGAATCAGTCACTGACAGGGTCCAGTATCTGGGCTCTCTCGTCCAGGAATGGCATTTGGAGGCATGCTGAACTTGAATCTTGGGATGATGCACTACAAATTGCACAAGTTGTTTTTAAAGGTGATGGATCCTCTCAAAAGCTTGGACCGGAGGACATAGCGTTATCTGTGCGTGCTCAAATTAGTGATGGAGAAGAAAGTGATTCCAGCTTGGAAAAGTCTGACTCAAGTGATTATGAAGATGATGATTTGCAGGGTTTGGGATTCCTCGAAAGCTCTACACAGCAGAGGGGCATTCAGATGGAAACCACCATATTTGCAAAATGGGAGAACCATACCCGGGGAATCGCCTCCAAGATGATGGCTAATATGGGCTATCGAGAAGGAATGGGTTTGGGTGCATCTGGGCAGGGGATGCTAAATCCTATCCCTGTCAAAGTTCTTCCAGCAAAACAATCTCTTGATCATGCTCTAGAATCACAAAAGGAGAATAATACTAACGACGAGAATAATGGCAAGAAACGAAGTAGAGGCGGTAAGAGGAAACGTGATAAGAAGTTTGCTGCAGCAATGCAGGCAGCTAAAGAGGAAGAAGACTCAAGACCTGATGTCTTTAATCTTATCAACAACCACCTTGCAATGCATAATGGAGCACCGAATGATGGATCTGTTAAGAAACAGAAAGATAAAGGTTCAGCAGATGGAAAGAAGGTAGATAGACGAACTCTAATCGCGTACGATGGTGAGGTGAAAGACCTGCGAGTACGAATAGAGAAGCTTGAAGAAATGGTGAACAGAAATAAGAATGAGAAGGTTGTTTTCGAGGCTGCCTTAAGAAAGCTGAACGAGACTCGGAAAGCTTTGGCCGAGGCCGAGGCAGCTCATGCGTCTGCATCAAATGCAGTTACCAGCAGAGAAAAGGAAAAAAAATGGTTGAAGTCATTGTACTCGGTAACTGGGACATCGTTTCGAGCTTTGTCCAATCTTTTGTTAACCCATTCTCTGGCCATCAAGTTGGGTACCATAGCAGACGTTTACACTTGCAACAATATCCTAAGTGGGTATTGGAAATGCAAAGAGTTTCGATCTGCAGACGTACTGTTCGACGAAATGCCGCTGAGAGACTCTGTATCTTGGAACACGATGATCGCGGGGTATATTAACTCTGGAAACTTGGAGAATTCATGGGAAGTTCTTAAATGCATGAGAAGATTTGGTTTTGATCAAGATGAGTACACCTTTGGAAGCATGCTGAAGGGCATTGCTTGTGCTGGTATGCTTGATTTGGGTCAGCAAATACATTCTATGATCATTAAGATGGGTTTTGCTGGAAATGTATATGCAGGGAGTGCTCTTCTGGATATGTATGCGAAATGTGAGAGACTTGAGGATGCATATTTGACATTCCTAAATATATCTAAGCAGAACACTGTTTCGTGGAATGCAATGATTGCTGGATACGCACAAACGGGTGATCGCGAGACCGCGTTTTTGTTGTTAGATTGTATGGAGCAAGAAGGTGAGAAGGTTGATGATGGCTCATTTGCTCCTCTTTTGCCTTTACTAGATGATGCTGAGTTTTGTAGATTGACAAGGCAAGTTCATGGAAAAGTCATAAAACATGGATTGGAGTCTGCTAATACAATGTGTAATGCTTTGATCACTTCTTATTCAGAATGTGGATCCCTTGTCGATGCCAAAAGGGTTTTCAATTGTTCGGCGGGCGTTCGAGATTTGGTGTCGTGGAACTCCCTGTTGGGTGCTTTTTTGGTGCATAATCAGGAAGATCTTGCTTTTAAACTCTTGATTGATATGCAAGAACATGGTTTTGAACCAGATTTGTACTCTTACACAAGCATTATCAGTGCTTGTTTCAACAAAGAGCTTAGCAATAATGGGAAATCCCTGCATGGGATGGTCATTAAAAGAGGATTAGAACAATCAGTGCCAATTTCAAATGCATTGATATCTATGTATCTTAAATCAGACGGTGGTTCGATGAAGGAAGCTTTATGTATATTCGAATCCTTGGAGATTAAGGATCGTGTGTCGTGGAACTCGATCTTGACGGGATTATCACAAATGGGGTCGAGCGAAGATGCTGTGAAGTCGTTTCTGCATATGAGATCTTTAGCAATGGATATTGATCGGTATTCGTTTTCTGCTGTGCTCAAATCATGCTCAGATTTGGCCACCTTTCAATTGGGACAACAATTTCATGTCTTGGCGCTGAAATATGGTATGGATTCCAATGAGTTTGTTTCAAGTTCATTAATCTTCATGTATTCAAAGTGTGGGATTATGGAAGATGCTAAAAGATCATTTGAAGGAGCTTCAAAAAGCTCTTCAATCACCTGGAATGCACTCATGTTTGGCTATGCACAACATGGGCAATGCCATGTTGCATTAGACCTCTTCTTTCTAATGGAAGAGAAGAAGGTGAAAATGGATCACATAACATTCGTTGCAGTTCTGACCGCTTGTAGCCATATCGGTTTAGTCGAACGGGGCTGCGAATTCTTACGATGTATGGAATCTGATTATGGGGTTCCTCCACGAATGGAGCATTATGCTTGTGCAGTTGATCTATATGGCCGTTCTGGGCGTCTTGATGAAGCCAAGGCCTTGATTGAGGCAATGCCATTCAAGCCGAACGCGATGGTGTGGAAGACGTTCTTGGGGGCATGTCGTTCTTGTGGGAACGTTGAGTTAGCTTGTCAGGTTGCAAGGCATCTACTAGAGATGGAGCCTGAAGAGCATTGCACTTATGTTCTTCTCTCAAACATGTATGGAGATCTAATGAGATGGGAGGAGAAGGCTCAGGTGAAGAGGTTAATGAAGGAAAGAGGAGTTAAGAAAACGCCTGGTTGGAGTTGGATTGAAGTTAAGAACAAGGTTCATGCTTTCATTGCTGAAGATCGTTCTCATCCCAGTTGCCAACAGATATACTTTTTGCTGGAAGTTCTTATGGAGGAAATCACAAGAATTGAAGCTGCTGCTGATGGTTTTGAGAGTTTTTTGGAGCAGGAAGAGCTAAGTTATGCATATGCATAA

Protein sequence

MANDEERVLEHQLEVQLHEQRESLAALQDALASDASNPELLEVHDELVQAIKDAEEGLLHLKRSRLLREADLVLCGRDSNAAEDVKVEPLHSTDVEPESPEDQSFVVGSKCRFRHTDGRWYDGEIVGLDGSNSAKISFLTPTTENMLICKFFLQQRCRFGTSCRLSHGVDIPLTSLRRYAPTIWNQSLTGSSIWALSSRNGIWRHAELESWDDALQIAQVVFKGDGSSQKLGPEDIALSVRAQISDGEESDSSLEKSDSSDYEDDDLQGLGFLESSTQQRGIQMETTIFAKWENHTRGIASKMMANMGYREGMGLGASGQGMLNPIPVKVLPAKQSLDHALESQKENNTNDENNGKKRSRGGKRKRDKKFAAAMQAAKEEEDSRPDVFNLINNHLAMHNGAPNDGSVKKQKDKGSADGKKVDRRTLIAYDGEVKDLRVRIEKLEEMVNRNKNEKVVFEAALRKLNETRKALAEAEAAHASASNAVTSREKEKKWLKSLYSVTGTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLRDSVSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIHSMIIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRETAFLLLDCMEQEGEKVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNALITSYSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPDLYSYTSIISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCIFESLEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLATFQLGQQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFGYAQHGQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPPRMEHYACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHLLEMEPEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAEDRSHPSCQQIYFLLEVLMEEITRIEAAADGFESFLEQEELSYAYA
BLAST of CmaCh05G003410.1 vs. Swiss-Prot
Match: PP255_ARATH (Putative pentatricopeptide repeat-containing protein At3g25970 OS=Arabidopsis thaliana GN=PCMP-E46 PE=3 SV=2)

HSP 1 Score: 796.6 bits (2056), Expect = 3.8e-229
Identity = 387/686 (56.41%), Postives = 505/686 (73.62%), Query Frame = 1

Query: 498  LYSVTGTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLR 557
            L S+  +S  +   L LTH  AIK G+I+D+Y  N IL  Y K      A++LFDEMP R
Sbjct: 5    LASLLESSLNSFQKLSLTHCYAIKCGSISDIYVSNRILDSYIKFGFLGYANMLFDEMPKR 64

Query: 558  DSVSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIH 617
            DSVSWNTMI+GY + G LE++W +  CM+R G D D Y+F  +LKGIA     DLG+Q+H
Sbjct: 65   DSVSWNTMISGYTSCGKLEDAWCLFTCMKRSGSDVDGYSFSRLLKGIASVKRFDLGEQVH 124

Query: 618  SMIIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRE 677
             ++IK G+  NVY GS+L+DMYAKCER+EDA+  F  IS+ N+VSWNA+IAG+ Q  D +
Sbjct: 125  GLVIKGGYECNVYVGSSLVDMYAKCERVEDAFEAFKEISEPNSVSWNALIAGFVQVRDIK 184

Query: 678  TAFLLLDCMEQEGE-KVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNAL 737
            TAF LL  ME +    +D G+FAPLL LLDD  FC L +QVH KV+K GL+   T+CNA+
Sbjct: 185  TAFWLLGLMEMKAAVTMDAGTFAPLLTLLDDPMFCNLLKQVHAKVLKLGLQHEITICNAM 244

Query: 738  ITSYSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPD 797
            I+SY++CGS+ DAKRVF+   G +DL+SWNS++  F  H  ++ AF+L I MQ H  E D
Sbjct: 245  ISSYADCGSVSDAKRVFDGLGGSKDLISWNSMIAGFSKHELKESAFELFIQMQRHWVETD 304

Query: 798  LYSYTSIISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCI 857
            +Y+YT ++SAC  +E    GKSLHGMVIK+GLEQ    +NALISMY++   G+M++AL +
Sbjct: 305  IYTYTGLLSACSGEEHQIFGKSLHGMVIKKGLEQVTSATNALISMYIQFPTGTMEDALSL 364

Query: 858  FESLEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLATF 917
            FESL+ KD +SWNSI+TG +Q G SEDAVK F ++RS  + +D Y+FSA+L+SCSDLAT 
Sbjct: 365  FESLKSKDLISWNSIITGFAQKGLSEDAVKFFSYLRSSEIKVDDYAFSALLRSCSDLATL 424

Query: 918  QLGQQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEG-ASKSSSITWNALMFG 977
            QLGQQ H LA K G  SNEFV SSLI MYSKCGI+E A++ F+  +SK S++ WNA++ G
Sbjct: 425  QLGQQIHALATKSGFVSNEFVISSLIVMYSKCGIIESARKCFQQISSKHSTVAWNAMILG 484

Query: 978  YAQHGQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPP 1037
            YAQHG   V+LDLF  M  + VK+DH+TF A+LTACSH GL++ G E L  ME  Y + P
Sbjct: 485  YAQHGLGQVSLDLFSQMCNQNVKLDHVTFTAILTACSHTGLIQEGLELLNLMEPVYKIQP 544

Query: 1038 RMEHYACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHL 1097
            RMEHYA AVDL GR+G +++AK LIE+MP  P+ MV KTFLG CR+CG +E+A QVA HL
Sbjct: 545  RMEHYAAAVDLLGRAGLVNKAKELIESMPLNPDPMVLKTFLGVCRACGEIEMATQVANHL 604

Query: 1098 LEMEPEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAE 1157
            LE+EPE+H TYV LS+MY DL +WEEKA VK++MKERGVKK PGWSWIE++N+V AF AE
Sbjct: 605  LEIEPEDHFTYVSLSHMYSDLKKWEEKASVKKMMKERGVKKVPGWSWIEIRNQVKAFNAE 664

Query: 1158 DRSHPSCQQIYFLLEVLMEEITRIEA 1182
            DRS+P CQ IY +++ L +E+  +++
Sbjct: 665  DRSNPLCQDIYMMIKDLTQEMQWLDS 690

BLAST of CmaCh05G003410.1 vs. Swiss-Prot
Match: C3H22_ARATH (Zinc finger CCCH domain-containing protein 22 OS=Arabidopsis thaliana GN=At2g24830 PE=2 SV=1)

HSP 1 Score: 537.0 bits (1382), Expect = 5.4e-151
Identity = 296/508 (58.27%), Postives = 378/508 (74.41%), Query Frame = 1

Query: 1   MANDEERVLEHQLEVQLHEQRESLAALQDALASDASNPELLEVHDELVQAIKDAEEGLLH 60
           MA++E   LE+ L++QL EQ+ESL+++ +AL SD SNPELL VH+EL+ AIK+ EEGLLH
Sbjct: 1   MASEENNDLENLLDIQLIEQKESLSSIDEALLSDPSNPELLSVHEELLSAIKEVEEGLLH 60

Query: 61  LKRSRLLREADLVLCGRDSNAAEDVKVEPLHSTDVEPESPEDQSFVVGSKCRFRHTDGRW 120
           LKR+RLL EAD+VL G + +A     V+P H   +EPE  E++  + GSKCRFRHTDGRW
Sbjct: 61  LKRARLLEEADIVLNGLNHDAG----VKPEH---LEPEKTEEKKDLDGSKCRFRHTDGRW 120

Query: 121 YDGEIVGLDGSNSAKISFLTPTTENMLICKFFLQQRCRFGTSCRLSHGVDIPLTSLRRYA 180
           Y+G I+G +GS+SAKISFLTPT+E+M+ICKFF+QQRCRFG+SCR SHG+D+P++SL+ Y 
Sbjct: 121 YNGRIIGFEGSDSAKISFLTPTSESMMICKFFMQQRCRFGSSCRSSHGLDVPISSLKNYE 180

Query: 181 PTIWNQSLTGSSIWALS-SRNGIWRHAELESWDDALQIAQVVFKGDGSSQKLGPEDIALS 240
            T W Q + GS IWA+S S+  IWR AELESWDD LQ+  VVF+ D SS KLG + +ALS
Sbjct: 181 QTEWKQLMVGSKIWAVSGSKYDIWRKAELESWDDELQVGGVVFRDDKSSAKLGSDSLALS 240

Query: 241 VRAQISD--GEESD--------SSLEKSDSSDYEDDDLQGLGFLESSTQQRGIQMETTIF 300
             AQ++D  GEE +        S  E S SSDY++   QG+GFLES+   RG+Q +T +F
Sbjct: 241 EYAQMTDDDGEEEEEEDEQQSASDSEDSVSSDYDEGSPQGIGFLESTNLPRGVQTDTALF 300

Query: 301 AKWENHTRGIASKMMANMGYREGMGLGASGQGMLNPIPVKVLPAKQSLDHALESQKENNT 360
           AKWENHTRGIASKMMA+MGYREGMGLG SGQG+LNPI VKVLPAK+SLD+ALE  +    
Sbjct: 301 AKWENHTRGIASKMMASMGYREGMGLGVSGQGILNPILVKVLPAKRSLDYALEHIRNGEC 360

Query: 361 NDENNGKKRSRGGKRKRDKKFAAAMQAAKEEEDSRPDVFNLINNHL-AMHNGAPNDGSVK 420
             E   KKRSRGGKRKR KKFA A +AAK+EE+S+PD+F+LIN  +    +   +  SVK
Sbjct: 361 KSEKQKKKRSRGGKRKRGKKFAEAAKAAKQEEESKPDLFSLINEQIFPTRHEKVHSESVK 420

Query: 421 KQKDKGSADGKKVDRRTLIAYDGEVKDLRVRIEKLEEMVNRNKNEKVVFEAALRKLNETR 480
            +++KG      VDR+ L+ Y  EV+DL++ + KLE+MVNRNK + VV EAA R+L E R
Sbjct: 421 NRQNKG-----PVDRKALVEYQDEVRDLKLEMLKLEQMVNRNKKDLVVSEAATRRLKEVR 480

Query: 481 KALAEAEAAHASASNAVTSREKEKKWLK 497
           KALA   A  A+ASNA+ S+E EKKWLK
Sbjct: 481 KALASTLACQAAASNAIVSKENEKKWLK 496

BLAST of CmaCh05G003410.1 vs. Swiss-Prot
Match: C3H18_ORYSJ (Zinc finger CCCH domain-containing protein 18 OS=Oryza sativa subsp. japonica GN=Os02g0793000 PE=2 SV=1)

HSP 1 Score: 527.3 bits (1357), Expect = 4.3e-148
Identity = 296/502 (58.96%), Postives = 372/502 (74.10%), Query Frame = 1

Query: 4   DEERVLEHQLEVQLHEQRESLAALQDALASDASNPELLEVHDELVQAIKDAEEGLLHLKR 63
           DE   +E QLE  L EQR SL A+ +ALA+D SN +LLEVH+EL+ AIKDAEEGLLHLKR
Sbjct: 8   DEAASIELQLEHHLQEQRASLTAVDEALAADPSNADLLEVHEELLAAIKDAEEGLLHLKR 67

Query: 64  SRLLREADLVLCGRD-SNAAEDVKVEPLHSTDVEPESPEDQSFVVGSKCRFRHTDGRWYD 123
           SRL+++ D +   ++ ++ A +V V+P    DVEPE  E Q F VGSKCRFRH DGRWY+
Sbjct: 68  SRLVKQIDEIFPNQEPTSEAPEVAVDP--PDDVEPEPLEPQEFSVGSKCRFRHKDGRWYN 127

Query: 124 GEIVGLDGSNSAKISFLTPTTENMLICKFFLQQRCRFGTSCRLSHGVDIPLTSLRRYAPT 183
           G ++GL+GS+ A+ISFLTPT+ENM +CKFFLQQRCRFG++CRLSHG+ IP+ SL+++ PT
Sbjct: 128 GCVIGLEGSSDARISFLTPTSENMSMCKFFLQQRCRFGSNCRLSHGIVIPILSLKQFTPT 187

Query: 184 IWNQSLTGSSIWALSS-RNGIWRHAELESWDDALQIAQVVFKGDGSSQKLGPEDIALSVR 243
            W QSL GSSI A S   +G+WR AELESWDD L++ QVVF+ DGSS +L  + +++S  
Sbjct: 188 RWQQSLVGSSILAASGHHSGLWRRAELESWDDDLKVGQVVFQDDGSSARLPSDSLSISEY 247

Query: 244 AQISD----GEESDSSLEKSDSSDYEDDDL-QGLGFLESSTQQRGIQMETTIFAKWENHT 303
           A  SD    G  SD   + S+  D ED+ + QGLG LES     G+Q ET IFAKWE+HT
Sbjct: 248 ADESDEDGEGSSSDEGSDFSEDGDQEDESVHQGLGLLESKNLS-GVQTETAIFAKWEHHT 307

Query: 304 RGIASKMMANMGYREGMGLGASGQGMLNPIPVKVLPAKQSLDHALESQKENNTNDENNGK 363
           RG+ASKMMA MGYREGMGLG SGQGML+PIPVKVLP KQSLDHA+ + + N++     GK
Sbjct: 308 RGVASKMMAKMGYREGMGLGVSGQGMLDPIPVKVLPPKQSLDHAVAASEVNDS--VGPGK 367

Query: 364 KRSRGGKRKRDKKFAAAMQAAKEEEDSRPDVFNLINNHLAMHNGAPNDGSVKKQKDKGSA 423
           KRSRGGKRKR+KKFA   +AAK EE+ R  VF+ IN+ L   + A       K+   G A
Sbjct: 368 KRSRGGKRKREKKFAEQARAAKAEEEER-SVFSFINSQLVGQDVAEGSAVKSKKDSSGEA 427

Query: 424 DG--KKVDRRTLIAYDGEVKDLRVRIEKLEEMVNRNKNEKVVFEAALRKLNETRKALAEA 483
           +G  KK DRR+L+AYD EVK+LR R+EKLEEM+ RN+ +K  +EAA +KL +TRKALA+A
Sbjct: 428 NGHAKKEDRRSLLAYDDEVKELRSRVEKLEEMMKRNRKDKAFYEAASKKLKQTRKALADA 487

Query: 484 EAAHASASNAVTSREKEKKWLK 497
           EA HASA+NAV  +EKEKKWLK
Sbjct: 488 EATHASATNAVARKEKEKKWLK 503

BLAST of CmaCh05G003410.1 vs. Swiss-Prot
Match: PP307_ARATH (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 471.1 bits (1211), Expect = 3.7e-131
Identity = 238/671 (35.47%), Postives = 373/671 (55.59%), Query Frame = 1

Query: 516  HSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLRDSVSWNTMIAGYINSGNL 575
            H L +KLG  +D Y CN ++S Y+      SA+ +F  M  RD+V++NT+I G    G  
Sbjct: 311  HGLVLKLGFSSDTYVCNALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLSQCGYG 370

Query: 576  ENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIHSMIIKMGFAGNVYAGSAL 635
            E + E+ K M   G + D  T  S++   +  G L  GQQ+H+   K+GFA N     AL
Sbjct: 371  EKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGAL 430

Query: 636  LDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRETAFLLLDCMEQEGEKVDD 695
            L++YAKC  +E A   FL    +N V WN M+  Y    D   +F +   M+ E    + 
Sbjct: 431  LNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQ 490

Query: 696  GSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNALITSYSECGSLVDAKRVFNC 755
             ++  +L          L  Q+H ++IK   +    +C+ LI  Y++ G L  A  +   
Sbjct: 491  YTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIR 550

Query: 756  SAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPDLYSYTSIISACFNKELSNN 815
             AG +D+VSW +++  +  +N +D A      M + G   D    T+ +SAC   +    
Sbjct: 551  FAG-KDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKE 610

Query: 816  GKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCIFESLEIKDRVSWNSILTGL 875
            G+ +H      G    +P  NAL+++Y +   G ++E+   FE  E  D ++WN++++G 
Sbjct: 611  GQQIHAQACVSGFSSDLPFQNALVTLYSRC--GKIEESYLAFEQTEAGDNIAWNALVSGF 670

Query: 876  SQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLATFQLGQQFHVLALKYGMDSNE 935
             Q G++E+A++ F+ M    +D + ++F + +K+ S+ A  + G+Q H +  K G DS  
Sbjct: 671  QQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSET 730

Query: 936  FVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFGYAQHGQCHVALDLFFLMEEK 995
             V ++LI MY+KCG + DA++ F   S  + ++WNA++  Y++HG    ALD F  M   
Sbjct: 731  EVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHS 790

Query: 996  KVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPPRMEHYACAVDLYGRSGRLDE 1055
             V+ +H+T V VL+ACSHIGLV++G  +   M S+YG+ P+ EHY C VD+  R+G L  
Sbjct: 791  NVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSR 850

Query: 1056 AKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHLLEMEPEEHCTYVLLSNMYGD 1115
            AK  I+ MP KP+A+VW+T L AC    N+E+    A HLLE+EPE+  TYVLLSN+Y  
Sbjct: 851  AKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAV 910

Query: 1116 LMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAEDRSHPSCQQIYFLLEVLMEE 1175
              +W+ +   ++ MKE+GVKK PG SWIEVKN +H+F   D++HP   +I+   + L + 
Sbjct: 911  SKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKR 970

Query: 1176 ITRIEAAADGF 1187
             + I    D F
Sbjct: 971  ASEIGYVQDCF 978

BLAST of CmaCh05G003410.1 vs. Swiss-Prot
Match: PP172_ARATH (Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana GN=PCMP-H60 PE=2 SV=1)

HSP 1 Score: 451.1 bits (1159), Expect = 3.9e-125
Identity = 235/658 (35.71%), Postives = 384/658 (58.36%), Query Frame = 1

Query: 516  HSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLRDSVSWNTMIAGYINSGNL 575
            H   IK G + DV    +++  Y K   F+    +FDEM  R+ V+W T+I+GY  +   
Sbjct: 116  HCQCIKFGFLDDVSVGTSLVDTYMKGSNFKDGRKVFDEMKERNVVTWTTLISGYARNSMN 175

Query: 576  ENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIHSMIIKMGFAGNVYAGSAL 635
            +    +   M+  G   + +TF + L  +A  G+   G Q+H++++K G    +   ++L
Sbjct: 176  DEVLTLFMRMQNEGTQPNSFTFAAALGVLAEEGVGGRGLQVHTVVVKNGLDKTIPVSNSL 235

Query: 636  LDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRETAFLLLDCMEQEGEKVDD 695
            +++Y KC  +  A + F     ++ V+WN+MI+GYA  G    A  +   M     ++ +
Sbjct: 236  INLYLKCGNVRKARILFDKTEVKSVVTWNSMISGYAANGLDLEALGMFYSMRLNYVRLSE 295

Query: 696  GSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNALITSYSECGSLVDAKRVFNC 755
             SFA ++ L  + +  R T Q+H  V+K+G      +  AL+ +YS+C +++DA R+F  
Sbjct: 296  SSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDALRLFKE 355

Query: 756  SAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPDLYSYTSIISACFNKELSNN 815
               V ++VSW +++  FL ++ ++ A  L  +M+  G  P+ ++Y+ I++A      S  
Sbjct: 356  IGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILTALPVISPSE- 415

Query: 816  GKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCIFESLEIKDRVSWNSILTGL 875
               +H  V+K   E+S  +  AL+  Y+K   G ++EA  +F  ++ KD V+W+++L G 
Sbjct: 416  ---VHAQVVKTNYERSSTVGTALLDAYVKL--GKVEEAAKVFSGIDDKDIVAWSAMLAGY 475

Query: 876  SQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDL-ATFQLGQQFHVLALKYGMDSN 935
            +Q G +E A+K F  +    +  + ++FS++L  C+   A+   G+QFH  A+K  +DS+
Sbjct: 476  AQTGETEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFHGFAIKSRLDSS 535

Query: 936  EFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFGYAQHGQCHVALDLFFLMEE 995
              VSS+L+ MY+K G +E A+  F+   +   ++WN+++ GYAQHGQ   ALD+F  M++
Sbjct: 536  LCVSSALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDVFKEMKK 595

Query: 996  KKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPPRMEHYACAVDLYGRSGRLD 1055
            +KVKMD +TF+ V  AC+H GLVE G ++   M  D  + P  EH +C VDLY R+G+L+
Sbjct: 596  RKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSRAGQLE 655

Query: 1056 EAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHLLEMEPEEHCTYVLLSNMYG 1115
            +A  +IE MP    + +W+T L ACR     EL    A  ++ M+PE+   YVLLSNMY 
Sbjct: 656  KAMKVIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYA 715

Query: 1116 DLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAEDRSHPSCQQIYFLLEVL 1173
            +   W+E+A+V++LM ER VKK PG+SWIEVKNK ++F+A DRSHP   QIY  LE L
Sbjct: 716  ESGDWQERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKLEDL 767

BLAST of CmaCh05G003410.1 vs. TrEMBL
Match: A0A0A0LMK6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G382810 PE=4 SV=1)

HSP 1 Score: 1251.1 bits (3236), Expect = 0.0e+00
Identity = 605/701 (86.31%), Postives = 655/701 (93.44%), Query Frame = 1

Query: 500  SVTGTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLRDS 559
            S  GTSFRAL+NLLL HSLA+KLGTIADVYTCNNIL+GYWKCKE RSADVLFDEMP+RDS
Sbjct: 5    SAVGTSFRALANLLLNHSLAVKLGTIADVYTCNNILNGYWKCKELRSADVLFDEMPMRDS 64

Query: 560  VSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIHSM 619
            VSWNTMIAG+IN GNLE SW+VL+CMR  GF+ D YTFGSMLKGIA AGM  LGQQ+HS+
Sbjct: 65   VSWNTMIAGHINCGNLEASWDVLRCMRSCGFELDRYTFGSMLKGIAFAGMFHLGQQVHSI 124

Query: 620  IIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRETA 679
            IIKMG+A NVYAGSALLDMYAKCE+LEDAYL+FL+ISK NTVSWNAMI GYAQ GDRETA
Sbjct: 125  IIKMGYAENVYAGSALLDMYAKCEKLEDAYLSFLSISKHNTVSWNAMINGYAQAGDRETA 184

Query: 680  FLLLDCMEQEGEKVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNALITS 739
            F LLDCMEQEGEKVDDG++APLLPLLDDA+FC LT Q+HGK+IKHGLE  NTMCNALITS
Sbjct: 185  FWLLDCMEQEGEKVDDGTYAPLLPLLDDADFCNLTSQLHGKIIKHGLELVNTMCNALITS 244

Query: 740  YSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPDLYS 799
            YS+CGSL DAKR+F+ SAG+RDLV+WNSLL A+L+ +QEDLAFKLLIDMQEHGFEPDLYS
Sbjct: 245  YSKCGSLDDAKRIFDSSAGIRDLVTWNSLLAAYLLRSQEDLAFKLLIDMQEHGFEPDLYS 304

Query: 800  YTSIISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCIFES 859
            YTSIISACFN+ +SNNG+SLHG+VIKRG EQSVPISNALISMYLKSD GSMKEALCIFES
Sbjct: 305  YTSIISACFNENISNNGRSLHGLVIKRGFEQSVPISNALISMYLKSDYGSMKEALCIFES 364

Query: 860  LEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLATFQLG 919
            LE KDRVSWNSILTGLSQ GSSEDAVKSFLHMRS AMDID YSFSAVL+SCSDLATFQLG
Sbjct: 365  LEFKDRVSWNSILTGLSQTGSSEDAVKSFLHMRSAAMDIDHYSFSAVLRSCSDLATFQLG 424

Query: 920  QQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFGYAQH 979
            QQ HVLALKYG++SNEFVSSSLIFMYSKCGI+EDA+RSFE ASK+SSITWNALMFGYAQH
Sbjct: 425  QQIHVLALKYGLESNEFVSSSLIFMYSKCGIIEDARRSFEEASKNSSITWNALMFGYAQH 484

Query: 980  GQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPPRMEH 1039
            GQC+VALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVE+GC+FLRCMESDYGVPPRMEH
Sbjct: 485  GQCNVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVEQGCKFLRCMESDYGVPPRMEH 544

Query: 1040 YACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHLLEME 1099
            YACAVDLYGRSGRL+EAKALIE MPFKP+  VWKTFLGACRSCGN+ELACQVA HLLEME
Sbjct: 545  YACAVDLYGRSGRLEEAKALIEEMPFKPDTTVWKTFLGACRSCGNIELACQVAGHLLEME 604

Query: 1100 PEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAEDRSH 1159
            PEEHCTYVLLSNMYG+LMRW+EKA+VKRLMKERGVKK PGWSWIEV N VHAFIA+D SH
Sbjct: 605  PEEHCTYVLLSNMYGNLMRWDEKAKVKRLMKERGVKKVPGWSWIEVNNNVHAFIAQDHSH 664

Query: 1160 PSCQQIYFLLEVLMEEITRIEAAADGFESFLEQEELSYAYA 1201
            PSCQQIYFLLEVL+EEITR+E  ADGF+SFLEQEELSYA A
Sbjct: 665  PSCQQIYFLLEVLLEEITRME-DADGFKSFLEQEELSYANA 704

BLAST of CmaCh05G003410.1 vs. TrEMBL
Match: A0A061GHG1_THECC (Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_036872 PE=4 SV=1)

HSP 1 Score: 964.9 bits (2493), Expect = 9.0e-278
Identity = 462/704 (65.62%), Postives = 574/704 (81.53%), Query Frame = 1

Query: 496  KSLYSVTGTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMP 555
            + L S+  +S  A   +L TH  AIKLGT+ADVYT N IL+ Y +CKE   A  LF E+ 
Sbjct: 3    RPLNSLLESSAYAFYKVLTTHCCAIKLGTLADVYTANKILNAYARCKELHVARKLFAEVL 62

Query: 556  LRDSVSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQ 615
             RD+VSWNTMIAGY+N GNLE ++E++K M+R GFD D YTFGS+LKG+A A  L +GQQ
Sbjct: 63   HRDTVSWNTMIAGYVNCGNLETAFEIMKDMKRCGFDFDGYTFGSLLKGVASAYRLQVGQQ 122

Query: 616  IHSMIIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGD 675
            +HSMI+KMG+  NVYAGSALLDMYAKCE++ DAY+ F  + + N+VSWNA+IAG++Q GD
Sbjct: 123  LHSMIVKMGYEENVYAGSALLDMYAKCEKVGDAYMVFECLPEPNSVSWNALIAGFSQMGD 182

Query: 676  RETAFLLLDCMEQEGEKVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNA 735
            R T F LLDCME+EG KVDDG++APLL LLDD EF +LT Q+HGK+IK GL   NT+CNA
Sbjct: 183  RSTVFWLLDCMEKEGVKVDDGTYAPLLTLLDDIEFYKLTIQIHGKIIKRGLACDNTVCNA 242

Query: 736  LITSYSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEP 795
            +ITSYSECGS+ DA++VF+ + G+RDLV+WNS+L A+LVH +E+L FKL +DMQ  GFEP
Sbjct: 243  MITSYSECGSIGDARKVFDDAVGMRDLVTWNSMLAAYLVHEKEELGFKLFLDMQRLGFEP 302

Query: 796  DLYSYTSIISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALC 855
            D+Y+YTSI+SACF K   ++GKS+H +VIKRGLE SVPISNALI+MYLKS+  SM+EAL 
Sbjct: 303  DIYTYTSILSACFEKAHKSHGKSVHAVVIKRGLEYSVPISNALIAMYLKSNSTSMEEALS 362

Query: 856  IFESLEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLAT 915
            +FES+E+KDRVSWNSILTG SQ+G SEDA+  F  MR   ++ID Y+ SAVL+SCSDLAT
Sbjct: 363  LFESMELKDRVSWNSILTGFSQIGLSEDALNFFGKMRGFMVEIDHYALSAVLRSCSDLAT 422

Query: 916  FQLGQQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFG 975
             QLG+Q HVLA+K G ++N+FV+S+LIFMYSKCGI++DA++SFE   K  SI WN+++FG
Sbjct: 423  LQLGRQVHVLAIKLGFETNDFVASALIFMYSKCGIIQDARKSFEETPKDISIAWNSIIFG 482

Query: 976  YAQHGQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPP 1035
            YAQ+GQ + ALDLFFLM + KV++DHITFVAVLTACSHIGLVE G  FL+ MESDYG+PP
Sbjct: 483  YAQNGQGNDALDLFFLMRDTKVRLDHITFVAVLTACSHIGLVEEGLNFLKSMESDYGIPP 542

Query: 1036 RMEHYACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHL 1095
            RMEHYACAVDL+GR+GRLDEAK LIE+MPFKP+AMVWKT LGACR CG++ELA QVA HL
Sbjct: 543  RMEHYACAVDLFGRAGRLDEAKPLIESMPFKPDAMVWKTLLGACRVCGDIELAAQVASHL 602

Query: 1096 LEMEPEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAE 1155
            L++EPEEHCTYV+LSNMYG L RW EKA V RLM+ERGVKK PGWSWIE+KN+VHAF AE
Sbjct: 603  LDLEPEEHCTYVILSNMYGHLRRWGEKASVTRLMRERGVKKVPGWSWIEIKNQVHAFNAE 662

Query: 1156 DRSHPSCQQIYFLLEVLMEEITRIEAAADGFESFLEQEELSYAY 1200
            D+SHP C++IY +L  LMEEIT ++A   G ++     + +Y Y
Sbjct: 663  DQSHPHCKEIYQMLGGLMEEITWLDADT-GLDALTSDFDETYGY 705

BLAST of CmaCh05G003410.1 vs. TrEMBL
Match: F6H3K3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_04s0008g07050 PE=4 SV=1)

HSP 1 Score: 960.3 bits (2481), Expect = 2.2e-276
Identity = 462/702 (65.81%), Postives = 569/702 (81.05%), Query Frame = 1

Query: 495  LKSLYSVTGTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEM 554
            ++ L+S++ +SF AL    + H LAIK GT A +YT NNI+SGY KC E R A  +F E 
Sbjct: 1    MRPLHSLSQSSFTALYRASVNHCLAIKSGTTASIYTANNIISGYAKCGEIRIASKMFGET 60

Query: 555  PLRDSVSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQ 614
              RD+VSWNTMIAG++N GN E + E LK M+R+GF  D Y+FGS+LKG+AC G +++GQ
Sbjct: 61   SQRDAVSWNTMIAGFVNLGNFETALEFLKSMKRYGFAVDGYSFGSILKGVACVGYVEVGQ 120

Query: 615  QIHSMIIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTG 674
            Q+HSM++KMG+ GNV+AGSALLDMYAKCER+EDA+  F +I+ +N+V+WNA+I+GYAQ G
Sbjct: 121  QVHSMMVKMGYEGNVFAGSALLDMYAKCERVEDAFEVFKSINIRNSVTWNALISGYAQVG 180

Query: 675  DRETAFLLLDCMEQEGEKVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCN 734
            DR TAF LLDCME EG ++DDG+FAPLL LLDD +  +LT QVH K++KHGL S  T+CN
Sbjct: 181  DRGTAFWLLDCMELEGVEIDDGTFAPLLTLLDDPDLHKLTTQVHAKIVKHGLASDTTVCN 240

Query: 735  ALITSYSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFE 794
            A+IT+YSECGS+ DA+RVF+ +   RDLV+WNS+L A+LV+NQE+ AF+L ++MQ  GFE
Sbjct: 241  AIITAYSECGSIEDAERVFDGAIETRDLVTWNSMLAAYLVNNQEEEAFQLFLEMQVLGFE 300

Query: 795  PDLYSYTSIISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEAL 854
            PD+Y+YTS+ISA F       GKSLHG+VIKRGLE  VPISN+LI+MYLKS   SM EAL
Sbjct: 301  PDIYTYTSVISAAFEGSHQGQGKSLHGLVIKRGLEFLVPISNSLIAMYLKSHSKSMDEAL 360

Query: 855  CIFESLEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLA 914
             IFESLE KD VSWNSILTG SQ G SEDA+K F +MRS  + ID Y+FSAVL+SCSDLA
Sbjct: 361  NIFESLENKDHVSWNSILTGFSQSGLSEDALKFFENMRSQYVVIDHYAFSAVLRSCSDLA 420

Query: 915  TFQLGQQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMF 974
            T QLGQQ HVL LK G + N FV+SSLIFMYSKCG++EDA++SF+   K SSI WN+L+F
Sbjct: 421  TLQLGQQVHVLVLKSGFEPNGFVASSLIFMYSKCGVIEDARKSFDATPKDSSIAWNSLIF 480

Query: 975  GYAQHGQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVP 1034
            GYAQHG+  +ALDLFFLM++++VK+DHITFVAVLTACSHIGLVE G  FL+ MESDYG+P
Sbjct: 481  GYAQHGRGKIALDLFFLMKDRRVKLDHITFVAVLTACSHIGLVEEGWSFLKSMESDYGIP 540

Query: 1035 PRMEHYACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARH 1094
            PRMEHYAC +DL GR+GRLDEAKALIEAMPF+P+AMVWKT LGACR+CG++ELA QVA H
Sbjct: 541  PRMEHYACMIDLLGRAGRLDEAKALIEAMPFEPDAMVWKTLLGACRTCGDIELASQVASH 600

Query: 1095 LLEMEPEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIA 1154
            LLE+EPEEHCTYVLLS+M+G L RW EKA +KRLMKERGVKK PGWSWIEVKN+V +F A
Sbjct: 601  LLELEPEEHCTYVLLSSMFGHLRRWNEKASIKRLMKERGVKKVPGWSWIEVKNEVRSFNA 660

Query: 1155 EDRSHPSCQQIYFLLEVLMEEITRIEAAADGFESFLEQEELS 1197
            EDRSHP+C++IY  L  LMEEI R++  A+    FL+   LS
Sbjct: 661  EDRSHPNCEEIYLRLGELMEEIRRLDYVAN--SEFLQNNLLS 700

BLAST of CmaCh05G003410.1 vs. TrEMBL
Match: W9S7C5_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_007144 PE=4 SV=1)

HSP 1 Score: 954.1 bits (2465), Expect = 1.6e-274
Identity = 455/683 (66.62%), Postives = 556/683 (81.41%), Query Frame = 1

Query: 503  GTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLRDSVSW 562
            G S  + + +L+THS AIK GT++D+Y  NNIL GY + +E   A  LFDEM  RDSVSW
Sbjct: 9    GRSPNSFAKVLITHSWAIKSGTLSDIYIANNILCGYSRRQESWLAHKLFDEMSQRDSVSW 68

Query: 563  NTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIHSMIIK 622
            NTMIAG +N GN EN+WE  K M++ GF+ D YTFGS+LKG+A A    +GQQ+HSMI+K
Sbjct: 69   NTMIAGNVNRGNFENAWEFFKNMKKCGFELDGYTFGSLLKGVASAHQWSIGQQVHSMIVK 128

Query: 623  MGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRETAFLL 682
            MG+A NVY GSALLDMYAKC R+EDA+L    + ++N VSWNA+I+GY Q GDR+TAFLL
Sbjct: 129  MGYAENVYCGSALLDMYAKCGRVEDAFLVLEGMPERNPVSWNALISGYVQLGDRDTAFLL 188

Query: 683  LDCMEQEGEKVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNALITSYSE 742
              CMEQEG K++DG+ APLL LLDDAEF   T Q+HGK IKHGLE  N +CNA ITSYSE
Sbjct: 189  FACMEQEGLKIEDGTIAPLLTLLDDAEFYLSTMQMHGKAIKHGLEFENKVCNATITSYSE 248

Query: 743  CGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPDLYSYTS 802
            CGS+ DAK+VF+ S G RDLV+WNS+LGA+LVHN+E+ AF L IDMQ  GFEPD+YSYTS
Sbjct: 249  CGSIADAKKVFDGSFGTRDLVTWNSMLGAYLVHNKEECAFNLFIDMQRFGFEPDIYSYTS 308

Query: 803  IISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCIFESLEI 862
            IISACF +E   +GKSLHG++IKRGLEQSVP+ NALI+MYLKS+  SM E L IFES+E 
Sbjct: 309  IISACFEEEHKKHGKSLHGLIIKRGLEQSVPVCNALIAMYLKSNTRSMVEPLSIFESMEF 368

Query: 863  KDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLATFQLGQQF 922
            KDRVSWNSILTGLSQ+G SEDA+K F HM+   ++ID YSFSAVL+SC+DLAT QLGQQ 
Sbjct: 369  KDRVSWNSILTGLSQVGLSEDALKFFGHMQFAILEIDHYSFSAVLRSCADLATLQLGQQV 428

Query: 923  HVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFGYAQHGQC 982
            HVLA+K G++SNEFV SSLIFMY+KCGI+EDA++SFE   K SSITWN+++F YAQHGQ 
Sbjct: 429  HVLAIKSGLNSNEFVVSSLIFMYAKCGIIEDARKSFEENPKDSSITWNSIIFAYAQHGQG 488

Query: 983  HVALDLFFLME-EKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPPRMEHYA 1042
            ++ALD F  M+  ++VK+DHITFVAVLTACSH+GLVE GC+ L+ ME  +G+PPR+EHYA
Sbjct: 489  YIALDFFSQMKMREEVKLDHITFVAVLTACSHMGLVEEGCKLLKSMEFKHGIPPRVEHYA 548

Query: 1043 CAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHLLEMEPE 1102
            CAVD+YGR+GRLDEAKAL+E+MPF+P+AMVWKT L ACR+CGN+E A QVA HLL++EPE
Sbjct: 549  CAVDMYGRAGRLDEAKALVESMPFEPDAMVWKTLLSACRACGNIEFASQVASHLLDVEPE 608

Query: 1103 EHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAEDRSHPS 1162
            EHCTYV+LS++Y  L RW+E A VKRLM++RGVKK PGWSWIE+KN+VHAF AEDR HP+
Sbjct: 609  EHCTYVILSDLYRLLRRWDESASVKRLMRQRGVKKVPGWSWIEIKNEVHAFKAEDRLHPN 668

Query: 1163 CQQIYFLLEVLMEEITRIEAAAD 1185
               IYFLL V M+EI R++   D
Sbjct: 669  SDDIYFLLGVFMDEIRRLDDVDD 691

BLAST of CmaCh05G003410.1 vs. TrEMBL
Match: A0A0D2N583_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G039300 PE=4 SV=1)

HSP 1 Score: 942.2 bits (2434), Expect = 6.2e-271
Identity = 446/685 (65.11%), Postives = 564/685 (82.34%), Query Frame = 1

Query: 496  KSLYSVTGTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMP 555
            + L S+  +S  A   +L TH  A+KLGT+ADVYT N IL+ Y + KE   A  LFDE+P
Sbjct: 3    RPLNSLIQSSAYAFYKVLTTHCHALKLGTLADVYTANKILNAYTRWKELHIARKLFDEIP 62

Query: 556  LRDSVSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQ 615
             RD+VSWNTMIAG++N GNLE + ++LK MR   FD D Y+FGS+LKG+A A  L++GQQ
Sbjct: 63   HRDTVSWNTMIAGFVNCGNLETACKILKNMRICDFDFDGYSFGSLLKGVASAYRLEVGQQ 122

Query: 616  IHSMIIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGD 675
            +HS++IKMG+  NVYAGSALLDMYAKCE++EDAY  F  + + N+VSWNA+IAG+++ GD
Sbjct: 123  LHSIVIKMGYEENVYAGSALLDMYAKCEKVEDAYTVFEYLPEPNSVSWNALIAGFSKVGD 182

Query: 676  RETAFLLLDCMEQEGEKVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNA 735
            R TAF LL CME+EG + +DG+FAPLL LLDD EF +LT Q+HGK++KHGL   NT+CNA
Sbjct: 183  RSTAFWLLHCMEKEGVRAEDGTFAPLLTLLDDIEFYKLTIQIHGKIVKHGLAFDNTVCNA 242

Query: 736  LITSYSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEP 795
            +IT+YSECGS+ D ++VF+ + G+RDLV+WNS+L A+LVH +E+L F+L +DMQ  GFEP
Sbjct: 243  MITAYSECGSIRDGRKVFDGAVGMRDLVTWNSMLAAYLVHEEEELGFQLFLDMQRLGFEP 302

Query: 796  DLYSYTSIISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALC 855
            D+Y+YTSI+S CF K   ++G+SLH +VIKRGLE  VPISNALI+MYLKS+  SM EAL 
Sbjct: 303  DIYTYTSILSGCFEKAHKSHGQSLHAVVIKRGLEYLVPISNALIAMYLKSNNTSMGEALK 362

Query: 856  IFESLEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLAT 915
            +FES+E+KDRVSWNSILTG SQ+G +EDA+K F  MRSL ++ID Y+FSAVL+SC+DLAT
Sbjct: 363  LFESMELKDRVSWNSILTGFSQIGLNEDALKLFGQMRSLMVEIDHYAFSAVLRSCADLAT 422

Query: 916  FQLGQQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFG 975
             QLG+Q HVLA+K G ++N+FV+S+LIF+YSKCGI+EDA++SFE     SSI WN+L+FG
Sbjct: 423  LQLGRQVHVLAIKSGFETNDFVASALIFLYSKCGIIEDARKSFEETPNDSSIAWNSLIFG 482

Query: 976  YAQHGQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPP 1035
            YAQ+GQ  +ALDLFFLM ++KV++DHITFVAVLTACSHIGLVE G  FL+ MESDYG+PP
Sbjct: 483  YAQNGQGSIALDLFFLMRDRKVRLDHITFVAVLTACSHIGLVEEGLNFLKSMESDYGIPP 542

Query: 1036 RMEHYACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHL 1095
            RMEHYACAVDL GR+ RL EA+ LIE+MPFKP+AMVWKT LGACR CG++ELA QVA HL
Sbjct: 543  RMEHYACAVDLLGRARRLGEARTLIESMPFKPDAMVWKTLLGACRVCGDIELATQVASHL 602

Query: 1096 LEMEPEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAE 1155
            LE+EPEEHCTYVLLS++YG L RW+EKA + RLM+ERGVKK PGWSWIE+KN+VHAF AE
Sbjct: 603  LELEPEEHCTYVLLSHLYGHLRRWDEKANLTRLMRERGVKKVPGWSWIEIKNQVHAFNAE 662

Query: 1156 DRSHPSCQQIYFLLEVLMEEITRIE 1181
            D+SHP C++IY +L  LMEEIT ++
Sbjct: 663  DQSHPLCKEIYQMLGELMEEITWLD 687

BLAST of CmaCh05G003410.1 vs. TAIR10
Match: AT3G25970.1 (AT3G25970.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 796.6 bits (2056), Expect = 2.1e-230
Identity = 387/686 (56.41%), Postives = 505/686 (73.62%), Query Frame = 1

Query: 498  LYSVTGTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLR 557
            L S+  +S  +   L LTH  AIK G+I+D+Y  N IL  Y K      A++LFDEMP R
Sbjct: 5    LASLLESSLNSFQKLSLTHCYAIKCGSISDIYVSNRILDSYIKFGFLGYANMLFDEMPKR 64

Query: 558  DSVSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIH 617
            DSVSWNTMI+GY + G LE++W +  CM+R G D D Y+F  +LKGIA     DLG+Q+H
Sbjct: 65   DSVSWNTMISGYTSCGKLEDAWCLFTCMKRSGSDVDGYSFSRLLKGIASVKRFDLGEQVH 124

Query: 618  SMIIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRE 677
             ++IK G+  NVY GS+L+DMYAKCER+EDA+  F  IS+ N+VSWNA+IAG+ Q  D +
Sbjct: 125  GLVIKGGYECNVYVGSSLVDMYAKCERVEDAFEAFKEISEPNSVSWNALIAGFVQVRDIK 184

Query: 678  TAFLLLDCMEQEGE-KVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNAL 737
            TAF LL  ME +    +D G+FAPLL LLDD  FC L +QVH KV+K GL+   T+CNA+
Sbjct: 185  TAFWLLGLMEMKAAVTMDAGTFAPLLTLLDDPMFCNLLKQVHAKVLKLGLQHEITICNAM 244

Query: 738  ITSYSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPD 797
            I+SY++CGS+ DAKRVF+   G +DL+SWNS++  F  H  ++ AF+L I MQ H  E D
Sbjct: 245  ISSYADCGSVSDAKRVFDGLGGSKDLISWNSMIAGFSKHELKESAFELFIQMQRHWVETD 304

Query: 798  LYSYTSIISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCI 857
            +Y+YT ++SAC  +E    GKSLHGMVIK+GLEQ    +NALISMY++   G+M++AL +
Sbjct: 305  IYTYTGLLSACSGEEHQIFGKSLHGMVIKKGLEQVTSATNALISMYIQFPTGTMEDALSL 364

Query: 858  FESLEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLATF 917
            FESL+ KD +SWNSI+TG +Q G SEDAVK F ++RS  + +D Y+FSA+L+SCSDLAT 
Sbjct: 365  FESLKSKDLISWNSIITGFAQKGLSEDAVKFFSYLRSSEIKVDDYAFSALLRSCSDLATL 424

Query: 918  QLGQQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEG-ASKSSSITWNALMFG 977
            QLGQQ H LA K G  SNEFV SSLI MYSKCGI+E A++ F+  +SK S++ WNA++ G
Sbjct: 425  QLGQQIHALATKSGFVSNEFVISSLIVMYSKCGIIESARKCFQQISSKHSTVAWNAMILG 484

Query: 978  YAQHGQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPP 1037
            YAQHG   V+LDLF  M  + VK+DH+TF A+LTACSH GL++ G E L  ME  Y + P
Sbjct: 485  YAQHGLGQVSLDLFSQMCNQNVKLDHVTFTAILTACSHTGLIQEGLELLNLMEPVYKIQP 544

Query: 1038 RMEHYACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHL 1097
            RMEHYA AVDL GR+G +++AK LIE+MP  P+ MV KTFLG CR+CG +E+A QVA HL
Sbjct: 545  RMEHYAAAVDLLGRAGLVNKAKELIESMPLNPDPMVLKTFLGVCRACGEIEMATQVANHL 604

Query: 1098 LEMEPEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAE 1157
            LE+EPE+H TYV LS+MY DL +WEEKA VK++MKERGVKK PGWSWIE++N+V AF AE
Sbjct: 605  LEIEPEDHFTYVSLSHMYSDLKKWEEKASVKKMMKERGVKKVPGWSWIEIRNQVKAFNAE 664

Query: 1158 DRSHPSCQQIYFLLEVLMEEITRIEA 1182
            DRS+P CQ IY +++ L +E+  +++
Sbjct: 665  DRSNPLCQDIYMMIKDLTQEMQWLDS 690

BLAST of CmaCh05G003410.1 vs. TAIR10
Match: AT2G24830.1 (AT2G24830.1 zinc finger (CCCH-type) family protein / D111/G-patch domain-containing protein)

HSP 1 Score: 537.0 bits (1382), Expect = 3.1e-152
Identity = 296/508 (58.27%), Postives = 378/508 (74.41%), Query Frame = 1

Query: 1   MANDEERVLEHQLEVQLHEQRESLAALQDALASDASNPELLEVHDELVQAIKDAEEGLLH 60
           MA++E   LE+ L++QL EQ+ESL+++ +AL SD SNPELL VH+EL+ AIK+ EEGLLH
Sbjct: 1   MASEENNDLENLLDIQLIEQKESLSSIDEALLSDPSNPELLSVHEELLSAIKEVEEGLLH 60

Query: 61  LKRSRLLREADLVLCGRDSNAAEDVKVEPLHSTDVEPESPEDQSFVVGSKCRFRHTDGRW 120
           LKR+RLL EAD+VL G + +A     V+P H   +EPE  E++  + GSKCRFRHTDGRW
Sbjct: 61  LKRARLLEEADIVLNGLNHDAG----VKPEH---LEPEKTEEKKDLDGSKCRFRHTDGRW 120

Query: 121 YDGEIVGLDGSNSAKISFLTPTTENMLICKFFLQQRCRFGTSCRLSHGVDIPLTSLRRYA 180
           Y+G I+G +GS+SAKISFLTPT+E+M+ICKFF+QQRCRFG+SCR SHG+D+P++SL+ Y 
Sbjct: 121 YNGRIIGFEGSDSAKISFLTPTSESMMICKFFMQQRCRFGSSCRSSHGLDVPISSLKNYE 180

Query: 181 PTIWNQSLTGSSIWALS-SRNGIWRHAELESWDDALQIAQVVFKGDGSSQKLGPEDIALS 240
            T W Q + GS IWA+S S+  IWR AELESWDD LQ+  VVF+ D SS KLG + +ALS
Sbjct: 181 QTEWKQLMVGSKIWAVSGSKYDIWRKAELESWDDELQVGGVVFRDDKSSAKLGSDSLALS 240

Query: 241 VRAQISD--GEESD--------SSLEKSDSSDYEDDDLQGLGFLESSTQQRGIQMETTIF 300
             AQ++D  GEE +        S  E S SSDY++   QG+GFLES+   RG+Q +T +F
Sbjct: 241 EYAQMTDDDGEEEEEEDEQQSASDSEDSVSSDYDEGSPQGIGFLESTNLPRGVQTDTALF 300

Query: 301 AKWENHTRGIASKMMANMGYREGMGLGASGQGMLNPIPVKVLPAKQSLDHALESQKENNT 360
           AKWENHTRGIASKMMA+MGYREGMGLG SGQG+LNPI VKVLPAK+SLD+ALE  +    
Sbjct: 301 AKWENHTRGIASKMMASMGYREGMGLGVSGQGILNPILVKVLPAKRSLDYALEHIRNGEC 360

Query: 361 NDENNGKKRSRGGKRKRDKKFAAAMQAAKEEEDSRPDVFNLINNHL-AMHNGAPNDGSVK 420
             E   KKRSRGGKRKR KKFA A +AAK+EE+S+PD+F+LIN  +    +   +  SVK
Sbjct: 361 KSEKQKKKRSRGGKRKRGKKFAEAAKAAKQEEESKPDLFSLINEQIFPTRHEKVHSESVK 420

Query: 421 KQKDKGSADGKKVDRRTLIAYDGEVKDLRVRIEKLEEMVNRNKNEKVVFEAALRKLNETR 480
            +++KG      VDR+ L+ Y  EV+DL++ + KLE+MVNRNK + VV EAA R+L E R
Sbjct: 421 NRQNKG-----PVDRKALVEYQDEVRDLKLEMLKLEQMVNRNKKDLVVSEAATRRLKEVR 480

Query: 481 KALAEAEAAHASASNAVTSREKEKKWLK 497
           KALA   A  A+ASNA+ S+E EKKWLK
Sbjct: 481 KALASTLACQAAASNAIVSKENEKKWLK 496

BLAST of CmaCh05G003410.1 vs. TAIR10
Match: AT4G13650.1 (AT4G13650.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 471.1 bits (1211), Expect = 2.1e-132
Identity = 238/671 (35.47%), Postives = 373/671 (55.59%), Query Frame = 1

Query: 516  HSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLRDSVSWNTMIAGYINSGNL 575
            H L +KLG  +D Y CN ++S Y+      SA+ +F  M  RD+V++NT+I G    G  
Sbjct: 311  HGLVLKLGFSSDTYVCNALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLSQCGYG 370

Query: 576  ENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIHSMIIKMGFAGNVYAGSAL 635
            E + E+ K M   G + D  T  S++   +  G L  GQQ+H+   K+GFA N     AL
Sbjct: 371  EKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGAL 430

Query: 636  LDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRETAFLLLDCMEQEGEKVDD 695
            L++YAKC  +E A   FL    +N V WN M+  Y    D   +F +   M+ E    + 
Sbjct: 431  LNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQ 490

Query: 696  GSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNALITSYSECGSLVDAKRVFNC 755
             ++  +L          L  Q+H ++IK   +    +C+ LI  Y++ G L  A  +   
Sbjct: 491  YTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIR 550

Query: 756  SAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPDLYSYTSIISACFNKELSNN 815
             AG +D+VSW +++  +  +N +D A      M + G   D    T+ +SAC   +    
Sbjct: 551  FAG-KDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKE 610

Query: 816  GKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCIFESLEIKDRVSWNSILTGL 875
            G+ +H      G    +P  NAL+++Y +   G ++E+   FE  E  D ++WN++++G 
Sbjct: 611  GQQIHAQACVSGFSSDLPFQNALVTLYSRC--GKIEESYLAFEQTEAGDNIAWNALVSGF 670

Query: 876  SQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLATFQLGQQFHVLALKYGMDSNE 935
             Q G++E+A++ F+ M    +D + ++F + +K+ S+ A  + G+Q H +  K G DS  
Sbjct: 671  QQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSET 730

Query: 936  FVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFGYAQHGQCHVALDLFFLMEEK 995
             V ++LI MY+KCG + DA++ F   S  + ++WNA++  Y++HG    ALD F  M   
Sbjct: 731  EVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHS 790

Query: 996  KVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPPRMEHYACAVDLYGRSGRLDE 1055
             V+ +H+T V VL+ACSHIGLV++G  +   M S+YG+ P+ EHY C VD+  R+G L  
Sbjct: 791  NVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSR 850

Query: 1056 AKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHLLEMEPEEHCTYVLLSNMYGD 1115
            AK  I+ MP KP+A+VW+T L AC    N+E+    A HLLE+EPE+  TYVLLSN+Y  
Sbjct: 851  AKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAV 910

Query: 1116 LMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAEDRSHPSCQQIYFLLEVLMEE 1175
              +W+ +   ++ MKE+GVKK PG SWIEVKN +H+F   D++HP   +I+   + L + 
Sbjct: 911  SKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKR 970

Query: 1176 ITRIEAAADGF 1187
             + I    D F
Sbjct: 971  ASEIGYVQDCF 978

BLAST of CmaCh05G003410.1 vs. TAIR10
Match: AT2G27610.1 (AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 451.1 bits (1159), Expect = 2.2e-126
Identity = 235/658 (35.71%), Postives = 384/658 (58.36%), Query Frame = 1

Query: 516  HSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLRDSVSWNTMIAGYINSGNL 575
            H   IK G + DV    +++  Y K   F+    +FDEM  R+ V+W T+I+GY  +   
Sbjct: 116  HCQCIKFGFLDDVSVGTSLVDTYMKGSNFKDGRKVFDEMKERNVVTWTTLISGYARNSMN 175

Query: 576  ENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIHSMIIKMGFAGNVYAGSAL 635
            +    +   M+  G   + +TF + L  +A  G+   G Q+H++++K G    +   ++L
Sbjct: 176  DEVLTLFMRMQNEGTQPNSFTFAAALGVLAEEGVGGRGLQVHTVVVKNGLDKTIPVSNSL 235

Query: 636  LDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRETAFLLLDCMEQEGEKVDD 695
            +++Y KC  +  A + F     ++ V+WN+MI+GYA  G    A  +   M     ++ +
Sbjct: 236  INLYLKCGNVRKARILFDKTEVKSVVTWNSMISGYAANGLDLEALGMFYSMRLNYVRLSE 295

Query: 696  GSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNALITSYSECGSLVDAKRVFNC 755
             SFA ++ L  + +  R T Q+H  V+K+G      +  AL+ +YS+C +++DA R+F  
Sbjct: 296  SSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDALRLFKE 355

Query: 756  SAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPDLYSYTSIISACFNKELSNN 815
               V ++VSW +++  FL ++ ++ A  L  +M+  G  P+ ++Y+ I++A      S  
Sbjct: 356  IGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILTALPVISPSE- 415

Query: 816  GKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCIFESLEIKDRVSWNSILTGL 875
               +H  V+K   E+S  +  AL+  Y+K   G ++EA  +F  ++ KD V+W+++L G 
Sbjct: 416  ---VHAQVVKTNYERSSTVGTALLDAYVKL--GKVEEAAKVFSGIDDKDIVAWSAMLAGY 475

Query: 876  SQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDL-ATFQLGQQFHVLALKYGMDSN 935
            +Q G +E A+K F  +    +  + ++FS++L  C+   A+   G+QFH  A+K  +DS+
Sbjct: 476  AQTGETEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFHGFAIKSRLDSS 535

Query: 936  EFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFGYAQHGQCHVALDLFFLMEE 995
              VSS+L+ MY+K G +E A+  F+   +   ++WN+++ GYAQHGQ   ALD+F  M++
Sbjct: 536  LCVSSALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDVFKEMKK 595

Query: 996  KKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPPRMEHYACAVDLYGRSGRLD 1055
            +KVKMD +TF+ V  AC+H GLVE G ++   M  D  + P  EH +C VDLY R+G+L+
Sbjct: 596  RKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSRAGQLE 655

Query: 1056 EAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHLLEMEPEEHCTYVLLSNMYG 1115
            +A  +IE MP    + +W+T L ACR     EL    A  ++ M+PE+   YVLLSNMY 
Sbjct: 656  KAMKVIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLLSNMYA 715

Query: 1116 DLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAEDRSHPSCQQIYFLLEVL 1173
            +   W+E+A+V++LM ER VKK PG+SWIEVKNK ++F+A DRSHP   QIY  LE L
Sbjct: 716  ESGDWQERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKLEDL 767

BLAST of CmaCh05G003410.1 vs. TAIR10
Match: AT3G02330.1 (AT3G02330.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 416.0 bits (1068), Expect = 7.9e-116
Identity = 224/683 (32.80%), Postives = 375/683 (54.90%), Query Frame = 1

Query: 516  HSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLRDSVSWNTMIAGYINSGNL 575
            H + +++G   DV   + +L  Y K K F  +  +F  +P ++SVSW+ +IAG + +  L
Sbjct: 203  HGIVVRVGCDTDVVAASALLDMYAKGKRFVESLRVFQGIPEKNSVSWSAIIAGCVQNNLL 262

Query: 576  ENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIHSMIIKMGFAGNVYAGSAL 635
              + +  K M++      +  + S+L+  A    L LG Q+H+  +K  FA +    +A 
Sbjct: 263  SLALKFFKEMQKVNAGVSQSIYASVLRSCAALSELRLGGQLHAHALKSDFAADGIVRTAT 322

Query: 636  LDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRETAFLLLDCMEQEGEKVDD 695
            LDMYAKC+ ++DA + F N    N  S+NAMI GY+Q      A LL   +   G   D+
Sbjct: 323  LDMYAKCDNMQDAQILFDNSENLNRQSYNAMITGYSQEEHGFKALLLFHRLMSSGLGFDE 382

Query: 696  GSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNALITSYSECGSLVDAKRVFNC 755
             S + +       +      Q++G  IK  L     + NA I  Y +C +L +A RVF+ 
Sbjct: 383  ISLSGVFRACALVKGLSEGLQIYGLAIKSSLSLDVCVANAAIDMYGKCQALAEAFRVFD- 442

Query: 756  SAGVRDLVSWNSLLGAFLVHNQEDLAFK---LLIDMQEHGFEPDLYSYTSIISACFNKEL 815
                RD VSWN+++ A   H Q    ++   L + M     EPD +++ SI+ AC    L
Sbjct: 443  EMRRRDAVSWNAIIAA---HEQNGKGYETLFLFVSMLRSRIEPDEFTFGSILKACTGGSL 502

Query: 816  SNNGKSLHGMVIKRGLEQSVPISNALISMYLKSD------------------GGSMKEAL 875
               G  +H  ++K G+  +  +  +LI MY K                     G+M+E  
Sbjct: 503  GY-GMEIHSSIVKSGMASNSSVGCSLIDMYSKCGMIEEAEKIHSRFFQRANVSGTMEELE 562

Query: 876  CIFESLEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLA 935
             +      +  VSWNSI++G      SEDA   F  M  + +  D+++++ VL +C++LA
Sbjct: 563  KMHNKRLQEMCVSWNSIISGYVMKEQSEDAQMLFTRMMEMGITPDKFTYATVLDTCANLA 622

Query: 936  TFQLGQQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMF 995
            +  LG+Q H   +K  + S+ ++ S+L+ MYSKCG + D++  FE + +   +TWNA++ 
Sbjct: 623  SAGLGKQIHAQVIKKELQSDVYICSTLVDMYSKCGDLHDSRLMFEKSLRRDFVTWNAMIC 682

Query: 996  GYAQHGQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVP 1055
            GYA HG+   A+ LF  M  + +K +H+TF+++L AC+H+GL+++G E+   M+ DYG+ 
Sbjct: 683  GYAHHGKGEEAIQLFERMILENIKPNHVTFISILRACAHMGLIDKGLEYFYMMKRDYGLD 742

Query: 1056 PRMEHYACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACR-SCGNVELACQVAR 1115
            P++ HY+  VD+ G+SG++  A  LI  MPF+ + ++W+T LG C     NVE+A +   
Sbjct: 743  PQLPHYSNMVDILGKSGKVKRALELIREMPFEADDVIWRTLLGVCTIHRNNVEVAEEATA 802

Query: 1116 HLLEMEPEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFI 1175
             LL ++P++   Y LLSN+Y D   WE+ + ++R M+   +KK PG SW+E+K+++H F+
Sbjct: 803  ALLRLDPQDSSAYTLLSNVYADAGMWEKVSDLRRNMRGFKLKKEPGCSWVELKDELHVFL 862

Query: 1176 AEDRSHPSCQQIYFLLEVLMEEI 1177
              D++HP  ++IY  L ++  E+
Sbjct: 863  VGDKAHPRWEEIYEELGLIYSEM 880

BLAST of CmaCh05G003410.1 vs. NCBI nr
Match: gi|449442142|ref|XP_004138841.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g25970 [Cucumis sativus])

HSP 1 Score: 1251.1 bits (3236), Expect = 0.0e+00
Identity = 605/701 (86.31%), Postives = 655/701 (93.44%), Query Frame = 1

Query: 500  SVTGTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLRDS 559
            S  GTSFRAL+NLLL HSLA+KLGTIADVYTCNNIL+GYWKCKE RSADVLFDEMP+RDS
Sbjct: 5    SAVGTSFRALANLLLNHSLAVKLGTIADVYTCNNILNGYWKCKELRSADVLFDEMPMRDS 64

Query: 560  VSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIHSM 619
            VSWNTMIAG+IN GNLE SW+VL+CMR  GF+ D YTFGSMLKGIA AGM  LGQQ+HS+
Sbjct: 65   VSWNTMIAGHINCGNLEASWDVLRCMRSCGFELDRYTFGSMLKGIAFAGMFHLGQQVHSI 124

Query: 620  IIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRETA 679
            IIKMG+A NVYAGSALLDMYAKCE+LEDAYL+FL+ISK NTVSWNAMI GYAQ GDRETA
Sbjct: 125  IIKMGYAENVYAGSALLDMYAKCEKLEDAYLSFLSISKHNTVSWNAMINGYAQAGDRETA 184

Query: 680  FLLLDCMEQEGEKVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNALITS 739
            F LLDCMEQEGEKVDDG++APLLPLLDDA+FC LT Q+HGK+IKHGLE  NTMCNALITS
Sbjct: 185  FWLLDCMEQEGEKVDDGTYAPLLPLLDDADFCNLTSQLHGKIIKHGLELVNTMCNALITS 244

Query: 740  YSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPDLYS 799
            YS+CGSL DAKR+F+ SAG+RDLV+WNSLL A+L+ +QEDLAFKLLIDMQEHGFEPDLYS
Sbjct: 245  YSKCGSLDDAKRIFDSSAGIRDLVTWNSLLAAYLLRSQEDLAFKLLIDMQEHGFEPDLYS 304

Query: 800  YTSIISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCIFES 859
            YTSIISACFN+ +SNNG+SLHG+VIKRG EQSVPISNALISMYLKSD GSMKEALCIFES
Sbjct: 305  YTSIISACFNENISNNGRSLHGLVIKRGFEQSVPISNALISMYLKSDYGSMKEALCIFES 364

Query: 860  LEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLATFQLG 919
            LE KDRVSWNSILTGLSQ GSSEDAVKSFLHMRS AMDID YSFSAVL+SCSDLATFQLG
Sbjct: 365  LEFKDRVSWNSILTGLSQTGSSEDAVKSFLHMRSAAMDIDHYSFSAVLRSCSDLATFQLG 424

Query: 920  QQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFGYAQH 979
            QQ HVLALKYG++SNEFVSSSLIFMYSKCGI+EDA+RSFE ASK+SSITWNALMFGYAQH
Sbjct: 425  QQIHVLALKYGLESNEFVSSSLIFMYSKCGIIEDARRSFEEASKNSSITWNALMFGYAQH 484

Query: 980  GQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPPRMEH 1039
            GQC+VALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVE+GC+FLRCMESDYGVPPRMEH
Sbjct: 485  GQCNVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVEQGCKFLRCMESDYGVPPRMEH 544

Query: 1040 YACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHLLEME 1099
            YACAVDLYGRSGRL+EAKALIE MPFKP+  VWKTFLGACRSCGN+ELACQVA HLLEME
Sbjct: 545  YACAVDLYGRSGRLEEAKALIEEMPFKPDTTVWKTFLGACRSCGNIELACQVAGHLLEME 604

Query: 1100 PEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAEDRSH 1159
            PEEHCTYVLLSNMYG+LMRW+EKA+VKRLMKERGVKK PGWSWIEV N VHAFIA+D SH
Sbjct: 605  PEEHCTYVLLSNMYGNLMRWDEKAKVKRLMKERGVKKVPGWSWIEVNNNVHAFIAQDHSH 664

Query: 1160 PSCQQIYFLLEVLMEEITRIEAAADGFESFLEQEELSYAYA 1201
            PSCQQIYFLLEVL+EEITR+E  ADGF+SFLEQEELSYA A
Sbjct: 665  PSCQQIYFLLEVLLEEITRME-DADGFKSFLEQEELSYANA 704

BLAST of CmaCh05G003410.1 vs. NCBI nr
Match: gi|659089006|ref|XP_008445278.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g25970 [Cucumis melo])

HSP 1 Score: 1239.2 bits (3205), Expect = 0.0e+00
Identity = 601/699 (85.98%), Postives = 652/699 (93.28%), Query Frame = 1

Query: 500  SVTGTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMPLRDS 559
            S  GTSFRALSNLLL HSLA+KLGTIADVYTCNNIL+GYWKCKE RSAD+LFDEMPLRDS
Sbjct: 5    SAVGTSFRALSNLLLNHSLAVKLGTIADVYTCNNILNGYWKCKELRSADILFDEMPLRDS 64

Query: 560  VSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQIHSM 619
            VSWNTM+AG+IN GNLE SW+VLKCMRR GF+ D YTFGSMLKGIA AGM DLGQQ+HSM
Sbjct: 65   VSWNTMVAGHINCGNLEASWDVLKCMRRCGFEMDRYTFGSMLKGIAFAGMFDLGQQVHSM 124

Query: 620  IIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGDRETA 679
            IIKM +AGNVYAGSALLDMYAKCERLEDAYL+FL+ISK+NTVSWNAMIAGYAQ GDRETA
Sbjct: 125  IIKMDYAGNVYAGSALLDMYAKCERLEDAYLSFLSISKKNTVSWNAMIAGYAQAGDRETA 184

Query: 680  FLLLDCMEQEGEKVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNALITS 739
            F LLDCMEQEGEKVDDG++APLLPLLDDA+FC LT Q+HGK+IKHGLE  NTMCNALITS
Sbjct: 185  FWLLDCMEQEGEKVDDGTYAPLLPLLDDADFCNLTSQLHGKIIKHGLEFVNTMCNALITS 244

Query: 740  YSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEPDLYS 799
            YS+CGSL DAKR+F+ SAG+ DLV+WNSLL A+L+ ++EDLAFKLLIDMQEHGFEPDLYS
Sbjct: 245  YSKCGSLDDAKRIFDSSAGIWDLVTWNSLLAAYLLRSREDLAFKLLIDMQEHGFEPDLYS 304

Query: 800  YTSIISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALCIFES 859
            YTSIISACFN++LSNNGKSLHG+VIKRG EQSVPISNALISMYLKSD GSMKEALCIFES
Sbjct: 305  YTSIISACFNEKLSNNGKSLHGLVIKRGFEQSVPISNALISMYLKSDYGSMKEALCIFES 364

Query: 860  LEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLATFQLG 919
            LE KDRVSWNSILTGLSQ G SEDAVKSFL+MRS AMDID YSFSAVL+SCSDLATFQLG
Sbjct: 365  LEFKDRVSWNSILTGLSQTGLSEDAVKSFLYMRSAAMDIDHYSFSAVLRSCSDLATFQLG 424

Query: 920  QQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFGYAQH 979
            QQ HVLALKYG++S+EFVSSSLIFMYSKCG +EDA+RSFE ASK+SSITWNALMFGYAQH
Sbjct: 425  QQIHVLALKYGLESDEFVSSSLIFMYSKCGFIEDARRSFEEASKNSSITWNALMFGYAQH 484

Query: 980  GQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPPRMEH 1039
            GQC+VALDLFFLMEEK+VKMDHITFVAVLTACSHIGLVE+G +FL+CMESDYGVPPRMEH
Sbjct: 485  GQCNVALDLFFLMEEKRVKMDHITFVAVLTACSHIGLVEQGRKFLQCMESDYGVPPRMEH 544

Query: 1040 YACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHLLEME 1099
            YACAVDLYGRSG L+EAKALIE MPFKP+  VWKTFLGACRSCGNVELACQVA HLLE+E
Sbjct: 545  YACAVDLYGRSGHLEEAKALIEEMPFKPDVTVWKTFLGACRSCGNVELACQVASHLLEIE 604

Query: 1100 PEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAEDRSH 1159
            PEEHCTYVLLSNMYG+LMRWEEKA+VKRLMKERGVKK PGWSWIEV N VHAFIA+D SH
Sbjct: 605  PEEHCTYVLLSNMYGNLMRWEEKAKVKRLMKERGVKKVPGWSWIEVNNNVHAFIAQDHSH 664

Query: 1160 PSCQQIYFLLEVLMEEITRIEAAADGFESFLEQEELSYA 1199
            PSC+QIYFLLEVL+EEITR+E A  GFES LEQEELSYA
Sbjct: 665  PSCRQIYFLLEVLLEEITRMEDAY-GFESSLEQEELSYA 702

BLAST of CmaCh05G003410.1 vs. NCBI nr
Match: gi|1009134051|ref|XP_015884237.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g25970 [Ziziphus jujuba])

HSP 1 Score: 973.0 bits (2514), Expect = 4.7e-280
Identity = 463/684 (67.69%), Postives = 575/684 (84.06%), Query Frame = 1

Query: 495  LKSLYSVTGTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEM 554
            ++SL+S   +S  AL  +L+THS A+K GTIAD+Y  NNIL+GY + ++F  A  LFD+M
Sbjct: 1    MRSLHSAIESSTNALLKVLITHSHAVKSGTIADIYIANNILNGYSRNQQFGLAHKLFDKM 60

Query: 555  PLRDSVSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQ 614
              RD+VSWNTMIAGY+N GN   +WE L+ MRR GF+ D YTFGS+LKGIA +   D+G+
Sbjct: 61   LNRDTVSWNTMIAGYVNCGNFGIAWEFLRNMRRSGFELDGYTFGSILKGIAGSHQWDIGE 120

Query: 615  QIHSMIIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTG 674
            ++HSMIIKMG+AGNVY+GSALLDMYAKCER+E+AY+ F ++ ++NTVSWNA+IAG+ Q G
Sbjct: 121  EVHSMIIKMGYAGNVYSGSALLDMYAKCERVEEAYVVFEHMPERNTVSWNALIAGFVQVG 180

Query: 675  DRETAFLLLDCMEQEGEKVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCN 734
            DR TAF L  CME++  K DDG+ APLL LLDD+EF   T Q+HGK+ K GLE +NT+CN
Sbjct: 181  DRRTAFWLFGCMEKDAVKPDDGTIAPLLTLLDDSEFYWTTMQIHGKITKLGLEFSNTVCN 240

Query: 735  ALITSYSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFE 794
            A+ITSYSECGS+ +AK+VF+ S   RD+V+WNS+L A+L+H +E LAF + +DMQ  GFE
Sbjct: 241  AIITSYSECGSIENAKKVFDRSFDTRDVVTWNSMLAAYLIHGKEGLAFNIFMDMQWLGFE 300

Query: 795  PDLYSYTSIISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEAL 854
            PD+YSYTSI+SACF +   N G+SLHG++IKRGLEQSVPI+NALI+MYLKS   SM+EAL
Sbjct: 301  PDIYSYTSIVSACFEEAHKNLGQSLHGLIIKRGLEQSVPIANALIAMYLKSINKSMEEAL 360

Query: 855  CIFESLEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLA 914
             IFE LE+KD+VSWNSILTGLSQ G SEDA+K F+HMR +A++ID Y+ SAV++SCSDLA
Sbjct: 361  HIFECLEMKDKVSWNSILTGLSQFGLSEDALKFFVHMRYVAVEIDHYTLSAVIRSCSDLA 420

Query: 915  TFQLGQQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMF 974
            T QLGQQ HVLALK G++SNEFV+SSLIFMY+KCGI+EDA++SFE     S ITWN+L+F
Sbjct: 421  TLQLGQQVHVLALKSGLESNEFVASSLIFMYAKCGIIEDARKSFEENPNDSPITWNSLIF 480

Query: 975  GYAQHGQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVP 1034
            GYAQHG  ++ALD+FF ME++KVK+DHITFVAVLTACSHIGLVE+GCE L+ MES+YG+ 
Sbjct: 481  GYAQHGLGYIALDIFFEMEKRKVKLDHITFVAVLTACSHIGLVEQGCELLKSMESNYGIT 540

Query: 1035 PRMEHYACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARH 1094
            PRMEHYACAVDLYGR+GRL+EAKALIE MPF+P+A+VWKTFLGACR+CGN+ELA QVA  
Sbjct: 541  PRMEHYACAVDLYGRAGRLNEAKALIETMPFEPDAIVWKTFLGACRACGNIELASQVASR 600

Query: 1095 LLEMEPEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIA 1154
            LL++EPEEHCTYVLLS+MYG L RW+EKA VKRLMKERGVKK PGWSWIE+KN+VHAF A
Sbjct: 601  LLDLEPEEHCTYVLLSDMYGYLRRWDEKASVKRLMKERGVKKVPGWSWIEIKNQVHAFKA 660

Query: 1155 EDRSHPSCQQIYFLLEVLMEEITR 1179
            EDR HP+C +IYF+L  LM+EI+R
Sbjct: 661  EDRLHPNCGEIYFVLGGLMDEISR 684

BLAST of CmaCh05G003410.1 vs. NCBI nr
Match: gi|590571697|ref|XP_007011666.1| (Pentatricopeptide repeat-containing protein, putative [Theobroma cacao])

HSP 1 Score: 964.9 bits (2493), Expect = 1.3e-277
Identity = 462/704 (65.62%), Postives = 574/704 (81.53%), Query Frame = 1

Query: 496  KSLYSVTGTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEMP 555
            + L S+  +S  A   +L TH  AIKLGT+ADVYT N IL+ Y +CKE   A  LF E+ 
Sbjct: 3    RPLNSLLESSAYAFYKVLTTHCCAIKLGTLADVYTANKILNAYARCKELHVARKLFAEVL 62

Query: 556  LRDSVSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQQ 615
             RD+VSWNTMIAGY+N GNLE ++E++K M+R GFD D YTFGS+LKG+A A  L +GQQ
Sbjct: 63   HRDTVSWNTMIAGYVNCGNLETAFEIMKDMKRCGFDFDGYTFGSLLKGVASAYRLQVGQQ 122

Query: 616  IHSMIIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTGD 675
            +HSMI+KMG+  NVYAGSALLDMYAKCE++ DAY+ F  + + N+VSWNA+IAG++Q GD
Sbjct: 123  LHSMIVKMGYEENVYAGSALLDMYAKCEKVGDAYMVFECLPEPNSVSWNALIAGFSQMGD 182

Query: 676  RETAFLLLDCMEQEGEKVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCNA 735
            R T F LLDCME+EG KVDDG++APLL LLDD EF +LT Q+HGK+IK GL   NT+CNA
Sbjct: 183  RSTVFWLLDCMEKEGVKVDDGTYAPLLTLLDDIEFYKLTIQIHGKIIKRGLACDNTVCNA 242

Query: 736  LITSYSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFEP 795
            +ITSYSECGS+ DA++VF+ + G+RDLV+WNS+L A+LVH +E+L FKL +DMQ  GFEP
Sbjct: 243  MITSYSECGSIGDARKVFDDAVGMRDLVTWNSMLAAYLVHEKEELGFKLFLDMQRLGFEP 302

Query: 796  DLYSYTSIISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEALC 855
            D+Y+YTSI+SACF K   ++GKS+H +VIKRGLE SVPISNALI+MYLKS+  SM+EAL 
Sbjct: 303  DIYTYTSILSACFEKAHKSHGKSVHAVVIKRGLEYSVPISNALIAMYLKSNSTSMEEALS 362

Query: 856  IFESLEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLAT 915
            +FES+E+KDRVSWNSILTG SQ+G SEDA+  F  MR   ++ID Y+ SAVL+SCSDLAT
Sbjct: 363  LFESMELKDRVSWNSILTGFSQIGLSEDALNFFGKMRGFMVEIDHYALSAVLRSCSDLAT 422

Query: 916  FQLGQQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMFG 975
             QLG+Q HVLA+K G ++N+FV+S+LIFMYSKCGI++DA++SFE   K  SI WN+++FG
Sbjct: 423  LQLGRQVHVLAIKLGFETNDFVASALIFMYSKCGIIQDARKSFEETPKDISIAWNSIIFG 482

Query: 976  YAQHGQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVPP 1035
            YAQ+GQ + ALDLFFLM + KV++DHITFVAVLTACSHIGLVE G  FL+ MESDYG+PP
Sbjct: 483  YAQNGQGNDALDLFFLMRDTKVRLDHITFVAVLTACSHIGLVEEGLNFLKSMESDYGIPP 542

Query: 1036 RMEHYACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARHL 1095
            RMEHYACAVDL+GR+GRLDEAK LIE+MPFKP+AMVWKT LGACR CG++ELA QVA HL
Sbjct: 543  RMEHYACAVDLFGRAGRLDEAKPLIESMPFKPDAMVWKTLLGACRVCGDIELAAQVASHL 602

Query: 1096 LEMEPEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIAE 1155
            L++EPEEHCTYV+LSNMYG L RW EKA V RLM+ERGVKK PGWSWIE+KN+VHAF AE
Sbjct: 603  LDLEPEEHCTYVILSNMYGHLRRWGEKASVTRLMRERGVKKVPGWSWIEIKNQVHAFNAE 662

Query: 1156 DRSHPSCQQIYFLLEVLMEEITRIEAAADGFESFLEQEELSYAY 1200
            D+SHP C++IY +L  LMEEIT ++A   G ++     + +Y Y
Sbjct: 663  DQSHPHCKEIYQMLGGLMEEITWLDADT-GLDALTSDFDETYGY 705

BLAST of CmaCh05G003410.1 vs. NCBI nr
Match: gi|731386173|ref|XP_010648772.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g25970 [Vitis vinifera])

HSP 1 Score: 960.3 bits (2481), Expect = 3.2e-276
Identity = 461/696 (66.24%), Postives = 567/696 (81.47%), Query Frame = 1

Query: 495  LKSLYSVTGTSFRALSNLLLTHSLAIKLGTIADVYTCNNILSGYWKCKEFRSADVLFDEM 554
            ++ L+S++ +SF AL    + H LAIK GT A +YT NNI+SGY KC E R A  +F E 
Sbjct: 1    MRPLHSLSQSSFTALYRASVNHCLAIKSGTTASIYTANNIISGYAKCGEIRIASKMFGET 60

Query: 555  PLRDSVSWNTMIAGYINSGNLENSWEVLKCMRRFGFDQDEYTFGSMLKGIACAGMLDLGQ 614
              RD+VSWNTMIAG++N GN E + E LK M+R+GF  D Y+FGS+LKG+AC G +++GQ
Sbjct: 61   SQRDAVSWNTMIAGFVNLGNFETALEFLKSMKRYGFAVDGYSFGSILKGVACVGYVEVGQ 120

Query: 615  QIHSMIIKMGFAGNVYAGSALLDMYAKCERLEDAYLTFLNISKQNTVSWNAMIAGYAQTG 674
            Q+HSM++KMG+ GNV+AGSALLDMYAKCER+EDA+  F +I+ +N+V+WNA+I+GYAQ G
Sbjct: 121  QVHSMMVKMGYEGNVFAGSALLDMYAKCERVEDAFEVFKSINIRNSVTWNALISGYAQVG 180

Query: 675  DRETAFLLLDCMEQEGEKVDDGSFAPLLPLLDDAEFCRLTRQVHGKVIKHGLESANTMCN 734
            DR TAF LLDCME EG ++DDG+FAPLL LLDD +  +LT QVH K++KHGL S  T+CN
Sbjct: 181  DRGTAFWLLDCMELEGVEIDDGTFAPLLTLLDDPDLHKLTTQVHAKIVKHGLASDTTVCN 240

Query: 735  ALITSYSECGSLVDAKRVFNCSAGVRDLVSWNSLLGAFLVHNQEDLAFKLLIDMQEHGFE 794
            A+IT+YSECGS+ DA+RVF+ +   RDLV+WNS+L A+LV+NQE+ AF+L ++MQ  GFE
Sbjct: 241  AIITAYSECGSIEDAERVFDGAIETRDLVTWNSMLAAYLVNNQEEEAFQLFLEMQVLGFE 300

Query: 795  PDLYSYTSIISACFNKELSNNGKSLHGMVIKRGLEQSVPISNALISMYLKSDGGSMKEAL 854
            PD+Y+YTS+ISA F       GKSLHG+VIKRGLE  VPISN+LI+MYLKS   SM EAL
Sbjct: 301  PDIYTYTSVISAAFEGSHQGQGKSLHGLVIKRGLEFLVPISNSLIAMYLKSHSKSMDEAL 360

Query: 855  CIFESLEIKDRVSWNSILTGLSQMGSSEDAVKSFLHMRSLAMDIDRYSFSAVLKSCSDLA 914
             IFESLE KD VSWNSILTG SQ G SEDA+K F +MRS  + ID Y+FSAVL+SCSDLA
Sbjct: 361  NIFESLENKDHVSWNSILTGFSQSGLSEDALKFFENMRSQYVVIDHYAFSAVLRSCSDLA 420

Query: 915  TFQLGQQFHVLALKYGMDSNEFVSSSLIFMYSKCGIMEDAKRSFEGASKSSSITWNALMF 974
            T QLGQQ HVL LK G + N FV+SSLIFMYSKCG++EDA++SF+   K SSI WN+L+F
Sbjct: 421  TLQLGQQVHVLVLKSGFEPNGFVASSLIFMYSKCGVIEDARKSFDATPKDSSIAWNSLIF 480

Query: 975  GYAQHGQCHVALDLFFLMEEKKVKMDHITFVAVLTACSHIGLVERGCEFLRCMESDYGVP 1034
            GYAQHG+  +ALDLFFLM++++VK+DHITFVAVLTACSHIGLVE G  FL+ MESDYG+P
Sbjct: 481  GYAQHGRGKIALDLFFLMKDRRVKLDHITFVAVLTACSHIGLVEEGWSFLKSMESDYGIP 540

Query: 1035 PRMEHYACAVDLYGRSGRLDEAKALIEAMPFKPNAMVWKTFLGACRSCGNVELACQVARH 1094
            PRMEHYAC +DL GR+GRLDEAKALIEAMPF+P+AMVWKT LGACR+CG++ELA QVA H
Sbjct: 541  PRMEHYACMIDLLGRAGRLDEAKALIEAMPFEPDAMVWKTLLGACRTCGDIELASQVASH 600

Query: 1095 LLEMEPEEHCTYVLLSNMYGDLMRWEEKAQVKRLMKERGVKKTPGWSWIEVKNKVHAFIA 1154
            LLE+EPEEHCTYVLLS+M+G L RW EKA +KRLMKERGVKK PGWSWIEVKN+V +F A
Sbjct: 601  LLELEPEEHCTYVLLSSMFGHLRRWNEKASIKRLMKERGVKKVPGWSWIEVKNEVRSFNA 660

Query: 1155 EDRSHPSCQQIYFLLEVLMEEITRIEAAADGFESFL 1191
            EDRSHP+C++IY  L  LMEEI R++  A+  E FL
Sbjct: 661  EDRSHPNCEEIYLRLGELMEEIRRLDYVANS-EVFL 695

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP255_ARATH3.8e-22956.41Putative pentatricopeptide repeat-containing protein At3g25970 OS=Arabidopsis th... [more]
C3H22_ARATH5.4e-15158.27Zinc finger CCCH domain-containing protein 22 OS=Arabidopsis thaliana GN=At2g248... [more]
C3H18_ORYSJ4.3e-14858.96Zinc finger CCCH domain-containing protein 18 OS=Oryza sativa subsp. japonica GN... [more]
PP307_ARATH3.7e-13135.47Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana GN... [more]
PP172_ARATH3.9e-12535.71Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LMK6_CUCSA0.0e+0086.31Uncharacterized protein OS=Cucumis sativus GN=Csa_2G382810 PE=4 SV=1[more]
A0A061GHG1_THECC9.0e-27865.63Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_... [more]
F6H3K3_VITVI2.2e-27665.81Putative uncharacterized protein OS=Vitis vinifera GN=VIT_04s0008g07050 PE=4 SV=... [more]
W9S7C5_9ROSA1.6e-27466.62Uncharacterized protein OS=Morus notabilis GN=L484_007144 PE=4 SV=1[more]
A0A0D2N583_GOSRA6.2e-27165.11Uncharacterized protein OS=Gossypium raimondii GN=B456_001G039300 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G25970.12.1e-23056.41 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G24830.13.1e-15258.27 zinc finger (CCCH-type) family protein / D111/G-patch domain-contain... [more]
AT4G13650.12.1e-13235.47 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G27610.12.2e-12635.71 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G02330.17.9e-11632.80 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449442142|ref|XP_004138841.1|0.0e+0086.31PREDICTED: putative pentatricopeptide repeat-containing protein At3g25970 [Cucum... [more]
gi|659089006|ref|XP_008445278.1|0.0e+0085.98PREDICTED: putative pentatricopeptide repeat-containing protein At3g25970 [Cucum... [more]
gi|1009134051|ref|XP_015884237.1|4.7e-28067.69PREDICTED: putative pentatricopeptide repeat-containing protein At3g25970 [Zizip... [more]
gi|590571697|ref|XP_007011666.1|1.3e-27765.63Pentatricopeptide repeat-containing protein, putative [Theobroma cacao][more]
gi|731386173|ref|XP_010648772.1|3.2e-27666.24PREDICTED: putative pentatricopeptide repeat-containing protein At3g25970 [Vitis... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR000467G_patch_dom
IPR000571Znf_CCCH
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO:0046872metal ion binding
GO:0005515protein binding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0005515 protein binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmaCh05G003410CmaCh05G003410gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmaCh05G003410.1CmaCh05G003410.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh05G003410.1.CDS.5CmaCh05G003410.1.CDS.5CDS
CmaCh05G003410.1.CDS.4CmaCh05G003410.1.CDS.4CDS
CmaCh05G003410.1.CDS.3CmaCh05G003410.1.CDS.3CDS
CmaCh05G003410.1.CDS.2CmaCh05G003410.1.CDS.2CDS
CmaCh05G003410.1.CDS.1CmaCh05G003410.1.CDS.1CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh05G003410.1.five_prime_UTR.1CmaCh05G003410.1.five_prime_UTR.1five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh05G003410.1.exon.5CmaCh05G003410.1.exon.5exon
CmaCh05G003410.1.exon.4CmaCh05G003410.1.exon.4exon
CmaCh05G003410.1.exon.3CmaCh05G003410.1.exon.3exon
CmaCh05G003410.1.exon.2CmaCh05G003410.1.exon.2exon
CmaCh05G003410.1.exon.1CmaCh05G003410.1.exon.1exon


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000467G-patch domainPFAMPF01585G-patchcoord: 296..338
score: 2.3
IPR000467G-patch domainSMARTSM00443G-patch_5coord: 294..340
score: 4.1
IPR000467G-patch domainPROFILEPS50174G_PATCHcoord: 296..342
score: 13
IPR000571Zinc finger, CCCH-typeSMARTSM00356c3hfinal6coord: 143..169
score: 7.
IPR000571Zinc finger, CCCH-typePROFILEPS50103ZF_C3H1coord: 143..170
score: 13
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 866..893
score: 0.018coord: 1043..1063
score: 0.73coord: 529..555
score: 0.024coord: 661..690
score: 3.0E-5coord: 732..754
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 966..1012
score: 1.7E-7coord: 761..807
score: 7.1E-11coord: 558..604
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 967..1000
score: 1.2E-5coord: 560..591
score: 6.1E-4coord: 763..796
score: 2.9E-4coord: 661..694
score: 0.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 1000..1034
score: 7.366coord: 899..933
score: 5.612coord: 659..693
score: 10.315coord: 1068..1098
score: 6.16coord: 761..795
score: 10.665coord: 864..898
score: 8.901coord: 593..627
score: 8.013coord: 628..658
score: 6.095coord: 527..557
score: 8.396coord: 965..999
score: 9.931coord: 831..863
score: 5.766coord: 558..592
score: 11.29coord: 1036..1066
score: 6.96coord: 796..830
score: 7.432coord: 934..964
score: 6.204coord: 729..759
score: 5.152coord: 1102..1136
score: 7
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 633..676
score: 1.2E-6coord: 867..898
score: 1.2E-6coord: 1037..1190
score: 1.
NoneNo IPR availableunknownCoilCoilcoord: 1..30
score: -coord: 426..484
scor
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 832..1143
score: 0.0coord: 519..796
score:
NoneNo IPR availablePANTHERPTHR24015:SF77SUBFAMILY NOT NAMEDcoord: 832..1143
score: 0.0coord: 519..796
score: