CcUC01G004300 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC01G004300
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCicolChr01: 4387754 .. 4396544 (-)
RNA-Seq ExpressionCcUC01G004300
SyntenyCcUC01G004300
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGAAGGAAAAAGTCATTATCACTCTCTTGCAAGGCTGCAACAATCTCAACAAGCTTCGCAAAATCCACGCACATGTTATTGTAAGCGGCCTCCGCGATCATGTCGCCATTGGCAACAAGCTTTTGAACTTCTGTGCCATCTCTGTTTCAGGTTCCCTTGCTTATGCCCAGCTTCTCTTCCATCAAATGGAGTGCCCACAAACCGAAGCCTGGAACTCCATCATCAGAGGTTTTGCCCAGAGCTCATCTCCCATTGAGGCTATTGTTTTCTACAATCGAATGGTTTCGGCCTCTTTCTCTTCTCCTGACACTTTCACTTTCTCATTTGTGCTCAAAGCCTGTGAAAGAATCAAGGCTGAGCGTAAGTGTAAAGAAGTTCATGGCTCTGTAATCCGTTGCGGTTATGATGGGGATGTGATTGTCTGCACCAATCTTGTCAAATGCTATTCGGTGATGGGGTCCGTTTGTAGTGCCCAACAGGTGTTTGACGAAATGCCTGCAAGAGACTTGGTGGCTTGGAATGCTATGATTTCCTGCTTTTCTCAACAGGGTTTGCACCTGGAGTCACTAGAGACATACAATCAGATGAGAAGTGAAAATGTGGATGTAGATGGTTTTACACTCGTTGGGTTGATTTCGTCTTGTGCCCATCTTGGAGCTTTGAATATTGGGGTTCAGATGCATAGATTTGCTCGTGAAAAGGGTCTTGTGCACAGTCTTTATGTTGGAAATGCGTTGATAGATATGTATGCTAAATGTGGCCGTTTAGATCAGGCCATTCTTATCTTTGATAGAATGCAGAAGAAGGACATTTTCACTTGGAACTCGATGATTGTTGGGTATGGAGTTCATGGTCGAGGTAGTGAAGCTATATATTGCTTTCAACAGATGTTAGAAGCAAGAGTGCAACCGAACTCTATCACATTTTTGGGTTTACTTTGTGGATGTAGTCATCAAGGCTTGGTTCAAGAAGGTCTTAAATACTTCCATTTGATGAGCTCTGAGTTTAGGCTAAGACCCGAGGTTAAACACTATGGATGCCTTGTGGATTTATATGGTCGAGCTGGGAAGCTTGAGAAGGCACTTGAAATCGTATCAAATTCATCACAGAATGATCCAGTTTTGTGGCGAATCTTACTTGGCTCTTGCAAGATTCACAAAAATGTGACAATAGGAGAAATTGCCATGAACAGTCTGTGTGAGCTTGGAGCTACAAATGCAGGGGATTGTATATTGTTGGCTACAATCTATGCGGGAAAAAATGATACAGCTGGTGTTGCAAGAATGAGAAAAATGATCAAGAGGCAAGGGATAAAGACTACCCCAGGTTGGAGTTGGATTGAAATTGAGGATCAAGTTCATAAATTTGTGGTTGATGACAAGTCCCATCGTTATTCCATTGAAGTTTATGAGAAGTTGAGGGAAGTTATTCATCAAGCTTCCTTGTTTGGATATGTAGGAGATGAGTCTATTTCATCACTGGATGTGCTTTCTACCACAGAGACCTTAAAGACTTCATGTACATATCATAGTGAGAAACTGGCAATTGCATTTGGATTGGCAAGAACTTCAGATGGGACACAGATACGCATTGTTAAAAACCTTAGAGTTTGTATAGATTGTCATTCATTCATAAAAGCTGTCTCGGCGGCATTCAACCGAGAAATAATTGTTAGAGATCGGGTTCGATTCCACCATTTCAATGGTGGTCAATGTTCCTGCAATGACTACTGGTGAAAAGTGAAAAATGTACTTTCAATTTTGCCTGTTTCAAGTCTGCAGAGCCTCCTTTCGTTGTAATGGAGTTTTGGCTCTTTCCATACCTCTTTTATTAGCATAGTACAGTCGTTGCTTACAGTCATGCAAGGAGATATAGCTGAGTTTCAAGTTATTATGCTCTTCTTTGATGCAACTTCTAACCTACTCCGCACGACAAGTTTATGTAATTAGTTATACATTCTTTGAAATTATACAGGCTCCACAAGGCCGCGTTGCAAGCACAAACTAAGCTCAAGAAGGATAGTTTACTAGTTGAAATAAAACTTCCGAGCTTCCACGGTATGTTAGGAAATTGAGTTTCCCTACATTTTAAGGTGGGGAATTGTATTGGATAGCAAATTTTGATTATATATTAGTATTCGAGTCATCAATTTTTAAAATATTGCAAATATGGCAAAATTTGTTTCAAATAATATATATTGATAAAAATCATTTTATTTGATTTTTTTCTTTTTCCAAAAACACCCTCATCTCTATTTTCTTCTTTGACAAGCTATGAGTTTCTGTACAACATCTTGAACTCCTTCCCAACAAGTTTGAACTTACCAGGGTGGTTTTCCCATAAAGTTGGTCCTCAATTTCTTCCTAATAAGTTCAAACTTACCAAGAAGGAAGATCACTAGAACTTTATGGGAAAAAAGATAGAGAGCTTTCATTCCTTTCGGATAAGTTCAAAAAAGACAAAGAGCTACTTTAGGAAGGAAAAGAAGGAAAAGAAAGAAGAAGAAGAAAAAAAGGAAAAATTGATTCTGTCAAATTTACGGAAAGTTCTTGTTATTAGTCAACTGAGCTGCTTTTGACAAGTATATCAGTCAAGTGTATATTAGTAGTGTGTCATGTGTATATCAATAGTATATCAACTATACCAGTCAAATGCATCATTATCAAGTTTATCAATAATATGATATTAACTAATATAATGAAAATGAAGTATATCAATAGTATAGTTAAAATATTAATAGTAGAGGGTGTTTTAGACTTTTATTCTTTGAATGCTTGGGTTGGATCACAATTTTGCTAAATGTGCAAAAGGAAAAAGTTTGAGACACGTGTGCAAATTTCATTTCATCTTTTGCCATACATGTAATTGCCTTTTTTTAAAAAAAAACTTTTTTTTTCTAACATCTTTTTCACTTTCCAGCCTTTTAGAGAAGGACCCACAATATAAAATGACGTTTTTATCCTCATTAATTCTTCATTCAATTCCCTTCCAATTTTGCTTGAATTATACTCATGTTTCCATTTTGCTTTCACAAAAAATGCACAAGGGTCTCGTTTTCACCTTGCTTAGAGAGAATTTATTGGTTCCATTCTTAGAGACTTCTTTCCTTTTCCTCTTGGTACAAAATTTTGTGTGGAGGAAAGGTTCTTTATTGGAAATGTTTAAATTTATTGGAATTCTTAGGAAACTAATATTTTCCAGTCAACGTTACGAGAGGAAAAAAATGATTCTGTCAATTTTTTTCGTAAAGCTGACTGAATATATATATATATATATATATATATATATATATATATATATATATTTTACATAAAGCTAACTGGAAATTTTATGGAAAATTTGATAGAATTTTTTTTTCTTCTTTTTTTTTTCTTTCAAGAAAACAACTATTATTCAATTTACGAGGTTATTTACAATAAAGTTGATTGGAAAATATTATTTTCCCAAAAATTTCAATAAATCTAAACATTTTTTGGAAAGATCTTTCCCCCACACAAAATTAAGCAACGGCTATTTCAACTTTCAACAACATAAAACACATGAAGAAAAGAGAAAAAACCAAAACAATGTCCCTTACTTCTTCTGTACGATAGGAAACGCAGCAAAAGGAAAAAAGTCTTTGAGAATGGGACTAATATTTACAACATAAAACACATGAATAAAAGAGAAGAAAACCAAAACAAGGTACGATAGGAAATGCTACAAAAGGGGAAAAATGCTGAAAATTTTTGTTAGGTAAGTTAGGTCGAGATAAAATTGAATGATAAATTAACGAGGATAAAAATGTTATTTTATATTGTGTATGCTTCCAAATTTTTTTTTTTTGGTTAGAAATAACAATTTCCTGCAATCGCTTATAAAACATAGAAAAAGAAAAAAAGTAAGAAAAAAATTGCTGTTAACCGTTGTTTGGAGTCTCACCGTCGTCCGAGGCGAAAACGAAACCCTAGCAGTGGCAAGTGCCGAGAATTAGCAGGTTGCTTCAAATCTCTCCACTTGTTTATTTTTCTATTGTTATTCATTGAGTCTTCAGTTATCACCAAACTTTAGCCCCAAAATCTTTGGAGTCACCGATCAATTTACAGAGACGGTGAAGGCGGAGCATCATACATAGTGTAGAAACAATCTTACAAAGTTCAAACATCAGCAACTCAATTACCCGAAACAATTAGATAATGCGTCAATAGATACGATGGCGACTCTGGCAGGTCAACCAACTCACCATCAACCAACTACGGTAGAGGAAGTTAGAACACTTTGGATAGGGGATTTGCAGTACTGGGTCGATGAGTCTTACCTTAATTCTTGCTTTGCTCACACTGGCGAGGTTCGTTTGTTAGTTTTTTCTTAATTTATTTTGCAACTTTGGTGTTTAATGATGATGTTTTTAATCATAGACCCATTAGGATTTCACCAGTTTCTCAAGCTGATGACACTGTTTATGAAACCTAGATGTGTTTAAGTGGATTTTTTAGGTCTTCTACTTTTAGGTTGGCCTTTCCAATAATTTGTCCTGAAATTGGTTAGTTTTCTTCGGGTGTTTGGAAGTAAATTTGGGTGTATAAAATTAGAATGTGTTTGTATTTTGCTCTATTGCTACTGAGATCTGAACTGTAGGGAGAGGAAGGGCCTCTCTTATGACCTTTCATGCCACTGATGGGGATGATTTTGATGCTTGAGAGCCGTGGTAGTGGAAACGCCAATGTTTAAAATTATAGTTATTTTATAGTTGTTATATTACAATAGCCCATACAAATGTACTTCTGATGTCTAGTTGTTGTTGTTGTTTGTATTCTAGTATTTGAGTAAGCGCTTGTAACTTAGCTGCATTCAGAGAATAGCATTCTGTTGCTTGTTGACTATGCCATATTTCCTAATACAGGTAATATCAATTAAAATAATTCGCAACAAGATCACTGGCCAGCCTGAGGGTTATGGGTTTGTGGAGTTTGTATCTCATGCCGCAGCAGAAAGAATTTTGCAGACATACAATGGGACCCAGATGCCTGGAACAGAGCAAACTTTCAGATTGAATTGGGCCTCCTTTGGAATTGGAGAAAGGCGCCCGGACGCTGGCCCTGAGCACTCTATTTTTGTGGGGGATTTGGCTCCTGATGTTACAGACTATCTGTTGCAAGAGACCTTTAGAGTGCAATATCCATCTGTTAGGGGTGCTAAAGTTGTGACTGATCCAAACACTGGACGTTCAAAGGGATACGGGTTTGTTAAATTTGCTGATGAAAATGAAAGGAATCGAGCTATGTCAGAAATGAATGGTATTTATTGCTCAACTAGGCCTATGCGTATTAGTGCAGCAACACCCAAAAAGACCATTGGTGTTCAGCAGCAATATAGTCTAGGTAAAGGTAATAAATAATGAATATGCAGTACTGCCTTTTCGTTTATTCATTTTATCCCTTTTCATGTGTTATAATATATATGATCCTTTTGGAGTGCGTGCAATCTTGTTTATTATTCATCCAAGTTTCCAGCCCTTCTAGCTTCTCTATTTACATGAAACCGGTTTGGTTCCCATCTGTGCTAGTACTCATAGATATGAAACTGCTGGTTTTAAAATTTGAATTTCACCTTTGTCAATGAAAGTTTTATTTTCATCATCAAACCTAGAGCCAGCAGTTAATAGGTACAAATGATCATTGTGAAATTTTAGTTGATATTAGCTAAATTAGTGTTCTTGGACCTATGATGAAAACTTATCAAGTTAATTGTTATGCATATTTAATGTATTTGTCAAAATACAATTTGTCTCTTGATTAATTTTTTGGATAAATGCGTTAATCGATGTCAAAGATATTTTCTAAATGCCTTGATGGGGGAGGATATTTGCAAATTTCAAATGAAAGAGAATTCATCGTTATTAGTACTTTTCCTTCACCTATGAGAAAATCTTAGGAGGGGATGGTAATTTACTATTTTTAAACTTGTACTTATCAATGTGCCTCTTGTCTCAGCAATGTACCCAGTTCCAGCCTACACTACATCCGTGCCTGTGCTTCCAGCAGATTATGATGCAAATAACACAACAGTAAGTCAAATGTCAAGTGACAGTTTTACATCTTATGGTTTCTATGCTCAGCTCTTACTATATTGTGCTCTGTGCAGATCTTTGTTGGTAACTTGGATCCTAATATTACAGAGGAGGAGTTGAAGCAAACTTTTTTGCAGTTTGGTGAGATTGCTTATGTGAAAATTCCTTCTGGGAAAGGCTGTGGTTTTGTACAGTTTGGGACAAGGTGTGTTTTGCATGATTAATTTTATTGATGTAAGTTGAGTAAATTGCATCTCTGCTTGCCTTGAAAGAAGGTTGTGAATAGAAATAATAGTCACGAAGTGGTAGAAAAAAAAATCCTGTGGAATAGAAGGAAAGCTCGTGTATGTATGATTCTCAAAGTTATGTTATTGACTTCTTATTTCACATGAACATTTGGACTTGGATTTAATATGTTCTCCTTATTGTGTCCATGCAGGGCTTCAGCTGAAGAAGCCATCCAAAAGATGCAAGGAAAAATAATTGGTCAACAAGTGGTTCGTACTTCTTGGGGTAGAAATCCGGCTGTCAAGCAGGTATTTAACCTTCCAATTGCAGTTCAGAACAAATATTTCCTAGATTTAGTTAAATCCCTTGTCCTCTGTGGATGTGATGTGTAATTATGTGGTTTTTAAATTAATGCTTTTTCTTGGATTCAAATTACAAAAACCTGATTGCTCGATGAGTTTTTGCTTAGAACATTAGGTGTATTGTCTCTTATTTAGCCTGTGAATTTGAAATTTTATCTATAGGGAGAAATGGTTCAGACTTCATCCTTGCAAATAATTTATAATTTATGGTTGAAAATACCATTTTGGTCCTATGTGCTGAATTTTGTTTTATTTTAGTCCCTATACTTTTAAATGTCTAATTTAAATTTTTGTACTTTCAAGAAATCTTAGATTTAGTTCCTACTACTTGCTTATTTATTGTTGACTTTTTTCAAAAACTTTTTATTTATTTATTAGCATTTTTATTCTAAATTTTGAAAACATATTCATATATTATATTTCCATGCCAATTTCGATAAGAATTAACTTTGAGAGACTAAGTTTAAGATTTATTAAAAGTACAGGGATTAAAATTGGACATTTATAGGTACATGGACTAAAATTGAACTTACATCACAGTATTGGGACCAAAAATGTTATTTTAACCGTAATTTTCTTTTGCTTTTTTGCTTTATTTATTTATTTATTAATTATTATTATTATTTTTTCTTTTTGGTTTTGTTGGTGGTGGTGGTGTTGGTGAACGTTAAAGCACCCTAGTACGAAATAGGATCTAAGGAAAAGGTCTGGGAATTGGGAATGAGGCTTACTTCAGAAAAGTATAACAGACAAAATCATCTACAAACCTATAGCAGACAATCATTTACCTAAATTATAATACATTGTTTTATATTTTCTAGCTTCGCTAAATGGATATTTTGGTACAGAAAAAAGGACTCCTCGGTAAAACTGGTTATGCGTGCTATCTTTCTGTTAGCAAGCTTTCATAAGTCGCAATAACTTGTGTAATGCTCCAGTGAATTACACTGATGAAATGGTGATGAAATCTTAGGACTTGGCTACTTGGGGTCAGCAAGTTGATCCAAACCAGTGGAGTGCGTACTATGGGTATGGAGGGACTTATGATGCTTATGGATATGGAGTTGTACAAGATCCATCTTTATATGCTTATGGTGCATATTCGGGTTATGCCTCATATCCTCAACAGGTAAGTTTTATTCTCCTTATAAGGCAAGAGCTCCATACTCTTCTGTGGTAGATAGTGGGCAAAGTGATTGTTATAATCTATTTGTTTGCAGCCATTTTTCAATTGGAGTCTATTGGGTGAATTCTAATTGCCCTTTCAGGATGCTAAGGCTTAGAATCAGAATAAAATGAGTGGGAATCGAGAGCAATATATTTATAGAAAAAGGTTGATCTATCCAGAAAATTTGTTGCCTTTTTGATCTTGGAACTATTTGTTTGAACACTTCTCTTTTAGGTTGATGGTGTACAAGATTTGGCCGCTGTAGCTGGTGCAGTCCCTTCTGTGGAACAGGGAGAGGAATGGAATGATACGCTGGATACGCCAGATGTTGATTAGTAAGTATCTCTCTCTTTTGTCCCTAAGGAGCTACAATACAAGATTGGAAATGTACAAATCTATTATTATTTGATCTTGCATTGACATCAAGTTGCAAATTCTGCAGTTTAAATGATGCATACCTCTCAAAACATGAGAGTGCGATTTTAGGCTGGCCATTATGGTTGACTACCTCATCACTGGTTAGGCAAACATGAGCTCGGTCGTTGTTTGTAGTCAATAGGTTGTCAGTGCATGAAGGCAGCTGGAGGAAAATTATGGTATGTGCTTTATATGCTTGACCATACTCTTAGTCGATGATATCGACACAATAATGTTGCCTCGTACTACAGCGTGTTTGATTTGATAATTTATTTTTTACTATAGGAACGGGATTGCCAGGGTTAGGTGCTTTGATGCTGTATGTATTATGATGCTCTCTGCCATTTGGAAATGGAGTTTTCATATCAGGAGAACATCTGCTCTAATCTGTACCTTGTTTCATATTCATAGTTGTGTTTTAAAACAGGGAGCTTGAACTGATAATTGTTTGTCAGTTGAGATTTAGCAAAAAGAAAATGAAGCTGAAATGAAATGAAGTTTTTAATGAATGGAGTGATGAAGACGGTTGTTCCTAAAACAAATTTCTCAATTTTCCCCTCTTC

mRNA sequence

ATGTCGAAGGAAAAAGTCATTATCACTCTCTTGCAAGGCTGCAACAATCTCAACAAGCTTCGCAAAATCCACGCACATGTTATTGTAAGCGGCCTCCGCGATCATGTCGCCATTGGCAACAAGCTTTTGAACTTCTGTGCCATCTCTGTTTCAGGTTCCCTTGCTTATGCCCAGCTTCTCTTCCATCAAATGGAGTGCCCACAAACCGAAGCCTGGAACTCCATCATCAGAGGTTTTGCCCAGAGCTCATCTCCCATTGAGGCTATTGTTTTCTACAATCGAATGGTTTCGGCCTCTTTCTCTTCTCCTGACACTTTCACTTTCTCATTTGTGCTCAAAGCCTGTGAAAGAATCAAGGCTGAGCGTAAGTGTAAAGAAGTTCATGGCTCTGTAATCCGTTGCGGTTATGATGGGGATGTGATTGTCTGCACCAATCTTGTCAAATGCTATTCGGTGATGGGGTCCGTTTGTAGTGCCCAACAGGTGTTTGACGAAATGCCTGCAAGAGACTTGGTGGCTTGGAATGCTATGATTTCCTGCTTTTCTCAACAGGGTTTGCACCTGGAGTCACTAGAGACATACAATCAGATGAGAAGTGAAAATGTGGATGTAGATGGTTTTACACTCGTTGGGTTGATTTCGTCTTGTGCCCATCTTGGAGCTTTGAATATTGGGGTTCAGATGCATAGATTTGCTCGTGAAAAGGGTCTTGTGCACAGTCTTTATGTTGGAAATGCGTTGATAGATATGTATGCTAAATGTGGCCGTTTAGATCAGGCCATTCTTATCTTTGATAGAATGCAGAAGAAGGACATTTTCACTTGGAACTCGATGATTGTTGGGTATGGAGTTCATGGTCGAGGTAGTGAAGCTATATATTGCTTTCAACAGATGTTAGAAGCAAGAGTGCAACCGAACTCTATCACATTTTTGGGTTTACTTTGTGGATGTAGTCATCAAGGCTTGGTTCAAGAAGGTCTTAAATACTTCCATTTGATGAGCTCTGAGTTTAGGCTAAGACCCGAGGTTAAACACTATGGATGCCTTGTGGATTTATATGGTCGAGCTGGGAAGCTTGAGAAGGCACTTGAAATCGTATCAAATTCATCACAGAATGATCCAGTTTTGTGGCGAATCTTACTTGGCTCTTGCAAGATTCACAAAAATGTGACAATAGGAGAAATTGCCATGAACAGTCTGTGTGAGCTTGGAGCTACAAATGCAGGGGATTGTATATTGTTGGCTACAATCTATGCGGGAAAAAATGATACAGCTGGTGTTGCAAGAATGAGAAAAATGATCAAGAGGCAAGGGATAAAGACTACCCCAGGTTGGAGTTGGATTGAAATTGAGGATCAAGTTCATAAATTTGTGGTTGATGACAAGTCCCATCGTTATTCCATTGAAGTTTATGAGAAGTTGAGGGAAGTTATTCATCAAGCTTCCTTGTTTGGATATGTAGGAGATGAGTCTATTTCATCACTGGATGTGCTTTCTACCACAGAGACCTTAAAGACTTCATGTACATATCATAGTGAGAAACTGGCAATTGCATTTGGATTGGCAAGAACTTCAGATGGGACACAGATACGCATTGTTAAAAACCTTAGAGTTTGTATAGATTGTCATTCATTCATAAAAGCTGTCTCGGCGGCATTCAACCGAGAAATAATTGTTAGAGATCGGGTTCGATTCCACCATTTCAATGGTGGTCAATGTTCCTGCAATGACTACTGGCTCCACAAGGCCGCGTTGCAAGCACAAACTAAGCTCAAGAAGGATAGTTTACTAGTTGAAATAAAACTTCCGAGCTTCCACGGTCAACCAACTCACCATCAACCAACTACGGTAGAGGAAGTTAGAACACTTTGGATAGGGGATTTGCAGTACTGGGTCGATGAGTCTTACCTTAATTCTTGCTTTGCTCACACTGGCGAGGTAATATCAATTAAAATAATTCGCAACAAGATCACTGGCCAGCCTGAGGGTTATGGGTTTGTGGAGTTTGTATCTCATGCCGCAGCAGAAAGAATTTTGCAGACATACAATGGGACCCAGATGCCTGGAACAGAGCAAACTTTCAGATTGAATTGGGCCTCCTTTGGAATTGGAGAAAGGCGCCCGGACGCTGGCCCTGAGCACTCTATTTTTGTGGGGGATTTGGCTCCTGATGTTACAGACTATCTGTTGCAAGAGACCTTTAGAGTGCAATATCCATCTGTTAGGGGTGCTAAAGTTGTGACTGATCCAAACACTGGACGTTCAAAGGGATACGGGTTTGTTAAATTTGCTGATGAAAATGAAAGGAATCGAGCTATGTCAGAAATGAATGGTATTTATTGCTCAACTAGGCCTATGCGTATTAGTGCAGCAACACCCAAAAAGACCATTGGTGTTCAGCAGCAATATAGTCTAGGTAAAGCAATGTACCCAGTTCCAGCCTACACTACATCCGTGCCTGTGCTTCCAGCAGATTATGATGCAAATAACACAACAATCTTTGTTGGTAACTTGGATCCTAATATTACAGAGGAGGAGTTGAAGCAAACTTTTTTGCAGTTTGGTGAGATTGCTTATGTGAAAATTCCTTCTGGGAAAGGCTGTGGTTTTGTACAGTTTGGGACAAGGGCTTCAGCTGAAGAAGCCATCCAAAAGATGCAAGGAAAAATAATTGGTCAACAAGTGGTTCGTACTTCTTGGGGTAGAAATCCGGCTGTCAAGCAGGACTTGGCTACTTGGGGTCAGCAAGTTGATCCAAACCAGTGGAGTGCGTACTATGGGTATGGAGGGACTTATGATGCTTATGGATATGGAGTTGTACAAGATCCATCTTTATATGCTTATGGTGCATATTCGGGTTATGCCTCATATCCTCAACAGGTTGATGGTGTACAAGATTTGGCCGCTGTAGCTGGTGCAGTCCCTTCTGTGGAACAGGGAGAGGAATGGAATGATACGCTGGATACGCCAGATGTTGATTATTTAAATGATGCATACCTCTCAAAACATGAGAGTGCGATTTTAGGCTGGCCATTATGGTTGACTACCTCATCACTGGTTAGGCAAACATGAGCTCGGTCGTTGTTTGTAGTCAATAGGTTGTCAGTGCATGAAGGCAGCTGGAGGAAAATTATGGAACGGGATTGCCAGGGTTAGGTGCTTTGATGCTGTATGTATTATGATGCTCTCTGCCATTTGGAAATGGAGTTTTCATATCAGGAGAACATCTGCTCTAATCTGTACCTTGTTTCATATTCATAGTTGTGTTTTAAAACAGGGAGCTTGAACTGATAATTGTTTGTCAGTTGAGATTTAGCAAAAAGAAAATGAAGCTGAAATGAAATGAAGTTTTTAATGAATGGAGTGATGAAGACGGTTGTTCCTAAAACAAATTTCTCAATTTTCCCCTCTTC

Coding sequence (CDS)

ATGTCGAAGGAAAAAGTCATTATCACTCTCTTGCAAGGCTGCAACAATCTCAACAAGCTTCGCAAAATCCACGCACATGTTATTGTAAGCGGCCTCCGCGATCATGTCGCCATTGGCAACAAGCTTTTGAACTTCTGTGCCATCTCTGTTTCAGGTTCCCTTGCTTATGCCCAGCTTCTCTTCCATCAAATGGAGTGCCCACAAACCGAAGCCTGGAACTCCATCATCAGAGGTTTTGCCCAGAGCTCATCTCCCATTGAGGCTATTGTTTTCTACAATCGAATGGTTTCGGCCTCTTTCTCTTCTCCTGACACTTTCACTTTCTCATTTGTGCTCAAAGCCTGTGAAAGAATCAAGGCTGAGCGTAAGTGTAAAGAAGTTCATGGCTCTGTAATCCGTTGCGGTTATGATGGGGATGTGATTGTCTGCACCAATCTTGTCAAATGCTATTCGGTGATGGGGTCCGTTTGTAGTGCCCAACAGGTGTTTGACGAAATGCCTGCAAGAGACTTGGTGGCTTGGAATGCTATGATTTCCTGCTTTTCTCAACAGGGTTTGCACCTGGAGTCACTAGAGACATACAATCAGATGAGAAGTGAAAATGTGGATGTAGATGGTTTTACACTCGTTGGGTTGATTTCGTCTTGTGCCCATCTTGGAGCTTTGAATATTGGGGTTCAGATGCATAGATTTGCTCGTGAAAAGGGTCTTGTGCACAGTCTTTATGTTGGAAATGCGTTGATAGATATGTATGCTAAATGTGGCCGTTTAGATCAGGCCATTCTTATCTTTGATAGAATGCAGAAGAAGGACATTTTCACTTGGAACTCGATGATTGTTGGGTATGGAGTTCATGGTCGAGGTAGTGAAGCTATATATTGCTTTCAACAGATGTTAGAAGCAAGAGTGCAACCGAACTCTATCACATTTTTGGGTTTACTTTGTGGATGTAGTCATCAAGGCTTGGTTCAAGAAGGTCTTAAATACTTCCATTTGATGAGCTCTGAGTTTAGGCTAAGACCCGAGGTTAAACACTATGGATGCCTTGTGGATTTATATGGTCGAGCTGGGAAGCTTGAGAAGGCACTTGAAATCGTATCAAATTCATCACAGAATGATCCAGTTTTGTGGCGAATCTTACTTGGCTCTTGCAAGATTCACAAAAATGTGACAATAGGAGAAATTGCCATGAACAGTCTGTGTGAGCTTGGAGCTACAAATGCAGGGGATTGTATATTGTTGGCTACAATCTATGCGGGAAAAAATGATACAGCTGGTGTTGCAAGAATGAGAAAAATGATCAAGAGGCAAGGGATAAAGACTACCCCAGGTTGGAGTTGGATTGAAATTGAGGATCAAGTTCATAAATTTGTGGTTGATGACAAGTCCCATCGTTATTCCATTGAAGTTTATGAGAAGTTGAGGGAAGTTATTCATCAAGCTTCCTTGTTTGGATATGTAGGAGATGAGTCTATTTCATCACTGGATGTGCTTTCTACCACAGAGACCTTAAAGACTTCATGTACATATCATAGTGAGAAACTGGCAATTGCATTTGGATTGGCAAGAACTTCAGATGGGACACAGATACGCATTGTTAAAAACCTTAGAGTTTGTATAGATTGTCATTCATTCATAAAAGCTGTCTCGGCGGCATTCAACCGAGAAATAATTGTTAGAGATCGGGTTCGATTCCACCATTTCAATGGTGGTCAATGTTCCTGCAATGACTACTGGCTCCACAAGGCCGCGTTGCAAGCACAAACTAAGCTCAAGAAGGATAGTTTACTAGTTGAAATAAAACTTCCGAGCTTCCACGGTCAACCAACTCACCATCAACCAACTACGGTAGAGGAAGTTAGAACACTTTGGATAGGGGATTTGCAGTACTGGGTCGATGAGTCTTACCTTAATTCTTGCTTTGCTCACACTGGCGAGGTAATATCAATTAAAATAATTCGCAACAAGATCACTGGCCAGCCTGAGGGTTATGGGTTTGTGGAGTTTGTATCTCATGCCGCAGCAGAAAGAATTTTGCAGACATACAATGGGACCCAGATGCCTGGAACAGAGCAAACTTTCAGATTGAATTGGGCCTCCTTTGGAATTGGAGAAAGGCGCCCGGACGCTGGCCCTGAGCACTCTATTTTTGTGGGGGATTTGGCTCCTGATGTTACAGACTATCTGTTGCAAGAGACCTTTAGAGTGCAATATCCATCTGTTAGGGGTGCTAAAGTTGTGACTGATCCAAACACTGGACGTTCAAAGGGATACGGGTTTGTTAAATTTGCTGATGAAAATGAAAGGAATCGAGCTATGTCAGAAATGAATGGTATTTATTGCTCAACTAGGCCTATGCGTATTAGTGCAGCAACACCCAAAAAGACCATTGGTGTTCAGCAGCAATATAGTCTAGGTAAAGCAATGTACCCAGTTCCAGCCTACACTACATCCGTGCCTGTGCTTCCAGCAGATTATGATGCAAATAACACAACAATCTTTGTTGGTAACTTGGATCCTAATATTACAGAGGAGGAGTTGAAGCAAACTTTTTTGCAGTTTGGTGAGATTGCTTATGTGAAAATTCCTTCTGGGAAAGGCTGTGGTTTTGTACAGTTTGGGACAAGGGCTTCAGCTGAAGAAGCCATCCAAAAGATGCAAGGAAAAATAATTGGTCAACAAGTGGTTCGTACTTCTTGGGGTAGAAATCCGGCTGTCAAGCAGGACTTGGCTACTTGGGGTCAGCAAGTTGATCCAAACCAGTGGAGTGCGTACTATGGGTATGGAGGGACTTATGATGCTTATGGATATGGAGTTGTACAAGATCCATCTTTATATGCTTATGGTGCATATTCGGGTTATGCCTCATATCCTCAACAGGTTGATGGTGTACAAGATTTGGCCGCTGTAGCTGGTGCAGTCCCTTCTGTGGAACAGGGAGAGGAATGGAATGATACGCTGGATACGCCAGATGTTGATTATTTAAATGATGCATACCTCTCAAAACATGAGAGTGCGATTTTAGGCTGGCCATTATGGTTGACTACCTCATCACTGGTTAGGCAAACATGA

Protein sequence

MSKEKVIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLLFHQMECPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKAERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISCFSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHSLYVGNALIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEARVQPNSITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEKALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAGKNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCIDCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYWLHKAALQAQTKLKKDSLLVEIKLPSFHGQPTHHQPTTVEEVRTLWIGDLQYWVDESYLNSCFAHTGEVISIKIIRNKITGQPEGYGFVEFVSHAAAERILQTYNGTQMPGTEQTFRLNWASFGIGERRPDAGPEHSIFVGDLAPDVTDYLLQETFRVQYPSVRGAKVVTDPNTGRSKGYGFVKFADENERNRAMSEMNGIYCSTRPMRISAATPKKTIGVQQQYSLGKAMYPVPAYTTSVPVLPADYDANNTTIFVGNLDPNITEEELKQTFLQFGEIAYVKIPSGKGCGFVQFGTRASAEEAIQKMQGKIIGQQVVRTSWGRNPAVKQDLATWGQQVDPNQWSAYYGYGGTYDAYGYGVVQDPSLYAYGAYSGYASYPQQVDGVQDLAAVAGAVPSVEQGEEWNDTLDTPDVDYLNDAYLSKHESAILGWPLWLTTSSLVRQT
Homology
BLAST of CcUC01G004300 vs. NCBI nr
Match: XP_038890323.1 (pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida] >XP_038890324.1 pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida] >XP_038890325.1 pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida])

HSP 1 Score: 1101.3 bits (2847), Expect = 0.0e+00
Identity = 534/579 (92.23%), Postives = 557/579 (96.20%), Query Frame = 0

Query: 1   MSKEKVIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEK I+TLLQGCN+LN+LRKIHAHVIVSGLR HVAIGNKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAILTLLQGCNSLNRLRKIHAHVIVSGLRHHVAIGNKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQMECPQTEAWNSIIRGFAQSSSPI+AI+FYN+MV ASFSSPDTFTFSFVLKACERIKA
Sbjct: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAIIFYNQMVWASFSSPDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKC EVHGSVIRCGYDGDVIVCTNLVKCYS MGS+C AQQVFD+MPARDLVAWNAMISC
Sbjct: 121 ERKCNEVHGSVIRCGYDGDVIVCTNLVKCYSAMGSICIAQQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHS 240
           FSQQGLH E+L+TYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLV S
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQS 240

Query: 241 LYVGNALIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCG LD+AI IFDRMQ+KDIFTWNSMIVGYGVHGRGSEAIYCFQ+MLE
Sbjct: 241 LYVGNALIDMYAKCGSLDEAIFIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQKMLE 300

Query: 301 ARVQPNSITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPNS+TFLGLLCGCSHQGLVQEG+KYF+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEIV NSSQNDPVLWRILLGSCKIHKN+ IGEIAM SL ELGATNAGDCILLATIYAG
Sbjct: 361 KALEIVLNSSQNDPVLWRILLGSCKIHKNMKIGEIAMKSLSELGATNAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
           + DT GVARMRKMIK QGIKTTPGWSWIEI +QVHKFVVDDKSHRYSIEVYEKLREVIHQ
Sbjct: 421 EKDTVGVARMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCI 540
           ASLFGYVGD S+SSLDVLSTTETLKTSCTYHSEKLAIAFGLART+DGTQIRIVKNLRVC 
Sbjct: 481 ASLFGYVGDASVSSLDVLSTTETLKTSCTYHSEKLAIAFGLARTTDGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSFIKAVS AFNREIIVRDRVRFHHF GGQCSCNDYW
Sbjct: 541 DCHSFIKAVSEAFNREIIVRDRVRFHHFKGGQCSCNDYW 579

BLAST of CcUC01G004300 vs. NCBI nr
Match: XP_004152881.1 (pentatricopeptide repeat-containing protein At3g56550 isoform X1 [Cucumis sativus] >XP_011648994.1 pentatricopeptide repeat-containing protein At3g56550 isoform X1 [Cucumis sativus] >XP_031737318.1 pentatricopeptide repeat-containing protein At3g56550 isoform X1 [Cucumis sativus])

HSP 1 Score: 1072.0 bits (2771), Expect = 3.1e-309
Identity = 520/579 (89.81%), Postives = 545/579 (94.13%), Query Frame = 0

Query: 1   MSKEKVIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MS EK I+ LLQGCN+L +LRKIHAHVIVSGL  HV I NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSNEKAILALLQGCNSLKRLRKIHAHVIVSGLHHHVPIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQMECPQTEAWNSIIRGFAQSSSPI+AIVFYN+MV  SFS PDTFTFSFVLKACERIKA
Sbjct: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVCDSFSIPDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKCKEVHGSVIRCGYD DVIVCTNLVKCYS MGSVC A+QVFD+MPARDLVAWNAMISC
Sbjct: 121 ERKCKEVHGSVIRCGYDADVIVCTNLVKCYSAMGSVCIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHS 240
           FSQQGLH E+L+TYNQMRSENVD+DGFTLVGLISSCAHLGALNIGVQMHRFARE GL  S
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCAHLGALNIGVQMHRFARENGLDQS 240

Query: 241 LYVGNALIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCG LDQAILIFDRMQ+KDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300

Query: 301 ARVQPNSITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPN +TFLGLLCGCSHQGLVQEG+KYF+LMSS+FRL+PEVKHYGCLVDLYGRAGKL+
Sbjct: 301 ARIQPNPVTFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLKPEVKHYGCLVDLYGRAGKLD 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEIVSNSS ND VLWRILLGSCKIHKNVTIGEIAMN L ELGAT+AGDCILLATIYAG
Sbjct: 361 KALEIVSNSSHNDSVLWRILLGSCKIHKNVTIGEIAMNRLSELGATSAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
           + D AGVARMRKMIK QG KTTPGWSWIEI +QVHKFVVDDKSHRYS+EVYEKLREVIHQ
Sbjct: 421 EKDKAGVARMRKMIKSQGKKTTPGWSWIEIGEQVHKFVVDDKSHRYSVEVYEKLREVIHQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCI 540
           AS FGYVGDESISSLD+LST ETLKTSCTYHSEKLAIAFGLART+DGTQIRIVKNLRVC 
Sbjct: 481 ASFFGYVGDESISSLDMLSTMETLKTSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSFIKAVS AFNREIIVRDRVRFHHF GG+CSCNDYW
Sbjct: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGECSCNDYW 579

BLAST of CcUC01G004300 vs. NCBI nr
Match: XP_016899519.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] >XP_016899520.1 PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] >XP_016899521.1 PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] >XP_016899522.1 PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] >XP_016899523.1 PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] >KAA0047714.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK08368.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1063.9 bits (2750), Expect = 8.6e-307
Identity = 521/579 (89.98%), Postives = 546/579 (94.30%), Query Frame = 0

Query: 1   MSKEKVIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEK I+TLLQGCN+L +LRKIHAHVIVSGL  HVAI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAILTLLQGCNSLKRLRKIHAHVIVSGLHHHVAIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQ E PQTEAWNSIIRGFAQSSSPI+AIVFYN+MV  SFS  DTFTFSFVLKACERIKA
Sbjct: 61  FHQTEFPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVWDSFSMRDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKCKEVHG+VIRCGYD DVIVCTNLVKCYS MGSV  A+QVFD+MPARDLVAWNAMISC
Sbjct: 121 ERKCKEVHGTVIRCGYDADVIVCTNLVKCYSAMGSVYIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHS 240
           FSQQGLH E+L+TYNQMRSENVD+DGFTLVGLISSCAHLGALNIGVQMHRFARE GL  S
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCAHLGALNIGVQMHRFARENGLDQS 240

Query: 241 LYVGNALIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCG LDQAILIFDRMQ+KDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300

Query: 301 ARVQPNSITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPNSITFLGLLCGCSHQGLVQEG+KYF+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARIQPNSITFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEIVSNSS ND VLWR LLGSCKIHKNVTIGEIAMN L ELGAT+AGDCILLATIYAG
Sbjct: 361 KALEIVSNSSHNDSVLWRTLLGSCKIHKNVTIGEIAMNRLFELGATSAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
           +ND AGV+RMRKMIK QGIKTTPGWSWIEI +QVHKFVVDDKS+RYSIEVYEKLREVI+Q
Sbjct: 421 ENDKAGVSRMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSNRYSIEVYEKLREVIYQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCI 540
           AS FGYVGDES+SSLDVLST ETLKTSCTYHSEKLAIAFGLART+DGTQIRIVKNLRVC 
Sbjct: 481 ASFFGYVGDESVSSLDVLSTIETLKTSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSFIKAVS AFNREIIVRDRVRFHHF GG+CSCNDYW
Sbjct: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGKCSCNDYW 579

BLAST of CcUC01G004300 vs. NCBI nr
Match: XP_022149932.1 (pentatricopeptide repeat-containing protein At3g56550 [Momordica charantia] >XP_022149933.1 pentatricopeptide repeat-containing protein At3g56550 [Momordica charantia] >XP_022149934.1 pentatricopeptide repeat-containing protein At3g56550 [Momordica charantia] >XP_022149935.1 pentatricopeptide repeat-containing protein At3g56550 [Momordica charantia])

HSP 1 Score: 1042.3 bits (2694), Expect = 2.7e-300
Identity = 497/579 (85.84%), Postives = 542/579 (93.61%), Query Frame = 0

Query: 1   MSKEKVIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MS EK I+TLLQGCN+LNKLRKIHAHVI+SGLR H AIGNKLLNFCAISVSGSL YA+LL
Sbjct: 1   MSNEKAILTLLQGCNSLNKLRKIHAHVILSGLRHHAAIGNKLLNFCAISVSGSLPYARLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           F  M+CPQTEAWNSIIRGFAQS+SPIEA+V+YN+MV AS S PDTFTFSFVLKACER+KA
Sbjct: 61  FRHMDCPQTEAWNSIIRGFAQSASPIEAVVYYNQMVWASLSPPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKC+EVHGSVIR GYDGDVI+CTNL+KCY+ MG +C AQQVFD+MP RDLVAWNAMISC
Sbjct: 121 ERKCREVHGSVIRWGYDGDVIICTNLMKCYAAMGFICVAQQVFDKMPTRDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHS 240
           +SQQGLH E+LETYNQMRS NVDVDGFTLVGL+SSCAHLGALNIGVQMHRFAREKGLV S
Sbjct: 181 YSQQGLHQEALETYNQMRSGNVDVDGFTLVGLLSSCAHLGALNIGVQMHRFAREKGLVES 240

Query: 241 LYVGNALIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCG LDQAILIFDRMQ+KD+FTWNSMIVGYGVHGRG+EAI+CFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDVFTWNSMIVGYGVHGRGTEAIFCFQQMLE 300

Query: 301 ARVQPNSITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QP+S+TFLGLLCGCSHQGLVQEG+K+F+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARIQPSSVTFLGLLCGCSHQGLVQEGVKFFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEI+ NSSQNDPVLWRILLGSCKIHKNV IGEIAMN+L +LGATNAGDCILLATIYAG
Sbjct: 361 KALEIILNSSQNDPVLWRILLGSCKIHKNVGIGEIAMNNLSQLGATNAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
             +T+GV RMRKMI+ QGIKTTPGWSWIEI +QVHKFVVDDKSHRY IEVYEKL+EVIHQ
Sbjct: 421 VKNTSGVVRMRKMIRSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYYIEVYEKLKEVIHQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCI 540
           ASLFGY+GD   S+ DV ST+E L+TSC+YHSEKLAIAFGLART+DGTQIRIVKNLRVC 
Sbjct: 481 ASLFGYIGDGYFSTTDVFSTSEILETSCSYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSF+KAVS AFNREIIVRDRVRFHHF GGQCSCNDYW
Sbjct: 541 DCHSFVKAVSLAFNREIIVRDRVRFHHFKGGQCSCNDYW 579

BLAST of CcUC01G004300 vs. NCBI nr
Match: XP_023537237.1 (pentatricopeptide repeat-containing protein At3g56550 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1039.3 bits (2686), Expect = 2.3e-299
Identity = 504/579 (87.05%), Postives = 536/579 (92.57%), Query Frame = 0

Query: 1   MSKEKVIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEK IITLLQGCN+LNKLRKIHAHV+VSGLR HVAI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQMEC QTEAWNSIIRGFAQSSSPI+A+V+YN+MV ASFSSPDTFTFSFVLKACER+KA
Sbjct: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKCKE+HG++IRCGYDGDVI+CTNLVKCY+ MGSVC AQQVFDEMP RDLVAWNAMISC
Sbjct: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHS 240
           FSQQGLH E+L+ YNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLV S
Sbjct: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVES 240

Query: 241 LYVGNALIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCG LDQAI IFDRM +KDIFTWNSMIVGYGVHGRG+EAI+CF++MLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAIFIFDRMHRKDIFTWNSMIVGYGVHGRGTEAIFCFERMLE 300

Query: 301 ARVQPNSITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPNSITFLGLLCGCSHQGLVQEG+KYF+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARMQPNSITFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALE + NSS NDPVLWRILLGSCKIHKNV +GEIAMN+L ELGATNAGDCILLATIYAG
Sbjct: 361 KALETIQNSSPNDPVLWRILLGSCKIHKNVGVGEIAMNNLSELGATNAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
            NDTAGVA MRK IK QGIKT+PGWSWIEI +QVHKFVVDDKSHR SIEVYEKLREV+HQ
Sbjct: 421 VNDTAGVASMRKTIKSQGIKTSPGWSWIEIGEQVHKFVVDDKSHRDSIEVYEKLREVLHQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCI 540
           ASLFGYV D            ETLKTS TYHSEKLAIAFGLART+DGT IRIVKNLRVC 
Sbjct: 481 ASLFGYVRD-----------AETLKTSSTYHSEKLAIAFGLARTADGTPIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSF+KAVS AF+REIIVRDRVRFHHF GGQCSCNDYW
Sbjct: 541 DCHSFMKAVSVAFDREIIVRDRVRFHHFKGGQCSCNDYW 568

BLAST of CcUC01G004300 vs. ExPASy Swiss-Prot
Match: Q9LXY5 (Pentatricopeptide repeat-containing protein At3g56550 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H80 PE=2 SV=1)

HSP 1 Score: 713.8 bits (1841), Expect = 2.9e-204
Identity = 346/579 (59.76%), Postives = 440/579 (75.99%), Query Frame = 0

Query: 3   KEKVIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLLFH 62
           K +VI+ +LQGCN++ KLRKIH+HVI++GL+ H +I N LL FCA+SV+GSL++AQLLF 
Sbjct: 4   KARVIVRMLQGCNSMKKLRKIHSHVIINGLQHHPSIFNHLLRFCAVSVTGSLSHAQLLFD 63

Query: 63  QMEC-PQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKAE 122
             +  P T  WN +IRGF+ SSSP+ +I+FYNRM+ +S S PD FTF+F LK+CERIK+ 
Sbjct: 64  HFDSDPSTSDWNYLIRGFSNSSSPLNSILFYNRMLLSSVSRPDLFTFNFALKSCERIKSI 123

Query: 123 RKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISCF 182
            KC E+HGSVIR G+  D IV T+LV+CYS  GSV  A +VFDEMP RDLV+WN MI CF
Sbjct: 124 PKCLEIHGSVIRSGFLDDAIVATSLVRCYSANGSVEIASKVFDEMPVRDLVSWNVMICCF 183

Query: 183 SQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHSL 242
           S  GLH ++L  Y +M +E V  D +TLV L+SSCAH+ ALN+GV +HR A +      +
Sbjct: 184 SHVGLHNQALSMYKRMGNEGVCGDSYTLVALLSSCAHVSALNMGVMLHRIACDIRCESCV 243

Query: 243 YVGNALIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEA 302
           +V NALIDMYAKCG L+ AI +F+ M+K+D+ TWNSMI+GYGVHG G EAI  F++M+ +
Sbjct: 244 FVSNALIDMYAKCGSLENAIGVFNGMRKRDVLTWNSMIIGYGVHGHGVEAISFFRKMVAS 303

Query: 303 RVQPNSITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEK 362
            V+PN+ITFLGLL GCSHQGLV+EG+++F +MSS+F L P VKHYGC+VDLYGRAG+LE 
Sbjct: 304 GVRPNAITFLGLLLGCSHQGLVKEGVEHFEIMSSQFHLTPNVKHYGCMVDLYGRAGQLEN 363

Query: 363 ALEIV-SNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 422
           +LE++ ++S   DPVLWR LLGSCKIH+N+ +GE+AM  L +L A NAGD +L+ +IY+ 
Sbjct: 364 SLEMIYASSCHEDPVLWRTLLGSCKIHRNLELGEVAMKKLVQLEAFNAGDYVLMTSIYSA 423

Query: 423 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 482
            ND    A MRK+I+   ++T PGWSWIEI DQVHKFVVDDK H  S  +Y +L EVI++
Sbjct: 424 ANDAQAFASMRKLIRSHDLQTVPGWSWIEIGDQVHKFVVDDKMHPESAVIYSELGEVINR 483

Query: 483 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCI 542
           A L GY  ++S  +   LS    L ++ T HSEKLAIA+GL RT+ GT +RI KNLRVC 
Sbjct: 484 AILAGYKPEDSNRTAPTLS-DRCLGSADTSHSEKLAIAYGLMRTTAGTTLRITKNLRVCR 543

Query: 543 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSF K VS AFNREIIVRDRVRFHHF  G CSCNDYW
Sbjct: 544 DCHSFTKYVSKAFNREIIVRDRVRFHHFADGICSCNDYW 581

BLAST of CcUC01G004300 vs. ExPASy Swiss-Prot
Match: Q8VXZ9 (Polyadenylate-binding protein RBP47B' OS=Arabidopsis thaliana OX=3702 GN=RBP47B' PE=2 SV=1)

HSP 1 Score: 577.0 bits (1486), Expect = 4.2e-163
Identity = 292/421 (69.36%), Postives = 332/421 (78.86%), Query Frame = 0

Query: 608  QPTHHQPTTVEEVRTLWIGDLQYWVDESYLNSCFAHTGEVISIKIIRNKITGQPEGYGFV 667
            Q ++H P T+EEVRTLWIGDLQYWVDE+YL SCF+ TGE++S+K+IRNKITGQPEGYGF+
Sbjct: 11   QGSYHHPQTLEEVRTLWIGDLQYWVDENYLTSCFSQTGELVSVKVIRNKITGQPEGYGFI 70

Query: 668  EFVSHAAAERILQTYNGTQMPGTEQTFRLNWASFGIGERRPDAGPEHSIFVGDLAPDVTD 727
            EF+SHAAAER LQTYNGTQMPGTE TFRLNWASFG G+ + DAGP+HSIFVGDLAPDVTD
Sbjct: 71   EFISHAAAERTLQTYNGTQMPGTELTFRLNWASFGSGQ-KVDAGPDHSIFVGDLAPDVTD 130

Query: 728  YLLQETFRVQYPSVRGAKVVTDPNTGRSKGYGFVKFADENERNRAMSEMNGIYCSTRPMR 787
            YLLQETFRV Y SVRGAKVVTDP+TGRSKGYGFVKFA+E+ERNRAM+EMNG+YCSTRPMR
Sbjct: 131  YLLQETFRVHYSSVRGAKVVTDPSTGRSKGYGFVKFAEESERNRAMAEMNGLYCSTRPMR 190

Query: 788  ISAATPKKTIGVQQQYSLGKAMYPVP-----AYTTSVPVLPADYDANNTTIFVGNLDPNI 847
            ISAATPKK +GVQQQY + KA+YPV      A      V P + D   TTI V NLD N+
Sbjct: 191  ISAATPKKNVGVQQQY-VTKAVYPVTVPSAVAAPVQAYVAPPESDVTCTTISVANLDQNV 250

Query: 848  TEEELKQTFLQFGEIAYVKIPSGKGCGFVQFGTRASAEEAIQKMQGKIIGQQVVRTSWGR 907
            TEEELK+ F Q GE+ YVKIP+ KG G+VQF TR SAEEA+Q+MQG++IGQQ VR SW +
Sbjct: 251  TEEELKKAFSQLGEVIYVKIPATKGYGYVQFKTRPSAEEAVQRMQGQVIGQQAVRISWSK 310

Query: 908  NPAVKQDLATWGQQVDPNQWSAYYGYGGTYDAYGYGVVQDPSLYAYGAYSGYASYPQQVD 967
            NP   QD   W  Q DPNQW+ YYGYG  YDAY YG  QDPS+YAYG Y GY  YPQQ +
Sbjct: 311  NPG--QD--GWVTQADPNQWNGYYGYGQGYDAYAYGATQDPSVYAYGGY-GYPQYPQQGE 370

Query: 968  GVQDLA-AVAGAVPSVEQGEEWNDTLDTPDVDYLNDAYLSKHESAILGWPLWLTTSSLVR 1023
            G QD++ + AG V   EQ  E  D L TPDVD LN AYLS H SAILG P+W  TSSL  
Sbjct: 371  GTQDISNSAAGGVAGAEQ--ELYDPLATPDVDKLNAAYLSVHASAILGRPMWQRTSSLTS 422

BLAST of CcUC01G004300 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 460.3 bits (1183), Expect = 5.7e-128
Identity = 240/579 (41.45%), Postives = 360/579 (62.18%), Query Frame = 0

Query: 8   ITLLQ--GCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSG--SLAYAQLLFHQ 67
           I LLQ  G +++ KLR+IHA  I  G+    A   K L F  +S+     ++YA  +F +
Sbjct: 19  INLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSK 78

Query: 68  MECP-QTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKAER 127
           +E P     WN++IRG+A+  + I A   Y  M  +    PDT T+ F++KA   +   R
Sbjct: 79  IEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVR 138

Query: 128 KCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISCFS 187
             + +H  VIR G+   + V  +L+  Y+  G V SA +VFD+MP +DLVAWN++I+ F+
Sbjct: 139 LGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFA 198

Query: 188 QQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHSLY 247
           + G   E+L  Y +M S+ +  DGFT+V L+S+CA +GAL +G ++H +  + GL  +L+
Sbjct: 199 ENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLH 258

Query: 248 VGNALIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEAR 307
             N L+D+YA+CGR+++A  +FD M  K+  +W S+IVG  V+G G EAI  F+ M    
Sbjct: 259 SSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTE 318

Query: 308 -VQPNSITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEK 367
            + P  ITF+G+L  CSH G+V+EG +YF  M  E+++ P ++H+GC+VDL  RAG+++K
Sbjct: 319 GLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKK 378

Query: 368 ALE-IVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 427
           A E I S   Q + V+WR LLG+C +H +  + E A   + +L   ++GD +LL+ +YA 
Sbjct: 379 AYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYAS 438

Query: 428 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 487
           +   + V ++RK + R G+K  PG S +E+ ++VH+F++ DKSH  S  +Y KL+E+  +
Sbjct: 439 EQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGR 498

Query: 488 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCI 547
               GYV    IS++ V    E  + +  YHSEK+AIAF L  T + + I +VKNLRVC 
Sbjct: 499 LRSEGYV--PQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCA 558

Query: 548 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCH  IK VS  +NREI+VRDR RFHHF  G CSC DYW
Sbjct: 559 DCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of CcUC01G004300 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 425.2 bits (1092), Expect = 2.0e-117
Identity = 230/621 (37.04%), Postives = 361/621 (58.13%), Query Frame = 0

Query: 11  LQGCNNLNKLRKIHAHVIVSG-LRDHVAIGNKLLNFCAIS--VSGSLAYAQLLFHQMECP 70
           +  C  +  L +IHA  I SG +RD +A   ++L FCA S      L YA  +F+QM   
Sbjct: 30  INNCRTIRDLSQIHAVFIKSGQMRDTLAAA-EILRFCATSDLHHRDLDYAHKIFNQMPQR 89

Query: 71  QTEAWNSIIRGFAQS--SSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKAERKCK 130
              +WN+IIRGF++S     + AI  +  M+S  F  P+ FTF  VLKAC +    ++ K
Sbjct: 90  NCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGK 149

Query: 131 EVHGSVIRCGYDGDVIVCTNLVKCYSV--------------------------------- 190
           ++HG  ++ G+ GD  V +NLV+ Y +                                 
Sbjct: 150 QIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDRRKRDGEI 209

Query: 191 ------------MGSVCSAQQVFDEMPARDLVAWNAMISCFSQQGLHLESLETYNQMRSE 250
                       +G   +A+ +FD+M  R +V+WN MIS +S  G   +++E + +M+  
Sbjct: 210 VLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVEVFREMKKG 269

Query: 251 NVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHSLYVGNALIDMYAKCGRLDQA 310
           ++  +  TLV ++ + + LG+L +G  +H +A + G+     +G+ALIDMY+KCG +++A
Sbjct: 270 DIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCGIIEKA 329

Query: 311 ILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEARVQPNSITFLGLLCGCSHQ 370
           I +F+R+ ++++ TW++MI G+ +HG+  +AI CF +M +A V+P+ + ++ LL  CSH 
Sbjct: 330 IHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSHG 389

Query: 371 GLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEKALEIVSNSS-QNDPVLWRI 430
           GLV+EG +YF  M S   L P ++HYGC+VDL GR+G L++A E + N   + D V+W+ 
Sbjct: 390 GLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWKA 449

Query: 431 LLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAGKNDTAGVARMRKMIKRQGI 490
           LLG+C++  NV +G+   N L ++   ++G  + L+ +YA + + + V+ MR  +K + I
Sbjct: 450 LLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDI 509

Query: 491 KTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQASLFGYVGDESISSLDVLS 550
           +  PG S I+I+  +H+FVV+D SH  + E+   L E+  +  L GY     I++  +L+
Sbjct: 510 RKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGY---RPITTQVLLN 569

Query: 551 TTETLKTSCT-YHSEKLAIAFGLARTSDGTQIRIVKNLRVCIDCHSFIKAVSAAFNREII 580
             E  K +   YHSEK+A AFGL  TS G  IRIVKNLR+C DCHS IK +S  + R+I 
Sbjct: 570 LEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVYKRKIT 629

BLAST of CcUC01G004300 vs. ExPASy Swiss-Prot
Match: Q8LK93 (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 424.5 bits (1090), Expect = 3.5e-117
Identity = 223/574 (38.85%), Postives = 343/574 (59.76%), Query Frame = 0

Query: 8   ITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAIS-VSGSLAYAQLLFHQMEC 67
           I L+  CN+L +L +I A+ I S + D V+   KL+NFC  S    S++YA+ LF  M  
Sbjct: 33  ILLISKCNSLRELMQIQAYAIKSHIED-VSFVAKLINFCTESPTESSMSYARHLFEAMSE 92

Query: 68  PQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKAERKCKE 127
           P    +NS+ RG+++ ++P+E    +  ++      PD +TF  +LKAC   KA  + ++
Sbjct: 93  PDIVIFNSMARGYSRFTNPLEVFSLFVEILEDGI-LPDNYTFPSLLKACAVAKALEEGRQ 152

Query: 128 VHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISCFSQQGL 187
           +H   ++ G D +V VC  L+  Y+    V SA+ VFD +    +V +NAMI+ ++++  
Sbjct: 153 LHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNR 212

Query: 188 HLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHSLYVGNA 247
             E+L  + +M+ + +  +  TL+ ++SSCA LG+L++G  +H++A++      + V  A
Sbjct: 213 PNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTA 272

Query: 248 LIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEARVQPN 307
           LIDM+AKCG LD A+ IF++M+ KD   W++MIV Y  HG+  +++  F++M    VQP+
Sbjct: 273 LIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMRSENVQPD 332

Query: 308 SITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEKALEIV 367
            ITFLGLL  CSH G V+EG KYF  M S+F + P +KHYG +VDL  RAG LE A E +
Sbjct: 333 EITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFI 392

Query: 368 SN-SSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAGKNDTA 427
                   P+LWRILL +C  H N+ + E     + EL  ++ GD ++L+ +YA      
Sbjct: 393 DKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWE 452

Query: 428 GVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQASLFG 487
            V  +RK++K +     PG S IE+ + VH+F   D     + +++  L E++ +  L G
Sbjct: 453 YVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKLSG 512

Query: 488 YVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCIDCHSF 547
           YV D S+     ++  E  + +  YHSEKLAI FGL  T  GT IR+VKNLRVC DCH+ 
Sbjct: 513 YVPDTSMVVHANMNDQEK-EITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNA 572

Query: 548 IKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
            K +S  F R++++RD  RFHHF  G+CSC D+W
Sbjct: 573 AKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of CcUC01G004300 vs. ExPASy TrEMBL
Match: A0A0A0LH20 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G074120 PE=3 SV=1)

HSP 1 Score: 1072.4 bits (2772), Expect = 1.1e-309
Identity = 520/580 (89.66%), Postives = 546/580 (94.14%), Query Frame = 0

Query: 1   MSKEKVIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MS EK I+ LLQGCN+L +LRKIHAHVIVSGL  HV I NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSNEKAILALLQGCNSLKRLRKIHAHVIVSGLHHHVPIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQMECPQTEAWNSIIRGFAQSSSPI+AIVFYN+MV  SFS PDTFTFSFVLKACERIKA
Sbjct: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVCDSFSIPDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKCKEVHGSVIRCGYD DVIVCTNLVKCYS MGSVC A+QVFD+MPARDLVAWNAMISC
Sbjct: 121 ERKCKEVHGSVIRCGYDADVIVCTNLVKCYSAMGSVCIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHS 240
           FSQQGLH E+L+TYNQMRSENVD+DGFTLVGLISSCAHLGALNIGVQMHRFARE GL  S
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCAHLGALNIGVQMHRFARENGLDQS 240

Query: 241 LYVGNALIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCG LDQAILIFDRMQ+KDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300

Query: 301 ARVQPNSITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPN +TFLGLLCGCSHQGLVQEG+KYF+LMSS+FRL+PEVKHYGCLVDLYGRAGKL+
Sbjct: 301 ARIQPNPVTFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLKPEVKHYGCLVDLYGRAGKLD 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEIVSNSS ND VLWRILLGSCKIHKNVTIGEIAMN L ELGAT+AGDCILLATIYAG
Sbjct: 361 KALEIVSNSSHNDSVLWRILLGSCKIHKNVTIGEIAMNRLSELGATSAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
           + D AGVARMRKMIK QG KTTPGWSWIEI +QVHKFVVDDKSHRYS+EVYEKLREVIHQ
Sbjct: 421 EKDKAGVARMRKMIKSQGKKTTPGWSWIEIGEQVHKFVVDDKSHRYSVEVYEKLREVIHQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCI 540
           AS FGYVGDESISSLD+LST ETLKTSCTYHSEKLAIAFGLART+DGTQIRIVKNLRVC 
Sbjct: 481 ASFFGYVGDESISSLDMLSTMETLKTSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYWL 581
           DCHSFIKAVS AFNREIIVRDRVRFHHF GG+CSCNDYW+
Sbjct: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGECSCNDYWV 580

BLAST of CcUC01G004300 vs. ExPASy TrEMBL
Match: A0A1S4DU66 (pentatricopeptide repeat-containing protein At3g56550 OS=Cucumis melo OX=3656 GN=LOC103485901 PE=3 SV=1)

HSP 1 Score: 1063.9 bits (2750), Expect = 4.2e-307
Identity = 521/579 (89.98%), Postives = 546/579 (94.30%), Query Frame = 0

Query: 1   MSKEKVIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEK I+TLLQGCN+L +LRKIHAHVIVSGL  HVAI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAILTLLQGCNSLKRLRKIHAHVIVSGLHHHVAIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQ E PQTEAWNSIIRGFAQSSSPI+AIVFYN+MV  SFS  DTFTFSFVLKACERIKA
Sbjct: 61  FHQTEFPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVWDSFSMRDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKCKEVHG+VIRCGYD DVIVCTNLVKCYS MGSV  A+QVFD+MPARDLVAWNAMISC
Sbjct: 121 ERKCKEVHGTVIRCGYDADVIVCTNLVKCYSAMGSVYIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHS 240
           FSQQGLH E+L+TYNQMRSENVD+DGFTLVGLISSCAHLGALNIGVQMHRFARE GL  S
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCAHLGALNIGVQMHRFARENGLDQS 240

Query: 241 LYVGNALIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCG LDQAILIFDRMQ+KDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300

Query: 301 ARVQPNSITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPNSITFLGLLCGCSHQGLVQEG+KYF+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARIQPNSITFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEIVSNSS ND VLWR LLGSCKIHKNVTIGEIAMN L ELGAT+AGDCILLATIYAG
Sbjct: 361 KALEIVSNSSHNDSVLWRTLLGSCKIHKNVTIGEIAMNRLFELGATSAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
           +ND AGV+RMRKMIK QGIKTTPGWSWIEI +QVHKFVVDDKS+RYSIEVYEKLREVI+Q
Sbjct: 421 ENDKAGVSRMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSNRYSIEVYEKLREVIYQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCI 540
           AS FGYVGDES+SSLDVLST ETLKTSCTYHSEKLAIAFGLART+DGTQIRIVKNLRVC 
Sbjct: 481 ASFFGYVGDESVSSLDVLSTIETLKTSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSFIKAVS AFNREIIVRDRVRFHHF GG+CSCNDYW
Sbjct: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGKCSCNDYW 579

BLAST of CcUC01G004300 vs. ExPASy TrEMBL
Match: A0A5A7TXJ9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold648G001800 PE=3 SV=1)

HSP 1 Score: 1063.9 bits (2750), Expect = 4.2e-307
Identity = 521/579 (89.98%), Postives = 546/579 (94.30%), Query Frame = 0

Query: 1   MSKEKVIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEK I+TLLQGCN+L +LRKIHAHVIVSGL  HVAI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAILTLLQGCNSLKRLRKIHAHVIVSGLHHHVAIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQ E PQTEAWNSIIRGFAQSSSPI+AIVFYN+MV  SFS  DTFTFSFVLKACERIKA
Sbjct: 61  FHQTEFPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVWDSFSMRDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKCKEVHG+VIRCGYD DVIVCTNLVKCYS MGSV  A+QVFD+MPARDLVAWNAMISC
Sbjct: 121 ERKCKEVHGTVIRCGYDADVIVCTNLVKCYSAMGSVYIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHS 240
           FSQQGLH E+L+TYNQMRSENVD+DGFTLVGLISSCAHLGALNIGVQMHRFARE GL  S
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCAHLGALNIGVQMHRFARENGLDQS 240

Query: 241 LYVGNALIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCG LDQAILIFDRMQ+KDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300

Query: 301 ARVQPNSITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPNSITFLGLLCGCSHQGLVQEG+KYF+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARIQPNSITFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEIVSNSS ND VLWR LLGSCKIHKNVTIGEIAMN L ELGAT+AGDCILLATIYAG
Sbjct: 361 KALEIVSNSSHNDSVLWRTLLGSCKIHKNVTIGEIAMNRLFELGATSAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
           +ND AGV+RMRKMIK QGIKTTPGWSWIEI +QVHKFVVDDKS+RYSIEVYEKLREVI+Q
Sbjct: 421 ENDKAGVSRMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSNRYSIEVYEKLREVIYQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCI 540
           AS FGYVGDES+SSLDVLST ETLKTSCTYHSEKLAIAFGLART+DGTQIRIVKNLRVC 
Sbjct: 481 ASFFGYVGDESVSSLDVLSTIETLKTSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSFIKAVS AFNREIIVRDRVRFHHF GG+CSCNDYW
Sbjct: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGKCSCNDYW 579

BLAST of CcUC01G004300 vs. ExPASy TrEMBL
Match: A0A6J1D832 (pentatricopeptide repeat-containing protein At3g56550 OS=Momordica charantia OX=3673 GN=LOC111018226 PE=3 SV=1)

HSP 1 Score: 1042.3 bits (2694), Expect = 1.3e-300
Identity = 497/579 (85.84%), Postives = 542/579 (93.61%), Query Frame = 0

Query: 1   MSKEKVIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MS EK I+TLLQGCN+LNKLRKIHAHVI+SGLR H AIGNKLLNFCAISVSGSL YA+LL
Sbjct: 1   MSNEKAILTLLQGCNSLNKLRKIHAHVILSGLRHHAAIGNKLLNFCAISVSGSLPYARLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           F  M+CPQTEAWNSIIRGFAQS+SPIEA+V+YN+MV AS S PDTFTFSFVLKACER+KA
Sbjct: 61  FRHMDCPQTEAWNSIIRGFAQSASPIEAVVYYNQMVWASLSPPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKC+EVHGSVIR GYDGDVI+CTNL+KCY+ MG +C AQQVFD+MP RDLVAWNAMISC
Sbjct: 121 ERKCREVHGSVIRWGYDGDVIICTNLMKCYAAMGFICVAQQVFDKMPTRDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHS 240
           +SQQGLH E+LETYNQMRS NVDVDGFTLVGL+SSCAHLGALNIGVQMHRFAREKGLV S
Sbjct: 181 YSQQGLHQEALETYNQMRSGNVDVDGFTLVGLLSSCAHLGALNIGVQMHRFAREKGLVES 240

Query: 241 LYVGNALIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCG LDQAILIFDRMQ+KD+FTWNSMIVGYGVHGRG+EAI+CFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDVFTWNSMIVGYGVHGRGTEAIFCFQQMLE 300

Query: 301 ARVQPNSITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QP+S+TFLGLLCGCSHQGLVQEG+K+F+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARIQPSSVTFLGLLCGCSHQGLVQEGVKFFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEI+ NSSQNDPVLWRILLGSCKIHKNV IGEIAMN+L +LGATNAGDCILLATIYAG
Sbjct: 361 KALEIILNSSQNDPVLWRILLGSCKIHKNVGIGEIAMNNLSQLGATNAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
             +T+GV RMRKMI+ QGIKTTPGWSWIEI +QVHKFVVDDKSHRY IEVYEKL+EVIHQ
Sbjct: 421 VKNTSGVVRMRKMIRSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYYIEVYEKLKEVIHQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCI 540
           ASLFGY+GD   S+ DV ST+E L+TSC+YHSEKLAIAFGLART+DGTQIRIVKNLRVC 
Sbjct: 481 ASLFGYIGDGYFSTTDVFSTSEILETSCSYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSF+KAVS AFNREIIVRDRVRFHHF GGQCSCNDYW
Sbjct: 541 DCHSFVKAVSLAFNREIIVRDRVRFHHFKGGQCSCNDYW 579

BLAST of CcUC01G004300 vs. ExPASy TrEMBL
Match: A0A6J1FBZ0 (pentatricopeptide repeat-containing protein At3g56550 OS=Cucurbita moschata OX=3662 GN=LOC111444025 PE=3 SV=1)

HSP 1 Score: 1034.6 bits (2674), Expect = 2.7e-298
Identity = 501/579 (86.53%), Postives = 535/579 (92.40%), Query Frame = 0

Query: 1   MSKEKVIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEK I+TLLQGCN+LNKLRKIHAHV+VSGLR HVAI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAIVTLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMECPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQMEC QTEAWNSIIRGFAQSSSPI+A+V+YN+MV ASFSSPDTFTFSFVLKACER+KA
Sbjct: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKCKE+HG++IRCGYDGDVI+CTNLVKCY+ MGSVC A QVFDEMP RDLVAWNAMISC
Sbjct: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAHQVFDEMPVRDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHS 240
           FSQQGLH E+L+ YNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLV S
Sbjct: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVES 240

Query: 241 LYVGNALIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCG LDQAI IFDRM +KDIFTWNSMIVGYGVHGRG+EAI+CF++MLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAIFIFDRMHRKDIFTWNSMIVGYGVHGRGTEAIFCFERMLE 300

Query: 301 ARVQPNSITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPNS+TFLGLLCGCSHQGLVQEG+KYF+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALE + NSS NDPVLWRILLGSCKIHKNV +GEIAMN+L ELGATNAGDCILLATIYAG
Sbjct: 361 KALETIRNSSPNDPVLWRILLGSCKIHKNVGVGEIAMNNLNELGATNAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
            NDTAGVA MRK IK QGIKT+PGWSWIEI +QVHKFVVDDKSHR SIEVYEKLREV+HQ
Sbjct: 421 VNDTAGVASMRKTIKSQGIKTSPGWSWIEIGEQVHKFVVDDKSHRDSIEVYEKLREVLHQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCI 540
           ASLFGYV D            ETLKTS TYHSEKLAIAFGLART+DGT IRIVKNLRVC 
Sbjct: 481 ASLFGYVID-----------AETLKTSSTYHSEKLAIAFGLARTADGTPIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSF+KAVS AF+REIIVRDRVRFHHF GGQCSCNDYW
Sbjct: 541 DCHSFMKAVSVAFDREIIVRDRVRFHHFKGGQCSCNDYW 568

BLAST of CcUC01G004300 vs. TAIR 10
Match: AT3G56550.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 713.8 bits (1841), Expect = 2.0e-205
Identity = 346/579 (59.76%), Postives = 440/579 (75.99%), Query Frame = 0

Query: 3   KEKVIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLLFH 62
           K +VI+ +LQGCN++ KLRKIH+HVI++GL+ H +I N LL FCA+SV+GSL++AQLLF 
Sbjct: 4   KARVIVRMLQGCNSMKKLRKIHSHVIINGLQHHPSIFNHLLRFCAVSVTGSLSHAQLLFD 63

Query: 63  QMEC-PQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKAE 122
             +  P T  WN +IRGF+ SSSP+ +I+FYNRM+ +S S PD FTF+F LK+CERIK+ 
Sbjct: 64  HFDSDPSTSDWNYLIRGFSNSSSPLNSILFYNRMLLSSVSRPDLFTFNFALKSCERIKSI 123

Query: 123 RKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISCF 182
            KC E+HGSVIR G+  D IV T+LV+CYS  GSV  A +VFDEMP RDLV+WN MI CF
Sbjct: 124 PKCLEIHGSVIRSGFLDDAIVATSLVRCYSANGSVEIASKVFDEMPVRDLVSWNVMICCF 183

Query: 183 SQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHSL 242
           S  GLH ++L  Y +M +E V  D +TLV L+SSCAH+ ALN+GV +HR A +      +
Sbjct: 184 SHVGLHNQALSMYKRMGNEGVCGDSYTLVALLSSCAHVSALNMGVMLHRIACDIRCESCV 243

Query: 243 YVGNALIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEA 302
           +V NALIDMYAKCG L+ AI +F+ M+K+D+ TWNSMI+GYGVHG G EAI  F++M+ +
Sbjct: 244 FVSNALIDMYAKCGSLENAIGVFNGMRKRDVLTWNSMIIGYGVHGHGVEAISFFRKMVAS 303

Query: 303 RVQPNSITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEK 362
            V+PN+ITFLGLL GCSHQGLV+EG+++F +MSS+F L P VKHYGC+VDLYGRAG+LE 
Sbjct: 304 GVRPNAITFLGLLLGCSHQGLVKEGVEHFEIMSSQFHLTPNVKHYGCMVDLYGRAGQLEN 363

Query: 363 ALEIV-SNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 422
           +LE++ ++S   DPVLWR LLGSCKIH+N+ +GE+AM  L +L A NAGD +L+ +IY+ 
Sbjct: 364 SLEMIYASSCHEDPVLWRTLLGSCKIHRNLELGEVAMKKLVQLEAFNAGDYVLMTSIYSA 423

Query: 423 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 482
            ND    A MRK+I+   ++T PGWSWIEI DQVHKFVVDDK H  S  +Y +L EVI++
Sbjct: 424 ANDAQAFASMRKLIRSHDLQTVPGWSWIEIGDQVHKFVVDDKMHPESAVIYSELGEVINR 483

Query: 483 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCI 542
           A L GY  ++S  +   LS    L ++ T HSEKLAIA+GL RT+ GT +RI KNLRVC 
Sbjct: 484 AILAGYKPEDSNRTAPTLS-DRCLGSADTSHSEKLAIAYGLMRTTAGTTLRITKNLRVCR 543

Query: 543 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSF K VS AFNREIIVRDRVRFHHF  G CSCNDYW
Sbjct: 544 DCHSFTKYVSKAFNREIIVRDRVRFHHFADGICSCNDYW 581

BLAST of CcUC01G004300 vs. TAIR 10
Match: AT5G19350.1 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 577.0 bits (1486), Expect = 3.0e-164
Identity = 292/421 (69.36%), Postives = 332/421 (78.86%), Query Frame = 0

Query: 608  QPTHHQPTTVEEVRTLWIGDLQYWVDESYLNSCFAHTGEVISIKIIRNKITGQPEGYGFV 667
            Q ++H P T+EEVRTLWIGDLQYWVDE+YL SCF+ TGE++S+K+IRNKITGQPEGYGF+
Sbjct: 11   QGSYHHPQTLEEVRTLWIGDLQYWVDENYLTSCFSQTGELVSVKVIRNKITGQPEGYGFI 70

Query: 668  EFVSHAAAERILQTYNGTQMPGTEQTFRLNWASFGIGERRPDAGPEHSIFVGDLAPDVTD 727
            EF+SHAAAER LQTYNGTQMPGTE TFRLNWASFG G+ + DAGP+HSIFVGDLAPDVTD
Sbjct: 71   EFISHAAAERTLQTYNGTQMPGTELTFRLNWASFGSGQ-KVDAGPDHSIFVGDLAPDVTD 130

Query: 728  YLLQETFRVQYPSVRGAKVVTDPNTGRSKGYGFVKFADENERNRAMSEMNGIYCSTRPMR 787
            YLLQETFRV Y SVRGAKVVTDP+TGRSKGYGFVKFA+E+ERNRAM+EMNG+YCSTRPMR
Sbjct: 131  YLLQETFRVHYSSVRGAKVVTDPSTGRSKGYGFVKFAEESERNRAMAEMNGLYCSTRPMR 190

Query: 788  ISAATPKKTIGVQQQYSLGKAMYPVP-----AYTTSVPVLPADYDANNTTIFVGNLDPNI 847
            ISAATPKK +GVQQQY + KA+YPV      A      V P + D   TTI V NLD N+
Sbjct: 191  ISAATPKKNVGVQQQY-VTKAVYPVTVPSAVAAPVQAYVAPPESDVTCTTISVANLDQNV 250

Query: 848  TEEELKQTFLQFGEIAYVKIPSGKGCGFVQFGTRASAEEAIQKMQGKIIGQQVVRTSWGR 907
            TEEELK+ F Q GE+ YVKIP+ KG G+VQF TR SAEEA+Q+MQG++IGQQ VR SW +
Sbjct: 251  TEEELKKAFSQLGEVIYVKIPATKGYGYVQFKTRPSAEEAVQRMQGQVIGQQAVRISWSK 310

Query: 908  NPAVKQDLATWGQQVDPNQWSAYYGYGGTYDAYGYGVVQDPSLYAYGAYSGYASYPQQVD 967
            NP   QD   W  Q DPNQW+ YYGYG  YDAY YG  QDPS+YAYG Y GY  YPQQ +
Sbjct: 311  NPG--QD--GWVTQADPNQWNGYYGYGQGYDAYAYGATQDPSVYAYGGY-GYPQYPQQGE 370

Query: 968  GVQDLA-AVAGAVPSVEQGEEWNDTLDTPDVDYLNDAYLSKHESAILGWPLWLTTSSLVR 1023
            G QD++ + AG V   EQ  E  D L TPDVD LN AYLS H SAILG P+W  TSSL  
Sbjct: 371  GTQDISNSAAGGVAGAEQ--ELYDPLATPDVDKLNAAYLSVHASAILGRPMWQRTSSLTS 422

BLAST of CcUC01G004300 vs. TAIR 10
Match: AT5G19350.2 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 573.2 bits (1476), Expect = 4.3e-163
Identity = 287/416 (68.99%), Postives = 326/416 (78.37%), Query Frame = 0

Query: 608  QPTHHQPTTVEEVRTLWIGDLQYWVDESYLNSCFAHTGEVISIKIIRNKITGQPEGYGFV 667
            Q ++H P T+EEVRTLWIGDLQYWVDE+YL SCF+ TGE++S+K+IRNKITGQPEGYGF+
Sbjct: 11   QGSYHHPQTLEEVRTLWIGDLQYWVDENYLTSCFSQTGELVSVKVIRNKITGQPEGYGFI 70

Query: 668  EFVSHAAAERILQTYNGTQMPGTEQTFRLNWASFGIGERRPDAGPEHSIFVGDLAPDVTD 727
            EF+SHAAAER LQTYNGTQMPGTE TFRLNWASFG G+ + DAGP+HSIFVGDLAPDVTD
Sbjct: 71   EFISHAAAERTLQTYNGTQMPGTELTFRLNWASFGSGQ-KVDAGPDHSIFVGDLAPDVTD 130

Query: 728  YLLQETFRVQYPSVRGAKVVTDPNTGRSKGYGFVKFADENERNRAMSEMNGIYCSTRPMR 787
            YLLQETFRV Y SVRGAKVVTDP+TGRSKGYGFVKFA+E+ERNRAM+EMNG+YCSTRPMR
Sbjct: 131  YLLQETFRVHYSSVRGAKVVTDPSTGRSKGYGFVKFAEESERNRAMAEMNGLYCSTRPMR 190

Query: 788  ISAATPKKTIGVQQQYSLGKAMYPVPAYTTSVPVLPADYDANNTTIFVGNLDPNITEEEL 847
            ISAATPKK +GVQQQY     +    A      V P + D   TTI V NLD N+TEEEL
Sbjct: 191  ISAATPKKNVGVQQQYVTKVTVPSAVAAPVQAYVAPPESDVTCTTISVANLDQNVTEEEL 250

Query: 848  KQTFLQFGEIAYVKIPSGKGCGFVQFGTRASAEEAIQKMQGKIIGQQVVRTSWGRNPAVK 907
            K+ F Q GE+ YVKIP+ KG G+VQF TR SAEEA+Q+MQG++IGQQ VR SW +NP   
Sbjct: 251  KKAFSQLGEVIYVKIPATKGYGYVQFKTRPSAEEAVQRMQGQVIGQQAVRISWSKNPG-- 310

Query: 908  QDLATWGQQVDPNQWSAYYGYGGTYDAYGYGVVQDPSLYAYGAYSGYASYPQQVDGVQDL 967
            QD   W  Q DPNQW+ YYGYG  YDAY YG  QDPS+YAYG Y GY  YPQQ +G QD+
Sbjct: 311  QD--GWVTQADPNQWNGYYGYGQGYDAYAYGATQDPSVYAYGGY-GYPQYPQQGEGTQDI 370

Query: 968  A-AVAGAVPSVEQGEEWNDTLDTPDVDYLNDAYLSKHESAILGWPLWLTTSSLVRQ 1023
            + + AG V   EQ  E  D L TPDVD LN AYLS H SAILG P+W  TSSL  Q
Sbjct: 371  SNSAAGGVAGAEQ--ELYDPLATPDVDKLNAAYLSVHASAILGRPMWQRTSSLTSQ 418

BLAST of CcUC01G004300 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 460.3 bits (1183), Expect = 4.0e-129
Identity = 240/579 (41.45%), Postives = 360/579 (62.18%), Query Frame = 0

Query: 8   ITLLQ--GCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSG--SLAYAQLLFHQ 67
           I LLQ  G +++ KLR+IHA  I  G+    A   K L F  +S+     ++YA  +F +
Sbjct: 19  INLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSK 78

Query: 68  MECP-QTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKAER 127
           +E P     WN++IRG+A+  + I A   Y  M  +    PDT T+ F++KA   +   R
Sbjct: 79  IEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVR 138

Query: 128 KCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISCFS 187
             + +H  VIR G+   + V  +L+  Y+  G V SA +VFD+MP +DLVAWN++I+ F+
Sbjct: 139 LGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFA 198

Query: 188 QQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHSLY 247
           + G   E+L  Y +M S+ +  DGFT+V L+S+CA +GAL +G ++H +  + GL  +L+
Sbjct: 199 ENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLH 258

Query: 248 VGNALIDMYAKCGRLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEAR 307
             N L+D+YA+CGR+++A  +FD M  K+  +W S+IVG  V+G G EAI  F+ M    
Sbjct: 259 SSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTE 318

Query: 308 -VQPNSITFLGLLCGCSHQGLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEK 367
            + P  ITF+G+L  CSH G+V+EG +YF  M  E+++ P ++H+GC+VDL  RAG+++K
Sbjct: 319 GLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKK 378

Query: 368 ALE-IVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 427
           A E I S   Q + V+WR LLG+C +H +  + E A   + +L   ++GD +LL+ +YA 
Sbjct: 379 AYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYAS 438

Query: 428 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 487
           +   + V ++RK + R G+K  PG S +E+ ++VH+F++ DKSH  S  +Y KL+E+  +
Sbjct: 439 EQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGR 498

Query: 488 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCI 547
               GYV    IS++ V    E  + +  YHSEK+AIAF L  T + + I +VKNLRVC 
Sbjct: 499 LRSEGYV--PQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCA 558

Query: 548 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCH  IK VS  +NREI+VRDR RFHHF  G CSC DYW
Sbjct: 559 DCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of CcUC01G004300 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 425.2 bits (1092), Expect = 1.4e-118
Identity = 230/621 (37.04%), Postives = 361/621 (58.13%), Query Frame = 0

Query: 11  LQGCNNLNKLRKIHAHVIVSG-LRDHVAIGNKLLNFCAIS--VSGSLAYAQLLFHQMECP 70
           +  C  +  L +IHA  I SG +RD +A   ++L FCA S      L YA  +F+QM   
Sbjct: 30  INNCRTIRDLSQIHAVFIKSGQMRDTLAAA-EILRFCATSDLHHRDLDYAHKIFNQMPQR 89

Query: 71  QTEAWNSIIRGFAQS--SSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKAERKCK 130
              +WN+IIRGF++S     + AI  +  M+S  F  P+ FTF  VLKAC +    ++ K
Sbjct: 90  NCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGK 149

Query: 131 EVHGSVIRCGYDGDVIVCTNLVKCYSV--------------------------------- 190
           ++HG  ++ G+ GD  V +NLV+ Y +                                 
Sbjct: 150 QIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDRRKRDGEI 209

Query: 191 ------------MGSVCSAQQVFDEMPARDLVAWNAMISCFSQQGLHLESLETYNQMRSE 250
                       +G   +A+ +FD+M  R +V+WN MIS +S  G   +++E + +M+  
Sbjct: 210 VLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVEVFREMKKG 269

Query: 251 NVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVHSLYVGNALIDMYAKCGRLDQA 310
           ++  +  TLV ++ + + LG+L +G  +H +A + G+     +G+ALIDMY+KCG +++A
Sbjct: 270 DIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCGIIEKA 329

Query: 311 ILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEARVQPNSITFLGLLCGCSHQ 370
           I +F+R+ ++++ TW++MI G+ +HG+  +AI CF +M +A V+P+ + ++ LL  CSH 
Sbjct: 330 IHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSHG 389

Query: 371 GLVQEGLKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEKALEIVSNSS-QNDPVLWRI 430
           GLV+EG +YF  M S   L P ++HYGC+VDL GR+G L++A E + N   + D V+W+ 
Sbjct: 390 GLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWKA 449

Query: 431 LLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAGKNDTAGVARMRKMIKRQGI 490
           LLG+C++  NV +G+   N L ++   ++G  + L+ +YA + + + V+ MR  +K + I
Sbjct: 450 LLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDI 509

Query: 491 KTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQASLFGYVGDESISSLDVLS 550
           +  PG S I+I+  +H+FVV+D SH  + E+   L E+  +  L GY     I++  +L+
Sbjct: 510 RKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGY---RPITTQVLLN 569

Query: 551 TTETLKTSCT-YHSEKLAIAFGLARTSDGTQIRIVKNLRVCIDCHSFIKAVSAAFNREII 580
             E  K +   YHSEK+A AFGL  TS G  IRIVKNLR+C DCHS IK +S  + R+I 
Sbjct: 570 LEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVYKRKIT 629

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890323.10.0e+0092.23pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida] >XP_03... [more]
XP_004152881.13.1e-30989.81pentatricopeptide repeat-containing protein At3g56550 isoform X1 [Cucumis sativu... [more]
XP_016899519.18.6e-30789.98PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] ... [more]
XP_022149932.12.7e-30085.84pentatricopeptide repeat-containing protein At3g56550 [Momordica charantia] >XP_... [more]
XP_023537237.12.3e-29987.05pentatricopeptide repeat-containing protein At3g56550 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
Q9LXY52.9e-20459.76Pentatricopeptide repeat-containing protein At3g56550 OS=Arabidopsis thaliana OX... [more]
Q8VXZ94.2e-16369.36Polyadenylate-binding protein RBP47B' OS=Arabidopsis thaliana OX=3702 GN=RBP47B'... [more]
A8MQA35.7e-12841.45Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q9FI802.0e-11737.04Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Q8LK933.5e-11738.85Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LH201.1e-30989.66DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G0741... [more]
A0A1S4DU664.2e-30789.98pentatricopeptide repeat-containing protein At3g56550 OS=Cucumis melo OX=3656 GN... [more]
A0A5A7TXJ94.2e-30789.98Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1D8321.3e-30085.84pentatricopeptide repeat-containing protein At3g56550 OS=Momordica charantia OX=... [more]
A0A6J1FBZ02.7e-29886.53pentatricopeptide repeat-containing protein At3g56550 OS=Cucurbita moschata OX=3... [more]
Match NameE-valueIdentityDescription
AT3G56550.12.0e-20559.76Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G19350.13.0e-16469.36RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT5G19350.24.3e-16368.99RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT4G21065.14.0e-12941.45Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G48910.11.4e-11837.04Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 622..697
e-value: 2.8E-14
score: 63.4
coord: 832..899
e-value: 7.1E-23
score: 92.0
coord: 715..789
e-value: 2.0E-20
score: 83.9
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 623..691
e-value: 7.4E-14
score: 51.4
coord: 833..896
e-value: 3.3E-18
score: 65.3
coord: 716..784
e-value: 1.1E-15
score: 57.2
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 621..701
score: 15.408587
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 831..903
score: 17.438179
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 714..793
score: 17.114126
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 823..909
e-value: 2.5E-24
score: 87.4
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 703..808
e-value: 1.1E-21
score: 79.2
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 620..700
e-value: 1.1E-23
score: 85.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 245..476
e-value: 3.3E-35
score: 123.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 127..226
e-value: 2.4E-16
score: 61.6
coord: 5..126
e-value: 5.5E-13
score: 50.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 270..318
e-value: 1.1E-9
score: 38.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 172..205
e-value: 2.5E-5
score: 22.1
coord: 71..104
e-value: 7.3E-4
score: 17.5
coord: 273..306
e-value: 1.2E-7
score: 29.4
coord: 245..272
e-value: 4.1E-5
score: 21.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 346..365
e-value: 0.061
score: 13.6
coord: 172..202
e-value: 1.6E-5
score: 24.8
coord: 71..98
e-value: 0.0027
score: 17.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 68..102
score: 8.506026
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 240..270
score: 9.415814
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 170..204
score: 10.226951
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 271..305
score: 11.432693
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 443..568
e-value: 4.1E-33
score: 114.0
NoneNo IPR availablePANTHERPTHR47928:SF8SUBFAMILY NOT NAMEDcoord: 10..553
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 10..553
NoneNo IPR availableCDDcd12346RRM3_NGR1_NAM8_likecoord: 830..901
e-value: 1.39114E-37
score: 133.198
NoneNo IPR availableCDDcd12344RRM1_SECp43_likecoord: 622..702
e-value: 1.019E-46
score: 159.35
NoneNo IPR availableCDDcd12345RRM2_SECp43_likecoord: 713..792
e-value: 7.36356E-51
score: 171.308
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 242..374
IPR035979RNA-binding domain superfamilySUPERFAMILY54928RNA-binding domain, RBDcoord: 713..906
IPR035979RNA-binding domain superfamilySUPERFAMILY54928RNA-binding domain, RBDcoord: 608..689

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC01G004300.1CcUC01G004300.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003676 nucleic acid binding