CaUC01G004240 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC01G004240
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCiama_Chr01: 4625023 .. 4633940 (-)
RNA-Seq ExpressionCaUC01G004240
SyntenyCaUC01G004240
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGAAGGAAAAAGCCATTATCACTCTCTTGCAAGGCTGCAACAACCTTAACAAGCTTCGCAAAATCCACGCACATGTTATTGTAAGCGGCCTCCGCGATCATGTCGCCATTGGCAACAAGCTTTTGAACTTCTGTGCCATCTCTGTTTCAGGTTCCCTTGCTTATGCCCAGCTTCTCTTCCATCAAATGCAGTGCCCACAAACCGAAGCCTGGAACTCCATCATCAGAGGTTTTGCCCAGAGCTCATCTCCCATTGAGGCTATTGTTTTCTACAATCGAATGGTTTCGGCCTCTTTCTCTTCCCCTGACACTTTCACTTTCTCATTTGTGCTCAAAGCCTGTGAAAGAATCAAGGCTGAGCGTAAGTGTAAAGAAGTTCATGGCTCTGTAATCCGTTGCGGTTATGATGGGGATGTGATTGTCTGCACCAATCTTGTCAAATGCTATTCGGTGATGGGGTCCGTTTGTAGTGCCCAACAGGTGTTTGACGAAATGCCTGCAAGAGACTTGGTGGCTTGGAATGCTATGATTTCCTGCTTTTCTCAACAGGGTTTGCACCTGGAGTCACTGGAGACATACAATCAGATGAGAAGTGAAAATGTGGATGTAGATGGTTTTACACTCGTTGGGTTGATTTCGTCTTGTGCCCATCTTGGAGCTTTGAATATTGGGGTTCAGATGCATAGATTTGCTCGTGAAAAGGGTCTTGTGCAGAGTCTTTATGTTGGAAATGCGTTGATAGATATGTATGCTAAATGTGGCAGTTTAGATCAGGCCATTCTTATCTTTGATAGAATGCAGAAGAAGGACATTTTCACTTGGAACTCGATGATTGTTGGGTATGGAGTTCATGGTCGAGGTAGTGAAGCTATATATTGCTTTCAACAGATGTTAGAAGCAAGAATGCAACCGAACTCTATCACATTTTTGGGTTTACTTTGTGGATGTAGTCATCAAGGCTTGGTTCAAGAAGGTGTTAAATACTTCCATTTGATGAGCTCTGAGTTTAGGCTAAGACCTGAGGTTAAACACTATGGATGCCTTGTGGATTTATATGGTCGAGCTGGGAAGCTTGAGAAGGCACTTGAAATTGTATCAAATTCATCACAGAATGATCCAGTTTTGTGGCGAATCTTACTTGGCTCTTGCAAGATTCACAAAAATGTGACAATAGGAGAAATTGCCATGAACAGTCTGTGTGAGCTTGGAGCTACAAATGCAGGGGATTGTATATTGTTGGCTACAATCTATGCGGGAAAAAATGATACAGCTGGTGTTGCAAGAATGAGAAAAATGATAAAGAGGCAAGGGATAAAGACTACCCCAGGTTGGAGTTGGATTGAAATTGAGGATCAAGTTCATAAATTTGTGGTTGATGACAAGTCCCACCGTTATTCCATTGAAGTTTATGAGAAGTTGAGGGAAGTTATTCATCAAGCTTCCTTGTTTGGATATGTAGGAGATGAGTCTATTTCATCACTGGATGTGCTTTCTACCACAGAGACCTTAAAGACTTCATGTACATATCATAGTGAGAAACTTGCAATTGCATTTGGATTGGCAAGAACTTCAGATGGGACACAGATACGCATTGTTAAAAACCTTAGAGTTTGTAGAGATTGTCATTCATTCATAAAAGCTGTCTCGGCGGCATTCAACCGAGAAATAATTGTTAGAGATCGGGTTAGATTCCACCATTTCAATGGTGGTCAATGTTCCTGCAATGACTACTGGTGAAAAGTGAAAAATGTACTTTCAATTTTGCCTGTTTCAAGTCTGCAGAGCCTCCTTTCGTTGTAATGGAGTTTTGGCTCTTTCCATACCTCTTTTATTAGCATAGTAGAGTTGTTGCTTACAGTCATGCAAGAAGATATAGCTGAGTTTCAAGTTATTATGCTCTTCTTTGATGTAACTTCTAACCTACTCCGCACGACAAGTTTATGTAATTAGTTATACATTCTTTGAAATTATACAGGCTCCATAAGGCCGCGTTGCAAGCACAAACTAAGCTCAAGAAGGATAGTTTACTAGTTGAAACAAAACTTCCGAGCTTCCACGGTATGTTAGGAAATTGAGTTTCCCTACATTTTAAGGGAAAATTGTATTGGATAGCAAATTTTGATTATATATTAGTATCTGAGTCATCAATTTTTAAAATATTGTAAATATGGCAAAATTTGTTTGAAATAATATATATTGATAAAAATCATTTTATTTGATTTTTTTCTCTTTCCAAAAACACCCTCATCTCTTCAAATGTGTAAAATCGGCTTCAACTACACGTGTTTTCTTCTTTGACAATCTGTGAGTTTCTCTACAACATCTTCAAACTCATAAGGAAGGAATTAGCTCTATGTCGCTTTTTCATTAAATTCAAACTTATTCAGAACTTTACGGGAAAAGTTACAAAGGGGCTCAACTCCTTCCCAACAAGTTTGAATTTACCAGGAATTGAGGCCTATGTGGTTTTCCTGTAAAGTTGGTCATAAAGTACAAGAGCTTTCATTCATTTCAGACAAGTTCAAAAAAGACAAAGAGCTACTTTAGGAAGGTAGTAGGAAGGAAAAGAAAGAAAAGGAAAAAAAAGGGAAAAATTGATTCTGTCAAATTTACGGGAAATTCTTATTATTAGTCGACTAAGCTGTTTTGACAAGTATATCAATCAAGTGTATATTAGTAGTGTGTCATGTGTATATCAGTAGTATATTAACTATACCAGTCAAATGCATCATTATCAAGTTTATCAATAATATGATATTAACTAATATAATGAAAATGAAGTATATCAATAGTATATTTAGAATGTTAATAGTAGAGGGTGTTTTAGACTTTTATCCTTTTAATGCTTGGGTTGGATCACAATTTCGCTAAATCTGCAAAAGGAAAAAGTTTGAAACACGCGTGCAAATTTCATTCATCTTTTGCCATACATGTAATTGCCTTTTTTTAAAAAACGTTTTTTTTCTACAATCTTTTTTACTTTTTCAGCATTTTAGTGAAGTACCCCATAATATAAAATGACGTTTTTGTCCCCATTAATTCCCTTTCAATTTTACCTCGACCTAACTTACCTAACAAAAATTTTCAGCTTCTTAACTTCAGTTTGCTTGAATTATACTCATGTTTCCATTTTGCTTTCACCAAAAAGGCACAAGTGTCTCGTTTTCACCTTGCTTATAGAGAATCTATTGGTCCCATGTTTAGGGACTTCTTTCATTTTGCAGCTTTTCCTCTATTAGAGAAAAAGGTATGAATGTTGTTTTAATTTTAAGTGGGGGAAAGGTTCTTTATTGGAAATGTTTAAATTTATTGGAATTCTTGAGAAAATAATATTTTTCAGTCAACTTTACGAAAAAAAAGGGAAAAAAATGATTCTGTCAATTTTCCCGTAAAGCTGACTAAATTTATTTATTTATTATTGTTGTTTTTTTTACATAAAGCTGGAAACTTTATGGAAATTTTGACAGAATTTTTTTTCCTTTTTTTTTTTTTTCTTTCAAGAAAACAACTATTATCCTTCAATTGATTGGAAAATATTATTTTCCCAAGAATTCCAATAAATCTAAATATTTCTTGGAAAGACCTTTTCCTCACACAAAATTAAGCAACTGCTATTTTAACTTTAAACAACATAAAACACATGAATAAAAGAGAAAAAACCAAAACAATGTCCATTCCTTCTTTTGTATAGAGAAAAAAACCAAAACAAGGTATGATAGGAAATGATGCAAAAGGAAAAAAGTCCTGAAAATTTTTGTTAAATAAGTTAGGTCGAGATAAAATTGAAGGGAATTGAATGAGAAATTAATGAGGATAAAAATGTCATTTTATATTGTGGAGGCTTCCAAAACTTTTTTTAGGTTAGAAATAACAATTTAGTGATAAAACATATGAAAAAGGAAAAGTAAGAAAAAAATTGCTGTTAACCGTAGTTTGAGTCTCACCGTCGTCCGAGGCGAAAACGAAACCCTAGCAGTGGCAAGTGCCGAGAATTCGCAGGTTTCTTCAAATCTCTCCACTTGTTTATTTTTCTATTGTTATTCATTGAGTCTTCAGTTATCACCAAACTTTAGCCCCAAAATCTTTGGAGTCACCGATCAATTTACAGAGACGGTAAAGTCGGAGCATCATACATAGTGTAGAAACAATCTTACAAAGTTCAAACATCAGCAACTCAATTACCAGAAGCGATTAGATAATGCGTCAATAGATACGATGGCGACTCTGGCAGGTCAACCAACTCACCATCAACCAACTACGGTAGAGGAAGTTAGAACACTTTGGATAGGGGATTTGCAGTATTGGGTCGATGAGTCTTACCTTAATTCTTGCTTTGCTCACACTGGCGAGGTTCGTTTGTTAGTTTTTTCTTAATTTATTTTGCAACTTTGGTGTTTAATGATGATGTTTTTAATCATAGACCCATTAGGATTTCACCAGTTTCTCAAGCTGATGACACTGTTTTATGAAACCTAGATGTGTTTAAGTGGATTTTTTAGGTCTTCTACTTTTAGGTTGGCCTTTCCAATAATTTGTCCTGAAATAATTGGTTAGTTTTCTTCGGGTGTTTGGAAGTAAATTTGGGTGTATAAAATTAGAATGTGTTTGTATTTTGCTCTATTGCTACTGAGATCTGAACTGTAGGGAGAGGAAGGGCCTCTCTTATGACCTTTCATGCCACTGATGGGGATGATTTTGATGCTTGAGAGCCGTGGTAGTGGAAACGCCAATGTTTAAAATTATAGTTATTTTATAGTTGTTATATTACAATAGCCCATACAAATGTACTTCTGATGTCTAGTTGTTGTTGTTTGTATTCTAGTATTTGAGTAAGAGCTTGTAACTTAGCTGCATTCAGAGAATAGCATTCTGTTGTTTGTTGACTATGCCATATTTCCTAATACAGGTAATATCAATTAAAATAATTCGCAACAAGATCACTGGCCAGCCTGAGGGTTATGGGTTTGTGGAGTTTGTATCTCATGCCGCAGCAGAAAGAATTTTGCAGACATACAATGGGACCCAGATGCCTGGAACGGAGCAAACTTTCAGATTGAATTGGGCCTCCTTTGGAATTGGAGAAAGGCGCCCGGACGCTGGCCCTGAGCACTCTATTTTTGTGGGGGATTTGGCTCCTGATGTTACAGACTATCTGTTGCAAGAGACCTTTAGAGTGCAATATCCATCTGTTAGGGGTGCTAAAGTTGTGACTGATCCAAACACTGGACGTTCAAAGGGATATGGGTTTGTTAAATTTGCTGATGAAAATGAAAGGAATCGAGCTATGTCAGAAATGAATGGTATTTATTGCTCAACTAGGCCTATGCGTATTAGTGCAGCAACACCCAAAAAGACCATTGGTGTTCAGCAGCAATATAGTCTAGGTAAAGGTAATAAATAATGAATATGCAGTACTGCCTTTTCGTTTATTCATTTTATCCCTTTTCATGTGTTATAATATATATGATCCTTTTGGAGTGAGTGCAATCATTTTTTTATTCATCCAAGTTTCCAACCCTCCTAGCTTCTCTATTTACATGAAACCGGTTTGGTTCCTATCTGTGCTAGTACTCATAGATATGAAACTGCTGGCTTTAAAATTTGAATTTCACCTTTGTCAATGAAAGTTTTATTTTCATCATCAAACCTACAGCCAGCAGCTAATAGGTACAAATGATCATTGTGAAATTTTAGTTGATATTAGCTAAACTAGTGTTCTTGGACCTATGATGAAAACTTATCAAGTTAATTGTTATGCATATTTAATGTATTTGTCAAAATACAATTTGTCTCTTGATTAATTTTTTGGATAAATGCGTTAATCGATGTCAAAGATATTTTCTAAATGCGTTGATGGGGGAGGATATTTGCAAATTTCAAATGAAAGAGAATTCACTGTTATTAGTACTTTTCCTTCACCTATGAGAAAATCTTAGGAGGGGATGGTAATTTACTATTTTTAAACTTGTACTTATCAATGTGCCTCTTGTCTCAGCAATGTACCCAGTTCCAGCCTACACTACATCCGTGCCTGTGCTTCCAGCAGATTATGATGCAAATAACACAACAGTAAGTCAAATGTCAAGTGACAGTTTTACATCTTATGGTTTCTATGCTCAGCTCTTACTATATTGTACTCTGTGCAGATCTTTGTCGGTAACTTGGATCCTAATATTACAGAGGAGGAGTTGAAGCAAACTTTTTTGCAGTTTGGTGAGATTGCTTATGTGAAAATTCCTTCCGGGAAAGGCTGTGGTTTTGTACAGTTTGGGACAAGGTGTGTTTTGCATGATTAATTTTATTGATGTAAGTTGAGTAAATTGCATCTCTGCTTGCCTTGAAAGAAGGCTGTGAATAGAAATAATGGTCACGAAGTGGTAGAAAAAAAAAATCTTGTGGAATAGAAGGAAAGCTTGTGTATGTATGATTCTCAAAGTTATGTTATTGACTTCTTATTTCACATGAACATTTGGACTTGGATTTAATATGTTCTCCTTATTGTGTCCATGCAGGGCTTCAGCTGAAGAAGCCATCCAAAAGATGCAAGGAAAAATAATTGGTCAACAAGTGGTTCGTACTTCTTGGGGTAGAAATCCGGCTGCCAAGCAGGTATTTGACCTTCCAATTGCAGTTCAGAACAAATATTTTCCTAGATTTAGTTAAATCCCTTTTCCTCTGTGGATGTGATGTGCAATTATATGGTTTTTAAATTAATGCTTTTTCTTGGATTCAAATTACAAAAACCTGATTGCTCGATGAGTTTTTGCATAGAACATTAGGTGTATTGTCTCTTATTTAGCCTGTGAATTTGAAATTTCATCTATAGGGAGAAATGGTTCAGACTTCATCCTTGCAAATAATTTATAATTTATGGTTAAAAATACCATTTTGGTCCTATGTGCTGAATTTTGTTTTATTTTAGTCCCTATACTTTTCAATGTCCTGTTTAAATTTTTGTACTTTCAAGAAATCTTAGATTTAGTTCCTACTACTTGCTTATTTATTGTTGACTTTTTCAAAAACTTTTTATTTATTTATTAGCATTTTTATTTTAAATTTTTAAAACATATTCATATATTATATTTTCATGCTTGAAAATTATTGTTATTATTTAGCCAATTCCGATAAGAATTAACTTTGAGAGACTAAGTTTACGATTTATTAAAAGTACAGGGATTAAAATTGGACATTTATAGGTACATGGACTAAAATTGAACTTACATCACAGTACTGGGACCAAAAATGGTATTTTAACCGTAATTTTCTTTTGCTTTTTTGCTTTATTTTTTTATTTTTGTATTTTTTTAAAATATATTATTATTTTTTTTTTCTTTTTGTTTTTGTTGTTGGTGGTGGTGGTGGTGTTGGTGAACGTTAAAACACCCTAGTACAAAATAGGATCTAAGGAAAAGGTCTGGGAATTGGGAATGAGGCTTACTTCAGAAAAGTATAACAGACAAAATCATCTACAAACCTATAGCAGACAATCATTTACCTAAATTATAATACATTGTTTTATATTTTCTAGCTTGGCTAAATGGGTATTTTGGTACAGAAAAAAGGACTCCTCGGTAAAATTGGTTATGCGTGCTATCTTTCTGTTAGCAAGCTTTCATAAGTCGCAATAACTTGTGTAATGCTCACAGTGAATTACACTGATGAAATGGTGATGAAATCTTAGGACTTGGCTACTTGGGGTCAGCAAGTTGATCCAAACCAGTGGAGTGCGTACTATGGGTATGGAGGGACTTATGATGCTTATGGATATGGAGTTGTACAAGATCCATCTTTATATGCTTATGGTGCATATTCGGGTTATGCCTCGTATCCTCAACAGGTAAGTTTTATTCTCCTTATAAGGCAAGAGCTCCATACTCTTCTGTGGTAGATAGTGGGCAAAGTGATTGTTATAATCTATTTGTTTGCAGCCATTTTTCAATTGGAGTCTATTGGGTGAATTCTAATTGCCCTTTCAGGATGCTAAGGCTTAGAATCAGAATAAAATGAGTGGGAATCAAGAGCAATATATTTATAGAAAAAGGTTGATCTATCCAGAAAATTTGTTGCCTTTTTGATCTTGGAACTATTTATTTGAACACTTCTCTTTTAGGTTGATGGTGTACAAGATTTGGCCGCTGTAGCTGGTGCAGTCCCTTCTGTGGAACAGGGAGAGGAATGGAATGATACGCTTGATACGCCAGATGTTGATTAGTAAGTATCTCTCTCTTTTGTCCCTAAGGAGCTACAATACAAGATTGGAAATGTACAAATCTATTATTATTTGATCTTGCATTGACATCAAGTTGCAAATTTTGCAGTTTAAATGATGCATACCTCTCAAAACATGAAAGTGCGATTTTAGGCTGGCCATTATGGTTGACTACCTCATCACTGGTTAGGCAAACATGAGCTCGGTGTCTGCACCTCGGTCGTTGTTTGTAGTCAATAGGTTGTCAGTGCATGAAGGCAGCTGGAGGAAAATTATGGTATGTGCTTTATATGCTTGACCATACTCTTAGTCGATGATATCGACACAATACTGTTGCCGCGTACTACAGCGTGTTTGATTTAATCATTCATTTTTTACTATAGGAACGGGATTGCCAGGGTTAGGTGCTTTGATGCTGTATGTATTATGATGCTCTCTGCCATTTGGAAATGGAGTTTTCATATCAGGAGAACATCTGCTTTAATCTGTACCTTGTTTCATATTCATAGTTGTGTTTTAAAACAGGGAGATTGAGCTGATAATTGTTTGTCAGTTGAGATTTAGCAAAAAGAAAATGAAGCTGAAATGAAACGAAGTTTTTAATGAATGGAGTGATGAAGACGGTTGTTCCTAAAACAAATTTCTCAATTTTCCCCTCTTCATGAGTGAAATATAAATTTTAGTAACTGCATCAATCTTGTATCCA

mRNA sequence

ATGTCGAAGGAAAAAGCCATTATCACTCTCTTGCAAGGCTGCAACAACCTTAACAAGCTTCGCAAAATCCACGCACATGTTATTGTAAGCGGCCTCCGCGATCATGTCGCCATTGGCAACAAGCTTTTGAACTTCTGTGCCATCTCTGTTTCAGGTTCCCTTGCTTATGCCCAGCTTCTCTTCCATCAAATGCAGTGCCCACAAACCGAAGCCTGGAACTCCATCATCAGAGGTTTTGCCCAGAGCTCATCTCCCATTGAGGCTATTGTTTTCTACAATCGAATGGTTTCGGCCTCTTTCTCTTCCCCTGACACTTTCACTTTCTCATTTGTGCTCAAAGCCTGTGAAAGAATCAAGGCTGAGCGTAAGTGTAAAGAAGTTCATGGCTCTGTAATCCGTTGCGGTTATGATGGGGATGTGATTGTCTGCACCAATCTTGTCAAATGCTATTCGGTGATGGGGTCCGTTTGTAGTGCCCAACAGGTGTTTGACGAAATGCCTGCAAGAGACTTGGTGGCTTGGAATGCTATGATTTCCTGCTTTTCTCAACAGGGTTTGCACCTGGAGTCACTGGAGACATACAATCAGATGAGAAGTGAAAATGTGGATGTAGATGGTTTTACACTCGTTGGGTTGATTTCGTCTTGTGCCCATCTTGGAGCTTTGAATATTGGGGTTCAGATGCATAGATTTGCTCGTGAAAAGGGTCTTGTGCAGAGTCTTTATGTTGGAAATGCGTTGATAGATATGTATGCTAAATGTGGCAGTTTAGATCAGGCCATTCTTATCTTTGATAGAATGCAGAAGAAGGACATTTTCACTTGGAACTCGATGATTGTTGGGTATGGAGTTCATGGTCGAGGTAGTGAAGCTATATATTGCTTTCAACAGATGTTAGAAGCAAGAATGCAACCGAACTCTATCACATTTTTGGGTTTACTTTGTGGATGTAGTCATCAAGGCTTGGTTCAAGAAGGTGTTAAATACTTCCATTTGATGAGCTCTGAGTTTAGGCTAAGACCTGAGGTTAAACACTATGGATGCCTTGTGGATTTATATGGTCGAGCTGGGAAGCTTGAGAAGGCACTTGAAATTGTATCAAATTCATCACAGAATGATCCAGTTTTGTGGCGAATCTTACTTGGCTCTTGCAAGATTCACAAAAATGTGACAATAGGAGAAATTGCCATGAACAGTCTGTGTGAGCTTGGAGCTACAAATGCAGGGGATTGTATATTGTTGGCTACAATCTATGCGGGAAAAAATGATACAGCTGGTGTTGCAAGAATGAGAAAAATGATAAAGAGGCAAGGGATAAAGACTACCCCAGGTTGGAGTTGGATTGAAATTGAGGATCAAGTTCATAAATTTGTGGTTGATGACAAGTCCCACCGTTATTCCATTGAAGTTTATGAGAAGTTGAGGGAAGTTATTCATCAAGCTTCCTTGTTTGGATATGTAGGAGATGAGTCTATTTCATCACTGGATGTGCTTTCTACCACAGAGACCTTAAAGACTTCATGTACATATCATAGTGAGAAACTTGCAATTGCATTTGGATTGGCAAGAACTTCAGATGGGACACAGATACGCATTGTTAAAAACCTTAGAGTTTGTAGAGATTGTCATTCATTCATAAAAGCTGTCTCGGCGGCATTCAACCGAGAAATAATTGTTAGAGATCGGGTTAGATTCCACCATTTCAATGGTGGTCAATGTTCCTGCAATGACTACTGGCTCCATAAGGCCGCGTTGCAAGCACAAACTAAGCTCAAGAAGGATAGTTTACTAGTTGAAACAAAACTTCCGAGCTTCCACGGTCAACCAACTCACCATCAACCAACTACGGTAGAGGAAGTTAGAACACTTTGGATAGGGGATTTGCAGTATTGGGTCGATGAGTCTTACCTTAATTCTTGCTTTGCTCACACTGGCGAGGTAATATCAATTAAAATAATTCGCAACAAGATCACTGGCCAGCCTGAGGGTTATGGGTTTGTGGAGTTTGTATCTCATGCCGCAGCAGAAAGAATTTTGCAGACATACAATGGGACCCAGATGCCTGGAACGGAGCAAACTTTCAGATTGAATTGGGCCTCCTTTGGAATTGGAGAAAGGCGCCCGGACGCTGGCCCTGAGCACTCTATTTTTGTGGGGGATTTGGCTCCTGATGTTACAGACTATCTGTTGCAAGAGACCTTTAGAGTGCAATATCCATCTGTTAGGGGTGCTAAAGTTGTGACTGATCCAAACACTGGACGTTCAAAGGGATATGGGTTTGTTAAATTTGCTGATGAAAATGAAAGGAATCGAGCTATGTCAGAAATGAATGGTATTTATTGCTCAACTAGGCCTATGCGTATTAGTGCAGCAACACCCAAAAAGACCATTGGTGTTCAGCAGCAATATAGTCTAGGTAAAGCAATGTACCCAGTTCCAGCCTACACTACATCCGTGCCTGTGCTTCCAGCAGATTATGATGCAAATAACACAACAATCTTTGTCGGTAACTTGGATCCTAATATTACAGAGGAGGAGTTGAAGCAAACTTTTTTGCAGTTTGGTGAGATTGCTTATGTGAAAATTCCTTCCGGGAAAGGCTGTGGTTTTGTACAGTTTGGGACAAGGGCTTCAGCTGAAGAAGCCATCCAAAAGATGCAAGGAAAAATAATTGGTCAACAAGTGGTTCGTACTTCTTGGGGTAGAAATCCGGCTGCCAAGCAGGACTTGGCTACTTGGGGTCAGCAAGTTGATCCAAACCAGTGGAGTGCGTACTATGGGTATGGAGGGACTTATGATGCTTATGGATATGGAGTTGTACAAGATCCATCTTTATATGCTTATGGTGCATATTCGGGTTATGCCTCGTATCCTCAACAGGTTGATGGTGTACAAGATTTGGCCGCTGTAGCTGGTGCAGTCCCTTCTGTGGAACAGGGAGAGGAATGGAATGATACGCTTGATACGCCAGATGTTGATTATTTAAATGATGCATACCTCTCAAAACATGAAAGTGCGATTTTAGGCTGGCCATTATGGTTGACTACCTCATCACTGGTTAGGCAAACATGAGCTCGGTGTCTGCACCTCGGTCGTTGTTTGTAGTCAATAGGTTGTCAGTGCATGAAGGCAGCTGGAGGAAAATTATGGAACGGGATTGCCAGGGTTAGGTGCTTTGATGCTGTATGTATTATGATGCTCTCTGCCATTTGGAAATGGAGTTTTCATATCAGGAGAACATCTGCTTTAATCTGTACCTTGTTTCATATTCATAGTTGTGTTTTAAAACAGGGAGATTGAGCTGATAATTGTTTGTCAGTTGAGATTTAGCAAAAAGAAAATGAAGCTGAAATGAAACGAAGTTTTTAATGAATGGAGTGATGAAGACGGTTGTTCCTAAAACAAATTTCTCAATTTTCCCCTCTTCATGAGTGAAATATAAATTTTAGTAACTGCATCAATCTTGTATCCA

Coding sequence (CDS)

ATGTCGAAGGAAAAAGCCATTATCACTCTCTTGCAAGGCTGCAACAACCTTAACAAGCTTCGCAAAATCCACGCACATGTTATTGTAAGCGGCCTCCGCGATCATGTCGCCATTGGCAACAAGCTTTTGAACTTCTGTGCCATCTCTGTTTCAGGTTCCCTTGCTTATGCCCAGCTTCTCTTCCATCAAATGCAGTGCCCACAAACCGAAGCCTGGAACTCCATCATCAGAGGTTTTGCCCAGAGCTCATCTCCCATTGAGGCTATTGTTTTCTACAATCGAATGGTTTCGGCCTCTTTCTCTTCCCCTGACACTTTCACTTTCTCATTTGTGCTCAAAGCCTGTGAAAGAATCAAGGCTGAGCGTAAGTGTAAAGAAGTTCATGGCTCTGTAATCCGTTGCGGTTATGATGGGGATGTGATTGTCTGCACCAATCTTGTCAAATGCTATTCGGTGATGGGGTCCGTTTGTAGTGCCCAACAGGTGTTTGACGAAATGCCTGCAAGAGACTTGGTGGCTTGGAATGCTATGATTTCCTGCTTTTCTCAACAGGGTTTGCACCTGGAGTCACTGGAGACATACAATCAGATGAGAAGTGAAAATGTGGATGTAGATGGTTTTACACTCGTTGGGTTGATTTCGTCTTGTGCCCATCTTGGAGCTTTGAATATTGGGGTTCAGATGCATAGATTTGCTCGTGAAAAGGGTCTTGTGCAGAGTCTTTATGTTGGAAATGCGTTGATAGATATGTATGCTAAATGTGGCAGTTTAGATCAGGCCATTCTTATCTTTGATAGAATGCAGAAGAAGGACATTTTCACTTGGAACTCGATGATTGTTGGGTATGGAGTTCATGGTCGAGGTAGTGAAGCTATATATTGCTTTCAACAGATGTTAGAAGCAAGAATGCAACCGAACTCTATCACATTTTTGGGTTTACTTTGTGGATGTAGTCATCAAGGCTTGGTTCAAGAAGGTGTTAAATACTTCCATTTGATGAGCTCTGAGTTTAGGCTAAGACCTGAGGTTAAACACTATGGATGCCTTGTGGATTTATATGGTCGAGCTGGGAAGCTTGAGAAGGCACTTGAAATTGTATCAAATTCATCACAGAATGATCCAGTTTTGTGGCGAATCTTACTTGGCTCTTGCAAGATTCACAAAAATGTGACAATAGGAGAAATTGCCATGAACAGTCTGTGTGAGCTTGGAGCTACAAATGCAGGGGATTGTATATTGTTGGCTACAATCTATGCGGGAAAAAATGATACAGCTGGTGTTGCAAGAATGAGAAAAATGATAAAGAGGCAAGGGATAAAGACTACCCCAGGTTGGAGTTGGATTGAAATTGAGGATCAAGTTCATAAATTTGTGGTTGATGACAAGTCCCACCGTTATTCCATTGAAGTTTATGAGAAGTTGAGGGAAGTTATTCATCAAGCTTCCTTGTTTGGATATGTAGGAGATGAGTCTATTTCATCACTGGATGTGCTTTCTACCACAGAGACCTTAAAGACTTCATGTACATATCATAGTGAGAAACTTGCAATTGCATTTGGATTGGCAAGAACTTCAGATGGGACACAGATACGCATTGTTAAAAACCTTAGAGTTTGTAGAGATTGTCATTCATTCATAAAAGCTGTCTCGGCGGCATTCAACCGAGAAATAATTGTTAGAGATCGGGTTAGATTCCACCATTTCAATGGTGGTCAATGTTCCTGCAATGACTACTGGCTCCATAAGGCCGCGTTGCAAGCACAAACTAAGCTCAAGAAGGATAGTTTACTAGTTGAAACAAAACTTCCGAGCTTCCACGGTCAACCAACTCACCATCAACCAACTACGGTAGAGGAAGTTAGAACACTTTGGATAGGGGATTTGCAGTATTGGGTCGATGAGTCTTACCTTAATTCTTGCTTTGCTCACACTGGCGAGGTAATATCAATTAAAATAATTCGCAACAAGATCACTGGCCAGCCTGAGGGTTATGGGTTTGTGGAGTTTGTATCTCATGCCGCAGCAGAAAGAATTTTGCAGACATACAATGGGACCCAGATGCCTGGAACGGAGCAAACTTTCAGATTGAATTGGGCCTCCTTTGGAATTGGAGAAAGGCGCCCGGACGCTGGCCCTGAGCACTCTATTTTTGTGGGGGATTTGGCTCCTGATGTTACAGACTATCTGTTGCAAGAGACCTTTAGAGTGCAATATCCATCTGTTAGGGGTGCTAAAGTTGTGACTGATCCAAACACTGGACGTTCAAAGGGATATGGGTTTGTTAAATTTGCTGATGAAAATGAAAGGAATCGAGCTATGTCAGAAATGAATGGTATTTATTGCTCAACTAGGCCTATGCGTATTAGTGCAGCAACACCCAAAAAGACCATTGGTGTTCAGCAGCAATATAGTCTAGGTAAAGCAATGTACCCAGTTCCAGCCTACACTACATCCGTGCCTGTGCTTCCAGCAGATTATGATGCAAATAACACAACAATCTTTGTCGGTAACTTGGATCCTAATATTACAGAGGAGGAGTTGAAGCAAACTTTTTTGCAGTTTGGTGAGATTGCTTATGTGAAAATTCCTTCCGGGAAAGGCTGTGGTTTTGTACAGTTTGGGACAAGGGCTTCAGCTGAAGAAGCCATCCAAAAGATGCAAGGAAAAATAATTGGTCAACAAGTGGTTCGTACTTCTTGGGGTAGAAATCCGGCTGCCAAGCAGGACTTGGCTACTTGGGGTCAGCAAGTTGATCCAAACCAGTGGAGTGCGTACTATGGGTATGGAGGGACTTATGATGCTTATGGATATGGAGTTGTACAAGATCCATCTTTATATGCTTATGGTGCATATTCGGGTTATGCCTCGTATCCTCAACAGGTTGATGGTGTACAAGATTTGGCCGCTGTAGCTGGTGCAGTCCCTTCTGTGGAACAGGGAGAGGAATGGAATGATACGCTTGATACGCCAGATGTTGATTATTTAAATGATGCATACCTCTCAAAACATGAAAGTGCGATTTTAGGCTGGCCATTATGGTTGACTACCTCATCACTGGTTAGGCAAACATGA

Protein sequence

MSKEKAIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLLFHQMQCPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKAERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISCFSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQSLYVGNALIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEARMQPNSITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEKALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAGKNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCRDCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYWLHKAALQAQTKLKKDSLLVETKLPSFHGQPTHHQPTTVEEVRTLWIGDLQYWVDESYLNSCFAHTGEVISIKIIRNKITGQPEGYGFVEFVSHAAAERILQTYNGTQMPGTEQTFRLNWASFGIGERRPDAGPEHSIFVGDLAPDVTDYLLQETFRVQYPSVRGAKVVTDPNTGRSKGYGFVKFADENERNRAMSEMNGIYCSTRPMRISAATPKKTIGVQQQYSLGKAMYPVPAYTTSVPVLPADYDANNTTIFVGNLDPNITEEELKQTFLQFGEIAYVKIPSGKGCGFVQFGTRASAEEAIQKMQGKIIGQQVVRTSWGRNPAAKQDLATWGQQVDPNQWSAYYGYGGTYDAYGYGVVQDPSLYAYGAYSGYASYPQQVDGVQDLAAVAGAVPSVEQGEEWNDTLDTPDVDYLNDAYLSKHESAILGWPLWLTTSSLVRQT
Homology
BLAST of CaUC01G004240 vs. NCBI nr
Match: XP_038890323.1 (pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida] >XP_038890324.1 pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida] >XP_038890325.1 pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida])

HSP 1 Score: 1110.9 bits (2872), Expect = 0.0e+00
Identity = 539/579 (93.09%), Postives = 561/579 (96.89%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAI+TLLQGCN+LN+LRKIHAHVIVSGLR HVAIGNKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAILTLLQGCNSLNRLRKIHAHVIVSGLRHHVAIGNKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMQCPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQM+CPQTEAWNSIIRGFAQSSSPI+AI+FYN+MV ASFSSPDTFTFSFVLKACERIKA
Sbjct: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAIIFYNQMVWASFSSPDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKC EVHGSVIRCGYDGDVIVCTNLVKCYS MGS+C AQQVFD+MPARDLVAWNAMISC
Sbjct: 121 ERKCNEVHGSVIRCGYDGDVIVCTNLVKCYSAMGSICIAQQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQS 240
           FSQQGLH E+L+TYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQS
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQS 240

Query: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCGSLD+AI IFDRMQ+KDIFTWNSMIVGYGVHGRGSEAIYCFQ+MLE
Sbjct: 241 LYVGNALIDMYAKCGSLDEAIFIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQKMLE 300

Query: 301 ARMQPNSITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           ARMQPNS+TFLGLLCGCSHQGLVQEGVKYF+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEIV NSSQNDPVLWRILLGSCKIHKN+ IGEIAM SL ELGATNAGDCILLATIYAG
Sbjct: 361 KALEIVLNSSQNDPVLWRILLGSCKIHKNMKIGEIAMKSLSELGATNAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
           + DT GVARMRKMIK QGIKTTPGWSWIEI +QVHKFVVDDKSHRYSIEVYEKLREVIHQ
Sbjct: 421 EKDTVGVARMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCR 540
           ASLFGYVGD S+SSLDVLSTTETLKTSCTYHSEKLAIAFGLART+DGTQIRIVKNLRVCR
Sbjct: 481 ASLFGYVGDASVSSLDVLSTTETLKTSCTYHSEKLAIAFGLARTTDGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSFIKAVS AFNREIIVRDRVRFHHF GGQCSCNDYW
Sbjct: 541 DCHSFIKAVSEAFNREIIVRDRVRFHHFKGGQCSCNDYW 579

BLAST of CaUC01G004240 vs. NCBI nr
Match: XP_004152881.1 (pentatricopeptide repeat-containing protein At3g56550 isoform X1 [Cucumis sativus] >XP_011648994.1 pentatricopeptide repeat-containing protein At3g56550 isoform X1 [Cucumis sativus] >XP_031737318.1 pentatricopeptide repeat-containing protein At3g56550 isoform X1 [Cucumis sativus])

HSP 1 Score: 1079.3 bits (2790), Expect = 0.0e+00
Identity = 524/579 (90.50%), Postives = 549/579 (94.82%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MS EKAI+ LLQGCN+L +LRKIHAHVIVSGL  HV I NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSNEKAILALLQGCNSLKRLRKIHAHVIVSGLHHHVPIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMQCPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQM+CPQTEAWNSIIRGFAQSSSPI+AIVFYN+MV  SFS PDTFTFSFVLKACERIKA
Sbjct: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVCDSFSIPDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKCKEVHGSVIRCGYD DVIVCTNLVKCYS MGSVC A+QVFD+MPARDLVAWNAMISC
Sbjct: 121 ERKCKEVHGSVIRCGYDADVIVCTNLVKCYSAMGSVCIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQS 240
           FSQQGLH E+L+TYNQMRSENVD+DGFTLVGLISSCAHLGALNIGVQMHRFARE GL QS
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCAHLGALNIGVQMHRFARENGLDQS 240

Query: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAILIFDRMQ+KDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300

Query: 301 ARMQPNSITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPN +TFLGLLCGCSHQGLVQEGVKYF+LMSS+FRL+PEVKHYGCLVDLYGRAGKL+
Sbjct: 301 ARIQPNPVTFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLKPEVKHYGCLVDLYGRAGKLD 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEIVSNSS ND VLWRILLGSCKIHKNVTIGEIAMN L ELGAT+AGDCILLATIYAG
Sbjct: 361 KALEIVSNSSHNDSVLWRILLGSCKIHKNVTIGEIAMNRLSELGATSAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
           + D AGVARMRKMIK QG KTTPGWSWIEI +QVHKFVVDDKSHRYS+EVYEKLREVIHQ
Sbjct: 421 EKDKAGVARMRKMIKSQGKKTTPGWSWIEIGEQVHKFVVDDKSHRYSVEVYEKLREVIHQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCR 540
           AS FGYVGDESISSLD+LST ETLKTSCTYHSEKLAIAFGLART+DGTQIRIVKNLRVCR
Sbjct: 481 ASFFGYVGDESISSLDMLSTMETLKTSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSFIKAVS AFNREIIVRDRVRFHHF GG+CSCNDYW
Sbjct: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGECSCNDYW 579

BLAST of CaUC01G004240 vs. NCBI nr
Match: XP_016899519.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] >XP_016899520.1 PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] >XP_016899521.1 PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] >XP_016899522.1 PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] >XP_016899523.1 PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] >KAA0047714.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK08368.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1071.2 bits (2769), Expect = 5.5e-309
Identity = 525/579 (90.67%), Postives = 550/579 (94.99%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAI+TLLQGCN+L +LRKIHAHVIVSGL  HVAI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAILTLLQGCNSLKRLRKIHAHVIVSGLHHHVAIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMQCPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQ + PQTEAWNSIIRGFAQSSSPI+AIVFYN+MV  SFS  DTFTFSFVLKACERIKA
Sbjct: 61  FHQTEFPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVWDSFSMRDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKCKEVHG+VIRCGYD DVIVCTNLVKCYS MGSV  A+QVFD+MPARDLVAWNAMISC
Sbjct: 121 ERKCKEVHGTVIRCGYDADVIVCTNLVKCYSAMGSVYIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQS 240
           FSQQGLH E+L+TYNQMRSENVD+DGFTLVGLISSCAHLGALNIGVQMHRFARE GL QS
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCAHLGALNIGVQMHRFARENGLDQS 240

Query: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAILIFDRMQ+KDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300

Query: 301 ARMQPNSITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPNSITFLGLLCGCSHQGLVQEGVKYF+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARIQPNSITFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEIVSNSS ND VLWR LLGSCKIHKNVTIGEIAMN L ELGAT+AGDCILLATIYAG
Sbjct: 361 KALEIVSNSSHNDSVLWRTLLGSCKIHKNVTIGEIAMNRLFELGATSAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
           +ND AGV+RMRKMIK QGIKTTPGWSWIEI +QVHKFVVDDKS+RYSIEVYEKLREVI+Q
Sbjct: 421 ENDKAGVSRMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSNRYSIEVYEKLREVIYQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCR 540
           AS FGYVGDES+SSLDVLST ETLKTSCTYHSEKLAIAFGLART+DGTQIRIVKNLRVCR
Sbjct: 481 ASFFGYVGDESVSSLDVLSTIETLKTSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSFIKAVS AFNREIIVRDRVRFHHF GG+CSCNDYW
Sbjct: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGKCSCNDYW 579

BLAST of CaUC01G004240 vs. NCBI nr
Match: XP_022149932.1 (pentatricopeptide repeat-containing protein At3g56550 [Momordica charantia] >XP_022149933.1 pentatricopeptide repeat-containing protein At3g56550 [Momordica charantia] >XP_022149934.1 pentatricopeptide repeat-containing protein At3g56550 [Momordica charantia] >XP_022149935.1 pentatricopeptide repeat-containing protein At3g56550 [Momordica charantia])

HSP 1 Score: 1048.9 bits (2711), Expect = 2.9e-302
Identity = 501/579 (86.53%), Postives = 545/579 (94.13%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MS EKAI+TLLQGCN+LNKLRKIHAHVI+SGLR H AIGNKLLNFCAISVSGSL YA+LL
Sbjct: 1   MSNEKAILTLLQGCNSLNKLRKIHAHVILSGLRHHAAIGNKLLNFCAISVSGSLPYARLL 60

Query: 61  FHQMQCPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           F  M CPQTEAWNSIIRGFAQS+SPIEA+V+YN+MV AS S PDTFTFSFVLKACER+KA
Sbjct: 61  FRHMDCPQTEAWNSIIRGFAQSASPIEAVVYYNQMVWASLSPPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKC+EVHGSVIR GYDGDVI+CTNL+KCY+ MG +C AQQVFD+MP RDLVAWNAMISC
Sbjct: 121 ERKCREVHGSVIRWGYDGDVIICTNLMKCYAAMGFICVAQQVFDKMPTRDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQS 240
           +SQQGLH E+LETYNQMRS NVDVDGFTLVGL+SSCAHLGALNIGVQMHRFAREKGLV+S
Sbjct: 181 YSQQGLHQEALETYNQMRSGNVDVDGFTLVGLLSSCAHLGALNIGVQMHRFAREKGLVES 240

Query: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAILIFDRMQ+KD+FTWNSMIVGYGVHGRG+EAI+CFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDVFTWNSMIVGYGVHGRGTEAIFCFQQMLE 300

Query: 301 ARMQPNSITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QP+S+TFLGLLCGCSHQGLVQEGVK+F+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARIQPSSVTFLGLLCGCSHQGLVQEGVKFFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEI+ NSSQNDPVLWRILLGSCKIHKNV IGEIAMN+L +LGATNAGDCILLATIYAG
Sbjct: 361 KALEIILNSSQNDPVLWRILLGSCKIHKNVGIGEIAMNNLSQLGATNAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
             +T+GV RMRKMI+ QGIKTTPGWSWIEI +QVHKFVVDDKSHRY IEVYEKL+EVIHQ
Sbjct: 421 VKNTSGVVRMRKMIRSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYYIEVYEKLKEVIHQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCR 540
           ASLFGY+GD   S+ DV ST+E L+TSC+YHSEKLAIAFGLART+DGTQIRIVKNLRVCR
Sbjct: 481 ASLFGYIGDGYFSTTDVFSTSEILETSCSYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSF+KAVS AFNREIIVRDRVRFHHF GGQCSCNDYW
Sbjct: 541 DCHSFVKAVSLAFNREIIVRDRVRFHHFKGGQCSCNDYW 579

BLAST of CaUC01G004240 vs. NCBI nr
Match: XP_023537237.1 (pentatricopeptide repeat-containing protein At3g56550 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1047.7 bits (2708), Expect = 6.4e-302
Identity = 508/579 (87.74%), Postives = 540/579 (93.26%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAIITLLQGCN+LNKLRKIHAHV+VSGLR HVAI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAIITLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMQCPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQM+C QTEAWNSIIRGFAQSSSPI+A+V+YN+MV ASFSSPDTFTFSFVLKACER+KA
Sbjct: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKCKE+HG++IRCGYDGDVI+CTNLVKCY+ MGSVC AQQVFDEMP RDLVAWNAMISC
Sbjct: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAQQVFDEMPVRDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQS 240
           FSQQGLH E+L+ YNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLV+S
Sbjct: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVES 240

Query: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAI IFDRM +KDIFTWNSMIVGYGVHGRG+EAI+CF++MLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAIFIFDRMHRKDIFTWNSMIVGYGVHGRGTEAIFCFERMLE 300

Query: 301 ARMQPNSITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           ARMQPNSITFLGLLCGCSHQGLVQEGVKYF+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARMQPNSITFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALE + NSS NDPVLWRILLGSCKIHKNV +GEIAMN+L ELGATNAGDCILLATIYAG
Sbjct: 361 KALETIQNSSPNDPVLWRILLGSCKIHKNVGVGEIAMNNLSELGATNAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
            NDTAGVA MRK IK QGIKT+PGWSWIEI +QVHKFVVDDKSHR SIEVYEKLREV+HQ
Sbjct: 421 VNDTAGVASMRKTIKSQGIKTSPGWSWIEIGEQVHKFVVDDKSHRDSIEVYEKLREVLHQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCR 540
           ASLFGYV D            ETLKTS TYHSEKLAIAFGLART+DGT IRIVKNLRVCR
Sbjct: 481 ASLFGYVRD-----------AETLKTSSTYHSEKLAIAFGLARTADGTPIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSF+KAVS AF+REIIVRDRVRFHHF GGQCSCNDYW
Sbjct: 541 DCHSFMKAVSVAFDREIIVRDRVRFHHFKGGQCSCNDYW 568

BLAST of CaUC01G004240 vs. ExPASy Swiss-Prot
Match: Q9LXY5 (Pentatricopeptide repeat-containing protein At3g56550 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H80 PE=2 SV=1)

HSP 1 Score: 716.8 bits (1849), Expect = 3.4e-205
Identity = 348/579 (60.10%), Postives = 441/579 (76.17%), Query Frame = 0

Query: 3   KEKAIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLLF- 62
           K + I+ +LQGCN++ KLRKIH+HVI++GL+ H +I N LL FCA+SV+GSL++AQLLF 
Sbjct: 4   KARVIVRMLQGCNSMKKLRKIHSHVIINGLQHHPSIFNHLLRFCAVSVTGSLSHAQLLFD 63

Query: 63  HQMQCPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKAE 122
           H    P T  WN +IRGF+ SSSP+ +I+FYNRM+ +S S PD FTF+F LK+CERIK+ 
Sbjct: 64  HFDSDPSTSDWNYLIRGFSNSSSPLNSILFYNRMLLSSVSRPDLFTFNFALKSCERIKSI 123

Query: 123 RKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISCF 182
            KC E+HGSVIR G+  D IV T+LV+CYS  GSV  A +VFDEMP RDLV+WN MI CF
Sbjct: 124 PKCLEIHGSVIRSGFLDDAIVATSLVRCYSANGSVEIASKVFDEMPVRDLVSWNVMICCF 183

Query: 183 SQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQSL 242
           S  GLH ++L  Y +M +E V  D +TLV L+SSCAH+ ALN+GV +HR A +      +
Sbjct: 184 SHVGLHNQALSMYKRMGNEGVCGDSYTLVALLSSCAHVSALNMGVMLHRIACDIRCESCV 243

Query: 243 YVGNALIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEA 302
           +V NALIDMYAKCGSL+ AI +F+ M+K+D+ TWNSMI+GYGVHG G EAI  F++M+ +
Sbjct: 244 FVSNALIDMYAKCGSLENAIGVFNGMRKRDVLTWNSMIIGYGVHGHGVEAISFFRKMVAS 303

Query: 303 RMQPNSITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEK 362
            ++PN+ITFLGLL GCSHQGLV+EGV++F +MSS+F L P VKHYGC+VDLYGRAG+LE 
Sbjct: 304 GVRPNAITFLGLLLGCSHQGLVKEGVEHFEIMSSQFHLTPNVKHYGCMVDLYGRAGQLEN 363

Query: 363 ALEIV-SNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 422
           +LE++ ++S   DPVLWR LLGSCKIH+N+ +GE+AM  L +L A NAGD +L+ +IY+ 
Sbjct: 364 SLEMIYASSCHEDPVLWRTLLGSCKIHRNLELGEVAMKKLVQLEAFNAGDYVLMTSIYSA 423

Query: 423 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 482
            ND    A MRK+I+   ++T PGWSWIEI DQVHKFVVDDK H  S  +Y +L EVI++
Sbjct: 424 ANDAQAFASMRKLIRSHDLQTVPGWSWIEIGDQVHKFVVDDKMHPESAVIYSELGEVINR 483

Query: 483 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCR 542
           A L GY  ++S  +   LS    L ++ T HSEKLAIA+GL RT+ GT +RI KNLRVCR
Sbjct: 484 AILAGYKPEDSNRTAPTLS-DRCLGSADTSHSEKLAIAYGLMRTTAGTTLRITKNLRVCR 543

Query: 543 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSF K VS AFNREIIVRDRVRFHHF  G CSCNDYW
Sbjct: 544 DCHSFTKYVSKAFNREIIVRDRVRFHHFADGICSCNDYW 581

BLAST of CaUC01G004240 vs. ExPASy Swiss-Prot
Match: Q8VXZ9 (Polyadenylate-binding protein RBP47B' OS=Arabidopsis thaliana OX=3702 GN=RBP47B' PE=2 SV=1)

HSP 1 Score: 576.6 bits (1485), Expect = 5.4e-163
Identity = 292/421 (69.36%), Postives = 332/421 (78.86%), Query Frame = 0

Query: 608  QPTHHQPTTVEEVRTLWIGDLQYWVDESYLNSCFAHTGEVISIKIIRNKITGQPEGYGFV 667
            Q ++H P T+EEVRTLWIGDLQYWVDE+YL SCF+ TGE++S+K+IRNKITGQPEGYGF+
Sbjct: 11   QGSYHHPQTLEEVRTLWIGDLQYWVDENYLTSCFSQTGELVSVKVIRNKITGQPEGYGFI 70

Query: 668  EFVSHAAAERILQTYNGTQMPGTEQTFRLNWASFGIGERRPDAGPEHSIFVGDLAPDVTD 727
            EF+SHAAAER LQTYNGTQMPGTE TFRLNWASFG G+ + DAGP+HSIFVGDLAPDVTD
Sbjct: 71   EFISHAAAERTLQTYNGTQMPGTELTFRLNWASFGSGQ-KVDAGPDHSIFVGDLAPDVTD 130

Query: 728  YLLQETFRVQYPSVRGAKVVTDPNTGRSKGYGFVKFADENERNRAMSEMNGIYCSTRPMR 787
            YLLQETFRV Y SVRGAKVVTDP+TGRSKGYGFVKFA+E+ERNRAM+EMNG+YCSTRPMR
Sbjct: 131  YLLQETFRVHYSSVRGAKVVTDPSTGRSKGYGFVKFAEESERNRAMAEMNGLYCSTRPMR 190

Query: 788  ISAATPKKTIGVQQQYSLGKAMYPVP-----AYTTSVPVLPADYDANNTTIFVGNLDPNI 847
            ISAATPKK +GVQQQY + KA+YPV      A      V P + D   TTI V NLD N+
Sbjct: 191  ISAATPKKNVGVQQQY-VTKAVYPVTVPSAVAAPVQAYVAPPESDVTCTTISVANLDQNV 250

Query: 848  TEEELKQTFLQFGEIAYVKIPSGKGCGFVQFGTRASAEEAIQKMQGKIIGQQVVRTSWGR 907
            TEEELK+ F Q GE+ YVKIP+ KG G+VQF TR SAEEA+Q+MQG++IGQQ VR SW +
Sbjct: 251  TEEELKKAFSQLGEVIYVKIPATKGYGYVQFKTRPSAEEAVQRMQGQVIGQQAVRISWSK 310

Query: 908  NPAAKQDLATWGQQVDPNQWSAYYGYGGTYDAYGYGVVQDPSLYAYGAYSGYASYPQQVD 967
            NP   QD   W  Q DPNQW+ YYGYG  YDAY YG  QDPS+YAYG Y GY  YPQQ +
Sbjct: 311  NPG--QD--GWVTQADPNQWNGYYGYGQGYDAYAYGATQDPSVYAYGGY-GYPQYPQQGE 370

Query: 968  GVQDLA-AVAGAVPSVEQGEEWNDTLDTPDVDYLNDAYLSKHESAILGWPLWLTTSSLVR 1023
            G QD++ + AG V   EQ  E  D L TPDVD LN AYLS H SAILG P+W  TSSL  
Sbjct: 371  GTQDISNSAAGGVAGAEQ--ELYDPLATPDVDKLNAAYLSVHASAILGRPMWQRTSSLTS 422

BLAST of CaUC01G004240 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 456.4 bits (1173), Expect = 8.2e-127
Identity = 238/579 (41.11%), Postives = 360/579 (62.18%), Query Frame = 0

Query: 8   ITLLQ--GCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSG--SLAYAQLLFHQ 67
           I LLQ  G +++ KLR+IHA  I  G+    A   K L F  +S+     ++YA  +F +
Sbjct: 19  INLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSK 78

Query: 68  MQCP-QTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKAER 127
           ++ P     WN++IRG+A+  + I A   Y  M  +    PDT T+ F++KA   +   R
Sbjct: 79  IEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVR 138

Query: 128 KCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISCFS 187
             + +H  VIR G+   + V  +L+  Y+  G V SA +VFD+MP +DLVAWN++I+ F+
Sbjct: 139 LGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFA 198

Query: 188 QQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQSLY 247
           + G   E+L  Y +M S+ +  DGFT+V L+S+CA +GAL +G ++H +  + GL ++L+
Sbjct: 199 ENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLH 258

Query: 248 VGNALIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEAR 307
             N L+D+YA+CG +++A  +FD M  K+  +W S+IVG  V+G G EAI  F+ M    
Sbjct: 259 SSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTE 318

Query: 308 -MQPNSITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEK 367
            + P  ITF+G+L  CSH G+V+EG +YF  M  E+++ P ++H+GC+VDL  RAG+++K
Sbjct: 319 GLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKK 378

Query: 368 ALE-IVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 427
           A E I S   Q + V+WR LLG+C +H +  + E A   + +L   ++GD +LL+ +YA 
Sbjct: 379 AYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYAS 438

Query: 428 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 487
           +   + V ++RK + R G+K  PG S +E+ ++VH+F++ DKSH  S  +Y KL+E+  +
Sbjct: 439 EQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGR 498

Query: 488 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCR 547
               GYV    IS++ V    E  + +  YHSEK+AIAF L  T + + I +VKNLRVC 
Sbjct: 499 LRSEGYV--PQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCA 558

Query: 548 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCH  IK VS  +NREI+VRDR RFHHF  G CSC DYW
Sbjct: 559 DCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of CaUC01G004240 vs. ExPASy Swiss-Prot
Match: Q8LK93 (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 428.3 bits (1100), Expect = 2.4e-118
Identity = 224/574 (39.02%), Postives = 346/574 (60.28%), Query Frame = 0

Query: 8   ITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAIS-VSGSLAYAQLLFHQMQC 67
           I L+  CN+L +L +I A+ I S + D V+   KL+NFC  S    S++YA+ LF  M  
Sbjct: 33  ILLISKCNSLRELMQIQAYAIKSHIED-VSFVAKLINFCTESPTESSMSYARHLFEAMSE 92

Query: 68  PQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKAERKCKE 127
           P    +NS+ RG+++ ++P+E    +  ++      PD +TF  +LKAC   KA  + ++
Sbjct: 93  PDIVIFNSMARGYSRFTNPLEVFSLFVEILEDGI-LPDNYTFPSLLKACAVAKALEEGRQ 152

Query: 128 VHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISCFSQQGL 187
           +H   ++ G D +V VC  L+  Y+    V SA+ VFD +    +V +NAMI+ ++++  
Sbjct: 153 LHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNR 212

Query: 188 HLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQSLYVGNA 247
             E+L  + +M+ + +  +  TL+ ++SSCA LG+L++G  +H++A++    + + V  A
Sbjct: 213 PNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTA 272

Query: 248 LIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEARMQPN 307
           LIDM+AKCGSLD A+ IF++M+ KD   W++MIV Y  HG+  +++  F++M    +QP+
Sbjct: 273 LIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMRSENVQPD 332

Query: 308 SITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEKALEIV 367
            ITFLGLL  CSH G V+EG KYF  M S+F + P +KHYG +VDL  RAG LE A E +
Sbjct: 333 EITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFI 392

Query: 368 SN-SSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAGKNDTA 427
                   P+LWRILL +C  H N+ + E     + EL  ++ GD ++L+ +YA      
Sbjct: 393 DKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWE 452

Query: 428 GVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQASLFG 487
            V  +RK++K +     PG S IE+ + VH+F   D     + +++  L E++ +  L G
Sbjct: 453 YVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKLSG 512

Query: 488 YVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCRDCHSF 547
           YV D S+     ++  E  + +  YHSEKLAI FGL  T  GT IR+VKNLRVCRDCH+ 
Sbjct: 513 YVPDTSMVVHANMNDQEK-EITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNA 572

Query: 548 IKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
            K +S  F R++++RD  RFHHF  G+CSC D+W
Sbjct: 573 AKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of CaUC01G004240 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 424.9 bits (1091), Expect = 2.7e-117
Identity = 230/630 (36.51%), Postives = 364/630 (57.78%), Query Frame = 0

Query: 2   SKEKAIITLLQGCNNLNKLRKIHAHVIVSG-LRDHVAIGNKLLNFCAIS--VSGSLAYAQ 61
           S   ++   +  C  +  L +IHA  I SG +RD +A   ++L FCA S      L YA 
Sbjct: 21  SHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAA-EILRFCATSDLHHRDLDYAH 80

Query: 62  LLFHQMQCPQTEAWNSIIRGFAQS--SSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACE 121
            +F+QM      +WN+IIRGF++S     + AI  +  M+S  F  P+ FTF  VLKAC 
Sbjct: 81  KIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACA 140

Query: 122 RIKAERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSV------------------------ 181
           +    ++ K++HG  ++ G+ GD  V +NLV+ Y +                        
Sbjct: 141 KTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMT 200

Query: 182 ---------------------MGSVCSAQQVFDEMPARDLVAWNAMISCFSQQGLHLESL 241
                                +G   +A+ +FD+M  R +V+WN MIS +S  G   +++
Sbjct: 201 DRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAV 260

Query: 242 ETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQSLYVGNALIDMY 301
           E + +M+  ++  +  TLV ++ + + LG+L +G  +H +A + G+     +G+ALIDMY
Sbjct: 261 EVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMY 320

Query: 302 AKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEARMQPNSITFL 361
           +KCG +++AI +F+R+ ++++ TW++MI G+ +HG+  +AI CF +M +A ++P+ + ++
Sbjct: 321 SKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYI 380

Query: 362 GLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEKALEIVSNSS- 421
            LL  CSH GLV+EG +YF  M S   L P ++HYGC+VDL GR+G L++A E + N   
Sbjct: 381 NLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPI 440

Query: 422 QNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAGKNDTAGVARM 481
           + D V+W+ LLG+C++  NV +G+   N L ++   ++G  + L+ +YA + + + V+ M
Sbjct: 441 KPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEM 500

Query: 482 RKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQASLFGYVGDE 541
           R  +K + I+  PG S I+I+  +H+FVV+D SH  + E+   L E+  +  L GY    
Sbjct: 501 RLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGY---R 560

Query: 542 SISSLDVLSTTETLKTSCT-YHSEKLAIAFGLARTSDGTQIRIVKNLRVCRDCHSFIKAV 580
            I++  +L+  E  K +   YHSEK+A AFGL  TS G  IRIVKNLR+C DCHS IK +
Sbjct: 561 PITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLI 620

BLAST of CaUC01G004240 vs. ExPASy TrEMBL
Match: A0A0A0LH20 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G074120 PE=3 SV=1)

HSP 1 Score: 1079.7 bits (2791), Expect = 0.0e+00
Identity = 524/580 (90.34%), Postives = 550/580 (94.83%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MS EKAI+ LLQGCN+L +LRKIHAHVIVSGL  HV I NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSNEKAILALLQGCNSLKRLRKIHAHVIVSGLHHHVPIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMQCPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQM+CPQTEAWNSIIRGFAQSSSPI+AIVFYN+MV  SFS PDTFTFSFVLKACERIKA
Sbjct: 61  FHQMECPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVCDSFSIPDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKCKEVHGSVIRCGYD DVIVCTNLVKCYS MGSVC A+QVFD+MPARDLVAWNAMISC
Sbjct: 121 ERKCKEVHGSVIRCGYDADVIVCTNLVKCYSAMGSVCIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQS 240
           FSQQGLH E+L+TYNQMRSENVD+DGFTLVGLISSCAHLGALNIGVQMHRFARE GL QS
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCAHLGALNIGVQMHRFARENGLDQS 240

Query: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAILIFDRMQ+KDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300

Query: 301 ARMQPNSITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPN +TFLGLLCGCSHQGLVQEGVKYF+LMSS+FRL+PEVKHYGCLVDLYGRAGKL+
Sbjct: 301 ARIQPNPVTFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLKPEVKHYGCLVDLYGRAGKLD 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEIVSNSS ND VLWRILLGSCKIHKNVTIGEIAMN L ELGAT+AGDCILLATIYAG
Sbjct: 361 KALEIVSNSSHNDSVLWRILLGSCKIHKNVTIGEIAMNRLSELGATSAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
           + D AGVARMRKMIK QG KTTPGWSWIEI +QVHKFVVDDKSHRYS+EVYEKLREVIHQ
Sbjct: 421 EKDKAGVARMRKMIKSQGKKTTPGWSWIEIGEQVHKFVVDDKSHRYSVEVYEKLREVIHQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCR 540
           AS FGYVGDESISSLD+LST ETLKTSCTYHSEKLAIAFGLART+DGTQIRIVKNLRVCR
Sbjct: 481 ASFFGYVGDESISSLDMLSTMETLKTSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYWL 581
           DCHSFIKAVS AFNREIIVRDRVRFHHF GG+CSCNDYW+
Sbjct: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGECSCNDYWV 580

BLAST of CaUC01G004240 vs. ExPASy TrEMBL
Match: A0A1S4DU66 (pentatricopeptide repeat-containing protein At3g56550 OS=Cucumis melo OX=3656 GN=LOC103485901 PE=3 SV=1)

HSP 1 Score: 1071.2 bits (2769), Expect = 2.7e-309
Identity = 525/579 (90.67%), Postives = 550/579 (94.99%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAI+TLLQGCN+L +LRKIHAHVIVSGL  HVAI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAILTLLQGCNSLKRLRKIHAHVIVSGLHHHVAIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMQCPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQ + PQTEAWNSIIRGFAQSSSPI+AIVFYN+MV  SFS  DTFTFSFVLKACERIKA
Sbjct: 61  FHQTEFPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVWDSFSMRDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKCKEVHG+VIRCGYD DVIVCTNLVKCYS MGSV  A+QVFD+MPARDLVAWNAMISC
Sbjct: 121 ERKCKEVHGTVIRCGYDADVIVCTNLVKCYSAMGSVYIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQS 240
           FSQQGLH E+L+TYNQMRSENVD+DGFTLVGLISSCAHLGALNIGVQMHRFARE GL QS
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCAHLGALNIGVQMHRFARENGLDQS 240

Query: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAILIFDRMQ+KDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300

Query: 301 ARMQPNSITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPNSITFLGLLCGCSHQGLVQEGVKYF+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARIQPNSITFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEIVSNSS ND VLWR LLGSCKIHKNVTIGEIAMN L ELGAT+AGDCILLATIYAG
Sbjct: 361 KALEIVSNSSHNDSVLWRTLLGSCKIHKNVTIGEIAMNRLFELGATSAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
           +ND AGV+RMRKMIK QGIKTTPGWSWIEI +QVHKFVVDDKS+RYSIEVYEKLREVI+Q
Sbjct: 421 ENDKAGVSRMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSNRYSIEVYEKLREVIYQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCR 540
           AS FGYVGDES+SSLDVLST ETLKTSCTYHSEKLAIAFGLART+DGTQIRIVKNLRVCR
Sbjct: 481 ASFFGYVGDESVSSLDVLSTIETLKTSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSFIKAVS AFNREIIVRDRVRFHHF GG+CSCNDYW
Sbjct: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGKCSCNDYW 579

BLAST of CaUC01G004240 vs. ExPASy TrEMBL
Match: A0A5A7TXJ9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold648G001800 PE=3 SV=1)

HSP 1 Score: 1071.2 bits (2769), Expect = 2.7e-309
Identity = 525/579 (90.67%), Postives = 550/579 (94.99%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAI+TLLQGCN+L +LRKIHAHVIVSGL  HVAI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAILTLLQGCNSLKRLRKIHAHVIVSGLHHHVAIANKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMQCPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQ + PQTEAWNSIIRGFAQSSSPI+AIVFYN+MV  SFS  DTFTFSFVLKACERIKA
Sbjct: 61  FHQTEFPQTEAWNSIIRGFAQSSSPIDAIVFYNQMVWDSFSMRDTFTFSFVLKACERIKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKCKEVHG+VIRCGYD DVIVCTNLVKCYS MGSV  A+QVFD+MPARDLVAWNAMISC
Sbjct: 121 ERKCKEVHGTVIRCGYDADVIVCTNLVKCYSAMGSVYIARQVFDKMPARDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQS 240
           FSQQGLH E+L+TYNQMRSENVD+DGFTLVGLISSCAHLGALNIGVQMHRFARE GL QS
Sbjct: 181 FSQQGLHQEALQTYNQMRSENVDIDGFTLVGLISSCAHLGALNIGVQMHRFARENGLDQS 240

Query: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAILIFDRMQ+KDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300

Query: 301 ARMQPNSITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QPNSITFLGLLCGCSHQGLVQEGVKYF+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARIQPNSITFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEIVSNSS ND VLWR LLGSCKIHKNVTIGEIAMN L ELGAT+AGDCILLATIYAG
Sbjct: 361 KALEIVSNSSHNDSVLWRTLLGSCKIHKNVTIGEIAMNRLFELGATSAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
           +ND AGV+RMRKMIK QGIKTTPGWSWIEI +QVHKFVVDDKS+RYSIEVYEKLREVI+Q
Sbjct: 421 ENDKAGVSRMRKMIKSQGIKTTPGWSWIEIGEQVHKFVVDDKSNRYSIEVYEKLREVIYQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCR 540
           AS FGYVGDES+SSLDVLST ETLKTSCTYHSEKLAIAFGLART+DGTQIRIVKNLRVCR
Sbjct: 481 ASFFGYVGDESVSSLDVLSTIETLKTSCTYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSFIKAVS AFNREIIVRDRVRFHHF GG+CSCNDYW
Sbjct: 541 DCHSFIKAVSVAFNREIIVRDRVRFHHFKGGKCSCNDYW 579

BLAST of CaUC01G004240 vs. ExPASy TrEMBL
Match: A0A6J1D832 (pentatricopeptide repeat-containing protein At3g56550 OS=Momordica charantia OX=3673 GN=LOC111018226 PE=3 SV=1)

HSP 1 Score: 1048.9 bits (2711), Expect = 1.4e-302
Identity = 501/579 (86.53%), Postives = 545/579 (94.13%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MS EKAI+TLLQGCN+LNKLRKIHAHVI+SGLR H AIGNKLLNFCAISVSGSL YA+LL
Sbjct: 1   MSNEKAILTLLQGCNSLNKLRKIHAHVILSGLRHHAAIGNKLLNFCAISVSGSLPYARLL 60

Query: 61  FHQMQCPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           F  M CPQTEAWNSIIRGFAQS+SPIEA+V+YN+MV AS S PDTFTFSFVLKACER+KA
Sbjct: 61  FRHMDCPQTEAWNSIIRGFAQSASPIEAVVYYNQMVWASLSPPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKC+EVHGSVIR GYDGDVI+CTNL+KCY+ MG +C AQQVFD+MP RDLVAWNAMISC
Sbjct: 121 ERKCREVHGSVIRWGYDGDVIICTNLMKCYAAMGFICVAQQVFDKMPTRDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQS 240
           +SQQGLH E+LETYNQMRS NVDVDGFTLVGL+SSCAHLGALNIGVQMHRFAREKGLV+S
Sbjct: 181 YSQQGLHQEALETYNQMRSGNVDVDGFTLVGLLSSCAHLGALNIGVQMHRFAREKGLVES 240

Query: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAILIFDRMQ+KD+FTWNSMIVGYGVHGRG+EAI+CFQQMLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQRKDVFTWNSMIVGYGVHGRGTEAIFCFQQMLE 300

Query: 301 ARMQPNSITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           AR+QP+S+TFLGLLCGCSHQGLVQEGVK+F+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARIQPSSVTFLGLLCGCSHQGLVQEGVKFFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALEI+ NSSQNDPVLWRILLGSCKIHKNV IGEIAMN+L +LGATNAGDCILLATIYAG
Sbjct: 361 KALEIILNSSQNDPVLWRILLGSCKIHKNVGIGEIAMNNLSQLGATNAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
             +T+GV RMRKMI+ QGIKTTPGWSWIEI +QVHKFVVDDKSHRY IEVYEKL+EVIHQ
Sbjct: 421 VKNTSGVVRMRKMIRSQGIKTTPGWSWIEIGEQVHKFVVDDKSHRYYIEVYEKLKEVIHQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCR 540
           ASLFGY+GD   S+ DV ST+E L+TSC+YHSEKLAIAFGLART+DGTQIRIVKNLRVCR
Sbjct: 481 ASLFGYIGDGYFSTTDVFSTSEILETSCSYHSEKLAIAFGLARTADGTQIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSF+KAVS AFNREIIVRDRVRFHHF GGQCSCNDYW
Sbjct: 541 DCHSFVKAVSLAFNREIIVRDRVRFHHFKGGQCSCNDYW 579

BLAST of CaUC01G004240 vs. ExPASy TrEMBL
Match: A0A6J1FBZ0 (pentatricopeptide repeat-containing protein At3g56550 OS=Cucurbita moschata OX=3662 GN=LOC111444025 PE=3 SV=1)

HSP 1 Score: 1043.1 bits (2696), Expect = 7.6e-301
Identity = 505/579 (87.22%), Postives = 539/579 (93.09%), Query Frame = 0

Query: 1   MSKEKAIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLL 60
           MSKEKAI+TLLQGCN+LNKLRKIHAHV+VSGLR HVAI NKLLNFCAISVSGSLAYAQLL
Sbjct: 1   MSKEKAIVTLLQGCNSLNKLRKIHAHVLVSGLRHHVAINNKLLNFCAISVSGSLAYAQLL 60

Query: 61  FHQMQCPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKA 120
           FHQM+C QTEAWNSIIRGFAQSSSPI+A+V+YN+MV ASFSSPDTFTFSFVLKACER+KA
Sbjct: 61  FHQMECLQTEAWNSIIRGFAQSSSPIDAVVYYNQMVCASFSSPDTFTFSFVLKACERLKA 120

Query: 121 ERKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISC 180
           ERKCKE+HG++IRCGYDGDVI+CTNLVKCY+ MGSVC A QVFDEMP RDLVAWNAMISC
Sbjct: 121 ERKCKEIHGTIIRCGYDGDVIICTNLVKCYAAMGSVCIAHQVFDEMPVRDLVAWNAMISC 180

Query: 181 FSQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQS 240
           FSQQGLH E+L+ YNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLV+S
Sbjct: 181 FSQQGLHGEALQVYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVES 240

Query: 241 LYVGNALIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLE 300
           LYVGNALIDMYAKCGSLDQAI IFDRM +KDIFTWNSMIVGYGVHGRG+EAI+CF++MLE
Sbjct: 241 LYVGNALIDMYAKCGSLDQAIFIFDRMHRKDIFTWNSMIVGYGVHGRGTEAIFCFERMLE 300

Query: 301 ARMQPNSITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLE 360
           ARMQPNS+TFLGLLCGCSHQGLVQEGVKYF+LMSS+FRLRPEVKHYGCLVDLYGRAGKLE
Sbjct: 301 ARMQPNSVTFLGLLCGCSHQGLVQEGVKYFNLMSSKFRLRPEVKHYGCLVDLYGRAGKLE 360

Query: 361 KALEIVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 420
           KALE + NSS NDPVLWRILLGSCKIHKNV +GEIAMN+L ELGATNAGDCILLATIYAG
Sbjct: 361 KALETIRNSSPNDPVLWRILLGSCKIHKNVGVGEIAMNNLNELGATNAGDCILLATIYAG 420

Query: 421 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 480
            NDTAGVA MRK IK QGIKT+PGWSWIEI +QVHKFVVDDKSHR SIEVYEKLREV+HQ
Sbjct: 421 VNDTAGVASMRKTIKSQGIKTSPGWSWIEIGEQVHKFVVDDKSHRDSIEVYEKLREVLHQ 480

Query: 481 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCR 540
           ASLFGYV D            ETLKTS TYHSEKLAIAFGLART+DGT IRIVKNLRVCR
Sbjct: 481 ASLFGYVID-----------AETLKTSSTYHSEKLAIAFGLARTADGTPIRIVKNLRVCR 540

Query: 541 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSF+KAVS AF+REIIVRDRVRFHHF GGQCSCNDYW
Sbjct: 541 DCHSFMKAVSVAFDREIIVRDRVRFHHFKGGQCSCNDYW 568

BLAST of CaUC01G004240 vs. TAIR 10
Match: AT3G56550.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 716.8 bits (1849), Expect = 2.4e-206
Identity = 348/579 (60.10%), Postives = 441/579 (76.17%), Query Frame = 0

Query: 3   KEKAIITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSGSLAYAQLLF- 62
           K + I+ +LQGCN++ KLRKIH+HVI++GL+ H +I N LL FCA+SV+GSL++AQLLF 
Sbjct: 4   KARVIVRMLQGCNSMKKLRKIHSHVIINGLQHHPSIFNHLLRFCAVSVTGSLSHAQLLFD 63

Query: 63  HQMQCPQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKAE 122
           H    P T  WN +IRGF+ SSSP+ +I+FYNRM+ +S S PD FTF+F LK+CERIK+ 
Sbjct: 64  HFDSDPSTSDWNYLIRGFSNSSSPLNSILFYNRMLLSSVSRPDLFTFNFALKSCERIKSI 123

Query: 123 RKCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISCF 182
            KC E+HGSVIR G+  D IV T+LV+CYS  GSV  A +VFDEMP RDLV+WN MI CF
Sbjct: 124 PKCLEIHGSVIRSGFLDDAIVATSLVRCYSANGSVEIASKVFDEMPVRDLVSWNVMICCF 183

Query: 183 SQQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQSL 242
           S  GLH ++L  Y +M +E V  D +TLV L+SSCAH+ ALN+GV +HR A +      +
Sbjct: 184 SHVGLHNQALSMYKRMGNEGVCGDSYTLVALLSSCAHVSALNMGVMLHRIACDIRCESCV 243

Query: 243 YVGNALIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEA 302
           +V NALIDMYAKCGSL+ AI +F+ M+K+D+ TWNSMI+GYGVHG G EAI  F++M+ +
Sbjct: 244 FVSNALIDMYAKCGSLENAIGVFNGMRKRDVLTWNSMIIGYGVHGHGVEAISFFRKMVAS 303

Query: 303 RMQPNSITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEK 362
            ++PN+ITFLGLL GCSHQGLV+EGV++F +MSS+F L P VKHYGC+VDLYGRAG+LE 
Sbjct: 304 GVRPNAITFLGLLLGCSHQGLVKEGVEHFEIMSSQFHLTPNVKHYGCMVDLYGRAGQLEN 363

Query: 363 ALEIV-SNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 422
           +LE++ ++S   DPVLWR LLGSCKIH+N+ +GE+AM  L +L A NAGD +L+ +IY+ 
Sbjct: 364 SLEMIYASSCHEDPVLWRTLLGSCKIHRNLELGEVAMKKLVQLEAFNAGDYVLMTSIYSA 423

Query: 423 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 482
            ND    A MRK+I+   ++T PGWSWIEI DQVHKFVVDDK H  S  +Y +L EVI++
Sbjct: 424 ANDAQAFASMRKLIRSHDLQTVPGWSWIEIGDQVHKFVVDDKMHPESAVIYSELGEVINR 483

Query: 483 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCR 542
           A L GY  ++S  +   LS    L ++ T HSEKLAIA+GL RT+ GT +RI KNLRVCR
Sbjct: 484 AILAGYKPEDSNRTAPTLS-DRCLGSADTSHSEKLAIAYGLMRTTAGTTLRITKNLRVCR 543

Query: 543 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCHSF K VS AFNREIIVRDRVRFHHF  G CSCNDYW
Sbjct: 544 DCHSFTKYVSKAFNREIIVRDRVRFHHFADGICSCNDYW 581

BLAST of CaUC01G004240 vs. TAIR 10
Match: AT5G19350.1 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 576.6 bits (1485), Expect = 3.9e-164
Identity = 292/421 (69.36%), Postives = 332/421 (78.86%), Query Frame = 0

Query: 608  QPTHHQPTTVEEVRTLWIGDLQYWVDESYLNSCFAHTGEVISIKIIRNKITGQPEGYGFV 667
            Q ++H P T+EEVRTLWIGDLQYWVDE+YL SCF+ TGE++S+K+IRNKITGQPEGYGF+
Sbjct: 11   QGSYHHPQTLEEVRTLWIGDLQYWVDENYLTSCFSQTGELVSVKVIRNKITGQPEGYGFI 70

Query: 668  EFVSHAAAERILQTYNGTQMPGTEQTFRLNWASFGIGERRPDAGPEHSIFVGDLAPDVTD 727
            EF+SHAAAER LQTYNGTQMPGTE TFRLNWASFG G+ + DAGP+HSIFVGDLAPDVTD
Sbjct: 71   EFISHAAAERTLQTYNGTQMPGTELTFRLNWASFGSGQ-KVDAGPDHSIFVGDLAPDVTD 130

Query: 728  YLLQETFRVQYPSVRGAKVVTDPNTGRSKGYGFVKFADENERNRAMSEMNGIYCSTRPMR 787
            YLLQETFRV Y SVRGAKVVTDP+TGRSKGYGFVKFA+E+ERNRAM+EMNG+YCSTRPMR
Sbjct: 131  YLLQETFRVHYSSVRGAKVVTDPSTGRSKGYGFVKFAEESERNRAMAEMNGLYCSTRPMR 190

Query: 788  ISAATPKKTIGVQQQYSLGKAMYPVP-----AYTTSVPVLPADYDANNTTIFVGNLDPNI 847
            ISAATPKK +GVQQQY + KA+YPV      A      V P + D   TTI V NLD N+
Sbjct: 191  ISAATPKKNVGVQQQY-VTKAVYPVTVPSAVAAPVQAYVAPPESDVTCTTISVANLDQNV 250

Query: 848  TEEELKQTFLQFGEIAYVKIPSGKGCGFVQFGTRASAEEAIQKMQGKIIGQQVVRTSWGR 907
            TEEELK+ F Q GE+ YVKIP+ KG G+VQF TR SAEEA+Q+MQG++IGQQ VR SW +
Sbjct: 251  TEEELKKAFSQLGEVIYVKIPATKGYGYVQFKTRPSAEEAVQRMQGQVIGQQAVRISWSK 310

Query: 908  NPAAKQDLATWGQQVDPNQWSAYYGYGGTYDAYGYGVVQDPSLYAYGAYSGYASYPQQVD 967
            NP   QD   W  Q DPNQW+ YYGYG  YDAY YG  QDPS+YAYG Y GY  YPQQ +
Sbjct: 311  NPG--QD--GWVTQADPNQWNGYYGYGQGYDAYAYGATQDPSVYAYGGY-GYPQYPQQGE 370

Query: 968  GVQDLA-AVAGAVPSVEQGEEWNDTLDTPDVDYLNDAYLSKHESAILGWPLWLTTSSLVR 1023
            G QD++ + AG V   EQ  E  D L TPDVD LN AYLS H SAILG P+W  TSSL  
Sbjct: 371  GTQDISNSAAGGVAGAEQ--ELYDPLATPDVDKLNAAYLSVHASAILGRPMWQRTSSLTS 422

BLAST of CaUC01G004240 vs. TAIR 10
Match: AT5G19350.2 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 572.8 bits (1475), Expect = 5.6e-163
Identity = 287/416 (68.99%), Postives = 326/416 (78.37%), Query Frame = 0

Query: 608  QPTHHQPTTVEEVRTLWIGDLQYWVDESYLNSCFAHTGEVISIKIIRNKITGQPEGYGFV 667
            Q ++H P T+EEVRTLWIGDLQYWVDE+YL SCF+ TGE++S+K+IRNKITGQPEGYGF+
Sbjct: 11   QGSYHHPQTLEEVRTLWIGDLQYWVDENYLTSCFSQTGELVSVKVIRNKITGQPEGYGFI 70

Query: 668  EFVSHAAAERILQTYNGTQMPGTEQTFRLNWASFGIGERRPDAGPEHSIFVGDLAPDVTD 727
            EF+SHAAAER LQTYNGTQMPGTE TFRLNWASFG G+ + DAGP+HSIFVGDLAPDVTD
Sbjct: 71   EFISHAAAERTLQTYNGTQMPGTELTFRLNWASFGSGQ-KVDAGPDHSIFVGDLAPDVTD 130

Query: 728  YLLQETFRVQYPSVRGAKVVTDPNTGRSKGYGFVKFADENERNRAMSEMNGIYCSTRPMR 787
            YLLQETFRV Y SVRGAKVVTDP+TGRSKGYGFVKFA+E+ERNRAM+EMNG+YCSTRPMR
Sbjct: 131  YLLQETFRVHYSSVRGAKVVTDPSTGRSKGYGFVKFAEESERNRAMAEMNGLYCSTRPMR 190

Query: 788  ISAATPKKTIGVQQQYSLGKAMYPVPAYTTSVPVLPADYDANNTTIFVGNLDPNITEEEL 847
            ISAATPKK +GVQQQY     +    A      V P + D   TTI V NLD N+TEEEL
Sbjct: 191  ISAATPKKNVGVQQQYVTKVTVPSAVAAPVQAYVAPPESDVTCTTISVANLDQNVTEEEL 250

Query: 848  KQTFLQFGEIAYVKIPSGKGCGFVQFGTRASAEEAIQKMQGKIIGQQVVRTSWGRNPAAK 907
            K+ F Q GE+ YVKIP+ KG G+VQF TR SAEEA+Q+MQG++IGQQ VR SW +NP   
Sbjct: 251  KKAFSQLGEVIYVKIPATKGYGYVQFKTRPSAEEAVQRMQGQVIGQQAVRISWSKNPG-- 310

Query: 908  QDLATWGQQVDPNQWSAYYGYGGTYDAYGYGVVQDPSLYAYGAYSGYASYPQQVDGVQDL 967
            QD   W  Q DPNQW+ YYGYG  YDAY YG  QDPS+YAYG Y GY  YPQQ +G QD+
Sbjct: 311  QD--GWVTQADPNQWNGYYGYGQGYDAYAYGATQDPSVYAYGGY-GYPQYPQQGEGTQDI 370

Query: 968  A-AVAGAVPSVEQGEEWNDTLDTPDVDYLNDAYLSKHESAILGWPLWLTTSSLVRQ 1023
            + + AG V   EQ  E  D L TPDVD LN AYLS H SAILG P+W  TSSL  Q
Sbjct: 371  SNSAAGGVAGAEQ--ELYDPLATPDVDKLNAAYLSVHASAILGRPMWQRTSSLTSQ 418

BLAST of CaUC01G004240 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 456.4 bits (1173), Expect = 5.8e-128
Identity = 238/579 (41.11%), Postives = 360/579 (62.18%), Query Frame = 0

Query: 8   ITLLQ--GCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAISVSG--SLAYAQLLFHQ 67
           I LLQ  G +++ KLR+IHA  I  G+    A   K L F  +S+     ++YA  +F +
Sbjct: 19  INLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSK 78

Query: 68  MQCP-QTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKAER 127
           ++ P     WN++IRG+A+  + I A   Y  M  +    PDT T+ F++KA   +   R
Sbjct: 79  IEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVR 138

Query: 128 KCKEVHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISCFS 187
             + +H  VIR G+   + V  +L+  Y+  G V SA +VFD+MP +DLVAWN++I+ F+
Sbjct: 139 LGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFA 198

Query: 188 QQGLHLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQSLY 247
           + G   E+L  Y +M S+ +  DGFT+V L+S+CA +GAL +G ++H +  + GL ++L+
Sbjct: 199 ENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLH 258

Query: 248 VGNALIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEAR 307
             N L+D+YA+CG +++A  +FD M  K+  +W S+IVG  V+G G EAI  F+ M    
Sbjct: 259 SSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTE 318

Query: 308 -MQPNSITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEK 367
            + P  ITF+G+L  CSH G+V+EG +YF  M  E+++ P ++H+GC+VDL  RAG+++K
Sbjct: 319 GLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKK 378

Query: 368 ALE-IVSNSSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAG 427
           A E I S   Q + V+WR LLG+C +H +  + E A   + +L   ++GD +LL+ +YA 
Sbjct: 379 AYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYAS 438

Query: 428 KNDTAGVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQ 487
           +   + V ++RK + R G+K  PG S +E+ ++VH+F++ DKSH  S  +Y KL+E+  +
Sbjct: 439 EQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGR 498

Query: 488 ASLFGYVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCR 547
               GYV    IS++ V    E  + +  YHSEK+AIAF L  T + + I +VKNLRVC 
Sbjct: 499 LRSEGYV--PQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCA 558

Query: 548 DCHSFIKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
           DCH  IK VS  +NREI+VRDR RFHHF  G CSC DYW
Sbjct: 559 DCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of CaUC01G004240 vs. TAIR 10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 428.3 bits (1100), Expect = 1.7e-119
Identity = 224/574 (39.02%), Postives = 346/574 (60.28%), Query Frame = 0

Query: 8   ITLLQGCNNLNKLRKIHAHVIVSGLRDHVAIGNKLLNFCAIS-VSGSLAYAQLLFHQMQC 67
           I L+  CN+L +L +I A+ I S + D V+   KL+NFC  S    S++YA+ LF  M  
Sbjct: 33  ILLISKCNSLRELMQIQAYAIKSHIED-VSFVAKLINFCTESPTESSMSYARHLFEAMSE 92

Query: 68  PQTEAWNSIIRGFAQSSSPIEAIVFYNRMVSASFSSPDTFTFSFVLKACERIKAERKCKE 127
           P    +NS+ RG+++ ++P+E    +  ++      PD +TF  +LKAC   KA  + ++
Sbjct: 93  PDIVIFNSMARGYSRFTNPLEVFSLFVEILEDGI-LPDNYTFPSLLKACAVAKALEEGRQ 152

Query: 128 VHGSVIRCGYDGDVIVCTNLVKCYSVMGSVCSAQQVFDEMPARDLVAWNAMISCFSQQGL 187
           +H   ++ G D +V VC  L+  Y+    V SA+ VFD +    +V +NAMI+ ++++  
Sbjct: 153 LHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNR 212

Query: 188 HLESLETYNQMRSENVDVDGFTLVGLISSCAHLGALNIGVQMHRFAREKGLVQSLYVGNA 247
             E+L  + +M+ + +  +  TL+ ++SSCA LG+L++G  +H++A++    + + V  A
Sbjct: 213 PNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTA 272

Query: 248 LIDMYAKCGSLDQAILIFDRMQKKDIFTWNSMIVGYGVHGRGSEAIYCFQQMLEARMQPN 307
           LIDM+AKCGSLD A+ IF++M+ KD   W++MIV Y  HG+  +++  F++M    +QP+
Sbjct: 273 LIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMRSENVQPD 332

Query: 308 SITFLGLLCGCSHQGLVQEGVKYFHLMSSEFRLRPEVKHYGCLVDLYGRAGKLEKALEIV 367
            ITFLGLL  CSH G V+EG KYF  M S+F + P +KHYG +VDL  RAG LE A E +
Sbjct: 333 EITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFI 392

Query: 368 SN-SSQNDPVLWRILLGSCKIHKNVTIGEIAMNSLCELGATNAGDCILLATIYAGKNDTA 427
                   P+LWRILL +C  H N+ + E     + EL  ++ GD ++L+ +YA      
Sbjct: 393 DKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWE 452

Query: 428 GVARMRKMIKRQGIKTTPGWSWIEIEDQVHKFVVDDKSHRYSIEVYEKLREVIHQASLFG 487
            V  +RK++K +     PG S IE+ + VH+F   D     + +++  L E++ +  L G
Sbjct: 453 YVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKLSG 512

Query: 488 YVGDESISSLDVLSTTETLKTSCTYHSEKLAIAFGLARTSDGTQIRIVKNLRVCRDCHSF 547
           YV D S+     ++  E  + +  YHSEKLAI FGL  T  GT IR+VKNLRVCRDCH+ 
Sbjct: 513 YVPDTSMVVHANMNDQEK-EITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNA 572

Query: 548 IKAVSAAFNREIIVRDRVRFHHFNGGQCSCNDYW 580
            K +S  F R++++RD  RFHHF  G+CSC D+W
Sbjct: 573 AKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890323.10.0e+0093.09pentatricopeptide repeat-containing protein At3g56550 [Benincasa hispida] >XP_03... [more]
XP_004152881.10.0e+0090.50pentatricopeptide repeat-containing protein At3g56550 isoform X1 [Cucumis sativu... [more]
XP_016899519.15.5e-30990.67PREDICTED: pentatricopeptide repeat-containing protein At3g56550 [Cucumis melo] ... [more]
XP_022149932.12.9e-30286.53pentatricopeptide repeat-containing protein At3g56550 [Momordica charantia] >XP_... [more]
XP_023537237.16.4e-30287.74pentatricopeptide repeat-containing protein At3g56550 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
Q9LXY53.4e-20560.10Pentatricopeptide repeat-containing protein At3g56550 OS=Arabidopsis thaliana OX... [more]
Q8VXZ95.4e-16369.36Polyadenylate-binding protein RBP47B' OS=Arabidopsis thaliana OX=3702 GN=RBP47B'... [more]
A8MQA38.2e-12741.11Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q8LK932.4e-11839.02Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
Q9FI802.7e-11736.51Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0LH200.0e+0090.34DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G0741... [more]
A0A1S4DU662.7e-30990.67pentatricopeptide repeat-containing protein At3g56550 OS=Cucumis melo OX=3656 GN... [more]
A0A5A7TXJ92.7e-30990.67Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1D8321.4e-30286.53pentatricopeptide repeat-containing protein At3g56550 OS=Momordica charantia OX=... [more]
A0A6J1FBZ07.6e-30187.22pentatricopeptide repeat-containing protein At3g56550 OS=Cucurbita moschata OX=3... [more]
Match NameE-valueIdentityDescription
AT3G56550.12.4e-20660.10Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G19350.13.9e-16469.36RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT5G19350.25.6e-16368.99RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT4G21065.15.8e-12841.11Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G02980.11.7e-11939.02Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 622..697
e-value: 2.8E-14
score: 63.4
coord: 832..899
e-value: 7.1E-23
score: 92.0
coord: 715..789
e-value: 2.0E-20
score: 83.9
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 833..896
e-value: 3.3E-18
score: 65.3
coord: 623..691
e-value: 7.4E-14
score: 51.4
coord: 716..784
e-value: 1.1E-15
score: 57.2
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 621..701
score: 15.408587
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 831..903
score: 17.438179
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 714..793
score: 17.114126
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 703..810
e-value: 1.1E-21
score: 79.3
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 823..910
e-value: 1.9E-24
score: 87.8
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 620..700
e-value: 1.1E-23
score: 85.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 4..117
e-value: 1.4E-10
score: 43.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 223..328
e-value: 4.8E-26
score: 93.2
coord: 120..222
e-value: 2.3E-16
score: 61.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 172..202
e-value: 1.6E-5
score: 24.8
coord: 346..365
e-value: 0.061
score: 13.6
coord: 71..98
e-value: 0.0027
score: 17.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 172..205
e-value: 2.5E-5
score: 22.1
coord: 71..104
e-value: 7.3E-4
score: 17.5
coord: 245..272
e-value: 1.4E-4
score: 19.8
coord: 273..306
e-value: 1.9E-7
score: 28.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 270..318
e-value: 3.4E-9
score: 36.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 240..270
score: 8.977363
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 68..102
score: 8.506026
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 271..305
score: 11.301158
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 170..204
score: 10.226951
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 443..568
e-value: 6.9E-34
score: 116.5
NoneNo IPR availablePANTHERPTHR47928:SF8SUBFAMILY NOT NAMEDcoord: 10..553
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 10..553
NoneNo IPR availableCDDcd12346RRM3_NGR1_NAM8_likecoord: 830..901
e-value: 1.40472E-37
score: 133.198
NoneNo IPR availableCDDcd12344RRM1_SECp43_likecoord: 622..702
e-value: 1.019E-46
score: 159.35
NoneNo IPR availableCDDcd12345RRM2_SECp43_likecoord: 713..792
e-value: 8.35755E-51
score: 170.923
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 243..374
IPR035979RNA-binding domain superfamilySUPERFAMILY54928RNA-binding domain, RBDcoord: 713..905
IPR035979RNA-binding domain superfamilySUPERFAMILY54928RNA-binding domain, RBDcoord: 608..689

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC01G004240.1CaUC01G004240.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003676 nucleic acid binding