CsaV3_4G031220 (gene) Cucumber (Chinese Long) v3

NameCsaV3_4G031220
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein family
Locationchr4 : 21460864 .. 21475543 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAAATCACACTTTTTCAAGGAATTAAAAAACCGACGTGATTCGTTAATATGTTGGTTTCTAAAAATTGAAACAAACGAACCCTCCTTTTGTAGCACTCGACGTTTTTGTTATTCCTCAACCTTGCGTGCGACGGCAAAAAACTTCGAAGCCTCGCAGCAGCTTGAGCTTGGCAGCTGCAGACAACCAATCTGTCCAACGGCGACGAGATTCTCGACGGCTGTCCCAATTCCCTTTCCAGCAGCGCCAAGAAAAAGGTACATATGTTCAATATATATTTTCAAACTTCAAACTTCTTCACCAAGTGCAACTTTCACTTCAAGCACCCCCTTTTTATTCGTTGCATCCATGGCATTGCGCATTATTCATCCAATCTCGACTCCAATCAGCTTCTTAGTGAGTTATCTAAAAATGGTCGAGTTGATGAAGCTCGTAAGTTGTTTGATCAAATGCCTTATCGGGACAAGTACACATGGAACATTATGATTTCTGCTTATGCCAATTTAGGAAATTTAGTTGAAGCTCGGAAGCTCTTTAATGAAACTCCAATTAAAAATTCTATCACTTGGTCTTCCCTGGTATCCGGATATTGCAAAAATGGGTGTGAAGTTGAAGGCTTGAGGCAGTTCAGCCAAATGTGGAGTGATGGGCAGAAGCCAAGTCAATACACGTTGGGCAGTGTTCTAAGAGCATGTTCAACTTTGAGTTTGCTCCATACTGGCAAAATGATTCATTGCTATGCAATAAAGATCCAATTAGAAGCGAATATATTTGTTGCAACTGGTCTTGTTGACATGTATTCCAAGTGTAAGTGTCTTCTGGAGGCTGAATACCTCTTCTTTTCACTGCCTGATAGGAAGAACTATGTTCAATGGACTGCTATGCTCACTGGATATGCTCAAAATGGCGAGAGTTTGAAGGCAATTCAGTGTTTTAAGGAGATGAGAAATCAGGGAATGGAGTCTAACCATTTCACATTTCCCAGCATATTGACAGCATGTACATCAATTTCAGCTTATGCTTTTGGTCGTCAAGTACATGGATGTATTATTTGGAGTGGCTTTGGTCCTAACGTATATGTTCAAAGTGCATTAGTTGATATGTATGCCAAATGTGGAGACTTAGCTAGTGCGAGAATGATATTGGATACCATGGAAATTGATGATGTTGTGTGTTGGAACTCGATGATTGTTGGGTGTGTTACACATGGATATATGGAGGAAGCTCTAGTTTTGTTCCATAAGATGCATAATCGGGATATAAGAATTGATGATTTCACATATCCGTCTGTTTTGAAATCTCTGGCTTCTTGTAAGAACCTGAAAATTGGAGAATCAGTTCATTCTCTGACTATTAAAACTGGTTTTGATGCTTGCAAAACGGTGAGCAATGCACTTGTTGACATGTATGCTAAACAAGGAAACTTGAGTTGTGCATTAGACGTTTTCAATAAGATATTAGATAAAGATGTAATATCGTGGACCTCCTTGGTCACGGGATATGTTCACAATGGCTTCCACGAAAAGGCTCTGCAGTTATTTTGTGACATGAGAACAGCAAGGGTTGATCTTGACCAATTTGTAGTTGCCTGTGTTTTTAGTGCATGTGCTGAACTAACAGTTATAGAGTTTGGTCGACAGGTTCATGCAAACTTTATCAAATCTAGTGCTGGTTCATTGTTGTCTGCGGAAAACTCTCTCATAACAATGTACGCCAAATGTGGATGCTTAGAAGATGCAATTAGAGTCTTTGACTCAATGGAAACTCGAAATGTCATATCATGGACTGCCATAATAGTTGGTTATGCACAGAATGGGAGAGGGAAGGACTCTCTTCATTTTTATGAACAAATGATAATTGATGGCATAAAGCCAGACGGTGTTACTTTTATTGGTTTGTTATTTGCTTGCAGCCATGCAGGTCTTGTGGAAACTGGTCAATCTTACTTTGAATCAATGGAAAAAGTTTATGGAATAAAGCCAGCTTCTGATCATTATGCTTGCATGATTGATCTACTGGGACGTGCAGGAAAAATCAATGAGGCAGAGCATTTATTGAACCGAATGGACGTTGAACCCGATGCAACCATATGGAAGTCATTACTTTCTGCATGTAGGGTTCATGGCAATTTAGAACTTGGAGAAAGGGCTGGAAAAAATCTCATTAAATTGGAACCTTCAAATTCTTTGCCTTACGTTTTATTGTCCAATATGTTTTCTGTTGCTGGTAGATGGGAAGATGCAGCCCATATTCGTAGAGCAATGAAGACAATGGGTATTAACAAGGAGCCCGGATATAGTTGGATTGAAATGAAGAGCCAAGTGCATACATTTATATCTGAAGATAGAAGCCATCCTTTGGCGGCTGAAATATATTCAAAGATTGATGAAATGATGATATTAATAAAGGAAGCTGGACATGTTCCAGATATGAACTTTGCATTACGTGACATGGATGAAGAGGCTAAAGAACGTAGTCTAGCATATCATAGTGAGAAGTTGGCAGTTGCATTTGGTCTTCTCACAGTTGCGAAAGGAGCACCAATTCGGATTTTCAAGAATCTGAGAGTATGTGGGGACTGCCACTCAGCAATGAAATATATATCTAGCATTTTTAAGCGGCATATTATTTTGAGAGATTTAAATTGTTTCCATCACTTTATAGAGGGGAAATGTTCTTGTGGAGACTTCTGGTAGGTAGGGTGTTCAGCTTCTTGATTTACTTATCTATATTGATCCACCCTGGAGAATGAAACACCTAATTCCTTGGAGTTATTCTATTGGCACAACCCAAGCAAGATAAGAATGGTGGTAGTCTTTTCCCAATCACGTCAAGAATTACTCTATATTCTTATCGAAAAGTAACAACTATGTTTCTTGGCTCTCTTCATTCTCTACTGGCTGAAGGAGATACAAACTCCATTCAATCGACAAGGGCATCACTTGACCTTGCTCCATTCGTTGTTTCCAATTTATAAGCTTCACAGACTTCAGAGAGCCAGAAAATGAACTCCCAAAACTATGTTAATGCTTGCCAAAGGAAATTATGGTCGTGACTTCACTCCAACTCTTCCTTCTCTCAATTCTTCACCTCAAAGCTCAATTCTTACATCTCTCTAGCAAGCTACTCTTTTCAGATGAAGCCACAGTGATGGTTCTTGTTTTAGATTCTGTCGGTCAGACAATTTTTCACTAGGTGTGGCAAGCTGCTCAAGGAGGTCAAAAGTGATGTCAAGAGGATTTTCTCAGGCTTCTGAAGGCTCGAGAGAACTTTGCTATTATCAATAAGCTTCTTCTGTACCGGGAAAATCGGTAAGAGTTATCAATGAACTCGAAGAGGTCTGTCTTATATTCCTTAACTGCTGTCTTTGTTTTATTGTCTCAAGGAAAACATCTCTTGCCAAGGAAAACAGTTCTCTTAGCGAATTTGTTAGGCCTCGTATTATATTCCAATATCCAAGGAGGGGTAAAAAGGGAATGGCTTATGCTAGTGGGACTGTGGCTTCTACCCTCAATAGTAATAGAATGGGGGACCACAGATGGTTAGAGAGGAGGAAAGAGAGTTATTTGGGAGATCGAAGTTAGGAGAATTTTTCAGTTTTCATTTTGGTAAGTCTTGAGAGTGAGAGGAGGGAATCCTTGCTTCTATGAGAGGATGGACCTTGCAAACAAGTACTTTCAATTCAGTAGGAATTCCTTGTCACTATACCTATCAGAATTTTTGGATTATTTGAACTCAAACTTGTCAAAACCAATCCATGATTCCACTATAGTTTTTAGCTTTCTTTTCCTGAATGTGGATTCAACTAAGGTGTAATTCAATTGTAATTTAACATCTGTCTCTTCTCTTCAACTTTGCACCATTCTTTATTTCAGAGAATGAACTTAAATTTATACTCTTTTAAAAATGAATTTGGGTGTCAAGAAAATGAGTAGGAAATTAATTCCAGGTAGGTGACCACGAACCCATGCCCTTCTAGTTATTGAGACTATGTCTTCCTTTTACCACTAGGCCAACTTGTGATGGTTTGGTAGGTCTTTTTTTGTTTCTTTTGAATAAGAAATTGATAAATTCATGAGAACAAAAGTACGAATTTTCTTGACACTCAAATGTTGTAGGGTCATGTAGTTTGTTCCGTGAGATTAGTCGAGGTAAGCTGACCCAGACACTCACAAATATATATATAAAAAAGAAGAACTACTAAACCAACCCATATCAATATATAAGAAATATGCAAATGACACAACAAGCAAGCATAAAAAATTTTATTTTTGAAAGAGTATAAATTTATTGTAGTTCTTTTAATGCTAGAGTTTTTTAAAATGTTATCCATAAGCTAAATGAAAAACTATGATTAATTGCAGTGTTGAAAACTTTGTGGGTGTCGTTGGTGTCTGATTTTGGGCCAAGGAACACCTATCAAGAGAGTGGAGAGCTTGCATGATGAAATATTGGCGGGGAAGCTCAAATCAGGTGTGAAATAAAGATTACCAGTTCACAAATTGTTTCATTACTCTGCCTTCTAAAGTAAAAAGGTTTTATCTCTTGGAAATGAATTAATAATTACTACATTTACTTGCTCAACTATGAAGGATAAATTAACATGCAAGCGGAAGCACGAATAAAAACACTCTTAATTTCATGATTTTTGAGGATTAAAGCATTATGAAAATTTAGGGAAAAATAGAAATTCAAGAACACTTATCTTTGTAACTCAAAATTCTCCCTCTATAGTCGTTGAAAATGAACACAACCAACCTCTATAATATCGAATATTACCAACAATTTATTTTCTCAAACATTTCGAACAATTCAATTAGATTAACTGGCTAAACTTTTAATCGAATTAATCAACATTTGTTAACTAATTGGGACTAGCCACTATAACTCGTAACTATACTCTCCTTAGTGTATCTATATTTGTGTCCATTTGTTATAATCATGATTAGTAAGTCAACCCTAGTAAGTCAACCCTTCAGAGTTGTTCATAATCTCGGCTAGGTCAATTTACCGTTTTACCCCCGAACATTTTGTTCCTTAAGTTTCAGCTAATTGAAAGGTATTGATCTTTATCGTAATAGTAATGGATTGACAAGTACAATTCCTAGGGCACAATCATAGACGATCGTCCCTAACTCCGTGTCTCTCACTCTTGAGCAGATACTCCAAAGAAATACTACTTACTCTCAACACACATAGAGAATATATATACTATTCTCCTAAGTGGTCCCTACCCCTTACAATTCCTATGCCCGGATTACTCTTTCCTCTCCTCCTATACTGATGCACGATTGGTGGCCTAACATCACACTCCCCTTCTAAAGTCACCTTGTCCTCAAGGTGGAATGTAGGAAACTGTTGGCGAAATAATCCACAACCTCCCAAGTAGCATCTTGAGCTGATAACCTTTTCCAAGCTACCAATGCCTTCCATTCCCCCAGTGCAGGATTATTGTGATATCCACAAACTTCCTCGGGCACCGTCTCCCACTCAAACCTCTTGGAAAGTGGAGGTAATGTAGGTTGCACCACCTGCTGTCCTAAAGCCTTCTTCAGCTGAGATGCATGAAATGCCAAATGAGTGGAAGCTTCTTCTAGGATAGACGATATGCTACCTATCCCACACTCGCCTCAATATAGTATGGGCCAAGATATTTATGAATTTAGGAGACACGTTCTCATTTCTTCTTACTCACATGGAAGTATGCCTGTACGGGCGTAGCTCGAGGAAACCATAATCACCCACCTCATATTCTACTTCACGTCTTTTTCAATCAGCAAATTTCTTCATCCTTTCGTGTGCCATCCTTAAATGTTCTTTCAACATCCCCAAAGTGATATCTATTTCAAGCAGTTGCTAATCTAGAGTGGAATTTGATGTGGACTACTCACCATGTGCAACCAAAGCAGGAGGTACATAGCCATACAAGGCTTGAAATGGTATTATACCAATACTCCACCCAATGCAGCCTCATATACCAGAGTAGGATGTTCACCATAGAAACACCGCATATACAGTTTTACACTTTTAGAACTTTTATTCACTACTTTGGTCTGCCCATCCGATTGCGGGTGGTATGTTGTACTTCTATTAAGTTGAGTTCCTCCTAATTGAAATAACTTCTGCCAGAAATGACTTAAAAACACCTTGTCTCTATCTGAAACTATCGGTTTAGGAAAGCCATGAAGATGCACTTGTTCTTCTATAAAAACATTTGCCACTCCTTTAGTCATGGATGAATGTTCAAACCACTTAAAGGACTATATATGGTCAGTCGATCTACTACCACCAATACCACATTAATTTCCTGCGACTTCAGTAACCCTTCAATGAAATCCATGGAAGTACCTTCCCACATTGTGTCCAGAGTTGCTAATGGTAACAATAACCAAGTCGGAGTCATGGCCAATAATTTGTTTTGTTGACATACTGAACACTCCTCGACATACTTTTGTGCATCTTTTTTCATGCCTTTCCAATACAATTCTCAAGCAACCTTTTATAGGGGCGTAAGAAGCCGGAATGCCTCCCAAACACACAATCATGATAAATGTGTAATATGGATGGCAACAGGGAGGAGGTCTTTGAAATTACTAATTGTCTCCTATACTTCAATACACGTTGCTGAAGAGAGAAATTTGAATTACTGTCTTCATCTATCTTCAATTTCTCAATTCTCTCTCTCAATTTAGAATCTCCATAAACTTCCTCATTAATTACAGCCACATTCTACAATGTTGGTACCACCAAATGAGCTAACTCAACACCCTCCGACCATTGATAAGCTCCCTTTTTACAGTTGCATAAATGGAGCAGCCAGACTTCCCTGACTTTGAACGAATCTTCGGTAATAGCCTGTTAATCCCAAGAAGCTCCGCACCTCCCGAAAACATGTAGGAGTCGGTCATTCTAACACAGCTCATATCTTGTTGACATTAGCTTCTACTCCTTCACCAGAAATAATATGCCTCAAATATTCTATTCTCCCCTGCAGAAATTGGCATTTATTTCTATTGGCATATAATTCACTACCCCGCAACACAACCAACACTACCTCCAAATGCTCTTCTAGGTGCTTTTCTAAATTCTTACTCTAGACCAAAATATCATCAAAGAATACCACAACAAATTTCCTCAAATATGGCTTGAAGATCTAATTCATCATAGCTTGAAACGTAGATGGCGTGTTGGTCAATCCAAAAGGCATTACCAAGAGCTTGTAAATTTGTAATGACCCTCATGTGTACAAAAAGCAGTCTTCTCTATATCAACAACATGCATCCTAATTTGATAATAATCGGACTTCAAGTTAATCTTTGAAAACTAGACTGCCTCATTTAATTCATCAAACAACTCATCTACTACTGGTATTTGAAATTTATTCGCAACAGTAACATTGTTCAACGATCGGTAATCCACACAAAATCTCTATCCTTTGTCCTTTTTAACCAACAACATTGGACTGGAACAAGGACTAGCACTCGATAGTGTGATTTCGGAAGCTATCATGTCAACTAATATCATTCTCCATTTCACACTTTTGATTAGGTGTATCTATATGGTCTCACATTCACTGGACCTTCTCCTTCTTTAAGGTGAATGTGATGGTCTACTCCCCGGCTTGGAGGTAATTCCTCTGGCCAATCGAACACATCATCATATTTCAGCAAGACTTTTGTCAAAGCTTCAGTCAGATTGACGGCTGCTTCCACTCCATAGAACTCTTCCCATGCTTCCATTTCTTGCAAAGATCTACATTCTACTAGGAATCCTTGATCCCCTCCTCCCAAGTTTTCGCCAAACTCTTCAAGCTTACTTGTAATTTCTTTAGACTCGGGTCCCCTCTCAATGCAATTGCCTTCCCTTTATGTAGAATCTTCATTGTTAGTGTTCTCCAATCTACTTCAGTAACCCCTAAGGTGTGTAACCACTGCATTCCCAAAATAACATCTACTCCACTCAACTCTAATGGTAAGAATTCCTCTTTAATGACTAGTTCTCCTAGATGTAATCGCACTCCCTTACAACCCTTCTTTCCCTTTTACAGCAGTGCCAATATTTATTATCACACACCATAATTAGTGGTCATCTCTACTTTGATATTCTCTTTCTCCACTACTTGTTGTGCAATAAAATTGTGCATTGCTCGAGAATCGATAAGCACCACAACATCTTGATCTCCCAATTTCCCTATCAATTTCATCATACCTAGATTCAATAAACCAACTACTGAATTCATCAATAACTGTACTGTTTCCTGTACTTTTGTGGTCTGGAGTTCCACCAGCTCGAATTTTTCCACTGCATCTTCAAACACTTCTAATTCATCAACATCTTTTGTATCAACAGAACCCAAAGCTCACGGTGATCTTTAACTTCACATTAATTCCCTATAGTGTATCATTTATCAACGGGAAAACATAGTCCCTTCTCCATCTTCTCTTTAAATTTATCGTTAGACAAACGCTTAAATGTACCTTCCTTTTGAACAATAGTCGGATTTACCCCTTGCAATGTGATTGTTCACATTGGAACTGGCTTAGTTATCTTCGGCATCTTCTTTGCTGTAATAAGAACTGTCCCTTTAGGGTTATGCACAACATTCAAACCCTTATTTTTGTCGATAGTCGTGCCCTTCTCTCTGTTCTTCACTATTGAATCAGATCAGACTACTTCCCTATTTTATACTCTTTGAGCCACTTTCATCATATTCACAAGCCCAATTGGCTCTCAGCATTCTGCCTCTGCCCTAACCCATGGTATAAGCCCATTCATGAATGTATTCTCCAAGATTTTGTCCGACAAACGTGGTAAAGGTGCAACTAGTTTATCAAAAAGATTTATATATTTCTCCACAGTTGATTCTTGTTTAATCTCAAGGAATTGACTACATATCGTCCCTTGCCGAGGAGATCAGAATCGTTGCAAGAGACACATCTTCAATGCCTTCCAATCAGAAAACGGTTCTCGGTCTTCCTTTGCACGGTACTAGTCCAGAGCTACTGCTTCAAAACTTATAACCATCACCGTCATTTTCTCTTCCTCCATCAATTTATGAATCTGAAAATACCTTTCAGCTCTGAATAACCAAGAATCAGGATTAATGCCATAGAAAATCGACATCTCTACCTTCTTAAACTTGCTCAATTCCATCTTTGCATCTTCATTTCTTTTAACCTGCGTTGTTGACTATCCACTTGCACTTTCAATCATTCCACACTTTTTGATAAGGATTTCATGGATTCCTCCAACTTTGGTAGTTTTTGAATTTCTACACAAATCACTGAAATCTTAGGGCTTGTTTATGTTAGGTACCTAGATTAGTATAAGGTTAAGGGTATAGGGGTAATTAGCTATTTAGGAAGTTACTAGTAGTCATTGTGTAAGTGTGGTTACTAGGGTGGTTACATCTTGTTATAAATGGAGGGAGGGGGAGGCTATTCGGTGGAGTGATCTAGGGCTTGGGTGACAGTACTCAAGAGAGAGGTTCCAAGTGCCTTATACTTGGTTTTATCTTGTATTTTCTTATAGTTACATTATAATAAATTCAGATCTATCCTAACAGTTTCATAACATTCATGTTTTTTGTTTTTGTTTCTCATTTTTTAAAAACGGATTTATTTGATAAATGTTCATATTTCTTGTTTCCAAATTTTAGAAAACCTTTTTTTGGAAATAAGGGAAATTTTTGGAACCACAAAATTAAGCTTCATCATTCCATTTCTTATCCCTCGTTCTGTTTCACTCTAAACTTTGGGATCATTGATTCCTTCCTCTCAGAGCAACTTCACACATGCCATAGCTACTCTCTCCTCCTTCCTCTCATCGACTGGATCTAATCTCAAGTCCAGGGCAAACATGAGGTTTGATTCTTCTTCTTCCATCATATTAGTTGAGATGGAAACTGCGGCTTAATTTTTTTGCATTTCAAAATTGTTAGTCTTGAAAGAACTAGTGAAGTTTTAGAAAGAACAACATTCACACATAGAGTGGATTACTTTCTTTACCTTCAGGAATTACATCAATTGCATTAATTCAAAATAATTGAGAATTTCAGCTCGTATCCATGAAATAATTAGCCAAAATTTCCAATTATTTGGTTCTCTTTATGGGCAGGGCAAAGTTTTGCTATTGTATGACCTATCCTATTTCTAGCTGTTACAATTACAAAATGCTTTAGAATTTCAAGTCTTTGAGTTTGTTCATCTAAGAAGCACGGACATGAACACGATACACAGACATGACACGACATGGATATGGCGACACGTCATTTTTTAAAAATCTAAAACACGACACAACAAGGATACTTTTATTAAAATATT

mRNA sequence

ATGTTCAATATATATTTTCAAACTTCAAACTTCTTCACCAAGTGCAACTTTCACTTCAAGCACCCCCTTTTTATTCGTTGCATCCATGGCATTGCGCATTATTCATCCAATCTCGACTCCAATCAGCTTCTTAGTGAGTTATCTAAAAATGGTCGAGTTGATGAAGCTCGTAAGTTGTTTGATCAAATGCCTTATCGGGACAAGTACACATGGAACATTATGATTTCTGCTTATGCCAATTTAGGAAATTTAGTTGAAGCTCGGAAGCTCTTTAATGAAACTCCAATTAAAAATTCTATCACTTGGTCTTCCCTGGTATCCGGATATTGCAAAAATGGGTGTGAAGTTGAAGGCTTGAGGCAGTTCAGCCAAATGTGGAGTGATGGGCAGAAGCCAAGTCAATACACGTTGGGCAGTGTTCTAAGAGCATGTTCAACTTTGAGTTTGCTCCATACTGGCAAAATGATTCATTGCTATGCAATAAAGATCCAATTAGAAGCGAATATATTTGTTGCAACTGGTCTTGTTGACATGTATTCCAAGTGTAAGTGTCTTCTGGAGGCTGAATACCTCTTCTTTTCACTGCCTGATAGGAAGAACTATGTTCAATGGACTGCTATGCTCACTGGATATGCTCAAAATGGCGAGAGTTTGAAGGCAATTCAGTGTTTTAAGGAGATGAGAAATCAGGGAATGGAGTCTAACCATTTCACATTTCCCAGCATATTGACAGCATGTACATCAATTTCAGCTTATGCTTTTGGTCGTCAAGTACATGGATGTATTATTTGGAGTGGCTTTGGTCCTAACGTATATGTTCAAAGTGCATTAGTTGATATGTATGCCAAATGTGGAGACTTAGCTAGTGCGAGAATGATATTGGATACCATGGAAATTGATGATGTTGTGTGTTGGAACTCGATGATTGTTGGGTGTGTTACACATGGATATATGGAGGAAGCTCTAGTTTTGTTCCATAAGATGCATAATCGGGATATAAGAATTGATGATTTCACATATCCGTCTGTTTTGAAATCTCTGGCTTCTTGTAAGAACCTGAAAATTGGAGAATCAGTTCATTCTCTGACTATTAAAACTGGTTTTGATGCTTGCAAAACGGTGAGCAATGCACTTGTTGACATGTATGCTAAACAAGGAAACTTGAGTTGTGCATTAGACGTTTTCAATAAGATATTAGATAAAGATGTAATATCGTGGACCTCCTTGGTCACGGGATATGTTCACAATGGCTTCCACGAAAAGGCTCTGCAGTTATTTTGTGACATGAGAACAGCAAGGGTTGATCTTGACCAATTTGTAGTTGCCTGTGTTTTTAGTGCATGTGCTGAACTAACAGTTATAGAGTTTGGTCGACAGGTTCATGCAAACTTTATCAAATCTAGTGCTGGTTCATTGTTGTCTGCGGAAAACTCTCTCATAACAATGTACGCCAAATGTGGATGCTTAGAAGATGCAATTAGAGTCTTTGACTCAATGGAAACTCGAAATGTCATATCATGGACTGCCATAATAGTTGGTTATGCACAGAATGGGAGAGGGAAGGACTCTCTTCATTTTTATGAACAAATGATAATTGATGGCATAAAGCCAGACGGTGTTACTTTTATTGGTTTGTTATTTGCTTGCAGCCATGCAGGTCTTGTGGAAACTGGTCAATCTTACTTTGAATCAATGGAAAAAGTTTATGGAATAAAGCCAGCTTCTGATCATTATGCTTGCATGATTGATCTACTGGGACGTGCAGGAAAAATCAATGAGGCAGAGCATTTATTGAACCGAATGGACGTTGAACCCGATGCAACCATATGGAAGTCATTACTTTCTGCATGTAGGGTTCATGGCAATTTAGAACTTGGAGAAAGGGCTGGAAAAAATCTCATTAAATTGGAACCTTCAAATTCTTTGCCTTACGTTTTATTGTCCAATATGTTTTCTGTTGCTGGTAGATGGGAAGATGCAGCCCATATTCGTAGAGCAATGAAGACAATGGGTATTAACAAGGAGCCCGGATATAGTTGGATTGAAATGAAGAGCCAAGTGCATACATTTATATCTGAAGATAGAAGCCATCCTTTGGCGGCTGAAATATATTCAAAGATTGATGAAATGATGATATTAATAAAGGAAGCTGGACATGTTCCAGATATGAACTTTGCATTACGTGACATGGATGAAGAGGCTAAAGAACGTAGTCTAGCATATCATAGTGAGAAGTTGGCAGTTGCATTTGGTCTTCTCACAGTTGCGAAAGGAGCACCAATTCGGATTTTCAAGAATCTGAGAGTATGTGGGGACTGCCACTCAGCAATGAAATATATATCTAGCATTTTTAAGCGGCATATTATTTTGAGAGATTTAAATTGTTTCCATCACTTTATAGAGGGGAAATGTTCTTGTGGAGACTTCTGGTAG

Coding sequence (CDS)

ATGTTCAATATATATTTTCAAACTTCAAACTTCTTCACCAAGTGCAACTTTCACTTCAAGCACCCCCTTTTTATTCGTTGCATCCATGGCATTGCGCATTATTCATCCAATCTCGACTCCAATCAGCTTCTTAGTGAGTTATCTAAAAATGGTCGAGTTGATGAAGCTCGTAAGTTGTTTGATCAAATGCCTTATCGGGACAAGTACACATGGAACATTATGATTTCTGCTTATGCCAATTTAGGAAATTTAGTTGAAGCTCGGAAGCTCTTTAATGAAACTCCAATTAAAAATTCTATCACTTGGTCTTCCCTGGTATCCGGATATTGCAAAAATGGGTGTGAAGTTGAAGGCTTGAGGCAGTTCAGCCAAATGTGGAGTGATGGGCAGAAGCCAAGTCAATACACGTTGGGCAGTGTTCTAAGAGCATGTTCAACTTTGAGTTTGCTCCATACTGGCAAAATGATTCATTGCTATGCAATAAAGATCCAATTAGAAGCGAATATATTTGTTGCAACTGGTCTTGTTGACATGTATTCCAAGTGTAAGTGTCTTCTGGAGGCTGAATACCTCTTCTTTTCACTGCCTGATAGGAAGAACTATGTTCAATGGACTGCTATGCTCACTGGATATGCTCAAAATGGCGAGAGTTTGAAGGCAATTCAGTGTTTTAAGGAGATGAGAAATCAGGGAATGGAGTCTAACCATTTCACATTTCCCAGCATATTGACAGCATGTACATCAATTTCAGCTTATGCTTTTGGTCGTCAAGTACATGGATGTATTATTTGGAGTGGCTTTGGTCCTAACGTATATGTTCAAAGTGCATTAGTTGATATGTATGCCAAATGTGGAGACTTAGCTAGTGCGAGAATGATATTGGATACCATGGAAATTGATGATGTTGTGTGTTGGAACTCGATGATTGTTGGGTGTGTTACACATGGATATATGGAGGAAGCTCTAGTTTTGTTCCATAAGATGCATAATCGGGATATAAGAATTGATGATTTCACATATCCGTCTGTTTTGAAATCTCTGGCTTCTTGTAAGAACCTGAAAATTGGAGAATCAGTTCATTCTCTGACTATTAAAACTGGTTTTGATGCTTGCAAAACGGTGAGCAATGCACTTGTTGACATGTATGCTAAACAAGGAAACTTGAGTTGTGCATTAGACGTTTTCAATAAGATATTAGATAAAGATGTAATATCGTGGACCTCCTTGGTCACGGGATATGTTCACAATGGCTTCCACGAAAAGGCTCTGCAGTTATTTTGTGACATGAGAACAGCAAGGGTTGATCTTGACCAATTTGTAGTTGCCTGTGTTTTTAGTGCATGTGCTGAACTAACAGTTATAGAGTTTGGTCGACAGGTTCATGCAAACTTTATCAAATCTAGTGCTGGTTCATTGTTGTCTGCGGAAAACTCTCTCATAACAATGTACGCCAAATGTGGATGCTTAGAAGATGCAATTAGAGTCTTTGACTCAATGGAAACTCGAAATGTCATATCATGGACTGCCATAATAGTTGGTTATGCACAGAATGGGAGAGGGAAGGACTCTCTTCATTTTTATGAACAAATGATAATTGATGGCATAAAGCCAGACGGTGTTACTTTTATTGGTTTGTTATTTGCTTGCAGCCATGCAGGTCTTGTGGAAACTGGTCAATCTTACTTTGAATCAATGGAAAAAGTTTATGGAATAAAGCCAGCTTCTGATCATTATGCTTGCATGATTGATCTACTGGGACGTGCAGGAAAAATCAATGAGGCAGAGCATTTATTGAACCGAATGGACGTTGAACCCGATGCAACCATATGGAAGTCATTACTTTCTGCATGTAGGGTTCATGGCAATTTAGAACTTGGAGAAAGGGCTGGAAAAAATCTCATTAAATTGGAACCTTCAAATTCTTTGCCTTACGTTTTATTGTCCAATATGTTTTCTGTTGCTGGTAGATGGGAAGATGCAGCCCATATTCGTAGAGCAATGAAGACAATGGGTATTAACAAGGAGCCCGGATATAGTTGGATTGAAATGAAGAGCCAAGTGCATACATTTATATCTGAAGATAGAAGCCATCCTTTGGCGGCTGAAATATATTCAAAGATTGATGAAATGATGATATTAATAAAGGAAGCTGGACATGTTCCAGATATGAACTTTGCATTACGTGACATGGATGAAGAGGCTAAAGAACGTAGTCTAGCATATCATAGTGAGAAGTTGGCAGTTGCATTTGGTCTTCTCACAGTTGCGAAAGGAGCACCAATTCGGATTTTCAAGAATCTGAGAGTATGTGGGGACTGCCACTCAGCAATGAAATATATATCTAGCATTTTTAAGCGGCATATTATTTTGAGAGATTTAAATTGTTTCCATCACTTTATAGAGGGGAAATGTTCTTGTGGAGACTTCTGGTAG

Protein sequence

MFNIYFQTSNFFTKCNFHFKHPLFIRCIHGIAHYSSNLDSNQLLSELSKNGRVDEARKLFDQMPYRDKYTWNIMISAYANLGNLVEARKLFNETPIKNSITWSSLVSGYCKNGCEVEGLRQFSQMWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYSKCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFPSILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEIDDVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVHSLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHEKALQLFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLITMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGVTFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNRMDVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYISSIFKRHIILRDLNCFHHFIEGKCSCGDFW
BLAST of CsaV3_4G031220 vs. NCBI nr
Match: XP_011653924.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g03880, mitochondrial [Cucumis sativus] >XP_011653925.1 PREDICTED: pentatricopeptide repeat-containing protein At2g03880, mitochondrial [Cucumis sativus] >XP_011653926.1 PREDICTED: pentatricopeptide repeat-containing protein At2g03880, mitochondrial [Cucumis sativus] >XP_011653927.1 PREDICTED: pentatricopeptide repeat-containing protein At2g03880, mitochondrial [Cucumis sativus])

HSP 1 Score: 1533.9 bits (3970), Expect = 0.0e+00
Identity = 810/810 (100.00%), Postives = 810/810 (100.00%), Query Frame = 0

Query: 1   MFNIYFQTSNFFTKCNFHFKHPLFIRCIHGIAHYSSNLDSNQLLSELSKNGRVDEARKLF 60
           MFNIYFQTSNFFTKCNFHFKHPLFIRCIHGIAHYSSNLDSNQLLSELSKNGRVDEARKLF
Sbjct: 1   MFNIYFQTSNFFTKCNFHFKHPLFIRCIHGIAHYSSNLDSNQLLSELSKNGRVDEARKLF 60

Query: 61  DQMPYRDKYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120
           DQMPYRDKYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  DQMPYRDKYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 121 XXXXMWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYS 180
           XXXXMWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYS
Sbjct: 121 XXXXMWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYS 180

Query: 181 KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFP 240
           KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFP
Sbjct: 181 KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFP 240

Query: 241 SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEID 300
           SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEID
Sbjct: 241 SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEID 300

Query: 301 DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVH 360
           DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVH
Sbjct: 301 DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVH 360

Query: 361 SLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHE 420
           SLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHE
Sbjct: 361 SLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHE 420

Query: 421 KALQLFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLI 480
           KALQLFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLI
Sbjct: 421 KALQLFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLI 480

Query: 481 TMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGV 540
           TMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGV
Sbjct: 481 TMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGV 540

Query: 541 TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNR 600
           TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNR
Sbjct: 541 TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNR 600

Query: 601 MDVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDA 660
           MDVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDA
Sbjct: 601 MDVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDA 660

Query: 661 AHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHV 720
           AHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHV
Sbjct: 661 AHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHV 720

Query: 721 PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYI 780
           PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYI
Sbjct: 721 PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYI 780

Query: 781 SSIFKRHIILRDLNCFHHFIEGKCSCGDFW 811
           SSIFKRHIILRDLNCFHHFIEGKCSCGDFW
Sbjct: 781 SSIFKRHIILRDLNCFHHFIEGKCSCGDFW 810

BLAST of CsaV3_4G031220 vs. NCBI nr
Match: XP_008442211.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial isoform X1 [Cucumis melo] >XP_008442212.1 PREDICTED: putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial isoform X1 [Cucumis melo] >XP_008442213.1 PREDICTED: putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial isoform X1 [Cucumis melo] >XP_016899536.1 PREDICTED: putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial isoform X1 [Cucumis melo] >XP_016899537.1 PREDICTED: putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial isoform X1 [Cucumis melo])

HSP 1 Score: 1438.7 bits (3723), Expect = 0.0e+00
Identity = 769/810 (94.94%), Postives = 783/810 (96.67%), Query Frame = 0

Query: 1   MFNIYFQTSNFFTKCNFHFKHPLFIRCIHGIAHYSSNLDSNQLLSELSKNGRVDEARKLF 60
           MFNIYF+TSN   KCNFHFK  LFIRCIH IAHYSSN+ SNQLLSELSKNGRVDEARKLF
Sbjct: 1   MFNIYFRTSN---KCNFHFKLTLFIRCIHDIAHYSSNVVSNQLLSELSKNGRVDEARKLF 60

Query: 61  DQMPYRDKYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120
                    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 121 XXXXMWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYS 180
           XXXXMWSDGQKPSQYTLGSVLRACSTLSLLH+GKMIHCYAIKIQLE NIFVATGLVDMYS
Sbjct: 121 XXXXMWSDGQKPSQYTLGSVLRACSTLSLLHSGKMIHCYAIKIQLEENIFVATGLVDMYS 180

Query: 181 KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFP 240
           KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMR QGMESNHFTFP
Sbjct: 181 KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRIQGMESNHFTFP 240

Query: 241 SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEID 300
           SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASAR+IL+TMEID
Sbjct: 241 SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARVILNTMEID 300

Query: 301 DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVH 360
           DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPS LKSLAS KNLKIG+SVH
Sbjct: 301 DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSALKSLASSKNLKIGQSVH 360

Query: 361 SLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHE 420
           SL IKTGFDACKTVSNALVDMYAKQGNLSCALDVFN+ILDKDVISWTSLVTGYVHNGFHE
Sbjct: 361 SLIIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNRILDKDVISWTSLVTGYVHNGFHE 420

Query: 421 KALQLFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLI 480
           KAL+LFCDMR ARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSS GSLLSAENSLI
Sbjct: 421 KALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLI 480

Query: 481 TMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGV 540
           TMYAKCGCLEDAIRVFDSME RNVISWTAIIVGYAQNGRGKDSLHFY+QMI++GIKPD V
Sbjct: 481 TMYAKCGCLEDAIRVFDSMEIRNVISWTAIIVGYAQNGRGKDSLHFYDQMIMNGIKPDDV 540

Query: 541 TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNR 600
           TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGK+NEAEHLLNR
Sbjct: 541 TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEHLLNR 600

Query: 601 MDVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDA 660
           MDVEPDATIWKSLLSACRVHGNLELGERAG+NLIKLEPSNSLPYVLLSNMFSVAGRWEDA
Sbjct: 601 MDVEPDATIWKSLLSACRVHGNLELGERAGRNLIKLEPSNSLPYVLLSNMFSVAGRWEDA 660

Query: 661 AHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHV 720
           AHIR AMKTMGINKEPGYSWIE+KSQVH FISEDRSHPLAAEIYSKIDEMMILIKEAGHV
Sbjct: 661 AHIRIAMKTMGINKEPGYSWIEVKSQVHRFISEDRSHPLAAEIYSKIDEMMILIKEAGHV 720

Query: 721 PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYI 780
           PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYI
Sbjct: 721 PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYI 780

Query: 781 SSIFKRHIILRDLNCFHHFIEGKCSCGDFW 811
           SSIFKRHIILRDLNCFHHFIEGKCSCGDFW
Sbjct: 781 SSIFKRHIILRDLNCFHHFIEGKCSCGDFW 807

BLAST of CsaV3_4G031220 vs. NCBI nr
Match: KGN54859.1 (hypothetical protein Csa_4G554180 [Cucumis sativus])

HSP 1 Score: 1436.4 bits (3717), Expect = 0.0e+00
Identity = 768/768 (100.00%), Postives = 768/768 (100.00%), Query Frame = 0

Query: 1   MFNIYFQTSNFFTKCNFHFKHPLFIRCIHGIAHYSSNLDSNQLLSELSKNGRVDEARKLF 60
           MFNIYFQTSNFFTKCNFHFKHPLFIRCIHGIAHYSSNLDSNQLLSELSKNGRVDEARKLF
Sbjct: 1   MFNIYFQTSNFFTKCNFHFKHPLFIRCIHGIAHYSSNLDSNQLLSELSKNGRVDEARKLF 60

Query: 61  DQMPYRDKYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120
           DQMPYRDKYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  DQMPYRDKYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 121 XXXXMWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYS 180
           XXXXMWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYS
Sbjct: 121 XXXXMWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYS 180

Query: 181 KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFP 240
           KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFP
Sbjct: 181 KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFP 240

Query: 241 SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEID 300
           SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEID
Sbjct: 241 SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEID 300

Query: 301 DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVH 360
           DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVH
Sbjct: 301 DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVH 360

Query: 361 SLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHE 420
           SLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHE
Sbjct: 361 SLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHE 420

Query: 421 KALQLFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLI 480
           KALQLFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLI
Sbjct: 421 KALQLFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLI 480

Query: 481 TMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGV 540
           TMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGV
Sbjct: 481 TMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGV 540

Query: 541 TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNR 600
           TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNR
Sbjct: 541 TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNR 600

Query: 601 MDVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDA 660
           MDVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDA
Sbjct: 601 MDVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDA 660

Query: 661 AHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHV 720
           AHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHV
Sbjct: 661 AHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHV 720

Query: 721 PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLR 769
           PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLR
Sbjct: 721 PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLR 768

BLAST of CsaV3_4G031220 vs. NCBI nr
Match: XP_016899538.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial isoform X2 [Cucumis melo])

HSP 1 Score: 1368.2 bits (3540), Expect = 0.0e+00
Identity = 663/686 (96.65%), Postives = 675/686 (98.40%), Query Frame = 0

Query: 125 MWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYSKCKC 184
           MWSDGQKPSQYTLGSVLRACSTLSLLH+GKMIHCYAIKIQLE NIFVATGLVDMYSKCKC
Sbjct: 63  MWSDGQKPSQYTLGSVLRACSTLSLLHSGKMIHCYAIKIQLEENIFVATGLVDMYSKCKC 122

Query: 185 LLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFPSILT 244
           LLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMR QGMESNHFTFPSILT
Sbjct: 123 LLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRIQGMESNHFTFPSILT 182

Query: 245 ACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEIDDVVC 304
           ACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASAR+IL+TMEIDDVVC
Sbjct: 183 ACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARVILNTMEIDDVVC 242

Query: 305 WNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVHSLTI 364
           WNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPS LKSLAS KNLKIG+SVHSL I
Sbjct: 243 WNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSALKSLASSKNLKIGQSVHSLII 302

Query: 365 KTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHEKALQ 424
           KTGFDACKTVSNALVDMYAKQGNLSCALDVFN+ILDKDVISWTSLVTGYVHNGFHEKAL+
Sbjct: 303 KTGFDACKTVSNALVDMYAKQGNLSCALDVFNRILDKDVISWTSLVTGYVHNGFHEKALK 362

Query: 425 LFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLITMYA 484
           LFCDMR ARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSS GSLLSAENSLITMYA
Sbjct: 363 LFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYA 422

Query: 485 KCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGVTFIG 544
           KCGCLEDAIRVFDSME RNVISWTAIIVGYAQNGRGKDSLHFY+QMI++GIKPD VTFIG
Sbjct: 423 KCGCLEDAIRVFDSMEIRNVISWTAIIVGYAQNGRGKDSLHFYDQMIMNGIKPDDVTFIG 482

Query: 545 LLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNRMDVE 604
           LLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGK+NEAEHLLNRMDVE
Sbjct: 483 LLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEHLLNRMDVE 542

Query: 605 PDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIR 664
           PDATIWKSLLSACRVHGNLELGERAG+NLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIR
Sbjct: 543 PDATIWKSLLSACRVHGNLELGERAGRNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIR 602

Query: 665 RAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMN 724
            AMKTMGINKEPGYSWIE+KSQVH FISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMN
Sbjct: 603 IAMKTMGINKEPGYSWIEVKSQVHRFISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMN 662

Query: 725 FALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYISSIF 784
           FALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYISSIF
Sbjct: 663 FALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYISSIF 722

Query: 785 KRHIILRDLNCFHHFIEGKCSCGDFW 811
           KRHIILRDLNCFHHFIEGKCSCGDFW
Sbjct: 723 KRHIILRDLNCFHHFIEGKCSCGDFW 748

BLAST of CsaV3_4G031220 vs. NCBI nr
Match: XP_022967715.1 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g03880, mitochondrial-like [Cucurbita maxima])

HSP 1 Score: 1344.7 bits (3479), Expect = 0.0e+00
Identity = 720/810 (88.89%), Postives = 753/810 (92.96%), Query Frame = 0

Query: 1   MFNIYFQTSNFFTKCNFHFKHPLFIRCIHGIAHYSSNLDSNQLLSELSKNGRVDEARKLF 60
           MF IYFQTSN FTKC F F      RCIH + + SSN  SNQ LSELSK+GRVDEARKLF
Sbjct: 27  MFTIYFQTSNSFTKC-FXF------RCIHNLVYDSSNFVSNQRLSELSKDGRVDEARKLF 86

Query: 61  DQMPYRDKYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120
           D M YRD Y  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 87  DHMSYRDTYTWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 146

Query: 121 XXXXMWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYS 180
           XXXX WS+GQKPSQYTLGSVLRACSTL LLH+GKMIH Y IKIQLEANIFVATGLVDMYS
Sbjct: 147 XXXXXWSEGQKPSQYTLGSVLRACSTLGLLHSGKMIHGYVIKIQLEANIFVATGLVDMYS 206

Query: 181 KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFP 240
           KCKCLLEAEYLF SL DRKNYV  TAMLTGYAQNGESLKA+QCFKEMR QGMESNHFTFP
Sbjct: 207 KCKCLLEAEYLFVSLSDRKNYVLSTAMLTGYAQNGESLKAMQCFKEMRIQGMESNHFTFP 266

Query: 241 SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEID 300
           SILTACT+ISAY+FG+QVHGCII SGFG NVYVQSALVDMYAKCGDL SARM+L+ MEID
Sbjct: 267 SILTACTAISAYSFGQQVHGCIILSGFGANVYVQSALVDMYAKCGDLNSARMLLNIMEID 326

Query: 301 DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVH 360
           DVVCWNSMIVGCVTHG+MEEALVLFHKMHNRDI IDDFTYPSVLKSL +C++LK GESVH
Sbjct: 327 DVVCWNSMIVGCVTHGHMEEALVLFHKMHNRDIVIDDFTYPSVLKSLGTCRDLKNGESVH 386

Query: 361 SLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHE 420
           SL +KTGFDACKTVSNALVDMYAKQGNL+CAL+VFNKI DKDVISWTSLVTGYVHNGFHE
Sbjct: 387 SLIMKTGFDACKTVSNALVDMYAKQGNLNCALEVFNKISDKDVISWTSLVTGYVHNGFHE 446

Query: 421 KALQLFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLI 480
           KAL+LFCDMR A VDLDQFV+ACVFSACAELT+IEFGRQVH NFIKSS GSLLSAENSLI
Sbjct: 447 KALKLFCDMRIAGVDLDQFVIACVFSACAELTIIEFGRQVHGNFIKSSVGSLLSAENSLI 506

Query: 481 TMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGV 540
           TMYAKCGCLEDA RVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFY++MIIDG+KPD V
Sbjct: 507 TMYAKCGCLEDATRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYDRMIIDGVKPDPV 566

Query: 541 TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNR 600
           TFIGLLFACSHAGLVETG+SYFESMEKVYGIKP SDHYACMIDLLGRAGK+NEAE LLNR
Sbjct: 567 TFIGLLFACSHAGLVETGRSYFESMEKVYGIKPGSDHYACMIDLLGRAGKLNEAEELLNR 626

Query: 601 MDVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDA 660
           MDVEPDAT+WKSLLSACRVHGNLELGERAGKNLIKLEP NSLPYVLLSNMFSVAGRWEDA
Sbjct: 627 MDVEPDATVWKSLLSACRVHGNLELGERAGKNLIKLEPLNSLPYVLLSNMFSVAGRWEDA 686

Query: 661 AHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHV 720
            HIR +MK MGINKEPGYSWIEMKSQVH+FISEDRSHP+AAEIYSKIDEMMILIKEAG+V
Sbjct: 687 THIRNSMKRMGINKEPGYSWIEMKSQVHSFISEDRSHPMAAEIYSKIDEMMILIKEAGYV 746

Query: 721 PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYI 780
           PDMNFALRDMDEEAKERSL YHSEKLAVAFGLL V   APIRIFKNLRVCGDCHSAMKYI
Sbjct: 747 PDMNFALRDMDEEAKERSLTYHSEKLAVAFGLLAVPNRAPIRIFKNLRVCGDCHSAMKYI 806

Query: 781 SSIFKRHIILRDLNCFHHFIEGKCSCGDFW 811
           SS+FKRH+ILRDLNCFHHF EGKCSCGDFW
Sbjct: 807 SSVFKRHVILRDLNCFHHFKEGKCSCGDFW 829

BLAST of CsaV3_4G031220 vs. TAIR10
Match: AT3G61170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 743.0 bits (1917), Expect = 1.9e-214
Identity = 436/758 (57.52%), Postives = 544/758 (71.77%), Query Frame = 0

Query: 24  FIRCIHGIAHYSSNLDSNQLLSELSKNGRVDEARKLFDQMPYRDKYXXXXXXXXXXXXXX 83
           F  CIH  A   + L SN LL +LSK+GRVDEAR++FD+MP     XXXXXXXXXXXXXX
Sbjct: 16  FGSCIHSYAD-RTKLHSNLLLGDLSKSGRVDEARQMFDKMPEXXXXXXXXXXXXXXXXXX 75

Query: 84  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMWSDGQKPSQYTLGSVLRA 143
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  M SDG KP++YTLGSVLR 
Sbjct: 76  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWEMQSDGIKPNEYTLGSVLRM 135

Query: 144 CSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYSKCKCLLEAEYLFFSLPDRKNYVQ 203
           C++L LL  G+ IH + IK   + ++ V  GL+ MY++CK + EAEYLF ++   KN V 
Sbjct: 136 CTSLVLLLRGEQIHGHTIKTGFDLDVNVVNGLLAMYAQCKRISEAEYLFETMEGEKNNVT 195

Query: 204 WTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFPSILTACTSISAYAFGRQVHGCII 263
           WT+MLTGY+QNG + KAI+CF+++R +G +SN +TFPS+LTAC S+SA   G QVH CI+
Sbjct: 196 WTSMLTGYSQNGFAFKAIECFRDLRREGNQSNQYTFPSVLTACASVSACRVGVQVHCCIV 255

Query: 264 WSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEIDDVVCWNSMIVGCVTHGYMEEALV 323
            SGF  N+YVQSAL+DMYAKC ++ SAR +L+ ME+DDVV WNSMIVGCV  G + EAL 
Sbjct: 256 KSGFKTNIYVQSALIDMYAKCREMESARALLEGMEVDDVVSWNSMIVGCVRQGLIGEALS 315

Query: 324 LFHKMHNRDIRIDDFTYPSVLKSLA-SCKNLKIGESVHSLTIKTGFDACKTVSNALVDMY 383
           +F +MH RD++IDDFT PS+L   A S   +KI  S H L +KTG+   K V+NALVDMY
Sbjct: 316 MFGRMHERDMKIDDFTIPSILNCFALSRTEMKIASSAHCLIVKTGYATYKLVNNALVDMY 375

Query: 384 AKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHEKALQLFCDMRTARVDLDQFVVA 443
           AK+G +  AL VF  +++KDVISWT+LVTG  HNG +++AL+LFC+MR   +  D+ V A
Sbjct: 376 AKRGIMDSALKVFEGMIEKDVISWTALVTGNTHNGSYDEALKLFCNMRVGGITPDKIVTA 435

Query: 444 CVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLITMYAKCGCLEDAIRVFDSMETR 503
            V SA AELT++EFG+QVH N+IKS   S LS  NSL+TMY KCG LEDA  +F+SME R
Sbjct: 436 SVLSASAELTLLEFGQQVHGNYIKSGFPSSLSVNNSLVTMYTKCGSLEDANVIFNSMEIR 495

Query: 504 NVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGVTFIGLLFACSHAGLVETGQSYF 563
           ++I+WT                                                      
Sbjct: 496 DLITWTC-----------------------------------XXXXXXXXXXXXXXXXXX 555

Query: 564 ESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNRMDVEPDATIWKSLLSACRVHGN 623
                VYGI P  +HYACMIDL GR+G   + E LL++M+VEPDAT+WK++L+A R HGN
Sbjct: 556 XXXXTVYGITPGPEHYACMIDLFGRSGDFVKVEQLLHQMEVEPDATVWKAILAASRKHGN 615

Query: 624 LELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRAMKTMGINKEPGYSWIE 683
           +E GERA K L++LEP+N++PYV LSNM+S AGR ++AA++RR MK+  I+KEPG SW+E
Sbjct: 616 IENGERAAKTLMELEPNNAVPYVQLSNMYSAAGRQDEAANVRRLMKSRNISKEPGCSWVE 675

Query: 684 MKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMNFALRDMDEEAKERSLAYH 743
            K +VH+F+SEDR HP   EIYSK+DEMM+LIKEAG+  DM+FAL D+D+E KE  LAYH
Sbjct: 676 EKGKVHSFMSEDRRHPRMVEIYSKVDEMMLLIKEAGYFADMSFALHDLDKEGKELGLAYH 735

Query: 744 SEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYI 781
           SEKLAVAFGLL V  GAPIRI KNLRVCGDCHSAMK +
Sbjct: 736 SEKLAVAFGLLVVPSGAPIRIIKNLRVCGDCHSAMKLL 737

BLAST of CsaV3_4G031220 vs. TAIR10
Match: AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 556.2 bits (1432), Expect = 3.3e-158
Identity = 274/678 (40.41%), Postives = 425/678 (62.68%), Query Frame = 0

Query: 134 QYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYSKCKCLLEAEYLFF 193
           Q T   +L     +  L  G+ +HC A+K+ L+  + V+  L++MY K +    A  +F 
Sbjct: 315 QVTFILMLATAVKVDSLALGQQVHCMALKLGLDLMLTVSNSLINMYCKLRKFGFARTVFD 374

Query: 194 SLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFPSILTACTSI-SAY 253
           ++ +R + + W +++ G AQNG  ++A+  F ++   G++ + +T  S+L A +S+    
Sbjct: 375 NMSER-DLISWNSVIAGIAQNGLEVEAVCLFMQLLRCGLKPDQYTMTSVLKAASSLPEGL 434

Query: 254 AFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEIDDVVCWNSMIVGC 313
           +  +QVH   I      + +V +AL+D Y++   +  A ++ +     D+V WN+M+ G 
Sbjct: 435 SLSKQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILFERHNF-DLVAWNAMMAGY 494

Query: 314 VTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVHSLTIKTGFDACK 373
                  + L LF  MH +  R DDFT  +V K+      +  G+ VH+  IK+G+D   
Sbjct: 495 TQSHDGHKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDL 554

Query: 374 TVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHEKALQLFCDMRTA 433
            VS+ ++DMY K G++S A   F+ I   D ++WT++++G + NG  E+A  +F  MR  
Sbjct: 555 WVSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLM 614

Query: 434 RVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLITMYAKCGCLEDA 493
            V  D+F +A +  A + LT +E GRQ+HAN +K +  +      SL+ MYAKCG ++DA
Sbjct: 615 GVLPDEFTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDA 674

Query: 494 IRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGVTFIGLLFACSHA 553
             +F  +E  N+ +W A++VG AQ+G GK++L  ++QM   GIKPD VTFIG+L ACSH+
Sbjct: 675 YCLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHS 734

Query: 554 GLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNRMDVEPDATIWKS 613
           GLV     +  SM   YGIKP  +HY+C+ D LGRAG + +AE+L+  M +E  A+++++
Sbjct: 735 GLVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRT 794

Query: 614 LLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRAMKTMGI 673
           LL+ACRV G+ E G+R    L++LEP +S  YVLLSNM++ A +W++    R  MK   +
Sbjct: 795 LLAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKV 854

Query: 674 NKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMNFALRDMDE 733
            K+PG+SWIE+K+++H F+ +DRS+     IY K+ +M+  IK+ G+VP+ +F L D++E
Sbjct: 855 KKDPGFSWIEVKNKIHIFVVDDRSNRQTELIYRKVKDMIRDIKQEGYVPETDFTLVDVEE 914

Query: 734 EAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYISSIFKRHIILRD 793
           E KER+L YHSEKLAVAFGLL+     PIR+ KNLRVCGDCH+AMKYI+ ++ R I+LRD
Sbjct: 915 EEKERALYYHSEKLAVAFGLLSTPPSTPIRVIKNLRVCGDCHNAMKYIAKVYNREIVLRD 974

Query: 794 LNCFHHFIEGKCSCGDFW 811
            N FH F +G CSCGD+W
Sbjct: 975 ANRFHRFKDGICSCGDYW 990

BLAST of CsaV3_4G031220 vs. TAIR10
Match: AT3G57430.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 542.7 bits (1397), Expect = 3.8e-154
Identity = 276/703 (39.26%), Postives = 425/703 (60.46%), Query Frame = 0

Query: 125 MWSDGQKPSQYTLGSVLRACSTLSL---LHTGKMIHCYAIKIQLEANIFVATGLVDMYSK 184
           M  +  +PS +TL SV+ ACS L +   L  GK +H Y ++ + E N F+   LV MY K
Sbjct: 190 MLDENVEPSSFTLVSVVTACSNLPMPEGLMMGKQVHAYGLR-KGELNSFIINTLVAMYGK 249

Query: 185 CKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFPS 244
              L  ++ L  S   R + V W  +L+   QN + L+A++  +EM  +G+E + FT  S
Sbjct: 250 LGKLASSKVLLGSFGGR-DLVTWNTVLSSLCQNEQLLEALEYLREMVLEGVEPDEFTISS 309

Query: 245 ILTACTSISAYAFGRQVHGCIIWSG-FGPNVYVQSALVDMYAKCGDLASARMILDTMEID 304
           +L AC+ +     G+++H   + +G    N +V SALVDMY  C  + S R + D M   
Sbjct: 310 VLPACSHLEMLRTGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVLSGRRVFDGMFDR 369

Query: 305 DVVCWNSMIVGCVTHGYMEEALVLFHKM-HNRDIRIDDFTYPSVLKSLASCKNLKIGESV 364
            +  WN+MI G   + + +EAL+LF  M  +  +  +  T   V+ +          E++
Sbjct: 370 KIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAI 429

Query: 365 HSLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFH 424
           H   +K G D  + V N L+DMY++ G +  A+ +F K+ D+D+++W +++TGYV +  H
Sbjct: 430 HGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHH 489

Query: 425 EKALQLFCDMR---------TARVDL--DQFVVACVFSACAELTVIEFGRQVHANFIKSS 484
           E AL L   M+          +RV L  +   +  +  +CA L+ +  G+++HA  IK++
Sbjct: 490 EDALLLLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIKNN 549

Query: 485 AGSLLSAENSLITMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYE 544
             + ++  ++L+ MYAKCGCL+ + +VFD +  +NVI+W  II+ Y  +G G++++    
Sbjct: 550 LATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDLLR 609

Query: 545 QMIIDGIKPDGVTFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRA 604
            M++ G+KP+ VTFI +  ACSH+G+V+ G   F  M+  YG++P+SDHYAC++DLLGRA
Sbjct: 610 MMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRA 669

Query: 605 GKINEAEHLLNRMDVE-PDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLL 664
           G+I EA  L+N M  +   A  W SLL A R+H NLE+GE A +NLI+LEP+ +  YVLL
Sbjct: 670 GRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLL 729

Query: 665 SNMFSVAGRWEDAAHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKI 724
           +N++S AG W+ A  +RR MK  G+ KEPG SWIE   +VH F++ D SHP + ++   +
Sbjct: 730 ANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYL 789

Query: 725 DEMMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNL 784
           + +   +++ G+VPD +  L +++E+ KE  L  HSEKLA+AFG+L  + G  IR+ KNL
Sbjct: 790 ETLWERMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEKLAIAFGILNTSPGTIIRVAKNL 849

Query: 785 RVCGDCHSAMKYISSIFKRHIILRDLNCFHHFIEGKCSCGDFW 811
           RVC DCH A K+IS I  R IILRD+  FH F  G CSCGD+W
Sbjct: 850 RVCNDCHLATKFISKIVDREIILRDVRRFHRFKNGTCSCGDYW 890

BLAST of CsaV3_4G031220 vs. TAIR10
Match: AT5G09950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 540.4 bits (1391), Expect = 1.9e-153
Identity = 278/669 (41.55%), Postives = 414/669 (61.88%), Query Frame = 0

Query: 150 LHTGKMIHCYAIKIQL-EANIFVATGLVDMYSKCKCLLEAEYLFFSLPDRKNYVQWTAML 209
           L  G+ +H + I   L +  + +  GLV+MY+KC  + +A  +F+ + D K+ V W +M+
Sbjct: 329 LKKGREVHGHVITTGLVDFMVGIGNGLVNMYAKCGSIADARRVFYFMTD-KDSVSWNSMI 388

Query: 210 TGYAQNGESLKAIQCFKEMRNQGMESNHFTFPSILTACTSISAYAFGRQVHGCIIWSGFG 269
           TG  QNG  ++A++ +K MR   +    FT  S L++C S+     G+Q+HG  +  G  
Sbjct: 389 TGLDQNGCFIEAVERYKSMRRHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGID 448

Query: 270 PNVYVQSALVDMYAKCGDLASARMILDTMEIDDVVCWNSMIVGCVTHG--YMEEALVLFH 329
            NV V +AL+ +YA+ G L   R I  +M   D V WNS I+G +      + EA+V F 
Sbjct: 449 LNVSVSNALMTLYAETGYLNECRKIFSSMPEHDQVSWNS-IIGALARSERSLPEAVVCFL 508

Query: 330 KMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVHSLTIKTGFDACKTVSNALVDMYAKQG 389
                  +++  T+ SVL +++S    ++G+ +H L +K       T  NAL+  Y K G
Sbjct: 509 NAQRAGQKLNRITFSSVLSAVSSLSFGELGKQIHGLALKNNIADEATTENALIACYGKCG 568

Query: 390 NLSCALDVFNKILD-KDVISWTSLVTGYVHNGFHEKALQLFCDMRTARVDLDQFVVACVF 449
            +     +F+++ + +D ++W S+++GY+HN    KAL L   M      LD F+ A V 
Sbjct: 569 EMDGCEKIFSRMAERRDNVTWNSMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVL 628

Query: 450 SACAELTVIEFGRQVHANFIKSSAGSLLSAENSLITMYAKCGCLEDAIRVFDSMETRNVI 509
           SA A +  +E G +VHA  +++   S +   ++L+ MY+KCG L+ A+R F++M  RN  
Sbjct: 629 SAFASVATLERGMEVHACSVRACLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSY 688

Query: 510 SWTAIIVGYAQNGRGKDSLHFYEQMIIDG-IKPDGVTFIGLLFACSHAGLVETGQSYFES 569
           SW ++I GYA++G+G+++L  +E M +DG   PD VTF+G+L ACSHAGL+E G  +FES
Sbjct: 689 SWNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFES 748

Query: 570 MEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNRMDVEPDATIWKSLLSA-CRVHG-N 629
           M   YG+ P  +H++CM D+LGRAG++++ E  + +M ++P+  IW+++L A CR +G  
Sbjct: 749 MSDSYGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRK 808

Query: 630 LELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRAMKTMGINKEPGYSWIE 689
            ELG++A + L +LEP N++ YVLL NM++  GRWED    R+ MK   + KE GYSW+ 
Sbjct: 809 AELGKKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWVT 868

Query: 690 MKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMNFALRDMDEEAKERSLAYH 749
           MK  VH F++ D+SHP A  IY K+ E+   +++AG+VP   FAL D+++E KE  L+YH
Sbjct: 869 MKDGVHMFVAGDKSHPDADVIYKKLKELNRKMRDAGYVPQTGFALYDLEQENKEEILSYH 928

Query: 750 SEKLAVAFGLLTVAKGA-PIRIFKNLRVCGDCHSAMKYISSIFKRHIILRDLNCFHHFIE 809
           SEKLAVAF L        PIRI KNLRVCGDCHSA KYIS I  R IILRD N FHHF +
Sbjct: 929 SEKLAVAFVLAAQRSSTLPIRIMKNLRVCGDCHSAFKYISKIEGRQIILRDSNRFHHFQD 988

Query: 810 GKCSCGDFW 811
           G CSC DFW
Sbjct: 989 GACSCSDFW 995

BLAST of CsaV3_4G031220 vs. TAIR10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 536.2 bits (1380), Expect = 3.5e-152
Identity = 263/686 (38.34%), Postives = 408/686 (59.48%), Query Frame = 0

Query: 125  MWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYSKCKC 184
            M  DG +P   TL S++ ACS    L  G+ +H Y  K+   +N  +   L+++Y+KC  
Sbjct: 380  MHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKC-A 439

Query: 185  LLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFPSILT 244
             +E    +F   + +N V W  ML  Y    +   + + F++M+ + +  N +T+PSIL 
Sbjct: 440  DIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILK 499

Query: 245  ACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEIDDVVC 304
             C  +     G Q+H  II + F  N YV S L+DMYAK G L +A  IL      DVV 
Sbjct: 500  TCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVS 559

Query: 305  WNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVHSLTI 364
            W +MI G   + + ++AL  F +M +R IR D+    + + + A  + LK G+ +H+   
Sbjct: 560  WTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQAC 619

Query: 365  KTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHEKALQ 424
             +GF +     NALV +Y++ G +  +   F +    D I+W +LV+G+  +G +E+AL+
Sbjct: 620  VSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALR 679

Query: 425  LFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLITMYA 484
            +F  M    +D + F       A +E   ++ G+QVHA   K+   S     N+LI+MYA
Sbjct: 680  VFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYA 739

Query: 485  KCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGVTFIG 544
            KCG + DA + F  + T+N +SW AII  Y+++G G ++L  ++QMI   ++P+ VT +G
Sbjct: 740  KCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVG 799

Query: 545  LLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNRMDVE 604
            +L ACSH GLV+ G +YFESM   YG+ P  +HY C++D+L RAG ++ A+  +  M ++
Sbjct: 800  VLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIK 859

Query: 605  PDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIR 664
            PDA +W++LLSAC VH N+E+GE A  +L++LEP +S  YVLLSN+++V+ +W+     R
Sbjct: 860  PDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTR 919

Query: 665  RAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMN 724
            + MK  G+ KEPG SWIE+K+ +H+F   D++HPLA EI+    ++     E G+V D  
Sbjct: 920  QKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRASEIGYVQDCF 979

Query: 725  FALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYISSIF 784
              L ++  E K+  +  HSEKLA++FGLL++    PI + KNLRVC DCH+ +K++S + 
Sbjct: 980  SLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVMKNLRVCNDCHAWIKFVSKVS 1039

Query: 785  KRHIILRDLNCFHHFIEGKCSCGDFW 811
             R II+RD   FHHF  G CSC D+W
Sbjct: 1040 NREIIVRDAYRFHHFEGGACSCKDYW 1064

BLAST of CsaV3_4G031220 vs. Swiss-Prot
Match: sp|Q9SMZ2|PP347_ARATH (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 556.2 bits (1432), Expect = 5.9e-157
Identity = 274/678 (40.41%), Postives = 425/678 (62.68%), Query Frame = 0

Query: 134 QYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYSKCKCLLEAEYLFF 193
           Q T   +L     +  L  G+ +HC A+K+ L+  + V+  L++MY K +    A  +F 
Sbjct: 315 QVTFILMLATAVKVDSLALGQQVHCMALKLGLDLMLTVSNSLINMYCKLRKFGFARTVFD 374

Query: 194 SLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFPSILTACTSI-SAY 253
           ++ +R + + W +++ G AQNG  ++A+  F ++   G++ + +T  S+L A +S+    
Sbjct: 375 NMSER-DLISWNSVIAGIAQNGLEVEAVCLFMQLLRCGLKPDQYTMTSVLKAASSLPEGL 434

Query: 254 AFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEIDDVVCWNSMIVGC 313
           +  +QVH   I      + +V +AL+D Y++   +  A ++ +     D+V WN+M+ G 
Sbjct: 435 SLSKQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILFERHNF-DLVAWNAMMAGY 494

Query: 314 VTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVHSLTIKTGFDACK 373
                  + L LF  MH +  R DDFT  +V K+      +  G+ VH+  IK+G+D   
Sbjct: 495 TQSHDGHKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDL 554

Query: 374 TVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHEKALQLFCDMRTA 433
            VS+ ++DMY K G++S A   F+ I   D ++WT++++G + NG  E+A  +F  MR  
Sbjct: 555 WVSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLM 614

Query: 434 RVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLITMYAKCGCLEDA 493
            V  D+F +A +  A + LT +E GRQ+HAN +K +  +      SL+ MYAKCG ++DA
Sbjct: 615 GVLPDEFTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDA 674

Query: 494 IRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGVTFIGLLFACSHA 553
             +F  +E  N+ +W A++VG AQ+G GK++L  ++QM   GIKPD VTFIG+L ACSH+
Sbjct: 675 YCLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHS 734

Query: 554 GLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNRMDVEPDATIWKS 613
           GLV     +  SM   YGIKP  +HY+C+ D LGRAG + +AE+L+  M +E  A+++++
Sbjct: 735 GLVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRT 794

Query: 614 LLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRAMKTMGI 673
           LL+ACRV G+ E G+R    L++LEP +S  YVLLSNM++ A +W++    R  MK   +
Sbjct: 795 LLAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKV 854

Query: 674 NKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMNFALRDMDE 733
            K+PG+SWIE+K+++H F+ +DRS+     IY K+ +M+  IK+ G+VP+ +F L D++E
Sbjct: 855 KKDPGFSWIEVKNKIHIFVVDDRSNRQTELIYRKVKDMIRDIKQEGYVPETDFTLVDVEE 914

Query: 734 EAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYISSIFKRHIILRD 793
           E KER+L YHSEKLAVAFGLL+     PIR+ KNLRVCGDCH+AMKYI+ ++ R I+LRD
Sbjct: 915 EEKERALYYHSEKLAVAFGLLSTPPSTPIRVIKNLRVCGDCHNAMKYIAKVYNREIVLRD 974

Query: 794 LNCFHHFIEGKCSCGDFW 811
            N FH F +G CSCGD+W
Sbjct: 975 ANRFHRFKDGICSCGDYW 990

BLAST of CsaV3_4G031220 vs. Swiss-Prot
Match: sp|Q7Y211|PP285_ARATH (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 542.7 bits (1397), Expect = 6.8e-153
Identity = 276/703 (39.26%), Postives = 425/703 (60.46%), Query Frame = 0

Query: 125 MWSDGQKPSQYTLGSVLRACSTLSL---LHTGKMIHCYAIKIQLEANIFVATGLVDMYSK 184
           M  +  +PS +TL SV+ ACS L +   L  GK +H Y ++ + E N F+   LV MY K
Sbjct: 190 MLDENVEPSSFTLVSVVTACSNLPMPEGLMMGKQVHAYGLR-KGELNSFIINTLVAMYGK 249

Query: 185 CKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFPS 244
              L  ++ L  S   R + V W  +L+   QN + L+A++  +EM  +G+E + FT  S
Sbjct: 250 LGKLASSKVLLGSFGGR-DLVTWNTVLSSLCQNEQLLEALEYLREMVLEGVEPDEFTISS 309

Query: 245 ILTACTSISAYAFGRQVHGCIIWSG-FGPNVYVQSALVDMYAKCGDLASARMILDTMEID 304
           +L AC+ +     G+++H   + +G    N +V SALVDMY  C  + S R + D M   
Sbjct: 310 VLPACSHLEMLRTGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVLSGRRVFDGMFDR 369

Query: 305 DVVCWNSMIVGCVTHGYMEEALVLFHKM-HNRDIRIDDFTYPSVLKSLASCKNLKIGESV 364
            +  WN+MI G   + + +EAL+LF  M  +  +  +  T   V+ +          E++
Sbjct: 370 KIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAI 429

Query: 365 HSLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFH 424
           H   +K G D  + V N L+DMY++ G +  A+ +F K+ D+D+++W +++TGYV +  H
Sbjct: 430 HGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHH 489

Query: 425 EKALQLFCDMR---------TARVDL--DQFVVACVFSACAELTVIEFGRQVHANFIKSS 484
           E AL L   M+          +RV L  +   +  +  +CA L+ +  G+++HA  IK++
Sbjct: 490 EDALLLLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIKNN 549

Query: 485 AGSLLSAENSLITMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYE 544
             + ++  ++L+ MYAKCGCL+ + +VFD +  +NVI+W  II+ Y  +G G++++    
Sbjct: 550 LATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDLLR 609

Query: 545 QMIIDGIKPDGVTFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRA 604
            M++ G+KP+ VTFI +  ACSH+G+V+ G   F  M+  YG++P+SDHYAC++DLLGRA
Sbjct: 610 MMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRA 669

Query: 605 GKINEAEHLLNRMDVE-PDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLL 664
           G+I EA  L+N M  +   A  W SLL A R+H NLE+GE A +NLI+LEP+ +  YVLL
Sbjct: 670 GRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLL 729

Query: 665 SNMFSVAGRWEDAAHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKI 724
           +N++S AG W+ A  +RR MK  G+ KEPG SWIE   +VH F++ D SHP + ++   +
Sbjct: 730 ANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYL 789

Query: 725 DEMMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNL 784
           + +   +++ G+VPD +  L +++E+ KE  L  HSEKLA+AFG+L  + G  IR+ KNL
Sbjct: 790 ETLWERMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEKLAIAFGILNTSPGTIIRVAKNL 849

Query: 785 RVCGDCHSAMKYISSIFKRHIILRDLNCFHHFIEGKCSCGDFW 811
           RVC DCH A K+IS I  R IILRD+  FH F  G CSCGD+W
Sbjct: 850 RVCNDCHLATKFISKIVDREIILRDVRRFHRFKNGTCSCGDYW 890

BLAST of CsaV3_4G031220 vs. Swiss-Prot
Match: sp|Q9FIB2|PP373_ARATH (Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H35 PE=3 SV=1)

HSP 1 Score: 540.4 bits (1391), Expect = 3.4e-152
Identity = 278/669 (41.55%), Postives = 414/669 (61.88%), Query Frame = 0

Query: 150 LHTGKMIHCYAIKIQL-EANIFVATGLVDMYSKCKCLLEAEYLFFSLPDRKNYVQWTAML 209
           L  G+ +H + I   L +  + +  GLV+MY+KC  + +A  +F+ + D K+ V W +M+
Sbjct: 329 LKKGREVHGHVITTGLVDFMVGIGNGLVNMYAKCGSIADARRVFYFMTD-KDSVSWNSMI 388

Query: 210 TGYAQNGESLKAIQCFKEMRNQGMESNHFTFPSILTACTSISAYAFGRQVHGCIIWSGFG 269
           TG  QNG  ++A++ +K MR   +    FT  S L++C S+     G+Q+HG  +  G  
Sbjct: 389 TGLDQNGCFIEAVERYKSMRRHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGID 448

Query: 270 PNVYVQSALVDMYAKCGDLASARMILDTMEIDDVVCWNSMIVGCVTHG--YMEEALVLFH 329
            NV V +AL+ +YA+ G L   R I  +M   D V WNS I+G +      + EA+V F 
Sbjct: 449 LNVSVSNALMTLYAETGYLNECRKIFSSMPEHDQVSWNS-IIGALARSERSLPEAVVCFL 508

Query: 330 KMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVHSLTIKTGFDACKTVSNALVDMYAKQG 389
                  +++  T+ SVL +++S    ++G+ +H L +K       T  NAL+  Y K G
Sbjct: 509 NAQRAGQKLNRITFSSVLSAVSSLSFGELGKQIHGLALKNNIADEATTENALIACYGKCG 568

Query: 390 NLSCALDVFNKILD-KDVISWTSLVTGYVHNGFHEKALQLFCDMRTARVDLDQFVVACVF 449
            +     +F+++ + +D ++W S+++GY+HN    KAL L   M      LD F+ A V 
Sbjct: 569 EMDGCEKIFSRMAERRDNVTWNSMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVL 628

Query: 450 SACAELTVIEFGRQVHANFIKSSAGSLLSAENSLITMYAKCGCLEDAIRVFDSMETRNVI 509
           SA A +  +E G +VHA  +++   S +   ++L+ MY+KCG L+ A+R F++M  RN  
Sbjct: 629 SAFASVATLERGMEVHACSVRACLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSY 688

Query: 510 SWTAIIVGYAQNGRGKDSLHFYEQMIIDG-IKPDGVTFIGLLFACSHAGLVETGQSYFES 569
           SW ++I GYA++G+G+++L  +E M +DG   PD VTF+G+L ACSHAGL+E G  +FES
Sbjct: 689 SWNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFES 748

Query: 570 MEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNRMDVEPDATIWKSLLSA-CRVHG-N 629
           M   YG+ P  +H++CM D+LGRAG++++ E  + +M ++P+  IW+++L A CR +G  
Sbjct: 749 MSDSYGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRK 808

Query: 630 LELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRAMKTMGINKEPGYSWIE 689
            ELG++A + L +LEP N++ YVLL NM++  GRWED    R+ MK   + KE GYSW+ 
Sbjct: 809 AELGKKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWVT 868

Query: 690 MKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMNFALRDMDEEAKERSLAYH 749
           MK  VH F++ D+SHP A  IY K+ E+   +++AG+VP   FAL D+++E KE  L+YH
Sbjct: 869 MKDGVHMFVAGDKSHPDADVIYKKLKELNRKMRDAGYVPQTGFALYDLEQENKEEILSYH 928

Query: 750 SEKLAVAFGLLTVAKGA-PIRIFKNLRVCGDCHSAMKYISSIFKRHIILRDLNCFHHFIE 809
           SEKLAVAF L        PIRI KNLRVCGDCHSA KYIS I  R IILRD N FHHF +
Sbjct: 929 SEKLAVAFVLAAQRSSTLPIRIMKNLRVCGDCHSAFKYISKIEGRQIILRDSNRFHHFQD 988

Query: 810 GKCSCGDFW 811
           G CSC DFW
Sbjct: 989 GACSCSDFW 995

BLAST of CsaV3_4G031220 vs. Swiss-Prot
Match: sp|Q9SVP7|PP307_ARATH (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 536.2 bits (1380), Expect = 6.4e-151
Identity = 263/686 (38.34%), Postives = 408/686 (59.48%), Query Frame = 0

Query: 125  MWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYSKCKC 184
            M  DG +P   TL S++ ACS    L  G+ +H Y  K+   +N  +   L+++Y+KC  
Sbjct: 380  MHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKC-A 439

Query: 185  LLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFPSILT 244
             +E    +F   + +N V W  ML  Y    +   + + F++M+ + +  N +T+PSIL 
Sbjct: 440  DIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILK 499

Query: 245  ACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEIDDVVC 304
             C  +     G Q+H  II + F  N YV S L+DMYAK G L +A  IL      DVV 
Sbjct: 500  TCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVS 559

Query: 305  WNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVHSLTI 364
            W +MI G   + + ++AL  F +M +R IR D+    + + + A  + LK G+ +H+   
Sbjct: 560  WTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQAC 619

Query: 365  KTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHEKALQ 424
             +GF +     NALV +Y++ G +  +   F +    D I+W +LV+G+  +G +E+AL+
Sbjct: 620  VSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALR 679

Query: 425  LFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLITMYA 484
            +F  M    +D + F       A +E   ++ G+QVHA   K+   S     N+LI+MYA
Sbjct: 680  VFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYA 739

Query: 485  KCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGVTFIG 544
            KCG + DA + F  + T+N +SW AII  Y+++G G ++L  ++QMI   ++P+ VT +G
Sbjct: 740  KCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVG 799

Query: 545  LLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNRMDVE 604
            +L ACSH GLV+ G +YFESM   YG+ P  +HY C++D+L RAG ++ A+  +  M ++
Sbjct: 800  VLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIK 859

Query: 605  PDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIR 664
            PDA +W++LLSAC VH N+E+GE A  +L++LEP +S  YVLLSN+++V+ +W+     R
Sbjct: 860  PDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTR 919

Query: 665  RAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMN 724
            + MK  G+ KEPG SWIE+K+ +H+F   D++HPLA EI+    ++     E G+V D  
Sbjct: 920  QKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRASEIGYVQDCF 979

Query: 725  FALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYISSIF 784
              L ++  E K+  +  HSEKLA++FGLL++    PI + KNLRVC DCH+ +K++S + 
Sbjct: 980  SLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVMKNLRVCNDCHAWIKFVSKVS 1039

Query: 785  KRHIILRDLNCFHHFIEGKCSCGDFW 811
             R II+RD   FHHF  G CSC D+W
Sbjct: 1040 NREIIVRDAYRFHHFEGGACSCKDYW 1064

BLAST of CsaV3_4G031220 vs. Swiss-Prot
Match: sp|Q3E6Q1|PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 535.4 bits (1378), Expect = 1.1e-150
Identity = 260/683 (38.07%), Postives = 410/683 (60.03%), Query Frame = 0

Query: 128 DGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYSKCKCLLE 187
           D  +P  Y    +L+ C   + L  GK IH   +K     ++F  TGL +MY+KC+ + E
Sbjct: 129 DDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNE 188

Query: 188 AEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFPSILTACT 247
           A  +F  +P+R + V W  ++ GY+QNG +  A++  K M  + ++ +  T  S+L A +
Sbjct: 189 ARKVFDRMPER-DLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVS 248

Query: 248 SISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEIDDVVCWNS 307
           ++   + G+++HG  + SGF   V + +ALVDMYAKCG L +AR + D M   +VV WNS
Sbjct: 249 ALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNS 308

Query: 308 MIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVHSLTIKTG 367
           MI   V +   +EA+++F KM +  ++  D +    L + A   +L+ G  +H L+++ G
Sbjct: 309 MIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELG 368

Query: 368 FDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHEKALQLFC 427
            D   +V N+L+ MY K   +  A  +F K+  + ++SW +++ G+  NG    AL  F 
Sbjct: 369 LDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFS 428

Query: 428 DMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLITMYAKCG 487
            MR+  V  D F    V +A AEL++    + +H   ++S     +    +L+ MYAKCG
Sbjct: 429 QMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCG 488

Query: 488 CLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGVTFIGLLF 547
            +  A  +FD M  R+V +W A+I GY  +G GK +L  +E+M    IKP+GVTF+ ++ 
Sbjct: 489 AIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVIS 548

Query: 548 ACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNRMDVEPDA 607
           ACSH+GLVE G   F  M++ Y I+ + DHY  M+DLLGRAG++NEA   + +M V+P  
Sbjct: 549 ACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAV 608

Query: 608 TIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRAM 667
            ++ ++L AC++H N+   E+A + L +L P +   +VLL+N++  A  WE    +R +M
Sbjct: 609 NVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSM 668

Query: 668 KTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMNFAL 727
              G+ K PG S +E+K++VH+F S   +HP + +IY+ +++++  IKEAG+VPD N  L
Sbjct: 669 LRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLVL 728

Query: 728 RDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYISSIFKRH 787
             ++ + KE+ L+ HSEKLA++FGLL    G  I + KNLRVC DCH+A KYIS +  R 
Sbjct: 729 -GVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTGRE 788

Query: 788 IILRDLNCFHHFIEGKCSCGDFW 811
           I++RD+  FHHF  G CSCGD+W
Sbjct: 789 IVVRDMQRFHHFKNGACSCGDYW 809

BLAST of CsaV3_4G031220 vs. TrEMBL
Match: tr|A0A1S3B568|A0A1S3B568_CUCME (putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial isoform X1 OS=Cucumis melo OX=3656 GN=LOC103486132 PE=4 SV=1)

HSP 1 Score: 1438.7 bits (3723), Expect = 0.0e+00
Identity = 769/810 (94.94%), Postives = 783/810 (96.67%), Query Frame = 0

Query: 1   MFNIYFQTSNFFTKCNFHFKHPLFIRCIHGIAHYSSNLDSNQLLSELSKNGRVDEARKLF 60
           MFNIYF+TSN   KCNFHFK  LFIRCIH IAHYSSN+ SNQLLSELSKNGRVDEARKLF
Sbjct: 1   MFNIYFRTSN---KCNFHFKLTLFIRCIHDIAHYSSNVVSNQLLSELSKNGRVDEARKLF 60

Query: 61  DQMPYRDKYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120
                    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 121 XXXXMWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYS 180
           XXXXMWSDGQKPSQYTLGSVLRACSTLSLLH+GKMIHCYAIKIQLE NIFVATGLVDMYS
Sbjct: 121 XXXXMWSDGQKPSQYTLGSVLRACSTLSLLHSGKMIHCYAIKIQLEENIFVATGLVDMYS 180

Query: 181 KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFP 240
           KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMR QGMESNHFTFP
Sbjct: 181 KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRIQGMESNHFTFP 240

Query: 241 SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEID 300
           SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASAR+IL+TMEID
Sbjct: 241 SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARVILNTMEID 300

Query: 301 DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVH 360
           DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPS LKSLAS KNLKIG+SVH
Sbjct: 301 DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSALKSLASSKNLKIGQSVH 360

Query: 361 SLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHE 420
           SL IKTGFDACKTVSNALVDMYAKQGNLSCALDVFN+ILDKDVISWTSLVTGYVHNGFHE
Sbjct: 361 SLIIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNRILDKDVISWTSLVTGYVHNGFHE 420

Query: 421 KALQLFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLI 480
           KAL+LFCDMR ARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSS GSLLSAENSLI
Sbjct: 421 KALKLFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLI 480

Query: 481 TMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGV 540
           TMYAKCGCLEDAIRVFDSME RNVISWTAIIVGYAQNGRGKDSLHFY+QMI++GIKPD V
Sbjct: 481 TMYAKCGCLEDAIRVFDSMEIRNVISWTAIIVGYAQNGRGKDSLHFYDQMIMNGIKPDDV 540

Query: 541 TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNR 600
           TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGK+NEAEHLLNR
Sbjct: 541 TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEHLLNR 600

Query: 601 MDVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDA 660
           MDVEPDATIWKSLLSACRVHGNLELGERAG+NLIKLEPSNSLPYVLLSNMFSVAGRWEDA
Sbjct: 601 MDVEPDATIWKSLLSACRVHGNLELGERAGRNLIKLEPSNSLPYVLLSNMFSVAGRWEDA 660

Query: 661 AHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHV 720
           AHIR AMKTMGINKEPGYSWIE+KSQVH FISEDRSHPLAAEIYSKIDEMMILIKEAGHV
Sbjct: 661 AHIRIAMKTMGINKEPGYSWIEVKSQVHRFISEDRSHPLAAEIYSKIDEMMILIKEAGHV 720

Query: 721 PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYI 780
           PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYI
Sbjct: 721 PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYI 780

Query: 781 SSIFKRHIILRDLNCFHHFIEGKCSCGDFW 811
           SSIFKRHIILRDLNCFHHFIEGKCSCGDFW
Sbjct: 781 SSIFKRHIILRDLNCFHHFIEGKCSCGDFW 807

BLAST of CsaV3_4G031220 vs. TrEMBL
Match: tr|A0A0A0L1C4|A0A0A0L1C4_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G554180 PE=4 SV=1)

HSP 1 Score: 1436.4 bits (3717), Expect = 0.0e+00
Identity = 768/768 (100.00%), Postives = 768/768 (100.00%), Query Frame = 0

Query: 1   MFNIYFQTSNFFTKCNFHFKHPLFIRCIHGIAHYSSNLDSNQLLSELSKNGRVDEARKLF 60
           MFNIYFQTSNFFTKCNFHFKHPLFIRCIHGIAHYSSNLDSNQLLSELSKNGRVDEARKLF
Sbjct: 1   MFNIYFQTSNFFTKCNFHFKHPLFIRCIHGIAHYSSNLDSNQLLSELSKNGRVDEARKLF 60

Query: 61  DQMPYRDKYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120
           DQMPYRDKYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 61  DQMPYRDKYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120

Query: 121 XXXXMWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYS 180
           XXXXMWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYS
Sbjct: 121 XXXXMWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYS 180

Query: 181 KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFP 240
           KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFP
Sbjct: 181 KCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFP 240

Query: 241 SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEID 300
           SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEID
Sbjct: 241 SILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEID 300

Query: 301 DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVH 360
           DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVH
Sbjct: 301 DVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVH 360

Query: 361 SLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHE 420
           SLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHE
Sbjct: 361 SLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHE 420

Query: 421 KALQLFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLI 480
           KALQLFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLI
Sbjct: 421 KALQLFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLI 480

Query: 481 TMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGV 540
           TMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGV
Sbjct: 481 TMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGV 540

Query: 541 TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNR 600
           TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNR
Sbjct: 541 TFIGLLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNR 600

Query: 601 MDVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDA 660
           MDVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDA
Sbjct: 601 MDVEPDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDA 660

Query: 661 AHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHV 720
           AHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHV
Sbjct: 661 AHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHV 720

Query: 721 PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLR 769
           PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLR
Sbjct: 721 PDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLR 768

BLAST of CsaV3_4G031220 vs. TrEMBL
Match: tr|A0A1S4DU93|A0A1S4DU93_CUCME (putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial isoform X2 OS=Cucumis melo OX=3656 GN=LOC103486132 PE=4 SV=1)

HSP 1 Score: 1368.2 bits (3540), Expect = 0.0e+00
Identity = 663/686 (96.65%), Postives = 675/686 (98.40%), Query Frame = 0

Query: 125 MWSDGQKPSQYTLGSVLRACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYSKCKC 184
           MWSDGQKPSQYTLGSVLRACSTLSLLH+GKMIHCYAIKIQLE NIFVATGLVDMYSKCKC
Sbjct: 63  MWSDGQKPSQYTLGSVLRACSTLSLLHSGKMIHCYAIKIQLEENIFVATGLVDMYSKCKC 122

Query: 185 LLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFPSILT 244
           LLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMR QGMESNHFTFPSILT
Sbjct: 123 LLEAEYLFFSLPDRKNYVQWTAMLTGYAQNGESLKAIQCFKEMRIQGMESNHFTFPSILT 182

Query: 245 ACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEIDDVVC 304
           ACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASAR+IL+TMEIDDVVC
Sbjct: 183 ACTSISAYAFGRQVHGCIIWSGFGPNVYVQSALVDMYAKCGDLASARVILNTMEIDDVVC 242

Query: 305 WNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVHSLTI 364
           WNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPS LKSLAS KNLKIG+SVHSL I
Sbjct: 243 WNSMIVGCVTHGYMEEALVLFHKMHNRDIRIDDFTYPSALKSLASSKNLKIGQSVHSLII 302

Query: 365 KTGFDACKTVSNALVDMYAKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHEKALQ 424
           KTGFDACKTVSNALVDMYAKQGNLSCALDVFN+ILDKDVISWTSLVTGYVHNGFHEKAL+
Sbjct: 303 KTGFDACKTVSNALVDMYAKQGNLSCALDVFNRILDKDVISWTSLVTGYVHNGFHEKALK 362

Query: 425 LFCDMRTARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLITMYA 484
           LFCDMR ARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSS GSLLSAENSLITMYA
Sbjct: 363 LFCDMRIARVDLDQFVVACVFSACAELTVIEFGRQVHANFIKSSVGSLLSAENSLITMYA 422

Query: 485 KCGCLEDAIRVFDSMETRNVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGVTFIG 544
           KCGCLEDAIRVFDSME RNVISWTAIIVGYAQNGRGKDSLHFY+QMI++GIKPD VTFIG
Sbjct: 423 KCGCLEDAIRVFDSMEIRNVISWTAIIVGYAQNGRGKDSLHFYDQMIMNGIKPDDVTFIG 482

Query: 545 LLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNRMDVE 604
           LLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGK+NEAEHLLNRMDVE
Sbjct: 483 LLFACSHAGLVETGQSYFESMEKVYGIKPASDHYACMIDLLGRAGKLNEAEHLLNRMDVE 542

Query: 605 PDATIWKSLLSACRVHGNLELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIR 664
           PDATIWKSLLSACRVHGNLELGERAG+NLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIR
Sbjct: 543 PDATIWKSLLSACRVHGNLELGERAGRNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIR 602

Query: 665 RAMKTMGINKEPGYSWIEMKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMN 724
            AMKTMGINKEPGYSWIE+KSQVH FISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMN
Sbjct: 603 IAMKTMGINKEPGYSWIEVKSQVHRFISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMN 662

Query: 725 FALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYISSIF 784
           FALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYISSIF
Sbjct: 663 FALRDMDEEAKERSLAYHSEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYISSIF 722

Query: 785 KRHIILRDLNCFHHFIEGKCSCGDFW 811
           KRHIILRDLNCFHHFIEGKCSCGDFW
Sbjct: 723 KRHIILRDLNCFHHFIEGKCSCGDFW 748

BLAST of CsaV3_4G031220 vs. TrEMBL
Match: tr|A0A2I4DT49|A0A2I4DT49_9ROSI (pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Juglans regia OX=51240 GN=LOC108983215 PE=4 SV=1)

HSP 1 Score: 1050.0 bits (2714), Expect = 2.6e-303
Identity = 558/788 (70.81%), Postives = 666/788 (84.52%), Query Frame = 0

Query: 26  RCIHGIAHY---SSNLDSNQLLSELSKNGRVDEARKLFDQMPYRDKYXXXXXXXXXXXXX 85
           R +H I +    S+ L SN+LL++LSK+GR+DEARK+FD M  RD++ XXXXXXXXXXXX
Sbjct: 27  RHVHSIVNSKLDSNRLLSNRLLNDLSKSGRIDEARKMFDNMFNRDEFSXXXXXXXXXXXX 86

Query: 86  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMWSDGQKPSQYTLGSVLR 145
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   +GQK SQYTLGS LR
Sbjct: 87  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLYEGQKLSQYTLGSALR 146

Query: 146 ACSTLSLLHTGKMIHCYAIKIQLEANIFVATGLVDMYSKCKCLLEAEYLFFSLPDRKNYV 205
            CS L LL  G++IH Y I+I  ++N+FV T LVDMY+KCKC+LEAEYLF +LP R+N+V
Sbjct: 147 GCSVLGLLQGGEIIHGYLIRIGFDSNVFVVTALVDMYAKCKCILEAEYLFDTLPGRRNHV 206

Query: 206 QWTAMLTGYAQNGESLKAIQCFKEMRNQGMESNHFTFPSILTACTSISAYAFGRQVHGCI 265
            WTAM++GY+QNG+  KAI+CF+ M+ +G+ESN FTFPSILTAC S+SA  FG QVHGCI
Sbjct: 207 LWTAMVSGYSQNGDEFKAIECFRGMQAEGVESNQFTFPSILTACASVSAGDFGAQVHGCI 266

Query: 266 IWSGFGPNVYVQSALVDMYAKCGDLASARMILDTMEIDDVVCWNSMIVGCVTHGYMEEAL 325
           + SGFG NV+VQSALV+MYAKCG+L SAR  L+ ME DDVV WNSMIVGCV HG+ +EAL
Sbjct: 267 VRSGFGANVFVQSALVNMYAKCGNLNSARRALENMEFDDVVSWNSMIVGCVRHGFEQEAL 326

Query: 326 VLFHKMHNRDIRIDDFTYPSVLKSLASCKNLKIGESVHSLTIKTGFDACKTVSNALVDMY 385
            LF KMH RD++IDDFTYPSVL S  S  ++K  +SVH + IKTGF+A K VSNALVDMY
Sbjct: 327 SLFKKMHARDMKIDDFTYPSVLNSFTSTMDMKNAKSVHCMIIKTGFEAYKLVSNALVDMY 386

Query: 386 AKQGNLSCALDVFNKILDKDVISWTSLVTGYVHNGFHEKALQLFCDMRTARVDLDQFVVA 445
           AKQG L CA +VF++++D+DVISWTSLVTGY HNG HE+A++LFCDMRT  +  D+FV+A
Sbjct: 387 AKQGYLECAFEVFSRMVDRDVISWTSLVTGYAHNGSHEEAIKLFCDMRTTGICPDEFVIA 446

Query: 446 CVFSACAELTVIEFGRQVHANFIKSSAGSLLSAENSLITMYAKCGCLEDAIRVFDSMETR 505
            + SACAELT+++FG+QVHANFIK    S LS +NSL+TMYAKCGC+EDA  VF+SM+ +
Sbjct: 447 SILSACAELTLLKFGQQVHANFIKFGLVSSLSIDNSLVTMYAKCGCIEDANGVFNSMQVQ 506

Query: 506 NVISWTAIIVGYAQNGRGKDSLHFYEQMIIDGIKPDGVTFIGLLFACSHAGLVETGQSYF 565
           +V++WTA+IVGYAQNGRGKDS+ FY +MI  G KPD +TFIGLLFACSHAGLV+ G+ +F
Sbjct: 507 DVVTWTALIVGYAQNGRGKDSIQFYNRMIASGTKPDFITFIGLLFACSHAGLVDDGRWFF 566

Query: 566 ESMEKVYGIKPASDHYACMIDLLGRAGKINEAEHLLNRMDVEPDATIWKSLLSACRVHGN 625
           ESM +V+GIKP ++HYACMIDLLGR+GK+NEA+ LLN MDV+PDAT+WK+LL+ACRVH N
Sbjct: 567 ESMNQVFGIKPGAEHYACMIDLLGRSGKLNEAKELLNEMDVKPDATVWKALLAACRVHRN 626

Query: 626 LELGERAGKNLIKLEPSNSLPYVLLSNMFSVAGRWEDAAHIRRAMKTMGINKEPGYSWIE 685
           LELGE+A KNL +LEPSN++PYVLLSNM+  AGRWEDAA IRR MK+MGI+KEPG SWIE
Sbjct: 627 LELGEKAAKNLFELEPSNAVPYVLLSNMYFSAGRWEDAARIRRLMKSMGISKEPGCSWIE 686

Query: 686 MKSQVHTFISEDRSHPLAAEIYSKIDEMMILIKEAGHVPDMNFALRDMDEEAKERSLAYH 745
           + S VH F+SEDR HP  AEIYSKIDE+MILIKEAG+VPDMNFAL DMDEE KE  LAYH
Sbjct: 687 LNSHVHRFMSEDRGHPRTAEIYSKIDEIMILIKEAGYVPDMNFALHDMDEEGKELGLAYH 746

Query: 746 SEKLAVAFGLLTVAKGAPIRIFKNLRVCGDCHSAMKYISSIFKRHIILRDLNCFHHFIEG 805
           SEKLA+AFG+LTV  GAPIRIFKNLRVCGDCH+AMKYIS +F RHIILRD NCFHHF +G
Sbjct: 747 SEKLAIAFGILTVPPGAPIRIFKNLRVCGDCHTAMKYISRVFLRHIILRDSNCFHHFRDG 806

Query: 806 KCSCGDFW 811
            CSCGD+W
Sbjct: 807 NCSCGDYW 814

BLAST of CsaV3_4G031220 vs. TrEMBL
Match: tr|A0A2P5CNS6|A0A2P5CNS6_9ROSA (DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_278530 PE=4 SV=1)

HSP 1 Score: 1044.3 bits (2699), Expect = 1.5e-301
Identity = 561/775 (72.39%), Postives = 653/775 (84.26%), Query Frame = 0

Query: 36  SNLDSNQLLSELSKNGRVDEARKLFDQMPYRDKYXXXXXXXXXXXXXXXXXXXXXXXXXX 95
           S  +SN+LL+ELSK+GRVDEAR+LFD+M  RD  XXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 39  SRFESNRLLNELSKSGRVDEARQLFDKMLVRDXXXXXXXXXXXXXXXXXXXXXXXXXXXX 98

Query: 96  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXMWSDGQKPSQYTLGSVLRACSTLSLLHTGKM 155
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXX    GQ PSQ+TLGSVLR CS L LL  G+ 
Sbjct: 99  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQMPSQFTLGSVLRLCSMLGLLQRGEQ 158

Query: 156 IHCYAIKIQLEANIFVATGLVDMYSKCKCLLEAEYLFFSLPDRKNYVQWTAMLTGYAQNG 215
           IH Y IK   +++ FV TGLVDMY+KCK +LEAEYLF   PD KN V WTAM+TGY+QNG
Sbjct: 159 IHGYTIKTGFDSSDFVLTGLVDMYAKCKHILEAEYLFGMSPDSKNNVMWTAMVTGYSQNG 218

Query: 216 ESLKAIQCFKEMRNQGMESNHFTFPSILTACTSISAYAFGRQVHGCIIWSGFGPNVYVQS 275
           E  KA++CF+ MR++G+ESN FTFP ILTAC ++SA+ FG QVHGCI+ SGFG NV+VQS
Sbjct: 219 ECFKAMKCFRAMRSEGVESNQFTFPGILTACAAVSAHGFGAQVHGCIVRSGFGANVFVQS 278

Query: 276 ALVDMYAKCGDLASARMILDTMEIDDVVCWNSMIVGCVTHGYMEEALVLFHKMHNRDIRI 335
           ALVDMYAKCGDLASA+  L+ ME DDVV WNSMIVGCV  G++E+AL+LF KMH RD++ 
Sbjct: 279 ALVDMYAKCGDLASAKTALENMEADDVVSWNSMIVGCVRQGHLEDALILFEKMHARDMKS 338

Query: 336 DDFTYPSVLKSLASCKNLKIGESVHSLTIKTGFDACKTVSNALVDMYAKQGNLSCALDVF 395
           D FTYPSVL S A  K ++  ++VH L IK+GF+A   V+NALVDMYAKQGNL  A  +F
Sbjct: 339 DSFTYPSVLNSFAVLKEIENAKAVHCLIIKSGFEAYVLVANALVDMYAKQGNLDWAFRMF 398

Query: 396 NKILDKDVISWTSLVTGYVHNGFHEKALQLFCDMRTARVDLDQFVVACVFSACAELTVIE 455
           + I DKDVISWTSLVTGY HNG H KAL LFCDMR A +DLDQFVVA V SACAELTV+E
Sbjct: 399 DLIQDKDVISWTSLVTGYAHNGSHGKALGLFCDMRIAGIDLDQFVVASVLSACAELTVLE 458

Query: 456 FGRQVHANFIKSSAGSLLSAENSLITMYAKCGCLEDAIRVFDSMETRNVISWTAIIVGYA 515
           FG+Q+HAN  K    S LS +NSL+TMYAKCGC+E+A RVFD+M  RNVISWTA+IVGYA
Sbjct: 459 FGQQIHANCTKFGLQSSLSVDNSLVTMYAKCGCIEEANRVFDAMRVRNVISWTALIVGYA 518

Query: 516 QNGRGKDSLHFYEQMIIDGIKPDGVTFIGLLFACSHAGLVETGQSYFESMEKVYGIKPAS 575
           QNGRG+DSL FY++MI  G  PD +TFIGLLFACSHAGLVE G++YF+SM+KV+GIKP  
Sbjct: 519 QNGRGRDSLKFYDKMIATGTNPDFITFIGLLFACSHAGLVENGRTYFKSMDKVFGIKPGP 578

Query: 576 DHYACMIDLLGRAGKINEAEHLLNRMDVEPDATIWKSLLSACRVHGNLELGERAGKNLIK 635
           +HYACMIDLLGR+GK+ EAE L+N+M +EPDAT+WK+LL+ACRVHGN+ELGE+A KNL++
Sbjct: 579 EHYACMIDLLGRSGKLKEAEGLVNQMTMEPDATVWKALLAACRVHGNVELGEKAAKNLLE 638

Query: 636 LEPSNSLPYVLLSNMFSVAGRWEDAAHIRRAMKTMGINKEPGYSWIEMKSQVHTFISEDR 695
           LEP N++PYVLLSNM+S AGRWEDAA +RR MK+MGI+KEPG SWIE+ SQV+ F+SEDR
Sbjct: 639 LEPFNAVPYVLLSNMYSAAGRWEDAARVRRLMKSMGISKEPGCSWIEINSQVNRFMSEDR 698

Query: 696 SHPLAAEIYSKIDEMMILIKEAGHVPDMNFALRDMDEEAKERSLAYHSEKLAVAFGLLTV 755
            HP  AEIYSK+DE+MILIKEAG+VPDMNFAL+DMDEE KE  LAYHSEKLA+AFGLLTV
Sbjct: 699 GHPRTAEIYSKLDEIMILIKEAGYVPDMNFALQDMDEEGKEIGLAYHSEKLAIAFGLLTV 758

Query: 756 AKGAPIRIFKNLRVCGDCHSAMKYISSIFKRHIILRDLNCFHHFIEGKCSCGDFW 811
             GAPIRIFKNLRVCGDCH+AMKYIS +F RHIILRD NCFHHF EG CSCGD+W
Sbjct: 759 PPGAPIRIFKNLRVCGDCHTAMKYISRVFLRHIILRDPNCFHHFKEGNCSCGDYW 813

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011653924.10.0e+00100.00PREDICTED: pentatricopeptide repeat-containing protein At2g03880, mitochondrial ... [more]
XP_008442211.10.0e+0094.94PREDICTED: putative pentatricopeptide repeat-containing protein At3g13770, mitoc... [more]
KGN54859.10.0e+00100.00hypothetical protein Csa_4G554180 [Cucumis sativus][more]
XP_016899538.10.0e+0096.65PREDICTED: putative pentatricopeptide repeat-containing protein At3g13770, mitoc... [more]
XP_022967715.10.0e+0088.89LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g03880, mito... [more]
Match NameE-valueIdentityDescription
AT3G61170.11.9e-21457.52Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G33170.13.3e-15840.41Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G57430.13.8e-15439.26Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G09950.11.9e-15341.55Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G13650.13.5e-15238.34Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9SMZ2|PP347_ARATH5.9e-15740.41Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
sp|Q7Y211|PP285_ARATH6.8e-15339.26Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
sp|Q9FIB2|PP373_ARATH3.4e-15241.55Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis th... [more]
sp|Q9SVP7|PP307_ARATH6.4e-15138.34Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
sp|Q3E6Q1|PPR32_ARATH1.1e-15038.07Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3B568|A0A1S3B568_CUCME0.0e+0094.94putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial is... [more]
tr|A0A0A0L1C4|A0A0A0L1C4_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G554180 PE=4 SV=1[more]
tr|A0A1S4DU93|A0A1S4DU93_CUCME0.0e+0096.65putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial is... [more]
tr|A0A2I4DT49|A0A2I4DT49_9ROSI2.6e-30370.81pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Juglans ... [more]
tr|A0A2P5CNS6|A0A2P5CNS6_9ROSA1.5e-30172.39DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_278530 ... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR032867DYW_dom
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_4G031220.1CsaV3_4G031220.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 69..93
e-value: 1.4E-5
score: 23.0
coord: 404..436
e-value: 8.8E-6
score: 23.6
coord: 578..602
e-value: 5.1E-4
score: 18.0
coord: 41..63
e-value: 0.0013
score: 16.8
coord: 505..538
e-value: 1.1E-5
score: 23.3
coord: 303..336
e-value: 3.4E-7
score: 28.0
coord: 100..133
e-value: 1.6E-4
score: 19.6
coord: 204..235
e-value: 4.3E-6
score: 24.6
coord: 477..504
e-value: 8.6E-5
score: 20.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 41..64
e-value: 0.0021
score: 18.1
coord: 376..402
e-value: 0.013
score: 15.6
coord: 69..93
e-value: 3.5E-6
score: 26.8
coord: 404..431
e-value: 1.4E-6
score: 28.0
coord: 477..504
e-value: 2.5E-5
score: 24.1
coord: 100..129
e-value: 7.4E-5
score: 22.6
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 528..586
e-value: 1.7E-4
score: 21.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 199..246
e-value: 6.5E-9
score: 35.7
coord: 301..348
e-value: 2.3E-12
score: 46.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 301..335
score: 10.83
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 235..269
score: 5.47
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 371..401
score: 6.862
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 402..436
score: 10.337
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 168..198
score: 5.371
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 67..97
score: 9.372
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 640..674
score: 7.826
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 503..537
score: 11.213
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 574..604
score: 7.509
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 36..66
score: 7.903
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 98..132
score: 11.126
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 606..636
score: 5.294
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 200..234
score: 10.337
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 336..370
score: 6.862
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 472..502
score: 7.859
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 270..300
score: 6.96
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 538..573
score: 6.61
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 475..708
e-value: 2.9E-41
score: 143.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 253..358
e-value: 3.5E-21
score: 77.3
coord: 359..474
e-value: 8.9E-17
score: 63.0
coord: 151..252
e-value: 2.0E-15
score: 58.5
coord: 29..150
e-value: 1.0E-28
score: 101.9
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 43..405
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 589..662
coord: 381..426
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 676..799
e-value: 1.4E-38
score: 131.5
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 193..302
NoneNo IPR availablePANTHERPTHR24015:SF1029PENTATRICOPEPTIDE PPR REPEAT-CONTAINING PROTEINcoord: 31..192
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 31..192
NoneNo IPR availablePANTHERPTHR24015:SF1029PENTATRICOPEPTIDE PPR REPEAT-CONTAINING PROTEINcoord: 193..302
NoneNo IPR availablePANTHERPTHR24015:SF1029PENTATRICOPEPTIDE PPR REPEAT-CONTAINING PROTEINcoord: 298..729
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 298..729