Csa2G004730 (gene) Cucumber (Chinese Long) v2

NameCsa2G004730
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionDNA mismatch repair protein mutS; contains IPR000432 (DNA mismatch repair protein MutS, C-terminal), IPR007695 (DNA mismatch repair protein MutS-like, N-terminal), IPR007696 (DNA mismatch repair protein MutS, core), IPR007860 (DNA mismatch repair protein MutS, connector domain), IPR015536 (DNA mismatch repair protein MutS-homologue MSH6), IPR027417 (P-loop containing nucleoside triphosphate hydrolase)
LocationChr2 : 774237 .. 782062 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCAAAAGGGCGCAAAAGTTTTGGCGCCAATTTCAACATCTGCAATTTTTCTCTATACAGCCATCTCCAGTTCTAGCGAAGCTTCCGCAATTCATCGCGAAAAGATGCAGCGCCAGAAATCTTTGTTATCCTTCTTCCAAAAATCTCCGTCCGATAATCGGAGCTCCGATGGCTGTGCCTCCTCCGTCGGCCAGCGGCTCACTCGCTTTCAAACGAAACCAAGCGCAGCCGGTTTGGAGCAGCCGGCTATCCAGACCACTGCGGATTCCTCCCTGGAGATTAGAGGAACCGACACTCCGCCGGAGAAGGTGCCTCGCCAGATTTTGCCGGTGATTGAGAAGAACAGAGGTTCTTCTCTCTTTTCAAGCATCATGCATAAATTTGTGCGAGTCGATGATAAACGTAAGGCGAACGAGAGGTAATGTCGATACTAACTAAACCAGTAGGTTTTTCGATTCTTAGTCGATGATAAACTGCATTTTGCTCTCTCCATTCGACTATGTTTACTGTTCACGAACTGCTCTTGCATCGACGTTTCTTTTTCTGTCAACGCTTTTATATTCGTGGCTTGTTCTATGGTAGAGTTGCTTTTTGATCTTGATGTATGAATCACGCAGGGACGAAGTTCAAAAAGATTCATCTCAGAATGAGGTTGGAAAAGATTCTCCTCAGTTACCTTCCATTTCTGGTAAGGTAAATGATCCGACAGAGTTTTCGAAACTAGATGTAGCTTCTAGACGTCACGGTAAATTCGACGTTGCAAATTTAAACGGACATAGAGGACCTGTATTGAATATTGAAAGCAATGAGGACATTGCTGGACCAGAAACACCTGGCATGCGACCTTCTGTCTCTCGTTTGAAGAGATCTCAAGAGGTTTCTCTTGTGAATTGTAGCGGTGATTCTTTGCAAGATAGTACAAAAAGAATCAAACTCCTTCAGGACTCGATTAACTTGAACAAAATTCACAATGAAATTTCCGATGCAACTAGCAAATTTGAGTGGCTCAATCCCTCTCAAGTTCGAGATGCCAATCGTAGAAGGCCCGACCATCCTCTTTATGATAAAAAAACACTATACATACCACCCGACGTGCTGAAGAAAATGTCAGCATCGCAAAAACAATACTGGAATGTGAAATGTCAATATATGGACATCTTGCTCTTCTTCAAAGTGGTTAGTTTATATTCTTCTTTTTCAACAGCACCATTGCATAAACTTAGATCCTGCGTTCAATATATGTCTTTATATGCTATGAATACTGTTGATTTTTGTGCTGAAGTAGGCATCTCCACGACCACAATCTGAAACTATATGAAATATTAGGATCGTGTTTATGAAGGTTACATTGGAACTGTAATAGAAATGCGTTGTATTGTTGTGGATTTTGTTTTGAATTGTCCAAGTGAGGTCAAGTTTAAGTCAAATTTGGATCCAATATAGAAGTAGACAGGGAAAGAATTTAGGAATCGTAATTGCAGTCTCTACTCCAGATTGCATTTAAATTCAGAAACAATTTGTGCTGAGATTCACATTTTCTGATCCATTCATGTAAACACATGCACTAATAAATTAATAAGGTATTGAACTTATTATTTGCTAGGCTTGATTTTTCTCTATCTCAGTGAGAGATTTTCCTCTTCTACACAGTTTATACTACTGTAGATTTTGCTTTGGATGCTTTCACTGTGTAATTGGTACCTTACCTCGTTTAATTAACATAAGACTATGAAATTTCTTTTTTACATCACAGGGGAAGTTCTATGAGCTATATGAACAAGATGCTGAAATTGGTCACAAGGAGCTTGACTGGAAAATGACATTAAGCGGCGTTGGGAAATGTAGGCAGGTTAAATAAATTTCATCAATTTGTTATCTCTAGTCTATATATTTCTTCATCTTCGAGTTATCCGTTTATTCATGTTATTCCCTATTATACTTAGGTTGGTGTACCTGAGAGTGGAATTGATGAAGCCGTTCAAAAACTTGTTGCTCGTGGGTAAGGATAGAGAACTTTGCTGTTTGTTCACCTAACCTCTCATTTATGTTTCTCCAGATTGCATTCTATATGTGTGCATGCTTTTGTTCTCACTTGATGCATCATCTGAAGTAGAGAGAGGGTGAAGGGGAGTAAAACATAAAATGTGTAAACTAGAGGCACTGGAACTGATAAAAAACCTGAACGAAAATTTACTTCAGCATCGTGTTTGTGTGCTATTTATTTTGAGTATGTTAAGTGACACATTTATGTGATATTCTTTATGAAGATATAAAGTTGGGAGAGTCGAGCAGTTGGAATCTGCAGAGCAAACAAAAAGCAGAGGTGCAAATTCTGTAAGTATGTCACTTCAAATATCAGAAGGACATTGATTCTAGTTTTAGATTTTCAAAAGATCATTGTGTCTCTATTTTTTTGTTCTATGTATTTCGGCATCTGGATGTTATACTTGCAATTTAATTTCTGACTTGTTAATTGCAGGTGATACCTAGGAAATTGGTACAGGTGACTACTCCATCAACCAAGGCTGACGGTGACATTGGGCCTGATGCCGTTCATTTGCTTGCCATCAAAGAGGTATTAGAAATTGATTAAACTCTTGTATGTAATTGATATTTGGACTAGTTAGATAAATGAAAATTGTGGTTATAGTTAATTCAAATTGTCCCTTTTTCTTTCATTTTTACAAACAAAAGTTAAATAGTTCTTACCCATTGAAGAAGTTCACACTTAATCAACAGTTGATGGATCTATTCCATGGCATTACCATTTGAAATTGTATCATGGATTGCAATATTACTTTTAAATGATGGTCATGTTGGAAATAAACATTACATGACAAGCTACAACGAGAGTATGCTTCATGGGAATTTCTTTTCTATAGGAGAGCTGCGGGCTGGATAATAATTCAATTTCATATGGGTTTGCTTTTGTCGATTGTGCTGCTTTAAAATTTTGGACTGGTTCTATCAAAGATGATGCTTCCTGCGCTGCTCTGGGTGCTCTCTTGATGCAAGTAGGCAATCCAAAATTGTCATTTTTGCATTTTTGTCATTTGGTTTTGCAAAGTTGATAGTAATATGTGCCACTGTGATTTTTCTAGGTGTCCCCAAAGGAAATAATATATGAAGCTAGAGGTGAAAAGAAAATCTTTCATCCAATCTTGGACTTCAAATTTTCATTTAATTGATTGATATATTTAACTTTGGCAAACTCCTCAATGTCCACTATGTAACTTTGTGCAGGACTATCTAAAGAAACACATAAAGTTCTCAAGAAGTACTCGCCTACTGGTTTGTTATATACTAAGAAATTTCCTTTGATGAGTGGTTGCCTTTTTATTTATCACATATGTCTGCCTTCTGAGTTTAAATGTCTTTCCTCTTTTGCTACTGTCAAATTCTTAATAGCTACTATTTACTAGTCTAACTACAAATTCAATTATCTAGTTAATACAATTTTATTTTACATTGTTCTCAAGTCACTGAAAGTGATCGACCGAAGTAATGAGATTTAAATGAATGCAATTGACTCAAGTCTTGAGATTTACTTGAAAATTTGACCATCGATTTCTCAAATATCTCTCATGTTGTTAATAACCGATGATCATCTGAATTGTTTAACAAATTTTGATGTTTTATTCACAATGTTTGAGCTGTACTATTTATTCTCAAAGTTCTTAGTCTTCTCATCTTGCAGGCTCCACTGCTCTAGAACTCACATCAGGGTCGCCAGTTACAAATTTTCTAGAAGCTTCAGAGGTTAAACTTTTGGTCCAGTCTAAAGCATACTTTAAAGGTTCCTTGAATTTGTGGAATCACGAAAGCACAGTCCATGATGACATTGCTCTATGTGCGCTTGGAGGACTTATCAATCACATGTCAAGGCTGATGGTAAATATGATTCTCCAAGATACAGTTATATGTCTTATCACATAACTTTTGTGATTTATGATCAAACATTTCCTACAGTTAGACGATGTTTTGCGGAATGGTGATTTACTGCCATACCAAGTATACAGAGGCTGCCTAAGAATGGATGGACAAACAATGGTTAATCTTGAAATTTTCAGAAACAATGATGACGGTGGTCTGTCAGGTAGGTATTTGTTTCAAATTCAAGGTCTGAAAAACTGGACCGATATCTTTTTTTATTAATTTTGATACATATGTTGCAGGTACACTGTACAAGTATCTTGATAACTGCGTGACATCATCGGGAAAGAGGCTCTTAAGGTTATGGATCTGTCATCCTCTTAAAGATGTTGAAGAGATAAACAACAGACTTAATGTGGTTGAAGAACTGATGGCGCAATCTGACATTATGGTACTTCTAGGTACCACCTATCTTCGCAAGCTTCCAGACTTGGAGAGGTTGCTTGGGCAGATTAAGGCTACTGTCCAGTCATCTGCTTCTCTTGTATTACCATTGATTCGCAAAAAGTTACAGAAACGACGGGTAAGTATAATATAACATATGTACAAACAGCAGATCTAATGTCATTGCTTACACTCTCAACAAGTCGAACGGGTGGTCATCACTCAAGGTTATTGCAAAATCATTAAACTGAGATTGTCTTAAATTAGATTAGTTCTTAAATGAAATTGTCTCACATAATAATTAAACTGTAATTAGTTATACAAGGTTTCTCCGTGCATTTGTTTTACATGAAGTGGCAATTTTGAAGATGATTCATTCCTAAATTATATCAAGATATACTAACTGAATACTGATGACATAATATGATGCAGGTGAAACTATTTGGGTCTCTGGTGAAGGGCCTCAGGACTGGATTGGATCTATTAATTCAAGTCCAGAAGGAAGGTCTCATTATTTCTCTACCAAAAGTAGTTAAACTTCCACAGCTCAGCGGCAATGGCGGGCTTGATCAATTCCTCACTCAATTTGAAGCAGCTGTAGATAGCGAGTTTCCAGATTATCAGGTGTGTCCTTGACTTTCTTTTTTATTCTATAGTTGTTTATATATATATGACAAAGAGAGTATTATTCCAACGGCAAAACCCTCACTAAAATAACCTAAAAAGATAAAGGGATAGCTGATAAGGCGTGCTCTTCCCTGAAGGAAAGATTACAAAAAGGCTTCCTATCTAAGCTAATAGTGTCCGTGTAGTTACTAAATAGCATATGCTGAGCCCTACAACTAGAAGCCATAAACCGAAAAAAATCACAAAACGACGAAAAAAAAACCCAATTATCTAGGAATAGACCCTCACTATTTCTTTCCTTCTAAAAGGAACCATGGATTAGCCCTACTGCCATTTTTCCAGAGAACTTCAGGTTTATTTTTCAAGAACCAACCATAGAGAGCTTCATGCATAGCCACAAACGGAGGGAACTCGGCTTGCAGCTCTGAAGGCCAAAACATTCTACAGTTTTATAATCCTTTATACTTTATATCTTTTCTTGGCTGGAGATCTCGTCTGAAAATGTTTATATCCCTCACTTTTGTTTGCAATTTTTTATCAGAACCATGATGTAACGGATTCTGGTGCTGAAAGACTCTCTATACTGATCGAGTTATTTGTAGAAAAAGCTACTGAATGGTCTGAAGTAATCCATGCCCTCAATTGTGTAGACGTGCTGAGATCATTTGCAATCATTGCTCATTCATCTAGAGGGTCCATGTCTAGGCCTCTAATTTTGCCTCAATCGAACAATTCGATGTTAAGTCCAGAAAAGCAAGGGCCAGTTCTTAAAATTAATGGGCTATGGCATCCATATGCCCTTGTGGAGAGTGGAGAAACACCAGTCCCTAATGATATGATCCTTGGTCTTGATCAAGATAGCTACCACCCTCGTACTCTACTTCTAACAGGACCAAACATGGGCGGGAAATCCACACTTCTCCGTTCCACTTGTCTAGCTGTTGTTCTTGCACAGGTTTGTTCCATTATCGTTTAAGTCCGTTTGATAATCACTTTGTTTTCTTATTTTTTTTAAAAATCAAGTCGATCGTATCATTTTTAAAATCTGAAGCTTATCAAACAAGCCTAAAATATAATGTTATTTCAATCCACTCTTGTTGAATTGTAGTTGGGTTGCTACGTACCTTGTGAGACATGTACGCTCTCGGTCGTCGATACCATCTTTACACGACTTGGTGCCACTGATCGAATCATGACAGGAGAAAGTAAGTTCATTTATCTTTTTGTGATAAGAAACAATTTTGTGTTAGGATAGAGAGACCAAGCGAGGAGAAGAAATTGGAGGATACAGCTGGTGAAAAATCAATTTTTAAATATAGTGTCTTTTGAATAATGCTTTTTAAATATAATAAATTTATCTATTAACTTCAAGTACCTTTGAAAAAAATCACACATATATATTTAAATTTTATTAAATAAAAAATAGTTGAAAAACTTGAGTTAATTTAAAACTTATTGTTTAAGATGTTATGTATATATATTGAAACAAGACTTTATCTAAAGTTGATGTGAAACAATAAAGAGATTCCATGTAGGTACGTTCCTTGTTGAATGTTCGGAAACAGCTTCAGTTCTCCAACATGCGACTCAGGACTCTCTTGTTATTCTTGATGAACTCGGCCGAGGAACCAGCACCTTTGATGGCTATGCCATTGCATATGCTGTGAGTTACCTTCTTCTTTTTAATTTAATCCATTTATCTCATTCACTATCATTCCCGAGCATATATAATTGATTTTGAATTAAAGGTTCGTGTTGCATTATATTAATTGAGAATTTTGAACATTACAAGGACATGCTAGTAGTGATTTTAAAGCAACAAAAATCACTCGTCACATTTAAAATTAATATCACTAAAAGTGGTTATGAATGTTTAAAAAGGCATATAACAAGCATATGTCAGTAGTGATGTTTAATTTTACATGTTTAGCAGCAATTTTAGTCATTTTCAAAATCAGACGTTTTTAATTTTGGAAAACTTGTTTTGATGATAAGAAAATTGTACCCTGAACTTGAATATATGTTGAGTATTTAAACTTTTGTTTTTGAGTAATGTTAACATGACAAGTGATTTTAATAAACTCTTAAACATTTACTAAAACATGTATTGGAGTGATTTTGAAATGAAAAATGATTTTAATTATTTTAAAATGACTCCTAAACATATTAGGATCAAATAATTCCTTATTGATCAAAAGATATTAAAATGACTACAAATCGGTTCTTCAGGTGTTTCGCCATTTGATAGAAAAGGTTAACTGTAGACTTCTATTTGCTACTCACTATCACCCTTTAACAAAAGAGTTTGCTTCCCATCCTCACGTCATGCTTCAACACATGGCCTGCACATTCAAAGACCACGAGCTCATTTTTCTATATCGCCTTCGCTCCGGAGCTTGCCCTGAGAGCTATGGCCTAAAAGTAGCAACCATGGCTGGAATCCCAGGACGAGTTGTTGAAGCAGCTTCAAGAGCTAGCCAAATGATGAAGCAAACCATCAAAGAGAACTTCAAATCGAGCGAGCAACGATCGGAGTTCTCAACACTTCATGAAGAGTGGTTGAAAACTCTAATCACAGTCTTAGAGTTTAAAGGTAACAATCTCGGTGAAAATGATGCTTTTGATACATTATTTTGTTTATGGTATGAGCTTAAAAGAGATCGTATCACTGCTAGGATATGACATTGTAGGGAAAAAGCATGACATAGGAAATTCCTTGTGTGAAGTTTTTATCATCTAATTGGCGTAGTTTTAGGTTAGAGAGTACTTTTGTAGTTGAGACATGAATGCATTTAGGGTTCTACCGTAGATCCTTAAATCTACCAAGGCATGTGTTTGTACAGAAAAATACCCTAATATATTTTAATCTCAATATACATAAA

mRNA sequence

ATGCAGCGCCAGAAATCTTTGTTATCCTTCTTCCAAAAATCTCCGTCCGATAATCGGAGCTCCGATGGCTGTGCCTCCTCCGTCGGCCAGCGGCTCACTCGCTTTCAAACGAAACCAAGCGCAGCCGGTTTGGAGCAGCCGGCTATCCAGACCACTGCGGATTCCTCCCTGGAGATTAGAGGAACCGACACTCCGCCGGAGAAGGTGCCTCGCCAGATTTTGCCGGTGATTGAGAAGAACAGAGGTTCTTCTCTCTTTTCAAGCATCATGCATAAATTTGTGCGAGTCGATGATAAACGTAAGGCGAACGAGAGGGACGAAGTTCAAAAAGATTCATCTCAGAATGAGGTTGGAAAAGATTCTCCTCAGTTACCTTCCATTTCTGGTAAGGTAAATGATCCGACAGAGTTTTCGAAACTAGATGTAGCTTCTAGACGTCACGGTAAATTCGACGTTGCAAATTTAAACGGACATAGAGGACCTGTATTGAATATTGAAAGCAATGAGGACATTGCTGGACCAGAAACACCTGGCATGCGACCTTCTGTCTCTCGTTTGAAGAGATCTCAAGAGGTTTCTCTTGTGAATTGTAGCGGTGATTCTTTGCAAGATAGTACAAAAAGAATCAAACTCCTTCAGGACTCGATTAACTTGAACAAAATTCACAATGAAATTTCCGATGCAACTAGCAAATTTGAGTGGCTCAATCCCTCTCAAGTTCGAGATGCCAATCGTAGAAGGCCCGACCATCCTCTTTATGATAAAAAAACACTATACATACCACCCGACGTGCTGAAGAAAATGTCAGCATCGCAAAAACAATACTGGAATGTGAAATGTCAATATATGGACATCTTGCTCTTCTTCAAAGTGGGGAAGTTCTATGAGCTATATGAACAAGATGCTGAAATTGGTCACAAGGAGCTTGACTGGAAAATGACATTAAGCGGCGTTGGGAAATGTAGGCAGGTTGGTGTACCTGAGAGTGGAATTGATGAAGCCGTTCAAAAACTTGTTGCTCGTGGATATAAAGTTGGGAGAGTCGAGCAGTTGGAATCTGCAGAGCAAACAAAAAGCAGAGGTGCAAATTCTGTGATACCTAGGAAATTGGTACAGGTGACTACTCCATCAACCAAGGCTGACGGTGACATTGGGCCTGATGCCGTTCATTTGCTTGCCATCAAAGAGGAGAGCTGCGGGCTGGATAATAATTCAATTTCATATGGGTTTGCTTTTGTCGATTGTGCTGCTTTAAAATTTTGGACTGGTTCTATCAAAGATGATGCTTCCTGCGCTGCTCTGGGTGCTCTCTTGATGCAAGTGTCCCCAAAGGAAATAATATATGAAGCTAGAGGACTATCTAAAGAAACACATAAAGTTCTCAAGAAGTACTCGCCTACTGGCTCCACTGCTCTAGAACTCACATCAGGGTCGCCAGTTACAAATTTTCTAGAAGCTTCAGAGGTTAAACTTTTGGTCCAGTCTAAAGCATACTTTAAAGGTTCCTTGAATTTGTGGAATCACGAAAGCACAGTCCATGATGACATTGCTCTATGTGCGCTTGGAGGACTTATCAATCACATGTCAAGGCTGATGTTAGACGATGTTTTGCGGAATGGTGATTTACTGCCATACCAAGTATACAGAGGCTGCCTAAGAATGGATGGACAAACAATGGTTAATCTTGAAATTTTCAGAAACAATGATGACGGTGGTCTGTCAGGTACACTGTACAAGTATCTTGATAACTGCGTGACATCATCGGGAAAGAGGCTCTTAAGGTTATGGATCTGTCATCCTCTTAAAGATGTTGAAGAGATAAACAACAGACTTAATGTGGTTGAAGAACTGATGGCGCAATCTGACATTATGGTACTTCTAGGTACCACCTATCTTCGCAAGCTTCCAGACTTGGAGAGGTTGCTTGGGCAGATTAAGGCTACTGTCCAGTCATCTGCTTCTCTTGTATTACCATTGATTCGCAAAAAGTTACAGAAACGACGGGTGAAACTATTTGGGTCTCTGGTGAAGGGCCTCAGGACTGGATTGGATCTATTAATTCAAGTCCAGAAGGAAGGTCTCATTATTTCTCTACCAAAAGTAGTTAAACTTCCACAGCTCAGCGGCAATGGCGGGCTTGATCAATTCCTCACTCAATTTGAAGCAGCTGTAGATAGCGAGTTTCCAGATTATCAGAACCATGATGTAACGGATTCTGGTGCTGAAAGACTCTCTATACTGATCGAGTTATTTGTAGAAAAAGCTACTGAATGGTCTGAAGTAATCCATGCCCTCAATTGTGTAGACGTGCTGAGATCATTTGCAATCATTGCTCATTCATCTAGAGGGTCCATGTCTAGGCCTCTAATTTTGCCTCAATCGAACAATTCGATGTTAAGTCCAGAAAAGCAAGGGCCAGTTCTTAAAATTAATGGGCTATGGCATCCATATGCCCTTGTGGAGAGTGGAGAAACACCAGTCCCTAATGATATGATCCTTGGTCTTGATCAAGATAGCTACCACCCTCGTACTCTACTTCTAACAGGACCAAACATGGGCGGGAAATCCACACTTCTCCGTTCCACTTGTCTAGCTGTTGTTCTTGCACAGTTGGGTTGCTACGTACCTTGTGAGACATGTACGCTCTCGGTCGTCGATACCATCTTTACACGACTTGGTGCCACTGATCGAATCATGACAGGAGAAAGTACGTTCCTTGTTGAATGTTCGGAAACAGCTTCAGTTCTCCAACATGCGACTCAGGACTCTCTTGTTATTCTTGATGAACTCGGCCGAGGAACCAGCACCTTTGATGGCTATGCCATTGCATATGCTGTGTTTCGCCATTTGATAGAAAAGGTTAACTGTAGACTTCTATTTGCTACTCACTATCACCCTTTAACAAAAGAGTTTGCTTCCCATCCTCACGTCATGCTTCAACACATGGCCTGCACATTCAAAGACCACGAGCTCATTTTTCTATATCGCCTTCGCTCCGGAGCTTGCCCTGAGAGCTATGGCCTAAAAGTAGCAACCATGGCTGGAATCCCAGGACGAGTTGTTGAAGCAGCTTCAAGAGCTAGCCAAATGATGAAGCAAACCATCAAAGAGAACTTCAAATCGAGCGAGCAACGATCGGAGTTCTCAACACTTCATGAAGAGTGGTTGAAAACTCTAATCACAGTCTTAGAGTTTAAAGGTAACAATCTCGGTGAAAATGATGCTTTTGATACATTATTTTGTTTATGGTATGAGCTTAAAAGAGATCGTATCACTGCTAGGATATGA

Coding sequence (CDS)

ATGCAGCGCCAGAAATCTTTGTTATCCTTCTTCCAAAAATCTCCGTCCGATAATCGGAGCTCCGATGGCTGTGCCTCCTCCGTCGGCCAGCGGCTCACTCGCTTTCAAACGAAACCAAGCGCAGCCGGTTTGGAGCAGCCGGCTATCCAGACCACTGCGGATTCCTCCCTGGAGATTAGAGGAACCGACACTCCGCCGGAGAAGGTGCCTCGCCAGATTTTGCCGGTGATTGAGAAGAACAGAGGTTCTTCTCTCTTTTCAAGCATCATGCATAAATTTGTGCGAGTCGATGATAAACGTAAGGCGAACGAGAGGGACGAAGTTCAAAAAGATTCATCTCAGAATGAGGTTGGAAAAGATTCTCCTCAGTTACCTTCCATTTCTGGTAAGGTAAATGATCCGACAGAGTTTTCGAAACTAGATGTAGCTTCTAGACGTCACGGTAAATTCGACGTTGCAAATTTAAACGGACATAGAGGACCTGTATTGAATATTGAAAGCAATGAGGACATTGCTGGACCAGAAACACCTGGCATGCGACCTTCTGTCTCTCGTTTGAAGAGATCTCAAGAGGTTTCTCTTGTGAATTGTAGCGGTGATTCTTTGCAAGATAGTACAAAAAGAATCAAACTCCTTCAGGACTCGATTAACTTGAACAAAATTCACAATGAAATTTCCGATGCAACTAGCAAATTTGAGTGGCTCAATCCCTCTCAAGTTCGAGATGCCAATCGTAGAAGGCCCGACCATCCTCTTTATGATAAAAAAACACTATACATACCACCCGACGTGCTGAAGAAAATGTCAGCATCGCAAAAACAATACTGGAATGTGAAATGTCAATATATGGACATCTTGCTCTTCTTCAAAGTGGGGAAGTTCTATGAGCTATATGAACAAGATGCTGAAATTGGTCACAAGGAGCTTGACTGGAAAATGACATTAAGCGGCGTTGGGAAATGTAGGCAGGTTGGTGTACCTGAGAGTGGAATTGATGAAGCCGTTCAAAAACTTGTTGCTCGTGGATATAAAGTTGGGAGAGTCGAGCAGTTGGAATCTGCAGAGCAAACAAAAAGCAGAGGTGCAAATTCTGTGATACCTAGGAAATTGGTACAGGTGACTACTCCATCAACCAAGGCTGACGGTGACATTGGGCCTGATGCCGTTCATTTGCTTGCCATCAAAGAGGAGAGCTGCGGGCTGGATAATAATTCAATTTCATATGGGTTTGCTTTTGTCGATTGTGCTGCTTTAAAATTTTGGACTGGTTCTATCAAAGATGATGCTTCCTGCGCTGCTCTGGGTGCTCTCTTGATGCAAGTGTCCCCAAAGGAAATAATATATGAAGCTAGAGGACTATCTAAAGAAACACATAAAGTTCTCAAGAAGTACTCGCCTACTGGCTCCACTGCTCTAGAACTCACATCAGGGTCGCCAGTTACAAATTTTCTAGAAGCTTCAGAGGTTAAACTTTTGGTCCAGTCTAAAGCATACTTTAAAGGTTCCTTGAATTTGTGGAATCACGAAAGCACAGTCCATGATGACATTGCTCTATGTGCGCTTGGAGGACTTATCAATCACATGTCAAGGCTGATGTTAGACGATGTTTTGCGGAATGGTGATTTACTGCCATACCAAGTATACAGAGGCTGCCTAAGAATGGATGGACAAACAATGGTTAATCTTGAAATTTTCAGAAACAATGATGACGGTGGTCTGTCAGGTACACTGTACAAGTATCTTGATAACTGCGTGACATCATCGGGAAAGAGGCTCTTAAGGTTATGGATCTGTCATCCTCTTAAAGATGTTGAAGAGATAAACAACAGACTTAATGTGGTTGAAGAACTGATGGCGCAATCTGACATTATGGTACTTCTAGGTACCACCTATCTTCGCAAGCTTCCAGACTTGGAGAGGTTGCTTGGGCAGATTAAGGCTACTGTCCAGTCATCTGCTTCTCTTGTATTACCATTGATTCGCAAAAAGTTACAGAAACGACGGGTGAAACTATTTGGGTCTCTGGTGAAGGGCCTCAGGACTGGATTGGATCTATTAATTCAAGTCCAGAAGGAAGGTCTCATTATTTCTCTACCAAAAGTAGTTAAACTTCCACAGCTCAGCGGCAATGGCGGGCTTGATCAATTCCTCACTCAATTTGAAGCAGCTGTAGATAGCGAGTTTCCAGATTATCAGAACCATGATGTAACGGATTCTGGTGCTGAAAGACTCTCTATACTGATCGAGTTATTTGTAGAAAAAGCTACTGAATGGTCTGAAGTAATCCATGCCCTCAATTGTGTAGACGTGCTGAGATCATTTGCAATCATTGCTCATTCATCTAGAGGGTCCATGTCTAGGCCTCTAATTTTGCCTCAATCGAACAATTCGATGTTAAGTCCAGAAAAGCAAGGGCCAGTTCTTAAAATTAATGGGCTATGGCATCCATATGCCCTTGTGGAGAGTGGAGAAACACCAGTCCCTAATGATATGATCCTTGGTCTTGATCAAGATAGCTACCACCCTCGTACTCTACTTCTAACAGGACCAAACATGGGCGGGAAATCCACACTTCTCCGTTCCACTTGTCTAGCTGTTGTTCTTGCACAGTTGGGTTGCTACGTACCTTGTGAGACATGTACGCTCTCGGTCGTCGATACCATCTTTACACGACTTGGTGCCACTGATCGAATCATGACAGGAGAAAGTACGTTCCTTGTTGAATGTTCGGAAACAGCTTCAGTTCTCCAACATGCGACTCAGGACTCTCTTGTTATTCTTGATGAACTCGGCCGAGGAACCAGCACCTTTGATGGCTATGCCATTGCATATGCTGTGTTTCGCCATTTGATAGAAAAGGTTAACTGTAGACTTCTATTTGCTACTCACTATCACCCTTTAACAAAAGAGTTTGCTTCCCATCCTCACGTCATGCTTCAACACATGGCCTGCACATTCAAAGACCACGAGCTCATTTTTCTATATCGCCTTCGCTCCGGAGCTTGCCCTGAGAGCTATGGCCTAAAAGTAGCAACCATGGCTGGAATCCCAGGACGAGTTGTTGAAGCAGCTTCAAGAGCTAGCCAAATGATGAAGCAAACCATCAAAGAGAACTTCAAATCGAGCGAGCAACGATCGGAGTTCTCAACACTTCATGAAGAGTGGTTGAAAACTCTAATCACAGTCTTAGAGTTTAAAGGTAACAATCTCGGTGAAAATGATGCTTTTGATACATTATTTTGTTTATGGTATGAGCTTAAAAGAGATCGTATCACTGCTAGGATATGA

Protein sequence

MQRQKSLLSFFQKSPSDNRSSDGCASSVGQRLTRFQTKPSAAGLEQPAIQTTADSSLEIRGTDTPPEKVPRQILPVIEKNRGSSLFSSIMHKFVRVDDKRKANERDEVQKDSSQNEVGKDSPQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNGHRGPVLNIESNEDIAGPETPGMRPSVSRLKRSQEVSLVNCSGDSLQDSTKRIKLLQDSINLNKIHNEISDATSKFEWLNPSQVRDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLFFKVGKFYELYEQDAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGRVEQLESAEQTKSRGANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSISYGFAFVDCAALKFWTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPTGSTALELTSGSPVTNFLEASEVKLLVQSKAYFKGSLNLWNHESTVHDDIALCALGGLINHMSRLMLDDVLRNGDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCVTSSGKRLLRLWICHPLKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQIKATVQSSASLVLPLIRKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLIISLPKVVKLPQLSGNGGLDQFLTQFEAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVIHALNCVDVLRSFAIIAHSSRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVESGETPVPNDMILGLDQDSYHPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSVVDTIFTRLGATDRIMTGESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAVFRHLIEKVNCRLLFATHYHPLTKEFASHPHVMLQHMACTFKDHELIFLYRLRSGACPESYGLKVATMAGIPGRVVEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEWLKTLITVLEFKGNNLGENDAFDTLFCLWYELKRDRITARI*
BLAST of Csa2G004730 vs. Swiss-Prot
Match: MSH7_ARATH (DNA mismatch repair protein MSH7 OS=Arabidopsis thaliana GN=MSH7 PE=1 SV=1)

HSP 1 Score: 1309.3 bits (3387), Expect = 0.0e+00
Identity = 691/1120 (61.70%), Postives = 841/1120 (75.09%), Query Frame = 1

Query: 1    MQRQKSLLSFFQKSPSDNR----SSDGCASSVGQRLTRFQTKPSAAGLEQPAIQTTADSS 60
            MQRQ+S+LSFFQK  +       S D  +   G    RF  K   A  +       + S 
Sbjct: 1    MQRQRSILSFFQKPTAATTKGLVSGDAASGGGGSGGPRFNVKEGDAKGDASVRFAVSKSV 60

Query: 61   LEIRGTDTPPEKVPRQILPVIEK-----NRGSSLFSSIMHKFVRVDDKRKANERDEVQ-- 120
             E+RGTDTPPEKVPR++LP   K        SSLFS+IMHKFV+VDD+  + ER      
Sbjct: 61   DEVRGTDTPPEKVPRRVLPSGFKPAESAGDASSLFSNIMHKFVKVDDRDCSGERSREDVV 120

Query: 121  --KDSSQNEVGKDS-PQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNGHRGPVLNIE 180
               DSS      D  PQ  S +GK  +            R+  F  +     R  V +I 
Sbjct: 121  PLNDSSLCMKANDVIPQFRSNNGKTQE------------RNHAFSFSGRAELRS-VEDIG 180

Query: 181  SNEDIAGPETPGMRPSVSRLKRSQEVSLVNCSGD-SLQDSTKRIKLLQDSINLNKIHNEI 240
             + D+ GPETPGMRP  SRLKR  E  +        + DS KR+K+LQD +   K   E+
Sbjct: 181  VDGDVPGPETPGMRPRASRLKRVLEDEMTFKEDKVPVLDSNKRLKMLQDPVCGEK--KEV 240

Query: 241  SDATSKFEWLNPSQVRDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDI 300
            ++ T KFEWL  S++RDANRRRPD PLYD+KTL+IPPDV KKMSASQKQYW+VK +YMDI
Sbjct: 241  NEGT-KFEWLESSRIRDANRRRPDDPLYDRKTLHIPPDVFKKMSASQKQYWSVKSEYMDI 300

Query: 301  LLFFKVGKFYELYEQDAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKV 360
            +LFFKVGKFYELYE DAE+GHKELDWKMT+SGVGKCRQVG+ ESGIDEAVQKL+ARGYKV
Sbjct: 301  VLFFKVGKFYELYELDAELGHKELDWKMTMSGVGKCRQVGISESGIDEAVQKLLARGYKV 360

Query: 361  GRVEQLESAEQTKSRGANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNS 420
            GR+EQLE+++Q K+RGAN++IPRKLVQV TPST ++G+IGPDAVHLLAIKE    L   S
Sbjct: 361  GRIEQLETSDQAKARGANTIIPRKLVQVLTPSTASEGNIGPDAVHLLAIKEIKMELQKCS 420

Query: 421  ISYGFAFVDCAALKFWTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYS 480
              YGFAFVDCAAL+FW GSI DDASCAALGALLMQVSPKE++Y+++GLS+E  K L+KY+
Sbjct: 421  TVYGFAFVDCAALRFWVGSISDDASCAALGALLMQVSPKEVLYDSKGLSREAQKALRKYT 480

Query: 481  PTGSTALELTSGSPVTNFLEASEVKLLVQSKAYFKGSLNLWN--HESTVHDDIALCALGG 540
             TGSTA++L     V    +A+ V+ +++S  YFKGS   WN   +     D+AL ALG 
Sbjct: 481  LTGSTAVQLAPVPQVMGDTDAAGVRNIIESNGYFKGSSESWNCAVDGLNECDVALSALGE 540

Query: 541  LINHMSRLMLDDVLRNGDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDN 600
            LINH+SRL L+DVL++GD+ PYQVYRGCLR+DGQTMVNLEIF N+ DGG SGTLYKYLDN
Sbjct: 541  LINHLSRLKLEDVLKHGDIFPYQVYRGCLRIDGQTMVNLEIFNNSCDGGPSGTLYKYLDN 600

Query: 601  CVTSSGKRLLRLWICHPLKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLG 660
            CV+ +GKRLLR WICHPLKDVE IN RL+VVEE  A S+ M + G  YL KLPDLERLLG
Sbjct: 601  CVSPTGKRLLRNWICHPLKDVESINKRLDVVEEFTANSESMQITG-QYLHKLPDLERLLG 660

Query: 661  QIKATVQSSASLVLPLIRKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLIIS-LPKVV 720
            +IK++V+SSAS++  L+ KK+ K+RVK FG +VKG R+G+DLL+ +QKE  ++S L K+ 
Sbjct: 661  RIKSSVRSSASVLPALLGKKVLKQRVKAFGQIVKGFRSGIDLLLALQKESNMMSLLYKLC 720

Query: 721  KLPQLSGNGGLDQFLTQFEAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVI 780
            KLP L G  GL+ FL+QFEAA+DS+FP+YQN DVTD  AE L+ILIELF+E+AT+WSEVI
Sbjct: 721  KLPILVGKSGLELFLSQFEAAIDSDFPNYQNQDVTDENAETLTILIELFIERATQWSEVI 780

Query: 781  HALNCVDVLRSFAIIAHSSRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVES 840
            H ++C+DVLRSFAI A  S GSM+RP+I P+S  +  + + +GP+LKI GLWHP+A+   
Sbjct: 781  HTISCLDVLRSFAIAASLSAGSMARPVIFPESEATDQNQKTKGPILKIQGLWHPFAVAAD 840

Query: 841  GETPVPNDMILG---LDQDSYHPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCET 900
            G+ PVPND++LG       S HPR+LLLTGPNMGGKSTLLR+TCLAV+ AQLGCYVPCE+
Sbjct: 841  GQLPVPNDILLGEARRSSGSIHPRSLLLTGPNMGGKSTLLRATCLAVIFAQLGCYVPCES 900

Query: 901  CTLSVVDTIFTRLGATDRIMTGESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDG 960
            C +S+VDTIFTRLGA+DRIMTGESTFLVEC+ETASVLQ+ATQDSLVILDELGRGTSTFDG
Sbjct: 901  CEISLVDTIFTRLGASDRIMTGESTFLVECTETASVLQNATQDSLVILDELGRGTSTFDG 960

Query: 961  YAIAYAVFRHLIEKVNCRLLFATHYHPLTKEFASHPHVMLQHMACTFK----------DH 1020
            YAIAY+VFRHL+EKV CR+LFATHYHPLTKEFASHP V  +HMAC FK          D 
Sbjct: 961  YAIAYSVFRHLVEKVQCRMLFATHYHPLTKEFASHPRVTSKHMACAFKSRSDYQPRGCDQ 1020

Query: 1021 ELIFLYRLRSGACPESYGLKVATMAGIPGRVVEAASRASQMMKQTIKENFKSSEQRSEFS 1080
            +L+FLYRL  GACPESYGL+VA MAGIP +VVE AS A+Q MK++I ENFKSSE RSEFS
Sbjct: 1021 DLVFLYRLTEGACPESYGLQVALMAGIPNQVVETASGAAQAMKRSIGENFKSSELRSEFS 1080

Query: 1081 TLHEEWLKTLITVLEFKGNN--LGENDAFDTLFCLWYELK 1088
            +LHE+WLK+L+ +     NN  +GE+D +DTLFCLW+E+K
Sbjct: 1081 SLHEDWLKSLVGISRVAHNNAPIGEDD-YDTLFCLWHEIK 1102

BLAST of Csa2G004730 vs. Swiss-Prot
Match: MUTS_NEIMB (DNA mismatch repair protein MutS OS=Neisseria meningitidis serogroup B (strain MC58) GN=mutS PE=3 SV=1)

HSP 1 Score: 255.4 bits (651), Expect = 2.9e-66
Identity = 244/854 (28.57%), Postives = 385/854 (45.08%), Query Frame = 1

Query: 268  MSASQKQYWNVKCQYMDILLFFKVGKFYELYEQDAEIGHKELDWKMTLSGV---GKCRQV 327
            +S   +QY  +K Q+ D L+F+++G FYE++  DA    K LD  +T  G       +  
Sbjct: 6    VSPMMQQYLGIKAQHTDKLVFYRMGDFYEMFFDDAVEAAKLLDITLTTRGQVDGEPVKMA 65

Query: 328  GVPESGIDEAVQKLVARGYKVGRVEQLESAEQTKSRGANSVIPRKLVQVTTPSTKADGDI 387
            GVP    ++ + +LV  G  V   EQ+      K       + RK+V++ TP T  D   
Sbjct: 66   GVPFHAAEQYLARLVKLGKSVAICEQVGEVGAGKGP-----VERKVVRIVTPGTLTDSA- 125

Query: 388  GPDAVHLLAIKEESCGLDNNSISYGFAFVDCAALKFWTGSIKDDASCA-ALGALLMQVSP 447
                  LL  KE +  +   ++S    ++  A     +G  K   +    L   L ++  
Sbjct: 126  ------LLEDKETNRIV---AVSPDKKYIGLAWASLQSGEFKTKLTTVDKLDDELARLQA 185

Query: 448  KEIIYEARGLSKETHKVLKKYSPTGSTALELTSGSPVTN-----FLEASEVKLLVQSKAY 507
             EI+                  P    A +L + S VT      F   +  KLL +   Y
Sbjct: 186  AEILL-----------------PDSKNAPQLQTASGVTRLNAWQFAADAGEKLLTE---Y 245

Query: 508  FK-GSLNLWNHESTVHDDIALCALGGLINHMSRLMLDDVLRNGDLLPYQVYRGCLRMDGQ 567
            F    L  +  +   H  +A+ A G L+N++ RL  + + ++ D L  +     + MD  
Sbjct: 246  FGCQDLRGFGLDGKEHA-VAIGAAGALLNYI-RLTQNLMPQHLDGLSLETDSQYIGMDAA 305

Query: 568  TMVNLEIFRNNDDGGLSGTLYKYLDNCVTSSGKRLLRLWICHPLKDVEEINNRLNVVEEL 627
            T  NLEI +    G  S TL   LD C T  G RLL LW+ HPL++   I  R   V  L
Sbjct: 306  TRRNLEITQTLS-GKKSPTLMSTLDLCATHMGSRLLALWLHHPLRNRAHIRARQEAVAAL 365

Query: 628  MAQSDIMVLLGTTYLRKLPDLERLLGQI---KATVQSSASL---VLPLIRKKLQKRRVKL 687
             +Q   +       L+ + D+ER+  +I    A  +  A+L   +  L   +L      L
Sbjct: 366  ESQYKPL----QCRLKSIADIERIAARIAVGNARPRDLAALRDSLFALSEIELSAECSSL 425

Query: 688  FGSLVKGLRTGLDLLIQVQ-----------KEGLIIS---LPKVVKLPQLSGNGGLDQFL 747
             G+L       L    Q++           K+G +I+    P++ +L ++  +G  D+FL
Sbjct: 426  LGTLKAVFPENLSTAEQLRQAILPEPSVWLKDGNVINHGFHPELDELRRIQNHG--DEFL 485

Query: 748  TQFEA---------AVDSEF-------------------PDYQNHDVTDSGAERLSILIE 807
               EA          +  EF                    DYQ      +    ++  ++
Sbjct: 486  LDLEAKERERTGLSTLKVEFNRVHGFYIELSKTQAEQAPADYQRRQTLKNAERFITPELK 545

Query: 808  LFVEKATEWSEVIHALN---CVDVLRSF-AIIAHSSRGSMSRPLILPQSNNSMLSPEKQG 867
             F +K     E   AL       VL++    +    + + +   +   S  S L+ E+  
Sbjct: 546  AFEDKVLTAQEQALALEKQLFDGVLKNLQTALPQLQKAAKAAAALDVLSTFSALAKERN- 605

Query: 868  PVLKINGLWHPYALVESGETPVPNDMILGL-----DQDSYHPRTLLLTGPNMGGKSTLLR 927
              ++     +P   +E+G  PV    +        D D  H R +LLTGPNMGGKST +R
Sbjct: 606  -FVRPEFADYPVIHIENGRHPVVEQQVRHFTANHTDLDHKH-RLMLLTGPNMGGKSTYMR 665

Query: 928  STCLAVVLAQLGCYVPCETCTLSVVDTIFTRLGATDRIMTGESTFLVECSETASVLQHAT 987
               L V+LA  GC+VP +  T+  +D IFTR+GA+D + +  STF+VE SETA +L HAT
Sbjct: 666  QVALIVLLAHTGCFVPADAATIGPIDQIFTRIGASDDLASNRSTFMVEMSETAYILHHAT 725

Query: 988  QDSLVILDELGRGTSTFDGYAIAYAVFRHLIEKVNCRLLFATHYHPLTKEFASHPHVMLQ 1047
            + SLV++DE+GRGTSTFDG A+A+AV  HL++K     LFATHY  LT    +H   +  
Sbjct: 726  EQSLVLMDEVGRGTSTFDGLALAHAVAEHLLQKNKSFSLFATHYFELTYLPEAHTAAVNM 785

Query: 1048 HMACTFKDHELIFLYRLRSGACPESYGLKVATMAGIPGRVVEAASRASQMMKQTIKENFK 1055
            H++   +  +++FL++++ G   +SYG+ VA +AG+P R +++A +    ++     N  
Sbjct: 786  HLSALEQGQDIVFLHQIQPGPAGKSYGIAVAKLAGLPVRALKSAQKHLNGLENQAAAN-- 809

BLAST of Csa2G004730 vs. Swiss-Prot
Match: MSH6_HUMAN (DNA mismatch repair protein Msh6 OS=Homo sapiens GN=MSH6 PE=1 SV=2)

HSP 1 Score: 248.8 bits (634), Expect = 2.7e-64
Identity = 138/328 (42.07%), Postives = 196/328 (59.76%), Query Frame = 1

Query: 751  FVEKATEWSEVIHALNCVDVLRSFAIIAHSSRGSMSRPLILPQSNNSMLSPEKQGPVLKI 810
            F +   +W   +  +  +DVL   A  +    G M RP+IL         PE   P L++
Sbjct: 1040 FDKNYKDWQSAVECIAVLDVLLCLANYSRGGDGPMCRPVIL--------LPEDTPPFLEL 1099

Query: 811  NGLWHP-YALVESGETPVPNDMILGLD---QDSYHPRTLLLTGPNMGGKSTLLRSTCLAV 870
             G  HP       G+  +PND+++G +   Q++     +L+TGPNMGGKSTL+R   L  
Sbjct: 1100 KGSRHPCITKTFFGDDFIPNDILIGCEEEEQENGKAYCVLVTGPNMGGKSTLMRQAGLLA 1159

Query: 871  VLAQLGCYVPCETCTLSVVDTIFTRLGATDRIMTGESTFLVECSETASVLQHATQDSLVI 930
            V+AQ+GCYVP E C L+ +D +FTRLGA+DRIM+GESTF VE SETAS+L HAT  SLV+
Sbjct: 1160 VMAQMGCYVPAEVCRLTPIDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVL 1219

Query: 931  LDELGRGTSTFDGYAIAYAVFRHLIEKVNCRLLFATHYHPLTKEFASHPHVMLQHMACTF 990
            +DELGRGT+TFDG AIA AV + L E + CR LF+THYH L ++++ +  V L HMAC  
Sbjct: 1220 VDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDYSQNVAVRLGHMACMV 1279

Query: 991  KD-------HELIFLYRLRSGACPESYGLKVATMAGIPGRVVEAASRAS---QMMKQTIK 1050
            ++         + FLY+   GACP+SYG   A +A +P  V++   R +   + M Q+++
Sbjct: 1280 ENECEDPSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQKGHRKAREFEKMNQSLR 1339

Query: 1051 ENFKSSEQRSEFSTLHEEWLKTLITVLE 1065
              F+     SE ST+  E +  L+T+++
Sbjct: 1340 -LFREVCLASERSTVDAEAVHKLLTLIK 1358


HSP 2 Score: 222.6 bits (566), Expect = 2.1e-56
Identity = 151/466 (32.40%), Postives = 230/466 (49.36%), Query Frame = 1

Query: 233 EWLNPSQVRDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLFFKVG 292
           EWL   + RD +RRRPDHP +D  TLY+P D L   +   +++W +K Q  D+++ +KVG
Sbjct: 371 EWLKEEKRRDEHRRRPDHPDFDASTLYVPEDFLNSCTPGMRKWWQIKSQNFDLVICYKVG 430

Query: 293 KFYELYEQDAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGRVEQLE 352
           KFYELY  DA IG  EL     +   G     G PE         LV +GYKV RVEQ E
Sbjct: 431 KFYELYHMDALIGVSELG---LVFMKGNWAHSGFPEIAFGRYSDSLVQKGYKVARVEQTE 490

Query: 353 SAEQTKSR--------GANSVIPRKLVQVTTPSTKA----DGDIGPD-AVHLLAIKEESC 412
           + E  ++R          + V+ R++ ++ T  T+     +GD   + + +LL++KE+  
Sbjct: 491 TPEMMEARCRKMAHISKYDRVVRREICRIITKGTQTYSVLEGDPSENYSKYLLSLKEKEE 550

Query: 413 GLDNNSISYGFAFVDCAALKFWTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHK 472
               ++ +YG  FVD +  KF+ G   DD  C+    L+    P ++++E   LSKET  
Sbjct: 551 DSSGHTRAYGVCFVDTSLGKFFIGQFSDDRHCSRFRTLVAHYPPVQVLFEKGNLSKETKT 610

Query: 473 VLKKYSPTGSTALELTSG-SPVTNFLEASEVKLLVQSKAYFKGSLN---------LWNHE 532
           +LK      S +  L  G  P + F +AS+    +  + YF+  L+         +    
Sbjct: 611 ILK-----SSLSCSLQEGLIPGSQFWDASKTLRTLLEEEYFREKLSDGIGVMLPQVLKGM 670

Query: 533 STVHD----------DIALCALGGLINHMSRLMLDDVL--------------------RN 592
           ++  D          ++AL ALGG + ++ + ++D  L                    R+
Sbjct: 671 TSESDSIGLTPGEKSELALSALGGCVFYLKKCLIDQELLSMANFEEYIPLDSDTVSTTRS 730

Query: 593 GDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCVTSSGKRLLRLWICH 646
           G +      R  + +D  T+ NLEIF N  +G   GTL + +D C T  GKRLL+ W+C 
Sbjct: 731 GAIFTKAYQR--MVLDAVTLNNLEIFLNGTNGSTEGTLLERVDTCHTPFGKRLLKQWLCA 790

BLAST of Csa2G004730 vs. Swiss-Prot
Match: MUTS_CHLCH (DNA mismatch repair protein MutS OS=Chlorobium chlorochromatii (strain CaD3) GN=mutS PE=3 SV=1)

HSP 1 Score: 247.7 bits (631), Expect = 6.0e-64
Identity = 243/859 (28.29%), Postives = 386/859 (44.94%), Query Frame = 1

Query: 266  KKMSASQKQYWNVKCQYMDILLFFKVGKFYELYEQDAEIGHKELDWKMTLSGVGKCRQVG 325
            K+ S   +QY  VK +Y D LL F+VG FYE +  DA      L+  +T          G
Sbjct: 9    KEHSPMMRQYLEVKERYPDYLLLFRVGDFYETFFDDAITVSTALNIVLT-KRTADIPMAG 68

Query: 326  VPESGIDEAVQKLVARGYKVGRVEQLESAEQTKSRGANSVIPRKLVQVTTPSTKADGDIG 385
             P    +  + KL+ +GYKV   +Q+E     K      ++ R++  + TP       + 
Sbjct: 69   FPYHASEGYIAKLIKKGYKVAVCDQVEDPADAKG-----IVRREITDIVTPGVTYSDKLL 128

Query: 386  PDAVHLLAIKEESCGLDNNSISYGFAFVDCAALKFWTGSIKDDASCAALGALLMQVSPKE 445
             D  H   +   +   +  ++  G AF+D    +F   ++  +     L   L  + P E
Sbjct: 129  DDR-HNNYLAGVAFLKEGKTLMAGVAFIDVTTAEFRITTLLPEE----LPHFLAGLHPSE 188

Query: 446  IIYEARGLSKETHKVLKKYSPTGSTALELTSGSPVTNFLEASEVKLLVQSKAYFKGSLNL 505
            I++  +   KE   +LKK  P+  T + L    P     E S+  LL   K +   SL  
Sbjct: 189  ILFSTQ--EKERTLLLKKSLPS-ETLISLLE--PWMFSEEQSQTVLLRHFKTH---SLKG 248

Query: 506  WNHESTVHDDIALCALGGLINHMSRLM---LDDVLRNGDLLPYQVYRGCLRMDGQTMVNL 565
            +  E+   +  AL A G ++ ++       L  + R G+L   +     + +D QT  NL
Sbjct: 249  FGIETAGGNRAALVAAGVILQYLEETRQNSLSYITRIGELHHTEF----MSLDQQTKRNL 308

Query: 566  EIFRNNDDGGLSGTLYKYLDNCVTSSGKRLLRLWICHPLKDV------------------ 625
            EI  +  DG LSG+L + +D      G RLLR W+  PLK +                  
Sbjct: 309  EIISSMQDGSLSGSLLQVMDRTRNPMGARLLRRWLQRPLKKLTNIQERHNAVEELVENRT 368

Query: 626  --EEINNRLNVVEELMAQSDIMVLLGTT---------YLRKLPDLERLLGQIKAT---VQ 685
              E +  +L  + +L      +  L T           L  +P L+ LL  + A      
Sbjct: 369  LRESVAEQLAAINDLERSLARIATLRTIPREVRQLGISLAAIPTLQALLSDVTAPRLQAL 428

Query: 686  SSASLVLPLIRKKLQKRRVKLFGSLVKG---LRTGL---------------DLLIQVQKE 745
            ++A   LP + ++++       G+ ++    +R G                D L+Q+Q+E
Sbjct: 429  TAALQPLPKLAEQIESAIDPDAGATMRDGGYIRAGYNEELDDLRSIASTAKDRLMQIQQE 488

Query: 746  --------GLIISLPKVVKLPQLSGNGGLDQFLTQFE---AAVDSE---FPDYQNHDVTD 805
                     L +S  KV            D+    +E     V++E    P  + ++   
Sbjct: 489  EREATAISSLKVSYNKVFGYYIEISRANSDKVPAYYEKKQTLVNAERYTIPALKEYEEKI 548

Query: 806  SGAERLSILIE------LFVEKATEWSEV---IHALNCVDVLRSFAIIAHSSRGSMSRPL 865
              AE  S+L+E      L  + ATE + V      L  +D L SFA  A +     ++P 
Sbjct: 549  LHAEEKSLLLEAELFRNLCQQIATEAATVQANAALLAELDALCSFAECAVAF--DYTKPT 608

Query: 866  ILPQSNNSMLSPEKQGPVLKINGLWHPYA--LVESGETPVPNDMILGLDQDSYHPRTLLL 925
            +             +G  L I    HP    L+ + E+ +PND      Q       L++
Sbjct: 609  M------------HEGTTLSITAGRHPVLERLLGAEESYIPNDCHFDDKQTM-----LII 668

Query: 926  TGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSVVDTIFTRLGATDRIMTGESTFLV 985
            TGPNM GKS+ LR   L V+LAQ G +VP E+ +L VVD IFTR+GA+D + +GESTFLV
Sbjct: 669  TGPNMAGKSSYLRQIGLIVLLAQAGSFVPAESASLGVVDRIFTRVGASDNLTSGESTFLV 728

Query: 986  ECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAVFRHLIEKVNCRLLFATHYHPL 1045
            E +E A++L +AT+ SL++LDE+GRGTSTFDG +IA+++  +++  +  + LFATHYH L
Sbjct: 729  EMNEAANILNNATERSLLLLDEIGRGTSTFDGMSIAWSMCEYIVHTIGAKTLFATHYHEL 788

Query: 1046 TKEFASHPHVMLQHMACTFKDHELIFLYRLRSGACPESYGLKVATMAGIPGRVVEAASRA 1047
             +       V+  +         +IFL ++  GA   SYG++VA MAG+P  V+   SRA
Sbjct: 789  AELEERLKGVVNYNATVVETAERVIFLRKIVRGATDNSYGIEVAKMAGMPNDVI---SRA 822

BLAST of Csa2G004730 vs. Swiss-Prot
Match: MUTS_LACH4 (DNA mismatch repair protein MutS OS=Lactobacillus helveticus (strain DPC 4571) GN=mutS PE=3 SV=1)

HSP 1 Score: 241.9 bits (616), Expect = 3.3e-62
Identity = 231/843 (27.40%), Postives = 392/843 (46.50%), Query Frame = 1

Query: 273  KQYWNVKCQYMDILLFFKVGKFYELYEQDAEIGHKELDWKMTLSG---VGKCRQVGVPES 332
            +QY+ +K QY D  LF++VG FYEL+E DA  G + L+  +T             GVP  
Sbjct: 3    EQYYEIKKQYPDAFLFYRVGDFYELFEDDAVKGAQILELTLTHRSNKTKNPIPMAGVPHM 62

Query: 333  GIDEAVQKLVARGYKVGRVEQLESAEQTKSRGANSVIPRKLVQVTTPSTKADGDIGP--- 392
             +D  V  LV +GYKV   EQLE  ++ K      ++ R ++Q+ TP T  +   GP   
Sbjct: 63   AVDTYVNTLVEKGYKVALCEQLEDPKKAKG-----MVKRGIIQLVTPGTMMNE--GPNDA 122

Query: 393  -DAVHLLAIKEESCGLDNNSISYGFAFVDCAALKFWTGSIKDDASCAALGALLMQVSPKE 452
             D+ +L ++     G       +G A+ D +  + ++  +K   S AA+   L+ +  +E
Sbjct: 123  KDSNYLTSVVTTKSG-------FGLAYSDLSTGEIYSTHLK---SFAAVSNELLSLRTRE 182

Query: 453  IIYEARGLSKETHKVLKKYSPTGS--TALE------------LTSGSP------VTNFLE 512
            ++Y    L++++   + K + T S  T +E            LT+G+       +  +L 
Sbjct: 183  VVYNGP-LTEQSKDFMHKANITVSAPTPIEGEHAEISYVEQNLTNGAEKSATRQLVGYLL 242

Query: 513  ASEVKLLVQSKAYFKGSLNLWNHES-TVHDDIALCALGGLINHMSRL--MLDDV--LRNG 572
            +++ + L   +      +N +   S TV +++ L A       M  L  +LD       G
Sbjct: 243  STQKRSLAHLQIAKSYEVNQYLQMSHTVQNNLELVASAKTGKKMGSLFWVLDKTHTAMGG 302

Query: 573  DLLPYQVYRGCLRMD--------GQTMVNLEIFRNNDDGGLSGT--LYKYLDNCVTSSGK 632
             LL   + R  L +D         Q +++    R N    L G   L +        +  
Sbjct: 303  RLLKQWLARPLLNVDIINHREKMVQALLDGYFTRENTIDALKGVYDLERLTGRIAFGNVN 362

Query: 633  RLLRLWICHPLKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQIKATVQ 692
                L +   L+ V  I + LN       QSD  VL  T + +K+  L+ +   I  T+ 
Sbjct: 363  ARELLQLSRSLQAVPVILDALN-------QSDSDVL--TDFAKKIDPLKGVAELISTTLV 422

Query: 693  SSASLVLP---LIRKKLQKRRVKLFGSLVKGLRTGLDLLIQVQK----EGLIISLPKVVK 752
                L+     LIR  + K+  +   ++  G +  + +    ++    E L +   KV  
Sbjct: 423  KDPPLLTTEGGLIRDGVDKQLDRYRDAMNNGKKWLVQMETDERQKTGIENLKVGFNKVFG 482

Query: 753  LPQLSGNGG-----LDQF-----LTQFEAAVDSEFPDYQNHDVTDSGAERLSILIELFVE 812
                  NG      LD++     LT  E  +  E  +++N  + ++      +  +LFV+
Sbjct: 483  YYIQVSNGNKSKVPLDRYTRKQTLTNAERYITPELKEHENL-ILEAQTRSTDLEYDLFVQ 542

Query: 813  KATEWSEVIHALN-------CVDVLRSFAIIAHSSRGSMSRPLILPQSNNSMLSPEKQGP 872
               E  + I AL         +DV   FA +A  +  +  RP     S +          
Sbjct: 543  LRDEVKKYIPALQKLGNQLAALDVYCGFATVAEQN--NYCRPSFHTDSQD---------- 602

Query: 873  VLKINGLWHPYALVESGETPVPNDMILGLDQDSYHPRTLLLTGPNMGGKSTLLRSTCLAV 932
            +  +NG       V +  + +PND+ +    + +     L+TGPNM GKST +R   L  
Sbjct: 603  IDVVNGRHPVVEKVMTAGSYIPNDVKMDSATNIF-----LITGPNMSGKSTYMRQMALIA 662

Query: 933  VLAQLGCYVPCETCTLSVVDTIFTRLGATDRIMTGESTFLVECSETASVLQHATQDSLVI 992
            ++AQ+G +VP ++  L + D IFTR+GA D +++G+STF+VE SE  + LQ+AT+ SLV+
Sbjct: 663  IMAQVGSFVPADSAALPIFDQIFTRIGAADDLISGQSTFMVEMSEANAALQYATKRSLVL 722

Query: 993  LDELGRGTSTFDGYAIAYAVFRHLIEKVNCRLLFATHYHPLTKEFASHPHVMLQHMACTF 1050
             DE+GRGT+T+DG A+A A+ ++L +KV  + LFATHYH LT    +  ++   H+  T 
Sbjct: 723  FDEIGRGTATYDGMALAGAIVKYLHDKVGAKALFATHYHELTSLDETLDYLKNIHVGATE 782

BLAST of Csa2G004730 vs. TrEMBL
Match: A0A0A0LHY3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G004730 PE=4 SV=1)

HSP 1 Score: 2175.2 bits (5635), Expect = 0.0e+00
Identity = 1095/1095 (100.00%), Postives = 1095/1095 (100.00%), Query Frame = 1

Query: 1    MQRQKSLLSFFQKSPSDNRSSDGCASSVGQRLTRFQTKPSAAGLEQPAIQTTADSSLEIR 60
            MQRQKSLLSFFQKSPSDNRSSDGCASSVGQRLTRFQTKPSAAGLEQPAIQTTADSSLEIR
Sbjct: 1    MQRQKSLLSFFQKSPSDNRSSDGCASSVGQRLTRFQTKPSAAGLEQPAIQTTADSSLEIR 60

Query: 61   GTDTPPEKVPRQILPVIEKNRGSSLFSSIMHKFVRVDDKRKANERDEVQKDSSQNEVGKD 120
            GTDTPPEKVPRQILPVIEKNRGSSLFSSIMHKFVRVDDKRKANERDEVQKDSSQNEVGKD
Sbjct: 61   GTDTPPEKVPRQILPVIEKNRGSSLFSSIMHKFVRVDDKRKANERDEVQKDSSQNEVGKD 120

Query: 121  SPQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNGHRGPVLNIESNEDIAGPETPGMR 180
            SPQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNGHRGPVLNIESNEDIAGPETPGMR
Sbjct: 121  SPQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNGHRGPVLNIESNEDIAGPETPGMR 180

Query: 181  PSVSRLKRSQEVSLVNCSGDSLQDSTKRIKLLQDSINLNKIHNEISDATSKFEWLNPSQV 240
            PSVSRLKRSQEVSLVNCSGDSLQDSTKRIKLLQDSINLNKIHNEISDATSKFEWLNPSQV
Sbjct: 181  PSVSRLKRSQEVSLVNCSGDSLQDSTKRIKLLQDSINLNKIHNEISDATSKFEWLNPSQV 240

Query: 241  RDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLFFKVGKFYELYEQ 300
            RDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLFFKVGKFYELYEQ
Sbjct: 241  RDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLFFKVGKFYELYEQ 300

Query: 301  DAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGRVEQLESAEQTKSR 360
            DAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGRVEQLESAEQTKSR
Sbjct: 301  DAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGRVEQLESAEQTKSR 360

Query: 361  GANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSISYGFAFVDCAALKF 420
            GANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSISYGFAFVDCAALKF
Sbjct: 361  GANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSISYGFAFVDCAALKF 420

Query: 421  WTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPTGSTALELTSGSPV 480
            WTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPTGSTALELTSGSPV
Sbjct: 421  WTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPTGSTALELTSGSPV 480

Query: 481  TNFLEASEVKLLVQSKAYFKGSLNLWNHESTVHDDIALCALGGLINHMSRLMLDDVLRNG 540
            TNFLEASEVKLLVQSKAYFKGSLNLWNHESTVHDDIALCALGGLINHMSRLMLDDVLRNG
Sbjct: 481  TNFLEASEVKLLVQSKAYFKGSLNLWNHESTVHDDIALCALGGLINHMSRLMLDDVLRNG 540

Query: 541  DLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCVTSSGKRLLRLWICHP 600
            DLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCVTSSGKRLLRLWICHP
Sbjct: 541  DLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCVTSSGKRLLRLWICHP 600

Query: 601  LKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQIKATVQSSASLVLPLI 660
            LKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQIKATVQSSASLVLPLI
Sbjct: 601  LKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQIKATVQSSASLVLPLI 660

Query: 661  RKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLIISLPKVVKLPQLSGNGGLDQFLTQF 720
            RKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLIISLPKVVKLPQLSGNGGLDQFLTQF
Sbjct: 661  RKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLIISLPKVVKLPQLSGNGGLDQFLTQF 720

Query: 721  EAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVIHALNCVDVLRSFAIIAHS 780
            EAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVIHALNCVDVLRSFAIIAHS
Sbjct: 721  EAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVIHALNCVDVLRSFAIIAHS 780

Query: 781  SRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVESGETPVPNDMILGLDQDSY 840
            SRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVESGETPVPNDMILGLDQDSY
Sbjct: 781  SRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVESGETPVPNDMILGLDQDSY 840

Query: 841  HPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSVVDTIFTRLGATDRIMT 900
            HPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSVVDTIFTRLGATDRIMT
Sbjct: 841  HPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSVVDTIFTRLGATDRIMT 900

Query: 901  GESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAVFRHLIEKVNCRLLF 960
            GESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAVFRHLIEKVNCRLLF
Sbjct: 901  GESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAVFRHLIEKVNCRLLF 960

Query: 961  ATHYHPLTKEFASHPHVMLQHMACTFKDHELIFLYRLRSGACPESYGLKVATMAGIPGRV 1020
            ATHYHPLTKEFASHPHVMLQHMACTFKDHELIFLYRLRSGACPESYGLKVATMAGIPGRV
Sbjct: 961  ATHYHPLTKEFASHPHVMLQHMACTFKDHELIFLYRLRSGACPESYGLKVATMAGIPGRV 1020

Query: 1021 VEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEWLKTLITVLEFKGNNLGENDAFDTLF 1080
            VEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEWLKTLITVLEFKGNNLGENDAFDTLF
Sbjct: 1021 VEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEWLKTLITVLEFKGNNLGENDAFDTLF 1080

Query: 1081 CLWYELKRDRITARI 1096
            CLWYELKRDRITARI
Sbjct: 1081 CLWYELKRDRITARI 1095

BLAST of Csa2G004730 vs. TrEMBL
Match: A0A067KMZ7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_07221 PE=4 SV=1)

HSP 1 Score: 1440.6 bits (3728), Expect = 0.0e+00
Identity = 749/1112 (67.36%), Postives = 879/1112 (79.05%), Query Frame = 1

Query: 1    MQRQKSLLSFFQKSPSDNRSSDGCASSVGQRLTRFQTKPSAAGLEQPAIQTT---ADSSL 60
            MQRQKS+LSFFQK    ++ +D   +   ++   F +K     +  P    T    DSSL
Sbjct: 1    MQRQKSILSFFQKPSPASQKADSGGTLNERKAPHFSSKQENQKVVSPGKPDTHGSVDSSL 60

Query: 61   EIRGTDTPPEKVPRQILP----VIEKNRGSSLFSSIMHKFVRVDDKRKANERDEVQKDSS 120
            E+RGTDTPPEKVPRQ+LP    V E   GSSLFSSIMHKFV+VD K K  ER +V   S+
Sbjct: 61   EVRGTDTPPEKVPRQVLPGSYSVNENTTGSSLFSSIMHKFVKVDSKEKPLERVQVHHPSN 120

Query: 121  QNEVGKDSPQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNG--HRGPVLNIESNEDI 180
                      + S+SG++ D   +SK         K +  + NG    G VL ++S+ D+
Sbjct: 121  D---------ICSVSGRLIDTKGWSKQRTDVLHLEKNNAYSSNGMVDHGDVLLLKSSNDV 180

Query: 181  AGPETPGMRPSVSRLKRSQEVS--LVNCSGDSLQDSTKRIKLLQDSINLNKIHNEISDAT 240
             GPETPG++P V RLKR Q+ S    + SG SL +++KR+KLL DS   +K    I D+T
Sbjct: 181  PGPETPGVQPLVPRLKRIQDDSSKFDDRSGCSLLNASKRMKLLLDSTASSKNQGVIFDST 240

Query: 241  SKFEWLNPSQVRDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLFF 300
            SKFEWL+P ++RDAN RR   PLYDKKTLYIPPD LKKMSASQKQYW++K QYMDILLFF
Sbjct: 241  SKFEWLDPLRIRDANGRRLSDPLYDKKTLYIPPDTLKKMSASQKQYWSIKSQYMDILLFF 300

Query: 301  KVGKFYELYEQDAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGRVE 360
            KVGKFYELYE DAEIGHKELDWKMTLSGVGKCRQVG+ ESGID+AV+KLVARGYKVGR+E
Sbjct: 301  KVGKFYELYELDAEIGHKELDWKMTLSGVGKCRQVGISESGIDDAVEKLVARGYKVGRIE 360

Query: 361  QLESAEQTKSRGANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSISYG 420
            QLE++ Q K+RGANSVIPRKLVQV TPST  DG+IGPDAVHLLAIKE +CGLDN + SYG
Sbjct: 361  QLETSGQAKARGANSVIPRKLVQVVTPSTATDGNIGPDAVHLLAIKEGNCGLDNGATSYG 420

Query: 421  FAFVDCAALKFWTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPTGS 480
            FAFVDCAAL+FW GSI DD S AALGALLMQVSPKE+IYE+ G+SKE  K L+KYS TGS
Sbjct: 421  FAFVDCAALRFWVGSINDDTSYAALGALLMQVSPKEVIYESGGMSKEAQKALRKYSLTGS 480

Query: 481  TALELTSGSPVTNFLEASEVKLLVQSKAYFKGSLNLWNHE--STVHDDIALCALGGLINH 540
             AL+LT     T+FL  SEV+ L+QSK YF GS N WN+   S +H DIAL ALGGL+ H
Sbjct: 481  -ALQLTPVQSTTDFLHGSEVRNLIQSKGYFSGSSNPWNNAIVSVLHHDIALSALGGLVGH 540

Query: 541  MSRLMLDDVLRNGDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCVTS 600
            +SRLMLDDVLRNGD+ PYQVY GCLRMDGQT++NLEIF NN DGGLSGTL+ +LDNCVTS
Sbjct: 541  LSRLMLDDVLRNGDIQPYQVYTGCLRMDGQTLINLEIFNNNADGGLSGTLFNHLDNCVTS 600

Query: 601  SGKRLLRLWICHPLKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQIKA 660
            SGKRLLR WICHPLK V+ IN+RLNVVEEL+ +S+IM+++   YLRKLPD+ER+LG++KA
Sbjct: 601  SGKRLLRKWICHPLKCVKGINDRLNVVEELINRSEIMLVIAQ-YLRKLPDIERMLGRVKA 660

Query: 661  TVQSSASLVLPLIRKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLIISL-PKVVKLPQ 720
            + Q+SASL LPLI KK+ K+RVK+FG LVKGLRTG+DLL+ +QKE  I+ L  K+ KLP+
Sbjct: 661  SFQASASLALPLIGKKMLKQRVKVFGCLVKGLRTGMDLLLLLQKESQIMLLFLKIFKLPE 720

Query: 721  LSGNGGLDQFLTQFEAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVIHALN 780
            L+G+ GLD+FL QFEAAVDSEFPDYQNHDVTDS AE LS+LIELF+EKAT+WSE+IHA+N
Sbjct: 721  LNGSAGLDKFLAQFEAAVDSEFPDYQNHDVTDSEAETLSVLIELFIEKATQWSEIIHAIN 780

Query: 781  CVDVLRSFAIIAHSSRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVESGETP 840
            C+DVLRSFA+ A  S GSMSRP+IL  S  +  S E  GPVLKI GLWHP+AL E+G  P
Sbjct: 781  CIDVLRSFAVTASMSSGSMSRPVILSDSKTTTFSREAAGPVLKIKGLWHPFALGENGGLP 840

Query: 841  VPNDMILGLDQDSYHPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSVVD 900
            VPND+ LG    SYHP TLLLTGPNMGGKSTLLR+TCLAV+LAQLGC+VP E C LS+VD
Sbjct: 841  VPNDLNLGEHPGSYHPHTLLLTGPNMGGKSTLLRATCLAVILAQLGCFVPSEMCILSLVD 900

Query: 901  TIFTRLGATDRIMTGESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAV 960
             IFTRLGA DRIMTGESTF +EC+ETASVLQ+ATQDSLVILDELGRGTSTFDGYAIAYAV
Sbjct: 901  VIFTRLGAIDRIMTGESTFYIECTETASVLQNATQDSLVILDELGRGTSTFDGYAIAYAV 960

Query: 961  FRHLIEKVNCRLLFATHYHPLTKEFASHPHVMLQHMACTFK---------DHELIFLYRL 1020
            FRHL+EKVNCRLLFATHYHPLTKEFASHPHV LQHMAC FK         D EL+FLYRL
Sbjct: 961  FRHLVEKVNCRLLFATHYHPLTKEFASHPHVTLQHMACAFKPKSGSYSKDDEELVFLYRL 1020

Query: 1021 RSGACPESYGLKVATMAGIPGRVVEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEWLK 1080
             SGACPESYGL+VA MAGIP +VVEAAS+A Q+MK++I ENF+SSEQRSEFS+LHE+WLK
Sbjct: 1021 ASGACPESYGLQVAAMAGIPEKVVEAASKAGQIMKKSIGENFQSSEQRSEFSSLHEDWLK 1080

Query: 1081 TLITVLEFKGNNLGEN--DAFDTLFCLWYELK 1088
            TL+   + +  N+  N  D +DTLFCLW+ELK
Sbjct: 1081 TLLNASQIEDCNVDNNDDDVYDTLFCLWHELK 1101

BLAST of Csa2G004730 vs. TrEMBL
Match: V4WD14_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10007291mg PE=4 SV=1)

HSP 1 Score: 1431.0 bits (3703), Expect = 0.0e+00
Identity = 749/1112 (67.36%), Postives = 875/1112 (78.69%), Query Frame = 1

Query: 1    MQRQKSLLSFFQKSPSDNRSSDGCASSVGQR----LTRFQTKPSAAGLEQPAIQTTADSS 60
            MQRQ+S+ SFFQK    N+S  G A   G R     T  Q  P      QP +  T DSS
Sbjct: 1    MQRQQSIHSFFQKCSPANKS--GAADMSGARKDSNFTTKQRNPVGDSSGQPTVSATEDSS 60

Query: 61   LEIRGTDTPPEKVPRQILPVIEK-----NRGSSLFSSIMHKFVRVDDKRKANERDEVQKD 120
            LEIRGTDTPPEKVPRQILP   K     + GSSLFSSIMHKFV+VD ++ AN+R+E   +
Sbjct: 61   LEIRGTDTPPEKVPRQILPSGFKANEGTSGGSSLFSSIMHKFVKVDARQNANKRNEQHGN 120

Query: 121  SSQNEVGKDSPQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNG--HRGPVLNIESNE 180
            SS          + S+ GK  D    S+   AS    K +V N NG  ++G V   E NE
Sbjct: 121  SST---------VCSVFGKTGDLEASSQQGTASLYSEKDNVFNCNGLANQGCVSCTEMNE 180

Query: 181  DIAGPETPGMRPSVSRLKRSQE--VSLVNCSGDSLQDSTKRIKLLQDSINLNKIHNEISD 240
            D++GP+TPGM   V RLKR  E  +++ +    SL DS+KR++LLQDS+   K   E +D
Sbjct: 181  DVSGPDTPGMHRVVPRLKRILEDNLNIGDKKNSSLLDSSKRMRLLQDSVAGVKNCEEEAD 240

Query: 241  ATSKFEWLNPSQVRDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILL 300
             TSKFEWL+PS++RDANRRRPD PLYDK+TLYIPP+ LKKMSASQKQYWNVK QYMD+LL
Sbjct: 241  TTSKFEWLDPSKIRDANRRRPDDPLYDKRTLYIPPEALKKMSASQKQYWNVKSQYMDVLL 300

Query: 301  FFKVGKFYELYEQDAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGR 360
            FFKVGKFYELYE DAEIGHKELDWK+TLSGVGKCRQVG+ ESGID+AV+KLVARGYKVGR
Sbjct: 301  FFKVGKFYELYELDAEIGHKELDWKITLSGVGKCRQVGISESGIDDAVEKLVARGYKVGR 360

Query: 361  VEQLESAEQTKSRGANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSIS 420
            +EQLE++EQ K+R  NSVI RKLV V TPST  DG IGPDAVHLLAIKE +CG DN S+ 
Sbjct: 361  IEQLETSEQAKARHTNSVISRKLVNVVTPSTTVDGTIGPDAVHLLAIKEGNCGPDNGSVV 420

Query: 421  YGFAFVDCAALKFWTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPT 480
            YGFAFVDCAAL+ W G+I DDASCAALGALLMQVSPKE+IYE RGL KE  K L+K+S  
Sbjct: 421  YGFAFVDCAALRVWVGTINDDASCAALGALLMQVSPKEVIYENRGLCKEAQKALRKFS-A 480

Query: 481  GSTALELTSGSPVTNFLEASEVKLLVQSKAYFKGSLNLWNH--ESTVHDDIALCALGGLI 540
            GS ALELT    VT+FL+ASEVK LVQ   YF GS + W+   E+ +  DI   ALGGLI
Sbjct: 481  GSAALELTPAMAVTDFLDASEVKKLVQLNGYFNGSSSPWSKALENVMQHDIGFSALGGLI 540

Query: 541  NHMSRLMLDDVLRNGDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCV 600
            +H+SRLMLDDVLRNGD+LPY+VYR CLRMDGQT+VNLEIF NN D G SGTL+KYLD+CV
Sbjct: 541  SHLSRLMLDDVLRNGDILPYKVYRDCLRMDGQTLVNLEIFNNNADSGSSGTLFKYLDSCV 600

Query: 601  TSSGKRLLRLWICHPLKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQI 660
            TSSGKRLLR WICHPLKDVE INNRL+VVE LM  S++++++   YLRKLPDLERLLG++
Sbjct: 601  TSSGKRLLRSWICHPLKDVEGINNRLDVVEYLMKNSEVVMVVAQ-YLRKLPDLERLLGRV 660

Query: 661  KATVQSSASLVLPLIRKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLII-SLPKVVKL 720
            KA VQ+S+ +VLPLI KK+ K++VK+FGSLVKGLR  +DLL+ + KEG II SL ++ K 
Sbjct: 661  KARVQASSCIVLPLIGKKVLKQQVKVFGSLVKGLRIAMDLLMLMHKEGHIIPSLSRIFKP 720

Query: 721  PQLSGNGGLDQFLTQFEAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVIHA 780
            P   G+ GLD+FLTQFEAA+DS+FPDYQNHDVTD  AE LSILIELF+EKA++WSEVIHA
Sbjct: 721  PIFDGSDGLDKFLTQFEAAIDSDFPDYQNHDVTDLDAETLSILIELFIEKASQWSEVIHA 780

Query: 781  LNCVDVLRSFAIIAHSSRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVESGE 840
            ++C+DVLRSFA+ A  S G+M RPLILPQS N  +  +  GPVLKI GLWHP+AL E+G 
Sbjct: 781  ISCIDVLRSFAVTASMSSGAMHRPLILPQSKNPAVRKDNGGPVLKIKGLWHPFALGENGG 840

Query: 841  TPVPNDMILGLDQDSYHPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSV 900
             PVPND++LG D D   PRTLLLTGPNMGGKSTLLR+TCLAV+LAQLGC+VPCE C LS+
Sbjct: 841  LPVPNDILLGEDSDDCLPRTLLLTGPNMGGKSTLLRATCLAVILAQLGCFVPCEMCVLSL 900

Query: 901  VDTIFTRLGATDRIMTGESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAY 960
             DTIFTRLGATDRIMTGESTFLVEC+ETASVLQ ATQDSLVILDELGRGTSTFDGYAIAY
Sbjct: 901  ADTIFTRLGATDRIMTGESTFLVECTETASVLQKATQDSLVILDELGRGTSTFDGYAIAY 960

Query: 961  AVFRHLIEKVNCRLLFATHYHPLTKEFASHPHVMLQHMACTFK---------DHELIFLY 1020
            AVFR L+E++NCRLLFATHYHPLTKEFASHPHV LQHMAC FK         D EL+FLY
Sbjct: 961  AVFRQLVERINCRLLFATHYHPLTKEFASHPHVTLQHMACAFKSNSENYSKGDQELVFLY 1020

Query: 1021 RLRSGACPESYGLKVATMAGIPGRVVEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEW 1080
            RL SGACPESYGL+VA MAG+P +VVEAAS A+  MK++I E+FKSSEQRSEFS+LHEEW
Sbjct: 1021 RLTSGACPESYGLQVAVMAGVPQKVVEAASHAALAMKKSIGESFKSSEQRSEFSSLHEEW 1080

Query: 1081 LKTLITVLEFKGNNLGENDAFDTLFCLWYELK 1088
            LKT++ V     N+  ++DA+DTLFCLW+ELK
Sbjct: 1081 LKTIVNVSRVDCNS-DDDDAYDTLFCLWHELK 1098

BLAST of Csa2G004730 vs. TrEMBL
Match: A0A061GMS3_THECC (MUTS isoform 1 OS=Theobroma cacao GN=TCM_037911 PE=4 SV=1)

HSP 1 Score: 1431.0 bits (3703), Expect = 0.0e+00
Identity = 737/1108 (66.52%), Postives = 884/1108 (79.78%), Query Frame = 1

Query: 1    MQRQKSLLSFFQK-SPSDNRSSDGCASSV-GQRLTRFQTKPSAAGLEQPAIQTTADSSLE 60
            MQRQKS+LSF QK SP+   S DG    V GQ  ++F +K      +         SSLE
Sbjct: 1    MQRQKSILSFLQKPSPA---SQDGIGGKVKGQEASQFPSKQ-----QNQNAAAVCGSSLE 60

Query: 61   IRGTDTPPEKVPRQILPV-IEKNRG----SSLFSSIMHKFVRVDDKRKANERDEVQKDSS 120
            + GTDTPPEKVPR++LP     N G    SS+FSSIMHKFVRVDDK  A++ +  + +SS
Sbjct: 61   VTGTDTPPEKVPRKVLPASFAANTGTRDSSSMFSSIMHKFVRVDDKENASQSNRARTNSS 120

Query: 121  QNEVGKDSPQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNGHRGPVLNIESNEDIAG 180
              E      +LP +      PTE +K                      VL+IE+++D+ G
Sbjct: 121  NIE------ELPKVE-LTAQPTEMAK----------------------VLSIETDDDL-G 180

Query: 181  PETPGMRPSVSRLKRSQE--VSLVNCSGDSLQDSTKRIKLLQDSINLNKIHNEISDATSK 240
            PETP  RP VSRLKR Q       +    SL DS KR+KLLQDS   NK H +++D  SK
Sbjct: 181  PETPVTRPGVSRLKRIQGDLPKFGDKKDSSLLDSGKRVKLLQDSNVGNKNHKDVADIASK 240

Query: 241  FEWLNPSQVRDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLFFKV 300
            F+WL+PS+++D+NRRRP   LYDKKTLYIPPD LKKMSASQKQYW+VKCQYMD++LFFKV
Sbjct: 241  FDWLDPSRIKDSNRRRPGDSLYDKKTLYIPPDALKKMSASQKQYWSVKCQYMDVVLFFKV 300

Query: 301  GKFYELYEQDAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGRVEQL 360
            GKFYELYE DAEIGHKELDWKMT+SGVGKCRQVG+ ESGID+AVQKLVARGYKVGR+EQL
Sbjct: 301  GKFYELYEIDAEIGHKELDWKMTVSGVGKCRQVGISESGIDDAVQKLVARGYKVGRMEQL 360

Query: 361  ESAEQTKSRGANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSISYGFA 420
            E++EQ K+RGANSVIPRKLVQV TPST  DG+IGPDAVHLLAIKE + G++  S  YGFA
Sbjct: 361  ETSEQAKARGANSVIPRKLVQVITPSTIVDGNIGPDAVHLLAIKEGNYGVEKGSTVYGFA 420

Query: 421  FVDCAALKFWTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPTGSTA 480
            FVDCAALKFW GSI DD++C+ALGALLMQVSPKE++YE+ GL +E HK LKKYS TGSTA
Sbjct: 421  FVDCAALKFWVGSISDDSTCSALGALLMQVSPKEVVYESAGLPREAHKALKKYSFTGSTA 480

Query: 481  LELTSGSPVTNFLEASEVKLLVQSKAYFKGSLNLWNH--ESTVHDDIALCALGGLINHMS 540
            ++L+    VT+FL+ASEV+ ++QS  YFKGSLN + +  +  +H D+ALCALGGL++H+S
Sbjct: 481  VQLSPALSVTDFLDASEVRNMIQSNGYFKGSLNSYINALDGVMHPDVALCALGGLVSHLS 540

Query: 541  RLMLDDVLRNGDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCVTSSG 600
            RLMLDD+LR+G++LPYQVY+GCLR+DGQT+VNLEIF N+ DGG SGTLYKYLD CVTSSG
Sbjct: 541  RLMLDDILRSGEVLPYQVYQGCLRIDGQTLVNLEIFNNSADGGSSGTLYKYLDYCVTSSG 600

Query: 601  KRLLRLWICHPLKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQIKATV 660
            KRLLR WICHPLKDV+ INNRL+VVEELM+ S+ M+L+   YLRKLPDLERL+G++KA++
Sbjct: 601  KRLLRSWICHPLKDVDSINNRLDVVEELMSHSEKMLLIAQ-YLRKLPDLERLIGRVKASI 660

Query: 661  QSSASLVLPLIRKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLIISL-PKVVKLPQLS 720
            QSSASLVLP+I KK+ K+ VK FG+LVKGLR G+DLL  +QK+  ++SL  KV KLP LS
Sbjct: 661  QSSASLVLPMIGKKVLKQLVKAFGTLVKGLRIGMDLLKLLQKDADVVSLLSKVFKLPMLS 720

Query: 721  GNGGLDQFLTQFEAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVIHALNCV 780
            G  GLD+FL QFEAA+DS+FP+YQNHD+TD+ AE LSILIELF+EKA +WS+VIHALNC+
Sbjct: 721  GTNGLDEFLGQFEAAIDSDFPNYQNHDLTDTDAETLSILIELFIEKAAQWSQVIHALNCI 780

Query: 781  DVLRSFAIIAHSSRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVESGETPVP 840
            DVLRSFA+ A  S G+M+RPL+LPQS    L+ E  GP+LKI GLWHP+AL E+G  PVP
Sbjct: 781  DVLRSFAVTASLSFGAMARPLVLPQSKTVTLNQETGGPILKIKGLWHPFALGENGGLPVP 840

Query: 841  NDMILGLDQDSYHPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSVVDTI 900
            ND+ +G D ++YHPR LLLTGPNMGGKSTLLR+TCLAV+LAQLG YVPCETC LS+VD I
Sbjct: 841  NDIFVGEDVNAYHPRALLLTGPNMGGKSTLLRATCLAVILAQLGSYVPCETCVLSLVDII 900

Query: 901  FTRLGATDRIMTGESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAVFR 960
            FTRLGATDRIMTGESTFLVEC+ETASVLQ+ATQDSLV+LDELGRGTSTFDGYAIAYAVFR
Sbjct: 901  FTRLGATDRIMTGESTFLVECTETASVLQNATQDSLVLLDELGRGTSTFDGYAIAYAVFR 960

Query: 961  HLIEKVNCRLLFATHYHPLTKEFASHPHVMLQHMACTFK---------DHELIFLYRLRS 1020
            HL+EKV+CRLLFATHYHPLTKEFASHPHV LQHMAC+FK         + EL+FLYRL +
Sbjct: 961  HLVEKVHCRLLFATHYHPLTKEFASHPHVTLQHMACSFKLKSESCSKGEQELVFLYRLTN 1020

Query: 1021 GACPESYGLKVATMAGIPGRVVEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEWLKTL 1080
            G CPESYGL+VA MAGIP  VV+AAS A+Q+MK+++ E+F++SEQRSEFSTLHEEWLKTL
Sbjct: 1021 GPCPESYGLQVAIMAGIPEHVVDAASGAAQVMKRSVGESFRASEQRSEFSTLHEEWLKTL 1069

Query: 1081 ITVLEFKGNNLGENDAFDTLFCLWYELK 1088
            + V +    NL E DA+DTLFCLW+ELK
Sbjct: 1081 VNVSQVGNRNLDEGDAYDTLFCLWHELK 1069

BLAST of Csa2G004730 vs. TrEMBL
Match: F6HH29_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_11s0016g04090 PE=4 SV=1)

HSP 1 Score: 1419.4 bits (3673), Expect = 0.0e+00
Identity = 736/1118 (65.83%), Postives = 874/1118 (78.18%), Query Frame = 1

Query: 1    MQRQKSLLSFFQK-SPSDNR--SSDGCASSVGQRLTRFQTKPSAAGL---EQPAIQTTAD 60
            MQRQKS+LSFFQK SP D +   S    +S G+ +++F  K         +QP  Q    
Sbjct: 1    MQRQKSILSFFQKPSPEDQKCGGSSAADTSAGRSVSQFPAKQRNQNFAVGDQPTFQIPKH 60

Query: 61   SSLEIRGTDTPPEKVPRQILPVI--------EKNRGSSLFSSIMHKFVRVDDKRKANERD 120
            SS+EI GTDTPPEKVPRQ++P            +  SSLFSSIMHKFV+VD++  + ER 
Sbjct: 61   SSMEITGTDTPPEKVPRQMIPASFTANDDRKAASSSSSLFSSIMHKFVKVDERESSCERK 120

Query: 121  EVQKDSSQN---EVGKDSPQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNGHRGPVL 180
            E+   SS      V  D   LP      +   + S  +   + +    V +L+   G   
Sbjct: 121  EMHSGSSNTCSTSVNSDCEVLPKEGNVFHSDAKESGFNSTKQVN---QVCSLHSESG--- 180

Query: 181  NIESNEDIAGPETPGMRPSVSRLKRSQEVSLVNCSGDSLQDSTKRIKLLQDSINLNKIHN 240
                ++DI GPETPGMRP V RLKR QE +  N +  SL DS+KR+KLLQ+S   NK + 
Sbjct: 181  ----DDDIIGPETPGMRPFVPRLKRIQEDNFENKNECSLLDSSKRLKLLQNSTTGNKNYG 240

Query: 241  EISDATSKFEWLNPSQVRDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYM 300
            E+SD TSKFEWL+PS+ RDANRRRP   LYDK+TLYIPPD L+KMSASQKQYW++KCQYM
Sbjct: 241  EVSDTTSKFEWLDPSRKRDANRRRPGDALYDKRTLYIPPDALQKMSASQKQYWSIKCQYM 300

Query: 301  DILLFFKVGKFYELYEQDAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGY 360
            D++LFFKVGKFYELYE DAEIGHKELDWKMT SGVGKCRQVG+ ESGIDEAVQKL+ARGY
Sbjct: 301  DVVLFFKVGKFYELYELDAEIGHKELDWKMTFSGVGKCRQVGISESGIDEAVQKLIARGY 360

Query: 361  KVGRVEQLESAEQTKSRGANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDN 420
            KVGR+EQLE++EQ K+RG+ SVI RKLV V TPST  DG+IGPDAVHLL++KE +  L+N
Sbjct: 361  KVGRMEQLETSEQAKARGSTSVIQRKLVHVVTPSTACDGNIGPDAVHLLSVKEGNNILEN 420

Query: 421  NSISYGFAFVDCAALKFWTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKK 480
             S+ YGFAFVDCAALKFW GSI DDASCAALGALLMQVSPKE+IYE + LSKE  K LKK
Sbjct: 421  GSVIYGFAFVDCAALKFWIGSISDDASCAALGALLMQVSPKEVIYENQELSKEAQKALKK 480

Query: 481  YSPTGSTALELTSGSPVTNFLEASEVKLLVQSKAYFKGSLNLWNH--ESTVHDDIALCAL 540
            YS +G TAL+LT     T+F++AS+V+ L+  K YFKGS N W+H  +  +H D+ALCAL
Sbjct: 481  YSLSGFTALKLTPLPLCTDFVDASKVRNLIHLKGYFKGSDNSWDHALDGVMHHDLALCAL 540

Query: 541  GGLINHMSRLMLDDVLRNGDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSG--TLYK 600
            GGL+ H+SRL LDD LRNGD+LPYQVY GCLRMDGQT+VNLEIF NN DGG SG  TLYK
Sbjct: 541  GGLLGHLSRLKLDDTLRNGDILPYQVYSGCLRMDGQTLVNLEIFSNNADGGSSGKCTLYK 600

Query: 601  YLDNCVTSSGKRLLRLWICHPLKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLE 660
            YLDNCVTSSGKRLLR WICHPLKDV+ INNRLNVVE LM  ++ M  +    LRKLPDLE
Sbjct: 601  YLDNCVTSSGKRLLRNWICHPLKDVQGINNRLNVVEHLMTNTETMSFIAQC-LRKLPDLE 660

Query: 661  RLLGQIKATVQSSASLVLPLIRKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLII-SL 720
            RLLGQ+KA+VQSSA L+LP   KKL K+RVK+FG LVKGLR  +DLL+Q+QKEG I+ SL
Sbjct: 661  RLLGQVKASVQSSALLLLPFFGKKLLKQRVKVFGLLVKGLRVAIDLLVQLQKEGHIMPSL 720

Query: 721  PKVVKLPQLSGNGGLDQFLTQFEAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEW 780
             +V+KLP LSG+ G+D+ LTQFEAA+DS+FP+Y+NHDVTDS AE LSILIELF+EK T+W
Sbjct: 721  SEVLKLPMLSGSSGVDKLLTQFEAAIDSDFPNYENHDVTDSDAEILSILIELFIEKTTQW 780

Query: 781  SEVIHALNCVDVLRSFAIIAHSSRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYA 840
             +VIHA+N +DVLRSFA+IA+ S G+MSRP+ILP S  + LS E +GP+LKI GLWHP+A
Sbjct: 781  LQVIHAINHIDVLRSFAVIANFSCGAMSRPVILPHSEPATLSGETRGPLLKIRGLWHPFA 840

Query: 841  LVESGETPVPNDMILGLDQDSYHPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCE 900
            + E+G  PVPND+ LG D D  HPRTLLLTGPNMGGKSTLLR+TCLAV+LAQLG YVPC+
Sbjct: 841  IGENGGLPVPNDIHLGEDTDGNHPRTLLLTGPNMGGKSTLLRATCLAVILAQLGSYVPCK 900

Query: 901  TCTLSVVDTIFTRLGATDRIMTGESTFLVECSETASVLQHATQDSLVILDELGRGTSTFD 960
             C LS+VD +FTRLGATDRIMTGESTF +EC+ETASVL++ATQDSLV+LDELGRGTSTFD
Sbjct: 901  MCILSLVDVVFTRLGATDRIMTGESTFFIECTETASVLRNATQDSLVLLDELGRGTSTFD 960

Query: 961  GYAIAYAVFRHLIEKVNCRLLFATHYHPLTKEFASHPHVMLQHMACTFK---------DH 1020
            GYAIAYAVFRHL+EKVNCRLLFATHYHPLTKEFASHPHV LQHMACTF          + 
Sbjct: 961  GYAIAYAVFRHLVEKVNCRLLFATHYHPLTKEFASHPHVTLQHMACTFNLKGEKSSGGEQ 1020

Query: 1021 ELIFLYRLRSGACPESYGLKVATMAGIPGRVVEAASRASQMMKQTIKENFKSSEQRSEFS 1080
            EL+FLY+L SGACPESYGL+VA MAG+P  VVEAAS A +MMKQ+I E+F++SEQRSEFS
Sbjct: 1021 ELVFLYQLTSGACPESYGLQVALMAGVPKEVVEAASTAGRMMKQSIGESFRTSEQRSEFS 1080

Query: 1081 TLHEEWLKTLITVLEFKGNNLGENDAFDTLFCLWYELK 1088
            TLHEEWLK L+TV     +N  ++DA+DTLFCLW+E+K
Sbjct: 1081 TLHEEWLKALLTVSRLGEHNF-DDDAWDTLFCLWHEMK 1106

BLAST of Csa2G004730 vs. TAIR10
Match: AT3G24495.1 (AT3G24495.1 MUTS homolog 7)

HSP 1 Score: 1309.3 bits (3387), Expect = 0.0e+00
Identity = 691/1120 (61.70%), Postives = 841/1120 (75.09%), Query Frame = 1

Query: 1    MQRQKSLLSFFQKSPSDNR----SSDGCASSVGQRLTRFQTKPSAAGLEQPAIQTTADSS 60
            MQRQ+S+LSFFQK  +       S D  +   G    RF  K   A  +       + S 
Sbjct: 1    MQRQRSILSFFQKPTAATTKGLVSGDAASGGGGSGGPRFNVKEGDAKGDASVRFAVSKSV 60

Query: 61   LEIRGTDTPPEKVPRQILPVIEK-----NRGSSLFSSIMHKFVRVDDKRKANERDEVQ-- 120
             E+RGTDTPPEKVPR++LP   K        SSLFS+IMHKFV+VDD+  + ER      
Sbjct: 61   DEVRGTDTPPEKVPRRVLPSGFKPAESAGDASSLFSNIMHKFVKVDDRDCSGERSREDVV 120

Query: 121  --KDSSQNEVGKDS-PQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNGHRGPVLNIE 180
               DSS      D  PQ  S +GK  +            R+  F  +     R  V +I 
Sbjct: 121  PLNDSSLCMKANDVIPQFRSNNGKTQE------------RNHAFSFSGRAELRS-VEDIG 180

Query: 181  SNEDIAGPETPGMRPSVSRLKRSQEVSLVNCSGD-SLQDSTKRIKLLQDSINLNKIHNEI 240
             + D+ GPETPGMRP  SRLKR  E  +        + DS KR+K+LQD +   K   E+
Sbjct: 181  VDGDVPGPETPGMRPRASRLKRVLEDEMTFKEDKVPVLDSNKRLKMLQDPVCGEK--KEV 240

Query: 241  SDATSKFEWLNPSQVRDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDI 300
            ++ T KFEWL  S++RDANRRRPD PLYD+KTL+IPPDV KKMSASQKQYW+VK +YMDI
Sbjct: 241  NEGT-KFEWLESSRIRDANRRRPDDPLYDRKTLHIPPDVFKKMSASQKQYWSVKSEYMDI 300

Query: 301  LLFFKVGKFYELYEQDAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKV 360
            +LFFKVGKFYELYE DAE+GHKELDWKMT+SGVGKCRQVG+ ESGIDEAVQKL+ARGYKV
Sbjct: 301  VLFFKVGKFYELYELDAELGHKELDWKMTMSGVGKCRQVGISESGIDEAVQKLLARGYKV 360

Query: 361  GRVEQLESAEQTKSRGANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNS 420
            GR+EQLE+++Q K+RGAN++IPRKLVQV TPST ++G+IGPDAVHLLAIKE    L   S
Sbjct: 361  GRIEQLETSDQAKARGANTIIPRKLVQVLTPSTASEGNIGPDAVHLLAIKEIKMELQKCS 420

Query: 421  ISYGFAFVDCAALKFWTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYS 480
              YGFAFVDCAAL+FW GSI DDASCAALGALLMQVSPKE++Y+++GLS+E  K L+KY+
Sbjct: 421  TVYGFAFVDCAALRFWVGSISDDASCAALGALLMQVSPKEVLYDSKGLSREAQKALRKYT 480

Query: 481  PTGSTALELTSGSPVTNFLEASEVKLLVQSKAYFKGSLNLWN--HESTVHDDIALCALGG 540
             TGSTA++L     V    +A+ V+ +++S  YFKGS   WN   +     D+AL ALG 
Sbjct: 481  LTGSTAVQLAPVPQVMGDTDAAGVRNIIESNGYFKGSSESWNCAVDGLNECDVALSALGE 540

Query: 541  LINHMSRLMLDDVLRNGDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDN 600
            LINH+SRL L+DVL++GD+ PYQVYRGCLR+DGQTMVNLEIF N+ DGG SGTLYKYLDN
Sbjct: 541  LINHLSRLKLEDVLKHGDIFPYQVYRGCLRIDGQTMVNLEIFNNSCDGGPSGTLYKYLDN 600

Query: 601  CVTSSGKRLLRLWICHPLKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLG 660
            CV+ +GKRLLR WICHPLKDVE IN RL+VVEE  A S+ M + G  YL KLPDLERLLG
Sbjct: 601  CVSPTGKRLLRNWICHPLKDVESINKRLDVVEEFTANSESMQITG-QYLHKLPDLERLLG 660

Query: 661  QIKATVQSSASLVLPLIRKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLIIS-LPKVV 720
            +IK++V+SSAS++  L+ KK+ K+RVK FG +VKG R+G+DLL+ +QKE  ++S L K+ 
Sbjct: 661  RIKSSVRSSASVLPALLGKKVLKQRVKAFGQIVKGFRSGIDLLLALQKESNMMSLLYKLC 720

Query: 721  KLPQLSGNGGLDQFLTQFEAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVI 780
            KLP L G  GL+ FL+QFEAA+DS+FP+YQN DVTD  AE L+ILIELF+E+AT+WSEVI
Sbjct: 721  KLPILVGKSGLELFLSQFEAAIDSDFPNYQNQDVTDENAETLTILIELFIERATQWSEVI 780

Query: 781  HALNCVDVLRSFAIIAHSSRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVES 840
            H ++C+DVLRSFAI A  S GSM+RP+I P+S  +  + + +GP+LKI GLWHP+A+   
Sbjct: 781  HTISCLDVLRSFAIAASLSAGSMARPVIFPESEATDQNQKTKGPILKIQGLWHPFAVAAD 840

Query: 841  GETPVPNDMILG---LDQDSYHPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCET 900
            G+ PVPND++LG       S HPR+LLLTGPNMGGKSTLLR+TCLAV+ AQLGCYVPCE+
Sbjct: 841  GQLPVPNDILLGEARRSSGSIHPRSLLLTGPNMGGKSTLLRATCLAVIFAQLGCYVPCES 900

Query: 901  CTLSVVDTIFTRLGATDRIMTGESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDG 960
            C +S+VDTIFTRLGA+DRIMTGESTFLVEC+ETASVLQ+ATQDSLVILDELGRGTSTFDG
Sbjct: 901  CEISLVDTIFTRLGASDRIMTGESTFLVECTETASVLQNATQDSLVILDELGRGTSTFDG 960

Query: 961  YAIAYAVFRHLIEKVNCRLLFATHYHPLTKEFASHPHVMLQHMACTFK----------DH 1020
            YAIAY+VFRHL+EKV CR+LFATHYHPLTKEFASHP V  +HMAC FK          D 
Sbjct: 961  YAIAYSVFRHLVEKVQCRMLFATHYHPLTKEFASHPRVTSKHMACAFKSRSDYQPRGCDQ 1020

Query: 1021 ELIFLYRLRSGACPESYGLKVATMAGIPGRVVEAASRASQMMKQTIKENFKSSEQRSEFS 1080
            +L+FLYRL  GACPESYGL+VA MAGIP +VVE AS A+Q MK++I ENFKSSE RSEFS
Sbjct: 1021 DLVFLYRLTEGACPESYGLQVALMAGIPNQVVETASGAAQAMKRSIGENFKSSELRSEFS 1080

Query: 1081 TLHEEWLKTLITVLEFKGNN--LGENDAFDTLFCLWYELK 1088
            +LHE+WLK+L+ +     NN  +GE+D +DTLFCLW+E+K
Sbjct: 1081 SLHEDWLKSLVGISRVAHNNAPIGEDD-YDTLFCLWHEIK 1102

BLAST of Csa2G004730 vs. TAIR10
Match: AT4G02070.1 (AT4G02070.1 MUTS homolog 6)

HSP 1 Score: 241.1 bits (614), Expect = 3.2e-63
Identity = 157/395 (39.75%), Postives = 225/395 (56.96%), Query Frame = 1

Query: 660  IRKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLIISLPKVVKLPQLSGNGGLDQFLTQ 719
            ++K L+++R  L  + +  +  G D  +    E L  S+P   +L   S   G+ ++ T 
Sbjct: 905  LKKHLKEQRKLLGDASINYVTVGKDEYLLEVPESLSGSVPHDYEL--CSSKKGVSRYWTP 964

Query: 720  FEAAVDSEFPDYQNHDVTDSGAERLSI-LIELFVEKATEWSEVIHALNCVDVLRSFAIIA 779
                +  E    ++    +S  + +S  LI  F E   +W +++ A   +DVL S A  +
Sbjct: 965  TIKKLLKELSQAKSEK--ESALKSISQRLIGRFCEHQEKWRQLVSATAELDVLISLAFAS 1024

Query: 780  HSSRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVES--GETPVPNDM-ILGL 839
             S  G   RP+I   +++ +       P L   GL HP    +S    + VPN++ I G 
Sbjct: 1025 DSYEGVRCRPVISGSTSDGV-------PHLSATGLGHPVLRGDSLGRGSFVPNNVKIGGA 1084

Query: 840  DQDSYHPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSVVDTIFTRLGAT 899
            ++ S+    +LLTGPNMGGKSTLLR  CLAV+LAQ+G  VP ET  +S VD I  R+GA 
Sbjct: 1085 EKASF----ILLTGPNMGGKSTLLRQVCLAVILAQIGADVPAETFEVSPVDKICVRMGAK 1144

Query: 900  DRIMTGESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAVFRHLIEKVN 959
            D IM G+STFL E SETA +L  AT++SLV+LDELGRGT+T DG AIA +V  H IEKV 
Sbjct: 1145 DHIMAGQSTFLTELSETAVMLTSATRNSLVVLDELGRGTATSDGQAIAESVLEHFIEKVQ 1204

Query: 960  CRLLFATHYHPLTKEFASHPHVMLQHMACTFKD-----HELIFLYRLRSGACPESYGLKV 1019
            CR  F+THYH L+ ++ ++P V L HMAC   +      E+ FLYRL  GACP+SYG+ V
Sbjct: 1205 CRGFFSTHYHRLSVDYQTNPKVSLCHMACQIGEGIGGVEEVTFLYRLTPGACPKSYGVNV 1264

Query: 1020 ATMAGIPGRVVEAASRASQMMKQTIKENFKSSEQR 1046
            A +AG+P  V++ A   SQ  +    +N + ++ +
Sbjct: 1265 ARLAGLPDYVLQRAVIKSQEFEALYGKNHRKTDHK 1284


HSP 2 Score: 241.1 bits (614), Expect = 3.2e-63
Identity = 165/493 (33.47%), Postives = 259/493 (52.54%), Query Frame = 1

Query: 229 TSKFEWLNPSQVRDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLF 288
           + KF +L   + RDA RRRP    YD +TLY+PPD +KK++  Q+Q+W  K ++MD ++F
Sbjct: 341 SEKFRFLGVDR-RDAKRRRPTDENYDPRTLYLPPDFVKKLTGGQRQWWEFKAKHMDKVVF 400

Query: 289 FKVGKFYELYEQDAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGRV 348
           FK+GKFYEL+E DA +G KELD +        C   G PE      ++KLV +GY+V  V
Sbjct: 401 FKMGKFYELFEMDAHVGAKELDIQYMKGEQPHC---GFPEKNFSVNIEKLVRKGYRVLVV 460

Query: 349 EQLESAEQTKSR-----GANSVIPRKLVQVTTPSTKADGDI---GPDAVHLLAIKEESCG 408
           EQ E+ +Q + R       + V+ R++  V T  T  DG++    PDA +L+A+ E    
Sbjct: 461 EQTETPDQLEQRRKETGSKDKVVKREVCAVVTKGTLTDGEMLLTNPDASYLMALTEGGES 520

Query: 409 LDNNSI--SYGFAFVDCAALKFWTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETH 468
           L N +   ++G   VD A  K   G  KDD  C+AL  LL ++ P EII  A+ LS  T 
Sbjct: 521 LTNPTAEHNFGVCLVDVATQKIILGQFKDDQDCSALSCLLSEMRPVEIIKPAKVLSYATE 580

Query: 469 KV---------------LKKYSPTGSTALEL------TSGSPVTNFLEASEVKLLVQSKA 528
           +                L ++  +  T  E+       +  P + +  +SE K+L    +
Sbjct: 581 RTIVRQTRNPLVNNLVPLSEFWDSEKTIYEVGIIYKRINCQPSSAY--SSEGKILGDGSS 640

Query: 529 YFKGSLNLWNHESTVHDDIALCALGGLINHMSRLMLDDVL---RNGDLLPYQVYRGC--- 588
           +    L+    E   +  +AL ALGG I ++ +  LD+ L      + LPY  +      
Sbjct: 641 FLPKMLSELATEDK-NGSLALSALGGAIYYLRQAFLDESLLRFAKFESLPYCDFSNVNEK 700

Query: 589 --LRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCVTSSGKRLLRLWICHPLKDVEEINN 648
             + +D   + NLEIF N+ +GG SGTLY  L+ C+T+SGKRLL+ W+  PL + E I  
Sbjct: 701 QHMVLDAAALENLEIFENSRNGGYSGTLYAQLNQCITASGKRLLKTWLARPLYNTELIKE 760

Query: 649 RLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQIKATVQSSA-SLVLPLIRKKLQKRR 682
           R + V  L  ++    L     L +LPD+ERL+ ++ +++++S  +    ++ +   K++
Sbjct: 761 RQDAVAILRGENLPYSLEFRKSLSRLPDMERLIARMFSSIEASGRNGDKVVLYEDTAKKQ 820

BLAST of Csa2G004730 vs. TAIR10
Match: AT3G18524.1 (AT3G18524.1 MUTS homolog 2)

HSP 1 Score: 172.9 bits (437), Expect = 1.1e-42
Identity = 130/419 (31.03%), Postives = 204/419 (48.69%), Query Frame = 1

Query: 630  TYLRKLPDLERLLGQIKATVQSSASLVLPL-IRKKLQKRRVKLFGSL----------VKG 689
            T L  L D + LL Q    +    ++ L L + K L+  +   FG +          ++ 
Sbjct: 471  TKLASLKDQKELLEQQIHELHKKTAIELDLQVDKALKLDKAAQFGHVFRITKKEEPKIRK 530

Query: 690  LRTGLDLLIQVQKEGLIISLPKVVKLPQLSGNGGLDQFLTQFEAAVDSEFPDYQNHDVTD 749
              T   ++++ +K+G+  +  K+ KL              Q+++ VD    DY++     
Sbjct: 531  KLTTQFIVLETRKDGVKFTNTKLKKLGD------------QYQSVVD----DYRSCQKE- 590

Query: 750  SGAERLSILIELFVEKATEWSEVIH----ALNCVDVLRSFAIIAHSSRGSMSRPLILPQS 809
                    L++  VE  T +SEV       L+ +DVL SFA +A S      RP I    
Sbjct: 591  --------LVDRVVETVTSFSEVFEDLAGLLSEMDVLLSFADLAASCPTPYCRPEITSSD 650

Query: 810  NNSMLSPEKQGPVLKINGLWHPYALVESGETPVPNDMILGLDQDSYHPRTLLLTGPNMGG 869
               ++          + G  HP    +     +PND  L   +  +     ++TGPNMGG
Sbjct: 651  AGDIV----------LEGSRHPCVEAQDWVNFIPNDCRLMRGKSWFQ----IVTGPNMGG 710

Query: 870  KSTLLRSTCLAVVLAQLGCYVPCETCTLSVVDTIFTRLGATDRIMTGESTFLVECSETAS 929
            KST +R   + V++AQ+G +VPC+  ++S+ D IF R+GA D  + G STF+ E  ETAS
Sbjct: 711  KSTFIRQVGVIVLMAQVGSFVPCDKASISIRDCIFARVGAGDCQLRGVSTFMQEMLETAS 770

Query: 930  VLQHATQDSLVILDELGRGTSTFDGYAIAYAVFRHLIEKVNCRLLFATHYHPLTKEFASH 989
            +L+ A+  SL+I+DELGRGTST+DG+ +A+A+  HL++      LFATH+H LT    ++
Sbjct: 771  ILKGASDKSLIIIDELGRGTSTYDGFGLAWAICEHLVQVKRAPTLFATHFHELTALAQAN 830

Query: 990  PHVMLQ-------HMACTF--KDHELIFLYRLRSGACPESYGLKVATMAGIPGRVVEAA 1025
              V          H++     +  +L  LY++  GAC +S+G+ VA  A  P  VV  A
Sbjct: 831  SEVSGNTVGVANFHVSAHIDTESRKLTMLYKVEPGACDQSFGIHVAEFANFPESVVALA 850


HSP 2 Score: 69.3 bits (168), Expect = 1.6e-11
Identity = 90/374 (24.06%), Postives = 158/374 (42.25%), Query Frame = 1

Query: 409 GFAFVDCAALKFWTGSIKDDASCAALGALLMQVSPKEIIYEAR-GLSKETHKVLKKYSPT 468
           G A+VD            DD+    L + L+ +  KE I+ A  G S E   +       
Sbjct: 159 GMAYVDLTRRVLGLAEFLDDSRFTNLESSLIALGAKECIFPAESGKSNECKSLYDSLERC 218

Query: 469 GSTALELTSGSPVTNFLEASEVKLLVQSKAYFKGSLNLWNHESTVHD-----DIALCALG 528
                E          L+ S++K LV      KG++        V D     D+A  ALG
Sbjct: 219 AVMITERKKHEFKGRDLD-SDLKRLV------KGNIE------PVRDLVSGFDLATPALG 278

Query: 529 GLINHMSRLMLDDVLRNGDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLD 588
            L++    L  +D   N  +  Y +  G +R+D   M  L +  +  D   + +L+  ++
Sbjct: 279 ALLSFSELLSNEDNYGNFTIRRYDI-GGFMRLDSAAMRALNVMESKTDANKNFSLFGLMN 338

Query: 589 N-CVTSSGKRLLRLWICHPLKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERL 648
             C    GKRLL +W+  PL D+ EI  RL++V+  + ++ +   L   +L+++ D+ERL
Sbjct: 339 RTCTAGMGKRLLHMWLKQPLVDLNEIKTRLDIVQCFVEEAGLRQDL-RQHLKRISDVERL 398

Query: 649 L-------GQIKATVQSSASLV-LPLIRKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEG 708
           L       G ++  ++   S + LP I+  +Q+   +    + +     L+ L      G
Sbjct: 399 LRSLERRRGGLQHIIKLYQSTIRLPFIKTAMQQYTGEFASLISERYLKKLEALSDQDHLG 458

Query: 709 LIISLPKV-VKLPQLSGNGGL--DQFLTQFEAAVD-SEFPDYQNHDVTDSGAERLSILIE 762
             I L +  V L QL     +    + T+  +  D  E  + Q H++    A  L + ++
Sbjct: 459 KFIDLVECSVDLDQLENGEYMISSSYDTKLASLKDQKELLEQQIHELHKKTAIELDLQVD 517

BLAST of Csa2G004730 vs. TAIR10
Match: AT4G25540.1 (AT4G25540.1 homolog of DNA mismatch repair protein MSH3)

HSP 1 Score: 160.6 bits (405), Expect = 5.4e-39
Identity = 99/298 (33.22%), Postives = 152/298 (51.01%), Query Frame = 1

Query: 748  IELFVEKATEWSEVIHALNCVDVLRSFAIIAHSSRGSMSRPLILPQSNNSMLSPEKQGPV 807
            ++ F    T++   + AL  +D L S + +      S ++  + P+  +     E     
Sbjct: 731  LKSFSRYYTDFKAAVQALAALDCLHSLSTL------SRNKNYVRPEFVDDCEPVE----- 790

Query: 808  LKINGLWHPYALVESGETPVPNDMILGLDQDSYHPRTLLLTGPNMGGKSTLLRSTCLAVV 867
            + I    HP       +  VPND IL  + +       ++TGPNMGGKS  +R   L  +
Sbjct: 791  INIQSGRHPVLETILQDNFVPNDTILHAEGEYCQ----IITGPNMGGKSCYIRQVALISI 850

Query: 868  LAQLGCYVPCETCTLSVVDTIFTRLGATDRIMTGESTFLVECSETASVLQHATQDSLVIL 927
            +AQ+G +VP     L V+D +FTR+GA+D I  G STFL E SE + +++  +  SLVIL
Sbjct: 851  MAQVGSFVPASFAKLHVLDGVFTRMGASDSIQHGRSTFLEELSEASHIIRTCSSRSLVIL 910

Query: 928  DELGRGTSTFDGYAIAYAVFRHLIEKVNCRLLFATHYHPLTKEFASHP---------HVM 987
            DELGRGTST DG AIAYA  +HL+ +  C +LF THY  + +     P         ++ 
Sbjct: 911  DELGRGTSTHDGVAIAYATLQHLLAEKRCLVLFVTHYPEIAEISNGFPGSVGTYHVSYLT 970

Query: 988  LQHMACTFKDHELIFLYRLRSGACPESYGLKVATMAGIPGRVVEAASRASQMMKQTIK 1037
            LQ    ++   ++ +LY+L  G C  S+G KVA +A IP   +  A   +  ++  ++
Sbjct: 971  LQKDKGSYDHDDVTYLYKLVRGLCSRSFGFKVAQLAQIPPSCIRRAISMAAKLEAEVR 1013


HSP 2 Score: 104.8 bits (260), Expect = 3.5e-22
Identity = 103/390 (26.41%), Postives = 174/390 (44.62%), Query Frame = 1

Query: 266 KKMSASQKQYWNVKCQYMDILLFFKVGKFYELYEQDAEIGHKELDWKMTLSGVGKCRQVG 325
           +K +  ++Q   +K +Y D++L  +VG  Y  + +DAEI  + L     +          
Sbjct: 102 RKYTPLEQQVVELKSKYPDVVLMVEVGYRYRFFGEDAEIAARVLGIYAHMDH--NFMTAS 161

Query: 326 VPESGIDEAVQKLVARGYKVGRVEQLESAEQTKSRGANSVIP--RKLVQVTTPST-KADG 385
           VP   ++  V++LV  GYK+G V+Q E+A   KS GAN   P  R L  + T +T +A  
Sbjct: 162 VPTFRLNFHVRRLVNAGYKIGVVKQTETAA-IKSHGANRTGPFFRGLSALYTKATLEAAE 221

Query: 386 DI----------GPDAVHLLAIKEE-------SCGLDNN-SISYGFAFVDCAALKFWTGS 445
           DI          G  +  L+ + +E        CG++ +  +  G   V+ +  +     
Sbjct: 222 DISGGCGGEEGFGSQSNFLVCVVDERVKSETLGCGIEMSFDVRVGVVGVEISTGEVVYEE 281

Query: 446 IKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYS-PTGSTALELTSGSPVTNF 505
             D+   + L A+++ +SP E++   + LS++T K L  ++ PT +  +E  S    +N 
Sbjct: 282 FNDNFMRSGLEAVILSLSPAELLL-GQPLSQQTEKFLVAHAGPTSNVRVERASLDCFSNG 341

Query: 506 LEASEVKLLVQS--------------KAYFKGSLNLWNHESTVHDDIALCALGGLINHMS 565
               EV  L +               +A  KG   L  H       + + AL     H+ 
Sbjct: 342 NAVDEVISLCEKISAGNLEDDKEMKLEAAEKGMSCLTVHTIMNMPHLTVQALALTFCHLK 401

Query: 566 RLMLDDVLRNGDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCVTSSG 620
           +   + +L  G           + +   T+  LE+ +NN DG  SG+L+  +++ +T  G
Sbjct: 402 QFGFERILYQGASFRSLSSNTEMTLSANTLQQLEVVKNNSDGSESGSLFHNMNHTLTVYG 461

BLAST of Csa2G004730 vs. TAIR10
Match: AT4G17380.1 (AT4G17380.1 MUTS-like protein 4)

HSP 1 Score: 154.5 bits (389), Expect = 3.9e-37
Identity = 86/224 (38.39%), Postives = 129/224 (57.59%), Query Frame = 1

Query: 820  VESGETPV----PNDMILGLDQDSYHPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYV 879
            +++G  P+     ND +      S     L++ GPNM GKST L+  CL V+LAQ+GCYV
Sbjct: 520  IDAGRHPILESIHNDFVSNSIFMSEATNMLVVMGPNMSGKSTYLQQVCLVVILAQIGCYV 579

Query: 880  PCETCTLSVVDTIFTRLGATDRIMTGESTFLVECSETASVLQHATQDSLVILDELGRGTS 939
            P    T+ VVD IFTR+G  D + +  STF+ E  ETA ++Q+ T  SL+++DELGR TS
Sbjct: 580  PARFATIRVVDRIFTRMGTMDNLESNSSTFMTEMRETAFIMQNVTNRSLIVMDELGRATS 639

Query: 940  TFDGYAIAYAVFRHLIEKVNCRLLFATHYHPLTKEFASHPHVMLQHMACTFKDHELIFLY 999
            + DG A+A++   +L+  +    +FATH   L +    +P+V + H     +D+ L F +
Sbjct: 640  SSDGLAMAWSCCEYLL-SLKAYTVFATHMDSLAELATIYPNVKVLHFYVDIRDNRLDFKF 699

Query: 1000 RLRSGAC-PESYGLKVATMAGIPGRVVEAASRASQMMKQTIKEN 1039
            +LR G      YGL +A +AG+P  V++ A   ++ +  T KEN
Sbjct: 700  QLRDGTLHVPHYGLLLAEVAGLPSTVIDTARIITKRI--TDKEN 740


HSP 2 Score: 47.0 bits (110), Expect = 8.8e-05
Identity = 43/159 (27.04%), Postives = 70/159 (44.03%), Query Frame = 1

Query: 552 LRMDGQTMVNLEIFRNNDDGGLSGT------LYKYLDNCVTSSGKRLLRLWICHPLKDVE 611
           + +D  ++ NLE+  +     L GT      L++      T+ G RLLR  +  PLKD+E
Sbjct: 165 MNIDATSVENLELI-DPFHNALLGTSNKKRSLFQMFKTTKTAGGTRLLRANLLQPLKDIE 224

Query: 612 EINNRLNVVEELMAQSDIMVLLGTTYLRKLP-DLERLLGQIKATVQSSASLVLPLIRKKL 671
            IN RL+ ++ELM+   +   L +  LRK P + +R+L       +     V+     + 
Sbjct: 225 TINTRLDCLDELMSNEQLFFGL-SQVLRKFPKETDRVLCHFCFKPKKVTEAVIGFENTRK 284

Query: 672 QKRRVKLFGSLVKGLRTGLDLLIQVQKEGLIISLPKVVK 704
            +  +     L+K     L +L +V K+     L  V K
Sbjct: 285 SQNMISSI-ILLKTALDALPILAKVLKDAKCFLLANVYK 320

BLAST of Csa2G004730 vs. NCBI nr
Match: gi|449443325|ref|XP_004139430.1| (PREDICTED: DNA mismatch repair protein MSH7 [Cucumis sativus])

HSP 1 Score: 2175.2 bits (5635), Expect = 0.0e+00
Identity = 1095/1095 (100.00%), Postives = 1095/1095 (100.00%), Query Frame = 1

Query: 1    MQRQKSLLSFFQKSPSDNRSSDGCASSVGQRLTRFQTKPSAAGLEQPAIQTTADSSLEIR 60
            MQRQKSLLSFFQKSPSDNRSSDGCASSVGQRLTRFQTKPSAAGLEQPAIQTTADSSLEIR
Sbjct: 1    MQRQKSLLSFFQKSPSDNRSSDGCASSVGQRLTRFQTKPSAAGLEQPAIQTTADSSLEIR 60

Query: 61   GTDTPPEKVPRQILPVIEKNRGSSLFSSIMHKFVRVDDKRKANERDEVQKDSSQNEVGKD 120
            GTDTPPEKVPRQILPVIEKNRGSSLFSSIMHKFVRVDDKRKANERDEVQKDSSQNEVGKD
Sbjct: 61   GTDTPPEKVPRQILPVIEKNRGSSLFSSIMHKFVRVDDKRKANERDEVQKDSSQNEVGKD 120

Query: 121  SPQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNGHRGPVLNIESNEDIAGPETPGMR 180
            SPQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNGHRGPVLNIESNEDIAGPETPGMR
Sbjct: 121  SPQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNGHRGPVLNIESNEDIAGPETPGMR 180

Query: 181  PSVSRLKRSQEVSLVNCSGDSLQDSTKRIKLLQDSINLNKIHNEISDATSKFEWLNPSQV 240
            PSVSRLKRSQEVSLVNCSGDSLQDSTKRIKLLQDSINLNKIHNEISDATSKFEWLNPSQV
Sbjct: 181  PSVSRLKRSQEVSLVNCSGDSLQDSTKRIKLLQDSINLNKIHNEISDATSKFEWLNPSQV 240

Query: 241  RDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLFFKVGKFYELYEQ 300
            RDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLFFKVGKFYELYEQ
Sbjct: 241  RDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLFFKVGKFYELYEQ 300

Query: 301  DAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGRVEQLESAEQTKSR 360
            DAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGRVEQLESAEQTKSR
Sbjct: 301  DAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGRVEQLESAEQTKSR 360

Query: 361  GANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSISYGFAFVDCAALKF 420
            GANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSISYGFAFVDCAALKF
Sbjct: 361  GANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSISYGFAFVDCAALKF 420

Query: 421  WTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPTGSTALELTSGSPV 480
            WTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPTGSTALELTSGSPV
Sbjct: 421  WTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPTGSTALELTSGSPV 480

Query: 481  TNFLEASEVKLLVQSKAYFKGSLNLWNHESTVHDDIALCALGGLINHMSRLMLDDVLRNG 540
            TNFLEASEVKLLVQSKAYFKGSLNLWNHESTVHDDIALCALGGLINHMSRLMLDDVLRNG
Sbjct: 481  TNFLEASEVKLLVQSKAYFKGSLNLWNHESTVHDDIALCALGGLINHMSRLMLDDVLRNG 540

Query: 541  DLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCVTSSGKRLLRLWICHP 600
            DLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCVTSSGKRLLRLWICHP
Sbjct: 541  DLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCVTSSGKRLLRLWICHP 600

Query: 601  LKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQIKATVQSSASLVLPLI 660
            LKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQIKATVQSSASLVLPLI
Sbjct: 601  LKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQIKATVQSSASLVLPLI 660

Query: 661  RKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLIISLPKVVKLPQLSGNGGLDQFLTQF 720
            RKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLIISLPKVVKLPQLSGNGGLDQFLTQF
Sbjct: 661  RKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLIISLPKVVKLPQLSGNGGLDQFLTQF 720

Query: 721  EAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVIHALNCVDVLRSFAIIAHS 780
            EAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVIHALNCVDVLRSFAIIAHS
Sbjct: 721  EAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVIHALNCVDVLRSFAIIAHS 780

Query: 781  SRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVESGETPVPNDMILGLDQDSY 840
            SRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVESGETPVPNDMILGLDQDSY
Sbjct: 781  SRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVESGETPVPNDMILGLDQDSY 840

Query: 841  HPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSVVDTIFTRLGATDRIMT 900
            HPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSVVDTIFTRLGATDRIMT
Sbjct: 841  HPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSVVDTIFTRLGATDRIMT 900

Query: 901  GESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAVFRHLIEKVNCRLLF 960
            GESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAVFRHLIEKVNCRLLF
Sbjct: 901  GESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAVFRHLIEKVNCRLLF 960

Query: 961  ATHYHPLTKEFASHPHVMLQHMACTFKDHELIFLYRLRSGACPESYGLKVATMAGIPGRV 1020
            ATHYHPLTKEFASHPHVMLQHMACTFKDHELIFLYRLRSGACPESYGLKVATMAGIPGRV
Sbjct: 961  ATHYHPLTKEFASHPHVMLQHMACTFKDHELIFLYRLRSGACPESYGLKVATMAGIPGRV 1020

Query: 1021 VEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEWLKTLITVLEFKGNNLGENDAFDTLF 1080
            VEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEWLKTLITVLEFKGNNLGENDAFDTLF
Sbjct: 1021 VEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEWLKTLITVLEFKGNNLGENDAFDTLF 1080

Query: 1081 CLWYELKRDRITARI 1096
            CLWYELKRDRITARI
Sbjct: 1081 CLWYELKRDRITARI 1095

BLAST of Csa2G004730 vs. NCBI nr
Match: gi|659071128|ref|XP_008458258.1| (PREDICTED: DNA mismatch repair protein MSH7 [Cucumis melo])

HSP 1 Score: 2077.0 bits (5380), Expect = 0.0e+00
Identity = 1045/1090 (95.87%), Postives = 1064/1090 (97.61%), Query Frame = 1

Query: 1    MQRQKSLLSFFQKSPSDNRSSDGCASSVGQRLTRFQTKPSAAGLEQPAIQTTADSSLEIR 60
            MQRQKSLLSFFQKSPSD RSSDG ASS+G+RLT F  KPSAAGLEQPAIQTTA SSLEIR
Sbjct: 1    MQRQKSLLSFFQKSPSDYRSSDGGASSIGERLTCFPPKPSAAGLEQPAIQTTAHSSLEIR 60

Query: 61   GTDTPPEKVPRQILPVIEKNRGSSLFSSIMHKFVRVDDKRKANERDEVQKDSSQNEVGKD 120
            GTDTPPEKVPRQILP IEKNRGSSLFSSIMHKFVRVDDKRKANERD VQ+DSSQNEVGKD
Sbjct: 61   GTDTPPEKVPRQILPAIEKNRGSSLFSSIMHKFVRVDDKRKANERDGVQEDSSQNEVGKD 120

Query: 121  SPQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNGHRGPVLNIESNEDIAGPETPGMR 180
            SPQLPSI GKVNDPTEFSKLDVASRRHGKFD+ANLNGHRGPVLNIES+EDIAGPETPGMR
Sbjct: 121  SPQLPSIYGKVNDPTEFSKLDVASRRHGKFDIANLNGHRGPVLNIESDEDIAGPETPGMR 180

Query: 181  PSVSRLKRSQEVSLVNCSGDSLQDSTKRIKLLQDSINLNKIHNEISDATSKFEWLNPSQV 240
            PS+SRLKRSQEVSLVNCSGDSLQDSTKRIKLLQDSINL KIHNEISDATSKFEWLNPSQV
Sbjct: 181  PSISRLKRSQEVSLVNCSGDSLQDSTKRIKLLQDSINLKKIHNEISDATSKFEWLNPSQV 240

Query: 241  RDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLFFKVGKFYELYEQ 300
            RDANRRRP HPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLFFKVGKFYELYEQ
Sbjct: 241  RDANRRRPGHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLFFKVGKFYELYEQ 300

Query: 301  DAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGRVEQLESAEQTKSR 360
            DAEIGH+ELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVA GYKVGRVEQLESA+QTKSR
Sbjct: 301  DAEIGHRELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVALGYKVGRVEQLESADQTKSR 360

Query: 361  GANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSISYGFAFVDCAALKF 420
            GANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSISYGFAFVDCAALKF
Sbjct: 361  GANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSISYGFAFVDCAALKF 420

Query: 421  WTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPTGSTALELTSGSPV 480
            WTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPTG TALE TSGSPV
Sbjct: 421  WTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPTGFTALEFTSGSPV 480

Query: 481  TNFLEASEVKLLVQSKAYFKGSLNLWNH--ESTVHDDIALCALGGLINHMSRLMLDDVLR 540
            TNFLEASEVKLLVQSKAYFKGSLNLWN   ESTVHDDIALCALGGLINHMSRLMLDDVLR
Sbjct: 481  TNFLEASEVKLLVQSKAYFKGSLNLWNQTIESTVHDDIALCALGGLINHMSRLMLDDVLR 540

Query: 541  NGDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCVTSSGKRLLRLWIC 600
            NGDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNC+TSSGKRLLRLWIC
Sbjct: 541  NGDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCMTSSGKRLLRLWIC 600

Query: 601  HPLKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQIKATVQSSASLVLP 660
            HPLKDVEEINNRLNVVEELMAQS+IMVLLGTTYLRKLPDLERLLGQIKATVQSSASL LP
Sbjct: 601  HPLKDVEEINNRLNVVEELMAQSEIMVLLGTTYLRKLPDLERLLGQIKATVQSSASLALP 660

Query: 661  LIRKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLIISLPKVVKLPQLSGNGGLDQFLT 720
            LIRKKLQKRRVKLFGSLVKGL TGLDLLIQVQKEGLIISLPKVVKLPQLSGNGGLDQFLT
Sbjct: 661  LIRKKLQKRRVKLFGSLVKGLSTGLDLLIQVQKEGLIISLPKVVKLPQLSGNGGLDQFLT 720

Query: 721  QFEAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVIHALNCVDVLRSFAIIA 780
            QFEAA+DSEFPDYQNHDVTDSGAERLSILIELFVEKATEWS+VIHALNC+DVLRSFAIIA
Sbjct: 721  QFEAAIDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSKVIHALNCIDVLRSFAIIA 780

Query: 781  HSSRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVESGETPVPNDMILGLDQD 840
            HSSRGSMSRPLILPQS+NSMLSPEKQGPVLKINGLWHPYALVESGETPVPND+ILG DQ 
Sbjct: 781  HSSRGSMSRPLILPQSSNSMLSPEKQGPVLKINGLWHPYALVESGETPVPNDIILGPDQH 840

Query: 841  SYHPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSVVDTIFTRLGATDRI 900
             YHPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCY+PCETCTLSVVDTIFTRLGATDRI
Sbjct: 841  GYHPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYIPCETCTLSVVDTIFTRLGATDRI 900

Query: 901  MTGESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAVFRHLIEKVNCRL 960
            MTGESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAVFRHLIEKVNCRL
Sbjct: 901  MTGESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAVFRHLIEKVNCRL 960

Query: 961  LFATHYHPLTKEFASHPHVMLQHMACTFKDHELIFLYRLRSGACPESYGLKVATMAGIPG 1020
            LFATHYHPLTKEFASHPHVMLQHMACTF D ELIFLYRLRSGACPESYGLKVATMAGIPG
Sbjct: 961  LFATHYHPLTKEFASHPHVMLQHMACTFNDQELIFLYRLRSGACPESYGLKVATMAGIPG 1020

Query: 1021 RVVEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEWLKTLITVLEFKGNNLGENDAFDT 1080
            RVVEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEWLKTLIT+ EFKGN+L ENDAFDT
Sbjct: 1021 RVVEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEWLKTLITISEFKGNDLDENDAFDT 1080

Query: 1081 LFCLWYELKR 1089
            LFCLWYELK+
Sbjct: 1081 LFCLWYELKK 1090

BLAST of Csa2G004730 vs. NCBI nr
Match: gi|802627380|ref|XP_012076663.1| (PREDICTED: DNA mismatch repair protein MSH7 [Jatropha curcas])

HSP 1 Score: 1440.6 bits (3728), Expect = 0.0e+00
Identity = 749/1112 (67.36%), Postives = 879/1112 (79.05%), Query Frame = 1

Query: 1    MQRQKSLLSFFQKSPSDNRSSDGCASSVGQRLTRFQTKPSAAGLEQPAIQTT---ADSSL 60
            MQRQKS+LSFFQK    ++ +D   +   ++   F +K     +  P    T    DSSL
Sbjct: 1    MQRQKSILSFFQKPSPASQKADSGGTLNERKAPHFSSKQENQKVVSPGKPDTHGSVDSSL 60

Query: 61   EIRGTDTPPEKVPRQILP----VIEKNRGSSLFSSIMHKFVRVDDKRKANERDEVQKDSS 120
            E+RGTDTPPEKVPRQ+LP    V E   GSSLFSSIMHKFV+VD K K  ER +V   S+
Sbjct: 61   EVRGTDTPPEKVPRQVLPGSYSVNENTTGSSLFSSIMHKFVKVDSKEKPLERVQVHHPSN 120

Query: 121  QNEVGKDSPQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNG--HRGPVLNIESNEDI 180
                      + S+SG++ D   +SK         K +  + NG    G VL ++S+ D+
Sbjct: 121  D---------ICSVSGRLIDTKGWSKQRTDVLHLEKNNAYSSNGMVDHGDVLLLKSSNDV 180

Query: 181  AGPETPGMRPSVSRLKRSQEVS--LVNCSGDSLQDSTKRIKLLQDSINLNKIHNEISDAT 240
             GPETPG++P V RLKR Q+ S    + SG SL +++KR+KLL DS   +K    I D+T
Sbjct: 181  PGPETPGVQPLVPRLKRIQDDSSKFDDRSGCSLLNASKRMKLLLDSTASSKNQGVIFDST 240

Query: 241  SKFEWLNPSQVRDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLFF 300
            SKFEWL+P ++RDAN RR   PLYDKKTLYIPPD LKKMSASQKQYW++K QYMDILLFF
Sbjct: 241  SKFEWLDPLRIRDANGRRLSDPLYDKKTLYIPPDTLKKMSASQKQYWSIKSQYMDILLFF 300

Query: 301  KVGKFYELYEQDAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGRVE 360
            KVGKFYELYE DAEIGHKELDWKMTLSGVGKCRQVG+ ESGID+AV+KLVARGYKVGR+E
Sbjct: 301  KVGKFYELYELDAEIGHKELDWKMTLSGVGKCRQVGISESGIDDAVEKLVARGYKVGRIE 360

Query: 361  QLESAEQTKSRGANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSISYG 420
            QLE++ Q K+RGANSVIPRKLVQV TPST  DG+IGPDAVHLLAIKE +CGLDN + SYG
Sbjct: 361  QLETSGQAKARGANSVIPRKLVQVVTPSTATDGNIGPDAVHLLAIKEGNCGLDNGATSYG 420

Query: 421  FAFVDCAALKFWTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPTGS 480
            FAFVDCAAL+FW GSI DD S AALGALLMQVSPKE+IYE+ G+SKE  K L+KYS TGS
Sbjct: 421  FAFVDCAALRFWVGSINDDTSYAALGALLMQVSPKEVIYESGGMSKEAQKALRKYSLTGS 480

Query: 481  TALELTSGSPVTNFLEASEVKLLVQSKAYFKGSLNLWNHE--STVHDDIALCALGGLINH 540
             AL+LT     T+FL  SEV+ L+QSK YF GS N WN+   S +H DIAL ALGGL+ H
Sbjct: 481  -ALQLTPVQSTTDFLHGSEVRNLIQSKGYFSGSSNPWNNAIVSVLHHDIALSALGGLVGH 540

Query: 541  MSRLMLDDVLRNGDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCVTS 600
            +SRLMLDDVLRNGD+ PYQVY GCLRMDGQT++NLEIF NN DGGLSGTL+ +LDNCVTS
Sbjct: 541  LSRLMLDDVLRNGDIQPYQVYTGCLRMDGQTLINLEIFNNNADGGLSGTLFNHLDNCVTS 600

Query: 601  SGKRLLRLWICHPLKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQIKA 660
            SGKRLLR WICHPLK V+ IN+RLNVVEEL+ +S+IM+++   YLRKLPD+ER+LG++KA
Sbjct: 601  SGKRLLRKWICHPLKCVKGINDRLNVVEELINRSEIMLVIAQ-YLRKLPDIERMLGRVKA 660

Query: 661  TVQSSASLVLPLIRKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLIISL-PKVVKLPQ 720
            + Q+SASL LPLI KK+ K+RVK+FG LVKGLRTG+DLL+ +QKE  I+ L  K+ KLP+
Sbjct: 661  SFQASASLALPLIGKKMLKQRVKVFGCLVKGLRTGMDLLLLLQKESQIMLLFLKIFKLPE 720

Query: 721  LSGNGGLDQFLTQFEAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVIHALN 780
            L+G+ GLD+FL QFEAAVDSEFPDYQNHDVTDS AE LS+LIELF+EKAT+WSE+IHA+N
Sbjct: 721  LNGSAGLDKFLAQFEAAVDSEFPDYQNHDVTDSEAETLSVLIELFIEKATQWSEIIHAIN 780

Query: 781  CVDVLRSFAIIAHSSRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVESGETP 840
            C+DVLRSFA+ A  S GSMSRP+IL  S  +  S E  GPVLKI GLWHP+AL E+G  P
Sbjct: 781  CIDVLRSFAVTASMSSGSMSRPVILSDSKTTTFSREAAGPVLKIKGLWHPFALGENGGLP 840

Query: 841  VPNDMILGLDQDSYHPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSVVD 900
            VPND+ LG    SYHP TLLLTGPNMGGKSTLLR+TCLAV+LAQLGC+VP E C LS+VD
Sbjct: 841  VPNDLNLGEHPGSYHPHTLLLTGPNMGGKSTLLRATCLAVILAQLGCFVPSEMCILSLVD 900

Query: 901  TIFTRLGATDRIMTGESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAV 960
             IFTRLGA DRIMTGESTF +EC+ETASVLQ+ATQDSLVILDELGRGTSTFDGYAIAYAV
Sbjct: 901  VIFTRLGAIDRIMTGESTFYIECTETASVLQNATQDSLVILDELGRGTSTFDGYAIAYAV 960

Query: 961  FRHLIEKVNCRLLFATHYHPLTKEFASHPHVMLQHMACTFK---------DHELIFLYRL 1020
            FRHL+EKVNCRLLFATHYHPLTKEFASHPHV LQHMAC FK         D EL+FLYRL
Sbjct: 961  FRHLVEKVNCRLLFATHYHPLTKEFASHPHVTLQHMACAFKPKSGSYSKDDEELVFLYRL 1020

Query: 1021 RSGACPESYGLKVATMAGIPGRVVEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEWLK 1080
             SGACPESYGL+VA MAGIP +VVEAAS+A Q+MK++I ENF+SSEQRSEFS+LHE+WLK
Sbjct: 1021 ASGACPESYGLQVAAMAGIPEKVVEAASKAGQIMKKSIGENFQSSEQRSEFSSLHEDWLK 1080

Query: 1081 TLITVLEFKGNNLGEN--DAFDTLFCLWYELK 1088
            TL+   + +  N+  N  D +DTLFCLW+ELK
Sbjct: 1081 TLLNASQIEDCNVDNNDDDVYDTLFCLWHELK 1101

BLAST of Csa2G004730 vs. NCBI nr
Match: gi|1009160401|ref|XP_015898331.1| (PREDICTED: DNA mismatch repair protein MSH7 [Ziziphus jujuba])

HSP 1 Score: 1439.5 bits (3725), Expect = 0.0e+00
Identity = 743/1109 (67.00%), Postives = 887/1109 (79.98%), Query Frame = 1

Query: 1    MQRQKSLLSFFQKSPSDNRSSDGCASSVGQRLTRFQ-TKPSAAGLEQPAIQTTADSSLEI 60
            MQRQKS+LSFFQK   +N++S        +R+ +F  T+ + +G +QP      D  +EI
Sbjct: 1    MQRQKSILSFFQKPSQENQNSGT------RRVPQFPVTQRNVSGSDQPK---ATDPVVEI 60

Query: 61   RGTDTPPEKVPRQILPVI----EKNRGSSLFSSIMHKFVRVDDKRKANERDEVQKDSSQN 120
            RGTDTPPEKVPRQI P      + + GSSLFSSIMHKFV+ DD+ +A++R++    SS+ 
Sbjct: 61   RGTDTPPEKVPRQIFPASFVANDDSSGSSLFSSIMHKFVKADDRERASDRNQSNCGSSEL 120

Query: 121  EVGKDSPQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNG--HRGPVLNIESNEDIAG 180
             V         +S K+N+P    K+ VA+++ G+ ++    G  ++  VLNIES+++I G
Sbjct: 121  HV---------VSEKINEPEGSPKIGVAAQQGGEDNIVKSTGKVYQSCVLNIESDDNIPG 180

Query: 181  PETPGMRPSVSRLKRSQE---VSLVNCSGDSLQDSTKRIKLLQDSINLNKIHNEISDATS 240
            PETPGM+P V RLKR QE    S  NC   +L  S+KR+K+ +DS+ LNK   ++SD  S
Sbjct: 181  PETPGMQPLVPRLKRIQEGGPKSEDNCDR-ALLGSSKRMKMFEDSMLLNKNDKDVSDTAS 240

Query: 241  KFEWLNPSQVRDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILLFFK 300
            KF+WL+PSQ+RDAN+RRPD PLYDK TLY+PP+   KMSASQKQYW+ KCQYMD+LLFFK
Sbjct: 241  KFDWLDPSQIRDANKRRPDDPLYDKTTLYVPPNAFTKMSASQKQYWSTKCQYMDVLLFFK 300

Query: 301  VGKFYELYEQDAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGRVEQ 360
            VGKFYELYE DAEIGHKELDWK+TLSGVGKCRQVG+ ESGID+AVQKLVARGYKVGR+EQ
Sbjct: 301  VGKFYELYELDAEIGHKELDWKITLSGVGKCRQVGISESGIDDAVQKLVARGYKVGRIEQ 360

Query: 361  LESAEQTKSRGANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSISYGF 420
            LE++++ K+RGANSVI RKLV+V +PST  D  IGPDAVHLLAIKE   G+DN    YGF
Sbjct: 361  LETSDKAKARGANSVISRKLVEVVSPSTATDYHIGPDAVHLLAIKE--VGMDNGQTVYGF 420

Query: 421  AFVDCAALKFWTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPTGST 480
            AFVDCAALKFW GSI DDASCAALGALL+QVSPKE+IYE+RGLSKE  K L+KYS TG +
Sbjct: 421  AFVDCAALKFWVGSINDDASCAALGALLLQVSPKELIYESRGLSKEVQKALRKYSLTGPS 480

Query: 481  ALELTSGSPVTNFLEASEVKLLVQSKAYFKGSLNLWNH--ESTVHDDIALCALGGLINHM 540
            AL+LT   P+  F +ASEV+ LV SK YFKGSLNL NH  +S +H D+ L ALGGLINH+
Sbjct: 481  ALQLTPMQPI--FADASEVRNLVHSKGYFKGSLNLQNHALKSVIHPDVTLSALGGLINHL 540

Query: 541  SRLMLDDVLRNGDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCVTSS 600
            SRLMLDDVLRNGDLLPYQVYRGCL+MDGQT+VNLEIF NN DGG +GTLYKYLDNCVTSS
Sbjct: 541  SRLMLDDVLRNGDLLPYQVYRGCLKMDGQTLVNLEIFSNNADGGPAGTLYKYLDNCVTSS 600

Query: 601  GKRLLRLWICHPLKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQIKAT 660
            GKRLLR WICHPL DVE INNRLNVVE+++A  +IM+L+G  YLRK+PDLERLLG+I+A+
Sbjct: 601  GKRLLRTWICHPLMDVEGINNRLNVVEDMLAHPEIMLLVG-QYLRKIPDLERLLGRIRAS 660

Query: 661  VQSSASLVLPLIRKKLQKRRVKLFGSLVKGLRTGLDLLIQVQ-KEGLIISLPKVVKLPQL 720
             QSSA+L+LPL+ KK+ K+RVK FG+LVKGLR G+DLL+ +Q +E L   L KV KLP  
Sbjct: 661  FQSSAALLLPLLGKKVLKQRVKAFGTLVKGLRVGMDLLLLLQTEEHLSTPLLKVFKLPLF 720

Query: 721  SGNGGLDQFLTQFEAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVIHALNC 780
            SG+ GLD+FLTQFEAA+DS+FP+Y+NHDVTD  AE +SILIELF+EKATEWSE+IHA+NC
Sbjct: 721  SGSDGLDKFLTQFEAAIDSDFPNYKNHDVTDKDAEIISILIELFIEKATEWSEIIHAINC 780

Query: 781  VDVLRSFAIIAHSSRG-SMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVESGETP 840
            VDVLRSF + A SS G +MSRP ILP   N +LS E +GP+LK  GLWHP+AL E+G  P
Sbjct: 781  VDVLRSFTVTASSSCGAAMSRPFILPLLKNVVLSQETRGPILKAEGLWHPFALGENG-MP 840

Query: 841  VPNDMILGLDQDSYHPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSVVD 900
            VPND+ILG D   YHPRTLLLTGPNMGGKSTLLR+ CLAV+LAQLGCYVPCE C +S+VD
Sbjct: 841  VPNDIILGEDTSGYHPRTLLLTGPNMGGKSTLLRTACLAVILAQLGCYVPCEMCVISLVD 900

Query: 901  TIFTRLGATDRIMTGESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAYAV 960
            TIFTRLGATDRIM GESTF VEC+ETASVLQ+ATQDSLVILDELGRGTSTFDGYAIAYAV
Sbjct: 901  TIFTRLGATDRIMAGESTFFVECTETASVLQNATQDSLVILDELGRGTSTFDGYAIAYAV 960

Query: 961  FRHLIEKVNCRLLFATHYHPLTKEFASHPHVMLQHMACTFK--------DHELIFLYRLR 1020
             RHLIEKVNCRLLFATHYHPLTKEFASHPHV LQHMACTF+          EL+FLYRL 
Sbjct: 961  LRHLIEKVNCRLLFATHYHPLTKEFASHPHVNLQHMACTFRSKSECSSESKELVFLYRLA 1020

Query: 1021 SGACPESYGLKVATMAGIPGRVVEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEWLKT 1080
            SGACPESYGL+VA MAGIP +V+  AS+A Q+++++I E+F+ SE+RSEFSTLHEEWL  
Sbjct: 1021 SGACPESYGLQVAVMAGIPEQVIRTASKAGQVIRKSIGESFRVSERRSEFSTLHEEWLTN 1080

Query: 1081 LITVLEFKGNNLGENDAFDTLFCLWYELK 1088
            L+ V   +     E+D  DTLFCLW+ELK
Sbjct: 1081 LMAVSRIEDGKFDEDDVLDTLFCLWHELK 1084

BLAST of Csa2G004730 vs. NCBI nr
Match: gi|567918206|ref|XP_006451109.1| (hypothetical protein CICLE_v10007291mg [Citrus clementina])

HSP 1 Score: 1431.0 bits (3703), Expect = 0.0e+00
Identity = 749/1112 (67.36%), Postives = 875/1112 (78.69%), Query Frame = 1

Query: 1    MQRQKSLLSFFQKSPSDNRSSDGCASSVGQR----LTRFQTKPSAAGLEQPAIQTTADSS 60
            MQRQ+S+ SFFQK    N+S  G A   G R     T  Q  P      QP +  T DSS
Sbjct: 1    MQRQQSIHSFFQKCSPANKS--GAADMSGARKDSNFTTKQRNPVGDSSGQPTVSATEDSS 60

Query: 61   LEIRGTDTPPEKVPRQILPVIEK-----NRGSSLFSSIMHKFVRVDDKRKANERDEVQKD 120
            LEIRGTDTPPEKVPRQILP   K     + GSSLFSSIMHKFV+VD ++ AN+R+E   +
Sbjct: 61   LEIRGTDTPPEKVPRQILPSGFKANEGTSGGSSLFSSIMHKFVKVDARQNANKRNEQHGN 120

Query: 121  SSQNEVGKDSPQLPSISGKVNDPTEFSKLDVASRRHGKFDVANLNG--HRGPVLNIESNE 180
            SS          + S+ GK  D    S+   AS    K +V N NG  ++G V   E NE
Sbjct: 121  SST---------VCSVFGKTGDLEASSQQGTASLYSEKDNVFNCNGLANQGCVSCTEMNE 180

Query: 181  DIAGPETPGMRPSVSRLKRSQE--VSLVNCSGDSLQDSTKRIKLLQDSINLNKIHNEISD 240
            D++GP+TPGM   V RLKR  E  +++ +    SL DS+KR++LLQDS+   K   E +D
Sbjct: 181  DVSGPDTPGMHRVVPRLKRILEDNLNIGDKKNSSLLDSSKRMRLLQDSVAGVKNCEEEAD 240

Query: 241  ATSKFEWLNPSQVRDANRRRPDHPLYDKKTLYIPPDVLKKMSASQKQYWNVKCQYMDILL 300
             TSKFEWL+PS++RDANRRRPD PLYDK+TLYIPP+ LKKMSASQKQYWNVK QYMD+LL
Sbjct: 241  TTSKFEWLDPSKIRDANRRRPDDPLYDKRTLYIPPEALKKMSASQKQYWNVKSQYMDVLL 300

Query: 301  FFKVGKFYELYEQDAEIGHKELDWKMTLSGVGKCRQVGVPESGIDEAVQKLVARGYKVGR 360
            FFKVGKFYELYE DAEIGHKELDWK+TLSGVGKCRQVG+ ESGID+AV+KLVARGYKVGR
Sbjct: 301  FFKVGKFYELYELDAEIGHKELDWKITLSGVGKCRQVGISESGIDDAVEKLVARGYKVGR 360

Query: 361  VEQLESAEQTKSRGANSVIPRKLVQVTTPSTKADGDIGPDAVHLLAIKEESCGLDNNSIS 420
            +EQLE++EQ K+R  NSVI RKLV V TPST  DG IGPDAVHLLAIKE +CG DN S+ 
Sbjct: 361  IEQLETSEQAKARHTNSVISRKLVNVVTPSTTVDGTIGPDAVHLLAIKEGNCGPDNGSVV 420

Query: 421  YGFAFVDCAALKFWTGSIKDDASCAALGALLMQVSPKEIIYEARGLSKETHKVLKKYSPT 480
            YGFAFVDCAAL+ W G+I DDASCAALGALLMQVSPKE+IYE RGL KE  K L+K+S  
Sbjct: 421  YGFAFVDCAALRVWVGTINDDASCAALGALLMQVSPKEVIYENRGLCKEAQKALRKFS-A 480

Query: 481  GSTALELTSGSPVTNFLEASEVKLLVQSKAYFKGSLNLWNH--ESTVHDDIALCALGGLI 540
            GS ALELT    VT+FL+ASEVK LVQ   YF GS + W+   E+ +  DI   ALGGLI
Sbjct: 481  GSAALELTPAMAVTDFLDASEVKKLVQLNGYFNGSSSPWSKALENVMQHDIGFSALGGLI 540

Query: 541  NHMSRLMLDDVLRNGDLLPYQVYRGCLRMDGQTMVNLEIFRNNDDGGLSGTLYKYLDNCV 600
            +H+SRLMLDDVLRNGD+LPY+VYR CLRMDGQT+VNLEIF NN D G SGTL+KYLD+CV
Sbjct: 541  SHLSRLMLDDVLRNGDILPYKVYRDCLRMDGQTLVNLEIFNNNADSGSSGTLFKYLDSCV 600

Query: 601  TSSGKRLLRLWICHPLKDVEEINNRLNVVEELMAQSDIMVLLGTTYLRKLPDLERLLGQI 660
            TSSGKRLLR WICHPLKDVE INNRL+VVE LM  S++++++   YLRKLPDLERLLG++
Sbjct: 601  TSSGKRLLRSWICHPLKDVEGINNRLDVVEYLMKNSEVVMVVAQ-YLRKLPDLERLLGRV 660

Query: 661  KATVQSSASLVLPLIRKKLQKRRVKLFGSLVKGLRTGLDLLIQVQKEGLII-SLPKVVKL 720
            KA VQ+S+ +VLPLI KK+ K++VK+FGSLVKGLR  +DLL+ + KEG II SL ++ K 
Sbjct: 661  KARVQASSCIVLPLIGKKVLKQQVKVFGSLVKGLRIAMDLLMLMHKEGHIIPSLSRIFKP 720

Query: 721  PQLSGNGGLDQFLTQFEAAVDSEFPDYQNHDVTDSGAERLSILIELFVEKATEWSEVIHA 780
            P   G+ GLD+FLTQFEAA+DS+FPDYQNHDVTD  AE LSILIELF+EKA++WSEVIHA
Sbjct: 721  PIFDGSDGLDKFLTQFEAAIDSDFPDYQNHDVTDLDAETLSILIELFIEKASQWSEVIHA 780

Query: 781  LNCVDVLRSFAIIAHSSRGSMSRPLILPQSNNSMLSPEKQGPVLKINGLWHPYALVESGE 840
            ++C+DVLRSFA+ A  S G+M RPLILPQS N  +  +  GPVLKI GLWHP+AL E+G 
Sbjct: 781  ISCIDVLRSFAVTASMSSGAMHRPLILPQSKNPAVRKDNGGPVLKIKGLWHPFALGENGG 840

Query: 841  TPVPNDMILGLDQDSYHPRTLLLTGPNMGGKSTLLRSTCLAVVLAQLGCYVPCETCTLSV 900
             PVPND++LG D D   PRTLLLTGPNMGGKSTLLR+TCLAV+LAQLGC+VPCE C LS+
Sbjct: 841  LPVPNDILLGEDSDDCLPRTLLLTGPNMGGKSTLLRATCLAVILAQLGCFVPCEMCVLSL 900

Query: 901  VDTIFTRLGATDRIMTGESTFLVECSETASVLQHATQDSLVILDELGRGTSTFDGYAIAY 960
             DTIFTRLGATDRIMTGESTFLVEC+ETASVLQ ATQDSLVILDELGRGTSTFDGYAIAY
Sbjct: 901  ADTIFTRLGATDRIMTGESTFLVECTETASVLQKATQDSLVILDELGRGTSTFDGYAIAY 960

Query: 961  AVFRHLIEKVNCRLLFATHYHPLTKEFASHPHVMLQHMACTFK---------DHELIFLY 1020
            AVFR L+E++NCRLLFATHYHPLTKEFASHPHV LQHMAC FK         D EL+FLY
Sbjct: 961  AVFRQLVERINCRLLFATHYHPLTKEFASHPHVTLQHMACAFKSNSENYSKGDQELVFLY 1020

Query: 1021 RLRSGACPESYGLKVATMAGIPGRVVEAASRASQMMKQTIKENFKSSEQRSEFSTLHEEW 1080
            RL SGACPESYGL+VA MAG+P +VVEAAS A+  MK++I E+FKSSEQRSEFS+LHEEW
Sbjct: 1021 RLTSGACPESYGLQVAVMAGVPQKVVEAASHAALAMKKSIGESFKSSEQRSEFSSLHEEW 1080

Query: 1081 LKTLITVLEFKGNNLGENDAFDTLFCLWYELK 1088
            LKT++ V     N+  ++DA+DTLFCLW+ELK
Sbjct: 1081 LKTIVNVSRVDCNS-DDDDAYDTLFCLWHELK 1098

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MSH7_ARATH0.0e+0061.70DNA mismatch repair protein MSH7 OS=Arabidopsis thaliana GN=MSH7 PE=1 SV=1[more]
MUTS_NEIMB2.9e-6628.57DNA mismatch repair protein MutS OS=Neisseria meningitidis serogroup B (strain M... [more]
MSH6_HUMAN2.7e-6442.07DNA mismatch repair protein Msh6 OS=Homo sapiens GN=MSH6 PE=1 SV=2[more]
MUTS_CHLCH6.0e-6428.29DNA mismatch repair protein MutS OS=Chlorobium chlorochromatii (strain CaD3) GN=... [more]
MUTS_LACH43.3e-6227.40DNA mismatch repair protein MutS OS=Lactobacillus helveticus (strain DPC 4571) G... [more]
Match NameE-valueIdentityDescription
A0A0A0LHY3_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_2G004730 PE=4 SV=1[more]
A0A067KMZ7_JATCU0.0e+0067.36Uncharacterized protein OS=Jatropha curcas GN=JCGZ_07221 PE=4 SV=1[more]
V4WD14_9ROSI0.0e+0067.36Uncharacterized protein OS=Citrus clementina GN=CICLE_v10007291mg PE=4 SV=1[more]
A0A061GMS3_THECC0.0e+0066.52MUTS isoform 1 OS=Theobroma cacao GN=TCM_037911 PE=4 SV=1[more]
F6HH29_VITVI0.0e+0065.83Putative uncharacterized protein OS=Vitis vinifera GN=VIT_11s0016g04090 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT3G24495.10.0e+0061.70 MUTS homolog 7[more]
AT4G02070.13.2e-6339.75 MUTS homolog 6[more]
AT3G18524.11.1e-4231.03 MUTS homolog 2[more]
AT4G25540.15.4e-3933.22 homolog of DNA mismatch repair protein MSH3[more]
AT4G17380.13.9e-3738.39 MUTS-like protein 4[more]
Match NameE-valueIdentityDescription
gi|449443325|ref|XP_004139430.1|0.0e+00100.00PREDICTED: DNA mismatch repair protein MSH7 [Cucumis sativus][more]
gi|659071128|ref|XP_008458258.1|0.0e+0095.87PREDICTED: DNA mismatch repair protein MSH7 [Cucumis melo][more]
gi|802627380|ref|XP_012076663.1|0.0e+0067.36PREDICTED: DNA mismatch repair protein MSH7 [Jatropha curcas][more]
gi|1009160401|ref|XP_015898331.1|0.0e+0067.00PREDICTED: DNA mismatch repair protein MSH7 [Ziziphus jujuba][more]
gi|567918206|ref|XP_006451109.1|0.0e+0067.36hypothetical protein CICLE_v10007291mg [Citrus clementina][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000432DNA_mismatch_repair_MutS_C
IPR007695DNA_mismatch_repair_MutS-lik_N
IPR007696DNA_mismatch_repair_MutS_core
IPR007860DNA_mmatch_repair_MutS_con_dom
IPR016151DNA_mismatch_repair_MutS_N
IPR027417P-loop_NTPase
Vocabulary: Molecular Function
TermDefinition
GO:0005524ATP binding
GO:0030983mismatched DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006298mismatch repair
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006298 mismatch repair
cellular_component GO:0005575 cellular_component
molecular_function GO:0005524 ATP binding
molecular_function GO:0030983 mismatched DNA binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU114975cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa2G004730.1Csa2G004730.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU114975CU114975transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000432DNA mismatch repair protein MutS, C-terminalPFAMPF00488MutS_Vcoord: 846..1031
score: 1.0
IPR000432DNA mismatch repair protein MutS, C-terminalSMARTSM00534mutATP5coord: 842..1029
score: 5.2E
IPR000432DNA mismatch repair protein MutS, C-terminalPROSITEPS00486DNA_MISMATCH_REPAIR_2coord: 923..939
scor
IPR007695DNA mismatch repair protein MutS-like, N-terminalPFAMPF01624MutS_Icoord: 271..381
score: 2.8
IPR007696DNA mismatch repair protein MutS, corePFAMPF05192MutS_IIIcoord: 557..775
score: 8.1
IPR007696DNA mismatch repair protein MutS, coreSMARTSM00533DNAendcoord: 572..821
score: 2.4
IPR007696DNA mismatch repair protein MutS, coreunknownSSF48334DNA repair protein MutS, domain IIIcoord: 555..779
score: 1.44
IPR007860DNA mismatch repair protein MutS, connector domainPFAMPF05188MutS_IIcoord: 391..538
score: 4.9
IPR007860DNA mismatch repair protein MutS, connector domainunknownSSF53150DNA repair protein MutS, domain IIcoord: 404..452
score: 3.01E-6coord: 489..567
score: 3.0
IPR016151DNA mismatch repair protein MutS, N-terminalGENE3DG3DSA:3.40.1170.10coord: 257..382
score: 1.0
IPR016151DNA mismatch repair protein MutS, N-terminalunknownSSF55271DNA repair protein MutS, domain Icoord: 257..378
score: 4.58
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3DG3DSA:3.40.50.300coord: 804..1032
score: 6.9
IPR027417P-loop containing nucleoside triphosphate hydrolaseunknownSSF52540P-loop containing nucleoside triphosphate hydrolasescoord: 805..1033
score: 4.2
NoneNo IPR availableGENE3DG3DSA:1.10.1420.10coord: 551..734
score: 6.5
NoneNo IPR availablePANTHERPTHR11361DNA MISMATCH REPAIR MUTS RELATED PROTEINScoord: 131..1087
score:
NoneNo IPR availablePANTHERPTHR11361:SF31DNA MISMATCH REPAIR PROTEIN MSH6coord: 131..1087
score:

The following gene(s) are paralogous to this gene:

None