Cp4.1LG08g12770 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG08g12770
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionDNA mismatch repair protein MLH3 isoform X1
LocationCp4.1LG08: 9268218 .. 9279549 (+)
RNA-Seq ExpressionCp4.1LG08g12770
SyntenyCp4.1LG08g12770
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTTTGGCCATTGTCGCTTTCGCATCTGCTCTGTTTCTGAAAACCTGTTTATCACTCATGAAAAGAGTGAATCGTGAAGTAAATCGAATATTAGGACCTGTGAAATTGGCGAATGAATTTATGCTGAAAAAAAAGCATTGAATTGGTTGTTTAGTTCGTTGTTTTCGCGCCATATTTTATCTCCGTGTCGAAGTTCAATCTCCACATGCTTCCAAGTCCATGGTTTGCCGTTTTAAGGACTGAGATTGCCTCGGGTGAGTTATAACGAAGTTAGAAATTGAAACCTGTTCAGTAAAATATGCATCATTTGTGCATGTGTTCTCGCCATTGGAACCTACTTTCAGAAAAAGTGGTACGAATAATGTATTGAGTTTTGGTGAAAGTAATGAACAACTAATCATTATATTTATCTGCTAATTCGCCTCCTATTCTTCATTTTTCCCATTTGATGTTTTTCCTTGTTGATGTAGCGTAATTCTAGATGGGGATTATCAAGCCCTTGCCAAAGTCTGTTCGTAGTTCTGTGCGTGCTGGCGTTATACTCTATGATGCCACGAAGGTTGTGGAAGAGCTTGTTTATAATAGCTTGGATGCTGGTGCGTCAAAGGTAGAGAGTCTAATCGACCATCTTATTATTTTGTTTTATGCTGAGGCATGGTGTTGCCTACTCTCCTAGTTGAAAATGTAAAGTGTTTTATTTCTAATTATCAATGTAAAAGTAAGTTATTCGGGAATTACTAAATATTTTAACTGAGATATTCATCAACCTTGATCCTATAACAGTTAGAAATTGTGCACTGTGAATTTGGCCTAGCCATGAAATAACAGTTTGTCTTCGTCGTTCATCACTGAGCACGTTTCTTCTACAAGTGAAAAAAATACTCCTATGGTTGATTGATTTTCGATATCAAAGAGAGAAATTTAATAACTCTTCGTTCCATTTATATTCCAAAATTTTTGTTTTGGTTCTAATAAGTTCCAGCTGTTGACTTTGTTACTTGTCATGTTAGTTGAACTAAAGTGTGATGATTTGTCAATGGGTAACACAAAATCTGCGATGTGCACTCGCCCTATTGAATTTTTCAGCTTGTCTGCCTTTATTTTTTTATGTCATGTTGTATGAAATCATCACATGGTAGTCAATTAGTAGTTCAATCAATTTGCCAAACAATGTTAACGGTCATAACTTATTAGAATCAAAATAAAAAGTTCAAGGATGTAATTAGAACTTCTAAAATTTAGAGACTAAATGAAACAACCATCAGATTTTAGGGACAATATTTGCCCTTTTATCAAATAAAAATGATCTATAACTAATTTTAAAATTTAAATTTTCTTCTAGTTGTAATTTGTCTGGCTTAATATATGTTACTTACAGCCTATTGTATCAATTCAGATTTCAATTTTCATTGGCATTGGGACATCCTATGTGAAAGTAGTGGACAATGGTTTGTGATTTTCTCAGTATGCTACTTGTTACAGTACTTTTTATATGTCAGGTTTCATACTGATACAGTGAATTTGTCATATTCTTGTAGGATCTGGTATTACTCGGGATGGGTTAGCCTTGCTAGGAGAAAGATATGGTAAGTCTCTCAATTTCTTGGTTTTCTTTGTGTACTCTGTGTATTTACTTCATCATCATTTCGGTGTATTTTGGAATCAAGAATTAAATTGAGATGCTTGGGCTTGCAGGGGGATGGGGATGGTGTTGATAAGTGTGTAATAATTTGATGGGTTAGATAATGATTACTTGCTCTGATTGAGTGGAACGAATGAAAGGGCAAGTTTTGACCTGCAAACCTCTTTGAATATAACTCTTTAACTCCAATGATATTTCCCAAAATTGATTAGCCTTTTCAACTAATGGAAATGGTGCAAGAATGGGCTGAACCTGCTAATGGGCTTTCCCAGCTATCTCTTATAAGAGGTGCAAGGATGTTTTAATGAACCATGCAATAAGTTTGGGTGGTATGGCTATGGCTTGCTTGATGTGATTATCTTATAGACAAGTTGGTCAAAAGTAGAATTCTATTGAAGGAAATAGACGTGTTAGCAAGTTGGCAAATATTTAATGTATTTGAATTTACTACCTTGGTGTGTTTATGCTGTGCTTTTCAATGACTATATATGCAGCAACTTCGAAATTCCATGATCTCATAGACATGGATACCAAAGGCAAAACATTTGGCTTTCGAGGAGAGGCATTGGCTTCCATTTCAGATGTATCGTTGTTGGAGATCATAACCAAAGCATGTGGGAGGGCAAATGGATATCGTAAAGTCATAAAGGCACAGAAGTTTACAGCTATTTAATTGTGGAGCAATATGTGGAAATTTTCACATTGTGCTGTCCTTTAAAAATAATTAACTTTTTATTGCAGGGTTGCAAGTGCTTGTACCTTGGAATTGATGATGATATGGAAGATATTGGTACTACAGGTAACAGGTCCATTATCCGCCAGGGCAGTCAGATATTTTCATAGCGAGGGGCAATAGCTTATATTCTAACTACATATACATATCATATATTCTTTCTATGAACAAGTTAGACAGAAAACTTTAACATCTTAACTGTATTATATCTTTCTTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNTCTAAGATGAAGTAAGGGTATCTAATTAGGCCTTAGTTATTTGGATACCACCCTTTTGTATCTGGACTGAGTCGTCTATTTCGGGGTCTGCTTTTATAGTTGTACTTTTGCTTTTGTATTTCTGTTCTTACCATTAGCCTCATGAAAAACACTGATGTGGTAAATCGTAATTTTATGGGATGTATATCCATTTTTTGGTCGTTTTAGAAAATTTGTAGTTATCCTTTCAGTTATCTGATCCATAGTCATGAAGATCCACAAAATTAACATCATATATTTGGAAGAAATAGTGTTTGGTTGTATGATTTGTACCTTGGTGCAGTGTATTGATTAGCCACCCATTACACACGTACTAGGAGGAAAATCAAATGTGGGGGTGGAATACAGTGGTGGCTGATGGATATTTATGTTGAGGGGGATCCTTGTCTTTTTGCTGTTTGATATTCTCTTAGATGGTAGCAAGGCAACTATTGGCGTCGGTGAATTTTCTGATGAACTATTCTTGAAAAAAGAAATGAAACTATGTGGATTTTATTATGTTGTATAACCTTACTATTAAACATCCATGCGATTTGCAATTACATTTGAAAAAATAATTTATTTTCTTCTCCTTTGTTATATTCTATTTTTGCATACACTTCTGAAGGCAAATACTCGATCAGAATTCTTTTGTCTATTTCTTTTGTAGTTATTGTTCGAGATCTATTTTACAACCAACCAGTTCGAAGGAAGCATATGCAATTCAGGTTTGATTTTCTTAGATTAGTTTGTTTGTTTTTGTTTTTGTTTTGTTTTTTTTTTTTTTTTTGGGTATGATAAACAAAAATCTTGAGTAAATAGAAGGATTCAAGAGTTGACATATGGATACGGCAGTCCATATGAGGAGAAAGAAGAAAATGAAATATTATATCCTTGCATTTTACTGACGCGCTAATTCATTTTGCGTAATGGAACTGGAAGCCCCAAGAAGGTCTTGCAAGCAGTCAAGAAATGTGTAGTCCGTATTGCCCTTGTGCATTCTAAAGTTTCCTTCAAAATTGTAGATAGTGAAAGGTATATCCATGTTAAATTCAGGTTGAAATATGTAAAACTTTTATGTTAGTTATCTTACAATTTTATCTCTTCCTGTAGTGAGAGCATCCTTCTTTACACGAATCCCTCTCCTTCTCCTTTATCACTTTTGAGAAGTGGCTTTGGCAGCGAGGTTTCCAGGTCTCTCCATGAACTAAAAATCGGTGATGGGGGCTTAAAGCTTTCTGGTTATATATGCAGTCCGTTTGATACGTTTACTATCAAGGTATAATTGTTTTCTTGCTGATATAAAAGAAGGCTAATTCGGCTGTTTAACATTTCATTAATTTTTGTGTTCTTTTTTTAAAAAAAATTATGGCAGGCTGTTCAATATGTCTGTATCCACATCATTCTGTGAATTCTTTAAATCTGTGGTGGGTGGATAGGATCCAGTATTTTATGAAATTGTTTTGACCTCAGCAAGGACAGATATTAACAGACGATTCATTTGCAAGGGCCAAATTCATAAATTACTTAATCAACTGGCCAGTAGATTTGTAAGCCTGAGTCCGCAGACTGACCAAGCCTGTCACAGCAGGAAGAGAAGCAGGTTTCAAGCAAATCCTGCTTATATTTTAAATTTAGATTGCCCTGGATCTTTCTATGACCTAACATTTGAATCATCCAAGACCTTTGTTCAGTTTAAGGTAATTCTTTATTTGTTCTCCAAACACCAGTTCATTTGTTTATCTTCTGTTCCTTCATTTGTTTGGGAATAACTTTTTTATTCATTATTAAATGGTTCTATAGAATTCTTTCCTTCGCCTTTTCTAACCAACACCTTCTCATTTCCTTCCTTTTTTGTTTCCCAAGGACTGGACTTCAATACTTACCTTCATTGAAGAGACCATTCAACAATTTTGGAAAGAAAAATATAGTAGTGGTATGGAGTTAGCCATATCTTTGAAACCATAGTGAGTGTTAATATAAACCGTTTGAAGTTGCATTTTAAATTTGCAGGAAAATCTTTGGTCCATACAACCTCCATAGTTGGAGGAGATCAGCTGTGGAAGGATGAAGATAACATGATATCAACAAATTCAGGTATTTTAAGTTGAACTTTCATCTCCAATCTCACTTTAATTTATGAAATGATTTAGTCTTATTTATGCTGCATGAATTGCCCACTCCACCGCCTCTTTAAAAGAGGGTTAAGGTGAAATGTCTCCATTTATTATCTTCACAGGACACTTCCCCATAAAAAATATGTGCACACCGAGTGAACTTGCAAAAACTGACCTGGTTCAATTAATTTCTGAATATCTTTCCTTTTTAGATTTTCGTGAGGATGTTATACGTTTCCCTCCAGAGAGTATTCGATCTCTGAAGAAGAGCAGAATGCGAAGCCCTCGGGCCTCCCTTATTGATTTGTTTTCTCCGTCAGCAATGCTTACAAAAGATGATGACATCTTGTCCAATAGTTTGCATGAAAAGAAGGCATGTGAGAATTCACACACAAGTTCAAGCGAATTGAATGACGTTCACCGACAAGCTAGGATGCAATTTGGTAACCAAGCTGCCGATCATTTCTCAGGATTATGGGGTACTCCCTTGGCAAAATGCTCAACTACAGCTGTCCAAAAAGGTGACAGGCACCCATGGGTACCTGATAACATTTTTGTATCTGAAGACTCCTTTCTGGATAGAAGGCTGGCTTCTCCCAAGAGGTGTGATGACATTGTTGAGGACAATATCTTCAGTTCAGACTTGAAAGGTCAATCTTCTAAAGTGTATATCGATATGATCAATGGGTCTGCTGAAAGTACACCATCTTCTTACTTTCATGAATTTAGTTATGATGACAATATCTTCACGGGTAACAAACCCTCCCTTCGGGGATGCACCTCAGGAAGCAGTTTTCAACTTGAGAGTACTTCCATTCTTGGTGACAAACTGTTCATTCAAAATGATGTCATCAAAAGAATCCAAAAGCAGGGAATCCCTGATGATGAAGTTGATGTTCTAAAGCTTGACGGTTACATCCAGGGTTCTGATTTTTATGCCGGAGACTCATTTCATGCTGAGGTATTGTTGTCTGACTCTTACTAGAGCTAACAATTTATGCGTTTCTCAAGATCTAAAACTTCTGTCCTTTTTTTTTTCCTGTTGTCGATTACTTTTTTTTGGGCAACAGTTTACTGAAGAAAATATGTACTCATGTCATTTGGACAAGCACGTGCAGAAGTTTTTCTCAAGTTATCAGACTAGAAATTCCCCAGATGTTCACGTGACCCCAAATCCCAGATTAGCTTCAGAATGGGATGTTGATTGCTTCAGTGTTAGGGATGGGGTTGAAAGGAACTGGAGATCTAGAGATAGGACTCCCTTCAGGGATTTGGTGGATGGTGAGGATAAGGGCTGCGGATTTGATTCTGATATCATGTTGAGAAGTTCCAAAAAGAATTACATACCAAGCTGTATAGATAGTGAACTGATAATTGATGATGTTCTTGATACAAGAGAAGACCTTAGTACTTCCCTTGAAAAATCTAATAATTTTGATCATTCTTCTCCTGTGAGTCCTAATATGCACTCCTGTCAGAAGGGCTGCGGATTTGATTCTGATATCATGTTGAGAAGTTCCAAAAAGAATTACATACCAAGCTGTATAGATAGTGAACTGATAATTGATGATGTTCTTGATACAAGAGAAGACCTTAGTACTTCCCTTGAAAAATCTAATAATTTTGATCATTCTTCTCCTGTGAGTCCTAATATGCACTCCTGTCAGAAGGGCTGCGGATTTGATTCTGATATCATGTTGAGAAGTTCCAAAAAGAATTACATACCAAGCTGTATAGATAGTGAACTGATAATTGATGATGTTCTTGATACAAGAGAAGACCTTAGTACTTCCCTTGAAAAATCTAATAATTTTGATCATTCTTCTCCTGTGAGTCCTAATATGCACTCCTGTCAGAAGTATCTTTTCAATTGGAGATTACCTGGAAGAGATTGGGAAAAGGCTTATGGAAGCTCAGAGCTTAAGTTTGGACATCAAGCTTTTAAACAGAGGTACGTTTCTGTTGAAAGGCCTAGAAGATGCAAATCAGCTCCACCTTCTTACAAAAGAAAAACTAGTTTCTATTGCCTGTACCAAAGAAAGGAAGAAAAGCATAATGCCGCCGGTTTCTATGGCCTTGACCAAAGAAAAACTGATAAGTTTAATGCCACAAATTTCTATTGCATGGACCAAGGGAAAGAAGAAAAGCTTAGGGCATCGGCCTTCCTTGACAGCCCACCTCATTTAGGTATTTTCATTTAAGAAGTCATTACTAGTAATGCATTGTCATTGATGAAATACTAATGCTGTGCCAATACGTACAGAACTAGGTGAGCTGAGAGATTCCAAACATTTCTCTAGTACTAATAATCTTTATATTAAGCCAAGTCCTCTTGATGACTTATCGATGGGAACTAGGTAGAGATCCATTCCTTTCCCCTATCAATTTAAAATTGAAGTTGTTATTAGAATATAATGCATCTTTTCATCACTTCCAGAATAGATATGACAAAGACGCCTGCTATTACGGGAAATAATAAAGAGAAACAAGAAGGAAAAATTTCCAAGCAGTTCCAATCCGATGTTAAAGTTACTGCATCTGCTTTAGGTAACTGCACTATGTATATGAAACTGATTATTTCTTTGTTCTTAAGGATTTGGAATTCGAAATTTCAATTCTTAGGGAGGGAATTGGTATTGAAAGATTTTATGTGATCAGAAAAACATGGAAAGAAAATTGTGCTGTAATTTCTTTCCATGCAGATTTAGTAGATTTATCTTTATTATCATTGACATATTAGTGGTTAATAAGTTAAATTCTCATCAAGAATAGTCTTTACAATATATAGCATGCTGCATTAGTTCATAATTTTGTTGTTCTTGAATATTGAGTTTTATATGATGGTAGCATCATCTTTACTGGCGGAAATAAAATTTCAGTTCTTTTTGCAGTTGCCATATTAGTGAAAATTTAAATATCCTCCAATTCTATTTAAATGGCATAGTTTTGATCCTGTTTTGCCTTCAGAATTATGCTCAAAGGAAACTCAAGAGTCAGATTTATGGATCAAATGGAAAAATTGCTGTCCGACTACAGTTAAGTTTGAAAGTTGATTTGTTTTCTATAGTTTGATATGAGGCTTCTCTTTTCTATTTTTATTTATTTATTTATTTGAGTAACATTTAATAGTTTTTCTTTATTGATTTCAGAGAAATGATGGGCCACGTGCTTTTGAAGATGAAGTTAGTATACTTGATATCTCTTCAGGGTTCCTATCTCTTGCCAGAAATTCCTTAGTTCCCAAATCCATCGATAAGAATTTCCTTGAAGATGCCAAAGTTCTTCTACAGCTTGATAAGAAATTCATTCCAGTTGTTTCTGGTGGAATACTGGCTGTTATTGATCAGGTTAGTTCACTCCCTTCTGGTTCATTCACTTCAGGTTTTAAGTGAAAGACAGTATCTCCACCCCTTAAAAATCTTTCTACTCCTATTTCTCTTTGAGTCTAATGTATGACAAGGTAACGAAAAAGGCAATCATCCTTCCACTTCGTATGAGTGGGTTTCCTGCCAACTTTGTTTTGTTGTGTGATAATATATCCAAGTGTAAGGGTTTATGATATGCCACATAAGAAGAGGCATTAGGTAACATTTAGATAGCATCTCCTTAGAGAGGTCTGAGGAAGATGGCAATCTCTTCTTTCATTAGCTTTGTCGATCCAAGGAATGCAAATGTAGAAACGGAAGTAGTTTAATTGATTCTAGATCAACAACTTTATAGCAATTGGGGAATTTTCTTCCTCAATTTTATAGCGTGTCACCTGACTCACGTTCCAGTGTGGAAACAACAAAGGTGCTCATTGAAGATTGTTGGAACAAGGAAAATCAAGTATGAAGTGTTTATCTTAAAAATAACTTATTTGGTCTTTCAGTCTGGTTTCTTTTCTTTTTTTTTTTTCTTTTTTTTTTTTTCTTTTTAACTGAAGAAGTAGATTAGGTTGTCTGCGTACTTTGATACATTACTTGGATACAGCACACTCATGCCTTGTGATTTTTTGTGCACCCCATATACCAACATATATGTGCACACCAATTGACAAGTCTTTCTAAAGCACGGGGTGGGGTAATGATGTTTAAATGGTGCTTTCTCCAGCATGCTGCAGATGAAAGAATCCGACTTGAAGATCTTCGTCAAAAGGTAAACATTATATATTTGATGGAATAATTCTATGATTTCTAATAAAGCAATATATGTATATACACAGTTGTTGTCTGGTGAAGCAAAGACAATAGCCTATCTGGAGGATGAACATGAACTGGTAAGTTTATGTTTGATGTTGTCTGGTCTCAATCTGTCATGCTTTGATTGAAAGGGATAAATTAATCTTGCTGTTTGTTTACTACAGGTGCTGCCTGAAATTGGGTACCAGTTGTTGTACAACTATAGTGATCAGGTGAAAGAGTGGGGTTGGATCTGTAATATTCATGCTCAAGATTCAAAATCCTTCCAAAGGTATATTTAATTCCGTTGGGGTTTGTCTACTTTGCCAGTACTGCTCAAGTTTGCCAAAGTAATATTTGGTTCTCTGTGCAGGAATTTGAATATCCTATACAAGCAGGAAACGGTCATCACGCTAATGGCAGTTAATCACCATCTTCCTTGCTTCGATAATTATTGCATTTAAACTTCGAAACCAAACCATGCCAACTGTAATAAAACATTTATTGTGATACAATAGCAATTGTCAAATTTGTGCTTGAAATTTGTAACGAATTTATATTTTTGTTTGAAAATTCATGTATTTTTATTTTATGTTTGTATAAAGTTTCATGTGTATGACACAGGTACCTTGCATACTAGGTGTTAATTTATCTGATGCAGATCTGCTGGAGTTTCTTGATCAGGTAACTGAGGACTGCACTTTAAAACCAATTTTGCATGGTTCATCCATTAATTTTGTTTAGCTGTTACATTGTTCCCTAAACACCTGGAATAATTTTGCTAATGTCGCTATAGCTACATATATAAGCTTGCTAAGACTTTTTTCTTGGATCGAAAGCTTGCTGATACAGATGGCTCATCAACAATGCCGCCATCCGTGCTTCGAGTTCTTAATTCAAAGGCCTGCAGAGGTACATGCTTAAGCTTGTTTGATGTGTGTAAATACCGTTCACCATGGGAAATTAAATGTCGTGTATAACAGGTTGACTTGCACCTTTTTCGGATGATGTCCATAGCTGTAACATGACAATTTTTTATTTGTTTTTTGGTAATAAATGCTGTTGAAGAAATGACTTCATCATGTAGTGTCTCACTTCTCTGGCAACAGTCTGTAGCTCAGTGCTGTGTGTGTATCTGCATGTTGATCCTTTTTCACTGAAAACTTTTTCTTTGATAAAATTACTTGATTAGGTGCAATTATGTTTGGAGACTCTTTGTTACCTTCAGAGTGTTCCCTTATTGTTGAAGAACTGAAGCAGACTTCTCTGTGTTTCCAAGTGAGCGAATAAGTTTTCAAGCATGTTTTCGAAGTTCTGATTATATATACATATATATATTCTTTTGGTGGGTGTTAGGAGGTGAAAATGACTCGTAATTGATGAGTGTGAGACGTTTAATGTGCAGTGCGCCCATGGTCGACCAACTACAGTACCTCTCGTGAACTTGGAGGCATTGCACAAGCAGATAAGGGAGATGGAAATATTAGATAAAAATGGTTCGAATGGAACGTGGCACGGGCTGCGGCGACATGAGCTGAGCATTGAACGGATGTTGCAGCAAGTAGGTTCGGCCTGATGGTAAATGCTGCCGTTCCATCAAGGAGAATAGTTGAGAGGGTGCCCCGGTAATACCAGTCTGCAGAAGTGAATGGATTGAGTAGCATTCTTAGATCATGTATACAACCAGTCTACGATGATTTGTGAAAAAGAGCAATGTTTCTGTAGTTTAGTTTTCCTTGGAATTTCGGTTGCAACTCTTCTTATAAACAGTTGTCTGCCCTCTAGCTGCATAAATCCTTGTGCATTTTGCCTGTCATGGTGCTCAATGTGTGTGAATAAAACTAAATGACTTCTTTTAAGTTGAGAAAAATGGCCATTTAATTAGCCCAAATATATGCTCCTTTAATATTAGTCCATTTTTACAGGAGATTCCATATTGAAATAGTTGTCTTAGTGTTATTTCTAGTGCAAGAAAGGTAGGTATCAAAAGAAACTTACAGAAAGACCATATACATCCTACTAAAATATCAATGATATCATGTACCTGCGATTAACAAAAATTAATGGAAGAGAAGAATGATGAAGGCTATTACTAACATTAAAACATTTACATACAAGACTGTGGCTGGGAGCTATAGCTAATTGCTGATCGCTTTATGAACAGGCTTCTTTGTTTTTACCAGCACATAAAGCCTTCCAGATAGGCACCAACATTTTCACCGGTGGACGTAATTCCTAGAGGGGAAAGAATAATTTTCAGCTGGTATAACACAACAATATATAATGCTCATCATGACTCACCGTTTTGACAAGTGCGGTTTTGTACAATCTAGGAAAACGTTCACGAGTATAAATGTAACATTTTGAAATCTAGATGTCAAATAGAAACTAA

mRNA sequence

TCTTTGGCCATTGTCGCTTTCGCATCTGCTCTGTTTCTGAAAACCTGTTTATCACTCATGAAAAGAGTGAATCGTGAAATGGGGATTATCAAGCCCTTGCCAAAGTCTGTTCGTAGTTCTGTGCGTGCTGGCGTTATACTCTATGATGCCACGAAGGTTGTGGAAGAGCTTGTTTATAATAGCTTGGATGCTGGTGCGTCAAAGGTAGAGAGTCTAATCGACCATCTTATTATTTTGTTTTATGCTGAGGCATGTTTGTCTTCGTCGTTCATCACTGAGCACGTTTCTTCTACAAGTGAAAAAAATACTCCTATGCCTATTGTATCAATTCAGATTTCAATTTTCATTGGCATTGGGACATCCTATGTGAAAGTAGTGGACAATGGATCTGGTATTACTCGGGATGGGTTAGCCTTGCTAGGAGAAAGATATGCAACTTCGAAATTCCATGATCTCATAGACATGGATACCAAAGGCAAAACATTTGGCTTTCGAGGAGAGGCATTGGCTTCCATTTCAGATGTATCGTTGTTGGAGATCATAACCAAAGCATGTGGGAGGGCAAATGGATATCGTAAAGTCATAAAGGCACAGAAGGTTGCAAGTGCTTGTACCTTGGAATTGATGATGATATGGAAGATATTGGTACTACAGGTAACAGTTATTGTTCGAGATCTATTTTACAACCAACCAGTTCGAAGGAAGCATATGCAATTCAGCCCCAAGAAGGTCTTGCAAGCAGTCAAGAAATGTGTAGTCCGTATTGCCCTTGTGCATTCTAAAGTTTCCTTCAAAATTGTAGATAGTGAAAGTGAGAGCATCCTTCTTTACACGAATCCCTCTCCTTCTCCTTTATCACTTTTGAGAAGTGGCTTTGGCAGCGAGGTTTCCAGGTCTCTCCATGAACTAAAAATCGGTGATGGGGGCTTAAAGCTTTCTGGTTATATATGCAGTCCGTTTGATACGTTTACTATCAAGGATCCAGTATTTTATGAAATTGTTTTGACCTCAGCAAGGACAGATATTAACAGACGATTCATTTGCAAGGGCCAAATTCATAAATTACTTAATCAACTGGCCAGTAGATTTGTAAGCCTGAGTCCGCAGACTGACCAAGCCTGTCACAGCAGGAAGAGAAGCAGGTTTCAAGCAAATCCTGCTTATATTTTAAATTTAGATTGCCCTGGATCTTTCTATGACCTAACATTTGAATCATCCAAGACCTTTGTTCAGTTTAAGGACTGGACTTCAATACTTACCTTCATTGAAGAGACCATTCAACAATTTTGGAAAGAAAAATATAGTAGTGGAAAATCTTTGGTCCATACAACCTCCATAGTTGGAGGAGATCAGCTGTGGAAGGATGAAGATAACATGATATCAACAAATTCAGATTTTCGTGAGGATGTTATACGTTTCCCTCCAGAGAGTATTCGATCTCTGAAGAAGAGCAGAATGCGAAGCCCTCGGGCCTCCCTTATTGATTTGTTTTCTCCGTCAGCAATGCTTACAAAAGATGATGACATCTTGTCCAATAGTTTGCATGAAAAGAAGGCATGTGAGAATTCACACACAAGTTCAAGCGAATTGAATGACGTTCACCGACAAGCTAGGATGCAATTTGGTAACCAAGCTGCCGATCATTTCTCAGGATTATGGGGTACTCCCTTGGCAAAATGCTCAACTACAGCTGTCCAAAAAGGTGACAGGCACCCATGGGTACCTGATAACATTTTTGTATCTGAAGACTCCTTTCTGGATAGAAGGCTGGCTTCTCCCAAGAGGTGTGATGACATTGTTGAGGACAATATCTTCAGTTCAGACTTGAAAGGTCAATCTTCTAAAGTGTATATCGATATGATCAATGGGTCTGCTGAAAGTACACCATCTTCTTACTTTCATGAATTTAGTTATGATGACAATATCTTCACGGGTAACAAACCCTCCCTTCGGGGATGCACCTCAGGAAGCAGTTTTCAACTTGAGAGTACTTCCATTCTTGGTGACAAACTGTTCATTCAAAATGATGTCATCAAAAGAATCCAAAAGCAGGGAATCCCTGATGATGAAGTTGATGTTCTAAAGCTTGACGGTTACATCCAGGGTTCTGATTTTTATGCCGGAGACTCATTTCATGCTGAGTTTACTGAAGAAAATATGTACTCATGTCATTTGGACAAGCACGTGCAGAAGTTTTTCTCAAGTTATCAGACTAGAAATTCCCCAGATGTTCACGTGACCCCAAATCCCAGATTAGCTTCAGAATGGGATGTTGATTGCTTCAGTGTTAGGGATGGGGTTGAAAGGAACTGGAGATCTAGAGATAGGACTCCCTTCAGGGATTTGGTGGATGGTGAGGATAAGGGCTGCGGATTTGATTCTGATATCATGTTGAGAAGTTCCAAAAAGAATTACATACCAAGCTGTATAGATAGTGAACTGATAATTGATGATGTTCTTGATACAAGAGAAGACCTTAGTACTTCCCTTGAAAAATCTAATAATTTTGATCATTCTTCTCCTGTGAGTCCTAATATGCACTCCTGTCAGAAGGGCTGCGGATTTGATTCTGATATCATGTTGAGAAGTTCCAAAAAGAATTACATACCAAGCTGTATAGATAGTGAACTGATAATTGATGATGTTCTTGATACAAGAGAAGACCTTAGTACTTCCCTTGAAAAATCTAATAATTTTGATCATTCTTCTCCTGTGAGTCCTAATATGCACTCCTGTCAGAAGGGCTGCGGATTTGATTCTGATATCATGTTGAGAAGTTCCAAAAAGAATTACATACCAAGCTGTATAGATAGTGAACTGATAATTGATGATGTTCTTGATACAAGAGAAGACCTTAGTACTTCCCTTGAAAAATCTAATAATTTTGATCATTCTTCTCCTGTGAGTCCTAATATGCACTCCTGTCAGAAGTATCTTTTCAATTGGAGATTACCTGGAAGAGATTGGGAAAAGGCTTATGGAAGCTCAGAGCTTAAGTTTGGACATCAAGCTTTTAAACAGAGGTACGTTTCTGTTGAAAGGCCTAGAAGATGCAAATCAGCTCCACCTTCTTACAAAAGAAAAACTAGTTTCTATTGCCTGTACCAAAGAAAGGAAGAAAAGCATAATGCCGCCGGTTTCTATGGCCTTGACCAAAGAAAAACTGATAAGTTTAATGCCACAAATTTCTATTGCATGGACCAAGGGAAAGAAGAAAAGCTTAGGGCATCGGCCTTCCTTGACAGCCCACCTCATTTAGAACTAGGTGAGCTGAGAGATTCCAAACATTTCTCTAGTACTAATAATCTTTATATTAAGCCAAGTCCTCTTGATGACTTATCGATGGGAACTAGAATAGATATGACAAAGACGCCTGCTATTACGGGAAATAATAAAGAGAAACAAGAAGGAAAAATTTCCAAGCAGTTCCAATCCGATGTTAAAGTTACTGCATCTGCTTTAGAATTATGCTCAAAGGAAACTCAAGAGTCAGATTTATGGATCAAATGGAAAAATTGCTGTCCGACTACAAGAAATGATGGGCCACGTGCTTTTGAAGATGAAGTTAGTATACTTGATATCTCTTCAGGGTTCCTATCTCTTGCCAGAAATTCCTTAGTTCCCAAATCCATCGATAAGAATTTCCTTGAAGATGCCAAAGTTCTTCTACAGCTTGATAAGAAATTCATTCCAGTTGTTTCTGGTGGAATACTGGCTGTTATTGATCAGCATGCTGCAGATGAAAGAATCCGACTTGAAGATCTTCGTCAAAAGTTGTTGTCTGGTGAAGCAAAGACAATAGCCTATCTGGAGGATGAACATGAACTGGTGCTGCCTGAAATTGGGTACCAGTTGTTGTACAACTATAGTGATCAGGTGAAAGAGTGGGGTTGGATCTGTAATATTCATGCTCAAGATTCAAAATCCTTCCAAAGGAATTTGAATATCCTATACAAGCAGGAAACGGTCATCACGCTAATGGCAGTACCTTGCATACTAGGTGTTAATTTATCTGATGCAGATCTGCTGGAGTTTCTTGATCAGCTTGCTGATACAGATGGCTCATCAACAATGCCGCCATCCGTGCTTCGAGTTCTTAATTCAAAGGCCTGCAGAGGTGCAATTATGTTTGGAGACTCTTTGTTACCTTCAGAGTGTTCCCTTATTGTTGAAGAACTGAAGCAGACTTCTCTGTGTTTCCAATGCGCCCATGGTCGACCAACTACAGTACCTCTCGTGAACTTGGAGGCATTGCACAAGCAGATAAGGGAGATGGAAATATTAGATAAAAATGGTTCGAATGGAACGTGGCACGGGCTGCGGCGACATGAGCTGAGCATTGAACGGATGTTGCAGCAAGTAGGTTCGGCCTGATGGTAAATGCTGCCGTTCCATCAAGGAGAATAGTTGAGAGGGTGCCCCGGTAATACCAGTCTGCAGAAGTGAATGGATTGAGTAGCATTCTTAGATCATGTATACAACCAGTCTACGATGATTTGTGAAAAAGAGCAATGTTTCTGTAGTTTAGTTTTCCTTGGAATTTCGGTTGCAACTCTTCTTATAAACAGTTGTCTGCCCTCTAGCTGCATAAATCCTTGTGCATTTTGCCTGTCATGGTGCTCAATGTGTGTGAATAAAACTAAATGACTTCTTTTAAGTTGAGAAAAATGGCCATTTAATTAGCCCAAATATATGCTCCTTTAATATTAGTCCATTTTTACAGGAGATTCCATATTGAAATAGTTGTCTTAGTGTTATTTCTAGTGCAAGAAAGGTAGGTATCAAAAGAAACTTACAGAAAGACCATATACATCCTACTAAAATATCAATGATATCATGTACCTGCGATTAACAAAAATTAATGGAAGAGAAGAATGATGAAGGCTATTACTAACATTAAAACATTTACATACAAGACTGTGGCTGGGAGCTATAGCTAATTGCTGATCGCTTTATGAACAGGCTTCTTTGTTTTTACCAGCACATAAAGCCTTCCAGATAGGCACCAACATTTTCACCGGTGGACGTAATTCCTAGAGGGGAAAGAATAATTTTCAGCTGGTATAACACAACAATATATAATGCTCATCATGACTCACCGTTTTGACAAGTGCGGTTTTGTACAATCTAGGAAAACGTTCACGAGTATAAATGTAACATTTTGAAATCTAGATGTCAAATAGAAACTAA

Coding sequence (CDS)

TCTTTGGCCATTGTCGCTTTCGCATCTGCTCTGTTTCTGAAAACCTGTTTATCACTCATGAAAAGAGTGAATCGTGAAATGGGGATTATCAAGCCCTTGCCAAAGTCTGTTCGTAGTTCTGTGCGTGCTGGCGTTATACTCTATGATGCCACGAAGGTTGTGGAAGAGCTTGTTTATAATAGCTTGGATGCTGGTGCGTCAAAGGTAGAGAGTCTAATCGACCATCTTATTATTTTGTTTTATGCTGAGGCATGTTTGTCTTCGTCGTTCATCACTGAGCACGTTTCTTCTACAAGTGAAAAAAATACTCCTATGCCTATTGTATCAATTCAGATTTCAATTTTCATTGGCATTGGGACATCCTATGTGAAAGTAGTGGACAATGGATCTGGTATTACTCGGGATGGGTTAGCCTTGCTAGGAGAAAGATATGCAACTTCGAAATTCCATGATCTCATAGACATGGATACCAAAGGCAAAACATTTGGCTTTCGAGGAGAGGCATTGGCTTCCATTTCAGATGTATCGTTGTTGGAGATCATAACCAAAGCATGTGGGAGGGCAAATGGATATCGTAAAGTCATAAAGGCACAGAAGGTTGCAAGTGCTTGTACCTTGGAATTGATGATGATATGGAAGATATTGGTACTACAGGTAACAGTTATTGTTCGAGATCTATTTTACAACCAACCAGTTCGAAGGAAGCATATGCAATTCAGCCCCAAGAAGGTCTTGCAAGCAGTCAAGAAATGTGTAGTCCGTATTGCCCTTGTGCATTCTAAAGTTTCCTTCAAAATTGTAGATAGTGAAAGTGAGAGCATCCTTCTTTACACGAATCCCTCTCCTTCTCCTTTATCACTTTTGAGAAGTGGCTTTGGCAGCGAGGTTTCCAGGTCTCTCCATGAACTAAAAATCGGTGATGGGGGCTTAAAGCTTTCTGGTTATATATGCAGTCCGTTTGATACGTTTACTATCAAGGATCCAGTATTTTATGAAATTGTTTTGACCTCAGCAAGGACAGATATTAACAGACGATTCATTTGCAAGGGCCAAATTCATAAATTACTTAATCAACTGGCCAGTAGATTTGTAAGCCTGAGTCCGCAGACTGACCAAGCCTGTCACAGCAGGAAGAGAAGCAGGTTTCAAGCAAATCCTGCTTATATTTTAAATTTAGATTGCCCTGGATCTTTCTATGACCTAACATTTGAATCATCCAAGACCTTTGTTCAGTTTAAGGACTGGACTTCAATACTTACCTTCATTGAAGAGACCATTCAACAATTTTGGAAAGAAAAATATAGTAGTGGAAAATCTTTGGTCCATACAACCTCCATAGTTGGAGGAGATCAGCTGTGGAAGGATGAAGATAACATGATATCAACAAATTCAGATTTTCGTGAGGATGTTATACGTTTCCCTCCAGAGAGTATTCGATCTCTGAAGAAGAGCAGAATGCGAAGCCCTCGGGCCTCCCTTATTGATTTGTTTTCTCCGTCAGCAATGCTTACAAAAGATGATGACATCTTGTCCAATAGTTTGCATGAAAAGAAGGCATGTGAGAATTCACACACAAGTTCAAGCGAATTGAATGACGTTCACCGACAAGCTAGGATGCAATTTGGTAACCAAGCTGCCGATCATTTCTCAGGATTATGGGGTACTCCCTTGGCAAAATGCTCAACTACAGCTGTCCAAAAAGGTGACAGGCACCCATGGGTACCTGATAACATTTTTGTATCTGAAGACTCCTTTCTGGATAGAAGGCTGGCTTCTCCCAAGAGGTGTGATGACATTGTTGAGGACAATATCTTCAGTTCAGACTTGAAAGGTCAATCTTCTAAAGTGTATATCGATATGATCAATGGGTCTGCTGAAAGTACACCATCTTCTTACTTTCATGAATTTAGTTATGATGACAATATCTTCACGGGTAACAAACCCTCCCTTCGGGGATGCACCTCAGGAAGCAGTTTTCAACTTGAGAGTACTTCCATTCTTGGTGACAAACTGTTCATTCAAAATGATGTCATCAAAAGAATCCAAAAGCAGGGAATCCCTGATGATGAAGTTGATGTTCTAAAGCTTGACGGTTACATCCAGGGTTCTGATTTTTATGCCGGAGACTCATTTCATGCTGAGTTTACTGAAGAAAATATGTACTCATGTCATTTGGACAAGCACGTGCAGAAGTTTTTCTCAAGTTATCAGACTAGAAATTCCCCAGATGTTCACGTGACCCCAAATCCCAGATTAGCTTCAGAATGGGATGTTGATTGCTTCAGTGTTAGGGATGGGGTTGAAAGGAACTGGAGATCTAGAGATAGGACTCCCTTCAGGGATTTGGTGGATGGTGAGGATAAGGGCTGCGGATTTGATTCTGATATCATGTTGAGAAGTTCCAAAAAGAATTACATACCAAGCTGTATAGATAGTGAACTGATAATTGATGATGTTCTTGATACAAGAGAAGACCTTAGTACTTCCCTTGAAAAATCTAATAATTTTGATCATTCTTCTCCTGTGAGTCCTAATATGCACTCCTGTCAGAAGGGCTGCGGATTTGATTCTGATATCATGTTGAGAAGTTCCAAAAAGAATTACATACCAAGCTGTATAGATAGTGAACTGATAATTGATGATGTTCTTGATACAAGAGAAGACCTTAGTACTTCCCTTGAAAAATCTAATAATTTTGATCATTCTTCTCCTGTGAGTCCTAATATGCACTCCTGTCAGAAGGGCTGCGGATTTGATTCTGATATCATGTTGAGAAGTTCCAAAAAGAATTACATACCAAGCTGTATAGATAGTGAACTGATAATTGATGATGTTCTTGATACAAGAGAAGACCTTAGTACTTCCCTTGAAAAATCTAATAATTTTGATCATTCTTCTCCTGTGAGTCCTAATATGCACTCCTGTCAGAAGTATCTTTTCAATTGGAGATTACCTGGAAGAGATTGGGAAAAGGCTTATGGAAGCTCAGAGCTTAAGTTTGGACATCAAGCTTTTAAACAGAGGTACGTTTCTGTTGAAAGGCCTAGAAGATGCAAATCAGCTCCACCTTCTTACAAAAGAAAAACTAGTTTCTATTGCCTGTACCAAAGAAAGGAAGAAAAGCATAATGCCGCCGGTTTCTATGGCCTTGACCAAAGAAAAACTGATAAGTTTAATGCCACAAATTTCTATTGCATGGACCAAGGGAAAGAAGAAAAGCTTAGGGCATCGGCCTTCCTTGACAGCCCACCTCATTTAGAACTAGGTGAGCTGAGAGATTCCAAACATTTCTCTAGTACTAATAATCTTTATATTAAGCCAAGTCCTCTTGATGACTTATCGATGGGAACTAGAATAGATATGACAAAGACGCCTGCTATTACGGGAAATAATAAAGAGAAACAAGAAGGAAAAATTTCCAAGCAGTTCCAATCCGATGTTAAAGTTACTGCATCTGCTTTAGAATTATGCTCAAAGGAAACTCAAGAGTCAGATTTATGGATCAAATGGAAAAATTGCTGTCCGACTACAAGAAATGATGGGCCACGTGCTTTTGAAGATGAAGTTAGTATACTTGATATCTCTTCAGGGTTCCTATCTCTTGCCAGAAATTCCTTAGTTCCCAAATCCATCGATAAGAATTTCCTTGAAGATGCCAAAGTTCTTCTACAGCTTGATAAGAAATTCATTCCAGTTGTTTCTGGTGGAATACTGGCTGTTATTGATCAGCATGCTGCAGATGAAAGAATCCGACTTGAAGATCTTCGTCAAAAGTTGTTGTCTGGTGAAGCAAAGACAATAGCCTATCTGGAGGATGAACATGAACTGGTGCTGCCTGAAATTGGGTACCAGTTGTTGTACAACTATAGTGATCAGGTGAAAGAGTGGGGTTGGATCTGTAATATTCATGCTCAAGATTCAAAATCCTTCCAAAGGAATTTGAATATCCTATACAAGCAGGAAACGGTCATCACGCTAATGGCAGTACCTTGCATACTAGGTGTTAATTTATCTGATGCAGATCTGCTGGAGTTTCTTGATCAGCTTGCTGATACAGATGGCTCATCAACAATGCCGCCATCCGTGCTTCGAGTTCTTAATTCAAAGGCCTGCAGAGGTGCAATTATGTTTGGAGACTCTTTGTTACCTTCAGAGTGTTCCCTTATTGTTGAAGAACTGAAGCAGACTTCTCTGTGTTTCCAATGCGCCCATGGTCGACCAACTACAGTACCTCTCGTGAACTTGGAGGCATTGCACAAGCAGATAAGGGAGATGGAAATATTAGATAAAAATGGTTCGAATGGAACGTGGCACGGGCTGCGGCGACATGAGCTGAGCATTGAACGGATGTTGCAGCAAGTAGGTTCGGCCTGA

Protein sequence

SLAIVAFASALFLKTCLSLMKRVNREMGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYAEACLSSSFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYATSKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVASACTLELMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKIVDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDTFTIKDPVFYEIVLTSARTDINRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANPAYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSIVGGDQLWKDEDNMISTNSDFREDVIRFPPESIRSLKKSRMRSPRASLIDLFSPSAMLTKDDDILSNSLHEKKACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWGTPLAKCSTTAVQKGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAESTPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLFIQNDVIKRIQKQGIPDDEVDVLKLDGYIQGSDFYAGDSFHAEFTEENMYSCHLDKHVQKFFSSYQTRNSPDVHVTPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRSSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRSSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRSSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPGRDWEKAYGSSELKFGHQAFKQRYVSVERPRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAGFYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELGELRDSKHFSSTNNLYIKPSPLDDLSMGTRIDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETQESDLWIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLLYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFLDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGRPTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQQVGSA
Homology
BLAST of Cp4.1LG08g12770 vs. ExPASy Swiss-Prot
Match: F4JN26 (DNA mismatch repair protein MLH3 OS=Arabidopsis thaliana OX=3702 GN=MLH3 PE=2 SV=2)

HSP 1 Score: 609.8 bits (1571), Expect = 8.3e-173
Identity = 492/1449 (33.95%), Postives = 684/1449 (47.20%), Query Frame = 0

Query: 27   MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYAEACL 86
            M  IKPLP+ VR S+R+G+I++D  +VVEELV+NSLDAGA+KV                 
Sbjct: 1    MKTIKPLPEGVRHSMRSGIIMFDMARVVEELVFNSLDAGATKV----------------- 60

Query: 87   SSSFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 146
                                      SIF+G+ +  VKVVD+GSG++RD L LLGERYAT
Sbjct: 61   --------------------------SIFVGVVSCSVKVVDDGSGVSRDDLVLLGERYAT 120

Query: 147  SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVASACTL 206
            SKFHD  +++T  +TFGFRGEALASISD+SLLE+ TKA GR NGYRKV+K  K      L
Sbjct: 121  SKFHDFTNVETASETFGFRGEALASISDISLLEVRTKAIGRPNGYRKVMKGSK-----CL 180

Query: 207  ELMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI 266
             L +         TV VRDLFY+QPVRRK+MQ SPKKVL+++KKCV RIALVHS VSF +
Sbjct: 181  HLGIDDDRKDSGTTVTVRDLFYSQPVRRKYMQSSPKKVLESIKKCVFRIALVHSNVSFSV 240

Query: 267  VDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDTFTIK 326
            +D ES+  L  TNPS S  SLL    G+E   SL ++ + DG L +SG+ C+  D +   
Sbjct: 241  LDIESDEELFQTNPSSSAFSLLMRDAGTEAVNSLCKVNVTDGMLNVSGFECA--DDWKPT 300

Query: 327  DPVFYEIVLTSARTDINRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP 386
            D                      GQ                        + +R+R Q+NP
Sbjct: 301  D----------------------GQ-----------------------QTGRRNRLQSNP 360

Query: 387  AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI 446
             YIL + CP   Y+ +FE SKT V+FK W  +L FIE      WK      K  +     
Sbjct: 361  GYILCIACPRRLYEFSFEPSKTHVEFKKWGPVLAFIERITLANWK------KDRILELFD 420

Query: 447  VGGDQLWKDEDNMISTNSDFREDVIRFPPESIRSLKKSRMRSPRASLIDLFSPSAMLTKD 506
             G D L K +        D  +D IR    S+ S+            +D   P AM    
Sbjct: 421  GGADILAKGD------RQDLIDDKIRLQNGSLFSI---------LHFLDADWPEAM---- 480

Query: 507  DDILSNSLHEKKACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWGTPLAKCSTTAVQ 566
                     +K    N H   S L  +   A  +   Q  D+FS        +C      
Sbjct: 481  -----EPAKKKLKRSNDHAPCSSL--LFPSADFK---QDGDYFSPRKDVWSPECEVELKI 540

Query: 567  KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE 626
            +  +            DS L  R    +  +D  +            SK     +     
Sbjct: 541  QNPKEQGTVAGFESRTDSLLQSRDIEMQTNEDFPQVTDLLETSLVADSKCRKQFLTRCQI 600

Query: 627  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLFIQNDVIKRI----- 686
            +TP +  H+F  D ++               +FQ +    L D+L + N + K +     
Sbjct: 601  TTPVNINHDFMKDSDVL--------------NFQFQG---LKDELDVSNCIGKHLLRGCS 660

Query: 687  QKQGIPDDEVDVLKLDGYIQGSDFYAGDSFHAEFTEENMYSCHLDKHVQKFFSSYQTRNS 746
             +  +   E  +  ++GY         +S       E   S        +   + +  + 
Sbjct: 661  SRVSLTFHEPKLSHVEGY---------ESVVPMIPNEKQSS-------PRVLETREGGSY 720

Query: 747  PDVH--VTPNPRLASEW-DVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGC-GFDSDIM 806
             DV+   TP+  L S W D D F+ +                      D+GC G   D  
Sbjct: 721  CDVYSDKTPDCSLGSSWQDTDWFTPQ-------------------CSSDRGCVGIGEDF- 780

Query: 807  LRSSKKNYIPSCIDSELIIDDVLDTREDLST---SLEKSNNFDHSSPVSPNMHSCQKGCG 866
                  N  P         D+ + +++ LS+       + +F  SS  SP M+S      
Sbjct: 781  ------NITPIDTAEFDSYDEKVGSKKYLSSVNVGSSVTGSFCLSSEWSP-MYSTPSATK 840

Query: 867  FDSDIMLRSSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQK 926
            ++S+         Y   C   E  +   L    D       +NN      V P M  C+ 
Sbjct: 841  WESE---------YQKGCRILEQSLR--LGRMPDPEFCFSAANNIKFDHEVIPEMDCCET 900

Query: 927  GCGFDSDIMLRSSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHS 986
            G                     DS   I +     + +  S     ++ H+  V  + +S
Sbjct: 901  G--------------------TDSFTAIQNCTQLADKICKS-----SWGHADDVRIDQYS 960

Query: 987  CQKYLFNWRLPGRDWEKAYGSSELKFGHQAFKQRYVSVERPRRCKSAPPSYKRKTSFYCL 1046
             +K                     KF +    Q     +R +R +SAPP Y+ K  F  L
Sbjct: 961  IRKE--------------------KFSYMDGTQNNAGKQRSKRSRSAPPFYREKKRFISL 1020

Query: 1047 YQRKEEKHNAAGFYGLDQRKTDKFNATNFYCMDQ---GKEEKLRASAFLD-SPPHLELGE 1106
              + + K           + +D     +  C+ Q     +  L+ S   D S  H++  E
Sbjct: 1021 SCKSDTK----------PKNSDPSEPDDLECLTQPCNASQMHLKCSILDDVSYDHIQETE 1080

Query: 1107 LRDSKHFSSTNNLYIKPSPLDDLSMGTRIDMTKTPAITGNNKEKQEGKISKQFQSDVKVT 1166
                K  SS ++L          S G R   ++T      +++  E   S++F   +K T
Sbjct: 1081 ----KRLSSASDL--------KASAGCRTVHSET-----QDEDVHEDFSSEEFLDPIKST 1140

Query: 1167 ASALELCSKETQESDLWIKWK-NCCPTTRNDGPRAFEDEVSILDISSGFLSL-ARNSLVP 1226
                              KW+ NC  +           +  + DISSG L L +  SLVP
Sbjct: 1141 T-----------------KWRHNCAVSQVPKESHELHGQDGVFDISSGLLHLRSDESLVP 1153

Query: 1227 KSIDKNFLEDAKVLLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIA 1286
            +SI+++ LEDAKVL Q+DKK+IP+V+ G +A++DQHAADERIRLE+LR K+L+G+A+T+ 
Sbjct: 1201 ESINRHSLEDAKVLQQVDKKYIPIVACGTVAIVDQHAADERIRLEELRTKVLAGKARTVT 1153

Query: 1287 YLEDEHELVLPEIGYQLLYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMA 1346
            YL  + ELVLPE+GYQLL +YS+Q+++WGWICNI  + S SF++N++I+ ++ T ITL A
Sbjct: 1261 YLSADQELVLPEMGYQLLQSYSEQIRDWGWICNITVEGSTSFKKNMSIIQRKPTPITLNA 1153

Query: 1347 VPCILGVNLSDADLLEFLDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSL 1406
            VPCILGVNLSD DLLEFL QLADTDGSST+PPSVLRVLNSKACRGAIMFGDSLLPSECSL
Sbjct: 1321 VPCILGVNLSDVDLLEFLQQLADTDGSSTIPPSVLRVLNSKACRGAIMFGDSLLPSECSL 1153

Query: 1407 IVEELKQTSLCFQCAHGRPTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIE 1458
            I++ LKQTSLCFQCAHGRPTTVPLV+L+ALHKQI ++           WHGL+R E++++
Sbjct: 1381 IIDGLKQTSLCFQCAHGRPTTVPLVDLKALHKQIAKL------SGRQVWHGLQRREITLD 1153

BLAST of Cp4.1LG08g12770 vs. ExPASy Swiss-Prot
Match: Q9UHC1 (DNA mismatch repair protein Mlh3 OS=Homo sapiens OX=9606 GN=MLH3 PE=1 SV=3)

HSP 1 Score: 131.3 bits (329), Expect = 8.6e-29
Identity = 111/412 (26.94%), Postives = 195/412 (47.33%), Query Frame = 0

Query: 29  IIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYAEACLSS 88
           +IK L   V++ +R+G+ +    + VEEL  NS+DA A                  C   
Sbjct: 1   MIKCLSVEVQAKLRSGLAISSLGQCVEELALNSIDAEAK-----------------C--- 60

Query: 89  SFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYATSK 148
                                  +++ + + T  V+V+DNG G+  D +  +G RY TSK
Sbjct: 61  -----------------------VAVRVNMETFQVQVIDNGFGMGSDDVEKVGNRYFTSK 120

Query: 149 FHDLIDMDTKGKTFGFRGEALASISDV-SLLEIITKACGRANGYRKVIKAQKVASACTLE 208
            H + D++   + +GFRGEALA+I+D+ S +EI +K       + K+ ++ K   AC  +
Sbjct: 121 CHSVQDLENP-RFYGFRGEALANIADMASAVEISSKKNRTMKTFVKLFQSGKALKACEAD 180

Query: 209 LMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKIV 268
           +           TV V +LFY  PVRRK M   P+   + V++ +  ++L+H  +SF + 
Sbjct: 181 VTR----ASAGTTVTVYNLFYQLPVRRKCM--DPRLEFEKVRQRIEALSLMHPSISFSLR 240

Query: 269 DSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDTFTIKD 328
           +  S S++L    +    S     +G   S+ L E+       +LSGYI S  +    K+
Sbjct: 241 NDVSGSMVLQLPKTKDVCSRFCQIYGLGKSQKLREISFKYKEFELSGYISS--EAHYNKN 300

Query: 329 PVFYEIVLTSARTDINRRFICKGQIHKLLNQLASRFVSL-----SPQTDQACHS-RKRSR 388
             F           +N+R + + ++HKL++ L  +   +      P + Q   S R RS 
Sbjct: 301 MQF---------LFVNKRLVLRTKLHKLIDFLLRKESIICKPKNGPTSRQMNSSLRHRST 351

Query: 389 FQANPAYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEK 434
            +    Y++N+ C    YD+  E +KT ++F++W ++L  I+E ++ F K++
Sbjct: 361 PELYGIYVINVQCQFCEYDVCMEPAKTLIEFQNWDTLLFCIQEGVKMFLKQE 351


HSP 2 Score: 98.6 bits (244), Expect = 6.2e-19
Identity = 87/288 (30.21%), Postives = 132/288 (45.83%), Query Frame = 0

Query: 1192 LDISSGFL-SLA---RNSLVPKSIDKNFLEDAKVLLQLDKKFIPVV-----------SGG 1251
            +D+SSG   SLA    N L P    K  +   +VL Q+D KFI  +            G 
Sbjct: 1158 VDVSSGQAESLAVKIHNILYPYRFTKGMIHSMQVLQQVDNKFIACLMSTKTEENGEAGGN 1217

Query: 1252 ILAVIDQHAADERIRLEDL-------RQKLLSGEAKTI-AYLEDEHELVLPEIGYQLLYN 1311
            +L ++DQHAA ERIRLE L       +Q   SG  K + + L    E+ + E   +LL+ 
Sbjct: 1218 LLVLVDQHAAHERIRLEQLIIDSYEKQQAQGSGRKKLLSSTLIPPLEITVTEEQRRLLWC 1277

Query: 1312 YSDQVKEWGW-ICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFLD 1371
            Y   +++ G         DS      + + + +     L      +  ++ +  + E L+
Sbjct: 1278 YHKNLEDLGLEFVFPDTSDSLVLVGKVPLCFVEREANELRRGRSTVTKSIVEEFIREQLE 1337

Query: 1372 QLADTDG-SSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGR 1431
             L  T G   T+P +V +VL S+AC GAI F D L   E   ++E L    L FQCAHGR
Sbjct: 1338 LLQTTGGIQGTLPLTVQKVLASQACHGAIKFNDGLSLQESCRLIEALSSCQLPFQCAHGR 1397

Query: 1432 PTTVPLVNLEALHKQIREMEILDK-NGSNGTWHGLRRHELSIERMLQQ 1454
            P+ +PL +++ L ++ +    L K       W    + E    + LQQ
Sbjct: 1398 PSMLPLADIDHLEQEKQIKPNLTKLRKMAQAWRLFGKAECDTRQSLQQ 1445

BLAST of Cp4.1LG08g12770 vs. ExPASy Swiss-Prot
Match: B8CX97 (DNA mismatch repair protein MutL OS=Halothermothrix orenii (strain H 168 / OCM 544 / DSM 9562) OX=373903 GN=mutL PE=3 SV=1)

HSP 1 Score: 114.0 bits (284), Expect = 1.4e-23
Identity = 87/292 (29.79%), Postives = 145/292 (49.66%), Query Frame = 0

Query: 30  IKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYAEACLSSS 89
           IK LP+SV + + AG ++     VV+ELV NSLDAG++K+                    
Sbjct: 4   IKRLPESVANQISAGEVVERPASVVKELVENSLDAGSNKI-------------------- 63

Query: 90  FITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYATSKF 149
                            ++ I+       G   ++V DNG GI  D + +  +RYATSK 
Sbjct: 64  -----------------LIEIENG-----GKDLIRVKDNGHGIPSDEIEIAFDRYATSKI 123

Query: 150 HDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVAS--ACTLE 209
            D+ D+ +  K+ GFRGEALASI+ VS+L+II++   +    +  +K  KV S   C   
Sbjct: 124 TDINDLYSL-KSLGFRGEALASIASVSILDIISRTKSQTKAIKMRLKGGKVISKEPCGAS 183

Query: 210 LMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKIV 269
                    +   +IV+DLF+N P R K+++ + +   + +   + R AL +  V+F ++
Sbjct: 184 ---------VGTDIIVKDLFFNTPARYKYLK-TTRNEFKHISNIITREALAYPGVNFTLI 240

Query: 270 DSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSP 320
              +  I+L T  +   L  + + +G E+++SL ++   D  +K+SGYI  P
Sbjct: 244 --HNGRIVLKTPGTGKTLDCIYAIYGKEMAQSLVKIDYEDRYIKVSGYISRP 240

BLAST of Cp4.1LG08g12770 vs. ExPASy Swiss-Prot
Match: Q01QW7 (DNA mismatch repair protein MutL OS=Solibacter usitatus (strain Ellin6076) OX=234267 GN=mutL PE=3 SV=1)

HSP 1 Score: 108.6 bits (270), Expect = 6.0e-22
Identity = 118/419 (28.16%), Postives = 182/419 (43.44%), Query Frame = 0

Query: 27  MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYAEACL 86
           MG I+ LP  V + + AG ++     VV+EL+ NSLDAGA++V                 
Sbjct: 1   MGRIRILPDQVANKIAAGEVVERPASVVKELLENSLDAGATEV----------------- 60

Query: 87  SSSFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 146
                                   ++ +  G G   +++VD+G G+ RD   L  ER+AT
Sbjct: 61  ------------------------RVEVEAG-GRRLIRIVDDGFGMLRDDALLAFERHAT 120

Query: 147 SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVASACTL 206
           SK  D+ D+     T GFRGEAL SI+ VS L + T++     G R  I   K+      
Sbjct: 121 SKLRDVKDL-LSIATLGFRGEALPSIASVSRLLLETRSMEEPTGTRIEIAGGKM------ 180

Query: 207 ELMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI 266
            L      L     + VRDLFYN P RRK ++  P + L  +   V   +L H   SF++
Sbjct: 181 -LRCEEAALGGGTVITVRDLFYNVPARRKFLRTEPTE-LAHIASLVTHYSLAHPDKSFRL 240

Query: 267 VDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSP------F 326
               +E  LL   P  S    +   FGS++   L E+ + +  L L      P      +
Sbjct: 241 STGPTE--LLGVTPVASMKERVYQVFGSQILDELVEIGVRERDLFLPPPSVPPSQAIAEY 300

Query: 327 DTFTIKDPVFYEIVLTS--ARTDI---NRRFI---CKGQIHK---LLNQLASRFVSLSPQ 386
            +   +DP F    LT   +R  I   NR  I     G++ +   +L+ L+S + +L P 
Sbjct: 301 RSTEPEDPPFRRFRLTGFFSRPQIQKSNRNSIYIFVNGRLIRDRLVLHALSSAYHNLMPA 353

Query: 387 TDQACHSRKRSRFQANPAYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQ 429
           +             A P  +L L+C     D+    SKT V+F+  + +  FI ++I++
Sbjct: 361 S-------------AYPFALLFLECDAEEVDVNVHPSKTEVRFRHGSFLHDFIRDSIRE 353

BLAST of Cp4.1LG08g12770 vs. ExPASy Swiss-Prot
Match: Q8A120 (DNA mismatch repair protein MutL OS=Bacteroides thetaiotaomicron (strain ATCC 29148 / DSM 2079 / NCTC 10582 / E50 / VPI-5482) OX=226186 GN=mutL PE=3 SV=1)

HSP 1 Score: 104.0 bits (258), Expect = 1.5e-20
Identity = 100/405 (24.69%), Postives = 181/405 (44.69%), Query Frame = 0

Query: 29  IIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYAEACLSS 88
           II  LP SV + + AG ++     V++ELV N++DA A  +     H+++    + C   
Sbjct: 4   IIHLLPDSVANQIAAGEVIQRPASVIKELVENAIDADAQNI-----HVLVTDAGKTC--- 63

Query: 89  SFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYATSK 148
                                             ++++D+G G++     L  ER+ATSK
Sbjct: 64  ----------------------------------IQIIDDGKGMSETDARLSFERHATSK 123

Query: 149 FHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVASACTLEL 208
             +  D+    +T GFRGEALASI+ V+ +E+ T+      G + VI   KV S    E 
Sbjct: 124 IREAADLFAL-RTMGFRGEALASIAAVAQVELKTRLESEELGTKLVIAGSKVESQ---EA 183

Query: 209 MMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKIVD 268
           +   K         V++LF+N P RRK ++ +  ++   + +   RIALVH +V+F +  
Sbjct: 184 VSCSK----GSNFSVKNLFFNVPARRKFLKANSTELSNILAE-FERIALVHPEVAFSLYS 243

Query: 269 SESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDTFTIKDP 328
           ++SE   L  +P    +  +   FG ++++ L  +++    +K+SGYI  P +T   K  
Sbjct: 244 NDSELFNLPVSPLRQRILAI---FGKKLNQQLLNIEVNTTMVKISGYIAKP-ETARKKGA 303

Query: 329 VFYEIVLTSARTDINRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANPAY 388
             Y  V        N R++     HK + +   + + +  Q     +      F+ +PA 
Sbjct: 304 HQYFFV--------NGRYMRHPYFHKAVMEAYEQLIPVGEQVSYFIY------FEVDPAN 329

Query: 389 ILNLDCPGSFYDLTFESSKTFVQFKD----WTSILTFIEETIQQF 430
           I          D+    +KT ++F++    W  +   ++E++ +F
Sbjct: 364 I----------DVNIHPTKTEIKFENEQAIWQILSASVKESLGKF 329

BLAST of Cp4.1LG08g12770 vs. NCBI nr
Match: XP_023539512.1 (DNA mismatch repair protein MLH3 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 2656 bits (6885), Expect = 0.0
Identity = 1353/1431 (94.55%), Postives = 1357/1431 (94.83%), Query Frame = 0

Query: 27   MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYAEACL 86
            MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK                  
Sbjct: 1    MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK------------------ 60

Query: 87   SSSFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 146
                                     ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT
Sbjct: 61   -------------------------ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 120

Query: 147  SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVASACTL 206
            SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIK  K      L
Sbjct: 121  SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKGCK-----CL 180

Query: 207  ELMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI 266
             L +   +  +  TVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI
Sbjct: 181  YLGIDDDMEDIGTTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI 240

Query: 267  VDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDTFTIK 326
            VDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDTFTIK
Sbjct: 241  VDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDTFTIK 300

Query: 327  DPVFYEIVLTSARTDINRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP 386
               +  I         NRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP
Sbjct: 301  AVQYVYI---------NRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP 360

Query: 387  AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI 446
            AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI
Sbjct: 361  AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI 420

Query: 447  VGGDQLWKDEDNMISTNSDFREDVIRFPPESIRSLKKSRMRSPRASLIDLFSPSAMLTKD 506
            VGGDQLWKDEDNMISTNSDFREDVIRFPPESIRSLKKSRMRSPRASLIDLFSPSAMLTKD
Sbjct: 421  VGGDQLWKDEDNMISTNSDFREDVIRFPPESIRSLKKSRMRSPRASLIDLFSPSAMLTKD 480

Query: 507  DDILSNSLHEKKACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWGTPLAKCSTTAVQ 566
            DDILSNSLHEKKACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWGTPLAKCSTTAVQ
Sbjct: 481  DDILSNSLHEKKACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWGTPLAKCSTTAVQ 540

Query: 567  KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE 626
            KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE
Sbjct: 541  KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE 600

Query: 627  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLFIQNDVIKRIQKQGI 686
            STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLFIQNDVIKRIQKQGI
Sbjct: 601  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLFIQNDVIKRIQKQGI 660

Query: 687  PDDEVDVLKLDGYIQGSDFYAGDSFHAEFTEENMYSCHLDKHVQKFFSSYQTRNSPDVHV 746
            PDDEVDVLKLDGYIQGSDFYAGDSFHAEFTEENMYSCHLDKHVQKFFSSYQTRNSPDVHV
Sbjct: 661  PDDEVDVLKLDGYIQGSDFYAGDSFHAEFTEENMYSCHLDKHVQKFFSSYQTRNSPDVHV 720

Query: 747  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRSSKKNYI 806
            TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRSSKKNYI
Sbjct: 721  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRSSKKNYI 780

Query: 807  PSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRSSKK 866
            PSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRSSKK
Sbjct: 781  PSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRSSKK 840

Query: 867  NYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRS 926
            NYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRS
Sbjct: 841  NYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRS 900

Query: 927  SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG 986
            SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG
Sbjct: 901  SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG 960

Query: 987  RDWEKAYGSSELKFGHQAFKQRYVSVERPRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG 1046
            RDWEKAYGSSELKFGHQAFKQRYVSVERPRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG
Sbjct: 961  RDWEKAYGSSELKFGHQAFKQRYVSVERPRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG 1020

Query: 1047 FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELGELRDSKHFSSTNNLYIK 1106
            FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELGELRDSKHFSSTNNLYIK
Sbjct: 1021 FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELGELRDSKHFSSTNNLYIK 1080

Query: 1107 PSPLDDLSMGTRIDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETQESDL 1166
            PSPLDDLSMGTRIDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETQESDL
Sbjct: 1081 PSPLDDLSMGTRIDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETQESDL 1140

Query: 1167 WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1226
            WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD
Sbjct: 1141 WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1200

Query: 1227 KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL 1286
            KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL
Sbjct: 1201 KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL 1260

Query: 1287 YNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL 1346
            YNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL
Sbjct: 1261 YNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL 1320

Query: 1347 DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGR 1406
            DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGR
Sbjct: 1321 DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGR 1374

Query: 1407 PTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQQVGSA 1457
            PTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQQVGSA
Sbjct: 1381 PTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQQVGSA 1374

BLAST of Cp4.1LG08g12770 vs. NCBI nr
Match: KAG7027832.1 (DNA mismatch repair protein MLH3, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 2352 bits (6094), Expect = 0.0
Identity = 1228/1457 (84.28%), Postives = 1243/1457 (85.31%), Query Frame = 0

Query: 1    SLAIVAFASALFLKTCLSLMKRVNREMGIIKPLPKSVRSSVRAGVILYDATKVVEELVYN 60
            SLAIVAFA ALFLK+CLSLMKR +  MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYN
Sbjct: 101  SLAIVAFAYALFLKSCLSLMKR-SESMGIIKPLPKSVRSSVRAGVILYDATKVVEELVYN 160

Query: 61   SLDAGASKVESLIDHLIILFYAEACLSSSFITEHVSSTSEKNTPMPIVSIQISIFIGIGT 120
            SLDAGASK                                           ISIFIGIGT
Sbjct: 161  SLDAGASK-------------------------------------------ISIFIGIGT 220

Query: 121  SYVKVVDNGSGITRDGLALLGERYATSKFHDLIDMDTKGKTFGFRGEALASISDVSLLEI 180
            SYVKVVDNGSGITRDGLALLGERYATSKFHDLIDM TKGKTFGFRGEALASISDVSL+EI
Sbjct: 221  SYVKVVDNGSGITRDGLALLGERYATSKFHDLIDMHTKGKTFGFRGEALASISDVSLVEI 280

Query: 181  ITKACGRANGYRKVIKAQKVASACTLELMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFS 240
            ITKACGRANGYRKV+K  K      L L +   +  +  TVIVRDLFYNQPVRRKHMQFS
Sbjct: 281  ITKACGRANGYRKVMKGCK-----CLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQFS 340

Query: 241  PKKVLQAVKKCVVRIALVHSKVSFKIVDSESESILLYTNPSPSPLSLLRSGFGSEVSRSL 300
            PKKVLQAVKKCVVR ALVHSKVSFKIVDSESESILLYTNPSPSPLSLLRSGFGSEVSRSL
Sbjct: 341  PKKVLQAVKKCVVRTALVHSKVSFKIVDSESESILLYTNPSPSPLSLLRSGFGSEVSRSL 400

Query: 301  HELKIGDGGLKLSGYICSPFDTFTIKDPVFYEIVLTSARTDINRRFICKGQIHKLLNQLA 360
            HELKIGDG LKLSGYICSPFDTFTIKDPVFYEIVLTSARTDINRRFICKGQIHK LNQLA
Sbjct: 401  HELKIGDGDLKLSGYICSPFDTFTIKDPVFYEIVLTSARTDINRRFICKGQIHKSLNQLA 460

Query: 361  SRFVSLSPQTDQACHSRKRSRFQANPAYILNLDCPGSFYDLTFESSKTFVQFKDWTSILT 420
            SRFVSLSPQTDQACHSRKRSRFQANPAYILNLDCPGSFYDLTFESSKTFVQFKDWTSILT
Sbjct: 461  SRFVSLSPQTDQACHSRKRSRFQANPAYILNLDCPGSFYDLTFESSKTFVQFKDWTSILT 520

Query: 421  FIEETIQQFWKEKYSSGKSLVHTTSIVGGDQLWKDEDNMISTNSDFREDVIRFPPESIRS 480
            FIEETIQQFWKE YSSGKSLVHT  IVGGDQLWKDED MISTNSDFREDVI FPPESIRS
Sbjct: 521  FIEETIQQFWKENYSSGKSLVHTNPIVGGDQLWKDEDYMISTNSDFREDVILFPPESIRS 580

Query: 481  LKKSRMRSPRASLIDLFSPSAMLTKDDDILSNSLHEKKACENSHTSSSELNDVHRQARMQ 540
            +KKSRMRSP+A LIDLFSPSAMLTKDD ILSNSLHEKKACENSHTSSSELNDVH QARMQ
Sbjct: 581  VKKSRMRSPQACLIDLFSPSAMLTKDD-ILSNSLHEKKACENSHTSSSELNDVHLQARMQ 640

Query: 541  FGNQAADHFSGLWGTPLAKCSTTAVQKGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIV 600
            FGNQAADHFSGLWGTPLAKCSTTAVQKGDRHPWVPDNIFVSEDSFLDRRLASPKRC DIV
Sbjct: 641  FGNQAADHFSGLWGTPLAKCSTTAVQKGDRHPWVPDNIFVSEDSFLDRRLASPKRCGDIV 700

Query: 601  EDNIFSSDLKGQSSKVYIDMINGSAESTPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQ 660
            EDNIFSSDLKGQSSKVYIDMINGSAESTPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQ
Sbjct: 701  EDNIFSSDLKGQSSKVYIDMINGSAESTPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQ 760

Query: 661  LESTSILGDKLFIQNDVIKRIQKQGIPDDEVDVLKLDGYIQGSDFYAGDSFHAEFTEENM 720
            LESTSILGDKL+IQND IKRIQKQGIPDDEVDVLKLDGYIQGSDFYAG S HAEF EEN+
Sbjct: 761  LESTSILGDKLYIQNDDIKRIQKQGIPDDEVDVLKLDGYIQGSDFYAGGSLHAEFAEENI 820

Query: 721  YSCHLDKHVQKFFSSYQTRNSPDVHVTPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFR 780
            YSCHLDKHVQKFFSSYQTRNSPDVHVTPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFR
Sbjct: 821  YSCHLDKHVQKFFSSYQTRNSPDVHVTPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFR 880

Query: 781  DLVDGEDKGCGFDSDIMLRSSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSS 840
            DLVDGEDKGCGF                                                
Sbjct: 881  DLVDGEDKGCGF------------------------------------------------ 940

Query: 841  PVSPNMHSCQKGCGFDSDIMLRSSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFD 900
                                                                        
Sbjct: 941  ------------------------------------------------------------ 1000

Query: 901  HSSPVSPNMHSCQKGCGFDSDIMLRSSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSN 960
                              DSDIMLRSSKKNYIPSCIDS+LIIDDVLDTREDLSTSLEKSN
Sbjct: 1001 ------------------DSDIMLRSSKKNYIPSCIDSKLIIDDVLDTREDLSTSLEKSN 1060

Query: 961  NFDHSSPVSPNMHSCQKYLFNWRLPGRDWEKAYGSSELKFGHQAFKQRYVSVERPRRCKS 1020
            NF+HSSPVSPNMHSCQKYLFNWRLPGRDWEKAYGSSELKFGHQAFKQ+YVS ERPRRCKS
Sbjct: 1061 NFEHSSPVSPNMHSCQKYLFNWRLPGRDWEKAYGSSELKFGHQAFKQKYVSAERPRRCKS 1120

Query: 1021 APPSYKRKTSFYCLYQRKEEKHNAAGFYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFL 1080
            APPSYKRKTSFYCLYQRKEEKHNAAGFYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFL
Sbjct: 1121 APPSYKRKTSFYCLYQRKEEKHNAAGFYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFL 1180

Query: 1081 DSPPHLELGELRDSKHFSSTNNLYIKPSPLDDLSMGTRIDMTKTPAITGNNKEKQEGKIS 1140
            DSPPHLELGELRDSKHFS TNNLY+KPSPLDDLSMGTR DMTKTP ITGNNKEKQEGK S
Sbjct: 1181 DSPPHLELGELRDSKHFSGTNNLYVKPSPLDDLSMGTRTDMTKTPTITGNNKEKQEGKFS 1240

Query: 1141 KQFQSDVKVTASALELCSKETQESDLWIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLS 1200
            KQFQSDVKVTASALELCSKETQESDLWIKWKNCCPTTRNDGPRAFEDEVSILDISS FLS
Sbjct: 1241 KQFQSDVKVTASALELCSKETQESDLWIKWKNCCPTTRNDGPRAFEDEVSILDISSEFLS 1300

Query: 1201 LARNSLVPKSIDKNFLEDAKVLLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLL 1260
            LARNSLVPKSIDKNFLEDAKVLLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLL
Sbjct: 1301 LARNSLVPKSIDKNFLEDAKVLLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLL 1360

Query: 1261 SGEAKTIAYLEDEHELVLPEIGYQLLYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQ 1320
            SGEAKTIAYLEDEHELVLPEIGYQLLYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQ
Sbjct: 1361 SGEAKTIAYLEDEHELVLPEIGYQLLYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQ 1381

Query: 1321 ETVITLMAVPCILGVNLSDADLLEFLDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDS 1380
            ETVITLMAVPCILGVNLSDADLLEFLDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDS
Sbjct: 1421 ETVITLMAVPCILGVNLSDADLLEFLDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDS 1381

Query: 1381 LLPSECSLIVEELKQTSLCFQCAHGRPTTVPLVNLEALHKQIREMEILDKNGSNGTWHGL 1440
            LLPSECSLIVEELKQTSLCFQCAHGRPTTVPLVNLEALHKQIREMEILDKNGSNGTWHGL
Sbjct: 1481 LLPSECSLIVEELKQTSLCFQCAHGRPTTVPLVNLEALHKQIREMEILDKNGSNGTWHGL 1381

Query: 1441 RRHELSIERMLQQVGSA 1457
            RRHELSIERMLQ VGSA
Sbjct: 1541 RRHELSIERMLQHVGSA 1381

BLAST of Cp4.1LG08g12770 vs. NCBI nr
Match: XP_022939754.1 (DNA mismatch repair protein MLH3 isoform X1 [Cucurbita moschata])

HSP 1 Score: 2314 bits (5997), Expect = 0.0
Identity = 1207/1431 (84.35%), Postives = 1221/1431 (85.32%), Query Frame = 0

Query: 27   MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYAEACL 86
            MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK                  
Sbjct: 1    MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK------------------ 60

Query: 87   SSSFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 146
                                     ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT
Sbjct: 61   -------------------------ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 120

Query: 147  SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVASACTL 206
            SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIK  K      L
Sbjct: 121  SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKGCK-----CL 180

Query: 207  ELMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI 266
             L +   +  +  TVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI
Sbjct: 181  YLGIDDDMEDIGTTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI 240

Query: 267  VDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDTFTIK 326
            VD ESESILLY NPSPSPLSLLRSGFGSEVSRSLHELKIGDG LKLSGYICSPFDTFTIK
Sbjct: 241  VDIESESILLYPNPSPSPLSLLRSGFGSEVSRSLHELKIGDGDLKLSGYICSPFDTFTIK 300

Query: 327  DPVFYEIVLTSARTDINRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP 386
               +  I         NRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP
Sbjct: 301  AVQYVYI---------NRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP 360

Query: 387  AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI 446
            AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI
Sbjct: 361  AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI 420

Query: 447  VGGDQLWKDEDNMISTNSDFREDVIRFPPESIRSLKKSRMRSPRASLIDLFSPSAMLTKD 506
            VGGDQLWKDEDNMISTNSDFREDVI FPPESIRS+KKSRMRSP+ASLIDLFSPSAMLTKD
Sbjct: 421  VGGDQLWKDEDNMISTNSDFREDVILFPPESIRSVKKSRMRSPQASLIDLFSPSAMLTKD 480

Query: 507  DDILSNSLHEKKACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWGTPLAKCSTTAVQ 566
            DDILSNSLHEK ACENSHTSSSELNDVHRQARMQFGNQAADHFSGLW TPLAKCSTTAVQ
Sbjct: 481  DDILSNSLHEK-ACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWSTPLAKCSTTAVQ 540

Query: 567  KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE 626
            KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE
Sbjct: 541  KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE 600

Query: 627  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLFIQNDVIKRIQKQGI 686
            STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKL+IQNDVIKRIQKQGI
Sbjct: 601  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIKRIQKQGI 660

Query: 687  PDDEVDVLKLDGYIQGSDFYAGDSFHAEFTEENMYSCHLDKHVQKFFSSYQTRNSPDVHV 746
            PDDEVDVLKLDGYIQGSDFYAGDS HAEFTEEN+YSCHLDKHVQKFFSSYQTRNSPDVHV
Sbjct: 661  PDDEVDVLKLDGYIQGSDFYAGDSLHAEFTEENIYSCHLDKHVQKFFSSYQTRNSPDVHV 720

Query: 747  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRSSKKNYI 806
            TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGF              
Sbjct: 721  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGF-------------- 780

Query: 807  PSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRSSKK 866
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 867  NYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRS 926
                                                                DSDIMLRS
Sbjct: 841  ----------------------------------------------------DSDIMLRS 900

Query: 927  SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG 986
            SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG
Sbjct: 901  SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG 960

Query: 987  RDWEKAYGSSELKFGHQAFKQRYVSVERPRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG 1046
            +DWEKAYGSSELKFGHQAFKQ+YVSVERPRRCKSAPPSYKRKTSFYCLY+RKEEKHNAAG
Sbjct: 961  KDWEKAYGSSELKFGHQAFKQKYVSVERPRRCKSAPPSYKRKTSFYCLYRRKEEKHNAAG 1020

Query: 1047 FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELGELRDSKHFSSTNNLYIK 1106
            FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLEL ELRDSKHFSSTNNLYIK
Sbjct: 1021 FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELAELRDSKHFSSTNNLYIK 1080

Query: 1107 PSPLDDLSMGTRIDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETQESDL 1166
            PSPLDDLSMGTR DMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKET+ESDL
Sbjct: 1081 PSPLDDLSMGTRTDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETRESDL 1140

Query: 1167 WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1226
            WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD
Sbjct: 1141 WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1200

Query: 1227 KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL 1286
            KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL
Sbjct: 1201 KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL 1247

Query: 1287 YNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL 1346
            YNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL
Sbjct: 1261 YNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL 1247

Query: 1347 DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGR 1406
            DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIV+ELKQTSLCFQCAHGR
Sbjct: 1321 DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVDELKQTSLCFQCAHGR 1247

Query: 1407 PTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQQVGSA 1457
            PTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQ +GSA
Sbjct: 1381 PTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQHIGSA 1247

BLAST of Cp4.1LG08g12770 vs. NCBI nr
Match: KAG6596281.1 (DNA mismatch repair protein MLH3, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 2283 bits (5917), Expect = 0.0
Identity = 1192/1431 (83.30%), Postives = 1209/1431 (84.49%), Query Frame = 0

Query: 27   MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYAEACL 86
            MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK                  
Sbjct: 1    MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK------------------ 60

Query: 87   SSSFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 146
                                     ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT
Sbjct: 61   -------------------------ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 120

Query: 147  SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVASACTL 206
            SKFHDLIDM TKGKTFGFRGEALASISDVSL+EIITKACGRANGYRKV+K  K      L
Sbjct: 121  SKFHDLIDMHTKGKTFGFRGEALASISDVSLVEIITKACGRANGYRKVMKGCK-----CL 180

Query: 207  ELMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI 266
             L +   +  +  TVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVR ALVHSKVSFKI
Sbjct: 181  YLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRTALVHSKVSFKI 240

Query: 267  VDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDTFTIK 326
            VDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDG LKLSGYICSPFDTFTIK
Sbjct: 241  VDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGDLKLSGYICSPFDTFTIK 300

Query: 327  DPVFYEIVLTSARTDINRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP 386
               +       +RTDINRRFICKGQIHK LNQLASRFVSLSPQTDQACHSRKRSRFQANP
Sbjct: 301  AVQYV------SRTDINRRFICKGQIHKSLNQLASRFVSLSPQTDQACHSRKRSRFQANP 360

Query: 387  AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI 446
            AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKE YSSGKSLVHT  I
Sbjct: 361  AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKENYSSGKSLVHTNPI 420

Query: 447  VGGDQLWKDEDNMISTNSDFREDVIRFPPESIRSLKKSRMRSPRASLIDLFSPSAMLTKD 506
            VGGDQLWKDED MIST+SDFREDVI FPPESIRS+KKSRMRSP+A LIDLFSPSAMLTKD
Sbjct: 421  VGGDQLWKDEDYMISTDSDFREDVILFPPESIRSVKKSRMRSPQACLIDLFSPSAMLTKD 480

Query: 507  DDILSNSLHEKKACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWGTPLAKCSTTAVQ 566
            D ILSNSLHEKKACENSHTSSSELNDVH QARMQFGNQA DHFSGLWGTPLAKCSTTAVQ
Sbjct: 481  D-ILSNSLHEKKACENSHTSSSELNDVHLQARMQFGNQAGDHFSGLWGTPLAKCSTTAVQ 540

Query: 567  KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE 626
            KGDRHPWVPDNIFVSEDSFLDRRLASPKRC DIVEDNIFSSDLKGQSSKVYIDMINGSAE
Sbjct: 541  KGDRHPWVPDNIFVSEDSFLDRRLASPKRCGDIVEDNIFSSDLKGQSSKVYIDMINGSAE 600

Query: 627  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLFIQNDVIKRIQKQGI 686
            STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKL+IQND IKRIQKQGI
Sbjct: 601  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDDIKRIQKQGI 660

Query: 687  PDDEVDVLKLDGYIQGSDFYAGDSFHAEFTEENMYSCHLDKHVQKFFSSYQTRNSPDVHV 746
            PDDEVDVLKLDGYIQGSDFYAG S HAEF EEN+YSCHLDKHVQKFFSSYQTRNSPDVHV
Sbjct: 661  PDDEVDVLKLDGYIQGSDFYAGGSLHAEFAEENIYSCHLDKHVQKFFSSYQTRNSPDVHV 720

Query: 747  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRSSKKNYI 806
            TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGF              
Sbjct: 721  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGF-------------- 780

Query: 807  PSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRSSKK 866
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 867  NYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRS 926
                                                                DSDIMLRS
Sbjct: 841  ----------------------------------------------------DSDIMLRS 900

Query: 927  SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG 986
            SKKNYIPSCIDS+LIIDDVLDTREDLSTSLEKSNNF+HSSPVSPNMHSCQKYLFNWRLPG
Sbjct: 901  SKKNYIPSCIDSKLIIDDVLDTREDLSTSLEKSNNFEHSSPVSPNMHSCQKYLFNWRLPG 960

Query: 987  RDWEKAYGSSELKFGHQAFKQRYVSVERPRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG 1046
            RDWEKAYGSSELKFGHQ FK++YVSVERPRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG
Sbjct: 961  RDWEKAYGSSELKFGHQVFKRKYVSVERPRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG 1020

Query: 1047 FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELGELRDSKHFSSTNNLYIK 1106
            FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSP HLELGELRDSKHFS TNNLY+K
Sbjct: 1021 FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPSHLELGELRDSKHFSGTNNLYVK 1080

Query: 1107 PSPLDDLSMGTRIDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETQESDL 1166
            PSPLDDLSMGTR DMTKTP ITGNNKEKQEGK SKQFQSDVKVTASALELCSKETQESDL
Sbjct: 1081 PSPLDDLSMGTRTDMTKTPTITGNNKEKQEGKFSKQFQSDVKVTASALELCSKETQESDL 1140

Query: 1167 WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1226
            WIKWKNCCPTTRNDGPRAFEDEVSILDISS FLSLARNSLVPKSIDKNFLEDAKVLLQLD
Sbjct: 1141 WIKWKNCCPTTRNDGPRAFEDEVSILDISSEFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1200

Query: 1227 KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL 1286
            KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL
Sbjct: 1201 KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL 1250

Query: 1287 YNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL 1346
            YNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL
Sbjct: 1261 YNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL 1250

Query: 1347 DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGR 1406
            DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGR
Sbjct: 1321 DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGR 1250

Query: 1407 PTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQQVGSA 1457
            PTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQ VGSA
Sbjct: 1381 PTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQHVGSA 1250

BLAST of Cp4.1LG08g12770 vs. NCBI nr
Match: XP_022971390.1 (DNA mismatch repair protein MLH3 isoform X1 [Cucurbita maxima])

HSP 1 Score: 2273 bits (5890), Expect = 0.0
Identity = 1184/1431 (82.74%), Postives = 1206/1431 (84.28%), Query Frame = 0

Query: 27   MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYAEACL 86
            MG+IKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK                  
Sbjct: 1    MGMIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK------------------ 60

Query: 87   SSSFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 146
                                     ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT
Sbjct: 61   -------------------------ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 120

Query: 147  SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVASACTL 206
            SKFHDLIDMDTKGKTFGFRGEALASISDVSL+EIITKACGRANGYRKVIK  K      L
Sbjct: 121  SKFHDLIDMDTKGKTFGFRGEALASISDVSLVEIITKACGRANGYRKVIKGCK-----CL 180

Query: 207  ELMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI 266
             L +   +  +  TVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVR +LVHSKVSFKI
Sbjct: 181  YLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRTSLVHSKVSFKI 240

Query: 267  VDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDTFTIK 326
            VDSESESILLYTNPSPSPLSLLRSGFGSE+SRSL ELKIGDG LKLSGYICSPFDTFTIK
Sbjct: 241  VDSESESILLYTNPSPSPLSLLRSGFGSEISRSLRELKIGDGDLKLSGYICSPFDTFTIK 300

Query: 327  DPVFYEIVLTSARTDINRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP 386
               +  I         NRRFICKGQIHKLLNQLASRFVSLSPQTDQ CHSRKRSRFQANP
Sbjct: 301  AVQYVYI---------NRRFICKGQIHKLLNQLASRFVSLSPQTDQVCHSRKRSRFQANP 360

Query: 387  AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI 446
            AYILNLDCPGSFYDLTFESSKTFVQFKDW SILTFIEETIQQFWKEKYSSGKSLVHTT I
Sbjct: 361  AYILNLDCPGSFYDLTFESSKTFVQFKDWASILTFIEETIQQFWKEKYSSGKSLVHTTPI 420

Query: 447  VGGDQLWKDEDNMISTNSDFREDVIRFPPESIRSLKKSRMRSPRASLIDLFSPSAMLTKD 506
            VGGDQLWKDEDN+ISTNSDFREDVI FPPESIRS+KKSRMRSP+ASLIDLFSPSAMLTKD
Sbjct: 421  VGGDQLWKDEDNIISTNSDFREDVILFPPESIRSVKKSRMRSPQASLIDLFSPSAMLTKD 480

Query: 507  DDILSNSLHEKKACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWGTPLAKCSTTAVQ 566
            DDILSNSLHEKKACENSHTSSSELNDVH+QARMQFGNQAADHFSGLWGTPLAKCSTTAVQ
Sbjct: 481  DDILSNSLHEKKACENSHTSSSELNDVHQQARMQFGNQAADHFSGLWGTPLAKCSTTAVQ 540

Query: 567  KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE 626
             GDRHPWVPDNIFVSEDSFLDRRLA PKRCDDIVEDNIFSSDLKGQSS+VYIDMINGSAE
Sbjct: 541  NGDRHPWVPDNIFVSEDSFLDRRLAFPKRCDDIVEDNIFSSDLKGQSSEVYIDMINGSAE 600

Query: 627  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLFIQNDVIKRIQKQGI 686
            STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKL+IQNDVIKRIQKQGI
Sbjct: 601  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIKRIQKQGI 660

Query: 687  PDDEVDVLKLDGYIQGSDFYAGDSFHAEFTEENMYSCHLDKHVQKFFSSYQTRNSPDVHV 746
            PDDEVDVLKLDGYIQGS FYAGDS HAEF EEN+YSCHLDKHVQKFFSSYQTRNSPDVHV
Sbjct: 661  PDDEVDVLKLDGYIQGSGFYAGDSLHAEFAEENIYSCHLDKHVQKFFSSYQTRNSPDVHV 720

Query: 747  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRSSKKNYI 806
            TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVD EDKGCGF              
Sbjct: 721  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDCEDKGCGF-------------- 780

Query: 807  PSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRSSKK 866
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 867  NYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRS 926
                                                                DSDIMLRS
Sbjct: 841  ----------------------------------------------------DSDIMLRS 900

Query: 927  SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG 986
            SKKNYIPSCIDS+LIIDDVLD REDLSTSLEKSNNF+HSSPVSPNMHSCQKYL NWRLPG
Sbjct: 901  SKKNYIPSCIDSKLIIDDVLDIREDLSTSLEKSNNFEHSSPVSPNMHSCQKYLSNWRLPG 960

Query: 987  RDWEKAYGSSELKFGHQAFKQRYVSVERPRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG 1046
            RDWEKAYGSSELKFGH+AFKQ+YVSVER RRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG
Sbjct: 961  RDWEKAYGSSELKFGHKAFKQKYVSVERRRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG 1020

Query: 1047 FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELGELRDSKHFSSTNNLYIK 1106
            FYGLDQRKTDKFNATNFYCMDQGK+EKLRASAFLDSPPHLELG+LRDSKHFS TNNLYI 
Sbjct: 1021 FYGLDQRKTDKFNATNFYCMDQGKDEKLRASAFLDSPPHLELGQLRDSKHFSGTNNLYIN 1080

Query: 1107 PSPLDDLSMGTRIDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETQESDL 1166
            PSPLDDLSMGTR DMTK P ITGNNKEKQEGK+SKQFQSDVKVTASALELCSKETQES L
Sbjct: 1081 PSPLDDLSMGTRTDMTKMPTITGNNKEKQEGKVSKQFQSDVKVTASALELCSKETQESYL 1140

Query: 1167 WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1226
            WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD
Sbjct: 1141 WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1200

Query: 1227 KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL 1286
            KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL
Sbjct: 1201 KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL 1248

Query: 1287 YNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL 1346
            YNYSDQVKEWGWICNIHAQDSK FQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL
Sbjct: 1261 YNYSDQVKEWGWICNIHAQDSKCFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL 1248

Query: 1347 DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGR 1406
            DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGR
Sbjct: 1321 DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGR 1248

Query: 1407 PTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQQVGSA 1457
            PTTVPLVNLEALHKQIREMEILDKNG NGTWHGLRRHELSIERMLQ VGSA
Sbjct: 1381 PTTVPLVNLEALHKQIREMEILDKNGLNGTWHGLRRHELSIERMLQHVGSA 1248

BLAST of Cp4.1LG08g12770 vs. ExPASy TrEMBL
Match: A0A6J1FI48 (DNA mismatch repair protein MLH3 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445300 PE=3 SV=1)

HSP 1 Score: 2314 bits (5997), Expect = 0.0
Identity = 1207/1431 (84.35%), Postives = 1221/1431 (85.32%), Query Frame = 0

Query: 27   MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYAEACL 86
            MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK                  
Sbjct: 1    MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK------------------ 60

Query: 87   SSSFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 146
                                     ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT
Sbjct: 61   -------------------------ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 120

Query: 147  SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVASACTL 206
            SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIK  K      L
Sbjct: 121  SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKGCK-----CL 180

Query: 207  ELMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI 266
             L +   +  +  TVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI
Sbjct: 181  YLGIDDDMEDIGTTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI 240

Query: 267  VDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDTFTIK 326
            VD ESESILLY NPSPSPLSLLRSGFGSEVSRSLHELKIGDG LKLSGYICSPFDTFTIK
Sbjct: 241  VDIESESILLYPNPSPSPLSLLRSGFGSEVSRSLHELKIGDGDLKLSGYICSPFDTFTIK 300

Query: 327  DPVFYEIVLTSARTDINRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP 386
               +  I         NRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP
Sbjct: 301  AVQYVYI---------NRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP 360

Query: 387  AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI 446
            AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI
Sbjct: 361  AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI 420

Query: 447  VGGDQLWKDEDNMISTNSDFREDVIRFPPESIRSLKKSRMRSPRASLIDLFSPSAMLTKD 506
            VGGDQLWKDEDNMISTNSDFREDVI FPPESIRS+KKSRMRSP+ASLIDLFSPSAMLTKD
Sbjct: 421  VGGDQLWKDEDNMISTNSDFREDVILFPPESIRSVKKSRMRSPQASLIDLFSPSAMLTKD 480

Query: 507  DDILSNSLHEKKACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWGTPLAKCSTTAVQ 566
            DDILSNSLHEK ACENSHTSSSELNDVHRQARMQFGNQAADHFSGLW TPLAKCSTTAVQ
Sbjct: 481  DDILSNSLHEK-ACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWSTPLAKCSTTAVQ 540

Query: 567  KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE 626
            KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE
Sbjct: 541  KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE 600

Query: 627  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLFIQNDVIKRIQKQGI 686
            STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKL+IQNDVIKRIQKQGI
Sbjct: 601  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIKRIQKQGI 660

Query: 687  PDDEVDVLKLDGYIQGSDFYAGDSFHAEFTEENMYSCHLDKHVQKFFSSYQTRNSPDVHV 746
            PDDEVDVLKLDGYIQGSDFYAGDS HAEFTEEN+YSCHLDKHVQKFFSSYQTRNSPDVHV
Sbjct: 661  PDDEVDVLKLDGYIQGSDFYAGDSLHAEFTEENIYSCHLDKHVQKFFSSYQTRNSPDVHV 720

Query: 747  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRSSKKNYI 806
            TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGF              
Sbjct: 721  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGF-------------- 780

Query: 807  PSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRSSKK 866
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 867  NYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRS 926
                                                                DSDIMLRS
Sbjct: 841  ----------------------------------------------------DSDIMLRS 900

Query: 927  SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG 986
            SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG
Sbjct: 901  SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG 960

Query: 987  RDWEKAYGSSELKFGHQAFKQRYVSVERPRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG 1046
            +DWEKAYGSSELKFGHQAFKQ+YVSVERPRRCKSAPPSYKRKTSFYCLY+RKEEKHNAAG
Sbjct: 961  KDWEKAYGSSELKFGHQAFKQKYVSVERPRRCKSAPPSYKRKTSFYCLYRRKEEKHNAAG 1020

Query: 1047 FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELGELRDSKHFSSTNNLYIK 1106
            FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLEL ELRDSKHFSSTNNLYIK
Sbjct: 1021 FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELAELRDSKHFSSTNNLYIK 1080

Query: 1107 PSPLDDLSMGTRIDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETQESDL 1166
            PSPLDDLSMGTR DMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKET+ESDL
Sbjct: 1081 PSPLDDLSMGTRTDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETRESDL 1140

Query: 1167 WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1226
            WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD
Sbjct: 1141 WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1200

Query: 1227 KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL 1286
            KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL
Sbjct: 1201 KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL 1247

Query: 1287 YNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL 1346
            YNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL
Sbjct: 1261 YNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL 1247

Query: 1347 DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGR 1406
            DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIV+ELKQTSLCFQCAHGR
Sbjct: 1321 DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVDELKQTSLCFQCAHGR 1247

Query: 1407 PTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQQVGSA 1457
            PTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQ +GSA
Sbjct: 1381 PTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQHIGSA 1247

BLAST of Cp4.1LG08g12770 vs. ExPASy TrEMBL
Match: A0A6J1I5L5 (DNA mismatch repair protein MLH3 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111470127 PE=3 SV=1)

HSP 1 Score: 2273 bits (5890), Expect = 0.0
Identity = 1184/1431 (82.74%), Postives = 1206/1431 (84.28%), Query Frame = 0

Query: 27   MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYAEACL 86
            MG+IKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK                  
Sbjct: 1    MGMIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK------------------ 60

Query: 87   SSSFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 146
                                     ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT
Sbjct: 61   -------------------------ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 120

Query: 147  SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVASACTL 206
            SKFHDLIDMDTKGKTFGFRGEALASISDVSL+EIITKACGRANGYRKVIK  K      L
Sbjct: 121  SKFHDLIDMDTKGKTFGFRGEALASISDVSLVEIITKACGRANGYRKVIKGCK-----CL 180

Query: 207  ELMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI 266
             L +   +  +  TVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVR +LVHSKVSFKI
Sbjct: 181  YLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRTSLVHSKVSFKI 240

Query: 267  VDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDTFTIK 326
            VDSESESILLYTNPSPSPLSLLRSGFGSE+SRSL ELKIGDG LKLSGYICSPFDTFTIK
Sbjct: 241  VDSESESILLYTNPSPSPLSLLRSGFGSEISRSLRELKIGDGDLKLSGYICSPFDTFTIK 300

Query: 327  DPVFYEIVLTSARTDINRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP 386
               +  I         NRRFICKGQIHKLLNQLASRFVSLSPQTDQ CHSRKRSRFQANP
Sbjct: 301  AVQYVYI---------NRRFICKGQIHKLLNQLASRFVSLSPQTDQVCHSRKRSRFQANP 360

Query: 387  AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI 446
            AYILNLDCPGSFYDLTFESSKTFVQFKDW SILTFIEETIQQFWKEKYSSGKSLVHTT I
Sbjct: 361  AYILNLDCPGSFYDLTFESSKTFVQFKDWASILTFIEETIQQFWKEKYSSGKSLVHTTPI 420

Query: 447  VGGDQLWKDEDNMISTNSDFREDVIRFPPESIRSLKKSRMRSPRASLIDLFSPSAMLTKD 506
            VGGDQLWKDEDN+ISTNSDFREDVI FPPESIRS+KKSRMRSP+ASLIDLFSPSAMLTKD
Sbjct: 421  VGGDQLWKDEDNIISTNSDFREDVILFPPESIRSVKKSRMRSPQASLIDLFSPSAMLTKD 480

Query: 507  DDILSNSLHEKKACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWGTPLAKCSTTAVQ 566
            DDILSNSLHEKKACENSHTSSSELNDVH+QARMQFGNQAADHFSGLWGTPLAKCSTTAVQ
Sbjct: 481  DDILSNSLHEKKACENSHTSSSELNDVHQQARMQFGNQAADHFSGLWGTPLAKCSTTAVQ 540

Query: 567  KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE 626
             GDRHPWVPDNIFVSEDSFLDRRLA PKRCDDIVEDNIFSSDLKGQSS+VYIDMINGSAE
Sbjct: 541  NGDRHPWVPDNIFVSEDSFLDRRLAFPKRCDDIVEDNIFSSDLKGQSSEVYIDMINGSAE 600

Query: 627  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLFIQNDVIKRIQKQGI 686
            STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKL+IQNDVIKRIQKQGI
Sbjct: 601  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIKRIQKQGI 660

Query: 687  PDDEVDVLKLDGYIQGSDFYAGDSFHAEFTEENMYSCHLDKHVQKFFSSYQTRNSPDVHV 746
            PDDEVDVLKLDGYIQGS FYAGDS HAEF EEN+YSCHLDKHVQKFFSSYQTRNSPDVHV
Sbjct: 661  PDDEVDVLKLDGYIQGSGFYAGDSLHAEFAEENIYSCHLDKHVQKFFSSYQTRNSPDVHV 720

Query: 747  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRSSKKNYI 806
            TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVD EDKGCGF              
Sbjct: 721  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDCEDKGCGF-------------- 780

Query: 807  PSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRSSKK 866
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 867  NYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRS 926
                                                                DSDIMLRS
Sbjct: 841  ----------------------------------------------------DSDIMLRS 900

Query: 927  SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG 986
            SKKNYIPSCIDS+LIIDDVLD REDLSTSLEKSNNF+HSSPVSPNMHSCQKYL NWRLPG
Sbjct: 901  SKKNYIPSCIDSKLIIDDVLDIREDLSTSLEKSNNFEHSSPVSPNMHSCQKYLSNWRLPG 960

Query: 987  RDWEKAYGSSELKFGHQAFKQRYVSVERPRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG 1046
            RDWEKAYGSSELKFGH+AFKQ+YVSVER RRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG
Sbjct: 961  RDWEKAYGSSELKFGHKAFKQKYVSVERRRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG 1020

Query: 1047 FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELGELRDSKHFSSTNNLYIK 1106
            FYGLDQRKTDKFNATNFYCMDQGK+EKLRASAFLDSPPHLELG+LRDSKHFS TNNLYI 
Sbjct: 1021 FYGLDQRKTDKFNATNFYCMDQGKDEKLRASAFLDSPPHLELGQLRDSKHFSGTNNLYIN 1080

Query: 1107 PSPLDDLSMGTRIDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETQESDL 1166
            PSPLDDLSMGTR DMTK P ITGNNKEKQEGK+SKQFQSDVKVTASALELCSKETQES L
Sbjct: 1081 PSPLDDLSMGTRTDMTKMPTITGNNKEKQEGKVSKQFQSDVKVTASALELCSKETQESYL 1140

Query: 1167 WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1226
            WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD
Sbjct: 1141 WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1200

Query: 1227 KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL 1286
            KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL
Sbjct: 1201 KKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELVLPEIGYQLL 1248

Query: 1287 YNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL 1346
            YNYSDQVKEWGWICNIHAQDSK FQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL
Sbjct: 1261 YNYSDQVKEWGWICNIHAQDSKCFQRNLNILYKQETVITLMAVPCILGVNLSDADLLEFL 1248

Query: 1347 DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGR 1406
            DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGR
Sbjct: 1321 DQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGR 1248

Query: 1407 PTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQQVGSA 1457
            PTTVPLVNLEALHKQIREMEILDKNG NGTWHGLRRHELSIERMLQ VGSA
Sbjct: 1381 PTTVPLVNLEALHKQIREMEILDKNGLNGTWHGLRRHELSIERMLQHVGSA 1248

BLAST of Cp4.1LG08g12770 vs. ExPASy TrEMBL
Match: A0A6J1FIL6 (DNA mismatch repair protein MLH3 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111445300 PE=3 SV=1)

HSP 1 Score: 1893 bits (4903), Expect = 0.0
Identity = 997/1218 (81.86%), Postives = 1009/1218 (82.84%), Query Frame = 0

Query: 27   MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYAEACL 86
            MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK                  
Sbjct: 1    MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK------------------ 60

Query: 87   SSSFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 146
                                     ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT
Sbjct: 61   -------------------------ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 120

Query: 147  SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVASACTL 206
            SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIK  K      L
Sbjct: 121  SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKGCK-----CL 180

Query: 207  ELMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI 266
             L +   +  +  TVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI
Sbjct: 181  YLGIDDDMEDIGTTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI 240

Query: 267  VDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDTFTIK 326
            VD ESESILLY NPSPSPLSLLRSGFGSEVSRSLHELKIGDG LKLSGYICSPFDTFTIK
Sbjct: 241  VDIESESILLYPNPSPSPLSLLRSGFGSEVSRSLHELKIGDGDLKLSGYICSPFDTFTIK 300

Query: 327  DPVFYEIVLTSARTDINRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP 386
               +  I         NRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP
Sbjct: 301  AVQYVYI---------NRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP 360

Query: 387  AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI 446
            AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI
Sbjct: 361  AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI 420

Query: 447  VGGDQLWKDEDNMISTNSDFREDVIRFPPESIRSLKKSRMRSPRASLIDLFSPSAMLTKD 506
            VGGDQLWKDEDNMISTNSDFREDVI FPPESIRS+KKSRMRSP+ASLIDLFSPSAMLTKD
Sbjct: 421  VGGDQLWKDEDNMISTNSDFREDVILFPPESIRSVKKSRMRSPQASLIDLFSPSAMLTKD 480

Query: 507  DDILSNSLHEKKACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWGTPLAKCSTTAVQ 566
            DDILSNSLHEK ACENSHTSSSELNDVHRQARMQFGNQAADHFSGLW TPLAKCSTTAVQ
Sbjct: 481  DDILSNSLHEK-ACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWSTPLAKCSTTAVQ 540

Query: 567  KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE 626
            KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE
Sbjct: 541  KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE 600

Query: 627  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLFIQNDVIKRIQKQGI 686
            STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKL+IQNDVIKRIQKQGI
Sbjct: 601  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIKRIQKQGI 660

Query: 687  PDDEVDVLKLDGYIQGSDFYAGDSFHAEFTEENMYSCHLDKHVQKFFSSYQTRNSPDVHV 746
            PDDEVDVLKLDGYIQGSDFYAGDS HAEFTEEN+YSCHLDKHVQKFFSSYQTRNSPDVHV
Sbjct: 661  PDDEVDVLKLDGYIQGSDFYAGDSLHAEFTEENIYSCHLDKHVQKFFSSYQTRNSPDVHV 720

Query: 747  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRSSKKNYI 806
            TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGF              
Sbjct: 721  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGF-------------- 780

Query: 807  PSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRSSKK 866
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 867  NYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRS 926
                                                                DSDIMLRS
Sbjct: 841  ----------------------------------------------------DSDIMLRS 900

Query: 927  SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG 986
            SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG
Sbjct: 901  SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG 960

Query: 987  RDWEKAYGSSELKFGHQAFKQRYVSVERPRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG 1046
            +DWEKAYGSSELKFGHQAFKQ+YVSVERPRRCKSAPPSYKRKTSFYCLY+RKEEKHNAAG
Sbjct: 961  KDWEKAYGSSELKFGHQAFKQKYVSVERPRRCKSAPPSYKRKTSFYCLYRRKEEKHNAAG 1020

Query: 1047 FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELGELRDSKHFSSTNNLYIK 1106
            FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLEL ELRDSKHFSSTNNLYIK
Sbjct: 1021 FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELAELRDSKHFSSTNNLYIK 1034

Query: 1107 PSPLDDLSMGTRIDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETQESDL 1166
            PSPLDDLSMGTR DMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKET+ESDL
Sbjct: 1081 PSPLDDLSMGTRTDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETRESDL 1034

Query: 1167 WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1226
            WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD
Sbjct: 1141 WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1034

Query: 1227 KKFIPVVSGGILAVIDQH 1244
            KKFIPVVSGGILAVIDQH
Sbjct: 1201 KKFIPVVSGGILAVIDQH 1034

BLAST of Cp4.1LG08g12770 vs. ExPASy TrEMBL
Match: A0A6J1I371 (DNA mismatch repair protein MLH3 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111470127 PE=3 SV=1)

HSP 1 Score: 1854 bits (4803), Expect = 0.0
Identity = 974/1218 (79.97%), Postives = 996/1218 (81.77%), Query Frame = 0

Query: 27   MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYAEACL 86
            MG+IKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK                  
Sbjct: 1    MGMIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASK------------------ 60

Query: 87   SSSFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 146
                                     ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT
Sbjct: 61   -------------------------ISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 120

Query: 147  SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVASACTL 206
            SKFHDLIDMDTKGKTFGFRGEALASISDVSL+EIITKACGRANGYRKVIK  K      L
Sbjct: 121  SKFHDLIDMDTKGKTFGFRGEALASISDVSLVEIITKACGRANGYRKVIKGCK-----CL 180

Query: 207  ELMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI 266
             L +   +  +  TVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVR +LVHSKVSFKI
Sbjct: 181  YLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRTSLVHSKVSFKI 240

Query: 267  VDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDTFTIK 326
            VDSESESILLYTNPSPSPLSLLRSGFGSE+SRSL ELKIGDG LKLSGYICSPFDTFTIK
Sbjct: 241  VDSESESILLYTNPSPSPLSLLRSGFGSEISRSLRELKIGDGDLKLSGYICSPFDTFTIK 300

Query: 327  DPVFYEIVLTSARTDINRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP 386
               +  I         NRRFICKGQIHKLLNQLASRFVSLSPQTDQ CHSRKRSRFQANP
Sbjct: 301  AVQYVYI---------NRRFICKGQIHKLLNQLASRFVSLSPQTDQVCHSRKRSRFQANP 360

Query: 387  AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI 446
            AYILNLDCPGSFYDLTFESSKTFVQFKDW SILTFIEETIQQFWKEKYSSGKSLVHTT I
Sbjct: 361  AYILNLDCPGSFYDLTFESSKTFVQFKDWASILTFIEETIQQFWKEKYSSGKSLVHTTPI 420

Query: 447  VGGDQLWKDEDNMISTNSDFREDVIRFPPESIRSLKKSRMRSPRASLIDLFSPSAMLTKD 506
            VGGDQLWKDEDN+ISTNSDFREDVI FPPESIRS+KKSRMRSP+ASLIDLFSPSAMLTKD
Sbjct: 421  VGGDQLWKDEDNIISTNSDFREDVILFPPESIRSVKKSRMRSPQASLIDLFSPSAMLTKD 480

Query: 507  DDILSNSLHEKKACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWGTPLAKCSTTAVQ 566
            DDILSNSLHEKKACENSHTSSSELNDVH+QARMQFGNQAADHFSGLWGTPLAKCSTTAVQ
Sbjct: 481  DDILSNSLHEKKACENSHTSSSELNDVHQQARMQFGNQAADHFSGLWGTPLAKCSTTAVQ 540

Query: 567  KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE 626
             GDRHPWVPDNIFVSEDSFLDRRLA PKRCDDIVEDNIFSSDLKGQSS+VYIDMINGSAE
Sbjct: 541  NGDRHPWVPDNIFVSEDSFLDRRLAFPKRCDDIVEDNIFSSDLKGQSSEVYIDMINGSAE 600

Query: 627  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLFIQNDVIKRIQKQGI 686
            STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKL+IQNDVIKRIQKQGI
Sbjct: 601  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLYIQNDVIKRIQKQGI 660

Query: 687  PDDEVDVLKLDGYIQGSDFYAGDSFHAEFTEENMYSCHLDKHVQKFFSSYQTRNSPDVHV 746
            PDDEVDVLKLDGYIQGS FYAGDS HAEF EEN+YSCHLDKHVQKFFSSYQTRNSPDVHV
Sbjct: 661  PDDEVDVLKLDGYIQGSGFYAGDSLHAEFAEENIYSCHLDKHVQKFFSSYQTRNSPDVHV 720

Query: 747  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRSSKKNYI 806
            TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVD EDKGCGF              
Sbjct: 721  TPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDCEDKGCGF-------------- 780

Query: 807  PSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRSSKK 866
                                                                        
Sbjct: 781  ------------------------------------------------------------ 840

Query: 867  NYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIMLRS 926
                                                                DSDIMLRS
Sbjct: 841  ----------------------------------------------------DSDIMLRS 900

Query: 927  SKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFNWRLPG 986
            SKKNYIPSCIDS+LIIDDVLD REDLSTSLEKSNNF+HSSPVSPNMHSCQKYL NWRLPG
Sbjct: 901  SKKNYIPSCIDSKLIIDDVLDIREDLSTSLEKSNNFEHSSPVSPNMHSCQKYLSNWRLPG 960

Query: 987  RDWEKAYGSSELKFGHQAFKQRYVSVERPRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG 1046
            RDWEKAYGSSELKFGH+AFKQ+YVSVER RRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG
Sbjct: 961  RDWEKAYGSSELKFGHKAFKQKYVSVERRRRCKSAPPSYKRKTSFYCLYQRKEEKHNAAG 1020

Query: 1047 FYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELGELRDSKHFSSTNNLYIK 1106
            FYGLDQRKTDKFNATNFYCMDQGK+EKLRASAFLDSPPHLELG+LRDSKHFS TNNLYI 
Sbjct: 1021 FYGLDQRKTDKFNATNFYCMDQGKDEKLRASAFLDSPPHLELGQLRDSKHFSGTNNLYIN 1035

Query: 1107 PSPLDDLSMGTRIDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCSKETQESDL 1166
            PSPLDDLSMGTR DMTK P ITGNNKEKQEGK+SKQFQSDVKVTASALELCSKETQES L
Sbjct: 1081 PSPLDDLSMGTRTDMTKMPTITGNNKEKQEGKVSKQFQSDVKVTASALELCSKETQESYL 1035

Query: 1167 WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1226
            WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD
Sbjct: 1141 WIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLEDAKVLLQLD 1035

Query: 1227 KKFIPVVSGGILAVIDQH 1244
            KKFIPVVSGGILAVIDQH
Sbjct: 1201 KKFIPVVSGGILAVIDQH 1035

BLAST of Cp4.1LG08g12770 vs. ExPASy TrEMBL
Match: A0A5A7UH95 (DNA mismatch repair protein MLH3 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold344G00170 PE=3 SV=1)

HSP 1 Score: 1745 bits (4520), Expect = 0.0
Identity = 962/1440 (66.81%), Postives = 1049/1440 (72.85%), Query Frame = 0

Query: 23   VNREMGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYA 82
            V+  MG IKPLPKSVR+SVRAGVILYD TKVVEELVYNSLDAGASK              
Sbjct: 44   VDAAMGTIKPLPKSVRNSVRAGVILYDVTKVVEELVYNSLDAGASK-------------- 103

Query: 83   EACLSSSFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGE 142
                                         ISIFIGIGTSYVKVVD+GSGITRDGL LLGE
Sbjct: 104  -----------------------------ISIFIGIGTSYVKVVDDGSGITRDGLVLLGE 163

Query: 143  RYATSKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVAS 202
            RY TSKFHDLID D KG TFGFRGEALASISD+SL+EIIT+ACGRANGYRKV+K      
Sbjct: 164  RYVTSKFHDLIDTDLKGGTFGFRGEALASISDLSLVEIITRACGRANGYRKVLKG----- 223

Query: 203  ACTLELMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKV 262
             C    + I  +     TVIVRDLFYNQPVRRKHMQ SPKKVL AVKKCVVR ALVHSKV
Sbjct: 224  -CKCLYLGIDDMEDFGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKV 283

Query: 263  SFKIVDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDT 322
            SFKIVDSESESILL T+PSPSPLSLLRSGFGSEVSRSLHELKIG G LKLSGYICSPFD 
Sbjct: 284  SFKIVDSESESILLCTDPSPSPLSLLRSGFGSEVSRSLHELKIGGGDLKLSGYICSPFDN 343

Query: 323  FTIKDPVFYEIVLTSARTDINRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRF 382
            F+IKD VFYEIV TSA TDINRRFICKGQIHKLLNQLA RF+SL PQTD   H RKRSR 
Sbjct: 344  FSIKDSVFYEIVWTSAGTDINRRFICKGQIHKLLNQLAGRFMSLDPQTDLVFHRRKRSRS 403

Query: 383  QANPAYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVH 442
            +ANPAY+LNL+CP SFYDLTFESSKTFVQFKDWT ILTFIEE IQQFWKEKY+ GKS+VH
Sbjct: 404  EANPAYVLNLECPVSFYDLTFESSKTFVQFKDWTPILTFIEEAIQQFWKEKYNCGKSVVH 463

Query: 443  TTSIVGGDQLWKDEDNMISTNSDFREDVIRFPPESIRSLKKSRMRSPRASLIDLFSPSAM 502
            +  IVG D+LWKDEDN IST S+            I S+KK+RM+S +ASLID+FSPS M
Sbjct: 464  SAPIVG-DELWKDEDNTISTKSN-----------DILSVKKNRMQSCQASLIDMFSPSVM 523

Query: 503  LTKDDDILSNSLHEKKACENSHTSSSELNDV-HRQARMQFGNQAADHFSGLWGTPLAKCS 562
             TK DDILS    +KKA E+SHTSS E +D  H  A+MQF +QA  HF   W TPLAKCS
Sbjct: 524  FTKHDDILSYRFCDKKARESSHTSSIEFDDGDHHLAKMQFSHQAG-HFPKSWDTPLAKCS 583

Query: 563  TTAVQKGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMI 622
            TTAV+  D +  VP+  FVSE SFLDRRL SPK CDDIVE+NIF SD KGQSSK++ID I
Sbjct: 584  TTAVRNNDHYQLVPEFPFVSEGSFLDRRLNSPKGCDDIVEENIFCSDFKGQSSKMHIDTI 643

Query: 623  NGSAESTPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLFIQNDVIKRI 682
             GSAESTPSSYFHEFSYDD IF GNKPSL GC+S SSF            +IQNDVI R 
Sbjct: 644  TGSAESTPSSYFHEFSYDDCIFMGNKPSLTGCSSMSSFHP----------YIQNDVIDRT 703

Query: 683  QKQGIPDDEVDVLKLDGYIQGSDFYAGDSFHAEFTEENMYSCHLDKHVQKFFSSYQTRNS 742
            Q QG+ DDEVD++KLD YI+GSDF AG S HAE                 F SSYQTRNS
Sbjct: 704  QMQGMLDDEVDIMKLDAYIKGSDFCAGSSLHAEM----------------FLSSYQTRNS 763

Query: 743  PDVHVTPNPRLASEWDVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGCGFDSDIMLRSS 802
            P+ H+T    LA+EWDVDCFSVRD VER+WRSRDRTPF+ LVD ++KGC FD DIML SS
Sbjct: 764  PNAHMTSKSILATEWDVDCFSVRDEVERSWRSRDRTPFKQLVDDDEKGCRFDYDIMLSSS 823

Query: 803  KKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSDIML 862
            KKN                                                         
Sbjct: 824  KKN--------------------------------------------------------- 883

Query: 863  RSSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKGCGFDSD 922
                                                                        
Sbjct: 884  ------------------------------------------------------------ 943

Query: 923  IMLRSSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQKYLFN 982
                    NY  S  DS  I+DDV DTRE+L   L+KSNNF+HSSP SP+MHS QKY  N
Sbjct: 944  --------NYKSSYTDSATIVDDVFDTRENLGNFLKKSNNFEHSSPRSPDMHSRQKYFSN 1003

Query: 983  WRLPGRDWEKAYGSSELKFGHQAFKQRYVSVERPRRCKSAPPSYKRKTSFYCLYQRKEEK 1042
            WRLP RD EKAYGSSE KFGHQAFKQ+Y SVERPRR KSAPP YKRKTSFYCL Q+K E+
Sbjct: 1004 WRLPERDCEKAYGSSEPKFGHQAFKQKYCSVERPRRGKSAPPFYKRKTSFYCLDQQKAER 1063

Query: 1043 HNAAGFYGLDQRKTDKFNATNFYCMDQGKEEKLRASAFLDSPPHLELGELRDSKHFSSTN 1102
             NAA FY L++ K D+ +A++FYCMDQGK EKL+AS FLDSPPHLE  ELRDS+H S T+
Sbjct: 1064 PNAASFYCLNEGKADQSSASSFYCMDQGKVEKLKASVFLDSPPHLEPVELRDSEHVSGTS 1123

Query: 1103 NLYIKPSPLDDLSMGTR---IDMTKTPAITGNNKEKQEGKISKQFQSDVKVTASALELCS 1162
            N Y+KP P+DDL + TR    D  K  AI GN++EKQ G+ISKQ QSDVKVT SA+ELCS
Sbjct: 1124 NQYVKPFPVDDLLVETRSSRTDTIKMSAIMGNSEEKQ-GEISKQSQSDVKVTESAIELCS 1183

Query: 1163 KETQES-DLWIKWKNCCPTTRNDGPRAFEDEVSILDISSGFLSLARNSLVPKSIDKNFLE 1222
            KETQES DLWIKWKNCCPTTRN+   AF+DEVSILDISSGFLSLA NSLVP  IDKNFL+
Sbjct: 1184 KETQESSDLWIKWKNCCPTTRNEDSHAFDDEVSILDISSGFLSLASNSLVPDLIDKNFLQ 1243

Query: 1223 DAKVLLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIAYLEDEHELV 1282
            +AKVLLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKT AYL+ EHEL 
Sbjct: 1244 NAKVLLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTTAYLDAEHELA 1269

Query: 1283 LPEIGYQLLYNYSDQVKEWGWICNIHAQDSKSFQRNLNILYKQETVITLMAVPCILGVNL 1342
            LPEIGYQLLYNY+DQVKEWGWICNIHAQDSKSF+ NLNIL+KQETVITLMAVPCILGVNL
Sbjct: 1304 LPEIGYQLLYNYADQVKEWGWICNIHAQDSKSFRSNLNILHKQETVITLMAVPCILGVNL 1269

Query: 1343 SDADLLEFLDQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTS 1402
            SD DLLEFL QLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTS
Sbjct: 1364 SDVDLLEFLHQLADTDGSSTMPPSVLRVLNSKACRGAIMFGDSLLPSECSLIVEELKQTS 1269

Query: 1403 LCFQCAHGRPTTVPLVNLEALHKQIREMEILDKNGSNGTWHGLRRHELSIERMLQQVGSA 1457
            LCFQCAHGRPTTVPLVNLEALHKQI+E+EI  K+GSNGTW+GL RHELSIERMLQ++ SA
Sbjct: 1424 LCFQCAHGRPTTVPLVNLEALHKQIKELEIHGKSGSNGTWNGLGRHELSIERMLQRLSSA 1269

BLAST of Cp4.1LG08g12770 vs. TAIR 10
Match: AT4G35520.1 (MUTL protein homolog 3 )

HSP 1 Score: 600.1 bits (1546), Expect = 4.7e-171
Identity = 492/1463 (33.63%), Postives = 684/1463 (46.75%), Query Frame = 0

Query: 27   MGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYAEACL 86
            M  IKPLP+ VR S+R+G+I++D  +VVEELV+NSLDAGA+KV                 
Sbjct: 1    MKTIKPLPEGVRHSMRSGIIMFDMARVVEELVFNSLDAGATKV----------------- 60

Query: 87   SSSFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYAT 146
                                      SIF+G+ +  VKVVD+GSG++RD L LLGERYAT
Sbjct: 61   --------------------------SIFVGVVSCSVKVVDDGSGVSRDDLVLLGERYAT 120

Query: 147  SKFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVASACTL 206
            SKFHD  +++T  +TFGFRGEALASISD+SLLE+ TKA GR NGYRKV+K  K      L
Sbjct: 121  SKFHDFTNVETASETFGFRGEALASISDISLLEVRTKAIGRPNGYRKVMKGSK-----CL 180

Query: 207  ELMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKVSFKI 266
             L +         TV VRDLFY+QPVRRK+MQ SPKKVL+++KKCV RIALVHS VSF +
Sbjct: 181  HLGIDDDRKDSGTTVTVRDLFYSQPVRRKYMQSSPKKVLESIKKCVFRIALVHSNVSFSV 240

Query: 267  VDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKIGDGGLKLSGYICSPFDTFTIK 326
            +D ES+  L  TNPS S  SLL    G+E   SL ++ + DG L +SG+ C+  D +   
Sbjct: 241  LDIESDEELFQTNPSSSAFSLLMRDAGTEAVNSLCKVNVTDGMLNVSGFECA--DDWKPT 300

Query: 327  DPVFYEIVLTSARTDINRRFICKGQIHKLLNQLASRFVSLSPQTDQACHSRKRSRFQANP 386
            D                      GQ                        + +R+R Q+NP
Sbjct: 301  D----------------------GQ-----------------------QTGRRNRLQSNP 360

Query: 387  AYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSSGKSLVHTTSI 446
             YIL + CP   Y+ +FE SKT V+FK W  +L FIE      WK      K  +     
Sbjct: 361  GYILCIACPRRLYEFSFEPSKTHVEFKKWGPVLAFIERITLANWK------KDRILELFD 420

Query: 447  VGGDQLWKDEDNMISTNSDFREDVIRFPPESIRSLKKSRMRSPRASLIDLFSPSAMLTKD 506
             G D L K +        D  +D IR    S+ S+            +D   P AM    
Sbjct: 421  GGADILAKGD------RQDLIDDKIRLQNGSLFSI---------LHFLDADWPEAM---- 480

Query: 507  DDILSNSLHEKKACENSHTSSSELNDVHRQARMQFGNQAADHFSGLWGTPLAKCSTTAVQ 566
                     +K    N H   S L  +   A  +   Q  D+FS        +C      
Sbjct: 481  -----EPAKKKLKRSNDHAPCSSL--LFPSADFK---QDGDYFSPRKDVWSPECEVELKI 540

Query: 567  KGDRHPWVPDNIFVSEDSFLDRRLASPKRCDDIVEDNIFSSDLKGQSSKVYIDMINGSAE 626
            +  +            DS L  R    +  +D  +            SK     +     
Sbjct: 541  QNPKEQGTVAGFESRTDSLLQSRDIEMQTNEDFPQVTDLLETSLVADSKCRKQFLTRCQI 600

Query: 627  STPSSYFHEFSYDDNIFTGNKPSLRGCTSGSSFQLESTSILGDKLFIQNDVIKRI----- 686
            +TP +  H+F  D ++               +FQ +    L D+L + N + K +     
Sbjct: 601  TTPVNINHDFMKDSDVL--------------NFQFQG---LKDELDVSNCIGKHLLRGCS 660

Query: 687  QKQGIPDDEVDVLKLDGYIQGSDFYAGDSFHAEFTEENMYSCHLDKHVQKFFSSYQTRNS 746
             +  +   E  +  ++GY         +S       E   S        +   + +  + 
Sbjct: 661  SRVSLTFHEPKLSHVEGY---------ESVVPMIPNEKQSS-------PRVLETREGGSY 720

Query: 747  PDVH--VTPNPRLASEW-DVDCFSVRDGVERNWRSRDRTPFRDLVDGEDKGC-GFDSDIM 806
             DV+   TP+  L S W D D F+ +                      D+GC G   D  
Sbjct: 721  CDVYSDKTPDCSLGSSWQDTDWFTPQ-------------------CSSDRGCVGIGEDF- 780

Query: 807  LRSSKKNYIPSCIDSELIIDDVLDTREDLST---SLEKSNNFDHSSPVSPNMHSCQKGCG 866
                  N  P         D+ + +++ LS+       + +F  SS  SP M+S      
Sbjct: 781  ------NITPIDTAEFDSYDEKVGSKKYLSSVNVGSSVTGSFCLSSEWSP-MYSTPSATK 840

Query: 867  FDSDIMLRSSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHSCQK 926
            ++S+         Y   C   E  +   L    D       +NN      V P M  C+ 
Sbjct: 841  WESE---------YQKGCRILEQSLR--LGRMPDPEFCFSAANNIKFDHEVIPEMDCCET 900

Query: 927  GCGFDSDIMLRSSKKNYIPSCIDSELIIDDVLDTREDLSTSLEKSNNFDHSSPVSPNMHS 986
            G                     DS   I +     + +  S     ++ H+  V  + +S
Sbjct: 901  G--------------------TDSFTAIQNCTQLADKICKS-----SWGHADDVRIDQYS 960

Query: 987  CQKYLFNWRLPGRDWEKAYGSSELKFGHQAFKQRYVSVERPRRCKSAPPSYKRKTSFYCL 1046
             +K                     KF +    Q     +R +R +SAPP Y+ K  F  L
Sbjct: 961  IRKE--------------------KFSYMDGTQNNAGKQRSKRSRSAPPFYREKKRFISL 1020

Query: 1047 YQRKEEKHNAAGFYGLDQRKTDKFNATNFYCMDQ---GKEEKLRASAFLD-SPPHLELGE 1106
              + + K           + +D     +  C+ Q     +  L+ S   D S  H++  E
Sbjct: 1021 SCKSDTK----------PKNSDPSEPDDLECLTQPCNASQMHLKCSILDDVSYDHIQETE 1080

Query: 1107 LRDSKHFSSTNNLYIKPSPLDDLSMGTRIDMTKTPAITGNNKEKQEGKISKQFQSDVKVT 1166
                K  SS ++L          S G R   ++T      +++  E   S++F   +K T
Sbjct: 1081 ----KRLSSASDL--------KASAGCRTVHSET-----QDEDVHEDFSSEEFLDPIKST 1140

Query: 1167 ASALELCSKETQESDLWIKWK-NCCPTTRNDGPRAFEDEVSILDISSGFLSL-ARNSLVP 1226
                              KW+ NC  +           +  + DISSG L L +  SLVP
Sbjct: 1141 T-----------------KWRHNCAVSQVPKESHELHGQDGVFDISSGLLHLRSDESLVP 1167

Query: 1227 KSIDKNFLEDAKVLLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLSGEAKTIA 1286
            +SI+++ LEDAKVL Q+DKK+IP+V+ G +A++DQHAADERIRLE+LR K+L+G+A+T+ 
Sbjct: 1201 ESINRHSLEDAKVLQQVDKKYIPIVACGTVAIVDQHAADERIRLEELRTKVLAGKARTVT 1167

Query: 1287 YLEDEHEL--------------VLPEIGYQLLYNYSDQVKEWGWICNIHAQDSKSFQRNL 1346
            YL  + EL              VLPE+GYQLL +YS+Q+++WGWICNI  + S SF++N+
Sbjct: 1261 YLSADQELFINDALLIFVLTLKVLPEMGYQLLQSYSEQIRDWGWICNITVEGSTSFKKNM 1167

Query: 1347 NILYKQETVITLMAVPCILGVNLSDADLLEFLDQLADTDGSSTMPPSVLRVLNSKACRGA 1406
            +I+ ++ T ITL AVPCILGVNLSD DLLEFL QLADTDGSST+PPSVLRVLNSKACRGA
Sbjct: 1321 SIIQRKPTPITLNAVPCILGVNLSDVDLLEFLQQLADTDGSSTIPPSVLRVLNSKACRGA 1167

Query: 1407 IMFGDSLLPSECSLIVEELKQTSLCFQCAHGRPTTVPLVNLEALHKQIREMEILDKNGSN 1458
            IMFGDSLLPSECSLI++ LKQTSLCFQCAHGRPTTVPLV+L+ALHKQI ++         
Sbjct: 1381 IMFGDSLLPSECSLIIDGLKQTSLCFQCAHGRPTTVPLVDLKALHKQIAKL------SGR 1167

BLAST of Cp4.1LG08g12770 vs. TAIR 10
Match: AT4G02460.1 (DNA mismatch repair protein, putative )

HSP 1 Score: 82.8 bits (203), Expect = 2.5e-15
Identity = 102/420 (24.29%), Postives = 169/420 (40.24%), Query Frame = 0

Query: 29  IIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVE-SLIDHLIILFYAEACLS 88
           +I+P+ ++V   + +G ++ D +  V+ELV NSLDAGA+ +E +L D+            
Sbjct: 16  LIRPINRNVIHRICSGQVILDLSSAVKELVENSLDAGATSIEINLRDY------------ 75

Query: 89  SSFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGERYATS 148
                                          G  Y +V+DNG GI+     +L  ++ TS
Sbjct: 76  -------------------------------GEDYFQVIDNGCGISPTNFKVLALKHHTS 135

Query: 149 KFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIKAQKVASACTLE 208
           K  D  D+     T+GFRGEAL+S+  +  L + T+            K + VA+  T +
Sbjct: 136 KLEDFTDL-LNLTTYGFRGEALSSLCALGNLTVETRT-----------KNEPVATLLTFD 195

Query: 209 ---LMMIWKILVLQV--TVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIALVHSKV 268
              L+   K    Q+  TV VR LF N PVR K  + + +K    +   +   AL+   V
Sbjct: 196 HSGLLTAEKKTARQIGTTVTVRKLFSNLPVRSKEFKRNIRKEYGKLVSLLNAYALIAKGV 255

Query: 269 SF---KIVDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKI-GDGGLKLSGYICS 328
            F          +S++L T    S    + + FG     SL  + I      ++ G++  
Sbjct: 256 RFVCSNTTGKNPKSVVLNTQGRGSLKDNIITVFGISTFTSLQPVSICVSEDCRVEGFLSK 315

Query: 329 PFDTF--TIKDPVFYEIVLTSARTDINRRFICKGQIHKLLNQLASRFVSLSPQTDQACHS 388
           P       + D  ++          IN R +   ++ KL+N+L                 
Sbjct: 316 PGQGTGRNLADRQYF---------FINGRPVDMPKVSKLVNEL----------------- 354

Query: 389 RKRSRFQANPAYILNLDCPGSFYDLTFESSKTFVQFKDWTSILTFIEETIQQFWKEKYSS 437
            K +  +  P  IL+   PG   DL     K  V F D TS++  + E + + +    +S
Sbjct: 376 YKDTSSRKYPVTILDFIVPGGACDLNVTPDKRKVFFSDETSVIGSLREGLNEIYSSSNAS 354


HSP 2 Score: 70.9 bits (172), Expect = 9.8e-12
Identity = 65/232 (28.02%), Postives = 95/232 (40.95%), Query Frame = 0

Query: 1202 ARNSLVPKSIDKNFLEDAKVLLQLDKKFIPVVSGGILAVIDQHAADERIRLEDLRQKLLS 1261
            A  S + +   K      +VL Q +  FI       L ++DQHAADE+   E L +  + 
Sbjct: 688  AATSELERLFRKEDFRRMQVLGQFNLGFIIAKLERDLFIVDQHAADEKFNFEHLARSTVL 747

Query: 1262 GEAKTIAYLEDEHELVLPEIGYQLLYNYSDQVKEWGWIC--NIHAQDSKSFQRNLNILYK 1321
             +   +  L  E   + PE    +L  + D ++E G++   N  A   K F+        
Sbjct: 748  NQQPLLQPLNLE---LSPEEEVTVLM-HMDIIRENGFLLEENPSAPPGKHFR-------- 807

Query: 1322 QETVITLMAVPCILGVNLSDADLLEFLDQLADTDG-------------SSTMPPSVLRVL 1381
                  L A+P    +     DL + +  L D  G              S  P  V  +L
Sbjct: 808  ------LRAIPYSKNITFGVEDLKDLISTLGDNHGECSVASSYKTSKTDSICPSRVRAML 867

Query: 1382 NSKACRGAIMFGDSLLPSECSLIVEELKQTSLCFQCAHGRPTTVPLVNLEAL 1419
             S+ACR ++M GD L  +E   IVE L      + C HGRPT   LV+L  L
Sbjct: 868  ASRACRSSVMIGDPLRKNEMQKIVEHLADLESPWNCPHGRPTMRHLVDLTTL 901

BLAST of Cp4.1LG08g12770 vs. TAIR 10
Match: AT4G09140.1 (MUTL-homologue 1 )

HSP 1 Score: 82.4 bits (202), Expect = 3.3e-15
Identity = 82/289 (28.37%), Postives = 132/289 (45.67%), Query Frame = 0

Query: 23  VNREMGIIKPLPKSVRSSVRAGVILYDATKVVEELVYNSLDAGASKVESLIDHLIILFYA 82
           V RE   I+ L +SV + + AG ++      V+ELV NSLDA +S               
Sbjct: 22  VPREPPKIQRLEESVVNRIAAGEVIQRPVSAVKELVENSLDADSS--------------- 81

Query: 83  EACLSSSFITEHVSSTSEKNTPMPIVSIQISIFIGIGTSYVKVVDNGSGITRDGLALLGE 142
                                     SI + +  G G   ++V D+G GI R+ L +L E
Sbjct: 82  --------------------------SISVVVKDG-GLKLIQVSDDGHGIRREDLPILCE 141

Query: 143 RYATS---KFHDLIDMDTKGKTFGFRGEALASISDVSLLEIITKACGRANGYRKVIK--- 202
           R+ TS   KF DL  +     + GFRGEALAS++ V+ + + T   G+ +GYR   +   
Sbjct: 142 RHTTSKLTKFEDLFSL----SSMGFRGEALASMTYVAHVTVTTITKGQIHGYRVSYRDGV 201

Query: 203 AQKVASACTLELMMIWKILVLQVTVIVRDLFYNQPVRRKHMQFSPKKVLQAVKKCVVRIA 262
            +    AC           V    ++V +LFYN   RRK +Q S     + V   + R+A
Sbjct: 202 MEHEPKACA---------AVKGTQIMVENLFYNMIARRKTLQNSADDYGKIV-DLLSRMA 254

Query: 263 LVHSKVSFKIVDSESESILLYTNPSPSPLSLLRSGFGSEVSRSLHELKI 306
           + ++ VSF      +    +++  SPS L  +RS +G  V+++L ++++
Sbjct: 262 IHYNNVSFSCRKHGAVKADVHSVVSPSRLDSIRSVYGVSVAKNLMKVEV 254

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
F4JN268.3e-17333.95DNA mismatch repair protein MLH3 OS=Arabidopsis thaliana OX=3702 GN=MLH3 PE=2 SV... [more]
Q9UHC18.6e-2926.94DNA mismatch repair protein Mlh3 OS=Homo sapiens OX=9606 GN=MLH3 PE=1 SV=3[more]
B8CX971.4e-2329.79DNA mismatch repair protein MutL OS=Halothermothrix orenii (strain H 168 / OCM 5... [more]
Q01QW76.0e-2228.16DNA mismatch repair protein MutL OS=Solibacter usitatus (strain Ellin6076) OX=23... [more]
Q8A1201.5e-2024.69DNA mismatch repair protein MutL OS=Bacteroides thetaiotaomicron (strain ATCC 29... [more]
Match NameE-valueIdentityDescription
XP_023539512.10.094.55DNA mismatch repair protein MLH3 isoform X1 [Cucurbita pepo subsp. pepo][more]
KAG7027832.10.084.28DNA mismatch repair protein MLH3, partial [Cucurbita argyrosperma subsp. argyros... [more]
XP_022939754.10.084.35DNA mismatch repair protein MLH3 isoform X1 [Cucurbita moschata][more]
KAG6596281.10.083.30DNA mismatch repair protein MLH3, partial [Cucurbita argyrosperma subsp. sororia... [more]
XP_022971390.10.082.74DNA mismatch repair protein MLH3 isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1FI480.084.35DNA mismatch repair protein MLH3 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC... [more]
A0A6J1I5L50.082.74DNA mismatch repair protein MLH3 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC11... [more]
A0A6J1FIL60.081.86DNA mismatch repair protein MLH3 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC... [more]
A0A6J1I3710.079.97DNA mismatch repair protein MLH3 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC11... [more]
A0A5A7UH950.066.81DNA mismatch repair protein MLH3 isoform X2 OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT4G35520.14.7e-17133.63MUTL protein homolog 3 [more]
AT4G02460.12.5e-1524.29DNA mismatch repair protein, putative [more]
AT4G09140.13.3e-1528.37MUTL-homologue 1 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR014790MutL, C-terminal, dimerisationSMARTSM00853MutL_C_2coord: 1221..1381
e-value: 3.2E-17
score: 73.2
IPR014790MutL, C-terminal, dimerisationPFAMPF08676MutL_Ccoord: 1220..1381
e-value: 5.1E-10
score: 39.3
IPR013507DNA mismatch repair protein, S5 domain 2-likeSMARTSM01340DNA_mis_repair_2coord: 288..431
e-value: 9.0E-6
score: 26.4
IPR013507DNA mismatch repair protein, S5 domain 2-likePFAMPF01119DNA_mis_repaircoord: 290..429
e-value: 1.4E-4
score: 21.5
IPR036890Histidine kinase/HSP90-like ATPase superfamilyGENE3D3.30.565.10coord: 29..280
e-value: 3.2E-46
score: 159.4
IPR036890Histidine kinase/HSP90-like ATPase superfamilySUPERFAMILY55874ATPase domain of HSP90 chaperone/DNA topoisomerase II/histidine kinasecoord: 50..275
IPR042120MutL, C-terminal domain, dimerisation subdomainGENE3D3.30.1540.20coord: 1224..1412
e-value: 3.8E-22
score: 81.0
IPR042121MutL, C-terminal domain, regulatory subdomainGENE3D3.30.1370.100coord: 1264..1376
e-value: 3.8E-22
score: 81.0
NoneNo IPR availablePFAMPF13589HATPase_c_3coord: 50..203
e-value: 5.7E-6
score: 26.4
NoneNo IPR availableCDDcd00782MutL_Transcoord: 288..430
e-value: 6.2813E-19
score: 82.2041
IPR014721Ribosomal protein S5 domain 2-type fold, subgroupGENE3D3.30.230.10coord: 294..435
e-value: 1.6E-15
score: 58.9
IPR038973DNA mismatch repair protein MutL/Mlh/PmsPANTHERPTHR10073DNA MISMATCH REPAIR PROTEIN MLH, PMS, MUTLcoord: 27..1417
IPR028830DNA mismatch repair protein Mlh3PANTHERPTHR10073:SF47DNA MISMATCH REPAIR PROTEIN MLH3-RELATEDcoord: 27..1417
IPR014762DNA mismatch repair, conserved sitePROSITEPS00058DNA_MISMATCH_REPAIR_1coord: 163..169
IPR037198MutL, C-terminal domain superfamilySUPERFAMILY118116DNA mismatch repair protein MutLcoord: 1219..1412
IPR020568Ribosomal protein S5 domain 2-type foldSUPERFAMILY54211Ribosomal protein S5 domain 2-likecoord: 272..431

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g12770.1Cp4.1LG08g12770.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006298 mismatch repair
cellular_component GO:0032300 mismatch repair complex
molecular_function GO:0005524 ATP binding
molecular_function GO:0016887 ATP hydrolysis activity
molecular_function GO:0030983 mismatched DNA binding