Cla001012 (gene) Watermelon (97103) v1

NameCla001012
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionDNA mismatch repair protein mutL (AHRD V1 **** E9D2C4_COCPS); contains Interpro domain(s) IPR011186 DNA mismatch repair protein Mlh1
LocationChr8 : 11652980 .. 11656861 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAACCCATGCCAATGAAGACATTATTCCTATGGACGCATGTGGGGATGAAGAACTAGTTCCTTGTAAAGAACCCCCCAAAATCCTCCGACTCGACGAGTCCGTCGTCAATCGAATCGCTGCCGGAGAGGTTATTCAAAGGCCTGTGTCCGCCATTAAAGAGCTCGTCGAGAACAGCCTCGACGCCCAGTCCACCTCCGTCAACGTCGTTGTCAAAGACGGCGGCCTCAAACTCATCCAAGTTTCCGACGACGGCCATGGCATCCGTGTCTGTTTCCTTTTTAAGTCTATTTTTCTACCTACTACTTATCTATCGGGAAATATGGGAAAACCAAAACGAATTTAACCGGAATCTAGATAGTCATTTTTCATAATCTACATTCACAATCGTCGGAATATTGTGTGGGTTATAGCTTTTTCTATCAAATTTCTAGGAGCGAATCCGTGTTAATGTTAAGATTCTAATGGAGAAGGAAATTAAGGAAACCTTGTGGAAGACTGAAAAGTGGCTATAATTCGAGGAAACTCCGCAATTTTATTCTACTAACTTTCTGATTTTCAATTTGTTCTTCTTCCCTAGTATGAAGATTTGCCCATTTTGTGCGAGAGACACACGACGTCCAAGTTGTCAACATTTGAGGATTTACAGTCCATAAAGTCGATGGGATTTCGAGGAGAGGCTCTGGCGAGCATGACCTATGTAGGTCATGTTACGGTCACTACCATTACTAAAGGACAACTACACGGTTATAGGTAGAATTACTACAAGCTATGGCTCTGCCTCTCATGTTCTTCATATTGGGTTGCTTGGGGATTTGTATACGTAGGGAAAATCATAATATTGAGGTTAAAATACTATTTTGGTCGCCATACAATGGGTTCTTGTTTTGTTTTGGTGTTTGTACTTTCAAATGCCCAAATGTTTTAAACTAATGGTTATCTAGATTGTATGTACTATGTTGTCATTTTCACCTTGTTTGAGAGAATGAATTTCTTTTAGTATCTTACTGAGTTTCCTGACATAATGGGACAGAGTATCCTATAGAGATGGTGTGATGGAGCATGAGCCTAAGCCATGTGCTGCTGTAAAAGGAACTCAGATAACGGTAATTGTACTAGTCTAGGCTGCTCCTATTGCAAGGCAACAATTTCTGGGGCTAACAGTGTTTTATCAACTAAATGAATTATGCTCGCATGACTCTTGTACATGTATGATTCATGATTAATTACAGTCAGGAGGTTAAGATTTTTCTTAAATTTGTTCATAGACAATTTTGTGTAAATACTTCACAATTTCAAATTTCATCATCATGCACTGTAATTGGCAATTTAGTTTTCTTGGATGTCTGGTTCCAGGTTGAGAATCTTTTCTATAACATGACTGCTCGGAGGAAGACACTACAAAATGCATCTGATGATTACACGAAGATTGTGGACCTTCTAAGTCGTTTTGCCATTCATCATATAAACATCAGCTTTTCTTGCAGAAAGGTTATTTCCATAATTAGTCAAACTTGAGTGTTTTTATTACTTCGGTATTGCTTTCATAGTCAGATGTTTTGATGGTTTAAAATGCGCTATCTGAGTTGAGATATACTGGTTACTATATTTATTAGCATGGAGCTGCCAGAGCAGATGTTCACTCAGTTGGGCCAACTTCAAGGTTAGATGCCATTCGTACAGTTTATGGTGCATCAGTCGCCCGCAATCTAATAAAAATAGAAGTTTCAGAAAATGGCGAAGCCTGTTCAGGTTTCAAGATGGATGGTCTAATCTCCAACTCAAATTATGTTGCGAAGAAGATCACGATGGTGCTCTTTATCAATGGTACTGAGATTGTTGTTCTTTTTCAAACTAATAATAAAAAATCTCCCCTTCTATAAGCTCTTAATAAGCAACCTTGGAAATTACTCAATTTCTCTTCCAAATTATGATATGATTATTTTCATAAACACCTATATCACAACCCCTTGATCACTTACACTACCCCAAACATCTGACTACACACCCACTCTTGCCCTTAATGAGCCTTATATCTAAGTCCATACTTAGAGGAAGTGTGAAGAATGAACTGAAAAGATTATATTTACAATAACCTATAAGAGCTTATGTGTCTAATAGTAATTTATTGACATTTTTTTTTTTGTTTCATTAATTTTATTTGAAGAGCATGGATCATAGTTTAGCATAGGATCCTTGTTCTTTTCATTTTTTCAAAAGTTTTTATTTTTTTTATTTTTATTTTTTATTTTTAATTAGATTTTCTTATTGCAGGAAGAATGGTGGAATGCAGTGCTTTAAAAAGAGCTATTGAAATTGTTTATGCTGCAACCTTACCCAAAGCATCCAAACCTTTCATATATATGTCAATTATATTGCCACCTGAGCATGTAGACGTGAATGTTCATCCGACCAAAAAAGAGGTGAATTTTTTGTTAAGGATATCCTTGCATCTTTTCTTGTTGGTGGGTACTTCTGTGTGCATGATATATGGTACCCACAGTTTCTGCACTATAAAATGATGAGTTTCTGCACTCTAAAATGATATATAATATGTTTATACCAAAAGAAAATAAAGTTAATAAGCCTACGTATTTATACTTAAACAAGAAGTTCAATGTTTTTCAAATTTTGATATTTTTGCCCTATACTTAAGTATATGTCCATGTTCTAACGAGTAAGAAAAAATGAATAAAAGTCTGAACATCTTAGTTTATGTGTCTAATCATTGTGGTCATGAGTGTCTGACACATGTTCAATTTGTTGTGTTTAACTAGTGCTAGACACGTGCCTAAATATTGGATTTGTATTCAATTCGTTCGTCTAACCCATGTCTATTGTACTAGCAAGTGCTTTATAGTGTCTAAGAAGTGTCAAATATGTGGATGCTATGAAAAGGAGTTTCTCCCAAAGAGTGTCAGATTTTAATTTCTCTACAGAAGGCTTCTATTTTCTTCCACGCCATAATAGCCAAAAAGGCTCTAATAAATTGAGGCATCAAGTTTTGACTTCATTCTTAAATGCGTGCCTATCCTAAATGGGATTGACGATGCCATGTTGTTTGACTGTGAGGAGTTGGCACTTTCAAGGTGCTGGAAATAGTTTCCAGATTCCAGGCATATGAGCAATTGTTGAAAAGTTGACATTGGGATTCTGAGCTTTCCTTACAAATTAGATTCAGTAAATTACAAGCTATGGTTGCTATCAAAATTCTTATACAGTTTCCATTTGTATACTTCAAATTTCTTTAAAAGTTATCTTTTAACTGTAAAATCCACATATCTTGTTCATTGTTTGAAGTGTTATATTGTGCTGCAGAATAAAGCTCTATCAATATGCAGGTAAGCCTCTTGAACCAGGAAGTTATTATTGAGAGGATACAGTCAGCGGTTGAATCAAAATTGAGAAGTTCTAACGACACGAGGGCATTTCAAGAACAGGTACTTTTAAAAGCTCTTTGTCTATCTGAAACTTGGATTATAAGTTAATTTATTGAATCTCAGTTATTGACTCATATTCCACCATGAACTCATATTATAGGATGTAGAATCTTCTGAGGCTTGTCAAATGGTTCTTAGCAAGGACGATACTCAAAATTGCTCGCAGTCTGGTACAACAGATTTTGTCTTTGTTGGTCTTACACTGTAGTTTCTAAGGTTATTCTTTGGGCTTTTATCTGTTGCTTCAGTGTCATATTGTTATAATGCATATAGGGTCAAAATCGCAAAAGGTTCCAGTGCATAAAATGGTTAGGACAGATTCAACAGATCCAGCTGGAAGGTTGCACGCATATGTGCAAATGAAGCCTCCTGGCCTCCCCGAATCTAGCTTGACTACTGTGAGGTATGTATCCACTATTCAGTTTTCCTCTTGGATTTAA

mRNA sequence

ATGGAAACCCATGCCAATGAAGACATTATTCCTATGGACGCATGTGGGGATGAAGAACTAGTTCCTTGTAAAGAACCCCCCAAAATCCTCCGACTCGACGAGTCCGTCGTCAATCGAATCGCTGCCGGAGAGGTTATTCAAAGGCCTGTGTCCGCCATTAAAGAGCTCGTCGAGAACAGCCTCGACGCCCAGTCCACCTCCGTCAACGTCGTTGTCAAAGACGGCGGCCTCAAACTCATCCAAGTTTCCGACGACGGCCATGGCATCCGTTATGAAGATTTGCCCATTTTGTGCGAGAGACACACGACGTCCAAGTTGTCAACATTTGAGGATTTACAGTCCATAAAGTCGATGGGATTTCGAGGAGAGGCTCTGGCGAGCATGACCTATGTAGGTCATGTTACGGTCACTACCATTACTAAAGGACAACTACACGGTTATAGAGTATCCTATAGAGATGGTGTGATGGAGCATGAGCCTAAGCCATGTGCTGCTGTAAAAGGAACTCAGATAACGGTTGAGAATCTTTTCTATAACATGACTGCTCGGAGGAAGACACTACAAAATGCATCTGATGATTACACGAAGATTGTGGACCTTCTAAGTCGTTTTGCCATTCATCATATAAACATCAGCTTTTCTTGCAGAAAGCATGGAGCTGCCAGAGCAGATGTTCACTCAGTTGGGCCAACTTCAAGGTTAGATGCCATTCGTACAGTTTATGGTGCATCAGTCGCCCGCAATCTAATAAAAATAGAAGTTTCAGAAAATGGCGAAGCCTGTTCAGGTTTCAAGATGGATGGTCTAATCTCCAACTCAAATTATGTTGCGAAGAAGATCACGATGGTGCTCTTTATCAATGGAAGAATGGTGGAATGCAGTGCTTTAAAAAGAGCTATTGAAATTGTTTATGCTGCAACCTTACCCAAAGCATCCAAACCTTTCATATATATGTCAATTATATTGCCACCTGAGCATGTAGACGTGAATGTTCATCCGACCAAAAAAGAGGTAAGCCTCTTGAACCAGGAAGTTATTATTGAGAGGATACAGTCAGCGGTTGAATCAAAATTGAGAAGTTCTAACGACACGAGGGCATTTCAAGAACAGGATGTAGAATCTTCTGAGGCTTGTCAAATGGTTCTTAGCAAGGACGATACTCAAAATTGCTCGCAGTCTGGGTCAAAATCGCAAAAGGTTCCAGTGCATAAAATGGTTAGGACAGATTCAACAGATCCAGCTGGAAGGTTGCACGCATATGTGCAAATGAAGCCTCCTGGCCTCCCCGAATCTAGCTTGACTACTGTGAGGTATGTATCCACTATTCAGTTTTCCTCTTGGATTTAA

Coding sequence (CDS)

ATGGAAACCCATGCCAATGAAGACATTATTCCTATGGACGCATGTGGGGATGAAGAACTAGTTCCTTGTAAAGAACCCCCCAAAATCCTCCGACTCGACGAGTCCGTCGTCAATCGAATCGCTGCCGGAGAGGTTATTCAAAGGCCTGTGTCCGCCATTAAAGAGCTCGTCGAGAACAGCCTCGACGCCCAGTCCACCTCCGTCAACGTCGTTGTCAAAGACGGCGGCCTCAAACTCATCCAAGTTTCCGACGACGGCCATGGCATCCGTTATGAAGATTTGCCCATTTTGTGCGAGAGACACACGACGTCCAAGTTGTCAACATTTGAGGATTTACAGTCCATAAAGTCGATGGGATTTCGAGGAGAGGCTCTGGCGAGCATGACCTATGTAGGTCATGTTACGGTCACTACCATTACTAAAGGACAACTACACGGTTATAGAGTATCCTATAGAGATGGTGTGATGGAGCATGAGCCTAAGCCATGTGCTGCTGTAAAAGGAACTCAGATAACGGTTGAGAATCTTTTCTATAACATGACTGCTCGGAGGAAGACACTACAAAATGCATCTGATGATTACACGAAGATTGTGGACCTTCTAAGTCGTTTTGCCATTCATCATATAAACATCAGCTTTTCTTGCAGAAAGCATGGAGCTGCCAGAGCAGATGTTCACTCAGTTGGGCCAACTTCAAGGTTAGATGCCATTCGTACAGTTTATGGTGCATCAGTCGCCCGCAATCTAATAAAAATAGAAGTTTCAGAAAATGGCGAAGCCTGTTCAGGTTTCAAGATGGATGGTCTAATCTCCAACTCAAATTATGTTGCGAAGAAGATCACGATGGTGCTCTTTATCAATGGAAGAATGGTGGAATGCAGTGCTTTAAAAAGAGCTATTGAAATTGTTTATGCTGCAACCTTACCCAAAGCATCCAAACCTTTCATATATATGTCAATTATATTGCCACCTGAGCATGTAGACGTGAATGTTCATCCGACCAAAAAAGAGGTAAGCCTCTTGAACCAGGAAGTTATTATTGAGAGGATACAGTCAGCGGTTGAATCAAAATTGAGAAGTTCTAACGACACGAGGGCATTTCAAGAACAGGATGTAGAATCTTCTGAGGCTTGTCAAATGGTTCTTAGCAAGGACGATACTCAAAATTGCTCGCAGTCTGGGTCAAAATCGCAAAAGGTTCCAGTGCATAAAATGGTTAGGACAGATTCAACAGATCCAGCTGGAAGGTTGCACGCATATGTGCAAATGAAGCCTCCTGGCCTCCCCGAATCTAGCTTGACTACTGTGAGGTATGTATCCACTATTCAGTTTTCCTCTTGGATTTAA

Protein sequence

METHANEDIIPMDACGDEELVPCKEPPKILRLDESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVVVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMTARRKTLQNASDDYTKIVDLLSRFAIHHINISFSCRKHGAARADVHSVGPTSRLDAIRTVYGASVARNLIKIEVSENGEACSGFKMDGLISNSNYVAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIILPPEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLRSSNDTRAFQEQDVESSEACQMVLSKDDTQNCSQSGSKSQKVPVHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTTVRYVSTIQFSSWI
BLAST of Cla001012 vs. Swiss-Prot
Match: MLH1_ARATH (DNA mismatch repair protein MLH1 OS=Arabidopsis thaliana GN=MLH1 PE=2 SV=1)

HSP 1 Score: 620.2 bits (1598), Expect = 1.8e-176
Identity = 320/420 (76.19%), Postives = 358/420 (85.24%), Query Frame = 1

Query: 20  LVPCKEPPKILRLDESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVVVKDGGLKL 79
           +VP +EPPKI RL+ESVVNRIAAGEVIQRPVSA+KELVENSLDA S+S++VVVKDGGLKL
Sbjct: 21  IVP-REPPKIQRLEESVVNRIAAGEVIQRPVSAVKELVENSLDADSSSISVVVKDGGLKL 80

Query: 80  IQVSDDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYVGHVTVTTI 139
           IQVSDDGHGIR EDLPILCERHTTSKL+ FEDL S+ SMGFRGEALASMTYV HVTVTTI
Sbjct: 81  IQVSDDGHGIRREDLPILCERHTTSKLTKFEDLFSLSSMGFRGEALASMTYVAHVTVTTI 140

Query: 140 TKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMTARRKTLQNASDDYTKIVD 199
           TKGQ+HGYRVSYRDGVMEHEPK CAAVKGTQI VENLFYNM ARRKTLQN++DDY KIVD
Sbjct: 141 TKGQIHGYRVSYRDGVMEHEPKACAAVKGTQIMVENLFYNMIARRKTLQNSADDYGKIVD 200

Query: 200 LLSRFAIHHINISFSCRKHGAARADVHSVGPTSRLDAIRTVYGASVARNLIKIEVSENGE 259
           LLSR AIH+ N+SFSCRKHGA +ADVHSV   SRLD+IR+VYG SVA+NL+K+EVS    
Sbjct: 201 LLSRMAIHYNNVSFSCRKHGAVKADVHSVVSPSRLDSIRSVYGVSVAKNLMKVEVSSCDS 260

Query: 260 ACSGFKMDGLISNSNYVAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMS 319
           +   F M+G ISNSNYVAKK  +VLFIN R+VECSALKRAIEIVYAATLPKASKPF+YMS
Sbjct: 261 SGCTFDMEGFISNSNYVAKKTILVLFINDRLVECSALKRAIEIVYAATLPKASKPFVYMS 320

Query: 320 IILPPEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLRSSNDTRAFQEQDVESSEACQ 379
           I LP EHVD+N+HPTKKEVSLLNQE+IIE IQS VE KLR++NDTR FQEQ VE  ++  
Sbjct: 321 INLPREHVDINIHPTKKEVSLLNQEIIIEMIQSEVEVKLRNANDTRTFQEQKVEYIQSTL 380

Query: 380 MVLSKDDTQNCSQSGSKSQKVPVHKMVRTDSTDPAGRLHAYVQMKPPGLPE--SSLTTVR 438
                D   +   SG K+QKVPV+KMVRTDS+DPAGRLHA++Q KP  LP+  SSL+ VR
Sbjct: 381 TSQKSDSPVSQKPSGQKTQKVPVNKMVRTDSSDPAGRLHAFLQPKPQSLPDKVSSLSVVR 439

BLAST of Cla001012 vs. Swiss-Prot
Match: MLH1_HUMAN (DNA mismatch repair protein Mlh1 OS=Homo sapiens GN=MLH1 PE=1 SV=1)

HSP 1 Score: 397.9 bits (1021), Expect = 1.5e-109
Identity = 216/407 (53.07%), Postives = 290/407 (71.25%), Query Frame = 1

Query: 29  ILRLDESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVVVKDGGLKLIQVSDDGHG 88
           I RLDE+VVNRIAAGEVIQRP +AIKE++EN LDA+STS+ V+VK+GGLKLIQ+ D+G G
Sbjct: 8   IRRLDETVVNRIAAGEVIQRPANAIKEMIENCLDAKSTSIQVIVKEGGLKLIQIQDNGTG 67

Query: 89  IRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQLHGYR 148
           IR EDL I+CER TTSKL +FEDL SI + GFRGEALAS+++V HVT+TT T      YR
Sbjct: 68  IRKEDLDIVCERFTTSKLQSFEDLASISTYGFRGEALASISHVAHVTITTKTADGKCAYR 127

Query: 149 VSYRDGVMEHEPKPCAAVKGTQITVENLFYNMTARRKTLQNASDDYTKIVDLLSRFAIHH 208
            SY DG ++  PKPCA  +GTQITVE+LFYN+  RRK L+N S++Y KI++++ R+++H+
Sbjct: 128 ASYSDGKLKAPPKPCAGNQGTQITVEDLFYNIATRRKALKNPSEEYGKILEVVGRYSVHN 187

Query: 209 INISFSCRKHGAARADVHSVGPTSRLDAIRTVYGASVARNLIKIEVSENGEACSGFKMDG 268
             ISFS +K G   ADV ++   S +D IR+++G +V+R LI+I   +   A   FKM+G
Sbjct: 188 AGISFSVKKQGETVADVRTLPNASTVDNIRSIFGNAVSRELIEIGCEDKTLA---FKMNG 247

Query: 269 LISNSNYVAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIILPPEHVD 328
            ISN+NY  KK   +LFIN R+VE ++L++AIE VYAA LPK + PF+Y+S+ + P++VD
Sbjct: 248 YISNANYSVKKCIFLLFINHRLVESTSLRKAIETVYAAYLPKNTHPFLYLSLEISPQNVD 307

Query: 329 VNVHPTKKEVSLLNQEVIIERIQSAVESKLRSSNDTRAFQEQDV---ESSEACQMVLSKD 388
           VNVHPTK EV  L++E I+ER+Q  +ESKL  SN +R +  Q +    +  + +MV S  
Sbjct: 308 VNVHPTKHEVHFLHEESILERVQQHIESKLLGSNSSRMYFTQTLLPGLAGPSGEMVKSTT 367

Query: 389 DTQNCSQSGSKSQKVPVHKMVRTDSTDPAGRLHAYVQMKPPGLPESS 433
              + S SGS S KV  H+MVRTDS +   +L A++Q  P   P SS
Sbjct: 368 SLTSSSTSGS-SDKVYAHQMVRTDSREQ--KLDAFLQ--PLSKPLSS 406

BLAST of Cla001012 vs. Swiss-Prot
Match: MLH1_MOUSE (DNA mismatch repair protein Mlh1 OS=Mus musculus GN=Mlh1 PE=1 SV=2)

HSP 1 Score: 384.4 bits (986), Expect = 1.7e-105
Identity = 208/400 (52.00%), Postives = 282/400 (70.50%), Query Frame = 1

Query: 29  ILRLDESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVVVKDGGLKLIQVSDDGHG 88
           I RLDE+VVNRIAAGEVIQRP +AIKE++EN LDA+ST++ VVVK+GGLKLIQ+ D+G G
Sbjct: 8   IRRLDETVVNRIAAGEVIQRPANAIKEMIENCLDAKSTNIQVVVKEGGLKLIQIQDNGTG 67

Query: 89  IRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQLHGYR 148
           IR EDL I+CER TTSKL TFEDL SI + GFRGEALAS+++V HVT+TT T      YR
Sbjct: 68  IRKEDLDIVCERFTTSKLQTFEDLASISTYGFRGEALASISHVAHVTITTKTADGKCAYR 127

Query: 149 VSYRDGVMEHEPKPCAAVKGTQITVENLFYNMTARRKTLQNASDDYTKIVDLLSRFAIHH 208
            SY DG ++  PKPCA  +GT ITVE+LFYN+  RRK L+N S++Y KI++++ R++IH+
Sbjct: 128 ASYSDGKLQAPPKPCAGNQGTLITVEDLFYNIITRRKALKNPSEEYGKILEVVGRYSIHN 187

Query: 209 INISFSCRKHGAARADVHSVGPTSRLDAIRTVYGASVARNLIKIEVSENGEACSGFKMDG 268
             ISFS +K G   +DV ++   + +D IR+++G +V+R LI++   +   A   FKM+G
Sbjct: 188 SGISFSVKKQGETVSDVRTLPNATTVDNIRSIFGNAVSRELIEVGCEDKTLA---FKMNG 247

Query: 269 LISNSNYVAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIILPPEHVD 328
            ISN+NY  KK   +LFIN R+VE +AL++AIE VYAA LPK + PF+Y+S+ + P++VD
Sbjct: 248 YISNANYSVKKCIFLLFINHRLVESAALRKAIETVYAAYLPKNTHPFLYLSLEISPQNVD 307

Query: 329 VNVHPTKKEVSLLNQEVIIERIQSAVESKLRSSNDTRAFQEQDV------ESSEACQMVL 388
           VNVHPTK EV  L++E I++R+Q  +ESKL  SN +R +  Q +       S EA +   
Sbjct: 308 VNVHPTKHEVHFLHEESILQRVQQHIESKLLGSNSSRMYFTQTLLPGLAGPSGEAARPTT 367

Query: 389 SKDDTQNCSQSGSKSQKVPVHKMVRTDSTDPAGRLHAYVQ 423
                 + S SGS   KV  ++MVRTDS +   +L A++Q
Sbjct: 368 G---VASSSTSGS-GDKVYAYQMVRTDSREQ--KLDAFLQ 398

BLAST of Cla001012 vs. Swiss-Prot
Match: MLH1_RAT (DNA mismatch repair protein Mlh1 OS=Rattus norvegicus GN=Mlh1 PE=2 SV=1)

HSP 1 Score: 383.3 bits (983), Expect = 3.7e-105
Identity = 207/404 (51.24%), Postives = 282/404 (69.80%), Query Frame = 1

Query: 29  ILRLDESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVVVKDGGLKLIQVSDDGHG 88
           I RLDE+VVNRIAAGEVIQRP +AIKE+ EN LDA+ST++ V+V++GGLKLIQ+ D+G G
Sbjct: 8   IRRLDETVVNRIAAGEVIQRPANAIKEMTENCLDAKSTNIQVIVREGGLKLIQIQDNGTG 67

Query: 89  IRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQLHGYR 148
           IR EDL I+CER TTSKL TFEDL  I + GFRGEALAS+++V HVT+TT T      YR
Sbjct: 68  IRKEDLDIVCERFTTSKLQTFEDLAMISTYGFRGEALASISHVAHVTITTKTADGKCAYR 127

Query: 149 VSYRDGVMEHEPKPCAAVKGTQITVENLFYNMTARRKTLQNASDDYTKIVDLLSRFAIHH 208
            SY DG ++  PKPCA  +GT ITVE+LFYN+  R+K L+N S++Y KI++++ R++IH+
Sbjct: 128 ASYSDGKLQAPPKPCAGNQGTLITVEDLFYNIITRKKALKNPSEEYGKILEVVGRYSIHN 187

Query: 209 INISFSCRKHGAARADVHSVGPTSRLDAIRTVYGASVARNLIKIEVSENGEACSGFKMDG 268
             ISFS +K G   +DV ++   + +D IR+++G +V+R LI++   +   A   FKM+G
Sbjct: 188 SGISFSVKKQGETVSDVRTLPNATTVDNIRSIFGNAVSRELIEVGCEDKTLA---FKMNG 247

Query: 269 LISNSNYVAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIILPPEHVD 328
            ISN+NY  KK   +LFIN R+VE +ALK+AIE VYAA LPK + PF+Y+ + + P++VD
Sbjct: 248 YISNANYSVKKCIFLLFINHRLVESAALKKAIEAVYAAYLPKNTHPFLYLILEISPQNVD 307

Query: 329 VNVHPTKKEVSLLNQEVIIERIQSAVESKLRSSNDTRAFQEQDV---ESSEACQMVLSKD 388
           VNVHPTK EV  L++E I+ER+Q  +ESKL  SN +R +  Q +    +  + + V S  
Sbjct: 308 VNVHPTKHEVHFLHEESILERVQQHIESKLLGSNSSRMYFTQTLLPGLAGPSGEAVKSTT 367

Query: 389 DTQNCSQSGSKSQKVPVHKMVRTDSTDPAGRLHAYVQMKPPGLP 430
              + S SGS   KV  ++MVRTDS D   +L A++Q     LP
Sbjct: 368 GIASSSTSGS-GDKVHAYQMVRTDSRDQ--KLDAFMQPVSRRLP 405

BLAST of Cla001012 vs. Swiss-Prot
Match: MLH1_YEAST (DNA mismatch repair protein MLH1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=MLH1 PE=1 SV=2)

HSP 1 Score: 322.8 bits (826), Expect = 6.0e-87
Identity = 167/364 (45.88%), Postives = 252/364 (69.23%), Query Frame = 1

Query: 28  KILRLDESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVVVKDGGLKLIQVSDDGH 87
           +I  LD SVVN+IAAGE+I  PV+A+KE++ENS+DA +T ++++VK+GG+K++Q++D+G 
Sbjct: 4   RIKALDASVVNKIAAGEIIISPVNALKEMMENSIDANATMIDILVKEGGIKVLQITDNGS 63

Query: 88  GIRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQLHGY 147
           GI   DLPILCER TTSKL  FEDL  I++ GFRGEALAS+++V  VTVTT  K     +
Sbjct: 64  GINKADLPILCERFTTSKLQKFEDLSQIQTYGFRGEALASISHVARVTVTTKVKEDRCAW 123

Query: 148 RVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMTARRKTLQNASDDYTKIVDLLSRFAIH 207
           RVSY +G M   PKP A   GT I VE+LF+N+ +R + L++ +D+Y+KI+D++ R+AIH
Sbjct: 124 RVSYAEGKMLESPKPVAGKDGTTILVEDLFFNIPSRLRALRSHNDEYSKILDVVGRYAIH 183

Query: 208 HINISFSCRKHGAARADVHSVGPTSRL-DAIRTVYGASVARNLIKIEVSENGEACSGFKM 267
             +I FSC+K G +   + SV P+  + D IRTV+  SVA NLI   +S+  E  +   +
Sbjct: 184 SKDIGFSCKKFGDSNYSL-SVKPSYTVQDRIRTVFNKSVASNLITFHISK-VEDLNLESV 243

Query: 268 DGLISNSNYVAKK-ITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIILPPE 327
           DG + N N+++KK I+ + FIN R+V C  L+RA+  VY+  LPK ++PFIY+ I++ P 
Sbjct: 244 DGKVCNLNFISKKSISPIFFINNRLVTCDLLRRALNSVYSNYLPKGNRPFIYLGIVIDPA 303

Query: 328 HVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLRSSNDTRAFQEQDVESSEACQMVLSKD 387
            VDVNVHPTK+EV  L+Q+ IIE+I + + ++L + + +R F+   + +++   ++   D
Sbjct: 304 AVDVNVHPTKREVRFLSQDEIIEKIANQLHAELSAIDTSRTFKASSISTNKPESLIPFND 363

Query: 388 DTQN 390
             ++
Sbjct: 364 TIES 365

BLAST of Cla001012 vs. TrEMBL
Match: A0A061FKD1_THECC (MUTL isoform 1 OS=Theobroma cacao GN=TCM_036511 PE=4 SV=1)

HSP 1 Score: 664.1 bits (1712), Expect = 1.2e-187
Identity = 342/416 (82.21%), Postives = 373/416 (89.66%), Query Frame = 1

Query: 24  KEPPKILRLDESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVVVKDGGLKLIQVS 83
           KE PKI RLDESVVNRIAAGEVIQRPVSA+KELVENSLDA STS++VVVKDGGLKLIQVS
Sbjct: 10  KELPKIHRLDESVVNRIAAGEVIQRPVSAVKELVENSLDASSTSISVVVKDGGLKLIQVS 69

Query: 84  DDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQ 143
           DDGHGIR+EDLPILCERHTTSKLS +EDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQ
Sbjct: 70  DDGHGIRHEDLPILCERHTTSKLSKYEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQ 129

Query: 144 LHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMTARRKTLQNASDDYTKIVDLLSR 203
           LHGYRVSYRDG+MEHEPK CAAVKGTQI VENLFYNM ARRKTLQN++DDYTKIVDLLSR
Sbjct: 130 LHGYRVSYRDGMMEHEPKACAAVKGTQIMVENLFYNMIARRKTLQNSADDYTKIVDLLSR 189

Query: 204 FAIHHINISFSCRKHGAARADVHSVGPTSRLDAIRTVYGASVARNLIKIEVSENGEACSG 263
           FAIH+I++SFSCRKHGAARADVHSV  +SRLDAIR+VYG SVARNLIKIE S+N  + S 
Sbjct: 190 FAIHYIDVSFSCRKHGAARADVHSVATSSRLDAIRSVYGLSVARNLIKIEASDNDPSSSV 249

Query: 264 FKMDGLISNSNYVAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIILP 323
           F+MDG ISNSNYV KK TMVLFIN R+VEC+ALKRA+EIVY+ATLPKASKPFIYMSIILP
Sbjct: 250 FEMDGFISNSNYVVKKTTMVLFINDRLVECTALKRALEIVYSATLPKASKPFIYMSIILP 309

Query: 324 PEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLRSSNDTRAFQEQDVESSEACQMVLS 383
           PEHVDVNVHPTK+EVSLLNQEVIIE+IQS VES LR+SN++R FQEQ VESS +   + +
Sbjct: 310 PEHVDVNVHPTKREVSLLNQEVIIEKIQSVVESMLRNSNESRTFQEQTVESSPSVPSITN 369

Query: 384 KDDTQNCSQSGSKSQKVPVHKMVRTDSTDPAGRLHAYVQMKPPGLPE--SSLTTVR 438
            +   N S SGSKSQKVPVHKMVRTDS+DPAGRLHAY+  KP    E  SSLT VR
Sbjct: 370 NESHLNPSPSGSKSQKVPVHKMVRTDSSDPAGRLHAYLYKKPQNHLEMNSSLTAVR 425

BLAST of Cla001012 vs. TrEMBL
Match: A0A061FS59_THECC (MUTL isoform 2 OS=Theobroma cacao GN=TCM_036511 PE=4 SV=1)

HSP 1 Score: 664.1 bits (1712), Expect = 1.2e-187
Identity = 342/416 (82.21%), Postives = 373/416 (89.66%), Query Frame = 1

Query: 24  KEPPKILRLDESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVVVKDGGLKLIQVS 83
           KE PKI RLDESVVNRIAAGEVIQRPVSA+KELVENSLDA STS++VVVKDGGLKLIQVS
Sbjct: 10  KELPKIHRLDESVVNRIAAGEVIQRPVSAVKELVENSLDASSTSISVVVKDGGLKLIQVS 69

Query: 84  DDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQ 143
           DDGHGIR+EDLPILCERHTTSKLS +EDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQ
Sbjct: 70  DDGHGIRHEDLPILCERHTTSKLSKYEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQ 129

Query: 144 LHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMTARRKTLQNASDDYTKIVDLLSR 203
           LHGYRVSYRDG+MEHEPK CAAVKGTQI VENLFYNM ARRKTLQN++DDYTKIVDLLSR
Sbjct: 130 LHGYRVSYRDGMMEHEPKACAAVKGTQIMVENLFYNMIARRKTLQNSADDYTKIVDLLSR 189

Query: 204 FAIHHINISFSCRKHGAARADVHSVGPTSRLDAIRTVYGASVARNLIKIEVSENGEACSG 263
           FAIH+I++SFSCRKHGAARADVHSV  +SRLDAIR+VYG SVARNLIKIE S+N  + S 
Sbjct: 190 FAIHYIDVSFSCRKHGAARADVHSVATSSRLDAIRSVYGLSVARNLIKIEASDNDPSSSV 249

Query: 264 FKMDGLISNSNYVAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIILP 323
           F+MDG ISNSNYV KK TMVLFIN R+VEC+ALKRA+EIVY+ATLPKASKPFIYMSIILP
Sbjct: 250 FEMDGFISNSNYVVKKTTMVLFINDRLVECTALKRALEIVYSATLPKASKPFIYMSIILP 309

Query: 324 PEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLRSSNDTRAFQEQDVESSEACQMVLS 383
           PEHVDVNVHPTK+EVSLLNQEVIIE+IQS VES LR+SN++R FQEQ VESS +   + +
Sbjct: 310 PEHVDVNVHPTKREVSLLNQEVIIEKIQSVVESMLRNSNESRTFQEQTVESSPSVPSITN 369

Query: 384 KDDTQNCSQSGSKSQKVPVHKMVRTDSTDPAGRLHAYVQMKPPGLPE--SSLTTVR 438
            +   N S SGSKSQKVPVHKMVRTDS+DPAGRLHAY+  KP    E  SSLT VR
Sbjct: 370 NESHLNPSPSGSKSQKVPVHKMVRTDSSDPAGRLHAYLYKKPQNHLEMNSSLTAVR 425

BLAST of Cla001012 vs. TrEMBL
Match: A0A061FJ50_THECC (MUTL isoform 3 OS=Theobroma cacao GN=TCM_036511 PE=4 SV=1)

HSP 1 Score: 664.1 bits (1712), Expect = 1.2e-187
Identity = 342/416 (82.21%), Postives = 373/416 (89.66%), Query Frame = 1

Query: 24  KEPPKILRLDESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVVVKDGGLKLIQVS 83
           KE PKI RLDESVVNRIAAGEVIQRPVSA+KELVENSLDA STS++VVVKDGGLKLIQVS
Sbjct: 10  KELPKIHRLDESVVNRIAAGEVIQRPVSAVKELVENSLDASSTSISVVVKDGGLKLIQVS 69

Query: 84  DDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQ 143
           DDGHGIR+EDLPILCERHTTSKLS +EDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQ
Sbjct: 70  DDGHGIRHEDLPILCERHTTSKLSKYEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQ 129

Query: 144 LHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMTARRKTLQNASDDYTKIVDLLSR 203
           LHGYRVSYRDG+MEHEPK CAAVKGTQI VENLFYNM ARRKTLQN++DDYTKIVDLLSR
Sbjct: 130 LHGYRVSYRDGMMEHEPKACAAVKGTQIMVENLFYNMIARRKTLQNSADDYTKIVDLLSR 189

Query: 204 FAIHHINISFSCRKHGAARADVHSVGPTSRLDAIRTVYGASVARNLIKIEVSENGEACSG 263
           FAIH+I++SFSCRKHGAARADVHSV  +SRLDAIR+VYG SVARNLIKIE S+N  + S 
Sbjct: 190 FAIHYIDVSFSCRKHGAARADVHSVATSSRLDAIRSVYGLSVARNLIKIEASDNDPSSSV 249

Query: 264 FKMDGLISNSNYVAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIILP 323
           F+MDG ISNSNYV KK TMVLFIN R+VEC+ALKRA+EIVY+ATLPKASKPFIYMSIILP
Sbjct: 250 FEMDGFISNSNYVVKKTTMVLFINDRLVECTALKRALEIVYSATLPKASKPFIYMSIILP 309

Query: 324 PEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLRSSNDTRAFQEQDVESSEACQMVLS 383
           PEHVDVNVHPTK+EVSLLNQEVIIE+IQS VES LR+SN++R FQEQ VESS +   + +
Sbjct: 310 PEHVDVNVHPTKREVSLLNQEVIIEKIQSVVESMLRNSNESRTFQEQTVESSPSVPSITN 369

Query: 384 KDDTQNCSQSGSKSQKVPVHKMVRTDSTDPAGRLHAYVQMKPPGLPE--SSLTTVR 438
            +   N S SGSKSQKVPVHKMVRTDS+DPAGRLHAY+  KP    E  SSLT VR
Sbjct: 370 NESHLNPSPSGSKSQKVPVHKMVRTDSSDPAGRLHAYLYKKPQNHLEMNSSLTAVR 425

BLAST of Cla001012 vs. TrEMBL
Match: A0A097PJQ0_FRAVE (MLH1 (Fragment) OS=Fragaria vesca PE=2 SV=1)

HSP 1 Score: 657.5 bits (1695), Expect = 1.1e-185
Identity = 333/411 (81.02%), Postives = 373/411 (90.75%), Query Frame = 1

Query: 29  ILRLDESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVVVKDGGLKLIQVSDDGHG 88
           I RLDESVVNRIAAGEVIQRPVSA+KELVENSLDA S+S+NVVVKDGGLKLIQVSD+GHG
Sbjct: 1   IHRLDESVVNRIAAGEVIQRPVSAVKELVENSLDAHSSSINVVVKDGGLKLIQVSDNGHG 60

Query: 89  IRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQLHGYR 148
           IRYEDLPILCERHTTSKLS+FEDLQSIKSMGFRGEALASMTYV HVTVTTITKGQLHGYR
Sbjct: 61  IRYEDLPILCERHTTSKLSSFEDLQSIKSMGFRGEALASMTYVAHVTVTTITKGQLHGYR 120

Query: 149 VSYRDGVMEHEPKPCAAVKGTQITVENLFYNMTARRKTLQNASDDYTKIVDLLSRFAIHH 208
           VSY+DGVME+EPK CAAVKGTQI +ENLFYNM+ARRK LQN++DDY+KIVDLLSRFAIHH
Sbjct: 121 VSYKDGVMENEPKACAAVKGTQIMIENLFYNMSARRKNLQNSADDYSKIVDLLSRFAIHH 180

Query: 209 INISFSCRKHGAARADVHSVGPTSRLDAIRTVYGASVARNLIKIEVSENGEACSGFKMDG 268
           IN+SFSCRKHGA RADV SV   SR+DAIR+VYGASVAR+L+KIE S+   + S F+MDG
Sbjct: 181 INVSFSCRKHGAGRADVSSVATVSRIDAIRSVYGASVARSLMKIEASDKDPSSSIFQMDG 240

Query: 269 LISNSNYVAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIILPPEHVD 328
           L SNS YVAKKITMVLFIN R+V+C+ALKRA+EIVYAATLPKASKPF+YMSI+LPPEHVD
Sbjct: 241 LFSNSEYVAKKITMVLFINDRLVDCTALKRALEIVYAATLPKASKPFLYMSIVLPPEHVD 300

Query: 329 VNVHPTKKEVSLLNQEVIIERIQSAVESKLRSSNDTRAFQEQDVESSEACQMVLSKDDTQ 388
           VNVHPTK+EVSLLNQEVIIE+IQS VES+LRSSN+T+ FQEQ VE S +CQM+ SKD  +
Sbjct: 301 VNVHPTKREVSLLNQEVIIEKIQSVVESRLRSSNETQIFQEQTVEPSSSCQMISSKDSNR 360

Query: 389 NCSQSGSKSQKVPVHKMVRTDSTDPAGRLHAYVQMKPPG--LPESSLTTVR 438
           N S SGSKSQKVPV+KMVRTDS+DPAGRLH Y+Q +P G  +  +SLT VR
Sbjct: 361 NPSPSGSKSQKVPVNKMVRTDSSDPAGRLHIYLQAQPHGHLVKNTSLTAVR 411

BLAST of Cla001012 vs. TrEMBL
Match: A0A061FJV6_THECC (MUTL isoform 4 OS=Theobroma cacao GN=TCM_036511 PE=4 SV=1)

HSP 1 Score: 650.6 bits (1677), Expect = 1.4e-183
Identity = 338/416 (81.25%), Postives = 369/416 (88.70%), Query Frame = 1

Query: 24  KEPPKILRLDESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVVVKDGGLKLIQVS 83
           KE PKI RLDESVVNRIAAGEVIQRPVSA+KELVENSLDA STS++VVVKDGGLKLIQVS
Sbjct: 10  KELPKIHRLDESVVNRIAAGEVIQRPVSAVKELVENSLDASSTSISVVVKDGGLKLIQVS 69

Query: 84  DDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQ 143
           DDGHGIR+EDLPILCERHTTSKLS +EDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQ
Sbjct: 70  DDGHGIRHEDLPILCERHTTSKLSKYEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQ 129

Query: 144 LHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMTARRKTLQNASDDYTKIVDLLSR 203
           LHG    YRDG+MEHEPK CAAVKGTQI VENLFYNM ARRKTLQN++DDYTKIVDLLSR
Sbjct: 130 LHG----YRDGMMEHEPKACAAVKGTQIMVENLFYNMIARRKTLQNSADDYTKIVDLLSR 189

Query: 204 FAIHHINISFSCRKHGAARADVHSVGPTSRLDAIRTVYGASVARNLIKIEVSENGEACSG 263
           FAIH+I++SFSCRKHGAARADVHSV  +SRLDAIR+VYG SVARNLIKIE S+N  + S 
Sbjct: 190 FAIHYIDVSFSCRKHGAARADVHSVATSSRLDAIRSVYGLSVARNLIKIEASDNDPSSSV 249

Query: 264 FKMDGLISNSNYVAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIILP 323
           F+MDG ISNSNYV KK TMVLFIN R+VEC+ALKRA+EIVY+ATLPKASKPFIYMSIILP
Sbjct: 250 FEMDGFISNSNYVVKKTTMVLFINDRLVECTALKRALEIVYSATLPKASKPFIYMSIILP 309

Query: 324 PEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLRSSNDTRAFQEQDVESSEACQMVLS 383
           PEHVDVNVHPTK+EVSLLNQEVIIE+IQS VES LR+SN++R FQEQ VESS +   + +
Sbjct: 310 PEHVDVNVHPTKREVSLLNQEVIIEKIQSVVESMLRNSNESRTFQEQTVESSPSVPSITN 369

Query: 384 KDDTQNCSQSGSKSQKVPVHKMVRTDSTDPAGRLHAYVQMKPPGLPE--SSLTTVR 438
            +   N S SGSKSQKVPVHKMVRTDS+DPAGRLHAY+  KP    E  SSLT VR
Sbjct: 370 NESHLNPSPSGSKSQKVPVHKMVRTDSSDPAGRLHAYLYKKPQNHLEMNSSLTAVR 421

BLAST of Cla001012 vs. NCBI nr
Match: gi|778709830|ref|XP_011656465.1| (PREDICTED: DNA mismatch repair protein MLH1 [Cucumis sativus])

HSP 1 Score: 810.8 bits (2093), Expect = 1.2e-231
Identity = 413/438 (94.29%), Postives = 427/438 (97.49%), Query Frame = 1

Query: 1   METHANEDIIPMDACGD-EELVPCKEPPKILRLDESVVNRIAAGEVIQRPVSAIKELVEN 60
           METHAN++IIPMD  G+ EE+VPCKEPPKILRL+ESVVNRIAAGEVIQRPVSA+KELVEN
Sbjct: 1   METHANDEIIPMDTAGEQEEVVPCKEPPKILRLEESVVNRIAAGEVIQRPVSAVKELVEN 60

Query: 61  SLDAQSTSVNVVVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMG 120
           SLDAQ+TSVNVVVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMG
Sbjct: 61  SLDAQATSVNVVVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMG 120

Query: 121 FRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYN 180
           FRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYN
Sbjct: 121 FRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYN 180

Query: 181 MTARRKTLQNASDDYTKIVDLLSRFAIHHINISFSCRKHGAARADVHSVGPTSRLDAIRT 240
           MTARRKTLQNASDDYTKIVDLLSRFAIHHINISFSCRKHGAARADVHSVGPTSRLDAIRT
Sbjct: 181 MTARRKTLQNASDDYTKIVDLLSRFAIHHINISFSCRKHGAARADVHSVGPTSRLDAIRT 240

Query: 241 VYGASVARNLIKIEVSENGEACSGFKMDGLISNSNYVAKKITMVLFINGRMVECSALKRA 300
           VYGASVARNL+KIEVSEN EACSGFKMDGLISNSNYVAKKITMVLFINGRMVECSALKRA
Sbjct: 241 VYGASVARNLMKIEVSENDEACSGFKMDGLISNSNYVAKKITMVLFINGRMVECSALKRA 300

Query: 301 IEIVYAATLPKASKPFIYMSIILPPEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLR 360
           IEIVYAATLPKASKP+IYMSIILPPEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLR
Sbjct: 301 IEIVYAATLPKASKPYIYMSIILPPEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLR 360

Query: 361 SSNDTRAFQEQDVESSEACQMVLSKDDTQNCSQSGSKSQKVPVHKMVRTDSTDPAGRLHA 420
           SSNDT+AFQEQDVESSEA QM+LS DD+QN S+ GSKSQKVPVHKMVR DSTDPAGRLHA
Sbjct: 361 SSNDTKAFQEQDVESSEAYQMLLSNDDSQNFSKFGSKSQKVPVHKMVRADSTDPAGRLHA 420

Query: 421 YVQMKPPGLPESSLTTVR 438
           YVQMK PGLPES+LT VR
Sbjct: 421 YVQMKRPGLPESTLTAVR 438

BLAST of Cla001012 vs. NCBI nr
Match: gi|659126219|ref|XP_008463072.1| (PREDICTED: DNA mismatch repair protein MLH1 isoform X1 [Cucumis melo])

HSP 1 Score: 807.7 bits (2085), Expect = 9.8e-231
Identity = 413/438 (94.29%), Postives = 424/438 (96.80%), Query Frame = 1

Query: 1   METHANEDIIPMDACGDE-ELVPCKEPPKILRLDESVVNRIAAGEVIQRPVSAIKELVEN 60
           METHAN++IIPMD  G+E E+VPCKEPPKILRLDESVVNRIAAGEVIQRPVSAIKELVEN
Sbjct: 1   METHANDEIIPMDTAGEEEEVVPCKEPPKILRLDESVVNRIAAGEVIQRPVSAIKELVEN 60

Query: 61  SLDAQSTSVNVVVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMG 120
           SLDAQ+TSVNVVVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMG
Sbjct: 61  SLDAQATSVNVVVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMG 120

Query: 121 FRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYN 180
           FRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYN
Sbjct: 121 FRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYN 180

Query: 181 MTARRKTLQNASDDYTKIVDLLSRFAIHHINISFSCRKHGAARADVHSVGPTSRLDAIRT 240
           MTARRKTLQNASDDYTKIVDLLSRFAIHH NISFSCRKHGAARADVHSVGPTSRLDAIRT
Sbjct: 181 MTARRKTLQNASDDYTKIVDLLSRFAIHHTNISFSCRKHGAARADVHSVGPTSRLDAIRT 240

Query: 241 VYGASVARNLIKIEVSENGEACSGFKMDGLISNSNYVAKKITMVLFINGRMVECSALKRA 300
           VYGASVARNL+KIEVSEN EACSGF+MDGLISNSNYVAKKI MVLFINGRMVECSALKRA
Sbjct: 241 VYGASVARNLMKIEVSENDEACSGFQMDGLISNSNYVAKKIMMVLFINGRMVECSALKRA 300

Query: 301 IEIVYAATLPKASKPFIYMSIILPPEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLR 360
           IEIVYAATLPKASKP+IYMSIILPPEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLR
Sbjct: 301 IEIVYAATLPKASKPYIYMSIILPPEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLR 360

Query: 361 SSNDTRAFQEQDVESSEACQMVLSKDDTQNCSQSGSKSQKVPVHKMVRTDSTDPAGRLHA 420
           SSNDT+A+QEQDVESS A QMVLS DDTQN S+SGSKSQKVPVHKMVR DSTDPAGRLHA
Sbjct: 361 SSNDTKAYQEQDVESSVAYQMVLSNDDTQNSSKSGSKSQKVPVHKMVRADSTDPAGRLHA 420

Query: 421 YVQMKPPGLPESSLTTVR 438
           YVQMK PGLPESSL  VR
Sbjct: 421 YVQMKQPGLPESSLPAVR 438

BLAST of Cla001012 vs. NCBI nr
Match: gi|470131067|ref|XP_004301421.1| (PREDICTED: DNA mismatch repair protein MLH1 [Fragaria vesca subsp. vesca])

HSP 1 Score: 668.3 bits (1723), Expect = 9.3e-189
Identity = 339/422 (80.33%), Postives = 379/422 (89.81%), Query Frame = 1

Query: 18  EELVPCKEPPKILRLDESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVVVKDGGL 77
           EE     EPPKI RLDESVVNRIAAGEVIQRPVSA+KELVENSLDA S+S+NVVVKDGGL
Sbjct: 4   EEAQVATEPPKIHRLDESVVNRIAAGEVIQRPVSAVKELVENSLDAHSSSINVVVKDGGL 63

Query: 78  KLIQVSDDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYVGHVTVT 137
           KLIQVSD+GHGIRYEDLPILCERHTTSKLS+FEDLQSIKSMGFRGEALASMTYV HVTVT
Sbjct: 64  KLIQVSDNGHGIRYEDLPILCERHTTSKLSSFEDLQSIKSMGFRGEALASMTYVAHVTVT 123

Query: 138 TITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMTARRKTLQNASDDYTKI 197
           TITKGQLHGYRVSY+DGVME+EPK CAAVKGTQI +ENLFYNM+ARRK LQN++DDY+KI
Sbjct: 124 TITKGQLHGYRVSYKDGVMENEPKACAAVKGTQIMIENLFYNMSARRKNLQNSADDYSKI 183

Query: 198 VDLLSRFAIHHINISFSCRKHGAARADVHSVGPTSRLDAIRTVYGASVARNLIKIEVSEN 257
           VDLLSRFAIHHIN+SFSCRKHGA RADV SV   SR+DAIR+VYGASVAR+L+KIE S+ 
Sbjct: 184 VDLLSRFAIHHINVSFSCRKHGAGRADVSSVATVSRIDAIRSVYGASVARSLMKIEASDK 243

Query: 258 GEACSGFKMDGLISNSNYVAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIY 317
             + S F+MDGL SNS YVAKKITMVLFIN R+V+C+ALKRA+EIVYAATLPKASKPF+Y
Sbjct: 244 DPSSSIFQMDGLFSNSEYVAKKITMVLFINDRLVDCTALKRALEIVYAATLPKASKPFLY 303

Query: 318 MSIILPPEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLRSSNDTRAFQEQDVESSEA 377
           MSI+LPPEHVDVNVHPTK+EVSLLNQEVIIE+IQS VES+LRSSN+T+ FQEQ VE S +
Sbjct: 304 MSIVLPPEHVDVNVHPTKREVSLLNQEVIIEKIQSVVESRLRSSNETQIFQEQTVEPSSS 363

Query: 378 CQMVLSKDDTQNCSQSGSKSQKVPVHKMVRTDSTDPAGRLHAYVQMKPPG--LPESSLTT 437
           CQM+ SKD  +N S SGSKSQKVPV+KMVRTDS+DPAGRLH Y+Q +P G  +  +SLT 
Sbjct: 364 CQMISSKDSNRNPSPSGSKSQKVPVNKMVRTDSSDPAGRLHIYLQAQPHGHLVKNTSLTA 423

BLAST of Cla001012 vs. NCBI nr
Match: gi|645224076|ref|XP_008218935.1| (PREDICTED: DNA mismatch repair protein MLH1 isoform X2 [Prunus mume])

HSP 1 Score: 664.5 bits (1713), Expect = 1.3e-187
Identity = 340/428 (79.44%), Postives = 382/428 (89.25%), Query Frame = 1

Query: 12  MDACGDEELVPCKEPPKILRLDESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVV 71
           M+   +EE VP  EPPKI RLD+SVVNRIAAGEVIQRPVSA+KELVENSLDA S+S+NVV
Sbjct: 3   MEIEAEEEQVPM-EPPKIHRLDDSVVNRIAAGEVIQRPVSAVKELVENSLDACSSSINVV 62

Query: 72  VKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYV 131
           VKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYV
Sbjct: 63  VKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYV 122

Query: 132 GHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMTARRKTLQNAS 191
            HVTVTTITKGQLHGYRVSY+DGVMEHEPK CAAVKGTQI VENLFYNMTARRKTLQN++
Sbjct: 123 AHVTVTTITKGQLHGYRVSYKDGVMEHEPKACAAVKGTQIMVENLFYNMTARRKTLQNSA 182

Query: 192 DDYTKIVDLLSRFAIHHINISFSCRKHGAARADVHSVGPTSRLDAIRTVYGASVARNLIK 251
           DDY+KIVD+LSRFAIHH+N+SFSCRKHGAARADV+SV   SR+DAIR+VYG SVAR L+K
Sbjct: 183 DDYSKIVDVLSRFAIHHMNVSFSCRKHGAARADVNSVATISRIDAIRSVYGVSVARCLMK 242

Query: 252 IEVSENGEACSGFKMDGLISNSNYVAKKITMVLFINGRMVECSALKRAIEIVYAATLPKA 311
           +E  +   + S F+M+G ISNSNYVAKKITMVLFIN R+V+C+ALKRA+EIVYAATLPKA
Sbjct: 243 VEALDKDPSSSVFQMEGFISNSNYVAKKITMVLFINDRLVDCTALKRALEIVYAATLPKA 302

Query: 312 SKPFIYMSIILPPEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLRSSNDTRAFQEQD 371
           SKPFIYM+IILPPEHVDVNVHPTK+EVSLLNQE+IIE+IQS VES+LRSSN+T+ FQEQ 
Sbjct: 303 SKPFIYMAIILPPEHVDVNVHPTKREVSLLNQEIIIEKIQSVVESRLRSSNETQTFQEQA 362

Query: 372 VESSEACQMVLSKDDTQNCSQSGSKSQKVPVHKMVRTDSTDPAGRLHAYVQMKPPGLPE- 431
           V+ + +CQMV S D  +N S SGSK QKVPVHKMVRTDS+DPAGRLH Y+Q +  G  E 
Sbjct: 363 VKPTPSCQMVSSNDSNRNPSPSGSKLQKVPVHKMVRTDSSDPAGRLHVYLQPESCGHLER 422

Query: 432 -SSLTTVR 438
            +SLT +R
Sbjct: 423 NTSLTAIR 429

BLAST of Cla001012 vs. NCBI nr
Match: gi|590603959|ref|XP_007020138.1| (MUTL isoform 1 [Theobroma cacao])

HSP 1 Score: 664.1 bits (1712), Expect = 1.8e-187
Identity = 342/416 (82.21%), Postives = 373/416 (89.66%), Query Frame = 1

Query: 24  KEPPKILRLDESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVVVKDGGLKLIQVS 83
           KE PKI RLDESVVNRIAAGEVIQRPVSA+KELVENSLDA STS++VVVKDGGLKLIQVS
Sbjct: 10  KELPKIHRLDESVVNRIAAGEVIQRPVSAVKELVENSLDASSTSISVVVKDGGLKLIQVS 69

Query: 84  DDGHGIRYEDLPILCERHTTSKLSTFEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQ 143
           DDGHGIR+EDLPILCERHTTSKLS +EDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQ
Sbjct: 70  DDGHGIRHEDLPILCERHTTSKLSKYEDLQSIKSMGFRGEALASMTYVGHVTVTTITKGQ 129

Query: 144 LHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMTARRKTLQNASDDYTKIVDLLSR 203
           LHGYRVSYRDG+MEHEPK CAAVKGTQI VENLFYNM ARRKTLQN++DDYTKIVDLLSR
Sbjct: 130 LHGYRVSYRDGMMEHEPKACAAVKGTQIMVENLFYNMIARRKTLQNSADDYTKIVDLLSR 189

Query: 204 FAIHHINISFSCRKHGAARADVHSVGPTSRLDAIRTVYGASVARNLIKIEVSENGEACSG 263
           FAIH+I++SFSCRKHGAARADVHSV  +SRLDAIR+VYG SVARNLIKIE S+N  + S 
Sbjct: 190 FAIHYIDVSFSCRKHGAARADVHSVATSSRLDAIRSVYGLSVARNLIKIEASDNDPSSSV 249

Query: 264 FKMDGLISNSNYVAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIILP 323
           F+MDG ISNSNYV KK TMVLFIN R+VEC+ALKRA+EIVY+ATLPKASKPFIYMSIILP
Sbjct: 250 FEMDGFISNSNYVVKKTTMVLFINDRLVECTALKRALEIVYSATLPKASKPFIYMSIILP 309

Query: 324 PEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLRSSNDTRAFQEQDVESSEACQMVLS 383
           PEHVDVNVHPTK+EVSLLNQEVIIE+IQS VES LR+SN++R FQEQ VESS +   + +
Sbjct: 310 PEHVDVNVHPTKREVSLLNQEVIIEKIQSVVESMLRNSNESRTFQEQTVESSPSVPSITN 369

Query: 384 KDDTQNCSQSGSKSQKVPVHKMVRTDSTDPAGRLHAYVQMKPPGLPE--SSLTTVR 438
            +   N S SGSKSQKVPVHKMVRTDS+DPAGRLHAY+  KP    E  SSLT VR
Sbjct: 370 NESHLNPSPSGSKSQKVPVHKMVRTDSSDPAGRLHAYLYKKPQNHLEMNSSLTAVR 425

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MLH1_ARATH1.8e-17676.19DNA mismatch repair protein MLH1 OS=Arabidopsis thaliana GN=MLH1 PE=2 SV=1[more]
MLH1_HUMAN1.5e-10953.07DNA mismatch repair protein Mlh1 OS=Homo sapiens GN=MLH1 PE=1 SV=1[more]
MLH1_MOUSE1.7e-10552.00DNA mismatch repair protein Mlh1 OS=Mus musculus GN=Mlh1 PE=1 SV=2[more]
MLH1_RAT3.7e-10551.24DNA mismatch repair protein Mlh1 OS=Rattus norvegicus GN=Mlh1 PE=2 SV=1[more]
MLH1_YEAST6.0e-8745.88DNA mismatch repair protein MLH1 OS=Saccharomyces cerevisiae (strain ATCC 204508... [more]
Match NameE-valueIdentityDescription
A0A061FKD1_THECC1.2e-18782.21MUTL isoform 1 OS=Theobroma cacao GN=TCM_036511 PE=4 SV=1[more]
A0A061FS59_THECC1.2e-18782.21MUTL isoform 2 OS=Theobroma cacao GN=TCM_036511 PE=4 SV=1[more]
A0A061FJ50_THECC1.2e-18782.21MUTL isoform 3 OS=Theobroma cacao GN=TCM_036511 PE=4 SV=1[more]
A0A097PJQ0_FRAVE1.1e-18581.02MLH1 (Fragment) OS=Fragaria vesca PE=2 SV=1[more]
A0A061FJV6_THECC1.4e-18381.25MUTL isoform 4 OS=Theobroma cacao GN=TCM_036511 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|778709830|ref|XP_011656465.1|1.2e-23194.29PREDICTED: DNA mismatch repair protein MLH1 [Cucumis sativus][more]
gi|659126219|ref|XP_008463072.1|9.8e-23194.29PREDICTED: DNA mismatch repair protein MLH1 isoform X1 [Cucumis melo][more]
gi|470131067|ref|XP_004301421.1|9.3e-18980.33PREDICTED: DNA mismatch repair protein MLH1 [Fragaria vesca subsp. vesca][more]
gi|645224076|ref|XP_008218935.1|1.3e-18779.44PREDICTED: DNA mismatch repair protein MLH1 isoform X2 [Prunus mume][more]
gi|590603959|ref|XP_007020138.1|1.8e-18782.21MUTL isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002099DNA_mismatch_repair_N
IPR003594HATPase_C
IPR013507DNA_mismatch_S5_2-like
IPR014721Ribosomal_S5_D2-typ_fold_subgr
IPR014762DNA_mismatch_repair_CS
IPR020568Ribosomal_S5_D2-typ_fold
Vocabulary: Molecular Function
TermDefinition
GO:0005524ATP binding
GO:0030983mismatched DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006298mismatch repair
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006298 mismatch repair
biological_process GO:0009555 pollen development
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0048316 seed development
biological_process GO:0009845 seed germination
biological_process GO:0006312 mitotic recombination
cellular_component GO:0005712 chiasma
cellular_component GO:0000795 synaptonemal complex
cellular_component GO:0032389 MutLalpha complex
cellular_component GO:0000790 nuclear chromatin
cellular_component GO:0005739 mitochondrion
cellular_component GO:0032300 mismatch repair complex
cellular_component GO:0005575 cellular_component
molecular_function GO:0030983 mismatched DNA binding
molecular_function GO:0005524 ATP binding
molecular_function GO:0016887 ATPase activity
molecular_function GO:0003697 single-stranded DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla001012Cla001012.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002099DNA mismatch repair protein familyTIGRFAMsTIGR00585TIGR00585coord: 28..338
score: 4.2
IPR003594Histidine kinase-like ATPase, C-terminal domainGENE3DG3DSA:3.30.565.10coord: 28..235
score: 2.8
IPR003594Histidine kinase-like ATPase, C-terminal domainunknownSSF55874ATPase domain of HSP90 chaperone/DNA topoisomerase II/histidine kinasecoord: 43..220
score: 1.23
IPR013507DNA mismatch repair protein, C-terminalPFAMPF01119DNA_mis_repaircoord: 237..358
score: 4.2
IPR013507DNA mismatch repair protein, C-terminalSMARTSM01340DNA_mis_repair_2coord: 237..359
score: 1.8
IPR014721Ribosomal protein S5 domain 2-type fold, subgroupGENE3DG3DSA:3.30.230.10coord: 236..358
score: 4.3
IPR014762DNA mismatch repair, conserved sitePROSITEPS00058DNA_MISMATCH_REPAIR_1coord: 119..125
scor
IPR020568Ribosomal protein S5 domain 2-type foldunknownSSF54211Ribosomal protein S5 domain 2-likecoord: 227..359
score: 8.34
NoneNo IPR availablePANTHERPTHR10073DNA MISMATCH REPAIR PROTEIN MLH, PMS, MUTLcoord: 29..412
score: 2.0E
NoneNo IPR availablePANTHERPTHR10073:SF40DNA MISMATCH REPAIR PROTEIN MLH1coord: 29..412
score: 2.0E
NoneNo IPR availablePFAMPF13589HATPase_c_3coord: 50..155
score: 4.9