Cmc12g0314871 (gene) Melon (Charmono) v1.1

Overview
NameCmc12g0314871
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
LocationCMiso1.1chr12: 265766 .. 268777 (-)
RNA-Seq ExpressionCmc12g0314871
SyntenyCmc12g0314871
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCCTAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTATAAGAATGGTGTACATATTTGTTCAGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAACAAGTTAAAAGATGTTTCATTACCTCCATGTGAATCTTGTCTTGAAGGTAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTGTGGTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCTAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAATATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTTCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAGAGGAGAAATAGAACCTTGTTAGACATGGTTCGTTCAATGATGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTGTTTCTGAAACACCTTTCGAGCTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTATTAGTGACAAATCCCAAGAAGTTGGAACCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTTGGAAGAAGACCACATGAGAAATCATAAACCACGAAGCAAATTAGTATTAAGTGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGGGTTGATGAAACCACCACATCAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGGTTGTATCACAACCTAACCGCTATTTGGGTTTAACTGAAACTCAAGTTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTTAAAGCTAGACTTGTGGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTCTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCCACATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCAATCAAATCCTATGGTTTTGACCAGAATGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGTTACCTTACTGACGTTAAAGCTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCAACCTATATCGACAAATTGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTAAGGAACAGAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTGTTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCGTAGCATCAAGCAAGGATGCATTGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCCAAATATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGTCTACGAGATATGTACATTAGGTAA

mRNA sequence

ATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCCTAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTATAAGAATGGTGTACATATTTGTTCAGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAACAAGTTAAAAGATGTTTCATTACCTCCATGTGAATCTTGTCTTGAAGGTAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTGTGGTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCTAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAATATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTTCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAGAGGAGAAATAGAACCTTGTTAGACATGGTTCGTTCAATGATGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTGTTTCTGAAACACCTTTCGAGCTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTATTAGTGACAAATCCCAAGAAGTTGGAACCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTTGGAAGAAGACCACATGAGAAATCATAAACCACGAAGCAAATTAGTATTAAGTGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGGGTTGATGAAACCACCACATCAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGGTTGTATCACAACCTAACCGCTATTTGGGTTTAACTGAAACTCAAGTTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTTAAAGCTAGACTTGTGGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTCTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCCACATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCAATCAAATCCTATGGTTTTGACCAGAATGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGTTACCTTACTGACGTTAAAGCTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCAACCTATATCGACAAATTGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTAAGGAACAGAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTGTTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCGTAGCATCAAGCAAGGATGCATTGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCCAAATATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGTCTACGAGATATGTACATTAGGTAA

Coding sequence (CDS)

ATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCCTAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTATAAGAATGGTGTACATATTTGTTCAGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAACAAGTTAAAAGATGTTTCATTACCTCCATGTGAATCTTGTCTTGAAGGTAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTGTGGTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCTAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAATATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTTCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAGAGGAGAAATAGAACCTTGTTAGACATGGTTCGTTCAATGATGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTGTTTCTGAAACACCTTTCGAGCTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTATTAGTGACAAATCCCAAGAAGTTGGAACCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTTGGAAGAAGACCACATGAGAAATCATAAACCACGAAGCAAATTAGTATTAAGTGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGGGTTGATGAAACCACCACATCAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGGTTGTATCACAACCTAACCGCTATTTGGGTTTAACTGAAACTCAAGTTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTTAAAGCTAGACTTGTGGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTCTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCCACATTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCAATCAAATCCTATGGTTTTGACCAGAATGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGTTACCTTACTGACGTTAAAGCTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCAACCTATATCGACAAATTGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTAAGGAACAGAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTGTTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCGTAGCATCAAGCAAGGATGCATTGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCCAAATATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGTCTACGAGATATGTACATTAGGTAA

Protein sequence

MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Homology
BLAST of Cmc12g0314871 vs. NCBI nr
Match: KAA0025945.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0035786.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0040492.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0041262.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 2027.3 bits (5251), Expect = 0.0e+00
Identity = 1003/1003 (100.00%), Postives = 1003/1003 (100.00%), Query Frame = 0

Query: 1    MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 60
            MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM
Sbjct: 233  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 292

Query: 61   NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 120
            NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH
Sbjct: 293  NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 352

Query: 121  LRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 180
            LRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS
Sbjct: 353  LRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 412

Query: 181  DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 240
            DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI
Sbjct: 413  DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 472

Query: 241  LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 300
            LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
Sbjct: 473  LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 532

Query: 301  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 360
            SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP
Sbjct: 533  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 592

Query: 361  RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST 420
            RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST
Sbjct: 593  RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST 652

Query: 421  RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED 480
            RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED
Sbjct: 653  RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED 712

Query: 481  PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 540
            PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK
Sbjct: 713  PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 772

Query: 541  VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG 600
            VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG
Sbjct: 773  VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG 832

Query: 601  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP 660
            NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP
Sbjct: 833  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP 892

Query: 661  CVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR 720
            CVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR
Sbjct: 893  CVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR 952

Query: 721  DRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPY 780
            DRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPY
Sbjct: 953  DRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPY 1012

Query: 781  ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK 840
            ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK
Sbjct: 1013 ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK 1072

Query: 841  DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA 900
            DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA
Sbjct: 1073 DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA 1132

Query: 901  KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ 960
            KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
Sbjct: 1133 KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ 1192

Query: 961  RGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1004
            RGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Sbjct: 1193 RGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1235

BLAST of Cmc12g0314871 vs. NCBI nr
Match: KAA0035907.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 2000.7 bits (5182), Expect = 0.0e+00
Identity = 989/1003 (98.60%), Postives = 996/1003 (99.30%), Query Frame = 0

Query: 1    MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 60
            MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM
Sbjct: 233  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 292

Query: 61   NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 120
            NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH
Sbjct: 293  NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 352

Query: 121  LRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 180
            LRLGHINLDRIGRLVK+GLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS
Sbjct: 353  LRLGHINLDRIGRLVKDGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 412

Query: 181  DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 240
            DLCGPMNVKARG FEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI
Sbjct: 413  DLCGPMNVKARGSFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 472

Query: 241  LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 300
             RSDRGGEYMDL FQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
Sbjct: 473  FRSDRGGEYMDLIFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 532

Query: 301  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 360
            SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP
Sbjct: 533  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 592

Query: 361  RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST 420
            RSRLCQFVGYPKETRGGLFFDP+ENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST
Sbjct: 593  RSRLCQFVGYPKETRGGLFFDPKENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST 652

Query: 421  RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED 480
            RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED
Sbjct: 653  RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED 712

Query: 481  PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 540
            PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK
Sbjct: 713  PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 772

Query: 541  VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG 600
            VQTFKARLVAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLNG
Sbjct: 773  VQTFKARLVAKGYTRKEGVDYEETFSSVAMLKSIRILLSIAKFYDYEIWQMDVKTAFLNG 832

Query: 601  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP 660
            NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP
Sbjct: 833  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP 892

Query: 661  CVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR 720
            CVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGE QYVLGIQIIR
Sbjct: 893  CVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEGQYVLGIQIIR 952

Query: 721  DRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPY 780
            DRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPY
Sbjct: 953  DRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPY 1012

Query: 781  ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK 840
            ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAK
Sbjct: 1013 ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAK 1072

Query: 841  DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA 900
            DLILTGYT+SDFQTDKDSRKSTS SVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA
Sbjct: 1073 DLILTGYTNSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA 1132

Query: 901  KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ 960
            KEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
Sbjct: 1133 KEAVWLKKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ 1192

Query: 961  RGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1004
            RGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Sbjct: 1193 RGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1235

BLAST of Cmc12g0314871 vs. NCBI nr
Match: KAA0059226.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1785.4 bits (4623), Expect = 0.0e+00
Identity = 881/882 (99.89%), Postives = 882/882 (100.00%), Query Frame = 0

Query: 122  RLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSD 181
            +LGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSD
Sbjct: 228  KLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSD 287

Query: 182  LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKIL 241
            LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKIL
Sbjct: 288  LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKIL 347

Query: 242  RSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPS 301
            RSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPS
Sbjct: 348  RSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPS 407

Query: 302  SFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPR 361
            SFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPR
Sbjct: 408  SFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPR 467

Query: 362  SRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTR 421
            SRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTR
Sbjct: 468  SRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTR 527

Query: 422  VVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDP 481
            VVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDP
Sbjct: 528  VVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDP 587

Query: 482  LSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKV 541
            LSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKV
Sbjct: 588  LSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKV 647

Query: 542  QTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGN 601
            QTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGN
Sbjct: 648  QTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGN 707

Query: 602  LEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC 661
            LEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC
Sbjct: 708  LEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC 767

Query: 662  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD 721
            VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD
Sbjct: 768  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD 827

Query: 722  RKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYA 781
            RKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYA
Sbjct: 828  RKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYA 887

Query: 782  SAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKD 841
            SAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKD
Sbjct: 888  SAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKD 947

Query: 842  LILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAK 901
            LILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAK
Sbjct: 948  LILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAK 1007

Query: 902  EAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQR 961
            EAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQR
Sbjct: 1008 EAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQR 1067

Query: 962  GDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1004
            GDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Sbjct: 1068 GDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1109

BLAST of Cmc12g0314871 vs. NCBI nr
Match: KAA0048404.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1656.0 bits (4287), Expect = 0.0e+00
Identity = 810/1004 (80.68%), Postives = 897/1004 (89.34%), Query Frame = 0

Query: 1    MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 60
            MT++VGTG V+SA AVG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++
Sbjct: 335  MTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNV 394

Query: 61   NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 120
            N+ FIYKNGV ICSAKLENNLYVLR   +KA+LN EMF+TA TQNKR +ISP  N +LWH
Sbjct: 395  NKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWH 454

Query: 121  LRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 180
            LRLGHINL+RI RLVKNGLL++L++ SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HS
Sbjct: 455  LRLGHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHS 514

Query: 181  DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 240
            DLCGPMNVKARGGFEYFI+F DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK 
Sbjct: 515  DLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKT 574

Query: 241  LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 300
             RSDRGGEYMDL+FQ+Y++E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP
Sbjct: 575  FRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLP 634

Query: 301  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 360
            +SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEP
Sbjct: 635  NSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEP 694

Query: 361  RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLS----EAT 420
            RS+LC FVGYPK TRGG F+DP++N+VFVSTNATFLEEDH+R HKPRSK+VL+    E T
Sbjct: 695  RSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETT 754

Query: 421  DESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDD 480
            + STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P RY+ LTET  VI D 
Sbjct: 755  EPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDG 814

Query: 481  GVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRD 540
             +EDPL++K+AM DVDKD+W+KAM+LE+ESMYFNSVW+LVD P+GVKPIGCKWIYKRKR 
Sbjct: 815  DIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRG 874

Query: 541  SAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTA 600
            + GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTA
Sbjct: 875  ADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTA 934

Query: 601  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQN 660
            FLNGNLEE+I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ 
Sbjct: 935  FLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQI 994

Query: 661  VDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGI 720
            VDEPCVYK+I    VAFLVLYVDDILLIGND+G LTD+K WLA QFQMKDLGEAQ+VLGI
Sbjct: 995  VDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGI 1054

Query: 721  QIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMR 780
            QI RDRKNK LALSQA+YIDK++V+YSMQNSK+GLLPFRHGV LSKEQ PKTPQ+VE+MR
Sbjct: 1055 QIFRDRKNKMLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMR 1114

Query: 781  RIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLV 840
             IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGL HWTAVK +LKYLRRTRDY LV
Sbjct: 1115 HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYTLV 1174

Query: 841  YGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA 900
            YG+KDLILTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA
Sbjct: 1175 YGSKDLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA 1234

Query: 901  CEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR 960
            CEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIR
Sbjct: 1235 CEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIR 1294

Query: 961  EIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM 1001
            EIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLESLGLRDM
Sbjct: 1295 EIVHRGDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDM 1338

BLAST of Cmc12g0314871 vs. NCBI nr
Match: KAA0035879.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0044276.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0051221.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0051893.1 gag/pol protein [Cucumis melo var. makuwa] >TYK00551.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1656.0 bits (4287), Expect = 0.0e+00
Identity = 810/1004 (80.68%), Postives = 897/1004 (89.34%), Query Frame = 0

Query: 1    MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 60
            MT++VGTG V+SA AVG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++
Sbjct: 336  MTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNV 395

Query: 61   NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 120
            N+ FIYKNGV ICSAKLENNLYVLR   +KA+LN EMF+TA TQNKR +ISP  N +LWH
Sbjct: 396  NKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWH 455

Query: 121  LRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 180
            LRLGHINL+RI RLVKNGLL++L++ SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HS
Sbjct: 456  LRLGHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHS 515

Query: 181  DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 240
            DLCGPMNVKARGGFEYFI+F DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK 
Sbjct: 516  DLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKT 575

Query: 241  LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 300
             RSDRGGEYMDL+FQ+Y++E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP
Sbjct: 576  FRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLP 635

Query: 301  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 360
            +SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEP
Sbjct: 636  NSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEP 695

Query: 361  RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLS----EAT 420
            RS+LC FVGYPK TRGG F+DP++N+VFVSTNATFLEEDH+R HKPRSK+VL+    E T
Sbjct: 696  RSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETT 755

Query: 421  DESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDD 480
            + STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P RY+ LTET  VI D 
Sbjct: 756  EPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDG 815

Query: 481  GVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRD 540
             +EDPL++K+AM DVDKD+W+KAM+LE+ESMYFNSVW+LVD P+GVKPIGCKWIYKRKR 
Sbjct: 816  DIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRG 875

Query: 541  SAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTA 600
            + GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTA
Sbjct: 876  ADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTA 935

Query: 601  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQN 660
            FLNGNLEE+I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ 
Sbjct: 936  FLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQI 995

Query: 661  VDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGI 720
            VDEPCVYK+I    VAFLVLYVDDILLIGND+G LTD+K WLA QFQMKDLGEAQ+VLGI
Sbjct: 996  VDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGI 1055

Query: 721  QIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMR 780
            QI RDRKNK LALSQA+YIDK++V+YSMQNSK+GLLPFRHGV LSKEQ PKTPQ+VE+MR
Sbjct: 1056 QIFRDRKNKMLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMR 1115

Query: 781  RIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLV 840
             IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGL HWTAVK +LKYLRRTRDY LV
Sbjct: 1116 HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYTLV 1175

Query: 841  YGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA 900
            YG+KDLILTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA
Sbjct: 1176 YGSKDLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA 1235

Query: 901  CEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR 960
            CEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIR
Sbjct: 1236 CEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIR 1295

Query: 961  EIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM 1001
            EIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLESLGLRDM
Sbjct: 1296 EIVHRGDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDM 1339

BLAST of Cmc12g0314871 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 719.2 bits (1855), Expect = 6.7e-206
Identity = 400/1027 (38.95%), Postives = 604/1027 (58.81%), Query Frame = 0

Query: 2    TLKVGTGDVISARAVGDAKLFFG-NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 61
            T+K+G         +GD  +       + L+++  VP ++ NL+S   L    Y   F+ 
Sbjct: 321  TVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFAN 380

Query: 62   NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 121
             +  + K  + I        LY       +  LN            +  IS +    LWH
Sbjct: 381  QKWRLTKGSLVIAKGVARGTLYRTNAEICQGELN----------AAQDEISVD----LWH 440

Query: 122  LRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 181
             R+GH++   +  L K  L++  K  ++ PC+ CL GK  +  F     R    L+L++S
Sbjct: 441  KRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNILDLVYS 500

Query: 182  DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 241
            D+CGPM +++ GG +YF++FIDD SR  ++Y+++ K +  + F+++   VE    +K+K 
Sbjct: 501  DVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKR 560

Query: 242  LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 301
            LRSD GGEY    F++Y   HGI+ + + PGTPQ NGV+ER NRT+++ VRSM+  A+LP
Sbjct: 561  LRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLP 620

Query: 302  SSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCP--AHVLVTNPKK 361
             SFWG AV+TA +++N  PS  ++ E P  +W  ++ S SH +++GC   AHV      K
Sbjct: 621  KSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTK 680

Query: 362  LEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHM----RNHKPRSKLVLS 421
            L+ +S  C F+GY  E  G   +DP + +V  S +  F E +       + K ++ ++ +
Sbjct: 681  LDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSEKVKNGIIPN 740

Query: 422  EATDESTRVVDEVGPSSRVDETTTSGQ-------------------SHPSQSLRMP---R 481
              T  ST   +     S  DE +  G+                    HP+Q        R
Sbjct: 741  FVTIPSTS-NNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGEEQHQPLR 800

Query: 482  RSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNS 541
            RS R   +  RY   +   V+I DD   +P S K+ ++  +K+Q +KAM  EMES+  N 
Sbjct: 801  RSERPRVESRRY--PSTEYVLISDD--REPESLKEVLSHPEKNQLMKAMQEEMESLQKNG 860

Query: 542  VWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAML 601
             ++LV+LP+G +P+ CKW++K K+D   K+  +KARLV KG+ Q++G+D++E FSPV  +
Sbjct: 861  TYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKM 920

Query: 602  KSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIY 661
             SIR +LS+A   D E+ Q+DVKTAFL+G+LEE I+M QPEGF   G++  VCKLN+S+Y
Sbjct: 921  TSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLY 980

Query: 662  GLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY-KKINKGKVAFLVLYVDDILLIGNDVGY 721
            GLKQA R W ++FD+ +KS  + +   +PCVY K+ ++     L+LYVDD+L++G D G 
Sbjct: 981  GLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGL 1040

Query: 722  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKG 781
            +  +K  L+  F MKDLG AQ +LG++I+R+R ++ L LSQ  YI+++L R++M+N+K  
Sbjct: 1041 IAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPV 1100

Query: 782  LLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQS 841
              P    + LSK+  P T +E  +M ++PY+SAVGSLMYAM+CTRPDI +AVG+VSR+  
Sbjct: 1101 STPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLE 1160

Query: 842  NPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLN 901
            NPG +HW AVK +L+YLR T    L +G  D IL GYTD+D   D D+RKS++G +FT +
Sbjct: 1161 NPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRKSSTGYLFTFS 1220

Query: 902  GGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNS 961
            GGA+ W+S  Q C+A ST EAEY+AA E  KE +WL++FL +L +         +YCD+ 
Sbjct: 1221 GGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGL---HQKEYVVYCDSQ 1280

Query: 962  GAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEG 998
             A+  SK    H R KHI+ +YH IRE+V    + V KI++  N AD  TK +    FE 
Sbjct: 1281 SAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKFEL 1325

BLAST of Cmc12g0314871 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 506.5 bits (1303), Expect = 6.8e-142
Identity = 336/1089 (30.85%), Postives = 541/1089 (49.68%), Query Frame = 0

Query: 30   LENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHIC-SAKLENNLYVLRPNE 89
            LE++    +   NL+SV  L E   SI F  +   I KNG+ +  ++ + NN+       
Sbjct: 345  LEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSGVTISKNGLMVVKNSGMLNNV------- 404

Query: 90   AKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHIN------LDRIGRLVKNGLLNK 149
               V+N + + + N ++K       NN  LWH R GHI+      + R        LLN 
Sbjct: 405  --PVINFQAY-SINAKHK-------NNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNN 464

Query: 150  LKDVSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARGGFEYFISF 209
            L ++S   CE CL GK  + PF     +   K PL ++HSD+CGP+         YF+ F
Sbjct: 465  L-ELSCEICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIF 524

Query: 210  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIE 269
            +D ++ Y   YL+++KS+    F+++  + E   + K+  L  D G EY+    + + ++
Sbjct: 525  VDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVK 584

Query: 270  HGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPS 329
             GI   L+ P TPQ NGVSER  RT+ +  R+M+S A+L  SFWG AV TA +++N +PS
Sbjct: 585  KGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPS 644

Query: 330  KSV---SETPFELWRGRKPSLSHFRIWGCPAHVLVTNPK-KLEPRSRLCQFVGYPKETRG 389
            +++   S+TP+E+W  +KP L H R++G   +V + N + K + +S    FVGY  E  G
Sbjct: 645  RALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVHIKNKQGKFDDKSFKSIFVGY--EPNG 704

Query: 390  GLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDE---------STRVVDEVG 449
               +D    +  V+ +    E + + +   + + V  + + E         S +++    
Sbjct: 705  FKLWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESENKNFPNDSRKIIQTEF 764

Query: 450  P--SSRVDETTTSGQSHPSQSLRMPRRSGRVV--------------------SQPNRYL- 509
            P  S   D       S  S++   P  S +++                     + N+Y  
Sbjct: 765  PNESKECDNIQFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNKYFL 824

Query: 510  ------------------------GLTETQVVIPDDGVEDP------------------- 569
                                      +ET   + + G+++P                   
Sbjct: 825  NESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRSERLKTK 884

Query: 570  --LSYKQ--------------AMNDV-----------DKDQWVKAMDLEMESMYFNSVWE 629
              +SY +                NDV           DK  W +A++ E+ +   N+ W 
Sbjct: 885  PQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWT 944

Query: 630  LVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSI 689
            +   PE    +  +W++  K +  G    +KARLVA+G+TQ+  +DYEETF+PVA + S 
Sbjct: 945  ITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSF 1004

Query: 690  RILLSIATFYDYEIWQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLK 749
            R +LS+   Y+ ++ QMDVKTAFLNG L+E I+M  P+G         VCKLN++IYGLK
Sbjct: 1005 RFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGI--SCNSDNVCKLNKAIYGLK 1064

Query: 750  QASRSWNIRFDTAIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLIGNDVGYLT 809
            QA+R W   F+ A+K   F  +  + C+Y   K N  +  +++LYVDD+++   D+  + 
Sbjct: 1065 QAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMN 1124

Query: 810  DVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLL 869
            + K +L  +F+M DL E ++ +GI+I  + +   + LSQ+ Y+ K+L +++M+N      
Sbjct: 1125 NFKRYLMEKFRMTDLNEIKHFIGIRI--EMQEDKIYLSQSAYVKKILSKFNMENCNAVST 1184

Query: 870  PFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP 929
            P    ++     S       ++    P  S +G LMY MLCTRPD+  AV I+SRY S  
Sbjct: 1185 PLPSKINYELLNS-------DEDCNTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKN 1244

Query: 930  GLDHWTAVKIVLKYLRRTRDYMLVYG---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL 989
              + W  +K VL+YL+ T D  L++    A +  + GY DSD+   +  RKST+G +F +
Sbjct: 1245 NSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKM 1304

Query: 990  -NGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCD 998
             +   + W + +Q  +A S+ EAEY+A  EA +EA+WL+  L  + +   +  PI +Y D
Sbjct: 1305 FDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINI--KLENPIKIYED 1364

BLAST of Cmc12g0314871 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 458.8 bits (1179), Expect = 1.6e-127
Identity = 334/1130 (29.56%), Postives = 523/1130 (46.28%), Query Frame = 0

Query: 5    VGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEA 64
            +  G  I     G A L   ++ + L  +  VP I +NL+SV  L   +  S+ F    +
Sbjct: 341  IADGSTIPITHTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEF-FPAS 400

Query: 65   FIYKN---GVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 124
            F  K+   GV +   K ++ LY      ++AV    MF  A+  +K    S       WH
Sbjct: 401  FQVKDLNTGVPLLQGKTKDELYEWPIASSQAV---SMF--ASPCSKATHSS-------WH 460

Query: 125  LRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIH 184
             RLGH +L  +  ++ N  L  L     L  C  C   K  K PF+     + +PLE I+
Sbjct: 461  SRLGHPSLAILNSVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIY 520

Query: 185  SDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIK 244
            SD+     + +   + Y++ F+D ++RY +LY ++ KS+  + F  +K+ VEN    +I 
Sbjct: 521  SDVWS-SPILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIG 580

Query: 245  ILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQL 304
             L SD GGE++ LR  DY+ +HGI    S P TP+ NG+SER++R +++M  +++S+A +
Sbjct: 581  TLYSDNGGEFVVLR--DYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASV 640

Query: 305  PSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPK 364
            P ++W YA   AV+++N +P+  +  ++PF+   G+ P+    +++GC  +  +   N  
Sbjct: 641  PKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRH 700

Query: 365  KLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE------------------ 424
            KLE +S+ C F+GY       L       R++ S +  F E                   
Sbjct: 701  KLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQR 760

Query: 425  ----DHMRNHK--PRSKLVL------------------SEATDESTRVVDEVGPSSRVDE 484
                 +  +H   P + LVL                  S +   +T+V     PSS +  
Sbjct: 761  SDSAPNWPSHTTLPTTPLVLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLPSSSISS 820

Query: 485  TTTSGQSHPSQS--------------------LRMPRRSGRVVSQPNRYLGLTETQVVIP 544
             ++S  + PS +                    L  P  +    + PN+   L ++ +  P
Sbjct: 821  PSSSEPTAPSHNGPQPTAQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSP 880

Query: 545  ----------------------------------------------------DDGVEDP- 604
                                                                 DG+  P 
Sbjct: 881  HIPTPSTSISEPNSPSSSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGIRKPN 940

Query: 605  --LSY----------KQAMNDVDKDQWVKAMDLEMESMYFNSVWELV-DLPEGVKPIGCK 664
               SY          + A+  +  D+W +AM  E+ +   N  W+LV   P  V  +GC+
Sbjct: 941  QKYSYATSLAANSEPRTAIQAMKDDRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCR 1000

Query: 665  WIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEI 724
            WI+ +K +S G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I
Sbjct: 1001 WIFTKKFNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPI 1060

Query: 725  WQMDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAI 784
             Q+DV  AFL G L + ++MSQP GF+ + +   VC+L ++IYGLKQA R+W +   T +
Sbjct: 1061 RQLDVNNAFLQGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYL 1120

Query: 785  KSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLG 844
             + GF  ++ +  ++       + ++++YVDDIL+ GND   L      L+ +F +K+  
Sbjct: 1121 LTVGFVNSISDTSLFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHE 1180

Query: 845  EAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKT 904
            +  Y LGI+    R  + L LSQ  Y   LL R +M  +K    P      L+     K 
Sbjct: 1181 DLHYFLGIE--AKRVPQGLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTKL 1240

Query: 905  PQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLR 964
            P   E      Y   VGSL Y +  TRPD+ YAV  +S+Y   P  DHW A+K VL+YL 
Sbjct: 1241 PDPTE------YRGIVGSLQY-LAFTRPDLSYAVNRLSQYMHMPTDDHWNALKRVLRYLA 1300

Query: 965  RTRDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADS 998
             T D+ + +     L L  Y+D+D+  D D   ST+G +  L    + W S KQ  +  S
Sbjct: 1301 GTPDHGIFLKKGNTLSLHAYSDADWAGDTDDYVSTNGYIVYLGHHPISWSSKKQKGVVRS 1360

BLAST of Cmc12g0314871 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 449.5 bits (1155), Expect = 9.9e-125
Identity = 325/1128 (28.81%), Postives = 497/1128 (44.06%), Query Frame = 0

Query: 5    VGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINF 64
            V  G  I     G   L   ++ + L N+  VP I +NL+SV  L          +  +F
Sbjct: 362  VADGSTIPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASF 421

Query: 65   SMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYL 124
             + +      GV +   K ++ LY     E     +  +   A+  +K    S       
Sbjct: 422  QVKD---LNTGVPLLQGKTKDELY-----EWPIASSQPVSLFASPSSKATHSS------- 481

Query: 125  WHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCLEGKMTKRPFTGKGYRAKEPLEL 184
            WH RLGH     +  ++ N  L+ L        C  CL  K  K PF+     +  PLE 
Sbjct: 482  WHARLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEY 541

Query: 185  IHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKK 244
            I+SD+     + +   + Y++ F+D ++RY +LY ++ KS+  E F  +K  +EN    +
Sbjct: 542  IYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTR 601

Query: 245  IKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA 304
            I    SD GGE++ L   +Y  +HGI    S P TP+ NG+SER++R +++   +++S+A
Sbjct: 602  IGTFYSDNGGEFVAL--WEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHA 661

Query: 305  QLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--N 364
             +P ++W YA   AV+++N +P+  +  E+PF+   G  P+    R++GC  +  +   N
Sbjct: 662  SIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYN 721

Query: 365  PKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE-----------DHMRN 424
              KL+ +SR C F+GY       L    Q +R+++S +  F E              ++ 
Sbjct: 722  QHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQE 781

Query: 425  HKPRSKLVLSEATDESTR------------------------------------------ 484
             +  S  V S  T   TR                                          
Sbjct: 782  QRRESSCVWSPHTTLPTRTPVLPAPSCSDPHHAATPPSSPSAPFRNSQVSSSNLDSSFSS 841

Query: 485  ---------VVDEVGPSSRVDETTTSGQSHPS-----------------QSLRMPRRSGR 544
                        + GP      T T  Q+H S                 QSL  P +S  
Sbjct: 842  SFPSSPEPTAPRQNGPQPTTQPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSS 901

Query: 545  VVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMND------------------------- 604
                P      + T    P   +  P    Q +N+                         
Sbjct: 902  SSPSPTTSASSSSTSPTPPSILIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPK 961

Query: 605  -------------------VDKDQWVKAMDLEMESMYFNSVWELVDLPEG-VKPIGCKWI 664
                               +  ++W  AM  E+ +   N  W+LV  P   V  +GC+WI
Sbjct: 962  YSLAVSLAAESEPRTAIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWI 1021

Query: 665  YKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQ 724
            + +K +S G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q
Sbjct: 1022 FTKKYNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQ 1081

Query: 725  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 784
            +DV  AFL G L + ++MSQP GFI + +   VCKL +++YGLKQA R+W +     + +
Sbjct: 1082 LDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLT 1141

Query: 785  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 844
             GF  +V +  ++       + ++++YVDDIL+ GND   L +    L+ +F +KD  E 
Sbjct: 1142 IGFVNSVSDTSLFVLQRGKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEEL 1201

Query: 845  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 904
             Y LGI+    R    L LSQ  YI  LL R +M  +K    P      LS     K   
Sbjct: 1202 HYFLGIE--AKRVPTGLHLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTD 1261

Query: 905  EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 964
              E      Y   VGSL Y +  TRPDI YAV  +S++   P  +H  A+K +L+YL  T
Sbjct: 1262 PTE------YRGIVGSLQY-LAFTRPDISYAVNRLSQFMHMPTEEHLQALKRILRYLAGT 1321

Query: 965  RDY-MLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTM 998
             ++ + +     L L  Y+D+D+  DKD   ST+G +  L    + W S KQ  +  S+ 
Sbjct: 1322 PNHGIFLKKGNTLSLHAYSDADWAGDKDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSST 1381

BLAST of Cmc12g0314871 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 156.0 bits (393), Expect = 2.3e-36
Identity = 106/314 (33.76%), Postives = 147/314 (46.82%), Query Frame = 0

Query: 591 MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 650
           MDV TAFLN  ++E I++ QP GF+ +     V +L   +YGLKQA   WN   +  +K 
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 651 YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 710
            GF ++  E  +Y +       ++ +YVDD+L+          VK  L   + MKDLG+ 
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 711 QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 770
              LG+  I    N  + LS   YI K      +   K    P  +   L +  SP    
Sbjct: 121 DKFLGLN-IHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSP---- 180

Query: 771 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 830
            ++D+   PY S VG L++     RPDI Y V ++SR+   P   H  + + VL+YL  T
Sbjct: 181 HLKDI--TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTT 240

Query: 831 RDYMLVY-GAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK-QGCIADST 890
           R   L Y     L LT Y D+      D   ST G V  L G  V W S K +G I   +
Sbjct: 241 RSMCLKYRSGSQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPS 300

Query: 891 MEAEYVAACEAAKE 903
            EAEY+ A E   E
Sbjct: 301 TEAEYITASETVME 307

BLAST of Cmc12g0314871 vs. ExPASy TrEMBL
Match: A0A5A7TZD0 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G00090 PE=4 SV=1)

HSP 1 Score: 2027.3 bits (5251), Expect = 0.0e+00
Identity = 1003/1003 (100.00%), Postives = 1003/1003 (100.00%), Query Frame = 0

Query: 1    MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 60
            MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM
Sbjct: 233  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 292

Query: 61   NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 120
            NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH
Sbjct: 293  NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 352

Query: 121  LRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 180
            LRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS
Sbjct: 353  LRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 412

Query: 181  DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 240
            DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI
Sbjct: 413  DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 472

Query: 241  LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 300
            LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
Sbjct: 473  LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 532

Query: 301  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 360
            SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP
Sbjct: 533  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 592

Query: 361  RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST 420
            RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST
Sbjct: 593  RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST 652

Query: 421  RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED 480
            RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED
Sbjct: 653  RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED 712

Query: 481  PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 540
            PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK
Sbjct: 713  PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 772

Query: 541  VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG 600
            VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG
Sbjct: 773  VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG 832

Query: 601  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP 660
            NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP
Sbjct: 833  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP 892

Query: 661  CVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR 720
            CVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR
Sbjct: 893  CVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR 952

Query: 721  DRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPY 780
            DRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPY
Sbjct: 953  DRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPY 1012

Query: 781  ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK 840
            ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK
Sbjct: 1013 ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK 1072

Query: 841  DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA 900
            DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA
Sbjct: 1073 DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA 1132

Query: 901  KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ 960
            KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
Sbjct: 1133 KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ 1192

Query: 961  RGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1004
            RGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Sbjct: 1193 RGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1235

BLAST of Cmc12g0314871 vs. ExPASy TrEMBL
Match: A0A5A7T2V9 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760 PE=4 SV=1)

HSP 1 Score: 2000.7 bits (5182), Expect = 0.0e+00
Identity = 989/1003 (98.60%), Postives = 996/1003 (99.30%), Query Frame = 0

Query: 1    MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 60
            MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM
Sbjct: 233  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 292

Query: 61   NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 120
            NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH
Sbjct: 293  NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 352

Query: 121  LRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 180
            LRLGHINLDRIGRLVK+GLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS
Sbjct: 353  LRLGHINLDRIGRLVKDGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 412

Query: 181  DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 240
            DLCGPMNVKARG FEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI
Sbjct: 413  DLCGPMNVKARGSFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 472

Query: 241  LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 300
             RSDRGGEYMDL FQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP
Sbjct: 473  FRSDRGGEYMDLIFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 532

Query: 301  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 360
            SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP
Sbjct: 533  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 592

Query: 361  RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST 420
            RSRLCQFVGYPKETRGGLFFDP+ENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST
Sbjct: 593  RSRLCQFVGYPKETRGGLFFDPKENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST 652

Query: 421  RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED 480
            RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED
Sbjct: 653  RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED 712

Query: 481  PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 540
            PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK
Sbjct: 713  PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 772

Query: 541  VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG 600
            VQTFKARLVAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLNG
Sbjct: 773  VQTFKARLVAKGYTRKEGVDYEETFSSVAMLKSIRILLSIAKFYDYEIWQMDVKTAFLNG 832

Query: 601  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP 660
            NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP
Sbjct: 833  NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP 892

Query: 661  CVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR 720
            CVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGE QYVLGIQIIR
Sbjct: 893  CVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEGQYVLGIQIIR 952

Query: 721  DRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPY 780
            DRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPY
Sbjct: 953  DRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPY 1012

Query: 781  ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAK 840
            ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKI+LKYLRRTRDYMLVYGAK
Sbjct: 1013 ASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRTRDYMLVYGAK 1072

Query: 841  DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA 900
            DLILTGYT+SDFQTDKDSRKSTS SVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA
Sbjct: 1073 DLILTGYTNSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAA 1132

Query: 901  KEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ 960
            KEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
Sbjct: 1133 KEAVWLKKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ 1192

Query: 961  RGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1004
            RGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Sbjct: 1193 RGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1235

BLAST of Cmc12g0314871 vs. ExPASy TrEMBL
Match: A0A5A7UYE8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G001570 PE=4 SV=1)

HSP 1 Score: 1785.4 bits (4623), Expect = 0.0e+00
Identity = 881/882 (99.89%), Postives = 882/882 (100.00%), Query Frame = 0

Query: 122  RLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSD 181
            +LGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSD
Sbjct: 228  KLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSD 287

Query: 182  LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKIL 241
            LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKIL
Sbjct: 288  LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKIL 347

Query: 242  RSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPS 301
            RSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPS
Sbjct: 348  RSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPS 407

Query: 302  SFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPR 361
            SFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPR
Sbjct: 408  SFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPR 467

Query: 362  SRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTR 421
            SRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTR
Sbjct: 468  SRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTR 527

Query: 422  VVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDP 481
            VVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDP
Sbjct: 528  VVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDP 587

Query: 482  LSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKV 541
            LSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKV
Sbjct: 588  LSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKV 647

Query: 542  QTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGN 601
            QTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGN
Sbjct: 648  QTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGN 707

Query: 602  LEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC 661
            LEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC
Sbjct: 708  LEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPC 767

Query: 662  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD 721
            VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD
Sbjct: 768  VYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRD 827

Query: 722  RKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYA 781
            RKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYA
Sbjct: 828  RKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYA 887

Query: 782  SAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKD 841
            SAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKD
Sbjct: 888  SAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKD 947

Query: 842  LILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAK 901
            LILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAK
Sbjct: 948  LILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAK 1007

Query: 902  EAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQR 961
            EAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQR
Sbjct: 1008 EAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQR 1067

Query: 962  GDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1004
            GDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Sbjct: 1068 GDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1109

BLAST of Cmc12g0314871 vs. ExPASy TrEMBL
Match: A0A5A7SMH8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G002560 PE=4 SV=1)

HSP 1 Score: 1656.0 bits (4287), Expect = 0.0e+00
Identity = 810/1004 (80.68%), Postives = 897/1004 (89.34%), Query Frame = 0

Query: 1    MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 60
            MT++VGTG V+SA AVG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++
Sbjct: 336  MTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNV 395

Query: 61   NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 120
            N+ FIYKNGV ICSAKLENNLYVLR   +KA+LN EMF+TA TQNKR +ISP  N +LWH
Sbjct: 396  NKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWH 455

Query: 121  LRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 180
            LRLGHINL+RI RLVKNGLL++L++ SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HS
Sbjct: 456  LRLGHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHS 515

Query: 181  DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 240
            DLCGPMNVKARGGFEYFI+F DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK 
Sbjct: 516  DLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKT 575

Query: 241  LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 300
             RSDRGGEYMDL+FQ+Y++E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP
Sbjct: 576  FRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLP 635

Query: 301  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 360
            +SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEP
Sbjct: 636  NSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEP 695

Query: 361  RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLS----EAT 420
            RS+LC FVGYPK TRGG F+DP++N+VFVSTNATFLEEDH+R HKPRSK+VL+    E T
Sbjct: 696  RSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETT 755

Query: 421  DESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDD 480
            + STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P RY+ LTET  VI D 
Sbjct: 756  EPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDG 815

Query: 481  GVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRD 540
             +EDPL++K+AM DVDKD+W+KAM+LE+ESMYFNSVW+LVD P+GVKPIGCKWIYKRKR 
Sbjct: 816  DIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRG 875

Query: 541  SAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTA 600
            + GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTA
Sbjct: 876  ADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTA 935

Query: 601  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQN 660
            FLNGNLEE+I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ 
Sbjct: 936  FLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQI 995

Query: 661  VDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGI 720
            VDEPCVYK+I    VAFLVLYVDDILLIGND+G LTD+K WLA QFQMKDLGEAQ+VLGI
Sbjct: 996  VDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGI 1055

Query: 721  QIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMR 780
            QI RDRKNK LALSQA+YIDK++V+YSMQNSK+GLLPFRHGV LSKEQ PKTPQ+VE+MR
Sbjct: 1056 QIFRDRKNKMLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMR 1115

Query: 781  RIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLV 840
             IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGL HWTAVK +LKYLRRTRDY LV
Sbjct: 1116 HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYTLV 1175

Query: 841  YGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA 900
            YG+KDLILTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA
Sbjct: 1176 YGSKDLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA 1235

Query: 901  CEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR 960
            CEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIR
Sbjct: 1236 CEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIR 1295

Query: 961  EIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM 1001
            EIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLESLGLRDM
Sbjct: 1296 EIVHRGDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDM 1339

BLAST of Cmc12g0314871 vs. ExPASy TrEMBL
Match: A0A5D3CPJ6 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G00040 PE=4 SV=1)

HSP 1 Score: 1656.0 bits (4287), Expect = 0.0e+00
Identity = 810/1004 (80.68%), Postives = 897/1004 (89.34%), Query Frame = 0

Query: 1    MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 60
            MT++VGTG V+SA AVG  +L     F+ LEN+Y+VP +KRNL+SV CL+E  YS+ F++
Sbjct: 336  MTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNV 395

Query: 61   NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 120
            N+ FIYKNGV ICSAKLENNLYVLR   +KA+LN EMF+TA TQNKR +ISP  N +LWH
Sbjct: 396  NKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWH 455

Query: 121  LRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 180
            LRLGHINL+RI RLVKNGLL++L++ SLP CESCLEGKMTKRPFTGKG+RAKEPLEL+HS
Sbjct: 456  LRLGHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHS 515

Query: 181  DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 240
            DLCGPMNVKARGGFEYFI+F DDYSRYGY+YLM+HKSEALEKFKEYK EVEN LSK IK 
Sbjct: 516  DLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKT 575

Query: 241  LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 300
             RSDRGGEYMDL+FQ+Y++E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LP
Sbjct: 576  FRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLP 635

Query: 301  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 360
            +SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SL HFRIWGCPAHVL  NPKKLEP
Sbjct: 636  NSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEP 695

Query: 361  RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLS----EAT 420
            RS+LC FVGYPK TRGG F+DP++N+VFVSTNATFLEEDH+R HKPRSK+VL+    E T
Sbjct: 696  RSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETT 755

Query: 421  DESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDD 480
            + STRVV+E    +RV    +S ++H  QSLR PRRSGRV + P RY+ LTET  VI D 
Sbjct: 756  EPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDG 815

Query: 481  GVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRD 540
             +EDPL++K+AM DVDKD+W+KAM+LE+ESMYFNSVW+LVD P+GVKPIGCKWIYKRKR 
Sbjct: 816  DIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRG 875

Query: 541  SAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTA 600
            + GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTA
Sbjct: 876  ADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTA 935

Query: 601  FLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQN 660
            FLNGNLEE+I+M QPEGFI  GQEQK+CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQ 
Sbjct: 936  FLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQI 995

Query: 661  VDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGI 720
            VDEPCVYK+I    VAFLVLYVDDILLIGND+G LTD+K WLA QFQMKDLGEAQ+VLGI
Sbjct: 996  VDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGI 1055

Query: 721  QIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMR 780
            QI RDRKNK LALSQA+YIDK++V+YSMQNSK+GLLPFRHGV LSKEQ PKTPQ+VE+MR
Sbjct: 1056 QIFRDRKNKMLALSQASYIDKIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMR 1115

Query: 781  RIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLV 840
             IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGL HWTAVK +LKYLRRTRDY LV
Sbjct: 1116 HIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYTLV 1175

Query: 841  YGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA 900
            YG+KDLILTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA
Sbjct: 1176 YGSKDLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAA 1235

Query: 901  CEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR 960
            CEAAKEAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIR
Sbjct: 1236 CEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIR 1295

Query: 961  EIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDM 1001
            EIV RGDVIVT+IAS HN+ADPFTK LTAKVFEGHLESLGLRDM
Sbjct: 1296 EIVHRGDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDM 1339

BLAST of Cmc12g0314871 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 307.4 bits (786), Expect = 4.3e-83
Identity = 175/484 (36.16%), Postives = 275/484 (56.82%), Query Frame = 0

Query: 479 EDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSA 538
           ++P +Y +A   +    W  AMD E+ +M     WE+  LP   KPIGCKW+YK K +S 
Sbjct: 84  KEPSTYNEAKEFL---VWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSD 143

Query: 539 GKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFL 598
           G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L+I+  Y++ + Q+D+  AFL
Sbjct: 144 GTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFL 203

Query: 599 NGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFD 658
           NG+L+E I+M  P G+   QG       VC L +SIYGLKQASR W ++F   +  +GF 
Sbjct: 204 NGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFV 263

Query: 659 QNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVL 718
           Q+  +   + KI       +++YVDDI++  N+   + ++K+ L + F+++DLG  +Y L
Sbjct: 264 QSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFL 323

Query: 719 GIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVED 778
           G++I R      + + Q  Y   LL    +   K   +P    V  S         +  D
Sbjct: 324 GLEIARSAAG--INICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSG----GDFVD 383

Query: 779 MRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYM 838
            +   Y   +G LMY  + TR DI +AV  +S++   P L H  AV  +L Y++ T    
Sbjct: 384 AK--AYRRLIGRLMYLQI-TRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQG 443

Query: 839 LVYGAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEY 898
           L Y ++ ++ L  ++D+ FQ+ KD+R+ST+G    L    + W+S KQ  ++ S+ EAEY
Sbjct: 444 LFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEY 503

Query: 899 VAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYH 958
            A   A  E +WL +F  +L++   ++ P  L+CDN+ A+  +     H+R KHIE   H
Sbjct: 504 RALSFATDEMMWLAQFFRELQL--PLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCH 553

BLAST of Cmc12g0314871 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 103.2 bits (256), Expect = 1.2e-21
Identity = 79/234 (33.76%), Postives = 115/234 (49.15%), Query Frame = 0

Query: 673 FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQA 732
           +L+LYVDDILL G+    L  +   L++ F MKDLG   Y LGIQI        L LSQ 
Sbjct: 2   YLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQI--KTHPSGLFLSQT 61

Query: 733 TYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAML 792
            Y +++L    M + K    P    + L+   S     +  D R     S VG+L Y  L
Sbjct: 62  KYAEQILNNAGMLDCKPMSTPL--PLKLNSSVSTAKYPDPSDFR-----SIVGALQYLTL 121

Query: 793 CTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSD 852
            TRPDI YAV IV +    P L  +  +K VL+Y++ T  + + ++    L +  + DSD
Sbjct: 122 -TRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSD 181

Query: 853 FQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVW 906
           +     +R+ST+G    L    + W + +Q  ++ S+ E EY A    A E  W
Sbjct: 182 WAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of Cmc12g0314871 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 82.4 bits (202), Expect = 2.2e-15
Identity = 48/133 (36.09%), Postives = 72/133 (54.14%), Query Frame = 0

Query: 449 MPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMY 508
           M  RS   +++ N    LT T  +      ++P S   A+ D     W +AM  E++++ 
Sbjct: 1   MLTRSKAGINKLNPKYSLTITTTI-----KKEPKSVIFALKD---PGWCQAMQEELDALS 60

Query: 509 FNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPV 568
            N  W LV  P     +GCKW++K K  S G +   KARLVAKG+ Q EG+ + ET+SPV
Sbjct: 61  RNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPV 120

Query: 569 AMLKSIRILLSIA 582
               +IR +L++A
Sbjct: 121 VRTATIRTILNVA 125

BLAST of Cmc12g0314871 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 63.9 bits (154), Expect = 8.3e-10
Identity = 32/82 (39.02%), Postives = 51/82 (62.20%), Query Frame = 0

Query: 283 NRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHF 342
           NRT+++ VRSM+    LP +F   A  TAVHI+N  PS +++   P E+W    P+ S+ 
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 343 RIWGCPAHVLVTNPKKLEPRSR 364
           R +GC A++   +  KL+PR++
Sbjct: 62  RRFGCVAYI-HCDEGKLKPRAK 82

BLAST of Cmc12g0314871 vs. TAIR 10
Match: ATMG00300.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 60.1 bits (144), Expect = 1.2e-08
Identity = 30/75 (40.00%), Postives = 42/75 (56.00%), Query Frame = 0

Query: 114 NNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKE 173
           + T LWH RL H++   +  LVK G L+  K  SL  CE C+ GK  +  F+   +  K 
Sbjct: 67  DETRLWHSRLAHMSQRGMELLVKKGFLDSSKVSSLKFCEDCIYGKTHRVNFSTGQHTTKN 126

Query: 174 PLELIHSDLCGPMNV 189
           PL+ +HSDL G  +V
Sbjct: 127 PLDYVHSDLWGAPSV 141

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0025945.10.0e+00100.00gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumi... [more]
KAA0035907.10.0e+0098.60gag/pol protein [Cucumis melo var. makuwa][more]
KAA0059226.10.0e+0099.89gag/pol protein [Cucumis melo var. makuwa][more]
KAA0048404.10.0e+0080.68gag/pol protein [Cucumis melo var. makuwa][more]
KAA0035879.10.0e+0080.68gag/pol protein [Cucumis melo var. makuwa] >KAA0044276.1 gag/pol protein [Cucumi... [more]
Match NameE-valueIdentityDescription
P109786.7e-20638.95Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041466.8e-14230.85Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT941.6e-12729.56Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW29.9e-12528.81Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P256002.3e-3633.76Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
A0A5A7TZD00.0e+00100.00Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G000... [more]
A0A5A7T2V90.0e+0098.60Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760... [more]
A0A5A7UYE80.0e+0099.89Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G0015... [more]
A0A5A7SMH80.0e+0080.68Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G0025... [more]
A0A5D3CPJ60.0e+0080.68Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G0004... [more]
Match NameE-valueIdentityDescription
AT4G23160.14.3e-8336.16cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.11.2e-2133.76DNA/RNA polymerases superfamily protein [more]
ATMG00820.12.2e-1536.09Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00710.18.3e-1039.02Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
ATMG00300.11.2e-0840.00Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 216..236
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 413..454
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 430..452
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 114..837
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 845..986
e-value: 1.37918E-65
score: 215.025
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 172..273
e-value: 6.0E-12
score: 45.8
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 170..335
score: 24.711876
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 167..343
e-value: 8.1E-42
score: 144.7
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 106..159
e-value: 1.3E-12
score: 47.3
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 510..753
e-value: 1.2E-74
score: 251.0
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 510..953
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 169..333

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc12g0314871.1Cmc12g0314871.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006508 proteolysis
molecular_function GO:0008234 cysteine-type peptidase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding