Cmc04g0106681 (gene) Melon (Charmono) v1.1

Overview
NameCmc04g0106681
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
LocationCMiso1.1chr04: 24797099 .. 24799211 (-)
RNA-Seq ExpressionCmc04g0106681
SyntenyCmc04g0106681
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGCGATGCTAAGTTGTTTTTCAGAAATAAATACATGTTTTTGGAAAACTTGTACATAGTTCCTAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTACATATTTGTTCGGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCAATCAGATCGGGAGATTGATAAAGAATGGACTTCTAAACAAGTTAGAAGATGATTCATTACCTTCATGTGAATCTTGTCACGAAGGAAAAATGACAAAGAGACCTTTTACTGAAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTTTGTGGTCCCATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCAAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGGCTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCAAGGTGGAGAGTACATGGATTTGAGATTCCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGGACCTTGTTAGACAAGGTTCGTTCAATGATGAGTTACACTCAATTGCCTAGTTCGTTTTGGGGCGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTGTTTCTGAAACACCTTTCGAGTTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGAGGTTGTCCAGCACACGTGTTAGTGACAAATCCCAAGAAGTTGGGACCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTATTCGATCCACAAGAAAATAGAGTGCTTGTATCGACAAATGCTACTTTCTTAGAAGAAGACCACACGAGAGATCATAAACCACGAAGCAAATTAGTATTAAATGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAATTTGGTCCCTCATCAAGAGTTGATGAAACCACCACATCAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGATTGTATCACAACCTAACCGTTATTTGGGTTTAACTGAAACTCAGATTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAATCAGGTAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATTGACCTTGAAATGGAGTCTATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGTAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTCAAAGCTAGACTTGTAGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTTTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCCACTTTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATATCTCAGCCCAAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGTAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCTAGATCCTGGAACATTAGGTTTGTTACTGCGATCAAATCCTACGGTTTTGACCAAAACGTTGATGAACCTTGTGTATATAAGAAAATCAACATAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCTTCATTGGGAATGATGTGGGATACCTTACTGACGTTAAAGCTTGGCTAGCAGCTCAATTCCAAATGAAAGATTTATGA

mRNA sequence

ATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGCGATGCTAAGTTGTTTTTCAGAAATAAATACATGTTTTTGGAAAACTTGTACATAGTTCCTAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTACATATTTGTTCGGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCAATCAGATCGGGAGATTGATAAAGAATGGACTTCTAAACAAGTTAGAAGATGATTCATTACCTTCATGTGAATCTTGTCACGAAGGAAAAATGACAAAGAGACCTTTTACTGAAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTTTGTGGTCCCATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCAAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGGCTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCAAGGTGGAGAGTACATGGATTTGAGATTCCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGGACCTTGTTAGACAAGAAGTTGGGACCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTATTCGATCCACAAGAAAATAGAGTGCTTGTATCGACAAATGCTACTTTCTTAGAAGAAGACCACACGAGAGATCATAAACCACGAAGCAAATTAGTATTAAATGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAATTTGGTCCCTCATCAAGAGTTGATGAAACCACCACATCAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGATTGTATCACAACCTAACCGTTATTTGGGTTTAACTGAAACTCAGATTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAATCAGGTAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATTGACCTTGAAATGGAGTCTATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGTAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTCAAAGCTAGACTTGTAGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTTTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCCACTTTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATATCTCAGCCCAAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGTAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCTAGATCCTGGAACATTAGGTTTGTTACTGCGATCAAATCCTACGGTTTTGACCAAAACGTTGATGAACCTTGTGTATATAAGAAAATCAACATAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCTTCATTGGGAATGATGTGGGATACCTTACTGACGTTAAAGCTTGGCTAGCAGCTCAATTCCAAATGAAAGATTTATGA

Coding sequence (CDS)

ATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGCGATGCTAAGTTGTTTTTCAGAAATAAATACATGTTTTTGGAAAACTTGTACATAGTTCCTAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTACATATTTGTTCGGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCAATCAGATCGGGAGATTGATAAAGAATGGACTTCTAAACAAGTTAGAAGATGATTCATTACCTTCATGTGAATCTTGTCACGAAGGAAAAATGACAAAGAGACCTTTTACTGAAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTTTGTGGTCCCATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCAAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGGCTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCAAGGTGGAGAGTACATGGATTTGAGATTCCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAAAGGAGAAATAGGACCTTGTTAGACAAGAAGTTGGGACCTCGTTCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTATTCGATCCACAAGAAAATAGAGTGCTTGTATCGACAAATGCTACTTTCTTAGAAGAAGACCACACGAGAGATCATAAACCACGAAGCAAATTAGTATTAAATGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAATTTGGTCCCTCATCAAGAGTTGATGAAACCACCACATCAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGATTGTATCACAACCTAACCGTTATTTGGGTTTAACTGAAACTCAGATTGTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAATCAGGTAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATTGACCTTGAAATGGAGTCTATGTACTTCAATTCAGTGTGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGTAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTCAAAGCTAGACTTGTAGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTTTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCCACTTTTTATGATTATGAAATATGGCAAATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATATCTCAGCCCAAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGTAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCTAGATCCTGGAACATTAGGTTTGTTACTGCGATCAAATCCTACGGTTTTGACCAAAACGTTGATGAACCTTGTGTATATAAGAAAATCAACATAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCTTCATTGGGAATGATGTGGGATACCTTACTGACGTTAAAGCTTGGCTAGCAGCTCAATTCCAAATGAAAGATTTATGA

Protein sequence

MTLKVGTGDVISARAVGDAKLFFRNKYMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDKKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATFLEEDHTRDHKPRSKLVLNEATDESTRVVDEFGPSSRVDETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQIVIPDDGVEDPLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFVTAIKSYGFDQNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVKAWLAAQFQMKDL
Homology
BLAST of Cmc04g0106681 vs. NCBI nr
Match: KAA0025945.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0035786.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0040492.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0041262.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1202.2 bits (3109), Expect = 0.0e+00
Identity = 610/707 (86.28%), Postives = 623/707 (88.12%), Query Frame = 0

Query: 1   MTLKVGTGDVISARAVGDAKLFFRNKYMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 60
           MTLKVGTGDVISARAVGDAKLFF NK+MFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM
Sbjct: 233 MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 292

Query: 61  NEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 120
           NEAFI KNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH
Sbjct: 293 NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 352

Query: 121 LRLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHS 180
           LRLGHINL++IGRL+KNGLLNKL+D SLP CESC EGKMTKRPFT KGYRAKEPLELIHS
Sbjct: 353 LRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 412

Query: 181 DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKI 240
           DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKI
Sbjct: 413 DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 472

Query: 241 LRSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD------------ 300
           LRSD+GGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD            
Sbjct: 473 LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 532

Query: 301 -------------------------------------------------------KKLGP 360
                                                                  KKL P
Sbjct: 533 SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 592

Query: 361 RSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATFLEEDHTRDHKPRSKLVLNEATDEST 420
           RSRLCQFVGYPKETRGGL FDPQENRV VSTNATFLEEDH R+HKPRSKLVL+EATDEST
Sbjct: 593 RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST 652

Query: 421 RVVDEFGPSSRVDETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQIVIPDDGVED 480
           RVVDE GPSSRVDETTTSGQSHPSQSLRMPRRSGR+VSQPNRYLGLTETQ+VIPDDGVED
Sbjct: 653 RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED 712

Query: 481 PLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 540
           PLSY Q MNDVDKDQWVKA+DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK
Sbjct: 713 PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 772

Query: 541 VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG 600
           VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG
Sbjct: 773 VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG 832

Query: 601 NLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFVTAIKSYGFDQNVDEP 641
           NLEESIF+SQP+GFITQGQEQKVCKLNRSIYGLKQASRSWNIRF TAIKSYGFDQNVDEP
Sbjct: 833 NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP 892

BLAST of Cmc04g0106681 vs. NCBI nr
Match: KAA0035907.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1183.7 bits (3061), Expect = 0.0e+00
Identity = 601/707 (85.01%), Postives = 618/707 (87.41%), Query Frame = 0

Query: 1   MTLKVGTGDVISARAVGDAKLFFRNKYMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 60
           MTLKVGTGDVISARAVGDAKLFF NK+MFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM
Sbjct: 233 MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 292

Query: 61  NEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 120
           NEAFI KNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH
Sbjct: 293 NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 352

Query: 121 LRLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHS 180
           LRLGHINL++IGRL+K+GLLNKL+D SLP CESC EGKMTKRPFT KGYRAKEPLELIHS
Sbjct: 353 LRLGHINLDRIGRLVKDGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 412

Query: 181 DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKI 240
           DLCGPMNVKARG FEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKI
Sbjct: 413 DLCGPMNVKARGSFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 472

Query: 241 LRSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD------------ 300
            RSD+GGEYMDL FQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD            
Sbjct: 473 FRSDRGGEYMDLIFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 532

Query: 301 -------------------------------------------------------KKLGP 360
                                                                  KKL P
Sbjct: 533 SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 592

Query: 361 RSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATFLEEDHTRDHKPRSKLVLNEATDEST 420
           RSRLCQFVGYPKETRGGL FDP+ENRV VSTNATFLEEDH R+HKPRSKLVL+EATDEST
Sbjct: 593 RSRLCQFVGYPKETRGGLFFDPKENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST 652

Query: 421 RVVDEFGPSSRVDETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQIVIPDDGVED 480
           RVVDE GPSSRVDETTTSGQSHPSQSLRMPRRSGR+VSQPNRYLGLTETQ+VIPDDGVED
Sbjct: 653 RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED 712

Query: 481 PLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 540
           PLSY Q MNDVDKDQWVKA+DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK
Sbjct: 713 PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 772

Query: 541 VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG 600
           VQTFKARLVAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLNG
Sbjct: 773 VQTFKARLVAKGYTRKEGVDYEETFSSVAMLKSIRILLSIAKFYDYEIWQMDVKTAFLNG 832

Query: 601 NLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFVTAIKSYGFDQNVDEP 641
           NLEESIF+SQP+GFITQGQEQKVCKLNRSIYGLKQASRSWNIRF TAIKSYGFDQNVDEP
Sbjct: 833 NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP 892

BLAST of Cmc04g0106681 vs. NCBI nr
Match: KAA0065400.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1033.1 bits (2670), Expect = 1.0e-297
Identity = 518/552 (93.84%), Postives = 519/552 (94.02%), Query Frame = 0

Query: 122 RLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHSD 181
           +LGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHSD
Sbjct: 189 KLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHSD 248

Query: 182 LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL 241
           LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL
Sbjct: 249 LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL 308

Query: 242 RSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD------------- 301
           RSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD             
Sbjct: 309 RSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDKVRSMMSYTQLPS 368

Query: 302 --------------------KKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATF 361
                               KKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATF
Sbjct: 369 SFWGDCSSYLEQSHVLVTNPKKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATF 428

Query: 362 LEEDHTRDHKPRSKLVLNEATDESTRVVDEFGPSSRVDETTTSGQSHPSQSLRMPRRSGR 421
           LEEDHTRDHKPRSKLVLNEATDESTRVVDEFGPSSRVDETTTSGQSHPSQSLRMPRRSGR
Sbjct: 429 LEEDHTRDHKPRSKLVLNEATDESTRVVDEFGPSSRVDETTTSGQSHPSQSLRMPRRSGR 488

Query: 422 IVSQPNRYLGLTETQIVIPDDGVEDPLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWEL 481
           IVSQPNRYLGLTETQIVIPDDGVEDPLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWEL
Sbjct: 489 IVSQPNRYLGLTETQIVIPDDGVEDPLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWEL 548

Query: 482 VDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIR 541
           VDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIR
Sbjct: 549 VDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIR 608

Query: 542 ILLSIATFYDYEIWQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQ 601
           ILLSIATFYDYEIWQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQ
Sbjct: 609 ILLSIATFYDYEIWQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQ 668

Query: 602 ASRSWNIRFVTAIKSYGFDQNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVK 641
           ASRSWNIRFVTAIKSYGFDQNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVK
Sbjct: 669 ASRSWNIRFVTAIKSYGFDQNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVK 728

BLAST of Cmc04g0106681 vs. NCBI nr
Match: TYK28885.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1025.8 bits (2651), Expect = 1.6e-295
Identity = 513/552 (92.93%), Postives = 516/552 (93.48%), Query Frame = 0

Query: 122 RLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHSD 181
           +LGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHSD
Sbjct: 200 KLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHSD 259

Query: 182 LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL 241
           LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL
Sbjct: 260 LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL 319

Query: 242 RSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD------------- 301
           RSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD             
Sbjct: 320 RSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDKVRSMMSYTQLPS 379

Query: 302 --------------------KKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATF 361
                               KKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATF
Sbjct: 380 SFWGDCSSYLEQSHVLVTNPKKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATF 439

Query: 362 LEEDHTRDHKPRSKLVLNEATDESTRVVDEFGPSSRVDETTTSGQSHPSQSLRMPRRSGR 421
           LEEDHTRDHKPRSKLVLNEATDESTRVVDE GPSSRVDETTTSGQSHPSQSLRMPRRSGR
Sbjct: 440 LEEDHTRDHKPRSKLVLNEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGR 499

Query: 422 IVSQPNRYLGLTETQIVIPDDGVEDPLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWEL 481
           +VSQPNRYLGLTETQ+VIPDDGVEDPLSY Q MNDVDKDQWVKAIDLEMESMYFNSVWEL
Sbjct: 500 VVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAIDLEMESMYFNSVWEL 559

Query: 482 VDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIR 541
           VDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIR
Sbjct: 560 VDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIR 619

Query: 542 ILLSIATFYDYEIWQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQ 601
           ILLSIATFYDYEIWQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQ
Sbjct: 620 ILLSIATFYDYEIWQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQ 679

Query: 602 ASRSWNIRFVTAIKSYGFDQNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVK 641
           ASRSWNIRFVTAIKSYGFDQNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVK
Sbjct: 680 ASRSWNIRFVTAIKSYGFDQNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVK 739

BLAST of Cmc04g0106681 vs. NCBI nr
Match: KAA0067938.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1003.0 bits (2592), Expect = 1.1e-288
Identity = 524/659 (79.51%), Postives = 540/659 (81.94%), Query Frame = 0

Query: 1   MTLKVGTGDVISARAVGDAKLFFRNKYMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 60
           MTL VGTGDVISARAVGD KLFF  K+MFLENLYIVPKIKRNLV VSCLIEHMYSINFSM
Sbjct: 208 MTLMVGTGDVISARAVGDVKLFFGIKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSM 267

Query: 61  NEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 120
           NEAFISKNG     AKLE+NLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH
Sbjct: 268 NEAFISKNG-----AKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 327

Query: 121 LRLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHS 180
           LRL HINL++IGRL+KNGLLNKL+DDSLP CESC EGKMTKRPFT K YRAKEPLELIHS
Sbjct: 328 LRLDHINLDRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKDYRAKEPLELIHS 387

Query: 181 DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKI 240
           DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHK EALEKFKEYK EVENLLSKKIKI
Sbjct: 388 DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKYEALEKFKEYKTEVENLLSKKIKI 447

Query: 241 LRSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD------------ 300
           LRSD+GGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD            
Sbjct: 448 LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 507

Query: 301 -------KKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATFLEEDHTRDHKPRS 360
                  KKL PRSRLCQFVGYPKE RGGL FDPQENRV VSTN TFLEED  RDHKPRS
Sbjct: 508 SSFWGTPKKLEPRSRLCQFVGYPKERRGGLFFDPQENRVFVSTNTTFLEEDRMRDHKPRS 567

Query: 361 KLVLNEATDESTRVVDEFGPSSRVDETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTE 420
           KLVL EATDESTRVVDE  PSSRVDETTTSGQSHPSQSLRMPRRSGRIVSQP RYLGLTE
Sbjct: 568 KLVLCEATDESTRVVDEVDPSSRVDETTTSGQSHPSQSLRMPRRSGRIVSQPKRYLGLTE 627

Query: 421 TQIVIPDDGVEDPLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWELVDLPEGVKPIGCK 480
           TQ+VIPDDGVEDPLSY Q MNDVDK+QWVKA+DLE+ESMYFNSVWEL DL EGVKPIGCK
Sbjct: 628 TQVVIPDDGVEDPLSYKQTMNDVDKNQWVKAMDLEIESMYFNSVWELADLSEGVKPIGCK 687

Query: 481 WIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEI 540
           WIYKRKRDS G                                KSIRILLSIATFYDYEI
Sbjct: 688 WIYKRKRDSVG--------------------------------KSIRILLSIATFYDYEI 747

Query: 541 WQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFVTAI 600
           WQ+DVKTAFLNGNLEESIF+SQP+G                              F T I
Sbjct: 748 WQIDVKTAFLNGNLEESIFMSQPEG------------------------------FDTTI 799

Query: 601 KSYGFDQNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVKAWLAAQFQMKDL 641
           KSYGFDQNVDEPCVYKKIN  KVAFLVLYVDDIL IGN+VGYLTDVKAWLAAQFQMK+L
Sbjct: 808 KSYGFDQNVDEPCVYKKINKVKVAFLVLYVDDILLIGNNVGYLTDVKAWLAAQFQMKNL 799

BLAST of Cmc04g0106681 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 362.1 bits (928), Expect = 1.3e-98
Identity = 232/737 (31.48%), Postives = 368/737 (49.93%), Query Frame = 0

Query: 2    TLKVGTGDVISARAVGDAKLFFR-NKYMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 61
            T+K+G         +GD  +       + L+++  VP ++ NL+S   L    Y   F+ 
Sbjct: 321  TVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFAN 380

Query: 62   NEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 121
             +  ++K  + I        LY       +  LN            +  IS +    LWH
Sbjct: 381  QKWRLTKGSLVIAKGVARGTLYRTNAEICQGELN----------AAQDEISVD----LWH 440

Query: 122  LRLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHS 181
             R+GH++   +  L K  L++  +  ++  C+ C  GK  +  F     R    L+L++S
Sbjct: 441  KRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNILDLVYS 500

Query: 182  DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKI 241
            D+CGPM +++ GG +YF++FIDD SR  ++Y+++ K +  + F+++ A VE    +K+K 
Sbjct: 501  DVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKR 560

Query: 242  LRSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDK----------- 301
            LRSD GGEY    F++Y   HGI+ + + PGTPQ NGV+ER NRT+++K           
Sbjct: 561  LRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLP 620

Query: 302  -----------------------------------------------------------K 361
                                                                       K
Sbjct: 621  KSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTK 680

Query: 362  LGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATFLEED----HTRDHKPRSKLVLN 421
            L  +S  C F+GY  E  G  L+DP + +V+ S +  F E +         K ++ ++ N
Sbjct: 681  LDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSEKVKNGIIPN 740

Query: 422  EATDESTRVVDEFGPSSRVDETTTSGQ-------------------SHPSQSLRMP---R 481
              T  ST   +     S  DE +  G+                    HP+Q        R
Sbjct: 741  FVTIPSTS-NNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGEEQHQPLR 800

Query: 482  RSGRIVSQPNRYLGLTETQIVIPDDGVEDPLSYNQVMNDVDKDQWVKAIDLEMESMYFNS 541
            RS R   +  RY   +   ++I DD   +P S  +V++  +K+Q +KA+  EMES+  N 
Sbjct: 801  RSERPRVESRRY--PSTEYVLISDD--REPESLKEVLSHPEKNQLMKAMQEEMESLQKNG 860

Query: 542  VWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAML 601
             ++LV+LP+G +P+ CKW++K K+D   K+  +KARLV KG+ Q++G+D++E FSPV  +
Sbjct: 861  TYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKM 920

Query: 602  KSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIY 641
             SIR +LS+A   D E+ Q+DVKTAFL+G+LEE I++ QP+GF   G++  VCKLN+S+Y
Sbjct: 921  TSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLY 980

BLAST of Cmc04g0106681 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 252.7 bits (644), Expect = 1.1e-65
Identity = 217/820 (26.46%), Postives = 345/820 (42.07%), Query Frame = 0

Query: 8    GDVISARAVGDAKLFFRNKY-MFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIS 67
            G+ I A   G  +L  RN + + LE++    +   NL+SV  L E   SI F  +   IS
Sbjct: 324  GEFIYATKRGIVRL--RNDHEITLEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSGVTIS 383

Query: 68   KNGVHIC-SAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGH 127
            KNG+ +  ++ + NN+          V+N + + + N ++K       NN  LWH R GH
Sbjct: 384  KNGLMVVKNSGMLNNV---------PVINFQAY-SINAKHK-------NNFRLWHERFGH 443

Query: 128  INLNQIGRLIKNGLLNKLEDDSLPS--------CESCHEGKMTKRPFTEKGYRA--KEPL 187
            I+    G+L++    N   D SL +        CE C  GK  + PF +   +   K PL
Sbjct: 444  IS---DGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQARLPFKQLKDKTHIKRPL 503

Query: 188  ELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLS 247
             ++HSD+CGP+         YF+ F+D ++ Y   YL+++KS+    F+++ A+ E   +
Sbjct: 504  FVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFN 563

Query: 248  KKIKILRSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDK------ 307
             K+  L  D G EY+    + + ++ GI   L+ P TPQ NGVSER  RT+ +K      
Sbjct: 564  LKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVS 623

Query: 308  ------------------------------------------------------------ 367
                                                                        
Sbjct: 624  GAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVHI 683

Query: 368  -----KLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATFLEEDHTRDHKPRSKLV 427
                 K   +S    FVGY  E  G  L+D    + +V+ +    E +       + + V
Sbjct: 684  KNKQGKFDDKSFKSIFVGY--EPNGFKLWDAVNEKFIVARDVVVDETNMVNSRAVKFETV 743

Query: 428  LNEATDESTR----------VVDEF-GPSSRVDETTTSGQSHPSQSLRMPRRSGRIV--- 487
              + + ES            +  EF   S   D       S  S++   P  S +I+   
Sbjct: 744  FLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESENKNFPNDSRKIIQTE 803

Query: 488  -----------------SQPNRYL-------------------------GLTETQIVIPD 547
                              + N+Y                            +ET   + +
Sbjct: 804  FPNESKECDNIQFLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKE 863

Query: 548  DGVEDP---------------------LSYNQ--------------VMNDV--------- 607
             G+++P                     +SYN+              + NDV         
Sbjct: 864  IGIDNPTKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQY 923

Query: 608  --DKDQWVKAIDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLV 641
              DK  W +AI+ E+ +   N+ W +   PE    +  +W++  K +  G    +KARLV
Sbjct: 924  RDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLV 983

BLAST of Cmc04g0106681 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 240.0 bits (611), Expect = 7.6e-62
Identity = 206/836 (24.64%), Postives = 322/836 (38.52%), Query Frame = 0

Query: 5    VGTGDVISARAVGDAKLFFRNKYMFLENLYIVPKIKRNLVSVSCLIE------HMYSINF 64
            V  G  I     G   L  +++ + L N+  VP I +NL+SV  L          +  +F
Sbjct: 362  VADGSTIPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASF 421

Query: 65   SMNEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYL 124
             + +      GV +   K ++ LY     E     +  +   A+  +K    S       
Sbjct: 422  QVKDL---NTGVPLLQGKTKDELY-----EWPIASSQPVSLFASPSSKATHSS------- 481

Query: 125  WHLRLGHINLNQIGRLIKNGLLNKLE-DDSLPSCESCHEGKMTKRPFTEKGYRAKEPLEL 184
            WH RLGH   + +  +I N  L+ L       SC  C   K  K PF++    +  PLE 
Sbjct: 482  WHARLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEY 541

Query: 185  IHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKK 244
            I+SD+     + +   + Y++ F+D ++RY +LY ++ KS+  E F  +K  +EN    +
Sbjct: 542  IYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTR 601

Query: 245  IKILRSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD--------- 304
            I    SD GGE++ L   +Y  +HGI    S P TP+ NG+SER++R +++         
Sbjct: 602  IGTFYSDNGGEFVAL--WEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHA 661

Query: 305  ------------------------------------------------------------ 364
                                                                        
Sbjct: 662  SIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYN 721

Query: 365  -KKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATFLEE-----------DHTRD 424
              KL  +SR C F+GY       L    Q +R+ +S +  F E               ++
Sbjct: 722  QHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQE 781

Query: 425  HKPRSKLVLNEATDESTR------------------------------------------ 484
             +  S  V +  T   TR                                          
Sbjct: 782  QRRESSCVWSPHTTLPTRTPVLPAPSCSDPHHAATPPSSPSAPFRNSQVSSSNLDSSFSS 841

Query: 485  ---------VVDEFGPSSRVDETTTSGQSHPS-----------------QSLRMPRRSGR 544
                        + GP      T T  Q+H S                 QSL  P +S  
Sbjct: 842  SFPSSPEPTAPRQNGPQPTTQPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSS 901

Query: 545  IVSQPNRYLGLTETQIVIPDDGVEDPLSYNQVMND------------------------- 604
                P      + T    P   +  P    Q++N+                         
Sbjct: 902  SSPSPTTSASSSSTSPTPPSILIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPK 961

Query: 605  -------------------VDKDQWVKAIDLEMESMYFNSVWELVDLPEG-VKPIGCKWI 640
                               +  ++W  A+  E+ +   N  W+LV  P   V  +GC+WI
Sbjct: 962  YSLAVSLAAESEPRTAIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWI 1021

BLAST of Cmc04g0106681 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 236.9 bits (603), Expect = 6.4e-61
Identity = 218/840 (25.95%), Postives = 337/840 (40.12%), Query Frame = 0

Query: 5    VGTGDVISARAVGDAKLFFRNKYMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEA 64
            +  G  I     G A L   ++ + L  +  VP I +NL+SV  L   +  S+ F     
Sbjct: 341  IADGSTIPITHTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASF 400

Query: 65   FIS--KNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHL 124
             +     GV +   K ++ LY      ++AV    MF  A+  +K    S       WH 
Sbjct: 401  QVKDLNTGVPLLQGKTKDELYEWPIASSQAV---SMF--ASPCSKATHSS-------WHS 460

Query: 125  RLGHINLNQIGRLIKNGLLNKLE-DDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHS 184
            RLGH +L  +  +I N  L  L     L SC  C   K  K PF+     + +PLE I+S
Sbjct: 461  RLGHPSLAILNSVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYS 520

Query: 185  DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKI 244
            D+     + +   + Y++ F+D ++RY +LY ++ KS+  + F  +K+ VEN    +I  
Sbjct: 521  DVWS-SPILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGT 580

Query: 245  LRSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD------------ 304
            L SD GGE++ LR  DY+ +HGI    S P TP+ NG+SER++R +++            
Sbjct: 581  LYSDNGGEFVVLR--DYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVP 640

Query: 305  ----------------------------------------------------------KK 364
                                                                       K
Sbjct: 641  KTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHK 700

Query: 365  LGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATFLE---------------EDHTR 424
            L  +S+ C F+GY       L       R+  S +  F E               ++   
Sbjct: 701  LEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRS 760

Query: 425  DHKPR---------SKLV------LNEATDES------------TRVVDEFGPSSRVDET 484
            D  P          + LV      L    D S            T+V     PSS +   
Sbjct: 761  DSAPNWPSHTTLPTTPLVLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLPSSSISSP 820

Query: 485  TTSGQSHPSQS--------------------LRMPRRSGRIVSQPNRYLGLTETQIVIP- 544
            ++S  + PS +                    L  P  +    + PN+   L ++ I  P 
Sbjct: 821  SSSEPTAPSHNGPQPTAQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPH 880

Query: 545  ---------------------------------------------------DDGVEDP-- 604
                                                                DG+  P  
Sbjct: 881  IPTPSTSISEPNSPSSSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGIRKPNQ 940

Query: 605  -------LSYN-------QVMNDVDKDQWVKAIDLEMESMYFNSVWELV-DLPEGVKPIG 640
                   L+ N       Q M D   D+W +A+  E+ +   N  W+LV   P  V  +G
Sbjct: 941  KYSYATSLAANSEPRTAIQAMKD---DRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVG 1000

BLAST of Cmc04g0106681 vs. ExPASy Swiss-Prot
Match: Q03494 (Transposon Ty2-DR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-DR2 PE=3 SV=2)

HSP 1 Score: 89.4 bits (220), Expect = 1.7e-16
Identity = 77/293 (26.28%), Postives = 130/293 (44.37%), Query Frame = 0

Query: 11  ISARAVGDAKLFFRNKYMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGV 70
           I   A+G+    F+N           P I  +L+S+S L     +  F+ N      +G 
Sbjct: 490 IPINAIGNLHFNFQNGTKTSIKALHTPNIAYDLLSLSELANQNITACFTRN-TLERSDGT 549

Query: 71  HICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTY-LWHLRLGHINLN 130
            +       + Y L  ++   + +H    T N  NK +  S N   Y L H  LGH N  
Sbjct: 550 VLAPIVKHGDFYWL--SKKYLIPSHISKLTINNVNKSK--SVNKYPYPLIHRMLGHANFR 609

Query: 131 QIGRLIKNGLLNKLEDDSLP-------SCESCHEGKMTKRPFTEKGYRAK-----EPLEL 190
            I + +K   +  L++  +         C  C  GK TK     KG R K     EP + 
Sbjct: 610 SIQKSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHV-KGSRLKYQESYEPFQY 669

Query: 191 IHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSE--ALEKFKEYKAEVENLLS 250
           +H+D+ GP++   +    YFISF D+ +R+ ++Y +  + E   L  F    A ++N  +
Sbjct: 670 LHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFN 729

Query: 251 KKIKILRSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD 289
            ++ +++ D+G EY +     +    GI +  +     + +GV+ER NRTLL+
Sbjct: 730 ARVLVIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAERLNRTLLN 776

BLAST of Cmc04g0106681 vs. ExPASy TrEMBL
Match: A0A5A7TZD0 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G00090 PE=4 SV=1)

HSP 1 Score: 1202.2 bits (3109), Expect = 0.0e+00
Identity = 610/707 (86.28%), Postives = 623/707 (88.12%), Query Frame = 0

Query: 1   MTLKVGTGDVISARAVGDAKLFFRNKYMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 60
           MTLKVGTGDVISARAVGDAKLFF NK+MFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM
Sbjct: 233 MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 292

Query: 61  NEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 120
           NEAFI KNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH
Sbjct: 293 NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 352

Query: 121 LRLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHS 180
           LRLGHINL++IGRL+KNGLLNKL+D SLP CESC EGKMTKRPFT KGYRAKEPLELIHS
Sbjct: 353 LRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 412

Query: 181 DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKI 240
           DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKI
Sbjct: 413 DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 472

Query: 241 LRSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD------------ 300
           LRSD+GGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD            
Sbjct: 473 LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 532

Query: 301 -------------------------------------------------------KKLGP 360
                                                                  KKL P
Sbjct: 533 SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 592

Query: 361 RSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATFLEEDHTRDHKPRSKLVLNEATDEST 420
           RSRLCQFVGYPKETRGGL FDPQENRV VSTNATFLEEDH R+HKPRSKLVL+EATDEST
Sbjct: 593 RSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST 652

Query: 421 RVVDEFGPSSRVDETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQIVIPDDGVED 480
           RVVDE GPSSRVDETTTSGQSHPSQSLRMPRRSGR+VSQPNRYLGLTETQ+VIPDDGVED
Sbjct: 653 RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED 712

Query: 481 PLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 540
           PLSY Q MNDVDKDQWVKA+DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK
Sbjct: 713 PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 772

Query: 541 VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG 600
           VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG
Sbjct: 773 VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG 832

Query: 601 NLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFVTAIKSYGFDQNVDEP 641
           NLEESIF+SQP+GFITQGQEQKVCKLNRSIYGLKQASRSWNIRF TAIKSYGFDQNVDEP
Sbjct: 833 NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP 892

BLAST of Cmc04g0106681 vs. ExPASy TrEMBL
Match: A0A5A7T2V9 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760 PE=4 SV=1)

HSP 1 Score: 1183.7 bits (3061), Expect = 0.0e+00
Identity = 601/707 (85.01%), Postives = 618/707 (87.41%), Query Frame = 0

Query: 1   MTLKVGTGDVISARAVGDAKLFFRNKYMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 60
           MTLKVGTGDVISARAVGDAKLFF NK+MFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM
Sbjct: 233 MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 292

Query: 61  NEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 120
           NEAFI KNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH
Sbjct: 293 NEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 352

Query: 121 LRLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHS 180
           LRLGHINL++IGRL+K+GLLNKL+D SLP CESC EGKMTKRPFT KGYRAKEPLELIHS
Sbjct: 353 LRLGHINLDRIGRLVKDGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHS 412

Query: 181 DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKI 240
           DLCGPMNVKARG FEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKI
Sbjct: 413 DLCGPMNVKARGSFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI 472

Query: 241 LRSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD------------ 300
            RSD+GGEYMDL FQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD            
Sbjct: 473 FRSDRGGEYMDLIFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 532

Query: 301 -------------------------------------------------------KKLGP 360
                                                                  KKL P
Sbjct: 533 SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP 592

Query: 361 RSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATFLEEDHTRDHKPRSKLVLNEATDEST 420
           RSRLCQFVGYPKETRGGL FDP+ENRV VSTNATFLEEDH R+HKPRSKLVL+EATDEST
Sbjct: 593 RSRLCQFVGYPKETRGGLFFDPKENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDEST 652

Query: 421 RVVDEFGPSSRVDETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQIVIPDDGVED 480
           RVVDE GPSSRVDETTTSGQSHPSQSLRMPRRSGR+VSQPNRYLGLTETQ+VIPDDGVED
Sbjct: 653 RVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVED 712

Query: 481 PLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 540
           PLSY Q MNDVDKDQWVKA+DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK
Sbjct: 713 PLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK 772

Query: 541 VQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG 600
           VQTFKARLVAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLNG
Sbjct: 773 VQTFKARLVAKGYTRKEGVDYEETFSSVAMLKSIRILLSIAKFYDYEIWQMDVKTAFLNG 832

Query: 601 NLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFVTAIKSYGFDQNVDEP 641
           NLEESIF+SQP+GFITQGQEQKVCKLNRSIYGLKQASRSWNIRF TAIKSYGFDQNVDEP
Sbjct: 833 NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEP 892

BLAST of Cmc04g0106681 vs. ExPASy TrEMBL
Match: A0A5A7VDX8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold17G00700 PE=4 SV=1)

HSP 1 Score: 1033.1 bits (2670), Expect = 4.9e-298
Identity = 518/552 (93.84%), Postives = 519/552 (94.02%), Query Frame = 0

Query: 122 RLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHSD 181
           +LGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHSD
Sbjct: 189 KLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHSD 248

Query: 182 LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL 241
           LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL
Sbjct: 249 LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL 308

Query: 242 RSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD------------- 301
           RSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD             
Sbjct: 309 RSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDKVRSMMSYTQLPS 368

Query: 302 --------------------KKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATF 361
                               KKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATF
Sbjct: 369 SFWGDCSSYLEQSHVLVTNPKKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATF 428

Query: 362 LEEDHTRDHKPRSKLVLNEATDESTRVVDEFGPSSRVDETTTSGQSHPSQSLRMPRRSGR 421
           LEEDHTRDHKPRSKLVLNEATDESTRVVDEFGPSSRVDETTTSGQSHPSQSLRMPRRSGR
Sbjct: 429 LEEDHTRDHKPRSKLVLNEATDESTRVVDEFGPSSRVDETTTSGQSHPSQSLRMPRRSGR 488

Query: 422 IVSQPNRYLGLTETQIVIPDDGVEDPLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWEL 481
           IVSQPNRYLGLTETQIVIPDDGVEDPLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWEL
Sbjct: 489 IVSQPNRYLGLTETQIVIPDDGVEDPLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWEL 548

Query: 482 VDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIR 541
           VDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIR
Sbjct: 549 VDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIR 608

Query: 542 ILLSIATFYDYEIWQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQ 601
           ILLSIATFYDYEIWQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQ
Sbjct: 609 ILLSIATFYDYEIWQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQ 668

Query: 602 ASRSWNIRFVTAIKSYGFDQNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVK 641
           ASRSWNIRFVTAIKSYGFDQNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVK
Sbjct: 669 ASRSWNIRFVTAIKSYGFDQNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVK 728

BLAST of Cmc04g0106681 vs. ExPASy TrEMBL
Match: A0A5D3DZX8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold381G00320 PE=4 SV=1)

HSP 1 Score: 1025.8 bits (2651), Expect = 7.8e-296
Identity = 513/552 (92.93%), Postives = 516/552 (93.48%), Query Frame = 0

Query: 122 RLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHSD 181
           +LGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHSD
Sbjct: 200 KLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHSD 259

Query: 182 LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL 241
           LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL
Sbjct: 260 LCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKIL 319

Query: 242 RSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD------------- 301
           RSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD             
Sbjct: 320 RSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDKVRSMMSYTQLPS 379

Query: 302 --------------------KKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATF 361
                               KKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATF
Sbjct: 380 SFWGDCSSYLEQSHVLVTNPKKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATF 439

Query: 362 LEEDHTRDHKPRSKLVLNEATDESTRVVDEFGPSSRVDETTTSGQSHPSQSLRMPRRSGR 421
           LEEDHTRDHKPRSKLVLNEATDESTRVVDE GPSSRVDETTTSGQSHPSQSLRMPRRSGR
Sbjct: 440 LEEDHTRDHKPRSKLVLNEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGR 499

Query: 422 IVSQPNRYLGLTETQIVIPDDGVEDPLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWEL 481
           +VSQPNRYLGLTETQ+VIPDDGVEDPLSY Q MNDVDKDQWVKAIDLEMESMYFNSVWEL
Sbjct: 500 VVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAIDLEMESMYFNSVWEL 559

Query: 482 VDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIR 541
           VDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIR
Sbjct: 560 VDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIR 619

Query: 542 ILLSIATFYDYEIWQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQ 601
           ILLSIATFYDYEIWQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQ
Sbjct: 620 ILLSIATFYDYEIWQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQ 679

Query: 602 ASRSWNIRFVTAIKSYGFDQNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVK 641
           ASRSWNIRFVTAIKSYGFDQNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVK
Sbjct: 680 ASRSWNIRFVTAIKSYGFDQNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVK 739

BLAST of Cmc04g0106681 vs. ExPASy TrEMBL
Match: A0A5A7VJG3 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold138G001110 PE=4 SV=1)

HSP 1 Score: 1003.0 bits (2592), Expect = 5.4e-289
Identity = 524/659 (79.51%), Postives = 540/659 (81.94%), Query Frame = 0

Query: 1   MTLKVGTGDVISARAVGDAKLFFRNKYMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSM 60
           MTL VGTGDVISARAVGD KLFF  K+MFLENLYIVPKIKRNLV VSCLIEHMYSINFSM
Sbjct: 208 MTLMVGTGDVISARAVGDVKLFFGIKFMFLENLYIVPKIKRNLVFVSCLIEHMYSINFSM 267

Query: 61  NEAFISKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 120
           NEAFISKNG     AKLE+NLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH
Sbjct: 268 NEAFISKNG-----AKLEDNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWH 327

Query: 121 LRLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKEPLELIHS 180
           LRL HINL++IGRL+KNGLLNKL+DDSLP CESC EGKMTKRPFT K YRAKEPLELIHS
Sbjct: 328 LRLDHINLDRIGRLVKNGLLNKLKDDSLPPCESCLEGKMTKRPFTGKDYRAKEPLELIHS 387

Query: 181 DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKI 240
           DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHK EALEKFKEYK EVENLLSKKIKI
Sbjct: 388 DLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKYEALEKFKEYKTEVENLLSKKIKI 447

Query: 241 LRSDQGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD------------ 300
           LRSD+GGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLD            
Sbjct: 448 LRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP 507

Query: 301 -------KKLGPRSRLCQFVGYPKETRGGLLFDPQENRVLVSTNATFLEEDHTRDHKPRS 360
                  KKL PRSRLCQFVGYPKE RGGL FDPQENRV VSTN TFLEED  RDHKPRS
Sbjct: 508 SSFWGTPKKLEPRSRLCQFVGYPKERRGGLFFDPQENRVFVSTNTTFLEEDRMRDHKPRS 567

Query: 361 KLVLNEATDESTRVVDEFGPSSRVDETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTE 420
           KLVL EATDESTRVVDE  PSSRVDETTTSGQSHPSQSLRMPRRSGRIVSQP RYLGLTE
Sbjct: 568 KLVLCEATDESTRVVDEVDPSSRVDETTTSGQSHPSQSLRMPRRSGRIVSQPKRYLGLTE 627

Query: 421 TQIVIPDDGVEDPLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWELVDLPEGVKPIGCK 480
           TQ+VIPDDGVEDPLSY Q MNDVDK+QWVKA+DLE+ESMYFNSVWEL DL EGVKPIGCK
Sbjct: 628 TQVVIPDDGVEDPLSYKQTMNDVDKNQWVKAMDLEIESMYFNSVWELADLSEGVKPIGCK 687

Query: 481 WIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEI 540
           WIYKRKRDS G                                KSIRILLSIATFYDYEI
Sbjct: 688 WIYKRKRDSVG--------------------------------KSIRILLSIATFYDYEI 747

Query: 541 WQMDVKTAFLNGNLEESIFISQPKGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFVTAI 600
           WQ+DVKTAFLNGNLEESIF+SQP+G                              F T I
Sbjct: 748 WQIDVKTAFLNGNLEESIFMSQPEG------------------------------FDTTI 799

Query: 601 KSYGFDQNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVKAWLAAQFQMKDL 641
           KSYGFDQNVDEPCVYKKIN  KVAFLVLYVDDIL IGN+VGYLTDVKAWLAAQFQMK+L
Sbjct: 808 KSYGFDQNVDEPCVYKKINKVKVAFLVLYVDDILLIGNNVGYLTDVKAWLAAQFQMKNL 799

BLAST of Cmc04g0106681 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 193.7 bits (491), Expect = 4.4e-49
Identity = 94/233 (40.34%), Postives = 148/233 (63.52%), Query Frame = 0

Query: 412 EDPLSYNQVMNDVDKDQWVKAIDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSA 471
           ++P +YN+    +    W  A+D E+ +M     WE+  LP   KPIGCKW+YK K +S 
Sbjct: 84  KEPSTYNEAKEFL---VWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSD 143

Query: 472 GKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFL 531
           G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L+I+  Y++ + Q+D+  AFL
Sbjct: 144 GTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFL 203

Query: 532 NGNLEESIFISQPKGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFVTAIKSYGFD 591
           NG+L+E I++  P G+   QG       VC L +SIYGLKQASR W ++F   +  +GF 
Sbjct: 204 NGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFV 263

Query: 592 QNVDEPCVYKKINIGKVAFLVLYVDDILFIGNDVGYLTDVKAWLAAQFQMKDL 641
           Q+  +   + KI       +++YVDDI+   N+   + ++K+ L + F+++DL
Sbjct: 264 QSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDL 313

BLAST of Cmc04g0106681 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 80.9 bits (198), Expect = 4.2e-15
Identity = 36/86 (41.86%), Postives = 54/86 (62.79%), Query Frame = 0

Query: 429 WVKAIDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQ 488
           W +A+  E++++  N  W LV  P     +GCKW++K K  S G +   KARLVAKG+ Q
Sbjct: 40  WCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQ 99

Query: 489 REGVDYEETFSPVAMLKSIRILLSIA 515
            EG+ + ET+SPV    +IR +L++A
Sbjct: 100 EEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of Cmc04g0106681 vs. TAIR 10
Match: ATMG00300.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 57.4 bits (137), Expect = 4.9e-08
Identity = 28/75 (37.33%), Postives = 41/75 (54.67%), Query Frame = 0

Query: 114 NNTYLWHLRLGHINLNQIGRLIKNGLLNKLEDDSLPSCESCHEGKMTKRPFTEKGYRAKE 173
           + T LWH RL H++   +  L+K G L+  +  SL  CE C  GK  +  F+   +  K 
Sbjct: 67  DETRLWHSRLAHMSQRGMELLVKKGFLDSSKVSSLKFCEDCIYGKTHRVNFSTGQHTTKN 126

Query: 174 PLELIHSDLCGPMNV 189
           PL+ +HSDL G  +V
Sbjct: 127 PLDYVHSDLWGAPSV 141

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0025945.10.0e+0086.28gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumi... [more]
KAA0035907.10.0e+0085.01gag/pol protein [Cucumis melo var. makuwa][more]
KAA0065400.11.0e-29793.84gag/pol protein [Cucumis melo var. makuwa][more]
TYK28885.11.6e-29592.93gag/pol protein [Cucumis melo var. makuwa][more]
KAA0067938.11.1e-28879.51gag/pol protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
P109781.3e-9831.48Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041461.1e-6526.46Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q94HW27.6e-6224.64Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT946.4e-6125.95Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q034941.7e-1626.28Transposon Ty2-DR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
A0A5A7TZD00.0e+0086.28Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G000... [more]
A0A5A7T2V90.0e+0085.01Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760... [more]
A0A5A7VDX84.9e-29893.84Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold17G00700... [more]
A0A5D3DZX87.8e-29692.93Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold381G0032... [more]
A0A5A7VJG35.4e-28979.51Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold138G0011... [more]
Match NameE-valueIdentityDescription
AT4G23160.14.4e-4940.34cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00820.14.2e-1541.86Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00300.14.9e-0837.33Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 216..236
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 363..385
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 352..385
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 114..286
coord: 412..639
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 443..640
e-value: 4.1E-65
score: 219.8
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 167..297
e-value: 2.4E-27
score: 97.5
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 106..159
e-value: 1.4E-12
score: 47.3
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 172..273
e-value: 1.9E-12
score: 47.4
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 170..347
score: 20.299007
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 169..288
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 443..625

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc04g0106681.1Cmc04g0106681.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding