Cmc02g0049551 (gene) Melon (Charmono) v1.1

Overview
NameCmc02g0049551
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
LocationCMiso1.1chr02: 15221971 .. 15223179 (+)
RNA-Seq ExpressionCmc02g0049551
SyntenyCmc02g0049551
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAAAGAGTATCTTTATGTCTCAGCTCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGATTGAAACAAGCATCTAGATCTTGGAACATTAGGTTTGATAATGCGATCAAATCCTACGGTTTTGACCGAAACGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGAAATGATGTGGGATACCTTACTAACGTTAAAGCTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCATAAGAACAAAACGCTAGCACTGTCTCAAGCAATCTATATCGACAAAATGTTGGTTCGATATTTGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGACATGGGGTTCACTTGTCTAAGGAACAGTGTCCTAAGACACCTCAAGAAGTTGAGGATATAAGACGTATTCCCTATGCCTCAACTATGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAATCCAGGGTTAGACCACTGGACGAGGGTTAAAATTATTCTCAAGTATCTTAGGAGAATGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACATTGACTATGATTTCCAAATCGATAAAGATTCTAGAAAATCTACGTCGGGATCAATGTTCACCCTAAATGAGAGAGCTGTAGTATGGCATAGCATCAAGCAAGGATGCATTGCAGACTCTACAATGGAGGCTAAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGATTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTACCAAACATGAATTTGCCCATCACTCCATATTATGATAACAGTCGGGCAGTAGCCAATTCTAAGGAACCTCGCAACTATAAACGAGGGAAACACAAAGAGAGGAAGTATCACCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATTGCTTCGGAGCACAACATTGTTGATCCATTTACGAAGACTTTCACGGCTAAAGTGTTCGAGGGTCATCTATAG

mRNA sequence

ATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAAAGAGTATCTTTATGTCTCAGCTCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGATTGAAACAAGCATCTAGATCTTGGAACATTAGGTTTGATAATGCGATCAAATCCTACGGTTTTGACCGAAACGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGAAATGATGTGGGATACCTTACTAACGTTAAAGCTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCATAAGAACAAAACGCTAGCACTGTCTCAAGCAATCTATATCGACAAAATGTTGGTTCGATATTTGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGACATGGGGTTCACTTGTCTAAGGAACAGTGTCCTAAGACACCTCAAGAAGTTGAGGATATAAGACGTATTCCCTATGCCTCAACTATGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAATCCAGGGTTAGACCACTGGACGAGGGTTAAAATTATTCTCAAGTATCTTAGGAGAATGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACATTGACTATGATTTCCAAATCGATAAAGATTCTAGAAAATCTACGTCGGGATCAATGTTCACCCTAAATGAGAGAGCTGTAGTATGGCATAGCATCAAGCAAGGATGCATTGCAGACTCTACAATGGAGGCTAAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGATTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTACCAAACATGAATTTGCCCATCACTCCATATTATGATAACAGTCGGGCAGTAGCCAATTCTAAGGAACCTCGCAACTATAAACGAGGGAAACACAAAGAGAGGAAGTATCACCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATTGCTTCGGAGCACAACATTGTTGATCCATTTACGAAGACTTTCACGGCTAAAGTGTTCGAGGGTCATCTATAG

Coding sequence (CDS)

ATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAAAGAGTATCTTTATGTCTCAGCTCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGATTGAAACAAGCATCTAGATCTTGGAACATTAGGTTTGATAATGCGATCAAATCCTACGGTTTTGACCGAAACGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGAAATGATGTGGGATACCTTACTAACGTTAAAGCTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCATAAGAACAAAACGCTAGCACTGTCTCAAGCAATCTATATCGACAAAATGTTGGTTCGATATTTGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGACATGGGGTTCACTTGTCTAAGGAACAGTGTCCTAAGACACCTCAAGAAGTTGAGGATATAAGACGTATTCCCTATGCCTCAACTATGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAATCCAGGGTTAGACCACTGGACGAGGGTTAAAATTATTCTCAAGTATCTTAGGAGAATGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACATTGACTATGATTTCCAAATCGATAAAGATTCTAGAAAATCTACGTCGGGATCAATGTTCACCCTAAATGAGAGAGCTGTAGTATGGCATAGCATCAAGCAAGGATGCATTGCAGACTCTACAATGGAGGCTAAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGATTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTACCAAACATGAATTTGCCCATCACTCCATATTATGATAACAGTCGGGCAGTAGCCAATTCTAAGGAACCTCGCAACTATAAACGAGGGAAACACAAAGAGAGGAAGTATCACCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATTGCTTCGGAGCACAACATTGTTGATCCATTTACGAAGACTTTCACGGCTAAAGTGTTCGAGGGTCATCTATAG

Protein sequence

MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKSYGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRMRDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTMEAKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKERKYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFEGHL
Homology
BLAST of Cmc02g0049551 vs. NCBI nr
Match: KAA0046800.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 782.7 bits (2020), Expect = 1.5e-222
Identity = 385/402 (95.77%), Postives = 390/402 (97.01%), Query Frame = 0

Query: 1   MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 60
           +++K+ + N   E       LEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS
Sbjct: 290 LEMKSMYFNSMWELVDLPEGLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 349

Query: 61  YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA 120
           YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA
Sbjct: 350 YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA 409

Query: 121 QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ 180
           QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ
Sbjct: 410 QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ 469

Query: 181 EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM 240
           EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM
Sbjct: 470 EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM 529

Query: 241 RDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTME 300
           RDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTME
Sbjct: 530 RDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTME 589

Query: 301 AKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKER 360
           AKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKER
Sbjct: 590 AKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKER 649

Query: 361 KYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFEGHL 403
           KYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFEGHL
Sbjct: 650 KYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFEGHL 691

BLAST of Cmc02g0049551 vs. NCBI nr
Match: KAA0025945.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0035786.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0040492.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0041262.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 742.3 bits (1915), Expect = 2.2e-210
Identity = 369/402 (91.79%), Postives = 380/402 (94.53%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 60
            MDVKTAFLNGNLE+SIFMSQ EGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFD AIKS
Sbjct: 823  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 882

Query: 61   YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA 120
            YGFD+NVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLT+VKAWLAAQFQMKDLGEA
Sbjct: 883  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 942

Query: 121  QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ 180
            QYVLGIQIIRD KNKTLALSQA YIDK+LVRY MQNSKKGLLPFRHGVHLSKEQ PKTPQ
Sbjct: 943  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 1002

Query: 181  EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM 240
            EVED+RRIPYAS +GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWT VKI+LKYLRR 
Sbjct: 1003 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 1062

Query: 241  RDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTME 300
            RDYMLVYGAKDLILTGY D DFQ DKDSRKSTSGS+FTLN  AVVW SIKQGCIADSTME
Sbjct: 1063 RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 1122

Query: 301  AKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKER 360
            A+YVAACEAAKEA WLRKFLHDLEVVPNMNLPIT Y DNS AVANSKEPR++KRGKH ER
Sbjct: 1123 AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 1182

Query: 361  KYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFEGHL 403
            KYHLIREIVQRGDVIVTKIASEHNI DPFTKT TAKVFEGHL
Sbjct: 1183 KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHL 1224

BLAST of Cmc02g0049551 vs. NCBI nr
Match: KAA0059226.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 742.3 bits (1915), Expect = 2.2e-210
Identity = 369/402 (91.79%), Postives = 380/402 (94.53%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 60
            MDVKTAFLNGNLE+SIFMSQ EGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFD AIKS
Sbjct: 697  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 756

Query: 61   YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA 120
            YGFD+NVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLT+VKAWLAAQFQMKDLGEA
Sbjct: 757  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 816

Query: 121  QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ 180
            QYVLGIQIIRD KNKTLALSQA YIDK+LVRY MQNSKKGLLPFRHGVHLSKEQ PKTPQ
Sbjct: 817  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 876

Query: 181  EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM 240
            EVED+RRIPYAS +GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWT VKI+LKYLRR 
Sbjct: 877  EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 936

Query: 241  RDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTME 300
            RDYMLVYGAKDLILTGY D DFQ DKDSRKSTSGS+FTLN  AVVW SIKQGCIADSTME
Sbjct: 937  RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 996

Query: 301  AKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKER 360
            A+YVAACEAAKEA WLRKFLHDLEVVPNMNLPIT Y DNS AVANSKEPR++KRGKH ER
Sbjct: 997  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 1056

Query: 361  KYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFEGHL 403
            KYHLIREIVQRGDVIVTKIASEHNI DPFTKT TAKVFEGHL
Sbjct: 1057 KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHL 1098

BLAST of Cmc02g0049551 vs. NCBI nr
Match: KAA0035907.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 734.9 bits (1896), Expect = 3.6e-208
Identity = 366/402 (91.04%), Postives = 378/402 (94.03%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 60
            MDVKTAFLNGNLE+SIFMSQ EGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFD AIKS
Sbjct: 823  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 882

Query: 61   YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA 120
            YGFD+NVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLT+VKAWLAAQFQMKDLGE 
Sbjct: 883  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEG 942

Query: 121  QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ 180
            QYVLGIQIIRD KNKTLALSQA YIDK+LVRY MQNSKKGLLPFRHGVHLSKEQ PKTPQ
Sbjct: 943  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 1002

Query: 181  EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM 240
            EVED+RRIPYAS +GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWT VKIILKYLRR 
Sbjct: 1003 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRT 1062

Query: 241  RDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTME 300
            RDYMLVYGAKDLILTGY + DFQ DKDSRKSTS S+FTLN  AVVW SIKQGCIADSTME
Sbjct: 1063 RDYMLVYGAKDLILTGYTNSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIADSTME 1122

Query: 301  AKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKER 360
            A+YVAACEAAKEA WL+KFLHDLEVVPNMNLPIT Y DNS AVANSKEPR++KRGKH ER
Sbjct: 1123 AEYVAACEAAKEAVWLKKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 1182

Query: 361  KYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFEGHL 403
            KYHLIREIVQRGDVIVTKIASEHNI DPFTKT TAKVFEGHL
Sbjct: 1183 KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHL 1224

BLAST of Cmc02g0049551 vs. NCBI nr
Match: KAA0040367.1 (gag/pol protein [Cucumis melo var. makuwa] >TYK23337.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 713.4 bits (1840), Expect = 1.1e-201
Identity = 355/402 (88.31%), Postives = 367/402 (91.29%), Query Frame = 0

Query: 1   MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 60
           MDVKTAFLN NLE+SIFMSQ EGFITQGQEQKVCKLNRSIYGLKQ+SRSWN+RFD AIKS
Sbjct: 22  MDVKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQSSRSWNMRFDTAIKS 81

Query: 61  YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA 120
           YGFD+NVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLT+VKAWLA QFQMKDLGE 
Sbjct: 82  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLATQFQMKDLGET 141

Query: 121 QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ 180
           QYVLGIQIIRD KNKTLALSQA YIDKMLVRY MQNSKK LLPFRHGVHLSKEQCPKTPQ
Sbjct: 142 QYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKDLLPFRHGVHLSKEQCPKTPQ 201

Query: 181 EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM 240
           E+ED+RRI YAS +GSLMY ML TRPDICYAVGIVSRY  NPGLDHWT VKIILKYLRR 
Sbjct: 202 EIEDMRRILYASAVGSLMYDMLYTRPDICYAVGIVSRYLFNPGLDHWTAVKIILKYLRRT 261

Query: 241 RDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTME 300
           RDYMLVYG KDLILTGY D DFQ DKDSRKSTSGS+FTLN  AVVWHSIKQGCIADSTME
Sbjct: 262 RDYMLVYGGKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNRGAVVWHSIKQGCIADSTME 321

Query: 301 AKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKER 360
           A+Y+AACEAAKE  WLRKFLHDLEVVPNMNL IT Y DNS AVANSKEPRN+KRGKH ER
Sbjct: 322 AEYIAACEAAKEVVWLRKFLHDLEVVPNMNLSITLYCDNSGAVANSKEPRNHKRGKHIER 381

Query: 361 KYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFEGHL 403
           KYHLIREIVQR DVIVTKI SEH I DPFTKT TAKVFEGHL
Sbjct: 382 KYHLIREIVQRRDVIVTKITSEHKITDPFTKTLTAKVFEGHL 423

BLAST of Cmc02g0049551 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 345.5 bits (885), Expect = 8.0e-94
Identity = 178/400 (44.50%), Postives = 255/400 (63.75%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 60
            +DVKTAFL+G+LE+ I+M Q EGF   G++  VCKLN+S+YGLKQA R W ++FD+ +KS
Sbjct: 921  LDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKS 980

Query: 61   YGFDRNVDEPCVY-KKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGE 120
              + +   +PCVY K+ ++     L+LYVDD+L++G D G +  +K  L+  F MKDLG 
Sbjct: 981  QTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGP 1040

Query: 121  AQYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTP 180
            AQ +LG++I+R+  ++ L LSQ  YI+++L R+ M+N+K    P    + LSK+ CP T 
Sbjct: 1041 AQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTV 1100

Query: 181  QEVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRR 240
            +E  ++ ++PY+S +GSLMYAM+CTRPDI +AVG+VSR+  NPG +HW  VK IL+YLR 
Sbjct: 1101 EEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRG 1160

Query: 241  MRDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTM 300
                 L +G  D IL GY D D   D D+RKS++G +FT +  A+ W S  Q C+A ST 
Sbjct: 1161 TTGDCLCFGGSDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTT 1220

Query: 301  EAKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKE 360
            EA+Y+AA E  KE  WL++FL +L +          Y D+  A+  SK    + R KH +
Sbjct: 1221 EAEYIAATETGKEMIWLKRFLQELGL---HQKEYVVYCDSQSAIDLSKNSMYHARTKHID 1280

Query: 361  RKYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFE 400
             +YH IRE+V    + V KI++  N  D  TK      FE
Sbjct: 1281 VRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKFE 1317

BLAST of Cmc02g0049551 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 235.7 bits (600), Expect = 9.0e-61
Identity = 143/408 (35.05%), Postives = 224/408 (54.90%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 60
            MDVKTAFLNG L++ I+M   +G         VCKLN++IYGLKQA+R W   F+ A+K 
Sbjct: 1001 MDVKTAFLNGTLKEEIYMRLPQGI--SCNSDNVCKLNKAIYGLKQAARCWFEVFEQALKE 1060

Query: 61   YGFDRNVDEPCVY--KKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLG 120
              F  +  + C+Y   K N  +  +++LYVDD+++   D+  + N K +L  +F+M DL 
Sbjct: 1061 CEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLN 1120

Query: 121  EAQYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHL----SKEQ 180
            E ++ +GI+I  + +   + LSQ+ Y+ K+L ++ M+N      P    ++     S E 
Sbjct: 1121 EIKHFIGIRI--EMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSDED 1180

Query: 181  CPKTPQEVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIIL 240
            C             P  S +G LMY MLCTRPD+  AV I+SRY S    + W  +K +L
Sbjct: 1181 C-----------NTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVL 1240

Query: 241  KYLRRMRDYMLVYG---AKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNE-RAVVWHSIK 300
            +YL+   D  L++    A +  + GY+D D+   +  RKST+G +F + +   + W++ +
Sbjct: 1241 RYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKR 1300

Query: 301  QGCIADSTMEAKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPR 360
            Q  +A S+ EA+Y+A  EA +EA WL+  L  + +   +  PI  Y DN   ++ +  P 
Sbjct: 1301 QNSVAASSTEAEYMALFEAVREALWLKFLLTSINI--KLENPIKIYEDNQGCISIANNPS 1360

Query: 361  NYKRGKHKERKYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVF 399
             +KR KH + KYH  RE VQ   + +  I +E+ + D FTK   A  F
Sbjct: 1361 CHKRAKHIDIKYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARF 1391

BLAST of Cmc02g0049551 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 195.3 bits (495), Expect = 1.3e-48
Identity = 125/400 (31.25%), Postives = 193/400 (48.25%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 60
            +DV  AFL G L   ++MSQ  GF+ + +   VC+L ++IYGLKQA R+W +     + +
Sbjct: 1047 LDVNNAFLQGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLT 1106

Query: 61   YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA 120
             GF  ++ +  ++       + ++++YVDDIL+ GND   L +    L+ +F +K+  + 
Sbjct: 1107 VGFVNSISDTSLFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDL 1166

Query: 121  QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ 180
             Y LGI+  R  +   L LSQ  Y   +L R  M  +K    P      L+     K P 
Sbjct: 1167 HYFLGIEAKRVPQG--LHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTKLPD 1226

Query: 181  EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM 240
              E      Y   +GSL Y +  TRPD+ YAV  +S+Y   P  DHW  +K +L+YL   
Sbjct: 1227 PTE------YRGIVGSLQY-LAFTRPDLSYAVNRLSQYMHMPTDDHWNALKRVLRYLAGT 1286

Query: 241  RDY-MLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTM 300
             D+ + +     L L  Y D D+  D D   ST+G +  L    + W S KQ  +  S+ 
Sbjct: 1287 PDHGIFLKKGNTLSLHAYSDADWAGDTDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSST 1346

Query: 301  EAKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKE 360
            EA+Y +    + E  W+   L +L +   ++ P   Y DN  A      P  + R KH  
Sbjct: 1347 EAEYRSVANTSSELQWICSLLTELGI--QLSHPPVIYCDNVGATYLCANPVFHSRMKHIA 1406

Query: 361  RKYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFE 400
              YH IR  VQ G + V  +++   + D  TK  +   F+
Sbjct: 1407 LDYHFIRNQVQSGALRVVHVSTHDQLADTLTKPLSRVAFQ 1435

BLAST of Cmc02g0049551 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 194.1 bits (492), Expect = 3.0e-48
Identity = 131/400 (32.75%), Postives = 192/400 (48.00%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 60
            +DV  AFL G L   ++MSQ  GFI + +   VCKL +++YGLKQA R+W +   N + +
Sbjct: 1064 LDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLT 1123

Query: 61   YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA 120
             GF  +V +  ++       + ++++YVDDIL+ GND   L N    L+ +F +KD  E 
Sbjct: 1124 IGFVNSVSDTSLFVLQRGKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEEL 1183

Query: 121  QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ 180
             Y LGI+  R      L LSQ  YI  +L R  M  +K    P      LS     K   
Sbjct: 1184 HYFLGIEAKRVPTG--LHLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTD 1243

Query: 181  EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM 240
              E      Y   +GSL Y +  TRPDI YAV  +S++   P  +H   +K IL+YL   
Sbjct: 1244 PTE------YRGIVGSLQY-LAFTRPDISYAVNRLSQFMHMPTEEHLQALKRILRYLAGT 1303

Query: 241  RDY-MLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTM 300
             ++ + +     L L  Y D D+  DKD   ST+G +  L    + W S KQ  +  S+ 
Sbjct: 1304 PNHGIFLKKGNTLSLHAYSDADWAGDKDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSST 1363

Query: 301  EAKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKE 360
            EA+Y +    + E  W+   L +L +   +  P   Y DN  A      P  + R KH  
Sbjct: 1364 EAEYRSVANTSSEMQWICSLLTELGI--RLTRPPVIYCDNVGATYLCANPVFHSRMKHIA 1423

Query: 361  RKYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFE 400
              YH IR  VQ G + V  +++   + D  TK  +   F+
Sbjct: 1424 IDYHFIRNQVQSGALRVVHVSTHDQLADTLTKPLSRTAFQ 1452

BLAST of Cmc02g0049551 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 142.1 bits (357), Expect = 1.3e-32
Identity = 101/314 (32.17%), Postives = 142/314 (45.22%), Query Frame = 0

Query: 1   MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 60
           MDV TAFLN  +++ I++ Q  GF+ +     V +L   +YGLKQA   WN   +N +K 
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 61  YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA 120
            GF R+  E  +Y +       ++ +YVDD+L+          VK  L   + MKDLG+ 
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 121 QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ 180
              LG+  I    N  + LS   YI K      +   K    P  +    SK     T  
Sbjct: 121 DKFLGLN-IHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCN----SKPLFETTSP 180

Query: 181 EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM 240
            ++DI   PY S +G L++     RPDI Y V ++SR+   P   H    + +L+YL   
Sbjct: 181 HLKDI--TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTT 240

Query: 241 RDYMLVY-GAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIK-QGCIADST 300
           R   L Y     L LT Y D       D   ST G +  L    V W S K +G I   +
Sbjct: 241 RSMCLKYRSGSQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPS 300

Query: 301 MEAKYVAACEAAKE 313
            EA+Y+ A E   E
Sbjct: 301 TEAEYITASETVME 307

BLAST of Cmc02g0049551 vs. ExPASy TrEMBL
Match: A0A5A7TUI8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold216G00920 PE=4 SV=1)

HSP 1 Score: 782.7 bits (2020), Expect = 7.3e-223
Identity = 385/402 (95.77%), Postives = 390/402 (97.01%), Query Frame = 0

Query: 1   MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 60
           +++K+ + N   E       LEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS
Sbjct: 290 LEMKSMYFNSMWELVDLPEGLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 349

Query: 61  YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA 120
           YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA
Sbjct: 350 YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA 409

Query: 121 QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ 180
           QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ
Sbjct: 410 QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ 469

Query: 181 EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM 240
           EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM
Sbjct: 470 EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM 529

Query: 241 RDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTME 300
           RDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTME
Sbjct: 530 RDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTME 589

Query: 301 AKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKER 360
           AKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKER
Sbjct: 590 AKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKER 649

Query: 361 KYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFEGHL 403
           KYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFEGHL
Sbjct: 650 KYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFEGHL 691

BLAST of Cmc02g0049551 vs. ExPASy TrEMBL
Match: A0A5A7TZD0 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G00090 PE=4 SV=1)

HSP 1 Score: 742.3 bits (1915), Expect = 1.1e-210
Identity = 369/402 (91.79%), Postives = 380/402 (94.53%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 60
            MDVKTAFLNGNLE+SIFMSQ EGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFD AIKS
Sbjct: 823  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 882

Query: 61   YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA 120
            YGFD+NVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLT+VKAWLAAQFQMKDLGEA
Sbjct: 883  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 942

Query: 121  QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ 180
            QYVLGIQIIRD KNKTLALSQA YIDK+LVRY MQNSKKGLLPFRHGVHLSKEQ PKTPQ
Sbjct: 943  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 1002

Query: 181  EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM 240
            EVED+RRIPYAS +GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWT VKI+LKYLRR 
Sbjct: 1003 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 1062

Query: 241  RDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTME 300
            RDYMLVYGAKDLILTGY D DFQ DKDSRKSTSGS+FTLN  AVVW SIKQGCIADSTME
Sbjct: 1063 RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 1122

Query: 301  AKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKER 360
            A+YVAACEAAKEA WLRKFLHDLEVVPNMNLPIT Y DNS AVANSKEPR++KRGKH ER
Sbjct: 1123 AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 1182

Query: 361  KYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFEGHL 403
            KYHLIREIVQRGDVIVTKIASEHNI DPFTKT TAKVFEGHL
Sbjct: 1183 KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHL 1224

BLAST of Cmc02g0049551 vs. ExPASy TrEMBL
Match: A0A5A7UYE8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G001570 PE=4 SV=1)

HSP 1 Score: 742.3 bits (1915), Expect = 1.1e-210
Identity = 369/402 (91.79%), Postives = 380/402 (94.53%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 60
            MDVKTAFLNGNLE+SIFMSQ EGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFD AIKS
Sbjct: 697  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 756

Query: 61   YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA 120
            YGFD+NVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLT+VKAWLAAQFQMKDLGEA
Sbjct: 757  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 816

Query: 121  QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ 180
            QYVLGIQIIRD KNKTLALSQA YIDK+LVRY MQNSKKGLLPFRHGVHLSKEQ PKTPQ
Sbjct: 817  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 876

Query: 181  EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM 240
            EVED+RRIPYAS +GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWT VKI+LKYLRR 
Sbjct: 877  EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 936

Query: 241  RDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTME 300
            RDYMLVYGAKDLILTGY D DFQ DKDSRKSTSGS+FTLN  AVVW SIKQGCIADSTME
Sbjct: 937  RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 996

Query: 301  AKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKER 360
            A+YVAACEAAKEA WLRKFLHDLEVVPNMNLPIT Y DNS AVANSKEPR++KRGKH ER
Sbjct: 997  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 1056

Query: 361  KYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFEGHL 403
            KYHLIREIVQRGDVIVTKIASEHNI DPFTKT TAKVFEGHL
Sbjct: 1057 KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHL 1098

BLAST of Cmc02g0049551 vs. ExPASy TrEMBL
Match: A0A5A7T2V9 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760 PE=4 SV=1)

HSP 1 Score: 734.9 bits (1896), Expect = 1.7e-208
Identity = 366/402 (91.04%), Postives = 378/402 (94.03%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 60
            MDVKTAFLNGNLE+SIFMSQ EGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFD AIKS
Sbjct: 823  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 882

Query: 61   YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA 120
            YGFD+NVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLT+VKAWLAAQFQMKDLGE 
Sbjct: 883  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEG 942

Query: 121  QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ 180
            QYVLGIQIIRD KNKTLALSQA YIDK+LVRY MQNSKKGLLPFRHGVHLSKEQ PKTPQ
Sbjct: 943  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 1002

Query: 181  EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM 240
            EVED+RRIPYAS +GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWT VKIILKYLRR 
Sbjct: 1003 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRT 1062

Query: 241  RDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTME 300
            RDYMLVYGAKDLILTGY + DFQ DKDSRKSTS S+FTLN  AVVW SIKQGCIADSTME
Sbjct: 1063 RDYMLVYGAKDLILTGYTNSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIADSTME 1122

Query: 301  AKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKER 360
            A+YVAACEAAKEA WL+KFLHDLEVVPNMNLPIT Y DNS AVANSKEPR++KRGKH ER
Sbjct: 1123 AEYVAACEAAKEAVWLKKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 1182

Query: 361  KYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFEGHL 403
            KYHLIREIVQRGDVIVTKIASEHNI DPFTKT TAKVFEGHL
Sbjct: 1183 KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHL 1224

BLAST of Cmc02g0049551 vs. ExPASy TrEMBL
Match: A0A5D3DI92 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold142G003980 PE=4 SV=1)

HSP 1 Score: 713.4 bits (1840), Expect = 5.4e-202
Identity = 355/402 (88.31%), Postives = 367/402 (91.29%), Query Frame = 0

Query: 1   MDVKTAFLNGNLEKSIFMSQLEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDNAIKS 60
           MDVKTAFLN NLE+SIFMSQ EGFITQGQEQKVCKLNRSIYGLKQ+SRSWN+RFD AIKS
Sbjct: 22  MDVKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQSSRSWNMRFDTAIKS 81

Query: 61  YGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEA 120
           YGFD+NVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLT+VKAWLA QFQMKDLGE 
Sbjct: 82  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLATQFQMKDLGET 141

Query: 121 QYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCPKTPQ 180
           QYVLGIQIIRD KNKTLALSQA YIDKMLVRY MQNSKK LLPFRHGVHLSKEQCPKTPQ
Sbjct: 142 QYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKDLLPFRHGVHLSKEQCPKTPQ 201

Query: 181 EVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLRRM 240
           E+ED+RRI YAS +GSLMY ML TRPDICYAVGIVSRY  NPGLDHWT VKIILKYLRR 
Sbjct: 202 EIEDMRRILYASAVGSLMYDMLYTRPDICYAVGIVSRYLFNPGLDHWTAVKIILKYLRRT 261

Query: 241 RDYMLVYGAKDLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTME 300
           RDYMLVYG KDLILTGY D DFQ DKDSRKSTSGS+FTLN  AVVWHSIKQGCIADSTME
Sbjct: 262 RDYMLVYGGKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNRGAVVWHSIKQGCIADSTME 321

Query: 301 AKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRGKHKER 360
           A+Y+AACEAAKE  WLRKFLHDLEVVPNMNL IT Y DNS AVANSKEPRN+KRGKH ER
Sbjct: 322 AEYIAACEAAKEVVWLRKFLHDLEVVPNMNLSITLYCDNSGAVANSKEPRNHKRGKHIER 381

Query: 361 KYHLIREIVQRGDVIVTKIASEHNIVDPFTKTFTAKVFEGHL 403
           KYHLIREIVQR DVIVTKI SEH I DPFTKT TAKVFEGHL
Sbjct: 382 KYHLIREIVQRRDVIVTKITSEHKITDPFTKTLTAKVFEGHL 423

BLAST of Cmc02g0049551 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 162.5 bits (410), Expect = 6.8e-40
Identity = 114/372 (30.65%), Postives = 188/372 (50.54%), Query Frame = 0

Query: 1   MDVKTAFLNGNLEKSIFMSQLEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDN 60
           +D+  AFLNG+L++ I+M    G+   QG       VC L +SIYGLKQASR W ++F  
Sbjct: 193 LDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSV 252

Query: 61  AIKSYGFDRNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKD 120
            +  +GF ++  +   + KI       +++YVDDI++  N+   +  +K+ L + F+++D
Sbjct: 253 TLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRD 312

Query: 121 LGEAQYVLGIQIIRDHKNKTLALSQAIYIDKMLVRYLMQNSKKGLLPFRHGVHLSKEQCP 180
           LG  +Y LG++I R      + + Q  Y   +L    +   K   +P    V  S     
Sbjct: 313 LGPLKYFLGLEIARSAAG--INICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAH--- 372

Query: 181 KTPQEVEDIRRIPYASTMGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKY 240
            +  +  D +   Y   +G LMY  + TR DI +AV  +S++   P L H   V  IL Y
Sbjct: 373 -SGGDFVDAK--AYRRLIGRLMYLQI-TRLDISFAVNKLSQFSEAPRLAHQQAVMKILHY 432

Query: 241 LRRMRDYMLVYGAK-DLILTGYIDYDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIA 300
           ++      L Y ++ ++ L  + D  FQ  KD+R+ST+G    L    + W S KQ  ++
Sbjct: 433 IKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVS 492

Query: 301 DSTMEAKYVAACEAAKEADWLRKFLHDLEVVPNMNLPITPYYDNSRAVANSKEPRNYKRG 360
            S+ EA+Y A   A  E  WL +F  +L++   ++ P   + DN+ A+  +     ++R 
Sbjct: 493 KSSAEAEYRALSFATDEMMWLAQFFRELQL--PLSKPTLLFCDNTAAIHIATNAVFHERT 552

Query: 361 KHKERKYHLIRE 368
           KH E   H +RE
Sbjct: 553 KHIESDCHSVRE 553

BLAST of Cmc02g0049551 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 94.0 bits (232), Expect = 3.0e-19
Identity = 76/236 (32.20%), Postives = 118/236 (50.00%), Query Frame = 0

Query: 83  FLVLYVDDILLIGNDVGYLTNVKAWLAAQFQMKDLGEAQYVLGIQIIRDHKNKTLALSQA 142
           +L+LYVDDILL G+    L  +   L++ F MKDLG   Y LGIQ I+ H +  L LSQ 
Sbjct: 2   YLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQ-IKTHPS-GLFLSQT 61

Query: 143 IYIDKMLVRYLMQNSK--KGLLPFRHGVHLSKEQCPKTPQEVEDIRRIPYASTMGSLMYA 202
            Y +++L    M + K     LP +    +S  + P    +  D R     S +G+L Y 
Sbjct: 62  KYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP----DPSDFR-----SIVGALQYL 121

Query: 203 MLCTRPDICYAVGIVSRYQSNPGLDHWTRVKIILKYLR-RMRDYMLVYGAKDLILTGYID 262
            L TRPDI YAV IV +    P L  +  +K +L+Y++  +   + ++    L +  + D
Sbjct: 122 TL-TRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCD 181

Query: 263 YDFQIDKDSRKSTSGSMFTLNERAVVWHSIKQGCIADSTMEAKYVAACEAAKEADW 316
            D+     +R+ST+G    L    + W + +Q  ++ S+ E +Y A    A E  W
Sbjct: 182 SDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0046800.11.5e-22295.77gag/pol protein [Cucumis melo var. makuwa][more]
KAA0025945.12.2e-21091.79gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumi... [more]
KAA0059226.12.2e-21091.79gag/pol protein [Cucumis melo var. makuwa][more]
KAA0035907.13.6e-20891.04gag/pol protein [Cucumis melo var. makuwa][more]
KAA0040367.11.1e-20188.31gag/pol protein [Cucumis melo var. makuwa] >TYK23337.1 gag/pol protein [Cucumis ... [more]
Match NameE-valueIdentityDescription
P109788.0e-9444.50Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041469.0e-6135.05Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT941.3e-4831.25Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW23.0e-4832.75Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P256001.3e-3232.17Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
A0A5A7TUI87.3e-22395.77Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold216G0092... [more]
A0A5A7TZD01.1e-21091.79Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G000... [more]
A0A5A7UYE81.1e-21091.79Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G0015... [more]
A0A5A7T2V91.7e-20891.04Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760... [more]
A0A5D3DI925.4e-20288.31Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold142G0039... [more]
Match NameE-valueIdentityDescription
AT4G23160.16.8e-4030.65cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.13.0e-1932.20DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 1..162
e-value: 7.4E-39
score: 133.8
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 1..268
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 255..391
e-value: 5.06242E-49
score: 161.097
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 1..246

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc02g0049551.1Cmc02g0049551.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding