Cmc04g0102941 (gene) Melon (Charmono) v1.1

Overview
NameCmc04g0102941
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
LocationCMiso1.1chr04: 19824525 .. 19825765 (+)
RNA-Seq ExpressionCmc04g0102941
SyntenyCmc04g0102941
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCAATCAAATCCTATGGTTTTGACCAGAATGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGTTACCTTACTGACGTTAAAGCTTGGCTAGCAGCACAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCAACCTATATCGACAAATTGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTTAGGACAGAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTGTTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCGTAGCATCAAGCAAGGATGTATTGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCCAAATATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGTCTACGAGATATGTACATTAGGTAA

mRNA sequence

ATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCAATCAAATCCTATGGTTTTGACCAGAATGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGTTACCTTACTGACGTTAAAGCTTGGCTAGCAGCACAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCAACCTATATCGACAAATTGTTGGCATGGAGTTCACTTGTCTTAGGACAGAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTGTTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCGTAGCATCAAGCAAGGATGTATTGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCCAAATATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGTCTACGAGATATGTACATTAGGTAA

Coding sequence (CDS)

ATGGATGTCAAGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAAGCTGAATCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCAATCAAATCCTATGGTTTTGACCAGAATGTTGATGAACCTTGTGTATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGTTACCTTACTGACGTTAAAGCTTGGCTAGCAGCACAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCAACCTATATCGACAAATTGTTGGCATGGAGTTCACTTGTCTTAGGACAGAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCAGTAGGTATCAGTCCAACCCAGGGTTAGACCATTGGACGGCGGTTAAAATTGTTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGGAGCTAAGGATTTGATCCTTACAGGATACACTGATTCTGATTTCCAAACCGATAAGGATTCTAGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCGTAGCATCAAGCAAGGATGTATTGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCCAAATATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACGAGGGGATGTGATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGTCTACGAGATATGTACATTAGGTAA

Protein sequence

MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Homology
BLAST of Cmc04g0102941 vs. NCBI nr
Match: KAA0025945.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0035786.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0040492.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0041262.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 773.9 bits (1997), Expect = 6.9e-220
Identity = 391/413 (94.67%), Postives = 391/413 (94.67%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 60
            MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS
Sbjct: 823  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 882

Query: 61   YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 120
            YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA
Sbjct: 883  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 942

Query: 121  QYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQ 180
            QYVLGIQIIRDRKNKTLALSQATYIDKLL   S                L   QSPKTPQ
Sbjct: 943  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 1002

Query: 181  EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 240
            EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT
Sbjct: 1003 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 1062

Query: 241  RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 300
            RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
Sbjct: 1063 RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 1122

Query: 301  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 360
            AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER
Sbjct: 1123 AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 1182

Query: 361  KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 398
            KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Sbjct: 1183 KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1235

BLAST of Cmc04g0102941 vs. NCBI nr
Match: KAA0059226.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 773.9 bits (1997), Expect = 6.9e-220
Identity = 391/413 (94.67%), Postives = 391/413 (94.67%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 60
            MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS
Sbjct: 697  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 756

Query: 61   YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 120
            YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA
Sbjct: 757  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 816

Query: 121  QYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQ 180
            QYVLGIQIIRDRKNKTLALSQATYIDKLL   S                L   QSPKTPQ
Sbjct: 817  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 876

Query: 181  EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 240
            EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT
Sbjct: 877  EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 936

Query: 241  RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 300
            RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
Sbjct: 937  RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 996

Query: 301  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 360
            AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER
Sbjct: 997  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 1056

Query: 361  KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 398
            KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Sbjct: 1057 KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1109

BLAST of Cmc04g0102941 vs. NCBI nr
Match: KAA0035907.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 765.8 bits (1976), Expect = 1.9e-217
Identity = 386/413 (93.46%), Postives = 389/413 (94.19%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 60
            MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS
Sbjct: 823  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 882

Query: 61   YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 120
            YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGE 
Sbjct: 883  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEG 942

Query: 121  QYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQ 180
            QYVLGIQIIRDRKNKTLALSQATYIDKLL   S                L   QSPKTPQ
Sbjct: 943  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 1002

Query: 181  EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 240
            EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKI+LKYLRRT
Sbjct: 1003 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRT 1062

Query: 241  RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 300
            RDYMLVYGAKDLILTGYT+SDFQTDKDSRKSTS SVFTLNGGAVVWRSIKQGCIADSTME
Sbjct: 1063 RDYMLVYGAKDLILTGYTNSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIADSTME 1122

Query: 301  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 360
            AEYVAACEAAKEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER
Sbjct: 1123 AEYVAACEAAKEAVWLKKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 1182

Query: 361  KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 398
            KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Sbjct: 1183 KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1235

BLAST of Cmc04g0102941 vs. NCBI nr
Match: KAA0061170.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 727.2 bits (1876), Expect = 7.4e-206
Identity = 367/396 (92.68%), Postives = 368/396 (92.93%), Query Frame = 0

Query: 18  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKIN 77
           MSQPEGFITQ QEQKVCKLNRSIYG KQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKIN
Sbjct: 1   MSQPEGFITQSQEQKVCKLNRSIYGSKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKIN 60

Query: 78  KGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTL 137
           KGKVAFLVLYVDDILLIGND GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTL
Sbjct: 61  KGKVAFLVLYVDDILLIGNDAGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTL 120

Query: 138 ALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSL 197
           ALSQATYIDKLL   S                L   QSPKTPQEVEDMRRIPYASAVGSL
Sbjct: 121 ALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSL 180

Query: 198 MYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGY 257
           MYAMLCTRPDICYAVGIVSRYQSNPGLDHWT VKI+LKYLRRTRDYMLVYGAKDLILTGY
Sbjct: 181 MYAMLCTRPDICYAVGIVSRYQSNPGLDHWTTVKIILKYLRRTRDYMLVYGAKDLILTGY 240

Query: 258 TDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR 317
           TDSDFQTDKDSRKSTSGSVFTLN GAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR
Sbjct: 241 TDSDFQTDKDSRKSTSGSVFTLNEGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR 300

Query: 318 KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVT 377
           KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVT
Sbjct: 301 KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVT 360

Query: 378 KIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 398
           KIASEHNIADPFTK LTAKVFEGHLESLGLRDMYIR
Sbjct: 361 KIASEHNIADPFTKILTAKVFEGHLESLGLRDMYIR 396

BLAST of Cmc04g0102941 vs. NCBI nr
Match: KAA0040367.1 (gag/pol protein [Cucumis melo var. makuwa] >TYK23337.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 725.7 bits (1872), Expect = 2.1e-205
Identity = 366/413 (88.62%), Postives = 373/413 (90.31%), Query Frame = 0

Query: 1   MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 60
           MDVKTAFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQ+SRSWN+RFDTAIKS
Sbjct: 22  MDVKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQSSRSWNMRFDTAIKS 81

Query: 61  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 120
           YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLA QFQMKDLGE 
Sbjct: 82  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLATQFQMKDLGET 141

Query: 121 QYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQ 180
           QYVLGIQIIRDRKNKTLALSQATYIDK+L   S                L   Q PKTPQ
Sbjct: 142 QYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKDLLPFRHGVHLSKEQCPKTPQ 201

Query: 181 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 240
           E+EDMRRI YASAVGSLMY ML TRPDICYAVGIVSRY  NPGLDHWTAVKI+LKYLRRT
Sbjct: 202 EIEDMRRILYASAVGSLMYDMLYTRPDICYAVGIVSRYLFNPGLDHWTAVKIILKYLRRT 261

Query: 241 RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 300
           RDYMLVYG KDLILTGYTDSDFQTDKDSRKSTSGSVFTLN GAVVW SIKQGCIADSTME
Sbjct: 262 RDYMLVYGGKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNRGAVVWHSIKQGCIADSTME 321

Query: 301 AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 360
           AEY+AACEAAKE VWLRKFLHDLEVVPNMNL ITLYCDNSGAVANSKEPR+HKRGKHIER
Sbjct: 322 AEYIAACEAAKEVVWLRKFLHDLEVVPNMNLSITLYCDNSGAVANSKEPRNHKRGKHIER 381

Query: 361 KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 398
           KYHLIREIVQR DVIVTKI SEH I DPFTKTLTAKVFEGHLESLGLRDMYIR
Sbjct: 382 KYHLIREIVQRRDVIVTKITSEHKITDPFTKTLTAKVFEGHLESLGLRDMYIR 434

BLAST of Cmc04g0102941 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 363.2 bits (931), Expect = 3.7e-99
Identity = 187/408 (45.83%), Postives = 262/408 (64.22%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 60
            +DVKTAFL+G+LEE I+M QPEGF   G++  VCKLN+S+YGLKQA R W ++FD+ +KS
Sbjct: 921  LDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKS 980

Query: 61   YGFDQNVDEPCVY-KKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGE 120
              + +   +PCVY K+ ++     L+LYVDD+L++G D G +  +K  L+  F MKDLG 
Sbjct: 981  QTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGP 1040

Query: 121  AQYVLGIQIIRDRKNKTLALSQATYIDKLL----------------AWSSLVLGQSPKTP 180
            AQ +LG++I+R+R ++ L LSQ  YI+++L                    L     P T 
Sbjct: 1041 AQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTV 1100

Query: 181  QEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRR 240
            +E  +M ++PY+SAVGSLMYAM+CTRPDI +AVG+VSR+  NPG +HW AVK +L+YLR 
Sbjct: 1101 EEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRG 1160

Query: 241  TRDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTM 300
            T    L +G  D IL GYTD+D   D D+RKS++G +FT +GGA+ W+S  Q C+A ST 
Sbjct: 1161 TTGDCLCFGGSDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTT 1220

Query: 301  EAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIE 360
            EAEY+AA E  KE +WL++FL +L +         +YCD+  A+  SK    H R KHI+
Sbjct: 1221 EAEYIAATETGKEMIWLKRFLQELGL---HQKEYVVYCDSQSAIDLSKNSMYHARTKHID 1280

Query: 361  RKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL 392
             +YH IRE+V    + V KI++  N AD  TK +    FE   E +G+
Sbjct: 1281 VRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKFELCKELVGM 1325

BLAST of Cmc04g0102941 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 259.6 bits (662), Expect = 5.7e-68
Identity = 152/406 (37.44%), Postives = 232/406 (57.14%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 60
            MDVKTAFLNG L+E I+M  P+G         VCKLN++IYGLKQA+R W   F+ A+K 
Sbjct: 1001 MDVKTAFLNGTLKEEIYMRLPQGI--SCNSDNVCKLNKAIYGLKQAARCWFEVFEQALKE 1060

Query: 61   YGFDQNVDEPCVY--KKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLG 120
              F  +  + C+Y   K N  +  +++LYVDD+++   D+  + + K +L  +F+M DL 
Sbjct: 1061 CEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLN 1120

Query: 121  EAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTP---------QEVEDM 180
            E ++ +GI+I  + +   + LSQ+ Y+ K+L+  ++    +  TP            ++ 
Sbjct: 1121 EIKHFIGIRI--EMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSDED 1180

Query: 181  RRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYML 240
               P  S +G LMY MLCTRPD+  AV I+SRY S    + W  +K VL+YL+ T D  L
Sbjct: 1181 CNTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKL 1240

Query: 241  VYG---AKDLILTGYTDSDFQTDKDSRKSTSGSVFTL-NGGAVVWRSIKQGCIADSTMEA 300
            ++    A +  + GY DSD+   +  RKST+G +F + +   + W + +Q  +A S+ EA
Sbjct: 1241 IFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEA 1300

Query: 301  EYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERK 360
            EY+A  EA +EA+WL+  L  + +   +  PI +Y DN G ++ +  P  HKR KHI+ K
Sbjct: 1301 EYMALFEAVREALWLKFLLTSINI--KLENPIKIYEDNQGCISIANNPSCHKRAKHIDIK 1360

Query: 361  YHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL 392
            YH  RE VQ   + +  I +E+ +AD FTK L A  F    + LGL
Sbjct: 1361 YHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGL 1400

BLAST of Cmc04g0102941 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 229.9 bits (585), Expect = 4.9e-59
Identity = 136/402 (33.83%), Postives = 210/402 (52.24%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 60
            +DV  AFL G L + ++MSQP GF+ + +   VC+L ++IYGLKQA R+W +   T + +
Sbjct: 1047 LDVNNAFLQGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLT 1106

Query: 61   YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 120
             GF  ++ +  ++       + ++++YVDDIL+ GND   L      L+ +F +K+  + 
Sbjct: 1107 VGFVNSISDTSLFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDL 1166

Query: 121  QYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTPQ------EVEDMRRIP- 180
             Y LGI+    R  + L LSQ  Y   LLA ++++  +   TP        +    ++P 
Sbjct: 1167 HYFLGIE--AKRVPQGLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTKLPD 1226

Query: 181  ---YASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-ML 240
               Y   VGSL Y +  TRPD+ YAV  +S+Y   P  DHW A+K VL+YL  T D+ + 
Sbjct: 1227 PTEYRGIVGSLQY-LAFTRPDLSYAVNRLSQYMHMPTDDHWNALKRVLRYLAGTPDHGIF 1286

Query: 241  VYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVA 300
            +     L L  Y+D+D+  D D   ST+G +  L    + W S KQ  +  S+ EAEY +
Sbjct: 1287 LKKGNTLSLHAYSDADWAGDTDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEYRS 1346

Query: 301  ACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLI 360
                + E  W+   L +L +   ++ P  +YCDN GA      P  H R KHI   YH I
Sbjct: 1347 VANTSSELQWICSLLTELGI--QLSHPPVIYCDNVGATYLCANPVFHSRMKHIALDYHFI 1406

Query: 361  REIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL 392
            R  VQ G + V  +++   +AD  TK L+   F+     +G+
Sbjct: 1407 RNQVQSGALRVVHVSTHDQLADTLTKPLSRVAFQNFSRKIGV 1443

BLAST of Cmc04g0102941 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 225.7 bits (574), Expect = 9.2e-58
Identity = 136/402 (33.83%), Postives = 206/402 (51.24%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 60
            +DV  AFL G L + ++MSQP GFI + +   VCKL +++YGLKQA R+W +     + +
Sbjct: 1064 LDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLT 1123

Query: 61   YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 120
             GF  +V +  ++       + ++++YVDDIL+ GND   L +    L+ +F +KD  E 
Sbjct: 1124 IGFVNSVSDTSLFVLQRGKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEEL 1183

Query: 121  QYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTPQEVEDMRRI-------- 180
             Y LGI+    R    L LSQ  YI  LLA ++++  +   TP        +        
Sbjct: 1184 HYFLGIE--AKRVPTGLHLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTD 1243

Query: 181  --PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-ML 240
               Y   VGSL Y +  TRPDI YAV  +S++   P  +H  A+K +L+YL  T ++ + 
Sbjct: 1244 PTEYRGIVGSLQY-LAFTRPDISYAVNRLSQFMHMPTEEHLQALKRILRYLAGTPNHGIF 1303

Query: 241  VYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVA 300
            +     L L  Y+D+D+  DKD   ST+G +  L    + W S KQ  +  S+ EAEY +
Sbjct: 1304 LKKGNTLSLHAYSDADWAGDKDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEYRS 1363

Query: 301  ACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLI 360
                + E  W+   L +L +   +  P  +YCDN GA      P  H R KHI   YH I
Sbjct: 1364 VANTSSEMQWICSLLTELGI--RLTRPPVIYCDNVGATYLCANPVFHSRMKHIAIDYHFI 1423

Query: 361  REIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL 392
            R  VQ G + V  +++   +AD  TK L+   F+     +G+
Sbjct: 1424 RNQVQSGALRVVHVSTHDQLADTLTKPLSRTAFQNFASKIGV 1460

BLAST of Cmc04g0102941 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 154.5 bits (389), Expect = 2.6e-36
Identity = 106/310 (34.19%), Postives = 147/310 (47.42%), Query Frame = 0

Query: 1   MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 60
           MDV TAFLN  ++E I++ QP GF+ +     V +L   +YGLKQA   WN   +  +K 
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 61  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 120
            GF ++  E  +Y +       ++ +YVDD+L+          VK  L   + MKDLG+ 
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 121 QYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSL---VLGQSP---------KTPQEVED 180
              LG+  I    N  + LS   YI K  + S +    L Q+P          T   ++D
Sbjct: 121 DKFLGLN-IHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKD 180

Query: 181 MRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYM 240
           +   PY S VG L++     RPDI Y V ++SR+   P   H  + + VL+YL  TR   
Sbjct: 181 I--TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMC 240

Query: 241 LVY-GAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIK-QGCIADSTMEAE 297
           L Y     L LT Y D+      D   ST G V  L G  V W S K +G I   + EAE
Sbjct: 241 LKYRSGSQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAE 300

BLAST of Cmc04g0102941 vs. ExPASy TrEMBL
Match: A0A5A7TZD0 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G00090 PE=4 SV=1)

HSP 1 Score: 773.9 bits (1997), Expect = 3.3e-220
Identity = 391/413 (94.67%), Postives = 391/413 (94.67%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 60
            MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS
Sbjct: 823  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 882

Query: 61   YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 120
            YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA
Sbjct: 883  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 942

Query: 121  QYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQ 180
            QYVLGIQIIRDRKNKTLALSQATYIDKLL   S                L   QSPKTPQ
Sbjct: 943  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 1002

Query: 181  EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 240
            EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT
Sbjct: 1003 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 1062

Query: 241  RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 300
            RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
Sbjct: 1063 RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 1122

Query: 301  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 360
            AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER
Sbjct: 1123 AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 1182

Query: 361  KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 398
            KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Sbjct: 1183 KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1235

BLAST of Cmc04g0102941 vs. ExPASy TrEMBL
Match: A0A5A7UYE8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G001570 PE=4 SV=1)

HSP 1 Score: 773.9 bits (1997), Expect = 3.3e-220
Identity = 391/413 (94.67%), Postives = 391/413 (94.67%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 60
            MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS
Sbjct: 697  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 756

Query: 61   YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 120
            YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA
Sbjct: 757  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 816

Query: 121  QYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQ 180
            QYVLGIQIIRDRKNKTLALSQATYIDKLL   S                L   QSPKTPQ
Sbjct: 817  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 876

Query: 181  EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 240
            EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT
Sbjct: 877  EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 936

Query: 241  RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 300
            RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME
Sbjct: 937  RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 996

Query: 301  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 360
            AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER
Sbjct: 997  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 1056

Query: 361  KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 398
            KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Sbjct: 1057 KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1109

BLAST of Cmc04g0102941 vs. ExPASy TrEMBL
Match: A0A5A7T2V9 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760 PE=4 SV=1)

HSP 1 Score: 765.8 bits (1976), Expect = 9.1e-218
Identity = 386/413 (93.46%), Postives = 389/413 (94.19%), Query Frame = 0

Query: 1    MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 60
            MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS
Sbjct: 823  MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 882

Query: 61   YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 120
            YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGE 
Sbjct: 883  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEG 942

Query: 121  QYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQ 180
            QYVLGIQIIRDRKNKTLALSQATYIDKLL   S                L   QSPKTPQ
Sbjct: 943  QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQ 1002

Query: 181  EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 240
            EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKI+LKYLRRT
Sbjct: 1003 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIILKYLRRT 1062

Query: 241  RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 300
            RDYMLVYGAKDLILTGYT+SDFQTDKDSRKSTS SVFTLNGGAVVWRSIKQGCIADSTME
Sbjct: 1063 RDYMLVYGAKDLILTGYTNSDFQTDKDSRKSTSRSVFTLNGGAVVWRSIKQGCIADSTME 1122

Query: 301  AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 360
            AEYVAACEAAKEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER
Sbjct: 1123 AEYVAACEAAKEAVWLKKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 1182

Query: 361  KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 398
            KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
Sbjct: 1183 KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 1235

BLAST of Cmc04g0102941 vs. ExPASy TrEMBL
Match: A0A5A7V1F5 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold753G00440 PE=4 SV=1)

HSP 1 Score: 727.2 bits (1876), Expect = 3.6e-206
Identity = 367/396 (92.68%), Postives = 368/396 (92.93%), Query Frame = 0

Query: 18  MSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKIN 77
           MSQPEGFITQ QEQKVCKLNRSIYG KQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKIN
Sbjct: 1   MSQPEGFITQSQEQKVCKLNRSIYGSKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKIN 60

Query: 78  KGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTL 137
           KGKVAFLVLYVDDILLIGND GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTL
Sbjct: 61  KGKVAFLVLYVDDILLIGNDAGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTL 120

Query: 138 ALSQATYIDKLLAWSS----------------LVLGQSPKTPQEVEDMRRIPYASAVGSL 197
           ALSQATYIDKLL   S                L   QSPKTPQEVEDMRRIPYASAVGSL
Sbjct: 121 ALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSL 180

Query: 198 MYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGAKDLILTGY 257
           MYAMLCTRPDICYAVGIVSRYQSNPGLDHWT VKI+LKYLRRTRDYMLVYGAKDLILTGY
Sbjct: 181 MYAMLCTRPDICYAVGIVSRYQSNPGLDHWTTVKIILKYLRRTRDYMLVYGAKDLILTGY 240

Query: 258 TDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR 317
           TDSDFQTDKDSRKSTSGSVFTLN GAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR
Sbjct: 241 TDSDFQTDKDSRKSTSGSVFTLNEGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLR 300

Query: 318 KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVT 377
           KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVT
Sbjct: 301 KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVT 360

Query: 378 KIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 398
           KIASEHNIADPFTK LTAKVFEGHLESLGLRDMYIR
Sbjct: 361 KIASEHNIADPFTKILTAKVFEGHLESLGLRDMYIR 396

BLAST of Cmc04g0102941 vs. ExPASy TrEMBL
Match: A0A5D3DI92 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold142G003980 PE=4 SV=1)

HSP 1 Score: 725.7 bits (1872), Expect = 1.0e-205
Identity = 366/413 (88.62%), Postives = 373/413 (90.31%), Query Frame = 0

Query: 1   MDVKTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS 60
           MDVKTAFLN NLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQ+SRSWN+RFDTAIKS
Sbjct: 22  MDVKTAFLNDNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQSSRSWNMRFDTAIKS 81

Query: 61  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEA 120
           YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLA QFQMKDLGE 
Sbjct: 82  YGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLATQFQMKDLGET 141

Query: 121 QYVLGIQIIRDRKNKTLALSQATYIDKLLAWSS----------------LVLGQSPKTPQ 180
           QYVLGIQIIRDRKNKTLALSQATYIDK+L   S                L   Q PKTPQ
Sbjct: 142 QYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKDLLPFRHGVHLSKEQCPKTPQ 201

Query: 181 EVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRT 240
           E+EDMRRI YASAVGSLMY ML TRPDICYAVGIVSRY  NPGLDHWTAVKI+LKYLRRT
Sbjct: 202 EIEDMRRILYASAVGSLMYDMLYTRPDICYAVGIVSRYLFNPGLDHWTAVKIILKYLRRT 261

Query: 241 RDYMLVYGAKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTME 300
           RDYMLVYG KDLILTGYTDSDFQTDKDSRKSTSGSVFTLN GAVVW SIKQGCIADSTME
Sbjct: 262 RDYMLVYGGKDLILTGYTDSDFQTDKDSRKSTSGSVFTLNRGAVVWHSIKQGCIADSTME 321

Query: 301 AEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIER 360
           AEY+AACEAAKE VWLRKFLHDLEVVPNMNL ITLYCDNSGAVANSKEPR+HKRGKHIER
Sbjct: 322 AEYIAACEAAKEVVWLRKFLHDLEVVPNMNLSITLYCDNSGAVANSKEPRNHKRGKHIER 381

Query: 361 KYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR 398
           KYHLIREIVQR DVIVTKI SEH I DPFTKTLTAKVFEGHLESLGLRDMYIR
Sbjct: 382 KYHLIREIVQRRDVIVTKITSEHKITDPFTKTLTAKVFEGHLESLGLRDMYIR 434

BLAST of Cmc04g0102941 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 198.4 bits (503), Expect = 1.1e-50
Identity = 121/366 (33.06%), Postives = 198/366 (54.10%), Query Frame = 0

Query: 1   MDVKTAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDT 60
           +D+  AFLNG+L+E I+M  P G+   QG       VC L +SIYGLKQASR W ++F  
Sbjct: 193 LDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSV 252

Query: 61  AIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKD 120
            +  +GF Q+  +   + KI       +++YVDDI++  N+   + ++K+ L + F+++D
Sbjct: 253 TLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRD 312

Query: 121 LGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLAWSSLVLGQSPKTPQEV---------- 180
           LG  +Y LG++I R      + + Q  Y   LL  + L+  +    P +           
Sbjct: 313 LGPLKYFLGLEIARSAAG--INICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGG 372

Query: 181 EDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRD 240
           + +    Y   +G LMY  + TR DI +AV  +S++   P L H  AV  +L Y++ T  
Sbjct: 373 DFVDAKAYRRLIGRLMYLQI-TRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVG 432

Query: 241 YMLVYGAK-DLILTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEA 300
             L Y ++ ++ L  ++D+ FQ+ KD+R+ST+G    L    + W+S KQ  ++ S+ EA
Sbjct: 433 QGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEA 492

Query: 301 EYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERK 352
           EY A   A  E +WL +F  +L++   ++ P  L+CDN+ A+  +     H+R KHIE  
Sbjct: 493 EYRALSFATDEMMWLAQFFRELQL--PLSKPTLLFCDNTAAIHIATNAVFHERTKHIESD 552

BLAST of Cmc04g0102941 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 101.3 bits (251), Expect = 1.9e-21
Identity = 75/227 (33.04%), Postives = 115/227 (50.66%), Query Frame = 0

Query: 83  FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQA 142
           +L+LYVDDILL G+    L  +   L++ F MKDLG   Y LGIQI        L LSQ 
Sbjct: 2   YLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQI--KTHPSGLFLSQT 61

Query: 143 TYIDKLLAWSSLVLGQSPKTPQEVE-----DMRRIP----YASAVGSLMYAMLCTRPDIC 202
            Y +++L  + ++  +   TP  ++        + P    + S VG+L Y  L TRPDI 
Sbjct: 62  KYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTL-TRPDIS 121

Query: 203 YAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTDKDS 262
           YAV IV +    P L  +  +K VL+Y++ T  + + ++    L +  + DSD+     +
Sbjct: 122 YAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTST 181

Query: 263 RKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVW 300
           R+ST+G    L    + W + +Q  ++ S+ E EY A    A E  W
Sbjct: 182 RRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of Cmc04g0102941 vs. TAIR 10
Match: ATMG00240.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 44.3 bits (103), Expect = 2.7e-04
Identity = 27/72 (37.50%), Postives = 40/72 (55.56%), Query Frame = 0

Query: 188 TRPDICYAVGIVSRYQSNPGLDHWTAVKIVLKYLRRTRDYMLVYGA-KDLILTGYTDSDF 247
           TRPD+ +AV  +S++ S        AV  VL Y++ T    L Y A  DL L  + DSD+
Sbjct: 6   TRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAFADSDW 65

Query: 248 QTDKDSRKSTSG 259
            +  D+R+S +G
Sbjct: 66  ASCPDTRRSVTG 77

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0025945.16.9e-22094.67gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumi... [more]
KAA0059226.16.9e-22094.67gag/pol protein [Cucumis melo var. makuwa][more]
KAA0035907.11.9e-21793.46gag/pol protein [Cucumis melo var. makuwa][more]
KAA0061170.17.4e-20692.68gag/pol protein [Cucumis melo var. makuwa][more]
KAA0040367.12.1e-20588.62gag/pol protein [Cucumis melo var. makuwa] >TYK23337.1 gag/pol protein [Cucumis ... [more]
Match NameE-valueIdentityDescription
P109783.7e-9945.83Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041465.7e-6837.44Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT944.9e-5933.83Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW29.2e-5833.83Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P256002.6e-3634.19Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
A0A5A7TZD03.3e-22094.67Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G000... [more]
A0A5A7UYE83.3e-22094.67Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G0015... [more]
A0A5A7T2V99.1e-21893.46Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760... [more]
A0A5A7V1F53.6e-20692.68Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold753G0044... [more]
A0A5D3DI921.0e-20588.62Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold142G0039... [more]
Match NameE-valueIdentityDescription
AT4G23160.11.1e-5033.06cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.11.9e-2133.04DNA/RNA polymerases superfamily protein [more]
ATMG00240.12.7e-0437.50Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 1..155
e-value: 9.2E-42
score: 143.3
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 1..235
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 239..380
e-value: 2.47381E-69
score: 213.099
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 1..347

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc04g0102941.1Cmc04g0102941.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding