Cmc02g0043691 (gene) Melon (Charmono) v1.1

Overview
NameCmc02g0043691
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCMiso1.1chr02: 6174739 .. 6176955 (-)
RNA-Seq ExpressionCmc02g0043691
SyntenyCmc02g0043691
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTTTATTACTTACTCAGAAATCTCAAAATGAGAGCAAATTAATCAGCGAAAATGCTCTACCTTATGTTAATATTGTCACCCAAACAACTGAAAAAGGAGCAGAATCTTACATAAGGACCAACGAAAACAACTATCACAACAATCATTTCTACAATCAAAGGGGTGGCCGTGGCAATGGAAGATCAAACAGGGGAGGAAGAGGAAATCGTAATAAACCACAATGCCAAATCTGTACAAATTTTGGACATAGTGCTGATCGCTGCTTCTTTCGATATACTACAAGATCAAATTCATCAGGTTACTCACCGAACTCACATAATACTTCATATACTAATATGAATAATCATCCACAGATGTCTGCTATGGTGGCTACCCCCGACCTGAATATTGACAGCAATTGGTATCCTGATTCGGGAGCTACAAATCATTTAACTCATAGCTTGAGCAACCTATCTACTGGATCTGAGTACGGGGGAGGAAATCAAATATATGCAGCAAATGGGTCAGGTCTGCCAATCACTCATTATGGTTCCATGTCATTTAACTCCTCTACATTACCATTTAAATCGTTTACACTCAATAACTTACTTCATGTTCCATCTATTACCAAAAACTTAATCAGTGTTTCACAATTTGCCAAAGATAATCATGTTTTCTTTGAATTTTACCCCACTTTATGTTATGTGAAGGATCTGGATACTGGCCAAGTACTTCTTCAAGGACTACTTAATGATGGGCTCTACAAATTTACCATTCAACCATCACATAAAAGACTTCACCATTCTGACTCAAACACCAAGTCCGTTTTCAATACCGTCGTACCTAAATCTAATACTCCCTTACTTGATCTATGGCATAGAAGACTAGGTCATCCCCATTTACCTACTGTTAAAGCTGTTTTGAATCACATTGACCATTCTTCTGGTACTATAAATAAACTGAATTTTTATGAAGCATGTGCATTGGGCAAACATCATGCCCTTCCTTTCTCTCACTTCCTTACTCTTTATACACATCCTTTACAACTTATCACTTGTGATTTATGGGGTCCTGCTGTAAATATATCTCATAATAGTTTTAGATATTACATAAGTTTTTTTGATACCTATAGTAGATACACCTGGATATATTTCTTACATTCCAAGTCTGATGCCTTTTTAGCTTTTCAAAAATTCAAAACCTGTGTTGAAAAGTCTCTTGGTCAATCAATTAAAAGTCTTCAAACTGATGGTGGTACTGAATTTAAACCATTCAAACCTTTTCTTGATCAACATGGCATTGAACATAGGATAACATGTCCTTACACTTCAAAGCAGAATGATATAGTTGAGAGAAAACATAGGCATATCATGGAAATGGGTCTTACATTGCTATCTCAAGCCACTTTACCTCTATCATTCTGGGATGAAGCCTTCTTCACTAGTGTCTATCTCATAAATCTTTTGCCTACCCCAGTTCTTGATAATATAAGCCCGTTGGAGAAGCTACTTTGCCGGAAACCTAACTTTCCTTTTCTTAGAGTTTTTGGCTGCAAGTGTTATCCCTACTTTCGACCCTACCAATCACATAAACTATCTCTCCGATCCACACCATGTACTTTCCTAGGATACAGTACCTCCCATAAAGGGTACAAATGTCTAGCTTCAGATGGGCGTCTTTTCATTTCTAGACATGTATTGTTTGATGAAAATTCATTTCCATATGCATCATTTGCATCTCATTCTAGCATACCCAAATCCAAAAATGTTCTATCCCCACCACTTCACTCAATAATTCCATCATCTCTTATGAACCATAATGAGGATAGGCGACACACTGACACAGTTTCTGATAACACTGATTATCTAAACTCTACTATTGTGTATCCTTTAGAGACAGGTACTCAAGAGAGCTCTAGGGATGATGGTAACAGTGGAGGTATTACTCAATCTCCAAGTCCTATGGAACCTCCGCATCAAACTGATTCTGGTATGAATACTCAACTTCAATCTACCTCTATTCATCCCATGATAACACAGAGTAAGCATGATATTTTTAAACCAAAAGCATTCTTGATTGATTATACTCAAACTGAAACTTGCAATGCCAAGGAAGCTTTTAACCATCCTCATTGGAAAAAGGCCATGGAAGAAGAGTTTGAAGCCTTACAAAAAAATGGCACTTGGAGCCTTATTCCACAAAATCCTAATCAGAAAATTGTTGGTTGCAAATAG

mRNA sequence

ATGTCTTTATTACTTACTCAGAAATCTCAAAATGAGAGCAAATTAATCAGCGAAAATGCTCTACCTTATGTTAATATTGTCACCCAAACAACTGAAAAAGGAGCAGAATCTTACATAAGGACCAACGAAAACAACTATCACAACAATCATTTCTACAATCAAAGGGGTGGCCGTGGCAATGGAAGATCAAACAGGGGAGGAAGAGGAAATCGTAATAAACCACAATGCCAAATCTGTACAAATTTTGGACATAGTGCTGATCGCTGCTTCTTTCGATATACTACAAGATCAAATTCATCAGGTTACTCACCGAACTCACATAATACTTCATATACTAATATGAATAATCATCCACAGATGTCTGCTATGGTGGCTACCCCCGACCTGAATATTGACAGCAATTGGTATCCTGATTCGGGAGCTACAAATCATTTAACTCATAGCTTGAGCAACCTATCTACTGGATCTGAGTACGGGGGAGGAAATCAAATATATGCAGCAAATGGGTCAGGTCTGCCAATCACTCATTATGGTTCCATGTCATTTAACTCCTCTACATTACCATTTAAATCGTTTACACTCAATAACTTACTTCATGTTCCATCTATTACCAAAAACTTAATCAGTGTTTCACAATTTGCCAAAGATAATCATGTTTTCTTTGAATTTTACCCCACTTTATGTTATGTGAAGGATCTGGATACTGGCCAAGTACTTCTTCAAGGACTACTTAATGATGGGCTCTACAAATTTACCATTCAACCATCACATAAAAGACTTCACCATTCTGACTCAAACACCAAGTCCGTTTTCAATACCGTCGTACCTAAATCTAATACTCCCTTACTTGATCTATGGCATAGAAGACTAGGTCATCCCCATTTACCTACTGTTAAAGCTGTTTTGAATCACATTGACCATTCTTCTGGTACTATAAATAAACTGAATTTTTATGAAGCATGTGCATTGGGCAAACATCATGCCCTTCCTTTCTCTCACTTCCTTACTCTTTATACACATCCTTTACAACTTATCACTTGTGATTTATGGGGTCCTGCTGTAAATATATCTCATAATAGTTTTAGATATTACATAAGTTTTTTTGATACCTATAGTAGATACACCTGGATATATTTCTTACATTCCAAGTCTGATGCCTTTTTAGCTTTTCAAAAATTCAAAACCTGTGTTGAAAAGTCTCTTGGTCAATCAATTAAAAGTCTTCAAACTGATGGTGGTACTGAATTTAAACCATTCAAACCTTTTCTTGATCAACATGGCATTGAACATAGGATAACATGTCCTTACACTTCAAAGCAGAATGATATAGTTGAGAGAAAACATAGGCATATCATGGAAATGGGTCTTACATTGCTATCTCAAGCCACTTTACCTCTATCATTCTGGGATGAAGCCTTCTTCACTAGTGTCTATCTCATAAATCTTTTGCCTACCCCAGTTCTTGATAATATAAGCCCGTTGGAGAAGCTACTTTGCCGGAAACCTAACTTTCCTTTTCTTAGAGTTTTTGGCTGCAAGTGTTATCCCTACTTTCGACCCTACCAATCACATAAACTATCTCTCCGATCCACACCATGTACTTTCCTAGGATACAGTACCTCCCATAAAGGGTACAAATGTCTAGCTTCAGATGGGCGTCTTTTCATTTCTAGACATGTATTGTTTGATGAAAATTCATTTCCATATGCATCATTTGCATCTCATTCTAGCATACCCAAATCCAAAAATGTTCTATCCCCACCACTTCACTCAATAATTCCATCATCTCTTATGAACCATAATGAGGATAGGCGACACACTGACACAGTTTCTGATAACACTGATTATCTAAACTCTACTATTGTGTATCCTTTAGAGACAGGTACTCAAGAGAGCTCTAGGGATGATGGTAACAGTGGAGGTATTACTCAATCTCCAAGTCCTATGGAACCTCCGCATCAAACTGATTCTGGTATGAATACTCAACTTCAATCTACCTCTATTCATCCCATGATAACACAGAGTAAGCATGATATTTTTAAACCAAAAGCATTCTTGATTGATTATACTCAAACTGAAACTTGCAATGCCAAGGAAGCTTTTAACCATCCTCATTGGAAAAAGGCCATGGAAGAAGAGTTTGAAGCCTTACAAAAAAATGGCACTTGGAGCCTTATTCCACAAAATCCTAATCAGAAAATTGTTGGTTGCAAATAG

Coding sequence (CDS)

ATGTCTTTATTACTTACTCAGAAATCTCAAAATGAGAGCAAATTAATCAGCGAAAATGCTCTACCTTATGTTAATATTGTCACCCAAACAACTGAAAAAGGAGCAGAATCTTACATAAGGACCAACGAAAACAACTATCACAACAATCATTTCTACAATCAAAGGGGTGGCCGTGGCAATGGAAGATCAAACAGGGGAGGAAGAGGAAATCGTAATAAACCACAATGCCAAATCTGTACAAATTTTGGACATAGTGCTGATCGCTGCTTCTTTCGATATACTACAAGATCAAATTCATCAGGTTACTCACCGAACTCACATAATACTTCATATACTAATATGAATAATCATCCACAGATGTCTGCTATGGTGGCTACCCCCGACCTGAATATTGACAGCAATTGGTATCCTGATTCGGGAGCTACAAATCATTTAACTCATAGCTTGAGCAACCTATCTACTGGATCTGAGTACGGGGGAGGAAATCAAATATATGCAGCAAATGGGTCAGGTCTGCCAATCACTCATTATGGTTCCATGTCATTTAACTCCTCTACATTACCATTTAAATCGTTTACACTCAATAACTTACTTCATGTTCCATCTATTACCAAAAACTTAATCAGTGTTTCACAATTTGCCAAAGATAATCATGTTTTCTTTGAATTTTACCCCACTTTATGTTATGTGAAGGATCTGGATACTGGCCAAGTACTTCTTCAAGGACTACTTAATGATGGGCTCTACAAATTTACCATTCAACCATCACATAAAAGACTTCACCATTCTGACTCAAACACCAAGTCCGTTTTCAATACCGTCGTACCTAAATCTAATACTCCCTTACTTGATCTATGGCATAGAAGACTAGGTCATCCCCATTTACCTACTGTTAAAGCTGTTTTGAATCACATTGACCATTCTTCTGGTACTATAAATAAACTGAATTTTTATGAAGCATGTGCATTGGGCAAACATCATGCCCTTCCTTTCTCTCACTTCCTTACTCTTTATACACATCCTTTACAACTTATCACTTGTGATTTATGGGGTCCTGCTGTAAATATATCTCATAATAGTTTTAGATATTACATAAGTTTTTTTGATACCTATAGTAGATACACCTGGATATATTTCTTACATTCCAAGTCTGATGCCTTTTTAGCTTTTCAAAAATTCAAAACCTGTGTTGAAAAGTCTCTTGGTCAATCAATTAAAAGTCTTCAAACTGATGGTGGTACTGAATTTAAACCATTCAAACCTTTTCTTGATCAACATGGCATTGAACATAGGATAACATGTCCTTACACTTCAAAGCAGAATGATATAGTTGAGAGAAAACATAGGCATATCATGGAAATGGGTCTTACATTGCTATCTCAAGCCACTTTACCTCTATCATTCTGGGATGAAGCCTTCTTCACTAGTGTCTATCTCATAAATCTTTTGCCTACCCCAGTTCTTGATAATATAAGCCCGTTGGAGAAGCTACTTTGCCGGAAACCTAACTTTCCTTTTCTTAGAGTTTTTGGCTGCAAGTGTTATCCCTACTTTCGACCCTACCAATCACATAAACTATCTCTCCGATCCACACCATGTACTTTCCTAGGATACAGTACCTCCCATAAAGGGTACAAATGTCTAGCTTCAGATGGGCGTCTTTTCATTTCTAGACATGTATTGTTTGATGAAAATTCATTTCCATATGCATCATTTGCATCTCATTCTAGCATACCCAAATCCAAAAATGTTCTATCCCCACCACTTCACTCAATAATTCCATCATCTCTTATGAACCATAATGAGGATAGGCGACACACTGACACAGTTTCTGATAACACTGATTATCTAAACTCTACTATTGTGTATCCTTTAGAGACAGGTACTCAAGAGAGCTCTAGGGATGATGGTAACAGTGGAGGTATTACTCAATCTCCAAGTCCTATGGAACCTCCGCATCAAACTGATTCTGGTATGAATACTCAACTTCAATCTACCTCTATTCATCCCATGATAACACAGAGTAAGCATGATATTTTTAAACCAAAAGCATTCTTGATTGATTATACTCAAACTGAAACTTGCAATGCCAAGGAAGCTTTTAACCATCCTCATTGGAAAAAGGCCATGGAAGAAGAGTTTGAAGCCTTACAAAAAAATGGCACTTGGAGCCTTATTCCACAAAATCCTAATCAGAAAATTGTTGGTTGCAAATAG

Protein sequence

MSLLLTQKSQNESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNGRSNRGGRGNRNKPQCQICTNFGHSADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLCYVKDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPTVKAVLNHIDHSSGTINKLNFYEACALGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNISHNSFRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIVGCK
Homology
BLAST of Cmc02g0043691 vs. NCBI nr
Match: TYK10642.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 1288.5 bits (3333), Expect = 0.0e+00
Identity = 637/676 (94.23%), Postives = 649/676 (96.01%), Query Frame = 0

Query: 1   MSLLLTQKSQNESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGN 60
           MSLLLTQ+SQNESKLISE ALP VNIVTQTTEKGAESYIRTN+NNYHNNH YNQRGGRGN
Sbjct: 214 MSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGN 273

Query: 61  GRSNRGGRGNRNKPQCQICTNFGHSADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQM 120
           GRSNRG RGNRNKPQCQIC   G+SADRCFFRYT RSNSSGYSPNSHNTSYTNMNNHPQM
Sbjct: 274 GRSNRGRRGNRNKPQCQICAKLGYSADRCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQM 333

Query: 121 SAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSM 180
           SAMVA  DLNIDSNWYPDSGATNHLTHSLSNLS GSEYGGGNQIYAANGSGLPITHYGSM
Sbjct: 334 SAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGNQIYAANGSGLPITHYGSM 393

Query: 181 SFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLCYVKDLDTGQVLL 240
           SFNSSTLPFKSFTLNNLL VPSITKNLISVSQFAKDNHVFFEF+PTLCYVKDLDTGQVLL
Sbjct: 394 SFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLL 453

Query: 241 QGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPTVKA 300
           QGLLNDGLYKFTI+PSHKRLHHS+SNTK VFNTVVPKSNTPLLDLWHRRLGHPHLP VKA
Sbjct: 454 QGLLNDGLYKFTIEPSHKRLHHSNSNTKPVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKA 513

Query: 301 VLNHIDHSSGTINKLNFYEACALGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNISHNS 360
           VLNHID+SSGTINKLNF EACALGKHHALPFSH LTLYTHPLQLITCDLWGPAVN+SHN 
Sbjct: 514 VLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNG 573

Query: 361 FRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFK 420
           FRYYISF D YSRYTWIYFL+SKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFK
Sbjct: 574 FRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFK 633

Query: 421 PFLDQHGIEHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLI 480
           PFLDQHGIEHRITCPYTSKQNDIVERKHR+IMEMGLTLLSQATLPLSFWDEAF TSVYLI
Sbjct: 634 PFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLI 693

Query: 481 NLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYST 540
           N LPTPVLDNISPLEKL CRKPNFP LRVFGCKCYPY RPYQSHKLSLRSTPCTFLGYST
Sbjct: 694 NRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYST 753

Query: 541 SHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNH 600
           SHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSK+VLSPPLHSIIPSSLMNH
Sbjct: 754 SHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKDVLSPPLHSIIPSSLMNH 813

Query: 601 NEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMN 660
           NEDRRHTDTVSDNTD+LN TIVYPLETGTQESSRDDGNSGGITQSPS MEP HQTDSGMN
Sbjct: 814 NEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPSHQTDSGMN 873

Query: 661 TQLQSTSIHPMITQSK 677
           TQLQSTSIHPMITQSK
Sbjct: 874 TQLQSTSIHPMITQSK 889

BLAST of Cmc02g0043691 vs. NCBI nr
Match: KAA0048297.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 1286.6 bits (3328), Expect = 0.0e+00
Identity = 636/676 (94.08%), Postives = 648/676 (95.86%), Query Frame = 0

Query: 1   MSLLLTQKSQNESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGN 60
           MSLLLTQ+SQNESKLISE ALP VNIVTQTTEKGAESYIRTN+NNYHNNH YNQRGGRGN
Sbjct: 214 MSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGN 273

Query: 61  GRSNRGGRGNRNKPQCQICTNFGHSADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQM 120
           GRSNRG RGNRNKPQCQIC   G+SADRCFFRYT RSNSSGYSPNSHNTSYTNMNNHPQM
Sbjct: 274 GRSNRGRRGNRNKPQCQICAKLGYSADRCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQM 333

Query: 121 SAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSM 180
           SAMVA  DLNIDSNWYPDSGATNHLTHSLSNLS GSEYGGGNQIYAANGSGLPITHYGSM
Sbjct: 334 SAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGNQIYAANGSGLPITHYGSM 393

Query: 181 SFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLCYVKDLDTGQVLL 240
           SFNSSTLPFKSFTLNNLL VPSITKNLISVSQFAKDNHVFFEF+PTLCYVKDLDTGQVLL
Sbjct: 394 SFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLL 453

Query: 241 QGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPTVKA 300
           QGLLNDGLYKFTI+PSHKRLHHS+SNTK VFNTVVPKSNTPLLDLWHRRLGHPHLP VKA
Sbjct: 454 QGLLNDGLYKFTIEPSHKRLHHSNSNTKPVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKA 513

Query: 301 VLNHIDHSSGTINKLNFYEACALGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNISHNS 360
           VLNHID+SSGTINKLNF EACALGKHHALPFSH LTLYTHPLQLITCDLWGPAVN+SHN 
Sbjct: 514 VLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNG 573

Query: 361 FRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFK 420
           FRYYISF D YSRYTWIYFL+SKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFK
Sbjct: 574 FRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFK 633

Query: 421 PFLDQHGIEHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLI 480
           PFLDQHGIEHRITCPYTSKQNDIVERKHR+IMEMGLTLLSQATLPLSFWDEAF TSVYLI
Sbjct: 634 PFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLI 693

Query: 481 NLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYST 540
           N LPTPVLDNISPLEKL CRKPNFP LRVFGCKCYPY RPYQSHKLSLRSTPCTFLGYST
Sbjct: 694 NRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYST 753

Query: 541 SHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNH 600
           SHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSS PKSK+VLSPPLHSIIPSSLMNH
Sbjct: 754 SHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSPPLHSIIPSSLMNH 813

Query: 601 NEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMN 660
           NEDRRHTDTVSDNTD+LN TIVYPLETGTQESSRDDGNSGGITQSPS MEP HQTDSGMN
Sbjct: 814 NEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPSHQTDSGMN 873

Query: 661 TQLQSTSIHPMITQSK 677
           TQLQSTSIHPMITQSK
Sbjct: 874 TQLQSTSIHPMITQSK 889

BLAST of Cmc02g0043691 vs. NCBI nr
Match: KAA0067212.1 (retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa])

HSP 1 Score: 997.3 bits (2577), Expect = 7.1e-287
Identity = 508/620 (81.94%), Postives = 508/620 (81.94%), Query Frame = 0

Query: 119 QMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYG 178
           QMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYG
Sbjct: 41  QMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYG 100

Query: 179 SMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLCYVKDLDTGQV 238
           SMSFNSSTLPFKSFTLNNLLH                                DLDTGQV
Sbjct: 101 SMSFNSSTLPFKSFTLNNLLH--------------------------------DLDTGQV 160

Query: 239 LLQGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPTV 298
           LLQGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPTV
Sbjct: 161 LLQGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPTV 220

Query: 299 KAVLNHIDHSSGTINKLNFYEACALGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNISH 358
           KAVLNHIDHSS                                                 
Sbjct: 221 KAVLNHIDHSS------------------------------------------------- 280

Query: 359 NSFRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKP 418
                                         AFQKFKTCVEKSLGQSIKSLQTDGGTEFKP
Sbjct: 281 ------------------------------AFQKFKTCVEKSLGQSIKSLQTDGGTEFKP 340

Query: 419 FKPFLDQHGIEHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVY 478
           FKPFLDQHGIEHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVY
Sbjct: 341 FKPFLDQHGIEHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVY 400

Query: 479 LINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGY 538
           LINLLPTPVLDNISPLEKL CRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGY
Sbjct: 401 LINLLPTPVLDNISPLEKLFCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGY 460

Query: 539 STSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLM 598
           STSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLM
Sbjct: 461 STSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLM 520

Query: 599 NHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSG 658
           NHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSG
Sbjct: 521 NHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSG 549

Query: 659 MNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQ 718
           MNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQ
Sbjct: 581 MNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQ 549

Query: 719 KNGTWSLIPQNPNQKIVGCK 739
           KNGTWSLIPQNPNQKIVGCK
Sbjct: 641 KNGTWSLIPQNPNQKIVGCK 549

BLAST of Cmc02g0043691 vs. NCBI nr
Match: TYK18915.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 577.8 bits (1488), Expect = 1.3e-160
Identity = 280/287 (97.56%), Postives = 282/287 (98.26%), Query Frame = 0

Query: 452 MEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFG 511
           MEMGLTLLSQATLPLSFWDEAF TSVYLINLLPTPVLDNISPLEK+  RKPNFPFLRVFG
Sbjct: 1   MEMGLTLLSQATLPLSFWDEAFSTSVYLINLLPTPVLDNISPLEKVFFRKPNFPFLRVFG 60

Query: 512 CKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYAS 571
           CKCYPY RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYAS
Sbjct: 61  CKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYAS 120

Query: 572 FASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQE 631
           FASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLN TIVYPLETGTQE
Sbjct: 121 FASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNPTIVYPLETGTQE 180

Query: 632 SSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQ 691
           SSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQ
Sbjct: 181 SSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQ 240

Query: 692 TETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIVGCK 739
           TETCNAKEAFNHPHWKKAMEEEF+ALQKNGTWSLIPQNPNQKIVGCK
Sbjct: 241 TETCNAKEAFNHPHWKKAMEEEFKALQKNGTWSLIPQNPNQKIVGCK 287

BLAST of Cmc02g0043691 vs. NCBI nr
Match: RVW60229.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 560.8 bits (1444), Expect = 1.7e-155
Identity = 348/796 (43.72%), Postives = 454/796 (57.04%), Query Frame = 0

Query: 16   ISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFY--NQRGG----RGNGRSNRG-GR 75
            IS N L  VN  +Q + +G  S    N N Y ++ F   NQ GG    RG+   NRG GR
Sbjct: 343  ISSNDLS-VNYTSQYSNRGPSS--SWNSNGYPSSGFQNRNQFGGNQVTRGSFVHNRGRGR 402

Query: 76   GNRN---KPQCQICTNFGHSADRCFFRYT--------------------TRSNSSGYSPN 135
            G      KPQCQ+C  FGH+  RCF+RY                      R+ +SG   +
Sbjct: 403  GRAQGGIKPQCQLCNKFGHTVHRCFYRYDPNFHGNMPANGPTPGVLGSGARNGASGSISS 462

Query: 136  SHNTSYTNMN-----NHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGG 195
            + N + T  +     ++ +M AMVATP+   +  W+PDSGATNH+TH L NL++G+EY G
Sbjct: 463  AGNVNLTEYDAQENQDYSEMEAMVATPEDLQNCCWFPDSGATNHVTHDLGNLNSGAEYNG 522

Query: 196  GNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVF 255
             ++I+  NG+GL I+H G   F SS+ P K   L N+L VP+I KNL+SVSQFA+DN+V+
Sbjct: 523  NSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRVPAIKKNLLSVSQFARDNNVY 582

Query: 256  FEFYPTLCYVKDLDTGQVLLQGLLNDGLYKFTIQPS--HKRLHHSDSNTKSVF------- 315
            FEF+P +C+VKD     +LLQG L+ GLY+F +      K    S SN K+         
Sbjct: 583  FEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGKASGLSLSNDKNELTCCNASL 642

Query: 316  ----NTVVPK---SNTPLLDLWHRRLGHPHLPTVKAVLNHIDHSSGTINKLNFYEACALG 375
                N+  P+   S+  + DLWH+RLGHP    V  VLN       T +  +   AC LG
Sbjct: 643  VHNDNSDFPEKTNSSFHVFDLWHKRLGHPASKIVTQVLNDNKIPFSTKSGSSICSACQLG 702

Query: 376  KHHALPFSHFLTLYTHPLQLITCDLWGPAVNISHNSFRYYISFFDTYSRYTWIYFLHSKS 435
            K H LPF    T+YT PLQL+  DLWGPA   S   F YY+SF D YSRYTW+YFL +KS
Sbjct: 703  KSHNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYGFTYYVSFVDAYSRYTWVYFLKTKS 762

Query: 436  DAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIV 495
                AF  FK   E   G  +K+ QTD G EF+  K + +Q+GI HR++CP+TSKQN I+
Sbjct: 763  QTREAFLMFKAQAELQFGCKLKTFQTDWGGEFRSLKTYFEQNGIIHRLSCPHTSKQNGII 822

Query: 496  ERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNF 555
            ERKHRHI+E+GLTLL+QA+LPL +W +AF T+V+LIN LPT VL    P E L   KPN+
Sbjct: 823  ERKHRHIVELGLTLLAQASLPLKYWPDAFSTAVFLINRLPTEVLKQKCPYEFLFNSKPNY 882

Query: 556  PFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDE 615
              L+VFGC C+P+ RPY  HKL  RS+PCTFLGYS+ HKGYKCL   GR+FISR V+FDE
Sbjct: 883  SQLKVFGCLCFPHLRPYNKHKLDFRSSPCTFLGYSSKHKGYKCLNQQGRMFISRSVVFDE 942

Query: 616  NSFPYA-------SFASHSS-----IPKSKNV--------LSPPLHSIIPSSLMNHN--E 675
              FP+A          SHS+     IP  KN+        LS P  S   S  ++ N   
Sbjct: 943  TRFPFADRLQKPVQIVSHSTVGLPCIPLVKNLEPLSVSPSLSLPTSSAQSSHQLDENLGS 1002

Query: 676  DRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQ 735
            D R       NTD  ++  +         SS      G I  S +  EP    ++   T 
Sbjct: 1003 DIRSVQQDLSNTDSSSTVPILNESASIPSSSNLYALPGTIPLSTNSDEPNESINTRPVTF 1062

Query: 736  LQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQKNGT 739
             Q    H M+T+SK+ IFKPK + +D    E    +EA +HP WK+AM+EEF AL KN T
Sbjct: 1063 PQQP--HHMVTRSKNGIFKPKVYTVDLNVEEPNTFQEAISHPKWKEAMDEEFRALMKNKT 1122

BLAST of Cmc02g0043691 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 370.2 bits (949), Expect = 5.6e-101
Identity = 279/824 (33.86%), Postives = 398/824 (48.30%), Query Frame = 0

Query: 12  ESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNG--RSNRGGRG 71
           ESKL++ N+   V I T        +    N+NN  +N  YN    R N    S+ G R 
Sbjct: 188 ESKLLALNSAEVVPI-TANVVTHRNTNTNRNQNNRGDNRNYNNNNNRSNSWQPSSSGSRS 247

Query: 72  NRNKP-----QCQICTNFGHSADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMSAMV 131
           +  +P     +CQIC+  GHSA RC   +  +S ++     S  T +      P+ +  V
Sbjct: 248 DNRQPKPYLGRCQICSVQGHSAKRCPQLHQFQSTTNQQQSTSPFTPW-----QPRANLAV 307

Query: 132 ATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNS 191
            +P     +NW  DSGAT+H+T   +NLS    Y GG+ +  A+GS +PITH GS S  +
Sbjct: 308 NSP--YNANNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASLPT 367

Query: 192 STLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLCYVKDLDTGQVLLQGLL 251
           S+   +S  LN +L+VP+I KNLISV +    N V  EF+P    VKDL+TG  LLQG  
Sbjct: 368 SS---RSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKT 427

Query: 252 NDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPTVKAVLNH 311
            D LY++ I         + S   S+F +   K+       WH RLGHP L  + +V++ 
Sbjct: 428 KDELYEWPI---------ASSQAVSMFASPCSKATH---SSWHSRLGHPSLAILNSVIS- 487

Query: 312 IDHSSGTIN---KLNFYEACALGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNISHNSF 371
            +HS   +N   KL     C + K H +PFS+     + PL+ I  D+W   + +S +++
Sbjct: 488 -NHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWSSPI-LSIDNY 547

Query: 372 RYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKP 431
           RYY+ F D ++RYTW+Y L  KS     F  FK+ VE      I +L +D G EF   + 
Sbjct: 548 RYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVVLRD 607

Query: 432 FLDQHGIEHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLIN 491
           +L QHGI H  + P+T + N + ERKHRHI+EMGLTLLS A++P ++W  AF  +VYLIN
Sbjct: 608 YLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLIN 667

Query: 492 LLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTS 551
            LPTP+L   SP +KL  + PN+  L+VFGC CYP+ RPY  HKL  +S  C F+GYS +
Sbjct: 668 RLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLT 727

Query: 552 HKGYKCL-ASDGRLFISRHVLFDENSFPYA------------------SFASHSSIPKSK 611
              Y CL    GRL+ SRHV FDE  FP++                  ++ SH+++P + 
Sbjct: 728 QSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSDSAPNWPSHTTLPTTP 787

Query: 612 NVLSPP-------------------------LHSIIPSS--------------------- 671
            VL  P                           S +PSS                     
Sbjct: 788 LVLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSEPTAPSHNGPQPT 847

Query: 672 ---------------LMNHNEDRRHTDTVSDNTDYLNSTIVYP-LETGTQESSRDDGNSG 731
                          L N N +    ++ + N+    S I  P + T +   S  +  S 
Sbjct: 848 AQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSS 907

Query: 732 GITQSPSPMEPPHQTDSGMNTQLQS-TSIHPMITQSKHDIFKPKAFLIDYT----QTETC 739
             T +P P+ P       +    Q+  + H M T++K  I KP       T     +E  
Sbjct: 908 SSTSTP-PLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGIRKPNQKYSYATSLAANSEPR 967

BLAST of Cmc02g0043691 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 352.1 bits (902), Expect = 1.6e-95
Identity = 245/714 (34.31%), Postives = 364/714 (50.98%), Query Frame = 0

Query: 11  NESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNGR------SN 70
           +ESK+++ ++   + I T        +    N NN + N+ Y+ R    N +      +N
Sbjct: 206 HESKILAVSSATVIPI-TANAVSHRNTTTTNNNNNGNRNNRYDNRNNNNNSKPWQQSSTN 265

Query: 71  RGGRGNRNKP---QCQICTNFGHSADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMS 130
                N++KP   +CQIC   GHSA RC       S+ +   P S  T +      P+ +
Sbjct: 266 FHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSPFTPW-----QPRAN 325

Query: 131 AMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMS 190
             + +P     +NW  DSGAT+H+T   +NLS    Y GG+ +  A+GS +PI+H GS S
Sbjct: 326 LALGSP--YSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTS 385

Query: 191 FNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLCYVKDLDTGQVLLQ 250
            ++ + P     L+N+L+VP+I KNLISV +    N V  EF+P    VKDL+TG  LLQ
Sbjct: 386 LSTKSRP---LNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQ 445

Query: 251 GLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPTVKAV 310
           G   D LY++ I  S      +  ++K+  ++            WH RLGHP    + +V
Sbjct: 446 GKTKDELYEWPIASSQPVSLFASPSSKATHSS------------WHARLGHPAPSILNSV 505

Query: 311 LNHIDHSSGTINKLNFYEACA---LGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNISH 370
           ++  ++S   +N  + + +C+   + K + +PFS      T PL+ I  D+W   + +SH
Sbjct: 506 IS--NYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWSSPI-LSH 565

Query: 371 NSFRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKP 430
           +++RYY+ F D ++RYTW+Y L  KS     F  FK  +E      I +  +D G EF  
Sbjct: 566 DNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVA 625

Query: 431 FKPFLDQHGIEHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVY 490
              +  QHGI H  + P+T + N + ERKHRHI+E GLTLLS A++P ++W  AF  +VY
Sbjct: 626 LWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVY 685

Query: 491 LINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGY 550
           LIN LPTP+L   SP +KL    PN+  LRVFGC CYP+ RPY  HKL  +S  C FLGY
Sbjct: 686 LINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLGY 745

Query: 551 STSHKGYKCL-ASDGRLFISRHVLFDENSFPYASF------------------ASHSSIP 610
           S +   Y CL     RL+ISRHV FDEN FP++++                  + H+++P
Sbjct: 746 SLTQSAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQRRESSCVWSPHTTLP 805

Query: 611 KSKNVL-----SPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTI-------VYPLET 670
               VL     S P H+  P S  + +   R++   S N D   S+          P + 
Sbjct: 806 TRTPVLPAPSCSDPHHAATPPS--SPSAPFRNSQVSSSNLDSSFSSSFPSSPEPTAPRQN 865

Query: 671 GTQ------ESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQS 676
           G Q      ++     +S   +Q+    E P Q    ++T  QS+S  P  T S
Sbjct: 866 GPQPTTQPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSSSSPSPTTS 891

BLAST of Cmc02g0043691 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 189.1 bits (479), Expect = 1.8e-46
Identity = 194/719 (26.98%), Postives = 303/719 (42.14%), Query Frame = 0

Query: 31  TEKGAESYIRTNENNYHNNHFYNQRGGRGNGRSNRGGRGNRNKPQCQICTNFGHSADRCF 90
           TE    SY R++ N       Y + G RG  + NR     RN   C  C   GH    C 
Sbjct: 198 TEGRGRSYQRSSNN-------YGRSGARGKSK-NRSKSRVRN---CYNCNQPGHFKRDCP 257

Query: 91  FRYTTRSNSSGYSPNSHNTSYTNMNNHPQMSAMVATPDLNI---DSNWYPDSGATNHLTH 150
                +  +SG   + +  +    N++  +        +++   +S W  D+ A++H T 
Sbjct: 258 NPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESEWVVDTAASHHAT- 317

Query: 151 SLSNLSTGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLHVPSITKNL 210
            + +L      G    +   N S   I   G +   ++     +  L ++ HVP +  NL
Sbjct: 318 PVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNV--GCTLVLKDVRHVPDLRMNL 377

Query: 211 ISVSQFAKDNHVFFEFYPTLCYVKDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSDSNT 270
           IS     +D    +E Y      +      V+ +G+    LY+   +     L+ +    
Sbjct: 378 ISGIALDRDG---YESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQGELNAAQDEI 437

Query: 271 KSVFNTVVPKSNTPLLDLWHRRLGHPHLP--TVKAVLNHIDHSSGTINKLNFYEACALGK 330
                          +DLWH+R+GH       + A  + I ++ GT  K   Y  C  GK
Sbjct: 438 S--------------VDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDY--CLFGK 497

Query: 331 HHALPFSHFLTLYTHPLQLITCDLWGPAVNISHNSFRYYISFFDTYSRYTWIYFLHSKSD 390
            H + F        + L L+  D+ GP    S    +Y+++F D  SR  W+Y L +K  
Sbjct: 498 QHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQ 557

Query: 391 AFLAFQKFKTCVEKSLGQSIKSLQTDGGTEF--KPFKPFLDQHGIEHRITCPYTSKQNDI 450
            F  FQKF   VE+  G+ +K L++D G E+  + F+ +   HGI H  T P T + N +
Sbjct: 558 VFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGV 617

Query: 451 VERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPN 510
            ER +R I+E   ++L  A LP SFW EA  T+ YLIN  P+  L    P      ++ +
Sbjct: 618 AERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVS 677

Query: 511 FPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFI-SRHVLF 570
           +  L+VFGC+ + +    Q  KL  +S PC F+GY     GY+      +  I SR V+F
Sbjct: 678 YSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVF 737

Query: 571 DENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIV 630
            E+    A+  S     K KN + P   + IPS+  N       TD VS+  +       
Sbjct: 738 RESEVRTAADMSE----KVKNGIIPNFVT-IPSTSNNPTSAESTTDEVSEQGE------- 797

Query: 631 YPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKP 690
            P E   Q    D+G            E  H T      Q    S  P +   +   +  
Sbjct: 798 QPGEVIEQGEQLDEGVE----------EVEHPTQGEEQHQPLRRSERPRVESRR---YPS 857

Query: 691 KAFLIDYTQTETCNAKEAFNHP---HWKKAMEEEFEALQKNGTWSLIPQNPNQKIVGCK 739
             +++     E  + KE  +HP      KAM+EE E+LQKNGT+ L+     ++ + CK
Sbjct: 858 TEYVLISDDREPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCK 858

BLAST of Cmc02g0043691 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 120.2 bits (300), Expect = 1.0e-25
Identity = 143/561 (25.49%), Postives = 230/561 (41.00%), Query Frame = 0

Query: 30  TTEKGAESYIRTNENNYHNNHFYNQRGGRGNGRSNRGGRGN-RNKPQCQICTNFGHSADR 89
           T++K   + +  N N Y NN F N+       +  +  +GN + K +C  C   GH    
Sbjct: 190 TSKKVMNAIVHNNNNTYKNNLFKNR-----VTKPKKIFKGNSKYKVKCHHCGREGHIKKD 249

Query: 90  CFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMSAMVATPDLNIDSNWYPDSGATNHLTHS 149
           CF      +N +    N         +    M   V    +  +  +  DSGA++HL + 
Sbjct: 250 CFHYKRILNNKN--KENEKQVQTATSHGIAFMVKEVNNTSVMDNCGFVLDSGASDHLIND 309

Query: 150 LSNLSTGSEYGGGNQI-YAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLHVPSITKNL 209
            S  +   E     +I  A  G  +  T  G +   +        TL ++L       NL
Sbjct: 310 ESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRND----HEITLEDVLFCKEAAGNL 369

Query: 210 ISVSQFAKDNHVFFEFYPTLCYVKDLDTGQVLLQGLLND----GLYKFTIQPSHKRLHHS 269
           +SV +  ++  +  EF  +   +       V   G+LN+        ++I   HK     
Sbjct: 370 MSVKRL-QEAGMSIEFDKSGVTISKNGLMVVKNSGMLNNVPVINFQAYSINAKHK----- 429

Query: 270 DSNTKSVFNTVVPKSNTPLLDLWHRRLGH-----------PHLPTVKAVLNHIDHSSGTI 329
            +N +                LWH R GH            ++ + +++LN+++ S    
Sbjct: 430 -NNFR----------------LWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELS---- 489

Query: 330 NKLNFYEACALGKHHALPFSHF--LTLYTHPLQLITCDLWGPAVNISHNSFRYYISFFDT 389
                 E C  GK   LPF      T    PL ++  D+ GP   ++ +   Y++ F D 
Sbjct: 490 --CEICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQ 549

Query: 390 YSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEF--KPFKPFLDQHGI 449
           ++ Y   Y +  KSD F  FQ F    E      +  L  D G E+     + F  + GI
Sbjct: 550 FTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGI 609

Query: 450 EHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVL 509
            + +T P+T + N + ER  R I E   T++S A L  SFW EA  T+ YLIN +P+  L
Sbjct: 610 SYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRAL 669

Query: 510 DNIS--PLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYK 567
            + S  P E    +KP    LRVFG   Y + +  Q  K   +S    F+GY  +  G+K
Sbjct: 670 VDSSKTPYEMWHNKKPYLKHLRVFGATVYVHIKNKQG-KFDDKSFKSIFVGYEPN--GFK 707

BLAST of Cmc02g0043691 vs. ExPASy Swiss-Prot
Match: Q03494 (Transposon Ty2-DR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-DR2 PE=3 SV=2)

HSP 1 Score: 84.7 bits (208), Expect = 4.7e-15
Identity = 113/458 (24.67%), Postives = 193/458 (42.14%), Query Frame = 0

Query: 68  RGNRNKPQCQICTNFGHSADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMSAM--VA 127
           R N +KP+     N   S+        +R NS   + ++ ++ Y + +N   +      +
Sbjct: 385 RTNSSKPRAAKAHNIATSSK------FSRVNSDHINESTVSSQYLSDDNELSLGQQQKES 444

Query: 128 TPDLNIDSN------WYPDSGATNHL---THSLSNLSTGSEYGGGNQIYAANGSGLPITH 187
            P   IDSN         DSGA+  L    H L + +  SE      I  A    +PI  
Sbjct: 445 KPTRTIDSNDELPDHLLIDSGASQTLVRSAHYLHHATPNSEI----NIVDAQKQDIPINA 504

Query: 188 YGSMSFNSSTLPFKSFTLNNL--LHVPSITKNLISVSQFAKDNHVFFEFYPTLCYVK--- 247
            G++ FN     F++ T  ++  LH P+I  +L+S+S+ A  N        T C+ +   
Sbjct: 505 IGNLHFN-----FQNGTKTSIKALHTPNIAYDLLSLSELANQN-------ITACFTRNTL 564

Query: 248 DLDTGQVLLQGLLNDGLY---KFTIQPSH--KRLHHSDSNTKSVFNTVVPKSNTPLLDLW 307
           +   G VL   + +   Y   K  + PSH  K   ++ + +KSV     P        L 
Sbjct: 565 ERSDGTVLAPIVKHGDFYWLSKKYLIPSHISKLTINNVNKSKSVNKYPYP--------LI 624

Query: 308 HRRLGHPHLPTV-----KAVLNHIDHSSGTINKLNFYEA--CALG---KHHALPFSHFLT 367
           HR LGH +  ++     K  + ++  S    +  + Y+   C +G   KH  +  S    
Sbjct: 625 HRMLGHANFRSIQKSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLKY 684

Query: 368 LYTH-PLQLITCDLWGPAVNISHNSFRYYISFFDTYSRYTWIYFLHSKSDAFL--AFQKF 427
             ++ P Q +  D++GP  ++  ++  Y+ISF D  +R+ W+Y LH + +  +   F   
Sbjct: 685 QESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSI 744

Query: 428 KTCVEKSLGQSIKSLQTDGGTEF--KPFKPFLDQHGIEHRITCPYTSKQNDIVERKHRHI 487
              ++      +  +Q D G+E+  K    F    GI    T    S+ + + ER +R +
Sbjct: 745 LAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAERLNRTL 804

Query: 488 MEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLD 490
           +    TLL  + LP   W  A   S  + N L +P  D
Sbjct: 805 LNDCRTLLHCSGLPNHLWFSAVEFSTIIRNSLVSPKND 812

BLAST of Cmc02g0043691 vs. ExPASy TrEMBL
Match: A0A5D3CH97 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold25G00040 PE=4 SV=1)

HSP 1 Score: 1288.5 bits (3333), Expect = 0.0e+00
Identity = 637/676 (94.23%), Postives = 649/676 (96.01%), Query Frame = 0

Query: 1   MSLLLTQKSQNESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGN 60
           MSLLLTQ+SQNESKLISE ALP VNIVTQTTEKGAESYIRTN+NNYHNNH YNQRGGRGN
Sbjct: 214 MSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGN 273

Query: 61  GRSNRGGRGNRNKPQCQICTNFGHSADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQM 120
           GRSNRG RGNRNKPQCQIC   G+SADRCFFRYT RSNSSGYSPNSHNTSYTNMNNHPQM
Sbjct: 274 GRSNRGRRGNRNKPQCQICAKLGYSADRCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQM 333

Query: 121 SAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSM 180
           SAMVA  DLNIDSNWYPDSGATNHLTHSLSNLS GSEYGGGNQIYAANGSGLPITHYGSM
Sbjct: 334 SAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGNQIYAANGSGLPITHYGSM 393

Query: 181 SFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLCYVKDLDTGQVLL 240
           SFNSSTLPFKSFTLNNLL VPSITKNLISVSQFAKDNHVFFEF+PTLCYVKDLDTGQVLL
Sbjct: 394 SFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLL 453

Query: 241 QGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPTVKA 300
           QGLLNDGLYKFTI+PSHKRLHHS+SNTK VFNTVVPKSNTPLLDLWHRRLGHPHLP VKA
Sbjct: 454 QGLLNDGLYKFTIEPSHKRLHHSNSNTKPVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKA 513

Query: 301 VLNHIDHSSGTINKLNFYEACALGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNISHNS 360
           VLNHID+SSGTINKLNF EACALGKHHALPFSH LTLYTHPLQLITCDLWGPAVN+SHN 
Sbjct: 514 VLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNG 573

Query: 361 FRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFK 420
           FRYYISF D YSRYTWIYFL+SKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFK
Sbjct: 574 FRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFK 633

Query: 421 PFLDQHGIEHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLI 480
           PFLDQHGIEHRITCPYTSKQNDIVERKHR+IMEMGLTLLSQATLPLSFWDEAF TSVYLI
Sbjct: 634 PFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLI 693

Query: 481 NLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYST 540
           N LPTPVLDNISPLEKL CRKPNFP LRVFGCKCYPY RPYQSHKLSLRSTPCTFLGYST
Sbjct: 694 NRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYST 753

Query: 541 SHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNH 600
           SHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSK+VLSPPLHSIIPSSLMNH
Sbjct: 754 SHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKDVLSPPLHSIIPSSLMNH 813

Query: 601 NEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMN 660
           NEDRRHTDTVSDNTD+LN TIVYPLETGTQESSRDDGNSGGITQSPS MEP HQTDSGMN
Sbjct: 814 NEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPSHQTDSGMN 873

Query: 661 TQLQSTSIHPMITQSK 677
           TQLQSTSIHPMITQSK
Sbjct: 874 TQLQSTSIHPMITQSK 889

BLAST of Cmc02g0043691 vs. ExPASy TrEMBL
Match: A0A5A7U233 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold264G00060 PE=4 SV=1)

HSP 1 Score: 1286.6 bits (3328), Expect = 0.0e+00
Identity = 636/676 (94.08%), Postives = 648/676 (95.86%), Query Frame = 0

Query: 1   MSLLLTQKSQNESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGN 60
           MSLLLTQ+SQNESKLISE ALP VNIVTQTTEKGAESYIRTN+NNYHNNH YNQRGGRGN
Sbjct: 214 MSLLLTQESQNESKLISETALPSVNIVTQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGN 273

Query: 61  GRSNRGGRGNRNKPQCQICTNFGHSADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQM 120
           GRSNRG RGNRNKPQCQIC   G+SADRCFFRYT RSNSSGYSPNSHNTSYTNMNNHPQM
Sbjct: 274 GRSNRGRRGNRNKPQCQICAKLGYSADRCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQM 333

Query: 121 SAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYGSM 180
           SAMVA  DLNIDSNWYPDSGATNHLTHSLSNLS GSEYGGGNQIYAANGSGLPITHYGSM
Sbjct: 334 SAMVAALDLNIDSNWYPDSGATNHLTHSLSNLSIGSEYGGGNQIYAANGSGLPITHYGSM 393

Query: 181 SFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLCYVKDLDTGQVLL 240
           SFNSSTLPFKSFTLNNLL VPSITKNLISVSQFAKDNHVFFEF+PTLCYVKDLDTGQVLL
Sbjct: 394 SFNSSTLPFKSFTLNNLLQVPSITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLL 453

Query: 241 QGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPTVKA 300
           QGLLNDGLYKFTI+PSHKRLHHS+SNTK VFNTVVPKSNTPLLDLWHRRLGHPHLP VKA
Sbjct: 454 QGLLNDGLYKFTIEPSHKRLHHSNSNTKPVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKA 513

Query: 301 VLNHIDHSSGTINKLNFYEACALGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNISHNS 360
           VLNHID+SSGTINKLNF EACALGKHHALPFSH LTLYTHPLQLITCDLWGPAVN+SHN 
Sbjct: 514 VLNHIDNSSGTINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNG 573

Query: 361 FRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFK 420
           FRYYISF D YSRYTWIYFL+SKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFK
Sbjct: 574 FRYYISFVDAYSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFK 633

Query: 421 PFLDQHGIEHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLI 480
           PFLDQHGIEHRITCPYTSKQNDIVERKHR+IMEMGLTLLSQATLPLSFWDEAF TSVYLI
Sbjct: 634 PFLDQHGIEHRITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLI 693

Query: 481 NLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYST 540
           N LPTPVLDNISPLEKL CRKPNFP LRVFGCKCYPY RPYQSHKLSLRSTPCTFLGYST
Sbjct: 694 NRLPTPVLDNISPLEKLFCRKPNFPSLRVFGCKCYPYLRPYQSHKLSLRSTPCTFLGYST 753

Query: 541 SHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNH 600
           SHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSS PKSK+VLSPPLHSIIPSSLMNH
Sbjct: 754 SHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSTPKSKDVLSPPLHSIIPSSLMNH 813

Query: 601 NEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMN 660
           NEDRRHTDTVSDNTD+LN TIVYPLETGTQESSRDDGNSGGITQSPS MEP HQTDSGMN
Sbjct: 814 NEDRRHTDTVSDNTDHLNPTIVYPLETGTQESSRDDGNSGGITQSPSSMEPSHQTDSGMN 873

Query: 661 TQLQSTSIHPMITQSK 677
           TQLQSTSIHPMITQSK
Sbjct: 874 TQLQSTSIHPMITQSK 889

BLAST of Cmc02g0043691 vs. ExPASy TrEMBL
Match: A0A5A7VFQ6 (Retrotransposon protein, putative, Ty1-copia subclass OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold418G00150 PE=4 SV=1)

HSP 1 Score: 997.3 bits (2577), Expect = 3.4e-287
Identity = 508/620 (81.94%), Postives = 508/620 (81.94%), Query Frame = 0

Query: 119 QMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYG 178
           QMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYG
Sbjct: 41  QMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHYG 100

Query: 179 SMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVFFEFYPTLCYVKDLDTGQV 238
           SMSFNSSTLPFKSFTLNNLLH                                DLDTGQV
Sbjct: 101 SMSFNSSTLPFKSFTLNNLLH--------------------------------DLDTGQV 160

Query: 239 LLQGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPTV 298
           LLQGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPTV
Sbjct: 161 LLQGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPTV 220

Query: 299 KAVLNHIDHSSGTINKLNFYEACALGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNISH 358
           KAVLNHIDHSS                                                 
Sbjct: 221 KAVLNHIDHSS------------------------------------------------- 280

Query: 359 NSFRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKP 418
                                         AFQKFKTCVEKSLGQSIKSLQTDGGTEFKP
Sbjct: 281 ------------------------------AFQKFKTCVEKSLGQSIKSLQTDGGTEFKP 340

Query: 419 FKPFLDQHGIEHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVY 478
           FKPFLDQHGIEHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVY
Sbjct: 341 FKPFLDQHGIEHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVY 400

Query: 479 LINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGY 538
           LINLLPTPVLDNISPLEKL CRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGY
Sbjct: 401 LINLLPTPVLDNISPLEKLFCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGY 460

Query: 539 STSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLM 598
           STSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLM
Sbjct: 461 STSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLM 520

Query: 599 NHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSG 658
           NHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSG
Sbjct: 521 NHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSG 549

Query: 659 MNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQ 718
           MNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQ
Sbjct: 581 MNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQ 549

Query: 719 KNGTWSLIPQNPNQKIVGCK 739
           KNGTWSLIPQNPNQKIVGCK
Sbjct: 641 KNGTWSLIPQNPNQKIVGCK 549

BLAST of Cmc02g0043691 vs. ExPASy TrEMBL
Match: A0A5D3D5W0 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold204G001310 PE=4 SV=1)

HSP 1 Score: 577.8 bits (1488), Expect = 6.5e-161
Identity = 280/287 (97.56%), Postives = 282/287 (98.26%), Query Frame = 0

Query: 452 MEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFG 511
           MEMGLTLLSQATLPLSFWDEAF TSVYLINLLPTPVLDNISPLEK+  RKPNFPFLRVFG
Sbjct: 1   MEMGLTLLSQATLPLSFWDEAFSTSVYLINLLPTPVLDNISPLEKVFFRKPNFPFLRVFG 60

Query: 512 CKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYAS 571
           CKCYPY RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYAS
Sbjct: 61  CKCYPYLRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYAS 120

Query: 572 FASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQE 631
           FASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLN TIVYPLETGTQE
Sbjct: 121 FASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNPTIVYPLETGTQE 180

Query: 632 SSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQ 691
           SSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQ
Sbjct: 181 SSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQ 240

Query: 692 TETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIVGCK 739
           TETCNAKEAFNHPHWKKAMEEEF+ALQKNGTWSLIPQNPNQKIVGCK
Sbjct: 241 TETCNAKEAFNHPHWKKAMEEEFKALQKNGTWSLIPQNPNQKIVGCK 287

BLAST of Cmc02g0043691 vs. ExPASy TrEMBL
Match: A0A438FJP6 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_1134 PE=4 SV=1)

HSP 1 Score: 560.8 bits (1444), Expect = 8.2e-156
Identity = 348/796 (43.72%), Postives = 454/796 (57.04%), Query Frame = 0

Query: 16   ISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFY--NQRGG----RGNGRSNRG-GR 75
            IS N L  VN  +Q + +G  S    N N Y ++ F   NQ GG    RG+   NRG GR
Sbjct: 343  ISSNDLS-VNYTSQYSNRGPSS--SWNSNGYPSSGFQNRNQFGGNQVTRGSFVHNRGRGR 402

Query: 76   GNRN---KPQCQICTNFGHSADRCFFRYT--------------------TRSNSSGYSPN 135
            G      KPQCQ+C  FGH+  RCF+RY                      R+ +SG   +
Sbjct: 403  GRAQGGIKPQCQLCNKFGHTVHRCFYRYDPNFHGNMPANGPTPGVLGSGARNGASGSISS 462

Query: 136  SHNTSYTNMN-----NHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGG 195
            + N + T  +     ++ +M AMVATP+   +  W+PDSGATNH+TH L NL++G+EY G
Sbjct: 463  AGNVNLTEYDAQENQDYSEMEAMVATPEDLQNCCWFPDSGATNHVTHDLGNLNSGAEYNG 522

Query: 196  GNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLHVPSITKNLISVSQFAKDNHVF 255
             ++I+  NG+GL I+H G   F SS+ P K   L N+L VP+I KNL+SVSQFA+DN+V+
Sbjct: 523  NSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRVPAIKKNLLSVSQFARDNNVY 582

Query: 256  FEFYPTLCYVKDLDTGQVLLQGLLNDGLYKFTIQPS--HKRLHHSDSNTKSVF------- 315
            FEF+P +C+VKD     +LLQG L+ GLY+F +      K    S SN K+         
Sbjct: 583  FEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGKASGLSLSNDKNELTCCNASL 642

Query: 316  ----NTVVPK---SNTPLLDLWHRRLGHPHLPTVKAVLNHIDHSSGTINKLNFYEACALG 375
                N+  P+   S+  + DLWH+RLGHP    V  VLN       T +  +   AC LG
Sbjct: 643  VHNDNSDFPEKTNSSFHVFDLWHKRLGHPASKIVTQVLNDNKIPFSTKSGSSICSACQLG 702

Query: 376  KHHALPFSHFLTLYTHPLQLITCDLWGPAVNISHNSFRYYISFFDTYSRYTWIYFLHSKS 435
            K H LPF    T+YT PLQL+  DLWGPA   S   F YY+SF D YSRYTW+YFL +KS
Sbjct: 703  KSHNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYGFTYYVSFVDAYSRYTWVYFLKTKS 762

Query: 436  DAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIV 495
                AF  FK   E   G  +K+ QTD G EF+  K + +Q+GI HR++CP+TSKQN I+
Sbjct: 763  QTREAFLMFKAQAELQFGCKLKTFQTDWGGEFRSLKTYFEQNGIIHRLSCPHTSKQNGII 822

Query: 496  ERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNF 555
            ERKHRHI+E+GLTLL+QA+LPL +W +AF T+V+LIN LPT VL    P E L   KPN+
Sbjct: 823  ERKHRHIVELGLTLLAQASLPLKYWPDAFSTAVFLINRLPTEVLKQKCPYEFLFNSKPNY 882

Query: 556  PFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDE 615
              L+VFGC C+P+ RPY  HKL  RS+PCTFLGYS+ HKGYKCL   GR+FISR V+FDE
Sbjct: 883  SQLKVFGCLCFPHLRPYNKHKLDFRSSPCTFLGYSSKHKGYKCLNQQGRMFISRSVVFDE 942

Query: 616  NSFPYA-------SFASHSS-----IPKSKNV--------LSPPLHSIIPSSLMNHN--E 675
              FP+A          SHS+     IP  KN+        LS P  S   S  ++ N   
Sbjct: 943  TRFPFADRLQKPVQIVSHSTVGLPCIPLVKNLEPLSVSPSLSLPTSSAQSSHQLDENLGS 1002

Query: 676  DRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQ 735
            D R       NTD  ++  +         SS      G I  S +  EP    ++   T 
Sbjct: 1003 DIRSVQQDLSNTDSSSTVPILNESASIPSSSNLYALPGTIPLSTNSDEPNESINTRPVTF 1062

Query: 736  LQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQKNGT 739
             Q    H M+T+SK+ IFKPK + +D    E    +EA +HP WK+AM+EEF AL KN T
Sbjct: 1063 PQQP--HHMVTRSKNGIFKPKVYTVDLNVEEPNTFQEAISHPKWKEAMDEEFRALMKNKT 1122

BLAST of Cmc02g0043691 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 50.8 bits (120), Expect = 5.3e-06
Identity = 32/77 (41.56%), Postives = 42/77 (54.55%), Query Frame = 0

Query: 671 MITQSKHDIFK--PKAFLIDYTQTETCNAKE-------AFNHPHWKKAMEEEFEALQKNG 730
           M+T+SK  I K  PK     Y+ T T   K+       A   P W +AM+EE +AL +N 
Sbjct: 1   MLTRSKAGINKLNPK-----YSLTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNK 60

Query: 731 TWSLIPQNPNQKIVGCK 739
           TW L+P   NQ I+GCK
Sbjct: 61  TWILVPPPVNQNILGCK 72

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TYK10642.10.0e+0094.23Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAA0048297.10.0e+0094.08Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAA0067212.17.1e-28781.94retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa][more]
TYK18915.11.3e-16097.56Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
RVW60229.11.7e-15543.72Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
Q9ZT945.6e-10133.86Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW21.6e-9534.31Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P109781.8e-4626.98Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041461.0e-2525.49Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q034944.7e-1524.67Transposon Ty2-DR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
A0A5D3CH970.0e+0094.23Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5A7U2330.0e+0094.08Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5A7VFQ63.4e-28781.94Retrotransposon protein, putative, Ty1-copia subclass OS=Cucumis melo var. makuw... [more]
A0A5D3D5W06.5e-16197.56Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A438FJP68.2e-15643.72Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
Match NameE-valueIdentityDescription
ATMG00820.15.3e-0641.56Reverse transcriptase (RNA-dependent DNA polymerase) [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 247..326
e-value: 5.3E-9
score: 35.8
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 336..509
e-value: 1.2E-30
score: 108.3
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 339..437
e-value: 5.3E-10
score: 39.5
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 337..501
score: 21.021112
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 46..75
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 625..661
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 217..561
NoneNo IPR availablePANTHERPTHR11439:SF324RIBONUCLEASE H-LIKE DOMAIN, GAG-PRE-INTEGRASE DOMAIN, GAG-POLYPEPTIDE OF LTR COPIA-TYPE-RELATEDcoord: 217..561
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 338..499

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc02g0043691.1Cmc02g0043691.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding