Cmc04g0104881 (gene) Melon (Charmono) v1.1

Overview
NameCmc04g0104881
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCMiso1.1chr04: 23131154 .. 23132401 (-)
RNA-Seq ExpressionCmc04g0104881
SyntenyCmc04g0104881
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAGAAGAGTTAAAATCTATGAATGATAATGAAGTCTGGGATCTTGTAGAATTGCCTAAAGAAAGTAAAAGAGTTGGGTGTAAATGGGTTTTTAAGACCAAACGTGACTCAAATGGCAATATTGAACGATGTAAGGCTAGACTTGTTGCCAAAGGTTATACTCAGAAAGATGGCATTGACTACAAAGAGACTTTTTCTCCTGTCTCGAAAAAGGACTCATTAAGAATTATTATGGTTTTGGTAGCTCATTATGATTTAGAGCTTCATCAAATGGATGTGAAAACCGCCTTTCTAAATGGAAATTTAGATGAAGAAGTGTTCATGGATCAACCAGAAGGTTTTATGGTTGAAGGAAAGGAACATATGGTGTGTAAATTAAAGAGGTCAATATATGGACTTAAACAAGCTTCCAAACAGTGGTATCTTAAGTTTAATGATACCATCACATCTTTTGGTTTTAAAGAAAACATCGTTGATCGATGTATATACCTAAAGATCAGTGGGAGTAAGTTTATAATTCTTGTTCTATATGTTGATGACATCTTGCTTGCTACAAATGACTTTGGTTTATTATGTCAAACCAAAGAATTTCTTTCTAAAAACTTTGAAATGAAAGATATGAGTGAGGCATCTTATGTGATTGGAATTGAAATATTCCGTGATCGAACACATGGATTGTTAGGATTGTCTCAAAAGGCCTATATTAATAAAGTTTTAGAGAAACTTAAGATGAACAAATGCTCTTCAAGTGTAGTTCCAATTCAGAAGGGAGATAAATTTAGTCTCATGCAATGTCCAAAAAATGAATTGGAACGAAATCAGATGGAAACTATTTCTTATGCATCTATTGTTGGAAGCTTATTGTATGCACAAACTTGCACTAGACTAGACATCAGTTTTGCTGTGGGTATGCTAGGCAGGTATCAAAGTAATCCAGGAATGGATCATTGGAAAGCTGCAAAGAAAGTTTTAAGGTATCTGCAAGGAGCAAAAGATTATATGCTTACTTACAAGAGATCTGATCATCTTGAGGTGATTGAATATTCAGATTCAGATTTTGCCAGATGTGTGGATACAAGAAAATCCACTTTTGGCTATTTGTTCCTTTTAGCTGAAGGAGCAATTTCATGGAAAAGTGCAAAGCAGTCTATTATCGCTGCATCCACTATGGAAGCTAAATTTGTAACATGCTTTGAGGCTACCAGTTCATGGTTTATGGCTGTGGAACTTTATCTCAGGACTTAG

mRNA sequence

ATGAAAGAAGAGTTAAAATCTATGAATGATAATGAAGTCTGGGATCTTGTAGAATTGCCTAAAGAAAGTAAAAGAGTTGGGTGTAAATGGGTTTTTAAGACCAAACGTGACTCAAATGGCAATATTGAACGATGTAAGGCTAGACTTGTTGCCAAAGGTTATACTCAGAAAGATGGCATTGACTACAAAGAGACTTTTTCTCCTGTCTCGAAAAAGGACTCATTAAGAATTATTATGGTTTTGGTAGCTCATTATGATTTAGAGCTTCATCAAATGGATGTGAAAACCGCCTTTCTAAATGGAAATTTAGATGAAGAAGTGTTCATGGATCAACCAGAAGGTTTTATGGTTGAAGGAAAGGAACATATGGTGTGTAAATTAAAGAGGTCAATATATGGACTTAAACAAGCTTCCAAACAGTGGTATCTTAAGTTTAATGATACCATCACATCTTTTGGTTTTAAAGAAAACATCGTTGATCGATGTATATACCTAAAGATCAGTGGGAGTAAGTTTATAATTCTTGTTCTATATGTTGATGACATCTTGCTTGCTACAAATGACTTTGGTTTATTATGTCAAACCAAAGAATTTCTTTCTAAAAACTTTGAAATGAAAGATATGAGTGAGGCATCTTATGTGATTGGAATTGAAATATTCCGTGATCGAACACATGGATTGTTAGGATTGTCTCAAAAGGCCTATATTAATAAAGTTTTAGAGAAACTTAAGATGAACAAATGCTCTTCAAGTGTAGTTCCAATTCAGAAGGGAGATAAATTTAGTCTCATGCAATGTCCAAAAAATGAATTGGAACGAAATCAGATGGAAACTATTTCTTATGCATCTATTGTTGGAAGCTTATTGTATGCACAAACTTGCACTAGACTAGACATCAGTTTTGCTGTGGGTATGCTAGGCAGGTATCAAAGTAATCCAGGAATGGATCATTGGAAAGCTGCAAAGAAAGTTTTAAGGTATCTGCAAGGAGCAAAAGATTATATGCTTACTTACAAGAGATCTGATCATCTTGAGGTGATTGAATATTCAGATTCAGATTTTGCCAGATGTGTGGATACAAGAAAATCCACTTTTGGCTATTTGTTCCTTTTAGCTGAAGGAGCAATTTCATGGAAAAGTGCAAAGCAGTCTATTATCGCTGCATCCACTATGGAAGCTAAATTTGTAACATGCTTTGAGGCTACCAGTTCATGGTTTATGGCTGTGGAACTTTATCTCAGGACTTAG

Coding sequence (CDS)

ATGAAAGAAGAGTTAAAATCTATGAATGATAATGAAGTCTGGGATCTTGTAGAATTGCCTAAAGAAAGTAAAAGAGTTGGGTGTAAATGGGTTTTTAAGACCAAACGTGACTCAAATGGCAATATTGAACGATGTAAGGCTAGACTTGTTGCCAAAGGTTATACTCAGAAAGATGGCATTGACTACAAAGAGACTTTTTCTCCTGTCTCGAAAAAGGACTCATTAAGAATTATTATGGTTTTGGTAGCTCATTATGATTTAGAGCTTCATCAAATGGATGTGAAAACCGCCTTTCTAAATGGAAATTTAGATGAAGAAGTGTTCATGGATCAACCAGAAGGTTTTATGGTTGAAGGAAAGGAACATATGGTGTGTAAATTAAAGAGGTCAATATATGGACTTAAACAAGCTTCCAAACAGTGGTATCTTAAGTTTAATGATACCATCACATCTTTTGGTTTTAAAGAAAACATCGTTGATCGATGTATATACCTAAAGATCAGTGGGAGTAAGTTTATAATTCTTGTTCTATATGTTGATGACATCTTGCTTGCTACAAATGACTTTGGTTTATTATGTCAAACCAAAGAATTTCTTTCTAAAAACTTTGAAATGAAAGATATGAGTGAGGCATCTTATGTGATTGGAATTGAAATATTCCGTGATCGAACACATGGATTGTTAGGATTGTCTCAAAAGGCCTATATTAATAAAGTTTTAGAGAAACTTAAGATGAACAAATGCTCTTCAAGTGTAGTTCCAATTCAGAAGGGAGATAAATTTAGTCTCATGCAATGTCCAAAAAATGAATTGGAACGAAATCAGATGGAAACTATTTCTTATGCATCTATTGTTGGAAGCTTATTGTATGCACAAACTTGCACTAGACTAGACATCAGTTTTGCTGTGGGTATGCTAGGCAGGTATCAAAGTAATCCAGGAATGGATCATTGGAAAGCTGCAAAGAAAGTTTTAAGGTATCTGCAAGGAGCAAAAGATTATATGCTTACTTACAAGAGATCTGATCATCTTGAGGTGATTGAATATTCAGATTCAGATTTTGCCAGATGTGTGGATACAAGAAAATCCACTTTTGGCTATTTGTTCCTTTTAGCTGAAGGAGCAATTTCATGGAAAAGTGCAAAGCAGTCTATTATCGCTGCATCCACTATGGAAGCTAAATTTGTAACATGCTTTGAGGCTACCAGTTCATGGTTTATGGCTGTGGAACTTTATCTCAGGACTTAG

Protein sequence

MKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASKQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKLKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFARCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAKFVTCFEATSSWFMAVELYLRT
Homology
BLAST of Cmc04g0104881 vs. NCBI nr
Match: KAA0052755.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 817.4 bits (2110), Expect = 5.7e-233
Identity = 408/415 (98.31%), Postives = 411/415 (99.04%), Query Frame = 0

Query: 1   MKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 60
           MKEELKSMNDNEVWDLVELPK+SKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI
Sbjct: 541 MKEELKSMNDNEVWDLVELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 600

Query: 61  DYKETFSPVSKKDSLRIIMVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 120
           DYKETFSPVSKKDSLRIIM LVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK
Sbjct: 601 DYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 660

Query: 121 EHMVCKLKRSIYGLKQASKQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD 180
           EHMVCKLKRSIYGLKQAS+QWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD
Sbjct: 661 EHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD 720

Query: 181 DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYINKVL 240
           DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQ AYINKVL
Sbjct: 721 DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVL 780

Query: 241 EKLKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS 300
           EK KMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS
Sbjct: 781 EKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS 840

Query: 301 FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFARCVDT 360
           FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFA CVDT
Sbjct: 841 FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDT 900

Query: 361 RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAKFVTCFEATSSWFMAVELYLRT 416
           RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEA+FVTCFEATSSWFMAVELYLRT
Sbjct: 901 RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRT 955

BLAST of Cmc04g0104881 vs. NCBI nr
Match: TYK04201.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 817.4 bits (2110), Expect = 5.7e-233
Identity = 408/415 (98.31%), Postives = 411/415 (99.04%), Query Frame = 0

Query: 1   MKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 60
           MKEELKSMNDNEVWDLVELPK+SKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI
Sbjct: 541 MKEELKSMNDNEVWDLVELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 600

Query: 61  DYKETFSPVSKKDSLRIIMVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 120
           DYKETFSPVSKKDSLRIIM LVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK
Sbjct: 601 DYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 660

Query: 121 EHMVCKLKRSIYGLKQASKQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD 180
           EHMVCKLKRSIYGLKQAS+QWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD
Sbjct: 661 EHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD 720

Query: 181 DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYINKVL 240
           DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQ AYINKVL
Sbjct: 721 DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVL 780

Query: 241 EKLKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS 300
           EK KMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS
Sbjct: 781 EKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS 840

Query: 301 FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFARCVDT 360
           FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFA CVDT
Sbjct: 841 FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDT 900

Query: 361 RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAKFVTCFEATSSWFMAVELYLRT 416
           RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEA+FVTCFEATSSWFMAVELYLRT
Sbjct: 901 RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRT 955

BLAST of Cmc04g0104881 vs. NCBI nr
Match: TYK00088.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 688.0 bits (1774), Expect = 5.2e-194
Identity = 343/370 (92.70%), Postives = 353/370 (95.41%), Query Frame = 0

Query: 8   MNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFS 67
           MND+EVWDLVEL KESKRVGCKWVFKTKRDSNGNIER KARLVAKGYTQKDGIDYKETFS
Sbjct: 1   MNDSEVWDLVELLKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGYTQKDGIDYKETFS 60

Query: 68  PVSKKDSLRIIMVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKL 127
           PVSKKDSLRIIM LVAHYDLELHQMDVKTAF+NGNLDE++FMDQPEGFMVEGKEHMVCKL
Sbjct: 61  PVSKKDSLRIIMALVAHYDLELHQMDVKTAFINGNLDEKLFMDQPEGFMVEGKEHMVCKL 120

Query: 128 KRSIYGLKQASKQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATN 187
           KRSIYGLKQAS+QWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATN
Sbjct: 121 KRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATN 180

Query: 188 DFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKLKMNK 247
           DFGLLCQTKEFLSKNFEMKDM EASYVIGIEIFRDRTHGLL LSQKAYINKVL+K KM+K
Sbjct: 181 DFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLRLSQKAYINKVLDKFKMDK 240

Query: 248 CSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLG 307
           CSSSVVPI KGDKFSLMQCPKNELERNQMETI YASIVGSLLYAQT TR DISFAVGML 
Sbjct: 241 CSSSVVPIHKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTYTRPDISFAVGMLD 300

Query: 308 RYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFARCVDTRKSTFGY 367
           RYQSNPGM+HWKAA KVLRYLQG KDYMLTYKRSDHL+VI YSDSDF  CVDTRKST GY
Sbjct: 301 RYQSNPGMNHWKAAMKVLRYLQGTKDYMLTYKRSDHLKVIGYSDSDFTGCVDTRKSTVGY 360

Query: 368 LFLLAEGAIS 378
           LFLLA+GAIS
Sbjct: 361 LFLLAKGAIS 370

BLAST of Cmc04g0104881 vs. NCBI nr
Match: RYE20331.1 (hypothetical protein EOP45_11235, partial [Sphingobacteriaceae bacterium])

HSP 1 Score: 682.2 bits (1759), Expect = 2.8e-192
Identity = 336/384 (87.50%), Postives = 360/384 (93.75%), Query Frame = 0

Query: 19  LPKESKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRII 78
           +P+  KRVGCKWVFKTK DSNGNIER KARLVAKG+TQKDG+DYKETFSPVSKKDSLRI+
Sbjct: 1   MPEGCKRVGCKWVFKTKLDSNGNIERHKARLVAKGFTQKDGVDYKETFSPVSKKDSLRIV 60

Query: 79  MVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQAS 138
           M +VAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGF+V+GKEHMVCKLK+SIYGLKQAS
Sbjct: 61  MAMVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFVVDGKEHMVCKLKKSIYGLKQAS 120

Query: 139 KQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEF 198
           +QWYLKFNDT+TSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATND GLL QTKEF
Sbjct: 121 RQWYLKFNDTVTSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDLGLLRQTKEF 180

Query: 199 LSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKLKMNKCSSSVVPIQKG 258
           LSKNFEMKDM EASYVIGIEI RDR+ G LGLSQKAYINKVLE+ KM+KCS+ +VPIQKG
Sbjct: 181 LSKNFEMKDMGEASYVIGIEISRDRSQGWLGLSQKAYINKVLERFKMDKCSAGIVPIQKG 240

Query: 259 DKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHW 318
           DKFSL QCPKNELER QME I YASIVGSL+YAQTCTR  ISFAVGMLGRYQSNPG+DHW
Sbjct: 241 DKFSLNQCPKNELERKQMEQIPYASIVGSLMYAQTCTRPGISFAVGMLGRYQSNPGIDHW 300

Query: 319 KAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFARCVDTRKSTFGYLFLLAEGAISW 378
           KAAKKVLRYLQG KD MLTYKRSDHLEVI YSDSD+A CVD+RKSTFGY+FLLAEGAISW
Sbjct: 301 KAAKKVLRYLQGTKDRMLTYKRSDHLEVIGYSDSDYAGCVDSRKSTFGYMFLLAEGAISW 360

Query: 379 KSAKQSIIAASTMEAKFVTCFEAT 403
           KSAKQ++IAASTMEA+FV CFEAT
Sbjct: 361 KSAKQTVIAASTMEAEFVACFEAT 384

BLAST of Cmc04g0104881 vs. NCBI nr
Match: RZB61294.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja])

HSP 1 Score: 678.7 bits (1750), Expect = 3.2e-191
Identity = 331/402 (82.34%), Postives = 365/402 (90.80%), Query Frame = 0

Query: 1    MKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 60
            MKEE+ SM  N VWDLVELPK  KRVGCKWVFKTKRDS+GN+ER KARLVAKG+TQKDGI
Sbjct: 688  MKEEIDSMEHNGVWDLVELPKGCKRVGCKWVFKTKRDSHGNLERYKARLVAKGFTQKDGI 747

Query: 61   DYKETFSPVSKKDSLRIIMVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 120
            DYKETFSPVS+KDS RIIM LV HYDLELHQMDVKTAFLNG+L+E+V+MDQP GF VEGK
Sbjct: 748  DYKETFSPVSRKDSFRIIMALVTHYDLELHQMDVKTAFLNGDLEEDVYMDQPMGFSVEGK 807

Query: 121  EHMVCKLKRSIYGLKQASKQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD 180
            EHMVCKLK+SIYGLKQAS+QWYLKFNDTI SFGFKEN VDRC+YLK+SGSK + LVLYVD
Sbjct: 808  EHMVCKLKKSIYGLKQASRQWYLKFNDTIVSFGFKENTVDRCVYLKVSGSKVMFLVLYVD 867

Query: 181  DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYINKVL 240
            DIL+ATND GLL +TK+FLS NFEMKDM EA+YVIGIEIFR+R+ GLLGLSQK YINKVL
Sbjct: 868  DILIATNDLGLLHETKKFLSSNFEMKDMGEANYVIGIEIFRNRSQGLLGLSQKTYINKVL 927

Query: 241  EKLKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS 300
            E+ +M KCS+S VPIQK DKFSL QCPKN+LER QME ISYAS+VGS++YAQTCTR DIS
Sbjct: 928  ERFRMEKCSASPVPIQKRDKFSLAQCPKNDLERKQMEEISYASVVGSIMYAQTCTRPDIS 987

Query: 301  FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFARCVDT 360
            FA GMLGRYQSNPGM+HWKAAKKVLRYLQG KD+MLTYKRSDHLEVI YSDSDFA CVDT
Sbjct: 988  FATGMLGRYQSNPGMEHWKAAKKVLRYLQGTKDHMLTYKRSDHLEVIGYSDSDFAGCVDT 1047

Query: 361  RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAKFVTCFEAT 403
            RKST G++FLLA GAISWKSAKQS++AASTMEA+FV CFEAT
Sbjct: 1048 RKSTLGFVFLLAGGAISWKSAKQSVVAASTMEAEFVACFEAT 1089

BLAST of Cmc04g0104881 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 376.3 bits (965), Expect = 4.4e-103
Identity = 192/401 (47.88%), Postives = 267/401 (66.58%), Query Frame = 0

Query: 1    MKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 60
            M+EE++S+  N  + LVELPK  + + CKWVFK K+D +  + R KARLV KG+ QK GI
Sbjct: 830  MQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGI 889

Query: 61   DYKETFSPVSKKDSLRIIMVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 120
            D+ E FSPV K  S+R I+ L A  DLE+ Q+DVKTAFL+G+L+EE++M+QPEGF V GK
Sbjct: 890  DFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGK 949

Query: 121  EHMVCKLKRSIYGLKQASKQWYLKFNDTITSFGFKENIVDRCIYLK-ISGSKFIILVLYV 180
            +HMVCKL +S+YGLKQA +QWY+KF+  + S  + +   D C+Y K  S + FIIL+LYV
Sbjct: 950  KHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYV 1009

Query: 181  DDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYINKV 240
            DD+L+   D GL+ + K  LSK+F+MKD+  A  ++G++I R+RT   L LSQ+ YI +V
Sbjct: 1010 DDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERV 1069

Query: 241  LEKLKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDI 300
            LE+  M        P+    K S   CP    E+  M  + Y+S VGSL+YA  CTR DI
Sbjct: 1070 LERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDI 1129

Query: 301  SFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFARCVD 360
            + AVG++ R+  NPG +HW+A K +LRYL+G     L +  SD + +  Y+D+D A  +D
Sbjct: 1130 AHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPI-LKGYTDADMAGDID 1189

Query: 361  TRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAKFVTCFE 401
             RKS+ GYLF  + GAISW+S  Q  +A ST EA+++   E
Sbjct: 1190 NRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATE 1229

BLAST of Cmc04g0104881 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 283.1 bits (723), Expect = 5.1e-75
Identity = 160/403 (39.70%), Postives = 236/403 (58.56%), Query Frame = 0

Query: 4    ELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYK 63
            EL +   N  W + + P+    V  +WVF  K +  GN  R KARLVA+G+TQK  IDY+
Sbjct: 913  ELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYE 972

Query: 64   ETFSPVSKKDSLRIIMVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHM 123
            ETF+PV++  S R I+ LV  Y+L++HQMDVKTAFLNG L EE++M  P+G  +      
Sbjct: 973  ETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQG--ISCNSDN 1032

Query: 124  VCKLKRSIYGLKQASKQWYLKFNDTITSFGFKENIVDRCIYLKISG--SKFIILVLYVDD 183
            VCKL ++IYGLKQA++ W+  F   +    F  + VDRCIY+   G  ++ I ++LYVDD
Sbjct: 1033 VCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLLYVDD 1092

Query: 184  ILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYINKVLE 243
            +++AT D   +   K +L + F M D++E  + IGI I  +     + LSQ AY+ K+L 
Sbjct: 1093 VVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRI--EMQEDKIYLSQSAYVKKILS 1152

Query: 244  KLKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISF 303
            K  M  C++   P+     + L       L  ++       S++G L+Y   CTR D++ 
Sbjct: 1153 KFNMENCNAVSTPLPSKINYEL-------LNSDEDCNTPCRSLIGCLMYIMLCTRPDLTT 1212

Query: 304  AVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLE--VIEYSDSDFARCVD 363
            AV +L RY S    + W+  K+VLRYL+G  D  L +K++   E  +I Y DSD+A    
Sbjct: 1213 AVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEI 1272

Query: 364  TRKSTFGYLFLLAE-GAISWKSAKQSIIAASTMEAKFVTCFEA 402
             RKST GYLF + +   I W + +Q+ +AAS+ EA+++  FEA
Sbjct: 1273 DRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEA 1304

BLAST of Cmc04g0104881 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 271.2 bits (692), Expect = 2.0e-71
Identity = 155/396 (39.14%), Postives = 221/396 (55.81%), Query Frame = 0

Query: 1    MKEELKSMNDNEVWDLVELPKESKR-VGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDG 60
            M  E+ +   N  WDLV  P  S   VGC+W+F  K +S+G++ R KARLVAKGY Q+ G
Sbjct: 955  MGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSLNRYKARLVAKGYNQRPG 1014

Query: 61   IDYKETFSPVSKKDSLRIIMVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEG 120
            +DY ETFSPV K  S+RI++ +       + Q+DV  AFL G L +EV+M QP GF+ + 
Sbjct: 1015 LDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDEVYMSQPPGFVDKD 1074

Query: 121  KEHMVCKLKRSIYGLKQASKQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYV 180
            +   VC+L+++IYGLKQA + WY++    + + GF  +I D  +++   G   I +++YV
Sbjct: 1075 RPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSISDTSLFVLQRGRSIIYMLVYV 1134

Query: 181  DDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYINKV 240
            DDIL+  ND  LL  T + LS+ F +K+  +  Y +GIE    R    L LSQ+ Y   +
Sbjct: 1135 DDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGIE--AKRVPQGLHLSQRRYTLDL 1194

Query: 241  LEKLKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDI 300
            L +  M        P+    K +L    K        +   Y  IVGSL Y    TR D+
Sbjct: 1195 LARTNMLTAKPVATPMATSPKLTLHSGTK------LPDPTEYRGIVGSLQYL-AFTRPDL 1254

Query: 301  SFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFARCVD 360
            S+AV  L +Y   P  DHW A K+VLRYL G  D+ +  K+ + L +  YSD+D+A   D
Sbjct: 1255 SYAVNRLSQYMHMPTDDHWNALKRVLRYLAGTPDHGIFLKKGNTLSLHAYSDADWAGDTD 1314

Query: 361  TRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAKF 396
               ST GY+  L    ISW S KQ  +  S+ EA++
Sbjct: 1315 DYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEY 1341

BLAST of Cmc04g0104881 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 268.5 bits (685), Expect = 1.3e-70
Identity = 152/396 (38.38%), Postives = 224/396 (56.57%), Query Frame = 0

Query: 1    MKEELKSMNDNEVWDLVELPKESKR-VGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDG 60
            M  E+ +   N  WDLV  P      VGC+W+F  K +S+G++ R KARLVAKGY Q+ G
Sbjct: 972  MGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQRPG 1031

Query: 61   IDYKETFSPVSKKDSLRIIMVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEG 120
            +DY ETFSPV K  S+RI++ +       + Q+DV  AFL G L ++V+M QP GF+ + 
Sbjct: 1032 LDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFIDKD 1091

Query: 121  KEHMVCKLKRSIYGLKQASKQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYV 180
            + + VCKL++++YGLKQA + WY++  + + + GF  ++ D  +++   G   + +++YV
Sbjct: 1092 RPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKSIVYMLVYV 1151

Query: 181  DDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYINKV 240
            DDIL+  ND  LL  T + LS+ F +KD  E  Y +GIE  R  T   L LSQ+ YI  +
Sbjct: 1152 DDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIEAKRVPTG--LHLSQRRYILDL 1211

Query: 241  LEKLKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDI 300
            L +  M        P+    K SL    K        +   Y  IVGSL Y    TR DI
Sbjct: 1212 LARTNMITAKPVTTPMAPSPKLSLYSGTK------LTDPTEYRGIVGSLQYL-AFTRPDI 1271

Query: 301  SFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFARCVD 360
            S+AV  L ++   P  +H +A K++LRYL G  ++ +  K+ + L +  YSD+D+A   D
Sbjct: 1272 SYAVNRLSQFMHMPTEEHLQALKRILRYLAGTPNHGIFLKKGNTLSLHAYSDADWAGDKD 1331

Query: 361  TRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAKF 396
               ST GY+  L    ISW S KQ  +  S+ EA++
Sbjct: 1332 DYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEY 1358

BLAST of Cmc04g0104881 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 154.8 bits (390), Expect = 2.1e-36
Identity = 98/311 (31.51%), Postives = 159/311 (51.13%), Query Frame = 0

Query: 92  MDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASKQWYLKFNDTITS 151
           MDV TAFLN  +DE +++ QP GF+ E     V +L   +YGLKQA   W    N+T+  
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 152 FGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEA 211
            GF  +  +  +Y + +    I + +YVDD+L+A     +  + K+ L+K + MKD+ + 
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 212 SYVIGIEIFRDRTHGLLGLSQKAYINKVLEKLKMNKCSSSVVPIQKGDKFSLMQCPKNEL 271
              +G+ I +  ++G + LS + YI K   + ++N    +  P+           P  E 
Sbjct: 121 DKFLGLNIHQS-SNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSK-------PLFET 180

Query: 272 ERNQMETIS-YASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQG 331
               ++ I+ Y SIVG LL+     R DIS+ V +L R+   P   H ++A++VLRYL  
Sbjct: 181 TSPHLKDITPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYT 240

Query: 332 AKDYMLTYKRSDHLEVIEYSDSDFARCVDTRKSTFGYLFLLAEGAISWKSAK-QSIIAAS 391
            +   L Y+    L +  Y D+      D   ST GY+ LLA   ++W S K + +I   
Sbjct: 241 TRSMCLKYRSGSQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVP 300

Query: 392 TMEAKFVTCFE 401
           + EA+++T  E
Sbjct: 301 STEAEYITASE 303

BLAST of Cmc04g0104881 vs. ExPASy TrEMBL
Match: A0A5A7UG95 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold43055G00040 PE=4 SV=1)

HSP 1 Score: 817.4 bits (2110), Expect = 2.7e-233
Identity = 408/415 (98.31%), Postives = 411/415 (99.04%), Query Frame = 0

Query: 1   MKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 60
           MKEELKSMNDNEVWDLVELPK+SKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI
Sbjct: 541 MKEELKSMNDNEVWDLVELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 600

Query: 61  DYKETFSPVSKKDSLRIIMVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 120
           DYKETFSPVSKKDSLRIIM LVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK
Sbjct: 601 DYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 660

Query: 121 EHMVCKLKRSIYGLKQASKQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD 180
           EHMVCKLKRSIYGLKQAS+QWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD
Sbjct: 661 EHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD 720

Query: 181 DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYINKVL 240
           DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQ AYINKVL
Sbjct: 721 DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVL 780

Query: 241 EKLKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS 300
           EK KMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS
Sbjct: 781 EKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS 840

Query: 301 FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFARCVDT 360
           FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFA CVDT
Sbjct: 841 FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDT 900

Query: 361 RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAKFVTCFEATSSWFMAVELYLRT 416
           RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEA+FVTCFEATSSWFMAVELYLRT
Sbjct: 901 RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRT 955

BLAST of Cmc04g0104881 vs. ExPASy TrEMBL
Match: A0A5D3BWW5 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1856G00300 PE=4 SV=1)

HSP 1 Score: 817.4 bits (2110), Expect = 2.7e-233
Identity = 408/415 (98.31%), Postives = 411/415 (99.04%), Query Frame = 0

Query: 1   MKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 60
           MKEELKSMNDNEVWDLVELPK+SKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI
Sbjct: 541 MKEELKSMNDNEVWDLVELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 600

Query: 61  DYKETFSPVSKKDSLRIIMVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 120
           DYKETFSPVSKKDSLRIIM LVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK
Sbjct: 601 DYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 660

Query: 121 EHMVCKLKRSIYGLKQASKQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD 180
           EHMVCKLKRSIYGLKQAS+QWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD
Sbjct: 661 EHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD 720

Query: 181 DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYINKVL 240
           DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQ AYINKVL
Sbjct: 721 DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVL 780

Query: 241 EKLKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS 300
           EK KMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS
Sbjct: 781 EKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS 840

Query: 301 FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFARCVDT 360
           FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFA CVDT
Sbjct: 841 FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDT 900

Query: 361 RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAKFVTCFEATSSWFMAVELYLRT 416
           RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEA+FVTCFEATSSWFMAVELYLRT
Sbjct: 901 RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRT 955

BLAST of Cmc04g0104881 vs. ExPASy TrEMBL
Match: A0A5D3BLU0 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold596G00400 PE=4 SV=1)

HSP 1 Score: 688.0 bits (1774), Expect = 2.5e-194
Identity = 343/370 (92.70%), Postives = 353/370 (95.41%), Query Frame = 0

Query: 8   MNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFS 67
           MND+EVWDLVEL KESKRVGCKWVFKTKRDSNGNIER KARLVAKGYTQKDGIDYKETFS
Sbjct: 1   MNDSEVWDLVELLKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGYTQKDGIDYKETFS 60

Query: 68  PVSKKDSLRIIMVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKL 127
           PVSKKDSLRIIM LVAHYDLELHQMDVKTAF+NGNLDE++FMDQPEGFMVEGKEHMVCKL
Sbjct: 61  PVSKKDSLRIIMALVAHYDLELHQMDVKTAFINGNLDEKLFMDQPEGFMVEGKEHMVCKL 120

Query: 128 KRSIYGLKQASKQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATN 187
           KRSIYGLKQAS+QWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATN
Sbjct: 121 KRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATN 180

Query: 188 DFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKLKMNK 247
           DFGLLCQTKEFLSKNFEMKDM EASYVIGIEIFRDRTHGLL LSQKAYINKVL+K KM+K
Sbjct: 181 DFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLRLSQKAYINKVLDKFKMDK 240

Query: 248 CSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLG 307
           CSSSVVPI KGDKFSLMQCPKNELERNQMETI YASIVGSLLYAQT TR DISFAVGML 
Sbjct: 241 CSSSVVPIHKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTYTRPDISFAVGMLD 300

Query: 308 RYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFARCVDTRKSTFGY 367
           RYQSNPGM+HWKAA KVLRYLQG KDYMLTYKRSDHL+VI YSDSDF  CVDTRKST GY
Sbjct: 301 RYQSNPGMNHWKAAMKVLRYLQGTKDYMLTYKRSDHLKVIGYSDSDFTGCVDTRKSTVGY 360

Query: 368 LFLLAEGAIS 378
           LFLLA+GAIS
Sbjct: 361 LFLLAKGAIS 370

BLAST of Cmc04g0104881 vs. ExPASy TrEMBL
Match: A0A4V1T029 (Reverse transcriptase Ty1/copia-type domain-containing protein (Fragment) OS=Sphingobacteriaceae bacterium OX=2021370 GN=EOP45_11235 PE=4 SV=1)

HSP 1 Score: 682.2 bits (1759), Expect = 1.4e-192
Identity = 336/384 (87.50%), Postives = 360/384 (93.75%), Query Frame = 0

Query: 19  LPKESKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRII 78
           +P+  KRVGCKWVFKTK DSNGNIER KARLVAKG+TQKDG+DYKETFSPVSKKDSLRI+
Sbjct: 1   MPEGCKRVGCKWVFKTKLDSNGNIERHKARLVAKGFTQKDGVDYKETFSPVSKKDSLRIV 60

Query: 79  MVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQAS 138
           M +VAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGF+V+GKEHMVCKLK+SIYGLKQAS
Sbjct: 61  MAMVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFVVDGKEHMVCKLKKSIYGLKQAS 120

Query: 139 KQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEF 198
           +QWYLKFNDT+TSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATND GLL QTKEF
Sbjct: 121 RQWYLKFNDTVTSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDLGLLRQTKEF 180

Query: 199 LSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKLKMNKCSSSVVPIQKG 258
           LSKNFEMKDM EASYVIGIEI RDR+ G LGLSQKAYINKVLE+ KM+KCS+ +VPIQKG
Sbjct: 181 LSKNFEMKDMGEASYVIGIEISRDRSQGWLGLSQKAYINKVLERFKMDKCSAGIVPIQKG 240

Query: 259 DKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDISFAVGMLGRYQSNPGMDHW 318
           DKFSL QCPKNELER QME I YASIVGSL+YAQTCTR  ISFAVGMLGRYQSNPG+DHW
Sbjct: 241 DKFSLNQCPKNELERKQMEQIPYASIVGSLMYAQTCTRPGISFAVGMLGRYQSNPGIDHW 300

Query: 319 KAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFARCVDTRKSTFGYLFLLAEGAISW 378
           KAAKKVLRYLQG KD MLTYKRSDHLEVI YSDSD+A CVD+RKSTFGY+FLLAEGAISW
Sbjct: 301 KAAKKVLRYLQGTKDRMLTYKRSDHLEVIGYSDSDYAGCVDSRKSTFGYMFLLAEGAISW 360

Query: 379 KSAKQSIIAASTMEAKFVTCFEAT 403
           KSAKQ++IAASTMEA+FV CFEAT
Sbjct: 361 KSAKQTVIAASTMEAEFVACFEAT 384

BLAST of Cmc04g0104881 vs. ExPASy TrEMBL
Match: A0A445GJ88 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3848 GN=D0Y65_043849 PE=4 SV=1)

HSP 1 Score: 678.7 bits (1750), Expect = 1.5e-191
Identity = 331/402 (82.34%), Postives = 365/402 (90.80%), Query Frame = 0

Query: 1    MKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 60
            MKEE+ SM  N VWDLVELPK  KRVGCKWVFKTKRDS+GN+ER KARLVAKG+TQKDGI
Sbjct: 688  MKEEIDSMEHNGVWDLVELPKGCKRVGCKWVFKTKRDSHGNLERYKARLVAKGFTQKDGI 747

Query: 61   DYKETFSPVSKKDSLRIIMVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 120
            DYKETFSPVS+KDS RIIM LV HYDLELHQMDVKTAFLNG+L+E+V+MDQP GF VEGK
Sbjct: 748  DYKETFSPVSRKDSFRIIMALVTHYDLELHQMDVKTAFLNGDLEEDVYMDQPMGFSVEGK 807

Query: 121  EHMVCKLKRSIYGLKQASKQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD 180
            EHMVCKLK+SIYGLKQAS+QWYLKFNDTI SFGFKEN VDRC+YLK+SGSK + LVLYVD
Sbjct: 808  EHMVCKLKKSIYGLKQASRQWYLKFNDTIVSFGFKENTVDRCVYLKVSGSKVMFLVLYVD 867

Query: 181  DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYINKVL 240
            DIL+ATND GLL +TK+FLS NFEMKDM EA+YVIGIEIFR+R+ GLLGLSQK YINKVL
Sbjct: 868  DILIATNDLGLLHETKKFLSSNFEMKDMGEANYVIGIEIFRNRSQGLLGLSQKTYINKVL 927

Query: 241  EKLKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS 300
            E+ +M KCS+S VPIQK DKFSL QCPKN+LER QME ISYAS+VGS++YAQTCTR DIS
Sbjct: 928  ERFRMEKCSASPVPIQKRDKFSLAQCPKNDLERKQMEEISYASVVGSIMYAQTCTRPDIS 987

Query: 301  FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFARCVDT 360
            FA GMLGRYQSNPGM+HWKAAKKVLRYLQG KD+MLTYKRSDHLEVI YSDSDFA CVDT
Sbjct: 988  FATGMLGRYQSNPGMEHWKAAKKVLRYLQGTKDHMLTYKRSDHLEVIGYSDSDFAGCVDT 1047

Query: 361  RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAKFVTCFEAT 403
            RKST G++FLLA GAISWKSAKQS++AASTMEA+FV CFEAT
Sbjct: 1048 RKSTLGFVFLLAGGAISWKSAKQSVVAASTMEAEFVACFEAT 1089

BLAST of Cmc04g0104881 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 303.1 bits (775), Expect = 3.4e-82
Identity = 159/416 (38.22%), Postives = 247/416 (59.38%), Query Frame = 0

Query: 1   MKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 60
           M +E+ +M     W++  LP   K +GCKWV+K K +S+G IER KARLVAKGYTQ++GI
Sbjct: 102 MDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGI 161

Query: 61  DYKETFSPVSKKDSLRIIMVLVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 120
           D+ ETFSPV K  S+++I+ + A Y+  LHQ+D+  AFLNG+LDEE++M  P G+     
Sbjct: 162 DFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQG 221

Query: 121 EHM----VCKLKRSIYGLKQASKQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILV 180
           + +    VC LK+SIYGLKQAS+QW+LKF+ T+  FGF ++  D   +LKI+ + F+ ++
Sbjct: 222 DSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVL 281

Query: 181 LYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQKAYI 240
           +YVDDI++ +N+   + + K  L   F+++D+    Y +G+EI R      + + Q+ Y 
Sbjct: 282 VYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAG--INICQRKYA 341

Query: 241 NKVLEKLKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTR 300
             +L++  +  C  S VP+     FS           + ++  +Y  ++G L+Y Q  TR
Sbjct: 342 LDLLDETGLLGCKPSSVPMDPSVTFSA------HSGGDFVDAKAYRRLIGRLMYLQ-ITR 401

Query: 301 LDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAR 360
           LDISFAV  L ++   P + H +A  K+L Y++G     L Y     +++  +SD+ F  
Sbjct: 402 LDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQS 461

Query: 361 CVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAKFVTCFEATSSWFMAVELY 413
           C DTR+ST GY   L    ISWKS KQ +++ S+ EA++     AT       + +
Sbjct: 462 CKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMWLAQFF 508

BLAST of Cmc04g0104881 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 107.8 bits (268), Expect = 2.1e-23
Identity = 76/224 (33.93%), Postives = 115/224 (51.34%), Query Frame = 0

Query: 175 LVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTH-GLLGLSQK 234
           L+LYVDDILL  +   LL      LS  F MKD+    Y +GI+I   +TH   L LSQ 
Sbjct: 3   LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQI---KTHPSGLFLSQT 62

Query: 235 AYINKVLEKLKMNKCS--SSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYA 294
            Y  ++L    M  C   S+ +P++     S  + P         +   + SIVG+L Y 
Sbjct: 63  KYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP---------DPSDFRSIVGALQYL 122

Query: 295 QTCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSD 354
            T TR DIS+AV ++ +    P +  +   K+VLRY++G   + L   ++  L V  + D
Sbjct: 123 -TLTRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCD 182

Query: 355 SDFARCVDTRKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAKF 396
           SD+A C  TR+ST G+   L    ISW + +Q  ++ S+ E ++
Sbjct: 183 SDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEY 213

BLAST of Cmc04g0104881 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 83.6 bits (205), Expect = 4.2e-16
Identity = 39/79 (49.37%), Postives = 57/79 (72.15%), Query Frame = 0

Query: 1   MKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 60
           M+EEL +++ N+ W LV  P     +GCKWVFKTK  S+G ++R KARLVAKG+ Q++GI
Sbjct: 44  MQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGI 103

Query: 61  DYKETFSPVSKKDSLRIIM 80
            + ET+SPV +  ++R I+
Sbjct: 104 YFVETYSPVVRTATIRTIL 122

BLAST of Cmc04g0104881 vs. TAIR 10
Match: ATMG00240.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 57.0 bits (136), Expect = 4.2e-08
Identity = 29/79 (36.71%), Postives = 45/79 (56.96%), Query Frame = 0

Query: 293 TCTRLDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDS 352
           T TR D++FAV  L ++ S       +A  KVL Y++G     L Y  +  L++  ++DS
Sbjct: 4   TITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAFADS 63

Query: 353 DFARCVDTRKSTFGYLFLL 372
           D+A C DTR+S  G+  L+
Sbjct: 64  DWASCPDTRRSVTGFCSLV 82

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0052755.15.7e-23398.31Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
TYK04201.15.7e-23398.31Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
TYK00088.15.2e-19492.70Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
RYE20331.12.8e-19287.50hypothetical protein EOP45_11235, partial [Sphingobacteriaceae bacterium][more]
RZB61294.13.2e-19182.34Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja][more]
Match NameE-valueIdentityDescription
P109784.4e-10347.88Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041465.1e-7539.70Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT942.0e-7139.14Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW21.3e-7038.38Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P256002.1e-3631.51Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
A0A5A7UG952.7e-23398.31Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3BWW52.7e-23398.31Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3BLU02.5e-19492.70Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A4V1T0291.4e-19287.50Reverse transcriptase Ty1/copia-type domain-containing protein (Fragment) OS=Sph... [more]
A0A445GJ881.5e-19182.34Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3... [more]
Match NameE-valueIdentityDescription
AT4G23160.13.4e-8238.22cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.12.1e-2333.93DNA/RNA polymerases superfamily protein [more]
ATMG00820.14.2e-1649.37Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00240.14.2e-0836.71Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 11..255
e-value: 8.8E-75
score: 251.4
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 2..375
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 349..402
e-value: 3.59412E-22
score: 89.8349
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 11..389

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc04g0104881.1Cmc04g0104881.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006952 defense response
biological_process GO:0015074 DNA integration
biological_process GO:0006278 RNA-dependent DNA biosynthetic process
biological_process GO:0007165 signal transduction
molecular_function GO:0043531 ADP binding
molecular_function GO:0003887 DNA-directed DNA polymerase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003964 RNA-directed DNA polymerase activity
molecular_function GO:0008270 zinc ion binding