Cmc06g0163371 (gene) Melon (Charmono) v1.1

Overview
NameCmc06g0163371
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCMiso1.1chr06: 12922282 .. 12923571 (-)
RNA-Seq ExpressionCmc06g0163371
SyntenyCmc06g0163371
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAGAAGAGTTAAAATCTATGAATGATAATAAAGTCTGGGATCTTGTAGAATTACCTAAAGAAAGTAAAAGAGTTGAGTGTAAATGGGTCTTTAAGACCAAATGTGACTCAAATGGCAATATCGAACAATACAAGGCTAGACTTGTTGCCAAAGGTTATACTCAGAAAGATGGCGTTGACTACAAAGAGACCTTTTCTCCTATCTCGAAAAAGGACTCATTAAGAATTATTATGGCTTTGGTAGCTCATTATGATTTAGAGCTTTATCAAATAGATGTGAAGACCGCCTTTCTATATAGAAATTTAGATGAAAAAGTGTTCATGGATCAACTAGAAGGTTTTATGGTTGAAGGAAAGGAATATATGGTGTGTAAATTAAAGAGGTCAATATATGAACTTAAACAAGCTTCCAGACAGTGCTATCTTAAGTTCAATGATATCATCACATCTTTTGGTTTTAAAGAAAACATCATTGATCAATGTATATGCCTAAAGATCAGTGGGAGTAAGTTTATAATTCTTGTTCTATATGTTGATGACATCTTGTTTGCTACAAATGACTTTGGTTTGTTATGTCAAACCAAAGAATTTCTTTCTAAAAACTTTGAAATGAAAGATATGGGTGAGGCATCCTATGTGATTGGAATTGAAATATTTTGTGACCAAACACATGGATTGTTAGGATTGTCTCAAAAGGCCTATATTAATAAAGTTTTAGAGAAATTTAAGATGGATAAATGCTCTTCAAGTGTAGTTCCAATTCAGAAGGGAGATAAGTTCAGTCTCATGCAATATCCAAAAAATGAATTGGAACGAAATCAAATGGAAACTATTCCTTATGCATCTATTGTCGGAAGCTTATTGTATGCACAGACTTGCACTAGACCAGACATTAGTTTTGTTATGGGTATGCTAGGCATATATCAAAGCAATCCAGGAATGGATCATTGGAAAACTGCAAAGAAAGTTTTAAGGTATCTGCAAGGAACAAAGGATTATATGCTTACTTACAAGAGATCTGATCATCTTGAGGTGATTGGATATTCAGATTTAGATTTTACCGGATGTATGGATAAAAGAAAATCCACTTTTGGCTATTTGTTTTTGTTAGCTGAAGGAGCAATTTCATGGAAAAGTGCAAAGCAATCTATTGTCGCTGTATCCACTATGAAAGCTGAATTTGTAGCATGCTTTGAAGCTATAGTTCATAGTTTTATGGCTGCGGAACTTTATCTCAGGACTTGGAATTGTCGACAGTATTGTCAAGCCGCTGAGAATTTATCGTGA

mRNA sequence

ATGAAAGAAGAGTTAAAATCTATGAATGATAATAAAGTCTGGGATCTTGTAGAATTACCTAAAGAAAGTAAAAGAGTTGAGTGTAAATGGGTCTTTAAGACCAAATGTGACTCAAATGGCAATATCGAACAATACAAGGCTAGACTTGTTGCCAAAGGTTATACTCAGAAAGATGGCGTTGACTACAAAGAGACCTTTTCTCCTATCTCGAAAAAGGACTCATTAAGAATTATTATGGCTTTGGTAGCTCATTATGATTTAGAGCTTTATCAAATAGATGTGAAGACCGCCTTTCTATATAGAAATTTAGATGAAAAAGTGTTCATGGATCAACTAGAAGGTTTTATGGTTGAAGGAAAGGAATATATGGTGTGTAAATTAAAGAGGTCAATATATGAACTTAAACAAGCTTCCAGACAGTGCTATCTTAAGTTCAATGATATCATCACATCTTTTGGTTTTAAAGAAAACATCATTGATCAATGTATATGCCTAAAGATCAGTGGGAGTAAGTTTATAATTCTTGTTCTATATGTTGATGACATCTTGTTTGCTACAAATGACTTTGGTTTGTTATGTCAAACCAAAGAATTTCTTTCTAAAAACTTTGAAATGAAAGATATGGGTGAGGCATCCTATGTGATTGGAATTGAAATATTTTGTGACCAAACACATGGATTGTTAGGATTGTCTCAAAAGGCCTATATTAATAAAGTTTTAGAGAAATTTAAGATGGATAAATGCTCTTCAAGTGTAGTTCCAATTCAGAAGGGAGATAAGTTCAGTCTCATGCAATATCCAAAAAATGAATTGGAACGAAATCAAATGGAAACTATTCCTTATGCATCTATTGTCGGAAGCTTATTGTATGCACAGACTTGCACTAGACCAGACATTAGTTTTGTTATGGGTATGCTAGGCATATATCAAAGCAATCCAGGAATGGATCATTGGAAAACTGCAAAGAAAGTTTTAAGGTATCTGCAAGGAACAAAGGATTATATGCTTACTTACAAGAGATCTGATCATCTTGAGGTGATTGGATATTCAGATTTAGATTTTACCGGATGTATGGATAAAAGAAAATCCACTTTTGGCTATTTGTTTTTGTTAGCTGAAGGAGCAATTTCATGGAAAAGTGCAAAGCAATCTATTGTCGCTGTATCCACTATGAAAGCTGAATTTGTAGCATGCTTTGAAGCTATAGTTCATAGTTTTATGGCTGCGGAACTTTATCTCAGGACTTGGAATTGTCGACAGTATTGTCAAGCCGCTGAGAATTTATCGTGA

Coding sequence (CDS)

ATGAAAGAAGAGTTAAAATCTATGAATGATAATAAAGTCTGGGATCTTGTAGAATTACCTAAAGAAAGTAAAAGAGTTGAGTGTAAATGGGTCTTTAAGACCAAATGTGACTCAAATGGCAATATCGAACAATACAAGGCTAGACTTGTTGCCAAAGGTTATACTCAGAAAGATGGCGTTGACTACAAAGAGACCTTTTCTCCTATCTCGAAAAAGGACTCATTAAGAATTATTATGGCTTTGGTAGCTCATTATGATTTAGAGCTTTATCAAATAGATGTGAAGACCGCCTTTCTATATAGAAATTTAGATGAAAAAGTGTTCATGGATCAACTAGAAGGTTTTATGGTTGAAGGAAAGGAATATATGGTGTGTAAATTAAAGAGGTCAATATATGAACTTAAACAAGCTTCCAGACAGTGCTATCTTAAGTTCAATGATATCATCACATCTTTTGGTTTTAAAGAAAACATCATTGATCAATGTATATGCCTAAAGATCAGTGGGAGTAAGTTTATAATTCTTGTTCTATATGTTGATGACATCTTGTTTGCTACAAATGACTTTGGTTTGTTATGTCAAACCAAAGAATTTCTTTCTAAAAACTTTGAAATGAAAGATATGGGTGAGGCATCCTATGTGATTGGAATTGAAATATTTTGTGACCAAACACATGGATTGTTAGGATTGTCTCAAAAGGCCTATATTAATAAAGTTTTAGAGAAATTTAAGATGGATAAATGCTCTTCAAGTGTAGTTCCAATTCAGAAGGGAGATAAGTTCAGTCTCATGCAATATCCAAAAAATGAATTGGAACGAAATCAAATGGAAACTATTCCTTATGCATCTATTGTCGGAAGCTTATTGTATGCACAGACTTGCACTAGACCAGACATTAGTTTTGTTATGGGTATGCTAGGCATATATCAAAGCAATCCAGGAATGGATCATTGGAAAACTGCAAAGAAAGTTTTAAGGTATCTGCAAGGAACAAAGGATTATATGCTTACTTACAAGAGATCTGATCATCTTGAGGTGATTGGATATTCAGATTTAGATTTTACCGGATGTATGGATAAAAGAAAATCCACTTTTGGCTATTTGTTTTTGTTAGCTGAAGGAGCAATTTCATGGAAAAGTGCAAAGCAATCTATTGTCGCTGTATCCACTATGAAAGCTGAATTTGTAGCATGCTTTGAAGCTATAGTTCATAGTTTTATGGCTGCGGAACTTTATCTCAGGACTTGGAATTGTCGACAGTATTGTCAAGCCGCTGAGAATTTATCGTGA

Protein sequence

MKEELKSMNDNKVWDLVELPKESKRVECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDGVDYKETFSPISKKDSLRIIMALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEGKEYMVCKLKRSIYELKQASRQCYLKFNDIITSFGFKENIIDQCICLKISGSKFIILVLYVDDILFATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFVMGMLGIYQSNPGMDHWKTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDLDFTGCMDKRKSTFGYLFLLAEGAISWKSAKQSIVAVSTMKAEFVACFEAIVHSFMAAELYLRTWNCRQYCQAAENLS
Homology
BLAST of Cmc06g0163371 vs. NCBI nr
Match: KAA0052755.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 753.1 bits (1943), Expect = 1.4e-213
Identity = 378/428 (88.32%), Postives = 395/428 (92.29%), Query Frame = 0

Query: 1   MKEELKSMNDNKVWDLVELPKESKRVECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDGV 60
           MKEELKSMNDN+VWDLVELPK+SKRV CKWVFKTK DSNGNIE+ KARLVAKGYTQKDG+
Sbjct: 541 MKEELKSMNDNEVWDLVELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 600

Query: 61  DYKETFSPISKKDSLRIIMALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEGK 120
           DYKETFSP+SKKDSLRIIMALVAHYDLEL+Q+DVKTAFL  NLDE+VFMDQ EGFMVEGK
Sbjct: 601 DYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 660

Query: 121 EYMVCKLKRSIYELKQASRQCYLKFNDIITSFGFKENIIDQCICLKISGSKFIILVLYVD 180
           E+MVCKLKRSIY LKQASRQ YLKFND ITSFGFKENI+D+CI LKISGSKFIILVLYVD
Sbjct: 661 EHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD 720

Query: 181 DILFATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYINKVL 240
           DIL ATNDFGLLCQTKEFLSKNFEMKDM EASYVIGIEIF D+THGLLGLSQ AYINKVL
Sbjct: 721 DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVL 780

Query: 241 EKFKMDKCSSSVVPIQKGDKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTRPDIS 300
           EKFKM+KCSSSVVPIQKGDKFSLMQ PKNELERNQMETI YASIVGSLLYAQTCTR DIS
Sbjct: 781 EKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS 840

Query: 301 FVMGMLGIYQSNPGMDHWKTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDLDFTGCMDK 360
           F +GMLG YQSNPGMDHWK AKKVLRYLQG KDYMLTYKRSDHLEVI YSD DF GC+D 
Sbjct: 841 FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDT 900

Query: 361 RKSTFGYLFLLAEGAISWKSAKQSIVAVSTMKAEFVACFEAIVHSFMAAELYLRTWNCRQ 420
           RKSTFGYLFLLAEGAISWKSAKQSI+A STM+AEFV CFEA    FMA ELYLRTWNCRQ
Sbjct: 901 RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNCRQ 960

Query: 421 YCQAAENL 429
           YCQAA NL
Sbjct: 961 YCQAAGNL 968

BLAST of Cmc06g0163371 vs. NCBI nr
Match: TYK04201.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 753.1 bits (1943), Expect = 1.4e-213
Identity = 378/428 (88.32%), Postives = 395/428 (92.29%), Query Frame = 0

Query: 1   MKEELKSMNDNKVWDLVELPKESKRVECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDGV 60
           MKEELKSMNDN+VWDLVELPK+SKRV CKWVFKTK DSNGNIE+ KARLVAKGYTQKDG+
Sbjct: 541 MKEELKSMNDNEVWDLVELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 600

Query: 61  DYKETFSPISKKDSLRIIMALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEGK 120
           DYKETFSP+SKKDSLRIIMALVAHYDLEL+Q+DVKTAFL  NLDE+VFMDQ EGFMVEGK
Sbjct: 601 DYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 660

Query: 121 EYMVCKLKRSIYELKQASRQCYLKFNDIITSFGFKENIIDQCICLKISGSKFIILVLYVD 180
           E+MVCKLKRSIY LKQASRQ YLKFND ITSFGFKENI+D+CI LKISGSKFIILVLYVD
Sbjct: 661 EHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD 720

Query: 181 DILFATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYINKVL 240
           DIL ATNDFGLLCQTKEFLSKNFEMKDM EASYVIGIEIF D+THGLLGLSQ AYINKVL
Sbjct: 721 DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVL 780

Query: 241 EKFKMDKCSSSVVPIQKGDKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTRPDIS 300
           EKFKM+KCSSSVVPIQKGDKFSLMQ PKNELERNQMETI YASIVGSLLYAQTCTR DIS
Sbjct: 781 EKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS 840

Query: 301 FVMGMLGIYQSNPGMDHWKTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDLDFTGCMDK 360
           F +GMLG YQSNPGMDHWK AKKVLRYLQG KDYMLTYKRSDHLEVI YSD DF GC+D 
Sbjct: 841 FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDT 900

Query: 361 RKSTFGYLFLLAEGAISWKSAKQSIVAVSTMKAEFVACFEAIVHSFMAAELYLRTWNCRQ 420
           RKSTFGYLFLLAEGAISWKSAKQSI+A STM+AEFV CFEA    FMA ELYLRTWNCRQ
Sbjct: 901 RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNCRQ 960

Query: 421 YCQAAENL 429
           YCQAA NL
Sbjct: 961 YCQAAGNL 968

BLAST of Cmc06g0163371 vs. NCBI nr
Match: TYK00088.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 654.8 bits (1688), Expect = 5.0e-184
Identity = 327/370 (88.38%), Postives = 346/370 (93.51%), Query Frame = 0

Query: 8   MNDNKVWDLVELPKESKRVECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDGVDYKETFS 67
           MND++VWDLVEL KESKRV CKWVFKTK DSNGNIE+YKARLVAKGYTQKDG+DYKETFS
Sbjct: 1   MNDSEVWDLVELLKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGYTQKDGIDYKETFS 60

Query: 68  PISKKDSLRIIMALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEGKEYMVCKL 127
           P+SKKDSLRIIMALVAHYDLEL+Q+DVKTAF+  NLDEK+FMDQ EGFMVEGKE+MVCKL
Sbjct: 61  PVSKKDSLRIIMALVAHYDLELHQMDVKTAFINGNLDEKLFMDQPEGFMVEGKEHMVCKL 120

Query: 128 KRSIYELKQASRQCYLKFNDIITSFGFKENIIDQCICLKISGSKFIILVLYVDDILFATN 187
           KRSIY LKQASRQ YLKFND ITSFGFKENI+D+CI LKISGSKFIILVLYVDDIL ATN
Sbjct: 121 KRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATN 180

Query: 188 DFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYINKVLEKFKMDK 247
           DFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIF D+THGLL LSQKAYINKVL+KFKMDK
Sbjct: 181 DFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLRLSQKAYINKVLDKFKMDK 240

Query: 248 CSSSVVPIQKGDKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFVMGMLG 307
           CSSSVVPI KGDKFSLMQ PKNELERNQMETIPYASIVGSLLYAQT TRPDISF +GML 
Sbjct: 241 CSSSVVPIHKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTYTRPDISFAVGMLD 300

Query: 308 IYQSNPGMDHWKTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDLDFTGCMDKRKSTFGY 367
            YQSNPGM+HWK A KVLRYLQGTKDYMLTYKRSDHL+VIGYSD DFTGC+D RKST GY
Sbjct: 301 RYQSNPGMNHWKAAMKVLRYLQGTKDYMLTYKRSDHLKVIGYSDSDFTGCVDTRKSTVGY 360

Query: 368 LFLLAEGAIS 378
           LFLLA+GAIS
Sbjct: 361 LFLLAKGAIS 370

BLAST of Cmc06g0163371 vs. NCBI nr
Match: RYE20331.1 (hypothetical protein EOP45_11235, partial [Sphingobacteriaceae bacterium])

HSP 1 Score: 651.0 bits (1678), Expect = 7.3e-183
Identity = 320/386 (82.90%), Postives = 353/386 (91.45%), Query Frame = 0

Query: 19  LPKESKRVECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDGVDYKETFSPISKKDSLRII 78
           +P+  KRV CKWVFKTK DSNGNIE++KARLVAKG+TQKDGVDYKETFSP+SKKDSLRI+
Sbjct: 1   MPEGCKRVGCKWVFKTKLDSNGNIERHKARLVAKGFTQKDGVDYKETFSPVSKKDSLRIV 60

Query: 79  MALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEGKEYMVCKLKRSIYELKQAS 138
           MA+VAHYDLEL+Q+DVKTAFL  NLDE+VFMDQ EGF+V+GKE+MVCKLK+SIY LKQAS
Sbjct: 61  MAMVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFVVDGKEHMVCKLKKSIYGLKQAS 120

Query: 139 RQCYLKFNDIITSFGFKENIIDQCICLKISGSKFIILVLYVDDILFATNDFGLLCQTKEF 198
           RQ YLKFND +TSFGFKENI+D+CI LKISGSKFIILVLYVDDIL ATND GLL QTKEF
Sbjct: 121 RQWYLKFNDTVTSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDLGLLRQTKEF 180

Query: 199 LSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKG 258
           LSKNFEMKDMGEASYVIGIEI  D++ G LGLSQKAYINKVLE+FKMDKCS+ +VPIQKG
Sbjct: 181 LSKNFEMKDMGEASYVIGIEISRDRSQGWLGLSQKAYINKVLERFKMDKCSAGIVPIQKG 240

Query: 259 DKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFVMGMLGIYQSNPGMDHW 318
           DKFSL Q PKNELER QME IPYASIVGSL+YAQTCTRP ISF +GMLG YQSNPG+DHW
Sbjct: 241 DKFSLNQCPKNELERKQMEQIPYASIVGSLMYAQTCTRPGISFAVGMLGRYQSNPGIDHW 300

Query: 319 KTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDLDFTGCMDKRKSTFGYLFLLAEGAISW 378
           K AKKVLRYLQGTKD MLTYKRSDHLEVIGYSD D+ GC+D RKSTFGY+FLLAEGAISW
Sbjct: 301 KAAKKVLRYLQGTKDRMLTYKRSDHLEVIGYSDSDYAGCVDSRKSTFGYMFLLAEGAISW 360

Query: 379 KSAKQSIVAVSTMKAEFVACFEAIVH 405
           KSAKQ+++A STM+AEFVACFEA VH
Sbjct: 361 KSAKQTVIAASTMEAEFVACFEATVH 386

BLAST of Cmc06g0163371 vs. NCBI nr
Match: RZC25410.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja])

HSP 1 Score: 637.9 bits (1644), Expect = 6.4e-179
Identity = 313/405 (77.28%), Postives = 356/405 (87.90%), Query Frame = 0

Query: 1    MKEELKSMNDNKVWDLVELPKESKRVECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDGV 60
            MKEE+ SM  N VWDLVELPK  KRV  KWVFKTK DS+GN+E+YKARLVAKG+TQKDG+
Sbjct: 913  MKEEIDSMEHNDVWDLVELPKGCKRVGYKWVFKTKRDSHGNLERYKARLVAKGFTQKDGI 972

Query: 61   DYKETFSPISKKDSLRIIMALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEGK 120
            DYKETFSP+S+KDS RIIMALVAHYDLEL+Q+DVKTAFL  +L+E V+MDQ  GF VEGK
Sbjct: 973  DYKETFSPVSRKDSFRIIMALVAHYDLELHQMDVKTAFLNGDLEEDVYMDQPMGFSVEGK 1032

Query: 121  EYMVCKLKRSIYELKQASRQCYLKFNDIITSFGFKENIIDQCICLKISGSKFIILVLYVD 180
            E+MVCKLK+SIY LKQASRQ YLKFND I SFGFKEN +D+C+ LK+SGSK + LVLYVD
Sbjct: 1033 EHMVCKLKKSIYGLKQASRQWYLKFNDTIVSFGFKENTVDRCVYLKVSGSKVMFLVLYVD 1092

Query: 181  DILFATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYINKVL 240
            DIL ATND GL  +TK+FLS NFEMKDMGEASYVIGIEIF +++ GLLGLSQKAYINKVL
Sbjct: 1093 DILLATNDLGLFHETKKFLSSNFEMKDMGEASYVIGIEIFRNRSQGLLGLSQKAYINKVL 1152

Query: 241  EKFKMDKCSSSVVPIQKGDKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTRPDIS 300
            E+F+M+KCS+S VPIQKGDKFSL Q PKN+LER QME IPYAS+VGS++YAQTCTRPDIS
Sbjct: 1153 ERFRMEKCSASPVPIQKGDKFSLAQCPKNDLERKQMEAIPYASVVGSIMYAQTCTRPDIS 1212

Query: 301  FVMGMLGIYQSNPGMDHWKTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDLDFTGCMDK 360
            F  GMLG YQSNPGM+HWK AKKVLRYLQGTKD++LTYKRSDHLEVIGYSD DF GC+D 
Sbjct: 1213 FATGMLGRYQSNPGMEHWKAAKKVLRYLQGTKDHILTYKRSDHLEVIGYSDSDFAGCVDT 1272

Query: 361  RKSTFGYLFLLAEGAISWKSAKQSIVAVSTMKAEFVACFEAIVHS 406
            RKST G++FLLA GAISWKSAKQS+VA STM+A FVACFEA + +
Sbjct: 1273 RKSTLGFVFLLAGGAISWKSAKQSVVAASTMEAAFVACFEATIQA 1317

BLAST of Cmc06g0163371 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 360.5 bits (924), Expect = 2.6e-98
Identity = 185/401 (46.13%), Postives = 265/401 (66.08%), Query Frame = 0

Query: 1    MKEELKSMNDNKVWDLVELPKESKRVECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDGV 60
            M+EE++S+  N  + LVELPK  + ++CKWVFK K D +  + +YKARLV KG+ QK G+
Sbjct: 830  MQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGI 889

Query: 61   DYKETFSPISKKDSLRIIMALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEGK 120
            D+ E FSP+ K  S+R I++L A  DLE+ Q+DVKTAFL+ +L+E+++M+Q EGF V GK
Sbjct: 890  DFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGK 949

Query: 121  EYMVCKLKRSIYELKQASRQCYLKFNDIITSFGFKENIIDQCICLK-ISGSKFIILVLYV 180
            ++MVCKL +S+Y LKQA RQ Y+KF+  + S  + +   D C+  K  S + FIIL+LYV
Sbjct: 950  KHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYV 1009

Query: 181  DDILFATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYINKV 240
            DD+L    D GL+ + K  LSK+F+MKD+G A  ++G++I  ++T   L LSQ+ YI +V
Sbjct: 1010 DDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERV 1069

Query: 241  LEKFKMDKCSSSVVPIQKGDKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTRPDI 300
            LE+F M        P+    K S    P    E+  M  +PY+S VGSL+YA  CTRPDI
Sbjct: 1070 LERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDI 1129

Query: 301  SFVMGMLGIYQSNPGMDHWKTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDLDFTGCMD 360
            +  +G++  +  NPG +HW+  K +LRYL+GT    L +  SD + + GY+D D  G +D
Sbjct: 1130 AHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPI-LKGYTDADMAGDID 1189

Query: 361  KRKSTFGYLFLLAEGAISWKSAKQSIVAVSTMKAEFVACFE 401
             RKS+ GYLF  + GAISW+S  Q  VA+ST +AE++A  E
Sbjct: 1190 NRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATE 1229

BLAST of Cmc06g0163371 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 269.6 bits (688), Expect = 6.0e-71
Identity = 150/404 (37.13%), Postives = 234/404 (57.92%), Query Frame = 0

Query: 4    ELKSMNDNKVWDLVELPKESKRVECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDGVDYK 63
            EL +   N  W + + P+    V+ +WVF  K +  GN  +YKARLVA+G+TQK  +DY+
Sbjct: 913  ELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYE 972

Query: 64   ETFSPISKKDSLRIIMALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEGKEYM 123
            ETF+P+++  S R I++LV  Y+L+++Q+DVKTAFL   L E+++M   +G         
Sbjct: 973  ETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISCNSDN-- 1032

Query: 124  VCKLKRSIYELKQASRQCYLKFNDIITSFGFKENIIDQCICLKISG--SKFIILVLYVDD 183
            VCKL ++IY LKQA+R  +  F   +    F  + +D+CI +   G  ++ I ++LYVDD
Sbjct: 1033 VCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLLYVDD 1092

Query: 184  ILFATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYINKVLE 243
            ++ AT D   +   K +L + F M D+ E  + IGI I  +     + LSQ AY+ K+L 
Sbjct: 1093 VVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRI--EMQEDKIYLSQSAYVKKILS 1152

Query: 244  KFKMDKCSSSVVPIQKGDKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTRPDISF 303
            KF M+ C++   P+     + L       L  ++    P  S++G L+Y   CTRPD++ 
Sbjct: 1153 KFNMENCNAVSTPLPSKINYEL-------LNSDEDCNTPCRSLIGCLMYIMLCTRPDLTT 1212

Query: 304  VMGMLGIYQSNPGMDHWKTAKKVLRYLQGTKDYMLTYKRSDHLE--VIGYSDLDFTGCMD 363
             + +L  Y S    + W+  K+VLRYL+GT D  L +K++   E  +IGY D D+ G   
Sbjct: 1213 AVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEI 1272

Query: 364  KRKSTFGYLFLLAE-GAISWKSAKQSIVAVSTMKAEFVACFEAI 403
             RKST GYLF + +   I W + +Q+ VA S+ +AE++A FEA+
Sbjct: 1273 DRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAV 1305

BLAST of Cmc06g0163371 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 249.6 bits (636), Expect = 6.4e-65
Identity = 148/396 (37.37%), Postives = 213/396 (53.79%), Query Frame = 0

Query: 1    MKEELKSMNDNKVWDLVELPKESKR-VECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDG 60
            M  E+ +   N  WDLV  P  S   V C+W+F  K +S+G++ +YKARLVAKGY Q+ G
Sbjct: 955  MGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSLNRYKARLVAKGYNQRPG 1014

Query: 61   VDYKETFSPISKKDSLRIIMALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEG 120
            +DY ETFSP+ K  S+RI++ +       + Q+DV  AFL   L ++V+M Q  GF+ + 
Sbjct: 1015 LDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDEVYMSQPPGFVDKD 1074

Query: 121  KEYMVCKLKRSIYELKQASRQCYLKFNDIITSFGFKENIIDQCICLKISGSKFIILVLYV 180
            +   VC+L+++IY LKQA R  Y++    + + GF  +I D  + +   G   I +++YV
Sbjct: 1075 RPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSISDTSLFVLQRGRSIIYMLVYV 1134

Query: 181  DDILFATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYINKV 240
            DDIL   ND  LL  T + LS+ F +K+  +  Y +GIE    +    L LSQ+ Y   +
Sbjct: 1135 DDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGIE--AKRVPQGLHLSQRRYTLDL 1194

Query: 241  LEKFKMDKCSSSVVPIQKGDKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTRPDI 300
            L +  M        P+    K +L    K        +   Y  IVGSL Y    TRPD+
Sbjct: 1195 LARTNMLTAKPVATPMATSPKLTLHSGTK------LPDPTEYRGIVGSLQYL-AFTRPDL 1254

Query: 301  SFVMGMLGIYQSNPGMDHWKTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDLDFTGCMD 360
            S+ +  L  Y   P  DHW   K+VLRYL GT D+ +  K+ + L +  YSD D+ G  D
Sbjct: 1255 SYAVNRLSQYMHMPTDDHWNALKRVLRYLAGTPDHGIFLKKGNTLSLHAYSDADWAGDTD 1314

Query: 361  KRKSTFGYLFLLAEGAISWKSAKQSIVAVSTMKAEF 396
               ST GY+  L    ISW S KQ  V  S+ +AE+
Sbjct: 1315 DYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEY 1341

BLAST of Cmc06g0163371 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 246.1 bits (627), Expect = 7.1e-64
Identity = 145/396 (36.62%), Postives = 213/396 (53.79%), Query Frame = 0

Query: 1    MKEELKSMNDNKVWDLVELPKESKR-VECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDG 60
            M  E+ +   N  WDLV  P      V C+W+F  K +S+G++ +YKARLVAKGY Q+ G
Sbjct: 972  MGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQRPG 1031

Query: 61   VDYKETFSPISKKDSLRIIMALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEG 120
            +DY ETFSP+ K  S+RI++ +       + Q+DV  AFL   L + V+M Q  GF+ + 
Sbjct: 1032 LDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFIDKD 1091

Query: 121  KEYMVCKLKRSIYELKQASRQCYLKFNDIITSFGFKENIIDQCICLKISGSKFIILVLYV 180
            +   VCKL++++Y LKQA R  Y++  + + + GF  ++ D  + +   G   + +++YV
Sbjct: 1092 RPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKSIVYMLVYV 1151

Query: 181  DDILFATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYINKV 240
            DDIL   ND  LL  T + LS+ F +KD  E  Y +GIE    +    L LSQ+ YI  +
Sbjct: 1152 DDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIE--AKRVPTGLHLSQRRYILDL 1211

Query: 241  LEKFKMDKCSSSVVPIQKGDKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTRPDI 300
            L +  M        P+    K SL    K        +   Y  IVGSL Y    TRPDI
Sbjct: 1212 LARTNMITAKPVTTPMAPSPKLSLYSGTK------LTDPTEYRGIVGSLQYL-AFTRPDI 1271

Query: 301  SFVMGMLGIYQSNPGMDHWKTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDLDFTGCMD 360
            S+ +  L  +   P  +H +  K++LRYL GT ++ +  K+ + L +  YSD D+ G  D
Sbjct: 1272 SYAVNRLSQFMHMPTEEHLQALKRILRYLAGTPNHGIFLKKGNTLSLHAYSDADWAGDKD 1331

Query: 361  KRKSTFGYLFLLAEGAISWKSAKQSIVAVSTMKAEF 396
               ST GY+  L    ISW S KQ  V  S+ +AE+
Sbjct: 1332 DYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEY 1358

BLAST of Cmc06g0163371 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 134.0 bits (336), Expect = 3.9e-30
Identity = 89/313 (28.43%), Postives = 152/313 (48.56%), Query Frame = 0

Query: 92  IDVKTAFLYRNLDEKVFMDQLEGFMVEGKEYMVCKLKRSIYELKQASRQCYLKFNDIITS 151
           +DV TAFL   +DE +++ Q  GF+ E     V +L   +Y LKQA        N+ +  
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 152 FGFKENIIDQCICLKISGSKFIILVLYVDDILFATNDFGLLCQTKEFLSKNFEMKDMGEA 211
            GF  +  +  +  + +    I + +YVDD+L A     +  + K+ L+K + MKD+G+ 
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 212 SYVIGIEIFCDQTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQYPKNEL 271
              +G+ I    ++G + LS + YI K   + +++    +  P+           P    
Sbjct: 121 DKFLGLNIH-QSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSP---- 180

Query: 272 ERNQMETIPYASIVGSLLYAQTCTRPDISFVMGMLGIYQSNPGMDHWKTAKKVLRYLQGT 331
             +  +  PY SIVG LL+     RPDIS+ + +L  +   P   H ++A++VLRYL  T
Sbjct: 181 --HLKDITPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTT 240

Query: 332 KDYMLTYKRSDHLEVIGYSDLDFTGCMDKRKSTFGYLFLLAEGAISWKSAK-QSIVAVST 391
           +   L Y+    L +  Y D       D   ST GY+ LLA   ++W S K + ++ V +
Sbjct: 241 RSMCLKYRSGSQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPS 300

Query: 392 MKAEFVACFEAIV 404
            +AE++   E ++
Sbjct: 301 TEAEYITASETVM 306

BLAST of Cmc06g0163371 vs. ExPASy TrEMBL
Match: A0A5A7UG95 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold43055G00040 PE=4 SV=1)

HSP 1 Score: 753.1 bits (1943), Expect = 6.6e-214
Identity = 378/428 (88.32%), Postives = 395/428 (92.29%), Query Frame = 0

Query: 1   MKEELKSMNDNKVWDLVELPKESKRVECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDGV 60
           MKEELKSMNDN+VWDLVELPK+SKRV CKWVFKTK DSNGNIE+ KARLVAKGYTQKDG+
Sbjct: 541 MKEELKSMNDNEVWDLVELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 600

Query: 61  DYKETFSPISKKDSLRIIMALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEGK 120
           DYKETFSP+SKKDSLRIIMALVAHYDLEL+Q+DVKTAFL  NLDE+VFMDQ EGFMVEGK
Sbjct: 601 DYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 660

Query: 121 EYMVCKLKRSIYELKQASRQCYLKFNDIITSFGFKENIIDQCICLKISGSKFIILVLYVD 180
           E+MVCKLKRSIY LKQASRQ YLKFND ITSFGFKENI+D+CI LKISGSKFIILVLYVD
Sbjct: 661 EHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD 720

Query: 181 DILFATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYINKVL 240
           DIL ATNDFGLLCQTKEFLSKNFEMKDM EASYVIGIEIF D+THGLLGLSQ AYINKVL
Sbjct: 721 DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVL 780

Query: 241 EKFKMDKCSSSVVPIQKGDKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTRPDIS 300
           EKFKM+KCSSSVVPIQKGDKFSLMQ PKNELERNQMETI YASIVGSLLYAQTCTR DIS
Sbjct: 781 EKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS 840

Query: 301 FVMGMLGIYQSNPGMDHWKTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDLDFTGCMDK 360
           F +GMLG YQSNPGMDHWK AKKVLRYLQG KDYMLTYKRSDHLEVI YSD DF GC+D 
Sbjct: 841 FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDT 900

Query: 361 RKSTFGYLFLLAEGAISWKSAKQSIVAVSTMKAEFVACFEAIVHSFMAAELYLRTWNCRQ 420
           RKSTFGYLFLLAEGAISWKSAKQSI+A STM+AEFV CFEA    FMA ELYLRTWNCRQ
Sbjct: 901 RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNCRQ 960

Query: 421 YCQAAENL 429
           YCQAA NL
Sbjct: 961 YCQAAGNL 968

BLAST of Cmc06g0163371 vs. ExPASy TrEMBL
Match: A0A5D3BWW5 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1856G00300 PE=4 SV=1)

HSP 1 Score: 753.1 bits (1943), Expect = 6.6e-214
Identity = 378/428 (88.32%), Postives = 395/428 (92.29%), Query Frame = 0

Query: 1   MKEELKSMNDNKVWDLVELPKESKRVECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDGV 60
           MKEELKSMNDN+VWDLVELPK+SKRV CKWVFKTK DSNGNIE+ KARLVAKGYTQKDG+
Sbjct: 541 MKEELKSMNDNEVWDLVELPKKSKRVGCKWVFKTKRDSNGNIERCKARLVAKGYTQKDGI 600

Query: 61  DYKETFSPISKKDSLRIIMALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEGK 120
           DYKETFSP+SKKDSLRIIMALVAHYDLEL+Q+DVKTAFL  NLDE+VFMDQ EGFMVEGK
Sbjct: 601 DYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 660

Query: 121 EYMVCKLKRSIYELKQASRQCYLKFNDIITSFGFKENIIDQCICLKISGSKFIILVLYVD 180
           E+MVCKLKRSIY LKQASRQ YLKFND ITSFGFKENI+D+CI LKISGSKFIILVLYVD
Sbjct: 661 EHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD 720

Query: 181 DILFATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYINKVL 240
           DIL ATNDFGLLCQTKEFLSKNFEMKDM EASYVIGIEIF D+THGLLGLSQ AYINKVL
Sbjct: 721 DILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIGIEIFRDRTHGLLGLSQNAYINKVL 780

Query: 241 EKFKMDKCSSSVVPIQKGDKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTRPDIS 300
           EKFKM+KCSSSVVPIQKGDKFSLMQ PKNELERNQMETI YASIVGSLLYAQTCTR DIS
Sbjct: 781 EKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQMETISYASIVGSLLYAQTCTRLDIS 840

Query: 301 FVMGMLGIYQSNPGMDHWKTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDLDFTGCMDK 360
           F +GMLG YQSNPGMDHWK AKKVLRYLQG KDYMLTYKRSDHLEVI YSD DF GC+D 
Sbjct: 841 FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGAKDYMLTYKRSDHLEVIEYSDSDFAGCVDT 900

Query: 361 RKSTFGYLFLLAEGAISWKSAKQSIVAVSTMKAEFVACFEAIVHSFMAAELYLRTWNCRQ 420
           RKSTFGYLFLLAEGAISWKSAKQSI+A STM+AEFV CFEA    FMA ELYLRTWNCRQ
Sbjct: 901 RKSTFGYLFLLAEGAISWKSAKQSIIAASTMEAEFVTCFEATSSWFMAVELYLRTWNCRQ 960

Query: 421 YCQAAENL 429
           YCQAA NL
Sbjct: 961 YCQAAGNL 968

BLAST of Cmc06g0163371 vs. ExPASy TrEMBL
Match: A0A5D3BLU0 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold596G00400 PE=4 SV=1)

HSP 1 Score: 654.8 bits (1688), Expect = 2.4e-184
Identity = 327/370 (88.38%), Postives = 346/370 (93.51%), Query Frame = 0

Query: 8   MNDNKVWDLVELPKESKRVECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDGVDYKETFS 67
           MND++VWDLVEL KESKRV CKWVFKTK DSNGNIE+YKARLVAKGYTQKDG+DYKETFS
Sbjct: 1   MNDSEVWDLVELLKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGYTQKDGIDYKETFS 60

Query: 68  PISKKDSLRIIMALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEGKEYMVCKL 127
           P+SKKDSLRIIMALVAHYDLEL+Q+DVKTAF+  NLDEK+FMDQ EGFMVEGKE+MVCKL
Sbjct: 61  PVSKKDSLRIIMALVAHYDLELHQMDVKTAFINGNLDEKLFMDQPEGFMVEGKEHMVCKL 120

Query: 128 KRSIYELKQASRQCYLKFNDIITSFGFKENIIDQCICLKISGSKFIILVLYVDDILFATN 187
           KRSIY LKQASRQ YLKFND ITSFGFKENI+D+CI LKISGSKFIILVLYVDDIL ATN
Sbjct: 121 KRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATN 180

Query: 188 DFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYINKVLEKFKMDK 247
           DFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIF D+THGLL LSQKAYINKVL+KFKMDK
Sbjct: 181 DFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLRLSQKAYINKVLDKFKMDK 240

Query: 248 CSSSVVPIQKGDKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFVMGMLG 307
           CSSSVVPI KGDKFSLMQ PKNELERNQMETIPYASIVGSLLYAQT TRPDISF +GML 
Sbjct: 241 CSSSVVPIHKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTYTRPDISFAVGMLD 300

Query: 308 IYQSNPGMDHWKTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDLDFTGCMDKRKSTFGY 367
            YQSNPGM+HWK A KVLRYLQGTKDYMLTYKRSDHL+VIGYSD DFTGC+D RKST GY
Sbjct: 301 RYQSNPGMNHWKAAMKVLRYLQGTKDYMLTYKRSDHLKVIGYSDSDFTGCVDTRKSTVGY 360

Query: 368 LFLLAEGAIS 378
           LFLLA+GAIS
Sbjct: 361 LFLLAKGAIS 370

BLAST of Cmc06g0163371 vs. ExPASy TrEMBL
Match: A0A4V1T029 (Reverse transcriptase Ty1/copia-type domain-containing protein (Fragment) OS=Sphingobacteriaceae bacterium OX=2021370 GN=EOP45_11235 PE=4 SV=1)

HSP 1 Score: 651.0 bits (1678), Expect = 3.5e-183
Identity = 320/386 (82.90%), Postives = 353/386 (91.45%), Query Frame = 0

Query: 19  LPKESKRVECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDGVDYKETFSPISKKDSLRII 78
           +P+  KRV CKWVFKTK DSNGNIE++KARLVAKG+TQKDGVDYKETFSP+SKKDSLRI+
Sbjct: 1   MPEGCKRVGCKWVFKTKLDSNGNIERHKARLVAKGFTQKDGVDYKETFSPVSKKDSLRIV 60

Query: 79  MALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEGKEYMVCKLKRSIYELKQAS 138
           MA+VAHYDLEL+Q+DVKTAFL  NLDE+VFMDQ EGF+V+GKE+MVCKLK+SIY LKQAS
Sbjct: 61  MAMVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFVVDGKEHMVCKLKKSIYGLKQAS 120

Query: 139 RQCYLKFNDIITSFGFKENIIDQCICLKISGSKFIILVLYVDDILFATNDFGLLCQTKEF 198
           RQ YLKFND +TSFGFKENI+D+CI LKISGSKFIILVLYVDDIL ATND GLL QTKEF
Sbjct: 121 RQWYLKFNDTVTSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDLGLLRQTKEF 180

Query: 199 LSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKG 258
           LSKNFEMKDMGEASYVIGIEI  D++ G LGLSQKAYINKVLE+FKMDKCS+ +VPIQKG
Sbjct: 181 LSKNFEMKDMGEASYVIGIEISRDRSQGWLGLSQKAYINKVLERFKMDKCSAGIVPIQKG 240

Query: 259 DKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFVMGMLGIYQSNPGMDHW 318
           DKFSL Q PKNELER QME IPYASIVGSL+YAQTCTRP ISF +GMLG YQSNPG+DHW
Sbjct: 241 DKFSLNQCPKNELERKQMEQIPYASIVGSLMYAQTCTRPGISFAVGMLGRYQSNPGIDHW 300

Query: 319 KTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDLDFTGCMDKRKSTFGYLFLLAEGAISW 378
           K AKKVLRYLQGTKD MLTYKRSDHLEVIGYSD D+ GC+D RKSTFGY+FLLAEGAISW
Sbjct: 301 KAAKKVLRYLQGTKDRMLTYKRSDHLEVIGYSDSDYAGCVDSRKSTFGYMFLLAEGAISW 360

Query: 379 KSAKQSIVAVSTMKAEFVACFEAIVH 405
           KSAKQ+++A STM+AEFVACFEA VH
Sbjct: 361 KSAKQTVIAASTMEAEFVACFEATVH 386

BLAST of Cmc06g0163371 vs. ExPASy TrEMBL
Match: A0A445LQ30 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3848 GN=D0Y65_004205 PE=4 SV=1)

HSP 1 Score: 637.9 bits (1644), Expect = 3.1e-179
Identity = 313/405 (77.28%), Postives = 356/405 (87.90%), Query Frame = 0

Query: 1    MKEELKSMNDNKVWDLVELPKESKRVECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDGV 60
            MKEE+ SM  N VWDLVELPK  KRV  KWVFKTK DS+GN+E+YKARLVAKG+TQKDG+
Sbjct: 913  MKEEIDSMEHNDVWDLVELPKGCKRVGYKWVFKTKRDSHGNLERYKARLVAKGFTQKDGI 972

Query: 61   DYKETFSPISKKDSLRIIMALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEGK 120
            DYKETFSP+S+KDS RIIMALVAHYDLEL+Q+DVKTAFL  +L+E V+MDQ  GF VEGK
Sbjct: 973  DYKETFSPVSRKDSFRIIMALVAHYDLELHQMDVKTAFLNGDLEEDVYMDQPMGFSVEGK 1032

Query: 121  EYMVCKLKRSIYELKQASRQCYLKFNDIITSFGFKENIIDQCICLKISGSKFIILVLYVD 180
            E+MVCKLK+SIY LKQASRQ YLKFND I SFGFKEN +D+C+ LK+SGSK + LVLYVD
Sbjct: 1033 EHMVCKLKKSIYGLKQASRQWYLKFNDTIVSFGFKENTVDRCVYLKVSGSKVMFLVLYVD 1092

Query: 181  DILFATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYINKVL 240
            DIL ATND GL  +TK+FLS NFEMKDMGEASYVIGIEIF +++ GLLGLSQKAYINKVL
Sbjct: 1093 DILLATNDLGLFHETKKFLSSNFEMKDMGEASYVIGIEIFRNRSQGLLGLSQKAYINKVL 1152

Query: 241  EKFKMDKCSSSVVPIQKGDKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTRPDIS 300
            E+F+M+KCS+S VPIQKGDKFSL Q PKN+LER QME IPYAS+VGS++YAQTCTRPDIS
Sbjct: 1153 ERFRMEKCSASPVPIQKGDKFSLAQCPKNDLERKQMEAIPYASVVGSIMYAQTCTRPDIS 1212

Query: 301  FVMGMLGIYQSNPGMDHWKTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDLDFTGCMDK 360
            F  GMLG YQSNPGM+HWK AKKVLRYLQGTKD++LTYKRSDHLEVIGYSD DF GC+D 
Sbjct: 1213 FATGMLGRYQSNPGMEHWKAAKKVLRYLQGTKDHILTYKRSDHLEVIGYSDSDFAGCVDT 1272

Query: 361  RKSTFGYLFLLAEGAISWKSAKQSIVAVSTMKAEFVACFEAIVHS 406
            RKST G++FLLA GAISWKSAKQS+VA STM+A FVACFEA + +
Sbjct: 1273 RKSTLGFVFLLAGGAISWKSAKQSVVAASTMEAAFVACFEATIQA 1317

BLAST of Cmc06g0163371 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 267.7 bits (683), Expect = 1.6e-71
Identity = 146/401 (36.41%), Postives = 233/401 (58.10%), Query Frame = 0

Query: 1   MKEELKSMNDNKVWDLVELPKESKRVECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDGV 60
           M +E+ +M     W++  LP   K + CKWV+K K +S+G IE+YKARLVAKGYTQ++G+
Sbjct: 102 MDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGI 161

Query: 61  DYKETFSPISKKDSLRIIMALVAHYDLELYQIDVKTAFLYRNLDEKVFMDQLEGFMVEGK 120
           D+ ETFSP+ K  S+++I+A+ A Y+  L+Q+D+  AFL  +LDE+++M    G+     
Sbjct: 162 DFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQG 221

Query: 121 EYM----VCKLKRSIYELKQASRQCYLKFNDIITSFGFKENIIDQCICLKISGSKFIILV 180
           + +    VC LK+SIY LKQASRQ +LKF+  +  FGF ++  D    LKI+ + F+ ++
Sbjct: 222 DSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVL 281

Query: 181 LYVDDILFATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFCDQTHGLLGLSQKAYI 240
           +YVDDI+  +N+   + + K  L   F+++D+G   Y +G+EI   ++   + + Q+ Y 
Sbjct: 282 VYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLEI--ARSAAGINICQRKYA 341

Query: 241 NKVLEKFKMDKCSSSVVPIQKGDKFSLMQYPKNELERNQMETIPYASIVGSLLYAQTCTR 300
             +L++  +  C  S VP+     FS           + ++   Y  ++G L+Y Q  TR
Sbjct: 342 LDLLDETGLLGCKPSSVPMDPSVTFSA------HSGGDFVDAKAYRRLIGRLMYLQ-ITR 401

Query: 301 PDISFVMGMLGIYQSNPGMDHWKTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDLDFTG 360
            DISF +  L  +   P + H +   K+L Y++GT    L Y     +++  +SD  F  
Sbjct: 402 LDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQS 461

Query: 361 CMDKRKSTFGYLFLLAEGAISWKSAKQSIVAVSTMKAEFVA 398
           C D R+ST GY   L    ISWKS KQ +V+ S+ +AE+ A
Sbjct: 462 CKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRA 493

BLAST of Cmc06g0163371 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 111.7 bits (278), Expect = 1.5e-24
Identity = 83/246 (33.74%), Postives = 123/246 (50.00%), Query Frame = 0

Query: 175 LVLYVDDILFATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFCDQTH-GLLGLSQK 234
           L+LYVDDIL   +   LL      LS  F MKD+G   Y +GI+I   +TH   L LSQ 
Sbjct: 3   LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQI---KTHPSGLFLSQT 62

Query: 235 AYINKVLEKFKMDKCS--SSVVPIQKGDKFSLMQYPKNELERNQMETIPYASIVGSLLYA 294
            Y  ++L    M  C   S+ +P++     S  +YP         +   + SIVG+L Y 
Sbjct: 63  KYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP---------DPSDFRSIVGALQYL 122

Query: 295 QTCTRPDISFVMGMLGIYQSNPGMDHWKTAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSD 354
            T TRPDIS+ + ++      P +  +   K+VLRY++GT  + L   ++  L V  + D
Sbjct: 123 -TLTRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCD 182

Query: 355 LDFTGCMDKRKSTFGYLFLLAEGAISWKSAKQSIVAVSTMKAEFVACFEAIVHSFMAAEL 414
            D+ GC   R+ST G+   L    ISW + +Q  V+ S+ + E+ A       +  AAEL
Sbjct: 183 SDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRAL------ALTAAEL 226

Query: 415 YLRTWN 418
              TW+
Sbjct: 243 ---TWS 226

BLAST of Cmc06g0163371 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 79.7 bits (195), Expect = 6.2e-15
Identity = 36/79 (45.57%), Postives = 56/79 (70.89%), Query Frame = 0

Query: 1   MKEELKSMNDNKVWDLVELPKESKRVECKWVFKTKCDSNGNIEQYKARLVAKGYTQKDGV 60
           M+EEL +++ NK W LV  P     + CKWVFKTK  S+G +++ KARLVAKG+ Q++G+
Sbjct: 44  MQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGI 103

Query: 61  DYKETFSPISKKDSLRIIM 80
            + ET+SP+ +  ++R I+
Sbjct: 104 YFVETYSPVVRTATIRTIL 122

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0052755.11.4e-21388.32Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
TYK04201.11.4e-21388.32Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
TYK00088.15.0e-18488.38Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
RYE20331.17.3e-18382.90hypothetical protein EOP45_11235, partial [Sphingobacteriaceae bacterium][more]
RZC25410.16.4e-17977.28Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja][more]
Match NameE-valueIdentityDescription
P109782.6e-9846.13Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041466.0e-7137.13Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT946.4e-6537.37Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW27.1e-6436.62Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P256003.9e-3028.43Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
A0A5A7UG956.6e-21488.32Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3BWW56.6e-21488.32Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3BLU02.4e-18488.38Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A4V1T0293.5e-18382.90Reverse transcriptase Ty1/copia-type domain-containing protein (Fragment) OS=Sph... [more]
A0A445LQ303.1e-17977.28Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3... [more]
Match NameE-valueIdentityDescription
AT4G23160.11.6e-7136.41cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.11.5e-2433.74DNA/RNA polymerases superfamily protein [more]
ATMG00820.16.2e-1545.57Reverse transcriptase (RNA-dependent DNA polymerase) [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 11..255
e-value: 3.4E-65
score: 220.0
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 2..375
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 347..402
e-value: 8.41339E-22
score: 88.6793
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 11..396

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc06g0163371.1Cmc06g0163371.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006952 defense response
biological_process GO:0015074 DNA integration
biological_process GO:0006278 RNA-dependent DNA biosynthetic process
biological_process GO:0007165 signal transduction
molecular_function GO:0043531 ADP binding
molecular_function GO:0003887 DNA-directed DNA polymerase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003964 RNA-directed DNA polymerase activity
molecular_function GO:0008270 zinc ion binding