Cmc06g0169941 (gene) Melon (Charmono) v1.1

Overview
NameCmc06g0169941
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCMiso1.1chr06: 24714007 .. 24715728 (+)
RNA-Seq ExpressionCmc06g0169941
SyntenyCmc06g0169941
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAACGAGGATCAAACAAACAAAACAACCCACAGTCCAGAAGTACGGTACAAATTGTTTGTTATAATTGTAATAAGCTTGGTCATTTAGCTAGAAATTGTAGAAATAAGAGTCGTCCTGCTGCGCATGCGAACCTGATAAAAGATGAATTAGTAGCTATGATATCTAAAGTTAATGTGATTGGGGGGTCTGAAGGTTGGTGGCTAGACACTAGTGCATCCCGCCATGTCTGCTACGACCTTAGTCTTTTTAGAAAATATAATGAAGTTAAGGATAAAAATATCCTTCTAGGAGATCATCACACGACCAAGGTGGCCGGCATTGGAGAAGTAGAACTGAAATTCACATCCGGCAAGATGCTTGTGCTGAAGGAATTTCTGCATACTCCAGAAATTCAAAAGAATTTGGTCTCCAGATATCTCCTCAACAAGGCTGGATTCACACAAACCATAGGATCAGACTTGTTTACTTTAACTAAAAACAATGTGTTTGTGGAGAAGGGTTACGCTACTGATGGCATGTTCAAATTGAATCTGGAAATTAATAAGATTGCATCTTCTGCTTACATGTTGACTTCTTTTAATGTTTGGCATGTTAGACTTTGTCATGTTAATAAAAGATTAATTAGTAACATGAGTAGGTTAAATCTTATACCTAAGTTATCTCTGCATGATTTTGAGAAATGTGCATGTTGTAGTCAAGCTAAGATAACTAAAACCTCGCATAAGTATGTAACTAGAGTAACAGAGCCTTTAGAATTAATTCATTCAGACTTATGTGAATTTGATGGCGCTTTAACTAGAAACAGTAAAAGGTATGTAATTACCTTTATAGATGATTGTTCTGACTACACTTTTATTTATCTGCTTAAAAATAAAAGTGATGCATATGAAATGTTCAAAGTCTTTGTAACTGAAATAGAGAAACAGTTTAACAAAAGAATTAAGAGACTTCGTAGTGATAGAGGAATTGAATATGATTCAGTTTCTTTCAATGAGTTTTATAGCTCAAAAGGAATAATACATGAAACTATTGCGCCTTATTCTCCTGAAATGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAACTGAGTTAGTAATTGTTATCTTACTTGAGTCAGGAGCCGCACCATCTTGGTGGGGAGAAATATTTAAGACTATTAATTATGTTCTTAATAGGATTCCTAAATCAAACAGTAAAACTTCACCATACGAAGTCCTTAAACATAAAGCACCAAACCTATCTTATCTTCGAACTTGGGGTTGTCTAGCTTATGTTAGAATACCTGATCCAAAAAGAAGGAAATTAGCAAGTAGAGCCTATGAATGTGTCTTCATAGAATACGCTAAAAATAATAAAGCCTATAGATTCTATGACTTAGAAAACAAAGTAATTATAGAATGGAATGACGTAGATTTTTTCGAGGACAAATTTCCTTTTAAATCTAGAAATAGTGGGGACCTATATAGTCAAATTAGTGGGGGCTCAATTTCCAATAGTCTACCTTCAATTAGGATCCAAACCCAAGACAAGGAAGTAGATCCTGAACCTAGAAGAAGCAAGAGAGCTAGAACAGTAAAAGACTTTGGAGAAGACTTCGAAATGTACAACGTAGAAGATCCAAAAGATCTAACAGAAGCATTGTCATCAGTAGATGCTAATTTATGGCAAGAAGCTATCAATAATGAAATGGACTCTCTTGAACCCAATAGAACTTGA

mRNA sequence

ATGAAACGAGGATCAAACAAACAAAACAACCCACAGTCCAGAAGTACGGTACAAATTGTTTGTTATAATTGTAATAAGCTTGGTCATTTAGCTAGAAATTGTAGAAATAAGAGTCGTCCTGCTGCGCATGCGAACCTGATAAAAGATGAATTAGTAGCTATGATATCTAAAGTTAATGTGATTGGGGGGTCTGAAGGTTGGTGGCTAGACACTAGTGCATCCCGCCATGTCTGCTACGACCTTAGTCTTTTTAGAAAATATAATGAAGTTAAGGATAAAAATATCCTTCTAGGAGATCATCACACGACCAAGGTGGCCGGCATTGGAGAAGTAGAACTGAAATTCACATCCGGCAAGATGCTTGTGCTGAAGGAATTTCTGCATACTCCAGAAATTCAAAAGAATTTGGTCTCCAGATATCTCCTCAACAAGGCTGGATTCACACAAACCATAGGATCAGACTTGTTTACTTTAACTAAAAACAATGTGTTTGTGGAGAAGGGTTACGCTACTGATGGCATGTTCAAATTGAATCTGGAAATTAATAAGATTGCATCTTCTGCTTACATGTTGACTTCTTTTAATGTTTGGCATGTTAGACTTTGTCATGTTAATAAAAGATTAATTAGTAACATGAGTAGGTTAAATCTTATACCTAAGTTATCTCTGCATGATTTTGAGAAATGTGCATGTTGTAGTCAAGCTAAGATAACTAAAACCTCGCATAAGTATGTAACTAGAGTAACAGAGCCTTTAGAATTAATTCATTCAGACTTATGTGAATTTGATGGCGCTTTAACTAGAAACAGTAAAAGGTATGTAATTACCTTTATAGATGATTGTTCTGACTACACTTTTATTTATCTGCTTAAAAATAAAAGTGATGCATATGAAATGTTCAAAGTCTTTGTAACTGAAATAGAGAAACAGTTTAACAAAAGAATTAAGAGACTTCGTAGTGATAGAGGAATTGAATATGATTCAGTTTCTTTCAATGAGTTTTATAGCTCAAAAGGAATAATACATGAAACTATTGCGCCTTATTCTCCTGAAATGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAACTGAGTTAGTAATTGTTATCTTACTTGAGTCAGGAGCCGCACCATCTTGGTGGGGAGAAATATTTAAGACTATTAATTATGTTCTTAATAGGATTCCTAAATCAAACAGTAAAACTTCACCATACGAAGTCCTTAAACATAAAGCACCAAACCTATCTTATCTTCGAACTTGGGGTTGTCTAGCTTATGTTAGAATACCTGATCCAAAAAGAAGGAAATTAGCAAGTAGAGCCTATGAATGTGTCTTCATAGAATACGCTAAAAATAATAAAGCCTATAGATTCTATGACTTAGAAAACAAAGTAATTATAGAATGGAATGACGTAGATTTTTTCGAGGACAAATTTCCTTTTAAATCTAGAAATAGTGGGGACCTATATAGTCAAATTAGTGGGGGCTCAATTTCCAATAGTCTACCTTCAATTAGGATCCAAACCCAAGACAAGGAAGTAGATCCTGAACCTAGAAGAAGCAAGAGAGCTAGAACAGTAAAAGACTTTGGAGAAGACTTCGAAATGTACAACGTAGAAGATCCAAAAGATCTAACAGAAGCATTGTCATCAGTAGATGCTAATTTATGGCAAGAAGCTATCAATAATGAAATGGACTCTCTTGAACCCAATAGAACTTGA

Coding sequence (CDS)

ATGAAACGAGGATCAAACAAACAAAACAACCCACAGTCCAGAAGTACGGTACAAATTGTTTGTTATAATTGTAATAAGCTTGGTCATTTAGCTAGAAATTGTAGAAATAAGAGTCGTCCTGCTGCGCATGCGAACCTGATAAAAGATGAATTAGTAGCTATGATATCTAAAGTTAATGTGATTGGGGGGTCTGAAGGTTGGTGGCTAGACACTAGTGCATCCCGCCATGTCTGCTACGACCTTAGTCTTTTTAGAAAATATAATGAAGTTAAGGATAAAAATATCCTTCTAGGAGATCATCACACGACCAAGGTGGCCGGCATTGGAGAAGTAGAACTGAAATTCACATCCGGCAAGATGCTTGTGCTGAAGGAATTTCTGCATACTCCAGAAATTCAAAAGAATTTGGTCTCCAGATATCTCCTCAACAAGGCTGGATTCACACAAACCATAGGATCAGACTTGTTTACTTTAACTAAAAACAATGTGTTTGTGGAGAAGGGTTACGCTACTGATGGCATGTTCAAATTGAATCTGGAAATTAATAAGATTGCATCTTCTGCTTACATGTTGACTTCTTTTAATGTTTGGCATGTTAGACTTTGTCATGTTAATAAAAGATTAATTAGTAACATGAGTAGGTTAAATCTTATACCTAAGTTATCTCTGCATGATTTTGAGAAATGTGCATGTTGTAGTCAAGCTAAGATAACTAAAACCTCGCATAAGTATGTAACTAGAGTAACAGAGCCTTTAGAATTAATTCATTCAGACTTATGTGAATTTGATGGCGCTTTAACTAGAAACAGTAAAAGGTATGTAATTACCTTTATAGATGATTGTTCTGACTACACTTTTATTTATCTGCTTAAAAATAAAAGTGATGCATATGAAATGTTCAAAGTCTTTGTAACTGAAATAGAGAAACAGTTTAACAAAAGAATTAAGAGACTTCGTAGTGATAGAGGAATTGAATATGATTCAGTTTCTTTCAATGAGTTTTATAGCTCAAAAGGAATAATACATGAAACTATTGCGCCTTATTCTCCTGAAATGAATGGAAAAGCAGAAAGAAAGAATAGAACTCTAACTGAGTTAGTAATTGTTATCTTACTTGAGTCAGGAGCCGCACCATCTTGGTGGGGAGAAATATTTAAGACTATTAATTATGTTCTTAATAGGATTCCTAAATCAAACAGTAAAACTTCACCATACGAAGTCCTTAAACATAAAGCACCAAACCTATCTTATCTTCGAACTTGGGGTTGTCTAGCTTATGTTAGAATACCTGATCCAAAAAGAAGGAAATTAGCAAGTAGAGCCTATGAATGTGTCTTCATAGAATACGCTAAAAATAATAAAGCCTATAGATTCTATGACTTAGAAAACAAAGTAATTATAGAATGGAATGACGTAGATTTTTTCGAGGACAAATTTCCTTTTAAATCTAGAAATAGTGGGGACCTATATAGTCAAATTAGTGGGGGCTCAATTTCCAATAGTCTACCTTCAATTAGGATCCAAACCCAAGACAAGGAAGTAGATCCTGAACCTAGAAGAAGCAAGAGAGCTAGAACAGTAAAAGACTTTGGAGAAGACTTCGAAATGTACAACGTAGAAGATCCAAAAGATCTAACAGAAGCATTGTCATCAGTAGATGCTAATTTATGGCAAGAAGCTATCAATAATGAAATGGACTCTCTTGAACCCAATAGAACTTGA

Protein sequence

MKRGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISNSLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINNEMDSLEPNRT
Homology
BLAST of Cmc06g0169941 vs. NCBI nr
Match: KAA0034938.1 (putative Polyprotein [Cucumis melo var. makuwa] >TYK21293.1 putative Polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 1047.3 bits (2707), Expect = 4.7e-302
Identity = 515/573 (89.88%), Postives = 539/573 (94.07%), Query Frame = 0

Query: 1   MKRGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAAHANLIKDELVAMISKVNV 60
           MKRGSNKQNN QSRSTVQI CYNCNK GHLA+NCRN+SRPAA ANLI+DELVAMISKVNV
Sbjct: 105 MKRGSNKQNNLQSRSTVQIFCYNCNKPGHLAKNCRNRSRPAAQANLIEDELVAMISKVNV 164

Query: 61  IGGSEGWWLDTSASRHVCYDLSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKM 120
           IGGSEGWWLDT AS HVC++LSLFRKYNEVKDKNILLGDHHTTKV GIGEVELKFTS K 
Sbjct: 165 IGGSEGWWLDTGASHHVCHELSLFRKYNEVKDKNILLGDHHTTKVVGIGEVELKFTSDKT 224

Query: 121 LVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLE 180
           LV+KE LHTPEI+KNLV  YLLNKAGFTQTIGS+LFTLTKNNVFV KGYATDGMFKLNLE
Sbjct: 225 LVVKEGLHTPEIRKNLVFGYLLNKAGFTQTIGSNLFTLTKNNVFVGKGYATDGMFKLNLE 284

Query: 181 INKIASSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKT 240
           INKIASSAYMLTSFNVWH RLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKT
Sbjct: 285 INKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKT 344

Query: 241 SHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMF 300
           SHK+VTRVT+PLELIHSDLCEFDG LTRNSKRYV+TFIDDCSDYTFIYLLKNKSDAYEMF
Sbjct: 345 SHKFVTRVTKPLELIHSDLCEFDGTLTRNSKRYVVTFIDDCSDYTFIYLLKNKSDAYEMF 404

Query: 301 KVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKN 360
           KVFVTEIE QFNKRIKRLRSDRG EYDSV+FNEFY+SKGIIHET  PYSPEMNGK ERKN
Sbjct: 405 KVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYNSKGIIHETTTPYSPEMNGKEERKN 464

Query: 361 RTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYLRT 420
           RTLTEL + ILLES AAPSWWGEI KT+NYVLNRIPKSNSKTSPYEVLKHK PNLSYLRT
Sbjct: 465 RTLTELAVAILLESEAAPSWWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKIPNLSYLRT 524

Query: 421 WGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFP 480
           WGCLAYVRIP+P+RRKLAS+AYECVFI YA+N+KAYRFYDLENKVIIE NDVDFFEDKFP
Sbjct: 525 WGCLAYVRIPNPERRKLASKAYECVFIGYAENSKAYRFYDLENKVIIESNDVDFFEDKFP 584

Query: 481 FKSRNSGDLYSQISGGSISNSLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVE 540
           FKSRNSG LYSQ SGGS  +SLPSIRIQTQDKEVDPEPRRSKRARTVKDF EDFEMYNVE
Sbjct: 585 FKSRNSGGLYSQTSGGSSFSSLPSIRIQTQDKEVDPEPRRSKRARTVKDFREDFEMYNVE 644

Query: 541 DPKDLTEALSSVDANLWQEAINNEMDSLEPNRT 574
           DPKDLT+ALSSVDANLWQEAIN+ +DSLE NRT
Sbjct: 645 DPKDLTKALSSVDANLWQEAINDGIDSLESNRT 677

BLAST of Cmc06g0169941 vs. NCBI nr
Match: ABI34306.1 (Polyprotein, putative [Solanum demissum])

HSP 1 Score: 582.0 bits (1499), Expect = 5.5e-162
Identity = 296/558 (53.05%), Postives = 389/558 (69.71%), Query Frame = 0

Query: 21  CYNCNKLGHLARNCRNKSR-PAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCY 80
           C+ C K GH+AR CR + R P   AN+ ++  VA+I+ +N++   +GWW D+ A+RHVCY
Sbjct: 263 CFVCGKSGHIARFCRFRKRGPNPQANVTEEPFVAVITDINMVENVDGWWADSGANRHVCY 322

Query: 81  DLSLFRKYNEVKD-KNILLGDHHTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVS 140
           D   F+KY   ++ K I+LGD HTT+V G G+VEL FTSG++L LK+ L+TP ++K L+S
Sbjct: 323 DKDWFKKYTHFEEPKTIMLGDSHTTQVLGTGDVELCFTSGRVLTLKDVLYTPSMRKKLMS 382

Query: 141 RYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWH 200
            +LLNKAGF Q I S+ + + K  +FV KGYA DGMFKLN+E+NK ++S YML+S N WH
Sbjct: 383 SFLLNKAGFKQIIESNQYVIVKKGIFVGKGYACDGMFKLNVEMNKTSTSVYMLSSTNFWH 442

Query: 201 VRLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSD 260
            RLCH+N R +  MS L LIP +   +FEKC  CS+AKITK  H  V R T+ LEL+H+D
Sbjct: 443 ARLCHINDRYVGIMSSLGLIPMIK-KNFEKCEACSKAKITKRPHFQVERKTDLLELVHTD 502

Query: 261 LCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRL 320
           +CE  G LTR   RY ITFIDD S +T++YL+KNKSDA+E FK ++ E+E QF ++IKR+
Sbjct: 503 ICELGGILTRGGNRYFITFIDDFSKFTYVYLMKNKSDAFENFKTYLHEVENQFGRKIKRI 562

Query: 321 RSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAP 380
           RSDRG EY+S  FN F  S GIIHET  PYSP  NG AERKNRTL EL   +L+ES A  
Sbjct: 563 RSDRGREYESNEFNSFVRSLGIIHETTPPYSPSSNGVAERKNRTLVELTNAMLIESHAPL 622

Query: 381 SWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLA 440
           ++WGE   T  YVLNR+P   SK +P+E+ K   P+L YLR WGCLA+VR+ DPK  KL 
Sbjct: 623 NFWGEAILTACYVLNRVPHKKSKLTPFELWKGYKPSLGYLRVWGCLAFVRLMDPKITKLG 682

Query: 441 SRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSI 500
            +   C F+ YA N+ AYRF++LE+ ++IE  D  F E+KFPF S+NSG    +I    +
Sbjct: 683 KKVTTCAFLGYASNSTAYRFFNLEDNIVIESGDAIFHENKFPFDSKNSGG--QRIEQNIL 742

Query: 501 SNSLPSIRIQT-QDKEV-DPEPRRSKRARTVKDFGEDFEMYNVEDPK-DLTEALSSVDAN 560
             +LPS    T ++KEV D E RRSKRAR  KDFG DF ++NV D +  L EALSS D+ 
Sbjct: 743 --TLPSSSTSTLKNKEVNDFELRRSKRARVEKDFGPDFYVFNVGDDRLTLKEALSSHDSI 802

Query: 561 LWQEAINNEMDSLEPNRT 574
            W+EA+N+EM+SL  N+T
Sbjct: 803 FWKEAVNDEMESLISNKT 815

BLAST of Cmc06g0169941 vs. NCBI nr
Match: KAG5527251.1 (hypothetical protein RHGRI_028223 [Rhododendron griersonianum])

HSP 1 Score: 496.5 bits (1277), Expect = 3.1e-136
Identity = 270/597 (45.23%), Postives = 370/597 (61.98%), Query Frame = 0

Query: 4   GSNKQNNPQSRSTVQ--IVCYNCNKLGHLARNCRNKSRPAAHANLIKDELVAMISKVNVI 63
           GSN+ NN + R+  +    CYNC K GH A++CR+K +  + AN+++++LVAM++++N+ 
Sbjct: 151 GSNRPNNKRKRNNKKKNDKCYNCGKKGHYAKDCRSKKQKTS-ANMVEEQLVAMVTEINMA 210

Query: 64  GGSEGWWLDTSASRHVCYDLSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKML 123
             S GWW D+ A+ HVC D SLF+ Y ++  +   +G+    KVAG G  EL FTSGK L
Sbjct: 211 DTSSGWWFDSGATVHVCKDRSLFKTYEKLDGQEAQMGNQDCAKVAGKGSAELNFTSGKTL 270

Query: 124 VLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEI 183
            L   LH P+++KNLVS  L+ K GF     SD   LTKN +FV KGY  +GMFKL++  
Sbjct: 271 TLLNVLHVPDMRKNLVSVDLVCKKGFRVVFESDKLILTKNGMFVGKGYGCNGMFKLSINE 330

Query: 184 NKIASSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLIPKLSLHD--FEKCACCSQAKITK 243
           N  ASS Y++ SF++WH RL H+N R I NMSR  LI   + HD   +KC  C++AK+ K
Sbjct: 331 NNTASS-YIVDSFSLWHSRLAHLNFRSIKNMSRFGLI---NYHDNVCDKCEICAKAKMAK 390

Query: 244 TSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEM 303
            S   V R +E L+LIHSD+CE +G LTR  KRY  TF+DD S YTF+YLL+ K + +  
Sbjct: 391 KSFPSVQRNSEILDLIHSDICELNGVLTRGGKRYFATFVDDFSKYTFVYLLRTKDEVFNK 450

Query: 304 FKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERK 363
           F+ +  E+E Q NK+IK LRSDRG+EY    F++F    GIIH+  APYSP+ NG AERK
Sbjct: 451 FQAYKNEVENQLNKKIKVLRSDRGMEYFPHEFDDFCEMHGIIHQKTAPYSPQQNGLAERK 510

Query: 364 NRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYLR 423
           NRTLTE+   +++ + A    WGE   T  Y+ NRI    +   PYE+ K + PNLSYL+
Sbjct: 511 NRTLTEMANCMIVHANAPLYLWGEALYTACYLHNRIISRKTNLCPYELWKGRKPNLSYLK 570

Query: 424 TWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKF 483
            WGCLAY R+PDPKR KL  RA + +F+ YA+++KAYR  DLE+  I+E  +V FFE KF
Sbjct: 571 VWGCLAYYRVPDPKRTKLGPRAMKSIFVGYAEHSKAYRLLDLESNTIVESRNVKFFEHKF 630

Query: 484 PFKSRNSGDLYSQISGGSISNSLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDF----- 543
              S++   + ++ SG S++  +   +   +   ++ EPRRS R R  K    D      
Sbjct: 631 RADSKS---VVTENSGSSLNQEVEP-QNGVKRHVLNEEPRRSTRQRKEKGLDPDHISSQS 690

Query: 544 EMYNVE------------------DPKDLTEALSSVDANLWQEAINNEMDSLEPNRT 574
            ++ VE                  DPK  TEA+SS DA  W+EAIN+EMDSL  N T
Sbjct: 691 IIFLVEGDREGVTNKIPMVLQIEADPKTFTEAISSRDAAFWKEAINDEMDSLLSNGT 738

BLAST of Cmc06g0169941 vs. NCBI nr
Match: AAU90333.1 (Putative gag and pol polyprotein, identical [Solanum demissum])

HSP 1 Score: 493.4 bits (1269), Expect = 2.6e-135
Identity = 271/576 (47.05%), Postives = 358/576 (62.15%), Query Frame = 0

Query: 5   SNKQNNPQSRSTVQI--VCYNCNKLGHLARNCRNKSR-PAAHANLIKDELVAMISKVNVI 64
           +N +NN      VQ    C+ C K GH+AR CR + R P   AN+ ++  VA+I+ +N++
Sbjct: 92  NNGENNQAQNQQVQDKGPCFVCGKSGHIARFCRFRKRGPNPQANVTEEPFVAVITDINMV 151

Query: 65  GGSEGWWLDTSASRHVCYDLSLFRKYNEVKD-KNILLGDHHTTKVAGIGEVELKFTSGKM 124
              +GWW+D+ A+RHVCYD   F+KY   ++ K I+LGD HTT+V G G+VEL F+SG+ 
Sbjct: 152 ENVDGWWVDSGANRHVCYDKDWFKKYTHFEEPKTIMLGDAHTTQVLGKGDVELCFSSGRE 211

Query: 125 LVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLE 184
           L LK+ L+TP ++KNL+S +L NK GF Q I SD + + K  +FV KG            
Sbjct: 212 LTLKDVLYTPSMRKNLMSSFLFNKVGFKQIIESDQYVIVKKGIFVGKG------------ 271

Query: 185 INKIASSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKT 244
                                             L LIP +   +FEKC  CS+AKITK 
Sbjct: 272 ----------------------------------LRLIPMIK-KNFEKCEACSKAKITKR 331

Query: 245 SHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMF 304
            H  V R T  LEL+H+D+CE  G LTR   R  ITFIDD S +T++YL+KNKSDA+E F
Sbjct: 332 PHFQVERKTNLLELVHTDICELGGILTRGGNRNFITFIDDFSKFTYVYLMKNKSDAFENF 391

Query: 305 KVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKN 364
           K ++ E+E QF ++IKR+RSDRG EY+S  FN F  S GIIHET  PYSP  NG AERKN
Sbjct: 392 KTYLHEVENQFGRKIKRIRSDRGREYESNEFNSFVRSLGIIHETTPPYSPSSNGAAERKN 451

Query: 365 RTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYLRT 424
           RTL EL   +L+ES A  ++WGE   T  YVLNR+P   SK + +E+ K   P+L YLR 
Sbjct: 452 RTLVELTNAMLIESHAPLNFWGETILTACYVLNRVPHKKSKLTHFELWKGYKPSLGYLRV 511

Query: 425 WGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFP 484
           WGCLA+VR+ DPK  KL  +   C F+ YA N+ AYRF++LE+ ++IE  D  F E+KFP
Sbjct: 512 WGCLAFVRLMDPKITKLGKKVTTCAFLGYASNSTAYRFFNLEDNIVIESGDAIFHENKFP 571

Query: 485 FKSRNSGDLYSQISGGSISNSLPSIRIQT-QDKEV-DPEPRRSKRARTVKDFGEDFEMYN 544
           F S+NSG    +I    +  SLPS    T ++KEV D E RRSKRAR  KDFG +F ++N
Sbjct: 572 FDSKNSGG--QRIEQNIL--SLPSSSTSTLKNKEVNDFELRRSKRARIEKDFGPNFYVFN 616

Query: 545 V-EDPKDLTEALSSVDANLWQEAINNEMDSLEPNRT 574
           V +DP  L EALSS D+  W+EA+N+EM+SL  N+T
Sbjct: 632 VGDDPLTLKEALSSHDSIFWKEAVNDEMESLISNKT 616

BLAST of Cmc06g0169941 vs. NCBI nr
Match: XP_021732277.1 (uncharacterized protein LOC110699091 [Chenopodium quinoa])

HSP 1 Score: 485.7 bits (1249), Expect = 5.4e-133
Identity = 273/576 (47.40%), Postives = 328/576 (56.94%), Query Frame = 0

Query: 2   KRGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPA-AHANLIKDELVAMISKVNV 61
           K    +Q+ P    T Q +CY C K GH+AR CRN   P  A A++I++  VAMI+++N+
Sbjct: 434 KNNGRQQSQPPKNGTGQFLCYRCGKPGHMARKCRNMPNPVPAQASMIEEPFVAMITEINL 493

Query: 62  IGGSEGWWLDTSASRHVCYDLSLFRKYNE-VKDKNILLGDHHTTKVAGIGEVELKFTSGK 121
            GGS+GWW+DT A+RHVCYD  +F+ Y E   DK +LLGD H+T +AG+G VELKFTSG+
Sbjct: 494 TGGSDGWWIDTGATRHVCYDRRMFKTYTEKTDDKKVLLGDSHSTNIAGVGNVELKFTSGR 553

Query: 122 MLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNL 181
            L+LK+ LHTPE++KNLVS +LLNKAGF QTIGSDLFTLTKN +FV KGYATDGMFKLN+
Sbjct: 554 TLILKDVLHTPEMRKNLVSGFLLNKAGFIQTIGSDLFTLTKNGIFVGKGYATDGMFKLNV 613

Query: 182 EINKIASSAYMLTS-FNVWHVRLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKIT 241
           E+NKI++SAYML S  NVWH RLCHVNKRLI NMS L LIP +SL+DF+KC  CSQAKIT
Sbjct: 614 EVNKISNSAYMLCSTINVWHTRLCHVNKRLIKNMSNLGLIPNISLNDFDKCDSCSQAKIT 673

Query: 242 KTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYE 301
           KT HK V R +EPL+LIHSD+CE +G LTRN +RY ITFIDDCSDYT IYL+KNKSDA+E
Sbjct: 674 KTPHKSVIRNSEPLDLIHSDICELNGTLTRNGQRYFITFIDDCSDYTCIYLMKNKSDAFE 733

Query: 302 MFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAER 361
           M                                                        AER
Sbjct: 734 M--------------------------------------------------------AER 793

Query: 362 KNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYL 421
           KNRT TELV+ I L SGAA  W                                      
Sbjct: 794 KNRTFTELVVAISLYSGAADHW-------------------------------------- 847

Query: 422 RTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDK 481
                                                                       
Sbjct: 854 ------------------------------------------------------------ 847

Query: 482 FPFKSRNSGDLYSQISGGSISNSLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYN 541
           FPFKSRN        SGG+ S+ +P  R  +QD + +PE R+SKRAR  KDFG DF + N
Sbjct: 914 FPFKSRN--------SGGTSSSIIPKDRSNSQDLDAEPELRKSKRARVAKDFGPDFVVLN 847

Query: 542 V-EDPKDLTEALSSVDANLWQEAINNEMDSLEPNRT 574
           V EDP  L EAL+SVDA+LWQEA+N+EMDSLE NRT
Sbjct: 974 VEEDPSTLQEALTSVDADLWQEAVNDEMDSLESNRT 847

BLAST of Cmc06g0169941 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 263.8 bits (673), Expect = 4.4e-69
Identity = 183/634 (28.86%), Postives = 303/634 (47.79%), Query Frame = 0

Query: 2   KRGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRN--KSRPAAHANLIKDELVAMISK-- 61
           + G+  ++  +S+S V+  CYNCN+ GH  R+C N  K +         D   AM+    
Sbjct: 214 RSGARGKSKNRSKSRVR-NCYNCNQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNND 273

Query: 62  ------------VNVIGGSEGWWLDTSASRHVCYDLSLFRKYNEVKDKNILLGDHHTTKV 121
                       +++ G    W +DT+AS H      LF +Y       + +G+   +K+
Sbjct: 274 NVVLFINEEEECMHLSGPESEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKI 333

Query: 122 AGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFV 181
           AGIG++ +K   G  LVLK+  H P+++ NL+S   L++ G+     +  + LTK ++ +
Sbjct: 334 AGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVI 393

Query: 182 EKGYATDGMFKLNLEI-NKIASSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLIPKLSLH 241
            KG A   +++ N EI     ++A    S ++WH R+ H++++ +  +++ +LI      
Sbjct: 394 AKGVARGTLYRTNAEICQGELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGT 453

Query: 242 DFEKCACCSQAKITKTSHKYVT-RVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSD 301
             + C  C   K  + S +  + R    L+L++SD+C      +    +Y +TFIDD S 
Sbjct: 454 TVKPCDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASR 513

Query: 302 YTFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHE 361
             ++Y+LK K   +++F+ F   +E++  +++KRLRSD G EY S  F E+ SS GI HE
Sbjct: 514 KLWVYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHE 573

Query: 362 TIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPK-SNSKT 421
              P +P+ NG AER NRT+ E V  +L  +    S+WGE  +T  Y++NR P    +  
Sbjct: 574 KTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFE 633

Query: 422 SPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLE 481
            P  V  +K  + S+L+ +GC A+  +P  +R KL  ++  C+FI Y      YR +D  
Sbjct: 634 IPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPV 693

Query: 482 NKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISN--SLPS---------------- 541
            K +I   DV F E +     R + D+  ++  G I N  ++PS                
Sbjct: 694 KKKVIRSRDVVFRESEV----RTAADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVS 753

Query: 542 ----------------------IRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVED- 574
                                 +   TQ +E     RRS+R R         E   + D 
Sbjct: 754 EQGEQPGEVIEQGEQLDEGVEEVEHPTQGEEQHQPLRRSERPRVESRRYPSTEYVLISDD 813

BLAST of Cmc06g0169941 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 193.4 bits (490), Expect = 7.3e-48
Identity = 159/590 (26.95%), Postives = 268/590 (45.42%), Query Frame = 0

Query: 12  QSRSTVQIVCYNCNKLGHLARNC---------RNKSRPAAHANLIKDELVAMISKVN--V 71
           +  S  ++ C++C + GH+ ++C         +NK             +  M+ +VN   
Sbjct: 223 KGNSKYKVKCHHCGREGHIKKDCFHYKRILNNKNKENEKQVQTATSHGIAFMVKEVNNTS 282

Query: 72  IGGSEGWWLDTSASRHVCYDLSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKM 131
           +  + G+ LD+ AS H+  D SL+    EV     +        +       ++  +   
Sbjct: 283 VMDNCGFVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRNDHE 342

Query: 132 LVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLE 191
           + L++ L   E   NL+S   L +AG +        T++KN + V K     GM      
Sbjct: 343 ITLEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSGVTISKNGLMVVK---NSGMLNNVPV 402

Query: 192 INKIASS--AYMLTSFNVWHVRLCHVN---------KRLISNMSRLNLIPKLSLHDFEKC 251
           IN  A S  A    +F +WH R  H++         K + S+ S LN + +LS    E C
Sbjct: 403 INFQAYSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNL-ELSCEICEPC 462

Query: 252 ACCSQAKITKTSHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYL 311
               QA++     K  T +  PL ++HSD+C     +T + K Y + F+D  + Y   YL
Sbjct: 463 LNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYL 522

Query: 312 LKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYS 371
           +K KSD + MF+ FV + E  FN ++  L  D G EY S    +F   KGI +    P++
Sbjct: 523 IKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHT 582

Query: 372 PEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKS---NSKTSPYE 431
           P++NG +ER  RT+TE    ++  +    S+WGE   T  Y++NRIP     +S  +PYE
Sbjct: 583 PQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYE 642

Query: 432 VLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVI 491
           +  +K P L +LR +G   YV I + K+ K   ++++ +F+ Y  N   ++ +D  N+  
Sbjct: 643 MWHNKKPYLKHLRVFGATVYVHIKN-KQGKFDDKSFKSIFVGYEPN--GFKLWDAVNEKF 702

Query: 492 IEWNDVDFFE-DKFPFKSRNSGDLYSQISGGSISNSLPSIRIQTQDKEVDPE-PRRSKRA 551
           I   DV   E +    ++     ++ + S  S + + P+       K +  E P  SK  
Sbjct: 703 IVARDVVVDETNMVNSRAVKFETVFLKDSKESENKNFPN----DSRKIIQTEFPNESKEC 762

Query: 552 RTVKDFGEDFEMYNVEDPKDLTEALSSVDANLWQEAINNEM--DSLEPNR 573
             ++   +  E  N   P D  + + +   N  +E  N +   DS E N+
Sbjct: 763 DNIQFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNK 801

BLAST of Cmc06g0169941 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 182.6 bits (462), Expect = 1.3e-44
Identity = 146/496 (29.44%), Postives = 226/496 (45.56%), Query Frame = 0

Query: 9   NNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAAHANLIK--DELVAMISKVNVIGG--- 68
           NN QS+  +   C  C   GH A+ C       +  N  +          + N+  G   
Sbjct: 268 NNNQSKPYLG-KCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSPFTPWQPRANLALGSPY 327

Query: 69  -SEGWWLDTSASRHVCYD---LSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGK 128
            S  W LD+ A+ H+  D   LSL + Y    D  +++ D  T  ++  G   L  T  +
Sbjct: 328 SSNNWLLDSGATHHITSDFNNLSLHQPYTGGDD--VMVADGSTIPISHTGSTSLS-TKSR 387

Query: 129 MLVLKEFLHTPEIQKNLVSRY-LLNKAGFTQTIGSDLFTLTKNNVFVE--KGYATDGMFK 188
            L L   L+ P I KNL+S Y L N  G +       F +   N  V   +G   D +++
Sbjct: 388 PLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDELYE 447

Query: 189 LNLEINK---IASSAYMLTSFNVWHVRLCH----VNKRLISNMSRLNLIPKLSLHDFEKC 248
             +  ++   + +S     + + WH RL H    +   +ISN S   L P    H F  C
Sbjct: 448 WPIASSQPVSLFASPSSKATHSSWHARLGHPAPSILNSVISNYSLSVLNPS---HKFLSC 507

Query: 249 ACCSQAKITKTSHKYVT-RVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIY 308
           + C   K  K      T   T PLE I+SD+      L+ ++ RY + F+D  + YT++Y
Sbjct: 508 SDCLINKSNKVPFSQSTINSTRPLEYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLY 567

Query: 309 LLKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPY 368
            LK KS   E F  F   +E +F  RI    SD G E+  V+  E++S  GI H T  P+
Sbjct: 568 PLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEF--VALWEYFSQHGISHLTSPPH 627

Query: 369 SPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSK-TSPYEV 428
           +PE NG +ERK+R + E  + +L  +    ++W   F    Y++NR+P    +  SP++ 
Sbjct: 628 TPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQK 687

Query: 429 LKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVII 484
           L   +PN   LR +GC  Y  +    + KL  ++ +CVF+ Y+    AY    L+   + 
Sbjct: 688 LFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLY 747

BLAST of Cmc06g0169941 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 175.3 bits (443), Expect = 2.0e-42
Identity = 142/526 (27.00%), Postives = 240/526 (45.63%), Query Frame = 0

Query: 3   RGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAAHANLIKDELVAMIS------ 62
           R  N+Q  P         C  C+  GH A+ C    +  +  N  + +  +  +      
Sbjct: 245 RSDNRQPKPYLGR-----CQICSVQGHSAKRCPQLHQFQSTTN--QQQSTSPFTPWQPRA 304

Query: 63  --KVNVIGGSEGWWLDTSASRHVCYD---LSLFRKYNEVKDKNILLGDHHTTKVAGIGEV 122
              VN    +  W LD+ A+ H+  D   LS  + Y    D  +++ D  T  +   G  
Sbjct: 305 NLAVNSPYNANNWLLDSGATHHITSDFNNLSFHQPYTGGDD--VMIADGSTIPITHTGSA 364

Query: 123 ELKFTSGKMLVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLT------KNNVFV 182
            L  TS + L L + L+ P I KNL+S Y L     T  +  + F  +         V +
Sbjct: 365 SLP-TSSRSLDLNKVLYVPNIHKNLISVYRLCN---TNRVSVEFFPASFQVKDLNTGVPL 424

Query: 183 EKGYATDGMFKLNLEINKIAS---SAYMLTSFNVWHVRLCHVNKRLISNMSRLNLIPKLS 242
            +G   D +++  +  ++  S   S     + + WH RL H +  +++++   + +P L+
Sbjct: 425 LQGKTKDELYEWPIASSQAVSMFASPCSKATHSSWHSRLGHPSLAILNSVISNHSLPVLN 484

Query: 243 -LHDFEKCACCSQAKITKT--SHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFID 302
             H    C+ C   K  K   S+  +T  ++PLE I+SD+      L+ ++ RY + F+D
Sbjct: 485 PSHKLLSCSDCFINKSHKVPFSNSTITS-SKPLEYIYSDVWS-SPILSIDNYRYYVIFVD 544

Query: 303 DCSDYTFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKG 362
             + YT++Y LK KS   + F +F + +E +F  RI  L SD G E+  V   ++ S  G
Sbjct: 545 HFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEF--VVLRDYLSQHG 604

Query: 363 IIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSN 422
           I H T  P++PE NG +ERK+R + E+ + +L  +    ++W   F    Y++NR+P   
Sbjct: 605 ISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPL 664

Query: 423 SK-TSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRF 482
            +  SP++ L  + PN   L+ +GC  Y  +    R KL  ++ +C F+ Y+    AY  
Sbjct: 665 LQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYLC 724

Query: 483 YDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSISNSLPS 505
             +    +     V F E  FPF + N G   SQ      + + PS
Sbjct: 725 LHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSDSAPNWPS 753

BLAST of Cmc06g0169941 vs. ExPASy Swiss-Prot
Match: P47024 (Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY4B-J PE=3 SV=3)

HSP 1 Score: 80.9 bits (198), Expect = 5.3e-14
Identity = 91/440 (20.68%), Postives = 179/440 (40.68%), Query Frame = 0

Query: 69  LDTSASRHVCYDLSLFRKYNEVKDKNIL--LGDHHTTKVAGIGEVELK----FTSGKMLV 128
           +DT +  ++  D +L   Y +         +G + +  V G G +++K     T  K L+
Sbjct: 414 IDTGSGVNITNDKTLLHNYEDSNRSTRFFGIGKNSSVSVKGYGYIKIKNGHNNTDNKCLL 473

Query: 129 LKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNL--- 188
                + PE +  ++S Y L K   T+ + S  +T   N +   K    +G+  + +   
Sbjct: 474 T---YYVPEEESTIISCYDLAKK--TKMVLSRKYTRLGNKIIKIKTKIVNGVIHVKMNEL 533

Query: 189 --------EINKI---ASSAYMLTSFNVW----HVRLCHVNKRLISNMSRLNLIPKLSLH 248
                   +IN I   +S  + L   ++     H R+ H   + I N  + N   + SL 
Sbjct: 534 IERPSDDSKINAIKPTSSPGFKLNKRSITLEDAHKRMGHTGIQQIENSIKHNHYEE-SLD 593

Query: 249 DFEK-----CACCSQAKITKTSHKYVTRVT------EPLELIHSDLCEFDGALTRNSKRY 308
             ++     C  C  +K TK +H Y   +       EP      D+     +   ++KRY
Sbjct: 594 LIKEPNEFWCQTCKISKATKRNH-YTGSMNNHSTDHEPGSSWCMDIFGPVSSSNADTKRY 653

Query: 309 VITFIDDCSDY--TFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRLRSDRGIEYDSVSF 368
           ++  +D+ + Y  T  +  KN        +  +  +E QF+++++ + SDRG E+ +   
Sbjct: 654 MLIMVDNNTRYCMTSTHFNKNAETILAQVRKNIQYVETQFDRKVREINSDRGTEFTNDQI 713

Query: 369 NEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAPSWWGEIFKTINYV 428
            E++ SKGI H   +      NG+AER  RT+      +L +S     +W     +   +
Sbjct: 714 EEYFISKGIHHILTSTQDHAANGRAERYIRTIITDATTLLRQSNLRVKFWEYAVTSATNI 773

Query: 429 LNRIPKSNSKTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLASRAYECVFIEYAK 471
            N +   ++   P + +  +   +  +          I +   +KL       + +    
Sbjct: 774 RNYLEHKSTGKLPLKAISRQPVTVRLMSFLPFGEKGIIWNHNHKKLKPSGLPSIILCKDP 833

BLAST of Cmc06g0169941 vs. ExPASy TrEMBL
Match: A0A5D3DCJ1 (Putative Polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1199G00010 PE=4 SV=1)

HSP 1 Score: 1047.3 bits (2707), Expect = 2.3e-302
Identity = 515/573 (89.88%), Postives = 539/573 (94.07%), Query Frame = 0

Query: 1   MKRGSNKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAAHANLIKDELVAMISKVNV 60
           MKRGSNKQNN QSRSTVQI CYNCNK GHLA+NCRN+SRPAA ANLI+DELVAMISKVNV
Sbjct: 105 MKRGSNKQNNLQSRSTVQIFCYNCNKPGHLAKNCRNRSRPAAQANLIEDELVAMISKVNV 164

Query: 61  IGGSEGWWLDTSASRHVCYDLSLFRKYNEVKDKNILLGDHHTTKVAGIGEVELKFTSGKM 120
           IGGSEGWWLDT AS HVC++LSLFRKYNEVKDKNILLGDHHTTKV GIGEVELKFTS K 
Sbjct: 165 IGGSEGWWLDTGASHHVCHELSLFRKYNEVKDKNILLGDHHTTKVVGIGEVELKFTSDKT 224

Query: 121 LVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLE 180
           LV+KE LHTPEI+KNLV  YLLNKAGFTQTIGS+LFTLTKNNVFV KGYATDGMFKLNLE
Sbjct: 225 LVVKEGLHTPEIRKNLVFGYLLNKAGFTQTIGSNLFTLTKNNVFVGKGYATDGMFKLNLE 284

Query: 181 INKIASSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKT 240
           INKIASSAYMLTSFNVWH RLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKT
Sbjct: 285 INKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKT 344

Query: 241 SHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMF 300
           SHK+VTRVT+PLELIHSDLCEFDG LTRNSKRYV+TFIDDCSDYTFIYLLKNKSDAYEMF
Sbjct: 345 SHKFVTRVTKPLELIHSDLCEFDGTLTRNSKRYVVTFIDDCSDYTFIYLLKNKSDAYEMF 404

Query: 301 KVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKN 360
           KVFVTEIE QFNKRIKRLRSDRG EYDSV+FNEFY+SKGIIHET  PYSPEMNGK ERKN
Sbjct: 405 KVFVTEIENQFNKRIKRLRSDRGTEYDSVAFNEFYNSKGIIHETTTPYSPEMNGKEERKN 464

Query: 361 RTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYLRT 420
           RTLTEL + ILLES AAPSWWGEI KT+NYVLNRIPKSNSKTSPYEVLKHK PNLSYLRT
Sbjct: 465 RTLTELAVAILLESEAAPSWWGEIIKTVNYVLNRIPKSNSKTSPYEVLKHKIPNLSYLRT 524

Query: 421 WGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFP 480
           WGCLAYVRIP+P+RRKLAS+AYECVFI YA+N+KAYRFYDLENKVIIE NDVDFFEDKFP
Sbjct: 525 WGCLAYVRIPNPERRKLASKAYECVFIGYAENSKAYRFYDLENKVIIESNDVDFFEDKFP 584

Query: 481 FKSRNSGDLYSQISGGSISNSLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVE 540
           FKSRNSG LYSQ SGGS  +SLPSIRIQTQDKEVDPEPRRSKRARTVKDF EDFEMYNVE
Sbjct: 585 FKSRNSGGLYSQTSGGSSFSSLPSIRIQTQDKEVDPEPRRSKRARTVKDFREDFEMYNVE 644

Query: 541 DPKDLTEALSSVDANLWQEAINNEMDSLEPNRT 574
           DPKDLT+ALSSVDANLWQEAIN+ +DSLE NRT
Sbjct: 645 DPKDLTKALSSVDANLWQEAINDGIDSLESNRT 677

BLAST of Cmc06g0169941 vs. ExPASy TrEMBL
Match: A0A7N2L531 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 624.4 bits (1609), Expect = 4.7e-175
Identity = 323/572 (56.47%), Postives = 403/572 (70.45%), Query Frame = 0

Query: 6   NKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAA-HANLIKDELVAMISKVNVIGGS 65
           NK   P S+      C+ C K GH+AR C+ + R +   AN+ ++ LVAMI+ +N++   
Sbjct: 271 NKNQGPPSQDQFNRSCFVCGKSGHIARFCKFRKRESVPQANVTEEPLVAMITDINMVQYV 330

Query: 66  EGWWLDTSASRHVCYDLSLFRKYNEV-KDKNILLGDHHTTKVAGIGEVELKFTSGKMLVL 125
           EGWW D+ A+RHVCYD + F+ Y    ++K ++LGD   TKV G GEVELKFTSG++L L
Sbjct: 331 EGWWADSGANRHVCYDKNWFKLYTPFEEEKTVMLGDSSKTKVLGSGEVELKFTSGRVLTL 390

Query: 126 KEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINK 185
           K+ L+TP ++KNL+S +LLNKAGF QT+ SD + +TK  +FV KGYA DGMFKLN+E NK
Sbjct: 391 KDVLYTPSMRKNLMSSFLLNKAGFKQTMESDNYVITKKGLFVGKGYACDGMFKLNVENNK 450

Query: 186 IA-SSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSH 245
            + SS YML+S N WH RLCH+N R +  MS L LIP+LS  DFEKC  CSQAKITK  H
Sbjct: 451 ASTSSVYMLSSINFWHARLCHINSRYVGIMSSLGLIPRLS-KDFEKCETCSQAKITKRPH 510

Query: 246 KYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMFKV 305
           K V R TE LELIHSDLCEF+G LTR   RY+ITFIDD S YT IYLLKNKSDA+E F+ 
Sbjct: 511 KNVVRNTELLELIHSDLCEFEGILTRGGNRYIITFIDDFSKYTTIYLLKNKSDAFEKFQD 570

Query: 306 FVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRT 365
           F+ E+E QF ++IKR+RSDRG EY+S +FN F  S GIIHET APYSP  NG AERKNRT
Sbjct: 571 FLQEVENQFGRKIKRIRSDRGREYESSAFNSFAQSLGIIHETTAPYSPASNGVAERKNRT 630

Query: 366 LTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYLRTWG 425
           L EL   +L+ESGA   +WGE   T  +VLNR+P   S T+P+E+ K   PNL YLR W 
Sbjct: 631 LIELTNAMLIESGAPLHFWGEAILTACHVLNRVPHKKSHTTPFEMWKGHKPNLGYLRAWD 690

Query: 426 CLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFK 485
           CLAYVR+ DPK  KL  RA  C F+ YA N+ AYRF+DLENK+I E  D  F E+KFPFK
Sbjct: 691 CLAYVRLTDPKMPKLGIRATTCAFLGYAINSAAYRFFDLENKIIFESGDAIFHEEKFPFK 750

Query: 486 SRNSGDLYSQISGGSISNSLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVED- 545
            +NSG   + +S  S S S      Q Q+   + EPRRSKRAR  KDFG D+ ++N+E+ 
Sbjct: 751 LKNSGGEENILSQPSSSTS----HFQNQE-NFEMEPRRSKRARVEKDFGPDYYVFNIEEN 810

Query: 546 PKDLTEALSSVDANLWQEAINNEMDSLEPNRT 574
           PK+L EAL+S DA  W+EA+N+EM+SL  NRT
Sbjct: 811 PKNLKEALTSPDAIFWKEAVNDEMESLISNRT 836

BLAST of Cmc06g0169941 vs. ExPASy TrEMBL
Match: A0A7N2R9F3 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 612.8 bits (1579), Expect = 1.4e-171
Identity = 318/572 (55.59%), Postives = 399/572 (69.76%), Query Frame = 0

Query: 6   NKQNNPQSRSTVQIVCYNCNKLGHLARNCRNKSRPAA-HANLIKDELVAMISKVNVIGGS 65
           NK   P S+      C+ C K GH+AR C+ + R +    N+ ++ LVA+I+ +N++   
Sbjct: 271 NKNQGPPSQDQFNRSCFVCGKSGHIARFCKFRKRESVPQVNVTEEPLVAIITDINMVQYV 330

Query: 66  EGWWLDTSASRHVCYDLSLFRKYNEV-KDKNILLGDHHTTKVAGIGEVELKFTSGKMLVL 125
           EGWW D  A+RHVCYD + F+ Y    ++K I+LGD   TKV G GEVELKFTSG++L L
Sbjct: 331 EGWWADYGANRHVCYDKNWFKLYTPFEEEKTIMLGDSSKTKVLGSGEVELKFTSGRVLTL 390

Query: 126 KEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINK 185
           K+  +TP ++KNL+S +LLNKAGF QT+ SD + +TK  +FV KGYA DGMFKLN+E NK
Sbjct: 391 KDVFYTPSMRKNLMSSFLLNKAGFKQTMESDNYVITKKGLFVGKGYACDGMFKLNVENNK 450

Query: 186 IA-SSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSH 245
            + SS YML+S N WH RLCH+N R +  MS L LIP+LS  DFEKC  CSQAKITK  H
Sbjct: 451 ASTSSVYMLSSINFWHARLCHINSRYVGIMSSLGLIPRLS-KDFEKCETCSQAKITKKPH 510

Query: 246 KYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMFKV 305
           K V R TE LELIHSDLCEF+G LTR   RY+ITFIDD S YT IYLLKNKSDA+E F+ 
Sbjct: 511 KSVVRNTELLELIHSDLCEFEGILTRGGNRYIITFIDDFSKYTTIYLLKNKSDAFEKFQD 570

Query: 306 FVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRT 365
           F+ E+E QF ++IKR+RSDRG EY+S +FN F  S GIIHET APYSP  NG  ERKNRT
Sbjct: 571 FLKEVENQFGRKIKRIRSDRGREYESSAFNSFVQSLGIIHETTAPYSPASNGVVERKNRT 630

Query: 366 LTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYLRTWG 425
           L EL   +L+ESGA   +WGE   T  +VLNR+P   S T+P+E+ K   PNL YLR WG
Sbjct: 631 LIELTNAMLIESGAPLHFWGEAILTACHVLNRVPHKKSHTTPFEMWKGHKPNLGYLRVWG 690

Query: 426 CLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFK 485
           CLAYVR+ DPK  KL  RA  C F+ YA N+ AYRF+DLENK+I E  D  F E+KFPFK
Sbjct: 691 CLAYVRLTDPKIPKLGIRATTCAFLGYAINSAAYRFFDLENKIIFESGDAIFHEEKFPFK 750

Query: 486 SRNSGDLYSQISGGSISNSLPSIRIQTQDKEVDPEPRRSKRARTVKDFGEDFEMYNVED- 545
            +NSG   + +   S S S     +Q Q+   + E RRSKRAR  KDFG D+ ++N+E+ 
Sbjct: 751 LKNSGGEENILLQPSSSTS----HLQNQE-NFEMELRRSKRARVEKDFGPDYYVFNIEEN 810

Query: 546 PKDLTEALSSVDANLWQEAINNEMDSLEPNRT 574
           P++L EAL+S DA  W+EA+N+EM+SL  NRT
Sbjct: 811 PQNLKEALTSSDAIFWKEAVNDEMESLISNRT 836

BLAST of Cmc06g0169941 vs. ExPASy TrEMBL
Match: Q0KIN7 (Polyprotein, putative OS=Solanum demissum OX=50514 GN=SDM1_27t00018 PE=4 SV=1)

HSP 1 Score: 582.0 bits (1499), Expect = 2.7e-162
Identity = 296/558 (53.05%), Postives = 389/558 (69.71%), Query Frame = 0

Query: 21  CYNCNKLGHLARNCRNKSR-PAAHANLIKDELVAMISKVNVIGGSEGWWLDTSASRHVCY 80
           C+ C K GH+AR CR + R P   AN+ ++  VA+I+ +N++   +GWW D+ A+RHVCY
Sbjct: 263 CFVCGKSGHIARFCRFRKRGPNPQANVTEEPFVAVITDINMVENVDGWWADSGANRHVCY 322

Query: 81  DLSLFRKYNEVKD-KNILLGDHHTTKVAGIGEVELKFTSGKMLVLKEFLHTPEIQKNLVS 140
           D   F+KY   ++ K I+LGD HTT+V G G+VEL FTSG++L LK+ L+TP ++K L+S
Sbjct: 323 DKDWFKKYTHFEEPKTIMLGDSHTTQVLGTGDVELCFTSGRVLTLKDVLYTPSMRKKLMS 382

Query: 141 RYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLEINKIASSAYMLTSFNVWH 200
            +LLNKAGF Q I S+ + + K  +FV KGYA DGMFKLN+E+NK ++S YML+S N WH
Sbjct: 383 SFLLNKAGFKQIIESNQYVIVKKGIFVGKGYACDGMFKLNVEMNKTSTSVYMLSSTNFWH 442

Query: 201 VRLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKTSHKYVTRVTEPLELIHSD 260
            RLCH+N R +  MS L LIP +   +FEKC  CS+AKITK  H  V R T+ LEL+H+D
Sbjct: 443 ARLCHINDRYVGIMSSLGLIPMIK-KNFEKCEACSKAKITKRPHFQVERKTDLLELVHTD 502

Query: 261 LCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMFKVFVTEIEKQFNKRIKRL 320
           +CE  G LTR   RY ITFIDD S +T++YL+KNKSDA+E FK ++ E+E QF ++IKR+
Sbjct: 503 ICELGGILTRGGNRYFITFIDDFSKFTYVYLMKNKSDAFENFKTYLHEVENQFGRKIKRI 562

Query: 321 RSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKNRTLTELVIVILLESGAAP 380
           RSDRG EY+S  FN F  S GIIHET  PYSP  NG AERKNRTL EL   +L+ES A  
Sbjct: 563 RSDRGREYESNEFNSFVRSLGIIHETTPPYSPSSNGVAERKNRTLVELTNAMLIESHAPL 622

Query: 381 SWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYLRTWGCLAYVRIPDPKRRKLA 440
           ++WGE   T  YVLNR+P   SK +P+E+ K   P+L YLR WGCLA+VR+ DPK  KL 
Sbjct: 623 NFWGEAILTACYVLNRVPHKKSKLTPFELWKGYKPSLGYLRVWGCLAFVRLMDPKITKLG 682

Query: 441 SRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFPFKSRNSGDLYSQISGGSI 500
            +   C F+ YA N+ AYRF++LE+ ++IE  D  F E+KFPF S+NSG    +I    +
Sbjct: 683 KKVTTCAFLGYASNSTAYRFFNLEDNIVIESGDAIFHENKFPFDSKNSGG--QRIEQNIL 742

Query: 501 SNSLPSIRIQT-QDKEV-DPEPRRSKRARTVKDFGEDFEMYNVEDPK-DLTEALSSVDAN 560
             +LPS    T ++KEV D E RRSKRAR  KDFG DF ++NV D +  L EALSS D+ 
Sbjct: 743 --TLPSSSTSTLKNKEVNDFELRRSKRARVEKDFGPDFYVFNVGDDRLTLKEALSSHDSI 802

Query: 561 LWQEAINNEMDSLEPNRT 574
            W+EA+N+EM+SL  N+T
Sbjct: 803 FWKEAVNDEMESLISNKT 815

BLAST of Cmc06g0169941 vs. ExPASy TrEMBL
Match: Q60D13 (Putative gag and pol polyprotein, identical OS=Solanum demissum OX=50514 GN=SDM1_56t00010 PE=4 SV=1)

HSP 1 Score: 493.4 bits (1269), Expect = 1.3e-135
Identity = 271/576 (47.05%), Postives = 358/576 (62.15%), Query Frame = 0

Query: 5   SNKQNNPQSRSTVQI--VCYNCNKLGHLARNCRNKSR-PAAHANLIKDELVAMISKVNVI 64
           +N +NN      VQ    C+ C K GH+AR CR + R P   AN+ ++  VA+I+ +N++
Sbjct: 92  NNGENNQAQNQQVQDKGPCFVCGKSGHIARFCRFRKRGPNPQANVTEEPFVAVITDINMV 151

Query: 65  GGSEGWWLDTSASRHVCYDLSLFRKYNEVKD-KNILLGDHHTTKVAGIGEVELKFTSGKM 124
              +GWW+D+ A+RHVCYD   F+KY   ++ K I+LGD HTT+V G G+VEL F+SG+ 
Sbjct: 152 ENVDGWWVDSGANRHVCYDKDWFKKYTHFEEPKTIMLGDAHTTQVLGKGDVELCFSSGRE 211

Query: 125 LVLKEFLHTPEIQKNLVSRYLLNKAGFTQTIGSDLFTLTKNNVFVEKGYATDGMFKLNLE 184
           L LK+ L+TP ++KNL+S +L NK GF Q I SD + + K  +FV KG            
Sbjct: 212 LTLKDVLYTPSMRKNLMSSFLFNKVGFKQIIESDQYVIVKKGIFVGKG------------ 271

Query: 185 INKIASSAYMLTSFNVWHVRLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKT 244
                                             L LIP +   +FEKC  CS+AKITK 
Sbjct: 272 ----------------------------------LRLIPMIK-KNFEKCEACSKAKITKR 331

Query: 245 SHKYVTRVTEPLELIHSDLCEFDGALTRNSKRYVITFIDDCSDYTFIYLLKNKSDAYEMF 304
            H  V R T  LEL+H+D+CE  G LTR   R  ITFIDD S +T++YL+KNKSDA+E F
Sbjct: 332 PHFQVERKTNLLELVHTDICELGGILTRGGNRNFITFIDDFSKFTYVYLMKNKSDAFENF 391

Query: 305 KVFVTEIEKQFNKRIKRLRSDRGIEYDSVSFNEFYSSKGIIHETIAPYSPEMNGKAERKN 364
           K ++ E+E QF ++IKR+RSDRG EY+S  FN F  S GIIHET  PYSP  NG AERKN
Sbjct: 392 KTYLHEVENQFGRKIKRIRSDRGREYESNEFNSFVRSLGIIHETTPPYSPSSNGAAERKN 451

Query: 365 RTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTSPYEVLKHKAPNLSYLRT 424
           RTL EL   +L+ES A  ++WGE   T  YVLNR+P   SK + +E+ K   P+L YLR 
Sbjct: 452 RTLVELTNAMLIESHAPLNFWGETILTACYVLNRVPHKKSKLTHFELWKGYKPSLGYLRV 511

Query: 425 WGCLAYVRIPDPKRRKLASRAYECVFIEYAKNNKAYRFYDLENKVIIEWNDVDFFEDKFP 484
           WGCLA+VR+ DPK  KL  +   C F+ YA N+ AYRF++LE+ ++IE  D  F E+KFP
Sbjct: 512 WGCLAFVRLMDPKITKLGKKVTTCAFLGYASNSTAYRFFNLEDNIVIESGDAIFHENKFP 571

Query: 485 FKSRNSGDLYSQISGGSISNSLPSIRIQT-QDKEV-DPEPRRSKRARTVKDFGEDFEMYN 544
           F S+NSG    +I    +  SLPS    T ++KEV D E RRSKRAR  KDFG +F ++N
Sbjct: 572 FDSKNSGG--QRIEQNIL--SLPSSSTSTLKNKEVNDFELRRSKRARIEKDFGPNFYVFN 616

Query: 545 V-EDPKDLTEALSSVDANLWQEAINNEMDSLEPNRT 574
           V +DP  L EALSS D+  W+EA+N+EM+SL  N+T
Sbjct: 632 VGDDPLTLKEALSSHDSIFWKEAVNDEMESLISNKT 616

BLAST of Cmc06g0169941 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 47.8 bits (112), Expect = 3.5e-05
Identity = 26/85 (30.59%), Postives = 42/85 (49.41%), Query Frame = 0

Query: 360 NRTLTELVIVILLESGAAPSWWGEIFKTINYVLNRIPKSNSKTS-PYEVLKHKAPNLSYL 419
           NRT+ E V  +L E G   ++  +   T  +++N+ P +      P EV     P  SYL
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 420 RTWGCLAYVRIPDPKRRKLASRAYE 444
           R +GC+AY+   + K +  A +  E
Sbjct: 62  RRFGCVAYIHCDEGKLKPRAKKGEE 86

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0034938.14.7e-30289.88putative Polyprotein [Cucumis melo var. makuwa] >TYK21293.1 putative Polyprotein... [more]
ABI34306.15.5e-16253.05Polyprotein, putative [Solanum demissum][more]
KAG5527251.13.1e-13645.23hypothetical protein RHGRI_028223 [Rhododendron griersonianum][more]
AAU90333.12.6e-13547.05Putative gag and pol polyprotein, identical [Solanum demissum][more]
XP_021732277.15.4e-13347.40uncharacterized protein LOC110699091 [Chenopodium quinoa][more]
Match NameE-valueIdentityDescription
P109784.4e-6928.86Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041467.3e-4826.95Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q94HW21.3e-4429.44Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT942.0e-4227.00Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P470245.3e-1420.68Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A0A5D3DCJ12.3e-30289.88Putative Polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119... [more]
A0A7N2L5314.7e-17556.47Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
A0A7N2R9F31.4e-17155.59Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
Q0KIN72.7e-16253.05Polyprotein, putative OS=Solanum demissum OX=50514 GN=SDM1_27t00018 PE=4 SV=1[more]
Q60D131.3e-13547.05Putative gag and pol polyprotein, identical OS=Solanum demissum OX=50514 GN=SDM1... [more]
Match NameE-valueIdentityDescription
ATMG00710.13.5e-0530.59Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 20..36
e-value: 2.9E-5
score: 33.5
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 20..35
e-value: 2.2E-6
score: 27.5
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 21..36
score: 10.279065
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 249..350
e-value: 5.4E-16
score: 58.8
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 247..412
score: 22.866495
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 185..236
e-value: 1.4E-10
score: 40.9
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 242..421
e-value: 4.3E-37
score: 129.3
NoneNo IPR availableGENE3D4.10.60.10coord: 1..79
e-value: 1.3E-7
score: 33.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 502..523
NoneNo IPR availablePANTHERPTHR47592PBF68 PROTEINcoord: 31..166
NoneNo IPR availablePANTHERPTHR47592:SF5OS08G0421300 PROTEINcoord: 31..166
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 247..409
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 4..41

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc06g0169941.1Cmc06g0169941.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding