Cmc08g0226621 (gene) Melon (Charmono) v1.1

Overview
NameCmc08g0226621
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCMiso1.1chr08: 18911000 .. 18913426 (-)
RNA-Seq ExpressionCmc08g0226621
SyntenyCmc08g0226621
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTATGCGAATGACTGTAGCAAACAATATTAAGTCCACAATAAAGAACACTGAAGATGCTAAGGAATTTATGAAATATGTGGAAAAATGTTCTCAGTCAGAGTCGGCTAACAAGTCACTTGCTGGAACACTTATGAGTACTTTAACCAACATCAAGTTTGATGGTTCTCGTACTATACATGAGCATATCCTTGAAATGATGAACTTGGCAGCAAGGTTAAAGACCATGGGAATGGAAGTTAATGAGAATTTTTTGGTAACGTTTATCCTTAATTCCTTACCTTCAAAGTATGGTCCATTTCACATGAACTATAACACTCTAAAAGATAAATGGAATGTGCATGAATTACAAAGTATGCTCATTCAAGAGGAAGCGAGACTTAAGAAACCAATAATTCACTCTATCAATCCCATGGGTCATAAAGGAGCTGGAAAGAAACCTAGAAAAAAGAATGGCAAGGGCAATCATGGACAATTAAAGGTAAAACAGTCATCTTCCCCAATCCACAAAAAGGGACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCTGGACACTATCAGAAAGATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTAGTATGTTTTGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTTGTACCATTCATGTTTTCAATACGATGCAGGGATTCCTTACGACTCGAACCACAAACCTAAATGAGAGATTCATTTTTATGGGAAACAGAGTCAAAGTTCCAGTTGAAGCTGTGGGAACCTATCGTTTAACTTTAGATACTAGACATCATTTAGACCTTTTTGATACCTTTTATGTTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACATGATACTTCAGGTTATTACTTTAAATTTGGGAATGAGTGTTTTAGTTTATTCAAACAAAACATTTTTATTGGTTCTGGTATTCTTTGTGATGGCTTATATAAATTAAAGTTTGATAATGTTTTTGCTGAGAGTTTGTTAACCTTGCATCATAATGTTGGTACTAAACGTGGTCAAACTAATGAATCGTCAGCTTACTTGTGGCATAAACGCTTAGGTCACATATCCAAAGAAAGAATTAAAAGATTAATAAAGAATGAAATTCTTCCAGATTTGAATTTTACTGACCTTGGAATTTGTGTGGATTGTATTAAAGGAAAACAAACTAAACACTCAGTTAATAAAGAAGCCACAAGAAGCTCACAGCTCCTTGAAATTATACACACTGATATTTGTGGGTCTTTTGATGTTCCATCTTTTGGTGAAGAAAAGTATTTCATCACCTTTATTGATGATTTCTCACGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGAAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAAGGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATATGACGAGAATGGACAATGCCCCGGTCCATTCGCTAAATTCCTAGAAAGTCATGGCATATGTGCTCAATACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAAGGCGAAATCGTACATTAATGAATATGGTTAGAAGCATGTTAATTAATTCATCTTTACTTGTGTCCTTGTGGATGTATGCATTAAGAACCGCTCAATATTTATTAAACAGGGTTCCTAATAAGTCAGTTCCAAAGACACCTTTTGAACTATGGACAAGAAGGAAACCTAGTTTAAGACACCTACATGTTTGGGGTTGTCAAGCGGAAGTAAGAATTTGTAATCCACATGAAAAGAAACTGGATTCAAGAACAACCAGTGGTTTCTTCATTGGTTATCCGGAAAAATCAAAAGGGTATAGATTTTATTGTCCTAACCACAGTACGAGAATAGTTGAAACTGGAAATGTAAGGTTCATTGAGAATGACATAATTAGTGGGAGTTTGGAACCGCGAAAAGTGAAAATTCAAGAAGTTAGGGTGGAAATTCCTTCATCTATAACTTCTTCTCAAGTTGTTGTTCCTGTAGTTGTTGACTCTGTTAACAATCCACAAGAACAACAAATTAATGTTCAAACACCACATAATGATATTGTAACAAATGAACTTGTAACTAAGGGATCACAAGAAATAGAATTAAGAAGATTTGTAAGATCAAGAAGAGCAGCTATTTCTGATGACTATTTGGTTTATTTGCATGAGTCAGAATTTGATTTAAGCATTGATAATGATCCAGTTTCGTTTTCACAAGCCATTAAAGTAGATAATTCTACCAAATGGTTAGATGCCATGAAAGAAGAGTTAAAATCTATGAATGATAATGAAGTCTGGGATCTTGTAGAATTGCCTAAAGAAAGTAAAAGAGTTGGGTGTAAATGA

mRNA sequence

ATGTTTATGCGAATGACTGTAGCAAACAATATTAAGTCCACAATAAAGAACACTGAAGATGCTAAGGAATTTATGAAATATGTGGAAAAATGTTCTCAGTCAGAGTCGGCTAACAAGTCACTTGCTGGAACACTTATGAGTACTTTAACCAACATCAAGTTTGATGGTTCTCGTACTATACATGAGCATATCCTTGAAATGATGAACTTGGCAGCAAGGTTAAAGACCATGGGAATGGAAGTTAATGAGAATTTTTTGGTAACGTTTATCCTTAATTCCTTACCTTCAAAGTATGGTCCATTTCACATGAACTATAACACTCTAAAAGATAAATGGAATGTGCATGAATTACAAAGTATGCTCATTCAAGAGGAAGCGAGACTTAAGAAACCAATAATTCACTCTATCAATCCCATGGGTCATAAAGGAGCTGGAAAGAAACCTAGAAAAAAGAATGGCAAGGGCAATCATGGACAATTAAAGGTAAAACAGTCATCTTCCCCAATCCACAAAAAGGGACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCTGGACACTATCAGAAAGATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTAGTATGTTTTGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTTGTACCATTCATGTTTTCAATACGATGCAGGGATTCCTTACGACTCGAACCACAAACCTAAATGAGAGATTCATTTTTATGGGAAACAGAGTCAAAGTTCCAGTTGAAGCTGTGGGAACCTATCGTTTAACTTTAGATACTAGACATCATTTAGACCTTTTTGATACCTTTTATGTTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACATGATACTTCAGGTTATTACTTTAAATTTGGGAATGAGTGTTTTAGTTTATTCAAACAAAACATTTTTATTGGTTCTGGTATTCTTTGTGATGGCTTATATAAATTAAAGTTTGATAATGTTTTTGCTGAGAGTTTGTTAACCTTGCATCATAATGTTGGTACTAAACGTGGTCAAACTAATGAATCGTCAGCTTACTTGTGGCATAAACGCTTAGGTCACATATCCAAAGAAAGAATTAAAAGATTAATAAAGAATGAAATTCTTCCAGATTTGAATTTTACTGACCTTGGAATTTGTGTGGATTGTATTAAAGGAAAACAAACTAAACACTCAGTTAATAAAGAAGCCACAAGAAGCTCACAGCTCCTTGAAATTATACACACTGATATTTGTGGGTCTTTTGATGTTCCATCTTTTGGTGAAGAAAAGTATTTCATCACCTTTATTGATGATTTCTCACGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGAAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAAGGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATATGACGAGAATGGACAATGCCCCGGTCCATTCGCTAAATTCCTAGAAAGTCATGGCATATGTGCTCAATACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAAGGCGAAATCGTACATTAATGAATATGGTTAGAAGCATGTTAATTAATTCATCTTTACTTGTGTCCTTGTGGATGTATGCATTAAGAACCGCTCAATATTTATTAAACAGGGTTCCTAATAAGTCAGTTCCAAAGACACCTTTTGAACTATGGACAAGAAGGAAACCTAGTTTAAGACACCTACATGTTTGGGGTTGTCAAGCGGAAGTAAGAATTTGTAATCCACATGAAAAGAAACTGGATTCAAGAACAACCAGTGGTTTCTTCATTGGTTATCCGGAAAAATCAAAAGGGTATAGATTTTATTGTCCTAACCACAGTACGAGAATAGTTGAAACTGGAAATGTAAGGTTCATTGAGAATGACATAATTAGTGGGAGTTTGGAACCGCGAAAAGTGAAAATTCAAGAAGTTAGGGTGGAAATTCCTTCATCTATAACTTCTTCTCAAGTTGTTGTTCCTGTAGTTGTTGACTCTGTTAACAATCCACAAGAACAACAAATTAATGTTCAAACACCACATAATGATATTGTAACAAATGAACTTGTAACTAAGGGATCACAAGAAATAGAATTAAGAAGATTTGTAAGATCAAGAAGAGCAGCTATTTCTGATGACTATTTGGTTTATTTGCATGAGTCAGAATTTGATTTAAGCATTGATAATGATCCAGTTTCGTTTTCACAAGCCATTAAAGTAGATAATTCTACCAAATGGTTAGATGCCATGAAAGAAGAGTTAAAATCTATGAATGATAATGAAGTCTGGGATCTTGTAGAATTGCCTAAAGAAAGTAAAAGAGTTGGGTGTAAATGA

Coding sequence (CDS)

ATGTTTATGCGAATGACTGTAGCAAACAATATTAAGTCCACAATAAAGAACACTGAAGATGCTAAGGAATTTATGAAATATGTGGAAAAATGTTCTCAGTCAGAGTCGGCTAACAAGTCACTTGCTGGAACACTTATGAGTACTTTAACCAACATCAAGTTTGATGGTTCTCGTACTATACATGAGCATATCCTTGAAATGATGAACTTGGCAGCAAGGTTAAAGACCATGGGAATGGAAGTTAATGAGAATTTTTTGGTAACGTTTATCCTTAATTCCTTACCTTCAAAGTATGGTCCATTTCACATGAACTATAACACTCTAAAAGATAAATGGAATGTGCATGAATTACAAAGTATGCTCATTCAAGAGGAAGCGAGACTTAAGAAACCAATAATTCACTCTATCAATCCCATGGGTCATAAAGGAGCTGGAAAGAAACCTAGAAAAAAGAATGGCAAGGGCAATCATGGACAATTAAAGGTAAAACAGTCATCTTCCCCAATCCACAAAAAGGGACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCTGGACACTATCAGAAAGATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTAGTATGTTTTGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTTGTACCATTCATGTTTTCAATACGATGCAGGGATTCCTTACGACTCGAACCACAAACCTAAATGAGAGATTCATTTTTATGGGAAACAGAGTCAAAGTTCCAGTTGAAGCTGTGGGAACCTATCGTTTAACTTTAGATACTAGACATCATTTAGACCTTTTTGATACCTTTTATGTTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACATGATACTTCAGGTTATTACTTTAAATTTGGGAATGAGTGTTTTAGTTTATTCAAACAAAACATTTTTATTGGTTCTGGTATTCTTTGTGATGGCTTATATAAATTAAAGTTTGATAATGTTTTTGCTGAGAGTTTGTTAACCTTGCATCATAATGTTGGTACTAAACGTGGTCAAACTAATGAATCGTCAGCTTACTTGTGGCATAAACGCTTAGGTCACATATCCAAAGAAAGAATTAAAAGATTAATAAAGAATGAAATTCTTCCAGATTTGAATTTTACTGACCTTGGAATTTGTGTGGATTGTATTAAAGGAAAACAAACTAAACACTCAGTTAATAAAGAAGCCACAAGAAGCTCACAGCTCCTTGAAATTATACACACTGATATTTGTGGGTCTTTTGATGTTCCATCTTTTGGTGAAGAAAAGTATTTCATCACCTTTATTGATGATTTCTCACGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGAAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAAGGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATATGACGAGAATGGACAATGCCCCGGTCCATTCGCTAAATTCCTAGAAAGTCATGGCATATGTGCTCAATACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAAGGCGAAATCGTACATTAATGAATATGGTTAGAAGCATGTTAATTAATTCATCTTTACTTGTGTCCTTGTGGATGTATGCATTAAGAACCGCTCAATATTTATTAAACAGGGTTCCTAATAAGTCAGTTCCAAAGACACCTTTTGAACTATGGACAAGAAGGAAACCTAGTTTAAGACACCTACATGTTTGGGGTTGTCAAGCGGAAGTAAGAATTTGTAATCCACATGAAAAGAAACTGGATTCAAGAACAACCAGTGGTTTCTTCATTGGTTATCCGGAAAAATCAAAAGGGTATAGATTTTATTGTCCTAACCACAGTACGAGAATAGTTGAAACTGGAAATGTAAGGTTCATTGAGAATGACATAATTAGTGGGAGTTTGGAACCGCGAAAAGTGAAAATTCAAGAAGTTAGGGTGGAAATTCCTTCATCTATAACTTCTTCTCAAGTTGTTGTTCCTGTAGTTGTTGACTCTGTTAACAATCCACAAGAACAACAAATTAATGTTCAAACACCACATAATGATATTGTAACAAATGAACTTGTAACTAAGGGATCACAAGAAATAGAATTAAGAAGATTTGTAAGATCAAGAAGAGCAGCTATTTCTGATGACTATTTGGTTTATTTGCATGAGTCAGAATTTGATTTAAGCATTGATAATGATCCAGTTTCGTTTTCACAAGCCATTAAAGTAGATAATTCTACCAAATGGTTAGATGCCATGAAAGAAGAGTTAAAATCTATGAATGATAATGAAGTCTGGGATCTTGTAGAATTGCCTAAAGAAAGTAAAAGAGTTGGGTGTAAATGA

Protein sequence

MFMRMTVANNIKSTIKNTEDAKEFMKYVEKCSQSESANKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFILNSLPSKYGPFHMNYNTLKDKWNVHELQSMLIQEEARLKKPIIHSINPMGHKGAGKKPRKKNGKGNHGQLKVKQSSSPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVFNTMQGFLTTRTTNLNERFIFMGNRVKVPVEAVGTYRLTLDTRHHLDLFDTFYVPSISRNLISLSKHDTSGYYFKFGNECFSLFKQNIFIGSGILCDGLYKLKFDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLGHISKERIKRLIKNEILPDLNFTDLGICVDCIKGKQTKHSVNKEATRSSQLLEIIHTDICGSFDVPSFGEEKYFITFIDDFSRYGYIYLLHEKSQEIDALKVFINEVERQLDRKVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLLVSLWMYALRTAQYLLNRVPNKSVPKTPFELWTRRKPSLRHLHVWGCQAEVRICNPHEKKLDSRTTSGFFIGYPEKSKGYRFYCPNHSTRIVETGNVRFIENDIISGSLEPRKVKIQEVRVEIPSSITSSQVVVPVVVDSVNNPQEQQINVQTPHNDIVTNELVTKGSQEIELRRFVRSRRAAISDDYLVYLHESEFDLSIDNDPVSFSQAIKVDNSTKWLDAMKEELKSMNDNEVWDLVELPKESKRVGCK
Homology
BLAST of Cmc08g0226621 vs. NCBI nr
Match: RZC12927.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A [Glycine soja] >RZC12928.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform B [Glycine soja])

HSP 1 Score: 1153.7 bits (2983), Expect = 0.0e+00
Identity = 574/814 (70.52%), Postives = 670/814 (82.31%), Query Frame = 0

Query: 1   MFMRMTVANNIKSTIKNTEDAKEFMKYVEKCSQSESANKSLAGTLMSTLTNIKFDGSRTI 60
           MFMRMTVA++IK+ +  T+ AKEFM  V +  +S++A+KSLAGTLMSTLT +KFDGSRT+
Sbjct: 149 MFMRMTVADSIKTALPKTDSAKEFMGLVGE--RSQTADKSLAGTLMSTLTTMKFDGSRTM 208

Query: 61  HEHILEMMNLAARLKTMGMEVNENFLVTFILNSLPSKYGPFHMNYNTLKDKWNVHELQSM 120
           HEH++EM N+AARLKT+GM VNENFLV FILNSLPS+YGPF M+YNT+KDKWNVHEL SM
Sbjct: 209 HEHVIEMTNIAARLKTLGMAVNENFLVQFILNSLPSEYGPFQMSYNTMKDKWNVHELHSM 268

Query: 121 LIQEEARLKKPIIHSINPMGHK---GAGKKPRKKNGKGNHGQLKVKQSSSPIHKKGQIKD 180
           L+QEE RLK    HSI+ + H+   GAGK   KK+ KG  G LK+K     I KK    +
Sbjct: 269 LVQEETRLKNQGSHSIHYVSHRGNQGAGKNFVKKHDKGK-GPLKIKDGPVQIQKKASKNN 328

Query: 181 KCRFCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVFNTM 240
            C FC K GH+QKDC KRK+WFE KG+ NALV FESNLTEVP+NTWWIDSGCT HV NTM
Sbjct: 329 NCHFCGKSGHFQKDCPKRKSWFEKKGELNALVYFESNLTEVPHNTWWIDSGCTTHVSNTM 388

Query: 241 QGFLTTRTTNLNERFIFMGNRVKVPVEAVGTYRLTLDTRHHLDLFDTFYVPSISRNLISL 300
           QGFLT +T + NE+F+FMGNRVK PVEAVGTYRL LDT HHLDL +T YVPS+SRNL+SL
Sbjct: 389 QGFLTIQTISPNEKFVFMGNRVKAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSL 448

Query: 301 SKHDTSGYYFKFGNECFSLFKQNIFIGSGILCDGLYKLKFDNVFAESLLTLHHNVGTKRG 360
           SK D +GY F FGN CFSLFK N  IG+G+LCDGLYKLK D ++ E++LTLHHNVGTKR 
Sbjct: 449 SKLDITGYSFNFGNGCFSLFKYNHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRS 508

Query: 361 QTNESSAYLWHKRLGHISKERIKRLIKNEILPDLNFTDLGICVDCIKGKQTKHSVNKEAT 420
             NE SA+LWHKRLGHIS+ERI+RLIKNEILPDL+FTDL ICVDCIKGKQTKH+  K AT
Sbjct: 509 LVNERSAFLWHKRLGHISRERIERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGAT 568

Query: 421 RSSQLLEIIHTDICGSFDVPSFGEEKYFITFIDDFSRYGYIYLLHEKSQEIDALKVFINE 480
           RS+QLLEI+HTDICG FDV SFG E+YFITFIDD+SRYGY+YLLHEKSQ ++AL++++NE
Sbjct: 569 RSTQLLEIVHTDICGPFDVSSFGRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNE 628

Query: 481 VERQLDRKVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGTPQQNGVAE 540
           VERQLDRKVKI+RSDRGGEYYG+YDE GQ PGPFAK L+  GICAQYTMPGTPQQNGV+E
Sbjct: 629 VERQLDRKVKIIRSDRGGEYYGRYDETGQHPGPFAKLLQKRGICAQYTMPGTPQQNGVSE 688

Query: 541 RRNRTLMNMVRSMLINSSLLVSLWMYALRTAQYLLNRVPNKSVPKTPFELWTRRKPSLRH 600
           RRNRTLM+MVRSMLINS+L VSLWMYAL+TA YLLN VP+K+VPKTPFELWT R PS+RH
Sbjct: 689 RRNRTLMDMVRSMLINSTLPVSLWMYALKTAMYLLNMVPSKAVPKTPFELWTNRTPSMRH 748

Query: 601 LHVWGCQAEVRICNPHEKKLDSRTTSGFFIGYPEKSKGYRFYCPNHSTRIVETGNVRFIE 660
           LHVWGCQAE+RI NP E+KLD+RT SG+FIGYPEKSKGY FYCPNHSTRIVETGN RFIE
Sbjct: 749 LHVWGCQAEIRIYNPQERKLDARTISGYFIGYPEKSKGYMFYCPNHSTRIVETGNARFIE 808

Query: 661 NDIISGSLEPRKVKIQEVRVEIPSSITSSQVVVPVVVDSVNNPQEQQINVQTPHND--IV 720
           N  ISGS  PR+V+I+EVRV++P +  SS  V+   V + N+ +E Q      HND  ++
Sbjct: 809 NGEISGSTVPREVEIKEVRVQVPLAFASSSKVITTSVTATNSNEEVQ------HNDEPMI 868

Query: 721 TNELVTKGSQEIELRRFVRSRRAAISDDYLVYLHESEFDLSI-DNDPVSFSQAIKVDNST 780
            NE + +  QE+ L++  R RR AIS+DY+VYLHE E +LSI DNDPVSFSQAI  DNS 
Sbjct: 869 HNEPIMEEPQEVALKKSQRERRPAISNDYVVYLHEIETNLSINDNDPVSFSQAISCDNSE 928

Query: 781 KWLDAMKEELKSMNDNEVWDLVELPKESKRVGCK 809
           KWL+AMKEE+ SM  N VWDLVELPK  KRVGCK
Sbjct: 929 KWLNAMKEEIDSMEHNGVWDLVELPKGCKRVGCK 952

BLAST of Cmc08g0226621 vs. NCBI nr
Match: RZC25410.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja])

HSP 1 Score: 1153.7 bits (2983), Expect = 0.0e+00
Identity = 574/814 (70.52%), Postives = 671/814 (82.43%), Query Frame = 0

Query: 1   MFMRMTVANNIKSTIKNTEDAKEFMKYVEKCSQSESANKSLAGTLMSTLTNIKFDGSRTI 60
           MFMRMTVA++IK+ +  T+ AKEFM  V +  +S++A+KSLAGTLMSTLT +KFDGSRT+
Sbjct: 138 MFMRMTVADSIKTALPKTDSAKEFMGLVGE--RSQTADKSLAGTLMSTLTTMKFDGSRTM 197

Query: 61  HEHILEMMNLAARLKTMGMEVNENFLVTFILNSLPSKYGPFHMNYNTLKDKWNVHELQSM 120
           HEH++EM N+AARLKT+GM VNENFLV FILNSLPS+YGPF M+YNT+KDKWNVHEL SM
Sbjct: 198 HEHVIEMTNIAARLKTLGMAVNENFLVQFILNSLPSEYGPFQMSYNTMKDKWNVHELHSM 257

Query: 121 LIQEEARLKKPIIHSINPMGHK---GAGKKPRKKNGKGNHGQLKVKQSSSPIHKKGQIKD 180
           L+QEE RLK    HSI+ + H+   GAGKK  KK+ KG  G LK+K     I KK    +
Sbjct: 258 LVQEETRLKNQGSHSIHYVSHRGNQGAGKKFVKKHDKGK-GPLKIKDGPVQIQKKASKNN 317

Query: 181 KCRFCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVFNTM 240
            C FC K GH+QKDC KRK+WFE KG+ NALVCFESNLTEVP+NTWWIDSGCT HV NTM
Sbjct: 318 NCHFCGKSGHFQKDCPKRKSWFEKKGELNALVCFESNLTEVPHNTWWIDSGCTTHVSNTM 377

Query: 241 QGFLTTRTTNLNERFIFMGNRVKVPVEAVGTYRLTLDTRHHLDLFDTFYVPSISRNLISL 300
           QGFLT +T + NE+F+FMGNRVK PVEAVGTYRL LDT HHLDL +T YVPS+SRNL+SL
Sbjct: 378 QGFLTIQTISPNEKFVFMGNRVKAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSL 437

Query: 301 SKHDTSGYYFKFGNECFSLFKQNIFIGSGILCDGLYKLKFDNVFAESLLTLHHNVGTKRG 360
           SK D +GY F FGN CFSLFK N  IG+G+LCDGLYKLK D ++ E++LTLHHNVGTKR 
Sbjct: 438 SKLDITGYSFNFGNGCFSLFKYNHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRS 497

Query: 361 QTNESSAYLWHKRLGHISKERIKRLIKNEILPDLNFTDLGICVDCIKGKQTKHSVNKEAT 420
             NE SA+LWHKRLGHIS ERI+RLIKNEILPDL+FTDL ICVDCIKGKQTKH+  K AT
Sbjct: 498 LVNERSAFLWHKRLGHISGERIERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGAT 557

Query: 421 RSSQLLEIIHTDICGSFDVPSFGEEKYFITFIDDFSRYGYIYLLHEKSQEIDALKVFINE 480
           RS+QLLEI+HTDICG FDV SFG E+YFITFIDD+SRYGY+YLLHEKSQ ++AL++++NE
Sbjct: 558 RSTQLLEIVHTDICGPFDVSSFGRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNE 617

Query: 481 VERQLDRKVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGTPQQNGVAE 540
           VERQLDRKVKI+RSDRGGEYY +YDE GQ P PFAK L+  GICAQYTMPGTPQQNGV+E
Sbjct: 618 VERQLDRKVKIIRSDRGGEYYRRYDETGQHPSPFAKLLQKRGICAQYTMPGTPQQNGVSE 677

Query: 541 RRNRTLMNMVRSMLINSSLLVSLWMYALRTAQYLLNRVPNKSVPKTPFELWTRRKPSLRH 600
           RRN+TLM+MVRSMLINS+L VSLWMYAL+TA YLLNRVP+K+VPKTPFELWT R PS+RH
Sbjct: 678 RRNKTLMDMVRSMLINSTLPVSLWMYALKTAMYLLNRVPSKAVPKTPFELWTNRTPSMRH 737

Query: 601 LHVWGCQAEVRICNPHEKKLDSRTTSGFFIGYPEKSKGYRFYCPNHSTRIVETGNVRFIE 660
           LHVWGCQAE+RI NP E+KLD+RT SG+FIGYPEKSKGY FYCPNHSTRIVETGN RFIE
Sbjct: 738 LHVWGCQAEIRIYNPQERKLDARTISGYFIGYPEKSKGYMFYCPNHSTRIVETGNARFIE 797

Query: 661 NDIISGSLEPRKVKIQEVRVEIPSSITSSQVVVPVVVDSVNNPQEQQINVQTPHND--IV 720
           N  ISGS  PR+V+I+EVRV++P +  SS  V+   V + N+ +E Q      HND  ++
Sbjct: 798 NGEISGSTVPREVEIKEVRVQVPLAFASSSKVITTSVTATNSNEEVQ------HNDEPMI 857

Query: 721 TNELVTKGSQEIELRRFVRSRRAAISDDYLVYLHESEFDLSI-DNDPVSFSQAIKVDNST 780
            NE + +  QE+ LR+  R RR AIS+DY+VYLHE+E +LSI DNDPVSFSQAI  DNS 
Sbjct: 858 HNEPIMEEPQEVALRKSQRERRPAISNDYVVYLHETETNLSINDNDPVSFSQAISCDNSE 917

Query: 781 KWLDAMKEELKSMNDNEVWDLVELPKESKRVGCK 809
           KWL+AMKEE+ SM  N+VWDLVELPK  KRVG K
Sbjct: 918 KWLNAMKEEIDSMEHNDVWDLVELPKGCKRVGYK 941

BLAST of Cmc08g0226621 vs. NCBI nr
Match: KYP36562.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 1147.9 bits (2968), Expect = 0.0e+00
Identity = 567/813 (69.74%), Postives = 675/813 (83.03%), Query Frame = 0

Query: 1   MFMRMTVANNIKSTIKNTEDAKEFMKYVEKCSQSESANKSLAGTLMSTLTNIKFDGSRTI 60
           MFMRMTVA++IK+T+  TE AKEFM +V +  +S++A+KSLAGTLMSTLT +KFDGSRT+
Sbjct: 1   MFMRMTVADSIKTTLPKTESAKEFMGFVGE--RSQTADKSLAGTLMSTLTTMKFDGSRTM 60

Query: 61  HEHILEMMNLAARLKTMGMEVNENFLVTFILNSLPSKYGPFHMNYNTLKDKWNVHELQSM 120
           HEH++EM N+AARLK++GM VNENFLV FILNSLP++YGPF M+YNT+KDKWNVHEL SM
Sbjct: 61  HEHVIEMTNIAARLKSLGMAVNENFLVQFILNSLPTEYGPFQMSYNTMKDKWNVHELHSM 120

Query: 121 LIQEEARLKKP---IIHSINPMGHKGAGKKPRKKNGKGNHGQLKVKQSSSPIHKKGQIKD 180
           L+QEE RLK      IH ++  G++GAGKK  KK+ KG    LK+ ++S PI KK    +
Sbjct: 121 LVQEETRLKNQGSHSIHYVSHQGNQGAGKKFVKKHDKGKK-PLKINEASVPIQKKASKGN 180

Query: 181 KCRFCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVFNTM 240
            C FC K GH+QKDC KRKAWFE KGK NA VCFESNLTEVP+NTWWIDSGCT HV NTM
Sbjct: 181 NCHFCGKSGHFQKDCPKRKAWFEKKGKLNAYVCFESNLTEVPHNTWWIDSGCTTHVSNTM 240

Query: 241 QGFLTTRTTNLNERFIFMGNRVKVPVEAVGTYRLTLDTRHHLDLFDTFYVPSISRNLISL 300
           QGF T +T + NE+F+FMGNRVKVPVEAVGTYRL L+T HHLDL +T YVPS+SRNL+SL
Sbjct: 241 QGFTTIQTISPNEKFVFMGNRVKVPVEAVGTYRLILNTGHHLDLLETLYVPSLSRNLVSL 300

Query: 301 SKHDTSGYYFKFGNECFSLFKQNIFIGSGILCDGLYKLKFDNVFAESLLTLHHNVGTKRG 360
           SK D  GY F FGN CFSLFK+N  IG+GILCDGLYKL  D ++ E+LLTLHHN+GTKR 
Sbjct: 301 SKLDAIGYSFTFGNGCFSLFKRNHLIGTGILCDGLYKLNLDGLYDETLLTLHHNIGTKRS 360

Query: 361 QTNESSAYLWHKRLGHISKERIKRLIKNEILPDLNFTDLGICVDCIKGKQTKHSVNKEAT 420
             NE SA+LWH+RLGHIS+ER++RLIKNEILP+L+FTDL ICVDCIKGKQTKH+  K AT
Sbjct: 361 LVNERSAFLWHRRLGHISRERMERLIKNEILPNLDFTDLNICVDCIKGKQTKHT-KKGAT 420

Query: 421 RSSQLLEIIHTDICGSFDVPSFGEEKYFITFIDDFSRYGYIYLLHEKSQEIDALKVFINE 480
           RS+QLLEI+HTDICG FDV SFG+EKYFITFIDD+SRYGY+YLLHEKSQ +DAL++++NE
Sbjct: 421 RSTQLLEIVHTDICGPFDVNSFGKEKYFITFIDDYSRYGYVYLLHEKSQAVDALEIYLNE 480

Query: 481 VERQLDRKVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGTPQQNGVAE 540
           VERQLD+KVK++RSDRGGEYYG+Y+E GQ PGPFAK L+  GICAQYTMPGTPQQNGV+E
Sbjct: 481 VERQLDKKVKVVRSDRGGEYYGRYNETGQHPGPFAKLLQKRGICAQYTMPGTPQQNGVSE 540

Query: 541 RRNRTLMNMVRSMLINSSLLVSLWMYALRTAQYLLNRVPNKSVPKTPFELWTRRKPSLRH 600
           RRNRTLM+MVRSML NS+L + LWMYAL+TA YLLNRVP+K+V KTPFELWT R PSLRH
Sbjct: 541 RRNRTLMDMVRSMLSNSTLPIYLWMYALKTAMYLLNRVPSKAVSKTPFELWTGRTPSLRH 600

Query: 601 LHVWGCQAEVRICNPHEKKLDSRTTSGFFIGYPEKSKGYRFYCPNHSTRIVETGNVRFIE 660
           LHVWGCQAE+RI NP EKKLD+RT SG+FIGYPEKSKGY FYCPNH+ RIVETGN RFIE
Sbjct: 601 LHVWGCQAEIRIYNPQEKKLDARTISGYFIGYPEKSKGYMFYCPNHNMRIVETGNARFIE 660

Query: 661 NDIISGSLEPRKVKIQEVRVEIPSSITS-SQVVVPVVVDSVNNPQEQQINVQTPHNDIVT 720
           N  +SGS  P+KV+++EVR+++P +  S ++V VP+ V+S NN +EQ  N       ++ 
Sbjct: 661 NGEVSGSTIPQKVEVKEVRMQVPLTYASGNKVSVPLTVESNNNEEEQHNN-----EPMIH 720

Query: 721 NELVTKGSQEIELRRFVRSRRAAISDDYLVYLHESEFDLSI-DNDPVSFSQAIKVDNSTK 780
           NE + +  QEI LRR  R +R AIS+DY+VYLHE E D SI +NDPVSFSQA+  DNS K
Sbjct: 721 NEPIVEQPQEIALRRSQREKRPAISNDYMVYLHELENDSSINENDPVSFSQAVSCDNSEK 780

Query: 781 WLDAMKEELKSMNDNEVWDLVELPKESKRVGCK 809
           WL+AMKEELKSM  N+VWDLVELP+  KRVGCK
Sbjct: 781 WLNAMKEELKSMEQNDVWDLVELPEGCKRVGCK 804

BLAST of Cmc08g0226621 vs. NCBI nr
Match: RVX08602.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 988.4 bits (2554), Expect = 3.6e-284
Identity = 496/808 (61.39%), Postives = 613/808 (75.87%), Query Frame = 0

Query: 1   MFMRMTVANNIKSTIKNTEDAKEFMKYVEKCSQSESANKSLAGTLMSTLTNIKFDGSRTI 60
           MFMRMT+ANNIK+++  TE A EF+K VE+  + + A+KSLAGTLM+ LT +K+DG + I
Sbjct: 61  MFMRMTIANNIKTSLPQTEFASEFLKSVEE--RFKRADKSLAGTLMAELTTMKYDGQKGI 120

Query: 61  HEHILEMMNLAARLKTMGMEVNENFLVTFILNSLPSKYGPFHMNYNTLKDKWNVHELQSM 120
            +HIL M   AA+LK +GM ++E+FLV F+LNSLPS++ PF ++YNT  D+WN++EL S 
Sbjct: 121 QQHILNMTEKAAKLKALGMGMDESFLVQFVLNSLPSQFAPFKIHYNTNSDQWNLNELTSK 180

Query: 121 LIQEEARLKKPIIHSINPMGHKGAGKKPRKKNGKGNHGQLKVKQSSSPIHKKGQIKDKCR 180
            IQEE RL++   +    + H    KK + K GK    +       S  H  G+    C 
Sbjct: 181 CIQEEVRLRQEGHNLAFAVTHGVTKKKGKFKKGKNFPPKKSGPGEGSQSH-DGKFTVSCY 240

Query: 181 FCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVFNTMQGF 240
           FC K GH +KDC+KRKAWFE +G + + VC+ESNL EVP NTWWIDSG T HV N MQGF
Sbjct: 241 FCGKKGHVKKDCIKRKAWFEKRGINLSFVCYESNLAEVPSNTWWIDSGATTHVTNLMQGF 300

Query: 241 LTTRTTNLNERFIFMGNRVKVPVEAVGTYRLTLDTRHHLDLFDTFYVPSISRNLISLSKH 300
           LTTR    +E+F++MGNR+KV V AVGTYRL L+T H +DL +TFYVPSISRNL+SLSK 
Sbjct: 301 LTTRKPKESEKFLYMGNRLKVEVVAVGTYRLLLETGHRMDLLNTFYVPSISRNLVSLSKL 360

Query: 301 DTSGYYFKFGNECFSLFKQNIFIGSGILCDGLYKLKFDNVFAESLLTLHHNVGTKRGQTN 360
           D +GY   F +   SL    + +GSGILCDGLYK+  ++ FA++L+TLH NVG+KRG  N
Sbjct: 361 DATGYSVLFNSGQLSLMLNYVTVGSGILCDGLYKISLNHEFAQALITLHSNVGSKRGLIN 420

Query: 361 ESSAYLWHKRLGHISKERIKRLIKNEILPDLNFTDLGICVDCIKGKQTKHSVNKEATRSS 420
           E+S+ LWH+RLGHIS+ERI+RL+K  IL +L+FTD  +CVDCIKGKQTKH+  K ATRS+
Sbjct: 421 ENSSILWHRRLGHISRERIERLVKEGILQNLDFTDFHVCVDCIKGKQTKHT-KKGATRSN 480

Query: 421 QLLEIIHTDICGSFDVPSFGEEKYFITFIDDFSRYGYIYLLHEKSQEIDALKVFINEVER 480
           +LLEIIHTDICG   VP F  EKYFITFIDD SRYGY+YL+HEKSQ ID  ++FI EVER
Sbjct: 481 ELLEIIHTDICGPLSVPCFTGEKYFITFIDDLSRYGYVYLMHEKSQAIDIFEMFITEVER 540

Query: 481 QLDRKVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRN 540
           QLD+K+KI+RSDRGGEYYG+YDE+GQ PGPFAKFLE HGI AQYTMPGTPQQNGVAERRN
Sbjct: 541 QLDKKIKIVRSDRGGEYYGRYDESGQNPGPFAKFLEKHGIRAQYTMPGTPQQNGVAERRN 600

Query: 541 RTLMNMVRSMLINSSLLVSLWMYALRTAQYLLNRVPNKSVPKTPFELWTRRKPSLRHLHV 600
           RTLM MVRSM+  SS+ +SLW  AL+TA Y+LNRVP+K+VPKTPFELWT RKPSLRH+H+
Sbjct: 601 RTLMEMVRSMMSYSSVPISLWGEALKTAMYILNRVPSKAVPKTPFELWTGRKPSLRHIHI 660

Query: 601 WGCQAEVRICNPHEKKLDSRTTSGFFIGYPEKSKGYRFYCPNHSTRIVETGNVRFIENDI 660
           WGC AE RI NPHEKKLDSRT SG+FIGYP+KSKGYRFYCPNHS RIVETGN RF+EN  
Sbjct: 661 WGCPAEARIYNPHEKKLDSRTVSGYFIGYPDKSKGYRFYCPNHSVRIVETGNARFLENGE 720

Query: 661 ISGSLEPRKVKIQEVRVEIPSSITSSQVVVPVVVDSVNNPQEQQINVQTPHNDIVTNELV 720
           ISGS EPRKV I+E+RV+IP      +++VP  V  V + ++   +   P  +I   E V
Sbjct: 721 ISGSNEPRKVDIEEIRVDIPPPFLPQEIIVPQPVQQVEDNEQNNRDGSLPLENIAI-ENV 780

Query: 721 TKGSQEIELRRFVRSRRAAISDDYLVYLHESEFDLSIDNDPVSFSQAIKVDNSTKWLDAM 780
            +  Q   LRR  R RR AI+DDY+VYL ES++D+ I  DPVSFSQA++ D+S+KW++AM
Sbjct: 781 VEPPQPAPLRRSQRERRPAITDDYVVYLQESDYDIGIRKDPVSFSQAMESDDSSKWMEAM 840

Query: 781 KEELKSMNDNEVWDLVELPKESKRVGCK 809
            EELKSM  N VWDL+ELP   K VGCK
Sbjct: 841 NEELKSMAHNGVWDLIELPNNCKPVGCK 863

BLAST of Cmc08g0226621 vs. NCBI nr
Match: RVW55286.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 987.6 bits (2552), Expect = 6.2e-284
Identity = 495/808 (61.26%), Postives = 613/808 (75.87%), Query Frame = 0

Query: 1   MFMRMTVANNIKSTIKNTEDAKEFMKYVEKCSQSESANKSLAGTLMSTLTNIKFDGSRTI 60
           MFMRMT+ANNIK+++  TE A EF+K VE+  + + A+KSLAGTLM+ LT +K+DG + I
Sbjct: 61  MFMRMTIANNIKTSLPQTEFASEFLKSVEE--RFKRADKSLAGTLMAELTTMKYDGQKGI 120

Query: 61  HEHILEMMNLAARLKTMGMEVNENFLVTFILNSLPSKYGPFHMNYNTLKDKWNVHELQSM 120
            +HIL M   AA+LK +GM ++E+FLV F+LNSLPS++ PF ++YNT  D+WN++EL S 
Sbjct: 121 QQHILNMTEKAAKLKALGMGMDESFLVQFVLNSLPSQFAPFKIHYNTNSDQWNLNELTSK 180

Query: 121 LIQEEARLKKPIIHSINPMGHKGAGKKPRKKNGKGNHGQLKVKQSSSPIHKKGQIKDKCR 180
            IQEE RL++   +    + H    KK + K GK    +       S  H  G+    C 
Sbjct: 181 CIQEEVRLRQEGHNHAFAVTHGVTKKKGKFKKGKNFPPKKSGPGEGSQSH-DGKFTVSCY 240

Query: 181 FCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVFNTMQGF 240
           FC K GH +KDC+KRKAWFE +G + + VC+ESNL EVP NTWWIDSG T HV N MQGF
Sbjct: 241 FCGKKGHVKKDCIKRKAWFEKRGINLSFVCYESNLAEVPSNTWWIDSGATTHVTNLMQGF 300

Query: 241 LTTRTTNLNERFIFMGNRVKVPVEAVGTYRLTLDTRHHLDLFDTFYVPSISRNLISLSKH 300
           LTTR    +E+F++MGNR+KV V AVGTYRL L+T H +DL +TFYVPSISRNL+SLSK 
Sbjct: 301 LTTRKPKESEKFLYMGNRLKVEVVAVGTYRLLLETGHRMDLLNTFYVPSISRNLVSLSKL 360

Query: 301 DTSGYYFKFGNECFSLFKQNIFIGSGILCDGLYKLKFDNVFAESLLTLHHNVGTKRGQTN 360
           D +GY   F +   SL   ++ +GSGILCDGLYK+  ++ FA++L+TLH NVG+KRG  N
Sbjct: 361 DATGYSVLFSSGQLSLMLNSVTVGSGILCDGLYKISLNHEFAQALITLHSNVGSKRGLIN 420

Query: 361 ESSAYLWHKRLGHISKERIKRLIKNEILPDLNFTDLGICVDCIKGKQTKHSVNKEATRSS 420
           E+S+ LWH+RLGHIS+ERI+RL+K  IL +L+FTD  +CVDCIKGKQTKH+  K ATRS+
Sbjct: 421 ENSSILWHRRLGHISRERIERLVKEGILQNLDFTDFHVCVDCIKGKQTKHT-KKGATRSN 480

Query: 421 QLLEIIHTDICGSFDVPSFGEEKYFITFIDDFSRYGYIYLLHEKSQEIDALKVFINEVER 480
           +LLEIIH DICG   VP F  EKYFITFIDD SRYGY+YL+HEKSQ ID  ++FI EVER
Sbjct: 481 ELLEIIHIDICGPLSVPCFTGEKYFITFIDDLSRYGYVYLMHEKSQAIDIFEMFITEVER 540

Query: 481 QLDRKVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRN 540
           QLD+K+KI+RSDRGGEYYG+YDE+GQ PGPFAKFLE HGI AQYTMPGTPQQNGVAERRN
Sbjct: 541 QLDKKIKIVRSDRGGEYYGRYDESGQNPGPFAKFLEKHGIRAQYTMPGTPQQNGVAERRN 600

Query: 541 RTLMNMVRSMLINSSLLVSLWMYALRTAQYLLNRVPNKSVPKTPFELWTRRKPSLRHLHV 600
           RTLM MVRSM+  SS+ +SLW  AL+TA Y+LNRVP+K+VPKTPFELWT RKPSLRH+H+
Sbjct: 601 RTLMEMVRSMMSYSSVPISLWGEALKTAMYILNRVPSKAVPKTPFELWTGRKPSLRHIHI 660

Query: 601 WGCQAEVRICNPHEKKLDSRTTSGFFIGYPEKSKGYRFYCPNHSTRIVETGNVRFIENDI 660
           WGC AE RI NPHEKKLDSRT SG+FIGYP+KSKGYRFYCPNHS RIVETGN RF+EN  
Sbjct: 661 WGCPAEARIYNPHEKKLDSRTVSGYFIGYPDKSKGYRFYCPNHSVRIVETGNARFLENGE 720

Query: 661 ISGSLEPRKVKIQEVRVEIPSSITSSQVVVPVVVDSVNNPQEQQINVQTPHNDIVTNELV 720
           ISGS EPRKV I+E+RV+IP      +++VP  V  V + ++   +   P  +I   E V
Sbjct: 721 ISGSNEPRKVDIEEIRVDIPPPFLPQEIIVPQPVQQVEDNEQNNRDGSLPLENIAI-ENV 780

Query: 721 TKGSQEIELRRFVRSRRAAISDDYLVYLHESEFDLSIDNDPVSFSQAIKVDNSTKWLDAM 780
            +  Q   LRR  R RR AI+DDY+VYL ES++D+ I  DPVSFSQA++ D+S+KW++AM
Sbjct: 781 VEPPQPAPLRRSQRERRPAITDDYVVYLQESDYDIGIRKDPVSFSQAMESDDSSKWMEAM 840

Query: 781 KEELKSMNDNEVWDLVELPKESKRVGCK 809
            EELKSM  N VWDL+ELP   K VGCK
Sbjct: 841 NEELKSMAHNGVWDLIELPNNCKPVGCK 863

BLAST of Cmc08g0226621 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 297.0 bits (759), Expect = 6.6e-79
Identity = 245/844 (29.03%), Postives = 393/844 (46.56%), Query Frame = 0

Query: 3   MRMTVANNIKSTIKNTEDAKEFMKYVEKCSQSESANKSLAGTLMSTLTNIKFDGSRTIHE 62
           +R+ +++++ + I + + A+     +E    S++    L   L   L  +          
Sbjct: 63  IRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKL--YLKKQLYALHMSEGTNFLS 122

Query: 63  HILEMMNLAARLKTMGMEVNENFLVTFILNSLPSKYGPFHMNYNTLKDKWNVHELQSMLI 122
           H+     L  +L  +G+++ E      +LNSLPS Y          K    + ++ S L+
Sbjct: 123 HLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALL 182

Query: 123 QEEARLKKPIIHSINPMGHKGAGKK-PRKKNGKGNHGQLKVKQSSSPIHKKGQIKDKCRF 182
             E   KKP  +    +  +G G+   R  N  G  G     +  S    K ++++ C  
Sbjct: 183 LNEKMRKKP-ENQGQALITEGRGRSYQRSSNNYGRSG----ARGKSKNRSKSRVRN-CYN 242

Query: 183 CNKPGHYQKDCLK-RKAWFENKGKHN-----ALVCFESNLT------------EVPYNTW 242
           CN+PGH+++DC   RK   E  G+ N     A+V    N+               P + W
Sbjct: 243 CNQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESEW 302

Query: 243 WIDSGCTIHVFNTMQGFLTTRTTNLNERFIFMGNRVKVPVEAVGTYRLTLDTRHHLDLFD 302
            +D+  + H   T    L  R    +   + MGN     +  +G   +  +    L L D
Sbjct: 303 VVDTAASHHA--TPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKD 362

Query: 303 TFYVPSISRNLISLSKHDTSGYYFKFGNECFSLFKQNIFIGSGILCDGLYKLKFDNVFAE 362
             +VP +  NLIS    D  GY   F N+ + L K ++ I  G+    LY+   +     
Sbjct: 363 VRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEIC--- 422

Query: 363 SLLTLHHNVGTKRGQTNESSAYLWHKRLGHISKERIKRLIKNEILPDLNFTDLGICVDCI 422
                    G      +E S  LWHKR+GH+S++ ++ L K  ++     T +  C  C+
Sbjct: 423 --------QGELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCL 482

Query: 423 KGKQTKHSVNKEATRSSQLLEIIHTDICGSFDVPSFGEEKYFITFIDDFSRYGYIYLLHE 482
            GKQ + S    + R   +L+++++D+CG  ++ S G  KYF+TFIDD SR  ++Y+L  
Sbjct: 483 FGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKT 542

Query: 483 KSQEIDALKVFINEVERQLDRKVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQ 542
           K Q     + F   VER+  RK+K LRSD GGEY  +          F ++  SHGI  +
Sbjct: 543 KDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSR---------EFEEYCSSHGIRHE 602

Query: 543 YTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLLVSLWMYALRTAQYLLNRVPNKSVP-- 602
            T+PGTPQ NGVAER NRT++  VRSML  + L  S W  A++TA YL+NR P  SVP  
Sbjct: 603 KTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSP--SVPLA 662

Query: 603 -KTPFELWTRRKPSLRHLHVWGCQAEVRICNPHEKKLDSRTTSGFFIGYPEKSKGYRFYC 662
            + P  +WT ++ S  HL V+GC+A   +      KLD ++    FIGY ++  GYR + 
Sbjct: 663 FEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWD 722

Query: 663 PNHSTRIVETGNVRFIENDIISGSLEPRKVK--IQEVRVEIPSSITSSQVVVPVVVDSVN 722
           P    +++ + +V F E+++ + +    KVK  I    V IPS+ +++        D V+
Sbjct: 723 P-VKKKVIRSRDVVFRESEVRTAADMSEKVKNGIIPNFVTIPST-SNNPTSAESTTDEVS 782

Query: 723 NPQEQQINVQTPHNDIVTNELVTKGSQEIE-----------LRRFVRSR---RAAISDDY 782
              EQ      P   I   E + +G +E+E           LRR  R R   R   S +Y
Sbjct: 783 EQGEQ------PGEVIEQGEQLDEGVEEVEHPTQGEEQHQPLRRSERPRVESRRYPSTEY 842

Query: 783 LVYLHESEFDLSIDNDPVSFSQAIKVDNSTKWLDAMKEELKSMNDNEVWDLVELPKESKR 809
           ++        +S D +P S  + +      + + AM+EE++S+  N  + LVELPK  + 
Sbjct: 843 VL--------ISDDREPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRP 858

BLAST of Cmc08g0226621 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 191.8 bits (486), Expect = 3.0e-47
Identity = 196/801 (24.47%), Postives = 332/801 (41.45%), Query Frame = 0

Query: 21  AKEFMKYVEKCSQSESANKSLAGTLMSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGME 80
           A++ ++ ++   + +S    LA  L   L ++K     ++  H      L + L   G +
Sbjct: 77  ARQILENLDAVYERKSLASQLA--LRKRLLSLKLSSEMSLLSHFHIFDELISELLAAGAK 136

Query: 81  VNENFLVTFILNSLPSKYGPFHMNYNTL-KDKWNVHELQSMLIQEEARLKKPIIHSINPM 140
           + E   ++ +L +LPS Y        TL ++   +  +++ L+ +E ++K    H+    
Sbjct: 137 IEEMDKISHLLITLPSCYDGIITAIETLSEENLTLAFVKNRLLDQEIKIKND--HNDTSK 196

Query: 141 GHKGAGKKPRKKNGKGNHGQLKVKQSSSPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWF 200
               A         K N  + +V +         + K KC  C + GH +KDC   K   
Sbjct: 197 KVMNAIVHNNNNTYKNNLFKNRVTKPKKIFKGNSKYKVKCHHCGREGHIKKDCFHYKRIL 256

Query: 201 ENKGKHN------------ALVCFESNLTEVPYNTWWI-DSGCTIHVFNTMQGFLTTRTT 260
            NK K N            A +  E N T V  N  ++ DSG + H+ N    +  +   
Sbjct: 257 NNKNKENEKQVQTATSHGIAFMVKEVNNTSVMDNCGFVLDSGASDHLINDESLYTDSVEV 316

Query: 261 NLNERFIFMGNRVKVPVEAVGTYRLTLDTRHHLDLFDTFYVPSISRNLISLSKHDTSGYY 320
               +         +     G  RL  D  H + L D  +    + NL+S+ +   +G  
Sbjct: 317 VPPLKIAVAKQGEFIYATKRGIVRLRND--HEITLEDVLFCKEAAGNLMSVKRLQEAGMS 376

Query: 321 FKFGNECFSLFKQNIFI--GSGILCDGLYKLKFDNVFAESLLTLHHNVGTKRGQTNESSA 380
            +F     ++ K  + +   SG+    L  +   N  A S+   H N           + 
Sbjct: 377 IEFDKSGVTISKNGLMVVKNSGM----LNNVPVINFQAYSINAKHKN-----------NF 436

Query: 381 YLWHKRLGHISKERIKRLIKNEILPD---LNFTDLG--ICVDCIKGKQTKHSVN--KEAT 440
            LWH+R GHIS  ++  + +  +  D   LN  +L   IC  C+ GKQ +      K+ T
Sbjct: 437 RLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQARLPFKQLKDKT 496

Query: 441 RSSQLLEIIHTDICGSFDVPSFGEEKYFITFIDDFSRYGYIYLLHEKSQEIDALKVFINE 500
              + L ++H+D+CG     +  ++ YF+ F+D F+ Y   YL+  KS      + F+ +
Sbjct: 497 HIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAK 556

Query: 501 VERQLDRKVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGTPQQNGVAE 560
            E   + KV  L  D G EY               +F    GI    T+P TPQ NGV+E
Sbjct: 557 SEAHFNLKVVYLYIDNGREYLS---------NEMRQFCVKKGISYHLTVPHTPQLNGVSE 616

Query: 561 RRNRTLMNMVRSMLINSSLLVSLWMYALRTAQYLLNRVPNKSV---PKTPFELWTRRKPS 620
           R  RT+    R+M+  + L  S W  A+ TA YL+NR+P++++    KTP+E+W  +KP 
Sbjct: 617 RMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPY 676

Query: 621 LRHLHVWGCQAEVRICNPHEKKLDSRTTSGFFIGYPEKSKGYRFYCPNHSTRIVETGNVR 680
           L+HL V+G    V I N  + K D ++    F+GY  +  G++ +         +  N +
Sbjct: 677 LKHLRVFGATVYVHIKN-KQGKFDDKSFKSIFVGY--EPNGFKLW---------DAVNEK 736

Query: 681 FI-ENDIISGSLEPRKVKIQEVRVEIPSSITSSQVVVPVVVDSVNNPQEQQINVQTP--- 740
           FI   D++    E   V  + V+ E      S +       ++ N P + +  +QT    
Sbjct: 737 FIVARDVVVD--ETNMVNSRAVKFETVFLKDSKE------SENKNFPNDSRKIIQTEFPN 796

Query: 741 HNDIVTNELVTKGSQEIELRRFVRSRRAAISDDYLVYLHESEFDLSIDNDPVSFSQAIKV 792
            +    N    K S+E E + F    R  I  ++     E +    + +   S    +  
Sbjct: 797 ESKECDNIQFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNKYFLNE 827

BLAST of Cmc08g0226621 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 172.2 bits (435), Expect = 2.4e-41
Identity = 160/668 (23.95%), Postives = 295/668 (44.16%), Query Frame = 0

Query: 56  GSRTIHEHILEMMNLAARLKTMGMEVNENFLVTFILNSLPSKYGPFHMNYNTLKDKWNVH 115
           G++TI +++  ++    +L  +G  ++ +  V  +L +LP +Y P             + 
Sbjct: 138 GTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLT 197

Query: 116 ELQSMLIQEEARLKKPIIHSINPM--------------GHKGAGKKPRKKNGKGNHGQLK 175
           E+   L+  E+++      ++ P+               +    +  R  N   N+    
Sbjct: 198 EIHERLLNHESKILAVSSATVIPITANAVSHRNTTTTNNNNNGNRNNRYDNRNNNNNSKP 257

Query: 176 VKQSSSPIH-KKGQIK---DKCRFCNKPGHYQKDCLKRKAWFENKGKHNALVCF-----E 235
            +QSS+  H    Q K    KC+ C   GH  K C + + +  +         F      
Sbjct: 258 WQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSPFTPWQPR 317

Query: 236 SNLT-EVPY--NTWWIDSGCTIHVFNTMQGFLTTRTTNLNERFIFMGNRVKVPVEAVGTY 295
           +NL    PY  N W +DSG T H+ +     L+          + + +   +P+   G+ 
Sbjct: 318 ANLALGSPYSSNNWLLDSGATHHITSDFNN-LSLHQPYTGGDDVMVADGSTIPISHTGST 377

Query: 296 RLTLDTRHHLDLFDTFYVPSISRNLISLSK-HDTSGYYFKFGNECFSLFKQN--IFIGSG 355
            L+  +R  L+L +  YVP+I +NLIS+ +  + +G   +F    F +   N  + +  G
Sbjct: 378 SLSTKSR-PLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQG 437

Query: 356 ILCDGLYKLKFDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLGHISKERIKRLIKNE 415
              D LY+    +    SL             +++++   WH RLGH +   +  +I N 
Sbjct: 438 KTKDELYEWPIASSQPVSLFA---------SPSSKATHSSWHARLGHPAPSILNSVISNY 497

Query: 416 ILPDLNFTDLGI-CVDCIKGKQTKHSVNKEATRSSQLLEIIHTDICGSFDVPSFGEEKYF 475
            L  LN +   + C DC+  K  K   ++    S++ LE I++D+  S  + S    +Y+
Sbjct: 498 SLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWSS-PILSHDNYRYY 557

Query: 476 ITFIDDFSRYGYIYLLHEKSQEIDALKVFINEVERQLDRKVKILRSDRGGEYYGKYDENG 535
           + F+D F+RY ++Y L +KSQ  +    F N +E +   ++    SD GGE+   ++   
Sbjct: 558 VIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVALWE--- 617

Query: 536 QCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLLVSLWMYAL 595
                   +   HGI    + P TP+ NG++ER++R ++    ++L ++S+  + W YA 
Sbjct: 618 --------YFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAF 677

Query: 596 RTAQYLLNRVPNKSVP-KTPFELWTRRKPSLRHLHVWGCQAEVRICNPHEKKLDSRTTSG 655
             A YL+NR+P   +  ++PF+      P+   L V+GC     +   ++ KLD ++   
Sbjct: 678 AVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQC 737

Query: 656 FFIGYPEKSKGYRFYCPN-HSTRIVETGNVRFIENDIISGSLEPRKVKIQEVRVEIPSSI 692
            F+GY      Y   C +  ++R+  + +VRF EN     +       +QE R E  S +
Sbjct: 738 VFLGYSLTQSAY--LCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQRRE-SSCV 779

BLAST of Cmc08g0226621 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 170.2 bits (430), Expect = 9.3e-41
Identity = 151/593 (25.46%), Postives = 259/593 (43.68%), Query Frame = 0

Query: 73  RLKTMGMEVNENFLVTFILNSLPSKYGPFHMNYNTLKDKWNVHELQSMLIQEEARLKK-- 132
           +L  +G  ++ +  V  +L +LP  Y P            ++ E+   LI  E++L    
Sbjct: 136 QLALLGKPMDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTEIHERLINRESKLLALN 195

Query: 133 -----PIIHSI---------NPMGHKGAGKKPRKKNGKGNHGQLKVKQSSSPIHKKGQIK 192
                PI  ++             ++G  +     N + N  Q     S S   +     
Sbjct: 196 SAEVVPITANVVTHRNTNTNRNQNNRGDNRNYNNNNNRSNSWQPSSSGSRSDNRQPKPYL 255

Query: 193 DKCRFCNKPGHYQKDCLKRKAWFENKGKHNALVCF-----ESNL-TEVPY--NTWWIDSG 252
            +C+ C+  GH  K C +   +     +  +   F      +NL    PY  N W +DSG
Sbjct: 256 GRCQICSVQGHSAKRCPQLHQFQSTTNQQQSTSPFTPWQPRANLAVNSPYNANNWLLDSG 315

Query: 253 CTIHVFNTMQGFLTTRTTNLNERFIFMGNRVKVPVEAVGTYRLTLDTRHHLDLFDTFYVP 312
            T H+ +     L+          + + +   +P+   G+  L   +R  LDL    YVP
Sbjct: 316 ATHHITSDFNN-LSFHQPYTGGDDVMIADGSTIPITHTGSASLPTSSR-SLDLNKVLYVP 375

Query: 313 SISRNLISLSK-HDTSGYYFKFGNECFSLFKQN--IFIGSGILCDGLYKLKFDNVFAESL 372
           +I +NLIS+ +  +T+    +F    F +   N  + +  G   D LY+    +  A S+
Sbjct: 376 NIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQAVSM 435

Query: 373 LTLHHNVGTKRGQTNESSAYLWHKRLGHISKERIKRLIKNEILPDLNFT-DLGICVDCIK 432
                         ++++   WH RLGH S   +  +I N  LP LN +  L  C DC  
Sbjct: 436 FA---------SPCSKATHSSWHSRLGHPSLAILNSVISNHSLPVLNPSHKLLSCSDCFI 495

Query: 433 GKQTKHSVNKEATRSSQLLEIIHTDICGSFDVPSFGEEKYFITFIDDFSRYGYIYLLHEK 492
            K  K   +     SS+ LE I++D+  S  + S    +Y++ F+D F+RY ++Y L +K
Sbjct: 496 NKSHKVPFSNSTITSSKPLEYIYSDVWSS-PILSIDNYRYYVIFVDHFTRYTWLYPLKQK 555

Query: 493 SQEIDALKVFINEVERQLDRKVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQY 552
           SQ  D   +F + VE +   ++  L SD GGE+    D           +L  HGI    
Sbjct: 556 SQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVVLRD-----------YLSQHGISHFT 615

Query: 553 TMPGTPQQNGVAERRNRTLMNMVRSMLINSSLLVSLWMYALRTAQYLLNRVPNKSVP-KT 612
           + P TP+ NG++ER++R ++ M  ++L ++S+  + W YA   A YL+NR+P   +  ++
Sbjct: 616 SPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQS 675

Query: 613 PFELWTRRKPSLRHLHVWGCQAEVRICNPHEKKLDSRTTSGFFIGYPEKSKGY 637
           PF+    + P+   L V+GC     +   +  KL+ ++    F+GY      Y
Sbjct: 676 PFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAY 705

BLAST of Cmc08g0226621 vs. ExPASy Swiss-Prot
Match: Q04214 (Transposon Ty1-MR1 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY1B-MR1 PE=1 SV=2)

HSP 1 Score: 109.8 bits (273), Expect = 1.5e-22
Identity = 149/667 (22.34%), Postives = 272/667 (40.78%), Query Frame = 0

Query: 34  SESANKSLAGTL----MSTLTNIKFDGSRTIHEHILEMMNLAARLKTMGMEVNENFLVTF 93
           S+S NK  + T     ++TL  + ++GS        E+ N+  RL   G+ +N      F
Sbjct: 253 SKSINKMQSDTQEVNDITTLATLHYNGSTPADAFEAEVTNILDRLNNNGIPINNKVACQF 312

Query: 94  ILNSLPS-----KYGPFHMNYNTLKDKWNVHELQSMLIQEEARLKKPIIHSINPMGHKGA 153
           I+  L       +Y      + T+ D ++  ++ SM  +++   +    H  +P   K  
Sbjct: 313 IMRGLSGEYKFLRYARHRCIHMTVADLFS--DIHSMYEEQQESKRNKSTHRRSPSDEKKD 372

Query: 154 GK------KPR---KKNGKGNHGQLKVKQSSSPIHKKGQIKDKCRFCNKPGHYQKDCLKR 213
            +      KP+   + + K N+ Q +  ++           +   F N PG    D ++ 
Sbjct: 373 SRTYTNTTKPKSITRNSQKPNNSQSRTARA----------HNVSTFNNSPGP-DNDLIRG 432

Query: 214 KAWFENKGKHNALVCFESNLTEVPYN-----------TWWIDSGCTIHVFNTMQGFLTTR 273
                 + K+   +     LTE   N              +DSG +  +  +    + + 
Sbjct: 433 STTEPIQLKNTHDLHLGQELTESTVNHTNHSDDKLPGHLLLDSGASRTLIRSAH-HIHSA 492

Query: 274 TTNLNERFIFMGNRVKVPVEAVGTYRLTLDTRHHLDLFDTFYVPSISRNLISLSKHDTSG 333
           ++N +   +    R  +P+ A+G  +          +    + P+I+ +L+SL++     
Sbjct: 493 SSNPDINVVDAQKR-NIPINAIGDLQFHFQDNTKTSI-KVLHTPNIAYDLLSLNELAAVD 552

Query: 334 YYFKFGNECFSLFKQNIFIGSGILCDGLYKL-KFDNVFAESLLTLHHNVGT-KRGQTNES 393
                   CF+  K  +    G +   + K   F  V  + LL  + +V T     T+ES
Sbjct: 553 I-----TACFT--KNVLERSDGTVLAPIVKYGDFYWVSKKYLLPSNISVPTINNVHTSES 612

Query: 394 SAY----LWHKRLGHISKERIKRLIKNEILPDLNFTDLG-------ICVDCIKGKQTKHS 453
           +        H+ L H + + I+  +KN  +   N +D+         C DC+ GK TKH 
Sbjct: 613 TRKYPYPFIHRMLAHANAQTIRYSLKNNTITYFNESDVDWSSAIDYQCPDCLIGKSTKHR 672

Query: 454 VNK----EATRSSQLLEIIHTDICGSFDVPSFGEEKYFITFIDDFSRYGYIYLLHEKSQE 513
             K    +   S +  + +HTDI G           YFI+F D+ +++ ++Y LH++ ++
Sbjct: 673 HIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTKFRWVYPLHDRRED 732

Query: 514 --IDALKVFINEVERQLDRKVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYT 573
             +D     +  ++ Q    V +++ DRG EY  +            KFLE +GI   YT
Sbjct: 733 SILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNR---------TLHKFLEKNGITPCYT 792

Query: 574 MPGTPQQNGVAERRNRTLMNMVRSMLINSSLLVSLWMYALRTAQYLLNRVPNKSVPKTPF 633
                + +GVAER NRTL++  R+ L  S L   LW  A+  +  + N + +    K+  
Sbjct: 793 TTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASPKSKKSAR 852

Query: 634 ELWTRRKPSLRHLHVWGCQAEVRICNPHEKKLDSRTTSGFFIGYPEKSKGYRFYCPNHST 653
           +        +  L  +G    V   NP+  K+  R   G+ +     S GY  Y P+   
Sbjct: 853 QHAGLAGLDISTLLPFGQPVIVNDHNPN-SKIHPRGIPGYALHPSRNSYGYIIYLPS-LK 885

BLAST of Cmc08g0226621 vs. ExPASy TrEMBL
Match: A0A445KPR8 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A OS=Glycine soja OX=3848 GN=D0Y65_012596 PE=4 SV=1)

HSP 1 Score: 1153.7 bits (2983), Expect = 0.0e+00
Identity = 574/814 (70.52%), Postives = 670/814 (82.31%), Query Frame = 0

Query: 1   MFMRMTVANNIKSTIKNTEDAKEFMKYVEKCSQSESANKSLAGTLMSTLTNIKFDGSRTI 60
           MFMRMTVA++IK+ +  T+ AKEFM  V +  +S++A+KSLAGTLMSTLT +KFDGSRT+
Sbjct: 149 MFMRMTVADSIKTALPKTDSAKEFMGLVGE--RSQTADKSLAGTLMSTLTTMKFDGSRTM 208

Query: 61  HEHILEMMNLAARLKTMGMEVNENFLVTFILNSLPSKYGPFHMNYNTLKDKWNVHELQSM 120
           HEH++EM N+AARLKT+GM VNENFLV FILNSLPS+YGPF M+YNT+KDKWNVHEL SM
Sbjct: 209 HEHVIEMTNIAARLKTLGMAVNENFLVQFILNSLPSEYGPFQMSYNTMKDKWNVHELHSM 268

Query: 121 LIQEEARLKKPIIHSINPMGHK---GAGKKPRKKNGKGNHGQLKVKQSSSPIHKKGQIKD 180
           L+QEE RLK    HSI+ + H+   GAGK   KK+ KG  G LK+K     I KK    +
Sbjct: 269 LVQEETRLKNQGSHSIHYVSHRGNQGAGKNFVKKHDKGK-GPLKIKDGPVQIQKKASKNN 328

Query: 181 KCRFCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVFNTM 240
            C FC K GH+QKDC KRK+WFE KG+ NALV FESNLTEVP+NTWWIDSGCT HV NTM
Sbjct: 329 NCHFCGKSGHFQKDCPKRKSWFEKKGELNALVYFESNLTEVPHNTWWIDSGCTTHVSNTM 388

Query: 241 QGFLTTRTTNLNERFIFMGNRVKVPVEAVGTYRLTLDTRHHLDLFDTFYVPSISRNLISL 300
           QGFLT +T + NE+F+FMGNRVK PVEAVGTYRL LDT HHLDL +T YVPS+SRNL+SL
Sbjct: 389 QGFLTIQTISPNEKFVFMGNRVKAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSL 448

Query: 301 SKHDTSGYYFKFGNECFSLFKQNIFIGSGILCDGLYKLKFDNVFAESLLTLHHNVGTKRG 360
           SK D +GY F FGN CFSLFK N  IG+G+LCDGLYKLK D ++ E++LTLHHNVGTKR 
Sbjct: 449 SKLDITGYSFNFGNGCFSLFKYNHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRS 508

Query: 361 QTNESSAYLWHKRLGHISKERIKRLIKNEILPDLNFTDLGICVDCIKGKQTKHSVNKEAT 420
             NE SA+LWHKRLGHIS+ERI+RLIKNEILPDL+FTDL ICVDCIKGKQTKH+  K AT
Sbjct: 509 LVNERSAFLWHKRLGHISRERIERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGAT 568

Query: 421 RSSQLLEIIHTDICGSFDVPSFGEEKYFITFIDDFSRYGYIYLLHEKSQEIDALKVFINE 480
           RS+QLLEI+HTDICG FDV SFG E+YFITFIDD+SRYGY+YLLHEKSQ ++AL++++NE
Sbjct: 569 RSTQLLEIVHTDICGPFDVSSFGRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNE 628

Query: 481 VERQLDRKVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGTPQQNGVAE 540
           VERQLDRKVKI+RSDRGGEYYG+YDE GQ PGPFAK L+  GICAQYTMPGTPQQNGV+E
Sbjct: 629 VERQLDRKVKIIRSDRGGEYYGRYDETGQHPGPFAKLLQKRGICAQYTMPGTPQQNGVSE 688

Query: 541 RRNRTLMNMVRSMLINSSLLVSLWMYALRTAQYLLNRVPNKSVPKTPFELWTRRKPSLRH 600
           RRNRTLM+MVRSMLINS+L VSLWMYAL+TA YLLN VP+K+VPKTPFELWT R PS+RH
Sbjct: 689 RRNRTLMDMVRSMLINSTLPVSLWMYALKTAMYLLNMVPSKAVPKTPFELWTNRTPSMRH 748

Query: 601 LHVWGCQAEVRICNPHEKKLDSRTTSGFFIGYPEKSKGYRFYCPNHSTRIVETGNVRFIE 660
           LHVWGCQAE+RI NP E+KLD+RT SG+FIGYPEKSKGY FYCPNHSTRIVETGN RFIE
Sbjct: 749 LHVWGCQAEIRIYNPQERKLDARTISGYFIGYPEKSKGYMFYCPNHSTRIVETGNARFIE 808

Query: 661 NDIISGSLEPRKVKIQEVRVEIPSSITSSQVVVPVVVDSVNNPQEQQINVQTPHND--IV 720
           N  ISGS  PR+V+I+EVRV++P +  SS  V+   V + N+ +E Q      HND  ++
Sbjct: 809 NGEISGSTVPREVEIKEVRVQVPLAFASSSKVITTSVTATNSNEEVQ------HNDEPMI 868

Query: 721 TNELVTKGSQEIELRRFVRSRRAAISDDYLVYLHESEFDLSI-DNDPVSFSQAIKVDNST 780
            NE + +  QE+ L++  R RR AIS+DY+VYLHE E +LSI DNDPVSFSQAI  DNS 
Sbjct: 869 HNEPIMEEPQEVALKKSQRERRPAISNDYVVYLHEIETNLSINDNDPVSFSQAISCDNSE 928

Query: 781 KWLDAMKEELKSMNDNEVWDLVELPKESKRVGCK 809
           KWL+AMKEE+ SM  N VWDLVELPK  KRVGCK
Sbjct: 929 KWLNAMKEEIDSMEHNGVWDLVELPKGCKRVGCK 952

BLAST of Cmc08g0226621 vs. ExPASy TrEMBL
Match: A0A445LQ30 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3848 GN=D0Y65_004205 PE=4 SV=1)

HSP 1 Score: 1153.7 bits (2983), Expect = 0.0e+00
Identity = 574/814 (70.52%), Postives = 671/814 (82.43%), Query Frame = 0

Query: 1   MFMRMTVANNIKSTIKNTEDAKEFMKYVEKCSQSESANKSLAGTLMSTLTNIKFDGSRTI 60
           MFMRMTVA++IK+ +  T+ AKEFM  V +  +S++A+KSLAGTLMSTLT +KFDGSRT+
Sbjct: 138 MFMRMTVADSIKTALPKTDSAKEFMGLVGE--RSQTADKSLAGTLMSTLTTMKFDGSRTM 197

Query: 61  HEHILEMMNLAARLKTMGMEVNENFLVTFILNSLPSKYGPFHMNYNTLKDKWNVHELQSM 120
           HEH++EM N+AARLKT+GM VNENFLV FILNSLPS+YGPF M+YNT+KDKWNVHEL SM
Sbjct: 198 HEHVIEMTNIAARLKTLGMAVNENFLVQFILNSLPSEYGPFQMSYNTMKDKWNVHELHSM 257

Query: 121 LIQEEARLKKPIIHSINPMGHK---GAGKKPRKKNGKGNHGQLKVKQSSSPIHKKGQIKD 180
           L+QEE RLK    HSI+ + H+   GAGKK  KK+ KG  G LK+K     I KK    +
Sbjct: 258 LVQEETRLKNQGSHSIHYVSHRGNQGAGKKFVKKHDKGK-GPLKIKDGPVQIQKKASKNN 317

Query: 181 KCRFCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVFNTM 240
            C FC K GH+QKDC KRK+WFE KG+ NALVCFESNLTEVP+NTWWIDSGCT HV NTM
Sbjct: 318 NCHFCGKSGHFQKDCPKRKSWFEKKGELNALVCFESNLTEVPHNTWWIDSGCTTHVSNTM 377

Query: 241 QGFLTTRTTNLNERFIFMGNRVKVPVEAVGTYRLTLDTRHHLDLFDTFYVPSISRNLISL 300
           QGFLT +T + NE+F+FMGNRVK PVEAVGTYRL LDT HHLDL +T YVPS+SRNL+SL
Sbjct: 378 QGFLTIQTISPNEKFVFMGNRVKAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSL 437

Query: 301 SKHDTSGYYFKFGNECFSLFKQNIFIGSGILCDGLYKLKFDNVFAESLLTLHHNVGTKRG 360
           SK D +GY F FGN CFSLFK N  IG+G+LCDGLYKLK D ++ E++LTLHHNVGTKR 
Sbjct: 438 SKLDITGYSFNFGNGCFSLFKYNHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRS 497

Query: 361 QTNESSAYLWHKRLGHISKERIKRLIKNEILPDLNFTDLGICVDCIKGKQTKHSVNKEAT 420
             NE SA+LWHKRLGHIS ERI+RLIKNEILPDL+FTDL ICVDCIKGKQTKH+  K AT
Sbjct: 498 LVNERSAFLWHKRLGHISGERIERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGAT 557

Query: 421 RSSQLLEIIHTDICGSFDVPSFGEEKYFITFIDDFSRYGYIYLLHEKSQEIDALKVFINE 480
           RS+QLLEI+HTDICG FDV SFG E+YFITFIDD+SRYGY+YLLHEKSQ ++AL++++NE
Sbjct: 558 RSTQLLEIVHTDICGPFDVSSFGRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNE 617

Query: 481 VERQLDRKVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGTPQQNGVAE 540
           VERQLDRKVKI+RSDRGGEYY +YDE GQ P PFAK L+  GICAQYTMPGTPQQNGV+E
Sbjct: 618 VERQLDRKVKIIRSDRGGEYYRRYDETGQHPSPFAKLLQKRGICAQYTMPGTPQQNGVSE 677

Query: 541 RRNRTLMNMVRSMLINSSLLVSLWMYALRTAQYLLNRVPNKSVPKTPFELWTRRKPSLRH 600
           RRN+TLM+MVRSMLINS+L VSLWMYAL+TA YLLNRVP+K+VPKTPFELWT R PS+RH
Sbjct: 678 RRNKTLMDMVRSMLINSTLPVSLWMYALKTAMYLLNRVPSKAVPKTPFELWTNRTPSMRH 737

Query: 601 LHVWGCQAEVRICNPHEKKLDSRTTSGFFIGYPEKSKGYRFYCPNHSTRIVETGNVRFIE 660
           LHVWGCQAE+RI NP E+KLD+RT SG+FIGYPEKSKGY FYCPNHSTRIVETGN RFIE
Sbjct: 738 LHVWGCQAEIRIYNPQERKLDARTISGYFIGYPEKSKGYMFYCPNHSTRIVETGNARFIE 797

Query: 661 NDIISGSLEPRKVKIQEVRVEIPSSITSSQVVVPVVVDSVNNPQEQQINVQTPHND--IV 720
           N  ISGS  PR+V+I+EVRV++P +  SS  V+   V + N+ +E Q      HND  ++
Sbjct: 798 NGEISGSTVPREVEIKEVRVQVPLAFASSSKVITTSVTATNSNEEVQ------HNDEPMI 857

Query: 721 TNELVTKGSQEIELRRFVRSRRAAISDDYLVYLHESEFDLSI-DNDPVSFSQAIKVDNST 780
            NE + +  QE+ LR+  R RR AIS+DY+VYLHE+E +LSI DNDPVSFSQAI  DNS 
Sbjct: 858 HNEPIMEEPQEVALRKSQRERRPAISNDYVVYLHETETNLSINDNDPVSFSQAISCDNSE 917

Query: 781 KWLDAMKEELKSMNDNEVWDLVELPKESKRVGCK 809
           KWL+AMKEE+ SM  N+VWDLVELPK  KRVG K
Sbjct: 918 KWLNAMKEEIDSMEHNDVWDLVELPKGCKRVGYK 941

BLAST of Cmc08g0226621 vs. ExPASy TrEMBL
Match: A0A151R237 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_042301 PE=4 SV=1)

HSP 1 Score: 1147.9 bits (2968), Expect = 0.0e+00
Identity = 567/813 (69.74%), Postives = 675/813 (83.03%), Query Frame = 0

Query: 1   MFMRMTVANNIKSTIKNTEDAKEFMKYVEKCSQSESANKSLAGTLMSTLTNIKFDGSRTI 60
           MFMRMTVA++IK+T+  TE AKEFM +V +  +S++A+KSLAGTLMSTLT +KFDGSRT+
Sbjct: 1   MFMRMTVADSIKTTLPKTESAKEFMGFVGE--RSQTADKSLAGTLMSTLTTMKFDGSRTM 60

Query: 61  HEHILEMMNLAARLKTMGMEVNENFLVTFILNSLPSKYGPFHMNYNTLKDKWNVHELQSM 120
           HEH++EM N+AARLK++GM VNENFLV FILNSLP++YGPF M+YNT+KDKWNVHEL SM
Sbjct: 61  HEHVIEMTNIAARLKSLGMAVNENFLVQFILNSLPTEYGPFQMSYNTMKDKWNVHELHSM 120

Query: 121 LIQEEARLKKP---IIHSINPMGHKGAGKKPRKKNGKGNHGQLKVKQSSSPIHKKGQIKD 180
           L+QEE RLK      IH ++  G++GAGKK  KK+ KG    LK+ ++S PI KK    +
Sbjct: 121 LVQEETRLKNQGSHSIHYVSHQGNQGAGKKFVKKHDKGKK-PLKINEASVPIQKKASKGN 180

Query: 181 KCRFCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVFNTM 240
            C FC K GH+QKDC KRKAWFE KGK NA VCFESNLTEVP+NTWWIDSGCT HV NTM
Sbjct: 181 NCHFCGKSGHFQKDCPKRKAWFEKKGKLNAYVCFESNLTEVPHNTWWIDSGCTTHVSNTM 240

Query: 241 QGFLTTRTTNLNERFIFMGNRVKVPVEAVGTYRLTLDTRHHLDLFDTFYVPSISRNLISL 300
           QGF T +T + NE+F+FMGNRVKVPVEAVGTYRL L+T HHLDL +T YVPS+SRNL+SL
Sbjct: 241 QGFTTIQTISPNEKFVFMGNRVKVPVEAVGTYRLILNTGHHLDLLETLYVPSLSRNLVSL 300

Query: 301 SKHDTSGYYFKFGNECFSLFKQNIFIGSGILCDGLYKLKFDNVFAESLLTLHHNVGTKRG 360
           SK D  GY F FGN CFSLFK+N  IG+GILCDGLYKL  D ++ E+LLTLHHN+GTKR 
Sbjct: 301 SKLDAIGYSFTFGNGCFSLFKRNHLIGTGILCDGLYKLNLDGLYDETLLTLHHNIGTKRS 360

Query: 361 QTNESSAYLWHKRLGHISKERIKRLIKNEILPDLNFTDLGICVDCIKGKQTKHSVNKEAT 420
             NE SA+LWH+RLGHIS+ER++RLIKNEILP+L+FTDL ICVDCIKGKQTKH+  K AT
Sbjct: 361 LVNERSAFLWHRRLGHISRERMERLIKNEILPNLDFTDLNICVDCIKGKQTKHT-KKGAT 420

Query: 421 RSSQLLEIIHTDICGSFDVPSFGEEKYFITFIDDFSRYGYIYLLHEKSQEIDALKVFINE 480
           RS+QLLEI+HTDICG FDV SFG+EKYFITFIDD+SRYGY+YLLHEKSQ +DAL++++NE
Sbjct: 421 RSTQLLEIVHTDICGPFDVNSFGKEKYFITFIDDYSRYGYVYLLHEKSQAVDALEIYLNE 480

Query: 481 VERQLDRKVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGTPQQNGVAE 540
           VERQLD+KVK++RSDRGGEYYG+Y+E GQ PGPFAK L+  GICAQYTMPGTPQQNGV+E
Sbjct: 481 VERQLDKKVKVVRSDRGGEYYGRYNETGQHPGPFAKLLQKRGICAQYTMPGTPQQNGVSE 540

Query: 541 RRNRTLMNMVRSMLINSSLLVSLWMYALRTAQYLLNRVPNKSVPKTPFELWTRRKPSLRH 600
           RRNRTLM+MVRSML NS+L + LWMYAL+TA YLLNRVP+K+V KTPFELWT R PSLRH
Sbjct: 541 RRNRTLMDMVRSMLSNSTLPIYLWMYALKTAMYLLNRVPSKAVSKTPFELWTGRTPSLRH 600

Query: 601 LHVWGCQAEVRICNPHEKKLDSRTTSGFFIGYPEKSKGYRFYCPNHSTRIVETGNVRFIE 660
           LHVWGCQAE+RI NP EKKLD+RT SG+FIGYPEKSKGY FYCPNH+ RIVETGN RFIE
Sbjct: 601 LHVWGCQAEIRIYNPQEKKLDARTISGYFIGYPEKSKGYMFYCPNHNMRIVETGNARFIE 660

Query: 661 NDIISGSLEPRKVKIQEVRVEIPSSITS-SQVVVPVVVDSVNNPQEQQINVQTPHNDIVT 720
           N  +SGS  P+KV+++EVR+++P +  S ++V VP+ V+S NN +EQ  N       ++ 
Sbjct: 661 NGEVSGSTIPQKVEVKEVRMQVPLTYASGNKVSVPLTVESNNNEEEQHNN-----EPMIH 720

Query: 721 NELVTKGSQEIELRRFVRSRRAAISDDYLVYLHESEFDLSI-DNDPVSFSQAIKVDNSTK 780
           NE + +  QEI LRR  R +R AIS+DY+VYLHE E D SI +NDPVSFSQA+  DNS K
Sbjct: 721 NEPIVEQPQEIALRRSQREKRPAISNDYMVYLHELENDSSINENDPVSFSQAVSCDNSEK 780

Query: 781 WLDAMKEELKSMNDNEVWDLVELPKESKRVGCK 809
           WL+AMKEELKSM  N+VWDLVELP+  KRVGCK
Sbjct: 781 WLNAMKEELKSMEQNDVWDLVELPEGCKRVGCK 804

BLAST of Cmc08g0226621 vs. ExPASy TrEMBL
Match: A0A438JI44 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2658 PE=4 SV=1)

HSP 1 Score: 988.4 bits (2554), Expect = 1.8e-284
Identity = 496/808 (61.39%), Postives = 613/808 (75.87%), Query Frame = 0

Query: 1   MFMRMTVANNIKSTIKNTEDAKEFMKYVEKCSQSESANKSLAGTLMSTLTNIKFDGSRTI 60
           MFMRMT+ANNIK+++  TE A EF+K VE+  + + A+KSLAGTLM+ LT +K+DG + I
Sbjct: 61  MFMRMTIANNIKTSLPQTEFASEFLKSVEE--RFKRADKSLAGTLMAELTTMKYDGQKGI 120

Query: 61  HEHILEMMNLAARLKTMGMEVNENFLVTFILNSLPSKYGPFHMNYNTLKDKWNVHELQSM 120
            +HIL M   AA+LK +GM ++E+FLV F+LNSLPS++ PF ++YNT  D+WN++EL S 
Sbjct: 121 QQHILNMTEKAAKLKALGMGMDESFLVQFVLNSLPSQFAPFKIHYNTNSDQWNLNELTSK 180

Query: 121 LIQEEARLKKPIIHSINPMGHKGAGKKPRKKNGKGNHGQLKVKQSSSPIHKKGQIKDKCR 180
            IQEE RL++   +    + H    KK + K GK    +       S  H  G+    C 
Sbjct: 181 CIQEEVRLRQEGHNLAFAVTHGVTKKKGKFKKGKNFPPKKSGPGEGSQSH-DGKFTVSCY 240

Query: 181 FCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVFNTMQGF 240
           FC K GH +KDC+KRKAWFE +G + + VC+ESNL EVP NTWWIDSG T HV N MQGF
Sbjct: 241 FCGKKGHVKKDCIKRKAWFEKRGINLSFVCYESNLAEVPSNTWWIDSGATTHVTNLMQGF 300

Query: 241 LTTRTTNLNERFIFMGNRVKVPVEAVGTYRLTLDTRHHLDLFDTFYVPSISRNLISLSKH 300
           LTTR    +E+F++MGNR+KV V AVGTYRL L+T H +DL +TFYVPSISRNL+SLSK 
Sbjct: 301 LTTRKPKESEKFLYMGNRLKVEVVAVGTYRLLLETGHRMDLLNTFYVPSISRNLVSLSKL 360

Query: 301 DTSGYYFKFGNECFSLFKQNIFIGSGILCDGLYKLKFDNVFAESLLTLHHNVGTKRGQTN 360
           D +GY   F +   SL    + +GSGILCDGLYK+  ++ FA++L+TLH NVG+KRG  N
Sbjct: 361 DATGYSVLFNSGQLSLMLNYVTVGSGILCDGLYKISLNHEFAQALITLHSNVGSKRGLIN 420

Query: 361 ESSAYLWHKRLGHISKERIKRLIKNEILPDLNFTDLGICVDCIKGKQTKHSVNKEATRSS 420
           E+S+ LWH+RLGHIS+ERI+RL+K  IL +L+FTD  +CVDCIKGKQTKH+  K ATRS+
Sbjct: 421 ENSSILWHRRLGHISRERIERLVKEGILQNLDFTDFHVCVDCIKGKQTKHT-KKGATRSN 480

Query: 421 QLLEIIHTDICGSFDVPSFGEEKYFITFIDDFSRYGYIYLLHEKSQEIDALKVFINEVER 480
           +LLEIIHTDICG   VP F  EKYFITFIDD SRYGY+YL+HEKSQ ID  ++FI EVER
Sbjct: 481 ELLEIIHTDICGPLSVPCFTGEKYFITFIDDLSRYGYVYLMHEKSQAIDIFEMFITEVER 540

Query: 481 QLDRKVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRN 540
           QLD+K+KI+RSDRGGEYYG+YDE+GQ PGPFAKFLE HGI AQYTMPGTPQQNGVAERRN
Sbjct: 541 QLDKKIKIVRSDRGGEYYGRYDESGQNPGPFAKFLEKHGIRAQYTMPGTPQQNGVAERRN 600

Query: 541 RTLMNMVRSMLINSSLLVSLWMYALRTAQYLLNRVPNKSVPKTPFELWTRRKPSLRHLHV 600
           RTLM MVRSM+  SS+ +SLW  AL+TA Y+LNRVP+K+VPKTPFELWT RKPSLRH+H+
Sbjct: 601 RTLMEMVRSMMSYSSVPISLWGEALKTAMYILNRVPSKAVPKTPFELWTGRKPSLRHIHI 660

Query: 601 WGCQAEVRICNPHEKKLDSRTTSGFFIGYPEKSKGYRFYCPNHSTRIVETGNVRFIENDI 660
           WGC AE RI NPHEKKLDSRT SG+FIGYP+KSKGYRFYCPNHS RIVETGN RF+EN  
Sbjct: 661 WGCPAEARIYNPHEKKLDSRTVSGYFIGYPDKSKGYRFYCPNHSVRIVETGNARFLENGE 720

Query: 661 ISGSLEPRKVKIQEVRVEIPSSITSSQVVVPVVVDSVNNPQEQQINVQTPHNDIVTNELV 720
           ISGS EPRKV I+E+RV+IP      +++VP  V  V + ++   +   P  +I   E V
Sbjct: 721 ISGSNEPRKVDIEEIRVDIPPPFLPQEIIVPQPVQQVEDNEQNNRDGSLPLENIAI-ENV 780

Query: 721 TKGSQEIELRRFVRSRRAAISDDYLVYLHESEFDLSIDNDPVSFSQAIKVDNSTKWLDAM 780
            +  Q   LRR  R RR AI+DDY+VYL ES++D+ I  DPVSFSQA++ D+S+KW++AM
Sbjct: 781 VEPPQPAPLRRSQRERRPAITDDYVVYLQESDYDIGIRKDPVSFSQAMESDDSSKWMEAM 840

Query: 781 KEELKSMNDNEVWDLVELPKESKRVGCK 809
            EELKSM  N VWDL+ELP   K VGCK
Sbjct: 841 NEELKSMAHNGVWDLIELPNNCKPVGCK 863

BLAST of Cmc08g0226621 vs. ExPASy TrEMBL
Match: A0A438F5W4 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_3508 PE=4 SV=1)

HSP 1 Score: 987.6 bits (2552), Expect = 3.0e-284
Identity = 495/808 (61.26%), Postives = 613/808 (75.87%), Query Frame = 0

Query: 1   MFMRMTVANNIKSTIKNTEDAKEFMKYVEKCSQSESANKSLAGTLMSTLTNIKFDGSRTI 60
           MFMRMT+ANNIK+++  TE A EF+K VE+  + + A+KSLAGTLM+ LT +K+DG + I
Sbjct: 61  MFMRMTIANNIKTSLPQTEFASEFLKSVEE--RFKRADKSLAGTLMAELTTMKYDGQKGI 120

Query: 61  HEHILEMMNLAARLKTMGMEVNENFLVTFILNSLPSKYGPFHMNYNTLKDKWNVHELQSM 120
            +HIL M   AA+LK +GM ++E+FLV F+LNSLPS++ PF ++YNT  D+WN++EL S 
Sbjct: 121 QQHILNMTEKAAKLKALGMGMDESFLVQFVLNSLPSQFAPFKIHYNTNSDQWNLNELTSK 180

Query: 121 LIQEEARLKKPIIHSINPMGHKGAGKKPRKKNGKGNHGQLKVKQSSSPIHKKGQIKDKCR 180
            IQEE RL++   +    + H    KK + K GK    +       S  H  G+    C 
Sbjct: 181 CIQEEVRLRQEGHNHAFAVTHGVTKKKGKFKKGKNFPPKKSGPGEGSQSH-DGKFTVSCY 240

Query: 181 FCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVFNTMQGF 240
           FC K GH +KDC+KRKAWFE +G + + VC+ESNL EVP NTWWIDSG T HV N MQGF
Sbjct: 241 FCGKKGHVKKDCIKRKAWFEKRGINLSFVCYESNLAEVPSNTWWIDSGATTHVTNLMQGF 300

Query: 241 LTTRTTNLNERFIFMGNRVKVPVEAVGTYRLTLDTRHHLDLFDTFYVPSISRNLISLSKH 300
           LTTR    +E+F++MGNR+KV V AVGTYRL L+T H +DL +TFYVPSISRNL+SLSK 
Sbjct: 301 LTTRKPKESEKFLYMGNRLKVEVVAVGTYRLLLETGHRMDLLNTFYVPSISRNLVSLSKL 360

Query: 301 DTSGYYFKFGNECFSLFKQNIFIGSGILCDGLYKLKFDNVFAESLLTLHHNVGTKRGQTN 360
           D +GY   F +   SL   ++ +GSGILCDGLYK+  ++ FA++L+TLH NVG+KRG  N
Sbjct: 361 DATGYSVLFSSGQLSLMLNSVTVGSGILCDGLYKISLNHEFAQALITLHSNVGSKRGLIN 420

Query: 361 ESSAYLWHKRLGHISKERIKRLIKNEILPDLNFTDLGICVDCIKGKQTKHSVNKEATRSS 420
           E+S+ LWH+RLGHIS+ERI+RL+K  IL +L+FTD  +CVDCIKGKQTKH+  K ATRS+
Sbjct: 421 ENSSILWHRRLGHISRERIERLVKEGILQNLDFTDFHVCVDCIKGKQTKHT-KKGATRSN 480

Query: 421 QLLEIIHTDICGSFDVPSFGEEKYFITFIDDFSRYGYIYLLHEKSQEIDALKVFINEVER 480
           +LLEIIH DICG   VP F  EKYFITFIDD SRYGY+YL+HEKSQ ID  ++FI EVER
Sbjct: 481 ELLEIIHIDICGPLSVPCFTGEKYFITFIDDLSRYGYVYLMHEKSQAIDIFEMFITEVER 540

Query: 481 QLDRKVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRN 540
           QLD+K+KI+RSDRGGEYYG+YDE+GQ PGPFAKFLE HGI AQYTMPGTPQQNGVAERRN
Sbjct: 541 QLDKKIKIVRSDRGGEYYGRYDESGQNPGPFAKFLEKHGIRAQYTMPGTPQQNGVAERRN 600

Query: 541 RTLMNMVRSMLINSSLLVSLWMYALRTAQYLLNRVPNKSVPKTPFELWTRRKPSLRHLHV 600
           RTLM MVRSM+  SS+ +SLW  AL+TA Y+LNRVP+K+VPKTPFELWT RKPSLRH+H+
Sbjct: 601 RTLMEMVRSMMSYSSVPISLWGEALKTAMYILNRVPSKAVPKTPFELWTGRKPSLRHIHI 660

Query: 601 WGCQAEVRICNPHEKKLDSRTTSGFFIGYPEKSKGYRFYCPNHSTRIVETGNVRFIENDI 660
           WGC AE RI NPHEKKLDSRT SG+FIGYP+KSKGYRFYCPNHS RIVETGN RF+EN  
Sbjct: 661 WGCPAEARIYNPHEKKLDSRTVSGYFIGYPDKSKGYRFYCPNHSVRIVETGNARFLENGE 720

Query: 661 ISGSLEPRKVKIQEVRVEIPSSITSSQVVVPVVVDSVNNPQEQQINVQTPHNDIVTNELV 720
           ISGS EPRKV I+E+RV+IP      +++VP  V  V + ++   +   P  +I   E V
Sbjct: 721 ISGSNEPRKVDIEEIRVDIPPPFLPQEIIVPQPVQQVEDNEQNNRDGSLPLENIAI-ENV 780

Query: 721 TKGSQEIELRRFVRSRRAAISDDYLVYLHESEFDLSIDNDPVSFSQAIKVDNSTKWLDAM 780
            +  Q   LRR  R RR AI+DDY+VYL ES++D+ I  DPVSFSQA++ D+S+KW++AM
Sbjct: 781 VEPPQPAPLRRSQRERRPAITDDYVVYLQESDYDIGIRKDPVSFSQAMESDDSSKWMEAM 840

Query: 781 KEELKSMNDNEVWDLVELPKESKRVGCK 809
            EELKSM  N VWDL+ELP   K VGCK
Sbjct: 841 NEELKSMAHNGVWDLIELPNNCKPVGCK 863

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RZC12927.10.0e+0070.52Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A [Glycine s... [more]
RZC25410.10.0e+0070.52Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja][more]
KYP36562.10.0e+0069.74Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
RVX08602.13.6e-28461.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
RVW55286.16.2e-28461.26Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
P109786.6e-7929.03Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041463.0e-4724.47Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q94HW22.4e-4123.95Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT949.3e-4125.46Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q042141.5e-2222.34Transposon Ty1-MR1 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
A0A445KPR80.0e+0070.52Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A OS=Glycine... [more]
A0A445LQ300.0e+0070.52Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3... [more]
A0A151R2370.0e+0069.74Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A438JI441.8e-28461.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A438F5W43.0e-28461.26Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 461..481
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 6..130
e-value: 1.4E-18
score: 67.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 143..157
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 135..167
NoneNo IPR availablePANTHERPTHR35317:SF10ZINC FINGER, CCHC-TYPE-RELATEDcoord: 1..152
NoneNo IPR availablePANTHERPTHR35317OS04G0629600 PROTEINcoord: 1..152
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 331..407
e-value: 1.7E-16
score: 59.8
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 424..530
e-value: 9.3E-12
score: 45.2
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 418..592
score: 21.839499
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 413..609
e-value: 5.2E-41
score: 142.1
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 178..192
score: 9.125269
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 175..200
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 418..590

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc08g0226621.1Cmc08g0226621.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071897 DNA biosynthetic process
biological_process GO:0015074 DNA integration
biological_process GO:0009231 riboflavin biosynthetic process
cellular_component GO:0016020 membrane
molecular_function GO:0008686 3,4-dihydroxy-2-butanone-4-phosphate synthase activity
molecular_function GO:0003887 DNA-directed DNA polymerase activity
molecular_function GO:0004491 methylmalonate-semialdehyde dehydrogenase (acylating) activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding