Cmc11g0307031 (gene) Melon (Charmono) v1.1

Overview
NameCmc11g0307031
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCMiso1.1chr11: 28517292 .. 28518809 (-)
RNA-Seq ExpressionCmc11g0307031
SyntenyCmc11g0307031
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTCATATAGGAGCTGGAAAGAAACCTGGAAAAAAGAATGACAAGGGTAATCATGGACAATTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGAACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCTGGACACTATCAGAAAGATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTAGTATTTTTCGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTTGTACCATTCATGTTTCCAATATGATGTTGGGATTCCTTACGACTCGAACCACAAACCTAAATGAGAGACTCATTTTTATGGGAAACAAAGTCAAAGTTTCAGTTGAAGCTGTGGAAACCTATCGTTTAACTTTAGATACTGGATATCATTTAGACCTTTTTGATACCTTTTATATTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACTTGATACTTCAGGTTATTACTTTAAATTTGGGAATGAGTGTTTTAGTTTATTCAAACAGAACATTTTTATTGGTTCTGGTATTCTTTGTGATGACTTATATAAATTAAAGCTTGATAATGTTTTTGCTGAGAGTTTGTTAACCTTGCATCATAATGCTGGTACTAAACGTGGTCAAACTAATGAATCGTCAGCTTACTTGTGGCATAAACGTTTAGGTCACATATCCAAAGAAAGAATTAAAAGATTGATAAAGAATGAAATTCTTCCATATTTGGATTTTACTGACCTTGGAATTTGTGTGGATTGTATTAAAGAAAAACAAACAAAACATACAGTTAATAAAGAAGCCACAAGAAGCTCACAGCTCCTTGAAATTATACACACTGATATTTGTGGGCCTTTTGATGTTCCATCTTTTGGTGGAGAAAAGTATTTTATCACCTTTATTGATGATTTCGCACGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGCAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGACAATTAGATAGAAAGGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATAGAAACTATGACGAGAATGGACAATGCCCCGGTCCATTAGCTAAATTCCTAGAAAGCCATGACATATGTGCTCAATACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAAGGCGAAATCGTACATTAATGAATATGGTTAGAAGCATGTTAATTAATTCATCTTTACCTGTGTCCTTGTGGATGTATGCATTAAGAACCGCTCAATATTTATTAAACAGGGTTCCTAGTAAGTCAGTTCCAAAGACACCTTTTGAACTGTGGACAGGAAGGAAATCTAGTTTAAGACACCTACATGTTTGGGGTTGTCAAGTGGAAGTAAGAATTTATAATAGAAACTTGATTCAAGAACAACCAGTGGTTTCTTCATTGGTTATCCAGAAAAATCAAAAGGGTATAGATTTTATTGTCCTAACTACAGTACGGGAATAG

mRNA sequence

ATGGGTCATATAGGAGCTGGAAAGAAACCTGGAAAAAAGAATGACAAGGGTAATCATGGACAATTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGAACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCTGGACACTATCAGAAAGATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTAGTATTTTTCGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTTGTACCATTCATGTTTCCAATATGATGTTGGGATTCCTTACGACTCGAACCACAAACCTAAATGAGAGACTCATTTTTATGGGAAACAAAGTCAAAGTTTCAGTTGAAGCTGTGGAAACCTATCGTTTAACTTTAGATACTGGATATCATTTAGACCTTTTTGATACCTTTTATATTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACTTGATACTTCAGGTTATTACTTTAAATTTGGGAATGAGTGTTTTAGTTTATTCAAACAGAACATTTTTATTGGTTCTGGTATTCTTTGTGATGACTTATATAAATTAAAGCTTGATAATGTTTTTGCTGAGAGTTTGTTAACCTTGCATCATAATGCTGGTACTAAACGTGGTCAAACTAATGAATCGTCAGCTTACTTGTGGCATAAACGTTTAGGTCACATATCCAAAGAAAGAATTAAAAGATTGATAAAGAATGAAATTCTTCCATATTTGGATTTTACTGACCTTGGAATTTGTGTGGATTGTATTAAAGAAAAACAAACAAAACATACAGTTAATAAAGAAGCCACAAGAAGCTCACAGCTCCTTGAAATTATACACACTGATATTTGTGGGCCTTTTGATGTTCCATCTTTTGGTGGAGAAAAGTATTTTATCACCTTTATTGATGATTTCGCACGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGCAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGACAATTAGATAGAAAGGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATAGAAACTATGACGAGAATGGACAATGCCCCGGTCCATTAGCTAAATTCCTAGAAAGCCATGACATATGTGCTCAATACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAAGGCGAAATCGTACATTAATGAATATGGTTAGAAGCATGTTAATTAATTCATCTTTACCTGTGTCCTTGTGGATGTATGCATTAAGAACCGCTCAATATTTATTAAACAGGGTTCCTAGTAAGTCAGTTCCAAAGACACCTTTTGAACTGTGGACAGGAAGGAAATCTAGTTTAAGACACCTACATGTTTGGGGTTGTCAAGTGGAAGTAAGAATTTATAATAGAAACTTGATTCAAGAACAACCAGTGGTTTCTTCATTGGTTATCCAGAAAAATCAAAAGGGTATAGATTTTATTGTCCTAACTACAGTACGGGAATAG

Coding sequence (CDS)

ATGGGTCATATAGGAGCTGGAAAGAAACCTGGAAAAAAGAATGACAAGGGTAATCATGGACAATTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGAACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCTGGACACTATCAGAAAGATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTAGTATTTTTCGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTTGTACCATTCATGTTTCCAATATGATGTTGGGATTCCTTACGACTCGAACCACAAACCTAAATGAGAGACTCATTTTTATGGGAAACAAAGTCAAAGTTTCAGTTGAAGCTGTGGAAACCTATCGTTTAACTTTAGATACTGGATATCATTTAGACCTTTTTGATACCTTTTATATTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACTTGATACTTCAGGTTATTACTTTAAATTTGGGAATGAGTGTTTTAGTTTATTCAAACAGAACATTTTTATTGGTTCTGGTATTCTTTGTGATGACTTATATAAATTAAAGCTTGATAATGTTTTTGCTGAGAGTTTGTTAACCTTGCATCATAATGCTGGTACTAAACGTGGTCAAACTAATGAATCGTCAGCTTACTTGTGGCATAAACGTTTAGGTCACATATCCAAAGAAAGAATTAAAAGATTGATAAAGAATGAAATTCTTCCATATTTGGATTTTACTGACCTTGGAATTTGTGTGGATTGTATTAAAGAAAAACAAACAAAACATACAGTTAATAAAGAAGCCACAAGAAGCTCACAGCTCCTTGAAATTATACACACTGATATTTGTGGGCCTTTTGATGTTCCATCTTTTGGTGGAGAAAAGTATTTTATCACCTTTATTGATGATTTCGCACGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGCAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGACAATTAGATAGAAAGGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATAGAAACTATGACGAGAATGGACAATGCCCCGGTCCATTAGCTAAATTCCTAGAAAGCCATGACATATGTGCTCAATACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAAGGCGAAATCGTACATTAATGAATATGGTTAGAAGCATGTTAATTAATTCATCTTTACCTGTGTCCTTGTGGATGTATGCATTAAGAACCGCTCAATATTTATTAAACAGGGTTCCTAGTAAGTCAGTTCCAAAGACACCTTTTGAACTGTGGACAGGAAGGAAATCTAGTTTAAGACACCTACATGTTTGGGGTTGTCAAGTGGAAGTAAGAATTTATAATAGAAACTTGATTCAAGAACAACCAGTGGTTTCTTCATTGGTTATCCAGAAAAATCAAAAGGGTATAGATTTTATTGTCCTAACTACAGTACGGGAATAG

Protein sequence

MGHIGAGKKPGKKNDKGNHGQLKVKQSSAPIHKKEQIKDKCRFCNKPGHYQKDCLKRKAWFENKGKHNALVFFESNLTEVPYNTWWIDSGCTIHVSNMMLGFLTTRTTNLNERLIFMGNKVKVSVEAVETYRLTLDTGYHLDLFDTFYIPSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIFIGSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTNESSAYLWHKRLGHISKERIKRLIKNEILPYLDFTDLGICVDCIKEKQTKHTVNKEATRSSQLLEIIHTDICGPFDVPSFGGEKYFITFIDDFARYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKILRSDRGGEYYRNYDENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTPFELWTGRKSSLRHLHVWGCQVEVRIYNRNLIQEQPVVSSLVIQKNQKGIDFIVLTTVRE
Homology
BLAST of Cmc11g0307031 vs. NCBI nr
Match: RZC12927.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A [Glycine soja] >RZC12928.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform B [Glycine soja])

HSP 1 Score: 727.6 bits (1877), Expect = 7.2e-206
Identity = 347/472 (73.52%), Postives = 397/472 (84.11%), Query Frame = 0

Query: 2   GHIGAGKKPGKKNDKGNHGQLKVKQSSAPIHKKEQIKDKCRFCNKPGHYQKDCLKRKAWF 61
           G+ GAGK   KK+DKG  G LK+K     I KK    + C FC K GH+QKDC KRK+WF
Sbjct: 289 GNQGAGKNFVKKHDKGK-GPLKIKDGPVQIQKKASKNNNCHFCGKSGHFQKDCPKRKSWF 348

Query: 62  ENKGKHNALVFFESNLTEVPYNTWWIDSGCTIHVSNMMLGFLTTRTTNLNERLIFMGNKV 121
           E KG+ NALV+FESNLTEVP+NTWWIDSGCT HVSN M GFLT +T + NE+ +FMGN+V
Sbjct: 349 EKKGELNALVYFESNLTEVPHNTWWIDSGCTTHVSNTMQGFLTIQTISPNEKFVFMGNRV 408

Query: 122 KVSVEAVETYRLTLDTGYHLDLFDTFYIPSISRNLISLSKLDTSGYYFKFGNECFSLFKQ 181
           K  VEAV TYRL LDTG+HLDL +T Y+PS+SRNL+SLSKLD +GY F FGN CFSLFK 
Sbjct: 409 KAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLSKLDITGYSFNFGNGCFSLFKY 468

Query: 182 NIFIGSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTNESSAYLWHKRLGHISKERI 241
           N  IG+G+LCD LYKLKLD ++ E++LTLHHN GTKR   NE SA+LWHKRLGHIS+ERI
Sbjct: 469 NHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSLVNERSAFLWHKRLGHISRERI 528

Query: 242 KRLIKNEILPYLDFTDLGICVDCIKEKQTKHTVNKEATRSSQLLEIIHTDICGPFDVPSF 301
           +RLIKNEILP LDFTDL ICVDCIK KQTKHT  K ATRS+QLLEI+HTDICGPFDV SF
Sbjct: 529 ERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVSSF 588

Query: 302 GGEKYFITFIDDFARYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKILRSDRGGEYYR 361
           G E+YFITFIDD++RYGY+YLLHEKSQA++AL++++NEVERQLDRKVKI+RSDRGGEYY 
Sbjct: 589 GRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEVERQLDRKVKIIRSDRGGEYYG 648

Query: 362 NYDENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVS 421
            YDE GQ PGP AK L+   ICAQYTMPGTPQQNGV+ERRNRTLM+MVRSMLINS+LPVS
Sbjct: 649 RYDETGQHPGPFAKLLQKRGICAQYTMPGTPQQNGVSERRNRTLMDMVRSMLINSTLPVS 708

Query: 422 LWMYALRTAQYLLNRVPSKSVPKTPFELWTGRKSSLRHLHVWGCQVEVRIYN 474
           LWMYAL+TA YLLN VPSK+VPKTPFELWT R  S+RHLHVWGCQ E+RIYN
Sbjct: 709 LWMYALKTAMYLLNMVPSKAVPKTPFELWTNRTPSMRHLHVWGCQAEIRIYN 758

BLAST of Cmc11g0307031 vs. NCBI nr
Match: RZC25410.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja])

HSP 1 Score: 727.6 bits (1877), Expect = 7.2e-206
Identity = 348/472 (73.73%), Postives = 397/472 (84.11%), Query Frame = 0

Query: 2   GHIGAGKKPGKKNDKGNHGQLKVKQSSAPIHKKEQIKDKCRFCNKPGHYQKDCLKRKAWF 61
           G+ GAGKK  KK+DKG  G LK+K     I KK    + C FC K GH+QKDC KRK+WF
Sbjct: 278 GNQGAGKKFVKKHDKGK-GPLKIKDGPVQIQKKASKNNNCHFCGKSGHFQKDCPKRKSWF 337

Query: 62  ENKGKHNALVFFESNLTEVPYNTWWIDSGCTIHVSNMMLGFLTTRTTNLNERLIFMGNKV 121
           E KG+ NALV FESNLTEVP+NTWWIDSGCT HVSN M GFLT +T + NE+ +FMGN+V
Sbjct: 338 EKKGELNALVCFESNLTEVPHNTWWIDSGCTTHVSNTMQGFLTIQTISPNEKFVFMGNRV 397

Query: 122 KVSVEAVETYRLTLDTGYHLDLFDTFYIPSISRNLISLSKLDTSGYYFKFGNECFSLFKQ 181
           K  VEAV TYRL LDTG+HLDL +T Y+PS+SRNL+SLSKLD +GY F FGN CFSLFK 
Sbjct: 398 KAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLSKLDITGYSFNFGNGCFSLFKY 457

Query: 182 NIFIGSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTNESSAYLWHKRLGHISKERI 241
           N  IG+G+LCD LYKLKLD ++ E++LTLHHN GTKR   NE SA+LWHKRLGHIS ERI
Sbjct: 458 NHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSLVNERSAFLWHKRLGHISGERI 517

Query: 242 KRLIKNEILPYLDFTDLGICVDCIKEKQTKHTVNKEATRSSQLLEIIHTDICGPFDVPSF 301
           +RLIKNEILP LDFTDL ICVDCIK KQTKHT  K ATRS+QLLEI+HTDICGPFDV SF
Sbjct: 518 ERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVSSF 577

Query: 302 GGEKYFITFIDDFARYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKILRSDRGGEYYR 361
           G E+YFITFIDD++RYGY+YLLHEKSQA++AL++++NEVERQLDRKVKI+RSDRGGEYYR
Sbjct: 578 GRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEVERQLDRKVKIIRSDRGGEYYR 637

Query: 362 NYDENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVS 421
            YDE GQ P P AK L+   ICAQYTMPGTPQQNGV+ERRN+TLM+MVRSMLINS+LPVS
Sbjct: 638 RYDETGQHPSPFAKLLQKRGICAQYTMPGTPQQNGVSERRNKTLMDMVRSMLINSTLPVS 697

Query: 422 LWMYALRTAQYLLNRVPSKSVPKTPFELWTGRKSSLRHLHVWGCQVEVRIYN 474
           LWMYAL+TA YLLNRVPSK+VPKTPFELWT R  S+RHLHVWGCQ E+RIYN
Sbjct: 698 LWMYALKTAMYLLNRVPSKAVPKTPFELWTNRTPSMRHLHVWGCQAEIRIYN 747

BLAST of Cmc11g0307031 vs. NCBI nr
Match: KYP36562.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 719.2 bits (1855), Expect = 2.6e-203
Identity = 345/472 (73.09%), Postives = 396/472 (83.90%), Query Frame = 0

Query: 2   GHIGAGKKPGKKNDKGNHGQLKVKQSSAPIHKKEQIKDKCRFCNKPGHYQKDCLKRKAWF 61
           G+ GAGKK  KK+DKG    LK+ ++S PI KK    + C FC K GH+QKDC KRKAWF
Sbjct: 141 GNQGAGKKFVKKHDKGKK-PLKINEASVPIQKKASKGNNCHFCGKSGHFQKDCPKRKAWF 200

Query: 62  ENKGKHNALVFFESNLTEVPYNTWWIDSGCTIHVSNMMLGFLTTRTTNLNERLIFMGNKV 121
           E KGK NA V FESNLTEVP+NTWWIDSGCT HVSN M GF T +T + NE+ +FMGN+V
Sbjct: 201 EKKGKLNAYVCFESNLTEVPHNTWWIDSGCTTHVSNTMQGFTTIQTISPNEKFVFMGNRV 260

Query: 122 KVSVEAVETYRLTLDTGYHLDLFDTFYIPSISRNLISLSKLDTSGYYFKFGNECFSLFKQ 181
           KV VEAV TYRL L+TG+HLDL +T Y+PS+SRNL+SLSKLD  GY F FGN CFSLFK+
Sbjct: 261 KVPVEAVGTYRLILNTGHHLDLLETLYVPSLSRNLVSLSKLDAIGYSFTFGNGCFSLFKR 320

Query: 182 NIFIGSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTNESSAYLWHKRLGHISKERI 241
           N  IG+GILCD LYKL LD ++ E+LLTLHHN GTKR   NE SA+LWH+RLGHIS+ER+
Sbjct: 321 NHLIGTGILCDGLYKLNLDGLYDETLLTLHHNIGTKRSLVNERSAFLWHRRLGHISRERM 380

Query: 242 KRLIKNEILPYLDFTDLGICVDCIKEKQTKHTVNKEATRSSQLLEIIHTDICGPFDVPSF 301
           +RLIKNEILP LDFTDL ICVDCIK KQTKHT  K ATRS+QLLEI+HTDICGPFDV SF
Sbjct: 381 ERLIKNEILPNLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVNSF 440

Query: 302 GGEKYFITFIDDFARYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKILRSDRGGEYYR 361
           G EKYFITFIDD++RYGY+YLLHEKSQA+DAL++++NEVERQLD+KVK++RSDRGGEYY 
Sbjct: 441 GKEKYFITFIDDYSRYGYVYLLHEKSQAVDALEIYLNEVERQLDKKVKVVRSDRGGEYYG 500

Query: 362 NYDENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVS 421
            Y+E GQ PGP AK L+   ICAQYTMPGTPQQNGV+ERRNRTLM+MVRSML NS+LP+ 
Sbjct: 501 RYNETGQHPGPFAKLLQKRGICAQYTMPGTPQQNGVSERRNRTLMDMVRSMLSNSTLPIY 560

Query: 422 LWMYALRTAQYLLNRVPSKSVPKTPFELWTGRKSSLRHLHVWGCQVEVRIYN 474
           LWMYAL+TA YLLNRVPSK+V KTPFELWTGR  SLRHLHVWGCQ E+RIYN
Sbjct: 561 LWMYALKTAMYLLNRVPSKAVSKTPFELWTGRTPSLRHLHVWGCQAEIRIYN 610

BLAST of Cmc11g0307031 vs. NCBI nr
Match: RZC09906.1 (B2 protein isoform D [Glycine soja])

HSP 1 Score: 650.2 bits (1676), Expect = 1.5e-182
Identity = 320/472 (67.80%), Postives = 370/472 (78.39%), Query Frame = 0

Query: 2   GHIGAGKKPGKKNDKGNHGQLKVKQSSAPIHKKEQIKDKCRFCNKPGHYQKDCLKRKAWF 61
           G+ GAGKK  KK+DKG  G LK+K     I KK    + C FC K GH+QKDC KRK+WF
Sbjct: 230 GNQGAGKKFVKKHDKGK-GPLKIKDGPVQIQKKASKNNNCHFCGKSGHFQKDCPKRKSWF 289

Query: 62  ENKGKHNALVFFESNLTEVPYNTWWIDSGCTIHVSNMMLGFLTTRTTNLNERLIFMGNKV 121
           E KG+ NA                              LGFLT +T + N++ +FMGN+V
Sbjct: 290 EKKGELNA------------------------------LGFLTIQTISPNKKFVFMGNRV 349

Query: 122 KVSVEAVETYRLTLDTGYHLDLFDTFYIPSISRNLISLSKLDTSGYYFKFGNECFSLFKQ 181
           K  VEAV TYRL LDTG+HLDL +T Y+PS+SRNL+SLSKLD +GY F FGN CFSLFK 
Sbjct: 350 KAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLSKLDITGYSFNFGNGCFSLFKY 409

Query: 182 NIFIGSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTNESSAYLWHKRLGHISKERI 241
           N  IG+G+LCD LYKLKLD ++ E++LTLHHN GTKR   NE SA+LWHKRLGHIS+ERI
Sbjct: 410 NHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSLVNERSAFLWHKRLGHISRERI 469

Query: 242 KRLIKNEILPYLDFTDLGICVDCIKEKQTKHTVNKEATRSSQLLEIIHTDICGPFDVPSF 301
           +RLIKNEILP LDFTDL ICVDCIK KQTKHT  K ATRS+QLLEI+HTDICGPFDV SF
Sbjct: 470 ERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVSSF 529

Query: 302 GGEKYFITFIDDFARYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKILRSDRGGEYYR 361
           G E+YFITFIDD++RYGY+YLLHEKSQA++AL++++NEVERQLDRKVKI+RSDR GEYYR
Sbjct: 530 GRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEVERQLDRKVKIIRSDRDGEYYR 589

Query: 362 NYDENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVS 421
            YDE GQ  GP AK L+   ICAQYTMPGT QQNGV+ERRNRTLM+MVRSMLINS+LPVS
Sbjct: 590 RYDETGQHSGPFAKLLQKRGICAQYTMPGTLQQNGVSERRNRTLMDMVRSMLINSTLPVS 649

Query: 422 LWMYALRTAQYLLNRVPSKSVPKTPFELWTGRKSSLRHLHVWGCQVEVRIYN 474
           LWMYAL+TA YLLNRVPSK++PKTPFELWT R  S+RHLHVWGCQ E+RIYN
Sbjct: 650 LWMYALKTAMYLLNRVPSKAIPKTPFELWTNRTPSMRHLHVWGCQAEIRIYN 669

BLAST of Cmc11g0307031 vs. NCBI nr
Match: RVX08602.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 622.5 bits (1604), Expect = 3.3e-174
Identity = 302/469 (64.39%), Postives = 360/469 (76.76%), Query Frame = 0

Query: 5   GAGKKPGKKNDKGNHGQLKVKQSSAPIHKKEQIKDKCRFCNKPGHYQKDCLKRKAWFENK 64
           G  KK GK     N    K            +    C FC K GH +KDC+KRKAWFE +
Sbjct: 200 GVTKKKGKFKKGKNFPPKKSGPGEGSQSHDGKFTVSCYFCGKKGHVKKDCIKRKAWFEKR 259

Query: 65  GKHNALVFFESNLTEVPYNTWWIDSGCTIHVSNMMLGFLTTRTTNLNERLIFMGNKVKVS 124
           G + + V +ESNL EVP NTWWIDSG T HV+N+M GFLTTR    +E+ ++MGN++KV 
Sbjct: 260 GINLSFVCYESNLAEVPSNTWWIDSGATTHVTNLMQGFLTTRKPKESEKFLYMGNRLKVE 319

Query: 125 VEAVETYRLTLDTGYHLDLFDTFYIPSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIF 184
           V AV TYRL L+TG+ +DL +TFY+PSISRNL+SLSKLD +GY   F +   SL    + 
Sbjct: 320 VVAVGTYRLLLETGHRMDLLNTFYVPSISRNLVSLSKLDATGYSVLFNSGQLSLMLNYVT 379

Query: 185 IGSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTNESSAYLWHKRLGHISKERIKRL 244
           +GSGILCD LYK+ L++ FA++L+TLH N G+KRG  NE+S+ LWH+RLGHIS+ERI+RL
Sbjct: 380 VGSGILCDGLYKISLNHEFAQALITLHSNVGSKRGLINENSSILWHRRLGHISRERIERL 439

Query: 245 IKNEILPYLDFTDLGICVDCIKEKQTKHTVNKEATRSSQLLEIIHTDICGPFDVPSFGGE 304
           +K  IL  LDFTD  +CVDCIK KQTKHT  K ATRS++LLEIIHTDICGP  VP F GE
Sbjct: 440 VKEGILQNLDFTDFHVCVDCIKGKQTKHT-KKGATRSNELLEIIHTDICGPLSVPCFTGE 499

Query: 305 KYFITFIDDFARYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKILRSDRGGEYYRNYD 364
           KYFITFIDD +RYGY+YL+HEKSQAID  ++FI EVERQLD+K+KI+RSDRGGEYY  YD
Sbjct: 500 KYFITFIDDLSRYGYVYLMHEKSQAIDIFEMFITEVERQLDKKIKIVRSDRGGEYYGRYD 559

Query: 365 ENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWM 424
           E+GQ PGP AKFLE H I AQYTMPGTPQQNGVAERRNRTLM MVRSM+  SS+P+SLW 
Sbjct: 560 ESGQNPGPFAKFLEKHGIRAQYTMPGTPQQNGVAERRNRTLMEMVRSMMSYSSVPISLWG 619

Query: 425 YALRTAQYLLNRVPSKSVPKTPFELWTGRKSSLRHLHVWGCQVEVRIYN 474
            AL+TA Y+LNRVPSK+VPKTPFELWTGRK SLRH+H+WGC  E RIYN
Sbjct: 620 EALKTAMYILNRVPSKAVPKTPFELWTGRKPSLRHIHIWGCPAEARIYN 667

BLAST of Cmc11g0307031 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 234.6 bits (597), Expect = 2.5e-60
Identity = 158/487 (32.44%), Postives = 245/487 (50.31%), Query Frame = 0

Query: 4   IGAGKKPGKKNDKGNHGQLKVKQSSAPIHKKEQIKDKCRFCNKPGHYQKDCLK-RKAWFE 63
           I  G+    +    N+G+   +  S     K ++++ C  CN+PGH+++DC   RK   E
Sbjct: 197 ITEGRGRSYQRSSNNYGRSGARGKSKN-RSKSRVRN-CYNCNQPGHFKRDCPNPRKGKGE 256

Query: 64  NKGKHN-----ALVFFESNLT------------EVPYNTWWIDSGCTIHVSNMMLGFLTT 123
             G+ N     A+V    N+               P + W +D+  + H + +    L  
Sbjct: 257 TSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESEWVVDTAASHHATPVR--DLFC 316

Query: 124 RTTNLNERLIFMGNKVKVSVEAVETYRLTLDTGYHLDLFDTFYIPSISRNLISLSKLDTS 183
           R    +   + MGN     +  +    +  + G  L L D  ++P +  NLIS   LD  
Sbjct: 317 RYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDRD 376

Query: 184 GYYFKFGNECFSLFKQNIFIGSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTN--- 243
           GY   F N+ + L K ++ I  G+    LY+                NA   +G+ N   
Sbjct: 377 GYESYFANQKWRLTKGSLVIAKGVARGTLYRT---------------NAEICQGELNAAQ 436

Query: 244 -ESSAYLWHKRLGHISKERIKRLIKNEILPYLDFTDLGICVDCIKEKQTKHTVNKEATRS 303
            E S  LWHKR+GH+S++ ++ L K  ++ Y   T +  C  C+  KQ + +    + R 
Sbjct: 437 DEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERK 496

Query: 304 SQLLEIIHTDICGPFDVPSFGGEKYFITFIDDFARYGYIYLLHEKSQAIDALKVFINEVE 363
             +L+++++D+CGP ++ S GG KYF+TFIDD +R  ++Y+L  K Q     + F   VE
Sbjct: 497 LNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVE 556

Query: 364 RQLDRKVKILRSDRGGEY-YRNYDENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAER 423
           R+  RK+K LRSD GGEY  R ++E          +  SH I  + T+PGTPQ NGVAER
Sbjct: 557 RETGRKLKRLRSDNGGEYTSREFEE----------YCSSHGIRHEKTVPGTPQHNGVAER 616

Query: 424 RNRTLMNMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVP-KTPFELWTGRKSSLRH 467
            NRT++  VRSML  + LP S W  A++TA YL+NR PS  +  + P  +WT ++ S  H
Sbjct: 617 MNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSH 654

BLAST of Cmc11g0307031 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 179.5 bits (454), Expect = 9.6e-44
Identity = 137/484 (28.31%), Postives = 216/484 (44.63%), Query Frame = 0

Query: 16  KGNHGQLKVKQSSAPIHKKEQIKDKCRFCNKPGHYQKDCLKRKAWFENKGKHN------- 75
           K N  + +V +         + K KC  C + GH +KDC   K    NK K N       
Sbjct: 207 KNNLFKNRVTKPKKIFKGNSKYKVKCHHCGREGHIKKDCFHYKRILNNKNKENEKQVQTA 266

Query: 76  -----ALVFFESNLTEVPYNTWWI-DSGCTIHVSNMMLGFLTTRTTNLNERLIFMGNKVK 135
                A +  E N T V  N  ++ DSG + H+ N     L T +  +   L     K  
Sbjct: 267 TSHGIAFMVKEVNNTSVMDNCGFVLDSGASDHLIND--ESLYTDSVEVVPPLKIAVAKQG 326

Query: 136 VSVEAVETYRLTLDTGYHLDLFDTFYIPSISRNLISLSKLDTSGYYFKFGNECFSLFKQN 195
             + A +   + L   + + L D  +    + NL+S+ +L  +G   +F     ++ K  
Sbjct: 327 EFIYATKRGIVRLRNDHEITLEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSGVTISKNG 386

Query: 196 IFI--GSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTNESSAYLWHKRLGHISKER 255
           + +   SG+    L  + + N  A S+   H N           +  LWH+R GHIS  +
Sbjct: 387 LMVVKNSGM----LNNVPVINFQAYSINAKHKN-----------NFRLWHERFGHISDGK 446

Query: 256 IKRLIKNEIL---PYLDFTDLG--ICVDCIKEKQTKHTVN--KEATRSSQLLEIIHTDIC 315
           +  + +  +      L+  +L   IC  C+  KQ +      K+ T   + L ++H+D+C
Sbjct: 447 LLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVC 506

Query: 316 GPFDVPSFGGEKYFITFIDDFARYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKILRS 375
           GP    +   + YF+ F+D F  Y   YL+  KS      + F+ + E   + KV  L  
Sbjct: 507 GPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYI 566

Query: 376 DRGGEYYRNYDENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAERRNRTLMNMVRSML 435
           D G EY  N          + +F     I    T+P TPQ NGV+ER  RT+    R+M+
Sbjct: 567 DNGREYLSN---------EMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMV 626

Query: 436 INSSLPVSLWMYALRTAQYLLNRVPSKSV---PKTPFELWTGRKSSLRHLHVWGCQVEVR 475
             + L  S W  A+ TA YL+NR+PS+++    KTP+E+W  +K  L+HL V+G  V V 
Sbjct: 627 SGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVH 664

BLAST of Cmc11g0307031 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 161.0 bits (406), Expect = 3.5e-38
Identity = 134/494 (27.13%), Postives = 234/494 (47.37%), Query Frame = 0

Query: 5   GAGKKPGKKNDKGNHGQLKVKQSSAPIHKKEQIKDKCRFCNKPGHYQKDCLKRKAWFENK 64
           G  +     N++ N  Q     S +   + +    +C+ C+  GH  K C +   +    
Sbjct: 222 GDNRNYNNNNNRSNSWQPSSSGSRSDNRQPKPYLGRCQICSVQGHSAKRCPQLHQFQSTT 281

Query: 65  GKHNALVFF-----ESNL-TEVPY--NTWWIDSGCTIHVSNMM--LGFLTTRTTNLNERL 124
            +  +   F      +NL    PY  N W +DSG T H+++    L F    T   ++ +
Sbjct: 282 NQQQSTSPFTPWQPRANLAVNSPYNANNWLLDSGATHHITSDFNNLSFHQPYTGG-DDVM 341

Query: 125 IFMGNKVKVSVEAVETYRLTLDTGYH-LDLFDTFYIPSISRNLISLSKL-DTSGYYFKFG 184
           I  G+ + ++     T   +L T    LDL    Y+P+I +NLIS+ +L +T+    +F 
Sbjct: 342 IADGSTIPIT----HTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFF 401

Query: 185 NECFSLFKQN--IFIGSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTNESSAYLWH 244
              F +   N  + +  G   D+LY+  + +  A S+              ++++   WH
Sbjct: 402 PASFQVKDLNTGVPLLQGKTKDELYEWPIASSQAVSMFA---------SPCSKATHSSWH 461

Query: 245 KRLGHISKERIKRLIKNEILPYLDFT-DLGICVDCIKEKQTKHTVNKEATRSSQLLEIIH 304
            RLGH S   +  +I N  LP L+ +  L  C DC   K  K   +     SS+ LE I+
Sbjct: 462 SRLGHPSLAILNSVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIY 521

Query: 305 TDICGPFDVPSFGGEKYFITFIDDFARYGYIYLLHEKSQAIDALKVFINEVERQLDRKVK 364
           +D+     + S    +Y++ F+D F RY ++Y L +KSQ  D   +F + VE +   ++ 
Sbjct: 522 SDVWSS-PILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIG 581

Query: 365 ILRSDRGGEYYRNYDENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAERRNRTLMNMV 424
            L SD GGE+             L  +L  H I    + P TP+ NG++ER++R ++ M 
Sbjct: 582 TLYSDNGGEFV-----------VLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMG 641

Query: 425 RSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVP-KTPFELWTGRKSSLRHLHVWGCQVE 481
            ++L ++S+P + W YA   A YL+NR+P+  +  ++PF+   G+  +   L V+GC   
Sbjct: 642 LTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACY 689

BLAST of Cmc11g0307031 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 148.7 bits (374), Expect = 1.8e-34
Identity = 126/496 (25.40%), Postives = 234/496 (47.18%), Query Frame = 0

Query: 7   GKKPGKKNDKGNHGQLKV-KQSSAPIH----KKEQIKDKCRFCNKPGHYQKDCLKRKAWF 66
           G +  + +++ N+   K  +QSS   H    + +    KC+ C   GH  K C + + + 
Sbjct: 240 GNRNNRYDNRNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFL 299

Query: 67  ENKGKHNALVFF-----ESNLT-EVPY--NTWWIDSGCTIHVSNMMLGF-LTTRTTNLNE 126
            +         F      +NL    PY  N W +DSG T H+++      L    T  ++
Sbjct: 300 SSVNSQQPPSPFTPWQPRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDD 359

Query: 127 RLIFMGNKVKVSVEAVETYRLTLDT-GYHLDLFDTFYIPSISRNLISLSKL-DTSGYYFK 186
            ++  G+ + +S     T   +L T    L+L +  Y+P+I +NLIS+ +L + +G   +
Sbjct: 360 VMVADGSTIPIS----HTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVE 419

Query: 187 FGNECFSLFKQN--IFIGSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTNESSAYL 246
           F    F +   N  + +  G   D+LY+  + +    SL             +++++   
Sbjct: 420 FFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQPVSLFA---------SPSSKATHSS 479

Query: 247 WHKRLGHISKERIKRLIKNEILPYLDFTDLGI-CVDCIKEKQTKHTVNKEATRSSQLLEI 306
           WH RLGH +   +  +I N  L  L+ +   + C DC+  K  K   ++    S++ LE 
Sbjct: 480 WHARLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEY 539

Query: 307 IHTDICGPFDVPSFGGEKYFITFIDDFARYGYIYLLHEKSQAIDALKVFINEVERQLDRK 366
           I++D+     + S    +Y++ F+D F RY ++Y L +KSQ  +    F N +E +   +
Sbjct: 540 IYSDVWSS-PILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTR 599

Query: 367 VKILRSDRGGEYYRNYDENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAERRNRTLMN 426
           +    SD GGE+             L ++   H I    + P TP+ NG++ER++R ++ 
Sbjct: 600 IGTFYSDNGGEFV-----------ALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVE 659

Query: 427 MVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVP-KTPFELWTGRKSSLRHLHVWGCQ 481
              ++L ++S+P + W YA   A YL+NR+P+  +  ++PF+   G   +   L V+GC 
Sbjct: 660 TGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCA 710

BLAST of Cmc11g0307031 vs. ExPASy Swiss-Prot
Match: Q12337 (Transposon Ty2-GR1 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-GR1 PE=5 SV=2)

HSP 1 Score: 112.1 bits (279), Expect = 1.9e-23
Identity = 93/361 (25.76%), Postives = 153/361 (42.38%), Query Frame = 0

Query: 120 KVKVSVEAVETYRLTLDTGYHLDLFDTFYIPSISRNLISLSKLDTSGYYFKFGNECFSLF 179
           K  + + A+         G    +    + P+I+ +L+SLS+L        F        
Sbjct: 487 KQDIPINAIGNLHFNFQNGTKTSI-KALHTPNIAYDLLSLSELANQNITACFTRNTLER- 546

Query: 180 KQNIFIGSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTNESSAYLWHKRLGHISKE 239
                +   +   D Y L    +    +  L  N   K    N+    L H+ LGH +  
Sbjct: 547 SDGTVLAPIVKHGDFYWLSKKYLIPSHISKLTINNVNKSKSVNKYPYPLIHRMLGHANFR 606

Query: 240 RIKRLIKNEILPYLDFTDLG-------ICVDCIKEKQTKHTVNK----EATRSSQLLEII 299
            I++ +K   + YL  +D+         C DC+  K TKH   K    +   S +  + +
Sbjct: 607 SIQKSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYL 666

Query: 300 HTDICGPFDVPSFGGEKYFITFIDDFARYGYIYLLHEKSQ--AIDALKVFINEVERQLDR 359
           HTDI GP          YFI+F D+  R+ ++Y LH++ +   ++     +  ++ Q + 
Sbjct: 667 HTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNA 726

Query: 360 KVKILRSDRGGEYYRNYDENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAERRNRTLM 419
           +V +++ DRG EY             L KF  +  I A YT     + +GVAER NRTL+
Sbjct: 727 RVLVIQMDRGSEYTNK---------TLHKFFTNRGITACYTTTADSRAHGVAERLNRTLL 786

Query: 420 NMVRSMLINSSLPVSLWMYALRTAQYLLNRVPSKSVPKTPFELWTGRKSSLRHLHVWGCQ 468
           N  R++L  S LP  LW  A+  +  + N + S   PK        RKS+ +H  + G  
Sbjct: 787 NDCRTLLHCSGLPNHLWFSAVEFSTIIRNSLVS---PKK-------RKSARQHAGLAGLD 826

BLAST of Cmc11g0307031 vs. ExPASy TrEMBL
Match: A0A445KPR8 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A OS=Glycine soja OX=3848 GN=D0Y65_012596 PE=4 SV=1)

HSP 1 Score: 727.6 bits (1877), Expect = 3.5e-206
Identity = 347/472 (73.52%), Postives = 397/472 (84.11%), Query Frame = 0

Query: 2   GHIGAGKKPGKKNDKGNHGQLKVKQSSAPIHKKEQIKDKCRFCNKPGHYQKDCLKRKAWF 61
           G+ GAGK   KK+DKG  G LK+K     I KK    + C FC K GH+QKDC KRK+WF
Sbjct: 289 GNQGAGKNFVKKHDKGK-GPLKIKDGPVQIQKKASKNNNCHFCGKSGHFQKDCPKRKSWF 348

Query: 62  ENKGKHNALVFFESNLTEVPYNTWWIDSGCTIHVSNMMLGFLTTRTTNLNERLIFMGNKV 121
           E KG+ NALV+FESNLTEVP+NTWWIDSGCT HVSN M GFLT +T + NE+ +FMGN+V
Sbjct: 349 EKKGELNALVYFESNLTEVPHNTWWIDSGCTTHVSNTMQGFLTIQTISPNEKFVFMGNRV 408

Query: 122 KVSVEAVETYRLTLDTGYHLDLFDTFYIPSISRNLISLSKLDTSGYYFKFGNECFSLFKQ 181
           K  VEAV TYRL LDTG+HLDL +T Y+PS+SRNL+SLSKLD +GY F FGN CFSLFK 
Sbjct: 409 KAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLSKLDITGYSFNFGNGCFSLFKY 468

Query: 182 NIFIGSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTNESSAYLWHKRLGHISKERI 241
           N  IG+G+LCD LYKLKLD ++ E++LTLHHN GTKR   NE SA+LWHKRLGHIS+ERI
Sbjct: 469 NHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSLVNERSAFLWHKRLGHISRERI 528

Query: 242 KRLIKNEILPYLDFTDLGICVDCIKEKQTKHTVNKEATRSSQLLEIIHTDICGPFDVPSF 301
           +RLIKNEILP LDFTDL ICVDCIK KQTKHT  K ATRS+QLLEI+HTDICGPFDV SF
Sbjct: 529 ERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVSSF 588

Query: 302 GGEKYFITFIDDFARYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKILRSDRGGEYYR 361
           G E+YFITFIDD++RYGY+YLLHEKSQA++AL++++NEVERQLDRKVKI+RSDRGGEYY 
Sbjct: 589 GRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEVERQLDRKVKIIRSDRGGEYYG 648

Query: 362 NYDENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVS 421
            YDE GQ PGP AK L+   ICAQYTMPGTPQQNGV+ERRNRTLM+MVRSMLINS+LPVS
Sbjct: 649 RYDETGQHPGPFAKLLQKRGICAQYTMPGTPQQNGVSERRNRTLMDMVRSMLINSTLPVS 708

Query: 422 LWMYALRTAQYLLNRVPSKSVPKTPFELWTGRKSSLRHLHVWGCQVEVRIYN 474
           LWMYAL+TA YLLN VPSK+VPKTPFELWT R  S+RHLHVWGCQ E+RIYN
Sbjct: 709 LWMYALKTAMYLLNMVPSKAVPKTPFELWTNRTPSMRHLHVWGCQAEIRIYN 758

BLAST of Cmc11g0307031 vs. ExPASy TrEMBL
Match: A0A445LQ30 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3848 GN=D0Y65_004205 PE=4 SV=1)

HSP 1 Score: 727.6 bits (1877), Expect = 3.5e-206
Identity = 348/472 (73.73%), Postives = 397/472 (84.11%), Query Frame = 0

Query: 2   GHIGAGKKPGKKNDKGNHGQLKVKQSSAPIHKKEQIKDKCRFCNKPGHYQKDCLKRKAWF 61
           G+ GAGKK  KK+DKG  G LK+K     I KK    + C FC K GH+QKDC KRK+WF
Sbjct: 278 GNQGAGKKFVKKHDKGK-GPLKIKDGPVQIQKKASKNNNCHFCGKSGHFQKDCPKRKSWF 337

Query: 62  ENKGKHNALVFFESNLTEVPYNTWWIDSGCTIHVSNMMLGFLTTRTTNLNERLIFMGNKV 121
           E KG+ NALV FESNLTEVP+NTWWIDSGCT HVSN M GFLT +T + NE+ +FMGN+V
Sbjct: 338 EKKGELNALVCFESNLTEVPHNTWWIDSGCTTHVSNTMQGFLTIQTISPNEKFVFMGNRV 397

Query: 122 KVSVEAVETYRLTLDTGYHLDLFDTFYIPSISRNLISLSKLDTSGYYFKFGNECFSLFKQ 181
           K  VEAV TYRL LDTG+HLDL +T Y+PS+SRNL+SLSKLD +GY F FGN CFSLFK 
Sbjct: 398 KAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLSKLDITGYSFNFGNGCFSLFKY 457

Query: 182 NIFIGSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTNESSAYLWHKRLGHISKERI 241
           N  IG+G+LCD LYKLKLD ++ E++LTLHHN GTKR   NE SA+LWHKRLGHIS ERI
Sbjct: 458 NHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSLVNERSAFLWHKRLGHISGERI 517

Query: 242 KRLIKNEILPYLDFTDLGICVDCIKEKQTKHTVNKEATRSSQLLEIIHTDICGPFDVPSF 301
           +RLIKNEILP LDFTDL ICVDCIK KQTKHT  K ATRS+QLLEI+HTDICGPFDV SF
Sbjct: 518 ERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVSSF 577

Query: 302 GGEKYFITFIDDFARYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKILRSDRGGEYYR 361
           G E+YFITFIDD++RYGY+YLLHEKSQA++AL++++NEVERQLDRKVKI+RSDRGGEYYR
Sbjct: 578 GRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEVERQLDRKVKIIRSDRGGEYYR 637

Query: 362 NYDENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVS 421
            YDE GQ P P AK L+   ICAQYTMPGTPQQNGV+ERRN+TLM+MVRSMLINS+LPVS
Sbjct: 638 RYDETGQHPSPFAKLLQKRGICAQYTMPGTPQQNGVSERRNKTLMDMVRSMLINSTLPVS 697

Query: 422 LWMYALRTAQYLLNRVPSKSVPKTPFELWTGRKSSLRHLHVWGCQVEVRIYN 474
           LWMYAL+TA YLLNRVPSK+VPKTPFELWT R  S+RHLHVWGCQ E+RIYN
Sbjct: 698 LWMYALKTAMYLLNRVPSKAVPKTPFELWTNRTPSMRHLHVWGCQAEIRIYN 747

BLAST of Cmc11g0307031 vs. ExPASy TrEMBL
Match: A0A151R237 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_042301 PE=4 SV=1)

HSP 1 Score: 719.2 bits (1855), Expect = 1.2e-203
Identity = 345/472 (73.09%), Postives = 396/472 (83.90%), Query Frame = 0

Query: 2   GHIGAGKKPGKKNDKGNHGQLKVKQSSAPIHKKEQIKDKCRFCNKPGHYQKDCLKRKAWF 61
           G+ GAGKK  KK+DKG    LK+ ++S PI KK    + C FC K GH+QKDC KRKAWF
Sbjct: 141 GNQGAGKKFVKKHDKGKK-PLKINEASVPIQKKASKGNNCHFCGKSGHFQKDCPKRKAWF 200

Query: 62  ENKGKHNALVFFESNLTEVPYNTWWIDSGCTIHVSNMMLGFLTTRTTNLNERLIFMGNKV 121
           E KGK NA V FESNLTEVP+NTWWIDSGCT HVSN M GF T +T + NE+ +FMGN+V
Sbjct: 201 EKKGKLNAYVCFESNLTEVPHNTWWIDSGCTTHVSNTMQGFTTIQTISPNEKFVFMGNRV 260

Query: 122 KVSVEAVETYRLTLDTGYHLDLFDTFYIPSISRNLISLSKLDTSGYYFKFGNECFSLFKQ 181
           KV VEAV TYRL L+TG+HLDL +T Y+PS+SRNL+SLSKLD  GY F FGN CFSLFK+
Sbjct: 261 KVPVEAVGTYRLILNTGHHLDLLETLYVPSLSRNLVSLSKLDAIGYSFTFGNGCFSLFKR 320

Query: 182 NIFIGSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTNESSAYLWHKRLGHISKERI 241
           N  IG+GILCD LYKL LD ++ E+LLTLHHN GTKR   NE SA+LWH+RLGHIS+ER+
Sbjct: 321 NHLIGTGILCDGLYKLNLDGLYDETLLTLHHNIGTKRSLVNERSAFLWHRRLGHISRERM 380

Query: 242 KRLIKNEILPYLDFTDLGICVDCIKEKQTKHTVNKEATRSSQLLEIIHTDICGPFDVPSF 301
           +RLIKNEILP LDFTDL ICVDCIK KQTKHT  K ATRS+QLLEI+HTDICGPFDV SF
Sbjct: 381 ERLIKNEILPNLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVNSF 440

Query: 302 GGEKYFITFIDDFARYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKILRSDRGGEYYR 361
           G EKYFITFIDD++RYGY+YLLHEKSQA+DAL++++NEVERQLD+KVK++RSDRGGEYY 
Sbjct: 441 GKEKYFITFIDDYSRYGYVYLLHEKSQAVDALEIYLNEVERQLDKKVKVVRSDRGGEYYG 500

Query: 362 NYDENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVS 421
            Y+E GQ PGP AK L+   ICAQYTMPGTPQQNGV+ERRNRTLM+MVRSML NS+LP+ 
Sbjct: 501 RYNETGQHPGPFAKLLQKRGICAQYTMPGTPQQNGVSERRNRTLMDMVRSMLSNSTLPIY 560

Query: 422 LWMYALRTAQYLLNRVPSKSVPKTPFELWTGRKSSLRHLHVWGCQVEVRIYN 474
           LWMYAL+TA YLLNRVPSK+V KTPFELWTGR  SLRHLHVWGCQ E+RIYN
Sbjct: 561 LWMYALKTAMYLLNRVPSKAVSKTPFELWTGRTPSLRHLHVWGCQAEIRIYN 610

BLAST of Cmc11g0307031 vs. ExPASy TrEMBL
Match: A0A445KGB1 (B2 protein isoform D OS=Glycine soja OX=3848 GN=D0Y65_016300 PE=4 SV=1)

HSP 1 Score: 650.2 bits (1676), Expect = 7.1e-183
Identity = 320/472 (67.80%), Postives = 370/472 (78.39%), Query Frame = 0

Query: 2   GHIGAGKKPGKKNDKGNHGQLKVKQSSAPIHKKEQIKDKCRFCNKPGHYQKDCLKRKAWF 61
           G+ GAGKK  KK+DKG  G LK+K     I KK    + C FC K GH+QKDC KRK+WF
Sbjct: 230 GNQGAGKKFVKKHDKGK-GPLKIKDGPVQIQKKASKNNNCHFCGKSGHFQKDCPKRKSWF 289

Query: 62  ENKGKHNALVFFESNLTEVPYNTWWIDSGCTIHVSNMMLGFLTTRTTNLNERLIFMGNKV 121
           E KG+ NA                              LGFLT +T + N++ +FMGN+V
Sbjct: 290 EKKGELNA------------------------------LGFLTIQTISPNKKFVFMGNRV 349

Query: 122 KVSVEAVETYRLTLDTGYHLDLFDTFYIPSISRNLISLSKLDTSGYYFKFGNECFSLFKQ 181
           K  VEAV TYRL LDTG+HLDL +T Y+PS+SRNL+SLSKLD +GY F FGN CFSLFK 
Sbjct: 350 KAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLSKLDITGYSFNFGNGCFSLFKY 409

Query: 182 NIFIGSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTNESSAYLWHKRLGHISKERI 241
           N  IG+G+LCD LYKLKLD ++ E++LTLHHN GTKR   NE SA+LWHKRLGHIS+ERI
Sbjct: 410 NHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSLVNERSAFLWHKRLGHISRERI 469

Query: 242 KRLIKNEILPYLDFTDLGICVDCIKEKQTKHTVNKEATRSSQLLEIIHTDICGPFDVPSF 301
           +RLIKNEILP LDFTDL ICVDCIK KQTKHT  K ATRS+QLLEI+HTDICGPFDV SF
Sbjct: 470 ERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVSSF 529

Query: 302 GGEKYFITFIDDFARYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKILRSDRGGEYYR 361
           G E+YFITFIDD++RYGY+YLLHEKSQA++AL++++NEVERQLDRKVKI+RSDR GEYYR
Sbjct: 530 GRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEVERQLDRKVKIIRSDRDGEYYR 589

Query: 362 NYDENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVS 421
            YDE GQ  GP AK L+   ICAQYTMPGT QQNGV+ERRNRTLM+MVRSMLINS+LPVS
Sbjct: 590 RYDETGQHSGPFAKLLQKRGICAQYTMPGTLQQNGVSERRNRTLMDMVRSMLINSTLPVS 649

Query: 422 LWMYALRTAQYLLNRVPSKSVPKTPFELWTGRKSSLRHLHVWGCQVEVRIYN 474
           LWMYAL+TA YLLNRVPSK++PKTPFELWT R  S+RHLHVWGCQ E+RIYN
Sbjct: 650 LWMYALKTAMYLLNRVPSKAIPKTPFELWTNRTPSMRHLHVWGCQAEIRIYN 669

BLAST of Cmc11g0307031 vs. ExPASy TrEMBL
Match: A0A438JI44 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2658 PE=4 SV=1)

HSP 1 Score: 622.5 bits (1604), Expect = 1.6e-174
Identity = 302/469 (64.39%), Postives = 360/469 (76.76%), Query Frame = 0

Query: 5   GAGKKPGKKNDKGNHGQLKVKQSSAPIHKKEQIKDKCRFCNKPGHYQKDCLKRKAWFENK 64
           G  KK GK     N    K            +    C FC K GH +KDC+KRKAWFE +
Sbjct: 200 GVTKKKGKFKKGKNFPPKKSGPGEGSQSHDGKFTVSCYFCGKKGHVKKDCIKRKAWFEKR 259

Query: 65  GKHNALVFFESNLTEVPYNTWWIDSGCTIHVSNMMLGFLTTRTTNLNERLIFMGNKVKVS 124
           G + + V +ESNL EVP NTWWIDSG T HV+N+M GFLTTR    +E+ ++MGN++KV 
Sbjct: 260 GINLSFVCYESNLAEVPSNTWWIDSGATTHVTNLMQGFLTTRKPKESEKFLYMGNRLKVE 319

Query: 125 VEAVETYRLTLDTGYHLDLFDTFYIPSISRNLISLSKLDTSGYYFKFGNECFSLFKQNIF 184
           V AV TYRL L+TG+ +DL +TFY+PSISRNL+SLSKLD +GY   F +   SL    + 
Sbjct: 320 VVAVGTYRLLLETGHRMDLLNTFYVPSISRNLVSLSKLDATGYSVLFNSGQLSLMLNYVT 379

Query: 185 IGSGILCDDLYKLKLDNVFAESLLTLHHNAGTKRGQTNESSAYLWHKRLGHISKERIKRL 244
           +GSGILCD LYK+ L++ FA++L+TLH N G+KRG  NE+S+ LWH+RLGHIS+ERI+RL
Sbjct: 380 VGSGILCDGLYKISLNHEFAQALITLHSNVGSKRGLINENSSILWHRRLGHISRERIERL 439

Query: 245 IKNEILPYLDFTDLGICVDCIKEKQTKHTVNKEATRSSQLLEIIHTDICGPFDVPSFGGE 304
           +K  IL  LDFTD  +CVDCIK KQTKHT  K ATRS++LLEIIHTDICGP  VP F GE
Sbjct: 440 VKEGILQNLDFTDFHVCVDCIKGKQTKHT-KKGATRSNELLEIIHTDICGPLSVPCFTGE 499

Query: 305 KYFITFIDDFARYGYIYLLHEKSQAIDALKVFINEVERQLDRKVKILRSDRGGEYYRNYD 364
           KYFITFIDD +RYGY+YL+HEKSQAID  ++FI EVERQLD+K+KI+RSDRGGEYY  YD
Sbjct: 500 KYFITFIDDLSRYGYVYLMHEKSQAIDIFEMFITEVERQLDKKIKIVRSDRGGEYYGRYD 559

Query: 365 ENGQCPGPLAKFLESHDICAQYTMPGTPQQNGVAERRNRTLMNMVRSMLINSSLPVSLWM 424
           E+GQ PGP AKFLE H I AQYTMPGTPQQNGVAERRNRTLM MVRSM+  SS+P+SLW 
Sbjct: 560 ESGQNPGPFAKFLEKHGIRAQYTMPGTPQQNGVAERRNRTLMEMVRSMMSYSSVPISLWG 619

Query: 425 YALRTAQYLLNRVPSKSVPKTPFELWTGRKSSLRHLHVWGCQVEVRIYN 474
            AL+TA Y+LNRVPSK+VPKTPFELWTGRK SLRH+H+WGC  E RIYN
Sbjct: 620 EALKTAMYILNRVPSKAVPKTPFELWTGRKPSLRHIHIWGCPAEARIYN 667

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RZC12927.17.2e-20673.52Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A [Glycine s... [more]
RZC25410.17.2e-20673.73Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja][more]
KYP36562.12.6e-20373.09Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
RZC09906.11.5e-18267.80B2 protein isoform D [Glycine soja][more]
RVX08602.13.3e-17464.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
P109782.5e-6032.44Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041469.6e-4428.31Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT943.5e-3827.13Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW21.8e-3425.40Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q123371.9e-2325.76Transposon Ty2-GR1 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
A0A445KPR83.5e-20673.52Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A OS=Glycine... [more]
A0A445LQ303.5e-20673.73Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3... [more]
A0A151R2371.2e-20373.09Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A445KGB17.1e-18367.80B2 protein isoform D OS=Glycine soja OX=3848 GN=D0Y65_016300 PE=4 SV=1[more]
A0A438JI441.6e-17464.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 40..54
e-value: 9.3E-4
score: 19.2
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 40..54
score: 9.125269
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 194..269
e-value: 4.1E-15
score: 55.4
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 275..469
e-value: 7.0E-40
score: 138.4
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 286..392
e-value: 6.9E-11
score: 42.4
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 280..454
score: 20.908785
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..33
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 84..471
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 280..448
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 38..62

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc11g0307031.1Cmc11g0307031.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding