Cmc05g0130471 (gene) Melon (Charmono) v1.1

Overview
NameCmc05g0130471
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCMiso1.1chr05: 8938857 .. 8940059 (+)
RNA-Seq ExpressionCmc05g0130471
SyntenyCmc05g0130471
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTCATAAAGGAGCTGGAAAGAAACCTGGAAAAAAGAATGGCAAGGGCAATCGTGGACATTTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGGACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCTGGACACTATAAAAAAAATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTAGTATGTTTCGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTTGTACGATTCATGTTTCCAATACGATGCATGGATTCCTTACGACCCGAACCACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGAGTCAAAGTTTCAGTTGAAGCTGTGAGAACCTATTGTTTAACTTTAGATACTGGACATCATTTAGACCTTTTTGATACCTTTTATGTTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACTTGATACTTCAGGTTATTACTTTAAATTTGGGAATGGATGTTTTAGTTTATTCAAACAGAACATTTTTATTGGTTCCGGTATTCTTTGTGATGGCTTATATAAATTAAAGCTTGATAATGTTTTTGCTGAGAGTTTGTTAACCTTGCATCATAATGTTGGTACTAAACGTGGTCAAACTAATGAATCGTCAGCTTACTTGTGGCATAAACGTTTAGATCACATATCCAAAGAAAGAATTAAAAGATTGATAAAGAATGAAATTCTTCCAGATTTGGATTTTACTGACCTTGGAATTAGTGTGGATTGTATTAAAGGAAAACAAACAAAACACACAGTTAATAAAGAAGCCACAAGAAGCTCACAACTCCTTGAAATTATACACACTGATATTTGTAGGTCTTTTGATGTTCGATCTTTTGGTGGAGAAAAGTATTTTATCACCTTTATTGAGGATTTCTCACGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGCAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAATGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATATGACGAGAATGGACAATGCCCCGCTCCATTCGCTAAATTCCTAGAAAGCCATGGCAGATGTGCTCAATACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAATGTGA

mRNA sequence

ATGGGTCATAAAGGAGCTGGAAAGAAACCTGGAAAAAAGAATGGCAAGGGCAATCGTGGACATTTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGGACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCTGGACACTATAAAAAAAATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTAGTATGTTTCGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTTGTACGATTCATGTTTCCAATACGATGCATGGATTCCTTACGACCCGAACCACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGAGTCAAAGTTTCAGTTGAAGCTGTGAGAACCTATTGTTTAACTTTAGATACTGGACATCATTTAGACCTTTTTGATACCTTTTATGTTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACTTGATACTTCAGGTTATTACTTTAAATTTGGGAATGGATGTTTTAGTTTATTCAAACAGAACATTTTTATTGGTTCCGGTATTCTTTGTGATGGCTTATATAAATTAAAGCTTGATAATGTTTTTGCTGAGAGTTTGTTAACCTTGCATCATAATGTTGGTACTAAACGTGGTCAAACTAATGAATCGTCAGCTTACTTGTGGCATAAACGTTTAGATCACATATCCAAAGAAAGAATTAAAAGATTGATAAAGAATGAAATTCTTCCAGATTTGGATTTTACTGACCTTGGAATTAGTGTGGATTGTATTAAAGGAAAACAAACAAAACACACAGTTAATAAAGAAGCCACAAGAAGCTCACAACTCCTTGAAATTATACACACTGATATTTGTAGGTCTTTTGATGTTCGATCTTTTGGTGGAGAAAAGTATTTTATCACCTTTATTGAGGATTTCTCACGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGCAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAATGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATATGACGAGAATGGACAATGCCCCGCTCCATTCGCTAAATTCCTAGAAAGCCATGGCAGATGTGCTCAATACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAATGTGA

Coding sequence (CDS)

ATGGGTCATAAAGGAGCTGGAAAGAAACCTGGAAAAAAGAATGGCAAGGGCAATCGTGGACATTTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGGACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCTGGACACTATAAAAAAAATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTAGTATGTTTCGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTTGTACGATTCATGTTTCCAATACGATGCATGGATTCCTTACGACCCGAACCACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGAGTCAAAGTTTCAGTTGAAGCTGTGAGAACCTATTGTTTAACTTTAGATACTGGACATCATTTAGACCTTTTTGATACCTTTTATGTTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACTTGATACTTCAGGTTATTACTTTAAATTTGGGAATGGATGTTTTAGTTTATTCAAACAGAACATTTTTATTGGTTCCGGTATTCTTTGTGATGGCTTATATAAATTAAAGCTTGATAATGTTTTTGCTGAGAGTTTGTTAACCTTGCATCATAATGTTGGTACTAAACGTGGTCAAACTAATGAATCGTCAGCTTACTTGTGGCATAAACGTTTAGATCACATATCCAAAGAAAGAATTAAAAGATTGATAAAGAATGAAATTCTTCCAGATTTGGATTTTACTGACCTTGGAATTAGTGTGGATTGTATTAAAGGAAAACAAACAAAACACACAGTTAATAAAGAAGCCACAAGAAGCTCACAACTCCTTGAAATTATACACACTGATATTTGTAGGTCTTTTGATGTTCGATCTTTTGGTGGAGAAAAGTATTTTATCACCTTTATTGAGGATTTCTCACGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGCAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAATGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATATGACGAGAATGGACAATGCCCCGCTCCATTCGCTAAATTCCTAGAAAGCCATGGCAGATGTGCTCAATACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAATGTGA

Protein sequence

MGHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSFGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAEM
Homology
BLAST of Cmc05g0130471 vs. NCBI nr
Match: RZC25410.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja])

HSP 1 Score: 600.9 bits (1548), Expect = 8.0e-168
Identity = 289/398 (72.61%), Postives = 334/398 (83.92%), Query Frame = 0

Query: 2   GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWF 61
           G++GAGKK  KK+ KG +G LK+K     I KK    + C FC K GH++K+C KRK+WF
Sbjct: 278 GNQGAGKKFVKKHDKG-KGPLKIKDGPVQIQKKASKNNNCHFCGKSGHFQKDCPKRKSWF 337

Query: 62  ENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRV 121
           E KG+ NALVCFESNLTEVP+NTWWIDSGCT HVSNTM GFLT +T +PNE+F+FMGNRV
Sbjct: 338 EKKGELNALVCFESNLTEVPHNTWWIDSGCTTHVSNTMQGFLTIQTISPNEKFVFMGNRV 397

Query: 122 KVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQ 181
           K  VEAV TY L LDTGHHLDL +T YVPS+SRNL+SLSKLD +GY F FGNGCFSLFK 
Sbjct: 398 KAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLSKLDITGYSFNFGNGCFSLFKY 457

Query: 182 NIFIGSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERI 241
           N  IG+G+LCDGLYKLKLD ++ E++LTLHHNVGTKR   NE SA+LWHKRL HIS ERI
Sbjct: 458 NHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSLVNERSAFLWHKRLGHISGERI 517

Query: 242 KRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF 301
           +RLIKNEILPDLDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Sbjct: 518 ERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVSSF 577

Query: 302 GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYG 361
           G E+YFITFI+D+SRYGY+YLLHEKSQA++AL++++NEVERQLDR VKI+RSDRGGEYY 
Sbjct: 578 GRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEVERQLDRKVKIIRSDRGGEYYR 637

Query: 362 KYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE 400
           +YDE GQ P+PFAK L+  G CAQYTMPGTPQQNGV+E
Sbjct: 638 RYDETGQHPSPFAKLLQKRGICAQYTMPGTPQQNGVSE 673

BLAST of Cmc05g0130471 vs. NCBI nr
Match: RZC12927.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A [Glycine soja] >RZC12928.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform B [Glycine soja])

HSP 1 Score: 599.0 bits (1543), Expect = 3.1e-167
Identity = 288/398 (72.36%), Postives = 333/398 (83.67%), Query Frame = 0

Query: 2   GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWF 61
           G++GAGK   KK+ KG +G LK+K     I KK    + C FC K GH++K+C KRK+WF
Sbjct: 289 GNQGAGKNFVKKHDKG-KGPLKIKDGPVQIQKKASKNNNCHFCGKSGHFQKDCPKRKSWF 348

Query: 62  ENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRV 121
           E KG+ NALV FESNLTEVP+NTWWIDSGCT HVSNTM GFLT +T +PNE+F+FMGNRV
Sbjct: 349 EKKGELNALVYFESNLTEVPHNTWWIDSGCTTHVSNTMQGFLTIQTISPNEKFVFMGNRV 408

Query: 122 KVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQ 181
           K  VEAV TY L LDTGHHLDL +T YVPS+SRNL+SLSKLD +GY F FGNGCFSLFK 
Sbjct: 409 KAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLSKLDITGYSFNFGNGCFSLFKY 468

Query: 182 NIFIGSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERI 241
           N  IG+G+LCDGLYKLKLD ++ E++LTLHHNVGTKR   NE SA+LWHKRL HIS+ERI
Sbjct: 469 NHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSLVNERSAFLWHKRLGHISRERI 528

Query: 242 KRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF 301
           +RLIKNEILPDLDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Sbjct: 529 ERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVSSF 588

Query: 302 GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYG 361
           G E+YFITFI+D+SRYGY+YLLHEKSQA++AL++++NEVERQLDR VKI+RSDRGGEYYG
Sbjct: 589 GRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEVERQLDRKVKIIRSDRGGEYYG 648

Query: 362 KYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE 400
           +YDE GQ P PFAK L+  G CAQYTMPGTPQQNGV+E
Sbjct: 649 RYDETGQHPGPFAKLLQKRGICAQYTMPGTPQQNGVSE 684

BLAST of Cmc05g0130471 vs. NCBI nr
Match: KYP36562.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 598.6 bits (1542), Expect = 4.0e-167
Identity = 286/398 (71.86%), Postives = 335/398 (84.17%), Query Frame = 0

Query: 2   GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWF 61
           G++GAGKK  KK+ KG +  LK+ ++S PI KK    + C FC K GH++K+C KRKAWF
Sbjct: 141 GNQGAGKKFVKKHDKGKK-PLKINEASVPIQKKASKGNNCHFCGKSGHFQKDCPKRKAWF 200

Query: 62  ENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRV 121
           E KGK NA VCFESNLTEVP+NTWWIDSGCT HVSNTM GF T +T +PNE+F+FMGNRV
Sbjct: 201 EKKGKLNAYVCFESNLTEVPHNTWWIDSGCTTHVSNTMQGFTTIQTISPNEKFVFMGNRV 260

Query: 122 KVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQ 181
           KV VEAV TY L L+TGHHLDL +T YVPS+SRNL+SLSKLD  GY F FGNGCFSLFK+
Sbjct: 261 KVPVEAVGTYRLILNTGHHLDLLETLYVPSLSRNLVSLSKLDAIGYSFTFGNGCFSLFKR 320

Query: 182 NIFIGSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERI 241
           N  IG+GILCDGLYKL LD ++ E+LLTLHHN+GTKR   NE SA+LWH+RL HIS+ER+
Sbjct: 321 NHLIGTGILCDGLYKLNLDGLYDETLLTLHHNIGTKRSLVNERSAFLWHRRLGHISRERM 380

Query: 242 KRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF 301
           +RLIKNEILP+LDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Sbjct: 381 ERLIKNEILPNLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVNSF 440

Query: 302 GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYG 361
           G EKYFITFI+D+SRYGY+YLLHEKSQA+DAL++++NEVERQLD+ VK++RSDRGGEYYG
Sbjct: 441 GKEKYFITFIDDYSRYGYVYLLHEKSQAVDALEIYLNEVERQLDKKVKVVRSDRGGEYYG 500

Query: 362 KYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE 400
           +Y+E GQ P PFAK L+  G CAQYTMPGTPQQNGV+E
Sbjct: 501 RYNETGQHPGPFAKLLQKRGICAQYTMPGTPQQNGVSE 536

BLAST of Cmc05g0130471 vs. NCBI nr
Match: RYE18822.1 (hypothetical protein EOP45_13565, partial [Sphingobacteriaceae bacterium])

HSP 1 Score: 582.0 bits (1499), Expect = 3.9e-162
Identity = 276/367 (75.20%), Postives = 315/367 (85.83%), Query Frame = 0

Query: 1   MGHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAW 60
           +GHKGA KKP  K GKG +G  K+ +SS  IHKK Q  D CRFC K GH++K+C KRK W
Sbjct: 108 VGHKGARKKPW-KTGKGKQGLSKLNESSVQIHKKEQSNDVCRFCKKNGHWQKDCPKRKTW 167

Query: 61  FENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNR 120
           FE KGK +A VCFESN  EVPYNTWW+DSGCT HVSNTM GFLTT+T + NE+FI MGNR
Sbjct: 168 FERKGKPSAFVCFESNFAEVPYNTWWLDSGCTTHVSNTMQGFLTTQTISQNEKFILMGNR 227

Query: 121 VKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFK 180
            KV VEA+ TY L LDTGHHLDLF TFYVPS+SRNL+S+SKLD +GY F FGNGCFSLFK
Sbjct: 228 AKVQVEAIGTYRLVLDTGHHLDLFQTFYVPSVSRNLVSISKLDKAGYSFNFGNGCFSLFK 287

Query: 181 QNIFIGSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKER 240
           QN+F+GSGILCDGLYKLKLD  FAE+LLT+HHNVGTKRG +NESSAYLWHKRL HISKER
Sbjct: 288 QNLFLGSGILCDGLYKLKLDTFFAETLLTVHHNVGTKRGLSNESSAYLWHKRLGHISKER 347

Query: 241 IKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRS 300
           I+RL+KN+ILP+LDFTDLG+ V+CIKGK T+ T+ K ATRSSQLLEIIHTDIC  FDV S
Sbjct: 348 IERLVKNDILPNLDFTDLGVCVECIKGKHTRQTLKKAATRSSQLLEIIHTDICGPFDVPS 407

Query: 301 FGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYY 360
            GGE+YFITFI+DFSRYGY+YLLHEKSQ++D L+VF+NEVERQLDR VKI+RSDRGGEYY
Sbjct: 408 LGGERYFITFIDDFSRYGYVYLLHEKSQSVDTLEVFVNEVERQLDRKVKIVRSDRGGEYY 467

Query: 361 GKYDENG 368
           G+YDENG
Sbjct: 468 GRYDENG 473

BLAST of Cmc05g0130471 vs. NCBI nr
Match: RZC09906.1 (B2 protein isoform D [Glycine soja])

HSP 1 Score: 513.5 bits (1321), Expect = 1.7e-141
Identity = 258/398 (64.82%), Postives = 303/398 (76.13%), Query Frame = 0

Query: 2   GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWF 61
           G++GAGKK  KK+ KG +G LK+K     I KK    + C FC K GH++K+C KRK+WF
Sbjct: 230 GNQGAGKKFVKKHDKG-KGPLKIKDGPVQIQKKASKNNNCHFCGKSGHFQKDCPKRKSWF 289

Query: 62  ENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRV 121
           E KG+ NAL                              GFLT +T +PN++F+FMGNRV
Sbjct: 290 EKKGELNAL------------------------------GFLTIQTISPNKKFVFMGNRV 349

Query: 122 KVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQ 181
           K  VEAV TY L LDTGHHLDL +T YVPS+SRNL+SLSKLD +GY F FGNGCFSLFK 
Sbjct: 350 KAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLSKLDITGYSFNFGNGCFSLFKY 409

Query: 182 NIFIGSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERI 241
           N  IG+G+LCDGLYKLKLD ++ E++LTLHHNVGTKR   NE SA+LWHKRL HIS+ERI
Sbjct: 410 NHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSLVNERSAFLWHKRLGHISRERI 469

Query: 242 KRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF 301
           +RLIKNEILPDLDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Sbjct: 470 ERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVSSF 529

Query: 302 GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYG 361
           G E+YFITFI+D+SRYGY+YLLHEKSQA++AL++++NEVERQLDR VKI+RSDR GEYY 
Sbjct: 530 GRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEVERQLDRKVKIIRSDRDGEYYR 589

Query: 362 KYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE 400
           +YDE GQ   PFAK L+  G CAQYTMPGT QQNGV+E
Sbjct: 590 RYDETGQHSGPFAKLLQKRGICAQYTMPGTLQQNGVSE 595

BLAST of Cmc05g0130471 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 163.7 bits (413), Expect = 4.3e-39
Identity = 119/414 (28.74%), Postives = 191/414 (46.14%), Query Frame = 0

Query: 4   KGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLK-RKAWFE 63
           +G G+   + +    R   + K  +     K ++++ C  CN+PGH+K++C   RK   E
Sbjct: 199 EGRGRSYQRSSNNYGRSGARGKSKN---RSKSRVRN-CYNCNQPGHFKRDCPNPRKGKGE 258

Query: 64  NKGKHN-----ALVCFESNLT------------EVPYNTWWIDSGCTIHVSNTMHGFLTT 123
             G+ N     A+V    N+               P + W +D+  + H +      L  
Sbjct: 259 TSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESEWVVDTAASHHATPVRD--LFC 318

Query: 124 RTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTS 183
           R    +   + MGN     +  +   C+  + G  L L D  +VP +  NLIS   LD  
Sbjct: 319 RYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDRD 378

Query: 184 GYYFKFGNGCFSLFKQNIFIGSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESS 243
           GY   F N  + L K ++ I  G+    LY+   +              G      +E S
Sbjct: 379 GYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEIC-----------QGELNAAQDEIS 438

Query: 244 AYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLL 303
             LWHKR+ H+S++ ++ L K  ++     T +     C+ GKQ + +    + R   +L
Sbjct: 439 VDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNIL 498

Query: 304 EIIHTDICRSFDVRSFGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLD 363
           +++++D+C   ++ S GG KYF+TFI+D SR  ++Y+L  K Q     + F   VER+  
Sbjct: 499 DLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETG 558

Query: 364 RNVKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE 400
           R +K LRSD GGEY  +          F ++  SHG   + T+PGTPQ NGVAE
Sbjct: 559 RKLKRLRSDNGGEYTSR---------EFEEYCSSHGIRHEKTVPGTPQHNGVAE 586

BLAST of Cmc05g0130471 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 105.9 bits (263), Expect = 1.1e-21
Identity = 104/411 (25.30%), Postives = 179/411 (43.55%), Query Frame = 0

Query: 3   HKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWFE 62
           ++G  +     N + N        S +   +      +C+ C+  GH  K C +   +  
Sbjct: 220 NRGDNRNYNNNNNRSNSWQPSSSGSRSDNRQPKPYLGRCQICSVQGHSAKRCPQLHQFQS 279

Query: 63  NKGKHNALVCF-----ESNL-TEVPY--NTWWIDSGCTIHVSNTMHGFLTTRT-TNPNER 122
              +  +   F      +NL    PY  N W +DSG T H+++  +     +  T  ++ 
Sbjct: 280 TTNQQQSTSPFTPWQPRANLAVNSPYNANNWLLDSGATHHITSDFNNLSFHQPYTGGDDV 339

Query: 123 FIFMGNRVKVSVEAVRTYCLTLDT-GHHLDLFDTFYVPSISRNLISLSKL-DTSGYYFKF 182
            I  G+ + ++     T   +L T    LDL    YVP+I +NLIS+ +L +T+    +F
Sbjct: 340 MIADGSTIPIT----HTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEF 399

Query: 183 GNGCFSLFKQN--IFIGSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLW 242
               F +   N  + +  G   D LY+  + +  A S+              ++++   W
Sbjct: 400 FPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQAVSMFA---------SPCSKATHSSW 459

Query: 243 HKRLDHISKERIKRLIKNEILPDLDFTDLGISV-DCIKGKQTKHTVNKEATRSSQLLEII 302
           H RL H S   +  +I N  LP L+ +   +S  DC   K  K   +     SS+ LE I
Sbjct: 460 HSRLGHPSLAILNSVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYI 519

Query: 303 HTDICRSFDVRSFGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNV 362
           ++D+  S  + S    +Y++ F++ F+RY ++Y L +KSQ  D   +F + VE +    +
Sbjct: 520 YSDVWSS-PILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRI 579

Query: 363 KILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE 400
             L SD GGE+    D           +L  HG     + P TP+ NG++E
Sbjct: 580 GTLYSDNGGEFVVLRD-----------YLSQHGISHFTSPPHTPEHNGLSE 605

BLAST of Cmc05g0130471 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 103.2 bits (256), Expect = 6.9e-21
Identity = 103/412 (25.00%), Postives = 183/412 (44.42%), Query Frame = 0

Query: 7   GKKPGKKNGKGNRGHLKV-KQSSAPIH-KKGQIK---DKCRFCNKPGHYKKNCLKRKAWF 66
           G +  + + + N  + K  +QSS   H    Q K    KC+ C   GH  K C + + + 
Sbjct: 240 GNRNNRYDNRNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFL 299

Query: 67  ENKGKHNALVCF-----ESNLT-EVPY--NTWWIDSGCTIHVSNTMHGF-LTTRTTNPNE 126
            +         F      +NL    PY  N W +DSG T H+++  +   L    T  ++
Sbjct: 300 SSVNSQQPPSPFTPWQPRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDD 359

Query: 127 RFIFMGNRVKVSVEAVRTYCLTLDT-GHHLDLFDTFYVPSISRNLISLSKL-DTSGYYFK 186
             +  G+ + +S     T   +L T    L+L +  YVP+I +NLIS+ +L + +G   +
Sbjct: 360 VMVADGSTIPIS----HTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVE 419

Query: 187 FGNGCFSLFKQN--IFIGSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYL 246
           F    F +   N  + +  G   D LY+  + +    SL             +++++   
Sbjct: 420 FFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQPVSLFA---------SPSSKATHSS 479

Query: 247 WHKRLDHISKERIKRLIKNEILPDLDFTDLGISV-DCIKGKQTKHTVNKEATRSSQLLEI 306
           WH RL H +   +  +I N  L  L+ +   +S  DC+  K  K   ++    S++ LE 
Sbjct: 480 WHARLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEY 539

Query: 307 IHTDICRSFDVRSFGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRN 366
           I++D+  S  + S    +Y++ F++ F+RY ++Y L +KSQ  +    F N +E +    
Sbjct: 540 IYSDVWSS-PILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTR 599

Query: 367 VKILRSDRGGEYYGKYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE 400
           +    SD GGE+   ++           +   HG     + P TP+ NG++E
Sbjct: 600 IGTFYSDNGGEFVALWE-----------YFSQHGISHLTSPPHTPEHNGLSE 626

BLAST of Cmc05g0130471 vs. ExPASy Swiss-Prot
Match: Q12501 (Transposon Ty2-OR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-OR2 PE=1 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 1.0e-11
Identity = 78/330 (23.64%), Postives = 138/330 (41.82%), Query Frame = 0

Query: 87  IDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDT 146
           IDSG +  +  + H +L   T N +E  I    +  + + A+         G    +   
Sbjct: 456 IDSGASQTLVRSAH-YLHHATPN-SEINIVDAQKQDIPINAIGNLHFNFQNGTKTSI-KA 515

Query: 147 FYVPSISRNLISLSKLDTSGYYFKFGNGCF---SLFKQNIFIGSGILCDG-LYKLKLDNV 206
            + P+I+ +L+SLS+L            CF   +L + +  + + I+  G  Y L    +
Sbjct: 516 LHTPNIAYDLLSLSELTNQNI-----TACFTRNTLERSDGTVLAPIVKHGDFYWLSKKYL 575

Query: 207 FAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGIS- 266
               +  L  N   K    N+    L H+ L H +   I++ +K   +  L  +D+  S 
Sbjct: 576 IPSHISKLTINNVNKSKSVNKYPYPLIHRMLGHANFRSIQKSLKKNAVTYLKESDIEWSN 635

Query: 267 ------VDCIKGKQTKHTVNK----EATRSSQLLEIIHTDICRSFDVRSFGGEKYFITFI 326
                  DC+ GK TKH   K    +   S +  + +HTDI             YFI+F 
Sbjct: 636 ASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFT 695

Query: 327 EDFSRYGYIYLLHEKSQ--AIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQC 386
           ++ +R+ ++Y LH++ +   ++     +  ++ Q +  V +++ DRG EY  K       
Sbjct: 696 DEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQMDRGSEYTNK------- 755

Query: 387 PAPFAKFLESHGRCAQYTMPGTPQQNGVAE 400
                KF  + G  A YT     + +GVAE
Sbjct: 756 --TLHKFFTNRGITACYTTTADSRAHGVAE 768

BLAST of Cmc05g0130471 vs. ExPASy Swiss-Prot
Match: Q12491 (Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-B PE=3 SV=1)

HSP 1 Score: 72.4 bits (176), Expect = 1.3e-11
Identity = 78/330 (23.64%), Postives = 138/330 (41.82%), Query Frame = 0

Query: 87  IDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRVKVSVEAVRTYCLTLDTGHHLDLFDT 146
           IDSG +  +  + H +L   T N +E  I    +  + + A+         G    +   
Sbjct: 456 IDSGASQTLVRSAH-YLHHATPN-SEINIVDAQKQDIPINAIGNLHFNFQNGTKTSI-KA 515

Query: 147 FYVPSISRNLISLSKLDTSGYYFKFGNGCF---SLFKQNIFIGSGILCDG-LYKLKLDNV 206
            + P+I+ +L+SLS+L            CF   +L + +  + + I+  G  Y L    +
Sbjct: 516 LHTPNIAYDLLSLSELANQNI-----TACFTRNTLERSDGTVLAPIVKHGDFYWLSKKYL 575

Query: 207 FAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERIKRLIKNEILPDLDFTDLGIS- 266
               +  L  N   K    N+    L H+ L H +   I++ +K   +  L  +D+  S 
Sbjct: 576 IPSHISKLTINNVNKSKSVNKYPYPLIHRMLGHANFRSIQKSLKKNAVTYLKESDIEWSN 635

Query: 267 ------VDCIKGKQTKHTVNK----EATRSSQLLEIIHTDICRSFDVRSFGGEKYFITFI 326
                  DC+ GK TKH   K    +   S +  + +HTDI             YFI+F 
Sbjct: 636 ASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFT 695

Query: 327 EDFSRYGYIYLLHEKSQ--AIDALKVFINEVERQLDRNVKILRSDRGGEYYGKYDENGQC 386
           ++ +R+ ++Y LH++ +   ++     +  ++ Q +  V +++ DRG EY  K       
Sbjct: 696 DEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQMDRGSEYTNK------- 755

Query: 387 PAPFAKFLESHGRCAQYTMPGTPQQNGVAE 400
                KF  + G  A YT     + +GVAE
Sbjct: 756 --TLHKFFTNRGITACYTTTADSRAHGVAE 768

BLAST of Cmc05g0130471 vs. ExPASy TrEMBL
Match: A0A445LQ30 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3848 GN=D0Y65_004205 PE=4 SV=1)

HSP 1 Score: 600.9 bits (1548), Expect = 3.9e-168
Identity = 289/398 (72.61%), Postives = 334/398 (83.92%), Query Frame = 0

Query: 2   GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWF 61
           G++GAGKK  KK+ KG +G LK+K     I KK    + C FC K GH++K+C KRK+WF
Sbjct: 278 GNQGAGKKFVKKHDKG-KGPLKIKDGPVQIQKKASKNNNCHFCGKSGHFQKDCPKRKSWF 337

Query: 62  ENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRV 121
           E KG+ NALVCFESNLTEVP+NTWWIDSGCT HVSNTM GFLT +T +PNE+F+FMGNRV
Sbjct: 338 EKKGELNALVCFESNLTEVPHNTWWIDSGCTTHVSNTMQGFLTIQTISPNEKFVFMGNRV 397

Query: 122 KVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQ 181
           K  VEAV TY L LDTGHHLDL +T YVPS+SRNL+SLSKLD +GY F FGNGCFSLFK 
Sbjct: 398 KAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLSKLDITGYSFNFGNGCFSLFKY 457

Query: 182 NIFIGSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERI 241
           N  IG+G+LCDGLYKLKLD ++ E++LTLHHNVGTKR   NE SA+LWHKRL HIS ERI
Sbjct: 458 NHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSLVNERSAFLWHKRLGHISGERI 517

Query: 242 KRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF 301
           +RLIKNEILPDLDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Sbjct: 518 ERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVSSF 577

Query: 302 GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYG 361
           G E+YFITFI+D+SRYGY+YLLHEKSQA++AL++++NEVERQLDR VKI+RSDRGGEYY 
Sbjct: 578 GRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEVERQLDRKVKIIRSDRGGEYYR 637

Query: 362 KYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE 400
           +YDE GQ P+PFAK L+  G CAQYTMPGTPQQNGV+E
Sbjct: 638 RYDETGQHPSPFAKLLQKRGICAQYTMPGTPQQNGVSE 673

BLAST of Cmc05g0130471 vs. ExPASy TrEMBL
Match: A0A445KPR8 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A OS=Glycine soja OX=3848 GN=D0Y65_012596 PE=4 SV=1)

HSP 1 Score: 599.0 bits (1543), Expect = 1.5e-167
Identity = 288/398 (72.36%), Postives = 333/398 (83.67%), Query Frame = 0

Query: 2   GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWF 61
           G++GAGK   KK+ KG +G LK+K     I KK    + C FC K GH++K+C KRK+WF
Sbjct: 289 GNQGAGKNFVKKHDKG-KGPLKIKDGPVQIQKKASKNNNCHFCGKSGHFQKDCPKRKSWF 348

Query: 62  ENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRV 121
           E KG+ NALV FESNLTEVP+NTWWIDSGCT HVSNTM GFLT +T +PNE+F+FMGNRV
Sbjct: 349 EKKGELNALVYFESNLTEVPHNTWWIDSGCTTHVSNTMQGFLTIQTISPNEKFVFMGNRV 408

Query: 122 KVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQ 181
           K  VEAV TY L LDTGHHLDL +T YVPS+SRNL+SLSKLD +GY F FGNGCFSLFK 
Sbjct: 409 KAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLSKLDITGYSFNFGNGCFSLFKY 468

Query: 182 NIFIGSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERI 241
           N  IG+G+LCDGLYKLKLD ++ E++LTLHHNVGTKR   NE SA+LWHKRL HIS+ERI
Sbjct: 469 NHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSLVNERSAFLWHKRLGHISRERI 528

Query: 242 KRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF 301
           +RLIKNEILPDLDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Sbjct: 529 ERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVSSF 588

Query: 302 GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYG 361
           G E+YFITFI+D+SRYGY+YLLHEKSQA++AL++++NEVERQLDR VKI+RSDRGGEYYG
Sbjct: 589 GRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEVERQLDRKVKIIRSDRGGEYYG 648

Query: 362 KYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE 400
           +YDE GQ P PFAK L+  G CAQYTMPGTPQQNGV+E
Sbjct: 649 RYDETGQHPGPFAKLLQKRGICAQYTMPGTPQQNGVSE 684

BLAST of Cmc05g0130471 vs. ExPASy TrEMBL
Match: A0A151R237 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_042301 PE=4 SV=1)

HSP 1 Score: 598.6 bits (1542), Expect = 1.9e-167
Identity = 286/398 (71.86%), Postives = 335/398 (84.17%), Query Frame = 0

Query: 2   GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWF 61
           G++GAGKK  KK+ KG +  LK+ ++S PI KK    + C FC K GH++K+C KRKAWF
Sbjct: 141 GNQGAGKKFVKKHDKGKK-PLKINEASVPIQKKASKGNNCHFCGKSGHFQKDCPKRKAWF 200

Query: 62  ENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRV 121
           E KGK NA VCFESNLTEVP+NTWWIDSGCT HVSNTM GF T +T +PNE+F+FMGNRV
Sbjct: 201 EKKGKLNAYVCFESNLTEVPHNTWWIDSGCTTHVSNTMQGFTTIQTISPNEKFVFMGNRV 260

Query: 122 KVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQ 181
           KV VEAV TY L L+TGHHLDL +T YVPS+SRNL+SLSKLD  GY F FGNGCFSLFK+
Sbjct: 261 KVPVEAVGTYRLILNTGHHLDLLETLYVPSLSRNLVSLSKLDAIGYSFTFGNGCFSLFKR 320

Query: 182 NIFIGSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERI 241
           N  IG+GILCDGLYKL LD ++ E+LLTLHHN+GTKR   NE SA+LWH+RL HIS+ER+
Sbjct: 321 NHLIGTGILCDGLYKLNLDGLYDETLLTLHHNIGTKRSLVNERSAFLWHRRLGHISRERM 380

Query: 242 KRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF 301
           +RLIKNEILP+LDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Sbjct: 381 ERLIKNEILPNLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVNSF 440

Query: 302 GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYG 361
           G EKYFITFI+D+SRYGY+YLLHEKSQA+DAL++++NEVERQLD+ VK++RSDRGGEYYG
Sbjct: 441 GKEKYFITFIDDYSRYGYVYLLHEKSQAVDALEIYLNEVERQLDKKVKVVRSDRGGEYYG 500

Query: 362 KYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE 400
           +Y+E GQ P PFAK L+  G CAQYTMPGTPQQNGV+E
Sbjct: 501 RYNETGQHPGPFAKLLQKRGICAQYTMPGTPQQNGVSE 536

BLAST of Cmc05g0130471 vs. ExPASy TrEMBL
Match: A0A4Q3EHL3 (Uncharacterized protein (Fragment) OS=Sphingobacteriaceae bacterium OX=2021370 GN=EOP45_13565 PE=4 SV=1)

HSP 1 Score: 582.0 bits (1499), Expect = 1.9e-162
Identity = 276/367 (75.20%), Postives = 315/367 (85.83%), Query Frame = 0

Query: 1   MGHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAW 60
           +GHKGA KKP  K GKG +G  K+ +SS  IHKK Q  D CRFC K GH++K+C KRK W
Sbjct: 108 VGHKGARKKPW-KTGKGKQGLSKLNESSVQIHKKEQSNDVCRFCKKNGHWQKDCPKRKTW 167

Query: 61  FENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNR 120
           FE KGK +A VCFESN  EVPYNTWW+DSGCT HVSNTM GFLTT+T + NE+FI MGNR
Sbjct: 168 FERKGKPSAFVCFESNFAEVPYNTWWLDSGCTTHVSNTMQGFLTTQTISQNEKFILMGNR 227

Query: 121 VKVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFK 180
            KV VEA+ TY L LDTGHHLDLF TFYVPS+SRNL+S+SKLD +GY F FGNGCFSLFK
Sbjct: 228 AKVQVEAIGTYRLVLDTGHHLDLFQTFYVPSVSRNLVSISKLDKAGYSFNFGNGCFSLFK 287

Query: 181 QNIFIGSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKER 240
           QN+F+GSGILCDGLYKLKLD  FAE+LLT+HHNVGTKRG +NESSAYLWHKRL HISKER
Sbjct: 288 QNLFLGSGILCDGLYKLKLDTFFAETLLTVHHNVGTKRGLSNESSAYLWHKRLGHISKER 347

Query: 241 IKRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRS 300
           I+RL+KN+ILP+LDFTDLG+ V+CIKGK T+ T+ K ATRSSQLLEIIHTDIC  FDV S
Sbjct: 348 IERLVKNDILPNLDFTDLGVCVECIKGKHTRQTLKKAATRSSQLLEIIHTDICGPFDVPS 407

Query: 301 FGGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYY 360
            GGE+YFITFI+DFSRYGY+YLLHEKSQ++D L+VF+NEVERQLDR VKI+RSDRGGEYY
Sbjct: 408 LGGERYFITFIDDFSRYGYVYLLHEKSQSVDTLEVFVNEVERQLDRKVKIVRSDRGGEYY 467

Query: 361 GKYDENG 368
           G+YDENG
Sbjct: 468 GRYDENG 473

BLAST of Cmc05g0130471 vs. ExPASy TrEMBL
Match: A0A445KGB1 (B2 protein isoform D OS=Glycine soja OX=3848 GN=D0Y65_016300 PE=4 SV=1)

HSP 1 Score: 513.5 bits (1321), Expect = 8.2e-142
Identity = 258/398 (64.82%), Postives = 303/398 (76.13%), Query Frame = 0

Query: 2   GHKGAGKKPGKKNGKGNRGHLKVKQSSAPIHKKGQIKDKCRFCNKPGHYKKNCLKRKAWF 61
           G++GAGKK  KK+ KG +G LK+K     I KK    + C FC K GH++K+C KRK+WF
Sbjct: 230 GNQGAGKKFVKKHDKG-KGPLKIKDGPVQIQKKASKNNNCHFCGKSGHFQKDCPKRKSWF 289

Query: 62  ENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMHGFLTTRTTNPNERFIFMGNRV 121
           E KG+ NAL                              GFLT +T +PN++F+FMGNRV
Sbjct: 290 EKKGELNAL------------------------------GFLTIQTISPNKKFVFMGNRV 349

Query: 122 KVSVEAVRTYCLTLDTGHHLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNGCFSLFKQ 181
           K  VEAV TY L LDTGHHLDL +T YVPS+SRNL+SLSKLD +GY F FGNGCFSLFK 
Sbjct: 350 KAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLSKLDITGYSFNFGNGCFSLFKY 409

Query: 182 NIFIGSGILCDGLYKLKLDNVFAESLLTLHHNVGTKRGQTNESSAYLWHKRLDHISKERI 241
           N  IG+G+LCDGLYKLKLD ++ E++LTLHHNVGTKR   NE SA+LWHKRL HIS+ERI
Sbjct: 410 NHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSLVNERSAFLWHKRLGHISRERI 469

Query: 242 KRLIKNEILPDLDFTDLGISVDCIKGKQTKHTVNKEATRSSQLLEIIHTDICRSFDVRSF 301
           +RLIKNEILPDLDFTDL I VDCIKGKQTKHT  K ATRS+QLLEI+HTDIC  FDV SF
Sbjct: 470 ERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGATRSTQLLEIVHTDICGPFDVSSF 529

Query: 302 GGEKYFITFIEDFSRYGYIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYG 361
           G E+YFITFI+D+SRYGY+YLLHEKSQA++AL++++NEVERQLDR VKI+RSDR GEYY 
Sbjct: 530 GRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEVERQLDRKVKIIRSDRDGEYYR 589

Query: 362 KYDENGQCPAPFAKFLESHGRCAQYTMPGTPQQNGVAE 400
           +YDE GQ   PFAK L+  G CAQYTMPGT QQNGV+E
Sbjct: 590 RYDETGQHSGPFAKLLQKRGICAQYTMPGTLQQNGVSE 595

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RZC25410.18.0e-16872.61Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja][more]
RZC12927.13.1e-16772.36Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A [Glycine s... [more]
KYP36562.14.0e-16771.86Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
RYE18822.13.9e-16275.20hypothetical protein EOP45_13565, partial [Sphingobacteriaceae bacterium][more]
RZC09906.11.7e-14164.82B2 protein isoform D [Glycine soja][more]
Match NameE-valueIdentityDescription
P109784.3e-3928.74Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q9ZT941.1e-2125.30Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW26.9e-2125.00Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q125011.0e-1123.64Transposon Ty2-OR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Q124911.3e-1123.64Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A0A445LQ303.9e-16872.61Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3... [more]
A0A445KPR81.5e-16772.36Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A OS=Glycine... [more]
A0A151R2371.9e-16771.86Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A4Q3EHL31.9e-16275.20Uncharacterized protein (Fragment) OS=Sphingobacteriaceae bacterium OX=2021370 G... [more]
A0A445KGB18.2e-14264.82B2 protein isoform D OS=Glycine soja OX=3848 GN=D0Y65_016300 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 193..269
e-value: 3.9E-13
score: 49.0
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 275..400
e-value: 8.9E-19
score: 69.5
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 286..392
e-value: 1.1E-9
score: 38.5
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 280..400
score: 14.650533
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..32
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..20
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 269..360
NoneNo IPR availablePANTHERPTHR11439:SF324RIBONUCLEASE H-LIKE DOMAIN, GAG-PRE-INTEGRASE DOMAIN, GAG-POLYPEPTIDE OF LTR COPIA-TYPE-RELATEDcoord: 269..360
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 40..54
score: 8.845061
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 37..62
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 280..399

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc05g0130471.1Cmc05g0130471.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071897 DNA biosynthetic process
biological_process GO:0015074 DNA integration
biological_process GO:0009231 riboflavin biosynthetic process
cellular_component GO:0016020 membrane
molecular_function GO:0008686 3,4-dihydroxy-2-butanone-4-phosphate synthase activity
molecular_function GO:0003887 DNA-directed DNA polymerase activity
molecular_function GO:0004491 methylmalonate-semialdehyde dehydrogenase (acylating) activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding