Cmc01g0013861 (gene) Melon (Charmono) v1.1

Overview
NameCmc01g0013861
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCMiso1.1chr01: 10658044 .. 10658853 (-)
RNA-Seq ExpressionCmc01g0013861
SyntenyCmc01g0013861
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGGGATTCCTTACGACCCAAACCACAAACCCAAATGAGAGATTCATTTTTATGAGAAACAAAGTCAAAGTTCCAGTTGAAGCTGTGGGAACCTATCGTTTTACTTTAGATACTGGACATTATTTAGACCTTTTTGATACCTTTTATGTTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACTTGATACTTCAGGTTATTACTTTAAATTTGGGAATGAGTGTTTTAGTTTATTCAAACAGAACGGTTTTATTGGTTCTGGTATTCTTTGTGATGACTTACATAAATTAAAGCTTGATAATGTTTTAACTAAGAGTTTGTTAACCTTGCATTATAATGTTGGCACTAAACGTGGTCAAACTAATGAATCGTCAGCTTACTTATGGCATAAACGTTTGGGTCACATATCCAAAGAAAGAATTAAAAGATTGATAAATAATGAAATTCTTCCAGATTTGGATTTTACTAACCTTGGAATTTTTGTGAATTGTATTAAAGGAAAACAAACAAAATACACAGTTAATAAAGAAGCCACAAGAAGCTCACAACTTCTTGAAATTATACACACTGATATTTGTAGGCCTTTTGATGTTTCATCTTTTGGTGGAGAAAAGTATTTTATCACCTTTATTGATGATTTTTCACGTTATGGTTGTATCTATTTATTGCATGAAAAATCTCAAGCAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAATGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATAGAAAATATGATGAGAATGGATAA

mRNA sequence

ATGCAGGGATTCCTTACGACCCAAACCACAAACCCAAATGAGAGATTCATTTTTATGAGAAACAAAGTCAAAGTTCCAGTTGAAGCTGTGGGAACCTATCGTTTTACTTTAGATACTGGACATTATTTAGACCTTTTTGATACCTTTTATGTTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACTTGATACTTCAGGTTATTACTTTAAATTTGGGAATGAGTGTTTTAGTTTATTCAAACAGAACGGTTTTATTGGTTCTGGTATTCTTTGTGATGACTTACATAAATTAAAGCTTGATAATGTTTTAACTAAGAGTTTGTTAACCTTGCATTATAATGTTGGCACTAAACGTGGTCAAACTAATGAATCGTCAGCTTACTTATGGCATAAACGTTTGGGTCACATATCCAAAGAAAGAATTAAAAGATTGATAAATAATGAAATTCTTCCAGATTTGGATTTTACTAACCTTGGAATTTTTGTGAATTGTATTAAAGGAAAACAAACAAAATACACAGTTAATAAAGAAGCCACAAGAAGCTCACAACTTCTTGAAATTATACACACTGATATTTGTAGGCCTTTTGATGTTTCATCTTTTGGTGGAGAAAAGTATTTTATCACCTTTATTGATGATTTTTCACGTTATGGTTGTATCTATTTATTGCATGAAAAATCTCAAGCAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAATGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATAGAAAATATGATGAGAATGGATAA

Coding sequence (CDS)

ATGCAGGGATTCCTTACGACCCAAACCACAAACCCAAATGAGAGATTCATTTTTATGAGAAACAAAGTCAAAGTTCCAGTTGAAGCTGTGGGAACCTATCGTTTTACTTTAGATACTGGACATTATTTAGACCTTTTTGATACCTTTTATGTTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACTTGATACTTCAGGTTATTACTTTAAATTTGGGAATGAGTGTTTTAGTTTATTCAAACAGAACGGTTTTATTGGTTCTGGTATTCTTTGTGATGACTTACATAAATTAAAGCTTGATAATGTTTTAACTAAGAGTTTGTTAACCTTGCATTATAATGTTGGCACTAAACGTGGTCAAACTAATGAATCGTCAGCTTACTTATGGCATAAACGTTTGGGTCACATATCCAAAGAAAGAATTAAAAGATTGATAAATAATGAAATTCTTCCAGATTTGGATTTTACTAACCTTGGAATTTTTGTGAATTGTATTAAAGGAAAACAAACAAAATACACAGTTAATAAAGAAGCCACAAGAAGCTCACAACTTCTTGAAATTATACACACTGATATTTGTAGGCCTTTTGATGTTTCATCTTTTGGTGGAGAAAAGTATTTTATCACCTTTATTGATGATTTTTCACGTTATGGTTGTATCTATTTATTGCATGAAAAATCTCAAGCAATAGATGCCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAATGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATAGAAAATATGATGAGAATGGATAA

Protein sequence

MQGFLTTQTTNPNERFIFMRNKVKVPVEAVGTYRFTLDTGHYLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNECFSLFKQNGFIGSGILCDDLHKLKLDNVLTKSLLTLHYNVGTKRGQTNESSAYLWHKRLGHISKERIKRLINNEILPDLDFTNLGIFVNCIKGKQTKYTVNKEATRSSQLLEIIHTDICRPFDVSSFGGEKYFITFIDDFSRYGCIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEYYRKYDENG
Homology
BLAST of Cmc01g0013861 vs. NCBI nr
Match: RYE18822.1 (hypothetical protein EOP45_13565, partial [Sphingobacteriaceae bacterium])

HSP 1 Score: 430.6 bits (1106), Expect = 9.7e-117
Identity = 203/269 (75.46%), Postives = 235/269 (87.36%), Query Frame = 0

Query: 1   MQGFLTTQTTNPNERFIFMRNKVKVPVEAVGTYRFTLDTGHYLDLFDTFYVPSISRNLIS 60
           MQGFLTTQT + NE+FI M N+ KV VEA+GTYR  LDTGH+LDLF TFYVPS+SRNL+S
Sbjct: 205 MQGFLTTQTISQNEKFILMGNRAKVQVEAIGTYRLVLDTGHHLDLFQTFYVPSVSRNLVS 264

Query: 61  LSKLDTSGYYFKFGNECFSLFKQNGFIGSGILCDDLHKLKLDNVLTKSLLTLHYNVGTKR 120
           +SKLD +GY F FGN CFSLFKQN F+GSGILCD L+KLKLD    ++LLT+H+NVGTKR
Sbjct: 265 ISKLDKAGYSFNFGNGCFSLFKQNLFLGSGILCDGLYKLKLDTFFAETLLTVHHNVGTKR 324

Query: 121 GQTNESSAYLWHKRLGHISKERIKRLINNEILPDLDFTNLGIFVNCIKGKQTKYTVNKEA 180
           G +NESSAYLWHKRLGHISKERI+RL+ N+ILP+LDFT+LG+ V CIKGK T+ T+ K A
Sbjct: 325 GLSNESSAYLWHKRLGHISKERIERLVKNDILPNLDFTDLGVCVECIKGKHTRQTLKKAA 384

Query: 181 TRSSQLLEIIHTDICRPFDVSSFGGEKYFITFIDDFSRYGCIYLLHEKSQAIDALKVFIN 240
           TRSSQLLEIIHTDIC PFDV S GGE+YFITFIDDFSRYG +YLLHEKSQ++D L+VF+N
Sbjct: 385 TRSSQLLEIIHTDICGPFDVPSLGGERYFITFIDDFSRYGYVYLLHEKSQSVDTLEVFVN 444

Query: 241 EVERQLDRNVKILRSDRGGEYYRKYDENG 270
           EVERQLDR VKI+RSDRGGEYY +YDENG
Sbjct: 445 EVERQLDRKVKIVRSDRGGEYYGRYDENG 473

BLAST of Cmc01g0013861 vs. NCBI nr
Match: KYP65984.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 418.7 bits (1075), Expect = 3.8e-113
Identity = 199/269 (73.98%), Postives = 233/269 (86.62%), Query Frame = 0

Query: 1   MQGFLTTQTTNPNERFIFMRNKVKVPVEAVGTYRFTLDTGHYLDLFDTFYVPSISRNLIS 60
           MQGFLTT+TT PNE+F+FM N+VKVPVEAVGTYR  LDTGH+LDLF+T YVPSISRNL+S
Sbjct: 1   MQGFLTTRTTKPNEKFVFMGNRVKVPVEAVGTYRLILDTGHHLDLFETLYVPSISRNLVS 60

Query: 61  LSKLDTSGYYFKFGNECFSLFKQNGFIGSGILCDDLHKLKLDNVLTKSLLTLHYNVGTKR 120
           LSKLD +GY  KFGN CFSL+K    IGSGILCD L+KL LDN+  ++LLTLH+N+GTKR
Sbjct: 61  LSKLDVNGYSIKFGNGCFSLYKHTHLIGSGILCDGLYKLNLDNLFAETLLTLHHNIGTKR 120

Query: 121 GQTNESSAYLWHKRLGHISKERIKRLINNEILPDLDFTNLGIFVNCIKGKQTKYTVNKEA 180
           G  NE  AYLWHKRLGH+SKER++RL+ NEILPDLDFT+L + V+CIKGKQTK+T  K A
Sbjct: 121 GLENERFAYLWHKRLGHVSKERLQRLVKNEILPDLDFTDLNVCVDCIKGKQTKHT-KKGA 180

Query: 181 TRSSQLLEIIHTDICRPFDVSSFGGEKYFITFIDDFSRYGCIYLLHEKSQAIDALKVFIN 240
           TRS+QLLEIIHTDIC PFDV+SF  EKYFITFIDD+SRYG +YLLH+KSQAI+AL+++I 
Sbjct: 181 TRSTQLLEIIHTDICGPFDVNSFNKEKYFITFIDDYSRYGYVYLLHDKSQAINALEIYIE 240

Query: 241 EVERQLDRNVKILRSDRGGEYYRKYDENG 270
           EVERQLD  VKI+RSDRGGEYY +YDE G
Sbjct: 241 EVERQLDSKVKIIRSDRGGEYYGRYDERG 268

BLAST of Cmc01g0013861 vs. NCBI nr
Match: RZC25410.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja])

HSP 1 Score: 418.7 bits (1075), Expect = 3.8e-113
Identity = 199/269 (73.98%), Postives = 234/269 (86.99%), Query Frame = 0

Query: 1   MQGFLTTQTTNPNERFIFMRNKVKVPVEAVGTYRFTLDTGHYLDLFDTFYVPSISRNLIS 60
           MQGFLT QT +PNE+F+FM N+VK PVEAVGTYR  LDTGH+LDL +T YVPS+SRNL+S
Sbjct: 374 MQGFLTIQTISPNEKFVFMGNRVKAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVS 433

Query: 61  LSKLDTSGYYFKFGNECFSLFKQNGFIGSGILCDDLHKLKLDNVLTKSLLTLHYNVGTKR 120
           LSKLD +GY F FGN CFSLFK N  IG+G+LCD L+KLKLD +  +++LTLH+NVGTKR
Sbjct: 434 LSKLDITGYSFNFGNGCFSLFKYNHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKR 493

Query: 121 GQTNESSAYLWHKRLGHISKERIKRLINNEILPDLDFTNLGIFVNCIKGKQTKYTVNKEA 180
              NE SA+LWHKRLGHIS ERI+RLI NEILPDLDFT+L I V+CIKGKQTK+T  K A
Sbjct: 494 SLVNERSAFLWHKRLGHISGERIERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGA 553

Query: 181 TRSSQLLEIIHTDICRPFDVSSFGGEKYFITFIDDFSRYGCIYLLHEKSQAIDALKVFIN 240
           TRS+QLLEI+HTDIC PFDVSSFG E+YFITFIDD+SRYG +YLLHEKSQA++AL++++N
Sbjct: 554 TRSTQLLEIVHTDICGPFDVSSFGRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLN 613

Query: 241 EVERQLDRNVKILRSDRGGEYYRKYDENG 270
           EVERQLDR VKI+RSDRGGEYYR+YDE G
Sbjct: 614 EVERQLDRKVKIIRSDRGGEYYRRYDETG 641

BLAST of Cmc01g0013861 vs. NCBI nr
Match: RZC12927.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A [Glycine soja] >RZC12928.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform B [Glycine soja])

HSP 1 Score: 417.5 bits (1072), Expect = 8.5e-113
Identity = 198/269 (73.61%), Postives = 234/269 (86.99%), Query Frame = 0

Query: 1   MQGFLTTQTTNPNERFIFMRNKVKVPVEAVGTYRFTLDTGHYLDLFDTFYVPSISRNLIS 60
           MQGFLT QT +PNE+F+FM N+VK PVEAVGTYR  LDTGH+LDL +T YVPS+SRNL+S
Sbjct: 385 MQGFLTIQTISPNEKFVFMGNRVKAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVS 444

Query: 61  LSKLDTSGYYFKFGNECFSLFKQNGFIGSGILCDDLHKLKLDNVLTKSLLTLHYNVGTKR 120
           LSKLD +GY F FGN CFSLFK N  IG+G+LCD L+KLKLD +  +++LTLH+NVGTKR
Sbjct: 445 LSKLDITGYSFNFGNGCFSLFKYNHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKR 504

Query: 121 GQTNESSAYLWHKRLGHISKERIKRLINNEILPDLDFTNLGIFVNCIKGKQTKYTVNKEA 180
              NE SA+LWHKRLGHIS+ERI+RLI NEILPDLDFT+L I V+CIKGKQTK+T  K A
Sbjct: 505 SLVNERSAFLWHKRLGHISRERIERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGA 564

Query: 181 TRSSQLLEIIHTDICRPFDVSSFGGEKYFITFIDDFSRYGCIYLLHEKSQAIDALKVFIN 240
           TRS+QLLEI+HTDIC PFDVSSFG E+YFITFIDD+SRYG +YLLHEKSQA++AL++++N
Sbjct: 565 TRSTQLLEIVHTDICGPFDVSSFGRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLN 624

Query: 241 EVERQLDRNVKILRSDRGGEYYRKYDENG 270
           EVERQLDR VKI+RSDRGGEYY +YDE G
Sbjct: 625 EVERQLDRKVKIIRSDRGGEYYGRYDETG 652

BLAST of Cmc01g0013861 vs. NCBI nr
Match: RZC09906.1 (B2 protein isoform D [Glycine soja])

HSP 1 Score: 412.1 bits (1058), Expect = 3.6e-111
Identity = 195/267 (73.03%), Postives = 232/267 (86.89%), Query Frame = 0

Query: 3   GFLTTQTTNPNERFIFMRNKVKVPVEAVGTYRFTLDTGHYLDLFDTFYVPSISRNLISLS 62
           GFLT QT +PN++F+FM N+VK PVEAVGTYR  LDTGH+LDL +T YVPS+SRNL+SLS
Sbjct: 298 GFLTIQTISPNKKFVFMGNRVKAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLS 357

Query: 63  KLDTSGYYFKFGNECFSLFKQNGFIGSGILCDDLHKLKLDNVLTKSLLTLHYNVGTKRGQ 122
           KLD +GY F FGN CFSLFK N  IG+G+LCD L+KLKLD +  +++LTLH+NVGTKR  
Sbjct: 358 KLDITGYSFNFGNGCFSLFKYNHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSL 417

Query: 123 TNESSAYLWHKRLGHISKERIKRLINNEILPDLDFTNLGIFVNCIKGKQTKYTVNKEATR 182
            NE SA+LWHKRLGHIS+ERI+RLI NEILPDLDFT+L I V+CIKGKQTK+T  K ATR
Sbjct: 418 VNERSAFLWHKRLGHISRERIERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGATR 477

Query: 183 SSQLLEIIHTDICRPFDVSSFGGEKYFITFIDDFSRYGCIYLLHEKSQAIDALKVFINEV 242
           S+QLLEI+HTDIC PFDVSSFG E+YFITFIDD+SRYG +YLLHEKSQA++AL++++NEV
Sbjct: 478 STQLLEIVHTDICGPFDVSSFGRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEV 537

Query: 243 ERQLDRNVKILRSDRGGEYYRKYDENG 270
           ERQLDR VKI+RSDR GEYYR+YDE G
Sbjct: 538 ERQLDRKVKIIRSDRDGEYYRRYDETG 563

BLAST of Cmc01g0013861 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 130.6 bits (327), Expect = 2.7e-29
Identity = 79/250 (31.60%), Postives = 123/250 (49.20%), Query Frame = 0

Query: 19  MRNKVKVPVEAVGTYRFTLDTGHYLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNECF 78
           M N     +  +G      + G  L L D  +VP +  NLIS   LD  GY   F N+ +
Sbjct: 324 MGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFANQKW 383

Query: 79  SLFKQNGFIGSGILCDDLHKLKLDNVLTKSLLTLHYNVGTKRGQTNESSAYLWHKRLGHI 138
            L K +  I  G+    L++   +              G      +E S  LWHKR+GH+
Sbjct: 384 RLTKGSLVIAKGVARGTLYRTNAE-----------ICQGELNAAQDEISVDLWHKRMGHM 443

Query: 139 SKERIKRLINNEILPDLDFTNLGIFVNCIKGKQTKYTVNKEATRSSQLLEIIHTDICRPF 198
           S++ ++ L    ++     T +     C+ GKQ + +    + R   +L+++++D+C P 
Sbjct: 444 SEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPM 503

Query: 199 DVSSFGGEKYFITFIDDFSRYGCIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRG 258
           ++ S GG KYF+TFIDD SR   +Y+L  K Q     + F   VER+  R +K LRSD G
Sbjct: 504 EIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNG 562

Query: 259 GEY-YRKYDE 268
           GEY  R+++E
Sbjct: 564 GEYTSREFEE 562

BLAST of Cmc01g0013861 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 84.7 bits (208), Expect = 1.7e-15
Identity = 64/223 (28.70%), Postives = 108/223 (48.43%), Query Frame = 0

Query: 43  LDLFDTFYVPSISRNLISLSKL-DTSGYYFKFGNECFSLFKQNGFIG--SGILCDDLHKL 102
           LDL    YVP+I +NLIS+ +L +T+    +F    F +   N  +    G   D+L++ 
Sbjct: 364 LDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEW 423

Query: 103 KLDNVLTKSLLTLHYNVGTKRGQTNESSAYLWHKRLGHISKERIKRLINNEILPDLDFTN 162
            + +    S+              ++++   WH RLGH S   +  +I+N  LP L+ ++
Sbjct: 424 PIASSQAVSMFA---------SPCSKATHSSWHSRLGHPSLAILNSVISNHSLPVLNPSH 483

Query: 163 -LGIFVNCIKGKQTKYTVNKEATRSSQLLEIIHTDICRPFDVSSFGGEKYFITFIDDFSR 222
            L    +C   K  K   +     SS+ LE I++D+     + S    +Y++ F+D F+R
Sbjct: 484 KLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWSS-PILSIDNYRYYVIFVDHFTR 543

Query: 223 YGCIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEY 262
           Y  +Y L +KSQ  D   +F + VE +    +  L SD GGE+
Sbjct: 544 YTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEF 576

BLAST of Cmc01g0013861 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 79.0 bits (193), Expect = 9.4e-14
Identity = 58/223 (26.01%), Postives = 109/223 (48.88%), Query Frame = 0

Query: 43  LDLFDTFYVPSISRNLISLSKL-DTSGYYFKFGNECFSLFKQNGFIG--SGILCDDLHKL 102
           L+L +  YVP+I +NLIS+ +L + +G   +F    F +   N  +    G   D+L++ 
Sbjct: 385 LNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEW 444

Query: 103 KLDNVLTKSLLTLHYNVGTKRGQTNESSAYLWHKRLGHISKERIKRLINNEILPDLDFTN 162
            + +    SL             +++++   WH RLGH +   +  +I+N  L  L+ ++
Sbjct: 445 PIASSQPVSLFA---------SPSSKATHSSWHARLGHPAPSILNSVISNYSLSVLNPSH 504

Query: 163 LGIFV-NCIKGKQTKYTVNKEATRSSQLLEIIHTDICRPFDVSSFGGEKYFITFIDDFSR 222
             +   +C+  K  K   ++    S++ LE I++D+     + S    +Y++ F+D F+R
Sbjct: 505 KFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWSS-PILSHDNYRYYVIFVDHFTR 564

Query: 223 YGCIYLLHEKSQAIDALKVFINEVERQLDRNVKILRSDRGGEY 262
           Y  +Y L +KSQ  +    F N +E +    +    SD GGE+
Sbjct: 565 YTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEF 597

BLAST of Cmc01g0013861 vs. ExPASy Swiss-Prot
Match: Q12501 (Transposon Ty2-OR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-OR2 PE=1 SV=1)

HSP 1 Score: 76.3 bits (186), Expect = 6.1e-13
Identity = 60/257 (23.35%), Postives = 116/257 (45.14%), Query Frame = 0

Query: 22  KVKVPVEAVGTYRFTLDTGHYLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNECFSLF 81
           K  +P+ A+G   F    G    +    + P+I+ +L+SLS+L        F     +L 
Sbjct: 487 KQDIPINAIGNLHFNFQNGTKTSI-KALHTPNIAYDLLSLSELTNQNITACFTRN--TLE 546

Query: 82  KQNGFIGSGIL-CDDLHKLKLDNVLTKSLLTLHYNVGTKRGQTNESSAYLWHKRLGHISK 141
           + +G + + I+   D + L    ++   +  L  N   K    N+    L H+ LGH + 
Sbjct: 547 RSDGTVLAPIVKHGDFYWLSKKYLIPSHISKLTINNVNKSKSVNKYPYPLIHRMLGHANF 606

Query: 142 ERIKRLINNEIL-----PDLDFTNLGIF--VNCIKGKQTKYTVNK----EATRSSQLLEI 201
             I++ +    +      D++++N   +   +C+ GK TK+   K    +   S +  + 
Sbjct: 607 RSIQKSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQY 666

Query: 202 IHTDICRPFDVSSFGGEKYFITFIDDFSRYGCIYLLHEKSQ--AIDALKVFINEVERQLD 261
           +HTDI  P          YFI+F D+ +R+  +Y LH++ +   ++     +  ++ Q +
Sbjct: 667 LHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFN 726

Query: 262 RNVKILRSDRGGEYYRK 265
             V +++ DRG EY  K
Sbjct: 727 ARVLVIQMDRGSEYTNK 740

BLAST of Cmc01g0013861 vs. ExPASy Swiss-Prot
Match: Q12491 (Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-B PE=3 SV=1)

HSP 1 Score: 75.9 bits (185), Expect = 7.9e-13
Identity = 60/257 (23.35%), Postives = 116/257 (45.14%), Query Frame = 0

Query: 22  KVKVPVEAVGTYRFTLDTGHYLDLFDTFYVPSISRNLISLSKLDTSGYYFKFGNECFSLF 81
           K  +P+ A+G   F    G    +    + P+I+ +L+SLS+L        F     +L 
Sbjct: 487 KQDIPINAIGNLHFNFQNGTKTSI-KALHTPNIAYDLLSLSELANQNITACFTRN--TLE 546

Query: 82  KQNGFIGSGIL-CDDLHKLKLDNVLTKSLLTLHYNVGTKRGQTNESSAYLWHKRLGHISK 141
           + +G + + I+   D + L    ++   +  L  N   K    N+    L H+ LGH + 
Sbjct: 547 RSDGTVLAPIVKHGDFYWLSKKYLIPSHISKLTINNVNKSKSVNKYPYPLIHRMLGHANF 606

Query: 142 ERIKRLINNEIL-----PDLDFTNLGIF--VNCIKGKQTKYTVNK----EATRSSQLLEI 201
             I++ +    +      D++++N   +   +C+ GK TK+   K    +   S +  + 
Sbjct: 607 RSIQKSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQY 666

Query: 202 IHTDICRPFDVSSFGGEKYFITFIDDFSRYGCIYLLHEKSQ--AIDALKVFINEVERQLD 261
           +HTDI  P          YFI+F D+ +R+  +Y LH++ +   ++     +  ++ Q +
Sbjct: 667 LHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFN 726

Query: 262 RNVKILRSDRGGEYYRK 265
             V +++ DRG EY  K
Sbjct: 727 ARVLVIQMDRGSEYTNK 740

BLAST of Cmc01g0013861 vs. ExPASy TrEMBL
Match: A0A4Q3EHL3 (Uncharacterized protein (Fragment) OS=Sphingobacteriaceae bacterium OX=2021370 GN=EOP45_13565 PE=4 SV=1)

HSP 1 Score: 430.6 bits (1106), Expect = 4.7e-117
Identity = 203/269 (75.46%), Postives = 235/269 (87.36%), Query Frame = 0

Query: 1   MQGFLTTQTTNPNERFIFMRNKVKVPVEAVGTYRFTLDTGHYLDLFDTFYVPSISRNLIS 60
           MQGFLTTQT + NE+FI M N+ KV VEA+GTYR  LDTGH+LDLF TFYVPS+SRNL+S
Sbjct: 205 MQGFLTTQTISQNEKFILMGNRAKVQVEAIGTYRLVLDTGHHLDLFQTFYVPSVSRNLVS 264

Query: 61  LSKLDTSGYYFKFGNECFSLFKQNGFIGSGILCDDLHKLKLDNVLTKSLLTLHYNVGTKR 120
           +SKLD +GY F FGN CFSLFKQN F+GSGILCD L+KLKLD    ++LLT+H+NVGTKR
Sbjct: 265 ISKLDKAGYSFNFGNGCFSLFKQNLFLGSGILCDGLYKLKLDTFFAETLLTVHHNVGTKR 324

Query: 121 GQTNESSAYLWHKRLGHISKERIKRLINNEILPDLDFTNLGIFVNCIKGKQTKYTVNKEA 180
           G +NESSAYLWHKRLGHISKERI+RL+ N+ILP+LDFT+LG+ V CIKGK T+ T+ K A
Sbjct: 325 GLSNESSAYLWHKRLGHISKERIERLVKNDILPNLDFTDLGVCVECIKGKHTRQTLKKAA 384

Query: 181 TRSSQLLEIIHTDICRPFDVSSFGGEKYFITFIDDFSRYGCIYLLHEKSQAIDALKVFIN 240
           TRSSQLLEIIHTDIC PFDV S GGE+YFITFIDDFSRYG +YLLHEKSQ++D L+VF+N
Sbjct: 385 TRSSQLLEIIHTDICGPFDVPSLGGERYFITFIDDFSRYGYVYLLHEKSQSVDTLEVFVN 444

Query: 241 EVERQLDRNVKILRSDRGGEYYRKYDENG 270
           EVERQLDR VKI+RSDRGGEYY +YDENG
Sbjct: 445 EVERQLDRKVKIVRSDRGGEYYGRYDENG 473

BLAST of Cmc01g0013861 vs. ExPASy TrEMBL
Match: A0A151TG02 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_012262 PE=4 SV=1)

HSP 1 Score: 418.7 bits (1075), Expect = 1.8e-113
Identity = 199/269 (73.98%), Postives = 233/269 (86.62%), Query Frame = 0

Query: 1   MQGFLTTQTTNPNERFIFMRNKVKVPVEAVGTYRFTLDTGHYLDLFDTFYVPSISRNLIS 60
           MQGFLTT+TT PNE+F+FM N+VKVPVEAVGTYR  LDTGH+LDLF+T YVPSISRNL+S
Sbjct: 1   MQGFLTTRTTKPNEKFVFMGNRVKVPVEAVGTYRLILDTGHHLDLFETLYVPSISRNLVS 60

Query: 61  LSKLDTSGYYFKFGNECFSLFKQNGFIGSGILCDDLHKLKLDNVLTKSLLTLHYNVGTKR 120
           LSKLD +GY  KFGN CFSL+K    IGSGILCD L+KL LDN+  ++LLTLH+N+GTKR
Sbjct: 61  LSKLDVNGYSIKFGNGCFSLYKHTHLIGSGILCDGLYKLNLDNLFAETLLTLHHNIGTKR 120

Query: 121 GQTNESSAYLWHKRLGHISKERIKRLINNEILPDLDFTNLGIFVNCIKGKQTKYTVNKEA 180
           G  NE  AYLWHKRLGH+SKER++RL+ NEILPDLDFT+L + V+CIKGKQTK+T  K A
Sbjct: 121 GLENERFAYLWHKRLGHVSKERLQRLVKNEILPDLDFTDLNVCVDCIKGKQTKHT-KKGA 180

Query: 181 TRSSQLLEIIHTDICRPFDVSSFGGEKYFITFIDDFSRYGCIYLLHEKSQAIDALKVFIN 240
           TRS+QLLEIIHTDIC PFDV+SF  EKYFITFIDD+SRYG +YLLH+KSQAI+AL+++I 
Sbjct: 181 TRSTQLLEIIHTDICGPFDVNSFNKEKYFITFIDDYSRYGYVYLLHDKSQAINALEIYIE 240

Query: 241 EVERQLDRNVKILRSDRGGEYYRKYDENG 270
           EVERQLD  VKI+RSDRGGEYY +YDE G
Sbjct: 241 EVERQLDSKVKIIRSDRGGEYYGRYDERG 268

BLAST of Cmc01g0013861 vs. ExPASy TrEMBL
Match: A0A445LQ30 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3848 GN=D0Y65_004205 PE=4 SV=1)

HSP 1 Score: 418.7 bits (1075), Expect = 1.8e-113
Identity = 199/269 (73.98%), Postives = 234/269 (86.99%), Query Frame = 0

Query: 1   MQGFLTTQTTNPNERFIFMRNKVKVPVEAVGTYRFTLDTGHYLDLFDTFYVPSISRNLIS 60
           MQGFLT QT +PNE+F+FM N+VK PVEAVGTYR  LDTGH+LDL +T YVPS+SRNL+S
Sbjct: 374 MQGFLTIQTISPNEKFVFMGNRVKAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVS 433

Query: 61  LSKLDTSGYYFKFGNECFSLFKQNGFIGSGILCDDLHKLKLDNVLTKSLLTLHYNVGTKR 120
           LSKLD +GY F FGN CFSLFK N  IG+G+LCD L+KLKLD +  +++LTLH+NVGTKR
Sbjct: 434 LSKLDITGYSFNFGNGCFSLFKYNHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKR 493

Query: 121 GQTNESSAYLWHKRLGHISKERIKRLINNEILPDLDFTNLGIFVNCIKGKQTKYTVNKEA 180
              NE SA+LWHKRLGHIS ERI+RLI NEILPDLDFT+L I V+CIKGKQTK+T  K A
Sbjct: 494 SLVNERSAFLWHKRLGHISGERIERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGA 553

Query: 181 TRSSQLLEIIHTDICRPFDVSSFGGEKYFITFIDDFSRYGCIYLLHEKSQAIDALKVFIN 240
           TRS+QLLEI+HTDIC PFDVSSFG E+YFITFIDD+SRYG +YLLHEKSQA++AL++++N
Sbjct: 554 TRSTQLLEIVHTDICGPFDVSSFGRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLN 613

Query: 241 EVERQLDRNVKILRSDRGGEYYRKYDENG 270
           EVERQLDR VKI+RSDRGGEYYR+YDE G
Sbjct: 614 EVERQLDRKVKIIRSDRGGEYYRRYDETG 641

BLAST of Cmc01g0013861 vs. ExPASy TrEMBL
Match: A0A445KPR8 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A OS=Glycine soja OX=3848 GN=D0Y65_012596 PE=4 SV=1)

HSP 1 Score: 417.5 bits (1072), Expect = 4.1e-113
Identity = 198/269 (73.61%), Postives = 234/269 (86.99%), Query Frame = 0

Query: 1   MQGFLTTQTTNPNERFIFMRNKVKVPVEAVGTYRFTLDTGHYLDLFDTFYVPSISRNLIS 60
           MQGFLT QT +PNE+F+FM N+VK PVEAVGTYR  LDTGH+LDL +T YVPS+SRNL+S
Sbjct: 385 MQGFLTIQTISPNEKFVFMGNRVKAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVS 444

Query: 61  LSKLDTSGYYFKFGNECFSLFKQNGFIGSGILCDDLHKLKLDNVLTKSLLTLHYNVGTKR 120
           LSKLD +GY F FGN CFSLFK N  IG+G+LCD L+KLKLD +  +++LTLH+NVGTKR
Sbjct: 445 LSKLDITGYSFNFGNGCFSLFKYNHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKR 504

Query: 121 GQTNESSAYLWHKRLGHISKERIKRLINNEILPDLDFTNLGIFVNCIKGKQTKYTVNKEA 180
              NE SA+LWHKRLGHIS+ERI+RLI NEILPDLDFT+L I V+CIKGKQTK+T  K A
Sbjct: 505 SLVNERSAFLWHKRLGHISRERIERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGA 564

Query: 181 TRSSQLLEIIHTDICRPFDVSSFGGEKYFITFIDDFSRYGCIYLLHEKSQAIDALKVFIN 240
           TRS+QLLEI+HTDIC PFDVSSFG E+YFITFIDD+SRYG +YLLHEKSQA++AL++++N
Sbjct: 565 TRSTQLLEIVHTDICGPFDVSSFGRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLN 624

Query: 241 EVERQLDRNVKILRSDRGGEYYRKYDENG 270
           EVERQLDR VKI+RSDRGGEYY +YDE G
Sbjct: 625 EVERQLDRKVKIIRSDRGGEYYGRYDETG 652

BLAST of Cmc01g0013861 vs. ExPASy TrEMBL
Match: A0A445KGB1 (B2 protein isoform D OS=Glycine soja OX=3848 GN=D0Y65_016300 PE=4 SV=1)

HSP 1 Score: 412.1 bits (1058), Expect = 1.7e-111
Identity = 195/267 (73.03%), Postives = 232/267 (86.89%), Query Frame = 0

Query: 3   GFLTTQTTNPNERFIFMRNKVKVPVEAVGTYRFTLDTGHYLDLFDTFYVPSISRNLISLS 62
           GFLT QT +PN++F+FM N+VK PVEAVGTYR  LDTGH+LDL +T YVPS+SRNL+SLS
Sbjct: 298 GFLTIQTISPNKKFVFMGNRVKAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLS 357

Query: 63  KLDTSGYYFKFGNECFSLFKQNGFIGSGILCDDLHKLKLDNVLTKSLLTLHYNVGTKRGQ 122
           KLD +GY F FGN CFSLFK N  IG+G+LCD L+KLKLD +  +++LTLH+NVGTKR  
Sbjct: 358 KLDITGYSFNFGNGCFSLFKYNHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSL 417

Query: 123 TNESSAYLWHKRLGHISKERIKRLINNEILPDLDFTNLGIFVNCIKGKQTKYTVNKEATR 182
            NE SA+LWHKRLGHIS+ERI+RLI NEILPDLDFT+L I V+CIKGKQTK+T  K ATR
Sbjct: 418 VNERSAFLWHKRLGHISRERIERLIKNEILPDLDFTDLNICVDCIKGKQTKHT-KKGATR 477

Query: 183 SSQLLEIIHTDICRPFDVSSFGGEKYFITFIDDFSRYGCIYLLHEKSQAIDALKVFINEV 242
           S+QLLEI+HTDIC PFDVSSFG E+YFITFIDD+SRYG +YLLHEKSQA++AL++++NEV
Sbjct: 478 STQLLEIVHTDICGPFDVSSFGRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEV 537

Query: 243 ERQLDRNVKILRSDRGGEYYRKYDENG 270
           ERQLDR VKI+RSDR GEYYR+YDE G
Sbjct: 538 ERQLDRKVKIIRSDRDGEYYRRYDETG 563

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RYE18822.19.7e-11775.46hypothetical protein EOP45_13565, partial [Sphingobacteriaceae bacterium][more]
KYP65984.13.8e-11373.98Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
RZC25410.13.8e-11373.98Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja][more]
RZC12927.18.5e-11373.61Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A [Glycine s... [more]
RZC09906.13.6e-11173.03B2 protein isoform D [Glycine soja][more]
Match NameE-valueIdentityDescription
P109782.7e-2931.60Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q9ZT941.7e-1528.70Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW29.4e-1426.01Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q125016.1e-1323.35Transposon Ty2-OR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Q124917.9e-1323.35Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A0A4Q3EHL34.7e-11775.46Uncharacterized protein (Fragment) OS=Sphingobacteriaceae bacterium OX=2021370 G... [more]
A0A151TG021.8e-11373.98Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A445LQ301.8e-11373.98Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3... [more]
A0A445KPR84.1e-11373.61Retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform A OS=Glycine... [more]
A0A445KGB11.7e-11173.03B2 protein isoform D OS=Glycine soja OX=3848 GN=D0Y65_016300 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 122..171
e-value: 4.2E-13
score: 48.9
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 188..262
e-value: 6.3E-9
score: 36.1
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 182..269
score: 9.210668
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 177..266
e-value: 3.0E-13
score: 51.5
NoneNo IPR availablePANTHERPTHR11439:SF324RIBONUCLEASE H-LIKE DOMAIN, GAG-PRE-INTEGRASE DOMAIN, GAG-POLYPEPTIDE OF LTR COPIA-TYPE-RELATEDcoord: 124..261
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 124..261
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 182..263

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc01g0013861.1Cmc01g0013861.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071897 DNA biosynthetic process
biological_process GO:0015074 DNA integration
biological_process GO:0009231 riboflavin biosynthetic process
cellular_component GO:0016020 membrane
molecular_function GO:0008686 3,4-dihydroxy-2-butanone-4-phosphate synthase activity
molecular_function GO:0003887 DNA-directed DNA polymerase activity
molecular_function GO:0004491 methylmalonate-semialdehyde dehydrogenase (acylating) activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding