Cmc12g0320231 (gene) Melon (Charmono) v1.1

Overview
NameCmc12g0320231
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTransposon Ty3-G Gag-Pol polyprotein
LocationCMiso1.1chr12: 6477161 .. 6478015 (+)
RNA-Seq ExpressionCmc12g0320231
SyntenyCmc12g0320231
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTGAACAACTAGAAGGGAAATCTTACTTTCTCTTTCTAGATGGATTCTCTGGATTTTATCAAATAATCATTGCATGTGTAGACCAACATAAGACCATCTTCTCATGTGAATTTGGACCATTTTCTTTTAAAAGAATGCCCTTTGGACTATGTAATGCTCTTGCAACATTTCAAAGATGCATGTTAAGCATATTCACTGATTTCATAAGAAAATGCATAGAAGTGTTTATGGACGATTTCACAGTTTATGGGAACGATTTTGATTCTTTCTTGAATAGTTTAAATTTGATTTTAAAGAGATGCATTGGTACTAACTTGGTGCTTAACTTTGAAAAGTGTCATTTCATGGCCTCTCACGGTATAATACTAGGACGCTTAGTATCATCTAAGGGAATAGAAGTTGACAAAGCTAAAATTAATGTAATTCAAAACTTACCCTACCCCATTTGCTTAAAAGATATTAGATCATTTTTTAGCAGTGCCGGATTTTATAGAAAGTTCATAAAAGACTTTTCTAAGATAGCTTTGTCTTTGACAAATTTACTTCAAAAAGATGTCTCTGTTGTAATTGATGATAAATGCATGCATGCTTTTGATACTTTGAAAGATAAATTGACTTCTTCTCCTATCTTGCAAACACCTTATTGGAACTTACCCTTTGAAATATTGTGTGATGCAAGTGATTACGCATTAGGTGCAATGCTAGGACAAATAGTAGATAACAAATTCCATGCTATATATTTTGCATATCGAACTCTAAACTCTGCTCAAGCTAATTACTCCTCAACTGAAAAAGAGTTTTTGACTATAATCTTTTCTCTTGATAAGTTTCGTAGCTACATAATTGGATAA

mRNA sequence

ATGATTGAACAACTAGAAGGGAAATCTTACTTTCTCTTTCTAGATGGATTCTCTGGATTTTATCAAATAATCATTGCATGTGTAGACCAACATAAGACCATCTTCTCATGTGAATTTGGACCATTTTCTTTTAAAAGAATGCCCTTTGGACTATGTAATGCTCTTGCAACATTTCAAAGATGCATGTTAAGCATATTCACTGATTTCATAAGAAAATGCATAGAAGTGTTTATGGACGATTTCACAGTTTATGGGAACGATTTTGATTCTTTCTTGAATAGTTTAAATTTGATTTTAAAGAGATGCATTGGTACTAACTTGGTGCTTAACTTTGAAAAGTGTCATTTCATGGCCTCTCACGGTATAATACTAGGACGCTTAGTATCATCTAAGGGAATAGAAGTTGACAAAGCTAAAATTAATGTAATTCAAAACTTACCCTACCCCATTTGCTTAAAAGATATTAGATCATTTTTTAGCAGTGCCGGATTTTATAGAAAGTTCATAAAAGACTTTTCTAAGATAGCTTTGTCTTTGACAAATTTACTTCAAAAAGATGTCTCTGTTGTAATTGATGATAAATGCATGCATGCTTTTGATACTTTGAAAGATAAATTGACTTCTTCTCCTATCTTGCAAACACCTTATTGGAACTTACCCTTTGAAATATTGTGTGATGCAAGTGATTACGCATTAGGTGCAATGCTAGGACAAATAGTAGATAACAAATTCCATGCTATATATTTTGCATATCGAACTCTAAACTCTGCTCAAGCTAATTACTCCTCAACTGAAAAAGAGTTTTTGACTATAATCTTTTCTCTTGATAAGTTTCGTAGCTACATAATTGGATAA

Coding sequence (CDS)

ATGATTGAACAACTAGAAGGGAAATCTTACTTTCTCTTTCTAGATGGATTCTCTGGATTTTATCAAATAATCATTGCATGTGTAGACCAACATAAGACCATCTTCTCATGTGAATTTGGACCATTTTCTTTTAAAAGAATGCCCTTTGGACTATGTAATGCTCTTGCAACATTTCAAAGATGCATGTTAAGCATATTCACTGATTTCATAAGAAAATGCATAGAAGTGTTTATGGACGATTTCACAGTTTATGGGAACGATTTTGATTCTTTCTTGAATAGTTTAAATTTGATTTTAAAGAGATGCATTGGTACTAACTTGGTGCTTAACTTTGAAAAGTGTCATTTCATGGCCTCTCACGGTATAATACTAGGACGCTTAGTATCATCTAAGGGAATAGAAGTTGACAAAGCTAAAATTAATGTAATTCAAAACTTACCCTACCCCATTTGCTTAAAAGATATTAGATCATTTTTTAGCAGTGCCGGATTTTATAGAAAGTTCATAAAAGACTTTTCTAAGATAGCTTTGTCTTTGACAAATTTACTTCAAAAAGATGTCTCTGTTGTAATTGATGATAAATGCATGCATGCTTTTGATACTTTGAAAGATAAATTGACTTCTTCTCCTATCTTGCAAACACCTTATTGGAACTTACCCTTTGAAATATTGTGTGATGCAAGTGATTACGCATTAGGTGCAATGCTAGGACAAATAGTAGATAACAAATTCCATGCTATATATTTTGCATATCGAACTCTAAACTCTGCTCAAGCTAATTACTCCTCAACTGAAAAAGAGTTTTTGACTATAATCTTTTCTCTTGATAAGTTTCGTAGCTACATAATTGGATAA

Protein sequence

MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASHGIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG
Homology
BLAST of Cmc12g0320231 vs. NCBI nr
Match: KYP35881.1 (Transposon Ty3-G Gag-Pol polyprotein, partial [Cajanus cajan])

HSP 1 Score: 400.2 bits (1027), Expect = 1.5e-107
Identity = 184/284 (64.79%), Postives = 233/284 (82.04%), Query Frame = 0

Query: 1   MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQR 60
           M+E+L GKS++ FLDGFSG++QI IA  DQ KTIF+C FG F+++RMPFGLCNA  TFQR
Sbjct: 434 MLERLAGKSHYYFLDGFSGYFQIHIAPEDQEKTIFTCPFGTFAYRRMPFGLCNAPGTFQR 493

Query: 61  CMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASH 120
           CMLSIF+DF+  CIE+FMDDFTVYG+ FD+ L+SL+  L RCI TNLVLNFEKCHFM   
Sbjct: 494 CMLSIFSDFLENCIELFMDDFTVYGSSFDACLDSLDRFLNRCIETNLVLNFEKCHFMVEQ 553

Query: 121 GIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLT 180
           GI+LG ++SSKGIEVD AK++VI  LPYP C++++RSF   AGFYR+F+K+FSK AL L+
Sbjct: 554 GIVLGHIISSKGIEVDPAKVSVISQLPYPSCVREVRSFLGHAGFYRRFVKEFSKKALPLS 613

Query: 181 NLLQKDVSVVIDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIV 240
           NLLQKDV  V DD+C  AFD LK+ LT++PI+Q P W +PFE++CDAS+YALGA+L Q V
Sbjct: 614 NLLQKDVDFVFDDRCKQAFDCLKEALTTTPIIQAPDWTVPFELMCDASNYALGAVLAQRV 673

Query: 241 DNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG 285
           D     IY+A RTL++AQANY++TEKE L I+F+LDKFRSY++G
Sbjct: 674 DKLPRVIYYASRTLDAAQANYTTTEKELLAIVFALDKFRSYLLG 717

BLAST of Cmc12g0320231 vs. NCBI nr
Match: XP_027102722.1 (uncharacterized protein LOC113723965 [Coffea arabica])

HSP 1 Score: 396.4 bits (1017), Expect = 2.1e-106
Identity = 185/284 (65.14%), Postives = 231/284 (81.34%), Query Frame = 0

Query: 1   MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQR 60
           M+E+L G++Y+ FLDGFSG++QI IA  DQ KT F+C FG F+++RMPFGLCNA ATFQR
Sbjct: 284 MVERLAGRAYYCFLDGFSGYFQIAIAPEDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQR 343

Query: 61  CMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASH 120
           CM+SIF++++ K IEVFMDDF+VYG+ FD+ L++L LIL RCI TNLVLN+EKCHFM  H
Sbjct: 344 CMVSIFSEYVEKIIEVFMDDFSVYGDSFDTCLDNLKLILIRCIETNLVLNWEKCHFMVEH 403

Query: 121 GIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLT 180
           GI+LG +VSSKGIEVDKAKI++I  LPYP  ++++RSF   AGFYR+FIKDFSKI   L 
Sbjct: 404 GIVLGHIVSSKGIEVDKAKIDIISALPYPASVREVRSFLGHAGFYRRFIKDFSKIGAPLF 463

Query: 181 NLLQKDVSVVIDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIV 240
            LLQKDV+   DDKC  AF+ LK+ LTS PI+Q P WNLPFEI+CDASD+A+GA+LGQ V
Sbjct: 464 QLLQKDVAFEFDDKCERAFNKLKELLTSPPIIQPPDWNLPFEIMCDASDHAVGAVLGQRV 523

Query: 241 DNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG 285
               H IY+A R LN AQ NYS+TEKEFL +IF+L+KFRSY++G
Sbjct: 524 GKAAHVIYYASRALNGAQLNYSTTEKEFLAVIFALEKFRSYLLG 567

BLAST of Cmc12g0320231 vs. NCBI nr
Match: XP_027118748.1 (uncharacterized protein LOC113735992 [Coffea arabica])

HSP 1 Score: 395.2 bits (1014), Expect = 4.8e-106
Identity = 184/284 (64.79%), Postives = 231/284 (81.34%), Query Frame = 0

Query: 1   MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQR 60
           M+E+L G++Y+ FLDGFSG++QI IA  DQ KT F+C FG F+++RMPFGLCNA ATFQR
Sbjct: 494 MVERLAGRAYYCFLDGFSGYFQIAIALEDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQR 553

Query: 61  CMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASH 120
           CM+SIF++++ K IEVFMDDF+VYG+ FD+ L++L LIL RCI TNLVLN++KCHFM  H
Sbjct: 554 CMVSIFSEYVEKIIEVFMDDFSVYGDSFDTCLDNLKLILIRCIETNLVLNWKKCHFMVEH 613

Query: 121 GIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLT 180
           GI+LG +VSSKGIEVDKAKI++I  LPYP  ++++RSF   AGFYR+FIKDFSKI   L 
Sbjct: 614 GIVLGHIVSSKGIEVDKAKIDIISALPYPASVREVRSFLGHAGFYRRFIKDFSKIGAPLF 673

Query: 181 NLLQKDVSVVIDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIV 240
            LLQKDV+   DDKC  AF+ LK+ LTS PI+Q P WNLPFEI+CDASD+A+GA+LGQ V
Sbjct: 674 QLLQKDVAFEFDDKCERAFNKLKELLTSPPIIQPPDWNLPFEIMCDASDHAVGAVLGQRV 733

Query: 241 DNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG 285
               H IY+A R LN AQ NYS+TEKE LT+IF+L+KFRSY++G
Sbjct: 734 GKAAHVIYYASRALNGAQLNYSTTEKELLTVIFALEKFRSYLLG 777

BLAST of Cmc12g0320231 vs. NCBI nr
Match: RZB41284.1 (Transposon Ty3-G Gag-Pol polyprotein [Glycine soja])

HSP 1 Score: 394.0 bits (1011), Expect = 1.1e-105
Identity = 180/284 (63.38%), Postives = 229/284 (80.63%), Query Frame = 0

Query: 1   MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQR 60
           M+E+L GKS++ FLDGFSG+ QI IA  DQ KT F+C FG F+++RMPFGLCNA  TFQR
Sbjct: 596 MLERLAGKSHYCFLDGFSGYMQITIAPEDQEKTTFTCPFGTFAYRRMPFGLCNAPGTFQR 655

Query: 61  CMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASH 120
           CM+SIF+DF+  CIEVFMDDFTVYG+ FD  LNSL  +L RCI TNLVLNFEKCHFM   
Sbjct: 656 CMISIFSDFLENCIEVFMDDFTVYGSSFDGCLNSLEKVLNRCIETNLVLNFEKCHFMVEQ 715

Query: 121 GIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLT 180
           GI+LG ++S+KGIEVD AKI+VI  LPYP C++++RSF   AGFYR+FI+DFSK+AL L+
Sbjct: 716 GIVLGHIISNKGIEVDPAKISVISQLPYPSCVREVRSFLGHAGFYRRFIRDFSKVALPLS 775

Query: 181 NLLQKDVSVVIDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIV 240
           NLLQK+V    +D+C  AFD LK  LT++PI+Q P W  PFE++CDAS+YALGA+L Q +
Sbjct: 776 NLLQKEVEFDFNDRCKEAFDCLKRALTTTPIIQAPDWTAPFELMCDASNYALGAVLAQKI 835

Query: 241 DNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG 285
           D     IY+A+RTL++AQANY++TEKE L I+F+L+KFRSY++G
Sbjct: 836 DKLPRVIYYAFRTLDAAQANYTTTEKELLAIVFALEKFRSYLLG 879

BLAST of Cmc12g0320231 vs. NCBI nr
Match: XP_027065608.1 (uncharacterized protein LOC113691594 [Coffea arabica])

HSP 1 Score: 393.3 bits (1009), Expect = 1.8e-105
Identity = 184/284 (64.79%), Postives = 230/284 (80.99%), Query Frame = 0

Query: 1   MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQR 60
           M+E+L G++Y+ FLDGFSG++QI IA  DQ KT F+C FG F+++RMPFGLCNA ATFQR
Sbjct: 596 MVERLAGRAYYCFLDGFSGYFQIAIAPEDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQR 655

Query: 61  CMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASH 120
           CM+SIF++++ K IEVFMDDF+VYG+ FD+ L++L LIL RCI TNLVLN+EKCHFM  H
Sbjct: 656 CMVSIFSEYVEKIIEVFMDDFSVYGDSFDTCLDNLKLILIRCIETNLVLNWEKCHFMVEH 715

Query: 121 GIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLT 180
           GI+LG +VSSKGIEVDKAKI++I  LPYP  ++++RSF   AGFYR+FIKDFSKI   L 
Sbjct: 716 GIVLGHIVSSKGIEVDKAKIDIISALPYPASVREVRSFLGHAGFYRRFIKDFSKIGAPLF 775

Query: 181 NLLQKDVSVVIDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIV 240
            LLQKDV+   DDKC  AF+ LK+ LTS PI+Q P WNLPFEI+CDASD+A+GA+LGQ V
Sbjct: 776 QLLQKDVTFEFDDKCEGAFNKLKELLTSPPIIQPPDWNLPFEIMCDASDHAVGAVLGQRV 835

Query: 241 DNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG 285
               H IY+A R LN AQ NYS+TEKE L +IF+L+KFRSY++G
Sbjct: 836 GKAAHVIYYASRALNGAQLNYSTTEKELLAVIFALEKFRSYLLG 879

BLAST of Cmc12g0320231 vs. ExPASy Swiss-Prot
Match: Q8I7P9 (Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 2.3e-50
Identity = 110/294 (37.41%), Postives = 164/294 (55.78%), Query Frame = 0

Query: 2   IEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRC 61
           +  L    YF  LD  SGF+QI +   D  KT FS   G + F R+PFGL NA A FQR 
Sbjct: 205 LASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAIFQRM 264

Query: 62  MLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASHG 121
           +  I  + I K   V++DD  V+  D+D+   +L L+L      NL +N EK HF+ +  
Sbjct: 265 IDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFLDTQV 324

Query: 122 IILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTN 181
             LG +V++ GI+ D  K+  I  +P P  +K+++ F     +YRKFI+D++K+A  LTN
Sbjct: 325 EFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAKVAKPLTN 384

Query: 182 LLQ-----------KDVSVVIDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDY 241
           L +             V + +D+  + +F+ LK  L SS IL  P +  PF +  DAS++
Sbjct: 385 LTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTTDASNW 444

Query: 242 ALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG 285
           A+GA+L Q    +   I +  R+LN  + NY++ EKE L II+SLD  R+Y+ G
Sbjct: 445 AIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYG 498

BLAST of Cmc12g0320231 vs. ExPASy Swiss-Prot
Match: P04323 (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 189.9 bits (481), Expect = 4.0e-47
Identity = 105/277 (37.91%), Postives = 154/277 (55.60%), Query Frame = 0

Query: 9   SYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTD 68
           +YF  +D   GF+QI +      KT FS + G + + RMPFGL NA ATFQRCM  I   
Sbjct: 296 NYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRP 355

Query: 69  FIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASHGIILGRLV 128
            + K   V++DD  V+    D  L SL L+ ++    NL L  +KC F+      LG ++
Sbjct: 356 LLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLDKCEFLKQETTFLGHVL 415

Query: 129 SSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVS 188
           +  GI+ +  KI  IQ  P P   K+I++F    G+YRKFI +F+ IA  +T  L+K++ 
Sbjct: 416 TPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPNFADIAKPMTKCLKKNMK 475

Query: 189 V-VIDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAI 248
           +   + +   AF  LK  ++  PIL+ P +   F +  DASD ALGA+L Q      H +
Sbjct: 476 IDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDVALGAVLSQ----DGHPL 535

Query: 249 YFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG 285
            +  RTLN  + NYS+ EKE L I+++   FR Y++G
Sbjct: 536 SYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLG 568

BLAST of Cmc12g0320231 vs. ExPASy Swiss-Prot
Match: P20825 (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 185.7 bits (470), Expect = 7.5e-46
Identity = 104/276 (37.68%), Postives = 152/276 (55.07%), Query Frame = 0

Query: 10  YFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQRCMLSIFTDF 69
           YF  +D   GF+QI +      KT FS + G + + RMPFGL NA ATFQRCM +I    
Sbjct: 296 YFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPL 355

Query: 70  IRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASHGIILGRLVS 129
           + K   V++DD  ++       LNS+ L+  +    NL L  +KC F+      LG +V+
Sbjct: 356 LNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVT 415

Query: 130 SKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSV 189
             GI+ +  K+  I + P P   K+IR+F    G+YRKFI +++ IA  +T+ L+K   +
Sbjct: 416 PDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKI 475

Query: 190 VIDD-KCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIVDNKFHAIY 249
                + + AF+ LK  +   PILQ P +   F +  DAS+ ALGA+L Q      H I 
Sbjct: 476 DTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASNLALGAVLSQ----NGHPIS 535

Query: 250 FAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG 285
           F  RTLN  + NYS+ EKE L I+++   FR Y++G
Sbjct: 536 FISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLG 567

BLAST of Cmc12g0320231 vs. ExPASy Swiss-Prot
Match: P10401 (Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 161.8 bits (408), Expect = 1.2e-38
Identity = 97/296 (32.77%), Postives = 157/296 (53.04%), Query Frame = 0

Query: 1   MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQR 60
           ++  L    +F  LD  SG++QI +A  D+ KT FS   G + F R+PFGL NA + FQR
Sbjct: 263 ILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEFCRLPFGLRNASSIFQR 322

Query: 61  CMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASH 120
            +  +  + I K   V++DD  ++  +    +  ++ +LK  I  N+ ++ EK  F    
Sbjct: 323 ALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLIDANMRVSQEKTRFFKES 382

Query: 121 GIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLT 180
              LG +VS  G + D  K+  IQ  P P C+  +RSF   A +YR FIKDF+ IA  +T
Sbjct: 383 VEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYYRVFIKDFAAIARPIT 442

Query: 181 NLLQ-----------KDVSVVIDDKCMHAFDTLKDKLTSSP-ILQTPYWNLPFEILCDAS 240
           ++L+           K + V  ++   +AF  L++ L S   IL+ P +  PF++  DAS
Sbjct: 443 DILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVILKYPDFKKPFDLTTDAS 502

Query: 241 DYALGAMLGQIVDNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG 285
              +GA+L Q    +   I    RTL   + NY++ E+E L I+++L K ++++ G
Sbjct: 503 ASGIGAVLSQ----EGRPITMISRTLKQPEQNYATNERELLAIVWALGKLQNFLYG 554

BLAST of Cmc12g0320231 vs. ExPASy Swiss-Prot
Match: P10394 (Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaster OX=7227 GN=POL PE=4 SV=1)

HSP 1 Score: 157.5 bits (397), Expect = 2.2e-37
Identity = 100/284 (35.21%), Postives = 144/284 (50.70%), Query Frame = 0

Query: 1   MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQR 60
           +++QL    YF  LD  SGF+QI +    +  T FS   G + F R+PFGL  A  +FQR
Sbjct: 396 ILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSYRFTRLPFGLKIAPNSFQR 455

Query: 61  CMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASH 120
            M   F+        ++MDD  V G      L +L  +  +C   NL L+ EKC F    
Sbjct: 456 MMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKCREYNLKLHPEKCSFFMHE 515

Query: 121 GIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLT 180
              LG   + KGI  D  K +VIQN P P      R F +   +YR+FIK+F+  +  +T
Sbjct: 516 VTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFCNYYRRFIKNFADYSRHIT 575

Query: 181 NLLQKDVSVVIDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIV 240
            L +K+V     D+C  AF  LK +L +  +LQ P ++  F I  DAS  A GA+L Q  
Sbjct: 576 RLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFCITTDASKQACGAVLTQNH 635

Query: 241 DNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG 285
           +     + +A R     ++N S+TE+E   I +++  FR YI G
Sbjct: 636 NGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYG 679

BLAST of Cmc12g0320231 vs. ExPASy TrEMBL
Match: A0A151QZW2 (Transposon Ty3-G Gag-Pol polyprotein (Fragment) OS=Cajanus cajan OX=3821 GN=KK1_043040 PE=4 SV=1)

HSP 1 Score: 400.2 bits (1027), Expect = 7.2e-108
Identity = 184/284 (64.79%), Postives = 233/284 (82.04%), Query Frame = 0

Query: 1   MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQR 60
           M+E+L GKS++ FLDGFSG++QI IA  DQ KTIF+C FG F+++RMPFGLCNA  TFQR
Sbjct: 434 MLERLAGKSHYYFLDGFSGYFQIHIAPEDQEKTIFTCPFGTFAYRRMPFGLCNAPGTFQR 493

Query: 61  CMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASH 120
           CMLSIF+DF+  CIE+FMDDFTVYG+ FD+ L+SL+  L RCI TNLVLNFEKCHFM   
Sbjct: 494 CMLSIFSDFLENCIELFMDDFTVYGSSFDACLDSLDRFLNRCIETNLVLNFEKCHFMVEQ 553

Query: 121 GIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLT 180
           GI+LG ++SSKGIEVD AK++VI  LPYP C++++RSF   AGFYR+F+K+FSK AL L+
Sbjct: 554 GIVLGHIISSKGIEVDPAKVSVISQLPYPSCVREVRSFLGHAGFYRRFVKEFSKKALPLS 613

Query: 181 NLLQKDVSVVIDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIV 240
           NLLQKDV  V DD+C  AFD LK+ LT++PI+Q P W +PFE++CDAS+YALGA+L Q V
Sbjct: 614 NLLQKDVDFVFDDRCKQAFDCLKEALTTTPIIQAPDWTVPFELMCDASNYALGAVLAQRV 673

Query: 241 DNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG 285
           D     IY+A RTL++AQANY++TEKE L I+F+LDKFRSY++G
Sbjct: 674 DKLPRVIYYASRTLDAAQANYTTTEKELLAIVFALDKFRSYLLG 717

BLAST of Cmc12g0320231 vs. ExPASy TrEMBL
Match: A0A6P6VL84 (uncharacterized protein LOC113723965 OS=Coffea arabica OX=13443 GN=LOC113723965 PE=4 SV=1)

HSP 1 Score: 396.4 bits (1017), Expect = 1.0e-106
Identity = 185/284 (65.14%), Postives = 231/284 (81.34%), Query Frame = 0

Query: 1   MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQR 60
           M+E+L G++Y+ FLDGFSG++QI IA  DQ KT F+C FG F+++RMPFGLCNA ATFQR
Sbjct: 284 MVERLAGRAYYCFLDGFSGYFQIAIAPEDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQR 343

Query: 61  CMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASH 120
           CM+SIF++++ K IEVFMDDF+VYG+ FD+ L++L LIL RCI TNLVLN+EKCHFM  H
Sbjct: 344 CMVSIFSEYVEKIIEVFMDDFSVYGDSFDTCLDNLKLILIRCIETNLVLNWEKCHFMVEH 403

Query: 121 GIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLT 180
           GI+LG +VSSKGIEVDKAKI++I  LPYP  ++++RSF   AGFYR+FIKDFSKI   L 
Sbjct: 404 GIVLGHIVSSKGIEVDKAKIDIISALPYPASVREVRSFLGHAGFYRRFIKDFSKIGAPLF 463

Query: 181 NLLQKDVSVVIDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIV 240
            LLQKDV+   DDKC  AF+ LK+ LTS PI+Q P WNLPFEI+CDASD+A+GA+LGQ V
Sbjct: 464 QLLQKDVAFEFDDKCERAFNKLKELLTSPPIIQPPDWNLPFEIMCDASDHAVGAVLGQRV 523

Query: 241 DNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG 285
               H IY+A R LN AQ NYS+TEKEFL +IF+L+KFRSY++G
Sbjct: 524 GKAAHVIYYASRALNGAQLNYSTTEKEFLAVIFALEKFRSYLLG 567

BLAST of Cmc12g0320231 vs. ExPASy TrEMBL
Match: A0A6P6WTG8 (uncharacterized protein LOC113735992 OS=Coffea arabica OX=13443 GN=LOC113735992 PE=4 SV=1)

HSP 1 Score: 395.2 bits (1014), Expect = 2.3e-106
Identity = 184/284 (64.79%), Postives = 231/284 (81.34%), Query Frame = 0

Query: 1   MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQR 60
           M+E+L G++Y+ FLDGFSG++QI IA  DQ KT F+C FG F+++RMPFGLCNA ATFQR
Sbjct: 494 MVERLAGRAYYCFLDGFSGYFQIAIALEDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQR 553

Query: 61  CMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASH 120
           CM+SIF++++ K IEVFMDDF+VYG+ FD+ L++L LIL RCI TNLVLN++KCHFM  H
Sbjct: 554 CMVSIFSEYVEKIIEVFMDDFSVYGDSFDTCLDNLKLILIRCIETNLVLNWKKCHFMVEH 613

Query: 121 GIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLT 180
           GI+LG +VSSKGIEVDKAKI++I  LPYP  ++++RSF   AGFYR+FIKDFSKI   L 
Sbjct: 614 GIVLGHIVSSKGIEVDKAKIDIISALPYPASVREVRSFLGHAGFYRRFIKDFSKIGAPLF 673

Query: 181 NLLQKDVSVVIDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIV 240
            LLQKDV+   DDKC  AF+ LK+ LTS PI+Q P WNLPFEI+CDASD+A+GA+LGQ V
Sbjct: 674 QLLQKDVAFEFDDKCERAFNKLKELLTSPPIIQPPDWNLPFEIMCDASDHAVGAVLGQRV 733

Query: 241 DNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG 285
               H IY+A R LN AQ NYS+TEKE LT+IF+L+KFRSY++G
Sbjct: 734 GKAAHVIYYASRALNGAQLNYSTTEKELLTVIFALEKFRSYLLG 777

BLAST of Cmc12g0320231 vs. ExPASy TrEMBL
Match: A0A445EY74 (Reverse transcriptase OS=Glycine soja OX=3848 GN=D0Y65_055343 PE=4 SV=1)

HSP 1 Score: 394.0 bits (1011), Expect = 5.1e-106
Identity = 180/284 (63.38%), Postives = 229/284 (80.63%), Query Frame = 0

Query: 1   MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQR 60
           M+E+L GKS++ FLDGFSG+ QI IA  DQ KT F+C FG F+++RMPFGLCNA  TFQR
Sbjct: 596 MLERLAGKSHYCFLDGFSGYMQITIAPEDQEKTTFTCPFGTFAYRRMPFGLCNAPGTFQR 655

Query: 61  CMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASH 120
           CM+SIF+DF+  CIEVFMDDFTVYG+ FD  LNSL  +L RCI TNLVLNFEKCHFM   
Sbjct: 656 CMISIFSDFLENCIEVFMDDFTVYGSSFDGCLNSLEKVLNRCIETNLVLNFEKCHFMVEQ 715

Query: 121 GIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLT 180
           GI+LG ++S+KGIEVD AKI+VI  LPYP C++++RSF   AGFYR+FI+DFSK+AL L+
Sbjct: 716 GIVLGHIISNKGIEVDPAKISVISQLPYPSCVREVRSFLGHAGFYRRFIRDFSKVALPLS 775

Query: 181 NLLQKDVSVVIDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIV 240
           NLLQK+V    +D+C  AFD LK  LT++PI+Q P W  PFE++CDAS+YALGA+L Q +
Sbjct: 776 NLLQKEVEFDFNDRCKEAFDCLKRALTTTPIIQAPDWTAPFELMCDASNYALGAVLAQKI 835

Query: 241 DNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG 285
           D     IY+A+RTL++AQANY++TEKE L I+F+L+KFRSY++G
Sbjct: 836 DKLPRVIYYAFRTLDAAQANYTTTEKELLAIVFALEKFRSYLLG 879

BLAST of Cmc12g0320231 vs. ExPASy TrEMBL
Match: A0A6P6SHK4 (uncharacterized protein LOC113691594 OS=Coffea arabica OX=13443 GN=LOC113691594 PE=4 SV=1)

HSP 1 Score: 393.3 bits (1009), Expect = 8.8e-106
Identity = 184/284 (64.79%), Postives = 230/284 (80.99%), Query Frame = 0

Query: 1   MIEQLEGKSYFLFLDGFSGFYQIIIACVDQHKTIFSCEFGPFSFKRMPFGLCNALATFQR 60
           M+E+L G++Y+ FLDGFSG++QI IA  DQ KT F+C FG F+++RMPFGLCNA ATFQR
Sbjct: 596 MVERLAGRAYYCFLDGFSGYFQIAIAPEDQEKTTFTCPFGTFAYRRMPFGLCNAPATFQR 655

Query: 61  CMLSIFTDFIRKCIEVFMDDFTVYGNDFDSFLNSLNLILKRCIGTNLVLNFEKCHFMASH 120
           CM+SIF++++ K IEVFMDDF+VYG+ FD+ L++L LIL RCI TNLVLN+EKCHFM  H
Sbjct: 656 CMVSIFSEYVEKIIEVFMDDFSVYGDSFDTCLDNLKLILIRCIETNLVLNWEKCHFMVEH 715

Query: 121 GIILGRLVSSKGIEVDKAKINVIQNLPYPICLKDIRSFFSSAGFYRKFIKDFSKIALSLT 180
           GI+LG +VSSKGIEVDKAKI++I  LPYP  ++++RSF   AGFYR+FIKDFSKI   L 
Sbjct: 716 GIVLGHIVSSKGIEVDKAKIDIISALPYPASVREVRSFLGHAGFYRRFIKDFSKIGAPLF 775

Query: 181 NLLQKDVSVVIDDKCMHAFDTLKDKLTSSPILQTPYWNLPFEILCDASDYALGAMLGQIV 240
            LLQKDV+   DDKC  AF+ LK+ LTS PI+Q P WNLPFEI+CDASD+A+GA+LGQ V
Sbjct: 776 QLLQKDVTFEFDDKCEGAFNKLKELLTSPPIIQPPDWNLPFEIMCDASDHAVGAVLGQRV 835

Query: 241 DNKFHAIYFAYRTLNSAQANYSSTEKEFLTIIFSLDKFRSYIIG 285
               H IY+A R LN AQ NYS+TEKE L +IF+L+KFRSY++G
Sbjct: 836 GKAAHVIYYASRALNGAQLNYSTTEKELLAVIFALEKFRSYLLG 879

BLAST of Cmc12g0320231 vs. TAIR 10
Match: ATMG00860.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 71.2 bits (173), Expect = 1.5e-12
Identity = 42/132 (31.82%), Postives = 67/132 (50.76%), Query Frame = 0

Query: 92  LNSLNLILKRCIGTNLVLNFEKCHFMASHGIILG--RLVSSKGIEVDKAKINVIQNLPYP 151
           +N L ++L+         N +KC F       LG   ++S +G+  D AK+  +   P P
Sbjct: 1   MNHLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEP 60

Query: 152 ICLKDIRSFFSSAGFYRKFIKDFSKIALSLTNLLQKDVSVVIDDKCMHAFDTLKDKLTSS 211
               ++R F    G+YR+F+K++ KI   LT LL+K+ S+   +    AF  LK  +T+ 
Sbjct: 61  KNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKN-SLKWTEMAALAFKALKGAVTTL 120

Query: 212 PILQTPYWNLPF 222
           P+L  P   LPF
Sbjct: 121 PVLALPDLKLPF 131

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KYP35881.11.5e-10764.79Transposon Ty3-G Gag-Pol polyprotein, partial [Cajanus cajan][more]
XP_027102722.12.1e-10665.14uncharacterized protein LOC113723965 [Coffea arabica][more]
XP_027118748.14.8e-10664.79uncharacterized protein LOC113735992 [Coffea arabica][more]
RZB41284.11.1e-10563.38Transposon Ty3-G Gag-Pol polyprotein [Glycine soja][more]
XP_027065608.11.8e-10564.79uncharacterized protein LOC113691594 [Coffea arabica][more]
Match NameE-valueIdentityDescription
Q8I7P92.3e-5037.41Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogast... [more]
P043234.0e-4737.91Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
P208257.5e-4637.68Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaste... [more]
P104011.2e-3832.77Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogas... [more]
P103942.2e-3735.21Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaste... [more]
Match NameE-valueIdentityDescription
A0A151QZW27.2e-10864.79Transposon Ty3-G Gag-Pol polyprotein (Fragment) OS=Cajanus cajan OX=3821 GN=KK1_... [more]
A0A6P6VL841.0e-10665.14uncharacterized protein LOC113723965 OS=Coffea arabica OX=13443 GN=LOC113723965 ... [more]
A0A6P6WTG82.3e-10664.79uncharacterized protein LOC113735992 OS=Coffea arabica OX=13443 GN=LOC113735992 ... [more]
A0A445EY745.1e-10663.38Reverse transcriptase OS=Glycine soja OX=3848 GN=D0Y65_055343 PE=4 SV=1[more]
A0A6P6SHK48.8e-10664.79uncharacterized protein LOC113691594 OS=Coffea arabica OX=13443 GN=LOC113691594 ... [more]
Match NameE-valueIdentityDescription
ATMG00860.11.5e-1231.82DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR041577Reverse transcriptase/retrotransposon-derived protein, RNase H-like domainPFAMPF17919RT_RNaseH_2coord: 192..284
e-value: 3.8E-28
score: 97.4
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 4..119
e-value: 6.5E-13
score: 48.7
NoneNo IPR availableGENE3D3.10.10.10HIV Type 1 Reverse Transcriptase, subunit A, domain 1coord: 21..53
e-value: 9.4E-30
score: 105.5
NoneNo IPR availablePANTHERPTHR24559:SF324TRANSPOSON TY3-I GAG-POL POLYPROTEIN-LIKE PROTEINcoord: 2..284
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 2..284
NoneNo IPR availableCDDcd01647RT_LTRcoord: 1..128
e-value: 1.11898E-43
score: 145.046
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 137..229
e-value: 1.2E-21
score: 78.5
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 1..128
e-value: 9.4E-30
score: 105.5
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 2..284

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc12g0320231.1Cmc12g0320231.1mRNA