Cmc02g0043461 (gene) Melon (Charmono) v1.1

Overview
NameCmc02g0043461
Typegene
OrganismCucumis melo L. var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCMiso1.1chr02: 5763399 .. 5764424 (-)
RNA-Seq ExpressionCmc02g0043461
SyntenyCmc02g0043461
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAAAAAGGATTTAGCCCTTAAAAATATCCTATGTGTTCCTGATATGACTAAGAATCTGATTAGTGTTTCTAAACTTACACGAGATAATCATATTTATCTTAAATATCATGGTTATTGTTGTTTTATTAAGGACAAGGCTATAAGGGACATTTTGTTGAAAGGAACTCTTAAGGATGGGTTTTACCACCTAGAAAGTGTTAGCAGGAAGAAAGGAGTAGCACCAGTTTACAGTAATATTACAAATCAGCAGTTTATGCACAAAAATAAGGATATCTCAACCTTTGTCTTGACCGGAGGAACAAATCCTGTCAAAATTAATGTTGATGTATCCAAAGTTGTTTGGCACAGGAGACTTGAATATCCATTGTCCAAAATTTTGAATTCTATACTAAAAGGTTGTAATTTGATAGTTAATGACAACAATGGTAAAACTAAATTTTGTGACTACTGTCAGTTTGGGAAATCTCATAATCTTCCATTTCCTAATTCTTAATCTCATGCTAAGGAATTCTTTGTTATTATATACTCTGATTTATGGGGTCCAGTTCCTTATTGCCCAAATGATGGTTTCATATACTATATATTGTTTATGGATGATTACAGTAGATATACGTGGATTTATCCACTCAAACAGAAAAGTGCAGCAGTTGAAACATTTCAGCACTTTGTTACATATGTTAAAAACGAGTTTAATAAAACCAATAAAGTGTTTCAATCTGACAATGGAGGAAAATACAAAAAAATACGACATTTATGTTTAAATCTGGGGATCAGTTGTCGGTTCTATTGTCCTTATACCTCAACTCAGAATGGTAAAGCGGAAAGGAAACATCGACATATTGTTGAAACTGGGCTTACGCTTCTTGCACAAGCCAACATGACCATGAATTATTGGTGGGATGCCTTCCTAACCACTATCATCTTGATAAATGGAATGCCCACACCTATACTTCAAGGGCTATCTCTAATTGAGCTTATATTTCATCAAAAATTAAAATTCTTAGAATTGAAGATTTTATAG

mRNA sequence

ATGGAAAAAAAGGATTTAGCCCTTAAAAATATCCTATGTGTTCCTGATATGACTAAGAATCTGATTAGTGTTTCTAAACTTACACGAGATAATCATATTTATCTTAAATATCATGGTTATTGTTGTTTTATTAAGGACAAGGCTATAAGGGACATTTTGTTGAAAGGAACTCTTAAGGATGGGTTTTACCACCTAGAAAGTGTTAGCAGGAAGAAAGGAGTAGCACCAGTTTACAGTAATATTACAAATCAGCAGTTTATGCACAAAAATAAGGATATCTCAACCTTTGTCTTGACCGGAGGAACAAATCCTGTCAAAATTAATGTTGATGTATCCAAAGTTGTTTGGCACAGGAGACTTGAATATCCATTGTCCAAAATTTTGAATTCTATACTAAAAGGTTGTAATTTGATAGTTAATGACAACAATGGTAAAACTAAATTTTGTGACTACTTTCCTTATTGCCCAAATGATGGTTTCATATACTATATATTGTTTATGGATGATTACAGTAGATATACGTGGATTTATCCACTCAAACAGAAAAGTGCAGCAGTTGAAACATTTCAGCACTTTGTTACATATGTTAAAAACGAGTTTAATAAAACCAATAAAGTGTTTCAATCTGACAATGGAGGAAAATACAAAAAAATACGACATTTATGTTTAAATCTGGGGATCAGTTGTCGGTTCTATTGTCCTTATACCTCAACTCAGAATGGTAAAGCGGAAAGGAAACATCGACATATTGTTGAAACTGGGCTTACGCTTCTTGCACAAGCCAACATGACCATGAATTATTGGTGGGATGCCTTCCTAACCACTATCATCTTGATAAATGGAATGCCCACACCTATACTTCAAGGGCTATCTCTAATTGAGCTTATATTTCATCAAAAATTAAAATTCTTAGAATTGAAGATTTTATAG

Coding sequence (CDS)

ATGGAAAAAAAGGATTTAGCCCTTAAAAATATCCTATGTGTTCCTGATATGACTAAGAATCTGATTAGTGTTTCTAAACTTACACGAGATAATCATATTTATCTTAAATATCATGGTTATTGTTGTTTTATTAAGGACAAGGCTATAAGGGACATTTTGTTGAAAGGAACTCTTAAGGATGGGTTTTACCACCTAGAAAGTGTTAGCAGGAAGAAAGGAGTAGCACCAGTTTACAGTAATATTACAAATCAGCAGTTTATGCACAAAAATAAGGATATCTCAACCTTTGTCTTGACCGGAGGAACAAATCCTGTCAAAATTAATGTTGATGTATCCAAAGTTGTTTGGCACAGGAGACTTGAATATCCATTGTCCAAAATTTTGAATTCTATACTAAAAGGTTGTAATTTGATAGTTAATGACAACAATGGTAAAACTAAATTTTGTGACTACTTTCCTTATTGCCCAAATGATGGTTTCATATACTATATATTGTTTATGGATGATTACAGTAGATATACGTGGATTTATCCACTCAAACAGAAAAGTGCAGCAGTTGAAACATTTCAGCACTTTGTTACATATGTTAAAAACGAGTTTAATAAAACCAATAAAGTGTTTCAATCTGACAATGGAGGAAAATACAAAAAAATACGACATTTATGTTTAAATCTGGGGATCAGTTGTCGGTTCTATTGTCCTTATACCTCAACTCAGAATGGTAAAGCGGAAAGGAAACATCGACATATTGTTGAAACTGGGCTTACGCTTCTTGCACAAGCCAACATGACCATGAATTATTGGTGGGATGCCTTCCTAACCACTATCATCTTGATAAATGGAATGCCCACACCTATACTTCAAGGGCTATCTCTAATTGAGCTTATATTTCATCAAAAATTAAAATTCTTAGAATTGAAGATTTTATAG

Protein sequence

MEKKDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFYHLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYPLSKILNSILKGCNLIVNDNNGKTKFCDYFPYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDNGGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDAFLTTIILINGMPTPILQGLSLIELIFHQKLKFLELKIL
Homology
BLAST of Cmc02g0043461 vs. NCBI nr
Match: PNY02796.1 (copia protein (gag-int-pol protein), partial [Trifolium pratense])

HSP 1 Score: 243.8 bits (621), Expect = 1.9e-60
Identity = 135/336 (40.18%), Postives = 188/336 (55.95%), Query Frame = 0

Query: 4   KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
           K+L L ++L VP++TKNL+SVSKLT DN+I +++   CC +KDK     LLKG LK+G Y
Sbjct: 85  KNLNLYDVLYVPEITKNLLSVSKLTADNNIIVEFDADCCSVKDKLTGKALLKGKLKEGLY 144

Query: 64  HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
            +             SN+++Q     NKD  T++               K  WHR+L +P
Sbjct: 145 QV-------------SNVSSQ----SNKDACTYMSV-------------KESWHRKLGHP 204

Query: 124 LSKILNSILKGCNLIVNDNNGKTKFCDY-------------------------------- 183
            +K+L+ +LK CN +   ++ + KFC+                                 
Sbjct: 205 NNKVLDKVLKHCN-VKTSSSDQFKFCEACQFGKLHLLPFKSSYSHAQEPLDLIHTDVWGP 264

Query: 184 FPYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDN 243
            P   N GF YY+ F+DD+SR+TWIYPLKQKS  +  F  F T V+N+FNK  K+ Q D 
Sbjct: 265 APIMSNSGFKYYVHFIDDFSRFTWIYPLKQKSETIHAFTQFKTLVENQFNKRIKIVQCDG 324

Query: 244 GGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDA 303
           GG+YK ++ L L  GI  R  CPYTS QNG+AERKHRH+ E GLT+LAQA M + YWW+A
Sbjct: 325 GGEYKAVQKLALEAGIQFRMSCPYTSQQNGRAERKHRHVAELGLTMLAQARMPLCYWWEA 384

Query: 304 FLTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
           F T++ LIN +P+ I Q      LI+ ++  +  LK
Sbjct: 385 FSTSVYLINRLPSSINQNACPYTLIYKKEPDYSVLK 389

BLAST of Cmc02g0043461 vs. NCBI nr
Match: KYP50444.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 240.7 bits (613), Expect = 1.6e-59
Identity = 134/345 (38.84%), Postives = 185/345 (53.62%), Query Frame = 0

Query: 2   EKKDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDG 61
           ++K L LK+IL VP +TKNL+S+SKLT DN IY+++H   CF+KDK    ILL+G +KDG
Sbjct: 283 QQKSLNLKDILYVPKITKNLLSISKLTFDNDIYVEFHDVACFVKDKLTGRILLEGKIKDG 342

Query: 62  FYHLESVSRKKGVAP-VYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRL 121
            Y L   S      P V+ +I                               K  WHR+L
Sbjct: 343 LYQLPGGSTSTNKRPHVFFSI-------------------------------KETWHRKL 402

Query: 122 EYPLSKILNSILKGCNLIVNDNNGKTKFCDYFPYC----------------------PND 181
            +P SK+LN ++K CN+       +   C+ F +C                      P D
Sbjct: 403 GHPNSKVLNEVMKLCNI-------EASPCENFEFCEACQFGKAHNLPFQNSVSCAKEPLD 462

Query: 182 ----------------GFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNK 241
                           GF YY+LF+DD+SR+TWIYPLKQKS   + F  F   V+N+FNK
Sbjct: 463 LVHSDVWGPAPISSVSGFKYYVLFLDDWSRFTWIYPLKQKSDVFQAFIQFRNLVENQFNK 522

Query: 242 TNKVFQSDNGGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQAN 301
             K  Q D GG++K +  + +  GI  R  CPYTS QNG+AERKHRH+VE+GLTLLAQA 
Sbjct: 523 RIKTLQCDGGGEFKSLSKVLIKTGIQLRESCPYTSAQNGRAERKHRHVVESGLTLLAQAK 582

Query: 302 MTMNYWWDAFLTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
           M ++YWW+AF T + LIN +PT +++  S  + +F +   +  +K
Sbjct: 583 MPLHYWWEAFSTAVFLINRLPTQVIKNKSPYQQLFDKNPDYTAMK 589

BLAST of Cmc02g0043461 vs. NCBI nr
Match: GAU19483.1 (hypothetical protein TSUD_77270 [Trifolium subterraneum])

HSP 1 Score: 236.9 bits (603), Expect = 2.4e-58
Identity = 134/335 (40.00%), Postives = 178/335 (53.13%), Query Frame = 0

Query: 4   KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
           K L L +IL VP++TKNL+SVSKL  DN+I +++   CCF+KDK    ++LKG LKDG Y
Sbjct: 358 KSLNLHDILYVPNITKNLLSVSKLAADNNILVEFDENCCFVKDKLTGKVILKGLLKDGLY 417

Query: 64  HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
            L    R                       S FV               K  WHRRL +P
Sbjct: 418 QLSGTKRNP---------------------SAFVSV-------------KESWHRRLGHP 477

Query: 124 LSKILNSILKGCNLIV--NDNNGKTKFCDY-----------------------------F 183
            +K+L+ +L+ C + V  +DN    + C Y                              
Sbjct: 478 NNKVLDKVLESCKVKVPPSDNFSFCEACQYGKMHLLPFKSSSSHAQEPLELVHTDVWGPA 537

Query: 184 PYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDNG 243
           P   + GF YY+ F+DD+SR+TWIYPLKQKS  V+ F  F    +N+FNK  KV Q D G
Sbjct: 538 PIMTSSGFKYYVHFVDDFSRFTWIYPLKQKSETVQAFIQFKNLTENQFNKRIKVIQCDGG 597

Query: 244 GKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDAF 303
           G+YK ++ L +  GI  R  CPYTS QNG+AERKHRHI E GLTLLAQA M ++YWW+AF
Sbjct: 598 GEYKPVQKLAVEAGIQFRMSCPYTSQQNGRAERKHRHITEFGLTLLAQAQMPLHYWWEAF 657

Query: 304 LTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
            T + LIN +P+ + Q  S   L+  ++  +  LK
Sbjct: 658 STAVYLINRLPSQVTQNESPYSLMLQKEPDYKLLK 658

BLAST of Cmc02g0043461 vs. NCBI nr
Match: PNX78574.1 (retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense])

HSP 1 Score: 234.2 bits (596), Expect = 1.5e-57
Identity = 133/336 (39.58%), Postives = 186/336 (55.36%), Query Frame = 0

Query: 4   KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
           K+L L ++L VP +TKNL+SVSKLT DN+I +++   CCF+KDK    +LL+G LKDG Y
Sbjct: 362 KNLNLHDVLYVPQITKNLLSVSKLTSDNNIIVEFDNDCCFVKDKLTGKVLLRGILKDGLY 421

Query: 64  HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
            L             SN ++Q     NKD   ++               K  WHR+L +P
Sbjct: 422 QL-------------SNGSSQ----TNKDPCVYLSV-------------KESWHRKLGHP 481

Query: 124 LSKILNSILKGCNLIVNDNNGKTKFCDY-------------------------------- 183
            + +L+ +LK CN+  + ++ K KFC+                                 
Sbjct: 482 SNNVLDKVLKICNVKTSPSD-KFKFCEACQLGKSHLLPFKSSSSHAQEVLELIHTDVWGP 541

Query: 184 FPYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDN 243
            P     GF YY+ F+DD SR+TWIYPLKQKS  +  F  F   V+N+FNK  K+ Q D 
Sbjct: 542 APINSISGFKYYVHFIDDSSRFTWIYPLKQKSDTIHAFMQFKNMVENQFNKRIKIIQCDG 601

Query: 244 GGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDA 303
           GG++K ++ + L  GI  R  CPYTS QNG+AERKHRH+ E GLTLLAQANM+++YWW+A
Sbjct: 602 GGEFKPVQKVALETGIKFRMSCPYTSQQNGRAERKHRHVAELGLTLLAQANMSLHYWWEA 661

Query: 304 FLTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
           F T + LIN +P+ + +  S   LI  ++  +  LK
Sbjct: 662 FSTAVYLINRLPSSVTENESPYFLIHKKEPDYNVLK 666

BLAST of Cmc02g0043461 vs. NCBI nr
Match: PNY01489.1 (copia-like polyprotein, partial [Trifolium pratense])

HSP 1 Score: 228.8 bits (582), Expect = 6.4e-56
Identity = 127/334 (38.02%), Postives = 178/334 (53.29%), Query Frame = 0

Query: 6   LALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFYHL 65
           L L ++L VP +TKNL+SVSKLT DN+I++++   CC +KDK     LLKG LKDG Y L
Sbjct: 362 LNLHDVLYVPQITKNLLSVSKLTADNNIFVEFDANCCSVKDKLTGQTLLKGRLKDGLYQL 421

Query: 66  ESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYPLS 125
             VS +                  NKD   ++               K  WHR+L +P +
Sbjct: 422 SDVSPQ-----------------SNKDPCVYMSV-------------KESWHRKLGHPNN 481

Query: 126 KILNSILKGCNLIVNDNNGKTKFCDY--------------------------------FP 185
           K+L  +LK CN+ ++ ++ +  FC+                                  P
Sbjct: 482 KVLEKVLKDCNVKISPSD-QFSFCEACQFGKLHLLPFKSSSSHVQEPLGLIHSDVWGPAP 541

Query: 186 YCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDNGG 245
                GF YY+ F+DD+SR+TWI+PLKQKS  +  F  F    +N+FNK  K+ Q D GG
Sbjct: 542 ILSPSGFKYYVHFIDDFSRFTWIFPLKQKSDTIHAFIQFKNLAENQFNKKIKIIQCDGGG 601

Query: 246 KYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDAFL 305
           +YK ++ + +  GI  R  CPYTS QNG+AERKHRH+VE GLTLLAQA M + YWW+AF 
Sbjct: 602 EYKAVQKVSIEAGIQFRMSCPYTSQQNGRAERKHRHVVELGLTLLAQAKMPLRYWWEAFS 661

Query: 306 TTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
           T + LIN + + +    S   L+F ++  +  LK
Sbjct: 662 TAVYLINRLSSSVNPNESPYSLMFKREPDYNALK 664

BLAST of Cmc02g0043461 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 155.6 bits (392), Expect = 9.1e-37
Identity = 108/338 (31.95%), Postives = 160/338 (47.34%), Query Frame = 0

Query: 2   EKKDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDG 61
           + + L L NIL VP++ KNLISV +L   N + +++      +KD      LL+G  KD 
Sbjct: 381 KSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDE 440

Query: 62  FYHLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLE 121
            Y            P+ S          ++ +S F               +   WH RL 
Sbjct: 441 LYEW----------PIAS----------SQPVSLFASPSS--------KATHSSWHARLG 500

Query: 122 YPLSKILNSILKGCNL---------------IVNDNN---------GKTKFCDYF----- 181
           +P   ILNS++   +L               ++N +N           T+  +Y      
Sbjct: 501 HPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVW 560

Query: 182 --PYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSD 241
             P   +D + YY++F+D ++RYTW+YPLKQKS   ETF  F   ++N F      F SD
Sbjct: 561 SSPILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSD 620

Query: 242 NGGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWD 301
           NGG++  +       GIS     P+T   NG +ERKHRHIVETGLTLL+ A++   YW  
Sbjct: 621 NGGEFVALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPY 680

Query: 302 AFLTTIILINGMPTPILQGLSLIELIFHQKLKFLELKI 309
           AF   + LIN +PTP+LQ  S  + +F     + +L++
Sbjct: 681 AFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRV 690

BLAST of Cmc02g0043461 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 151.0 bits (380), Expect = 2.2e-35
Identity = 110/336 (32.74%), Postives = 158/336 (47.02%), Query Frame = 0

Query: 4   KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
           + L L  +L VP++ KNLISV +L   N + +++      +KD      LL+G  KD  Y
Sbjct: 362 RSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELY 421

Query: 64  HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
                       P+ S          ++ +S F      +P       S   WH RL +P
Sbjct: 422 EW----------PIAS----------SQAVSMF-----ASPCSKATHSS---WHSRLGHP 481

Query: 124 LSKILNSILKGCNLIVNDNNGKTKFC-DYF------------------------------ 183
              ILNS++   +L V + + K   C D F                              
Sbjct: 482 SLAILNSVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWSS 541

Query: 184 PYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDNG 243
           P    D + YY++F+D ++RYTW+YPLKQKS   +TF  F + V+N F        SDNG
Sbjct: 542 PILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNG 601

Query: 244 GKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDAF 303
           G++  +R      GIS     P+T   NG +ERKHRHIVE GLTLL+ A++   YW  AF
Sbjct: 602 GEFVVLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAF 661

Query: 304 LTTIILINGMPTPILQGLSLIELIFHQKLKFLELKI 309
              + LIN +PTP+LQ  S  + +F Q   + +LK+
Sbjct: 662 SVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKV 669

BLAST of Cmc02g0043461 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 91.3 bits (225), Expect = 2.1e-17
Identity = 89/313 (28.43%), Postives = 126/313 (40.26%), Query Frame = 0

Query: 6   LALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFYHL 65
           L LK++  VPD+  NLIS   L RD        GY  +  ++  R  L KG+L       
Sbjct: 348 LVLKDVRHVPDLRMNLISGIALDRD--------GYESYFANQKWR--LTKGSLVIA---- 407

Query: 66  ESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYPLS 125
                 KGVA      TN +           +  G  N  +  + V   +WH+R+ +   
Sbjct: 408 ------KGVARGTLYRTNAE-----------ICQGELNAAQDEISVD--LWHKRMGHMSE 467

Query: 126 KILNSILKGCNLIVNDNNGKTKFCDYFPYCPN---------------------------- 185
           K L  IL   +LI        K CDY  +                               
Sbjct: 468 KGL-QILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPME 527

Query: 186 ----DGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDNGG 245
                G  Y++ F+DD SR  W+Y LK K    + FQ F   V+ E  +  K  +SDNGG
Sbjct: 528 IESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGG 587

Query: 246 KY--KKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDA 285
           +Y  ++    C + GI      P T   NG AER +R IVE   ++L  A +  ++W +A
Sbjct: 588 EYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEA 626

BLAST of Cmc02g0043461 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 76.3 bits (186), Expect = 7.0e-13
Identity = 47/147 (31.97%), Postives = 74/147 (50.34%), Query Frame = 0

Query: 162 YYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDNGGKY--KKIR 221
           Y+++F+D ++ Y   Y +K KS     FQ FV   +  FN        DNG +Y   ++R
Sbjct: 502 YFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMR 561

Query: 222 HLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDAFLTTIILI 281
             C+  GIS     P+T   NG +ER  R I E   T+++ A +  ++W +A LT   LI
Sbjct: 562 QFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLI 621

Query: 282 NGMPTPILQGLSLIEL-IFHQKLKFLE 306
           N +P+  L   S     ++H K  +L+
Sbjct: 622 NRIPSRALVDSSKTPYEMWHNKKPYLK 648

BLAST of Cmc02g0043461 vs. ExPASy Swiss-Prot
Match: Q87040 (Pro-Pol polyprotein OS=Simian foamy virus (isolate chimpanzee) OX=298339 GN=pol PE=3 SV=1)

HSP 1 Score: 55.8 bits (133), Expect = 9.8e-07
Identity = 66/268 (24.63%), Postives = 107/268 (39.93%), Query Frame = 0

Query: 46   DKAIRDILLKGTLKDGFYHLE----SVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGG 105
            D+ ++   +KG  K   Y+LE     VSR +GV  +      Q+ + +  +++       
Sbjct: 764  DQLLQGNNVKGYPKQYTYYLEDGKVKVSRPEGVKIIPPQSDRQKIVLQAHNLA------H 823

Query: 106  TNPVKINVDVSKVVWHRRLEYPLSKILNSILKGCNLIVNDNNGKTK-------------- 165
            T      + ++ + W   +   + K L    K C LI N +N KT               
Sbjct: 824  TGREATLLKIANLYWWPNMRKDVVKQLGR-CKQC-LITNASN-KTSGPILRPDRPQKPFD 883

Query: 166  --FCDYF-PYCPNDGFIYYILFMDDYSRYTWIYPLK--QKSAAVETFQHFVTYVKNEFNK 225
              F DY  P  P+ G++Y ++ +D  + +TW+YP K    SA V++     +        
Sbjct: 884  KFFIDYIGPLPPSQGYLYVLVIVDGMTGFTWLYPTKAPSTSATVKSLNVLTSIA------ 943

Query: 226  TNKVFQSDNGGKY--KKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQ 285
              KV  SD G  +            GI   F  PY    +GK ERK+  I    LT L  
Sbjct: 944  IPKVIHSDQGAAFTSSTFAEWAKERGIHLEFSTPYHPQSSGKVERKNSDIKRL-LTKLLV 1003

Query: 286  ANMTMNYWWDAFLTTIILINGMPTPILQ 289
               T   W+D      + +N   +P+L+
Sbjct: 1004 GRPTK--WYDLLPVVQLALNNTYSPVLK 1013

BLAST of Cmc02g0043461 vs. ExPASy TrEMBL
Match: A0A2K3NIC3 (Copia protein (Gag-int-pol protein) (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g026116 PE=4 SV=1)

HSP 1 Score: 243.8 bits (621), Expect = 9.3e-61
Identity = 135/336 (40.18%), Postives = 188/336 (55.95%), Query Frame = 0

Query: 4   KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
           K+L L ++L VP++TKNL+SVSKLT DN+I +++   CC +KDK     LLKG LK+G Y
Sbjct: 85  KNLNLYDVLYVPEITKNLLSVSKLTADNNIIVEFDADCCSVKDKLTGKALLKGKLKEGLY 144

Query: 64  HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
            +             SN+++Q     NKD  T++               K  WHR+L +P
Sbjct: 145 QV-------------SNVSSQ----SNKDACTYMSV-------------KESWHRKLGHP 204

Query: 124 LSKILNSILKGCNLIVNDNNGKTKFCDY-------------------------------- 183
            +K+L+ +LK CN +   ++ + KFC+                                 
Sbjct: 205 NNKVLDKVLKHCN-VKTSSSDQFKFCEACQFGKLHLLPFKSSYSHAQEPLDLIHTDVWGP 264

Query: 184 FPYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDN 243
            P   N GF YY+ F+DD+SR+TWIYPLKQKS  +  F  F T V+N+FNK  K+ Q D 
Sbjct: 265 APIMSNSGFKYYVHFIDDFSRFTWIYPLKQKSETIHAFTQFKTLVENQFNKRIKIVQCDG 324

Query: 244 GGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDA 303
           GG+YK ++ L L  GI  R  CPYTS QNG+AERKHRH+ E GLT+LAQA M + YWW+A
Sbjct: 325 GGEYKAVQKLALEAGIQFRMSCPYTSQQNGRAERKHRHVAELGLTMLAQARMPLCYWWEA 384

Query: 304 FLTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
           F T++ LIN +P+ I Q      LI+ ++  +  LK
Sbjct: 385 FSTSVYLINRLPSSINQNACPYTLIYKKEPDYSVLK 389

BLAST of Cmc02g0043461 vs. ExPASy TrEMBL
Match: A0A151S6M8 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_027809 PE=4 SV=1)

HSP 1 Score: 240.7 bits (613), Expect = 7.9e-60
Identity = 134/345 (38.84%), Postives = 185/345 (53.62%), Query Frame = 0

Query: 2   EKKDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDG 61
           ++K L LK+IL VP +TKNL+S+SKLT DN IY+++H   CF+KDK    ILL+G +KDG
Sbjct: 283 QQKSLNLKDILYVPKITKNLLSISKLTFDNDIYVEFHDVACFVKDKLTGRILLEGKIKDG 342

Query: 62  FYHLESVSRKKGVAP-VYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRL 121
            Y L   S      P V+ +I                               K  WHR+L
Sbjct: 343 LYQLPGGSTSTNKRPHVFFSI-------------------------------KETWHRKL 402

Query: 122 EYPLSKILNSILKGCNLIVNDNNGKTKFCDYFPYC----------------------PND 181
            +P SK+LN ++K CN+       +   C+ F +C                      P D
Sbjct: 403 GHPNSKVLNEVMKLCNI-------EASPCENFEFCEACQFGKAHNLPFQNSVSCAKEPLD 462

Query: 182 ----------------GFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNK 241
                           GF YY+LF+DD+SR+TWIYPLKQKS   + F  F   V+N+FNK
Sbjct: 463 LVHSDVWGPAPISSVSGFKYYVLFLDDWSRFTWIYPLKQKSDVFQAFIQFRNLVENQFNK 522

Query: 242 TNKVFQSDNGGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQAN 301
             K  Q D GG++K +  + +  GI  R  CPYTS QNG+AERKHRH+VE+GLTLLAQA 
Sbjct: 523 RIKTLQCDGGGEFKSLSKVLIKTGIQLRESCPYTSAQNGRAERKHRHVVESGLTLLAQAK 582

Query: 302 MTMNYWWDAFLTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
           M ++YWW+AF T + LIN +PT +++  S  + +F +   +  +K
Sbjct: 583 MPLHYWWEAFSTAVFLINRLPTQVIKNKSPYQQLFDKNPDYTAMK 589

BLAST of Cmc02g0043461 vs. ExPASy TrEMBL
Match: A0A2Z6MBG6 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_77270 PE=4 SV=1)

HSP 1 Score: 236.9 bits (603), Expect = 1.1e-58
Identity = 134/335 (40.00%), Postives = 178/335 (53.13%), Query Frame = 0

Query: 4   KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
           K L L +IL VP++TKNL+SVSKL  DN+I +++   CCF+KDK    ++LKG LKDG Y
Sbjct: 358 KSLNLHDILYVPNITKNLLSVSKLAADNNILVEFDENCCFVKDKLTGKVILKGLLKDGLY 417

Query: 64  HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
            L    R                       S FV               K  WHRRL +P
Sbjct: 418 QLSGTKRNP---------------------SAFVSV-------------KESWHRRLGHP 477

Query: 124 LSKILNSILKGCNLIV--NDNNGKTKFCDY-----------------------------F 183
            +K+L+ +L+ C + V  +DN    + C Y                              
Sbjct: 478 NNKVLDKVLESCKVKVPPSDNFSFCEACQYGKMHLLPFKSSSSHAQEPLELVHTDVWGPA 537

Query: 184 PYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDNG 243
           P   + GF YY+ F+DD+SR+TWIYPLKQKS  V+ F  F    +N+FNK  KV Q D G
Sbjct: 538 PIMTSSGFKYYVHFVDDFSRFTWIYPLKQKSETVQAFIQFKNLTENQFNKRIKVIQCDGG 597

Query: 244 GKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDAF 303
           G+YK ++ L +  GI  R  CPYTS QNG+AERKHRHI E GLTLLAQA M ++YWW+AF
Sbjct: 598 GEYKPVQKLAVEAGIQFRMSCPYTSQQNGRAERKHRHITEFGLTLLAQAQMPLHYWWEAF 657

Query: 304 LTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
            T + LIN +P+ + Q  S   L+  ++  +  LK
Sbjct: 658 STAVYLINRLPSQVTQNESPYSLMLQKEPDYKLLK 658

BLAST of Cmc02g0043461 vs. ExPASy TrEMBL
Match: A0A2K3LJ49 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Trifolium pratense OX=57577 GN=L195_g034552 PE=4 SV=1)

HSP 1 Score: 234.2 bits (596), Expect = 7.4e-58
Identity = 133/336 (39.58%), Postives = 186/336 (55.36%), Query Frame = 0

Query: 4   KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
           K+L L ++L VP +TKNL+SVSKLT DN+I +++   CCF+KDK    +LL+G LKDG Y
Sbjct: 362 KNLNLHDVLYVPQITKNLLSVSKLTSDNNIIVEFDNDCCFVKDKLTGKVLLRGILKDGLY 421

Query: 64  HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
            L             SN ++Q     NKD   ++               K  WHR+L +P
Sbjct: 422 QL-------------SNGSSQ----TNKDPCVYLSV-------------KESWHRKLGHP 481

Query: 124 LSKILNSILKGCNLIVNDNNGKTKFCDY-------------------------------- 183
            + +L+ +LK CN+  + ++ K KFC+                                 
Sbjct: 482 SNNVLDKVLKICNVKTSPSD-KFKFCEACQLGKSHLLPFKSSSSHAQEVLELIHTDVWGP 541

Query: 184 FPYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDN 243
            P     GF YY+ F+DD SR+TWIYPLKQKS  +  F  F   V+N+FNK  K+ Q D 
Sbjct: 542 APINSISGFKYYVHFIDDSSRFTWIYPLKQKSDTIHAFMQFKNMVENQFNKRIKIIQCDG 601

Query: 244 GGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDA 303
           GG++K ++ + L  GI  R  CPYTS QNG+AERKHRH+ E GLTLLAQANM+++YWW+A
Sbjct: 602 GGEFKPVQKVALETGIKFRMSCPYTSQQNGRAERKHRHVAELGLTLLAQANMSLHYWWEA 661

Query: 304 FLTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
           F T + LIN +P+ + +  S   LI  ++  +  LK
Sbjct: 662 FSTAVYLINRLPSSVTENESPYFLIHKKEPDYNVLK 666

BLAST of Cmc02g0043461 vs. ExPASy TrEMBL
Match: A0A803P5A9 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 229.9 bits (585), Expect = 1.4e-56
Identity = 132/329 (40.12%), Postives = 176/329 (53.50%), Query Frame = 0

Query: 4   KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
           K L LK++L VP+M K LIS+SKLT DN I +++    CF+KDK  R +LL G LKDG Y
Sbjct: 239 KTLVLKDVLLVPEMAKKLISISKLTTDNDILIEFDSDFCFVKDKVTRKVLLTGMLKDGLY 298

Query: 64  HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
            L S   K    P  S  +             FV +   N  + N    K VWHRRL +P
Sbjct: 299 QLNSPLSKPVCQPTQSAPSTHD----------FVCSASINR-QSNFLSKKDVWHRRLGHP 358

Query: 124 LSKILNSILKGCNLIVNDNNGKTKFCDY-------------------------------- 183
            SKIL  +L   N+ V+ NN ++ FCD                                 
Sbjct: 359 SSKILKLVLNSSNVPVSFNNNES-FCDACQYGKSHALPFKLSNSRATKMLELIHTDLWGP 418

Query: 184 FPYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDN 243
            P   N  F +YI F+DDYSR+TW+YPLKQKS A+  F  F T  +N+F    K   +D 
Sbjct: 419 APINSNTNFKFYIHFLDDYSRFTWLYPLKQKSDALNGFTQFKTMAENQFETKIKFITTDW 478

Query: 244 GGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDA 301
           GG+++      +  GI     CP+TS QNG+ E  +RHIVE GLTLLAQA+M + +W DA
Sbjct: 479 GGEFQAFDQFLITHGIQFHHSCPHTSAQNGRNEENYRHIVEMGLTLLAQASMPLKFWVDA 538

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PNY02796.11.9e-6040.18copia protein (gag-int-pol protein), partial [Trifolium pratense][more]
KYP50444.11.6e-5938.84Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
GAU19483.12.4e-5840.00hypothetical protein TSUD_77270 [Trifolium subterraneum][more]
PNX78574.11.5e-5739.58retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense][more]
PNY01489.16.4e-5638.02copia-like polyprotein, partial [Trifolium pratense][more]
Match NameE-valueIdentityDescription
Q94HW29.1e-3731.95Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT942.2e-3532.74Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109782.1e-1728.43Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041467.0e-1331.97Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q870409.8e-0724.63Pro-Pol polyprotein OS=Simian foamy virus (isolate chimpanzee) OX=298339 GN=pol ... [more]
Match NameE-valueIdentityDescription
A0A2K3NIC39.3e-6140.18Copia protein (Gag-int-pol protein) (Fragment) OS=Trifolium pratense OX=57577 GN... [more]
A0A151S6M87.9e-6038.84Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A2Z6MBG61.1e-5840.00Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... [more]
A0A2K3LJ497.4e-5839.58Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Trifolium pratens... [more]
A0A803P5A91.4e-5640.12Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Melon (Charmono) v1.1
Date Performed: 2022-10-13
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 149..236
e-value: 1.1E-9
score: 38.5
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 119..300
score: 16.142885
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 146..306
e-value: 9.5E-25
score: 89.2
NoneNo IPR availablePANTHERPTHR11439:SF324RIBONUCLEASE H-LIKE DOMAIN, GAG-PRE-INTEGRASE DOMAIN, GAG-POLYPEPTIDE OF LTR COPIA-TYPE-RELATEDcoord: 162..280
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 162..280
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 149..291

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cmc02g0043461.1Cmc02g0043461.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding