Homology
BLAST of Cmc02g0043461 vs. NCBI nr
Match:
PNY02796.1 (copia protein (gag-int-pol protein), partial [Trifolium pratense])
HSP 1 Score: 243.8 bits (621), Expect = 1.9e-60
Identity = 135/336 (40.18%), Postives = 188/336 (55.95%), Query Frame = 0
Query: 4 KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
K+L L ++L VP++TKNL+SVSKLT DN+I +++ CC +KDK LLKG LK+G Y
Sbjct: 85 KNLNLYDVLYVPEITKNLLSVSKLTADNNIIVEFDADCCSVKDKLTGKALLKGKLKEGLY 144
Query: 64 HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
+ SN+++Q NKD T++ K WHR+L +P
Sbjct: 145 QV-------------SNVSSQ----SNKDACTYMSV-------------KESWHRKLGHP 204
Query: 124 LSKILNSILKGCNLIVNDNNGKTKFCDY-------------------------------- 183
+K+L+ +LK CN + ++ + KFC+
Sbjct: 205 NNKVLDKVLKHCN-VKTSSSDQFKFCEACQFGKLHLLPFKSSYSHAQEPLDLIHTDVWGP 264
Query: 184 FPYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDN 243
P N GF YY+ F+DD+SR+TWIYPLKQKS + F F T V+N+FNK K+ Q D
Sbjct: 265 APIMSNSGFKYYVHFIDDFSRFTWIYPLKQKSETIHAFTQFKTLVENQFNKRIKIVQCDG 324
Query: 244 GGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDA 303
GG+YK ++ L L GI R CPYTS QNG+AERKHRH+ E GLT+LAQA M + YWW+A
Sbjct: 325 GGEYKAVQKLALEAGIQFRMSCPYTSQQNGRAERKHRHVAELGLTMLAQARMPLCYWWEA 384
Query: 304 FLTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
F T++ LIN +P+ I Q LI+ ++ + LK
Sbjct: 385 FSTSVYLINRLPSSINQNACPYTLIYKKEPDYSVLK 389
BLAST of Cmc02g0043461 vs. NCBI nr
Match:
KYP50444.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])
HSP 1 Score: 240.7 bits (613), Expect = 1.6e-59
Identity = 134/345 (38.84%), Postives = 185/345 (53.62%), Query Frame = 0
Query: 2 EKKDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDG 61
++K L LK+IL VP +TKNL+S+SKLT DN IY+++H CF+KDK ILL+G +KDG
Sbjct: 283 QQKSLNLKDILYVPKITKNLLSISKLTFDNDIYVEFHDVACFVKDKLTGRILLEGKIKDG 342
Query: 62 FYHLESVSRKKGVAP-VYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRL 121
Y L S P V+ +I K WHR+L
Sbjct: 343 LYQLPGGSTSTNKRPHVFFSI-------------------------------KETWHRKL 402
Query: 122 EYPLSKILNSILKGCNLIVNDNNGKTKFCDYFPYC----------------------PND 181
+P SK+LN ++K CN+ + C+ F +C P D
Sbjct: 403 GHPNSKVLNEVMKLCNI-------EASPCENFEFCEACQFGKAHNLPFQNSVSCAKEPLD 462
Query: 182 ----------------GFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNK 241
GF YY+LF+DD+SR+TWIYPLKQKS + F F V+N+FNK
Sbjct: 463 LVHSDVWGPAPISSVSGFKYYVLFLDDWSRFTWIYPLKQKSDVFQAFIQFRNLVENQFNK 522
Query: 242 TNKVFQSDNGGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQAN 301
K Q D GG++K + + + GI R CPYTS QNG+AERKHRH+VE+GLTLLAQA
Sbjct: 523 RIKTLQCDGGGEFKSLSKVLIKTGIQLRESCPYTSAQNGRAERKHRHVVESGLTLLAQAK 582
Query: 302 MTMNYWWDAFLTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
M ++YWW+AF T + LIN +PT +++ S + +F + + +K
Sbjct: 583 MPLHYWWEAFSTAVFLINRLPTQVIKNKSPYQQLFDKNPDYTAMK 589
BLAST of Cmc02g0043461 vs. NCBI nr
Match:
GAU19483.1 (hypothetical protein TSUD_77270 [Trifolium subterraneum])
HSP 1 Score: 236.9 bits (603), Expect = 2.4e-58
Identity = 134/335 (40.00%), Postives = 178/335 (53.13%), Query Frame = 0
Query: 4 KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
K L L +IL VP++TKNL+SVSKL DN+I +++ CCF+KDK ++LKG LKDG Y
Sbjct: 358 KSLNLHDILYVPNITKNLLSVSKLAADNNILVEFDENCCFVKDKLTGKVILKGLLKDGLY 417
Query: 64 HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
L R S FV K WHRRL +P
Sbjct: 418 QLSGTKRNP---------------------SAFVSV-------------KESWHRRLGHP 477
Query: 124 LSKILNSILKGCNLIV--NDNNGKTKFCDY-----------------------------F 183
+K+L+ +L+ C + V +DN + C Y
Sbjct: 478 NNKVLDKVLESCKVKVPPSDNFSFCEACQYGKMHLLPFKSSSSHAQEPLELVHTDVWGPA 537
Query: 184 PYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDNG 243
P + GF YY+ F+DD+SR+TWIYPLKQKS V+ F F +N+FNK KV Q D G
Sbjct: 538 PIMTSSGFKYYVHFVDDFSRFTWIYPLKQKSETVQAFIQFKNLTENQFNKRIKVIQCDGG 597
Query: 244 GKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDAF 303
G+YK ++ L + GI R CPYTS QNG+AERKHRHI E GLTLLAQA M ++YWW+AF
Sbjct: 598 GEYKPVQKLAVEAGIQFRMSCPYTSQQNGRAERKHRHITEFGLTLLAQAQMPLHYWWEAF 657
Query: 304 LTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
T + LIN +P+ + Q S L+ ++ + LK
Sbjct: 658 STAVYLINRLPSQVTQNESPYSLMLQKEPDYKLLK 658
BLAST of Cmc02g0043461 vs. NCBI nr
Match:
PNX78574.1 (retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense])
HSP 1 Score: 234.2 bits (596), Expect = 1.5e-57
Identity = 133/336 (39.58%), Postives = 186/336 (55.36%), Query Frame = 0
Query: 4 KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
K+L L ++L VP +TKNL+SVSKLT DN+I +++ CCF+KDK +LL+G LKDG Y
Sbjct: 362 KNLNLHDVLYVPQITKNLLSVSKLTSDNNIIVEFDNDCCFVKDKLTGKVLLRGILKDGLY 421
Query: 64 HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
L SN ++Q NKD ++ K WHR+L +P
Sbjct: 422 QL-------------SNGSSQ----TNKDPCVYLSV-------------KESWHRKLGHP 481
Query: 124 LSKILNSILKGCNLIVNDNNGKTKFCDY-------------------------------- 183
+ +L+ +LK CN+ + ++ K KFC+
Sbjct: 482 SNNVLDKVLKICNVKTSPSD-KFKFCEACQLGKSHLLPFKSSSSHAQEVLELIHTDVWGP 541
Query: 184 FPYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDN 243
P GF YY+ F+DD SR+TWIYPLKQKS + F F V+N+FNK K+ Q D
Sbjct: 542 APINSISGFKYYVHFIDDSSRFTWIYPLKQKSDTIHAFMQFKNMVENQFNKRIKIIQCDG 601
Query: 244 GGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDA 303
GG++K ++ + L GI R CPYTS QNG+AERKHRH+ E GLTLLAQANM+++YWW+A
Sbjct: 602 GGEFKPVQKVALETGIKFRMSCPYTSQQNGRAERKHRHVAELGLTLLAQANMSLHYWWEA 661
Query: 304 FLTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
F T + LIN +P+ + + S LI ++ + LK
Sbjct: 662 FSTAVYLINRLPSSVTENESPYFLIHKKEPDYNVLK 666
BLAST of Cmc02g0043461 vs. NCBI nr
Match:
PNY01489.1 (copia-like polyprotein, partial [Trifolium pratense])
HSP 1 Score: 228.8 bits (582), Expect = 6.4e-56
Identity = 127/334 (38.02%), Postives = 178/334 (53.29%), Query Frame = 0
Query: 6 LALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFYHL 65
L L ++L VP +TKNL+SVSKLT DN+I++++ CC +KDK LLKG LKDG Y L
Sbjct: 362 LNLHDVLYVPQITKNLLSVSKLTADNNIFVEFDANCCSVKDKLTGQTLLKGRLKDGLYQL 421
Query: 66 ESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYPLS 125
VS + NKD ++ K WHR+L +P +
Sbjct: 422 SDVSPQ-----------------SNKDPCVYMSV-------------KESWHRKLGHPNN 481
Query: 126 KILNSILKGCNLIVNDNNGKTKFCDY--------------------------------FP 185
K+L +LK CN+ ++ ++ + FC+ P
Sbjct: 482 KVLEKVLKDCNVKISPSD-QFSFCEACQFGKLHLLPFKSSSSHVQEPLGLIHSDVWGPAP 541
Query: 186 YCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDNGG 245
GF YY+ F+DD+SR+TWI+PLKQKS + F F +N+FNK K+ Q D GG
Sbjct: 542 ILSPSGFKYYVHFIDDFSRFTWIFPLKQKSDTIHAFIQFKNLAENQFNKKIKIIQCDGGG 601
Query: 246 KYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDAFL 305
+YK ++ + + GI R CPYTS QNG+AERKHRH+VE GLTLLAQA M + YWW+AF
Sbjct: 602 EYKAVQKVSIEAGIQFRMSCPYTSQQNGRAERKHRHVVELGLTLLAQAKMPLRYWWEAFS 661
Query: 306 TTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
T + LIN + + + S L+F ++ + LK
Sbjct: 662 TAVYLINRLSSSVNPNESPYSLMFKREPDYNALK 664
BLAST of Cmc02g0043461 vs. ExPASy Swiss-Prot
Match:
Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)
HSP 1 Score: 155.6 bits (392), Expect = 9.1e-37
Identity = 108/338 (31.95%), Postives = 160/338 (47.34%), Query Frame = 0
Query: 2 EKKDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDG 61
+ + L L NIL VP++ KNLISV +L N + +++ +KD LL+G KD
Sbjct: 381 KSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDE 440
Query: 62 FYHLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLE 121
Y P+ S ++ +S F + WH RL
Sbjct: 441 LYEW----------PIAS----------SQPVSLFASPSS--------KATHSSWHARLG 500
Query: 122 YPLSKILNSILKGCNL---------------IVNDNN---------GKTKFCDYF----- 181
+P ILNS++ +L ++N +N T+ +Y
Sbjct: 501 HPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVW 560
Query: 182 --PYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSD 241
P +D + YY++F+D ++RYTW+YPLKQKS ETF F ++N F F SD
Sbjct: 561 SSPILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSD 620
Query: 242 NGGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWD 301
NGG++ + GIS P+T NG +ERKHRHIVETGLTLL+ A++ YW
Sbjct: 621 NGGEFVALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPY 680
Query: 302 AFLTTIILINGMPTPILQGLSLIELIFHQKLKFLELKI 309
AF + LIN +PTP+LQ S + +F + +L++
Sbjct: 681 AFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRV 690
BLAST of Cmc02g0043461 vs. ExPASy Swiss-Prot
Match:
Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)
HSP 1 Score: 151.0 bits (380), Expect = 2.2e-35
Identity = 110/336 (32.74%), Postives = 158/336 (47.02%), Query Frame = 0
Query: 4 KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
+ L L +L VP++ KNLISV +L N + +++ +KD LL+G KD Y
Sbjct: 362 RSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELY 421
Query: 64 HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
P+ S ++ +S F +P S WH RL +P
Sbjct: 422 EW----------PIAS----------SQAVSMF-----ASPCSKATHSS---WHSRLGHP 481
Query: 124 LSKILNSILKGCNLIVNDNNGKTKFC-DYF------------------------------ 183
ILNS++ +L V + + K C D F
Sbjct: 482 SLAILNSVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWSS 541
Query: 184 PYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDNG 243
P D + YY++F+D ++RYTW+YPLKQKS +TF F + V+N F SDNG
Sbjct: 542 PILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNG 601
Query: 244 GKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDAF 303
G++ +R GIS P+T NG +ERKHRHIVE GLTLL+ A++ YW AF
Sbjct: 602 GEFVVLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAF 661
Query: 304 LTTIILINGMPTPILQGLSLIELIFHQKLKFLELKI 309
+ LIN +PTP+LQ S + +F Q + +LK+
Sbjct: 662 SVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKV 669
BLAST of Cmc02g0043461 vs. ExPASy Swiss-Prot
Match:
P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)
HSP 1 Score: 91.3 bits (225), Expect = 2.1e-17
Identity = 89/313 (28.43%), Postives = 126/313 (40.26%), Query Frame = 0
Query: 6 LALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFYHL 65
L LK++ VPD+ NLIS L RD GY + ++ R L KG+L
Sbjct: 348 LVLKDVRHVPDLRMNLISGIALDRD--------GYESYFANQKWR--LTKGSLVIA---- 407
Query: 66 ESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYPLS 125
KGVA TN + + G N + + V +WH+R+ +
Sbjct: 408 ------KGVARGTLYRTNAE-----------ICQGELNAAQDEISVD--LWHKRMGHMSE 467
Query: 126 KILNSILKGCNLIVNDNNGKTKFCDYFPYCPN---------------------------- 185
K L IL +LI K CDY +
Sbjct: 468 KGL-QILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPME 527
Query: 186 ----DGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDNGG 245
G Y++ F+DD SR W+Y LK K + FQ F V+ E + K +SDNGG
Sbjct: 528 IESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGG 587
Query: 246 KY--KKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDA 285
+Y ++ C + GI P T NG AER +R IVE ++L A + ++W +A
Sbjct: 588 EYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEA 626
BLAST of Cmc02g0043461 vs. ExPASy Swiss-Prot
Match:
P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)
HSP 1 Score: 76.3 bits (186), Expect = 7.0e-13
Identity = 47/147 (31.97%), Postives = 74/147 (50.34%), Query Frame = 0
Query: 162 YYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDNGGKY--KKIR 221
Y+++F+D ++ Y Y +K KS FQ FV + FN DNG +Y ++R
Sbjct: 502 YFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMR 561
Query: 222 HLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDAFLTTIILI 281
C+ GIS P+T NG +ER R I E T+++ A + ++W +A LT LI
Sbjct: 562 QFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLI 621
Query: 282 NGMPTPILQGLSLIEL-IFHQKLKFLE 306
N +P+ L S ++H K +L+
Sbjct: 622 NRIPSRALVDSSKTPYEMWHNKKPYLK 648
BLAST of Cmc02g0043461 vs. ExPASy Swiss-Prot
Match:
Q87040 (Pro-Pol polyprotein OS=Simian foamy virus (isolate chimpanzee) OX=298339 GN=pol PE=3 SV=1)
HSP 1 Score: 55.8 bits (133), Expect = 9.8e-07
Identity = 66/268 (24.63%), Postives = 107/268 (39.93%), Query Frame = 0
Query: 46 DKAIRDILLKGTLKDGFYHLE----SVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGG 105
D+ ++ +KG K Y+LE VSR +GV + Q+ + + +++
Sbjct: 764 DQLLQGNNVKGYPKQYTYYLEDGKVKVSRPEGVKIIPPQSDRQKIVLQAHNLA------H 823
Query: 106 TNPVKINVDVSKVVWHRRLEYPLSKILNSILKGCNLIVNDNNGKTK-------------- 165
T + ++ + W + + K L K C LI N +N KT
Sbjct: 824 TGREATLLKIANLYWWPNMRKDVVKQLGR-CKQC-LITNASN-KTSGPILRPDRPQKPFD 883
Query: 166 --FCDYF-PYCPNDGFIYYILFMDDYSRYTWIYPLK--QKSAAVETFQHFVTYVKNEFNK 225
F DY P P+ G++Y ++ +D + +TW+YP K SA V++ +
Sbjct: 884 KFFIDYIGPLPPSQGYLYVLVIVDGMTGFTWLYPTKAPSTSATVKSLNVLTSIA------ 943
Query: 226 TNKVFQSDNGGKY--KKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQ 285
KV SD G + GI F PY +GK ERK+ I LT L
Sbjct: 944 IPKVIHSDQGAAFTSSTFAEWAKERGIHLEFSTPYHPQSSGKVERKNSDIKRL-LTKLLV 1003
Query: 286 ANMTMNYWWDAFLTTIILINGMPTPILQ 289
T W+D + +N +P+L+
Sbjct: 1004 GRPTK--WYDLLPVVQLALNNTYSPVLK 1013
BLAST of Cmc02g0043461 vs. ExPASy TrEMBL
Match:
A0A2K3NIC3 (Copia protein (Gag-int-pol protein) (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g026116 PE=4 SV=1)
HSP 1 Score: 243.8 bits (621), Expect = 9.3e-61
Identity = 135/336 (40.18%), Postives = 188/336 (55.95%), Query Frame = 0
Query: 4 KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
K+L L ++L VP++TKNL+SVSKLT DN+I +++ CC +KDK LLKG LK+G Y
Sbjct: 85 KNLNLYDVLYVPEITKNLLSVSKLTADNNIIVEFDADCCSVKDKLTGKALLKGKLKEGLY 144
Query: 64 HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
+ SN+++Q NKD T++ K WHR+L +P
Sbjct: 145 QV-------------SNVSSQ----SNKDACTYMSV-------------KESWHRKLGHP 204
Query: 124 LSKILNSILKGCNLIVNDNNGKTKFCDY-------------------------------- 183
+K+L+ +LK CN + ++ + KFC+
Sbjct: 205 NNKVLDKVLKHCN-VKTSSSDQFKFCEACQFGKLHLLPFKSSYSHAQEPLDLIHTDVWGP 264
Query: 184 FPYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDN 243
P N GF YY+ F+DD+SR+TWIYPLKQKS + F F T V+N+FNK K+ Q D
Sbjct: 265 APIMSNSGFKYYVHFIDDFSRFTWIYPLKQKSETIHAFTQFKTLVENQFNKRIKIVQCDG 324
Query: 244 GGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDA 303
GG+YK ++ L L GI R CPYTS QNG+AERKHRH+ E GLT+LAQA M + YWW+A
Sbjct: 325 GGEYKAVQKLALEAGIQFRMSCPYTSQQNGRAERKHRHVAELGLTMLAQARMPLCYWWEA 384
Query: 304 FLTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
F T++ LIN +P+ I Q LI+ ++ + LK
Sbjct: 385 FSTSVYLINRLPSSINQNACPYTLIYKKEPDYSVLK 389
BLAST of Cmc02g0043461 vs. ExPASy TrEMBL
Match:
A0A151S6M8 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_027809 PE=4 SV=1)
HSP 1 Score: 240.7 bits (613), Expect = 7.9e-60
Identity = 134/345 (38.84%), Postives = 185/345 (53.62%), Query Frame = 0
Query: 2 EKKDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDG 61
++K L LK+IL VP +TKNL+S+SKLT DN IY+++H CF+KDK ILL+G +KDG
Sbjct: 283 QQKSLNLKDILYVPKITKNLLSISKLTFDNDIYVEFHDVACFVKDKLTGRILLEGKIKDG 342
Query: 62 FYHLESVSRKKGVAP-VYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRL 121
Y L S P V+ +I K WHR+L
Sbjct: 343 LYQLPGGSTSTNKRPHVFFSI-------------------------------KETWHRKL 402
Query: 122 EYPLSKILNSILKGCNLIVNDNNGKTKFCDYFPYC----------------------PND 181
+P SK+LN ++K CN+ + C+ F +C P D
Sbjct: 403 GHPNSKVLNEVMKLCNI-------EASPCENFEFCEACQFGKAHNLPFQNSVSCAKEPLD 462
Query: 182 ----------------GFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNK 241
GF YY+LF+DD+SR+TWIYPLKQKS + F F V+N+FNK
Sbjct: 463 LVHSDVWGPAPISSVSGFKYYVLFLDDWSRFTWIYPLKQKSDVFQAFIQFRNLVENQFNK 522
Query: 242 TNKVFQSDNGGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQAN 301
K Q D GG++K + + + GI R CPYTS QNG+AERKHRH+VE+GLTLLAQA
Sbjct: 523 RIKTLQCDGGGEFKSLSKVLIKTGIQLRESCPYTSAQNGRAERKHRHVVESGLTLLAQAK 582
Query: 302 MTMNYWWDAFLTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
M ++YWW+AF T + LIN +PT +++ S + +F + + +K
Sbjct: 583 MPLHYWWEAFSTAVFLINRLPTQVIKNKSPYQQLFDKNPDYTAMK 589
BLAST of Cmc02g0043461 vs. ExPASy TrEMBL
Match:
A0A2Z6MBG6 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_77270 PE=4 SV=1)
HSP 1 Score: 236.9 bits (603), Expect = 1.1e-58
Identity = 134/335 (40.00%), Postives = 178/335 (53.13%), Query Frame = 0
Query: 4 KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
K L L +IL VP++TKNL+SVSKL DN+I +++ CCF+KDK ++LKG LKDG Y
Sbjct: 358 KSLNLHDILYVPNITKNLLSVSKLAADNNILVEFDENCCFVKDKLTGKVILKGLLKDGLY 417
Query: 64 HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
L R S FV K WHRRL +P
Sbjct: 418 QLSGTKRNP---------------------SAFVSV-------------KESWHRRLGHP 477
Query: 124 LSKILNSILKGCNLIV--NDNNGKTKFCDY-----------------------------F 183
+K+L+ +L+ C + V +DN + C Y
Sbjct: 478 NNKVLDKVLESCKVKVPPSDNFSFCEACQYGKMHLLPFKSSSSHAQEPLELVHTDVWGPA 537
Query: 184 PYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDNG 243
P + GF YY+ F+DD+SR+TWIYPLKQKS V+ F F +N+FNK KV Q D G
Sbjct: 538 PIMTSSGFKYYVHFVDDFSRFTWIYPLKQKSETVQAFIQFKNLTENQFNKRIKVIQCDGG 597
Query: 244 GKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDAF 303
G+YK ++ L + GI R CPYTS QNG+AERKHRHI E GLTLLAQA M ++YWW+AF
Sbjct: 598 GEYKPVQKLAVEAGIQFRMSCPYTSQQNGRAERKHRHITEFGLTLLAQAQMPLHYWWEAF 657
Query: 304 LTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
T + LIN +P+ + Q S L+ ++ + LK
Sbjct: 658 STAVYLINRLPSQVTQNESPYSLMLQKEPDYKLLK 658
BLAST of Cmc02g0043461 vs. ExPASy TrEMBL
Match:
A0A2K3LJ49 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Trifolium pratense OX=57577 GN=L195_g034552 PE=4 SV=1)
HSP 1 Score: 234.2 bits (596), Expect = 7.4e-58
Identity = 133/336 (39.58%), Postives = 186/336 (55.36%), Query Frame = 0
Query: 4 KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
K+L L ++L VP +TKNL+SVSKLT DN+I +++ CCF+KDK +LL+G LKDG Y
Sbjct: 362 KNLNLHDVLYVPQITKNLLSVSKLTSDNNIIVEFDNDCCFVKDKLTGKVLLRGILKDGLY 421
Query: 64 HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
L SN ++Q NKD ++ K WHR+L +P
Sbjct: 422 QL-------------SNGSSQ----TNKDPCVYLSV-------------KESWHRKLGHP 481
Query: 124 LSKILNSILKGCNLIVNDNNGKTKFCDY-------------------------------- 183
+ +L+ +LK CN+ + ++ K KFC+
Sbjct: 482 SNNVLDKVLKICNVKTSPSD-KFKFCEACQLGKSHLLPFKSSSSHAQEVLELIHTDVWGP 541
Query: 184 FPYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDN 243
P GF YY+ F+DD SR+TWIYPLKQKS + F F V+N+FNK K+ Q D
Sbjct: 542 APINSISGFKYYVHFIDDSSRFTWIYPLKQKSDTIHAFMQFKNMVENQFNKRIKIIQCDG 601
Query: 244 GGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDA 303
GG++K ++ + L GI R CPYTS QNG+AERKHRH+ E GLTLLAQANM+++YWW+A
Sbjct: 602 GGEFKPVQKVALETGIKFRMSCPYTSQQNGRAERKHRHVAELGLTLLAQANMSLHYWWEA 661
Query: 304 FLTTIILINGMPTPILQGLSLIELIFHQKLKFLELK 308
F T + LIN +P+ + + S LI ++ + LK
Sbjct: 662 FSTAVYLINRLPSSVTENESPYFLIHKKEPDYNVLK 666
BLAST of Cmc02g0043461 vs. ExPASy TrEMBL
Match:
A0A803P5A9 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)
HSP 1 Score: 229.9 bits (585), Expect = 1.4e-56
Identity = 132/329 (40.12%), Postives = 176/329 (53.50%), Query Frame = 0
Query: 4 KDLALKNILCVPDMTKNLISVSKLTRDNHIYLKYHGYCCFIKDKAIRDILLKGTLKDGFY 63
K L LK++L VP+M K LIS+SKLT DN I +++ CF+KDK R +LL G LKDG Y
Sbjct: 239 KTLVLKDVLLVPEMAKKLISISKLTTDNDILIEFDSDFCFVKDKVTRKVLLTGMLKDGLY 298
Query: 64 HLESVSRKKGVAPVYSNITNQQFMHKNKDISTFVLTGGTNPVKINVDVSKVVWHRRLEYP 123
L S K P S + FV + N + N K VWHRRL +P
Sbjct: 299 QLNSPLSKPVCQPTQSAPSTHD----------FVCSASINR-QSNFLSKKDVWHRRLGHP 358
Query: 124 LSKILNSILKGCNLIVNDNNGKTKFCDY-------------------------------- 183
SKIL +L N+ V+ NN ++ FCD
Sbjct: 359 SSKILKLVLNSSNVPVSFNNNES-FCDACQYGKSHALPFKLSNSRATKMLELIHTDLWGP 418
Query: 184 FPYCPNDGFIYYILFMDDYSRYTWIYPLKQKSAAVETFQHFVTYVKNEFNKTNKVFQSDN 243
P N F +YI F+DDYSR+TW+YPLKQKS A+ F F T +N+F K +D
Sbjct: 419 APINSNTNFKFYIHFLDDYSRFTWLYPLKQKSDALNGFTQFKTMAENQFETKIKFITTDW 478
Query: 244 GGKYKKIRHLCLNLGISCRFYCPYTSTQNGKAERKHRHIVETGLTLLAQANMTMNYWWDA 301
GG+++ + GI CP+TS QNG+ E +RHIVE GLTLLAQA+M + +W DA
Sbjct: 479 GGEFQAFDQFLITHGIQFHHSCPHTSAQNGRNEENYRHIVEMGLTLLAQASMPLKFWVDA 538
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
PNY02796.1 | 1.9e-60 | 40.18 | copia protein (gag-int-pol protein), partial [Trifolium pratense] | [more] |
KYP50444.1 | 1.6e-59 | 38.84 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] | [more] |
GAU19483.1 | 2.4e-58 | 40.00 | hypothetical protein TSUD_77270 [Trifolium subterraneum] | [more] |
PNX78574.1 | 1.5e-57 | 39.58 | retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] | [more] |
PNY01489.1 | 6.4e-56 | 38.02 | copia-like polyprotein, partial [Trifolium pratense] | [more] |
Match Name | E-value | Identity | Description | |
Q94HW2 | 9.1e-37 | 31.95 | Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... | [more] |
Q9ZT94 | 2.2e-35 | 32.74 | Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... | [more] |
P10978 | 2.1e-17 | 28.43 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... | [more] |
P04146 | 7.0e-13 | 31.97 | Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3 | [more] |
Q87040 | 9.8e-07 | 24.63 | Pro-Pol polyprotein OS=Simian foamy virus (isolate chimpanzee) OX=298339 GN=pol ... | [more] |
Match Name | E-value | Identity | Description | |
A0A2K3NIC3 | 9.3e-61 | 40.18 | Copia protein (Gag-int-pol protein) (Fragment) OS=Trifolium pratense OX=57577 GN... | [more] |
A0A151S6M8 | 7.9e-60 | 38.84 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... | [more] |
A0A2Z6MBG6 | 1.1e-58 | 40.00 | Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... | [more] |
A0A2K3LJ49 | 7.4e-58 | 39.58 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Trifolium pratens... | [more] |
A0A803P5A9 | 1.4e-56 | 40.12 | Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |