Homology
BLAST of Lag0008934 vs. NCBI nr
Match:
GAU51268.1 (hypothetical protein TSUD_412550 [Trifolium subterraneum])
HSP 1 Score: 725.7 bits (1872), Expect = 5.5e-205
Identity = 422/1060 (39.81%), Postives = 601/1060 (56.70%), Query Frame = 0
Query: 19 SSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIE 78
SSA N S + L ++++VKLD NY LWK +VL+++RG K+D Y+LGT P + +
Sbjct: 2 SSAAN---SPKKNDLPSIISVKLDRDNYPLWKSLVLSLIRGCKLDGYILGTTECPEQFVT 61
Query: 79 TTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGD 138
+ +S K+ N + +W+ DQAL GWL SM+ IA +++ +T +++W + + G
Sbjct: 62 SADKSKKV---NPDFGDWIANDQALLGWLMNSMAIDIATQLLHCETSKQLWDETQSLAGA 121
Query: 139 TSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSE 198
+K+ + + NT+KG MKM +YL MK S+ LKL G+P+S DL+ L GLD+E
Sbjct: 122 HTKSRITYLKSEFHNTRKGEMKMEEYLIKMKNLSDKLKLAGSPISNSDLMIQTLNGLDAE 181
Query: 199 YIPIVCAIDDKDLKTWQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDN 258
Y P+V + D+ +W ++ + L+ FE L +++ + L LN + F N
Sbjct: 182 YNPVVVKLSDQINLSWVDVQAQLLAFESRLDQFNN-----------FSGLTLNASANFAN 241
Query: 259 QRQFN----PSNGNRGNDNNSGSYYGSGNGQMGNNPTQ---GTAMVEIEEEEVEGATIFR 318
+ +F S GN N G G G G+M N Q GT + ++
Sbjct: 242 KTEFRGNKFNSRGNWRRSNFRGMRGGRGKGRMSNTKCQVCNGTGHIAVD----------- 301
Query: 319 EEITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHV 378
C RF+ + + S QG S SA+IA+P D +W DSGA NHV
Sbjct: 302 -----CSYRFDRPYTGRNYSTEADKQG------SHSAFIASPYHGQDYEWYFDSGANNHV 361
Query: 379 T--------------------ANARNLAV------KMDYNVLNNILHVPEIRKNLISIAS 438
T N L + K++ L+++L+VP+I KNL+S++
Sbjct: 362 THQTDKFQGFNEHNGKNSLMVGNGEKLKIVASGSTKLNNLNLHDVLYVPQITKNLLSVSK 421
Query: 439 LTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLW 498
LT DNN++VEF +N C VKDK + + +L L++ LYQ+ + K S W
Sbjct: 422 LTADNNILVEFDANCCSVKDKLTGQTLLKGRLKDGLYQL------SNKEPCVYMSVKESW 481
Query: 499 HNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVH 558
H +LGH ++KV+ VLK CNV ++ FC+ACQ GK H LPF S SH +PL L+H
Sbjct: 482 HRKLGHPNNKVLDKVLKDCNVKISHSDQFSFCEACQFGKLHLLPFKPSSSHVQEPLALIH 541
Query: 559 CDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIK 618
D+WGP+PI+S G+KYY+ F+DDF+R I+PLK K + +F Q+K L N+F KKIK
Sbjct: 542 SDVWGPAPILSPSGFKYYVHFIDDFSRFTWIFPLKQKSDTIHAFIQFKNLAENQFNKKIK 601
Query: 619 TLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPL 678
+Q D GGE+++ + GI+FR SCP+TSQQNG ERKHRH+ E+GLTLLAQA MPL
Sbjct: 602 IIQCDGGGEYKAVQKVSIEAGIQFRMSCPYTSQQNGRAERKHRHVAELGLTLLAQAKMPL 661
Query: 679 TYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------------YPCLRLYQSHKF 738
YWWEAFS+AVY+IN LP+ + + SP+ F YPCL+ Y HK
Sbjct: 662 RYWWEAFSTAVYLINRLPSSVNPNESPYSLMFKREPDYNALKPFGCACYPCLKPYNQHKL 721
Query: 739 QHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGP------S 798
Q H+T+CVF+GYS +HKGYKC++S GR+F+S HV+FNE+ FPF + + P +
Sbjct: 722 QFHTTRCVFVGYSNSHKGYKCINSHGRIFVSRHVIFNENHFPFHGGFLDTKNPLKTLTDN 781
Query: 799 NIALVPHHSSPVAEFQTSSPTLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPV 858
+ L+P S+ P +T S + S ++ +S + T++
Sbjct: 782 SSILLPTCSAGATTQDAIEPDNNTTSDQNTHSIESSDNNENEEQVDSSEFFVNTNNSSTQ 841
Query: 859 DGSSDGSPELPLQISTAL-------------NSHPMQTRAKSGIFKQKDWGAFLVNSSFS 918
D +D S + + ++ + N+H M+TR+K GI K K + + S
Sbjct: 842 DIEADNSVDSEDRNNSTMTGTIQQQAQQDNSNTHWMRTRSKDGIHKPKIPYVGMAETD-S 901
Query: 919 PEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNS 978
E EP SVKEAL WK AM+ E L N TWTLVP N+I SKWIFK K KS+
Sbjct: 902 EEKEPKSVKEALGRPMWKEAMDKEYKALVSNHTWTLVPYQEQENIIDSKWIFKTKYKSDG 961
Query: 979 SFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLN 1012
S +R KARLVA+GF Q G+DF ETFSPVVK+ T++IIL +AV NW +RQLD+NN FLN
Sbjct: 962 SIERRKARLVAKGFQQTAGLDFGETFSPVVKSSTVRIILTIAVHFNWEVRQLDINNAFLN 1015
BLAST of Lag0008934 vs. NCBI nr
Match:
GAU19483.1 (hypothetical protein TSUD_77270 [Trifolium subterraneum])
HSP 1 Score: 723.0 bits (1865), Expect = 3.5e-204
Identity = 417/1054 (39.56%), Postives = 588/1054 (55.79%), Query Frame = 0
Query: 14 AVAASSSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQP 73
A AA S+ N + SS ++VKLD NY LWK +VL ++RG K+D Y+LGT+ P
Sbjct: 2 ASAAGSNNKNDLPSS--------VSVKLDRNNYPLWKSLVLPVIRGCKLDGYMLGTEGCP 61
Query: 74 SELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALE 133
E I T+++S K N + EW DQ L GW+ SM+ IA +++ +T +++W +
Sbjct: 62 EEFI-TSSDSSKN--KNSAFVEWQANDQRLLGWMLNSMTTEIATQLLHCETSKQLWDEAQ 121
Query: 134 EVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLA 193
+ G +++ + + + +KG MKM DYL MK + LKL GNPVS DL+ L
Sbjct: 122 SLAGAHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLVDKLKLAGNPVSTSDLIIQTLN 181
Query: 194 GLDSEYIPIVCAIDDKDLKTWQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQ 253
GLDSEY P+V + D+ +W +L + L+ FE + + + TN + AT NR
Sbjct: 182 GLDSEYNPVVVKLSDQTTLSWVDLQAQLLTFESRIEQLNNLTNLTLN----ATANVANR- 241
Query: 254 SMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREE 313
+ + SN N N+ G G G G+ G NP Q +
Sbjct: 242 ----SDHRGKSSNNNWRGSNSRGWRGGRGRGKSGKNPCQVCG-------------LSNHI 301
Query: 314 ITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT- 373
C+ RF++ ++ + S QG S +A++A+ + D W DSGA+NHVT
Sbjct: 302 AIDCFHRFDKTYSRSNHSAGHDKQG------SHNAFLASQNSVEDYDWYFDSGASNHVTH 361
Query: 374 -------------------ANARNLAV------KMDYNVLNNILHVPEIRKNLISIASLT 433
N LA+ K+ L++IL+VP I KNL+S++ L
Sbjct: 362 QTEKFQDLTEHHGKNSLVVGNGEKLAILATGSSKLKSLNLHDILYVPNITKNLLSVSKLA 421
Query: 434 VDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHN 493
DNN++VEF N C VKDK + KV+L +L++ LYQ+ T ++ S WH
Sbjct: 422 ADNNILVEFDENCCFVKDKLTGKVILKGLLKDGLYQLS----GTKRNPSAFVSVKESWHR 481
Query: 494 RLGHASSKVIKSVLKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCD 553
RLGH ++KV+ VL+SC V +++ FC+ACQ GK H LPF S SH +PLELVH D
Sbjct: 482 RLGHPNNKVLDKVLESCKVKVPPSDNFSFCEACQYGKMHLLPFKSSSSHAQEPLELVHTD 541
Query: 554 LWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTL 613
+WGP+PI++ G+KYY+ FVDDF+R IYPLK K E +F Q+K L N+F K+IK +
Sbjct: 542 VWGPAPIMTSSGFKYYVHFVDDFSRFTWIYPLKQKSETVQAFIQFKNLTENQFNKRIKVI 601
Query: 614 QTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTY 673
Q D GGE++ + GI+FR SCP+TSQQNG ERKHRHI E GLTLLAQA MPL Y
Sbjct: 602 QCDGGGEYKPVQKLAVEAGIQFRMSCPYTSQQNGRAERKHRHITEFGLTLLAQAQMPLHY 661
Query: 674 WWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------------YPCLRLYQSHKFQH 733
WWEAFS+AVY+IN LP+ + + SP+ YPCL+ Y HK Q+
Sbjct: 662 WWEAFSTAVYLINRLPSQVTQNESPYSLMLQKEPDYKLLKTFGCACYPCLKPYNQHKLQY 721
Query: 734 HSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIAL---- 793
H+T+CVFLGYS +HKGYKCL+S GR+FIS HV+FNE FPF + + P +
Sbjct: 722 HTTRCVFLGYSNSHKGYKCLNSHGRIFISRHVIFNEDHFPFHDGFLNTRSPLKTTINVPS 781
Query: 794 -----------VPHHSSPVAEFQTSSPTLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLY 853
+ S P+ E + + T + QD +++ + + ++ P+ +
Sbjct: 782 TSFPLCTAGNVIDDASMPILEAENPAETNTEDSQD------VNSDTEQTNNGPSEDNTTH 841
Query: 854 TSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPT 913
++ S G SH + TR+KSGI K K + ++ +EP
Sbjct: 842 EETLDITQQQSVGEAS-----QNTNTSHAIHTRSKSGIHKPK-LPYIGLTETYKDTMEPA 901
Query: 914 SVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCK 973
+ KEAL WK AM E L NKTW LVP N++ SKW+FK K K + S +R K
Sbjct: 902 NAKEALSRPLWKEAMQKEFEALMSNKTWILVPYQNQENIVDSKWVFKTKYKPDGSLERRK 961
Query: 974 ARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEA 1012
ARLVA+GF Q G+D+ ETFSPV+KA T++IIL++AV NW +RQLD+NN FLNG L+E
Sbjct: 962 ARLVAKGFQQTAGIDYEETFSPVIKASTVRIILSIAVHLNWEVRQLDINNAFLNGHLKET 1000
BLAST of Lag0008934 vs. NCBI nr
Match:
PNX94503.1 (putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense])
HSP 1 Score: 712.2 bits (1837), Expect = 6.3e-201
Identity = 417/1048 (39.79%), Postives = 595/1048 (56.77%), Query Frame = 0
Query: 33 LSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNML 92
L + ++VKLD N+ LWK +VL ++RG K D Y+LGTK P + + + + K+ N
Sbjct: 12 LPSTVSVKLDRDNFPLWKSLVLPLIRGCKYDGYMLGTKKCPDQFVTSIDNTEKI---NPD 71
Query: 93 YEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGDTSKACVNQFRGILQ 152
Y++W DQAL GWL SM+ IA V++ +T +++W + + G +++ + +
Sbjct: 72 YQDWQADDQALLGWLMNSMTVDIATQVLHCETSKQLWDEAQSLAGAHTRSRIIYLKSEFH 131
Query: 153 NTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLK 212
NT K MKM YLA MK ++ LKL G+P+S DL+ L GLDSEY P+V + D+
Sbjct: 132 NTHKREMKMEQYLAKMKNLADKLKLAGSPISSSDLMIQTLNGLDSEYNPVVVKLSDQTNI 191
Query: 213 TWQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGND 272
+W + + L+ FE L + + N + + + + A +S +F G RG+
Sbjct: 192 SWVDFQAQLLAFESRLDQLNNFNNINL---NASANFASKNES---GGNKFGSRGGWRGS- 251
Query: 273 NNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYMRFEEHF--NNLHA 332
N+ G G G +M P + F CY RF++ + N +A
Sbjct: 252 NSRGMRGGRGRARMSKPP----------RPICQICGKFGHTAAQCYYRFDKSYTEKNHYA 311
Query: 333 SGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVTANARNL------------ 392
G G S SA++A+P D +W DSGA+NHVT + L
Sbjct: 312 EGEG----------SHSAFVASPYHGQDYEWYFDSGASNHVTHQSGQLQDLNENNGKNSL 371
Query: 393 --------------AVKMDYNVLNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVK 452
+ K++ L N+L+VPEI KNL+S++ LT+DNN +VEF NYC VK
Sbjct: 372 LVGNGEKLKILASGSTKLNDVNLRNVLYVPEITKNLLSVSKLTIDNNALVEFDENYCYVK 431
Query: 453 DKASKKVMLHEILRNDLYQI----ELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSV 512
DK + K +L L++ LYQ+ E P+ + P + I S +WH +LGH ++KV++ V
Sbjct: 432 DKLTGKALLKGRLKDGLYQLSANKEPPTNKDPCAYI---SLKEIWHRKLGHPNNKVLEKV 491
Query: 513 LKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGY 572
LK NV ++ FC+ACQ GK H LPF S SH +PL+L+H D+WGP+PI+S +
Sbjct: 492 LKDNNVKISPSDKFTFCEACQFGKLHLLPFKTSSSHAKEPLDLIHTDVWGPAPILSQSNF 551
Query: 573 KYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTLQTDWGGEFRSFTS 632
KYY+ F+DDF+R I+PLK K E +F+Q+K LV N+F KKIK ++ D GGE++
Sbjct: 552 KYYVHFLDDFSRFTWIFPLKQKSETIHAFNQFKNLVENQFNKKIKVIRCDGGGEYKPVQK 611
Query: 633 FLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIIN 692
D+GI+F+ SCP+TSQQNG ERKHRH+ E+GLTLLAQA MPL+YWWEAFS+AVY+IN
Sbjct: 612 CAIDSGIQFQMSCPYTSQQNGRAERKHRHVTELGLTLLAQAKMPLSYWWEAFSTAVYLIN 671
Query: 693 CLPTPILGDVSPWEQAFH---------------YPCLRLYQSHKFQHHSTKCVFLGYSLA 752
LP+ + + SP+ F YPCL+ Y HK Q H+T+CVFLGYS +
Sbjct: 672 RLPSSVNPNESPYTLVFKKEPDYTALKPFGCACYPCLKPYNQHKLQFHTTRCVFLGYSNS 731
Query: 753 HKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIAL------VPHHSSPVAEF 812
HKGYKC++S GR+F+S HVVFNE+ FPF+ + + P + P + +
Sbjct: 732 HKGYKCVNSHGRVFVSRHVVFNENHFPFQEGFLDTRNPIKVVTNDTPIGFPSFPAGITTN 791
Query: 813 QTSSPTLSTPPQ------------DQYVSPQLSAHSPEGH----SCPASVMPLYTSSMLP 872
T+ T + Q DQ V H+ E + S SM
Sbjct: 792 NTAEATDNIVDQQEPELNDINTVADQSVESDTFEHTDENNFSNGETEDSTEAAGRESMEE 851
Query: 873 VDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEAL 932
+ + P Q T N+H M+TR+K+G++K K L + + EP SV EAL
Sbjct: 852 ISQPITETNPPPQQDIT--NTHWMRTRSKAGVYKPKLPYIGLTEEAKEGK-EPESVSEAL 911
Query: 933 KSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQ 992
+W AM+ E L NKTWTLVP N+I SKWIFK K K++ + +R KARLVA+
Sbjct: 912 SIPEWLNAMDAEYKALMNNKTWTLVPFEGQENVISSKWIFKTKYKADGTIERRKARLVAR 971
Query: 993 GFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQL 1012
GF Q GVD+ ETFSPVVK+ T++IIL++AV +W +RQLD+NN FLNG L+E+V+M Q
Sbjct: 972 GFQQTAGVDYDETFSPVVKSSTVRIILSIAVHLSWEVRQLDINNAFLNGNLKESVFMHQP 1023
BLAST of Lag0008934 vs. NCBI nr
Match:
PNY01489.1 (copia-like polyprotein, partial [Trifolium pratense])
HSP 1 Score: 684.9 bits (1766), Expect = 1.1e-192
Identity = 412/1041 (39.58%), Postives = 584/1041 (56.10%), Query Frame = 0
Query: 19 SSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIE 78
SSA N S+ + L ++++VKLD NY LWK +VL ++RG K D Y+LGTK P + +
Sbjct: 2 SSAAN---SNKKNDLPSIISVKLDRDNYPLWKSLVLPLIRGCKFDGYILGTKECPEQFVT 61
Query: 79 TTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGD 138
+ +S K+ N +++WM DQAL GWL SM+ IA +++ +T +++W + + G
Sbjct: 62 SADKSKKV---NPDFQDWMADDQALLGWLMNSMAIDIATQLLHCETSKQLWDEAQSLAGA 121
Query: 139 TSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSE 198
+K+ + + NT+KG MKM +YL MK S+ LKL G+P+S DL+ L GLD+E
Sbjct: 122 HTKSRIIYLKSEFHNTRKGEMKMEEYLIKMKNLSDKLKLSGSPISNSDLMIQTLNGLDAE 181
Query: 199 YIPIVCAIDDKDLKTWQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDN 258
Y P+V + D+ +W ++ + L+ FE L D + + L LN + F N
Sbjct: 182 YNPVVVKLSDQINLSWVDVQAQLLAFESRL-----------DQLNNFSGLTLNASANFAN 241
Query: 259 QRQFN----PSNGNRGNDNNSGSYYGSGNGQMGNNPTQ---GTAMVEIEEEEVEGATIFR 318
+ +F S GN N G G G G+M N Q GT ++
Sbjct: 242 KTEFRGNKFHSRGNWRRSNFRGMRGGRGKGRMSNTKCQVCSGTGHTAVD----------- 301
Query: 319 EEITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHV 378
C RF+ + + S QG S SA++A+P D +W DSGA+NHV
Sbjct: 302 -----CSYRFDRSYTGRNYSTEADKQG------SHSAFVASPYHGQDYEWYFDSGASNHV 361
Query: 379 T--------------------ANARNLAV------KMDYNVLNNILHVPEIRKNLISIAS 438
T N L + K++ L+++L+VP+I KNL+S++
Sbjct: 362 THQTDKFQGFNEHNGKNSLMVGNGEKLKIVASGSTKLNTLNLHDVLYVPQITKNLLSVSK 421
Query: 439 LTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLW 498
LT DNN+ VEF +N C VKDK + + +L L++ LYQ+ S Q+ K S W
Sbjct: 422 LTADNNIFVEFDANCCSVKDKLTGQTLLKGRLKDGLYQLSDVSPQSNKDPCVYMSVKESW 481
Query: 499 HNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVH 558
H +LGH ++KV++ VLK CNV ++ FC+ACQ GK H LPF S SH +PL L+H
Sbjct: 482 HRKLGHPNNKVLEKVLKDCNVKISPSDQFSFCEACQFGKLHLLPFKSSSSHVQEPLGLIH 541
Query: 559 CDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIK 618
D+WGP+PI+S G+KYY+ F+DDF+R I+PLK K + +F Q+K L N+F KKIK
Sbjct: 542 SDVWGPAPILSPSGFKYYVHFIDDFSRFTWIFPLKQKSDTIHAFIQFKNLAENQFNKKIK 601
Query: 619 TLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPL 678
+Q D GGE+++ + GI+FR SCP+TSQQNG ERKHRH+VE+GLTLLAQA MPL
Sbjct: 602 IIQCDGGGEYKAVQKVSIEAGIQFRMSCPYTSQQNGRAERKHRHVVELGLTLLAQAKMPL 661
Query: 679 TYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------------YPCLRLYQSHKF 738
YWWEAFS+AVY+IN L + + + SP+ F YPCL+ Y HK
Sbjct: 662 RYWWEAFSTAVYLINRLSSSVNPNESPYSLMFKREPDYNALKPFGCACYPCLKPYNQHKL 721
Query: 739 QHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIALVP 798
Q H+T+CVF+GYS +HKG + G S + + N+ + + S+ S+
Sbjct: 722 QFHTTRCVFMGYSNSHKGSTTQDAIG----SDNNIVNDQD-TTNDQNTHSTESSDNNEEE 781
Query: 799 HHSSPVAEFQTSSPTLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDG 858
H + + T++ + D +V + +SP + G+S
Sbjct: 782 HADNSESFVNTNNGSTQDIEVDNFVDSE-DRNSP------------------TITGTSQQ 841
Query: 859 SPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKA 918
Q +T N+H ++TR+K+GI K K + + S E EP SVKEAL WK
Sbjct: 842 QAH---QDNT--NTHGIRTRSKNGIHKPKLPYVGMTETD-SEEKEPESVKEALDKPMWKE 901
Query: 919 AMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPG 978
AM+ E L N TWTLVP N+I SKWIFK K KS+ S +R KARLVA+GF Q G
Sbjct: 902 AMDKEYKALMSNYTWTLVPFQAQENIIDSKWIFKTKYKSDGSIERRKARLVAKGFQQTAG 961
Query: 979 VDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQS 1012
+DFHETFSPVVK+ T++IIL +AV NW +RQLD+NN FLNG+L+E V+M Q GYID +
Sbjct: 962 LDFHETFSPVVKSSTVRIILTIAVHFNWEVRQLDINNAFLNGKLKETVFMHQPEGYIDTT 973
BLAST of Lag0008934 vs. NCBI nr
Match:
KYP50444.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])
HSP 1 Score: 674.5 bits (1739), Expect = 1.4e-189
Identity = 379/947 (40.02%), Postives = 557/947 (58.82%), Query Frame = 0
Query: 111 MSPAIAADVINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQ 170
M+ +A +++ +T +++W+ + + G +++ + + T+KG +KM +YL MK+
Sbjct: 1 MTQEVATQLLHCETSQQIWEDAQSLAGAHTRSRITFLKTEFHRTRKGGLKMEEYLTKMKE 60
Query: 171 ASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLKTWQELSSILINFEGTLAR 230
+++L L G+ VS DLV+ LAGLD+EY PIV + DK+ TW E+ + L+ +E L +
Sbjct: 61 IADDLALAGSSVSTMDLVTQTLAGLDNEYNPIVVQLSDKEHLTWVEMQAQLLTYENRLEQ 120
Query: 231 YSTPTNAHFDLPDLATHLALNRQSMFDNQR-QFNPSNGNRGNDNNSGSYYGSGNGQMGNN 290
+ +N L + + N ++ N+R + N G RG N G+ G G G+
Sbjct: 121 INNQSN-------LTLNPSSNISTILYNRRGKSNAFGGGRGGQINRGARGGRGRGR---- 180
Query: 291 PTQGTAMVEIEEEEVEGATIFREEITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAY 350
T+ + ++ + A + CY RF +++ ++ S + + N + +AY
Sbjct: 181 ATKDRIVCQVCCKPGHAA-------SHCYHRFNKNYIGQNSDEQKS-EKDKEQNYNFNAY 240
Query: 351 IATPEILHDPKWLADSGATNHVT--------------------ANARNLAV------KMD 410
+A+P + D W DSGA+NHVT N NL + +D
Sbjct: 241 VASPSTVEDLDWYFDSGASNHVTYDQNKVQEVNENDGKSFLTVGNGANLKIIACGDSSLD 300
Query: 411 YNV----LNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRN 470
L +IL+VP+I KNL+SI+ LT DN++ VEFH C VKDK + +++L +++
Sbjct: 301 TQQKSLNLKDILYVPKITKNLLSISKLTFDNDIYVEFHDVACFVKDKLTGRILLEGKIKD 360
Query: 471 DLYQIELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDA 530
LYQ+ S T K S WH +LGH +SKV+ V+K CN+ E+ FC+A
Sbjct: 361 GLYQLPGGSTSTNKRPHVFFSIKETWHRKLGHPNSKVLNEVMKLCNIEASPCENFEFCEA 420
Query: 531 CQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPL 590
CQ GK+H LPF SVS +PL+LVH D+WGP+PI S+ G+KYY+ F+DD++R IYPL
Sbjct: 421 CQFGKAHNLPFQNSVSCAKEPLDLVHSDVWGPAPISSVSGFKYYVLFLDDWSRFTWIYPL 480
Query: 591 KTKGEAFSSFSQYKLLVANRFEKKIKTLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQ 650
K K + F +F Q++ LV N+F K+IKTLQ D GGEF+S + L GI+ R SCP+TS Q
Sbjct: 481 KQKSDVFQAFIQFRNLVENQFNKRIKTLQCDGGGEFKSLSKVLIKTGIQLRESCPYTSAQ 540
Query: 651 NGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH- 710
NG ERKHRH+VE GLTLLAQA MPL YWWEAFS+AV++IN LPT ++ + SP++Q F
Sbjct: 541 NGRAERKHRHVVESGLTLLAQAKMPLHYWWEAFSTAVFLINRLPTQVIKNKSPYQQLFDK 600
Query: 711 --------------YPCLRLYQSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHV 770
YPCL+ Y HK Q H+TKCVFLGYS +HKGYKCL+S+GR+FIS HV
Sbjct: 601 NPDYTAMKTFGCACYPCLKPYNQHKLQFHTTKCVFLGYSGSHKGYKCLNSTGRIFISRHV 660
Query: 771 VFNESEFPFKSELVPSSGPSNIALVPHHSSPVAEFQTSSPTLSTPPQDQYVSPQLSAHSP 830
VFNE FPF + + P+ I P +S + + ++ Q + + S+++
Sbjct: 661 VFNEHHFPFHDGFLNTRKPAEIITDP--TSLLFPISPTGSNVANEEQRLHTNNNSSSNTK 720
Query: 831 EGHSCPASVMPLYTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAF 890
H + + + + ++ E ++ ++N H M TR+K GI K K
Sbjct: 721 SKHQVEQAENQNTIDATISQNTFANSRIENNIE---SINQHQMTTRSKMGIIKPKKPYVG 780
Query: 891 LVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFK 950
V + E EP + EAL++ +WK AM E L NKTWTLVP N+I KW+FK
Sbjct: 781 AVEKTLE-EQEPETTYEALENPEWKKAMIAEFKALMMNKTWTLVPYQGQKNIIDCKWVFK 840
Query: 951 VKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLD 1010
K K++ + +R KARLVA+GF Q G+D+ ETFSPV+KA T++IIL++AV NW IRQ+D
Sbjct: 841 TKYKADGTIERRKARLVAKGFQQTLGLDYDETFSPVIKAITVRIILSIAVHFNWEIRQMD 900
Query: 1011 VNNIFLNGRLQEAVYMRQLTGYIDQSCPDYVCKLDKALYGLRQASRA 1012
+NN FLNG L+E V+MRQ G++D+S P ++CKL KA+YGL+QA R+
Sbjct: 901 INNAFLNGELKETVFMRQPEGFLDKSRPQHICKLTKAIYGLKQAPRS 922
BLAST of Lag0008934 vs. ExPASy Swiss-Prot
Match:
Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)
HSP 1 Score: 510.4 bits (1313), Expect = 4.7e-143
Identity = 376/1114 (33.75%), Postives = 535/1114 (48.03%), Query Frame = 0
Query: 40 KLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTV 99
KL NYL+W V A+ G ++ ++ G+ P I T N Y W
Sbjct: 25 KLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMPPATIGTDAAP----RVNPDYTRWKRQ 84
Query: 100 DQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSM 159
D+ + + G++S ++ V T ++W+ L ++Y + S V Q R L+ KG+
Sbjct: 85 DKLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKGTK 144
Query: 160 KMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDL-KTWQELS 219
+ DY+ + + L L+G P+ D+ V VL L EY P++ I KD T E+
Sbjct: 145 TIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTEIH 204
Query: 220 SILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSN--GNRGNDNNSG 279
L+N E + S+ T +P A ++ + +N N +N NR N+NNS
Sbjct: 205 ERLLNHESKILAVSSAT----VIPITANAVSHRNTTTTNNNNNGNRNNRYDNRNNNNNSK 264
Query: 280 SYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYMRFEEHFNNLHASGNGSV 339
+ S NN + + + V+G + R + F N+ +
Sbjct: 265 PWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKR---CSQLQHFLSSVNSQQPPSPFTP 324
Query: 340 QGNNSNNASSSAYIATPEILHDPKWLADSGATNHVTANARNLAVKMDYN----------- 399
+N A S Y + WL DSGAT+H+T++ NL++ Y
Sbjct: 325 WQPRANLALGSPYSSN-------NWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGS 384
Query: 400 -------------------VLNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDK 459
L+NIL+VP I KNLIS+ L N V VEF VKD
Sbjct: 385 TIPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDL 444
Query: 460 ASKKVMLHEILRNDLYQIELPSIQ------TPKSEIRSTSFAGLWHNRLGHASSKVIKSV 519
+ +L +++LY+ + S Q +P S+ +S WH RLGH + ++ SV
Sbjct: 445 NTGVPLLQGKTKDELYEWPIASSQPVSLFASPSSKATHSS----WHARLGHPAPSILNSV 504
Query: 520 LKSCNVSTFLNESLHF--CDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIV 579
+ + ++S LN S F C C KS+++PFS+S ++ +PLE ++ D+W SPI+S
Sbjct: 505 ISNYSLSV-LNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWS-SPILSHD 564
Query: 580 GYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTLQTDWGGEFRSF 639
Y+YY+ FVD FTR +YPLK K + +F +K L+ NRF+ +I T +D GGEF +
Sbjct: 565 NYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVAL 624
Query: 640 TSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYI 699
+ +GI S PHT + NG+ ERKHRHIVE GLTLL+ AS+P TYW AF+ AVY+
Sbjct: 625 WEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYL 684
Query: 700 INCLPTPILGDVSPWEQAFH---------------YPCLRLYQSHKFQHHSTKCVFLGYS 759
IN LPTP+L SP+++ F YP LR Y HK S +CVFLGYS
Sbjct: 685 INRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLGYS 744
Query: 760 LAHKGYKCLS-SSGRLFISCHVVFNESEFPFKSELVPSSG-------------------- 819
L Y CL + RL+IS HV F+E+ FPF + L S
Sbjct: 745 LTQSAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQRRESSCVWSPHTTLPT 804
Query: 820 -----PSNIALVPHH-----SSPVAEFQT-----------------SSPTLSTP------ 879
P+ PHH SSP A F+ SSP + P
Sbjct: 805 RTPVLPAPSCSDPHHAATPPSSPSAPFRNSQVSSSNLDSSFSSSFPSSPEPTAPRQNGPQ 864
Query: 880 PQDQYVSPQLSAHSPEGHS--CPASVMPLYTSSMLPVDGSSDGSPELPLQISTA------ 939
P Q Q HS + S P + P + L S S P +++
Sbjct: 865 PTTQPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSSSSPSPTTSASSSSTSPT 924
Query: 940 -----------------------LNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTS 999
LN+H M TRAK+GI K + V S + E EP +
Sbjct: 925 PPSILIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPKYSLAV--SLAAESEPRT 984
Query: 1000 VKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLP-NLNLIGSKWIFKVKRKSNSSFDRCK 1012
+ALK +W+ AM EI N TW LVP P ++ ++G +WIF K S+ S +R K
Sbjct: 985 AIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYK 1044
BLAST of Lag0008934 vs. ExPASy Swiss-Prot
Match:
Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)
HSP 1 Score: 490.3 bits (1261), Expect = 5.1e-137
Identity = 370/1132 (32.69%), Postives = 535/1132 (47.26%), Query Frame = 0
Query: 22 TNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTT 81
TNI++ ++ + KL NYL+W V A+ G ++ ++ G+ P I T
Sbjct: 13 TNILNVNMSN------VTKLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMPPATIGTDA 72
Query: 82 ESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGDTSK 141
+ N Y W D+ + + G++S ++ V T ++W+ L ++Y + S
Sbjct: 73 ----VPRVNPDYTRWRRQDKLIYSAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSY 132
Query: 142 ACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIP 201
V Q R I + + L L+G P+ D+ V VL L +Y P
Sbjct: 133 GHVTQLRFITR-------------------FDQLALLGKPMDHDEQVERVLENLPDDYKP 192
Query: 202 IVCAIDDKDL-KTWQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQR 261
++ I KD + E+ LIN E L ++ ++ + ++ +R + + +
Sbjct: 193 VIDQIAAKDTPPSLTEIHERLINRESKLLALNSA-----EVVPITANVVTHRNTNTNRNQ 252
Query: 262 QFNPSNGNRGNDNN-SGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYM 321
N N N+NN S S+ S +G +N + + V+G + R
Sbjct: 253 NNRGDNRNYNNNNNRSNSWQPSSSGSRSDNRQPKPYLGRCQICSVQGHSAKR---CPQLH 312
Query: 322 RFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVTANARNLA 381
+F+ N ++ + +N A +S Y A WL DSGAT+H+T++ NL+
Sbjct: 313 QFQSTTNQQQSTSPFTPWQPRANLAVNSPYNAN-------NWLLDSGATHHITSDFNNLS 372
Query: 382 VKMDYN------------------------------VLNNILHVPEIRKNLISIASLTVD 441
Y LN +L+VP I KNLIS+ L
Sbjct: 373 FHQPYTGGDDVMIADGSTIPITHTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLCNT 432
Query: 442 NNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFA--GLWHN 501
N V VEF VKD + +L +++LY+ + S Q S A WH+
Sbjct: 433 NRVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQAVSMFASPCSKATHSSWHS 492
Query: 502 RLGHASSKVIKSVLKSCNVSTFLNES--LHFCDACQKGKSHRLPFSRSVSHTWQPLELVH 561
RLGH S ++ SV+ + ++ LN S L C C KSH++PFS S + +PLE ++
Sbjct: 493 RLGHPSLAILNSVISNHSLPV-LNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIY 552
Query: 562 CDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIK 621
D+W SPI+SI Y+YY+ FVD FTR +YPLK K + +F +K LV NRF+ +I
Sbjct: 553 SDVWS-SPILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIG 612
Query: 622 TLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPL 681
TL +D GGEF +L +GI S PHT + NG+ ERKHRHIVEMGLTLL+ AS+P
Sbjct: 613 TLYSDNGGEFVVLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPK 672
Query: 682 TYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------------YPCLRLYQSHKF 741
TYW AFS AVY+IN LPTP+L SP+++ F YP LR Y HK
Sbjct: 673 TYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKL 732
Query: 742 QHHSTKCVFLGYSLAHKGYKCLS-SSGRLFISCHVVFNESEFPFK--------------- 801
+ S +C F+GYSL Y CL +GRL+ S HV F+E FPF
Sbjct: 733 EDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSD 792
Query: 802 -------------SELV----------------PSSGPSNIALVPHHSSPVAEFQTSSPT 861
+ LV P S PS + SS + SSP+
Sbjct: 793 SAPNWPSHTTLPTTPLVLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLPSSSISSPS 852
Query: 862 LSTPPQDQYVSPQLSA--------------------HSPEGHSCPASVMPL----YTSSM 921
S P + PQ +A +SP +S P PL +S
Sbjct: 853 SSEPTAPSHNGPQPTAQPHQTQNSNSNSPILNNPNPNSPSPNS-PNQNSPLPQSPISSPH 912
Query: 922 LPVDGSSDGSPELPLQISTA---------------------LNSHPMQTRAKSGIFKQKD 981
+P +S P P ST+ +N+H M TRAK GI K
Sbjct: 913 IPTPSTSISEPNSPSSSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGIRKPNQ 972
Query: 982 WGAFLVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLV-PLLPNLNLIGS 1012
+ +S + EP + +A+K +W+ AM EI N TW LV P P++ ++G
Sbjct: 973 --KYSYATSLAANSEPRTAIQAMKDDRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGC 1032
BLAST of Lag0008934 vs. ExPASy Swiss-Prot
Match:
P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)
HSP 1 Score: 277.7 bits (709), Expect = 5.2e-73
Identity = 259/994 (26.06%), Postives = 425/994 (42.76%), Query Frame = 0
Query: 91 MLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGDTSKACVNQF--- 150
M E+W +D+ + + +S + ++I+ T R +W LE +Y SK N+
Sbjct: 47 MKAEDWADLDERAASAIRLHLSDDVVNNIIDEDTARGIWTRLESLY--MSKTLTNKLYLK 106
Query: 151 RGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAID 210
+ + + +L + L +G + +D +L L S Y
Sbjct: 107 KQLYALHMSEGTNFLSHLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSY-------- 166
Query: 211 DKDLKTWQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNR--QSMFDNQRQFNPS 270
L++ +++ + T+ +L D+ + L LN + +NQ Q +
Sbjct: 167 -------DNLATTILHGKTTI-----------ELKDVTSALLLNEKMRKKPENQGQALIT 226
Query: 271 NGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCY-MRFEEH 330
G + S + YG +G G + + + + CY H
Sbjct: 227 EGRGRSYQRSSNNYGR-SGARGKSKNRS-----------------KSRVRNCYNCNQPGH 286
Query: 331 F-----NNLHASGNGSVQGNNSNNASSSA--------YIATPEILH----DPKWLADSGA 390
F N G S Q N+ N A+ E +H + +W+ D+ A
Sbjct: 287 FKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESEWVVDTAA 346
Query: 391 TNHVT---------------------------ANARNLAVKMDYN---VLNNILHVPEIR 450
++H T A ++ +K + VL ++ HVP++R
Sbjct: 347 SHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLR 406
Query: 451 KNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIR 510
NLIS + +D + + +N K S V+ + R LY+ Q + +
Sbjct: 407 MNLIS--GIALDRDGYESYFANQKWRLTKGS-LVIAKGVARGTLYRTNAEICQGELNAAQ 466
Query: 511 STSFAGLWHNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHT 570
LWH R+GH S K ++ + K +S ++ CD C GK HR+ F S
Sbjct: 467 DEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERK 526
Query: 571 WQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVA 630
L+LV+ D+ GP I S+ G KY+++F+DD +R +Y LKTK + F F ++ LV
Sbjct: 527 LNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVE 586
Query: 631 NRFEKKIKTLQTDWGGEF--RSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGL 690
+K+K L++D GGE+ R F + +GI + P T Q NG+ ER +R IVE
Sbjct: 587 RETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVR 646
Query: 691 TLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVSP---W-EQAFHYPCLRLYQSHKFQ 750
++L A +P ++W EA +A Y+IN P+ L P W + Y L+++ F
Sbjct: 647 SMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFA 706
Query: 751 H-----------HSTKCVFLGYSLAHKGYKCLSSSGRLFI-SCHVVFNESEFPFKSELVP 810
H S C+F+GY GY+ + I S VVF ESE +++
Sbjct: 707 HVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADM-- 766
Query: 811 SSGPSNIALVPHHSSPVAEFQTSSPTLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTS 870
S ++P+ F T T + P + + ++S + P V+
Sbjct: 767 -SEKVKNGIIPN-------FVTIPSTSNNPTSAESTTDEVSEQGEQ----PGEVIE---- 826
Query: 871 SMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSV 930
+ +G E+ P++ + + ++ V S + EP S+
Sbjct: 827 ---QGEQLDEGVEEVEHPTQGEEQHQPLRRSERPRVESRRYPSTEYV--LISDDREPESL 886
Query: 931 KEAL---KSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRC 990
KE L + +Q AM +E+ L +N T+ LV L + KW+FK+K+ + R
Sbjct: 887 KEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRY 946
Query: 991 KARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQE 1011
KARLV +GF Q G+DF E FSPVVK +I+ IL++A + + QLDV FL+G L+E
Sbjct: 947 KARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEE 968
BLAST of Lag0008934 vs. ExPASy Swiss-Prot
Match:
P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)
HSP 1 Score: 212.6 bits (540), Expect = 2.0e-53
Identity = 261/1069 (24.42%), Postives = 433/1069 (40.51%), Query Frame = 0
Query: 42 DDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVDQ 101
D + Y +WK + A+L Q V V G L+ N + + W ++
Sbjct: 12 DGEKYAIWKFRIRALLAEQDVLKVVDG------------------LMPNEVDDSWKKAER 71
Query: 102 ALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGS-MK 161
+ +S + + T R++ + L+ VY S A R L + K S M
Sbjct: 72 CAKSTIIEYLSDSFLNFATSDITARQILENLDAVYERKSLASQLALRKRLLSLKLSSEMS 131
Query: 162 MIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAID--DKDLKTWQELS 221
++ + I + L G + D +S++L L S Y I+ AI+ ++ T +
Sbjct: 132 LLSHFHIFDELISELLAAGAKIEEMDKISHLLITLPSCYDGIITAIETLSEENLTLAFVK 191
Query: 222 SILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGNDNNSGSY 281
+ L++ E + T+ + + + ++F N R P +GN
Sbjct: 192 NRLLDQEIKIKNDHNDTSKKVMNAIVHNNNNTYKNNLFKN-RVTKPKKIFKGNSKYKVKC 251
Query: 282 YGSG-NGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYMRFEEHFNNLHASGN-GSV 341
+ G G + + ++ + +E E +M + NN N G V
Sbjct: 252 HHCGREGHIKKDCFHYKRILNNKNKENEKQVQTATSHGIAFM--VKEVNNTSVMDNCGFV 311
Query: 342 QGNNSNN---ASSSAYIATPEILHDPKWLADSGATNHVTANARNLA-VKMDYNV-LNNIL 401
+ +++ S Y + E++ P +A + + A R + ++ D+ + L ++L
Sbjct: 312 LDSGASDHLINDESLYTDSVEVV-PPLKIAVAKQGEFIYATKRGIVRLRNDHEITLEDVL 371
Query: 402 HVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQT 461
E NL+S+ L + + +EF + + V +L N + + + Q
Sbjct: 372 FCKEAAGNLMSVKRLQ-EAGMSIEFDKSGVTISKNGLMVVKNSGMLNN----VPVINFQA 431
Query: 462 PKSEIRSTSFAGLWHNRLGHASSKVI-----KSVLKSCNVSTFLNESLHFCDACQKGKSH 521
+ + LWH R GH S + K++ ++ L S C+ C GK
Sbjct: 432 YSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQA 491
Query: 522 RLPFS--RSVSHTWQPLELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGE 581
RLPF + +H +PL +VH D+ GP V++ Y++ FVD FT Y +K K +
Sbjct: 492 RLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSD 551
Query: 582 AFSSFSQYKLLVANRFEKKIKTLQTDWGGEFRS--FTSFLRDNGIEFRHSCPHTSQQNGI 641
FS F + F K+ L D G E+ S F GI + + PHT Q NG+
Sbjct: 552 VFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGV 611
Query: 642 VERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIINCLPTPILGDVS--PWE----QA 701
ER R I E T+++ A + ++W EA +A Y+IN +P+ L D S P+E +
Sbjct: 612 SERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKK 671
Query: 702 FHYPCLRLY----------QSHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVV- 761
+ LR++ + KF S K +F+GY G+K + FI V
Sbjct: 672 PYLKHLRVFGATVYVHIKNKQGKFDDKSFKSIFVGYE--PNGFKLWDAVNEKFIVARDVV 731
Query: 762 ------FNESEFPFKSELVPSSGPSNIALVPHHSSPVAEFQTSSPTLSTP---------- 821
N F++ + S S P+ S + QT P S
Sbjct: 732 VDETNMVNSRAVKFETVFLKDSKESENKNFPNDSRKI--IQTEFPNESKECDNIQFLKDS 791
Query: 822 --------PQD--QYVSPQLSAHSPE-------GHSCPASVMPLYTSSMLPVD-----GS 881
P D + + + S E S ++ L S D
Sbjct: 792 KESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNKYFLNESKKRKRDDHLNESK 851
Query: 882 SDGSPELPLQISTA-------------------LNSHPMQTRAKSGI-FKQKD--WGAFL 941
G+P + TA +N + + K I + ++D +
Sbjct: 852 GSGNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVV 911
Query: 942 VNSSFSPEVEPTSVKEAL---KSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWI 1001
+N+ P S E S W+ A+N E+ N TWT+ N N++ S+W+
Sbjct: 912 LNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWV 971
Query: 1002 FKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQ 1011
F VK + R KARLVA+GF Q +D+ ETF+PV + + + IL++ + N + Q
Sbjct: 972 FSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQ 1031
BLAST of Lag0008934 vs. ExPASy Swiss-Prot
Match:
P92520 (Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 GN=AtMg00820 PE=4 SV=1)
HSP 1 Score: 111.7 bits (278), Expect = 4.9e-23
Identity = 61/127 (48.03%), Postives = 82/127 (64.57%), Query Frame = 0
Query: 827 MQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWT 886
M TR+K+GI K + + ++ + EP SV ALK W AM +E+ L+RNKTW
Sbjct: 1 MLTRSKAGINKLNPKYSLTITTTI--KKEPKSVIFALKDPGWCQAMQEELDALSRNKTWI 60
Query: 887 LVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTI 946
LVP N N++G KW+FK K S+ + DR KARLVA+GF+Q G+ F ET+SPVV+ TI
Sbjct: 61 LVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATI 120
Query: 947 QIILAVA 954
+ IL VA
Sbjct: 121 RTILNVA 125
BLAST of Lag0008934 vs. ExPASy TrEMBL
Match:
A0A803PM38 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)
HSP 1 Score: 737.3 bits (1902), Expect = 8.8e-209
Identity = 450/1083 (41.55%), Postives = 612/1083 (56.51%), Query Frame = 0
Query: 21 ATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETT 80
A NI+ G L+ +KLD N+ LW+ MV AI+RG ++D Y+ GT +P E + +T
Sbjct: 32 APNIVVPQFGSTLNQPFALKLDRNNFSLWRTMVSAIVRGHRLDGYLKGTLPKPQEFLSST 91
Query: 81 TESGKML---LSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYG 140
G + N +E+W+ DQ L GWL+GSM+ IA +V+ + +W ALEE++G
Sbjct: 92 DLDGSVSSVGQVNPAFEQWIVNDQLLLGWLYGSMTEGIACEVMGCDSSASLWTALEELFG 151
Query: 141 DTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDS 200
SKA ++++R +Q +KG++ M DYL +Q ++ L L G P + LVS VL+GLD
Sbjct: 152 AHSKAKMDEYRTKIQTARKGALSMADYLRQKRQWADVLALAGEPYPENQLVSNVLSGLDI 211
Query: 201 EYIPIVCAIDDKDLKTWQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFD 260
EY+P+V I+ + TWQ+L +L++ + + R + F T + +N +
Sbjct: 212 EYLPMVLLIEARGSTTWQQLQDMLLSLDSKMERLHS-----FSGSSKLTGVPMNPSASLA 271
Query: 261 NQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIF-REEITT 320
N+ +N N+NN G G N + NN ++G G T R
Sbjct: 272 NKGPHPGANRGNHNNNNRG---GHSNNRGSNNRSRGRG----------GRTSGPRPTCQV 331
Query: 321 C--YMRFEEHFNNLHASGNGSVQGNNSN-----NASSSAYIATPEILHDPKWLADSGATN 380
C Y H N AS + + + N N N +A L + G +
Sbjct: 332 CGKYGHSAAHCYNRGASNHITSEINKMNLKEEYNGKEKVTVANGNRLP----IHHIGLGS 391
Query: 381 HVTANARNLAVKMDYNVLNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVKDKASK 440
T +A L +L ILHVP I KNL+SI+ LT DNNV VEF S+ C VKDK +
Sbjct: 392 LQTLSASPL-------ILKEILHVPSITKNLLSISKLTSDNNVCVEFLSDLCFVKDKETG 451
Query: 441 KVMLHEILRNDLYQIELPSIQTPKSEIRS----TSFAGL--------------------- 500
+V+L L++ LYQ + P+ T S RS TSF+GL
Sbjct: 452 QVVLKGKLKDGLYQFDAPTSTTSMSSNRSISCPTSFSGLVVSAVESNVTKPMANQLLCSI 511
Query: 501 ---WHNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPL 560
WH RLGH S +V+ +VL NV +N SL FCDACQ GKSH LPF + PL
Sbjct: 512 KDRWHRRLGHPSIRVLDTVLHKINVKN-INSSLSFCDACQLGKSHSLPFKVNPKRATAPL 571
Query: 561 ELVHCDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFE 620
ELVH D+WGPSPI+S ++YYI F+DDF+R IYPLK K EA ++F Q+KLLV N+F
Sbjct: 572 ELVHTDIWGPSPIMSNTNFRYYIHFIDDFSRYTWIYPLKAKSEALAAFVQFKLLVENQFN 631
Query: 621 KKIKTLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQA 680
++K +QTDWGGE++ F F D+GI F+H CPHTS QNG ERKHRHIVEMGLTLLAQA
Sbjct: 632 SRVKRVQTDWGGEYQGFPRFGSDHGIGFQHPCPHTSGQNGRAERKHRHIVEMGLTLLAQA 691
Query: 681 SMPLTYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------------YPCLRLYQ 740
+P YWW+AF +AVY+IN LPTP+L +P+E F +PCLR YQ
Sbjct: 692 HVPQKYWWDAFQTAVYLINRLPTPVLKLKTPFEVLFKQQPDYKFLKVFGVSCFPCLRAYQ 751
Query: 741 SHKFQHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSN- 800
+HKFQ HSTKCV LGYS HKGYKCLSS+GRL+IS V+FNE EFPFKS + ++ P
Sbjct: 752 NHKFQFHSTKCVNLGYSDKHKGYKCLSSTGRLYISRDVIFNEDEFPFKSGFLNTNKPETP 811
Query: 801 -IALVP----------------HHSSPVAEFQTSSPTLSTPPQDQYVSPQLSAH------ 860
LVP SS + QT TP + V P LS
Sbjct: 812 VSVLVPFWTASSFVNSQSSSQNDFSSSIGNNQTDEVDHGTPTTSRVV-PDLSTFQGNDTD 871
Query: 861 ---SPEGHSCPASVMPL--------YTSSMLPVDGSSDGSPELPLQISTALNSHPMQTRA 920
S G+ S + + S+ P+D S+ + +++HPM TRA
Sbjct: 872 HVISDFGNIDRISDVQIQQHADTTTLESAADPIDTSASDH-----NLKAVVSTHPMITRA 931
Query: 921 KSGIFKQKDW---GAFLVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLV 980
K+GIFK K + ++ NSS EP S++EAL+ W AM+ E+ L RN TW LV
Sbjct: 932 KAGIFKPKTYLTQTKWIGNSS-----EPQSIEEALQHKGWNNAMSSEVHALARNGTWKLV 991
Query: 981 PLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQI 1012
P LP++++I +KW++K KR ++ SF R KARLVA+GF Q PGVDF ETFSPV+KA T++I
Sbjct: 992 PRLPHMHIIDNKWVYKEKRNADGSFQRLKARLVAKGFTQRPGVDFSETFSPVIKASTVRI 1051
BLAST of Lag0008934 vs. ExPASy TrEMBL
Match:
A0A2Z6P4D5 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_412550 PE=4 SV=1)
HSP 1 Score: 725.7 bits (1872), Expect = 2.7e-205
Identity = 422/1060 (39.81%), Postives = 601/1060 (56.70%), Query Frame = 0
Query: 19 SSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIE 78
SSA N S + L ++++VKLD NY LWK +VL+++RG K+D Y+LGT P + +
Sbjct: 2 SSAAN---SPKKNDLPSIISVKLDRDNYPLWKSLVLSLIRGCKLDGYILGTTECPEQFVT 61
Query: 79 TTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGD 138
+ +S K+ N + +W+ DQAL GWL SM+ IA +++ +T +++W + + G
Sbjct: 62 SADKSKKV---NPDFGDWIANDQALLGWLMNSMAIDIATQLLHCETSKQLWDETQSLAGA 121
Query: 139 TSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSE 198
+K+ + + NT+KG MKM +YL MK S+ LKL G+P+S DL+ L GLD+E
Sbjct: 122 HTKSRITYLKSEFHNTRKGEMKMEEYLIKMKNLSDKLKLAGSPISNSDLMIQTLNGLDAE 181
Query: 199 YIPIVCAIDDKDLKTWQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDN 258
Y P+V + D+ +W ++ + L+ FE L +++ + L LN + F N
Sbjct: 182 YNPVVVKLSDQINLSWVDVQAQLLAFESRLDQFNN-----------FSGLTLNASANFAN 241
Query: 259 QRQFN----PSNGNRGNDNNSGSYYGSGNGQMGNNPTQ---GTAMVEIEEEEVEGATIFR 318
+ +F S GN N G G G G+M N Q GT + ++
Sbjct: 242 KTEFRGNKFNSRGNWRRSNFRGMRGGRGKGRMSNTKCQVCNGTGHIAVD----------- 301
Query: 319 EEITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHV 378
C RF+ + + S QG S SA+IA+P D +W DSGA NHV
Sbjct: 302 -----CSYRFDRPYTGRNYSTEADKQG------SHSAFIASPYHGQDYEWYFDSGANNHV 361
Query: 379 T--------------------ANARNLAV------KMDYNVLNNILHVPEIRKNLISIAS 438
T N L + K++ L+++L+VP+I KNL+S++
Sbjct: 362 THQTDKFQGFNEHNGKNSLMVGNGEKLKIVASGSTKLNNLNLHDVLYVPQITKNLLSVSK 421
Query: 439 LTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLW 498
LT DNN++VEF +N C VKDK + + +L L++ LYQ+ + K S W
Sbjct: 422 LTADNNILVEFDANCCSVKDKLTGQTLLKGRLKDGLYQL------SNKEPCVYMSVKESW 481
Query: 499 HNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVH 558
H +LGH ++KV+ VLK CNV ++ FC+ACQ GK H LPF S SH +PL L+H
Sbjct: 482 HRKLGHPNNKVLDKVLKDCNVKISHSDQFSFCEACQFGKLHLLPFKPSSSHVQEPLALIH 541
Query: 559 CDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIK 618
D+WGP+PI+S G+KYY+ F+DDF+R I+PLK K + +F Q+K L N+F KKIK
Sbjct: 542 SDVWGPAPILSPSGFKYYVHFIDDFSRFTWIFPLKQKSDTIHAFIQFKNLAENQFNKKIK 601
Query: 619 TLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPL 678
+Q D GGE+++ + GI+FR SCP+TSQQNG ERKHRH+ E+GLTLLAQA MPL
Sbjct: 602 IIQCDGGGEYKAVQKVSIEAGIQFRMSCPYTSQQNGRAERKHRHVAELGLTLLAQAKMPL 661
Query: 679 TYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------------YPCLRLYQSHKF 738
YWWEAFS+AVY+IN LP+ + + SP+ F YPCL+ Y HK
Sbjct: 662 RYWWEAFSTAVYLINRLPSSVNPNESPYSLMFKREPDYNALKPFGCACYPCLKPYNQHKL 721
Query: 739 QHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGP------S 798
Q H+T+CVF+GYS +HKGYKC++S GR+F+S HV+FNE+ FPF + + P +
Sbjct: 722 QFHTTRCVFVGYSNSHKGYKCINSHGRIFVSRHVIFNENHFPFHGGFLDTKNPLKTLTDN 781
Query: 799 NIALVPHHSSPVAEFQTSSPTLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPV 858
+ L+P S+ P +T S + S ++ +S + T++
Sbjct: 782 SSILLPTCSAGATTQDAIEPDNNTTSDQNTHSIESSDNNENEEQVDSSEFFVNTNNSSTQ 841
Query: 859 DGSSDGSPELPLQISTAL-------------NSHPMQTRAKSGIFKQKDWGAFLVNSSFS 918
D +D S + + ++ + N+H M+TR+K GI K K + + S
Sbjct: 842 DIEADNSVDSEDRNNSTMTGTIQQQAQQDNSNTHWMRTRSKDGIHKPKIPYVGMAETD-S 901
Query: 919 PEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNS 978
E EP SVKEAL WK AM+ E L N TWTLVP N+I SKWIFK K KS+
Sbjct: 902 EEKEPKSVKEALGRPMWKEAMDKEYKALVSNHTWTLVPYQEQENIIDSKWIFKTKYKSDG 961
Query: 979 SFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLN 1012
S +R KARLVA+GF Q G+DF ETFSPVVK+ T++IIL +AV NW +RQLD+NN FLN
Sbjct: 962 SIERRKARLVAKGFQQTAGLDFGETFSPVVKSSTVRIILTIAVHFNWEVRQLDINNAFLN 1015
BLAST of Lag0008934 vs. ExPASy TrEMBL
Match:
A0A2Z6MBG6 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_77270 PE=4 SV=1)
HSP 1 Score: 723.0 bits (1865), Expect = 1.7e-204
Identity = 417/1054 (39.56%), Postives = 588/1054 (55.79%), Query Frame = 0
Query: 14 AVAASSSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQP 73
A AA S+ N + SS ++VKLD NY LWK +VL ++RG K+D Y+LGT+ P
Sbjct: 2 ASAAGSNNKNDLPSS--------VSVKLDRNNYPLWKSLVLPVIRGCKLDGYMLGTEGCP 61
Query: 74 SELIETTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALE 133
E I T+++S K N + EW DQ L GW+ SM+ IA +++ +T +++W +
Sbjct: 62 EEFI-TSSDSSKN--KNSAFVEWQANDQRLLGWMLNSMTTEIATQLLHCETSKQLWDEAQ 121
Query: 134 EVYGDTSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLA 193
+ G +++ + + + +KG MKM DYL MK + LKL GNPVS DL+ L
Sbjct: 122 SLAGAHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLVDKLKLAGNPVSTSDLIIQTLN 181
Query: 194 GLDSEYIPIVCAIDDKDLKTWQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQ 253
GLDSEY P+V + D+ +W +L + L+ FE + + + TN + AT NR
Sbjct: 182 GLDSEYNPVVVKLSDQTTLSWVDLQAQLLTFESRIEQLNNLTNLTLN----ATANVANR- 241
Query: 254 SMFDNQRQFNPSNGNRGNDNNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREE 313
+ + SN N N+ G G G G+ G NP Q +
Sbjct: 242 ----SDHRGKSSNNNWRGSNSRGWRGGRGRGKSGKNPCQVCG-------------LSNHI 301
Query: 314 ITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVT- 373
C+ RF++ ++ + S QG S +A++A+ + D W DSGA+NHVT
Sbjct: 302 AIDCFHRFDKTYSRSNHSAGHDKQG------SHNAFLASQNSVEDYDWYFDSGASNHVTH 361
Query: 374 -------------------ANARNLAV------KMDYNVLNNILHVPEIRKNLISIASLT 433
N LA+ K+ L++IL+VP I KNL+S++ L
Sbjct: 362 QTEKFQDLTEHHGKNSLVVGNGEKLAILATGSSKLKSLNLHDILYVPNITKNLLSVSKLA 421
Query: 434 VDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLWHN 493
DNN++VEF N C VKDK + KV+L +L++ LYQ+ T ++ S WH
Sbjct: 422 ADNNILVEFDENCCFVKDKLTGKVILKGLLKDGLYQLS----GTKRNPSAFVSVKESWHR 481
Query: 494 RLGHASSKVIKSVLKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCD 553
RLGH ++KV+ VL+SC V +++ FC+ACQ GK H LPF S SH +PLELVH D
Sbjct: 482 RLGHPNNKVLDKVLESCKVKVPPSDNFSFCEACQYGKMHLLPFKSSSSHAQEPLELVHTD 541
Query: 554 LWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTL 613
+WGP+PI++ G+KYY+ FVDDF+R IYPLK K E +F Q+K L N+F K+IK +
Sbjct: 542 VWGPAPIMTSSGFKYYVHFVDDFSRFTWIYPLKQKSETVQAFIQFKNLTENQFNKRIKVI 601
Query: 614 QTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTY 673
Q D GGE++ + GI+FR SCP+TSQQNG ERKHRHI E GLTLLAQA MPL Y
Sbjct: 602 QCDGGGEYKPVQKLAVEAGIQFRMSCPYTSQQNGRAERKHRHITEFGLTLLAQAQMPLHY 661
Query: 674 WWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------------YPCLRLYQSHKFQH 733
WWEAFS+AVY+IN LP+ + + SP+ YPCL+ Y HK Q+
Sbjct: 662 WWEAFSTAVYLINRLPSQVTQNESPYSLMLQKEPDYKLLKTFGCACYPCLKPYNQHKLQY 721
Query: 734 HSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIAL---- 793
H+T+CVFLGYS +HKGYKCL+S GR+FIS HV+FNE FPF + + P +
Sbjct: 722 HTTRCVFLGYSNSHKGYKCLNSHGRIFISRHVIFNEDHFPFHDGFLNTRSPLKTTINVPS 781
Query: 794 -----------VPHHSSPVAEFQTSSPTLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLY 853
+ S P+ E + + T + QD +++ + + ++ P+ +
Sbjct: 782 TSFPLCTAGNVIDDASMPILEAENPAETNTEDSQD------VNSDTEQTNNGPSEDNTTH 841
Query: 854 TSSMLPVDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPT 913
++ S G SH + TR+KSGI K K + ++ +EP
Sbjct: 842 EETLDITQQQSVGEAS-----QNTNTSHAIHTRSKSGIHKPK-LPYIGLTETYKDTMEPA 901
Query: 914 SVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCK 973
+ KEAL WK AM E L NKTW LVP N++ SKW+FK K K + S +R K
Sbjct: 902 NAKEALSRPLWKEAMQKEFEALMSNKTWILVPYQNQENIVDSKWVFKTKYKPDGSLERRK 961
Query: 974 ARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEA 1012
ARLVA+GF Q G+D+ ETFSPV+KA T++IIL++AV NW +RQLD+NN FLNG L+E
Sbjct: 962 ARLVAKGFQQTAGIDYEETFSPVIKASTVRIILSIAVHLNWEVRQLDINNAFLNGHLKET 1000
BLAST of Lag0008934 vs. ExPASy TrEMBL
Match:
A0A2K3MUJ9 (Putative retrotransposon Ty1-copia subclass protein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g017679 PE=4 SV=1)
HSP 1 Score: 712.2 bits (1837), Expect = 3.0e-201
Identity = 417/1048 (39.79%), Postives = 595/1048 (56.77%), Query Frame = 0
Query: 33 LSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNML 92
L + ++VKLD N+ LWK +VL ++RG K D Y+LGTK P + + + + K+ N
Sbjct: 12 LPSTVSVKLDRDNFPLWKSLVLPLIRGCKYDGYMLGTKKCPDQFVTSIDNTEKI---NPD 71
Query: 93 YEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGDTSKACVNQFRGILQ 152
Y++W DQAL GWL SM+ IA V++ +T +++W + + G +++ + +
Sbjct: 72 YQDWQADDQALLGWLMNSMTVDIATQVLHCETSKQLWDEAQSLAGAHTRSRIIYLKSEFH 131
Query: 153 NTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKDLK 212
NT K MKM YLA MK ++ LKL G+P+S DL+ L GLDSEY P+V + D+
Sbjct: 132 NTHKREMKMEQYLAKMKNLADKLKLAGSPISSSDLMIQTLNGLDSEYNPVVVKLSDQTNI 191
Query: 213 TWQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDNQRQFNPSNGNRGND 272
+W + + L+ FE L + + N + + + + A +S +F G RG+
Sbjct: 192 SWVDFQAQLLAFESRLDQLNNFNNINL---NASANFASKNES---GGNKFGSRGGWRGS- 251
Query: 273 NNSGSYYGSGNGQMGNNPTQGTAMVEIEEEEVEGATIFREEITTCYMRFEEHF--NNLHA 332
N+ G G G +M P + F CY RF++ + N +A
Sbjct: 252 NSRGMRGGRGRARMSKPP----------RPICQICGKFGHTAAQCYYRFDKSYTEKNHYA 311
Query: 333 SGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHVTANARNL------------ 392
G G S SA++A+P D +W DSGA+NHVT + L
Sbjct: 312 EGEG----------SHSAFVASPYHGQDYEWYFDSGASNHVTHQSGQLQDLNENNGKNSL 371
Query: 393 --------------AVKMDYNVLNNILHVPEIRKNLISIASLTVDNNVVVEFHSNYCVVK 452
+ K++ L N+L+VPEI KNL+S++ LT+DNN +VEF NYC VK
Sbjct: 372 LVGNGEKLKILASGSTKLNDVNLRNVLYVPEITKNLLSVSKLTIDNNALVEFDENYCYVK 431
Query: 453 DKASKKVMLHEILRNDLYQI----ELPSIQTPKSEIRSTSFAGLWHNRLGHASSKVIKSV 512
DK + K +L L++ LYQ+ E P+ + P + I S +WH +LGH ++KV++ V
Sbjct: 432 DKLTGKALLKGRLKDGLYQLSANKEPPTNKDPCAYI---SLKEIWHRKLGHPNNKVLEKV 491
Query: 513 LKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWGPSPIVSIVGY 572
LK NV ++ FC+ACQ GK H LPF S SH +PL+L+H D+WGP+PI+S +
Sbjct: 492 LKDNNVKISPSDKFTFCEACQFGKLHLLPFKTSSSHAKEPLDLIHTDVWGPAPILSQSNF 551
Query: 573 KYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIKTLQTDWGGEFRSFTS 632
KYY+ F+DDF+R I+PLK K E +F+Q+K LV N+F KKIK ++ D GGE++
Sbjct: 552 KYYVHFLDDFSRFTWIFPLKQKSETIHAFNQFKNLVENQFNKKIKVIRCDGGGEYKPVQK 611
Query: 633 FLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPLTYWWEAFSSAVYIIN 692
D+GI+F+ SCP+TSQQNG ERKHRH+ E+GLTLLAQA MPL+YWWEAFS+AVY+IN
Sbjct: 612 CAIDSGIQFQMSCPYTSQQNGRAERKHRHVTELGLTLLAQAKMPLSYWWEAFSTAVYLIN 671
Query: 693 CLPTPILGDVSPWEQAFH---------------YPCLRLYQSHKFQHHSTKCVFLGYSLA 752
LP+ + + SP+ F YPCL+ Y HK Q H+T+CVFLGYS +
Sbjct: 672 RLPSSVNPNESPYTLVFKKEPDYTALKPFGCACYPCLKPYNQHKLQFHTTRCVFLGYSNS 731
Query: 753 HKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIAL------VPHHSSPVAEF 812
HKGYKC++S GR+F+S HVVFNE+ FPF+ + + P + P + +
Sbjct: 732 HKGYKCVNSHGRVFVSRHVVFNENHFPFQEGFLDTRNPIKVVTNDTPIGFPSFPAGITTN 791
Query: 813 QTSSPTLSTPPQ------------DQYVSPQLSAHSPEGH----SCPASVMPLYTSSMLP 872
T+ T + Q DQ V H+ E + S SM
Sbjct: 792 NTAEATDNIVDQQEPELNDINTVADQSVESDTFEHTDENNFSNGETEDSTEAAGRESMEE 851
Query: 873 VDGSSDGSPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEAL 932
+ + P Q T N+H M+TR+K+G++K K L + + EP SV EAL
Sbjct: 852 ISQPITETNPPPQQDIT--NTHWMRTRSKAGVYKPKLPYIGLTEEAKEGK-EPESVSEAL 911
Query: 933 KSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQ 992
+W AM+ E L NKTWTLVP N+I SKWIFK K K++ + +R KARLVA+
Sbjct: 912 SIPEWLNAMDAEYKALMNNKTWTLVPFEGQENVISSKWIFKTKYKADGTIERRKARLVAR 971
Query: 993 GFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQL 1012
GF Q GVD+ ETFSPVVK+ T++IIL++AV +W +RQLD+NN FLNG L+E+V+M Q
Sbjct: 972 GFQQTAGVDYDETFSPVVKSSTVRIILSIAVHLSWEVRQLDINNAFLNGNLKESVFMHQP 1023
BLAST of Lag0008934 vs. ExPASy TrEMBL
Match:
A0A2K3NEN7 (Copia-like polyprotein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g024786 PE=4 SV=1)
HSP 1 Score: 684.9 bits (1766), Expect = 5.2e-193
Identity = 412/1041 (39.58%), Postives = 584/1041 (56.10%), Query Frame = 0
Query: 19 SSATNIISSSLGHPLSTVLTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIE 78
SSA N S+ + L ++++VKLD NY LWK +VL ++RG K D Y+LGTK P + +
Sbjct: 2 SSAAN---SNKKNDLPSIISVKLDRDNYPLWKSLVLPLIRGCKFDGYILGTKECPEQFVT 61
Query: 79 TTTESGKMLLSNMLYEEWMTVDQALSGWLFGSMSPAIAADVINFKTLREVWKALEEVYGD 138
+ +S K+ N +++WM DQAL GWL SM+ IA +++ +T +++W + + G
Sbjct: 62 SADKSKKV---NPDFQDWMADDQALLGWLMNSMAIDIATQLLHCETSKQLWDEAQSLAGA 121
Query: 139 TSKACVNQFRGILQNTKKGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSE 198
+K+ + + NT+KG MKM +YL MK S+ LKL G+P+S DL+ L GLD+E
Sbjct: 122 HTKSRIIYLKSEFHNTRKGEMKMEEYLIKMKNLSDKLKLSGSPISNSDLMIQTLNGLDAE 181
Query: 199 YIPIVCAIDDKDLKTWQELSSILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDN 258
Y P+V + D+ +W ++ + L+ FE L D + + L LN + F N
Sbjct: 182 YNPVVVKLSDQINLSWVDVQAQLLAFESRL-----------DQLNNFSGLTLNASANFAN 241
Query: 259 QRQFN----PSNGNRGNDNNSGSYYGSGNGQMGNNPTQ---GTAMVEIEEEEVEGATIFR 318
+ +F S GN N G G G G+M N Q GT ++
Sbjct: 242 KTEFRGNKFHSRGNWRRSNFRGMRGGRGKGRMSNTKCQVCSGTGHTAVD----------- 301
Query: 319 EEITTCYMRFEEHFNNLHASGNGSVQGNNSNNASSSAYIATPEILHDPKWLADSGATNHV 378
C RF+ + + S QG S SA++A+P D +W DSGA+NHV
Sbjct: 302 -----CSYRFDRSYTGRNYSTEADKQG------SHSAFVASPYHGQDYEWYFDSGASNHV 361
Query: 379 T--------------------ANARNLAV------KMDYNVLNNILHVPEIRKNLISIAS 438
T N L + K++ L+++L+VP+I KNL+S++
Sbjct: 362 THQTDKFQGFNEHNGKNSLMVGNGEKLKIVASGSTKLNTLNLHDVLYVPQITKNLLSVSK 421
Query: 439 LTVDNNVVVEFHSNYCVVKDKASKKVMLHEILRNDLYQIELPSIQTPKSEIRSTSFAGLW 498
LT DNN+ VEF +N C VKDK + + +L L++ LYQ+ S Q+ K S W
Sbjct: 422 LTADNNIFVEFDANCCSVKDKLTGQTLLKGRLKDGLYQLSDVSPQSNKDPCVYMSVKESW 481
Query: 499 HNRLGHASSKVIKSVLKSCNVSTFLNESLHFCDACQKGKSHRLPFSRSVSHTWQPLELVH 558
H +LGH ++KV++ VLK CNV ++ FC+ACQ GK H LPF S SH +PL L+H
Sbjct: 482 HRKLGHPNNKVLEKVLKDCNVKISPSDQFSFCEACQFGKLHLLPFKSSSSHVQEPLGLIH 541
Query: 559 CDLWGPSPIVSIVGYKYYISFVDDFTRLAHIYPLKTKGEAFSSFSQYKLLVANRFEKKIK 618
D+WGP+PI+S G+KYY+ F+DDF+R I+PLK K + +F Q+K L N+F KKIK
Sbjct: 542 SDVWGPAPILSPSGFKYYVHFIDDFSRFTWIFPLKQKSDTIHAFIQFKNLAENQFNKKIK 601
Query: 619 TLQTDWGGEFRSFTSFLRDNGIEFRHSCPHTSQQNGIVERKHRHIVEMGLTLLAQASMPL 678
+Q D GGE+++ + GI+FR SCP+TSQQNG ERKHRH+VE+GLTLLAQA MPL
Sbjct: 602 IIQCDGGGEYKAVQKVSIEAGIQFRMSCPYTSQQNGRAERKHRHVVELGLTLLAQAKMPL 661
Query: 679 TYWWEAFSSAVYIINCLPTPILGDVSPWEQAFH---------------YPCLRLYQSHKF 738
YWWEAFS+AVY+IN L + + + SP+ F YPCL+ Y HK
Sbjct: 662 RYWWEAFSTAVYLINRLSSSVNPNESPYSLMFKREPDYNALKPFGCACYPCLKPYNQHKL 721
Query: 739 QHHSTKCVFLGYSLAHKGYKCLSSSGRLFISCHVVFNESEFPFKSELVPSSGPSNIALVP 798
Q H+T+CVF+GYS +HKG + G S + + N+ + + S+ S+
Sbjct: 722 QFHTTRCVFMGYSNSHKGSTTQDAIG----SDNNIVNDQD-TTNDQNTHSTESSDNNEEE 781
Query: 799 HHSSPVAEFQTSSPTLSTPPQDQYVSPQLSAHSPEGHSCPASVMPLYTSSMLPVDGSSDG 858
H + + T++ + D +V + +SP + G+S
Sbjct: 782 HADNSESFVNTNNGSTQDIEVDNFVDSE-DRNSP------------------TITGTSQQ 841
Query: 859 SPELPLQISTALNSHPMQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKA 918
Q +T N+H ++TR+K+GI K K + + S E EP SVKEAL WK
Sbjct: 842 QAH---QDNT--NTHGIRTRSKNGIHKPKLPYVGMTETD-SEEKEPESVKEALDKPMWKE 901
Query: 919 AMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPG 978
AM+ E L N TWTLVP N+I SKWIFK K KS+ S +R KARLVA+GF Q G
Sbjct: 902 AMDKEYKALMSNYTWTLVPFQAQENIIDSKWIFKTKYKSDGSIERRKARLVAKGFQQTAG 961
Query: 979 VDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRLQEAVYMRQLTGYIDQS 1012
+DFHETFSPVVK+ T++IIL +AV NW +RQLD+NN FLNG+L+E V+M Q GYID +
Sbjct: 962 LDFHETFSPVVKSSTVRIILTIAVHFNWEVRQLDINNAFLNGKLKETVFMHQPEGYIDTT 973
BLAST of Lag0008934 vs. TAIR 10
Match:
AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )
HSP 1 Score: 148.7 bits (374), Expect = 2.6e-35
Identity = 72/160 (45.00%), Postives = 105/160 (65.62%), Query Frame = 0
Query: 855 EPTSVKEALKSSQWKAAMNDEIAVLTRNKTWTLVPLLPNLNLIGSKWIFKVKRKSNSSFD 914
EP++ EA + W AM+DEI + TW + L PN IG KW++K+K S+ + +
Sbjct: 85 EPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIE 144
Query: 915 RCKARLVAQGFNQVPGVDFHETFSPVVKAPTIQIILAVAVMKNWSIRQLDVNNIFLNGRL 974
R KARLVA+G+ Q G+DF ETFSPV K ++++ILA++ + N+++ QLD++N FLNG L
Sbjct: 145 RYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDL 204
Query: 975 QEAVYMRQLTGYI----DQSCPDYVCKLDKALYGLRQASR 1011
E +YM+ GY D P+ VC L K++YGL+QASR
Sbjct: 205 DEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASR 244
BLAST of Lag0008934 vs. TAIR 10
Match:
ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )
HSP 1 Score: 111.7 bits (278), Expect = 3.5e-24
Identity = 61/127 (48.03%), Postives = 82/127 (64.57%), Query Frame = 0
Query: 827 MQTRAKSGIFKQKDWGAFLVNSSFSPEVEPTSVKEALKSSQWKAAMNDEIAVLTRNKTWT 886
M TR+K+GI K + + ++ + EP SV ALK W AM +E+ L+RNKTW
Sbjct: 1 MLTRSKAGINKLNPKYSLTITTTI--KKEPKSVIFALKDPGWCQAMQEELDALSRNKTWI 60
Query: 887 LVPLLPNLNLIGSKWIFKVKRKSNSSFDRCKARLVAQGFNQVPGVDFHETFSPVVKAPTI 946
LVP N N++G KW+FK K S+ + DR KARLVA+GF+Q G+ F ET+SPVV+ TI
Sbjct: 61 LVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATI 120
Query: 947 QIILAVA 954
+ IL VA
Sbjct: 121 RTILNVA 125
BLAST of Lag0008934 vs. TAIR 10
Match:
AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 67.4 bits (163), Expect = 7.5e-11
Identity = 62/257 (24.12%), Postives = 118/257 (45.91%), Query Frame = 0
Query: 37 LTVKLDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEW 96
+T+ L+ NY +W+++ + V ++ G+ + P+ + E + W
Sbjct: 24 VTLDLNKLNYDVWRELFETLCLSFGVLGHIDGS-STPTPMTE---------------KRW 83
Query: 97 MTVDQALSGWLFGSMSPAIAADVINFK-TLREVWKALEEVYGDTSKACVNQFRGILQNTK 156
D + W++G+++ ++ +I T R++W +LE ++ D +A QF L+ T
Sbjct: 84 KERDGLVKMWIYGTITDSLLDTIIKVGCTARDLWLSLENLFRDNKEARALQFENELRTTT 143
Query: 157 KGSMKMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKD-LKTW 216
+ + +Y +K S+ L V +P+S LV ++L GL +Y I+ I K ++
Sbjct: 144 IDDLSVHEYCQKLKSLSDLLTNVDSPISDRVLVMHLLNGLTEKYDYILNVIKHKSPFPSF 203
Query: 217 QELSSILINFEGTLARYSTPTNAHFDLPDLATHL--ALNRQSMFDNQRQFNPSNGNRGND 276
E S+L+ E L+ S + +H + P L+ L +Q + + N SN RG
Sbjct: 204 TEARSMLLMEESRLSNKSKSSLSHTNHPSLSNVLFTVPRQQERYPQEYHNNNSNMGRGRS 263
Query: 277 NNSGSYYGSGNGQMGNN 290
GS +G+ NN
Sbjct: 264 KKKNRGGGSSDGRYNNN 264
BLAST of Lag0008934 vs. TAIR 10
Match:
ATMG00300.1 (Gag-Pol-related retrotransposon family protein )
HSP 1 Score: 67.0 bits (162), Expect = 9.8e-11
Identity = 35/95 (36.84%), Postives = 53/95 (55.79%), Query Frame = 0
Query: 438 RNDLYQIELPSIQTPKSEIRSTS--FAGLWHNRLGHASSKVIKSVLKSCNVSTFLNESLH 497
R+D I S++T +S + T+ LWH+RL H S + ++ ++K + + SL
Sbjct: 43 RHDSLYILQGSVETGESNLAETAKDETRLWHSRLAHMSQRGMELLVKKGFLDSSKVSSLK 102
Query: 498 FCDACQKGKSHRLPFSRSVSHTWQPLELVHCDLWG 531
FC+ C GK+HR+ FS T PL+ VH DLWG
Sbjct: 103 FCEDCIYGKTHRVNFSTGQHTTKNPLDYVHSDLWG 137
BLAST of Lag0008934 vs. TAIR 10
Match:
AT1G34070.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G48050.1); Has 648 Blast hits to 647 proteins in 29 species: Archae - 0; Bacteria - 0; Metazoa - 16; Fungi - 25; Plants - 607; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 58.2 bits (139), Expect = 4.6e-08
Identity = 56/245 (22.86%), Postives = 104/245 (42.45%), Query Frame = 0
Query: 41 LDDKNYLLWKDMVLAILRGQKVDMYVLGTKAQPSELIETTTESGKMLLSNMLYEEWMTVD 100
+++ NY W+++ L V ++ GT +L +N W D
Sbjct: 26 IEESNYDAWRELFLTHCLSFDVMGHIDGT----------------LLPTNANDVNWQKRD 85
Query: 101 QALSGWLFGSMSP-AIAADVINFKTLREVWKALEEVYGDTSKACVNQFRGILQNTKKGSM 160
+ L+G+++P + T R++W ++ + + A + L+ G M
Sbjct: 86 GIVKLSLYGTLTPKQFQGSFVTSSTSRDIWLRIKNQFRNNKDARALRLDSELRTKDIGDM 145
Query: 161 KMIDYLAIMKQASENLKLVGNPVSLDDLVSYVLAGLDSEYIPIVCAIDDKD-LKTWQELS 220
++ DY MK+ +++L+ V PV+ +LV YVL GL+ ++ I+ I + ++ + +
Sbjct: 146 RVADYYRKMKKLADSLRNVDVPVTDRNLVMYVLNGLNPKFDNIINVIKHRQPFPSFDDAA 205
Query: 221 SILINFEGTLARYSTPTNAHFDLPDLATHLALNRQSMFDN-QRQFNPSNGNRGNDNNSGS 280
++L E L R P H D +T LA + N QR G RG +
Sbjct: 206 TMLQEEEDRLKRAIKPNPTHVDHSSSSTVLACSEAPPVTNFQRSGGNQMGYRGRGRGNNI 254
Query: 281 YYGSG 283
+ G G
Sbjct: 266 FRGRG 254
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
GAU51268.1 | 5.5e-205 | 39.81 | hypothetical protein TSUD_412550 [Trifolium subterraneum] | [more] |
GAU19483.1 | 3.5e-204 | 39.56 | hypothetical protein TSUD_77270 [Trifolium subterraneum] | [more] |
PNX94503.1 | 6.3e-201 | 39.79 | putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense... | [more] |
PNY01489.1 | 1.1e-192 | 39.58 | copia-like polyprotein, partial [Trifolium pratense] | [more] |
KYP50444.1 | 1.4e-189 | 40.02 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] | [more] |
Match Name | E-value | Identity | Description | |
Q94HW2 | 4.7e-143 | 33.75 | Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... | [more] |
Q9ZT94 | 5.1e-137 | 32.69 | Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... | [more] |
P10978 | 5.2e-73 | 26.06 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... | [more] |
P04146 | 2.0e-53 | 24.42 | Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3 | [more] |
P92520 | 4.9e-23 | 48.03 | Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 ... | [more] |
Match Name | E-value | Identity | Description | |
A0A803PM38 | 8.8e-209 | 41.55 | Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1 | [more] |
A0A2Z6P4D5 | 2.7e-205 | 39.81 | Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... | [more] |
A0A2Z6MBG6 | 1.7e-204 | 39.56 | Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... | [more] |
A0A2K3MUJ9 | 3.0e-201 | 39.79 | Putative retrotransposon Ty1-copia subclass protein (Fragment) OS=Trifolium prat... | [more] |
A0A2K3NEN7 | 5.2e-193 | 39.58 | Copia-like polyprotein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g024786... | [more] |
Match Name | E-value | Identity | Description | |
AT4G23160.1 | 2.6e-35 | 45.00 | cysteine-rich RLK (RECEPTOR-like protein kinase) 8 | [more] |
ATMG00820.1 | 3.5e-24 | 48.03 | Reverse transcriptase (RNA-dependent DNA polymerase) | [more] |
AT5G48050.1 | 7.5e-11 | 24.12 | CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... | [more] |
ATMG00300.1 | 9.8e-11 | 36.84 | Gag-Pol-related retrotransposon family protein | [more] |
AT1G34070.1 | 4.6e-08 | 22.86 | CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... | [more] |