Homology
BLAST of Lag0039869 vs. NCBI nr
Match:
GAU19483.1 (hypothetical protein TSUD_77270 [Trifolium subterraneum])
HSP 1 Score: 847.4 bits (2188), Expect = 1.9e-241
Identity = 536/1457 (36.79%), Postives = 714/1457 (49.00%), Query Frame = 0
Query: 33 TIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSG 92
++KLDR N+ LWK+L LP++R KL+G++ G CPE+F+ ++ +S N
Sbjct: 18 SVKLDRNNYPLWKSLVLPVIRGCKLDGYMLGTEGCPEEFITSSDSSKNK----------- 77
Query: 93 AVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQE 152
N + W A DQ LLGW+ NSMT+E+ATQ++ E +++LW Q
Sbjct: 78 ---------------NSAFVEWQANDQRLLGWMLNSMTTEIATQLLHCETSKQLWDEAQS 137
Query: 153 LFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLG 212
L G +R++ YL+ F RKG KM DYL MKN D L AG+PV+T LI Q L G
Sbjct: 138 LAGAHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLVDKLKLAGNPVSTSDLIIQTLNG 197
Query: 213 LDEEYNPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDS 272
LD EYNPVV + D+ +SW ++QA+LL FE R+E NNL +L+ A+ N+AN
Sbjct: 198 LDSEYNPVVVKLSDQTTLSWVDLQAQLLTFESRIEQLNNL---TNLTLNATANVAN---- 257
Query: 273 GNQRNQSYGGRTNNGFGRGNQRG-AGGRGRGRARGYGSFSNKPVCQVCGKTGHTALMCYQ 332
R+ G +NN + N RG GGRGRG+ S K CQVCG + H A+ C+
Sbjct: 258 ---RSDHRGKSSNNNWRGSNSRGWRGGRGRGK-------SGKNPCQVCGLSNHIAIDCFH 317
Query: 333 RFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSG 392
RF+K + N G ++Q + N AF+A+QN +V D +WY DSG
Sbjct: 318 RFDKTY----SRSNHSAGHDKQGSHN------AFLASQN---------SVEDYDWYFDSG 377
Query: 393 ASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLCVPSI 452
ASNHVT + E+ GK S+ +G+G KL I G+S L K+ L +L VP+I
Sbjct: 378 ASNHVTHQTEKFQDLTEHHGKNSLVVGNGEKLAILATGSSKL----KSLNLHDILYVPNI 437
Query: 453 AKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNE 512
KNL+SVSKLA DN + VEF + C VKDK TGKV+LKG+L +GLY+ G K
Sbjct: 438 TKNLLSVSKLAADNNILVEFDENCCFVKDKLTGKVILKGLLKDGLYQLSGTK-------- 497
Query: 513 LPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVN 572
S FV K WH+RLGHP+ +VL V++SC + + +
Sbjct: 498 -------------RNPSAFV--------SVKESWHRRLGHPNNKVLDKVLESCKVKVPPS 557
Query: 573 ESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFS 632
++F FCEACQYGK H LPF +S+SHA + +LVHTD+WGPAP+ + +GFKYY+ F+DDFS
Sbjct: 558 DNFSFCEACQYGKMHLLPFKSSSSHAQEPLELVHTDVWGPAPIMTSSGFKYYVHFVDDFS 617
Query: 633 RFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRL 692
RF WIYPLK+K+E ++AF+ F L +NQFN IK +Q D GGEY + +L E G++ R+
Sbjct: 618 RFTWIYPLKQKSETVQAFIQFKNLTENQFNKRIKVIQCDGGGEYKPVQKLAVEAGIQFRM 677
Query: 693 SCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKS 752
SCP+TSQQNGRAERKHRH+ E GLTLLAQA MPL +W +AF AV LIN LPSQV +S
Sbjct: 678 SCPYTSQQNGRAERKHRHITEFGLTLLAQAQMPLHYWWEAFSTAVYLINRLPSQVTQNES 737
Query: 753 PMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSG 812
P L+ K+ D++ L+ FG AC+PCL+ Y HK QYHT +CV+LG S HKG+KCL+S G
Sbjct: 738 PYSLMLQKEPDYKLLKTFGCACYPCLKPYNQHKLQYHTTRCVFLGYSNSHKGYKCLNSHG 797
Query: 813 RLFISRHVRFNEEEFPFASGFQQVVTTAASSSNV-SPSIPLWFSNISGPDIKTTQQNREV 872
R+FISRHV FNE+ FPF GF + ++ NV S S PL T +
Sbjct: 798 RIFISRHVIFNEDHFPFHDGFLNTRSPLKTTINVPSTSFPLC----------TAGNVIDD 857
Query: 873 PSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSASESS 932
S + E P T + S DV + D+ + +E++ + E+
Sbjct: 858 ASMPILEAENPAETNTEDSQDV----------------NSDTEQTNNGPSEDNTTHEETL 917
Query: 933 ASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKL---WLTHACTDWSVTEPTRIADALAT 992
TQ ++ H + TR K+GI KPKL LT D EP +AL+
Sbjct: 918 DITQQQSVGEASQNTNTSHAIHTRSKSGIHKPKLPYIGLTETYKD--TMEPANAKEALSR 977
Query: 993 PQWREAMD---------------------------------------------------- 1052
P W+EAM
Sbjct: 978 PLWKEAMQKEFEALMSNKTWILVPYQNQENIVDSKWVFKTKYKPDGSLERRKARLVAKGF 1037
Query: 1053 -----------IDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPG 1112
PV+KAST+R+ILSIAV W++RQLD NNAFLNG L E V+M QP G
Sbjct: 1038 QQTAGIDYEETFSPVIKASTVRIILSIAVHLNWEVRQLDINNAFLNGHLKETVFMHQPEG 1097
Query: 1113 YEDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGYS--------IIFKYRD---- 1172
+ D+ N++CKL KAIYGLKQAPRAW ++LK LL+WG+ + K +D
Sbjct: 1098 FVDSTKPNHICKLSKAIYGLKQAPRAWFDSLKTALLNWGFQNTKSDSSLFLLKGKDHITF 1157
Query: 1173 ------------------------------------------------------------ 1207
Sbjct: 1158 LLIYVDDIIVTGSNGKFLQAFIKQLNDAFSLKDLGHLHYFLGIEVQRDASGMYLKQSKYI 1217
BLAST of Lag0039869 vs. NCBI nr
Match:
GAU51268.1 (hypothetical protein TSUD_412550 [Trifolium subterraneum])
HSP 1 Score: 789.6 bits (2038), Expect = 4.8e-224
Identity = 524/1496 (35.03%), Postives = 717/1496 (47.93%), Query Frame = 0
Query: 16 TTSFNSPPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAA 75
+++ NSP N L I ++KLDR N+ LWK+L L ++R KL+G++ G + CPE+FV +A
Sbjct: 2 SSAANSPKKND-LPSIISVKLDRDNYPLWKSLVLSLIRGCKLDGYILGTTECPEQFVTSA 61
Query: 76 SASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSEVAT 135
S +++NP + W+A DQ LLGWL NSM ++AT
Sbjct: 62 DKS--------------------------KKVNPDFGDWIANDQALLGWLMNSMAIDIAT 121
Query: 136 QVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQ 195
Q++ E +++LW Q L G +++ YL+ F +RKG KM +YL MKN +D L
Sbjct: 122 QLLHCETSKQLWDETQSLAGAHTKSRITYLKSEFHNTRKGEMKMEEYLIKMKNLSDKLKL 181
Query: 196 AGSPVNTRSLISQVLLGLDEEYNPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSS 255
AGSP++ L+ Q L GLD EYNPVV + D+ +SW ++QA+LL FE RL+ NN S
Sbjct: 182 AGSPISNSDLMIQTLNGLDAEYNPVVVKLSDQINLSWVDVQAQLLAFESRLDQFNNF-SG 241
Query: 256 LSLSPGASVNMANSRD-SGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKP 315
L+L+ AS N AN + GN+ N RGN R + RG RG G SN
Sbjct: 242 LTLN--ASANFANKTEFRGNKFN-----------SRGNWRRSNFRGMRGGRGKGRMSNTK 301
Query: 316 VCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSC 375
CQVC TGH A+ C RF++ + G N + QG +AF
Sbjct: 302 -CQVCNGTGHIAVDCSYRFDRPYT----------GRNYSTEADKQGSHSAF--------- 361
Query: 376 VAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNSCLA 435
+A+P D WY DSGA+NHVT + E+ GK S+ +G+G KLKI G++ L
Sbjct: 362 IASPYHGQDYEWYFDSGANNHVTHQTDKFQGFNEHNGKNSLMVGNGEKLKIVASGSTKL- 421
Query: 436 AEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDE 495
N L VL VP I KNL+SVSKL DN + VEF C VKDK TG+ LLKG L +
Sbjct: 422 ---NNLNLHDVLYVPQITKNLLSVSKLTADNNILVEFDANCCSVKDKLTGQTLLKGRLKD 481
Query: 496 GLYRFDGVKAVTTDTNELPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSE 555
GLY+ +N+ P C ++ K+ WH++LGHP+
Sbjct: 482 GLYQL---------SNKEP-CVYMSVKE---------------------SWHRKLGHPNN 541
Query: 556 EVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPL 615
+VL V+K CN+ +S ++ F FCEACQ+GK H LPF S+SH + L+H+D+WGPAP+
Sbjct: 542 KVLDKVLKDCNVKISHSDQFSFCEACQFGKLHLLPFKPSSSHVQEPLALIHSDVWGPAPI 601
Query: 616 DSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGE 675
S +GFKYY+ FIDDFSRF WI+PLK+K++ + AF+ F L +NQFN IK +Q D GGE
Sbjct: 602 LSPSGFKYYVHFIDDFSRFTWIFPLKQKSDTIHAFIQFKNLAENQFNKKIKIIQCDGGGE 661
Query: 676 YVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFLV 735
Y + ++ E G++ R+SCP+TSQQNGRAERKHRHV E GLTLLAQA MPL++W +AF
Sbjct: 662 YKAVQKVSIEAGIQFRMSCPYTSQQNGRAERKHRHVAELGLTLLAQAKMPLRYWWEAFST 721
Query: 736 AVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVY 795
AV LIN LPS V +SP L++ ++ D+ AL+ FG AC+PCL+ Y HK Q+HT +CV+
Sbjct: 722 AVYLINRLPSSVNPNESPYSLMFKREPDYNALKPFGCACYPCLKPYNQHKLQFHTTRCVF 781
Query: 796 LGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFPFASGF----QQVVTTAASSSNVSPSIP 855
+G S HKG+KC++S GR+F+SRHV FNE FPF GF + T +SS + P+
Sbjct: 782 VGYSNSHKGYKCINSHGRIFVSRHVIFNENHFPFHGGFLDTKNPLKTLTDNSSILLPTC- 841
Query: 856 LWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHE 915
T Q+ P + ++ T S S+D ++ + T+
Sbjct: 842 ---------SAGATTQDAIEPDNNTTSDQ---NTHSIESSDNNENEEQVDSSEFFVNTNN 901
Query: 916 DSPPSQVEVAENHCSASESSASTQSSNLPPTPSTVQAG-HPMVTRGKAGIFKPKL-WLTH 975
S +Q A+N + + + ST + + + H M TR K GI KPK+ ++
Sbjct: 902 SS--TQDIEADNSVDSEDRNNSTMTGTIQQQAQQDNSNTHWMRTRSKDGIHKPKIPYVGM 961
Query: 976 ACTDWSVTEPTRIADALATPQWREAMD--------------------------------- 1035
A TD EP + +AL P W+EAMD
Sbjct: 962 AETDSEEKEPKSVKEALGRPMWKEAMDKEYKALVSNHTWTLVPYQEQENIIDSKWIFKTK 1021
Query: 1036 ------------------------------IDPVVKASTIRVILSIAVTKGWQLRQLDFN 1095
PVVK+ST+R+IL+IAV W++RQLD N
Sbjct: 1022 YKSDGSIERRKARLVAKGFQQTAGLDFGETFSPVVKSSTVRIILTIAVHFNWEVRQLDIN 1081
Query: 1096 NAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGY- 1155
NAFLNG+L E V+M QP GY DA N++CKL KAIYGLKQAPRAW ++L++TL++WG+
Sbjct: 1082 NAFLNGKLKETVFMHQPEGYIDAAKPNHICKLSKAIYGLKQAPRAWYDSLRSTLVNWGFQ 1141
Query: 1156 ------SIIF-------------------------------------------------- 1208
S+ F
Sbjct: 1142 NAKNDTSLFFLKGADHTTFLLIYVDDIIVTGSNIKFLEAFTNQLNTAYSLKDLGPLHYFL 1201
BLAST of Lag0039869 vs. NCBI nr
Match:
PNX94503.1 (putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense])
HSP 1 Score: 775.8 bits (2002), Expect = 7.1e-220
Identity = 463/1123 (41.23%), Postives = 618/1123 (55.03%), Query Frame = 0
Query: 28 LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAG 87
L ++KLDR NF LWK+L LP++R K +G++ G CP++FV S++ T
Sbjct: 12 LPSTVSVKLDRDNFPLWKSLVLPLIRGCKYDGYMLGTKKCPDQFV----TSIDNT----- 71
Query: 88 ATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSEVATQVMGYENAQELW 147
++NP Y+ W A DQ LLGWL NSMT ++ATQV+ E +++LW
Sbjct: 72 -----------------EKINPDYQDWQADDQALLGWLMNSMTVDIATQVLHCETSKQLW 131
Query: 148 AAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLIS 207
Q L G +R+ YL+ F + K KM YL MKN AD L AGSP+++ L+
Sbjct: 132 DEAQSLAGAHTRSRIIYLKSEFHNTHKREMKMEQYLAKMKNLADKLKLAGSPISSSDLMI 191
Query: 208 QVLLGLDEEYNPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMA 267
Q L GLD EYNPVV + D+ ISW + QA+LL FE RL+ NN +++ AS N A
Sbjct: 192 QTLNGLDSEYNPVVVKLSDQTNISWVDFQAQLLAFESRLDQLNNFN---NINLNASANFA 251
Query: 268 NSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTAL 327
+ +SG + +G R G+ N RG G GRGRAR S +P+CQ+CGK GHTA
Sbjct: 252 SKNESGGNK---FGSR--GGWRGSNSRGMRG-GRGRAR--MSKPPRPICQICGKFGHTAA 311
Query: 328 MCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWY 387
CY RF+K + + + G+G +AF VA+P D WY
Sbjct: 312 QCYYRFDKSY------------TEKNHYAEGEGSHSAF---------VASPYHGQDYEWY 371
Query: 388 ADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLC 447
DSGASNHVT L + E GK S+ +G+G KLKI G++ L + L+ VL
Sbjct: 372 FDSGASNHVTHQSGQLQDLNENNGKNSLLVGNGEKLKILASGSTKL----NDVNLRNVLY 431
Query: 448 VPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTT 507
VP I KNL+SVSKL DN VEF + +C VKDK TGK LLKG L +GLY+ K T
Sbjct: 432 VPEITKNLLSVSKLTIDNNALVEFDENYCYVKDKLTGKALLKGRLKDGLYQLSANKEPPT 491
Query: 508 DTNELPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLP 567
+ + +L K IWH++LGHP+ +VL+ V+K N+
Sbjct: 492 NKDPCAYISL------------------------KEIWHRKLGHPNNKVLEKVLKDNNVK 551
Query: 568 LSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFI 627
+S ++ F FCEACQ+GK H LPF S+SHA + DL+HTD+WGPAP+ S + FKYY+ F+
Sbjct: 552 ISPSDKFTFCEACQFGKLHLLPFKTSSSHAKEPLDLIHTDVWGPAPILSQSNFKYYVHFL 611
Query: 628 DDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGV 687
DDFSRF WI+PLK+K+E + AF F LV+NQFN IK ++ D GGEY + + + G+
Sbjct: 612 DDFSRFTWIFPLKQKSETIHAFNQFKNLVENQFNKKIKVIRCDGGGEYKPVQKCAIDSGI 671
Query: 688 KIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVL 747
+ ++SCP+TSQQNGRAERKHRHV E GLTLLAQA MPL +W +AF AV LIN LPS V
Sbjct: 672 QFQMSCPYTSQQNGRAERKHRHVTELGLTLLAQAKMPLSYWWEAFSTAVYLINRLPSSVN 731
Query: 748 GGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCL 807
+SP L++ K+ D+ AL+ FG AC+PCL+ Y HK Q+HT +CV+LG S HKG+KC+
Sbjct: 732 PNESPYTLVFKKEPDYTALKPFGCACYPCLKPYNQHKLQFHTTRCVFLGYSNSHKGYKCV 791
Query: 808 SSSGRLFISRHVRFNEEEFPFASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQN 867
+S GR+F+SRHV FNE FPF GF + T V+ P+ F S P TT
Sbjct: 792 NSHGRVFVSRHVVFNENHFPFQEGF---LDTRNPIKVVTNDTPIGFP--SFPAGITTNNT 851
Query: 868 REVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSAS 927
E V E P N+ D Q + ++ + E A
Sbjct: 852 AEATDNIVDQQE-PELNDINTVAD-QSVESDTFEHTDENNFSNGETEDSTEAAGRESMEE 911
Query: 928 ESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKL---WLTHACTDWSVTEPTRIADA 987
S T+++ PP + H M TR KAG++KPKL LT + EP +++A
Sbjct: 912 ISQPITETN--PPPQQDITNTHWMRTRSKAGVYKPKLPYIGLTEEAKEGK--EPESVSEA 971
Query: 988 LATPQWREAMDID----------------------------------------------- 1047
L+ P+W AMD +
Sbjct: 972 LSIPEWLNAMDAEYKALMNNKTWTLVPFEGQENVISSKWIFKTKYKADGTIERRKARLVA 1031
Query: 1048 ----------------PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQ 1085
PVVK+ST+R+ILSIAV W++RQLD NNAFLNG L E V+M Q
Sbjct: 1032 RGFQQTAGVDYDETFSPVVKSSTVRIILSIAVHLSWEVRQLDINNAFLNGNLKESVFMHQ 1037
BLAST of Lag0039869 vs. NCBI nr
Match:
KYP50444.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])
HSP 1 Score: 753.8 bits (1945), Expect = 2.9e-213
Identity = 438/1025 (42.73%), Postives = 589/1025 (57.46%), Query Frame = 0
Query: 129 MTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKN 188
MT EVATQ++ E +Q++W Q L G +R+ +L+ F ++RKG KM +YL MK
Sbjct: 1 MTQEVATQLLHCETSQQIWEDAQSLAGAHTRSRITFLKTEFHRTRKGGLKMEEYLTKMKE 60
Query: 189 HADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDRAGISWSEMQAELLVFEKRLEL 248
AD+L AGS V+T L++Q L GLD EYNP+V + D+ ++W EMQA+LL +E RLE
Sbjct: 61 IADDLALAGSSVSTMDLVTQTLAGLDNEYNPIVVQLSDKEHLTWVEMQAQLLTYENRLEQ 120
Query: 249 QNNLKSSLSLSPGASVN--MANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARG 308
NN +S+L+L+P ++++ + N R N GG+ N G GGRGRGRA
Sbjct: 121 INN-QSNLTLNPSSNISTILYNRRGKSNAFGGGRGGQINRG-------ARGGRGRGRAT- 180
Query: 309 YGSFSNKPVCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFV 368
++ VCQVC K GH A CY RFNK ++G QN E + ++ +
Sbjct: 181 ----KDRIVCQVCCKPGHAASHCYHRFNKNYIG----QNSDEQKSEKDKEQ--------- 240
Query: 369 ANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIA 428
N N N+ VA+P TV D +WY DSGASNHVT D N + E +GK +T+G+G LKI
Sbjct: 241 -NYNFNAYVASPSTVEDLDWYFDSGASNHVTYDQNKVQEVNENDGKSFLTVGNGANLKII 300
Query: 429 FIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKV 488
G+S L + K+ LK +L VP I KNL+S+SKL DN+++VEFHD C VKDK TG++
Sbjct: 301 ACGDSSLDTQQKSLNLKDILYVPKITKNLLSISKLTFDNDIYVEFHDVACFVKDKLTGRI 360
Query: 489 LLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNELSGFVLSSTLNVAIS-KAIW 548
LL+G + +GLY+ G +T TN+ P +V S K W
Sbjct: 361 LLEGKIKDGLYQLPG---GSTSTNKRP-----------------------HVFFSIKETW 420
Query: 549 HKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVH 608
H++LGHP+ +VL V+K CN+ S E+F+FCEACQ+GK+H LPF NS S A + DLVH
Sbjct: 421 HRKLGHPNSKVLNEVMKLCNIEASPCENFEFCEACQFGKAHNLPFQNSVSCAKEPLDLVH 480
Query: 609 TDLWGPAPLDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIK 668
+D+WGPAP+ SV+GFKYY+LF+DD+SRF WIYPLK+K++ +AF+ F LV+NQFN IK
Sbjct: 481 SDVWGPAPISSVSGFKYYVLFLDDWSRFTWIYPLKQKSDVFQAFIQFRNLVENQFNKRIK 540
Query: 669 ALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPL 728
LQ D GGE+ + ++ + G+++R SCP+TS QNGRAERKHRHVVE+GLTLLAQA MPL
Sbjct: 541 TLQCDGGGEFKSLSKVLIKTGIQLRESCPYTSAQNGRAERKHRHVVESGLTLLAQAKMPL 600
Query: 729 KFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKF 788
+W +AF AV LIN LP+QV+ KSP + L+ K D+ A++ FG AC+PCL+ Y HK
Sbjct: 601 HYWWEAFSTAVFLINRLPTQVIKNKSPYQQLFDKNPDYTAMKTFGCACYPCLKPYNQHKL 660
Query: 789 QYHTEKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFPFASGFQQVVTTAASSSNV 848
Q+HT KCV+LG S HKG+KCL+S+GR+FISRHV FNE FPF GF + T + +
Sbjct: 661 QFHTTKCVFLGYSGSHKGYKCLNSTGRIFISRHVVFNEHHFPFHDGF---LNTRKPAEII 720
Query: 849 SPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSL 908
+ L F +SP T SN + + Q L + S+
Sbjct: 721 TDPTSLLFP--------------------ISP------TGSNVANEEQRLHTNNNSSSNT 780
Query: 909 MPTHEDSPPSQVEVAENH--CSASESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPK 968
H QVE AEN A+ S + +S + ++ H M TR K GI KPK
Sbjct: 781 KSKH------QVEQAENQNTIDATISQNTFANSRIENNIESINQ-HQMTTRSKMGIIKPK 840
Query: 969 LWLTHAC-TDWSVTEPTRIADALATPQWREAM---------------------------- 1028
A EP +AL P+W++AM
Sbjct: 841 KPYVGAVEKTLEEQEPETTYEALENPEWKKAMIAEFKALMMNKTWTLVPYQGQKNIIDCK 900
Query: 1029 -------------------------------DID----PVVKASTIRVILSIAVTKGWQL 1085
D D PV+KA T+R+ILSIAV W++
Sbjct: 901 WVFKTKYKADGTIERRKARLVAKGFQQTLGLDYDETFSPVIKAITVRIILSIAVHFNWEI 936
BLAST of Lag0039869 vs. NCBI nr
Match:
PNY01489.1 (copia-like polyprotein, partial [Trifolium pratense])
HSP 1 Score: 713.4 bits (1840), Expect = 4.3e-201
Identity = 435/1121 (38.80%), Postives = 604/1121 (53.88%), Query Frame = 0
Query: 28 LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAG 87
L I ++KLDR N+ LWK+L LP++R K +G++ G CPE+FV +A S
Sbjct: 13 LPSIISVKLDRDNYPLWKSLVLPLIRGCKFDGYILGTKECPEQFVTSADKS--------- 72
Query: 88 ATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSEVATQVMGYENAQELW 147
+++NP ++ W+A DQ LLGWL NSM ++ATQ++ E +++LW
Sbjct: 73 -----------------KKVNPDFQDWMADDQALLGWLMNSMAIDIATQLLHCETSKQLW 132
Query: 148 AAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLIS 207
Q L G +++ YL+ F +RKG KM +YL MKN +D L +GSP++ L+
Sbjct: 133 DEAQSLAGAHTKSRIIYLKSEFHNTRKGEMKMEEYLIKMKNLSDKLKLSGSPISNSDLMI 192
Query: 208 QVLLGLDEEYNPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMA 267
Q L GLD EYNPVV + D+ +SW ++QA+LL FE RL+ NN S L+L+ AS N A
Sbjct: 193 QTLNGLDAEYNPVVVKLSDQINLSWVDVQAQLLAFESRLDQLNNF-SGLTLN--ASANFA 252
Query: 268 NSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTAL 327
N + R N RGN R + RG RG G SN CQVC TGHTA+
Sbjct: 253 NKTEF----------RGNKFHSRGNWRRSNFRGMRGGRGKGRMSNTK-CQVCSGTGHTAV 312
Query: 328 MCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWY 387
C RF++ + G N + QG +AF VA+P D WY
Sbjct: 313 DCSYRFDRSYT----------GRNYSTEADKQGSHSAF---------VASPYHGQDYEWY 372
Query: 388 ADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLC 447
DSGASNHVT + E+ GK S+ +G+G KLKI G++ L L VL
Sbjct: 373 FDSGASNHVTHQTDKFQGFNEHNGKNSLMVGNGEKLKIVASGSTKL----NTLNLHDVLY 432
Query: 448 VPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTT 507
VP I KNL+SVSKL DN +FVEF C VKDK TG+ LLKG L +GLY+ + V+
Sbjct: 433 VPQITKNLLSVSKLTADNNIFVEFDANCCSVKDKLTGQTLLKGRLKDGLYQ---LSDVSP 492
Query: 508 DTNELPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLP 567
+N+ P C ++ K+ WH++LGHP+ +VL+ V+K CN+
Sbjct: 493 QSNKDP-CVYMSVKE---------------------SWHRKLGHPNNKVLEKVLKDCNVK 552
Query: 568 LSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFI 627
+S ++ F FCEACQ+GK H LPF +S+SH + L+H+D+WGPAP+ S +GFKYY+ FI
Sbjct: 553 ISPSDQFSFCEACQFGKLHLLPFKSSSSHVQEPLGLIHSDVWGPAPILSPSGFKYYVHFI 612
Query: 628 DDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGV 687
DDFSRF WI+PLK+K++ + AF+ F L +NQFN IK +Q D GGEY + ++ E G+
Sbjct: 613 DDFSRFTWIFPLKQKSDTIHAFIQFKNLAENQFNKKIKIIQCDGGGEYKAVQKVSIEAGI 672
Query: 688 KIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVL 747
+ R+SCP+TSQQNGRAERKHRHVVE GLTLLAQA MPL++W +AF AV LIN L S V
Sbjct: 673 QFRMSCPYTSQQNGRAERKHRHVVELGLTLLAQAKMPLRYWWEAFSTAVYLINRLSSSVN 732
Query: 748 GGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCL 807
+SP L++ ++ D+ AL+ FG AC+PCL+ Y HK Q+HT +CV++G S HKG
Sbjct: 733 PNESPYSLMFKREPDYNALKPFGCACYPCLKPYNQHKLQFHTTRCVFMGYSNSHKGSTTQ 792
Query: 808 SSSGRLFISRHVRFNEEEFPFASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQN 867
+ G S + N+++ + Q +T +S +N +++
Sbjct: 793 DAIG----SDNNIVNDQD---TTNDQNTHSTESSDNN-------------------EEEH 852
Query: 868 REVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSAS 927
+ + V+ N N ST +D +E+ S +
Sbjct: 853 ADNSESFVNTN--------NGST-------------------QDIEVDNFVDSEDRNSPT 912
Query: 928 ESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKL-WLTHACTDWSVTEPTRIADALA 987
+ S Q ++ T + H + TR K GI KPKL ++ TD EP + +AL
Sbjct: 913 ITGTSQQQAHQDNTNT-----HGIRTRSKNGIHKPKLPYVGMTETDSEEKEPESVKEALD 972
Query: 988 TPQWREAMD--------------------------------------------------- 1047
P W+EAMD
Sbjct: 973 KPMWKEAMDKEYKALMSNYTWTLVPFQAQENIIDSKWIFKTKYKSDGSIERRKARLVAKG 987
Query: 1048 ------------IDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPP 1085
PVVK+ST+R+IL+IAV W++RQLD NNAFLNG+L E V+M QP
Sbjct: 1033 FQQTAGLDFHETFSPVVKSSTVRIILTIAVHFNWEVRQLDINNAFLNGKLKETVFMHQPE 987
BLAST of Lag0039869 vs. ExPASy Swiss-Prot
Match:
Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)
HSP 1 Score: 495.4 bits (1274), Expect = 2.4e-138
Identity = 421/1540 (27.34%), Postives = 641/1540 (41.62%), Query Frame = 0
Query: 20 NSPPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASV 79
N+ LN ++ +T KL N+L+W + Y+L G L G++ P
Sbjct: 12 NTSILNVNMSNVT--KLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMP----------- 71
Query: 80 NPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSEVATQVMG 139
P T A A +NP Y W D+L+ + +++ V V
Sbjct: 72 -PATIGTDA---------------APRVNPDYTRWKRQDKLIYSAVLGAISMSVQPAVSR 131
Query: 140 YENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSP 199
A ++W +++++ S LR +Q KG + DY++ + D L G P
Sbjct: 132 ATTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKP 191
Query: 200 VNTRSLISQVLLGLDEEYNPVVAMIQDR-AGISWSEMQAELLVFEKRLELQNNLKSSLSL 259
++ + +VL L EEY PV+ I + + +E+ LL E ++ SS ++
Sbjct: 192 MDHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTEIHERLLNHESKI----LAVSSATV 251
Query: 260 SPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGY-GSFSNKPV-- 319
P + N + R++ N + G R N R N + + + + +KP
Sbjct: 252 IP-ITANAVSHRNTTTTNNNNNGNRNNRYDNRNNNNNSKPWQQSSTNFHPNNNQSKPYLG 311
Query: 320 -CQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQ-NVNS 379
CQ+CG GH+A C Q + F+ + N Q P+ F Q N
Sbjct: 312 KCQICGVQGHSAKRCSQL--QHFLSSV---------------NSQQPPSPFTPWQPRANL 371
Query: 380 CVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNSCL 439
+ +P + NW DSGA++H+T+D+NNL+ Y G + V + DG+ + I+ G++ L
Sbjct: 372 ALGSPYS--SNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSL 431
Query: 440 AAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLD 499
+ + + L +L VP+I KNL+SV +L N V VEF VKD +TG LL+G
Sbjct: 432 STKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTK 491
Query: 500 EGLYRFDGVKAVTTDTNELPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPS 559
+ LY E PI + + +S F S+ + + WH RLGHP+
Sbjct: 492 DELY-------------EWPIAS-------SQPVSLFASPSS---KATHSSWHARLGHPA 551
Query: 560 EEVLKSVVKSCNLPLSVNESFKF--CEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGP 619
+L SV+ + +L + +N S KF C C KS+ +PF ST ++ + +++D+W
Sbjct: 552 PSILNSVISNYSLSV-LNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWS- 611
Query: 620 APLDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDN 679
+P+ S + ++YY++F+D F+R+ W+YPLK+K++ E F+ F L++N+F + I +DN
Sbjct: 612 SPILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDN 671
Query: 680 GGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDA 739
GGE+V + + ++ G+ S PHT + NG +ERKHRH+VETGLTLL+ AS+P +W A
Sbjct: 672 GGEFVALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYA 731
Query: 740 FLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEK 799
F VAV LIN LP+ +L +SP + L+G ++ LRVFG AC+P LR Y HK + +
Sbjct: 732 FAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQ 791
Query: 800 CVYLGPSPLHKGHKCLS-SSGRLFISRHVRFNEEEFPFASGFQQV--------------- 859
CV+LG S + CL + RL+ISRHVRF+E FPF++ +
Sbjct: 792 CVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQRRESSCVWS 851
Query: 860 ----------VTTAASSSN--------VSPSIPLWFSNISGPDIKTTQQNREVPSTGVSP 919
V A S S+ SPS P S +S N + + P
Sbjct: 852 PHTTLPTRTPVLPAPSCSDPHHAATPPSSPSAPFRNSQVS-------SSNLDSSFSSSFP 911
Query: 920 NECPPTTPSNSSTDVQPLPAELSPQS-SLMPTHEDSP----PSQV-----EVAENHCSAS 979
+ PT P + P + Q+ S T +++P PSQ+ A++ S+
Sbjct: 912 SSPEPTAPRQNGPQPTTQPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSSSSP 971
Query: 980 ESSASTQSSNLPPTPSTV----------------QA---GHPMVTRGKAGIFKPKLWLTH 1039
+ S SS+ PTP ++ QA H M TR KAGI KP +
Sbjct: 972 SPTTSASSSSTSPTPPSILIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPKYSL 1031
Query: 1040 ACTDWSVTEPTRIADALATPQWREAM---------------------------------- 1099
A + + +EP AL +WR AM
Sbjct: 1032 AVSLAAESEPRTAIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTK 1091
Query: 1100 ------------------------------DIDPVVKASTIRVILSIAVTKGWQLRQLDF 1159
PV+K+++IR++L +AV + W +RQLD
Sbjct: 1092 KYNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDV 1151
Query: 1160 NNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGY 1209
NNAFL G L +DVYM QPPG+ D NYVCKL KA+YGLKQAPRAW L+ LL+ G+
Sbjct: 1152 NNAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGF 1211
BLAST of Lag0039869 vs. ExPASy Swiss-Prot
Match:
Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)
HSP 1 Score: 486.5 bits (1251), Expect = 1.1e-135
Identity = 410/1535 (26.71%), Postives = 628/1535 (40.91%), Query Frame = 0
Query: 20 NSPPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASV 79
N+ LN ++ +T KL N+L+W + Y+L G L G++P P
Sbjct: 12 NTNILNVNMSNVT--KLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMPP---------- 71
Query: 80 NPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSEVATQVMG 139
A+ ++ P +NP Y W D+L+ + +++ V V
Sbjct: 72 ---------------ATIGTDAVP--RVNPDYTRWRRQDKLIYSAILGAISMSVQPAVSR 131
Query: 140 YENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSP 199
A ++W +++++ S LR + D L G P
Sbjct: 132 ATTAAQIWETLRKIYANPSYGHVTQLRFI-------------------TRFDQLALLGKP 191
Query: 200 VNTRSLISQVLLGLDEEYNPVVAMIQDR-AGISWSEMQAELLVFEKRLELQNNLKSSLSL 259
++ + +VL L ++Y PV+ I + S +E+ L+ E +L N S +
Sbjct: 192 MDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTEIHERLINRESKLLALN----SAEV 251
Query: 260 SPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKPV--- 319
P + N+ R++ RNQ+ G N + N R + + KP
Sbjct: 252 VP-ITANVVTHRNTNTNRNQNNRG-DNRNYNNNNNRSNSWQPSSSGSRSDNRQPKPYLGR 311
Query: 320 CQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCV 379
CQ+C GH+A C Q ++ + Q P AN VNS
Sbjct: 312 CQICSVQGHSAKRCPQ------------LHQFQSTTNQQQSTSPFTPWQPRANLAVNSPY 371
Query: 380 AAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNSCLAA 439
A NW DSGA++H+T+D+NNL+ Y G + V I DG+ + I G++ L
Sbjct: 372 NA------NNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASLPT 431
Query: 440 EHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEG 499
++ L KVL VP+I KNL+SV +L N V VEF VKD +TG LL+G +
Sbjct: 432 SSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDE 491
Query: 500 LYRFDGVKAVTTDTNELPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEE 559
LY + + P +S WH RLGHPS
Sbjct: 492 LYEWPIASSQAVSMFASPCSKATHSS-----------------------WHSRLGHPSLA 551
Query: 560 VLKSVVKSCNLPLSVNESFKF--CEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAP 619
+L SV+ + +LP+ +N S K C C KSH +PF NST + + +++D+W +P
Sbjct: 552 ILNSVISNHSLPV-LNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWS-SP 611
Query: 620 LDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGG 679
+ S++ ++YY++F+D F+R+ W+YPLK+K++ + F+ F +LV+N+F + I L +DNGG
Sbjct: 612 ILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGG 671
Query: 680 EYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFL 739
E+V + ++ G+ S PHT + NG +ERKHRH+VE GLTLL+ AS+P +W AF
Sbjct: 672 EFVVLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFS 731
Query: 740 VAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCV 799
VAV LIN LP+ +L +SP + L+G+ +++ L+VFG AC+P LR Y HK + +++C
Sbjct: 732 VAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQCA 791
Query: 800 YLGPSPLHKGHKCLS-SSGRLFISRHVRFNEEEFPFASGFQQVVTTAASSSNVSPSIP-- 859
++G S + CL +GRL+ SRHV+F+E FPF++ V T+ S+ +P+ P
Sbjct: 792 FMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSDSAPNWPSH 851
Query: 860 ---------LWFSNISGPDIKTTQQNREVPS----TGVSPNECPPTTPSNSSTDVQPLPA 919
L GP + T+ + PS T VS + P ++ S+ S+ P+
Sbjct: 852 TTLPTTPLVLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSEPTAPS 911
Query: 920 ELSPQSSLMP-------------------------THEDSPPSQVEVAENHCSA------ 979
PQ + P +++SP Q ++ H
Sbjct: 912 HNGPQPTAQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSIS 971
Query: 980 ---SESSASTQSSNLP---PTPSTVQAG-------HPMVTRGKAGIFKPKLWLTHACTDW 1039
S SS+ST + LP P P +Q H M TR K GI KP ++A +
Sbjct: 972 EPNSPSSSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGIRKPNQKYSYATSLA 1031
Query: 1040 SVTEPTRIADALATPQWREAM--------------------------------------- 1099
+ +EP A+ +WR+AM
Sbjct: 1032 ANSEPRTAIQAMKDDRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSD 1091
Query: 1100 -------------------------DIDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFL 1159
PV+K+++IR++L +AV + W +RQLD NNAFL
Sbjct: 1092 GSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFL 1151
Query: 1160 NGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGY----- 1209
G L ++VYM QPPG+ D +YVC+L KAIYGLKQAPRAW L+ LL+ G+
Sbjct: 1152 QGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSIS 1211
BLAST of Lag0039869 vs. ExPASy Swiss-Prot
Match:
P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)
HSP 1 Score: 278.5 bits (711), Expect = 4.7e-73
Identity = 273/1014 (26.92%), Postives = 433/1014 (42.70%), Query Frame = 0
Query: 112 ESWVAVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYL-RQVFQ 171
E W +D+ + ++ +V ++ + A+ +W ++ L+ ++ + YL +Q++
Sbjct: 50 EDWADLDERAASAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYA 109
Query: 172 QSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEY-NPVVAMIQDRAG 231
+ +L + L G + +L L Y N ++ +
Sbjct: 110 LHMSEGTNFLSHLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTT 169
Query: 232 ISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFG 291
I ++ + LL+ EK + N +L ++ G R +SY R++N +G
Sbjct: 170 IELKDVTSALLLNEKMRKKPENQGQAL-ITEG--------------RGRSY-QRSSNNYG 229
Query: 292 RGNQRGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEG 351
R +G RG+ + R N C C + GH C P + +GE
Sbjct: 230 R-----SGARGKSKNRSKSRVRN---CYNCNQPGHFKRDC----------PNPRKGKGET 289
Query: 352 ANRQNNQNGQGQPAAFVA-NQNVNSCVAAPETVI-----DPNWYADSGASNHVTADYNNL 411
+ ++N+ N AA V N NV + E + + W D+ AS+H T +
Sbjct: 290 SGQKNDDN----TAAMVQNNDNVVLFINEEEECMHLSGPESEWVVDTAASHHATPVRDLF 349
Query: 412 ANPVEYEGKESVTIGDGNKLKIAFIGNSCLAAE-HKNFKLKKVLCVPSIAKNLVSVSKLA 471
V + +V +G+ + KIA IG+ C+ LK V VP + NL+S L
Sbjct: 350 CRYVAGD-FGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALD 409
Query: 472 KDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKD 531
+D + + L K V+ KGV LYR N++
Sbjct: 410 RDGYESYFANQKWRLTKG---SLVIAKGVARGTLYR-------------------TNAEI 469
Query: 532 INNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQY 591
EL + IS +WHKR+GH SE+ L+ + K + + + K C+ C +
Sbjct: 470 CQGEL------NAAQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLF 529
Query: 592 GKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFSRFVWIYPLKRK 651
GK H + F S+ L+ DLV++D+ GP ++S+ G KY++ FIDD SR +W+Y LK K
Sbjct: 530 GKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTK 589
Query: 652 NEALEAFVHFTALVKNQFNSSIKALQTDNGGEYV--KIHQLCNEMGVKIRLSCPHTSQQN 711
++ + F F ALV+ + +K L++DNGGEY + + C+ G++ + P T Q N
Sbjct: 590 DQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHN 649
Query: 712 GRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKK 771
G AER +R +VE ++L A +P FW +A A LIN PS L + P + K+
Sbjct: 650 GVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKE 709
Query: 772 IDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLFI-SRHV 831
+ + L+VFG F + K Q K + C+++G G++ + I SR V
Sbjct: 710 VSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDV 769
Query: 832 RFNEEEFPFASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNE 891
F E E V T A S V I P+ T +PST +P
Sbjct: 770 VFRESE---------VRTAADMSEKVKNGII--------PNFVT------IPSTSNNPTS 829
Query: 892 CPPTTPSNSSTDVQPLPAELSPQSSLM---------PTHEDSPPSQVEVAENHCSASESS 951
TT S Q P E+ Q + PT + + +E S
Sbjct: 830 AESTTDEVSEQGEQ--PGEVIEQGEQLDEGVEEVEHPTQGEEQHQPLRRSERPRVESRRY 889
Query: 952 ASTQ----SSNLPPTPSTVQAGHP------------MVTRGKAGIFK----PK------- 1011
ST+ S + P HP M + K G +K PK
Sbjct: 890 PSTEYVLISDDREPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKC 949
Query: 1012 LWLTHACTDWSVTEPTRIADALATPQWREAMDID------PVVKASTIRVILSIAVTKGW 1071
W+ D + R L + + ID PVVK ++IR ILS+A +
Sbjct: 950 KWVFKLKKDGD-CKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDL 970
HSP 2 Score: 77.0 bits (188), Expect = 2.1e-12
Identity = 45/134 (33.58%), Postives = 75/134 (55.97%), Query Frame = 0
Query: 1081 SWGYSIIFKYRDSSYRSSNISALALAST------------EVIWLRQLMTEIGFPCCSKS 1140
S GY F S++S +AL++T E+IWL++ + E+G +
Sbjct: 1193 STGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLH-QKEY 1252
Query: 1141 VLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGKLEVRYIPSNEQPADCLTKTL 1200
V++CD+ SA L+ N ++HARTKHI++ H++R+ V L+V I +NE PAD LTK +
Sbjct: 1253 VVYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKVV 1312
Query: 1201 SHSQFAYLRCKLGV 1203
++F + +G+
Sbjct: 1313 PRNKFELCKELVGM 1325
BLAST of Lag0039869 vs. ExPASy Swiss-Prot
Match:
P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)
HSP 1 Score: 171.8 bits (434), Expect = 6.2e-41
Identity = 240/1092 (21.98%), Postives = 416/1092 (38.10%), Query Frame = 0
Query: 108 NPLYESWVAVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLR- 167
N + +SW ++ + ++ A+++ + ++ +S A + LR
Sbjct: 42 NEVDDSWKKAERCAKSTIIEYLSDSFLNFATSDITARQILENLDAVYERKSLASQLALRK 101
Query: 168 QVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQD 227
++ + + I L AG+ + IS +L+ L Y+ ++ I+
Sbjct: 102 RLLSLKLSSEMSLLSHFHIFDELISELLAAGAKIEEMDKISHLLITLPSCYDGIITAIE- 161
Query: 228 RAGISWSEMQAELLVFEKR-LELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTN 287
+ SE L + R L+ + +K+ + + +N ++ +N + R
Sbjct: 162 ----TLSEENLTLAFVKNRLLDQEIKIKNDHNDTSKKVMNAIVHNNNNTYKNNLFKNRVT 221
Query: 288 NGFGRGNQRGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTALMCYQRFNKEFVGPIYNQN 347
+ + G+ K C CG+ GH C+ + + N+N
Sbjct: 222 ---------------KPKKIFKGNSKYKVKCHHCGREGHIKKDCF-----HYKRILNNKN 281
Query: 348 RGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLAN 407
+ N + Q AF+ + N+ V + + + DSGAS+H+ D + +
Sbjct: 282 K---ENEKQVQTATSHGIAFMVKEVNNTSV-----MDNCGFVLDSGASDHLINDESLYTD 341
Query: 408 PVEYEGKESVTIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDN 467
VE + + + A L+ VL A NL+SV +L ++
Sbjct: 342 SVEVVPPLKIAVAKQGEFIYATKRGIVRLRNDHEITLEDVLFCKEAAGNLMSVKRL-QEA 401
Query: 468 EVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNL----VNSK 527
+ +EF + V G+L N +P+ N +N+K
Sbjct: 402 GMSIEFDKSGVTISKNGLMVVKNSGML-----------------NNVPVINFQAYSINAK 461
Query: 528 DINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVL-----KSVVKSCNLPLSVNESFKF 587
NN +WH+R GH S+ L K++ +L ++ S +
Sbjct: 462 HKNN----------------FRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEI 521
Query: 588 CEACQYGKSHALPFP--NSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFSRFV 647
CE C GK LPF +H +VH+D+ GP +++ Y+++F+D F+ +
Sbjct: 522 CEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYC 581
Query: 648 WIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYV--KIHQLCNEMGVKIRLS 707
Y +K K++ F F A + FN + L DNG EY+ ++ Q C + G+ L+
Sbjct: 582 VTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLT 641
Query: 708 CPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVL--GGK 767
PHT Q NG +ER R + E T+++ A + FW +A L A LIN +PS+ L K
Sbjct: 642 VPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSK 701
Query: 768 SPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSS 827
+P E+ + KK + LRVFG+ + ++ Q KF + K +++G P G K +
Sbjct: 702 TPYEMWHNKKPYLKHLRVFGATVYVHIKNKQG-KFDDKSFKSIFVGYEP--NGFKLWDAV 761
Query: 828 GRLFI-SRHVRFNEEEFPFASGFQ-QVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNR 887
FI +R V +E + + + V S + + + P N S I+T N
Sbjct: 762 NEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESENKNFP----NDSRKIIQTEFPNE 821
Query: 888 EVPSTGV-----SPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSP--------PS 947
+ S P++S +Q S + + +DS S
Sbjct: 822 SKECDNIQFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNKYFLNES 881
Query: 948 QVEVAENHCSASESSASTQSSNLPPTPS----------TVQAGHPMVTRGKAGI-FKPKL 1007
+ ++H + S+ S + S T T G ++ R + KP++
Sbjct: 882 KKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRSERLKTKPQI 941
Query: 1008 --------------------------------------WLTHACTD---------WSVTE 1067
W T+ W++T+
Sbjct: 942 SYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITK 1001
Query: 1068 -----------------------PTRIADALATPQWREAMDID------PVVKASTIRVI 1080
P R L + + ID PV + S+ R I
Sbjct: 1002 RPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFI 1056
HSP 2 Score: 74.7 bits (182), Expect = 1.0e-11
Identity = 41/112 (36.61%), Postives = 60/112 (53.57%), Query Frame = 0
Query: 1093 SSYRSSNISALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKH 1152
+S + AL A E +WL+ L+T I + ++ DN ++A NP H R KH
Sbjct: 1291 ASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISIANNPSCHKRAKH 1350
Query: 1153 IEIDVHFVRDQVLQGKLEVRYIPSNEQPADCLTKTLSHSQFAYLRCKLGVAE 1205
I+I HF R+QV + + YIP+ Q AD TK L ++F LR KLG+ +
Sbjct: 1351 IDIKYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGLLQ 1402
BLAST of Lag0039869 vs. ExPASy Swiss-Prot
Match:
Q12491 (Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-B PE=3 SV=1)
HSP 1 Score: 110.2 bits (274), Expect = 2.2e-22
Identity = 102/378 (26.98%), Postives = 159/378 (42.06%), Query Frame = 0
Query: 389 DSGASNHV--TADYNNLANPVEYEGKESVTIGDGNK--LKIAFIGNSCLAAEHKNFKLKK 448
DSGAS + +A Y + A P + I D K + I IGN ++ K
Sbjct: 457 DSGASQTLVRSAHYLHHATP-----NSEINIVDAQKQDIPINAIGNLHFNFQNGTKTSIK 516
Query: 449 VLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVK---DKDTGKVLLKGVLDEGLYRFDG 508
L P+IA +L+S+S+LA N C + ++ G VL V Y
Sbjct: 517 ALHTPNIAYDLLSLSELANQNIT-------ACFTRNTLERSDGTVLAPIVKHGDFYWLSK 576
Query: 509 VKAVTTDTNELPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPS-EEVLKSV 568
+ + ++L I N+ SK +N + H+ LGH + + KS+
Sbjct: 577 KYLIPSHISKLTINNVNKSKSVNK--------------YPYPLIHRMLGHANFRSIQKSL 636
Query: 569 VKSCNLPLS------VNESFKFCEACQYGKS----HALPFPNSTSHALDKFDLVHTDLWG 628
K+ L N S C C GKS H + + F +HTD++G
Sbjct: 637 KKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTDIFG 696
Query: 629 PAPLDSVNGFKYYILFIDDFSRFVWIYPL--KRKNEALEAFVHFTALVKNQFNSSIKALQ 688
P + Y+I F D+ +RF W+YPL +R+ L F A +KNQFN+ + +Q
Sbjct: 697 PVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQ 756
Query: 689 TDNGGEYVK--IHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLK 745
D G EY +H+ G+ + S+ +G AER +R ++ TLL + +P
Sbjct: 757 MDRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAERLNRTLLNDCRTLLHCSGLPNH 808
BLAST of Lag0039869 vs. ExPASy TrEMBL
Match:
A0A2Z6MBG6 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_77270 PE=4 SV=1)
HSP 1 Score: 847.4 bits (2188), Expect = 9.3e-242
Identity = 536/1457 (36.79%), Postives = 714/1457 (49.00%), Query Frame = 0
Query: 33 TIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSG 92
++KLDR N+ LWK+L LP++R KL+G++ G CPE+F+ ++ +S N
Sbjct: 18 SVKLDRNNYPLWKSLVLPVIRGCKLDGYMLGTEGCPEEFITSSDSSKNK----------- 77
Query: 93 AVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQE 152
N + W A DQ LLGW+ NSMT+E+ATQ++ E +++LW Q
Sbjct: 78 ---------------NSAFVEWQANDQRLLGWMLNSMTTEIATQLLHCETSKQLWDEAQS 137
Query: 153 LFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLG 212
L G +R++ YL+ F RKG KM DYL MKN D L AG+PV+T LI Q L G
Sbjct: 138 LAGAHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKNLVDKLKLAGNPVSTSDLIIQTLNG 197
Query: 213 LDEEYNPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDS 272
LD EYNPVV + D+ +SW ++QA+LL FE R+E NNL +L+ A+ N+AN
Sbjct: 198 LDSEYNPVVVKLSDQTTLSWVDLQAQLLTFESRIEQLNNL---TNLTLNATANVAN---- 257
Query: 273 GNQRNQSYGGRTNNGFGRGNQRG-AGGRGRGRARGYGSFSNKPVCQVCGKTGHTALMCYQ 332
R+ G +NN + N RG GGRGRG+ S K CQVCG + H A+ C+
Sbjct: 258 ---RSDHRGKSSNNNWRGSNSRGWRGGRGRGK-------SGKNPCQVCGLSNHIAIDCFH 317
Query: 333 RFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSG 392
RF+K + N G ++Q + N AF+A+QN +V D +WY DSG
Sbjct: 318 RFDKTY----SRSNHSAGHDKQGSHN------AFLASQN---------SVEDYDWYFDSG 377
Query: 393 ASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLCVPSI 452
ASNHVT + E+ GK S+ +G+G KL I G+S L K+ L +L VP+I
Sbjct: 378 ASNHVTHQTEKFQDLTEHHGKNSLVVGNGEKLAILATGSSKL----KSLNLHDILYVPNI 437
Query: 453 AKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNE 512
KNL+SVSKLA DN + VEF + C VKDK TGKV+LKG+L +GLY+ G K
Sbjct: 438 TKNLLSVSKLAADNNILVEFDENCCFVKDKLTGKVILKGLLKDGLYQLSGTK-------- 497
Query: 513 LPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVN 572
S FV K WH+RLGHP+ +VL V++SC + + +
Sbjct: 498 -------------RNPSAFV--------SVKESWHRRLGHPNNKVLDKVLESCKVKVPPS 557
Query: 573 ESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFS 632
++F FCEACQYGK H LPF +S+SHA + +LVHTD+WGPAP+ + +GFKYY+ F+DDFS
Sbjct: 558 DNFSFCEACQYGKMHLLPFKSSSSHAQEPLELVHTDVWGPAPIMTSSGFKYYVHFVDDFS 617
Query: 633 RFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRL 692
RF WIYPLK+K+E ++AF+ F L +NQFN IK +Q D GGEY + +L E G++ R+
Sbjct: 618 RFTWIYPLKQKSETVQAFIQFKNLTENQFNKRIKVIQCDGGGEYKPVQKLAVEAGIQFRM 677
Query: 693 SCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKS 752
SCP+TSQQNGRAERKHRH+ E GLTLLAQA MPL +W +AF AV LIN LPSQV +S
Sbjct: 678 SCPYTSQQNGRAERKHRHITEFGLTLLAQAQMPLHYWWEAFSTAVYLINRLPSQVTQNES 737
Query: 753 PMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSG 812
P L+ K+ D++ L+ FG AC+PCL+ Y HK QYHT +CV+LG S HKG+KCL+S G
Sbjct: 738 PYSLMLQKEPDYKLLKTFGCACYPCLKPYNQHKLQYHTTRCVFLGYSNSHKGYKCLNSHG 797
Query: 813 RLFISRHVRFNEEEFPFASGFQQVVTTAASSSNV-SPSIPLWFSNISGPDIKTTQQNREV 872
R+FISRHV FNE+ FPF GF + ++ NV S S PL T +
Sbjct: 798 RIFISRHVIFNEDHFPFHDGFLNTRSPLKTTINVPSTSFPLC----------TAGNVIDD 857
Query: 873 PSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSASESS 932
S + E P T + S DV + D+ + +E++ + E+
Sbjct: 858 ASMPILEAENPAETNTEDSQDV----------------NSDTEQTNNGPSEDNTTHEETL 917
Query: 933 ASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKL---WLTHACTDWSVTEPTRIADALAT 992
TQ ++ H + TR K+GI KPKL LT D EP +AL+
Sbjct: 918 DITQQQSVGEASQNTNTSHAIHTRSKSGIHKPKLPYIGLTETYKD--TMEPANAKEALSR 977
Query: 993 PQWREAMD---------------------------------------------------- 1052
P W+EAM
Sbjct: 978 PLWKEAMQKEFEALMSNKTWILVPYQNQENIVDSKWVFKTKYKPDGSLERRKARLVAKGF 1037
Query: 1053 -----------IDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPG 1112
PV+KAST+R+ILSIAV W++RQLD NNAFLNG L E V+M QP G
Sbjct: 1038 QQTAGIDYEETFSPVIKASTVRIILSIAVHLNWEVRQLDINNAFLNGHLKETVFMHQPEG 1097
Query: 1113 YEDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGYS--------IIFKYRD---- 1172
+ D+ N++CKL KAIYGLKQAPRAW ++LK LL+WG+ + K +D
Sbjct: 1098 FVDSTKPNHICKLSKAIYGLKQAPRAWFDSLKTALLNWGFQNTKSDSSLFLLKGKDHITF 1157
Query: 1173 ------------------------------------------------------------ 1207
Sbjct: 1158 LLIYVDDIIVTGSNGKFLQAFIKQLNDAFSLKDLGHLHYFLGIEVQRDASGMYLKQSKYI 1217
BLAST of Lag0039869 vs. ExPASy TrEMBL
Match:
A0A803PM38 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)
HSP 1 Score: 838.6 bits (2165), Expect = 4.3e-239
Identity = 534/1434 (37.24%), Postives = 718/1434 (50.07%), Query Frame = 0
Query: 22 PPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNP 81
P LNQ +KLDR NF LW+ + I+R ++L+G+L G P P++F+ +
Sbjct: 38 PQFGSTLNQPFALKLDRNNFSLWRTMVSAIVRGHRLDGYLKGTLPKPQEFLSS------- 97
Query: 82 TTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSEVATQVMGYE 141
T + +S G ++NP +E W+ DQLLLGWLY SMT +A +VMG +
Sbjct: 98 TDLDGSVSSVG-------------QVNPAFEQWIVNDQLLLGWLYGSMTEGIACEVMGCD 157
Query: 142 NAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVN 201
++ LW A++ELFG S+A+ D R Q +RKG M DYLR + AD L AG P
Sbjct: 158 SSASLWTALEELFGAHSKAKMDEYRTKIQTARKGALSMADYLRQKRQWADVLALAGEPYP 217
Query: 202 TRSLISQVLLGLDEEYNPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLS-- 261
L+S VL GLD EY P+V +I+ R +W ++Q LL + ++E ++ S L+
Sbjct: 218 ENQLVSNVLSGLDIEYLPMVLLIEARGSTTWQQLQDMLLSLDSKMERLHSFSGSSKLTGV 277
Query: 262 -PGASVNMAN-----SRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNK 321
S ++AN + GN N + GG +NN RG GGR G +
Sbjct: 278 PMNPSASLANKGPHPGANRGNHNNNNRGGHSNNRGSNNRSRGRGGRTSG---------PR 337
Query: 322 PVCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNS 381
P CQVCGK GH+A CY R
Sbjct: 338 PTCQVCGKYGHSAAHCYNR----------------------------------------- 397
Query: 382 CVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIG-NSC 441
GASNH+T++ N + EY GKE VT+ +GN+L I IG S
Sbjct: 398 -----------------GASNHITSEINKMNLKEEYNGKEKVTVANGNRLPIHHIGLGSL 457
Query: 442 LAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVL 501
LK++L VPSI KNL+S+SKL DN V VEF C VKDK+TG+V+LKG L
Sbjct: 458 QTLSASPLILKEILHVPSITKNLLSISKLTSDNNVCVEFLSDLCFVKDKETGQVVLKGKL 517
Query: 502 DEGLYRFDGVKAVTT-DTNELPICNLVNSKDINNELSGFVLSSTLNVAIS--KAIWHKRL 561
+GLY+FD + T+ +N C S + + + V N + K WH+RL
Sbjct: 518 KDGLYQFDAPTSTTSMSSNRSISCPTSFSGLVVSAVESNVTKPMANQLLCSIKDRWHRRL 577
Query: 562 GHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLW 621
GHPS VL +V+ N+ ++N S FC+ACQ GKSH+LPF + A +LVHTD+W
Sbjct: 578 GHPSIRVLDTVLHKINVK-NINSSLSFCDACQLGKSHSLPFKVNPKRATAPLELVHTDIW 637
Query: 622 GPAPLDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQT 681
GP+P+ S F+YYI FIDDFSR+ WIYPLK K+EAL AFV F LV+NQFNS +K +QT
Sbjct: 638 GPSPIMSNTNFRYYIHFIDDFSRYTWIYPLKAKSEALAAFVQFKLLVENQFNSRVKRVQT 697
Query: 682 DNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWV 741
D GGEY + ++ G+ + CPHTS QNGRAERKHRH+VE GLTLLAQA +P K+W
Sbjct: 698 DWGGEYQGFPRFGSDHGIGFQHPCPHTSGQNGRAERKHRHIVEMGLTLLAQAHVPQKYWW 757
Query: 742 DAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHT 801
DAF AV LIN LP+ VL K+P E+L+ ++ D++ L+VFG +CFPCLR YQ HKFQ+H+
Sbjct: 758 DAFQTAVYLINRLPTPVLKLKTPFEVLFKQQPDYKFLKVFGVSCFPCLRAYQNHKFQFHS 817
Query: 802 EKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFPFASGFQQVVTTAASSSNVSPSI 861
KCV LG S HKG+KCLSS+GRL+ISR V FNE+EFPF SGF + T + VS +
Sbjct: 818 TKCVNLGYSDKHKGYKCLSSTGRLYISRDVIFNEDEFPFKSGF---LNTNKPETPVSVLV 877
Query: 862 PLWFSNISGPDIKTTQQNREVPSTGVSPNE----CPPTT----PSNSSTDVQPLPAELSP 921
P W ++ S + +++ QN S G + + PTT P S+ +S
Sbjct: 878 PFWTAS-SFVNSQSSSQNDFSSSIGNNQTDEVDHGTPTTSRVVPDLSTFQGNDTDHVISD 937
Query: 922 QSSLMPTHEDSPPSQVEVAENHCSASESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFK 981
++ + + +A S NL ST HPM+TR KAGIFK
Sbjct: 938 FGNIDRISDVQIQQHADTTTLESAADPIDTSASDHNLKAVVST----HPMITRAKAGIFK 997
Query: 982 PKLWLTHACTDWSVTEPTRIADALATPQWREAMD-------------------------- 1041
PK +LT + +EP I +AL W AM
Sbjct: 998 PKTYLTQTKWIGNSSEPQSIEEALQHKGWNNAMSSEVHALARNGTWKLVPRLPHMHIIDN 1057
Query: 1042 -------------------------------------IDPVVKASTIRVILSIAVTKGWQ 1101
PV+KAST+R++LSIAVTK W+
Sbjct: 1058 KWVYKEKRNADGSFQRLKARLVAKGFTQRPGVDFSETFSPVIKASTVRIVLSIAVTKEWE 1117
Query: 1102 LRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKAT 1161
+RQLD NNAFLNG + ED+YM+QP G+ED N+VCKL K+IYGL+QAPRAW + LKAT
Sbjct: 1118 VRQLDINNAFLNGHITEDIYMKQPLGFEDKNKPNHVCKLIKSIYGLRQAPRAWFDKLKAT 1177
Query: 1162 LLSWGY------SIIFKYRDSSY------------------------------------- 1221
L SW + S +F + SSY
Sbjct: 1178 LASWKFKNSKADSSLFFLKTSSYIILVLIYVDDIIITGNNSAVMQTFINKLNQQFALKDL 1237
Query: 1222 ------------------------------------------------------------ 1232
Sbjct: 1238 GKLHYFLGIEVNRDATGMYLSQPKYIEELLKKMNMINLKACPTPMATGKVLSIEDGDSLR 1297
BLAST of Lag0039869 vs. ExPASy TrEMBL
Match:
A0A803QCY3 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)
HSP 1 Score: 791.2 bits (2042), Expect = 7.9e-225
Identity = 516/1336 (38.62%), Postives = 696/1336 (52.10%), Query Frame = 0
Query: 5 TSTGNPAYVTGTTSFN------SPPLN-----QLLNQITTIKLDRGNFLLWKNLALPILR 64
T+ GNP VT S N +P L+ L Q ++KLD N+ LWK + I+R
Sbjct: 3 TNAGNPT-VTAAVSTNTGARSTNPQLHVPHHFSTLKQPFSLKLDMNNYSLWKTMVSTIVR 62
Query: 65 SYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYES 124
++L+G L+G + CP ++V G+T G S + LNP +E+
Sbjct: 63 GHRLDGFLNGTNVCPSEYVY------------TGSTEDG--------SKTIKTLNPEFEN 122
Query: 125 WVAVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSR 184
W+ DQLL+GWLY+SMT +AT+VMG +A LW A+++L+G S+++ D R + Q ++
Sbjct: 123 WIVNDQLLMGWLYSSMTETIATEVMGSTSAAGLWHALEQLYGAHSKSKMDDTRTLIQTTK 182
Query: 185 KGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDRAGISWS 244
KG + M +YLR K+ AD+L AG P L + VL LD Y +V I+ R SW
Sbjct: 183 KGGTPMIEYLRQKKSWADSLALAGEPYPEAQLATNVLSRLDINYLTLVLQIKARTKTSWQ 242
Query: 245 EMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQ 304
E+Q LL FE ++E G NN GR
Sbjct: 243 ELQELLLSFESKVE-------------------------------RLGRGNNNSRGRHFN 302
Query: 305 RGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQ 364
RG GGR RGR R + ++KP CQVCGK H+A++CY F+ ++G
Sbjct: 303 RGNGGRSRGRGR---TNNSKPTCQVCGKYDHSAVVCYNWFDDSYMG---------SDPHS 362
Query: 365 NNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKE 424
+NQN GQ N N ++ +A PE + W+ADSGASN++TAD + + EY GKE
Sbjct: 363 SNQNKTGQ-----NNNNPSAFIATPEFLDSEAWFADSGASNNITADPSVIPQKQEYGGKE 422
Query: 425 SVTIGDGNKLKIAFIGNSCLAAEHKNF-KLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFH 484
VT+G+G+KL I+ GN L + + KL ++L VP IAKN +SVSKL DN+V +EFH
Sbjct: 423 KVTVGNGDKLVISHFGNGKLYTKTGQWLKLNEMLLVPIIAKNFLSVSKLTTDNDVIIEFH 482
Query: 485 DGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNELSGFVL 544
C VKD T +VLL+G+L +GLY+ + T N+ S ++
Sbjct: 483 SNSCFVKDIATRRVLLQGMLKDGLYQ------LQTPRNK----------------SAYLR 542
Query: 545 SSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPN 604
ST K +V CN P +V+ FC+ACQYGKSH+LPF +
Sbjct: 543 FST---------------------SKFIVSDCN-PFTVDH---FCDACQYGKSHSLPFKH 602
Query: 605 STSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHF 664
S S AL DLVHTDLWGP+P+ S FKYY+ F+DD +RF WIYPLK K+EA +AF+ F
Sbjct: 603 SNSKALKVLDLVHTDLWGPSPITSNQDFKYYVHFVDDCTRFTWIYPLKNKSEACDAFLAF 662
Query: 665 TALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVE 724
+L +NQF IKAL+TD GGEY + G+ SCPHTS QNGRAERKHRH+VE
Sbjct: 663 KSLAENQFERKIKALRTDGGGEYQVLSDFVVTHGINFHHSCPHTSSQNGRAERKHRHIVE 722
Query: 725 TGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSA 784
GLTLLAQ+ MPLK+W DAF AV LIN LP+ +L K+P E+L+ K D++ L+ FG A
Sbjct: 723 MGLTLLAQSIMPLKYWWDAFSTAVYLINRLPTPILDHKTPFEMLHKKIPDYKFLKTFGVA 782
Query: 785 CFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFPFASGF 844
CFPCLR YQAHKFQ+H+ KCV LG S HKG+KCLS +GR++I R V FNE EFPF F
Sbjct: 783 CFPCLRPYQAHKFQFHSIKCVNLGYSDAHKGYKCLSPTGRIYILRDVVFNELEFPFQISF 842
Query: 845 ------QQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPS 904
+ +V +S+ +V PS+ P+TG + PS
Sbjct: 843 LNNYQSENLVIIQSSTWSVLPSVS--------------------PATG---SRFTSANPS 902
Query: 905 NSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVA-----ENHCSASESSASTQSSN---L 964
+SS + +P +PQS P+ V A N E +A + +
Sbjct: 903 SSSQEAEP----STPQSRNFQNPNSREPTSVAEALAQKGWNKAMNEEIAALKTNKTYVLV 962
Query: 965 PPTPSTVQAGHPMVTRGKAGIFKPKLWLTHACTDWSVTEPTRIADALATPQWREAMDID- 1024
PP PS G+ V R K + D ++ R+ L + + + ID
Sbjct: 963 PPAPSQNLIGNKWVFREKFNL------------DGTL---QRLKARLVAKGFHQRLGIDF 1022
Query: 1025 -----PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSN 1084
PV+KAST+R+IL+IAV+KGW +RQLD NNAFLNG L+EDV+M QP G+E+ +
Sbjct: 1023 GETYSPVIKASTVRIILAIAVSKGWDIRQLDINNAFLNGTLEEDVFMAQPEGFEEKGKEH 1082
Query: 1085 YVCKLDKAIYGLKQAPRAWNNT-------------------------------------- 1144
+VCKL+K++YGLKQA RAW+ T
Sbjct: 1083 FVCKLNKSLYGLKQALRAWDETGLFLTQTKYIEDLLKRTKFINLKSCPTPAIAGKPMSQN 1142
Query: 1145 -------------------------------------------------------LKATL 1204
L TL
Sbjct: 1143 DGEPLRDITTYKSIIGRLQYLCHTRPDIAYAVNKLNTDWACCPDDRKSVAGYCVYLGNTL 1178
Query: 1205 LSWGYSIIFKYRDSSYRSSNISALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGA 1216
+SW SS S AL S ++ W+ L+ EIGFP V WCDN+ A A
Sbjct: 1203 VSWSSKKQHVVSRSS-TESEYRALPHVSAKISWIESLLKEIGFP-LKTVVTWCDNLGASA 1178
BLAST of Lag0039869 vs. ExPASy TrEMBL
Match:
A0A2Z6P4D5 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_412550 PE=4 SV=1)
HSP 1 Score: 789.6 bits (2038), Expect = 2.3e-224
Identity = 524/1496 (35.03%), Postives = 717/1496 (47.93%), Query Frame = 0
Query: 16 TTSFNSPPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAA 75
+++ NSP N L I ++KLDR N+ LWK+L L ++R KL+G++ G + CPE+FV +A
Sbjct: 2 SSAANSPKKND-LPSIISVKLDRDNYPLWKSLVLSLIRGCKLDGYILGTTECPEQFVTSA 61
Query: 76 SASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSEVAT 135
S +++NP + W+A DQ LLGWL NSM ++AT
Sbjct: 62 DKS--------------------------KKVNPDFGDWIANDQALLGWLMNSMAIDIAT 121
Query: 136 QVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQ 195
Q++ E +++LW Q L G +++ YL+ F +RKG KM +YL MKN +D L
Sbjct: 122 QLLHCETSKQLWDETQSLAGAHTKSRITYLKSEFHNTRKGEMKMEEYLIKMKNLSDKLKL 181
Query: 196 AGSPVNTRSLISQVLLGLDEEYNPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSS 255
AGSP++ L+ Q L GLD EYNPVV + D+ +SW ++QA+LL FE RL+ NN S
Sbjct: 182 AGSPISNSDLMIQTLNGLDAEYNPVVVKLSDQINLSWVDVQAQLLAFESRLDQFNNF-SG 241
Query: 256 LSLSPGASVNMANSRD-SGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKP 315
L+L+ AS N AN + GN+ N RGN R + RG RG G SN
Sbjct: 242 LTLN--ASANFANKTEFRGNKFN-----------SRGNWRRSNFRGMRGGRGKGRMSNTK 301
Query: 316 VCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSC 375
CQVC TGH A+ C RF++ + G N + QG +AF
Sbjct: 302 -CQVCNGTGHIAVDCSYRFDRPYT----------GRNYSTEADKQGSHSAF--------- 361
Query: 376 VAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNSCLA 435
+A+P D WY DSGA+NHVT + E+ GK S+ +G+G KLKI G++ L
Sbjct: 362 IASPYHGQDYEWYFDSGANNHVTHQTDKFQGFNEHNGKNSLMVGNGEKLKIVASGSTKL- 421
Query: 436 AEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDE 495
N L VL VP I KNL+SVSKL DN + VEF C VKDK TG+ LLKG L +
Sbjct: 422 ---NNLNLHDVLYVPQITKNLLSVSKLTADNNILVEFDANCCSVKDKLTGQTLLKGRLKD 481
Query: 496 GLYRFDGVKAVTTDTNELPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSE 555
GLY+ +N+ P C ++ K+ WH++LGHP+
Sbjct: 482 GLYQL---------SNKEP-CVYMSVKE---------------------SWHRKLGHPNN 541
Query: 556 EVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPL 615
+VL V+K CN+ +S ++ F FCEACQ+GK H LPF S+SH + L+H+D+WGPAP+
Sbjct: 542 KVLDKVLKDCNVKISHSDQFSFCEACQFGKLHLLPFKPSSSHVQEPLALIHSDVWGPAPI 601
Query: 616 DSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGE 675
S +GFKYY+ FIDDFSRF WI+PLK+K++ + AF+ F L +NQFN IK +Q D GGE
Sbjct: 602 LSPSGFKYYVHFIDDFSRFTWIFPLKQKSDTIHAFIQFKNLAENQFNKKIKIIQCDGGGE 661
Query: 676 YVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFLV 735
Y + ++ E G++ R+SCP+TSQQNGRAERKHRHV E GLTLLAQA MPL++W +AF
Sbjct: 662 YKAVQKVSIEAGIQFRMSCPYTSQQNGRAERKHRHVAELGLTLLAQAKMPLRYWWEAFST 721
Query: 736 AVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVY 795
AV LIN LPS V +SP L++ ++ D+ AL+ FG AC+PCL+ Y HK Q+HT +CV+
Sbjct: 722 AVYLINRLPSSVNPNESPYSLMFKREPDYNALKPFGCACYPCLKPYNQHKLQFHTTRCVF 781
Query: 796 LGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFPFASGF----QQVVTTAASSSNVSPSIP 855
+G S HKG+KC++S GR+F+SRHV FNE FPF GF + T +SS + P+
Sbjct: 782 VGYSNSHKGYKCINSHGRIFVSRHVIFNENHFPFHGGFLDTKNPLKTLTDNSSILLPTC- 841
Query: 856 LWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHE 915
T Q+ P + ++ T S S+D ++ + T+
Sbjct: 842 ---------SAGATTQDAIEPDNNTTSDQ---NTHSIESSDNNENEEQVDSSEFFVNTNN 901
Query: 916 DSPPSQVEVAENHCSASESSASTQSSNLPPTPSTVQAG-HPMVTRGKAGIFKPKL-WLTH 975
S +Q A+N + + + ST + + + H M TR K GI KPK+ ++
Sbjct: 902 SS--TQDIEADNSVDSEDRNNSTMTGTIQQQAQQDNSNTHWMRTRSKDGIHKPKIPYVGM 961
Query: 976 ACTDWSVTEPTRIADALATPQWREAMD--------------------------------- 1035
A TD EP + +AL P W+EAMD
Sbjct: 962 AETDSEEKEPKSVKEALGRPMWKEAMDKEYKALVSNHTWTLVPYQEQENIIDSKWIFKTK 1021
Query: 1036 ------------------------------IDPVVKASTIRVILSIAVTKGWQLRQLDFN 1095
PVVK+ST+R+IL+IAV W++RQLD N
Sbjct: 1022 YKSDGSIERRKARLVAKGFQQTAGLDFGETFSPVVKSSTVRIILTIAVHFNWEVRQLDIN 1081
Query: 1096 NAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGY- 1155
NAFLNG+L E V+M QP GY DA N++CKL KAIYGLKQAPRAW ++L++TL++WG+
Sbjct: 1082 NAFLNGKLKETVFMHQPEGYIDAAKPNHICKLSKAIYGLKQAPRAWYDSLRSTLVNWGFQ 1141
Query: 1156 ------SIIF-------------------------------------------------- 1208
S+ F
Sbjct: 1142 NAKNDTSLFFLKGADHTTFLLIYVDDIIVTGSNIKFLEAFTNQLNTAYSLKDLGPLHYFL 1201
BLAST of Lag0039869 vs. ExPASy TrEMBL
Match:
A0A2K3MUJ9 (Putative retrotransposon Ty1-copia subclass protein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g017679 PE=4 SV=1)
HSP 1 Score: 775.8 bits (2002), Expect = 3.4e-220
Identity = 463/1123 (41.23%), Postives = 618/1123 (55.03%), Query Frame = 0
Query: 28 LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAG 87
L ++KLDR NF LWK+L LP++R K +G++ G CP++FV S++ T
Sbjct: 12 LPSTVSVKLDRDNFPLWKSLVLPLIRGCKYDGYMLGTKKCPDQFV----TSIDNT----- 71
Query: 88 ATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSEVATQVMGYENAQELW 147
++NP Y+ W A DQ LLGWL NSMT ++ATQV+ E +++LW
Sbjct: 72 -----------------EKINPDYQDWQADDQALLGWLMNSMTVDIATQVLHCETSKQLW 131
Query: 148 AAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLIS 207
Q L G +R+ YL+ F + K KM YL MKN AD L AGSP+++ L+
Sbjct: 132 DEAQSLAGAHTRSRIIYLKSEFHNTHKREMKMEQYLAKMKNLADKLKLAGSPISSSDLMI 191
Query: 208 QVLLGLDEEYNPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMA 267
Q L GLD EYNPVV + D+ ISW + QA+LL FE RL+ NN +++ AS N A
Sbjct: 192 QTLNGLDSEYNPVVVKLSDQTNISWVDFQAQLLAFESRLDQLNNFN---NINLNASANFA 251
Query: 268 NSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTAL 327
+ +SG + +G R G+ N RG G GRGRAR S +P+CQ+CGK GHTA
Sbjct: 252 SKNESGGNK---FGSR--GGWRGSNSRGMRG-GRGRAR--MSKPPRPICQICGKFGHTAA 311
Query: 328 MCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWY 387
CY RF+K + + + G+G +AF VA+P D WY
Sbjct: 312 QCYYRFDKSY------------TEKNHYAEGEGSHSAF---------VASPYHGQDYEWY 371
Query: 388 ADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLC 447
DSGASNHVT L + E GK S+ +G+G KLKI G++ L + L+ VL
Sbjct: 372 FDSGASNHVTHQSGQLQDLNENNGKNSLLVGNGEKLKILASGSTKL----NDVNLRNVLY 431
Query: 448 VPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTT 507
VP I KNL+SVSKL DN VEF + +C VKDK TGK LLKG L +GLY+ K T
Sbjct: 432 VPEITKNLLSVSKLTIDNNALVEFDENYCYVKDKLTGKALLKGRLKDGLYQLSANKEPPT 491
Query: 508 DTNELPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLP 567
+ + +L K IWH++LGHP+ +VL+ V+K N+
Sbjct: 492 NKDPCAYISL------------------------KEIWHRKLGHPNNKVLEKVLKDNNVK 551
Query: 568 LSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFI 627
+S ++ F FCEACQ+GK H LPF S+SHA + DL+HTD+WGPAP+ S + FKYY+ F+
Sbjct: 552 ISPSDKFTFCEACQFGKLHLLPFKTSSSHAKEPLDLIHTDVWGPAPILSQSNFKYYVHFL 611
Query: 628 DDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGV 687
DDFSRF WI+PLK+K+E + AF F LV+NQFN IK ++ D GGEY + + + G+
Sbjct: 612 DDFSRFTWIFPLKQKSETIHAFNQFKNLVENQFNKKIKVIRCDGGGEYKPVQKCAIDSGI 671
Query: 688 KIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVL 747
+ ++SCP+TSQQNGRAERKHRHV E GLTLLAQA MPL +W +AF AV LIN LPS V
Sbjct: 672 QFQMSCPYTSQQNGRAERKHRHVTELGLTLLAQAKMPLSYWWEAFSTAVYLINRLPSSVN 731
Query: 748 GGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCL 807
+SP L++ K+ D+ AL+ FG AC+PCL+ Y HK Q+HT +CV+LG S HKG+KC+
Sbjct: 732 PNESPYTLVFKKEPDYTALKPFGCACYPCLKPYNQHKLQFHTTRCVFLGYSNSHKGYKCV 791
Query: 808 SSSGRLFISRHVRFNEEEFPFASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQN 867
+S GR+F+SRHV FNE FPF GF + T V+ P+ F S P TT
Sbjct: 792 NSHGRVFVSRHVVFNENHFPFQEGF---LDTRNPIKVVTNDTPIGFP--SFPAGITTNNT 851
Query: 868 REVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSAS 927
E V E P N+ D Q + ++ + E A
Sbjct: 852 AEATDNIVDQQE-PELNDINTVAD-QSVESDTFEHTDENNFSNGETEDSTEAAGRESMEE 911
Query: 928 ESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKL---WLTHACTDWSVTEPTRIADA 987
S T+++ PP + H M TR KAG++KPKL LT + EP +++A
Sbjct: 912 ISQPITETN--PPPQQDITNTHWMRTRSKAGVYKPKLPYIGLTEEAKEGK--EPESVSEA 971
Query: 988 LATPQWREAMDID----------------------------------------------- 1047
L+ P+W AMD +
Sbjct: 972 LSIPEWLNAMDAEYKALMNNKTWTLVPFEGQENVISSKWIFKTKYKADGTIERRKARLVA 1031
Query: 1048 ----------------PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQ 1085
PVVK+ST+R+ILSIAV W++RQLD NNAFLNG L E V+M Q
Sbjct: 1032 RGFQQTAGVDYDETFSPVVKSSTVRIILSIAVHLSWEVRQLDINNAFLNGNLKESVFMHQ 1037
BLAST of Lag0039869 vs. TAIR 10
Match:
AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )
HSP 1 Score: 89.7 bits (221), Expect = 2.2e-17
Identity = 45/102 (44.12%), Postives = 65/102 (63.73%), Query Frame = 0
Query: 998 PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYE----DAKCSNY 1057
PV K +++++IL+I+ + L QLD +NAFLNG LDE++YM+ PPGY D+ N
Sbjct: 169 PVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNA 228
Query: 1058 VCKLDKAIYGLKQAPRAWNNTLKATLLSWGYSIIFKYRDSSY 1096
VC L K+IYGLKQA R W TL+ +G+ + + D +Y
Sbjct: 229 VCYLKKSIYGLKQASRQWFLKFSVTLIGFGF--VQSHSDHTY 268
HSP 2 Score: 70.9 bits (172), Expect = 1.1e-11
Identity = 52/155 (33.55%), Postives = 75/155 (48.39%), Query Frame = 0
Query: 1058 DKAIYGLKQAPRAWNN---TLKATLLSWGYSIIFKYRDSSYRSSNISALALASTEVIWLR 1117
D + K R+ N L +L+SW S + S + AL+ A+ E++WL
Sbjct: 447 DASFQSCKDTRRSTNGYCMFLGTSLISW-KSKKQQVVSKSSAEAEYRALSFATDEMMWLA 506
Query: 1118 QLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQ-VLQGKLEVRY 1177
Q E+ P ++L+CDN +A +A N VFH RTKHIE D H VR++ V Q L +
Sbjct: 507 QFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVRERSVYQATLSYSF 566
Query: 1178 IPSNEQPADCLTKTLS---HSQFAYLRCKLGVAEL 1206
+EQ D T+ LS Y+ G+A L
Sbjct: 567 QAYDEQ--DGFTEYLSPILRGTIMYIVSMFGLAGL 598
BLAST of Lag0039869 vs. TAIR 10
Match:
ATMG00300.1 (Gag-Pol-related retrotransposon family protein )
HSP 1 Score: 63.2 bits (152), Expect = 2.2e-09
Identity = 28/67 (41.79%), Postives = 37/67 (55.22%), Query Frame = 0
Query: 544 IWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDL 603
+WH RL H S+ ++ +VK L S S KFCE C YGK+H + F + D
Sbjct: 71 LWHSRLAHMSQRGMELLVKKGFLDSSKVSSLKFCEDCIYGKTHRVNFSTGQHTTKNPLDY 130
Query: 604 VHTDLWG 611
VH+DLWG
Sbjct: 131 VHSDLWG 137
BLAST of Lag0039869 vs. TAIR 10
Match:
AT1G34070.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G48050.1); Has 648 Blast hits to 647 proteins in 29 species: Archae - 0; Bacteria - 0; Metazoa - 16; Fungi - 25; Plants - 607; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 53.1 bits (126), Expect = 2.3e-06
Identity = 47/218 (21.56%), Postives = 85/218 (38.99%), Query Frame = 0
Query: 1319 PLNIKLSDSNYMLWKNQLLNHIIAFDMESFIDGTPPPVK-------------------YL 1378
P+ + + +SNY W+ L H ++FD+ IDGT P L
Sbjct: 21 PVMLDIEESNYDAWRELFLTHCLSFDVMGHIDGTLLPTNANDVNWQKRDGIVKLSLYGTL 80
Query: 1379 DPTQTQ-------------------------LQKIRKDS---------LPVSQYLAQIKD 1438
P Q Q + +R DS + V+ Y ++K
Sbjct: 81 TPKQFQGSFVTSSTSRDIWLRIKNQFRNNKDARALRLDSELRTKDIGDMRVADYYRKMKK 140
Query: 1439 ISDKFSAIDEPLSYRDHLAYILEGFGPEYNPFVTSIQNRTDRPSLVDVRSLLLAYDAKLE 1484
++D +D P++ R+ + Y+L G P+++ + I++R PS D ++L + +L+
Sbjct: 141 LADSLRNVDVPVTDRNLVMYVLNGLNPKFDNIINVIKHRQPFPSFDDAATMLQEEEDRLK 200
BLAST of Lag0039869 vs. TAIR 10
Match:
AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 51.6 bits (122), Expect = 6.6e-06
Identity = 60/271 (22.14%), Postives = 118/271 (43.54%), Query Frame = 0
Query: 33 TIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSG 92
T+ L++ N+ +W+ L + S+ + GH+ G+S
Sbjct: 25 TLDLNKLNYDVWRELFETLCLSFGVLGHIDGSS--------------------------- 84
Query: 93 AVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSEVATQVMGYE-NAQELWAAIQ 152
TP E + W D L+ W+Y ++T + ++ A++LW +++
Sbjct: 85 -------TPTPMTE-----KRWKERDGLVKMWIYGTITDSLLDTIIKVGCTARDLWLSLE 144
Query: 153 ELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLL 212
LF A + + + + +Y + +K+ +D L SP++ R L+ +L
Sbjct: 145 NLFRDNKEARALQFENELRTTTIDDLSVHEYCQKLKSLSDLLTNVDSPISDRVLVMHLLN 204
Query: 213 GLDEEYNPVVAMIQDRAGI-SWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVN---MA 272
GL E+Y+ ++ +I+ ++ S++E ++ LL+ E R L N KSSLS + S++
Sbjct: 205 GLTEKYDYILNVIKHKSPFPSFTEARSMLLMEESR--LSNKSKSSLSHTNHPSLSNVLFT 254
Query: 273 NSRDSGNQRNQSYGGRTNNGFGRGNQRGAGG 299
R + + +N G GR ++ GG
Sbjct: 265 VPRQQERYPQEYHNNNSNMGRGRSKKKNRGG 254
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
GAU19483.1 | 1.9e-241 | 36.79 | hypothetical protein TSUD_77270 [Trifolium subterraneum] | [more] |
GAU51268.1 | 4.8e-224 | 35.03 | hypothetical protein TSUD_412550 [Trifolium subterraneum] | [more] |
PNX94503.1 | 7.1e-220 | 41.23 | putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense... | [more] |
KYP50444.1 | 2.9e-213 | 42.73 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] | [more] |
PNY01489.1 | 4.3e-201 | 38.80 | copia-like polyprotein, partial [Trifolium pratense] | [more] |
Match Name | E-value | Identity | Description | |
Q94HW2 | 2.4e-138 | 27.34 | Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... | [more] |
Q9ZT94 | 1.1e-135 | 26.71 | Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... | [more] |
P10978 | 4.7e-73 | 26.92 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... | [more] |
P04146 | 6.2e-41 | 21.98 | Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3 | [more] |
Q12491 | 2.2e-22 | 26.98 | Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... | [more] |
Match Name | E-value | Identity | Description | |
A0A2Z6MBG6 | 9.3e-242 | 36.79 | Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... | [more] |
A0A803PM38 | 4.3e-239 | 37.24 | Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1 | [more] |
A0A803QCY3 | 7.9e-225 | 38.62 | Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1 | [more] |
A0A2Z6P4D5 | 2.3e-224 | 35.03 | Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... | [more] |
A0A2K3MUJ9 | 3.4e-220 | 41.23 | Putative retrotransposon Ty1-copia subclass protein (Fragment) OS=Trifolium prat... | [more] |
Match Name | E-value | Identity | Description | |
AT4G23160.1 | 2.2e-17 | 44.12 | cysteine-rich RLK (RECEPTOR-like protein kinase) 8 | [more] |
ATMG00300.1 | 2.2e-09 | 41.79 | Gag-Pol-related retrotransposon family protein | [more] |
AT1G34070.1 | 2.3e-06 | 21.56 | CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... | [more] |
AT5G48050.1 | 6.6e-06 | 22.14 | CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... | [more] |