Homology
BLAST of Lag0007984 vs. NCBI nr
Match:
GAU19483.1 (hypothetical protein TSUD_77270 [Trifolium subterraneum])
HSP 1 Score: 1119.4 bits (2894), Expect = 0.0e+00
Identity = 628/1407 (44.63%), Postives = 845/1407 (60.06%), Query Frame = 0
Query: 4 NPKYDAWLAVDQLLLGWLYNSMTPEVATQVMGVENAKDLWSAIQDLFGVQSRAEEDFLRQ 63
N + W A DQ LLGW+ NSMT E+ATQ++ E +K LW Q L G +R++ +L+
Sbjct: 67 NSAFVEWQANDQRLLGWMLNSMTTEIATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKS 126
Query: 64 TFQQTRKGNLKMADYLRTMKTHADNLGLTGSPISNRNLVSQVLLGLDEEYNAVVAMVQGR 123
F RKG +KM DYL MK D L L G+P+S +L+ Q L GLD EYN VV + +
Sbjct: 127 EFHSIRKGEMKMEDYLIKMKNLVDKLKLAGNPVSTSDLIIQTLNGLDSEYNPVVVKLSDQ 186
Query: 124 ANVSWSELQAELLVFEKRLELQISHKNTVAFNHNATANMAVNKVNSSPKQTTNNNGNRQG 183
+SW +LQA+LL FE R+E N NATAN+A N S + ++N N +G
Sbjct: 187 TTLSWVDLQAQLLTFESRIE---QLNNLTNLTLNATANVA----NRSDHRGKSSNNNWRG 246
Query: 184 YNNGHQRGNGYGNRYRGRGRGYNNWNNRPTCQVCGKVGHSAVVCYHRFDKEFSPIQNRNT 243
N+ RG GRGRG + N CQVCG H A+ C+HRFDK +S N +
Sbjct: 247 SNSRGWRG--------GRGRGKSGKN---PCQVCGLSNHIAIDCFHRFDKTYSR-SNHSA 306
Query: 244 GNGTESGNFQSNRGIGQQPNAFMTTQQTATPETLADPSWYADSGASNHVTNNYENIANPT 303
G+ + + NAF+ +Q ++ D WY DSGASNHVT+ E + T
Sbjct: 307 GHDKQGSH-----------NAFLASQ-----NSVEDYDWYFDSGASNHVTHQTEKFQDLT 366
Query: 304 DYRGKECVTVGNGDKLSITSVGNSVLTDGYHVLNLENVLCVPEIAKNLVSMSKLAQDNNV 363
++ GK + VGNG+KL+I + G+S L LNL ++L VP I KNL+S+SKLA DNN+
Sbjct: 367 EHHGKNSLVVGNGEKLAILATGSSKLKS----LNLHDILYVPNITKNLLSVSKLAADNNI 426
Query: 364 YIEFHGDFCLVKDKSSGQVLLKGTLKDGLYQLQDANTSAASVSASASSNNQSDNFNSAFI 423
+EF + C VKDK +G+V+LKG LKDGLYQL S + N SAF+
Sbjct: 427 LVEFDENCCFVKDKLTGKVILKGLLKDGLYQL------------SGTKRNP-----SAFV 486
Query: 424 VSNVVPHVSLAVSKTIWHRRLGHPSAKVLDFIVKDCKLQVKSNEMSQFCQSCQFGKAHAL 483
K WHRRLGHP+ KVLD +++ CK++V ++ FC++CQ+GK H L
Sbjct: 487 -----------SVKESWHRRLGHPNNKVLDKVLESCKVKVPPSDNFSFCEACQYGKMHLL 546
Query: 484 PFPLSNSRAAKKFDLIHTDVWGPAPILSVEGYRYYALFLDDHSRYLWLYPLKQKSDTVQA 543
PF S+S A + +L+HTDVWGPAPI++ G++YY F+DD SR+ W+YPLKQKS+TVQA
Sbjct: 547 PFKSSSSHAQEPLELVHTDVWGPAPIMTSSGFKYYVHFVDDFSRFTWIYPLKQKSETVQA 606
Query: 544 FNHLLTVIKTQFGCGIKSVQTDNGGEYIPIHKVCHQLGIKTRLSCPHTSAQNGRAERKHR 603
F + + QF IK +Q D GGEY P+ K+ + GI+ R+SCP+TS QNGRAERKHR
Sbjct: 607 FIQFKNLTENQFNKRIKVIQCDGGGEYKPVQKLAVEAGIQFRMSCPYTSQQNGRAERKHR 666
Query: 604 HVVETGLTLLAQASMPLSHWWDALVTATQLINGLPTTVLQGKSPMELMWSKKMNYEMLKT 663
H+ E GLTLLAQA MPL +WW+A TA LIN LP+ V Q +SP LM K+ +Y++LKT
Sbjct: 667 HITEFGLTLLAQAQMPLHYWWEAFSTAVYLINRLPSQVTQNESPYSLMLQKEPDYKLLKT 726
Query: 664 FGCSCYPCLRPYHKHKFHYHTERCVFLGISASHKGYRCMNEPGRVFISRHVRFNESEFPF 723
FGC+CYPCL+PY++HK YHT RCVFLG S SHKGY+C+N GR+FISRHV FNE FPF
Sbjct: 727 FGCACYPCLKPYNQHKLQYHTTRCVFLGYSNSHKGYKCLNSHGRIFISRHVIFNEDHFPF 786
Query: 724 ATGFGSISSANTASSGSPSILEWFPHVHLPNPTSQSTMHSTPVPNSLVHPPDLPHNPTSP 783
GF + S + PS FP N ++M
Sbjct: 787 HDGFLNTRSPLKTTINVPSTS--FPLCTAGNVIDDASM---------------------- 846
Query: 784 FPTLQPTCPQPNTNSYSSPTSLQSQSTDVLPQSPTQLQSLPNEPS-SNPVQPSQISATTP 843
P L+ P S + ++ T+ N PS N + T
Sbjct: 847 -PILEAENPAETNTEDSQDVNSDTEQTN-------------NGPSEDNTTHEETLDITQQ 906
Query: 844 ISLPPTSPETSVPVSDSPTAAPIQPQPTHPMITRGKAGIFKPK---AWLTQQHTDWSLTE 903
S+ S T+ +H + TR K+GI KPK LT+ + D E
Sbjct: 907 QSVGEASQNTNT---------------SHAIHTRSKSGIHKPKLPYIGLTETYKD--TME 966
Query: 904 PTRVQDAISTPQWKQAMDCEYSALMKNQTWVLVPSSPDFNVVGNKWIFRIKKNADGTVQR 963
P ++A+S P WK+AM E+ ALM N+TW+LVP N+V +KW+F+ K DG+++R
Sbjct: 967 PANAKEALSRPLWKEAMQKEFEALMSNKTWILVPYQNQENIVDSKWVFKTKYKPDGSLER 1026
Query: 964 YKARLVAKGFHQYPGVDFFETFSPVVKASTIRVVISLAVSKGWSLRQLDFNNAFLNGILV 1023
KARLVAKGF Q G+D+ ETFSPV+KAST+R+++S+AV W +RQLD NNAFLNG L
Sbjct: 1027 RKARLVAKGFQQTAGIDYEETFSPVIKASTVRIILSIAVHLNWEVRQLDINNAFLNGHLK 1086
Query: 1024 EDVYMQQPPGYTDPTCPKYVCKLKKAIYGLKQAPRAWNTALKSVLLSWGFINSRSDSSLY 1083
E V+M QP G+ D T P ++CKL KAIYGLKQAPRAW +LK+ LL+WGF N++SDSSL+
Sbjct: 1087 ETVFMHQPEGFVDSTKPNHICKLSKAIYGLKQAPRAWFDSLKTALLNWGFQNTKSDSSLF 1146
Query: 1084 IFKSQSTVLLLLVYVDDVVLTGNNLKAINRLIGELDKRFALKDLGKLNYFLGIQVHYMPS 1143
+ K + + LL+YVDD+++TG+N K + I +L+ F+LKDLG L+YFLGI+V S
Sbjct: 1147 LLKGKDHITFLLIYVDDIIVTGSNGKFLQAFIKQLNDAFSLKDLGHLHYFLGIEVQRDAS 1206
Query: 1144 GLILNQAKY-----------------TNLVN------DGKLLEDPFLYRSTIGALQYLTY 1203
G+ L Q+KY T ++ +G+ L+DP ++R IG LQYLT+
Sbjct: 1207 GMYLKQSKYIGDLLKKFKMDNASPCPTPMITGRQFTVEGEKLKDPTVFRQAIGGLQYLTH 1266
Query: 1204 TRPDISHVVNQLSQFLKSPTDIHWQMVKRVLRYISGTKHLGLLFQPSTSTSISAFSDADW 1263
T PDI+ VN+LSQ++ SP+ HWQ +KR+LRY+ GT + L +PST I+ FSDADW
Sbjct: 1267 TTPDIAFSVNKLSQYMSSPSIDHWQGIKRILRYLQGTINYCLHIKPSTDLDITGFSDADW 1326
Query: 1264 ASNIDDRRSVAAYCVFVGSNLVSWSSKKQSVVARSSTESEYRALAHASAEIIWIQQLLTE 1323
A++IDDR+S++ CVF+G L+SWSS+KQ VV+RSSTESEYRALA +AEI WI+ LLTE
Sbjct: 1327 ATSIDDRKSMSGQCVFLGETLISWSSRKQKVVSRSSTESEYRALADLAAEIAWIRSLLTE 1351
Query: 1324 IGC------------LSSLDPSSGVTI----------------SVLRGSLDVRYVPSYDQ 1356
+ LS+ +S + VL+ + V YVP+ DQ
Sbjct: 1387 LELPLPRKPILWCDNLSAKALASNPVLHARSKHIEIDVHYIRDQVLQNEVVVAYVPTTDQ 1351
BLAST of Lag0007984 vs. NCBI nr
Match:
GAU51268.1 (hypothetical protein TSUD_412550 [Trifolium subterraneum])
HSP 1 Score: 1066.2 bits (2756), Expect = 2.4e-307
Identity = 610/1426 (42.78%), Postives = 826/1426 (57.92%), Query Frame = 0
Query: 3 MNPKYDAWLAVDQLLLGWLYNSMTPEVATQVMGVENAKDLWSAIQDLFGVQSRAEEDFLR 62
+NP + W+A DQ LLGWL NSM ++ATQ++ E +K LW Q L G +++ +L+
Sbjct: 66 VNPDFGDWIANDQALLGWLMNSMAIDIATQLLHCETSKQLWDETQSLAGAHTKSRITYLK 125
Query: 63 QTFQQTRKGNLKMADYLRTMKTHADNLGLTGSPISNRNLVSQVLLGLDEEYNAVVAMVQG 122
F TRKG +KM +YL MK +D L L GSPISN +L+ Q L GLD EYN VV +
Sbjct: 126 SEFHNTRKGEMKMEEYLIKMKNLSDKLKLAGSPISNSDLMIQTLNGLDAEYNPVVVKLSD 185
Query: 123 RANVSWSELQAELLVFEKRLELQISHKNTVAFNHNATANMAVNKVNSSPKQTTNNNGNRQ 182
+ N+SW ++QA+LL FE RL+ N NA+AN A NK T GN+
Sbjct: 186 QINLSWVDVQAQLLAFESRLD---QFNNFSGLTLNASANFA-NK--------TEFRGNK- 245
Query: 183 GYNNGHQRGNGYGNRYRGR--GRGYNNWNNRPTCQVCGKVGHSAVVCYHRFDKEFSPIQN 242
+N+ RGN + +RG GRG +N CQVC GH AV C +RFD+ P
Sbjct: 246 -FNS---RGNWRRSNFRGMRGGRGKGRMSN-TKCQVCNGTGHIAVDCSYRFDR---PYTG 305
Query: 243 RNTGNGTESGNFQSNRGIGQQPNAFMTTQQTATPETLADPSWYADSGASNHVTNNYENIA 302
RN TE+ S+ +AF+ A+P D WY DSGA+NHVT+ +
Sbjct: 306 RN--YSTEADKQGSH-------SAFI-----ASPYHGQDYEWYFDSGANNHVTHQTDKFQ 365
Query: 303 NPTDYRGKECVTVGNGDKLSITSVGNSVLTDGYHVLNLENVLCVPEIAKNLVSMSKLAQD 362
++ GK + VGNG+KL I + G++ L + LNL +VL VP+I KNL+S+SKL D
Sbjct: 366 GFNEHNGKNSLMVGNGEKLKIVASGSTKLNN----LNLHDVLYVPQITKNLLSVSKLTAD 425
Query: 363 NNVYIEFHGDFCLVKDKSSGQVLLKGTLKDGLYQLQDANTSAASVSASASSNNQSDNFNS 422
NN+ +EF + C VKDK +GQ LLKG LKDGLYQL
Sbjct: 426 NNILVEFDANCCSVKDKLTGQTLLKGRLKDGLYQL------------------------- 485
Query: 423 AFIVSNVVPHVSLAVSKTIWHRRLGHPSAKVLDFIVKDCKLQVKSNEMSQFCQSCQFGKA 482
SN P V ++V K WHR+LGHP+ KVLD ++KDC +++ ++ FC++CQFGK
Sbjct: 486 ----SNKEPCVYMSV-KESWHRKLGHPNNKVLDKVLKDCNVKISHSDQFSFCEACQFGKL 545
Query: 483 HALPFPLSNSRAAKKFDLIHTDVWGPAPILSVEGYRYYALFLDDHSRYLWLYPLKQKSDT 542
H LPF S+S + LIH+DVWGPAPILS G++YY F+DD SR+ W++PLKQKSDT
Sbjct: 546 HLLPFKPSSSHVQEPLALIHSDVWGPAPILSPSGFKYYVHFIDDFSRFTWIFPLKQKSDT 605
Query: 543 VQAFNHLLTVIKTQFGCGIKSVQTDNGGEYIPIHKVCHQLGIKTRLSCPHTSAQNGRAER 602
+ AF + + QF IK +Q D GGEY + KV + GI+ R+SCP+TS QNGRAER
Sbjct: 606 IHAFIQFKNLAENQFNKKIKIIQCDGGGEYKAVQKVSIEAGIQFRMSCPYTSQQNGRAER 665
Query: 603 KHRHVVETGLTLLAQASMPLSHWWDALVTATQLINGLPTTVLQGKSPMELMWSKKMNYEM 662
KHRHV E GLTLLAQA MPL +WW+A TA LIN LP++V +SP LM+ ++ +Y
Sbjct: 666 KHRHVAELGLTLLAQAKMPLRYWWEAFSTAVYLINRLPSSVNPNESPYSLMFKREPDYNA 725
Query: 663 LKTFGCSCYPCLRPYHKHKFHYHTERCVFLGISASHKGYRCMNEPGRVFISRHVRFNESE 722
LK FGC+CYPCL+PY++HK +HT RCVF+G S SHKGY+C+N GR+F+SRHV FNE+
Sbjct: 726 LKPFGCACYPCLKPYNQHKLQFHTTRCVFVGYSNSHKGYKCINSHGRIFVSRHVIFNENH 785
Query: 723 FPFATGFGSISSANTASSGSPSILEWFPHVHLPNPTSQSTMHSTPVPNSLVHPPDLPHNP 782
FPF GF + + + SIL LP ++ +T P++
Sbjct: 786 FPFHGGFLDTKNPLKTLTDNSSIL-------LPTCSAGATTQDAIEPDN----------- 845
Query: 783 TSPFPTLQPTCPQPNTNSYSSPTSLQSQSTDVLPQSPTQLQSLPNEPSSNPVQPSQISAT 842
NT S + S++S + ++ Q+ S ++N I A
Sbjct: 846 --------------NTTSDQNTHSIESSDNN---ENEEQVDSSEFFVNTNNSSTQDIEAD 905
Query: 843 TPISLPPTSPETSVPVSDSPTAAPIQPQPTHPMITRGKAGIFKPK-AWLTQQHTDWSLTE 902
S+ S A TH M TR K GI KPK ++ TD E
Sbjct: 906 N--SVDSEDRNNSTMTGTIQQQAQQDNSNTHWMRTRSKDGIHKPKIPYVGMAETDSEEKE 965
Query: 903 PTRVQDAISTPQWKQAMDCEYSALMKNQTWVLVPSSPDFNVVGNKWIFRIKKNADGTVQR 962
P V++A+ P WK+AMD EY AL+ N TW LVP N++ +KWIF+ K +DG+++R
Sbjct: 966 PKSVKEALGRPMWKEAMDKEYKALVSNHTWTLVPYQEQENIIDSKWIFKTKYKSDGSIER 1025
Query: 963 YKARLVAKGFHQYPGVDFFETFSPVVKASTIRVVISLAVSKGWSLRQLDFNNAFLNGILV 1022
KARLVAKGF Q G+DF ETFSPVVK+ST+R+++++AV W +RQLD NNAFLNG L
Sbjct: 1026 RKARLVAKGFQQTAGLDFGETFSPVVKSSTVRIILTIAVHFNWEVRQLDINNAFLNGKLK 1085
Query: 1023 EDVYMQQPPGYTDPTCPKYVCKLKKAIYGLKQAPRAWNTALKSVLLSWGFINSRSDSSLY 1082
E V+M QP GY D P ++CKL KAIYGLKQAPRAW +L+S L++WGF N+++D+SL+
Sbjct: 1086 ETVFMHQPEGYIDAAKPNHICKLSKAIYGLKQAPRAWYDSLRSTLVNWGFQNAKNDTSLF 1145
Query: 1083 IFKSQSTVLLLLVYVDDVVLTGNNLKAINRLIGELDKRFALKDLGKLNYFLGIQVHYMPS 1142
K LL+YVDD+++TG+N+K + +L+ ++LKDLG L+YFLG++VH S
Sbjct: 1146 FLKGADHTTFLLIYVDDIIVTGSNIKFLEAFTNQLNTAYSLKDLGPLHYFLGVEVHRDDS 1205
Query: 1143 GLILNQAKY-----------------------TNLVNDGKLLEDPFLYRSTIGALQYLTY 1202
G+ L Q KY + +G+L+ +P LYR IGALQYLT
Sbjct: 1206 GMYLRQTKYIRDVLKKFNMENTSACPTPMVTGRQFIAEGELMSNPTLYRQAIGALQYLTN 1265
Query: 1203 TRPDISHVVNQLSQFLKSPTDIHWQMVKRVLRYISGTKHLGLLFQPSTSTSISAFSDADW 1262
TRPDI+ VN+LSQ++ +PT HWQ +KR+LRY+ GTK+ L +PST+ I+ F DADW
Sbjct: 1266 TRPDIAFAVNKLSQYMSTPTIEHWQGIKRILRYLQGTKNHSLHIKPSTNLHIAGFLDADW 1325
Query: 1263 ASNIDDRRSVAAYCVFVGSNLVSWSSKKQSVVARSSTESEYRALAHASAE---------- 1322
A++ DDR+S CVF+G LVSW+S+KQ VV+RSSTESEYR+LA AE
Sbjct: 1326 ATSTDDRKSTGGQCVFLGETLVSWASRKQKVVSRSSTESEYRSLADLVAEVSTSSVATLL 1385
Query: 1323 ----------------------------IIWIQQLLTEIGCLSSLDPSSGVTI------- 1356
++W L + + + + I
Sbjct: 1386 SSERFLLAHFSTRFTLLEELKLPILRKPVLWCDNLSAKALASNPVMHARSKHIEIDMHYI 1385
BLAST of Lag0007984 vs. NCBI nr
Match:
PNX94503.1 (putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense])
HSP 1 Score: 1037.7 bits (2682), Expect = 9.0e-299
Identity = 583/1276 (45.69%), Postives = 780/1276 (61.13%), Query Frame = 0
Query: 3 MNPKYDAWLAVDQLLLGWLYNSMTPEVATQVMGVENAKDLWSAIQDLFGVQSRAEEDFLR 62
+NP Y W A DQ LLGWL NSMT ++ATQV+ E +K LW Q L G +R+ +L+
Sbjct: 65 INPDYQDWQADDQALLGWLMNSMTVDIATQVLHCETSKQLWDEAQSLAGAHTRSRIIYLK 124
Query: 63 QTFQQTRKGNLKMADYLRTMKTHADNLGLTGSPISNRNLVSQVLLGLDEEYNAVVAMVQG 122
F T K +KM YL MK AD L L GSPIS+ +L+ Q L GLD EYN VV +
Sbjct: 125 SEFHNTHKREMKMEQYLAKMKNLADKLKLAGSPISSSDLMIQTLNGLDSEYNPVVVKLSD 184
Query: 123 RANVSWSELQAELLVFEKRLELQISHKNTVAFNHNATANMAVNKVNSSPKQTTNNNGNRQ 182
+ N+SW + QA+LL FE RL+ Q+++ N + N NA+AN A + GN+
Sbjct: 185 QTNISWVDFQAQLLAFESRLD-QLNNFNNI--NLNASANFA---------SKNESGGNKF 244
Query: 183 GYNNGHQRGNGYGNRYRGRGRGYNNWNNRPTCQVCGKVGHSAVVCYHRFDKEFSPIQNRN 242
G G + N G R GRGR + RP CQ+CGK GH+A CY+RFDK ++ +
Sbjct: 245 GSRGGWRGSNSRGMR-GGRGRARMSKPPRPICQICGKFGHTAAQCYYRFDKSYTEKNHYA 304
Query: 243 TGNGTESGNFQSNRGIGQQPNAFMTTQQTATPETLADPSWYADSGASNHVTNNYENIANP 302
G G+ S AF+ A+P D WY DSGASNHVT+ + +
Sbjct: 305 EGEGSHS--------------AFV-----ASPYHGQDYEWYFDSGASNHVTHQSGQLQDL 364
Query: 303 TDYRGKECVTVGNGDKLSITSVGNSVLTDGYHVLNLENVLCVPEIAKNLVSMSKLAQDNN 362
+ GK + VGNG+KL I + G++ L D +NL NVL VPEI KNL+S+SKL DNN
Sbjct: 365 NENNGKNSLLVGNGEKLKILASGSTKLND----VNLRNVLYVPEITKNLLSVSKLTIDNN 424
Query: 363 VYIEFHGDFCLVKDKSSGQVLLKGTLKDGLYQLQDANTSAASVSASASSNNQSDNFNSAF 422
+EF ++C VKDK +G+ LLKG LKDGLYQL SA+ ++ A+
Sbjct: 425 ALVEFDENYCYVKDKLTGKALLKGRLKDGLYQL------------SANKEPPTNKDPCAY 484
Query: 423 IVSNVVPHVSLAVSKTIWHRRLGHPSAKVLDFIVKDCKLQVKSNEMSQFCQSCQFGKAHA 482
I SL K IWHR+LGHP+ KVL+ ++KD +++ ++ FC++CQFGK H
Sbjct: 485 I--------SL---KEIWHRKLGHPNNKVLEKVLKDNNVKISPSDKFTFCEACQFGKLHL 544
Query: 483 LPFPLSNSRAAKKFDLIHTDVWGPAPILSVEGYRYYALFLDDHSRYLWLYPLKQKSDTVQ 542
LPF S+S A + DLIHTDVWGPAPILS ++YY FLDD SR+ W++PLKQKS+T+
Sbjct: 545 LPFKTSSSHAKEPLDLIHTDVWGPAPILSQSNFKYYVHFLDDFSRFTWIFPLKQKSETIH 604
Query: 543 AFNHLLTVIKTQFGCGIKSVQTDNGGEYIPIHKVCHQLGIKTRLSCPHTSAQNGRAERKH 602
AFN +++ QF IK ++ D GGEY P+ K GI+ ++SCP+TS QNGRAERKH
Sbjct: 605 AFNQFKNLVENQFNKKIKVIRCDGGGEYKPVQKCAIDSGIQFQMSCPYTSQQNGRAERKH 664
Query: 603 RHVVETGLTLLAQASMPLSHWWDALVTATQLINGLPTTVLQGKSPMELMWSKKMNYEMLK 662
RHV E GLTLLAQA MPLS+WW+A TA LIN LP++V +SP L++ K+ +Y LK
Sbjct: 665 RHVTELGLTLLAQAKMPLSYWWEAFSTAVYLINRLPSSVNPNESPYTLVFKKEPDYTALK 724
Query: 663 TFGCSCYPCLRPYHKHKFHYHTERCVFLGISASHKGYRCMNEPGRVFISRHVRFNESEFP 722
FGC+CYPCL+PY++HK +HT RCVFLG S SHKGY+C+N GRVF+SRHV FNE+ FP
Sbjct: 725 PFGCACYPCLKPYNQHKLQFHTTRCVFLGYSNSHKGYKCVNSHGRVFVSRHVVFNENHFP 784
Query: 723 FATGF-GSISSANTASSGSPSILEWFPHVHLPNPTSQSTMHSTPVPNSLVHPPDLPHNPT 782
F GF + + ++ +P FP N T+++T +++V +
Sbjct: 785 FQEGFLDTRNPIKVVTNDTPIGFPSFPAGITTNNTAEAT-------DNIVDQQE------ 844
Query: 783 SPFPTLQPTCPQPNTNSYSSPTSLQSQSTDVLPQSPTQLQSLPNEPSSNPVQPSQISATT 842
P+ N + + S++S + + T + N + + + + +
Sbjct: 845 ----------PELNDINTVADQSVESDTFE-----HTDENNFSNGETEDSTEAAGRESME 904
Query: 843 PISLPPTSPETSVPVSDSPTAAPIQPQPTHPMITRGKAGIFKPK---AWLTQQHTDWSLT 902
IS P T ET+ P T TH M TR KAG++KPK LT++ +
Sbjct: 905 EISQPIT--ETNPPPQQDIT-------NTHWMRTRSKAGVYKPKLPYIGLTEEAKEGK-- 964
Query: 903 EPTRVQDAISTPQWKQAMDCEYSALMKNQTWVLVPSSPDFNVVGNKWIFRIKKNADGTVQ 962
EP V +A+S P+W AMD EY ALM N+TW LVP NV+ +KWIF+ K ADGT++
Sbjct: 965 EPESVSEALSIPEWLNAMDAEYKALMNNKTWTLVPFEGQENVISSKWIFKTKYKADGTIE 1024
Query: 963 RYKARLVAKGFHQYPGVDFFETFSPVVKASTIRVVISLAVSKGWSLRQLDFNNAFLNGIL 1022
R KARLVA+GF Q GVD+ ETFSPVVK+ST+R+++S+AV W +RQLD NNAFLNG L
Sbjct: 1025 RRKARLVARGFQQTAGVDYDETFSPVVKSSTVRIILSIAVHLSWEVRQLDINNAFLNGNL 1084
Query: 1023 VEDVYMQQPPGYTDPTCPKYVCKLKKAIYGLKQAPRAWNTALKSVLLSWGFINSRSDSSL 1082
E V+M QP GY D T P ++C+L KAIYGLKQAPRAW L+ LLSWGF N++SDSSL
Sbjct: 1085 KESVFMHQPEGYIDQTKPHHICRLNKAIYGLKQAPRAWFDRLRHTLLSWGFQNTKSDSSL 1144
Query: 1083 YIFKSQSTVLLLLVYVDDVVLTGNNLKAINRLIGELDKRFALKDLGKLNYFLGIQVHYMP 1142
++ K LL+YVDD+++TG+N K + I +L+ F+LKDLG L+YFLGI+VH
Sbjct: 1145 FVLKETDHTTFLLIYVDDIIITGSNNKFLEAFISQLNLVFSLKDLGNLHYFLGIEVHRDS 1204
Query: 1143 SGLILNQAKYT-------NLVN----------------DGKLLEDPFLYRSTIGALQYLT 1202
SG+ L Q KY N+ N +G+ + +P LYR IGALQYLT
Sbjct: 1205 SGMYLTQTKYIRDLLKKFNMENASSCPTPMITGRQFTIEGEPMSNPTLYRQAIGALQYLT 1242
Query: 1203 YTRPDISHVVNQLSQFLKSPTDIHWQMVKRVLRYISGTKHLGLLFQPSTSTSISAFSDAD 1252
TRPDI+ VN+LSQ++ SPT HWQ +KR+LRY+ G+ +LGL +PST I+ FSDAD
Sbjct: 1265 NTRPDIAFAVNKLSQYMCSPTTDHWQGIKRILRYLHGSTNLGLHIKPSTDLDIAGFSDAD 1242
BLAST of Lag0007984 vs. NCBI nr
Match:
KYP50444.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])
HSP 1 Score: 1023.1 bits (2644), Expect = 2.3e-294
Identity = 562/1275 (44.08%), Postives = 774/1275 (60.71%), Query Frame = 0
Query: 25 MTPEVATQVMGVENAKDLWSAIQDLFGVQSRAEEDFLRQTFQQTRKGNLKMADYLRTMKT 84
MT EVATQ++ E ++ +W Q L G +R+ FL+ F +TRKG LKM +YL MK
Sbjct: 1 MTQEVATQLLHCETSQQIWEDAQSLAGAHTRSRITFLKTEFHRTRKGGLKMEEYLTKMKE 60
Query: 85 HADNLGLTGSPISNRNLVSQVLLGLDEEYNAVVAMVQGRANVSWSELQAELLVFEKRLEL 144
AD+L L GS +S +LV+Q L GLD EYN +V + + +++W E+QA+LL +E RLE
Sbjct: 61 IADDLALAGSSVSTMDLVTQTLAGLDNEYNPIVVQLSDKEHLTWVEMQAQLLTYENRLE- 120
Query: 145 QISHKNTVAFNHNATANMAVNKVNSSPKQTTNNNGNRQGYNNGHQRGNGYGNRYRGRGRG 204
QI++++ + N ++N++ N K G N G + G RGRGR
Sbjct: 121 QINNQSNLTL--NPSSNISTILYNRRGKSNAFGGGRGGQINRGARGG-------RGRGRA 180
Query: 205 YNNWNNRPTCQVCGKVGHSAVVCYHRFDKEFSPIQNRNTGNGTESGNFQSNRGIGQQPNA 264
+R CQVC K GH+A CYHRF+K + G ++ + ++ NA
Sbjct: 181 ---TKDRIVCQVCCKPGHAASHCYHRFNKNY-------IGQNSDEQKSEKDKEQNYNFNA 240
Query: 265 FMTTQQTATPETLADPSWYADSGASNHVTNNYENIANPTDYRGKECVTVGNGDKLSITSV 324
++ A+P T+ D WY DSGASNHVT + + + GK +TVGNG L I +
Sbjct: 241 YV-----ASPSTVEDLDWYFDSGASNHVTYDQNKVQEVNENDGKSFLTVGNGANLKIIAC 300
Query: 325 GNSVLTDGYHVLNLENVLCVPEIAKNLVSMSKLAQDNNVYIEFHGDFCLVKDKSSGQVLL 384
G+S L LNL+++L VP+I KNL+S+SKL DN++Y+EFH C VKDK +G++LL
Sbjct: 301 GDSSLDTQQKSLNLKDILYVPKITKNLLSISKLTFDNDIYVEFHDVACFVKDKLTGRILL 360
Query: 385 KGTLKDGLYQLQDANTSAASVSASASSNNQSDNFNSAFIVSNVVPHVSLAVSKTIWHRRL 444
+G +KDGLYQL +TS +N PHV ++ +T WHR+L
Sbjct: 361 EGKIKDGLYQLPGGSTS-----------------------TNKRPHVFFSIKET-WHRKL 420
Query: 445 GHPSAKVLDFIVKDCKLQVKSNEMSQFCQSCQFGKAHALPFPLSNSRAAKKFDLIHTDVW 504
GHP++KVL+ ++K C ++ E +FC++CQFGKAH LPF S S A + DL+H+DVW
Sbjct: 421 GHPNSKVLNEVMKLCNIEASPCENFEFCEACQFGKAHNLPFQNSVSCAKEPLDLVHSDVW 480
Query: 505 GPAPILSVEGYRYYALFLDDHSRYLWLYPLKQKSDTVQAFNHLLTVIKTQFGCGIKSVQT 564
GPAPI SV G++YY LFLDD SR+ W+YPLKQKSD QAF +++ QF IK++Q
Sbjct: 481 GPAPISSVSGFKYYVLFLDDWSRFTWIYPLKQKSDVFQAFIQFRNLVENQFNKRIKTLQC 540
Query: 565 DNGGEYIPIHKVCHQLGIKTRLSCPHTSAQNGRAERKHRHVVETGLTLLAQASMPLSHWW 624
D GGE+ + KV + GI+ R SCP+TSAQNGRAERKHRHVVE+GLTLLAQA MPL +WW
Sbjct: 541 DGGGEFKSLSKVLIKTGIQLRESCPYTSAQNGRAERKHRHVVESGLTLLAQAKMPLHYWW 600
Query: 625 DALVTATQLINGLPTTVLQGKSPMELMWSKKMNYEMLKTFGCSCYPCLRPYHKHKFHYHT 684
+A TA LIN LPT V++ KSP + ++ K +Y +KTFGC+CYPCL+PY++HK +HT
Sbjct: 601 EAFSTAVFLINRLPTQVIKNKSPYQQLFDKNPDYTAMKTFGCACYPCLKPYNQHKLQFHT 660
Query: 685 ERCVFLGISASHKGYRCMNEPGRVFISRHVRFNESEFPFATGFGSISSANTASSGSPSIL 744
+CVFLG S SHKGY+C+N GR+FISRHV FNE FPF GF NT
Sbjct: 661 TKCVFLGYSGSHKGYKCLNSTGRIFISRHVVFNEHHFPFHDGF-----LNTRK------- 720
Query: 745 EWFPHVHLPNPTSQSTMHSTPVPNSLVHPPDLPHNPTSPFPTLQPTCPQPNTNSYSSPTS 804
P ++ +PTS + PT +N +
Sbjct: 721 ----------------------------PAEIITDPTSLLFPISPT----GSNVANEEQR 780
Query: 805 LQSQSTDVLPQSPTQLQSLPNEPSSNPVQPSQISATTPISLPPTSPETSVPVSDSPTAAP 864
L T S N S + V+ ++ T ++ + ++S
Sbjct: 781 LH-----------TNNNSSSNTKSKHQVEQAENQNTIDATISQNT------FANSRIENN 840
Query: 865 IQPQPTHPMITRGKAGIFKP-KAWLTQQHTDWSLTEPTRVQDAISTPQWKQAMDCEYSAL 924
I+ H M TR K GI KP K ++ EP +A+ P+WK+AM E+ AL
Sbjct: 841 IESINQHQMTTRSKMGIIKPKKPYVGAVEKTLEEQEPETTYEALENPEWKKAMIAEFKAL 900
Query: 925 MKNQTWVLVPSSPDFNVVGNKWIFRIKKNADGTVQRYKARLVAKGFHQYPGVDFFETFSP 984
M N+TW LVP N++ KW+F+ K ADGT++R KARLVAKGF Q G+D+ ETFSP
Sbjct: 901 MMNKTWTLVPYQGQKNIIDCKWVFKTKYKADGTIERRKARLVAKGFQQTLGLDYDETFSP 960
Query: 985 VVKASTIRVVISLAVSKGWSLRQLDFNNAFLNGILVEDVYMQQPPGYTDPTCPKYVCKLK 1044
V+KA T+R+++S+AV W +RQ+D NNAFLNG L E V+M+QP G+ D + P+++CKL
Sbjct: 961 VIKAITVRIILSIAVHFNWEIRQMDINNAFLNGELKETVFMRQPEGFLDKSRPQHICKLT 1020
Query: 1045 KAIYGLKQAPRAWNTALKSVLLSWGFINSRSDSSLYIFKSQSTVLLLLVYVDDVVLTGNN 1104
KAIYGLKQAPR+W L++ LL WGF N+RSDSSL++ S++ + LL+YVDD+++TG++
Sbjct: 1021 KAIYGLKQAPRSWYDRLRNALLKWGFKNTRSDSSLFVLMSKAHITFLLIYVDDIIITGSS 1080
Query: 1105 LKAINRLIGELDKRFALKDLGKLNYFLGIQVHYMPSGLILNQAKYT-------------- 1164
++ I +L+ FALKDLG L+YFLG++ SGL L Q KY
Sbjct: 1081 SSFLSSFIKQLNIMFALKDLGSLHYFLGVEACRDASGLYLKQTKYVLDLLKKFNLEHVSS 1140
Query: 1165 ---------NLVNDGKLLEDPFLYRSTIGALQYLTYTRPDISHVVNQLSQFLKSPTDIHW 1224
+L + +L+++P LYR IG LQYLT TRPDI++ VN+LSQ++++PT IHW
Sbjct: 1141 CPTPMVTGRSLSEEAELMKNPTLYRRAIGVLQYLTNTRPDIAYSVNRLSQYMQAPTTIHW 1165
Query: 1225 QMVKRVLRYISGTKHLGLLFQPSTSTSISAFSDADWASNIDDRRSVAAYCVFVGSNLVSW 1276
Q VKRV RY+ GT + L +PS I+ FSDADWA+NI+DR+SVA YCVF+G +L++W
Sbjct: 1201 QSVKRVFRYLKGTMNHCLHIKPSVDLDITGFSDADWATNIEDRKSVAGYCVFLGESLITW 1165
BLAST of Lag0007984 vs. NCBI nr
Match:
RHN69202.1 (putative RNA-directed DNA polymerase [Medicago truncatula])
HSP 1 Score: 991.1 bits (2561), Expect = 9.7e-285
Identity = 621/1501 (41.37%), Postives = 858/1501 (57.16%), Query Frame = 0
Query: 4 NPKYDAWLAVDQLLLGWLYNSMTPEVATQVMGVENAKDLWSAIQDLFGVQSRAEEDFLRQ 63
NP Y W D LL W+ ++++P + ++ + + ++ +W I Q + LR
Sbjct: 26 NPAYTEWEEQDSLLCTWILSTISPSLLSRFVLLRHSWQVWDEIHSYCFTQMKTRSRQLRS 85
Query: 64 TFQQTRKGNLKMADYLRTMKTHADNLGLTGSPISNRNLVSQVLLGLDEEYNAVVAMVQGR 123
+ KG+ +A+++ ++ +++L G P+S+R+L+ VL L EE++ +VA V +
Sbjct: 86 ELRSITKGSRTVAEFIARIRAISESLASIGDPVSHRDLIEVVLEALPEEFDPIVASVNAK 145
Query: 124 AN-VSWSELQAELLVFEKRLE----LQISHKNTVAFNHNATANMAVNKVNSSPKQTTNNN 183
+ VS EL+++LL E R E IS +V A + + NS T+
Sbjct: 146 SEVVSLDELESQLLTQESRKEKFKKAAISEPVSVNLTETANSESQSHGPNSQNHNYTDGT 205
Query: 184 GNRQ--------GYNNGHQRGNG--YGNRYRGRGRGYNNWNNRPTCQVCGKVGHSAVVCY 243
GN Q G NG RG G +G R+RGRG + +N CQ+C K GH A C+
Sbjct: 206 GNNQFPNSNPNFGGRNGQFRGRGGRFGGRFRGRGGRFGGRSN-VQCQICSKTGHDASYCH 265
Query: 244 HRF----DKEFSP------------IQNRNTGNGTESGNF-----QSNRGIGQQPNAFMT 303
+RF + +SP + +N SG F Q+ GQ P AF+T
Sbjct: 266 YRFFAPQNDYYSPYGSPGGYGAPPNVWMQNMSRPQHSGQFLRPPTQAANQRGQAPQAFLT 325
Query: 304 TQQTATPETLADPSWYADSGASNHVTNNYENIANPTDYRGKECVTVGNGDKLSITSVGNS 363
+ P + +WY DSGA++HVT + N+ + T G + V +GNG L+ITSVG+
Sbjct: 326 ---GSDPYNSFNNAWYPDSGATHHVTPDASNLMDSTSLSGSDQVHIGNGQGLAITSVGSL 385
Query: 364 VLTDGYH---VLNLENVLCVPEIAKNLVSMSKLAQDNNVYIEFHGDFCLVKDKSSGQVLL 423
T H L L N+L VP I KNLVS+S+ A+DNNVY EFH + C VK + S +VLL
Sbjct: 386 QFTSPLHPQTTLKLNNLLLVPSITKNLVSVSQFAKDNNVYFEFHPNHCFVKSQDSSKVLL 445
Query: 424 KGTL-KDGLYQLQDANT--SAASVSASASSNN-------QSDN-----------FNSAFI 483
+G L DGLYQ + + + A VS ++S N Q+DN FN
Sbjct: 446 RGILGHDGLYQFEHTKSFKTTAPVSQNSSVNTVCNKVPAQTDNSASFHLSPSTGFNFNNF 505
Query: 484 VSNVVPHV--SLAVSKT--------IWHRRLGHPSAKVLDFIVKDCKLQVKSNEMSQFCQ 543
N V H+ S S T IWH RLGHP +VL I+K C +++ + +S FC
Sbjct: 506 QCNNVEHLPSSSTSSSTQSFPSMYGIWHSRLGHPHHEVLQSIIKLCNIKLPNKSLSDFCT 565
Query: 544 SCQFGKAHALPFPLSNSRAAKKFDLIHTDVWGPAPILSVEGYRYYALFLDDHSRYLWLYP 603
+C GK H LP S K +LI D+WGPAP+ S GY Y+ +D +SRY W+YP
Sbjct: 566 ACCHGKVHRLPSFASQMTYTKPLELIFCDLWGPAPVESSCGYTYFLTCVDAYSRYTWIYP 625
Query: 604 LKQKSDTVQAFNHLLTVIKTQFGCGIKSVQTDNGGEYIPIHKVCHQLGIKTRLSCPHTSA 663
LK KS T+ F + T+I+ Q I SVQTD GGE++P K + LGI R +CPHT
Sbjct: 626 LKLKSHTLSTFQNFKTMIELQLNHKITSVQTDGGGEFLPFTKYLNSLGITHRFTCPHTHH 685
Query: 664 QNGRAERKHRHVVETGLTLLAQASMPLSHWWDALVTATQLINGLPTTVLQGKSPMELMWS 723
QNG ERKHRH+VETGLTLL+ A MPL W A +TAT LIN LPT VL KSP L+
Sbjct: 686 QNGSVERKHRHIVETGLTLLSHAQMPLKFWDHAFLTATYLINRLPTPVLANKSPFFLLHL 745
Query: 724 KKMNYEMLKTFGCSCYPCLRPYHKHKFHYHTERCVFLGISASHKGYRCMNEPGRVFISRH 783
+ +Y+ LK+FGC+C+P LRPY+ HKF +H++ CVFLG S SHKGY+C++ GR+FIS+
Sbjct: 746 QFPDYKFLKSFGCACFPFLRPYNSHKFDFHSKECVFLGYSNSHKGYKCLDASGRIFISKD 805
Query: 784 VRFNESEFPFATGFGSISSANTASSGSPSILEWFPHVHLPNPTSQSTMHSTPVPNSLV-- 843
V FNE +FP+ F S + LP+ + ST TPV +
Sbjct: 806 VVFNEVKFPYLDLFPSQKVCSV----------------LPDGPTLSTFLPTPVSTTFTVN 865
Query: 844 -HPPDLPHNPTSPFPTLQPTCPQPNTNSYSSPTSLQSQSTDVLPQSPTQLQSLPNEPSSN 903
H P H+ + P PT PQ ++S S PT+ S + PQ+P+ + S +E S
Sbjct: 866 SHTPQNSHSESGPHTVNSPT-PQ-TSHSESVPTTPISNT----PQTPS-ISSHHSESSHR 925
Query: 904 ---PVQPSQISATTPISLPPTSPETSVPV-------SDSPTAAP--IQPQPTHPMITRGK 963
+ P+ I+ +P + +SPE+S V S+SP P I PQ H M TRGK
Sbjct: 926 NNVVLNPTPITILSPSASQNSSPESSASVTSSQSTNSESPPPVPHRIHPQNCHTMRTRGK 985
Query: 964 AGIFKPKAWLTQQHTDWSLTEPTRVQDAISTPQWKQAMDCEYSALMKNQTWVLVPSSPDF 1023
GI +P+ T T EPT + A+ P+W AM EY+AL+ NQTW LV +
Sbjct: 986 HGIVQPRINPTLLLTH---VEPTTYKTALQDPKWHLAMQEEYNALLHNQTWSLVSLPANR 1045
Query: 1024 NVVGNKWIFRIKKNADGTVQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRVVISLAV 1083
+G KW+FR+K+N DGTV +YKARLVAKGFHQ G D+ ETFSPVVK T+R V++LAV
Sbjct: 1046 LAIGCKWVFRVKENPDGTVNKYKARLVAKGFHQQTGFDYNETFSPVVKPVTVRTVLTLAV 1105
Query: 1084 SKGWSLRQLDFNNAFLNGILVEDVYMQQPPGYTDPTCPKYVCKLKKAIYGLKQAPRAWNT 1143
+ W+L+QLD NNAFLNG+L E+VYM QPPG+ + + VCKL KA+YGLKQAPRAW
Sbjct: 1106 TYNWTLQQLDVNNAFLNGVLTEEVYMVQPPGF-ESSDKNLVCKLHKALYGLKQAPRAWFE 1165
Query: 1144 ALKSVLLSWGFINSRSDSSLYIFKSQSTVLLLLVYVDDVVLTGNNLKAINRLIGELDKRF 1203
LKS LLS+GF +SR D SL+ +Q+ + +LVYVDD+++TGN+ AI L+ +L+ F
Sbjct: 1166 RLKSSLLSFGFKSSRCDPSLFTLHTQAHCIFILVYVDDIIITGNSKLAIQNLVHQLNSEF 1225
Query: 1204 ALKDLGKLNYFLGIQVHYMPSG-LILNQAKY-------TNLVNDGKL------------- 1263
+LKDLG L+YFLGI+VH+ PSG L+L+Q KY N++N +
Sbjct: 1226 SLKDLGILDYFLGIEVHHSPSGSLLLSQTKYIKDLLQKANMINANSMPSPMASSTKLSKF 1285
Query: 1264 ----LEDPFLYRSTIGALQYLTYTRPDISHVVNQLSQFLKSPTDIHWQMVKRVLRYISGT 1323
+ DP +RS +GALQY T TRP+IS+ VN++ QFL +P + HW+ VKR+LRY+ GT
Sbjct: 1286 GSSTVSDPTFFRSIVGALQYATITRPEISYSVNKVCQFLSNPLEDHWKAVKRILRYLQGT 1345
Query: 1324 KHLGLLFQPSTST---SISAFSDADWASNIDDRRSVAAYCVFVGSNLVSWSSKKQSVVAR 1365
H GL+ P++ST +I+ F DADWAS+ DDRRS + C+F+G NLVSW ++KQ++VAR
Sbjct: 1346 LHHGLMLTPASSTEPIAITGFCDADWASDPDDRRSTSGACIFLGPNLVSWWARKQTLVAR 1405
BLAST of Lag0007984 vs. ExPASy Swiss-Prot
Match:
Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)
HSP 1 Score: 838.2 bits (2164), Expect = 1.4e-241
Identity = 525/1454 (36.11%), Postives = 775/1454 (53.30%), Query Frame = 0
Query: 3 MNPKYDAWLAVDQLLLGWLYNSMTPEVATQVMGVENAKDLWSAIQDLFGVQSRAEEDFLR 62
+NP Y W D+L+ + +++ V V A +W ++ ++ S LR
Sbjct: 70 VNPDYTRWKRQDKLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLR 129
Query: 63 QTFQQTRKGNLKMADYLRTMKTHADNLGLTGSPISNRNLVSQVLLGLDEEYNAVVAMVQG 122
+Q KG + DY++ + T D L L G P+ + V +VL L EEY V+ +
Sbjct: 130 TQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIAA 189
Query: 123 R-ANVSWSELQAELLVFEKRLELQISHKNTVAFNHNATANMAVNKVNSSPKQTTNNNGNR 182
+ + +E+ LL E ++ L +S + NA ++ N+ NNNGNR
Sbjct: 190 KDTPPTLTEIHERLLNHESKI-LAVSSATVIPITANAVSHRNTTTTNN------NNNGNR 249
Query: 183 QG-YNNGHQRGNGYGNRYRGRGRGYNNWNNRP---TCQVCGKVGHSAVVCYHRFDKEFSP 242
Y+N + N + NN ++P CQ+CG GHSA C
Sbjct: 250 NNRYDNRNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRC---------- 309
Query: 243 IQNRNTGNGTESGNFQSNRGIGQQPNAFMTTQ---QTATPETLADPSWYADSGASNHVTN 302
++ +F S+ Q P+ F Q A + +W DSGA++H+T+
Sbjct: 310 ---------SQLQHFLSSVNSQQPPSPFTPWQPRANLALGSPYSSNNWLLDSGATHHITS 369
Query: 303 NYENIANPTDYRGKECVTVGNGDKLSITSVGNSVLTDGYHVLNLENVLCVPEIAKNLVSM 362
++ N++ Y G + V V +G + I+ G++ L+ LNL N+L VP I KNL+S+
Sbjct: 370 DFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISV 429
Query: 363 SKLAQDNNVYIEFHGDFCLVKDKSSGQVLLKGTLKDGLYQLQDANTSAASVSASASSNNQ 422
+L N V +EF VKD ++G LL+G KD LY+ A++ S+ AS SS
Sbjct: 430 YRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQPVSLFASPSSK-- 489
Query: 423 SDNFNSAFIVSNVVPHVSLAVSKTIWHRRLGHPSAKVLDFIVKDCKLQVKSNEMSQF--C 482
+ + WH RLGHP+ +L+ ++ + L V N +F C
Sbjct: 490 --------------------ATHSSWHARLGHPAPSILNSVISNYSLSV-LNPSHKFLSC 549
Query: 483 QSCQFGKAHALPFPLSNSRAAKKFDLIHTDVWGPAPILSVEGYRYYALFLDDHSRYLWLY 542
C K++ +PF S + + + I++DVW +PILS + YRYY +F+D +RY WLY
Sbjct: 550 SDCLINKSNKVPFSQSTINSTRPLEYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLY 609
Query: 543 PLKQKSDTVQAFNHLLTVIKTQFGCGIKSVQTDNGGEYIPIHKVCHQLGIKTRLSCPHTS 602
PLKQKS + F +++ +F I + +DNGGE++ + + Q GI S PHT
Sbjct: 610 PLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVALWEYFSQHGISHLTSPPHTP 669
Query: 603 AQNGRAERKHRHVVETGLTLLAQASMPLSHWWDALVTATQLINGLPTTVLQGKSPMELMW 662
NG +ERKHRH+VETGLTLL+ AS+P ++W A A LIN LPT +LQ +SP + ++
Sbjct: 670 EHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLF 729
Query: 663 SKKMNYEMLKTFGCSCYPCLRPYHKHKFHYHTERCVFLGISASHKGYRCMN-EPGRVFIS 722
NY+ L+ FGC+CYP LRPY++HK + +CVFLG S + Y C++ + R++IS
Sbjct: 730 GTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYIS 789
Query: 723 RHVRFNESEFPFATGFGSISSANTASSGSPSILEWFPHVHLPNPTSQSTMHSTPVPNSLV 782
RHVRF+E+ FPF+ ++S S + W PH LP T S P+
Sbjct: 790 RHVRFDENCFPFSNYLATLSPVQEQRRESSCV--WSPHTTLPTRTPVLPAPSCSDPHHAA 849
Query: 783 HPPDLPHNP------------------------------TSPFPTLQPTCPQPNTNSYSS 842
PP P P P PT QPT Q T ++SS
Sbjct: 850 TPPSSPSAPFRNSQVSSSNLDSSFSSSFPSSPEPTAPRQNGPQPTTQPT--QTQTQTHSS 909
Query: 843 PTSLQSQSTDVLPQSPTQL-QSLPNEPSSNPVQPSQISATTPISLPPTSPETSVPVSDSP 902
+ Q+ T+ +SP+QL QSL S+ PS ++ + S PT P S+ + P
Sbjct: 910 QNTSQNNPTN---ESPSQLAQSLSTPAQSSSSSPSPTTSASSSSTSPTPP--SILIHPPP 969
Query: 903 TAAPI------QPQPTHPMITRGKAGIFKPKAWLTQQHTDWSLTEPTRVQDAISTPQWKQ 962
A I P TH M TR KAGI KP + + + +EP A+ +W+
Sbjct: 970 PLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPKYSLAVSLAAESEPRTAIQALKDERWRN 1029
Query: 963 AMDCEYSALMKNQTWVLVPSSPD-FNVVGNKWIFRIKKNADGTVQRYKARLVAKGFHQYP 1022
AM E +A + N TW LVP P +VG +WIF K N+DG++ RYKARLVAKG++Q P
Sbjct: 1030 AMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQRP 1089
Query: 1023 GVDFFETFSPVVKASTIRVVISLAVSKGWSLRQLDFNNAFLNGILVEDVYMQQPPGYTDP 1082
G+D+ ETFSPV+K+++IR+V+ +AV + W +RQLD NNAFL G L +DVYM QPPG+ D
Sbjct: 1090 GLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFIDK 1149
Query: 1083 TCPKYVCKLKKAIYGLKQAPRAWNTALKSVLLSWGFINSRSDSSLYIFKSQSTVLLLLVY 1142
P YVCKL+KA+YGLKQAPRAW L++ LL+ GF+NS SD+SL++ + +++ +LVY
Sbjct: 1150 DRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKSIVYMLVY 1209
Query: 1143 VDDVVLTGNNLKAINRLIGELDKRFALKDLGKLNYFLGIQVHYMPSGLILNQAKY----- 1202
VDD+++TGN+ ++ + L +RF++KD +L+YFLGI+ +P+GL L+Q +Y
Sbjct: 1210 VDDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIEAKRVPTGLHLSQRRYILDLL 1269
Query: 1203 --TNLVN-----------------DGKLLEDPFLYRSTIGALQYLTYTRPDISHVVNQLS 1262
TN++ G L DP YR +G+LQYL +TRPDIS+ VN+LS
Sbjct: 1270 ARTNMITAKPVTTPMAPSPKLSLYSGTKLTDPTEYRGIVGSLQYLAFTRPDISYAVNRLS 1329
Query: 1263 QFLKSPTDIHWQMVKRVLRYISGTKHLGLLFQPSTSTSISAFSDADWASNIDDRRSVAAY 1322
QF+ PT+ H Q +KR+LRY++GT + G+ + + S+ A+SDADWA + DD S Y
Sbjct: 1330 QFMHMPTEEHLQALKRILRYLAGTPNHGIFLKKGNTLSLHAYSDADWAGDKDDYVSTNGY 1389
Query: 1323 CVFVGSNLVSWSSKKQSVVARSSTESEYRALAHASAEIIWIQQLLTEIGCLSSLDP---- 1356
V++G + +SWSSKKQ V RSSTE+EYR++A+ S+E+ WI LLTE+G + P
Sbjct: 1390 IVYLGHHPISWSSKKQKGVVRSSTEAEYRSVANTSSEMQWICSLLTELGIRLTRPPVIYC 1449
BLAST of Lag0007984 vs. ExPASy Swiss-Prot
Match:
Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)
HSP 1 Score: 812.8 bits (2098), Expect = 6.2e-234
Identity = 517/1450 (35.66%), Postives = 769/1450 (53.03%), Query Frame = 0
Query: 3 MNPKYDAWLAVDQLLLGWLYNSMTPEVATQVMGVENAKDLWSAIQDLFGVQSRAEEDFLR 62
+NP Y W D+L+ + +++ V V A +W ++ ++ S LR
Sbjct: 70 VNPDYTRWRRQDKLIYSAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLR 129
Query: 63 QTFQQTRKGNLKMADYLRTMKTHADNLGLTGSPISNRNLVSQVLLGLDEEYNAVVAMVQG 122
T D L L G P+ + V +VL L ++Y V+ +
Sbjct: 130 -------------------FITRFDQLALLGKPMDHDEQVERVLENLPDDYKPVIDQIAA 189
Query: 123 R-ANVSWSELQAELLVFEKRLELQISHKNTVAFNHNATANMAVNKVNSSPKQTTNNNGNR 182
+ S +E+ L+ E +L L ++ V TAN+ ++ N++ + NN G+
Sbjct: 190 KDTPPSLTEIHERLINRESKL-LALNSAEVVPI----TANVVTHR-NTNTNRNQNNRGDN 249
Query: 183 QGYNNGHQRGNGYGNRYRGRGRGYNNWNNRP---TCQVCGKVGHSAVVCYHRFDKEFSPI 242
+ YNN + R N + + G +N +P CQ+C GHSA C + +
Sbjct: 250 RNYNNNNNRSNSW--QPSSSGSRSDNRQPKPYLGRCQICSVQGHSAKRCPQLHQFQSTTN 309
Query: 243 QNRNTGNGTESGNFQSNRGIGQQPNAFMTTQQTATPETLADPSWYADSGASNHVTNNYEN 302
Q ++T T QP A + +W DSGA++H+T+++ N
Sbjct: 310 QQQSTSPFTP-----------WQPRANLAVNSPYNAN-----NWLLDSGATHHITSDFNN 369
Query: 303 IANPTDYRGKECVTVGNGDKLSITSVGNSVLTDGYHVLNLENVLCVPEIAKNLVSMSKLA 362
++ Y G + V + +G + IT G++ L L+L VL VP I KNL+S+ +L
Sbjct: 370 LSFHQPYTGGDDVMIADGSTIPITHTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLC 429
Query: 363 QDNNVYIEFHGDFCLVKDKSSGQVLLKGTLKDGLYQLQDANTSAASVSASASSNNQSDNF 422
N V +EF VKD ++G LL+G KD LY+ A++ A S+ AS S
Sbjct: 430 NTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQAVSMFASPCSK------ 489
Query: 423 NSAFIVSNVVPHVSLAVSKTIWHRRLGHPSAKVLDFIVKDCKLQV-KSNEMSQFCQSCQF 482
+ + WH RLGHPS +L+ ++ + L V + C C
Sbjct: 490 ----------------ATHSSWHSRLGHPSLAILNSVISNHSLPVLNPSHKLLSCSDCFI 549
Query: 483 GKAHALPFPLSNSRAAKKFDLIHTDVWGPAPILSVEGYRYYALFLDDHSRYLWLYPLKQK 542
K+H +PF S ++K + I++DVW +PILS++ YRYY +F+D +RY WLYPLKQK
Sbjct: 550 NKSHKVPFSNSTITSSKPLEYIYSDVWS-SPILSIDNYRYYVIFVDHFTRYTWLYPLKQK 609
Query: 543 SDTVQAFNHLLTVIKTQFGCGIKSVQTDNGGEYIPIHKVCHQLGIKTRLSCPHTSAQNGR 602
S F ++++ +F I ++ +DNGGE++ + Q GI S PHT NG
Sbjct: 610 SQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVVLRDYLSQHGISHFTSPPHTPEHNGL 669
Query: 603 AERKHRHVVETGLTLLAQASMPLSHWWDALVTATQLINGLPTTVLQGKSPMELMWSKKMN 662
+ERKHRH+VE GLTLL+ AS+P ++W A A LIN LPT +LQ +SP + ++ + N
Sbjct: 670 SERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPN 729
Query: 663 YEMLKTFGCSCYPCLRPYHKHKFHYHTERCVFLGISASHKGYRCMNEP-GRVFISRHVRF 722
YE LK FGC+CYP LRPY++HK +++C F+G S + Y C++ P GR++ SRHV+F
Sbjct: 730 YEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQF 789
Query: 723 NESEFPFA-TGFGSISSANTASSGSPSILEWFPHVHLPN--------------------- 782
+E FPF+ T FG +S S +P+ W H LP
Sbjct: 790 DERCFPFSTTNFGVSTSQEQRSDSAPN---WPSHTTLPTTPLVLPAPPCLGPHLDTSPRP 849
Query: 783 PTSQSTMHSTPV-----PNSLVHPPDLPHNPTSPF-----PTLQPTCPQPNTNSYSSPTS 842
P+S S + +T V P+S + P PT+P PT QP Q N+NS +SP
Sbjct: 850 PSSPSPLCTTQVSSSNLPSSSISSPS-SSEPTAPSHNGPQPTAQPHQTQ-NSNS-NSPIL 909
Query: 843 LQSQSTDVLPQSPTQLQSLPNEPSSNPVQPSQISATTPISLPPTSPETSVPVSDSPTAAP 902
P SP Q LP P S+P P+ ++ + + P +S ++ P+ A P
Sbjct: 910 NNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPPLPPVLPAPP 969
Query: 903 I------QPQPTHPMITRGKAGIFKPKAWLTQQHTDWSLTEPTRVQDAISTPQWKQAMDC 962
I P TH M TR K GI KP + + + +EP A+ +W+QAM
Sbjct: 970 IIQVNAQAPVNTHSMATRAKDGIRKPNQKYSYATSLAANSEPRTAIQAMKDDRWRQAMGS 1029
Query: 963 EYSALMKNQTWVLV-PSSPDFNVVGNKWIFRIKKNADGTVQRYKARLVAKGFHQYPGVDF 1022
E +A + N TW LV P P +VG +WIF K N+DG++ RYKARLVAKG++Q PG+D+
Sbjct: 1030 EINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSLNRYKARLVAKGYNQRPGLDY 1089
Query: 1023 FETFSPVVKASTIRVVISLAVSKGWSLRQLDFNNAFLNGILVEDVYMQQPPGYTDPTCPK 1082
ETFSPV+K+++IR+V+ +AV + W +RQLD NNAFL G L ++VYM QPPG+ D P
Sbjct: 1090 AETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDEVYMSQPPGFVDKDRPD 1149
Query: 1083 YVCKLKKAIYGLKQAPRAWNTALKSVLLSWGFINSRSDSSLYIFKSQSTVLLLLVYVDDV 1142
YVC+L+KAIYGLKQAPRAW L++ LL+ GF+NS SD+SL++ + +++ +LVYVDD+
Sbjct: 1150 YVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSISDTSLFVLQRGRSIIYMLVYVDDI 1209
Query: 1143 VLTGNNLKAINRLIGELDKRFALKDLGKLNYFLGIQVHYMPSGLILNQAKY-------TN 1202
++TGN+ + + L +RF++K+ L+YFLGI+ +P GL L+Q +Y TN
Sbjct: 1210 LITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGIEAKRVPQGLHLSQRRYTLDLLARTN 1269
Query: 1203 L-----------------VNDGKLLEDPFLYRSTIGALQYLTYTRPDISHVVNQLSQFLK 1262
+ ++ G L DP YR +G+LQYL +TRPD+S+ VN+LSQ++
Sbjct: 1270 MLTAKPVATPMATSPKLTLHSGTKLPDPTEYRGIVGSLQYLAFTRPDLSYAVNRLSQYMH 1329
Query: 1263 SPTDIHWQMVKRVLRYISGTKHLGLLFQPSTSTSISAFSDADWASNIDDRRSVAAYCVFV 1322
PTD HW +KRVLRY++GT G+ + + S+ A+SDADWA + DD S Y V++
Sbjct: 1330 MPTDDHWNALKRVLRYLAGTPDHGIFLKKGNTLSLHAYSDADWAGDTDDYVSTNGYIVYL 1389
Query: 1323 GSNLVSWSSKKQSVVARSSTESEYRALAHASAEIIWIQQLLTEIGCLSSLDP-----SSG 1356
G + +SWSSKKQ V RSSTE+EYR++A+ S+E+ WI LLTE+G S P + G
Sbjct: 1390 GHHPISWSSKKQKGVVRSSTEAEYRSVANTSSELQWICSLLTELGIQLSHPPVIYCDNVG 1447
BLAST of Lag0007984 vs. ExPASy Swiss-Prot
Match:
P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)
HSP 1 Score: 431.0 bits (1107), Expect = 5.1e-119
Identity = 380/1427 (26.63%), Postives = 609/1427 (42.68%), Query Frame = 0
Query: 6 KYDAWLAVDQLLLGWLYNSMTPEVATQVMGVENAKDLWSAIQDLFGVQSRAEEDFL-RQT 65
K + W +D+ + ++ +V ++ + A+ +W+ ++ L+ ++ + +L +Q
Sbjct: 48 KAEDWADLDERAASAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQL 107
Query: 66 FQQTRKGNLKMADYLRTMKTHADNLGLTGSPISNRNLVSQVLLGLDEEY-NAVVAMVQGR 125
+ +L L G I + +L L Y N ++ G+
Sbjct: 108 YALHMSEGTNFLSHLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGK 167
Query: 126 ANVSWSELQAELLVFEKRLELQISHKNTVAFNHNATANMAVNKVNSSPKQTTNNNGNRQG 185
+ ++ + LL+ EK + K + Q G +
Sbjct: 168 TTIELKDVTSALLLNEK-----------------------MRKKPENQGQALITEGRGRS 227
Query: 186 YNNGHQRGNGYGNRYRGRGRGYNNWNNR-PTCQVCGKVGHSAVVCYHRFDKEFSPIQNRN 245
Y + N YG R RG+ N +R C C + GH C N
Sbjct: 228 Y---QRSSNNYG-RSGARGKSKNRSKSRVRNCYNCNQPGHFKRDC-----------PNPR 287
Query: 246 TGNGTESGNFQSNRGIGQQPN-----AFMTTQQTATPETLADPSWYADSGASNHVTNNYE 305
G G SG + N F+ ++ + + W D+ AS+H T +
Sbjct: 288 KGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESEWVVDTAASHHATPVRD 347
Query: 306 NIANPTDYRGKECVTV--GNGDKLSITSVGN-SVLTDGYHVLNLENVLCVPEIAKNLVSM 365
Y + TV GN I +G+ + T+ L L++V VP++ NL+
Sbjct: 348 LFCR---YVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLI-- 407
Query: 366 SKLAQDNNVYIEFHGDFCLVKDKSSGQVLLKGTLKDGLYQLQDANTSAASVSASASSNNQ 425
S +A D + Y + + K S V+ KG + LY+ N +A+ +
Sbjct: 408 SGIALDRDGYESYFANQKWRLTKGS-LVIAKGVARGTLYR---TNAEICQGELNAAQDE- 467
Query: 426 SDNFNSAFIVSNVVPHVSLAVSKTIWHRRLGHPSAKVLDFIVKDCKLQVKSNEMSQFCQS 485
+S +WH+R+GH S K L + K + + C
Sbjct: 468 --------------------ISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDY 527
Query: 486 CQFGKAHALPFPLSNSRAAKKFDLIHTDVWGPAPILSVEGYRYYALFLDDHSRYLWLYPL 545
C FGK H + F S+ R DL+++DV GP I S+ G +Y+ F+DD SR LW+Y L
Sbjct: 528 CLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYIL 587
Query: 546 KQKSDTVQAFNHLLTVIKTQFGCGIKSVQTDNGGEYI--PIHKVCHQLGIKTRLSCPHTS 605
K K Q F +++ + G +K +++DNGGEY + C GI+ + P T
Sbjct: 588 KTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTP 647
Query: 606 AQNGRAERKHRHVVETGLTLLAQASMPLSHWWDALVTATQLINGLPTTVLQGKSPMELMW 665
NG AER +R +VE ++L A +P S W +A+ TA LIN P+ L + P +
Sbjct: 648 QHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWT 707
Query: 666 SKKMNYEMLKTFGCSCYPCLRPYHKHKFHYHTERCVFLGISASHKGYRCMNE-PGRVFIS 725
+K+++Y LK FGC + + + K + C+F+G GYR + +V S
Sbjct: 708 NKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRS 767
Query: 726 RHVRFNESEFPFATGFGSISSANTASSGSPSILEWFPHVHLPNPTSQSTMHSTPVPNSLV 785
R V F ESE TA+ S + +PN +
Sbjct: 768 RDVVFRESE------------VRTAADMSEKVKNGI------------------IPNFVT 827
Query: 786 HPPDLPHNPTSPFPTLQPTCPQPNTNSYSSPTSLQSQSTDVLPQSPTQLQSLPNEPSSNP 845
P ++ ++PTS +S + +V Q +P
Sbjct: 828 IP-----------------------STSNNPTSAESTTDEVSEQG--------EQPGEVI 887
Query: 846 VQPSQISATTPISLPPTSPETSVPVSDSPTAAPIQPQPTHPMITRGKAGIFKPKAWLTQQ 905
Q Q+ + V + PT Q QP + R + + + + + +
Sbjct: 888 EQGEQL-------------DEGVEEVEHPTQGEEQHQP----LRRSERPRVESRRYPSTE 947
Query: 906 HTDWS-LTEPTRVQDAISTPQWKQ---AMDCEYSALMKNQTWVLVPSSPDFNVVGNKWIF 965
+ S EP +++ +S P+ Q AM E +L KN T+ LV + KW+F
Sbjct: 948 YVLISDDREPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVF 1007
Query: 966 RIKKNADGTVQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRVVISLAVSKGWSLRQL 1025
++KK+ D + RYKARLV KGF Q G+DF E FSPVVK ++IR ++SLA S + QL
Sbjct: 1008 KLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQL 1067
Query: 1026 DFNNAFLNGILVEDVYMQQPPGYTDPTCPKYVCKLKKAIYGLKQAPRAWNTALKSVLLSW 1085
D AFL+G L E++YM+QP G+ VCKL K++YGLKQAPR W S + S
Sbjct: 1068 DVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQ 1127
Query: 1086 GFINSRSDSSLYIFK-SQSTVLLLLVYVDDVVLTGNNLKAINRLIGELDKRFALKDLGKL 1145
++ + SD +Y + S++ ++LL+YVDD+++ G + I +L G+L K F +KDLG
Sbjct: 1128 TYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPA 1187
Query: 1146 NYFLGIQV--HYMPSGLILNQAKY--------------------------------TNLV 1205
LG+++ L L+Q KY T +
Sbjct: 1188 QQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVE 1247
Query: 1206 NDGKLLEDPFLYRSTIGALQY-LTYTRPDISHVVNQLSQFLKSPTDIHWQMVKRVLRYIS 1265
G + + P Y S +G+L Y + TRPDI+H V +S+FL++P HW+ VK +LRY+
Sbjct: 1248 EKGNMAKVP--YSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLR 1307
Query: 1266 GTKHLGLLFQPSTSTSISAFSDADWASNIDDRRSVAAYCVFVGSNLVSWSSKKQSVVARS 1325
GT L F S + ++DAD A +ID+R+S Y +SW SK Q VA S
Sbjct: 1308 GTTGDCLCFGGS-DPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALS 1325
Query: 1326 STESEYRALAHASAEIIWIQQLLTEIG-----------CLSSLDPSSGVTISVLRGSLDV 1352
+TE+EY A E+IW+++ L E+G S++D S +DV
Sbjct: 1368 TTEAEYIAATETGKEMIWLKRFLQELGLHQKEYVVYCDSQSAIDLSKNSMYHARTKHIDV 1325
BLAST of Lag0007984 vs. ExPASy Swiss-Prot
Match:
P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)
HSP 1 Score: 365.2 bits (936), Expect = 3.4e-99
Identity = 366/1456 (25.14%), Postives = 620/1456 (42.58%), Query Frame = 0
Query: 1 MVMNPKYDAWLAVDQLLLGWLYNSMTPEVATQVMGVENAKDLWSAIQDLFGVQSRAEEDF 60
++ N D+W ++ + ++ A+ + + ++ +S A +
Sbjct: 39 LMPNEVDDSWKKAERCAKSTIIEYLSDSFLNFATSDITARQILENLDAVYERKSLASQLA 98
Query: 61 LRQTFQQTR-KGNLKMADYLRTMKTHADNLGLTGSPISNRNLVSQVLLGLDEEYNAVVAM 120
LR+ + + + + L G+ I + +S +L+ L Y+ ++
Sbjct: 99 LRKRLLSLKLSSEMSLLSHFHIFDELISELLAAGAKIEEMDKISHLLITLPSCYDGIITA 158
Query: 121 VQGRANVSWSELQAELLVFEKR-LELQISHKNTVAFNHNATANMAVNKVNSSPKQTTNNN 180
++ + SE L + R L+ +I KN +HN T+ +N + +NN
Sbjct: 159 IE-----TLSEENLTLAFVKNRLLDQEIKIKN----DHNDTSKKVMNAI-------VHNN 218
Query: 181 GNRQGYNNGHQRGNGYGNRYRGRGRGYNNWNNRPTCQVCGKVGHSAVVCYHRFDKEFSPI 240
N N R ++G N + C CG+ GH C+H + I
Sbjct: 219 NNTYKNNLFKNRVTKPKKIFKG------NSKYKVKCHHCGREGHIKKDCFH-----YKRI 278
Query: 241 QNRNTGNGTESGNFQSNRGIGQQPNAFMTTQQTATPETLADPSWYADSGASNHVTNNYEN 300
N + ++ GI AFM + T + + + DSGAS+H+ N+
Sbjct: 279 LNNKNKENEKQVQTATSHGI-----AFMVKEVNNT-SVMDNCGFVLDSGASDHLINDESL 338
Query: 301 IANPTDYRGKECVTVGNGDKLSITSVGNSVLTDGYHVLNLENVLCVPEIAKNLVSMSKLA 360
+ + + V + + V H + LE+VL E A NL+S+ +L
Sbjct: 339 YTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRNDHEITLEDVLFCKEAAGNLMSVKRL- 398
Query: 361 QDNNVYIEFHGDFCLVKDKSSGQVLLKGTLKDGLYQLQDA----NTSAASVSASASSNNQ 420
Q+ + IEF DKS + K+GL ++++ N + A + +
Sbjct: 399 QEAGMSIEF--------DKSGVTI-----SKNGLMVVKNSGMLNNVPVINFQAYSINAKH 458
Query: 421 SDNFNSAFIVSNVVPHVSLAVSKTIWHRRLGHPS-AKVLDF----IVKDCKLQVKSNEMS 480
+NF +WH R GH S K+L+ + D L
Sbjct: 459 KNNFR-------------------LWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSC 518
Query: 481 QFCQSCQFGKAHALPFPLSNSRAAKKFDL--IHTDVWGPAPILSVEGYRYYALFLDDHSR 540
+ C+ C GK LPF + K L +H+DV GP ++++ Y+ +F+D +
Sbjct: 519 EICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTH 578
Query: 541 YLWLYPLKQKSDTVQAFNHLLTVIKTQFGCGIKSVQTDNGGEYI--PIHKVCHQLGIKTR 600
Y Y +K KSD F + + F + + DNG EY+ + + C + GI
Sbjct: 579 YCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYH 638
Query: 601 LSCPHTSAQNGRAERKHRHVVETGLTLLAQASMPLSHWWDALVTATQLINGLPTTVL--Q 660
L+ PHT NG +ER R + E T+++ A + S W +A++TAT LIN +P+ L
Sbjct: 639 LTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDS 698
Query: 661 GKSPMELMWSKKMNYEMLKTFGCSCYPCLRPYHKHKFHYHTERCVFLGISASHKGYRCMN 720
K+P E+ +KK + L+ FG + Y ++ + KF + + +F+G + G++ +
Sbjct: 699 SKTPYEMWHNKKPYLKHLRVFGATVYVHIK-NKQGKFDDKSFKSIFVGYEPN--GFKLWD 758
Query: 721 EPGRVFI--------------SRHVRFNESEFPFATGFGSISSANTASSGSPSILEWFP- 780
FI SR V+F E F + N + I FP
Sbjct: 759 AVNEKFIVARDVVVDETNMVNSRAVKF---ETVFLKDSKESENKNFPNDSRKIIQTEFPN 818
Query: 781 ------HVHLPNPTSQSTMHSTPVPNSLVHPPDLPHNPTSPFPTLQPTCPQPNTNSYSSP 840
++ + +S + P + + + P N + +Q +N Y
Sbjct: 819 ESKECDNIQFLKDSKESENKNFPNDSRKIIQTEFP-NESKECDNIQFLKDSKESNKYFLN 878
Query: 841 TSLQSQSTDVLPQSPTQLQSLPNEPSSNPVQPSQISATTPISLPPTSPETSVPVSDSPTA 900
S + + D L +S + S +I P + + + + +
Sbjct: 879 ESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNP------TKNDGIEIINRRS- 938
Query: 901 APIQPQPTHPMITRGKAGIFKPKAWLTQQHTDWSLTEPT--RVQDAISTPQWKQAMDCEY 960
+ T P I+ + K L HT ++ + +Q W++A++ E
Sbjct: 939 ---ERLKTKPQISYNEEDNSLNKVVL-NAHTIFNDVPNSFDEIQYRDDKSSWEEAINTEL 998
Query: 961 SALMKNQTWVLVPSSPDFNVVGNKWIFRIKKNADGTVQRYKARLVAKGFHQYPGVDFFET 1020
+A N TW + + N+V ++W+F +K N G RYKARLVA+GF Q +D+ ET
Sbjct: 999 NAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEET 1058
Query: 1021 FSPVVKASTIRVVISLAVSKGWSLRQLDFNNAFLNGILVEDVYMQQPPGYTDPTCPKYVC 1080
F+PV + S+ R ++SL + + Q+D AFLNG L E++YM+ P G + + VC
Sbjct: 1059 FAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISCNS--DNVC 1118
Query: 1081 KLKKAIYGLKQAPRAWNTALKSVLLSWGFINSRSDSSLYIFK--SQSTVLLLLVYVDDVV 1140
KL KAIYGLKQA R W + L F+NS D +YI + + + +L+YVDDVV
Sbjct: 1119 KLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLLYVDDVV 1178
Query: 1141 LTGNNLKAINRLIGELDKRFALKDLGKLNYFLGIQVHYMPSGLILNQAKYT--------- 1200
+ ++ +N L ++F + DL ++ +F+GI++ + L+Q+ Y
Sbjct: 1179 IATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNM 1238
Query: 1201 ----------------NLVNDGKLLEDPFLYRSTIGALQYLTY-TRPDISHVVNQLSQFL 1260
L+N + P RS IG L Y+ TRPD++ VN LS++
Sbjct: 1239 ENCNAVSTPLPSKINYELLNSDEDCNTP--CRSLIGCLMYIMLCTRPDLTTAVNILSRYS 1298
Query: 1261 KSPTDIHWQMVKRVLRYISGTKHLGLLFQPSTS--TSISAFSDADWASNIDDRRSVAAYC 1320
WQ +KRVLRY+ GT + L+F+ + + I + D+DWA + DR+S Y
Sbjct: 1299 SKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYL 1358
Query: 1321 V-FVGSNLVSWSSKKQSVVARSSTESEYRALAHASAEIIWIQQLLTEI------------ 1358
NL+ W++K+Q+ VA SSTE+EY AL A E +W++ LLT I
Sbjct: 1359 FKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYE 1406
BLAST of Lag0007984 vs. ExPASy Swiss-Prot
Match:
P92519 (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 GN=AtMg00810 PE=4 SV=1)
HSP 1 Score: 190.7 bits (483), Expect = 1.2e-46
Identity = 99/225 (44.00%), Postives = 142/225 (63.11%), Query Frame = 0
Query: 1088 LLLLVYVDDVVLTGNNLKAINRLIGELDKRFALKDLGKLNYFLGIQVHYMPSGLILNQAK 1147
+ LL+YVDD++LTG++ +N LI +L F++KDLG ++YFLGIQ+ PSGL L+Q K
Sbjct: 1 MYLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTK 60
Query: 1148 YT-NLVNDGKLLE----------------------DPFLYRSTIGALQYLTYTRPDISHV 1207
Y ++N+ +L+ DP +RS +GALQYLT TRPDIS+
Sbjct: 61 YAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYA 120
Query: 1208 VNQLSQFLKSPTDIHWQMVKRVLRYISGTKHLGLLFQPSTSTSISAFSDADWASNIDDRR 1267
VN + Q + PT + ++KRVLRY+ GT GL ++ ++ AF D+DWA RR
Sbjct: 121 VNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRR 180
Query: 1268 SVAAYCVFVGSNLVSWSSKKQSVVARSSTESEYRALAHASAEIIW 1290
S +C F+G N++SWS+K+Q V+RSSTE+EYRALA +AE+ W
Sbjct: 181 STTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225
BLAST of Lag0007984 vs. ExPASy TrEMBL
Match:
A0A2Z6MBG6 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_77270 PE=4 SV=1)
HSP 1 Score: 1119.4 bits (2894), Expect = 0.0e+00
Identity = 628/1407 (44.63%), Postives = 845/1407 (60.06%), Query Frame = 0
Query: 4 NPKYDAWLAVDQLLLGWLYNSMTPEVATQVMGVENAKDLWSAIQDLFGVQSRAEEDFLRQ 63
N + W A DQ LLGW+ NSMT E+ATQ++ E +K LW Q L G +R++ +L+
Sbjct: 67 NSAFVEWQANDQRLLGWMLNSMTTEIATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKS 126
Query: 64 TFQQTRKGNLKMADYLRTMKTHADNLGLTGSPISNRNLVSQVLLGLDEEYNAVVAMVQGR 123
F RKG +KM DYL MK D L L G+P+S +L+ Q L GLD EYN VV + +
Sbjct: 127 EFHSIRKGEMKMEDYLIKMKNLVDKLKLAGNPVSTSDLIIQTLNGLDSEYNPVVVKLSDQ 186
Query: 124 ANVSWSELQAELLVFEKRLELQISHKNTVAFNHNATANMAVNKVNSSPKQTTNNNGNRQG 183
+SW +LQA+LL FE R+E N NATAN+A N S + ++N N +G
Sbjct: 187 TTLSWVDLQAQLLTFESRIE---QLNNLTNLTLNATANVA----NRSDHRGKSSNNNWRG 246
Query: 184 YNNGHQRGNGYGNRYRGRGRGYNNWNNRPTCQVCGKVGHSAVVCYHRFDKEFSPIQNRNT 243
N+ RG GRGRG + N CQVCG H A+ C+HRFDK +S N +
Sbjct: 247 SNSRGWRG--------GRGRGKSGKN---PCQVCGLSNHIAIDCFHRFDKTYSR-SNHSA 306
Query: 244 GNGTESGNFQSNRGIGQQPNAFMTTQQTATPETLADPSWYADSGASNHVTNNYENIANPT 303
G+ + + NAF+ +Q ++ D WY DSGASNHVT+ E + T
Sbjct: 307 GHDKQGSH-----------NAFLASQ-----NSVEDYDWYFDSGASNHVTHQTEKFQDLT 366
Query: 304 DYRGKECVTVGNGDKLSITSVGNSVLTDGYHVLNLENVLCVPEIAKNLVSMSKLAQDNNV 363
++ GK + VGNG+KL+I + G+S L LNL ++L VP I KNL+S+SKLA DNN+
Sbjct: 367 EHHGKNSLVVGNGEKLAILATGSSKLKS----LNLHDILYVPNITKNLLSVSKLAADNNI 426
Query: 364 YIEFHGDFCLVKDKSSGQVLLKGTLKDGLYQLQDANTSAASVSASASSNNQSDNFNSAFI 423
+EF + C VKDK +G+V+LKG LKDGLYQL S + N SAF+
Sbjct: 427 LVEFDENCCFVKDKLTGKVILKGLLKDGLYQL------------SGTKRNP-----SAFV 486
Query: 424 VSNVVPHVSLAVSKTIWHRRLGHPSAKVLDFIVKDCKLQVKSNEMSQFCQSCQFGKAHAL 483
K WHRRLGHP+ KVLD +++ CK++V ++ FC++CQ+GK H L
Sbjct: 487 -----------SVKESWHRRLGHPNNKVLDKVLESCKVKVPPSDNFSFCEACQYGKMHLL 546
Query: 484 PFPLSNSRAAKKFDLIHTDVWGPAPILSVEGYRYYALFLDDHSRYLWLYPLKQKSDTVQA 543
PF S+S A + +L+HTDVWGPAPI++ G++YY F+DD SR+ W+YPLKQKS+TVQA
Sbjct: 547 PFKSSSSHAQEPLELVHTDVWGPAPIMTSSGFKYYVHFVDDFSRFTWIYPLKQKSETVQA 606
Query: 544 FNHLLTVIKTQFGCGIKSVQTDNGGEYIPIHKVCHQLGIKTRLSCPHTSAQNGRAERKHR 603
F + + QF IK +Q D GGEY P+ K+ + GI+ R+SCP+TS QNGRAERKHR
Sbjct: 607 FIQFKNLTENQFNKRIKVIQCDGGGEYKPVQKLAVEAGIQFRMSCPYTSQQNGRAERKHR 666
Query: 604 HVVETGLTLLAQASMPLSHWWDALVTATQLINGLPTTVLQGKSPMELMWSKKMNYEMLKT 663
H+ E GLTLLAQA MPL +WW+A TA LIN LP+ V Q +SP LM K+ +Y++LKT
Sbjct: 667 HITEFGLTLLAQAQMPLHYWWEAFSTAVYLINRLPSQVTQNESPYSLMLQKEPDYKLLKT 726
Query: 664 FGCSCYPCLRPYHKHKFHYHTERCVFLGISASHKGYRCMNEPGRVFISRHVRFNESEFPF 723
FGC+CYPCL+PY++HK YHT RCVFLG S SHKGY+C+N GR+FISRHV FNE FPF
Sbjct: 727 FGCACYPCLKPYNQHKLQYHTTRCVFLGYSNSHKGYKCLNSHGRIFISRHVIFNEDHFPF 786
Query: 724 ATGFGSISSANTASSGSPSILEWFPHVHLPNPTSQSTMHSTPVPNSLVHPPDLPHNPTSP 783
GF + S + PS FP N ++M
Sbjct: 787 HDGFLNTRSPLKTTINVPSTS--FPLCTAGNVIDDASM---------------------- 846
Query: 784 FPTLQPTCPQPNTNSYSSPTSLQSQSTDVLPQSPTQLQSLPNEPS-SNPVQPSQISATTP 843
P L+ P S + ++ T+ N PS N + T
Sbjct: 847 -PILEAENPAETNTEDSQDVNSDTEQTN-------------NGPSEDNTTHEETLDITQQ 906
Query: 844 ISLPPTSPETSVPVSDSPTAAPIQPQPTHPMITRGKAGIFKPK---AWLTQQHTDWSLTE 903
S+ S T+ +H + TR K+GI KPK LT+ + D E
Sbjct: 907 QSVGEASQNTNT---------------SHAIHTRSKSGIHKPKLPYIGLTETYKD--TME 966
Query: 904 PTRVQDAISTPQWKQAMDCEYSALMKNQTWVLVPSSPDFNVVGNKWIFRIKKNADGTVQR 963
P ++A+S P WK+AM E+ ALM N+TW+LVP N+V +KW+F+ K DG+++R
Sbjct: 967 PANAKEALSRPLWKEAMQKEFEALMSNKTWILVPYQNQENIVDSKWVFKTKYKPDGSLER 1026
Query: 964 YKARLVAKGFHQYPGVDFFETFSPVVKASTIRVVISLAVSKGWSLRQLDFNNAFLNGILV 1023
KARLVAKGF Q G+D+ ETFSPV+KAST+R+++S+AV W +RQLD NNAFLNG L
Sbjct: 1027 RKARLVAKGFQQTAGIDYEETFSPVIKASTVRIILSIAVHLNWEVRQLDINNAFLNGHLK 1086
Query: 1024 EDVYMQQPPGYTDPTCPKYVCKLKKAIYGLKQAPRAWNTALKSVLLSWGFINSRSDSSLY 1083
E V+M QP G+ D T P ++CKL KAIYGLKQAPRAW +LK+ LL+WGF N++SDSSL+
Sbjct: 1087 ETVFMHQPEGFVDSTKPNHICKLSKAIYGLKQAPRAWFDSLKTALLNWGFQNTKSDSSLF 1146
Query: 1084 IFKSQSTVLLLLVYVDDVVLTGNNLKAINRLIGELDKRFALKDLGKLNYFLGIQVHYMPS 1143
+ K + + LL+YVDD+++TG+N K + I +L+ F+LKDLG L+YFLGI+V S
Sbjct: 1147 LLKGKDHITFLLIYVDDIIVTGSNGKFLQAFIKQLNDAFSLKDLGHLHYFLGIEVQRDAS 1206
Query: 1144 GLILNQAKY-----------------TNLVN------DGKLLEDPFLYRSTIGALQYLTY 1203
G+ L Q+KY T ++ +G+ L+DP ++R IG LQYLT+
Sbjct: 1207 GMYLKQSKYIGDLLKKFKMDNASPCPTPMITGRQFTVEGEKLKDPTVFRQAIGGLQYLTH 1266
Query: 1204 TRPDISHVVNQLSQFLKSPTDIHWQMVKRVLRYISGTKHLGLLFQPSTSTSISAFSDADW 1263
T PDI+ VN+LSQ++ SP+ HWQ +KR+LRY+ GT + L +PST I+ FSDADW
Sbjct: 1267 TTPDIAFSVNKLSQYMSSPSIDHWQGIKRILRYLQGTINYCLHIKPSTDLDITGFSDADW 1326
Query: 1264 ASNIDDRRSVAAYCVFVGSNLVSWSSKKQSVVARSSTESEYRALAHASAEIIWIQQLLTE 1323
A++IDDR+S++ CVF+G L+SWSS+KQ VV+RSSTESEYRALA +AEI WI+ LLTE
Sbjct: 1327 ATSIDDRKSMSGQCVFLGETLISWSSRKQKVVSRSSTESEYRALADLAAEIAWIRSLLTE 1351
Query: 1324 IGC------------LSSLDPSSGVTI----------------SVLRGSLDVRYVPSYDQ 1356
+ LS+ +S + VL+ + V YVP+ DQ
Sbjct: 1387 LELPLPRKPILWCDNLSAKALASNPVLHARSKHIEIDVHYIRDQVLQNEVVVAYVPTTDQ 1351
BLAST of Lag0007984 vs. ExPASy TrEMBL
Match:
A0A2Z6P4D5 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_412550 PE=4 SV=1)
HSP 1 Score: 1066.2 bits (2756), Expect = 1.1e-307
Identity = 610/1426 (42.78%), Postives = 826/1426 (57.92%), Query Frame = 0
Query: 3 MNPKYDAWLAVDQLLLGWLYNSMTPEVATQVMGVENAKDLWSAIQDLFGVQSRAEEDFLR 62
+NP + W+A DQ LLGWL NSM ++ATQ++ E +K LW Q L G +++ +L+
Sbjct: 66 VNPDFGDWIANDQALLGWLMNSMAIDIATQLLHCETSKQLWDETQSLAGAHTKSRITYLK 125
Query: 63 QTFQQTRKGNLKMADYLRTMKTHADNLGLTGSPISNRNLVSQVLLGLDEEYNAVVAMVQG 122
F TRKG +KM +YL MK +D L L GSPISN +L+ Q L GLD EYN VV +
Sbjct: 126 SEFHNTRKGEMKMEEYLIKMKNLSDKLKLAGSPISNSDLMIQTLNGLDAEYNPVVVKLSD 185
Query: 123 RANVSWSELQAELLVFEKRLELQISHKNTVAFNHNATANMAVNKVNSSPKQTTNNNGNRQ 182
+ N+SW ++QA+LL FE RL+ N NA+AN A NK T GN+
Sbjct: 186 QINLSWVDVQAQLLAFESRLD---QFNNFSGLTLNASANFA-NK--------TEFRGNK- 245
Query: 183 GYNNGHQRGNGYGNRYRGR--GRGYNNWNNRPTCQVCGKVGHSAVVCYHRFDKEFSPIQN 242
+N+ RGN + +RG GRG +N CQVC GH AV C +RFD+ P
Sbjct: 246 -FNS---RGNWRRSNFRGMRGGRGKGRMSN-TKCQVCNGTGHIAVDCSYRFDR---PYTG 305
Query: 243 RNTGNGTESGNFQSNRGIGQQPNAFMTTQQTATPETLADPSWYADSGASNHVTNNYENIA 302
RN TE+ S+ +AF+ A+P D WY DSGA+NHVT+ +
Sbjct: 306 RN--YSTEADKQGSH-------SAFI-----ASPYHGQDYEWYFDSGANNHVTHQTDKFQ 365
Query: 303 NPTDYRGKECVTVGNGDKLSITSVGNSVLTDGYHVLNLENVLCVPEIAKNLVSMSKLAQD 362
++ GK + VGNG+KL I + G++ L + LNL +VL VP+I KNL+S+SKL D
Sbjct: 366 GFNEHNGKNSLMVGNGEKLKIVASGSTKLNN----LNLHDVLYVPQITKNLLSVSKLTAD 425
Query: 363 NNVYIEFHGDFCLVKDKSSGQVLLKGTLKDGLYQLQDANTSAASVSASASSNNQSDNFNS 422
NN+ +EF + C VKDK +GQ LLKG LKDGLYQL
Sbjct: 426 NNILVEFDANCCSVKDKLTGQTLLKGRLKDGLYQL------------------------- 485
Query: 423 AFIVSNVVPHVSLAVSKTIWHRRLGHPSAKVLDFIVKDCKLQVKSNEMSQFCQSCQFGKA 482
SN P V ++V K WHR+LGHP+ KVLD ++KDC +++ ++ FC++CQFGK
Sbjct: 486 ----SNKEPCVYMSV-KESWHRKLGHPNNKVLDKVLKDCNVKISHSDQFSFCEACQFGKL 545
Query: 483 HALPFPLSNSRAAKKFDLIHTDVWGPAPILSVEGYRYYALFLDDHSRYLWLYPLKQKSDT 542
H LPF S+S + LIH+DVWGPAPILS G++YY F+DD SR+ W++PLKQKSDT
Sbjct: 546 HLLPFKPSSSHVQEPLALIHSDVWGPAPILSPSGFKYYVHFIDDFSRFTWIFPLKQKSDT 605
Query: 543 VQAFNHLLTVIKTQFGCGIKSVQTDNGGEYIPIHKVCHQLGIKTRLSCPHTSAQNGRAER 602
+ AF + + QF IK +Q D GGEY + KV + GI+ R+SCP+TS QNGRAER
Sbjct: 606 IHAFIQFKNLAENQFNKKIKIIQCDGGGEYKAVQKVSIEAGIQFRMSCPYTSQQNGRAER 665
Query: 603 KHRHVVETGLTLLAQASMPLSHWWDALVTATQLINGLPTTVLQGKSPMELMWSKKMNYEM 662
KHRHV E GLTLLAQA MPL +WW+A TA LIN LP++V +SP LM+ ++ +Y
Sbjct: 666 KHRHVAELGLTLLAQAKMPLRYWWEAFSTAVYLINRLPSSVNPNESPYSLMFKREPDYNA 725
Query: 663 LKTFGCSCYPCLRPYHKHKFHYHTERCVFLGISASHKGYRCMNEPGRVFISRHVRFNESE 722
LK FGC+CYPCL+PY++HK +HT RCVF+G S SHKGY+C+N GR+F+SRHV FNE+
Sbjct: 726 LKPFGCACYPCLKPYNQHKLQFHTTRCVFVGYSNSHKGYKCINSHGRIFVSRHVIFNENH 785
Query: 723 FPFATGFGSISSANTASSGSPSILEWFPHVHLPNPTSQSTMHSTPVPNSLVHPPDLPHNP 782
FPF GF + + + SIL LP ++ +T P++
Sbjct: 786 FPFHGGFLDTKNPLKTLTDNSSIL-------LPTCSAGATTQDAIEPDN----------- 845
Query: 783 TSPFPTLQPTCPQPNTNSYSSPTSLQSQSTDVLPQSPTQLQSLPNEPSSNPVQPSQISAT 842
NT S + S++S + ++ Q+ S ++N I A
Sbjct: 846 --------------NTTSDQNTHSIESSDNN---ENEEQVDSSEFFVNTNNSSTQDIEAD 905
Query: 843 TPISLPPTSPETSVPVSDSPTAAPIQPQPTHPMITRGKAGIFKPK-AWLTQQHTDWSLTE 902
S+ S A TH M TR K GI KPK ++ TD E
Sbjct: 906 N--SVDSEDRNNSTMTGTIQQQAQQDNSNTHWMRTRSKDGIHKPKIPYVGMAETDSEEKE 965
Query: 903 PTRVQDAISTPQWKQAMDCEYSALMKNQTWVLVPSSPDFNVVGNKWIFRIKKNADGTVQR 962
P V++A+ P WK+AMD EY AL+ N TW LVP N++ +KWIF+ K +DG+++R
Sbjct: 966 PKSVKEALGRPMWKEAMDKEYKALVSNHTWTLVPYQEQENIIDSKWIFKTKYKSDGSIER 1025
Query: 963 YKARLVAKGFHQYPGVDFFETFSPVVKASTIRVVISLAVSKGWSLRQLDFNNAFLNGILV 1022
KARLVAKGF Q G+DF ETFSPVVK+ST+R+++++AV W +RQLD NNAFLNG L
Sbjct: 1026 RKARLVAKGFQQTAGLDFGETFSPVVKSSTVRIILTIAVHFNWEVRQLDINNAFLNGKLK 1085
Query: 1023 EDVYMQQPPGYTDPTCPKYVCKLKKAIYGLKQAPRAWNTALKSVLLSWGFINSRSDSSLY 1082
E V+M QP GY D P ++CKL KAIYGLKQAPRAW +L+S L++WGF N+++D+SL+
Sbjct: 1086 ETVFMHQPEGYIDAAKPNHICKLSKAIYGLKQAPRAWYDSLRSTLVNWGFQNAKNDTSLF 1145
Query: 1083 IFKSQSTVLLLLVYVDDVVLTGNNLKAINRLIGELDKRFALKDLGKLNYFLGIQVHYMPS 1142
K LL+YVDD+++TG+N+K + +L+ ++LKDLG L+YFLG++VH S
Sbjct: 1146 FLKGADHTTFLLIYVDDIIVTGSNIKFLEAFTNQLNTAYSLKDLGPLHYFLGVEVHRDDS 1205
Query: 1143 GLILNQAKY-----------------------TNLVNDGKLLEDPFLYRSTIGALQYLTY 1202
G+ L Q KY + +G+L+ +P LYR IGALQYLT
Sbjct: 1206 GMYLRQTKYIRDVLKKFNMENTSACPTPMVTGRQFIAEGELMSNPTLYRQAIGALQYLTN 1265
Query: 1203 TRPDISHVVNQLSQFLKSPTDIHWQMVKRVLRYISGTKHLGLLFQPSTSTSISAFSDADW 1262
TRPDI+ VN+LSQ++ +PT HWQ +KR+LRY+ GTK+ L +PST+ I+ F DADW
Sbjct: 1266 TRPDIAFAVNKLSQYMSTPTIEHWQGIKRILRYLQGTKNHSLHIKPSTNLHIAGFLDADW 1325
Query: 1263 ASNIDDRRSVAAYCVFVGSNLVSWSSKKQSVVARSSTESEYRALAHASAE---------- 1322
A++ DDR+S CVF+G LVSW+S+KQ VV+RSSTESEYR+LA AE
Sbjct: 1326 ATSTDDRKSTGGQCVFLGETLVSWASRKQKVVSRSSTESEYRSLADLVAEVSTSSVATLL 1385
Query: 1323 ----------------------------IIWIQQLLTEIGCLSSLDPSSGVTI------- 1356
++W L + + + + I
Sbjct: 1386 SSERFLLAHFSTRFTLLEELKLPILRKPVLWCDNLSAKALASNPVMHARSKHIEIDMHYI 1385
BLAST of Lag0007984 vs. ExPASy TrEMBL
Match:
A0A2K3MUJ9 (Putative retrotransposon Ty1-copia subclass protein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g017679 PE=4 SV=1)
HSP 1 Score: 1037.7 bits (2682), Expect = 4.4e-299
Identity = 583/1276 (45.69%), Postives = 780/1276 (61.13%), Query Frame = 0
Query: 3 MNPKYDAWLAVDQLLLGWLYNSMTPEVATQVMGVENAKDLWSAIQDLFGVQSRAEEDFLR 62
+NP Y W A DQ LLGWL NSMT ++ATQV+ E +K LW Q L G +R+ +L+
Sbjct: 65 INPDYQDWQADDQALLGWLMNSMTVDIATQVLHCETSKQLWDEAQSLAGAHTRSRIIYLK 124
Query: 63 QTFQQTRKGNLKMADYLRTMKTHADNLGLTGSPISNRNLVSQVLLGLDEEYNAVVAMVQG 122
F T K +KM YL MK AD L L GSPIS+ +L+ Q L GLD EYN VV +
Sbjct: 125 SEFHNTHKREMKMEQYLAKMKNLADKLKLAGSPISSSDLMIQTLNGLDSEYNPVVVKLSD 184
Query: 123 RANVSWSELQAELLVFEKRLELQISHKNTVAFNHNATANMAVNKVNSSPKQTTNNNGNRQ 182
+ N+SW + QA+LL FE RL+ Q+++ N + N NA+AN A + GN+
Sbjct: 185 QTNISWVDFQAQLLAFESRLD-QLNNFNNI--NLNASANFA---------SKNESGGNKF 244
Query: 183 GYNNGHQRGNGYGNRYRGRGRGYNNWNNRPTCQVCGKVGHSAVVCYHRFDKEFSPIQNRN 242
G G + N G R GRGR + RP CQ+CGK GH+A CY+RFDK ++ +
Sbjct: 245 GSRGGWRGSNSRGMR-GGRGRARMSKPPRPICQICGKFGHTAAQCYYRFDKSYTEKNHYA 304
Query: 243 TGNGTESGNFQSNRGIGQQPNAFMTTQQTATPETLADPSWYADSGASNHVTNNYENIANP 302
G G+ S AF+ A+P D WY DSGASNHVT+ + +
Sbjct: 305 EGEGSHS--------------AFV-----ASPYHGQDYEWYFDSGASNHVTHQSGQLQDL 364
Query: 303 TDYRGKECVTVGNGDKLSITSVGNSVLTDGYHVLNLENVLCVPEIAKNLVSMSKLAQDNN 362
+ GK + VGNG+KL I + G++ L D +NL NVL VPEI KNL+S+SKL DNN
Sbjct: 365 NENNGKNSLLVGNGEKLKILASGSTKLND----VNLRNVLYVPEITKNLLSVSKLTIDNN 424
Query: 363 VYIEFHGDFCLVKDKSSGQVLLKGTLKDGLYQLQDANTSAASVSASASSNNQSDNFNSAF 422
+EF ++C VKDK +G+ LLKG LKDGLYQL SA+ ++ A+
Sbjct: 425 ALVEFDENYCYVKDKLTGKALLKGRLKDGLYQL------------SANKEPPTNKDPCAY 484
Query: 423 IVSNVVPHVSLAVSKTIWHRRLGHPSAKVLDFIVKDCKLQVKSNEMSQFCQSCQFGKAHA 482
I SL K IWHR+LGHP+ KVL+ ++KD +++ ++ FC++CQFGK H
Sbjct: 485 I--------SL---KEIWHRKLGHPNNKVLEKVLKDNNVKISPSDKFTFCEACQFGKLHL 544
Query: 483 LPFPLSNSRAAKKFDLIHTDVWGPAPILSVEGYRYYALFLDDHSRYLWLYPLKQKSDTVQ 542
LPF S+S A + DLIHTDVWGPAPILS ++YY FLDD SR+ W++PLKQKS+T+
Sbjct: 545 LPFKTSSSHAKEPLDLIHTDVWGPAPILSQSNFKYYVHFLDDFSRFTWIFPLKQKSETIH 604
Query: 543 AFNHLLTVIKTQFGCGIKSVQTDNGGEYIPIHKVCHQLGIKTRLSCPHTSAQNGRAERKH 602
AFN +++ QF IK ++ D GGEY P+ K GI+ ++SCP+TS QNGRAERKH
Sbjct: 605 AFNQFKNLVENQFNKKIKVIRCDGGGEYKPVQKCAIDSGIQFQMSCPYTSQQNGRAERKH 664
Query: 603 RHVVETGLTLLAQASMPLSHWWDALVTATQLINGLPTTVLQGKSPMELMWSKKMNYEMLK 662
RHV E GLTLLAQA MPLS+WW+A TA LIN LP++V +SP L++ K+ +Y LK
Sbjct: 665 RHVTELGLTLLAQAKMPLSYWWEAFSTAVYLINRLPSSVNPNESPYTLVFKKEPDYTALK 724
Query: 663 TFGCSCYPCLRPYHKHKFHYHTERCVFLGISASHKGYRCMNEPGRVFISRHVRFNESEFP 722
FGC+CYPCL+PY++HK +HT RCVFLG S SHKGY+C+N GRVF+SRHV FNE+ FP
Sbjct: 725 PFGCACYPCLKPYNQHKLQFHTTRCVFLGYSNSHKGYKCVNSHGRVFVSRHVVFNENHFP 784
Query: 723 FATGF-GSISSANTASSGSPSILEWFPHVHLPNPTSQSTMHSTPVPNSLVHPPDLPHNPT 782
F GF + + ++ +P FP N T+++T +++V +
Sbjct: 785 FQEGFLDTRNPIKVVTNDTPIGFPSFPAGITTNNTAEAT-------DNIVDQQE------ 844
Query: 783 SPFPTLQPTCPQPNTNSYSSPTSLQSQSTDVLPQSPTQLQSLPNEPSSNPVQPSQISATT 842
P+ N + + S++S + + T + N + + + + +
Sbjct: 845 ----------PELNDINTVADQSVESDTFE-----HTDENNFSNGETEDSTEAAGRESME 904
Query: 843 PISLPPTSPETSVPVSDSPTAAPIQPQPTHPMITRGKAGIFKPK---AWLTQQHTDWSLT 902
IS P T ET+ P T TH M TR KAG++KPK LT++ +
Sbjct: 905 EISQPIT--ETNPPPQQDIT-------NTHWMRTRSKAGVYKPKLPYIGLTEEAKEGK-- 964
Query: 903 EPTRVQDAISTPQWKQAMDCEYSALMKNQTWVLVPSSPDFNVVGNKWIFRIKKNADGTVQ 962
EP V +A+S P+W AMD EY ALM N+TW LVP NV+ +KWIF+ K ADGT++
Sbjct: 965 EPESVSEALSIPEWLNAMDAEYKALMNNKTWTLVPFEGQENVISSKWIFKTKYKADGTIE 1024
Query: 963 RYKARLVAKGFHQYPGVDFFETFSPVVKASTIRVVISLAVSKGWSLRQLDFNNAFLNGIL 1022
R KARLVA+GF Q GVD+ ETFSPVVK+ST+R+++S+AV W +RQLD NNAFLNG L
Sbjct: 1025 RRKARLVARGFQQTAGVDYDETFSPVVKSSTVRIILSIAVHLSWEVRQLDINNAFLNGNL 1084
Query: 1023 VEDVYMQQPPGYTDPTCPKYVCKLKKAIYGLKQAPRAWNTALKSVLLSWGFINSRSDSSL 1082
E V+M QP GY D T P ++C+L KAIYGLKQAPRAW L+ LLSWGF N++SDSSL
Sbjct: 1085 KESVFMHQPEGYIDQTKPHHICRLNKAIYGLKQAPRAWFDRLRHTLLSWGFQNTKSDSSL 1144
Query: 1083 YIFKSQSTVLLLLVYVDDVVLTGNNLKAINRLIGELDKRFALKDLGKLNYFLGIQVHYMP 1142
++ K LL+YVDD+++TG+N K + I +L+ F+LKDLG L+YFLGI+VH
Sbjct: 1145 FVLKETDHTTFLLIYVDDIIITGSNNKFLEAFISQLNLVFSLKDLGNLHYFLGIEVHRDS 1204
Query: 1143 SGLILNQAKYT-------NLVN----------------DGKLLEDPFLYRSTIGALQYLT 1202
SG+ L Q KY N+ N +G+ + +P LYR IGALQYLT
Sbjct: 1205 SGMYLTQTKYIRDLLKKFNMENASSCPTPMITGRQFTIEGEPMSNPTLYRQAIGALQYLT 1242
Query: 1203 YTRPDISHVVNQLSQFLKSPTDIHWQMVKRVLRYISGTKHLGLLFQPSTSTSISAFSDAD 1252
TRPDI+ VN+LSQ++ SPT HWQ +KR+LRY+ G+ +LGL +PST I+ FSDAD
Sbjct: 1265 NTRPDIAFAVNKLSQYMCSPTTDHWQGIKRILRYLHGSTNLGLHIKPSTDLDIAGFSDAD 1242
BLAST of Lag0007984 vs. ExPASy TrEMBL
Match:
A0A151S6M8 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_027809 PE=4 SV=1)
HSP 1 Score: 1023.1 bits (2644), Expect = 1.1e-294
Identity = 562/1275 (44.08%), Postives = 774/1275 (60.71%), Query Frame = 0
Query: 25 MTPEVATQVMGVENAKDLWSAIQDLFGVQSRAEEDFLRQTFQQTRKGNLKMADYLRTMKT 84
MT EVATQ++ E ++ +W Q L G +R+ FL+ F +TRKG LKM +YL MK
Sbjct: 1 MTQEVATQLLHCETSQQIWEDAQSLAGAHTRSRITFLKTEFHRTRKGGLKMEEYLTKMKE 60
Query: 85 HADNLGLTGSPISNRNLVSQVLLGLDEEYNAVVAMVQGRANVSWSELQAELLVFEKRLEL 144
AD+L L GS +S +LV+Q L GLD EYN +V + + +++W E+QA+LL +E RLE
Sbjct: 61 IADDLALAGSSVSTMDLVTQTLAGLDNEYNPIVVQLSDKEHLTWVEMQAQLLTYENRLE- 120
Query: 145 QISHKNTVAFNHNATANMAVNKVNSSPKQTTNNNGNRQGYNNGHQRGNGYGNRYRGRGRG 204
QI++++ + N ++N++ N K G N G + G RGRGR
Sbjct: 121 QINNQSNLTL--NPSSNISTILYNRRGKSNAFGGGRGGQINRGARGG-------RGRGRA 180
Query: 205 YNNWNNRPTCQVCGKVGHSAVVCYHRFDKEFSPIQNRNTGNGTESGNFQSNRGIGQQPNA 264
+R CQVC K GH+A CYHRF+K + G ++ + ++ NA
Sbjct: 181 ---TKDRIVCQVCCKPGHAASHCYHRFNKNY-------IGQNSDEQKSEKDKEQNYNFNA 240
Query: 265 FMTTQQTATPETLADPSWYADSGASNHVTNNYENIANPTDYRGKECVTVGNGDKLSITSV 324
++ A+P T+ D WY DSGASNHVT + + + GK +TVGNG L I +
Sbjct: 241 YV-----ASPSTVEDLDWYFDSGASNHVTYDQNKVQEVNENDGKSFLTVGNGANLKIIAC 300
Query: 325 GNSVLTDGYHVLNLENVLCVPEIAKNLVSMSKLAQDNNVYIEFHGDFCLVKDKSSGQVLL 384
G+S L LNL+++L VP+I KNL+S+SKL DN++Y+EFH C VKDK +G++LL
Sbjct: 301 GDSSLDTQQKSLNLKDILYVPKITKNLLSISKLTFDNDIYVEFHDVACFVKDKLTGRILL 360
Query: 385 KGTLKDGLYQLQDANTSAASVSASASSNNQSDNFNSAFIVSNVVPHVSLAVSKTIWHRRL 444
+G +KDGLYQL +TS +N PHV ++ +T WHR+L
Sbjct: 361 EGKIKDGLYQLPGGSTS-----------------------TNKRPHVFFSIKET-WHRKL 420
Query: 445 GHPSAKVLDFIVKDCKLQVKSNEMSQFCQSCQFGKAHALPFPLSNSRAAKKFDLIHTDVW 504
GHP++KVL+ ++K C ++ E +FC++CQFGKAH LPF S S A + DL+H+DVW
Sbjct: 421 GHPNSKVLNEVMKLCNIEASPCENFEFCEACQFGKAHNLPFQNSVSCAKEPLDLVHSDVW 480
Query: 505 GPAPILSVEGYRYYALFLDDHSRYLWLYPLKQKSDTVQAFNHLLTVIKTQFGCGIKSVQT 564
GPAPI SV G++YY LFLDD SR+ W+YPLKQKSD QAF +++ QF IK++Q
Sbjct: 481 GPAPISSVSGFKYYVLFLDDWSRFTWIYPLKQKSDVFQAFIQFRNLVENQFNKRIKTLQC 540
Query: 565 DNGGEYIPIHKVCHQLGIKTRLSCPHTSAQNGRAERKHRHVVETGLTLLAQASMPLSHWW 624
D GGE+ + KV + GI+ R SCP+TSAQNGRAERKHRHVVE+GLTLLAQA MPL +WW
Sbjct: 541 DGGGEFKSLSKVLIKTGIQLRESCPYTSAQNGRAERKHRHVVESGLTLLAQAKMPLHYWW 600
Query: 625 DALVTATQLINGLPTTVLQGKSPMELMWSKKMNYEMLKTFGCSCYPCLRPYHKHKFHYHT 684
+A TA LIN LPT V++ KSP + ++ K +Y +KTFGC+CYPCL+PY++HK +HT
Sbjct: 601 EAFSTAVFLINRLPTQVIKNKSPYQQLFDKNPDYTAMKTFGCACYPCLKPYNQHKLQFHT 660
Query: 685 ERCVFLGISASHKGYRCMNEPGRVFISRHVRFNESEFPFATGFGSISSANTASSGSPSIL 744
+CVFLG S SHKGY+C+N GR+FISRHV FNE FPF GF NT
Sbjct: 661 TKCVFLGYSGSHKGYKCLNSTGRIFISRHVVFNEHHFPFHDGF-----LNTRK------- 720
Query: 745 EWFPHVHLPNPTSQSTMHSTPVPNSLVHPPDLPHNPTSPFPTLQPTCPQPNTNSYSSPTS 804
P ++ +PTS + PT +N +
Sbjct: 721 ----------------------------PAEIITDPTSLLFPISPT----GSNVANEEQR 780
Query: 805 LQSQSTDVLPQSPTQLQSLPNEPSSNPVQPSQISATTPISLPPTSPETSVPVSDSPTAAP 864
L T S N S + V+ ++ T ++ + ++S
Sbjct: 781 LH-----------TNNNSSSNTKSKHQVEQAENQNTIDATISQNT------FANSRIENN 840
Query: 865 IQPQPTHPMITRGKAGIFKP-KAWLTQQHTDWSLTEPTRVQDAISTPQWKQAMDCEYSAL 924
I+ H M TR K GI KP K ++ EP +A+ P+WK+AM E+ AL
Sbjct: 841 IESINQHQMTTRSKMGIIKPKKPYVGAVEKTLEEQEPETTYEALENPEWKKAMIAEFKAL 900
Query: 925 MKNQTWVLVPSSPDFNVVGNKWIFRIKKNADGTVQRYKARLVAKGFHQYPGVDFFETFSP 984
M N+TW LVP N++ KW+F+ K ADGT++R KARLVAKGF Q G+D+ ETFSP
Sbjct: 901 MMNKTWTLVPYQGQKNIIDCKWVFKTKYKADGTIERRKARLVAKGFQQTLGLDYDETFSP 960
Query: 985 VVKASTIRVVISLAVSKGWSLRQLDFNNAFLNGILVEDVYMQQPPGYTDPTCPKYVCKLK 1044
V+KA T+R+++S+AV W +RQ+D NNAFLNG L E V+M+QP G+ D + P+++CKL
Sbjct: 961 VIKAITVRIILSIAVHFNWEIRQMDINNAFLNGELKETVFMRQPEGFLDKSRPQHICKLT 1020
Query: 1045 KAIYGLKQAPRAWNTALKSVLLSWGFINSRSDSSLYIFKSQSTVLLLLVYVDDVVLTGNN 1104
KAIYGLKQAPR+W L++ LL WGF N+RSDSSL++ S++ + LL+YVDD+++TG++
Sbjct: 1021 KAIYGLKQAPRSWYDRLRNALLKWGFKNTRSDSSLFVLMSKAHITFLLIYVDDIIITGSS 1080
Query: 1105 LKAINRLIGELDKRFALKDLGKLNYFLGIQVHYMPSGLILNQAKYT-------------- 1164
++ I +L+ FALKDLG L+YFLG++ SGL L Q KY
Sbjct: 1081 SSFLSSFIKQLNIMFALKDLGSLHYFLGVEACRDASGLYLKQTKYVLDLLKKFNLEHVSS 1140
Query: 1165 ---------NLVNDGKLLEDPFLYRSTIGALQYLTYTRPDISHVVNQLSQFLKSPTDIHW 1224
+L + +L+++P LYR IG LQYLT TRPDI++ VN+LSQ++++PT IHW
Sbjct: 1141 CPTPMVTGRSLSEEAELMKNPTLYRRAIGVLQYLTNTRPDIAYSVNRLSQYMQAPTTIHW 1165
Query: 1225 QMVKRVLRYISGTKHLGLLFQPSTSTSISAFSDADWASNIDDRRSVAAYCVFVGSNLVSW 1276
Q VKRV RY+ GT + L +PS I+ FSDADWA+NI+DR+SVA YCVF+G +L++W
Sbjct: 1201 QSVKRVFRYLKGTMNHCLHIKPSVDLDITGFSDADWATNIEDRKSVAGYCVFLGESLITW 1165
BLAST of Lag0007984 vs. ExPASy TrEMBL
Match:
A0A396IUH5 (Putative RNA-directed DNA polymerase OS=Medicago truncatula OX=3880 GN=MtrunA17_Chr3g0122161 PE=4 SV=1)
HSP 1 Score: 991.1 bits (2561), Expect = 4.7e-285
Identity = 621/1501 (41.37%), Postives = 858/1501 (57.16%), Query Frame = 0
Query: 4 NPKYDAWLAVDQLLLGWLYNSMTPEVATQVMGVENAKDLWSAIQDLFGVQSRAEEDFLRQ 63
NP Y W D LL W+ ++++P + ++ + + ++ +W I Q + LR
Sbjct: 26 NPAYTEWEEQDSLLCTWILSTISPSLLSRFVLLRHSWQVWDEIHSYCFTQMKTRSRQLRS 85
Query: 64 TFQQTRKGNLKMADYLRTMKTHADNLGLTGSPISNRNLVSQVLLGLDEEYNAVVAMVQGR 123
+ KG+ +A+++ ++ +++L G P+S+R+L+ VL L EE++ +VA V +
Sbjct: 86 ELRSITKGSRTVAEFIARIRAISESLASIGDPVSHRDLIEVVLEALPEEFDPIVASVNAK 145
Query: 124 AN-VSWSELQAELLVFEKRLE----LQISHKNTVAFNHNATANMAVNKVNSSPKQTTNNN 183
+ VS EL+++LL E R E IS +V A + + NS T+
Sbjct: 146 SEVVSLDELESQLLTQESRKEKFKKAAISEPVSVNLTETANSESQSHGPNSQNHNYTDGT 205
Query: 184 GNRQ--------GYNNGHQRGNG--YGNRYRGRGRGYNNWNNRPTCQVCGKVGHSAVVCY 243
GN Q G NG RG G +G R+RGRG + +N CQ+C K GH A C+
Sbjct: 206 GNNQFPNSNPNFGGRNGQFRGRGGRFGGRFRGRGGRFGGRSN-VQCQICSKTGHDASYCH 265
Query: 244 HRF----DKEFSP------------IQNRNTGNGTESGNF-----QSNRGIGQQPNAFMT 303
+RF + +SP + +N SG F Q+ GQ P AF+T
Sbjct: 266 YRFFAPQNDYYSPYGSPGGYGAPPNVWMQNMSRPQHSGQFLRPPTQAANQRGQAPQAFLT 325
Query: 304 TQQTATPETLADPSWYADSGASNHVTNNYENIANPTDYRGKECVTVGNGDKLSITSVGNS 363
+ P + +WY DSGA++HVT + N+ + T G + V +GNG L+ITSVG+
Sbjct: 326 ---GSDPYNSFNNAWYPDSGATHHVTPDASNLMDSTSLSGSDQVHIGNGQGLAITSVGSL 385
Query: 364 VLTDGYH---VLNLENVLCVPEIAKNLVSMSKLAQDNNVYIEFHGDFCLVKDKSSGQVLL 423
T H L L N+L VP I KNLVS+S+ A+DNNVY EFH + C VK + S +VLL
Sbjct: 386 QFTSPLHPQTTLKLNNLLLVPSITKNLVSVSQFAKDNNVYFEFHPNHCFVKSQDSSKVLL 445
Query: 424 KGTL-KDGLYQLQDANT--SAASVSASASSNN-------QSDN-----------FNSAFI 483
+G L DGLYQ + + + A VS ++S N Q+DN FN
Sbjct: 446 RGILGHDGLYQFEHTKSFKTTAPVSQNSSVNTVCNKVPAQTDNSASFHLSPSTGFNFNNF 505
Query: 484 VSNVVPHV--SLAVSKT--------IWHRRLGHPSAKVLDFIVKDCKLQVKSNEMSQFCQ 543
N V H+ S S T IWH RLGHP +VL I+K C +++ + +S FC
Sbjct: 506 QCNNVEHLPSSSTSSSTQSFPSMYGIWHSRLGHPHHEVLQSIIKLCNIKLPNKSLSDFCT 565
Query: 544 SCQFGKAHALPFPLSNSRAAKKFDLIHTDVWGPAPILSVEGYRYYALFLDDHSRYLWLYP 603
+C GK H LP S K +LI D+WGPAP+ S GY Y+ +D +SRY W+YP
Sbjct: 566 ACCHGKVHRLPSFASQMTYTKPLELIFCDLWGPAPVESSCGYTYFLTCVDAYSRYTWIYP 625
Query: 604 LKQKSDTVQAFNHLLTVIKTQFGCGIKSVQTDNGGEYIPIHKVCHQLGIKTRLSCPHTSA 663
LK KS T+ F + T+I+ Q I SVQTD GGE++P K + LGI R +CPHT
Sbjct: 626 LKLKSHTLSTFQNFKTMIELQLNHKITSVQTDGGGEFLPFTKYLNSLGITHRFTCPHTHH 685
Query: 664 QNGRAERKHRHVVETGLTLLAQASMPLSHWWDALVTATQLINGLPTTVLQGKSPMELMWS 723
QNG ERKHRH+VETGLTLL+ A MPL W A +TAT LIN LPT VL KSP L+
Sbjct: 686 QNGSVERKHRHIVETGLTLLSHAQMPLKFWDHAFLTATYLINRLPTPVLANKSPFFLLHL 745
Query: 724 KKMNYEMLKTFGCSCYPCLRPYHKHKFHYHTERCVFLGISASHKGYRCMNEPGRVFISRH 783
+ +Y+ LK+FGC+C+P LRPY+ HKF +H++ CVFLG S SHKGY+C++ GR+FIS+
Sbjct: 746 QFPDYKFLKSFGCACFPFLRPYNSHKFDFHSKECVFLGYSNSHKGYKCLDASGRIFISKD 805
Query: 784 VRFNESEFPFATGFGSISSANTASSGSPSILEWFPHVHLPNPTSQSTMHSTPVPNSLV-- 843
V FNE +FP+ F S + LP+ + ST TPV +
Sbjct: 806 VVFNEVKFPYLDLFPSQKVCSV----------------LPDGPTLSTFLPTPVSTTFTVN 865
Query: 844 -HPPDLPHNPTSPFPTLQPTCPQPNTNSYSSPTSLQSQSTDVLPQSPTQLQSLPNEPSSN 903
H P H+ + P PT PQ ++S S PT+ S + PQ+P+ + S +E S
Sbjct: 866 SHTPQNSHSESGPHTVNSPT-PQ-TSHSESVPTTPISNT----PQTPS-ISSHHSESSHR 925
Query: 904 ---PVQPSQISATTPISLPPTSPETSVPV-------SDSPTAAP--IQPQPTHPMITRGK 963
+ P+ I+ +P + +SPE+S V S+SP P I PQ H M TRGK
Sbjct: 926 NNVVLNPTPITILSPSASQNSSPESSASVTSSQSTNSESPPPVPHRIHPQNCHTMRTRGK 985
Query: 964 AGIFKPKAWLTQQHTDWSLTEPTRVQDAISTPQWKQAMDCEYSALMKNQTWVLVPSSPDF 1023
GI +P+ T T EPT + A+ P+W AM EY+AL+ NQTW LV +
Sbjct: 986 HGIVQPRINPTLLLTH---VEPTTYKTALQDPKWHLAMQEEYNALLHNQTWSLVSLPANR 1045
Query: 1024 NVVGNKWIFRIKKNADGTVQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRVVISLAV 1083
+G KW+FR+K+N DGTV +YKARLVAKGFHQ G D+ ETFSPVVK T+R V++LAV
Sbjct: 1046 LAIGCKWVFRVKENPDGTVNKYKARLVAKGFHQQTGFDYNETFSPVVKPVTVRTVLTLAV 1105
Query: 1084 SKGWSLRQLDFNNAFLNGILVEDVYMQQPPGYTDPTCPKYVCKLKKAIYGLKQAPRAWNT 1143
+ W+L+QLD NNAFLNG+L E+VYM QPPG+ + + VCKL KA+YGLKQAPRAW
Sbjct: 1106 TYNWTLQQLDVNNAFLNGVLTEEVYMVQPPGF-ESSDKNLVCKLHKALYGLKQAPRAWFE 1165
Query: 1144 ALKSVLLSWGFINSRSDSSLYIFKSQSTVLLLLVYVDDVVLTGNNLKAINRLIGELDKRF 1203
LKS LLS+GF +SR D SL+ +Q+ + +LVYVDD+++TGN+ AI L+ +L+ F
Sbjct: 1166 RLKSSLLSFGFKSSRCDPSLFTLHTQAHCIFILVYVDDIIITGNSKLAIQNLVHQLNSEF 1225
Query: 1204 ALKDLGKLNYFLGIQVHYMPSG-LILNQAKY-------TNLVNDGKL------------- 1263
+LKDLG L+YFLGI+VH+ PSG L+L+Q KY N++N +
Sbjct: 1226 SLKDLGILDYFLGIEVHHSPSGSLLLSQTKYIKDLLQKANMINANSMPSPMASSTKLSKF 1285
Query: 1264 ----LEDPFLYRSTIGALQYLTYTRPDISHVVNQLSQFLKSPTDIHWQMVKRVLRYISGT 1323
+ DP +RS +GALQY T TRP+IS+ VN++ QFL +P + HW+ VKR+LRY+ GT
Sbjct: 1286 GSSTVSDPTFFRSIVGALQYATITRPEISYSVNKVCQFLSNPLEDHWKAVKRILRYLQGT 1345
Query: 1324 KHLGLLFQPSTST---SISAFSDADWASNIDDRRSVAAYCVFVGSNLVSWSSKKQSVVAR 1365
H GL+ P++ST +I+ F DADWAS+ DDRRS + C+F+G NLVSW ++KQ++VAR
Sbjct: 1346 LHHGLMLTPASSTEPIAITGFCDADWASDPDDRRSTSGACIFLGPNLVSWWARKQTLVAR 1405
BLAST of Lag0007984 vs. TAIR 10
Match:
AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )
HSP 1 Score: 333.6 bits (854), Expect = 7.8e-91
Identity = 176/427 (41.22%), Postives = 255/427 (59.72%), Query Frame = 0
Query: 899 EPTRVQDAISTPQWKQAMDCEYSALMKNQTWVLVPSSPDFNVVGNKWIFRIKKNADGTVQ 958
EP+ +A W AMD E A+ TW + P+ +G KW+++IK N+DGT++
Sbjct: 85 EPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIE 144
Query: 959 RYKARLVAKGFHQYPGVDFFETFSPVVKASTIRVVISLAVSKGWSLRQLDFNNAFLNGIL 1018
RYKARLVAKG+ Q G+DF ETFSPV K ++++++++++ ++L QLD +NAFLNG L
Sbjct: 145 RYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDL 204
Query: 1019 VEDVYMQQPPGYT----DPTCPKYVCKLKKAIYGLKQAPRAWNTALKSVLLSWGFINSRS 1078
E++YM+ PPGY D P VC LKK+IYGLKQA R W L+ +GF+ S S
Sbjct: 205 DEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHS 264
Query: 1079 DSSLYIFKSQSTVLLLLVYVDDVVLTGNNLKAINRLIGELDKRFALKDLGKLNYFLGIQV 1138
D + ++ + + L +LVYVDD+++ NN A++ L +L F L+DLG L YFLG+++
Sbjct: 265 DHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLEI 324
Query: 1139 HYMPSGLILNQAKYT-NLVNDGKLL-----------------------EDPFLYRSTIGA 1198
+G+ + Q KY +L+++ LL D YR IG
Sbjct: 325 ARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAKAYRRLIGR 384
Query: 1199 LQYLTYTRPDISHVVNQLSQFLKSPTDIHWQMVKRVLRYISGTKHLGLLFQPSTSTSISA 1258
L YL TR DIS VN+LSQF ++P H Q V ++L YI GT GL + +
Sbjct: 385 LMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQV 444
Query: 1259 FSDADWASNIDDRRSVAAYCVFVGSNLVSWSSKKQSVVARSSTESEYRALAHASAEIIWI 1298
FSDA + S D RRS YC+F+G++L+SW SKKQ VV++SS E+EYRAL+ A+ E++W+
Sbjct: 445 FSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMWL 504
BLAST of Lag0007984 vs. TAIR 10
Match:
ATMG00810.1 (DNA/RNA polymerases superfamily protein )
HSP 1 Score: 190.7 bits (483), Expect = 8.2e-48
Identity = 99/225 (44.00%), Postives = 142/225 (63.11%), Query Frame = 0
Query: 1088 LLLLVYVDDVVLTGNNLKAINRLIGELDKRFALKDLGKLNYFLGIQVHYMPSGLILNQAK 1147
+ LL+YVDD++LTG++ +N LI +L F++KDLG ++YFLGIQ+ PSGL L+Q K
Sbjct: 1 MYLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTK 60
Query: 1148 YT-NLVNDGKLLE----------------------DPFLYRSTIGALQYLTYTRPDISHV 1207
Y ++N+ +L+ DP +RS +GALQYLT TRPDIS+
Sbjct: 61 YAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYA 120
Query: 1208 VNQLSQFLKSPTDIHWQMVKRVLRYISGTKHLGLLFQPSTSTSISAFSDADWASNIDDRR 1267
VN + Q + PT + ++KRVLRY+ GT GL ++ ++ AF D+DWA RR
Sbjct: 121 VNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRR 180
Query: 1268 SVAAYCVFVGSNLVSWSSKKQSVVARSSTESEYRALAHASAEIIW 1290
S +C F+G N++SWS+K+Q V+RSSTE+EYRALA +AE+ W
Sbjct: 181 STTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225
BLAST of Lag0007984 vs. TAIR 10
Match:
ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )
HSP 1 Score: 120.9 bits (302), Expect = 8.0e-27
Identity = 60/125 (48.00%), Postives = 82/125 (65.60%), Query Frame = 0
Query: 873 MITRGKAGIFKPKAWLTQQHTDWSLTEPTRVQDAISTPQWKQAMDCEYSALMKNQTWVLV 932
M+TR KAGI K + T EP V A+ P W QAM E AL +N+TW+LV
Sbjct: 1 MLTRSKAGINKLNPKYSLTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILV 60
Query: 933 PSSPDFNVVGNKWIFRIKKNADGTVQRYKARLVAKGFHQYPGVDFFETFSPVVKASTIRV 992
P + N++G KW+F+ K ++DGT+ R KARLVAKGFHQ G+ F ET+SPVV+ +TIR
Sbjct: 61 PPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRT 120
Query: 993 VISLA 998
++++A
Sbjct: 121 ILNVA 125
BLAST of Lag0007984 vs. TAIR 10
Match:
ATMG00240.1 (Gag-Pol-related retrotransposon family protein )
HSP 1 Score: 75.5 bits (184), Expect = 3.8e-13
Identity = 36/81 (44.44%), Postives = 49/81 (60.49%), Query Frame = 0
Query: 1173 YLTYTRPDISHVVNQLSQFLKSPTDIHWQMVKRVLRYISGTKHLGLLFQPSTSTSISAFS 1232
YLT TRPD++ VN+LSQF + Q V +VL Y+ GT GL + ++ + AF+
Sbjct: 2 YLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAFA 61
Query: 1233 DADWASNIDDRRSVAAYCVFV 1254
D+DWAS D RRSV +C V
Sbjct: 62 DSDWASCPDTRRSVTGFCSLV 82
BLAST of Lag0007984 vs. TAIR 10
Match:
AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 59.7 bits (143), Expect = 2.2e-08
Identity = 60/216 (27.78%), Postives = 109/216 (50.46%), Query Frame = 0
Query: 10 WLAVDQLLLGWLYNSMTPEVATQVMGVE-NAKDLWSAIQDLFGVQSRAEEDFLRQTFQQT 69
W D L+ W+Y ++T + ++ V A+DLW ++++LF A + T
Sbjct: 67 WKERDGLVKMWIYGTITDSLLDTIIKVGCTARDLWLSLENLFRDNKEARALQFENELRTT 126
Query: 70 RKGNLKMADYLRTMKTHADNLGLTGSPISNRNLVSQVLLGLDEEYNAVVAMVQGRANV-S 129
+L + +Y + +K+ +D L SPIS+R LV +L GL E+Y+ ++ +++ ++ S
Sbjct: 127 TIDDLSVHEYCQKLKSLSDLLTNVDSPISDRVLVMHLLNGLTEKYDYILNVIKHKSPFPS 186
Query: 130 WSELQAELLVFEKRLELQISHKNTVAFNHNATANM---AVNKVNSSPKQTTNNNGNR-QG 189
++E ++ LL+ E RL + S + NH + +N+ + P++ NNN N +G
Sbjct: 187 FTEARSMLLMEESRLSNK-SKSSLSHTNHPSLSNVLFTVPRQQERYPQEYHNNNSNMGRG 246
Query: 190 YNNGHQRGNGYGNRYRGRGRGYNNWN-NRPTCQVCG 219
+ RG G + GR NNW N+P + G
Sbjct: 247 RSKKKNRGGGSSD---GRYNNNNNWRLNQPPTWIYG 278
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
GAU19483.1 | 0.0e+00 | 44.63 | hypothetical protein TSUD_77270 [Trifolium subterraneum] | [more] |
GAU51268.1 | 2.4e-307 | 42.78 | hypothetical protein TSUD_412550 [Trifolium subterraneum] | [more] |
PNX94503.1 | 9.0e-299 | 45.69 | putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense... | [more] |
KYP50444.1 | 2.3e-294 | 44.08 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] | [more] |
RHN69202.1 | 9.7e-285 | 41.37 | putative RNA-directed DNA polymerase [Medicago truncatula] | [more] |
Match Name | E-value | Identity | Description | |
Q94HW2 | 1.4e-241 | 36.11 | Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... | [more] |
Q9ZT94 | 6.2e-234 | 35.66 | Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... | [more] |
P10978 | 5.1e-119 | 26.63 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... | [more] |
P04146 | 3.4e-99 | 25.14 | Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3 | [more] |
P92519 | 1.2e-46 | 44.00 | Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 ... | [more] |
Match Name | E-value | Identity | Description | |
A0A2Z6MBG6 | 0.0e+00 | 44.63 | Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... | [more] |
A0A2Z6P4D5 | 1.1e-307 | 42.78 | Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... | [more] |
A0A2K3MUJ9 | 4.4e-299 | 45.69 | Putative retrotransposon Ty1-copia subclass protein (Fragment) OS=Trifolium prat... | [more] |
A0A151S6M8 | 1.1e-294 | 44.08 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... | [more] |
A0A396IUH5 | 4.7e-285 | 41.37 | Putative RNA-directed DNA polymerase OS=Medicago truncatula OX=3880 GN=MtrunA17_... | [more] |
Match Name | E-value | Identity | Description | |
AT4G23160.1 | 7.8e-91 | 41.22 | cysteine-rich RLK (RECEPTOR-like protein kinase) 8 | [more] |
ATMG00810.1 | 8.2e-48 | 44.00 | DNA/RNA polymerases superfamily protein | [more] |
ATMG00820.1 | 8.0e-27 | 48.00 | Reverse transcriptase (RNA-dependent DNA polymerase) | [more] |
ATMG00240.1 | 3.8e-13 | 44.44 | Gag-Pol-related retrotransposon family protein | [more] |
AT5G48050.1 | 2.2e-08 | 27.78 | CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... | [more] |